BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (236 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P77759 Putative uncharacterized protein ylbH n=12 Tax=E... 492 e-138 UniRef50_UPI0001B52595 rhsE element core protein RshE n=1 Tax=Es... 174 2e-42 UniRef50_P16919 Protein rhsD n=261 Tax=Bacteria RepID=RHSD_ECOLI 172 1e-41 UniRef50_O52663 Core protein (Fragment) n=5 Tax=Enterobacteriace... 171 2e-41 UniRef50_Q31U53 Putative uncharacterized protein n=1 Tax=Shigell... 157 4e-37 UniRef50_Q328Z1 RhsA protein in rhs element n=7 Tax=Enterobacter... 156 7e-37 UniRef50_A8A655 Rhs family protein n=14 Tax=Enterobacteriaceae R... 154 3e-36 UniRef50_B5PJT7 Protein RhsD n=2 Tax=Enterobacteriaceae RepID=B5... 153 5e-36 UniRef50_P32109 Putative uncharacterized protein yibJ n=14 Tax=B... 151 2e-35 UniRef50_P77779 Putative uncharacterized protein ybfO n=67 Tax=E... 150 2e-35 UniRef50_Q3YV37 Putative uncharacterized protein n=1 Tax=Shigell... 146 6e-34 UniRef50_UPI0001B52C8C protein, rhs-like protein n=4 Tax=Enterob... 145 9e-34 UniRef50_UPI0001C341D4 protein RhsA n=1 Tax=Citrobacter youngae ... 145 1e-33 UniRef50_B3X3P2 RhsH n=1 Tax=Shigella dysenteriae 1012 RepID=B3X... 141 2e-32 UniRef50_C1M8X5 Core protein n=2 Tax=Citrobacter RepID=C1M8X5_9ENTR 137 2e-31 UniRef50_UPI00019F181C rhsC element core protein RshC n=2 Tax=En... 135 9e-31 UniRef50_UPI00019F17FA RhsC core protein with extension n=1 Tax=... 130 3e-29 UniRef50_A9C0N8 YD repeat protein n=5 Tax=cellular organisms Rep... 110 3e-23 UniRef50_A1TV35 YD repeat protein n=2 Tax=Acidovorax RepID=A1TV3... 110 5e-23 UniRef50_C6EE82 Rhs family protein-like protein n=2 Tax=Escheric... 109 8e-23 UniRef50_D0KEV6 RHS protein n=2 Tax=Enterobacteriaceae RepID=D0K... 107 3e-22 UniRef50_D1T3Q3 YD repeat protein (Fragment) n=2 Tax=Acidovorax ... 105 2e-21 UniRef50_UPI0001BC4026 Rhs family protein n=6 Tax=Neisseria RepI... 102 1e-20 UniRef50_Q1LDW7 YD repeat n=2 Tax=Burkholderiaceae RepID=Q1LDW7_... 99 1e-19 UniRef50_A1TTQ1 Rhs family protein n=1 Tax=Acidovorax citrulli A... 98 3e-19 UniRef50_Q147B5 Rhs family protein n=2 Tax=Betaproteobacteria Re... 98 3e-19 UniRef50_Q2SPP2 Rhs family protein n=3 Tax=Hahella chejuensis KC... 97 3e-19 UniRef50_Q83LZ1 Putative Rhs-family protein n=1 Tax=Shigella fle... 97 4e-19 UniRef50_B2HXH5 Rhs family protein n=5 Tax=Acinetobacter baumann... 97 5e-19 UniRef50_Q13ML3 YD repeat protein n=10 Tax=Proteobacteria RepID=... 96 7e-19 UniRef50_Q4ZLF3 YD repeat n=5 Tax=Pseudomonas syringae group Rep... 96 9e-19 UniRef50_C6AKX3 Rhs family protein n=4 Tax=Aggregatibacter aphro... 96 9e-19 UniRef50_B4SV70 Rhs-family protein n=42 Tax=Enterobacteriaceae R... 96 1e-18 UniRef50_B2VH58 Rhs family protein n=5 Tax=Enterobacteriaceae Re... 96 1e-18 UniRef50_C8QGI4 YD repeat protein n=1 Tax=Pantoea sp. At-9b RepI... 96 1e-18 UniRef50_A1TQS0 YD repeat protein n=2 Tax=Acidovorax RepID=A1TQS... 96 1e-18 UniRef50_Q0JZD5 RHS family protein n=9 Tax=Bacteria RepID=Q0JZD5... 95 2e-18 UniRef50_B2K1J9 YD repeat protein n=23 Tax=Yersinia RepID=B2K1J9... 95 2e-18 UniRef50_C0Q6Q9 Rhs-family protein n=27 Tax=Enterobacteriaceae R... 95 2e-18 UniRef50_C5CXA7 YD repeat protein n=1 Tax=Variovorax paradoxus S... 95 2e-18 UniRef50_B1J8X0 YD repeat protein n=50 Tax=Gammaproteobacteria R... 94 3e-18 UniRef50_C5CZG8 RHS protein n=1 Tax=Variovorax paradoxus S110 Re... 94 4e-18 UniRef50_UPI0001B53B37 YD repeat protein n=1 Tax=Streptomyces sp... 94 5e-18 UniRef50_Q7NY44 Probable Rhs-family protein n=2 Tax=Chromobacter... 94 5e-18 UniRef50_Q2SFR1 Rhs family protein n=9 Tax=cellular organisms Re... 93 8e-18 UniRef50_A7FDJ9 RHS/YD repeat protein n=30 Tax=Enterobacteriacea... 93 9e-18 UniRef50_Q2SFR5 Rhs family protein n=1 Tax=Hahella chejuensis KC... 92 1e-17 UniRef50_B5MRU6 Rhs-family protein n=9 Tax=Gammaproteobacteria R... 92 2e-17 UniRef50_Q39K64 Rhs family protein n=22 Tax=Burkholderia RepID=Q... 91 3e-17 UniRef50_Q2SFS1 Rhs family protein n=1 Tax=Hahella chejuensis KC... 91 4e-17 UniRef50_D0KES6 YD repeat protein n=15 Tax=Gammaproteobacteria R... 91 4e-17 UniRef50_A6GLW0 Rhs family protein n=1 Tax=Limnobacter sp. MED10... 91 4e-17 UniRef50_Q6LUC4 Putative uncharacterized protein n=1 Tax=Photoba... 91 4e-17 UniRef50_Q6LUC6 Hypothetical nucleotidyltransferase n=1 Tax=Phot... 91 4e-17 UniRef50_D1T3N5 YD repeat protein n=1 Tax=Acidovorax avenae subs... 90 7e-17 UniRef50_A9EW02 Conserved exported carbohydrate-binding protein,... 90 7e-17 UniRef50_C0EPY5 Putative uncharacterized protein n=1 Tax=Neisser... 89 9e-17 UniRef50_C7MXD3 Rhs family protein n=1 Tax=Saccharomonospora vir... 89 1e-16 UniRef50_C6M5B1 Rhs-related protein n=7 Tax=Proteobacteria RepID... 89 1e-16 UniRef50_C6M9F5 RHS family protein n=2 Tax=Bacteria RepID=C6M9F5... 89 1e-16 UniRef50_D1YPL0 RHS repeat-associated core domain protein n=1 Ta... 89 1e-16 UniRef50_A1TTP6 YD repeat protein n=2 Tax=Acidovorax citrulli AA... 89 2e-16 UniRef50_D2TGW0 Putative Rhs protein n=2 Tax=Citrobacter RepID=D... 89 2e-16 UniRef50_Q48LL6 Rhs family protein n=1 Tax=Pseudomonas syringae ... 88 2e-16 UniRef50_C3JXH8 Putative Rhs protein n=1 Tax=Pseudomonas fluores... 88 2e-16 UniRef50_B1JCT8 RHS protein n=1 Tax=Pseudomonas putida W619 RepI... 88 2e-16 UniRef50_Q2SKM2 Rhs family protein n=3 Tax=Hahella chejuensis KC... 88 2e-16 UniRef50_B7LTT0 Putative uncharacterized protein n=4 Tax=Enterob... 88 3e-16 UniRef50_C9Y462 Putative uncharacterized protein n=1 Tax=Cronoba... 88 3e-16 UniRef50_Q87U70 Rhs family protein n=2 Tax=Pseudomonas RepID=Q87... 88 3e-16 UniRef50_UPI0001C34A7C Rhs family protein n=1 Tax=Neisseria subf... 88 3e-16 UniRef50_Q6D1M4 Rhs protein n=5 Tax=Enterobacteriaceae RepID=Q6D... 87 3e-16 UniRef50_B0VRR7 Putative uncharacterized protein n=1 Tax=Acineto... 87 4e-16 UniRef50_D0KES8 RHS protein n=3 Tax=Pectobacterium wasabiae WPP1... 87 4e-16 UniRef50_Q4K3M9 Rhs family protein n=5 Tax=Pseudomonas RepID=Q4K... 87 4e-16 UniRef50_A3NNM1 Protein RhsD n=20 Tax=pseudomallei group RepID=A... 87 4e-16 UniRef50_Q1K295 YD repeat n=4 Tax=Desulfuromonas acetoxidans DSM... 87 5e-16 UniRef50_UPI000190F33A Rhs-family protein n=2 Tax=Salmonella ent... 87 5e-16 UniRef50_UPI000196E06E Rhs family protein n=2 Tax=Neisseria muco... 87 5e-16 UniRef50_Q88FK6 RHS family protein, putative n=1 Tax=Pseudomonas... 87 5e-16 UniRef50_A1U3R9 YD repeat protein n=3 Tax=Gammaproteobacteria Re... 86 7e-16 UniRef50_A7K3Q8 Rhs family protein n=7 Tax=Vibrio RepID=A7K3Q8_V... 86 8e-16 UniRef50_C6M9F4 Rhs family protein n=8 Tax=Neisseria RepID=C6M9F... 86 9e-16 UniRef50_C9Y459 Putative uncharacterized protein n=1 Tax=Cronoba... 86 9e-16 UniRef50_D0KG38 Rhs family protein-like protein n=1 Tax=Pectobac... 86 1e-15 UniRef50_C0EPY1 Putative uncharacterized protein n=7 Tax=Neisser... 86 1e-15 UniRef50_Q395C2 Rhs family protein n=1 Tax=Burkholderia sp. 383 ... 86 2e-15 UniRef50_B5I7B2 Rhs repeat protein n=1 Tax=Streptomyces sviceus ... 85 2e-15 UniRef50_Q7MDR0 Rhs family protein n=3 Tax=Vibrio vulnificus Rep... 85 2e-15 UniRef50_B8FBZ2 YD repeat protein n=1 Tax=Desulfatibacillum alke... 85 2e-15 UniRef50_A7FDK7 YD/RHS repeat protein n=25 Tax=Enterobacteriacea... 85 2e-15 UniRef50_A9AE26 Rhs family protein n=10 Tax=cellular organisms R... 85 2e-15 UniRef50_Q399U9 Rhs family protein n=3 Tax=Proteobacteria RepID=... 85 2e-15 UniRef50_C5AIF2 Rhs family protein n=1 Tax=Burkholderia glumae B... 85 2e-15 UniRef50_D0KWY8 YD repeat protein n=1 Tax=Halothiobacillus neapo... 85 3e-15 UniRef50_A1TSU8 YD repeat protein n=4 Tax=Acidovorax RepID=A1TSU... 85 3e-15 UniRef50_A1AK54 YD repeat protein n=1 Tax=Pelobacter propionicus... 84 3e-15 UniRef50_D0HCV2 Rhs protein n=3 Tax=Vibrio mimicus VM223 RepID=D... 84 3e-15 UniRef50_C7Q0B8 YD repeat protein n=1 Tax=Catenulispora acidiphi... 84 3e-15 UniRef50_Q2SGE8 Rhs family protein n=1 Tax=Hahella chejuensis KC... 84 3e-15 UniRef50_Q12LF3 YD repeat n=2 Tax=Shewanella denitrificans OS217... 84 4e-15 UniRef50_B9B9U8 YD repeat protein n=1 Tax=Burkholderia multivora... 84 4e-15 UniRef50_A1TR18 YD repeat protein n=8 Tax=Acidovorax RepID=A1TR1... 84 5e-15 UniRef50_A1TSG3 RHS protein n=1 Tax=Acidovorax citrulli AAC00-1 ... 83 6e-15 UniRef50_C9NF93 YD repeat protein n=1 Tax=Streptomyces flavogris... 83 7e-15 UniRef50_C5ALM7 YD repeat protein n=19 Tax=Proteobacteria RepID=... 83 7e-15 UniRef50_A4FJ21 YD repeat protein n=1 Tax=Saccharopolyspora eryt... 83 7e-15 UniRef50_C8Q7Z5 YD repeat protein n=8 Tax=Enterobacteriaceae Rep... 82 1e-14 UniRef50_A5GE16 YD repeat protein n=1 Tax=Geobacter uraniireduce... 82 1e-14 UniRef50_B7H4M5 Uncharacterized protein ybfO n=3 Tax=Acinetobact... 82 1e-14 UniRef50_Q2SPP3 Rhs family protein n=1 Tax=Hahella chejuensis KC... 82 1e-14 UniRef50_D1S833 YD repeat protein n=1 Tax=Micromonospora auranti... 82 2e-14 UniRef50_C1M4X0 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 81 3e-14 UniRef50_B5H9Z7 Rhs protein n=2 Tax=Streptomyces RepID=B5H9Z7_STRPR 81 3e-14 UniRef50_A0LJM9 YD repeat protein n=3 Tax=Syntrophobacter fumaro... 81 3e-14 UniRef50_D1VCV7 YD repeat protein n=1 Tax=Frankia sp. EuI1c RepI... 81 4e-14 UniRef50_UPI00016A9A82 Rhs family protein n=2 Tax=Burkholderia o... 80 4e-14 UniRef50_Q1I7Q5 Putative uncharacterized protein n=11 Tax=Pseudo... 80 4e-14 UniRef50_D1YPP7 RHS repeat-associated core domain protein n=1 Ta... 80 4e-14 UniRef50_UPI0001B56FBA YD repeat-containing protein n=1 Tax=Stre... 80 5e-14 UniRef50_A7FN18 RHS/YD repeat protein n=5 Tax=cellular organisms... 80 5e-14 UniRef50_A9EVR3 Conserved carbohydrate-binding protein, Rhs fami... 80 6e-14 UniRef50_A3RZQ8 Core protein n=21 Tax=Ralstonia solanacearum Rep... 80 6e-14 UniRef50_C7PJF4 YD repeat protein n=2 Tax=Chitinophaga pinensis ... 80 6e-14 UniRef50_A4SKJ3 Rhs family protein n=2 Tax=Bacteria RepID=A4SKJ3... 80 6e-14 UniRef50_B2PW78 Putative uncharacterized protein n=1 Tax=Provide... 80 6e-14 UniRef50_B4ETQ2 Rhs-family protein n=9 Tax=Enterobacteriaceae Re... 80 6e-14 UniRef50_C6WQE4 YD repeat protein n=1 Tax=Actinosynnema mirum DS... 80 8e-14 UniRef50_C5AA19 Rhs family protein n=1 Tax=Burkholderia glumae B... 80 8e-14 UniRef50_C7Q0A7 YD repeat protein n=2 Tax=Catenulispora acidiphi... 79 9e-14 UniRef50_C9Y441 Putative uncharacterized protein n=1 Tax=Cronoba... 79 9e-14 UniRef50_A9FBU5 Conserved carbohydrate-binding protein, Rhs fami... 79 1e-13 UniRef50_D0KZB0 YD repeat protein n=1 Tax=Halothiobacillus neapo... 79 1e-13 UniRef50_C6WJA6 YD repeat protein n=3 Tax=Actinosynnema mirum DS... 79 1e-13 UniRef50_A9GIJ6 Conserved carbohydrate-binding protein, Rhs fami... 79 2e-13 UniRef50_Q7N2G0 Complete genome; segment 11/17 n=4 Tax=Gammaprot... 79 2e-13 UniRef50_A1TJG7 YD repeat protein n=4 Tax=Proteobacteria RepID=A... 78 2e-13 UniRef50_B4ETQ7 Putative Rhs-family protein n=3 Tax=Enterobacter... 78 2e-13 UniRef50_C7QG23 YD repeat protein n=2 Tax=Catenulispora acidiphi... 78 2e-13 UniRef50_B1KGR6 YD repeat protein n=1 Tax=Shewanella woodyi ATCC... 78 3e-13 UniRef50_C6M9F8 RhsG core protein with extension n=1 Tax=Neisser... 78 3e-13 UniRef50_D1AA66 YD repeat-containing protein n=1 Tax=Thermomonos... 77 4e-13 UniRef50_C4KA75 YD repeat protein n=1 Tax=Thauera sp. MZ1T RepID... 77 4e-13 UniRef50_A3KUM5 Rhs family protein n=10 Tax=Pseudomonas aerugino... 77 4e-13 UniRef50_B4V6T6 Rhs protein n=2 Tax=Streptomyces RepID=B4V6T6_9ACTO 77 5e-13 UniRef50_D1SWH5 RHS protein (Fragment) n=1 Tax=Acidovorax avenae... 77 5e-13 UniRef50_UPI0001AF2680 RHS/YD repeat-containing protein n=1 Tax=... 77 6e-13 UniRef50_B2PVY9 Putative uncharacterized protein n=12 Tax=Entero... 77 7e-13 UniRef50_B7GX39 Protein rhsD n=4 Tax=Acinetobacter baumannii Rep... 76 7e-13 UniRef50_D1SVF0 Rhs family protein (Fragment) n=1 Tax=Acidovorax... 76 1e-12 UniRef50_Q4UTI2 Putative uncharacterized protein n=2 Tax=Xanthom... 76 1e-12 UniRef50_D1YPL4 RHS repeat-associated core domain protein n=1 Ta... 76 1e-12 UniRef50_C6CPH2 YD repeat protein n=11 Tax=Enterobacteriaceae Re... 76 1e-12 UniRef50_A9AKU8 Type VI secretion system Vgr family protein n=10... 76 1e-12 UniRef50_Q9L0E3 Putative Rhs protein n=2 Tax=Streptomyces RepID=... 75 1e-12 UniRef50_C6CNW6 YD repeat protein n=8 Tax=Enterobacteriaceae Rep... 75 1e-12 UniRef50_Q8GDM7 Rhs n=3 Tax=Photorhabdus RepID=Q8GDM7_PHOLU 75 1e-12 UniRef50_B6VUY0 Putative uncharacterized protein n=2 Tax=Bactero... 75 2e-12 UniRef50_UPI0001B58169 YD repeat-containing protein n=1 Tax=Stre... 75 2e-12 UniRef50_A9C2K3 YD repeat protein n=1 Tax=Delftia acidovorans SP... 75 2e-12 UniRef50_D1W448 RHS repeat-associated core domain protein n=1 Ta... 75 3e-12 UniRef50_B2HAQ4 RhsD protein n=4 Tax=Burkholderia RepID=B2HAQ4_B... 75 3e-12 UniRef50_B4EGW8 RHS-family protein n=9 Tax=Burkholderiaceae RepI... 74 3e-12 UniRef50_C7QAC6 YD repeat protein n=2 Tax=Bacteria RepID=C7QAC6_... 74 3e-12 UniRef50_C0FSB7 Putative uncharacterized protein n=1 Tax=Rosebur... 74 4e-12 UniRef50_B3PEN8 Rhsfamily protein n=2 Tax=Cellvibrio japonicus U... 74 4e-12 UniRef50_B4VFT3 Rhs protein n=4 Tax=Bacteria RepID=B4VFT3_9ACTO 74 4e-12 UniRef50_C4NV50 Rhs repeat family protein n=10 Tax=Gammaproteoba... 74 5e-12 UniRef50_A4B8X0 YD repeat n=1 Tax=Reinekea blandensis MED297 Rep... 74 5e-12 UniRef50_C7MZM5 Rhs family protein n=1 Tax=Saccharomonospora vir... 74 6e-12 UniRef50_C5T0D9 YD repeat protein n=1 Tax=Acidovorax delafieldii... 74 6e-12 UniRef50_A8ZSG6 YD repeat protein n=1 Tax=Desulfococcus oleovora... 73 7e-12 UniRef50_B4SL73 YD repeat protein n=4 Tax=Stenotrophomonas malto... 73 8e-12 UniRef50_D0BWK2 YD repeat protein n=1 Tax=Acinetobacter sp. RUH2... 73 8e-12 UniRef50_A9GD22 Conserved carbohydrate-binding protein, Rhs fami... 73 9e-12 UniRef50_Q8XSL8 Putative rsh-related protein n=1 Tax=Ralstonia s... 72 1e-11 UniRef50_A9C2L0 Putative uncharacterized protein n=1 Tax=Delftia... 72 1e-11 UniRef50_A8ZVU3 YD repeat protein n=1 Tax=Desulfococcus oleovora... 72 1e-11 UniRef50_Q3JSF4 RhsD protein n=23 Tax=Burkholderia pseudomallei ... 72 1e-11 UniRef50_B4V251 LipX3 n=2 Tax=Streptomyces RepID=B4V251_9ACTO 72 1e-11 UniRef50_D1T3N4 YD repeat protein n=2 Tax=Betaproteobacteria Rep... 72 1e-11 UniRef50_UPI00016B0868 Rhs family protein n=1 Tax=Burkholderia p... 72 2e-11 UniRef50_B2STD6 RHS Repeat family n=18 Tax=Bacteria RepID=B2STD6... 72 2e-11 UniRef50_B4SR21 YD repeat protein n=2 Tax=Stenotrophomonas malto... 72 2e-11 UniRef50_Q6FD94 Putative RHS-related protein n=1 Tax=Acinetobact... 72 2e-11 UniRef50_D1WVZ1 YD repeat protein n=1 Tax=Streptomyces sp. ACT-1... 71 3e-11 UniRef50_B1HKQ3 Rhs family protein n=1 Tax=Burkholderia pseudoma... 71 4e-11 UniRef50_C1B7W9 Putative uncharacterized protein n=1 Tax=Rhodoco... 70 4e-11 UniRef50_C2LFQ4 Putative uncharacterized protein n=1 Tax=Proteus... 70 5e-11 UniRef50_C6CI62 YD repeat protein n=7 Tax=Enterobacteriaceae Rep... 70 5e-11 UniRef50_B8FCM0 YD repeat protein n=1 Tax=Desulfatibacillum alke... 70 5e-11 UniRef50_D1PUV1 RHS family protein n=1 Tax=Prevotella bergensis ... 70 5e-11 UniRef50_D1JME6 Rhs family protein n=22 Tax=Bacteroides RepID=D1... 70 5e-11 UniRef50_A9C2K9 Putative uncharacterized protein n=1 Tax=Delftia... 70 6e-11 UniRef50_C0FSB4 Putative uncharacterized protein n=1 Tax=Rosebur... 70 7e-11 UniRef50_A9EZF2 Conserved carbohydrate-binding protein, Rhs fami... 70 8e-11 UniRef50_UPI0001B4DD67 Rhs protein n=1 Tax=Streptomyces hygrosco... 70 8e-11 UniRef50_C8W2A7 YD repeat protein n=3 Tax=Desulfotomaculum aceto... 69 1e-10 UniRef50_Q2T5B9 YD repeat protein n=39 Tax=Proteobacteria RepID=... 69 1e-10 UniRef50_B2PYS3 Putative uncharacterized protein n=1 Tax=Provide... 69 1e-10 UniRef50_A1WE65 Rhs family protein-like protein n=1 Tax=Verminep... 69 1e-10 UniRef50_C5ID01 RhsK n=50 Tax=cellular organisms RepID=C5ID01_ECOLX 69 1e-10 UniRef50_Q2SIG5 Rhs family protein n=1 Tax=Hahella chejuensis KC... 69 1e-10 UniRef50_B2Q762 Putative uncharacterized protein n=2 Tax=Provide... 69 1e-10 UniRef50_D2KTW4 Putative uncharacterized protein n=2 Tax=Strepto... 69 1e-10 UniRef50_A8ZTS1 YD repeat protein n=4 Tax=Desulfococcus oleovora... 69 2e-10 UniRef50_C5SNJ4 YD repeat protein n=1 Tax=Asticcacaulis excentri... 69 2e-10 UniRef50_B2PW71 Putative uncharacterized protein n=1 Tax=Provide... 69 2e-10 UniRef50_Q7NYN4 Probable rhs-related transmembrane protein relat... 68 2e-10 UniRef50_B7GLX9 Rhs family protein n=1 Tax=Anoxybacillus flavith... 68 3e-10 UniRef50_B2UQS6 Putative uncharacterized protein n=1 Tax=Akkerma... 68 3e-10 UniRef50_B7X4I1 Putative uncharacterized protein n=1 Tax=Comamon... 67 4e-10 UniRef50_A9GLF7 Conserved exported carbohydrate-binding protein,... 67 5e-10 UniRef50_D2UF28 Putative rhs family protein n=2 Tax=Xanthomonas ... 67 6e-10 UniRef50_A9C2K7 Rhs family protein-like protein n=1 Tax=Delftia ... 67 6e-10 UniRef50_UPI000196D8DA RHS family protein n=1 Tax=Neisseria muco... 67 7e-10 UniRef50_A8ZXK4 YD repeat protein n=3 Tax=Deltaproteobacteria Re... 66 8e-10 UniRef50_C0EGC9 Putative uncharacterized protein n=2 Tax=Clostri... 66 1e-09 UniRef50_B5S3P6 Probable rhs-related protein (Fragment) n=1 Tax=... 66 1e-09 UniRef50_C8QVK4 YD repeat protein n=4 Tax=Desulfurivibrio alkali... 66 1e-09 UniRef50_Q396B9 Rhs family protein n=1 Tax=Burkholderia sp. 383 ... 66 1e-09 UniRef50_D0KFB3 Putative uncharacterized protein n=1 Tax=Pectoba... 65 1e-09 UniRef50_C2M343 Rhs family protein n=2 Tax=Capnocytophaga gingiv... 65 2e-09 UniRef50_Q12LF5 Putative uncharacterized protein n=1 Tax=Shewane... 65 2e-09 UniRef50_B3PEK8 RHS Repeat family n=1 Tax=Cellvibrio japonicus U... 65 2e-09 UniRef50_C9PMJ2 Substrate-binding repeat protein n=2 Tax=Pasteur... 65 2e-09 UniRef50_B5JS46 NHL repeat containing protein n=2 Tax=gamma prot... 65 2e-09 UniRef50_B8FIJ1 YD repeat protein n=1 Tax=Desulfatibacillum alke... 65 2e-09 UniRef50_UPI0001AF39C0 YD repeat-containing protein n=1 Tax=Pseu... 65 2e-09 UniRef50_B7GLY4 Rhs family protein n=3 Tax=Anoxybacillus flavith... 65 3e-09 UniRef50_Q2T916 Rhs1 protein n=30 Tax=Burkholderia RepID=Q2T916_... 65 3e-09 UniRef50_Q30CR7 LipX3 n=4 Tax=Streptomyces RepID=Q30CR7_STRAU 65 3e-09 UniRef50_B9M2L6 YD repeat protein n=6 Tax=Geobacter sp. FRC-32 R... 64 3e-09 UniRef50_Q2Y6W6 RhsD protein n=1 Tax=Nitrosospira multiformis AT... 64 3e-09 UniRef50_UPI0001B52686 Rhs core protein with extension n=1 Tax=S... 64 4e-09 UniRef50_B9XHG9 YD repeat protein n=1 Tax=bacterium Ellin514 Rep... 64 4e-09 UniRef50_Q4ZU10 YD repeat n=2 Tax=Pseudomonas syringae group Rep... 64 4e-09 UniRef50_C3JCI6 Rhs family protein n=4 Tax=Bacteria RepID=C3JCI6... 64 4e-09 UniRef50_A3UDD5 Wall associated protein n=1 Tax=Oceanicaulis ale... 64 4e-09 UniRef50_Q2Y6N6 RhsD protein n=1 Tax=Nitrosospira multiformis AT... 64 4e-09 UniRef50_UPI0001B52689 Rhs core protein with extension n=1 Tax=S... 64 5e-09 UniRef50_Q2Y592 Peptidase C39, bacteriocin processing n=1 Tax=Ni... 64 5e-09 UniRef50_C1AD40 Putative uncharacterized protein n=2 Tax=Gemmati... 64 6e-09 UniRef50_C0ZAE5 Putative uncharacterized protein n=2 Tax=Breviba... 64 6e-09 UniRef50_B1HM94 Cell wall-associated protein n=5 Tax=Lysinibacil... 64 6e-09 UniRef50_B4UIF2 YD repeat protein n=1 Tax=Anaeromyxobacter sp. K... 64 6e-09 UniRef50_A9FQL9 Putative uncharacterized protein n=1 Tax=Sorangi... 64 6e-09 UniRef50_A7BNN3 Putative uncharacterized protein n=1 Tax=Beggiat... 63 6e-09 UniRef50_D0LTH3 YD repeat protein n=1 Tax=Haliangium ochraceum D... 63 7e-09 UniRef50_D1W844 RHS repeat-associated core domain protein (Fragm... 63 7e-09 >UniRef50_P77759 Putative uncharacterized protein ylbH n=12 Tax=Escherichia coli RepID=YLBH_ECOLI Length = 236 Score = 492 bits (1267), Expect = e-138, Method: Compositional matrix adjust. Identities = 236/236 (100%), Positives = 236/236 (100%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL Sbjct: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT Sbjct: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV Sbjct: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 Query: 181 KLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQDAGYLK 236 KLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQDAGYLK Sbjct: 181 KLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQDAGYLK 236 >UniRef50_UPI0001B52595 rhsE element core protein RshE n=1 Tax=Escherichia sp. 4_1_40B RepID=UPI0001B52595 Length = 273 Score = 174 bits (442), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 98/233 (42%), Positives = 124/233 (53%), Gaps = 34/233 (14%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DGN AW GEYDEWGNQLNEENPHHLHQPYRLPGQQ+D+ESGLYYNR+R+YDPLQ Sbjct: 32 LALISEDGNTAWRGEYDEWGNQLNEENPHHLHQPYRLPGQQHDEESGLYYNRHRHYDPLQ 91 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD--------------VALIRRKDQ 107 GRYIT DPIGL GGW++Y YPLNP+ IDP+GL + LI Sbjct: 92 GRYITPDPIGLRGGWNMYQYPLNPIQVIDPMGLDAIENMTSGGLIYAVSGVPGLIAANSI 151 Query: 108 LNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRD 167 N A+ D + + G D HC CR++K + ++ Sbjct: 152 TN--SAYQFGYDMDAIVGGAHNGAADAMRHCYLMCRMTKTFGSTIA-------------- 195 Query: 168 YGLNLFGMYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPS-TTDCSDRCSDYIN 219 ++ G + ++ DL N G+ C + CSD C + N Sbjct: 196 ---DVIGKNHEAAGDRQGQPAKERIMDLKNNTVGIACGDFSAKCSDACIEKYN 245 >UniRef50_P16919 Protein rhsD n=261 Tax=Bacteria RepID=RHSD_ECOLI Length = 1426 Score = 172 bits (435), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 75/93 (80%), Positives = 86/93 (92%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DGN AWS EYDEWGNQLNEENPHH++QPYRLPGQQ+D+ESGLYYNR+RYYDPLQ Sbjct: 1166 LALISEDGNTAWSAEYDEWGNQLNEENPHHVYQPYRLPGQQHDEESGLYYNRHRYYDPLQ 1225 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDP+GL+GGW+LY YPLNP+ IDP+GL Sbjct: 1226 GRYITQDPMGLKGGWNLYQYPLNPLQQIDPMGL 1258 >UniRef50_O52663 Core protein (Fragment) n=5 Tax=Enterobacteriaceae RepID=O52663_ECOLX Length = 350 Score = 171 bits (433), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 77/93 (82%), Positives = 85/93 (91%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DGN AW GEYDEWGNQLNEENP++LHQPYRLPGQQ+D+ESGLYYNRNRYYDPLQ Sbjct: 114 LALISEDGNTAWRGEYDEWGNQLNEENPYYLHQPYRLPGQQHDEESGLYYNRNRYYDPLQ 173 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDPIGL GGW+LY YPLNP+ +DPLGL Sbjct: 174 GRYITQDPIGLAGGWNLYNYPLNPIIRMDPLGL 206 >UniRef50_Q31U53 Putative uncharacterized protein n=1 Tax=Shigella boydii Sb227 RepID=Q31U53_SHIBS Length = 927 Score = 157 bits (396), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 70/93 (75%), Positives = 78/93 (83%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L+ +G W EYDEWGN LNEENPHHL Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 810 LVLISTEGATEWCAEYDEWGNLLNEENPHHLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 869 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDPIGL+GGW+ Y YPL+PVN +DPLGL Sbjct: 870 GRYITQDPIGLKGGWNFYQYPLSPVNSMDPLGL 902 >UniRef50_Q328Z1 RhsA protein in rhs element n=7 Tax=Enterobacteriaceae RepID=Q328Z1_SHIDS Length = 1213 Score = 156 bits (394), Expect = 7e-37, Method: Compositional matrix adjust. Identities = 71/94 (75%), Positives = 79/94 (84%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G W EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 1079 LALVSTEGATEWCAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 1138 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GRYITQDPIGL+GGW+LY Y LNP++ IDPLGLS Sbjct: 1139 GRYITQDPIGLKGGWNLYGYQLNPISDIDPLGLS 1172 >UniRef50_A8A655 Rhs family protein n=14 Tax=Enterobacteriaceae RepID=A8A655_ECOHS Length = 314 Score = 154 bits (388), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 72/108 (66%), Positives = 84/108 (77%), Gaps = 2/108 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G AW EYDEWGN L++ENPHHL Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 60 LALISTEGATAWCAEYDEWGNLLSDENPHHLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 119 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLN 109 GRYITQDPIGL+GGW+ Y YPLNPV +DP GL D+ L D ++ Sbjct: 120 GRYITQDPIGLKGGWNFYQYPLNPVINVDPQGL--VDINLYPESDLIH 165 >UniRef50_B5PJT7 Protein RhsD n=2 Tax=Enterobacteriaceae RepID=B5PJT7_SALET Length = 429 Score = 153 bits (386), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 73/99 (73%), Positives = 80/99 (80%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ D +AW GEYDEWGN EENP HL Q RLPGQQYD+ESGLYYNR+RYY+P Q Sbjct: 179 LALITPDNTVAWRGEYDEWGNLSGEENPAHLEQVIRLPGQQYDEESGLYYNRHRYYNPGQ 238 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 GRYITQDPIGL GGW+LY YPLNPV+ IDPLGLS D A Sbjct: 239 GRYITQDPIGLRGGWNLYNYPLNPVSEIDPLGLSMWDDA 277 >UniRef50_P32109 Putative uncharacterized protein yibJ n=14 Tax=Bacteria RepID=YIBJ_ECOLI Length = 233 Score = 151 bits (382), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 71/108 (65%), Positives = 83/108 (76%), Gaps = 2/108 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G AW EYDEWGN L++ENPHHL Q RLPGQQYD+ESGLYYNR+RYYDPL Sbjct: 60 LALISTEGATAWCAEYDEWGNLLSDENPHHLQQLIRLPGQQYDEESGLYYNRHRYYDPLL 119 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLN 109 GRYITQDPIGL+GGW+ Y YPLNPV +DP GL D+ L D ++ Sbjct: 120 GRYITQDPIGLKGGWNFYQYPLNPVINVDPQGL--VDINLYPESDLIH 165 >UniRef50_P77779 Putative uncharacterized protein ybfO n=67 Tax=Enterobacteriaceae RepID=YBFO_ECOLI Length = 477 Score = 150 bits (380), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 69/94 (73%), Positives = 77/94 (81%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G W EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 233 LALVSTEGATEWCAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 292 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GRYITQDPIGL+GGW+ Y YPLNPV ID +GL+ Sbjct: 293 GRYITQDPIGLKGGWNFYQYPLNPVQYIDSMGLA 326 >UniRef50_Q3YV37 Putative uncharacterized protein n=1 Tax=Shigella sonnei Ss046 RepID=Q3YV37_SHISS Length = 303 Score = 146 bits (369), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 66/79 (83%), Positives = 72/79 (91%) Query: 16 EYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGG 75 EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQGRYITQDPIGL+GG Sbjct: 78 EYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGG 137 Query: 76 WSLYAYPLNPVNGIDPLGL 94 W+LY YPL+PVN +DPLGL Sbjct: 138 WNLYTYPLSPVNSMDPLGL 156 >UniRef50_UPI0001B52C8C protein, rhs-like protein n=4 Tax=Enterobacteriaceae RepID=UPI0001B52C8C Length = 243 Score = 145 bits (367), Expect = 9e-34, Method: Compositional matrix adjust. Identities = 67/92 (72%), Positives = 75/92 (81%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ +G W EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQGR Sbjct: 1 LVSTEGATEWCAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGR 60 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 YITQDPIGL+GGW+ Y YPLNPV ID +GL+ Sbjct: 61 YITQDPIGLKGGWNFYQYPLNPVQYIDSMGLA 92 >UniRef50_UPI0001C341D4 protein RhsA n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C341D4 Length = 1365 Score = 145 bits (365), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 70/101 (69%), Positives = 80/101 (79%), Gaps = 3/101 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DG I+W EYDEWGN L E+NPH+L Q RLPGQQYD ESGL+YNR+RYY+P Sbjct: 1135 LALISQDGAISWRAEYDEWGNVLREDNPHNLQQLIRLPGQQYDDESGLHYNRHRYYNPGL 1194 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALI 102 GRYITQDPIGL+GGW+LY YPLNPV IDP GL DV L+ Sbjct: 1195 GRYITQDPIGLKGGWNLYKYPLNPVEYIDPSGL---DVRLV 1232 >UniRef50_B3X3P2 RhsH n=1 Tax=Shigella dysenteriae 1012 RepID=B3X3P2_SHIDY Length = 263 Score = 141 bits (355), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 69/105 (65%), Positives = 77/105 (73%), Gaps = 1/105 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L+ DG W EYDEWGN LNEENP HL Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 23 LTLISPDGATEWCAEYDEWGNLLNEENPQHLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 82 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRK 105 GRYITQDPIGLEGGW+ Y Y ++P IDPLGL + R+ Sbjct: 83 GRYITQDPIGLEGGWNQYVYASIHPTYSIDPLGLIDKPAPVFNRE 127 >UniRef50_C1M8X5 Core protein n=2 Tax=Citrobacter RepID=C1M8X5_9ENTR Length = 1359 Score = 137 bits (346), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 65/93 (69%), Positives = 74/93 (79%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DG I+W EYDEWGN L E+NPH+L Q RLPGQQYD ESGL+YNR+RYY+P Sbjct: 1136 LALIRQDGAISWRAEYDEWGNVLREDNPHNLQQLIRLPGQQYDDESGLHYNRHRYYNPGL 1195 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDPIGL GG + Y YPLNPV +DPLGL Sbjct: 1196 GRYITQDPIGLAGGLNPYQYPLNPVTEVDPLGL 1228 >UniRef50_UPI00019F181C rhsC element core protein RshC n=2 Tax=Enterobacteriaceae RepID=UPI00019F181C Length = 260 Score = 135 bits (341), Expect = 9e-31, Method: Compositional matrix adjust. Identities = 63/92 (68%), Positives = 72/92 (78%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L+ DG W EYDEWGN LNE+NP +L Q RLPGQQYD ES LYYNR+RYY+P Q Sbjct: 18 LTLIRTDGRTGWRAEYDEWGNLLNEDNPQNLQQLIRLPGQQYDDESELYYNRHRYYNPEQ 77 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLG 93 GRYITQDPIG++GG + YAYPLNPV +DPLG Sbjct: 78 GRYITQDPIGMKGGLNSYAYPLNPVESVDPLG 109 >UniRef50_UPI00019F17FA RhsC core protein with extension n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F17FA Length = 274 Score = 130 bits (328), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 62/95 (65%), Positives = 73/95 (76%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL++ DG I+W EYDEWGN L E+NPH+L + RLPGQQ D+ESGLYYNR+RY P Q Sbjct: 27 LALINQDGAISWRAEYDEWGNVLREDNPHNLQRLIRLPGQQCDEESGLYYNRHRYDSPGQ 86 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 RY+T DPIGLEGG + Y YP NP+ IDPLGL P Sbjct: 87 DRYLTLDPIGLEGGLNPYTYPRNPIRKIDPLGLQP 121 >UniRef50_A9C0N8 YD repeat protein n=5 Tax=cellular organisms RepID=A9C0N8_DELAS Length = 1528 Score = 110 bits (276), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 51/99 (51%), Positives = 67/99 (67%), Gaps = 1/99 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +AL D G +AW+ + D WGN L E NP +HQ RLPGQ +D+E+GLYYNR+RYYDP+ Sbjct: 1281 MALTDQTGQVAWAAKLDPWGNVLQEYNPQGIHQAIRLPGQHHDRETGLYYNRHRYYDPVV 1340 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADV 99 G Y+ QDPIGL GG + Y +P + IDP GL+ + Sbjct: 1341 GSYVNQDPIGLAGGVNKILYSESSPTSKIDPTGLNTVAI 1379 >UniRef50_A1TV35 YD repeat protein n=2 Tax=Acidovorax RepID=A1TV35_ACIAC Length = 1679 Score = 110 bits (274), Expect = 5e-23, Method: Composition-based stats. Identities = 50/99 (50%), Positives = 66/99 (66%), Gaps = 4/99 (4%) Query: 2 LALMDADGN----IAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 +A++DA+G + W+ Y WG E NP+ + QP R GQQ+D E+GL+YNR RYY Sbjct: 1432 IAMVDANGRHSGLLTWAATYHSWGALREEYNPNDISQPIRFQGQQFDAETGLHYNRLRYY 1491 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 DP G+Y+TQDPIGL GG Y YP++P IDP GL+P Sbjct: 1492 DPSLGQYLTQDPIGLLGGNDKYIYPVSPTGWIDPTGLNP 1530 >UniRef50_C6EE82 Rhs family protein-like protein n=2 Tax=Escherichia coli RepID=C6EE82_ECOBD Length = 290 Score = 109 bits (272), Expect = 8e-23, Method: Compositional matrix adjust. Identities = 49/58 (84%), Positives = 53/58 (91%) Query: 37 RLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 RLPGQQYD+ESGLYYNR+RYYDPLQGRYITQDPIGL+GGW+ Y YPLNPV DPLGL Sbjct: 103 RLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQYPLNPVTNTDPLGL 160 >UniRef50_D0KEV6 RHS protein n=2 Tax=Enterobacteriaceae RepID=D0KEV6_PECWW Length = 1348 Score = 107 bits (267), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 51/98 (52%), Positives = 64/98 (65%), Gaps = 5/98 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN---PHH--LHQPYRLPGQQYDKESGLYYNRNRYYD 58 + DG + W EY WGN + E PH +HQP R GQ +D E+GL+YNR RYYD Sbjct: 1112 MTGQDGGLVWRAEYRVWGNTVRVEQVEVPHSEPIHQPLRYQGQYFDAETGLHYNRFRYYD 1171 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 P GR+++QDPIGL GG +LY Y NP+ IDPLGL+P Sbjct: 1172 PDAGRFVSQDPIGLAGGINLYQYAPNPITWIDPLGLTP 1209 >UniRef50_D1T3Q3 YD repeat protein (Fragment) n=2 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1T3Q3_9BURK Length = 561 Score = 105 bits (261), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 50/97 (51%), Positives = 65/97 (67%), Gaps = 4/97 (4%) Query: 2 LALMDADGN----IAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 +AL+DA+G + W+ + WG E +P + QP R GQQ D E+GL+YNR+RYY Sbjct: 311 IALVDANGPQAGLVTWAATHHAWGAVREEYDPLGIGQPIRFQGQQLDAETGLHYNRHRYY 370 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 DP+ G+Y+TQDPIGL GG AYPLNP+ DPLGL Sbjct: 371 DPMLGQYVTQDPIGLMGGIHKQAYPLNPIQASDPLGL 407 >UniRef50_UPI0001BC4026 Rhs family protein n=6 Tax=Neisseria RepID=UPI0001BC4026 Length = 1477 Score = 102 bits (253), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 48/98 (48%), Positives = 63/98 (64%), Gaps = 1/98 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DG + W G+YD WG + E N HQP+RL Q +D+E+GL+YN RYYDP G Sbjct: 1159 MTDEDGKLLWFGKYDVWGKLVKETNITGSAHQPFRLQNQYFDRETGLHYNFFRYYDPDIG 1218 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 R++ QDPIGL+GG +LY + N DPLGL P D+ Sbjct: 1219 RFVNQDPIGLDGGENLYGFAPNAAVWSDPLGLEPMDIG 1256 >UniRef50_Q1LDW7 YD repeat n=2 Tax=Burkholderiaceae RepID=Q1LDW7_RALME Length = 1626 Score = 99.0 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 47/101 (46%), Positives = 61/101 (60%), Gaps = 6/101 (5%) Query: 4 LMDADGNIAWSGEYDEWG---NQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L+D G + W Y WG +P R PGQ +D E+GL+YNR+RYYDP Sbjct: 1414 LVDESGKVVWLARYKAWGGLKTPRKSTDPTETTNAIRFPGQYHDVETGLHYNRHRYYDPG 1473 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL---SPAD 98 GR+I++DP+GL GG ++Y Y NPV +DPLGL SPAD Sbjct: 1474 SGRFISKDPVGLAGGINVYTYAPNPVGWVDPLGLRCDSPAD 1514 >UniRef50_A1TTQ1 Rhs family protein n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TTQ1_ACIAC Length = 357 Score = 97.8 bits (242), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 53/130 (40%), Positives = 66/130 (50%), Gaps = 36/130 (27%) Query: 2 LALMDADGNIAWSGEYDEWG------------------------------------NQLN 25 L L DA G++AW+ +Y WG NQL+ Sbjct: 96 LELTDAQGHVAWAADYKVWGEAALRKVLKSATGTDALPGPRHKGHGPVLDEHDAYKNQLS 155 Query: 26 EENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 L QP+R GQQ+D E+GL+YNR RYYDP GR+ +QDP+GL GG +AY NP Sbjct: 156 HSVSPFLEQPFRFQGQQFDAETGLHYNRFRYYDPSIGRFFSQDPVGLHGGIHGFAYAPNP 215 Query: 86 VNGIDPLGLS 95 N IDPLGLS Sbjct: 216 NNWIDPLGLS 225 >UniRef50_Q147B5 Rhs family protein n=2 Tax=Betaproteobacteria RepID=Q147B5_BURXL Length = 1362 Score = 97.8 bits (242), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 50/105 (47%), Positives = 62/105 (59%), Gaps = 9/105 (8%) Query: 6 DADGNIAWSGEYDEWGNQLNEENPH--------HLHQPYRLPGQQYDKESGLYYNRNRYY 57 D G W Y WG L H HQP R GQ +D+E+GL+YNR+RYY Sbjct: 1142 DDAGRTQWRARYAAWGRLLGANGGHEQMHESGRQAHQPLRFQGQYFDEETGLHYNRHRYY 1201 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALI 102 DP GR++TQDPIGL GG +LY Y NP IDPLGL+ D++L+ Sbjct: 1202 DPDAGRFMTQDPIGLRGGINLYRYAPNPGRWIDPLGLA-VDLSLV 1245 >UniRef50_Q2SPP2 Rhs family protein n=3 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SPP2_HAHCH Length = 1434 Score = 97.4 bits (241), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 44/92 (47%), Positives = 60/92 (65%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D++G + WS Y +G L ++ +H P R GQ YD+E+G +YNR+RYYDP GR Sbjct: 1125 MTDSEGTLVWSARYKAYG-ALALQDVESVHNPLRFQGQYYDEETGFHYNRHRYYDPQSGR 1183 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 +I QDPIGL GG + Y Y NPV +DP GL+ Sbjct: 1184 FINQDPIGLLGGANAYQYAPNPVGWVDPFGLT 1215 >UniRef50_Q83LZ1 Putative Rhs-family protein n=1 Tax=Shigella flexneri RepID=Q83LZ1_SHIFL Length = 211 Score = 97.4 bits (241), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 46/97 (47%), Positives = 63/97 (64%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D+DG I W Y WGN + E++ + Q R GQ D+E+GL+YN +RYYDP GR Sbjct: 1 MTDSDGKIVWETGYQVWGNTIQEKDHGGVEQNLRYQGQYLDRETGLHYNLHRYYDPDVGR 60 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 ++ DPIGL GG +LY+Y NP+ DPLGL+P V+ Sbjct: 61 FMVTDPIGLRGGLNLYSYAPNPLKYADPLGLTPCAVS 97 >UniRef50_B2HXH5 Rhs family protein n=5 Tax=Acinetobacter baumannii RepID=B2HXH5_ACIBC Length = 1635 Score = 96.7 bits (239), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 53/130 (40%), Positives = 74/130 (56%), Gaps = 19/130 (14%) Query: 4 LMDADGNIAWSGEYDEWGN-QLNEENPHHLHQP------YRLPGQQYDKESGLYYNRNRY 56 + D G I W EY WG +L + N + R GQ +D+E+GL+YNR RY Sbjct: 1383 MSDQTGAIIWKAEYKAWGECKLEQTNSDFFEKSEIISNNIRFQGQYFDEETGLHYNRYRY 1442 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRK-DQLNHQ---- 111 Y P GR+I++DPIGL GG+++YAY NPV +DP GL+P +L+R K D++ Q Sbjct: 1443 YSPYVGRFISKDPIGLLGGFNVYAYTANPVQWVDPYGLAPC--SLVRYKPDKVTPQAGSR 1500 Query: 112 -----RAWDI 116 RAW + Sbjct: 1501 QDAIDRAWSL 1510 >UniRef50_Q13ML3 YD repeat protein n=10 Tax=Proteobacteria RepID=Q13ML3_BURXL Length = 1531 Score = 96.3 bits (238), Expect = 7e-19, Method: Compositional matrix adjust. Identities = 48/94 (51%), Positives = 59/94 (62%), Gaps = 2/94 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP--HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L ADG I W Y WGN + E P + + Q R GQ D+E+GL+YN RYYDP Sbjct: 1312 LTSADGRIVWQAMYQLWGNTVRESEPESYAVRQNLRYQGQYLDRETGLHYNTLRYYDPDI 1371 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GR+ T DPIGL GG +LY Y NP++ IDP+GLS Sbjct: 1372 GRFTTPDPIGLAGGVNLYRYAPNPMSWIDPMGLS 1405 >UniRef50_Q4ZLF3 YD repeat n=5 Tax=Pseudomonas syringae group RepID=Q4ZLF3_PSEU2 Length = 451 Score = 95.9 bits (237), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 1/93 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DA+G I W +Y WG + + + + Q R GQ +D E+GL+YN RYYDP Sbjct: 222 LEMTDAEGQIVWQAKYRAWG-AVEKLVVNEVEQNLRFQGQYFDAETGLHYNTFRYYDPEI 280 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GR+ITQDPIGL GG++LY Y NP IDPLGL Sbjct: 281 GRFITQDPIGLLGGFNLYGYCRNPTAWIDPLGL 313 >UniRef50_C6AKX3 Rhs family protein n=4 Tax=Aggregatibacter aphrophilus NJ8700 RepID=C6AKX3_AGGAN Length = 1917 Score = 95.9 bits (237), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 47/98 (47%), Positives = 64/98 (65%), Gaps = 4/98 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE---ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + D+DG + W G YD WG+ + + E HQP+RL Q +D+E+GL+YN RYY+P+ Sbjct: 1689 MTDSDGKLIWKGRYDAWGSLIRDSYRETASDSHQPFRLQNQYFDEETGLHYNFFRYYEPV 1748 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL-SPA 97 GR+ITQDPI L GG +LY + N DPLGL +PA Sbjct: 1749 LGRFITQDPIKLAGGNNLYRFEGTVQNQTDPLGLFAPA 1786 >UniRef50_B4SV70 Rhs-family protein n=42 Tax=Enterobacteriaceae RepID=B4SV70_SALNS Length = 1359 Score = 95.9 bits (237), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 46/94 (48%), Positives = 62/94 (65%), Gaps = 2/94 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE--ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + ADG + W+G +G + + + HQP RLPGQ +D E+GL+YN RYY P Sbjct: 1159 VTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYFDDETGLHYNLFRYYAPEC 1218 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GR+++QDPIGL GG +LYAY NP+ IDPLGL+ Sbjct: 1219 GRFVSQDPIGLRGGLNLYAYAPNPIRWIDPLGLA 1252 >UniRef50_B2VH58 Rhs family protein n=5 Tax=Enterobacteriaceae RepID=B2VH58_ERWT9 Length = 1322 Score = 95.5 bits (236), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 46/97 (47%), Positives = 60/97 (61%), Gaps = 6/97 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLN------EENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L + G+I W +Y WGN E + +HQP R GQ +D E+GL+YNR RYY Sbjct: 1108 LSNRSGDICWQADYRVWGNTRQVSYAQQEADAETIHQPLRYQGQYFDGETGLHYNRFRYY 1167 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 DP GR+I++DP+GL GG +LY Y NP +DPLGL Sbjct: 1168 DPDIGRFISRDPVGLSGGMNLYQYAPNPYGWVDPLGL 1204 >UniRef50_C8QGI4 YD repeat protein n=1 Tax=Pantoea sp. At-9b RepID=C8QGI4_9ENTR Length = 465 Score = 95.5 bits (236), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 48/102 (47%), Positives = 60/102 (58%), Gaps = 8/102 (7%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHL--------HQPYRLPGQQYDKESGLYYNR 53 L + DA G + WSG+Y +G + + + HQP R GQ D E+GL+YN Sbjct: 245 LEVTDASGKLRWSGQYGSFGEVTRQTDGVYRRASQTSLSHQPLRYAGQYADAETGLHYNL 304 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 RYYDP GR+ QDPIGL GGW+LY Y NP+ IDP GLS Sbjct: 305 FRYYDPQTGRFTVQDPIGLAGGWNLYQYAPNPLTWIDPTGLS 346 >UniRef50_A1TQS0 YD repeat protein n=2 Tax=Acidovorax RepID=A1TQS0_ACIAC Length = 1602 Score = 95.5 bits (236), Expect = 1e-18, Method: Composition-based stats. Identities = 47/130 (36%), Positives = 66/130 (50%), Gaps = 36/130 (27%) Query: 2 LALMDADGNIAWSGEYDEWGNQ------------------------------------LN 25 L L D +G IAW+ +Y WG + Sbjct: 1337 LELTDVNGQIAWAVDYKVWGEATLRAVPRSDTGTDGVPGPRRQGHGPEAKSHAADSEVVC 1396 Query: 26 EENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 P + QP+R GQQ+D+E+GL+YNR+RYYDP GR+I++DPIG GG + + YPL+P Sbjct: 1397 AREPQRVEQPFRFQGQQFDEETGLHYNRSRYYDPAVGRFISEDPIGFLGGINTFIYPLDP 1456 Query: 86 VNGIDPLGLS 95 + IDP GL+ Sbjct: 1457 YSWIDPTGLA 1466 >UniRef50_Q0JZD5 RHS family protein n=9 Tax=Bacteria RepID=Q0JZD5_RALEH Length = 1585 Score = 95.1 bits (235), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 45/96 (46%), Positives = 60/96 (62%), Gaps = 4/96 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQL----NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L D G +AWS +Y WG N + P R GQ YD E+GL+YNR+RYYDP Sbjct: 1377 LTDEAGELAWSAQYKAWGAAQEAISNAARKAGIQNPLRFQGQYYDHENGLHYNRHRYYDP 1436 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GR++++DPIGL GG +L Y NP++ +DPLGL+ Sbjct: 1437 GTGRFVSKDPIGLAGGLNLNQYAPNPISWVDPLGLA 1472 >UniRef50_B2K1J9 YD repeat protein n=23 Tax=Yersinia RepID=B2K1J9_YERPB Length = 1494 Score = 94.7 bits (234), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 44/93 (47%), Positives = 61/93 (65%), Gaps = 1/93 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + L D +G + W ++ +G QL+ L QP R+ GQ YD ESGL+YNR RYYDP Sbjct: 1281 IRLQDGEGEVVWEAQFTPFG-QLSVTGTSQLRQPLRMQGQYYDTESGLHYNRYRYYDPAC 1339 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 G +I+QDPIGL+GG + Y + +N + +DPLGL Sbjct: 1340 GVFISQDPIGLKGGLNPYQFAVNTLGWVDPLGL 1372 >UniRef50_C0Q6Q9 Rhs-family protein n=27 Tax=Enterobacteriaceae RepID=C0Q6Q9_SALPC Length = 1593 Score = 94.7 bits (234), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 44/91 (48%), Positives = 58/91 (63%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D GNI W Y WGN +E+ + Q R GQ D+E+GL+YN R+YDP G+ Sbjct: 1355 MTDGGGNIVWEAGYQVWGNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGK 1414 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 +I+ DPI L GG +LYAY NP++ IDPLGL Sbjct: 1415 FISGDPISLRGGINLYAYAPNPISWIDPLGL 1445 >UniRef50_C5CXA7 YD repeat protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CXA7_VARPS Length = 1434 Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 57/133 (42%), Positives = 75/133 (56%), Gaps = 3/133 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DG I W+ YD WG + ++ L QP R GQQ D E+GL+YNR+RY+DP+ Sbjct: 1223 LRMTTRDGQIVWAVRYDVWGG-IARKDCELLAQPIRCQGQQEDAETGLFYNRHRYFDPII 1281 Query: 62 GRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKD-QLNHQRAWDILSD 119 G Y++ DPIGL GG + YAY NP IDPLGL+ + +R D QL R I Sbjct: 1282 GAYVSADPIGLRGGVNPYAYGCSNPYFWIDPLGLAASCERHLRSIDEQLQQGRRARIDVA 1341 Query: 120 TYEDMKRLNLGGT 132 + +D L L T Sbjct: 1342 SKDDAIELLLAYT 1354 >UniRef50_B1J8X0 YD repeat protein n=50 Tax=Gammaproteobacteria RepID=B1J8X0_PSEPW Length = 1411 Score = 94.0 bits (232), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 44/101 (43%), Positives = 62/101 (61%), Gaps = 1/101 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L D++G I W Y WG + + + + Q R GQ +D+E+ L+YN RYYDP Sbjct: 1185 LELTDSEGKIVWQATYRSWG-AIEQLTVNEIDQNLRFQGQYFDRETSLHYNTLRYYDPDV 1243 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALI 102 GR+I DPIGL GG +L+ Y +NP+ IDP GL+P V ++ Sbjct: 1244 GRFIGPDPIGLRGGVNLFRYNVNPIYWIDPTGLAPCQVRVV 1284 >UniRef50_C5CZG8 RHS protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CZG8_VARPS Length = 1609 Score = 94.0 bits (232), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 46/96 (47%), Positives = 61/96 (63%), Gaps = 4/96 (4%) Query: 4 LMDADGNIAWSGEYDEWG---NQLNEENPHH-LHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L D +G IAWS +Y WG ++E P R GQ +D E+GL+YNR+RYYDP Sbjct: 1390 LTDHEGRIAWSAQYKAWGEAKQAISEAGRKAGFRNPIRFQGQYFDDETGLHYNRHRYYDP 1449 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GR++++DPIGL GG +L Y NP+ IDPLGL+ Sbjct: 1450 SCGRFVSKDPIGLAGGSNLQQYAPNPLGWIDPLGLA 1485 >UniRef50_UPI0001B53B37 YD repeat protein n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B53B37 Length = 1465 Score = 93.6 bits (231), Expect = 5e-18, Method: Composition-based stats. Identities = 53/133 (39%), Positives = 70/133 (52%), Gaps = 15/133 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D GN+AW + WG + + P R PGQ +D+ESGLYYN RYYDP GR Sbjct: 1268 LIDPGGNVAWHADRTLWGYRAGASQ-GGVSVPMRFPGQYHDEESGLYYNYFRYYDPETGR 1326 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE- 122 Y + DP+GL GG + +AY NP + +DP GLS +QR W SD + Sbjct: 1327 YASPDPLGLHGGDNPHAYVANPTSWLDPFGLSA-------------NQRRWIDHSDGWRL 1373 Query: 123 DMKRLNLGGTDQF 135 + R +GG F Sbjct: 1374 GIDRFPIGGGSDF 1386 >UniRef50_Q7NY44 Probable Rhs-family protein n=2 Tax=Chromobacterium violaceum RepID=Q7NY44_CHRVO Length = 1513 Score = 93.6 bits (231), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 45/97 (46%), Positives = 62/97 (63%), Gaps = 4/97 (4%) Query: 3 ALMDADGNIAWSGEYDEWGNQ----LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 AL D G +A +Y WG + + P+R GQ +D ESGL+YNR+RYYD Sbjct: 1297 ALTDEHGALALEMDYQAWGQAREVIADAAGKAGIRNPFRFQGQYHDDESGLHYNRHRYYD 1356 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 P GR+I++DPIGL+GG ++Y Y LNP+ +DPLGL+ Sbjct: 1357 PEIGRFISRDPIGLKGGINIYGYALNPIVWMDPLGLT 1393 >UniRef50_Q2SFR1 Rhs family protein n=9 Tax=cellular organisms RepID=Q2SFR1_HAHCH Length = 138 Score = 92.8 bits (229), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 45/91 (49%), Positives = 60/91 (65%), Gaps = 1/91 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A+G + WS Y +GN L ++ + P R GQ YD+E+GL+YNR+RYYDP R Sbjct: 46 MTNAEGEVVWSARYKAYGN-LALKDVEDVQNPLRFQGQYYDEETGLHYNRHRYYDPSAAR 104 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 +I QDP GL GG S Y Y LNP+ +DPLGL Sbjct: 105 FINQDPAGLLGGESNYEYVLNPIEWVDPLGL 135 >UniRef50_A7FDJ9 RHS/YD repeat protein n=30 Tax=Enterobacteriaceae RepID=A7FDJ9_YERP3 Length = 1418 Score = 92.8 bits (229), Expect = 9e-18, Method: Compositional matrix adjust. Identities = 49/102 (48%), Positives = 60/102 (58%), Gaps = 7/102 (6%) Query: 2 LALMDADGNIAWSGEYDEWG--NQLNEENPHH-----LHQPYRLPGQQYDKESGLYYNRN 54 L + D +G WSG+Y WG + N +P QP R PGQ D E+GL+YN Sbjct: 1192 LDVTDGEGKHRWSGKYHAWGKVTRQNVSDPRQSTVSRFAQPLRYPGQYSDDETGLHYNTF 1251 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 RYYDP GR+ TQDPIGL GG +LY Y NP+ +DPLG P Sbjct: 1252 RYYDPEIGRFSTQDPIGLAGGINLYQYGPNPLGWVDPLGWMP 1293 >UniRef50_Q2SFR5 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SFR5_HAHCH Length = 471 Score = 92.4 bits (228), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 43/92 (46%), Positives = 59/92 (64%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A+G + WS Y +GN L ++ + P R GQ YD+E+GL+YNR RYYDP R Sbjct: 46 MTNAEGEVVWSARYKAYGN-LALKDVEDVQNPLRFQGQYYDEETGLHYNRRRYYDPSAAR 104 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 +I QDP+GL GG + Y Y LNP +DP GL+ Sbjct: 105 FINQDPVGLLGGDNNYQYALNPTGWVDPYGLT 136 >UniRef50_B5MRU6 Rhs-family protein n=9 Tax=Gammaproteobacteria RepID=B5MRU6_SALET Length = 216 Score = 91.7 bits (226), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 57/153 (37%), Positives = 79/153 (51%), Gaps = 5/153 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D GNI W Y WGN +E+ + Q R GQ D+E+GL+YN R+YDP G+ Sbjct: 1 MTDGGGNIVWEAGYQVWGNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGK 60 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDI----LSD 119 +I+ DPI + GG +LY Y NP+ IDPLGL + K + H+ DI LSD Sbjct: 61 FISGDPISIRGGINLYQYAPNPIKWIDPLGLYNGEGQRELGKYHVFHEHNLDITEYGLSD 120 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGV 152 E R N +++ + AF R + GV Sbjct: 121 A-EHFSRGNQAISERMKNDPAFRREMQTKYPGV 152 >UniRef50_Q39K64 Rhs family protein n=22 Tax=Burkholderia RepID=Q39K64_BURS3 Length = 1560 Score = 91.3 bits (225), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 43/101 (42%), Positives = 61/101 (60%), Gaps = 6/101 (5%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L D +G++ W Y WG + ++ R GQQ D E+GL+YNR+RYY Sbjct: 1332 LTDDEGDVVWEASYKAWGEAREVIARASKVAGIVPRSSLRFQGQQVDDETGLHYNRHRYY 1391 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 DP GR++++DPIGL GG ++Y Y NPV +DPLGLS ++ Sbjct: 1392 DPRSGRFVSKDPIGLAGGINVYQYAPNPVKWVDPLGLSKSE 1432 >UniRef50_Q2SFS1 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SFS1_HAHCH Length = 295 Score = 90.9 bits (224), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 48/125 (38%), Positives = 72/125 (57%), Gaps = 2/125 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A+G + WS Y +GN L ++ + P R GQ YD+E+GL+YNR RYYDP R Sbjct: 1 MTNAEGEVVWSARYKAYGN-LALQDVEDVQNPLRFQGQYYDEETGLHYNRRRYYDPSAAR 59 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIR-RKDQLNHQRAWDILSDTYE 122 +I QDP+GL GG + Y Y NP +DP GL+ + + + +KD H + +Y+ Sbjct: 60 FINQDPVGLLGGDNNYQYAPNPTGWVDPYGLTCKENSWNQFQKDTKGHFANSTEAAKSYQ 119 Query: 123 DMKRL 127 MK + Sbjct: 120 KMKEV 124 >UniRef50_D0KES6 YD repeat protein n=15 Tax=Gammaproteobacteria RepID=D0KES6_PECWW Length = 1379 Score = 90.5 bits (223), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 48/102 (47%), Positives = 64/102 (62%), Gaps = 8/102 (7%) Query: 2 LALMDADGNIAWSGEYDEWG--NQLNEENPHHLH------QPYRLPGQQYDKESGLYYNR 53 L + DA+G + WSG+Y +G N +++ H Q R GQ D+E+GL+YN Sbjct: 1175 LEMTDAEGAVRWSGDYGSFGAINGQTQDSEGLRHGKPVESQSLRYAGQYADEETGLHYNL 1234 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 RYYDP GR+ TQDPIGL GG +LYAY NP+ +DPLGL+ Sbjct: 1235 FRYYDPTVGRFTTQDPIGLAGGLNLYAYAPNPLGWVDPLGLA 1276 >UniRef50_A6GLW0 Rhs family protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GLW0_9BURK Length = 1598 Score = 90.5 bits (223), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 51/120 (42%), Positives = 71/120 (59%), Gaps = 4/120 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A + WS + +G L + + + P R GQ +D E+GL+YNR+RYYDP G+ Sbjct: 1371 ITNASAEVVWSSTFKTYG-ALVLAHVNEVENPLRFQGQYFDSETGLHYNRHRYYDPNCGQ 1429 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA-LIRRKDQLNHQRAWDILSDTYE 122 + TQDPIGL GG + Y Y NP+ +DP GLS D A I K QL +R D L+ T+E Sbjct: 1430 FTTQDPIGLLGGMNTYQYAPNPMTWVDPWGLSCKDQANSIETKRQL--ERLPDDLAGTFE 1487 >UniRef50_Q6LUC4 Putative uncharacterized protein n=1 Tax=Photobacterium profundum RepID=Q6LUC4_PHOPR Length = 532 Score = 90.5 bits (223), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 48/129 (37%), Positives = 72/129 (55%), Gaps = 3/129 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + ++DG + W Y+ G + +H P R GQ +D+ESGL+YNR RYYDP G Sbjct: 272 TVTNSDGEVVWQATYNALGCAFISIDI--IHNPLRFQGQYHDQESGLHYNRFRYYDPSIG 329 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 ++I QDPIGL GG + Y Y NP+ +DPLGLS + +I K + W+ + + Sbjct: 330 QFIHQDPIGLLGGINHYRYAPNPIQWVDPLGLSCKE-GIIELKKSYSSLNWWEKIKRGLD 388 Query: 123 DMKRLNLGG 131 +++GG Sbjct: 389 IFDSVDVGG 397 >UniRef50_Q6LUC6 Hypothetical nucleotidyltransferase n=1 Tax=Photobacterium profundum RepID=Q6LUC6_PHOPR Length = 1352 Score = 90.5 bits (223), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 47/109 (43%), Positives = 65/109 (59%), Gaps = 3/109 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D++G + W Y+ G + +H P R GQ +D+ESGL+YNR RYYDP G Sbjct: 1070 TVTDSEGEVVWQATYNALGCAAISIDI--IHNPLRFQGQYHDQESGLHYNRFRYYDPSIG 1127 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQ-LNH 110 ++I QDPIGL GG + Y Y NP+ +DPLGLS + A + K +NH Sbjct: 1128 QFIHQDPIGLLGGINHYRYAPNPIQWVDPLGLSCKEAAFEKIKQSFVNH 1176 >UniRef50_D1T3N5 YD repeat protein n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1T3N5_9BURK Length = 598 Score = 89.7 bits (221), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 48/104 (46%), Positives = 64/104 (61%), Gaps = 5/104 (4%) Query: 2 LALMDADGN----IAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 +AL+DA+G + W+ Y WG E +PH + Q R GQQ+D E+GL+YNR RYY Sbjct: 357 IALVDANGPQAGLVTWAATYHAWGAVREEYDPHGIGQDIRFQGQQFDAETGLHYNRFRYY 416 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVA 100 DP+ G+Y+TQDPIGL+GG + Y +P DP GL D A Sbjct: 417 DPMLGQYVTQDPIGLKGGLNKSNYSGSSPAINCDPKGLDFKDKA 460 >UniRef50_A9EW02 Conserved exported carbohydrate-binding protein,Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EW02_SORC5 Length = 1310 Score = 89.7 bits (221), Expect = 7e-17, Method: Composition-based stats. Identities = 45/116 (38%), Positives = 71/116 (61%), Gaps = 4/116 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ G +AW ++ WG L EE+P + P+ L G D+E+GL Y R+RY+DP R Sbjct: 1095 LVSGRGQVAWRADHTLWGRVLAEESPAGVRAPFSLLGHYVDEETGLAYVRHRYFDPETAR 1154 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL-SPAD-VALIR--RKDQLNHQRAWD 115 ++++DP+G +GG +L+ + P +DPLGL + AD V +R R+ ++ QRA D Sbjct: 1155 WLSRDPLGFDGGPNLFGFDGMPTEEVDPLGLMTRADFVQFMRQYRRQRIQAQRASD 1210 >UniRef50_C0EPY5 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EPY5_NEIFL Length = 193 Score = 89.4 bits (220), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 48/104 (46%), Positives = 62/104 (59%), Gaps = 2/104 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHH-LHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D GN+ W GEY WG +E + HQP+RL Q YD E+GL+YN RYYD G Sbjct: 1 MTDIHGNLLWYGEYTAWGRLKKDERVYKDAHQPFRLQNQYYDSETGLHYNYFRYYDSETG 60 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKD 106 R+++QD IGL GG + Y + N + IDPLGL L+ RKD Sbjct: 61 RFVSQDVIGLVGGENFYQFSPNTQSWIDPLGLKEL-YYLVARKD 103 >UniRef50_C7MXD3 Rhs family protein n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MXD3_SACVD Length = 1485 Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 43/93 (46%), Positives = 58/93 (62%), Gaps = 5/93 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHH--LHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+D GN+AW WG + + HH P+R PGQ D E+GL+YN +RYYDP Sbjct: 1278 LLDGSGNLAWRNRTTLWGKTVIK---HHGSASTPWRFPGQYSDPETGLHYNYHRYYDPDT 1334 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRY++ DP+GL + YAY +NP+ +DPLGL Sbjct: 1335 GRYVSCDPLGLRPSPNHYAYVVNPLRWLDPLGL 1367 >UniRef50_C6M5B1 Rhs-related protein n=7 Tax=Proteobacteria RepID=C6M5B1_NEISI Length = 1934 Score = 89.4 bits (220), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 48/94 (51%), Positives = 61/94 (64%), Gaps = 3/94 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHL--HQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D +GNI WSG+Y WG +L +E L +QP+RL Q YD+E+GL+YN RYYDP Sbjct: 1734 MTDEEGNIVWSGDYSGWG-KLTQEGRLKLDVYQPFRLQNQYYDEETGLHYNFFRYYDPEI 1792 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GR+ QDPI L GG SLYA N +D LGL+ Sbjct: 1793 GRFTQQDPIKLLGGESLYALAPNVFVWLDTLGLA 1826 >UniRef50_C6M9F5 RHS family protein n=2 Tax=Bacteria RepID=C6M9F5_NEISI Length = 448 Score = 89.0 bits (219), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 45/93 (48%), Positives = 59/93 (63%), Gaps = 3/93 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN--PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D DGN+ W G Y WG +L EE +QP+RL Q D+E+GL+YN RYY+P Sbjct: 225 MTDKDGNLLWFGNYTGWG-RLKEETKVTDSAYQPFRLQNQYADRETGLHYNFFRYYEPDA 283 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GR++ QDPIGLEGG +LY + N N +D GL Sbjct: 284 GRFVNQDPIGLEGGENLYKFAPNAQNWVDIFGL 316 >UniRef50_D1YPL0 RHS repeat-associated core domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YPL0_9FIRM Length = 216 Score = 89.0 bits (219), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 47/99 (47%), Positives = 63/99 (63%), Gaps = 3/99 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN--PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D DGN+ W G Y WG +L EE +QP+RL Q D+E+GL+YN RYY+P Sbjct: 1 MTDKDGNLLWFGNYTGWG-RLKEETKVTDSAYQPFRLQNQYCDRETGLHYNFFRYYEPDA 59 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 GR++ QDPIGLEGG ++Y + N + IDPLGL + A Sbjct: 60 GRFVNQDPIGLEGGDNIYLFSPNIQSWIDPLGLLSWNTA 98 >UniRef50_A1TTP6 YD repeat protein n=2 Tax=Acidovorax citrulli AAC00-1 RepID=A1TTP6_ACIAC Length = 1554 Score = 88.6 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 47/130 (36%), Positives = 64/130 (49%), Gaps = 36/130 (27%) Query: 2 LALMDADGNIAWSGEYDEWGN------------------------------------QLN 25 L L DA G IAW+ +Y WG + Sbjct: 1296 LELTDAQGYIAWAADYKVWGEATLRAVPRTATGTDGVSGERRRGHGPVMDVHEGGGEKAR 1355 Query: 26 EENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 P + QP+R GQQ+D+E+GL+YNR RYY+P GR+++QDPIGL GG + + Y +P Sbjct: 1356 PTPPAIIEQPFRFQGQQFDEETGLHYNRFRYYEPSVGRFVSQDPIGLLGGVNSFTYAPSP 1415 Query: 86 VNGIDPLGLS 95 N +DP GLS Sbjct: 1416 NNWMDPFGLS 1425 >UniRef50_D2TGW0 Putative Rhs protein n=2 Tax=Citrobacter RepID=D2TGW0_CITRO Length = 1477 Score = 88.6 bits (218), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 45/93 (48%), Positives = 54/93 (58%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG I W GE WG + R PGQ D+ESGLYYNR RYYD G+ Sbjct: 1252 LLTEDGTIVWRGEQQLWGREEGRNRDDAPACRLRFPGQYEDEESGLYYNRFRYYDCEAGQ 1311 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 Y+ DP+GL GG + Y Y NP+ IDPLGL+P Sbjct: 1312 YLCADPVGLAGGLNPYGYVNNPLKYIDPLGLNP 1344 >UniRef50_Q48LL6 Rhs family protein n=1 Tax=Pseudomonas syringae pv. phaseolicola 1448A RepID=Q48LL6_PSE14 Length = 362 Score = 88.2 bits (217), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 44/94 (46%), Positives = 59/94 (62%), Gaps = 1/94 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DA+G I W +Y WG + + + + Q R GQ +D E+GL+YN RYYDP Sbjct: 138 LEMTDAEGQIVWQAKYRAWG-AVEKLVVNEVEQNLRFQGQYFDVETGLHYNTFRYYDPEI 196 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GR+ITQDPIGL+GG +LY Y NP +DP G + Sbjct: 197 GRFITQDPIGLDGGDNLYKYVPNPTAWVDPWGWA 230 >UniRef50_C3JXH8 Putative Rhs protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3JXH8_PSEFS Length = 1597 Score = 88.2 bits (217), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 44/92 (47%), Positives = 57/92 (61%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L ADG I WS Y +G ++ + + P R GQ +D+ESGL+YNR+RYY P GR Sbjct: 1342 LTAADGEIVWSAHYRAYG-EITRLDIGKIDNPLRFQGQYFDQESGLHYNRHRYYHPDIGR 1400 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y+T DP+ L GG + Y Y NP +DPLGLS Sbjct: 1401 YLTPDPVKLTGGINAYQYVPNPTGWVDPLGLS 1432 >UniRef50_B1JCT8 RHS protein n=1 Tax=Pseudomonas putida W619 RepID=B1JCT8_PSEPW Length = 231 Score = 88.2 bits (217), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 42/98 (42%), Positives = 58/98 (59%), Gaps = 1/98 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + L +++G I W Y WG + E H + Q R GQ ++ E+GL+YN RYYDP Sbjct: 1 MELSNSEGEIVWQATYRSWG-AIEELKVHDIEQNLRFQGQYFESETGLHYNTLRYYDPEV 59 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 GR++TQDPIGL G + Y Y +PV +DP GL+ V Sbjct: 60 GRFVTQDPIGLGDGMNFYQYAPSPVMWVDPWGLAFKSV 97 >UniRef50_Q2SKM2 Rhs family protein n=3 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SKM2_HAHCH Length = 1552 Score = 88.2 bits (217), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 42/92 (45%), Positives = 58/92 (63%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A+G + WS Y +GN L ++ + P R GQ YD+E+GL+YNR+RYYDP R Sbjct: 1268 MTNAEGEVVWSARYKAYGN-LALKDVEDVQNPLRFQGQYYDEETGLHYNRHRYYDPSAAR 1326 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 +I QDP+GL GG + Y Y NP DP GL+ Sbjct: 1327 FINQDPVGLLGGDNNYQYAPNPTGWGDPFGLT 1358 >UniRef50_B7LTT0 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B7LTT0_ESCF3 Length = 1543 Score = 87.8 bits (216), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 46/95 (48%), Positives = 56/95 (58%), Gaps = 4/95 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN----PHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 + D+DG I W WGN EEN H Q R GQ D+E+GL+YN RY+ P Sbjct: 1308 MTDSDGGIVWRARVQLWGNIRFEENRDIYSVHPQQNLRFAGQYLDRETGLHYNTFRYFLP 1367 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GR+ DPIGL GG +LYAY NP++ IDPLGL Sbjct: 1368 ESGRFSQPDPIGLAGGLNLYAYAPNPLSYIDPLGL 1402 >UniRef50_C9Y462 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y462_CROTZ Length = 252 Score = 87.8 bits (216), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 44/94 (46%), Positives = 56/94 (59%), Gaps = 2/94 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQ 61 + DADG W G++ WG E + P R GQ D+ESGL+YN RYYDP+ Sbjct: 27 VTDADGQTVWRGQFSTWGETERELSVPQWQVPQNLRFQGQYLDRESGLHYNLFRYYDPVA 86 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GRY DPIGL GG + YAY +P+ +DPLGL+ Sbjct: 87 GRYTQMDPIGLAGGINTYAYVGDPLTWVDPLGLA 120 >UniRef50_Q87U70 Rhs family protein n=2 Tax=Pseudomonas RepID=Q87U70_PSESM Length = 1572 Score = 87.8 bits (216), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 44/92 (47%), Positives = 57/92 (61%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D G I WS +Y +GN L + + P R GQ +D E+GL+YNR+RYY+P GR Sbjct: 1322 LTDYSGEIMWSAKYRAYGN-LATLDIAEIENPLRFQGQYFDAETGLHYNRHRYYNPGTGR 1380 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 ++T DPI L GG + Y Y NP +DPLGLS Sbjct: 1381 FLTPDPIKLAGGLNNYQYVPNPTGWVDPLGLS 1412 >UniRef50_UPI0001C34A7C Rhs family protein n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001C34A7C Length = 335 Score = 87.8 bits (216), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 53/135 (39%), Positives = 68/135 (50%), Gaps = 8/135 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DGN+ W G Y WG +E N HQP+RL Q D+E+GL+YN RYY+P G Sbjct: 93 MTDEDGNLLWFGNYTGWGKLKSETNISGTAHQPFRLQNQYCDRETGLHYNFFRYYEPDAG 152 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 R++ QDPIGL GG + Y + N +DPLGL Q DI Sbjct: 153 RFVNQDPIGLFGGSNFYMFAFNISRWLDPLGLKGKSKRSCEEIGQ-------DIDRLINR 205 Query: 123 DMKRLNLGGTDQFFH 137 D ++ N GGT H Sbjct: 206 DKRKCNNGGTHGLRH 220 >UniRef50_Q6D1M4 Rhs protein n=5 Tax=Enterobacteriaceae RepID=Q6D1M4_ERWCT Length = 1618 Score = 87.4 bits (215), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 46/107 (42%), Positives = 58/107 (54%), Gaps = 12/107 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQ------------PYRLPGQQYDKESGLYY 51 L +G I W GE WG E P L + R GQ YD E+GLYY Sbjct: 1398 LCSEEGEIRWRGEQGLWGAHREERRPIPLRRYLGDAANEEVYCELRYQGQLYDAETGLYY 1457 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 NR+RYYD G+Y++ DPIGL GG Y Y NP++ +DPLGL+P + Sbjct: 1458 NRHRYYDAESGQYLSPDPIGLAGGKRAYGYVKNPLSWVDPLGLTPKE 1504 >UniRef50_B0VRR7 Putative uncharacterized protein n=1 Tax=Acinetobacter baumannii SDF RepID=B0VRR7_ACIBS Length = 296 Score = 87.4 bits (215), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 44/99 (44%), Positives = 57/99 (57%), Gaps = 7/99 (7%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE-------ENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 + D G I W EY WG E EN + R GQ +D+E+GL+YNR RY Sbjct: 57 MTDHTGAIIWKAEYKAWGECKAEKAKSNFFENSEIISNNIRFQGQYFDEETGLHYNRYRY 116 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y P GR++++DPIGL GG + YAY +P +DPLGLS Sbjct: 117 YSPYVGRFVSKDPIGLLGGSNNYAYAPSPTEWVDPLGLS 155 >UniRef50_D0KES8 RHS protein n=3 Tax=Pectobacterium wasabiae WPP163 RepID=D0KES8_PECWW Length = 307 Score = 87.4 bits (215), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 8/102 (7%) Query: 2 LALMDADGNIAWSGEYDEWG--NQLNEENPHHLH------QPYRLPGQQYDKESGLYYNR 53 L + DA+G + WSG+Y +G N +++ H Q R GQ D+E+GL+YN Sbjct: 86 LEMTDAEGAVRWSGDYGSFGAVNGQTQDSEGLRHGKQAESQSLRYAGQYADEETGLHYNL 145 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 RYYDP GR+ TQDPIGL GG +LY Y NP+ IDPLGL+ Sbjct: 146 FRYYDPTVGRFTTQDPIGLAGGINLYQYAPNPLTWIDPLGLA 187 >UniRef50_Q4K3M9 Rhs family protein n=5 Tax=Pseudomonas RepID=Q4K3M9_PSEF5 Length = 1486 Score = 87.0 bits (214), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 43/92 (46%), Positives = 57/92 (61%), Gaps = 2/92 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE-ENPHHLH-QPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DG+ W Y WGN + E P+++ Q R GQ D+E+GL++N R+YDP Sbjct: 1258 LCEPDGHSVWQARYQVWGNTVEEIREPYYIEEQNLRFQGQYLDRETGLHFNTFRFYDPDI 1317 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLG 93 GR+ T DPIGL GG +LY Y NP+ IDPLG Sbjct: 1318 GRFTTPDPIGLAGGLNLYQYAPNPIGWIDPLG 1349 >UniRef50_A3NNM1 Protein RhsD n=20 Tax=pseudomallei group RepID=A3NNM1_BURP6 Length = 1539 Score = 87.0 bits (214), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 44/113 (38%), Positives = 68/113 (60%), Gaps = 3/113 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + ++D +GN+AW YD G + + + QP RL GQ +D E+G+ YNR+RYYD Sbjct: 1309 VRMLDVEGNVAWEASYDANGG-IEQFGIQAMPQPLRLQGQYFDAETGMSYNRHRYYDARI 1367 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 G+++++DPI L GG +LY Y +N ++ DPLGL V L ++L+ W Sbjct: 1368 GQFVSEDPIRLSGGENLYRYCVNSISWADPLGLD--RVPLFDPNNRLSFNAIW 1418 >UniRef50_Q1K295 YD repeat n=4 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1K295_DESAC Length = 1468 Score = 87.0 bits (214), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 47/97 (48%), Positives = 59/97 (60%), Gaps = 5/97 (5%) Query: 2 LALMDADGNIAWSGEYDEWG----NQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 + L DA G WS +Y +G N + + + R PGQ +D ESGL+YN +RYY Sbjct: 1284 ILLTDATGTAVWSAQYAPFGQATINNDVDGDGTEVVCNLRFPGQYFDAESGLHYNWHRYY 1343 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLG 93 +P GRYIT DPIGL GG +LYAY NPVN +DP G Sbjct: 1344 EPRSGRYITLDPIGLAGGINLYAYANRNPVNVVDPTG 1380 >UniRef50_UPI000190F33A Rhs-family protein n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190F33A Length = 138 Score = 87.0 bits (214), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 40/64 (62%), Positives = 49/64 (76%) Query: 31 HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGID 90 + HQP RLPGQ +D E+GL+YN RYY P GR+++QDPIGL GG +LYAY NP+ ID Sbjct: 3 YFHQPLRLPGQYFDDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYAYCPNPLTWID 62 Query: 91 PLGL 94 PLGL Sbjct: 63 PLGL 66 >UniRef50_UPI000196E06E Rhs family protein n=2 Tax=Neisseria mucosa ATCC 25996 RepID=UPI000196E06E Length = 280 Score = 86.7 bits (213), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 44/92 (47%), Positives = 59/92 (64%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D GN+ W GEY WG +E + + HQP+RL Q YD+E+GL+YN RYY+P G Sbjct: 51 MTDIHGNLLWYGEYTAWGRLKKDECVYRNAHQPFRLQNQYYDEETGLHYNLMRYYEPEAG 110 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 R++ QDPI L GG +LY++ N DPLGL Sbjct: 111 RFVNQDPILLLGGSNLYSFASNTNAWFDPLGL 142 >UniRef50_Q88FK6 RHS family protein, putative n=1 Tax=Pseudomonas putida KT2440 RepID=Q88FK6_PSEPK Length = 1530 Score = 86.7 bits (213), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 2/98 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHH--LHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L D+DGN W ++ WG +NE + Q R GQ D+E+GL++N R+YDP Sbjct: 1315 LTDSDGNTIWRSDHHGWGKIINEWHSQQNGREQNLRNQGQYIDRETGLHFNIFRFYDPDI 1374 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 GR+ T DP+G+EGG +LY+Y N VN DPLGL P +V Sbjct: 1375 GRFTTTDPLGIEGGVNLYSYAPNIVNYSDPLGLCPENV 1412 >UniRef50_A1U3R9 YD repeat protein n=3 Tax=Gammaproteobacteria RepID=A1U3R9_MARAV Length = 1611 Score = 86.3 bits (212), Expect = 7e-16, Method: Compositional matrix adjust. Identities = 42/92 (45%), Positives = 58/92 (63%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L + G + WS Y +GN + ++ + P R GQ +D E+GL+YNR+RYY+P GR Sbjct: 1368 LTNQQGRLVWSVTYRAYGNVVQQQVAE-IDNPLRFQGQYHDPETGLHYNRHRYYNPNTGR 1426 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 +IT DPIGL GG + Y Y NP +DPLGL+ Sbjct: 1427 FITPDPIGLAGGLNNYQYVPNPTGWVDPLGLA 1458 >UniRef50_A7K3Q8 Rhs family protein n=7 Tax=Vibrio RepID=A7K3Q8_VIBSE Length = 1384 Score = 86.3 bits (212), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 40/91 (43%), Positives = 54/91 (59%), Gaps = 2/91 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D +GN+ WS YD G + + P R GQ +D+E+ L+YN RYYDP GR Sbjct: 1043 LIDCEGNVVWSASYDAHG--FAHVHIEKVVNPLRFQGQYFDQETNLHYNLARYYDPKLGR 1100 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 +I QDPI + GG + Y Y +NP+ IDP G Sbjct: 1101 FIQQDPISIAGGINHYQYAINPIQWIDPTGF 1131 >UniRef50_C6M9F4 Rhs family protein n=8 Tax=Neisseria RepID=C6M9F4_NEISI Length = 632 Score = 86.3 bits (212), Expect = 9e-16, Method: Compositional matrix adjust. Identities = 45/93 (48%), Positives = 58/93 (62%), Gaps = 3/93 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN--PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D DGN+ W G Y WG +L EE +QP+RL Q D+E+GL+YN RYY+P Sbjct: 431 MTDKDGNLLWFGNYTGWG-RLKEETRVTDSAYQPFRLQNQYADRETGLHYNFFRYYEPDA 489 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GR++ QDPIGL GG + Y + N IDPLGL Sbjct: 490 GRFVNQDPIGLLGGANPYQFASNITEWIDPLGL 522 >UniRef50_C9Y459 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y459_CROTZ Length = 1523 Score = 85.9 bits (211), Expect = 9e-16, Method: Compositional matrix adjust. Identities = 42/93 (45%), Positives = 54/93 (58%), Gaps = 2/93 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQ 61 + DA+G W G++ WG E + P R GQ D+ESGL+YN RYYDP+ Sbjct: 1299 VTDANGQTVWRGQFSTWGETERELSVPQWQVPQNLRFQGQYLDRESGLHYNLFRYYDPVA 1358 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRY DPIGL GG + Y Y +P+ +DPLGL Sbjct: 1359 GRYTQMDPIGLLGGINTYGYVPDPLTWVDPLGL 1391 >UniRef50_D0KG38 Rhs family protein-like protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KG38_PECWW Length = 230 Score = 85.9 bits (211), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 46/109 (42%), Positives = 63/109 (57%), Gaps = 5/109 (4%) Query: 4 LMDADGNIAWSGEYDEWG-----NQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 + DA+ + WSG+Y +G Q +E H Q R GQ D+E+GL+YN RYYD Sbjct: 1 MTDAESAVRWSGDYGSFGAVNGQTQDSEGLRHGKSQSLRYAGQYADEETGLHYNLFRYYD 60 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQ 107 P GR+ TQ IGL GG +LY Y N + +DPLGL+P ++ KD+ Sbjct: 61 PTVGRFTTQGLIGLAGGLNLYQYAPNSLGWVDPLGLTPGEIIRYMGKDE 109 >UniRef50_C0EPY1 Putative uncharacterized protein n=7 Tax=Neisseria RepID=C0EPY1_NEIFL Length = 434 Score = 85.9 bits (211), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 43/92 (46%), Positives = 57/92 (61%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D GN+ W GEY WG + + H HQP+RL Q YD+E+GL+YN RYY+P G Sbjct: 223 MTDIRGNLLWYGEYTAWGRLKKDGRVYQHAHQPFRLQNQYYDRETGLHYNYFRYYEPETG 282 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 R+I+QDPIGL G +LY + N +D GL Sbjct: 283 RFISQDPIGLLGEDNLYWFGPNTAIWVDLFGL 314 >UniRef50_Q395C2 Rhs family protein n=1 Tax=Burkholderia sp. 383 RepID=Q395C2_BURS3 Length = 190 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 42/73 (57%), Positives = 48/73 (65%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 H + P R PGQ YD+ESGL+YNR RY DP GRYI QDPIGL+GG + Y Y NPV Sbjct: 7 HAVDNPIRFPGQYYDRESGLHYNRFRYCDPQVGRYINQDPIGLKGGANSYVYAHNPVTLS 66 Query: 90 DPLGLSPADVALI 102 DPLGL L+ Sbjct: 67 DPLGLQSTGPVLL 79 >UniRef50_B5I7B2 Rhs repeat protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5I7B2_9ACTO Length = 1249 Score = 85.1 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 42/92 (45%), Positives = 53/92 (57%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG IAW WG+ + + + P R PGQ +D E+GL+YN NRYYDP GR Sbjct: 1000 LIAPDGTIAWHSRSTAWGSTQSHRDAT-AYTPLRYPGQYFDPETGLHYNLNRYYDPELGR 1058 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y T DP+GL + Y Y NP DPLGL+ Sbjct: 1059 YTTPDPLGLAPAVNHYTYVPNPFTLADPLGLA 1090 >UniRef50_Q7MDR0 Rhs family protein n=3 Tax=Vibrio vulnificus RepID=Q7MDR0_VIBVY Length = 1498 Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 44/93 (47%), Positives = 55/93 (59%), Gaps = 2/93 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+D G++ W YD +G E + P R GQ +D E+GL+YN RYYDP Sbjct: 1039 LALVDEQGSVVWQARYDTYGRAHIEVES--VGNPLRFQGQYHDVETGLHYNLARYYDPRT 1096 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GR+I DPIGL GG + Y Y NPV +DP GL Sbjct: 1097 GRFIQPDPIGLLGGINHYQYAPNPVMWVDPHGL 1129 >UniRef50_B8FBZ2 YD repeat protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FBZ2_DESAA Length = 685 Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 45/93 (48%), Positives = 59/93 (63%), Gaps = 2/93 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+ A+G+ AWS Y +G + + + + R PGQ YD E+GL+YN NRYYDP G Sbjct: 252 VLVRANGSTAWSATYSAYG-KASVDPDSDVENNLRFPGQYYDAETGLHYNLNRYYDPEIG 310 Query: 63 RYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGL 94 Y + DP+GL GG +LYAY NP N +DPLGL Sbjct: 311 AYRSPDPLGLGGGVNLYAYTAGNPANYVDPLGL 343 >UniRef50_A7FDK7 YD/RHS repeat protein n=25 Tax=Enterobacteriaceae RepID=A7FDK7_YERP3 Length = 1527 Score = 84.7 bits (208), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 49/135 (36%), Positives = 68/135 (50%), Gaps = 8/135 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQ------LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L++ G + W+ WG NEE+ + R GQ D ESGL+YNR RYY Sbjct: 1295 LLNEQGKVVWASRLSTWGQAELWRQAANEED--RVSCNLRFAGQYADAESGLHYNRFRYY 1352 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 D G+Y+ DPIGL GG + Y Y NPV +DPLGL DVA R+ L ++I Sbjct: 1353 DGETGQYLCPDPIGLAGGLNPYGYVHNPVKYVDPLGLCKTDVARERQAQMLQDDVGYNIS 1412 Query: 118 SDTYEDMKRLNLGGT 132 +++ + G+ Sbjct: 1413 PKSWDQFPSIGRDGS 1427 >UniRef50_A9AE26 Rhs family protein n=10 Tax=cellular organisms RepID=A9AE26_BURM1 Length = 1547 Score = 84.7 bits (208), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 40/97 (41%), Positives = 57/97 (58%), Gaps = 6/97 (6%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L D DG++ W Y WG + ++ R GQQ D+E+GL+YNR RYY Sbjct: 1319 LTDDDGDVVWEASYKAWGEAREVIARASKAAGIVARNSLRFQGQQEDEETGLHYNRYRYY 1378 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 DP GR++++DP+G+ GG ++Y Y N V +DPLGL Sbjct: 1379 DPNSGRFVSKDPVGMVGGINVYQYAPNAVAWVDPLGL 1415 >UniRef50_Q399U9 Rhs family protein n=3 Tax=Proteobacteria RepID=Q399U9_BURS3 Length = 1446 Score = 84.7 bits (208), Expect = 2e-15, Method: Composition-based stats. Identities = 45/113 (39%), Positives = 59/113 (52%), Gaps = 23/113 (20%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE---------------------NPHHLHQP--YRLPG 40 L +A+G + W Y WGN + EE P H+ +P R G Sbjct: 1201 LTNAEGELIWQARYKVWGNAVQEEWIARTSQQSVPEWGEVQLASATPAHVPRPQNLRFQG 1260 Query: 41 QQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLG 93 Q D+E+GL+YN R+YDP GR+I DPIGL GG +LYAY +P+ IDP G Sbjct: 1261 QYLDRETGLHYNTFRFYDPDIGRFINPDPIGLSGGHNLYAYAESPLIWIDPWG 1313 >UniRef50_C5AIF2 Rhs family protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AIF2_BURGB Length = 345 Score = 84.7 bits (208), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 42/100 (42%), Positives = 57/100 (57%), Gaps = 7/100 (7%) Query: 2 LALMDADGNIAWSGEYDEWG--NQLNEENPHH-----LHQPYRLPGQQYDKESGLYYNRN 54 L + D G + W Y WG ++ E + P R GQQ+D E+G +YNR Sbjct: 119 LMMTDEAGELVWEASYRAWGEAQEVIERASAAAGIDVVRNPLRFQGQQFDDETGQHYNRY 178 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 RYYDP R++ +DPIGL GG ++Y Y NP++ IDPLGL Sbjct: 179 RYYDPGSSRFVNKDPIGLTGGINIYQYAPNPISWIDPLGL 218 >UniRef50_D0KWY8 YD repeat protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KWY8_HALNC Length = 1338 Score = 84.7 bits (208), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 45/97 (46%), Positives = 56/97 (57%), Gaps = 4/97 (4%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENP---HHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 +A DA + W Y +G QL+ + H R PGQ DKESGLYYN +RYYD Sbjct: 860 IAATDAQAQVIWRAHYGPYGQQLDVADSLVKDHFSLSLRNPGQWQDKESGLYYNDHRYYD 919 Query: 59 PLQGRYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGL 94 P GRY++ DP+GL GG + YAY NP+ DP GL Sbjct: 920 PATGRYLSPDPLGLAGGLNAYAYVAANPIAYTDPYGL 956 >UniRef50_A1TSU8 YD repeat protein n=4 Tax=Acidovorax RepID=A1TSU8_ACIAC Length = 1586 Score = 84.7 bits (208), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 47/121 (38%), Positives = 59/121 (48%), Gaps = 28/121 (23%) Query: 4 LMDADGNIAWSGEYDEWGNQ----------------LNEENP------------HHLHQP 35 L D G I W+ Y WG E P + + QP Sbjct: 1340 LTDEQGRIVWAASYQVWGQTRALQVMRTGTDDAAVFTQAERPLALAAKGDVQALNFVEQP 1399 Query: 36 YRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 R GQ +D E+GL+YNR RYYDP+ GR++ QDPIGL GG +L+ Y NP+ DPLGL Sbjct: 1400 LRFQGQYFDGETGLHYNRFRYYDPVTGRFVHQDPIGLAGGNNLFFYAPNPLIWNDPLGLK 1459 Query: 96 P 96 P Sbjct: 1460 P 1460 >UniRef50_A1AK54 YD repeat protein n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AK54_PELPD Length = 1352 Score = 84.3 bits (207), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 43/96 (44%), Positives = 62/96 (64%), Gaps = 4/96 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL D G + YD +GN ++ ++ QP+ G+++D+E+GLYY R RYYDP Sbjct: 1146 IVALTDRHGTVVQEYNYDSFGNP--DQRGENIDQPFSYTGREWDRETGLYYYRARYYDPK 1203 Query: 61 QGRYITQDPIGLEGG-WSLYAYPL-NPVNGIDPLGL 94 GR+I +DPI GG +LYAY L NP+N +DP GL Sbjct: 1204 IGRFIQKDPISFAGGDVNLYAYVLNNPINRLDPFGL 1239 >UniRef50_D0HCV2 Rhs protein n=3 Tax=Vibrio mimicus VM223 RepID=D0HCV2_VIBMI Length = 1617 Score = 84.3 bits (207), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 46/105 (43%), Positives = 59/105 (56%), Gaps = 11/105 (10%) Query: 4 LMDADGNIAWSGEYDEWGN-QLNEENPHH----------LHQPYRLPGQQYDKESGLYYN 52 L +G + W GE WG+ Q P+H L+ R GQ D+ESGLYYN Sbjct: 1382 LCSENGEVVWQGEQALWGHYQQRNTFPNHGIREHAHNDELYCDLRYQGQIEDRESGLYYN 1441 Query: 53 RNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 NRYYD G+Y++QDPIG GG AY NP+ +DPLGL+P+ Sbjct: 1442 VNRYYDADSGQYLSQDPIGFSGGLRPQAYVFNPLEWVDPLGLAPS 1486 >UniRef50_C7Q0B8 YD repeat protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q0B8_CATAD Length = 1489 Score = 84.3 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 43/95 (45%), Positives = 54/95 (56%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDE-WGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+ ADG++ W WG + P P R PGQ +D E+GL+YN NRYYDP Sbjct: 1288 LVSADGHVVWQQRRASIWGLPADIVPPDADEFPLRFPGQYHDSETGLHYNLNRYYDPEAA 1347 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 Y+T DP+GLE + Y Y NP+ DPLGL PA Sbjct: 1348 AYLTPDPLGLEPAPNQYGYVGNPLADSDPLGLYPA 1382 >UniRef50_Q2SGE8 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SGE8_HAHCH Length = 1452 Score = 84.0 bits (206), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 52/136 (38%), Positives = 71/136 (52%), Gaps = 9/136 (6%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 A+ D++G WS WG +L + P+R PGQ D+E+GLYYNR RYYDP G Sbjct: 1180 AMYDSEGRQVWSANISVWG-ELRNLKGNRGACPFRWPGQYEDEETGLYYNRFRYYDPDSG 1238 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 +YI QDPI ++GG +LY Y + ID LGL + R ++ H +D E Sbjct: 1239 QYIRQDPIRIKGGLNLYKYVSDVTTWIDTLGLQGGSASYGRSGRRIGHAD-----TDGLE 1293 Query: 123 DMKRLN---LGGTDQF 135 + LN GG D+ Sbjct: 1294 KIGELNAVKAGGDDRL 1309 >UniRef50_Q12LF3 YD repeat n=2 Tax=Shewanella denitrificans OS217 RepID=Q12LF3_SHEDO Length = 927 Score = 84.0 bits (206), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 44/93 (47%), Positives = 54/93 (58%), Gaps = 3/93 (3%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 AL D+ G + W Y +G + + + + Q R PGQ YD+ESGL+YN R YDP G Sbjct: 694 ALTDSTGTVQWQAHYTPFGQTIVDIDK--IKQAIRFPGQYYDEESGLHYNYFRDYDPELG 751 Query: 63 RYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGL 94 RYI DPIGL GG + Y Y NPV DP GL Sbjct: 752 RYIQSDPIGLAGGINTYGYAYQNPVMNTDPTGL 784 >UniRef50_B9B9U8 YD repeat protein n=1 Tax=Burkholderia multivorans CGD1 RepID=B9B9U8_9BURK Length = 345 Score = 84.0 bits (206), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 42/98 (42%), Positives = 55/98 (56%), Gaps = 6/98 (6%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L D DG++ W Y WG + ++ R GQQ D+E+GL+YNR RYY Sbjct: 185 LTDDDGDVVWEASYKAWGEAREVIARASKAAGIVARNSLRFQGQQEDEETGLHYNRYRYY 244 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 DP GR+ + DPI L GG ++Y Y LN V+ IDP G S Sbjct: 245 DPTSGRFTSADPIRLAGGSNVYQYALNLVSWIDPFGFS 282 >UniRef50_A1TR18 YD repeat protein n=8 Tax=Acidovorax RepID=A1TR18_ACIAC Length = 1654 Score = 83.6 bits (205), Expect = 5e-15, Method: Composition-based stats. Identities = 44/109 (40%), Positives = 56/109 (51%), Gaps = 19/109 (17%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE-------------------NPHHLHQPYRLPGQQYD 44 + D G + W + WG+ L E N L Q RL GQ D Sbjct: 1410 VTDEAGEVRWRASWRTWGSALEERWEAVRIDGSAIPAVQQRHRNEDTLEQNLRLQGQYLD 1469 Query: 45 KESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLG 93 +E+GL+YN RYYDP GR+I+ DPIGL GG +L Y NP++ IDPLG Sbjct: 1470 RETGLHYNTFRYYDPDVGRFISPDPIGLAGGLNLQRYAANPISWIDPLG 1518 >UniRef50_A1TSG3 RHS protein n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TSG3_ACIAC Length = 384 Score = 83.2 bits (204), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 63/197 (31%), Positives = 89/197 (45%), Gaps = 39/197 (19%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE------------NPHH------------LHQPYRLP 39 + D DG++AW +Y WG+ + EE P H + Q R+ Sbjct: 139 MSDRDGHLAWRAQYRVWGSAVAEEWQAFDGVGRPVEAPRHETGQRPDNSAAPMPQNLRMQ 198 Query: 40 GQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 GQ D+E+GL+YN RYY P G + T DPIGL GG +L+ Y NPV+ IDPLG +P Sbjct: 199 GQYLDRETGLHYNTFRYYGPDVGAFTTPDPIGLAGGVNLHQYAPNPVSWIDPLGWNP--- 255 Query: 100 ALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGL 159 + R Q H+ A + L GT + + R D VS +A+ Sbjct: 256 --VCRMSQTPHETASSLP---------LVRPGTSAWQKAVEAIRQGGKGDIRVSTAAEAK 304 Query: 160 GYEKEIRDYGLNLFGMY 176 +E R G++ MY Sbjct: 305 ALLQEARG-GMDRRKMY 320 >UniRef50_C9NF93 YD repeat protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NF93_9ACTO Length = 1536 Score = 83.2 bits (204), Expect = 7e-15, Method: Composition-based stats. Identities = 50/121 (41%), Positives = 67/121 (55%), Gaps = 9/121 (7%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G I+W WG ++ + P R PGQ +D+ES L+YN RYYDP R Sbjct: 1339 LIDESGFISWRVRRSVWGTTEWAKDSS-AYTPLRFPGQYFDQESLLHYNYLRYYDPDVSR 1397 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQL---NHQRAWDILSDT 120 YI+ DPIGLEGG + + Y NP IDPLGLS L R K +L N + W +++ Sbjct: 1398 YISPDPIGLEGGPNPHWYGPNPYTWIDPLGLS-----LCRVKPRLEDGNTKEGWQHINER 1452 Query: 121 Y 121 + Sbjct: 1453 H 1453 >UniRef50_C5ALM7 YD repeat protein n=19 Tax=Proteobacteria RepID=C5ALM7_BURGB Length = 1425 Score = 83.2 bits (204), Expect = 7e-15, Method: Composition-based stats. Identities = 40/88 (45%), Positives = 52/88 (59%), Gaps = 1/88 (1%) Query: 9 GNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQ 67 G+I W+G+Y WG N+ P + QP R GQ D + L+YN R+YDP GR+I Q Sbjct: 1200 GDIVWAGQYSAWGKVAPNQHAPARIDQPLRYAGQYADDSTELHYNTFRFYDPDVGRFINQ 1259 Query: 68 DPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 DPIGL GG +LY Y N + D GL+ Sbjct: 1260 DPIGLMGGLNLYQYAPNSIAWTDWWGLA 1287 >UniRef50_A4FJ21 YD repeat protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FJ21_SACEN Length = 1670 Score = 83.2 bits (204), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 42/92 (45%), Positives = 57/92 (61%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ +G +AW G WG +L + P+ + P R PGQ D E+GL YN +RYYDP GR Sbjct: 1273 LIAPNGVLAWHGRTSLWGKELPVQ-PNGVTTPLRFPGQYADAETGLNYNVHRYYDPATGR 1331 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y++QDP+GL + AY NP + DPLGL+ Sbjct: 1332 YLSQDPLGLAPAPNPVAYVDNPHSAADPLGLA 1363 >UniRef50_C8Q7Z5 YD repeat protein n=8 Tax=Enterobacteriaceae RepID=C8Q7Z5_9ENTR Length = 1507 Score = 82.4 bits (202), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 42/94 (44%), Positives = 52/94 (55%), Gaps = 2/94 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D G+ W G + WG E P R GQ D+E+GL+YN RYYDP Sbjct: 1278 VTDIRGDTVWQGAFAAWGRTTRESTGVDWEVPQNLRFQGQYLDRETGLHYNTFRYYDPCG 1337 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GRY DPIGL GG +LY Y + + GIDPLGL+ Sbjct: 1338 GRYTQLDPIGLMGGLNLYQYAPDVLTGIDPLGLA 1371 >UniRef50_A5GE16 YD repeat protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GE16_GEOUR Length = 1600 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 41/98 (41%), Positives = 63/98 (64%), Gaps = 4/98 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++++ DA+ N+ S EYD +G + Y G+++DKE+GLY+ R RYYDP+ Sbjct: 1411 IVSITDANRNVVQSYEYDSFGMV---KPSTVFANSYTYTGREWDKETGLYFYRARYYDPM 1467 Query: 61 QGRYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGLSPA 97 +GR+I++DP+G +GG ++YAY N VN DP GL P Sbjct: 1468 EGRFISKDPVGFKGGINIYAYVSNNVVNDTDPSGLYPG 1505 >UniRef50_B7H4M5 Uncharacterized protein ybfO n=3 Tax=Acinetobacter baumannii RepID=B7H4M5_ACIB3 Length = 229 Score = 82.0 bits (201), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 42/108 (38%), Positives = 59/108 (54%), Gaps = 7/108 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE-------ENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 + D G + W +Y WG E EN + R GQ +D E+GL+YNR Y Sbjct: 1 MTDHTGVVIWKAQYKAWGECKVEQAKSDFFENSEIISNNIRFQGQYFDGETGLHYNRYCY 60 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR 104 Y P GR+I++DPIGL GG ++YAY NPV +D LGL+ +++ Sbjct: 61 YSPYVGRFISKDPIGLLGGSNIYAYAPNPVGWVDQLGLAKTPTRTLQK 108 >UniRef50_Q2SPP3 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SPP3_HAHCH Length = 265 Score = 82.0 bits (201), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 38/76 (50%), Positives = 49/76 (64%) Query: 33 HQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPL 92 H P R GQ YD+E+G +YNR+RYYDP GR+I Q PIGL GG + Y Y NPV +DP Sbjct: 17 HNPLRFQGQYYDEETGFHYNRHRYYDPQSGRFINQAPIGLLGGANAYQYAPNPVGWVDPF 76 Query: 93 GLSPADVALIRRKDQL 108 GL+ + R+ D + Sbjct: 77 GLTAKKESPKRQFDAI 92 >UniRef50_D1S833 YD repeat protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1S833_9ACTO Length = 3829 Score = 82.0 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 40/92 (43%), Positives = 52/92 (56%), Gaps = 5/92 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L++ G + W D WG + P PGQ D E+GL+YNR RYYDP GR Sbjct: 2487 LVEPGGGLRWWSRGDLWGRGADRTA-----TPLAFPGQYVDAETGLHYNRFRYYDPATGR 2541 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y++ DP+GL GG + AY NP+ DPLGL+ Sbjct: 2542 YVSPDPLGLSGGPNPTAYVSNPLTVADPLGLT 2573 >UniRef50_C1M4X0 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1M4X0_9ENTR Length = 1494 Score = 81.3 bits (199), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 11/103 (10%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHH-----------LHQPYRLPGQQYDKESGLYY 51 AL D DG + W + + WG +E + R GQ D+E+GL+Y Sbjct: 1270 ALTDEDGKLHWRQDVETWGETRSEYADEEGGRWRKIWGGAPEENLRFAGQYLDRETGLHY 1329 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 N RYY P GR+IT DPIGL GG +LY+Y NP++ IDPLGL Sbjct: 1330 NTFRYYAPDMGRFITPDPIGLAGGINLYSYAPNPLSWIDPLGL 1372 >UniRef50_B5H9Z7 Rhs protein n=2 Tax=Streptomyces RepID=B5H9Z7_STRPR Length = 1054 Score = 80.9 bits (198), Expect = 3e-14, Method: Composition-based stats. Identities = 40/92 (43%), Positives = 53/92 (57%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G +AW WG + + P R PGQ +D ESGL+YNR+R+YDP GR Sbjct: 749 LVDEQGKVAWRTRATLWGTTTWNRSAT-AYTPLRFPGQYFDPESGLHYNRHRHYDPESGR 807 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y++ DP+GL + Y NP IDPLGL+ Sbjct: 808 YLSPDPLGLVPAPNAVTYVDNPTRWIDPLGLA 839 >UniRef50_A0LJM9 YD repeat protein n=3 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LJM9_SYNFM Length = 1433 Score = 80.9 bits (198), Expect = 3e-14, Method: Composition-based stats. Identities = 41/92 (44%), Positives = 53/92 (57%), Gaps = 2/92 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D+ W YD +G ++ +R PGQ YD E+GL+YN +RYYDP GR Sbjct: 1213 MTDSTNTAVWEAAYDAFGEATIHPASTVVNN-FRFPGQYYDAETGLHYNWHRYYDPKTGR 1271 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 Y+T DPIGL GG + Y Y N P+N ID GL Sbjct: 1272 YMTPDPIGLAGGINPYTYAENDPINFIDLYGL 1303 >UniRef50_D1VCV7 YD repeat protein n=1 Tax=Frankia sp. EuI1c RepID=D1VCV7_9ACTO Length = 1572 Score = 80.9 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 40/93 (43%), Positives = 50/93 (53%), Gaps = 1/93 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D DG +AW WG P P R PGQ +D E+GL YN +RYYDP R Sbjct: 1364 LVDPDGRLAWHSRATLWGVS-PPSTPTTTDCPLRFPGQYHDPETGLNYNFHRYYDPATAR 1422 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 Y T D +GL + + Y NP+ IDP GL+P Sbjct: 1423 YKTSDALGLSPAPNPWTYVTNPLTWIDPFGLAP 1455 >UniRef50_UPI00016A9A82 Rhs family protein n=2 Tax=Burkholderia oklahomensis RepID=UPI00016A9A82 Length = 1489 Score = 80.5 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 45/102 (44%), Positives = 53/102 (51%), Gaps = 12/102 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE-------NP-----HHLHQPYRLPGQQYDKESGLYY 51 L D G + W Y WGN + +E NP L Q RL GQ D E+G Y Sbjct: 1264 LTDGGGRVVWRTRYRAWGNTVLQEYAPEFQANPAGDVMQPLPQALRLQGQYEDLETGFCY 1323 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLG 93 + RYYDP GR+IT DPIGL GG + Y Y NP+ IDP G Sbjct: 1324 STFRYYDPDVGRFITPDPIGLAGGLNQYQYAPNPLTWIDPWG 1365 >UniRef50_Q1I7Q5 Putative uncharacterized protein n=11 Tax=Pseudomonas RepID=Q1I7Q5_PSEE4 Length = 1595 Score = 80.5 bits (197), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 39/98 (39%), Positives = 56/98 (57%), Gaps = 5/98 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQL----NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 + L D +G++ W+ Y WG+ + + P R GQ D+E+GL+YNR RYY Sbjct: 1338 MELTDEEGHVVWAAHYKAWGDLAELPGSSVAMSNARNPIRFQGQYQDQETGLHYNRFRYY 1397 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGL 94 DP RY+++DPIG GG + Y Y +PV DP+GL Sbjct: 1398 DPKSARYVSKDPIGFMGGANAYTYTGGSPVTATDPMGL 1435 >UniRef50_D1YPP7 RHS repeat-associated core domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YPP7_9FIRM Length = 237 Score = 80.5 bits (197), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 42/99 (42%), Positives = 57/99 (57%), Gaps = 3/99 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN--PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D +GN+ W EY W +L E HQP+RL Q D+E+GL+YN RYY+P Sbjct: 26 MTDKEGNLFWYVEYTIWA-RLKEATKVTDSAHQPFRLQNQYADRETGLHYNLMRYYEPEA 84 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 GR++ QDPIGL G +LY + N +DPLG V+ Sbjct: 85 GRFVNQDPIGLWGEENLYQFAPNATMWLDPLGWKGVTVS 123 >UniRef50_UPI0001B56FBA YD repeat-containing protein n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B56FBA Length = 1624 Score = 80.5 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 40/91 (43%), Positives = 55/91 (60%), Gaps = 2/91 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G IAW +G + E + P R GQ +D+E+GL+YN +RYYDP+ R Sbjct: 1275 LLDDRGEIAWQARSSLFGVVVAESGGTGI--PLRFQGQYFDEETGLHYNFHRYYDPVLAR 1332 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 Y++ DP+GL GG +AY NP +DPLGL Sbjct: 1333 YLSPDPLGLGGGLDPHAYVSNPHVSVDPLGL 1363 >UniRef50_A7FN18 RHS/YD repeat protein n=5 Tax=cellular organisms RepID=A7FN18_YERP3 Length = 1419 Score = 80.5 bits (197), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 42/101 (41%), Positives = 55/101 (54%), Gaps = 9/101 (8%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH---------HLHQPYRLPGQQYDKESGLYYNRN 54 + +A G + WSG+Y +G + + QP R GQ D E+GL+Y Sbjct: 1194 VTNAQGEMVWSGQYGVFGQVTRQTDAMWRNVSKPLGQFRQPLRYAGQYLDDETGLHYTTY 1253 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 RYY P GR+IT DPIGL GG +LY Y NP+ IDP GL+ Sbjct: 1254 RYYAPEVGRFITPDPIGLAGGLNLYQYAPNPLGWIDPWGLA 1294 >UniRef50_A9EVR3 Conserved carbohydrate-binding protein, Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVR3_SORC5 Length = 1367 Score = 80.1 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 41/91 (45%), Positives = 51/91 (56%), Gaps = 1/91 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D G W+ E D G E+ P+R PGQ D E+GLYYNR RYYDP G Sbjct: 1148 LFDTSGVQVWAAETDTLGRTAVEQGAPE-DCPWRWPGQYEDPETGLYYNRFRYYDPDAGN 1206 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 Y++ DP+GL G + YAY + + DPLGL Sbjct: 1207 YVSPDPLGLLAGTAEYAYAPDSLVWFDPLGL 1237 >UniRef50_A3RZQ8 Core protein n=21 Tax=Ralstonia solanacearum RepID=A3RZQ8_RALSO Length = 782 Score = 80.1 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 46/106 (43%), Positives = 60/106 (56%), Gaps = 8/106 (7%) Query: 4 LMDADGNIAWS-GEYDEWGNQLNEENPHHL----HQPYRLPGQQYDKESGLYYNRNRYYD 58 + D + + W + D +G L +ENP L + P R PGQ YD+E+G +YN NR YD Sbjct: 527 ITDTNNLMVWRWDQTDPFGATLPDENPTSLGAFTYNP-RFPGQVYDQETGKHYNANRDYD 585 Query: 59 PLQGRYITQDPIGLEGG-WSLYAY-PLNPVNGIDPLGLSPADVALI 102 P GRY+ DPIGL GG WS YAY P DP GL +A++ Sbjct: 586 PASGRYVQSDPIGLNGGQWSTYAYVDGQPTRYTDPKGLCIGPLAVV 631 >UniRef50_C7PJF4 YD repeat protein n=2 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJF4_CHIPD Length = 1401 Score = 80.1 bits (196), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 44/89 (49%), Positives = 52/89 (58%), Gaps = 1/89 (1%) Query: 6 DADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYI 65 DA G W GE D +G +L + P+R GQ D E+GLYYNR RYY PL+G YI Sbjct: 1162 DASGEKVWEGELDIYG-KLRKLAGASDFIPFRRQGQYEDVETGLYYNRFRYYSPLEGLYI 1220 Query: 66 TQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 +QDPI LEGG Y Y P +DP GL Sbjct: 1221 SQDPIRLEGGSRFYEYSRCPTLILDPFGL 1249 >UniRef50_A4SKJ3 Rhs family protein n=2 Tax=Bacteria RepID=A4SKJ3_AERS4 Length = 1590 Score = 80.1 bits (196), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 44/103 (42%), Positives = 55/103 (53%), Gaps = 12/103 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQ------------PYRLPGQQYDKESGLYY 51 L G+I W GE WGN + P L + R GQ YD+E+GLYY Sbjct: 1353 LCSEAGDIIWRGEQRLWGNYRADAIPQPLRRFLGDAANEETYCELRYQGQIYDQETGLYY 1412 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 NR+RY+DP G+YI+ DPIG GG Y NP+ +DPLGL Sbjct: 1413 NRHRYFDPELGQYISPDPIGFAGGVRPQGYVHNPLEWVDPLGL 1455 >UniRef50_B2PW78 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PW78_PROST Length = 330 Score = 80.1 bits (196), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 41/102 (40%), Positives = 58/102 (56%), Gaps = 9/102 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLN---------EENPHHLHQPYRLPGQQYDKESGLYYN 52 L + + GN WSG+Y+ +G + + Q R GQ +D E+GL++N Sbjct: 118 LDVTNEQGNTVWSGKYERFGFVRSSPLSFYSDPDRKMESFEQNLRYAGQYFDNETGLHFN 177 Query: 53 RNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 R+YDP GR+I DPIGL GG +LYAY NP++ +DP GL Sbjct: 178 TFRFYDPQIGRFIMPDPIGLLGGMNLYAYAPNPMSWVDPFGL 219 >UniRef50_B4ETQ2 Rhs-family protein n=9 Tax=Enterobacteriaceae RepID=B4ETQ2_PROMH Length = 1703 Score = 80.1 bits (196), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 40/100 (40%), Positives = 55/100 (55%), Gaps = 8/100 (8%) Query: 4 LMDADGNIAWSGEYDEWGNQL--------NEENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 + G +W+G + WG E +P++ P+R GQ D+ESGLYYNR R Sbjct: 1442 IFSEGGQASWAGRLNTWGQMQFWRYRDGKAENDPNYTECPFRFAGQYEDEESGLYYNRFR 1501 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 YYD G+Y++ DPIGL GG + Y Y P +DP GL+ Sbjct: 1502 YYDRETGQYLSPDPIGLLGGLNPYGYVHCPTGWVDPFGLA 1541 >UniRef50_C6WQE4 YD repeat protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WQE4_ACTMD Length = 2144 Score = 79.7 bits (195), Expect = 8e-14, Method: Composition-based stats. Identities = 33/66 (50%), Positives = 44/66 (66%) Query: 31 HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGID 90 H P R PGQ +D E+GL+YN +RYYDP GRY+T+DP+GL + + Y NP +D Sbjct: 1928 HSWTPLRFPGQHHDAETGLHYNVHRYYDPATGRYLTRDPLGLAPAANPWTYADNPTAAVD 1987 Query: 91 PLGLSP 96 P+GL P Sbjct: 1988 PVGLVP 1993 >UniRef50_C5AA19 Rhs family protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AA19_BURGB Length = 1596 Score = 79.7 bits (195), Expect = 8e-14, Method: Composition-based stats. Identities = 39/95 (41%), Positives = 52/95 (54%), Gaps = 6/95 (6%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 + D G I W Y WG ++++ + P R GQ +D ESGL YNR+RYY Sbjct: 1340 ITDELGEIVWEARYQAWGEARDVIERVSKATGERVRNPLRFQGQHFDDESGLAYNRHRYY 1399 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPL 92 GRY+++DP L GG + +AY NPV IDPL Sbjct: 1400 AADVGRYVSKDPAELLGGLNEFAYVPNPVQWIDPL 1434 >UniRef50_C7Q0A7 YD repeat protein n=2 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q0A7_CATAD Length = 1528 Score = 79.3 bits (194), Expect = 9e-14, Method: Composition-based stats. Identities = 39/92 (42%), Positives = 50/92 (54%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG + W WG + + P R PGQ +D ESGL+YN NRYYD Sbjct: 1304 LVTPDGRVVWHTTTSLWGRTIGTSAESGVDCPLRFPGQYHDDESGLHYNLNRYYDSETAA 1363 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y+T DP+GL + +AY NP+ DPLGLS Sbjct: 1364 YLTPDPLGLVPAPNDHAYVPNPLTVSDPLGLS 1395 >UniRef50_C9Y441 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y441_CROTZ Length = 1394 Score = 79.3 bits (194), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 40/94 (42%), Positives = 54/94 (57%), Gaps = 2/94 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP--HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L D +G + W G WG +E P +Q R+ GQ D+E+GL+YN RYYDP Sbjct: 1156 LTDVEGRVRWEGRNSAWGKLAHESTPLPTGYNQNLRMQGQYLDRETGLHYNLFRYYDPDC 1215 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GR+ DPIGL GG +LY + N + +DP GL+ Sbjct: 1216 GRFTQHDPIGLAGGINLYQFAPNALGWVDPWGLN 1249 >UniRef50_A9FBU5 Conserved carbohydrate-binding protein, Rhs family n=2 Tax=Proteobacteria RepID=A9FBU5_SORC5 Length = 1300 Score = 79.3 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 41/92 (44%), Positives = 54/92 (58%), Gaps = 2/92 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG I D WG+ + E P R GQ D+E+GL YNR RYYDP GR Sbjct: 1067 LVGPDGQIGCELARDPWGSATSAEGAQ-TSTPLRFRGQYADEETGLSYNRYRYYDPELGR 1125 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 YI+ DP+G+EGG +++AY N P + +D GL Sbjct: 1126 YISADPLGIEGGLNVFAYAANCPTSAVDVEGL 1157 >UniRef50_D0KZB0 YD repeat protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KZB0_HALNC Length = 1467 Score = 79.0 bits (193), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 40/91 (43%), Positives = 52/91 (57%), Gaps = 1/91 (1%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 +A+ W D +G++ + + R PGQ YD E+GL+YN NRYYDP GRY Sbjct: 1193 TNANQQTVWRWNRDAFGDRQVNASSASIEMNLRYPGQYYDTETGLFYNWNRYYDPSTGRY 1252 Query: 65 ITQDPIGLEGGWSLYAY-PLNPVNGIDPLGL 94 T DPIGL GG + + Y NP+ IDP GL Sbjct: 1253 ATSDPIGLSGGVNTFGYVSANPLALIDPWGL 1283 >UniRef50_C6WJA6 YD repeat protein n=3 Tax=Actinosynnema mirum DSM 43827 RepID=C6WJA6_ACTMD Length = 1509 Score = 78.6 bits (192), Expect = 1e-13, Method: Composition-based stats. Identities = 42/99 (42%), Positives = 57/99 (57%), Gaps = 5/99 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE--ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+D G + W + WG L E P P R GQ +D+E+GL+YN +RYYDP Sbjct: 1285 LLDDAGALVWRSQRTLWGAVLAELAGGPD---CPLRFAGQYHDRETGLFYNVHRYYDPET 1341 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 RY + DP+GL G + +AY +NP+ DPLGLS + A Sbjct: 1342 ARYSSPDPLGLLAGPNPHAYVVNPLRLTDPLGLSGCERA 1380 >UniRef50_A9GIJ6 Conserved carbohydrate-binding protein, Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GIJ6_SORC5 Length = 1429 Score = 78.6 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 43/92 (46%), Positives = 53/92 (57%), Gaps = 2/92 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+DA GN+A + WG E P R GQ YD E+GL YNR RYYDP GR Sbjct: 1056 LLDAAGNVACELDRTVWGAARPREGAR-TETPLRFLGQYYDDETGLAYNRYRYYDPAVGR 1114 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 YI+ DP+GL GG + ++Y N P +DP GL Sbjct: 1115 YISVDPVGLLGGQNGFSYAGNRPTKMVDPTGL 1146 >UniRef50_Q7N2G0 Complete genome; segment 11/17 n=4 Tax=Gammaproteobacteria RepID=Q7N2G0_PHOLL Length = 1498 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 39/97 (40%), Positives = 57/97 (58%), Gaps = 5/97 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQ-----LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 ++ G + W+G WG L +P +L +R GQ D+ESGL+YNR+RYY+ Sbjct: 1289 ILTEAGELIWAGRLLTWGEPECWPVLTVNDPRNLTCNFRFAGQYEDRESGLFYNRHRYYE 1348 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 G+Y++ DP+ L GG + Y Y +PVN IDP GL+ Sbjct: 1349 SDTGQYLSPDPLNLSGGVNPYGYVHDPVNWIDPFGLA 1385 >UniRef50_A1TJG7 YD repeat protein n=4 Tax=Proteobacteria RepID=A1TJG7_ACIAC Length = 1604 Score = 78.2 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 45/128 (35%), Positives = 63/128 (49%), Gaps = 26/128 (20%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE------------------------NPHHLHQPYRLP 39 + D +G++ W +Y WGN + EE + Q R+ Sbjct: 1374 MSDRNGHLVWRAQYRLWGNAVAEEWQAFDATGRPVNAPMAETGIRAQVSASPAPQNLRMQ 1433 Query: 40 GQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS--PA 97 GQ D+E+GL+YN RYYDP G + T DPIGL GG +L+ Y NP++ IDP G + P Sbjct: 1434 GQYLDRETGLHYNTFRYYDPDLGAFTTPDPIGLAGGLNLHGYAANPLSWIDPWGWACIPN 1493 Query: 98 DVALIRRK 105 VA R+ Sbjct: 1494 KVAGTARE 1501 >UniRef50_B4ETQ7 Putative Rhs-family protein n=3 Tax=Enterobacteriaceae RepID=B4ETQ7_PROMH Length = 380 Score = 78.2 bits (191), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 40/95 (42%), Positives = 54/95 (56%), Gaps = 8/95 (8%) Query: 9 GNIAWSGEYDEWGNQL--------NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 G +W+G + WG E +P++ P+R GQ D+ESGLYYNR RYYD Sbjct: 173 GQASWAGRLNTWGQMQFWRYRDGKAENDPNYTECPFRFAGQYEDEESGLYYNRFRYYDRE 232 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 G+Y++ DPIGL GG + Y Y P +DP GL+ Sbjct: 233 TGQYLSPDPIGLLGGLNPYGYVHCPTGWVDPFGLA 267 >UniRef50_C7QG23 YD repeat protein n=2 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QG23_CATAD Length = 1528 Score = 78.2 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 38/91 (41%), Positives = 52/91 (57%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG+I W +G + + H P R PGQ D E+GL+YN +RYYDP +G Sbjct: 1268 LVAPDGSIDWYTTTSLYGTTIATSSDHGADCPLRFPGQFRDDETGLHYNVHRYYDPERGS 1327 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 Y++ DP+GL + AY NP+ DPLGL Sbjct: 1328 YLSPDPLGLAAAPNDQAYVANPMVSADPLGL 1358 >UniRef50_B1KGR6 YD repeat protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KGR6_SHEWM Length = 1699 Score = 77.8 bits (190), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 44/122 (36%), Positives = 61/122 (50%), Gaps = 16/122 (13%) Query: 4 LMDADGNIAWSGEYDEWGN--------QLNEENPHHLHQP--------YRLPGQQYDKES 47 L +G+I W GE WG Q ++ +L R GQ +D E+ Sbjct: 1461 LCTENGDIEWRGEQSLWGEHHKWRLSVQAKSKHKKYLEDAANDPVNCDLRYQGQVFDSET 1520 Query: 48 GLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQ 107 GLYYNR+RYYDP +Y++ DPIG+ GG AY NP+ +DP GL+ A + Q Sbjct: 1521 GLYYNRHRYYDPETCQYLSPDPIGMAGGLRTQAYVHNPMEWVDPFGLAACPTAEAPQTHQ 1580 Query: 108 LN 109 +N Sbjct: 1581 IN 1582 >UniRef50_C6M9F8 RhsG core protein with extension n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M9F8_NEISI Length = 194 Score = 77.8 bits (190), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 49/138 (35%), Positives = 65/138 (47%), Gaps = 21/138 (15%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE--NPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D DGN+ W G Y WG +L EE +QP+RL Q D E+GL+YN RYY+ Sbjct: 1 MTDKDGNLLWFGNYTGWG-RLKEEIKVTDSAYQPFRLQNQYADPETGLHYNFFRYYESDA 59 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP------------------ADVALIR 103 GR++ QDPIGL GG + Y + N + D GL P A I Sbjct: 60 GRFVNQDPIGLWGGSNSYQFAPNTLKWTDTWGLLPKCSDAKRAECDKQAEIDEATCRQIP 119 Query: 104 RKDQLNHQRAWDILSDTY 121 KD+ R W + + Y Sbjct: 120 EKDKARRSRCWASVQERY 137 >UniRef50_D1AA66 YD repeat-containing protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AA66_THECD Length = 1197 Score = 77.4 bits (189), Expect = 4e-13, Method: Composition-based stats. Identities = 38/93 (40%), Positives = 49/93 (52%), Gaps = 1/93 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG+I+W WG + P PGQ YD E+GLY+N RYY P GR Sbjct: 1029 LVAPDGSISWRSHAAVWGAPY-VSRADQISCPLGFPGQYYDSETGLYFNYFRYYSPFDGR 1087 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 Y++ DP+GL + Y Y +NP DP GL P Sbjct: 1088 YLSPDPLGLSPQPNPYIYVINPFVWADPFGLYP 1120 >UniRef50_C4KA75 YD repeat protein n=1 Tax=Thauera sp. MZ1T RepID=C4KA75_THASP Length = 1892 Score = 77.4 bits (189), Expect = 4e-13, Method: Composition-based stats. Identities = 40/92 (43%), Positives = 51/92 (55%), Gaps = 3/92 (3%) Query: 6 DADGNIAWSGEYDEWGNQLNEENPHH--LHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 DADG W + +G +P H RLPGQ +D E+GL+YNR RYY P G Sbjct: 1389 DADGEPLWRARHAPFGAATVTTSPRHPDFTLDLRLPGQVFDAETGLHYNRRRYYAPTLGE 1448 Query: 64 YITQDPIGLEGGWSLYAY-PLNPVNGIDPLGL 94 Y+T DP+G G + YAY NP+ +DP GL Sbjct: 1449 YLTPDPLGTPDGPNPYAYAAFNPLRNVDPDGL 1480 >UniRef50_A3KUM5 Rhs family protein n=10 Tax=Pseudomonas aeruginosa RepID=A3KUM5_PSEAE Length = 931 Score = 77.0 bits (188), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 45/99 (45%), Positives = 57/99 (57%), Gaps = 4/99 (4%) Query: 5 MDADGNIAWSGEYDEWG-NQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 DA G IAW + D +G + + ++ R PGQ YD ESGL+YN R YDP GR Sbjct: 690 TDASGQIAWQWQSDAFGRGEALSQGSTQVN--LRFPGQYYDAESGLHYNYFRDYDPETGR 747 Query: 64 YITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVAL 101 Y+ DPIGL+GG + Y Y NP+ DP GL+PA L Sbjct: 748 YVESDPIGLKGGLNTYGYVYGNPLTYSDPKGLTPAAAGL 786 >UniRef50_B4V6T6 Rhs protein n=2 Tax=Streptomyces RepID=B4V6T6_9ACTO Length = 1263 Score = 77.0 bits (188), Expect = 5e-13, Method: Composition-based stats. Identities = 41/93 (44%), Positives = 53/93 (56%), Gaps = 1/93 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G IAW WG + + + P R PGQ +D ESGL+YN RYYDP R Sbjct: 1033 LVDEAGAIAWRTRSTLWGATTWNADAN-AYTPLRFPGQYFDPESGLHYNCFRYYDPETAR 1091 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 Y++ DP+GL + AY NP + DPLGL+P Sbjct: 1092 YLSVDPLGLGPAPNPVAYVSNPHSWSDPLGLTP 1124 >UniRef50_D1SWH5 RHS protein (Fragment) n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1SWH5_9BURK Length = 280 Score = 77.0 bits (188), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 43/110 (39%), Positives = 56/110 (50%), Gaps = 19/110 (17%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE-------------------NPHHLHQPYRLPGQQYD 44 + D G + W + WG+ L E + L Q RL GQ D Sbjct: 29 VTDEAGEVRWRASWRTWGSALEERWEAVRIDGSAIPAVQQRHRDEDTLEQNLRLQGQYLD 88 Query: 45 KESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 +E+GL+YN RYYDP GR+I+ DPIGL GG +L Y +NP+ IDP GL Sbjct: 89 RETGLHYNTFRYYDPDMGRFISPDPIGLAGGLNLQRYAINPLAWIDPWGL 138 >UniRef50_UPI0001AF2680 RHS/YD repeat-containing protein n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AF2680 Length = 1592 Score = 76.6 bits (187), Expect = 6e-13, Method: Composition-based stats. Identities = 40/93 (43%), Positives = 50/93 (53%), Gaps = 1/93 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G+IAW WG + P R PGQ D E+GL+YN R+YDP R Sbjct: 1364 LVDPSGDIAWRSRTTLWGATAWPRTST-AYTPLRFPGQYDDPETGLHYNFFRHYDPDAAR 1422 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 Y++ DP+GL G + AY NP DPLGL P Sbjct: 1423 YVSPDPLGLAAGPNPVAYVDNPFTWCDPLGLMP 1455 >UniRef50_B2PVY9 Putative uncharacterized protein n=12 Tax=Enterobacteriaceae RepID=B2PVY9_PROST Length = 707 Score = 76.6 bits (187), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 41/101 (40%), Positives = 55/101 (54%), Gaps = 9/101 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLN---------EENPHHLHQPYRLPGQQYDKESGLYYN 52 L + + GN WSG+Y+ +G + E Q R GQ +D E+GL++N Sbjct: 488 LDVTNEQGNTVWSGKYERFGFVRSSPLSFYSDPERKMESFEQNLRYAGQYFDNETGLHFN 547 Query: 53 RNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLG 93 R+YDP GR+I DPIGL GG +LY Y NP+ IDP G Sbjct: 548 TFRFYDPQIGRFIMPDPIGLLGGINLYQYAPNPLGWIDPWG 588 >UniRef50_B7GX39 Protein rhsD n=4 Tax=Acinetobacter baumannii RepID=B7GX39_ACIB3 Length = 1590 Score = 76.3 bits (186), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 40/98 (40%), Positives = 55/98 (56%), Gaps = 6/98 (6%) Query: 3 ALMDADGNIAWSGEYDEWG-----NQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 + + G W D WG LN++NP R GQ YD+E+ L+YNR RYY Sbjct: 1300 TMTNIRGECVWEILQDTWGAVSQIKALNQDNPFE-QNNLRFQGQYYDRETELHYNRYRYY 1358 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 +P RY+++DPIGLEGG + +Y +P IDP GL+ Sbjct: 1359 EPHSARYVSKDPIGLEGGMNTSSYVSDPNQWIDPKGLN 1396 >UniRef50_D1SVF0 Rhs family protein (Fragment) n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1SVF0_9BURK Length = 218 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 42/112 (37%), Positives = 55/112 (49%), Gaps = 24/112 (21%) Query: 9 GNIAWSGEYDEWGNQLNE-----------------ENPHHLHQPY-------RLPGQQYD 44 GN+ W Y WG + E E H Q + R+ GQ D Sbjct: 2 GNLVWQARYLTWGATVQEHWQAFDAAGRPVDAPVAETGHRPQQSFVLIPQNLRMQGQYLD 61 Query: 45 KESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 +E+GL+YN RYYDP G + T DPIGL GG +L+ Y LNP+ +DP G +P Sbjct: 62 RETGLHYNTFRYYDPDLGAFTTPDPIGLAGGINLHQYALNPIAWVDPWGWAP 113 >UniRef50_Q4UTI2 Putative uncharacterized protein n=2 Tax=Xanthomonas campestris pv. campestris RepID=Q4UTI2_XANC8 Length = 137 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 41/110 (37%), Positives = 58/110 (52%), Gaps = 5/110 (4%) Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLN-DAGVSRSAKGLGYEKEIRDYGLNLFG 174 ILS +MK N+ G+DQF+HC+A CR ++ + G+ L KE +DY G Sbjct: 3 ILSSLNSEMKSRNIAGSDQFYHCLASCRATQATKNPGLVLEMMAL---KETKDYYAGRLG 59 Query: 175 MYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKK 224 +YG + H EM DN++D+A N G TC DC RC + PE + Sbjct: 60 LYGDGRRRGHYEMQSDNQQDMAANQLGATCQMGEDCPRRCMGLV-PERSR 108 >UniRef50_D1YPL4 RHS repeat-associated core domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YPL4_9FIRM Length = 167 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 41/95 (43%), Positives = 57/95 (60%), Gaps = 3/95 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN--PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D DGN+ W G Y +WG L EE +QP+RL Q D+E+GL+YN R+Y+ Sbjct: 1 MTDKDGNLLWFGNYTDWG-HLKEETRVTDSAYQPFRLHKQYADRETGLHYNFFRHYETDA 59 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 G + QDPIGL GG +LY++ N + D LG+ P Sbjct: 60 GPLVNQDPIGLVGGDNLYSFANNATSWTDCLGVLP 94 >UniRef50_C6CPH2 YD repeat protein n=11 Tax=Enterobacteriaceae RepID=C6CPH2_DICZE Length = 1423 Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 42/103 (40%), Positives = 51/103 (49%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 AL DG + W Q E GQ D ESGL YNR RYYDP G Sbjct: 1211 ALFTPDGTLRWQAPKATLWGQRQAEKSESPDPGLAFAGQLRDSESGLCYNRFRYYDPAGG 1270 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRK 105 Y++ DPIG+ GG + YAY NP+ IDPLGL+ + + K Sbjct: 1271 CYVSPDPIGIAGGDNNYAYAPNPITWIDPLGLAGCSIQKLEEK 1313 >UniRef50_A9AKU8 Type VI secretion system Vgr family protein n=10 Tax=Burkholderia RepID=A9AKU8_BURM1 Length = 1981 Score = 75.9 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 2/92 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D + W+ + +G + P R PGQ D+ESGL+YNR RYYDP+ GR Sbjct: 1747 LYDEQREVLWAADLSAYGRTARWLT-RVVDNPIRFPGQYRDEESGLHYNRFRYYDPMVGR 1805 Query: 64 YITQDPIGLEGGWSLYAYPLNPVN-GIDPLGL 94 YI QDPI +GG + Y+Y + N DP GL Sbjct: 1806 YINQDPIAFDGGINFYSYADSAPNIAYDPKGL 1837 >UniRef50_Q9L0E3 Putative Rhs protein n=2 Tax=Streptomyces RepID=Q9L0E3_STRCO Length = 927 Score = 75.5 bits (184), Expect = 1e-12, Method: Composition-based stats. Identities = 44/105 (41%), Positives = 57/105 (54%), Gaps = 6/105 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG+I W WG ++ P R PGQ +D E+GL+YN RYYDP GR Sbjct: 730 LVSPDGDIGWRLRTTLWGLPVDGSG-GSTDCPLRFPGQYHDPETGLHYNYFRYYDPGLGR 788 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQL 108 Y + DP+GL GG + Y NP IDP GL AL R++ +L Sbjct: 789 YCSLDPLGLAGGPNPAWYTPNPTAWIDPFGL-----ALCRKRPRL 828 >UniRef50_C6CNW6 YD repeat protein n=8 Tax=Enterobacteriaceae RepID=C6CNW6_DICZE Length = 1679 Score = 75.5 bits (184), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 40/103 (38%), Positives = 52/103 (50%), Gaps = 12/103 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQ------------PYRLPGQQYDKESGLYY 51 L G + W GE WG E+ P L + R GQ YD E+GLYY Sbjct: 1461 LCSETGEVHWRGEQALWGAHREEKIPIPLRRWLGDAANEEVYCELRYQGQVYDSETGLYY 1520 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 NR+RYYDP +Y++ DP+G+ GG Y NP+ +DP GL Sbjct: 1521 NRHRYYDPETAQYLSGDPLGIAGGLRPQGYVHNPMEWVDPFGL 1563 >UniRef50_Q8GDM7 Rhs n=3 Tax=Photorhabdus RepID=Q8GDM7_PHOLU Length = 1469 Score = 75.5 bits (184), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 49/108 (45%), Positives = 62/108 (57%), Gaps = 6/108 (5%) Query: 2 LALMDADGNIAWSGEYDE-WGNQLNE--ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 LAL D G W +G +L+ ENP L R GQ +D+ESGL+YNR RYY Sbjct: 1245 LALFDPTGKRVWRRPKQSLYGLRLSGHGENPQ-LDPGLRFAGQLFDEESGLFYNRFRYYL 1303 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS--PADVALIRR 104 P Y++ DP+GL GG + YAY NP N IDPLGL+ P D + R+ Sbjct: 1304 PEAACYLSPDPLGLNGGPNPYAYVHNPANWIDPLGLAGCPTDYSQKRK 1351 >UniRef50_B6VUY0 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B6VUY0_9BACE Length = 1442 Score = 75.1 bits (183), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 42/94 (44%), Positives = 53/94 (56%), Gaps = 1/94 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + + D GN W D +G L + P+R GQ D+E+GLYYNR RYYD Sbjct: 1219 IQMYDEQGNKTWDCTLDIYGKVLAIDKGTEFDCPFRYQGQYEDEETGLYYNRFRYYDSNA 1278 Query: 62 GRYITQDPIGLEG-GWSLYAYPLNPVNGIDPLGL 94 G YI+QDPIGLE + Y Y + +GIDPLGL Sbjct: 1279 GSYISQDPIGLESDTLNFYDYVCDLNDGIDPLGL 1312 >UniRef50_UPI0001B58169 YD repeat-containing protein n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B58169 Length = 1724 Score = 75.1 bits (183), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 43/95 (45%), Positives = 52/95 (54%), Gaps = 6/95 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G +AWS WG P + P + PGQ D ESGL+YN RYYDP R Sbjct: 1222 LIDEHGGLAWSARRSLWGRV----QPGGI--PLQFPGQYADAESGLHYNVFRYYDPAAAR 1275 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 YI+QDP+GLEGG + Y +P D LGL D Sbjct: 1276 YISQDPLGLEGGPNPSNYVPDPFAATDVLGLKCGD 1310 >UniRef50_A9C2K3 YD repeat protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C2K3_DELAS Length = 1301 Score = 75.1 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 34/59 (57%), Positives = 43/59 (72%), Gaps = 1/59 (1%) Query: 37 RLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGL 94 R PGQ +D+E+GL YN +RYYD GRYI DPIGL GGW+ + Y NP++ +DPLGL Sbjct: 1112 RYPGQVFDEETGLSYNLHRYYDAATGRYIQADPIGLAGGWNRFGYVGENPLSFVDPLGL 1170 >UniRef50_D1W448 RHS repeat-associated core domain protein n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W448_9BACT Length = 195 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 40/105 (38%), Positives = 61/105 (58%), Gaps = 2/105 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + +D+ G + W D +G+ L P+R GQ D E+GLYYNR RYYDP Sbjct: 53 IQALDSKGEVVWDCILDIYGDVLELRGKRDFI-PFRFQGQYEDGETGLYYNRFRYYDPNS 111 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP-ADVALIRRK 105 G +I+QDPI + GG+++YAY + + +D GLS + + ++ RK Sbjct: 112 GTFISQDPISILGGFNIYAYVHDVNSWVDVFGLSKYSPIEVLGRK 156 >UniRef50_B2HAQ4 RhsD protein n=4 Tax=Burkholderia RepID=B2HAQ4_BURPS Length = 1531 Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 39/93 (41%), Positives = 55/93 (59%), Gaps = 6/93 (6%) Query: 4 LMDADGNIAWSGEYDEWGN--QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D+ G W+ YDE G +N + ++ P L GQ D E+GL+YNR+RYYDP Sbjct: 1291 MTDSSGREVWATGYDENGRLVPINAD----IYNPIHLQGQYRDAETGLHYNRHRYYDPAL 1346 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 G +I++DP+GL G +LY Y N + DPLG Sbjct: 1347 GSFISKDPLGLAAGVNLYRYAPNSIGWADPLGF 1379 >UniRef50_B4EGW8 RHS-family protein n=9 Tax=Burkholderiaceae RepID=B4EGW8_BURCJ Length = 1515 Score = 74.3 bits (181), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 47/131 (35%), Positives = 60/131 (45%), Gaps = 11/131 (8%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHH--------LHQPYRLPGQQYDKESGLYYNRN 54 + D G W Y WG L + P + + R GQ D E+GL YN N Sbjct: 1266 TVFDEQGRPVWKAAYSLWGKLLPVKRPANDADCGATSIDTTLRFSGQWADDETGLNYNLN 1325 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL---SPADVALIRRKDQLNHQ 111 RYYDP G+Y++ DPIGL GG AY +P IDPLGL PA + R + Sbjct: 1326 RYYDPDSGQYLSADPIGLLGGARTQAYVHDPSQWIDPLGLQGCKPAGKKISMRAYRYEMP 1385 Query: 112 RAWDILSDTYE 122 +D D +E Sbjct: 1386 ERFDTTWDAHE 1396 >UniRef50_C7QAC6 YD repeat protein n=2 Tax=Bacteria RepID=C7QAC6_CATAD Length = 1508 Score = 74.3 bits (181), Expect = 3e-12, Method: Composition-based stats. Identities = 39/95 (41%), Positives = 54/95 (56%), Gaps = 4/95 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQL---NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L+ DG +AW D +G + +P L P R GQ +D E+GL+YN RYYDP Sbjct: 1308 LVTPDGRVAWYQNTDLYGQSVAVATGGDPD-LECPLRFAGQYFDAETGLHYNVQRYYDPA 1366 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y+T DP+GL + +AY NP+ +DPLGL+ Sbjct: 1367 IAAYLTPDPLGLAPALNDHAYVPNPLTMVDPLGLA 1401 >UniRef50_C0FSB7 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FSB7_9FIRM Length = 306 Score = 73.9 bits (180), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 45/105 (42%), Positives = 53/105 (50%), Gaps = 11/105 (10%) Query: 6 DADGNIAWSGEYDEWGNQLN----------EENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 D +G W E D +G N E P+R GQ DKE GLYYNR R Sbjct: 131 DGEGIKVWERELDIYGRVKNVGKGSDRSAAPETGEQCFIPFRFQGQYEDKEIGLYYNRFR 190 Query: 56 YYDPLQGRYITQDPIGLEGG-WSLYAYPLNPVNGIDPLGLSPADV 99 YYDP G+Y QDPIGL GG +LY Y N + +DP GL D+ Sbjct: 191 YYDPSLGQYTQQDPIGLAGGNPTLYGYVFNTMWELDPFGLDWKDL 235 >UniRef50_B3PEN8 Rhsfamily protein n=2 Tax=Cellvibrio japonicus Ueda107 RepID=B3PEN8_CELJU Length = 1401 Score = 73.9 bits (180), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 42/98 (42%), Positives = 55/98 (56%), Gaps = 4/98 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 ++D GNI W +Y +G + R PGQ +D E+GL++N R YD GR Sbjct: 1188 MLDDAGNIVWEAQYSAFGKA--HITIDTVENNLRFPGQYFDSETGLHHNYFRDYDSALGR 1245 Query: 64 YITQDPIGLEGGWSLYAYPL-NPVNGIDPLGL-SPADV 99 YI DPIGL GG++ Y Y NP IDPLGL P+D+ Sbjct: 1246 YIQSDPIGLGGGFNTYVYAYQNPAVLIDPLGLYVPSDL 1283 >UniRef50_B4VFT3 Rhs protein n=4 Tax=Bacteria RepID=B4VFT3_9ACTO Length = 1253 Score = 73.9 bits (180), Expect = 4e-12, Method: Composition-based stats. Identities = 39/95 (41%), Positives = 49/95 (51%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG AW WG + + P R PGQ YD ESGL++N R YDP Sbjct: 1035 LLAEDGTTAWHTRATLWGTTTWNSDAT-AYTPLRFPGQYYDPESGLHHNYFRTYDPETAH 1093 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 Y++ DP+GL + Y NP DPLGL+PAD Sbjct: 1094 YLSPDPLGLAPAPNPTTYVHNPHTWSDPLGLTPAD 1128 >UniRef50_C4NV50 Rhs repeat family protein n=10 Tax=Gammaproteobacteria RepID=C4NV50_ECOLX Length = 1374 Score = 73.9 bits (180), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 40/93 (43%), Positives = 50/93 (53%), Gaps = 5/93 (5%) Query: 3 ALMDADGNIAWSGEYDEWGN-QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 AL D G + W Y +G + + P R PGQ YD E+G +YN RYYDP Sbjct: 1114 ALTDVSGQVVWKASYSPFGKASIIIQGPTF---NLRFPGQYYDAETGFHYNWRRYYDPAT 1170 Query: 62 GRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLG 93 GRYIT DP+GL G + Y Y NP++ DP G Sbjct: 1171 GRYITSDPLGLIDGVNTYGYVHGNPMSNTDPTG 1203 >UniRef50_A4B8X0 YD repeat n=1 Tax=Reinekea blandensis MED297 RepID=A4B8X0_9GAMM Length = 1098 Score = 73.6 bits (179), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 45/98 (45%), Positives = 56/98 (57%), Gaps = 2/98 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D+ GN+ W +G +++E + L PGQ D ESGL YN R YDP GR Sbjct: 858 LTDSLGNVVWQQHTTPFG-EVHETLGNGLGYLQSFPGQWRDSESGLSYNYYRDYDPSLGR 916 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVA 100 YI DPIGL GG + YAY NP++ IDPLGL D+ Sbjct: 917 YIQSDPIGLGGGLNTYAYVGGNPISRIDPLGLDYEDMV 954 >UniRef50_C7MZM5 Rhs family protein n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MZM5_SACVD Length = 1259 Score = 73.6 bits (179), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 39/95 (41%), Positives = 50/95 (52%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L++ G IAW WG ++ + P R PGQ D E+G YN R+YDP R Sbjct: 1052 LINIHGGIAWYHRTTLWGITTDQSR-TGAYTPLRFPGQYADPETGFNYNFQRHYDPASAR 1110 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 Y DP+GL GG+ ++Y NP IDPLGL D Sbjct: 1111 YAGTDPLGLVGGFDPHSYVWNPYAWIDPLGLKKCD 1145 >UniRef50_C5T0D9 YD repeat protein n=1 Tax=Acidovorax delafieldii 2AN RepID=C5T0D9_ACIDE Length = 1001 Score = 73.6 bits (179), Expect = 6e-12, Method: Composition-based stats. Identities = 37/77 (48%), Positives = 49/77 (63%), Gaps = 4/77 (5%) Query: 37 RLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGLS 95 R PGQQ+D E+ L YN +RYYD GRYI DPIGL GGW+ + Y NP++ +DP+GL Sbjct: 802 RYPGQQWDAETKLAYNLHRYYDAATGRYIQADPIGLGGGWNRFGYVGGNPLSFVDPVGLQ 861 Query: 96 PADVALI---RRKDQLN 109 D+ + RR L+ Sbjct: 862 FLDLTTLAGARRNTTLD 878 >UniRef50_A8ZSG6 YD repeat protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZSG6_DESOH Length = 423 Score = 73.2 bits (178), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 40/92 (43%), Positives = 54/92 (58%), Gaps = 3/92 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L ++G + WS +Y+ +G+ E + R PGQ +D ESGL+YN +RYY P GR Sbjct: 260 LTASNGAVVWSAKYESFGDATVEI--ETVENNLRFPGQYFDGESGLHYNLHRYYAPELGR 317 Query: 64 YITQDPIGLEGGWSLYAYPLNPV-NGIDPLGL 94 ++ DPIGL GG + Y Y N V N DP GL Sbjct: 318 FLKDDPIGLRGGINQYIYADNNVSNNTDPYGL 349 >UniRef50_B4SL73 YD repeat protein n=4 Tax=Stenotrophomonas maltophilia RepID=B4SL73_STRM5 Length = 1577 Score = 72.8 bits (177), Expect = 8e-12, Method: Composition-based stats. Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 5/88 (5%) Query: 13 WSGEYDEWGNQLNEENPH----HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQD 68 WS + + +GNQ+ +P R PGQQ SGL+YN R YDP GRY D Sbjct: 1351 WSNKSEVFGNQIPSTDPDGDGVAFELALRFPGQQATDASGLFYNYRREYDPAVGRYTQSD 1410 Query: 69 PIGLEGGWSLYAY-PLNPVNGIDPLGLS 95 PIGL GG + +AY +P +DPLGL+ Sbjct: 1411 PIGLMGGLNSFAYGSGDPTGRVDPLGLA 1438 >UniRef50_D0BWK2 YD repeat protein n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0BWK2_9GAMM Length = 1361 Score = 72.8 bits (177), Expect = 8e-12, Method: Composition-based stats. Identities = 41/106 (38%), Positives = 58/106 (54%), Gaps = 18/106 (16%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQP-------YRLPGQQYDKESGLYYNRNRY 56 L+D++G++ WS + +G L P R PGQ YD +G +YN NR+ Sbjct: 1054 LIDSNGSVVWSWDSTAFG----------LGSPVSTITFNLRFPGQYYDATTGQFYNHNRF 1103 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVAL 101 Y+P GRY+ DPIGL GG + Y Y L NPV +D G +P +A+ Sbjct: 1104 YNPELGRYMEPDPIGLAGGLNPYIYALNNPVMYVDMTGENPILIAM 1149 >UniRef50_A9GD22 Conserved carbohydrate-binding protein, Rhs family n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GD22_SORC5 Length = 1351 Score = 72.8 bits (177), Expect = 9e-12, Method: Composition-based stats. Identities = 33/63 (52%), Positives = 43/63 (68%), Gaps = 1/63 (1%) Query: 35 PYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLG 93 P R PGQ D+E+GL YNR RY+DP GRY++ DP GL+GG++ + Y N P +DP G Sbjct: 1079 PLRFPGQYEDEETGLVYNRYRYFDPALGRYLSADPAGLDGGFNGFDYAGNAPTRFVDPSG 1138 Query: 94 LSP 96 L P Sbjct: 1139 LMP 1141 >UniRef50_Q8XSL8 Putative rsh-related protein n=1 Tax=Ralstonia solanacearum RepID=Q8XSL8_RALSO Length = 319 Score = 72.4 bits (176), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 46/104 (44%), Positives = 59/104 (56%), Gaps = 8/104 (7%) Query: 3 ALMDADGNIAWS-GEYDEWGNQLNEENPHHL----HQPYRLPGQQYDKESGLYYNRNRYY 57 + D + + W + D +G L +ENP L + P R PGQ YD E+G +YN NR Y Sbjct: 66 VITDTNNLMVWRWDQADPFGATLPDENPTSLGTFAYNP-RFPGQVYDAETGKHYNANRDY 124 Query: 58 DPLQGRYITQDPIGLEGGW-SLYAYP-LNPVNGIDPLGLSPADV 99 DP+ GRY+ DPIGL GG S YAY +P+ IDP GL V Sbjct: 125 DPVSGRYVQSDPIGLNGGQPSTYAYVDSDPLGFIDPEGLGACTV 168 >UniRef50_A9C2L0 Putative uncharacterized protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C2L0_DELAS Length = 187 Score = 72.4 bits (176), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 38/79 (48%), Positives = 50/79 (63%), Gaps = 5/79 (6%) Query: 37 RLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLS 95 R PGQ +D+E+GL YN +RYYD GRY+ DPIGLE GW+ + Y NP+N +DP GL Sbjct: 66 RYPGQVFDEETGLSYNLHRYYDAATGRYMQADPIGLEDGWNRFGYVANNPLNDVDPQGLH 125 Query: 96 PADVALIRRKDQLNHQRAW 114 P L+ R L ++ W Sbjct: 126 P----LLTRMQGLYYRYGW 140 >UniRef50_A8ZVU3 YD repeat protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZVU3_DESOH Length = 2831 Score = 72.4 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 44/95 (46%), Positives = 55/95 (57%), Gaps = 3/95 (3%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 A+ DA GNI +YD +G LN+ P P+ G YDK++GL R YDP G Sbjct: 2603 AVADAAGNIVKQIDYDSFGFMLNDTYPG-FEIPFGFAGGLYDKDTGLVRFGYRDYDPNTG 2661 Query: 63 RYITQDPIGLEGGWS-LYAYPLN-PVNGIDPLGLS 95 R+ +DPIG GG S LY Y LN PVN ID +GL+ Sbjct: 2662 RWTAKDPIGFNGGASDLYGYCLNDPVNMIDGIGLA 2696 >UniRef50_Q3JSF4 RhsD protein n=23 Tax=Burkholderia pseudomallei RepID=Q3JSF4_BURP1 Length = 1593 Score = 72.4 bits (176), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 43/111 (38%), Positives = 58/111 (52%), Gaps = 7/111 (6%) Query: 9 GNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQD 68 G I W Y G + E + QP RL GQ +D ESGL YNR RY+ G +++QD Sbjct: 1356 GRIVWEARYGPHGGIASIET-DVIQQPIRLQGQIFDWESGLSYNRYRYFLSSIGAFVSQD 1414 Query: 69 PIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 PIGL GG +LY + N DPLGL R++ +N RA ++ + Sbjct: 1415 PIGLVGGVNLYRFAPNAFGWTDPLGLKK------RKRKFVNGVRATIVVGN 1459 >UniRef50_B4V251 LipX3 n=2 Tax=Streptomyces RepID=B4V251_9ACTO Length = 1253 Score = 72.4 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 37/93 (39%), Positives = 51/93 (54%), Gaps = 1/93 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+ + G +AW WG + P R PGQ D E+GL+YN RYY+P Sbjct: 1052 LVASGGELAWQRRTTLWGTDFPAPTDTTSADCPIRFPGQYADSETGLHYNFFRYYEPESA 1111 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 RYI+ DP+GLE + +AY +N + DPLGL+ Sbjct: 1112 RYISADPLGLEPAPNHHAYVVNALGWTDPLGLA 1144 >UniRef50_D1T3N4 YD repeat protein n=2 Tax=Betaproteobacteria RepID=D1T3N4_9BURK Length = 1584 Score = 72.0 bits (175), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 41/114 (35%), Positives = 53/114 (46%), Gaps = 24/114 (21%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE--------------------ENPHHLHQPY----RLP 39 + D GN+ W Y WG + E + P P R+ Sbjct: 1355 MSDTRGNLVWQARYLTWGATVQEHWQAFDATGRPADAPLAETCDRPQQSFAPMPQNLRMQ 1414 Query: 40 GQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLG 93 GQ D+E+GL+YN RYYDP G + T DPIGL GG +L+ Y NP+ IDP G Sbjct: 1415 GQYLDRETGLHYNTFRYYDPDLGAFTTPDPIGLAGGINLHQYAPNPIAWIDPWG 1468 >UniRef50_UPI00016B0868 Rhs family protein n=1 Tax=Burkholderia pseudomallei 112 RepID=UPI00016B0868 Length = 242 Score = 72.0 bits (175), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 39/93 (41%), Positives = 55/93 (59%), Gaps = 6/93 (6%) Query: 4 LMDADGNIAWSGEYDEWGN--QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D+ G W+ YDE G +N + ++ P L GQ D E+GL+YNR+RYYDP Sbjct: 1 MTDSSGREVWATGYDENGRLVPINAD----IYNPIHLQGQYRDAETGLHYNRHRYYDPAL 56 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 G +I++DP+GL G +LY Y N + DPLG Sbjct: 57 GSFISKDPLGLAAGVNLYRYAPNSIGWADPLGF 89 >UniRef50_B2STD6 RHS Repeat family n=18 Tax=Bacteria RepID=B2STD6_XANOP Length = 1579 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 36/88 (40%), Positives = 54/88 (61%), Gaps = 4/88 (4%) Query: 8 DGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYIT 66 + + WS Y G+++ + EN + P GQ D ESGL+YNR RYY+ G Y++ Sbjct: 1352 ESEVVWSASYGAHGDRIVDVEN---IFNPISGQGQYRDDESGLFYNRYRYYESSVGAYLS 1408 Query: 67 QDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 QDP+G+ G ++Y+Y LN + +DPLGL Sbjct: 1409 QDPMGMLTGSNVYSYGLNAMGWVDPLGL 1436 >UniRef50_B4SR21 YD repeat protein n=2 Tax=Stenotrophomonas maltophilia RepID=B4SR21_STRM5 Length = 738 Score = 71.6 bits (174), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 37/62 (59%), Positives = 40/62 (64%), Gaps = 1/62 (1%) Query: 38 LPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSP 96 PGQ YD ESGL+YN R YD GRY+ DPIGL GG + YAY NPV G DPLGL Sbjct: 538 FPGQYYDVESGLWYNGFRDYDASIGRYVQSDPIGLAGGLNTYAYAEGNPVTGFDPLGLCK 597 Query: 97 AD 98 D Sbjct: 598 TD 599 >UniRef50_Q6FD94 Putative RHS-related protein n=1 Tax=Acinetobacter sp. ADP1 RepID=Q6FD94_ACIAD Length = 552 Score = 71.6 bits (174), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 37/58 (63%), Positives = 40/58 (68%), Gaps = 1/58 (1%) Query: 38 LPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGL 94 LPGQ YD ESGL+YN NRY+D GRY DPIGL GG + Y Y NPVN IDP GL Sbjct: 355 LPGQYYDVESGLWYNWNRYFDGTVGRYTQPDPIGLAGGVNTYTYVGNNPVNFIDPYGL 412 >UniRef50_D1WVZ1 YD repeat protein n=1 Tax=Streptomyces sp. ACT-1 RepID=D1WVZ1_9ACTO Length = 2294 Score = 70.9 bits (172), Expect = 3e-11, Method: Composition-based stats. Identities = 40/97 (41%), Positives = 52/97 (53%), Gaps = 4/97 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++ + + DG IA YD G PY G++ D +GL Y RNRYYDP Sbjct: 1897 IVGMANTDGTIATRYTYDPNGQPTTSGAASS--NPYTFTGRESDG-TGLLYYRNRYYDPE 1953 Query: 61 QGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSP 96 GR+I+QDPIG GG +LY Y L +P DP G +P Sbjct: 1954 SGRFISQDPIGHAGGTNLYQYALSSPTTYTDPSGNNP 1990 >UniRef50_B1HKQ3 Rhs family protein n=1 Tax=Burkholderia pseudomallei S13 RepID=B1HKQ3_BURPS Length = 1749 Score = 70.9 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 40/95 (42%), Positives = 53/95 (55%), Gaps = 3/95 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +AL D+ G I YD +GN ++ PY+ G++ D +GLYYNR RYY PL Sbjct: 1522 IALTDSAGAIRQRYSYDPYGNTEQSDSTTGFTNPYQYTGREMDA-AGLYYNRARYYSPLM 1580 Query: 62 GRYITQDPIGLEGG-WSLYAYP-LNPVNGIDPLGL 94 +I++DPI GG S Y Y +PVN D LGL Sbjct: 1581 SGFISEDPITFGGGQLSFYGYSDSDPVNHTDRLGL 1615 >UniRef50_C1B7W9 Putative uncharacterized protein n=1 Tax=Rhodococcus opacus B4 RepID=C1B7W9_RHOOB Length = 514 Score = 70.5 bits (171), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 41/100 (41%), Positives = 50/100 (50%), Gaps = 15/100 (15%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 A D G+ W G D HL R PGQ +D E+GL+YN +RYY+P Sbjct: 333 ATTDLWGHTTWRGATDT-----------HL----RFPGQYHDPETGLHYNLHRYYNPHTA 377 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALI 102 RY+TQDP+GL + YP NP DPLGL P I Sbjct: 378 RYLTQDPLGLAPSPNPNTYPHNPTGWTDPLGLVPCPPTTI 417 >UniRef50_C2LFQ4 Putative uncharacterized protein n=1 Tax=Proteus mirabilis ATCC 29906 RepID=C2LFQ4_PROMI Length = 214 Score = 70.5 bits (171), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 35/70 (50%), Positives = 46/70 (65%) Query: 26 EENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 E +P++ P+R GQ D+ESGLYYNR RYYD G+Y++ DPIGL GG + Y Y P Sbjct: 12 ENDPNYTECPFRFAGQYEDEESGLYYNRFRYYDRETGQYLSPDPIGLLGGLNPYGYVHCP 71 Query: 86 VNGIDPLGLS 95 +DP GL+ Sbjct: 72 TGWVDPFGLA 81 >UniRef50_C6CI62 YD repeat protein n=7 Tax=Enterobacteriaceae RepID=C6CI62_DICZE Length = 1475 Score = 70.5 bits (171), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 38/93 (40%), Positives = 46/93 (49%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 AL DG + W Q E GQ D ESGL YNR RYYDP G Sbjct: 1226 ALFTPDGTLRWQAPTATLWGQRQAEKSESPDPGLAFAGQLRDSESGLCYNRFRYYDPAGG 1285 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y++ DPIG+ GG + Y Y NP+ +DP GL+ Sbjct: 1286 CYVSPDPIGIAGGENNYGYVQNPMCWVDPFGLA 1318 >UniRef50_B8FCM0 YD repeat protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FCM0_DESAA Length = 1448 Score = 70.5 bits (171), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 39/103 (37%), Positives = 60/103 (58%), Gaps = 3/103 (2%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + AL++A GN YD +GN L+ N QP+R ++YD+E+GLYY R+Y P+ Sbjct: 1185 VTALLNATGNACAWYAYDPYGNLLH--NTGAPVQPFRFSTKEYDEETGLYYFGRRFYSPV 1242 Query: 61 QGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALI 102 R++T+DP G +LY + NP+ DPLG A++A + Sbjct: 1243 MARWLTRDPKGEGASLNLYEFSRSNPLAYFDPLGAQDAELAAM 1285 >UniRef50_D1PUV1 RHS family protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PUV1_9BACT Length = 284 Score = 70.1 bits (170), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 35/65 (53%), Positives = 43/65 (66%) Query: 35 PYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 P+ GQ YD+E L YNR RYYDP GRYI++DP+ L GG +LY Y N + DPLGL Sbjct: 85 PFLFQGQYYDEEVKLAYNRFRYYDPELGRYISEDPVRLLGGSNLYRYVENTILWCDPLGL 144 Query: 95 SPADV 99 S A + Sbjct: 145 SSAKL 149 >UniRef50_D1JME6 Rhs family protein n=22 Tax=Bacteroides RepID=D1JME6_9BACE Length = 1494 Score = 70.1 bits (170), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 42/90 (46%), Positives = 52/90 (57%), Gaps = 2/90 (2%) Query: 6 DADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYI 65 D +GN WS D GN + EE + P+ GQ YD+E+GL YNR RYY P G Y+ Sbjct: 1266 DTEGNEVWSRVLDMDGNVI-EETGNKGMVPFLFQGQYYDRETGLAYNRFRYYSPKMGVYV 1324 Query: 66 TQDPIGLEGG-WSLYAYPLNPVNGIDPLGL 94 +QDPIGL GG +LY Y + ID GL Sbjct: 1325 SQDPIGLGGGILNLYGYVDDTNVWIDSFGL 1354 >UniRef50_A9C2K9 Putative uncharacterized protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C2K9_DELAS Length = 222 Score = 70.1 bits (170), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 34/59 (57%), Positives = 42/59 (71%), Gaps = 1/59 (1%) Query: 37 RLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGL 94 R PGQ +D+E+GL Y+ +RYYD GRYI DPIGLEGGW+ + Y NP+N I P GL Sbjct: 40 RYPGQVFDEETGLSYSLHRYYDAATGRYIQADPIGLEGGWNWFGYVGGNPLNFIGPKGL 98 >UniRef50_C0FSB4 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FSB4_9FIRM Length = 223 Score = 69.7 bits (169), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 40/92 (43%), Positives = 52/92 (56%), Gaps = 2/92 (2%) Query: 6 DADGNIAWSGEYDEWGNQLNEENPHHLHQ-PYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 D +G W D +G EE + P+R GQ D+E+GLYYNR RYY P +G Y Sbjct: 5 DEEGKKVWERNLDIYGRVKTEEALGEKNLIPFRFQGQYEDEETGLYYNRFRYYSPEEGCY 64 Query: 65 ITQDPIGLEGG-WSLYAYPLNPVNGIDPLGLS 95 QDPIGL GG +LY Y + + +DP GL+ Sbjct: 65 TQQDPIGLAGGNPTLYGYVYDTLCELDPFGLA 96 >UniRef50_A9EZF2 Conserved carbohydrate-binding protein, Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EZF2_SORC5 Length = 1352 Score = 69.7 bits (169), Expect = 8e-11, Method: Composition-based stats. Identities = 35/95 (36%), Positives = 54/95 (56%), Gaps = 3/95 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHH---LHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L+ +G +AWS + +G P + P+RL GQ +D+E+ L Y R RY+DP Sbjct: 1075 LIGPEGEVAWSAHHSAFGIITEAARPKGGPLVASPFRLLGQYHDEETELCYVRYRYFDPK 1134 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 R+++ DP+ L G +L+A+ +P DPLGLS Sbjct: 1135 TARFLSPDPLELFGSRNLFAFDGSPTTHADPLGLS 1169 >UniRef50_UPI0001B4DD67 Rhs protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4DD67 Length = 1730 Score = 69.7 bits (169), Expect = 8e-11, Method: Composition-based stats. Identities = 37/94 (39%), Positives = 51/94 (54%), Gaps = 3/94 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+D G+ AW WG N + ++ P R PGQ +D E+G +YN +R+YDP Sbjct: 1515 LVDDTGHTAWQSRTTLWGTTTWNTDAAAYI--PLRFPGQYHDLETGHHYNLHRHYDPETA 1572 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 RY+T DP+GL + Y NP DP GL+P Sbjct: 1573 RYLTPDPLGLAPAPNPTTYVHNPHVWADPDGLAP 1606 >UniRef50_C8W2A7 YD repeat protein n=3 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2A7_DESAS Length = 1732 Score = 69.3 bits (168), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 8/111 (7%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L + DG YD WGNQ++ + P+R G YD+E+GLYY ++RYY P GR Sbjct: 1505 LTNIDGGFYAKYNYDPWGNQISYSG--WISAPFRYAGYYYDEETGLYYLKSRYYSPALGR 1562 Query: 64 YITQDPIGL-----EGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQL 108 ++T+D I +LY+Y NPVN +DP G P +D+L Sbjct: 1563 FLTKDSIKYIKYKNPQTLNLYSYAGSNPVNNVDPTGEIPVRATWKAFEDKL 1613 >UniRef50_Q2T5B9 YD repeat protein n=39 Tax=Proteobacteria RepID=Q2T5B9_BURTA Length = 1553 Score = 69.3 bits (168), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 21/118 (17%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYR---------------------LPGQQ 42 L +DG W WG+ ++ L R PGQ Sbjct: 1289 LYSSDGRALWRARRTAWGDTAGDDGRDSLRSAVREQLRLGHRDSDEFDPPDCELRFPGQW 1348 Query: 43 YDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 D+ESGL+YN +RYYDP G+Y++ DP+GL GG +AY +P+ DP GL D Sbjct: 1349 ADEESGLHYNLHRYYDPSTGQYLSADPVGLAGGLRTHAYVHDPMQWGDPFGLQGYDTV 1406 >UniRef50_B2PYS3 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PYS3_PROST Length = 191 Score = 69.3 bits (168), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 34/66 (51%), Positives = 42/66 (63%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 Q R GQ +D E+GL++N R+YDP GR+I DPIGL GG +LY Y NP+ I Sbjct: 2 ESFEQNLRYAGQYFDNETGLHFNTFRFYDPQIGRFIMPDPIGLLGGVNLYQYAPNPLVWI 61 Query: 90 DPLGLS 95 DP GLS Sbjct: 62 DPWGLS 67 >UniRef50_A1WE65 Rhs family protein-like protein n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE65_VEREI Length = 303 Score = 69.3 bits (168), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 45/113 (39%), Positives = 60/113 (53%), Gaps = 9/113 (7%) Query: 2 LALMDADGNIAWSGEYDEWG-----NQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 L D GN+ W+ Y+ +G + ++ RLPGQ D E+GL+YN +RY Sbjct: 78 LQATDNAGNVVWAANYNAFGRADVVTPRSATGDSRINSQLRLPGQYEDVETGLHYNFHRY 137 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQL 108 YD GRY DPIGL GG + Y Y NP++ DPLGL DV L R ++ Sbjct: 138 YDLDIGRYSQIDPIGLRGGLNGYVYVNGNPLSFTDPLGL---DVELRCRPAEI 187 >UniRef50_C5ID01 RhsK n=50 Tax=cellular organisms RepID=C5ID01_ECOLX Length = 1616 Score = 69.3 bits (168), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 44/111 (39%), Positives = 56/111 (50%), Gaps = 12/111 (10%) Query: 2 LALMDADGNIAWS-GEYDEWGNQL------NEENPHHLHQPYRLPGQQY-----DKESGL 49 L L +++G W G+ WG L + +P P PG Y D ESGL Sbjct: 1357 LMLFNSEGKTVWRPGQTSLWGLALSLPADTDYPDPRGERDPEADPGLLYAGQWQDAESGL 1416 Query: 50 YYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 YNR RYY+P G Y+ DP+GL+GG Y Y NP IDPLGL+ +A Sbjct: 1417 CYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYVPNPCGYIDPLGLAICQLA 1467 >UniRef50_Q2SIG5 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SIG5_HAHCH Length = 1427 Score = 69.3 bits (168), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 39/101 (38%), Positives = 61/101 (60%), Gaps = 7/101 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQ--LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + +G I W Y+ +G ++ ++L R PGQ +D+ESG+++N R Y+P Sbjct: 1166 MASRNGQIVWKAAYEVFGRVKIFVDKAENNL----RFPGQYFDQESGMHHNYFRDYNPGY 1221 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVAL 101 GRYI +DPI + GG ++YAY NPV +DPLGL+ +V + Sbjct: 1222 GRYIQRDPISVYGGINVYAYANGNPVVYMDPLGLAKMNVGV 1262 >UniRef50_B2Q762 Putative uncharacterized protein n=2 Tax=Providencia stuartii ATCC 25827 RepID=B2Q762_PROST Length = 173 Score = 68.9 bits (167), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 34/64 (53%), Positives = 42/64 (65%) Query: 32 LHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDP 91 Q R GQ +D E+GL++N R+YDP GR+I DPIGL GG +LYAY NP+ IDP Sbjct: 4 FEQNLRYAGQYFDNETGLHFNTFRFYDPQIGRFIMPDPIGLLGGINLYAYAPNPLGWIDP 63 Query: 92 LGLS 95 G S Sbjct: 64 WGWS 67 >UniRef50_D2KTW4 Putative uncharacterized protein n=2 Tax=Streptomyces RepID=D2KTW4_9ACTO Length = 1097 Score = 68.9 bits (167), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 39/91 (42%), Positives = 48/91 (52%), Gaps = 1/91 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG++AW WG L + P R PGQ D E+GL YN +RYYDP + Sbjct: 913 LITPDGHLAWQHRTTLWGTPLPTPS-DTTTCPLRFPGQYADPETGLNYNHHRYYDPETAQ 971 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 Y+T DP+GL AY NP DPLGL Sbjct: 972 YLTPDPLGLAPAPHPRAYVHNPHTWQDPLGL 1002 >UniRef50_A8ZTS1 YD repeat protein n=4 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZTS1_DESOH Length = 1935 Score = 68.6 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 36/89 (40%), Positives = 53/89 (59%), Gaps = 1/89 (1%) Query: 7 ADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYIT 66 ADG +A + EY +G +++ +R + +D ESGLY RYYDP GR+I+ Sbjct: 1667 ADGTVAAAYEYAPFGGLIHKSGVMADENVFRFSTKYWDGESGLYEYGLRYYDPETGRWIS 1726 Query: 67 QDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 +DP+G GG +LY++ +N P N D LGL Sbjct: 1727 RDPVGESGGLNLYSFVMNDPTNFFDLLGL 1755 >UniRef50_C5SNJ4 YD repeat protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SNJ4_9CAUL Length = 1421 Score = 68.6 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 37/93 (39%), Positives = 52/93 (55%), Gaps = 3/93 (3%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 A+++++G + + YDE+G L +R GQ Y E GLYY + R Y P G Sbjct: 1103 AVLNSNGTVNSTYAYDEYGVPYTTSGS--LFSRFRYTGQAYLSEIGLYYYKARMYSPTLG 1160 Query: 63 RYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 R++ DPIG + G + YAY N P+NG DP GL Sbjct: 1161 RFLQTDPIGYDDGMNWYAYVHNDPMNGKDPSGL 1193 >UniRef50_B2PW71 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PW71_PROST Length = 191 Score = 68.6 bits (166), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 36/73 (49%), Positives = 45/73 (61%), Gaps = 2/73 (2%) Query: 32 LHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDP 91 Q R GQ +D E+GL++N R+YDP GR+I DPIGL GG +LY Y NP+ IDP Sbjct: 4 FEQNLRYAGQYFDNETGLHFNTFRFYDPQIGRFIMPDPIGLLGGINLYQYAPNPLGWIDP 63 Query: 92 LGLS--PADVALI 102 GL+ PA I Sbjct: 64 WGLAGNPATATHI 76 >UniRef50_Q7NYN4 Probable rhs-related transmembrane protein related n=1 Tax=Chromobacterium violaceum RepID=Q7NYN4_CHRVO Length = 282 Score = 68.2 bits (165), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 41/97 (42%), Positives = 54/97 (55%), Gaps = 6/97 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPY----RLPGQQYDKESGLYYNRNRYYDP 59 ++D++ W E D +GN+ ++P + + R PGQ YD E+G YN R YDP Sbjct: 47 VVDSNNVEVWRWEGDAYGNERPNQDPSNTGNAFVFNLRYPGQYYDTETGRMYNGWRDYDP 106 Query: 60 LQGRYITQDPIGLEGG-WSLYAYP-LNPVNGIDPLGL 94 GRY+ DP GL GG WS Y Y NP N IDP G+ Sbjct: 107 AVGRYVQSDPFGLLGGQWSTYGYVNGNPTNAIDPSGM 143 >UniRef50_B7GLX9 Rhs family protein n=1 Tax=Anoxybacillus flavithermus WK1 RepID=B7GLX9_ANOFW Length = 295 Score = 67.8 bits (164), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 44/110 (40%), Positives = 57/110 (51%), Gaps = 11/110 (10%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL D GNI +YD WGN L++ PYR G QYD E+GLYY RYY P Sbjct: 71 VIALTDEQGNIVARYQYDAWGNILSQSGDLEDENPYRYAGYQYDNETGLYYLIARYYYPE 130 Query: 61 QGRYITQDP-------IGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALI 102 G +++ DP I + G YAY NPV DP G +P + L+ Sbjct: 131 HGVFLSLDPDPGDADDILTQNG---YAYANNNPVMLTDPDGENPYIIILV 177 >UniRef50_B2UQS6 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQS6_AKKM8 Length = 284 Score = 67.8 bits (164), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 38/95 (40%), Positives = 56/95 (58%), Gaps = 4/95 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + + DA G IA + +Y +G + + L QP + G+ +D+ES L Y R+Y+P Sbjct: 43 VTEVFDAQGTIAAAYDYSPYGAVTSTGS---LVQPVQWSGEMHDEESSLVYYNYRFYNPK 99 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 GR+I +DPI EGGW+LYA+ N P + D LGL Sbjct: 100 DGRWINRDPIAEEGGWNLYAFLGNSPQDKFDALGL 134 >UniRef50_B7X4I1 Putative uncharacterized protein n=1 Tax=Comamonas testosteroni KF-1 RepID=B7X4I1_COMTE Length = 221 Score = 67.4 bits (163), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 38/85 (44%), Positives = 50/85 (58%), Gaps = 5/85 (5%) Query: 16 EYDEWGNQLNEENP----HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIG 71 E D +G+ +P H H R PGQ D E+GL+YN R YDP GRY+ DPIG Sbjct: 7 EMDPFGDGDAATDPDGDGSHTHIALRFPGQIRDDETGLHYNYFRDYDPETGRYVQSDPIG 66 Query: 72 LEGGWSLYAYPL-NPVNGIDPLGLS 95 L G + YAY NP++ +DP+GL+ Sbjct: 67 LLEGINTYAYVRGNPLSNVDPMGLA 91 >UniRef50_A9GLF7 Conserved exported carbohydrate-binding protein,Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GLF7_SORC5 Length = 773 Score = 67.0 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 38/103 (36%), Positives = 58/103 (56%), Gaps = 4/103 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L++ADG++ E D +G + + L R PG Y+ E+GL+YNR RY+DP Sbjct: 447 LGLVNADGSVDQVFETDAYGRVIAGDAAATL---VRFPGHWYEPETGLHYNRWRYFDPAS 503 Query: 62 GRYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGLSPADVALIR 103 Y++ +P+GLEGG Y Y P+ +D GL +D++ R Sbjct: 504 ATYLSPEPLGLEGGLEAYGYVSGRPLALVDLDGLMGSDLSTRR 546 >UniRef50_D2UF28 Putative rhs family protein n=2 Tax=Xanthomonas albilineans RepID=D2UF28_XANAL Length = 1812 Score = 66.6 bits (161), Expect = 6e-10, Method: Composition-based stats. Identities = 38/91 (41%), Positives = 51/91 (56%), Gaps = 4/91 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D +GN +YD +G ++ PY+ G++ D SGLYY R RYY P R Sbjct: 1556 LTDTNGNAVQRYDYDPYGTTTQSSASYN--NPYQYTGRERDA-SGLYYYRARYYTPELAR 1612 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLG 93 +I++DPI L GG + YAY NPV DP+G Sbjct: 1613 FISEDPIKLAGGVNTYAYTGGNPVMYRDPVG 1643 >UniRef50_A9C2K7 Rhs family protein-like protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C2K7_DELAS Length = 285 Score = 66.6 bits (161), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 34/59 (57%), Positives = 41/59 (69%), Gaps = 1/59 (1%) Query: 37 RLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGL 94 R PGQ +D+E+GL YN +RYYD GRYI DPIGLEGGW+ + Y NP+ DP GL Sbjct: 71 RYPGQVFDEETGLSYNLHRYYDAATGRYIQADPIGLEGGWNRFGYVGGNPLIYGDPQGL 129 >UniRef50_UPI000196D8DA RHS family protein n=1 Tax=Neisseria mucosa ATCC 25996 RepID=UPI000196D8DA Length = 180 Score = 66.6 bits (161), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 32/66 (48%), Positives = 44/66 (66%) Query: 38 LPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 + Q D E+GL+YN RYY+P GR++ QDPIGL GG +LYA+ LN +DPLGL+ Sbjct: 1 MQNQYADCETGLHYNFFRYYEPDAGRFVNQDPIGLLGGENLYAFGLNTKAWMDPLGLASK 60 Query: 98 DVALIR 103 A ++ Sbjct: 61 ATAKMQ 66 >UniRef50_A8ZXK4 YD repeat protein n=3 Tax=Deltaproteobacteria RepID=A8ZXK4_DESOH Length = 2961 Score = 66.2 bits (160), Expect = 8e-10, Method: Composition-based stats. Identities = 42/98 (42%), Positives = 54/98 (55%), Gaps = 3/98 (3%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 A+ D GNI +YD +G +N+ NP P+ G YDK++GL R Y+P G Sbjct: 2729 AVTDGSGNIVKQIDYDSFGFVINDTNPS-FSVPFGFAGGLYDKDTGLVRFGYRDYNPNTG 2787 Query: 63 RYITQDPIGLEGGWS-LYAYPL-NPVNGIDPLGLSPAD 98 R+ +DPIG GG S LY Y L + VN IDP GL D Sbjct: 2788 RWTAKDPIGFAGGSSDLYGYCLGDGVNLIDPDGLDFID 2825 >UniRef50_C0EGC9 Putative uncharacterized protein n=2 Tax=Clostridium methylpentosum DSM 5476 RepID=C0EGC9_9CLOT Length = 598 Score = 65.9 bits (159), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 38/104 (36%), Positives = 58/104 (55%), Gaps = 9/104 (8%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEE----NPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 ++ ++D DG+ S YD WG ++ + + PYR G YD E+G YY ++RY Sbjct: 321 VIGILDRDGSQVVSYVYDSWGKLVSTSGSLADTIGVQNPYRYRGYYYDVETGFYYLQSRY 380 Query: 57 YDPLQGRYITQDP-IGLEGGW---SLYAYPLN-PVNGIDPLGLS 95 YDP+ GR+I D IG G +++AY N P+N +DP G + Sbjct: 381 YDPVTGRFINSDSLIGSTGELNTHNMFAYCGNEPINRVDPAGFA 424 >UniRef50_B5S3P6 Probable rhs-related protein (Fragment) n=1 Tax=Ralstonia solanacearum RepID=B5S3P6_RALSO Length = 445 Score = 65.9 bits (159), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 51/171 (29%), Positives = 80/171 (46%), Gaps = 8/171 (4%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 + DG EY +G + + + G QY SG+Y R YDP GR+ Sbjct: 103 LTPDGRAVTHTEYGPYGELVKSQGRAEYRSDFGYAGMQYHAASGMYLTLFRAYDPGTGRW 162 Query: 65 ITQDPIGLEGGWSLYAYP-LNPVNGIDP---LGLSPADVALIR-RKDQLNHQR-AWDILS 118 +++DPIG +GG +LYAY NP N +DP L + P + +I +K + + + AW Sbjct: 163 VSRDPIGEDGGENLYAYANGNPENYVDPNGMLAIWPTNSGVILGKKYRCKYGKDAWSNAR 222 Query: 119 DTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYE--KEIRD 167 M+ LN G + + A+ VS + +A +GY KE+R+ Sbjct: 223 SDRNKMRGLNTGLRNAEHYLYAYDSVSSGEYNAGTMTALSIGYSIIKEVRN 273 >UniRef50_C8QVK4 YD repeat protein n=4 Tax=Desulfurivibrio alkaliphilus AHT2 RepID=C8QVK4_9DELT Length = 2439 Score = 65.9 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 39/94 (41%), Positives = 53/94 (56%), Gaps = 4/94 (4%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + + G IA +YDE+GN L + NP QP+ G +D+++ L R YDP Sbjct: 2207 LVVNTSTGEIAQRIDYDEFGNVLQDTNPGF--QPFGFAGGLHDRDTNLTRFGARDYDPQT 2264 Query: 62 GRYITQDPIGLEGG-WSLYAYPLN-PVNGIDPLG 93 GR+ +DPI GG +LY Y LN P+N IDP G Sbjct: 2265 GRWTAKDPILFAGGDTNLYGYVLNDPINWIDPEG 2298 >UniRef50_Q396B9 Rhs family protein n=1 Tax=Burkholderia sp. 383 RepID=Q396B9_BURS3 Length = 1556 Score = 65.9 bits (159), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 29/59 (49%), Positives = 40/59 (67%) Query: 37 RLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 R PG + +E+GL+ NR R Y+P+ GRY+ DP+G GG +LY Y NPV +D LGL+ Sbjct: 1241 RWPGHLFQRETGLHVNRFRSYNPMLGRYLQSDPVGHAGGGNLYTYSANPVVEVDVLGLN 1299 >UniRef50_D0KFB3 Putative uncharacterized protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KFB3_PECWW Length = 283 Score = 65.5 bits (158), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 38/106 (35%), Positives = 52/106 (49%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 AL DG + W Q + GQ D ESGL YNR RYYD G Sbjct: 49 ALYKPDGTLRWQAPKSTLWGQRRSAYADNADPGLGFAGQYRDTESGLCYNRFRYYDSNGG 108 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQL 108 Y++ DPIG+ GG + Y Y NP++ +DPLGL+ ++D++ Sbjct: 109 CYVSPDPIGVAGGDNNYGYVQNPLDWVDPLGLAGCSSKGFNKRDRI 154 >UniRef50_C2M343 Rhs family protein n=2 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M343_CAPGI Length = 336 Score = 65.1 bits (157), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 38/91 (41%), Positives = 50/91 (54%), Gaps = 4/91 (4%) Query: 6 DADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYI 65 D +G W+ E D +GN + ++ P+ GQ YD+E GL YNR RYY P G YI Sbjct: 126 DENGEKVWARELDLYGNAIAGDSSF---IPFLYQGQYYDEEIGLAYNRFRYYSPESGTYI 182 Query: 66 TQDPIGLEG-GWSLYAYPLNPVNGIDPLGLS 95 +QDPI L G + Y Y + +D LGLS Sbjct: 183 SQDPIRLAGNNPNFYGYTFDSNTEVDVLGLS 213 >UniRef50_Q12LF5 Putative uncharacterized protein n=1 Tax=Shewanella denitrificans OS217 RepID=Q12LF5_SHEDO Length = 232 Score = 65.1 bits (157), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 35/65 (53%), Positives = 43/65 (66%), Gaps = 1/65 (1%) Query: 32 LHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGID 90 + Q R P Q YD+ESGL+YN R YDP GRYI DPIGL GG + Y Y NP++ +D Sbjct: 6 IKQAIRFPRQYYDEESGLHYNYFRDYDPELGRYIQSDPIGLAGGINTYGYVGGNPISYVD 65 Query: 91 PLGLS 95 P GL+ Sbjct: 66 PYGLN 70 >UniRef50_B3PEK8 RHS Repeat family n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PEK8_CELJU Length = 2245 Score = 65.1 bits (157), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 32/60 (53%), Positives = 41/60 (68%), Gaps = 1/60 (1%) Query: 35 PYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLG 93 P+R G++ D E+GLYY R RYYDP GR++ DPIG + +LYAY N P+N IDP G Sbjct: 2028 PFRYTGRRLDPETGLYYYRARYYDPGLGRFLQTDPIGYKDQMNLYAYVGNDPLNKIDPTG 2087 >UniRef50_C9PMJ2 Substrate-binding repeat protein n=2 Tax=Pasteurella dagmatis ATCC 43325 RepID=C9PMJ2_9PAST Length = 142 Score = 65.1 bits (157), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 30/61 (49%), Positives = 41/61 (67%) Query: 36 YRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 +R GQ YD+ES L+Y R RYY P G+Y++ D IGL+GG++ Y Y +P +D LGL Sbjct: 25 HRFAGQYYDEESELHYKRFRYYSPETGQYLSHDLIGLQGGFNPYGYVFDPTGWVDVLGLK 84 Query: 96 P 96 P Sbjct: 85 P 85 >UniRef50_B5JS46 NHL repeat containing protein n=2 Tax=gamma proteobacterium HTCC5015 RepID=B5JS46_9GAMM Length = 2515 Score = 65.1 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 38/89 (42%), Positives = 49/89 (55%), Gaps = 3/89 (3%) Query: 7 ADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYIT 66 ADG IA YDE+G + + NP QP+ G YD+ + L R YD GR+ T Sbjct: 2338 ADGAIAQQMSYDEFGQVVEDSNPGF--QPFGFAGGIYDQHTKLTRFGARDYDAETGRWTT 2395 Query: 67 QDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 +DPI EGG +L+ Y N PVN +D GL Sbjct: 2396 KDPIRFEGGLNLFGYVANDPVNWVDIWGL 2424 >UniRef50_B8FIJ1 YD repeat protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FIJ1_DESAA Length = 1630 Score = 64.7 bits (156), Expect = 2e-09, Method: Composition-based stats. Identities = 38/105 (36%), Positives = 53/105 (50%), Gaps = 19/105 (18%) Query: 2 LALMDADGNIAWSGEYDEWG------------NQLNEENPHHLHQPYRLPGQQYDKESGL 49 +AL D+ G + + Y +G +Q NE NP + GQ+YD+E+GL Sbjct: 1318 IALTDSTGAVVETYRYTPYGQVSFFDGNGSSISQSNESNP------FLFTGQRYDEETGL 1371 Query: 50 YYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLG 93 YY R RY P GR++ DP G G +LY Y + NP +DP G Sbjct: 1372 YYYRARYLHPELGRFLNPDPKGFVDGMNLYEYAMSNPARYVDPRG 1416 >UniRef50_UPI0001AF39C0 YD repeat-containing protein n=1 Tax=Pseudomonas syringae pv. oryzae str. 1_6 RepID=UPI0001AF39C0 Length = 179 Score = 64.7 bits (156), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 32/58 (55%), Positives = 40/58 (68%) Query: 43 YDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 +D E+GL+YN RYYDP GR+ TQDPIGL GG++LY Y N IDPLG + D + Sbjct: 1 FDVETGLHYNTFRYYDPEIGRFTTQDPIGLLGGFNLYQYAPNTNGWIDPLGWTGLDAS 58 >UniRef50_B7GLY4 Rhs family protein n=3 Tax=Anoxybacillus flavithermus WK1 RepID=B7GLY4_ANOFW Length = 563 Score = 64.7 bits (156), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 40/100 (40%), Positives = 54/100 (54%), Gaps = 9/100 (9%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL DA GN+ EYD WG ++ PYR G QYD+E+GLYY RYY P Sbjct: 314 VIALTDAQGNVVARYEYDTWGQIRSQTGVLADENPYRYAGYQYDEETGLYYLMARYYHPT 373 Query: 61 QGRYITQDP-------IGLEGGWSLYAYPLNPVNGIDPLG 93 G +++ DP I + G++ YA NPV +DP G Sbjct: 374 HGVFLSLDPDPGDADDILTQNGYT-YA-NNNPVMLVDPDG 411 >UniRef50_Q2T916 Rhs1 protein n=30 Tax=Burkholderia RepID=Q2T916_BURTA Length = 1150 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 36/93 (38%), Positives = 48/93 (51%), Gaps = 2/93 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 +A+ D G +AW+G Y WG L + + + QP R G D E L+ N RYYDP Sbjct: 1009 VAMTDDAGALAWAGRYSAWGRILPPTSLNVQVDQPLRFAGHYADDEVRLHLNGTRYYDPD 1068 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLG 93 GRY++ D +E G S Y Y NP +P G Sbjct: 1069 TGRYLSPDRT-VEPGTSPYRYVSNPQTACNPTG 1100 >UniRef50_Q30CR7 LipX3 n=4 Tax=Streptomyces RepID=Q30CR7_STRAU Length = 1560 Score = 64.7 bits (156), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 34/82 (41%), Positives = 41/82 (50%), Gaps = 1/82 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+ ADG I W + WG P P R PGQ D E+G YN RYYDP Sbjct: 1311 LVSADGRIVWHRRFTAWGQPTGTPTPGPEADCPLRFPGQYADAETGWNYNYFRYYDPETA 1370 Query: 63 RYITQDPIGLEGGWSLYAYPLN 84 RY++ DP+GL + YAY N Sbjct: 1371 RYVSADPLGLAPDPNDYAYVTN 1392 >UniRef50_B9M2L6 YD repeat protein n=6 Tax=Geobacter sp. FRC-32 RepID=B9M2L6_GEOSF Length = 1348 Score = 64.3 bits (155), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 39/92 (42%), Positives = 49/92 (53%), Gaps = 3/92 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + DA W E +G+ N L+ R PGQ D ESGL YN R Y+P GR Sbjct: 1130 MTDAAATKVWEIEARPFGDSANITGTASLN--LRFPGQYADSESGLNYNYFRDYNPGIGR 1187 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 YI DP+GL+ G +L+ Y N P+ DPLGL Sbjct: 1188 YIQADPVGLDEGVTLFIYSGNDPILNEDPLGL 1219 >UniRef50_Q2Y6W6 RhsD protein n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2Y6W6_NITMU Length = 207 Score = 64.3 bits (155), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 42/110 (38%), Positives = 57/110 (51%), Gaps = 8/110 (7%) Query: 17 YDEWGNQLNEENPHHLHQ---PYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLE 73 Y +G +NP L R PGQ +D E+GL+YN R YDP GRYI DPIGL Sbjct: 5 YGPFGANPPNQNPSGLGTFSYNLRYPGQYHDAETGLHYNYFRDYDPKTGRYIQSDPIGLA 64 Query: 74 GGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 GG + Y Y NP++ IDP G ++A + + A+D + Y+ Sbjct: 65 GGINTYVYVEGNPLSKIDPTG----EIAFVPILIGIGVGYAFDYALERYK 110 >UniRef50_UPI0001B52686 Rhs core protein with extension n=1 Tax=Shigella sp. D9 RepID=UPI0001B52686 Length = 205 Score = 64.3 bits (155), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 31/59 (52%), Positives = 37/59 (62%) Query: 40 GQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 GQ D ESGL YNR RYY+P G Y+ DP+GL GG Y Y NP +DPLGL+ + Sbjct: 1 GQWQDAESGLCYNRFRYYEPETGMYLVSDPLGLLGGEQTYRYVPNPCGWVDPLGLAASS 59 >UniRef50_B9XHG9 YD repeat protein n=1 Tax=bacterium Ellin514 RepID=B9XHG9_9BACT Length = 1915 Score = 64.3 bits (155), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 1/96 (1%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + AL++A I YD +GN L++ P Y+ +++ + SGL +RYYDP Sbjct: 1717 ITALINAQQVIVAKYLYDPFGNILSKSGPLAEANLYQFSSKEFHQNSGLVCYLHRYYDPN 1776 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLS 95 R++T+DP+G GG++LY + N P+ G+DP GL+ Sbjct: 1777 LQRWLTRDPLGELGGYNLYQFVGNDPMEGVDPFGLA 1812 >UniRef50_Q4ZU10 YD repeat n=2 Tax=Pseudomonas syringae group RepID=Q4ZU10_PSEU2 Length = 913 Score = 63.9 bits (154), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 32/60 (53%), Positives = 41/60 (68%), Gaps = 1/60 (1%) Query: 37 RLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLS 95 R PGQ YD ++ L YN R Y+P GRY+ DPIGL GG + YAY NP++ IDP+GL+ Sbjct: 729 RFPGQIYDAQTQLSYNYYRDYNPDTGRYVQSDPIGLGGGLNTYAYVNANPMSFIDPMGLA 788 >UniRef50_C3JCI6 Rhs family protein n=4 Tax=Bacteria RepID=C3JCI6_9PORP Length = 1387 Score = 63.9 bits (154), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 31/61 (50%), Positives = 40/61 (65%) Query: 35 PYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 P+ GQ +D+E L YNR RYYDP G YI+QDPI + G ++YAY + + IDP GL Sbjct: 1219 PFLFQGQYFDEEIDLCYNRFRYYDPSTGTYISQDPISIAGRLNVYAYVHDSNSWIDPFGL 1278 Query: 95 S 95 S Sbjct: 1279 S 1279 >UniRef50_A3UDD5 Wall associated protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UDD5_9RHOB Length = 1693 Score = 63.9 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 32/61 (52%), Positives = 39/61 (63%), Gaps = 1/61 (1%) Query: 35 PYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLG 93 P+R GQ+ D E+GLYY + RYYDP GR++ DPIG +LYAY N PVN D G Sbjct: 1429 PFRFTGQKLDPETGLYYYKARYYDPELGRFLQTDPIGYADQMNLYAYVGNDPVNLRDSSG 1488 Query: 94 L 94 L Sbjct: 1489 L 1489 >UniRef50_Q2Y6N6 RhsD protein n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2Y6N6_NITMU Length = 187 Score = 63.9 bits (154), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 38/71 (53%), Positives = 44/71 (61%), Gaps = 5/71 (7%) Query: 37 RLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLG-- 93 R PGQ +D E+GLYYN R YDP GRYI DPIGL GG + YAY NPV+ +D G Sbjct: 24 RYPGQYHDAETGLYYNYFRDYDPKTGRYIQSDPIGLVGGINTYAYVEGNPVSKVDSTGEF 83 Query: 94 --LSPADVALI 102 PA A+I Sbjct: 84 AIAIPATPAVI 94 >UniRef50_UPI0001B52689 Rhs core protein with extension n=1 Tax=Shigella sp. D9 RepID=UPI0001B52689 Length = 186 Score = 63.9 bits (154), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 31/60 (51%), Positives = 37/60 (61%) Query: 40 GQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 GQ D ESGL YNR RYY+P G Y+ DP+GL GG Y Y NP +DPLGL+ + Sbjct: 1 GQWQDAESGLCYNRFRYYEPESGMYLVSDPLGLLGGEQTYRYVPNPCGYVDPLGLAITSI 60 >UniRef50_Q2Y592 Peptidase C39, bacteriocin processing n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2Y592_NITMU Length = 1599 Score = 63.5 bits (153), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 34/80 (42%), Positives = 49/80 (61%), Gaps = 3/80 (3%) Query: 16 EYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGG 75 +YD +GNQ+ + +R G Y ++SGLY R YDP +++++DPIG +GG Sbjct: 1382 DYDPYGNQIAGSG--RISVDFRYAGMFYHQQSGLYLTNFRAYDPKTAKWLSRDPIGEKGG 1439 Query: 76 WSLYAYP-LNPVNGIDPLGL 94 +LY Y NP+N IDPLGL Sbjct: 1440 LNLYGYVGGNPINMIDPLGL 1459 >UniRef50_C1AD40 Putative uncharacterized protein n=2 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AD40_GEMAT Length = 1219 Score = 63.5 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 34/74 (45%), Positives = 42/74 (56%), Gaps = 6/74 (8%) Query: 27 ENPHHLHQPYRLPGQQYDKE---SGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYP- 82 E P +L P G D + +G +Y RNRYYDP GR+ +DPIGL GG +LY + Sbjct: 995 ERPKYL--PSAFGGTLLDDKQTATGTHYRRNRYYDPGAGRFTQEDPIGLAGGLNLYGFAG 1052 Query: 83 LNPVNGIDPLGLSP 96 NP N DP GL P Sbjct: 1053 SNPANFSDPFGLCP 1066 >UniRef50_C0ZAE5 Putative uncharacterized protein n=2 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZAE5_BREBN Length = 1821 Score = 63.5 bits (153), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 36/102 (35%), Positives = 56/102 (54%), Gaps = 13/102 (12%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++ + +GN+ + +YD WGN + ++ + P+ G+ +DKESG YY R RYYDP Sbjct: 1508 VVKIKAPNGNVLNTYDYDIWGNLIADKVKETISNPFMYAGEMFDKESGFYYLRARYYDPK 1567 Query: 61 QGRYITQD--------PIGLEGGWSLYAY-PLNPVNGIDPLG 93 GR+I++D P+ L + Y Y NP+ IDP G Sbjct: 1568 IGRFISEDTYKGQVDNPLTL----NRYTYVSSNPLKYIDPSG 1605 >UniRef50_B1HM94 Cell wall-associated protein n=5 Tax=Lysinibacillus sphaericus C3-41 RepID=B1HM94_LYSSC Length = 995 Score = 63.5 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 35/98 (35%), Positives = 56/98 (57%), Gaps = 5/98 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 +LAL + +G+I YD WGN L++ PYR G +YD+++ LYY RYY+P Sbjct: 781 VLALTNTNGDIVAQYTYDAWGNILSQSGTMAAINPYRYAGYRYDEKTKLYYLMARYYNPD 840 Query: 61 QGRYITQDPIGLEG----GWSLYAYP-LNPVNGIDPLG 93 G ++++DP+ + ++ Y+Y NPV +DP G Sbjct: 841 TGVFLSRDPVRGDTMTPISFNGYSYTNNNPVMNVDPSG 878 >UniRef50_B4UIF2 YD repeat protein n=1 Tax=Anaeromyxobacter sp. K RepID=B4UIF2_ANASK Length = 2350 Score = 63.5 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 34/93 (36%), Positives = 48/93 (51%), Gaps = 3/93 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + + G + +YDEWG L + NP QP+ G YD+++GL R YDP Sbjct: 2134 LVVNASTGAVVQRIDYDEWGQVLADSNPGF--QPFGFAGGLYDRDTGLVRFGARDYDPTV 2191 Query: 62 GRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLG 93 GR+ +D GG + Y Y +PVN +DP G Sbjct: 2192 GRWTAKDRSRFRGGLNFYEYAASDPVNFVDPTG 2224 >UniRef50_A9FQL9 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FQL9_SORC5 Length = 2257 Score = 63.5 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 40/101 (39%), Positives = 53/101 (52%), Gaps = 4/101 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + A G +A +YDE+G L + NP QP+ G YD E+ L R YD Sbjct: 2054 LVVNTATGAVAQRIDYDEFGRVLQDTNPGF--QPFGFAGGLYDAETKLVRFGARDYDAEV 2111 Query: 62 GRYITQDPIGLEGG-WSLYAYPLN-PVNGIDPLGLSPADVA 100 GR+ +DPI +GG +LY Y LN PVN DP G P ++ Sbjct: 2112 GRWTAKDPILFDGGDANLYGYVLNDPVNFTDPNGYGPIELG 2152 >UniRef50_A7BNN3 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. SS RepID=A7BNN3_9GAMM Length = 212 Score = 63.2 bits (152), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 45/138 (32%), Positives = 66/138 (47%), Gaps = 20/138 (14%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D+ GN+ W+ Y+ +G + N H R GQ +D E+ L+YN RYY+P GR Sbjct: 29 LIDSQGNVVWAAVYEAFGKARVDVNLVENH--LRFAGQYFDSETRLHYNYYRYYEPTIGR 86 Query: 64 YITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 Y+ DPI + YAY NP++ +DP GL + D L D ++ Sbjct: 87 YLRVDPI---PSVNQYAYVSGNPLSYVDPFGLEKEIMM--------------DQLFDLFD 129 Query: 123 DMKRLNLGGTDQFFHCMA 140 +M L+L F C A Sbjct: 130 EMAALDLAFDYPKFGCEA 147 >UniRef50_D0LTH3 YD repeat protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTH3_HALO1 Length = 2387 Score = 63.2 bits (152), Expect = 7e-09, Method: Composition-based stats. Identities = 39/97 (40%), Positives = 52/97 (53%), Gaps = 4/97 (4%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + A G +A +YD WG L + NP QP+ G YD ++GL + R YDP Sbjct: 1971 LVVDTATGAVAQRIDYDVWGRVLADSNPGF--QPFGFAGGLYDPDTGLVHFGARDYDPRT 2028 Query: 62 GRYITQDPIGLEGGW-SLYAYP-LNPVNGIDPLGLSP 96 GR++ DP GG+ +LYAY +PVN ID G P Sbjct: 2029 GRFLQTDPRLFGGGYDNLYAYSGFDPVNYIDRTGEVP 2065 >UniRef50_D1W844 RHS repeat-associated core domain protein (Fragment) n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W844_9BACT Length = 186 Score = 63.2 bits (152), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 36/94 (38%), Positives = 51/94 (54%), Gaps = 2/94 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + +D+ G + W D +G+ L P+R GQ D+++GLYYNR RYY P Sbjct: 33 IQALDSKGEVVWDCILDIYGDVLELRGKRDF-IPFRFQGQYEDQKTGLYYNRFRYYSPQM 91 Query: 62 GRYITQDPIGLEG-GWSLYAYPLNPVNGIDPLGL 94 G YI+ DPIGL G +LY Y + + +D GL Sbjct: 92 GMYISSDPIGLAGNNPTLYGYVEDVNSYLDLFGL 125 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77759 Putative uncharacterized protein ylbH n=12 Tax=E... 353 3e-96 UniRef50_UPI0001B52595 rhsE element core protein RshE n=1 Tax=Es... 224 2e-57 UniRef50_B5MRU6 Rhs-family protein n=9 Tax=Gammaproteobacteria R... 160 3e-38 UniRef50_D0KEV6 RHS protein n=2 Tax=Enterobacteriaceae RepID=D0K... 158 2e-37 UniRef50_Q6LUC4 Putative uncharacterized protein n=1 Tax=Photoba... 157 3e-37 UniRef50_O52663 Core protein (Fragment) n=5 Tax=Enterobacteriace... 157 3e-37 UniRef50_Q7MDR0 Rhs family protein n=3 Tax=Vibrio vulnificus Rep... 157 4e-37 UniRef50_P16919 Protein rhsD n=261 Tax=Bacteria RepID=RHSD_ECOLI 157 4e-37 UniRef50_Q4ZLF3 YD repeat n=5 Tax=Pseudomonas syringae group Rep... 156 5e-37 UniRef50_Q2SFS1 Rhs family protein n=1 Tax=Hahella chejuensis KC... 156 7e-37 UniRef50_A8A655 Rhs family protein n=14 Tax=Enterobacteriaceae R... 155 7e-37 UniRef50_A1TSG3 RHS protein n=1 Tax=Acidovorax citrulli AAC00-1 ... 155 1e-36 UniRef50_Q2SPP2 Rhs family protein n=3 Tax=Hahella chejuensis KC... 155 1e-36 UniRef50_P32109 Putative uncharacterized protein yibJ n=14 Tax=B... 155 1e-36 UniRef50_Q48LL6 Rhs family protein n=1 Tax=Pseudomonas syringae ... 154 3e-36 UniRef50_Q2SFR5 Rhs family protein n=1 Tax=Hahella chejuensis KC... 153 4e-36 UniRef50_UPI0001C341D4 protein RhsA n=1 Tax=Citrobacter youngae ... 153 6e-36 UniRef50_C1M8X5 Core protein n=2 Tax=Citrobacter RepID=C1M8X5_9ENTR 152 7e-36 UniRef50_B5PJT7 Protein RhsD n=2 Tax=Enterobacteriaceae RepID=B5... 152 1e-35 UniRef50_A7FDK7 YD/RHS repeat protein n=25 Tax=Enterobacteriacea... 151 2e-35 UniRef50_Q13ML3 YD repeat protein n=10 Tax=Proteobacteria RepID=... 151 2e-35 UniRef50_B2HXH5 Rhs family protein n=5 Tax=Acinetobacter baumann... 151 2e-35 UniRef50_Q0JZD5 RHS family protein n=9 Tax=Bacteria RepID=Q0JZD5... 150 4e-35 UniRef50_B1J8X0 YD repeat protein n=50 Tax=Gammaproteobacteria R... 150 4e-35 UniRef50_Q328Z1 RhsA protein in rhs element n=7 Tax=Enterobacter... 150 5e-35 UniRef50_A1U3R9 YD repeat protein n=3 Tax=Gammaproteobacteria Re... 149 5e-35 UniRef50_Q6LUC6 Hypothetical nucleotidyltransferase n=1 Tax=Phot... 149 5e-35 UniRef50_Q1LDW7 YD repeat n=2 Tax=Burkholderiaceae RepID=Q1LDW7_... 149 7e-35 UniRef50_B1JCT8 RHS protein n=1 Tax=Pseudomonas putida W619 RepI... 149 8e-35 UniRef50_Q83LZ1 Putative Rhs-family protein n=1 Tax=Shigella fle... 149 9e-35 UniRef50_C5CZG8 RHS protein n=1 Tax=Variovorax paradoxus S110 Re... 148 1e-34 UniRef50_B2VH58 Rhs family protein n=5 Tax=Enterobacteriaceae Re... 148 2e-34 UniRef50_Q2SKM2 Rhs family protein n=3 Tax=Hahella chejuensis KC... 147 2e-34 UniRef50_Q147B5 Rhs family protein n=2 Tax=Betaproteobacteria Re... 147 3e-34 UniRef50_Q87U70 Rhs family protein n=2 Tax=Pseudomonas RepID=Q87... 147 4e-34 UniRef50_UPI0001BC4026 Rhs family protein n=6 Tax=Neisseria RepI... 147 4e-34 UniRef50_P77779 Putative uncharacterized protein ybfO n=67 Tax=E... 147 4e-34 UniRef50_B0VRR7 Putative uncharacterized protein n=1 Tax=Acineto... 147 4e-34 UniRef50_C8QGI4 YD repeat protein n=1 Tax=Pantoea sp. At-9b RepI... 146 4e-34 UniRef50_A6GLW0 Rhs family protein n=1 Tax=Limnobacter sp. MED10... 146 5e-34 UniRef50_C3JXH8 Putative Rhs protein n=1 Tax=Pseudomonas fluores... 146 6e-34 UniRef50_UPI00019F17FA RhsC core protein with extension n=1 Tax=... 146 6e-34 UniRef50_Q31U53 Putative uncharacterized protein n=1 Tax=Shigell... 146 7e-34 UniRef50_A7FDJ9 RHS/YD repeat protein n=30 Tax=Enterobacteriacea... 145 9e-34 UniRef50_D0KES6 YD repeat protein n=15 Tax=Gammaproteobacteria R... 145 1e-33 UniRef50_B5S3P6 Probable rhs-related protein (Fragment) n=1 Tax=... 144 2e-33 UniRef50_Q7NY44 Probable Rhs-family protein n=2 Tax=Chromobacter... 144 2e-33 UniRef50_C0Q6Q9 Rhs-family protein n=27 Tax=Enterobacteriaceae R... 143 5e-33 UniRef50_UPI0001B53B37 YD repeat protein n=1 Tax=Streptomyces sp... 143 5e-33 UniRef50_Q39K64 Rhs family protein n=22 Tax=Burkholderia RepID=Q... 142 7e-33 UniRef50_A7FN18 RHS/YD repeat protein n=5 Tax=cellular organisms... 142 9e-33 UniRef50_UPI0001C34A7C Rhs family protein n=1 Tax=Neisseria subf... 142 1e-32 UniRef50_Q12LF3 YD repeat n=2 Tax=Shewanella denitrificans OS217... 142 1e-32 UniRef50_D0KES8 RHS protein n=3 Tax=Pectobacterium wasabiae WPP1... 141 1e-32 UniRef50_UPI0001B52C8C protein, rhs-like protein n=4 Tax=Enterob... 141 2e-32 UniRef50_Q2SFR1 Rhs family protein n=9 Tax=cellular organisms Re... 140 2e-32 UniRef50_B7LTT0 Putative uncharacterized protein n=4 Tax=Enterob... 140 3e-32 UniRef50_B2K1J9 YD repeat protein n=23 Tax=Yersinia RepID=B2K1J9... 140 4e-32 UniRef50_B7H4M5 Uncharacterized protein ybfO n=3 Tax=Acinetobact... 140 4e-32 UniRef50_D2TGW0 Putative Rhs protein n=2 Tax=Citrobacter RepID=D... 140 4e-32 UniRef50_Q2SGE8 Rhs family protein n=1 Tax=Hahella chejuensis KC... 140 5e-32 UniRef50_C5AIF2 Rhs family protein n=1 Tax=Burkholderia glumae B... 140 5e-32 UniRef50_Q4K3M9 Rhs family protein n=5 Tax=Pseudomonas RepID=Q4K... 140 5e-32 UniRef50_A1TV35 YD repeat protein n=2 Tax=Acidovorax RepID=A1TV3... 139 6e-32 UniRef50_A7K3Q8 Rhs family protein n=7 Tax=Vibrio RepID=A7K3Q8_V... 139 6e-32 UniRef50_UPI00019F181C rhsC element core protein RshC n=2 Tax=En... 139 6e-32 UniRef50_B3X3P2 RhsH n=1 Tax=Shigella dysenteriae 1012 RepID=B3X... 139 6e-32 UniRef50_B2PW78 Putative uncharacterized protein n=1 Tax=Provide... 139 8e-32 UniRef50_C9Y459 Putative uncharacterized protein n=1 Tax=Cronoba... 139 9e-32 UniRef50_A1TTQ1 Rhs family protein n=1 Tax=Acidovorax citrulli A... 139 1e-31 UniRef50_A1TSU8 YD repeat protein n=4 Tax=Acidovorax RepID=A1TSU... 139 1e-31 UniRef50_A9AE26 Rhs family protein n=10 Tax=cellular organisms R... 138 2e-31 UniRef50_C5CXA7 YD repeat protein n=1 Tax=Variovorax paradoxus S... 137 3e-31 UniRef50_D1YPL0 RHS repeat-associated core domain protein n=1 Ta... 137 3e-31 UniRef50_Q399U9 Rhs family protein n=3 Tax=Proteobacteria RepID=... 137 3e-31 UniRef50_C9NF93 YD repeat protein n=1 Tax=Streptomyces flavogris... 137 4e-31 UniRef50_B4SV70 Rhs-family protein n=42 Tax=Enterobacteriaceae R... 137 4e-31 UniRef50_D0KG38 Rhs family protein-like protein n=1 Tax=Pectobac... 137 4e-31 UniRef50_C5ALM7 YD repeat protein n=19 Tax=Proteobacteria RepID=... 136 6e-31 UniRef50_C1M4X0 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 136 6e-31 UniRef50_B2PVY9 Putative uncharacterized protein n=12 Tax=Entero... 135 8e-31 UniRef50_A1TTP6 YD repeat protein n=2 Tax=Acidovorax citrulli AA... 135 8e-31 UniRef50_C6M9F5 RHS family protein n=2 Tax=Bacteria RepID=C6M9F5... 135 9e-31 UniRef50_C9Y441 Putative uncharacterized protein n=1 Tax=Cronoba... 135 1e-30 UniRef50_C9Y462 Putative uncharacterized protein n=1 Tax=Cronoba... 135 1e-30 UniRef50_C7MXD3 Rhs family protein n=1 Tax=Saccharomonospora vir... 135 1e-30 UniRef50_C0EPY5 Putative uncharacterized protein n=1 Tax=Neisser... 135 1e-30 UniRef50_A0LJM9 YD repeat protein n=3 Tax=Syntrophobacter fumaro... 135 2e-30 UniRef50_D1SWH5 RHS protein (Fragment) n=1 Tax=Acidovorax avenae... 135 2e-30 UniRef50_UPI00016A9A82 Rhs family protein n=2 Tax=Burkholderia o... 134 2e-30 UniRef50_C6M5B1 Rhs-related protein n=7 Tax=Proteobacteria RepID... 134 2e-30 UniRef50_A1AK54 YD repeat protein n=1 Tax=Pelobacter propionicus... 134 3e-30 UniRef50_Q3YV37 Putative uncharacterized protein n=1 Tax=Shigell... 134 3e-30 UniRef50_B9B9U8 YD repeat protein n=1 Tax=Burkholderia multivora... 133 4e-30 UniRef50_A1TJG7 YD repeat protein n=4 Tax=Proteobacteria RepID=A... 133 5e-30 UniRef50_Q1K295 YD repeat n=4 Tax=Desulfuromonas acetoxidans DSM... 133 6e-30 UniRef50_C6AKX3 Rhs family protein n=4 Tax=Aggregatibacter aphro... 132 7e-30 UniRef50_C8Q7Z5 YD repeat protein n=8 Tax=Enterobacteriaceae Rep... 132 8e-30 UniRef50_UPI00017448E8 YD repeat protein n=2 Tax=Verrucomicrobiu... 132 9e-30 UniRef50_Q9L0E3 Putative Rhs protein n=2 Tax=Streptomyces RepID=... 132 1e-29 UniRef50_B4V6T6 Rhs protein n=2 Tax=Streptomyces RepID=B4V6T6_9ACTO 132 1e-29 UniRef50_UPI0001AF2680 RHS/YD repeat-containing protein n=1 Tax=... 131 1e-29 UniRef50_A9C0N8 YD repeat protein n=5 Tax=cellular organisms Rep... 131 2e-29 UniRef50_C6M9F4 Rhs family protein n=8 Tax=Neisseria RepID=C6M9F... 131 2e-29 UniRef50_Q1I7Q5 Putative uncharacterized protein n=11 Tax=Pseudo... 131 2e-29 UniRef50_D1YPP7 RHS repeat-associated core domain protein n=1 Ta... 131 2e-29 UniRef50_UPI0001B4DD67 Rhs protein n=1 Tax=Streptomyces hygrosco... 131 2e-29 UniRef50_C6M9F8 RhsG core protein with extension n=1 Tax=Neisser... 131 2e-29 UniRef50_A1TR18 YD repeat protein n=8 Tax=Acidovorax RepID=A1TR1... 131 2e-29 UniRef50_A7BNN3 Putative uncharacterized protein n=1 Tax=Beggiat... 131 2e-29 UniRef50_B8FBZ2 YD repeat protein n=1 Tax=Desulfatibacillum alke... 130 3e-29 UniRef50_Q6D1M4 Rhs protein n=5 Tax=Enterobacteriaceae RepID=Q6D... 130 3e-29 UniRef50_Q2SPP3 Rhs family protein n=1 Tax=Hahella chejuensis KC... 130 3e-29 UniRef50_B5I7B2 Rhs repeat protein n=1 Tax=Streptomyces sviceus ... 130 3e-29 UniRef50_UPI000196E06E Rhs family protein n=2 Tax=Neisseria muco... 130 3e-29 UniRef50_A1TQS0 YD repeat protein n=2 Tax=Acidovorax RepID=A1TQS... 130 3e-29 UniRef50_B5H9Z7 Rhs protein n=2 Tax=Streptomyces RepID=B5H9Z7_STRPR 130 4e-29 UniRef50_D1T3Q3 YD repeat protein (Fragment) n=2 Tax=Acidovorax ... 130 5e-29 UniRef50_C6WJA6 YD repeat protein n=3 Tax=Actinosynnema mirum DS... 129 6e-29 UniRef50_A3NNM1 Protein RhsD n=20 Tax=pseudomallei group RepID=A... 129 6e-29 UniRef50_B4ETQ2 Rhs-family protein n=9 Tax=Enterobacteriaceae Re... 129 7e-29 UniRef50_Q88FK6 RHS family protein, putative n=1 Tax=Pseudomonas... 129 8e-29 UniRef50_B1KGR6 YD repeat protein n=1 Tax=Shewanella woodyi ATCC... 129 1e-28 UniRef50_UPI000190F33A Rhs-family protein n=2 Tax=Salmonella ent... 129 1e-28 UniRef50_B0SXS0 YD repeat protein n=1 Tax=Caulobacter sp. K31 Re... 129 1e-28 UniRef50_D1T3N4 YD repeat protein n=2 Tax=Betaproteobacteria Rep... 128 1e-28 UniRef50_UPI0001B56FBA YD repeat-containing protein n=1 Tax=Stre... 128 1e-28 UniRef50_D0KWY8 YD repeat protein n=1 Tax=Halothiobacillus neapo... 128 2e-28 UniRef50_C0EPY1 Putative uncharacterized protein n=7 Tax=Neisser... 128 2e-28 UniRef50_D1VCV7 YD repeat protein n=1 Tax=Frankia sp. EuI1c RepI... 127 3e-28 UniRef50_B7GLX9 Rhs family protein n=1 Tax=Anoxybacillus flavith... 127 4e-28 UniRef50_B4ETQ7 Putative Rhs-family protein n=3 Tax=Enterobacter... 126 6e-28 UniRef50_D1T3N5 YD repeat protein n=1 Tax=Acidovorax avenae subs... 126 6e-28 UniRef50_C5AA19 Rhs family protein n=1 Tax=Burkholderia glumae B... 126 6e-28 UniRef50_D1SVF0 Rhs family protein (Fragment) n=1 Tax=Acidovorax... 126 7e-28 UniRef50_A3KUM5 Rhs family protein n=10 Tax=Pseudomonas aerugino... 126 7e-28 UniRef50_Q7N2G0 Complete genome; segment 11/17 n=4 Tax=Gammaprot... 126 7e-28 UniRef50_D1PUV1 RHS family protein n=1 Tax=Prevotella bergensis ... 125 9e-28 UniRef50_C7Q0A7 YD repeat protein n=2 Tax=Catenulispora acidiphi... 125 9e-28 UniRef50_Q8GDM7 Rhs n=3 Tax=Photorhabdus RepID=Q8GDM7_PHOLU 125 1e-27 UniRef50_B4EGW8 RHS-family protein n=9 Tax=Burkholderiaceae RepI... 125 1e-27 UniRef50_A1WE65 Rhs family protein-like protein n=1 Tax=Verminep... 125 1e-27 UniRef50_B3JL94 Putative uncharacterized protein n=1 Tax=Bactero... 125 2e-27 UniRef50_C8W2T8 YD repeat protein n=2 Tax=Desulfotomaculum aceto... 124 2e-27 UniRef50_D1JME6 Rhs family protein n=22 Tax=Bacteroides RepID=D1... 124 2e-27 UniRef50_B3PEK8 RHS Repeat family n=1 Tax=Cellvibrio japonicus U... 124 2e-27 UniRef50_B2HAQ4 RhsD protein n=4 Tax=Burkholderia RepID=B2HAQ4_B... 124 3e-27 UniRef50_C6CPH2 YD repeat protein n=11 Tax=Enterobacteriaceae Re... 124 3e-27 UniRef50_B4VFT3 Rhs protein n=4 Tax=Bacteria RepID=B4VFT3_9ACTO 124 3e-27 UniRef50_A4SKJ3 Rhs family protein n=2 Tax=Bacteria RepID=A4SKJ3... 124 3e-27 UniRef50_A9EVR3 Conserved carbohydrate-binding protein, Rhs fami... 124 3e-27 UniRef50_D1S833 YD repeat protein n=1 Tax=Micromonospora auranti... 124 3e-27 UniRef50_B3PEN8 Rhsfamily protein n=2 Tax=Cellvibrio japonicus U... 123 4e-27 UniRef50_C4NV50 Rhs repeat family protein n=10 Tax=Gammaproteoba... 123 6e-27 UniRef50_Q12SZ6 Putative uncharacterized protein n=1 Tax=Shewane... 123 6e-27 UniRef50_D1AA66 YD repeat-containing protein n=1 Tax=Thermomonos... 123 6e-27 UniRef50_A5GE16 YD repeat protein n=1 Tax=Geobacter uraniireduce... 122 7e-27 UniRef50_B2PYS3 Putative uncharacterized protein n=1 Tax=Provide... 122 7e-27 UniRef50_C7Q0B8 YD repeat protein n=1 Tax=Catenulispora acidiphi... 122 8e-27 UniRef50_A4FJ21 YD repeat protein n=1 Tax=Saccharopolyspora eryt... 122 8e-27 UniRef50_B4V251 LipX3 n=2 Tax=Streptomyces RepID=B4V251_9ACTO 121 2e-26 UniRef50_UPI00016B0868 Rhs family protein n=1 Tax=Burkholderia p... 121 2e-26 UniRef50_B8FIJ1 YD repeat protein n=1 Tax=Desulfatibacillum alke... 121 2e-26 UniRef50_B7GX39 Protein rhsD n=4 Tax=Acinetobacter baumannii Rep... 121 2e-26 UniRef50_A9AKU8 Type VI secretion system Vgr family protein n=10... 120 3e-26 UniRef50_B7GLY4 Rhs family protein n=3 Tax=Anoxybacillus flavith... 120 3e-26 UniRef50_D2UF28 Putative rhs family protein n=2 Tax=Xanthomonas ... 120 3e-26 UniRef50_B5JS46 NHL repeat containing protein n=2 Tax=gamma prot... 120 3e-26 UniRef50_D1W448 RHS repeat-associated core domain protein n=1 Ta... 120 3e-26 UniRef50_C6CI62 YD repeat protein n=7 Tax=Enterobacteriaceae Rep... 120 3e-26 UniRef50_B3PHE6 RHS Repeat family n=5 Tax=cellular organisms Rep... 120 3e-26 UniRef50_Q2SIG5 Rhs family protein n=1 Tax=Hahella chejuensis KC... 120 4e-26 UniRef50_C9RZN2 YD repeat protein n=2 Tax=Geobacillus RepID=C9RZ... 120 5e-26 UniRef50_C6CNW6 YD repeat protein n=8 Tax=Enterobacteriaceae Rep... 120 5e-26 UniRef50_A8ZSG6 YD repeat protein n=1 Tax=Desulfococcus oleovora... 119 6e-26 UniRef50_B6VUY0 Putative uncharacterized protein n=2 Tax=Bactero... 119 6e-26 UniRef50_A9EW02 Conserved exported carbohydrate-binding protein,... 119 6e-26 UniRef50_A9GIJ6 Conserved carbohydrate-binding protein, Rhs fami... 119 7e-26 UniRef50_C0FSB7 Putative uncharacterized protein n=1 Tax=Rosebur... 119 7e-26 UniRef50_D1WVZ1 YD repeat protein n=1 Tax=Streptomyces sp. ACT-1... 119 8e-26 UniRef50_Q07833 Wall-associated protein n=18 Tax=Bacillaceae Rep... 119 8e-26 UniRef50_Q395C2 Rhs family protein n=1 Tax=Burkholderia sp. 383 ... 119 8e-26 UniRef50_A3UDD5 Wall associated protein n=1 Tax=Oceanicaulis ale... 119 8e-26 UniRef50_C3JCI6 Rhs family protein n=4 Tax=Bacteria RepID=C3JCI6... 119 8e-26 UniRef50_C7QAC6 YD repeat protein n=2 Tax=Bacteria RepID=C7QAC6_... 119 9e-26 UniRef50_D0HCV2 Rhs protein n=3 Tax=Vibrio mimicus VM223 RepID=D... 119 1e-25 UniRef50_C4KA75 YD repeat protein n=1 Tax=Thauera sp. MZ1T RepID... 119 1e-25 UniRef50_D0KZB0 YD repeat protein n=1 Tax=Halothiobacillus neapo... 119 1e-25 UniRef50_B3E8D4 YD repeat protein n=1 Tax=Geobacter lovleyi SZ R... 119 1e-25 UniRef50_A4B8X0 YD repeat n=1 Tax=Reinekea blandensis MED297 Rep... 119 1e-25 UniRef50_B8FCM0 YD repeat protein n=1 Tax=Desulfatibacillum alke... 118 1e-25 UniRef50_C2LFQ4 Putative uncharacterized protein n=1 Tax=Proteus... 118 2e-25 UniRef50_B1HM94 Cell wall-associated protein n=5 Tax=Lysinibacil... 118 2e-25 UniRef50_A8ZTS1 YD repeat protein n=4 Tax=Desulfococcus oleovora... 118 2e-25 UniRef50_Q0K1I3 Insecticidal toxin complex protein n=2 Tax=Prote... 118 2e-25 UniRef50_C0FSB4 Putative uncharacterized protein n=1 Tax=Rosebur... 117 2e-25 UniRef50_D0BWK2 YD repeat protein n=1 Tax=Acinetobacter sp. RUH2... 117 2e-25 UniRef50_C7QG23 YD repeat protein n=2 Tax=Catenulispora acidiphi... 117 3e-25 UniRef50_B2Q762 Putative uncharacterized protein n=2 Tax=Provide... 117 3e-25 UniRef50_C8W2V9 YD repeat protein n=3 Tax=Desulfotomaculum aceto... 117 3e-25 UniRef50_D2KTW4 Putative uncharacterized protein n=2 Tax=Strepto... 117 4e-25 UniRef50_C8W2A7 YD repeat protein n=3 Tax=Desulfotomaculum aceto... 117 4e-25 UniRef50_Q2Y6W6 RhsD protein n=1 Tax=Nitrosospira multiformis AT... 117 4e-25 UniRef50_A6GBQ3 Putative uncharacterized protein n=1 Tax=Plesioc... 116 7e-25 UniRef50_A3DF74 YD repeat protein n=17 Tax=Clostridium thermocel... 116 7e-25 UniRef50_Q3JSF4 RhsD protein n=23 Tax=Burkholderia pseudomallei ... 116 7e-25 UniRef50_A9C2K3 YD repeat protein n=1 Tax=Delftia acidovorans SP... 115 8e-25 UniRef50_UPI0001B58169 YD repeat-containing protein n=1 Tax=Stre... 115 8e-25 UniRef50_D1YPL4 RHS repeat-associated core domain protein n=1 Ta... 115 8e-25 UniRef50_Q2Y592 Peptidase C39, bacteriocin processing n=1 Tax=Ni... 115 9e-25 UniRef50_B2PW71 Putative uncharacterized protein n=1 Tax=Provide... 115 9e-25 UniRef50_C1B7W9 Putative uncharacterized protein n=1 Tax=Rhodoco... 115 9e-25 UniRef50_C6EE82 Rhs family protein-like protein n=2 Tax=Escheric... 115 1e-24 UniRef50_C8QVK4 YD repeat protein n=4 Tax=Desulfurivibrio alkali... 115 1e-24 UniRef50_Q16U81 Putative uncharacterized protein n=1 Tax=Aedes a... 115 1e-24 UniRef50_A0LQM7 NHL repeat containing protein n=1 Tax=Syntrophob... 115 1e-24 UniRef50_C7HJS3 YD repeat protein (Fragment) n=3 Tax=Clostridium... 115 2e-24 UniRef50_A9FBU5 Conserved carbohydrate-binding protein, Rhs fami... 114 2e-24 UniRef50_Q4UTI2 Putative uncharacterized protein n=2 Tax=Xanthom... 114 3e-24 UniRef50_A9GD22 Conserved carbohydrate-binding protein, Rhs fami... 114 3e-24 UniRef50_Q2T5B9 YD repeat protein n=39 Tax=Proteobacteria RepID=... 114 3e-24 UniRef50_Q73BZ0 Cell wall-associated protein n=4 Tax=Bacillus ce... 114 3e-24 UniRef50_C7MZM5 Rhs family protein n=1 Tax=Saccharomonospora vir... 113 3e-24 UniRef50_C0EFE6 Putative uncharacterized protein n=1 Tax=Clostri... 113 3e-24 UniRef50_A7BPL6 Protein containing RHS repeats n=2 Tax=Beggiatoa... 113 4e-24 UniRef50_C5ID01 RhsK n=50 Tax=cellular organisms RepID=C5ID01_ECOLX 113 4e-24 UniRef50_A8ZXK4 YD repeat protein n=3 Tax=Deltaproteobacteria Re... 113 4e-24 UniRef50_A9FQL9 Putative uncharacterized protein n=1 Tax=Sorangi... 113 4e-24 UniRef50_A9FZA9 Rhs family protein n=1 Tax=Sorangium cellulosum ... 113 6e-24 UniRef50_B9XHG9 YD repeat protein n=1 Tax=bacterium Ellin514 Rep... 113 6e-24 UniRef50_B2AU51 Predicted CDS Pa_1_17960 n=1 Tax=Podospora anser... 113 6e-24 UniRef50_A6EH31 Rhs family protein n=2 Tax=cellular organisms Re... 113 7e-24 UniRef50_C5SNJ4 YD repeat protein n=1 Tax=Asticcacaulis excentri... 112 8e-24 UniRef50_A6DLJ6 Putative uncharacterized protein n=1 Tax=Lentisp... 112 9e-24 UniRef50_C0ZAE5 Putative uncharacterized protein n=2 Tax=Breviba... 112 2e-23 UniRef50_D0KFB3 Putative uncharacterized protein n=1 Tax=Pectoba... 112 2e-23 UniRef50_A8ZVU3 YD repeat protein n=1 Tax=Desulfococcus oleovora... 111 2e-23 UniRef50_Q4VQB1 SGS1 n=4 Tax=Aedes/Ochlerotatus group RepID=Q4VQ... 111 2e-23 UniRef50_D0LTH3 YD repeat protein n=1 Tax=Haliangium ochraceum D... 111 2e-23 UniRef50_A9C2K7 Rhs family protein-like protein n=1 Tax=Delftia ... 111 2e-23 UniRef50_B2UQS6 Putative uncharacterized protein n=1 Tax=Akkerma... 111 2e-23 UniRef50_B4UIF2 YD repeat protein n=1 Tax=Anaeromyxobacter sp. K... 110 3e-23 UniRef50_C7IND8 YD repeat protein (Fragment) n=1 Tax=Clostridium... 110 3e-23 UniRef50_C0EGC9 Putative uncharacterized protein n=2 Tax=Clostri... 110 3e-23 UniRef50_Q2T916 Rhs1 protein n=30 Tax=Burkholderia RepID=Q2T916_... 110 3e-23 UniRef50_B1HKQ3 Rhs family protein n=1 Tax=Burkholderia pseudoma... 110 3e-23 UniRef50_C4DNE3 Putative uncharacterized protein n=1 Tax=Stackeb... 110 4e-23 UniRef50_C7PJF4 YD repeat protein n=2 Tax=Chitinophaga pinensis ... 110 4e-23 UniRef50_C3IG21 Wall associated protein n=2 Tax=Bacillus cereus ... 110 4e-23 Sequences not found previously or not previously below threshold: UniRef50_D1PV96 YD repeat protein n=1 Tax=Prevotella bergensis D... 114 2e-24 >UniRef50_P77759 Putative uncharacterized protein ylbH n=12 Tax=Escherichia coli RepID=YLBH_ECOLI Length = 236 Score = 353 bits (906), Expect = 3e-96, Method: Composition-based stats. Identities = 236/236 (100%), Positives = 236/236 (100%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL Sbjct: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT Sbjct: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV Sbjct: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 Query: 181 KLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQDAGYLK 236 KLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQDAGYLK Sbjct: 181 KLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQDAGYLK 236 >UniRef50_UPI0001B52595 rhsE element core protein RshE n=1 Tax=Escherichia sp. 4_1_40B RepID=UPI0001B52595 Length = 273 Score = 224 bits (572), Expect = 2e-57, Method: Composition-based stats. Identities = 97/231 (41%), Positives = 124/231 (53%), Gaps = 30/231 (12%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DGN AW GEYDEWGNQLNEENPHHLHQPYRLPGQQ+D+ESGLYYNR+R+YDPLQ Sbjct: 32 LALISEDGNTAWRGEYDEWGNQLNEENPHHLHQPYRLPGQQHDEESGLYYNRHRHYDPLQ 91 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD----------VALIRRKDQLNH- 110 GRYIT DPIGL GGW++Y YPLNP+ IDP+GL + V+ + N Sbjct: 92 GRYITPDPIGLRGGWNMYQYPLNPIQVIDPMGLDAIENMTSGGLIYAVSGVPGLIAANSI 151 Query: 111 -QRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYG 169 A+ D + + G D HC CR++K + + Sbjct: 152 TNSAYQFGYDMDAIVGGAHNGAADAMRHCYLMCRMTKTFGSTI----------------- 194 Query: 170 LNLFGMYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPS-TTDCSDRCSDYIN 219 ++ G + ++ DL N G+ C + CSD C + N Sbjct: 195 ADVIGKNHEAAGDRQGQPAKERIMDLKNNTVGIACGDFSAKCSDACIEKYN 245 >UniRef50_B5MRU6 Rhs-family protein n=9 Tax=Gammaproteobacteria RepID=B5MRU6_SALET Length = 216 Score = 160 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 53/153 (34%), Positives = 75/153 (49%), Gaps = 3/153 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D GNI W Y WGN +E+ + Q R GQ D+E+GL+YN R+YDP G+ Sbjct: 1 MTDGGGNIVWEAGYQVWGNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGK 60 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWD---ILSDT 120 +I+ DPI + GG +LY Y NP+ IDPLGL + K + H+ D Sbjct: 61 FISGDPISIRGGINLYQYAPNPIKWIDPLGLYNGEGQRELGKYHVFHEHNLDITEYGLSD 120 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVS 153 E R N +++ + AF R + GV Sbjct: 121 AEHFSRGNQAISERMKNDPAFRREMQTKYPGVV 153 >UniRef50_D0KEV6 RHS protein n=2 Tax=Enterobacteriaceae RepID=D0KEV6_PECWW Length = 1348 Score = 158 bits (399), Expect = 2e-37, Method: Composition-based stats. Identities = 49/99 (49%), Positives = 63/99 (63%), Gaps = 5/99 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE-----NPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 + DG + W EY WGN + E + +HQP R GQ +D E+GL+YNR RYYD Sbjct: 1112 MTGQDGGLVWRAEYRVWGNTVRVEQVEVPHSEPIHQPLRYQGQYFDAETGLHYNRFRYYD 1171 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 P GR+++QDPIGL GG +LY Y NP+ IDPLGL+P Sbjct: 1172 PDAGRFVSQDPIGLAGGINLYQYAPNPITWIDPLGLTPC 1210 >UniRef50_Q6LUC4 Putative uncharacterized protein n=1 Tax=Photobacterium profundum RepID=Q6LUC4_PHOPR Length = 532 Score = 157 bits (397), Expect = 3e-37, Method: Composition-based stats. Identities = 48/129 (37%), Positives = 72/129 (55%), Gaps = 3/129 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + ++DG + W Y+ G + +H P R GQ +D+ESGL+YNR RYYDP G+ Sbjct: 273 VTNSDGEVVWQATYNALGCAFISID--IIHNPLRFQGQYHDQESGLHYNRFRYYDPSIGQ 330 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 +I QDPIGL GG + Y Y NP+ +DPLGLS + +I K + W+ + + Sbjct: 331 FIHQDPIGLLGGINHYRYAPNPIQWVDPLGLSCKE-GIIELKKSYSSLNWWEKIKRGLDI 389 Query: 124 MKRLNLGGT 132 +++GG Sbjct: 390 FDSVDVGGG 398 >UniRef50_O52663 Core protein (Fragment) n=5 Tax=Enterobacteriaceae RepID=O52663_ECOLX Length = 350 Score = 157 bits (397), Expect = 3e-37, Method: Composition-based stats. Identities = 77/93 (82%), Positives = 85/93 (91%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DGN AW GEYDEWGNQLNEENP++LHQPYRLPGQQ+D+ESGLYYNRNRYYDPLQ Sbjct: 114 LALISEDGNTAWRGEYDEWGNQLNEENPYYLHQPYRLPGQQHDEESGLYYNRNRYYDPLQ 173 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDPIGL GGW+LY YPLNP+ +DPLGL Sbjct: 174 GRYITQDPIGLAGGWNLYNYPLNPIIRMDPLGL 206 >UniRef50_Q7MDR0 Rhs family protein n=3 Tax=Vibrio vulnificus RepID=Q7MDR0_VIBVY Length = 1498 Score = 157 bits (396), Expect = 4e-37, Method: Composition-based stats. Identities = 53/178 (29%), Positives = 76/178 (42%), Gaps = 6/178 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+D G++ W YD +G E + P R GQ +D E+GL+YN RYYDP Sbjct: 1039 LALVDEQGSVVWQARYDTYGRAHIE--VESVGNPLRFQGQYHDVETGLHYNLARYYDPRT 1096 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD--VALIRRKDQLNHQRAWDILSD 119 GR+I DPIGL GG + Y Y NPV +DP GL + A+ + Q+ Sbjct: 1097 GRFIQPDPIGLLGGINHYQYAPNPVMWVDPHGLCAKEGSPAIKAGVNDTQPQQPMFYAMG 1156 Query: 120 TYEDMKRLNLGGTDQFFHCMA--FCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGM 175 + + H +A + ++ A + G D L G+ Sbjct: 1157 SGNYASAVKTASPTYQLHAIAPDAIQQVGIDYALGATEMLAAGAYNTAVDAVAGLAGL 1214 >UniRef50_P16919 Protein rhsD n=261 Tax=Bacteria RepID=RHSD_ECOLI Length = 1426 Score = 157 bits (396), Expect = 4e-37, Method: Composition-based stats. Identities = 75/93 (80%), Positives = 86/93 (92%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DGN AWS EYDEWGNQLNEENPHH++QPYRLPGQQ+D+ESGLYYNR+RYYDPLQ Sbjct: 1166 LALISEDGNTAWSAEYDEWGNQLNEENPHHVYQPYRLPGQQHDEESGLYYNRHRYYDPLQ 1225 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDP+GL+GGW+LY YPLNP+ IDP+GL Sbjct: 1226 GRYITQDPMGLKGGWNLYQYPLNPLQQIDPMGL 1258 >UniRef50_Q4ZLF3 YD repeat n=5 Tax=Pseudomonas syringae group RepID=Q4ZLF3_PSEU2 Length = 451 Score = 156 bits (395), Expect = 5e-37, Method: Composition-based stats. Identities = 55/135 (40%), Positives = 70/135 (51%), Gaps = 2/135 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DA+G I W +Y WG + + + Q R GQ +D E+GL+YN RYYDP Sbjct: 222 LEMTDAEGQIVWQAKYRAWGAV-EKLVVNEVEQNLRFQGQYFDAETGLHYNTFRYYDPEI 280 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR+ITQDPIGL GG++LY Y NP IDPLGL + L H RA D S + Sbjct: 281 GRFITQDPIGLLGGFNLYGYCRNPTAWIDPLGL-DWNYFLSNEDGAYYHGRASDKTSLSD 339 Query: 122 EDMKRLNLGGTDQFF 136 + G D Sbjct: 340 VMRRHGKNKGVDGAR 354 >UniRef50_Q2SFS1 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SFS1_HAHCH Length = 295 Score = 156 bits (394), Expect = 7e-37, Method: Composition-based stats. Identities = 48/125 (38%), Positives = 72/125 (57%), Gaps = 2/125 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A+G + WS Y +GN L ++ + P R GQ YD+E+GL+YNR RYYDP R Sbjct: 1 MTNAEGEVVWSARYKAYGN-LALQDVEDVQNPLRFQGQYYDEETGLHYNRRRYYDPSAAR 59 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIR-RKDQLNHQRAWDILSDTYE 122 +I QDP+GL GG + Y Y NP +DP GL+ + + + +KD H + +Y+ Sbjct: 60 FINQDPVGLLGGDNNYQYAPNPTGWVDPYGLTCKENSWNQFQKDTKGHFANSTEAAKSYQ 119 Query: 123 DMKRL 127 MK + Sbjct: 120 KMKEV 124 >UniRef50_A8A655 Rhs family protein n=14 Tax=Enterobacteriaceae RepID=A8A655_ECOHS Length = 314 Score = 155 bits (393), Expect = 7e-37, Method: Composition-based stats. Identities = 72/124 (58%), Positives = 86/124 (69%), Gaps = 2/124 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G AW EYDEWGN L++ENPHHL Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 60 LALISTEGATAWCAEYDEWGNLLSDENPHHLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 119 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRYITQDPIGL+GGW+ Y YPLNPV +DP GL D+ L D ++ + + Sbjct: 120 GRYITQDPIGLKGGWNFYQYPLNPVINVDPQGL--VDINLYPESDLIHSVADEINIPGVF 177 Query: 122 EDMK 125 Sbjct: 178 TIGG 181 >UniRef50_A1TSG3 RHS protein n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TSG3_ACIAC Length = 384 Score = 155 bits (392), Expect = 1e-36, Method: Composition-based stats. Identities = 62/206 (30%), Positives = 89/206 (43%), Gaps = 39/206 (18%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE------------------------NPHHLHQPYRLP 39 + D DG++AW +Y WG+ + EE + + Q R+ Sbjct: 139 MSDRDGHLAWRAQYRVWGSAVAEEWQAFDGVGRPVEAPRHETGQRPDNSAAPMPQNLRMQ 198 Query: 40 GQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 GQ D+E+GL+YN RYY P G + T DPIGL GG +L+ Y NPV+ IDPLG +P Sbjct: 199 GQYLDRETGLHYNTFRYYGPDVGAFTTPDPIGLAGGVNLHQYAPNPVSWIDPLGWNP--- 255 Query: 100 ALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGL 159 + R Q H+ A + L GT + + R D VS +A+ Sbjct: 256 --VCRMSQTPHETASSLP---------LVRPGTSAWQKAVEAIRQGGKGDIRVSTAAEAK 304 Query: 160 GYEKEIRDYGLNLFGMYGRKVKLSHS 185 +E R G++ MY S Sbjct: 305 ALLQEARG-GMDRRKMYSDDGDYSKG 329 >UniRef50_Q2SPP2 Rhs family protein n=3 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SPP2_HAHCH Length = 1434 Score = 155 bits (392), Expect = 1e-36, Method: Composition-based stats. Identities = 44/95 (46%), Positives = 60/95 (63%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D++G + WS Y +G L ++ +H P R GQ YD+E+G +YNR+RYYDP GR Sbjct: 1125 MTDSEGTLVWSARYKAYG-ALALQDVESVHNPLRFQGQYYDEETGFHYNRHRYYDPQSGR 1183 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 +I QDPIGL GG + Y Y NPV +DP GL+ Sbjct: 1184 FINQDPIGLLGGANAYQYAPNPVGWVDPFGLTAKP 1218 >UniRef50_P32109 Putative uncharacterized protein yibJ n=14 Tax=Bacteria RepID=YIBJ_ECOLI Length = 233 Score = 155 bits (392), Expect = 1e-36, Method: Composition-based stats. Identities = 71/124 (57%), Positives = 85/124 (68%), Gaps = 2/124 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G AW EYDEWGN L++ENPHHL Q RLPGQQYD+ESGLYYNR+RYYDPL Sbjct: 60 LALISTEGATAWCAEYDEWGNLLSDENPHHLQQLIRLPGQQYDEESGLYYNRHRYYDPLL 119 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRYITQDPIGL+GGW+ Y YPLNPV +DP GL D+ L D ++ + + Sbjct: 120 GRYITQDPIGLKGGWNFYQYPLNPVINVDPQGL--VDINLYPESDLIHSVADEINIPGVF 177 Query: 122 EDMK 125 Sbjct: 178 TIGG 181 >UniRef50_Q48LL6 Rhs family protein n=1 Tax=Pseudomonas syringae pv. phaseolicola 1448A RepID=Q48LL6_PSE14 Length = 362 Score = 154 bits (388), Expect = 3e-36, Method: Composition-based stats. Identities = 56/185 (30%), Positives = 78/185 (42%), Gaps = 5/185 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DA+G I W +Y WG + + + Q R GQ +D E+GL+YN RYYDP Sbjct: 138 LEMTDAEGQIVWQAKYRAWGAV-EKLVVNEVEQNLRFQGQYFDVETGLHYNTFRYYDPEI 196 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR+ITQDPIGL+GG +LY Y NP +DP G + + D H ++ Sbjct: 197 GRFITQDPIGLDGGDNLYKYVPNPTAWVDPWGWACNRPGGYKSGDVDTHG---NLSPGVN 253 Query: 122 EDMKRLNLGGTDQFF-HCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 N+ H +K N AG R+A K + R Sbjct: 254 RAPGNKNIPSDKSVQSHHFIQDEWAKRNVAGYKRNAAPAVLLKSSSGESHAIVSSLQRTR 313 Query: 181 KLSHS 185 + Sbjct: 314 RRLGG 318 >UniRef50_Q2SFR5 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SFR5_HAHCH Length = 471 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 43/95 (45%), Positives = 59/95 (62%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A+G + WS Y +GN L ++ + P R GQ YD+E+GL+YNR RYYDP R Sbjct: 46 MTNAEGEVVWSARYKAYGN-LALKDVEDVQNPLRFQGQYYDEETGLHYNRRRYYDPSAAR 104 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 +I QDP+GL GG + Y Y LNP +DP GL+ Sbjct: 105 FINQDPVGLLGGDNNYQYALNPTGWVDPYGLTAKP 139 >UniRef50_UPI0001C341D4 protein RhsA n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C341D4 Length = 1365 Score = 153 bits (386), Expect = 6e-36, Method: Composition-based stats. Identities = 67/93 (72%), Positives = 76/93 (81%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DG I+W EYDEWGN L E+NPH+L Q RLPGQQYD ESGL+YNR+RYY+P Sbjct: 1135 LALISQDGAISWRAEYDEWGNVLREDNPHNLQQLIRLPGQQYDDESGLHYNRHRYYNPGL 1194 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDPIGL+GGW+LY YPLNPV IDP GL Sbjct: 1195 GRYITQDPIGLKGGWNLYKYPLNPVEYIDPSGL 1227 >UniRef50_C1M8X5 Core protein n=2 Tax=Citrobacter RepID=C1M8X5_9ENTR Length = 1359 Score = 152 bits (385), Expect = 7e-36, Method: Composition-based stats. Identities = 66/104 (63%), Positives = 76/104 (73%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DG I+W EYDEWGN L E+NPH+L Q RLPGQQYD ESGL+YNR+RYY+P Sbjct: 1136 LALIRQDGAISWRAEYDEWGNVLREDNPHNLQQLIRLPGQQYDDESGLHYNRHRYYNPGL 1195 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRK 105 GRYITQDPIGL GG + Y YPLNPV +DPLGL + I Sbjct: 1196 GRYITQDPIGLAGGLNPYQYPLNPVTEVDPLGLWAFAIPAILEG 1239 >UniRef50_B5PJT7 Protein RhsD n=2 Tax=Enterobacteriaceae RepID=B5PJT7_SALET Length = 429 Score = 152 bits (383), Expect = 1e-35, Method: Composition-based stats. Identities = 73/99 (73%), Positives = 80/99 (80%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ D +AW GEYDEWGN EENP HL Q RLPGQQYD+ESGLYYNR+RYY+P Q Sbjct: 179 LALITPDNTVAWRGEYDEWGNLSGEENPAHLEQVIRLPGQQYDEESGLYYNRHRYYNPGQ 238 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 GRYITQDPIGL GGW+LY YPLNPV+ IDPLGLS D A Sbjct: 239 GRYITQDPIGLRGGWNLYNYPLNPVSEIDPLGLSMWDDA 277 >UniRef50_A7FDK7 YD/RHS repeat protein n=25 Tax=Enterobacteriaceae RepID=A7FDK7_YERP3 Length = 1527 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 52/185 (28%), Positives = 81/185 (43%), Gaps = 11/185 (5%) Query: 4 LMDADGNIAWSGEYDEWGN----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L++ G + W+ WG + + R GQ D ESGL+YNR RYYD Sbjct: 1295 LLNEQGKVVWASRLSTWGQAELWRQAANEEDRVSCNLRFAGQYADAESGLHYNRFRYYDG 1354 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 G+Y+ DPIGL GG + Y Y NPV +DPLGL DVA R+ L ++I Sbjct: 1355 ETGQYLCPDPIGLAGGLNPYGYVHNPVKYVDPLGLCKTDVARERQAQMLQDDVGYNISPK 1414 Query: 120 TYEDMKRLNLGGT--DQFFHCMAFCRVSKLNDAGVSRSAKG-----LGYEKEIRDYGLNL 172 +++ + G+ + + + + +S+S +G + G N+ Sbjct: 1415 SWDQFPSIGRDGSFITDKKGALKYFNGMQTGNVTISKSVAASIEKDMGLSLGSLNGGFNI 1474 Query: 173 FGMYG 177 + G Sbjct: 1475 RKIDG 1479 >UniRef50_Q13ML3 YD repeat protein n=10 Tax=Proteobacteria RepID=Q13ML3_BURXL Length = 1531 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 53/133 (39%), Positives = 71/133 (53%), Gaps = 3/133 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH--HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L ADG I W Y WGN + E P + Q R GQ D+E+GL+YN RYYDP Sbjct: 1312 LTSADGRIVWQAMYQLWGNTVRESEPESYAVRQNLRYQGQYLDRETGLHYNTLRYYDPDI 1371 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR+ T DPIGL GG +LY Y NP++ IDP+GLS L ++QL+ + + + Sbjct: 1372 GRFTTPDPIGLAGGVNLYRYAPNPMSWIDPMGLS-CTSNLKDIENQLSRGKGATVTVSSK 1430 Query: 122 EDMKRLNLGGTDQ 134 + + L T Sbjct: 1431 AEAEELLRAYTSG 1443 >UniRef50_B2HXH5 Rhs family protein n=5 Tax=Acinetobacter baumannii RepID=B2HXH5_ACIBC Length = 1635 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 51/159 (32%), Positives = 75/159 (47%), Gaps = 11/159 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE-------ENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 + D G I W EY WG E E + R GQ +D+E+GL+YNR RY Sbjct: 1383 MSDQTGAIIWKAEYKAWGECKLEQTNSDFFEKSEIISNNIRFQGQYFDEETGLHYNRYRY 1442 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQL----NHQR 112 Y P GR+I++DPIGL GG+++YAY NPV +DP GL+P + + + Q Sbjct: 1443 YSPYVGRFISKDPIGLLGGFNVYAYTANPVQWVDPYGLAPCSLVRYKPDKVTPQAGSRQD 1502 Query: 113 AWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAG 151 A D + + + GT + + + +G Sbjct: 1503 AIDRAWSLEKQLIQTTGTGTRDWSKAELDTILRTPSGSG 1541 >UniRef50_Q0JZD5 RHS family protein n=9 Tax=Bacteria RepID=Q0JZD5_RALEH Length = 1585 Score = 150 bits (379), Expect = 4e-35, Method: Composition-based stats. Identities = 45/107 (42%), Positives = 63/107 (58%), Gaps = 4/107 (3%) Query: 4 LMDADGNIAWSGEYDEWGN----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L D G +AWS +Y WG N + P R GQ YD E+GL+YNR+RYYDP Sbjct: 1377 LTDEAGELAWSAQYKAWGAAQEAISNAARKAGIQNPLRFQGQYYDHENGLHYNRHRYYDP 1436 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKD 106 GR++++DPIGL GG +L Y NP++ +DPLGL+ + ++ Sbjct: 1437 GTGRFVSKDPIGLAGGLNLNQYAPNPISWVDPLGLACSQTRRASLRE 1483 >UniRef50_B1J8X0 YD repeat protein n=50 Tax=Gammaproteobacteria RepID=B1J8X0_PSEPW Length = 1411 Score = 150 bits (379), Expect = 4e-35, Method: Composition-based stats. Identities = 52/153 (33%), Positives = 75/153 (49%), Gaps = 2/153 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L D++G I W Y WG + + + + Q R GQ +D+E+ L+YN RYYDP Sbjct: 1185 LELTDSEGKIVWQATYRSWG-AIEQLTVNEIDQNLRFQGQYFDRETSLHYNTLRYYDPDV 1243 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILS-DT 120 GR+I DPIGL GG +L+ Y +NP+ IDP GL+P V ++ + D Sbjct: 1244 GRFIGPDPIGLRGGVNLFRYNVNPIYWIDPTGLAPCQVRVVNNTKIHGRGQVDGTPGHDQ 1303 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVS 153 + + L + +F S N GVS Sbjct: 1304 FSEAIANKLAMSGRFSDVYLNRSYSFANGRGVS 1336 >UniRef50_Q328Z1 RhsA protein in rhs element n=7 Tax=Enterobacteriaceae RepID=Q328Z1_SHIDS Length = 1213 Score = 150 bits (378), Expect = 5e-35, Method: Composition-based stats. Identities = 71/94 (75%), Positives = 79/94 (84%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G W EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 1079 LALVSTEGATEWCAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 1138 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GRYITQDPIGL+GGW+LY Y LNP++ IDPLGLS Sbjct: 1139 GRYITQDPIGLKGGWNLYGYQLNPISDIDPLGLS 1172 >UniRef50_A1U3R9 YD repeat protein n=3 Tax=Gammaproteobacteria RepID=A1U3R9_MARAV Length = 1611 Score = 149 bits (377), Expect = 5e-35, Method: Composition-based stats. Identities = 42/92 (45%), Positives = 58/92 (63%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L + G + WS Y +GN + ++ + P R GQ +D E+GL+YNR+RYY+P GR Sbjct: 1368 LTNQQGRLVWSVTYRAYGNVVQQQVAE-IDNPLRFQGQYHDPETGLHYNRHRYYNPNTGR 1426 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 +IT DPIGL GG + Y Y NP +DPLGL+ Sbjct: 1427 FITPDPIGLAGGLNNYQYVPNPTGWVDPLGLA 1458 >UniRef50_Q6LUC6 Hypothetical nucleotidyltransferase n=1 Tax=Photobacterium profundum RepID=Q6LUC6_PHOPR Length = 1352 Score = 149 bits (377), Expect = 5e-35, Method: Composition-based stats. Identities = 49/128 (38%), Positives = 68/128 (53%), Gaps = 2/128 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D++G + W Y+ G + +H P R GQ +D+ESGL+YNR RYYDP G+ Sbjct: 1071 VTDSEGEVVWQATYNALGCAAISID--IIHNPLRFQGQYHDQESGLHYNRFRYYDPSIGQ 1128 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 +I QDPIGL GG + Y Y NP+ +DPLGLS + A + K + D Sbjct: 1129 FIHQDPIGLLGGINHYRYAPNPIQWVDPLGLSCKEAAFEKIKQSFVNHILADFDLCRGGG 1188 Query: 124 MKRLNLGG 131 +N GG Sbjct: 1189 RVPVNFGG 1196 >UniRef50_Q1LDW7 YD repeat n=2 Tax=Burkholderiaceae RepID=Q1LDW7_RALME Length = 1626 Score = 149 bits (376), Expect = 7e-35, Method: Composition-based stats. Identities = 51/119 (42%), Positives = 66/119 (55%), Gaps = 7/119 (5%) Query: 4 LMDADGNIAWSGEYDEWG---NQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L+D G + W Y WG +P R PGQ +D E+GL+YNR+RYYDP Sbjct: 1414 LVDESGKVVWLARYKAWGGLKTPRKSTDPTETTNAIRFPGQYHDVETGLHYNRHRYYDPG 1473 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL---SPAD-VALIRRKDQLNHQRAWD 115 GR+I++DP+GL GG ++Y Y NPV +DPLGL SPAD +A R Q A Sbjct: 1474 SGRFISKDPVGLAGGINVYTYAPNPVGWVDPLGLRCDSPADKLARKLRALQKAQGNAAS 1532 >UniRef50_B1JCT8 RHS protein n=1 Tax=Pseudomonas putida W619 RepID=B1JCT8_PSEPW Length = 231 Score = 149 bits (376), Expect = 8e-35, Method: Composition-based stats. Identities = 42/100 (42%), Positives = 58/100 (58%), Gaps = 1/100 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + L +++G I W Y WG + E H + Q R GQ ++ E+GL+YN RYYDP Sbjct: 1 MELSNSEGEIVWQATYRSWG-AIEELKVHDIEQNLRFQGQYFESETGLHYNTLRYYDPEV 59 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVAL 101 GR++TQDPIGL G + Y Y +PV +DP GL+ V Sbjct: 60 GRFVTQDPIGLGDGMNFYQYAPSPVMWVDPWGLAFKSVNF 99 >UniRef50_Q83LZ1 Putative Rhs-family protein n=1 Tax=Shigella flexneri RepID=Q83LZ1_SHIFL Length = 211 Score = 149 bits (375), Expect = 9e-35, Method: Composition-based stats. Identities = 46/97 (47%), Positives = 63/97 (64%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D+DG I W Y WGN + E++ + Q R GQ D+E+GL+YN +RYYDP GR Sbjct: 1 MTDSDGKIVWETGYQVWGNTIQEKDHGGVEQNLRYQGQYLDRETGLHYNLHRYYDPDVGR 60 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 ++ DPIGL GG +LY+Y NP+ DPLGL+P V+ Sbjct: 61 FMVTDPIGLRGGLNLYSYAPNPLKYADPLGLTPCAVS 97 >UniRef50_C5CZG8 RHS protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CZG8_VARPS Length = 1609 Score = 148 bits (374), Expect = 1e-34, Method: Composition-based stats. Identities = 45/109 (41%), Positives = 61/109 (55%), Gaps = 4/109 (3%) Query: 4 LMDADGNIAWSGEYDEWGN----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L D +G IAWS +Y WG P R GQ +D E+GL+YNR+RYYDP Sbjct: 1390 LTDHEGRIAWSAQYKAWGEAKQAISEAGRKAGFRNPIRFQGQYFDDETGLHYNRHRYYDP 1449 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQL 108 GR++++DPIGL GG +L Y NP+ IDPLGL+ + + + Sbjct: 1450 SCGRFVSKDPIGLAGGSNLQQYAPNPLGWIDPLGLAGKGITPNNKGTRT 1498 >UniRef50_B2VH58 Rhs family protein n=5 Tax=Enterobacteriaceae RepID=B2VH58_ERWT9 Length = 1322 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 47/122 (38%), Positives = 65/122 (53%), Gaps = 6/122 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLN------EENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L + G+I W +Y WGN E + +HQP R GQ +D E+GL+YNR RYY Sbjct: 1108 LSNRSGDICWQADYRVWGNTRQVSYAQQEADAETIHQPLRYQGQYFDGETGLHYNRFRYY 1167 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 DP GR+I++DP+GL GG +LY Y NP +DPLGL + ++ + A Sbjct: 1168 DPDIGRFISRDPVGLSGGMNLYQYAPNPYGWVDPLGLMKCSPHKKTTYEGVSRRDALRQA 1227 Query: 118 SD 119 Sbjct: 1228 KR 1229 >UniRef50_Q2SKM2 Rhs family protein n=3 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SKM2_HAHCH Length = 1552 Score = 147 bits (372), Expect = 2e-34, Method: Composition-based stats. Identities = 42/95 (44%), Positives = 59/95 (62%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A+G + WS Y +GN L ++ + P R GQ YD+E+GL+YNR+RYYDP R Sbjct: 1268 MTNAEGEVVWSARYKAYGN-LALKDVEDVQNPLRFQGQYYDEETGLHYNRHRYYDPSAAR 1326 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 +I QDP+GL GG + Y Y NP DP GL+ + Sbjct: 1327 FINQDPVGLLGGDNNYQYAPNPTGWGDPFGLTCKE 1361 >UniRef50_Q147B5 Rhs family protein n=2 Tax=Betaproteobacteria RepID=Q147B5_BURXL Length = 1362 Score = 147 bits (371), Expect = 3e-34, Method: Composition-based stats. Identities = 48/99 (48%), Positives = 57/99 (57%), Gaps = 8/99 (8%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPH--------HLHQPYRLPGQQYDKESGLYYNRNRY 56 D G W Y WG L H HQP R GQ +D+E+GL+YNR+RY Sbjct: 1141 TDDAGRTQWRARYAAWGRLLGANGGHEQMHESGRQAHQPLRFQGQYFDEETGLHYNRHRY 1200 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 YDP GR++TQDPIGL GG +LY Y NP IDPLGL+ Sbjct: 1201 YDPDAGRFMTQDPIGLRGGINLYRYAPNPGRWIDPLGLA 1239 >UniRef50_Q87U70 Rhs family protein n=2 Tax=Pseudomonas RepID=Q87U70_PSESM Length = 1572 Score = 147 bits (370), Expect = 4e-34, Method: Composition-based stats. Identities = 47/122 (38%), Positives = 66/122 (54%), Gaps = 1/122 (0%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D G I WS +Y +GN L + + P R GQ +D E+GL+YNR+RYY+P GR Sbjct: 1322 LTDYSGEIMWSAKYRAYGN-LATLDIAEIENPLRFQGQYFDAETGLHYNRHRYYNPGTGR 1380 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 ++T DPI L GG + Y Y NP +DPLGLS + K + + D E+ Sbjct: 1381 FLTPDPIKLAGGLNNYQYVPNPTGWVDPLGLSGECPDSEKNKIPVEEASSPSNSYDNSEE 1440 Query: 124 MK 125 ++ Sbjct: 1441 LR 1442 >UniRef50_UPI0001BC4026 Rhs family protein n=6 Tax=Neisseria RepID=UPI0001BC4026 Length = 1477 Score = 147 bits (370), Expect = 4e-34, Method: Composition-based stats. Identities = 54/127 (42%), Positives = 75/127 (59%), Gaps = 4/127 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DG + W G+YD WG + E N HQP+RL Q +D+E+GL+YN RYYDP G Sbjct: 1159 MTDEDGKLLWFGKYDVWGKLVKETNITGSAHQPFRLQNQYFDRETGLHYNFFRYYDPDIG 1218 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD--T 120 R++ QDPIGL+GG +LY + N DPLGL P D+ D+ ++ W + + Sbjct: 1219 RFVNQDPIGLDGGENLYGFAPNAAVWSDPLGLEPMDIGKYIP-DEYRPKQGWKPPFEMPS 1277 Query: 121 YEDMKRL 127 YED+ L Sbjct: 1278 YEDVDAL 1284 >UniRef50_P77779 Putative uncharacterized protein ybfO n=67 Tax=Enterobacteriaceae RepID=YBFO_ECOLI Length = 477 Score = 147 bits (370), Expect = 4e-34, Method: Composition-based stats. Identities = 69/96 (71%), Positives = 77/96 (80%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G W EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 233 LALVSTEGATEWCAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 292 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 GRYITQDPIGL+GGW+ Y YPLNPV ID +GL+ Sbjct: 293 GRYITQDPIGLKGGWNFYQYPLNPVQYIDSMGLASK 328 >UniRef50_B0VRR7 Putative uncharacterized protein n=1 Tax=Acinetobacter baumannii SDF RepID=B0VRR7_ACIBS Length = 296 Score = 147 bits (370), Expect = 4e-34, Method: Composition-based stats. Identities = 44/103 (42%), Positives = 58/103 (56%), Gaps = 7/103 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE-------ENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 + D G I W EY WG E EN + R GQ +D+E+GL+YNR RY Sbjct: 57 MTDHTGAIIWKAEYKAWGECKAEKAKSNFFENSEIISNNIRFQGQYFDEETGLHYNRYRY 116 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 Y P GR++++DPIGL GG + YAY +P +DPLGLS + Sbjct: 117 YSPYVGRFVSKDPIGLLGGSNNYAYAPSPTEWVDPLGLSCQKI 159 >UniRef50_C8QGI4 YD repeat protein n=1 Tax=Pantoea sp. At-9b RepID=C8QGI4_9ENTR Length = 465 Score = 146 bits (369), Expect = 4e-34, Method: Composition-based stats. Identities = 50/116 (43%), Positives = 61/116 (52%), Gaps = 8/116 (6%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEEN--------PHHLHQPYRLPGQQYDKESGLYYNR 53 L + DA G + WSG+Y +G + + HQP R GQ D E+GL+YN Sbjct: 245 LEVTDASGKLRWSGQYGSFGEVTRQTDGVYRRASQTSLSHQPLRYAGQYADAETGLHYNL 304 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLN 109 RYYDP GR+ QDPIGL GGW+LY Y NP+ IDP GLS D L Sbjct: 305 FRYYDPQTGRFTVQDPIGLAGGWNLYQYAPNPLTWIDPTGLSRYPGVDFSGSDALY 360 >UniRef50_A6GLW0 Rhs family protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GLW0_9BURK Length = 1598 Score = 146 bits (369), Expect = 5e-34, Method: Composition-based stats. Identities = 46/126 (36%), Positives = 70/126 (55%), Gaps = 2/126 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A + WS + +G + + + + P R GQ +D E+GL+YNR+RYYDP G+ Sbjct: 1371 ITNASAEVVWSSTFKTYGALVL-AHVNEVENPLRFQGQYFDSETGLHYNRHRYYDPNCGQ 1429 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 + TQDPIGL GG + Y Y NP+ +DP GLS D A + +R D L+ T+E Sbjct: 1430 FTTQDPIGLLGGMNTYQYAPNPMTWVDPWGLSCKDQANSIETKR-QLERLPDDLAGTFEG 1488 Query: 124 MKRLNL 129 + + Sbjct: 1489 GRYTSR 1494 >UniRef50_C3JXH8 Putative Rhs protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3JXH8_PSEFS Length = 1597 Score = 146 bits (368), Expect = 6e-34, Method: Composition-based stats. Identities = 44/95 (46%), Positives = 56/95 (58%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L ADG I WS Y +G + + + P R GQ +D+ESGL+YNR+RYY P GR Sbjct: 1342 LTAADGEIVWSAHYRAYGE-ITRLDIGKIDNPLRFQGQYFDQESGLHYNRHRYYHPDIGR 1400 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 Y+T DP+ L GG + Y Y NP +DPLGLS Sbjct: 1401 YLTPDPVKLTGGINAYQYVPNPTGWVDPLGLSSCP 1435 >UniRef50_UPI00019F17FA RhsC core protein with extension n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F17FA Length = 274 Score = 146 bits (368), Expect = 6e-34, Method: Composition-based stats. Identities = 82/209 (39%), Positives = 106/209 (50%), Gaps = 21/209 (10%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL++ DG I+W EYDEWGN L E+NPH+L + RLPGQQ D+ESGLYYNR+RY P Q Sbjct: 27 LALINQDGAISWRAEYDEWGNVLREDNPHNLQRLIRLPGQQCDEESGLYYNRHRYDSPGQ 86 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 RY+T DPIGLEGG + Y YP NP+ IDPLGL P + +N A + Sbjct: 87 DRYLTLDPIGLEGGLNPYTYPRNPIRKIDPLGLQPWN--------SINMGSATTERASLG 138 Query: 122 EDMKRLNLGGTDQFFHCMA-------FCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 M + N TDQ +A R +++ S Y ++ G L G Sbjct: 139 LWMAQ-NGASTDQMVKALAPLPPPSSISRECRVSGIAALGSGLSGSYSYNEKNGGNVLIG 197 Query: 175 MYGRKVKLSHS-----EMIEDNKKDLAVN 198 + L S + + KDL N Sbjct: 198 APLAAIGLRGSLTCGLKFRSSDAKDLKTN 226 >UniRef50_Q31U53 Putative uncharacterized protein n=1 Tax=Shigella boydii Sb227 RepID=Q31U53_SHIBS Length = 927 Score = 146 bits (368), Expect = 7e-34, Method: Composition-based stats. Identities = 70/93 (75%), Positives = 78/93 (83%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L+ +G W EYDEWGN LNEENPHHL Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 810 LVLISTEGATEWCAEYDEWGNLLNEENPHHLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 869 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDPIGL+GGW+ Y YPL+PVN +DPLGL Sbjct: 870 GRYITQDPIGLKGGWNFYQYPLSPVNSMDPLGL 902 >UniRef50_A7FDJ9 RHS/YD repeat protein n=30 Tax=Enterobacteriaceae RepID=A7FDJ9_YERP3 Length = 1418 Score = 145 bits (367), Expect = 9e-34, Method: Composition-based stats. Identities = 47/102 (46%), Positives = 57/102 (55%), Gaps = 7/102 (6%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNE-------ENPHHLHQPYRLPGQQYDKESGLYYNRN 54 L + D +G WSG+Y WG + QP R PGQ D E+GL+YN Sbjct: 1192 LDVTDGEGKHRWSGKYHAWGKVTRQNVSDPRQSTVSRFAQPLRYPGQYSDDETGLHYNTF 1251 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 RYYDP GR+ TQDPIGL GG +LY Y NP+ +DPLG P Sbjct: 1252 RYYDPEIGRFSTQDPIGLAGGINLYQYGPNPLGWVDPLGWMP 1293 >UniRef50_D0KES6 YD repeat protein n=15 Tax=Gammaproteobacteria RepID=D0KES6_PECWW Length = 1379 Score = 145 bits (365), Expect = 1e-33, Method: Composition-based stats. Identities = 46/102 (45%), Positives = 60/102 (58%), Gaps = 8/102 (7%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEE--------NPHHLHQPYRLPGQQYDKESGLYYNR 53 L + DA+G + WSG+Y +G + Q R GQ D+E+GL+YN Sbjct: 1175 LEMTDAEGAVRWSGDYGSFGAINGQTQDSEGLRHGKPVESQSLRYAGQYADEETGLHYNL 1234 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 RYYDP GR+ TQDPIGL GG +LYAY NP+ +DPLGL+ Sbjct: 1235 FRYYDPTVGRFTTQDPIGLAGGLNLYAYAPNPLGWVDPLGLA 1276 >UniRef50_B5S3P6 Probable rhs-related protein (Fragment) n=1 Tax=Ralstonia solanacearum RepID=B5S3P6_RALSO Length = 445 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 51/176 (28%), Positives = 81/176 (46%), Gaps = 8/176 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + + DG EY +G + + + G QY SG+Y R YDP Sbjct: 99 VTDTLTPDGRAVTHTEYGPYGELVKSQGRAEYRSDFGYAGMQYHAASGMYLTLFRAYDPG 158 Query: 61 QGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGL---SPADVALIR-RKDQLNHQR-AW 114 GR++++DPIG +GG +LYAY NP N +DP G+ P + +I +K + + + AW Sbjct: 159 TGRWVSRDPIGEDGGENLYAYANGNPENYVDPNGMLAIWPTNSGVILGKKYRCKYGKDAW 218 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYE--KEIRDY 168 M+ LN G + + A+ VS + +A +GY KE+R+ Sbjct: 219 SNARSDRNKMRGLNTGLRNAEHYLYAYDSVSSGEYNAGTMTALSIGYSIIKEVRNR 274 >UniRef50_Q7NY44 Probable Rhs-family protein n=2 Tax=Chromobacterium violaceum RepID=Q7NY44_CHRVO Length = 1513 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 45/100 (45%), Positives = 62/100 (62%), Gaps = 4/100 (4%) Query: 3 ALMDADGNIAWSGEYDEWGN----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 AL D G +A +Y WG + + P+R GQ +D ESGL+YNR+RYYD Sbjct: 1297 ALTDEHGALALEMDYQAWGQAREVIADAAGKAGIRNPFRFQGQYHDDESGLHYNRHRYYD 1356 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 P GR+I++DPIGL+GG ++Y Y LNP+ +DPLGL+ Sbjct: 1357 PEIGRFISRDPIGLKGGINIYGYALNPIVWMDPLGLTGKQ 1396 >UniRef50_C0Q6Q9 Rhs-family protein n=27 Tax=Enterobacteriaceae RepID=C0Q6Q9_SALPC Length = 1593 Score = 143 bits (361), Expect = 5e-33, Method: Composition-based stats. Identities = 44/91 (48%), Positives = 58/91 (63%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D GNI W Y WGN +E+ + Q R GQ D+E+GL+YN R+YDP G+ Sbjct: 1355 MTDGGGNIVWEAGYQVWGNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGK 1414 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 +I+ DPI L GG +LYAY NP++ IDPLGL Sbjct: 1415 FISGDPISLRGGINLYAYAPNPISWIDPLGL 1445 >UniRef50_UPI0001B53B37 YD repeat protein n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B53B37 Length = 1465 Score = 143 bits (360), Expect = 5e-33, Method: Composition-based stats. Identities = 53/133 (39%), Positives = 70/133 (52%), Gaps = 15/133 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D GN+AW + WG + + P R PGQ +D+ESGLYYN RYYDP GR Sbjct: 1268 LIDPGGNVAWHADRTLWGYRAGASQG-GVSVPMRFPGQYHDEESGLYYNYFRYYDPETGR 1326 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE- 122 Y + DP+GL GG + +AY NP + +DP GLS +QR W SD + Sbjct: 1327 YASPDPLGLHGGDNPHAYVANPTSWLDPFGLS-------------ANQRRWIDHSDGWRL 1373 Query: 123 DMKRLNLGGTDQF 135 + R +GG F Sbjct: 1374 GIDRFPIGGGSDF 1386 >UniRef50_Q39K64 Rhs family protein n=22 Tax=Burkholderia RepID=Q39K64_BURS3 Length = 1560 Score = 142 bits (359), Expect = 7e-33, Method: Composition-based stats. Identities = 43/101 (42%), Positives = 61/101 (60%), Gaps = 6/101 (5%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L D +G++ W Y WG + ++ R GQQ D E+GL+YNR+RYY Sbjct: 1332 LTDDEGDVVWEASYKAWGEAREVIARASKVAGIVPRSSLRFQGQQVDDETGLHYNRHRYY 1391 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 DP GR++++DPIGL GG ++Y Y NPV +DPLGLS ++ Sbjct: 1392 DPRSGRFVSKDPIGLAGGINVYQYAPNPVKWVDPLGLSKSE 1432 >UniRef50_A7FN18 RHS/YD repeat protein n=5 Tax=cellular organisms RepID=A7FN18_YERP3 Length = 1419 Score = 142 bits (358), Expect = 9e-33, Method: Composition-based stats. Identities = 43/113 (38%), Positives = 57/113 (50%), Gaps = 9/113 (7%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP---------HHLHQPYRLPGQQYDKESGLYYNRN 54 + +A G + WSG+Y +G + + QP R GQ D E+GL+Y Sbjct: 1194 VTNAQGEMVWSGQYGVFGQVTRQTDAMWRNVSKPLGQFRQPLRYAGQYLDDETGLHYTTY 1253 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQ 107 RYY P GR+IT DPIGL GG +LY Y NP+ IDP GL+ + Q Sbjct: 1254 RYYAPEVGRFITPDPIGLAGGLNLYQYAPNPLGWIDPWGLAGSPTTATHITYQ 1306 >UniRef50_UPI0001C34A7C Rhs family protein n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001C34A7C Length = 335 Score = 142 bits (358), Expect = 1e-32, Method: Composition-based stats. Identities = 57/169 (33%), Positives = 77/169 (45%), Gaps = 11/169 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DGN+ W G Y WG +E N HQP+RL Q D+E+GL+YN RYY+P G Sbjct: 93 MTDEDGNLLWFGNYTGWGKLKSETNISGTAHQPFRLQNQYCDRETGLHYNFFRYYEPDAG 152 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 R++ QDPIGL GG + Y + N +DPLGL Q DI Sbjct: 153 RFVNQDPIGLFGGSNFYMFAFNISRWLDPLGLKGKSKRSCEEIGQ-------DIDRLINR 205 Query: 123 DMKRLNLGGTDQFFHCMAF---CRVSKLNDAGVSRSAKGLGYEKEIRDY 168 D ++ N GGT H R + + + +K +RD Sbjct: 206 DKRKCNNGGTHGLRHRFNEQINGRNGPGTQSWKTHEQEIKNQQKSLRDR 254 >UniRef50_Q12LF3 YD repeat n=2 Tax=Shewanella denitrificans OS217 RepID=Q12LF3_SHEDO Length = 927 Score = 142 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 54/159 (33%), Positives = 70/159 (44%), Gaps = 9/159 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 AL D+ G + W Y +G + + + + Q R PGQ YD+ESGL+YN R YDP Sbjct: 693 TALTDSTGTVQWQAHYTPFGQTIVDID--KIKQAIRFPGQYYDEESGLHYNYFRDYDPEL 750 Query: 62 GRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GRYI DPIGL GG + Y Y NPV DP GL + Sbjct: 751 GRYIQSDPIGLAGGINTYGYAYQNPVMNTDPTGLWVPQAIGALVNMGY---EGYTQYQSG 807 Query: 121 YEDMKRLNLGGTDQF---FHCMAFCRVSKLNDAGVSRSA 156 +M RL + G F A + + AG + SA Sbjct: 808 NFNMGRLFVAGATGALGGFGSSAIKAIGFGSLAGATNSA 846 >UniRef50_D0KES8 RHS protein n=3 Tax=Pectobacterium wasabiae WPP163 RepID=D0KES8_PECWW Length = 307 Score = 141 bits (356), Expect = 1e-32, Method: Composition-based stats. Identities = 46/102 (45%), Positives = 59/102 (57%), Gaps = 8/102 (7%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEE--------NPHHLHQPYRLPGQQYDKESGLYYNR 53 L + DA+G + WSG+Y +G + Q R GQ D+E+GL+YN Sbjct: 86 LEMTDAEGAVRWSGDYGSFGAVNGQTQDSEGLRHGKQAESQSLRYAGQYADEETGLHYNL 145 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 RYYDP GR+ TQDPIGL GG +LY Y NP+ IDPLGL+ Sbjct: 146 FRYYDPTVGRFTTQDPIGLAGGINLYQYAPNPLTWIDPLGLA 187 >UniRef50_UPI0001B52C8C protein, rhs-like protein n=4 Tax=Enterobacteriaceae RepID=UPI0001B52C8C Length = 243 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 67/94 (71%), Positives = 75/94 (79%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ +G W EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQGR Sbjct: 1 LVSTEGATEWCAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGR 60 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 YITQDPIGL+GGW+ Y YPLNPV ID +GL+ Sbjct: 61 YITQDPIGLKGGWNFYQYPLNPVQYIDSMGLASK 94 >UniRef50_Q2SFR1 Rhs family protein n=9 Tax=cellular organisms RepID=Q2SFR1_HAHCH Length = 138 Score = 140 bits (354), Expect = 2e-32, Method: Composition-based stats. Identities = 45/94 (47%), Positives = 60/94 (63%), Gaps = 1/94 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A+G + WS Y +GN L ++ + P R GQ YD+E+GL+YNR+RYYDP R Sbjct: 46 MTNAEGEVVWSARYKAYGN-LALKDVEDVQNPLRFQGQYYDEETGLHYNRHRYYDPSAAR 104 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 +I QDP GL GG S Y Y LNP+ +DPLGL Sbjct: 105 FINQDPAGLLGGESNYEYVLNPIEWVDPLGLMAK 138 >UniRef50_B7LTT0 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B7LTT0_ESCF3 Length = 1543 Score = 140 bits (354), Expect = 3e-32, Method: Composition-based stats. Identities = 49/113 (43%), Positives = 61/113 (53%), Gaps = 4/113 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH----HLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 + D+DG I W WGN EEN H Q R GQ D+E+GL+YN RY+ P Sbjct: 1308 MTDSDGGIVWRARVQLWGNIRFEENRDIYSVHPQQNLRFAGQYLDRETGLHYNTFRYFLP 1367 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 GR+ DPIGL GG +LYAY NP++ IDPLGL + R + N Q Sbjct: 1368 ESGRFSQPDPIGLAGGLNLYAYAPNPLSYIDPLGLCKLRGSDGRYRSAQNIQE 1420 >UniRef50_B2K1J9 YD repeat protein n=23 Tax=Yersinia RepID=B2K1J9_YERPB Length = 1494 Score = 140 bits (353), Expect = 4e-32, Method: Composition-based stats. Identities = 52/167 (31%), Positives = 78/167 (46%), Gaps = 5/167 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + L D +G + W ++ +G QL+ L QP R+ GQ YD ESGL+YNR RYYDP Sbjct: 1281 IRLQDGEGEVVWEAQFTPFG-QLSVTGTSQLRQPLRMQGQYYDTESGLHYNRYRYYDPAC 1339 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 G +I+QDPIGL+GG + Y + +N + +DPLGL + + + Sbjct: 1340 GVFISQDPIGLKGGLNPYQFAVNTLGWVDPLGLHRNSNNSMGYNELYIVHENDQPGAKIL 1399 Query: 122 EDMKRLN----LGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKE 164 + K + GT++ H V+ + LGY Sbjct: 1400 KIGKAKSEDKMADGTNRRMHNSERAAKKAGYSDAVATPYRDLGYTST 1446 >UniRef50_B7H4M5 Uncharacterized protein ybfO n=3 Tax=Acinetobacter baumannii RepID=B7H4M5_ACIB3 Length = 229 Score = 140 bits (353), Expect = 4e-32, Method: Composition-based stats. Identities = 45/174 (25%), Positives = 79/174 (45%), Gaps = 20/174 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE-------ENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 + D G + W +Y WG E EN + R GQ +D E+GL+YNR Y Sbjct: 1 MTDHTGVVIWKAQYKAWGECKVEQAKSDFFENSEIISNNIRFQGQYFDGETGLHYNRYCY 60 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLN------- 109 Y P GR+I++DPIGL GG ++YAY NPV +D LGL+ +++ + Sbjct: 61 YSPYVGRFISKDPIGLLGGSNIYAYAPNPVGWVDQLGLAKTPTRTLQKNWKNYVGCKHTN 120 Query: 110 ---HQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLG 160 H + ++ ++++ +++ + + + G+ ++ G Sbjct: 121 LDIHHGFPEEYAERFKNIAGIDV---NNPQYYYNLPKEKHTKSPGIHTNSSRTG 171 >UniRef50_D2TGW0 Putative Rhs protein n=2 Tax=Citrobacter RepID=D2TGW0_CITRO Length = 1477 Score = 140 bits (352), Expect = 4e-32, Method: Composition-based stats. Identities = 45/96 (46%), Positives = 55/96 (57%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + L+ DG I W GE WG + R PGQ D+ESGLYYNR RYYD Sbjct: 1249 IQELLTEDGTIVWRGEQQLWGREEGRNRDDAPACRLRFPGQYEDEESGLYYNRFRYYDCE 1308 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 G+Y+ DP+GL GG + Y Y NP+ IDPLGL+P Sbjct: 1309 AGQYLCADPVGLAGGLNPYGYVNNPLKYIDPLGLNP 1344 >UniRef50_Q2SGE8 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SGE8_HAHCH Length = 1452 Score = 140 bits (352), Expect = 5e-32, Method: Composition-based stats. Identities = 56/180 (31%), Positives = 83/180 (46%), Gaps = 9/180 (5%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 A+ D++G WS WG N + P+R PGQ D+E+GLYYNR RYYDP G Sbjct: 1180 AMYDSEGRQVWSANISVWGELRNLKGNRGA-CPFRWPGQYEDEETGLYYNRFRYYDPDSG 1238 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 +YI QDPI ++GG +LY Y + ID LGL + R ++ H +D E Sbjct: 1239 QYIRQDPIRIKGGLNLYKYVSDVTTWIDTLGLQGGSASYGRSGRRIGHA-----DTDGLE 1293 Query: 123 DMKRLN---LGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 + LN GG D+ + + L + + + K++ + G +G Sbjct: 1294 KIGELNAVKAGGDDRLPSILYEGDRTSLFHYTSEENLENILRTKKLFNSKGFEHGRHGDG 1353 >UniRef50_C5AIF2 Rhs family protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AIF2_BURGB Length = 345 Score = 140 bits (352), Expect = 5e-32, Method: Composition-based stats. Identities = 41/100 (41%), Positives = 54/100 (54%), Gaps = 7/100 (7%) Query: 2 LALMDADGNIAWSGEYDEWGNQLN-------EENPHHLHQPYRLPGQQYDKESGLYYNRN 54 L + D G + W Y WG + P R GQQ+D E+G +YNR Sbjct: 119 LMMTDEAGELVWEASYRAWGEAQEVIERASAAAGIDVVRNPLRFQGQQFDDETGQHYNRY 178 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 RYYDP R++ +DPIGL GG ++Y Y NP++ IDPLGL Sbjct: 179 RYYDPGSSRFVNKDPIGLTGGINIYQYAPNPISWIDPLGL 218 >UniRef50_Q4K3M9 Rhs family protein n=5 Tax=Pseudomonas RepID=Q4K3M9_PSEF5 Length = 1486 Score = 140 bits (352), Expect = 5e-32, Method: Composition-based stats. Identities = 45/104 (43%), Positives = 57/104 (54%), Gaps = 2/104 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE--ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DG+ W Y WGN + E E + Q R GQ D+E+GL++N R+YDP Sbjct: 1258 LCEPDGHSVWQARYQVWGNTVEEIREPYYIEEQNLRFQGQYLDRETGLHFNTFRFYDPDI 1317 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRK 105 GR+ T DPIGL GG +LY Y NP+ IDPLG RR Sbjct: 1318 GRFTTPDPIGLAGGLNLYQYAPNPIGWIDPLGWICKSAYSGRRG 1361 >UniRef50_A1TV35 YD repeat protein n=2 Tax=Acidovorax RepID=A1TV35_ACIAC Length = 1679 Score = 139 bits (351), Expect = 6e-32, Method: Composition-based stats. Identities = 50/99 (50%), Positives = 66/99 (66%), Gaps = 4/99 (4%) Query: 2 LALMDADGN----IAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 +A++DA+G + W+ Y WG E NP+ + QP R GQQ+D E+GL+YNR RYY Sbjct: 1432 IAMVDANGRHSGLLTWAATYHSWGALREEYNPNDISQPIRFQGQQFDAETGLHYNRLRYY 1491 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 DP G+Y+TQDPIGL GG Y YP++P IDP GL+P Sbjct: 1492 DPSLGQYLTQDPIGLLGGNDKYIYPVSPTGWIDPTGLNP 1530 >UniRef50_A7K3Q8 Rhs family protein n=7 Tax=Vibrio RepID=A7K3Q8_VIBSE Length = 1384 Score = 139 bits (351), Expect = 6e-32, Method: Composition-based stats. Identities = 48/144 (33%), Positives = 70/144 (48%), Gaps = 7/144 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D +GN+ WS YD G + + P R GQ +D+E+ L+YN RYYDP GR Sbjct: 1043 LIDCEGNVVWSASYDAHG--FAHVHIEKVVNPLRFQGQYFDQETNLHYNLARYYDPKLGR 1100 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRA----WDILSD 119 +I QDPI + GG + Y Y +NP+ IDP G + L R + L +A + D Sbjct: 1101 FIQQDPISIAGGINHYQYAINPIQWIDPTGFL-CEEGLKRLQQMLAEYQAQNNVPQEVCD 1159 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCR 143 + + + G D + R Sbjct: 1160 QILEAAKESSVGEDGVRSQVKIRR 1183 >UniRef50_UPI00019F181C rhsC element core protein RshC n=2 Tax=Enterobacteriaceae RepID=UPI00019F181C Length = 260 Score = 139 bits (351), Expect = 6e-32, Method: Composition-based stats. Identities = 63/92 (68%), Positives = 72/92 (78%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L+ DG W EYDEWGN LNE+NP +L Q RLPGQQYD ES LYYNR+RYY+P Q Sbjct: 18 LTLIRTDGRTGWRAEYDEWGNLLNEDNPQNLQQLIRLPGQQYDDESELYYNRHRYYNPEQ 77 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLG 93 GRYITQDPIG++GG + YAYPLNPV +DPLG Sbjct: 78 GRYITQDPIGMKGGLNSYAYPLNPVESVDPLG 109 >UniRef50_B3X3P2 RhsH n=1 Tax=Shigella dysenteriae 1012 RepID=B3X3P2_SHIDY Length = 263 Score = 139 bits (351), Expect = 6e-32, Method: Composition-based stats. Identities = 73/137 (53%), Positives = 84/137 (61%), Gaps = 6/137 (4%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L+ DG W EYDEWGN LNEENP HL Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 23 LTLISPDGATEWCAEYDEWGNLLNEENPQHLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 82 Query: 62 GRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRK-----DQLNHQRAWD 115 GRYITQDPIGLEGGW+ Y Y +P IDPLGL + R+ L + Sbjct: 83 GRYITQDPIGLEGGWNQYVYASIHPTYSIDPLGLIDKPAPVFNRELNSDPYYLAVNNCYS 142 Query: 116 ILSDTYEDMKRLNLGGT 132 + Y ++ GG Sbjct: 143 YALNRYGNLGSRIFGGG 159 >UniRef50_B2PW78 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PW78_PROST Length = 330 Score = 139 bits (350), Expect = 8e-32, Method: Composition-based stats. Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 9/102 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENP---------HHLHQPYRLPGQQYDKESGLYYN 52 L + + GN WSG+Y+ +G + Q R GQ +D E+GL++N Sbjct: 118 LDVTNEQGNTVWSGKYERFGFVRSSPLSFYSDPDRKMESFEQNLRYAGQYFDNETGLHFN 177 Query: 53 RNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 R+YDP GR+I DPIGL GG +LYAY NP++ +DP GL Sbjct: 178 TFRFYDPQIGRFIMPDPIGLLGGMNLYAYAPNPMSWVDPFGL 219 >UniRef50_C9Y459 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y459_CROTZ Length = 1523 Score = 139 bits (349), Expect = 9e-32, Method: Composition-based stats. Identities = 45/132 (34%), Positives = 63/132 (47%), Gaps = 3/132 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQ 61 + DA+G W G++ WG E + P R GQ D+ESGL+YN RYYDP+ Sbjct: 1299 VTDANGQTVWRGQFSTWGETERELSVPQWQVPQNLRFQGQYLDRESGLHYNLFRYYDPVA 1358 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD-VALIRRKDQLNHQRAWDILSDT 120 GRY DPIGL GG + Y Y +P+ +DPLGL + + + +D Sbjct: 1359 GRYTQMDPIGLLGGINTYGYVPDPLTWVDPLGLCFSKRLGDFGESHVKSQLERSGNYADV 1418 Query: 121 YEDMKRLNLGGT 132 + + N G Sbjct: 1419 FSVQNKSNNGID 1430 >UniRef50_A1TTQ1 Rhs family protein n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TTQ1_ACIAC Length = 357 Score = 139 bits (349), Expect = 1e-31, Method: Composition-based stats. Identities = 54/161 (33%), Positives = 73/161 (45%), Gaps = 36/161 (22%) Query: 2 LALMDADGNIAWSGEYDEWG------------------------------------NQLN 25 L L DA G++AW+ +Y WG NQL+ Sbjct: 96 LELTDAQGHVAWAADYKVWGEAALRKVLKSATGTDALPGPRHKGHGPVLDEHDAYKNQLS 155 Query: 26 EENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 L QP+R GQQ+D E+GL+YNR RYYDP GR+ +QDP+GL GG +AY NP Sbjct: 156 HSVSPFLEQPFRFQGQQFDAETGLHYNRFRYYDPSIGRFFSQDPVGLHGGIHGFAYAPNP 215 Query: 86 VNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKR 126 N IDPLGLS + + +N + ++ Sbjct: 216 NNWIDPLGLSNKCYNCLPKCTDINAPHKPKTAEEMAAELSN 256 >UniRef50_A1TSU8 YD repeat protein n=4 Tax=Acidovorax RepID=A1TSU8_ACIAC Length = 1586 Score = 139 bits (349), Expect = 1e-31, Method: Composition-based stats. Identities = 45/122 (36%), Positives = 58/122 (47%), Gaps = 28/122 (22%) Query: 4 LMDADGNIAWSGEYDEWGNQLN----------------------------EENPHHLHQP 35 L D G I W+ Y WG + + + QP Sbjct: 1340 LTDEQGRIVWAASYQVWGQTRALQVMRTGTDDAAVFTQAERPLALAAKGDVQALNFVEQP 1399 Query: 36 YRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 R GQ +D E+GL+YNR RYYDP+ GR++ QDPIGL GG +L+ Y NP+ DPLGL Sbjct: 1400 LRFQGQYFDGETGLHYNRFRYYDPVTGRFVHQDPIGLAGGNNLFFYAPNPLIWNDPLGLK 1459 Query: 96 PA 97 P Sbjct: 1460 PK 1461 >UniRef50_A9AE26 Rhs family protein n=10 Tax=cellular organisms RepID=A9AE26_BURM1 Length = 1547 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 59/218 (27%), Positives = 97/218 (44%), Gaps = 17/218 (7%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L D DG++ W Y WG + ++ R GQQ D+E+GL+YNR RYY Sbjct: 1319 LTDDDGDVVWEASYKAWGEAREVIARASKAAGIVARNSLRFQGQQEDEETGLHYNRYRYY 1378 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 DP GR++++DP+G+ GG ++Y Y N V +DPLGL + + D L Sbjct: 1379 DPNSGRFVSKDPVGMVGGINVYQYAPNAVAWVDPLGLRKRIGCPGKFHSFHDFDLPQDKL 1438 Query: 118 SDTY-EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGM- 175 + + N ++ AF R + + L + K+ +D G + GM Sbjct: 1439 FASDGVQFRLANKALIERMNTDEAFRRNLLSRNPAL------LDWSKKAKDLGSSPPGMT 1492 Query: 176 YGRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDR 213 + ++ ++ ++ D N HG+ P T D+ Sbjct: 1493 WHHNDEVGRLNLV--DRSDHGDN-HGIYHPDGTGGRDK 1527 >UniRef50_C5CXA7 YD repeat protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CXA7_VARPS Length = 1434 Score = 137 bits (345), Expect = 3e-31, Method: Composition-based stats. Identities = 57/133 (42%), Positives = 75/133 (56%), Gaps = 3/133 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DG I W+ YD WG + ++ L QP R GQQ D E+GL+YNR+RY+DP+ Sbjct: 1223 LRMTTRDGQIVWAVRYDVWGG-IARKDCELLAQPIRCQGQQEDAETGLFYNRHRYFDPII 1281 Query: 62 GRYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGLSPADVALIRRKD-QLNHQRAWDILSD 119 G Y++ DPIGL GG + YAY NP IDPLGL+ + +R D QL R I Sbjct: 1282 GAYVSADPIGLRGGVNPYAYGCSNPYFWIDPLGLAASCERHLRSIDEQLQQGRRARIDVA 1341 Query: 120 TYEDMKRLNLGGT 132 + +D L L T Sbjct: 1342 SKDDAIELLLAYT 1354 >UniRef50_D1YPL0 RHS repeat-associated core domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YPL0_9FIRM Length = 216 Score = 137 bits (345), Expect = 3e-31, Method: Composition-based stats. Identities = 46/124 (37%), Positives = 66/124 (53%), Gaps = 1/124 (0%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DGN+ W G Y WG E +QP+RL Q D+E+GL+YN RYY+P G Sbjct: 1 MTDKDGNLLWFGNYTGWGRLKEETKVTDSAYQPFRLQNQYCDRETGLHYNFFRYYEPDAG 60 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 R++ QDPIGLEGG ++Y + N + IDPLGL + A + + I + + Sbjct: 61 RFVNQDPIGLEGGDNIYLFSPNIQSWIDPLGLLSWNTARKQFWKAEAKKERDRIAKEKAK 120 Query: 123 DMKR 126 + Sbjct: 121 NPGS 124 >UniRef50_Q399U9 Rhs family protein n=3 Tax=Proteobacteria RepID=Q399U9_BURS3 Length = 1446 Score = 137 bits (345), Expect = 3e-31, Method: Composition-based stats. Identities = 53/155 (34%), Positives = 73/155 (47%), Gaps = 24/155 (15%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE---------------------NPHHLHQP--YRLPG 40 L +A+G + W Y WGN + EE P H+ +P R G Sbjct: 1201 LTNAEGELIWQARYKVWGNAVQEEWIARTSQQSVPEWGEVQLASATPAHVPRPQNLRFQG 1260 Query: 41 QQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS-PADV 99 Q D+E+GL+YN R+YDP GR+I DPIGL GG +LYAY +P+ IDP G + Sbjct: 1261 QYLDRETGLHYNTFRFYDPDIGRFINPDPIGLSGGHNLYAYAESPLIWIDPWGWCRRGNQ 1320 Query: 100 ALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQ 134 A D + Q D +D + R L G ++ Sbjct: 1321 ATKNHMDGVRDQYLADNPTDRHVAGGRNALTGGER 1355 >UniRef50_C9NF93 YD repeat protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NF93_9ACTO Length = 1536 Score = 137 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 47/118 (39%), Positives = 65/118 (55%), Gaps = 3/118 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G I+W WG ++ + P R PGQ +D+ES L+YN RYYDP R Sbjct: 1339 LIDESGFISWRVRRSVWGTTEWAKDSSA-YTPLRFPGQYFDQESLLHYNYLRYYDPDVSR 1397 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 YI+ DPIGLEGG + + Y NP IDPLGLS + R + N + W +++ + Sbjct: 1398 YISPDPIGLEGGPNPHWYGPNPYTWIDPLGLSLC--RVKPRLEDGNTKEGWQHINERH 1453 >UniRef50_B4SV70 Rhs-family protein n=42 Tax=Enterobacteriaceae RepID=B4SV70_SALNS Length = 1359 Score = 137 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 46/94 (48%), Positives = 62/94 (65%), Gaps = 2/94 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE--ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + ADG + W+G +G + + + HQP RLPGQ +D E+GL+YN RYY P Sbjct: 1159 VTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYFDDETGLHYNLFRYYAPEC 1218 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GR+++QDPIGL GG +LYAY NP+ IDPLGL+ Sbjct: 1219 GRFVSQDPIGLRGGLNLYAYAPNPIRWIDPLGLA 1252 >UniRef50_D0KG38 Rhs family protein-like protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KG38_PECWW Length = 230 Score = 137 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 45/109 (41%), Positives = 62/109 (56%), Gaps = 5/109 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQ-----LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 + DA+ + WSG+Y +G +E H Q R GQ D+E+GL+YN RYYD Sbjct: 1 MTDAESAVRWSGDYGSFGAVNGQTQDSEGLRHGKSQSLRYAGQYADEETGLHYNLFRYYD 60 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQ 107 P GR+ TQ IGL GG +LY Y N + +DPLGL+P ++ KD+ Sbjct: 61 PTVGRFTTQGLIGLAGGLNLYQYAPNSLGWVDPLGLTPGEIIRYMGKDE 109 >UniRef50_C5ALM7 YD repeat protein n=19 Tax=Proteobacteria RepID=C5ALM7_BURGB Length = 1425 Score = 136 bits (342), Expect = 6e-31, Method: Composition-based stats. Identities = 40/94 (42%), Positives = 54/94 (57%), Gaps = 1/94 (1%) Query: 5 MDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + G+I W+G+Y WG N+ P + QP R GQ D + L+YN R+YDP GR Sbjct: 1196 TNEAGDIVWAGQYSAWGKVAPNQHAPARIDQPLRYAGQYADDSTELHYNTFRFYDPDVGR 1255 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 +I QDPIGL GG +LY Y N + D GL+ + Sbjct: 1256 FINQDPIGLMGGLNLYQYAPNSIAWTDWWGLAGS 1289 >UniRef50_C1M4X0 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1M4X0_9ENTR Length = 1494 Score = 136 bits (342), Expect = 6e-31, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 60/118 (50%), Gaps = 11/118 (9%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEEN-----------PHHLHQPYRLPGQQYDKESGLYY 51 AL D DG + W + + WG +E + R GQ D+E+GL+Y Sbjct: 1270 ALTDEDGKLHWRQDVETWGETRSEYADEEGGRWRKIWGGAPEENLRFAGQYLDRETGLHY 1329 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLN 109 N RYY P GR+IT DPIGL GG +LY+Y NP++ IDPLGL + D N Sbjct: 1330 NTFRYYAPDMGRFITPDPIGLAGGINLYSYAPNPLSWIDPLGLLKCGLTGNEVGDASN 1387 >UniRef50_B2PVY9 Putative uncharacterized protein n=12 Tax=Enterobacteriaceae RepID=B2PVY9_PROST Length = 707 Score = 135 bits (341), Expect = 8e-31, Method: Composition-based stats. Identities = 43/134 (32%), Positives = 61/134 (45%), Gaps = 9/134 (6%) Query: 2 LALMDADGNIAWSGEYDEWGNQLN---------EENPHHLHQPYRLPGQQYDKESGLYYN 52 L + + GN WSG+Y+ +G + E Q R GQ +D E+GL++N Sbjct: 488 LDVTNEQGNTVWSGKYERFGFVRSSPLSFYSDPERKMESFEQNLRYAGQYFDNETGLHFN 547 Query: 53 RNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 R+YDP GR+I DPIGL GG +LY Y NP+ IDP G V D Sbjct: 548 TFRFYDPQIGRFIMPDPIGLLGGINLYQYAPNPLGWIDPWGWEIVRVYHYTSSDGYKGIM 607 Query: 113 AWDILSDTYEDMKR 126 ++ + + Sbjct: 608 GTGSINMSDPGARG 621 >UniRef50_A1TTP6 YD repeat protein n=2 Tax=Acidovorax citrulli AAC00-1 RepID=A1TTP6_ACIAC Length = 1554 Score = 135 bits (341), Expect = 8e-31, Method: Composition-based stats. Identities = 57/215 (26%), Positives = 84/215 (39%), Gaps = 41/215 (19%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHH------------------------------ 31 L L DA G IAW+ +Y WG P Sbjct: 1296 LELTDAQGYIAWAADYKVWGEATLRAVPRTATGTDGVSGERRRGHGPVMDVHEGGGEKAR 1355 Query: 32 ------LHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 + QP+R GQQ+D+E+GL+YNR RYY+P GR+++QDPIGL GG + + Y +P Sbjct: 1356 PTPPAIIEQPFRFQGQQFDEETGLHYNRFRYYEPSVGRFVSQDPIGLLGGVNSFTYAPSP 1415 Query: 86 VNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVS 145 N +DP GLS A D Y N + + Sbjct: 1416 NNWMDPFGLSCCPCPTGTATIHHYEGTA-DNPFGHYSIEVTANGTSLHTHQGVFQDGKQT 1474 Query: 146 KL----NDAGVSRSAKGLGYEKEIRDYGLNLFGMY 176 + ++G +++ + K +DY + G Y Sbjct: 1475 AILSNRGNSGTTQATVAIPDAKAAQDYQRKMMGQY 1509 >UniRef50_C6M9F5 RHS family protein n=2 Tax=Bacteria RepID=C6M9F5_NEISI Length = 448 Score = 135 bits (341), Expect = 9e-31, Method: Composition-based stats. Identities = 43/93 (46%), Positives = 56/93 (60%), Gaps = 1/93 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DGN+ W G Y WG E +QP+RL Q D+E+GL+YN RYY+P G Sbjct: 225 MTDKDGNLLWFGNYTGWGRLKEETKVTDSAYQPFRLQNQYADRETGLHYNFFRYYEPDAG 284 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 R++ QDPIGLEGG +LY + N N +D GL Sbjct: 285 RFVNQDPIGLEGGENLYKFAPNAQNWVDIFGLW 317 >UniRef50_C9Y441 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y441_CROTZ Length = 1394 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 40/94 (42%), Positives = 54/94 (57%), Gaps = 2/94 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP--HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L D +G + W G WG +E P +Q R+ GQ D+E+GL+YN RYYDP Sbjct: 1156 LTDVEGRVRWEGRNSAWGKLAHESTPLPTGYNQNLRMQGQYLDRETGLHYNLFRYYDPDC 1215 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GR+ DPIGL GG +LY + N + +DP GL+ Sbjct: 1216 GRFTQHDPIGLAGGINLYQFAPNALGWVDPWGLN 1249 >UniRef50_C9Y462 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y462_CROTZ Length = 252 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 2/104 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQ 61 + DADG W G++ WG E + P R GQ D+ESGL+YN RYYDP+ Sbjct: 27 VTDADGQTVWRGQFSTWGETERELSVPQWQVPQNLRFQGQYLDRESGLHYNLFRYYDPVA 86 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRK 105 GRY DPIGL GG + YAY +P+ +DPLGL+ + + + Sbjct: 87 GRYTQMDPIGLAGGINTYAYVGDPLTWVDPLGLAVDPLVKLEER 130 >UniRef50_C7MXD3 Rhs family protein n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MXD3_SACVD Length = 1485 Score = 135 bits (340), Expect = 1e-30, Method: Composition-based stats. Identities = 44/119 (36%), Positives = 65/119 (54%), Gaps = 1/119 (0%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D GN+AW WG + + + P+R PGQ D E+GL+YN +RYYDP GR Sbjct: 1278 LLDGSGNLAWRNRTTLWGKTVIKHHGSA-STPWRFPGQYSDPETGLHYNYHRYYDPDTGR 1336 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 Y++ DP+GL + YAY +NP+ +DPLGL + R +D +++T Sbjct: 1337 YVSCDPLGLRPSPNHYAYVVNPLRWLDPLGLMSCGDDDAVTLYRNVDGREFDAIAETGR 1395 >UniRef50_C0EPY5 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EPY5_NEIFL Length = 193 Score = 135 bits (340), Expect = 1e-30, Method: Composition-based stats. Identities = 52/166 (31%), Positives = 78/166 (46%), Gaps = 10/166 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D GN+ W GEY WG +E + HQP+RL Q YD E+GL+YN RYYD G Sbjct: 1 MTDIHGNLLWYGEYTAWGRLKKDERVYKDAHQPFRLQNQYYDSETGLHYNYFRYYDSETG 60 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR---------KDQLNHQRA 113 R+++QD IGL GG + Y + N + IDPLGL + R+ + + Sbjct: 61 RFVSQDVIGLVGGENFYQFSPNTQSWIDPLGLKELYYLVARKDGFYPVMEWGKREPVGKV 120 Query: 114 WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGL 159 W D ++ + N+ Q + + + L V + K + Sbjct: 121 WLKKGDIWKIGETKNIVNGVQRRYSQQWLDRNNLKYIRVMKGPKKV 166 >UniRef50_A0LJM9 YD repeat protein n=3 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LJM9_SYNFM Length = 1433 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 41/93 (44%), Positives = 52/93 (55%), Gaps = 2/93 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D+ W YD +G + +R PGQ YD E+GL+YN +RYYDP GR Sbjct: 1213 MTDSTNTAVWEAAYDAFGEATIHPAS-TVVNNFRFPGQYYDAETGLHYNWHRYYDPKTGR 1271 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLS 95 Y+T DPIGL GG + Y Y N P+N ID GL Sbjct: 1272 YMTPDPIGLAGGINPYTYAENDPINFIDLYGLW 1304 >UniRef50_D1SWH5 RHS protein (Fragment) n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1SWH5_9BURK Length = 280 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 43/115 (37%), Positives = 57/115 (49%), Gaps = 19/115 (16%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE-------------------NPHHLHQPYRLPGQQYD 44 + D G + W + WG+ L E + L Q RL GQ D Sbjct: 29 VTDEAGEVRWRASWRTWGSALEERWEAVRIDGSAIPAVQQRHRDEDTLEQNLRLQGQYLD 88 Query: 45 KESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 +E+GL+YN RYYDP GR+I+ DPIGL GG +L Y +NP+ IDP GL + Sbjct: 89 RETGLHYNTFRYYDPDMGRFISPDPIGLAGGLNLQRYAINPLAWIDPWGLCGEKI 143 >UniRef50_UPI00016A9A82 Rhs family protein n=2 Tax=Burkholderia oklahomensis RepID=UPI00016A9A82 Length = 1489 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 59/192 (30%), Positives = 76/192 (39%), Gaps = 18/192 (9%) Query: 4 LMDADGNIAWSGEYDEWGN-QLNEENPHHLHQP-----------YRLPGQQYDKESGLYY 51 L D G + W Y WGN L E P P RL GQ D E+G Y Sbjct: 1264 LTDGGGRVVWRTRYRAWGNTVLQEYAPEFQANPAGDVMQPLPQALRLQGQYEDLETGFCY 1323 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV-ALIRRKDQLNH 110 + RYYDP GR+IT DPIGL GG + Y Y NP+ IDP G + A L H Sbjct: 1324 STFRYYDPDVGRFITPDPIGLAGGLNQYQYAPNPLTWIDPWGWVETPLDAPGYSTYGLYH 1383 Query: 111 QRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGV----SRSAKGLGYEKEIR 166 A + + + LN D A +++ G ++ GYE+ R Sbjct: 1384 PGASEPYYVGH-TGQSLNDRLADHIDTKRAVRGQTEVRPLGGPEGTLTYSQAKGYEQAYR 1442 Query: 167 DYGLNLFGMYGR 178 + G G Sbjct: 1443 EKYKTKTGFPGN 1454 >UniRef50_C6M5B1 Rhs-related protein n=7 Tax=Proteobacteria RepID=C6M5B1_NEISI Length = 1934 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 46/93 (49%), Positives = 58/93 (62%), Gaps = 1/93 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D +GNI WSG+Y WG E ++QP+RL Q YD+E+GL+YN RYYDP G Sbjct: 1734 MTDEEGNIVWSGDYSGWGKLTQEGRLKLDVYQPFRLQNQYYDEETGLHYNFFRYYDPEIG 1793 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 R+ QDPI L GG SLYA N +D LGL+ Sbjct: 1794 RFTQQDPIKLLGGESLYALAPNVFVWLDTLGLA 1826 >UniRef50_A1AK54 YD repeat protein n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AK54_PELPD Length = 1352 Score = 134 bits (337), Expect = 3e-30, Method: Composition-based stats. Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 4/116 (3%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL D G + YD +GN ++ ++ QP+ G+++D+E+GLYY R RYYDP Sbjct: 1146 IVALTDRHGTVVQEYNYDSFGNP--DQRGENIDQPFSYTGREWDRETGLYYYRARYYDPK 1203 Query: 61 QGRYITQDPIGLEGG-WSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 GR+I +DPI GG +LYAY NP+N +DP GL + + N W Sbjct: 1204 IGRFIQKDPISFAGGDVNLYAYVLNNPINRLDPFGLWNSKTFPTNISNYANSALYW 1259 >UniRef50_Q3YV37 Putative uncharacterized protein n=1 Tax=Shigella sonnei Ss046 RepID=Q3YV37_SHISS Length = 303 Score = 134 bits (336), Expect = 3e-30, Method: Composition-based stats. Identities = 66/81 (81%), Positives = 72/81 (88%) Query: 14 SGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLE 73 EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQGRYITQDPIGL+ Sbjct: 76 RREYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLK 135 Query: 74 GGWSLYAYPLNPVNGIDPLGL 94 GGW+LY YPL+PVN +DPLGL Sbjct: 136 GGWNLYTYPLSPVNSMDPLGL 156 >UniRef50_B9B9U8 YD repeat protein n=1 Tax=Burkholderia multivorans CGD1 RepID=B9B9U8_9BURK Length = 345 Score = 133 bits (335), Expect = 4e-30, Method: Composition-based stats. Identities = 42/98 (42%), Positives = 55/98 (56%), Gaps = 6/98 (6%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L D DG++ W Y WG + ++ R GQQ D+E+GL+YNR RYY Sbjct: 185 LTDDDGDVVWEASYKAWGEAREVIARASKAAGIVARNSLRFQGQQEDEETGLHYNRYRYY 244 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 DP GR+ + DPI L GG ++Y Y LN V+ IDP G S Sbjct: 245 DPTSGRFTSADPIRLAGGSNVYQYALNLVSWIDPFGFS 282 >UniRef50_A1TJG7 YD repeat protein n=4 Tax=Proteobacteria RepID=A1TJG7_ACIAC Length = 1604 Score = 133 bits (335), Expect = 5e-30, Method: Composition-based stats. Identities = 45/134 (33%), Positives = 66/134 (49%), Gaps = 26/134 (19%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE------------------------NPHHLHQPYRLP 39 + D +G++ W +Y WGN + EE + Q R+ Sbjct: 1374 MSDRNGHLVWRAQYRLWGNAVAEEWQAFDATGRPVNAPMAETGIRAQVSASPAPQNLRMQ 1433 Query: 40 GQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS--PA 97 GQ D+E+GL+YN RYYDP G + T DPIGL GG +L+ Y NP++ IDP G + P Sbjct: 1434 GQYLDRETGLHYNTFRYYDPDLGAFTTPDPIGLAGGLNLHGYAANPLSWIDPWGWACIPN 1493 Query: 98 DVALIRRKDQLNHQ 111 VA R+ ++ + Sbjct: 1494 KVAGTAREARVGAK 1507 >UniRef50_Q1K295 YD repeat n=4 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1K295_DESAC Length = 1468 Score = 133 bits (334), Expect = 6e-30, Method: Composition-based stats. Identities = 46/97 (47%), Positives = 58/97 (59%), Gaps = 5/97 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQL----NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 + L DA G WS +Y +G + + + R PGQ +D ESGL+YN +RYY Sbjct: 1284 ILLTDATGTAVWSAQYAPFGQATINNDVDGDGTEVVCNLRFPGQYFDAESGLHYNWHRYY 1343 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLG 93 +P GRYIT DPIGL GG +LYAY NPVN +DP G Sbjct: 1344 EPRSGRYITLDPIGLAGGINLYAYANRNPVNVVDPTG 1380 >UniRef50_C6AKX3 Rhs family protein n=4 Tax=Aggregatibacter aphrophilus NJ8700 RepID=C6AKX3_AGGAN Length = 1917 Score = 132 bits (333), Expect = 7e-30, Method: Composition-based stats. Identities = 47/112 (41%), Positives = 64/112 (57%), Gaps = 3/112 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE---ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + D+DG + W G YD WG+ + + E HQP+RL Q +D+E+GL+YN RYY+P+ Sbjct: 1689 MTDSDGKLIWKGRYDAWGSLIRDSYRETASDSHQPFRLQNQYFDEETGLHYNFFRYYEPV 1748 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 GR+ITQDPI L GG +LY + N DPLGL + L L Sbjct: 1749 LGRFITQDPIKLAGGNNLYRFEGTVQNQTDPLGLFAPALLLAPELIALGKAA 1800 >UniRef50_C8Q7Z5 YD repeat protein n=8 Tax=Enterobacteriaceae RepID=C8Q7Z5_9ENTR Length = 1507 Score = 132 bits (333), Expect = 8e-30, Method: Composition-based stats. Identities = 48/153 (31%), Positives = 66/153 (43%), Gaps = 4/153 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D G+ W G + WG E P R GQ D+E+GL+YN RYYDP Sbjct: 1278 VTDIRGDTVWQGAFAAWGRTTRESTGVDWEVPQNLRFQGQYLDRETGLHYNTFRYYDPCG 1337 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRY DPIGL GG +LY Y + + GIDPLGL+ + + H + + + Sbjct: 1338 GRYTQLDPIGLMGGLNLYQYAPDVLTGIDPLGLA-TRLNNGQYNVFQEHTINAEHIYSSD 1396 Query: 122 -EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVS 153 R N ++ F R +S Sbjct: 1397 GVQFNRANTEFINRMNTDATFRRDMLGRYPELS 1429 >UniRef50_UPI00017448E8 YD repeat protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448E8 Length = 675 Score = 132 bits (332), Expect = 9e-30, Method: Composition-based stats. Identities = 56/198 (28%), Positives = 81/198 (40%), Gaps = 40/198 (20%) Query: 1 MLALMDA-DGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 + AL++ +G +Y +G L + P+R + YD E+GL Y RYY Sbjct: 249 ITALINQVNGETVAKFDYTPYGE-LKVSSGDVNACPFRYQSKYYDAETGLSYFGFRYYSA 307 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPL------------------GLSPADVA 100 GR+I++DP+G GG++LY Y N PVN D L GL A V Sbjct: 308 KLGRWISRDPLGEAGGFNLYGYCGNDPVNRWDYLGMDSGIWDKVVATDDFMDGLLWATVQ 367 Query: 101 LIRRKDQLNHQRA-WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGL 159 R D Q + +YE N+ G + HC + R+A+ L Sbjct: 368 FWTRGDMFGGQMPLYQSPKSSYEYGYNTNMTGYNVASHC-------------IGRTAREL 414 Query: 160 G-----YEKEIRDYGLNL 172 G + E+RD G L Sbjct: 415 GMLVSMIDTELRDRGFCL 432 >UniRef50_Q9L0E3 Putative Rhs protein n=2 Tax=Streptomyces RepID=Q9L0E3_STRCO Length = 927 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 47/131 (35%), Positives = 66/131 (50%), Gaps = 10/131 (7%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG+I W WG ++ P R PGQ +D E+GL+YN RYYDP GR Sbjct: 730 LVSPDGDIGWRLRTTLWGLPVDGSGGST-DCPLRFPGQYHDPETGLHYNYFRYYDPGLGR 788 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 Y + DP+GL GG + Y NP IDP GL AL R++ +L L + Sbjct: 789 YCSLDPLGLAGGPNPAWYTPNPTAWIDPFGL-----ALCRKRPRLETG----DLKKGWLH 839 Query: 124 MKRLNLGGTDQ 134 ++ ++ GT+ Sbjct: 840 IESRHITGTNP 850 >UniRef50_B4V6T6 Rhs protein n=2 Tax=Streptomyces RepID=B4V6T6_9ACTO Length = 1263 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 41/96 (42%), Positives = 53/96 (55%), Gaps = 1/96 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G IAW WG + + + P R PGQ +D ESGL+YN RYYDP R Sbjct: 1033 LVDEAGAIAWRTRSTLWGATTWNADANA-YTPLRFPGQYFDPESGLHYNCFRYYDPETAR 1091 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 Y++ DP+GL + AY NP + DPLGL+P Sbjct: 1092 YLSVDPLGLGPAPNPVAYVSNPHSWSDPLGLTPCPP 1127 >UniRef50_UPI0001AF2680 RHS/YD repeat-containing protein n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AF2680 Length = 1592 Score = 131 bits (330), Expect = 1e-29, Method: Composition-based stats. Identities = 56/217 (25%), Positives = 82/217 (37%), Gaps = 27/217 (12%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G+IAW WG + P R PGQ D E+GL+YN R+YDP R Sbjct: 1364 LVDPSGDIAWRSRTTLWGATAWP-RTSTAYTPLRFPGQYDDPETGLHYNFFRHYDPDAAR 1422 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA---------LIRRKDQLNHQRAW 114 Y++ DP+GL G + AY NP DPLGL P +I R + +++ Sbjct: 1423 YVSPDPLGLAAGPNPVAYVDNPFTWCDPLGLMPKCPRERAQKVADQVIERAQEGKMRKSS 1482 Query: 115 DILSDTYEDMKRLNL---------------GGTDQFFHCMAFCRVSKLNDAGVSRSAKGL 159 + +DT + + F ++K AG Sbjct: 1483 NYHADTRHQFSDERVLEILKNPDAVYQSQGQRGNLTFRQGEDVVITKGPGAGGGDVITAY 1542 Query: 160 GYEKEIRDYGLNLFGMYGRKVKL--SHSEMIEDNKKD 194 G + G FG L +H +++ N D Sbjct: 1543 GPSGTRGESGAGAFGGSPDDPGLPVTHDDIVNGNIPD 1579 >UniRef50_A9C0N8 YD repeat protein n=5 Tax=cellular organisms RepID=A9C0N8_DELAS Length = 1528 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 67/211 (31%), Positives = 91/211 (43%), Gaps = 17/211 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +AL D G +AW+ + D WGN L E NP +HQ RLPGQ +D+E+GLYYNR+RYYDP+ Sbjct: 1281 MALTDQTGQVAWAAKLDPWGNVLQEYNPQGIHQAIRLPGQHHDRETGLYYNRHRYYDPVV 1340 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 G Y+ QDPIGL GG + Y +P + IDP GL+ + Sbjct: 1341 GSYVNQDPIGLAGGVNKILYSESSPTSKIDPTGLNTVAIGAGVGASVGGP---------- 1390 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNL---FGMYG 177 + + N VS++A L + G G Sbjct: 1391 --IGAVVGAAIGVAVLGVLWMASRPSSNTTSVSKTASELSIAHNNAAQIGDRSSYQGYGG 1448 Query: 178 RKVKLSHSEMIEDNKKDLAVNDHGLTCPSTT 208 H ++ +D KKD GLTC Sbjct: 1449 NCTPDDHDKL-DDEKKDACDKSKGLTCGKNE 1478 >UniRef50_C6M9F4 Rhs family protein n=8 Tax=Neisseria RepID=C6M9F4_NEISI Length = 632 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 43/92 (46%), Positives = 55/92 (59%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DGN+ W G Y WG E +QP+RL Q D+E+GL+YN RYY+P G Sbjct: 431 MTDKDGNLLWFGNYTGWGRLKEETRVTDSAYQPFRLQNQYADRETGLHYNFFRYYEPDAG 490 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 R++ QDPIGL GG + Y + N IDPLGL Sbjct: 491 RFVNQDPIGLLGGANPYQFASNITEWIDPLGL 522 >UniRef50_Q1I7Q5 Putative uncharacterized protein n=11 Tax=Pseudomonas RepID=Q1I7Q5_PSEE4 Length = 1595 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 39/102 (38%), Positives = 56/102 (54%), Gaps = 5/102 (4%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPH----HLHQPYRLPGQQYDKESGLYYNRNRYY 57 + L D +G++ W+ Y WG+ + P R GQ D+E+GL+YNR RYY Sbjct: 1338 MELTDEEGHVVWAAHYKAWGDLAELPGSSVAMSNARNPIRFQGQYQDQETGLHYNRFRYY 1397 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPAD 98 DP RY+++DPIG GG + Y Y +PV DP+GL + Sbjct: 1398 DPKSARYVSKDPIGFMGGANAYTYTGGSPVTATDPMGLKSWE 1439 >UniRef50_D1YPP7 RHS repeat-associated core domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YPP7_9FIRM Length = 237 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 45/189 (23%), Positives = 74/189 (39%), Gaps = 2/189 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D +GN+ W EY W HQP+RL Q D+E+GL+YN RYY+P G Sbjct: 26 MTDKEGNLFWYVEYTIWARLKEATKVTDSAHQPFRLQNQYADRETGLHYNLMRYYEPEAG 85 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 R++ QDPIGL G +LY + N +DPLG V+ + + D E Sbjct: 86 RFVNQDPIGLWGEENLYQFAPNATMWLDPLGWKGVTVSKASKGSAFDFILKLDSSV-YLE 144 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKL 182 + + + R + ++ +++F G+ + Sbjct: 145 TAQHIKDAIASGKPSVVTIDRKGAAGRRKEVLKGTKCAKGTDRDEWPMSMFKEGGKGASI 204 Query: 183 SHSEMIEDN 191 ++ Sbjct: 205 RKISPSDNR 213 >UniRef50_UPI0001B4DD67 Rhs protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4DD67 Length = 1730 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 41/137 (29%), Positives = 62/137 (45%), Gaps = 9/137 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G+ AW WG + + P R PGQ +D E+G +YN +R+YDP R Sbjct: 1515 LVDDTGHTAWQSRTTLWGTTTWNTDAAA-YIPLRFPGQYHDLETGHHYNLHRHYDPETAR 1573 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALI--------RRKDQLNHQRAWD 115 Y+T DP+GL + Y NP DP GL+P + + R + + + R + Sbjct: 1574 YLTPDPLGLAPAPNPTTYVHNPHVWADPDGLAPKKPSYMVPEPLPSAPRGELVRYDRVFS 1633 Query: 116 ILSDTYEDMKRLNLGGT 132 + + RL G Sbjct: 1634 HRQEINLHVPRLITPGG 1650 >UniRef50_C6M9F8 RhsG core protein with extension n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M9F8_NEISI Length = 194 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 47/137 (34%), Positives = 63/137 (45%), Gaps = 19/137 (13%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE-ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DGN+ W G Y WG E + +QP+RL Q D E+GL+YN RYY+ G Sbjct: 1 MTDKDGNLLWFGNYTGWGRLKEEIKVTDSAYQPFRLQNQYADPETGLHYNFFRYYESDAG 60 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP------------------ADVALIRR 104 R++ QDPIGL GG + Y + N + D GL P A I Sbjct: 61 RFVNQDPIGLWGGSNSYQFAPNTLKWTDTWGLLPKCSDAKRAECDKQAEIDEATCRQIPE 120 Query: 105 KDQLNHQRAWDILSDTY 121 KD+ R W + + Y Sbjct: 121 KDKARRSRCWASVQERY 137 >UniRef50_A1TR18 YD repeat protein n=8 Tax=Acidovorax RepID=A1TR18_ACIAC Length = 1654 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 44/110 (40%), Positives = 56/110 (50%), Gaps = 19/110 (17%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE-------------------NPHHLHQPYRLPGQQYD 44 + D G + W + WG+ L E N L Q RL GQ D Sbjct: 1410 VTDEAGEVRWRASWRTWGSALEERWEAVRIDGSAIPAVQQRHRNEDTLEQNLRLQGQYLD 1469 Query: 45 KESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 +E+GL+YN RYYDP GR+I+ DPIGL GG +L Y NP++ IDPLG Sbjct: 1470 RETGLHYNTFRYYDPDVGRFISPDPIGLAGGLNLQRYAANPISWIDPLGH 1519 >UniRef50_A7BNN3 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. SS RepID=A7BNN3_9GAMM Length = 212 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 44/138 (31%), Positives = 66/138 (47%), Gaps = 20/138 (14%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D+ GN+ W+ Y+ +G + N + R GQ +D E+ L+YN RYY+P GR Sbjct: 29 LIDSQGNVVWAAVYEAFGKARVDVN--LVENHLRFAGQYFDSETRLHYNYYRYYEPTIGR 86 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 Y+ DPI + YAY NP++ +DP GL + D L D ++ Sbjct: 87 YLRVDPI---PSVNQYAYVSGNPLSYVDPFGLEKEIMM--------------DQLFDLFD 129 Query: 123 DMKRLNLGGTDQFFHCMA 140 +M L+L F C A Sbjct: 130 EMAALDLAFDYPKFGCEA 147 >UniRef50_B8FBZ2 YD repeat protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FBZ2_DESAA Length = 685 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 45/93 (48%), Positives = 58/93 (62%), Gaps = 2/93 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+ A+G+ AWS Y +G + + + R PGQ YD E+GL+YN NRYYDP G Sbjct: 252 VLVRANGSTAWSATYSAYGKASVDPDSD-VENNLRFPGQYYDAETGLHYNLNRYYDPEIG 310 Query: 63 RYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGL 94 Y + DP+GL GG +LYAY NP N +DPLGL Sbjct: 311 AYRSPDPLGLGGGVNLYAYTAGNPANYVDPLGL 343 >UniRef50_Q6D1M4 Rhs protein n=5 Tax=Enterobacteriaceae RepID=Q6D1M4_ERWCT Length = 1618 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 52/187 (27%), Positives = 81/187 (43%), Gaps = 14/187 (7%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP------------HHLHQPYRLPGQQYDKESGLYY 51 L +G I W GE WG E P ++ R GQ YD E+GLYY Sbjct: 1398 LCSEEGEIRWRGEQGLWGAHREERRPIPLRRYLGDAANEEVYCELRYQGQLYDAETGLYY 1457 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV--ALIRRKDQLN 109 NR+RYYD G+Y++ DPIGL GG Y Y NP++ +DPLGL+P + + D+ N Sbjct: 1458 NRHRYYDAESGQYLSPDPIGLAGGKRAYGYVKNPLSWVDPLGLTPKEGKNGSDKGPDKKN 1517 Query: 110 HQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYG 169 + + + D +++ L + + + +E+ + G Sbjct: 1518 SRAKINNIEDIFDNPDVLKGHTPESLKEELGGIPEGWKAGTMNKSRTEDGFTLRELNERG 1577 Query: 170 LNLFGMY 176 ++ Y Sbjct: 1578 NDVTDRY 1584 >UniRef50_Q2SPP3 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SPP3_HAHCH Length = 265 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 38/80 (47%), Positives = 49/80 (61%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 H P R GQ YD+E+G +YNR+RYYDP GR+I Q PIGL GG + Y Y NPV + Sbjct: 14 ERAHNPLRFQGQYYDEETGFHYNRHRYYDPQSGRFINQAPIGLLGGANAYQYAPNPVGWV 73 Query: 90 DPLGLSPADVALIRRKDQLN 109 DP GL+ + R+ D + Sbjct: 74 DPFGLTAKKESPKRQFDAIA 93 >UniRef50_B5I7B2 Rhs repeat protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5I7B2_9ACTO Length = 1249 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 42/96 (43%), Positives = 53/96 (55%), Gaps = 1/96 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG IAW WG+ + + + P R PGQ +D E+GL+YN NRYYDP GR Sbjct: 1000 LIAPDGTIAWHSRSTAWGSTQSHRDATA-YTPLRYPGQYFDPETGLHYNLNRYYDPELGR 1058 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 Y T DP+GL + Y Y NP DPLGL+ Sbjct: 1059 YTTPDPLGLAPAVNHYTYVPNPFTLADPLGLAGCTA 1094 >UniRef50_UPI000196E06E Rhs family protein n=2 Tax=Neisseria mucosa ATCC 25996 RepID=UPI000196E06E Length = 280 Score = 130 bits (327), Expect = 3e-29, Method: Composition-based stats. Identities = 44/92 (47%), Positives = 58/92 (63%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE-NPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D GN+ W GEY WG +E + HQP+RL Q YD+E+GL+YN RYY+P G Sbjct: 51 MTDIHGNLLWYGEYTAWGRLKKDECVYRNAHQPFRLQNQYYDEETGLHYNLMRYYEPEAG 110 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 R++ QDPI L GG +LY++ N DPLGL Sbjct: 111 RFVNQDPILLLGGSNLYSFASNTNAWFDPLGL 142 >UniRef50_A1TQS0 YD repeat protein n=2 Tax=Acidovorax RepID=A1TQS0_ACIAC Length = 1602 Score = 130 bits (327), Expect = 3e-29, Method: Composition-based stats. Identities = 61/221 (27%), Positives = 87/221 (39%), Gaps = 46/221 (20%) Query: 2 LALMDADGNIAWSGEYDEWGN------------------------------------QLN 25 L L D +G IAW+ +Y WG + Sbjct: 1337 LELTDVNGQIAWAVDYKVWGEATLRAVPRSDTGTDGVPGPRRQGHGPEAKSHAADSEVVC 1396 Query: 26 EENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 P + QP+R GQQ+D+E+GL+YNR+RYYDP GR+I++DPIG GG + + YPL+P Sbjct: 1397 AREPQRVEQPFRFQGQQFDEETGLHYNRSRYYDPAVGRFISEDPIGFLGGINTFIYPLDP 1456 Query: 86 VNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVS 145 + IDP GL+ KD L K +GG H + Sbjct: 1457 YSWIDPTGLAGFRACPCVCKDILAGLNVGPHS-------KIKKIGGLYDSHHIYQDKALE 1509 Query: 146 KLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKLSHSE 186 L G +R A + + R+ G K + Sbjct: 1510 GL--PGYTRGA-AVAISLQGRNADRTTRGTPHYKANRVQDQ 1547 >UniRef50_B5H9Z7 Rhs protein n=2 Tax=Streptomyces RepID=B5H9Z7_STRPR Length = 1054 Score = 130 bits (327), Expect = 4e-29, Method: Composition-based stats. Identities = 40/95 (42%), Positives = 52/95 (54%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G +AW WG + P R PGQ +D ESGL+YNR+R+YDP GR Sbjct: 749 LVDEQGKVAWRTRATLWGTTTWN-RSATAYTPLRFPGQYFDPESGLHYNRHRHYDPESGR 807 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 Y++ DP+GL + Y NP IDPLGL+ Sbjct: 808 YLSPDPLGLVPAPNAVTYVDNPTRWIDPLGLAGCP 842 >UniRef50_D1T3Q3 YD repeat protein (Fragment) n=2 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1T3Q3_9BURK Length = 561 Score = 130 bits (326), Expect = 5e-29, Method: Composition-based stats. Identities = 50/97 (51%), Positives = 65/97 (67%), Gaps = 4/97 (4%) Query: 2 LALMDADGN----IAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 +AL+DA+G + W+ + WG E +P + QP R GQQ D E+GL+YNR+RYY Sbjct: 311 IALVDANGPQAGLVTWAATHHAWGAVREEYDPLGIGQPIRFQGQQLDAETGLHYNRHRYY 370 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 DP+ G+Y+TQDPIGL GG AYPLNP+ DPLGL Sbjct: 371 DPMLGQYVTQDPIGLMGGIHKQAYPLNPIQASDPLGL 407 >UniRef50_C6WJA6 YD repeat protein n=3 Tax=Actinosynnema mirum DSM 43827 RepID=C6WJA6_ACTMD Length = 1509 Score = 129 bits (325), Expect = 6e-29, Method: Composition-based stats. Identities = 47/146 (32%), Positives = 69/146 (47%), Gaps = 7/146 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G + W + WG L E P R GQ +D+E+GL+YN +RYYDP R Sbjct: 1285 LLDDAGALVWRSQRTLWGAVLAELAGGP-DCPLRFAGQYHDRETGLFYNVHRYYDPETAR 1343 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA------LIRRKDQLNHQRAWDIL 117 Y + DP+GL G + +AY +NP+ DPLGLS + A ++ R Q + A + Sbjct: 1344 YSSPDPLGLLAGPNPHAYVVNPLRLTDPLGLSGCERARKIADRVVERAQQGRVREASNYH 1403 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCR 143 + + L D +H Sbjct: 1404 GRLSRERELEILSNPDGVYHSTGSGG 1429 >UniRef50_A3NNM1 Protein RhsD n=20 Tax=pseudomallei group RepID=A3NNM1_BURP6 Length = 1539 Score = 129 bits (325), Expect = 6e-29, Method: Composition-based stats. Identities = 44/116 (37%), Positives = 68/116 (58%), Gaps = 3/116 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + ++D +GN+AW YD G + + + QP RL GQ +D E+G+ YNR+RYYD Sbjct: 1309 VRMLDVEGNVAWEASYDANGG-IEQFGIQAMPQPLRLQGQYFDAETGMSYNRHRYYDARI 1367 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 G+++++DPI L GG +LY Y +N ++ DPLGL V L ++L+ W Sbjct: 1368 GQFVSEDPIRLSGGENLYRYCVNSISWADPLGL--DRVPLFDPNNRLSFNAIWAYT 1421 >UniRef50_B4ETQ2 Rhs-family protein n=9 Tax=Enterobacteriaceae RepID=B4ETQ2_PROMH Length = 1703 Score = 129 bits (325), Expect = 7e-29, Method: Composition-based stats. Identities = 40/100 (40%), Positives = 55/100 (55%), Gaps = 8/100 (8%) Query: 4 LMDADGNIAWSGEYDEWGNQL--------NEENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 + G +W+G + WG E +P++ P+R GQ D+ESGLYYNR R Sbjct: 1442 IFSEGGQASWAGRLNTWGQMQFWRYRDGKAENDPNYTECPFRFAGQYEDEESGLYYNRFR 1501 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 YYD G+Y++ DPIGL GG + Y Y P +DP GL+ Sbjct: 1502 YYDRETGQYLSPDPIGLLGGLNPYGYVHCPTGWVDPFGLA 1541 >UniRef50_Q88FK6 RHS family protein, putative n=1 Tax=Pseudomonas putida KT2440 RepID=Q88FK6_PSEPK Length = 1530 Score = 129 bits (324), Expect = 8e-29, Method: Composition-based stats. Identities = 45/98 (45%), Positives = 62/98 (63%), Gaps = 2/98 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP--HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L D+DGN W ++ WG +NE + + Q R GQ D+E+GL++N R+YDP Sbjct: 1315 LTDSDGNTIWRSDHHGWGKIINEWHSQQNGREQNLRNQGQYIDRETGLHFNIFRFYDPDI 1374 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 GR+ T DP+G+EGG +LY+Y N VN DPLGL P +V Sbjct: 1375 GRFTTTDPLGIEGGVNLYSYAPNIVNYSDPLGLCPENV 1412 >UniRef50_B1KGR6 YD repeat protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KGR6_SHEWM Length = 1699 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 46/145 (31%), Positives = 64/145 (44%), Gaps = 16/145 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQ----------------LNEENPHHLHQPYRLPGQQYDKES 47 L +G+I W GE WG L + ++ R GQ +D E+ Sbjct: 1461 LCTENGDIEWRGEQSLWGEHHKWRLSVQAKSKHKKYLEDAANDPVNCDLRYQGQVFDSET 1520 Query: 48 GLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQ 107 GLYYNR+RYYDP +Y++ DPIG+ GG AY NP+ +DP GL+ A + Q Sbjct: 1521 GLYYNRHRYYDPETCQYLSPDPIGMAGGLRTQAYVHNPMEWVDPFGLAACPTAEAPQTHQ 1580 Query: 108 LNHQRAWDILSDTYEDMKRLNLGGT 132 +N + TY T Sbjct: 1581 INSAVDAPEGAGTYSFRDNRGNQYT 1605 >UniRef50_UPI000190F33A Rhs-family protein n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190F33A Length = 138 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 40/66 (60%), Positives = 49/66 (74%) Query: 29 PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNG 88 + HQP RLPGQ +D E+GL+YN RYY P GR+++QDPIGL GG +LYAY NP+ Sbjct: 1 GAYFHQPLRLPGQYFDDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYAYCPNPLTW 60 Query: 89 IDPLGL 94 IDPLGL Sbjct: 61 IDPLGL 66 >UniRef50_B0SXS0 YD repeat protein n=1 Tax=Caulobacter sp. K31 RepID=B0SXS0_CAUSK Length = 1198 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 52/197 (26%), Positives = 72/197 (36%), Gaps = 16/197 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESG-LYYNRNRYYDPL 60 L + D W + +G + RLPGQ ESG L N NR YDP Sbjct: 934 LVMTDDSKAKVWDAYVEPFGR-AQVFGTASANIDLRLPGQWAQMESGGLSQNWNRDYDPT 992 Query: 61 QGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 RY+ D IGL GG +LYAY P DP G P + + Sbjct: 993 LARYVQADRIGLGGGQNLYAYVDGRPTEYSDPDGRIPLPLITGAIGAVIGGGS------- 1045 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 + +L + G D F C+ + V G A Y + YG G YG Sbjct: 1046 --NILGQLYMNGGD--FSCINWKNVGVATLVGGVTGALAPFYGTTL--YGAAALGAYGNF 1099 Query: 180 VKLSHSEMIEDNKKDLA 196 + + ++M+ + L Sbjct: 1100 AQYTGTQMVNGDSLSLG 1116 >UniRef50_D1T3N4 YD repeat protein n=2 Tax=Betaproteobacteria RepID=D1T3N4_9BURK Length = 1584 Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats. Identities = 40/116 (34%), Positives = 55/116 (47%), Gaps = 24/116 (20%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE------------------------ENPHHLHQPYRLP 39 + D GN+ W Y WG + E ++ + Q R+ Sbjct: 1355 MSDTRGNLVWQARYLTWGATVQEHWQAFDATGRPADAPLAETCDRPQQSFAPMPQNLRMQ 1414 Query: 40 GQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GQ D+E+GL+YN RYYDP G + T DPIGL GG +L+ Y NP+ IDP G + Sbjct: 1415 GQYLDRETGLHYNTFRYYDPDLGAFTTPDPIGLAGGINLHQYAPNPIAWIDPWGWN 1470 >UniRef50_UPI0001B56FBA YD repeat-containing protein n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B56FBA Length = 1624 Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats. Identities = 40/91 (43%), Positives = 55/91 (60%), Gaps = 2/91 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G IAW +G + E + P R GQ +D+E+GL+YN +RYYDP+ R Sbjct: 1275 LLDDRGEIAWQARSSLFGVVVAESGGTGI--PLRFQGQYFDEETGLHYNFHRYYDPVLAR 1332 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 Y++ DP+GL GG +AY NP +DPLGL Sbjct: 1333 YLSPDPLGLGGGLDPHAYVSNPHVSVDPLGL 1363 >UniRef50_D0KWY8 YD repeat protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KWY8_HALNC Length = 1338 Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats. Identities = 55/152 (36%), Positives = 72/152 (47%), Gaps = 15/152 (9%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENP---HHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 +A DA + W Y +G QL+ + H R PGQ DKESGLYYN +RYYD Sbjct: 860 IAATDAQAQVIWRAHYGPYGQQLDVADSLVKDHFSLSLRNPGQWQDKESGLYYNDHRYYD 919 Query: 59 PLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGL---------SPADVALIRRKDQL 108 P GRY++ DP+GL GG + YAY NP+ DP GL + A L+ D Sbjct: 920 PATGRYLSPDPLGLAGGLNAYAYVAANPIAYTDPYGLMLFAFDGTNNGAPGHLVPGNDTS 979 Query: 109 NHQRAWDILSDTYEDMKR--LNLGGTDQFFHC 138 N R +D + D + +G T H Sbjct: 980 NVYRFYDAYQFSSADPRYYITGIGTTYPEKHQ 1011 >UniRef50_C0EPY1 Putative uncharacterized protein n=7 Tax=Neisseria RepID=C0EPY1_NEIFL Length = 434 Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats. Identities = 43/92 (46%), Positives = 56/92 (60%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP-HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D GN+ W GEY WG + H HQP+RL Q YD+E+GL+YN RYY+P G Sbjct: 223 MTDIRGNLLWYGEYTAWGRLKKDGRVYQHAHQPFRLQNQYYDRETGLHYNYFRYYEPETG 282 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 R+I+QDPIGL G +LY + N +D GL Sbjct: 283 RFISQDPIGLLGEDNLYWFGPNTAIWVDLFGL 314 >UniRef50_D1VCV7 YD repeat protein n=1 Tax=Frankia sp. EuI1c RepID=D1VCV7_9ACTO Length = 1572 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 40/94 (42%), Positives = 50/94 (53%), Gaps = 1/94 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D DG +AW WG P P R PGQ +D E+GL YN +RYYDP R Sbjct: 1364 LVDPDGRLAWHSRATLWG-VSPPSTPTTTDCPLRFPGQYHDPETGLNYNFHRYYDPATAR 1422 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 Y T D +GL + + Y NP+ IDP GL+P Sbjct: 1423 YKTSDALGLSPAPNPWTYVTNPLTWIDPFGLAPC 1456 >UniRef50_B7GLX9 Rhs family protein n=1 Tax=Anoxybacillus flavithermus WK1 RepID=B7GLX9_ANOFW Length = 295 Score = 127 bits (318), Expect = 4e-28, Method: Composition-based stats. Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 5/117 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL D GNI +YD WGN L++ PYR G QYD E+GLYY RYY P Sbjct: 71 VIALTDEQGNIVARYQYDAWGNILSQSGDLEDENPYRYAGYQYDNETGLYYLIARYYYPE 130 Query: 61 QGRYITQDPIGLEGG----WSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 G +++ DP + + YAY NPV DP G +P + L+ ++ ++ Sbjct: 131 HGVFLSLDPDPGDADDILTQNGYAYANNNPVMLTDPDGENPYIIILVSVGGRIVAKK 187 >UniRef50_B4ETQ7 Putative Rhs-family protein n=3 Tax=Enterobacteriaceae RepID=B4ETQ7_PROMH Length = 380 Score = 126 bits (317), Expect = 6e-28, Method: Composition-based stats. Identities = 41/105 (39%), Positives = 56/105 (53%), Gaps = 8/105 (7%) Query: 7 ADGNIAWSGEYDEWGNQL--------NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 G +W+G + WG E +P++ P+R GQ D+ESGLYYNR RYYD Sbjct: 171 EGGQASWAGRLNTWGQMQFWRYRDGKAENDPNYTECPFRFAGQYEDEESGLYYNRFRYYD 230 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIR 103 G+Y++ DPIGL GG + Y Y P +DP GL+ I+ Sbjct: 231 RETGQYLSPDPIGLLGGLNPYGYVHCPTGWVDPFGLACCPPPKIK 275 >UniRef50_D1T3N5 YD repeat protein n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1T3N5_9BURK Length = 598 Score = 126 bits (316), Expect = 6e-28, Method: Composition-based stats. Identities = 48/104 (46%), Positives = 64/104 (61%), Gaps = 5/104 (4%) Query: 2 LALMDADGN----IAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 +AL+DA+G + W+ Y WG E +PH + Q R GQQ+D E+GL+YNR RYY Sbjct: 357 IALVDANGPQAGLVTWAATYHAWGAVREEYDPHGIGQDIRFQGQQFDAETGLHYNRFRYY 416 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVA 100 DP+ G+Y+TQDPIGL+GG + Y +P DP GL D A Sbjct: 417 DPMLGQYVTQDPIGLKGGLNKSNYSGSSPAINCDPKGLDFKDKA 460 >UniRef50_C5AA19 Rhs family protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AA19_BURGB Length = 1596 Score = 126 bits (316), Expect = 6e-28, Method: Composition-based stats. Identities = 39/95 (41%), Positives = 52/95 (54%), Gaps = 6/95 (6%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 + D G I W Y WG ++++ + P R GQ +D ESGL YNR+RYY Sbjct: 1340 ITDELGEIVWEARYQAWGEARDVIERVSKATGERVRNPLRFQGQHFDDESGLAYNRHRYY 1399 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPL 92 GRY+++DP L GG + +AY NPV IDPL Sbjct: 1400 AADVGRYVSKDPAELLGGLNEFAYVPNPVQWIDPL 1434 >UniRef50_D1SVF0 Rhs family protein (Fragment) n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1SVF0_9BURK Length = 218 Score = 126 bits (316), Expect = 7e-28, Method: Composition-based stats. Identities = 42/121 (34%), Positives = 57/121 (47%), Gaps = 24/121 (19%) Query: 9 GNIAWSGEYDEWGNQLNE------------------------ENPHHLHQPYRLPGQQYD 44 GN+ W Y WG + E ++ + Q R+ GQ D Sbjct: 2 GNLVWQARYLTWGATVQEHWQAFDAAGRPVDAPVAETGHRPQQSFVLIPQNLRMQGQYLD 61 Query: 45 KESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR 104 +E+GL+YN RYYDP G + T DPIGL GG +L+ Y LNP+ +DP G +P R Sbjct: 62 RETGLHYNTFRYYDPDLGAFTTPDPIGLAGGINLHQYALNPIAWVDPWGWAPYCRQKGRP 121 Query: 105 K 105 K Sbjct: 122 K 122 >UniRef50_A3KUM5 Rhs family protein n=10 Tax=Pseudomonas aeruginosa RepID=A3KUM5_PSEAE Length = 931 Score = 126 bits (316), Expect = 7e-28, Method: Composition-based stats. Identities = 45/99 (45%), Positives = 53/99 (53%), Gaps = 2/99 (2%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 DA G IAW + D +G R PGQ YD ESGL+YN R YDP GRY Sbjct: 690 TDASGQIAWQWQSDAFGRGEALSQGSTQVN-LRFPGQYYDAESGLHYNYFRDYDPETGRY 748 Query: 65 ITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALI 102 + DPIGL+GG + Y Y NP+ DP GL+PA L Sbjct: 749 VESDPIGLKGGLNTYGYVYGNPLTYSDPKGLTPAAAGLC 787 >UniRef50_Q7N2G0 Complete genome; segment 11/17 n=4 Tax=Gammaproteobacteria RepID=Q7N2G0_PHOLL Length = 1498 Score = 126 bits (316), Expect = 7e-28, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 61/119 (51%), Gaps = 5/119 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGN-----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 + ++ G + W+G WG L +P +L +R GQ D+ESGL+YNR+R Sbjct: 1286 VREILTEAGELIWAGRLLTWGEPECWPVLTVNDPRNLTCNFRFAGQYEDRESGLFYNRHR 1345 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 YY+ G+Y++ DP+ L GG + Y Y +PVN IDP GL+ + QR Sbjct: 1346 YYESDTGQYLSPDPLNLSGGVNPYGYVHDPVNWIDPFGLAACPTQKYEVSTFDDLQRRS 1404 >UniRef50_D1PUV1 RHS family protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PUV1_9BACT Length = 284 Score = 125 bits (315), Expect = 9e-28, Method: Composition-based stats. Identities = 56/225 (24%), Positives = 84/225 (37%), Gaps = 31/225 (13%) Query: 6 DADGNIAWSGEYDEWGNQLNEEN------PHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 + G W D G + E + ++ P+ GQ YD+E L YNR RYYDP Sbjct: 50 NEQGEEVWYRRLDMNGKVIEERSMNYTSYKDYVKIPFLFQGQYYDEEVKLAYNRFRYYDP 109 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 GRYI++DP+ L GG +LY Y N + DPLGLS A + + Q I Sbjct: 110 ELGRYISEDPVRLLGGSNLYRYVENTILWCDPLGLSSAKLNKALGGSKKGMQAHHLIPEK 169 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 + +QF + + ++ G+ + Y Y R Sbjct: 170 VWSQ--------NEQFLNQIGLSGQCDVSSNGLHMYNSKELAIANGKAY-------YHRG 214 Query: 180 VKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKK 224 S+S I + H SD + ++PE + Sbjct: 215 RHDSYSHAINRRISAIENQYH----------SDLQAGILSPEEAR 249 >UniRef50_C7Q0A7 YD repeat protein n=2 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q0A7_CATAD Length = 1528 Score = 125 bits (315), Expect = 9e-28, Method: Composition-based stats. Identities = 39/92 (42%), Positives = 50/92 (54%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG + W WG + + P R PGQ +D ESGL+YN NRYYD Sbjct: 1304 LVTPDGRVVWHTTTSLWGRTIGTSAESGVDCPLRFPGQYHDDESGLHYNLNRYYDSETAA 1363 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y+T DP+GL + +AY NP+ DPLGLS Sbjct: 1364 YLTPDPLGLVPAPNDHAYVPNPLTVSDPLGLS 1395 >UniRef50_Q8GDM7 Rhs n=3 Tax=Photorhabdus RepID=Q8GDM7_PHOLU Length = 1469 Score = 125 bits (315), Expect = 1e-27, Method: Composition-based stats. Identities = 50/147 (34%), Positives = 68/147 (46%), Gaps = 8/147 (5%) Query: 2 LALMDADGNIAWSG-EYDEWGNQLNEENPHHLHQP-YRLPGQQYDKESGLYYNRNRYYDP 59 LAL D G W + +G +L+ + P R GQ +D+ESGL+YNR RYY P Sbjct: 1245 LALFDPTGKRVWRRPKQSLYGLRLSGHGENPQLDPGLRFAGQLFDEESGLFYNRFRYYLP 1304 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 Y++ DP+GL GG + YAY NP N IDPLGL+ D ++ W Sbjct: 1305 EAACYLSPDPLGLNGGPNPYAYVHNPANWIDPLGLAGC------PTDYSQKRKYWSSDPI 1358 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSK 146 T++ K H A+ K Sbjct: 1359 TFKGNKVYQRNDLFDPQHMSAWRDQGK 1385 >UniRef50_B4EGW8 RHS-family protein n=9 Tax=Burkholderiaceae RepID=B4EGW8_BURCJ Length = 1515 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 47/130 (36%), Positives = 59/130 (45%), Gaps = 11/130 (8%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH--------HLHQPYRLPGQQYDKESGLYYNRNR 55 + D G W Y WG L + P + R GQ D E+GL YN NR Sbjct: 1267 VFDEQGRPVWKAAYSLWGKLLPVKRPANDADCGATSIDTTLRFSGQWADDETGLNYNLNR 1326 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS---PADVALIRRKDQLNHQR 112 YYDP G+Y++ DPIGL GG AY +P IDPLGL PA + R + Sbjct: 1327 YYDPDSGQYLSADPIGLLGGARTQAYVHDPSQWIDPLGLQGCKPAGKKISMRAYRYEMPE 1386 Query: 113 AWDILSDTYE 122 +D D +E Sbjct: 1387 RFDTTWDAHE 1396 >UniRef50_A1WE65 Rhs family protein-like protein n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE65_VEREI Length = 303 Score = 125 bits (313), Expect = 1e-27, Method: Composition-based stats. Identities = 50/156 (32%), Positives = 69/156 (44%), Gaps = 15/156 (9%) Query: 2 LALMDADGNIAWSGEYDEWGN-----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 L D GN+ W+ Y+ +G + ++ RLPGQ D E+GL+YN +RY Sbjct: 78 LQATDNAGNVVWAANYNAFGRADVVTPRSATGDSRINSQLRLPGQYEDVETGLHYNFHRY 137 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQLNHQ---R 112 YD GRY DPIGL GG + Y Y NP++ DPLGL DV L R ++ Sbjct: 138 YDLDIGRYSQIDPIGLRGGLNGYVYVNGNPLSFTDPLGL---DVELRCRPAEIVAGLVNH 194 Query: 113 AW---DILSDTYEDMKRLNLGGTDQFFHCMAFCRVS 145 W D + + G + + VS Sbjct: 195 CWLKTDTIEAGMNKRITCSRAGNELVNLDSIWVVVS 230 >UniRef50_B3JL94 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JL94_9BACE Length = 336 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 42/160 (26%), Positives = 66/160 (41%), Gaps = 6/160 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + + DG + EY +G EE + PY +++D+E+GLYY RYYDP Sbjct: 62 ITNLDGEVVQHIEYVPFGEVFIEERNSIWNTPYLFNAKEFDEETGLYYYGARYYDPRLSL 121 Query: 64 YITQDPIGLE-GGWSLYAY-PLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 +I+ DP+ + + Y Y NPV +DP G + A+ K +R W + T Sbjct: 122 WISTDPLQEKYPHINSYCYTANNPVLFVDPDGKAIVKGAVAAFKY---AKRIWSVYKKTG 178 Query: 122 E-DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLG 160 + L G D+F + DA + Sbjct: 179 KLTPSNLKKAGLDEFLDIAGDIQTIFTGDATTLDKVGAIA 218 >UniRef50_C8W2T8 YD repeat protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2T8_DESAS Length = 2349 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 58/205 (28%), Positives = 83/205 (40%), Gaps = 32/205 (15%) Query: 2 LALMDADGNIAW--SGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L++ D GN +YD WG + E+ + P+R G YD+E+GLYY ++RYY P Sbjct: 2115 LSMTDDYGNTDQENRYDYDPWGTPICED--ESVKLPFRYAGYYYDEETGLYYLKSRYYSP 2172 Query: 60 LQGRYITQDPIGL-----EGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQ---LNH 110 GR++T+D +LYAY NPVN +DP G + D+ R N Sbjct: 2173 ALGRFLTRDDHSFINHADPQTLNLYAYCGNNPVNYVDPDGNTIDDIKSGLRNAGNAVYNE 2232 Query: 111 QRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGL 170 R W D + C G+ + KG+G K + Sbjct: 2233 ARTW-----------------ADAWLDCPFTGGAVFTGGIGIIKRIKGVGNFK--KYSPA 2273 Query: 171 NLFGMYGRKVKLSHSEMIEDNKKDL 195 + YG K H E+ D DL Sbjct: 2274 QIEKNYGLKKGQFHREIKGDILSDL 2298 >UniRef50_D1JME6 Rhs family protein n=22 Tax=Bacteroides RepID=D1JME6_9BACE Length = 1494 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 50/159 (31%), Positives = 72/159 (45%), Gaps = 3/159 (1%) Query: 6 DADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYI 65 D +GN WS D GN + E + P+ GQ YD+E+GL YNR RYY P G Y+ Sbjct: 1266 DTEGNEVWSRVLDMDGNVIEETGNKGMV-PFLFQGQYYDRETGLAYNRFRYYSPKMGVYV 1324 Query: 66 TQDPIGLEGG-WSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDM 124 +QDPIGL GG +LY Y + ID GL V + + + RA D + + Sbjct: 1325 SQDPIGLGGGILNLYGYVDDTNVWIDSFGLDWNYVLINSKGNVFYSGRASDNANLSDVAR 1384 Query: 125 KRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEK 163 + G D + ++ ++ A G E+ Sbjct: 1385 RHAGTKGADGTRF-GNGDTLVQITKPQITSYAAARGVEQ 1422 >UniRef50_B3PEK8 RHS Repeat family n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PEK8_CELJU Length = 2245 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 57/212 (26%), Positives = 87/212 (41%), Gaps = 21/212 (9%) Query: 1 MLALMDADGNIAWS--GEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 ++A+ +A G + + Y +G P+R G++ D E+GLYY R RYYD Sbjct: 1997 VIAVSNASGQVTSNNIYTYSPYGEV-----NSAAGFPFRYTGRRLDPETGLYYYRARYYD 2051 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 P GR++ DPIG + +LYAY N P+N IDP G + L +L Sbjct: 2052 PGLGRFLQTDPIGYKDQMNLYAYVGNDPLNKIDPTGKNAKKPFLKEAAKELR-------- 2103 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 + +K G + +L + G S+S + E+ + G L M G Sbjct: 2104 KEAKRQIKNAKAQGRREALKQ----ERQQLKETGESKSNLSDDRKTELLETG-KLKNMDG 2158 Query: 178 RKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTD 209 S + K D+A N +T D Sbjct: 2159 HHEPSVSSGKTLEEKIDIAKNPDNITFMEKAD 2190 >UniRef50_B2HAQ4 RhsD protein n=4 Tax=Burkholderia RepID=B2HAQ4_BURPS Length = 1531 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 38/97 (39%), Positives = 53/97 (54%), Gaps = 2/97 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D+ G W+ YDE G + ++ P L GQ D E+GL+YNR+RYYDP Sbjct: 1289 TQMTDSSGREVWATGYDENGRLVPI--NADIYNPIHLQGQYRDAETGLHYNRHRYYDPAL 1346 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 G +I++DP+GL G +LY Y N + DPLG Sbjct: 1347 GSFISKDPLGLAAGVNLYRYAPNSIGWADPLGFQAKP 1383 >UniRef50_C6CPH2 YD repeat protein n=11 Tax=Enterobacteriaceae RepID=C6CPH2_DICZE Length = 1423 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 44/104 (42%), Positives = 54/104 (51%), Gaps = 2/104 (1%) Query: 3 ALMDADGNIAWSG-EYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 AL DG + W + WG Q E GQ D ESGL YNR RYYDP Sbjct: 1211 ALFTPDGTLRWQAPKATLWG-QRQAEKSESPDPGLAFAGQLRDSESGLCYNRFRYYDPAG 1269 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRK 105 G Y++ DPIG+ GG + YAY NP+ IDPLGL+ + + K Sbjct: 1270 GCYVSPDPIGIAGGDNNYAYAPNPITWIDPLGLAGCSIQKLEEK 1313 >UniRef50_B4VFT3 Rhs protein n=4 Tax=Bacteria RepID=B4VFT3_9ACTO Length = 1253 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 39/95 (41%), Positives = 49/95 (51%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG AW WG + + P R PGQ YD ESGL++N R YDP Sbjct: 1035 LLAEDGTTAWHTRATLWGTTTWNSDATA-YTPLRFPGQYYDPESGLHHNYFRTYDPETAH 1093 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 Y++ DP+GL + Y NP DPLGL+PAD Sbjct: 1094 YLSPDPLGLAPAPNPTTYVHNPHTWSDPLGLTPAD 1128 >UniRef50_A4SKJ3 Rhs family protein n=2 Tax=Bacteria RepID=A4SKJ3_AERS4 Length = 1590 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 43/103 (41%), Positives = 54/103 (52%), Gaps = 12/103 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP------------HHLHQPYRLPGQQYDKESGLYY 51 L G+I W GE WGN + P + R GQ YD+E+GLYY Sbjct: 1353 LCSEAGDIIWRGEQRLWGNYRADAIPQPLRRFLGDAANEETYCELRYQGQIYDQETGLYY 1412 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 NR+RY+DP G+YI+ DPIG GG Y NP+ +DPLGL Sbjct: 1413 NRHRYFDPELGQYISPDPIGFAGGVRPQGYVHNPLEWVDPLGL 1455 >UniRef50_A9EVR3 Conserved carbohydrate-binding protein, Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVR3_SORC5 Length = 1367 Score = 124 bits (310), Expect = 3e-27, Method: Composition-based stats. Identities = 44/129 (34%), Positives = 62/129 (48%), Gaps = 4/129 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D G W+ E D G E+ P+R PGQ D E+GLYYNR RYYDP G Sbjct: 1148 LFDTSGVQVWAAETDTLGRTAVEQGAPE-DCPWRWPGQYEDPETGLYYNRFRYYDPDAGN 1206 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILS-DTYE 122 Y++ DP+GL G + YAY + + DPLGL + + D + + + Sbjct: 1207 YVSPDPLGLLAGTAEYAYAPDSLVWFDPLGLIV--LQQVPYNDHPLFGAVSEFIQGKSRS 1264 Query: 123 DMKRLNLGG 131 D++ N+ Sbjct: 1265 DLRGRNVAA 1273 >UniRef50_D1S833 YD repeat protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1S833_9ACTO Length = 3829 Score = 124 bits (310), Expect = 3e-27, Method: Composition-based stats. Identities = 41/97 (42%), Positives = 52/97 (53%), Gaps = 5/97 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L++ G + W D WG P PGQ D E+GL+YNR RYYDP GR Sbjct: 2487 LVEPGGGLRWWSRGDLWGR-----GADRTATPLAFPGQYVDAETGLHYNRFRYYDPATGR 2541 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 Y++ DP+GL GG + AY NP+ DPLGL+ A Sbjct: 2542 YVSPDPLGLSGGPNPTAYVSNPLTVADPLGLTSCTPA 2578 >UniRef50_B3PEN8 Rhsfamily protein n=2 Tax=Cellvibrio japonicus Ueda107 RepID=B3PEN8_CELJU Length = 1401 Score = 123 bits (309), Expect = 4e-27, Method: Composition-based stats. Identities = 40/92 (43%), Positives = 52/92 (56%), Gaps = 3/92 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 ++D GNI W +Y +G + + R PGQ +D E+GL++N R YD GR Sbjct: 1188 MLDDAGNIVWEAQYSAFGKAHITIDT--VENNLRFPGQYFDSETGLHHNYFRDYDSALGR 1245 Query: 64 YITQDPIGLEGGWSLYAYPL-NPVNGIDPLGL 94 YI DPIGL GG++ Y Y NP IDPLGL Sbjct: 1246 YIQSDPIGLGGGFNTYVYAYQNPAVLIDPLGL 1277 >UniRef50_C4NV50 Rhs repeat family protein n=10 Tax=Gammaproteobacteria RepID=C4NV50_ECOLX Length = 1374 Score = 123 bits (308), Expect = 6e-27, Method: Composition-based stats. Identities = 40/93 (43%), Positives = 50/93 (53%), Gaps = 5/93 (5%) Query: 3 ALMDADGNIAWSGEYDEWGNQ-LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 AL D G + W Y +G + + P R PGQ YD E+G +YN RYYDP Sbjct: 1114 ALTDVSGQVVWKASYSPFGKASIIIQGPTF---NLRFPGQYYDAETGFHYNWRRYYDPAT 1170 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLG 93 GRYIT DP+GL G + Y Y NP++ DP G Sbjct: 1171 GRYITSDPLGLIDGVNTYGYVHGNPMSNTDPTG 1203 >UniRef50_Q12SZ6 Putative uncharacterized protein n=1 Tax=Shewanella denitrificans OS217 RepID=Q12SZ6_SHEDO Length = 520 Score = 123 bits (308), Expect = 6e-27, Method: Composition-based stats. Identities = 51/175 (29%), Positives = 77/175 (44%), Gaps = 17/175 (9%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++A D GNI Y+ +G +L + G DK+ GL Y + RYYDPL Sbjct: 236 VVAESDEAGNIISRSHYEPFGKRLGGDKAG-----IGYTGHLQDKDLGLTYMQARYYDPL 290 Query: 61 QGRYITQDPIGLEG--GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQ-----R 112 GR+ + DPI G ++ YAY NP IDP G + VA I N + Sbjct: 291 IGRFYSNDPISYRGVHSFNRYAYANNNPYKYIDPTGNNEEYVAGISVGIGFNAEDVNLIA 350 Query: 113 AWDILSDTYEDMKRLNLGGT---DQFFHCMAFCRVSKLND-AGVSRSAKGLGYEK 163 ++ Y ++ ++LG +Q F + +S L+ G++RS G K Sbjct: 351 TFENSVYNYNFLEGVSLGQRIVEEQMFQQLQLGELSSLSMLLGIARSNGKFGVVK 405 >UniRef50_D1AA66 YD repeat-containing protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AA66_THECD Length = 1197 Score = 123 bits (308), Expect = 6e-27, Method: Composition-based stats. Identities = 38/95 (40%), Positives = 50/95 (52%), Gaps = 1/95 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + L+ DG+I+W WG + P PGQ YD E+GLY+N RYY P Sbjct: 1027 VELVAPDGSISWRSHAAVWGAP-YVSRADQISCPLGFPGQYYDSETGLYFNYFRYYSPFD 1085 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 GRY++ DP+GL + Y Y +NP DP GL P Sbjct: 1086 GRYLSPDPLGLSPQPNPYIYVINPFVWADPFGLYP 1120 >UniRef50_A5GE16 YD repeat protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GE16_GEOUR Length = 1600 Score = 122 bits (307), Expect = 7e-27, Method: Composition-based stats. Identities = 41/101 (40%), Positives = 62/101 (61%), Gaps = 4/101 (3%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++++ DA+ N+ S EYD +G Y G+++DKE+GLY+ R RYYDP+ Sbjct: 1411 IVSITDANRNVVQSYEYDSFGMVKPST---VFANSYTYTGREWDKETGLYFYRARYYDPM 1467 Query: 61 QGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVA 100 +GR+I++DP+G +GG ++YAY N VN DP GL P Sbjct: 1468 EGRFISKDPVGFKGGINIYAYVSNNVVNDTDPSGLYPGPCG 1508 >UniRef50_B2PYS3 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PYS3_PROST Length = 191 Score = 122 bits (307), Expect = 7e-27, Method: Composition-based stats. Identities = 34/66 (51%), Positives = 42/66 (63%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 Q R GQ +D E+GL++N R+YDP GR+I DPIGL GG +LY Y NP+ I Sbjct: 2 ESFEQNLRYAGQYFDNETGLHFNTFRFYDPQIGRFIMPDPIGLLGGVNLYQYAPNPLVWI 61 Query: 90 DPLGLS 95 DP GLS Sbjct: 62 DPWGLS 67 >UniRef50_C7Q0B8 YD repeat protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q0B8_CATAD Length = 1489 Score = 122 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 43/96 (44%), Positives = 54/96 (56%), Gaps = 1/96 (1%) Query: 4 LMDADGNIAWSGEYDE-WGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+ ADG++ W WG + P P R PGQ +D E+GL+YN NRYYDP Sbjct: 1288 LVSADGHVVWQQRRASIWGLPADIVPPDADEFPLRFPGQYHDSETGLHYNLNRYYDPEAA 1347 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 Y+T DP+GLE + Y Y NP+ DPLGL PA Sbjct: 1348 AYLTPDPLGLEPAPNQYGYVGNPLADSDPLGLYPAS 1383 >UniRef50_A4FJ21 YD repeat protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FJ21_SACEN Length = 1670 Score = 122 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 42/92 (45%), Positives = 57/92 (61%), Gaps = 1/92 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ +G +AW G WG +L + P+ + P R PGQ D E+GL YN +RYYDP GR Sbjct: 1273 LIAPNGVLAWHGRTSLWGKELPVQ-PNGVTTPLRFPGQYADAETGLNYNVHRYYDPATGR 1331 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y++QDP+GL + AY NP + DPLGL+ Sbjct: 1332 YLSQDPLGLAPAPNPVAYVDNPHSAADPLGLA 1363 >UniRef50_B4V251 LipX3 n=2 Tax=Streptomyces RepID=B4V251_9ACTO Length = 1253 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 50/187 (26%), Positives = 79/187 (42%), Gaps = 21/187 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLN-EENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+ + G +AW WG + P R PGQ D E+GL+YN RYY+P Sbjct: 1052 LVASGGELAWQRRTTLWGTDFPAPTDTTSADCPIRFPGQYADSETGLHYNFFRYYEPESA 1111 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 RYI+ DP+GLE + +AY +N + DPLGL+ KD L+ + + DT++ Sbjct: 1112 RYISADPLGLEPAPNHHAYVVNALGWTDPLGLAARG-----PKDPLDLGQGYRGRLDTWK 1166 Query: 123 D---------------MKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRD 167 + + + + G+D +F+ V + KG + R Sbjct: 1167 EGTKGTDFEIHVYDKSGREVGIFGSDGWFNKHNTIGADVEVPPSVENALKGRAVDTMRRS 1226 Query: 168 YGLNLFG 174 + G Sbjct: 1227 GRIGPRG 1233 >UniRef50_UPI00016B0868 Rhs family protein n=1 Tax=Burkholderia pseudomallei 112 RepID=UPI00016B0868 Length = 242 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 38/95 (40%), Positives = 53/95 (55%), Gaps = 2/95 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D+ G W+ YDE G + ++ P L GQ D E+GL+YNR+RYYDP G Sbjct: 1 MTDSSGREVWATGYDENGRLVPI--NADIYNPIHLQGQYRDAETGLHYNRHRYYDPALGS 58 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 +I++DP+GL G +LY Y N + DPLG Sbjct: 59 FISKDPLGLAAGVNLYRYAPNSIGWADPLGFQAKP 93 >UniRef50_B8FIJ1 YD repeat protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FIJ1_DESAA Length = 1630 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 34/101 (33%), Positives = 49/101 (48%), Gaps = 7/101 (6%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHH------LHQPYRLPGQQYDKESGLYYNRNR 55 +AL D+ G + + Y +G + P+ GQ+YD+E+GLYY R R Sbjct: 1318 IALTDSTGAVVETYRYTPYGQVSFFDGNGSSISQSNESNPFLFTGQRYDEETGLYYYRAR 1377 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLS 95 Y P GR++ DP G G +LY Y NP +DP G + Sbjct: 1378 YLHPELGRFLNPDPKGFVDGMNLYEYAMSNPARYVDPRGTA 1418 >UniRef50_B7GX39 Protein rhsD n=4 Tax=Acinetobacter baumannii RepID=B7GX39_ACIB3 Length = 1590 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 40/97 (41%), Positives = 55/97 (56%), Gaps = 6/97 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQ-----LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 + + G W D WG LN++NP R GQ YD+E+ L+YNR RYY+ Sbjct: 1301 MTNIRGECVWEILQDTWGAVSQIKALNQDNPFE-QNNLRFQGQYYDRETELHYNRYRYYE 1359 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 P RY+++DPIGLEGG + +Y +P IDP GL+ Sbjct: 1360 PHSARYVSKDPIGLEGGMNTSSYVSDPNQWIDPKGLN 1396 >UniRef50_A9AKU8 Type VI secretion system Vgr family protein n=10 Tax=Burkholderia RepID=A9AKU8_BURM1 Length = 1981 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 43/124 (34%), Positives = 61/124 (49%), Gaps = 2/124 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D + W+ + +G + P R PGQ D+ESGL+YNR RYYDP+ GR Sbjct: 1747 LYDEQREVLWAADLSAYGRTAR-WLTRVVDNPIRFPGQYRDEESGLHYNRFRYYDPMVGR 1805 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 YI QDPI +GG + Y+Y + P DP GL +A+ + A +I + Sbjct: 1806 YINQDPIAFDGGINFYSYADSAPNIAYDPKGLFVPLIAVASFLGRGALGGAVEIAMQGAK 1865 Query: 123 DMKR 126 + R Sbjct: 1866 QVFR 1869 >UniRef50_B7GLY4 Rhs family protein n=3 Tax=Anoxybacillus flavithermus WK1 RepID=B7GLY4_ANOFW Length = 563 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 38/99 (38%), Positives = 51/99 (51%), Gaps = 5/99 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL DA GN+ EYD WG ++ PYR G QYD+E+GLYY RYY P Sbjct: 314 VIALTDAQGNVVARYEYDTWGQIRSQTGVLADENPYRYAGYQYDEETGLYYLMARYYHPT 373 Query: 61 QGRYITQDPIGLEGG----WSLYAYP-LNPVNGIDPLGL 94 G +++ DP + + Y Y NPV +DP G Sbjct: 374 HGVFLSLDPDPGDADDILTQNGYTYANNNPVMLVDPDGH 412 >UniRef50_D2UF28 Putative rhs family protein n=2 Tax=Xanthomonas albilineans RepID=D2UF28_XANAL Length = 1812 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 47/172 (27%), Positives = 67/172 (38%), Gaps = 5/172 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D +GN +YD +G + PY+ G++ D SGLYY R RYY P R Sbjct: 1556 LTDTNGNAVQRYDYDPYGTTTQSSAS--YNNPYQYTGRERDA-SGLYYYRARYYTPELAR 1612 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 +I++DPI L GG + YAY NPV DP+G A+ + A Sbjct: 1613 FISEDPIKLAGGVNTYAYTGGNPVMYRDPVGHEFVTAAIGAVLGGVAGYEAGGWQGAVAG 1672 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 + +G ++ AG+ SA G + G Sbjct: 1673 GLVGGVVGFVAPQASAFV-GTLAGEGLAGMFASAGATVVIGGASGAGATVLG 1723 >UniRef50_B5JS46 NHL repeat containing protein n=2 Tax=gamma proteobacterium HTCC5015 RepID=B5JS46_9GAMM Length = 2515 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 48/190 (25%), Positives = 73/190 (38%), Gaps = 11/190 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + + ADG IA YDE+G + + NP QP+ G YD+ + L R YD Sbjct: 2333 MVVNTADGAIAQQMSYDEFGQVVEDSNPG--FQPFGFAGGIYDQHTKLTRFGARDYDAET 2390 Query: 62 GRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+ T+DPI EGG +L+ Y N PVN +D GL + + W + Sbjct: 2391 GRWTTKDPIRFEGGLNLFGYVANDPVNWVDIWGLEGEQATVELVNPR---GADWSEAREN 2447 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 + G + R D V+ Y D+ ++ + Sbjct: 2448 NPHLPGPQYPGDYATEYTDKAER-----DVCVTCPNGEKYYTPIPGDHLPDIMDKFNNGW 2502 Query: 181 KLSHSEMIED 190 K H + E+ Sbjct: 2503 KKEHPDPTEE 2512 >UniRef50_D1W448 RHS repeat-associated core domain protein n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W448_9BACT Length = 195 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 40/123 (32%), Positives = 62/123 (50%), Gaps = 1/123 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + +D+ G + W D +G+ L P+R GQ D E+GLYYNR RYYDP Sbjct: 53 IQALDSKGEVVWDCILDIYGDVLELRGKRDF-IPFRFQGQYEDGETGLYYNRFRYYDPNS 111 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 G +I+QDPI + GG+++YAY + + +D GLS + + + + T+ Sbjct: 112 GTFISQDPISILGGFNIYAYVHDVNSWVDVFGLSKYSPIEVLGRKVYQNSADFGGGVPTF 171 Query: 122 EDM 124 D Sbjct: 172 VDP 174 >UniRef50_C6CI62 YD repeat protein n=7 Tax=Enterobacteriaceae RepID=C6CI62_DICZE Length = 1475 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 53/200 (26%), Positives = 71/200 (35%), Gaps = 17/200 (8%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 AL DG + W Q E GQ D ESGL YNR RYYDP G Sbjct: 1226 ALFTPDGTLRWQAPTATLWGQRQAEKSESPDPGLAFAGQLRDSESGLCYNRFRYYDPAGG 1285 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD--------------VALIRRKDQL 108 Y++ DPIG+ GG + Y Y NP+ +DP GL+ Sbjct: 1286 CYVSPDPIGIAGGENNYGYVQNPMCWVDPFGLAGCSSYLNAWGGRNAKAFKNFWNNSSHA 1345 Query: 109 NHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLN---DAGVSRSAKGLGYEKEI 165 + +AW+ + RL GG + VS+ + G S Sbjct: 1346 DFMKAWNNPAFKKNIKARLRGGGLTNSGGYHEWLPVSQGDKFKKMGFSFEEYMSLTSPTN 1405 Query: 166 RDYGLNLFGMYGRKVKLSHS 185 + L+ FG YG + Sbjct: 1406 KVGFLDKFGNYGSHTSYTGG 1425 >UniRef50_B3PHE6 RHS Repeat family n=5 Tax=cellular organisms RepID=B3PHE6_CELJU Length = 3998 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 43/167 (25%), Positives = 66/167 (39%), Gaps = 10/167 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D++GNI + YD +GN + + NP P+ G D ++GL R YDP GR Sbjct: 3781 ITDSNGNIVKTVSYDSYGNIIEDSNPE-FQIPFGFAGGLKDDDTGLIRFGYRDYDPETGR 3839 Query: 64 YITQDPIGLEGG-WSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAW------- 114 + +DPIG EGG +LY Y + +N ID GL R Q Sbjct: 3840 WTARDPIGFEGGDTNLYGYVLGDAINFIDIDGLQRNTSNSWNRHQQYYGGTGMTRSQLSQ 3899 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGY 161 +D + + + ++ +G S+ GY Sbjct: 3900 QYRADQLKQWASGRNPDLHWWLGNLPDNSQIQIVGSGGGTSSLIFGY 3946 >UniRef50_Q2SIG5 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SIG5_HAHCH Length = 1427 Score = 120 bits (300), Expect = 4e-26, Method: Composition-based stats. Identities = 38/99 (38%), Positives = 57/99 (57%), Gaps = 3/99 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +G I W Y+ +G + R PGQ +D+ESG+++N R Y+P GR Sbjct: 1166 MASRNGQIVWKAAYEVFGRVKIFVD--KAENNLRFPGQYFDQESGMHHNYFRDYNPGYGR 1223 Query: 64 YITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVAL 101 YI +DPI + GG ++YAY NPV +DPLGL+ +V + Sbjct: 1224 YIQRDPISVYGGINVYAYANGNPVVYMDPLGLAKMNVGV 1262 >UniRef50_C9RZN2 YD repeat protein n=2 Tax=Geobacillus RepID=C9RZN2_GEOSY Length = 678 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 39/98 (39%), Positives = 52/98 (53%), Gaps = 5/98 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL D GNI +YD GN L++ PYR G QYD+E+GLYY RYY P Sbjct: 468 VIALTDEQGNIVARYQYDARGNILSQSGALADENPYRYAGYQYDQETGLYYLIARYYHPE 527 Query: 61 QGRYITQDPIGLEGG----WSLYAYP-LNPVNGIDPLG 93 G +++ DP + + YAY NPV +DP G Sbjct: 528 HGVFLSLDPDPGDADDLLTQNGYAYANNNPVMFVDPDG 565 >UniRef50_C6CNW6 YD repeat protein n=8 Tax=Enterobacteriaceae RepID=C6CNW6_DICZE Length = 1679 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 53/200 (26%), Positives = 85/200 (42%), Gaps = 27/200 (13%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP------------HHLHQPYRLPGQQYDKESGLYY 51 L G + W GE WG E+ P ++ R GQ YD E+GLYY Sbjct: 1461 LCSETGEVHWRGEQALWGAHREEKIPIPLRRWLGDAANEEVYCELRYQGQVYDSETGLYY 1520 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQ 111 NR+RYYDP +Y++ DP+G+ GG Y NP+ +DP GL V +K +++ Sbjct: 1521 NRHRYYDPETAQYLSGDPLGIAGGLRPQGYVHNPMEWVDPFGLVGCPV----KKYEVS-- 1574 Query: 112 RAWDILSDTYEDMKRLNLGGTD-QFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGL 170 TY D+K ++ G + H M ++ ++A + K Sbjct: 1575 --------TYNDLKNRSVSGDELDIHHAMQKHPAGQVVPGYDPKTAPSIAIPKVEHQEIP 1626 Query: 171 NLFGMYGRKVKLSHSEMIED 190 + G Y + ++ ++D Sbjct: 1627 TMKGPYTGSARDLLAKDVKD 1646 >UniRef50_A8ZSG6 YD repeat protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZSG6_DESOH Length = 423 Score = 119 bits (299), Expect = 6e-26, Method: Composition-based stats. Identities = 39/92 (42%), Positives = 53/92 (57%), Gaps = 3/92 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L ++G + WS +Y+ +G+ E + R PGQ +D ESGL+YN +RYY P GR Sbjct: 260 LTASNGAVVWSAKYESFGDATVEI--ETVENNLRFPGQYFDGESGLHYNLHRYYAPELGR 317 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGL 94 ++ DPIGL GG + Y Y N N DP GL Sbjct: 318 FLKDDPIGLRGGINQYIYADNNVSNNTDPYGL 349 >UniRef50_B6VUY0 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B6VUY0_9BACE Length = 1442 Score = 119 bits (299), Expect = 6e-26, Method: Composition-based stats. Identities = 42/94 (44%), Positives = 53/94 (56%), Gaps = 1/94 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + + D GN W D +G L + P+R GQ D+E+GLYYNR RYYD Sbjct: 1219 IQMYDEQGNKTWDCTLDIYGKVLAIDKGTEFDCPFRYQGQYEDEETGLYYNRFRYYDSNA 1278 Query: 62 GRYITQDPIGLE-GGWSLYAYPLNPVNGIDPLGL 94 G YI+QDPIGLE + Y Y + +GIDPLGL Sbjct: 1279 GSYISQDPIGLESDTLNFYDYVCDLNDGIDPLGL 1312 >UniRef50_A9EW02 Conserved exported carbohydrate-binding protein,Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EW02_SORC5 Length = 1310 Score = 119 bits (299), Expect = 6e-26, Method: Composition-based stats. Identities = 43/120 (35%), Positives = 68/120 (56%), Gaps = 4/120 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ G +AW ++ WG L EE+P + P+ L G D+E+GL Y R+RY+DP R Sbjct: 1095 LVSGRGQVAWRADHTLWGRVLAEESPAGVRAPFSLLGHYVDEETGLAYVRHRYFDPETAR 1154 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL-SPADVALIR---RKDQLNHQRAWDILSD 119 ++++DP+G +GG +L+ + P +DPLGL + AD R+ ++ QRA D Sbjct: 1155 WLSRDPLGFDGGPNLFGFDGMPTEEVDPLGLMTRADFVQFMRQYRRQRIQAQRASDQAQS 1214 >UniRef50_A9GIJ6 Conserved carbohydrate-binding protein, Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GIJ6_SORC5 Length = 1429 Score = 119 bits (299), Expect = 7e-26, Method: Composition-based stats. Identities = 43/92 (46%), Positives = 53/92 (57%), Gaps = 2/92 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+DA GN+A + WG E P R GQ YD E+GL YNR RYYDP GR Sbjct: 1056 LLDAAGNVACELDRTVWGAARPREGART-ETPLRFLGQYYDDETGLAYNRYRYYDPAVGR 1114 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 YI+ DP+GL GG + ++Y N P +DP GL Sbjct: 1115 YISVDPVGLLGGQNGFSYAGNRPTKMVDPTGL 1146 >UniRef50_C0FSB7 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FSB7_9FIRM Length = 306 Score = 119 bits (298), Expect = 7e-26, Method: Composition-based stats. Identities = 44/105 (41%), Positives = 52/105 (49%), Gaps = 11/105 (10%) Query: 6 DADGNIAWSGEYDEWGNQ----------LNEENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 D +G W E D +G E P+R GQ DKE GLYYNR R Sbjct: 131 DGEGIKVWERELDIYGRVKNVGKGSDRSAAPETGEQCFIPFRFQGQYEDKEIGLYYNRFR 190 Query: 56 YYDPLQGRYITQDPIGLEGG-WSLYAYPLNPVNGIDPLGLSPADV 99 YYDP G+Y QDPIGL GG +LY Y N + +DP GL D+ Sbjct: 191 YYDPSLGQYTQQDPIGLAGGNPTLYGYVFNTMWELDPFGLDWKDL 235 >UniRef50_D1WVZ1 YD repeat protein n=1 Tax=Streptomyces sp. ACT-1 RepID=D1WVZ1_9ACTO Length = 2294 Score = 119 bits (298), Expect = 8e-26, Method: Composition-based stats. Identities = 39/97 (40%), Positives = 52/97 (53%), Gaps = 4/97 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++ + + DG IA YD G + PY G++ D +GL Y RNRYYDP Sbjct: 1897 IVGMANTDGTIATRYTYDPNGQPT--TSGAASSNPYTFTGRESDG-TGLLYYRNRYYDPE 1953 Query: 61 QGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSP 96 GR+I+QDPIG GG +LY Y +P DP G +P Sbjct: 1954 SGRFISQDPIGHAGGTNLYQYALSSPTTYTDPSGNNP 1990 >UniRef50_Q07833 Wall-associated protein n=18 Tax=Bacillaceae RepID=WAPA_BACSU Length = 2334 Score = 119 bits (298), Expect = 8e-26, Method: Composition-based stats. Identities = 39/101 (38%), Positives = 52/101 (51%), Gaps = 6/101 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHL-HQPYRLPGQQYDKESGLYYNRNRYYDP 59 ++A+ D+ G +YD WGN E + YR G QYD+E+GLYY RYY+P Sbjct: 2088 IIAISDSTGKTVAKYQYDAWGNPTKTEASDEVKDNRYRYAGYQYDEETGLYYLMARYYEP 2147 Query: 60 LQGRYITQDPIGLEGGWSL----YAYP-LNPVNGIDPLGLS 95 G +++ DP G SL YAY NPV +DP G Sbjct: 2148 RNGVFLSLDPDPGSDGDSLDQNGYAYGNNNPVMNVDPDGHW 2188 >UniRef50_Q395C2 Rhs family protein n=1 Tax=Burkholderia sp. 383 RepID=Q395C2_BURS3 Length = 190 Score = 119 bits (298), Expect = 8e-26, Method: Composition-based stats. Identities = 41/66 (62%), Positives = 46/66 (69%) Query: 29 PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNG 88 H + P R PGQ YD+ESGL+YNR RY DP GRYI QDPIGL+GG + Y Y NPV Sbjct: 6 AHAVDNPIRFPGQYYDRESGLHYNRFRYCDPQVGRYINQDPIGLKGGANSYVYAHNPVTL 65 Query: 89 IDPLGL 94 DPLGL Sbjct: 66 SDPLGL 71 >UniRef50_A3UDD5 Wall associated protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UDD5_9RHOB Length = 1693 Score = 119 bits (298), Expect = 8e-26, Method: Composition-based stats. Identities = 36/99 (36%), Positives = 51/99 (51%), Gaps = 5/99 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + + D+ G + + Y +G + P+R GQ+ D E+GLYY + RYYDP Sbjct: 1400 IVISDSAGAVIDTHTYSPFGQAGEGDGGF----PFRFTGQKLDPETGLYYYKARYYDPEL 1455 Query: 62 GRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADV 99 GR++ DPIG +LYAY N PVN D GL + Sbjct: 1456 GRFLQTDPIGYADQMNLYAYVGNDPVNLRDSSGLCGTRI 1494 >UniRef50_C3JCI6 Rhs family protein n=4 Tax=Bacteria RepID=C3JCI6_9PORP Length = 1387 Score = 119 bits (298), Expect = 8e-26, Method: Composition-based stats. Identities = 36/91 (39%), Positives = 49/91 (53%), Gaps = 3/91 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 D+ G W D +G ++ P+ GQ +D+E L YNR RYYDP G Y Sbjct: 1192 FDSHGAKVWERNLDIYGKIRTGDSTLV---PFLFQGQYFDEEIDLCYNRFRYYDPSTGTY 1248 Query: 65 ITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 I+QDPI + G ++YAY + + IDP GLS Sbjct: 1249 ISQDPISIAGRLNVYAYVHDSNSWIDPFGLS 1279 >UniRef50_C7QAC6 YD repeat protein n=2 Tax=Bacteria RepID=C7QAC6_CATAD Length = 1508 Score = 119 bits (298), Expect = 9e-26, Method: Composition-based stats. Identities = 38/94 (40%), Positives = 52/94 (55%), Gaps = 2/94 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHH--LHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+ DG +AW D +G + L P R GQ +D E+GL+YN RYYDP Sbjct: 1308 LVTPDGRVAWYQNTDLYGQSVAVATGGDPDLECPLRFAGQYFDAETGLHYNVQRYYDPAI 1367 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 Y+T DP+GL + +AY NP+ +DPLGL+ Sbjct: 1368 AAYLTPDPLGLAPALNDHAYVPNPLTMVDPLGLA 1401 >UniRef50_D0HCV2 Rhs protein n=3 Tax=Vibrio mimicus VM223 RepID=D0HCV2_VIBMI Length = 1617 Score = 119 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 43/105 (40%), Positives = 55/105 (52%), Gaps = 11/105 (10%) Query: 4 LMDADGNIAWSGEYDEWG-----------NQLNEENPHHLHQPYRLPGQQYDKESGLYYN 52 L +G + W GE WG + L+ R GQ D+ESGLYYN Sbjct: 1382 LCSENGEVVWQGEQALWGHYQQRNTFPNHGIREHAHNDELYCDLRYQGQIEDRESGLYYN 1441 Query: 53 RNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 NRYYD G+Y++QDPIG GG AY NP+ +DPLGL+P+ Sbjct: 1442 VNRYYDADSGQYLSQDPIGFSGGLRPQAYVFNPLEWVDPLGLAPS 1486 >UniRef50_C4KA75 YD repeat protein n=1 Tax=Thauera sp. MZ1T RepID=C4KA75_THASP Length = 1892 Score = 119 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 40/93 (43%), Positives = 51/93 (54%), Gaps = 3/93 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHH--LHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 DADG W + +G +P H RLPGQ +D E+GL+YNR RYY P G Sbjct: 1388 TDADGEPLWRARHAPFGAATVTTSPRHPDFTLDLRLPGQVFDAETGLHYNRRRYYAPTLG 1447 Query: 63 RYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGL 94 Y+T DP+G G + YAY NP+ +DP GL Sbjct: 1448 EYLTPDPLGTPDGPNPYAYAAFNPLRNVDPDGL 1480 >UniRef50_D0KZB0 YD repeat protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KZB0_HALNC Length = 1467 Score = 119 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 40/91 (43%), Positives = 52/91 (57%), Gaps = 1/91 (1%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 +A+ W D +G++ + + R PGQ YD E+GL+YN NRYYDP GRY Sbjct: 1193 TNANQQTVWRWNRDAFGDRQVNASSASIEMNLRYPGQYYDTETGLFYNWNRYYDPSTGRY 1252 Query: 65 ITQDPIGLEGGWSLYAYP-LNPVNGIDPLGL 94 T DPIGL GG + + Y NP+ IDP GL Sbjct: 1253 ATSDPIGLSGGVNTFGYVSANPLALIDPWGL 1283 >UniRef50_B3E8D4 YD repeat protein n=1 Tax=Geobacter lovleyi SZ RepID=B3E8D4_GEOLS Length = 1464 Score = 119 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 74/191 (38%), Gaps = 3/191 (1%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + A++++ G++ S YD +G L L QP R + YD+ +GLYY +R+Y P Sbjct: 1190 VTAVLNSSGSVVASYAYDPFGGTLAASGT--LSQPIRYSTKLYDEGTGLYYFGHRFYSPQ 1247 Query: 61 QGRYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 GR++++DP+ +LY + NP+ DP G + R + Q Q + L Sbjct: 1248 MGRWLSRDPLSEMASINLYRFAANNPLTHFDPFGAADNGGFWDRPEVQAQLQAQREALQA 1307 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 + + ++RS + + G + Sbjct: 1308 KRASEVTTFEKVKNSIGSAFKWIGDKFKEQPEMARSIEEKVANTALEANKYTKAGKEWNE 1367 Query: 180 VKLSHSEMIED 190 + +M ED Sbjct: 1368 RINTGIQMAED 1378 >UniRef50_A4B8X0 YD repeat n=1 Tax=Reinekea blandensis MED297 RepID=A4B8X0_9GAMM Length = 1098 Score = 119 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 44/92 (47%), Positives = 52/92 (56%), Gaps = 2/92 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D+ GN+ W +G +E + L PGQ D ESGL YN R YDP GR Sbjct: 858 LTDSLGNVVWQQHTTPFGEV-HETLGNGLGYLQSFPGQWRDSESGLSYNYYRDYDPSLGR 916 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGL 94 YI DPIGL GG + YAY NP++ IDPLGL Sbjct: 917 YIQSDPIGLGGGLNTYAYVGGNPISRIDPLGL 948 >UniRef50_B8FCM0 YD repeat protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FCM0_DESAA Length = 1448 Score = 118 bits (296), Expect = 1e-25, Method: Composition-based stats. Identities = 46/193 (23%), Positives = 78/193 (40%), Gaps = 5/193 (2%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + AL++A GN YD +GN L+ N QP+R ++YD+E+GLYY R+Y P+ Sbjct: 1185 VTALLNATGNACAWYAYDPYGNLLH--NTGAPVQPFRFSTKEYDEETGLYYFGRRFYSPV 1242 Query: 61 QGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALI--RRKDQLNHQRAWDIL 117 R++T+DP G +LY + NP+ DPLG A++A + R A Sbjct: 1243 MARWLTRDPKGEGASLNLYEFSRSNPLAYFDPLGAQDAELAAMDARWAQARAAMEASSQS 1302 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 + T D ++ + + + + + G G Sbjct: 1303 TATASDSGGFGEMMSNTVGKAFKWVGDIIQGSPDAAGTVVDKVVDTALESNKFTKMGKTG 1362 Query: 178 RKVKLSHSEMIED 190 ++ +D Sbjct: 1363 YDAIQKGFDVAQD 1375 >UniRef50_C2LFQ4 Putative uncharacterized protein n=1 Tax=Proteus mirabilis ATCC 29906 RepID=C2LFQ4_PROMI Length = 214 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 39/132 (29%), Positives = 53/132 (40%), Gaps = 5/132 (3%) Query: 13 WSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGL 72 W + E +P++ P+R GQ D+ESGLYYNR RYYD G+Y++ DPIGL Sbjct: 4 WRYR-----DGKAENDPNYTECPFRFAGQYEDEESGLYYNRFRYYDRETGQYLSPDPIGL 58 Query: 73 EGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGT 132 GG + Y Y P +DP GL+ + + G Sbjct: 59 LGGLNPYGYVHCPTGWVDPFGLAGGKGNKGELDGSGRPLASPRYSVAFETKIDSKFYPGR 118 Query: 133 DQFFHCMAFCRV 144 H R Sbjct: 119 SDKVHFQEANRN 130 >UniRef50_B1HM94 Cell wall-associated protein n=5 Tax=Lysinibacillus sphaericus C3-41 RepID=B1HM94_LYSSC Length = 995 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 35/98 (35%), Positives = 56/98 (57%), Gaps = 5/98 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 +LAL + +G+I YD WGN L++ PYR G +YD+++ LYY RYY+P Sbjct: 781 VLALTNTNGDIVAQYTYDAWGNILSQSGTMAAINPYRYAGYRYDEKTKLYYLMARYYNPD 840 Query: 61 QGRYITQDPIGLEG----GWSLYAYP-LNPVNGIDPLG 93 G ++++DP+ + ++ Y+Y NPV +DP G Sbjct: 841 TGVFLSRDPVRGDTMTPISFNGYSYTNNNPVMNVDPSG 878 >UniRef50_A8ZTS1 YD repeat protein n=4 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZTS1_DESOH Length = 1935 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 55/221 (24%), Positives = 90/221 (40%), Gaps = 14/221 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + ADG +A + EY +G +++ +R + +D ESGLY RYYDP GR Sbjct: 1664 VSAADGTVAAAYEYAPFGGLIHKSGVMADENVFRFSTKYWDGESGLYEYGLRYYDPETGR 1723 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 +I++DP+G GG +LY++ +N P N D LGL + D + W Y Sbjct: 1724 WISRDPVGESGGLNLYSFVMNDPTNFFDLLGLVQVGGSEGYSSDSMYIDITWYKPKSLYL 1783 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLND---AGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 K G R ++ + + +LF Y R+ Sbjct: 1784 LFKCK---GCPDVNEAPVAGRTVDIDYDKSNFKVETIIKYVLKIVSGGEDADLFWTYNRE 1840 Query: 180 VKLSHSEMIEDNKKDLAVND---HGLTCPSTTDCS---DRC 214 + +S + + KKD+ + ++C +CS D C Sbjct: 1841 KRY-YSSLPGEAKKDIEIGRPCWVKISCEVNCECSCGGDAC 1880 >UniRef50_Q0K1I3 Insecticidal toxin complex protein n=2 Tax=Proteobacteria RepID=Q0K1I3_RALEH Length = 2644 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 36/111 (32%), Positives = 52/111 (46%), Gaps = 3/111 (2%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQ--PYRLPGQQYDKESGLYYNRNRYYDPLQG 62 +D I EY +G+ + YR G++ D+ESGLYY+ RYY P G Sbjct: 2261 LDEQAQIISYEEYAPYGSSTYQAVRSQTETAKRYRYTGKEQDEESGLYYHGARYYAPWLG 2320 Query: 63 RYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 R+ DP G E G +LY Y NP +DP G S +A++ + + Sbjct: 2321 RWTACDPAGEEEGPNLYQYCFGNPTGFVDPDGQSGKKLAIVMHNESKPAGK 2371 >UniRef50_C0FSB4 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FSB4_9FIRM Length = 223 Score = 117 bits (294), Expect = 2e-25, Method: Composition-based stats. Identities = 40/92 (43%), Positives = 51/92 (55%), Gaps = 2/92 (2%) Query: 6 DADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 D +G W D +G EE P+R GQ D+E+GLYYNR RYY P +G Y Sbjct: 5 DEEGKKVWERNLDIYGRVKTEEALGEKNLIPFRFQGQYEDEETGLYYNRFRYYSPEEGCY 64 Query: 65 ITQDPIGLEGG-WSLYAYPLNPVNGIDPLGLS 95 QDPIGL GG +LY Y + + +DP GL+ Sbjct: 65 TQQDPIGLAGGNPTLYGYVYDTLCELDPFGLA 96 >UniRef50_D0BWK2 YD repeat protein n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0BWK2_9GAMM Length = 1361 Score = 117 bits (294), Expect = 2e-25, Method: Composition-based stats. Identities = 43/158 (27%), Positives = 70/158 (44%), Gaps = 4/158 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D++G++ WS + +G + R PGQ YD +G +YN NR+Y+P GR Sbjct: 1054 LIDSNGSVVWSWDSTAFGL---GSPVSTITFNLRFPGQYYDATTGQFYNHNRFYNPELGR 1110 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 Y+ DPIGL GG + Y Y NPV +D G +P +A+ ++ + Sbjct: 1111 YMEPDPIGLAGGLNPYIYALNNPVMYVDMTGENPILIAMGVSAATAGAFYTGEVFFNAVY 1170 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLG 160 D + + D F + + + G + G Sbjct: 1171 DAYKNDQSILDSFSKSFSIKELGQQMVIGAAFGGVGKA 1208 >UniRef50_C7QG23 YD repeat protein n=2 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QG23_CATAD Length = 1528 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 47/174 (27%), Positives = 69/174 (39%), Gaps = 14/174 (8%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG+I W +G + + H P R PGQ D E+GL+YN +RYYDP +G Sbjct: 1268 LVAPDGSIDWYTTTSLYGTTIATSSDHGADCPLRFPGQFRDDETGLHYNVHRYYDPERGS 1327 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 Y++ DP+GL + AY NP+ DPLGL + A + D + Sbjct: 1328 YLSPDPLGLAAAPNDQAYVANPMVSADPLGLICLNAA--------------QQIKDRVDF 1373 Query: 124 MKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 + Q + +A R V K + + G G Sbjct: 1374 LHGKFEAPYGQRENTVAIIRAMDKEGNIVHVVGWSGKESKTLFGDVDDEIGKNG 1427 >UniRef50_B2Q762 Putative uncharacterized protein n=2 Tax=Providencia stuartii ATCC 25827 RepID=B2Q762_PROST Length = 173 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 35/93 (37%), Positives = 46/93 (49%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 Q R GQ +D E+GL++N R+YDP GR+I DPIGL GG +LYAY NP+ I Sbjct: 2 ESFEQNLRYAGQYFDNETGLHFNTFRFYDPQIGRFIMPDPIGLLGGINLYAYAPNPLGWI 61 Query: 90 DPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 DP G S + + L + Sbjct: 62 DPWGWSHLNTNGATGNFGVYKIEIDGQLYKYGK 94 >UniRef50_C8W2V9 YD repeat protein n=3 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2V9_DESAS Length = 2658 Score = 117 bits (293), Expect = 3e-25, Method: Composition-based stats. Identities = 49/213 (23%), Positives = 83/213 (38%), Gaps = 30/213 (14%) Query: 2 LALMDADGNIAW--SGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L++ D GN +YD WG + E+ + P+R G YD E+GLYY ++RYY P Sbjct: 2417 LSMTDDYGNTDQENRYDYDPWGTPICED--ESVKSPFRYAGYYYDTETGLYYLKSRYYSP 2474 Query: 60 LQGRYITQDPIGL-----EGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRA 113 GR++T+DP +LYAY NPV+ +DP G + + Sbjct: 2475 ALGRFLTRDPHSFINHADPQTLNLYAYCGNNPVSTVDPTGHWDEEPS-----------EG 2523 Query: 114 WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLF 173 + + D E+ + ++ F + + V + A + Sbjct: 2524 YVVSPDYLENTATARIINSNVFQNILMGISVGE------GLRALKFFKGAGNLKLTSDAL 2577 Query: 174 GMYGRKVKLSHSEMIE---DNKKDLAVNDHGLT 203 G K + S ++ D+ D N ++ Sbjct: 2578 KKIGTKAESSGIRAVKGTADDAWDFFRNQVNIS 2610 >UniRef50_D2KTW4 Putative uncharacterized protein n=2 Tax=Streptomyces RepID=D2KTW4_9ACTO Length = 1097 Score = 117 bits (293), Expect = 4e-25, Method: Composition-based stats. Identities = 39/91 (42%), Positives = 48/91 (52%), Gaps = 1/91 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG++AW WG L + P R PGQ D E+GL YN +RYYDP + Sbjct: 913 LITPDGHLAWQHRTTLWGTPLPTPS-DTTTCPLRFPGQYADPETGLNYNHHRYYDPETAQ 971 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 Y+T DP+GL AY NP DPLGL Sbjct: 972 YLTPDPLGLAPAPHPRAYVHNPHTWQDPLGL 1002 >UniRef50_C8W2A7 YD repeat protein n=3 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2A7_DESAS Length = 1732 Score = 117 bits (292), Expect = 4e-25, Method: Composition-based stats. Identities = 42/129 (32%), Positives = 63/129 (48%), Gaps = 10/129 (7%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L + DG YD WGNQ++ + P+R G YD+E+GLYY ++RYY P GR Sbjct: 1505 LTNIDGGFYAKYNYDPWGNQISYSG--WISAPFRYAGYYYDEETGLYYLKSRYYSPALGR 1562 Query: 64 YITQDPIGL-----EGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 ++T+D I +LY+Y NPVN +DP G P +D+L D Sbjct: 1563 FLTKDSIKYIKYKNPQTLNLYSYAGSNPVNNVDPTGEIPVRATWKAFEDKL--GELLDTA 1620 Query: 118 SDTYEDMKR 126 ++ + + Sbjct: 1621 KNSKKGLGN 1629 >UniRef50_Q2Y6W6 RhsD protein n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2Y6W6_NITMU Length = 207 Score = 117 bits (292), Expect = 4e-25, Method: Composition-based stats. Identities = 42/111 (37%), Positives = 57/111 (51%), Gaps = 8/111 (7%) Query: 17 YDEWGNQLNEENPHHLHQ---PYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLE 73 Y +G +NP L R PGQ +D E+GL+YN R YDP GRYI DPIGL Sbjct: 5 YGPFGANPPNQNPSGLGTFSYNLRYPGQYHDAETGLHYNYFRDYDPKTGRYIQSDPIGLA 64 Query: 74 GGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 GG + Y Y NP++ IDP G ++A + + A+D + Y+ Sbjct: 65 GGINTYVYVEGNPLSKIDPTG----EIAFVPILIGIGVGYAFDYALERYKK 111 >UniRef50_A6GBQ3 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBQ3_9DELT Length = 2507 Score = 116 bits (290), Expect = 7e-25, Method: Composition-based stats. Identities = 37/125 (29%), Positives = 52/125 (41%), Gaps = 4/125 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQP---YRLPGQQYDKESGLYYNRNRYYDPLQ 61 + G I EY +G L P YR G + D+E+GL Y+ RYY P Sbjct: 2059 VSETGAIISHEEYHPYGTSAYRMVDSQLDVPPKRYRFTGMERDEETGLSYHSARYYAPWL 2118 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+ DPIGL G + +AY NP N +DP G + I N + + Sbjct: 2119 GRWTAADPIGLGDGVNRFAYVRGNPTNFVDPTGFAGESENHIYSDQWGNFAERAYQIGRS 2178 Query: 121 YEDMK 125 + + Sbjct: 2179 EQQLD 2183 >UniRef50_A3DF74 YD repeat protein n=17 Tax=Clostridium thermocellum RepID=A3DF74_CLOTH Length = 1959 Score = 116 bits (290), Expect = 7e-25, Method: Composition-based stats. Identities = 44/158 (27%), Positives = 62/158 (39%), Gaps = 18/158 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D+ G I + +YD +GN L +H +R G+QYD +G YY R R+Y P GR Sbjct: 1576 LADSSGKIVNTYDYDAFGNTL--SVKETIHNRFRYAGEQYDDFTGQYYLRARFYSPSLGR 1633 Query: 64 YITQDP----IGLEGGWSLYAYP-LNPVNGIDPLGLSPA--DVALIRRKDQLNHQRAW-- 114 + +D +LY Y NPV +DP G P D AL +++N W Sbjct: 1634 FTQEDTWRGFTYNPASLNLYTYVENNPVMFVDPTGHWPKFIDNALDWVGNKVNEAADWVG 1693 Query: 115 -------DILSDTYEDMKRLNLGGTDQFFHCMAFCRVS 145 D D D + + V Sbjct: 1694 NRVNDVVDWAGDRINDARNFITNTATGVKNWWVENNVG 1731 >UniRef50_Q3JSF4 RhsD protein n=23 Tax=Burkholderia pseudomallei RepID=Q3JSF4_BURP1 Length = 1593 Score = 116 bits (290), Expect = 7e-25, Method: Composition-based stats. Identities = 40/93 (43%), Positives = 51/93 (54%), Gaps = 1/93 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + + D G I W Y G + E + QP RL GQ +D ESGL YNR RY+ Sbjct: 1349 VRIYDDCGRIVWEARYGPHGGIASIE-TDVIQQPIRLQGQIFDWESGLSYNRYRYFLSSI 1407 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 G +++QDPIGL GG +LY + N DPLGL Sbjct: 1408 GAFVSQDPIGLVGGVNLYRFAPNAFGWTDPLGL 1440 >UniRef50_A9C2K3 YD repeat protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C2K3_DELAS Length = 1301 Score = 115 bits (289), Expect = 8e-25, Method: Composition-based stats. Identities = 40/105 (38%), Positives = 52/105 (49%), Gaps = 14/105 (13%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQ-------------PYRLPGQQYDKESGLY 50 L + G +AW +G Q R PGQ +D+E+GL Sbjct: 1066 LTNQQGQVAWQWLISGFGEVRPTTGDRGYGQTVSGPSYAQAVKFDLRYPGQVFDEETGLS 1125 Query: 51 YNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGL 94 YN +RYYD GRYI DPIGL GGW+ + Y NP++ +DPLGL Sbjct: 1126 YNLHRYYDAATGRYIQADPIGLAGGWNRFGYVGENPLSFVDPLGL 1170 >UniRef50_UPI0001B58169 YD repeat-containing protein n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B58169 Length = 1724 Score = 115 bits (289), Expect = 8e-25, Method: Composition-based stats. Identities = 43/103 (41%), Positives = 52/103 (50%), Gaps = 6/103 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G +AWS WG P + PGQ D ESGL+YN RYYDP R Sbjct: 1222 LIDEHGGLAWSARRSLWGRVQPGG------IPLQFPGQYADAESGLHYNVFRYYDPAAAR 1275 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKD 106 YI+QDP+GLEGG + Y +P D LGL D + D Sbjct: 1276 YISQDPLGLEGGPNPSNYVPDPFAATDVLGLKCGDKGKGKATD 1318 >UniRef50_D1YPL4 RHS repeat-associated core domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YPL4_9FIRM Length = 167 Score = 115 bits (289), Expect = 8e-25, Method: Composition-based stats. Identities = 39/94 (41%), Positives = 56/94 (59%), Gaps = 1/94 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DGN+ W G Y +WG+ E +QP+RL Q D+E+GL+YN R+Y+ G Sbjct: 1 MTDKDGNLLWFGNYTDWGHLKEETRVTDSAYQPFRLHKQYADRETGLHYNFFRHYETDAG 60 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 + QDPIGL GG +LY++ N + D LG+ P Sbjct: 61 PLVNQDPIGLVGGDNLYSFANNATSWTDCLGVLP 94 >UniRef50_Q2Y592 Peptidase C39, bacteriocin processing n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2Y592_NITMU Length = 1599 Score = 115 bits (289), Expect = 9e-25, Method: Composition-based stats. Identities = 42/131 (32%), Positives = 61/131 (46%), Gaps = 3/131 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + +G S +YD +GNQ+ + +R G Y ++SGLY R YDP Sbjct: 1369 VMAAQNGAKVASYDYDPYGNQIAGSGRISVD--FRYAGMFYHQQSGLYLTNFRAYDPKTA 1426 Query: 63 RYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 +++++DPIG +GG +LY Y NP+N IDPLGL L + DI + Sbjct: 1427 KWLSRDPIGEKGGLNLYGYVGGNPINMIDPLGLRALPWILGGASSDIATPDPSDIAWQKW 1486 Query: 122 EDMKRLNLGGT 132 L G T Sbjct: 1487 AGWAILITGAT 1497 >UniRef50_B2PW71 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PW71_PROST Length = 191 Score = 115 bits (289), Expect = 9e-25, Method: Composition-based stats. Identities = 34/78 (43%), Positives = 43/78 (55%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 Q R GQ +D E+GL++N R+YDP GR+I DPIGL GG +LY Y NP+ I Sbjct: 2 ESFEQNLRYAGQYFDNETGLHFNTFRFYDPQIGRFIMPDPIGLLGGINLYQYAPNPLGWI 61 Query: 90 DPLGLSPADVALIRRKDQ 107 DP GL+ Q Sbjct: 62 DPWGLAGNPATATHITYQ 79 >UniRef50_C1B7W9 Putative uncharacterized protein n=1 Tax=Rhodococcus opacus B4 RepID=C1B7W9_RHOOB Length = 514 Score = 115 bits (289), Expect = 9e-25, Method: Composition-based stats. Identities = 38/102 (37%), Positives = 50/102 (49%), Gaps = 4/102 (3%) Query: 2 LALMDA-DGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + L+D + D WG+ R PGQ +D E+GL+YN +RYY+P Sbjct: 319 VELVDPRTADSVADATTDLWGHTTWRGATD---THLRFPGQYHDPETGLHYNLHRYYNPH 375 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALI 102 RY+TQDP+GL + YP NP DPLGL P I Sbjct: 376 TARYLTQDPLGLAPSPNPNTYPHNPTGWTDPLGLVPCPPTTI 417 >UniRef50_C6EE82 Rhs family protein-like protein n=2 Tax=Escherichia coli RepID=C6EE82_ECOBD Length = 290 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 49/60 (81%), Positives = 53/60 (88%) Query: 35 PYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 RLPGQQYD+ESGLYYNR+RYYDPLQGRYITQDPIGL+GGW+ Y YPLNPV DPLGL Sbjct: 101 DIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQYPLNPVTNTDPLGL 160 >UniRef50_C8QVK4 YD repeat protein n=4 Tax=Desulfurivibrio alkaliphilus AHT2 RepID=C8QVK4_9DELT Length = 2439 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 49/196 (25%), Positives = 79/196 (40%), Gaps = 13/196 (6%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + + G IA +YDE+GN L + NP QP+ G +D+++ L R YDP Sbjct: 2207 LVVNTSTGEIAQRIDYDEFGNVLQDTNPG--FQPFGFAGGLHDRDTNLTRFGARDYDPQT 2264 Query: 62 GRYITQDPIGLEGG-WSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQR--AWDIL 117 GR+ +DPI GG +LY Y LN P+N IDP G A + A+ Sbjct: 2265 GRWTAKDPILFAGGDTNLYGYVLNDPINWIDPEGKIGIAGAAVGGVIGAVSGALGAYTSG 2324 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 ++++ GG + L AG+ I+++G G+ Sbjct: 2325 GNSWDIASAAFTGGLGGAVY-------GFLPGAGLLAHIGKGAAIGGIQNFGAQFMGILN 2377 Query: 178 RKVKLSHSEMIEDNKK 193 + + + + Sbjct: 2378 DPCQRFNYGSLAGSIL 2393 >UniRef50_Q16U81 Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q16U81_AEDAE Length = 3340 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 54/117 (46%), Gaps = 4/117 (3%) Query: 8 DGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYIT 66 +G + + +Y +G + + H YR GQ+YD+E+GLY R YDP GR+ Sbjct: 2635 NGEVVAAYDYFPYGQLMRIYGSNPEAHIAYRYTGQEYDEETGLYNYHARLYDPDIGRFFQ 2694 Query: 67 QDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 DP+ E S Y Y N PV+ IDP G + ++ + ++++ Sbjct: 2695 MDPM--EQYASPYKYAGNSPVSQIDPDGQIAITLIVMAIGALVGAYIGASSANNSWN 2749 >UniRef50_A0LQM7 NHL repeat containing protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LQM7_SYNFM Length = 1763 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 34/93 (36%), Positives = 50/93 (53%), Gaps = 3/93 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + DA G + YD +GN + + NP + P+ G +D ++GL R YDP GR Sbjct: 1591 VTDASGAVVKELVYDSFGNLIGDTNP-GFYVPFGFAGGLHDPDTGLVRFGFRDYDPEVGR 1649 Query: 64 YITQDPIGLEGG-WSLYAYP-LNPVNGIDPLGL 94 + +DPIG GG +LY Y +P+N +D GL Sbjct: 1650 WTAKDPIGFGGGDTNLYGYCLSDPINWVDSHGL 1682 >UniRef50_C7HJS3 YD repeat protein (Fragment) n=3 Tax=Clostridium thermocellum DSM 2360 RepID=C7HJS3_CLOTM Length = 783 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 4/103 (3%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + AL D G + S YD +GN L ++ + ++ G+ D +G YY R RYY+P Sbjct: 421 ITALTDGKGEVINSYSYDAFGNIL--DSVEKIENRFKYSGEMLDPVTGQYYLRARYYNPS 478 Query: 61 QGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALI 102 GR++ +D G +LY Y NP+ IDP G + A+ Sbjct: 479 IGRFMQEDTF-RGDGLNLYTYVANNPIKYIDPTGHCKENAAIA 520 >UniRef50_A9FBU5 Conserved carbohydrate-binding protein, Rhs family n=2 Tax=Proteobacteria RepID=A9FBU5_SORC5 Length = 1300 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 41/92 (44%), Positives = 54/92 (58%), Gaps = 2/92 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG I D WG+ + E P R GQ D+E+GL YNR RYYDP GR Sbjct: 1067 LVGPDGQIGCELARDPWGSATSAEGA-QTSTPLRFRGQYADEETGLSYNRYRYYDPELGR 1125 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 YI+ DP+G+EGG +++AY N P + +D GL Sbjct: 1126 YISADPLGIEGGLNVFAYAANCPTSAVDVEGL 1157 >UniRef50_D1PV96 YD repeat protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PV96_9BACT Length = 394 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 54/258 (20%), Positives = 94/258 (36%), Gaps = 40/258 (15%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEE--------------NPHHLHQPYRLPGQQYDKESGL 49 + + DG + EY +G +E + + PY ++ D+E+G+ Sbjct: 43 ITNLDGEVVQHIEYVPYGEVFIDELELTRKSATAGKANCNNTWNTPYLFNAKELDEETGM 102 Query: 50 YYNRNRYYDPLQGRYITQDPIGL-EGGWSLYAY-PLNPVNGIDPLGLSPADVALIRRKDQ 107 YY RYY+P +++ DP+G ++Y Y NP IDP G + + Sbjct: 103 YYYGARYYEPRLSLWVSTDPLGETAPHITVYCYTANNPTILIDPDGKAWKPTK--NEETG 160 Query: 108 LNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDA----------------- 150 N W + +Y+ +L G +Q K ++ Sbjct: 161 QNTGYEWINPAKSYDSKGKLLPGLYEQAIFFSNQGNNGKTFNSKNRFNMGSSIATVYCKD 220 Query: 151 GVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDC 210 G + + Y + Y GMY KV + + + K L ++D G T +++ Sbjct: 221 GTTSEFEACTYPSNLDKYATVPEGMYEAKVGMHNGSSAQ--YKALRMSDIGTTDFNSSSI 278 Query: 211 SDRCSDYINPEHKKTIKA 228 NP + KT KA Sbjct: 279 E---LGKPNPSNSKTTKA 293 >UniRef50_Q4UTI2 Putative uncharacterized protein n=2 Tax=Xanthomonas campestris pv. campestris RepID=Q4UTI2_XANC8 Length = 137 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 41/110 (37%), Positives = 58/110 (52%), Gaps = 5/110 (4%) Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSK-LNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 ILS +MK N+ G+DQF+HC+A CR ++ + G+ L KE +DY G Sbjct: 3 ILSSLNSEMKSRNIAGSDQFYHCLASCRATQATKNPGLVLEMMAL---KETKDYYAGRLG 59 Query: 175 MYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKK 224 +YG + H EM DN++D+A N G TC DC RC + PE + Sbjct: 60 LYGDGRRRGHYEMQSDNQQDMAANQLGATCQMGEDCPRRCMGLV-PERSR 108 >UniRef50_A9GD22 Conserved carbohydrate-binding protein, Rhs family n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GD22_SORC5 Length = 1351 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 37/94 (39%), Positives = 53/94 (56%), Gaps = 3/94 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ G + + +G ++ E P R PGQ D+E+GL YNR RY+DP GR Sbjct: 1050 LVGPAGEVVCELDRSAFGAKVKEGGRTT--TPLRFPGQYEDEETGLVYNRYRYFDPALGR 1107 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSP 96 Y++ DP GL+GG++ + Y N P +DP GL P Sbjct: 1108 YLSADPAGLDGGFNGFDYAGNAPTRFVDPSGLMP 1141 >UniRef50_Q2T5B9 YD repeat protein n=39 Tax=Proteobacteria RepID=Q2T5B9_BURTA Length = 1553 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 43/146 (29%), Positives = 59/146 (40%), Gaps = 25/146 (17%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHL---------------------HQPYRLPGQQ 42 L +DG W WG+ ++ L R PGQ Sbjct: 1289 LYSSDGRALWRARRTAWGDTAGDDGRDSLRSAVREQLRLGHRDSDEFDPPDCELRFPGQW 1348 Query: 43 YDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALI 102 D+ESGL+YN +RYYDP G+Y++ DP+GL GG +AY +P+ DP GL D Sbjct: 1349 ADEESGLHYNLHRYYDPSTGQYLSADPVGLAGGLRTHAYVHDPMQWGDPFGLQGYDTVRN 1408 Query: 103 RRKDQLNHQRAWDILSDTYEDMKRLN 128 R + D + K N Sbjct: 1409 HR----AGNKQIDYDGQRWNVPKGKN 1430 >UniRef50_Q73BZ0 Cell wall-associated protein n=4 Tax=Bacillus cereus group RepID=Q73BZ0_BACC1 Length = 258 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 35/101 (34%), Positives = 51/101 (50%), Gaps = 6/101 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 ++A+ D + + + EYD WGN L ++ P+ G YDKE G+YY RYY+P Sbjct: 19 VVAMTDQNKEVVATYEYDSWGNVLKSDAKGIATDNPFGYAGYMYDKEIGMYYLIARYYNP 78 Query: 60 LQGRYITQDPI-GLEGGW---SLYAYP-LNPVNGIDPLGLS 95 G +++ DP G E + Y Y NPV +DP G Sbjct: 79 EHGVFLSVDPNPGDEDDPVTQNGYTYGDNNPVMMVDPDGHW 119 >UniRef50_C7MZM5 Rhs family protein n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MZM5_SACVD Length = 1259 Score = 113 bits (284), Expect = 3e-24, Method: Composition-based stats. Identities = 39/96 (40%), Positives = 50/96 (52%), Gaps = 1/96 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L++ G IAW WG ++ + P R PGQ D E+G YN R+YDP R Sbjct: 1052 LINIHGGIAWYHRTTLWGITTDQSRTGA-YTPLRFPGQYADPETGFNYNFQRHYDPASAR 1110 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 Y DP+GL GG+ ++Y NP IDPLGL D Sbjct: 1111 YAGTDPLGLVGGFDPHSYVWNPYAWIDPLGLKKCDP 1146 >UniRef50_C0EFE6 Putative uncharacterized protein n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0EFE6_9CLOT Length = 292 Score = 113 bits (284), Expect = 3e-24, Method: Composition-based stats. Identities = 35/105 (33%), Positives = 52/105 (49%), Gaps = 12/105 (11%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHL----HQPYRLPGQQYDKESGLYYNRNRY 56 ++ ++D+DGN S YD WG ++ P+R G YD+E+G YY +RY Sbjct: 5 IIGILDSDGNQVVSYIYDSWGKLISTSGSLAESVGKQNPFRYAGYYYDQETGFYYIESRY 64 Query: 57 YDPLQGRYITQDPIGLEGG-------WSLYAYP-LNPVNGIDPLG 93 YDP R++ D + ++L+AY NPVN DP G Sbjct: 65 YDPETHRFLNADDSAILTDETIELVAYNLFAYTKNNPVNLYDPDG 109 >UniRef50_A7BPL6 Protein containing RHS repeats n=2 Tax=Beggiatoa sp. PS RepID=A7BPL6_9GAMM Length = 2594 Score = 113 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 43/126 (34%), Positives = 65/126 (51%), Gaps = 13/126 (10%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++ + D G++ +YD +GN L++ NP QP+ G YD +GL R Y+P Sbjct: 2367 VIDITD--GSVVQHMDYDTFGNILSDSNPG--FQPFGFAGGLYDVNTGLIRFGARDYEPE 2422 Query: 61 QGRYITQDPIGLEGG-WSLYAYPLN-PVNGIDPLGLSPADVALIRR------KDQLNHQR 112 GR+ +DPI +GG +LY Y +N P+N +D GL D IR K+ N+ + Sbjct: 2423 IGRWTAKDPILFDGGDTNLYGYVVNDPINFVDLFGLK-VDTGKIREDFRACLKEMTNNGQ 2481 Query: 113 AWDILS 118 D LS Sbjct: 2482 RLDKLS 2487 >UniRef50_C5ID01 RhsK n=50 Tax=cellular organisms RepID=C5ID01_ECOLX Length = 1616 Score = 113 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 42/115 (36%), Positives = 54/115 (46%), Gaps = 12/115 (10%) Query: 2 LALMDADGNIAWS-GEYDEWGNQLNEENPHHLHQP-----------YRLPGQQYDKESGL 49 L L +++G W G+ WG L+ P GQ D ESGL Sbjct: 1357 LMLFNSEGKTVWRPGQTSLWGLALSLPADTDYPDPRGERDPEADPGLLYAGQWQDAESGL 1416 Query: 50 YYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR 104 YNR RYY+P G Y+ DP+GL+GG Y Y NP IDPLGL+ +A + Sbjct: 1417 CYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYVPNPCGYIDPLGLAICQLARWTK 1471 >UniRef50_A8ZXK4 YD repeat protein n=3 Tax=Deltaproteobacteria RepID=A8ZXK4_DESOH Length = 2961 Score = 113 bits (283), Expect = 4e-24, Method: Composition-based stats. Identities = 43/122 (35%), Positives = 56/122 (45%), Gaps = 3/122 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 A+ D GNI +YD +G +N+ NP P+ G YDK++GL R Y+P G Sbjct: 2729 AVTDGSGNIVKQIDYDSFGFVINDTNP-SFSVPFGFAGGLYDKDTGLVRFGYRDYNPNTG 2787 Query: 63 RYITQDPIGLEGGWS-LYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 R+ +DPIG GG S LY Y + VN IDP GL D A D Sbjct: 2788 RWTAKDPIGFAGGSSDLYGYCLGDGVNLIDPDGLDFIDSMRALHAGASPTSNARDFHYSR 2847 Query: 121 YE 122 + Sbjct: 2848 NQ 2849 >UniRef50_A9FQL9 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FQL9_SORC5 Length = 2257 Score = 113 bits (283), Expect = 4e-24, Method: Composition-based stats. Identities = 40/103 (38%), Positives = 53/103 (51%), Gaps = 4/103 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + A G +A +YDE+G L + NP QP+ G YD E+ L R YD Sbjct: 2054 LVVNTATGAVAQRIDYDEFGRVLQDTNPG--FQPFGFAGGLYDAETKLVRFGARDYDAEV 2111 Query: 62 GRYITQDPIGLEGG-WSLYAYPLN-PVNGIDPLGLSPADVALI 102 GR+ +DPI +GG +LY Y LN PVN DP G P ++ Sbjct: 2112 GRWTAKDPILFDGGDANLYGYVLNDPVNFTDPNGYGPIELGQC 2154 >UniRef50_A9FZA9 Rhs family protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FZA9_SORC5 Length = 2407 Score = 113 bits (282), Expect = 6e-24, Method: Composition-based stats. Identities = 38/154 (24%), Positives = 60/154 (38%), Gaps = 4/154 (2%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLH---QPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +D G + EY +G + + YR G + D+E+GLYY+ RYY P Sbjct: 2005 VDGAGLVIGYEEYHPFGTTAYWSAASGIEVSQRRYRYTGTEKDEETGLYYHGARYYAPWL 2064 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+ + DP G G +LY Y NP+ DP G D + + D H+ + + Sbjct: 2065 GRWTSADPAGFVDGPNLYEYVRGNPIRLRDPSGRESTDQRIAQMTDVQLHRHVKALSPEA 2124 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSR 154 + G + + GV+ Sbjct: 2125 RAEFTASATGKFQERVSVTLARGKLESIRTGVTT 2158 >UniRef50_B9XHG9 YD repeat protein n=1 Tax=bacterium Ellin514 RepID=B9XHG9_9BACT Length = 1915 Score = 113 bits (282), Expect = 6e-24, Method: Composition-based stats. Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 1/96 (1%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + AL++A I YD +GN L++ P Y+ +++ + SGL +RYYDP Sbjct: 1717 ITALINAQQVIVAKYLYDPFGNILSKSGPLAEANLYQFSSKEFHQNSGLVCYLHRYYDPN 1776 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLS 95 R++T+DP+G GG++LY + N P+ G+DP GL+ Sbjct: 1777 LQRWLTRDPLGELGGYNLYQFVGNDPMEGVDPFGLA 1812 >UniRef50_B2AU51 Predicted CDS Pa_1_17960 n=1 Tax=Podospora anserina RepID=B2AU51_PODAN Length = 2454 Score = 113 bits (282), Expect = 6e-24, Method: Composition-based stats. Identities = 37/106 (34%), Positives = 58/106 (54%), Gaps = 3/106 (2%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQG 62 +D + N+ EY +G+ + + P YRL ++D+E+GLY+ RYY P G Sbjct: 2101 LDDEANLVSYEEYSPFGSVVYSAVYVEVEAPRKYRLARYEHDRETGLYHCGKRYYCPWLG 2160 Query: 63 RYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQ 107 R+ + DP G G +LY+Y N PVN +DP G S V+ + +D+ Sbjct: 2161 RWTSADPAGTVDGPNLYSYVRNDPVNWVDPKGTSGKKVSPEKPQDK 2206 >UniRef50_A6EH31 Rhs family protein n=2 Tax=cellular organisms RepID=A6EH31_9SPHI Length = 1409 Score = 113 bits (282), Expect = 7e-24, Method: Composition-based stats. Identities = 36/88 (40%), Positives = 47/88 (53%), Gaps = 1/88 (1%) Query: 8 DGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQ 67 DG + W E D +G + Y GQ D E+GL YNR RYY+P +G Y++Q Sbjct: 1186 DGELIWQRELDSYGRMKMQRGDAGFCN-YLYQGQLMDPETGLAYNRFRYYNPEEGIYVSQ 1244 Query: 68 DPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 DPI L GG LY Y + +DP GL+ Sbjct: 1245 DPIRLLGGSRLYGYVKDTNIWLDPFGLA 1272 >UniRef50_C5SNJ4 YD repeat protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SNJ4_9CAUL Length = 1421 Score = 112 bits (281), Expect = 8e-24, Method: Composition-based stats. Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 3/120 (2%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + A+++++G + + YDE+G + L +R GQ Y E GLYY + R Y P Sbjct: 1101 IQAVLNSNGTVNSTYAYDEYG--VPYTTSGSLFSRFRYTGQAYLSEIGLYYYKARMYSPT 1158 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 GR++ DPIG + G + YAY N P+NG DP GL + + D D Sbjct: 1159 LGRFLQTDPIGYDDGMNWYAYVHNDPMNGKDPSGLCEESTPEVIVICGSKKGKGSDNGDD 1218 >UniRef50_A6DLJ6 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLJ6_9BACT Length = 2320 Score = 112 bits (280), Expect = 9e-24, Method: Composition-based stats. Identities = 36/107 (33%), Positives = 54/107 (50%), Gaps = 7/107 (6%) Query: 1 MLALMDADGNIAWSGEYDEWG-NQLNEENPHHLHQPY-----RLPGQQYDKESGLYYNRN 54 ++A+ D GN+ S Y +G + + + G++YD ESGL++ RN Sbjct: 1850 VVAITDETGNLLESYSYTSFGIRTIYNQTGQEIANSAYGITAGYTGREYDSESGLWHYRN 1909 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVA 100 R Y GR++ DP G G +LYAY NP+N IDP GL ++ Sbjct: 1910 RMYSAEIGRFMQVDPAGFVDGLNLYAYVKNNPINFIDPWGLQALNLN 1956 >UniRef50_C0ZAE5 Putative uncharacterized protein n=2 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZAE5_BREBN Length = 1821 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 35/99 (35%), Positives = 54/99 (54%), Gaps = 5/99 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++ + +GN+ + +YD WGN + ++ + P+ G+ +DKESG YY R RYYDP Sbjct: 1508 VVKIKAPNGNVLNTYDYDIWGNLIADKVKETISNPFMYAGEMFDKESGFYYLRARYYDPK 1567 Query: 61 QGRYITQDPI-GLEGG---WSLYAYP-LNPVNGIDPLGL 94 GR+I++D G + Y Y NP+ IDP G Sbjct: 1568 IGRFISEDTYKGQVDNPLTLNRYTYVSSNPLKYIDPSGH 1606 >UniRef50_D0KFB3 Putative uncharacterized protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KFB3_PECWW Length = 283 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 38/106 (35%), Positives = 52/106 (49%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 AL DG + W Q + GQ D ESGL YNR RYYD G Sbjct: 49 ALYKPDGTLRWQAPKSTLWGQRRSAYADNADPGLGFAGQYRDTESGLCYNRFRYYDSNGG 108 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQL 108 Y++ DPIG+ GG + Y Y NP++ +DPLGL+ ++D++ Sbjct: 109 CYVSPDPIGVAGGDNNYGYVQNPLDWVDPLGLAGCSSKGFNKRDRI 154 >UniRef50_A8ZVU3 YD repeat protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZVU3_DESOH Length = 2831 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 44/95 (46%), Positives = 55/95 (57%), Gaps = 3/95 (3%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 A+ DA GNI +YD +G LN+ P P+ G YDK++GL R YDP G Sbjct: 2603 AVADAAGNIVKQIDYDSFGFMLNDTYP-GFEIPFGFAGGLYDKDTGLVRFGYRDYDPNTG 2661 Query: 63 RYITQDPIGLEGGWS-LYAYPLN-PVNGIDPLGLS 95 R+ +DPIG GG S LY Y LN PVN ID +GL+ Sbjct: 2662 RWTAKDPIGFNGGASDLYGYCLNDPVNMIDGIGLA 2696 >UniRef50_Q4VQB1 SGS1 n=4 Tax=Aedes/Ochlerotatus group RepID=Q4VQB1_AEDAE Length = 3060 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 4/121 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+ G + + +Y +G + + + H +R GQ++D+E+GLY R YDP G Sbjct: 2643 LVIHQGKVVAAYDYLPYGQMIRKYGSNPEAHIAFRYTGQEFDEETGLYNYHARLYDPDIG 2702 Query: 63 RYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 R+ DP+ E S Y Y N PV+ IDP G + L+ + ++++ Sbjct: 2703 RFFQMDPM--EQYASPYKYAGNSPVSQIDPDGQIAVTLVLMIIGAIVGAYLGAASANNSW 2760 Query: 122 E 122 Sbjct: 2761 N 2761 >UniRef50_D0LTH3 YD repeat protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTH3_HALO1 Length = 2387 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 39/97 (40%), Positives = 52/97 (53%), Gaps = 4/97 (4%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + A G +A +YD WG L + NP QP+ G YD ++GL + R YDP Sbjct: 1971 LVVDTATGAVAQRIDYDVWGRVLADSNPG--FQPFGFAGGLYDPDTGLVHFGARDYDPRT 2028 Query: 62 GRYITQDPIGLEGGW-SLYAYPL-NPVNGIDPLGLSP 96 GR++ DP GG+ +LYAY +PVN ID G P Sbjct: 2029 GRFLQTDPRLFGGGYDNLYAYSGFDPVNYIDRTGEVP 2065 >UniRef50_A9C2K7 Rhs family protein-like protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C2K7_DELAS Length = 285 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 40/105 (38%), Positives = 50/105 (47%), Gaps = 14/105 (13%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQ-------------PYRLPGQQYDKESGLY 50 L + G +AW +G Q R PGQ +D+E+GL Sbjct: 25 LTNQQGQVAWQWLISGFGEVRPTTGDRGYGQTVSGPSYAQAVKFDLRYPGQVFDEETGLS 84 Query: 51 YNRNRYYDPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGL 94 YN +RYYD GRYI DPIGLEGGW+ + Y NP+ DP GL Sbjct: 85 YNLHRYYDAATGRYIQADPIGLEGGWNRFGYVGGNPLIYGDPQGL 129 >UniRef50_B2UQS6 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQS6_AKKM8 Length = 284 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 38/95 (40%), Positives = 55/95 (57%), Gaps = 4/95 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + + DA G IA + +Y +G + L QP + G+ +D+ES L Y R+Y+P Sbjct: 43 VTEVFDAQGTIAAAYDYSPYGAVTS---TGSLVQPVQWSGEMHDEESSLVYYNYRFYNPK 99 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGL 94 GR+I +DPI EGGW+LYA+ N P + D LGL Sbjct: 100 DGRWINRDPIAEEGGWNLYAFLGNSPQDKFDALGL 134 >UniRef50_B4UIF2 YD repeat protein n=1 Tax=Anaeromyxobacter sp. K RepID=B4UIF2_ANASK Length = 2350 Score = 110 bits (276), Expect = 3e-23, Method: Composition-based stats. Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 5/104 (4%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + + G + +YDEWG L + NP QP+ G YD+++GL R YDP Sbjct: 2134 LVVNASTGAVVQRIDYDEWGQVLADSNPG--FQPFGFAGGLYDRDTGLVRFGARDYDPTV 2191 Query: 62 GRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLG--LSPADVALI 102 GR+ +D GG + Y Y +PVN +DP G +P A + Sbjct: 2192 GRWTAKDRSRFRGGLNFYEYAASDPVNFVDPTGNYFAPPPAAGV 2235 >UniRef50_C7IND8 YD repeat protein (Fragment) n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IND8_9CLOT Length = 581 Score = 110 bits (276), Expect = 3e-23, Method: Composition-based stats. Identities = 40/146 (27%), Positives = 63/146 (43%), Gaps = 8/146 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + AL+ +G I + YD +GN + ++ G QYDKE+ LYY RYYD Sbjct: 106 VTALVGENGAIQATYYYDAFGNITEQTG--DVNNNITYAGYQYDKETDLYYLNARYYDSK 163 Query: 61 QGRYITQDPIGLEGG----WSLYAYPLN-PVNGIDPLGLS-PADVALIRRKDQLNHQRAW 114 R++++D +LY Y N PV +DP G +D LI+ Q Sbjct: 164 TARFLSEDTYTGNTNDPLSLNLYTYCHNEPVMYVDPSGHWQESDKNLIQSARIAISQLTD 223 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMA 140 ++ + ++KR +C A Sbjct: 224 IYVTTSDPEVKRACATQAAAIRNCSA 249 >UniRef50_C0EGC9 Putative uncharacterized protein n=2 Tax=Clostridium methylpentosum DSM 5476 RepID=C0EGC9_9CLOT Length = 598 Score = 110 bits (276), Expect = 3e-23, Method: Composition-based stats. Identities = 36/104 (34%), Positives = 57/104 (54%), Gaps = 9/104 (8%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEE----NPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 ++ ++D DG+ S YD WG ++ + + PYR G YD E+G YY ++RY Sbjct: 321 VIGILDRDGSQVVSYVYDSWGKLVSTSGSLADTIGVQNPYRYRGYYYDVETGFYYLQSRY 380 Query: 57 YDPLQGRYITQDPI----GLEGGWSLYAYPLN-PVNGIDPLGLS 95 YDP+ GR+I D + G +++AY N P+N +DP G + Sbjct: 381 YDPVTGRFINSDSLIGSTGELNTHNMFAYCGNEPINRVDPAGFA 424 >UniRef50_Q2T916 Rhs1 protein n=30 Tax=Burkholderia RepID=Q2T916_BURTA Length = 1150 Score = 110 bits (276), Expect = 3e-23, Method: Composition-based stats. Identities = 36/95 (37%), Positives = 49/95 (51%), Gaps = 2/95 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 +A+ D G +AW+G Y WG L + + + QP R G D E L+ N RYYDP Sbjct: 1009 VAMTDDAGALAWAGRYSAWGRILPPTSLNVQVDQPLRFAGHYADDEVRLHLNGTRYYDPD 1068 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GRY++ D +E G S Y Y NP +P G + Sbjct: 1069 TGRYLSPDR-TVEPGTSPYRYVSNPQTACNPTGRA 1102 >UniRef50_B1HKQ3 Rhs family protein n=1 Tax=Burkholderia pseudomallei S13 RepID=B1HKQ3_BURPS Length = 1749 Score = 110 bits (275), Expect = 3e-23, Method: Composition-based stats. Identities = 40/95 (42%), Positives = 52/95 (54%), Gaps = 3/95 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +AL D+ G I YD +GN ++ PY+ G++ D GLYYNR RYY PL Sbjct: 1522 IALTDSAGAIRQRYSYDPYGNTEQSDSTTGFTNPYQYTGREMDAA-GLYYNRARYYSPLM 1580 Query: 62 GRYITQDPIGLEGG-WSLYAYP-LNPVNGIDPLGL 94 +I++DPI GG S Y Y +PVN D LGL Sbjct: 1581 SGFISEDPITFGGGQLSFYGYSDSDPVNHTDRLGL 1615 >UniRef50_C4DNE3 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DNE3_9ACTO Length = 714 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 34/92 (36%), Positives = 45/92 (48%), Gaps = 3/92 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQG 62 +D I EY +G E P YR G++ D+ESGLYY+ RYY P G Sbjct: 285 LDDSARIISHEEYYPYGGTAVESVRSRTETPKRYRFTGKERDEESGLYYHGARYYAPGLG 344 Query: 63 RYITQDPIGLEGGWSLYAYP-LNPVNGIDPLG 93 R+ + DP G+ G + Y Y NP+ DP G Sbjct: 345 RWTSGDPKGIAEGPNPYVYTRNNPIVFADPDG 376 >UniRef50_C7PJF4 YD repeat protein n=2 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJF4_CHIPD Length = 1401 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 43/89 (48%), Positives = 49/89 (55%), Gaps = 1/89 (1%) Query: 6 DADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYI 65 DA G W GE D +G P+R GQ D E+GLYYNR RYY PL+G YI Sbjct: 1162 DASGEKVWEGELDIYGKLRKLAGASDF-IPFRRQGQYEDVETGLYYNRFRYYSPLEGLYI 1220 Query: 66 TQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 +QDPI LEGG Y Y P +DP GL Sbjct: 1221 SQDPIRLEGGSRFYEYSRCPTLILDPFGL 1249 >UniRef50_C3IG21 Wall associated protein n=2 Tax=Bacillus cereus group RepID=C3IG21_BACTU Length = 263 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 35/115 (30%), Positives = 52/115 (45%), Gaps = 6/115 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 ++A+ D + + + EYD WGN L ++ P+ G YDKE G+YY RYY+P Sbjct: 18 VIAMTDQNREVVATYEYDSWGNVLKSDTKGIATENPFGYAGYMYDKEIGMYYLIARYYNP 77 Query: 60 LQGRYITQDPIGLEG----GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLN 109 +++ DP + + Y Y NPV IDP G P +I Sbjct: 78 DHAVFLSVDPDPGDEDDPVTMNGYTYVDNNPVMLIDPDGNIPVAPLVIAGARMAA 132 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77759 Putative uncharacterized protein ylbH n=12 Tax=E... 292 5e-78 UniRef50_UPI0001B52595 rhsE element core protein RshE n=1 Tax=Es... 191 2e-47 UniRef50_Q7MDR0 Rhs family protein n=3 Tax=Vibrio vulnificus Rep... 167 2e-40 UniRef50_Q48LL6 Rhs family protein n=1 Tax=Pseudomonas syringae ... 164 2e-39 UniRef50_B5MRU6 Rhs-family protein n=9 Tax=Gammaproteobacteria R... 164 2e-39 UniRef50_Q4ZLF3 YD repeat n=5 Tax=Pseudomonas syringae group Rep... 164 2e-39 UniRef50_D0KEV6 RHS protein n=2 Tax=Enterobacteriaceae RepID=D0K... 163 5e-39 UniRef50_A9AE26 Rhs family protein n=10 Tax=cellular organisms R... 162 8e-39 UniRef50_B2HXH5 Rhs family protein n=5 Tax=Acinetobacter baumann... 161 1e-38 UniRef50_B0VRR7 Putative uncharacterized protein n=1 Tax=Acineto... 161 2e-38 UniRef50_Q2SPP2 Rhs family protein n=3 Tax=Hahella chejuensis KC... 161 3e-38 UniRef50_D1YPP7 RHS repeat-associated core domain protein n=1 Ta... 159 8e-38 UniRef50_Q2SFR5 Rhs family protein n=1 Tax=Hahella chejuensis KC... 159 8e-38 UniRef50_A1U3R9 YD repeat protein n=3 Tax=Gammaproteobacteria Re... 159 1e-37 UniRef50_C0Q6Q9 Rhs-family protein n=27 Tax=Enterobacteriaceae R... 158 1e-37 UniRef50_Q13ML3 YD repeat protein n=10 Tax=Proteobacteria RepID=... 157 2e-37 UniRef50_Q1LDW7 YD repeat n=2 Tax=Burkholderiaceae RepID=Q1LDW7_... 157 3e-37 UniRef50_B2VH58 Rhs family protein n=5 Tax=Enterobacteriaceae Re... 157 3e-37 UniRef50_Q87U70 Rhs family protein n=2 Tax=Pseudomonas RepID=Q87... 155 1e-36 UniRef50_C5CZG8 RHS protein n=1 Tax=Variovorax paradoxus S110 Re... 154 2e-36 UniRef50_Q2SFS1 Rhs family protein n=1 Tax=Hahella chejuensis KC... 154 2e-36 UniRef50_Q0JZD5 RHS family protein n=9 Tax=Bacteria RepID=Q0JZD5... 154 3e-36 UniRef50_B1J8X0 YD repeat protein n=50 Tax=Gammaproteobacteria R... 153 4e-36 UniRef50_P16919 Protein rhsD n=261 Tax=Bacteria RepID=RHSD_ECOLI 153 4e-36 UniRef50_Q6LUC6 Hypothetical nucleotidyltransferase n=1 Tax=Phot... 153 4e-36 UniRef50_C1M8X5 Core protein n=2 Tax=Citrobacter RepID=C1M8X5_9ENTR 153 4e-36 UniRef50_B2K1J9 YD repeat protein n=23 Tax=Yersinia RepID=B2K1J9... 153 5e-36 UniRef50_A6GLW0 Rhs family protein n=1 Tax=Limnobacter sp. MED10... 153 5e-36 UniRef50_C8QGI4 YD repeat protein n=1 Tax=Pantoea sp. At-9b RepI... 152 8e-36 UniRef50_A7FDK7 YD/RHS repeat protein n=25 Tax=Enterobacteriacea... 151 1e-35 UniRef50_Q2SKM2 Rhs family protein n=3 Tax=Hahella chejuensis KC... 151 2e-35 UniRef50_Q6LUC4 Putative uncharacterized protein n=1 Tax=Photoba... 150 2e-35 UniRef50_C3JXH8 Putative Rhs protein n=1 Tax=Pseudomonas fluores... 150 3e-35 UniRef50_Q83LZ1 Putative Rhs-family protein n=1 Tax=Shigella fle... 150 3e-35 UniRef50_Q7NY44 Probable Rhs-family protein n=2 Tax=Chromobacter... 150 3e-35 UniRef50_UPI00016A9A82 Rhs family protein n=2 Tax=Burkholderia o... 150 4e-35 UniRef50_B3PEK8 RHS Repeat family n=1 Tax=Cellvibrio japonicus U... 150 4e-35 UniRef50_B1JCT8 RHS protein n=1 Tax=Pseudomonas putida W619 RepI... 150 4e-35 UniRef50_Q147B5 Rhs family protein n=2 Tax=Betaproteobacteria Re... 150 5e-35 UniRef50_A1TTP6 YD repeat protein n=2 Tax=Acidovorax citrulli AA... 150 5e-35 UniRef50_B7H4M5 Uncharacterized protein ybfO n=3 Tax=Acinetobact... 149 6e-35 UniRef50_D0KES6 YD repeat protein n=15 Tax=Gammaproteobacteria R... 149 7e-35 UniRef50_C1M4X0 Predicted protein n=1 Tax=Citrobacter sp. 30_2 R... 149 7e-35 UniRef50_UPI0001C341D4 protein RhsA n=1 Tax=Citrobacter youngae ... 149 8e-35 UniRef50_UPI0001BC4026 Rhs family protein n=6 Tax=Neisseria RepI... 149 9e-35 UniRef50_Q2SGE8 Rhs family protein n=1 Tax=Hahella chejuensis KC... 149 9e-35 UniRef50_B7LTT0 Putative uncharacterized protein n=4 Tax=Enterob... 149 9e-35 UniRef50_D1PV96 YD repeat protein n=1 Tax=Prevotella bergensis D... 149 1e-34 UniRef50_A7FDJ9 RHS/YD repeat protein n=30 Tax=Enterobacteriacea... 148 1e-34 UniRef50_O52663 Core protein (Fragment) n=5 Tax=Enterobacteriace... 148 1e-34 UniRef50_D1PUV1 RHS family protein n=1 Tax=Prevotella bergensis ... 148 2e-34 UniRef50_Q39K64 Rhs family protein n=22 Tax=Burkholderia RepID=Q... 148 2e-34 UniRef50_UPI00019F17FA RhsC core protein with extension n=1 Tax=... 148 2e-34 UniRef50_P32109 Putative uncharacterized protein yibJ n=14 Tax=B... 147 2e-34 UniRef50_UPI0001AF2680 RHS/YD repeat-containing protein n=1 Tax=... 147 3e-34 UniRef50_A8A655 Rhs family protein n=14 Tax=Enterobacteriaceae R... 147 4e-34 UniRef50_A7FN18 RHS/YD repeat protein n=5 Tax=cellular organisms... 146 5e-34 UniRef50_B5PJT7 Protein RhsD n=2 Tax=Enterobacteriaceae RepID=B5... 146 5e-34 UniRef50_Q328Z1 RhsA protein in rhs element n=7 Tax=Enterobacter... 145 8e-34 UniRef50_Q2SFR1 Rhs family protein n=9 Tax=cellular organisms Re... 145 8e-34 UniRef50_C9Y459 Putative uncharacterized protein n=1 Tax=Cronoba... 145 1e-33 UniRef50_P77779 Putative uncharacterized protein ybfO n=67 Tax=E... 145 1e-33 UniRef50_A1TQS0 YD repeat protein n=2 Tax=Acidovorax RepID=A1TQS... 145 1e-33 UniRef50_Q12LF3 YD repeat n=2 Tax=Shewanella denitrificans OS217... 145 2e-33 UniRef50_D2TGW0 Putative Rhs protein n=2 Tax=Citrobacter RepID=D... 144 2e-33 UniRef50_D1YPL0 RHS repeat-associated core domain protein n=1 Ta... 144 3e-33 UniRef50_UPI0001B53B37 YD repeat protein n=1 Tax=Streptomyces sp... 144 3e-33 UniRef50_Q4K3M9 Rhs family protein n=5 Tax=Pseudomonas RepID=Q4K... 143 4e-33 UniRef50_A1TSG3 RHS protein n=1 Tax=Acidovorax citrulli AAC00-1 ... 143 4e-33 UniRef50_A9C0N8 YD repeat protein n=5 Tax=cellular organisms Rep... 143 5e-33 UniRef50_A1TTQ1 Rhs family protein n=1 Tax=Acidovorax citrulli A... 143 5e-33 UniRef50_C5AIF2 Rhs family protein n=1 Tax=Burkholderia glumae B... 143 5e-33 UniRef50_A7K3Q8 Rhs family protein n=7 Tax=Vibrio RepID=A7K3Q8_V... 142 6e-33 UniRef50_A8ZTS1 YD repeat protein n=4 Tax=Desulfococcus oleovora... 142 7e-33 UniRef50_D0KES8 RHS protein n=3 Tax=Pectobacterium wasabiae WPP1... 142 9e-33 UniRef50_C6M9F5 RHS family protein n=2 Tax=Bacteria RepID=C6M9F5... 141 2e-32 UniRef50_Q6D1M4 Rhs protein n=5 Tax=Enterobacteriaceae RepID=Q6D... 141 2e-32 UniRef50_UPI0001C34A7C Rhs family protein n=1 Tax=Neisseria subf... 141 2e-32 UniRef50_B3E8D4 YD repeat protein n=1 Tax=Geobacter lovleyi SZ R... 141 2e-32 UniRef50_B4SV70 Rhs-family protein n=42 Tax=Enterobacteriaceae R... 141 2e-32 UniRef50_B9B9U8 YD repeat protein n=1 Tax=Burkholderia multivora... 141 2e-32 UniRef50_C8Q7Z5 YD repeat protein n=8 Tax=Enterobacteriaceae Rep... 141 2e-32 UniRef50_C6WJA6 YD repeat protein n=3 Tax=Actinosynnema mirum DS... 141 2e-32 UniRef50_C5CXA7 YD repeat protein n=1 Tax=Variovorax paradoxus S... 140 3e-32 UniRef50_A1TSU8 YD repeat protein n=4 Tax=Acidovorax RepID=A1TSU... 140 3e-32 UniRef50_B2PW78 Putative uncharacterized protein n=1 Tax=Provide... 140 3e-32 UniRef50_A1TV35 YD repeat protein n=2 Tax=Acidovorax RepID=A1TV3... 140 4e-32 UniRef50_Q31U53 Putative uncharacterized protein n=1 Tax=Shigell... 140 4e-32 UniRef50_C0EPY5 Putative uncharacterized protein n=1 Tax=Neisser... 140 5e-32 UniRef50_A1TR18 YD repeat protein n=8 Tax=Acidovorax RepID=A1TR1... 139 6e-32 UniRef50_B2PVY9 Putative uncharacterized protein n=12 Tax=Entero... 139 6e-32 UniRef50_UPI0001B52C8C protein, rhs-like protein n=4 Tax=Enterob... 139 7e-32 UniRef50_A1AK54 YD repeat protein n=1 Tax=Pelobacter propionicus... 139 7e-32 UniRef50_D2UF28 Putative rhs family protein n=2 Tax=Xanthomonas ... 139 8e-32 UniRef50_B5H9Z7 Rhs protein n=2 Tax=Streptomyces RepID=B5H9Z7_STRPR 139 8e-32 UniRef50_D1SWH5 RHS protein (Fragment) n=1 Tax=Acidovorax avenae... 139 8e-32 UniRef50_C7MXD3 Rhs family protein n=1 Tax=Saccharomonospora vir... 139 9e-32 UniRef50_Q399U9 Rhs family protein n=3 Tax=Proteobacteria RepID=... 139 9e-32 UniRef50_A1TJG7 YD repeat protein n=4 Tax=Proteobacteria RepID=A... 139 9e-32 UniRef50_Q8GDM7 Rhs n=3 Tax=Photorhabdus RepID=Q8GDM7_PHOLU 138 1e-31 UniRef50_B4V6T6 Rhs protein n=2 Tax=Streptomyces RepID=B4V6T6_9ACTO 138 1e-31 UniRef50_C9Y462 Putative uncharacterized protein n=1 Tax=Cronoba... 138 1e-31 UniRef50_Q1I7Q5 Putative uncharacterized protein n=11 Tax=Pseudo... 138 2e-31 UniRef50_A0LJM9 YD repeat protein n=3 Tax=Syntrophobacter fumaro... 138 2e-31 UniRef50_B1KGR6 YD repeat protein n=1 Tax=Shewanella woodyi ATCC... 138 2e-31 UniRef50_C9NF93 YD repeat protein n=1 Tax=Streptomyces flavogris... 137 3e-31 UniRef50_C9Y441 Putative uncharacterized protein n=1 Tax=Cronoba... 137 3e-31 UniRef50_B7GLX9 Rhs family protein n=1 Tax=Anoxybacillus flavith... 137 3e-31 UniRef50_C6M5B1 Rhs-related protein n=7 Tax=Proteobacteria RepID... 136 4e-31 UniRef50_B8FCM0 YD repeat protein n=1 Tax=Desulfatibacillum alke... 136 5e-31 UniRef50_Q7N2G0 Complete genome; segment 11/17 n=4 Tax=Gammaprot... 136 5e-31 UniRef50_C5ALM7 YD repeat protein n=19 Tax=Proteobacteria RepID=... 136 5e-31 UniRef50_C6CNW6 YD repeat protein n=8 Tax=Enterobacteriaceae Rep... 136 6e-31 UniRef50_UPI0001B56FBA YD repeat-containing protein n=1 Tax=Stre... 136 7e-31 UniRef50_C6AKX3 Rhs family protein n=4 Tax=Aggregatibacter aphro... 135 8e-31 UniRef50_B5S3P6 Probable rhs-related protein (Fragment) n=1 Tax=... 135 9e-31 UniRef50_C7Q0A7 YD repeat protein n=2 Tax=Catenulispora acidiphi... 135 9e-31 UniRef50_B4ETQ2 Rhs-family protein n=9 Tax=Enterobacteriaceae Re... 135 9e-31 UniRef50_D1PWX0 YD repeat protein n=1 Tax=Prevotella bergensis D... 135 1e-30 UniRef50_B5I7B2 Rhs repeat protein n=1 Tax=Streptomyces sviceus ... 135 1e-30 UniRef50_UPI00019F181C rhsC element core protein RshC n=2 Tax=En... 135 2e-30 UniRef50_B3X3P2 RhsH n=1 Tax=Shigella dysenteriae 1012 RepID=B3X... 134 2e-30 UniRef50_C6M9F4 Rhs family protein n=8 Tax=Neisseria RepID=C6M9F... 134 2e-30 UniRef50_B6A882 YenC2 n=1 Tax=Yersinia sp. MH-1 RepID=B6A882_9ENTR 134 2e-30 UniRef50_Q1K295 YD repeat n=4 Tax=Desulfuromonas acetoxidans DSM... 134 3e-30 UniRef50_C6CI62 YD repeat protein n=7 Tax=Enterobacteriaceae Rep... 134 3e-30 UniRef50_B8FBZ2 YD repeat protein n=1 Tax=Desulfatibacillum alke... 134 4e-30 UniRef50_UPI0001B4DD67 Rhs protein n=1 Tax=Streptomyces hygrosco... 133 4e-30 UniRef50_D1JME6 Rhs family protein n=22 Tax=Bacteroides RepID=D1... 133 4e-30 UniRef50_B4ETQ7 Putative Rhs-family protein n=3 Tax=Enterobacter... 133 4e-30 UniRef50_B4V251 LipX3 n=2 Tax=Streptomyces RepID=B4V251_9ACTO 133 4e-30 UniRef50_A9FZA9 Rhs family protein n=1 Tax=Sorangium cellulosum ... 133 4e-30 UniRef50_B3JL94 Putative uncharacterized protein n=1 Tax=Bactero... 133 5e-30 UniRef50_C7QG23 YD repeat protein n=2 Tax=Catenulispora acidiphi... 133 6e-30 UniRef50_A3UDD5 Wall associated protein n=1 Tax=Oceanicaulis ale... 132 6e-30 UniRef50_C6M9F8 RhsG core protein with extension n=1 Tax=Neisser... 132 7e-30 UniRef50_C8W2V9 YD repeat protein n=3 Tax=Desulfotomaculum aceto... 132 8e-30 UniRef50_C8W2T8 YD repeat protein n=2 Tax=Desulfotomaculum aceto... 132 9e-30 UniRef50_D1T3Q3 YD repeat protein (Fragment) n=2 Tax=Acidovorax ... 132 9e-30 UniRef50_Q9L0E3 Putative Rhs protein n=2 Tax=Streptomyces RepID=... 132 1e-29 UniRef50_Q73BZ0 Cell wall-associated protein n=4 Tax=Bacillus ce... 132 1e-29 UniRef50_A4SKJ3 Rhs family protein n=2 Tax=Bacteria RepID=A4SKJ3... 132 1e-29 UniRef50_UPI000196E06E Rhs family protein n=2 Tax=Neisseria muco... 132 1e-29 UniRef50_Q2SPP3 Rhs family protein n=1 Tax=Hahella chejuensis KC... 132 1e-29 UniRef50_B7GLY4 Rhs family protein n=3 Tax=Anoxybacillus flavith... 132 1e-29 UniRef50_D0KWY8 YD repeat protein n=1 Tax=Halothiobacillus neapo... 132 1e-29 UniRef50_A1WE65 Rhs family protein-like protein n=1 Tax=Verminep... 132 1e-29 UniRef50_A3NNM1 Protein RhsD n=20 Tax=pseudomallei group RepID=A... 131 1e-29 UniRef50_UPI000190F33A Rhs-family protein n=2 Tax=Salmonella ent... 131 2e-29 UniRef50_D1T3N4 YD repeat protein n=2 Tax=Betaproteobacteria Rep... 131 2e-29 UniRef50_B2HAQ4 RhsD protein n=4 Tax=Burkholderia RepID=B2HAQ4_B... 131 2e-29 UniRef50_Q88FK6 RHS family protein, putative n=1 Tax=Pseudomonas... 131 2e-29 UniRef50_Q0K1I3 Insecticidal toxin complex protein n=2 Tax=Prote... 130 2e-29 UniRef50_C5AA19 Rhs family protein n=1 Tax=Burkholderia glumae B... 130 2e-29 UniRef50_UPI00017448E8 YD repeat protein n=2 Tax=Verrucomicrobiu... 130 3e-29 UniRef50_D1VCV7 YD repeat protein n=1 Tax=Frankia sp. EuI1c RepI... 130 3e-29 UniRef50_D0KG38 Rhs family protein-like protein n=1 Tax=Pectobac... 130 4e-29 UniRef50_D1T3N5 YD repeat protein n=1 Tax=Acidovorax avenae subs... 130 4e-29 UniRef50_C2LFQ4 Putative uncharacterized protein n=1 Tax=Proteus... 130 5e-29 UniRef50_C0EPY1 Putative uncharacterized protein n=7 Tax=Neisser... 130 5e-29 UniRef50_A9AKU8 Type VI secretion system Vgr family protein n=10... 129 1e-28 UniRef50_C6CPH2 YD repeat protein n=11 Tax=Enterobacteriaceae Re... 129 1e-28 UniRef50_Q5TP09 AGAP009916-PA n=4 Tax=Anopheles gambiae RepID=Q5... 129 1e-28 UniRef50_C2VMF9 Wall-associated domain protein n=5 Tax=Bacillus ... 128 1e-28 UniRef50_C3GXN0 Wall associated protein n=1 Tax=Bacillus thuring... 128 1e-28 UniRef50_B0XCM0 SGS3 n=1 Tax=Culex quinquefasciatus RepID=B0XCM0... 128 2e-28 UniRef50_UPI000023D9A2 hypothetical protein FG10566.1 n=1 Tax=Gi... 128 2e-28 UniRef50_Q3YV37 Putative uncharacterized protein n=1 Tax=Shigell... 128 2e-28 UniRef50_C7Q0B8 YD repeat protein n=1 Tax=Catenulispora acidiphi... 127 2e-28 UniRef50_C4NV50 Rhs repeat family protein n=10 Tax=Gammaproteoba... 127 2e-28 UniRef50_D1S833 YD repeat protein n=1 Tax=Micromonospora auranti... 127 2e-28 UniRef50_A6GBQ3 Putative uncharacterized protein n=1 Tax=Plesioc... 127 2e-28 UniRef50_B8FIJ1 YD repeat protein n=1 Tax=Desulfatibacillum alke... 127 3e-28 UniRef50_D0BWK2 YD repeat protein n=1 Tax=Acinetobacter sp. RUH2... 127 3e-28 UniRef50_B5JS46 NHL repeat containing protein n=2 Tax=gamma prot... 127 3e-28 UniRef50_Q07833 Wall-associated protein n=18 Tax=Bacillaceae Rep... 127 3e-28 UniRef50_Q16U81 Putative uncharacterized protein n=1 Tax=Aedes a... 127 3e-28 UniRef50_D1SVF0 Rhs family protein (Fragment) n=1 Tax=Acidovorax... 127 3e-28 UniRef50_C9RZN2 YD repeat protein n=2 Tax=Geobacillus RepID=C9RZ... 127 4e-28 UniRef50_C4DNE5 Rhs family protein n=1 Tax=Stackebrandtia nassau... 127 4e-28 UniRef50_A5GE16 YD repeat protein n=1 Tax=Geobacter uraniireduce... 127 4e-28 UniRef50_D1W448 RHS repeat-associated core domain protein n=1 Ta... 126 6e-28 UniRef50_B4VFT3 Rhs protein n=4 Tax=Bacteria RepID=B4VFT3_9ACTO 126 6e-28 UniRef50_B3PHE6 RHS Repeat family n=5 Tax=cellular organisms Rep... 126 6e-28 UniRef50_B4EGW8 RHS-family protein n=9 Tax=Burkholderiaceae RepI... 126 6e-28 UniRef50_A7BNN3 Putative uncharacterized protein n=1 Tax=Beggiat... 126 8e-28 UniRef50_A4FJ21 YD repeat protein n=1 Tax=Saccharopolyspora eryt... 125 8e-28 UniRef50_A9EVR3 Conserved carbohydrate-binding protein, Rhs fami... 125 9e-28 UniRef50_UPI00016B0868 Rhs family protein n=1 Tax=Burkholderia p... 125 1e-27 UniRef50_C8QVK4 YD repeat protein n=4 Tax=Desulfurivibrio alkali... 125 1e-27 UniRef50_B3PEN8 Rhsfamily protein n=2 Tax=Cellvibrio japonicus U... 125 1e-27 UniRef50_Q0IFS2 Putative uncharacterized protein (Fragment) n=1 ... 125 1e-27 UniRef50_D1AA66 YD repeat-containing protein n=1 Tax=Thermomonos... 125 1e-27 UniRef50_C7BJB9 Insecticidal toxin complex TccC n=1 Tax=Photorha... 125 1e-27 UniRef50_B0SBE5 Putative uncharacterized protein n=2 Tax=Leptosp... 125 2e-27 UniRef50_A3KUM5 Rhs family protein n=10 Tax=Pseudomonas aerugino... 124 2e-27 UniRef50_A9GT29 Insecticidal toxin complex-like protein n=1 Tax=... 124 2e-27 UniRef50_Q4VQB1 SGS1 n=4 Tax=Aedes/Ochlerotatus group RepID=Q4VQ... 124 2e-27 UniRef50_Q395C2 Rhs family protein n=1 Tax=Burkholderia sp. 383 ... 124 3e-27 UniRef50_C7QAC6 YD repeat protein n=2 Tax=Bacteria RepID=C7QAC6_... 124 3e-27 UniRef50_D1PTA9 YD repeat protein n=1 Tax=Prevotella bergensis D... 124 3e-27 UniRef50_C4DNE3 Putative uncharacterized protein n=1 Tax=Stackeb... 124 3e-27 UniRef50_A9GIJ6 Conserved carbohydrate-binding protein, Rhs fami... 124 3e-27 UniRef50_C3IAE1 Insecticidal toxin complex protein TccC (Toxin c... 124 3e-27 UniRef50_B2AU51 Predicted CDS Pa_1_17960 n=1 Tax=Podospora anser... 124 4e-27 UniRef50_B7GX39 Protein rhsD n=4 Tax=Acinetobacter baumannii Rep... 123 4e-27 UniRef50_B0XA86 Putative uncharacterized protein n=1 Tax=Culex q... 123 4e-27 UniRef50_Q12SZ6 Putative uncharacterized protein n=1 Tax=Shewane... 123 5e-27 UniRef50_C4KA75 YD repeat protein n=1 Tax=Thauera sp. MZ1T RepID... 123 6e-27 UniRef50_D1VXD6 RHS repeat-associated core domain protein n=4 Ta... 123 6e-27 UniRef50_Q2Y592 Peptidase C39, bacteriocin processing n=1 Tax=Ni... 123 6e-27 UniRef50_Q72U39 Cytoplasmic membrane protein n=2 Tax=Leptospira ... 123 6e-27 UniRef50_C1B7W9 Putative uncharacterized protein n=1 Tax=Rhodoco... 122 7e-27 UniRef50_B2PYS3 Putative uncharacterized protein n=1 Tax=Provide... 122 7e-27 UniRef50_A6DLJ6 Putative uncharacterized protein n=1 Tax=Lentisp... 122 7e-27 UniRef50_C0B4J4 Putative uncharacterized protein n=3 Tax=Coproco... 122 7e-27 UniRef50_A9C2K3 YD repeat protein n=1 Tax=Delftia acidovorans SP... 122 7e-27 UniRef50_C4SEH6 Insecticidal toxin complex protein n=5 Tax=Yersi... 122 8e-27 UniRef50_D0HCV2 Rhs protein n=3 Tax=Vibrio mimicus VM223 RepID=D... 122 8e-27 UniRef50_A4ACT6 Putative uncharacterized protein n=1 Tax=Congreg... 122 9e-27 UniRef50_B6VUY0 Putative uncharacterized protein n=2 Tax=Bactero... 122 1e-26 UniRef50_C9PY20 YD repeat protein n=1 Tax=Prevotella sp. oral ta... 122 1e-26 UniRef50_C7HJS3 YD repeat protein (Fragment) n=3 Tax=Clostridium... 122 1e-26 UniRef50_A9GD22 Conserved carbohydrate-binding protein, Rhs fami... 122 1e-26 UniRef50_D0KZB0 YD repeat protein n=1 Tax=Halothiobacillus neapo... 122 1e-26 UniRef50_A1WM30 Putative uncharacterized protein n=1 Tax=Vermine... 122 1e-26 UniRef50_B6A881 YenC1 n=1 Tax=Yersinia sp. MH-1 RepID=B6A881_9ENTR 122 1e-26 UniRef50_Q2SIG5 Rhs family protein n=1 Tax=Hahella chejuensis KC... 122 1e-26 UniRef50_D2KTW4 Putative uncharacterized protein n=2 Tax=Strepto... 121 2e-26 UniRef50_C0FSB4 Putative uncharacterized protein n=1 Tax=Rosebur... 121 2e-26 UniRef50_B3EU05 Putative uncharacterized protein n=1 Tax=Candida... 121 2e-26 UniRef50_Q8PMX0 Wall-associated protein n=2 Tax=Xanthomonas RepI... 121 2e-26 UniRef50_A3DF74 YD repeat protein n=17 Tax=Clostridium thermocel... 121 2e-26 UniRef50_Q4ZP55 YD repeat n=10 Tax=Pseudomonas syringae group Re... 121 2e-26 UniRef50_Q3JSF4 RhsD protein n=23 Tax=Burkholderia pseudomallei ... 121 2e-26 UniRef50_B0WKJ0 Putative uncharacterized protein n=1 Tax=Culex q... 121 2e-26 UniRef50_B1HM94 Cell wall-associated protein n=5 Tax=Lysinibacil... 121 2e-26 UniRef50_D1WVZ1 YD repeat protein n=1 Tax=Streptomyces sp. ACT-1... 121 2e-26 UniRef50_A8ZSG6 YD repeat protein n=1 Tax=Desulfococcus oleovora... 121 2e-26 UniRef50_C3IG21 Wall associated protein n=2 Tax=Bacillus cereus ... 121 2e-26 UniRef50_C0FSB7 Putative uncharacterized protein n=1 Tax=Rosebur... 121 2e-26 UniRef50_C4DPW8 Rhs family protein n=1 Tax=Stackebrandtia nassau... 120 3e-26 UniRef50_C3LC21 Wall-associated domain protein n=18 Tax=Bacillus... 120 3e-26 UniRef50_C7IND8 YD repeat protein (Fragment) n=1 Tax=Clostridium... 120 3e-26 UniRef50_Q73C66 Wall associated protein, putative n=73 Tax=Bacil... 120 4e-26 UniRef50_B8CMS0 Putative uncharacterized protein n=1 Tax=Shewane... 120 4e-26 UniRef50_B2Q762 Putative uncharacterized protein n=2 Tax=Provide... 120 4e-26 UniRef50_C3JCI6 Rhs family protein n=4 Tax=Bacteria RepID=C3JCI6... 120 4e-26 UniRef50_B2PW71 Putative uncharacterized protein n=1 Tax=Provide... 120 5e-26 UniRef50_Q2T5B9 YD repeat protein n=39 Tax=Proteobacteria RepID=... 120 5e-26 Sequences not found previously or not previously below threshold: >UniRef50_P77759 Putative uncharacterized protein ylbH n=12 Tax=Escherichia coli RepID=YLBH_ECOLI Length = 236 Score = 292 bits (748), Expect = 5e-78, Method: Composition-based stats. Identities = 235/235 (100%), Positives = 235/235 (100%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ Sbjct: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY Sbjct: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK Sbjct: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 Query: 182 LSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQDAGYLK 236 LSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQDAGYLK Sbjct: 182 LSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQDAGYLK 236 >UniRef50_UPI0001B52595 rhsE element core protein RshE n=1 Tax=Escherichia sp. 4_1_40B RepID=UPI0001B52595 Length = 273 Score = 191 bits (484), Expect = 2e-47, Method: Composition-based stats. Identities = 95/231 (41%), Positives = 120/231 (51%), Gaps = 30/231 (12%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DGN AW GEYDEWGNQLNEENPHHLHQPYRLPGQQ+D+ESGLYYNR+R+YDPLQ Sbjct: 32 LALISEDGNTAWRGEYDEWGNQLNEENPHHLHQPYRLPGQQHDEESGLYYNRHRHYDPLQ 91 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQ---------- 111 GRYIT DPIGL GGW++Y YPLNP+ IDP+GL + Sbjct: 92 GRYITPDPIGLRGGWNMYQYPLNPIQVIDPMGLDAIENMTSGGLIYAVSGVPGLIAANSI 151 Query: 112 --RAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYG 169 A+ D + + G D HC CR++K + + Sbjct: 152 TNSAYQFGYDMDAIVGGAHNGAADAMRHCYLMCRMTKTFGSTI----------------- 194 Query: 170 LNLFGMYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPS-TTDCSDRCSDYIN 219 ++ G + ++ DL N G+ C + CSD C + N Sbjct: 195 ADVIGKNHEAAGDRQGQPAKERIMDLKNNTVGIACGDFSAKCSDACIEKYN 245 >UniRef50_Q7MDR0 Rhs family protein n=3 Tax=Vibrio vulnificus RepID=Q7MDR0_VIBVY Length = 1498 Score = 167 bits (423), Expect = 2e-40, Method: Composition-based stats. Identities = 53/178 (29%), Positives = 76/178 (42%), Gaps = 6/178 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+D G++ W YD +G E + P R GQ +D E+GL+YN RYYDP Sbjct: 1039 LALVDEQGSVVWQARYDTYGRAHIE--VESVGNPLRFQGQYHDVETGLHYNLARYYDPRT 1096 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD--VALIRRKDQLNHQRAWDILSD 119 GR+I DPIGL GG + Y Y NPV +DP GL + A+ + Q+ Sbjct: 1097 GRFIQPDPIGLLGGINHYQYAPNPVMWVDPHGLCAKEGSPAIKAGVNDTQPQQPMFYAMG 1156 Query: 120 TYEDMKRLNLGGTDQFFHCMA--FCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGM 175 + + H +A + ++ A + G D L G+ Sbjct: 1157 SGNYASAVKTASPTYQLHAIAPDAIQQVGIDYALGATEMLAAGAYNTAVDAVAGLAGL 1214 >UniRef50_Q48LL6 Rhs family protein n=1 Tax=Pseudomonas syringae pv. phaseolicola 1448A RepID=Q48LL6_PSE14 Length = 362 Score = 164 bits (416), Expect = 2e-39, Method: Composition-based stats. Identities = 56/185 (30%), Positives = 76/185 (41%), Gaps = 5/185 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DA+G I W +Y WG + + + Q R GQ +D E+GL+YN RYYDP Sbjct: 138 LEMTDAEGQIVWQAKYRAWGAV-EKLVVNEVEQNLRFQGQYFDVETGLHYNTFRYYDPEI 196 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR+ITQDPIGL+GG +LY Y NP +DP G + + D H Sbjct: 197 GRFITQDPIGLDGGDNLYKYVPNPTAWVDPWGWACNRPGGYKSGDVDTHGNLS---PGVN 253 Query: 122 EDMKRLNLGGTDQFF-HCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 N+ H +K N AG R+A K + R Sbjct: 254 RAPGNKNIPSDKSVQSHHFIQDEWAKRNVAGYKRNAAPAVLLKSSSGESHAIVSSLQRTR 313 Query: 181 KLSHS 185 + Sbjct: 314 RRLGG 318 >UniRef50_B5MRU6 Rhs-family protein n=9 Tax=Gammaproteobacteria RepID=B5MRU6_SALET Length = 216 Score = 164 bits (415), Expect = 2e-39, Method: Composition-based stats. Identities = 60/216 (27%), Positives = 90/216 (41%), Gaps = 8/216 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D GNI W Y WGN +E+ + Q R GQ D+E+GL+YN R+YDP G+ Sbjct: 1 MTDGGGNIVWEAGYQVWGNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGK 60 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWD---ILSDT 120 +I+ DPI + GG +LY Y NP+ IDPLGL + K + H+ D Sbjct: 61 FISGDPISIRGGINLYQYAPNPIKWIDPLGLYNGEGQRELGKYHVFHEHNLDITEYGLSD 120 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 E R N +++ + AF R + GV + K + L + K Sbjct: 121 AEHFSRGNQAISERMKNDPAFRREMQTKYPGVVEHVQPSSGGKFSTESPPGLTWHHENKP 180 Query: 181 KLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSD 216 + D+K H + P + + Sbjct: 181 GVLSLVDRLDHKT-----YHKIYHPDGSGGRKKWGG 211 >UniRef50_Q4ZLF3 YD repeat n=5 Tax=Pseudomonas syringae group RepID=Q4ZLF3_PSEU2 Length = 451 Score = 164 bits (415), Expect = 2e-39, Method: Composition-based stats. Identities = 62/211 (29%), Positives = 90/211 (42%), Gaps = 7/211 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DA+G I W +Y WG + + + Q R GQ +D E+GL+YN RYYDP Sbjct: 222 LEMTDAEGQIVWQAKYRAWGAV-EKLVVNEVEQNLRFQGQYFDAETGLHYNTFRYYDPEI 280 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR+ITQDPIGL GG++LY Y NP IDPLGL + L H RA D S + Sbjct: 281 GRFITQDPIGLLGGFNLYGYCRNPTAWIDPLGLD-WNYFLSNEDGAYYHGRASDKTSLSD 339 Query: 122 EDMKRLNLGGTDQFF---HCMAFCRVSKLNDAGVSRSAKGLGY--EKEIRDYGLNLFGMY 176 + G D K D R + G + + + G Sbjct: 340 VMRRHGKNKGVDGARFRAGDTITQVTPKGTDIDTVRGIENAGVREKPILGRGSPKVRGNT 399 Query: 177 GRKVKLSHSEMIEDNKKDLAVNDHGLTCPST 207 + + ++ + + + A N + + Sbjct: 400 IQGISDANLDTPKGIVRTNAANSYLESHGVN 430 >UniRef50_D0KEV6 RHS protein n=2 Tax=Enterobacteriaceae RepID=D0KEV6_PECWW Length = 1348 Score = 163 bits (412), Expect = 5e-39, Method: Composition-based stats. Identities = 51/115 (44%), Positives = 66/115 (57%), Gaps = 5/115 (4%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEE-----NPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 + DG + W EY WGN + E + +HQP R GQ +D E+GL+YNR RYY Sbjct: 1111 EMTGQDGGLVWRAEYRVWGNTVRVEQVEVPHSEPIHQPLRYQGQYFDAETGLHYNRFRYY 1170 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 DP GR+++QDPIGL GG +LY Y NP+ IDPLGL+P RK + Sbjct: 1171 DPDAGRFVSQDPIGLAGGINLYQYAPNPITWIDPLGLTPCMGLPSSRKAGGTGGK 1225 >UniRef50_A9AE26 Rhs family protein n=10 Tax=cellular organisms RepID=A9AE26_BURM1 Length = 1547 Score = 162 bits (410), Expect = 8e-39, Method: Composition-based stats. Identities = 59/220 (26%), Positives = 93/220 (42%), Gaps = 15/220 (6%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L D DG++ W Y WG + ++ R GQQ D+E+GL+YNR RYY Sbjct: 1319 LTDDDGDVVWEASYKAWGEAREVIARASKAAGIVARNSLRFQGQQEDEETGLHYNRYRYY 1378 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 DP GR++++DP+G+ GG ++Y Y N V +DPLGL + + D L Sbjct: 1379 DPNSGRFVSKDPVGMVGGINVYQYAPNAVAWVDPLGLRKRIGCPGKFHSFHDFDLPQDKL 1438 Query: 118 SDTY-EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMY 176 + + N ++ AF R + + L + K+ +D G + GM Sbjct: 1439 FASDGVQFRLANKALIERMNTDEAFRRNLLSRNPAL------LDWSKKAKDLGSSPPGMT 1492 Query: 177 GRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSD 216 + ++ D N HG+ P T D+ Sbjct: 1493 WHH-NDEVGRLNLVDRSDHGDN-HGIYHPDGTGGRDKWGG 1530 >UniRef50_B2HXH5 Rhs family protein n=5 Tax=Acinetobacter baumannii RepID=B2HXH5_ACIBC Length = 1635 Score = 161 bits (408), Expect = 1e-38, Method: Composition-based stats. Identities = 51/160 (31%), Positives = 75/160 (46%), Gaps = 11/160 (6%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNE-------ENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 + D G I W EY WG E E + R GQ +D+E+GL+YNR R Sbjct: 1382 EMSDQTGAIIWKAEYKAWGECKLEQTNSDFFEKSEIISNNIRFQGQYFDEETGLHYNRYR 1441 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR----KDQLNHQ 111 YY P GR+I++DPIGL GG+++YAY NPV +DP GL+P + + + Q Sbjct: 1442 YYSPYVGRFISKDPIGLLGGFNVYAYTANPVQWVDPYGLAPCSLVRYKPDKVTPQAGSRQ 1501 Query: 112 RAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAG 151 A D + + + GT + + + +G Sbjct: 1502 DAIDRAWSLEKQLIQTTGTGTRDWSKAELDTILRTPSGSG 1541 >UniRef50_B0VRR7 Putative uncharacterized protein n=1 Tax=Acinetobacter baumannii SDF RepID=B0VRR7_ACIBS Length = 296 Score = 161 bits (407), Expect = 2e-38, Method: Composition-based stats. Identities = 55/235 (23%), Positives = 78/235 (33%), Gaps = 13/235 (5%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNE-------ENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 + D G I W EY WG E EN + R GQ +D+E+GL+YNR R Sbjct: 56 EMTDHTGAIIWKAEYKAWGECKAEKAKSNFFENSEIISNNIRFQGQYFDEETGLHYNRYR 115 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWD 115 YY P GR++++DPIGL GG + YAY +P +DPLGLS + + + Sbjct: 116 YYSPYVGRFVSKDPIGLLGGSNNYAYAPSPTEWVDPLGLSCQKIPKGFKS----FGQLKQ 171 Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGM 175 +L G + A S ++ Sbjct: 172 FGQAMQAGFSKLGFKGATMYMQGSAHSGRSFETGKAFDDGRVSDFDIAVVQPELFEKAKK 231 Query: 176 YGRKVKLSH--SEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKA 228 G E+ L +N D + KA Sbjct: 232 MGIAKGNRTLPIEINSVEANKLGINGVLQKMSKLAGGRDVNVMIFDSPESAKAKA 286 >UniRef50_Q2SPP2 Rhs family protein n=3 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SPP2_HAHCH Length = 1434 Score = 161 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 59/216 (27%), Positives = 88/216 (40%), Gaps = 10/216 (4%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D++G + WS Y +G ++ +H P R GQ YD+E+G +YNR+RYYDP G Sbjct: 1124 EMTDSEGTLVWSARYKAYGALAL-QDVESVHNPLRFQGQYYDEETGFHYNRHRYYDPQSG 1182 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLN---------HQRA 113 R+I QDPIGL GG + Y Y NPV +DP GL+ L QRA Sbjct: 1183 RFINQDPIGLLGGANAYQYAPNPVGWVDPFGLTAKPGDCPPTSPALKDSVYSAENIAQRA 1242 Query: 114 WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLF 173 + D+K + Q + + K + ++ G Sbjct: 1243 AEFNYLRELDIKNNAMLSAQQRAKLIMEDIAQMPALKSIIAKVKEMDPSAKVGFRGSMTT 1302 Query: 174 GMYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTD 209 GM G + ++ ++ ND T D Sbjct: 1303 GMKGPHKLGQAKQRVKFDESVAFKNDKPYTDEQGWD 1338 >UniRef50_D1YPP7 RHS repeat-associated core domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YPP7_9FIRM Length = 237 Score = 159 bits (402), Expect = 8e-38, Method: Composition-based stats. Identities = 46/210 (21%), Positives = 77/210 (36%), Gaps = 2/210 (0%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D +GN+ W EY W HQP+RL Q D+E+GL+YN RYY+P Sbjct: 25 EMTDKEGNLFWYVEYTIWARLKEATKVTDSAHQPFRLQNQYADRETGLHYNLMRYYEPEA 84 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR++ QDPIGL G +LY + N +DPLG V+ + + D Sbjct: 85 GRFVNQDPIGLWGEENLYQFAPNATMWLDPLGWKGVTVSKASKGSAFDFILKLDSSV-YL 143 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 E + + + R + ++ +++F G+ Sbjct: 144 ETAQHIKDAIASGKPSVVTIDRKGAAGRRKEVLKGTKCAKGTDRDEWPMSMFKEGGKGAS 203 Query: 182 LSHSEMIEDNKKDLAVNDHGLTCPSTTDCS 211 + ++ ++ P Sbjct: 204 IRKISPSDNRGAGSSIGHALSDIPDNAKIK 233 >UniRef50_Q2SFR5 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SFR5_HAHCH Length = 471 Score = 159 bits (402), Expect = 8e-38, Method: Composition-based stats. Identities = 42/96 (43%), Positives = 58/96 (60%), Gaps = 1/96 (1%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + +A+G + WS Y +GN + + + P R GQ YD+E+GL+YNR RYYDP Sbjct: 45 EMTNAEGEVVWSARYKAYGNLALK-DVEDVQNPLRFQGQYYDEETGLHYNRRRYYDPSAA 103 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 R+I QDP+GL GG + Y Y LNP +DP GL+ Sbjct: 104 RFINQDPVGLLGGDNNYQYALNPTGWVDPYGLTAKP 139 >UniRef50_A1U3R9 YD repeat protein n=3 Tax=Gammaproteobacteria RepID=A1U3R9_MARAV Length = 1611 Score = 159 bits (401), Expect = 1e-37, Method: Composition-based stats. Identities = 62/229 (27%), Positives = 94/229 (41%), Gaps = 24/229 (10%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L + G + WS Y +GN + ++ + P R GQ +D E+GL+YNR+RYY+P G Sbjct: 1367 ELTNQQGRLVWSVTYRAYGNVVQQQVAE-IDNPLRFQGQYHDPETGLHYNRHRYYNPNTG 1425 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS---------------------PADVAL 101 R+IT DPIGL GG + Y Y NP +DPLGL+ P Sbjct: 1426 RFITPDPIGLAGGLNNYQYVPNPTGWVDPLGLASRQKNDPDSAGSPEGIGGDSGPFGGEP 1485 Query: 102 IRRKDQLNHQRAWDILSDTYEDMKRLNLGGTD--QFFHCMAFCRVSKLNDAGVSRSAKGL 159 + + I ++ D ++N+ G D Q + + G R + Sbjct: 1486 KTPESEPVGLSGGKIPWGSWNDYDKVNVNGQDYAQVGNRLYSEHAVNRMQPGRRRHSSRG 1545 Query: 160 GYEKEIRDYGLNLFGMYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTT 208 G Y G YGR V + E + K + + L+ S + Sbjct: 1546 GTGGLPEIYLAGTHGDYGRGVAPQYVEDAISSAKAVQQQNGNLSYLSGS 1594 >UniRef50_C0Q6Q9 Rhs-family protein n=27 Tax=Enterobacteriaceae RepID=C0Q6Q9_SALPC Length = 1593 Score = 158 bits (399), Expect = 1e-37, Method: Composition-based stats. Identities = 55/204 (26%), Positives = 79/204 (38%), Gaps = 5/204 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D GNI W Y WGN +E+ + Q R GQ D+E+GL+YN R+YDP G Sbjct: 1354 EMTDGGGNIVWEAGYQVWGNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIG 1413 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 ++I+ DPI L GG +LYAY NP++ IDPLGL + D + ++ Sbjct: 1414 KFISGDPISLRGGINLYAYAPNPISWIDPLGLRKSCSGFSSADDAARSALSKYNPMSIFK 1473 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKL 182 + + + + + G K G G Y Sbjct: 1474 NREYGGIIFRAKDGSYGYTRGRLGTGRTAPTFKDSAGGLPK-----GSTPVGQYHTHGDY 1528 Query: 183 SHSEMIEDNKKDLAVNDHGLTCPS 206 S S NK N + Sbjct: 1529 SDSGFNRTNKAGDYHNSDQFSSKD 1552 >UniRef50_Q13ML3 YD repeat protein n=10 Tax=Proteobacteria RepID=Q13ML3_BURXL Length = 1531 Score = 157 bits (398), Expect = 2e-37, Method: Composition-based stats. Identities = 57/196 (29%), Positives = 81/196 (41%), Gaps = 8/196 (4%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPH--HLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L ADG I W Y WGN + E P + Q R GQ D+E+GL+YN RYYDP Sbjct: 1311 ELTSADGRIVWQAMYQLWGNTVRESEPESYAVRQNLRYQGQYLDRETGLHYNTLRYYDPD 1370 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+ T DPIGL GG +LY Y NP++ IDP+GLS L ++QL+ + + + Sbjct: 1371 IGRFTTPDPIGLAGGVNLYRYAPNPMSWIDPMGLSCTS-NLKDIENQLSRGKGATVTVSS 1429 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKL-----NDAGVSRSAKGLGYEKEIRDYGLNLFGM 175 + + L T R + + + D+ G Sbjct: 1430 KAEAEELLRAYTSGPAGRRGAFRNTTGQEFPRDLFDPEKVVGPGSIPHNASDWLPAGRGA 1489 Query: 176 YGRKVKLSHSEMIEDN 191 + R+ Sbjct: 1490 FQREGTYHWDAANAGA 1505 >UniRef50_Q1LDW7 YD repeat n=2 Tax=Burkholderiaceae RepID=Q1LDW7_RALME Length = 1626 Score = 157 bits (397), Expect = 3e-37, Method: Composition-based stats. Identities = 48/170 (28%), Positives = 69/170 (40%), Gaps = 14/170 (8%) Query: 3 ALMDADGNIAWSGEYDEWGN---QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L+D G + W Y WG +P R PGQ +D E+GL+YNR+RYYDP Sbjct: 1413 ELVDESGKVVWLARYKAWGGLKTPRKSTDPTETTNAIRFPGQYHDVETGLHYNRHRYYDP 1472 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA-----------LIRRKDQL 108 GR+I++DP+GL GG ++Y Y NPV +DPLGL A + Sbjct: 1473 GSGRFISKDPVGLAGGINVYTYAPNPVGWVDPLGLRCDSPADKLARKLRALQKAQGNAAS 1532 Query: 109 NHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKG 158 + ++ L G T H + + + Sbjct: 1533 ARGFPDGRIRYYDTEVPALTQGPTRGRSHVTEWNPATGQVRSWEETYNHA 1582 >UniRef50_B2VH58 Rhs family protein n=5 Tax=Enterobacteriaceae RepID=B2VH58_ERWT9 Length = 1322 Score = 157 bits (397), Expect = 3e-37, Method: Composition-based stats. Identities = 53/178 (29%), Positives = 78/178 (43%), Gaps = 15/178 (8%) Query: 3 ALMDADGNIAWSGEYDEWGNQLN------EENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 L + G+I W +Y WGN E + +HQP R GQ +D E+GL+YNR RY Sbjct: 1107 ELSNRSGDICWQADYRVWGNTRQVSYAQQEADAETIHQPLRYQGQYFDGETGLHYNRFRY 1166 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDI 116 YDP GR+I++DP+GL GG +LY Y NP +DPLGL + + ++ Sbjct: 1167 YDPDIGRFISRDPVGLSGGMNLYQYAPNPYGWVDPLGLMKCSP---------HKKTTYEG 1217 Query: 117 LSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 +S + G H R + G K + + + + G Sbjct: 1218 VSRRDALRQAKRDAGIPNNQHPYEVSRAKLGDGYGDHVRDKNGIPLQTRQYHFKDQNG 1275 >UniRef50_Q87U70 Rhs family protein n=2 Tax=Pseudomonas RepID=Q87U70_PSESM Length = 1572 Score = 155 bits (391), Expect = 1e-36, Method: Composition-based stats. Identities = 47/165 (28%), Positives = 68/165 (41%), Gaps = 1/165 (0%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L D G I WS +Y +GN + + P R GQ +D E+GL+YNR+RYY+P G Sbjct: 1321 ELTDYSGEIMWSAKYRAYGNLATLDIAE-IENPLRFQGQYFDAETGLHYNRHRYYNPGTG 1379 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 R++T DPI L GG + Y Y NP +DPLGLS + K + + D E Sbjct: 1380 RFLTPDPIKLAGGLNNYQYVPNPTGWVDPLGLSGECPDSEKNKIPVEEASSPSNSYDNSE 1439 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRD 167 +++ G + + Sbjct: 1440 ELRPPESNGRRYAAGKHYVNPQDINFSQRGVHGNEYEAKMSQGDW 1484 >UniRef50_C5CZG8 RHS protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CZG8_VARPS Length = 1609 Score = 154 bits (390), Expect = 2e-36, Method: Composition-based stats. Identities = 45/109 (41%), Positives = 61/109 (55%), Gaps = 4/109 (3%) Query: 3 ALMDADGNIAWSGEYDEWGN----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 L D +G IAWS +Y WG P R GQ +D E+GL+YNR+RYYD Sbjct: 1389 ELTDHEGRIAWSAQYKAWGEAKQAISEAGRKAGFRNPIRFQGQYFDDETGLHYNRHRYYD 1448 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQ 107 P GR++++DPIGL GG +L Y NP+ IDPLGL+ + + + Sbjct: 1449 PSCGRFVSKDPIGLAGGSNLQQYAPNPLGWIDPLGLAGKGITPNNKGTR 1497 >UniRef50_Q2SFS1 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SFS1_HAHCH Length = 295 Score = 154 bits (389), Expect = 2e-36, Method: Composition-based stats. Identities = 47/125 (37%), Positives = 71/125 (56%), Gaps = 2/125 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + +A+G + WS Y +GN ++ + P R GQ YD+E+GL+YNR RYYDP R Sbjct: 1 MTNAEGEVVWSARYKAYGNLAL-QDVEDVQNPLRFQGQYYDEETGLHYNRRRYYDPSAAR 59 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIR-RKDQLNHQRAWDILSDTYE 122 +I QDP+GL GG + Y Y NP +DP GL+ + + + +KD H + +Y+ Sbjct: 60 FINQDPVGLLGGDNNYQYAPNPTGWVDPYGLTCKENSWNQFQKDTKGHFANSTEAAKSYQ 119 Query: 123 DMKRL 127 MK + Sbjct: 120 KMKEV 124 >UniRef50_Q0JZD5 RHS family protein n=9 Tax=Bacteria RepID=Q0JZD5_RALEH Length = 1585 Score = 154 bits (388), Expect = 3e-36, Method: Composition-based stats. Identities = 45/108 (41%), Positives = 63/108 (58%), Gaps = 4/108 (3%) Query: 3 ALMDADGNIAWSGEYDEWGN----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 L D G +AWS +Y WG N + P R GQ YD E+GL+YNR+RYYD Sbjct: 1376 ELTDEAGELAWSAQYKAWGAAQEAISNAARKAGIQNPLRFQGQYYDHENGLHYNRHRYYD 1435 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKD 106 P GR++++DPIGL GG +L Y NP++ +DPLGL+ + ++ Sbjct: 1436 PGTGRFVSKDPIGLAGGLNLNQYAPNPISWVDPLGLACSQTRRASLRE 1483 >UniRef50_B1J8X0 YD repeat protein n=50 Tax=Gammaproteobacteria RepID=B1J8X0_PSEPW Length = 1411 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 52/153 (33%), Positives = 72/153 (47%), Gaps = 2/153 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L D++G I W Y WG + Q R GQ +D+E+ L+YN RYYDP Sbjct: 1185 LELTDSEGKIVWQATYRSWGAIEQLTVNE-IDQNLRFQGQYFDRETSLHYNTLRYYDPDV 1243 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILS-DT 120 GR+I DPIGL GG +L+ Y +NP+ IDP GL+P V ++ + D Sbjct: 1244 GRFIGPDPIGLRGGVNLFRYNVNPIYWIDPTGLAPCQVRVVNNTKIHGRGQVDGTPGHDQ 1303 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVS 153 + + L + +F S N GVS Sbjct: 1304 FSEAIANKLAMSGRFSDVYLNRSYSFANGRGVS 1336 >UniRef50_P16919 Protein rhsD n=261 Tax=Bacteria RepID=RHSD_ECOLI Length = 1426 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 87/220 (39%), Positives = 114/220 (51%), Gaps = 15/220 (6%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DGN AWS EYDEWGNQLNEENPHH++QPYRLPGQQ+D+ESGLYYNR+RYYDPLQ Sbjct: 1166 LALISEDGNTAWSAEYDEWGNQLNEENPHHVYQPYRLPGQQHDEESGLYYNRHRYYDPLQ 1225 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS---------PADVALIRRKDQLNHQR 112 GRYITQDP+GL+GGW+LY YPLNP+ IDP+GL + ++ Sbjct: 1226 GRYITQDPMGLKGGWNLYQYPLNPLQQIDPMGLLQTWDDARSGACTGGVCGVLSRIIGPS 1285 Query: 113 AWDILSDTYEDM--KRLNLGGTDQFFHCMAFCRVSKLNDAGVSRS----AKGLGYEKEIR 166 +D +D D + N + + C+ + K K Sbjct: 1286 KFDSTADAALDALKETQNRSLCNDMEYSGIVCKDTNGKYFASKAETDNLRKESYPLKRKC 1345 Query: 167 DYGLNLFGMYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPS 206 G + Y SH + +++ N + Sbjct: 1346 PTGTDRVAAYHTHGADSHGDYVDEFFSSSDKNLVRSKDNN 1385 >UniRef50_Q6LUC6 Hypothetical nucleotidyltransferase n=1 Tax=Photobacterium profundum RepID=Q6LUC6_PHOPR Length = 1352 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 49/128 (38%), Positives = 68/128 (53%), Gaps = 2/128 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D++G + W Y+ G + +H P R GQ +D+ESGL+YNR RYYDP G+ Sbjct: 1071 VTDSEGEVVWQATYNALGCAAI--SIDIIHNPLRFQGQYHDQESGLHYNRFRYYDPSIGQ 1128 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 +I QDPIGL GG + Y Y NP+ +DPLGLS + A + K + D Sbjct: 1129 FIHQDPIGLLGGINHYRYAPNPIQWVDPLGLSCKEAAFEKIKQSFVNHILADFDLCRGGG 1188 Query: 124 MKRLNLGG 131 +N GG Sbjct: 1189 RVPVNFGG 1196 >UniRef50_C1M8X5 Core protein n=2 Tax=Citrobacter RepID=C1M8X5_9ENTR Length = 1359 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 81/227 (35%), Positives = 102/227 (44%), Gaps = 16/227 (7%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DG I+W EYDEWGN L E+NPH+L Q RLPGQQYD ESGL+YNR+RYY+P Sbjct: 1136 LALIRQDGAISWRAEYDEWGNVLREDNPHNLQQLIRLPGQQYDDESGLHYNRHRYYNPGL 1195 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRYITQDPIGL GG + Y YPLNPV +DPLGL + I W Sbjct: 1196 GRYITQDPIGLAGGLNPYQYPLNPVTEVDPLGLWAFAIPAIL------EGINWLFWGSAA 1249 Query: 122 EDMKRLNLGGTDQFFHC----MAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 L G + +A C K + S + + L G Sbjct: 1250 AGGTVLATSGDSEQSKTKTEHLAKCEEKKPCPPCKTISGDIVPRGTIGYRHDLVPPGKPH 1309 Query: 178 RKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDC-SDRCSDYINPEHK 223 H + + N+ N + C +D + P Sbjct: 1310 HPYTGDHYNLYKANQ-----NPNNCQCFWKESGAADAANGLPPPTGS 1351 >UniRef50_B2K1J9 YD repeat protein n=23 Tax=Yersinia RepID=B2K1J9_YERPB Length = 1494 Score = 153 bits (386), Expect = 5e-36, Method: Composition-based stats. Identities = 57/213 (26%), Positives = 88/213 (41%), Gaps = 6/213 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + L D +G + W ++ +G L+ L QP R+ GQ YD ESGL+YNR RYYDP Sbjct: 1281 IRLQDGEGEVVWEAQFTPFGQ-LSVTGTSQLRQPLRMQGQYYDTESGLHYNRYRYYDPAC 1339 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 G +I+QDPIGL+GG + Y + +N + +DPLGL + + + Sbjct: 1340 GVFISQDPIGLKGGLNPYQFAVNTLGWVDPLGLHRNSNNSMGYNELYIVHENDQPGAKIL 1399 Query: 122 EDMKRLN----LGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG-MY 176 + K + GT++ H V+ + LGY + Sbjct: 1400 KIGKAKSEDKMADGTNRRMHNSERAAKKAGYSDAVATPYRDLGYTSTGDAKKVEAKAVKD 1459 Query: 177 GRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTD 209 R E +K A +D +C + Sbjct: 1460 LRTKGDELPLNKERDKAYRADSDSNASCKNKKK 1492 >UniRef50_A6GLW0 Rhs family protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GLW0_9BURK Length = 1598 Score = 153 bits (386), Expect = 5e-36, Method: Composition-based stats. Identities = 45/135 (33%), Positives = 67/135 (49%), Gaps = 1/135 (0%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + +A + WS + +G + + + + P R GQ +D E+GL+YNR+RYYDP G Sbjct: 1370 EITNASAEVVWSSTFKTYGALVL-AHVNEVENPLRFQGQYFDSETGLHYNRHRYYDPNCG 1428 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 ++ TQDPIGL GG + Y Y NP+ +DP GLS D A + + D+ Sbjct: 1429 QFTTQDPIGLLGGMNTYQYAPNPMTWVDPWGLSCKDQANSIETKRQLERLPDDLAGTFEG 1488 Query: 123 DMKRLNLGGTDQFFH 137 + DQ H Sbjct: 1489 GRYTSRVLEEDQVLH 1503 >UniRef50_C8QGI4 YD repeat protein n=1 Tax=Pantoea sp. At-9b RepID=C8QGI4_9ENTR Length = 465 Score = 152 bits (384), Expect = 8e-36, Method: Composition-based stats. Identities = 56/206 (27%), Positives = 79/206 (38%), Gaps = 8/206 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEEN--------PHHLHQPYRLPGQQYDKESGLYYNR 53 L + DA G + WSG+Y +G + + HQP R GQ D E+GL+YN Sbjct: 245 LEVTDASGKLRWSGQYGSFGEVTRQTDGVYRRASQTSLSHQPLRYAGQYADAETGLHYNL 304 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRA 113 RYYDP GR+ QDPIGL GGW+LY Y NP+ IDP GLS D L Sbjct: 305 FRYYDPQTGRFTVQDPIGLAGGWNLYQYAPNPLTWIDPTGLSRYPGVDFSGSDALYPDGE 364 Query: 114 WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLF 173 + + + A ++ + Y+ + L Sbjct: 365 SIVKIQMTGSRYGDFKAANEIAGYANASGNITGKSHPENYTGHHLNDYDPVSNTSTMQLV 424 Query: 174 GMYGRKVKLSHSEMIEDNKKDLAVND 199 + HS + ++ V Sbjct: 425 DTSAHEATFPHSGSVSQFEQHHGVKY 450 >UniRef50_A7FDK7 YD/RHS repeat protein n=25 Tax=Enterobacteriaceae RepID=A7FDK7_YERP3 Length = 1527 Score = 151 bits (382), Expect = 1e-35, Method: Composition-based stats. Identities = 52/186 (27%), Positives = 81/186 (43%), Gaps = 11/186 (5%) Query: 3 ALMDADGNIAWSGEYDEWGN----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 L++ G + W+ WG + + R GQ D ESGL+YNR RYYD Sbjct: 1294 ELLNEQGKVVWASRLSTWGQAELWRQAANEEDRVSCNLRFAGQYADAESGLHYNRFRYYD 1353 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILS 118 G+Y+ DPIGL GG + Y Y NPV +DPLGL DVA R+ L ++I Sbjct: 1354 GETGQYLCPDPIGLAGGLNPYGYVHNPVKYVDPLGLCKTDVARERQAQMLQDDVGYNISP 1413 Query: 119 DTYEDMKRLNLGGT--DQFFHCMAFCRVSKLNDAGVSRSAKG-----LGYEKEIRDYGLN 171 +++ + G+ + + + + +S+S +G + G N Sbjct: 1414 KSWDQFPSIGRDGSFITDKKGALKYFNGMQTGNVTISKSVAASIEKDMGLSLGSLNGGFN 1473 Query: 172 LFGMYG 177 + + G Sbjct: 1474 IRKIDG 1479 >UniRef50_Q2SKM2 Rhs family protein n=3 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SKM2_HAHCH Length = 1552 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 41/96 (42%), Positives = 58/96 (60%), Gaps = 1/96 (1%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + +A+G + WS Y +GN + + + P R GQ YD+E+GL+YNR+RYYDP Sbjct: 1267 EMTNAEGEVVWSARYKAYGNLALK-DVEDVQNPLRFQGQYYDEETGLHYNRHRYYDPSAA 1325 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 R+I QDP+GL GG + Y Y NP DP GL+ + Sbjct: 1326 RFINQDPVGLLGGDNNYQYAPNPTGWGDPFGLTCKE 1361 >UniRef50_Q6LUC4 Putative uncharacterized protein n=1 Tax=Photobacterium profundum RepID=Q6LUC4_PHOPR Length = 532 Score = 150 bits (380), Expect = 2e-35, Method: Composition-based stats. Identities = 48/129 (37%), Positives = 72/129 (55%), Gaps = 3/129 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + ++DG + W Y+ G + +H P R GQ +D+ESGL+YNR RYYDP G+ Sbjct: 273 VTNSDGEVVWQATYNALGCAFI--SIDIIHNPLRFQGQYHDQESGLHYNRFRYYDPSIGQ 330 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 +I QDPIGL GG + Y Y NP+ +DPLGLS + +I K + W+ + + Sbjct: 331 FIHQDPIGLLGGINHYRYAPNPIQWVDPLGLSCKE-GIIELKKSYSSLNWWEKIKRGLDI 389 Query: 124 MKRLNLGGT 132 +++GG Sbjct: 390 FDSVDVGGG 398 >UniRef50_C3JXH8 Putative Rhs protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3JXH8_PSEFS Length = 1597 Score = 150 bits (380), Expect = 3e-35, Method: Composition-based stats. Identities = 46/120 (38%), Positives = 58/120 (48%), Gaps = 1/120 (0%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L ADG I WS Y +G + + P R GQ +D+ESGL+YNR+RYY P G Sbjct: 1341 ELTAADGEIVWSAHYRAYGEITRL-DIGKIDNPLRFQGQYFDQESGLHYNRHRYYHPDIG 1399 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 RY+T DP+ L GG + Y Y NP +DPLGLS + D E Sbjct: 1400 RYLTPDPVKLTGGINAYQYVPNPTGWVDPLGLSSCPGRDGCKPKTEAALTTETTSVDKGE 1459 >UniRef50_Q83LZ1 Putative Rhs-family protein n=1 Tax=Shigella flexneri RepID=Q83LZ1_SHIFL Length = 211 Score = 150 bits (379), Expect = 3e-35, Method: Composition-based stats. Identities = 57/204 (27%), Positives = 88/204 (43%), Gaps = 1/204 (0%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D+DG I W Y WGN + E++ + Q R GQ D+E+GL+YN +RYYDP GR Sbjct: 1 MTDSDGKIVWETGYQVWGNTIQEKDHGGVEQNLRYQGQYLDRETGLHYNLHRYYDPDVGR 60 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAW-DILSDTYE 122 ++ DPIGL GG +LY+Y NP+ DPLGL+P V+ + L+ + S + Sbjct: 61 FMVTDPIGLRGGLNLYSYAPNPLKYADPLGLTPCAVSNQKANRLLDSSETKVTVRSRSDA 120 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKL 182 + ++ + + S N K D +V Sbjct: 121 EQLFMDRYLGHNYKNMTGESGPSTKNLMEYLTENKTKAGSYHWDDIKDPSVTKPSYRVSG 180 Query: 183 SHSEMIEDNKKDLAVNDHGLTCPS 206 + + L V+ HG + Sbjct: 181 HGPGNPDGDLPHLQVHQHGGSVRH 204 >UniRef50_Q7NY44 Probable Rhs-family protein n=2 Tax=Chromobacterium violaceum RepID=Q7NY44_CHRVO Length = 1513 Score = 150 bits (379), Expect = 3e-35, Method: Composition-based stats. Identities = 46/114 (40%), Positives = 64/114 (56%), Gaps = 4/114 (3%) Query: 3 ALMDADGNIAWSGEYDEWGNQLN----EENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 AL D G +A +Y WG + P+R GQ +D ESGL+YNR+RYYD Sbjct: 1297 ALTDEHGALALEMDYQAWGQAREVIADAAGKAGIRNPFRFQGQYHDDESGLHYNRHRYYD 1356 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 P GR+I++DPIGL+GG ++Y Y LNP+ +DPLGL+ + Q+ Sbjct: 1357 PEIGRFISRDPIGLKGGINIYGYALNPIVWMDPLGLTGKQFTGTVYRALTPKQK 1410 >UniRef50_UPI00016A9A82 Rhs family protein n=2 Tax=Burkholderia oklahomensis RepID=UPI00016A9A82 Length = 1489 Score = 150 bits (379), Expect = 4e-35, Method: Composition-based stats. Identities = 55/192 (28%), Positives = 71/192 (36%), Gaps = 16/192 (8%) Query: 3 ALMDADGNIAWSGEYDEWGN-QLNEENPHHLHQP-----------YRLPGQQYDKESGLY 50 L D G + W Y WGN L E P P RL GQ D E+G Sbjct: 1263 ELTDGGGRVVWRTRYRAWGNTVLQEYAPEFQANPAGDVMQPLPQALRLQGQYEDLETGFC 1322 Query: 51 YNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNH 110 Y+ RYYDP GR+IT DPIGL GG + Y Y NP+ IDP G + + Sbjct: 1323 YSTFRYYDPDVGRFITPDPIGLAGGLNQYQYAPNPLTWIDPWGWVETPLDAPGYSTYGLY 1382 Query: 111 QRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVS----RSAKGLGYEKEIR 166 + LN D A +++ G ++ GYE+ R Sbjct: 1383 HPGASEPYYVGHTGQSLNDRLADHIDTKRAVRGQTEVRPLGGPEGTLTYSQAKGYEQAYR 1442 Query: 167 DYGLNLFGMYGR 178 + G G Sbjct: 1443 EKYKTKTGFPGN 1454 >UniRef50_B3PEK8 RHS Repeat family n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PEK8_CELJU Length = 2245 Score = 150 bits (378), Expect = 4e-35, Method: Composition-based stats. Identities = 57/212 (26%), Positives = 88/212 (41%), Gaps = 21/212 (9%) Query: 1 MLALMDADGNIAWS--GEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 ++A+ +A G + + Y +G P+R G++ D E+GLYY R RYYD Sbjct: 1997 VIAVSNASGQVTSNNIYTYSPYGEV-----NSAAGFPFRYTGRRLDPETGLYYYRARYYD 2051 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 P GR++ DPIG + +LYAY N P+N IDP G + L A ++ Sbjct: 2052 PGLGRFLQTDPIGYKDQMNLYAYVGNDPLNKIDPTGKNAKKPFLKE--------AAKELR 2103 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 + +K G + +L + G S+S + E+ + G L M G Sbjct: 2104 KEAKRQIKNAKAQGRREALKQ----ERQQLKETGESKSNLSDDRKTELLETGK-LKNMDG 2158 Query: 178 RKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTD 209 S + K D+A N +T D Sbjct: 2159 HHEPSVSSGKTLEEKIDIAKNPDNITFMEKAD 2190 >UniRef50_B1JCT8 RHS protein n=1 Tax=Pseudomonas putida W619 RepID=B1JCT8_PSEPW Length = 231 Score = 150 bits (378), Expect = 4e-35, Method: Composition-based stats. Identities = 43/105 (40%), Positives = 59/105 (56%), Gaps = 1/105 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + L +++G I W Y WG + E H + Q R GQ ++ E+GL+YN RYYDP Sbjct: 1 MELSNSEGEIVWQATYRSWGA-IEELKVHDIEQNLRFQGQYFESETGLHYNTLRYYDPEV 59 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKD 106 GR++TQDPIGL G + Y Y +PV +DP GL+ V D Sbjct: 60 GRFVTQDPIGLGDGMNFYQYAPSPVMWVDPWGLAFKSVNFEGSPD 104 >UniRef50_Q147B5 Rhs family protein n=2 Tax=Betaproteobacteria RepID=Q147B5_BURXL Length = 1362 Score = 150 bits (378), Expect = 5e-35, Method: Composition-based stats. Identities = 48/99 (48%), Positives = 57/99 (57%), Gaps = 8/99 (8%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPH--------HLHQPYRLPGQQYDKESGLYYNRNRY 56 D G W Y WG L H HQP R GQ +D+E+GL+YNR+RY Sbjct: 1141 TDDAGRTQWRARYAAWGRLLGANGGHEQMHESGRQAHQPLRFQGQYFDEETGLHYNRHRY 1200 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 YDP GR++TQDPIGL GG +LY Y NP IDPLGL+ Sbjct: 1201 YDPDAGRFMTQDPIGLRGGINLYRYAPNPGRWIDPLGLA 1239 >UniRef50_A1TTP6 YD repeat protein n=2 Tax=Acidovorax citrulli AAC00-1 RepID=A1TTP6_ACIAC Length = 1554 Score = 150 bits (378), Expect = 5e-35, Method: Composition-based stats. Identities = 57/215 (26%), Positives = 84/215 (39%), Gaps = 41/215 (19%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHH------------------------------ 31 L L DA G IAW+ +Y WG P Sbjct: 1296 LELTDAQGYIAWAADYKVWGEATLRAVPRTATGTDGVSGERRRGHGPVMDVHEGGGEKAR 1355 Query: 32 ------LHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 + QP+R GQQ+D+E+GL+YNR RYY+P GR+++QDPIGL GG + + Y +P Sbjct: 1356 PTPPAIIEQPFRFQGQQFDEETGLHYNRFRYYEPSVGRFVSQDPIGLLGGVNSFTYAPSP 1415 Query: 86 VNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVS 145 N +DP GLS A D Y N + + Sbjct: 1416 NNWMDPFGLSCCPCPTGTATIHHYEGTA-DNPFGHYSIEVTANGTSLHTHQGVFQDGKQT 1474 Query: 146 KL----NDAGVSRSAKGLGYEKEIRDYGLNLFGMY 176 + ++G +++ + K +DY + G Y Sbjct: 1475 AILSNRGNSGTTQATVAIPDAKAAQDYQRKMMGQY 1509 >UniRef50_B7H4M5 Uncharacterized protein ybfO n=3 Tax=Acinetobacter baumannii RepID=B7H4M5_ACIB3 Length = 229 Score = 149 bits (377), Expect = 6e-35, Method: Composition-based stats. Identities = 46/171 (26%), Positives = 72/171 (42%), Gaps = 14/171 (8%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE-------ENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 + D G + W +Y WG E EN + R GQ +D E+GL+YNR Y Sbjct: 1 MTDHTGVVIWKAQYKAWGECKVEQAKSDFFENSEIISNNIRFQGQYFDGETGLHYNRYCY 60 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQ----- 111 Y P GR+I++DPIGL GG ++YAY NPV +D LGL+ +++ + Sbjct: 61 YSPYVGRFISKDPIGLLGGSNIYAYAPNPVGWVDQLGLAKTPTRTLQKNWKNYVGCKHTN 120 Query: 112 --RAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLG 160 + E K + + + + G+ ++ G Sbjct: 121 LDIHHGFPEEYAERFKNIAGIDVNNPQYYYNLPKEKHTKSPGIHTNSSRTG 171 >UniRef50_D0KES6 YD repeat protein n=15 Tax=Gammaproteobacteria RepID=D0KES6_PECWW Length = 1379 Score = 149 bits (376), Expect = 7e-35, Method: Composition-based stats. Identities = 55/205 (26%), Positives = 81/205 (39%), Gaps = 11/205 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHH--------LHQPYRLPGQQYDKESGLYYNR 53 L + DA+G + WSG+Y +G + Q R GQ D+E+GL+YN Sbjct: 1175 LEMTDAEGAVRWSGDYGSFGAINGQTQDSEGLRHGKPVESQSLRYAGQYADEETGLHYNL 1234 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP--ADVALIRRKDQLNHQ 111 RYYDP GR+ TQDPIGL GG +LYAY NP+ +DPLGL+ D + R + Sbjct: 1235 FRYYDPTVGRFTTQDPIGLAGGLNLYAYAPNPLGWVDPLGLAKILTDGVVYRMGSGTDSN 1294 Query: 112 RAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLN 171 DT + ++ + + V + K + D ++ Sbjct: 1295 LTPRPGKDTTTGLSTTIEKPSNG-KYQTLDVKTLNDGGLDVVKDGKNHASVRPSNDPDMS 1353 Query: 172 LFGMYGRKVKLSHSEMIEDNKKDLA 196 + S K Sbjct: 1354 RLREWADTRGTEKSSSYTKTVKKSC 1378 >UniRef50_C1M4X0 Predicted protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1M4X0_9ENTR Length = 1494 Score = 149 bits (376), Expect = 7e-35, Method: Composition-based stats. Identities = 51/200 (25%), Positives = 75/200 (37%), Gaps = 12/200 (6%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPH-----------HLHQPYRLPGQQYDKESGLYY 51 AL D DG + W + + WG +E + R GQ D+E+GL+Y Sbjct: 1270 ALTDEDGKLHWRQDVETWGETRSEYADEEGGRWRKIWGGAPEENLRFAGQYLDRETGLHY 1329 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQ 111 N RYY P GR+IT DPIGL GG +LY+Y NP++ IDPLGL + D N Sbjct: 1330 NTFRYYAPDMGRFITPDPIGLAGGINLYSYAPNPLSWIDPLGLLKCGLTGNEVGDASNLP 1389 Query: 112 RAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLN 171 + ++ + R + + + N Sbjct: 1390 -VIKPGTKEWKLAVDTMRNNGSGKPNFRVADRNAAEKLLKDANPKIPEYGQYTGSKNYRN 1448 Query: 172 LFGMYGRKVKLSHSEMIEDN 191 G + + E+N Sbjct: 1449 KAGYEHHPNESHTANAPENN 1468 >UniRef50_UPI0001C341D4 protein RhsA n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C341D4 Length = 1365 Score = 149 bits (376), Expect = 8e-35, Method: Composition-based stats. Identities = 76/151 (50%), Positives = 89/151 (58%), Gaps = 1/151 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DG I+W EYDEWGN L E+NPH+L Q RLPGQQYD ESGL+YNR+RYY+P Sbjct: 1135 LALISQDGAISWRAEYDEWGNVLREDNPHNLQQLIRLPGQQYDDESGLHYNRHRYYNPGL 1194 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRYITQDPIGL+GGW+LY YPLNPV IDP GL V D + TY Sbjct: 1195 GRYITQDPIGLKGGWNLYKYPLNPVEYIDPSGLDVRLVNTDAVGGWHRKVEV-DEGTGTY 1253 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGV 152 + D + + V K D V Sbjct: 1254 GISFGVYTADPDIWGNVSGTPDVGKAGDGMV 1284 >UniRef50_UPI0001BC4026 Rhs family protein n=6 Tax=Neisseria RepID=UPI0001BC4026 Length = 1477 Score = 149 bits (375), Expect = 9e-35, Method: Composition-based stats. Identities = 59/202 (29%), Positives = 85/202 (42%), Gaps = 6/202 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D DG + W G+YD WG + E N HQP+RL Q +D+E+GL+YN RYYDP Sbjct: 1158 EMTDEDGKLLWFGKYDVWGKLVKETNITGSAHQPFRLQNQYFDRETGLHYNFFRYYDPDI 1217 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR++ QDPIGL+GG +LY + N DPLGL P D+ D+ ++ W + Sbjct: 1218 GRFVNQDPIGLDGGENLYGFAPNAAVWSDPLGLEPMDIGKYIP-DEYRPKQGWKPPFEMP 1276 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMY----G 177 L + L+ A A K + G Sbjct: 1277 SYEDVDALSQKYLRKSLNEMLEDALLSMASGGMRAPKAKIPKMAPKPQTKMPQKPNCGVG 1336 Query: 178 RKVKLSHSEMIEDNKKDLAVND 199 + ++ + K ND Sbjct: 1337 KCAGKRDAQGKYRDSKGRYAND 1358 >UniRef50_Q2SGE8 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SGE8_HAHCH Length = 1452 Score = 149 bits (375), Expect = 9e-35, Method: Composition-based stats. Identities = 55/199 (27%), Positives = 87/199 (43%), Gaps = 5/199 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 A+ D++G WS WG N + P+R PGQ D+E+GLYYNR RYYDP G Sbjct: 1180 AMYDSEGRQVWSANISVWGELRNLKGNRGA-CPFRWPGQYEDEETGLYYNRFRYYDPDSG 1238 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 +YI QDPI ++GG +LY Y + ID LGL + R ++ A + Sbjct: 1239 QYIRQDPIRIKGGLNLYKYVSDVTTWIDTLGLQGGSASYGRSGRRI--GHADTDGLEKIG 1296 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKL 182 ++ + GG D+ + + L + + + K++ + G +G + Sbjct: 1297 ELNAVKAGGDDRLPSILYEGDRTSLFHYTSEENLENILRTKKLFNSKGFEHGRHGDGQYM 1356 Query: 183 SHSEMIEDNKKDLAVNDHG 201 + D +A D Sbjct: 1357 ADISP--DEIVAMAKGDLT 1373 >UniRef50_B7LTT0 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B7LTT0_ESCF3 Length = 1543 Score = 149 bits (375), Expect = 9e-35, Method: Composition-based stats. Identities = 60/206 (29%), Positives = 83/206 (40%), Gaps = 10/206 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPH----HLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 + D+DG I W WGN EEN H Q R GQ D+E+GL+YN RY+ P Sbjct: 1308 MTDSDGGIVWRARVQLWGNIRFEENRDIYSVHPQQNLRFAGQYLDRETGLHYNTFRYFLP 1367 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 GR+ DPIGL GG +LYAY NP++ IDPLGL + R + N Q L + Sbjct: 1368 ESGRFSQPDPIGLAGGLNLYAYAPNPLSYIDPLGLCKLRGSDGRYRSAQNIQEEIGGLPN 1427 Query: 120 TY-----EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 E + L G +K G + + + ++ R Sbjct: 1428 FAGKTPGEIRQTLRNRGYSSVQAHSGGEVWTKALPDGNTVAVRLDPAKQRTRPKSFADEV 1487 Query: 175 MYGRKVKLSHSEMIEDNKKDLAVNDH 200 + L S ++ N N Sbjct: 1488 PHAHIESLPTSGVVNGNYGG-KNNPV 1512 >UniRef50_D1PV96 YD repeat protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PV96_9BACT Length = 394 Score = 149 bits (375), Expect = 1e-34, Method: Composition-based stats. Identities = 54/258 (20%), Positives = 94/258 (36%), Gaps = 40/258 (15%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN--------------PHHLHQPYRLPGQQYDKESGL 49 + + DG + EY +G +E + + PY ++ D+E+G+ Sbjct: 43 ITNLDGEVVQHIEYVPYGEVFIDELELTRKSATAGKANCNNTWNTPYLFNAKELDEETGM 102 Query: 50 YYNRNRYYDPLQGRYITQDPIGL-EGGWSLYAY-PLNPVNGIDPLGLSPADVALIRRKDQ 107 YY RYY+P +++ DP+G ++Y Y NP IDP G + + Sbjct: 103 YYYGARYYEPRLSLWVSTDPLGETAPHITVYCYTANNPTILIDPDGKAWKPT--KNEETG 160 Query: 108 LNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDA----------------- 150 N W + +Y+ +L G +Q K ++ Sbjct: 161 QNTGYEWINPAKSYDSKGKLLPGLYEQAIFFSNQGNNGKTFNSKNRFNMGSSIATVYCKD 220 Query: 151 GVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDC 210 G + + Y + Y GMY KV + + + K L ++D G T +++ Sbjct: 221 GTTSEFEACTYPSNLDKYATVPEGMYEAKVGMHNGSSAQ--YKALRMSDIGTTDFNSSSI 278 Query: 211 SDRCSDYINPEHKKTIKA 228 NP + KT KA Sbjct: 279 E---LGKPNPSNSKTTKA 293 >UniRef50_A7FDJ9 RHS/YD repeat protein n=30 Tax=Enterobacteriaceae RepID=A7FDJ9_YERP3 Length = 1418 Score = 148 bits (374), Expect = 1e-34, Method: Composition-based stats. Identities = 48/114 (42%), Positives = 58/114 (50%), Gaps = 7/114 (6%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHH-------LHQPYRLPGQQYDKESGLYYNRN 54 L + D +G WSG+Y WG + QP R PGQ D E+GL+YN Sbjct: 1192 LDVTDGEGKHRWSGKYHAWGKVTRQNVSDPRQSTVSRFAQPLRYPGQYSDDETGLHYNTF 1251 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQL 108 RYYDP GR+ TQDPIGL GG +LY Y NP+ +DPLG P L Sbjct: 1252 RYYDPEIGRFSTQDPIGLAGGINLYQYGPNPLGWVDPLGWMPWAWNPNGMGHHL 1305 >UniRef50_O52663 Core protein (Fragment) n=5 Tax=Enterobacteriaceae RepID=O52663_ECOLX Length = 350 Score = 148 bits (374), Expect = 1e-34, Method: Composition-based stats. Identities = 77/93 (82%), Positives = 85/93 (91%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ DGN AW GEYDEWGNQLNEENP++LHQPYRLPGQQ+D+ESGLYYNRNRYYDPLQ Sbjct: 114 LALISEDGNTAWRGEYDEWGNQLNEENPYYLHQPYRLPGQQHDEESGLYYNRNRYYDPLQ 173 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDPIGL GGW+LY YPLNP+ +DPLGL Sbjct: 174 GRYITQDPIGLAGGWNLYNYPLNPIIRMDPLGL 206 >UniRef50_D1PUV1 RHS family protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PUV1_9BACT Length = 284 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 56/225 (24%), Positives = 84/225 (37%), Gaps = 31/225 (13%) Query: 6 DADGNIAWSGEYDEWGNQLNEEN------PHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 + G W D G + E + ++ P+ GQ YD+E L YNR RYYDP Sbjct: 50 NEQGEEVWYRRLDMNGKVIEERSMNYTSYKDYVKIPFLFQGQYYDEEVKLAYNRFRYYDP 109 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 GRYI++DP+ L GG +LY Y N + DPLGLS A + + Q I Sbjct: 110 ELGRYISEDPVRLLGGSNLYRYVENTILWCDPLGLSSAKLNKALGGSKKGMQAHHLIPEK 169 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 + +QF + + ++ G+ + Y Y R Sbjct: 170 VWSQ--------NEQFLNQIGLSGQCDVSSNGLHMYNSKELAIANGKAY-------YHRG 214 Query: 180 VKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKK 224 S+S I + H SD + ++PE + Sbjct: 215 RHDSYSHAINRRISAIENQYH----------SDLQAGILSPEEAR 249 >UniRef50_Q39K64 Rhs family protein n=22 Tax=Burkholderia RepID=Q39K64_BURS3 Length = 1560 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 49/185 (26%), Positives = 78/185 (42%), Gaps = 18/185 (9%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L D +G++ W Y WG + ++ R GQQ D E+GL+YNR+RYY Sbjct: 1332 LTDDEGDVVWEASYKAWGEAREVIARASKVAGIVPRSSLRFQGQQVDDETGLHYNRHRYY 1391 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL--------SPADVALIRRKDQLN 109 DP GR++++DPIGL GG ++Y Y NPV +DPLGL + + K + Sbjct: 1392 DPRSGRFVSKDPIGLAGGINVYQYAPNPVKWVDPLGLSKSEGGCSCSCGINPVTGKPRGT 1451 Query: 110 HQRAWDILSDTYEDMKRLNLG----GTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEI 165 Q+ + + + +N G D+ A + G+ Sbjct: 1452 LQKLAQKIREAGKFKPAVNQRTIAVGQDESGGLFAGSSNGFDAGQREMADSLGIKRVPTS 1511 Query: 166 RDYGL 170 ++ Sbjct: 1512 KNKHA 1516 >UniRef50_UPI00019F17FA RhsC core protein with extension n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F17FA Length = 274 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 82/209 (39%), Positives = 106/209 (50%), Gaps = 21/209 (10%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL++ DG I+W EYDEWGN L E+NPH+L + RLPGQQ D+ESGLYYNR+RY P Q Sbjct: 27 LALINQDGAISWRAEYDEWGNVLREDNPHNLQRLIRLPGQQCDEESGLYYNRHRYDSPGQ 86 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 RY+T DPIGLEGG + Y YP NP+ IDPLGL P + +N A + Sbjct: 87 DRYLTLDPIGLEGGLNPYTYPRNPIRKIDPLGLQPW--------NSINMGSATTERASLG 138 Query: 122 EDMKRLNLGGTDQFFHCMA-------FCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 M + N TDQ +A R +++ S Y ++ G L G Sbjct: 139 LWMAQ-NGASTDQMVKALAPLPPPSSISRECRVSGIAALGSGLSGSYSYNEKNGGNVLIG 197 Query: 175 MYGRKVKLSHS-----EMIEDNKKDLAVN 198 + L S + + KDL N Sbjct: 198 APLAAIGLRGSLTCGLKFRSSDAKDLKTN 226 >UniRef50_P32109 Putative uncharacterized protein yibJ n=14 Tax=Bacteria RepID=YIBJ_ECOLI Length = 233 Score = 147 bits (372), Expect = 2e-34, Method: Composition-based stats. Identities = 68/93 (73%), Positives = 77/93 (82%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G AW EYDEWGN L++ENPHHL Q RLPGQQYD+ESGLYYNR+RYYDPL Sbjct: 60 LALISTEGATAWCAEYDEWGNLLSDENPHHLQQLIRLPGQQYDEESGLYYNRHRYYDPLL 119 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDPIGL+GGW+ Y YPLNPV +DP GL Sbjct: 120 GRYITQDPIGLKGGWNFYQYPLNPVINVDPQGL 152 >UniRef50_UPI0001AF2680 RHS/YD repeat-containing protein n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AF2680 Length = 1592 Score = 147 bits (371), Expect = 3e-34, Method: Composition-based stats. Identities = 55/224 (24%), Positives = 83/224 (37%), Gaps = 27/224 (12%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+D G+IAW WG + P R PGQ D E+GL+YN R+YDP Sbjct: 1362 TELVDPSGDIAWRSRTTLWGATAW-PRTSTAYTPLRFPGQYDDPETGLHYNFFRHYDPDA 1420 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA---------LIRRKDQLNHQR 112 RY++ DP+GL G + AY NP DPLGL P +I R + ++ Sbjct: 1421 ARYVSPDPLGLAAGPNPVAYVDNPFTWCDPLGLMPKCPRERAQKVADQVIERAQEGKMRK 1480 Query: 113 AWDILSDTYEDMKRLNL---------------GGTDQFFHCMAFCRVSKLNDAGVSRSAK 157 + + +DT + + F ++K AG Sbjct: 1481 SSNYHADTRHQFSDERVLEILKNPDAVYQSQGQRGNLTFRQGEDVVITKGPGAGGGDVIT 1540 Query: 158 GLGYEKEIRDYGLNLFGMYGR--KVKLSHSEMIEDNKKDLAVND 199 G + G FG + ++H +++ N D Sbjct: 1541 AYGPSGTRGESGAGAFGGSPDDPGLPVTHDDIVNGNIPDSRGGT 1584 >UniRef50_A8A655 Rhs family protein n=14 Tax=Enterobacteriaceae RepID=A8A655_ECOHS Length = 314 Score = 147 bits (370), Expect = 4e-34, Method: Composition-based stats. Identities = 79/187 (42%), Positives = 100/187 (53%), Gaps = 15/187 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G AW EYDEWGN L++ENPHHL Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 60 LALISTEGATAWCAEYDEWGNLLSDENPHHLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 119 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL--------------SPADVALIRRKDQ 107 GRYITQDPIGL+GGW+ Y YPLNPV +DP GL ++ + Sbjct: 120 GRYITQDPIGLKGGWNFYQYPLNPVINVDPQGLVDINLYPESDLIHSVADEINIPGVFTI 179 Query: 108 LNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRD 167 H I S T M +L +F L + + + + ++ Sbjct: 180 GGHGTPTSIESATRSIMTAKDLAYLIKFDGNYKDGMTVWLF-SCNTGKGQNSFASQLAKE 238 Query: 168 YGLNLFG 174 N+ G Sbjct: 239 LHTNVIG 245 >UniRef50_A7FN18 RHS/YD repeat protein n=5 Tax=cellular organisms RepID=A7FN18_YERP3 Length = 1419 Score = 146 bits (369), Expect = 5e-34, Method: Composition-based stats. Identities = 45/137 (32%), Positives = 61/137 (44%), Gaps = 10/137 (7%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENP---------HHLHQPYRLPGQQYDKESGLYYNR 53 + +A G + WSG+Y +G + + QP R GQ D E+GL+Y Sbjct: 1193 EVTNAQGEMVWSGQYGVFGQVTRQTDAMWRNVSKPLGQFRQPLRYAGQYLDDETGLHYTT 1252 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRA 113 RYY P GR+IT DPIGL GG +LY Y NP+ IDP GL+ + Q Sbjct: 1253 YRYYAPEVGRFITPDPIGLAGGLNLYQYAPNPLGWIDPWGLAGSPTTATHITYQ-GIDAI 1311 Query: 114 WDILSDTYEDMKRLNLG 130 Y M+ + Sbjct: 1312 TGKPYVGYASMQGNQIA 1328 >UniRef50_B5PJT7 Protein RhsD n=2 Tax=Enterobacteriaceae RepID=B5PJT7_SALET Length = 429 Score = 146 bits (369), Expect = 5e-34, Method: Composition-based stats. Identities = 73/100 (73%), Positives = 80/100 (80%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ D +AW GEYDEWGN EENP HL Q RLPGQQYD+ESGLYYNR+RYY+P Q Sbjct: 179 LALITPDNTVAWRGEYDEWGNLSGEENPAHLEQVIRLPGQQYDEESGLYYNRHRYYNPGQ 238 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVAL 101 GRYITQDPIGL GGW+LY YPLNPV+ IDPLGLS D A Sbjct: 239 GRYITQDPIGLRGGWNLYNYPLNPVSEIDPLGLSMWDDAK 278 >UniRef50_Q328Z1 RhsA protein in rhs element n=7 Tax=Enterobacteriaceae RepID=Q328Z1_SHIDS Length = 1213 Score = 145 bits (367), Expect = 8e-34, Method: Composition-based stats. Identities = 72/100 (72%), Positives = 81/100 (81%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G W EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 1079 LALVSTEGATEWCAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 1138 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVAL 101 GRYITQDPIGL+GGW+LY Y LNP++ IDPLGLS + A Sbjct: 1139 GRYITQDPIGLKGGWNLYGYQLNPISDIDPLGLSMWEDAK 1178 >UniRef50_Q2SFR1 Rhs family protein n=9 Tax=cellular organisms RepID=Q2SFR1_HAHCH Length = 138 Score = 145 bits (367), Expect = 8e-34, Method: Composition-based stats. Identities = 44/95 (46%), Positives = 59/95 (62%), Gaps = 1/95 (1%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + +A+G + WS Y +GN + + + P R GQ YD+E+GL+YNR+RYYDP Sbjct: 45 EMTNAEGEVVWSARYKAYGNLALK-DVEDVQNPLRFQGQYYDEETGLHYNRHRYYDPSAA 103 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 R+I QDP GL GG S Y Y LNP+ +DPLGL Sbjct: 104 RFINQDPAGLLGGESNYEYVLNPIEWVDPLGLMAK 138 >UniRef50_C9Y459 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y459_CROTZ Length = 1523 Score = 145 bits (366), Expect = 1e-33, Method: Composition-based stats. Identities = 54/199 (27%), Positives = 79/199 (39%), Gaps = 8/199 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLH--QPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + DA+G W G++ WG E + Q R GQ D+ESGL+YN RYYDP+ Sbjct: 1299 VTDANGQTVWRGQFSTWGETERELSVPQWQVPQNLRFQGQYLDRESGLHYNLFRYYDPVA 1358 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRY DPIGL GG + Y Y +P+ +DPLGL + + + Q Sbjct: 1359 GRYTQMDPIGLLGGINTYGYVPDPLTWVDPLGLCFSKRLGDFGESHVKSQLERSGNYADV 1418 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMY--GRK 179 ++ + G D K + V + G + R F G Sbjct: 1419 FSVQNKSNNGIDLV----GLRHDGKYDFFEVKTNTTGSVGKLSDRQLSSKDFIETVLGTD 1474 Query: 180 VKLSHSEMIEDNKKDLAVN 198 V+ ++ D+ N Sbjct: 1475 VRKGGYDLNGHRVNDMLDN 1493 >UniRef50_P77779 Putative uncharacterized protein ybfO n=67 Tax=Enterobacteriaceae RepID=YBFO_ECOLI Length = 477 Score = 145 bits (366), Expect = 1e-33, Method: Composition-based stats. Identities = 71/125 (56%), Positives = 82/125 (65%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 LAL+ +G W EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 233 LALVSTEGATEWCAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 292 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRYITQDPIGL+GGW+ Y YPLNPV ID +GL+ L + Sbjct: 293 GRYITQDPIGLKGGWNFYQYPLNPVQYIDSMGLASKYGHLNNGGYGARPNKPPTPDPSKL 352 Query: 122 EDMKR 126 D+ + Sbjct: 353 PDIAK 357 >UniRef50_A1TQS0 YD repeat protein n=2 Tax=Acidovorax RepID=A1TQS0_ACIAC Length = 1602 Score = 145 bits (366), Expect = 1e-33, Method: Composition-based stats. Identities = 60/225 (26%), Positives = 88/225 (39%), Gaps = 46/225 (20%) Query: 2 LALMDADGNIAWSGEYDEWGN------QLNEENPHHL----------------------- 32 L L D +G IAW+ +Y WG ++ + Sbjct: 1337 LELTDVNGQIAWAVDYKVWGEATLRAVPRSDTGTDGVPGPRRQGHGPEAKSHAADSEVVC 1396 Query: 33 -------HQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 QP+R GQQ+D+E+GL+YNR+RYYDP GR+I++DPIG GG + + YPL+P Sbjct: 1397 AREPQRVEQPFRFQGQQFDEETGLHYNRSRYYDPAVGRFISEDPIGFLGGINTFIYPLDP 1456 Query: 86 VNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVS 145 + IDP GL+ KD L + K +GG H + Sbjct: 1457 YSWIDPTGLAGFRACPCVCKDILAG-------LNVGPHSKIKKIGGLYDSHHIYQDKALE 1509 Query: 146 KLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKLSHSEMIED 190 L G +R A + + R+ G K + Sbjct: 1510 GL--PGYTRGA-AVAISLQGRNADRTTRGTPHYKANRVQDQAGGG 1551 >UniRef50_Q12LF3 YD repeat n=2 Tax=Shewanella denitrificans OS217 RepID=Q12LF3_SHEDO Length = 927 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 51/158 (32%), Positives = 66/158 (41%), Gaps = 3/158 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 AL D+ G + W Y +G + + + Q R PGQ YD+ESGL+YN R YDP Sbjct: 693 TALTDSTGTVQWQAHYTPFGQTIVD--IDKIKQAIRFPGQYYDEESGLHYNYFRDYDPEL 750 Query: 62 GRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GRYI DPIGL GG + Y Y NPV DP GL + + Sbjct: 751 GRYIQSDPIGLAGGINTYGYAYQNPVMNTDPTGLWVPQAIGALVNMGYEGYTQYQSGNFN 810 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKG 158 + G F A + + AG + SA Sbjct: 811 MGRLFVAGATGALGGFGSSAIKAIGFGSLAGATNSAYQ 848 >UniRef50_D2TGW0 Putative Rhs protein n=2 Tax=Citrobacter RepID=D2TGW0_CITRO Length = 1477 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 45/100 (45%), Positives = 56/100 (56%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + L+ DG I W GE WG + R PGQ D+ESGLYYNR RYYD Sbjct: 1249 IQELLTEDGTIVWRGEQQLWGREEGRNRDDAPACRLRFPGQYEDEESGLYYNRFRYYDCE 1308 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA 100 G+Y+ DP+GL GG + Y Y NP+ IDPLGL+P + Sbjct: 1309 AGQYLCADPVGLAGGLNPYGYVNNPLKYIDPLGLNPLALP 1348 >UniRef50_D1YPL0 RHS repeat-associated core domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YPL0_9FIRM Length = 216 Score = 144 bits (362), Expect = 3e-33, Method: Composition-based stats. Identities = 46/124 (37%), Positives = 66/124 (53%), Gaps = 1/124 (0%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DGN+ W G Y WG E +QP+RL Q D+E+GL+YN RYY+P G Sbjct: 1 MTDKDGNLLWFGNYTGWGRLKEETKVTDSAYQPFRLQNQYCDRETGLHYNFFRYYEPDAG 60 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 R++ QDPIGLEGG ++Y + N + IDPLGL + A + + I + + Sbjct: 61 RFVNQDPIGLEGGDNIYLFSPNIQSWIDPLGLLSWNTARKQFWKAEAKKERDRIAKEKAK 120 Query: 123 DMKR 126 + Sbjct: 121 NPGS 124 >UniRef50_UPI0001B53B37 YD repeat protein n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B53B37 Length = 1465 Score = 144 bits (362), Expect = 3e-33, Method: Composition-based stats. Identities = 51/146 (34%), Positives = 71/146 (48%), Gaps = 8/146 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+D GN+AW + WG + + P R PGQ +D+ESGLYYN RYYDP Sbjct: 1266 TELIDPGGNVAWHADRTLWG-YRAGASQGGVSVPMRFPGQYHDEESGLYYNYFRYYDPET 1324 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRK-------DQLNHQRAW 114 GRY + DP+GL GG + +AY NP + +DP GLS I D+ Sbjct: 1325 GRYASPDPLGLHGGDNPHAYVANPTSWLDPFGLSANQRRWIDHSDGWRLGIDRFPIGGGS 1384 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMA 140 D Y + + L G++ +F+ Sbjct: 1385 DFEIHVYRNGEEKGLWGSEGWFNKHG 1410 >UniRef50_Q4K3M9 Rhs family protein n=5 Tax=Pseudomonas RepID=Q4K3M9_PSEF5 Length = 1486 Score = 143 bits (361), Expect = 4e-33, Method: Composition-based stats. Identities = 44/107 (41%), Positives = 55/107 (51%), Gaps = 2/107 (1%) Query: 7 ADGNIAWSGEYDEWGNQLNE--ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 DG+ W Y WGN + E E + Q R GQ D+E+GL++N R+YDP GR+ Sbjct: 1261 PDGHSVWQARYQVWGNTVEEIREPYYIEEQNLRFQGQYLDRETGLHFNTFRFYDPDIGRF 1320 Query: 65 ITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQ 111 T DPIGL GG +LY Y NP+ IDPLG RR Sbjct: 1321 TTPDPIGLAGGLNLYQYAPNPIGWIDPLGWICKSAYSGRRGTTKAKA 1367 >UniRef50_A1TSG3 RHS protein n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TSG3_ACIAC Length = 384 Score = 143 bits (361), Expect = 4e-33, Method: Composition-based stats. Identities = 62/207 (29%), Positives = 89/207 (42%), Gaps = 39/207 (18%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEE------------------------NPHHLHQPYRL 38 + D DG++AW +Y WG+ + EE + + Q R+ Sbjct: 138 EMSDRDGHLAWRAQYRVWGSAVAEEWQAFDGVGRPVEAPRHETGQRPDNSAAPMPQNLRM 197 Query: 39 PGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 GQ D+E+GL+YN RYY P G + T DPIGL GG +L+ Y NPV+ IDPLG +P Sbjct: 198 QGQYLDRETGLHYNTFRYYGPDVGAFTTPDPIGLAGGVNLHQYAPNPVSWIDPLGWNP-- 255 Query: 99 VALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKG 158 + R Q H+ A + L GT + + R D VS +A+ Sbjct: 256 ---VCRMSQTPHETASSLP---------LVRPGTSAWQKAVEAIRQGGKGDIRVSTAAEA 303 Query: 159 LGYEKEIRDYGLNLFGMYGRKVKLSHS 185 +E R G++ MY S Sbjct: 304 KALLQEARG-GMDRRKMYSDDGDYSKG 329 >UniRef50_A9C0N8 YD repeat protein n=5 Tax=cellular organisms RepID=A9C0N8_DELAS Length = 1528 Score = 143 bits (361), Expect = 5e-33, Method: Composition-based stats. Identities = 67/211 (31%), Positives = 91/211 (43%), Gaps = 17/211 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +AL D G +AW+ + D WGN L E NP +HQ RLPGQ +D+E+GLYYNR+RYYDP+ Sbjct: 1281 MALTDQTGQVAWAAKLDPWGNVLQEYNPQGIHQAIRLPGQHHDRETGLYYNRHRYYDPVV 1340 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 G Y+ QDPIGL GG + Y +P + IDP GL+ + Sbjct: 1341 GSYVNQDPIGLAGGVNKILYSESSPTSKIDPTGLNTVAIGAGVGASVGG----------- 1389 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNL---FGMYG 177 + + N VS++A L + G G Sbjct: 1390 -PIGAVVGAAIGVAVLGVLWMASRPSSNTTSVSKTASELSIAHNNAAQIGDRSSYQGYGG 1448 Query: 178 RKVKLSHSEMIEDNKKDLAVNDHGLTCPSTT 208 H ++ +D KKD GLTC Sbjct: 1449 NCTPDDHDKL-DDEKKDACDKSKGLTCGKNE 1478 >UniRef50_A1TTQ1 Rhs family protein n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TTQ1_ACIAC Length = 357 Score = 143 bits (360), Expect = 5e-33, Method: Composition-based stats. Identities = 55/192 (28%), Positives = 79/192 (41%), Gaps = 36/192 (18%) Query: 2 LALMDADGNIAWSGEYDEWG------------------------------------NQLN 25 L L DA G++AW+ +Y WG NQL+ Sbjct: 96 LELTDAQGHVAWAADYKVWGEAALRKVLKSATGTDALPGPRHKGHGPVLDEHDAYKNQLS 155 Query: 26 EENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNP 85 L QP+R GQQ+D E+GL+YNR RYYDP GR+ +QDP+GL GG +AY NP Sbjct: 156 HSVSPFLEQPFRFQGQQFDAETGLHYNRFRYYDPSIGRFFSQDPVGLHGGIHGFAYAPNP 215 Query: 86 VNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVS 145 N IDPLGLS + + +N + ++ + F + Sbjct: 216 NNWIDPLGLSNKCYNCLPKCTDINAPHKPKTAEEMAAELSNQINKNSVTFSTPTKQGHID 275 Query: 146 KLNDAGVSRSAK 157 ++ + Sbjct: 276 LQGRTHYDKATE 287 >UniRef50_C5AIF2 Rhs family protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AIF2_BURGB Length = 345 Score = 143 bits (360), Expect = 5e-33, Method: Composition-based stats. Identities = 48/220 (21%), Positives = 76/220 (34%), Gaps = 9/220 (4%) Query: 2 LALMDADGNIAWSGEYDEWGNQ-------LNEENPHHLHQPYRLPGQQYDKESGLYYNRN 54 L + D G + W Y WG + P R GQQ+D E+G +YNR Sbjct: 119 LMMTDEAGELVWEASYRAWGEAQEVIERASAAAGIDVVRNPLRFQGQQFDDETGQHYNRY 178 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 RYYDP R++ +DPIGL GG ++Y Y NP++ IDPLGL + W Sbjct: 179 RYYDPGSSRFVNKDPIGLTGGINIYQYAPNPISWIDPLGLQRTLTGT-TKGGMSRCSSGW 237 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 + + + ++ + + + G Sbjct: 238 RDRYGPASMREHHLIPQA-MMNNDNFMAQMKNAGVSDPEDYIHRQIAQISNAQHIDVHEG 296 Query: 175 MYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRC 214 + + + + +KDL + S R Sbjct: 297 GWNKDWQSWYQNNPNFTRKDLEAQTKSMMKDYNIPRSSRN 336 >UniRef50_A7K3Q8 Rhs family protein n=7 Tax=Vibrio RepID=A7K3Q8_VIBSE Length = 1384 Score = 142 bits (359), Expect = 6e-33, Method: Composition-based stats. Identities = 49/183 (26%), Positives = 78/183 (42%), Gaps = 5/183 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D +GN+ WS YD G + + P R GQ +D+E+ L+YN RYYDP GR Sbjct: 1043 LIDCEGNVVWSASYDAHG--FAHVHIEKVVNPLRFQGQYFDQETNLHYNLARYYDPKLGR 1100 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALI---RRKDQLNHQRAWDILSDT 120 +I QDPI + GG + Y Y +NP+ IDP G + + + D Sbjct: 1101 FIQQDPISIAGGINHYQYAINPIQWIDPTGFLCEEGLKRLQQMLAEYQAQNNVPQEVCDQ 1160 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 + + + G D + R N + + +K + ++ GRK Sbjct: 1161 ILEAAKESSVGEDGVRSQVKIRRPDGQNHIRYEYDLERIDSQKNEITFYRHINYSDGRKH 1220 Query: 181 KLS 183 ++ Sbjct: 1221 EVH 1223 >UniRef50_A8ZTS1 YD repeat protein n=4 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZTS1_DESOH Length = 1935 Score = 142 bits (359), Expect = 7e-33, Method: Composition-based stats. Identities = 52/220 (23%), Positives = 88/220 (40%), Gaps = 8/220 (3%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + ADG +A + EY +G +++ +R + +D ESGLY RYYDP G Sbjct: 1663 MVSAADGTVAAAYEYAPFGGLIHKSGVMADENVFRFSTKYWDGESGLYEYGLRYYDPETG 1722 Query: 63 RYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 R+I++DP+G GG +LY++ +P N D LGL + D + W Y Sbjct: 1723 RWISRDPVGESGGLNLYSFVMNDPTNFFDLLGLVQVGGSEGYSSDSMYIDITWYKPKSLY 1782 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 K ++ + + + +LF Y R+ + Sbjct: 1783 LLFKCKGCPDVNEAPVAGRTVDIDYDKSNFKVETIIKYVLKIVSGGEDADLFWTYNREKR 1842 Query: 182 LSHSEMIEDNKKDLAVND---HGLTCPSTTDCS---DRCS 215 +S + + KKD+ + ++C +CS D C Sbjct: 1843 Y-YSSLPGEAKKDIEIGRPCWVKISCEVNCECSCGGDACH 1881 >UniRef50_D0KES8 RHS protein n=3 Tax=Pectobacterium wasabiae WPP163 RepID=D0KES8_PECWW Length = 307 Score = 142 bits (358), Expect = 9e-33, Method: Composition-based stats. Identities = 46/105 (43%), Positives = 59/105 (56%), Gaps = 8/105 (7%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEE--------NPHHLHQPYRLPGQQYDKESGLYYNR 53 L + DA+G + WSG+Y +G + Q R GQ D+E+GL+YN Sbjct: 86 LEMTDAEGAVRWSGDYGSFGAVNGQTQDSEGLRHGKQAESQSLRYAGQYADEETGLHYNL 145 Query: 54 NRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 RYYDP GR+ TQDPIGL GG +LY Y NP+ IDPLGL+ Sbjct: 146 FRYYDPTVGRFTTQDPIGLAGGINLYQYAPNPLTWIDPLGLACTK 190 >UniRef50_C6M9F5 RHS family protein n=2 Tax=Bacteria RepID=C6M9F5_NEISI Length = 448 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 54/197 (27%), Positives = 82/197 (41%), Gaps = 4/197 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D DGN+ W G Y WG E +QP+RL Q D+E+GL+YN RYY+P Sbjct: 224 EMTDKDGNLLWFGNYTGWGRLKEETKVTDSAYQPFRLQNQYADRETGLHYNFFRYYEPDA 283 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR++ QDPIGLEGG +LY + N N +D GL L + +I Sbjct: 284 GRFVNQDPIGLEGGENLYKFAPNAQNWVDIFGLWSFGDPLPQWLVDGAAGFGDNISMGIT 343 Query: 122 EDMKRL--NLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYG-LNLFGMYGR 178 + ++L GG + + ++ Y+ R R Sbjct: 344 KKARQLLGISGGVTESTAYKTGDWLGEIPGLINPAKKVNAAYKGLKRAKLWKCEMKKISR 403 Query: 179 KVKLSHSEMIEDNKKDL 195 K + ++ + K +L Sbjct: 404 TAKRTRAKRVTKAKANL 420 >UniRef50_Q6D1M4 Rhs protein n=5 Tax=Enterobacteriaceae RepID=Q6D1M4_ERWCT Length = 1618 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 51/188 (27%), Positives = 81/188 (43%), Gaps = 14/188 (7%) Query: 3 ALMDADGNIAWSGEYDEWG------------NQLNEENPHHLHQPYRLPGQQYDKESGLY 50 L +G I W GE WG L + ++ R GQ YD E+GLY Sbjct: 1397 ELCSEEGEIRWRGEQGLWGAHREERRPIPLRRYLGDAANEEVYCELRYQGQLYDAETGLY 1456 Query: 51 YNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV--ALIRRKDQL 108 YNR+RYYD G+Y++ DPIGL GG Y Y NP++ +DPLGL+P + + D+ Sbjct: 1457 YNRHRYYDAESGQYLSPDPIGLAGGKRAYGYVKNPLSWVDPLGLTPKEGKNGSDKGPDKK 1516 Query: 109 NHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDY 168 N + + + D +++ L + + + +E+ + Sbjct: 1517 NSRAKINNIEDIFDNPDVLKGHTPESLKEELGGIPEGWKAGTMNKSRTEDGFTLRELNER 1576 Query: 169 GLNLFGMY 176 G ++ Y Sbjct: 1577 GNDVTDRY 1584 >UniRef50_UPI0001C34A7C Rhs family protein n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001C34A7C Length = 335 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 57/170 (33%), Positives = 77/170 (45%), Gaps = 11/170 (6%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D DGN+ W G Y WG +E N HQP+RL Q D+E+GL+YN RYY+P Sbjct: 92 EMTDEDGNLLWFGNYTGWGKLKSETNISGTAHQPFRLQNQYCDRETGLHYNFFRYYEPDA 151 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR++ QDPIGL GG + Y + N +DPLGL Q DI Sbjct: 152 GRFVNQDPIGLFGGSNFYMFAFNISRWLDPLGLKGKSKRSCEEIGQ-------DIDRLIN 204 Query: 122 EDMKRLNLGGTDQFFHCMAF---CRVSKLNDAGVSRSAKGLGYEKEIRDY 168 D ++ N GGT H R + + + +K +RD Sbjct: 205 RDKRKCNNGGTHGLRHRFNEQINGRNGPGTQSWKTHEQEIKNQQKSLRDR 254 >UniRef50_B3E8D4 YD repeat protein n=1 Tax=Geobacter lovleyi SZ RepID=B3E8D4_GEOLS Length = 1464 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 74/191 (38%), Gaps = 3/191 (1%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + A++++ G++ S YD +G L L QP R + YD+ +GLYY +R+Y P Sbjct: 1190 VTAVLNSSGSVVASYAYDPFGGTLAASGT--LSQPIRYSTKLYDEGTGLYYFGHRFYSPQ 1247 Query: 61 QGRYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 GR++++DP+ +LY + NP+ DP G + R + Q Q + L Sbjct: 1248 MGRWLSRDPLSEMASINLYRFAANNPLTHFDPFGAADNGGFWDRPEVQAQLQAQREALQA 1307 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 + + ++RS + + G + Sbjct: 1308 KRASEVTTFEKVKNSIGSAFKWIGDKFKEQPEMARSIEEKVANTALEANKYTKAGKEWNE 1367 Query: 180 VKLSHSEMIED 190 + +M ED Sbjct: 1368 RINTGIQMAED 1378 >UniRef50_B4SV70 Rhs-family protein n=42 Tax=Enterobacteriaceae RepID=B4SV70_SALNS Length = 1359 Score = 141 bits (355), Expect = 2e-32, Method: Composition-based stats. Identities = 48/186 (25%), Positives = 74/186 (39%), Gaps = 2/186 (1%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHH--LHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + ADG + W+G +G + + HQP RLPGQ +D E+GL+YN RYY P Sbjct: 1158 EVTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYFDDETGLHYNLFRYYAPE 1217 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+++QDPIGL GG +LYAY NP+ IDPLGL+ + + + + Sbjct: 1218 CGRFVSQDPIGLRGGLNLYAYAPNPIRWIDPLGLAILEHQSNFDAARRTGFENAGMTNPE 1277 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 +++ + + + G G Sbjct: 1278 DVTFSKVDPKTGTVVEFKGPNGAKVAYDAPHADMDVTAGHDKPHVGWQSAGKRGSGGANR 1337 Query: 181 KLSHSE 186 + Sbjct: 1338 GNITYD 1343 >UniRef50_B9B9U8 YD repeat protein n=1 Tax=Burkholderia multivorans CGD1 RepID=B9B9U8_9BURK Length = 345 Score = 141 bits (355), Expect = 2e-32, Method: Composition-based stats. Identities = 42/107 (39%), Positives = 56/107 (52%), Gaps = 6/107 (5%) Query: 4 LMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 L D DG++ W Y WG + ++ R GQQ D+E+GL+YNR RYY Sbjct: 185 LTDDDGDVVWEASYKAWGEAREVIARASKAAGIVARNSLRFQGQQEDEETGLHYNRYRYY 244 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR 104 DP GR+ + DPI L GG ++Y Y LN V+ IDP G S + Sbjct: 245 DPTSGRFTSADPIRLAGGSNVYQYALNLVSWIDPFGFSSTTDGTGNQ 291 >UniRef50_C8Q7Z5 YD repeat protein n=8 Tax=Enterobacteriaceae RepID=C8Q7Z5_9ENTR Length = 1507 Score = 141 bits (355), Expect = 2e-32, Method: Composition-based stats. Identities = 56/216 (25%), Positives = 84/216 (38%), Gaps = 13/216 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLH--QPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D G+ W G + WG E Q R GQ D+E+GL+YN RYYDP Sbjct: 1278 VTDIRGDTVWQGAFAAWGRTTRESTGVDWEVPQNLRFQGQYLDRETGLHYNTFRYYDPCG 1337 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRY DPIGL GG +LY Y + + GIDPLGL+ + + H + + + Sbjct: 1338 GRYTQLDPIGLMGGLNLYQYAPDVLTGIDPLGLA-TRLNNGQYNVFQEHTINAEHIYSSD 1396 Query: 122 -EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 R N ++ F R +S K + + G+ Sbjct: 1397 GVQFNRANTEFINRMNTDATFRRDMLGRYPELSDWMK-------NPNKASSPPGLTWHHH 1449 Query: 181 KLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSD 216 + + + ++ D A N H L P+ + Sbjct: 1450 ED-YGRLTLVDRADHADN-HSLYHPTGKGGREMWGG 1483 >UniRef50_C6WJA6 YD repeat protein n=3 Tax=Actinosynnema mirum DSM 43827 RepID=C6WJA6_ACTMD Length = 1509 Score = 141 bits (355), Expect = 2e-32, Method: Composition-based stats. Identities = 50/184 (27%), Positives = 77/184 (41%), Gaps = 10/184 (5%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+D G + W + WG L E P R GQ +D+E+GL+YN +RYYDP Sbjct: 1284 ELLDDAGALVWRSQRTLWGAVLAELAGGP-DCPLRFAGQYHDRETGLFYNVHRYYDPETA 1342 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVA------LIRRKDQLNHQRAWDI 116 RY + DP+GL G + +AY +NP+ DPLGLS + A ++ R Q + A + Sbjct: 1343 RYSSPDPLGLLAGPNPHAYVVNPLRLTDPLGLSGCERARKIADRVVERAQQGRVREASNY 1402 Query: 117 LSDTYEDMKRLNLGGTDQFFHCMAFCRVS---KLNDAGVSRSAKGLGYEKEIRDYGLNLF 173 + + L D +H + +D ++ + Sbjct: 1403 HGRLSRERELEILSNPDGVYHSTGSGGRLIFRQGDDILITEGPGSSAGQLVTSYGPSGPR 1462 Query: 174 GMYG 177 G G Sbjct: 1463 GESG 1466 >UniRef50_C5CXA7 YD repeat protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CXA7_VARPS Length = 1434 Score = 140 bits (353), Expect = 3e-32, Method: Composition-based stats. Identities = 63/211 (29%), Positives = 86/211 (40%), Gaps = 12/211 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + DG I W+ YD WG + + L QP R GQQ D E+GL+YNR+RY+DP+ Sbjct: 1223 LRMTTRDGQIVWAVRYDVWGGIARK-DCELLAQPIRCQGQQEDAETGLFYNRHRYFDPII 1281 Query: 62 GRYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGLSPADVALIRRKD-QLNHQRAWDILSD 119 G Y++ DPIGL GG + YAY NP IDPLGL+ + +R D QL R I Sbjct: 1282 GAYVSADPIGLRGGVNPYAYGCSNPYFWIDPLGLAASCERHLRSIDEQLQQGRRARIDVA 1341 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 + +D L L T + + D+ G Y R+ Sbjct: 1342 SKDDAIELLLAYTTGP------AGRRGAFRNTTDQPLPEGKKSESGSDWLPGGRGAYQRE 1395 Query: 180 VKLSHSEM---IEDNKKDLAVNDHGLTCPST 207 + L N + Sbjct: 1396 GTYHWDAADPNAKAGDHALEGNHLQIHNFDG 1426 >UniRef50_A1TSU8 YD repeat protein n=4 Tax=Acidovorax RepID=A1TSU8_ACIAC Length = 1586 Score = 140 bits (353), Expect = 3e-32, Method: Composition-based stats. Identities = 48/206 (23%), Positives = 66/206 (32%), Gaps = 29/206 (14%) Query: 3 ALMDADGNIAWSGEYDEWGNQLN----EENPHH------------------------LHQ 34 L D G I W+ Y WG + Q Sbjct: 1339 ELTDEQGRIVWAASYQVWGQTRALQVMRTGTDDAAVFTQAERPLALAAKGDVQALNFVEQ 1398 Query: 35 PYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 P R GQ +D E+GL+YNR RYYDP+ GR++ QDPIGL GG +L+ Y NP+ DPLGL Sbjct: 1399 PLRFQGQYFDGETGLHYNRFRYYDPVTGRFVHQDPIGLAGGNNLFFYAPNPLIWNDPLGL 1458 Query: 95 SPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSR 154 P + + + S + Sbjct: 1459 KPKRCGCRPDPCGIAK-HGHQPSPRPSDTESHHIIQDAWAKSQIGKQGGYSTYGAPAILL 1517 Query: 155 SAKGLGYEKEIRDYGLNLFGMYGRKV 180 ++ + GR Sbjct: 1518 QNGPHDIANAAQNARRDARLATGRGK 1543 >UniRef50_B2PW78 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PW78_PROST Length = 330 Score = 140 bits (353), Expect = 3e-32, Method: Composition-based stats. Identities = 49/226 (21%), Positives = 78/226 (34%), Gaps = 25/226 (11%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENP---------HHLHQPYRLPGQQYDKESGLYYN 52 L + + GN WSG+Y+ +G + Q R GQ +D E+GL++N Sbjct: 118 LDVTNEQGNTVWSGKYERFGFVRSSPLSFYSDPDRKMESFEQNLRYAGQYFDNETGLHFN 177 Query: 53 RNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 R+YDP GR+I DPIGL GG +LYAY NP++ +DP GL + + + + Sbjct: 178 TFRFYDPQIGRFIMPDPIGLLGGMNLYAYAPNPMSWVDPFGLMTLYRGMNISEYESLMKT 237 Query: 113 AWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNL 172 + K + + + D Sbjct: 238 GTWTSPPNALEGKWFATTYDNAVIWGNKMGHGGDTF----------KVVQINVPDEIA-- 285 Query: 173 FGMYGRKVKLSHSEMIEDNKK-DLAVNDHGLTCPST--TDCSDRCS 215 + L DL ++ +T + RC Sbjct: 286 -KKWHFDPHLDSIGPARFATLDDLNNSNVKITWSKNVTAKGNQRCI 330 >UniRef50_A1TV35 YD repeat protein n=2 Tax=Acidovorax RepID=A1TV35_ACIAC Length = 1679 Score = 140 bits (353), Expect = 4e-32, Method: Composition-based stats. Identities = 62/211 (29%), Positives = 87/211 (41%), Gaps = 18/211 (8%) Query: 2 LALMDADGN----IAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 +A++DA+G + W+ Y WG E NP+ + QP R GQQ+D E+GL+YNR RYY Sbjct: 1432 IAMVDANGRHSGLLTWAATYHSWGALREEYNPNDISQPIRFQGQQFDAETGLHYNRLRYY 1491 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 DP G+Y+TQDPIGL GG Y YP++P IDP GL+P + A + Sbjct: 1492 DPSLGQYLTQDPIGLLGGNDKYIYPVSPTGWIDPTGLNPLAACAVPGPTMAACAVAAEKA 1551 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 + A+ L E G G Sbjct: 1552 MSLLALGTAALAT-------------AMIPSSTSQMSDAERLSSAHEAAMGNRTAPGYGG 1598 Query: 178 RKVKLSHSEMIEDNKKDLAVNDHGLTCPSTT 208 H + + D K+ + GL+C Sbjct: 1599 NCSPEEH-DALRDKKEKACDDAKGLSCQKGE 1628 >UniRef50_Q31U53 Putative uncharacterized protein n=1 Tax=Shigella boydii Sb227 RepID=Q31U53_SHIBS Length = 927 Score = 140 bits (353), Expect = 4e-32, Method: Composition-based stats. Identities = 70/93 (75%), Positives = 78/93 (83%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L+ +G W EYDEWGN LNEENPHHL Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 810 LVLISTEGATEWCAEYDEWGNLLNEENPHHLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 869 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GRYITQDPIGL+GGW+ Y YPL+PVN +DPLGL Sbjct: 870 GRYITQDPIGLKGGWNFYQYPLSPVNSMDPLGL 902 >UniRef50_C0EPY5 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EPY5_NEIFL Length = 193 Score = 140 bits (352), Expect = 5e-32, Method: Composition-based stats. Identities = 52/167 (31%), Positives = 77/167 (46%), Gaps = 10/167 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP-HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D GN+ W GEY WG +E HQP+RL Q YD E+GL+YN RYYD G Sbjct: 1 MTDIHGNLLWYGEYTAWGRLKKDERVYKDAHQPFRLQNQYYDSETGLHYNYFRYYDSETG 60 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR---------KDQLNHQRA 113 R+++QD IGL GG + Y + N + IDPLGL + R+ + + Sbjct: 61 RFVSQDVIGLVGGENFYQFSPNTQSWIDPLGLKELYYLVARKDGFYPVMEWGKREPVGKV 120 Query: 114 WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLG 160 W D ++ + N+ Q + + + L V + K + Sbjct: 121 WLKKGDIWKIGETKNIVNGVQRRYSQQWLDRNNLKYIRVMKGPKKVM 167 >UniRef50_A1TR18 YD repeat protein n=8 Tax=Acidovorax RepID=A1TR18_ACIAC Length = 1654 Score = 139 bits (351), Expect = 6e-32, Method: Composition-based stats. Identities = 59/224 (26%), Positives = 84/224 (37%), Gaps = 32/224 (14%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEE-------------------NPHHLHQPYRLPGQQY 43 + D G + W + WG+ L E N L Q RL GQ Sbjct: 1409 EVTDEAGEVRWRASWRTWGSALEERWEAVRIDGSAIPAVQQRHRNEDTLEQNLRLQGQYL 1468 Query: 44 DKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIR 103 D+E+GL+YN RYYDP GR+I+ DPIGL GG +L Y NP++ IDPLG + +I Sbjct: 1469 DRETGLHYNTFRYYDPDVGRFISPDPIGLAGGLNLQRYAANPISWIDPLGH---ENYVII 1525 Query: 104 RKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEK 163 + Q R I+SD + D + + S + + Sbjct: 1526 GEGQDAVNRYAKIMSDNPKLKGHEFRTIQDDWK--PMMRKSGASRLEFGSAEWERKAIQA 1583 Query: 164 EIRD--------YGLNLFGMYGRKVKLSHSEMIEDNKKDLAVND 199 + Y G G + S + +D L V Sbjct: 1584 NVDWIKDRHAEGYKFIDIGEDGSPNRSSFYKAEKDALSALGVKP 1627 >UniRef50_B2PVY9 Putative uncharacterized protein n=12 Tax=Enterobacteriaceae RepID=B2PVY9_PROST Length = 707 Score = 139 bits (351), Expect = 6e-32, Method: Composition-based stats. Identities = 43/136 (31%), Positives = 61/136 (44%), Gaps = 9/136 (6%) Query: 2 LALMDADGNIAWSGEYDEWGNQLN---------EENPHHLHQPYRLPGQQYDKESGLYYN 52 L + + GN WSG+Y+ +G + E Q R GQ +D E+GL++N Sbjct: 488 LDVTNEQGNTVWSGKYERFGFVRSSPLSFYSDPERKMESFEQNLRYAGQYFDNETGLHFN 547 Query: 53 RNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 R+YDP GR+I DPIGL GG +LY Y NP+ IDP G V D Sbjct: 548 TFRFYDPQIGRFIMPDPIGLLGGINLYQYAPNPLGWIDPWGWEIVRVYHYTSSDGYKGIM 607 Query: 113 AWDILSDTYEDMKRLN 128 ++ + + Sbjct: 608 GTGSINMSDPGARGKG 623 >UniRef50_UPI0001B52C8C protein, rhs-like protein n=4 Tax=Enterobacteriaceae RepID=UPI0001B52C8C Length = 243 Score = 139 bits (351), Expect = 7e-32, Method: Composition-based stats. Identities = 69/123 (56%), Positives = 80/123 (65%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ +G W EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQGR Sbjct: 1 LVSTEGATEWCAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGR 60 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 YITQDPIGL+GGW+ Y YPLNPV ID +GL+ L + D Sbjct: 61 YITQDPIGLKGGWNFYQYPLNPVQYIDSMGLASKYGHLNNGGYGARPNKPPTPDPSKLPD 120 Query: 124 MKR 126 + + Sbjct: 121 IAK 123 >UniRef50_A1AK54 YD repeat protein n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AK54_PELPD Length = 1352 Score = 139 bits (350), Expect = 7e-32, Method: Composition-based stats. Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 4/116 (3%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL D G + YD +GN ++ ++ QP+ G+++D+E+GLYY R RYYDP Sbjct: 1146 IVALTDRHGTVVQEYNYDSFGNP--DQRGENIDQPFSYTGREWDRETGLYYYRARYYDPK 1203 Query: 61 QGRYITQDPIGLEGG-WSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 GR+I +DPI GG +LYAY NP+N +DP GL + + N W Sbjct: 1204 IGRFIQKDPISFAGGDVNLYAYVLNNPINRLDPFGLWNSKTFPTNISNYANSALYW 1259 >UniRef50_D2UF28 Putative rhs family protein n=2 Tax=Xanthomonas albilineans RepID=D2UF28_XANAL Length = 1812 Score = 139 bits (350), Expect = 8e-32, Method: Composition-based stats. Identities = 47/172 (27%), Positives = 67/172 (38%), Gaps = 5/172 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D +GN +YD +G + PY+ G++ D SGLYY R RYY P R Sbjct: 1556 LTDTNGNAVQRYDYDPYGTTTQSSAS--YNNPYQYTGRERDA-SGLYYYRARYYTPELAR 1612 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 +I++DPI L GG + YAY NPV DP+G A+ + A Sbjct: 1613 FISEDPIKLAGGVNTYAYTGGNPVMYRDPVGHEFVTAAIGAVLGGVAGYEAGGWQGAVAG 1672 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 + +G ++ AG+ SA G + G Sbjct: 1673 GLVGGVVGFVAPQASAF-VGTLAGEGLAGMFASAGATVVIGGASGAGATVLG 1723 >UniRef50_B5H9Z7 Rhs protein n=2 Tax=Streptomyces RepID=B5H9Z7_STRPR Length = 1054 Score = 139 bits (350), Expect = 8e-32, Method: Composition-based stats. Identities = 44/158 (27%), Positives = 64/158 (40%), Gaps = 1/158 (0%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+D G +AW WG + P R PGQ +D ESGL+YNR+R+YDP G Sbjct: 748 ELVDEQGKVAWRTRATLWGTTTWN-RSATAYTPLRFPGQYFDPESGLHYNRHRHYDPESG 806 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 RY++ DP+GL + Y NP IDPLGL+ + + + + Sbjct: 807 RYLSPDPLGLVPAPNAVTYVDNPTRWIDPLGLAGCPHRNGEHRHSVVLGVDMEPHRQSES 866 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLG 160 + L + G + S + G Sbjct: 867 LARYLRNDPSYPNHDPNRPQDPGAHTYNGNAYSGEEAG 904 >UniRef50_D1SWH5 RHS protein (Fragment) n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1SWH5_9BURK Length = 280 Score = 139 bits (350), Expect = 8e-32, Method: Composition-based stats. Identities = 43/116 (37%), Positives = 57/116 (49%), Gaps = 19/116 (16%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEE-------------------NPHHLHQPYRLPGQQY 43 + D G + W + WG+ L E + L Q RL GQ Sbjct: 28 EVTDEAGEVRWRASWRTWGSALEERWEAVRIDGSAIPAVQQRHRDEDTLEQNLRLQGQYL 87 Query: 44 DKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 D+E+GL+YN RYYDP GR+I+ DPIGL GG +L Y +NP+ IDP GL + Sbjct: 88 DRETGLHYNTFRYYDPDMGRFISPDPIGLAGGLNLQRYAINPLAWIDPWGLCGEKI 143 >UniRef50_C7MXD3 Rhs family protein n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MXD3_SACVD Length = 1485 Score = 139 bits (350), Expect = 9e-32, Method: Composition-based stats. Identities = 48/139 (34%), Positives = 69/139 (49%), Gaps = 5/139 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+D GN+AW WG + + + P+R PGQ D E+GL+YN +RYYDP Sbjct: 1276 TELLDGSGNLAWRNRTTLWGKTVIKHHGSA-STPWRFPGQYSDPETGLHYNYHRYYDPDT 1334 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRY++ DP+GL + YAY +NP+ +DPLGL + R +D +++T Sbjct: 1335 GRYVSCDPLGLRPSPNHYAYVVNPLRWLDPLGLMSCGDDDAVTLYRNVDGREFDAIAET- 1393 Query: 122 EDMKRLNLGGTDQFFHCMA 140 R GG A Sbjct: 1394 ---GRFESGGGSSEGKWFA 1409 >UniRef50_Q399U9 Rhs family protein n=3 Tax=Proteobacteria RepID=Q399U9_BURS3 Length = 1446 Score = 139 bits (349), Expect = 9e-32, Method: Composition-based stats. Identities = 51/156 (32%), Positives = 69/156 (44%), Gaps = 24/156 (15%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEE-----------------------NPHHLHQPYRLP 39 L +A+G + W Y WGN + EE Q R Sbjct: 1200 ELTNAEGELIWQARYKVWGNAVQEEWIARTSQQSVPEWGEVQLASATPAHVPRPQNLRFQ 1259 Query: 40 GQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS-PAD 98 GQ D+E+GL+YN R+YDP GR+I DPIGL GG +LYAY +P+ IDP G + Sbjct: 1260 GQYLDRETGLHYNTFRFYDPDIGRFINPDPIGLSGGHNLYAYAESPLIWIDPWGWCRRGN 1319 Query: 99 VALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQ 134 A D + Q D +D + R L G ++ Sbjct: 1320 QATKNHMDGVRDQYLADNPTDRHVAGGRNALTGGER 1355 >UniRef50_A1TJG7 YD repeat protein n=4 Tax=Proteobacteria RepID=A1TJG7_ACIAC Length = 1604 Score = 139 bits (349), Expect = 9e-32, Method: Composition-based stats. Identities = 43/157 (27%), Positives = 65/157 (41%), Gaps = 24/157 (15%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEE------------------------NPHHLHQPYRL 38 + D +G++ W +Y WGN + EE + Q R+ Sbjct: 1373 EMSDRNGHLVWRAQYRLWGNAVAEEWQAFDATGRPVNAPMAETGIRAQVSASPAPQNLRM 1432 Query: 39 PGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 GQ D+E+GL+YN RYYDP G + T DPIGL GG +L+ Y NP++ IDP G + Sbjct: 1433 QGQYLDRETGLHYNTFRYYDPDLGAFTTPDPIGLAGGLNLHGYAANPLSWIDPWGWACIP 1492 Query: 99 VALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQF 135 + + D ++ L + Sbjct: 1493 NKVAGTAREARVGAKLDGKFGASNVLRERYLRDANGK 1529 >UniRef50_Q8GDM7 Rhs n=3 Tax=Photorhabdus RepID=Q8GDM7_PHOLU Length = 1469 Score = 138 bits (348), Expect = 1e-31, Method: Composition-based stats. Identities = 59/206 (28%), Positives = 84/206 (40%), Gaps = 19/206 (9%) Query: 2 LALMDADGNIAWSG-EYDEWGNQLNEENPHHLHQP-YRLPGQQYDKESGLYYNRNRYYDP 59 LAL D G W + +G +L+ + P R GQ +D+ESGL+YNR RYY P Sbjct: 1245 LALFDPTGKRVWRRPKQSLYGLRLSGHGENPQLDPGLRFAGQLFDEESGLFYNRFRYYLP 1304 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 Y++ DP+GL GG + YAY NP N IDPLGL+ D ++ W Sbjct: 1305 EAACYLSPDPLGLNGGPNPYAYVHNPANWIDPLGLAGCPT------DYSQKRKYWSSDPI 1358 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 T++ K H A+ K+ G E G G G+ Sbjct: 1359 TFKGNKVYQRNDLFDPQHMSAWRDQGKV----------IRGTNIERMASGRAPVGHDGKA 1408 Query: 180 VKLSH-SEMIEDNKKDLAVNDHGLTC 204 V L H + +++ H + Sbjct: 1409 VNLHHMLQTQNGPIAEMSQTFHKVNH 1434 >UniRef50_B4V6T6 Rhs protein n=2 Tax=Streptomyces RepID=B4V6T6_9ACTO Length = 1263 Score = 138 bits (348), Expect = 1e-31, Method: Composition-based stats. Identities = 41/98 (41%), Positives = 53/98 (54%), Gaps = 1/98 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+D G IAW WG + + + P R PGQ +D ESGL+YN RYYDP Sbjct: 1031 TELVDEAGAIAWRTRSTLWGATTWNADANA-YTPLRFPGQYFDPESGLHYNCFRYYDPET 1089 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADV 99 RY++ DP+GL + AY NP + DPLGL+P Sbjct: 1090 ARYLSVDPLGLGPAPNPVAYVSNPHSWSDPLGLTPCPP 1127 >UniRef50_C9Y462 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y462_CROTZ Length = 252 Score = 138 bits (348), Expect = 1e-31, Method: Composition-based stats. Identities = 50/189 (26%), Positives = 74/189 (39%), Gaps = 10/189 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLH--QPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + DADG W G++ WG E + Q R GQ D+ESGL+YN RYYDP+ Sbjct: 27 VTDADGQTVWRGQFSTWGETERELSVPQWQVPQNLRFQGQYLDRESGLHYNLFRYYDPVA 86 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRY DPIGL GG + YAY +P+ +DPLGL+ + + + + D Sbjct: 87 GRYTQMDPIGLAGGINTYAYVGDPLTWVDPLGLAVDPLVKLEERGYSGVVKTPGGGLDYS 146 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 N + K+ +G + + G + Sbjct: 147 NSNALYN--------KKPGVNPIVKIEYSGDYAIDFERANYEAGLNQKTTPSGYVWHHLD 198 Query: 182 LSHSEMIED 190 +E + Sbjct: 199 DYDAETNKG 207 >UniRef50_Q1I7Q5 Putative uncharacterized protein n=11 Tax=Pseudomonas RepID=Q1I7Q5_PSEE4 Length = 1595 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 39/102 (38%), Positives = 56/102 (54%), Gaps = 5/102 (4%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPH----HLHQPYRLPGQQYDKESGLYYNRNRYY 57 + L D +G++ W+ Y WG+ + P R GQ D+E+GL+YNR RYY Sbjct: 1338 MELTDEEGHVVWAAHYKAWGDLAELPGSSVAMSNARNPIRFQGQYQDQETGLHYNRFRYY 1397 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPAD 98 DP RY+++DPIG GG + Y Y +PV DP+GL + Sbjct: 1398 DPKSARYVSKDPIGFMGGANAYTYTGGSPVTATDPMGLKSWE 1439 >UniRef50_A0LJM9 YD repeat protein n=3 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LJM9_SYNFM Length = 1433 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 40/93 (43%), Positives = 52/93 (55%), Gaps = 2/93 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D+ W YD +G + +R PGQ YD E+GL+YN +RYYDP GR Sbjct: 1213 MTDSTNTAVWEAAYDAFGEATIHPAS-TVVNNFRFPGQYYDAETGLHYNWHRYYDPKTGR 1271 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLS 95 Y+T DPIGL GG + Y Y +P+N ID GL Sbjct: 1272 YMTPDPIGLAGGINPYTYAENDPINFIDLYGLW 1304 >UniRef50_B1KGR6 YD repeat protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KGR6_SHEWM Length = 1699 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 46/151 (30%), Positives = 65/151 (43%), Gaps = 16/151 (10%) Query: 3 ALMDADGNIAWSGEYDEWGNQ----------------LNEENPHHLHQPYRLPGQQYDKE 46 L +G+I W GE WG L + ++ R GQ +D E Sbjct: 1460 ELCTENGDIEWRGEQSLWGEHHKWRLSVQAKSKHKKYLEDAANDPVNCDLRYQGQVFDSE 1519 Query: 47 SGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKD 106 +GLYYNR+RYYDP +Y++ DPIG+ GG AY NP+ +DP GL+ A + Sbjct: 1520 TGLYYNRHRYYDPETCQYLSPDPIGMAGGLRTQAYVHNPMEWVDPFGLAACPTAEAPQTH 1579 Query: 107 QLNHQRAWDILSDTYEDMKRLNLGGTDQFFH 137 Q+N + TY T + Sbjct: 1580 QINSAVDAPEGAGTYSFRDNRGNQYTGSTNN 1610 >UniRef50_C9NF93 YD repeat protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NF93_9ACTO Length = 1536 Score = 137 bits (346), Expect = 3e-31, Method: Composition-based stats. Identities = 48/130 (36%), Positives = 65/130 (50%), Gaps = 3/130 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+D G I+W WG + P R PGQ +D+ES L+YN RYYDP Sbjct: 1337 TELIDESGFISWRVRRSVWG-TTEWAKDSSAYTPLRFPGQYFDQESLLHYNYLRYYDPDV 1395 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 RYI+ DPIGLEGG + + Y NP IDPLGLS + R + N + W +++ + Sbjct: 1396 SRYISPDPIGLEGGPNPHWYGPNPYTWIDPLGLSLC--RVKPRLEDGNTKEGWQHINERH 1453 Query: 122 EDMKRLNLGG 131 + G Sbjct: 1454 ISGTAAHGHG 1463 >UniRef50_C9Y441 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y441_CROTZ Length = 1394 Score = 137 bits (345), Expect = 3e-31, Method: Composition-based stats. Identities = 40/97 (41%), Positives = 54/97 (55%), Gaps = 2/97 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENP--HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L D +G + W G WG +E P +Q R+ GQ D+E+GL+YN RYYDP Sbjct: 1156 LTDVEGRVRWEGRNSAWGKLAHESTPLPTGYNQNLRMQGQYLDRETGLHYNLFRYYDPDC 1215 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 GR+ DPIGL GG +LY + N + +DP GL+ Sbjct: 1216 GRFTQHDPIGLAGGINLYQFAPNALGWVDPWGLNKTT 1252 >UniRef50_B7GLX9 Rhs family protein n=1 Tax=Anoxybacillus flavithermus WK1 RepID=B7GLX9_ANOFW Length = 295 Score = 137 bits (344), Expect = 3e-31, Method: Composition-based stats. Identities = 45/154 (29%), Positives = 66/154 (42%), Gaps = 6/154 (3%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL D GNI +YD WGN L++ PYR G QYD E+GLYY RYY P Sbjct: 71 VIALTDEQGNIVARYQYDAWGNILSQSGDLEDENPYRYAGYQYDNETGLYYLIARYYYPE 130 Query: 61 QGRYITQDPI-GLEGGW---SLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWD 115 G +++ DP G + YAY NPV DP G +P + L+ ++ ++ Sbjct: 131 HGVFLSLDPDPGDADDILTQNGYAYANNNPVMLTDPDGENPYIIILVSVGGRIVAKKVAK 190 Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLND 149 L + + + R + Sbjct: 191 SLIKKAVK-RTVKKAAKRYTKSNLKLGRQMHKSY 223 >UniRef50_C6M5B1 Rhs-related protein n=7 Tax=Proteobacteria RepID=C6M5B1_NEISI Length = 1934 Score = 136 bits (343), Expect = 4e-31, Method: Composition-based stats. Identities = 48/121 (39%), Positives = 62/121 (51%), Gaps = 1/121 (0%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D +GNI WSG+Y WG E ++QP+RL Q YD+E+GL+YN RYYDP Sbjct: 1733 EMTDEEGNIVWSGDYSGWGKLTQEGRLKLDVYQPFRLQNQYYDEETGLHYNFFRYYDPEI 1792 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR+ QDPI L GG SLYA N +D LGL+ ++ TY Sbjct: 1793 GRFTQQDPIKLLGGESLYALAPNVFVWLDTLGLAKYKCIADEKEGGGTRGYKNGRKLCTY 1852 Query: 122 E 122 Sbjct: 1853 S 1853 >UniRef50_B8FCM0 YD repeat protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FCM0_DESAA Length = 1448 Score = 136 bits (343), Expect = 5e-31, Method: Composition-based stats. Identities = 45/194 (23%), Positives = 76/194 (39%), Gaps = 5/194 (2%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + AL++A GN YD +GN L+ QP+R ++YD+E+GLYY R+Y P+ Sbjct: 1185 VTALLNATGNACAWYAYDPYGNLLHNTG--APVQPFRFSTKEYDEETGLYYFGRRFYSPV 1242 Query: 61 QGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVAL--IRRKDQLNHQRAWDIL 117 R++T+DP G +LY + NP+ DPLG A++A R A Sbjct: 1243 MARWLTRDPKGEGASLNLYEFSRSNPLAYFDPLGAQDAELAAMDARWAQARAAMEASSQS 1302 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 + T D ++ + + + + + G G Sbjct: 1303 TATASDSGGFGEMMSNTVGKAFKWVGDIIQGSPDAAGTVVDKVVDTALESNKFTKMGKTG 1362 Query: 178 RKVKLSHSEMIEDN 191 ++ +D Sbjct: 1363 YDAIQKGFDVAQDA 1376 >UniRef50_Q7N2G0 Complete genome; segment 11/17 n=4 Tax=Gammaproteobacteria RepID=Q7N2G0_PHOLL Length = 1498 Score = 136 bits (343), Expect = 5e-31, Method: Composition-based stats. Identities = 47/195 (24%), Positives = 77/195 (39%), Gaps = 18/195 (9%) Query: 1 MLALMDADGNIAWSGEYDEWGN-----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 + ++ G + W+G WG L +P +L +R GQ D+ESGL+YNR+R Sbjct: 1286 VREILTEAGELIWAGRLLTWGEPECWPVLTVNDPRNLTCNFRFAGQYEDRESGLFYNRHR 1345 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWD 115 YY+ G+Y++ DP+ L GG + Y Y +PVN IDP GL+ + QR Sbjct: 1346 YYESDTGQYLSPDPLNLSGGVNPYGYVHDPVNWIDPFGLAACPTQKYEVSTFDDLQRRSK 1405 Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGM 175 + H ++ + A + + + G Sbjct: 1406 VGDKL-------------DIHHTAQKHPAGQVITGYDPKVAPSIALPRGEHKLIPTMKGP 1452 Query: 176 YGRKVKLSHSEMIED 190 Y + ++ I D Sbjct: 1453 YTGSARDLLAKDIRD 1467 >UniRef50_C5ALM7 YD repeat protein n=19 Tax=Proteobacteria RepID=C5ALM7_BURGB Length = 1425 Score = 136 bits (343), Expect = 5e-31, Method: Composition-based stats. Identities = 43/129 (33%), Positives = 61/129 (47%), Gaps = 4/129 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEE-NPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + G+I W+G+Y WG + P + QP R GQ D + L+YN R+YDP GR Sbjct: 1196 TNEAGDIVWAGQYSAWGKVAPNQHAPARIDQPLRYAGQYADDSTELHYNTFRFYDPDVGR 1255 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 +I QDPIGL GG +LY Y N + D GL+ + Q++ + T Sbjct: 1256 FINQDPIGLMGGLNLYQYAPNSIAWTDWWGLAG---SYTLGSYQISAPQLPAYNGQTVGT 1312 Query: 124 MKRLNLGGT 132 +N G Sbjct: 1313 FYYVNGAGG 1321 >UniRef50_C6CNW6 YD repeat protein n=8 Tax=Enterobacteriaceae RepID=C6CNW6_DICZE Length = 1679 Score = 136 bits (342), Expect = 6e-31, Method: Composition-based stats. Identities = 51/201 (25%), Positives = 78/201 (38%), Gaps = 27/201 (13%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNE------------ENPHHLHQPYRLPGQQYDKESGLY 50 L G + W GE WG E ++ R GQ YD E+GLY Sbjct: 1460 ELCSETGEVHWRGEQALWGAHREEKIPIPLRRWLGDAANEEVYCELRYQGQVYDSETGLY 1519 Query: 51 YNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNH 110 YNR+RYYDP +Y++ DP+G+ GG Y NP+ +DP GL V Sbjct: 1520 YNRHRYYDPETAQYLSGDPLGIAGGLRPQGYVHNPMEWVDPFGLVGCPVKK--------- 1570 Query: 111 QRAWDILSDTYEDMKRLNLGGTD-QFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYG 169 TY D+K ++ G + H M ++ ++A + K Sbjct: 1571 -----YEVSTYNDLKNRSVSGDELDIHHAMQKHPAGQVVPGYDPKTAPSIAIPKVEHQEI 1625 Query: 170 LNLFGMYGRKVKLSHSEMIED 190 + G Y + ++ ++D Sbjct: 1626 PTMKGPYTGSARDLLAKDVKD 1646 >UniRef50_UPI0001B56FBA YD repeat-containing protein n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B56FBA Length = 1624 Score = 136 bits (342), Expect = 7e-31, Method: Composition-based stats. Identities = 45/155 (29%), Positives = 65/155 (41%), Gaps = 4/155 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+D G IAW +G + E + P R GQ +D+E+GL+YN +RYYDP+ Sbjct: 1273 TELLDDRGEIAWQARSSLFGVVVAESGGTGI--PLRFQGQYFDEETGLHYNFHRYYDPVL 1330 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 RY++ DP+GL GG +AY NP +DPLGL R + Sbjct: 1331 ARYLSPDPLGLGGGLDPHAYVSNPHVSVDPLGLVGGSCNTSRGLGGGQGPGGGSQRGRSR 1390 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSA 156 G + RV + ++ A Sbjct: 1391 LRSPSPGPNGERRVRS--PGGRVYQEGPPTIAAGA 1423 >UniRef50_C6AKX3 Rhs family protein n=4 Tax=Aggregatibacter aphrophilus NJ8700 RepID=C6AKX3_AGGAN Length = 1917 Score = 135 bits (341), Expect = 8e-31, Method: Composition-based stats. Identities = 58/199 (29%), Positives = 88/199 (44%), Gaps = 12/199 (6%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHL---HQPYRLPGQQYDKESGLYYNRNRYYDP 59 + D+DG + W G YD WG+ + + HQP+RL Q +D+E+GL+YN RYY+P Sbjct: 1688 EMTDSDGKLIWKGRYDAWGSLIRDSYRETASDSHQPFRLQNQYFDEETGLHYNFFRYYEP 1747 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 + GR+ITQDPI L GG +LY + N DPLGL AL+ + + +A I Sbjct: 1748 VLGRFITQDPIKLAGGNNLYRFEGTVQNQTDPLGLFA--PALLLAPELIALGKAALITLG 1805 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLN----DAGVSRSAKGLGYEKEIRDYGLNLF-- 173 + ++ S ++ S + K I L++ Sbjct: 1806 VLAVGAAVEETVKNRARASTQAKTESTSKCKKCPPCITTS-GRIVPPKTIGYRPLDVIPD 1864 Query: 174 GMYGRKVKLSHSEMIEDNK 192 V SH + E N+ Sbjct: 1865 DKMEHGVYGSHHNIFESNQ 1883 >UniRef50_B5S3P6 Probable rhs-related protein (Fragment) n=1 Tax=Ralstonia solanacearum RepID=B5S3P6_RALSO Length = 445 Score = 135 bits (341), Expect = 9e-31, Method: Composition-based stats. Identities = 46/168 (27%), Positives = 70/168 (41%), Gaps = 6/168 (3%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + + DG EY +G + + + G QY SG+Y R YDP Sbjct: 99 VTDTLTPDGRAVTHTEYGPYGELVKSQGRAEYRSDFGYAGMQYHAASGMYLTLFRAYDPG 158 Query: 61 QGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLG-LSPADVALIRRKDQ----LNHQRAW 114 GR++++DPIG +GG +LYAY NP N +DP G L+ + + AW Sbjct: 159 TGRWVSRDPIGEDGGENLYAYANGNPENYVDPNGMLAIWPTNSGVILGKKYRCKYGKDAW 218 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYE 162 M+ LN G + + A+ VS + +A +GY Sbjct: 219 SNARSDRNKMRGLNTGLRNAEHYLYAYDSVSSGEYNAGTMTALSIGYS 266 >UniRef50_C7Q0A7 YD repeat protein n=2 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q0A7_CATAD Length = 1528 Score = 135 bits (341), Expect = 9e-31, Method: Composition-based stats. Identities = 52/200 (26%), Positives = 74/200 (37%), Gaps = 1/200 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+ DG + W WG + + P R PGQ +D ESGL+YN NRYYD Sbjct: 1302 TELVTPDGRVVWHTTTSLWGRTIGTSAESGVDCPLRFPGQYHDDESGLHYNLNRYYDSET 1361 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 Y+T DP+GL + +AY NP+ DPLGLS L+ Sbjct: 1362 AAYLTPDPLGLVPAPNDHAYVPNPLTVSDPLGLSYEGPNGQTMYPNNMPGTLDTELAQAD 1421 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 ++ GT +F +A V S + + + G + Sbjct: 1422 RLGVVVSSPGTSEFDSAIASGTVKWAVKDDGSIVVMPKFVNGQEISHSVLTRGASVQAAG 1481 Query: 182 LSHS-EMIEDNKKDLAVNDH 200 + D L +NDH Sbjct: 1482 EAEIAGSGADGYFGLDINDH 1501 >UniRef50_B4ETQ2 Rhs-family protein n=9 Tax=Enterobacteriaceae RepID=B4ETQ2_PROMH Length = 1703 Score = 135 bits (341), Expect = 9e-31, Method: Composition-based stats. Identities = 50/213 (23%), Positives = 74/213 (34%), Gaps = 19/213 (8%) Query: 3 ALMDADGNIAWSGEYDEWGNQ--------LNEENPHHLHQPYRLPGQQYDKESGLYYNRN 54 + G +W+G + WG E +P++ P+R GQ D+ESGLYYNR Sbjct: 1441 EIFSEGGQASWAGRLNTWGQMQFWRYRDGKAENDPNYTECPFRFAGQYEDEESGLYYNRF 1500 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR-------KDQ 107 RYYD G+Y++ DPIGL GG + Y Y P +DP GL+ D Sbjct: 1501 RYYDRETGQYLSPDPIGLLGGLNPYGYVHCPTGWVDPFGLAGGKGNKGAPVTSSFINDDI 1560 Query: 108 LNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIR- 166 +NH D + + + +A Y +R Sbjct: 1561 INHSAKGDWKEASSMPPRDRKTFPNGRLSGGGHGQSAILELEARGILYNIEHTYPNGVRV 1620 Query: 167 ---DYGLNLFGMYGRKVKLSHSEMIEDNKKDLA 196 + G + + KD Sbjct: 1621 GNIPSHASKAKRSGTAQSWFPENWSDADIKDAG 1653 >UniRef50_D1PWX0 YD repeat protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PWX0_9BACT Length = 318 Score = 135 bits (340), Expect = 1e-30, Method: Composition-based stats. Identities = 44/208 (21%), Positives = 82/208 (39%), Gaps = 7/208 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + + DG + EY +G EE + + Y +++D+E+G+YY RY++P Sbjct: 16 ITNLDGEVMQHIEYVPYGEVFIEERNNTWNTAYLFNAKEFDEETGMYYYGARYHEPRLSL 75 Query: 64 YITQDPIGLEGGW-SLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 +++ DPI W S YAY NP+ IDP G ++ + + H + + Sbjct: 76 WMSVDPIANLKNWISPYAYTKDNPLRYIDPDGQDEWEINK--QGKIVKHIKTDKHDAFYI 133 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 + + + G + V K R Y+ G +LF + + Sbjct: 134 VNYRGKRIRGKSI---SFTYGTVEKATGQKNDRGLLYDVYQVRGDKQGTSLFEYFSKNTS 190 Query: 182 LSHSEMIEDNKKDLAVNDHGLTCPSTTD 209 + S+M K +N + +T+ Sbjct: 191 VEWSQMQTGFKGKRGLNYITTSHQHSTE 218 >UniRef50_B5I7B2 Rhs repeat protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5I7B2_9ACTO Length = 1249 Score = 135 bits (339), Expect = 1e-30, Method: Composition-based stats. Identities = 42/104 (40%), Positives = 51/104 (49%), Gaps = 1/104 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+ DG IAW WG+ + P R PGQ +D E+GL+YN NRYYDP Sbjct: 998 TDLIAPDGTIAWHSRSTAWGST-QSHRDATAYTPLRYPGQYFDPETGLHYNLNRYYDPEL 1056 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRK 105 GRY T DP+GL + Y Y NP DPLGL+ Sbjct: 1057 GRYTTPDPLGLAPAVNHYTYVPNPFTLADPLGLAGCTADPTWGG 1100 >UniRef50_UPI00019F181C rhsC element core protein RshC n=2 Tax=Enterobacteriaceae RepID=UPI00019F181C Length = 260 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 69/169 (40%), Positives = 84/169 (49%), Gaps = 9/169 (5%) Query: 8 DGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQ 67 DG W EYDEWGN LNE+NP +L Q RLPGQQYD ES LYYNR+RYY+P QGRYITQ Sbjct: 24 DGRTGWRAEYDEWGNLLNEDNPQNLQQLIRLPGQQYDDESELYYNRHRYYNPEQGRYITQ 83 Query: 68 DPIGLEGGWSLYAYPLNPVNGIDPLG---------LSPADVALIRRKDQLNHQRAWDILS 118 DPIG++GG + YAYPLNPV +DPLG L + + + Sbjct: 84 DPIGMKGGLNSYAYPLNPVESVDPLGSETLKCIKPLHSIGGIGEKSGPDIWGNPLYHQYL 143 Query: 119 DTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRD 167 GG DQ + + +R G E + Sbjct: 144 CVSNGKNGYICGGQDQRGELSNDGILGPGKASQDTREGAGRCDSVESNN 192 >UniRef50_B3X3P2 RhsH n=1 Tax=Shigella dysenteriae 1012 RepID=B3X3P2_SHIDY Length = 263 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 78/165 (47%), Positives = 94/165 (56%), Gaps = 6/165 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L L+ DG W EYDEWGN LNEENP HL Q RLPGQQYD+ESGLYYNR+RYYDPLQ Sbjct: 23 LTLISPDGATEWCAEYDEWGNLLNEENPQHLQQLIRLPGQQYDEESGLYYNRHRYYDPLQ 82 Query: 62 GRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRK-----DQLNHQRAWD 115 GRYITQDPIGLEGGW+ Y Y +P IDPLGL + R+ L + Sbjct: 83 GRYITQDPIGLEGGWNQYVYASIHPTYSIDPLGLIDKPAPVFNRELNSDPYYLAVNNCYS 142 Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLG 160 + Y ++ GG ++ SKL + ++K G Sbjct: 143 YALNRYGNLGSRIFGGGGLQPGELSGKEFSKLTCGSIFEASKNDG 187 >UniRef50_C6M9F4 Rhs family protein n=8 Tax=Neisseria RepID=C6M9F4_NEISI Length = 632 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 46/131 (35%), Positives = 63/131 (48%), Gaps = 10/131 (7%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D DGN+ W G Y WG E +QP+RL Q D+E+GL+YN RYY+P Sbjct: 430 EMTDKDGNLLWFGNYTGWGRLKEETRVTDSAYQPFRLQNQYADRETGLHYNFFRYYEPDA 489 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GR++ QDPIGL GG + Y + N IDPLGL K + W+ Sbjct: 490 GRFVNQDPIGLLGGANPYQFASNITEWIDPLGL---------VKSSQSGGLNWNWKGVGD 540 Query: 122 EDMKRLNLGGT 132 ++ G+ Sbjct: 541 RQAHVIDNHGS 551 >UniRef50_B6A882 YenC2 n=1 Tax=Yersinia sp. MH-1 RepID=B6A882_9ENTR Length = 970 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 48/210 (22%), Positives = 77/210 (36%), Gaps = 7/210 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPY---RLPGQQYDKESGLYYNRNRYYDPLQ 61 +D DG + EY +G H+ Y R G++ D +GLYY RYY P Sbjct: 598 VDGDGLVISMEEYYPYGGTAVWAARSHIETAYKTVRYSGKERDA-TGLYYYGFRYYQPWA 656 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+++ DP G G +LY NP+ DP G++P D + + + Sbjct: 657 GRWLSADPAGTVDGLNLYRMVRNNPLRLTDPDGMAPLDWLDLDTTNASRDIVKAIYQLNQ 716 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRD--YGLNLFGMYGR 178 + R + LN+ V + K EK+ R + + Y Sbjct: 717 IDGPHRGVRDTYQRMTESTGMILQETLNNEAVLKGIKQKDKEKKSRGMKFTNSKLKTYAA 776 Query: 179 KVKLSHSEMIEDNKKDLAVNDHGLTCPSTT 208 + ++ + KD +N G T Sbjct: 777 HAGVLNTLQPDPVYKDGFLNLPGSLGNKNT 806 >UniRef50_Q1K295 YD repeat n=4 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1K295_DESAC Length = 1468 Score = 134 bits (337), Expect = 3e-30, Method: Composition-based stats. Identities = 46/97 (47%), Positives = 58/97 (59%), Gaps = 5/97 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQL----NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 + L DA G WS +Y +G + + + R PGQ +D ESGL+YN +RYY Sbjct: 1284 ILLTDATGTAVWSAQYAPFGQATINNDVDGDGTEVVCNLRFPGQYFDAESGLHYNWHRYY 1343 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLG 93 +P GRYIT DPIGL GG +LYAY NPVN +DP G Sbjct: 1344 EPRSGRYITLDPIGLAGGINLYAYANRNPVNVVDPTG 1380 >UniRef50_C6CI62 YD repeat protein n=7 Tax=Enterobacteriaceae RepID=C6CI62_DICZE Length = 1475 Score = 134 bits (337), Expect = 3e-30, Method: Composition-based stats. Identities = 54/201 (26%), Positives = 73/201 (36%), Gaps = 19/201 (9%) Query: 3 ALMDADGNIAWSG-EYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 AL DG + W WG + E GQ D ESGL YNR RYYDP Sbjct: 1226 ALFTPDGTLRWQAPTATLWGQRQAE-KSESPDPGLAFAGQLRDSESGLCYNRFRYYDPAG 1284 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD--------------VALIRRKDQ 107 G Y++ DPIG+ GG + Y Y NP+ +DP GL+ Sbjct: 1285 GCYVSPDPIGIAGGENNYGYVQNPMCWVDPFGLAGCSSYLNAWGGRNAKAFKNFWNNSSH 1344 Query: 108 LNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLN---DAGVSRSAKGLGYEKE 164 + +AW+ + RL GG + VS+ + G S Sbjct: 1345 ADFMKAWNNPAFKKNIKARLRGGGLTNSGGYHEWLPVSQGDKFKKMGFSFEEYMSLTSPT 1404 Query: 165 IRDYGLNLFGMYGRKVKLSHS 185 + L+ FG YG + Sbjct: 1405 NKVGFLDKFGNYGSHTSYTGG 1425 >UniRef50_B8FBZ2 YD repeat protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FBZ2_DESAA Length = 685 Score = 134 bits (336), Expect = 4e-30, Method: Composition-based stats. Identities = 52/181 (28%), Positives = 75/181 (41%), Gaps = 2/181 (1%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L+ A+G+ AWS Y +G + + + R PGQ YD E+GL+YN NRYYDP G Sbjct: 252 VLVRANGSTAWSATYSAYGKASVDPDSD-VENNLRFPGQYYDAETGLHYNLNRYYDPEIG 310 Query: 63 RYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 Y + DP+GL GG +LYAY NP N +DPLGL + + + D + Sbjct: 311 AYRSPDPLGLGGGVNLYAYTAGNPANYVDPLGLLFSSMDNPTGYLVMETILLGDKILSVV 370 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 E +K + C+ K + G + G + Sbjct: 371 ETVKTIKEAIDVVKDPCLTDEEKYDYFKGVAGDLIIDALIGKALGKLGDIMGGAGKKGAD 430 Query: 182 L 182 Sbjct: 431 Y 431 >UniRef50_UPI0001B4DD67 Rhs protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4DD67 Length = 1730 Score = 133 bits (335), Expect = 4e-30, Method: Composition-based stats. Identities = 42/155 (27%), Positives = 62/155 (40%), Gaps = 9/155 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D G+ AW WG + + P R PGQ +D E+G +YN +R+YDP R Sbjct: 1515 LVDDTGHTAWQSRTTLWGTTTWNTDA-AAYIPLRFPGQYHDLETGHHYNLHRHYDPETAR 1573 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA--------DVALIRRKDQLNHQRAWD 115 Y+T DP+GL + Y NP DP GL+P + R + + + R + Sbjct: 1574 YLTPDPLGLAPAPNPTTYVHNPHVWADPDGLAPKKPSYMVPEPLPSAPRGELVRYDRVFS 1633 Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDA 150 + + RL G A Sbjct: 1634 HRQEINLHVPRLITPGGRTLSAHAAERVAGSGPGR 1668 >UniRef50_D1JME6 Rhs family protein n=22 Tax=Bacteroides RepID=D1JME6_9BACE Length = 1494 Score = 133 bits (335), Expect = 4e-30, Method: Composition-based stats. Identities = 51/207 (24%), Positives = 84/207 (40%), Gaps = 9/207 (4%) Query: 6 DADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYI 65 D +GN WS D GN + E + P+ GQ YD+E+GL YNR RYY P G Y+ Sbjct: 1266 DTEGNEVWSRVLDMDGNVIEETGNKGMV-PFLFQGQYYDRETGLAYNRFRYYSPKMGVYV 1324 Query: 66 TQDPIGLEGG-WSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDM 124 +QDPIGL GG +LY Y + ID GL V + + + RA D + + Sbjct: 1325 SQDPIGLGGGILNLYGYVDDTNVWIDSFGLDWNYVLINSKGNVFYSGRASDNANLSDVAR 1384 Query: 125 KRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKLSH 184 + G D + ++ ++ A G E+ + + Sbjct: 1385 RHAGTKGADGTRFG-NGDTLVQITKPQITSYAAARGVEQ------FGIMNSSTPLLGYRS 1437 Query: 185 SEMIEDNKKDLAVNDHGLTCPSTTDCS 211 + + + ++ ++ + + S Sbjct: 1438 TNVRGNKIAGISSSNRNIEIYMSEGES 1464 >UniRef50_B4ETQ7 Putative Rhs-family protein n=3 Tax=Enterobacteriaceae RepID=B4ETQ7_PROMH Length = 380 Score = 133 bits (335), Expect = 4e-30, Method: Composition-based stats. Identities = 41/108 (37%), Positives = 56/108 (51%), Gaps = 8/108 (7%) Query: 7 ADGNIAWSGEYDEWGNQ--------LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 G +W+G + WG E +P++ P+R GQ D+ESGLYYNR RYYD Sbjct: 171 EGGQASWAGRLNTWGQMQFWRYRDGKAENDPNYTECPFRFAGQYEDEESGLYYNRFRYYD 230 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKD 106 G+Y++ DPIGL GG + Y Y P +DP GL+ I+ Sbjct: 231 RETGQYLSPDPIGLLGGLNPYGYVHCPTGWVDPFGLACCPPPKIKATQ 278 >UniRef50_B4V251 LipX3 n=2 Tax=Streptomyces RepID=B4V251_9ACTO Length = 1253 Score = 133 bits (335), Expect = 4e-30, Method: Composition-based stats. Identities = 48/185 (25%), Positives = 72/185 (38%), Gaps = 11/185 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNE-ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L+ + G +AW WG + P R PGQ D E+GL+YN RYY+P Sbjct: 1050 TELVASGGELAWQRRTTLWGTDFPAPTDTTSADCPIRFPGQYADSETGLHYNFFRYYEPE 1109 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 RYI+ DP+GLE + +AY +N + DPLGL+ Q R T Sbjct: 1110 SARYISADPLGLEPAPNHHAYVVNALGWTDPLGLAARGPKDPLDLGQGYRGRLDTWKEGT 1169 Query: 121 Y----------EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGL 170 + + + + G+D +F+ V + KG + R + Sbjct: 1170 KGTDFEIHVYDKSGREVGIFGSDGWFNKHNTIGADVEVPPSVENALKGRAVDTMRRSGRI 1229 Query: 171 NLFGM 175 G Sbjct: 1230 GPRGT 1234 >UniRef50_A9FZA9 Rhs family protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FZA9_SORC5 Length = 2407 Score = 133 bits (335), Expect = 4e-30, Method: Composition-based stats. Identities = 38/154 (24%), Positives = 60/154 (38%), Gaps = 4/154 (2%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLH---QPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +D G + EY +G + + YR G + D+E+GLYY+ RYY P Sbjct: 2005 VDGAGLVIGYEEYHPFGTTAYWSAASGIEVSQRRYRYTGTEKDEETGLYYHGARYYAPWL 2064 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+ + DP G G +LY Y NP+ DP G D + + D H+ + + Sbjct: 2065 GRWTSADPAGFVDGPNLYEYVRGNPIRLRDPSGRESTDQRIAQMTDVQLHRHVKALSPEA 2124 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSR 154 + G + + GV+ Sbjct: 2125 RAEFTASATGKFQERVSVTLARGKLESIRTGVTT 2158 >UniRef50_B3JL94 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JL94_9BACE Length = 336 Score = 133 bits (335), Expect = 5e-30, Method: Composition-based stats. Identities = 52/235 (22%), Positives = 85/235 (36%), Gaps = 14/235 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + + DG + EY +G EE + PY +++D+E+GLYY RYYDP Sbjct: 62 ITNLDGEVVQHIEYVPFGEVFIEERNSIWNTPYLFNAKEFDEETGLYYYGARYYDPRLSL 121 Query: 64 YITQDPIGLE-GGWSLYAY-PLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 +I+ DP+ + + Y Y NPV +DP G + A+ K +R W + T Sbjct: 122 WISTDPLQEKYPHINSYCYTANNPVLFVDPDGKAIVKGAVAAFK---YAKRIWSVYKKTG 178 Query: 122 E-DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYE-------KEIRDYGLNLF 173 + L G D+F + DA + + + NL Sbjct: 179 KLTPSNLKKAGLDEFLDIAGDIQTIFTGDATTLDKVGAIADLIIGTDFNSKGKTTVSNLL 238 Query: 174 GMYGRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKA 228 G+ + + KK L + L T + + +P K Sbjct: 239 GLTESTLGKEKGSL-SGTKKALGIAKSKLGLDRTESLPKNKAKFGSPSRGDKRKG 292 >UniRef50_C7QG23 YD repeat protein n=2 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QG23_CATAD Length = 1528 Score = 133 bits (334), Expect = 6e-30, Method: Composition-based stats. Identities = 47/176 (26%), Positives = 68/176 (38%), Gaps = 14/176 (7%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+ DG+I W +G + + H P R PGQ D E+GL+YN +RYYDP + Sbjct: 1266 TELVAPDGSIDWYTTTSLYGTTIATSSDHGADCPLRFPGQFRDDETGLHYNVHRYYDPER 1325 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 G Y++ DP+GL + AY NP+ DPLGL A + D Sbjct: 1326 GSYLSPDPLGLAAAPNDQAYVANPMVSADPLGLICL--------------NAAQQIKDRV 1371 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 + + Q + +A R V K + + G G Sbjct: 1372 DFLHGKFEAPYGQRENTVAIIRAMDKEGNIVHVVGWSGKESKTLFGDVDDEIGKNG 1427 >UniRef50_A3UDD5 Wall associated protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UDD5_9RHOB Length = 1693 Score = 132 bits (333), Expect = 6e-30, Method: Composition-based stats. Identities = 44/161 (27%), Positives = 66/161 (40%), Gaps = 13/161 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + + D+ G + + Y +G + P+R GQ+ D E+GLYY + RYYDP Sbjct: 1400 IVISDSAGAVIDTHTYSPFGQAGEGDGG----FPFRFTGQKLDPETGLYYYKARYYDPEL 1455 Query: 62 GRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQ--------LNHQR 112 GR++ DPIG +LYAY N PVN D GL + ++ LN Sbjct: 1456 GRFLQTDPIGYADQMNLYAYVGNDPVNLRDSSGLCGTRIKDENGDNREGANCSGKLNLGS 1515 Query: 113 AWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVS 153 D + + +L ++ S A VS Sbjct: 1516 LSDQRGHSDPEASARSLPTIQGVSSQISARAASFEMPADVS 1556 >UniRef50_C6M9F8 RhsG core protein with extension n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M9F8_NEISI Length = 194 Score = 132 bits (333), Expect = 7e-30, Method: Composition-based stats. Identities = 40/95 (42%), Positives = 53/95 (55%), Gaps = 1/95 (1%) Query: 4 LMDADGNIAWSGEYDEWGNQLNE-ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + D DGN+ W G Y WG E + +QP+RL Q D E+GL+YN RYY+ G Sbjct: 1 MTDKDGNLLWFGNYTGWGRLKEEIKVTDSAYQPFRLQNQYADPETGLHYNFFRYYESDAG 60 Query: 63 RYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 R++ QDPIGL GG + Y + N + D GL P Sbjct: 61 RFVNQDPIGLWGGSNSYQFAPNTLKWTDTWGLLPK 95 >UniRef50_C8W2V9 YD repeat protein n=3 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2V9_DESAS Length = 2658 Score = 132 bits (333), Expect = 8e-30, Method: Composition-based stats. Identities = 50/213 (23%), Positives = 83/213 (38%), Gaps = 30/213 (14%) Query: 2 LALMDADGNIAW--SGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L++ D GN +YD WG + E+ + P+R G YD E+GLYY ++RYY P Sbjct: 2417 LSMTDDYGNTDQENRYDYDPWGTPICED--ESVKSPFRYAGYYYDTETGLYYLKSRYYSP 2474 Query: 60 LQGRYITQDPIGL-----EGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRA 113 GR++T+DP +LYAY NPV+ +DP G + + Sbjct: 2475 ALGRFLTRDPHSFINHADPQTLNLYAYCGNNPVSTVDPTGHWDEEPS-----------EG 2523 Query: 114 WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLF 173 + + D E+ + ++ F + + V G A + Sbjct: 2524 YVVSPDYLENTATARIINSNVFQNILMGISV------GEGLRALKFFKGAGNLKLTSDAL 2577 Query: 174 GMYGRKVKLSHSEMIE---DNKKDLAVNDHGLT 203 G K + S ++ D+ D N ++ Sbjct: 2578 KKIGTKAESSGIRAVKGTADDAWDFFRNQVNIS 2610 >UniRef50_C8W2T8 YD repeat protein n=2 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W2T8_DESAS Length = 2349 Score = 132 bits (332), Expect = 9e-30, Method: Composition-based stats. Identities = 55/202 (27%), Positives = 81/202 (40%), Gaps = 26/202 (12%) Query: 2 LALMDADGNIAW--SGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L++ D GN +YD WG + E+ + P+R G YD+E+GLYY ++RYY P Sbjct: 2115 LSMTDDYGNTDQENRYDYDPWGTPICED--ESVKLPFRYAGYYYDEETGLYYLKSRYYSP 2172 Query: 60 LQGRYITQDPIGL-----EGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRA 113 GR++T+D +LYAY NPVN +DP G + D+ R Sbjct: 2173 ALGRFLTRDDHSFINHADPQTLNLYAYCGNNPVNYVDPDGNTIDDIKSGLRNAG------ 2226 Query: 114 WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLF 173 + D + C G+ + KG+G K + + Sbjct: 2227 --------NAVYNEARTWADAWLDCPFTGGAVFTGGIGIIKRIKGVGNFK--KYSPAQIE 2276 Query: 174 GMYGRKVKLSHSEMIEDNKKDL 195 YG K H E+ D DL Sbjct: 2277 KNYGLKKGQFHREIKGDILSDL 2298 >UniRef50_D1T3Q3 YD repeat protein (Fragment) n=2 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1T3Q3_9BURK Length = 561 Score = 132 bits (332), Expect = 9e-30, Method: Composition-based stats. Identities = 51/112 (45%), Positives = 66/112 (58%), Gaps = 4/112 (3%) Query: 2 LALMDADG----NIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 +AL+DA+G + W+ + WG E +P + QP R GQQ D E+GL+YNR+RYY Sbjct: 311 IALVDANGPQAGLVTWAATHHAWGAVREEYDPLGIGQPIRFQGQQLDAETGLHYNRHRYY 370 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLN 109 DP+ G+Y+TQDPIGL GG AYPLNP+ DPLGL D Sbjct: 371 DPMLGQYVTQDPIGLMGGIHKQAYPLNPIQASDPLGLFEFPSVTTPIFDGTG 422 >UniRef50_Q9L0E3 Putative Rhs protein n=2 Tax=Streptomyces RepID=Q9L0E3_STRCO Length = 927 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 41/110 (37%), Positives = 52/110 (47%), Gaps = 1/110 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+ DG+I W WG ++ P R PGQ +D E+GL+YN RYYDP Sbjct: 728 TELVSPDGDIGWRLRTTLWGLPVDGSGGST-DCPLRFPGQYHDPETGLHYNYFRYYDPGL 786 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQ 111 GRY + DP+GL GG + Y NP IDP GL+ L Sbjct: 787 GRYCSLDPLGLAGGPNPAWYTPNPTAWIDPFGLALCRKRPRLETGDLKKG 836 >UniRef50_Q73BZ0 Cell wall-associated protein n=4 Tax=Bacillus cereus group RepID=Q73BZ0_BACC1 Length = 258 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 77/209 (36%), Gaps = 11/209 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 ++A+ D + + + EYD WGN L ++ P+ G YDKE G+YY RYY+P Sbjct: 19 VVAMTDQNKEVVATYEYDSWGNVLKSDAKGIATDNPFGYAGYMYDKEIGMYYLIARYYNP 78 Query: 60 LQGRYITQDPI-GLEGGW---SLYAYP-LNPVNGIDPLGLSPADV--ALIRRKDQLNHQR 112 G +++ DP G E + Y Y NPV +DP G A D + Sbjct: 79 EHGVFLSVDPNPGDEDDPVTQNGYTYGDNNPVMMVDPDGHWVWFAVNAGFAAYDGYKAYK 138 Query: 113 AWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNL 172 + + +GG V G + A ++ Sbjct: 139 SGKGWKGVAKAAAISAIGGGKLKLTKTITKPVYFTKLLGKNARAHKSRINTDLPGGKAVA 198 Query: 173 FGMYGRKVKLSHSEMIEDNKKDLAVNDHG 201 ++ + + + +++K +V H Sbjct: 199 KSIF---RHYTKGQKVINHRKGNSVRRHT 224 >UniRef50_A4SKJ3 Rhs family protein n=2 Tax=Bacteria RepID=A4SKJ3_AERS4 Length = 1590 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 43/129 (33%), Positives = 57/129 (44%), Gaps = 12/129 (9%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEE------------NPHHLHQPYRLPGQQYDKESGLY 50 L G+I W GE WGN + + R GQ YD+E+GLY Sbjct: 1352 ELCSEAGDIIWRGEQRLWGNYRADAIPQPLRRFLGDAANEETYCELRYQGQIYDQETGLY 1411 Query: 51 YNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNH 110 YNR+RY+DP G+YI+ DPIG GG Y NP+ +DPLGL + + Sbjct: 1412 YNRHRYFDPELGQYISPDPIGFAGGVRPQGYVHNPLEWVDPLGLKGFFTRTVFHAPSGST 1471 Query: 111 QRAWDILSD 119 + D Sbjct: 1472 HTVYQQAID 1480 >UniRef50_UPI000196E06E Rhs family protein n=2 Tax=Neisseria mucosa ATCC 25996 RepID=UPI000196E06E Length = 280 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 44/93 (47%), Positives = 58/93 (62%), Gaps = 1/93 (1%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEE-NPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D GN+ W GEY WG +E + HQP+RL Q YD+E+GL+YN RYY+P Sbjct: 50 EMTDIHGNLLWYGEYTAWGRLKKDECVYRNAHQPFRLQNQYYDEETGLHYNLMRYYEPEA 109 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GR++ QDPI L GG +LY++ N DPLGL Sbjct: 110 GRFVNQDPILLLGGSNLYSFASNTNAWFDPLGL 142 >UniRef50_Q2SPP3 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SPP3_HAHCH Length = 265 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 38/81 (46%), Positives = 49/81 (60%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 H P R GQ YD+E+G +YNR+RYYDP GR+I Q PIGL GG + Y Y NPV + Sbjct: 14 ERAHNPLRFQGQYYDEETGFHYNRHRYYDPQSGRFINQAPIGLLGGANAYQYAPNPVGWV 73 Query: 90 DPLGLSPADVALIRRKDQLNH 110 DP GL+ + R+ D + Sbjct: 74 DPFGLTAKKESPKRQFDAIAA 94 >UniRef50_B7GLY4 Rhs family protein n=3 Tax=Anoxybacillus flavithermus WK1 RepID=B7GLY4_ANOFW Length = 563 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 46/193 (23%), Positives = 66/193 (34%), Gaps = 5/193 (2%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL DA GN+ EYD WG ++ PYR G QYD+E+GLYY RYY P Sbjct: 314 VIALTDAQGNVVARYEYDTWGQIRSQTGVLADENPYRYAGYQYDEETGLYYLMARYYHPT 373 Query: 61 QGRYITQDPI-GLEGGW---SLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWD 115 G +++ DP G + Y Y NPV +DP G ++ Sbjct: 374 HGVFLSLDPDPGDADDILTQNGYTYANNNPVMLVDPDGHFVWMAINAGFAAYDGYKAFKK 433 Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGM 175 + L G F + + +AK I ++ Sbjct: 434 GGWKAAAVAVGVGLVGGAAFKAYRIYKAKETAKIMRILLAAKHGKGNTTIEVGRVSKRLA 493 Query: 176 YGRKVKLSHSEMI 188 S Sbjct: 494 KKAGKAWVGSGAK 506 >UniRef50_D0KWY8 YD repeat protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KWY8_HALNC Length = 1338 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 57/202 (28%), Positives = 78/202 (38%), Gaps = 8/202 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENP---HHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 +A DA + W Y +G QL+ + H R PGQ DKESGLYYN +RYYD Sbjct: 860 IAATDAQAQVIWRAHYGPYGQQLDVADSLVKDHFSLSLRNPGQWQDKESGLYYNDHRYYD 919 Query: 59 PLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 P GRY++ DP+GL GG + YAY NP+ DP GL H + Sbjct: 920 PATGRYLSPDPLGLAGGLNAYAYVAANPIAYTDPYGLM-LFAFDGTNNGAPGHLVPGNDT 978 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRV---SKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 S+ Y D ++ N G RS G LN F Sbjct: 979 SNVYRFYDAYQFSSADPRYYITGIGTTYPEKHQNYRGNPRSGDGFTERLSYATADLNTFI 1038 Query: 175 MYGRKVKLSHSEMIEDNKKDLA 196 + +++ ++ Sbjct: 1039 ARHNPTDTLNIDVVGFSRGAAE 1060 >UniRef50_A1WE65 Rhs family protein-like protein n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE65_VEREI Length = 303 Score = 132 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 45/151 (29%), Positives = 62/151 (41%), Gaps = 9/151 (5%) Query: 5 MDADGNIAWSGEYDEWGN-----QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 D GN+ W+ Y+ +G + ++ RLPGQ D E+GL+YN +RYYD Sbjct: 81 TDNAGNVVWAANYNAFGRADVVTPRSATGDSRINSQLRLPGQYEDVETGLHYNFHRYYDL 140 Query: 60 LQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQLNHQRAW---D 115 GRY DPIGL GG + Y Y NP++ DPLGL W D Sbjct: 141 DIGRYSQIDPIGLRGGLNGYVYVNGNPLSFTDPLGLDVELRCRPAEIVAGLVNHCWLKTD 200 Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSK 146 + + G + + VS Sbjct: 201 TIEAGMNKRITCSRAGNELVNLDSIWVVVSN 231 >UniRef50_A3NNM1 Protein RhsD n=20 Tax=pseudomallei group RepID=A3NNM1_BURP6 Length = 1539 Score = 131 bits (330), Expect = 1e-29, Method: Composition-based stats. Identities = 44/117 (37%), Positives = 68/117 (58%), Gaps = 3/117 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + ++D +GN+AW YD G + + + QP RL GQ +D E+G+ YNR+RYYD Sbjct: 1309 VRMLDVEGNVAWEASYDANGG-IEQFGIQAMPQPLRLQGQYFDAETGMSYNRHRYYDARI 1367 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILS 118 G+++++DPI L GG +LY Y +N ++ DPLGL V L ++L+ W Sbjct: 1368 GQFVSEDPIRLSGGENLYRYCVNSISWADPLGLD--RVPLFDPNNRLSFNAIWAYTG 1422 >UniRef50_UPI000190F33A Rhs-family protein n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190F33A Length = 138 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 40/63 (63%), Positives = 48/63 (76%) Query: 32 LHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDP 91 HQP RLPGQ +D E+GL+YN RYY P GR+++QDPIGL GG +LYAY NP+ IDP Sbjct: 4 FHQPLRLPGQYFDDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYAYCPNPLTWIDP 63 Query: 92 LGL 94 LGL Sbjct: 64 LGL 66 >UniRef50_D1T3N4 YD repeat protein n=2 Tax=Betaproteobacteria RepID=D1T3N4_9BURK Length = 1584 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 40/117 (34%), Positives = 55/117 (47%), Gaps = 24/117 (20%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNE------------------------ENPHHLHQPYRL 38 + D GN+ W Y WG + E ++ + Q R+ Sbjct: 1354 EMSDTRGNLVWQARYLTWGATVQEHWQAFDATGRPADAPLAETCDRPQQSFAPMPQNLRM 1413 Query: 39 PGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLS 95 GQ D+E+GL+YN RYYDP G + T DPIGL GG +L+ Y NP+ IDP G + Sbjct: 1414 QGQYLDRETGLHYNTFRYYDPDLGAFTTPDPIGLAGGINLHQYAPNPIAWIDPWGWN 1470 >UniRef50_B2HAQ4 RhsD protein n=4 Tax=Burkholderia RepID=B2HAQ4_BURPS Length = 1531 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 38/97 (39%), Positives = 53/97 (54%), Gaps = 2/97 (2%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D+ G W+ YDE G + ++ P L GQ D E+GL+YNR+RYYDP Sbjct: 1289 TQMTDSSGREVWATGYDENGRLVPI--NADIYNPIHLQGQYRDAETGLHYNRHRYYDPAL 1346 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 G +I++DP+GL G +LY Y N + DPLG Sbjct: 1347 GSFISKDPLGLAAGVNLYRYAPNSIGWADPLGFQAKP 1383 >UniRef50_Q88FK6 RHS family protein, putative n=1 Tax=Pseudomonas putida KT2440 RepID=Q88FK6_PSEPK Length = 1530 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 53/197 (26%), Positives = 84/197 (42%), Gaps = 13/197 (6%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENP--HHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L D+DGN W ++ WG +NE + + Q R GQ D+E+GL++N R+YDP Sbjct: 1314 QLTDSDGNTIWRSDHHGWGKIINEWHSQQNGREQNLRNQGQYIDRETGLHFNIFRFYDPD 1373 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+ T DP+G+EGG +LY+Y N VN DPLGL P +V + Sbjct: 1374 IGRFTTTDPLGIEGGVNLYSYAPNIVNYSDPLGLCPENVKEYDITPYRPSNSPLENHHGI 1433 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEI-RDYGLNLFGMYGRK 179 + + ++ + S+K K + R++ G Sbjct: 1434 LDIWAA---------QNVPSYGGKYPKGSPTIVLSSKNHAATKAVYREWLREKTGKP-VG 1483 Query: 180 VKLSHSEMIEDNKKDLA 196 K+ S++ L+ Sbjct: 1484 GKVDWSQISNREVMQLS 1500 >UniRef50_Q0K1I3 Insecticidal toxin complex protein n=2 Tax=Proteobacteria RepID=Q0K1I3_RALEH Length = 2644 Score = 130 bits (328), Expect = 2e-29, Method: Composition-based stats. Identities = 36/111 (32%), Positives = 52/111 (46%), Gaps = 3/111 (2%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQ--PYRLPGQQYDKESGLYYNRNRYYDPLQG 62 +D I EY +G+ + YR G++ D+ESGLYY+ RYY P G Sbjct: 2261 LDEQAQIISYEEYAPYGSSTYQAVRSQTETAKRYRYTGKEQDEESGLYYHGARYYAPWLG 2320 Query: 63 RYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 R+ DP G E G +LY Y NP +DP G S +A++ + + Sbjct: 2321 RWTACDPAGEEEGPNLYQYCFGNPTGFVDPDGQSGKKLAIVMHNESKPAGK 2371 >UniRef50_C5AA19 Rhs family protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AA19_BURGB Length = 1596 Score = 130 bits (328), Expect = 2e-29, Method: Composition-based stats. Identities = 39/96 (40%), Positives = 52/96 (54%), Gaps = 6/96 (6%) Query: 3 ALMDADGNIAWSGEYDEWGN------QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRY 56 + D G I W Y WG ++++ + P R GQ +D ESGL YNR+RY Sbjct: 1339 MITDELGEIVWEARYQAWGEARDVIERVSKATGERVRNPLRFQGQHFDDESGLAYNRHRY 1398 Query: 57 YDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPL 92 Y GRY+++DP L GG + +AY NPV IDPL Sbjct: 1399 YAADVGRYVSKDPAELLGGLNEFAYVPNPVQWIDPL 1434 >UniRef50_UPI00017448E8 YD repeat protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448E8 Length = 675 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 45/161 (27%), Positives = 65/161 (40%), Gaps = 22/161 (13%) Query: 1 MLALMDA-DGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 + AL++ +G +Y +G P+R + YD E+GL Y RYY Sbjct: 249 ITALINQVNGETVAKFDYTPYGELKVSSG-DVNACPFRYQSKYYDAETGLSYFGFRYYSA 307 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLN-PVNGIDP------------------LGLSPADVA 100 GR+I++DP+G GG++LY Y N PVN D GL A V Sbjct: 308 KLGRWISRDPLGEAGGFNLYGYCGNDPVNRWDYLGMDSGIWDKVVATDDFMDGLLWATVQ 367 Query: 101 LIRRKDQLNHQR-AWDILSDTYEDMKRLNLGGTDQFFHCMA 140 R D Q + +YE N+ G + HC+ Sbjct: 368 FWTRGDMFGGQMPLYQSPKSSYEYGYNTNMTGYNVASHCIG 408 >UniRef50_D1VCV7 YD repeat protein n=1 Tax=Frankia sp. EuI1c RepID=D1VCV7_9ACTO Length = 1572 Score = 130 bits (327), Expect = 3e-29, Method: Composition-based stats. Identities = 41/111 (36%), Positives = 53/111 (47%), Gaps = 1/111 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+D DG +AW WG P P R PGQ +D E+GL YN +RYYDP Sbjct: 1362 TELVDPDGRLAWHSRATLWG-VSPPSTPTTTDCPLRFPGQYHDPETGLNYNFHRYYDPAT 1420 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQR 112 RY T D +GL + + Y NP+ IDP GL+P L + + Sbjct: 1421 ARYKTSDALGLSPAPNPWTYVTNPLTWIDPFGLAPCKDLLHAFGNAQGPRA 1471 >UniRef50_D0KG38 Rhs family protein-like protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KG38_PECWW Length = 230 Score = 130 bits (326), Expect = 4e-29, Method: Composition-based stats. Identities = 45/110 (40%), Positives = 62/110 (56%), Gaps = 5/110 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQ-----LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYD 58 + DA+ + WSG+Y +G +E H Q R GQ D+E+GL+YN RYYD Sbjct: 1 MTDAESAVRWSGDYGSFGAVNGQTQDSEGLRHGKSQSLRYAGQYADEETGLHYNLFRYYD 60 Query: 59 PLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQL 108 P GR+ TQ IGL GG +LY Y N + +DPLGL+P ++ KD+ Sbjct: 61 PTVGRFTTQGLIGLAGGLNLYQYAPNSLGWVDPLGLTPGEIIRYMGKDEA 110 >UniRef50_D1T3N5 YD repeat protein n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1T3N5_9BURK Length = 598 Score = 130 bits (326), Expect = 4e-29, Method: Composition-based stats. Identities = 50/153 (32%), Positives = 72/153 (47%), Gaps = 5/153 (3%) Query: 2 LALMDADG----NIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYY 57 +AL+DA+G + W+ Y WG E +PH + Q R GQQ+D E+GL+YNR RYY Sbjct: 357 IALVDANGPQAGLVTWAATYHAWGAVREEYDPHGIGQDIRFQGQQFDAETGLHYNRFRYY 416 Query: 58 DPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQLNHQRAWDI 116 DP+ G+Y+TQDPIGL+GG + Y +P DP GL D A + A Sbjct: 417 DPMLGQYVTQDPIGLKGGLNKSNYSGSSPAINCDPKGLDFKDKATKGGEVGYEVYDAKST 476 Query: 117 LSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLND 149 + + Q + ++ Sbjct: 477 YESGSKRACLIRQITNRQIAYQELLGKMGSGKP 509 >UniRef50_C2LFQ4 Putative uncharacterized protein n=1 Tax=Proteus mirabilis ATCC 29906 RepID=C2LFQ4_PROMI Length = 214 Score = 130 bits (326), Expect = 5e-29, Method: Composition-based stats. Identities = 45/156 (28%), Positives = 59/156 (37%), Gaps = 2/156 (1%) Query: 25 NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLN 84 E +P++ P+R GQ D+ESGLYYNR RYYD G+Y++ DPIGL GG + Y Y Sbjct: 11 AENDPNYTECPFRFAGQYEDEESGLYYNRFRYYDRETGQYLSPDPIGLLGGLNPYGYVHC 70 Query: 85 PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRV 144 P +DP GL+ + + G H R Sbjct: 71 PTGWVDPFGLAGGKGNKGELDGSGRPLASPRYSVAFETKIDSKFYPGRSDKVHFQEANRN 130 Query: 145 SKLNDAGVSRSAKGL--GYEKEIRDYGLNLFGMYGR 178 AK L Y + G Y R Sbjct: 131 LHEAMKADPIFAKKLEDMYPGITQGVQPGPRGAYPR 166 >UniRef50_C0EPY1 Putative uncharacterized protein n=7 Tax=Neisseria RepID=C0EPY1_NEIFL Length = 434 Score = 130 bits (326), Expect = 5e-29, Method: Composition-based stats. Identities = 43/93 (46%), Positives = 56/93 (60%), Gaps = 1/93 (1%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENP-HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + D GN+ W GEY WG + H HQP+RL Q YD+E+GL+YN RYY+P Sbjct: 222 EMTDIRGNLLWYGEYTAWGRLKKDGRVYQHAHQPFRLQNQYYDRETGLHYNYFRYYEPET 281 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 GR+I+QDPIGL G +LY + N +D GL Sbjct: 282 GRFISQDPIGLLGEDNLYWFGPNTAIWVDLFGL 314 >UniRef50_A9AKU8 Type VI secretion system Vgr family protein n=10 Tax=Burkholderia RepID=A9AKU8_BURM1 Length = 1981 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 43/125 (34%), Positives = 61/125 (48%), Gaps = 2/125 (1%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 L D + W+ + +G + P R PGQ D+ESGL+YNR RYYDP+ G Sbjct: 1746 ELYDEQREVLWAADLSAYGRTARW-LTRVVDNPIRFPGQYRDEESGLHYNRFRYYDPMVG 1804 Query: 63 RYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 RYI QDPI +GG + Y+Y + P DP GL +A+ + A +I Sbjct: 1805 RYINQDPIAFDGGINFYSYADSAPNIAYDPKGLFVPLIAVASFLGRGALGGAVEIAMQGA 1864 Query: 122 EDMKR 126 + + R Sbjct: 1865 KQVFR 1869 >UniRef50_C6CPH2 YD repeat protein n=11 Tax=Enterobacteriaceae RepID=C6CPH2_DICZE Length = 1423 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 45/142 (31%), Positives = 60/142 (42%), Gaps = 2/142 (1%) Query: 3 ALMDADGNIAWSG-EYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 AL DG + W + WG + E GQ D ESGL YNR RYYDP Sbjct: 1211 ALFTPDGTLRWQAPKATLWGQRQAE-KSESPDPGLAFAGQLRDSESGLCYNRFRYYDPAG 1269 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 G Y++ DPIG+ GG + YAY NP+ IDPLGL+ + + K + + D Sbjct: 1270 GCYVSPDPIGIAGGDNNYAYAPNPITWIDPLGLAGCSIQKLEEKGFTGVKSTKNGGLDYA 1329 Query: 122 EDMKRLNLGGTDQFFHCMAFCR 143 + G + Sbjct: 1330 NSNALYSKEGVNPIRRIEYTGN 1351 >UniRef50_Q5TP09 AGAP009916-PA n=4 Tax=Anopheles gambiae RepID=Q5TP09_ANOGA Length = 3321 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 69/224 (30%), Gaps = 22/224 (9%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHL-HQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + L+ +G I + +Y +G L H YR GQ++D+E+ LY R YDP Sbjct: 2594 VRLVIKNGEIVAAYDYLPYGELLRSYGDDPDGHLDYRFTGQEWDEETNLYNFHARLYDPE 2653 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 GR++ DP E S Y Y N PV+ IDP G + + + + Sbjct: 2654 LGRFLQLDPK--EQYASPYLYAGNSPVSLIDPDGQFAILLIVSIVTAAVGAYLGASAANK 2711 Query: 120 TYEDMK-RLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG---- 174 ++ K + + ++ A +G + G Sbjct: 2712 SWNPAKWEVKKAIVGATMGAIVGGFAPVGIAGSITFLAGAVGTTAAVGITAATSVGFAYV 2771 Query: 175 ------MYGRKVKLSHSEM-------IEDNKKDLAVNDHGLTCP 205 +K S+ N G Sbjct: 2772 STASATKSWNPIKWDWSQPGTWNALFTGSISGASFYNTIGTIHQ 2815 >UniRef50_C2VMF9 Wall-associated domain protein n=5 Tax=Bacillus cereus RepID=C2VMF9_BACCE Length = 257 Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 69/203 (33%), Gaps = 12/203 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 ++A+ D +G + + EYD WGN L ++ P+ G YDKE G+YY RYY+P Sbjct: 18 VIAMTDQNGQVVANYEYDAWGNVLKSDAKGIAAENPFGYAGYMYDKEIGMYYLIARYYNP 77 Query: 60 LQGRYITQDPIGLEG-------GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQ 111 G +++ I + + Y Y NPV +DP G L ++ Sbjct: 78 EHGVFLS---IDPDPGDEDDPVTQNGYTYGDNNPVMMVDPDGHLAWFAPLAIHGARIAAP 134 Query: 112 RAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLN 171 + LN F SK + + +K + + Sbjct: 135 HVARYAAKKGIKRATLNAMKKKPFQVHHFATNKSKKYTSQFQKISKKYDLDLDAGWNKKR 194 Query: 172 LFGMYGRKVKLSHSEMIEDNKKD 194 + + K D Sbjct: 195 MRHQGRHPNAYHEFMLRNMKKID 217 >UniRef50_C3GXN0 Wall associated protein n=1 Tax=Bacillus thuringiensis serovar huazhongensis BGSC 4BD1 RepID=C3GXN0_BACTU Length = 228 Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats. Identities = 43/192 (22%), Positives = 65/192 (33%), Gaps = 6/192 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + + + + + EYD WGN L +E P+ G YDKE G+YY RYY+P G Sbjct: 1 MTNQNKEVVATYEYDSWGNVLKSEVKGIAADNPFGYAGYMYDKEIGMYYLIARYYNPDHG 60 Query: 63 RYITQDPIGLEG----GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL 117 +++ DP + + Y Y NPV +DP G V + Sbjct: 61 VFLSVDPDPGDEDDPVTMNGYTYADNNPVMMVDPDGHVAWWVGSAAIGAGFGVAKYLYQN 120 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 T K + V ++ +K I+ L F G Sbjct: 121 RKTGYTWKGGLKAAGKGALTGLIGGGVGRVFGFITPMQGGIRVVKKTIQGRALIPFKQVG 180 Query: 178 RKVKLSHSEMIE 189 R V + Sbjct: 181 RNVSRFIGNPKK 192 >UniRef50_B0XCM0 SGS3 n=1 Tax=Culex quinquefasciatus RepID=B0XCM0_CULQU Length = 2047 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 68/205 (33%), Gaps = 15/205 (7%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHH-LHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + L+ +G + + +Y +G L E N YR GQ++D+E+ LY R YDP Sbjct: 1440 IRLVIRNGEVVSAYDYLPYGQLLREYNTDPDAAIAYRYTGQEWDEETNLYNYHARLYDPE 1499 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVAL---IRRKDQLNHQRAWDI 116 GR+ DP E S Y Y N P++ +DP G + L Sbjct: 1500 IGRFYQIDPK--EQYPSPYVYAGNSPISLVDPDGQFAFLIPLAFAAVGGYLGASAANGSF 1557 Query: 117 LSDTYEDMKRLNLGGTDQFFHCMAFCRVSK-----LNDAGVSRSAKGLGYEKEIRDYG-- 169 ++ L G +A + G+S A G + Sbjct: 1558 NPTKWKLKPTLIGGVLGAVMGGLAPAGIGASFTFLTGTVGLSSVAAGFVIGTTAGGFAFL 1617 Query: 170 -LNLFGMYGRKVKLSHSEMIEDNKK 193 ++ + S+ N Sbjct: 1618 SAASANQNWNPLEWNWSDPATFNAL 1642 >UniRef50_UPI000023D9A2 hypothetical protein FG10566.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023D9A2 Length = 2439 Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats. Identities = 51/221 (23%), Positives = 82/221 (37%), Gaps = 16/221 (7%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQG 62 D + EY +G + ++ P YR ++D E+GLY+ RYY P G Sbjct: 2062 FDDQAQLISYEEYSPFGAVVYAAMYGNIEAPRAYRFARYEHDSETGLYHCGQRYYCPWLG 2121 Query: 63 RYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRK-----DQLNHQRAWDI 116 R+ + DP+G G +L+ Y N PVN DP G S R+ D +R D Sbjct: 2122 RWTSPDPLGDVDGPNLFVYVNNDPVNSHDPSGTSGKKTKEGTREMYAAPDDQGKRRLVDE 2181 Query: 117 LSDTYEDMKRLNLGGTDQFFH-CMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGM 175 + + + Q A R+S + SR+ +G N G Sbjct: 2182 NKAVADRIAKYERKLQRQERKQQRAIARMSGTDPILGSRARYAVGI-----AAMGNALGR 2236 Query: 176 YGRKVKLSHSEMIE--DNKKDLAVNDHGLTCPSTTDCSDRC 214 +L H+ E + D+ +N + + + C Sbjct: 2237 ISGSTELHHTYPQEYREEFSDIDINVDRTSVSISKEAHYIC 2277 >UniRef50_Q3YV37 Putative uncharacterized protein n=1 Tax=Shigella sonnei Ss046 RepID=Q3YV37_SHISS Length = 303 Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats. Identities = 66/81 (81%), Positives = 72/81 (88%) Query: 14 SGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLE 73 EYDEWGN LNEENPH L Q RLPGQQYD+ESGLYYNR+RYYDPLQGRYITQDPIGL+ Sbjct: 76 RREYDEWGNLLNEENPHQLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLK 135 Query: 74 GGWSLYAYPLNPVNGIDPLGL 94 GGW+LY YPL+PVN +DPLGL Sbjct: 136 GGWNLYTYPLSPVNSMDPLGL 156 >UniRef50_C7Q0B8 YD repeat protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q0B8_CATAD Length = 1489 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 52/200 (26%), Positives = 69/200 (34%), Gaps = 5/200 (2%) Query: 2 LALMDADGNIAWS-GEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L+ ADG++ W WG + P P R PGQ +D E+GL+YN NRYYDP Sbjct: 1286 TELVSADGHVVWQQRRASIWGLPADIVPPDADEFPLRFPGQYHDSETGLHYNLNRYYDPE 1345 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 Y+T DP+GLE + Y Y NP+ DPLGL PA + D Sbjct: 1346 AAAYLTPDPLGLEPAPNQYGYVGNPLADSDPLGLYPASGNARGGNGSNENVMPKSQADDV 1405 Query: 121 YEDMKRLNLG---GTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIR-DYGLNLFGMY 176 + + Q + R+ G K + Sbjct: 1406 AKYLGYKKTKEMSAGKQPIWVNKKAGGGQPKYITYDRTGHSGGIFKGASFEKPFQTTKDT 1465 Query: 177 GRKVKLSHSEMIEDNKKDLA 196 GR N L Sbjct: 1466 GRDGTYDLDVDGSGNVSGLK 1485 >UniRef50_C4NV50 Rhs repeat family protein n=10 Tax=Gammaproteobacteria RepID=C4NV50_ECOLX Length = 1374 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 49/182 (26%), Positives = 64/182 (35%), Gaps = 6/182 (3%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 AL D G + W Y +G R PGQ YD E+G +YN RYYDP G Sbjct: 1114 ALTDVSGQVVWKASYSPFGKASIIIQGPTF--NLRFPGQYYDAETGFHYNWRRYYDPATG 1171 Query: 63 RYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQ---RAWDILS 118 RYIT DP+GL G + Y Y NP++ DP G A I +L Q Sbjct: 1172 RYITSDPLGLIDGVNTYGYVHGNPMSNTDPTGEFAFVGAGIGAGLELLSQLIENNGSWKC 1231 Query: 119 DTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGR 178 ++ + G R + + S K +R G Sbjct: 1232 VSWSKVGIAGAIGAIGGGWASGVFRHASSGKSWFKLSQKWSNVSPRVRKVQGVPRGNELH 1291 Query: 179 KV 180 Sbjct: 1292 HW 1293 >UniRef50_D1S833 YD repeat protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1S833_9ACTO Length = 3829 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 41/101 (40%), Positives = 52/101 (51%), Gaps = 5/101 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L++ G + W D WG P PGQ D E+GL+YNR RYYDP GR Sbjct: 2487 LVEPGGGLRWWSRGDLWGR-----GADRTATPLAFPGQYVDAETGLHYNRFRYYDPATGR 2541 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR 104 Y++ DP+GL GG + AY NP+ DPLGL+ A Sbjct: 2542 YVSPDPLGLSGGPNPTAYVSNPLTVADPLGLTSCTPAPTTP 2582 >UniRef50_A6GBQ3 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBQ3_9DELT Length = 2507 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 37/126 (29%), Positives = 53/126 (42%), Gaps = 4/126 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQP---YRLPGQQYDKESGLYYNRNRYYDPLQ 61 + G I EY +G L P YR G + D+E+GL Y+ RYY P Sbjct: 2059 VSETGAIISHEEYHPYGTSAYRMVDSQLDVPPKRYRFTGMERDEETGLSYHSARYYAPWL 2118 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+ DPIGL G + +AY NP N +DP G + I N + + Sbjct: 2119 GRWTAADPIGLGDGVNRFAYVRGNPTNFVDPTGFAGESENHIYSDQWGNFAERAYQIGRS 2178 Query: 121 YEDMKR 126 + + + Sbjct: 2179 EQQLDQ 2184 >UniRef50_B8FIJ1 YD repeat protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FIJ1_DESAA Length = 1630 Score = 127 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 34/99 (34%), Positives = 48/99 (48%), Gaps = 7/99 (7%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHH------LHQPYRLPGQQYDKESGLYYNRNR 55 +AL D+ G + + Y +G + P+ GQ+YD+E+GLYY R R Sbjct: 1318 IALTDSTGAVVETYRYTPYGQVSFFDGNGSSISQSNESNPFLFTGQRYDEETGLYYYRAR 1377 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLG 93 Y P GR++ DP G G +LY Y NP +DP G Sbjct: 1378 YLHPELGRFLNPDPKGFVDGMNLYEYAMSNPARYVDPRG 1416 >UniRef50_D0BWK2 YD repeat protein n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0BWK2_9GAMM Length = 1361 Score = 127 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 43/164 (26%), Positives = 70/164 (42%), Gaps = 4/164 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D++G++ WS + +G + R PGQ YD +G +YN NR+Y+P GR Sbjct: 1054 LIDSNGSVVWSWDSTAFG---LGSPVSTITFNLRFPGQYYDATTGQFYNHNRFYNPELGR 1110 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 Y+ DPIGL GG + Y Y NPV +D G +P +A+ ++ + Sbjct: 1111 YMEPDPIGLAGGLNPYIYALNNPVMYVDMTGENPILIAMGVSAATAGAFYTGEVFFNAVY 1170 Query: 123 DMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIR 166 D + + D F + + + G + G Sbjct: 1171 DAYKNDQSILDSFSKSFSIKELGQQMVIGAAFGGVGKAAFLAAD 1214 >UniRef50_B5JS46 NHL repeat containing protein n=2 Tax=gamma proteobacterium HTCC5015 RepID=B5JS46_9GAMM Length = 2515 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 46/190 (24%), Positives = 72/190 (37%), Gaps = 11/190 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + + ADG IA YDE+G + + NP QP+ G YD+ + L R YD Sbjct: 2333 MVVNTADGAIAQQMSYDEFGQVVEDSNPG--FQPFGFAGGIYDQHTKLTRFGARDYDAET 2390 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+ T+DPI EGG +L+ Y +PVN +D GL + + W + Sbjct: 2391 GRWTTKDPIRFEGGLNLFGYVANDPVNWVDIWGLEGEQATVELVNPR---GADWSEAREN 2447 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKV 180 + G + R V+ Y D+ ++ + Sbjct: 2448 NPHLPGPQYPGDYATEYTDKAERDV-----CVTCPNGEKYYTPIPGDHLPDIMDKFNNGW 2502 Query: 181 KLSHSEMIED 190 K H + E+ Sbjct: 2503 KKEHPDPTEE 2512 >UniRef50_Q07833 Wall-associated protein n=18 Tax=Bacillaceae RepID=WAPA_BACSU Length = 2334 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 58/143 (40%), Gaps = 6/143 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHL-HQPYRLPGQQYDKESGLYYNRNRYYDP 59 ++A+ D+ G +YD WGN E + YR G QYD+E+GLYY RYY+P Sbjct: 2088 IIAISDSTGKTVAKYQYDAWGNPTKTEASDEVKDNRYRYAGYQYDEETGLYYLMARYYEP 2147 Query: 60 LQGRYITQDPIGLEGG----WSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 G +++ DP G + YAY NPV +DP G V ++ Sbjct: 2148 RNGVFLSLDPDPGSDGDSLDQNGYAYGNNNPVMNVDPDGHWVWLVVNAGFAAYDGYKAYK 2207 Query: 115 DILSDTYEDMKRLNLGGTDQFFH 137 + G + F Sbjct: 2208 SGKGWKGAAWAAASNFGPGKIFK 2230 >UniRef50_Q16U81 Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q16U81_AEDAE Length = 3340 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 34/122 (27%), Positives = 55/122 (45%), Gaps = 4/122 (3%) Query: 8 DGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYIT 66 +G + + +Y +G + + H YR GQ+YD+E+GLY R YDP GR+ Sbjct: 2635 NGEVVAAYDYFPYGQLMRIYGSNPEAHIAYRYTGQEYDEETGLYNYHARLYDPDIGRFFQ 2694 Query: 67 QDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMK 125 DP+ E S Y Y N PV+ IDP G + ++ + ++++ K Sbjct: 2695 MDPM--EQYASPYKYAGNSPVSQIDPDGQIAITLIVMAIGALVGAYIGASSANNSWNPAK 2752 Query: 126 RL 127 Sbjct: 2753 WA 2754 >UniRef50_D1SVF0 Rhs family protein (Fragment) n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1SVF0_9BURK Length = 218 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 42/121 (34%), Positives = 57/121 (47%), Gaps = 24/121 (19%) Query: 9 GNIAWSGEYDEWGNQLNE------------------------ENPHHLHQPYRLPGQQYD 44 GN+ W Y WG + E ++ + Q R+ GQ D Sbjct: 2 GNLVWQARYLTWGATVQEHWQAFDAAGRPVDAPVAETGHRPQQSFVLIPQNLRMQGQYLD 61 Query: 45 KESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRR 104 +E+GL+YN RYYDP G + T DPIGL GG +L+ Y LNP+ +DP G +P R Sbjct: 62 RETGLHYNTFRYYDPDLGAFTTPDPIGLAGGINLHQYALNPIAWVDPWGWAPYCRQKGRP 121 Query: 105 K 105 K Sbjct: 122 K 122 >UniRef50_C9RZN2 YD repeat protein n=2 Tax=Geobacillus RepID=C9RZN2_GEOSY Length = 678 Score = 127 bits (318), Expect = 4e-28, Method: Composition-based stats. Identities = 40/98 (40%), Positives = 52/98 (53%), Gaps = 5/98 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++AL D GNI +YD GN L++ PYR G QYD+E+GLYY RYY P Sbjct: 468 VIALTDEQGNIVARYQYDARGNILSQSGALADENPYRYAGYQYDQETGLYYLIARYYHPE 527 Query: 61 QGRYITQDPI-GLEGGW---SLYAYP-LNPVNGIDPLG 93 G +++ DP G + YAY NPV +DP G Sbjct: 528 HGVFLSLDPDPGDADDLLTQNGYAYANNNPVMFVDPDG 565 >UniRef50_C4DNE5 Rhs family protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DNE5_9ACTO Length = 2442 Score = 127 bits (318), Expect = 4e-28, Method: Composition-based stats. Identities = 37/141 (26%), Positives = 54/141 (38%), Gaps = 6/141 (4%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQG 62 +D + EY +G + H P YR G++ D ESGLYY RYY P G Sbjct: 1972 LDDQAQVISFEEYYPFGGTSFQSTREHTETPKRYRFTGKERDTESGLYYQGARYYAPWLG 2031 Query: 63 RYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 R+ + DP G G YAY NP+ DP GL + + W ++ T Sbjct: 2032 RWTSADPAGPRDGACPYAYVGNNPLRRTDPSGLEGDEPEQLNFAQSPAF---WQQVTATA 2088 Query: 122 EDMKRLNLGGTDQFFHCMAFC 142 + + + + Sbjct: 2089 KGRGISDATRANLQKVAQMWG 2109 >UniRef50_A5GE16 YD repeat protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GE16_GEOUR Length = 1600 Score = 127 bits (318), Expect = 4e-28, Method: Composition-based stats. Identities = 41/101 (40%), Positives = 62/101 (61%), Gaps = 4/101 (3%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++++ DA+ N+ S EYD +G Y G+++DKE+GLY+ R RYYDP+ Sbjct: 1411 IVSITDANRNVVQSYEYDSFGMVKPSTV---FANSYTYTGREWDKETGLYFYRARYYDPM 1467 Query: 61 QGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVA 100 +GR+I++DP+G +GG ++YAY N VN DP GL P Sbjct: 1468 EGRFISKDPVGFKGGINIYAYVSNNVVNDTDPSGLYPGPCG 1508 >UniRef50_D1W448 RHS repeat-associated core domain protein n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W448_9BACT Length = 195 Score = 126 bits (317), Expect = 6e-28, Method: Composition-based stats. Identities = 40/129 (31%), Positives = 62/129 (48%), Gaps = 1/129 (0%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + +D+ G + W D +G+ L P+R GQ D E+GLYYNR RYYDP Sbjct: 53 IQALDSKGEVVWDCILDIYGDVLELRGKRDF-IPFRFQGQYEDGETGLYYNRFRYYDPNS 111 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 G +I+QDPI + GG+++YAY + + +D GLS + + + + T+ Sbjct: 112 GTFISQDPISILGGFNIYAYVHDVNSWVDVFGLSKYSPIEVLGRKVYQNSADFGGGVPTF 171 Query: 122 EDMKRLNLG 130 D Sbjct: 172 VDPIVGKTN 180 >UniRef50_B4VFT3 Rhs protein n=4 Tax=Bacteria RepID=B4VFT3_9ACTO Length = 1253 Score = 126 bits (316), Expect = 6e-28, Method: Composition-based stats. Identities = 40/103 (38%), Positives = 51/103 (49%), Gaps = 1/103 (0%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ DG AW WG + + P R PGQ YD ESGL++N R YDP Sbjct: 1035 LLAEDGTTAWHTRATLWGTTTWNSDATA-YTPLRFPGQYYDPESGLHHNYFRTYDPETAH 1093 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKD 106 Y++ DP+GL + Y NP DPLGL+PAD R + Sbjct: 1094 YLSPDPLGLAPAPNPTTYVHNPHTWSDPLGLTPADECKYRVEQ 1136 >UniRef50_B3PHE6 RHS Repeat family n=5 Tax=cellular organisms RepID=B3PHE6_CELJU Length = 3998 Score = 126 bits (316), Expect = 6e-28, Method: Composition-based stats. Identities = 43/168 (25%), Positives = 67/168 (39%), Gaps = 10/168 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D++GNI + YD +GN + + NP P+ G D ++GL R YDP GR Sbjct: 3781 ITDSNGNIVKTVSYDSYGNIIEDSNPE-FQIPFGFAGGLKDDDTGLIRFGYRDYDPETGR 3839 Query: 64 YITQDPIGLEGG-WSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAW------- 114 + +DPIG EGG +LY Y + +N ID GL R Q Sbjct: 3840 WTARDPIGFEGGDTNLYGYVLGDAINFIDIDGLQRNTSNSWNRHQQYYGGTGMTRSQLSQ 3899 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYE 162 +D + + + ++ +G S+ GY+ Sbjct: 3900 QYRADQLKQWASGRNPDLHWWLGNLPDNSQIQIVGSGGGTSSLIFGYQ 3947 >UniRef50_B4EGW8 RHS-family protein n=9 Tax=Burkholderiaceae RepID=B4EGW8_BURCJ Length = 1515 Score = 126 bits (316), Expect = 6e-28, Method: Composition-based stats. Identities = 45/166 (27%), Positives = 62/166 (37%), Gaps = 13/166 (7%) Query: 4 LMDADGNIAWSGEYDEWGNQL--------NEENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 + D G W Y WG L + + R GQ D E+GL YN NR Sbjct: 1267 VFDEQGRPVWKAAYSLWGKLLPVKRPANDADCGATSIDTTLRFSGQWADDETGLNYNLNR 1326 Query: 56 YYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWD 115 YYDP G+Y++ DPIGL GG AY +P IDPLGL A + + + Sbjct: 1327 YYDPDSGQYLSADPIGLLGGARTQAYVHDPSQWIDPLGLQGCKPAGKKISMRAYRYEMPE 1386 Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGY 161 T++ + H + + A +A Sbjct: 1387 RFDTTWDAHEWNVAA-----RHRYTKKGLGGVYGADSPATALAEVT 1427 >UniRef50_A7BNN3 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. SS RepID=A7BNN3_9GAMM Length = 212 Score = 126 bits (316), Expect = 8e-28, Method: Composition-based stats. Identities = 43/138 (31%), Positives = 66/138 (47%), Gaps = 20/138 (14%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+D+ GN+ W+ Y+ +G + + + R GQ +D E+ L+YN RYY+P GR Sbjct: 29 LIDSQGNVVWAAVYEAFGKARVD--VNLVENHLRFAGQYFDSETRLHYNYYRYYEPTIGR 86 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 Y+ DPI + YAY NP++ +DP GL + D L D ++ Sbjct: 87 YLRVDPI---PSVNQYAYVSGNPLSYVDPFGLEKEIMM--------------DQLFDLFD 129 Query: 123 DMKRLNLGGTDQFFHCMA 140 +M L+L F C A Sbjct: 130 EMAALDLAFDYPKFGCEA 147 >UniRef50_A4FJ21 YD repeat protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FJ21_SACEN Length = 1670 Score = 125 bits (315), Expect = 8e-28, Method: Composition-based stats. Identities = 46/151 (30%), Positives = 65/151 (43%), Gaps = 2/151 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+ +G +AW G WG +L + P+ + P R PGQ D E+GL YN +RYYDP Sbjct: 1271 TDLIAPNGVLAWHGRTSLWGKELPVQ-PNGVTTPLRFPGQYADAETGLNYNVHRYYDPAT 1329 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 GRY++QDP+GL + AY NP + DPLGL+ + T Sbjct: 1330 GRYLSQDPLGLAPAPNPVAYVDNPHSAADPLGLAKGGTKRKHDGSGNQPPTTGNGNKRTK 1389 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGV 152 +D + + S G Sbjct: 1390 KDP-WDRKPFSHDTEKKTQADKTSTPPGTGY 1419 >UniRef50_A9EVR3 Conserved carbohydrate-binding protein, Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EVR3_SORC5 Length = 1367 Score = 125 bits (315), Expect = 9e-28, Method: Composition-based stats. Identities = 44/129 (34%), Positives = 62/129 (48%), Gaps = 4/129 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D G W+ E D G E+ P+R PGQ D E+GLYYNR RYYDP G Sbjct: 1148 LFDTSGVQVWAAETDTLGRTAVEQGA-PEDCPWRWPGQYEDPETGLYYNRFRYYDPDAGN 1206 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDIL-SDTYE 122 Y++ DP+GL G + YAY + + DPLGL + + D + + + Sbjct: 1207 YVSPDPLGLLAGTAEYAYAPDSLVWFDPLGL--IVLQQVPYNDHPLFGAVSEFIQGKSRS 1264 Query: 123 DMKRLNLGG 131 D++ N+ Sbjct: 1265 DLRGRNVAA 1273 >UniRef50_UPI00016B0868 Rhs family protein n=1 Tax=Burkholderia pseudomallei 112 RepID=UPI00016B0868 Length = 242 Score = 125 bits (315), Expect = 1e-27, Method: Composition-based stats. Identities = 38/95 (40%), Positives = 53/95 (55%), Gaps = 2/95 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D+ G W+ YDE G + ++ P L GQ D E+GL+YNR+RYYDP G Sbjct: 1 MTDSSGREVWATGYDENGRLVPINA--DIYNPIHLQGQYRDAETGLHYNRHRYYDPALGS 58 Query: 64 YITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPAD 98 +I++DP+GL G +LY Y N + DPLG Sbjct: 59 FISKDPLGLAAGVNLYRYAPNSIGWADPLGFQAKP 93 >UniRef50_C8QVK4 YD repeat protein n=4 Tax=Desulfurivibrio alkaliphilus AHT2 RepID=C8QVK4_9DELT Length = 2439 Score = 125 bits (315), Expect = 1e-27, Method: Composition-based stats. Identities = 48/201 (23%), Positives = 80/201 (39%), Gaps = 13/201 (6%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L + + G IA +YDE+GN L + NP QP+ G +D+++ L R YDP Sbjct: 2207 LVVNTSTGEIAQRIDYDEFGNVLQDTNPG--FQPFGFAGGLHDRDTNLTRFGARDYDPQT 2264 Query: 62 GRYITQDPIGLEGG-WSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQR--AWDIL 117 GR+ +DPI GG +LY Y +P+N IDP G A + A+ Sbjct: 2265 GRWTAKDPILFAGGDTNLYGYVLNDPINWIDPEGKIGIAGAAVGGVIGAVSGALGAYTSG 2324 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYG 177 ++++ GG + L AG+ I+++G G+ Sbjct: 2325 GNSWDIASAAFTGGLGGAVY-------GFLPGAGLLAHIGKGAAIGGIQNFGAQFMGILN 2377 Query: 178 RKVKLSHSEMIEDNKKDLAVN 198 + + + + A+ Sbjct: 2378 DPCQRFNYGSLAGSILGGAIG 2398 >UniRef50_B3PEN8 Rhsfamily protein n=2 Tax=Cellvibrio japonicus Ueda107 RepID=B3PEN8_CELJU Length = 1401 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 40/93 (43%), Positives = 52/93 (55%), Gaps = 3/93 (3%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 ++D GNI W +Y +G + + R PGQ +D E+GL++N R YD G Sbjct: 1187 QMLDDAGNIVWEAQYSAFGKAHITIDT--VENNLRFPGQYFDSETGLHHNYFRDYDSALG 1244 Query: 63 RYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGL 94 RYI DPIGL GG++ Y Y NP IDPLGL Sbjct: 1245 RYIQSDPIGLGGGFNTYVYAYQNPAVLIDPLGL 1277 >UniRef50_Q0IFS2 Putative uncharacterized protein (Fragment) n=1 Tax=Aedes aegypti RepID=Q0IFS2_AEDAE Length = 2805 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 57/135 (42%), Gaps = 13/135 (9%) Query: 3 ALMDADG---------NIAWSGEYDEWGNQLNEENPHHL-HQPYRLPGQQYDKESGLYYN 52 + D +G + + +Y +GN + + H YR GQ++D+E+GLY Sbjct: 2553 VITDHEGSIRLVVKGDEVVAAYDYLPYGNLMRVYGNNPEGHISYRYTGQEWDEETGLYNY 2612 Query: 53 RNRYYDPLQGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQ 111 R+YDP GR+ DP G +S Y Y N P++ +DP G + + + Sbjct: 2613 HARFYDPSIGRFYQIDPKG--QYFSPYKYAGNSPISMVDPDGQFAWFLIPLIVGLAIGGA 2670 Query: 112 RAWDILSDTYEDMKR 126 ++ + + Sbjct: 2671 YLGGSAANRNWNPAK 2685 >UniRef50_D1AA66 YD repeat-containing protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AA66_THECD Length = 1197 Score = 125 bits (313), Expect = 1e-27, Method: Composition-based stats. Identities = 38/95 (40%), Positives = 50/95 (52%), Gaps = 1/95 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + L+ DG+I+W WG + P PGQ YD E+GLY+N RYY P Sbjct: 1027 VELVAPDGSISWRSHAAVWGAP-YVSRADQISCPLGFPGQYYDSETGLYFNYFRYYSPFD 1085 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 GRY++ DP+GL + Y Y +NP DP GL P Sbjct: 1086 GRYLSPDPLGLSPQPNPYIYVINPFVWADPFGLYP 1120 >UniRef50_C7BJB9 Insecticidal toxin complex TccC n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BJB9_PHOAA Length = 926 Score = 125 bits (313), Expect = 1e-27, Method: Composition-based stats. Identities = 51/253 (20%), Positives = 84/253 (33%), Gaps = 25/253 (9%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPY---RLPGQQYDKESGLYYNRNRYYDPLQ 61 +D G I EY +G + + Y R G++ DK +GLYY R+RYY P Sbjct: 592 LDTKGKIISQEEYYPYGGTAIWTARNQIEASYKTVRYSGKERDK-TGLYYYRHRYYQPWL 650 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRA------- 113 GR+++ DP G G +LY NP+ D G + D A K+ Sbjct: 651 GRWLSADPAGTVDGLNLYRMVKNNPIRYQDESGTNANDKAQAIFKEGKKIAINQLKIASN 710 Query: 114 ---WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAG---------VSRSAKGLGY 161 S+ ++ R+ GG + + G V G Sbjct: 711 FLKDSKNSENALEIYRIFFGGHQDIEQLPQWKKRIDSVIYGLDKLKTTKHVHYQQDKSGS 770 Query: 162 EKEIRDYGLNLFGMYGRKVKLSHSEMIEDNKKDL-AVNDHGLTCPSTTDCSDRCSDYINP 220 + D ++ + + K + + D K + G + + + Sbjct: 771 SSTVADLNVDEYKKWSEGNKSIYVNVYADALKRVYEDPLLGREHVAHIAIHELSHGVLRT 830 Query: 221 EHKKTIKALQDAG 233 + K I L G Sbjct: 831 QDHKYIGVLSSPG 843 >UniRef50_B0SBE5 Putative uncharacterized protein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SBE5_LEPBA Length = 623 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 45/229 (19%), Positives = 78/229 (34%), Gaps = 19/229 (8%) Query: 1 MLALMDADGNIAW--------SGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYN 52 + + D +GN+ Y +G L ++ ++ GQ+ D+ESGLYY Sbjct: 303 ITMITDGNGNVLAGGERGGKSHITYKPYGEILRTDSYGPDITKFKYTGQEEDQESGLYYY 362 Query: 53 RNRYYDPLQGRYITQDPIGLEG---GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQL 108 ++RYYD R+++ D + G + Y NP+ D G + L Sbjct: 363 KSRYYDSNIARFVSNDGMVFPDKDQGMNRMMYVEGNPLKWRDRSGNRISTPLAWGIMGAL 422 Query: 109 NHQRAWDILSDTYEDMKRLNLGGTD---QFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEI 165 + QR ++ + + + +F +F N A + EK Sbjct: 423 SAQRYGLSAAEGFALGYGVGNRINNRSKRFNLKRSFDNTLGNNGAHGWLTRNAFSTEKIS 482 Query: 166 RDYGLNLFGMYGRKVKLSHSEMIEDNKKDLAVN-DHGLTCPSTTDCSDR 213 R Y G G + S +N AV+ + C Sbjct: 483 RWYN---RGKSGFGIGSFGSPDKYENTHQYAVHTTLNIMLRDGETCKRL 528 >UniRef50_A3KUM5 Rhs family protein n=10 Tax=Pseudomonas aeruginosa RepID=A3KUM5_PSEAE Length = 931 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 45/99 (45%), Positives = 53/99 (53%), Gaps = 2/99 (2%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 DA G IAW + D +G R PGQ YD ESGL+YN R YDP GRY Sbjct: 690 TDASGQIAWQWQSDAFGRGEALSQGSTQVN-LRFPGQYYDAESGLHYNYFRDYDPETGRY 748 Query: 65 ITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALI 102 + DPIGL+GG + Y Y NP+ DP GL+PA L Sbjct: 749 VESDPIGLKGGLNTYGYVYGNPLTYSDPKGLTPAAAGLC 787 >UniRef50_A9GT29 Insecticidal toxin complex-like protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GT29_SORC5 Length = 2426 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 31/93 (33%), Positives = 45/93 (48%), Gaps = 4/93 (4%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLH---QPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +D G + EY +G + + YR G++ D+E+GLYY+ RYY P Sbjct: 2046 VDGAGLVIGYEEYHPFGTTAYWSAASGIEVSQRRYRYTGKEKDEETGLYYHGARYYAPWL 2105 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLG 93 GR+ DP G+ G +LY Y P+ DP G Sbjct: 2106 GRWTAADPAGMVDGPNLYMYVSGRPITLTDPSG 2138 >UniRef50_Q4VQB1 SGS1 n=4 Tax=Aedes/Ochlerotatus group RepID=Q4VQB1_AEDAE Length = 3060 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 39/175 (22%), Positives = 66/175 (37%), Gaps = 15/175 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPH-HLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L+ G + + +Y +G + + + H +R GQ++D+E+GLY R YDP Sbjct: 2641 TRLVIHQGKVVAAYDYLPYGQMIRKYGSNPEAHIAFRYTGQEFDEETGLYNYHARLYDPD 2700 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 GR+ DP+ E S Y Y N PV+ IDP G + L+ + ++ Sbjct: 2701 IGRFFQMDPM--EQYASPYKYAGNSPVSQIDPDGQIAVTLVLMIIGAIVGAYLGAASANN 2758 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 ++ + + AG A + Y +FG Sbjct: 2759 SW-----------NPAKWAWGDKKTWIGLFAGAIMGAFAVYGGAATFSYFTAMFG 2802 >UniRef50_Q395C2 Rhs family protein n=1 Tax=Burkholderia sp. 383 RepID=Q395C2_BURS3 Length = 190 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 42/80 (52%), Positives = 49/80 (61%) Query: 29 PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNG 88 H + P R PGQ YD+ESGL+YNR RY DP GRYI QDPIGL+GG + Y Y NPV Sbjct: 6 AHAVDNPIRFPGQYYDRESGLHYNRFRYCDPQVGRYINQDPIGLKGGANSYVYAHNPVTL 65 Query: 89 IDPLGLSPADVALIRRKDQL 108 DPLGL L+ + Sbjct: 66 SDPLGLQSTGPVLLGMGNLY 85 >UniRef50_C7QAC6 YD repeat protein n=2 Tax=Bacteria RepID=C7QAC6_CATAD Length = 1508 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 38/97 (39%), Positives = 52/97 (53%), Gaps = 2/97 (2%) Query: 2 LALMDADGNIAWSGEYDEWGN--QLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 L+ DG +AW D +G + L P R GQ +D E+GL+YN RYYDP Sbjct: 1306 TELVTPDGRVAWYQNTDLYGQSVAVATGGDPDLECPLRFAGQYFDAETGLHYNVQRYYDP 1365 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 Y+T DP+GL + +AY NP+ +DPLGL+ Sbjct: 1366 AIAAYLTPDPLGLAPALNDHAYVPNPLTMVDPLGLAS 1402 >UniRef50_D1PTA9 YD repeat protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTA9_9BACT Length = 320 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 73/189 (38%), Gaps = 7/189 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + + DG ++ EY +G EE + + PY ++ D+E GLYY RYY+P Sbjct: 131 ITNLDGEVSQHIEYVPFGEVFIEERNNTWNTPYLFNAKELDEEIGLYYYGARYYEPRLSL 190 Query: 64 YITQDPIGLE-GGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 +++ DP E ++Y Y NP+ +DP G ++ + D + + + + Sbjct: 191 WMSVDPSAEEKPWLTIYCYTRNNPIILVDPDGRDEWEIN--SKGDIVKQIKTDKHDAFFF 248 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 + G F + S G +R Y+ + LF + + Sbjct: 249 VSKAGKRVKGKSLSFEYGTIEKASGQKTDGGTRYD---VYQVRGDKHAKQLFESFAKNTS 305 Query: 182 LSHSEMIED 190 + S M Sbjct: 306 VEWSIMQTG 314 >UniRef50_C4DNE3 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DNE3_9ACTO Length = 714 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 69/212 (32%), Gaps = 8/212 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQG 62 +D I EY +G E P YR G++ D+ESGLYY+ RYY P G Sbjct: 285 LDDSARIISHEEYYPYGGTAVESVRSRTETPKRYRFTGKERDEESGLYYHGARYYAPGLG 344 Query: 63 RYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 R+ + DP G+ G + Y Y NP+ DP G + + + H Y Sbjct: 345 RWTSGDPKGIAEGPNPYVYTRNNPIVFADPDGREARLII-----NPVKHTVTVRTTVHLY 399 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 + A+ + +A R G + Sbjct: 400 AATDAERTALREVAKKAEAYWANPTVATESEVNTAVAAKTSIPNRGTGATINNQQWTLNY 459 Query: 182 LSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDR 213 ++ + + ++ G +D+ Sbjct: 460 DIKYQVHDSPTAPIKIDKSGFVVDEKQAAADK 491 >UniRef50_A9GIJ6 Conserved carbohydrate-binding protein, Rhs family n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GIJ6_SORC5 Length = 1429 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 43/97 (44%), Positives = 54/97 (55%), Gaps = 2/97 (2%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+DA GN+A + WG E P R GQ YD E+GL YNR RYYDP GR Sbjct: 1056 LLDAAGNVACELDRTVWGAARPREGART-ETPLRFLGQYYDDETGLAYNRYRYYDPAVGR 1114 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADV 99 YI+ DP+GL GG + ++Y N P +DP GL + Sbjct: 1115 YISVDPVGLLGGQNGFSYAGNRPTKMVDPTGLMFSTT 1151 >UniRef50_C3IAE1 Insecticidal toxin complex protein TccC (Toxin complex protein) n=1 Tax=Bacillus thuringiensis IBL 200 RepID=C3IAE1_BACTU Length = 921 Score = 124 bits (310), Expect = 3e-27, Method: Composition-based stats. Identities = 36/144 (25%), Positives = 55/144 (38%), Gaps = 5/144 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPY---RLPGQQYDKESGLYYNRNRYYDPLQ 61 +D G I E+ +G + Y R G++ D +GLYY +RYY P Sbjct: 577 LDKQGKIISKEEFYPYGGTALWTARTEIEANYKTIRYSGKERDA-TGLYYYGHRYYIPWA 635 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR++ DP G G +LY NP+N IDP G +P + +++ + Sbjct: 636 GRWLNPDPAGTVDGLNLYRMVRNNPINLIDPDGNAPIQITNYSKENGDLFYGLANERGRY 695 Query: 121 YEDMKRLNLGGTDQFFHCMAFCRV 144 E R +D Sbjct: 696 IEAALRGKTFVSDSAESEPMIIDQ 719 >UniRef50_B2AU51 Predicted CDS Pa_1_17960 n=1 Tax=Podospora anserina RepID=B2AU51_PODAN Length = 2454 Score = 124 bits (310), Expect = 4e-27, Method: Composition-based stats. Identities = 45/181 (24%), Positives = 76/181 (41%), Gaps = 10/181 (5%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQP--YRLPGQQYDKESGLYYNRNRYYDPLQG 62 +D + N+ EY +G+ + + P YRL ++D+E+GLY+ RYY P G Sbjct: 2101 LDDEANLVSYEEYSPFGSVVYSAVYVEVEAPRKYRLARYEHDRETGLYHCGKRYYCPWLG 2160 Query: 63 RYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLN----HQRAWDIL 117 R+ + DP G G +LY+Y N PVN +DP G S V+ + +D+ Sbjct: 2161 RWTSADPAGTVDGPNLYSYVRNDPVNWVDPKGTSGKKVSPEKPQDKGGITGDQPEINSRQ 2220 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLND---AGVSRSAKGLGYEKEIRDYGLNLFG 174 + +G D + AF ++ + G+ + K R + + G Sbjct: 2221 VILGDIAGSEIVGTLDTPENINAFRKLVENKLKKEEGIIKGLWNKFSGKIARKAAVIVLG 2280 Query: 175 M 175 Sbjct: 2281 T 2281 >UniRef50_B7GX39 Protein rhsD n=4 Tax=Acinetobacter baumannii RepID=B7GX39_ACIB3 Length = 1590 Score = 123 bits (309), Expect = 4e-27, Method: Composition-based stats. Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 4/97 (4%) Query: 4 LMDADGNIAWSGEYDEWGNQ----LNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 + + G W D WG ++ R GQ YD+E+ L+YNR RYY+P Sbjct: 1301 MTNIRGECVWEILQDTWGAVSQIKALNQDNPFEQNNLRFQGQYYDRETELHYNRYRYYEP 1360 Query: 60 LQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSP 96 RY+++DPIGLEGG + +Y +P IDP GL+ Sbjct: 1361 HSARYVSKDPIGLEGGMNTSSYVSDPNQWIDPKGLNS 1397 >UniRef50_B0XA86 Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0XA86_CULQU Length = 2714 Score = 123 bits (309), Expect = 4e-27, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 68/190 (35%), Gaps = 11/190 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHL-HQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L+ +G + + +Y +G +L N H Y+ GQ++D+E+GLY R YDP Sbjct: 2085 TRLVVKNGEVVAAYDYLPYGQRLRFYNTDPDGHVAYQYTGQEFDEETGLYNYHARLYDPE 2144 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVA------LIRRKDQLNHQRA 113 GR+ DP + S Y Y N PV +DP G +A + + ++ Sbjct: 2145 LGRFYQTDP--QDQYPSPYKYAGNSPVMMVDPDGEFALMIACIVFAIVGSYLGAASVNQS 2202 Query: 114 WDILSDTYEDMKRLNLGGTDQFFHCMA-FCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNL 172 W+ L+ ++ + + V+ A + Sbjct: 2203 WNPLAWNWKSGHTYLGMFSGAVAGAVLPIGGVASFGYFSALGGATFASFATASVALAGAY 2262 Query: 173 FGMYGRKVKL 182 GM G Sbjct: 2263 LGMAGAANDW 2272 >UniRef50_Q12SZ6 Putative uncharacterized protein n=1 Tax=Shewanella denitrificans OS217 RepID=Q12SZ6_SHEDO Length = 520 Score = 123 bits (308), Expect = 5e-27, Method: Composition-based stats. Identities = 51/177 (28%), Positives = 77/177 (43%), Gaps = 17/177 (9%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++A D GNI Y+ +G +L + G DK+ GL Y + RYYDPL Sbjct: 236 VVAESDEAGNIISRSHYEPFGKRLGGDKAG-----IGYTGHLQDKDLGLTYMQARYYDPL 290 Query: 61 QGRYITQDPIGLEG--GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQ-----R 112 GR+ + DPI G ++ YAY NP IDP G + VA I N + Sbjct: 291 IGRFYSNDPISYRGVHSFNRYAYANNNPYKYIDPTGNNEEYVAGISVGIGFNAEDVNLIA 350 Query: 113 AWDILSDTYEDMKRLNLGGT---DQFFHCMAFCRVSKLND-AGVSRSAKGLGYEKEI 165 ++ Y ++ ++LG +Q F + +S L+ G++RS G K Sbjct: 351 TFENSVYNYNFLEGVSLGQRIVEEQMFQQLQLGELSSLSMLLGIARSNGKFGVVKGG 407 >UniRef50_C4KA75 YD repeat protein n=1 Tax=Thauera sp. MZ1T RepID=C4KA75_THASP Length = 1892 Score = 123 bits (308), Expect = 6e-27, Method: Composition-based stats. Identities = 40/93 (43%), Positives = 51/93 (54%), Gaps = 3/93 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQ--PYRLPGQQYDKESGLYYNRNRYYDPLQG 62 DADG W + +G +P H RLPGQ +D E+GL+YNR RYY P G Sbjct: 1388 TDADGEPLWRARHAPFGAATVTTSPRHPDFTLDLRLPGQVFDAETGLHYNRRRYYAPTLG 1447 Query: 63 RYITQDPIGLEGGWSLYAY-PLNPVNGIDPLGL 94 Y+T DP+G G + YAY NP+ +DP GL Sbjct: 1448 EYLTPDPLGTPDGPNPYAYAAFNPLRNVDPDGL 1480 >UniRef50_D1VXD6 RHS repeat-associated core domain protein n=4 Tax=Prevotella RepID=D1VXD6_9BACT Length = 678 Score = 123 bits (308), Expect = 6e-27, Method: Composition-based stats. Identities = 40/171 (23%), Positives = 68/171 (39%), Gaps = 9/171 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + D D NI Y +G L +E+ PY+ G+++D+E+GLYY RY +P Sbjct: 370 ITDKDANITQFDAYLPYGELLVDEHSSSEELPYKFNGKEFDEETGLYYYGARYMNPNTSL 429 Query: 64 YITQDPIGLEGGWSLYAYP---LNPVNGIDPLGLSPADVAL-----IRRKDQLNHQRAWD 115 + D + + ++ Y NP+ +DP G SP A+ ++ W Sbjct: 430 WYGVDALTEK-NPNVTGYCYTFNNPIKLLDPDGNSPIGAAVEGISAFVVSAGVDFISNWI 488 Query: 116 ILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIR 166 Y+ R GT F ++ AG +++ + K R Sbjct: 489 FEGMDYKTAFRHVRWGTAAFDGISTAAISLLVDGAGTTKTIGKIANSKAGR 539 >UniRef50_Q2Y592 Peptidase C39, bacteriocin processing n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2Y592_NITMU Length = 1599 Score = 123 bits (308), Expect = 6e-27, Method: Composition-based stats. Identities = 42/131 (32%), Positives = 61/131 (46%), Gaps = 3/131 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + +G S +YD +GNQ+ + +R G Y ++SGLY R YDP Sbjct: 1369 VMAAQNGAKVASYDYDPYGNQIAGSGRISVD--FRYAGMFYHQQSGLYLTNFRAYDPKTA 1426 Query: 63 RYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 +++++DPIG +GG +LY Y NP+N IDPLGL L + DI + Sbjct: 1427 KWLSRDPIGEKGGLNLYGYVGGNPINMIDPLGLRALPWILGGASSDIATPDPSDIAWQKW 1486 Query: 122 EDMKRLNLGGT 132 L G T Sbjct: 1487 AGWAILITGAT 1497 >UniRef50_Q72U39 Cytoplasmic membrane protein n=2 Tax=Leptospira interrogans RepID=Q72U39_LEPIC Length = 2379 Score = 123 bits (308), Expect = 6e-27, Method: Composition-based stats. Identities = 48/218 (22%), Positives = 83/218 (38%), Gaps = 15/218 (6%) Query: 1 MLALMDADGNI-------AWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNR 53 + + D GN Y+ +G+ + ++ Y+ GQ DKE+GLYY + Sbjct: 2070 ITMITDGAGNPASGPEPGVSFVSYEPYGSIIRNDSYGPDIFRYKFTGQIEDKETGLYYYK 2129 Query: 54 NRYYDPLQGRYITQDPIGLEG---GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLN 109 +R+Y+P GR++ D + G + Y Y NPV+ DP G + + Sbjct: 2130 SRFYEPTLGRFLQADSVIAPDSVNGMNRYMYVDGNPVSYRDPSGHISGPDMMHMLNRIVG 2189 Query: 110 HQRAWDILSDTYEDMKRLN--LGGTDQFFHCMAFCRV-SKLNDAGVSRSAKGLGYEKEIR 166 H D S + N G ++F H F +KL S LGY Sbjct: 2190 HAMGKDFNSKGLDKKLSTNGISKGVNRFVHNATFVHNPTKLRGMRDSTKGAILGYLMTGS 2249 Query: 167 DYGLNLFGMYGRKVKLSHSE-MIEDNKKDLAVNDHGLT 203 G ++G + + ++ K ++D ++ Sbjct: 2250 FEGAFDGYLWGNAKDQTKIDNKRKNYKWAFDLSDVVIS 2287 >UniRef50_C1B7W9 Putative uncharacterized protein n=1 Tax=Rhodococcus opacus B4 RepID=C1B7W9_RHOOB Length = 514 Score = 122 bits (307), Expect = 7e-27, Method: Composition-based stats. Identities = 38/107 (35%), Positives = 51/107 (47%), Gaps = 4/107 (3%) Query: 2 LALMDA-DGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + L+D + D WG+ R PGQ +D E+GL+YN +RYY+P Sbjct: 319 VELVDPRTADSVADATTDLWGHTTWRGATD---THLRFPGQYHDPETGLHYNLHRYYNPH 375 Query: 61 QGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQ 107 RY+TQDP+GL + YP NP DPLGL P I + Sbjct: 376 TARYLTQDPLGLAPSPNPNTYPHNPTGWTDPLGLVPCPPTTIEGPNG 422 >UniRef50_B2PYS3 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PYS3_PROST Length = 191 Score = 122 bits (307), Expect = 7e-27, Method: Composition-based stats. Identities = 34/66 (51%), Positives = 42/66 (63%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 Q R GQ +D E+GL++N R+YDP GR+I DPIGL GG +LY Y NP+ I Sbjct: 2 ESFEQNLRYAGQYFDNETGLHFNTFRFYDPQIGRFIMPDPIGLLGGVNLYQYAPNPLVWI 61 Query: 90 DPLGLS 95 DP GLS Sbjct: 62 DPWGLS 67 >UniRef50_A6DLJ6 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLJ6_9BACT Length = 2320 Score = 122 bits (307), Expect = 7e-27, Method: Composition-based stats. Identities = 46/163 (28%), Positives = 67/163 (41%), Gaps = 7/163 (4%) Query: 1 MLALMDADGNIAWSGEYDEWG------NQLNEENPHHLHQPYRLPGQQYDKESGLYYNRN 54 ++A+ D GN+ S Y +G E G++YD ESGL++ RN Sbjct: 1850 VVAITDETGNLLESYSYTSFGIRTIYNQTGQEIANSAYGITAGYTGREYDSESGLWHYRN 1909 Query: 55 RYYDPLQGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRA 113 R Y GR++ DP G G +LYAY NP+N IDP GL ++ + Sbjct: 1910 RMYSAEIGRFMQVDPAGFVDGLNLYAYVKNNPINFIDPWGLQALNLNSRGEGNFDWSTGK 1969 Query: 114 WDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSA 156 +S + L G + F+ R N+ VS S Sbjct: 1970 VTPMSMFEDQGIGLGPDGRYRKFNTNTGKRELYYNNTVVSESG 2012 >UniRef50_C0B4J4 Putative uncharacterized protein n=3 Tax=Coprococcus comes ATCC 27758 RepID=C0B4J4_9FIRM Length = 558 Score = 122 bits (307), Expect = 7e-27, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 67/184 (36%), Gaps = 16/184 (8%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLH----QPYRLPGQQYDKESGLYYNRNRY 56 ++ ++D++G+ YD WG L + P R G YD+E+GLYY + RY Sbjct: 306 IIGIVDSEGSQVVVYRYDAWGEVLVSSDASGFGLSQINPLRYRGYYYDQETGLYYLQTRY 365 Query: 57 YDPLQGRYITQDPIGLEGGW-------SLYAYP-LNPVNGIDPLGL----SPADVALIRR 104 YDP R++ D + +LYAY NPV D G+ S A+ Sbjct: 366 YDPKVRRFLNADDASVLTKDPEQLTEKNLYAYCDDNPVMYRDDTGMFDIVSGIFGAVTNV 425 Query: 105 KDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKE 164 + ++ G + + +GV + L + Sbjct: 426 ATTYFAAKVTGQECGVWDLAVAAFAGLVSGMTKSSLYITLVSAFISGVGTTIGSLAGGSD 485 Query: 165 IRDY 168 I++ Sbjct: 486 IKEA 489 >UniRef50_A9C2K3 YD repeat protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C2K3_DELAS Length = 1301 Score = 122 bits (307), Expect = 7e-27, Method: Composition-based stats. Identities = 41/125 (32%), Positives = 55/125 (44%), Gaps = 14/125 (11%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQ-------------PYRLPGQQYDKESGLY 50 L + G +AW +G Q R PGQ +D+E+GL Sbjct: 1066 LTNQQGQVAWQWLISGFGEVRPTTGDRGYGQTVSGPSYAQAVKFDLRYPGQVFDEETGLS 1125 Query: 51 YNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALIRRKDQLN 109 YN +RYYD GRYI DPIGL GGW+ + Y NP++ +DPLGL + R Sbjct: 1126 YNLHRYYDAATGRYIQADPIGLAGGWNRFGYVGENPLSFVDPLGLEKVILIPTREGWTFA 1185 Query: 110 HQRAW 114 + Sbjct: 1186 AALTY 1190 >UniRef50_C4SEH6 Insecticidal toxin complex protein n=5 Tax=Yersinia RepID=C4SEH6_YERMO Length = 939 Score = 122 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 39/159 (24%), Positives = 58/159 (36%), Gaps = 6/159 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPY---RLPGQQYDKESGLYYNRNRYYDPLQ 61 +D G+I EY +G H L Y R G++ D +GLYY RYY P Sbjct: 574 LDTHGDIISQEEYYPFGGTAVFAARHTLEAKYKTIRYSGKERDA-TGLYYYGFRYYMPWL 632 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSP-ADVALIRRKDQLNHQRAWDILSD 119 GR+++ DP G G +LY NPV +D GLSP I + Sbjct: 633 GRWLSADPAGTVDGLNLYRMVRNNPVGLMDEDGLSPGVKYNFISKGYDTLADIGKRQNIK 692 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKG 158 + KR+ + +++ + Sbjct: 693 HFTQAKRIVTNSYLDANSIIKSAIEKLNDNSLETTQVLE 731 >UniRef50_D0HCV2 Rhs protein n=3 Tax=Vibrio mimicus VM223 RepID=D0HCV2_VIBMI Length = 1617 Score = 122 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 48/176 (27%), Positives = 69/176 (39%), Gaps = 21/176 (11%) Query: 3 ALMDADGNIAWSGEYDEWG-----------NQLNEENPHHLHQPYRLPGQQYDKESGLYY 51 L +G + W GE WG + L+ R GQ D+ESGLYY Sbjct: 1381 ELCSENGEVVWQGEQALWGHYQQRNTFPNHGIREHAHNDELYCDLRYQGQIEDRESGLYY 1440 Query: 52 NRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRK------ 105 N NRYYD G+Y++QDPIG GG AY NP+ +DPLGL+P+ + Sbjct: 1441 NVNRYYDADSGQYLSQDPIGFSGGLRPQAYVFNPLEWVDPLGLAPSGCKDASGRPLSSSQ 1500 Query: 106 -DQLNHQRAWDILSDTYEDMKRLNLGGT---DQFFHCMAFCRVSKLNDAGVSRSAK 157 + + D D + +F + + G++ K Sbjct: 1501 YSVVYEAKIDDKYYPGRSDKVHFQDANKKLHEAMLADSSFAKSMEDMYPGITEGVK 1556 >UniRef50_A4ACT6 Putative uncharacterized protein n=1 Tax=Congregibacter litoralis KT71 RepID=A4ACT6_9GAMM Length = 336 Score = 122 bits (306), Expect = 9e-27, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 75/231 (32%), Gaps = 20/231 (8%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHH-------------LHQPYRLPGQQYDKESG 48 + D G + W Y +G +L EE+ G+ + ++G Sbjct: 42 VVATDETGAVKWREAYAPYGGRLLEESREQDCSVNPCESTNSPWENRQFYTGKYDEADTG 101 Query: 49 LYYNRNRYYDPLQGRYITQDPIGLEGG----WSLYAYP-LNPVNGIDPLGLSPADVALIR 103 L Y R+YDP GR+++ DP+ + G ++ Y+Y NP +DP G L+ Sbjct: 102 LTYFGARWYDPSLGRFLSVDPVEFQEGSVFSFNRYSYANNNPYLYVDPDGRFALFGFLVG 161 Query: 104 RKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEK 163 + Q LS ++ ++++ G V+ L + R+ Sbjct: 162 AGLEAARQAITGELSLSWSSAGKISISGVSGTTGVGLGRGVANLTGNVLVRAGANASAGA 221 Query: 164 EIRDYGLNLFGM-YGRK-VKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSD 212 + GR+ + G ++ + Sbjct: 222 VLGASVTAANNAVDGREMTAGVGIGAATGAIGGFGGSLIGDGLDASISLAR 272 >UniRef50_B6VUY0 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B6VUY0_9BACE Length = 1442 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 42/94 (44%), Positives = 53/94 (56%), Gaps = 1/94 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + + D GN W D +G L + P+R GQ D+E+GLYYNR RYYD Sbjct: 1219 IQMYDEQGNKTWDCTLDIYGKVLAIDKGTEFDCPFRYQGQYEDEETGLYYNRFRYYDSNA 1278 Query: 62 GRYITQDPIGLE-GGWSLYAYPLNPVNGIDPLGL 94 G YI+QDPIGLE + Y Y + +GIDPLGL Sbjct: 1279 GSYISQDPIGLESDTLNFYDYVCDLNDGIDPLGL 1312 >UniRef50_C9PY20 YD repeat protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9PY20_9BACT Length = 522 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 41/184 (22%), Positives = 73/184 (39%), Gaps = 10/184 (5%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 + DA NI Y +G L +E+ PY+ G+++D+E+GLYY RY +P+ Sbjct: 187 ITDAKANITQFDAYLPYGELLVDEHSSSEEMPYKFNGKEFDQETGLYYYGARYMNPVTSL 246 Query: 64 YITQDPIGLE-GGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTY 121 + DP+ Y Y NP++ D G S D + + Q W ++ + Sbjct: 247 WYGVDPMAESYDALGAYVYCAGNPISLSDVNGESFFDPS--------DDQSIWSGVTSYF 298 Query: 122 EDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 + + + D H + R +AK + + + R G+N + R Sbjct: 299 KGVGKSFKNTYDMVRHPVNTGRALWHAARHPVNTAKAVWHTAKARWIGMNSSDINERGKA 358 Query: 182 LSHS 185 + Sbjct: 359 WGET 362 >UniRef50_C7HJS3 YD repeat protein (Fragment) n=3 Tax=Clostridium thermocellum DSM 2360 RepID=C7HJS3_CLOTM Length = 783 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 4/103 (3%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + AL D G + S YD +GN L ++ + ++ G+ D +G YY R RYY+P Sbjct: 421 ITALTDGKGEVINSYSYDAFGNIL--DSVEKIENRFKYSGEMLDPVTGQYYLRARYYNPS 478 Query: 61 QGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALI 102 GR++ +D G +LY Y NP+ IDP G + A+ Sbjct: 479 IGRFMQEDTFR-GDGLNLYTYVANNPIKYIDPTGHCKENAAIA 520 >UniRef50_A9GD22 Conserved carbohydrate-binding protein, Rhs family n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GD22_SORC5 Length = 1351 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 42/142 (29%), Positives = 62/142 (43%), Gaps = 9/142 (6%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L+ G + + +G ++ E P R PGQ D+E+GL YNR RY+DP GR Sbjct: 1050 LVGPAGEVVCELDRSAFGAKVKEGGRTT--TPLRFPGQYEDEETGLVYNRYRYFDPALGR 1107 Query: 64 YITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYE 122 Y++ DP GL+GG++ + Y N P +DP GL P + D T Sbjct: 1108 YLSADPAGLDGGFNGFDYAGNAPTRFVDPSGLMPFST------VRNAAGNNPDNKDFTDP 1161 Query: 123 DMKRLNLGGTDQFFHCMAFCRV 144 +R G + + R Sbjct: 1162 KNRRGVPGDIEGKSQGVRGGRT 1183 >UniRef50_D0KZB0 YD repeat protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KZB0_HALNC Length = 1467 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 52/235 (22%), Positives = 83/235 (35%), Gaps = 8/235 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 +A+ W D +G++ + + R PGQ YD E+GL+YN NRYYDP GRY Sbjct: 1193 TNANQQTVWRWNRDAFGDRQVNASSASIEMNLRYPGQYYDTETGLFYNWNRYYDPSTGRY 1252 Query: 65 ITQDPIGLEGGWSLYAYPL-NPVNGIDPLGL------SPADVALIRRKDQLNHQRAWDIL 117 T DPIGL GG + + Y NP+ IDP GL SP + + + Sbjct: 1253 ATSDPIGLSGGVNTFGYVSANPLALIDPWGLFGSAQVSPIPPGYNSGDVRGAYDSYGNSS 1312 Query: 118 SDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLF-GMY 176 + A ++ + + E + + Sbjct: 1313 VYEPGYYDPNRAAAIIMALLAAGMGPAADEFAAILAHAEEASAIGGECEAAASSKLPSLD 1372 Query: 177 GRKVKLSHSEMIEDNKKDLAVNDHGLTCPSTTDCSDRCSDYINPEHKKTIKALQD 231 S S+ +K ++ HG S D + + + + + D Sbjct: 1373 DLSRAASASDRNGFSKAGRSLQKHGSRPGSKWGQEDVNVNNPSEANSRAQGLVDD 1427 >UniRef50_A1WM30 Putative uncharacterized protein n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM30_VEREI Length = 311 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 44/219 (20%), Positives = 70/219 (31%), Gaps = 8/219 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 +A G + W Y +G + N G+ YD +GL Y RYY PL GR+ Sbjct: 50 TNASGAVIWKESYLPYGQRQQLPNASA-GNKLWFTGKPYDANTGLSYMGARYYMPLTGRF 108 Query: 65 ITQDPIGLEG----GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 DP L ++ YAY NP +DP G V L Q D Sbjct: 109 TGMDPKDLVPEQPHSFNRYAYANNNPNKYVDPDGKIAETV-WDAFNLALGFQSLVDNARA 167 Query: 120 TYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRK 179 ++L G V G+ + + R + +L ++ + Sbjct: 168 GNWANAAIDLAGITLDAAATTVPGVPGGASTGIKAYREAAEPKGAARAFHASLRNLHHNE 227 Query: 180 VKLSHSEMIEDNKKDLA-VNDHGLTCPSTTDCSDRCSDY 217 + + K+ + L+ + D D Sbjct: 228 RIARIRSKLAEVAKNNGWKKNSRLSRMNKRDVYDAPDGI 266 >UniRef50_B6A881 YenC1 n=1 Tax=Yersinia sp. MH-1 RepID=B6A881_9ENTR Length = 974 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 30/96 (31%), Positives = 44/96 (45%), Gaps = 5/96 (5%) Query: 5 MDADGNIAWSGEYDEWGNQLNE---ENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +D G + EY +G + Y G++ D +GLYY RYY P Sbjct: 588 VDGTGQLISQEEYYPYGGTAVWMARSQREASDKAYGYSGKERDA-TGLYYYGFRYYQPWA 646 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSP 96 GR+++ DP G G +L+ NP+ DP GL+P Sbjct: 647 GRWLSADPAGTIDGLNLFRMVRNNPIVLHDPDGLAP 682 >UniRef50_Q2SIG5 Rhs family protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SIG5_HAHCH Length = 1427 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 38/101 (37%), Positives = 56/101 (55%), Gaps = 3/101 (2%) Query: 3 ALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQG 62 + +G I W Y+ +G R PGQ +D+ESG+++N R Y+P G Sbjct: 1165 QMASRNGQIVWKAAYEVFGRVKIF--VDKAENNLRFPGQYFDQESGMHHNYFRDYNPGYG 1222 Query: 63 RYITQDPIGLEGGWSLYAYPL-NPVNGIDPLGLSPADVALI 102 RYI +DPI + GG ++YAY NPV +DPLGL+ +V + Sbjct: 1223 RYIQRDPISVYGGINVYAYANGNPVVYMDPLGLAKMNVGVG 1263 >UniRef50_D2KTW4 Putative uncharacterized protein n=2 Tax=Streptomyces RepID=D2KTW4_9ACTO Length = 1097 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 39/96 (40%), Positives = 47/96 (48%), Gaps = 1/96 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 L+ DG++AW WG L P R PGQ D E+GL YN +RYYDP Sbjct: 911 TELITPDGHLAWQHRTTLWGTPLP-TPSDTTTCPLRFPGQYADPETGLNYNHHRYYDPET 969 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPA 97 +Y+T DP+GL AY NP DPLGL Sbjct: 970 AQYLTPDPLGLAPAPHPRAYVHNPHTWQDPLGLEGC 1005 >UniRef50_C0FSB4 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FSB4_9FIRM Length = 223 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 46/187 (24%), Positives = 67/187 (35%), Gaps = 9/187 (4%) Query: 6 DADGNIAWSGEYDEWGNQLNEEN-PHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 D +G W D +G EE P+R GQ D+E+GLYYNR RYY P +G Y Sbjct: 5 DEEGKKVWERNLDIYGRVKTEEALGEKNLIPFRFQGQYEDEETGLYYNRFRYYSPEEGCY 64 Query: 65 ITQDPIGLEGG-WSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYED 123 QDPIGL GG +LY Y + + +DP GL+ I + + Sbjct: 65 TQQDPIGLAGGNPTLYGYVYDTLCELDPFGLA------ILFETGTYGGLNSSVHVGDGLQ 118 Query: 124 MKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKLS 183 L + RV + ++ + + G Sbjct: 119 AHELIRH-KYLVQKKLTSQRVRLSGNPAIALDNVHHTRVGGAHWWETQIRKSQGLGRNQF 177 Query: 184 HSEMIED 190 H + + Sbjct: 178 HPNLKRE 184 >UniRef50_B3EU05 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3EU05_AMOA5 Length = 2534 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 38/135 (28%), Positives = 58/135 (42%), Gaps = 4/135 (2%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQ---PYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +D + EY +G + N + YR G + D+E+GL Y+ RYY P Sbjct: 2180 LDDSAQVISYEEYHPYGTTAYQANNAAIKAAAKRYRYTGMERDEETGLEYHSARYYVPWL 2239 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDT 120 GR+ + DPIG+ G ++Y Y NPVN DP G + + N + D + Sbjct: 2240 GRWCSADPIGIGDGVNVYRYVGNNPVNLHDPTGALEFENYDAYKAYVGNKALSADKIGSQ 2299 Query: 121 YEDMKRLNLGGTDQF 135 +K TD + Sbjct: 2300 GHWLKSDRENKTDVW 2314 >UniRef50_Q8PMX0 Wall-associated protein n=2 Tax=Xanthomonas RepID=Q8PMX0_XANAC Length = 1199 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 65/210 (30%), Gaps = 11/210 (5%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +A+ D G + +Y +G+ + + G D +GL Y + RYYD Sbjct: 941 IAVTDTAGQVVERTDYQPYGSPIGKTVDG-----IGYTGHAMDGATGLTYMQQRYYDQDL 995 Query: 62 GRYITQDPIG----LEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDI 116 GR+++ DP+ ++ Y Y NP DP G A + + Sbjct: 996 GRFLSVDPVAADSVFAANFNRYWYANNNPYKFTDPDGRQSAADRYYGAAVGYMLRNDPEK 1055 Query: 117 LSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMY 176 L + + Q A + S + + G Sbjct: 1056 LR-IWMAGEAAATTEGSQAEQGAAMGQAVGEFVDAGDFSNEAIAGALLKAAAAGVTRGRS 1114 Query: 177 GRKVKLSHSEMIEDNKKDLAVNDHGLTCPS 206 GR + + +++ ND + C Sbjct: 1115 GRPGDFTRGQRNAFKRENAQRNDGQMRCDD 1144 >UniRef50_A3DF74 YD repeat protein n=17 Tax=Clostridium thermocellum RepID=A3DF74_CLOTH Length = 1959 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 44/165 (26%), Positives = 64/165 (38%), Gaps = 18/165 (10%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L D+ G I + +YD +GN L +H +R G+QYD +G YY R R+Y P GR Sbjct: 1576 LADSSGKIVNTYDYDAFGNTL--SVKETIHNRFRYAGEQYDDFTGQYYLRARFYSPSLGR 1633 Query: 64 YITQDP----IGLEGGWSLYAYP-LNPVNGIDPLGLSPADV--ALIRRKDQLNHQRAW-- 114 + +D +LY Y NPV +DP G P + AL +++N W Sbjct: 1634 FTQEDTWRGFTYNPASLNLYTYVENNPVMFVDPTGHWPKFIDNALDWVGNKVNEAADWVG 1693 Query: 115 -------DILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGV 152 D D D + + V G+ Sbjct: 1694 NRVNDVVDWAGDRINDARNFITNTATGVKNWWVENNVGAYVVGGL 1738 >UniRef50_Q4ZP55 YD repeat n=10 Tax=Pseudomonas syringae group RepID=Q4ZP55_PSEU2 Length = 955 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 62/204 (30%), Gaps = 11/204 (5%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPY---RLPGQQYDKESGLYYNRNRYYDPLQ 61 +D G + Y +G + Y R G++ D SGLYY RYY P Sbjct: 554 LDQQGGLISQESYYPFGGTAWWAARSAVEAKYKTVRYSGKERDA-SGLYYYGFRYYAPWL 612 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKD----QLNHQRAWDI 116 R+I DP G G +L+ + NP+N DP G S + Sbjct: 613 QRWINPDPAGDVDGLNLFGFVGNNPLNLFDPDGQSSQKARARWLTAMDILREQRDTNDVS 672 Query: 117 LSDTYEDMKRLNLGGTDQFFHCMAFC--RVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFG 174 + + +++ GT++ F + R + S K Sbjct: 673 VRTLAPGVIGVSIQGTNETFAQIGMGLPRAFEGRRELTYLSIDSTYRSKVSGGSYPFAIK 732 Query: 175 MYGRKVKLSHSEMIEDNKKDLAVN 198 +E+ + Sbjct: 733 SASGAAIYVEAEVPGSSHLAFING 756 >UniRef50_Q3JSF4 RhsD protein n=23 Tax=Burkholderia pseudomallei RepID=Q3JSF4_BURP1 Length = 1593 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 40/93 (43%), Positives = 51/93 (54%), Gaps = 1/93 (1%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 + + D G I W Y G + E + QP RL GQ +D ESGL YNR RY+ Sbjct: 1349 VRIYDDCGRIVWEARYGPHGGIASIE-TDVIQQPIRLQGQIFDWESGLSYNRYRYFLSSI 1407 Query: 62 GRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGL 94 G +++QDPIGL GG +LY + N DPLGL Sbjct: 1408 GAFVSQDPIGLVGGVNLYRFAPNAFGWTDPLGL 1440 >UniRef50_B0WKJ0 Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0WKJ0_CULQU Length = 2185 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 35/123 (28%), Positives = 54/123 (43%), Gaps = 4/123 (3%) Query: 2 LALMDADGNIAWSGEYDEWGNQLNEENPHH-LHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 L+ + + +Y +G + + H YR GQ+YD+E+GLY R YDP Sbjct: 1677 TRLVIHQNRVVAAYDYMPYGQLMRKFGSSAEAHIAYRYTGQEYDEETGLYNYHARLYDPD 1736 Query: 61 QGRYITQDPIGLEGGWSLYAYPLN-PVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSD 119 GR+ DP+ E S Y Y N PV+ IDP G + +I L ++ Sbjct: 1737 IGRFYQLDPM--EQYPSPYKYAGNSPVSQIDPDGQVAITLIVIGIGALLGAYFGAASANN 1794 Query: 120 TYE 122 ++ Sbjct: 1795 SWN 1797 >UniRef50_B1HM94 Cell wall-associated protein n=5 Tax=Lysinibacillus sphaericus C3-41 RepID=B1HM94_LYSSC Length = 995 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 35/98 (35%), Positives = 56/98 (57%), Gaps = 5/98 (5%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 +LAL + +G+I YD WGN L++ PYR G +YD+++ LYY RYY+P Sbjct: 781 VLALTNTNGDIVAQYTYDAWGNILSQSGTMAAINPYRYAGYRYDEKTKLYYLMARYYNPD 840 Query: 61 QGRYITQDPIGLEG----GWSLYAYP-LNPVNGIDPLG 93 G ++++DP+ + ++ Y+Y NPV +DP G Sbjct: 841 TGVFLSRDPVRGDTMTPISFNGYSYTNNNPVMNVDPSG 878 >UniRef50_D1WVZ1 YD repeat protein n=1 Tax=Streptomyces sp. ACT-1 RepID=D1WVZ1_9ACTO Length = 2294 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 39/97 (40%), Positives = 52/97 (53%), Gaps = 4/97 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 ++ + + DG IA YD G + PY G++ D +GL Y RNRYYDP Sbjct: 1897 IVGMANTDGTIATRYTYDPNGQPT--TSGAASSNPYTFTGRESDG-TGLLYYRNRYYDPE 1953 Query: 61 QGRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSP 96 GR+I+QDPIG GG +LY Y +P DP G +P Sbjct: 1954 SGRFISQDPIGHAGGTNLYQYALSSPTTYTDPSGNNP 1990 >UniRef50_A8ZSG6 YD repeat protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZSG6_DESOH Length = 423 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 39/97 (40%), Positives = 53/97 (54%), Gaps = 3/97 (3%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGR 63 L ++G + WS +Y+ +G+ E + R PGQ +D ESGL+YN +RYY P GR Sbjct: 260 LTASNGAVVWSAKYESFGDATVE--IETVENNLRFPGQYFDGESGLHYNLHRYYAPELGR 317 Query: 64 YITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADV 99 ++ DPIGL GG + Y Y N N DP GL Sbjct: 318 FLKDDPIGLRGGINQYIYADNNVSNNTDPYGLFSKKT 354 >UniRef50_C3IG21 Wall associated protein n=2 Tax=Bacillus cereus group RepID=C3IG21_BACTU Length = 263 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 40/171 (23%), Positives = 65/171 (38%), Gaps = 6/171 (3%) Query: 1 MLALMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 ++A+ D + + + EYD WGN L ++ P+ G YDKE G+YY RYY+P Sbjct: 18 VIAMTDQNREVVATYEYDSWGNVLKSDTKGIATENPFGYAGYMYDKEIGMYYLIARYYNP 77 Query: 60 LQGRYITQDPIGLEG----GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 +++ DP + + Y Y NPV IDP G P +I A Sbjct: 78 DHAVFLSVDPDPGDEDDPVTMNGYTYVDNNPVMLIDPDGNIPVAPLVIAGARMAAPHIAR 137 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEI 165 + R + A R +L + + + + + Sbjct: 138 YAAKQAAKQAARKAKKRATLAANKRAGARAEELAKSALKKKGYDVLGSQVS 188 >UniRef50_C0FSB7 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FSB7_9FIRM Length = 306 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 48/151 (31%), Positives = 59/151 (39%), Gaps = 17/151 (11%) Query: 6 DADGNIAWSGEYDEWGNQ----------LNEENPHHLHQPYRLPGQQYDKESGLYYNRNR 55 D +G W E D +G E P+R GQ DKE GLYYNR R Sbjct: 131 DGEGIKVWERELDIYGRVKNVGKGSDRSAAPETGEQCFIPFRFQGQYEDKEIGLYYNRFR 190 Query: 56 YYDPLQGRYITQDPIGLEGG-WSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 YYDP G+Y QDPIGL GG +LY Y N + +DP GL D+ L Sbjct: 191 YYDPSLGQYTQQDPIGLAGGNPTLYGYVFNTMWELDPFGLDWKDLLETGLGHHLFPGSVA 250 Query: 115 ------DILSDTYEDMKRLNLGGTDQFFHCM 139 + T G+ +F Sbjct: 251 KKLGIEEWAKLTAYSWYPYETAGSGRFIKNF 281 >UniRef50_C4DPW8 Rhs family protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DPW8_9ACTO Length = 2607 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 33/108 (30%), Positives = 48/108 (44%), Gaps = 4/108 (3%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLH---QPYRLPGQQYDKESGLYYNRNRYYDPLQ 61 +D G + EY +G + YR G++ D +GLYY +RYY Sbjct: 1838 LDEAGAVVSYEEYFPFGGTAFVAGDSLREVRMRDYRFSGKENDDATGLYYFGHRYYAAWT 1897 Query: 62 GRYITQDPIGLEGGWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQL 108 GR+I+ DP+G G +LY Y NPV DP GL + + + Sbjct: 1898 GRWISADPMGPVDGTNLYVYVRNNPVTFADPNGLQTTTTETQQGEHHV 1945 >UniRef50_C3LC21 Wall-associated domain protein n=18 Tax=Bacillus cereus group RepID=C3LC21_BACAC Length = 262 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 31/131 (23%), Positives = 56/131 (42%), Gaps = 6/131 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 ++ + + D + + EYD WGN + ++ + P+ G YDKE G+YY RYY+P Sbjct: 36 VVTITNQDKEVVATYEYDAWGNVVKSDTKGIAVDNPFGYAGYMYDKEIGMYYLIARYYNP 95 Query: 60 LQGRYITQDPIGLEG----GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 G +++ DP + + Y Y NPV I+P G + + + ++ Sbjct: 96 EHGVFLSVDPDPGDEDDPVTMNGYTYGDNNPVMMINPDGHLAWFIPVAVHEARIAAPHVG 155 Query: 115 DILSDTYEDMK 125 + Sbjct: 156 RFVGKQLAKRA 166 >UniRef50_C7IND8 YD repeat protein (Fragment) n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IND8_9CLOT Length = 581 Score = 120 bits (301), Expect = 3e-26, Method: Composition-based stats. Identities = 43/176 (24%), Positives = 68/176 (38%), Gaps = 11/176 (6%) Query: 1 MLALMDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPL 60 + AL+ +G I + YD +GN + ++ G QYDKE+ LYY RYYD Sbjct: 106 VTALVGENGAIQATYYYDAFGNITEQTG--DVNNNITYAGYQYDKETDLYYLNARYYDSK 163 Query: 61 QGRYITQDPIGLEG----GWSLYAYPLN-PVNGIDPLGLS-PADVALIRRKDQLNHQRAW 114 R++++D +LY Y N PV +DP G +D LI+ Q Sbjct: 164 TARFLSEDTYTGNTNDPLSLNLYTYCHNEPVMYVDPSGHWQESDKNLIQSARIAISQLTD 223 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMAFCRVS---KLNDAGVSRSAKGLGYEKEIRD 167 ++ + ++KR +C A + + K GY Sbjct: 224 IYVTTSDPEVKRACATQAAAIRNCSANIAKTPQYSEVGMLYGQQLKKDGYVSASDW 279 >UniRef50_Q73C66 Wall associated protein, putative n=73 Tax=Bacillus cereus group RepID=Q73C66_BACC1 Length = 2246 Score = 120 bits (301), Expect = 4e-26, Method: Composition-based stats. Identities = 39/155 (25%), Positives = 66/155 (42%), Gaps = 7/155 (4%) Query: 1 MLALMDADGNIAWSGEYDEWGNQL-NEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDP 59 ++A+ + D + + EYD WGN L ++ P+ G YDKE G+YY RYY+P Sbjct: 1987 VVAMTNQDKEVVATYEYDSWGNVLKSDTKGIAADNPFGYAGYMYDKEIGMYYLVARYYNP 2046 Query: 60 LQGRYITQDPIGLEG----GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 G +++ DP + + Y Y NPV DP G +P +A+ + +A+ Sbjct: 2047 DHGVFLSVDPDPGDEDDPITMNGYTYGDNNPVMMTDPDGHAPW-LAINAGFAVYDGYKAY 2105 Query: 115 DILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLND 149 + +++ LG A + Sbjct: 2106 KSGASKKVILRKAALGFVGGGKLKYAGKITKAIGG 2140 >UniRef50_B8CMS0 Putative uncharacterized protein n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CMS0_SHEPW Length = 510 Score = 120 bits (301), Expect = 4e-26, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 76/207 (36%), Gaps = 18/207 (8%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 DA+G + Y+ +G +L E G D + GL Y + RYYDPL GR+ Sbjct: 259 TDAEGKVVSRSVYEPFGKRLGGEKAG-----IGYTGHLQDTDLGLTYMQARYYDPLIGRF 313 Query: 65 ITQDPIGLEG---------GWSLYAYP-LNPVNGIDPLGLSPADVALIRRKDQLNHQRAW 114 + DP+ G G++ Y Y NP +DP G + L+ + A+ Sbjct: 314 YSNDPVNATGHIGRGNPVHGFNRYTYANNNPYKYVDPDGEFAFLIPLV--GAVIGGYTAF 371 Query: 115 DILSDTYEDMKRLNLGG-TDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLF 173 + D + G F ++ + GV +A+ + ++ + Sbjct: 372 NQAKSMGADNSEAMVAGVAGAFVGALSGGTIGTAAGFGVKIAAQQAVKQSVVKTASKVVG 431 Query: 174 GMYGRKVKLSHSEMIEDNKKDLAVNDH 200 S ++ + D D++ + Sbjct: 432 SGVSGGASGSLTQAVADASGDISNGEV 458 >UniRef50_B2Q762 Putative uncharacterized protein n=2 Tax=Providencia stuartii ATCC 25827 RepID=B2Q762_PROST Length = 173 Score = 120 bits (301), Expect = 4e-26, Method: Composition-based stats. Identities = 37/127 (29%), Positives = 52/127 (40%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 Q R GQ +D E+GL++N R+YDP GR+I DPIGL GG +LYAY NP+ I Sbjct: 2 ESFEQNLRYAGQYFDNETGLHFNTFRFYDPQIGRFIMPDPIGLLGGINLYAYAPNPLGWI 61 Query: 90 DPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLGGTDQFFHCMAFCRVSKLND 149 DP G S + + L + T+ R + Sbjct: 62 DPWGWSHLNTNGATGNFGVYKIEIDGQLYKYGKADLNRVTQSTNLPTRLHQQVRKLQDLY 121 Query: 150 AGVSRSA 156 + + Sbjct: 122 PDKTITG 128 >UniRef50_C3JCI6 Rhs family protein n=4 Tax=Bacteria RepID=C3JCI6_9PORP Length = 1387 Score = 120 bits (300), Expect = 4e-26, Method: Composition-based stats. Identities = 44/177 (24%), Positives = 65/177 (36%), Gaps = 3/177 (1%) Query: 5 MDADGNIAWSGEYDEWGNQLNEENPHHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRY 64 D+ G W D +G ++ P+ GQ +D+E L YNR RYYDP G Y Sbjct: 1192 FDSHGAKVWERNLDIYGKIRTGDSTLV---PFLFQGQYFDEEIDLCYNRFRYYDPSTGTY 1248 Query: 65 ITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDM 124 I+QDPI + G ++YAY + + IDP GLS K + T + + Sbjct: 1249 ISQDPISIAGRLNVYAYVHDSNSWIDPFGLSGTHGHHSDPKFMGGDPKQSLTNIKTEDHI 1308 Query: 125 KRLNLGGTDQFFHCMAFCRVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVK 181 T S S + + ++ L Y + K Sbjct: 1309 DLHRDMNTYLETKTKVVNGESVSMRPKRGNSGRIIRQNFTRQERLDALAEFYTKNKK 1365 >UniRef50_B2PW71 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PW71_PROST Length = 191 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 48/191 (25%), Positives = 68/191 (35%), Gaps = 11/191 (5%) Query: 30 HHLHQPYRLPGQQYDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGI 89 Q R GQ +D E+GL++N R+YDP GR+I DPIGL GG +LY Y NP+ I Sbjct: 2 ESFEQNLRYAGQYFDNETGLHFNTFRFYDPQIGRFIMPDPIGLLGGINLYQYAPNPLGWI 61 Query: 90 DPLGLSPADVALIRRKDQLNHQRAWDILSDTYEDMKRLNLG-------GTDQFFHCMAFC 142 DP GL+ + A +A Y M+ + F Sbjct: 62 DPWGLAG-NPATATHITYQGIDKATGKPYVGYASMQGNQTAQNVLKYRYGNDFSRFGGTP 120 Query: 143 RVSKLNDAGVSRSAKGLGYEKEIRDYGLNLFGMYGRKVKLSHSEMIED---NKKDLAVND 199 V + G G E+ + L G ++ + N D +N Sbjct: 121 PVILYDGYGQKGKDIARGLEQRRFEQLGGLEGTANKQNPVGQGNARRTEYLNAADEHLNS 180 Query: 200 HGLTCPSTTDC 210 C Sbjct: 181 KQTGTKKGGKC 191 >UniRef50_Q2T5B9 YD repeat protein n=39 Tax=Proteobacteria RepID=Q2T5B9_BURTA Length = 1553 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 43/146 (29%), Positives = 59/146 (40%), Gaps = 25/146 (17%) Query: 4 LMDADGNIAWSGEYDEWGNQLNEENPHHL---------------------HQPYRLPGQQ 42 L +DG W WG+ ++ L R PGQ Sbjct: 1289 LYSSDGRALWRARRTAWGDTAGDDGRDSLRSAVREQLRLGHRDSDEFDPPDCELRFPGQW 1348 Query: 43 YDKESGLYYNRNRYYDPLQGRYITQDPIGLEGGWSLYAYPLNPVNGIDPLGLSPADVALI 102 D+ESGL+YN +RYYDP G+Y++ DP+GL GG +AY +P+ DP GL D Sbjct: 1349 ADEESGLHYNLHRYYDPSTGQYLSADPVGLAGGLRTHAYVHDPMQWGDPFGLQGYDTVRN 1408 Query: 103 RRKDQLNHQRAWDILSDTYEDMKRLN 128 R + D + K N Sbjct: 1409 HR----AGNKQIDYDGQRWNVPKGKN 1430 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.133 0.428 Lambda K H 0.267 0.0409 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,423,700,014 Number of Sequences: 3077464 Number of extensions: 70935159 Number of successful extensions: 153638 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 1890 Number of HSP's successfully gapped in prelim test: 736 Number of HSP's that attempted gapping in prelim test: 144152 Number of HSP's gapped (non-prelim): 6641 length of query: 236 length of database: 1,040,396,356 effective HSP length: 125 effective length of query: 111 effective length of database: 655,713,356 effective search space: 72784182516 effective search space used: 72784182516 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 91 (39.7 bits)