BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (277 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_A8AI45 Curli production assembly/transport component cs... 561 e-158 UniRef50_Q0TJ37 Curli production assembly/transport component Cs... 417 e-115 UniRef50_Q084E4 Curli production assembly/transport component Cs... 281 2e-74 UniRef50_Q3KES0 Assembly/transport component in curli production... 270 4e-71 UniRef50_Q392B7 Curli production assembly/transport component Cs... 233 5e-60 UniRef50_C6X5S1 Curli production assembly/transport component Cs... 223 5e-57 UniRef50_A6EMN0 Putative assembly or transport protein for curli... 217 3e-55 UniRef50_A9L3G5 Curli production assembly/transport component Cs... 211 2e-53 UniRef50_A5FHE8 Curli production assembly/transport component Cs... 208 1e-52 UniRef50_A3XJL0 Putative assembly or transport protein for curli... 208 2e-52 UniRef50_D2LDX9 Curli production assembly/transport component Cs... 194 2e-48 UniRef50_D2QPP9 Curli production assembly/transport component Cs... 193 6e-48 UniRef50_Q1NBA5 Putative curli production assembly/transport com... 187 4e-46 UniRef50_Q1YMP1 Putative Curli production assembly/transport com... 171 3e-41 UniRef50_B6QZQ0 Curli fiber membrane-associated lipoprotein CsgG... 168 2e-40 UniRef50_B3QD06 Curli production assembly/transport component Cs... 154 3e-36 UniRef50_B3QHQ8 Curli production assembly/transport component Cs... 154 4e-36 UniRef50_Q1QYN5 Curli production assembly/transport component Cs... 75 3e-12 UniRef50_C5D1Z0 Putative uncharacterized protein n=1 Tax=Variovo... 60 5e-08 UniRef50_C8R0R5 Curli production assembly/transport component Cs... 58 3e-07 UniRef50_A7HWH6 HfaB protein n=1 Tax=Parvibaculum lavamentivoran... 58 4e-07 UniRef50_Q0AL09 HfaB protein n=1 Tax=Maricaulis maris MCS10 RepI... 54 7e-06 UniRef50_C5SPX6 HfaB protein n=1 Tax=Asticcacaulis excentricus C... 54 8e-06 UniRef50_B4W728 Putative uncharacterized protein n=1 Tax=Brevund... 50 6e-05 UniRef50_A6GPY8 HfaB protein n=1 Tax=Limnobacter sp. MED105 RepI... 49 1e-04 UniRef50_A4CEI5 Putative uncharacterized protein n=1 Tax=Pseudoa... 49 1e-04 UniRef50_Q0IE08 Curli production assembly/transport component Cs... 48 4e-04 UniRef50_C7YQN8 Predicted protein n=1 Tax=Nectria haematococca m... 43 0.009 UniRef50_Q05S73 Putative uncharacterized protein n=1 Tax=Synecho... 43 0.011 UniRef50_P27343 Putative transcription activator protein hfaB n=... 42 0.016 UniRef50_A2CA24 Uncharacterized protein involved in formation of... 42 0.023 UniRef50_A1BH58 Curli production assembly/transport component Cs... 40 0.080 UniRef50_B8HPG7 CsgG family protein, putative n=1 Tax=Cyanothece... 40 0.092 >UniRef50_A8AI45 Curli production assembly/transport component csgG n=90 Tax=Enterobacteriaceae RepID=CSGG_CITK8 Length = 277 Score = 561 bits (1445), Expect = e-158, Method: Compositional matrix adjust. Identities = 271/277 (97%), Positives = 274/277 (98%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 MQRL +LVAV LLSGCLTAPPKEAA+PTLMPRAQSYKDLTHLP PTGKIFVSVYNIQDET Sbjct: 1 MQRLLILVAVCLLSGCLTAPPKEAAKPTLMPRAQSYKDLTHLPMPTGKIFVSVYNIQDET 60 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV Sbjct: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN Sbjct: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGE+GYTSNEPVMLCLMSAIETGVI Sbjct: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEIGYTSNEPVMLCLMSAIETGVI 240 Query: 241 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES Sbjct: 241 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 >UniRef50_Q0TJ37 Curli production assembly/transport component CsgG n=1 Tax=Escherichia coli 536 RepID=Q0TJ37_ECOL5 Length = 228 Score = 417 bits (1071), Expect = e-115, Method: Compositional matrix adjust. Identities = 206/217 (94%), Positives = 206/217 (94%) Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 G P AVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV Sbjct: 12 GNLNPTRQVTSPLAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 71 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN Sbjct: 72 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 131 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI Sbjct: 132 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 191 Query: 241 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES Sbjct: 192 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 228 >UniRef50_Q084E4 Curli production assembly/transport component CsgG n=10 Tax=Gammaproteobacteria RepID=Q084E4_SHEFN Length = 283 Score = 281 bits (719), Expect = 2e-74, Method: Compositional matrix adjust. Identities = 139/272 (51%), Positives = 189/272 (69%), Gaps = 10/272 (3%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKE---AARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 M+RL L + ++ LS C + + A +LMP+ ++Y DL LP+P G + +VY+ + Sbjct: 1 MKRLVLSLFILSLSACSSIESEFDGIEATTSLMPKGETYYDLVSLPSPQGSMVAAVYDFR 60 Query: 58 DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQEN 117 D+TGQ+KP P+SNFSTAVPQS TA L AL DS WF+P+ER+GLQNLL ERKI+RA N Sbjct: 61 DQTGQYKPIPSSNFSTAVPQSGTAFLAQALNDSAWFVPVEREGLQNLLTERKIVRAGL-N 119 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 G + L L +A I++EG I+ Y++N+K+GG GARY GIG QY++D I VNLR Sbjct: 120 GDAS-----KLPQLNSAQILMEGGIVAYDTNIKTGGAGARYLGIGVSGQYRVDSITVNLR 174 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 V++ TG +LSSV T+K ++S E+ AGVF+FID Q LLE EVGYTSNEPV LC+ +AIE+ Sbjct: 175 AVDIRTGRLLSSVTTTKAVISKEITAGVFKFIDAQELLESEVGYTSNEPVSLCIAAAIES 234 Query: 238 GVIFLINDGIDRGLWDLQNKAER-QNDILVKY 268 V+ +I DGI + W+L + A +N L KY Sbjct: 235 AVVHMIADGIWKRAWNLLDAASGVKNPTLQKY 266 >UniRef50_Q3KES0 Assembly/transport component in curli production n=8 Tax=Gammaproteobacteria RepID=Q3KES0_PSEPF Length = 286 Score = 270 bits (690), Expect = 4e-71, Method: Compositional matrix adjust. Identities = 131/262 (50%), Positives = 182/262 (69%), Gaps = 6/262 (2%) Query: 13 LSGCLTAPPKEAAR----PTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPA 68 L GC P A + PTL PRA +Y DL +P P G++ VY +D+TGQ+KP PA Sbjct: 14 LQGCSLREPMPAEQDTDTPTLTPRASTYYDLLKMPRPKGRLMAVVYGFRDQTGQYKPTPA 73 Query: 69 SNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPL 128 S+FST+V Q A +ML+ A++ S WF+ LER+GLQNLL ERKIIRA+Q+ +N + L Sbjct: 74 SSFSTSVTQGAASMLMDAMQASGWFVVLEREGLQNLLTERKIIRASQKKPNTPVNIQGEL 133 Query: 129 QSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILS 188 L AAN+M+EG II Y++NV+SGG GARY GI +Y++DQ+ VNLR V+V +G++L+ Sbjct: 134 PPLQAANMMLEGGIIAYDTNVRSGGEGARYLGIDLSREYRVDQVTVNLRAVDVRSGQVLA 193 Query: 189 SVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGID 248 +V TSKTI S AG+F+FI++++LLE EVGYT+NEP LC++SAIE V ++ GI+ Sbjct: 194 NVMTSKTIYSVARSAGIFKFIEFKKLLEAEVGYTTNEPAQLCVLSAIEAAVGHMVAQGIE 253 Query: 249 RGLWDLQNKAE--RQNDILVKY 268 R LW + A Q+D+L +Y Sbjct: 254 RRLWQVAGDASTPSQDDVLNRY 275 >UniRef50_Q392B7 Curli production assembly/transport component CsgG n=2 Tax=Proteobacteria RepID=Q392B7_BURS3 Length = 312 Score = 233 bits (594), Expect = 5e-60, Method: Compositional matrix adjust. Identities = 115/262 (43%), Positives = 171/262 (65%), Gaps = 3/262 (1%) Query: 9 AVMLLS--GCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPY 66 AV+LLS GC+T P + TL P ++ +DLTHLP P GKI +VY +D TGQ+K Sbjct: 21 AVLLLSLVGCVTRPMPALSNATLTPPTRTTRDLTHLPPPKGKIVAAVYGFRDLTGQYKAS 80 Query: 67 PASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRI 126 P S+FS+ V Q + LV A++DS WF P+ER+ LQ+LL ERKI+RA + N Sbjct: 81 PDSSFSSQVTQGGASFLVKAMRDSGWFTPVERENLQDLLTERKIMRATDGSDAKKAQND- 139 Query: 127 PLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEI 186 + L ANI++EG I+GY++NV++GG G Y GI TQY++DQ+ VNLR +++ TG++ Sbjct: 140 AMAPLMPANIVLEGGIVGYDTNVRTGGAGVAYLGISGSTQYRIDQVTVNLRAIDIRTGQV 199 Query: 187 LSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDG 246 L+SV+T+KT+ SY+V G++RF+ ++ LL+ E G T NEP +C+ AIE+ + LI G Sbjct: 200 LNSVSTTKTVYSYQVDTGIYRFVGFKDLLQAEAGLTRNEPAQICVNEAIESALTHLIVQG 259 Query: 247 IDRGLWDLQNKAERQNDILVKY 268 + W L+N + + + +Y Sbjct: 260 VANQTWVLKNDQDWYDPTMQRY 281 >UniRef50_C6X5S1 Curli production assembly/transport component CsgG n=3 Tax=Bacteroidetes RepID=C6X5S1_FLAB3 Length = 456 Score = 223 bits (568), Expect = 5e-57, Method: Compositional matrix adjust. Identities = 121/267 (45%), Positives = 169/267 (63%), Gaps = 5/267 (1%) Query: 1 MQRLFLLVAVMLLSGC-LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDE 59 + ++ LL ++ LS C L P + + TL ++ +LPAP KI V VY +D+ Sbjct: 6 LTKIALLTPLIFLSSCTLFNLPTNSEKSTLGEVTPYTPEIKNLPAPKEKIVVGVYKFRDQ 65 Query: 60 TGQFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAA-QEN 117 TGQ+K +++STA+PQ T +L+ AL+DSRWF P+ER+ + NLLNER+IIR+ +E Sbjct: 66 TGQYKAAENGASWSTAIPQGTTTILLKALEDSRWFTPIERENIGNLLNERQIIRSTRKEY 125 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 N L L A I++EG +I Y++NV +GG+GARYFG+GA QY+ D+I V LR Sbjct: 126 AGNDANEAALLPPLLFAGIILEGGVISYDTNVMTGGIGARYFGLGAGAQYRQDRITVYLR 185 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 V+ S+GEIL +V TSKTILS + FRFID +RLLE ++G T NEPV L + AIE Sbjct: 186 AVSTSSGEILKTVYTSKTILSTSINGNFFRFIDTERLLESDIGITQNEPVHLAVTEAIEK 245 Query: 238 GVIFLINDGIDRGLWDLQNKAERQNDI 264 V+ LI +G+ LW NK + ND Sbjct: 246 AVLSLIVEGVRDNLW--TNKQKTPNDF 270 >UniRef50_A6EMN0 Putative assembly or transport protein for curli synthesis n=1 Tax=unidentified eubacterium SCB49 RepID=A6EMN0_9BACT Length = 458 Score = 217 bits (553), Expect = 3e-55, Method: Compositional matrix adjust. Identities = 117/283 (41%), Positives = 170/283 (60%), Gaps = 9/283 (3%) Query: 1 MQRLFLLVAVMLLSGC---LTAP--PKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYN 55 ++L LL + L+ C L P +EA L + + L++LP + K+ V VYN Sbjct: 4 FEKLVLLFVFVTLTSCGAMLNQPYNVQEARTGELTGKNNA---LSNLPKASDKVVVGVYN 60 Query: 56 IQDETGQFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAA 114 +D+TGQFK S+FSTAV Q TA+L+ AL+DS WF P+ER+ L NLLNER II Sbjct: 61 FRDQTGQFKLTDVGSSFSTAVSQGTTAILLKALEDSEWFRPIERENLNNLLNERSIIEKT 120 Query: 115 QENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAV 174 + + T A L+ L A I++EG ++ Y++N+ +GG GARYFG G Y+ D+I V Sbjct: 121 RRDYTPAGQQPQKLKPLLFAGILLEGGVVSYDTNILTGGAGARYFGAGGSVSYRQDRITV 180 Query: 175 NLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 LR ++ STGE+L +V SKTILS AG+FRF+ ++RLLE E+G+T NEP L + A Sbjct: 181 YLRAISTSTGEVLKTVYVSKTILSQGADAGIFRFVKFERLLEAEMGFTKNEPAELAVKEA 240 Query: 235 IETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 IE V+ L+ +GI LW+++ + + +L Y E+ Sbjct: 241 IEKAVVDLVYEGIKDNLWNMEGEEDDVIGVLETYEREKAAEEA 283 >UniRef50_A9L3G5 Curli production assembly/transport component CsgG n=20 Tax=Alteromonadales RepID=A9L3G5_SHEB9 Length = 268 Score = 211 bits (537), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 113/256 (44%), Positives = 167/256 (65%), Gaps = 10/256 (3%) Query: 1 MQRL-FLLVAVMLLSGCLTAP-PKEAARPT-LMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 M RL F + ++ +S C P P P + P ++ + L P P I V+VY+ + Sbjct: 1 MARLVFWGLLLLSMSACSLIPKPDLNITPAEVNPLSEVMRGLQTQPGPKFPIPVAVYSFR 60 Query: 58 DETGQFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQE 116 D+TGQ+KP S+FSTAV Q AT+ML+ L DS+WF P+ER+GLQNLL ERKI + ++ Sbjct: 61 DQTGQYKPQANVSSFSTAVTQGATSMLMQTLLDSKWFTPVEREGLQNLLTERKI--SNKQ 118 Query: 117 NGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNL 176 +GT + +P+ L+ A +++EG +I YE+N +GG G Y+GIGA Y+ DQ+ + L Sbjct: 119 SGTKG--DDVPV--LSTARLLLEGGVISYETNTSTGGSGVEYYGIGASEMYREDQVTIYL 174 Query: 177 RVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIE 236 R V+V TG+++ SV+TSK +LS E++AG+FR+ RL E E+G+T+NEPV C++ AIE Sbjct: 175 RAVDVHTGKVMMSVSTSKRVLSQEMRAGLFRYTSLNRLAEAEIGFTTNEPVQFCVLQAIE 234 Query: 237 TGVIFLINDGIDRGLW 252 V LI+ GI +G W Sbjct: 235 LAVAELIDKGIKQGYW 250 >UniRef50_A5FHE8 Curli production assembly/transport component CsgG n=3 Tax=Flavobacteria RepID=A5FHE8_FLAJ1 Length = 454 Score = 208 bits (530), Expect = 1e-52, Method: Compositional matrix adjust. Identities = 112/268 (41%), Positives = 168/268 (62%), Gaps = 6/268 (2%) Query: 4 LFLLVAVMLLSGC--LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG 61 LF+L+A L +GC P + L + L LP P ++ V VY +D+TG Sbjct: 7 LFILIA-FLFAGCGAYYNQPTGVQKAILGESTPATSLLKDLPKPKEQVVVGVYKFRDQTG 65 Query: 62 QFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 Q+KP S+FSTAV Q AT++L+ AL+DS+WFIP+ER+ + NLL ER +IRA ++ Sbjct: 66 QYKPQENGSSFSTAVTQGATSILIKALEDSKWFIPIERENIGNLLQERNLIRATRQEYVK 125 Query: 121 AIN-NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 N N L L A +++EG I+ Y+SN+ +GG GARYFG GA +Y+ D++ + LR++ Sbjct: 126 NANPNEPQLTPLLYAGVLLEGGIVSYDSNIITGGFGARYFGAGASVKYRQDRVTIYLRMI 185 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 + S G+IL SV SKTILS + +FR+++++RLLE E GYT+NEPV + + AIE V Sbjct: 186 STSNGKILKSVYISKTILSQAIDESLFRYVNFKRLLEVETGYTTNEPVHMAVTEAIEKAV 245 Query: 240 IFLINDGIDRGLWDLQNKAERQNDILVK 267 L+ +G+ +W+ + + Q D L+K Sbjct: 246 ESLVLEGLQDNIWE-ADAPKWQVDNLIK 272 >UniRef50_A3XJL0 Putative assembly or transport protein for curli synthesis n=2 Tax=Bacteria RepID=A3XJL0_9FLAO Length = 470 Score = 208 bits (529), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 108/216 (50%), Positives = 142/216 (65%), Gaps = 2/216 (0%) Query: 39 LTHLPAPTGKIFVSVYNIQDETGQFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLE 97 LT P P + VYN +D+TGQ+K S FSTAV Q AT ML+ AL+DS+WF P+E Sbjct: 44 LTQFPEPAEPVVAGVYNFKDQTGQYKNVENGSTFSTAVSQGATTMLIKALEDSKWFTPIE 103 Query: 98 RQGLQNLLNERKIIRAAQ-ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGA 156 R+ L NLLNER IIR+ + E N L L A +++EG II Y+SN+ +GG+GA Sbjct: 104 RENLGNLLNERNIIRSTRDEYRKNNNPNEPNLPPLLYAGVLLEGGIISYDSNIITGGLGA 163 Query: 157 RYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLE 216 RYFG+G TQY+ D++ V LR V+ S+G +L +V SKTILS + A +FR++++QRLLE Sbjct: 164 RYFGVGGSTQYRQDRLTVYLRAVSTSSGRVLKTVYVSKTILSQAIDASLFRYVNFQRLLE 223 Query: 217 GEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLW 252 E GYT NEPV L + AIE V LI +GI LW Sbjct: 224 VETGYTKNEPVQLAMKDAIEKAVESLIIEGIKDNLW 259 >UniRef50_D2LDX9 Curli production assembly/transport component CsgG n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LDX9_RHOVA Length = 397 Score = 194 bits (494), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 107/240 (44%), Positives = 143/240 (59%), Gaps = 10/240 (4%) Query: 27 PTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPAS----NFSTAVPQSATAM 82 PT+ P + L LP P + V+VY D+TGQFKP + S AV Q AT++ Sbjct: 44 PTVNPINSTNSKLRELPPPKAPVAVAVYGYGDQTGQFKPVAEGANVQSLSRAVTQGATSI 103 Query: 83 LVTALKDS---RWFIPLERQGLQNLLNERKIIRAAQEN--GTVAINNRIPLQSLTAANIM 137 L+ AL+D+ RWF +ER+ L NLL ER+II ++ G ++ L L A ++ Sbjct: 104 LMKALQDAGNGRWFTVVERERLDNLLKERRIIADMRQRYLGEQVVDPAA-LPPLLFAGVL 162 Query: 138 VEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTIL 197 ++G IIGY+SN K+GG GA+YFGIG D +Y D + V LR +V TG++L SV +KTIL Sbjct: 163 IDGGIIGYDSNTKTGGAGAKYFGIGGDVKYSEDTVTVYLRATSVKTGQVLLSVVANKTIL 222 Query: 198 SYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNK 257 SY VQ FRF+ Y RL E E G T NEP L + AIE V+ I +G RGLW Q+K Sbjct: 223 SYGVQGSAFRFVTYNRLFEAEGGLTMNEPGSLAVEQAIEKAVLTFIVEGSARGLWSFQDK 282 >UniRef50_D2QPP9 Curli production assembly/transport component CsgG n=2 Tax=Spirosoma linguale DSM 74 RepID=D2QPP9_9SPHI Length = 478 Score = 193 bits (490), Expect = 6e-48, Method: Compositional matrix adjust. Identities = 107/268 (39%), Positives = 157/268 (58%), Gaps = 11/268 (4%) Query: 3 RLFLLVAVMLLSGCLT--APPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 R+ + +LSGC P R L + L +LP K+ V+VY +D+T Sbjct: 12 RIIPFSCLWVLSGCAAYLHQPTGLQRARLGEETTTTAALRNLPKAKEKVVVAVYKFRDQT 71 Query: 61 GQFK-PYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 GQ+K S FST V Q T +L+ AL++S WF +ER+ + NLLNERKIIR++ Sbjct: 72 GQYKLSETGSTFSTVVSQGTTNILLKALEESGWFTTIERENVSNLLNERKIIRSSVAQYK 131 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 N L L A +++EG I+ Y++N+ +GG G RYF G TQY+ D++ V LR V Sbjct: 132 EGEN----LPPLLFAGVILEGGIVSYDANIITGGAGLRYFATGGSTQYRQDRVTVYLRAV 187 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 +G+IL +V TSKTILS V AG+FR++ +++LLE E G+++NEP + + AIE V Sbjct: 188 ATRSGKILKTVYTSKTILSQSVDAGIFRYVTFKKLLEAETGFSTNEPSQMAVTEAIEKAV 247 Query: 240 IFLINDGIDRGLWDLQNK----AERQND 263 L+ +GI GLW + +K A+R+ D Sbjct: 248 QALVLEGIQDGLWAVSDKDTGVAKRELD 275 >UniRef50_Q1NBA5 Putative curli production assembly/transport component csgg n=3 Tax=Sphingomonadales RepID=Q1NBA5_9SPHN Length = 336 Score = 187 bits (474), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 112/285 (39%), Positives = 169/285 (59%), Gaps = 17/285 (5%) Query: 1 MQRLFLLV-AVMLLSGCLT-APPKEAARPTLMPRA------QSYKDLTHLPAPTGKIFVS 52 ++++FL A +LL GC + A P++ P A Q+ + L LP P + ++ Sbjct: 2 IRQIFLASGAALLLGGCNSLATTGRDDIPSMNPMAVYPRYTQAQRQLMDLPPPQRPVAIA 61 Query: 53 VYNIQDETGQFKPYPASN--FSTAVPQSATAMLVTALKDS---RWFIPLERQGLQNLLNE 107 VYN D+TGQ++ S AV Q A ++LV AL+D+ +WF +ER+ L+NLLNE Sbjct: 62 VYNFSDQTGQYRVGEGGTQTLSRAVTQGAASILVRALQDAGNRKWFTIVEREQLRNLLNE 121 Query: 108 RKIIRAAQENGTVAINNRIP--LQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADT 165 R+IIR +E + NN P L ++ A +++EG II +++N +GG GA + GIGA T Sbjct: 122 RQIIREMRER-YLGENNVNPQALPAMLFAGVLLEGGIISFDTNTVTGGAGASFLGIGAST 180 Query: 166 QYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNE 225 QY+ D + V LR ++V TGE+L++V SKTI S + A FRF+ ++ LLE EVG T+NE Sbjct: 181 QYRQDTVTVYLRAISVRTGEVLTTVTASKTIASQSLGASAFRFVGFKELLEAEVGMTTNE 240 Query: 226 PVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRH 270 P + L AIE V L+ +GID LW+ + + +L +YR Sbjct: 241 PDHIALQQAIEKAVYGLVMEGIDLNLWNFAD-TQAGWPMLWRYRQ 284 >UniRef50_Q1YMP1 Putative Curli production assembly/transport component n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YMP1_MOBAS Length = 400 Score = 171 bits (432), Expect = 3e-41, Method: Compositional matrix adjust. Identities = 107/260 (41%), Positives = 147/260 (56%), Gaps = 9/260 (3%) Query: 6 LLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKP 65 ++VA+ LSGC+T P + P ++ DL +P P K V+VY +D TGQFK Sbjct: 16 MIVALASLSGCVTQEAFTDTPPVIAPVSRPNDDLRRVPPPRQKTVVAVYGYEDLTGQFKE 75 Query: 66 YP-ASNFSTAVPQSATAMLVTALKDS---RWFIPLERQGLQNLLNERKII---RAAQENG 118 + S AV Q +ML+ AL+D+ RWF LER L NLL ER+II R N Sbjct: 76 RENVQSLSRAVTQGGASMLIQALQDAGERRWFTVLERAELDNLLKERQIITEMRRLYRNE 135 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRV 178 T +P L ANI++EG IIGY++N+ +GGVGA + GI ADT+Y D + V LR Sbjct: 136 TQLDPKVVP--PLLHANIIIEGGIIGYDTNIMTGGVGAGFLGISADTKYIHDVVTVTLRA 193 Query: 179 VNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETG 238 V+ TGE+L++V K + SY +Q G FR++ L E G T NEP + + SAIE Sbjct: 194 VSTKTGEVLTTVTVRKAVASYALQGGAFRYVKIDELFMAEAGVTYNEPKQIAVQSAIEKA 253 Query: 239 VIFLINDGIDRGLWDLQNKA 258 V LI +G D +W+ + A Sbjct: 254 VEGLIVEGADLSIWEFSDPA 273 >UniRef50_B6QZQ0 Curli fiber membrane-associated lipoprotein CsgG n=1 Tax=Pseudovibrio sp. JE062 RepID=B6QZQ0_9RHOB Length = 316 Score = 168 bits (426), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 95/268 (35%), Positives = 148/268 (55%), Gaps = 12/268 (4%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAA---RPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 M++ ++ + LS C P A P L + L LP P ++ +VY+ Q Sbjct: 2 MKKSIVVAMALSLSAC--GPVTRMAFSPGPQLATVSVQASKLKKLPPPKEPVYAAVYSYQ 59 Query: 58 DETGQFKPY-PASNFSTAVPQSATAMLVTALKDS---RWFIPLERQGLQNLLNERKIIRA 113 D TGQ+KP S +V Q A MLV AL+D+ +WF +ER L LL ER+I+ Sbjct: 60 DLTGQYKPSDKVQTLSRSVTQGADTMLVRALQDAGDRKWFRVVERGNLDALLKERQIVTQ 119 Query: 114 AQEN--GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQ 171 ++ G ++ ++ L L A ++ EG ++GY+SN ++GG GARY GIG + Y+ D Sbjct: 120 IRKIYLGEDKVDPKV-LPPLLYAGVLFEGGVVGYDSNTRTGGAGARYLGIGGNADYRQDD 178 Query: 172 IAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCL 231 + V+LR V+ TGE++++V K+++S ++ G FR+I +LE E G T NEPV L + Sbjct: 179 VTVSLRAVSTRTGEVMANVMVQKSVVSVGLKGGAFRYIALDEILEAEAGITKNEPVTLAV 238 Query: 232 MSAIETGVIFLINDGIDRGLWDLQNKAE 259 AIE V ++ +G G W +N A+ Sbjct: 239 QQAIEKAVYAIVMEGARVGAWSFENPAQ 266 >UniRef50_B3QD06 Curli production assembly/transport component CsgG n=4 Tax=Alphaproteobacteria RepID=B3QD06_RHOPT Length = 291 Score = 154 bits (389), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 98/274 (35%), Positives = 145/274 (52%), Gaps = 28/274 (10%) Query: 3 RLFLLVAVML---LSGC-LTAPPKEAARP--TLMPRAQSYKDLTHLPAPTGKIFVSVYNI 56 R +L V+L L GC +T K+ P T++ ++ L LP PT K+ V+VYN Sbjct: 4 RAGVLACVLLAVSLGGCAITGTDKDPVTPPATMVASTKTGVVLEQLPPPTKKLDVAVYNF 63 Query: 57 QDETGQFKPYPA-SNFSTAVPQSATAMLVTAL---KDSRWFIPLERQGLQNLLNERKIIR 112 D TGQ K + FS AV Q +A+L L WF ER LQ LL ER+II+ Sbjct: 64 PDLTGQNKSNDNFAEFSRAVTQGGSAILTDVLLTAGGGHWFDVAERADLQPLLQERQIIQ 123 Query: 113 AAQENGTVAINNRIPLQSLTA--------ANIMVEGSIIGYESNVKSGGVGARYFGIGAD 164 N R LQ A A +++EG I+GY++N +GG+GA Y G+G + Sbjct: 124 ----------NTRSALQGEKAQSLPPLRFAGVLLEGGIVGYDTNETTGGIGANYLGLGGN 173 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSN 224 QY+ D + V+LR V+V TG +L++V T+K I S V F+++ LL+ + G+T N Sbjct: 174 MQYRQDIVTVSLRAVSVQTGRVLAAVTTTKIIYSVNVSGSGFKYVAIDSLLQADAGFTKN 233 Query: 225 EPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKA 258 P L + I+ V LI +G+ + LW+ ++ A Sbjct: 234 SPTTLAVREGIQLAVYSLIFEGVKKELWNFKDPA 267 >UniRef50_B3QHQ8 Curli production assembly/transport component CsgG n=6 Tax=Rhizobiales RepID=B3QHQ8_RHOPT Length = 313 Score = 154 bits (388), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 82/225 (36%), Positives = 130/225 (57%), Gaps = 6/225 (2%) Query: 38 DLTHLPAPTGKIFVSVYNIQDETGQFKPYP-ASNFSTAVPQSATAMLVTALKDS---RWF 93 +L LP P KI +++Y D TG+ +P + FS AV Q + + ALK + WF Sbjct: 46 NLETLPPPKQKIDIAIYQFPDLTGKNEPNDNVAVFSRAVTQGGAGLAIDALKRAGGGAWF 105 Query: 94 IPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGG 153 +ER GL +LL ER+++RA ++ + PL + A +++EG I+ +++N +GG Sbjct: 106 RVVERNGLNDLLQERQLVRATRQE--FDRDRAKPLPPMRFAGLLIEGGIVAFDANYMTGG 163 Query: 154 VGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQR 213 +GA Y GIGADT+++ D + V LRV +V TGE+L+SV T+KT+ S +Q ++++ + Sbjct: 164 IGANYLGIGADTKFRRDMVTVALRVASVQTGEVLTSVTTTKTVYSVSLQGNTYKYVALDK 223 Query: 214 LLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKA 258 LL+ E G T EP L + AI+ V I +G LW + A Sbjct: 224 LLQIEAGITRTEPTQLAVRQAIDLAVYSTIMEGARDKLWRFADPA 268 >UniRef50_Q1QYN5 Curli production assembly/transport component CsgG n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QYN5_CHRSD Length = 271 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 74/251 (29%), Positives = 121/251 (48%), Gaps = 22/251 (8%) Query: 15 GCL---TAP--PKEAAR--PTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYP 67 GC TAP P+E +R P + L+ P + VSV I D TGQ Y Sbjct: 21 GCASIGTAPVAPREGSRVMTNHTPYTRCLSALSQQPGENLPV-VSVGQILDRTGQVS-YS 78 Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRA-AQENGTVAINNRI 126 S + Q + ML++AL +R ER ++ L ER++ A A + A+N Sbjct: 79 TITESRVLTQGVSEMLISALYKTRKVRLAERLDIRIPLAERQLKDAGAMQRAPAALN--- 135 Query: 127 PLQSLTAANIMVEGSIIGYESNVKSGGVGAR-YFG-IGADTQYQLDQIAVNLRVVNVSTG 184 + N ++ G++ N+ + G AR Y G IGA + + + ++LRVV+ +T Sbjct: 136 ----VQPVNFVILGALTELNYNILTQG--ARLYVGLIGASNREAVINVGLDLRVVDATTF 189 Query: 185 EILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIN 244 + + + K I+ +V+AGV+RF D Q L+E + G NEP+ L + S +E ++ Sbjct: 190 QTVYVTSLQKQIVGNQVEAGVYRFFDNQ-LVEFDAGTVRNEPLQLGVRSVVEMAAYQILT 248 Query: 245 DGIDRGLWDLQ 255 +G+ + D Q Sbjct: 249 EGLGLPINDTQ 259 >UniRef50_C5D1Z0 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5D1Z0_VARPS Length = 467 Score = 60.5 bits (145), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 69/262 (26%), Positives = 117/262 (44%), Gaps = 44/262 (16%) Query: 15 GCLTAP-----PKEA-------ARPTLMPRAQSYKDLTHLPAPTGK--IFVSVYNIQDET 60 GC+TAP P EA R + P A T + I ++V +++D T Sbjct: 31 GCVTAPQQRMAPNEAPIVLGPAVRENVTPMEVVLACFGDHVAATQRQPIVITVGDVKDYT 90 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTAL-------KDSRWFIPL--ERQ---GLQNLLNER 108 G++ + N A+ Q M+ +AL + + F P+ ER+ + L + Sbjct: 91 GKY----SINEGNAITQGGALMVYSALGKLGGAVQAAERFDPVIAERELGYADRRQLGDG 146 Query: 109 KIIRAAQENGTVAINNRIPL-----QSLTAANIMVEGSIIGYESNVKSGG--VGARYFGI 161 + + A NG +P S+ ++ + G I N+ SGG +G G+ Sbjct: 147 RTHQLAGPNG----GQTVPWLPYFGGSINKSDYFIVGGITELNYNIHSGGGEIGVNQIGV 202 Query: 162 GADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGY 221 A T Q + V+LR+V+ T I+ +++ +K YEV VFRF +L + +VG Sbjct: 203 KARTFSQ--SVGVDLRIVDTKTLMIVKTISLTKQFNGYEVGFNVFRFFG-SKLYDIDVGA 259 Query: 222 TSNEPVMLCLMSAIETGVIFLI 243 EPV + + +A+E GV+ L+ Sbjct: 260 KGQEPVQMGVRAALEEGVVRLV 281 >UniRef50_C8R0R5 Curli production assembly/transport component CsgG n=1 Tax=Desulfurivibrio alkaliphilus AHT2 RepID=C8R0R5_9DELT Length = 247 Score = 58.2 bits (139), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 55/215 (25%), Positives = 92/215 (42%), Gaps = 34/215 (15%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET--- 60 FLL A + TA + RA+ Y H P K ++V + +D+T Sbjct: 12 FFLLAACSFIIPAATAQAQSGGPDMGTARAEDY----HGP----KAAIAVADFEDKTVGR 63 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 GQ++ MLVT L ++ FI LER+ L ++ E+ + +G Sbjct: 64 GQYRREYGRGMQD--------MLVTELFNTNRFIVLEREKLSAVIAEQDL----GASGRF 111 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNV---------KSGGVGARYFGIGADTQYQLDQ 171 + P+ L A +MV ++ G++ +SG +G R + A Q Sbjct: 112 RQDTTAPIGELEGAQLMVIAAVTGFDPGTSGTKGTVRGRSGLLGDRLGSLTAGVQQA--H 169 Query: 172 IAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVF 206 +A++LRVV+ +TG ILS+ N S+++ F Sbjct: 170 VALDLRVVDTATGRILSATNVEGKARSFDLGGSAF 204 >UniRef50_A7HWH6 HfaB protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HWH6_PARL1 Length = 342 Score = 57.8 bits (138), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 69/267 (25%), Positives = 118/267 (44%), Gaps = 39/267 (14%) Query: 4 LFLLVAVMLLSGCLTAPPKEAAR--------PTLM---PRAQSYKDLT-HLPAPTGKIFV 51 L L L+GC++A R P + P + + + L H+P+ + Sbjct: 19 LGALAMAFALTGCVSANAGSDGRYVAPIGNAPVITNETPYSSALRCLAGHVPSAANTTRI 78 Query: 52 SVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPL-ER-----QGLQNLL 105 +V NI+D TG+ + A V Q A+ M ++AL + +PL ER L+ Sbjct: 79 AVGNIRDYTGKAE---ADGTGMKVTQGASLMAMSAL--GKAGVPLVERYDTSISELEMKY 133 Query: 106 NERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGAD- 164 K+I A + +I + ++ + G I N++SGG+ Y G D Sbjct: 134 TNNKLIGDAVD----GEYRKIYAGEIRGSDYYLVGGITELNFNIQSGGINGSY-AEGGDM 188 Query: 165 --------TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLE 216 Y ++ I ++LR+VN T E++ V+ K I+ E++AGVF F L Sbjct: 189 GAAANASAATYVMN-IGLDLRLVNSRTLEVVDYVSYQKQIVGREIKAGVFDFFG-GNLFS 246 Query: 217 GEVGYTSNEPVMLCLMSAIETGVIFLI 243 VG ++ EP+ L + + +E V+ L+ Sbjct: 247 LGVGSSAQEPIQLAVRAVVERAVLKLL 273 >UniRef50_Q0AL09 HfaB protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0AL09_MARMM Length = 287 Score = 53.5 bits (127), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 59/217 (27%), Positives = 96/217 (44%), Gaps = 23/217 (10%) Query: 38 DLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLE 97 D A G+ ++V + D TG+F S+ V TA++V + D F E Sbjct: 53 DCLSRTAMAGRPRIAVGEVNDLTGRF-----SSLDGTVATQGTALMVMSALDRAGFPLAE 107 Query: 98 R------QGLQNLLNERKIIRAAQ--ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNV 149 R Q N R I Q EN R+ S+ ++ ++ G + N+ Sbjct: 108 RLDTRVAQQELEFANSRLIGPDGQPTEN-----YRRVMAGSIAGSDYIILGGVTELNFNL 162 Query: 150 KSGGVGARYFG--IGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFR 207 SG AR G IG Y + + ++LR+V+ + ++ V+ SK I +E++AGVF Sbjct: 163 HSGVAEAR-IGPLIGGRRHYAM-TVGLDLRLVDATDLRVIDIVSQSKVIRGHEIRAGVFE 220 Query: 208 FIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIN 244 FI L G +G EPV + + +E+ V L++ Sbjct: 221 FIGDTTLDIG-MGERVQEPVHTAIRTIVESAVFDLVS 256 >UniRef50_C5SPX6 HfaB protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SPX6_9CAUL Length = 316 Score = 53.5 bits (127), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 67/274 (24%), Positives = 122/274 (44%), Gaps = 49/274 (17%) Query: 1 MQRLFLLVAVMLLSGC--LTAPPKEA--ARP--------TLMPRA------QSYKDLTHL 42 M+RL + +LLS C +TA P+ A+P L P + Q +TH Sbjct: 1 MRRLVFCLP-LLLSACASVTADPQTGLYAKPVGNAPATANLTPYSADLTCLQQAALVTHK 59 Query: 43 PAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQ 102 P P V+V I D TG+ Y +N + + A + L SR +P +G + Sbjct: 60 PLPR----VAVSRIDDLTGKRDFYTGANITQGIALFAQSAL------SRAGLPQVERGDR 109 Query: 103 NLLNERKIIRAAQEN---------GTVAINNR-IPLQSLTAANIMVEGSIIGYESNVKSG 152 ++ + ++AA ++ G N R + + ++ + G + N++S Sbjct: 110 DISDYE--LKAAMDHVLSDTPDQAGNDPDNFRKVYAGQIAGSDYYISGGLTELNYNIRSD 167 Query: 153 GVGARYFGIGAD-------TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGV 205 GV R G G+D ++ + IA++LR+++ + EI K I+ +EV+ G+ Sbjct: 168 GVDLRAGGTGSDDPAGSFVSRRFVMNIALDLRLIDTRSQEIKRVTAYQKQIVGHEVKPGL 227 Query: 206 FRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 F +D +L+ G++ EP+ L + + +E V Sbjct: 228 FTLLD-GTMLDLSGGFSEMEPIQLGVRTLVERSV 260 >UniRef50_B4W728 Putative uncharacterized protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4W728_9CAUL Length = 344 Score = 50.4 bits (119), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 54/214 (25%), Positives = 96/214 (44%), Gaps = 30/214 (14%) Query: 51 VSVYNIQDETGQFKPYPASNFSTA--VPQSATAMLVTAL---------KDSRWFIPLERQ 99 ++V +I D TG+ ++ T + Q A+ VTAL + R +ERQ Sbjct: 59 LAVGDIADLTGR------NDLETGRKISQGASLFAVTALTKAGVPTVERQDRGVSEVERQ 112 Query: 100 GLQNLLNERKIIRAAQENGTVAINNR-IPLQSLTAANIMVEGSIIGYESNVKSGGVGARY 158 Q+ L + Q G A N R I + + V G + N++S GV A Sbjct: 113 YAQSHL----LSDTPQAAGESAENFRPIYAGQIAGSRYYVVGGVTELNYNLRSSGVDASA 168 Query: 159 FGI-------GADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDY 211 GI G + + IA++LR+V+ + E++++ + K ++ E++ GVF F + Sbjct: 169 GGIEASGVKGGLTSSGYVMNIAIDLRLVDTRSQEVVATASYQKQLVGREIRVGVFDFT-H 227 Query: 212 QRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 + + G + EP+ + +AIE G+ + D Sbjct: 228 GNIFDLSAGASGMEPIQFAVRTAIERGLYDFVAD 261 >UniRef50_A6GPY8 HfaB protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GPY8_9BURK Length = 370 Score = 49.3 bits (116), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 32/116 (27%), Positives = 58/116 (50%), Gaps = 1/116 (0%) Query: 130 SLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSS 189 SL ++ + G I N++SG + + IGA +Y + +AV+LRVVN T E++++ Sbjct: 146 SLPGSDYHIVGGITEVNYNIRSGSLESSIRFIGAAARYFVMNVAVDLRVVNTKTLEVVNT 205 Query: 190 VNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 + K I+ E+ G FR L++ + EP+ + IE V ++ + Sbjct: 206 QSLQKQIIGTELNGGYFRLFS-DGLVDVSAAERTQEPIQKGVRMVIEHAVFNMLTE 260 >UniRef50_A4CEI5 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4CEI5_9GAMM Length = 490 Score = 49.3 bits (116), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 43/178 (24%), Positives = 77/178 (43%), Gaps = 12/178 (6%) Query: 71 FSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQS 130 +S + + L TAL + FI LERQ L +L+E+ ++ A G V+ N+ Sbjct: 67 WSKEIGNGMSDQLTTALVGTNRFIVLERQALDAVLSEQDLVTA----GRVSANSGAAFGE 122 Query: 131 LTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQ-----YQLDQIAVNLRVVNVSTGE 185 + A I++ S+ ++ + +G F IG Q +A++LR+++ T Sbjct: 123 IEGAEIVIVASVTEFDDDASGARIGGMGF-IGDMVQSVSAGLSNTHMAIDLRLIDTRTSR 181 Query: 186 ILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLI 243 IL++ +++ A F +L G + SN P L I+ V F + Sbjct: 182 ILAATTVEGGSKDFDIAAAATNF--GSSILGGNLSAWSNTPKEKALREIIQKAVEFTL 237 >UniRef50_Q0IE08 Curli production assembly/transport component CsgG subfamily protein n=1 Tax=Synechococcus sp. CC9311 RepID=Q0IE08_SYNS3 Length = 301 Score = 47.8 bits (112), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 42/154 (27%), Positives = 74/154 (48%), Gaps = 18/154 (11%) Query: 39 LTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER 98 + PT VSV + ++E GQ + S V + L L + +ER Sbjct: 51 FASIAGPT----VSVPDFKNEVGQLAWW-----SPRVSRQLADALSNELSAAGGLTVVER 101 Query: 99 QGLQNLLNERKIIRAAQENGTVAINNRIPLQS-LTAANIMVEGSIIGYESNV--KSGGVG 155 Q ++ +L+E+++ E G V N+R +T + ++ G + G+E+NV K G G Sbjct: 102 QNVRAVLSEQEM----AELGIVRNNDRAAKSGQMTGSQYVILGRVSGFENNVETKQSGSG 157 Query: 156 ARYFGIGA--DTQYQLDQIAVNLRVVNVSTGEIL 187 R+ G G D ++++LRVV+ +TGE++ Sbjct: 158 MRFLGFGGSKDVAETKAYVSLDLRVVDTTTGEVV 191 >UniRef50_C7YQN8 Predicted protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YQN8_NECH7 Length = 1209 Score = 43.1 bits (100), Expect = 0.009, Method: Compositional matrix adjust. Identities = 27/134 (20%), Positives = 66/134 (49%), Gaps = 7/134 (5%) Query: 144 GYESNVKSGGVGARYFGI--GADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEV 201 G+++ + G +G R+ G + +L+ + V +++NV+ G++ +++ +KT+ S + Sbjct: 867 GFDAKLDEGRLGRRFSKALSGLRSNTRLEHLRVRSQMLNVNIGDLAEAISANKTLHSLDC 926 Query: 202 QAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQ 261 + F +++ L V + S+ + + + + I +D + + A RQ Sbjct: 927 EGNDFNLSNFRHL----VKHLSDSTTIRYFSAFSDRDLAQTIQKSVDNAVAAATSTARRQ 982 Query: 262 NDILVKYRHMSVPP 275 + ++ + RH VPP Sbjct: 983 S-MMSRLRHDKVPP 995 >UniRef50_Q05S73 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05S73_9SYNE Length = 256 Score = 43.1 bits (100), Expect = 0.011, Method: Compositional matrix adjust. Identities = 36/123 (29%), Positives = 60/123 (48%), Gaps = 8/123 (6%) Query: 71 FSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQS 130 +S V + T ML LK + F +ER GL+ +L+E+++ + A + Sbjct: 55 WSPRVSKQLTDMLSNELKATGNFTLVERAGLKKVLDEQELAELGITRQSTAPKRGM---- 110 Query: 131 LTAANIMVEGSIIGYE--SNVKSGGVGARYFGIGADTQYQLDQ--IAVNLRVVNVSTGEI 186 +T A V G++ Y+ + KSGG G G G + +A+++RVV+ +TGEI Sbjct: 111 VTGAKYYVLGAVSDYQQGTETKSGGGGFNIMGFGQRKSSSESKAYVALDVRVVDTTTGEI 170 Query: 187 LSS 189 S Sbjct: 171 AYS 173 >UniRef50_P27343 Putative transcription activator protein hfaB n=6 Tax=Alphaproteobacteria RepID=HFAB_CAUCR Length = 337 Score = 42.4 bits (98), Expect = 0.016, Method: Compositional matrix adjust. Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 10/116 (8%) Query: 133 AANIMVEGSIIGYESNVKSGGVGARYFGI----GADTQYQ----LDQIAVNLRVVNVSTG 184 ++ V G I N++S G+ A Y G G ++ + IA++LR+VN T Sbjct: 155 GSDFYVIGGITELNYNIRSAGIDA-YAGDKDTDGLKGNFRRRVFIMNIALDLRLVNTRTL 213 Query: 185 EILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 E++ ++ K ++ EV AGVF F++ L + G + EP+ L + + IE + Sbjct: 214 EVVDVISYQKQVVGREVSAGVFDFLN-GNLFDISAGRGALEPMQLAVRALIERATV 268 >UniRef50_A2CA24 Uncharacterized protein involved in formation of curli polymers n=3 Tax=Cyanobacteria RepID=A2CA24_PROM3 Length = 275 Score = 42.0 bits (97), Expect = 0.023, Method: Compositional matrix adjust. Identities = 31/115 (26%), Positives = 56/115 (48%), Gaps = 11/115 (9%) Query: 96 LERQGLQNLLNERKIIRAA--QENGTVAINNRIPLQSLTAANIMVEGSIIGYESNV--KS 151 +ERQ L+ +L+E+++ + +G A + + + A ++ G + GYE V K Sbjct: 73 VERQNLKAVLSEQELAELGIVRNDGDAARS-----RQMRGARYLIMGRVSGYEDGVETKQ 127 Query: 152 GGVGARYFGIGADTQYQLDQ--IAVNLRVVNVSTGEILSSVNTSKTILSYEVQAG 204 G G R+ G G + ++++LRVV+ STGE++ + S Q G Sbjct: 128 SGSGMRFMGFGGSKTVSESKAYVSIDLRVVDSSTGEVVGARTVEGRATSTAKQKG 182 >UniRef50_A1BH58 Curli production assembly/transport component CsgG n=3 Tax=Chlorobium/Pelodictyon group RepID=A1BH58_CHLPD Length = 240 Score = 40.0 bits (92), Expect = 0.080, Method: Compositional matrix adjust. Identities = 38/130 (29%), Positives = 60/130 (46%), Gaps = 12/130 (9%) Query: 82 MLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGS 141 ML++ L + F LER L ++ E+ + +G + R L +T A +V + Sbjct: 54 MLISELASTNSFRVLERNELDAVIREQDL----GASGRINPGTRSKLGKITGAKYLVAAT 109 Query: 142 IIGYESNVKSGGVGARY----FGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTIL 197 + +E N GG G Y G D Y +AV+L+V++V TGEI + T Sbjct: 110 VSAFEHNTSGGGGGISYGGISLGGKQDKAY----MAVDLKVIDVQTGEIYDARTVEATSK 165 Query: 198 SYEVQAGVFR 207 S + GV+R Sbjct: 166 SSGISVGVYR 175 >UniRef50_B8HPG7 CsgG family protein, putative n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPG7_CYAP4 Length = 330 Score = 40.0 bits (92), Expect = 0.092, Method: Compositional matrix adjust. Identities = 36/119 (30%), Positives = 61/119 (51%), Gaps = 11/119 (9%) Query: 73 TAVPQSATAMLVTAL--KDSRWFIPLERQGLQNLLNERKIIRAAQ-ENGTVAINNRIPLQ 129 +A P A + L+T L KD + + +ER + +L E+ + +A + E T A RI Sbjct: 68 SAGPSKAVSTLLTNLLVKDGTYVV-VERSRIDAVLAEQNLGQAGRIEPTTAAQVGRI--- 123 Query: 130 SLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILS 188 + +V GS+ + K GGV G G+ + Q+ ++ + R+V+ +TGEILS Sbjct: 124 --LGVDAVVIGSVTEFGLEQKKGGV--NILGFGSQKETQIARVQLAARIVSTTTGEILS 178 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A8AI45 Curli production assembly/transport component cs... 408 e-112 UniRef50_A6EMN0 Putative assembly or transport protein for curli... 327 4e-88 UniRef50_C6X5S1 Curli production assembly/transport component Cs... 321 2e-86 UniRef50_Q3KES0 Assembly/transport component in curli production... 321 2e-86 UniRef50_A5FHE8 Curli production assembly/transport component Cs... 320 2e-86 UniRef50_Q084E4 Curli production assembly/transport component Cs... 318 2e-85 UniRef50_Q392B7 Curli production assembly/transport component Cs... 317 2e-85 UniRef50_Q0TJ37 Curli production assembly/transport component Cs... 308 1e-82 UniRef50_D2QPP9 Curli production assembly/transport component Cs... 307 2e-82 UniRef50_Q1NBA5 Putative curli production assembly/transport com... 303 4e-81 UniRef50_A3XJL0 Putative assembly or transport protein for curli... 299 8e-80 UniRef50_B6QZQ0 Curli fiber membrane-associated lipoprotein CsgG... 285 1e-75 UniRef50_A9L3G5 Curli production assembly/transport component Cs... 280 3e-74 UniRef50_D2LDX9 Curli production assembly/transport component Cs... 271 2e-71 UniRef50_Q1YMP1 Putative Curli production assembly/transport com... 267 4e-70 UniRef50_B3QD06 Curli production assembly/transport component Cs... 261 2e-68 UniRef50_B3QHQ8 Curli production assembly/transport component Cs... 261 2e-68 UniRef50_Q1QYN5 Curli production assembly/transport component Cs... 204 3e-51 UniRef50_A7HWH6 HfaB protein n=1 Tax=Parvibaculum lavamentivoran... 200 4e-50 UniRef50_C5SPX6 HfaB protein n=1 Tax=Asticcacaulis excentricus C... 194 4e-48 UniRef50_C5D1Z0 Putative uncharacterized protein n=1 Tax=Variovo... 180 5e-44 UniRef50_B4W728 Putative uncharacterized protein n=1 Tax=Brevund... 173 6e-42 UniRef50_Q0AL09 HfaB protein n=1 Tax=Maricaulis maris MCS10 RepI... 173 7e-42 UniRef50_A6GPY8 HfaB protein n=1 Tax=Limnobacter sp. MED105 RepI... 172 1e-41 UniRef50_A4CEI5 Putative uncharacterized protein n=1 Tax=Pseudoa... 160 5e-38 UniRef50_C8R0R5 Curli production assembly/transport component Cs... 152 1e-35 UniRef50_Q0IE08 Curli production assembly/transport component Cs... 116 9e-25 Sequences not found previously or not previously below threshold: UniRef50_C6XNU9 Holdfast attachment protein HfaB n=3 Tax=Alphapr... 157 4e-37 UniRef50_P27343 Putative transcription activator protein hfaB n=... 149 9e-35 UniRef50_C5SPX7 HfaB protein n=2 Tax=Caulobacteraceae RepID=C5SP... 147 3e-34 UniRef50_B9L5Z4 Putative curli production assembly/transport com... 112 2e-23 UniRef50_A2CA24 Uncharacterized protein involved in formation of... 101 2e-20 UniRef50_D0SYN2 Predicted protein n=1 Tax=Acinetobacter lwoffii ... 99 2e-19 UniRef50_A1BH58 Curli production assembly/transport component Cs... 96 1e-18 UniRef50_P73111 Sll1835 protein n=3 Tax=Chroococcales RepID=P731... 92 2e-17 UniRef50_Q05S73 Putative uncharacterized protein n=1 Tax=Synecho... 91 5e-17 UniRef50_D0XIZ9 HfaB protein n=1 Tax=Brevundimonas subvibrioides... 86 1e-15 UniRef50_Q2JVB7 CsgG family protein n=4 Tax=Cyanobacteria RepID=... 84 3e-15 UniRef50_Q31MJ4 Putative uncharacterized protein n=2 Tax=Synecho... 78 3e-13 UniRef50_D0XJ00 Putative uncharacterized protein n=1 Tax=Brevund... 77 7e-13 UniRef50_O67219 Putative uncharacterized protein n=1 Tax=Aquifex... 76 2e-12 UniRef50_A6DA55 Putative uncharacterized protein n=1 Tax=Caminib... 75 2e-12 UniRef50_B0VF47 Putative curli production assembly/transport com... 73 1e-11 UniRef50_D1B8K8 Curli production assembly/transport component Cs... 72 2e-11 UniRef50_B8HPG7 CsgG family protein, putative n=1 Tax=Cyanothece... 72 2e-11 UniRef50_Q21E38 Curli production assembly/transport component Cs... 71 3e-11 UniRef50_B0CFQ9 CsgG family protein, putative n=1 Tax=Acaryochlo... 70 7e-11 UniRef50_A6DK47 Curli production assembly/transport component Cs... 69 2e-10 UniRef50_Q9RTM7 Putative uncharacterized protein n=1 Tax=Deinoco... 69 2e-10 UniRef50_D1Y6S9 Tetratricopeptide repeat domain protein n=1 Tax=... 67 8e-10 UniRef50_A9L616 Curli production assembly/transport component Cs... 66 2e-09 UniRef50_B0JU00 Curli production assembly/transport component n=... 66 2e-09 UniRef50_B8CKS1 Curli production assembly/transport component Cs... 66 2e-09 UniRef50_Q4KAD5 CsgG family protein n=20 Tax=Proteobacteria RepI... 65 3e-09 UniRef50_B8JAW6 Curli production assembly/transport component Cs... 64 6e-09 UniRef50_C3WB28 Curli production assembly/transport component Cs... 64 6e-09 UniRef50_Q7DDH4 Putative lipoprotein NMB1126/NMB1164 n=130 Tax=P... 64 7e-09 UniRef50_D1Y6S8 Putative curli production assembly/transport com... 63 1e-08 UniRef50_Q1IPN7 Curli production assembly/transport component Cs... 62 2e-08 UniRef50_C1XFI9 Uncharacterized protein involved in formation of... 61 4e-08 UniRef50_Q2RW21 Putative uncharacterized protein n=1 Tax=Rhodosp... 61 5e-08 UniRef50_C1CWI5 Putative Curli production assembly/transport com... 61 5e-08 UniRef50_C7RID3 Peptidoglycan-binding domain 1 protein n=1 Tax=C... 60 8e-08 UniRef50_C6W063 Uncharacterized protein involved in formation of... 60 9e-08 UniRef50_A3WBM1 Putative uncharacterized protein n=1 Tax=Erythro... 60 1e-07 UniRef50_C2MBZ3 Curli production assembly/transport component Cs... 59 1e-07 UniRef50_A1VWR2 Curli production assembly/transport component Cs... 59 2e-07 UniRef50_C9KNZ5 Putative uncharacterized protein n=1 Tax=Mitsuok... 59 2e-07 UniRef50_A3JNY2 Putative uncharacterized protein n=1 Tax=Rhodoba... 58 4e-07 UniRef50_C6JM48 Putative uncharacterized protein n=2 Tax=Fusobac... 55 3e-06 UniRef50_Q93HR4 Putative uncharacterized protein (Fragment) n=1 ... 54 4e-06 UniRef50_A5PA28 Curli production assembly/transport component Cs... 53 9e-06 UniRef50_B7A9M7 Putative uncharacterized protein n=1 Tax=Thermus... 53 1e-05 UniRef50_B9CYA0 Peptidoglycan-binding domain 1 protein n=2 Tax=C... 52 2e-05 UniRef50_B8GVR1 Putative uncharacterized protein n=2 Tax=Cauloba... 52 3e-05 UniRef50_C1A9F6 Putative uncharacterized protein n=1 Tax=Gemmati... 52 3e-05 UniRef50_Q7UMD1 Putative uncharacterized protein n=1 Tax=Rhodopi... 51 6e-05 UniRef50_A8ZXP3 Uncharacterized protein involved in formation of... 51 6e-05 UniRef50_B0SIM6 Hypothetical lipoprotein n=2 Tax=Leptospira bifl... 49 1e-04 UniRef50_Q30SV1 Putative uncharacterized protein n=1 Tax=Sulfuri... 49 2e-04 UniRef50_A2SLP6 Putative uncharacterized protein n=1 Tax=Methyli... 49 2e-04 UniRef50_C1AEH8 Putative uncharacterized protein n=1 Tax=Gemmati... 49 2e-04 UniRef50_C1QBW4 Putative uncharacterized protein n=1 Tax=Brachys... 49 2e-04 UniRef50_B4SIB8 Putative uncharacterized protein n=3 Tax=Bacteri... 49 2e-04 UniRef50_C1TRF7 Tetratricopeptide repeat protein n=1 Tax=Dethios... 48 3e-04 UniRef50_A8UTU4 Curli production assembly/transport component Cs... 48 3e-04 UniRef50_B6BP75 Peptidoglycan-binding domain 1 protein n=1 Tax=C... 48 4e-04 UniRef50_B7QTV8 Putative peptidoglycan binding domain protein n=... 47 7e-04 UniRef50_B7IGU6 Putative uncharacterized protein n=1 Tax=Thermos... 47 0.001 UniRef50_Q3B6Q5 Periplasmic protein n=2 Tax=Chlorobium/Pelodicty... 46 0.002 UniRef50_C0QSR9 Putative lipoprotein n=1 Tax=Persephonella marin... 45 0.004 UniRef50_A8ZSW9 Tetratricopeptide TPR_2 repeat protein n=1 Tax=D... 44 0.004 UniRef50_Q0VKU1 Putative uncharacterized protein n=2 Tax=Alcaniv... 44 0.004 UniRef50_A7BYD5 Putative uncharacterized protein n=1 Tax=Beggiat... 44 0.007 UniRef50_Q1LBB8 Putative uncharacterized protein n=1 Tax=Cupriav... 44 0.008 >UniRef50_A8AI45 Curli production assembly/transport component csgG n=90 Tax=Enterobacteriaceae RepID=CSGG_CITK8 Length = 277 Score = 408 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 271/277 (97%), Positives = 274/277 (98%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 MQRL +LVAV LLSGCLTAPPKEAA+PTLMPRAQSYKDLTHLP PTGKIFVSVYNIQDET Sbjct: 1 MQRLLILVAVCLLSGCLTAPPKEAAKPTLMPRAQSYKDLTHLPMPTGKIFVSVYNIQDET 60 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV Sbjct: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN Sbjct: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGE+GYTSNEPVMLCLMSAIETGVI Sbjct: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEIGYTSNEPVMLCLMSAIETGVI 240 Query: 241 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES Sbjct: 241 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 >UniRef50_A6EMN0 Putative assembly or transport protein for curli synthesis n=1 Tax=unidentified eubacterium SCB49 RepID=A6EMN0_9BACT Length = 458 Score = 327 bits (837), Expect = 4e-88, Method: Composition-based stats. Identities = 113/280 (40%), Positives = 164/280 (58%), Gaps = 3/280 (1%) Query: 1 MQRLFLLVAVMLLSGC--LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 ++L LL + L+ C + P L++LP + K+ V VYN +D Sbjct: 4 FEKLVLLFVFVTLTSCGAMLNQPYNVQEARTGELTGKNNALSNLPKASDKVVVGVYNFRD 63 Query: 59 ETGQFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQEN 117 +TGQFK S+FSTAV Q TA+L+ AL+DS WF P+ER+ L NLLNER II + + Sbjct: 64 QTGQFKLTDVGSSFSTAVSQGTTAILLKALEDSEWFRPIERENLNNLLNERSIIEKTRRD 123 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 T A L+ L A I++EG ++ Y++N+ +GG GARYFG G Y+ D+I V LR Sbjct: 124 YTPAGQQPQKLKPLLFAGILLEGGVVSYDTNILTGGAGARYFGAGGSVSYRQDRITVYLR 183 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 ++ STGE+L +V SKTILS AG+FRF+ ++RLLE E+G+T NEP L + AIE Sbjct: 184 AISTSTGEVLKTVYVSKTILSQGADAGIFRFVKFERLLEAEMGFTKNEPAELAVKEAIEK 243 Query: 238 GVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 V+ L+ +GI LW+++ + + +L Y E+ Sbjct: 244 AVVDLVYEGIKDNLWNMEGEEDDVIGVLETYEREKAAEEA 283 >UniRef50_C6X5S1 Curli production assembly/transport component CsgG n=3 Tax=Bacteroidetes RepID=C6X5S1_FLAB3 Length = 456 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 121/267 (45%), Positives = 169/267 (63%), Gaps = 5/267 (1%) Query: 1 MQRLFLLVAVMLLSGC-LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDE 59 + ++ LL ++ LS C L P + + TL ++ +LPAP KI V VY +D+ Sbjct: 6 LTKIALLTPLIFLSSCTLFNLPTNSEKSTLGEVTPYTPEIKNLPAPKEKIVVGVYKFRDQ 65 Query: 60 TGQFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ-EN 117 TGQ+K +++STA+PQ T +L+ AL+DSRWF P+ER+ + NLLNER+IIR+ + E Sbjct: 66 TGQYKAAENGASWSTAIPQGTTTILLKALEDSRWFTPIERENIGNLLNERQIIRSTRKEY 125 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 N L L A I++EG +I Y++NV +GG+GARYFG+GA QY+ D+I V LR Sbjct: 126 AGNDANEAALLPPLLFAGIILEGGVISYDTNVMTGGIGARYFGLGAGAQYRQDRITVYLR 185 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 V+ S+GEIL +V TSKTILS + FRFID +RLLE ++G T NEPV L + AIE Sbjct: 186 AVSTSSGEILKTVYTSKTILSTSINGNFFRFIDTERLLESDIGITQNEPVHLAVTEAIEK 245 Query: 238 GVIFLINDGIDRGLWDLQNKAERQNDI 264 V+ LI +G+ LW NK + ND Sbjct: 246 AVLSLIVEGVRDNLW--TNKQKTPNDF 270 >UniRef50_Q3KES0 Assembly/transport component in curli production n=8 Tax=Gammaproteobacteria RepID=Q3KES0_PSEPF Length = 286 Score = 321 bits (822), Expect = 2e-86, Method: Composition-based stats. Identities = 134/285 (47%), Positives = 193/285 (67%), Gaps = 8/285 (2%) Query: 1 MQRLFLL-VAVMLLSGCLTAPP----KEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYN 55 M+++ L + + L GC P ++ PTL PRA +Y DL +P P G++ VY Sbjct: 1 MKKIIALGLMLAALQGCSLREPMPAEQDTDTPTLTPRASTYYDLLKMPRPKGRLMAVVYG 60 Query: 56 IQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ 115 +D+TGQ+KP PAS+FST+V Q A +ML+ A++ S WF+ LER+GLQNLL ERKIIRA+Q Sbjct: 61 FRDQTGQYKPTPASSFSTSVTQGAASMLMDAMQASGWFVVLEREGLQNLLTERKIIRASQ 120 Query: 116 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN 175 + +N + L L AAN+M+EG II Y++NV+SGG GARY GI +Y++DQ+ VN Sbjct: 121 KKPNTPVNIQGELPPLQAANMMLEGGIIAYDTNVRSGGEGARYLGIDLSREYRVDQVTVN 180 Query: 176 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 235 LR V+V +G++L++V TSKTI S AG+F+FI++++LLE EVGYT+NEP LC++SAI Sbjct: 181 LRAVDVRSGQVLANVMTSKTIYSVARSAGIFKFIEFKKLLEAEVGYTTNEPAQLCVLSAI 240 Query: 236 ETGVIFLINDGIDRGLWDLQNKAE--RQNDILVKY-RHMSVPPES 277 E V ++ GI+R LW + A Q+D+L +Y V P++ Sbjct: 241 EAAVGHMVAQGIERRLWQVAGDASTPSQDDVLNRYLTQNKVDPDA 285 >UniRef50_A5FHE8 Curli production assembly/transport component CsgG n=3 Tax=Flavobacteria RepID=A5FHE8_FLAJ1 Length = 454 Score = 320 bits (821), Expect = 2e-86, Method: Composition-based stats. Identities = 108/267 (40%), Positives = 164/267 (61%), Gaps = 5/267 (1%) Query: 5 FLLVAVMLLSGC--LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQ 62 ++ L +GC P + L + L LP P ++ V VY +D+TGQ Sbjct: 7 LFILIAFLFAGCGAYYNQPTGVQKAILGESTPATSLLKDLPKPKEQVVVGVYKFRDQTGQ 66 Query: 63 FKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 +KP S+FSTAV Q AT++L+ AL+DS+WFIP+ER+ + NLL ER +IRA ++ Sbjct: 67 YKPQENGSSFSTAVTQGATSILIKALEDSKWFIPIERENIGNLLQERNLIRATRQEYVKN 126 Query: 122 IN-NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 N N L L A +++EG I+ Y+SN+ +GG GARYFG GA +Y+ D++ + LR+++ Sbjct: 127 ANPNEPQLTPLLYAGVLLEGGIVSYDSNIITGGFGARYFGAGASVKYRQDRVTIYLRMIS 186 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 S G+IL SV SKTILS + +FR+++++RLLE E GYT+NEPV + + AIE V Sbjct: 187 TSNGKILKSVYISKTILSQAIDESLFRYVNFKRLLEVETGYTTNEPVHMAVTEAIEKAVE 246 Query: 241 FLINDGIDRGLWDLQNKAERQNDILVK 267 L+ +G+ +W+ + + Q D L+K Sbjct: 247 SLVLEGLQDNIWE-ADAPKWQVDNLIK 272 >UniRef50_Q084E4 Curli production assembly/transport component CsgG n=10 Tax=Gammaproteobacteria RepID=Q084E4_SHEFN Length = 283 Score = 318 bits (814), Expect = 2e-85, Method: Composition-based stats. Identities = 137/272 (50%), Positives = 187/272 (68%), Gaps = 10/272 (3%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEA---ARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 M+RL L + ++ LS C + + A +LMP+ ++Y DL LP+P G + +VY+ + Sbjct: 1 MKRLVLSLFILSLSACSSIESEFDGIEATTSLMPKGETYYDLVSLPSPQGSMVAAVYDFR 60 Query: 58 DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQEN 117 D+TGQ+KP P+SNFSTAVPQS TA L AL DS WF+P+ER+GLQNLL ERKI+RA Sbjct: 61 DQTGQYKPIPSSNFSTAVPQSGTAFLAQALNDSAWFVPVEREGLQNLLTERKIVRAGL-- 118 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 + L L +A I++EG I+ Y++N+K+GG GARY GIG QY++D I VNLR Sbjct: 119 ----NGDASKLPQLNSAQILMEGGIVAYDTNIKTGGAGARYLGIGVSGQYRVDSITVNLR 174 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 V++ TG +LSSV T+K ++S E+ AGVF+FID Q LLE EVGYTSNEPV LC+ +AIE+ Sbjct: 175 AVDIRTGRLLSSVTTTKAVISKEITAGVFKFIDAQELLESEVGYTSNEPVSLCIAAAIES 234 Query: 238 GVIFLINDGIDRGLWDLQNKAER-QNDILVKY 268 V+ +I DGI + W+L + A +N L KY Sbjct: 235 AVVHMIADGIWKRAWNLLDAASGVKNPTLQKY 266 >UniRef50_Q392B7 Curli production assembly/transport component CsgG n=2 Tax=Proteobacteria RepID=Q392B7_BURS3 Length = 312 Score = 317 bits (812), Expect = 2e-85, Method: Composition-based stats. Identities = 114/272 (41%), Positives = 172/272 (63%), Gaps = 2/272 (0%) Query: 1 MQRLFLLVAVML-LSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDE 59 M R+ + ++L L GC+T P + TL P ++ +DLTHLP P GKI +VY +D Sbjct: 14 MTRIAMGAVLLLSLVGCVTRPMPALSNATLTPPTRTTRDLTHLPPPKGKIVAAVYGFRDL 73 Query: 60 TGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 TGQ+K P S+FS+ V Q + LV A++DS WF P+ER+ LQ+LL ERKI+RA + Sbjct: 74 TGQYKASPDSSFSSQVTQGGASFLVKAMRDSGWFTPVERENLQDLLTERKIMRATDGSDA 133 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 N L ANI++EG I+GY++NV++GG G Y GI TQY++DQ+ VNLR + Sbjct: 134 KKAQNDAMA-PLMPANIVLEGGIVGYDTNVRTGGAGVAYLGISGSTQYRIDQVTVNLRAI 192 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 ++ TG++L+SV+T+KT+ SY+V G++RF+ ++ LL+ E G T NEP +C+ AIE+ + Sbjct: 193 DIRTGQVLNSVSTTKTVYSYQVDTGIYRFVGFKDLLQAEAGLTRNEPAQICVNEAIESAL 252 Query: 240 IFLINDGIDRGLWDLQNKAERQNDILVKYRHM 271 LI G+ W L+N + + + +Y Sbjct: 253 THLIVQGVANQTWVLKNDQDWYDPTMQRYLQE 284 >UniRef50_Q0TJ37 Curli production assembly/transport component CsgG n=1 Tax=Escherichia coli 536 RepID=Q0TJ37_ECOL5 Length = 228 Score = 308 bits (789), Expect = 1e-82, Method: Composition-based stats. Identities = 206/222 (92%), Positives = 208/222 (93%) Query: 56 IQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ 115 + + G P AVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ Sbjct: 7 FRTKPGNLNPTRQVTSPLAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ 66 Query: 116 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN 175 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN Sbjct: 67 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN 126 Query: 176 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 235 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI Sbjct: 127 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 186 Query: 236 ETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 ETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES Sbjct: 187 ETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 228 >UniRef50_D2QPP9 Curli production assembly/transport component CsgG n=2 Tax=Spirosoma linguale DSM 74 RepID=D2QPP9_9SPHI Length = 478 Score = 307 bits (787), Expect = 2e-82, Method: Composition-based stats. Identities = 109/278 (39%), Positives = 160/278 (57%), Gaps = 14/278 (5%) Query: 3 RLFLLVAVMLLSGCLT--APPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 R+ + +LSGC P R L + L +LP K+ V+VY +D+T Sbjct: 12 RIIPFSCLWVLSGCAAYLHQPTGLQRARLGEETTTTAALRNLPKAKEKVVVAVYKFRDQT 71 Query: 61 GQFK-PYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 GQ+K S FST V Q T +L+ AL++S WF +ER+ + NLLNERKIIR++ Sbjct: 72 GQYKLSETGSTFSTVVSQGTTNILLKALEESGWFTTIERENVSNLLNERKIIRSSVAQYK 131 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 N L L A +++EG I+ Y++N+ +GG G RYF G TQY+ D++ V LR V Sbjct: 132 EGEN----LPPLLFAGVILEGGIVSYDANIITGGAGLRYFATGGSTQYRQDRVTVYLRAV 187 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 +G+IL +V TSKTILS V AG+FR++ +++LLE E G+++NEP + + AIE V Sbjct: 188 ATRSGKILKTVYTSKTILSQSVDAGIFRYVTFKKLLEAETGFSTNEPSQMAVTEAIEKAV 247 Query: 240 IFLINDGIDRGLWDLQNK----AERQNDILVKYRHMSV 273 L+ +GI GLW + +K A+R+ D +Y V Sbjct: 248 QALVLEGIQDGLWAVSDKDTGVAKRELD---RYDAEKV 282 >UniRef50_Q1NBA5 Putative curli production assembly/transport component csgg n=3 Tax=Sphingomonadales RepID=Q1NBA5_9SPHN Length = 336 Score = 303 bits (776), Expect = 4e-81, Method: Composition-based stats. Identities = 110/286 (38%), Positives = 166/286 (58%), Gaps = 15/286 (5%) Query: 1 MQRLFLLV-AVMLLSGCLT-APPKEAARPTLMP------RAQSYKDLTHLPAPTGKIFVS 52 ++++FL A +LL GC + A P++ P Q+ + L LP P + ++ Sbjct: 2 IRQIFLASGAALLLGGCNSLATTGRDDIPSMNPMAVYPRYTQAQRQLMDLPPPQRPVAIA 61 Query: 53 VYNIQDETGQFKPYPA--SNFSTAVPQSATAMLVTALKDS---RWFIPLERQGLQNLLNE 107 VYN D+TGQ++ S AV Q A ++LV AL+D+ +WF +ER+ L+NLLNE Sbjct: 62 VYNFSDQTGQYRVGEGGTQTLSRAVTQGAASILVRALQDAGNRKWFTIVEREQLRNLLNE 121 Query: 108 RKIIRAAQENGTVAIN-NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQ 166 R+IIR +E N N L ++ A +++EG II +++N +GG GA + GIGA TQ Sbjct: 122 RQIIREMRERYLGENNVNPQALPAMLFAGVLLEGGIISFDTNTVTGGAGASFLGIGASTQ 181 Query: 167 YQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEP 226 Y+ D + V LR ++V TGE+L++V SKTI S + A FRF+ ++ LLE EVG T+NEP Sbjct: 182 YRQDTVTVYLRAISVRTGEVLTTVTASKTIASQSLGASAFRFVGFKELLEAEVGMTTNEP 241 Query: 227 VMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMS 272 + L AIE V L+ +GID LW+ + + +L +YR Sbjct: 242 DHIALQQAIEKAVYGLVMEGIDLNLWNFAD-TQAGWPMLWRYRQER 286 >UniRef50_A3XJL0 Putative assembly or transport protein for curli synthesis n=2 Tax=Bacteria RepID=A3XJL0_9FLAO Length = 470 Score = 299 bits (765), Expect = 8e-80, Method: Composition-based stats. Identities = 112/265 (42%), Positives = 150/265 (56%), Gaps = 2/265 (0%) Query: 15 GCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYP-ASNFST 73 G P L LT P P + VYN +D+TGQ+K S FST Sbjct: 20 GSYFNQPLSQQDARLGEVTSHTTTLTQFPEPAEPVVAGVYNFKDQTGQYKNVENGSTFST 79 Query: 74 AVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLT- 132 AV Q AT ML+ AL+DS+WF P+ER+ L NLLNER IIR+ ++ N P Sbjct: 80 AVSQGATTMLIKALEDSKWFTPIERENLGNLLNERNIIRSTRDEYRKNNNPNEPNLPPLL 139 Query: 133 AANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNT 192 A +++EG II Y+SN+ +GG+GARYFG+G TQY+ D++ V LR V+ S+G +L +V Sbjct: 140 YAGVLLEGGIISYDSNIITGGLGARYFGVGGSTQYRQDRLTVYLRAVSTSSGRVLKTVYV 199 Query: 193 SKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLW 252 SKTILS + A +FR++++QRLLE E GYT NEPV L + AIE V LI +GI LW Sbjct: 200 SKTILSQAIDASLFRYVNFQRLLEVETGYTKNEPVQLAMKDAIEKAVESLIIEGIKDNLW 259 Query: 253 DLQNKAERQNDILVKYRHMSVPPES 277 + ++ Y ES Sbjct: 260 SSKEGVTVNEALIENYEKEKELEES 284 >UniRef50_B6QZQ0 Curli fiber membrane-associated lipoprotein CsgG n=1 Tax=Pseudovibrio sp. JE062 RepID=B6QZQ0_9RHOB Length = 316 Score = 285 bits (730), Expect = 1e-75, Method: Composition-based stats. Identities = 96/281 (34%), Positives = 152/281 (54%), Gaps = 11/281 (3%) Query: 1 MQRLFLLVAVMLLSGC--LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 M++ ++ + LS C +T P L + L LP P ++ +VY+ QD Sbjct: 2 MKKSIVVAMALSLSACGPVTRMAFSPG-PQLATVSVQASKLKKLPPPKEPVYAAVYSYQD 60 Query: 59 ETGQFKPYP-ASNFSTAVPQSATAMLVTALKDS---RWFIPLERQGLQNLLNERKIIRAA 114 TGQ+KP S +V Q A MLV AL+D+ +WF +ER L LL ER+I+ Sbjct: 61 LTGQYKPSDKVQTLSRSVTQGADTMLVRALQDAGDRKWFRVVERGNLDALLKERQIVTQI 120 Query: 115 QE--NGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQI 172 ++ G ++ ++ L L A ++ EG ++GY+SN ++GG GARY GIG + Y+ D + Sbjct: 121 RKIYLGEDKVDPKV-LPPLLYAGVLFEGGVVGYDSNTRTGGAGARYLGIGGNADYRQDDV 179 Query: 173 AVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLM 232 V+LR V+ TGE++++V K+++S ++ G FR+I +LE E G T NEPV L + Sbjct: 180 TVSLRAVSTRTGEVMANVMVQKSVVSVGLKGGAFRYIALDEILEAEAGITKNEPVTLAVQ 239 Query: 233 SAIETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSV 273 AIE V ++ +G G W +N A+ N + +Y Sbjct: 240 QAIEKAVYAIVMEGARVGAWSFENPAQA-NALYQEYTSEKA 279 >UniRef50_A9L3G5 Curli production assembly/transport component CsgG n=20 Tax=Alteromonadales RepID=A9L3G5_SHEB9 Length = 268 Score = 280 bits (717), Expect = 3e-74, Method: Composition-based stats. Identities = 112/257 (43%), Positives = 165/257 (64%), Gaps = 10/257 (3%) Query: 1 MQRL-FLLVAVMLLSGCLTAP-PKEAARPT-LMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 M RL F + ++ +S C P P P + P ++ + L P P I V+VY+ + Sbjct: 1 MARLVFWGLLLLSMSACSLIPKPDLNITPAEVNPLSEVMRGLQTQPGPKFPIPVAVYSFR 60 Query: 58 DETGQFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQE 116 D+TGQ+KP S+FSTAV Q AT+ML+ L DS+WF P+ER+GLQNLL ERKI + ++ Sbjct: 61 DQTGQYKPQANVSSFSTAVTQGATSMLMQTLLDSKWFTPVEREGLQNLLTERKI--SNKQ 118 Query: 117 NGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNL 176 +GT + + L+ A +++EG +I YE+N +GG G Y+GIGA Y+ DQ+ + L Sbjct: 119 SGTKGDD----VPVLSTARLLLEGGVISYETNTSTGGSGVEYYGIGASEMYREDQVTIYL 174 Query: 177 RVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIE 236 R V+V TG+++ SV+TSK +LS E++AG+FR+ RL E E+G+T+NEPV C++ AIE Sbjct: 175 RAVDVHTGKVMMSVSTSKRVLSQEMRAGLFRYTSLNRLAEAEIGFTTNEPVQFCVLQAIE 234 Query: 237 TGVIFLINDGIDRGLWD 253 V LI+ GI +G W Sbjct: 235 LAVAELIDKGIKQGYWS 251 >UniRef50_D2LDX9 Curli production assembly/transport component CsgG n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LDX9_RHOVA Length = 397 Score = 271 bits (693), Expect = 2e-71, Method: Composition-based stats. Identities = 111/261 (42%), Positives = 150/261 (57%), Gaps = 9/261 (3%) Query: 17 LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYP----ASNFS 72 + AP PT+ P + L LP P + V+VY D+TGQFKP + S Sbjct: 34 VAAPNTGLLPPTVNPINSTNSKLRELPPPKAPVAVAVYGYGDQTGQFKPVAEGANVQSLS 93 Query: 73 TAVPQSATAMLVTALKDS---RWFIPLERQGLQNLLNERKIIRAAQENG-TVAINNRIPL 128 AV Q AT++L+ AL+D+ RWF +ER+ L NLL ER+II ++ + + L Sbjct: 94 RAVTQGATSILMKALQDAGNGRWFTVVERERLDNLLKERRIIADMRQRYLGEQVVDPAAL 153 Query: 129 QSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILS 188 L A ++++G IIGY+SN K+GG GA+YFGIG D +Y D + V LR +V TG++L Sbjct: 154 PPLLFAGVLIDGGIIGYDSNTKTGGAGAKYFGIGGDVKYSEDTVTVYLRATSVKTGQVLL 213 Query: 189 SVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGID 248 SV +KTILSY VQ FRF+ Y RL E E G T NEP L + AIE V+ I +G Sbjct: 214 SVVANKTILSYGVQGSAFRFVTYNRLFEAEGGLTMNEPGSLAVEQAIEKAVLTFIVEGSA 273 Query: 249 RGLWDLQNKAERQNDILVKYR 269 RGLW Q+K Q+ I+ Y Sbjct: 274 RGLWSFQDKT-FQSRIIQDYE 293 >UniRef50_Q1YMP1 Putative Curli production assembly/transport component n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YMP1_MOBAS Length = 400 Score = 267 bits (681), Expect = 4e-70, Method: Composition-based stats. Identities = 103/264 (39%), Positives = 145/264 (54%), Gaps = 5/264 (1%) Query: 2 QRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG 61 + ++VA+ LSGC+T P + P ++ DL +P P K V+VY +D TG Sbjct: 12 RLAVMIVALASLSGCVTQEAFTDTPPVIAPVSRPNDDLRRVPPPRQKTVVAVYGYEDLTG 71 Query: 62 QFKPYP-ASNFSTAVPQSATAMLVTALKDS---RWFIPLERQGLQNLLNERKIIRAAQEN 117 QFK + S AV Q +ML+ AL+D+ RWF LER L NLL ER+II + Sbjct: 72 QFKERENVQSLSRAVTQGGASMLIQALQDAGERRWFTVLERAELDNLLKERQIITEMRRL 131 Query: 118 GTVAINNRIPL-QSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNL 176 + L ANI++EG IIGY++N+ +GGVGA + GI ADT+Y D + V L Sbjct: 132 YRNETQLDPKVVPPLLHANIIIEGGIIGYDTNIMTGGVGAGFLGISADTKYIHDVVTVTL 191 Query: 177 RVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIE 236 R V+ TGE+L++V K + SY +Q G FR++ L E G T NEP + + SAIE Sbjct: 192 RAVSTKTGEVLTTVTVRKAVASYALQGGAFRYVKIDELFMAEAGVTYNEPKQIAVQSAIE 251 Query: 237 TGVIFLINDGIDRGLWDLQNKAER 260 V LI +G D +W+ + A Sbjct: 252 KAVEGLIVEGADLSIWEFSDPAAG 275 >UniRef50_B3QD06 Curli production assembly/transport component CsgG n=4 Tax=Alphaproteobacteria RepID=B3QD06_RHOPT Length = 291 Score = 261 bits (667), Expect = 2e-68, Method: Composition-based stats. Identities = 95/268 (35%), Positives = 144/268 (53%), Gaps = 12/268 (4%) Query: 3 RLFLLVAVML---LSGCL-TAPPKEAARP--TLMPRAQSYKDLTHLPAPTGKIFVSVYNI 56 R +L V+L L GC T K+ P T++ ++ L LP PT K+ V+VYN Sbjct: 4 RAGVLACVLLAVSLGGCAITGTDKDPVTPPATMVASTKTGVVLEQLPPPTKKLDVAVYNF 63 Query: 57 QDETGQFKPYPA-SNFSTAVPQSATAMLVTALKDSR---WFIPLERQGLQNLLNERKIIR 112 D TGQ K + FS AV Q +A+L L + WF ER LQ LL ER+II+ Sbjct: 64 PDLTGQNKSNDNFAEFSRAVTQGGSAILTDVLLTAGGGHWFDVAERADLQPLLQERQIIQ 123 Query: 113 AAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQI 172 + + L L A +++EG I+GY++N +GG+GA Y G+G + QY+ D + Sbjct: 124 NTRSA--LQGEKAQSLPPLRFAGVLLEGGIVGYDTNETTGGIGANYLGLGGNMQYRQDIV 181 Query: 173 AVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLM 232 V+LR V+V TG +L++V T+K I S V F+++ LL+ + G+T N P L + Sbjct: 182 TVSLRAVSVQTGRVLAAVTTTKIIYSVNVSGSGFKYVAIDSLLQADAGFTKNSPTTLAVR 241 Query: 233 SAIETGVIFLINDGIDRGLWDLQNKAER 260 I+ V LI +G+ + LW+ ++ A Sbjct: 242 EGIQLAVYSLIFEGVKKELWNFKDPAAG 269 >UniRef50_B3QHQ8 Curli production assembly/transport component CsgG n=6 Tax=Rhizobiales RepID=B3QHQ8_RHOPT Length = 313 Score = 261 bits (666), Expect = 2e-68, Method: Composition-based stats. Identities = 82/225 (36%), Positives = 130/225 (57%), Gaps = 6/225 (2%) Query: 38 DLTHLPAPTGKIFVSVYNIQDETGQFKPYP-ASNFSTAVPQSATAMLVTALKDSR---WF 93 +L LP P KI +++Y D TG+ +P + FS AV Q + + ALK + WF Sbjct: 46 NLETLPPPKQKIDIAIYQFPDLTGKNEPNDNVAVFSRAVTQGGAGLAIDALKRAGGGAWF 105 Query: 94 IPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGG 153 +ER GL +LL ER+++RA ++ + PL + A +++EG I+ +++N +GG Sbjct: 106 RVVERNGLNDLLQERQLVRATRQ--EFDRDRAKPLPPMRFAGLLIEGGIVAFDANYMTGG 163 Query: 154 VGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQR 213 +GA Y GIGADT+++ D + V LRV +V TGE+L+SV T+KT+ S +Q ++++ + Sbjct: 164 IGANYLGIGADTKFRRDMVTVALRVASVQTGEVLTSVTTTKTVYSVSLQGNTYKYVALDK 223 Query: 214 LLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKA 258 LL+ E G T EP L + AI+ V I +G LW + A Sbjct: 224 LLQIEAGITRTEPTQLAVRQAIDLAVYSTIMEGARDKLWRFADPA 268 >UniRef50_Q1QYN5 Curli production assembly/transport component CsgG n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QYN5_CHRSD Length = 271 Score = 204 bits (518), Expect = 3e-51, Method: Composition-based stats. Identities = 70/250 (28%), Positives = 117/250 (46%), Gaps = 18/250 (7%) Query: 14 SGCL---TAP--PKEAAR--PTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPY 66 GC TAP P+E +R P + L+ P + VSV I D TGQ Y Sbjct: 20 GGCASIGTAPVAPREGSRVMTNHTPYTRCLSALSQQPGENLPV-VSVGQILDRTGQVS-Y 77 Query: 67 PASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRA-AQENGTVAINNR 125 S + Q + ML++AL +R ER ++ L ER++ A A + A+N Sbjct: 78 STITESRVLTQGVSEMLISALYKTRKVRLAERLDIRIPLAERQLKDAGAMQRAPAALN-- 135 Query: 126 IPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGE 185 + N ++ G++ N+ + G IGA + + + ++LRVV+ +T + Sbjct: 136 -----VQPVNFVILGALTELNYNILTQGARLYVGLIGASNREAVINVGLDLRVVDATTFQ 190 Query: 186 ILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 + + K I+ +V+AGV+RF D Q L+E + G NEP+ L + S +E ++ + Sbjct: 191 TVYVTSLQKQIVGNQVEAGVYRFFDNQ-LVEFDAGTVRNEPLQLGVRSVVEMAAYQILTE 249 Query: 246 GIDRGLWDLQ 255 G+ + D Q Sbjct: 250 GLGLPINDTQ 259 >UniRef50_A7HWH6 HfaB protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HWH6_PARL1 Length = 342 Score = 200 bits (508), Expect = 4e-50, Method: Composition-based stats. Identities = 64/267 (23%), Positives = 111/267 (41%), Gaps = 33/267 (12%) Query: 2 QRLFLLVAVMLLSGCLTAPPKEAAR-----------PTLMPRAQSYKDLT-HLPAPTGKI 49 + L L L+GC++A R P + + + L H+P+ Sbjct: 17 RTLGALAMAFALTGCVSANAGSDGRYVAPIGNAPVITNETPYSSALRCLAGHVPSAANTT 76 Query: 50 FVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER-----QGLQNL 104 ++V NI+D TG+ A V Q A+ M ++AL + +ER L+ Sbjct: 77 RIAVGNIRDYTGK---AEADGTGMKVTQGASLMAMSALGKA-GVPLVERYDTSISELEMK 132 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGAD 164 K+I A + +I + ++ + G I N++SGG+ Y G Sbjct: 133 YTNNKLIGDAVD----GEYRKIYAGEIRGSDYYLVGGITELNFNIQSGGINGSYAEGGDM 188 Query: 165 TQY-------QLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEG 217 + I ++LR+VN T E++ V+ K I+ E++AGVF F L Sbjct: 189 GAAANASAATYVMNIGLDLRLVNSRTLEVVDYVSYQKQIVGREIKAGVFDFFG-GNLFSL 247 Query: 218 EVGYTSNEPVMLCLMSAIETGVIFLIN 244 VG ++ EP+ L + + +E V+ L+ Sbjct: 248 GVGSSAQEPIQLAVRAVVERAVLKLLV 274 >UniRef50_C5SPX6 HfaB protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SPX6_9CAUL Length = 316 Score = 194 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 68/294 (23%), Positives = 121/294 (41%), Gaps = 39/294 (13%) Query: 1 MQRLFLLVAVMLLSGC--LTAPPKEA----------ARPTLMPRAQSYKDLTHLPAPTGK 48 M+RL + +LLS C +TA P+ A L P + L T K Sbjct: 1 MRRLVFCLP-LLLSACASVTADPQTGLYAKPVGNAPATANLTPYSADLTCLQQAALVTHK 59 Query: 49 I--FVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLN 106 V+V I D TG+ Y +N + Q +AL SR +P +G +++ + Sbjct: 60 PLPRVAVSRIDDLTGKRDFYTGAN----ITQGIALFAQSAL--SRAGLPQVERGDRDISD 113 Query: 107 ERK-------IIRAAQENGTVAINNR-IPLQSLTAANIMVEGSIIGYESNVKSGGVGARY 158 + + G N R + + ++ + G + N++S GV R Sbjct: 114 YELKAAMDHVLSDTPDQAGNDPDNFRKVYAGQIAGSDYYISGGLTELNYNIRSDGVDLRA 173 Query: 159 FGIGAD-------TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDY 211 G G+D ++ + IA++LR+++ + EI K I+ +EV+ G+F +D Sbjct: 174 GGTGSDDPAGSFVSRRFVMNIALDLRLIDTRSQEIKRVTAYQKQIVGHEVKPGLFTLLD- 232 Query: 212 QRLLEGEVGYTSNEPVMLCLMSAIETGVIFL--INDGIDRGLWDLQNKAERQND 263 +L+ G++ EP+ L + + +E V + G+D + A QN Sbjct: 233 GTMLDLSGGFSEMEPIQLGVRTLVERSVYDFAVVLYGMDPSVCRNGGVATTQNP 286 >UniRef50_C5D1Z0 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5D1Z0_VARPS Length = 467 Score = 180 bits (456), Expect = 5e-44, Method: Composition-based stats. Identities = 63/257 (24%), Positives = 110/257 (42%), Gaps = 32/257 (12%) Query: 15 GCLTAPPKE------------AARPTLMPRAQSYKDLTHLPAPTG--KIFVSVYNIQDET 60 GC+TAP + A R + P A T I ++V +++D T Sbjct: 31 GCVTAPQQRMAPNEAPIVLGPAVRENVTPMEVVLACFGDHVAATQRQPIVITVGDVKDYT 90 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTAL-------KDSRWFIPL--ERQ---GLQNLLNER 108 G++ + N A+ Q M+ +AL + + F P+ ER+ + L + Sbjct: 91 GKY----SINEGNAITQGGALMVYSALGKLGGAVQAAERFDPVIAERELGYADRRQLGDG 146 Query: 109 KIIRAAQENGTVAINNRIPLQ-SLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQY 167 + + A NG + S+ ++ + G I N+ SGG IG + Sbjct: 147 RTHQLAGPNGGQTVPWLPYFGGSINKSDYFIVGGITELNYNIHSGGGEIGVNQIGVKART 206 Query: 168 QLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPV 227 + V+LR+V+ T I+ +++ +K YEV VFRF +L + +VG EPV Sbjct: 207 FSQSVGVDLRIVDTKTLMIVKTISLTKQFNGYEVGFNVFRFFG-SKLYDIDVGAKGQEPV 265 Query: 228 MLCLMSAIETGVIFLIN 244 + + +A+E GV+ L+ Sbjct: 266 QMGVRAALEEGVVRLVA 282 >UniRef50_B4W728 Putative uncharacterized protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4W728_9CAUL Length = 344 Score = 173 bits (438), Expect = 6e-42, Method: Composition-based stats. Identities = 64/270 (23%), Positives = 112/270 (41%), Gaps = 40/270 (14%) Query: 7 LVAVMLLSGCLTA----------PPKEAARPTL--MPRAQSYKDLTHLPAPTGK--IFVS 52 ++A + L+GC TA P A T + + + L G+ ++ Sbjct: 1 MIAAVGLAGCTTARYDPATGLYANPIGGAPATGNDTAYSAALRCLASAGQAEGRSAPRLA 60 Query: 53 VYNIQDETGQFKPYPASNFSTAVPQSATAMLVTAL---------KDSRWFIPLERQGLQN 103 V +I D TG+ + Q A+ VTAL + R +ERQ Q+ Sbjct: 61 VGDIADLTGRNDLETG----RKISQGASLFAVTALTKAGVPTVERQDRGVSEVERQYAQS 116 Query: 104 LLNERKIIRAAQENGTVAINNR-IPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGI- 161 L + Q G A N R I + + V G + N++S GV A GI Sbjct: 117 HL----LSDTPQAAGESAENFRPIYAGQIAGSRYYVVGGVTELNYNLRSSGVDASAGGIE 172 Query: 162 ------GADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLL 215 G + + IA++LR+V+ + E++++ + K ++ E++ GVF F + + Sbjct: 173 ASGVKGGLTSSGYVMNIAIDLRLVDTRSQEVVATASYQKQLVGREIRVGVFDFT-HGNIF 231 Query: 216 EGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 + G + EP+ + +AIE G+ + D Sbjct: 232 DLSAGASGMEPIQFAVRTAIERGLYDFVAD 261 >UniRef50_Q0AL09 HfaB protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0AL09_MARMM Length = 287 Score = 173 bits (438), Expect = 7e-42, Method: Composition-based stats. Identities = 54/253 (21%), Positives = 99/253 (39%), Gaps = 26/253 (10%) Query: 8 VAVMLLSGCLT----------APPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 + + ++GC + + L A G+ ++V + Sbjct: 14 LTLCAMTGCASLSGDFHDHMAGYTGARVIDNSTAYSSDLDCL-SRTAMAGRPRIAVGEVN 72 Query: 58 DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER------QGLQNLLNERKII 111 D TG+F T Q M+++AL D F ER Q N R I Sbjct: 73 DLTGRFSSLDG----TVATQGTALMVMSAL-DRAGFPLAERLDTRVAQQELEFANSRLIG 127 Query: 112 RAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQ 171 Q R+ S+ ++ ++ G + N+ SG AR + ++ Sbjct: 128 PDGQ---PTENYRRVMAGSIAGSDYIILGGVTELNFNLHSGVAEARIGPLIGGRRHYAMT 184 Query: 172 IAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCL 231 + ++LR+V+ + ++ V+ SK I +E++AGVF FI L + +G EPV + Sbjct: 185 VGLDLRLVDATDLRVIDIVSQSKVIRGHEIRAGVFEFIGDTTL-DIGMGERVQEPVHTAI 243 Query: 232 MSAIETGVIFLIN 244 + +E+ V L++ Sbjct: 244 RTIVESAVFDLVS 256 >UniRef50_A6GPY8 HfaB protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GPY8_9BURK Length = 370 Score = 172 bits (436), Expect = 1e-41, Method: Composition-based stats. Identities = 56/259 (21%), Positives = 102/259 (39%), Gaps = 27/259 (10%) Query: 5 FLLVAVMLLSGCLTAPPKEAAR-----------PTLMPRAQSYKDLTHLPAPTGK--IFV 51 + V S C T P A + + + L L G Sbjct: 11 LISVIAASASACTTLPSFSEAEYVSPFDGASVVENTTRYSPALECLKPLVGGRGPNAKRF 70 Query: 52 SVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER-----QGLQNLLN 106 +V + D TG+ + Q A M+++AL + +ER ++ Sbjct: 71 AVGRVSDFTGKEDLVNG----KRITQGAALMVISALAKT-GVPMVERFDTTIADMELKYA 125 Query: 107 ERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQ 166 + K+I +N + +I SL ++ + G I N++SG + + IGA + Sbjct: 126 DNKLIT---DNPDSKAHRQIFSGSLPGSDYHIVGGITEVNYNIRSGSLESSIRFIGAAAR 182 Query: 167 YQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEP 226 Y + +AV+LRVVN T E++++ + K I+ E+ G FR L++ + EP Sbjct: 183 YFVMNVAVDLRVVNTKTLEVVNTQSLQKQIIGTELNGGYFRLFS-DGLVDVSAAERTQEP 241 Query: 227 VMLCLMSAIETGVIFLIND 245 + + IE V ++ + Sbjct: 242 IQKGVRMVIEHAVFNMLTE 260 >UniRef50_A4CEI5 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4CEI5_9GAMM Length = 490 Score = 160 bits (405), Expect = 5e-38, Method: Composition-based stats. Identities = 54/253 (21%), Positives = 95/253 (37%), Gaps = 27/253 (10%) Query: 1 MQRLFLLVAVMLLSGC---LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 + L L + L C T P + Q + K ++V Sbjct: 6 FRPLVLASVITSLVACQSTSTQVTSNTNTPNVNQVQQEQYN-------GAKARIAVARFT 58 Query: 58 DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQEN 117 DE+ +S + + L TAL + FI LERQ L +L+E+ ++ A Sbjct: 59 DESNNHHW-----WSKEIGNGMSDQLTTALVGTNRFIVLERQALDAVLSEQDLVTA---- 109 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQ-----YQLDQI 172 G V+ N+ + A I++ S+ ++ + +G F IG Q + Sbjct: 110 GRVSANSGAAFGEIEGAEIVIVASVTEFDDDASGARIGGMGF-IGDMVQSVSAGLSNTHM 168 Query: 173 AVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLM 232 A++LR+++ T IL++ +++ A F +L G + SN P L Sbjct: 169 AIDLRLIDTRTSRILAATTVEGGSKDFDIAAAATNF--GSSILGGNLSAWSNTPKEKALR 226 Query: 233 SAIETGVIFLIND 245 I+ V F + Sbjct: 227 EIIQKAVEFTLTK 239 >UniRef50_C6XNU9 Holdfast attachment protein HfaB n=3 Tax=Alphaproteobacteria RepID=C6XNU9_HIRBI Length = 342 Score = 157 bits (396), Expect = 4e-37, Method: Composition-based stats. Identities = 52/269 (19%), Positives = 105/269 (39%), Gaps = 41/269 (15%) Query: 2 QRLFLLVAVMLLSGCLTAPPK-----------EAARPTLMPRAQSYKDLTHLPAPTG--K 48 + + LL + ++++GC++ P + S + G + Sbjct: 11 KSVALLASALMVTGCVSPVATKSGNYTKPIGGSPVTANPTPYSTSLVCMGDYAHQVGLGQ 70 Query: 49 IFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER-----QGLQN 103 ++V I D TG+ V Q A+ M ++A + +ER L+ Sbjct: 71 PRIAVGRILDYTGKEDFEGG----RRVTQGASLMAISAFAKAGA-RLVERFDTSVSELEL 125 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIG- 162 K+I A + +I S+ ++ + G I SN++S G+ IG Sbjct: 126 KYANNKLIGDAADQ----DFRKITAGSIPGSDFYLVGGITELNSNIRSVGIDG---FIGD 178 Query: 163 ---------ADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQR 213 + + + ++LR+V + E++ ++ K I+ E+ AG+F F Sbjct: 179 RDVEDPKGNGGAKMFVINVGLDLRLVETESLEVVDVISYQKQIIGREISAGIFDF-ANNN 237 Query: 214 LLEGEVGYTSNEPVMLCLMSAIETGVIFL 242 + + +G + EP+ L + S IE V+ + Sbjct: 238 VFDIGLGERAQEPIQLAVRSVIERAVLEM 266 >UniRef50_C8R0R5 Curli production assembly/transport component CsgG n=1 Tax=Desulfurivibrio alkaliphilus AHT2 RepID=C8R0R5_9DELT Length = 247 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 61/254 (24%), Positives = 104/254 (40%), Gaps = 39/254 (15%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET--- 60 FLL A + TA + RA+ Y H P K ++V + +D+T Sbjct: 12 FFLLAACSFIIPAATAQAQSGGPDMGTARAEDY----HGP----KAAIAVADFEDKTVGR 63 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 GQ++ + MLVT L ++ FI LER+ L ++ E+ + +G Sbjct: 64 GQYRREYG--------RGMQDMLVTELFNTNRFIVLEREKLSAVIAEQDL----GASGRF 111 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNV---------KSGGVGARYFGIGADTQYQLDQ 171 + P+ L A +MV ++ G++ +SG +G R + A Q Sbjct: 112 RQDTTAPIGELEGAQLMVIAAVTGFDPGTSGTKGTVRGRSGLLGDRLGSLTAGVQ--QAH 169 Query: 172 IAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCL 231 +A++LRVV+ +TG ILS+ N S+++ F L + P+ + Sbjct: 170 VALDLRVVDTATGRILSATNVEGKARSFDLGGSAFGSQGSGGL-----STFARTPMEQAI 224 Query: 232 MSAIETGVIFLIND 245 AI V F++ Sbjct: 225 RRAIAEAVDFVVEQ 238 >UniRef50_P27343 Putative transcription activator protein hfaB n=6 Tax=Alphaproteobacteria RepID=HFAB_CAUCR Length = 337 Score = 149 bits (376), Expect = 9e-35, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 89/236 (37%), Gaps = 19/236 (8%) Query: 22 KEAARPTLMPRAQSYKDLTHLPAPTG--KIFVSVYNIQDETGQFKPYPASNFSTAVPQSA 79 + + L +++ I D TG+ + V Q A Sbjct: 39 TAPVTANPTDYSSALVCLNQYARTNRIVAPRIAIGRIADYTGK---EESDGSGRKVTQGA 95 Query: 80 TAMLVTALKDSR-----WFIPLERQGLQNLLNERKIIRAAQENGTVAINNR-IPLQSLTA 133 + M V+A + F + N + I + R I + Sbjct: 96 SLMAVSAFAKAGMPLVERFDTSVSEFELKYANNKLISDRPNPAPDAPADFRKILAGQVPG 155 Query: 134 ANIMVEGSIIGYESNVKSGGVGARYF-----GIGADTQYQL--DQIAVNLRVVNVSTGEI 186 ++ V G I N++S G+ A G+ + + ++ IA++LR+VN T E+ Sbjct: 156 SDFYVIGGITELNYNIRSAGIDAYAGDKDTDGLKGNFRRRVFIMNIALDLRLVNTRTLEV 215 Query: 187 LSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFL 242 + ++ K ++ EV AGVF F++ L + G + EP+ L + + IE + + Sbjct: 216 VDVISYQKQVVGREVSAGVFDFLN-GNLFDISAGRGALEPMQLAVRALIERATVEM 270 >UniRef50_C5SPX7 HfaB protein n=2 Tax=Caulobacteraceae RepID=C5SPX7_9CAUL Length = 281 Score = 147 bits (372), Expect = 3e-34, Method: Composition-based stats. Identities = 53/273 (19%), Positives = 99/273 (36%), Gaps = 29/273 (10%) Query: 3 RLFLLVAVML--LSG--CLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 RLF ++ L+ + PP+ P + + L P+ G ++V I D Sbjct: 6 RLFFALSCSAGALTPGLAVATPPQAEVTLNETPVTPALRCLARRPSLNGLPRLAVGRIGD 65 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER------QGLQNLLNERKIIR 112 TG+ + + Q A+ V+AL +ER + N ++ + Sbjct: 66 LTGKIDFDTGA----KITQGASLFAVSAL-GYAGVPVVERLDNSVAEIELNYARQKLLSD 120 Query: 113 AAQENGTVAINNR-IPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGA-------D 164 + G N R I + + + G I N++S G A Sbjct: 121 TPERAGQSGDNFRPILAGQIAGSRYYIVGGITELNYNIRSDGYDAAIGSQALPGAQGQIS 180 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSN 224 + + +AV+LR+VN T +++ +V K ++ + + LL + G + Sbjct: 181 GRTYVLNVAVDLRLVNTQTQQVVDTVTFQKQVIGVTNDRRLTGGSEDIGLL-LQGGNSRQ 239 Query: 225 EPVMLCLMSAIETGVIFLINDGIDRGLWDLQNK 257 EP+ + + +E V LI LW + Sbjct: 240 EPLQMSVRELVERSVYHLIA-----PLWTATDA 267 >UniRef50_Q0IE08 Curli production assembly/transport component CsgG subfamily protein n=1 Tax=Synechococcus sp. CC9311 RepID=Q0IE08_SYNS3 Length = 301 Score = 116 bits (290), Expect = 9e-25, Method: Composition-based stats. Identities = 46/228 (20%), Positives = 85/228 (37%), Gaps = 41/228 (17%) Query: 39 LTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER 98 + PT VSV + ++E GQ +S V + L L + +ER Sbjct: 51 FASIAGPT----VSVPDFKNEVGQLAW-----WSPRVSRQLADALSNELSAAGGLTVVER 101 Query: 99 QGLQNLLNERKIIRAAQENGTVAINNRIPL-QSLTAANIMVEGSIIGYESNVKS--GGVG 155 Q ++ +L+E+++ E G V N+R +T + ++ G + G+E+NV++ G G Sbjct: 102 QNVRAVLSEQEM----AELGIVRNNDRAAKSGQMTGSQYVILGRVSGFENNVETKQSGSG 157 Query: 156 ARYFGIGA--DTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSY-------------- 199 R+ G G D ++++LRVV+ +TGE++ + Sbjct: 158 MRFLGFGGSKDVAETKAYVSLDLRVVDTTTGEVVGYKTVEGRAKNTAKVKGSGGSLAPLA 217 Query: 200 --------EVQAGVFRFIDYQRL-LEGEVGYTSNEPVMLCLMSAIETG 238 G + L T P + +A+ Sbjct: 218 GLVGGLTGASGTGAYGLAAAGTLSFNESSSETKKTPASKAVRAALIAA 265 >UniRef50_B9L5Z4 Putative curli production assembly/transport component CsgG subfamily n=1 Tax=Nautilia profundicola AmH RepID=B9L5Z4_NAUPA Length = 443 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 58/286 (20%), Positives = 110/286 (38%), Gaps = 36/286 (12%) Query: 1 MQRLF-LLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDE 59 ++RLF +V + LSGC+ + Q+ ++ P + ++V + + Sbjct: 3 LRRLFAFMVGIFFLSGCV------GTTTNVTTSNQNVNEIVKYKGPKAR--IAVASFK-- 52 Query: 60 TGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 A+ + + ML TAL +S FI +ER ER++ G Sbjct: 53 ------CKAAKCNGQIGSGIADMLTTALFNSGKFIVIERSNEGFSAVEREL---QLSQGM 103 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGA------RYFGIGADTQYQLD--Q 171 + N +I +L A+I+V G+I +E K+GG+ A R + ++ D Sbjct: 104 IKQNRQIN--NLEGADILVVGAITAFEP--KAGGISAGGIVIPRGVPVIGGIKFGKDEAY 159 Query: 172 IAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCL 231 IA ++R+++V TG I+++ + + F L + N P+ + Sbjct: 160 IAADIRLIDVKTGRIINATTVEGQASKWNIGGIGGGFTGNVAL-GAGLSTYKNTPMEKAI 218 Query: 232 MSAIETGVIFLINDGIDRGL--WDLQNKAERQNDILVKYRHMSVPP 275 I V I I ++ Q + + + +V P Sbjct: 219 RDMINKAVEK-IAQLIPDNYYRYNANGTVNNQLNTTINTQQNTVKP 263 >UniRef50_A2CA24 Uncharacterized protein involved in formation of curli polymers n=3 Tax=Cyanobacteria RepID=A2CA24_PROM3 Length = 275 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 39/161 (24%), Positives = 70/161 (43%), Gaps = 14/161 (8%) Query: 49 IFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNER 108 VSV + +++ G+ +S V + L L + +ERQ L+ +L+E+ Sbjct: 31 PTVSVPDFKNQVGRLSW-----WSPRVSRQLADALSNELALAGGLTVVERQNLKAVLSEQ 85 Query: 109 KIIRAAQENGTVAINNRIPLQ-SLTAANIMVEGSIIGYESNVKS--GGVGARYFGIGAD- 164 ++ E G V + + A ++ G + GYE V++ G G R+ G G Sbjct: 86 EL----AELGIVRNDGDAARSRQMRGARYLIMGRVSGYEDGVETKQSGSGMRFMGFGGSK 141 Query: 165 -TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAG 204 ++++LRVV+ STGE++ + S Q G Sbjct: 142 TVSESKAYVSIDLRVVDSSTGEVVGARTVEGRATSTAKQKG 182 >UniRef50_D0SYN2 Predicted protein n=1 Tax=Acinetobacter lwoffii SH145 RepID=D0SYN2_ACILW Length = 362 Score = 99.1 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 81/225 (36%), Gaps = 21/225 (9%) Query: 22 KEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATA 81 P + L + SV I D+TG+ V Q A Sbjct: 4 GPPITDIFTPFDMALSCLKGQ--LRSDVSFSVGAILDQTGK-DVVTNGGSGKMVTQGAGD 60 Query: 82 MLVTALKDSRWFIPLERQGLQNLLNERKI-IRAAQENGTVAINNRIPLQSLTAANIMVEG 140 M+ +AL + + R+ + + +E K IR ++ + A++ V G Sbjct: 61 MVQSALFQA-GVSLMNRRDPRIIESEAKWGIRDPRQ--------------IQASDYYVTG 105 Query: 141 SIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYE 200 SI + + GG + G+G + + ++L + + + +++ +V+ K I + + Sbjct: 106 SINSLDF-IPGGGFDMQIAGVGPNYSQTRIMVGLDLSLTDTRSSKVVGNVSLQKQIAAQD 164 Query: 201 VQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 RF + LL ++G E L + L++ Sbjct: 165 YGLSAGRF-AGRTLLNIQIGKGEREATNFALRQMLNLATFELLSQ 208 >UniRef50_A1BH58 Curli production assembly/transport component CsgG n=3 Tax=Chlorobium/Pelodictyon group RepID=A1BH58_CHLPD Length = 240 Score = 96.0 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 52/220 (23%), Positives = 90/220 (40%), Gaps = 18/220 (8%) Query: 45 PTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNL 104 K + V + T A+ + V ML++ L + F LER L + Sbjct: 23 AQEKPRIGVLRFTNNT------YATWWRGGVGTDLQDMLISELASTNSFRVLERNELDAV 76 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGAD 164 + E+ + +G + R L +T A +V ++ +E N GG G Y GI Sbjct: 77 IREQDL----GASGRINPGTRSKLGKITGAKYLVAATVSAFEHNTSGGGGGISYGGISLG 132 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSN 224 + +AV+L+V++V TGEI + T S + GV+R G + + Sbjct: 133 GKQDKAYMAVDLKVIDVQTGEIYDARTVEATSKSSGISVGVYR-----GGFGGHLNQYKD 187 Query: 225 EPVMLCLMS-AIETG--VIFLINDGIDRGLWDLQNKAERQ 261 PV + + IE + + +G + G D N+ +R+ Sbjct: 188 TPVGKAIRACVIEIAEYLECSLVEGKNSGCMDEYNQKDRK 227 >UniRef50_P73111 Sll1835 protein n=3 Tax=Chroococcales RepID=P73111_SYNY3 Length = 265 Score = 92.1 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 47/201 (23%), Positives = 82/201 (40%), Gaps = 20/201 (9%) Query: 45 PTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNL 104 GK +SV +++T + + N S + L L + F +ERQ L ++ Sbjct: 36 AQGKPTISVPEFKNDTNMSWWWWSGNTSREL----ADALSNELTSTGNFQVVERQNLGSV 91 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARY------ 158 L+E+++ E G LT A +V G + YE V S G + Sbjct: 92 LSEQEL----AELGLTRPETSAQRGQLTGAQYIVLGRVTAYEEGVSSESGGNNFGLNLGL 147 Query: 159 FGIGADTQY--QLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLE 216 F IG + Q +A++LRVV+ STGE++ + + + A ++ ++ Sbjct: 148 FSIGNSERQAKQEAYVAIDLRVVDSSTGEVVYARTVEG--RATDTAASSANNVNILGIVN 205 Query: 217 G--EVGYTSNEPVMLCLMSAI 235 + TS PV L + + Sbjct: 206 TGQDNQSTSRAPVGKALRAGL 226 >UniRef50_Q05S73 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05S73_9SYNE Length = 256 Score = 90.6 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 42/156 (26%), Positives = 68/156 (43%), Gaps = 13/156 (8%) Query: 44 APTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQN 103 AP + V+V I + +S V + T ML LK + F +ER GL+ Sbjct: 33 APRQPVTVAVKEITN-----NASGVWWWSPRVSKQLTDMLSNELKATGNFTLVERAGLKK 87 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYE--SNVKSGGVGARYFGI 161 +L+E+++ E G + +T A V G++ Y+ + KSGG G G Sbjct: 88 VLDEQEL----AELGITRQSTAPKRGMVTGAKYYVLGAVSDYQQGTETKSGGGGFNIMGF 143 Query: 162 G--ADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKT 195 G + +A+++RVV+ +TGEI S Sbjct: 144 GQRKSSSESKAYVALDVRVVDTTTGEIAYSRTIEGK 179 >UniRef50_D0XIZ9 HfaB protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XIZ9_9CAUL Length = 245 Score = 86.0 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 90/239 (37%), Gaps = 24/239 (10%) Query: 20 PPKEAARPTLMPRAQSYKDLTH-LPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQS 78 P A + P + + L + ++V NI+D +G+ + PQ Sbjct: 2 QPGAVAPQAIAP--DTVRCLQQTGTGTGSRPRIAVGNIRDLSGRVSLESGALL----PQG 55 Query: 79 ATAMLVTALKDSRWFIPLER-----QGLQNLLNERKIIRAAQENG--TVAINNRIPLQSL 131 A+ ++AL + + +ER ++ ++++ E RI + Sbjct: 56 ASMFAISALLEM-GYPVVERFDMAIAEIEINYARQQLLSDTPELAGQVQDNYRRIYPGQI 114 Query: 132 TAANIMVEGSIIGYESNVKS-------GGVGARYFGIGADTQYQLDQIAVNLRVVNVSTG 184 + + G++ + V S V + + A Y+ +A++LR+V+ + Sbjct: 115 AGSRFYLTGALTELNTGVSSLSGAASATAVSSAIASLSASGGYERASVALDLRLVDTLSQ 174 Query: 185 EILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLI 243 E++ S +++ S + G ++ G+V Y + V + +E V L+ Sbjct: 175 EVVGSATVRRSLSSGNLSVGAIG--ATAPVVSGQVAYARSSEVQYAVRGMVEEAVRVLV 231 >UniRef50_Q2JVB7 CsgG family protein n=4 Tax=Cyanobacteria RepID=Q2JVB7_SYNJA Length = 364 Score = 84.4 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 45/219 (20%), Positives = 80/219 (36%), Gaps = 27/219 (12%) Query: 34 QSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWF 93 + + P PT + V+V + F S + + +LV L F Sbjct: 60 SAPSQVGQAPQPTTRPRVAVLDFD-----FSSLSNSYSLREASRGVSDLLVDRLVRDGTF 114 Query: 94 IPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGG 153 +E L +L E+ + +G + N + + + ++ GS+ ++ +V+ G Sbjct: 115 SVIEPSRLDAILAEQNL----GLSGRLDANTAAQVGRILGVDAVILGSVTQFDVSVRRSG 170 Query: 154 VGARY--------FGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGV 205 AR +GA+ + +N R+V+ ST EIL+ V + + V Sbjct: 171 GEARVLTPFGSFPLAVGAEVVDADANVQLNARLVSTSTAEILAVVEGRGNVSQSDSTVTV 230 Query: 206 FRFIDYQRLLEGEVGYTSNEPVM--LCLMSAIETGVIFL 242 F G TSNE + L A+E L Sbjct: 231 ADF--------GGGSATSNEEKLLVLASQQAVEQIAQQL 261 >UniRef50_Q31MJ4 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31MJ4_SYNE7 Length = 332 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 43/193 (22%), Positives = 86/193 (44%), Gaps = 11/193 (5%) Query: 5 FLLVAVMLLSGCLTAPP-KEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQF 63 FL++ + LL+G + P +A +L+ +A+ L K ++V + D + Sbjct: 7 FLVLNLSLLTGLIQTPAIATSAEQSLVIKAKQPLLLAQN---QAKRRIAVLDF-DFSNVS 62 Query: 64 KPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAIN 123 P S F V + + +LV L + +ER + +LNE+ + +G + + Sbjct: 63 SPSVLSAFPN-VSKGVSDILVNRLVKDGTYTLIERSRIDAVLNEQNL----GASGRIDPS 117 Query: 124 NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVST 183 + + + ++ GS+ + + G G+ FGIG ++ +N+R+V+ ST Sbjct: 118 TAAQIGKILGVDAVIIGSVTRLDLQTRQSG-GSFLFGIGGNSTDVDAYAQINIRMVSTST 176 Query: 184 GEILSSVNTSKTI 196 EIL+ + I Sbjct: 177 AEILAVAEGTGNI 189 >UniRef50_D0XJ00 Putative uncharacterized protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XJ00_9CAUL Length = 338 Score = 76.7 bits (187), Expect = 7e-13, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 85/242 (35%), Gaps = 19/242 (7%) Query: 6 LLVAVMLLSGCLTAPP---KEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQ 62 + + +LL+GC T+P E + +S A + ++V ++ D TG+ Sbjct: 4 IGLVALLLAGCATSPSLTTFEREFARTGRQTESLAQCLAASAEATRPILAVGSVADLTGR 63 Query: 63 FKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAI 122 V Q A L +A + ERQ L +E +++ V+ Sbjct: 64 QTFATG----RVVTQGAGLFL-SADLATFGIRLAERQDTSVLDSETRLL--------VSD 110 Query: 123 NNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLD--QIAVNLRVVN 180 + + V G+I+ + +G G G G ++ V++R+++ Sbjct: 111 TTPGETGRIAGSRYYVSGAIVTADPADAAGLQGQAIGGAGVTLSSTEMRRRVVVSIRIID 170 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 + I + + +S + + +L + + N+ + L AI Sbjct: 171 SRSLLIAAVGTYERIAVSTDQSLSISDPSSLD-VLRFDARRSENDGLDLATRLAIREAAR 229 Query: 241 FL 242 + Sbjct: 230 DI 231 >UniRef50_O67219 Putative uncharacterized protein n=1 Tax=Aquifex aeolicus RepID=O67219_AQUAE Length = 462 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 86/265 (32%), Gaps = 37/265 (13%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLT-HLPAPTGKIFVSVY-NIQD 58 M L ++ + L+ C + + + T A+ + LP I V + Sbjct: 1 MFPLLVISVIFLIFSCGPVAQQASTQTTQGEYAKDIRQREPELPKCDRPIGTIVARGFKC 60 Query: 59 ETGQ-------FKPYPASNFSTAV-PQSATAMLVTALKDSRWFIPLERQGLQNLLNERKI 110 + Q F P S V + MLVTAL + F LER+ LQ + E ++ Sbjct: 61 KAAQCAGDRIVFGPNYTVEVSPKVLGDGLSDMLVTALVKTGCFRVLERETLQEIKEELEM 120 Query: 111 IRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGAR------YFGIGAD 164 + P ++L A+ ++ GSI E G G G+G Sbjct: 121 LG------------VQPKKALKGADFLLTGSITALEMKASGMGGGGVVVPLPFLGGVGVK 168 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYT-- 222 IA++LR+V V E+L + ++ G Sbjct: 169 AGKSSAHIALDLRLVRVRDAEVLLAETVEGKSDRWKFGV-----GGGGIFGTTIAGGWFE 223 Query: 223 --SNEPVMLCLMSAIETGVIFLIND 245 N P+ I V ++ Sbjct: 224 AFKNTPMEEATRDLIYHAVKLIVAQ 248 >UniRef50_A6DA55 Putative uncharacterized protein n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DA55_9PROT Length = 174 Score = 75.2 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 44/198 (22%), Positives = 78/198 (39%), Gaps = 39/198 (19%) Query: 12 LLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNF 71 SGC T + + Q+ D+ K ++V + + A+ Sbjct: 5 FFSGCGT------SISNVSTSKQNINDVASYQG--KKARIAVASFK--------CKAAKC 48 Query: 72 STAVPQSATAMLVTALKDSRWFIPLER-----QGLQNLLNERKIIRAAQENGTVAINNRI 126 + ++ + +L TAL + FI LER + +QN LN + I+ N Sbjct: 49 NGSIGSGISDILTTALMKTNKFIVLERDSEAMRAIQNELNNQIIMTNRHANR-------- 100 Query: 127 PLQSLTAANIMVEGSIIGYESNVKSGGVGA-----RYFGIGADTQYQLD-QIAVNLRVVN 180 + +I+V G+I +E G+G IG + D IA++LR+V+ Sbjct: 101 ----MEGTDILVVGAITAFEPKAGGFGIGGVTIPLNVPVIGGIKFAKNDAYIALDLRLVD 156 Query: 181 VSTGEILSSVNTSKTILS 198 +STG +L++ S Sbjct: 157 ISTGRVLAATTIEGKASS 174 >UniRef50_B0VF47 Putative curli production assembly/transport component CsgG n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VF47_9BACT Length = 314 Score = 72.9 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 45/211 (21%), Positives = 81/211 (38%), Gaps = 34/211 (16%) Query: 5 FLLVAVMLLSGCLTAP-PKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET--G 61 FL + +L+S C P + LMP + D K +++ +ET Sbjct: 7 FLCIGALLISACAQNQAPAKVEVVNLMPAEKQITDEQIH----LKKKIAIGRFTNETRLA 62 Query: 62 QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 SN + + ++A +L + L + FI +ERQ L +++ AQ + Sbjct: 63 NSFLNEGSNTGSRMSKAANDILASKLAITNRFILIERQDELILDINQQVADIAQYH---- 118 Query: 122 INNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNV 181 A+ ++ GSI + G G+ T+ Q VNLR+++ Sbjct: 119 ----------IPADYIILGSITEF------GTSNTGNVGLIDRTKKQTAFAKVNLRILDT 162 Query: 182 STGEILS-------SVNTSKTILSYEVQAGV 205 TG ++ + ++ T+L QAG Sbjct: 163 HTGRVIYGEEGAGEASTSTSTVLGMGSQAGY 193 >UniRef50_D1B8K8 Curli production assembly/transport component CsgG n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B8K8_THEAS Length = 305 Score = 72.1 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 75/195 (38%), Gaps = 19/195 (9%) Query: 44 APTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQN 103 K ++V + QD S+ + A + M+ T L ++ F +ER + Sbjct: 21 PSLAKARIAVLSFQD----------SSGAGAPAAAIADMMTTELFNTGLFSVVERSRIDQ 70 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGI-G 162 + E+++ G + ++ + + L A ++ GSI Y G + + G+ G Sbjct: 71 IAMEQRMS----AQGLTSPSSAVQMGQLLGAEYLMTGSITQYRYEASGGVIPLPFGGLSG 126 Query: 163 ADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYT 222 + + +++R++N +TGE++++ + + + G+ G Sbjct: 127 VAVGSETAYVTLDVRLINAATGEVITTARAEGAANQTQ-GGLAYDSAVFGT---GKAGGL 182 Query: 223 SNEPVMLCLMSAIET 237 + + +E Sbjct: 183 LGQATYKAVTKIVEQ 197 >UniRef50_B8HPG7 CsgG family protein, putative n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPG7_CYAP4 Length = 330 Score = 72.1 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 40/186 (21%), Positives = 76/186 (40%), Gaps = 21/186 (11%) Query: 28 TLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ----DETGQ-FKPYPASNFSTAVPQSATAM 82 T P + L P K+ ++V N TG + ++ S AV + + Sbjct: 27 TTAPSSPGLVPLQARP----KVRIAVLNFDFSNIGLTGAVYSFTDSAGPSKAV----STL 78 Query: 83 LVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSI 142 L L ++ +ER + +L E+ + +A G + + + + +V GS+ Sbjct: 79 LTNLLVKDGTYVVVERSRIDAVLAEQNLGQA----GRIEPTTAAQVGRILGVDAVVIGSV 134 Query: 143 IGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSK--TILSYE 200 + K GGV G G+ + Q+ ++ + R+V+ +TGEILS T + Sbjct: 135 TEFGLEQKKGGV--NILGFGSQKETQIARVQLAARIVSTTTGEILSVAEAKGEATQVDES 192 Query: 201 VQAGVF 206 + G + Sbjct: 193 ISVGGY 198 >UniRef50_Q21E38 Curli production assembly/transport component CsgG n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21E38_SACD2 Length = 321 Score = 71.3 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 48/243 (19%), Positives = 96/243 (39%), Gaps = 32/243 (13%) Query: 1 MQRLFLLVAVMLLSGCLTAPPK-EAARPTLMPRAQSYKDLTHLPAPTGKIF-----VSVY 54 M+ + + V LL C + P+ + PT+ + Q L A K ++V Sbjct: 1 MKIVTTGLMVALLCSCASQDPRLKDVEPTISEQQQREAQAKLLEAQATKTLALKRKIAVG 60 Query: 55 NIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAA 114 + +ET K S+ + + + + M V +L +S ++ ER L+ L NE ++ Sbjct: 61 RLSNETSYGKSLLGSSKNDVLGEKVSDMFVQSLANSGNYLIFERPDLELLENEARLTGET 120 Query: 115 QENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAV 174 L + +V GS+ + N G + ++ Q V Sbjct: 121 VN--------------LIGVDTLVIGSLTQFGRNTTGES------GFLSSSKKQEATATV 160 Query: 175 NLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 +LR+V TG + +SV + + S E R + + + + G +++ + + +A Sbjct: 161 DLRLVETKTGRVFASVTGTGSS-STETA----RTMGFGSVAGYD-GSINDQAIGAAVNAA 214 Query: 235 IET 237 +E Sbjct: 215 VEK 217 >UniRef50_B0CFQ9 CsgG family protein, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CFQ9_ACAM1 Length = 336 Score = 70.2 bits (170), Expect = 7e-11, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 64/155 (41%), Gaps = 12/155 (7%) Query: 46 TGKIFVSVYNIQ----DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGL 101 ++V + +TG N + + +L L ++ +ER + Sbjct: 45 KESRRIAVLDFDYANVSKTGISYGLYGKN---GASRGISNLLTNELVKDGTYVLVERSKI 101 Query: 102 QNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKS-GGVGARYFG 160 +L E+ + +G + + + + ++ GSI + +S GG +FG Sbjct: 102 DTILAEQNL----GASGRIEPTTAAQIGRVLGVDAVLIGSITQFHIEEQSKGGSIGGFFG 157 Query: 161 IGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKT 195 +G + QL + ++ R+V+ +TGEIL++ + Sbjct: 158 LGGKQKTQLATVQLSTRLVSTATGEILTAAEGTGQ 192 >UniRef50_A6DK47 Curli production assembly/transport component CsgG n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DK47_9BACT Length = 311 Score = 68.6 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 76/209 (36%), Gaps = 22/209 (10%) Query: 46 TGKIFVSVYNIQDETG-QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNL 104 ++V D++ +++ V + L+ L S+ + L R+ + + Sbjct: 30 KKIPTIAVLEFADKSHFRYRWN--------VGEGIRDSLIDELVQSKRYKVLTRKNIDAV 81 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGAD 164 + E I Q++ ++ L +++GS+ + K+ G A + G Sbjct: 82 IGELNI----QQDKLFRPEGKVARGRLKNVQYLLKGSVTDFAHVAKT-GASAFFSNWGFS 136 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQ-AGVFRFIDYQRLLEGEVGYTS 223 + + V L ++ V +GEI++S + + AG ++ + + G Sbjct: 137 GSTHVAVVTVTLYIIEVESGEIIASKQVEGKAHATSLDVAGQYKNMSFGS------GSFY 190 Query: 224 NEPVMLCLMSAIETGVIFLINDGIDRGLW 252 P+ + ++ IN I W Sbjct: 191 RTPLGKACKELMHQALLE-INKTIADKKW 218 >UniRef50_Q9RTM7 Putative uncharacterized protein n=1 Tax=Deinococcus radiodurans RepID=Q9RTM7_DEIRA Length = 212 Score = 68.6 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 59/172 (34%), Gaps = 27/172 (15%) Query: 74 AVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTA 133 + + L TAL ++ F ER+ L E + A + Sbjct: 60 GLGEGIGDALTTALLNTGKFAVYERENTAQLTEEAFLNGGA---------------TFQG 104 Query: 134 ANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTS 193 A++++ G+I YE SGG+ +G + IA++LR+V+ T I+ + Sbjct: 105 ADVLIFGAITQYEPQASSGGLSFMGVSVGKKSS----TIAMDLRIVDAKTRRIIGATQVQ 160 Query: 194 KTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 + LL VG S+ + + + L+ Sbjct: 161 GKAEGNN--------FNVSGLLPVNVGAQSSPQLEAAISQMLNNAAQQLLLK 204 >UniRef50_D1Y6S9 Tetratricopeptide repeat domain protein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y6S9_9BACT Length = 471 Score = 66.7 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 75/201 (37%), Gaps = 22/201 (10%) Query: 42 LPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGL 101 L I + V + A S V + T M +T L +S F ER L Sbjct: 19 LSPAAAVIRIGVDRFR--------SGAPGVSPDVADALTEMFITELSNSGSFQVYERTAL 70 Query: 102 QNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGI 161 + + E+++ + G V+ + + + L ++ G++ + G + FG+ Sbjct: 71 EKVAREQRLSMS----GLVSESTLVKVGRLAGVEWIITGAVTQSDEKQTGGVLPIHGFGL 126 Query: 162 GADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGY 221 T + + +++R ++ +TG I +++ + S + V+ G VG Sbjct: 127 AVGTN--VGTVTLDVRTIDTTTGAITAALRKTGAA-SRAIAGAVYEGTVIGTTQYGGVGS 183 Query: 222 TSNEPVMLCLMSAIETGVIFL 242 M A++ V L Sbjct: 184 Q-------AAMKAVKRTVREL 197 >UniRef50_A9L616 Curli production assembly/transport component CsgG n=2 Tax=Alteromonadales RepID=A9L616_SHEB9 Length = 323 Score = 65.9 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 46/206 (22%), Positives = 74/206 (35%), Gaps = 41/206 (19%) Query: 4 LFLLVAVMLLSGCLT---------APPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVY 54 L ++ L + C T A P A T A + K L V++ Sbjct: 7 LIFVMCASLTTACATVNKKQVVTTAQPAAAISATQTEVALNTKLLKRK--------VAIG 58 Query: 55 NIQDET--GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIR 112 +ET GQ N + + A +L + L + FI LER L + E + Sbjct: 59 RFTNETTYGQGFFIDEDN--NRIGKQAMDILSSKLFQTGKFIMLERADLGKIEKELAM-- 114 Query: 113 AAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQI 172 N+ + AA+ ++ GSI + G G+ + + Q Sbjct: 115 --------GGNSTLK----NAADYLIVGSITEF------GRKEVSDVGVFSRVKKQEANA 156 Query: 173 AVNLRVVNVSTGEILSSVNTSKTILS 198 VN+R+V+V+TG I+ S S Sbjct: 157 KVNIRIVDVATGLIIYSEEGKGIAYS 182 >UniRef50_B0JU00 Curli production assembly/transport component n=2 Tax=Microcystis aeruginosa RepID=B0JU00_MICAN Length = 333 Score = 65.5 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 25/212 (11%) Query: 46 TGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLL 105 + K+ V+V + D +G P F + +LV L +S + +ER + +L Sbjct: 48 SEKVRVAVLDF-DYSGLSNPQWL-TFLNGGASGVSDILVNRLVESGRYTVIERSRIDAVL 105 Query: 106 NERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADT 165 E+ + +G V + + ++++ GSI ++ K G + Sbjct: 106 REQNL----GASGRVDAATAAQIGQILGVDVVIIGSITQFDLQKKQSG--GSFIIFSTAK 159 Query: 166 QYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGY-TSN 224 + +N+R +N +T EI+++ T + +L G TSN Sbjct: 160 TETDAFVKLNVRAINTTTAEIITTAQGDGTANQSD---------GSTVVLGVGGGSQTSN 210 Query: 225 EPVMLCLMSAIETGVIFLINDGIDRGLWDLQN 256 E +L + A + V ++ L D + Sbjct: 211 EGKLLSI--ATDKAVARVV-----DNLNDKAD 235 >UniRef50_B8CKS1 Curli production assembly/transport component CsgG, putative n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CKS1_SHEPW Length = 310 Score = 65.5 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 75/190 (39%), Gaps = 23/190 (12%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M++ + L CL +P + + L KI ++ +ET Sbjct: 1 MKKWNIFTLTCLAIFCLQSPVSASGLDATKASTTVASN-DSLTFLKRKIAIA--RFSNET 57 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + + + + + A +L L D+ FI ER ++ LN I++ ++G Sbjct: 58 QAANSFLVDSSNNRIGKQAADILSARLADTNKFIMFERLDTED-LNSENILKGISDSGV- 115 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 A + ++ GS+ + G GI + ++ Q VN+R+V+ Sbjct: 116 ------------AVDYLIVGSVSEF------GRSAESTTGIFSQSKIQKAYTKVNVRLVD 157 Query: 181 VSTGEILSSV 190 VSTG I+SSV Sbjct: 158 VSTGRIISSV 167 >UniRef50_Q4KAD5 CsgG family protein n=20 Tax=Proteobacteria RepID=Q4KAD5_PSEF5 Length = 242 Score = 64.8 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 47/259 (18%), Positives = 95/259 (36%), Gaps = 51/259 (19%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M L + +SGC T E++R + + QS + ++ ++V + + Sbjct: 24 MLSAVALAVLAGMSGCAT----ESSRALPVEKVQSASQVWTGA----RVPMAVGKFDNRS 75 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + + Q A +L+T L+ + F L+R + + E I AQ+ Sbjct: 76 SYMRGIFSDGVDRLGGQ-AKTILITHLQQTNRFSVLDRDNMGEIQQEAAIKGQAQK---- 130 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 L A+ +V G + + + + FGI + Q+ V L +VN Sbjct: 131 ----------LKGADFVVTGDVTEFG---RKETGDHQLFGILGRGKTQVAYAKVALNIVN 177 Query: 181 VSTGEILSSVN-------TSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMS 233 +ST E++ S +++ ++ + + ++L+ L + Sbjct: 178 ISTSEVVYSTQGAGEYALSNREVIGFG-GTAAYDSTLNGKVLD------------LAMRE 224 Query: 234 AIETGVIFLINDGIDRGLW 252 A+ V + ID G W Sbjct: 225 AVNRMV-----EAIDAGAW 238 >UniRef50_B8JAW6 Curli production assembly/transport component CsgG n=5 Tax=Proteobacteria RepID=B8JAW6_ANAD2 Length = 321 Score = 63.6 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 70/187 (37%), Gaps = 26/187 (13%) Query: 3 RLFLLVAVMLLSGCLT--APPKEAARPT---LMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 R L VA + L C T PP E P AQ A K +++ Sbjct: 4 RFALPVAALALQACATVSQPPVEVESPVPKAAQVAAQQQAQAPAPSAKRYKTRIAIARFT 63 Query: 58 DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQEN 117 +ET + + + A+ ML + L S F+ LER LQ L E+ + A Sbjct: 64 NETSYGRSLLNDADLDRIGKQASDMLASRLVMSGNFVVLERPDLQKLEREQALRGVAG-- 121 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 L A+ ++ GS+ + G G + T+ Q + V++R Sbjct: 122 -------------LVGADTVISGSVTEF------GRSVGGKKGFLSSTKVQTARAKVDIR 162 Query: 178 VVNVSTG 184 +V+V TG Sbjct: 163 LVDVKTG 169 >UniRef50_C3WB28 Curli production assembly/transport component CsgG n=1 Tax=Fusobacterium mortiferum ATCC 9817 RepID=C3WB28_FUSMR Length = 305 Score = 63.6 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 86/244 (35%), Gaps = 32/244 (13%) Query: 1 MQR-LFLLVAVMLLSGCLTAPPKEAARPTLMPRA-QSYKDLTHLPAPTGKIFVSVYNIQD 58 M++ + + +A + L C + + + Y AP ++ + +++ Sbjct: 4 MKKYMGIFLAALFLVSCSNKEIRSTVKKEDNISTLRDYNTYKENLAPKRRVVI--GKVKN 61 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 T +F + + +L++ +S F LER+ L +++ E + E Sbjct: 62 YT-RFGTQRTDSITK-------DILISEFANSGRFNVLEREDLDSVMEELAFSNSLGEKS 113 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRV 178 +A + +V GS+ YE N + + ++ Q + + L+V Sbjct: 114 ILAKQK------FLDTDFIVVGSVTKYELNTTGSKS------LFSKSKEQRAEAVIELKV 161 Query: 179 VNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETG 238 ++V G++ + + + G + Y L E ++ +E Sbjct: 162 IDVLNGKVWTETGEGSASVKFGTVLGAGTYGSYGSL--------EQEAFRAAVIQGVEKI 213 Query: 239 VIFL 242 V + Sbjct: 214 VKKI 217 >UniRef50_Q7DDH4 Putative lipoprotein NMB1126/NMB1164 n=130 Tax=Proteobacteria RepID=Y1126_NEIMB Length = 223 Score = 63.6 bits (153), Expect = 7e-09, Method: Composition-based stats. Identities = 45/247 (18%), Positives = 82/247 (33%), Gaps = 51/247 (20%) Query: 14 SGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFST 73 +GC T + + Y + + +SV + + K + Sbjct: 18 TGCATESSRSLEVEKVASYNTQYHGV--------RTPISVGTFDNRSSFQKGIFSDGEDR 69 Query: 74 AVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTA 133 Q A +LVT L+ + F L R L L E I A L Sbjct: 70 LGSQ-AKTILVTHLQQTNRFNVLNRTNLNALKQESGISGKAHN--------------LKG 114 Query: 134 ANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVN-- 191 A+ +V G + + + + FGI + Q+ V L +VNV+T EI+ S Sbjct: 115 ADYVVTGDVTEFG---RRDVGDHQLFGILGRGKSQIAYAKVALNIVNVNTSEIVYSAQGA 171 Query: 192 -----TSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDG 246 +++ I+ + + ++L+ L + A+ + + Sbjct: 172 GEYALSNREIIGFG-GTSGYDATLNGKVLD------------LAIREAVNS-----LVQA 213 Query: 247 IDRGLWD 253 +D G W Sbjct: 214 VDNGAWQ 220 >UniRef50_D1Y6S8 Putative curli production assembly/transport component n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y6S8_9BACT Length = 309 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 70/188 (37%), Gaps = 22/188 (11%) Query: 51 VSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKI 110 +SV ++ +G+ P + M++T L +S F +ER L + E+++ Sbjct: 27 ISVETFRNSSGRHVPVDSI----------MDMMITELVNSGTFQVVERDRLDVIAREQRM 76 Query: 111 IRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLD 170 ++G + N L A M+ G++ Y ++ +GG + Sbjct: 77 ----GQSGLIDSNTASRTGRLAGAQYMMTGAVTKYSASDTAGGGIIGGGSSLIGGLINTN 132 Query: 171 --QIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVF-RFIDYQRLLEGEVGYTSNEPV 227 + +++R+V+ +TG I+ + V G+ R+ + G G Sbjct: 133 TAYVTLDVRIVDTTTGAIVYAGRAEGA--GTNVMGGLLSRYAGFGT---GRSGGQLATAT 187 Query: 228 MLCLMSAI 235 + + Sbjct: 188 HKAITKVV 195 >UniRef50_Q1IPN7 Curli production assembly/transport component CsgG n=2 Tax=Acidobacteria RepID=Q1IPN7_ACIBL Length = 327 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 66/172 (38%), Gaps = 12/172 (6%) Query: 39 LTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER 98 ++ P T K V++ + T V + + +L+ L + + +ER Sbjct: 17 VSAFPQATRKKRVAIMSFDYGTVHSSVAAIFGSDQDVGKGISDLLIQKLVNDGDYSVIER 76 Query: 99 QGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGY-----ESNVKSGG 153 L ++ E+ + + + N+ + L + ++ GSI + +NV GG Sbjct: 77 AQLDKIMAEQNFSNSDRADP----NSAAKIGRLLGVDAIITGSITQFGRDDQHTNVGGGG 132 Query: 154 VGARYFGI---GADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQ 202 G G T + + R+V+V+T EIL++ + T V Sbjct: 133 YGGITGRYGIGGVGTHSAKAVVGITARLVDVNTAEILAACTGTGTSKRSGVS 184 >UniRef50_C1XFI9 Uncharacterized protein involved in formation of curli polymers n=2 Tax=Thermaceae RepID=C1XFI9_MEIRU Length = 167 Score = 61.3 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 43/164 (26%), Positives = 67/164 (40%), Gaps = 17/164 (10%) Query: 82 MLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGS 141 ML TAL S F+ +R + L +E + + Q+N S T A++++ G Sbjct: 1 MLNTALVSSNHFVVYDRSIITQLRSEAAL--SNQQN------------SFTGADLIITGV 46 Query: 142 IIGYESNVK-SGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYE 200 I G+E N S G+GA F IG Q + I V++R+V+ TG I+++ Sbjct: 47 ITGFEPNASGSSGLGAIPF-IGGLVQQKKSYIRVDMRIVDTRTGAIIAAFPVEAEATDTN 105 Query: 201 VQAGVFRFIDYQRLLEGEVGYT-SNEPVMLCLMSAIETGVIFLI 243 + T SN P+ L IE +I Sbjct: 106 FAGVGAGLLPGGLGGLVGGLRTYSNTPMAKALALMIEAATQAII 149 >UniRef50_Q2RW21 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW21_RHORT Length = 517 Score = 60.9 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 98/292 (33%), Gaps = 42/292 (14%) Query: 6 LLVAVMLLSGCLT----------APPKEAARPTLMPRAQSYKDLTHLPAPTGK--IFVSV 53 +L ++L+GC T A P + + + + L GK I ++ Sbjct: 14 VLAGALVLAGCQTMSNPQTASVVAQPDTPVVKNMTSFTNALRCMDDLFLAYGKRDIIITS 73 Query: 54 YNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRA 113 + D+TG+ + +A+ + +TA ++ F+ +ER G + + Sbjct: 74 DGLPDQTGEVRAGTKEMMISALSK------MTAKSNAFRFVDVERSGDAVFYFNQILTNH 127 Query: 114 AQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKS--GGVGARY--FGIGADTQYQL 169 + + + G+ + + G+G + +G + + Sbjct: 128 DTQRKSFPS-------------YYIRGAFTQVDRGILQDNQGIGVAFDFVSLGYEQDQLV 174 Query: 170 DQIAVNLRVVNVSTGEILSSVNTSKTILSY--EVQAGVFRFIDYQRLLEGEVGYTSNEPV 227 I+++L + + EIL ++++ TI + A V + + + Sbjct: 175 SLISMDLNMGKTTDLEILPGISSTNTIATVKSGRGAEVEGLVPKANIY-LNFSNDRAQGT 233 Query: 228 MLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVK----YRHMSVPP 275 + +E G+I L+ W +++ + Y MS Sbjct: 234 HAAARTLVELGLIELLGKFTRVPYWRCLEIESTNPEMMAQIRDWYDQMSPAD 285 >UniRef50_C1CWI5 Putative Curli production assembly/transport component CsgG, n=1 Tax=Deinococcus deserti VCD115 RepID=C1CWI5_DEIDV Length = 211 Score = 60.9 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 62/171 (36%), Gaps = 25/171 (14%) Query: 72 STAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSL 131 ++ + L+ AL ++ F ER+ + L+ I + L + Sbjct: 56 TSELGTGLADALMNALSETGKFAVYERENVPQLVQNNMI---------AGTDPTAALSPV 106 Query: 132 TAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVN 191 +++V G+I YE SG +G + +I +LR+V+ TG +L++ Sbjct: 107 ---DVLVFGNINVYEPESSSGQGCFMGVCLGG----KESRIGADLRIVDSKTGRVLATTK 159 Query: 192 TSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFL 242 S ++ + L +G + + + + + V L Sbjct: 160 VEGK--SSTTGGSIY----FNGL---SLGGKQSSGLDKAVGAMLTQAVQVL 201 >UniRef50_C7RID3 Peptidoglycan-binding domain 1 protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RID3_9PROT Length = 519 Score = 60.2 bits (144), Expect = 8e-08, Method: Composition-based stats. Identities = 50/290 (17%), Positives = 99/290 (34%), Gaps = 42/290 (14%) Query: 1 MQRLFLLVAVMLLSGC------------LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGK 48 M++ L + + L A PK A T+ QS + + L GK Sbjct: 1 MKKFGLSLIALSLLPIPGWGNDIGAEVQSAAAPKTPAIKTITNFTQSLRCMDELLYAYGK 60 Query: 49 IFVSVY--NIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLN 106 +++ I DETG+ K ML+TA+ + + ++ Sbjct: 61 QGIAITSTGIPDETGKVK------------TGTKEMLITAVSK-----MTVKSNAFDFID 103 Query: 107 ERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNV--KSGGVGAR--YFGIG 162 + + + + GSI + N K+ GVG + +G Sbjct: 104 FHSGADDLGALFAARGDQNRLMP-----DYYIRGSITQMDDNSVRKNKGVGFSLPFLDLG 158 Query: 163 ADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKT-ILSYEVQAGVFRFIDYQRLLEGEVGY 221 D I+++L + + +T +I+ +TS T +L +G + L + Sbjct: 159 VSKDDAYDLISMDLSIGDAATRKIIPITSTSNTLVLMKGGISGEGGGKIGKVGLSFNIDV 218 Query: 222 TSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHM 271 + +E V + +E G+I + W + + N ++ + Sbjct: 219 SRSEGVGAATRTLVELGLIETLGKFTQVPYWKCLD-TDLTNPLIREQARE 267 >UniRef50_C6W063 Uncharacterized protein involved in formation of curli polymers-like protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W063_DYAFD Length = 574 Score = 60.2 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 73/209 (34%), Gaps = 27/209 (12%) Query: 38 DLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLE 97 +LP ++ + V T S F + ML +AL+ + F +E Sbjct: 50 KCQNLPRA-QRVIIKVARFSVST--KAAQARSTFGDEL----ATMLTSALQQTNCFRVME 102 Query: 98 -RQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGA 156 + L + +E + NG+ + A ++V G I + S + Sbjct: 103 TNKNLSDATSEMAFAQDGFTNGSGPQ-----AGQMLGAQLIVTGEITDFSEGSSSKSI-- 155 Query: 157 RYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLE 216 +G +++ + NL+V+N TGE+L S + + + + V + Sbjct: 156 ----LGVESKSNQATVGFNLKVLNPQTGELLFSKDVN--MKGHN-SGKVLDIFGVKT--- 205 Query: 217 GEVGYTSNEPVMLCLMSAIETGVIFLIND 245 N V L AI V L ++ Sbjct: 206 --SSSNENRAVQDALQKAIIKAVEILADE 232 >UniRef50_A3WBM1 Putative uncharacterized protein n=1 Tax=Erythrobacter sp. NAP1 RepID=A3WBM1_9SPHN Length = 328 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 67/164 (40%), Gaps = 21/164 (12%) Query: 47 GKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLN 106 + V + ++D TG N V AM+ TA+ S F +ER L L+ Sbjct: 51 NRPIVGIAQMEDLTG------GGNADNFV-----AMIETAIIGSGKFRIIERARLATLME 99 Query: 107 ERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQ 166 E+ + +GT N + + +V G+I + +S G+ G+ ++ Sbjct: 100 EQGL----ALSGTTTTNRPGQVGGFEGVDYLVYGTISSISATNRSDIGGSMLRGLLGGSR 155 Query: 167 YQLD------QIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAG 204 + D ++ ++R+ + +TGE+ + S+ S V G Sbjct: 156 NRPDCYKTRVRMEADIRITDTNTGEVRYATRISEEQDSATVCGG 199 >UniRef50_C2MBZ3 Curli production assembly/transport component CsgG n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2MBZ3_9PORP Length = 300 Score = 59.4 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 78/227 (34%), Gaps = 34/227 (14%) Query: 41 HLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQG 100 + K V++ +ET K + + + A +L L S FI LER Sbjct: 27 QETGKSLKWKVAIGRFSNETQYGKGIFYDRENDPIAKQAQDILAAKLVASGKFILLERSD 86 Query: 101 LQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFG 160 + + E + G++ A+ ++ GS+ + G Sbjct: 87 AEAVAQE---VTDGTSEGSIK------------ADYVILGSVTEF------GRKTTGQTS 125 Query: 161 IGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRF-IDYQRLLEGEV 219 + + Q + AVNLR+V+V+TG S+ Y + + + Sbjct: 126 LFTAEKTQQVEAAVNLRLVDVATG----IATYSEEAKGYANNVSKSTLGLGGTSGYDASL 181 Query: 220 GYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILV 266 G + +AI+ V +IN D+ W + Q+ ++ Sbjct: 182 G-------DKAISAAIDQLVENIINKCSDKP-WRTYLISMDQDGTII 220 >UniRef50_A1VWR2 Curli production assembly/transport component CsgG n=2 Tax=Proteobacteria RepID=A1VWR2_POLNA Length = 337 Score = 59.0 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 90/261 (34%), Gaps = 40/261 (15%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPT-----LMPRAQSYKDLTHLPAPTGKIFVSVYN 55 + + L++ L+ C P A + AQ PT K +++ Sbjct: 16 VTKPLLIILAAGLTACAVQAPPVAQKDAPQSLATQKAAQQAVASQAPATPTLKRKIALGR 75 Query: 56 IQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ 115 I +ET + + + T ++ AL +S ++ ER + + E ++ Sbjct: 76 ITNETSYGQSLLRDRHDDPLGKQVTDLMSKALTESGAYLVFERPDIGRIQAEGRLTDTKL 135 Query: 116 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN 175 + + ++ GS+ + G G + ++ Q+ V+ Sbjct: 136 N--------------IVGVDALIIGSLTEF------GRKAIGATGFVSSSKRQVAFAKVD 175 Query: 176 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGY--TSNEPVMLCLMS 233 +RVV+V+TG + + + + + A F F + GY T N+ + Sbjct: 176 IRVVDVNTGHVFFATSGAGEASTE--TASTFGF-------GSQAGYDGTLND---AAIRQ 223 Query: 234 AIETGVIFLINDGIDRGLWDL 254 A+ + L + R W Sbjct: 224 AVAEAINRLSVEMSGRP-WQT 243 >UniRef50_C9KNZ5 Putative uncharacterized protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KNZ5_9FIRM Length = 234 Score = 58.6 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 44/194 (22%), Positives = 73/194 (37%), Gaps = 22/194 (11%) Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIP 127 + + +A + T L+DS F L+R + L +E A +G V + Sbjct: 62 SEGWDREEMGAAVEYVYTDLQDSGRFKLLDRTRQRALTDE----YAHDMSGLVDEDTAPV 117 Query: 128 LQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEIL 187 + A ++ GSIIG + V +GA T+ V+LR+V+ TGE++ Sbjct: 118 IGDQYGAQYLLMGSIIGVTTRRSETTV------VGAGTKRAQVTATVSLRLVDTETGEVV 171 Query: 188 SSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGI 247 + + V+A + L+ E V L AI V DG Sbjct: 172 LAATGRSRKNNTLVKAPL-------GLIRIGTEQVDKEQVNEALEDAIHDAV-----DGP 219 Query: 248 DRGLWDLQNKAERQ 261 L + KA+ + Sbjct: 220 RGLLARMDGKAKSK 233 >UniRef50_A3JNY2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JNY2_9RHOB Length = 389 Score = 57.8 bits (138), Expect = 4e-07, Method: Composition-based stats. Identities = 48/261 (18%), Positives = 92/261 (35%), Gaps = 39/261 (14%) Query: 10 VMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLP------------APTGKIFVSVYNIQ 57 ++ LSGC T P A PR ++ T A + +S I Sbjct: 1 MIALSGCATINP-SLAPKVAHPRTPPARNFTSFNDTLRCMDNMLARAGRKTVLISSSGIP 59 Query: 58 DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFI--PLERQGLQNLLNERKIIRAAQ 115 D T + + AV Q + S FI PLER+ Q I+ + Sbjct: 60 DLTSKIRVGADDMLVNAVNQ------MNVNSKSYVFIDQPLERRDAQ-------IVWLTK 106 Query: 116 ENGTVAINNRIPLQSLTAANIMVEGSII--GYESNVKSGGVGARYFGIGADT-QYQLDQI 172 G + + + ++ ++ + + + R G++ +L + Sbjct: 107 REGDLTPQFY-----IRGSISQLDEGVVKDSFSFGINNDLAPNRDVESGSNRFSRRLSVV 161 Query: 173 AVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLM 232 V+L +V +I+ + + +++ V AG+ ID + +G E + + Sbjct: 162 TVDLHLVTYPDRKIIPGGSVANSMV--IVGAGLTGIIDLSEI-GVTIGMERIESIGQAVR 218 Query: 233 SAIETGVIFLINDGIDRGLWD 253 + +E GVI L+ W+ Sbjct: 219 NLVELGVIELLGKHSRLPYWN 239 >UniRef50_C6JM48 Putative uncharacterized protein n=2 Tax=Fusobacterium RepID=C6JM48_FUSVA Length = 305 Score = 55.1 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 85/241 (35%), Gaps = 33/241 (13%) Query: 4 LFLLVAVMLLSGC-LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQ 62 + ++ +LL C T + + + Y +L P ++ + G+ Sbjct: 7 IGMIFISLLLLSCGKTGVESNIKKDDKIVSLREYNNLKETALPKRRVVI---------GK 57 Query: 63 FKPYPASNFSTAVPQSAT-AMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 K Y S F T T +L + +S F LER L +++ E E ++ Sbjct: 58 VKNY--SRFGTQRTDITTKDILASEFSNSGRFNVLERSDLDSVIEELAFSNTLGEKSLLS 115 Query: 122 INNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNV 181 + ++ GS+ Y N + + ++ Q ++ + L+V++V Sbjct: 116 KQK------FLDTDFVIIGSVTKYALNTTGNKS------LFSKSKEQKAEVVIELKVIDV 163 Query: 182 STGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIF 241 + G++ + +++ G + Y L E E ++ +E V Sbjct: 164 TNGKVWIETGEGSSSVTFGTVLGAGTYGSYTSLEE--------EAFRAAVIQGVEKIVKK 215 Query: 242 L 242 L Sbjct: 216 L 216 >UniRef50_Q93HR4 Putative uncharacterized protein (Fragment) n=1 Tax=Thermus thermophilus RepID=Q93HR4_THETH Length = 149 Score = 54.4 bits (129), Expect = 4e-06, Method: Composition-based stats. Identities = 37/160 (23%), Positives = 52/160 (32%), Gaps = 34/160 (21%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+R +LL+ V LS C L P P K+ V+ + + E Sbjct: 1 MKRAWLLLGVFALSACAP-QVTTKVDTGLSPDNPY----ATYTGPRAKVVVASFPCKAEK 55 Query: 61 ---------------GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLL 105 G+ S + ML TAL +S FI ER L L Sbjct: 56 CGPGVDSSQVGAAVLGKLFGIEVSTSKGDIGAGIADMLTTALINSNHFIVYERSVLDQLQ 115 Query: 106 NERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGY 145 E +I AQ+ L A I++ G+I Sbjct: 116 KESQIGNQAQQ--------------LQGAEILITGTITAL 141 >UniRef50_A5PA28 Curli production assembly/transport component CsgG n=1 Tax=Erythrobacter sp. SD-21 RepID=A5PA28_9SPHN Length = 310 Score = 53.2 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 64/197 (32%), Gaps = 30/197 (15%) Query: 48 KIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNE 107 K V++ +ET + + + + A ++ L DS FI +ER +L E Sbjct: 43 KRRVAIGRFTNETRYGQTLLRDSDLDPLGKQAADIMAAYLIDSNAFIVVERTDANEVLKE 102 Query: 108 RKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQY 167 + + L A+ ++ GSI+ + G + + Sbjct: 103 QGVGGET--------------SGLIGADTIIVGSIVEF------GRADEGERAVFKRERT 142 Query: 168 QLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPV 227 Q V +R+V+V TG S S + +L G Sbjct: 143 QKAYAKVAIRLVDVRTGVAFHSATGSGEATT----------TTKTKLFSGTTARYDGTLT 192 Query: 228 MLCLMSAIETGVIFLIN 244 L AIE + LIN Sbjct: 193 DKALSVAIEDVLEDLIN 209 >UniRef50_B7A9M7 Putative uncharacterized protein n=1 Tax=Thermus aquaticus Y51MC23 RepID=B7A9M7_THEAQ Length = 225 Score = 53.2 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 68/192 (35%), Gaps = 23/192 (11%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQF 63 +L+ LL GC+ P+ A T Q+ + P T V+V + + Sbjct: 13 FLMLLFGALLVGCV---PQAATPTTPGSLPQTVQIRYDGPRET----VAVIDTTNI---- 61 Query: 64 KPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAIN 123 S T L+ F +ER L +L E+++ A Sbjct: 62 -RLSGSPLKERFLAILTEELIVHPYFKDRFSLVERVKLDQVLKEQRLSAAGLSPTDAPR- 119 Query: 124 NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVST 183 + L A ++ S++ +++ +S G +G + V L +V+ Sbjct: 120 ----IGKLLGARYLLLASVVNAKTSRRSTGA------LGIRIDEVQGTVEVALSLVDSEN 169 Query: 184 GEILSSVNTSKT 195 G +L+ V S++ Sbjct: 170 GRVLARVLVSQS 181 >UniRef50_B9CYA0 Peptidoglycan-binding domain 1 protein n=2 Tax=Campylobacter RepID=B9CYA0_WOLRE Length = 272 Score = 52.1 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 46/253 (18%), Positives = 89/253 (35%), Gaps = 27/253 (10%) Query: 3 RLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGK----IFVSVYNIQD 58 ++ L LL+GC+++ + + P +V +D Sbjct: 10 KIAALALPFLLTGCMSSMSMGSPGAKTTATGAAAGSNAQNTNPGLTRCTETMGTVTIYED 69 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 Y + + + + + A + S F+ +ER N + E RA E+G Sbjct: 70 R--NSNWYSVATRQYKLTSTIPVLRLLA-QQSNCFVVVERSKAFNQMLEE---RALMESG 123 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVK--SGGVGARYFGIGADTQ--YQLDQIAV 174 + N+ + AA+ + +I ESN SG VGA + + + ++ Sbjct: 124 ELRENSNFKKGQMVAADYTLTPTITFSESNTSGLSGVVGALFGSVAGSVAGGFSTSDVST 183 Query: 175 NLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYT----SNEPVMLC 230 L ++ +G L++ E A F L G+ G + +N P Sbjct: 184 VLTLIENRSGVQLAAA---------EGSARNTDFAGLGSLFGGKAGGSLRAYANTPEGKI 234 Query: 231 LMSAIETGVIFLI 243 +++A + LI Sbjct: 235 IIAAFTDSMNNLI 247 >UniRef50_B8GVR1 Putative uncharacterized protein n=2 Tax=Caulobacter vibrioides RepID=B8GVR1_CAUCN Length = 275 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 46/259 (17%), Positives = 85/259 (32%), Gaps = 40/259 (15%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+ + V + L++G A K AAR L+H AP + V+ + + Sbjct: 19 MRPAIIAVGLSLIAGSAIAETKVAARDL-------SSLLSHCEAPVAALTVTAFKCKASA 71 Query: 61 GQFKPYPASNFSTA-----------------VPQSATAMLVTALKDSRWFIPLERQGLQN 103 P P SN + + L TALK + F + R+ ++ Sbjct: 72 CSVAPAPGSNTGLGALMSMAQAAQGLQTFPNIGDGLSNALTTALKTTGCFKVMAREDFED 131 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGA 163 L E + ++ + A+ +V G+I K+ G + + Sbjct: 132 LRREAEAAGITLKSAS--------------ADYLVTGAITSLAVGAKTQSFGGGFVPLVG 177 Query: 164 DTQYQL--DQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGY 221 I++++R+V+V E+ +S + AG + L Sbjct: 178 AVSRSTKSANISIDVRLVDVKASEVKASQTFDVSNERSSWGAGGAGWGGSGALFGAASST 237 Query: 222 TSNEPVMLCLMSAIETGVI 240 S E + S I+ Sbjct: 238 QSPELDSVANESVIQAANY 256 >UniRef50_C1A9F6 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A9F6_GEMAT Length = 260 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 51/152 (33%), Gaps = 35/152 (23%) Query: 44 APTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQN 103 AP K V+V + TG+ + Q AM+ T L LERQ L + Sbjct: 24 APARKT-VAVLAFDNNTGKTDY-------DHLGQGMAAMMTTDLAAVDEIQLLERQRLAD 75 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGA 163 + E Q + + + L A ++ GS+ E Sbjct: 76 VTKE----IDNQRSQYFDSTTAVKVGRLAGAQYIIVGSLAAVEP---------------- 115 Query: 164 DTQYQLDQIAVNLRVVNVSTGEILSSVNTSKT 195 Q+ ++ R+V V TG I+ + S Sbjct: 116 -------QVRIDTRIVRVETGAIVKTAKVSGK 140 >UniRef50_Q7UMD1 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UMD1_RHOBA Length = 530 Score = 50.5 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 31/149 (20%), Positives = 61/149 (40%), Gaps = 36/149 (24%) Query: 45 PTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNL 104 P + ++V I+D+ G ++ + + + T LV R ++R+ L ++ Sbjct: 200 PNLDLRIAVCPIRDQNGN-----TADETLVMAEDLTTRLVN-----RRVPVVDRESLGSV 249 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGAD 164 L+E AQ + L L+ A +V G I+ N ++ G+ Sbjct: 250 LDE----LLAQNSILFDPKTAQKLGELSGATHVVAGKIVA---NGRTRGI---------- 292 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVNTS 193 V +R+++V TG I+ + +TS Sbjct: 293 ---------VYVRLIDVQTGRIVVATSTS 312 >UniRef50_A8ZXP3 Uncharacterized protein involved in formation of curli polymers-like protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZXP3_DESOH Length = 315 Score = 50.5 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 29/137 (21%), Positives = 62/137 (45%), Gaps = 18/137 (13%) Query: 48 KIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNE 107 K+ V V N Q++T + + ++A +L T L+ + FI + +Q + ++LN+ Sbjct: 53 KLRVGVVNFQNKT--------PSRVLGIGEAAADILGTILQKTDRFIIIPQQDMSSILNQ 104 Query: 108 RKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQY 167 + + +G + + + N +V G+I Y S + I + Sbjct: 105 QSM----GASGVIDPTTAAKMGKVLGLNAIVTGAITAY-----SEAEEGQDLLI-YQKKK 154 Query: 168 QLDQIAVNLRVVNVSTG 184 Q+ ++ V+ R+V+ +TG Sbjct: 155 QIARVTVDYRIVDTTTG 171 >UniRef50_B0SIM6 Hypothetical lipoprotein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SIM6_LEPBA Length = 166 Score = 49.4 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 63/191 (32%), Gaps = 38/191 (19%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKE--AARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 + L + + + S C +E +PT+ P Q L+ + V D Sbjct: 3 FKGFILALVLGVFSACYLGEERESKPKKPTVPPLEQLAISLSEKGFYFQPERLVVLTFLD 62 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 G+ PY L T L F+ L+R Q +L E + + ++ Sbjct: 63 NEGKKSPYGEI---------LAEKLTTELVKKDRFLILDRLANQKVLKEAGL---SLDSP 110 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRV 178 T R + + +++ G + Y D + VN R+ Sbjct: 111 TDTATLR-KIGEVLKVGVIITGIVTPY-----------------------QDGVFVNTRL 146 Query: 179 VNVSTGEILSS 189 + + TG IL + Sbjct: 147 IEIKTGLILKA 157 >UniRef50_Q30SV1 Putative uncharacterized protein n=1 Tax=Sulfurimonas denitrificans DSM 1251 RepID=Q30SV1_SULDN Length = 334 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 81/204 (39%), Gaps = 32/204 (15%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 MQ L L+ LL+ L A L P LT +SV N ++ Sbjct: 1 MQVLKWLILAFLLTVFLAGCSHRVAIRALEPAEIDRATLTRK--------ISVTNFEN-- 50 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + S + T ++ + D +F + R+ +++E+KI Q +G V Sbjct: 51 ------DSVGLSNKIE---TKIISKKIDDKSYFTLISRKDFDKIISEQKI----QNSGLV 97 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKS---GGVGARYFGIGADTQY------QLDQ 171 I+ + + ++ A ++ G + N + + + ++Y + Sbjct: 98 DISTAVEVGNILGAEAIISGGVGRVAFNDTTYYERRIRCNDKKCKSVSEYSVRCIKRNIG 157 Query: 172 IAVNLRVVNVSTGEILSSVNTSKT 195 ++ +LR+++++ G+I+ + +K+ Sbjct: 158 LSADLRMIDIAKGDIIYANTFNKS 181 >UniRef50_A2SLP6 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SLP6_METPP Length = 292 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 48/230 (20%), Positives = 83/230 (36%), Gaps = 26/230 (11%) Query: 13 LSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG-----QFKPYP 67 ++GC T+ + + + P V + G + + Sbjct: 18 IAGCSTSKTEIGGPSDMA--------IADQAPPQEG---GVGRCEKRLGTVAITESEVNS 66 Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLL-NERKIIRAAQENGTVAINNRI 126 + S +P+S ++ L S F ++R +LL ER++ + + + Sbjct: 67 QALMSAGLPRSMAPLVRHLLIRSGCFNVVDRGAAYSLLEAERRLREQLGTDANATVARHL 126 Query: 127 -PLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGE 185 PL + A I+ I G V G G GIG QY + V L VV+ T E Sbjct: 127 QPLDYILRAEIVFAEQI-GQSKGVLGGVFGDVIGGIGG--QYNKKEAVVLLSVVDARTSE 183 Query: 186 ILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 235 I SSV T S AG+ + + + G ++ P + +A+ Sbjct: 184 ITSSVFGRGTSDS----AGLGSLVLSSGVFAIDGG-WADTPQAKTVAAAL 228 >UniRef50_C1AEH8 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AEH8_GEMAT Length = 273 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 61/176 (34%), Gaps = 36/176 (20%) Query: 75 VPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAA 134 + + ML+T L + +ER LQ+LL+E+ + +G VA + A Sbjct: 87 LSKGLAEMLITELSGNENIRVVERDRLQSLLDEQNL----GASGRVATETAAKIGKTLGA 142 Query: 135 NIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSK 194 M+ GS + + + + +++R +N T E+ + + + Sbjct: 143 LHMLMGSFV-IDP---------------------KNTMRMDVRAINTETSELEYATSVTG 180 Query: 195 TI---------LSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIF 241 L ++ G+ + QR E + P L M + + Sbjct: 181 KADKMLELLGELGTKLNTGL-KLPSVQRGFEEGKAVGAKGPNQLKSMMLLSRALEQ 235 >UniRef50_C1QBW4 Putative uncharacterized protein n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBW4_9SPIR Length = 490 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 76/201 (37%), Gaps = 45/201 (22%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+ + + ++ S T P++ R + ++V+ I+D + Sbjct: 1 MKNIIFTLFFLIFS--FTLFPQQMNREIGKTYTKEN--------------IAVFEIEDSS 44 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 ++ S + + TA++ +L F ++R+ L L E ++ + + Sbjct: 45 SRY--------SQGLGKQLTALIEDSLTKMNRFNIVDRKNLDKYLKEMEL-----QLTGI 91 Query: 121 AINNRIPLQSLTAANIMVEGSI----IGYESNVKSGGVGARYFGIGADTQYQLDQIAVNL 176 + I + + + V+G+I + Y + + G G+ Y Q+ + L Sbjct: 92 TESQVIEVGKIYGYSKAVKGNIVNADVSYNYDSDT-GSGSLYG-----------QVEMVL 139 Query: 177 RVVNVSTGEILSSVNTSKTIL 197 ++V+V T +I+ S Sbjct: 140 QIVDVETTKIMYSSKLQGISY 160 >UniRef50_B4SIB8 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B4SIB8_STRM5 Length = 551 Score = 48.6 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 62/163 (38%), Gaps = 18/163 (11%) Query: 43 PAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQ 102 P GK + V + + G + + V + A L L ++ FI L+R+ Sbjct: 235 PDEQGKPKIVVALPRTKAGSYAVGDGRVDADEVADAIRARLSDTLTQTQRFIVLDREFGD 294 Query: 103 NLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYES-----NVKSGGVGAR 157 L E I + G V + + + A ++++ +I +E N++ Sbjct: 295 ELQAEIDHINS----GNVRLQDTARVGQQLATDLILIPTIERFEYPRSVRNLRMSDRQVT 350 Query: 158 YFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYE 200 + G + LR++N +TG+++ S + + S Sbjct: 351 SYSGGGR---------ITLRLINATTGQVVMSDSFDHQLASTG 384 >UniRef50_C1TRF7 Tetratricopeptide repeat protein n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TRF7_9BACT Length = 382 Score = 48.2 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 20/100 (20%), Positives = 43/100 (43%), Gaps = 14/100 (14%) Query: 46 TGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLL 105 + + V++ + A S +V QS ML + L +R F +ER L + Sbjct: 24 SAPMTVAIGDFN----------ARGASYSVGQSVVEMLYSRLAGNRAFRLVERGQLDQVA 73 Query: 106 NERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGY 145 +++I + G V+ + + + + A V+G++ + Sbjct: 74 RQQRITMS----GMVSQESAVEIGRIVGAKYYVQGAVSHF 109 >UniRef50_A8UTU4 Curli production assembly/transport component CsgG n=2 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8UTU4_9AQUI Length = 258 Score = 48.2 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 56/152 (36%), Gaps = 31/152 (20%) Query: 98 RQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGAR 157 R LQ +L E+K Q +G V N + + L +V GS+ G G + Sbjct: 6 RNDLQKVLQEQKF----QMSGLVDPNTAVQIGQLAGVKYIVTGSVNNINLKWVDVGEGVK 61 Query: 158 ------------YFGIGADTQYQ-LDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAG 204 +GA TQ I + ++V++ TGE++ +KT+ EV Sbjct: 62 RGLSEHLGLLGTALAVGASTQEGWNLSIDIVVKVIDTETGEVV----LTKTVSGREVLGK 117 Query: 205 VFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIE 236 F ++ ++ G A+E Sbjct: 118 TPTF-NFDSIIGG---------AKKAAQEALE 139 >UniRef50_B6BP75 Peptidoglycan-binding domain 1 protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BP75_9RICK Length = 346 Score = 47.8 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 68/166 (40%), Gaps = 22/166 (13%) Query: 80 TAMLVTALKDSRWFIPLERQ-GLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMV 138 T+++ ++ S FI LER +QNLL ER + + G + + + + A+ ++ Sbjct: 90 TSLIRLIVQQSNCFIVLERGIAMQNLLQERTLSSS----GELKQDQNMGKGQMITADYIL 145 Query: 139 EGSIIGYESNVKSGG----------VGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILS 188 +II E+N G G+ IG ++ Q ++ L + +G ++ Sbjct: 146 TPTIIFKEANTGGVGGLLGGLLPGNAGSVAGIIGGSLKFSESQTSLTL--ADTRSGIQVA 203 Query: 189 SVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 + + S+ V AG+ +G SN P + +A Sbjct: 204 AAAGASKKSSFGVVAGL-----GGSSAAAGLGAYSNTPEGKVVAAA 244 >UniRef50_B7QTV8 Putative peptidoglycan binding domain protein n=1 Tax=Ruegeria sp. R11 RepID=B7QTV8_9RHOB Length = 424 Score = 47.1 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 54/277 (19%), Positives = 89/277 (32%), Gaps = 44/277 (15%) Query: 4 LFLLVAVMLLSGC---------LTAPPKEAARPTLMPRAQSYKDLTHLPA--PTGKIFVS 52 + LV + L+GC A P A L ++ + + L A P VS Sbjct: 8 VGSLVLALTLAGCGARYPALTPAIAQPNARAARNLTSFSEPLRCMDGLFAQLPRQSYLVS 67 Query: 53 VYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIR 112 +I DET V A ML+ A+ R L++ +I Sbjct: 68 SSDIPDET------------RRVSVGADDMLINAMNQMNR-----RSQRYVFLDQARISG 110 Query: 113 AAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADT------- 165 Q T L + GSI +S+ VGA + G+ Sbjct: 111 FGQLELTTTRKKGEVKPQL-----YIRGSISQLDSDRVDAEVGAVHSTEGSSGLTKSLYK 165 Query: 166 -QYQLDQIAVNLRVVNVSTGEIL--SSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYT 222 +L ++V+L +V + ++ +SV S ++ + V ID + Sbjct: 166 GFRKLSVVSVDLHLVEYPSRRVVPGASVANSMVVVRRGLNGTVTGIIDSVT-GGVPISVE 224 Query: 223 SNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAE 259 E + + IE G+I L+ W Sbjct: 225 RIESQGQAVRNLIELGLIELLGKHAGVPYWQCLEAPS 261 >UniRef50_B7IGU6 Putative uncharacterized protein n=1 Tax=Thermosipho africanus TCF52B RepID=B7IGU6_THEAB Length = 396 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 19/133 (14%), Positives = 44/133 (33%), Gaps = 20/133 (15%) Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIP 127 + ++ ++L + F R L ++ ER + + Sbjct: 36 GNGWNMDEADFLISILEQKALELGRFRVFSRNDLDMIVKERNLGDLGI------VEQTFE 89 Query: 128 LQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEIL 187 + A + ++ SN + G A ++L++ ++ TGE+L Sbjct: 90 AGKILGARYAILLTLTELTSNYEKNGYTASLR--------------LSLKLYDLKTGELL 135 Query: 188 SSVNTSKTILSYE 200 +S +K + E Sbjct: 136 ASTPFAKETYTEE 148 >UniRef50_Q3B6Q5 Periplasmic protein n=2 Tax=Chlorobium/Pelodictyon group RepID=Q3B6Q5_PELLD Length = 435 Score = 45.5 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 63/168 (37%), Gaps = 32/168 (19%) Query: 33 AQSYKDLTHLPAP----TGKIFVSVYNIQD------ETGQFKPYPASNFSTAVPQSATAM 82 A+ TH +P + + ++V + GQ P V Sbjct: 119 ARVLVAFTHYKSPGLNASNRRRIAVMPFRTAGIPMLLDGQRVPAE------EVSAELVQQ 172 Query: 83 LVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSI 142 +VT L SR F L+R + L+E+ ++ + + + + + + M+ GSI Sbjct: 173 VVTELTQSRKFTVLDRDYMDAYLSEKSLLLSPDGE----ESEMMKMGRVLGVDYMLVGSI 228 Query: 143 IGYESNVKSGGVGARYFGI----GADTQYQLDQIAVNLRVVNVSTGEI 186 SGGV R + G Q+ + V+ R++ + T E+ Sbjct: 229 --------SGGVERRAEDVLALTGERVQHGAASLNVDYRIIVMPTREV 268 >UniRef50_C0QSR9 Putative lipoprotein n=1 Tax=Persephonella marina EX-H1 RepID=C0QSR9_PERMH Length = 399 Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 66/165 (40%), Gaps = 24/165 (14%) Query: 51 VSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKI 110 + + Q + + +V + TA LV + ++ + R+ LQ ++ E++ Sbjct: 110 ATAGDYQKQLEMVTRDVNARLGESVAEGVTAQLV-EMGGAKVYT---RRDLQKVMQEQQF 165 Query: 111 IRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSG-----GVGARYFGIGADT 165 Q++G +N + L L ++ GS+ S G+ +GA Sbjct: 166 ----QQSGLTDVNTLVQLGKLAGVKYIITGSVNNVNLKWISAEYAKKGLSQHLGLVGAIA 221 Query: 166 QYQLD-----QIAVNL--RVVNVSTGEILSSVNTSKTILSYEVQA 203 ++ ++ +L ++++V TGE++ +K I EV Sbjct: 222 AAAIETQEGWNLSTDLTIKIIDVETGEVV----LAKNISGREVLG 262 >UniRef50_A8ZSW9 Tetratricopeptide TPR_2 repeat protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZSW9_DESOH Length = 212 Score = 44.4 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 50/123 (40%), Gaps = 28/123 (22%) Query: 78 SATAMLVTALKDSRW-FIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANI 136 + ML+T L +S F +ER+ ++ LL+E + + G + + + + + A Sbjct: 42 GLSVMLMTELANSEAAFTLVEREKIRALLDEITL----GQTGVIDASTAVKMGKMLGAQA 97 Query: 137 MVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTI 196 + G+ + NV+ +++R+V V TG ++ + + + Sbjct: 98 IGFGAFMVMGKNVR-----------------------IDMRMVEVETGALIMAESITGKT 134 Query: 197 LSY 199 + Sbjct: 135 DDF 137 >UniRef50_Q0VKU1 Putative uncharacterized protein n=2 Tax=Alcanivorax RepID=Q0VKU1_ALCBS Length = 430 Score = 44.4 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 56/151 (37%), Gaps = 16/151 (10%) Query: 37 KDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPL 96 D +HLP +++ + + ++ V L A S + L Sbjct: 132 PDRSHLPG------IAIATFESAKSSYDLGDIKVPASQVQHQLQDNLTMAFSQSGRYRVL 185 Query: 97 ERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGA 156 +R L ++ E ++ G++A L A++++ G+I ++ A Sbjct: 186 DRTYLADVDEELGVV----AQGSIAPEEMARLGQRKGADLLLVGTIEDFQ---IGDSAQA 238 Query: 157 RYFGIGADTQYQLDQIAVNLRVVNVSTGEIL 187 Y GA + V R+++ +T EIL Sbjct: 239 FY---GAKMGGYAPYVRVRYRLIDTTTTEIL 266 >UniRef50_A7BYD5 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BYD5_9GAMM Length = 334 Score = 43.6 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 55/143 (38%), Gaps = 16/143 (11%) Query: 8 VAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYP 67 V ++ +S TA E + + + + K V+V + D +G Sbjct: 17 VPIIFMSVNATAYEYEYEK----EITRLSATMAEKISAANKTKVAVVDFTDISGN----- 67 Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIP 127 ++ + + + LV+A + F ++R L +++ E K+ + G + Sbjct: 68 VTHLGRFIAEEFSVALVSA---GKGFQVVDRIHLHSIIKEHKLSKT----GLIDPKTARE 120 Query: 128 LQSLTAANIMVEGSIIGYESNVK 150 L + ++ G++ + +++ Sbjct: 121 LGKIAGVEALITGTLTPFGDSIR 143 >UniRef50_Q1LBB8 Putative uncharacterized protein n=1 Tax=Cupriavidus metallidurans CH34 RepID=Q1LBB8_RALME Length = 447 Score = 43.6 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 59/160 (36%), Gaps = 24/160 (15%) Query: 43 PAPTGKIFVSVY--NIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQG 100 PA TGK+ +++ I G A+ + AL S F LER Sbjct: 160 PADTGKLRIAIAPLRIGHVGGNNAERIAAELRQRIT--------DALTQSGRFTVLERGD 211 Query: 101 LQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFG 160 L E + I +G V+ + + A+++ GS+ +E+ + Sbjct: 212 APELYGEIERI----ASGEVSNDQFSKIGQGLGADLIWFGSVNAFETGRPTD-------- 259 Query: 161 IGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYE 200 AD + + +V+ + VNV+T E+L S Sbjct: 260 --ADQTGRAGRWSVSQKFVNVTTREVLFSNTVDGGAAEQS 297 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A8AI45 Curli production assembly/transport component cs... 317 3e-85 UniRef50_A6EMN0 Putative assembly or transport protein for curli... 276 5e-73 UniRef50_A5FHE8 Curli production assembly/transport component Cs... 264 2e-69 UniRef50_Q3KES0 Assembly/transport component in curli production... 260 4e-68 UniRef50_C6X5S1 Curli production assembly/transport component Cs... 256 8e-67 UniRef50_Q392B7 Curli production assembly/transport component Cs... 254 3e-66 UniRef50_A3XJL0 Putative assembly or transport protein for curli... 250 4e-65 UniRef50_Q084E4 Curli production assembly/transport component Cs... 248 1e-64 UniRef50_Q1NBA5 Putative curli production assembly/transport com... 244 2e-63 UniRef50_D2QPP9 Curli production assembly/transport component Cs... 243 6e-63 UniRef50_Q0TJ37 Curli production assembly/transport component Cs... 241 3e-62 UniRef50_B6QZQ0 Curli fiber membrane-associated lipoprotein CsgG... 236 7e-61 UniRef50_A9L3G5 Curli production assembly/transport component Cs... 225 9e-58 UniRef50_B3QHQ8 Curli production assembly/transport component Cs... 218 2e-55 UniRef50_Q1YMP1 Putative Curli production assembly/transport com... 218 2e-55 UniRef50_D2LDX9 Curli production assembly/transport component Cs... 217 3e-55 UniRef50_B3QD06 Curli production assembly/transport component Cs... 213 7e-54 UniRef50_A7HWH6 HfaB protein n=1 Tax=Parvibaculum lavamentivoran... 185 1e-45 UniRef50_C5SPX6 HfaB protein n=1 Tax=Asticcacaulis excentricus C... 185 1e-45 UniRef50_A6GPY8 HfaB protein n=1 Tax=Limnobacter sp. MED105 RepI... 171 2e-41 UniRef50_Q1QYN5 Curli production assembly/transport component Cs... 168 3e-40 UniRef50_B9L5Z4 Putative curli production assembly/transport com... 167 3e-40 UniRef50_P27343 Putative transcription activator protein hfaB n=... 163 7e-39 UniRef50_C6XNU9 Holdfast attachment protein HfaB n=3 Tax=Alphapr... 162 1e-38 UniRef50_Q0AL09 HfaB protein n=1 Tax=Maricaulis maris MCS10 RepI... 161 2e-38 UniRef50_B4W728 Putative uncharacterized protein n=1 Tax=Brevund... 161 3e-38 UniRef50_A4CEI5 Putative uncharacterized protein n=1 Tax=Pseudoa... 161 3e-38 UniRef50_C5SPX7 HfaB protein n=2 Tax=Caulobacteraceae RepID=C5SP... 155 1e-36 UniRef50_C5D1Z0 Putative uncharacterized protein n=1 Tax=Variovo... 152 1e-35 UniRef50_A1BH58 Curli production assembly/transport component Cs... 144 4e-33 UniRef50_C8R0R5 Curli production assembly/transport component Cs... 143 7e-33 UniRef50_Q2RW21 Putative uncharacterized protein n=1 Tax=Rhodosp... 138 2e-31 UniRef50_C7RID3 Peptidoglycan-binding domain 1 protein n=1 Tax=C... 133 6e-30 UniRef50_O67219 Putative uncharacterized protein n=1 Tax=Aquifex... 129 8e-29 UniRef50_P73111 Sll1835 protein n=3 Tax=Chroococcales RepID=P731... 128 3e-28 UniRef50_A6DK47 Curli production assembly/transport component Cs... 123 8e-27 UniRef50_Q05S73 Putative uncharacterized protein n=1 Tax=Synecho... 123 9e-27 UniRef50_Q0IE08 Curli production assembly/transport component Cs... 122 1e-26 UniRef50_D0SYN2 Predicted protein n=1 Tax=Acinetobacter lwoffii ... 121 2e-26 UniRef50_A1VWR2 Curli production assembly/transport component Cs... 119 7e-26 UniRef50_Q21E38 Curli production assembly/transport component Cs... 119 7e-26 UniRef50_B0VF47 Putative curli production assembly/transport com... 117 3e-25 UniRef50_Q4KAD5 CsgG family protein n=20 Tax=Proteobacteria RepI... 117 4e-25 UniRef50_Q2JVB7 CsgG family protein n=4 Tax=Cyanobacteria RepID=... 115 1e-24 UniRef50_B8CKS1 Curli production assembly/transport component Cs... 115 2e-24 UniRef50_Q31MJ4 Putative uncharacterized protein n=2 Tax=Synecho... 114 3e-24 UniRef50_A9L616 Curli production assembly/transport component Cs... 114 3e-24 UniRef50_A3JNY2 Putative uncharacterized protein n=1 Tax=Rhodoba... 113 6e-24 UniRef50_B8JAW6 Curli production assembly/transport component Cs... 112 9e-24 UniRef50_D0XIZ9 HfaB protein n=1 Tax=Brevundimonas subvibrioides... 112 1e-23 UniRef50_Q7DDH4 Putative lipoprotein NMB1126/NMB1164 n=130 Tax=P... 112 2e-23 UniRef50_C3WB28 Curli production assembly/transport component Cs... 111 2e-23 UniRef50_B7QTV8 Putative peptidoglycan binding domain protein n=... 111 2e-23 UniRef50_C2MBZ3 Curli production assembly/transport component Cs... 110 6e-23 UniRef50_D1B8K8 Curli production assembly/transport component Cs... 110 7e-23 UniRef50_D1Y6S9 Tetratricopeptide repeat domain protein n=1 Tax=... 109 1e-22 UniRef50_A2CA24 Uncharacterized protein involved in formation of... 109 1e-22 UniRef50_D0XJ00 Putative uncharacterized protein n=1 Tax=Brevund... 109 1e-22 UniRef50_B8HPG7 CsgG family protein, putative n=1 Tax=Cyanothece... 108 3e-22 UniRef50_Q1IPN7 Curli production assembly/transport component Cs... 106 7e-22 UniRef50_C6JM48 Putative uncharacterized protein n=2 Tax=Fusobac... 106 8e-22 UniRef50_B0CFQ9 CsgG family protein, putative n=1 Tax=Acaryochlo... 105 2e-21 UniRef50_D1Y6S8 Putative curli production assembly/transport com... 103 5e-21 UniRef50_A6DA55 Putative uncharacterized protein n=1 Tax=Caminib... 101 3e-20 UniRef50_Q9RTM7 Putative uncharacterized protein n=1 Tax=Deinoco... 101 3e-20 UniRef50_A5PA28 Curli production assembly/transport component Cs... 99 1e-19 UniRef50_C9KNZ5 Putative uncharacterized protein n=1 Tax=Mitsuok... 98 4e-19 UniRef50_B9CYA0 Peptidoglycan-binding domain 1 protein n=2 Tax=C... 96 2e-18 UniRef50_B0JU00 Curli production assembly/transport component n=... 95 3e-18 UniRef50_C1CWI5 Putative Curli production assembly/transport com... 95 4e-18 UniRef50_C1XFI9 Uncharacterized protein involved in formation of... 93 9e-18 UniRef50_A8ZXP3 Uncharacterized protein involved in formation of... 92 2e-17 UniRef50_C6W063 Uncharacterized protein involved in formation of... 91 4e-17 UniRef50_B8GVR1 Putative uncharacterized protein n=2 Tax=Cauloba... 90 6e-17 UniRef50_B7A9M7 Putative uncharacterized protein n=1 Tax=Thermus... 89 1e-16 UniRef50_A2SLP6 Putative uncharacterized protein n=1 Tax=Methyli... 89 2e-16 UniRef50_Q30SV1 Putative uncharacterized protein n=1 Tax=Sulfuri... 84 6e-15 UniRef50_B0SIM6 Hypothetical lipoprotein n=2 Tax=Leptospira bifl... 83 8e-15 UniRef50_A3WBM1 Putative uncharacterized protein n=1 Tax=Erythro... 82 2e-14 UniRef50_C1QBW4 Putative uncharacterized protein n=1 Tax=Brachys... 82 2e-14 UniRef50_Q93HR4 Putative uncharacterized protein (Fragment) n=1 ... 82 3e-14 UniRef50_C1AEH8 Putative uncharacterized protein n=1 Tax=Gemmati... 81 4e-14 UniRef50_B4SIB8 Putative uncharacterized protein n=3 Tax=Bacteri... 77 6e-13 UniRef50_C1A9F6 Putative uncharacterized protein n=1 Tax=Gemmati... 76 9e-13 UniRef50_B6BP75 Peptidoglycan-binding domain 1 protein n=1 Tax=C... 76 2e-12 UniRef50_Q7UMD1 Putative uncharacterized protein n=1 Tax=Rhodopi... 69 2e-10 UniRef50_B7IGU6 Putative uncharacterized protein n=1 Tax=Thermos... 66 1e-09 UniRef50_A8UTU4 Curli production assembly/transport component Cs... 65 3e-09 UniRef50_C1TRF7 Tetratricopeptide repeat protein n=1 Tax=Dethios... 64 6e-09 Sequences not found previously or not previously below threshold: UniRef50_A9BZV7 Peptidoglycan-binding domain 1 protein n=4 Tax=C... 76 1e-12 UniRef50_A6VLC8 Peptidoglycan-binding domain 1 protein n=2 Tax=P... 73 1e-11 UniRef50_A7ZCR3 Peptidoglycan-binding domain 1 protein n=3 Tax=B... 68 5e-10 UniRef50_C0QZW0 Curli production assembly/transport component Cs... 67 6e-10 UniRef50_C0QSR9 Putative lipoprotein n=1 Tax=Persephonella marin... 67 9e-10 UniRef50_Q0VKU1 Putative uncharacterized protein n=2 Tax=Alcaniv... 66 9e-10 UniRef50_B5WBR1 Putative uncharacterized protein n=1 Tax=Burkhol... 64 4e-09 UniRef50_A7BYD5 Putative uncharacterized protein n=1 Tax=Beggiat... 64 5e-09 UniRef50_C1SMS2 Putative uncharacterized protein n=1 Tax=Denitro... 63 1e-08 UniRef50_Q3B6Q5 Periplasmic protein n=2 Tax=Chlorobium/Pelodicty... 61 3e-08 UniRef50_A8ZSW9 Tetratricopeptide TPR_2 repeat protein n=1 Tax=D... 60 9e-08 UniRef50_B6IWB1 Putative uncharacterized protein n=2 Tax=Rhodosp... 60 1e-07 UniRef50_Q1LBB8 Putative uncharacterized protein n=1 Tax=Cupriav... 59 1e-07 UniRef50_C8PSK0 Putative uncharacterized protein n=1 Tax=Trepone... 59 2e-07 UniRef50_B9M4B1 Membrane lipoprotein lipid attachment site n=2 T... 59 2e-07 UniRef50_A6WLI2 Putative uncharacterized protein n=1 Tax=Shewane... 59 2e-07 UniRef50_C6BZA8 Putative lipoprotein n=1 Tax=Desulfovibrio salex... 58 3e-07 UniRef50_A6Q5P4 Putative uncharacterized protein n=1 Tax=Nitrati... 58 5e-07 UniRef50_B0VIM9 Putative uncharacterized protein n=1 Tax=Candida... 56 1e-06 UniRef50_D1N7A7 Putative uncharacterized protein n=1 Tax=Victiva... 56 2e-06 UniRef50_C6BWK5 Putative uncharacterized protein n=1 Tax=Desulfo... 55 3e-06 UniRef50_A6LK65 Putative uncharacterized protein n=1 Tax=Thermos... 55 3e-06 UniRef50_Q31E50 Putative uncharacterized protein n=2 Tax=Proteob... 54 4e-06 UniRef50_B2KEV3 Putative uncharacterized protein n=1 Tax=Elusimi... 54 4e-06 UniRef50_Q4UYT5 Putative uncharacterized protein n=6 Tax=Xanthom... 54 7e-06 UniRef50_A9BUH8 Putative uncharacterized protein n=1 Tax=Delftia... 54 8e-06 UniRef50_D1N8P2 Putative uncharacterized protein n=1 Tax=Victiva... 53 9e-06 UniRef50_B9XL67 Putative uncharacterized protein n=1 Tax=bacteri... 53 1e-05 UniRef50_C1AEH5 Putative uncharacterized protein n=1 Tax=Gemmati... 53 1e-05 UniRef50_C0QAM6 Putative uncharacterized protein n=1 Tax=Desulfo... 53 1e-05 UniRef50_C6BSE2 Putative lipoprotein n=1 Tax=Desulfovibrio salex... 53 1e-05 UniRef50_C9KNZ7 Putative curli production assembly/transport com... 53 1e-05 UniRef50_UPI00016C46F9 putative serine/threonine-protein kinase ... 52 2e-05 UniRef50_Q1PV59 Putative uncharacterized protein n=1 Tax=Candida... 52 2e-05 UniRef50_Q2LUS2 Tetratricopeptide repeat domain containing prote... 52 3e-05 UniRef50_Q6AQQ4 Probable periplasmic protein n=1 Tax=Desulfotale... 52 3e-05 UniRef50_Q2LUR8 Fibronectin type III domain containing protein n... 52 3e-05 UniRef50_Q8EXG4 Putative uncharacterized protein n=2 Tax=Leptosp... 52 3e-05 UniRef50_C9KTB5 Putative uncharacterized protein n=3 Tax=Bactero... 51 5e-05 UniRef50_B3E5P1 Putative uncharacterized protein n=1 Tax=Geobact... 51 6e-05 UniRef50_Q2BZ14 Putative uncharacterized protein n=2 Tax=Photoba... 51 6e-05 UniRef50_UPI00016C46FA hypothetical protein GobsU_15593 n=1 Tax=... 51 6e-05 UniRef50_B1Y5R8 Peptidoglycan-binding domain 1 protein n=6 Tax=P... 50 7e-05 UniRef50_B2V7H8 Putative uncharacterized protein n=1 Tax=Sulfuri... 50 7e-05 UniRef50_C4XP04 Putative uncharacterized protein n=1 Tax=Desulfo... 50 9e-05 UniRef50_Q1Q2L7 Putative uncharacterized protein n=1 Tax=Candida... 50 1e-04 UniRef50_C1TRF9 Putative uncharacterized protein n=1 Tax=Dethios... 50 1e-04 UniRef50_A6EQ48 Putative uncharacterized protein n=1 Tax=unident... 50 1e-04 UniRef50_Q4HPW6 Probable periplasmic protein Cj0093 n=14 Tax=Cam... 50 1e-04 UniRef50_B3E9Y8 Tetratricopeptide TPR_2 repeat protein n=1 Tax=G... 49 2e-04 UniRef50_C4LBF5 Putative uncharacterized protein n=1 Tax=Tolumon... 49 2e-04 UniRef50_A8ZSW8 Tetratricopeptide TPR_2 repeat protein n=1 Tax=D... 48 3e-04 UniRef50_A8UWV5 Putative uncharacterized protein n=1 Tax=Hydroge... 48 3e-04 UniRef50_A8V0I3 Putative uncharacterized protein n=1 Tax=Hydroge... 48 4e-04 UniRef50_A7HKS6 Putative uncharacterized protein n=1 Tax=Fervido... 48 5e-04 UniRef50_B0SDG2 Putative uncharacterized protein n=2 Tax=Leptosp... 48 5e-04 UniRef50_A6GKK1 Putative lipoprotein n=1 Tax=Plesiocystis pacifi... 48 5e-04 UniRef50_A3EPA7 Putative uncharacterized protein n=3 Tax=Leptosp... 47 6e-04 UniRef50_A0L966 Putative uncharacterized protein n=1 Tax=Magneto... 46 0.001 UniRef50_A7HBE1 Lipoprotein, putative n=1 Tax=Anaeromyxobacter s... 46 0.001 UniRef50_Q1K1J1 Lipoprotein, putative n=1 Tax=Desulfuromonas ace... 46 0.001 UniRef50_C8R2C8 Putative uncharacterized protein n=1 Tax=Desulfu... 46 0.002 UniRef50_A1WBB7 Putative lipoprotein n=1 Tax=Acidovorax sp. JS42... 46 0.002 UniRef50_B9B5R2 Peptidoglycan-binding domain 1 protein n=29 Tax=... 45 0.002 UniRef50_Q1IJH0 Serine/threonine protein kinase with TPR repeats... 45 0.003 UniRef50_D1N8P3 Putative uncharacterized protein n=1 Tax=Victiva... 44 0.004 UniRef50_A9BP36 Putative uncharacterized protein n=5 Tax=Comamon... 44 0.004 UniRef50_Q6MLD6 Putative uncharacterized protein n=1 Tax=Bdellov... 44 0.004 UniRef50_Q2BQZ0 Putative lipoprotein n=1 Tax=Neptuniibacter caes... 44 0.004 UniRef50_A7HI06 Serine/threonine protein kinase n=1 Tax=Anaeromy... 44 0.005 UniRef50_B1ZST0 Putative uncharacterized protein n=2 Tax=Verruco... 44 0.007 UniRef50_B9XQI4 Calcium-binding EF-hand-containing protein n=2 T... 44 0.007 UniRef50_B8L8X5 Peptidoglycan-binding domain 1 protein n=1 Tax=S... 44 0.008 UniRef50_B8FNQ3 Integrin-like repeat-containing protein n=1 Tax=... 44 0.008 UniRef50_D0KYL0 Putative uncharacterized protein n=1 Tax=Halothi... 43 0.009 UniRef50_A3EPC9 Putative uncharacterized protein n=2 Tax=Leptosp... 43 0.010 UniRef50_UPI00016C4204 hypothetical protein GobsU_09179 n=1 Tax=... 43 0.012 UniRef50_Q1N7X1 Putative uncharacterized protein n=1 Tax=Sphingo... 43 0.012 UniRef50_A0LJ31 Putative uncharacterized protein n=1 Tax=Syntrop... 43 0.012 UniRef50_Q6W140 Probable adenylate class-3/4/guanylyl cyclase n=... 42 0.020 UniRef50_B5EG18 Putative uncharacterized protein n=1 Tax=Geobact... 42 0.028 UniRef50_B9XQI7 Putative uncharacterized protein n=2 Tax=Verruco... 41 0.040 UniRef50_B0TXJ5 Putative uncharacterized protein n=18 Tax=Franci... 41 0.053 UniRef50_C0AAC4 Putative uncharacterized protein n=1 Tax=Opituta... 40 0.083 UniRef50_B2SFZ2 Lipoprotein, putative n=18 Tax=Francisella RepID... 40 0.099 >UniRef50_A8AI45 Curli production assembly/transport component csgG n=90 Tax=Enterobacteriaceae RepID=CSGG_CITK8 Length = 277 Score = 317 bits (811), Expect = 3e-85, Method: Composition-based stats. Identities = 271/277 (97%), Positives = 274/277 (98%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 MQRL +LVAV LLSGCLTAPPKEAA+PTLMPRAQSYKDLTHLP PTGKIFVSVYNIQDET Sbjct: 1 MQRLLILVAVCLLSGCLTAPPKEAAKPTLMPRAQSYKDLTHLPMPTGKIFVSVYNIQDET 60 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV Sbjct: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN Sbjct: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGE+GYTSNEPVMLCLMSAIETGVI Sbjct: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEIGYTSNEPVMLCLMSAIETGVI 240 Query: 241 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES Sbjct: 241 FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 >UniRef50_A6EMN0 Putative assembly or transport protein for curli synthesis n=1 Tax=unidentified eubacterium SCB49 RepID=A6EMN0_9BACT Length = 458 Score = 276 bits (706), Expect = 5e-73, Method: Composition-based stats. Identities = 113/280 (40%), Positives = 164/280 (58%), Gaps = 3/280 (1%) Query: 1 MQRLFLLVAVMLLSGC--LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 ++L LL + L+ C + P L++LP + K+ V VYN +D Sbjct: 4 FEKLVLLFVFVTLTSCGAMLNQPYNVQEARTGELTGKNNALSNLPKASDKVVVGVYNFRD 63 Query: 59 ETGQFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQEN 117 +TGQFK S+FSTAV Q TA+L+ AL+DS WF P+ER+ L NLLNER II + + Sbjct: 64 QTGQFKLTDVGSSFSTAVSQGTTAILLKALEDSEWFRPIERENLNNLLNERSIIEKTRRD 123 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 T A L+ L A I++EG ++ Y++N+ +GG GARYFG G Y+ D+I V LR Sbjct: 124 YTPAGQQPQKLKPLLFAGILLEGGVVSYDTNILTGGAGARYFGAGGSVSYRQDRITVYLR 183 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 ++ STGE+L +V SKTILS AG+FRF+ ++RLLE E+G+T NEP L + AIE Sbjct: 184 AISTSTGEVLKTVYVSKTILSQGADAGIFRFVKFERLLEAEMGFTKNEPAELAVKEAIEK 243 Query: 238 GVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 V+ L+ +GI LW+++ + + +L Y E+ Sbjct: 244 AVVDLVYEGIKDNLWNMEGEEDDVIGVLETYEREKAAEEA 283 >UniRef50_A5FHE8 Curli production assembly/transport component CsgG n=3 Tax=Flavobacteria RepID=A5FHE8_FLAJ1 Length = 454 Score = 264 bits (674), Expect = 2e-69, Method: Composition-based stats. Identities = 108/272 (39%), Positives = 164/272 (60%), Gaps = 6/272 (2%) Query: 5 FLLVAVMLLSGCLT--APPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQ 62 ++ L +GC P + L + L LP P ++ V VY +D+TGQ Sbjct: 7 LFILIAFLFAGCGAYYNQPTGVQKAILGESTPATSLLKDLPKPKEQVVVGVYKFRDQTGQ 66 Query: 63 FKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 +KP S+FSTAV Q AT++L+ AL+DS+WFIP+ER+ + NLL ER +IRA ++ Sbjct: 67 YKPQENGSSFSTAVTQGATSILIKALEDSKWFIPIERENIGNLLQERNLIRATRQEYVKN 126 Query: 122 INNR-IPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 N L L A +++EG I+ Y+SN+ +GG GARYFG GA +Y+ D++ + LR+++ Sbjct: 127 ANPNEPQLTPLLYAGVLLEGGIVSYDSNIITGGFGARYFGAGASVKYRQDRVTIYLRMIS 186 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 S G+IL SV SKTILS + +FR+++++RLLE E GYT+NEPV + + AIE V Sbjct: 187 TSNGKILKSVYISKTILSQAIDESLFRYVNFKRLLEVETGYTTNEPVHMAVTEAIEKAVE 246 Query: 241 FLINDGIDRGLWDLQNKAERQNDILVK-YRHM 271 L+ +G+ +W+ + + Q D L+K Y Sbjct: 247 SLVLEGLQDNIWE-ADAPKWQVDNLIKAYNEE 277 >UniRef50_Q3KES0 Assembly/transport component in curli production n=8 Tax=Gammaproteobacteria RepID=Q3KES0_PSEPF Length = 286 Score = 260 bits (664), Expect = 4e-68, Method: Composition-based stats. Identities = 134/285 (47%), Positives = 193/285 (67%), Gaps = 8/285 (2%) Query: 1 MQRLFLL-VAVMLLSGCLTAPP----KEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYN 55 M+++ L + + L GC P ++ PTL PRA +Y DL +P P G++ VY Sbjct: 1 MKKIIALGLMLAALQGCSLREPMPAEQDTDTPTLTPRASTYYDLLKMPRPKGRLMAVVYG 60 Query: 56 IQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ 115 +D+TGQ+KP PAS+FST+V Q A +ML+ A++ S WF+ LER+GLQNLL ERKIIRA+Q Sbjct: 61 FRDQTGQYKPTPASSFSTSVTQGAASMLMDAMQASGWFVVLEREGLQNLLTERKIIRASQ 120 Query: 116 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN 175 + +N + L L AAN+M+EG II Y++NV+SGG GARY GI +Y++DQ+ VN Sbjct: 121 KKPNTPVNIQGELPPLQAANMMLEGGIIAYDTNVRSGGEGARYLGIDLSREYRVDQVTVN 180 Query: 176 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 235 LR V+V +G++L++V TSKTI S AG+F+FI++++LLE EVGYT+NEP LC++SAI Sbjct: 181 LRAVDVRSGQVLANVMTSKTIYSVARSAGIFKFIEFKKLLEAEVGYTTNEPAQLCVLSAI 240 Query: 236 ETGVIFLINDGIDRGLWDLQNKAER--QNDILVKY-RHMSVPPES 277 E V ++ GI+R LW + A Q+D+L +Y V P++ Sbjct: 241 EAAVGHMVAQGIERRLWQVAGDASTPSQDDVLNRYLTQNKVDPDA 285 >UniRef50_C6X5S1 Curli production assembly/transport component CsgG n=3 Tax=Bacteroidetes RepID=C6X5S1_FLAB3 Length = 456 Score = 256 bits (653), Expect = 8e-67, Method: Composition-based stats. Identities = 118/281 (41%), Positives = 171/281 (60%), Gaps = 4/281 (1%) Query: 1 MQRLFLLVAVMLLSGCLT-APPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDE 59 + ++ LL ++ LS C P + + TL ++ +LPAP KI V VY +D+ Sbjct: 6 LTKIALLTPLIFLSSCTLFNLPTNSEKSTLGEVTPYTPEIKNLPAPKEKIVVGVYKFRDQ 65 Query: 60 TGQFKPYP-ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 TGQ+K +++STA+PQ T +L+ AL+DSRWF P+ER+ + NLLNER+IIR+ ++ Sbjct: 66 TGQYKAAENGASWSTAIPQGTTTILLKALEDSRWFTPIERENIGNLLNERQIIRSTRKEY 125 Query: 119 -TVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 N L L A I++EG +I Y++NV +GG+GARYFG+GA QY+ D+I V LR Sbjct: 126 AGNDANEAALLPPLLFAGIILEGGVISYDTNVMTGGIGARYFGLGAGAQYRQDRITVYLR 185 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 V+ S+GEIL +V TSKTILS + FRFID +RLLE ++G T NEPV L + AIE Sbjct: 186 AVSTSSGEILKTVYTSKTILSTSINGNFFRFIDTERLLESDIGITQNEPVHLAVTEAIEK 245 Query: 238 GVIFLINDGIDRGLW-DLQNKAERQNDILVKYRHMSVPPES 277 V+ LI +G+ LW + Q + I+ Y ++ Sbjct: 246 AVLSLIVEGVRDNLWTNKQKTPNDFDKIIEGYTSEVKVNDA 286 >UniRef50_Q392B7 Curli production assembly/transport component CsgG n=2 Tax=Proteobacteria RepID=Q392B7_BURS3 Length = 312 Score = 254 bits (648), Expect = 3e-66, Method: Composition-based stats. Identities = 114/272 (41%), Positives = 172/272 (63%), Gaps = 2/272 (0%) Query: 1 MQRLFLLVAVML-LSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDE 59 M R+ + ++L L GC+T P + TL P ++ +DLTHLP P GKI +VY +D Sbjct: 14 MTRIAMGAVLLLSLVGCVTRPMPALSNATLTPPTRTTRDLTHLPPPKGKIVAAVYGFRDL 73 Query: 60 TGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 TGQ+K P S+FS+ V Q + LV A++DS WF P+ER+ LQ+LL ERKI+RA + Sbjct: 74 TGQYKASPDSSFSSQVTQGGASFLVKAMRDSGWFTPVERENLQDLLTERKIMRATDGSDA 133 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 N L ANI++EG I+GY++NV++GG G Y GI TQY++DQ+ VNLR + Sbjct: 134 KKAQNDAMAP-LMPANIVLEGGIVGYDTNVRTGGAGVAYLGISGSTQYRIDQVTVNLRAI 192 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 ++ TG++L+SV+T+KT+ SY+V G++RF+ ++ LL+ E G T NEP +C+ AIE+ + Sbjct: 193 DIRTGQVLNSVSTTKTVYSYQVDTGIYRFVGFKDLLQAEAGLTRNEPAQICVNEAIESAL 252 Query: 240 IFLINDGIDRGLWDLQNKAERQNDILVKYRHM 271 LI G+ W L+N + + + +Y Sbjct: 253 THLIVQGVANQTWVLKNDQDWYDPTMQRYLQE 284 >UniRef50_A3XJL0 Putative assembly or transport protein for curli synthesis n=2 Tax=Bacteria RepID=A3XJL0_9FLAO Length = 470 Score = 250 bits (638), Expect = 4e-65, Method: Composition-based stats. Identities = 113/266 (42%), Positives = 151/266 (56%), Gaps = 2/266 (0%) Query: 14 SGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFK-PYPASNFS 72 G P L LT P P + VYN +D+TGQ+K S FS Sbjct: 19 CGSYFNQPLSQQDARLGEVTSHTTTLTQFPEPAEPVVAGVYNFKDQTGQYKNVENGSTFS 78 Query: 73 TAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT-VAINNRIPLQSL 131 TAV Q AT ML+ AL+DS+WF P+ER+ L NLLNER IIR+ ++ N L L Sbjct: 79 TAVSQGATTMLIKALEDSKWFTPIERENLGNLLNERNIIRSTRDEYRKNNNPNEPNLPPL 138 Query: 132 TAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVN 191 A +++EG II Y+SN+ +GG+GARYFG+G TQY+ D++ V LR V+ S+G +L +V Sbjct: 139 LYAGVLLEGGIISYDSNIITGGLGARYFGVGGSTQYRQDRLTVYLRAVSTSSGRVLKTVY 198 Query: 192 TSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGL 251 SKTILS + A +FR++++QRLLE E GYT NEPV L + AIE V LI +GI L Sbjct: 199 VSKTILSQAIDASLFRYVNFQRLLEVETGYTKNEPVQLAMKDAIEKAVESLIIEGIKDNL 258 Query: 252 WDLQNKAERQNDILVKYRHMSVPPES 277 W + ++ Y ES Sbjct: 259 WSSKEGVTVNEALIENYEKEKELEES 284 >UniRef50_Q084E4 Curli production assembly/transport component CsgG n=10 Tax=Gammaproteobacteria RepID=Q084E4_SHEFN Length = 283 Score = 248 bits (634), Expect = 1e-64, Method: Composition-based stats. Identities = 137/272 (50%), Positives = 187/272 (68%), Gaps = 10/272 (3%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEA---ARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 M+RL L + ++ LS C + + A +LMP+ ++Y DL LP+P G + +VY+ + Sbjct: 1 MKRLVLSLFILSLSACSSIESEFDGIEATTSLMPKGETYYDLVSLPSPQGSMVAAVYDFR 60 Query: 58 DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQEN 117 D+TGQ+KP P+SNFSTAVPQS TA L AL DS WF+P+ER+GLQNLL ERKI+RA Sbjct: 61 DQTGQYKPIPSSNFSTAVPQSGTAFLAQALNDSAWFVPVEREGLQNLLTERKIVRAGL-- 118 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 + L L +A I++EG I+ Y++N+K+GG GARY GIG QY++D I VNLR Sbjct: 119 ----NGDASKLPQLNSAQILMEGGIVAYDTNIKTGGAGARYLGIGVSGQYRVDSITVNLR 174 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 V++ TG +LSSV T+K ++S E+ AGVF+FID Q LLE EVGYTSNEPV LC+ +AIE+ Sbjct: 175 AVDIRTGRLLSSVTTTKAVISKEITAGVFKFIDAQELLESEVGYTSNEPVSLCIAAAIES 234 Query: 238 GVIFLINDGIDRGLWDLQNKAER-QNDILVKY 268 V+ +I DGI + W+L + A +N L KY Sbjct: 235 AVVHMIADGIWKRAWNLLDAASGVKNPTLQKY 266 >UniRef50_Q1NBA5 Putative curli production assembly/transport component csgg n=3 Tax=Sphingomonadales RepID=Q1NBA5_9SPHN Length = 336 Score = 244 bits (623), Expect = 2e-63, Method: Composition-based stats. Identities = 111/287 (38%), Positives = 160/287 (55%), Gaps = 16/287 (5%) Query: 1 MQRLFLLV--AVMLLSGC-LTAPPKEAARPTLMP------RAQSYKDLTHLPAPTGKIFV 51 M R L A +LL GC A P++ P Q+ + L LP P + + Sbjct: 1 MIRQIFLASGAALLLGGCNSLATTGRDDIPSMNPMAVYPRYTQAQRQLMDLPPPQRPVAI 60 Query: 52 SVYNIQDETGQFKPYPA--SNFSTAVPQSATAMLVTALKDSR---WFIPLERQGLQNLLN 106 +VYN D+TGQ++ S AV Q A ++LV AL+D+ WF +ER+ L+NLLN Sbjct: 61 AVYNFSDQTGQYRVGEGGTQTLSRAVTQGAASILVRALQDAGNRKWFTIVEREQLRNLLN 120 Query: 107 ERKIIRAAQENGTVAIN-NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADT 165 ER+IIR +E N N L ++ A +++EG II +++N +GG GA + GIGA T Sbjct: 121 ERQIIREMRERYLGENNVNPQALPAMLFAGVLLEGGIISFDTNTVTGGAGASFLGIGAST 180 Query: 166 QYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNE 225 QY+ D + V LR ++V TGE+L++V SKTI S + A FRF+ ++ LLE EVG T+NE Sbjct: 181 QYRQDTVTVYLRAISVRTGEVLTTVTASKTIASQSLGASAFRFVGFKELLEAEVGMTTNE 240 Query: 226 PVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMS 272 P + L AIE V L+ +GID LW+ + +L +YR Sbjct: 241 PDHIALQQAIEKAVYGLVMEGIDLNLWNFADTQAG-WPMLWRYRQER 286 >UniRef50_D2QPP9 Curli production assembly/transport component CsgG n=2 Tax=Spirosoma linguale DSM 74 RepID=D2QPP9_9SPHI Length = 478 Score = 243 bits (619), Expect = 6e-63, Method: Composition-based stats. Identities = 107/275 (38%), Positives = 156/275 (56%), Gaps = 8/275 (2%) Query: 3 RLFLLVAVMLLSGCLT--APPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 R+ + +LSGC P R L + L +LP K+ V+VY +D+T Sbjct: 12 RIIPFSCLWVLSGCAAYLHQPTGLQRARLGEETTTTAALRNLPKAKEKVVVAVYKFRDQT 71 Query: 61 GQFK-PYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 GQ+K S FST V Q T +L+ AL++S WF +ER+ + NLLNERKIIR++ Sbjct: 72 GQYKLSETGSTFSTVVSQGTTNILLKALEESGWFTTIERENVSNLLNERKIIRSSVAQYK 131 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 N L L A +++EG I+ Y++N+ +GG G RYF G TQY+ D++ V LR V Sbjct: 132 EGEN----LPPLLFAGVILEGGIVSYDANIITGGAGLRYFATGGSTQYRQDRVTVYLRAV 187 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 +G+IL +V TSKTILS V AG+FR++ +++LLE E G+++NEP + + AIE V Sbjct: 188 ATRSGKILKTVYTSKTILSQSVDAGIFRYVTFKKLLEAETGFSTNEPSQMAVTEAIEKAV 247 Query: 240 IFLINDGIDRGLWDLQNKAER-QNDILVKYRHMSV 273 L+ +GI GLW + +K L +Y V Sbjct: 248 QALVLEGIQDGLWAVSDKDTGVAKRELDRYDAEKV 282 >UniRef50_Q0TJ37 Curli production assembly/transport component CsgG n=1 Tax=Escherichia coli 536 RepID=Q0TJ37_ECOL5 Length = 228 Score = 241 bits (614), Expect = 3e-62, Method: Composition-based stats. Identities = 206/222 (92%), Positives = 208/222 (93%) Query: 56 IQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ 115 + + G P AVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ Sbjct: 7 FRTKPGNLNPTRQVTSPLAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ 66 Query: 116 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN 175 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN Sbjct: 67 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN 126 Query: 176 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 235 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI Sbjct: 127 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 186 Query: 236 ETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 277 ETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES Sbjct: 187 ETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES 228 >UniRef50_B6QZQ0 Curli fiber membrane-associated lipoprotein CsgG n=1 Tax=Pseudovibrio sp. JE062 RepID=B6QZQ0_9RHOB Length = 316 Score = 236 bits (601), Expect = 7e-61, Method: Composition-based stats. Identities = 94/280 (33%), Positives = 145/280 (51%), Gaps = 9/280 (3%) Query: 1 MQRLFLLVAVMLLSGC--LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 M++ ++ + LS C +T L + L LP P ++ +VY+ QD Sbjct: 2 MKKSIVVAMALSLSACGPVTRMAFSPGPQ-LATVSVQASKLKKLPPPKEPVYAAVYSYQD 60 Query: 59 ETGQFKPYP-ASNFSTAVPQSATAMLVTALKDSR---WFIPLERQGLQNLLNERKIIRAA 114 TGQ+KP S +V Q A MLV AL+D+ WF +ER L LL ER+I+ Sbjct: 61 LTGQYKPSDKVQTLSRSVTQGADTMLVRALQDAGDRKWFRVVERGNLDALLKERQIVTQI 120 Query: 115 QENGTVAINNRIP-LQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIA 173 ++ L L A ++ EG ++GY+SN ++GG GARY GIG + Y+ D + Sbjct: 121 RKIYLGEDKVDPKVLPPLLYAGVLFEGGVVGYDSNTRTGGAGARYLGIGGNADYRQDDVT 180 Query: 174 VNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMS 233 V+LR V+ TGE++++V K+++S ++ G FR+I +LE E G T NEPV L + Sbjct: 181 VSLRAVSTRTGEVMANVMVQKSVVSVGLKGGAFRYIALDEILEAEAGITKNEPVTLAVQQ 240 Query: 234 AIETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHMSV 273 AIE V ++ +G G W +N A+ N + +Y Sbjct: 241 AIEKAVYAIVMEGARVGAWSFENPAQA-NALYQEYTSEKA 279 >UniRef50_A9L3G5 Curli production assembly/transport component CsgG n=20 Tax=Alteromonadales RepID=A9L3G5_SHEB9 Length = 268 Score = 225 bits (574), Expect = 9e-58, Method: Composition-based stats. Identities = 107/255 (41%), Positives = 161/255 (63%), Gaps = 9/255 (3%) Query: 2 QRLFLLVAVMLLSGCLTAPPK--EAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDE 59 + +F + ++ +S C P + P ++ + L P P I V+VY+ +D+ Sbjct: 3 RLVFWGLLLLSMSACSLIPKPDLNITPAEVNPLSEVMRGLQTQPGPKFPIPVAVYSFRDQ 62 Query: 60 TGQFKPYPA-SNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 TGQ+KP S+FSTAV Q AT+ML+ L DS+WF P+ER+GLQNLL ERKI +++G Sbjct: 63 TGQYKPQANVSSFSTAVTQGATSMLMQTLLDSKWFTPVEREGLQNLLTERKISN--KQSG 120 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRV 178 T + + L+ A +++EG +I YE+N +GG G Y+GIGA Y+ DQ+ + LR Sbjct: 121 TKGDD----VPVLSTARLLLEGGVISYETNTSTGGSGVEYYGIGASEMYREDQVTIYLRA 176 Query: 179 VNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETG 238 V+V TG+++ SV+TSK +LS E++AG+FR+ RL E E+G+T+NEPV C++ AIE Sbjct: 177 VDVHTGKVMMSVSTSKRVLSQEMRAGLFRYTSLNRLAEAEIGFTTNEPVQFCVLQAIELA 236 Query: 239 VIFLINDGIDRGLWD 253 V LI+ GI +G W Sbjct: 237 VAELIDKGIKQGYWS 251 >UniRef50_B3QHQ8 Curli production assembly/transport component CsgG n=6 Tax=Rhizobiales RepID=B3QHQ8_RHOPT Length = 313 Score = 218 bits (555), Expect = 2e-55, Method: Composition-based stats. Identities = 84/245 (34%), Positives = 135/245 (55%), Gaps = 8/245 (3%) Query: 36 YKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPA-SNFSTAVPQSATAMLVTALKDSR--- 91 +L LP P KI +++Y D TG+ +P + FS AV Q + + ALK + Sbjct: 44 AVNLETLPPPKQKIDIAIYQFPDLTGKNEPNDNVAVFSRAVTQGGAGLAIDALKRAGGGA 103 Query: 92 WFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKS 151 WF +ER GL +LL ER+++RA ++ + PL + A +++EG I+ +++N + Sbjct: 104 WFRVVERNGLNDLLQERQLVRATRQE--FDRDRAKPLPPMRFAGLLIEGGIVAFDANYMT 161 Query: 152 GGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDY 211 GG+GA Y GIGADT+++ D + V LRV +V TGE+L+SV T+KT+ S +Q ++++ Sbjct: 162 GGIGANYLGIGADTKFRRDMVTVALRVASVQTGEVLTSVTTTKTVYSVSLQGNTYKYVAL 221 Query: 212 QRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVKY-RH 270 +LL+ E G T EP L + AI+ V I +G LW + A + ++ Y Sbjct: 222 DKLLQIEAGITRTEPTQLAVRQAIDLAVYSTIMEGARDKLWRFADPAM-EAKLIRDYLDR 280 Query: 271 MSVPP 275 P Sbjct: 281 DKPQP 285 >UniRef50_Q1YMP1 Putative Curli production assembly/transport component n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YMP1_MOBAS Length = 400 Score = 218 bits (554), Expect = 2e-55, Method: Composition-based stats. Identities = 102/265 (38%), Positives = 145/265 (54%), Gaps = 5/265 (1%) Query: 2 QRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG 61 + ++VA+ LSGC+T P + P ++ DL +P P K V+VY +D TG Sbjct: 12 RLAVMIVALASLSGCVTQEAFTDTPPVIAPVSRPNDDLRRVPPPRQKTVVAVYGYEDLTG 71 Query: 62 QFK-PYPASNFSTAVPQSATAMLVTALKDSR---WFIPLERQGLQNLLNERKIIRAAQEN 117 QFK + S AV Q +ML+ AL+D+ WF LER L NLL ER+II + Sbjct: 72 QFKERENVQSLSRAVTQGGASMLIQALQDAGERRWFTVLERAELDNLLKERQIITEMRRL 131 Query: 118 GTVAINNRIPL-QSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNL 176 + L ANI++EG IIGY++N+ +GGVGA + GI ADT+Y D + V L Sbjct: 132 YRNETQLDPKVVPPLLHANIIIEGGIIGYDTNIMTGGVGAGFLGISADTKYIHDVVTVTL 191 Query: 177 RVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIE 236 R V+ TGE+L++V K + SY +Q G FR++ L E G T NEP + + SAIE Sbjct: 192 RAVSTKTGEVLTTVTVRKAVASYALQGGAFRYVKIDELFMAEAGVTYNEPKQIAVQSAIE 251 Query: 237 TGVIFLINDGIDRGLWDLQNKAERQ 261 V LI +G D +W+ + A + Sbjct: 252 KAVEGLIVEGADLSIWEFSDPAAGR 276 >UniRef50_D2LDX9 Curli production assembly/transport component CsgG n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LDX9_RHOVA Length = 397 Score = 217 bits (553), Expect = 3e-55, Method: Composition-based stats. Identities = 113/287 (39%), Positives = 151/287 (52%), Gaps = 22/287 (7%) Query: 4 LFLLVAVMLLSGCL-------------TAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIF 50 L + GC AP PT+ P + L LP P + Sbjct: 8 LASASIALSAGGCADYSYLKDQIGETVAAPNTGLLPPTVNPINSTNSKLRELPPPKAPVA 67 Query: 51 VSVYNIQDETGQFKPYP----ASNFSTAVPQSATAMLVTALKDSR---WFIPLERQGLQN 103 V+VY D+TGQFKP + S AV Q AT++L+ AL+D+ WF +ER+ L N Sbjct: 68 VAVYGYGDQTGQFKPVAEGANVQSLSRAVTQGATSILMKALQDAGNGRWFTVVERERLDN 127 Query: 104 LLNERKIIRAAQENGTVAIN-NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIG 162 LL ER+II ++ + L L A ++++G IIGY+SN K+GG GA+YFGIG Sbjct: 128 LLKERRIIADMRQRYLGEQVVDPAALPPLLFAGVLIDGGIIGYDSNTKTGGAGAKYFGIG 187 Query: 163 ADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYT 222 D +Y D + V LR +V TG++L SV +KTILSY VQ FRF+ Y RL E E G T Sbjct: 188 GDVKYSEDTVTVYLRATSVKTGQVLLSVVANKTILSYGVQGSAFRFVTYNRLFEAEGGLT 247 Query: 223 SNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVKYR 269 NEP L + AIE V+ I +G RGLW Q+K Q+ I+ Y Sbjct: 248 MNEPGSLAVEQAIEKAVLTFIVEGSARGLWSFQDKT-FQSRIIQDYE 293 >UniRef50_B3QD06 Curli production assembly/transport component CsgG n=4 Tax=Alphaproteobacteria RepID=B3QD06_RHOPT Length = 291 Score = 213 bits (541), Expect = 7e-54, Method: Composition-based stats. Identities = 90/265 (33%), Positives = 139/265 (52%), Gaps = 9/265 (3%) Query: 4 LFLLVAVMLLSGCL---TAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 L ++ + L GC T T++ ++ L LP PT K+ V+VYN D T Sbjct: 8 LACVLLAVSLGGCAITGTDKDPVTPPATMVASTKTGVVLEQLPPPTKKLDVAVYNFPDLT 67 Query: 61 GQFKPYPA-SNFSTAVPQSATAMLVTALKDSR---WFIPLERQGLQNLLNERKIIRAAQE 116 GQ K + FS AV Q +A+L L + WF ER LQ LL ER+II+ + Sbjct: 68 GQNKSNDNFAEFSRAVTQGGSAILTDVLLTAGGGHWFDVAERADLQPLLQERQIIQNTRS 127 Query: 117 NGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNL 176 + L L A +++EG I+GY++N +GG+GA Y G+G + QY+ D + V+L Sbjct: 128 A--LQGEKAQSLPPLRFAGVLLEGGIVGYDTNETTGGIGANYLGLGGNMQYRQDIVTVSL 185 Query: 177 RVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIE 236 R V+V TG +L++V T+K I S V F+++ LL+ + G+T N P L + I+ Sbjct: 186 RAVSVQTGRVLAAVTTTKIIYSVNVSGSGFKYVAIDSLLQADAGFTKNSPTTLAVREGIQ 245 Query: 237 TGVIFLINDGIDRGLWDLQNKAERQ 261 V LI +G+ + LW+ ++ A Sbjct: 246 LAVYSLIFEGVKKELWNFKDPAAGT 270 >UniRef50_A7HWH6 HfaB protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HWH6_PARL1 Length = 342 Score = 185 bits (470), Expect = 1e-45, Method: Composition-based stats. Identities = 66/297 (22%), Positives = 118/297 (39%), Gaps = 34/297 (11%) Query: 2 QRLFLLVAVMLLSGCLTAPPK-----------EAARPTLMPRAQSYKDLT-HLPAPTGKI 49 + L L L+GC++A P + + + L H+P+ Sbjct: 17 RTLGALAMAFALTGCVSANAGSDGRYVAPIGNAPVITNETPYSSALRCLAGHVPSAANTT 76 Query: 50 FVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER-----QGLQNL 104 ++V NI+D TG+ A V Q A+ M ++AL + +ER L+ Sbjct: 77 RIAVGNIRDYTGK---AEADGTGMKVTQGASLMAMSALGKAG-VPLVERYDTSISELEMK 132 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGAD 164 K+I A + +I + ++ + G I N++SGG+ Y G Sbjct: 133 YTNNKLIGDAVD----GEYRKIYAGEIRGSDYYLVGGITELNFNIQSGGINGSYAEGGDM 188 Query: 165 TQ-------YQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEG 217 + I ++LR+VN T E++ V+ K I+ E++AGVF F L Sbjct: 189 GAAANASAATYVMNIGLDLRLVNSRTLEVVDYVSYQKQIVGREIKAGVFDFFG-GNLFSL 247 Query: 218 EVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDI-LVKYRHMSV 273 VG ++ EP+ L + + +E V+ L+ D ++ + + YR + Sbjct: 248 GVGSSAQEPIQLAVRAVVERAVLKLLVPLYGVKPADCARFPGNKDPMGEIDYRQANP 304 >UniRef50_C5SPX6 HfaB protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SPX6_9CAUL Length = 316 Score = 185 bits (469), Expect = 1e-45, Method: Composition-based stats. Identities = 68/293 (23%), Positives = 116/293 (39%), Gaps = 37/293 (12%) Query: 1 MQRLFLLVAVMLLSGCL--TAPPK----------EAARPTLMPRAQSYKDLTHLPAPTGK 48 M+RL + +LLS C TA P+ A L P + L T K Sbjct: 1 MRRLVFCLP-LLLSACASVTADPQTGLYAKPVGNAPATANLTPYSADLTCLQQAALVTHK 59 Query: 49 I--FVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLN 106 V+V I D TG+ Y +N + Q +AL + +ER Sbjct: 60 PLPRVAVSRIDDLTGKRDFYTGAN----ITQGIALFAQSALSRAG-LPQVERGDRDISDY 114 Query: 107 ERK------IIRAAQENGTVAINNR-IPLQSLTAANIMVEGSIIGYESNVKSGGVGARYF 159 E K + + G N R + + ++ + G + N++S GV R Sbjct: 115 ELKAAMDHVLSDTPDQAGNDPDNFRKVYAGQIAGSDYYISGGLTELNYNIRSDGVDLRAG 174 Query: 160 GIGAD-------TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQ 212 G G+D ++ + IA++LR+++ + EI K I+ +EV+ G+F +D Sbjct: 175 GTGSDDPAGSFVSRRFVMNIALDLRLIDTRSQEIKRVTAYQKQIVGHEVKPGLFTLLD-G 233 Query: 213 RLLEGEVGYTSNEPVMLCLMSAIETGVIFL--INDGIDRGLWDLQNKAERQND 263 +L+ G++ EP+ L + + +E V + G+D + A QN Sbjct: 234 TMLDLSGGFSEMEPIQLGVRTLVERSVYDFAVVLYGMDPSVCRNGGVATTQNP 286 >UniRef50_A6GPY8 HfaB protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GPY8_9BURK Length = 370 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 56/284 (19%), Positives = 107/284 (37%), Gaps = 27/284 (9%) Query: 2 QRLFLLVAVMLLSGCLTAPPK-----------EAARPTLMPRAQSYKDLTHLPAPTGK-- 48 + + V S C T P + + + + L L G Sbjct: 8 KFALISVIAASASACTTLPSFSEAEYVSPFDGASVVENTTRYSPALECLKPLVGGRGPNA 67 Query: 49 IFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER-----QGLQN 103 +V + D TG+ + Q A M+++AL + +ER ++ Sbjct: 68 KRFAVGRVSDFTGKEDLVNG----KRITQGAALMVISALAKTG-VPMVERFDTTIADMEL 122 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGA 163 + K+I N + +I SL ++ + G I N++SG + + IGA Sbjct: 123 KYADNKLITD---NPDSKAHRQIFSGSLPGSDYHIVGGITEVNYNIRSGSLESSIRFIGA 179 Query: 164 DTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTS 223 +Y + +AV+LRVVN T E++++ + K I+ E+ G FR L++ + Sbjct: 180 AARYFVMNVAVDLRVVNTKTLEVVNTQSLQKQIIGTELNGGYFRLFS-DGLVDVSAAERT 238 Query: 224 NEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVK 267 EP+ + IE V ++ + + + ++ L Sbjct: 239 QEPIQKGVRMVIEHAVFNMLTEMNNVSAQSCARLSTAKSGELRN 282 >UniRef50_Q1QYN5 Curli production assembly/transport component CsgG n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QYN5_CHRSD Length = 271 Score = 168 bits (424), Expect = 3e-40, Method: Composition-based stats. Identities = 62/248 (25%), Positives = 107/248 (43%), Gaps = 16/248 (6%) Query: 15 GCLTAPPKEAAR-------PTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYP 67 GC + A P + L+ P + VSV I D TGQ Y Sbjct: 21 GCASIGTAPVAPREGSRVMTNHTPYTRCLSALSQQPGENLPV-VSVGQILDRTGQVS-YS 78 Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIP 127 S + Q + ML++AL +R ER ++ L ER++ A Sbjct: 79 TITESRVLTQGVSEMLISALYKTRKVRLAERLDIRIPLAERQLKDAGAM------QRAPA 132 Query: 128 LQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEIL 187 ++ N ++ G++ N+ + G IGA + + + ++LRVV+ +T + + Sbjct: 133 ALNVQPVNFVILGALTELNYNILTQGARLYVGLIGASNREAVINVGLDLRVVDATTFQTV 192 Query: 188 SSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGI 247 + K I+ +V+AGV+RF D Q L+E + G NEP+ L + S +E ++ +G+ Sbjct: 193 YVTSLQKQIVGNQVEAGVYRFFDNQ-LVEFDAGTVRNEPLQLGVRSVVEMAAYQILTEGL 251 Query: 248 DRGLWDLQ 255 + D Q Sbjct: 252 GLPINDTQ 259 >UniRef50_B9L5Z4 Putative curli production assembly/transport component CsgG subfamily n=1 Tax=Nautilia profundicola AmH RepID=B9L5Z4_NAUPA Length = 443 Score = 167 bits (423), Expect = 3e-40, Method: Composition-based stats. Identities = 52/282 (18%), Positives = 100/282 (35%), Gaps = 31/282 (10%) Query: 2 QRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG 61 + +V + LSGC+ + Q+ ++ P K ++V + + + Sbjct: 5 RLFAFMVGIFFLSGCV------GTTTNVTTSNQNVNEIVKYKGP--KARIAVASFKCKAA 56 Query: 62 QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 + + + ML TAL +S FI +ER ER++ G + Sbjct: 57 K--------CNGQIGSGIADMLTTALFNSGKFIVIERSNEGFSAVEREL---QLSQGMIK 105 Query: 122 INNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGA----RYFGIGADTQYQLD--QIAVN 175 N +I +L A+I+V G+I +E G R + ++ D IA + Sbjct: 106 QNRQIN--NLEGADILVVGAITAFEPKAGGISAGGIVIPRGVPVIGGIKFGKDEAYIAAD 163 Query: 176 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 235 +R+++V TG I+++ + + F L + N P+ + I Sbjct: 164 IRLIDVKTGRIINATTVEGQASKWNIGGIGGGFTGNVAL-GAGLSTYKNTPMEKAIRDMI 222 Query: 236 ETGVIFLINDGIDRGLWD--LQNKAERQNDILVKYRHMSVPP 275 V I I + Q + + + +V P Sbjct: 223 NKAVEK-IAQLIPDNYYRYNANGTVNNQLNTTINTQQNTVKP 263 >UniRef50_P27343 Putative transcription activator protein hfaB n=6 Tax=Alphaproteobacteria RepID=HFAB_CAUCR Length = 337 Score = 163 bits (411), Expect = 7e-39, Method: Composition-based stats. Identities = 48/245 (19%), Positives = 92/245 (37%), Gaps = 21/245 (8%) Query: 22 KEAARPTLMPRAQSYKDLTHLPAPTG--KIFVSVYNIQDETGQFKPYPASNFSTAVPQSA 79 + + L +++ I D TG+ + V Q A Sbjct: 39 TAPVTANPTDYSSALVCLNQYARTNRIVAPRIAIGRIADYTGK---EESDGSGRKVTQGA 95 Query: 80 TAMLVTALKDSRWFIPLER------QGLQNLLNERKIIRAAQENGTVAINNR-IPLQSLT 132 + M V+A + +ER + N + I + R I + Sbjct: 96 SLMAVSAFAKAG-MPLVERFDTSVSEFELKYANNKLISDRPNPAPDAPADFRKILAGQVP 154 Query: 133 AANIMVEGSIIGYESNVKSGGVGARYF-----GIGADTQYQL--DQIAVNLRVVNVSTGE 185 ++ V G I N++S G+ A G+ + + ++ IA++LR+VN T E Sbjct: 155 GSDFYVIGGITELNYNIRSAGIDAYAGDKDTDGLKGNFRRRVFIMNIALDLRLVNTRTLE 214 Query: 186 ILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 ++ ++ K ++ EV AGVF F++ L + G + EP+ L + + IE + + + Sbjct: 215 VVDVISYQKQVVGREVSAGVFDFLN-GNLFDISAGRGALEPMQLAVRALIERATVEMAAN 273 Query: 246 GIDRG 250 Sbjct: 274 LYGMP 278 >UniRef50_C6XNU9 Holdfast attachment protein HfaB n=3 Tax=Alphaproteobacteria RepID=C6XNU9_HIRBI Length = 342 Score = 162 bits (410), Expect = 1e-38, Method: Composition-based stats. Identities = 50/266 (18%), Positives = 104/266 (39%), Gaps = 35/266 (13%) Query: 2 QRLFLLVAVMLLSGCLTAPPK-----------EAARPTLMPRAQSYKDLTHLPAPTG--K 48 + + LL + ++++GC++ P + S + G + Sbjct: 11 KSVALLASALMVTGCVSPVATKSGNYTKPIGGSPVTANPTPYSTSLVCMGDYAHQVGLGQ 70 Query: 49 IFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLER-----QGLQN 103 ++V I D TG+ V Q A+ M ++A + +ER L+ Sbjct: 71 PRIAVGRILDYTGKEDFEGG----RRVTQGASLMAISAFAKAGA-RLVERFDTSVSELEL 125 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFG--- 160 K+I A + +I S+ ++ + G I SN++S G+ Sbjct: 126 KYANNKLIGDAADQ----DFRKITAGSIPGSDFYLVGGITELNSNIRSVGIDGFIGDRDV 181 Query: 161 ----IGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLE 216 + + + ++LR+V + E++ ++ K I+ E+ AG+F F + + + Sbjct: 182 EDPKGNGGAKMFVINVGLDLRLVETESLEVVDVISYQKQIIGREISAGIFDFAN-NNVFD 240 Query: 217 GEVGYTSNEPVMLCLMSAIETGVIFL 242 +G + EP+ L + S IE V+ + Sbjct: 241 IGLGERAQEPIQLAVRSVIERAVLEM 266 >UniRef50_Q0AL09 HfaB protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0AL09_MARMM Length = 287 Score = 161 bits (408), Expect = 2e-38, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 106/267 (39%), Gaps = 21/267 (7%) Query: 1 MQRLFLLVAVMLLSGCLT----------APPKEAARPTLMPRAQSYKDLTHLPAPTGKIF 50 ++ + + ++GC + + L+ A G+ Sbjct: 7 LKAGVSALTLCAMTGCASLSGDFHDHMAGYTGARVIDNSTAYSSDLDCLSR-TAMAGRPR 65 Query: 51 VSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKI 110 ++V + D TG+F T Q M+++AL + F ER + E + Sbjct: 66 IAVGEVNDLTGRFSSLDG----TVATQGTALMVMSALDRAG-FPLAERLDTRVAQQELEF 120 Query: 111 --IRAAQENGTVAIN-NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQY 167 R +G N R+ S+ ++ ++ G + N+ SG AR + ++ Sbjct: 121 ANSRLIGPDGQPTENYRRVMAGSIAGSDYIILGGVTELNFNLHSGVAEARIGPLIGGRRH 180 Query: 168 QLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPV 227 + ++LR+V+ + ++ V+ SK I +E++AGVF FI L + +G EPV Sbjct: 181 YAMTVGLDLRLVDATDLRVIDIVSQSKVIRGHEIRAGVFEFIGDTTL-DIGMGERVQEPV 239 Query: 228 MLCLMSAIETGVIFLINDGIDRGLWDL 254 + + +E+ V L++ G+ Sbjct: 240 HTAIRTIVESAVFDLVS-GLAGPAAQT 265 >UniRef50_B4W728 Putative uncharacterized protein n=1 Tax=Brevundimonas sp. BAL3 RepID=B4W728_9CAUL Length = 344 Score = 161 bits (407), Expect = 3e-38, Method: Composition-based stats. Identities = 62/282 (21%), Positives = 111/282 (39%), Gaps = 34/282 (12%) Query: 7 LVAVMLLSGCLTAP------------PKEAARPTLMPRAQSYKDLTHLPAPTGK--IFVS 52 ++A + L+GC TA A + + + L G+ ++ Sbjct: 1 MIAAVGLAGCTTARYDPATGLYANPIGGAPATGNDTAYSAALRCLASAGQAEGRSAPRLA 60 Query: 53 VYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERK--- 109 V +I D TG+ + Q A+ VTAL + +ERQ ER+ Sbjct: 61 VGDIADLTGRNDLETG----RKISQGASLFAVTALTKAG-VPTVERQDRGVSEVERQYAQ 115 Query: 110 ---IIRAAQENGTVAINNR-IPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGI---- 161 + Q G A N R I + + V G + N++S GV A GI Sbjct: 116 SHLLSDTPQAAGESAENFRPIYAGQIAGSRYYVVGGVTELNYNLRSSGVDASAGGIEASG 175 Query: 162 ---GADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGE 218 G + + IA++LR+V+ + E++++ + K ++ E++ GVF F + + + Sbjct: 176 VKGGLTSSGYVMNIAIDLRLVDTRSQEVVATASYQKQLVGREIRVGVFDFT-HGNIFDLS 234 Query: 219 VGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAER 260 G + EP+ + +AIE G+ + D + E Sbjct: 235 AGASGMEPIQFAVRTAIERGLYDFVADLYAIPRDRCLARPET 276 >UniRef50_A4CEI5 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4CEI5_9GAMM Length = 490 Score = 161 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 52/256 (20%), Positives = 96/256 (37%), Gaps = 21/256 (8%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 + L L + L C + + + + + K ++V DE+ Sbjct: 6 FRPLVLASVITSLVACQSTSTQVTSNTN----TPNVNQVQQEQYNGAKARIAVARFTDES 61 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 +S + + L TAL + FI LERQ L +L+E+ ++ A G V Sbjct: 62 NNHHW-----WSKEIGNGMSDQLTTALVGTNRFIVLERQALDAVLSEQDLVTA----GRV 112 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLD-----QIAVN 175 + N+ + A I++ S+ ++ + +G F IG Q +A++ Sbjct: 113 SANSGAAFGEIEGAEIVIVASVTEFDDDASGARIGGMGF-IGDMVQSVSAGLSNTHMAID 171 Query: 176 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 235 LR+++ T IL++ +++ A F +L G + SN P L I Sbjct: 172 LRLIDTRTSRILAATTVEGGSKDFDIAAAATNF--GSSILGGNLSAWSNTPKEKALREII 229 Query: 236 ETGVIFLINDGIDRGL 251 + V F + D Sbjct: 230 QKAVEFTLTKIPDVYY 245 >UniRef50_C5SPX7 HfaB protein n=2 Tax=Caulobacteraceae RepID=C5SPX7_9CAUL Length = 281 Score = 155 bits (392), Expect = 1e-36, Method: Composition-based stats. Identities = 49/255 (19%), Positives = 92/255 (36%), Gaps = 25/255 (9%) Query: 17 LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVP 76 + PP+ P + + L P+ G ++V I D TG+ + + Sbjct: 24 VATPPQAEVTLNETPVTPALRCLARRPSLNGLPRLAVGRIGDLTGKIDFDTGA----KIT 79 Query: 77 QSATAMLVTALKDSRWFIPLERQG------LQNLLNERKIIRAAQENGTVAINNR-IPLQ 129 Q A+ V+AL + +ER N ++ + + G N R I Sbjct: 80 QGASLFAVSALGYAG-VPVVERLDNSVAEIELNYARQKLLSDTPERAGQSGDNFRPILAG 138 Query: 130 SLTAANIMVEGSIIGYESNVKSGGVGARYFGIG-------ADTQYQLDQIAVNLRVVNVS 182 + + + G I N++S G A + + +AV+LR+VN Sbjct: 139 QIAGSRYYIVGGITELNYNIRSDGYDAAIGSQALPGAQGQISGRTYVLNVAVDLRLVNTQ 198 Query: 183 TGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFL 242 T +++ +V K ++ + + LL + G + EP+ + + +E V L Sbjct: 199 TQQVVDTVTFQKQVIGVTNDRRLTGGSEDIGLL-LQGGNSRQEPLQMSVRELVERSVYHL 257 Query: 243 INDGIDRGLWDLQNK 257 I LW + Sbjct: 258 IA-----PLWTATDA 267 >UniRef50_C5D1Z0 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5D1Z0_VARPS Length = 467 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 60/257 (23%), Positives = 106/257 (41%), Gaps = 32/257 (12%) Query: 15 GCLTAPPKEAAR------------PTLMPRAQSYKDLTHLPAP--TGKIFVSVYNIQDET 60 GC+TAP + A + P A I ++V +++D T Sbjct: 31 GCVTAPQQRMAPNEAPIVLGPAVRENVTPMEVVLACFGDHVAATQRQPIVITVGDVKDYT 90 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTAL-------KDSRWFIPL--ERQ---GLQNLLNER 108 G++ A+ Q M+ +AL + + F P+ ER+ + L + Sbjct: 91 GKYSINEG----NAITQGGALMVYSALGKLGGAVQAAERFDPVIAERELGYADRRQLGDG 146 Query: 109 KIIRAAQENGTVAINNRIPLQ-SLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQY 167 + + A NG + S+ ++ + G I N+ SGG IG + Sbjct: 147 RTHQLAGPNGGQTVPWLPYFGGSINKSDYFIVGGITELNYNIHSGGGEIGVNQIGVKART 206 Query: 168 QLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPV 227 + V+LR+V+ T I+ +++ +K YEV VFRF +L + +VG EPV Sbjct: 207 FSQSVGVDLRIVDTKTLMIVKTISLTKQFNGYEVGFNVFRFFG-SKLYDIDVGAKGQEPV 265 Query: 228 MLCLMSAIETGVIFLIN 244 + + +A+E GV+ L+ Sbjct: 266 QMGVRAALEEGVVRLVA 282 >UniRef50_A1BH58 Curli production assembly/transport component CsgG n=3 Tax=Chlorobium/Pelodictyon group RepID=A1BH58_CHLPD Length = 240 Score = 144 bits (362), Expect = 4e-33, Method: Composition-based stats. Identities = 52/220 (23%), Positives = 90/220 (40%), Gaps = 18/220 (8%) Query: 45 PTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNL 104 K + V + T A+ + V ML++ L + F LER L + Sbjct: 23 AQEKPRIGVLRFTNNT------YATWWRGGVGTDLQDMLISELASTNSFRVLERNELDAV 76 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGAD 164 + E+ + +G + R L +T A +V ++ +E N GG G Y GI Sbjct: 77 IREQDL----GASGRINPGTRSKLGKITGAKYLVAATVSAFEHNTSGGGGGISYGGISLG 132 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSN 224 + +AV+L+V++V TGEI + T S + GV+R G + + Sbjct: 133 GKQDKAYMAVDLKVIDVQTGEIYDARTVEATSKSSGISVGVYR-----GGFGGHLNQYKD 187 Query: 225 EPVMLCLMS-AIETG--VIFLINDGIDRGLWDLQNKAERQ 261 PV + + IE + + +G + G D N+ +R+ Sbjct: 188 TPVGKAIRACVIEIAEYLECSLVEGKNSGCMDEYNQKDRK 227 >UniRef50_C8R0R5 Curli production assembly/transport component CsgG n=1 Tax=Desulfurivibrio alkaliphilus AHT2 RepID=C8R0R5_9DELT Length = 247 Score = 143 bits (360), Expect = 7e-33, Method: Composition-based stats. Identities = 57/251 (22%), Positives = 99/251 (39%), Gaps = 33/251 (13%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQF 63 FLL A + TA + RA+ Y K ++V + +D+T Sbjct: 12 FFLLAACSFIIPAATAQAQSGGPDMGTARAEDYH--------GPKAAIAVADFEDKT--- 60 Query: 64 KPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAIN 123 + + MLVT L ++ FI LER+ L ++ E+ + +G + Sbjct: 61 --VGRGQYRREYGRGMQDMLVTELFNTNRFIVLEREKLSAVIAEQDL----GASGRFRQD 114 Query: 124 NRIPLQSLTAANIMVEGSIIGYESNV---------KSGGVGARYFGIGADTQYQLDQIAV 174 P+ L A +MV ++ G++ +SG +G R + A Q +A+ Sbjct: 115 TTAPIGELEGAQLMVIAAVTGFDPGTSGTKGTVRGRSGLLGDRLGSLTAGV--QQAHVAL 172 Query: 175 NLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 +LRVV+ +TG ILS+ N S+++ F L + P+ + A Sbjct: 173 DLRVVDTATGRILSATNVEGKARSFDLGGSAFGSQGSGGL-----STFARTPMEQAIRRA 227 Query: 235 IETGVIFLIND 245 I V F++ Sbjct: 228 IAEAVDFVVEQ 238 >UniRef50_Q2RW21 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW21_RHORT Length = 517 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 97/292 (33%), Gaps = 42/292 (14%) Query: 6 LLVAVMLLSGCLT----------APPKEAARPTLMPRAQSYKDLTHLPAPTGK--IFVSV 53 +L ++L+GC T A P + + + + L GK I ++ Sbjct: 14 VLAGALVLAGCQTMSNPQTASVVAQPDTPVVKNMTSFTNALRCMDDLFLAYGKRDIIITS 73 Query: 54 YNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRA 113 + D+TG+ + +A+ + +TA ++ F+ +ER G + + Sbjct: 74 DGLPDQTGEVRAGTKEMMISALSK------MTAKSNAFRFVDVERSGDAVFYFNQILTNH 127 Query: 114 AQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSG----GVGARYFGIGADTQYQL 169 + + + G+ + + GV + +G + + Sbjct: 128 DTQRKSFPS-------------YYIRGAFTQVDRGILQDNQGIGVAFDFVSLGYEQDQLV 174 Query: 170 DQIAVNLRVVNVSTGEILSSVNTSKTILS--YEVQAGVFRFIDYQRLLEGEVGYTSNEPV 227 I+++L + + EIL ++++ TI + A V + + + Sbjct: 175 SLISMDLNMGKTTDLEILPGISSTNTIATVKSGRGAEVEGLVPKANIY-LNFSNDRAQGT 233 Query: 228 MLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVK----YRHMSVPP 275 + +E G+I L+ W +++ + Y MS Sbjct: 234 HAAARTLVELGLIELLGKFTRVPYWRCLEIESTNPEMMAQIRDWYDQMSPAD 285 >UniRef50_C7RID3 Peptidoglycan-binding domain 1 protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RID3_9PROT Length = 519 Score = 133 bits (334), Expect = 6e-30, Method: Composition-based stats. Identities = 48/290 (16%), Positives = 96/290 (33%), Gaps = 42/290 (14%) Query: 1 MQRLFLLVAVMLLSGC------------LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGK 48 M++ L + + L A PK A T+ QS + + L GK Sbjct: 1 MKKFGLSLIALSLLPIPGWGNDIGAEVQSAAAPKTPAIKTITNFTQSLRCMDELLYAYGK 60 Query: 49 IFVSVY--NIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLN 106 +++ I DETG+ K ML+TA+ + + ++ Sbjct: 61 QGIAITSTGIPDETGKVK------------TGTKEMLITAVSK-----MTVKSNAFDFID 103 Query: 107 ERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNV----KSGGVGARYFGIG 162 + + + + GSI + N K G + +G Sbjct: 104 FHSGADDLGALFAARGDQNRLMP-----DYYIRGSITQMDDNSVRKNKGVGFSLPFLDLG 158 Query: 163 ADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKT-ILSYEVQAGVFRFIDYQRLLEGEVGY 221 D I+++L + + +T +I+ +TS T +L +G + L + Sbjct: 159 VSKDDAYDLISMDLSIGDAATRKIIPITSTSNTLVLMKGGISGEGGGKIGKVGLSFNIDV 218 Query: 222 TSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVKYRHM 271 + +E V + +E G+I + W + + N ++ + Sbjct: 219 SRSEGVGAATRTLVELGLIETLGKFTQVPYWKCLD-TDLTNPLIREQARE 267 >UniRef50_O67219 Putative uncharacterized protein n=1 Tax=Aquifex aeolicus RepID=O67219_AQUAE Length = 462 Score = 129 bits (325), Expect = 8e-29, Method: Composition-based stats. Identities = 51/266 (19%), Positives = 87/266 (32%), Gaps = 29/266 (10%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLT-HLPAPTGKIFVSVYN-IQD 58 M L ++ + L+ C + + + T A+ + LP I V + Sbjct: 1 MFPLLVISVIFLIFSCGPVAQQASTQTTQGEYAKDIRQREPELPKCDRPIGTIVARGFKC 60 Query: 59 ETGQFKP--------YPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKI 110 + Q Y + + MLVTAL + F LER+ LQ + E ++ Sbjct: 61 KAAQCAGDRIVFGPNYTVEVSPKVLGDGLSDMLVTALVKTGCFRVLERETLQEIKEELEM 120 Query: 111 IRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGAR------YFGIGAD 164 + P ++L A+ ++ GSI E G G G+G Sbjct: 121 LG------------VQPKKALKGADFLLTGSITALEMKASGMGGGGVVVPLPFLGGVGVK 168 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSN 224 IA++LR+V V E+L + ++ I + G N Sbjct: 169 AGKSSAHIALDLRLVRVRDAEVLLAETVEGKSDRWKFGV-GGGGIFGTTIAGGWFEAFKN 227 Query: 225 EPVMLCLMSAIETGVIFLINDGIDRG 250 P+ I V ++ D Sbjct: 228 TPMEEATRDLIYHAVKLIVAQVRDLP 253 >UniRef50_P73111 Sll1835 protein n=3 Tax=Chroococcales RepID=P73111_SYNY3 Length = 265 Score = 128 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 47/201 (23%), Positives = 77/201 (38%), Gaps = 16/201 (7%) Query: 43 PAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQ 102 GK +SV +++T + + N S + L L + F +ERQ L Sbjct: 34 AQAQGKPTISVPEFKNDTNMSWWWWSGNTSREL----ADALSNELTSTGNFQVVERQNLG 89 Query: 103 NLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARY---- 158 ++L+E+++ E G LT A +V G + YE V S G + Sbjct: 90 SVLSEQELA----ELGLTRPETSAQRGQLTGAQYIVLGRVTAYEEGVSSESGGNNFGLNL 145 Query: 159 --FGIGADTQY--QLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRL 214 F IG + Q +A++LRVV+ STGE++ + + I Sbjct: 146 GLFSIGNSERQAKQEAYVAIDLRVVDSSTGEVVYARTVEGRATDTAASSANNVNILGIVN 205 Query: 215 LEGEVGYTSNEPVMLCLMSAI 235 + TS PV L + + Sbjct: 206 TGQDNQSTSRAPVGKALRAGL 226 >UniRef50_A6DK47 Curli production assembly/transport component CsgG n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DK47_9BACT Length = 311 Score = 123 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 42/254 (16%), Positives = 88/254 (34%), Gaps = 38/254 (14%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+ L LL V+L C T + A ++V D++ Sbjct: 1 MKLLSLLSLVILFCSCQTNSKRLPADVID----------------KKIPTIAVLEFADKS 44 Query: 61 G-QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 +++ V + L+ L S+ + L R+ + ++ E I Q++ Sbjct: 45 HFRYRWN--------VGEGIRDSLIDELVQSKRYKVLTRKNIDAVIGELNI----QQDKL 92 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 ++ L +++GS+ + K+ G A + G + + V L ++ Sbjct: 93 FRPEGKVARGRLKNVQYLLKGSVTDFAHVAKT-GASAFFSNWGFSGSTHVAVVTVTLYII 151 Query: 180 NVSTGEILSSVNTSKTILSYEVQ-AGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETG 238 V +GEI++S + + AG ++ + + G P+ + Sbjct: 152 EVESGEIIASKQVEGKAHATSLDVAGQYKNMSFGS------GSFYRTPLGKACKELMHQA 205 Query: 239 VIFLINDGIDRGLW 252 ++ IN I W Sbjct: 206 LLE-INKTIADKKW 218 >UniRef50_Q05S73 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05S73_9SYNE Length = 256 Score = 123 bits (307), Expect = 9e-27, Method: Composition-based stats. Identities = 47/203 (23%), Positives = 80/203 (39%), Gaps = 14/203 (6%) Query: 44 APTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQN 103 AP + V+V I + +S V + T ML LK + F +ER GL+ Sbjct: 33 APRQPVTVAVKEITNNASGVWW-----WSPRVSKQLTDMLSNELKATGNFTLVERAGLKK 87 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYE--SNVKSGGVGARYFGI 161 +L+E+++ E G + +T A V G++ Y+ + KSGG G G Sbjct: 88 VLDEQELA----ELGITRQSTAPKRGMVTGAKYYVLGAVSDYQQGTETKSGGGGFNIMGF 143 Query: 162 G--ADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEV 219 G + +A+++RVV+ +TGEI S S + Y Sbjct: 144 GQRKSSSESKAYVALDVRVVDTTTGEIAYSRTIEGKATSKSESKSTSGGL-YGLSFSDSQ 202 Query: 220 GYTSNEPVMLCLMSAIETGVIFL 242 ++ P + +A+ +L Sbjct: 203 SSSNKVPASKAVRAAMIEVSEYL 225 >UniRef50_Q0IE08 Curli production assembly/transport component CsgG subfamily protein n=1 Tax=Synechococcus sp. CC9311 RepID=Q0IE08_SYNS3 Length = 301 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 76/217 (35%), Gaps = 35/217 (16%) Query: 49 IFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNER 108 VSV + ++E GQ +S V + L L + +ERQ ++ +L+E+ Sbjct: 57 PTVSVPDFKNEVGQLAW-----WSPRVSRQLADALSNELSAAGGLTVVERQNVRAVLSEQ 111 Query: 109 KIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKS--GGVGARYFGIGADTQ 166 ++ + +T + ++ G + G+E+NV++ G G R+ G G Sbjct: 112 EMAELGIVR---NNDRAAKSGQMTGSQYVILGRVSGFENNVETKQSGSGMRFLGFGGSKD 168 Query: 167 --YQLDQIAVNLRVVNVSTGEILSSVNTSKTILSY----------------------EVQ 202 ++++LRVV+ +TGE++ + Sbjct: 169 VAETKAYVSLDLRVVDTTTGEVVGYKTVEGRAKNTAKVKGSGGSLAPLAGLVGGLTGASG 228 Query: 203 AGVFRFIDYQRL-LEGEVGYTSNEPVMLCLMSAIETG 238 G + L T P + +A+ Sbjct: 229 TGAYGLAAAGTLSFNESSSETKKTPASKAVRAALIAA 265 >UniRef50_D0SYN2 Predicted protein n=1 Tax=Acinetobacter lwoffii SH145 RepID=D0SYN2_ACILW Length = 362 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 77/222 (34%), Gaps = 19/222 (8%) Query: 24 AARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAML 83 P + L + SV I D+TG+ V Q A M+ Sbjct: 6 PITDIFTPFDMALSCLKGQL--RSDVSFSVGAILDQTGK-DVVTNGGSGKMVTQGAGDMV 62 Query: 84 VTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSII 143 +AL + + R+ + + +E K + A++ V GSI Sbjct: 63 QSALFQAG-VSLMNRRDPRIIESEAKWGIRDPR-------------QIQASDYYVTGSIN 108 Query: 144 GYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQA 203 + + GG + G+G + + ++L + + + +++ +V+ K I + + Sbjct: 109 SLDF-IPGGGFDMQIAGVGPNYSQTRIMVGLDLSLTDTRSSKVVGNVSLQKQIAAQDYGL 167 Query: 204 GVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 RF + LL ++G E L + L++ Sbjct: 168 SAGRFAG-RTLLNIQIGKGEREATNFALRQMLNLATFELLSQ 208 >UniRef50_A1VWR2 Curli production assembly/transport component CsgG n=2 Tax=Proteobacteria RepID=A1VWR2_POLNA Length = 337 Score = 119 bits (299), Expect = 7e-26, Method: Composition-based stats. Identities = 39/260 (15%), Positives = 86/260 (33%), Gaps = 36/260 (13%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPA-----PTGKIFVSVYN 55 + + L++ L+ C P A + A + + PT K +++ Sbjct: 16 VTKPLLIILAAGLTACAVQAPPVAQKDAPQSLATQKAAQQAVASQAPATPTLKRKIALGR 75 Query: 56 IQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ 115 I +ET + + + T ++ AL +S ++ ER + + E ++ Sbjct: 76 ITNETSYGQSLLRDRHDDPLGKQVTDLMSKALTESGAYLVFERPDIGRIQAEGRLTDTKL 135 Query: 116 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN 175 + + ++ GS+ + G G + ++ Q+ V+ Sbjct: 136 N--------------IVGVDALIIGSLTEF------GRKAIGATGFVSSSKRQVAFAKVD 175 Query: 176 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 235 +RVV+V+TG + + + + + A F F G + A+ Sbjct: 176 IRVVDVNTGHVFFATSGAGEASTE--TASTFGF--------GSQAGYDGTLNDAAIRQAV 225 Query: 236 ETGVIFLINDGIDRGLWDLQ 255 + L + R W Sbjct: 226 AEAINRLSVEMSGRP-WQTY 244 >UniRef50_Q21E38 Curli production assembly/transport component CsgG n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21E38_SACD2 Length = 321 Score = 119 bits (299), Expect = 7e-26, Method: Composition-based stats. Identities = 46/251 (18%), Positives = 97/251 (38%), Gaps = 32/251 (12%) Query: 1 MQRLFLLVAVMLLSGCLTAPPK-EAARPTLMPRAQSYKDLTHLPAPTGKI-----FVSVY 54 M+ + + V LL C + P+ + PT+ + Q L A K ++V Sbjct: 1 MKIVTTGLMVALLCSCASQDPRLKDVEPTISEQQQREAQAKLLEAQATKTLALKRKIAVG 60 Query: 55 NIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAA 114 + +ET K S+ + + + + M V +L +S ++ ER L+ L NE ++ Sbjct: 61 RLSNETSYGKSLLGSSKNDVLGEKVSDMFVQSLANSGNYLIFERPDLELLENEARLTGET 120 Query: 115 QENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAV 174 L + +V GS+ + N G + ++ Q V Sbjct: 121 VN--------------LIGVDTLVIGSLTQFGRNTTGES------GFLSSSKKQEATATV 160 Query: 175 NLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 +LR+V TG + +SV + + + R + + + + G +++ + + +A Sbjct: 161 DLRLVETKTGRVFASVTGTGSSSTET-----ARTMGFGSVAGYD-GSINDQAIGAAVNAA 214 Query: 235 IETGVIFLIND 245 +E ++ Sbjct: 215 VEKLNKIMLEK 225 >UniRef50_B0VF47 Putative curli production assembly/transport component CsgG n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VF47_9BACT Length = 314 Score = 117 bits (294), Expect = 3e-25, Method: Composition-based stats. Identities = 47/271 (17%), Positives = 93/271 (34%), Gaps = 40/271 (14%) Query: 1 MQRL--FLLVAVMLLSGCLTAP-PKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 M ++ FL + +L+S C P + LMP + D K +++ Sbjct: 1 MTKILYFLCIGALLISACAQNQAPAKVEVVNLMPAEKQITDEQIH----LKKKIAIGRFT 56 Query: 58 DET--GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQ 115 +ET SN + + ++A +L + L + FI +ERQ L +++ AQ Sbjct: 57 NETRLANSFLNEGSNTGSRMSKAANDILASKLAITNRFILIERQDELILDINQQVADIAQ 116 Query: 116 ENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVN 175 + A+ ++ GSI + G G+ T+ Q VN Sbjct: 117 YH--------------IPADYIILGSITEF------GTSNTGNVGLIDRTKKQTAFAKVN 156 Query: 176 LRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAI 235 LR+++ TG ++ + + L G + +AI Sbjct: 157 LRILDTHTGRVIYGEEGAGEASTS----------TSTVLGMGSQAGYDTSLADKAIDAAI 206 Query: 236 ETGVIFLINDGIDRGLWDLQNKAERQNDILV 266 + + ++ W + N +++ Sbjct: 207 SSVIDNIVAKLSQDK-WRSYVLQQENNQLII 236 >UniRef50_Q4KAD5 CsgG family protein n=20 Tax=Proteobacteria RepID=Q4KAD5_PSEF5 Length = 242 Score = 117 bits (293), Expect = 4e-25, Method: Composition-based stats. Identities = 49/254 (19%), Positives = 89/254 (35%), Gaps = 39/254 (15%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M L + +SGC T + + +Q + ++ ++V + + Sbjct: 24 MLSAVALAVLAGMSGCATESSRALPVEKVQSASQVWT--------GARVPMAVGKFDNRS 75 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + + + A +L+T L+ + F L+R + + E I AQ Sbjct: 76 SYMRGIFSDGVDR-LGGQAKTILITHLQQTNRFSVLDRDNMGEIQQEAAIKGQAQ----- 129 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 L A+ +V G + + + + FGI + Q+ V L +VN Sbjct: 130 ---------KLKGADFVVTGDVTEFG---RKETGDHQLFGILGRGKTQVAYAKVALNIVN 177 Query: 181 VSTGEILSSVNTSKT-ILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 +ST E++ S + LS G Y L G+V + L + A+ V Sbjct: 178 ISTSEVVYSTQGAGEYALSNREVIGFGGTAAYDSTLNGKV-------LDLAMREAVNRMV 230 Query: 240 IFLINDGIDRGLWD 253 + ID G W Sbjct: 231 -----EAIDAGAWK 239 >UniRef50_Q2JVB7 CsgG family protein n=4 Tax=Cyanobacteria RepID=Q2JVB7_SYNJA Length = 364 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 45/226 (19%), Positives = 80/226 (35%), Gaps = 27/226 (11%) Query: 34 QSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWF 93 + + P PT + V+V + F S + + +LV L F Sbjct: 60 SAPSQVGQAPQPTTRPRVAVLDFD-----FSSLSNSYSLREASRGVSDLLVDRLVRDGTF 114 Query: 94 IPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGG 153 +E L +L E+ + +G + N + + + ++ GS+ ++ +V+ G Sbjct: 115 SVIEPSRLDAILAEQNL----GLSGRLDANTAAQVGRILGVDAVILGSVTQFDVSVRRSG 170 Query: 154 VGARYF--------GIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGV 205 AR +GA+ + +N R+V+ ST EIL+ V + + V Sbjct: 171 GEARVLTPFGSFPLAVGAEVVDADANVQLNARLVSTSTAEILAVVEGRGNVSQSDSTVTV 230 Query: 206 FRFIDYQRLLEGEVGYTSNEPVM--LCLMSAIETGVIFLINDGIDR 249 F G TSNE + L A+E L Sbjct: 231 ADF--------GGGSATSNEEKLLVLASQQAVEQIAQQLAGFASRL 268 >UniRef50_B8CKS1 Curli production assembly/transport component CsgG, putative n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CKS1_SHEPW Length = 310 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 45/255 (17%), Positives = 86/255 (33%), Gaps = 34/255 (13%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M++ + L CL +P + + L K +++ +ET Sbjct: 1 MKKWNIFTLTCLAIFCLQSPVSASGLDATKASTTVASN-DSLTFLKRK--IAIARFSNET 57 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + + + + + A +L L D+ FI ER ++L +E + + Sbjct: 58 QAANSFLVDSSNNRIGKQAADILSARLADTNKFIMFERLDTEDLNSENILKGISDSG--- 114 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 A + ++ GS+ + G GI + ++ Q VN+R+V+ Sbjct: 115 -----------VAVDYLIVGSVSEF------GRSAESTTGIFSQSKIQKAYTKVNVRLVD 157 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 VSTG I+SSV + + + L G L AI + Sbjct: 158 VSTGRIISSVEGAGEASTE----------TKKTLGAGTSAAFDQSLTDKALSQAISQMIS 207 Query: 241 FLINDGIDRGLWDLQ 255 L+ + + W Sbjct: 208 NLVENMTAKP-WKSY 221 >UniRef50_Q31MJ4 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31MJ4_SYNE7 Length = 332 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 91/240 (37%), Gaps = 15/240 (6%) Query: 5 FLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFK 64 FL++ + LL+G A K L K ++V + D + Sbjct: 7 FLVLNLSLLTG--LIQTPAIATSAEQSLVIKAKQPLLLAQNQAKRRIAVLDF-DFSNVSS 63 Query: 65 PYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINN 124 P S F V + + +LV L + +ER + +LNE+ + +G + + Sbjct: 64 PSVLSAFPN-VSKGVSDILVNRLVKDGTYTLIERSRIDAVLNEQNL----GASGRIDPST 118 Query: 125 RIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTG 184 + + + ++ GS+ + + G G+ FGIG ++ +N+R+V+ ST Sbjct: 119 AAQIGKILGVDAVIIGSVTRLDLQTRQSG-GSFLFGIGGNSTDVDAYAQINIRMVSTSTA 177 Query: 185 EILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIN 244 EIL+ + I + + V + E V + AI+ +IN Sbjct: 178 EILAVAEGTGNISQSDSRVTVLGIGGGSQTFNPEK------LVFIATEQAIDQVAKEVIN 231 >UniRef50_A9L616 Curli production assembly/transport component CsgG n=2 Tax=Alteromonadales RepID=A9L616_SHEB9 Length = 323 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 45/245 (18%), Positives = 84/245 (34%), Gaps = 29/245 (11%) Query: 3 RLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLT-HLPAPTGKIFVSVYNIQDETG 61 L ++ L + C T K+ A S L K V++ +ET Sbjct: 6 PLIFVMCASLTTACATVNKKQVVTTAQPAAAISATQTEVALNTKLLKRKVAIGRFTNETT 65 Query: 62 QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 + + + + + A +L + L + FI LER L + E + + Sbjct: 66 YGQGFFIDEDNNRIGKQAMDILSSKLFQTGKFIMLERADLGKIEKELAMGGNSTLK---- 121 Query: 122 INNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNV 181 AA+ ++ GSI + G G+ + + Q VN+R+V+V Sbjct: 122 ----------NAADYLIVGSITEF------GRKEVSDVGVFSRVKKQEANAKVNIRIVDV 165 Query: 182 STGEILSSVNTSKTILSYEVQA-GVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 +TG I+ S S GV Y L +V + + + + + Sbjct: 166 ATGLIIYSEEGKGIAYSEAGSVMGVGDKAGYDSSLNDKV-------LDVAITNLASNIIE 218 Query: 241 FLIND 245 +++ Sbjct: 219 NMLDK 223 >UniRef50_A3JNY2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JNY2_9RHOB Length = 389 Score = 113 bits (283), Expect = 6e-24, Method: Composition-based stats. Identities = 46/270 (17%), Positives = 98/270 (36%), Gaps = 37/270 (13%) Query: 10 VMLLSGCLT---------APPKEAARPTLMPRAQSYKDLTHLPAPTGK--IFVSVYNIQD 58 ++ LSGC T A P+ + + + ++ A G+ + +S I D Sbjct: 1 MIALSGCATINPSLAPKVAHPRTPPARNFTSFNDTLRCMDNMLARAGRKTVLISSSGIPD 60 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFI--PLERQGLQNLLNERKIIRAAQE 116 T + + AV Q + S FI PLER+ Q I+ + Sbjct: 61 LTSKIRVGADDMLVNAVNQ------MNVNSKSYVFIDQPLERRDAQ-------IVWLTKR 107 Query: 117 NGTVAINNRIPLQSLTAANIMVEGSII--GYESNVKSGGVGARYFGIGADT-QYQLDQIA 173 G + + + ++ ++ + + + R G++ +L + Sbjct: 108 EGDLTPQF-----YIRGSISQLDEGVVKDSFSFGINNDLAPNRDVESGSNRFSRRLSVVT 162 Query: 174 VNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMS 233 V+L +V +I+ + + +++ V AG+ ID + +G E + + + Sbjct: 163 VDLHLVTYPDRKIIPGGSVANSMV--IVGAGLTGIIDLSEI-GVTIGMERIESIGQAVRN 219 Query: 234 AIETGVIFLINDGIDRGLWDLQNKAERQND 263 +E GVI L+ W+ + + + Sbjct: 220 LVELGVIELLGKHSRLPYWNCLSLPTVKTE 249 >UniRef50_B8JAW6 Curli production assembly/transport component CsgG n=5 Tax=Proteobacteria RepID=B8JAW6_ANAD2 Length = 321 Score = 112 bits (281), Expect = 9e-24, Method: Composition-based stats. Identities = 48/256 (18%), Positives = 82/256 (32%), Gaps = 37/256 (14%) Query: 3 RLFLLVAVMLLSGCLTA-----PPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 R L VA + L C T + AQ A K +++ Sbjct: 4 RFALPVAALALQACATVSQPPVEVESPVPKAAQVAAQQQAQAPAPSAKRYKTRIAIARFT 63 Query: 58 DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQEN 117 +ET + + + A+ ML + L S F+ LER LQ L E+ + A Sbjct: 64 NETSYGRSLLNDADLDRIGKQASDMLASRLVMSGNFVVLERPDLQKLEREQALRGVAG-- 121 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR 177 L A+ ++ GS+ + G G + T+ Q + V++R Sbjct: 122 -------------LVGADTVISGSVTEF------GRSVGGKKGFLSSTKVQTARAKVDIR 162 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 +V+V TG S + + + F + +AI Sbjct: 163 LVDVKTGHAYFSALGAGEASTESGEIAGFG----------SRAEYDATLNDRAIAAAISD 212 Query: 238 GVIFLINDGIDRGLWD 253 + L++ R W Sbjct: 213 VIDRLVSTLAARP-WR 227 >UniRef50_D0XIZ9 HfaB protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XIZ9_9CAUL Length = 245 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 92/240 (38%), Gaps = 24/240 (10%) Query: 20 PPKEAARPTLMPRAQSYKDLTH-LPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQS 78 P A + P + L + ++V NI+D +G+ + +PQ Sbjct: 2 QPGAVAPQAIAPDT--VRCLQQTGTGTGSRPRIAVGNIRDLSGRVSLESGAL----LPQG 55 Query: 79 ATAMLVTALKDSRWFIPLERQGLQ------NLLNERKIIRAAQENGTVAIN-NRIPLQSL 131 A+ ++AL + + +ER + N ++ + + G V N RI + Sbjct: 56 ASMFAISALLEMG-YPVVERFDMAIAEIEINYARQQLLSDTPELAGQVQDNYRRIYPGQI 114 Query: 132 TAANIMVEGSIIGYESNVKS-------GGVGARYFGIGADTQYQLDQIAVNLRVVNVSTG 184 + + G++ + V S V + + A Y+ +A++LR+V+ + Sbjct: 115 AGSRFYLTGALTELNTGVSSLSGAASATAVSSAIASLSASGGYERASVALDLRLVDTLSQ 174 Query: 185 EILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIN 244 E++ S +++ S + G ++ G+V Y + V + +E V L+ Sbjct: 175 EVVGSATVRRSLSSGNLSVGAIGATA--PVVSGQVAYARSSEVQYAVRGMVEEAVRVLVG 232 >UniRef50_Q7DDH4 Putative lipoprotein NMB1126/NMB1164 n=130 Tax=Proteobacteria RepID=Y1126_NEIMB Length = 223 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 78/240 (32%), Gaps = 37/240 (15%) Query: 14 SGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFST 73 +GC T + + Y + + +SV + + K + Sbjct: 18 TGCATESSRSLEVEKVASYNTQYHGV--------RTPISVGTFDNRSSFQKGIFSDGEDR 69 Query: 74 AVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTA 133 + A +LVT L+ + F L R L L E I A L Sbjct: 70 -LGSQAKTILVTHLQQTNRFNVLNRTNLNALKQESGISGKAHN--------------LKG 114 Query: 134 ANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTS 193 A+ +V G + + + + FGI + Q+ V L +VNV+T EI+ S Sbjct: 115 ADYVVTGDVTEFG---RRDVGDHQLFGILGRGKSQIAYAKVALNIVNVNTSEIVYSAQ-- 169 Query: 194 KTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWD 253 AG + + + + G L AI V L+ +D G W Sbjct: 170 --------GAGEYALSNREIIGFGGTSGYDATLNGKVLDLAIREAVNSLV-QAVDNGAWQ 220 >UniRef50_C3WB28 Curli production assembly/transport component CsgG n=1 Tax=Fusobacterium mortiferum ATCC 9817 RepID=C3WB28_FUSMR Length = 305 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 39/269 (14%), Positives = 93/269 (34%), Gaps = 37/269 (13%) Query: 1 MQR-LFLLVAVMLLSGCLTAPPKEAARPTLMPRA-QSYKDLTHLPAPTGKIFVSVYNIQD 58 M++ + + +A + L C + + + Y AP ++ + +++ Sbjct: 4 MKKYMGIFLAALFLVSCSNKEIRSTVKKEDNISTLRDYNTYKENLAPKRRVVI--GKVKN 61 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 T + + ++ + +L++ +S F LER+ L +++ E + E Sbjct: 62 YT-----RFGTQRTDSITK---DILISEFANSGRFNVLEREDLDSVMEELAFSNSLGEKS 113 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRV 178 +A + +V GS+ YE N + + ++ Q + + L+V Sbjct: 114 ILAKQ------KFLDTDFIVVGSVTKYELNTTGS------KSLFSKSKEQRAEAVIELKV 161 Query: 179 VNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETG 238 ++V G++ + + + G + Y L E ++ +E Sbjct: 162 IDVLNGKVWTETGEGSASVKFGTVLGAGTYGSYGSL--------EQEAFRAAVIQGVEK- 212 Query: 239 VIFLINDGIDRGLWDLQNKAERQNDILVK 267 I ID W + I++ Sbjct: 213 ----IVKKIDSMPWTAAVVKKDGKRIIIN 237 >UniRef50_B7QTV8 Putative peptidoglycan binding domain protein n=1 Tax=Ruegeria sp. R11 RepID=B7QTV8_9RHOB Length = 424 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 51/277 (18%), Positives = 88/277 (31%), Gaps = 44/277 (15%) Query: 4 LFLLVAVMLLSGC---------LTAPPKEAARPTLMPRAQSYKDLTHLPA--PTGKIFVS 52 + LV + L+GC A P A L ++ + + L A P VS Sbjct: 8 VGSLVLALTLAGCGARYPALTPAIAQPNARAARNLTSFSEPLRCMDGLFAQLPRQSYLVS 67 Query: 53 VYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIR 112 +I DET V A ML+ A+ R L++ +I Sbjct: 68 SSDIPDET------------RRVSVGADDMLINAMNQMNR-----RSQRYVFLDQARISG 110 Query: 113 AAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADT------- 165 Q T L + GSI +S+ VGA + G+ Sbjct: 111 FGQLELTTTRKKGEVKPQL-----YIRGSISQLDSDRVDAEVGAVHSTEGSSGLTKSLYK 165 Query: 166 -QYQLDQIAVNLRVVNVSTGEILSSVNTSKT--ILSYEVQAGVFRFIDYQRLLEGEVGYT 222 +L ++V+L +V + ++ + + + ++ + V ID + Sbjct: 166 GFRKLSVVSVDLHLVEYPSRRVVPGASVANSMVVVRRGLNGTVTGIIDSVTG-GVPISVE 224 Query: 223 SNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAE 259 E + + IE G+I L+ W Sbjct: 225 RIESQGQAVRNLIELGLIELLGKHAGVPYWQCLEAPS 261 >UniRef50_C2MBZ3 Curli production assembly/transport component CsgG n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2MBZ3_9PORP Length = 300 Score = 110 bits (274), Expect = 6e-23, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 76/226 (33%), Gaps = 32/226 (14%) Query: 41 HLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQG 100 + K V++ +ET K + + + A +L L S FI LER Sbjct: 27 QETGKSLKWKVAIGRFSNETQYGKGIFYDRENDPIAKQAQDILAAKLVASGKFILLERSD 86 Query: 101 LQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFG 160 + + E + G++ A+ ++ GS+ + Sbjct: 87 AEAVAQE---VTDGTSEGSIK------------ADYVILGSVTEFGRKTTGQ------TS 125 Query: 161 IGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVG 220 + + Q + AVNLR+V+V+TG S+ Y + L G Sbjct: 126 LFTAEKTQQVEAAVNLRLVDVATG----IATYSEEAKGYANN------VSKSTLGLGGTS 175 Query: 221 YTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILV 266 + +AI+ V +IN D+ W + Q+ ++ Sbjct: 176 GYDASLGDKAISAAIDQLVENIINKCSDKP-WRTYLISMDQDGTII 220 >UniRef50_D1B8K8 Curli production assembly/transport component CsgG n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B8K8_THEAS Length = 305 Score = 110 bits (274), Expect = 7e-23, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 75/192 (39%), Gaps = 19/192 (9%) Query: 47 GKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLN 106 K ++V + QD S+ + A + M+ T L ++ F +ER + + Sbjct: 24 AKARIAVLSFQD----------SSGAGAPAAAIADMMTTELFNTGLFSVVERSRIDQIAM 73 Query: 107 ERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGI-GADT 165 E+++ G + ++ + + L A ++ GSI Y G + + G+ G Sbjct: 74 EQRMS----AQGLTSPSSAVQMGQLLGAEYLMTGSITQYRYEASGGVIPLPFGGLSGVAV 129 Query: 166 QYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNE 225 + + +++R++N +TGE++++ + + + G+ G + Sbjct: 130 GSETAYVTLDVRLINAATGEVITTARAEGAANQTQ-GGLAYDSAVFGT---GKAGGLLGQ 185 Query: 226 PVMLCLMSAIET 237 + +E Sbjct: 186 ATYKAVTKIVEQ 197 >UniRef50_D1Y6S9 Tetratricopeptide repeat domain protein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y6S9_9BACT Length = 471 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 75/201 (37%), Gaps = 22/201 (10%) Query: 42 LPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGL 101 L I + V + A S V + T M +T L +S F ER L Sbjct: 19 LSPAAAVIRIGVDRFR--------SGAPGVSPDVADALTEMFITELSNSGSFQVYERTAL 70 Query: 102 QNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGI 161 + + E+++ +G V+ + + + L ++ G++ + G + FG+ Sbjct: 71 EKVAREQRLS----MSGLVSESTLVKVGRLAGVEWIITGAVTQSDEKQTGGVLPIHGFGL 126 Query: 162 GADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGY 221 T + + +++R ++ +TG I +++ + S + V+ G VG Sbjct: 127 AVGT--NVGTVTLDVRTIDTTTGAITAALRKTG-AASRAIAGAVYEGTVIGTTQYGGVGS 183 Query: 222 TSNEPVMLCLMSAIETGVIFL 242 M A++ V L Sbjct: 184 Q-------AAMKAVKRTVREL 197 >UniRef50_A2CA24 Uncharacterized protein involved in formation of curli polymers n=3 Tax=Cyanobacteria RepID=A2CA24_PROM3 Length = 275 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 37/171 (21%), Positives = 68/171 (39%), Gaps = 12/171 (7%) Query: 44 APTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQN 103 A VSV + +++ G+ +S V + L L + +ERQ L+ Sbjct: 26 AAWAGPTVSVPDFKNQVGRLSW-----WSPRVSRQLADALSNELALAGGLTVVERQNLKA 80 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKS--GGVGARYFGI 161 +L+E+++ + A ++ G + GYE V++ G G R+ G Sbjct: 81 VLSEQELAELGIVRNDGDAARSR---QMRGARYLIMGRVSGYEDGVETKQSGSGMRFMGF 137 Query: 162 GAD--TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFID 210 G ++++LRVV+ STGE++ + S Q G + Sbjct: 138 GGSKTVSESKAYVSIDLRVVDSSTGEVVGARTVEGRATSTAKQKGSGGSLA 188 >UniRef50_D0XJ00 Putative uncharacterized protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XJ00_9CAUL Length = 338 Score = 109 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 83/243 (34%), Gaps = 19/243 (7%) Query: 5 FLLVAVMLLSGCLTAP---PKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG 61 + + +LL+GC T+P E + +S A + ++V ++ D TG Sbjct: 3 AIGLVALLLAGCATSPSLTTFEREFARTGRQTESLAQCLAASAEATRPILAVGSVADLTG 62 Query: 62 QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 + V Q A L L + ERQ L +E ++ V+ Sbjct: 63 RQTFATG----RVVTQGAGLFLSADLA-TFGIRLAERQDTSVLDSETRL--------LVS 109 Query: 122 INNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLD--QIAVNLRVV 179 + + V G+I+ + +G G G G ++ V++R++ Sbjct: 110 DTTPGETGRIAGSRYYVSGAIVTADPADAAGLQGQAIGGAGVTLSSTEMRRRVVVSIRII 169 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 + + I + + +S + + +L + + N+ + L AI Sbjct: 170 DSRSLLIAAVGTYERIAVSTDQSLSISDPSSLD-VLRFDARRSENDGLDLATRLAIREAA 228 Query: 240 IFL 242 + Sbjct: 229 RDI 231 >UniRef50_B8HPG7 CsgG family protein, putative n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPG7_CYAP4 Length = 330 Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 79/221 (35%), Gaps = 16/221 (7%) Query: 27 PTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ-DETGQFKPYPASNFSTAVPQSATAMLVT 85 T P + L P K+ ++V N G + S ++ + +L Sbjct: 26 QTTAPSSPGLVPLQARP----KVRIAVLNFDFSNIGLTGAVYSFTDSAGPSKAVSTLLTN 81 Query: 86 ALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGY 145 L ++ +ER + +L E+ + + G + + + + +V GS+ + Sbjct: 82 LLVKDGTYVVVERSRIDAVLAEQNL----GQAGRIEPTTAAQVGRILGVDAVVIGSVTEF 137 Query: 146 ESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGV 205 K GGV G G+ + Q+ ++ + R+V+ +TGEILS + V Sbjct: 138 GLEQKKGGV--NILGFGSQKETQIARVQLAARIVSTTTGEILSVAEAKGEATQVDESISV 195 Query: 206 FRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDG 246 + + S+ + A+E + + Sbjct: 196 GGYGST-----AQGSNASDRLLSTAAQQALEEVKTKFVAEA 231 >UniRef50_Q1IPN7 Curli production assembly/transport component CsgG n=2 Tax=Acidobacteria RepID=Q1IPN7_ACIBL Length = 327 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 77/229 (33%), Gaps = 15/229 (6%) Query: 36 YKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIP 95 ++ P T K V++ + T V + + +L+ L + + Sbjct: 14 LLSVSAFPQATRKKRVAIMSFDYGTVHSSVAAIFGSDQDVGKGISDLLIQKLVNDGDYSV 73 Query: 96 LERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESN------- 148 +ER L ++ E+ + + N+ + L + ++ GSI + + Sbjct: 74 IERAQLDKIMAEQNFSNSDRA----DPNSAAKIGRLLGVDAIITGSITQFGRDDQHTNVG 129 Query: 149 -VKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQ---AG 204 GG+ RY G T + + R+V+V+T EIL++ + T V AG Sbjct: 130 GGGYGGITGRYGIGGVGTHSAKAVVGITARLVDVNTAEILAACTGTGTSKRSGVSLLGAG 189 Query: 205 VFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWD 253 + L+ + + A+++ L Sbjct: 190 GSGWNGGGGSLDMGSSNFGETILGEAVHQAVDSLGAQLDAKAGALPTNK 238 >UniRef50_C6JM48 Putative uncharacterized protein n=2 Tax=Fusobacterium RepID=C6JM48_FUSVA Length = 305 Score = 106 bits (264), Expect = 8e-22, Method: Composition-based stats. Identities = 42/266 (15%), Positives = 90/266 (33%), Gaps = 38/266 (14%) Query: 4 LFLLVAVMLLSGCL-TAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQ 62 + ++ +LL C T + + + Y +L P ++ + Sbjct: 7 IGMIFISLLLLSCGKTGVESNIKKDDKIVSLREYNNLKETALPKRRVVIG---------- 56 Query: 63 FKPYPASNFSTAVPQSAT-AMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 K S F T T +L + +S F LER L +++ E E ++ Sbjct: 57 -KVKNYSRFGTQRTDITTKDILASEFSNSGRFNVLERSDLDSVIEELAFSNTLGEKSLLS 115 Query: 122 INNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNV 181 + ++ GS+ Y N + + ++ Q ++ + L+V++V Sbjct: 116 KQ------KFLDTDFVIIGSVTKYALNTTGN------KSLFSKSKEQKAEVVIELKVIDV 163 Query: 182 STGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIF 241 + G++ + +++ G + Y L E E ++ +E V Sbjct: 164 TNGKVWIETGEGSSSVTFGTVLGAGTYGSYTSLEE--------EAFRAAVIQGVEKIVKK 215 Query: 242 LINDGIDRGLWDLQNKAERQNDILVK 267 L D W + N+I++ Sbjct: 216 L-----DSMPWSASIVKKSGNNIIIN 236 >UniRef50_B0CFQ9 CsgG family protein, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CFQ9_ACAM1 Length = 336 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 85/246 (34%), Gaps = 13/246 (5%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ-DE 59 M+ L +A + ++ + P + + + ++V + Sbjct: 1 MKHTVLPIAKLSIALTVLTAFCVPGSVYANPFSAPGIGVEQI-KVKESRRIAVLDFDYAN 59 Query: 60 TGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 + + + +L L ++ +ER + +L E+ + +G Sbjct: 60 VSKTGISYGLYGKNGASRGISNLLTNELVKDGTYVLVERSKIDTILAEQNL----GASGR 115 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKS-GGVGARYFGIGADTQYQLDQIAVNLRV 178 + + + + ++ GSI + +S GG +FG+G + QL + ++ R+ Sbjct: 116 IEPTTAAQIGRVLGVDAVLIGSITQFHIEEQSKGGSIGGFFGLGGKQKTQLATVQLSTRL 175 Query: 179 VNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETG 238 V+ +TGEIL++ + + +F +S + A++ Sbjct: 176 VSTATGEILTAAEGTGQADKSGGRGRIFGIGVDSN------SDSSERLLGEASGQAVDKI 229 Query: 239 VIFLIN 244 V L Sbjct: 230 VSQLAA 235 >UniRef50_D1Y6S8 Putative curli production assembly/transport component n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y6S8_9BACT Length = 309 Score = 103 bits (257), Expect = 5e-21, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 69/189 (36%), Gaps = 22/189 (11%) Query: 50 FVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERK 109 +SV ++ +G+ P + M++T L +S F +ER L + E++ Sbjct: 26 TISVETFRNSSGRHVPVDS----------IMDMMITELVNSGTFQVVERDRLDVIAREQR 75 Query: 110 IIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQ--Y 167 + ++G + N L A M+ G++ Y ++ +GG Sbjct: 76 M----GQSGLIDSNTASRTGRLAGAQYMMTGAVTKYSASDTAGGGIIGGGSSLIGGLINT 131 Query: 168 QLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAG-VFRFIDYQRLLEGEVGYTSNEP 226 + +++R+V+ +TG I+ + V G + R+ + G G Sbjct: 132 NTAYVTLDVRIVDTTTGAIVYAGRAEG--AGTNVMGGLLSRYAGFGT---GRSGGQLATA 186 Query: 227 VMLCLMSAI 235 + + Sbjct: 187 THKAITKVV 195 >UniRef50_A6DA55 Putative uncharacterized protein n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DA55_9PROT Length = 174 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 78/202 (38%), Gaps = 39/202 (19%) Query: 8 VAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYP 67 + + SGC T+ + Q+ D+ K ++V + + + + Sbjct: 1 MLLNFFSGCGTS------ISNVSTSKQNINDVASYQG--KKARIAVASFKCKAAK----- 47 Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLER-----QGLQNLLNERKIIRAAQENGTVAI 122 + ++ + +L TAL + FI LER + +QN LN + I+ N Sbjct: 48 ---CNGSIGSGISDILTTALMKTNKFIVLERDSEAMRAIQNELNNQIIMTNRHAN----- 99 Query: 123 NNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGI------GADTQYQLDQIAVNL 176 + +I+V G+I +E G+G + G IA++L Sbjct: 100 -------RMEGTDILVVGAITAFEPKAGGFGIGGVTIPLNVPVIGGIKFAKNDAYIALDL 152 Query: 177 RVVNVSTGEILSSVNTSKTILS 198 R+V++STG +L++ S Sbjct: 153 RLVDISTGRVLAATTIEGKASS 174 >UniRef50_Q9RTM7 Putative uncharacterized protein n=1 Tax=Deinococcus radiodurans RepID=Q9RTM7_DEIRA Length = 212 Score = 101 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 75/245 (30%), Gaps = 41/245 (16%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+++ V A P G++ ++V +++ Sbjct: 1 MKKVLTAVLATAFL------VAPVAVAQTAAAPTPAPVAAAPALPQGQVNIAVGSLK--- 51 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 A + + L TAL ++ F ER+ L E + A Sbjct: 52 -----CKAEKCYGGLGEGIGDALTTALLNTGKFAVYERENTAQLTEEAFLNGGAT----- 101 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 A++++ G+I YE SGG+ +G + IA++LR+V+ Sbjct: 102 ----------FQGADVLIFGAITQYEPQASSGGLSFMGVSVGKKS----STIAMDLRIVD 147 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 T I+ + + LL VG S+ + + + Sbjct: 148 AKTRRIIGATQVQGKAEGNN--------FNVSGLLPVNVGAQSSPQLEAAISQMLNNAAQ 199 Query: 241 FLIND 245 L+ Sbjct: 200 QLLLK 204 >UniRef50_A5PA28 Curli production assembly/transport component CsgG n=1 Tax=Erythrobacter sp. SD-21 RepID=A5PA28_9SPHN Length = 310 Score = 99.1 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 69/230 (30%), Gaps = 31/230 (13%) Query: 26 RPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVT 85 + S + K V++ +ET + + + + A ++ Sbjct: 21 TAVAQEVSASQQVAASGDEIVLKRRVAIGRFTNETRYGQTLLRDSDLDPLGKQAADIMAA 80 Query: 86 ALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGY 145 L DS FI +ER +L E+ + L A+ ++ GSI+ + Sbjct: 81 YLIDSNAFIVVERTDANEVLKEQGVGGET--------------SGLIGADTIIVGSIVEF 126 Query: 146 ESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGV 205 G + + Q V +R+V+V TG S S + Sbjct: 127 ------GRADEGERAVFKRERTQKAYAKVAIRLVDVRTGVAFHSATGSGEATTT------ 174 Query: 206 FRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQ 255 +L G L AIE + LIN R W Sbjct: 175 ----TKTKLFSGTTARYDGTLTDKALSVAIEDVLEDLINSLSARE-WRTD 219 >UniRef50_C9KNZ5 Putative uncharacterized protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KNZ5_9FIRM Length = 234 Score = 97.6 bits (241), Expect = 4e-19, Method: Composition-based stats. Identities = 45/213 (21%), Positives = 79/213 (37%), Gaps = 27/213 (12%) Query: 49 IFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNER 108 V + +++ + + + +A + T L+DS F L+R + L +E Sbjct: 48 PTVGILPLEN-----RGLVSEGWDREEMGAAVEYVYTDLQDSGRFKLLDRTRQRALTDE- 101 Query: 109 KIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQ 168 A +G V + + A ++ GSIIG + V +GA T+ Sbjct: 102 ---YAHDMSGLVDEDTAPVIGDQYGAQYLLMGSIIGVTTRRSETTV------VGAGTKRA 152 Query: 169 LDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVM 228 V+LR+V+ TGE++ + + V+A + L+ E V Sbjct: 153 QVTATVSLRLVDTETGEVVLAATGRSRKNNTLVKAPL-------GLIRIGTEQVDKEQVN 205 Query: 229 LCLMSAIETGVIFLINDGIDRGLWDLQNKAERQ 261 L AI V DG L + KA+ + Sbjct: 206 EALEDAIHDAV-----DGPRGLLARMDGKAKSK 233 >UniRef50_B9CYA0 Peptidoglycan-binding domain 1 protein n=2 Tax=Campylobacter RepID=B9CYA0_WOLRE Length = 272 Score = 95.7 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 42/249 (16%), Positives = 86/249 (34%), Gaps = 19/249 (7%) Query: 3 RLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGK----IFVSVYNIQD 58 ++ L LL+GC+++ + + P +V +D Sbjct: 10 KIAALALPFLLTGCMSSMSMGSPGAKTTATGAAAGSNAQNTNPGLTRCTETMGTVTIYED 69 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 Y + + + + + A + S F+ +ER N + E + A E+G Sbjct: 70 R--NSNWYSVATRQYKLTSTIPVLRLLA-QQSNCFVVVERSKAFNQMLEER---ALMESG 123 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVK--SGGVGARYFGIGAD--TQYQLDQIAV 174 + N+ + AA+ + +I ESN SG VGA + + + ++ Sbjct: 124 ELRENSNFKKGQMVAADYTLTPTITFSESNTSGLSGVVGALFGSVAGSVAGGFSTSDVST 183 Query: 175 NLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 L ++ +G L++ S + +F L +N P +++A Sbjct: 184 VLTLIENRSGVQLAAAEGSARNTDFAGLGSLFGGKAGGSL-----RAYANTPEGKIIIAA 238 Query: 235 IETGVIFLI 243 + LI Sbjct: 239 FTDSMNNLI 247 >UniRef50_B0JU00 Curli production assembly/transport component n=2 Tax=Microcystis aeruginosa RepID=B0JU00_MICAN Length = 333 Score = 94.5 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 50/262 (19%), Positives = 92/262 (35%), Gaps = 31/262 (11%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQF 63 + L+ L AP A +P L + K+ V+V + D +G Sbjct: 14 ITLVGVASSLGFSFNAPAFSQAISNNLPLL--------LAQKSEKVRVAVLDF-DYSGLS 64 Query: 64 KPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAIN 123 P F + +LV L +S + +ER + +L E+ + +G V Sbjct: 65 NPQWL-TFLNGGASGVSDILVNRLVESGRYTVIERSRIDAVLREQNL----GASGRVDAA 119 Query: 124 NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVST 183 + + ++++ GSI ++ K G F + +N+R +N +T Sbjct: 120 TAAQIGQILGVDVVIIGSITQFDLQKKQSGGSFIIFS--TAKTETDAFVKLNVRAINTTT 177 Query: 184 GEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLI 243 EI+++ T + V L G TSNE +L + A + V ++ Sbjct: 178 AEIITTAQGDGTANQSDGSTVV--------LGVGGGSQTSNEGKLLSI--ATDKAVARVV 227 Query: 244 NDGIDRGLWDLQNKAERQNDIL 265 L D ++ L Sbjct: 228 -----DNLNDKADQIAATPRSL 244 >UniRef50_C1CWI5 Putative Curli production assembly/transport component CsgG, n=1 Tax=Deinococcus deserti VCD115 RepID=C1CWI5_DEIDV Length = 211 Score = 94.5 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 32/245 (13%), Positives = 76/245 (31%), Gaps = 41/245 (16%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+ ++ + L A ++ P ++ ++V + Sbjct: 1 MKMFRAILVMSALCA--------GQVLAQTAPAAPPAPVSAPPVSEPQVHIAVGVFK--- 49 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 ++ + L+ AL ++ F ER+ + L+ I Sbjct: 50 -----CNVRLCTSELGTGLADALMNALSETGKFAVYERENVPQLVQNNMIAGTDPTAA-- 102 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 L+ +++V G+I YE SG +G + +I +LR+V+ Sbjct: 103 ----------LSPVDVLVFGNINVYEPESSSGQGCFMGVCLGG----KESRIGADLRIVD 148 Query: 181 VSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI 240 TG +L++ + ++ +G + + + + + V Sbjct: 149 SKTGRVLATTKVEGKSSTTG--GSIY-------FNGLSLGGKQSSGLDKAVGAMLTQAVQ 199 Query: 241 FLIND 245 L + Sbjct: 200 VLQSK 204 >UniRef50_C1XFI9 Uncharacterized protein involved in formation of curli polymers n=2 Tax=Thermaceae RepID=C1XFI9_MEIRU Length = 167 Score = 93.3 bits (230), Expect = 9e-18, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 57/163 (34%), Gaps = 15/163 (9%) Query: 82 MLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGS 141 ML TAL S F+ +R + L +E + S T A++++ G Sbjct: 1 MLNTALVSSNHFVVYDRSIITQLRSEAALSNQQN--------------SFTGADLIITGV 46 Query: 142 IIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEV 201 I G+E N IG Q + I V++R+V+ TG I+++ Sbjct: 47 ITGFEPNASGSSGLGAIPFIGGLVQQKKSYIRVDMRIVDTRTGAIIAAFPVEAEATDTNF 106 Query: 202 QAGVFRFIDYQRLLEGEVGYT-SNEPVMLCLMSAIETGVIFLI 243 + T SN P+ L IE +I Sbjct: 107 AGVGAGLLPGGLGGLVGGLRTYSNTPMAKALALMIEAATQAII 149 >UniRef50_A8ZXP3 Uncharacterized protein involved in formation of curli polymers-like protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZXP3_DESOH Length = 315 Score = 92.2 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 86/233 (36%), Gaps = 31/233 (13%) Query: 14 SGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTG-KIFVSVYNIQDETGQFKPYPASNFS 72 SGC P A R +L TG K+ V V N Q++T + Sbjct: 20 SGCA--PKISGAVKDDSSRTGIKDELAPTGGYTGPKLRVGVVNFQNKT--------PSRV 69 Query: 73 TAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLT 132 + ++A +L T L+ + FI + +Q + ++LN++ + +G + + + Sbjct: 70 LGIGEAAADILGTILQKTDRFIIIPQQDMSSILNQQSM----GASGVIDPTTAAKMGKVL 125 Query: 133 AANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNT 192 N +V G+I Y + + + + Q+ ++ V+ R+V+ +TG + + + Sbjct: 126 GLNAIVTGAITAYSEAEEGQDL------LIYQKKKQIARVTVDYRIVDTTTGIQIMADSG 179 Query: 193 SKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 + L G + L A+ ++ ++ Sbjct: 180 QGE----------YAKSTGGALGLGSRSTYDADLRDGALRDALTKAMVNMLKQ 222 >UniRef50_C6W063 Uncharacterized protein involved in formation of curli polymers-like protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W063_DYAFD Length = 574 Score = 90.6 bits (223), Expect = 4e-17, Method: Composition-based stats. Identities = 41/246 (16%), Positives = 79/246 (32%), Gaps = 28/246 (11%) Query: 2 QRLFLLVAVMLLSGCLTAPPKEAARPTLMPRA-QSYKDLTHLPAPTGKIFVSVYNIQDET 60 Q + L + L +T + + K ++ + V T Sbjct: 13 QSIILCAFTVALQ-IITHESLAQKKSKEPEITIEDVKQKCQNLPRAQRVIIKVARFSVST 71 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLE-RQGLQNLLNERKIIRAAQENGT 119 A+ + ML +AL+ + F +E + L + +E + NG Sbjct: 72 ------KAAQARSTFGDELATMLTSALQQTNCFRVMETNKNLSDATSEMAFAQDGFTNG- 124 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 + + A ++V G I + S + +G +++ + NL+V+ Sbjct: 125 ----SGPQAGQMLGAQLIVTGEITDFSEGSSSKSI------LGVESKSNQATVGFNLKVL 174 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 N TGE+L S + + + +F N V L AI V Sbjct: 175 NPQTGELLFSKDVNMKGHNSGKVLDIFGVKTS--------SSNENRAVQDALQKAIIKAV 226 Query: 240 IFLIND 245 L ++ Sbjct: 227 EILADE 232 >UniRef50_B8GVR1 Putative uncharacterized protein n=2 Tax=Caulobacter vibrioides RepID=B8GVR1_CAUCN Length = 275 Score = 90.3 bits (222), Expect = 6e-17, Method: Composition-based stats. Identities = 46/259 (17%), Positives = 86/259 (33%), Gaps = 40/259 (15%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+ + V + L++G A K AAR L+H AP + V+ + + Sbjct: 19 MRPAIIAVGLSLIAGSAIAETKVAARDL-------SSLLSHCEAPVAALTVTAFKCKASA 71 Query: 61 GQFKPYPASNFSTA-----------------VPQSATAMLVTALKDSRWFIPLERQGLQN 103 P P SN + + L TALK + F + R+ ++ Sbjct: 72 CSVAPAPGSNTGLGALMSMAQAAQGLQTFPNIGDGLSNALTTALKTTGCFKVMAREDFED 131 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGA 163 L E + ++ + A+ +V G+I K+ G + + Sbjct: 132 LRREAEAAGITLKSAS--------------ADYLVTGAITSLAVGAKTQSFGGGFVPLVG 177 Query: 164 --DTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGY 221 + I++++R+V+V E+ +S + AG + L Sbjct: 178 AVSRSTKSANISIDVRLVDVKASEVKASQTFDVSNERSSWGAGGAGWGGSGALFGAASST 237 Query: 222 TSNEPVMLCLMSAIETGVI 240 S E + S I+ Sbjct: 238 QSPELDSVANESVIQAANY 256 >UniRef50_B7A9M7 Putative uncharacterized protein n=1 Tax=Thermus aquaticus Y51MC23 RepID=B7A9M7_THEAQ Length = 225 Score = 89.5 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 85/241 (35%), Gaps = 28/241 (11%) Query: 2 QRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG 61 +L+ LL GC+ P+ A T Q+ + P T V+V + + Sbjct: 11 TSFLMLLFGALLVGCV---PQAATPTTPGSLPQTVQIRYDGPRET----VAVIDTTNI-- 61 Query: 62 QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 S T L+ F +ER L +L E+++ A ++ Sbjct: 62 ---RLSGSPLKERFLAILTEELIVHPYFKDRFSLVERVKLDQVLKEQRLSAAG-----LS 113 Query: 122 INNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNV 181 + + L A ++ S++ +++ +S G +G + V L +V+ Sbjct: 114 PTDAPRIGKLLGARYLLLASVVNAKTSRRSTGA------LGIRIDEVQGTVEVALSLVDS 167 Query: 182 STGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIF 241 G +L+ V S++ + ++ E ++ + + A+E ++ Sbjct: 168 ENGRVLARVLVSQSDTRVARVSTGLAGVNLDPSEEL-----VSDLLRKAVKEALEKMLVQ 222 Query: 242 L 242 L Sbjct: 223 L 223 >UniRef50_A2SLP6 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SLP6_METPP Length = 292 Score = 88.7 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 83/240 (34%), Gaps = 26/240 (10%) Query: 13 LSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG-----QFKPYP 67 ++GC T+ + + + P V + G + + Sbjct: 18 IAGCSTSKTEIGGPSDMA--------IADQAPPQEG---GVGRCEKRLGTVAITESEVNS 66 Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLL-NERKIIRAAQENGTVAINNR- 125 + S +P+S ++ L S F ++R +LL ER++ + + Sbjct: 67 QALMSAGLPRSMAPLVRHLLIRSGCFNVVDRGAAYSLLEAERRLREQLGTDANATVARHL 126 Query: 126 IPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGE 185 PL + A I+ I G V G G GIG QY + V L VV+ T E Sbjct: 127 QPLDYILRAEIVFAEQI-GQSKGVLGGVFGDVIGGIGG--QYNKKEAVVLLSVVDARTSE 183 Query: 186 ILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 I SSV T S AG+ + + + G ++ P + +A+ + Sbjct: 184 ITSSVFGRGTSDS----AGLGSLVLSSGVFAIDGG-WADTPQAKTVAAALVDAWNRTLPK 238 >UniRef50_Q30SV1 Putative uncharacterized protein n=1 Tax=Sulfurimonas denitrificans DSM 1251 RepID=Q30SV1_SULDN Length = 334 Score = 83.7 bits (205), Expect = 6e-15, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 81/204 (39%), Gaps = 32/204 (15%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 MQ L L+ LL+ L A L P LT +SV N ++ Sbjct: 1 MQVLKWLILAFLLTVFLAGCSHRVAIRALEPAEIDRATLTR--------KISVTNFEN-- 50 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + S + T ++ + D +F + R+ +++E+KI Q +G V Sbjct: 51 ------DSVGLSNKIE---TKIISKKIDDKSYFTLISRKDFDKIISEQKI----QNSGLV 97 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKS---GGVGARYFGIGADTQY------QLDQ 171 I+ + + ++ A ++ G + N + + + ++Y + Sbjct: 98 DISTAVEVGNILGAEAIISGGVGRVAFNDTTYYERRIRCNDKKCKSVSEYSVRCIKRNIG 157 Query: 172 IAVNLRVVNVSTGEILSSVNTSKT 195 ++ +LR+++++ G+I+ + +K+ Sbjct: 158 LSADLRMIDIAKGDIIYANTFNKS 181 >UniRef50_B0SIM6 Hypothetical lipoprotein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SIM6_LEPBA Length = 166 Score = 83.3 bits (204), Expect = 8e-15, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 61/200 (30%), Gaps = 38/200 (19%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKE--AARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 + L + + + S C +E +PT+ P Q L+ + V D Sbjct: 3 FKGFILALVLGVFSACYLGEERESKPKKPTVPPLEQLAISLSEKGFYFQPERLVVLTFLD 62 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 G+ PY + L T L F+ L+R Q +L E + + Sbjct: 63 NEGKKSPY---------GEILAEKLTTELVKKDRFLILDRLANQKVLKEAGLSLDS---- 109 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRV 178 + + +++ G + Y D + VN R+ Sbjct: 110 PTDTATLRKIGEVLKVGVIITGIVTPY-----------------------QDGVFVNTRL 146 Query: 179 VNVSTGEILSSVNTSKTILS 198 + + TG IL + I Sbjct: 147 IEIKTGLILKADEVYVRIDG 166 >UniRef50_A3WBM1 Putative uncharacterized protein n=1 Tax=Erythrobacter sp. NAP1 RepID=A3WBM1_9SPHN Length = 328 Score = 82.2 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 66/164 (40%), Gaps = 21/164 (12%) Query: 48 KIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNE 107 + V + ++D TG N V M+ TA+ S F +ER L L+ E Sbjct: 52 RPIVGIAQMEDLTG------GGNADNFVA-----MIETAIIGSGKFRIIERARLATLMEE 100 Query: 108 RKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQY 167 + + +GT N + + +V G+I + +S G+ G+ ++ Sbjct: 101 QGLA----LSGTTTTNRPGQVGGFEGVDYLVYGTISSISATNRSDIGGSMLRGLLGGSRN 156 Query: 168 QLD------QIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGV 205 + D ++ ++R+ + +TGE+ + S+ S V G Sbjct: 157 RPDCYKTRVRMEADIRITDTNTGEVRYATRISEEQDSATVCGGG 200 >UniRef50_C1QBW4 Putative uncharacterized protein n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBW4_9SPIR Length = 490 Score = 81.8 bits (200), Expect = 2e-14, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 74/199 (37%), Gaps = 41/199 (20%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+ + + ++ S T P++ R + ++V+ I+D + Sbjct: 1 MKNIIFTLFFLIFS--FTLFPQQMNREIGKTYTKEN--------------IAVFEIEDSS 44 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 ++ S + + TA++ +L F ++R+ L L E ++ + + Sbjct: 45 SRY--------SQGLGKQLTALIEDSLTKMNRFNIVDRKNLDKYLKEMEL-----QLTGI 91 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYE--SNVKSGGVGARYFGIGADTQYQLDQIAVNLRV 178 + I + + + V+G+I+ + N S +G Q+ + L++ Sbjct: 92 TESQVIEVGKIYGYSKAVKGNIVNADVSYNYDSDTGSGSLYG----------QVEMVLQI 141 Query: 179 VNVSTGEILSSVNTSKTIL 197 V+V T +I+ S Sbjct: 142 VDVETTKIMYSSKLQGISY 160 >UniRef50_Q93HR4 Putative uncharacterized protein (Fragment) n=1 Tax=Thermus thermophilus RepID=Q93HR4_THETH Length = 149 Score = 81.8 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 51/161 (31%), Gaps = 34/161 (21%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+R +LL+ V LS C L P P K+ V+ + + E Sbjct: 1 MKRAWLLLGVFALSACAP-QVTTKVDTGLSPDNP----YATYTGPRAKVVVASFPCKAEK 55 Query: 61 ---------------GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLL 105 G+ S + ML TAL +S FI ER L L Sbjct: 56 CGPGVDSSQVGAAVLGKLFGIEVSTSKGDIGAGIADMLTTALINSNHFIVYERSVLDQLQ 115 Query: 106 NERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYE 146 E +I AQ L A I++ G+I Sbjct: 116 KESQIGNQAQ--------------QLQGAEILITGTITALS 142 >UniRef50_C1AEH8 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AEH8_GEMAT Length = 273 Score = 81.0 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 66/203 (32%), Gaps = 41/203 (20%) Query: 48 KIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNE 107 K V++ + + + + ML+T L + +ER LQ+LL+E Sbjct: 65 KPTVAIMYFTN-----GAISNNAEYAPLSKGLAEMLITELSGNENIRVVERDRLQSLLDE 119 Query: 108 RKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQY 167 + + +G VA + A M+ GS + Sbjct: 120 QNL----GASGRVATETAAKIGKTLGALHMLMGSFV----------------------ID 153 Query: 168 QLDQIAVNLRVVNVSTGEILSSVNTSKTI---------LSYEVQAGVFRFIDYQRLLEGE 218 + + +++R +N T E+ + + + L ++ G+ + QR E Sbjct: 154 PKNTMRMDVRAINTETSELEYATSVTGKADKMLELLGELGTKLNTGL-KLPSVQRGFEEG 212 Query: 219 VGYTSNEPVMLCLMSAIETGVIF 241 + P L M + + Sbjct: 213 KAVGAKGPNQLKSMMLLSRALEQ 235 >UniRef50_B4SIB8 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B4SIB8_STRM5 Length = 551 Score = 77.2 bits (188), Expect = 6e-13, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 62/165 (37%), Gaps = 18/165 (10%) Query: 41 HLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQG 100 P GK + V + + G + + V + A L L ++ FI L+R+ Sbjct: 233 RAPDEQGKPKIVVALPRTKAGSYAVGDGRVDADEVADAIRARLSDTLTQTQRFIVLDREF 292 Query: 101 LQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYES-----NVKSGGVG 155 L E I +G V + + + A ++++ +I +E N++ Sbjct: 293 GDELQAEIDHIN----SGNVRLQDTARVGQQLATDLILIPTIERFEYPRSVRNLRMSDRQ 348 Query: 156 ARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYE 200 + G + LR++N +TG+++ S + + S Sbjct: 349 VTSYSGGGR---------ITLRLINATTGQVVMSDSFDHQLASTG 384 >UniRef50_C1A9F6 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A9F6_GEMAT Length = 260 Score = 76.4 bits (186), Expect = 9e-13, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 50/152 (32%), Gaps = 34/152 (22%) Query: 44 APTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQN 103 A + V+V + TG+ + Q AM+ T L LERQ L + Sbjct: 23 AAPARKTVAVLAFDNNTGKTDY-------DHLGQGMAAMMTTDLAAVDEIQLLERQRLAD 75 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGA 163 + E Q + + + L A ++ GS+ E Sbjct: 76 VTKE----IDNQRSQYFDSTTAVKVGRLAGAQYIIVGSLAAVEP---------------- 115 Query: 164 DTQYQLDQIAVNLRVVNVSTGEILSSVNTSKT 195 Q+ ++ R+V V TG I+ + S Sbjct: 116 -------QVRIDTRIVRVETGAIVKTAKVSGK 140 >UniRef50_A9BZV7 Peptidoglycan-binding domain 1 protein n=4 Tax=Comamonadaceae RepID=A9BZV7_DELAS Length = 586 Score = 76.0 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 44/290 (15%), Positives = 88/290 (30%), Gaps = 33/290 (11%) Query: 1 MQRLF--LLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 M+ + + +++LLSGC ++ A+ + P + S+ + D Sbjct: 42 MRAIACAVSASIVLLSGCAPLDARKDAK--------YLDKVYAADRPVVRPVRSISSFSD 93 Query: 59 E--------------TGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNL 104 T ++S V +A M+VTAL + + Sbjct: 94 SLMCMDRMFRNSHMPTVLITSKQLPDYSGRVAVAAKEMVVTALSQMSRVS----SAFRYV 149 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSG----GVGARYFG 160 E I R I L + V G++ + NV G A Sbjct: 150 DYEVDIARQDTVQNLTTILLNNNQIQLQRPALYVSGAVSFMDQNVLRNNVDIGTSASRLE 209 Query: 161 IGADTQYQLDQIAVNLRVVNVSTGEILSSV-NTSKTILSYEVQAGVFRFIDYQRLLEGEV 219 G + + L + + T I+ + + ++ ++ Q ++ V Sbjct: 210 TGYSANRSGAVLGMELHLGDFRTRTIIPGLDSANEVVIGNGSQGLDLSGRIGTYGVQFSV 269 Query: 220 GYTSNEPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVKYR 269 G + + + +E G+I L+ W + D + R Sbjct: 270 GRDYAQGSGPAVRTLVELGMIELLGKWSRLPYWQCLTLDQTHPDFQRQLR 319 >UniRef50_B6BP75 Peptidoglycan-binding domain 1 protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BP75_9RICK Length = 346 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 68/166 (40%), Gaps = 22/166 (13%) Query: 80 TAMLVTALKDSRWFIPLERQ-GLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMV 138 T+++ ++ S FI LER +QNLL ER + +G + + + + A+ ++ Sbjct: 90 TSLIRLIVQQSNCFIVLERGIAMQNLLQERTLS----SSGELKQDQNMGKGQMITADYIL 145 Query: 139 EGSIIGYESNVKSGG----------VGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILS 188 +II E+N G G+ IG ++ Q ++ L + +G ++ Sbjct: 146 TPTIIFKEANTGGVGGLLGGLLPGNAGSVAGIIGGSLKFSESQTSLTL--ADTRSGIQVA 203 Query: 189 SVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 + + S+ V AG+ +G SN P + +A Sbjct: 204 AAAGASKKSSFGVVAGL-----GGSSAAAGLGAYSNTPEGKVVAAA 244 >UniRef50_A6VLC8 Peptidoglycan-binding domain 1 protein n=2 Tax=Pasteurellaceae RepID=A6VLC8_ACTSZ Length = 528 Score = 72.9 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 42/294 (14%), Positives = 94/294 (31%), Gaps = 51/294 (17%) Query: 4 LFLLVAVMLLSGCLTAP-----------------PKEAARPTLMPRAQSYKDLTHLPAP- 45 VA +L+GC T P + + S + +L Sbjct: 7 FLSYVAASVLAGCSTGTQLHGDRPYIESTTTHTRPVSEPVRAMTSFSDSLNCMDNLLLQS 66 Query: 46 -TGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKD----SRWFIPLERQG 100 G+ V++ I D +G+ +A M+VTAL S F + Sbjct: 67 NAGQTVVAIKTINDPSGKALV------------AANDMIVTALSQMSRTSGAFRVV---D 111 Query: 101 LQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNV--KSGGVGARY 158 + ++ ++ + +IP + + G++ + NV KS G Sbjct: 112 FEINPMKQDTVQTLSSLLLPTGSMQIPAPQI-----YISGAVSYVDQNVLKKSDSAGISV 166 Query: 159 ---FGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLL 215 +G + + L + + T + ++++ I++ G+ R + Sbjct: 167 SDDVELGISGDLITTALGMELHIGDFLTRTLYPGIDSANEIVAANKGFGL-DSGAKIRKV 225 Query: 216 EGEVGYTSN--EPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAERQNDILVK 267 + + N + V + + + G+I L+ W + + + + Sbjct: 226 GIQFSFERNLSQGVGSAVRTLTDLGMIELVGKYAQVPYWQCLSLDQSHPEFQRQ 279 >UniRef50_Q7UMD1 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UMD1_RHOBA Length = 530 Score = 68.7 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 62/156 (39%), Gaps = 36/156 (23%) Query: 38 DLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLE 97 + P + ++V I+D+ G ++ + + + T LV R ++ Sbjct: 193 QVGDEVEPNLDLRIAVCPIRDQNGN-----TADETLVMAEDLTTRLVN-----RRVPVVD 242 Query: 98 RQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGAR 157 R+ L ++L+E AQ + L L+ A +V G I+ N ++ G+ Sbjct: 243 RESLGSVLDEL----LAQNSILFDPKTAQKLGELSGATHVVAGKIVA---NGRTRGI--- 292 Query: 158 YFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTS 193 V +R+++V TG I+ + +TS Sbjct: 293 ----------------VYVRLIDVQTGRIVVATSTS 312 >UniRef50_A7ZCR3 Peptidoglycan-binding domain 1 protein n=3 Tax=Bacteria RepID=A7ZCR3_CAMC1 Length = 264 Score = 67.5 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 33/252 (13%), Positives = 86/252 (34%), Gaps = 29/252 (11%) Query: 2 QRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG 61 + + A+ L +GC ++ + + G + +D++ Sbjct: 9 KVFSSVAALCLFAGCASSNSGVTGAAAGDTAKNANTKIERCSQTLGTLSF----YEDQSS 64 Query: 62 QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQG-LQNLLNERKIIRAAQENGTV 120 Y + + + + A + + F+ +ER + N++ ER + + +G + Sbjct: 65 --SWYSYLTRDYQLGSTVPVLRILA-QQTGCFVIVERGRSMDNMMQERAL----EASGEL 117 Query: 121 AINNRIPLQSLTAANIMVEGSIIGY--ESNVKSGGVGARYFGIGA--DTQYQLDQIAVNL 176 ++ + AA+ ++ I ++ SG VGA + + + + +L Sbjct: 118 RKGSKFHKGQVVAADYTMQPEITFSKEDTGGISGLVGAVFGNVAGKVSGGFSKSETQTSL 177 Query: 177 RVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQ----RLLEGEVGYTSNEPVMLCLM 232 +++ +G ++ S F F + +G + P ++ Sbjct: 178 LLIDNRSGVQIAGAVGSD---------SNFDFFGMGSNSFSRVSAGLGGYTKTPEGRMIV 228 Query: 233 SAIETGVIFLIN 244 +A + LI Sbjct: 229 NAFMDAMNQLIV 240 >UniRef50_C0QZW0 Curli production assembly/transport component CsgG n=1 Tax=Brachyspira hyodysenteriae WA1 RepID=C0QZW0_BRAHW Length = 489 Score = 67.2 bits (162), Expect = 6e-10, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 55/155 (35%), Gaps = 22/155 (14%) Query: 40 THLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQ 99 + K ++V+ IQ +S + + TA++ +L F ++R+ Sbjct: 24 KEIGETYEKENIAVFEIQ--------STSSGYGEELGPKMTALIENSLTRMNRFNIVDRK 75 Query: 100 GLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYF 159 L L E ++ + + I + + + V G I+ V+ G+ Sbjct: 76 NLDKYLKEMEL-----QLTGITEKQVIEVGKIYGYSKAVTGKIVSANVTVEYNDDGSFSL 130 Query: 160 GIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSK 194 + + L++V+V T +IL S Sbjct: 131 ---------YSTVDMVLQIVDVETTKILYSSQLQG 156 >UniRef50_C0QSR9 Putative lipoprotein n=1 Tax=Persephonella marina EX-H1 RepID=C0QSR9_PERMH Length = 399 Score = 66.8 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 65/192 (33%), Gaps = 30/192 (15%) Query: 72 STAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSL 131 + + +S + L + R+ LQ ++ E++ Q++G +N + L L Sbjct: 127 NARLGESVAEGVTAQLVEMGGAKVYTRRDLQKVMQEQQF----QQSGLTDVNTLVQLGKL 182 Query: 132 TAANIMVEGSIIGYESNVKS------------GGVGARYFGIGADTQYQLDQIAVNLRVV 179 ++ GS+ S G VGA + + ++++ Sbjct: 183 AGVKYIITGSVNNVNLKWISAEYAKKGLSQHLGLVGAIAAAAIETQEGWNLSTDLTIKII 242 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 +V TGE++ + K I EV + + +G + M+AI Sbjct: 243 DVETGEVVLA----KNISGREVLGKTPQLT-----FDAIIG-----GIKKAAMNAIAEAK 288 Query: 240 IFLINDGIDRGL 251 L RG Sbjct: 289 EDLSKYFKVRGY 300 >UniRef50_Q0VKU1 Putative uncharacterized protein n=2 Tax=Alcanivorax RepID=Q0VKU1_ALCBS Length = 430 Score = 66.4 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 51/145 (35%), Gaps = 10/145 (6%) Query: 49 IFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNER 108 +++ + + ++ V L A S + L+R L ++ E Sbjct: 138 PGIAIATFESAKSSYDLGDIKVPASQVQHQLQDNLTMAFSQSGRYRVLDRTYLADVDEEL 197 Query: 109 KIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQ 168 ++ G++A L A++++ G+I ++ + GA Sbjct: 198 GVV----AQGSIAPEEMARLGQRKGADLLLVGTIEDFQIGDSAQAF------YGAKMGGY 247 Query: 169 LDQIAVNLRVVNVSTGEILSSVNTS 193 + V R+++ +T EIL S Sbjct: 248 APYVRVRYRLIDTTTTEILWSDLYE 272 >UniRef50_B7IGU6 Putative uncharacterized protein n=1 Tax=Thermosipho africanus TCF52B RepID=B7IGU6_THEAB Length = 396 Score = 66.0 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 44/133 (33%), Gaps = 20/133 (15%) Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIP 127 + ++ ++L + F R L ++ ER + + Sbjct: 36 GNGWNMDEADFLISILEQKALELGRFRVFSRNDLDMIVKERNLGDLGI------VEQTFE 89 Query: 128 LQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEIL 187 + A + ++ SN + G + ++L++ ++ TGE+L Sbjct: 90 AGKILGARYAILLTLTELTSNYEKNG--------------YTASLRLSLKLYDLKTGELL 135 Query: 188 SSVNTSKTILSYE 200 +S +K + E Sbjct: 136 ASTPFAKETYTEE 148 >UniRef50_A8UTU4 Curli production assembly/transport component CsgG n=2 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8UTU4_9AQUI Length = 258 Score = 65.2 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 58/170 (34%), Gaps = 31/170 (18%) Query: 95 PLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGV 154 R LQ +L E+K Q +G V N + + L +V GS+ G Sbjct: 3 IYTRNDLQKVLQEQKF----QMSGLVDPNTAVQIGQLAGVKYIVTGSVNNINLKWVDVGE 58 Query: 155 G------------ARYFGIGADTQYQ-LDQIAVNLRVVNVSTGEILSSVNTSKTILSYEV 201 G +GA TQ I + ++V++ TGE++ +KT+ EV Sbjct: 59 GVKRGLSEHLGLLGTALAVGASTQEGWNLSIDIVVKVIDTETGEVV----LTKTVSGREV 114 Query: 202 QAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGL 251 F ++ ++ G A+E L RG Sbjct: 115 LGKTPTF-NFDSIIGG---------AKKAAQEALEDIRPELSKLFPLRGY 154 >UniRef50_B5WBR1 Putative uncharacterized protein n=1 Tax=Burkholderia sp. H160 RepID=B5WBR1_9BURK Length = 267 Score = 64.5 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 36/260 (13%), Positives = 77/260 (29%), Gaps = 36/260 (13%) Query: 1 MQRLFLLVAVMLLSGCLTAP-----PKEAARPTLMPRAQSYKDLTHLPAPTGKIFV---- 51 L + M L+ C + A + ++ AP G I + Sbjct: 5 FSTLAAVAFAMSLTACSQMQLAGGGQSPDSTAVAGSTAPTADNMHRCAAPLGTIAIQEDT 64 Query: 52 SVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQG-LQNLLNERKI 110 + TGQ++ + Q S F+ ++R L + ER + Sbjct: 65 ASPWYSLLTGQYQLGSTVPVLKMLVQ-----------QSNCFVIVDRGRALNQAMQERAL 113 Query: 111 IRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLD 170 + G + +N+I + A+ + SI N S + Q+ Sbjct: 114 ----GDAGELRASNKIKKGKMVVADYTMTPSITFSNQNAGSIAGVLSVIPVVGGVAAQVA 169 Query: 171 ------QIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSN 224 + L +V + +++ S + + + G S+ Sbjct: 170 GQVNTKSASTTLTLVENRSTVQIAAATGSARNMDIGAIGSLMSSHSGGTV-----GGYSD 224 Query: 225 EPVMLCLMSAIETGVIFLIN 244 P +++A + L++ Sbjct: 225 TPEGKVIVAAFTDSLNNLVD 244 >UniRef50_A7BYD5 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BYD5_9GAMM Length = 334 Score = 64.1 bits (154), Expect = 5e-09, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 57/186 (30%), Gaps = 43/186 (23%) Query: 11 MLLSGCLTAPPKEAARPTLMPR------AQSYKDLTHLPAPTGKIFVSVYNIQDETGQFK 64 + L C + T + + + K V+V + D +G Sbjct: 10 LFLWVCAVPIIFMSVNATAYEYEYEKEITRLSATMAEKISAANKTKVAVVDFTDISGNV- 68 Query: 65 PYPASNFSTAVPQSATAMLVTALKDSRW-FIPLERQGLQNLLNERKIIRAAQENGTVAIN 123 T + + AL + F ++R L +++ E K+ + G + Sbjct: 69 --------THLGRFIAEEFSVALVSAGKGFQVVDRIHLHSIIKEHKLSKT----GLIDPK 116 Query: 124 NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVST 183 L + ++ G++ + D I + +++++ ST Sbjct: 117 TARELGKIAGVEALITGTLTPFG-----------------------DSIRIVVKILDTST 153 Query: 184 GEILSS 189 I+ + Sbjct: 154 AVIIDA 159 >UniRef50_C1TRF7 Tetratricopeptide repeat protein n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TRF7_9BACT Length = 382 Score = 64.1 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 49/117 (41%), Gaps = 14/117 (11%) Query: 29 LMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALK 88 ++ A + L A + + V++ + A S +V QS ML + L Sbjct: 7 IVSVAVAVGLLFGTAAFSAPMTVAIGDFN----------ARGASYSVGQSVVEMLYSRLA 56 Query: 89 DSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGY 145 +R F +ER L + +++I +G V+ + + + + A V+G++ + Sbjct: 57 GNRAFRLVERGQLDQVARQQRIT----MSGMVSQESAVEIGRIVGAKYYVQGAVSHF 109 >UniRef50_C1SMS2 Putative uncharacterized protein n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SMS2_9BACT Length = 397 Score = 62.9 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 68/219 (31%), Gaps = 32/219 (14%) Query: 46 TGKIFVSVYNIQ-DETGQFKPYPASNFSTAVPQSATAMLVTALKDS----RWFIPLERQG 100 + + V + +T +F + + +A++++ +R Sbjct: 98 ATPVGIGVGGVSASKTDTNYSGNIDSFMREIAPNIGTYAQSAVENTMSGIGGMKIYDRSH 157 Query: 101 LQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSI----IGYESNVKSGGVGA 156 LQ ++NE+K N + L + ++ G++ Y +K Sbjct: 158 LQKIMNEQKFQMTIG-----DPNTAVQLGKMAGVQYIITGTVDNITTKYVEKIKDDKSLG 212 Query: 157 RYFGIGADTQYQLD----QIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQ 212 F + + + + +++++V+TGE L +K + +EV F Sbjct: 213 SLFTLASIAVNTQAGWNVNVEMTIKLLDVATGEQL----LNKKVTGHEVAGNQPNFNPEM 268 Query: 213 RLLEGEVGYTSNEPVMLCLMSAIETGVIFLINDGIDRGL 251 + + A+E RG Sbjct: 269 SI----------TAAKKAMGEAVEDLRPDFSQRFAQRGY 297 >UniRef50_Q3B6Q5 Periplasmic protein n=2 Tax=Chlorobium/Pelodictyon group RepID=Q3B6Q5_PELLD Length = 435 Score = 61.4 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 52/147 (35%), Gaps = 8/147 (5%) Query: 45 PTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNL 104 + + ++V + + V +VT L SR F L+R + Sbjct: 135 ASNRRRIAVMPFRTAGIPMLLDGQRVPAEEVSAELVQQVVTELTQSRKFTVLDRDYMDAY 194 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGAD 164 L+E+ ++ + + + + + + M+ GSI S G Sbjct: 195 LSEKSLL----LSPDGEESEMMKMGRVLGVDYMLVGSI----SGGVERRAEDVLALTGER 246 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVN 191 Q+ + V+ R++ + T E+ S + Sbjct: 247 VQHGAASLNVDYRIIVMPTREVKWSGS 273 >UniRef50_A8ZSW9 Tetratricopeptide TPR_2 repeat protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZSW9_DESOH Length = 212 Score = 59.8 bits (143), Expect = 9e-08, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 59/158 (37%), Gaps = 33/158 (20%) Query: 43 PAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRW-FIPLERQGL 101 +P G +++ ++ + + + ML+T L +S F +ER+ + Sbjct: 12 ASPDGDRTLAILPFENNS-----VTTPETYDPLKSGLSVMLMTELANSEAAFTLVEREKI 66 Query: 102 QNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGI 161 + LL+E + + G + + + + + A + G+ + N Sbjct: 67 RALLDEITL----GQTGVIDASTAVKMGKMLGAQAIGFGAFMVMGKN------------- 109 Query: 162 GADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSY 199 + +++R+V V TG ++ + + + + Sbjct: 110 ----------VRIDMRMVEVETGALIMAESITGKTDDF 137 >UniRef50_B6IWB1 Putative uncharacterized protein n=2 Tax=Rhodospirillum centenum RepID=B6IWB1_RHOCS Length = 346 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 36/256 (14%), Positives = 84/256 (32%), Gaps = 20/256 (7%) Query: 4 LFLLVAVMLLSGCL---TAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 + L +A +LS C+ T ++ T A ++ A + ++ ++++ Sbjct: 8 VALALAGSMLSACVSTGTTMGGGSSIATGSAGAAGTQNANEQLARCDRPLGTIALVENQ- 66 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 P + A + ++ + F + R + + + A G++ Sbjct: 67 ----PKSPILVMMGLTSPIPA-VRLVMQQTGCFRVVARGDDFERIQQER---ALAAGGSL 118 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYF------GIGADTQYQLDQIAV 174 + + L AA+ +V+ ++ N G F + + + V Sbjct: 119 QAGSNLGGGQLAAADFIVDVHVLSQNENSGGNAAGLGAFVPGIAGAVLGGLRTKESTANV 178 Query: 175 NLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 L V +V TG + + S ++ + G +G N + +++A Sbjct: 179 MLTVTDVRTGVQEAVAT--GSAESRDIGWAFGAGGFGALPIAGGLGGYENTEMGKTMIAA 236 Query: 235 IETGVIFLINDGIDRG 250 I V L+ Sbjct: 237 IMDAVNKLVPQFRAMP 252 >UniRef50_Q1LBB8 Putative uncharacterized protein n=1 Tax=Cupriavidus metallidurans CH34 RepID=Q1LBB8_RALME Length = 447 Score = 59.4 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 61/155 (39%), Gaps = 20/155 (12%) Query: 43 PAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQ 102 PA TGK+ +++ ++ + N + + + AL S F LER Sbjct: 160 PADTGKLRIAIAPLR------IGHVGGNNAERIAAELRQRITDALTQSGRFTVLERGDAP 213 Query: 103 NLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIG 162 L E + I +G V+ + + A+++ GS+ +E+ + Sbjct: 214 ELYGEIERI----ASGEVSNDQFSKIGQGLGADLIWFGSVNAFETGRPTD---------- 259 Query: 163 ADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTIL 197 AD + + +V+ + VNV+T E+L S Sbjct: 260 ADQTGRAGRWSVSQKFVNVTTREVLFSNTVDGGAA 294 >UniRef50_C8PSK0 Putative uncharacterized protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PSK0_9SPIO Length = 275 Score = 59.1 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 53/142 (37%), Gaps = 36/142 (25%) Query: 52 SVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKII 111 +V++ +F+ V T +L+ L + +ER+ + + +E Sbjct: 26 AVFDFD--------CEDPDFADKVGM-MTDLLIHELVKASGVTVVERKNIDKVFSE---- 72 Query: 112 RAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQ 171 + Q N + + + L A+ ++ GSI G+G + Sbjct: 73 YSFQANPYIDLKSAKKLGKGLGADCIIVGSI---------AGLGCPLY------------ 111 Query: 172 IAVNLRVVNVSTGEILSSVNTS 193 V R+++V +G++L S + Sbjct: 112 --VTARMIDVESGKVLHSAKMT 131 >UniRef50_B9M4B1 Membrane lipoprotein lipid attachment site n=2 Tax=Geobacter RepID=B9M4B1_GEOSF Length = 202 Score = 58.7 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 55/164 (33%), Gaps = 31/164 (18%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQS--YKDLTHLPA----------------P 45 +FL+ A L SGC T + + + DL + A Sbjct: 11 IFLIAAATLFSGCATPSVEYGDPLSQQALSTDFGSADLQQIAATMVDSLITFPPVAEITA 70 Query: 46 TGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLL 105 T + +SV I++++ Q ++ T + L S F ++R +L Sbjct: 71 TRRPVISVDRIKNKSMQHID----------MEAVTDSIRARLIKSGKFTFVDRTTEAAVL 120 Query: 106 NERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNV 149 E K Q +G V + + A ++ G+ E V Sbjct: 121 EELK---YQQNSGMVDQEKAVEMGKQYGAEYILSGNFAEIEHKV 161 >UniRef50_A6WLI2 Putative uncharacterized protein n=1 Tax=Shewanella baltica OS185 RepID=A6WLI2_SHEB8 Length = 423 Score = 58.7 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 61/158 (38%), Gaps = 8/158 (5%) Query: 48 KIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNE 107 + ++V + F S + V + ++ ++ L +R F L+R ++ +E Sbjct: 131 RQRIAVLPFKTTKQAFSLAGQSTKAETVANTLSSSVIEKLVTTRKFAILDRDFDESTKSE 190 Query: 108 RKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQY 167 I G VA L A+ ++ G+I ++ + + Sbjct: 191 LARIST----GDVAPAEFSRLTQGLIADYVLVGAIDALNFDLYERTMRTSDKKFVSG--- 243 Query: 168 QLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGV 205 +I V+ R++ V+T +I+ S + S + + GV Sbjct: 244 -QGRIKVSFRLIEVATSQIVFSGSASSELTDKNLANGV 280 >UniRef50_C6BZA8 Putative lipoprotein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BZA8_DESAD Length = 186 Score = 58.3 bits (139), Expect = 3e-07, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 67/197 (34%), Gaps = 45/197 (22%) Query: 1 MQRLFL--LVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 M+R F L+ V+ L+GC K+ +P + V+V + + Sbjct: 1 MKRSFFTTLIFVLFLAGCSGTYMKDYVQPNGVASEA--------------RHVAVLPLVN 46 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 T + + +L T + S F +E + + Sbjct: 47 LTN----------TPNAGRMVGDLLTTEIYASTKFDLMESTEMFKRIK----GDEDDLEF 92 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRV 178 + L S + ++ GS+ Y+ Q + +NLR+ Sbjct: 93 VMEDVVAQKLGSKLGVDTVIYGSVTEYQYK---------------RGVNQSPSVGINLRM 137 Query: 179 VNVSTGEILSSVNTSKT 195 ++VS+G +L + ++SK+ Sbjct: 138 IDVSSGNVLWASSSSKS 154 >UniRef50_A6Q5P4 Putative uncharacterized protein n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q5P4_NITSB Length = 408 Score = 57.5 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 53/168 (31%), Gaps = 16/168 (9%) Query: 29 LMPRAQSYKDLTHLPAPTGKIFVSVYNI---QDETGQFKPYPASNFSTAVPQSATAMLVT 85 + R YK P + ++V + S + QS + Sbjct: 114 VTKRTTRYKYRDAGHNPHNRRTLAVLPFEYKPTYSLHGITIDGRELSRRLTQSI----IN 169 Query: 86 ALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGY 145 + +R F L+RQ + E+ + + + A+ + G I+ + Sbjct: 170 KITQTRKFTILDRQNSKYYEFEKSFLLSPG----TDPVELARIGKRLGADYFIIGQILDF 225 Query: 146 ESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTS 193 ++ K G + +G D + + RV+ V +I S Sbjct: 226 GTDKKEG-----NYLLGPDETTEEAYATIAYRVLYVPRQQIKWSDTID 268 >UniRef50_B0VIM9 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VIM9_9BACT Length = 423 Score = 56.0 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 65/195 (33%), Gaps = 35/195 (17%) Query: 28 TLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTAL 87 L+P Q ++V D++ Y A S + + L++A Sbjct: 172 ALIPNYQDADLRYEQCRKLAIKRIAVSPFTDKSNTSGKYGA--VSDILTDHIVSRLISAA 229 Query: 88 KDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSI----- 142 ++ + + R L+ ++ E+++ +G V + + L + AN ++ GSI Sbjct: 230 VNNEFVAIISRSQLETVMKEQQLS----ASGLVNDASSVHLGQILGANEILAGSILQISV 285 Query: 143 -------IGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLR-----------------V 178 + E + Y TQ + + V R + Sbjct: 286 SPERTVSVQSEDETEVVLRTEEYTDDEGSTQEREIKGKVYFRYRKFTKTASVSISTSYSI 345 Query: 179 VNVSTGEILSSVNTS 193 ++V TG+IL Sbjct: 346 LDVETGKILLQETVE 360 >UniRef50_D1N7A7 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N7A7_9BACT Length = 310 Score = 55.6 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 61/198 (30%), Gaps = 48/198 (24%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M RL +++A + AA P P S ++V Sbjct: 1 MNRLLMILAAVF----TVHSAAVAADPAPQPETASLPV----------PTIAVLPFDSRG 46 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + + + +S +L L F +ER L +L E + G V Sbjct: 47 ARAQ-------DENLGKSIAELLSVELATQGDFELVERAELDKILTELHLS----ATGLV 95 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 ++ L LT A I++ GS+ D V +++ Sbjct: 96 DKETQLKLGQLTGAKILITGSV-----------------------FRSGDNNYVVAKLIG 132 Query: 181 VSTGEILSSVNTSKTILS 198 V TG++L + + + Sbjct: 133 VETGKVLPAAAKGGSAAT 150 >UniRef50_C6BWK5 Putative uncharacterized protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BWK5_DESAD Length = 277 Score = 54.8 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 50/156 (32%), Gaps = 32/156 (20%) Query: 4 LFLLVAVMLLSGCLTAPPKEAA-------------RPTLMPRAQSYKDLTHLPAPTGKIF 50 L L +L GC+ P+ A + P + P K Sbjct: 12 LVLAAVSVLGLGCIPRQPRAPAYPIPPKSIYETIQEAVVTPMNLDSFRMHKRPLTKEKT- 70 Query: 51 VSVYNIQ--DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNER 108 ++ N + D T + L+ L ++ + ++R+ + ++ E+ Sbjct: 71 FAIMNFRANDNTS--------------GSMVSDRLIIEL-KTKGYHVIDREEIDKVVREQ 115 Query: 109 KIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIG 144 ++ + G + + L A+ V GS+ Sbjct: 116 AMMSE-HKTGLTDLEIAQRIGRLVHADYFVFGSVTD 150 >UniRef50_A6LK65 Putative uncharacterized protein n=1 Tax=Thermosipho melanesiensis BI429 RepID=A6LK65_THEM4 Length = 394 Score = 54.8 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 16/136 (11%), Positives = 42/136 (30%), Gaps = 20/136 (14%) Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIP 127 + ++ ++L F R L ++ ER + + Sbjct: 34 GNGWNMDEADYLISLLEEKALSLGRFRVYSRNDLDAIVKERNLSELGI------VEKTFE 87 Query: 128 LQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEIL 187 + A + ++ S + G + ++L++ ++S+GE+L Sbjct: 88 AGKILGAKYAILLTLTELTSQYEDNG--------------YTASLRLSLKLYDLSSGELL 133 Query: 188 SSVNTSKTILSYEVQA 203 ++ + E A Sbjct: 134 AAKTFDDSTYVEEETA 149 >UniRef50_Q31E50 Putative uncharacterized protein n=2 Tax=Proteobacteria RepID=Q31E50_THICR Length = 337 Score = 54.4 bits (129), Expect = 4e-06, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 66/204 (32%), Gaps = 33/204 (16%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+ F + + +T K L P ++V + +D+T Sbjct: 5 MKGTFFALTAWAVMS-ITGCSKNVKIQVLEPAKID--------RAAQTKQIAVSDFKDDT 55 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + + L ++F + R + +L+E+K+ Q +G V Sbjct: 56 -----------VGLAGKIEALLAKQTLDGQKYFTTISRDEMDRILDEQKL----QYSGVV 100 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYE------SNVKSGGVGAR---YFGIGADTQYQLDQ 171 + + + A + G +I ++ V + Sbjct: 101 NDSKIVEAGEILGAQAFISGEVISASVKDNRHYEKRTKCVDDNCKTTRAYLVSCLSRTID 160 Query: 172 IAVNLRVVNVSTGEILSSVNTSKT 195 ++ N+++ +VS +I+ + SK+ Sbjct: 161 LSANIKMTDVSKADIIYADAYSKS 184 >UniRef50_B2KEV3 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KEV3_ELUMP Length = 228 Score = 54.4 bits (129), Expect = 4e-06, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 69/235 (29%), Gaps = 71/235 (30%) Query: 1 MQRLFL--LVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQD 58 M+++ +V+ +LL GC T +Q Y+ ++ Sbjct: 1 MKKILFSGIVSCLLLFGCATK----------TVISQD------------------YDFKN 32 Query: 59 ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENG 118 S + + A + L + F +ER ++ +L E + G Sbjct: 33 VKRIGIMAFDSPWQSF--TGAENLFAKYLLE-NGFTIIERAKIEQVLQEHNLSIT----G 85 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVKS------------------------GGV 154 ++ L + ++++ G I Y K+ G V Sbjct: 86 YLSPETTKMLGKILGVDLLLMGEITSYTPEKKTLTMVETRNYRTEPVFSTQMVKKPDGSV 145 Query: 155 GARYFGIGADTQYQL----------DQIAVNLRVVNVSTGEILSSVNTSKTILSY 199 A G Q Q+ V ++V+V T E++ + S Sbjct: 146 VATSRPSGQRVTNQREVTPTEYTISAQVGVIAKLVDVETAEVVWIGTDTAQDYSS 200 >UniRef50_Q4UYT5 Putative uncharacterized protein n=6 Tax=Xanthomonas RepID=Q4UYT5_XANC8 Length = 321 Score = 53.7 bits (127), Expect = 7e-06, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 76/255 (29%), Gaps = 21/255 (8%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M L + + ++AP + AQ + P + ++ Sbjct: 1 MSAAVLSSVLGIALASVSAPASAGLKDAFKGSAQDQRKDAVAQIPVCAKPLGSLSV---- 56 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKD-SRWFIPLER-QGLQNLLNERKIIRAAQENG 118 +P A N+ + A + L+ + SR F ++R G+ ER + NG Sbjct: 57 --IEPEDAVNWWSGQQLPAPSKLIKVFVNRSRCFTLVDRGAGMAASQRERDMA----ANG 110 Query: 119 TVAINNRIPLQSLTAANIMVEGSIIGYESNVK--------SGGVGARYFGIGADTQYQLD 170 + + + + AA+ ++ +I N G VG + Sbjct: 111 DLRARSNMGKGQIRAADYVMTPDLISQNRNAGGSAIAGMLGGLVGGNAGNLVGGLNLSKK 170 Query: 171 QIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLC 230 V L + +V + E ++ + A G +N + Sbjct: 171 TADVVLTITDVRSSEQVAMAEGNAKKTDLGWGAR-GNVFGGSDYGAAGAGGYANTEIGQV 229 Query: 231 LMSAIETGVIFLIND 245 + A +++ Sbjct: 230 ITLAYLQAYTDIVSQ 244 >UniRef50_A9BUH8 Putative uncharacterized protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9BUH8_DELAS Length = 484 Score = 53.7 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 58/157 (36%), Gaps = 8/157 (5%) Query: 44 APTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQN 103 A KI V V ++ E S + V + + AL + F L+R+ Sbjct: 154 AEMQKIKVVVGPVRFEQASLPMGDRSVSAAEVGATLRQRISDALVQTGRFAVLDREFSPE 213 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGA 163 + E II G L +A+++ + + N + + + + Sbjct: 214 IEQELAII----ATGQAPSAELAKLSQAASADLVWSARVSSFAYNRHARQLKTSDRQLVS 269 Query: 164 DTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYE 200 + A++ ++VNV+T ++ +S + ++ S Sbjct: 270 YSGGW----ALSQKLVNVATRQVTASDSLRGSVPSTA 302 >UniRef50_D1N8P2 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N8P2_9BACT Length = 439 Score = 53.3 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 58/178 (32%), Gaps = 17/178 (9%) Query: 65 PYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINN 124 A + A L L S F L R L + E ++ A Q Sbjct: 172 VQAAPGIGREDGSAFGAKLNEFLLKSGSFELLNRDALNLVAMESALVDAEQAA----PGQ 227 Query: 125 RIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTG 184 + + + + ++ + E N S G I + ++ + LR+++V TG Sbjct: 228 YVKVGQIAVGDWLIAVKLKRLEVNRLSSG-----TAIAGVSTREVATLEAELRIIDVKTG 282 Query: 185 EILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFL 242 +++ N + + ++ A + R + N+ + A + L Sbjct: 283 ALVAIENVTCRRKTTDIPASIRR--------DWTAADYRNDLMEQAAEQAGRKLLERL 332 >UniRef50_B9XL67 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XL67_9BACT Length = 1037 Score = 52.9 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 75/198 (37%), Gaps = 35/198 (17%) Query: 49 IFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNER 108 + + + P + S+A+ +L L + + +ERQ + ++ E Sbjct: 32 VRLGIGRF-----NLGVEPNAEQSSAL---LADLLTARLSEVKEVELVERQAMDRIVKEM 83 Query: 109 KIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQ 168 + + + + + + L A ++ GS KSGG+ + I Q Sbjct: 84 SLS----LSQPMKASEAVQVGKLAGAEWLLTGS------RQKSGGIDSLVIKILDAQTGQ 133 Query: 169 LDQIAVNLRVV-----NV-STG-EILSSVNTSKTILSYEVQAGVFRFIDYQRL-LEGEVG 220 + +L ++ +V TG I+ V S I S+ Q VF +++ L ++ G Sbjct: 134 IR----DLALIPVQNNDVNQTGDRIVDFVKNSTRIYSHHEQRIVFGIGNFENLSVDDRFG 189 Query: 221 YTSNEPVMLCLMSAIETG 238 N L +++E+ Sbjct: 190 SFGN-----ALRTSLESS 202 >UniRef50_C1AEH5 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AEH5_GEMAT Length = 341 Score = 52.9 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 55/176 (31%), Gaps = 20/176 (11%) Query: 3 RLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQ 62 + ++V LL+ C P RA + P + V V Sbjct: 34 PVASTLSVWLLAACGGGPTPAPRPAAADNRALAS---EQARGPGAQGTVGVPPF------ 84 Query: 63 FKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAI 122 + T + + ++ T L S +ER L +L E + + G V Sbjct: 85 -AARGSDTTLTPLAFALAELVSTDLSRSGKVRVVERARLGEVLRELDLA----QTGRVDS 139 Query: 123 NNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRV 178 + L +A +V GS+ ++N G D + AV+ R Sbjct: 140 ATAPRVGRLVSAERLVFGSVEPVDANTLRLGARI------GDVERSTVSNAVDARA 189 >UniRef50_C0QAM6 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QAM6_DESAH Length = 204 Score = 52.9 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 52/152 (34%), Gaps = 23/152 (15%) Query: 42 LPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGL 101 P K + VY + +ET + T + L S F + Sbjct: 64 FPVQASKPILIVYPVVNETSEHIS----------TGGITDEIRMKLIQSGKFRFINETQR 113 Query: 102 QNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGI 161 +N+ E + Q G V + R+ A+ ++ G++ + + Sbjct: 114 KNIQKETR----YQSRGYVDPSMRVDQGRQLGADYILSGTLRSIKKDQPRQWR------- 162 Query: 162 GADTQYQLDQIAVNLRVVNVSTGEILSSVNTS 193 + + ++++ + +++TGEI+ + Sbjct: 163 --LNKSERIYYSLDMTLTDLTTGEIVYADQAE 192 >UniRef50_C6BSE2 Putative lipoprotein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BSE2_DESAD Length = 218 Score = 52.5 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 79/253 (31%), Gaps = 48/253 (18%) Query: 1 MQRLFLLVAVM-----LLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYN 55 M+RLF L+ + LSGC + D+ LP P+GK ++V Sbjct: 1 MKRLFPLLILCVFALGALSGCASK-----------------SDIVKLPRPSGK--IAVAG 41 Query: 56 IQD------ETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERK 109 + + P+ + Q A +V L R + E + Sbjct: 42 FTNPVFNWELLAGYLPHEGKPIKKDILQELDAKMVGVLSKHG-VTAFARPAITRQCQEIE 100 Query: 110 IIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQL 169 + A + + +A+ ++ I+ ++ Q Sbjct: 101 VFENLGGRREAAFAYWVKVGQCMSADYILVPQILFWQDLRGMQKADFNI---------QP 151 Query: 170 DQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVML 229 + ++L +++V+ I+ + E Q + + G + L Sbjct: 152 ASVIIDLYLIDVNNRRIIRRFHYD------ETQQPLTENMLEAGTFFKRGGKWV-TAMQL 204 Query: 230 CLMSAIETGVIFL 242 A+ETG++ L Sbjct: 205 A-DEALETGLMEL 216 >UniRef50_C9KNZ7 Putative curli production assembly/transport component CsgG n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KNZ7_9FIRM Length = 214 Score = 52.5 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 61/214 (28%), Gaps = 31/214 (14%) Query: 39 LTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFST--AVPQSATAMLVTALKDSRWFIPL 96 + + A + V++ + + A L L + F Sbjct: 26 FSSITAEAAQPRVAIVGFTSRIHKTDLTLTDLKEPLPELLSVARDQLTAELAGEQKFALY 85 Query: 97 E--RQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGV 154 + Q + +E + A E GT+A A+ V G + G V Sbjct: 86 DFGEQTTKARCDEAAFVDALGE-GTIAPELAGK------ADYYVFGYLTTL------GKV 132 Query: 155 GARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRL 214 A+ +G + + ++LRV++ TG I+ + L Sbjct: 133 KAQSGALGLSGRDKTVYAELSLRVMDAHTGAIVFVTKADSRRKA--------------EL 178 Query: 215 LEGEVGYTSNEPVMLCLMSAIETGVIFLINDGID 248 + + + +A+E I L Sbjct: 179 SYNAIWQRHDSGEEDAIRAALEDAAINLAAQFKQ 212 >UniRef50_UPI00016C46F9 putative serine/threonine-protein kinase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C46F9 Length = 768 Score = 52.1 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 51/140 (36%), Gaps = 11/140 (7%) Query: 7 LVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPY 66 VA++L +G P P P Q+ P +V + Sbjct: 427 FVALLLAAGTRQWLPSATVAPAEPPAQQT-----EPAKPPDPKRAAVRAVP--VALLGFD 479 Query: 67 PASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRI 126 ++ + + + +L L F ++R L+ L E+++ +G V+ + Sbjct: 480 ERGGAASGLGGAVSDLLFAKLVAKPGFHLVDRTDLKKTLEEQRLS----LSGAVSPTEAV 535 Query: 127 PLQSLTAANIMVEGSIIGYE 146 + L A ++V GS++ + Sbjct: 536 RVGQLIGARLIVTGSVVRAD 555 >UniRef50_Q1PV59 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PV59_9BACT Length = 320 Score = 51.7 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 72/194 (37%), Gaps = 25/194 (12%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M++ ++ + L GC A + +++ + + + + ++ + + Sbjct: 21 MKKPTMMKILFL--GCALAFLM---------QTRAFAERETMGIASVRPTQAIIAGANRS 69 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 G + V QS L + +R F + R L +++ E++ + + Sbjct: 70 G------SRLSLDRVMQSMNGQLADRINATRKFQIVARNDLDDIVKEQEFAHSGNVS--A 121 Query: 121 AINNRIPLQSLTAANIMVEGSIIGY-ESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 + ++ ++ +I + + N A + G ++ +++ ++ Sbjct: 122 DDSTAAKQFKISGIKYLLVATIDDFQDYNEL-----ATFKKTGRTATKRVIRLSCIGKIY 176 Query: 180 NVSTGEILSSVNTS 193 + +TG++L S N Sbjct: 177 DTTTGKLLESANFQ 190 >UniRef50_Q2LUS2 Tetratricopeptide repeat domain containing protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LUS2_SYNAS Length = 663 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 20/125 (16%), Positives = 45/125 (36%), Gaps = 16/125 (12%) Query: 23 EAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAM 82 E+ P+ Q D+ K ++V++ + + A Sbjct: 351 ESVNPSHPELFQKLLDVKDHINKRIKKSIAVFDF----------GSPANDRDAGKIAANK 400 Query: 83 LVTAL--KDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEG 140 L+ L S +ER+ LQ++L E ++ + G V + + + + + + G Sbjct: 401 LIAYLHRNASGDLRIIERENLQSILREMQL----GQTGIVDMKSAQNVGKMRGIDTFIMG 456 Query: 141 SIIGY 145 ++ Y Sbjct: 457 DVLHY 461 >UniRef50_Q6AQQ4 Probable periplasmic protein n=1 Tax=Desulfotalea psychrophila RepID=Q6AQQ4_DESPS Length = 398 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 56/175 (32%), Gaps = 8/175 (4%) Query: 40 THLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQ 99 + + ++V +F + + V + L+ SR L+RQ Sbjct: 128 EQGYSADNRRKLAVVPFATGKSRFMLLGDATPAAKVEEEFRNRLIDLFTQSRRLSILDRQ 187 Query: 100 GLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYF 159 + E+ + + L ++ + +V G+I ++ S + Sbjct: 188 YGEAFETEKDLWLSDDAASGET----ARLGNVRGVDYLVVGTI----RSIWSKRYVEKIQ 239 Query: 160 GIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRL 214 G + V+ +++ +T ++ S + ++ + RF Q Sbjct: 240 LTGETISTYAGKAQVDYKIIQAATRQVKWSDTITVKFSDRNIRRMLSRFGSSQAG 294 >UniRef50_Q2LUR8 Fibronectin type III domain containing protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LUR8_SYNAS Length = 927 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 60/186 (32%), Gaps = 29/186 (15%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQF 63 FLL +++L GCL A P + Q + V+V ++ + Sbjct: 12 AFLLSVLIILQGCLAATKNTVISPQIRQLFQGTYVVDPYMEKHTPRTVAVLPFRNSSNSQ 71 Query: 64 KPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAIN 123 S V + S F +E + +LL + + G + Sbjct: 72 AG------SNEVRKG----FYNHFS-SLPFKDMELHRVDDLL----LKAGLTDPGVIRNT 116 Query: 124 NRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVST 183 + + L + + +V G I ++ R F + Y + +R+ + T Sbjct: 117 SPVKLGEILGVDAVVFGEISNFD----------RLFALV----YSQVSVGAEIRMYDAKT 162 Query: 184 GEILSS 189 G L S Sbjct: 163 GHFLWS 168 >UniRef50_Q8EXG4 Putative uncharacterized protein n=2 Tax=Leptospira interrogans RepID=Q8EXG4_LEPIN Length = 485 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 61/192 (31%), Gaps = 45/192 (23%) Query: 5 FLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNI---QDETG 61 F + L C+ T + + + + V + + +T Sbjct: 6 FFKFTYLFLIVCVIQC-------TTTGSSPIESVILNFNKNKKSYKIGVIDFIHSEKQTN 58 Query: 62 QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 ++ + + + L I +ER L LL E + + +G + Sbjct: 59 KYNSM------------ISDLFIVELSKDSSNILVERTKLAELLTEHSL----EYSGLLD 102 Query: 122 INNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNV 181 + L + ++++ GS ++I ++ R ++V Sbjct: 103 SDQARKLGKIIPIDLILTGS-------------------YSIQKIQTQEEIKISGRFIHV 143 Query: 182 STGEILSSVNTS 193 TGEI+ + NT+ Sbjct: 144 VTGEIVYAFNTT 155 >UniRef50_C9KTB5 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=C9KTB5_9BACE Length = 278 Score = 50.6 bits (119), Expect = 5e-05, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 48/150 (32%), Gaps = 22/150 (14%) Query: 48 KIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNE 107 + V V + S + V + ML S+ F ++R + + E Sbjct: 26 RQVVGVAEFSCK-------ENSPYIGLVTEKVVEMLTN----SKRFRVVDRTSREKITQE 74 Query: 108 RKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQY 167 ++ ++ + N + A M+ G II G G + Sbjct: 75 LELQKS---EAFIDSENLVEQDVAVGAEKMITGEIIKIPVYRMKNGDG--------TVRG 123 Query: 168 QLDQIAVNLRVVNVSTGEILSSVNTSKTIL 197 +A +++V+V+TG + + Sbjct: 124 YKASVAFQMKIVDVATGLSTEATSFEGKAS 153 >UniRef50_B3E5P1 Putative uncharacterized protein n=1 Tax=Geobacter lovleyi SZ RepID=B3E5P1_GEOLS Length = 340 Score = 50.6 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 34/263 (12%), Positives = 70/263 (26%), Gaps = 37/263 (14%) Query: 1 MQRLFLLV----AVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSV--- 53 M+ + + LSGC + + + + R + + V++ Sbjct: 3 MKSIIGCLLGLGIASGLSGCAHTVEVGSFKESELDRKVAETLPPQHVIDKKQPKVAILPL 62 Query: 54 YNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRA 113 + G+ P + + +ER Q + +E+K + + Sbjct: 63 GEPPEYAGRLSPGAQEGITQIAAKGCGM------------EVVERSQAQRIFDEKKFVWS 110 Query: 114 AQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVK---SGGVGARYFGIGAD--TQYQ 168 + + + + + +V GSI + K S + Sbjct: 111 LDLSADFSEIRSM----VNGIDYIVLGSITNPATGAKFTPSQTSCDSKGKCSTSKPSCTV 166 Query: 169 LDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVM 228 VN+RV+ STG + + V VG Sbjct: 167 KGSATVNIRVIQASTGSVAQAF--------EPFTGSVSNSFAVTAQYACRVGNPVAVLTQ 218 Query: 229 LCLMSAIETGVIFLINDGIDRGL 251 + +A+ LI G Sbjct: 219 -AVANAVSKAKRPLIEAFPRYGY 240 >UniRef50_Q2BZ14 Putative uncharacterized protein n=2 Tax=Photobacterium RepID=Q2BZ14_9GAMM Length = 196 Score = 50.6 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 48/206 (23%) Query: 1 MQR--LFLLVAVMLLSGCL-------TAPPKEAAR-----------PTLMPRAQSYKDLT 40 M++ L LL A LL GC T P+ + L + S ++ Sbjct: 1 MKKSVLVLLTASSLLGGCAQTVDYVNTHLPETPSVNLGSNDLDSTASDLTTKMLSSPAVS 60 Query: 41 HLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQG 100 + A V V NIQ+ T +S T + + + S F +++ Sbjct: 61 SITAGGKHPIVIVDNIQNNTS----------GHVDTKSLTNTIKSKISRSGKFNLVDKSR 110 Query: 101 LQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFG 160 + + + + V I +T A M+ G + ++ + G ++ Sbjct: 111 VDAVRKQMNFSENDR---FVNQGTAIQFAKMTGAQYMLYGHLT--NTSKRQDGQNVPFY- 164 Query: 161 IGADTQYQLDQIAVNLRVVNVSTGEI 186 + +R+++ ++G I Sbjct: 165 ------------KMTMRLMDTNSGSI 178 >UniRef50_UPI00016C46FA hypothetical protein GobsU_15593 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C46FA Length = 338 Score = 50.6 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 3 RLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQ 62 R+ L V+ +L+ G L A + +A+ + P V++ Sbjct: 5 RIVLAVSAVLVCGLLPAAAFGHGPDKVSDKAKDKQTKGGKDKPVLP--VALLGFD----- 57 Query: 63 FKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAI 122 + + T +L L F ++R L+ L+E+ + G V Sbjct: 58 ----ERGAGAKGLGPKVTDLLFLKLAARTEFFLVDRADLKKALDEQVLSLT----GAVKA 109 Query: 123 NNRIPLQSLTAANIMVEGSIIGYESNVK-----SGGVGARYFGIGADTQYQLDQIAVNLR 177 + + + LT A +++ GS++ + V R G A+ D + + Sbjct: 110 DTAVTVGQLTGAKLLITGSVVQIDKRVHLIAKVISSETGRVVGASAEGTLSDDLEGLVAQ 169 Query: 178 VVNVSTG 184 + NV TG Sbjct: 170 LANVITG 176 >UniRef50_B1Y5R8 Peptidoglycan-binding domain 1 protein n=6 Tax=Proteobacteria RepID=B1Y5R8_LEPCP Length = 312 Score = 50.2 bits (118), Expect = 7e-05, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 80/233 (34%), Gaps = 14/233 (6%) Query: 15 GCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTA 74 GCL+ PK + A + + + K S+ I + P+ Sbjct: 30 GCLSTAPKLGENKGTVSGAAGGETAENANSQLEKCEESLGTIAVQEDTNAPWYFQLRERQ 89 Query: 75 VPQSATAMLVTALKDSRWFIPLER-QGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTA 133 + + + + ++ S F+ +ER G+ N++ ER + +++G + + + + Sbjct: 90 LGSTVPVIRL-MIQQSNCFVVVERGAGMNNMMAERNL----EKSGEMRGGSNFGGGQMVS 144 Query: 134 ANIMVEGSI---IGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSV 190 A+ + I G GA Q ++ + L +++ +G +S+ Sbjct: 145 ADYTLRPEIQFSGKTGGGGGFLGGGALGLIGAVAGQMGKNEASTTLLLIDNRSGVQISAS 204 Query: 191 NTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLI 243 + + + + F G S P ++SA ++ Sbjct: 205 EGTGSNYDFGLFGAAFTGGLGGG-----GGGYSKTPQGKVIVSAFADSYNQMV 252 >UniRef50_B2V7H8 Putative uncharacterized protein n=1 Tax=Sulfurihydrogenibium sp. YO3AOP1 RepID=B2V7H8_SULSY Length = 225 Score = 50.2 bits (118), Expect = 7e-05, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 48/136 (35%), Gaps = 16/136 (11%) Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIP 127 S + + ML+TALK+S F + L +K + A + ++ Sbjct: 32 GSANVQGIGKGLGNMLITALKESGCFKVI---DLDQFEQVKKKLEATGQ--------KVQ 80 Query: 128 LQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQ--YQLDQIAVNLRVVNVSTGE 185 + + + +I + SG +G I + Q ++ +++ +++ T E Sbjct: 81 PPKI---DKFINLTITQIALSRSSGALGGGLIPIIGAIKKDTQSAEVGIDVALMDPVTLE 137 Query: 186 ILSSVNTSKTILSYEV 201 I + + Sbjct: 138 ISEAKSFKANSEKTSW 153 >UniRef50_C4XP04 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XP04_DESMR Length = 426 Score = 49.8 bits (117), Expect = 9e-05, Method: Composition-based stats. Identities = 37/266 (13%), Positives = 74/266 (27%), Gaps = 46/266 (17%) Query: 1 MQR---LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQ 57 M+R L +++++LLSGC+ A K + P P V V Sbjct: 1 MKRSGILGAVLSLVLLSGCVGAGKKATELADQALYKPVEYENAAAPGPE----VVVLP-- 54 Query: 58 DETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQEN 117 G+ K + + L F+ L+R Q E + A Sbjct: 55 ---GKIKSSNYAFVQKVTSNNLRDFAEIELSKDN-FVVLDRADSQPFFQEIAL---AANL 107 Query: 118 GTVAINNRIPLQSLTAANIMVEGSIIGYESNV-KSGGVGARYFG---------------- 160 G AA ++ +I+ E V + G G Sbjct: 108 GDADALKVFRKGKFKAARWLLTFNILKAEPTVYVTKGFDGETAGAAIELIALIANKGEKS 167 Query: 161 -IGADTQYQLDQ-----------IAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRF 208 +G+ + + + ++V+ STG+ ++ + + Sbjct: 168 SLGSMVGKTVASAKAADTSAIWLVGLQYKIVDASTGQQMAQGYIEDKMEATTSMNSFLGA 227 Query: 209 IDYQRLLEGEVGYTSNEPVMLCLMSA 234 + Q + + + + Sbjct: 228 TNQQST-SITLDTMAQRLIQKAVAEI 252 >UniRef50_Q1Q2L7 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q2L7_9BACT Length = 197 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 72/243 (29%), Gaps = 57/243 (23%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQF 63 + + +L GC T +++ + A ++V + D+ Sbjct: 7 IIAFIICLLCFGCKTI-------------SKTNIIVDSNDASRSIKTIAVMHFNDQLLPK 53 Query: 64 KPYPASNFSTAVPQSATAMLVT----ALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 K + T +A ML + L + R + ++N A E Sbjct: 54 KGVTGTLVKTISTANAGEMLASIMSRELSGLGIYEVRSRTDIAKIIN----KSKANEKEL 109 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVV 179 V + L L + +V G I+ ++ + + Y+ ++ + Sbjct: 110 VERRDYDRLGKLLGVDSVVIGKILEFK--------------LSSSLIYERGTVSFVAECI 155 Query: 180 NVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGV 239 + G IL +++ +++ + E + +AIE Sbjct: 156 DTKNGNILWTIDVNESAAYED----------------------EIELAGKAMRTAIEKLK 193 Query: 240 IFL 242 L Sbjct: 194 KEL 196 >UniRef50_C1TRF9 Putative uncharacterized protein n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TRF9_9BACT Length = 159 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 47/129 (36%), Gaps = 29/129 (22%) Query: 75 VPQSATAMLVTALKDSRWFI--PLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLT 132 + + + ++ L ++R + E+KI +G VA + + Sbjct: 40 LGAAVSDIVRNELSTLPEITMVVVDRNHIVKAAGEQKI----GMSGLVAPETAAKVGRIV 95 Query: 133 AANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNT 192 A ++ GS+ GG +I+++ R+V+V +GE++ S + Sbjct: 96 GARYVLVGSVNSL------GG-----------------EISLDSRLVDVESGEVIESFSA 132 Query: 193 SKTILSYEV 201 S + Sbjct: 133 SSDAGQEGL 141 >UniRef50_A6EQ48 Putative uncharacterized protein n=1 Tax=unidentified eubacterium SCB49 RepID=A6EQ48_9BACT Length = 299 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 69/222 (31%), Gaps = 30/222 (13%) Query: 45 PTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNL 104 KI ++ I Y + S A Q+ ++ +R F ++R+ L L Sbjct: 19 AQEKISIAFIPI--------SYDETMISKADAQTIQQSVLNKFVTARKFSVVDREQLDEL 70 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGAD 164 NE+ + R + + A+ ++ SI+ + G + Sbjct: 71 ENEKNLQRT---ESFMDSEPNVTDGVSLGASYLISTSILSLRHSEIRRGWESML------ 121 Query: 165 TQYQLDQIAVNLRVVNVSTGEILSSVNTSKTIL--SYEVQAGVFRFIDYQRLLEGEVGYT 222 + ++V+++STG+IL++ N S + S + ++ L Sbjct: 122 --------QLQIKVLDISTGQILATENISSEFIEPSNLILKARKEYLSKDELKVISDKED 173 Query: 223 SNEPVMLCLMSAIETGVIFL---INDGIDRGLWDLQNKAERQ 261 E + A + L I + Sbjct: 174 RLEEIQSHKEDAFIMALQRLELNIQKFTGKNFPVALEIINWN 215 >UniRef50_Q4HPW6 Probable periplasmic protein Cj0093 n=14 Tax=Campylobacterales RepID=Q4HPW6_CAMUP Length = 400 Score = 49.8 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 53/171 (30%), Gaps = 20/171 (11%) Query: 27 PTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTA 86 + + P + S+ + + ++T Sbjct: 112 ANVTIFKTTTTKKYQAPGLSADNRRSISVFD--------STPDPTKRGIGAALQQKIITN 163 Query: 87 LKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYE 146 L SR F L+R E+ +I ++G + L+++ + ++ SI + Sbjct: 164 LLQSRKFNVLDRDSNGYYEMEKALI----QSGNATKDEIYKLKNVLGTDYILLFSISALD 219 Query: 147 SNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTIL 197 K+ + ++ ++ RV+ +T +I + S + Sbjct: 220 GKQKTSNL--------TGKSKMEAEVVIDYRVLLFATRQIKFANTLSMKVA 262 >UniRef50_B3E9Y8 Tetratricopeptide TPR_2 repeat protein n=1 Tax=Geobacter lovleyi SZ RepID=B3E9Y8_GEOLS Length = 660 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 74/276 (26%), Gaps = 70/276 (25%) Query: 31 PRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKD- 89 Q + L ++V + T ST + T L++ + Sbjct: 354 ESQQRLQTLKDRLRQRVVKKIAVMDFSPPTN----------STDAGKLVTDSLLSYMTKN 403 Query: 90 -SRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESN 148 S L R L ++ E + + G I + L +I + GS++ Y Sbjct: 404 ASSDVKILARDVLGAIIKEIEF----GQAGLYDIESAKKTGKLKGTDIFIFGSVLQYNVE 459 Query: 149 VKSGGVGARYFGIGADTQ------------------------------------------ 166 + + A Q Sbjct: 460 KSAEEGSKMVNVVVATKQVPNPAYQSWLLSHPSPNEKEMALAPPALMKEEIRETVKYKVG 519 Query: 167 --YQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSN 224 + +A++ RV++V +GE++ + + + + F + + Sbjct: 520 THKKTANVALSFRVIDVESGEVVITTTIKSRKEAEDKFSEGVEF--------ANIPFDPM 571 Query: 225 EPVMLCLMSAIETGVIFLINDGIDRGLWDLQNKAER 260 E + +E V I + L QN+ Sbjct: 572 ELPSDAV--LLEKAVDEGIAELGRLVLTRFQNRQAS 605 >UniRef50_C4LBF5 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LBF5_TOLAT Length = 197 Score = 48.7 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 18/155 (11%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARP-----------TLMPRAQSYKDLTHLPAPTGKIFV- 51 L + +L+GC T K + T P+ DL L T K + Sbjct: 14 WLLWLTAFVLTGCNTTMYKSSGTRPSGQTVTTGTGTAQPQQVKPVDLQPLAKRTAKSVIK 73 Query: 52 --SVYNIQDE-TGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNER 108 + +++ + T S T++L L S F + L+ + Sbjct: 74 RSAAFDLGKKPTLYVDMLRNSTGRPQDTAKITSVLHKELARSGRFQLIP---LEKNAAYQ 130 Query: 109 KIIRAAQENGTVAINNRIPLQSLTAANIMVEGSII 143 + + Q G++ + + L T A++++ G++ Sbjct: 131 QSLDYQQSEGSMNPSTAVQLGKQTGADLILYGNVS 165 >UniRef50_A8ZSW8 Tetratricopeptide TPR_2 repeat protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZSW8_DESOH Length = 401 Score = 48.3 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 23/117 (19%), Positives = 42/117 (35%), Gaps = 18/117 (15%) Query: 52 SVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKII 111 +V+ +D+T P N + A ++ AM ++ L +ER LQ L+ E + Sbjct: 163 AVFYYEDKT------PGKNMA-AFQKALAAMTISNLSHINSIQVVERLRLQALMEEMAL- 214 Query: 112 RAAQENGTVAINNRIPLQSLTAANIMVEGSI-------IGYESNVKSGGVGARYFGI 161 G V L L A ++ G++ S + +G + Sbjct: 215 ---GRTGIVDSKTAPRLGRLVGAEHLIVGTLSKDIRTDTALASTTRRKVIGNAALTV 268 >UniRef50_A8UWV5 Putative uncharacterized protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8UWV5_9AQUI Length = 250 Score = 48.3 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 70/211 (33%), Gaps = 26/211 (12%) Query: 4 LFLLVAVMLLS-GCLTAPPKEAARPT---LMPRAQSYKDLTHLPAPTGKIFVSVYNIQDE 59 ++ + L+S C ++ + L + + G F+ ++ Sbjct: 3 FVVIALLFLISLSCAPKVSQQESIRVAVLLPAKGKWEYRQYENGREVGTYFLEYLFFENY 62 Query: 60 TGQFKPYPASNFSTAVPQSA--TAMLVTALKDSR----WFIPLERQGLQNLLNERKIIRA 113 +S + +S +L A+++ + ++R+ + + E+K Sbjct: 63 RSDKGSIYSSMARDRIEESTLCDGVLSKAIRNIKGKGLRVDIVDRRNISKIFEEQKF--- 119 Query: 114 AQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQL---- 169 Q +G V N + + + A+ +V + IG Y+ Sbjct: 120 -QYSGFVDKNTMVKIGKILGADYLV---FVEPRYISLKKDSYVDVRKIGVPLVYEKGKHF 175 Query: 170 -----DQIAVNLRVVNVSTGEILSSVNTSKT 195 + + + VVNV TGE++ + T Sbjct: 176 CLSFASSVRLKISVVNVETGEMVVTRIYKGT 206 >UniRef50_A8V0I3 Putative uncharacterized protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V0I3_9AQUI Length = 191 Score = 47.9 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 73/211 (34%), Gaps = 36/211 (17%) Query: 1 MQRLFLLVA-VMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDE 59 M+++F+ + + L+ C K+ +L P + P ++V Sbjct: 1 MRKIFIFLFFISLIISCA----KKVQVSSLNP--------PNYPQVMKYKKIAVLPF--- 45 Query: 60 TGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 K A + + + + + ++ +RQ ++ +L+E+K +G Sbjct: 46 ----KGKGADLLALEIESALANIYIN---GKPFYQIADRQTIKQVLSEQKFS----SSGL 94 Query: 120 VAINNRIPLQSLTAANIMVEGSI----IGYESNVKSGGVGARYFGIGADTQY-QLDQIAV 174 V + + + L + S+ + Y + Y + I Sbjct: 95 VNEKDAVKIGKLLGVQGIYTVSVLKSGVSYNQTYEKRVKCLDDKCKKTREYYVRCKNITA 154 Query: 175 NL----RVVNVSTGEILSSVNTSKTILSYEV 201 R+++V TG+++ + N ++ + V Sbjct: 155 VFSFIPRLIDVETGKVVFAKNYTRQESAKTV 185 >UniRef50_A7HKS6 Putative uncharacterized protein n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HKS6_FERNB Length = 390 Score = 47.5 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 18/135 (13%), Positives = 45/135 (33%), Gaps = 19/135 (14%) Query: 68 ASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIP 127 + ++ ++L + F R L+ +L ER + + ++ + Sbjct: 41 GNGWNMDEVDYLLSILEEQALELGRFQLFPRADLEKILKERNLT-------ELGVSEAVE 93 Query: 128 LQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEIL 187 + L + + ++ ++ S T +++ N+ GE+L Sbjct: 94 IGKLGGSKYALLLTLTELSASWSS------------KTNSYQAVSRYTIKLYNIENGELL 141 Query: 188 SSVNTSKTILSYEVQ 202 +S + T S E Sbjct: 142 ASKSMESTGSSKETS 156 >UniRef50_B0SDG2 Putative uncharacterized protein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SDG2_LEPBA Length = 248 Score = 47.5 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 47/139 (33%), Gaps = 28/139 (20%) Query: 49 IFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNER 108 V+V + + + + + +L F +ER+ L ++NE+ Sbjct: 82 TKVAVLVFD--------IEEAKWGDEFTDAVSLQIAKSLP----FKVIEREQLSKVVNEQ 129 Query: 109 KIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQ 168 + G + + L + + +V G + + G + Sbjct: 130 SFSKT----GIIDTQTAVRLGKVLGVDALVFGRGSALKKFDEKGKLIPNLVD-------- 177 Query: 169 LDQIAVNLRVVNVSTGEIL 187 V+L++V++ +G ++ Sbjct: 178 ----TVSLKIVHIESGHVI 192 >UniRef50_A6GKK1 Putative lipoprotein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GKK1_9DELT Length = 208 Score = 47.5 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 44/140 (31%), Gaps = 27/140 (19%) Query: 47 GKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLN 106 V+++ I++ T Q + + T + +S + R+ ++ Sbjct: 82 EAPIVAIWPIKNSTDQHIDDQMLTLLSEI--------ETQMINSGAVNVVSRERQAEMVA 133 Query: 107 ERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQ 166 E ++ Q + L + A + G I + + Sbjct: 134 EAQL----QNSDLFNPATAAQLGAQLGAKYYITGKITSTDERFD---------------K 174 Query: 167 YQLDQIAVNLRVVNVSTGEI 186 + Q ++ L+V+ V T I Sbjct: 175 ERRVQYSLFLQVIEVETSMI 194 >UniRef50_A3EPA7 Putative uncharacterized protein n=3 Tax=Leptospirillum RepID=A3EPA7_9BACT Length = 205 Score = 47.1 bits (110), Expect = 6e-04, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 67/213 (31%), Gaps = 32/213 (15%) Query: 4 LFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSV-----YNIQD 58 L ++ +L GC + K + T + D I ++ N Q+ Sbjct: 12 LGTILLAPVLQGCSSYSVKRVSVDTEGGVTSRWTDTDARLTAKKLIKKALDTPWLTNFQE 71 Query: 59 ETGQFKPYPASNFSTAVPQSATAML-----VTALKDSR--WFIPLERQGLQNLLNERKII 111 + G+ Q L L +S F+ + L ER Sbjct: 72 KHGRRPVVELGQMINRSDQHINTRLFLNHFQDELINSGKVRFVTASEAHRRALQEER--- 128 Query: 112 RAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQ 171 A + + T A+ ++ GSI Y + GG R++ Sbjct: 129 --AYQMKHARSSTVHGPGEQTGADFLLTGSINSY--MARRGGKTVRFY------------ 172 Query: 172 IAVNLRVVNVSTGEILSSVNTSKTILSYEVQAG 204 +L+ ++++T EI+ ++++ G Sbjct: 173 -ETHLKAIDLTTNEIIWGTEYRVKKIAHQSGYG 204 >UniRef50_A0L966 Putative uncharacterized protein n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L966_MAGSM Length = 302 Score = 46.4 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 67/173 (38%), Gaps = 16/173 (9%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+ ++ ++LLSGC +E A + + L A V V ++ + Sbjct: 1 MRLWVGIMFLLLLSGCGGKSIEEGAESLV-------QQLVEQGAVFNGKRVVVADLLER- 52 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 S ++TA+ ++ L+ L R LER L+ +L E K Q + V Sbjct: 53 ---HSNQTSVYTTALSENLRNGLIPHLGRLRS-SLLERSMLEQVLKEFKF----QSSFWV 104 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIA 173 + +A+++V G++ V + F + A + ++ ++ Sbjct: 105 DQEKAKQVGKALSADLVVMGTVDKTTGQALVRVVELQSFSVMAGARERIFMVS 157 >UniRef50_A7HBE1 Lipoprotein, putative n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HBE1_ANADF Length = 389 Score = 46.0 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 51/144 (35%), Gaps = 29/144 (20%) Query: 43 PAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQ 102 P + ++V Q+ T + A+ L AL R + LE ++ Sbjct: 231 PPKDRAVTIAVVPFQNRTQREHAGEV----------ASLQLTRALAGIRGYRVLEPAVVR 280 Query: 103 NLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIG 162 + R+I+ + G R+ L A+ ++ G + Y+ + G+ A Sbjct: 281 EEMLRRRIV---VQEGVSRETVRMLRGGLE-ADYVIGGMVTRYDEARGAKGIPA------ 330 Query: 163 ADTQYQLDQIAVNLRVVNVSTGEI 186 + V + ++ +TG + Sbjct: 331 ---------VDVTVTMLETATGRV 345 >UniRef50_Q1K1J1 Lipoprotein, putative n=1 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1K1J1_DESAC Length = 189 Score = 46.0 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 68/209 (32%), Gaps = 44/209 (21%) Query: 5 FLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFK 64 LL+AV+L +GC T L ++ +++ +++ T + Sbjct: 7 LLLIAVLLCAGCA---------KTGHHYTDPAVGLGYI------KKIAILPLENFTARKG 51 Query: 65 PYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINN 124 S +L T + + +E+ L L + + E + Sbjct: 52 IEERSR----------ELLTTRILGHGLYEVVEKGELHRFLRDEI---RSNEKELIDQRV 98 Query: 125 RIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTG 184 + + GS+ + + V++G Y +IA++LR+V++ TG Sbjct: 99 AKRMAREFNIEAYIAGSVDEF-TEVRNG-------------SYTYPEIAISLRMVDIKTG 144 Query: 185 EILSSVNTSKTILSYEVQAGVFRFIDYQR 213 ++ S Y +F Sbjct: 145 NVVW--KASHHANGYSTAGRLFGLTAEDT 171 >UniRef50_C8R2C8 Putative uncharacterized protein n=1 Tax=Desulfurivibrio alkaliphilus AHT2 RepID=C8R2C8_9DELT Length = 188 Score = 45.6 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 52/151 (34%), Gaps = 29/151 (19%) Query: 50 FVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERK 109 V+V +++ G + + V +++T + S F ++ + ++L E Sbjct: 40 RVAVLPLENLGG------GAGQAERV----REIVITQVLASGLFDVADKGRVDSVLREEA 89 Query: 110 IIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQL 169 I A + L + GS+ E +S G A Sbjct: 90 IAPGAA----IEDATLRRLGQTLGTEAFILGSV---EQGTESRGGAAYP----------- 131 Query: 170 DQIAVNLRVVNVSTGEILSSVNTSKTILSYE 200 +I + LR+V+ +G IL + + S Sbjct: 132 -EINLTLRLVDSESGLILWQASGRGSGYSVS 161 >UniRef50_A1WBB7 Putative lipoprotein n=1 Tax=Acidovorax sp. JS42 RepID=A1WBB7_ACISJ Length = 100 Score = 45.6 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 14/91 (15%), Positives = 34/91 (37%), Gaps = 10/91 (10%) Query: 4 LFLLVAVMLLSGCLTA-PPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQ 62 ++ L +GC+T + P A++Y+ + + +S+ + + Sbjct: 10 ALVIAVAALTAGCVTTEQSRTLETPKPTASARAYQGV--------RSPISIGKFANRSNF 61 Query: 63 FKPYPASNFSTAVPQSATAMLVTALKDSRWF 93 + + + A +L+T L+ S F Sbjct: 62 QRGIFSDGVDR-LGGQAQTILMTHLQQSGRF 91 >UniRef50_B9B5R2 Peptidoglycan-binding domain 1 protein n=29 Tax=Proteobacteria RepID=B9B5R2_9BURK Length = 261 Score = 45.2 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 79/250 (31%), Gaps = 23/250 (9%) Query: 6 LLVAVMLLSGC---LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKI-FVSVYNIQDETG 61 + ++GC +T + A S T L T + ++V + ++ Sbjct: 1 MAALCASVAGCGGMVTPGGQNAGVTGAAAGGTSAGADTQLQRCTTPLGTIAVDDGRN--- 57 Query: 62 QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVA 121 + +T V S +L A++ S F+ + +I + + +G Sbjct: 58 -ADWWGPFGSATKVT-SIDPLLRLAVQQSNCFVITSLGNQKTDARLSRITQLQRNSGEYR 115 Query: 122 INNRIPLQSLTAANIMVEGSIIGYES-------NVKSGGVGARYFGIGADTQYQLDQIAV 174 ++ AA+ +E I+ +S + + + Q + + + Sbjct: 116 AGSKQQKGQRVAADYYMEPQIVISDSPIGGIGSMIGGLIGNSAVAAVAGHLQTKASVVTL 175 Query: 175 NLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 L +V + +++ S T +Y + G + S+ P + A Sbjct: 176 TL--FDVRSAVQIAAAEGSSTATNYGAAL-----GGLGGGVGGGLAGFSSTPEGKATVVA 228 Query: 235 IETGVIFLIN 244 ++ Sbjct: 229 FIDAWNKMVV 238 >UniRef50_Q1IJH0 Serine/threonine protein kinase with TPR repeats n=3 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IJH0_ACIBL Length = 1023 Score = 45.2 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 45/145 (31%), Gaps = 34/145 (23%) Query: 51 VSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKI 110 V+V + T N T ML T L + L + + + Sbjct: 384 VAVLYFSNLTQDPALNWLDN-------GLTDMLTTNLAQVKGLDVLASDRVMSAVQ---- 432 Query: 111 IRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLD 170 +A+++ T+ + A+ + G++ + Sbjct: 433 -KASKDGKTLDPAQAQKIARDAGADTYITGAL----------------------LKIGPT 469 Query: 171 QIAVNLRVVNVSTGEILSSVNTSKT 195 Q+ +++R + STG+I+ S Sbjct: 470 QLRLDVRAQDTSTGQIVYSDKLEGQ 494 >UniRef50_D1N8P3 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N8P3_9BACT Length = 276 Score = 44.4 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 51/143 (35%), Gaps = 11/143 (7%) Query: 70 NFSTAVPQSATAMLVTALKDS-RWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPL 128 N + ++ AL D F P++R+ L + L + + Sbjct: 118 NSPQRSTVIGSEYMIAALGDYPEAFSPVDRRALDDSL----LAIELGTQADAMAQAKEKF 173 Query: 129 QSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILS 188 LT A + G + ++ S + G G +T+ +L + V ++VV + T ++ Sbjct: 174 GKLTGATHALYGVVSDFQVEETS------FKGYGIETKNKLYTLDVIVKVVELGTNRVVF 227 Query: 189 SVNTSKTILSYEVQAGVFRFIDY 211 S + I + A R Sbjct: 228 SGLFTGKIKRLDHGATTRRDTGL 250 >UniRef50_A9BP36 Putative uncharacterized protein n=5 Tax=Comamonadaceae RepID=A9BP36_DELAS Length = 271 Score = 44.4 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 57/170 (33%), Gaps = 24/170 (14%) Query: 88 KDSRWFIPLER-QGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYE 146 + S F ++R GL+ + E+ + A G + N+ + A + S+ E Sbjct: 92 QQSGCFRVVDRGAGLRGTVQEQDLKNA----GVLRENSTVRKGRGYEAQYTLTPSLTFSE 147 Query: 147 SNVKSGGVG-----------ARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKT 195 + G G A G+ +++ Q A+ L + T E +++ + Sbjct: 148 QDAGRGLAGVVAMIPVLRDIAGLVGMVEQVKFKEAQTALLLS--DNETTEQVAAATGAAR 205 Query: 196 ILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVIFLIND 245 V + RL G SN + +A L++ Sbjct: 206 TTDLGVGG-----MVLGRLGAAGAG-WSNTNEGKVIAAAFLDAHNQLVHQ 249 >UniRef50_Q6MLD6 Putative uncharacterized protein n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MLD6_BDEBA Length = 309 Score = 44.4 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 64/203 (31%), Gaps = 22/203 (10%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M++LF L V ++ C T + +Y+ AP ++ V D Sbjct: 1 MKKLFCLAIVFCVTACATLDRSANPTTRREIKDVNYEARKDDSAPRKRMM--VLPFLD-- 56 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + A A + L + I L+ + L+ L ENG Sbjct: 57 ------AGDKRPQELRDQARAAFIADLNRTGEVIALDSRELKVDLA------KMIENGQY 104 Query: 121 AINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVN 180 ++ N ++EG II K+ VG ++ +RVV Sbjct: 105 KLSEVAKAAQALGVNTVLEGKIIDIRIKRKADNVGV------VRHLTTAFEVVAQVRVVT 158 Query: 181 VSTGEILSSVNTSKTILSYEVQA 203 G + + + T+ V+ Sbjct: 159 GRAGREVFNTVKTVTVEEQGVRV 181 >UniRef50_Q2BQZ0 Putative lipoprotein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BQZ0_9GAMM Length = 243 Score = 44.4 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 51/255 (20%), Positives = 90/255 (35%), Gaps = 40/255 (15%) Query: 1 MQRLFLLVAVMLLSGCLT------APPKEAARPTLMP-------------RAQSYKDLTH 41 M+++ LLV V +LSGCL APP+ ++ P A+ L Sbjct: 1 MKKVLLLVFVAMLSGCLQQIKKEDAPPERVHIESMSPEVVAELHKLEEAHIAEKEALLDK 60 Query: 42 LPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSR--WFIPLERQ 99 A + V V ++ + S AV Q A M V L+ +R F Sbjct: 61 EQAMLQRQVVQVDSLPN--------GLDPLSEAVAQMAVQMNV-GLQQNRVKRFPVAV-V 110 Query: 100 GLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGA--- 156 NL NER++ R + I + G + + + A Sbjct: 111 PFTNLHNERRVGRFGERLEQAFIYQLQQHGYNM-VDYRAAGLTTSTKQPLSKQNLSALRV 169 Query: 157 ---RYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQR 213 YF + D + +N RV++ +T ++L+ + + ++ G+ + + Sbjct: 170 RYKIYFLVTGTYAQHSDGMVINARVIDTTTRQVLA--TGQSHVSNARLEGGIPGYNPLEA 227 Query: 214 LLEGEVGYTSNEPVM 228 L +G + PV Sbjct: 228 LNKGMIIENRGGPVG 242 >UniRef50_A7HI06 Serine/threonine protein kinase n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HI06_ANADF Length = 990 Score = 44.0 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 46/115 (40%), Gaps = 17/115 (14%) Query: 34 QSYKDLTHLPAPTG--KIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSR 91 + + L PA G ++ V+V + ++ TG+ + + ML+T+L+ SR Sbjct: 349 RQRQGLDSPPAGAGEARVMVAVADFENHTGEPELG-----------GLSGMLITSLEQSR 397 Query: 92 WFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYE 146 L R + ++L R++ R V + +V SI ++ Sbjct: 398 RLSVLTRVRMLDIL--RQLGRENVA--GVDEALGREVARSAGVRALVLASIRRFD 448 >UniRef50_B1ZST0 Putative uncharacterized protein n=2 Tax=Verrucomicrobia RepID=B1ZST0_OPITP Length = 211 Score = 43.7 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 62/220 (28%), Gaps = 36/220 (16%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYN----- 55 L + + LL GC + K + + + V+V + Sbjct: 9 FTTLAIALTSALLGGCASQGVKNPTGVPVTEMRADERGFVAGTGVESQDLVAVTDKMARS 68 Query: 56 -------IQDETGQFKPYPASNFSTAVP---QSATAMLVTALKDS--RWFIPLERQGLQN 103 + +T T P + L + L R + Sbjct: 69 ILAIPEISRAQTAPRIVLDPVVNDTRFPLNKDIFNDRIRIELNKNAQGRVRFLARDRMAT 128 Query: 104 LLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGA 163 L ER++ +++G V + + A+ + G + G + +G Sbjct: 129 LEREREL----KQSGQVTASADPSVTEFRGADYFLTGKLSGMATRTSAGTS--------- 175 Query: 164 DTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQA 203 D + + ++++ T EI+ + E A Sbjct: 176 ------DYVLYSFQLIDARTSEIVWEDSAEIKKQGLEDAA 209 >UniRef50_B9XQI4 Calcium-binding EF-hand-containing protein n=2 Tax=bacterium Ellin514 RepID=B9XQI4_9BACT Length = 1431 Score = 43.7 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 13/73 (17%), Positives = 31/73 (42%), Gaps = 5/73 (6%) Query: 78 SATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIM 137 A +L L + LER + +L E+K+ ++ ++I I + + A+ + Sbjct: 41 QAGDLLAVELSGQTNVVLLERNEIDKILKEQKLAQSG-----ISIKESIDVGHILGADGL 95 Query: 138 VEGSIIGYESNVK 150 + ++ +N Sbjct: 96 LTLGLVQSGTNSH 108 >UniRef50_B8L8X5 Peptidoglycan-binding domain 1 protein n=1 Tax=Stenotrophomonas sp. SKA14 RepID=B8L8X5_9GAMM Length = 330 Score = 43.7 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 37/255 (14%), Positives = 76/255 (29%), Gaps = 14/255 (5%) Query: 8 VAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSV-YNIQDETGQFKPY 66 V L G A L + + + G+ + V +P Sbjct: 10 VLACALLGAGLLGVSAPASAGLRDSFANSRTSAQEQSKKGQAEIPVCTKPLGSISVIEPE 69 Query: 67 PASNFSTAVPQSATAMLVTALK-DSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNR 125 N+ T A + L+ S+ F ++R + + RA G + + Sbjct: 70 DVVNWWTGQQLPAPSKLIKVFVNKSKCFTLVDR---GVGMAAAQAERALASEGQLRGRSN 126 Query: 126 IPLQSLTAANIMVEGSIIGYESNV--------KSGGVGARYFGIGADTQYQLDQIAVNLR 177 I + AA+ ++ +I SN G +G + + + ++ V L Sbjct: 127 IGKGQIRAADYVLVPDLISQNSNAGGNAIGGLLGGLIGGKAGAVVSGLNFRSKTADVTLT 186 Query: 178 VVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIET 237 + +V + E ++ V + AG F G +N + + A Sbjct: 187 LTDVRSSEQVAIVEGNSKKTDLGWGAGGGLFGGGGIGAMGVG-GYANTEIGQVITLAYLQ 245 Query: 238 GVIFLINDGIDRGLW 252 ++ + Sbjct: 246 AYTNMVAELGGLPAN 260 >UniRef50_B8FNQ3 Integrin-like repeat-containing protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FNQ3_DESAA Length = 551 Score = 43.7 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 35/106 (33%), Gaps = 16/106 (15%) Query: 45 PTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNL 104 + V++ A + + ML T L +E++ +Q Sbjct: 30 AQEPVKVAIMPFT--------MNADKDLSFLQSGIQDMLTTRLAYEGEVTVVEKKAVQE- 80 Query: 105 LNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVK 150 + A +V + + + A+ ++ GS+ + ++V Sbjct: 81 -------KVAALGDSVDKDKARKIGAELGADYVLFGSLTVFGASVS 119 >UniRef50_D0KYL0 Putative uncharacterized protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KYL0_HALNC Length = 310 Score = 43.3 bits (100), Expect = 0.009, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 64/224 (28%), Gaps = 36/224 (16%) Query: 14 SGCL---TAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASN 70 +GC TAPP E+A+ A K V + + P N Sbjct: 21 TGCAPGMTAPPGESAQANAQTPAPKTDGALQCAEARFKKTVMLTRV---WLPSPPQDVVN 77 Query: 71 FSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQS 130 A+ ++ T L+++ F + + + Sbjct: 78 LGAALSKTIA----TDLQNTGHFHV----------------ETTEATQSTGPEPAVDAFK 117 Query: 131 LTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEILSSV 190 + I G+ + + G + + +++R+V+ T + ++ Sbjct: 118 SHGTPYFIR--ISGHNFGLSGQASMWSFLGPSLNPRGG----TLDIRIVDALTTQTIAQT 171 Query: 191 NTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSA 234 S + V+AGV+ + + L +A Sbjct: 172 TVS----APSVKAGVYDPVIDALGTDFGASAYGRAMNQLAGEAA 211 >UniRef50_A3EPC9 Putative uncharacterized protein n=2 Tax=Leptospirillum sp. Group II RepID=A3EPC9_9BACT Length = 202 Score = 43.3 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 60/184 (32%), Gaps = 16/184 (8%) Query: 9 AVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETG------- 61 A+ +L+GC T P P + L + ++ + +I D T Sbjct: 22 ALSVLAGCQTTPAVPYVPPKETGVTFPTEPSETLGSLVDRMGKQLEDILDRTSGGEIGLY 81 Query: 62 --QFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGT 119 K + + + + L L + + ++ + + + + A G Sbjct: 82 VAIVKYTDSGGVAHSFGRVLALKLGEKLAKTGRYHVIDPGRIH-----QALRQEALNQGI 136 Query: 120 VAINNRIPLQSLTAANIMVEGSIIGYESNVK--SGGVGARYFGIGADTQYQLDQIAVNLR 177 V + + A+ +V GS ++ + + + +D+ + + Sbjct: 137 VDTRSVVRAARKAGADRVVLGSYTDLGPQIELNTRVIRISDGFVLGQFSEAVDRGSAIMN 196 Query: 178 VVNV 181 +++V Sbjct: 197 LIHV 200 >UniRef50_UPI00016C4204 hypothetical protein GobsU_09179 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4204 Length = 303 Score = 42.9 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 17/98 (17%), Positives = 32/98 (32%), Gaps = 7/98 (7%) Query: 50 FVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERK 109 V V ++ETG + P V + A + L F + L + Sbjct: 54 RVVVLPFRNETGFTRNEPGYT---RVGEEAREAFIAELNKVGRFEVIP----TALDDRAY 106 Query: 110 IIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYES 147 + ++ G + T A+++V G + Y Sbjct: 107 LAALSRCGGRFDEPLLYDIAKATNADVVVYGVVTNYSP 144 >UniRef50_Q1N7X1 Putative uncharacterized protein n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1N7X1_9SPHN Length = 270 Score = 42.9 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 43/113 (38%), Gaps = 11/113 (9%) Query: 81 AMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEG 140 A+L ++ S F + R + R + RA + G + + + + AA+ +E Sbjct: 86 AILKVFVQQSGCFRMVNRGR---SMQSRAMERAMADAGELQAGSNLGKGQVKAADYYLEP 142 Query: 141 SIIGYESNVKSGGVGARYFGIGADTQY--------QLDQIAVNLRVVNVSTGE 185 I+ N GG+G G+ + + V L +VN T E Sbjct: 143 DIVSSNRNSGGGGIGGALGGLMGGFGGAILGGLNIKKKEANVTLSIVNARTTE 195 >UniRef50_A0LJ31 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LJ31_SYNFM Length = 614 Score = 42.9 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 15/135 (11%), Positives = 34/135 (25%), Gaps = 19/135 (14%) Query: 32 RAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSR 91 + G + V+V A + + ML + L Sbjct: 46 SGPCAPAAWSAESQAGPVKVAVLPFT--------MHAPSDLAYLQSGVRDMLTSRLAWQG 97 Query: 92 WFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKS 151 L+R + ++ + + + A+ ++ GSI + Sbjct: 98 KVQVLDRSVTDQAVR--------SPKSDLSPAEAMKIGNTLRADYVLYGSITALGQAIS- 148 Query: 152 GGVGARYFGIGADTQ 166 + AR + Sbjct: 149 --IDARMVPVSGKGD 161 >UniRef50_Q6W140 Probable adenylate class-3/4/guanylyl cyclase n=4 Tax=Rhizobium RepID=Q6W140_RHISN Length = 769 Score = 42.1 bits (97), Expect = 0.020, Method: Composition-based stats. Identities = 25/208 (12%), Positives = 56/208 (26%), Gaps = 48/208 (23%) Query: 41 HLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQG 100 LP P K ++V ++ + + + T ++T L + R Sbjct: 227 ALPLPQ-KPSIAVLPFENLS-------SDAGQSYFADGMTDDVITELSKLSGIFVIARNS 278 Query: 101 LQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFG 160 + + ++EGS+ Sbjct: 279 TFA--------------YKGKPTKVQQVAKELGVHYILEGSV------------------ 306 Query: 161 IGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVG 220 + + + + VN ++++ G L + + G+ + Q + V Sbjct: 307 -----RREGNHVRVNAQLIDALDGHHLWAQRYDGEMSGV---FGLQDRVIGQIVSTLSVK 358 Query: 221 YTSNEPVMLCLMSAIETGVIFLINDGID 248 TS E + + I + G D Sbjct: 359 LTSAEKSVAAVPETINPRAYDTLLQGWD 386 >UniRef50_B5EG18 Putative uncharacterized protein n=1 Tax=Geobacter bemidjiensis Bem RepID=B5EG18_GEOBB Length = 312 Score = 41.7 bits (96), Expect = 0.028, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 32/100 (32%), Gaps = 11/100 (11%) Query: 50 FVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERK 109 ++V Q K + Q + L L S ++R + L+E K Sbjct: 179 TIAVLPFTIHAAQNKY-------DHMSQGFSDDLTCYLMKSEDIKIIDRNTVDKALSEIK 231 Query: 110 IIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNV 149 + +G + + A ++ G++ +S Sbjct: 232 ----SSNSGILDTTTAQNIGKAIGAKFVILGNVEVIDSEA 267 >UniRef50_B9XQI7 Putative uncharacterized protein n=2 Tax=Verrucomicrobia RepID=B9XQI7_9BACT Length = 317 Score = 41.0 bits (94), Expect = 0.040, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 48/142 (33%), Gaps = 35/142 (24%) Query: 1 MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDET 60 M+ + + L C TA + A L V++++ + Sbjct: 8 MKLSLSTIVSLFLLLCATA--RAADPDVLT--------------------VAIFDFE--- 42 Query: 61 GQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTV 120 + ++ L I +ER L+ +L E+++ +GTV Sbjct: 43 ------SKDEAVHDLGPKIATLVNANLSAEPQIITVERAELEKVLGEQEL----GLSGTV 92 Query: 121 AINNRIPLQSLTAANIMVEGSI 142 + + + LT A ++V G + Sbjct: 93 SADTAAKVGHLTGAKVLVTGRV 114 >UniRef50_B0TXJ5 Putative uncharacterized protein n=18 Tax=Francisella RepID=B0TXJ5_FRAP2 Length = 331 Score = 40.6 bits (93), Expect = 0.053, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 61/201 (30%), Gaps = 16/201 (7%) Query: 50 FVSVYNIQDETGQFKPYPASNFSTAVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERK 109 V++ + + + +++ + +R F R E + Sbjct: 70 TVAILPFDT---KGVTNESIYSEKQLRNRLNQIIIAQITQTRKFRVSNRDAKDEKAYEEE 126 Query: 110 IIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQL 169 I R N + + L A+ ++ G I+G + K G D Sbjct: 127 IRRIINSNDSSEKD---KLNQRIGADFILTGDILGLNISKKKSSY------YGEDFTTLN 177 Query: 170 DQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVML 229 +V R++ ++T E+ S + + + +Y ++ E + Sbjct: 178 VSASVAYRMIELATMEVKWSNVVRLQVPANIAN----EYANYDNGDHSQILDYVAEQIGQ 233 Query: 230 CLMSAIETGVIFLINDGIDRG 250 + I + L +D G Sbjct: 234 TISEQIVGAIYPLQVLKVDDG 254 >UniRef50_C0AAC4 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AAC4_9BACT Length = 298 Score = 40.2 bits (92), Expect = 0.083, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 71/201 (35%), Gaps = 21/201 (10%) Query: 40 THLPAPTGKIFVSVYNIQDETGQFKPYPASNFS-TAVPQSATAMLVTALKDSRWFIPLER 98 T +P P G +++ I + S V QS A L + ++R F + R Sbjct: 31 TSVPQPLGLKVLAIGKIAATPAAAEAASKKRVSMQRVTQSLDAQLADRVHNTRRFEVISR 90 Query: 99 QGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARY 158 +L+ E A G A+ ++ ++ ++ ++G Sbjct: 91 SDSAHLVEEAAATGRAFAFGN--------------ADYLLTVTVDDFQDVAQTG-----D 131 Query: 159 FGIGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSK-TILSYEVQAGVFRFIDYQRLLEG 217 FG ++ +++ ++ + T +++ + N + L E QA V D L Sbjct: 132 FGWAGKVTKRVIRLSAVGKIHDAKTNKLIETANFQEVKNLVEEKQAQVTEDGDLSDSLLV 191 Query: 218 EVGYTSNEPVMLCLMSAIETG 238 E+ E + ++ + Sbjct: 192 EMSRGIAEKIANKVVDVVYPA 212 >UniRef50_B2SFZ2 Lipoprotein, putative n=18 Tax=Francisella RepID=B2SFZ2_FRATM Length = 202 Score = 39.8 bits (91), Expect = 0.099, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 54/174 (31%), Gaps = 19/174 (10%) Query: 17 LTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNI---QDETGQFKPYPASNFST 73 P ++ + + +T+ A +V I T F Sbjct: 27 AYEDPNGVDTTSINFSSTDLQAITNKMAEDMLNSPAVKRITAMDTPTLFFSNIRNETREH 86 Query: 74 AVPQSATAMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTA 133 + T + S F + ++N+ R+ + +G V + Sbjct: 87 INTTMLSNTAQTQIIKSGLFQVTDMSQIKNV---REQLGYQANSGMVDQATATKIGQHIG 143 Query: 134 ANIMVEGSIIGYESNVKSGGVGARYFGIGADTQYQLDQIAVNLRVVNVSTGEIL 187 A MV GSI ++ G +++F L+++++ TG ++ Sbjct: 144 ARYMVYGSIQDIDNTNVDGDKRSKFFL-------------ATLKMMDLKTGLVV 184 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.122 0.286 Lambda K H 0.267 0.0373 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,184,539,349 Number of Sequences: 3077464 Number of extensions: 37147119 Number of successful extensions: 159329 Number of sequences better than 1.0e-01: 188 Number of HSP's better than 0.1 without gapping: 136 Number of HSP's successfully gapped in prelim test: 187 Number of HSP's that attempted gapping in prelim test: 158680 Number of HSP's gapped (non-prelim): 346 length of query: 277 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 150 effective length of database: 649,558,428 effective search space: 97433764200 effective search space used: 97433764200 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 92 (40.2 bits)