BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (138 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0AEA0 Curli production assembly/transport component cs... 280 1e-74 UniRef50_A5KYB2 Curli production assembly/transport component Cs... 93 2e-18 UniRef50_A4VNF3 Curli fiber protein CsgF, putative n=10 Tax=Gamm... 91 1e-17 UniRef50_A3QDP7 Curli production assembly/transport component Cs... 82 7e-15 UniRef50_Q392B8 Putative uncharacterized protein n=1 Tax=Burkhol... 77 2e-13 UniRef50_D2QT92 Curli production assembly/transport component Cs... 62 6e-09 UniRef50_A3XJK9 Curli production assembly/transport component, C... 61 1e-08 UniRef50_Q5QXH1 Curli production assembly/transport component, C... 59 4e-08 UniRef50_A6EMN1 Curli production assembly/transport component Cs... 49 6e-05 UniRef50_Q12QL8 Curli production assembly/transport component Cs... 48 1e-04 UniRef50_C2G2I9 Curli production assembly/transport component Cs... 44 0.002 UniRef50_B4RW66 Putative curli production assembly/transport com... 44 0.002 UniRef50_A5PB14 Putative uncharacterized protein n=3 Tax=Sphingo... 40 0.021 UniRef50_C6X5S2 Curli production assembly/transport component Cs... 40 0.024 >UniRef50_P0AEA0 Curli production assembly/transport component csgF n=93 Tax=Enterobacteriaceae RepID=CSGF_ECO57 Length = 138 Score = 280 bits (716), Expect = 1e-74, Method: Compositional matrix adjust. Identities = 138/138 (100%), Positives = 138/138 (100%) Query: 1 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF Sbjct: 1 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 Query: 61 GIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKT 120 GIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKT Sbjct: 61 GIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKT 120 Query: 121 GQTSTIQVSGLQNNSTDF 138 GQTSTIQVSGLQNNSTDF Sbjct: 121 GQTSTIQVSGLQNNSTDF 138 >UniRef50_A5KYB2 Curli production assembly/transport component CsgF n=4 Tax=Vibrionales RepID=A5KYB2_9GAMM Length = 136 Score = 93.2 bits (230), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 46/128 (35%), Positives = 77/128 (60%), Gaps = 1/128 (0%) Query: 4 KHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIE 63 + A ++ +L SP +A + + NP+FGGN N + L N A A N Y+DPS D F E Sbjct: 6 RKAALIALLASPSIYASELVYTPVNPSFGGNALNSSHLFNHANAINDYEDPSARDIFD-E 64 Query: 64 TPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQT 123 SALD +++S+++ LL+++ G G++ T D+ ++I + G L + + D+ TG++ Sbjct: 65 QESALDRLASSLESRLISQLLADVGNGNTGQLETEDFFLNIVDDSGTLLVQIVDKATGES 124 Query: 124 STIQVSGL 131 + I VSGL Sbjct: 125 TEISVSGL 132 >UniRef50_A4VNF3 Curli fiber protein CsgF, putative n=10 Tax=Gammaproteobacteria RepID=A4VNF3_PSEU5 Length = 138 Score = 90.9 bits (224), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 53/105 (50%), Positives = 73/105 (69%), Gaps = 6/105 (5%) Query: 28 NPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDD---FGIETPSALDNFTQAIQSQILGGLL 84 NP+FGG+P NGA+LL +AQAQN+ KDP D FG + SALD FT ++S++LG LL Sbjct: 33 NPSFGGSPLNGAWLLGNAQAQNNKKDPDALDRSSLFGNQ--SALDRFTSQLESRLLGDLL 90 Query: 85 SNINTGKPGRMVTNDYIVDIANRD-GQLQLNVTDRKTGQTSTIQV 128 S +++GK G + T+D+IV + N D G L + +TDR TG+ S I V Sbjct: 91 SGVSSGKTGTVTTDDFIVRVYNGDAGMLIVEITDRLTGEMSEIIV 135 >UniRef50_A3QDP7 Curli production assembly/transport component CsgF n=4 Tax=Shewanella RepID=A3QDP7_SHELP Length = 139 Score = 81.6 bits (200), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 46/131 (35%), Positives = 77/131 (58%), Gaps = 5/131 (3%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 A++L S S A + + NPNFGG+ NG++LL +A AQN +K S G P Sbjct: 11 ALLLAATFSLSSQATQLIYTPVNPNFGGSYLNGSYLLANASAQNDHKGGS-----GYTPP 65 Query: 66 SALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTST 125 SAL+ +++S+++ L ++ G G + T D+ + + N DG L +++TD TG+T+ Sbjct: 66 SALERMASSLESRLMSQLFNDAANGGEGYLKTTDFEIQVVNEDGMLLVHITDLLTGETTV 125 Query: 126 IQVSGLQNNST 136 I+V GL ++S+ Sbjct: 126 IEVGGLSDSSS 136 >UniRef50_Q392B8 Putative uncharacterized protein n=1 Tax=Burkholderia sp. 383 RepID=Q392B8_BURS3 Length = 145 Score = 76.6 bits (187), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 56/133 (42%), Positives = 78/133 (58%), Gaps = 9/133 (6%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 A+V + +SP +W+ T+ +Q NPNFGGNP NG LLN A AQN + DPS + + Sbjct: 10 ALVAMTAVSPYAWSSTLVYQPNNPNFGGNPANGPNLLNEANAQNKHTDPSLSAYSSPTST 69 Query: 66 SALDNFT----QAIQSQILGGLLSNI----NTGKPGRMVTNDY-IVDIANRDGQLQLNVT 116 S+LD F QAI S++ G + ++I KPG + T ++ IV A +G LQ+ T Sbjct: 70 SSLDQFNAQLQQAILSRVAGSVTNSIVGTNGELKPGTISTGNFTIVVSAAGNGSLQVTTT 129 Query: 117 DRKTGQTSTIQVS 129 D+ TG TST VS Sbjct: 130 DKTTGATSTFIVS 142 >UniRef50_D2QT92 Curli production assembly/transport component CsgF n=2 Tax=Spirosoma linguale DSM 74 RepID=D2QT92_9SPHI Length = 140 Score = 62.0 bits (149), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 41/127 (32%), Positives = 66/127 (51%), Gaps = 11/127 (8%) Query: 9 LLMLISPLSWAGTMTFQFR--NPNFGGNPNNGAFLLNSAQAQNSYKDP--SYNDDFGIET 64 L ++ + F + NPNFGGN N +++L+SAQAQ+ DP + G Sbjct: 8 FLFVVCLAGYCSAQAFVYHPNNPNFGGNTFNYSWMLSSAQAQDRTTDPTATVRTANGTTQ 67 Query: 65 PSALDNFTQAIQSQILGGLLSNI-------NTGKPGRMVTNDYIVDIANRDGQLQLNVTD 117 +L++F Q++QSQ+L + S++ T KPG +Y VDI+N + + + D Sbjct: 68 TGSLNSFAQSLQSQLLSRITSSLVGSQFGEGTLKPGTYKFGEYQVDISNGASGVVVRIVD 127 Query: 118 RKTGQTS 124 K G+TS Sbjct: 128 GKGGETS 134 >UniRef50_A3XJK9 Curli production assembly/transport component, CsgF n=3 Tax=Flavobacteriaceae RepID=A3XJK9_9FLAO Length = 133 Score = 61.2 bits (147), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 43/133 (32%), Positives = 71/133 (53%), Gaps = 10/133 (7%) Query: 3 VKHAVVLLMLI-SPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFG 61 +K+ + LL LI S ++A +T+Q +NPNFGG N +LL+SA AQNS+KDP N Sbjct: 1 MKYILFLLFLIASSTTFAQQLTYQPKNPNFGGETFNYQWLLSSANAQNSFKDPEEN---A 57 Query: 62 IETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVT------NDYIVDIANRDGQLQLNV 115 + S L++F + Q+L + + T + G + ++I + + L +N+ Sbjct: 58 AKDQSELESFASNLNRQLLSQISRGLFTAQLGEGLEAGTYNFGSLALEIYDSNEGLVINI 117 Query: 116 TDRKTGQTSTIQV 128 D TG+ + I V Sbjct: 118 LDTNTGEQTQIIV 130 >UniRef50_Q5QXH1 Curli production assembly/transport component, CsgF n=1 Tax=Idiomarina loihiensis RepID=Q5QXH1_IDILO Length = 127 Score = 59.3 bits (142), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 37/129 (28%), Positives = 66/129 (51%), Gaps = 10/129 (7%) Query: 1 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 M+++ + + + + A + +Q NP+FGGNP NG FLL AQ QN++ P + F Sbjct: 1 MKLRWLFIGFLFTAVNAAATEIVYQPINPSFGGNPLNGNFLLQKAQTQNAHSAPDSDRSF 60 Query: 61 GIETPSALDNFTQAIQSQILGGLLSNINTGK--PGRMVTNDYIVDIANR-DGQLQLNVTD 117 +D F +A++ I+ L I G+ G T +Y V++ + DG + + +T Sbjct: 61 -------VDRFREALERNIINSLTRRIADGEIVEGLYDTGEYTVEVVGQPDGSVIVYITH 113 Query: 118 RKTGQTSTI 126 +G+ + I Sbjct: 114 NGSGEQTVI 122 >UniRef50_A6EMN1 Curli production assembly/transport component CsgF n=2 Tax=Bacteroidetes RepID=A6EMN1_9BACT Length = 133 Score = 48.5 bits (114), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 12/124 (9%) Query: 12 LISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETPSALDNF 71 L+S +S A + ++ NP FGG N +LL+SA AQNS +DPS +D +L+ F Sbjct: 13 LVSSVS-AQQLVYKPVNPAFGGETFNYQWLLSSASAQNSIEDPSVRED----EIGSLEQF 67 Query: 72 TQAIQSQILGGL-----LSNINTG--KPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTS 124 +++ SQ+L L TG +PG ++I L +N+ D T + + Sbjct: 68 NESLNSQLLAQLSRTAFADQFGTGEFEPGTFSFGSLELEIVESLNGLVINILDTTTAEQT 127 Query: 125 TIQV 128 I + Sbjct: 128 QIII 131 >UniRef50_Q12QL8 Curli production assembly/transport component CsgF, putative n=14 Tax=Shewanella RepID=Q12QL8_SHEDO Length = 135 Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 35/122 (28%), Positives = 64/122 (52%), Gaps = 12/122 (9%) Query: 10 LMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYND-DFGIETPSAL 68 L+ + L+ A + + NP+FGG+ NG+FLLN AQAQN + ND DF + Sbjct: 11 LISFAALTQATELVYTPVNPSFGGSALNGSFLLNKAQAQN--DNARTNDKDF-------V 61 Query: 69 DNFTQAIQSQILGGLLSNINTGK--PGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTSTI 126 F ++++ I+ + + G+ G T D+ V+++ + L +T+ +TG + I Sbjct: 62 TRFQESLERNIINSITRGVANGEITDGVYDTGDFRVEVSPTANGVMLTITNLETGDVTVI 121 Query: 127 QV 128 ++ Sbjct: 122 EM 123 >UniRef50_C2G2I9 Curli production assembly/transport component CsgF n=2 Tax=Sphingobacterium spiritivorum RepID=C2G2I9_9SPHI Length = 141 Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust. Identities = 35/118 (29%), Positives = 56/118 (47%), Gaps = 10/118 (8%) Query: 18 WAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETPSALDNFTQAIQS 77 +A ++ NP FGG+ N +LL+SA AQN + D + +A+ +FT + Sbjct: 23 YAQQFVYKPVNPAFGGDTFNYQWLLSSANAQNQFDDSKQQS---YKPATAIGSFTDNLNR 79 Query: 78 QILGGLLSNI---NTG----KPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTSTIQV 128 QIL + ++ TG KPG + IA G L +N+ D TG+ + I + Sbjct: 80 QILNKISRDLFGDETGEKPMKPGVYSLGTMNITIAEYYGGLNINIIDINTGEQTVINI 137 >UniRef50_B4RW66 Putative curli production assembly/transport component n=5 Tax=Alteromonadales RepID=B4RW66_ALTMD Length = 151 Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust. Identities = 33/125 (26%), Positives = 60/125 (48%), Gaps = 13/125 (10%) Query: 17 SWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDP----SYNDDFGIETPSALDNFT 72 ++A + ++ NP+FGGNP NG+FLL+ A +QN++ P SY++ A N Sbjct: 23 AFATELVYEPINPSFGGNPLNGSFLLSKANSQNAHSAPLSERSYDERLQESLERAYINRI 82 Query: 73 QAIQSQILGGLLSNINTGKP--------GRMVTNDYIVD-IANRDGQLQLNVTDRKTGQT 123 + I G G P V+ D+ V+ I + + +N+T+ TG+ Sbjct: 83 VREITDIAFGEQEYDEDGNPIDSIFNQDSIFVSGDFQVELITSNPDSIVVNITNLLTGEV 142 Query: 124 STIQV 128 + +++ Sbjct: 143 TVVEI 147 >UniRef50_A5PB14 Putative uncharacterized protein n=3 Tax=Sphingomonadales RepID=A5PB14_9SPHN Length = 141 Score = 40.0 bits (92), Expect = 0.021, Method: Compositional matrix adjust. Identities = 25/60 (41%), Positives = 32/60 (53%), Gaps = 5/60 (8%) Query: 28 NPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETPSALDNFTQAIQSQILGGLLSNI 87 +P FGGNP N +L A AQN KDP+ T S D F + +QS++L L S I Sbjct: 32 SPTFGGNPFNSNHILGIANAQNKTKDPN-----AASTNSQADIFARQLQSRLLSALSSQI 86 >UniRef50_C6X5S2 Curli production assembly/transport component CsgF n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X5S2_FLAB3 Length = 136 Score = 40.0 bits (92), Expect = 0.024, Method: Compositional matrix adjust. Identities = 38/137 (27%), Positives = 67/137 (48%), Gaps = 12/137 (8%) Query: 3 VKHAVVLLMLISPLSWAGTMTFQFR--NPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 +K+ ++++LI+P +A + F ++ NP FGG+ N +LL+SA AQN + + Sbjct: 1 MKNLGIIVLLITPF-YAASQQFVYKPINPAFGGDTFNYQWLLSSASAQNQFDNNLRGSL- 58 Query: 61 GIETPSALDNFTQAIQSQILGGLLSNI-------NTGKPGRMVTNDYIVDIANRDGQLQL 113 S+LD+FT + QIL L + +PG V ++I L + Sbjct: 59 -GSGGSSLDSFTDNLNRQILSELSRKLFGDQFGDGDLQPGTYVFGSIYLEILQTGQGLLI 117 Query: 114 NVTDRKTGQTSTIQVSG 130 ++ + TG+ S I + G Sbjct: 118 SILNTDTGEQSEILIPG 134 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AEA0 Curli production assembly/transport component cs... 174 6e-43 UniRef50_A5KYB2 Curli production assembly/transport component Cs... 149 3e-35 UniRef50_A3QDP7 Curli production assembly/transport component Cs... 147 1e-34 UniRef50_Q5QXH1 Curli production assembly/transport component, C... 141 5e-33 UniRef50_A4VNF3 Curli fiber protein CsgF, putative n=10 Tax=Gamm... 135 3e-31 UniRef50_A3XJK9 Curli production assembly/transport component, C... 129 3e-29 UniRef50_D2QT92 Curli production assembly/transport component Cs... 126 3e-28 UniRef50_A6EMN1 Curli production assembly/transport component Cs... 121 7e-27 UniRef50_Q12QL8 Curli production assembly/transport component Cs... 114 8e-25 UniRef50_Q392B8 Putative uncharacterized protein n=1 Tax=Burkhol... 105 4e-22 Sequences not found previously or not previously below threshold: UniRef50_C6X5S2 Curli production assembly/transport component Cs... 100 1e-20 UniRef50_C2G2I9 Curli production assembly/transport component Cs... 100 2e-20 UniRef50_B4RW66 Putative curli production assembly/transport com... 92 5e-18 UniRef50_Q1YMP0 Putative uncharacterized protein n=2 Tax=Alphapr... 83 3e-15 UniRef50_A5PB14 Putative uncharacterized protein n=3 Tax=Sphingo... 79 3e-14 UniRef50_B0UH01 Putative uncharacterized protein n=6 Tax=Rhizobi... 57 2e-07 UniRef50_B3QD05 Putative uncharacterized protein n=3 Tax=Rhodops... 44 0.002 >UniRef50_P0AEA0 Curli production assembly/transport component csgF n=93 Tax=Enterobacteriaceae RepID=CSGF_ECO57 Length = 138 Score = 174 bits (442), Expect = 6e-43, Method: Composition-based stats. Identities = 138/138 (100%), Positives = 138/138 (100%) Query: 1 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF Sbjct: 1 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 Query: 61 GIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKT 120 GIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKT Sbjct: 61 GIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKT 120 Query: 121 GQTSTIQVSGLQNNSTDF 138 GQTSTIQVSGLQNNSTDF Sbjct: 121 GQTSTIQVSGLQNNSTDF 138 >UniRef50_A5KYB2 Curli production assembly/transport component CsgF n=4 Tax=Vibrionales RepID=A5KYB2_9GAMM Length = 136 Score = 149 bits (375), Expect = 3e-35, Method: Composition-based stats. Identities = 46/129 (35%), Positives = 77/129 (59%), Gaps = 1/129 (0%) Query: 4 KHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIE 63 + A ++ +L SP +A + + NP+FGGN N + L N A A N Y+DPS D F E Sbjct: 6 RKAALIALLASPSIYASELVYTPVNPSFGGNALNSSHLFNHANAINDYEDPSARDIFD-E 64 Query: 64 TPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQT 123 SALD +++S+++ LL+++ G G++ T D+ ++I + G L + + D+ TG++ Sbjct: 65 QESALDRLASSLESRLISQLLADVGNGNTGQLETEDFFLNIVDDSGTLLVQIVDKATGES 124 Query: 124 STIQVSGLQ 132 + I VSGL Sbjct: 125 TEISVSGLN 133 >UniRef50_A3QDP7 Curli production assembly/transport component CsgF n=4 Tax=Shewanella RepID=A3QDP7_SHELP Length = 139 Score = 147 bits (371), Expect = 1e-34, Method: Composition-based stats. Identities = 46/131 (35%), Positives = 77/131 (58%), Gaps = 5/131 (3%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 A++L S S A + + NPNFGG+ NG++LL +A AQN +K S G P Sbjct: 11 ALLLAATFSLSSQATQLIYTPVNPNFGGSYLNGSYLLANASAQNDHKGGS-----GYTPP 65 Query: 66 SALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTST 125 SAL+ +++S+++ L ++ G G + T D+ + + N DG L +++TD TG+T+ Sbjct: 66 SALERMASSLESRLMSQLFNDAANGGEGYLKTTDFEIQVVNEDGMLLVHITDLLTGETTV 125 Query: 126 IQVSGLQNNST 136 I+V GL ++S+ Sbjct: 126 IEVGGLSDSSS 136 >UniRef50_Q5QXH1 Curli production assembly/transport component, CsgF n=1 Tax=Idiomarina loihiensis RepID=Q5QXH1_IDILO Length = 127 Score = 141 bits (356), Expect = 5e-33, Method: Composition-based stats. Identities = 37/131 (28%), Positives = 67/131 (51%), Gaps = 10/131 (7%) Query: 1 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 M+++ + + + + A + +Q NP+FGGNP NG FLL AQ QN++ P + F Sbjct: 1 MKLRWLFIGFLFTAVNAAATEIVYQPINPSFGGNPLNGNFLLQKAQTQNAHSAPDSDRSF 60 Query: 61 GIETPSALDNFTQAIQSQILGGLLSNINTGK--PGRMVTNDYIVDIANR-DGQLQLNVTD 117 +D F +A++ I+ L I G+ G T +Y V++ + DG + + +T Sbjct: 61 -------VDRFREALERNIINSLTRRIADGEIVEGLYDTGEYTVEVVGQPDGSVIVYITH 113 Query: 118 RKTGQTSTIQV 128 +G+ + I + Sbjct: 114 NGSGEQTVITM 124 >UniRef50_A4VNF3 Curli fiber protein CsgF, putative n=10 Tax=Gammaproteobacteria RepID=A4VNF3_PSEU5 Length = 138 Score = 135 bits (340), Expect = 3e-31, Method: Composition-based stats. Identities = 51/112 (45%), Positives = 72/112 (64%), Gaps = 2/112 (1%) Query: 20 GTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFG-IETPSALDNFTQAIQSQ 78 + + NP+FGG+P NGA+LL +AQAQN+ KDP D SALD FT ++S+ Sbjct: 25 TELVYTPVNPSFGGSPLNGAWLLGNAQAQNNKKDPDALDRSSLFGNQSALDRFTSQLESR 84 Query: 79 ILGGLLSNINTGKPGRMVTNDYIVDIANRD-GQLQLNVTDRKTGQTSTIQVS 129 +LG LLS +++GK G + T+D+IV + N D G L + +TDR TG+ S I V Sbjct: 85 LLGDLLSGVSSGKTGTVTTDDFIVRVYNGDAGMLIVEITDRLTGEMSEIIVG 136 >UniRef50_A3XJK9 Curli production assembly/transport component, CsgF n=3 Tax=Flavobacteriaceae RepID=A3XJK9_9FLAO Length = 133 Score = 129 bits (324), Expect = 3e-29, Method: Composition-based stats. Identities = 43/133 (32%), Positives = 70/133 (52%), Gaps = 10/133 (7%) Query: 3 VKHAVVLLMLI-SPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFG 61 +K+ + LL LI S ++A +T+Q +NPNFGG N +LL+SA AQNS+KDP N Sbjct: 1 MKYILFLLFLIASSTTFAQQLTYQPKNPNFGGETFNYQWLLSSANAQNSFKDPEEN---A 57 Query: 62 IETPSALDNFTQAIQSQILGGLLSNINTG------KPGRMVTNDYIVDIANRDGQLQLNV 115 + S L++F + Q+L + + T + G ++I + + L +N+ Sbjct: 58 AKDQSELESFASNLNRQLLSQISRGLFTAQLGEGLEAGTYNFGSLALEIYDSNEGLVINI 117 Query: 116 TDRKTGQTSTIQV 128 D TG+ + I V Sbjct: 118 LDTNTGEQTQIIV 130 >UniRef50_D2QT92 Curli production assembly/transport component CsgF n=2 Tax=Spirosoma linguale DSM 74 RepID=D2QT92_9SPHI Length = 140 Score = 126 bits (315), Expect = 3e-28, Method: Composition-based stats. Identities = 41/138 (29%), Positives = 68/138 (49%), Gaps = 12/138 (8%) Query: 1 MRVKHAVVLLMLISPLSWAGTM--TFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSY-- 56 M+ + L ++ + + NPNFGGN N +++L+SAQAQ+ DP+ Sbjct: 1 MKTR-LFTFLFVVCLAGYCSAQAFVYHPNNPNFGGNTFNYSWMLSSAQAQDRTTDPTATV 59 Query: 57 NDDFGIETPSALDNFTQAIQSQILGGLLSNI-------NTGKPGRMVTNDYIVDIANRDG 109 G +L++F Q++QSQ+L + S++ T KPG +Y VDI+N Sbjct: 60 RTANGTTQTGSLNSFAQSLQSQLLSRITSSLVGSQFGEGTLKPGTYKFGEYQVDISNGAS 119 Query: 110 QLQLNVTDRKTGQTSTIQ 127 + + + D K G+TS Sbjct: 120 GVVVRIVDGKGGETSITI 137 >UniRef50_A6EMN1 Curli production assembly/transport component CsgF n=2 Tax=Bacteroidetes RepID=A6EMN1_9BACT Length = 133 Score = 121 bits (303), Expect = 7e-27, Method: Composition-based stats. Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 13/135 (9%) Query: 3 VKHAVVLLMLISPLSW--AGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 + ++ L+ + +S A + ++ NP FGG N +LL+SA AQNS +DPS +D Sbjct: 1 MYKSLFFLLFVGLVSSVSAQQLVYKPVNPAFGGETFNYQWLLSSASAQNSIEDPSVRED- 59 Query: 61 GIETPSALDNFTQAIQSQILGGLLS-------NINTGKPGRMVTNDYIVDIANRDGQLQL 113 +L+ F +++ SQ+L L +PG ++I L + Sbjct: 60 ---EIGSLEQFNESLNSQLLAQLSRTAFADQFGTGEFEPGTFSFGSLELEIVESLNGLVI 116 Query: 114 NVTDRKTGQTSTIQV 128 N+ D T + + I + Sbjct: 117 NILDTTTAEQTQIII 131 >UniRef50_Q12QL8 Curli production assembly/transport component CsgF, putative n=14 Tax=Shewanella RepID=Q12QL8_SHEDO Length = 135 Score = 114 bits (285), Expect = 8e-25, Method: Composition-based stats. Identities = 32/122 (26%), Positives = 61/122 (50%), Gaps = 10/122 (8%) Query: 9 LLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETPSAL 68 L+ + L+ A + + NP+FGG+ NG+FLLN AQAQN D+ + Sbjct: 10 GLISFAALTQATELVYTPVNPSFGGSALNGSFLLNKAQAQN--------DNARTNDKDFV 61 Query: 69 DNFTQAIQSQILGGLLSNINTGK--PGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTSTI 126 F ++++ I+ + + G+ G T D+ V+++ + L +T+ +TG + I Sbjct: 62 TRFQESLERNIINSITRGVANGEITDGVYDTGDFRVEVSPTANGVMLTITNLETGDVTVI 121 Query: 127 QV 128 ++ Sbjct: 122 EM 123 >UniRef50_Q392B8 Putative uncharacterized protein n=1 Tax=Burkholderia sp. 383 RepID=Q392B8_BURS3 Length = 145 Score = 105 bits (262), Expect = 4e-22, Method: Composition-based stats. Identities = 54/133 (40%), Positives = 78/133 (58%), Gaps = 9/133 (6%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 A+V + +SP +W+ T+ +Q NPNFGGNP NG LLN A AQN + DPS + + Sbjct: 10 ALVAMTAVSPYAWSSTLVYQPNNPNFGGNPANGPNLLNEANAQNKHTDPSLSAYSSPTST 69 Query: 66 SALDNFT----QAIQSQILGGLLSNI----NTGKPGRMVTNDYIVDI-ANRDGQLQLNVT 116 S+LD F QAI S++ G + ++I KPG + T ++ + + A +G LQ+ T Sbjct: 70 SSLDQFNAQLQQAILSRVAGSVTNSIVGTNGELKPGTISTGNFTIVVSAAGNGSLQVTTT 129 Query: 117 DRKTGQTSTIQVS 129 D+ TG TST VS Sbjct: 130 DKTTGATSTFIVS 142 >UniRef50_C6X5S2 Curli production assembly/transport component CsgF n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X5S2_FLAB3 Length = 136 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 36/136 (26%), Positives = 65/136 (47%), Gaps = 10/136 (7%) Query: 3 VKHAVVLLMLISP-LSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFG 61 +K+ ++++LI+P + + ++ NP FGG+ N +LL+SA AQN + + Sbjct: 1 MKNLGIIVLLITPFYAASQQFVYKPINPAFGGDTFNYQWLLSSASAQNQFDN--NLRGSL 58 Query: 62 IETPSALDNFTQAIQSQILGGLLSNI-------NTGKPGRMVTNDYIVDIANRDGQLQLN 114 S+LD+FT + QIL L + +PG V ++I L ++ Sbjct: 59 GSGGSSLDSFTDNLNRQILSELSRKLFGDQFGDGDLQPGTYVFGSIYLEILQTGQGLLIS 118 Query: 115 VTDRKTGQTSTIQVSG 130 + + TG+ S I + G Sbjct: 119 ILNTDTGEQSEILIPG 134 >UniRef50_C2G2I9 Curli production assembly/transport component CsgF n=2 Tax=Sphingobacterium spiritivorum RepID=C2G2I9_9SPHI Length = 141 Score = 100 bits (248), Expect = 2e-20, Method: Composition-based stats. Identities = 33/122 (27%), Positives = 56/122 (45%), Gaps = 10/122 (8%) Query: 18 WAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETPSALDNFTQAIQS 77 +A ++ NP FGG+ N +LL+SA AQN + D + + +A+ +FT + Sbjct: 23 YAQQFVYKPVNPAFGGDTFNYQWLLSSANAQNQFDD---SKQQSYKPATAIGSFTDNLNR 79 Query: 78 QILGGLLSNINT-------GKPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTSTIQVSG 130 QIL + ++ KPG + IA G L +N+ D TG+ + I + Sbjct: 80 QILNKISRDLFGDETGEKPMKPGVYSLGTMNITIAEYYGGLNINIIDINTGEQTVINIPN 139 Query: 131 LQ 132 + Sbjct: 140 IS 141 >UniRef50_B4RW66 Putative curli production assembly/transport component n=5 Tax=Alteromonadales RepID=B4RW66_ALTMD Length = 151 Score = 92.1 bits (227), Expect = 5e-18, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 63/143 (44%), Gaps = 27/143 (18%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 + L + ++A + ++ NP+FGGNP NG+FLL+ A +QN++ P + Sbjct: 12 LITLGAVFQFSAFATELVYEPINPSFGGNPLNGSFLLSKANSQNAHSAPLSERSYD---- 67 Query: 66 SALDNFTQAIQSQILGGLLSNINTGKPGR-------------------MVTNDYIVDIAN 106 + ++++ + ++ I G V+ D+ V++ Sbjct: 68 ---ERLQESLERAYINRIVREITDIAFGEQEYDEDGNPIDSIFNQDSIFVSGDFQVELIT 124 Query: 107 RD-GQLQLNVTDRKTGQTSTIQV 128 + + +N+T+ TG+ + +++ Sbjct: 125 SNPDSIVVNITNLLTGEVTVVEI 147 >UniRef50_Q1YMP0 Putative uncharacterized protein n=2 Tax=Alphaproteobacteria RepID=Q1YMP0_MOBAS Length = 174 Score = 82.9 bits (203), Expect = 3e-15, Method: Composition-based stats. Identities = 33/148 (22%), Positives = 54/148 (36%), Gaps = 17/148 (11%) Query: 5 HAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIET 64 AV++ S + AG + + NP+FGGNP N + L A AQ +D + Sbjct: 25 WAVLIFAAASHTATAGDLVYTPVNPSFGGNPLNSSHLFAIANAQRDATASDADDGLSGGS 84 Query: 65 PSAL----------DNFTQAIQSQILGGLLSNINTG-------KPGRMVTNDYIVDIANR 107 D F +Q ++L L S + G + D V + Sbjct: 85 SIGGGGGGAGNSQADLFVSQLQGRLLSALASQVTEAIFGENPQDSGTVTFGDTSVTFSRT 144 Query: 108 DGQLQLNVTDRKTGQTSTIQVSGLQNNS 135 ++L + + G + I V L +S Sbjct: 145 LDSIRLTIVNGADGTVTDIVVPQLVTSS 172 >UniRef50_A5PB14 Putative uncharacterized protein n=3 Tax=Sphingomonadales RepID=A5PB14_9SPHN Length = 141 Score = 79.4 bits (194), Expect = 3e-14, Method: Composition-based stats. Identities = 34/133 (25%), Positives = 53/133 (39%), Gaps = 14/133 (10%) Query: 5 HAVVLLMLISPL--SWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGI 62 A++ + A + + +P FGGNP N +L A AQN KDP Sbjct: 7 WAIIGAGAACVAVPANAQDIVHEPISPTFGGNPFNSNHILGIANAQNKTKDP-----NAA 61 Query: 63 ETPSALDNFTQAIQSQILGGLLSNINTG-------KPGRMVTNDYIVDIANRDGQLQLNV 115 T S D F + +QS++L L S I + G + + + L + Sbjct: 62 STNSQADIFARQLQSRLLSALSSQIVDAIFGDDPQESGVISFGGQTISFFRSLDSVTLII 121 Query: 116 TDRKTGQTSTIQV 128 T+ TG+ + I V Sbjct: 122 TNDDTGEETRIVV 134 >UniRef50_B0UH01 Putative uncharacterized protein n=6 Tax=Rhizobiales RepID=B0UH01_METS4 Length = 152 Score = 57.1 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 49/130 (37%), Gaps = 16/130 (12%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 A ++L+ + + A ++ +Q NP FGG NG++L A AQN + + Sbjct: 12 AGLVLLHAARPAEASSLVYQPVNPAFGGPQLNGSWLQAEANAQNIPQAAAQRRQQITGAA 71 Query: 66 SAL---------DNFTQAIQSQILGGLLSNIN------TGKP-GRMVTNDYIVDIANRDG 109 ++ F Q +Q+Q+ L + I +P G+ + Sbjct: 72 ASAARGGILTPGQAFAQQLQTQLYSSLANQITRAIFGENAQPNGQFSFQGTTISYTRVGS 131 Query: 110 QLQLNVTDRK 119 + + + D Sbjct: 132 NVSITINDGS 141 >UniRef50_B3QD05 Putative uncharacterized protein n=3 Tax=Rhodopseudomonas palustris RepID=B3QD05_RHOPT Length = 94 Score = 43.6 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 16/38 (42%), Positives = 26/38 (68%) Query: 11 MLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQ 48 ++++ S+A ++ +P F GNP NG FLL++AQAQ Sbjct: 13 VILTAPSFATEQIYRPVSPTFNGNPLNGNFLLSTAQAQ 50 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AEA0 Curli production assembly/transport component cs... 154 7e-37 UniRef50_Q5QXH1 Curli production assembly/transport component, C... 139 2e-32 UniRef50_A5KYB2 Curli production assembly/transport component Cs... 137 1e-31 UniRef50_A3QDP7 Curli production assembly/transport component Cs... 134 9e-31 UniRef50_A4VNF3 Curli fiber protein CsgF, putative n=10 Tax=Gamm... 129 2e-29 UniRef50_A3XJK9 Curli production assembly/transport component, C... 126 2e-28 UniRef50_C2G2I9 Curli production assembly/transport component Cs... 123 2e-27 UniRef50_A6EMN1 Curli production assembly/transport component Cs... 121 7e-27 UniRef50_D2QT92 Curli production assembly/transport component Cs... 120 1e-26 UniRef50_Q1YMP0 Putative uncharacterized protein n=2 Tax=Alphapr... 120 2e-26 UniRef50_C6X5S2 Curli production assembly/transport component Cs... 119 2e-26 UniRef50_A5PB14 Putative uncharacterized protein n=3 Tax=Sphingo... 116 2e-25 UniRef50_Q12QL8 Curli production assembly/transport component Cs... 112 3e-24 UniRef50_B4RW66 Putative curli production assembly/transport com... 111 6e-24 UniRef50_Q392B8 Putative uncharacterized protein n=1 Tax=Burkhol... 97 2e-19 UniRef50_B0UH01 Putative uncharacterized protein n=6 Tax=Rhizobi... 91 1e-17 UniRef50_B3QD05 Putative uncharacterized protein n=3 Tax=Rhodops... 47 1e-04 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P0AEA0 Curli production assembly/transport component csgF n=93 Tax=Enterobacteriaceae RepID=CSGF_ECO57 Length = 138 Score = 154 bits (389), Expect = 7e-37, Method: Composition-based stats. Identities = 138/138 (100%), Positives = 138/138 (100%) Query: 1 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF Sbjct: 1 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 Query: 61 GIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKT 120 GIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKT Sbjct: 61 GIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKT 120 Query: 121 GQTSTIQVSGLQNNSTDF 138 GQTSTIQVSGLQNNSTDF Sbjct: 121 GQTSTIQVSGLQNNSTDF 138 >UniRef50_Q5QXH1 Curli production assembly/transport component, CsgF n=1 Tax=Idiomarina loihiensis RepID=Q5QXH1_IDILO Length = 127 Score = 139 bits (351), Expect = 2e-32, Method: Composition-based stats. Identities = 37/134 (27%), Positives = 68/134 (50%), Gaps = 10/134 (7%) Query: 1 MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 M+++ + + + + A + +Q NP+FGGNP NG FLL AQ QN++ P + F Sbjct: 1 MKLRWLFIGFLFTAVNAAATEIVYQPINPSFGGNPLNGNFLLQKAQTQNAHSAPDSDRSF 60 Query: 61 GIETPSALDNFTQAIQSQILGGLLSNINTGK--PGRMVTNDYIVDIANR-DGQLQLNVTD 117 +D F +A++ I+ L I G+ G T +Y V++ + DG + + +T Sbjct: 61 -------VDRFREALERNIINSLTRRIADGEIVEGLYDTGEYTVEVVGQPDGSVIVYITH 113 Query: 118 RKTGQTSTIQVSGL 131 +G+ + I + + Sbjct: 114 NGSGEQTVITMPSI 127 >UniRef50_A5KYB2 Curli production assembly/transport component CsgF n=4 Tax=Vibrionales RepID=A5KYB2_9GAMM Length = 136 Score = 137 bits (345), Expect = 1e-31, Method: Composition-based stats. Identities = 46/129 (35%), Positives = 77/129 (59%), Gaps = 1/129 (0%) Query: 4 KHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIE 63 + A ++ +L SP +A + + NP+FGGN N + L N A A N Y+DPS D F E Sbjct: 6 RKAALIALLASPSIYASELVYTPVNPSFGGNALNSSHLFNHANAINDYEDPSARDIFD-E 64 Query: 64 TPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQT 123 SALD +++S+++ LL+++ G G++ T D+ ++I + G L + + D+ TG++ Sbjct: 65 QESALDRLASSLESRLISQLLADVGNGNTGQLETEDFFLNIVDDSGTLLVQIVDKATGES 124 Query: 124 STIQVSGLQ 132 + I VSGL Sbjct: 125 TEISVSGLN 133 >UniRef50_A3QDP7 Curli production assembly/transport component CsgF n=4 Tax=Shewanella RepID=A3QDP7_SHELP Length = 139 Score = 134 bits (337), Expect = 9e-31, Method: Composition-based stats. Identities = 46/131 (35%), Positives = 77/131 (58%), Gaps = 5/131 (3%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 A++L S S A + + NPNFGG+ NG++LL +A AQN +K S G P Sbjct: 11 ALLLAATFSLSSQATQLIYTPVNPNFGGSYLNGSYLLANASAQNDHKGGS-----GYTPP 65 Query: 66 SALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTST 125 SAL+ +++S+++ L ++ G G + T D+ + + N DG L +++TD TG+T+ Sbjct: 66 SALERMASSLESRLMSQLFNDAANGGEGYLKTTDFEIQVVNEDGMLLVHITDLLTGETTV 125 Query: 126 IQVSGLQNNST 136 I+V GL ++S+ Sbjct: 126 IEVGGLSDSSS 136 >UniRef50_A4VNF3 Curli fiber protein CsgF, putative n=10 Tax=Gammaproteobacteria RepID=A4VNF3_PSEU5 Length = 138 Score = 129 bits (325), Expect = 2e-29, Method: Composition-based stats. Identities = 51/113 (45%), Positives = 72/113 (63%), Gaps = 2/113 (1%) Query: 20 GTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFG-IETPSALDNFTQAIQSQ 78 + + NP+FGG+P NGA+LL +AQAQN+ KDP D SALD FT ++S+ Sbjct: 25 TELVYTPVNPSFGGSPLNGAWLLGNAQAQNNKKDPDALDRSSLFGNQSALDRFTSQLESR 84 Query: 79 ILGGLLSNINTGKPGRMVTNDYIVDIANRD-GQLQLNVTDRKTGQTSTIQVSG 130 +LG LLS +++GK G + T+D+IV + N D G L + +TDR TG+ S I V Sbjct: 85 LLGDLLSGVSSGKTGTVTTDDFIVRVYNGDAGMLIVEITDRLTGEMSEIIVGQ 137 >UniRef50_A3XJK9 Curli production assembly/transport component, CsgF n=3 Tax=Flavobacteriaceae RepID=A3XJK9_9FLAO Length = 133 Score = 126 bits (316), Expect = 2e-28, Method: Composition-based stats. Identities = 39/131 (29%), Positives = 66/131 (50%), Gaps = 9/131 (6%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 +L ++ S ++A +T+Q +NPNFGG N +LL+SA AQNS+KDP N + Sbjct: 5 LFLLFLIASSTTFAQQLTYQPKNPNFGGETFNYQWLLSSANAQNSFKDPEEN---AAKDQ 61 Query: 66 SALDNFTQAIQSQILGGLLSNINTG------KPGRMVTNDYIVDIANRDGQLQLNVTDRK 119 S L++F + Q+L + + T + G ++I + + L +N+ D Sbjct: 62 SELESFASNLNRQLLSQISRGLFTAQLGEGLEAGTYNFGSLALEIYDSNEGLVINILDTN 121 Query: 120 TGQTSTIQVSG 130 TG+ + I V Sbjct: 122 TGEQTQIIVPN 132 >UniRef50_C2G2I9 Curli production assembly/transport component CsgF n=2 Tax=Sphingobacterium spiritivorum RepID=C2G2I9_9SPHI Length = 141 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 33/122 (27%), Positives = 56/122 (45%), Gaps = 10/122 (8%) Query: 18 WAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETPSALDNFTQAIQS 77 +A ++ NP FGG+ N +LL+SA AQN + D + + +A+ +FT + Sbjct: 23 YAQQFVYKPVNPAFGGDTFNYQWLLSSANAQNQFDD---SKQQSYKPATAIGSFTDNLNR 79 Query: 78 QILGGLLSNINT-------GKPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTSTIQVSG 130 QIL + ++ KPG + IA G L +N+ D TG+ + I + Sbjct: 80 QILNKISRDLFGDETGEKPMKPGVYSLGTMNITIAEYYGGLNINIIDINTGEQTVINIPN 139 Query: 131 LQ 132 + Sbjct: 140 IS 141 >UniRef50_A6EMN1 Curli production assembly/transport component CsgF n=2 Tax=Bacteroidetes RepID=A6EMN1_9BACT Length = 133 Score = 121 bits (303), Expect = 7e-27, Method: Composition-based stats. Identities = 35/137 (25%), Positives = 62/137 (45%), Gaps = 13/137 (9%) Query: 3 VKHAVVLLMLISPLSW--AGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDF 60 + ++ L+ + +S A + ++ NP FGG N +LL+SA AQNS +DPS +D Sbjct: 1 MYKSLFFLLFVGLVSSVSAQQLVYKPVNPAFGGETFNYQWLLSSASAQNSIEDPSVRED- 59 Query: 61 GIETPSALDNFTQAIQSQILGGLLS-------NINTGKPGRMVTNDYIVDIANRDGQLQL 113 +L+ F +++ SQ+L L +PG ++I L + Sbjct: 60 ---EIGSLEQFNESLNSQLLAQLSRTAFADQFGTGEFEPGTFSFGSLELEIVESLNGLVI 116 Query: 114 NVTDRKTGQTSTIQVSG 130 N+ D T + + I + Sbjct: 117 NILDTTTAEQTQIIIPN 133 >UniRef50_D2QT92 Curli production assembly/transport component CsgF n=2 Tax=Spirosoma linguale DSM 74 RepID=D2QT92_9SPHI Length = 140 Score = 120 bits (301), Expect = 1e-26, Method: Composition-based stats. Identities = 42/138 (30%), Positives = 69/138 (50%), Gaps = 12/138 (8%) Query: 1 MRVKHAVVLLMLISPLSW--AGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSY-- 56 M+ + L ++ + A + NPNFGGN N +++L+SAQAQ+ DP+ Sbjct: 1 MKTR-LFTFLFVVCLAGYCSAQAFVYHPNNPNFGGNTFNYSWMLSSAQAQDRTTDPTATV 59 Query: 57 NDDFGIETPSALDNFTQAIQSQILGGLLSNI-------NTGKPGRMVTNDYIVDIANRDG 109 G +L++F Q++QSQ+L + S++ T KPG +Y VDI+N Sbjct: 60 RTANGTTQTGSLNSFAQSLQSQLLSRITSSLVGSQFGEGTLKPGTYKFGEYQVDISNGAS 119 Query: 110 QLQLNVTDRKTGQTSTIQ 127 + + + D K G+TS Sbjct: 120 GVVVRIVDGKGGETSITI 137 >UniRef50_Q1YMP0 Putative uncharacterized protein n=2 Tax=Alphaproteobacteria RepID=Q1YMP0_MOBAS Length = 174 Score = 120 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 35/148 (23%), Positives = 55/148 (37%), Gaps = 17/148 (11%) Query: 5 HAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYND------ 58 AV++ S + AG + + NP+FGGNP N + L A AQ +D Sbjct: 25 WAVLIFAAASHTATAGDLVYTPVNPSFGGNPLNSSHLFAIANAQRDATASDADDGLSGGS 84 Query: 59 ----DFGIETPSALDNFTQAIQSQILGGLLSNINTG-------KPGRMVTNDYIVDIANR 107 G S D F +Q ++L L S + G + D V + Sbjct: 85 SIGGGGGGAGNSQADLFVSQLQGRLLSALASQVTEAIFGENPQDSGTVTFGDTSVTFSRT 144 Query: 108 DGQLQLNVTDRKTGQTSTIQVSGLQNNS 135 ++L + + G + I V L +S Sbjct: 145 LDSIRLTIVNGADGTVTDIVVPQLVTSS 172 >UniRef50_C6X5S2 Curli production assembly/transport component CsgF n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X5S2_FLAB3 Length = 136 Score = 119 bits (299), Expect = 2e-26, Method: Composition-based stats. Identities = 36/136 (26%), Positives = 65/136 (47%), Gaps = 10/136 (7%) Query: 3 VKHAVVLLMLISP-LSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFG 61 +K+ ++++LI+P + + ++ NP FGG+ N +LL+SA AQN + + Sbjct: 1 MKNLGIIVLLITPFYAASQQFVYKPINPAFGGDTFNYQWLLSSASAQNQFDN--NLRGSL 58 Query: 62 IETPSALDNFTQAIQSQILGGLLSNI-------NTGKPGRMVTNDYIVDIANRDGQLQLN 114 S+LD+FT + QIL L + +PG V ++I L ++ Sbjct: 59 GSGGSSLDSFTDNLNRQILSELSRKLFGDQFGDGDLQPGTYVFGSIYLEILQTGQGLLIS 118 Query: 115 VTDRKTGQTSTIQVSG 130 + + TG+ S I + G Sbjct: 119 ILNTDTGEQSEILIPG 134 >UniRef50_A5PB14 Putative uncharacterized protein n=3 Tax=Sphingomonadales RepID=A5PB14_9SPHN Length = 141 Score = 116 bits (291), Expect = 2e-25, Method: Composition-based stats. Identities = 34/134 (25%), Positives = 53/134 (39%), Gaps = 14/134 (10%) Query: 5 HAVVLLMLISP--LSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGI 62 A++ + A + + +P FGGNP N +L A AQN KDP Sbjct: 7 WAIIGAGAACVAVPANAQDIVHEPISPTFGGNPFNSNHILGIANAQNKTKDP-----NAA 61 Query: 63 ETPSALDNFTQAIQSQILGGLLSNINTG-------KPGRMVTNDYIVDIANRDGQLQLNV 115 T S D F + +QS++L L S I + G + + + L + Sbjct: 62 STNSQADIFARQLQSRLLSALSSQIVDAIFGDDPQESGVISFGGQTISFFRSLDSVTLII 121 Query: 116 TDRKTGQTSTIQVS 129 T+ TG+ + I V Sbjct: 122 TNDDTGEETRIVVP 135 >UniRef50_Q12QL8 Curli production assembly/transport component CsgF, putative n=14 Tax=Shewanella RepID=Q12QL8_SHEDO Length = 135 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 34/139 (24%), Positives = 66/139 (47%), Gaps = 11/139 (7%) Query: 1 MRVKHAVVL-LMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDD 59 M+ + + L+ + L+ A + + NP+FGG+ NG+FLLN AQAQN D+ Sbjct: 1 MKNRMLLAGGLISFAALTQATELVYTPVNPSFGGSALNGSFLLNKAQAQN--------DN 52 Query: 60 FGIETPSALDNFTQAIQSQILGGLLSNINTGK--PGRMVTNDYIVDIANRDGQLQLNVTD 117 + F ++++ I+ + + G+ G T D+ V+++ + L +T+ Sbjct: 53 ARTNDKDFVTRFQESLERNIINSITRGVANGEITDGVYDTGDFRVEVSPTANGVMLTITN 112 Query: 118 RKTGQTSTIQVSGLQNNST 136 +TG + I++ T Sbjct: 113 LETGDVTVIEMPTFPTAPT 131 >UniRef50_B4RW66 Putative curli production assembly/transport component n=5 Tax=Alteromonadales RepID=B4RW66_ALTMD Length = 151 Score = 111 bits (278), Expect = 6e-24, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 63/144 (43%), Gaps = 27/144 (18%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 + L + ++A + ++ NP+FGGNP NG+FLL+ A +QN++ P + Sbjct: 12 LITLGAVFQFSAFATELVYEPINPSFGGNPLNGSFLLSKANSQNAHSAPLSERSYD---- 67 Query: 66 SALDNFTQAIQSQILGGLLSNINTGKPGR-------------------MVTNDYIVDIAN 106 + ++++ + ++ I G V+ D+ V++ Sbjct: 68 ---ERLQESLERAYINRIVREITDIAFGEQEYDEDGNPIDSIFNQDSIFVSGDFQVELIT 124 Query: 107 RD-GQLQLNVTDRKTGQTSTIQVS 129 + + +N+T+ TG+ + +++ Sbjct: 125 SNPDSIVVNITNLLTGEVTVVEIP 148 >UniRef50_Q392B8 Putative uncharacterized protein n=1 Tax=Burkholderia sp. 383 RepID=Q392B8_BURS3 Length = 145 Score = 96.8 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 51/134 (38%), Positives = 74/134 (55%), Gaps = 9/134 (6%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 A+V + +SP +W+ T+ +Q NPNFGGNP NG LLN A AQN + DPS + + Sbjct: 10 ALVAMTAVSPYAWSSTLVYQPNNPNFGGNPANGPNLLNEANAQNKHTDPSLSAYSSPTST 69 Query: 66 SALDNFTQAIQSQILGGLLSNINT--------GKPGRMVTNDYIVDI-ANRDGQLQLNVT 116 S+LD F +Q IL + ++ KPG + T ++ + + A +G LQ+ T Sbjct: 70 SSLDQFNAQLQQAILSRVAGSVTNSIVGTNGELKPGTISTGNFTIVVSAAGNGSLQVTTT 129 Query: 117 DRKTGQTSTIQVSG 130 D+ TG TST VS Sbjct: 130 DKTTGATSTFIVSN 143 >UniRef50_B0UH01 Putative uncharacterized protein n=6 Tax=Rhizobiales RepID=B0UH01_METS4 Length = 152 Score = 90.6 bits (223), Expect = 1e-17, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 48/130 (36%), Gaps = 16/130 (12%) Query: 6 AVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETP 65 A ++L+ + + A ++ +Q NP FGG NG++L A AQN + + Sbjct: 12 AGLVLLHAARPAEASSLVYQPVNPAFGGPQLNGSWLQAEANAQNIPQAAAQRRQQITGAA 71 Query: 66 ---------SALDNFTQAIQSQILGGLLSNIN------TGKP-GRMVTNDYIVDIANRDG 109 + F Q +Q+Q+ L + I +P G+ + Sbjct: 72 ASAARGGILTPGQAFAQQLQTQLYSSLANQITRAIFGENAQPNGQFSFQGTTISYTRVGS 131 Query: 110 QLQLNVTDRK 119 + + + D Sbjct: 132 NVSITINDGS 141 >UniRef50_B3QD05 Putative uncharacterized protein n=3 Tax=Rhodopseudomonas palustris RepID=B3QD05_RHOPT Length = 94 Score = 47.5 bits (111), Expect = 1e-04, Method: Composition-based stats. Identities = 16/38 (42%), Positives = 26/38 (68%) Query: 11 MLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQ 48 ++++ S+A ++ +P F GNP NG FLL++AQAQ Sbjct: 13 VILTAPSFATEQIYRPVSPTFNGNPLNGNFLLSTAQAQ 50 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.306 0.126 0.299 Lambda K H 0.267 0.0387 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 594,397,505 Number of Sequences: 3077464 Number of extensions: 18441955 Number of successful extensions: 64164 Number of sequences better than 1.0e-01: 20 Number of HSP's better than 0.1 without gapping: 46 Number of HSP's successfully gapped in prelim test: 9 Number of HSP's that attempted gapping in prelim test: 64057 Number of HSP's gapped (non-prelim): 55 length of query: 138 length of database: 1,040,396,356 effective HSP length: 102 effective length of query: 36 effective length of database: 726,495,028 effective search space: 26153821008 effective search space used: 26153821008 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 87 (38.2 bits)