BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (110 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B7MII3 Curli assembly protein csgC n=58 Tax=Enterobacte... 215 3e-55 UniRef50_A9MH19 Curli assembly protein csgC n=35 Tax=Enterobacte... 157 7e-38 UniRef50_C9Y0V6 Putative curli production protein csgC n=1 Tax=C... 87 2e-16 >UniRef50_B7MII3 Curli assembly protein csgC n=58 Tax=Enterobacteriaceae RepID=CSGC_ECO45 Length = 110 Score = 215 bits (548), Expect = 3e-55, Method: Compositional matrix adjust. Identities = 108/110 (98%), Positives = 109/110 (99%) Query: 1 MNTLLLLAALSSQITFNTTQQGDVYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK 60 MN LLLLAALSSQITFNTTQQGD+YTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK Sbjct: 1 MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK 60 Query: 61 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSEKS 110 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSEKS Sbjct: 61 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSEKS 110 >UniRef50_A9MH19 Curli assembly protein csgC n=35 Tax=Enterobacteriaceae RepID=CSGC_SALAR Length = 108 Score = 157 bits (398), Expect = 7e-38, Method: Compositional matrix adjust. Identities = 75/108 (69%), Positives = 97/108 (89%) Query: 1 MNTLLLLAALSSQITFNTTQQGDVYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK 60 M+TLLLLAALS+QITF TTQQGD+YT+IP+V L++ C+C+VQILS+R G+ GQS T+Q++ Sbjct: 1 MHTLLLLAALSNQITFITTQQGDIYTVIPQVILSEPCVCQVQILSVRNGTWGQSHTQQKQ 60 Query: 61 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSE 108 TLSLPANQPI L++LS+NIS +D VKI++TVS+GQSLHLSQQWP S+ Sbjct: 61 TLSLPANQPIELSRLSVNISSEDSVKIILTVSNGQSLHLSQQWPSSAR 108 >UniRef50_C9Y0V6 Putative curli production protein csgC n=1 Tax=Cronobacter turicensis RepID=C9Y0V6_CROTZ Length = 109 Score = 86.7 bits (213), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 42/105 (40%), Positives = 64/105 (60%) Query: 1 MNTLLLLAALSSQITFNTTQQGDVYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK 60 M LL++AALS ++ F Q + + P VT+ +CLCR+++ R+GSSG T+Q K Sbjct: 1 MLDLLMIAALSGEVAFKAQPQDAGWQVTPVVTVAHACLCRIEMELRRQGSSGSVVTQQRK 60 Query: 61 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPP 105 L+L N P+AL+ L+++ DD V + VTVSD + + QW P Sbjct: 61 VLTLSPNAPLALSTFFLSVAADDTVTLRVTVSDSGTASFTGQWSP 105 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7MII3 Curli assembly protein csgC n=58 Tax=Enterobacte... 156 2e-37 UniRef50_C9Y0V6 Putative curli production protein csgC n=1 Tax=C... 149 3e-35 UniRef50_A9MH19 Curli assembly protein csgC n=35 Tax=Enterobacte... 147 1e-34 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_B7MII3 Curli assembly protein csgC n=58 Tax=Enterobacteriaceae RepID=CSGC_ECO45 Length = 110 Score = 156 bits (393), Expect = 2e-37, Method: Composition-based stats. Identities = 108/110 (98%), Positives = 109/110 (99%) Query: 1 MNTLLLLAALSSQITFNTTQQGDVYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK 60 MN LLLLAALSSQITFNTTQQGD+YTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK Sbjct: 1 MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK 60 Query: 61 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSEKS 110 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSEKS Sbjct: 61 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSEKS 110 >UniRef50_C9Y0V6 Putative curli production protein csgC n=1 Tax=Cronobacter turicensis RepID=C9Y0V6_CROTZ Length = 109 Score = 149 bits (375), Expect = 3e-35, Method: Composition-based stats. Identities = 42/108 (38%), Positives = 64/108 (59%) Query: 1 MNTLLLLAALSSQITFNTTQQGDVYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK 60 M LL++AALS ++ F Q + + P VT+ +CLCR+++ R+GSSG T+Q K Sbjct: 1 MLDLLMIAALSGEVAFKAQPQDAGWQVTPVVTVAHACLCRIEMELRRQGSSGSVVTQQRK 60 Query: 61 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSE 108 L+L N P+AL+ L+++ DD V + VTVSD + + QW P Sbjct: 61 VLTLSPNAPLALSTFFLSVAADDTVTLRVTVSDSGTASFTGQWSPPVR 108 >UniRef50_A9MH19 Curli assembly protein csgC n=35 Tax=Enterobacteriaceae RepID=CSGC_SALAR Length = 108 Score = 147 bits (370), Expect = 1e-34, Method: Composition-based stats. Identities = 75/108 (69%), Positives = 97/108 (89%) Query: 1 MNTLLLLAALSSQITFNTTQQGDVYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK 60 M+TLLLLAALS+QITF TTQQGD+YT+IP+V L++ C+C+VQILS+R G+ GQS T+Q++ Sbjct: 1 MHTLLLLAALSNQITFITTQQGDIYTVIPQVILSEPCVCQVQILSVRNGTWGQSHTQQKQ 60 Query: 61 TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSE 108 TLSLPANQPI L++LS+NIS +D VKI++TVS+GQSLHLSQQWP S+ Sbjct: 61 TLSLPANQPIELSRLSVNISSEDSVKIILTVSNGQSLHLSQQWPSSAR 108 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.303 0.124 0.322 Lambda K H 0.267 0.0384 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 346,683,309 Number of Sequences: 3077464 Number of extensions: 10326610 Number of successful extensions: 23582 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 23576 Number of HSP's gapped (non-prelim): 6 length of query: 110 length of database: 1,040,396,356 effective HSP length: 78 effective length of query: 32 effective length of database: 800,354,164 effective search space: 25611333248 effective search space used: 25611333248 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.4 bits) S2: 87 (38.2 bits)