BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (139 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P03825 Protein pioO n=23 Tax=Enterobacteriaceae RepID=P... 286 2e-76 UniRef50_D2TMI1 Putative T2SS component n=1 Tax=Citrobacter rode... 60 2e-08 UniRef50_B3DWC9 Uncharacterized conserved protein n=1 Tax=Methyl... 39 0.059 >UniRef50_P03825 Protein pioO n=23 Tax=Enterobacteriaceae RepID=PIOO_ECOLI Length = 139 Score = 286 bits (731), Expect = 2e-76, Method: Compositional matrix adjust. Identities = 139/139 (100%), Positives = 139/139 (100%) Query: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV Sbjct: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 Query: 61 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES 120 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES Sbjct: 61 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES 120 Query: 121 EEKAGLRERVKNALNELER 139 EEKAGLRERVKNALNELER Sbjct: 121 EEKAGLRERVKNALNELER 139 >UniRef50_D2TMI1 Putative T2SS component n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TMI1_CITRO Length = 132 Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 46/141 (32%), Positives = 70/141 (49%), Gaps = 14/141 (9%) Query: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 MF Y AR+ ++ + TI +V+ L L+ AG ++GGYA Q Sbjct: 1 MFHLYCRARQVATLSEQSMWHERLTGTIGFVLQLCGWLLFAAGCFVLGGYAYQ-----AC 55 Query: 61 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVE--SVEEEDDPGVAVENAPSSSEDEENTVE 118 + T +P + Q+ +H+ FKK+PLPV E SV+EE E+ P++ E E Sbjct: 56 QERRTEVPTSPDYYQTQEHFVFKKKPLPVPEEVSVDEEQ------ESRPAAEAPTEPVAE 109 Query: 119 ESEE-KAGLRERVKNALNELE 138 E + A L+ RVK A+ EL+ Sbjct: 110 EGDSANAELKRRVKEAMAELD 130 >UniRef50_B3DWC9 Uncharacterized conserved protein n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DWC9_METI4 Length = 399 Score = 38.5 bits (88), Expect = 0.059, Method: Compositional matrix adjust. Identities = 19/73 (26%), Positives = 37/73 (50%), Gaps = 3/73 (4%) Query: 17 PGIFSRQKHSTIIYVICL---LLICLWFAGMVLVGGYARQLWVLWIVKAEVTVEAETPAF 73 PG+ S ++ +Y+ + L W + G + WVL +VK + VE+E+P Sbjct: 68 PGLLSLRRKKPFVYLFAIDPYSLYAYWIIDQKELRGAWKNQWVLRLVKNDSIVESESPLE 127 Query: 74 KQSTQHYFFKKQP 86 ++ + Y+F+ +P Sbjct: 128 LKTQKKYYFQAKP 140 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P03825 Protein pioO n=23 Tax=Enterobacteriaceae RepID=P... 221 7e-57 UniRef50_D2TMI1 Putative T2SS component n=1 Tax=Citrobacter rode... 186 2e-46 Sequences not found previously or not previously below threshold: UniRef50_C4LB53 General secretion pathway protein B n=1 Tax=Tolu... 49 5e-05 >UniRef50_P03825 Protein pioO n=23 Tax=Enterobacteriaceae RepID=PIOO_ECOLI Length = 139 Score = 221 bits (562), Expect = 7e-57, Method: Composition-based stats. Identities = 139/139 (100%), Positives = 139/139 (100%) Query: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV Sbjct: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 Query: 61 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES 120 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES Sbjct: 61 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES 120 Query: 121 EEKAGLRERVKNALNELER 139 EEKAGLRERVKNALNELER Sbjct: 121 EEKAGLRERVKNALNELER 139 >UniRef50_D2TMI1 Putative T2SS component n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TMI1_CITRO Length = 132 Score = 186 bits (473), Expect = 2e-46, Method: Composition-based stats. Identities = 46/141 (32%), Positives = 70/141 (49%), Gaps = 14/141 (9%) Query: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 MF Y AR+ ++ + TI +V+ L L+ AG ++GGYA Q Sbjct: 1 MFHLYCRARQVATLSEQSMWHERLTGTIGFVLQLCGWLLFAAGCFVLGGYAYQ-----AC 55 Query: 61 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVE--SVEEEDDPGVAVENAPSSSEDEENTVE 118 + T +P + Q+ +H+ FKK+PLPV E SV+EE E+ P++ E E Sbjct: 56 QERRTEVPTSPDYYQTQEHFVFKKKPLPVPEEVSVDEEQ------ESRPAAEAPTEPVAE 109 Query: 119 ESEE-KAGLRERVKNALNELE 138 E + A L+ RVK A+ EL+ Sbjct: 110 EGDSANAELKRRVKEAMAELD 130 >UniRef50_C4LB53 General secretion pathway protein B n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LB53_TOLAT Length = 235 Score = 49.0 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 30/135 (22%), Positives = 50/135 (37%), Gaps = 10/135 (7%) Query: 7 AAREQKETGHPGIF-SRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLW----IVK 61 A+R+ K F + H L+ L ++G W L +V+ Sbjct: 7 ASRQAKRVEQAHFFMPERLHGERGSKASWLVWLLGLPVCAVLGAATNYGWHLLHNEPVVQ 66 Query: 62 AEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEESE 121 FK Q+Y F +PLP E E+ ++ A E+ EE++ Sbjct: 67 RVEVPHPTELPFKPLNQNYEFVTEPLP-----EAEEQNLAMIDPAALPLPAEQPAAEEAD 121 Query: 122 EKAGLRERVKNALNE 136 ++ L +R A+NE Sbjct: 122 TRSSLEKRFMQAVNE 136 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P03825 Protein pioO n=23 Tax=Enterobacteriaceae RepID=P... 196 2e-49 UniRef50_C4LB53 General secretion pathway protein B n=1 Tax=Tolu... 163 2e-39 UniRef50_D2TMI1 Putative T2SS component n=1 Tax=Citrobacter rode... 163 2e-39 Sequences not found previously or not previously below threshold: UniRef50_UPI000182659F hypothetical protein EcanA3_01905 n=1 Tax... 45 8e-04 >UniRef50_P03825 Protein pioO n=23 Tax=Enterobacteriaceae RepID=PIOO_ECOLI Length = 139 Score = 196 bits (498), Expect = 2e-49, Method: Composition-based stats. Identities = 139/139 (100%), Positives = 139/139 (100%) Query: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV Sbjct: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 Query: 61 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES 120 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES Sbjct: 61 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES 120 Query: 121 EEKAGLRERVKNALNELER 139 EEKAGLRERVKNALNELER Sbjct: 121 EEKAGLRERVKNALNELER 139 >UniRef50_C4LB53 General secretion pathway protein B n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LB53_TOLAT Length = 235 Score = 163 bits (411), Expect = 2e-39, Method: Composition-based stats. Identities = 30/135 (22%), Positives = 50/135 (37%), Gaps = 10/135 (7%) Query: 7 AAREQKETGHPGIF-SRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLW----IVK 61 A+R+ K F + H L+ L ++G W L +V+ Sbjct: 7 ASRQAKRVEQAHFFMPERLHGERGSKASWLVWLLGLPVCAVLGAATNYGWHLLHNEPVVQ 66 Query: 62 AEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEESE 121 FK Q+Y F +PLP E E+ ++ A E+ EE++ Sbjct: 67 RVEVPHPTELPFKPLNQNYEFVTEPLP-----EAEEQNLAMIDPAALPLPAEQPAAEEAD 121 Query: 122 EKAGLRERVKNALNE 136 ++ L +R A+NE Sbjct: 122 TRSSLEKRFMQAVNE 136 >UniRef50_D2TMI1 Putative T2SS component n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TMI1_CITRO Length = 132 Score = 163 bits (411), Expect = 2e-39, Method: Composition-based stats. Identities = 43/139 (30%), Positives = 69/139 (49%), Gaps = 10/139 (7%) Query: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 MF Y AR+ ++ + TI +V+ L L+ AG ++GGYA Q Sbjct: 1 MFHLYCRARQVATLSEQSMWHERLTGTIGFVLQLCGWLLFAAGCFVLGGYAYQ-----AC 55 Query: 61 KAEVTVEAETPAFKQSTQHYFFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEES 120 + T +P + Q+ +H+ FKK+PLPV E V +++ E+ P++ E EE Sbjct: 56 QERRTEVPTSPDYYQTQEHFVFKKKPLPVPEEVSVDEEQ----ESRPAAEAPTEPVAEEG 111 Query: 121 EE-KAGLRERVKNALNELE 138 + A L+ RVK A+ EL+ Sbjct: 112 DSANAELKRRVKEAMAELD 130 >UniRef50_UPI000182659F hypothetical protein EcanA3_01905 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI000182659F Length = 210 Score = 44.8 bits (104), Expect = 8e-04, Method: Composition-based stats. Identities = 27/131 (20%), Positives = 40/131 (30%), Gaps = 9/131 (6%) Query: 1 MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIV 60 M +AA+ ETG FS + L L +++ G YA W L Sbjct: 1 MSTICLAAQRSYETGEAVWFSYRFPGLKKIFGTTFLWLLGLCVLIVAGLYAHVYWNLRHP 60 Query: 61 KAEVTVEAE-TPAFKQSTQHYFFKKQPLPVV--------ESVEEEDDPGVAVENAPSSSE 111 T P S HY + +P P E V D + ++A Sbjct: 61 APVTTKVNTVKPEATLSDMHYIYVSKPFPHPQPKPIPVHEDVPAMQDVPINSDDADWQQA 120 Query: 112 DEENTVEESEE 122 + E + Sbjct: 121 PDGVAEPEHDT 131 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.304 0.120 0.303 Lambda K H 0.267 0.0357 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 677,759,358 Number of Sequences: 3077464 Number of extensions: 23721728 Number of successful extensions: 221124 Number of sequences better than 1.0e-01: 13 Number of HSP's better than 0.1 without gapping: 7 Number of HSP's successfully gapped in prelim test: 19 Number of HSP's that attempted gapping in prelim test: 221000 Number of HSP's gapped (non-prelim): 110 length of query: 139 length of database: 1,040,396,356 effective HSP length: 103 effective length of query: 36 effective length of database: 723,417,564 effective search space: 26043032304 effective search space used: 26043032304 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 86 (37.9 bits)