BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (362 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P10030 Transcriptional repressor pifC n=6 Tax=Enterobac... 731 0.0 UniRef50_C9Y5U4 Putative uncharacterized protein n=1 Tax=Cronoba... 207 3e-52 >UniRef50_P10030 Transcriptional repressor pifC n=6 Tax=Enterobacteriaceae RepID=PIFC_ECOLI Length = 362 Score = 731 bits (1888), Expect = 0.0, Method: Compositional matrix adjust. Identities = 362/362 (100%), Positives = 362/362 (100%) Query: 1 MLSQLNLRFHKKLIEALKTRAGRENTSVNALAERFLDDGLKTVAPGDGYFQLIADPEATV 60 MLSQLNLRFHKKLIEALKTRAGRENTSVNALAERFLDDGLKTVAPGDGYFQLIADPEATV Sbjct: 1 MLSQLNLRFHKKLIEALKTRAGRENTSVNALAERFLDDGLKTVAPGDGYFQLIADPEATV 60 Query: 61 RQLYRHIILGQTFGTSALSRDELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLLAW 120 RQLYRHIILGQTFGTSALSRDELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLLAW Sbjct: 61 RQLYRHIILGQTFGTSALSRDELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLLAW 120 Query: 121 QVEHDRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVDQMYAEHLLRPLESDCFGLAE 180 QVEHDRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVDQMYAEHLLRPLESDCFGLAE Sbjct: 121 QVEHDRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVDQMYAEHLLRPLESDCFGLAE 180 Query: 181 VPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIEAGTLRLEI 240 VPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIEAGTLRLEI Sbjct: 181 VPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIEAGTLRLEI 240 Query: 241 RVDGQHPGERPGAWYTTPRLHLLITGQDFVVPYGWEALSELLGLFTLYARHPEALTHGHQ 300 RVDGQHPGERPGAWYTTPRLHLLITGQDFVVPYGWEALSELLGLFTLYARHPEALTHGHQ Sbjct: 241 RVDGQHPGERPGAWYTTPRLHLLITGQDFVVPYGWEALSELLGLFTLYARHPEALTHGHQ 300 Query: 301 GERVMFSPPGNVTPEGFFGIDGLRIFMPAEAFETLVRELATRCQEGPLAEALTGLRCLYG 360 GERVMFSPPGNVTPEGFFGIDGLRIFMPAEAFETLVRELATRCQEGPLAEALTGLRCLYG Sbjct: 301 GERVMFSPPGNVTPEGFFGIDGLRIFMPAEAFETLVRELATRCQEGPLAEALTGLRCLYG 360 Query: 361 DL 362 DL Sbjct: 361 DL 362 >UniRef50_C9Y5U4 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y5U4_CROTZ Length = 365 Score = 207 bits (528), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 131/364 (35%), Positives = 185/364 (50%), Gaps = 5/364 (1%) Query: 1 MLSQLNLRFHKKLIEALKTRAGRENTSVNALAERFLDDGLKTVAPGDGYFQLIADPEATV 60 MLSQLNLRF KKLIE+LK+RA E TSVNALA RF+++ L + APGD L ADP T Sbjct: 5 MLSQLNLRFPKKLIESLKSRASAEATSVNALAGRFIEEKLMSAAPGDDSLALNADPAGTR 64 Query: 61 RQLYRHIILGQTFGTSALSRDELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLLAW 120 LYR I+ G+ FG L ELR++ H A L G + + P ++ L++IT + L + Sbjct: 65 ESLYRKIVRGEFFGRQTLRHAELRWLFDHAHRACLYGSGYV-SWPVIEALMNITFDALLY 123 Query: 121 QVEHDRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVDQMYAEHLLRPLESDCFGLAE 180 H VD Y+ F GKN+ EE + F A + VD +AE+LLRPL S L Sbjct: 124 AEAHKIEVDTFYINRTFDFPGKNYPEETQRFMAVMPRHVDPSWAEYLLRPLSSGALELQN 183 Query: 181 VPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIEAGTLRLEI 240 PD LA+I + RL+ +FPL+++ + ++ + V + T E G +RL + Sbjct: 184 FPDEALAQICSPDRLRLIFPLVVKAQALDEQEMKAWVAATGLVTEDLNLTAEVGDIRLHV 243 Query: 241 RVDGQHPGERPGAWYTTPRLHLLITGQDFVVPYGWEALSELLGLFTLYARHPEALTHG-- 298 +V G + PG + P L+++ V GWE S L+ L AR + + HG Sbjct: 244 QVSGNRAPQLPGREWEAPTFGLIVSAGCVVTAMGWEVFSALVR--QLQARAAQPVLHGWH 301 Query: 299 HQGERVMFSPPGNVTPEGFFGIDGLRIFMPAEAFETLVRELATRCQEGPLAEALTGLRCL 358 + + V P + G+ G+ I M A+ + L A L LR L Sbjct: 302 SRDKHVSVYIPRAEGTDVILGLSGIHISMTADNYLALETAFLAEVNAPAAAPVLAELRAL 361 Query: 359 YGDL 362 YGDL Sbjct: 362 YGDL 365 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P10030 Transcriptional repressor pifC n=6 Tax=Enterobac... 598 e-169 UniRef50_C9Y5U4 Putative uncharacterized protein n=1 Tax=Cronoba... 576 e-163 Sequences not found previously or not previously below threshold: UniRef50_B1F9C7 Putative uncharacterized protein n=1 Tax=Burkhol... 43 0.021 CONVERGED! >UniRef50_P10030 Transcriptional repressor pifC n=6 Tax=Enterobacteriaceae RepID=PIFC_ECOLI Length = 362 Score = 598 bits (1542), Expect = e-169, Method: Composition-based stats. Identities = 362/362 (100%), Positives = 362/362 (100%) Query: 1 MLSQLNLRFHKKLIEALKTRAGRENTSVNALAERFLDDGLKTVAPGDGYFQLIADPEATV 60 MLSQLNLRFHKKLIEALKTRAGRENTSVNALAERFLDDGLKTVAPGDGYFQLIADPEATV Sbjct: 1 MLSQLNLRFHKKLIEALKTRAGRENTSVNALAERFLDDGLKTVAPGDGYFQLIADPEATV 60 Query: 61 RQLYRHIILGQTFGTSALSRDELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLLAW 120 RQLYRHIILGQTFGTSALSRDELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLLAW Sbjct: 61 RQLYRHIILGQTFGTSALSRDELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLLAW 120 Query: 121 QVEHDRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVDQMYAEHLLRPLESDCFGLAE 180 QVEHDRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVDQMYAEHLLRPLESDCFGLAE Sbjct: 121 QVEHDRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVDQMYAEHLLRPLESDCFGLAE 180 Query: 181 VPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIEAGTLRLEI 240 VPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIEAGTLRLEI Sbjct: 181 VPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIEAGTLRLEI 240 Query: 241 RVDGQHPGERPGAWYTTPRLHLLITGQDFVVPYGWEALSELLGLFTLYARHPEALTHGHQ 300 RVDGQHPGERPGAWYTTPRLHLLITGQDFVVPYGWEALSELLGLFTLYARHPEALTHGHQ Sbjct: 241 RVDGQHPGERPGAWYTTPRLHLLITGQDFVVPYGWEALSELLGLFTLYARHPEALTHGHQ 300 Query: 301 GERVMFSPPGNVTPEGFFGIDGLRIFMPAEAFETLVRELATRCQEGPLAEALTGLRCLYG 360 GERVMFSPPGNVTPEGFFGIDGLRIFMPAEAFETLVRELATRCQEGPLAEALTGLRCLYG Sbjct: 301 GERVMFSPPGNVTPEGFFGIDGLRIFMPAEAFETLVRELATRCQEGPLAEALTGLRCLYG 360 Query: 361 DL 362 DL Sbjct: 361 DL 362 >UniRef50_C9Y5U4 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9Y5U4_CROTZ Length = 365 Score = 576 bits (1485), Expect = e-163, Method: Composition-based stats. Identities = 131/364 (35%), Positives = 185/364 (50%), Gaps = 5/364 (1%) Query: 1 MLSQLNLRFHKKLIEALKTRAGRENTSVNALAERFLDDGLKTVAPGDGYFQLIADPEATV 60 MLSQLNLRF KKLIE+LK+RA E TSVNALA RF+++ L + APGD L ADP T Sbjct: 5 MLSQLNLRFPKKLIESLKSRASAEATSVNALAGRFIEEKLMSAAPGDDSLALNADPAGTR 64 Query: 61 RQLYRHIILGQTFGTSALSRDELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLLAW 120 LYR I+ G+ FG L ELR++ H A L G + + P ++ L++IT + L + Sbjct: 65 ESLYRKIVRGEFFGRQTLRHAELRWLFDHAHRACLYGSGYV-SWPVIEALMNITFDALLY 123 Query: 121 QVEHDRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVDQMYAEHLLRPLESDCFGLAE 180 H VD Y+ F GKN+ EE + F A + VD +AE+LLRPL S L Sbjct: 124 AEAHKIEVDTFYINRTFDFPGKNYPEETQRFMAVMPRHVDPSWAEYLLRPLSSGALELQN 183 Query: 181 VPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIEAGTLRLEI 240 PD LA+I + RL+ +FPL+++ + ++ + V + T E G +RL + Sbjct: 184 FPDEALAQICSPDRLRLIFPLVVKAQALDEQEMKAWVAATGLVTEDLNLTAEVGDIRLHV 243 Query: 241 RVDGQHPGERPGAWYTTPRLHLLITGQDFVVPYGWEALSELLGLFTLYARHPEALTHG-- 298 +V G + PG + P L+++ V GWE S L+ L AR + + HG Sbjct: 244 QVSGNRAPQLPGREWEAPTFGLIVSAGCVVTAMGWEVFSALVR--QLQARAAQPVLHGWH 301 Query: 299 HQGERVMFSPPGNVTPEGFFGIDGLRIFMPAEAFETLVRELATRCQEGPLAEALTGLRCL 358 + + V P + G+ G+ I M A+ + L A L LR L Sbjct: 302 SRDKHVSVYIPRAEGTDVILGLSGIHISMTADNYLALETAFLAEVNAPAAAPVLAELRAL 361 Query: 359 YGDL 362 YGDL Sbjct: 362 YGDL 365 >UniRef50_B1F9C7 Putative uncharacterized protein n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1F9C7_9BURK Length = 358 Score = 42.7 bits (99), Expect = 0.021, Method: Composition-based stats. Identities = 53/254 (20%), Positives = 97/254 (38%), Gaps = 22/254 (8%) Query: 5 LNLRFHKKLIEAL-KTRAGRENTSVNALAERFLDDGL-----KTVAPGDGYFQLIADPEA 58 +R + L++A+ K + + +S + +A R ++ GL + D L+ DP A Sbjct: 6 FTMRISESLVQAVEKIKGQWQESSTSEVARRLIELGLDADRRQREEHADLRSALLDDPRA 65 Query: 59 TVRQLYRHIILGQTFGTSALSRDELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLL 118 + Q + + L R+E F+ V +A+ R T L +L T L Sbjct: 66 ALVQF-----RDRYYRNEPLQREEWAFIAQRVHDAYGRARRTFVTRAVLVDVLQATRALF 120 Query: 119 AWQVEH----DRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVDQMYAEHLLRPLESD 174 + H D DG++ + ++ + A L DQ YAE L R + Sbjct: 121 IARTRHTGRVDYVADGYHRSKLNLRDDESLVDGIARVAAGLPAWPDQSYAEWLSRSF-TG 179 Query: 175 CF--GLAEVPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIE 232 F ++PDA + E P ++ L +R + E+ + +T ++ Sbjct: 180 FFNGEEPDLPDAAIHEALA-PYFDSLLGLAVRA--YWVEKGEPIVAADSAFARPLT-VVK 235 Query: 233 AGTLRLEIRVDGQH 246 L ++ GQ Sbjct: 236 LDDLSVQFMFTGQR 249 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.138 0.374 Lambda K H 0.267 0.0406 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,343,755,457 Number of Sequences: 3077464 Number of extensions: 53127917 Number of successful extensions: 136110 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 136096 Number of HSP's gapped (non-prelim): 11 length of query: 362 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 232 effective length of database: 640,326,036 effective search space: 148555640352 effective search space used: 148555640352 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 93 (40.4 bits)