BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (67 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P64465 Putative selenoprotein ydfZ n=64 Tax=Enterobacte... 135 4e-31 UniRef50_D0YA57 Selenium-binding protein YdfZ n=36 Tax=Enterobac... 86 3e-16 UniRef50_C4LFP5 Selenium-binding protein YdfZ n=24 Tax=Gammaprot... 61 1e-08 UniRef50_A1JTE1 Putative uncharacterized protein n=8 Tax=Yersini... 57 2e-07 UniRef50_C6CD67 Selenium-binding protein YdfZ n=1 Tax=Dickeya da... 56 3e-07 >UniRef50_P64465 Putative selenoprotein ydfZ n=64 Tax=Enterobacteriaceae RepID=YDFZ_ECO57 Length = 67 Score = 135 bits (340), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 67/67 (100%), Positives = 67/67 (100%) Query: 1 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD 60 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD Sbjct: 1 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD 60 Query: 61 LIRLGMN 67 LIRLGMN Sbjct: 61 LIRLGMN 67 >UniRef50_D0YA57 Selenium-binding protein YdfZ n=36 Tax=Enterobacteriaceae RepID=D0YA57_KLEVA Length = 67 Score = 86.3 bits (212), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 41/67 (61%), Positives = 49/67 (73%) Query: 1 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD 60 M TYDRNRNAI TGS VM++G G TG I +I EG TAEQ+RR V ++GCE + PLD Sbjct: 1 MKTYDRNRNAIATGSMVMIAGNGTTGVIKAIHGEGKTAEQLRRADCVEIDGCEGRFCPLD 60 Query: 61 LIRLGMN 67 LIRLG + Sbjct: 61 LIRLGFH 67 >UniRef50_C4LFP5 Selenium-binding protein YdfZ n=24 Tax=Gammaproteobacteria RepID=C4LFP5_TOLAT Length = 66 Score = 60.8 bits (146), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 27/62 (43%), Positives = 42/62 (67%) Query: 4 YDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLDLIR 63 YDRNRN ++ G RVM+ GTG + + T+GLT+ Q R K V++ +E+ AP++L++ Sbjct: 5 YDRNRNLLSAGQRVMIVGTGEIDVLETALTDGLTSSQAERSKCVLLTHAQERYAPIELVK 64 Query: 64 LG 65 LG Sbjct: 65 LG 66 >UniRef50_A1JTE1 Putative uncharacterized protein n=8 Tax=Yersinia RepID=A1JTE1_YERE8 Length = 70 Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 28/65 (43%), Positives = 35/65 (53%) Query: 1 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD 60 + YDRNRN I G RVMV+ TG +I EGL Q R V ++ E+ P D Sbjct: 6 IKVYDRNRNQIIPGKRVMVAQTGEIDVAKAIHAEGLAPVQAEREPVVELQHTGERYVPFD 65 Query: 61 LIRLG 65 L+RLG Sbjct: 66 LVRLG 70 >UniRef50_C6CD67 Selenium-binding protein YdfZ n=1 Tax=Dickeya dadantii Ech703 RepID=C6CD67_DICDC Length = 76 Score = 56.2 bits (134), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 28/66 (42%), Positives = 40/66 (60%) Query: 2 TTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLDL 61 T YDR RN + G +VM +G+ G I +I + LT +Q+R K V ++G E AP +L Sbjct: 3 TAYDRYRNPLDAGCQVMQNGSRRVGTIAAIHADNLTRDQVRLTKCVELKGVEGYFAPEEL 62 Query: 62 IRLGMN 67 +RLG N Sbjct: 63 MRLGRN 68 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_D0YA57 Selenium-binding protein YdfZ n=36 Tax=Enterobac... 106 3e-22 UniRef50_C6CD67 Selenium-binding protein YdfZ n=1 Tax=Dickeya da... 99 5e-20 UniRef50_A1JTE1 Putative uncharacterized protein n=8 Tax=Yersini... 97 2e-19 UniRef50_P64465 Putative selenoprotein ydfZ n=64 Tax=Enterobacte... 96 3e-19 UniRef50_C4LFP5 Selenium-binding protein YdfZ n=24 Tax=Gammaprot... 92 4e-18 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_D0YA57 Selenium-binding protein YdfZ n=36 Tax=Enterobacteriaceae RepID=D0YA57_KLEVA Length = 67 Score = 106 bits (264), Expect = 3e-22, Method: Composition-based stats. Identities = 41/67 (61%), Positives = 49/67 (73%) Query: 1 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD 60 M TYDRNRNAI TGS VM++G G TG I +I EG TAEQ+RR V ++GCE + PLD Sbjct: 1 MKTYDRNRNAIATGSMVMIAGNGTTGVIKAIHGEGKTAEQLRRADCVEIDGCEGRFCPLD 60 Query: 61 LIRLGMN 67 LIRLG + Sbjct: 61 LIRLGFH 67 >UniRef50_C6CD67 Selenium-binding protein YdfZ n=1 Tax=Dickeya dadantii Ech703 RepID=C6CD67_DICDC Length = 76 Score = 98.6 bits (244), Expect = 5e-20, Method: Composition-based stats. Identities = 28/66 (42%), Positives = 40/66 (60%) Query: 2 TTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLDL 61 T YDR RN + G +VM +G+ G I +I + LT +Q+R K V ++G E AP +L Sbjct: 3 TAYDRYRNPLDAGCQVMQNGSRRVGTIAAIHADNLTRDQVRLTKCVELKGVEGYFAPEEL 62 Query: 62 IRLGMN 67 +RLG N Sbjct: 63 MRLGRN 68 >UniRef50_A1JTE1 Putative uncharacterized protein n=8 Tax=Yersinia RepID=A1JTE1_YERE8 Length = 70 Score = 96.7 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 28/65 (43%), Positives = 35/65 (53%) Query: 1 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD 60 + YDRNRN I G RVMV+ TG +I EGL Q R V ++ E+ P D Sbjct: 6 IKVYDRNRNQIIPGKRVMVAQTGEIDVAKAIHAEGLAPVQAEREPVVELQHTGERYVPFD 65 Query: 61 LIRLG 65 L+RLG Sbjct: 66 LVRLG 70 >UniRef50_P64465 Putative selenoprotein ydfZ n=64 Tax=Enterobacteriaceae RepID=YDFZ_ECO57 Length = 67 Score = 95.9 bits (237), Expect = 3e-19, Method: Composition-based stats. Identities = 67/67 (100%), Positives = 67/67 (100%) Query: 1 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD 60 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD Sbjct: 1 MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLD 60 Query: 61 LIRLGMN 67 LIRLGMN Sbjct: 61 LIRLGMN 67 >UniRef50_C4LFP5 Selenium-binding protein YdfZ n=24 Tax=Gammaproteobacteria RepID=C4LFP5_TOLAT Length = 66 Score = 92.4 bits (228), Expect = 4e-18, Method: Composition-based stats. Identities = 27/64 (42%), Positives = 42/64 (65%) Query: 2 TTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLDL 61 YDRNRN ++ G RVM+ GTG + + T+GLT+ Q R K V++ +E+ AP++L Sbjct: 3 KIYDRNRNLLSAGQRVMIVGTGEIDVLETALTDGLTSSQAERSKCVLLTHAQERYAPIEL 62 Query: 62 IRLG 65 ++LG Sbjct: 63 VKLG 66 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.134 0.346 Lambda K H 0.267 0.0413 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 226,999,379 Number of Sequences: 3077464 Number of extensions: 6098181 Number of successful extensions: 17985 Number of sequences better than 1.0e-01: 5 Number of HSP's better than 0.1 without gapping: 10 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 17975 Number of HSP's gapped (non-prelim): 10 length of query: 67 length of database: 1,040,396,356 effective HSP length: 39 effective length of query: 28 effective length of database: 920,375,260 effective search space: 25770507280 effective search space used: 25770507280 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 87 (38.1 bits)