BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (78 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P64472 Uncharacterized protein ydhI n=103 Tax=Gammaprot... 155 5e-37 UniRef50_C9Y3L9 Uncharacterized protein ydhI n=2 Tax=Enterobacte... 101 6e-21 UniRef50_UPI000190D31E hypothetical protein SentesTyp_27517 n=1 ... 63 3e-09 UniRef50_B4EDQ7 Putative membrane protein n=2 Tax=Burkholderia R... 42 0.007 >UniRef50_P64472 Uncharacterized protein ydhI n=103 Tax=Gammaproteobacteria RepID=YDHI_ECO57 Length = 78 Score = 155 bits (391), Expect = 5e-37, Method: Compositional matrix adjust. Identities = 78/78 (100%), Positives = 78/78 (100%) Query: 1 MKFMLNATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMD 60 MKFMLNATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMD Sbjct: 1 MKFMLNATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMD 60 Query: 61 LSLFAICVCLALAILIAW 78 LSLFAICVCLALAILIAW Sbjct: 61 LSLFAICVCLALAILIAW 78 >UniRef50_C9Y3L9 Uncharacterized protein ydhI n=2 Tax=Enterobacteriaceae RepID=C9Y3L9_CROTZ Length = 74 Score = 101 bits (252), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 49/72 (68%), Positives = 54/72 (75%) Query: 7 ATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMDLSLFAI 66 +G L DL+ GASVYFPP FKA GF IWL+VHRLLR W+YAGDIWHP LMDLSLF + Sbjct: 3 TSGFALSDLIAGASVYFPPLFKAVMLGFFIWLMVHRLLRDWMYAGDIWHPTLMDLSLFIL 62 Query: 67 CVCLALAILIAW 78 V L L ILI W Sbjct: 63 SVSLGLVILIVW 74 >UniRef50_UPI000190D31E hypothetical protein SentesTyp_27517 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190D31E Length = 76 Score = 62.8 bits (151), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 36/61 (59%), Positives = 43/61 (70%) Query: 18 GASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMDLSLFAICVCLALAILIA 77 G+ + FA GFVIWL +HRLLR IY+ +IWHPLLMDLSLF +CVCL L +LI Sbjct: 16 GSICLLSAILQTFALGFVIWLFIHRLLRDRIYSDEIWHPLLMDLSLFTLCVCLGLVLLIV 75 Query: 78 W 78 W Sbjct: 76 W 76 >UniRef50_B4EDQ7 Putative membrane protein n=2 Tax=Burkholderia RepID=B4EDQ7_BURCJ Length = 67 Score = 41.6 bits (96), Expect = 0.007, Method: Compositional matrix adjust. Identities = 26/61 (42%), Positives = 33/61 (54%), Gaps = 3/61 (4%) Query: 16 VFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMDLSLFAICVCLALAIL 75 VFG VYFP F G +I LV +RLL + +WHP L SL +C+C AL + Sbjct: 8 VFG--VYFPAVMLTFIAGALISLVANRLLSRTRFCEHVWHPSLFRASLM-LCICCALGLT 64 Query: 76 I 76 I Sbjct: 65 I 65 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P64472 Uncharacterized protein ydhI n=103 Tax=Gammaprot... 98 7e-20 UniRef50_C9Y3L9 Uncharacterized protein ydhI n=2 Tax=Enterobacte... 84 2e-15 UniRef50_UPI000190D31E hypothetical protein SentesTyp_27517 n=1 ... 66 2e-10 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P64472 Uncharacterized protein ydhI n=103 Tax=Gammaproteobacteria RepID=YDHI_ECO57 Length = 78 Score = 98.4 bits (244), Expect = 7e-20, Method: Composition-based stats. Identities = 78/78 (100%), Positives = 78/78 (100%) Query: 1 MKFMLNATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMD 60 MKFMLNATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMD Sbjct: 1 MKFMLNATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMD 60 Query: 61 LSLFAICVCLALAILIAW 78 LSLFAICVCLALAILIAW Sbjct: 61 LSLFAICVCLALAILIAW 78 >UniRef50_C9Y3L9 Uncharacterized protein ydhI n=2 Tax=Enterobacteriaceae RepID=C9Y3L9_CROTZ Length = 74 Score = 83.8 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 49/72 (68%), Positives = 54/72 (75%) Query: 7 ATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMDLSLFAI 66 +G L DL+ GASVYFPP FKA GF IWL+VHRLLR W+YAGDIWHP LMDLSLF + Sbjct: 3 TSGFALSDLIAGASVYFPPLFKAVMLGFFIWLMVHRLLRDWMYAGDIWHPTLMDLSLFIL 62 Query: 67 CVCLALAILIAW 78 V L L ILI W Sbjct: 63 SVSLGLVILIVW 74 >UniRef50_UPI000190D31E hypothetical protein SentesTyp_27517 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190D31E Length = 76 Score = 66.4 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 36/61 (59%), Positives = 43/61 (70%) Query: 18 GASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYAGDIWHPLLMDLSLFAICVCLALAILIA 77 G+ + FA GFVIWL +HRLLR IY+ +IWHPLLMDLSLF +CVCL L +LI Sbjct: 16 GSICLLSAILQTFALGFVIWLFIHRLLRDRIYSDEIWHPLLMDLSLFTLCVCLGLVLLIV 75 Query: 78 W 78 W Sbjct: 76 W 76 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.332 0.152 0.528 Lambda K H 0.267 0.0470 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 382,366,874 Number of Sequences: 3077464 Number of extensions: 15571488 Number of successful extensions: 75363 Number of sequences better than 1.0e-01: 9 Number of HSP's better than 0.1 without gapping: 8 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 75352 Number of HSP's gapped (non-prelim): 15 length of query: 78 length of database: 1,040,396,356 effective HSP length: 49 effective length of query: 29 effective length of database: 889,600,620 effective search space: 25798417980 effective search space used: 25798417980 T: 11 A: 40 X1: 16 ( 7.7 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.4 bits) S2: 87 (37.9 bits)