BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (77 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P64457 Uncharacterized protein ydcY n=98 Tax=Enterobact... 153 2e-36 UniRef50_B4EUA3 Putative uncharacterized protein n=3 Tax=Proteus... 65 9e-10 UniRef50_B2PZN6 Putative uncharacterized protein n=4 Tax=Provide... 40 0.025 >UniRef50_P64457 Uncharacterized protein ydcY n=98 Tax=Enterobacteriaceae RepID=YDCY_ECO57 Length = 77 Score = 153 bits (386), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 77/77 (100%), Positives = 77/77 (100%) Query: 1 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGRKHK 60 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGRKHK Sbjct: 1 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGRKHK 60 Query: 61 EDMEARHEQLTKGGTIL 77 EDMEARHEQLTKGGTIL Sbjct: 61 EDMEARHEQLTKGGTIL 77 >UniRef50_B4EUA3 Putative uncharacterized protein n=3 Tax=Proteus RepID=B4EUA3_PROMH Length = 60 Score = 64.7 bits (156), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 32/60 (53%), Positives = 40/60 (66%) Query: 1 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGRKHK 60 MSH +EVI +VD AI + I MNELL+ L D ++ RY QQ+RLRTAI HHG K + Sbjct: 1 MSHYEEVIKQVDEAIASASIQTMNELLVELGKDNTIAFAQRYEQQERLRTAIFHHGEKQR 60 >UniRef50_B2PZN6 Putative uncharacterized protein n=4 Tax=Providencia RepID=B2PZN6_PROST Length = 58 Score = 40.0 bits (92), Expect = 0.025, Method: Compositional matrix adjust. Identities = 28/55 (50%), Positives = 38/55 (69%) Query: 1 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHH 55 M H D ++ VDA + E+ I++MN LL LS DA+L++E R+ QQQRLR AI H Sbjct: 1 MQHYDNIVKHVDALLAENSISNMNILLAQLSHDAQLTQEQRFEQQQRLRKAIFKH 55 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P64457 Uncharacterized protein ydcY n=98 Tax=Enterobact... 105 5e-22 UniRef50_B4EUA3 Putative uncharacterized protein n=3 Tax=Proteus... 82 4e-15 Sequences not found previously or not previously below threshold: UniRef50_B2PZN6 Putative uncharacterized protein n=4 Tax=Provide... 42 0.004 CONVERGED! >UniRef50_P64457 Uncharacterized protein ydcY n=98 Tax=Enterobacteriaceae RepID=YDCY_ECO57 Length = 77 Score = 105 bits (262), Expect = 5e-22, Method: Composition-based stats. Identities = 77/77 (100%), Positives = 77/77 (100%) Query: 1 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGRKHK 60 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGRKHK Sbjct: 1 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGRKHK 60 Query: 61 EDMEARHEQLTKGGTIL 77 EDMEARHEQLTKGGTIL Sbjct: 61 EDMEARHEQLTKGGTIL 77 >UniRef50_B4EUA3 Putative uncharacterized protein n=3 Tax=Proteus RepID=B4EUA3_PROMH Length = 60 Score = 82.4 bits (202), Expect = 4e-15, Method: Composition-based stats. Identities = 32/60 (53%), Positives = 40/60 (66%) Query: 1 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGRKHK 60 MSH +EVI +VD AI + I MNELL+ L D ++ RY QQ+RLRTAI HHG K + Sbjct: 1 MSHYEEVIKQVDEAIASASIQTMNELLVELGKDNTIAFAQRYEQQERLRTAIFHHGEKQR 60 >UniRef50_B2PZN6 Putative uncharacterized protein n=4 Tax=Providencia RepID=B2PZN6_PROST Length = 58 Score = 42.4 bits (98), Expect = 0.004, Method: Composition-based stats. Identities = 28/57 (49%), Positives = 38/57 (66%) Query: 1 MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGR 57 M H D ++ VDA + E+ I++MN LL LS DA+L++E R+ QQQRLR AI H Sbjct: 1 MQHYDNIVKHVDALLAENSISNMNILLAQLSHDAQLTQEQRFEQQQRLRKAIFKHHE 57 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.132 0.332 Lambda K H 0.267 0.0403 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 258,911,505 Number of Sequences: 3077464 Number of extensions: 7624323 Number of successful extensions: 40048 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 40042 Number of HSP's gapped (non-prelim): 6 length of query: 77 length of database: 1,040,396,356 effective HSP length: 48 effective length of query: 29 effective length of database: 892,678,084 effective search space: 25887664436 effective search space used: 25887664436 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 87 (38.1 bits)