BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (222 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P77180 Uncharacterized protein ykgH n=35 Tax=Enterobact... 461 e-129 UniRef50_B1LS00 Putative uncharacterized protein n=1 Tax=Escheri... 103 6e-21 UniRef50_A9MK41 Putative uncharacterized protein n=34 Tax=Salmon... 46 0.001 >UniRef50_P77180 Uncharacterized protein ykgH n=35 Tax=Enterobacteriaceae RepID=YKGH_ECOLI Length = 222 Score = 461 bits (1186), Expect = e-129, Method: Compositional matrix adjust. Identities = 222/222 (100%), Positives = 222/222 (100%) Query: 1 MREQIKQDIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTP 60 MREQIKQDIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTP Sbjct: 1 MREQIKQDIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTP 60 Query: 61 GILVSCDSNNNFACQTTMTEDVIQRITTFFHTSPDVKNREIRLEWSGDKRALPTAEEEIS 120 GILVSCDSNNNFACQTTMTEDVIQRITTFFHTSPDVKNREIRLEWSGDKRALPTAEEEIS Sbjct: 61 GILVSCDSNNNFACQTTMTEDVIQRITTFFHTSPDVKNREIRLEWSGDKRALPTAEEEIS 120 Query: 121 RVQASIIKWYASEYHNGRQVLDEIQTPSAINSELYTKMIYLTRNWSLYPNGDGCVTISSP 180 RVQASIIKWYASEYHNGRQVLDEIQTPSAINSELYTKMIYLTRNWSLYPNGDGCVTISSP Sbjct: 121 RVQASIIKWYASEYHNGRQVLDEIQTPSAINSELYTKMIYLTRNWSLYPNGDGCVTISSP 180 Query: 181 EIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQQNSGQ 222 EIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQQNSGQ Sbjct: 181 EIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQQNSGQ 222 >UniRef50_B1LS00 Putative uncharacterized protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LS00_ECOSM Length = 220 Score = 103 bits (256), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 57/213 (26%), Positives = 117/213 (54%), Gaps = 8/213 (3%) Query: 8 DIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTPGILVSCD 67 DID+IE+ +LKKKI I+ + + + + +FL INK+ I + Y L + +P ++++C Sbjct: 7 DIDIIELFLFLKKKIVSIILFVVLSLILSSIFLLINKNKINIKYELNMIANSPSMIINCG 66 Query: 68 SNNNFACQTTMTEDVIQRITTFFHTSPDVKNREIRLEWSGDKRALPTAEEEISRVQASII 127 S+ F C+ + + +I + ++ D + ++I L W+GD R P E++ + +I Sbjct: 67 SD--FYCKANIIQSIINNKSKSITSNIDERGKKIILSWAGDDRYKPLVTAEVNAIHKAID 124 Query: 128 KWYASEYHNGRQVLDEIQTPSAIN-SELYTKMIYLTRNWSLYPNGD-GCVTISSPEIKNK 185 WY +Y + +++ Q + IN +E Y K+ LT+ + G+ ++I+ + K Sbjct: 125 DWYIQDYKTYKSIVNN-QDSNYINGTETYAKVALLTK---INTAGEKNFISINDEIVNKK 180 Query: 186 YPAAICLALGFFLSIVISVMFCLVKKMVDEYQQ 218 Y A+ +AL ++++ S + ++K+ + EY+ Sbjct: 181 YKPALIMALTLIIAVIFSFSYHIIKRSILEYKN 213 >UniRef50_A9MK41 Putative uncharacterized protein n=34 Tax=Salmonella enterica RepID=A9MK41_SALAR Length = 227 Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust. Identities = 53/219 (24%), Positives = 96/219 (43%), Gaps = 23/219 (10%) Query: 8 DIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTPGILVSCD 67 +IDL++I +L + I+ I + M +F+ + D V +K N TP C Sbjct: 7 EIDLVDISLFLIRNWLSIVMTTFIFVVMGYVFVSVQHDTKIVSIKIKPNLDTPATYALCG 66 Query: 68 SNNNFACQTTMTEDVIQRITTFF------HTSPDVKNREIRLEWSGDKRALPTAEEEISR 121 N+ C+TT+ ++ R + F S + KN+ I + G K ++ IS Sbjct: 67 YVNDIQCKTTV---ILNRFSRFLPADYKNKISFEAKNQLILFKAQGRKESVNNI---ISA 120 Query: 122 VQASIIK---WYASEYHNGRQVLDEIQTPSAINSELYTKMIYLTRNWSLYPNGDGCVTIS 178 ++ S+ K WY + R L + + + +E Y K+ L+ N + Sbjct: 121 LKNSVDKMAVWYIDDGDIKRYSLHK----NMLQTETYAKLSLLSDAVRNNINIEVLDVSI 176 Query: 179 SPEIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQ 217 +P+ K + A+C +GF S I+ + C K+ + EY+ Sbjct: 177 TPKYKMRLVLALCGLMGFIFS--IATLLC--KRALTEYR 211 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77180 Uncharacterized protein ykgH n=35 Tax=Enterobact... 302 8e-81 UniRef50_B1LS00 Putative uncharacterized protein n=1 Tax=Escheri... 210 3e-53 UniRef50_A9MK41 Putative uncharacterized protein n=34 Tax=Salmon... 208 1e-52 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P77180 Uncharacterized protein ykgH n=35 Tax=Enterobacteriaceae RepID=YKGH_ECOLI Length = 222 Score = 302 bits (772), Expect = 8e-81, Method: Composition-based stats. Identities = 222/222 (100%), Positives = 222/222 (100%) Query: 1 MREQIKQDIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTP 60 MREQIKQDIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTP Sbjct: 1 MREQIKQDIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTP 60 Query: 61 GILVSCDSNNNFACQTTMTEDVIQRITTFFHTSPDVKNREIRLEWSGDKRALPTAEEEIS 120 GILVSCDSNNNFACQTTMTEDVIQRITTFFHTSPDVKNREIRLEWSGDKRALPTAEEEIS Sbjct: 61 GILVSCDSNNNFACQTTMTEDVIQRITTFFHTSPDVKNREIRLEWSGDKRALPTAEEEIS 120 Query: 121 RVQASIIKWYASEYHNGRQVLDEIQTPSAINSELYTKMIYLTRNWSLYPNGDGCVTISSP 180 RVQASIIKWYASEYHNGRQVLDEIQTPSAINSELYTKMIYLTRNWSLYPNGDGCVTISSP Sbjct: 121 RVQASIIKWYASEYHNGRQVLDEIQTPSAINSELYTKMIYLTRNWSLYPNGDGCVTISSP 180 Query: 181 EIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQQNSGQ 222 EIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQQNSGQ Sbjct: 181 EIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQQNSGQ 222 >UniRef50_B1LS00 Putative uncharacterized protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LS00_ECOSM Length = 220 Score = 210 bits (534), Expect = 3e-53, Method: Composition-based stats. Identities = 58/218 (26%), Positives = 118/218 (54%), Gaps = 8/218 (3%) Query: 3 EQIKQDIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTPGI 62 E DID+IE+ +LKKKI I+ + + + + +FL INK+ I + Y L + +P + Sbjct: 2 ENNSPDIDIIELFLFLKKKIVSIILFVVLSLILSSIFLLINKNKINIKYELNMIANSPSM 61 Query: 63 LVSCDSNNNFACQTTMTEDVIQRITTFFHTSPDVKNREIRLEWSGDKRALPTAEEEISRV 122 +++C S +F C+ + + +I + ++ D + ++I L W+GD R P E++ + Sbjct: 62 IINCGS--DFYCKANIIQSIINNKSKSITSNIDERGKKIILSWAGDDRYKPLVTAEVNAI 119 Query: 123 QASIIKWYASEYHNGRQVLDEIQTPSAIN-SELYTKMIYLTRNWSLYPNGD-GCVTISSP 180 +I WY +Y + +++ Q + IN +E Y K+ LT+ + G+ ++I+ Sbjct: 120 HKAIDDWYIQDYKTYKSIVNN-QDSNYINGTETYAKVALLTK---INTAGEKNFISINDE 175 Query: 181 EIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQQ 218 + KY A+ +AL ++++ S + ++K+ + EY+ Sbjct: 176 IVNKKYKPALIMALTLIIAVIFSFSYHIIKRSILEYKN 213 >UniRef50_A9MK41 Putative uncharacterized protein n=34 Tax=Salmonella enterica RepID=A9MK41_SALAR Length = 227 Score = 208 bits (529), Expect = 1e-52, Method: Composition-based stats. Identities = 53/219 (24%), Positives = 96/219 (43%), Gaps = 23/219 (10%) Query: 8 DIDLIEILFYLKKKIRVILFIMAICMAMVLLFLYINKDNIKVIYSLKINQTTPGILVSCD 67 +IDL++I +L + I+ I + M +F+ + D V +K N TP C Sbjct: 7 EIDLVDISLFLIRNWLSIVMTTFIFVVMGYVFVSVQHDTKIVSIKIKPNLDTPATYALCG 66 Query: 68 SNNNFACQTTMTEDVIQRITTFF------HTSPDVKNREIRLEWSGDKRALPTAEEEISR 121 N+ C+TT+ ++ R + F S + KN+ I + G K ++ IS Sbjct: 67 YVNDIQCKTTV---ILNRFSRFLPADYKNKISFEAKNQLILFKAQGRKESVNNI---ISA 120 Query: 122 VQASIIK---WYASEYHNGRQVLDEIQTPSAINSELYTKMIYLTRNWSLYPNGDGCVTIS 178 ++ S+ K WY + R L + + + +E Y K+ L+ N + Sbjct: 121 LKNSVDKMAVWYIDDGDIKRYSLHK----NMLQTETYAKLSLLSDAVRNNINIEVLDVSI 176 Query: 179 SPEIKNKYPAAICLALGFFLSIVISVMFCLVKKMVDEYQ 217 +P+ K + A+C +GF S I+ + C K+ + EY+ Sbjct: 177 TPKYKMRLVLALCGLMGFIFS--IATLLC--KRALTEYR 211 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.319 0.128 0.319 Lambda K H 0.267 0.0394 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 725,548,102 Number of Sequences: 3077464 Number of extensions: 25127377 Number of successful extensions: 85610 Number of sequences better than 1.0e-01: 13 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 11 Number of HSP's that attempted gapping in prelim test: 85592 Number of HSP's gapped (non-prelim): 18 length of query: 222 length of database: 1,040,396,356 effective HSP length: 124 effective length of query: 98 effective length of database: 658,790,820 effective search space: 64561500360 effective search space used: 64561500360 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 90 (39.3 bits)