BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (97 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q31US3 Uncharacterized protein yicS n=28 Tax=Enterobact... 204 8e-52 UniRef50_Q0TB41 Uncharacterized protein yicS n=62 Tax=Enterobact... 181 5e-45 UniRef50_A4W4V0 Putative uncharacterized protein n=2 Tax=Enterob... 81 9e-15 UniRef50_C4X3B7 Putative secreted protein n=4 Tax=Klebsiella Rep... 68 7e-11 UniRef50_C8QEH1 Putative uncharacterized protein n=1 Tax=Pantoea... 52 9e-06 >UniRef50_Q31US3 Uncharacterized protein yicS n=28 Tax=Enterobacteriaceae RepID=YICS_SHIBS Length = 97 Score = 204 bits (518), Expect = 8e-52, Method: Compositional matrix adjust. Identities = 97/97 (100%), Positives = 97/97 (100%) Query: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM Sbjct: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 Query: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM 97 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM Sbjct: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM 97 >UniRef50_Q0TB41 Uncharacterized protein yicS n=62 Tax=Enterobacteriaceae RepID=YICS_ECOL5 Length = 97 Score = 181 bits (460), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 87/97 (89%), Positives = 88/97 (90%) Query: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 MKPT LL+I F P I AESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM Sbjct: 1 MKPTMLLMITVFLIFPAISQAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 Query: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM 97 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM Sbjct: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM 97 >UniRef50_A4W4V0 Putative uncharacterized protein n=2 Tax=Enterobacter RepID=A4W4V0_ENT38 Length = 98 Score = 81.3 bits (199), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 35/94 (37%), Positives = 57/94 (60%) Query: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 MK L + +A+ + K K V++D++K+C+PQ+ +D+ W+ ++ Sbjct: 1 MKVAHLFCLVVCLLFAAFAHAKETRDPAKDEKIKQVVMKDIKKVCSPQSKQTDKQWQTMI 60 Query: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVEC 94 LS E NK I+ A++AMER+N NYWEA+GKV+C Sbjct: 61 LSSEANKLLIKNAVLAMERDNLDNYWEAVGKVDC 94 >UniRef50_C4X3B7 Putative secreted protein n=4 Tax=Klebsiella RepID=C4X3B7_KLEPN Length = 116 Score = 68.2 bits (165), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 35/95 (36%), Positives = 56/95 (58%), Gaps = 1/95 (1%) Query: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 MK ++ + T + +AESP SLQ ++K VL+ +++ C P + LSD + + Sbjct: 21 MKACWMVCLVTALSATS-AWAESPLQSLQFEQQKQQVLKAVKEKCAPASHLSDNDFANKV 79 Query: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECP 95 L+ + NK +REA +A ERNNQ +Y A+ K+ CP Sbjct: 80 LATDGNKNAVREATLAKERNNQKSYQAAIDKIVCP 114 >UniRef50_C8QEH1 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QEH1_9ENTR Length = 102 Score = 51.6 bits (122), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 25/93 (26%), Positives = 50/93 (53%), Gaps = 2/93 (2%) Query: 4 TTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLMLSD 63 T L+LI PG+ A+S + +L A + ++ DL+ C A ++DE ++ L+ Sbjct: 5 TLLVLIINLCPFPGM--AKSAYDNLAYALREQQIIGDLKLHCHIPAGVTDEHIRQVFLNS 62 Query: 64 ENNKQHIREAIVAMERNNQSNYWEALGKVECPD 96 ++N + +A A++ + +Y + + +V CPD Sbjct: 63 KDNHDAVIDAASALKAQHHDSYQQQIARVRCPD 95 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q31US3 Uncharacterized protein yicS n=28 Tax=Enterobact... 150 1e-35 UniRef50_Q0TB41 Uncharacterized protein yicS n=62 Tax=Enterobact... 145 4e-34 UniRef50_A4W4V0 Putative uncharacterized protein n=2 Tax=Enterob... 127 1e-28 UniRef50_C8QEH1 Putative uncharacterized protein n=1 Tax=Pantoea... 126 2e-28 UniRef50_C4X3B7 Putative secreted protein n=4 Tax=Klebsiella Rep... 121 8e-27 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_Q31US3 Uncharacterized protein yicS n=28 Tax=Enterobacteriaceae RepID=YICS_SHIBS Length = 97 Score = 150 bits (379), Expect = 1e-35, Method: Composition-based stats. Identities = 97/97 (100%), Positives = 97/97 (100%) Query: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM Sbjct: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 Query: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM 97 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM Sbjct: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM 97 >UniRef50_Q0TB41 Uncharacterized protein yicS n=62 Tax=Enterobacteriaceae RepID=YICS_ECOL5 Length = 97 Score = 145 bits (365), Expect = 4e-34, Method: Composition-based stats. Identities = 87/97 (89%), Positives = 88/97 (90%) Query: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 MKPT LL+I F P I AESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM Sbjct: 1 MKPTMLLMITVFLIFPAISQAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 Query: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM 97 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM Sbjct: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM 97 >UniRef50_A4W4V0 Putative uncharacterized protein n=2 Tax=Enterobacter RepID=A4W4V0_ENT38 Length = 98 Score = 127 bits (318), Expect = 1e-28, Method: Composition-based stats. Identities = 35/94 (37%), Positives = 57/94 (60%) Query: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 MK L + +A+ + K K V++D++K+C+PQ+ +D+ W+ ++ Sbjct: 1 MKVAHLFCLVVCLLFAAFAHAKETRDPAKDEKIKQVVMKDIKKVCSPQSKQTDKQWQTMI 60 Query: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVEC 94 LS E NK I+ A++AMER+N NYWEA+GKV+C Sbjct: 61 LSSEANKLLIKNAVLAMERDNLDNYWEAVGKVDC 94 >UniRef50_C8QEH1 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QEH1_9ENTR Length = 102 Score = 126 bits (316), Expect = 2e-28, Method: Composition-based stats. Identities = 25/93 (26%), Positives = 50/93 (53%), Gaps = 2/93 (2%) Query: 4 TTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLMLSD 63 T L+LI PG+ A+S + +L A + ++ DL+ C A ++DE ++ L+ Sbjct: 5 TLLVLIINLCPFPGM--AKSAYDNLAYALREQQIIGDLKLHCHIPAGVTDEHIRQVFLNS 62 Query: 64 ENNKQHIREAIVAMERNNQSNYWEALGKVECPD 96 ++N + +A A++ + +Y + + +V CPD Sbjct: 63 KDNHDAVIDAASALKAQHHDSYQQQIARVRCPD 95 >UniRef50_C4X3B7 Putative secreted protein n=4 Tax=Klebsiella RepID=C4X3B7_KLEPN Length = 116 Score = 121 bits (303), Expect = 8e-27, Method: Composition-based stats. Identities = 35/95 (36%), Positives = 56/95 (58%), Gaps = 1/95 (1%) Query: 1 MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPQASLSDEAWEKLM 60 MK ++ + T + +AESP SLQ ++K VL+ +++ C P + LSD + + Sbjct: 21 MKACWMVCLVTALSATS-AWAESPLQSLQFEQQKQQVLKAVKEKCAPASHLSDNDFANKV 79 Query: 61 LSDENNKQHIREAIVAMERNNQSNYWEALGKVECP 95 L+ + NK +REA +A ERNNQ +Y A+ K+ CP Sbjct: 80 LATDGNKNAVREATLAKERNNQKSYQAAIDKIVCP 114 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.305 0.123 0.314 Lambda K H 0.267 0.0389 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 304,010,630 Number of Sequences: 3077464 Number of extensions: 8393372 Number of successful extensions: 27584 Number of sequences better than 1.0e-01: 5 Number of HSP's better than 0.1 without gapping: 10 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 27571 Number of HSP's gapped (non-prelim): 10 length of query: 97 length of database: 1,040,396,356 effective HSP length: 66 effective length of query: 31 effective length of database: 837,283,732 effective search space: 25955795692 effective search space used: 25955795692 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 87 (38.2 bits)