BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (71 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B1XBH4 Lysis S family protein n=2 Tax=Enterobacteriacea... 97 2e-19 UniRef50_C8SXC9 Putative uncharacterized protein n=1 Tax=Klebsie... 89 4e-17 UniRef50_C8T826 Putative uncharacterized protein n=1 Tax=Klebsie... 89 4e-17 UniRef50_B5YTY8 Conserved domain protein n=2 Tax=Escherichia col... 80 2e-14 UniRef50_B4AC67 Conserved domain protein n=1 Tax=Salmonella ente... 79 3e-14 UniRef50_D2AA42 Lysis protein S n=2 Tax=Shigella flexneri RepID=... 44 0.002 UniRef50_Q1I0Z2 Hol n=1 Tax=Pasteurella phage F108 RepID=Q1I0Z2_... 43 0.003 UniRef50_B2HW61 Putative uncharacterized protein n=2 Tax=Acineto... 42 0.005 UniRef50_Q4QK06 Putative uncharacterized protein n=7 Tax=Haemoph... 41 0.015 UniRef50_A7MNN3 Putative uncharacterized protein n=1 Tax=Cronoba... 40 0.024 UniRef50_Q7Y3V4 Putative uncharacterized protein n=1 Tax=Yersini... 40 0.030 UniRef50_A6XRP5 Conserved domain protein n=1 Tax=Vibrio cholerae... 39 0.042 >UniRef50_B1XBH4 Lysis S family protein n=2 Tax=Enterobacteriaceae RepID=B1XBH4_ECODH Length = 74 Score = 96.8 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 37/69 (53%), Positives = 47/69 (68%), Gaps = 3/69 (4%) Query: 1 MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKI 60 M MDKLTTG AYG SAGS +L+ +P QW AIGVL ++ ++TYLTNLYFKI Sbjct: 1 MYRMDKLTTGAAYGASAGSI---LNGMLNAYSPEQWNAIGVLVGIIIAVMTYLTNLYFKI 57 Query: 61 KEDKRKAAR 69 +ED R++ Sbjct: 58 REDNRRSRS 66 >UniRef50_C8SXC9 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8SXC9_KLEPR Length = 82 Score = 89.0 bits (219), Expect = 4e-17, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 30/61 (49%) Query: 1 MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKI 60 MK DK+ + +Y +S G + D W I + ++ G+ TYLTNLYFK Sbjct: 1 MKMPDKIFSAASYCSSGGLICTGLARTYDWFHGLDWNFIALASGVIIGVATYLTNLYFKR 60 Query: 61 K 61 + Sbjct: 61 R 61 >UniRef50_C8T826 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T826_KLEPR Length = 71 Score = 89.0 bits (219), Expect = 4e-17, Method: Composition-based stats. Identities = 37/71 (52%), Positives = 44/71 (61%) Query: 1 MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKI 60 MK +++ + YGTS SA YWF QLLD TP QWAAIGV+GSL F LT N+YFK Sbjct: 1 MKMTHRVSEVITYGTSTVSATYWFSQLLDSYTPGQWAAIGVIGSLAFTALTAFVNIYFKW 60 Query: 61 KEDKRKAARGE 71 +R GE Sbjct: 61 LAYRRGKLSGE 71 >UniRef50_B5YTY8 Conserved domain protein n=2 Tax=Escherichia coli O157:H7 RepID=B5YTY8_ECO5E Length = 54 Score = 79.8 bits (195), Expect = 2e-14, Method: Composition-based stats. Identities = 43/50 (86%), Positives = 47/50 (94%) Query: 22 YWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE 71 YW LQLLDKV+PSQW AIGVLGSL+FGLLTYLTNLYFKI+ED+RK ARGE Sbjct: 5 YWLLQLLDKVSPSQWVAIGVLGSLLFGLLTYLTNLYFKIREDRRKVARGE 54 >UniRef50_B4AC67 Conserved domain protein n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL317 RepID=B4AC67_SALNE Length = 71 Score = 79.4 bits (194), Expect = 3e-14, Method: Composition-based stats. Identities = 29/61 (47%), Positives = 42/61 (68%), Gaps = 3/61 (4%) Query: 4 MDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKED 63 MD+ TTG +YG SA + Y F LD T +WAAIG++G L FG LT++TN+YF+ + + Sbjct: 1 MDRTTTGASYGVSAATMIYSF---LDSFTHDEWAAIGIMGGLFFGALTWITNVYFQRQRN 57 Query: 64 K 64 + Sbjct: 58 R 58 >UniRef50_D2AA42 Lysis protein S n=2 Tax=Shigella flexneri RepID=D2AA42_SHIF2 Length = 55 Score = 44.0 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 22/30 (73%), Positives = 27/30 (90%) Query: 1 MKSMDKLTTGVAYGTSAGSAGYWFLQLLDK 30 M M+K++TG+AYGTSAGSAGYWFLQ LD+ Sbjct: 26 MYQMEKISTGIAYGTSAGSAGYWFLQWLDQ 55 >UniRef50_Q1I0Z2 Hol n=1 Tax=Pasteurella phage F108 RepID=Q1I0Z2_9CAUD Length = 70 Score = 42.8 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 20/41 (48%), Positives = 23/41 (56%) Query: 31 VTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE 71 W IG + ++FGLLT LTN YFK KEDKR E Sbjct: 19 FAQLSWGDIGAIFGILFGLLTVLTNWYFKRKEDKRAEKALE 59 >UniRef50_B2HW61 Putative uncharacterized protein n=2 Tax=Acinetobacter RepID=B2HW61_ACIBC Length = 97 Score = 42.1 bits (97), Expect = 0.005, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 9/55 (16%) Query: 12 AYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRK 66 YG G + + +D V+ S++ G+ T+LTNLYFK ++DKRK Sbjct: 24 TYGYVVGGSLIGVIGKIDW---------AVVFSILIGIATFLTNLYFKKRDDKRK 69 >UniRef50_Q4QK06 Putative uncharacterized protein n=7 Tax=Haemophilus influenzae RepID=Q4QK06_HAEI8 Length = 73 Score = 40.5 bits (93), Expect = 0.015, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 2/63 (3%) Query: 4 MDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKED 63 M + +Y ++G + ++ D + WA + + +V G+ T+L N Y+K K+ Sbjct: 1 MHDAPSKASY--TSGIFAFLIGRIADMFSNVNWADVASITGIVIGVATFLVNWYYKKKDF 58 Query: 64 KRK 66 + K Sbjct: 59 ELK 61 >UniRef50_A7MNN3 Putative uncharacterized protein n=1 Tax=Cronobacter sakazakii ATCC BAA-894 RepID=A7MNN3_ENTS8 Length = 78 Score = 40.1 bits (92), Expect = 0.024, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 25/37 (67%) Query: 25 LQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIK 61 LL +++P +W+A+GV+ +V LLT+ N Y+K K Sbjct: 19 NGLLTRLSPDEWSAVGVIAGIVVALLTFGINWYYKRK 55 >UniRef50_Q7Y3V4 Putative uncharacterized protein n=1 Tax=Yersinia phage PY54 RepID=Q7Y3V4_9CAUD Length = 88 Score = 39.7 bits (91), Expect = 0.030, Method: Composition-based stats. Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 3/60 (5%) Query: 3 SMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKE 62 +K+ YG GS + + +PS+W +G+L +V ++ T ++FK + Sbjct: 4 MQEKIADNALYGGGIGSVLFGL---ITYFSPSEWMVLGILVGIVTTIIGCGTGIWFKCQR 60 >UniRef50_A6XRP5 Conserved domain protein n=1 Tax=Vibrio cholerae AM-19226 RepID=A6XRP5_VIBCH Length = 70 Score = 39.0 bits (89), Expect = 0.042, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 30/61 (49%), Gaps = 9/61 (14%) Query: 3 SMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKE 62 +K+++ +Y T+ AG+ L L D V+ L L+F LTY TN Y+K K Sbjct: 1 MQEKISSFCSYLTAGVFAGFGALTLQDWVS---------LLGLLFVALTYFTNRYYKKKS 51 Query: 63 D 63 Sbjct: 52 Y 52 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B1XBH4 Lysis S family protein n=2 Tax=Enterobacteriacea... 97 2e-19 UniRef50_C8SXC9 Putative uncharacterized protein n=1 Tax=Klebsie... 89 4e-17 UniRef50_C8T826 Putative uncharacterized protein n=1 Tax=Klebsie... 89 4e-17 UniRef50_B5YTY8 Conserved domain protein n=2 Tax=Escherichia col... 80 2e-14 UniRef50_B4AC67 Conserved domain protein n=1 Tax=Salmonella ente... 79 3e-14 UniRef50_D2AA42 Lysis protein S n=2 Tax=Shigella flexneri RepID=... 44 0.002 Sequences not found previously or not previously below threshold: UniRef50_Q1I0Z2 Hol n=1 Tax=Pasteurella phage F108 RepID=Q1I0Z2_... 43 0.003 UniRef50_B2HW61 Putative uncharacterized protein n=2 Tax=Acineto... 42 0.005 UniRef50_Q4QK06 Putative uncharacterized protein n=7 Tax=Haemoph... 41 0.015 UniRef50_A7MNN3 Putative uncharacterized protein n=1 Tax=Cronoba... 40 0.024 UniRef50_Q7Y3V4 Putative uncharacterized protein n=1 Tax=Yersini... 40 0.030 UniRef50_A6XRP5 Conserved domain protein n=1 Tax=Vibrio cholerae... 39 0.042 CONVERGED! >UniRef50_B1XBH4 Lysis S family protein n=2 Tax=Enterobacteriaceae RepID=B1XBH4_ECODH Length = 74 Score = 96.8 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 37/69 (53%), Positives = 47/69 (68%), Gaps = 3/69 (4%) Query: 1 MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKI 60 M MDKLTTG AYG SAGS +L+ +P QW AIGVL ++ ++TYLTNLYFKI Sbjct: 1 MYRMDKLTTGAAYGASAGSI---LNGMLNAYSPEQWNAIGVLVGIIIAVMTYLTNLYFKI 57 Query: 61 KEDKRKAAR 69 +ED R++ Sbjct: 58 REDNRRSRS 66 >UniRef50_C8SXC9 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8SXC9_KLEPR Length = 82 Score = 89.0 bits (219), Expect = 4e-17, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 30/61 (49%) Query: 1 MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKI 60 MK DK+ + +Y +S G + D W I + ++ G+ TYLTNLYFK Sbjct: 1 MKMPDKIFSAASYCSSGGLICTGLARTYDWFHGLDWNFIALASGVIIGVATYLTNLYFKR 60 Query: 61 K 61 + Sbjct: 61 R 61 >UniRef50_C8T826 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T826_KLEPR Length = 71 Score = 89.0 bits (219), Expect = 4e-17, Method: Composition-based stats. Identities = 37/71 (52%), Positives = 44/71 (61%) Query: 1 MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKI 60 MK +++ + YGTS SA YWF QLLD TP QWAAIGV+GSL F LT N+YFK Sbjct: 1 MKMTHRVSEVITYGTSTVSATYWFSQLLDSYTPGQWAAIGVIGSLAFTALTAFVNIYFKW 60 Query: 61 KEDKRKAARGE 71 +R GE Sbjct: 61 LAYRRGKLSGE 71 >UniRef50_B5YTY8 Conserved domain protein n=2 Tax=Escherichia coli O157:H7 RepID=B5YTY8_ECO5E Length = 54 Score = 79.8 bits (195), Expect = 2e-14, Method: Composition-based stats. Identities = 43/50 (86%), Positives = 47/50 (94%) Query: 22 YWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE 71 YW LQLLDKV+PSQW AIGVLGSL+FGLLTYLTNLYFKI+ED+RK ARGE Sbjct: 5 YWLLQLLDKVSPSQWVAIGVLGSLLFGLLTYLTNLYFKIREDRRKVARGE 54 >UniRef50_B4AC67 Conserved domain protein n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL317 RepID=B4AC67_SALNE Length = 71 Score = 79.4 bits (194), Expect = 3e-14, Method: Composition-based stats. Identities = 29/61 (47%), Positives = 42/61 (68%), Gaps = 3/61 (4%) Query: 4 MDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKED 63 MD+ TTG +YG SA + Y F LD T +WAAIG++G L FG LT++TN+YF+ + + Sbjct: 1 MDRTTTGASYGVSAATMIYSF---LDSFTHDEWAAIGIMGGLFFGALTWITNVYFQRQRN 57 Query: 64 K 64 + Sbjct: 58 R 58 >UniRef50_D2AA42 Lysis protein S n=2 Tax=Shigella flexneri RepID=D2AA42_SHIF2 Length = 55 Score = 44.0 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 22/30 (73%), Positives = 27/30 (90%) Query: 1 MKSMDKLTTGVAYGTSAGSAGYWFLQLLDK 30 M M+K++TG+AYGTSAGSAGYWFLQ LD+ Sbjct: 26 MYQMEKISTGIAYGTSAGSAGYWFLQWLDQ 55 >UniRef50_Q1I0Z2 Hol n=1 Tax=Pasteurella phage F108 RepID=Q1I0Z2_9CAUD Length = 70 Score = 42.8 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 20/41 (48%), Positives = 23/41 (56%) Query: 31 VTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE 71 W IG + ++FGLLT LTN YFK KEDKR E Sbjct: 19 FAQLSWGDIGAIFGILFGLLTVLTNWYFKRKEDKRAEKALE 59 >UniRef50_B2HW61 Putative uncharacterized protein n=2 Tax=Acinetobacter RepID=B2HW61_ACIBC Length = 97 Score = 42.1 bits (97), Expect = 0.005, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 9/55 (16%) Query: 12 AYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRK 66 YG G + + +D V+ S++ G+ T+LTNLYFK ++DKRK Sbjct: 24 TYGYVVGGSLIGVIGKIDW---------AVVFSILIGIATFLTNLYFKKRDDKRK 69 >UniRef50_Q4QK06 Putative uncharacterized protein n=7 Tax=Haemophilus influenzae RepID=Q4QK06_HAEI8 Length = 73 Score = 40.5 bits (93), Expect = 0.015, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 2/63 (3%) Query: 4 MDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKED 63 M + +Y ++G + ++ D + WA + + +V G+ T+L N Y+K K+ Sbjct: 1 MHDAPSKASY--TSGIFAFLIGRIADMFSNVNWADVASITGIVIGVATFLVNWYYKKKDF 58 Query: 64 KRK 66 + K Sbjct: 59 ELK 61 >UniRef50_A7MNN3 Putative uncharacterized protein n=1 Tax=Cronobacter sakazakii ATCC BAA-894 RepID=A7MNN3_ENTS8 Length = 78 Score = 40.1 bits (92), Expect = 0.024, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 25/37 (67%) Query: 25 LQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIK 61 LL +++P +W+A+GV+ +V LLT+ N Y+K K Sbjct: 19 NGLLTRLSPDEWSAVGVIAGIVVALLTFGINWYYKRK 55 >UniRef50_Q7Y3V4 Putative uncharacterized protein n=1 Tax=Yersinia phage PY54 RepID=Q7Y3V4_9CAUD Length = 88 Score = 39.7 bits (91), Expect = 0.030, Method: Composition-based stats. Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 3/60 (5%) Query: 3 SMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKE 62 +K+ YG GS + + +PS+W +G+L +V ++ T ++FK + Sbjct: 4 MQEKIADNALYGGGIGSVLFGL---ITYFSPSEWMVLGILVGIVTTIIGCGTGIWFKCQR 60 >UniRef50_A6XRP5 Conserved domain protein n=1 Tax=Vibrio cholerae AM-19226 RepID=A6XRP5_VIBCH Length = 70 Score = 39.0 bits (89), Expect = 0.042, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 30/61 (49%), Gaps = 9/61 (14%) Query: 3 SMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKE 62 +K+++ +Y T+ AG+ L L D V+ L L+F LTY TN Y+K K Sbjct: 1 MQEKISSFCSYLTAGVFAGFGALTLQDWVS---------LLGLLFVALTYFTNRYYKKKS 51 Query: 63 D 63 Sbjct: 52 Y 52 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.127 0.339 Lambda K H 0.267 0.0392 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 227,836,172 Number of Sequences: 3077464 Number of extensions: 5682442 Number of successful extensions: 26822 Number of sequences better than 1.0e-01: 14 Number of HSP's better than 0.1 without gapping: 18 Number of HSP's successfully gapped in prelim test: 10 Number of HSP's that attempted gapping in prelim test: 26794 Number of HSP's gapped (non-prelim): 28 length of query: 71 length of database: 1,040,396,356 effective HSP length: 43 effective length of query: 28 effective length of database: 908,065,404 effective search space: 25425831312 effective search space used: 25425831312 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 87 (38.2 bits)