BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (63 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0ADJ5 Uncharacterized protein yhjT n=91 Tax=Enterobact... 126 2e-28 UniRef50_D2TJX0 Putative membrane protein n=2 Tax=Enterobacteria... 106 3e-22 UniRef50_B5XN21 Cellulose biosynthesis protein BcsF n=6 Tax=Ente... 88 7e-17 UniRef50_D0KCJ2 Celllulose biosynthesis operon protein BcsF/YhjT... 46 4e-04 UniRef50_D0ZGP8 Putative uncharacterized protein n=1 Tax=Edwards... 38 0.081 >UniRef50_P0ADJ5 Uncharacterized protein yhjT n=91 Tax=Enterobacteriaceae RepID=YHJT_ECOLI Length = 63 Score = 126 bits (317), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 63/63 (100%), Positives = 63/63 (100%) Query: 1 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA 60 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA Sbjct: 1 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA 60 Query: 61 TKK 63 TKK Sbjct: 61 TKK 63 >UniRef50_D2TJX0 Putative membrane protein n=2 Tax=Enterobacteriaceae RepID=D2TJX0_CITRO Length = 63 Score = 106 bits (264), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 51/63 (80%), Positives = 57/63 (90%) Query: 1 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA 60 MM+ISDI++IIVVCALIFFPLGYLARHSLRRI + +RL FAKPRYVKPAGTLRRT +RA Sbjct: 1 MMSISDILQIIVVCALIFFPLGYLARHSLRRIGNIVRLIFAKPRYVKPAGTLRRTVISRA 60 Query: 61 TKK 63 KK Sbjct: 61 NKK 63 >UniRef50_B5XN21 Cellulose biosynthesis protein BcsF n=6 Tax=Enterobacteriaceae RepID=B5XN21_KLEP3 Length = 65 Score = 88.2 bits (217), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 40/62 (64%), Positives = 49/62 (79%) Query: 1 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA 60 MM+I+DI++++V+CAL+FFPLGYL RH RRIR TLRL F KPRYVKPAG LRR + Sbjct: 1 MMSIADILQLVVLCALLFFPLGYLTRHYQRRIRTTLRLMFFKPRYVKPAGVLRRGATVKQ 60 Query: 61 TK 62 K Sbjct: 61 GK 62 >UniRef50_D0KCJ2 Celllulose biosynthesis operon protein BcsF/YhjT n=7 Tax=Enterobacteriaceae RepID=D0KCJ2_PECWW Length = 68 Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 8/62 (12%) Query: 1 MMTISDIIEIIVVCALIFFPLGYLAR----HSLRRIRDTLRLFFAKPRYVKPAGTLRRTE 56 MM + DI++++++ A++FF LGYLA H L+ ++ L PRY+KPA R+ Sbjct: 1 MMNLIDIVQLVLLSAVVFFTLGYLAHRVIPHWLQYWKNRL----LSPRYLKPASVWMRSS 56 Query: 57 KA 58 + Sbjct: 57 SS 58 >UniRef50_D0ZGP8 Putative uncharacterized protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZGP8_EDWTE Length = 68 Score = 38.1 bits (87), Expect = 0.081, Method: Compositional matrix adjust. Identities = 20/40 (50%), Positives = 29/40 (72%), Gaps = 4/40 (10%) Query: 1 MMTISDIIEIIVVCALIFFPLG-YLAR---HSLRRIRDTL 36 MMTI+DI++I+V+CAL+ P G YL + H LR +R T+ Sbjct: 1 MMTITDILQILVLCALVMLPGGFYLHKYYPHPLRTLRRTI 40 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0ADJ5 Uncharacterized protein yhjT n=91 Tax=Enterobact... 87 2e-16 UniRef50_D2TJX0 Putative membrane protein n=2 Tax=Enterobacteria... 86 4e-16 UniRef50_B5XN21 Cellulose biosynthesis protein BcsF n=6 Tax=Ente... 84 1e-15 UniRef50_D0KCJ2 Celllulose biosynthesis operon protein BcsF/YhjT... 65 5e-10 Sequences not found previously or not previously below threshold: UniRef50_C4SLW2 Putative uncharacterized protein n=2 Tax=Yersini... 42 0.007 UniRef50_D0ZGP8 Putative uncharacterized protein n=1 Tax=Edwards... 38 0.090 CONVERGED! >UniRef50_P0ADJ5 Uncharacterized protein yhjT n=91 Tax=Enterobacteriaceae RepID=YHJT_ECOLI Length = 63 Score = 86.6 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 63/63 (100%), Positives = 63/63 (100%) Query: 1 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA 60 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA Sbjct: 1 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA 60 Query: 61 TKK 63 TKK Sbjct: 61 TKK 63 >UniRef50_D2TJX0 Putative membrane protein n=2 Tax=Enterobacteriaceae RepID=D2TJX0_CITRO Length = 63 Score = 85.8 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 51/63 (80%), Positives = 57/63 (90%) Query: 1 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA 60 MM+ISDI++IIVVCALIFFPLGYLARHSLRRI + +RL FAKPRYVKPAGTLRRT +RA Sbjct: 1 MMSISDILQIIVVCALIFFPLGYLARHSLRRIGNIVRLIFAKPRYVKPAGTLRRTVISRA 60 Query: 61 TKK 63 KK Sbjct: 61 NKK 63 >UniRef50_B5XN21 Cellulose biosynthesis protein BcsF n=6 Tax=Enterobacteriaceae RepID=B5XN21_KLEP3 Length = 65 Score = 83.9 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 40/62 (64%), Positives = 49/62 (79%) Query: 1 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARA 60 MM+I+DI++++V+CAL+FFPLGYL RH RRIR TLRL F KPRYVKPAG LRR + Sbjct: 1 MMSIADILQLVVLCALLFFPLGYLTRHYQRRIRTTLRLMFFKPRYVKPAGVLRRGATVKQ 60 Query: 61 TK 62 K Sbjct: 61 GK 62 >UniRef50_D0KCJ2 Celllulose biosynthesis operon protein BcsF/YhjT n=7 Tax=Enterobacteriaceae RepID=D0KCJ2_PECWW Length = 68 Score = 65.4 bits (158), Expect = 5e-10, Method: Composition-based stats. Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 8/62 (12%) Query: 1 MMTISDIIEIIVVCALIFFPLGYLAR----HSLRRIRDTLRLFFAKPRYVKPAGTLRRTE 56 MM + DI++++++ A++FF LGYLA H L+ ++ L PRY+KPA R+ Sbjct: 1 MMNLIDIVQLVLLSAVVFFTLGYLAHRVIPHWLQYWKNRL----LSPRYLKPASVWMRSS 56 Query: 57 KA 58 + Sbjct: 57 SS 58 >UniRef50_C4SLW2 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SLW2_YERFR Length = 53 Score = 41.9 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 16/53 (30%), Positives = 27/53 (50%) Query: 11 IVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTLRRTEKARATKK 63 +++ A+I FPLGY+A + + + F RY+K G R + + KK Sbjct: 1 MLLGAVISFPLGYMACRRFPKWWEKKQHLFLSARYLKSEGIWLRDDSSSQNKK 53 >UniRef50_D0ZGP8 Putative uncharacterized protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZGP8_EDWTE Length = 68 Score = 38.0 bits (87), Expect = 0.090, Method: Composition-based stats. Identities = 24/63 (38%), Positives = 33/63 (52%), Gaps = 1/63 (1%) Query: 1 MMTISDIIEIIVVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAGTL-RRTEKAR 59 MMTI+DI++I+V+CAL+ P G+ TLR RY++ L RR AR Sbjct: 1 MMTITDILQILVLCALVMLPGGFYLHKYYPHPLRTLRRTIFPTRYLQRVTVLHRRPLSAR 60 Query: 60 ATK 62 K Sbjct: 61 VDK 63 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.327 0.141 0.403 Lambda K H 0.267 0.0425 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 248,779,645 Number of Sequences: 3077464 Number of extensions: 7515492 Number of successful extensions: 36170 Number of sequences better than 1.0e-01: 7 Number of HSP's better than 0.1 without gapping: 10 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 36158 Number of HSP's gapped (non-prelim): 12 length of query: 63 length of database: 1,040,396,356 effective HSP length: 35 effective length of query: 28 effective length of database: 932,685,116 effective search space: 26115183248 effective search space used: 26115183248 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.2 bits) S2: 87 (38.1 bits)