BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (57 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_UPI0001C341B5 hypothetical protein CATC2_04606 n=2 Tax=... 89 5e-17 UniRef50_B5F569 UPF0509 protein yciZ n=32 Tax=Enterobacteriaceae... 78 9e-14 UniRef50_A6T7Z3 UPF0509 protein KPN78578_12530 n=56 Tax=Enteroba... 66 3e-10 >UniRef50_UPI0001C341B5 hypothetical protein CATC2_04606 n=2 Tax=Enterobacteriaceae RepID=UPI0001C341B5 Length = 60 Score = 88.6 bits (218), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 45/60 (75%), Positives = 48/60 (80%), Gaps = 3/60 (5%) Query: 1 MSEFDAQRVAERIDIVLDILVAGDYHSAIHNLEILKAELLRQVAESTPDI---PKAPWEI 57 MS+ DAQR+AERID VLDILVAGDYHSAIHNLEILK ELL V E T + PKAPWEI Sbjct: 1 MSDIDAQRLAERIDTVLDILVAGDYHSAIHNLEILKTELLNTVKEDTQSLSAKPKAPWEI 60 >UniRef50_B5F569 UPF0509 protein yciZ n=32 Tax=Enterobacteriaceae RepID=YCIZ_SALA4 Length = 59 Score = 77.8 bits (190), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 40/59 (67%), Positives = 47/59 (79%), Gaps = 2/59 (3%) Query: 1 MSEFDAQRVAERIDIVLDILVAGDYHSAIHNLEILKAELLRQVAE--STPDIPKAPWEI 57 MS+ +AQR+A RID VLDILVAGDYHSAI+NLEIL+AELL QV + S P +PWEI Sbjct: 1 MSDIEAQRIAARIDTVLDILVAGDYHSAINNLEILRAELLDQVKDGISPSQAPGSPWEI 59 >UniRef50_A6T7Z3 UPF0509 protein KPN78578_12530 n=56 Tax=Enterobacteriaceae RepID=Y1253_KLEP7 Length = 59 Score = 66.2 bits (160), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 40/59 (67%), Positives = 46/59 (77%), Gaps = 2/59 (3%) Query: 1 MSEFDAQRVAERIDIVLDILVAGDYHSAIHNLEILKAELLRQVAEST--PDIPKAPWEI 57 MS+ +A +A+RID VLDILVAGDYHSAIHNLEILKAELL A+ + PKAPWEI Sbjct: 1 MSDVNAHLLAQRIDTVLDILVAGDYHSAIHNLEILKAELLALAADDAEQQNQPKAPWEI 59 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_UPI0001C341B5 hypothetical protein CATC2_04606 n=2 Tax=... 84 1e-15 UniRef50_B5F569 UPF0509 protein yciZ n=32 Tax=Enterobacteriaceae... 83 4e-15 UniRef50_A6T7Z3 UPF0509 protein KPN78578_12530 n=56 Tax=Enteroba... 70 3e-11 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_UPI0001C341B5 hypothetical protein CATC2_04606 n=2 Tax=Enterobacteriaceae RepID=UPI0001C341B5 Length = 60 Score = 83.9 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 45/60 (75%), Positives = 48/60 (80%), Gaps = 3/60 (5%) Query: 1 MSEFDAQRVAERIDIVLDILVAGDYHSAIHNLEILKAELLRQVAESTPDI---PKAPWEI 57 MS+ DAQR+AERID VLDILVAGDYHSAIHNLEILK ELL V E T + PKAPWEI Sbjct: 1 MSDIDAQRLAERIDTVLDILVAGDYHSAIHNLEILKTELLNTVKEDTQSLSAKPKAPWEI 60 >UniRef50_B5F569 UPF0509 protein yciZ n=32 Tax=Enterobacteriaceae RepID=YCIZ_SALA4 Length = 59 Score = 82.8 bits (203), Expect = 4e-15, Method: Composition-based stats. Identities = 40/59 (67%), Positives = 47/59 (79%), Gaps = 2/59 (3%) Query: 1 MSEFDAQRVAERIDIVLDILVAGDYHSAIHNLEILKAELLRQVAE--STPDIPKAPWEI 57 MS+ +AQR+A RID VLDILVAGDYHSAI+NLEIL+AELL QV + S P +PWEI Sbjct: 1 MSDIEAQRIAARIDTVLDILVAGDYHSAINNLEILRAELLDQVKDGISPSQAPGSPWEI 59 >UniRef50_A6T7Z3 UPF0509 protein KPN78578_12530 n=56 Tax=Enterobacteriaceae RepID=Y1253_KLEP7 Length = 59 Score = 69.7 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 40/59 (67%), Positives = 46/59 (77%), Gaps = 2/59 (3%) Query: 1 MSEFDAQRVAERIDIVLDILVAGDYHSAIHNLEILKAELLRQVAEST--PDIPKAPWEI 57 MS+ +A +A+RID VLDILVAGDYHSAIHNLEILKAELL A+ + PKAPWEI Sbjct: 1 MSDVNAHLLAQRIDTVLDILVAGDYHSAIHNLEILKAELLALAADDAEQQNQPKAPWEI 59 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.318 0.139 0.397 Lambda K H 0.267 0.0420 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 217,220,883 Number of Sequences: 3077464 Number of extensions: 5879809 Number of successful extensions: 16660 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 16649 Number of HSP's gapped (non-prelim): 6 length of query: 57 length of database: 1,040,396,356 effective HSP length: 30 effective length of query: 27 effective length of database: 948,072,436 effective search space: 25597955772 effective search space used: 25597955772 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 87 (38.1 bits)