BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (66 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_A7ZPR6 UPF0370 protein ypfN n=151 Tax=Enterobacteriacea... 128 7e-29 UniRef50_D0ZEZ5 Putative uncharacterized protein n=2 Tax=Edwards... 68 9e-11 UniRef50_D2U230 Putative uncharacterized protein n=1 Tax=Arsenop... 54 2e-06 >UniRef50_A7ZPR6 UPF0370 protein ypfN n=151 Tax=Enterobacteriaceae RepID=YPFN_ECO24 Length = 66 Score = 128 bits (321), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 66/66 (100%), Positives = 66/66 (100%) Query: 1 MDWLAKYWWILVIVFLVGVLLNVIKDLKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPK 60 MDWLAKYWWILVIVFLVGVLLNVIKDLKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPK Sbjct: 1 MDWLAKYWWILVIVFLVGVLLNVIKDLKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPK 60 Query: 61 KDQPKK 66 KDQPKK Sbjct: 61 KDQPKK 66 >UniRef50_D0ZEZ5 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=D0ZEZ5_EDWTE Length = 71 Score = 67.8 bits (164), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 34/65 (52%), Positives = 50/65 (76%), Gaps = 2/65 (3%) Query: 1 MDWLAKYWWILVIVFLVGVLLNVIKDLKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPK 60 M WLA YWW+++++ L+G++ N IK L RV+ K +L ++P LPPHRD N +W DDDDWP+ Sbjct: 1 MSWLADYWWVILLI-LIGMIWNGIKALMRVNPKHYLDDRPPLPPHRDNNAQW-DDDDWPR 58 Query: 61 KDQPK 65 +D+ K Sbjct: 59 QDKDK 63 >UniRef50_D2U230 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2U230_9ENTR Length = 38 Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 23/35 (65%), Positives = 32/35 (91%) Query: 27 LKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPKK 61 ++++D K+FL NKP+LPPHRDFN+KWDD+DDWP + Sbjct: 1 MQKIDPKRFLDNKPKLPPHRDFNNKWDDEDDWPNQ 35 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A7ZPR6 UPF0370 protein ypfN n=151 Tax=Enterobacteriacea... 90 2e-17 UniRef50_D0ZEZ5 Putative uncharacterized protein n=2 Tax=Edwards... 81 9e-15 UniRef50_D2U230 Putative uncharacterized protein n=1 Tax=Arsenop... 50 2e-05 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_A7ZPR6 UPF0370 protein ypfN n=151 Tax=Enterobacteriaceae RepID=YPFN_ECO24 Length = 66 Score = 90.0 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 66/66 (100%), Positives = 66/66 (100%) Query: 1 MDWLAKYWWILVIVFLVGVLLNVIKDLKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPK 60 MDWLAKYWWILVIVFLVGVLLNVIKDLKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPK Sbjct: 1 MDWLAKYWWILVIVFLVGVLLNVIKDLKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPK 60 Query: 61 KDQPKK 66 KDQPKK Sbjct: 61 KDQPKK 66 >UniRef50_D0ZEZ5 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=D0ZEZ5_EDWTE Length = 71 Score = 81.2 bits (199), Expect = 9e-15, Method: Composition-based stats. Identities = 34/65 (52%), Positives = 50/65 (76%), Gaps = 2/65 (3%) Query: 1 MDWLAKYWWILVIVFLVGVLLNVIKDLKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPK 60 M WLA YWW+++++ L+G++ N IK L RV+ K +L ++P LPPHRD N +W DDDDWP+ Sbjct: 1 MSWLADYWWVILLI-LIGMIWNGIKALMRVNPKHYLDDRPPLPPHRDNNAQW-DDDDWPR 58 Query: 61 KDQPK 65 +D+ K Sbjct: 59 QDKDK 63 >UniRef50_D2U230 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2U230_9ENTR Length = 38 Score = 50.0 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 23/37 (62%), Positives = 33/37 (89%) Query: 27 LKRVDHKKFLANKPELPPHRDFNDKWDDDDDWPKKDQ 63 ++++D K+FL NKP+LPPHRDFN+KWDD+DDWP + + Sbjct: 1 MQKIDPKRFLDNKPKLPPHRDFNNKWDDEDDWPNQKK 37 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.143 0.531 Lambda K H 0.267 0.0448 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 356,360,748 Number of Sequences: 3077464 Number of extensions: 13068349 Number of successful extensions: 45806 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 45796 Number of HSP's gapped (non-prelim): 6 length of query: 66 length of database: 1,040,396,356 effective HSP length: 38 effective length of query: 28 effective length of database: 923,452,724 effective search space: 25856676272 effective search space used: 25856676272 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 87 (38.0 bits)