BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (78 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P75991 Uncharacterized protein ycgZ n=29 Tax=Enterobact... 160 1e-38 UniRef50_C9Y5C0 Uncharacterized protein ycgZ n=4 Tax=Enterobacte... 102 5e-21 UniRef50_C4X5Q8 Putative uncharacterized protein n=5 Tax=Enterob... 96 4e-19 UniRef50_D0FT16 Conserved uncharacterized protein YcgZ n=7 Tax=E... 86 3e-16 UniRef50_UPI0001C34448 hypothetical protein EcanA3_04832 n=1 Tax... 83 3e-15 UniRef50_C8QGJ1 Putative uncharacterized protein n=2 Tax=Pantoea... 55 5e-07 UniRef50_C6DKW6 Putative uncharacterized protein n=5 Tax=Pectoba... 39 0.074 >UniRef50_P75991 Uncharacterized protein ycgZ n=29 Tax=Enterobacteriaceae RepID=YCGZ_ECOLI Length = 78 Score = 160 bits (404), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 78/78 (100%), Positives = 78/78 (100%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA Sbjct: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 Query: 61 TGEEEQKHYNALIGLLFE 78 TGEEEQKHYNALIGLLFE Sbjct: 61 TGEEEQKHYNALIGLLFE 78 >UniRef50_C9Y5C0 Uncharacterized protein ycgZ n=4 Tax=Enterobacteriaceae RepID=C9Y5C0_CROTZ Length = 79 Score = 102 bits (253), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 49/78 (62%), Positives = 58/78 (74%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 M QN D+A AI RYF KA L TQQETLG+IV EIL+DGR L++KS+C KLL RLE A Sbjct: 1 MRQNGYIPDTANAIARYFDKAALPTQQETLGQIVVEILRDGRLLNKKSICTKLLSRLEMA 60 Query: 61 TGEEEQKHYNALIGLLFE 78 + EE+ HY LIGLLF+ Sbjct: 61 SSVEEESHYQTLIGLLFD 78 >UniRef50_C4X5Q8 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C4X5Q8_KLEPN Length = 81 Score = 95.9 bits (237), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 45/77 (58%), Positives = 57/77 (74%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 M QN D+A AI +YF KA+L +QQETLG+IV +IL +GR+L+RK+LC KLL RL+ A Sbjct: 1 MQQNGYIPDTANAIAQYFNKASLPSQQETLGQIVMDILNEGRHLNRKALCTKLLSRLDRA 60 Query: 61 TGEEEQKHYNALIGLLF 77 EE+ HY LIGLLF Sbjct: 61 RAPEEESHYQTLIGLLF 77 >UniRef50_D0FT16 Conserved uncharacterized protein YcgZ n=7 Tax=Enterobacteriaceae RepID=D0FT16_ERWPY Length = 81 Score = 86.3 bits (212), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 37/64 (57%), Positives = 53/64 (82%) Query: 14 ITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHATGEEEQKHYNALI 73 I+RYF + + +QQETLG+IV EIL+ GR+L+RK++C+KLL RLE A+G E+++HY LI Sbjct: 14 ISRYFVEVIMPSQQETLGQIVVEILRSGRHLNRKAICSKLLTRLELASGPEQERHYQQLI 73 Query: 74 GLLF 77 G+LF Sbjct: 74 GILF 77 >UniRef50_UPI0001C34448 hypothetical protein EcanA3_04832 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001C34448 Length = 79 Score = 82.8 bits (203), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 41/78 (52%), Positives = 54/78 (69%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 M +NSV + AIT YF K N+ T+QETLG IV+EI+K+ LSR ++C KLL R+E + Sbjct: 1 MRRNSVPSTTEHAITLYFNKNNIPTKQETLGLIVSEIIKENVQLSRMTICTKLLRRIEES 60 Query: 61 TGEEEQKHYNALIGLLFE 78 + E E+ HYN LI L FE Sbjct: 61 SCEHEKAHYNGLIALFFE 78 >UniRef50_C8QGJ1 Putative uncharacterized protein n=2 Tax=Pantoea sp. At-9b RepID=C8QGJ1_9ENTR Length = 79 Score = 55.5 bits (132), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 27/77 (35%), Positives = 41/77 (53%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 M Q+ D ++ F++ Q E LG I EIL+ R ++R ++C KL+ RL+ A Sbjct: 1 MRQDGFPPDRQAGLSGCFSRVTHPRQTEILGAIAMEILRQDRKITRINVCLKLIARLDAA 60 Query: 61 TGEEEQKHYNALIGLLF 77 E + H LIG+LF Sbjct: 61 GDEADAGHLRELIGMLF 77 >UniRef50_C6DKW6 Putative uncharacterized protein n=5 Tax=Pectobacterium RepID=C6DKW6_PECCP Length = 84 Score = 38.5 bits (88), Expect = 0.074, Method: Compositional matrix adjust. Identities = 19/49 (38%), Positives = 31/49 (63%) Query: 28 ETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHATGEEEQKHYNALIGLL 76 E LG IV EI++ G ++S K++ AKL+ ++E Q+ Y AL+ L+ Sbjct: 26 EILGNIVEEIIQSGHSVSNKAIIAKLIHKIETEANAAFQEKYRALLELI 74 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_C9Y5C0 Uncharacterized protein ycgZ n=4 Tax=Enterobacte... 130 1e-29 UniRef50_C4X5Q8 Putative uncharacterized protein n=5 Tax=Enterob... 129 4e-29 UniRef50_P75991 Uncharacterized protein ycgZ n=29 Tax=Enterobact... 122 3e-27 UniRef50_C8QGJ1 Putative uncharacterized protein n=2 Tax=Pantoea... 120 2e-26 UniRef50_UPI0001C34448 hypothetical protein EcanA3_04832 n=1 Tax... 118 5e-26 UniRef50_D0FT16 Conserved uncharacterized protein YcgZ n=7 Tax=E... 112 4e-24 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_C9Y5C0 Uncharacterized protein ycgZ n=4 Tax=Enterobacteriaceae RepID=C9Y5C0_CROTZ Length = 79 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 49/78 (62%), Positives = 58/78 (74%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 M QN D+A AI RYF KA L TQQETLG+IV EIL+DGR L++KS+C KLL RLE A Sbjct: 1 MRQNGYIPDTANAIARYFDKAALPTQQETLGQIVVEILRDGRLLNKKSICTKLLSRLEMA 60 Query: 61 TGEEEQKHYNALIGLLFE 78 + EE+ HY LIGLLF+ Sbjct: 61 SSVEEESHYQTLIGLLFD 78 >UniRef50_C4X5Q8 Putative uncharacterized protein n=5 Tax=Enterobacteriaceae RepID=C4X5Q8_KLEPN Length = 81 Score = 129 bits (323), Expect = 4e-29, Method: Composition-based stats. Identities = 45/77 (58%), Positives = 57/77 (74%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 M QN D+A AI +YF KA+L +QQETLG+IV +IL +GR+L+RK+LC KLL RL+ A Sbjct: 1 MQQNGYIPDTANAIAQYFNKASLPSQQETLGQIVMDILNEGRHLNRKALCTKLLSRLDRA 60 Query: 61 TGEEEQKHYNALIGLLF 77 EE+ HY LIGLLF Sbjct: 61 RAPEEESHYQTLIGLLF 77 >UniRef50_P75991 Uncharacterized protein ycgZ n=29 Tax=Enterobacteriaceae RepID=YCGZ_ECOLI Length = 78 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 78/78 (100%), Positives = 78/78 (100%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA Sbjct: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 Query: 61 TGEEEQKHYNALIGLLFE 78 TGEEEQKHYNALIGLLFE Sbjct: 61 TGEEEQKHYNALIGLLFE 78 >UniRef50_C8QGJ1 Putative uncharacterized protein n=2 Tax=Pantoea sp. At-9b RepID=C8QGJ1_9ENTR Length = 79 Score = 120 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 27/77 (35%), Positives = 41/77 (53%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 M Q+ D ++ F++ Q E LG I EIL+ R ++R ++C KL+ RL+ A Sbjct: 1 MRQDGFPPDRQAGLSGCFSRVTHPRQTEILGAIAMEILRQDRKITRINVCLKLIARLDAA 60 Query: 61 TGEEEQKHYNALIGLLF 77 E + H LIG+LF Sbjct: 61 GDEADAGHLRELIGMLF 77 >UniRef50_UPI0001C34448 hypothetical protein EcanA3_04832 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001C34448 Length = 79 Score = 118 bits (296), Expect = 5e-26, Method: Composition-based stats. Identities = 41/78 (52%), Positives = 54/78 (69%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 M +NSV + AIT YF K N+ T+QETLG IV+EI+K+ LSR ++C KLL R+E + Sbjct: 1 MRRNSVPSTTEHAITLYFNKNNIPTKQETLGLIVSEIIKENVQLSRMTICTKLLRRIEES 60 Query: 61 TGEEEQKHYNALIGLLFE 78 + E E+ HYN LI L FE Sbjct: 61 SCEHEKAHYNGLIALFFE 78 >UniRef50_D0FT16 Conserved uncharacterized protein YcgZ n=7 Tax=Enterobacteriaceae RepID=D0FT16_ERWPY Length = 81 Score = 112 bits (280), Expect = 4e-24, Method: Composition-based stats. Identities = 38/77 (49%), Positives = 55/77 (71%) Query: 1 MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHA 60 M + I+RYF + + +QQETLG+IV EIL+ GR+L+RK++C+KLL RLE A Sbjct: 1 MRLDGSNPMIENDISRYFVEVIMPSQQETLGQIVVEILRSGRHLNRKAICSKLLTRLELA 60 Query: 61 TGEEEQKHYNALIGLLF 77 +G E+++HY LIG+LF Sbjct: 61 SGPEQERHYQQLIGILF 77 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.304 0.136 0.353 Lambda K H 0.267 0.0423 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 282,166,801 Number of Sequences: 3077464 Number of extensions: 8248700 Number of successful extensions: 20238 Number of sequences better than 1.0e-01: 6 Number of HSP's better than 0.1 without gapping: 13 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 20225 Number of HSP's gapped (non-prelim): 13 length of query: 78 length of database: 1,040,396,356 effective HSP length: 49 effective length of query: 29 effective length of database: 889,600,620 effective search space: 25798417980 effective search space used: 25798417980 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.8 bits) S2: 87 (38.1 bits)