BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (114 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B7LJT2 UPF0757 protein ymgG n=47 Tax=Proteobacteria Rep... 127 9e-29 UniRef50_C5BD73 UPF0757 protein ymgG n=2 Tax=Edwardsiella RepID=... 78 7e-14 UniRef50_B8CKK4 Putative uncharacterized protein n=1 Tax=Shewane... 42 0.007 >UniRef50_B7LJT2 UPF0757 protein ymgG n=47 Tax=Proteobacteria RepID=YMGG_ESCF3 Length = 114 Score = 127 bits (320), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 82/98 (83%), Positives = 88/98 (89%) Query: 1 MKKKILAFGLISALFCSTPAMADMNRTTKGALLGAGVGLLTGNGVNGVLKGAAVGAGVGA 60 MKK LA GLISALF S A AD++RT+KGALLGAGVGLL+GNGVNGVLKGAAVGAGVGA Sbjct: 1 MKKHTLALGLISALFFSFQAHADLSRTSKGALLGAGVGLLSGNGVNGVLKGAAVGAGVGA 60 Query: 61 VTEKGRDGKNARKGAKVGAAVGAVTGVLTGNGLEGAIK 98 VTEKG+DGK ARKGAKVGA +GAV GVLTGNGLEGAIK Sbjct: 61 VTEKGKDGKKARKGAKVGATLGAVAGVLTGNGLEGAIK 98 >UniRef50_C5BD73 UPF0757 protein ymgG n=2 Tax=Edwardsiella RepID=YMGG_EDWI9 Length = 114 Score = 78.2 bits (191), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 66/98 (67%), Positives = 80/98 (81%) Query: 1 MKKKILAFGLISALFCSTPAMADMNRTTKGALLGAGVGLLTGNGVNGVLKGAAVGAGVGA 60 M+ ++A +S L ST ++A +RTTKGALLGA GLL+GNGVNGVLKGAA+GAGVGA Sbjct: 1 MRNTLIAVTTVSLLAFSTASLAGPSRTTKGALLGATAGLLSGNGVNGVLKGAALGAGVGA 60 Query: 61 VTEKGRDGKNARKGAKVGAAVGAVTGVLTGNGLEGAIK 98 ++EKGR GK ARKGA +GA VGAV GVL+GNG+EGAIK Sbjct: 61 ISEKGRKGKKARKGATIGATVGAVAGVLSGNGIEGAIK 98 >UniRef50_B8CKK4 Putative uncharacterized protein n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CKK4_SHEPW Length = 117 Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust. Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 16/92 (17%) Query: 1 MKKKILAFGLISALFC-STPAMADMNRTTKGALLGAGVGLLTGNGVNGVLKGAAVGAGVG 59 MK K L +L C +T + AD T GN V G L+G VG G+G Sbjct: 1 MKLKHLLITATLSLSCVATSSFADYCDDT------------LGNVVGGALEGGLVGTGLG 48 Query: 60 AVTEKGRDGKNARKGAKVGAAVGAVTGVLTGN 91 A+ + GR+G A++GA++GAAVGA+ G++ G Sbjct: 49 AIAD-GREG--AKRGAQIGAAVGAIDGLIEGE 77 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7LJT2 UPF0757 protein ymgG n=47 Tax=Proteobacteria Rep... 86 3e-16 UniRef50_C5BD73 UPF0757 protein ymgG n=2 Tax=Edwardsiella RepID=... 67 2e-10 Sequences not found previously or not previously below threshold: UniRef50_Q0EW85 Putative uncharacterized protein n=1 Tax=Maripro... 42 0.005 CONVERGED! >UniRef50_B7LJT2 UPF0757 protein ymgG n=47 Tax=Proteobacteria RepID=YMGG_ESCF3 Length = 114 Score = 86.2 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 95/113 (84%), Positives = 102/113 (90%) Query: 1 MKKKILAFGLISALFCSTPAMADMNRTTKGALLGAGVGLLTGNGVNGVLKGAAVGAGVGA 60 MKK LA GLISALF S A AD++RT+KGALLGAGVGLL+GNGVNGVLKGAAVGAGVGA Sbjct: 1 MKKHTLALGLISALFFSFQAHADLSRTSKGALLGAGVGLLSGNGVNGVLKGAAVGAGVGA 60 Query: 61 VTEKGRDGKNARKGAKVGAAVGAVTGVLTGNGLEGAIKGAVIGGTGGAILGKM 113 VTEKG+DGK ARKGAKVGA +GAV GVLTGNGLEGAIKGAVIGG GGAI+GKM Sbjct: 61 VTEKGKDGKKARKGAKVGATLGAVAGVLTGNGLEGAIKGAVIGGAGGAIVGKM 113 >UniRef50_C5BD73 UPF0757 protein ymgG n=2 Tax=Edwardsiella RepID=YMGG_EDWI9 Length = 114 Score = 66.5 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 66/98 (67%), Positives = 80/98 (81%) Query: 1 MKKKILAFGLISALFCSTPAMADMNRTTKGALLGAGVGLLTGNGVNGVLKGAAVGAGVGA 60 M+ ++A +S L ST ++A +RTTKGALLGA GLL+GNGVNGVLKGAA+GAGVGA Sbjct: 1 MRNTLIAVTTVSLLAFSTASLAGPSRTTKGALLGATAGLLSGNGVNGVLKGAALGAGVGA 60 Query: 61 VTEKGRDGKNARKGAKVGAAVGAVTGVLTGNGLEGAIK 98 ++EKGR GK ARKGA +GA VGAV GVL+GNG+EGAIK Sbjct: 61 ISEKGRKGKKARKGATIGATVGAVAGVLSGNGIEGAIK 98 >UniRef50_Q0EW85 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EW85_9PROT Length = 161 Score = 42.2 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 19/38 (50%), Positives = 24/38 (63%) Query: 74 GAKVGAAVGAVTGVLTGNGLEGAIKGAVIGGTGGAILG 111 GA VGA GAV G + L+GAI GAV+G GA++ Sbjct: 43 GAVVGATTGAVAGSSSDQALQGAIFGAVLGTIAGAVIA 80 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.323 0.145 0.396 Lambda K H 0.267 0.0454 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 494,754,910 Number of Sequences: 3077464 Number of extensions: 24334426 Number of successful extensions: 146306 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 168 Number of HSP's successfully gapped in prelim test: 820 Number of HSP's that attempted gapping in prelim test: 136743 Number of HSP's gapped (non-prelim): 7378 length of query: 114 length of database: 1,040,396,356 effective HSP length: 81 effective length of query: 33 effective length of database: 791,121,772 effective search space: 26107018476 effective search space used: 26107018476 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (22.4 bits) S2: 87 (38.0 bits)