BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (63 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P64546 Uncharacterized protein yfgG n=74 Tax=Enterobact... 128 6e-29 UniRef50_A8AD83 Putative uncharacterized protein n=3 Tax=Enterob... 119 4e-26 UniRef50_C5BBK8 Putative uncharacterized protein n=1 Tax=Edwards... 58 1e-07 UniRef50_D0ZE62 Putative uncharacterized protein n=1 Tax=Edwards... 58 1e-07 UniRef50_C4GRQ4 Putative exported protein n=20 Tax=Yersinia RepI... 52 6e-06 UniRef50_D0KKK8 Putative uncharacterized protein n=3 Tax=Pectoba... 49 4e-05 UniRef50_C8Q4H6 Putative uncharacterized protein n=1 Tax=Pantoea... 49 5e-05 UniRef50_A8GHQ0 Putative uncharacterized protein n=2 Tax=Serrati... 48 1e-04 UniRef50_D2BTI4 Putative uncharacterized protein n=1 Tax=Dickeya... 45 6e-04 >UniRef50_P64546 Uncharacterized protein yfgG n=74 Tax=Enterobacteriaceae RepID=YFGG_ECO57 Length = 63 Score = 128 bits (321), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 63/63 (100%), Positives = 63/63 (100%) Query: 1 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESP 60 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESP Sbjct: 1 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESP 60 Query: 61 VQR 63 VQR Sbjct: 61 VQR 63 >UniRef50_A8AD83 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A8AD83_CITK8 Length = 63 Score = 119 bits (297), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 55/63 (87%), Positives = 59/63 (93%) Query: 1 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESP 60 MSQATSMRKRHRFNSRMTRIVL ISF+FFFGRF+YSS+GAW HHQ KKEAQQSTLSVE+P Sbjct: 1 MSQATSMRKRHRFNSRMTRIVLFISFLFFFGRFVYSSIGAWHHHQDKKEAQQSTLSVETP 60 Query: 61 VQR 63 QR Sbjct: 61 AQR 63 >UniRef50_C5BBK8 Putative uncharacterized protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BBK8_EDWI9 Length = 65 Score = 57.8 bits (138), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 28/52 (53%), Positives = 35/52 (67%), Gaps = 3/52 (5%) Query: 10 RHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESPV 61 + R N RMTRIVLLISFI FGR +Y+S+GA HH+S+ Q S + PV Sbjct: 8 KRRTNIRMTRIVLLISFIILFGRLLYASIGALNHHRSQ---QNSPVEQSQPV 56 >UniRef50_D0ZE62 Putative uncharacterized protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZE62_EDWTE Length = 66 Score = 57.8 bits (138), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 30/49 (61%), Positives = 36/49 (73%), Gaps = 5/49 (10%) Query: 10 RHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVE 58 + R N RMTRIVLLISFI FGR +Y+S+GA HH+S QQ+T SVE Sbjct: 9 KRRTNIRMTRIVLLISFIILFGRLLYASIGALNHHRS----QQNT-SVE 52 >UniRef50_C4GRQ4 Putative exported protein n=20 Tax=Yersinia RepID=C4GRQ4_YERPN Length = 70 Score = 52.0 bits (123), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 21/49 (42%), Positives = 38/49 (77%), Gaps = 1/49 (2%) Query: 10 RHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVE 58 R R ++++T+I+LL+SF+ FGR +Y+++ + HHQ ++++QQ LSVE Sbjct: 10 RRRPSTQLTKIILLVSFLILFGRLLYAAIASISHHQ-ERQSQQIELSVE 57 >UniRef50_D0KKK8 Putative uncharacterized protein n=3 Tax=Pectobacterium RepID=D0KKK8_PECWW Length = 71 Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 25/56 (44%), Positives = 36/56 (64%), Gaps = 1/56 (1%) Query: 6 SMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESPV 61 S+R R R SR+ R VLLISF+ GRF YS++ A+ HHQ K++ + L + + V Sbjct: 11 SIRPR-RSGSRIARAVLLISFVILLGRFAYSTITAFGHHQDKQQQRAEQLLLPTNV 65 >UniRef50_C8Q4H6 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8Q4H6_9ENTR Length = 61 Score = 48.9 bits (115), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 25/48 (52%), Positives = 31/48 (64%), Gaps = 1/48 (2%) Query: 1 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKK 48 M+ A R+R + S MTRIVLLISF GR I++ GA +HHQ KK Sbjct: 1 MNNAFPARRRPKTGS-MTRIVLLISFFILVGRLIFTIPGAIEHHQQKK 47 >UniRef50_A8GHQ0 Putative uncharacterized protein n=2 Tax=Serratia RepID=A8GHQ0_SERP5 Length = 59 Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 21/47 (44%), Positives = 29/47 (61%) Query: 10 RHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLS 56 R + +MT+IVL +SFI GR +Y++V A HHQ KK A T + Sbjct: 2 RKKTTGQMTKIVLFVSFIILVGRLLYAAVVAVPHHQEKKLAPYQTTT 48 >UniRef50_D2BTI4 Putative uncharacterized protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BTI4_DICD5 Length = 75 Score = 45.4 bits (106), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 18/36 (50%), Positives = 27/36 (75%) Query: 14 NSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKE 49 +R+ R +L+ISFI GRF YS+VGA+ HHQ+ ++ Sbjct: 19 GTRIARTILMISFIILLGRFAYSAVGAFFHHQNIQQ 54 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A8AD83 Putative uncharacterized protein n=3 Tax=Enterob... 84 2e-15 UniRef50_P64546 Uncharacterized protein yfgG n=74 Tax=Enterobact... 81 2e-14 UniRef50_A8GHQ0 Putative uncharacterized protein n=2 Tax=Serrati... 63 2e-09 UniRef50_C8Q4H6 Putative uncharacterized protein n=1 Tax=Pantoea... 61 1e-08 UniRef50_D0KKK8 Putative uncharacterized protein n=3 Tax=Pectoba... 59 7e-08 UniRef50_C4GRQ4 Putative exported protein n=20 Tax=Yersinia RepI... 57 2e-07 UniRef50_D0ZE62 Putative uncharacterized protein n=1 Tax=Edwards... 54 1e-06 UniRef50_C5BBK8 Putative uncharacterized protein n=1 Tax=Edwards... 53 4e-06 UniRef50_D2BTI4 Putative uncharacterized protein n=1 Tax=Dickeya... 50 3e-05 Sequences not found previously or not previously below threshold: UniRef50_Q492F2 Putative uncharacterized protein n=1 Tax=Candida... 43 0.003 CONVERGED! >UniRef50_A8AD83 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A8AD83_CITK8 Length = 63 Score = 83.6 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 55/63 (87%), Positives = 59/63 (93%) Query: 1 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESP 60 MSQATSMRKRHRFNSRMTRIVL ISF+FFFGRF+YSS+GAW HHQ KKEAQQSTLSVE+P Sbjct: 1 MSQATSMRKRHRFNSRMTRIVLFISFLFFFGRFVYSSIGAWHHHQDKKEAQQSTLSVETP 60 Query: 61 VQR 63 QR Sbjct: 61 AQR 63 >UniRef50_P64546 Uncharacterized protein yfgG n=74 Tax=Enterobacteriaceae RepID=YFGG_ECO57 Length = 63 Score = 80.5 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 63/63 (100%), Positives = 63/63 (100%) Query: 1 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESP 60 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESP Sbjct: 1 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESP 60 Query: 61 VQR 63 VQR Sbjct: 61 VQR 63 >UniRef50_A8GHQ0 Putative uncharacterized protein n=2 Tax=Serratia RepID=A8GHQ0_SERP5 Length = 59 Score = 63.2 bits (152), Expect = 2e-09, Method: Composition-based stats. Identities = 21/47 (44%), Positives = 29/47 (61%) Query: 10 RHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLS 56 R + +MT+IVL +SFI GR +Y++V A HHQ KK A T + Sbjct: 2 RKKTTGQMTKIVLFVSFIILVGRLLYAAVVAVPHHQEKKLAPYQTTT 48 >UniRef50_C8Q4H6 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8Q4H6_9ENTR Length = 61 Score = 60.9 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 25/48 (52%), Positives = 31/48 (64%), Gaps = 1/48 (2%) Query: 1 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKK 48 M+ A R+R + S MTRIVLLISF GR I++ GA +HHQ KK Sbjct: 1 MNNAFPARRRPKTGS-MTRIVLLISFFILVGRLIFTIPGAIEHHQQKK 47 >UniRef50_D0KKK8 Putative uncharacterized protein n=3 Tax=Pectobacterium RepID=D0KKK8_PECWW Length = 71 Score = 58.5 bits (140), Expect = 7e-08, Method: Composition-based stats. Identities = 25/56 (44%), Positives = 36/56 (64%), Gaps = 1/56 (1%) Query: 6 SMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESPV 61 S+R R R SR+ R VLLISF+ GRF YS++ A+ HHQ K++ + L + + V Sbjct: 11 SIRPR-RSGSRIARAVLLISFVILLGRFAYSTITAFGHHQDKQQQRAEQLLLPTNV 65 >UniRef50_C4GRQ4 Putative exported protein n=20 Tax=Yersinia RepID=C4GRQ4_YERPN Length = 70 Score = 57.0 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 21/49 (42%), Positives = 38/49 (77%), Gaps = 1/49 (2%) Query: 10 RHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVE 58 R R ++++T+I+LL+SF+ FGR +Y+++ + HHQ ++++QQ LSVE Sbjct: 10 RRRPSTQLTKIILLVSFLILFGRLLYAAIASISHHQ-ERQSQQIELSVE 57 >UniRef50_D0ZE62 Putative uncharacterized protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZE62_EDWTE Length = 66 Score = 53.9 bits (128), Expect = 1e-06, Method: Composition-based stats. Identities = 25/53 (47%), Positives = 34/53 (64%) Query: 8 RKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESP 60 + + R N RMTRIVLLISFI FGR +Y+S+GA HH+S++ +P Sbjct: 7 KMKRRTNIRMTRIVLLISFIILFGRLLYASIGALNHHRSQQNTSVEQSQPLTP 59 >UniRef50_C5BBK8 Putative uncharacterized protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BBK8_EDWI9 Length = 65 Score = 52.8 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 24/47 (51%), Positives = 33/47 (70%) Query: 8 RKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQST 54 + + R N RMTRIVLLISFI FGR +Y+S+GA HH+S++ + Sbjct: 6 KMKRRTNIRMTRIVLLISFIILFGRLLYASIGALNHHRSQQNSPVEQ 52 >UniRef50_D2BTI4 Putative uncharacterized protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BTI4_DICD5 Length = 75 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 18/47 (38%), Positives = 30/47 (63%) Query: 13 FNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVES 59 +R+ R +L+ISFI GRF YS+VGA+ HHQ+ ++ + + + Sbjct: 18 SGTRIARTILMISFIILLGRFAYSAVGAFFHHQNIQQQRVTQPVAPT 64 >UniRef50_Q492F2 Putative uncharacterized protein n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492F2_BLOPB Length = 66 Score = 42.8 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 14/47 (29%), Positives = 27/47 (57%) Query: 1 MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSK 47 ++ + + + R+ + +LLISF F R IY S+ AW +H+++ Sbjct: 5 LNNFHIIFMKRKKKIRVAKWILLISFSILFARLIYVSINAWNYHKNR 51 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.133 0.337 Lambda K H 0.267 0.0399 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 192,448,019 Number of Sequences: 3077464 Number of extensions: 4796728 Number of successful extensions: 36963 Number of sequences better than 1.0e-01: 12 Number of HSP's better than 0.1 without gapping: 23 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 36938 Number of HSP's gapped (non-prelim): 23 length of query: 63 length of database: 1,040,396,356 effective HSP length: 35 effective length of query: 28 effective length of database: 932,685,116 effective search space: 26115183248 effective search space used: 26115183248 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.0 bits) S2: 87 (38.2 bits)