BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (74 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0AD08 Uncharacterized protein yecF n=137 Tax=Enterobac... 150 1e-35 UniRef50_C9XV62 Uncharacterized protein yecF n=8 Tax=Enterobacte... 123 2e-27 UniRef50_B2Q4D4 Putative uncharacterized protein n=3 Tax=Provide... 75 7e-13 UniRef50_D2TVT7 Putative uncharacterized protein n=1 Tax=Arsenop... 73 3e-12 UniRef50_Q7N5C5 Similar to unknown protein YecF of Escherichia c... 73 3e-12 UniRef50_UPI000190E69C hypothetical protein SentesTyp_17354 n=1 ... 52 7e-06 >UniRef50_P0AD08 Uncharacterized protein yecF n=137 Tax=Enterobacteriaceae RepID=YECF_ECOL6 Length = 74 Score = 150 bits (379), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 74/74 (100%), Positives = 74/74 (100%) Query: 1 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV 60 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV Sbjct: 1 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV 60 Query: 61 FSKTVKQIKQAYRQ 74 FSKTVKQIKQAYRQ Sbjct: 61 FSKTVKQIKQAYRQ 74 >UniRef50_C9XV62 Uncharacterized protein yecF n=8 Tax=Enterobacteriaceae RepID=C9XV62_CROTZ Length = 74 Score = 123 bits (308), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 59/74 (79%), Positives = 67/74 (90%) Query: 1 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV 60 MS PD STAENNQELA EVSCLK++LTLMLQAMGQADAGRV+LKMEKQ+A +ED QAAV Sbjct: 1 MSAPDISTAENNQELATEVSCLKSLLTLMLQAMGQADAGRVILKMEKQIAEMEDAEQAAV 60 Query: 61 FSKTVKQIKQAYRQ 74 ++ TVKQIKQAYR+ Sbjct: 61 YTNTVKQIKQAYRR 74 >UniRef50_B2Q4D4 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q4D4_PROST Length = 73 Score = 75.1 bits (183), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 33/69 (47%), Positives = 51/69 (73%) Query: 6 FSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAVFSKTV 65 F T + ELA EV+C+K +L ML+A+GQADAG++++KME+++ +EDE QA + T+ Sbjct: 5 FPTQPDVNELAAEVTCIKNLLAHMLKAIGQADAGKILIKMEREIVSMEDEKQAETYRNTL 64 Query: 66 KQIKQAYRQ 74 +QIK +RQ Sbjct: 65 EQIKAGFRQ 73 >UniRef50_D2TVT7 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2TVT7_9ENTR Length = 73 Score = 73.2 bits (178), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 32/71 (45%), Positives = 49/71 (69%) Query: 3 TPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAVFS 62 T D S+ N + +A E+ CLK +T +L+A+ QADAGR +L +EK++ ++D QA VF Sbjct: 2 TNDCSSTPNTETIATEIGCLKTFITHILKALNQADAGRAILNIEKEMLTMQDPKQAEVFK 61 Query: 63 KTVKQIKQAYR 73 + ++QIK AYR Sbjct: 62 RIIEQIKTAYR 72 >UniRef50_Q7N5C5 Similar to unknown protein YecF of Escherichia coli n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N5C5_PHOLL Length = 74 Score = 72.8 bits (177), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 35/74 (47%), Positives = 50/74 (67%) Query: 1 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV 60 M+ D + + + LA EV+CLK +L +L+ MGQA AGRV+L +E+ +A + DE QA + Sbjct: 1 MNKIDLIPSSDTETLATEVTCLKVLLASILKTMGQAHAGRVVLNLERVIAEMGDEKQAKI 60 Query: 61 FSKTVKQIKQAYRQ 74 F TV+QIK YRQ Sbjct: 61 FENTVQQIKALYRQ 74 >UniRef50_UPI000190E69C hypothetical protein SentesTyp_17354 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190E69C Length = 73 Score = 51.6 bits (122), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 23/29 (79%), Positives = 27/29 (93%) Query: 33 MGQADAGRVMLKMEKQLALIEDETQAAVF 61 MGQADAGRV+LKMEKQ+A ++DE QAAVF Sbjct: 1 MGQADAGRVILKMEKQIAQMDDEAQAAVF 29 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q7N5C5 Similar to unknown protein YecF of Escherichia c... 100 1e-20 UniRef50_D2TVT7 Putative uncharacterized protein n=1 Tax=Arsenop... 100 2e-20 UniRef50_B2Q4D4 Putative uncharacterized protein n=3 Tax=Provide... 98 8e-20 UniRef50_C9XV62 Uncharacterized protein yecF n=8 Tax=Enterobacte... 96 3e-19 UniRef50_P0AD08 Uncharacterized protein yecF n=137 Tax=Enterobac... 95 5e-19 UniRef50_UPI000190E69C hypothetical protein SentesTyp_17354 n=1 ... 51 1e-05 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_Q7N5C5 Similar to unknown protein YecF of Escherichia coli n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N5C5_PHOLL Length = 74 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 35/74 (47%), Positives = 50/74 (67%) Query: 1 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV 60 M+ D + + + LA EV+CLK +L +L+ MGQA AGRV+L +E+ +A + DE QA + Sbjct: 1 MNKIDLIPSSDTETLATEVTCLKVLLASILKTMGQAHAGRVVLNLERVIAEMGDEKQAKI 60 Query: 61 FSKTVKQIKQAYRQ 74 F TV+QIK YRQ Sbjct: 61 FENTVQQIKALYRQ 74 >UniRef50_D2TVT7 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2TVT7_9ENTR Length = 73 Score = 99.8 bits (247), Expect = 2e-20, Method: Composition-based stats. Identities = 32/71 (45%), Positives = 49/71 (69%) Query: 3 TPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAVFS 62 T D S+ N + +A E+ CLK +T +L+A+ QADAGR +L +EK++ ++D QA VF Sbjct: 2 TNDCSSTPNTETIATEIGCLKTFITHILKALNQADAGRAILNIEKEMLTMQDPKQAEVFK 61 Query: 63 KTVKQIKQAYR 73 + ++QIK AYR Sbjct: 62 RIIEQIKTAYR 72 >UniRef50_B2Q4D4 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q4D4_PROST Length = 73 Score = 98.3 bits (243), Expect = 8e-20, Method: Composition-based stats. Identities = 33/69 (47%), Positives = 51/69 (73%) Query: 6 FSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAVFSKTV 65 F T + ELA EV+C+K +L ML+A+GQADAG++++KME+++ +EDE QA + T+ Sbjct: 5 FPTQPDVNELAAEVTCIKNLLAHMLKAIGQADAGKILIKMEREIVSMEDEKQAETYRNTL 64 Query: 66 KQIKQAYRQ 74 +QIK +RQ Sbjct: 65 EQIKAGFRQ 73 >UniRef50_C9XV62 Uncharacterized protein yecF n=8 Tax=Enterobacteriaceae RepID=C9XV62_CROTZ Length = 74 Score = 95.9 bits (237), Expect = 3e-19, Method: Composition-based stats. Identities = 59/74 (79%), Positives = 67/74 (90%) Query: 1 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV 60 MS PD STAENNQELA EVSCLK++LTLMLQAMGQADAGRV+LKMEKQ+A +ED QAAV Sbjct: 1 MSAPDISTAENNQELATEVSCLKSLLTLMLQAMGQADAGRVILKMEKQIAEMEDAEQAAV 60 Query: 61 FSKTVKQIKQAYRQ 74 ++ TVKQIKQAYR+ Sbjct: 61 YTNTVKQIKQAYRR 74 >UniRef50_P0AD08 Uncharacterized protein yecF n=137 Tax=Enterobacteriaceae RepID=YECF_ECOL6 Length = 74 Score = 95.2 bits (235), Expect = 5e-19, Method: Composition-based stats. Identities = 74/74 (100%), Positives = 74/74 (100%) Query: 1 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV 60 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV Sbjct: 1 MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLALIEDETQAAV 60 Query: 61 FSKTVKQIKQAYRQ 74 FSKTVKQIKQAYRQ Sbjct: 61 FSKTVKQIKQAYRQ 74 >UniRef50_UPI000190E69C hypothetical protein SentesTyp_17354 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190E69C Length = 73 Score = 50.9 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 23/29 (79%), Positives = 27/29 (93%) Query: 33 MGQADAGRVMLKMEKQLALIEDETQAAVF 61 MGQADAGRV+LKMEKQ+A ++DE QAAVF Sbjct: 1 MGQADAGRVILKMEKQIAQMDDEAQAAVF 29 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.305 0.130 0.318 Lambda K H 0.267 0.0406 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 212,110,409 Number of Sequences: 3077464 Number of extensions: 5101631 Number of successful extensions: 17583 Number of sequences better than 1.0e-01: 6 Number of HSP's better than 0.1 without gapping: 12 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 17571 Number of HSP's gapped (non-prelim): 12 length of query: 74 length of database: 1,040,396,356 effective HSP length: 45 effective length of query: 29 effective length of database: 901,910,476 effective search space: 26155403804 effective search space used: 26155403804 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 87 (38.1 bits)