BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (78 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P64504 Uncharacterized protein yebV n=58 Tax=Bacteria R... 164 9e-40 UniRef50_C9XUI3 Uncharacterized protein yebV n=46 Tax=Enterobact... 144 8e-34 UniRef50_Q2NTD7 Putative uncharacterized protein n=24 Tax=Entero... 94 2e-18 UniRef50_Q7N3N0 Similar to unknown protein Yebv of Escherichia c... 93 3e-18 UniRef50_B2Q4E3 Putative uncharacterized protein n=4 Tax=Provide... 79 4e-14 UniRef50_D2TVU2 Putative uncharacterized protein n=1 Tax=Arsenop... 74 1e-12 UniRef50_B4EVN5 Putative uncharacterized protein n=3 Tax=Proteus... 74 1e-12 UniRef50_C8Q9A9 Putative uncharacterized protein n=1 Tax=Pantoea... 65 7e-10 >UniRef50_P64504 Uncharacterized protein yebV n=58 Tax=Bacteria RepID=YEBV_ECOL6 Length = 78 Score = 164 bits (415), Expect = 9e-40, Method: Compositional matrix adjust. Identities = 78/78 (100%), Positives = 78/78 (100%) Query: 1 MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV Sbjct: 1 MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 Query: 61 LYRTRYDQQSDAWIMRLA 78 LYRTRYDQQSDAWIMRLA Sbjct: 61 LYRTRYDQQSDAWIMRLA 78 >UniRef50_C9XUI3 Uncharacterized protein yebV n=46 Tax=Enterobacteriaceae RepID=C9XUI3_CROTZ Length = 79 Score = 144 bits (363), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 65/77 (84%), Positives = 71/77 (92%) Query: 2 KTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSVL 61 KTSV+IG FEIDD ELHG +PG+R L+IPCKSDPDLCMQLDAWDAETSIPA+LNGEHSVL Sbjct: 3 KTSVKIGTFEIDDAELHGNAPGERKLSIPCKSDPDLCMQLDAWDAETSIPAVLNGEHSVL 62 Query: 62 YRTRYDQQSDAWIMRLA 78 YR YDQQ+DAWIMRLA Sbjct: 63 YREHYDQQADAWIMRLA 79 >UniRef50_Q2NTD7 Putative uncharacterized protein n=24 Tax=Enterobacteriaceae RepID=Q2NTD7_SODGM Length = 81 Score = 93.6 bits (231), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 44/81 (54%), Positives = 57/81 (70%), Gaps = 8/81 (9%) Query: 2 KTSVRIGAFEIDDGELHGESPGD-----RTLTIPCKSDPDLCMQLDAWDAETSIPALLNG 56 KT+V+IG+F +DD L S GD ++IPCKSDPDLC+QLD WD +T IPA L+G Sbjct: 3 KTTVQIGSFTVDDATL---SSGDDQRVKNIVSIPCKSDPDLCLQLDGWDEQTGIPAFLDG 59 Query: 57 EHSVLYRTRYDQQSDAWIMRL 77 + +L+R YD +DAWIMRL Sbjct: 60 KEHILFRQHYDHDNDAWIMRL 80 >UniRef50_Q7N3N0 Similar to unknown protein Yebv of Escherichia coli n=6 Tax=Enterobacteriaceae RepID=Q7N3N0_PHOLL Length = 80 Score = 93.2 bits (230), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 43/78 (55%), Positives = 55/78 (70%), Gaps = 1/78 (1%) Query: 2 KTSVRIGAFEIDDGELHGESPGDR-TLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 +T +RI ++EIDD L S D T++IPC SD +LCMQLD WD TSIPALLN +H + Sbjct: 3 QTKLRISSYEIDDAILSSPSQEDDITISIPCNSDQELCMQLDGWDEHTSIPALLNDKHIL 62 Query: 61 LYRTRYDQQSDAWIMRLA 78 LYR YD+Q AW+MR+ Sbjct: 63 LYRKHYDRQKHAWVMRVV 80 >UniRef50_B2Q4E3 Putative uncharacterized protein n=4 Tax=Providencia RepID=B2Q4E3_PROST Length = 77 Score = 79.0 bits (193), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 35/76 (46%), Positives = 51/76 (67%), Gaps = 2/76 (2%) Query: 3 TSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSVLY 62 +++I ++EI+D + TL+IPC SD + CMQLD WD TSIPAL+N + +LY Sbjct: 4 VNLKISSYEINDAIMSDHKSD--TLSIPCDSDTEFCMQLDGWDENTSIPALINEKPVLLY 61 Query: 63 RTRYDQQSDAWIMRLA 78 R RYD+ + W+MR+A Sbjct: 62 RHRYDKDNHHWVMRVA 77 >UniRef50_D2TVU2 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2TVU2_9ENTR Length = 82 Score = 74.3 bits (181), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 32/77 (41%), Positives = 48/77 (62%), Gaps = 2/77 (2%) Query: 2 KTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSVL 61 + ++I + IDD L + T++IPC+SD + CMQLD WD TSIPA L+ + +L Sbjct: 8 RAKLKISTYVIDDALL--SDAAENTISIPCESDTEFCMQLDGWDENTSIPATLDNKPVLL 65 Query: 62 YRTRYDQQSDAWIMRLA 78 YR YD++ W+MR+ Sbjct: 66 YRQYYDKKKHHWVMRIV 82 >UniRef50_B4EVN5 Putative uncharacterized protein n=3 Tax=Proteus RepID=B4EVN5_PROMH Length = 80 Score = 73.9 bits (180), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 1/78 (1%) Query: 2 KTSVRIGAFEIDDGELHGESPGDRT-LTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 KT+++I +EI+D L S T + IPC D + CMQLD WD TSIPA L ++ + Sbjct: 3 KTTLKIACYEIEDVTLKHSSDNQLTYVHIPCDYDKEFCMQLDGWDENTSIPAQLKDKNIL 62 Query: 61 LYRTRYDQQSDAWIMRLA 78 LYR YD+++ WI+++A Sbjct: 63 LYRHAYDKENHHWILKVA 80 >UniRef50_C8Q9A9 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8Q9A9_9ENTR Length = 77 Score = 65.1 bits (157), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 32/76 (42%), Positives = 46/76 (60%), Gaps = 2/76 (2%) Query: 1 MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 M V+IG +EI D EL+ T++IPC ++P L QLD WD ETS+PA ++G Sbjct: 1 MGHVVKIGRYEIIDAELNAGKL--ETVSIPCLTNPGLSYQLDGWDHETSVPAWIDGNPVD 58 Query: 61 LYRTRYDQQSDAWIMR 76 L YD+Q D W+++ Sbjct: 59 LEIGHYDKQQDRWVLK 74 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P64504 Uncharacterized protein yebV n=58 Tax=Bacteria R... 132 3e-30 UniRef50_C9XUI3 Uncharacterized protein yebV n=46 Tax=Enterobact... 131 6e-30 UniRef50_D2TVU2 Putative uncharacterized protein n=1 Tax=Arsenop... 120 9e-27 UniRef50_Q7N3N0 Similar to unknown protein Yebv of Escherichia c... 120 2e-26 UniRef50_B2Q4E3 Putative uncharacterized protein n=4 Tax=Provide... 119 4e-26 UniRef50_B4EVN5 Putative uncharacterized protein n=3 Tax=Proteus... 117 9e-26 UniRef50_Q2NTD7 Putative uncharacterized protein n=24 Tax=Entero... 115 3e-25 UniRef50_C8Q9A9 Putative uncharacterized protein n=1 Tax=Pantoea... 112 3e-24 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P64504 Uncharacterized protein yebV n=58 Tax=Bacteria RepID=YEBV_ECOL6 Length = 78 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 78/78 (100%), Positives = 78/78 (100%) Query: 1 MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV Sbjct: 1 MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 Query: 61 LYRTRYDQQSDAWIMRLA 78 LYRTRYDQQSDAWIMRLA Sbjct: 61 LYRTRYDQQSDAWIMRLA 78 >UniRef50_C9XUI3 Uncharacterized protein yebV n=46 Tax=Enterobacteriaceae RepID=C9XUI3_CROTZ Length = 79 Score = 131 bits (330), Expect = 6e-30, Method: Composition-based stats. Identities = 65/77 (84%), Positives = 71/77 (92%) Query: 2 KTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSVL 61 KTSV+IG FEIDD ELHG +PG+R L+IPCKSDPDLCMQLDAWDAETSIPA+LNGEHSVL Sbjct: 3 KTSVKIGTFEIDDAELHGNAPGERKLSIPCKSDPDLCMQLDAWDAETSIPAVLNGEHSVL 62 Query: 62 YRTRYDQQSDAWIMRLA 78 YR YDQQ+DAWIMRLA Sbjct: 63 YREHYDQQADAWIMRLA 79 >UniRef50_D2TVU2 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2TVU2_9ENTR Length = 82 Score = 120 bits (302), Expect = 9e-27, Method: Composition-based stats. Identities = 32/77 (41%), Positives = 48/77 (62%), Gaps = 2/77 (2%) Query: 2 KTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSVL 61 + ++I + IDD L + T++IPC+SD + CMQLD WD TSIPA L+ + +L Sbjct: 8 RAKLKISTYVIDDALL--SDAAENTISIPCESDTEFCMQLDGWDENTSIPATLDNKPVLL 65 Query: 62 YRTRYDQQSDAWIMRLA 78 YR YD++ W+MR+ Sbjct: 66 YRQYYDKKKHHWVMRIV 82 >UniRef50_Q7N3N0 Similar to unknown protein Yebv of Escherichia coli n=6 Tax=Enterobacteriaceae RepID=Q7N3N0_PHOLL Length = 80 Score = 120 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 43/78 (55%), Positives = 55/78 (70%), Gaps = 1/78 (1%) Query: 2 KTSVRIGAFEIDDGELHGESPGDR-TLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 +T +RI ++EIDD L S D T++IPC SD +LCMQLD WD TSIPALLN +H + Sbjct: 3 QTKLRISSYEIDDAILSSPSQEDDITISIPCNSDQELCMQLDGWDEHTSIPALLNDKHIL 62 Query: 61 LYRTRYDQQSDAWIMRLA 78 LYR YD+Q AW+MR+ Sbjct: 63 LYRKHYDRQKHAWVMRVV 80 >UniRef50_B2Q4E3 Putative uncharacterized protein n=4 Tax=Providencia RepID=B2Q4E3_PROST Length = 77 Score = 119 bits (297), Expect = 4e-26, Method: Composition-based stats. Identities = 35/76 (46%), Positives = 51/76 (67%), Gaps = 2/76 (2%) Query: 3 TSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSVLY 62 +++I ++EI+D + TL+IPC SD + CMQLD WD TSIPAL+N + +LY Sbjct: 4 VNLKISSYEINDAIMSDHKS--DTLSIPCDSDTEFCMQLDGWDENTSIPALINEKPVLLY 61 Query: 63 RTRYDQQSDAWIMRLA 78 R RYD+ + W+MR+A Sbjct: 62 RHRYDKDNHHWVMRVA 77 >UniRef50_B4EVN5 Putative uncharacterized protein n=3 Tax=Proteus RepID=B4EVN5_PROMH Length = 80 Score = 117 bits (294), Expect = 9e-26, Method: Composition-based stats. Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 1/78 (1%) Query: 2 KTSVRIGAFEIDDGELHGESPGDRT-LTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 KT+++I +EI+D L S T + IPC D + CMQLD WD TSIPA L ++ + Sbjct: 3 KTTLKIACYEIEDVTLKHSSDNQLTYVHIPCDYDKEFCMQLDGWDENTSIPAQLKDKNIL 62 Query: 61 LYRTRYDQQSDAWIMRLA 78 LYR YD+++ WI+++A Sbjct: 63 LYRHAYDKENHHWILKVA 80 >UniRef50_Q2NTD7 Putative uncharacterized protein n=24 Tax=Enterobacteriaceae RepID=Q2NTD7_SODGM Length = 81 Score = 115 bits (289), Expect = 3e-25, Method: Composition-based stats. Identities = 41/78 (52%), Positives = 54/78 (69%), Gaps = 2/78 (2%) Query: 2 KTSVRIGAFEIDDGELHGESPG--DRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHS 59 KT+V+IG+F +DD L ++IPCKSDPDLC+QLD WD +T IPA L+G+ Sbjct: 3 KTTVQIGSFTVDDATLSSGDDQRVKNIVSIPCKSDPDLCLQLDGWDEQTGIPAFLDGKEH 62 Query: 60 VLYRTRYDQQSDAWIMRL 77 +L+R YD +DAWIMRL Sbjct: 63 ILFRQHYDHDNDAWIMRL 80 >UniRef50_C8Q9A9 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8Q9A9_9ENTR Length = 77 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 32/76 (42%), Positives = 46/76 (60%), Gaps = 2/76 (2%) Query: 1 MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSV 60 M V+IG +EI D EL+ T++IPC ++P L QLD WD ETS+PA ++G Sbjct: 1 MGHVVKIGRYEIIDAELNAGKL--ETVSIPCLTNPGLSYQLDGWDHETSVPAWIDGNPVD 58 Query: 61 LYRTRYDQQSDAWIMR 76 L YD+Q D W+++ Sbjct: 59 LEIGHYDKQQDRWVLK 74 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.135 0.410 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 320,796,202 Number of Sequences: 3077464 Number of extensions: 10603506 Number of successful extensions: 22603 Number of sequences better than 1.0e-01: 8 Number of HSP's better than 0.1 without gapping: 16 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 22578 Number of HSP's gapped (non-prelim): 16 length of query: 78 length of database: 1,040,396,356 effective HSP length: 49 effective length of query: 29 effective length of database: 889,600,620 effective search space: 25798417980 effective search space used: 25798417980 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 87 (38.1 bits)