BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (535 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P37629 Putative uncharacterized protein yhiL n=40 Tax=E... 1113 0.0 UniRef50_P37627 Uncharacterized protein yhiJ n=45 Tax=Bacteria R... 237 8e-61 >UniRef50_P37629 Putative uncharacterized protein yhiL n=40 Tax=Enterobacteriaceae RepID=YHIL_ECOLI Length = 535 Score = 1113 bits (2879), Expect = 0.0, Method: Compositional matrix adjust. Identities = 535/535 (100%), Positives = 535/535 (100%) Query: 1 MKAIDNQIRNISSSHQDKHSDKVNSHQHHGKVDKTHRAKIVEFDKLDNDSQIDNDFGLHI 60 MKAIDNQIRNISSSHQDKHSDKVNSHQHHGKVDKTHRAKIVEFDKLDNDSQIDNDFGLHI Sbjct: 1 MKAIDNQIRNISSSHQDKHSDKVNSHQHHGKVDKTHRAKIVEFDKLDNDSQIDNDFGLHI 60 Query: 61 IYFLQHGHWKVNDRSHQMEKVWFYNSEPSIDIQEYNRFADNTTDTFIFTIIPDNNHVLKL 120 IYFLQHGHWKVNDRSHQMEKVWFYNSEPSIDIQEYNRFADNTTDTFIFTIIPDNNHVLKL Sbjct: 61 IYFLQHGHWKVNDRSHQMEKVWFYNSEPSIDIQEYNRFADNTTDTFIFTIIPDNNHVLKL 120 Query: 121 SSPITVTVECKGGYYFINSSGDKSDIIYKVDGLSIIARNFFTLLSGNFKPDWRWDVSKET 180 SSPITVTVECKGGYYFINSSGDKSDIIYKVDGLSIIARNFFTLLSGNFKPDWRWDVSKET Sbjct: 121 SSPITVTVECKGGYYFINSSGDKSDIIYKVDGLSIIARNFFTLLSGNFKPDWRWDVSKET 180 Query: 181 FTKEKFDSYVKSVFSKIDFYKQCGVINPQNANTAYFGDTDGRVGAVLYALLVSGHIGIRE 240 FTKEKFDSYVKSVFSKIDFYKQCGVINPQNANTAYFGDTDGRVGAVLYALLVSGHIGIRE Sbjct: 181 FTKEKFDSYVKSVFSKIDFYKQCGVINPQNANTAYFGDTDGRVGAVLYALLVSGHIGIRE 240 Query: 241 KGWSLLCELLKHEEMASSAYKHKNNKVLYDLLNTRDMILNELHQHVFLKDDAITPCIFLG 300 KGWSLLCELLKHEEMASSAYKHKNNKVLYDLLNTRDMILNELHQHVFLKDDAITPCIFLG Sbjct: 241 KGWSLLCELLKHEEMASSAYKHKNNKVLYDLLNTRDMILNELHQHVFLKDDAITPCIFLG 300 Query: 301 DHTGDRFSTIFGDKYILTLLNSMRNMEGNKDSRINKNVVVLAGNHEINFNGNYTARLANH 360 DHTGDRFSTIFGDKYILTLLNSMRNMEGNKDSRINKNVVVLAGNHEINFNGNYTARLANH Sbjct: 301 DHTGDRFSTIFGDKYILTLLNSMRNMEGNKDSRINKNVVVLAGNHEINFNGNYTARLANH 360 Query: 361 KLSAGDTYNLIKTLDVCNYDSERQVLTSHHGIIRDEEKKCYCLGALQVPFNQMKNPTDPE 420 KLSAGDTYNLIKTLDVCNYDSERQVLTSHHGIIRDEEKKCYCLGALQVPFNQMKNPTDPE Sbjct: 361 KLSAGDTYNLIKTLDVCNYDSERQVLTSHHGIIRDEEKKCYCLGALQVPFNQMKNPTDPE 420 Query: 421 ELANIFNKKHKEHMDDPLFHLIRSNTLKPTPVYANYFDNTTDFRPARERIFICGETLKGE 480 ELANIFNKKHKEHMDDPLFHLIRSNTLKPTPVYANYFDNTTDFRPARERIFICGETLKGE Sbjct: 421 ELANIFNKKHKEHMDDPLFHLIRSNTLKPTPVYANYFDNTTDFRPARERIFICGETLKGE 480 Query: 481 DPSKYIRQKYGHHGPGVDHNQQFDNGIMGLNSLKEARDKNNKIIYSSGLSCFQLH 535 DPSKYIRQKYGHHGPGVDHNQQFDNGIMGLNSLKEARDKNNKIIYSSGLSCFQLH Sbjct: 481 DPSKYIRQKYGHHGPGVDHNQQFDNGIMGLNSLKEARDKNNKIIYSSGLSCFQLH 535 >UniRef50_P37627 Uncharacterized protein yhiJ n=45 Tax=Bacteria RepID=YHIJ_ECOLI Length = 540 Score = 237 bits (605), Expect = 8e-61, Method: Compositional matrix adjust. Identities = 136/396 (34%), Positives = 210/396 (53%), Gaps = 27/396 (6%) Query: 126 VTVECKG-GYYFINSSGDKSDIIYKVDGLSIIARNFFTLLSGNFKPDWRWDVSKETFTKE 184 VT+ K G N+ + I+Y ++ +++ ++ ++KP W W +S+ + Sbjct: 110 VTINSKNYGCSLDNTDINWCSIVYLLNNMTVNDNANDVAVTESYKPIWNWKISQYNVSDI 169 Query: 185 KFDSYVKSVFSKIDFYKQCGVINPQNANTAYFGDTDGRVGAVLYALLVSGHIGIREKGWS 244 KF++ +K F+ ++ C ++P + YFGDTDG VGAVL+AL +GH+GI +G + Sbjct: 170 KFETMIKPQFADRIYFSNCLPVDPTSTRPTYFGDTDGSVGAVLFALFATGHLGIMAEGEN 229 Query: 245 LLCELLKHEEMASSAYKHKN--------NKVLYDLLNTRDMILNELHQHVFLKDDAITPC 296 L +LL E+ + +N + +LN RD+IL L ++ + DA+TPC Sbjct: 230 FLSQLLNIEDEVLNVLLRENFNEQLNTNVNTIISILNRRDIILESLQPYLVINKDAVTPC 289 Query: 297 IFLGDHTGDRFSTIFGDKYILTLLNSMRNMEGNKDSRINKNVVVLAGNHEINFNGNYTAR 356 FLGD TGDRFS I GD++I+ LL + + IN+NV VLAGNHE N NGNY Sbjct: 290 TFLGDQTGDRFSNICGDQFIIDLLKRIMS--------INENVHVLAGNHETNCNGNYMQN 341 Query: 357 LANHKLSAGDTYNLIKTLDVCNYDSERQVLTSHHGIIRDEEKKCYCLGALQVPFNQMKNP 416 K DTY+ IK VC YD + +++ +HHGI D+++K Y +G + V ++M N Sbjct: 342 FTRMKPLDEDTYSGIKDYPVCFYDPKYKIMANHHGITFDDQRKRYIIGPITVSIDEMTNA 401 Query: 417 TDPEELANIFNKKHKEHMDDPLFHLIRSNTLKPTPVYANYFDNTTDFRPARERIFICGET 476 DP ELA I NKKH ++ F R+ + + + YF +TD+RP E + C + Sbjct: 402 LDPVELAAIINKKHHAIINGKKFKTSRAISCRS---FNRYFSVSTDYRPKLEALLACSQM 458 Query: 477 LKGEDPSKYIRQKYGHHGPGVDHNQQFDNGIMGLNS 512 L I Q H+G G ++GLN+ Sbjct: 459 LG-------INQVVAHNGNGGRERIGETGTVLGLNA 487 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37629 Putative uncharacterized protein yhiL n=40 Tax=E... 942 0.0 UniRef50_P37627 Uncharacterized protein yhiJ n=45 Tax=Bacteria R... 595 e-168 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P37629 Putative uncharacterized protein yhiL n=40 Tax=Enterobacteriaceae RepID=YHIL_ECOLI Length = 535 Score = 942 bits (2434), Expect = 0.0, Method: Composition-based stats. Identities = 535/535 (100%), Positives = 535/535 (100%) Query: 1 MKAIDNQIRNISSSHQDKHSDKVNSHQHHGKVDKTHRAKIVEFDKLDNDSQIDNDFGLHI 60 MKAIDNQIRNISSSHQDKHSDKVNSHQHHGKVDKTHRAKIVEFDKLDNDSQIDNDFGLHI Sbjct: 1 MKAIDNQIRNISSSHQDKHSDKVNSHQHHGKVDKTHRAKIVEFDKLDNDSQIDNDFGLHI 60 Query: 61 IYFLQHGHWKVNDRSHQMEKVWFYNSEPSIDIQEYNRFADNTTDTFIFTIIPDNNHVLKL 120 IYFLQHGHWKVNDRSHQMEKVWFYNSEPSIDIQEYNRFADNTTDTFIFTIIPDNNHVLKL Sbjct: 61 IYFLQHGHWKVNDRSHQMEKVWFYNSEPSIDIQEYNRFADNTTDTFIFTIIPDNNHVLKL 120 Query: 121 SSPITVTVECKGGYYFINSSGDKSDIIYKVDGLSIIARNFFTLLSGNFKPDWRWDVSKET 180 SSPITVTVECKGGYYFINSSGDKSDIIYKVDGLSIIARNFFTLLSGNFKPDWRWDVSKET Sbjct: 121 SSPITVTVECKGGYYFINSSGDKSDIIYKVDGLSIIARNFFTLLSGNFKPDWRWDVSKET 180 Query: 181 FTKEKFDSYVKSVFSKIDFYKQCGVINPQNANTAYFGDTDGRVGAVLYALLVSGHIGIRE 240 FTKEKFDSYVKSVFSKIDFYKQCGVINPQNANTAYFGDTDGRVGAVLYALLVSGHIGIRE Sbjct: 181 FTKEKFDSYVKSVFSKIDFYKQCGVINPQNANTAYFGDTDGRVGAVLYALLVSGHIGIRE 240 Query: 241 KGWSLLCELLKHEEMASSAYKHKNNKVLYDLLNTRDMILNELHQHVFLKDDAITPCIFLG 300 KGWSLLCELLKHEEMASSAYKHKNNKVLYDLLNTRDMILNELHQHVFLKDDAITPCIFLG Sbjct: 241 KGWSLLCELLKHEEMASSAYKHKNNKVLYDLLNTRDMILNELHQHVFLKDDAITPCIFLG 300 Query: 301 DHTGDRFSTIFGDKYILTLLNSMRNMEGNKDSRINKNVVVLAGNHEINFNGNYTARLANH 360 DHTGDRFSTIFGDKYILTLLNSMRNMEGNKDSRINKNVVVLAGNHEINFNGNYTARLANH Sbjct: 301 DHTGDRFSTIFGDKYILTLLNSMRNMEGNKDSRINKNVVVLAGNHEINFNGNYTARLANH 360 Query: 361 KLSAGDTYNLIKTLDVCNYDSERQVLTSHHGIIRDEEKKCYCLGALQVPFNQMKNPTDPE 420 KLSAGDTYNLIKTLDVCNYDSERQVLTSHHGIIRDEEKKCYCLGALQVPFNQMKNPTDPE Sbjct: 361 KLSAGDTYNLIKTLDVCNYDSERQVLTSHHGIIRDEEKKCYCLGALQVPFNQMKNPTDPE 420 Query: 421 ELANIFNKKHKEHMDDPLFHLIRSNTLKPTPVYANYFDNTTDFRPARERIFICGETLKGE 480 ELANIFNKKHKEHMDDPLFHLIRSNTLKPTPVYANYFDNTTDFRPARERIFICGETLKGE Sbjct: 421 ELANIFNKKHKEHMDDPLFHLIRSNTLKPTPVYANYFDNTTDFRPARERIFICGETLKGE 480 Query: 481 DPSKYIRQKYGHHGPGVDHNQQFDNGIMGLNSLKEARDKNNKIIYSSGLSCFQLH 535 DPSKYIRQKYGHHGPGVDHNQQFDNGIMGLNSLKEARDKNNKIIYSSGLSCFQLH Sbjct: 481 DPSKYIRQKYGHHGPGVDHNQQFDNGIMGLNSLKEARDKNNKIIYSSGLSCFQLH 535 >UniRef50_P37627 Uncharacterized protein yhiJ n=45 Tax=Bacteria RepID=YHIJ_ECOLI Length = 540 Score = 595 bits (1533), Expect = e-168, Method: Composition-based stats. Identities = 148/446 (33%), Positives = 230/446 (51%), Gaps = 41/446 (9%) Query: 90 IDIQEYNRFADNTTDTFI-FTIIPDNN-----------HVLKLSS--PITVTVECKG-GY 134 ID + RF + D I FTI D++ H+L+ ++ VT+ K G Sbjct: 60 IDDWQIERFQQSIQDDKISFTIQTDHSEKYSMLSGMRAHILRRNNNYQFIVTINSKNYGC 119 Query: 135 YFINSSGDKSDIIYKVDGLSIIARNFFTLLSGNFKPDWRWDVSKETFTKEKFDSYVKSVF 194 N+ + I+Y ++ +++ ++ ++KP W W +S+ + KF++ +K F Sbjct: 120 SLDNTDINWCSIVYLLNNMTVNDNANDVAVTESYKPIWNWKISQYNVSDIKFETMIKPQF 179 Query: 195 SKIDFYKQCGVINPQNANTAYFGDTDGRVGAVLYALLVSGHIGIREKGWSLLCELLKHEE 254 + ++ C ++P + YFGDTDG VGAVL+AL +GH+GI +G + L +LL E+ Sbjct: 180 ADRIYFSNCLPVDPTSTRPTYFGDTDGSVGAVLFALFATGHLGIMAEGENFLSQLLNIED 239 Query: 255 MASSAYKHK--------NNKVLYDLLNTRDMILNELHQHVFLKDDAITPCIFLGDHTGDR 306 + + N + +LN RD+IL L ++ + DA+TPC FLGD TGDR Sbjct: 240 EVLNVLLRENFNEQLNTNVNTIISILNRRDIILESLQPYLVINKDAVTPCTFLGDQTGDR 299 Query: 307 FSTIFGDKYILTLLNSMRNMEGNKDSRINKNVVVLAGNHEINFNGNYTARLANHKLSAGD 366 FS I GD++I+ LL + + IN+NV VLAGNHE N NGNY K D Sbjct: 300 FSNICGDQFIIDLLKRIMS--------INENVHVLAGNHETNCNGNYMQNFTRMKPLDED 351 Query: 367 TYNLIKTLDVCNYDSERQVLTSHHGIIRDEEKKCYCLGALQVPFNQMKNPTDPEELANIF 426 TY+ IK VC YD + +++ +HHGI D+++K Y +G + V ++M N DP ELA I Sbjct: 352 TYSGIKDYPVCFYDPKYKIMANHHGITFDDQRKRYIIGPITVSIDEMTNALDPVELAAII 411 Query: 427 NKKHKEHMDDPLFHLIRSNTLKPTPVYANYFDNTTDFRPARERIFICGETLKGEDPSKYI 486 NKKH ++ F R+ + + + YF +TD+RP E + C + L I Sbjct: 412 NKKHHAIINGKKFKTSRAISCRS---FNRYFSVSTDYRPKLEALLACSQMLG-------I 461 Query: 487 RQKYGHHGPGVDHNQQFDNGIMGLNS 512 Q H+G G ++GLN+ Sbjct: 462 NQVVAHNGNGGRERIGETGTVLGLNA 487 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.319 0.135 0.387 Lambda K H 0.267 0.0414 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,253,380,540 Number of Sequences: 3077464 Number of extensions: 99246956 Number of successful extensions: 250781 Number of sequences better than 1.0e-01: 2 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 250770 Number of HSP's gapped (non-prelim): 4 length of query: 535 length of database: 1,040,396,356 effective HSP length: 134 effective length of query: 401 effective length of database: 628,016,180 effective search space: 251834488180 effective search space used: 251834488180 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 96 (41.6 bits)