BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (328 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P75910 Uncharacterized protein ycdU n=32 Tax=Enterobact... 657 0.0 UniRef50_C8Q4Z7 Putative uncharacterized protein n=1 Tax=Pantoea... 221 3e-56 >UniRef50_P75910 Uncharacterized protein ycdU n=32 Tax=Enterobacteriaceae RepID=YCDU_ECOLI Length = 328 Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust. Identities = 328/328 (100%), Positives = 328/328 (100%) Query: 1 MIPDYLTFIRFQDKRNLIYIYAIGLILIGFYWKNAGFTFPSEDIGVVSGILALVLYNFIF 60 MIPDYLTFIRFQDKRNLIYIYAIGLILIGFYWKNAGFTFPSEDIGVVSGILALVLYNFIF Sbjct: 1 MIPDYLTFIRFQDKRNLIYIYAIGLILIGFYWKNAGFTFPSEDIGVVSGILALVLYNFIF 60 Query: 61 DLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGLYQLLPSL 120 DLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGLYQLLPSL Sbjct: 61 DLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGLYQLLPSL 120 Query: 121 YALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLSVCISTVVNLLT 180 YALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLSVCISTVVNLLT Sbjct: 121 YALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLSVCISTVVNLLT 180 Query: 181 ISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFLGRLFLQEIDLF 240 ISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFLGRLFLQEIDLF Sbjct: 181 ISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFLGRLFLQEIDLF 240 Query: 241 FSSENALSTFFAKPLWLRLFILLVIEVMWITLVSVLATLVEWRIWFEAYFLLCYVPCLIY 300 FSSENALSTFFAKPLWLRLFILLVIEVMWITLVSVLATLVEWRIWFEAYFLLCYVPCLIY Sbjct: 241 FSSENALSTFFAKPLWLRLFILLVIEVMWITLVSVLATLVEWRIWFEAYFLLCYVPCLIY 300 Query: 301 YFFYCRFLWHNDFMMACDMYFRWGHFNK 328 YFFYCRFLWHNDFMMACDMYFRWGHFNK Sbjct: 301 YFFYCRFLWHNDFMMACDMYFRWGHFNK 328 >UniRef50_C8Q4Z7 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8Q4Z7_9ENTR Length = 278 Score = 221 bits (563), Expect = 3e-56, Method: Compositional matrix adjust. Identities = 117/279 (41%), Positives = 179/279 (64%), Gaps = 8/279 (2%) Query: 54 VLYNFIFDLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGL 113 +L++ I+DLKA WAY CV +D F ++ H + +++P++ LSL A+ W + Sbjct: 1 MLFHIIYDLKASWAYSCVLGRVDVDRFNGQRCHPVLNVVSRPILLSALSL----AVFWTV 56 Query: 114 YQ--LLPS--LYALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLS 169 LL S Y + +SL+ P I+L+FR +R Y++QV +VA VKY+ + +YV+ Sbjct: 57 ANVCLLSSAKFYPMLYLSLISPFTIYLVFRWVRPLYIRQVLTAVAPGVKYRHMYQYVMCY 116 Query: 170 VCISTVVNLLTISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFL 229 +C TV+NL+++SPL S F + + + ++A+L+LC +VLAINLFF R KRY FL Sbjct: 117 LCSVTVLNLISVSPLAASADFSIREGFFSARLMVAMLMLCTIVLAINLFFTRLPKRYVFL 176 Query: 230 GRLFLQEIDLFFSSENALSTFFAKPLWLRLFILLVIEVMWITLVSVLATLVEWRIWFEAY 289 GR+ L+EID +FS+ L+TF P +RL I+ + ++WI L+S L+ W+ FE Y Sbjct: 177 GRILLKEIDFYFSTSIPLNTFSMMPFIVRLLIVEWVLLLWIILISSALNLLHWQPGFELY 236 Query: 290 FLLCYVPCLIYYFFYCRFLWHNDFMMACDMYFRWGHFNK 328 F LC++PC+ YYF + + WH++FMMACDMYFR+ FNK Sbjct: 237 FFLCFLPCIGYYFLHTWWKWHHEFMMACDMYFRYDVFNK 275 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P75910 Uncharacterized protein ycdU n=32 Tax=Enterobact... 430 e-119 UniRef50_C8Q4Z7 Putative uncharacterized protein n=1 Tax=Pantoea... 345 2e-93 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P75910 Uncharacterized protein ycdU n=32 Tax=Enterobacteriaceae RepID=YCDU_ECOLI Length = 328 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 328/328 (100%), Positives = 328/328 (100%) Query: 1 MIPDYLTFIRFQDKRNLIYIYAIGLILIGFYWKNAGFTFPSEDIGVVSGILALVLYNFIF 60 MIPDYLTFIRFQDKRNLIYIYAIGLILIGFYWKNAGFTFPSEDIGVVSGILALVLYNFIF Sbjct: 1 MIPDYLTFIRFQDKRNLIYIYAIGLILIGFYWKNAGFTFPSEDIGVVSGILALVLYNFIF 60 Query: 61 DLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGLYQLLPSL 120 DLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGLYQLLPSL Sbjct: 61 DLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGLYQLLPSL 120 Query: 121 YALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLSVCISTVVNLLT 180 YALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLSVCISTVVNLLT Sbjct: 121 YALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLSVCISTVVNLLT 180 Query: 181 ISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFLGRLFLQEIDLF 240 ISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFLGRLFLQEIDLF Sbjct: 181 ISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFLGRLFLQEIDLF 240 Query: 241 FSSENALSTFFAKPLWLRLFILLVIEVMWITLVSVLATLVEWRIWFEAYFLLCYVPCLIY 300 FSSENALSTFFAKPLWLRLFILLVIEVMWITLVSVLATLVEWRIWFEAYFLLCYVPCLIY Sbjct: 241 FSSENALSTFFAKPLWLRLFILLVIEVMWITLVSVLATLVEWRIWFEAYFLLCYVPCLIY 300 Query: 301 YFFYCRFLWHNDFMMACDMYFRWGHFNK 328 YFFYCRFLWHNDFMMACDMYFRWGHFNK Sbjct: 301 YFFYCRFLWHNDFMMACDMYFRWGHFNK 328 >UniRef50_C8Q4Z7 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8Q4Z7_9ENTR Length = 278 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 117/279 (41%), Positives = 179/279 (64%), Gaps = 8/279 (2%) Query: 54 VLYNFIFDLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGL 113 +L++ I+DLKA WAY CV +D F ++ H + +++P++ LSL A+ W + Sbjct: 1 MLFHIIYDLKASWAYSCVLGRVDVDRFNGQRCHPVLNVVSRPILLSALSL----AVFWTV 56 Query: 114 YQ--LLPS--LYALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLS 169 LL S Y + +SL+ P I+L+FR +R Y++QV +VA VKY+ + +YV+ Sbjct: 57 ANVCLLSSAKFYPMLYLSLISPFTIYLVFRWVRPLYIRQVLTAVAPGVKYRHMYQYVMCY 116 Query: 170 VCISTVVNLLTISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFL 229 +C TV+NL+++SPL S F + + + ++A+L+LC +VLAINLFF R KRY FL Sbjct: 117 LCSVTVLNLISVSPLAASADFSIREGFFSARLMVAMLMLCTIVLAINLFFTRLPKRYVFL 176 Query: 230 GRLFLQEIDLFFSSENALSTFFAKPLWLRLFILLVIEVMWITLVSVLATLVEWRIWFEAY 289 GR+ L+EID +FS+ L+TF P +RL I+ + ++WI L+S L+ W+ FE Y Sbjct: 177 GRILLKEIDFYFSTSIPLNTFSMMPFIVRLLIVEWVLLLWIILISSALNLLHWQPGFELY 236 Query: 290 FLLCYVPCLIYYFFYCRFLWHNDFMMACDMYFRWGHFNK 328 F LC++PC+ YYF + + WH++FMMACDMYFR+ FNK Sbjct: 237 FFLCFLPCIGYYFLHTWWKWHHEFMMACDMYFRYDVFNK 275 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.332 0.148 0.477 Lambda K H 0.267 0.0452 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,476,923,725 Number of Sequences: 3077464 Number of extensions: 65831645 Number of successful extensions: 343514 Number of sequences better than 1.0e-01: 44 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 51 Number of HSP's that attempted gapping in prelim test: 343441 Number of HSP's gapped (non-prelim): 102 length of query: 328 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 199 effective length of database: 643,403,500 effective search space: 128037296500 effective search space used: 128037296500 T: 11 A: 40 X1: 16 ( 7.7 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.4 bits) S2: 93 (40.3 bits)