BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (418 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P36661 Uncharacterized protein yccE n=37 Tax=Enterobact... 860 0.0 UniRef50_B1ERU3 Putative uncharacterized protein n=1 Tax=Escheri... 48 7e-04 >UniRef50_P36661 Uncharacterized protein yccE n=37 Tax=Enterobacteriaceae RepID=YCCE_ECOLI Length = 418 Score = 860 bits (2223), Expect = 0.0, Method: Compositional matrix adjust. Identities = 418/418 (100%), Positives = 418/418 (100%) Query: 1 MGSNIHGISCTANNYLKQAWNDIKNEYEKNQTYSITLFENTLVCFMRLYNELRRKVNEED 60 MGSNIHGISCTANNYLKQAWNDIKNEYEKNQTYSITLFENTLVCFMRLYNELRRKVNEED Sbjct: 1 MGSNIHGISCTANNYLKQAWNDIKNEYEKNQTYSITLFENTLVCFMRLYNELRRKVNEED 60 Query: 61 TPCLECESLEKEFEEMQNDNDLSLFMRILRTNDTQIYSGVSGGITYTIQYVRDIDIVRVS 120 TPCLECESLEKEFEEMQNDNDLSLFMRILRTNDTQIYSGVSGGITYTIQYVRDIDIVRVS Sbjct: 61 TPCLECESLEKEFEEMQNDNDLSLFMRILRTNDTQIYSGVSGGITYTIQYVRDIDIVRVS 120 Query: 121 LPGRASESITDFKGYYWYNFMEYIENINACDDVFSEYCFDDENISVQPERINTPGISDLD 180 LPGRASESITDFKGYYWYNFMEYIENINACDDVFSEYCFDDENISVQPERINTPGISDLD Sbjct: 121 LPGRASESITDFKGYYWYNFMEYIENINACDDVFSEYCFDDENISVQPERINTPGISDLD 180 Query: 181 SDIDLSGISFIQRETNQALGLKYAPVDGDGYCLLRAILVLKQHDYSWALVSYKMQKEVYN 240 SDIDLSGISFIQRETNQALGLKYAPVDGDGYCLLRAILVLKQHDYSWALVSYKMQKEVYN Sbjct: 181 SDIDLSGISFIQRETNQALGLKYAPVDGDGYCLLRAILVLKQHDYSWALVSYKMQKEVYN 240 Query: 241 EFIKMVDKKTIEALVDTAFYNLREDVKTLFGVDLQSDNQIQGQSSLMSWSFLFFKKQFID 300 EFIKMVDKKTIEALVDTAFYNLREDVKTLFGVDLQSDNQIQGQSSLMSWSFLFFKKQFID Sbjct: 241 EFIKMVDKKTIEALVDTAFYNLREDVKTLFGVDLQSDNQIQGQSSLMSWSFLFFKKQFID 300 Query: 301 SCLNNEKCILHLPEFIFNDNKNLLALDTDTSDRIKAVKNFLVVLSDSICSLFIVNSNVAS 360 SCLNNEKCILHLPEFIFNDNKNLLALDTDTSDRIKAVKNFLVVLSDSICSLFIVNSNVAS Sbjct: 301 SCLNNEKCILHLPEFIFNDNKNLLALDTDTSDRIKAVKNFLVVLSDSICSLFIVNSNVAS 360 Query: 361 ISLGNESFSTDEDLEYGYLMNTGNHYDVYLPPELFAQAYKLNNKEMNAQLDYLNRYAI 418 ISLGNESFSTDEDLEYGYLMNTGNHYDVYLPPELFAQAYKLNNKEMNAQLDYLNRYAI Sbjct: 361 ISLGNESFSTDEDLEYGYLMNTGNHYDVYLPPELFAQAYKLNNKEMNAQLDYLNRYAI 418 >UniRef50_B1ERU3 Putative uncharacterized protein n=1 Tax=Escherichia albertii TW07627 RepID=B1ERU3_9ESCH Length = 175 Score = 47.8 bits (112), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 44/193 (22%), Positives = 86/193 (44%), Gaps = 37/193 (19%) Query: 213 LLRAILVLKQHDYSWALVSYKMQKEVYNEFIKMVDKKTIEALVDTAFYNLREDVKTLFGV 272 + RAIL + D SWA + K EVY F D+ L + Sbjct: 1 MFRAILAILDKDSSWADKNVKTSSEVYGSF--------------------ASDIANLNQI 40 Query: 273 DLQSDNQIQGQSSLMSWSFLF--FKKQFIDSCLNNEKCILHLPEFIFND-NKNLLALDTD 329 + +++G +S L F++ + C + +C+++ P IFN + + +++D + Sbjct: 41 AVNVCEELEGMEFFLSDYPLLEDFEENVVKKCFRDNECVIYSPTGIFNACSVSEMSIDDN 100 Query: 330 TSDRIKAVKNFLVVLSDSICSLFIVNSNVASISLGNES--FSTDED----LEYGYLMNT- 382 SD +L + + S + + + + + ++ F D+D ++ GY++N Sbjct: 101 YSD-------YLSIWASSFAAKLLEHYGIVIKKIADDGTEFWEDQDKLNHIDNGYVLNVN 153 Query: 383 GNHYDVYLPPELF 395 GNHY+V LP ++F Sbjct: 154 GNHYNVRLPLDIF 166 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P36661 Uncharacterized protein yccE n=37 Tax=Enterobact... 764 0.0 UniRef50_B1ERU3 Putative uncharacterized protein n=1 Tax=Escheri... 212 2e-53 Sequences not found previously or not previously below threshold: UniRef50_UPI0001850E26 hypothetical protein Bcoam_11275 n=1 Tax=... 41 0.098 CONVERGED! >UniRef50_P36661 Uncharacterized protein yccE n=37 Tax=Enterobacteriaceae RepID=YCCE_ECOLI Length = 418 Score = 764 bits (1973), Expect = 0.0, Method: Composition-based stats. Identities = 418/418 (100%), Positives = 418/418 (100%) Query: 1 MGSNIHGISCTANNYLKQAWNDIKNEYEKNQTYSITLFENTLVCFMRLYNELRRKVNEED 60 MGSNIHGISCTANNYLKQAWNDIKNEYEKNQTYSITLFENTLVCFMRLYNELRRKVNEED Sbjct: 1 MGSNIHGISCTANNYLKQAWNDIKNEYEKNQTYSITLFENTLVCFMRLYNELRRKVNEED 60 Query: 61 TPCLECESLEKEFEEMQNDNDLSLFMRILRTNDTQIYSGVSGGITYTIQYVRDIDIVRVS 120 TPCLECESLEKEFEEMQNDNDLSLFMRILRTNDTQIYSGVSGGITYTIQYVRDIDIVRVS Sbjct: 61 TPCLECESLEKEFEEMQNDNDLSLFMRILRTNDTQIYSGVSGGITYTIQYVRDIDIVRVS 120 Query: 121 LPGRASESITDFKGYYWYNFMEYIENINACDDVFSEYCFDDENISVQPERINTPGISDLD 180 LPGRASESITDFKGYYWYNFMEYIENINACDDVFSEYCFDDENISVQPERINTPGISDLD Sbjct: 121 LPGRASESITDFKGYYWYNFMEYIENINACDDVFSEYCFDDENISVQPERINTPGISDLD 180 Query: 181 SDIDLSGISFIQRETNQALGLKYAPVDGDGYCLLRAILVLKQHDYSWALVSYKMQKEVYN 240 SDIDLSGISFIQRETNQALGLKYAPVDGDGYCLLRAILVLKQHDYSWALVSYKMQKEVYN Sbjct: 181 SDIDLSGISFIQRETNQALGLKYAPVDGDGYCLLRAILVLKQHDYSWALVSYKMQKEVYN 240 Query: 241 EFIKMVDKKTIEALVDTAFYNLREDVKTLFGVDLQSDNQIQGQSSLMSWSFLFFKKQFID 300 EFIKMVDKKTIEALVDTAFYNLREDVKTLFGVDLQSDNQIQGQSSLMSWSFLFFKKQFID Sbjct: 241 EFIKMVDKKTIEALVDTAFYNLREDVKTLFGVDLQSDNQIQGQSSLMSWSFLFFKKQFID 300 Query: 301 SCLNNEKCILHLPEFIFNDNKNLLALDTDTSDRIKAVKNFLVVLSDSICSLFIVNSNVAS 360 SCLNNEKCILHLPEFIFNDNKNLLALDTDTSDRIKAVKNFLVVLSDSICSLFIVNSNVAS Sbjct: 301 SCLNNEKCILHLPEFIFNDNKNLLALDTDTSDRIKAVKNFLVVLSDSICSLFIVNSNVAS 360 Query: 361 ISLGNESFSTDEDLEYGYLMNTGNHYDVYLPPELFAQAYKLNNKEMNAQLDYLNRYAI 418 ISLGNESFSTDEDLEYGYLMNTGNHYDVYLPPELFAQAYKLNNKEMNAQLDYLNRYAI Sbjct: 361 ISLGNESFSTDEDLEYGYLMNTGNHYDVYLPPELFAQAYKLNNKEMNAQLDYLNRYAI 418 >UniRef50_B1ERU3 Putative uncharacterized protein n=1 Tax=Escherichia albertii TW07627 RepID=B1ERU3_9ESCH Length = 175 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 44/193 (22%), Positives = 86/193 (44%), Gaps = 37/193 (19%) Query: 213 LLRAILVLKQHDYSWALVSYKMQKEVYNEFIKMVDKKTIEALVDTAFYNLREDVKTLFGV 272 + RAIL + D SWA + K EVY F D+ L + Sbjct: 1 MFRAILAILDKDSSWADKNVKTSSEVYGSF--------------------ASDIANLNQI 40 Query: 273 DLQSDNQIQGQSSLMSWSFLF--FKKQFIDSCLNNEKCILHLPEFIFND-NKNLLALDTD 329 + +++G +S L F++ + C + +C+++ P IFN + + +++D + Sbjct: 41 AVNVCEELEGMEFFLSDYPLLEDFEENVVKKCFRDNECVIYSPTGIFNACSVSEMSIDDN 100 Query: 330 TSDRIKAVKNFLVVLSDSICSLFIVNSNVASISLGNES--FSTDED----LEYGYLMNT- 382 SD +L + + S + + + + + ++ F D+D ++ GY++N Sbjct: 101 YSD-------YLSIWASSFAAKLLEHYGIVIKKIADDGTEFWEDQDKLNHIDNGYVLNVN 153 Query: 383 GNHYDVYLPPELF 395 GNHY+V LP ++F Sbjct: 154 GNHYNVRLPLDIF 166 >UniRef50_UPI0001850E26 hypothetical protein Bcoam_11275 n=1 Tax=Bacillus coahuilensis m4-4 RepID=UPI0001850E26 Length = 355 Score = 40.8 bits (94), Expect = 0.098, Method: Composition-based stats. Identities = 33/143 (23%), Positives = 58/143 (40%), Gaps = 5/143 (3%) Query: 233 KMQKEVYNEFIKMVDKKTIEALVDTAFYNLREDVKTLFGVDLQSDNQIQGQSSLMSWSFL 292 K+ E+Y E I +++K+ E VD + +R + G Q +Q+ G S+M W Sbjct: 168 KLTGELYKEIIDILEKE--EFPVDPSIITIRLSGYQVAGKTTQQSSQLLGMDSVMYWYRF 225 Query: 293 FFKKQFIDSCLNNEKCILHLPEFIFNDNKNLLALDTDTSDRIKAVKNFLVVLSDSICSLF 352 +I + + + F+ D + L T+ + VK L + S Sbjct: 226 LEGITYIVKLIKDNQREFPHLSFLVADLMSDFTLTESTTKTYELVKRGLSLKEISSARKL 285 Query: 353 ---IVNSNVASISLGNESFSTDE 372 + +V I+L + SFS D Sbjct: 286 RLSTIEDHVIEIALNDSSFSIDH 308 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.318 0.132 0.375 Lambda K H 0.267 0.0405 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,658,254,956 Number of Sequences: 3077464 Number of extensions: 70162186 Number of successful extensions: 193840 Number of sequences better than 1.0e-01: 3 Number of HSP's better than 0.1 without gapping: 3 Number of HSP's successfully gapped in prelim test: 4 Number of HSP's that attempted gapping in prelim test: 193828 Number of HSP's gapped (non-prelim): 7 length of query: 418 length of database: 1,040,396,356 effective HSP length: 131 effective length of query: 287 effective length of database: 637,248,572 effective search space: 182890340164 effective search space used: 182890340164 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 94 (40.8 bits)