BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (405 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P71238 Putative colanic acid polymerase n=105 Tax=Enter... 800 0.0 UniRef50_A5CRE7 Wzy1 protein n=1 Tax=Clavibacter michiganensis s... 80 1e-13 >UniRef50_P71238 Putative colanic acid polymerase n=105 Tax=Enterobacteriaceae RepID=WCAD_ECOLI Length = 405 Score = 800 bits (2066), Expect = 0.0, Method: Compositional matrix adjust. Identities = 405/405 (100%), Positives = 405/405 (100%) Query: 1 MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG 60 MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG Sbjct: 1 MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG 60 Query: 61 IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL 120 IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL Sbjct: 61 IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL 120 Query: 121 VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF 180 VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF Sbjct: 121 VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF 180 Query: 181 ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK 240 ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK Sbjct: 181 ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK 240 Query: 241 KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV 300 KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV Sbjct: 241 KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV 300 Query: 301 RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD 360 RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD Sbjct: 301 RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD 360 Query: 361 NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR 405 NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR Sbjct: 361 NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR 405 >UniRef50_A5CRE7 Wzy1 protein n=1 Tax=Clavibacter michiganensis subsp. michiganensis NCPPB 382 RepID=A5CRE7_CLAM3 Length = 427 Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 48/153 (31%), Positives = 82/153 (53%), Gaps = 2/153 (1%) Query: 238 AIKKKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHID 297 A++ + +++ L+ V V + Y+ +RL T GSS+ YR+V PL ++G +L H Sbjct: 226 ALRSRPGASIVVLSGLAVVGVASGTYLQSRLESFSTSGSSTNYRLVAPLQVLGDTLLHTP 285 Query: 298 GVVRFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINA 357 GS+ + + ++NGA++G ++DNGLY+L+ YF W + L ++ M+ Sbjct: 286 IGHALGSVSNILLGYDLYNGAELGTSLDNGLYVLVFYFGWIGLVLIAALAVMALRAMLRG 345 Query: 358 FGDNRNFRVQLYLFTPVSLFFTGSIFSPEYAFL 390 L+LF +LFF+G I PEYA + Sbjct: 346 GRSTWGAVAPLWLFG--TLFFSGGIMLPEYAVM 376 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P71238 Putative colanic acid polymerase n=105 Tax=Enter... 600 e-170 UniRef50_A5CRE7 Wzy1 protein n=1 Tax=Clavibacter michiganensis s... 207 8e-52 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P71238 Putative colanic acid polymerase n=105 Tax=Enterobacteriaceae RepID=WCAD_ECOLI Length = 405 Score = 600 bits (1547), Expect = e-170, Method: Composition-based stats. Identities = 405/405 (100%), Positives = 405/405 (100%) Query: 1 MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG 60 MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG Sbjct: 1 MSTSIRICSYLLLPLIYLLVNVKIAQLGESFPITIVTFLPVLLLLFLERISVKKLMIALG 60 Query: 61 IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL 120 IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL Sbjct: 61 IGAGLTAFNYLFGQSLDASKYVTSTMLFVYIVIIIGMVWSIRFKTISPHNHRKILRFFYL 120 Query: 121 VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF 180 VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF Sbjct: 121 VVGLVVALAAVEMAQIILTGGSSIMESISKYLIYSNSYVLNFIKFGGKRTTALYFEPAFF 180 Query: 181 ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK 240 ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK Sbjct: 181 ALALISIWLSIKQFGIKTPKTDAMILAGIILSGSFSGVMTFILFYLLEWAFQYLNKEAIK 240 Query: 241 KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV 300 KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV Sbjct: 241 KKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHIDGVV 300 Query: 301 RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD 360 RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD Sbjct: 301 RFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINAFGD 360 Query: 361 NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR 405 NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR Sbjct: 361 NRNFRVQLYLFTPVSLFFTGSIFSPEYAFLIVCPFILRKALNITR 405 >UniRef50_A5CRE7 Wzy1 protein n=1 Tax=Clavibacter michiganensis subsp. michiganensis NCPPB 382 RepID=A5CRE7_CLAM3 Length = 427 Score = 207 bits (526), Expect = 8e-52, Method: Composition-based stats. Identities = 48/153 (31%), Positives = 82/153 (53%), Gaps = 2/153 (1%) Query: 238 AIKKKLPLALISLAVFLVGVVIAFPYISTRLGDLGTEGSSSYYRIVGPLVMVGYSLTHID 297 A++ + +++ L+ V V + Y+ +RL T GSS+ YR+V PL ++G +L H Sbjct: 226 ALRSRPGASIVVLSGLAVVGVASGTYLQSRLESFSTSGSSTNYRLVAPLQVLGDTLLHTP 285 Query: 298 GVVRFGSLYEYVASFGIFNGADVGKTIDNGLYLLIIYFSWFAVFLSLWYMGKVIKMMINA 357 GS+ + + ++NGA++G ++DNGLY+L+ YF W + L ++ M+ Sbjct: 286 IGHALGSVSNILLGYDLYNGAELGTSLDNGLYVLVFYFGWIGLVLIAALAVMALRAMLRG 345 Query: 358 FGDNRNFRVQLYLFTPVSLFFTGSIFSPEYAFL 390 L+LF +LFF+G I PEYA + Sbjct: 346 GRSTWGAVAPLWLFG--TLFFSGGIMLPEYAVM 376 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.330 0.146 0.418 Lambda K H 0.267 0.0454 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,777,333,582 Number of Sequences: 3077464 Number of extensions: 83293259 Number of successful extensions: 327742 Number of sequences better than 1.0e-01: 60 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 65 Number of HSP's that attempted gapping in prelim test: 327693 Number of HSP's gapped (non-prelim): 88 length of query: 405 length of database: 1,040,396,356 effective HSP length: 131 effective length of query: 274 effective length of database: 637,248,572 effective search space: 174606108728 effective search space used: 174606108728 T: 11 A: 40 X1: 16 ( 7.6 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.8 bits) S2: 94 (40.7 bits)