BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (464 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P26389 Colanic acid biosynthesis protein wcaM n=103 Tax... 752 0.0 >UniRef50_P26389 Colanic acid biosynthesis protein wcaM n=103 Tax=Enterobacteriaceae RepID=WCAM_SALTY Length = 467 Score = 752 bits (1942), Expect = 0.0, Method: Compositional matrix adjust. Identities = 359/466 (77%), Positives = 408/466 (87%), Gaps = 7/466 (1%) Query: 1 MPFKKLSRRTFLTASSALAFLHTPFARALPA-----RQSVNINDYNPHDWIASFKQAFSE 55 MP K SRRT LTA SALA L PF RALP R++V+I DY D IASFKQAF++ Sbjct: 1 MPATKFSRRTLLTAGSALAVL--PFLRALPVQAREPRETVDIKDYPADDGIASFKQAFAD 58 Query: 56 GQTVVVPAGLVCDNINTGIFIPPGKTLHILGSLRGNGRGRFVLQDGSQVTGEDGGSMHNI 115 GQTVVVP G VC+NIN I IP GKTL + G++RGNGRGRF+LQDG QV GE GGS+HN+ Sbjct: 59 GQTVVVPPGWVCENINAAITIPAGKTLRVQGAVRGNGRGRFILQDGCQVVGEQGGSLHNV 118 Query: 116 TLDVRGSDCTIKGLTMSGFGPVTQIYIGGKNKRVMRNLIIDNLTVSHANYAILRQGFHNQ 175 TLDVRGSDC IKG+ MSGFGPV QI+IGGK +VMRNLIID++TV+HANYAILRQGFHNQ Sbjct: 119 TLDVRGSDCVIKGVAMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQ 178 Query: 176 IIGANITNCKFSDLQGDAIEWNVAINDRDILISDHVIERINCTNGKINWGIGIGLAGSTY 235 + GA IT+ +FSDLQGDAIEWNVAI+DRDILISDHVIERINCTNGKINWGIGIGLAGSTY Sbjct: 179 MDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERINCTNGKINWGIGIGLAGSTY 238 Query: 236 DNNYPEDQAVKNFVVANITGSDCRQLIHVENGKHFVIRNIKARNITPDFSKKAGIDNATV 295 DN+YPEDQAVKNFVVANITGSDCRQL+HVENGKHFVIRN+KA+NITP FSK AGIDNAT+ Sbjct: 239 DNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPGFSKNAGIDNATI 298 Query: 296 AIYGCDNFVIDNIEMINSAGMLIGYGVIKGKYLSIPQNFRVNDIQLDNTHLAYKLRGIQI 355 AIYGCDNFVIDNI+M NSAGMLIGYGV+KGKYLSIPQNF++N I+LDN +AYKLRGIQI Sbjct: 299 AIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQI 358 Query: 356 SAGNAVSFVALTNIEMKRASLELHNKPQHFFMRNINVMQESSVGPALSMNFDMRKDVRGV 415 S+GN SFVA+TN+ M RA+LELHN+PQH F+RNINVMQ S++GPAL M+FD+RKDVRG Sbjct: 359 SSGNTPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQ 418 Query: 416 FMAKKETLLSLANVHAVNEKGQSSVDIDRINHHIVNVEKINFRLPE 461 FMA+++TLLSLANVHA+NE GQSSVDIDRINH VNVE +NF LP+ Sbjct: 419 FMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPK 464 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P26389 Colanic acid biosynthesis protein wcaM n=103 Tax... 856 0.0 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P26389 Colanic acid biosynthesis protein wcaM n=103 Tax=Enterobacteriaceae RepID=WCAM_SALTY Length = 467 Score = 856 bits (2213), Expect = 0.0, Method: Composition-based stats. Identities = 359/466 (77%), Positives = 408/466 (87%), Gaps = 7/466 (1%) Query: 1 MPFKKLSRRTFLTASSALAFLHTPFARALPA-----RQSVNINDYNPHDWIASFKQAFSE 55 MP K SRRT LTA SALA L PF RALP R++V+I DY D IASFKQAF++ Sbjct: 1 MPATKFSRRTLLTAGSALAVL--PFLRALPVQAREPRETVDIKDYPADDGIASFKQAFAD 58 Query: 56 GQTVVVPAGLVCDNINTGIFIPPGKTLHILGSLRGNGRGRFVLQDGSQVTGEDGGSMHNI 115 GQTVVVP G VC+NIN I IP GKTL + G++RGNGRGRF+LQDG QV GE GGS+HN+ Sbjct: 59 GQTVVVPPGWVCENINAAITIPAGKTLRVQGAVRGNGRGRFILQDGCQVVGEQGGSLHNV 118 Query: 116 TLDVRGSDCTIKGLTMSGFGPVTQIYIGGKNKRVMRNLIIDNLTVSHANYAILRQGFHNQ 175 TLDVRGSDC IKG+ MSGFGPV QI+IGGK +VMRNLIID++TV+HANYAILRQGFHNQ Sbjct: 119 TLDVRGSDCVIKGVAMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQ 178 Query: 176 IIGANITNCKFSDLQGDAIEWNVAINDRDILISDHVIERINCTNGKINWGIGIGLAGSTY 235 + GA IT+ +FSDLQGDAIEWNVAI+DRDILISDHVIERINCTNGKINWGIGIGLAGSTY Sbjct: 179 MDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERINCTNGKINWGIGIGLAGSTY 238 Query: 236 DNNYPEDQAVKNFVVANITGSDCRQLIHVENGKHFVIRNIKARNITPDFSKKAGIDNATV 295 DN+YPEDQAVKNFVVANITGSDCRQL+HVENGKHFVIRN+KA+NITP FSK AGIDNAT+ Sbjct: 239 DNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPGFSKNAGIDNATI 298 Query: 296 AIYGCDNFVIDNIEMINSAGMLIGYGVIKGKYLSIPQNFRVNDIQLDNTHLAYKLRGIQI 355 AIYGCDNFVIDNI+M NSAGMLIGYGV+KGKYLSIPQNF++N I+LDN +AYKLRGIQI Sbjct: 299 AIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQI 358 Query: 356 SAGNAVSFVALTNIEMKRASLELHNKPQHFFMRNINVMQESSVGPALSMNFDMRKDVRGV 415 S+GN SFVA+TN+ M RA+LELHN+PQH F+RNINVMQ S++GPAL M+FD+RKDVRG Sbjct: 359 SSGNTPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQ 418 Query: 416 FMAKKETLLSLANVHAVNEKGQSSVDIDRINHHIVNVEKINFRLPE 461 FMA+++TLLSLANVHA+NE GQSSVDIDRINH VNVE +NF LP+ Sbjct: 419 FMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPK 464 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.317 0.143 0.426 Lambda K H 0.267 0.0438 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,983,686,956 Number of Sequences: 3077464 Number of extensions: 89764617 Number of successful extensions: 185289 Number of sequences better than 1.0e-01: 2 Number of HSP's better than 0.1 without gapping: 2 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 185279 Number of HSP's gapped (non-prelim): 5 length of query: 464 length of database: 1,040,396,356 effective HSP length: 132 effective length of query: 332 effective length of database: 634,171,108 effective search space: 210544807856 effective search space used: 210544807856 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 95 (41.1 bits)