BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (395 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P11864 Uncharacterized protein yhaC n=45 Tax=Enterobact... 796 0.0 >UniRef50_P11864 Uncharacterized protein yhaC n=45 Tax=Enterobacteriaceae RepID=YHAC_ECOLI Length = 395 Score = 796 bits (2056), Expect = 0.0, Method: Compositional matrix adjust. Identities = 395/395 (100%), Positives = 395/395 (100%) Query: 1 MFPVSSIGNDISSDLVRRKMNDLPESPTGNNLEALAPGIEKLKQTSIEMVTLLNTLQPGG 60 MFPVSSIGNDISSDLVRRKMNDLPESPTGNNLEALAPGIEKLKQTSIEMVTLLNTLQPGG Sbjct: 1 MFPVSSIGNDISSDLVRRKMNDLPESPTGNNLEALAPGIEKLKQTSIEMVTLLNTLQPGG 60 Query: 61 KCIITGDFQKELAYLQNVILYNVSSLRLDFLGYNAQIIQRSDNTCELTINEPLKNQEIST 120 KCIITGDFQKELAYLQNVILYNVSSLRLDFLGYNAQIIQRSDNTCELTINEPLKNQEIST Sbjct: 61 KCIITGDFQKELAYLQNVILYNVSSLRLDFLGYNAQIIQRSDNTCELTINEPLKNQEIST 120 Query: 121 GNININCPLKDIYNEIRRLNVIFSCGTGDIVDLSSLDLRNVDLDYYDFTDKHMANTILNP 180 GNININCPLKDIYNEIRRLNVIFSCGTGDIVDLSSLDLRNVDLDYYDFTDKHMANTILNP Sbjct: 121 GNININCPLKDIYNEIRRLNVIFSCGTGDIVDLSSLDLRNVDLDYYDFTDKHMANTILNP 180 Query: 181 FKLNSTNFTNANMFQVNFVSSTQNATISWDYLLKITPVLISISDMYSEEKIKFVESCLNE 240 FKLNSTNFTNANMFQVNFVSSTQNATISWDYLLKITPVLISISDMYSEEKIKFVESCLNE Sbjct: 181 FKLNSTNFTNANMFQVNFVSSTQNATISWDYLLKITPVLISISDMYSEEKIKFVESCLNE 240 Query: 241 PGDITEEQLKIMRFAIIKSIPRATLTDKLENELTKEIYKSSSKIINCLNRIKLTEMKEFS 300 PGDITEEQLKIMRFAIIKSIPRATLTDKLENELTKEIYKSSSKIINCLNRIKLTEMKEFS Sbjct: 241 PGDITEEQLKIMRFAIIKSIPRATLTDKLENELTKEIYKSSSKIINCLNRIKLTEMKEFS 300 Query: 301 SEKIYDYIDIIIEDYENTKENAYLVVPQINYTMDLNIEDSSSEELLSDNTLEKDENSPDN 360 SEKIYDYIDIIIEDYENTKENAYLVVPQINYTMDLNIEDSSSEELLSDNTLEKDENSPDN Sbjct: 301 SEKIYDYIDIIIEDYENTKENAYLVVPQINYTMDLNIEDSSSEELLSDNTLEKDENSPDN 360 Query: 361 GFEVGEYNTYEAYNSEKQYFTREDYTYDYDLLNAI 395 GFEVGEYNTYEAYNSEKQYFTREDYTYDYDLLNAI Sbjct: 361 GFEVGEYNTYEAYNSEKQYFTREDYTYDYDLLNAI 395 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P11864 Uncharacterized protein yhaC n=45 Tax=Enterobact... 785 0.0 Sequences not found previously or not previously below threshold: UniRef50_C4QG00 Kif1, putative n=1 Tax=Schistosoma mansoni RepID... 46 0.002 UniRef50_B6G285 Putative uncharacterized protein n=1 Tax=Clostri... 45 0.007 UniRef50_B0C228 Pentapeptide repeat protein n=1 Tax=Acaryochlori... 42 0.049 UniRef50_B0SFM7 Methyl-accepting chemotaxis protein n=2 Tax=Lept... 41 0.059 UniRef50_Q23RG4 SNF2 family N-terminal domain containing protein... 41 0.094 CONVERGED! >UniRef50_P11864 Uncharacterized protein yhaC n=45 Tax=Enterobacteriaceae RepID=YHAC_ECOLI Length = 395 Score = 785 bits (2027), Expect = 0.0, Method: Composition-based stats. Identities = 395/395 (100%), Positives = 395/395 (100%) Query: 1 MFPVSSIGNDISSDLVRRKMNDLPESPTGNNLEALAPGIEKLKQTSIEMVTLLNTLQPGG 60 MFPVSSIGNDISSDLVRRKMNDLPESPTGNNLEALAPGIEKLKQTSIEMVTLLNTLQPGG Sbjct: 1 MFPVSSIGNDISSDLVRRKMNDLPESPTGNNLEALAPGIEKLKQTSIEMVTLLNTLQPGG 60 Query: 61 KCIITGDFQKELAYLQNVILYNVSSLRLDFLGYNAQIIQRSDNTCELTINEPLKNQEIST 120 KCIITGDFQKELAYLQNVILYNVSSLRLDFLGYNAQIIQRSDNTCELTINEPLKNQEIST Sbjct: 61 KCIITGDFQKELAYLQNVILYNVSSLRLDFLGYNAQIIQRSDNTCELTINEPLKNQEIST 120 Query: 121 GNININCPLKDIYNEIRRLNVIFSCGTGDIVDLSSLDLRNVDLDYYDFTDKHMANTILNP 180 GNININCPLKDIYNEIRRLNVIFSCGTGDIVDLSSLDLRNVDLDYYDFTDKHMANTILNP Sbjct: 121 GNININCPLKDIYNEIRRLNVIFSCGTGDIVDLSSLDLRNVDLDYYDFTDKHMANTILNP 180 Query: 181 FKLNSTNFTNANMFQVNFVSSTQNATISWDYLLKITPVLISISDMYSEEKIKFVESCLNE 240 FKLNSTNFTNANMFQVNFVSSTQNATISWDYLLKITPVLISISDMYSEEKIKFVESCLNE Sbjct: 181 FKLNSTNFTNANMFQVNFVSSTQNATISWDYLLKITPVLISISDMYSEEKIKFVESCLNE 240 Query: 241 PGDITEEQLKIMRFAIIKSIPRATLTDKLENELTKEIYKSSSKIINCLNRIKLTEMKEFS 300 PGDITEEQLKIMRFAIIKSIPRATLTDKLENELTKEIYKSSSKIINCLNRIKLTEMKEFS Sbjct: 241 PGDITEEQLKIMRFAIIKSIPRATLTDKLENELTKEIYKSSSKIINCLNRIKLTEMKEFS 300 Query: 301 SEKIYDYIDIIIEDYENTKENAYLVVPQINYTMDLNIEDSSSEELLSDNTLEKDENSPDN 360 SEKIYDYIDIIIEDYENTKENAYLVVPQINYTMDLNIEDSSSEELLSDNTLEKDENSPDN Sbjct: 301 SEKIYDYIDIIIEDYENTKENAYLVVPQINYTMDLNIEDSSSEELLSDNTLEKDENSPDN 360 Query: 361 GFEVGEYNTYEAYNSEKQYFTREDYTYDYDLLNAI 395 GFEVGEYNTYEAYNSEKQYFTREDYTYDYDLLNAI Sbjct: 361 GFEVGEYNTYEAYNSEKQYFTREDYTYDYDLLNAI 395 >UniRef50_C4QG00 Kif1, putative n=1 Tax=Schistosoma mansoni RepID=C4QG00_SCHMA Length = 1736 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 39/129 (30%), Positives = 61/129 (47%), Gaps = 14/129 (10%) Query: 168 FTDKHMANTILNPFKLNSTNFTNANMFQVNFVSSTQNATISWDYLLKITPVLISISDMYS 227 TDK +NT N L ST+ N+ QVN ++ST N T + L IT L ++++ Sbjct: 1484 LTDKRESNT--NNKSLKSTSNIEQNIKQVNEINSTTNETEASRNHLLITKCLTALTNYLP 1541 Query: 228 EEK--IKFVESCLNEPGDITEEQLKIMRFAIIKSIPRATLTDKLENELTKEIYKSSSKII 285 K I+ LN G +F IPR+ TD ++N L E Y SSS ++ Sbjct: 1542 INKCLIRHFGVTLNYGG----------QFLYSSGIPRSATTDSIQNILNGESYFSSSYLV 1591 Query: 286 NCLNRIKLT 294 + ++++ Sbjct: 1592 GECDEVRVS 1600 >UniRef50_B6G285 Putative uncharacterized protein n=1 Tax=Clostridium hiranonis DSM 13275 RepID=B6G285_9CLOT Length = 253 Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats. Identities = 22/56 (39%), Positives = 34/56 (60%) Query: 151 VDLSSLDLRNVDLDYYDFTDKHMANTILNPFKLNSTNFTNANMFQVNFVSSTQNAT 206 +DLS LDLRNVDLD+ DF + +AN L++ +A++ +F + T +AT Sbjct: 33 MDLSELDLRNVDLDWSDFENVKLANVDFENSNLSNVFLEHADLSNASFKNCTLHAT 88 >UniRef50_B0C228 Pentapeptide repeat protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C228_ACAM1 Length = 502 Score = 41.6 bits (96), Expect = 0.049, Method: Composition-based stats. Identities = 38/136 (27%), Positives = 57/136 (41%), Gaps = 14/136 (10%) Query: 143 FSCGTGDIVDLSSLDLRNVDLDYYDFTDKHMANTILNPFKLNSTNFTNANMFQVNFVSST 202 FS G VDLSS +L VDL +D + +L L NFT +F NF ++T Sbjct: 228 FSRVDGCEVDLSSANLTGVDL-----SDADLQQAVLKEANLRDVNFTRCRLFGTNFAAAT 282 Query: 203 QNATISWDYLLKITPVLISISDMYS----EEKIKFVESCLNEPGDI----TEEQLKIMRF 254 D T L+ + Y+ ++ + + EPGD+ TE Q + Sbjct: 283 LTGACIEDISPNATTTLVDVVCDYAYQRHNQQERIPKQGYFEPGDVKTFFTEAQ-ATLNL 341 Query: 255 AIIKSIPRATLTDKLE 270 + K + A LT L+ Sbjct: 342 GLPKDVHWAALTYSLQ 357 >UniRef50_B0SFM7 Methyl-accepting chemotaxis protein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SFM7_LEPBA Length = 517 Score = 41.2 bits (95), Expect = 0.059, Method: Composition-based stats. Identities = 33/151 (21%), Positives = 72/151 (47%), Gaps = 3/151 (1%) Query: 199 VSSTQNATISWDYLLKITPVLISISDMYSEEKIKFVESCLNEPGDITEEQLKIMRFAIIK 258 V + A D LL++ +++ D+ +++IK ++ E I+E Q + F I Sbjct: 194 VEKQKEAKKVLDNLLEMAKTILNTIDLM-KQQIKNQDTISEELKQISEHQASSL-FEIST 251 Query: 259 SIPRATLTDKLENELTKEIYKSSSKIINCLNRIK-LTEMKEFSSEKIYDYIDIIIEDYEN 317 S+ + N++ K +Y+ S I +N +K + E + + IY +D++ E + Sbjct: 252 SLQELSSKADSNNKIAKSLYQESETSIQSVNDLKAINETVQTGTGSIYKNLDVVTEYSND 311 Query: 318 TKENAYLVVPQINYTMDLNIEDSSSEELLSD 348 T ++ +L + + N + + E S +++D Sbjct: 312 TSKHIHLSIDKFNILQEKSREISEFVSVIND 342 >UniRef50_Q23RG4 SNF2 family N-terminal domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23RG4_TETTH Length = 1016 Score = 40.8 bits (94), Expect = 0.094, Method: Composition-based stats. Identities = 49/216 (22%), Positives = 93/216 (43%), Gaps = 19/216 (8%) Query: 181 FKLNSTNFTNANMFQVNFVSSTQNATISWDYLLKITPVLISISDMYSEEKIKFVESCLNE 240 F L+ N N F + + + +TIS D + I +D ++E K VE + Sbjct: 514 FNLDQHNIKNITQFGADEIFKAKGSTISDDDIDMILAKGEKKTDEQNKEIQKLVEEKMKG 573 Query: 241 PGDITEEQLKIM-------RFAIIKSIPRATLTDKLENELTKEIYKSSSKIINCLNRIKL 293 D+ + +K R +I +A L +LE + K+IY+ K N L ++K+ Sbjct: 574 LFDLEVKSIKCKELFGEDPRLQDEIAIKQAALQSELEEK--KKIYEDQIKK-NKLIKVKI 630 Query: 294 TEMKEFSSEKIYDYIDIIIEDYENTKENAYLVVPQINYTMDLN----IEDSSSEELLSDN 349 ++ E+ + ++ +++ KE YL I Y + L I++ + L+D+ Sbjct: 631 IQLPEYQLFESKSRLEQLLQ-----KEEEYLARGGIMYKVGLYSEEFIQNMAKNNGLTDD 685 Query: 350 TLEKDENSPDNGFEVGEYNTYEAYNSEKQYFTREDY 385 L++ E GF + N Y+ + + R D+ Sbjct: 686 ELKEKEQLMQQGFPYWDANDYKEFLKASYLYGRGDF 721 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.133 0.371 Lambda K H 0.267 0.0412 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,542,474,651 Number of Sequences: 3077464 Number of extensions: 65770107 Number of successful extensions: 180853 Number of sequences better than 1.0e-01: 12 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 25 Number of HSP's that attempted gapping in prelim test: 180789 Number of HSP's gapped (non-prelim): 77 length of query: 395 length of database: 1,040,396,356 effective HSP length: 131 effective length of query: 264 effective length of database: 637,248,572 effective search space: 168233623008 effective search space used: 168233623008 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (22.0 bits) S2: 94 (40.8 bits)