BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (138 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P46856 Putative uncharacterized protein yrhA n=34 Tax=E... 278 4e-74 UniRef50_UPI00018265F6 hypothetical protein EcanA3_08239 n=1 Tax... 149 2e-35 UniRef50_A9MS31 Putative uncharacterized protein n=1 Tax=Salmone... 56 4e-07 UniRef50_C4X9I2 Putative uncharacterized protein n=4 Tax=Klebsie... 55 8e-07 UniRef50_A7KG42 Putative uncharacterized protein n=3 Tax=Enterob... 53 3e-06 >UniRef50_P46856 Putative uncharacterized protein yrhA n=34 Tax=Enterobacteriaceae RepID=YRHA_ECOLI Length = 138 Score = 278 bits (711), Expect = 4e-74, Method: Compositional matrix adjust. Identities = 138/138 (100%), Positives = 138/138 (100%) Query: 1 MMTKTQINKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQ 60 MMTKTQINKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQ Sbjct: 1 MMTKTQINKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQ 60 Query: 61 PDYLTFLRAMDGFEVNGLRLFSLSIPEPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDY 120 PDYLTFLRAMDGFEVNGLRLFSLSIPEPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDY Sbjct: 61 PDYLTFLRAMDGFEVNGLRLFSLSIPEPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDY 120 Query: 121 SISIFTYDIKGDAANLLI 138 SISIFTYDIKGDAANLLI Sbjct: 121 SISIFTYDIKGDAANLLI 138 >UniRef50_UPI00018265F6 hypothetical protein EcanA3_08239 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI00018265F6 Length = 166 Score = 149 bits (377), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 74/124 (59%), Positives = 95/124 (76%) Query: 7 INKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQPDYLTF 66 +N L M + +YP E P+ + +IESI + + S+ + LEKL +EV +LF NQPDY+TF Sbjct: 9 LNDLKSKMTNWNYPIEEPITDQYIESITRRKSTSSLSPMLEKLGDEVCLLFVNQPDYVTF 68 Query: 67 LRAMDGFEVNGLRLFSLSIPEPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDYSISIFT 126 L MDGFE +GL LFSLSIPEP V+NLF +N+FYRNN+D+INPDL ERLVIG+ SIS+FT Sbjct: 69 LETMDGFEFDGLILFSLSIPEPLVRNLFVMNDFYRNNNDYINPDLSERLVIGNDSISVFT 128 Query: 127 YDIK 130 YD K Sbjct: 129 YDSK 132 >UniRef50_A9MS31 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MS31_SALAR Length = 228 Score = 55.8 bits (133), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 34/126 (26%), Positives = 63/126 (50%), Gaps = 3/126 (2%) Query: 10 LIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQPDYLTFLRA 69 L ++ D +YP P+ + +++ + K+ ++ L ++ DY Sbjct: 70 LKTLLVDNNYPVANPVDPDVMADLMRKAAPAEWDAQKRKVFEDIQRLMIHRLDYSDMFEI 129 Query: 70 MDGFEVNGLRLFSLSIPE---PSVKNLFAVNEFYRNNDDFINPDLQERLVIGDYSISIFT 126 MDG E NGL L+SL E P N++ N R+ND +++P+L ++++IG+ +S+F Sbjct: 130 MDGLEYNGLTLYSLVQAENDEPVWSNIYIRNFETRDNDIYVDPNLSDKVLIGEDGMSVFA 189 Query: 127 YDIKGD 132 Y D Sbjct: 190 YSFNDD 195 >UniRef50_C4X9I2 Putative uncharacterized protein n=4 Tax=Klebsiella RepID=C4X9I2_KLEPN Length = 227 Score = 55.1 bits (131), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 34/129 (26%), Positives = 62/129 (48%), Gaps = 3/129 (2%) Query: 7 INKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQPDYLTF 66 I L +M + +YP P+ + ++ + ++ ++ L ++PDY Sbjct: 67 IRNLKSLMVEGNYPVADPVDPDVMADLMHKVAPVEWEDKKRRVYEDIQRLLISRPDYADM 126 Query: 67 LRAMDGFEVNGLRLFSLSIPE---PSVKNLFAVNEFYRNNDDFINPDLQERLVIGDYSIS 123 MDG E NG+ L+SL E P+ N+F N R N+ +++P+L ++VIG+ +S Sbjct: 127 FNIMDGGEYNGMTLYSLVQAENGEPAWSNIFVRNFDTRINEIYVDPNLIGKVVIGEEGMS 186 Query: 124 IFTYDIKGD 132 + Y D Sbjct: 187 VIVYSFTDD 195 >UniRef50_A7KG42 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7KG42_KLEPN Length = 168 Score = 52.8 bits (125), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 34/133 (25%), Positives = 67/133 (50%), Gaps = 11/133 (8%) Query: 7 INKLIKMMNDLDYPFEAPLKESFIESII----QIEFNSNSTNCLEKLCNEVSILFKNQPD 62 ++ L ++M YP +P+ ++ +I +E+ + + ++ L K + D Sbjct: 8 LSSLKRLMVADHYPLASPVAPEVLKDLICNPPPVEWADHKKSAY----IDIQKLIKTRLD 63 Query: 63 YLTFLRAMDGFEVNGLRLFSLSIPEPS---VKNLFAVNEFYRNNDDFINPDLQERLVIGD 119 Y AMDGFE NGL ++L E N++ N R+N+ +++P+L ++++IG+ Sbjct: 64 YAQVFNAMDGFEYNGLTFYNLVQAENENLLWSNIYIRNFEARDNEIYVDPNLTDKVLIGE 123 Query: 120 YSISIFTYDIKGD 132 +S+F Y D Sbjct: 124 DGMSLFAYSFADD 136 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P46856 Putative uncharacterized protein yrhA n=34 Tax=E... 190 1e-47 UniRef50_C4X9I2 Putative uncharacterized protein n=4 Tax=Klebsie... 188 6e-47 UniRef50_A9MS31 Putative uncharacterized protein n=1 Tax=Salmone... 184 6e-46 UniRef50_UPI00018265F6 hypothetical protein EcanA3_08239 n=1 Tax... 168 4e-41 UniRef50_A7KG42 Putative uncharacterized protein n=3 Tax=Enterob... 167 7e-41 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P46856 Putative uncharacterized protein yrhA n=34 Tax=Enterobacteriaceae RepID=YRHA_ECOLI Length = 138 Score = 190 bits (483), Expect = 1e-47, Method: Composition-based stats. Identities = 138/138 (100%), Positives = 138/138 (100%) Query: 1 MMTKTQINKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQ 60 MMTKTQINKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQ Sbjct: 1 MMTKTQINKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQ 60 Query: 61 PDYLTFLRAMDGFEVNGLRLFSLSIPEPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDY 120 PDYLTFLRAMDGFEVNGLRLFSLSIPEPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDY Sbjct: 61 PDYLTFLRAMDGFEVNGLRLFSLSIPEPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDY 120 Query: 121 SISIFTYDIKGDAANLLI 138 SISIFTYDIKGDAANLLI Sbjct: 121 SISIFTYDIKGDAANLLI 138 >UniRef50_C4X9I2 Putative uncharacterized protein n=4 Tax=Klebsiella RepID=C4X9I2_KLEPN Length = 227 Score = 188 bits (477), Expect = 6e-47, Method: Composition-based stats. Identities = 34/129 (26%), Positives = 62/129 (48%), Gaps = 3/129 (2%) Query: 7 INKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQPDYLTF 66 I L +M + +YP P+ + ++ + ++ ++ L ++PDY Sbjct: 67 IRNLKSLMVEGNYPVADPVDPDVMADLMHKVAPVEWEDKKRRVYEDIQRLLISRPDYADM 126 Query: 67 LRAMDGFEVNGLRLFSLSIP---EPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDYSIS 123 MDG E NG+ L+SL EP+ N+F N R N+ +++P+L ++VIG+ +S Sbjct: 127 FNIMDGGEYNGMTLYSLVQAENGEPAWSNIFVRNFDTRINEIYVDPNLIGKVVIGEEGMS 186 Query: 124 IFTYDIKGD 132 + Y D Sbjct: 187 VIVYSFTDD 195 >UniRef50_A9MS31 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MS31_SALAR Length = 228 Score = 184 bits (468), Expect = 6e-46, Method: Composition-based stats. Identities = 34/126 (26%), Positives = 63/126 (50%), Gaps = 3/126 (2%) Query: 10 LIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQPDYLTFLRA 69 L ++ D +YP P+ + +++ + K+ ++ L ++ DY Sbjct: 70 LKTLLVDNNYPVANPVDPDVMADLMRKAAPAEWDAQKRKVFEDIQRLMIHRLDYSDMFEI 129 Query: 70 MDGFEVNGLRLFSLSIP---EPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDYSISIFT 126 MDG E NGL L+SL EP N++ N R+ND +++P+L ++++IG+ +S+F Sbjct: 130 MDGLEYNGLTLYSLVQAENDEPVWSNIYIRNFETRDNDIYVDPNLSDKVLIGEDGMSVFA 189 Query: 127 YDIKGD 132 Y D Sbjct: 190 YSFNDD 195 >UniRef50_UPI00018265F6 hypothetical protein EcanA3_08239 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI00018265F6 Length = 166 Score = 168 bits (426), Expect = 4e-41, Method: Composition-based stats. Identities = 74/124 (59%), Positives = 95/124 (76%) Query: 7 INKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQPDYLTF 66 +N L M + +YP E P+ + +IESI + + S+ + LEKL +EV +LF NQPDY+TF Sbjct: 9 LNDLKSKMTNWNYPIEEPITDQYIESITRRKSTSSLSPMLEKLGDEVCLLFVNQPDYVTF 68 Query: 67 LRAMDGFEVNGLRLFSLSIPEPSVKNLFAVNEFYRNNDDFINPDLQERLVIGDYSISIFT 126 L MDGFE +GL LFSLSIPEP V+NLF +N+FYRNN+D+INPDL ERLVIG+ SIS+FT Sbjct: 69 LETMDGFEFDGLILFSLSIPEPLVRNLFVMNDFYRNNNDYINPDLSERLVIGNDSISVFT 128 Query: 127 YDIK 130 YD K Sbjct: 129 YDSK 132 >UniRef50_A7KG42 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7KG42_KLEPN Length = 168 Score = 167 bits (424), Expect = 7e-41, Method: Composition-based stats. Identities = 33/129 (25%), Positives = 64/129 (49%), Gaps = 3/129 (2%) Query: 7 INKLIKMMNDLDYPFEAPLKESFIESIIQIEFNSNSTNCLEKLCNEVSILFKNQPDYLTF 66 ++ L ++M YP +P+ ++ +I + + ++ L K + DY Sbjct: 8 LSSLKRLMVADHYPLASPVAPEVLKDLICNPPPVEWADHKKSAYIDIQKLIKTRLDYAQV 67 Query: 67 LRAMDGFEVNGLRLFSLSIPEPS---VKNLFAVNEFYRNNDDFINPDLQERLVIGDYSIS 123 AMDGFE NGL ++L E N++ N R+N+ +++P+L ++++IG+ +S Sbjct: 68 FNAMDGFEYNGLTFYNLVQAENENLLWSNIYIRNFEARDNEIYVDPNLTDKVLIGEDGMS 127 Query: 124 IFTYDIKGD 132 +F Y D Sbjct: 128 LFAYSFADD 136 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.136 0.362 Lambda K H 0.267 0.0418 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 499,907,448 Number of Sequences: 3077464 Number of extensions: 18357652 Number of successful extensions: 50625 Number of sequences better than 1.0e-01: 6 Number of HSP's better than 0.1 without gapping: 8 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 50610 Number of HSP's gapped (non-prelim): 11 length of query: 138 length of database: 1,040,396,356 effective HSP length: 102 effective length of query: 36 effective length of database: 726,495,028 effective search space: 26153821008 effective search space used: 26153821008 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 87 (38.1 bits)