BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (173 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P09129 Protein traS n=2 Tax=Escherichia coli RepID=TRAS... 340 8e-93 UniRef50_D2U4N7 Conjugal transfer entry exclusion protein n=1 Ta... 86 4e-16 UniRef50_C8T3Z2 Putative uncharacterized protein n=1 Tax=Klebsie... 86 5e-16 UniRef50_A6TII8 Surface exclusion n=1 Tax=Klebsiella pneumoniae ... 69 4e-11 UniRef50_B1P7R2 TraS n=10 Tax=Enterobacteriaceae RepID=B1P7R2_ECOLX 54 2e-06 UniRef50_A4SUE2 TraS protein n=1 Tax=Aeromonas salmonicida subsp... 48 1e-04 >UniRef50_P09129 Protein traS n=2 Tax=Escherichia coli RepID=TRAS1_ECOLI Length = 173 Score = 340 bits (873), Expect = 8e-93, Method: Compositional matrix adjust. Identities = 173/173 (100%), Positives = 173/173 (100%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET Sbjct: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 Query: 61 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVIFLW 120 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVIFLW Sbjct: 61 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVIFLW 120 Query: 121 SFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNKKLSRVK 173 SFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNKKLSRVK Sbjct: 121 SFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNKKLSRVK 173 >UniRef50_D2U4N7 Conjugal transfer entry exclusion protein n=1 Tax=Arsenophonus nasoniae RepID=D2U4N7_9ENTR Length = 175 Score = 86.3 bits (212), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 57/174 (32%), Positives = 90/174 (51%), Gaps = 9/174 (5%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 MIT +I+S E + LK+ + +G++ IP+ W+ +PGL MIF L +S K ET Sbjct: 1 MITHKILSKEAKELKEILKNGEMTIPTFWECCRPGLFYFVGMIFWVLFISNFTLNK--ET 58 Query: 61 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVI--- 117 L ++L G++ +L VA R Y S+P +F + S +R+I KLK Y I+++ +I Sbjct: 59 LFSLLFTGFICFFVLLAVAGTRGNYLSIPSDFRQKSICLRMIRDKLKRYIIIHLSLIIAV 118 Query: 118 -FLWSFLGGGIIY--GFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNKK 168 L I+ GF + + I A + +DI RYQF + I + K Sbjct: 119 GILVKLYTAMFIFFDGFHLVFIVCICA-IFSVDISRYQFSAFVAIIEQFKDQHK 171 >UniRef50_C8T3Z2 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T3Z2_KLEPR Length = 176 Score = 85.9 bits (211), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 55/172 (31%), Positives = 86/172 (50%), Gaps = 5/172 (2%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 MIT +I+ E+E LK + G IPS W+ L PG++++ W I C L+ Sbjct: 2 MITNKILIEEVEELKSLLSQGKAEIPSFWECLWPGVVLVAWNILCALISVGGGYLSVKFI 61 Query: 61 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVI--- 117 ++ G + I+L VA R L+ S+P + S V R IS K+ VY +VYM +I Sbjct: 62 FWVLVFPGVISFIVLLGVASARSLFLSVPRTYRMRSQVYRFISRKVMVYALVYMALILAL 121 Query: 118 --FLWSFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNK 167 F + + F ++ T+ + F++ +D GRYQ + AINS+ K Sbjct: 122 ALFNRVYYASPFPFSFMILIATIFVWFVMNIDFGRYQLSVLSSAINSFRAEK 173 >UniRef50_A6TII8 Surface exclusion n=1 Tax=Klebsiella pneumoniae subsp. pneumoniae MGH 78578 RepID=A6TII8_KLEP7 Length = 182 Score = 69.3 bits (168), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 51/183 (27%), Positives = 87/183 (47%), Gaps = 18/183 (9%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMS----FLITQK 56 MIT + I +E+E L + G+ IPS+W PGL M W++ P+ M F + Sbjct: 1 MITNKQILTEVEQLSRLFKDGEADIPSMWNCFWPGLAAMFWLLLWPIAMFSYKLFFVELP 60 Query: 57 TSETL---TAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVY 113 + + +VL GL ++ + R LY S+P+ + K S++ + I K K Y VY Sbjct: 61 NAAQMGIAASVLVAFVAGLFMMLYIVNFRALYLSIPDTYRKQSALCQFIVGKAKFYCRVY 120 Query: 114 MGVIFLWSFLGGGIIYG-FGAILVTVI-MAF------LIQLDIGRYQFVGVIDAINSYVK 165 + + FL Y +GAI TV+ + F ++ +D+ RYQ + + ++ Sbjct: 121 L---LTYVFLIISCTYPVYGAIYSTVLFICFSFCLMQIVNVDLSRYQLTALTSLLETFKT 177 Query: 166 NKK 168 +K Sbjct: 178 SKS 180 >UniRef50_B1P7R2 TraS n=10 Tax=Enterobacteriaceae RepID=B1P7R2_ECOLX Length = 169 Score = 54.3 bits (129), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 44/167 (26%), Positives = 76/167 (45%), Gaps = 2/167 (1%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQ-KTSE 59 MIT I E + LK + + IP L Q +KPG + W++ P + L + + Sbjct: 1 MITHYDIKMETQKLKDVLSVEGVNIPPLLQVIKPGGYVFLWVLLWPTFLRLLADKVDIRD 60 Query: 60 TLTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVIFL 119 + G +G I+ + + MLY ++PE+F S V+ + K K Y + ++ + Sbjct: 61 AGFDICFSGVMGFILFVAITNVMMLYLAIPEKFRDESKVISFMYDKNKNYILSFLIAFSM 120 Query: 120 WSFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKN 166 SF ++Y F I + +I F+ +DI RY + I + K Sbjct: 121 VSF-SHTLLYEFLLIALFIIFFFIYAIDINRYNLSAIASVIGLFKKE 166 >UniRef50_A4SUE2 TraS protein n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SUE2_AERS4 Length = 189 Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 45/183 (24%), Positives = 85/183 (46%), Gaps = 17/183 (9%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 MIT++ I+ ++ + + ++ IPS + PG+I++ M+ + ++F I S Sbjct: 2 MITKKAINDDIAFIVATLKHKEVTIPSFFSCAGPGIIVLLGMVIWQIYVAFPILHHNSMA 61 Query: 61 LTAVLA--GG--WLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGV 116 T GG +LG +I + +R Y S+P E K S ++R+ +K K F Y+G+ Sbjct: 62 RTDAYGSVGGTLFLGFMIFITLTSMRGKYLSMPVEIRKNSLIVRL--TKHKAMF--YVGL 117 Query: 117 IFLWSFLGG---------GIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNK 167 + + +GG +I ++ V + F+ +D+GRY I ++ Sbjct: 118 WVVINIVGGVCIKMSALSPVISCAVQLVALVFIWFIAFVDLGRYDLAVFSSVIKAWRDGD 177 Query: 168 KLS 170 KL Sbjct: 178 KLD 180 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P09129 Protein traS n=2 Tax=Escherichia coli RepID=TRAS... 167 2e-40 UniRef50_C8T3Z2 Putative uncharacterized protein n=1 Tax=Klebsie... 152 5e-36 UniRef50_D2U4N7 Conjugal transfer entry exclusion protein n=1 Ta... 147 2e-34 UniRef50_A4SUE2 TraS protein n=1 Tax=Aeromonas salmonicida subsp... 142 4e-33 UniRef50_B1P7R2 TraS n=10 Tax=Enterobacteriaceae RepID=B1P7R2_ECOLX 136 3e-31 UniRef50_A6TII8 Surface exclusion n=1 Tax=Klebsiella pneumoniae ... 134 1e-30 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P09129 Protein traS n=2 Tax=Escherichia coli RepID=TRAS1_ECOLI Length = 173 Score = 167 bits (422), Expect = 2e-40, Method: Composition-based stats. Identities = 173/173 (100%), Positives = 173/173 (100%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET Sbjct: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 Query: 61 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVIFLW 120 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVIFLW Sbjct: 61 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVIFLW 120 Query: 121 SFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNKKLSRVK 173 SFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNKKLSRVK Sbjct: 121 SFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNKKLSRVK 173 >UniRef50_C8T3Z2 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T3Z2_KLEPR Length = 176 Score = 152 bits (384), Expect = 5e-36, Method: Composition-based stats. Identities = 55/172 (31%), Positives = 86/172 (50%), Gaps = 5/172 (2%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 MIT +I+ E+E LK + G IPS W+ L PG++++ W I C L+ Sbjct: 2 MITNKILIEEVEELKSLLSQGKAEIPSFWECLWPGVVLVAWNILCALISVGGGYLSVKFI 61 Query: 61 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVI--- 117 ++ G + I+L VA R L+ S+P + S V R IS K+ VY +VYM +I Sbjct: 62 FWVLVFPGVISFIVLLGVASARSLFLSVPRTYRMRSQVYRFISRKVMVYALVYMALILAL 121 Query: 118 --FLWSFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNK 167 F + + F ++ T+ + F++ +D GRYQ + AINS+ K Sbjct: 122 ALFNRVYYASPFPFSFMILIATIFVWFVMNIDFGRYQLSVLSSAINSFRAEK 173 >UniRef50_D2U4N7 Conjugal transfer entry exclusion protein n=1 Tax=Arsenophonus nasoniae RepID=D2U4N7_9ENTR Length = 175 Score = 147 bits (370), Expect = 2e-34, Method: Composition-based stats. Identities = 56/174 (32%), Positives = 89/174 (51%), Gaps = 9/174 (5%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 MIT +I+S E + LK+ + +G++ IP+ W+ +PGL MIF L +S ET Sbjct: 1 MITHKILSKEAKELKEILKNGEMTIPTFWECCRPGLFYFVGMIFWVLFISNFTL--NKET 58 Query: 61 LTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVI--- 117 L ++L G++ +L VA R Y S+P +F + S +R+I KLK Y I+++ +I Sbjct: 59 LFSLLFTGFICFFVLLAVAGTRGNYLSIPSDFRQKSICLRMIRDKLKRYIIIHLSLIIAV 118 Query: 118 -FLWSFLGGGIIY--GFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNKK 168 L I+ GF + + I A + +DI RYQF + I + K Sbjct: 119 GILVKLYTAMFIFFDGFHLVFIVCICA-IFSVDISRYQFSAFVAIIEQFKDQHK 171 >UniRef50_A4SUE2 TraS protein n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SUE2_AERS4 Length = 189 Score = 142 bits (358), Expect = 4e-33, Method: Composition-based stats. Identities = 43/183 (23%), Positives = 83/183 (45%), Gaps = 17/183 (9%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSET 60 MIT++ I+ ++ + + ++ IPS + PG+I++ M+ + ++F I S Sbjct: 2 MITKKAINDDIAFIVATLKHKEVTIPSFFSCAGPGIIVLLGMVIWQIYVAFPILHHNSMA 61 Query: 61 LTAVLAG----GWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGV 116 T +LG +I + +R Y S+P E K S ++R+ +K K F Y+G+ Sbjct: 62 RTDAYGSVGGTLFLGFMIFITLTSMRGKYLSMPVEIRKNSLIVRL--TKHKAMF--YVGL 117 Query: 117 IFLWSFLGG---------GIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKNK 167 + + +GG +I ++ V + F+ +D+GRY I ++ Sbjct: 118 WVVINIVGGVCIKMSALSPVISCAVQLVALVFIWFIAFVDLGRYDLAVFSSVIKAWRDGD 177 Query: 168 KLS 170 KL Sbjct: 178 KLD 180 >UniRef50_B1P7R2 TraS n=10 Tax=Enterobacteriaceae RepID=B1P7R2_ECOLX Length = 169 Score = 136 bits (343), Expect = 3e-31, Method: Composition-based stats. Identities = 44/167 (26%), Positives = 76/167 (45%), Gaps = 2/167 (1%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQ-KTSE 59 MIT I E + LK + + IP L Q +KPG + W++ P + L + + Sbjct: 1 MITHYDIKMETQKLKDVLSVEGVNIPPLLQVIKPGGYVFLWVLLWPTFLRLLADKVDIRD 60 Query: 60 TLTAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVIFL 119 + G +G I+ + + MLY ++PE+F S V+ + K K Y + ++ + Sbjct: 61 AGFDICFSGVMGFILFVAITNVMMLYLAIPEKFRDESKVISFMYDKNKNYILSFLIAFSM 120 Query: 120 WSFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAINSYVKN 166 SF ++Y F I + +I F+ +DI RY + I + K Sbjct: 121 VSF-SHTLLYEFLLIALFIIFFFIYAIDINRYNLSAIASVIGLFKKE 166 >UniRef50_A6TII8 Surface exclusion n=1 Tax=Klebsiella pneumoniae subsp. pneumoniae MGH 78578 RepID=A6TII8_KLEP7 Length = 182 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 51/183 (27%), Positives = 86/183 (46%), Gaps = 18/183 (9%) Query: 1 MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMS----FLITQK 56 MIT + I +E+E L + G+ IPS+W PGL M W++ P+ M F + Sbjct: 1 MITNKQILTEVEQLSRLFKDGEADIPSMWNCFWPGLAAMFWLLLWPIAMFSYKLFFVELP 60 Query: 57 TSETL---TAVLAGGWLGLIILFIVARIRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVY 113 + + +VL GL ++ + R LY S+P+ + K S++ + I K K Y VY Sbjct: 61 NAAQMGIAASVLVAFVAGLFMMLYIVNFRALYLSIPDTYRKQSALCQFIVGKAKFYCRVY 120 Query: 114 MGVIFLWSFLGGGIIYG-FGAILVTV-IMAF------LIQLDIGRYQFVGVIDAINSYVK 165 + + FL Y +GAI TV + F ++ +D+ RYQ + + ++ Sbjct: 121 L---LTYVFLIISCTYPVYGAIYSTVLFICFSFCLMQIVNVDLSRYQLTALTSLLETFKT 177 Query: 166 NKK 168 +K Sbjct: 178 SKS 180 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.330 0.143 0.380 Lambda K H 0.267 0.0428 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 595,352,056 Number of Sequences: 3077464 Number of extensions: 19655603 Number of successful extensions: 122230 Number of sequences better than 1.0e-01: 40 Number of HSP's better than 0.1 without gapping: 14 Number of HSP's successfully gapped in prelim test: 33 Number of HSP's that attempted gapping in prelim test: 122163 Number of HSP's gapped (non-prelim): 72 length of query: 173 length of database: 1,040,396,356 effective HSP length: 120 effective length of query: 53 effective length of database: 671,100,676 effective search space: 35568335828 effective search space used: 35568335828 T: 11 A: 40 X1: 16 ( 7.6 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.9 bits) S2: 88 (38.4 bits)