BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (121 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B1VC86 Pilin n=55 Tax=root RepID=PIL2_ECOLX 231 5e-60 UniRef50_A4TSY5 Conjugative transfer fimbrial subunit n=2 Tax=Ye... 129 3e-29 UniRef50_A4SUC7 TraA fimbrial protein n=1 Tax=Aeromonas salmonic... 52 7e-06 UniRef50_P12060 Pilin n=7 Tax=Enterobacteriaceae RepID=PIL1_SALTI 51 1e-05 UniRef50_B2VB05 TraA protein n=1 Tax=Erwinia tasmaniensis RepID=... 51 1e-05 UniRef50_D2U005 Conjugal transfer pilus assembly protein TraA n=... 47 3e-04 UniRef50_A4WGS0 Putative uncharacterized protein n=1 Tax=Enterob... 44 0.001 UniRef50_B6ICF0 Putative TraA protein n=1 Tax=Escherichia coli S... 42 0.006 UniRef50_C8QE96 Putative uncharacterized protein n=2 Tax=Pantoea... 38 0.082 >UniRef50_B1VC86 Pilin n=55 Tax=root RepID=PIL2_ECOLX Length = 121 Score = 231 bits (589), Expect = 5e-60, Method: Compositional matrix adjust. Identities = 119/121 (98%), Positives = 120/121 (99%) Query: 1 MNAVLSVQGASAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLM 60 M+AVLSVQG SAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLM Sbjct: 1 MDAVLSVQGVSAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLM 60 Query: 61 ASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 ASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG Sbjct: 61 ASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 Query: 121 L 121 L Sbjct: 121 L 121 >UniRef50_A4TSY5 Conjugative transfer fimbrial subunit n=2 Tax=Yersinia pestis RepID=A4TSY5_YERPP Length = 116 Score = 129 bits (324), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 69/110 (62%), Positives = 85/110 (77%), Gaps = 6/110 (5%) Query: 11 SAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKAT 70 + P KKK FSK L L+ A +P A L FFP+ +A+ ++G+DLM+SG+ TVK T Sbjct: 12 ATPAKKK--FSKVALLKALKFA---LPVAALAAFFPETVLAS-TAGKDLMSSGDATVKGT 65 Query: 71 FGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 FGKDSSVVKWV+LAEVLVGAVMYM TKN+KFLAGFAI+SVF+ VGM+V G Sbjct: 66 FGKDSSVVKWVILAEVLVGAVMYMTTKNLKFLAGFAILSVFVTVGMSVAG 115 >UniRef50_A4SUC7 TraA fimbrial protein n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SUC7_AERS4 Length = 103 Score = 51.6 bits (122), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 33/89 (37%), Positives = 51/89 (57%), Gaps = 10/89 (11%) Query: 20 FSKFTRLNMLRLAR--AVIPAAVL--MMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDS 75 + T+LN L R A++ AVL ++ F Q + D++A + TV TFG +S Sbjct: 5 LQERTQLNPKHLLRVMALMAVAVLVTLLTFGQ------AHAVDMLAGQSGTVNDTFGANS 58 Query: 76 SVVKWVVLAEVLVGAVMYMMTKNVKFLAG 104 +V KW++LAEV++G Y+ TKN+ L G Sbjct: 59 TVAKWIILAEVIIGVASYIKTKNLLLLFG 87 >UniRef50_P12060 Pilin n=7 Tax=Enterobacteriaceae RepID=PIL1_SALTI Length = 119 Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 29/66 (43%), Positives = 43/66 (65%) Query: 54 SSGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIA 113 +S DL+A G VKATFG DS V+ +++AE++VG MY+ TKN+ L G ++ VF Sbjct: 53 ASATDLLAGGKDDVKATFGADSFVMMCIIIAELIVGVAMYIRTKNLLILLGLVVVIVFTT 112 Query: 114 VGMAVV 119 VG+ + Sbjct: 113 VGLTFI 118 >UniRef50_B2VB05 TraA protein n=1 Tax=Erwinia tasmaniensis RepID=B2VB05_ERWT9 Length = 105 Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 28/99 (28%), Positives = 53/99 (53%), Gaps = 3/99 (3%) Query: 22 KFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSVVKWV 81 F+R N +L+ ++ AAV + P++A +A SS DL+ + T+ +TFG SS+ K+ Sbjct: 10 NFSRANKKKLS--IVMAAVTVCMLPEIAFSADSS-TDLLQAQQATINSTFGHGSSLEKYF 66 Query: 82 VLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 +AEV + + Y + G ++ +F + ++G Sbjct: 67 YIAEVFMSLIAYFRARTPMVFIGLIMVIIFTRIAFGIIG 105 >UniRef50_D2U005 Conjugal transfer pilus assembly protein TraA n=1 Tax=Arsenophonus nasoniae RepID=D2U005_9ENTR Length = 117 Score = 46.6 bits (109), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 28/75 (37%), Positives = 42/75 (56%), Gaps = 1/75 (1%) Query: 41 LMMFFPQLAMAAGS-SGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNV 99 L+MF P + + A + DL + G VK+TFGK S+VV + + E+L Y+ TKN+ Sbjct: 38 LLMFIPAMQVYADPPTTGDLFSGGKEVVKSTFGKGSTVVWILYVLEILAAIFAYVKTKNL 97 Query: 100 KFLAGFAIISVFIAV 114 G A + VF+ V Sbjct: 98 AVFGGIAAVIVFVNV 112 >UniRef50_A4WGS0 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4WGS0_ENT38 Length = 117 Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust. Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 5/78 (6%) Query: 41 LMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVK 100 LM+ P LA +G DL+A TV +TFG SS++KW +AE+++G +Y+ ++ Sbjct: 43 LMVAHPVLA-----AGTDLLAPQQATVNSTFGSGSSLIKWFYIAEIVMGLFIYIKARSPL 97 Query: 101 FLAGFAIISVFIAVGMAV 118 G +F V ++ Sbjct: 98 VFVGIVGCIIFTRVAFSI 115 >UniRef50_B6ICF0 Putative TraA protein n=1 Tax=Escherichia coli SE11 RepID=B6ICF0_ECOSE Length = 123 Score = 42.0 bits (97), Expect = 0.006, Method: Compositional matrix adjust. Identities = 22/61 (36%), Positives = 38/61 (62%), Gaps = 3/61 (4%) Query: 54 SSGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIA 113 + DL ++G V+ATFG+DS+++ + ++AEV + ++Y+ N A F +I VFI Sbjct: 56 AQATDLASAGKADVEATFGEDSTMMYYFMIAEVFLVFMIYLRNHNP---ATFVLIPVFIV 112 Query: 114 V 114 V Sbjct: 113 V 113 >UniRef50_C8QE96 Putative uncharacterized protein n=2 Tax=Pantoea sp. At-9b RepID=C8QE96_9ENTR Length = 130 Score = 38.1 bits (87), Expect = 0.082, Method: Compositional matrix adjust. Identities = 25/87 (28%), Positives = 41/87 (47%), Gaps = 5/87 (5%) Query: 18 SFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSV 77 + + F+ L R+ +I AVL P L A +DL++S K TFG S++ Sbjct: 32 TLYRSFSLLWRHRVNVLLIALAVLFFALPHLVRA-----EDLLSSQKQDAKDTFGHGSTI 86 Query: 78 VKWVVLAEVLVGAVMYMMTKNVKFLAG 104 + +AEV++ Y+ T+N G Sbjct: 87 EWGLYIAEVIISIGAYIKTRNPMLFVG 113 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B1VC86 Pilin n=55 Tax=root RepID=PIL2_ECOLX 123 2e-27 UniRef50_B2VB05 TraA protein n=1 Tax=Erwinia tasmaniensis RepID=... 92 6e-18 UniRef50_A4TSY5 Conjugative transfer fimbrial subunit n=2 Tax=Ye... 82 4e-15 UniRef50_A4WGS0 Putative uncharacterized protein n=1 Tax=Enterob... 79 5e-14 UniRef50_D2U005 Conjugal transfer pilus assembly protein TraA n=... 74 1e-12 UniRef50_A4SUC7 TraA fimbrial protein n=1 Tax=Aeromonas salmonic... 70 2e-11 UniRef50_P12060 Pilin n=7 Tax=Enterobacteriaceae RepID=PIL1_SALTI 67 2e-10 Sequences not found previously or not previously below threshold: UniRef50_C8QE96 Putative uncharacterized protein n=2 Tax=Pantoea... 65 5e-10 UniRef50_B6ICF0 Putative TraA protein n=1 Tax=Escherichia coli S... 48 8e-05 UniRef50_D1RMP4 Type IV conjugative transfer system pilin TraA-l... 48 1e-04 UniRef50_A5KZF1 Putative uncharacterized protein n=1 Tax=Vibrion... 39 0.055 >UniRef50_B1VC86 Pilin n=55 Tax=root RepID=PIL2_ECOLX Length = 121 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 119/121 (98%), Positives = 120/121 (99%) Query: 1 MNAVLSVQGASAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLM 60 M+AVLSVQG SAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLM Sbjct: 1 MDAVLSVQGVSAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLM 60 Query: 61 ASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 ASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG Sbjct: 61 ASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 Query: 121 L 121 L Sbjct: 121 L 121 >UniRef50_B2VB05 TraA protein n=1 Tax=Erwinia tasmaniensis RepID=B2VB05_ERWT9 Length = 105 Score = 91.7 bits (226), Expect = 6e-18, Method: Composition-based stats. Identities = 28/99 (28%), Positives = 53/99 (53%), Gaps = 3/99 (3%) Query: 22 KFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSVVKWV 81 F+R N +L+ ++ AAV + P++A +A SS DL+ + T+ +TFG SS+ K+ Sbjct: 10 NFSRANKKKLS--IVMAAVTVCMLPEIAFSADSS-TDLLQAQQATINSTFGHGSSLEKYF 66 Query: 82 VLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 +AEV + + Y + G ++ +F + ++G Sbjct: 67 YIAEVFMSLIAYFRARTPMVFIGLIMVIIFTRIAFGIIG 105 >UniRef50_A4TSY5 Conjugative transfer fimbrial subunit n=2 Tax=Yersinia pestis RepID=A4TSY5_YERPP Length = 116 Score = 82.4 bits (202), Expect = 4e-15, Method: Composition-based stats. Identities = 69/110 (62%), Positives = 85/110 (77%), Gaps = 6/110 (5%) Query: 11 SAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKAT 70 + P KKK FSK L L+ A +P A L FFP+ +A+ ++G+DLM+SG+ TVK T Sbjct: 12 ATPAKKK--FSKVALLKALKFA---LPVAALAAFFPETVLAS-TAGKDLMSSGDATVKGT 65 Query: 71 FGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 FGKDSSVVKWV+LAEVLVGAVMYM TKN+KFLAGFAI+SVF+ VGM+V G Sbjct: 66 FGKDSSVVKWVILAEVLVGAVMYMTTKNLKFLAGFAILSVFVTVGMSVAG 115 >UniRef50_A4WGS0 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4WGS0_ENT38 Length = 117 Score = 79.0 bits (193), Expect = 5e-14, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 56/118 (47%), Gaps = 7/118 (5%) Query: 2 NAVLSVQGASAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMA 61 NA+ V+G + ++ L+ ++ + LM+ P LA +G DL+A Sbjct: 6 NALNVVKGWAVASLFRTALENRQMKKFLKNGGLILIS--LMVAHPVLA-----AGTDLLA 58 Query: 62 SGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVV 119 TV +TFG SS++KW +AE+++G +Y+ ++ G +F V ++ Sbjct: 59 PQQATVNSTFGSGSSLIKWFYIAEIVMGLFIYIKARSPLVFVGIVGCIIFTRVAFSIA 116 >UniRef50_D2U005 Conjugal transfer pilus assembly protein TraA n=1 Tax=Arsenophonus nasoniae RepID=D2U005_9ENTR Length = 117 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 1/80 (1%) Query: 41 LMMFFPQLAMAAGS-SGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNV 99 L+MF P + + A + DL + G VK+TFGK S+VV + + E+L Y+ TKN+ Sbjct: 38 LLMFIPAMQVYADPPTTGDLFSGGKEVVKSTFGKGSTVVWILYVLEILAAIFAYVKTKNL 97 Query: 100 KFLAGFAIISVFIAVGMAVV 119 G A + VF+ V ++ Sbjct: 98 AVFGGIAAVIVFVNVVFGLI 117 >UniRef50_A4SUC7 TraA fimbrial protein n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SUC7_AERS4 Length = 103 Score = 70.1 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 35/101 (34%), Positives = 54/101 (53%), Gaps = 2/101 (1%) Query: 20 FSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSVVK 79 + T+LN L R + AV ++ L + D++A + TV TFG +S+V K Sbjct: 5 LQERTQLNPKHLLRVMALMAVAVLV--TLLTFGQAHAVDMLAGQSGTVNDTFGANSTVAK 62 Query: 80 WVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 W++LAEV++G Y+ TKN+ L G I+ VF VG + Sbjct: 63 WIILAEVIIGVASYIKTKNLLLLFGVIIVVVFTTVGFQLAA 103 >UniRef50_P12060 Pilin n=7 Tax=Enterobacteriaceae RepID=PIL1_SALTI Length = 119 Score = 66.6 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 41/123 (33%), Positives = 66/123 (53%), Gaps = 9/123 (7%) Query: 1 MNAVLSVQGASAPVKKKSF----FSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSG 56 MN + G APVK +++ + + L+R + +L++ Q+A S Sbjct: 1 MNLSFAKGGLPAPVKNRAWQYCQMAWRGVTSKKALSRLAALSPLLLLGVGQMA-----SA 55 Query: 57 QDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGM 116 DL+A G VKATFG DS V+ +++AE++VG MY+ TKN+ L G ++ VF VG+ Sbjct: 56 TDLLAGGKDDVKATFGADSFVMMCIIIAELIVGVAMYIRTKNLLILLGLVVVIVFTTVGL 115 Query: 117 AVV 119 + Sbjct: 116 TFI 118 >UniRef50_C8QE96 Putative uncharacterized protein n=2 Tax=Pantoea sp. At-9b RepID=C8QE96_9ENTR Length = 130 Score = 65.5 bits (158), Expect = 5e-10, Method: Composition-based stats. Identities = 27/104 (25%), Positives = 46/104 (44%), Gaps = 6/104 (5%) Query: 18 SFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSV 77 + + F+ L R+ +I AVL P L +DL++S K TFG S++ Sbjct: 32 TLYRSFSLLWRHRVNVLLIALAVLFFALPHLV-----RAEDLLSSQKQDAKDTFGHGSTI 86 Query: 78 VKWVVLAEVLVGAVMYMMTKNVKFL-AGFAIISVFIAVGMAVVG 120 + +AEV++ Y+ T+N G + V V ++ G Sbjct: 87 EWGLYIAEVIISIGAYIKTRNPMLFVGGLGFLIVVTRVLFSLAG 130 >UniRef50_B6ICF0 Putative TraA protein n=1 Tax=Escherichia coli SE11 RepID=B6ICF0_ECOSE Length = 123 Score = 48.1 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 25/118 (21%), Positives = 51/118 (43%) Query: 3 AVLSVQGASAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMAS 62 AVL+ ++ K+K + + + V LA + DL ++ Sbjct: 5 AVLNNVMTTSAKKRKPGLLSRCLAVVNKSTAKALVKYVAAPLALWLAAQGMAQATDLASA 64 Query: 63 GNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 G V+ATFG+DS+++ + ++AEV + ++Y+ N + V V +++ Sbjct: 65 GKADVEATFGEDSTMMYYFMIAEVFLVFMIYLRNHNPATFVLIPVFIVVTKVIFSMIA 122 >UniRef50_D1RMP4 Type IV conjugative transfer system pilin TraA-like protein n=2 Tax=Legionella RepID=D1RMP4_LEGLO Length = 107 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 35/67 (52%) Query: 54 SSGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIA 113 ++GQ+ ++ V ATFG++S + ++ + E L V++ K+ G ++ +F Sbjct: 41 AAGQNYLSPMKADVSATFGQNSDLPGYLYMGETLGAGVIWWQKKSPWVFVGLPLLMIFTH 100 Query: 114 VGMAVVG 120 G++ V Sbjct: 101 WGLSYVA 107 >UniRef50_A5KZF1 Putative uncharacterized protein n=1 Tax=Vibrionales bacterium SWAT-3 RepID=A5KZF1_9GAMM Length = 100 Score = 38.9 bits (89), Expect = 0.055, Method: Composition-based stats. Identities = 27/74 (36%), Positives = 41/74 (55%), Gaps = 1/74 (1%) Query: 49 AMAAGSSGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKN-VKFLAGFAI 107 A A+ + DL ASG TV T G SS ++++A ++ G ++ ++ KN V + GF + Sbjct: 27 ACASPAYADDLFASGKDTVTNTVGTGSSGEFYILVAGLIGGIIVGIIQKNWVGGIIGFFV 86 Query: 108 ISVFIAVGMAVVGL 121 VF VG VGL Sbjct: 87 GVVFWEVGKGFVGL 100 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B1VC86 Pilin n=55 Tax=root RepID=PIL2_ECOLX 100 1e-20 UniRef50_A4WGS0 Putative uncharacterized protein n=1 Tax=Enterob... 93 3e-18 UniRef50_B6ICF0 Putative TraA protein n=1 Tax=Escherichia coli S... 91 8e-18 UniRef50_C8QE96 Putative uncharacterized protein n=2 Tax=Pantoea... 87 2e-16 UniRef50_B2VB05 TraA protein n=1 Tax=Erwinia tasmaniensis RepID=... 86 5e-16 UniRef50_D1RMP4 Type IV conjugative transfer system pilin TraA-l... 81 1e-14 UniRef50_P12060 Pilin n=7 Tax=Enterobacteriaceae RepID=PIL1_SALTI 78 7e-14 UniRef50_A4TSY5 Conjugative transfer fimbrial subunit n=2 Tax=Ye... 71 1e-11 UniRef50_A4SUC7 TraA fimbrial protein n=1 Tax=Aeromonas salmonic... 69 6e-11 UniRef50_D2U005 Conjugal transfer pilus assembly protein TraA n=... 68 8e-11 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_B1VC86 Pilin n=55 Tax=root RepID=PIL2_ECOLX Length = 121 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 119/121 (98%), Positives = 120/121 (99%) Query: 1 MNAVLSVQGASAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLM 60 M+AVLSVQG SAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLM Sbjct: 1 MDAVLSVQGVSAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLM 60 Query: 61 ASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 ASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG Sbjct: 61 ASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 Query: 121 L 121 L Sbjct: 121 L 121 >UniRef50_A4WGS0 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4WGS0_ENT38 Length = 117 Score = 92.8 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 30/119 (25%), Positives = 56/119 (47%), Gaps = 7/119 (5%) Query: 2 NAVLSVQGASAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMA 61 NA+ V+G + ++ L+ ++ + LM+ P LA +G DL+A Sbjct: 6 NALNVVKGWAVASLFRTALENRQMKKFLKNGGLILIS--LMVAHPVLA-----AGTDLLA 58 Query: 62 SGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 TV +TFG SS++KW +AE+++G +Y+ ++ G +F V ++ Sbjct: 59 PQQATVNSTFGSGSSLIKWFYIAEIVMGLFIYIKARSPLVFVGIVGCIIFTRVAFSIAS 117 >UniRef50_B6ICF0 Putative TraA protein n=1 Tax=Escherichia coli SE11 RepID=B6ICF0_ECOSE Length = 123 Score = 91.3 bits (225), Expect = 8e-18, Method: Composition-based stats. Identities = 25/118 (21%), Positives = 51/118 (43%) Query: 3 AVLSVQGASAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMAS 62 AVL+ ++ K+K + + + V LA + DL ++ Sbjct: 5 AVLNNVMTTSAKKRKPGLLSRCLAVVNKSTAKALVKYVAAPLALWLAAQGMAQATDLASA 64 Query: 63 GNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 G V+ATFG+DS+++ + ++AEV + ++Y+ N + V V +++ Sbjct: 65 GKADVEATFGEDSTMMYYFMIAEVFLVFMIYLRNHNPATFVLIPVFIVVTKVIFSMIA 122 >UniRef50_C8QE96 Putative uncharacterized protein n=2 Tax=Pantoea sp. At-9b RepID=C8QE96_9ENTR Length = 130 Score = 86.7 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 27/104 (25%), Positives = 46/104 (44%), Gaps = 6/104 (5%) Query: 18 SFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSV 77 + + F+ L R+ +I AVL P L +DL++S K TFG S++ Sbjct: 32 TLYRSFSLLWRHRVNVLLIALAVLFFALPHLV-----RAEDLLSSQKQDAKDTFGHGSTI 86 Query: 78 VKWVVLAEVLVGAVMYMMTKNVKFLA-GFAIISVFIAVGMAVVG 120 + +AEV++ Y+ T+N G + V V ++ G Sbjct: 87 EWGLYIAEVIISIGAYIKTRNPMLFVGGLGFLIVVTRVLFSLAG 130 >UniRef50_B2VB05 TraA protein n=1 Tax=Erwinia tasmaniensis RepID=B2VB05_ERWT9 Length = 105 Score = 85.5 bits (210), Expect = 5e-16, Method: Composition-based stats. Identities = 28/101 (27%), Positives = 53/101 (52%), Gaps = 3/101 (2%) Query: 20 FSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSVVK 79 F+R N +L+ ++ AAV + P++A +A SS DL+ + T+ +TFG SS+ K Sbjct: 8 ILNFSRANKKKLS--IVMAAVTVCMLPEIAFSADSS-TDLLQAQQATINSTFGHGSSLEK 64 Query: 80 WVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 + +AEV + + Y + G ++ +F + ++G Sbjct: 65 YFYIAEVFMSLIAYFRARTPMVFIGLIMVIIFTRIAFGIIG 105 >UniRef50_D1RMP4 Type IV conjugative transfer system pilin TraA-like protein n=2 Tax=Legionella RepID=D1RMP4_LEGLO Length = 107 Score = 80.5 bits (197), Expect = 1e-14, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 35/69 (50%) Query: 52 AGSSGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVF 111 ++GQ+ ++ V ATFG++S + ++ + E L V++ K+ G ++ +F Sbjct: 39 GHAAGQNYLSPMKADVSATFGQNSDLPGYLYMGETLGAGVIWWQKKSPWVFVGLPLLMIF 98 Query: 112 IAVGMAVVG 120 G++ V Sbjct: 99 THWGLSYVA 107 >UniRef50_P12060 Pilin n=7 Tax=Enterobacteriaceae RepID=PIL1_SALTI Length = 119 Score = 78.2 bits (191), Expect = 7e-14, Method: Composition-based stats. Identities = 41/123 (33%), Positives = 66/123 (53%), Gaps = 9/123 (7%) Query: 1 MNAVLSVQGASAPVKKKSF----FSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSG 56 MN + G APVK +++ + + L+R + +L++ Q+A S Sbjct: 1 MNLSFAKGGLPAPVKNRAWQYCQMAWRGVTSKKALSRLAALSPLLLLGVGQMA-----SA 55 Query: 57 QDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGM 116 DL+A G VKATFG DS V+ +++AE++VG MY+ TKN+ L G ++ VF VG+ Sbjct: 56 TDLLAGGKDDVKATFGADSFVMMCIIIAELIVGVAMYIRTKNLLILLGLVVVIVFTTVGL 115 Query: 117 AVV 119 + Sbjct: 116 TFI 118 >UniRef50_A4TSY5 Conjugative transfer fimbrial subunit n=2 Tax=Yersinia pestis RepID=A4TSY5_YERPP Length = 116 Score = 70.9 bits (172), Expect = 1e-11, Method: Composition-based stats. Identities = 63/108 (58%), Positives = 83/108 (76%), Gaps = 1/108 (0%) Query: 13 PVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFG 72 V KF+++ +L+ + +P A L FFP+ +A+ ++G+DLM+SG+ TVK TFG Sbjct: 9 AVPATPAKKKFSKVALLKALKFALPVAALAAFFPETVLAS-TAGKDLMSSGDATVKGTFG 67 Query: 73 KDSSVVKWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 KDSSVVKWV+LAEVLVGAVMYM TKN+KFLAGFAI+SVF+ VGM+V G Sbjct: 68 KDSSVVKWVILAEVLVGAVMYMTTKNLKFLAGFAILSVFVTVGMSVAG 115 >UniRef50_A4SUC7 TraA fimbrial protein n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SUC7_AERS4 Length = 103 Score = 68.6 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 2/102 (1%) Query: 19 FFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSVV 78 + T+LN L R + AV ++ L + D++A + TV TFG +S+V Sbjct: 4 ALQERTQLNPKHLLRVMALMAVAVLV--TLLTFGQAHAVDMLAGQSGTVNDTFGANSTVA 61 Query: 79 KWVVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVG 120 KW++LAEV++G Y+ TKN+ L G I+ VF VG + Sbjct: 62 KWIILAEVIIGVASYIKTKNLLLLFGVIIVVVFTTVGFQLAA 103 >UniRef50_D2U005 Conjugal transfer pilus assembly protein TraA n=1 Tax=Arsenophonus nasoniae RepID=D2U005_9ENTR Length = 117 Score = 68.2 bits (165), Expect = 8e-11, Method: Composition-based stats. Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 1/80 (1%) Query: 41 LMMFFPQLAMAAGS-SGQDLMASGNTTVKATFGKDSSVVKWVVLAEVLVGAVMYMMTKNV 99 L+MF P + + A + DL + G VK+TFGK S+VV + + E+L Y+ TKN+ Sbjct: 38 LLMFIPAMQVYADPPTTGDLFSGGKEVVKSTFGKGSTVVWILYVLEILAAIFAYVKTKNL 97 Query: 100 KFLAGFAIISVFIAVGMAVV 119 G A + VF+ V ++ Sbjct: 98 AVFGGIAAVIVFVNVVFGLI 117 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.323 0.131 0.302 Lambda K H 0.267 0.0394 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 498,307,128 Number of Sequences: 3077464 Number of extensions: 14061408 Number of successful extensions: 78275 Number of sequences better than 1.0e-01: 16 Number of HSP's better than 0.1 without gapping: 34 Number of HSP's successfully gapped in prelim test: 12 Number of HSP's that attempted gapping in prelim test: 78221 Number of HSP's gapped (non-prelim): 47 length of query: 121 length of database: 1,040,396,356 effective HSP length: 88 effective length of query: 33 effective length of database: 769,579,524 effective search space: 25396124292 effective search space used: 25396124292 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 87 (38.2 bits)