BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (166 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Entero... 338 3e-92 UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli R... 188 6e-47 UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteria... 186 3e-46 UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 Rep... 184 1e-45 UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia ... 178 4e-44 UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae... 175 6e-43 UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 ... 174 9e-43 UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID... 167 1e-40 UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrob... 166 4e-40 UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=... 161 8e-39 UniRef50_B7NJP1 Putative side tail fiber protein homolog from la... 157 9e-38 UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia Rep... 151 8e-36 UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterob... 141 7e-33 UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersini... 138 6e-32 UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU 134 2e-30 UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteri... 133 2e-30 UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid pr... 133 2e-30 UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bact... 133 3e-30 UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacteriu... 132 4e-30 UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammapr... 132 5e-30 UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteri... 131 9e-30 UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber prote... 130 2e-29 UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadan... 129 3e-29 UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadan... 127 1e-28 UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=... 127 1e-28 UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID... 126 3e-28 UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia Rep... 125 5e-28 UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteri... 124 1e-27 UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteri... 123 2e-27 UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadan... 122 3e-27 UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadan... 122 4e-27 UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli pl... 120 1e-26 UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadan... 120 2e-26 UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus... 118 6e-26 UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersini... 117 2e-25 UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Pho... 116 2e-25 UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseu... 116 3e-25 UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacteriu... 116 3e-25 UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae ... 116 3e-25 UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannhei... 115 6e-25 UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1... 115 7e-25 UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH... 114 1e-24 UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabd... 113 2e-24 UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=... 112 4e-24 UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectoba... 112 6e-24 UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus... 111 1e-23 UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae... 110 2e-23 UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 T... 107 1e-22 UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=... 105 4e-22 UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica ... 102 6e-21 UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=... 97 1e-19 UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Entero... 96 4e-19 UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 T... 96 6e-19 UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio... 95 9e-19 UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia... 95 9e-19 UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas... 89 4e-17 UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotroph... 82 9e-15 UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Ta... 82 9e-15 UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia sola... 80 3e-14 UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=... 79 6e-14 UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria pha... 79 8e-14 UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacteriu... 78 8e-14 UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Ta... 77 2e-13 UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxaloba... 77 2e-13 UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax... 76 3e-13 UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_L... 76 5e-13 UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylo... 75 8e-13 UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synecho... 75 9e-13 UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacteriu... 75 1e-12 UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium w... 74 2e-12 UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX 74 2e-12 UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 ... 74 3e-12 UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polarom... 73 3e-12 UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella gra... 73 4e-12 UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralston... 73 4e-12 UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepI... 72 8e-12 UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 ... 71 1e-11 UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A... 71 1e-11 UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae R... 70 2e-11 UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio... 69 4e-11 UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45... 69 5e-11 UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas ... 68 1e-10 UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxaloba... 68 1e-10 UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escheri... 68 1e-10 UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labr... 67 2e-10 UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica... 66 4e-10 UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX 66 4e-10 UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escheric... 66 4e-10 UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella... 65 7e-10 UniRef50_P76072 Side tail fiber protein homolog from lambdoid pr... 65 9e-10 UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae Re... 64 2e-09 UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia ... 64 2e-09 UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriacea... 63 3e-09 UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingell... 63 4e-09 UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enter... 62 8e-09 UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax... 62 9e-09 UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enter... 61 1e-08 UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkhol... 60 2e-08 UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysente... 60 3e-08 UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp.... 60 3e-08 UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID... 60 3e-08 UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkhol... 59 5e-08 UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter ... 59 5e-08 UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibaci... 59 5e-08 UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythrae... 58 1e-07 UniRef50_C3X912 Phage tail collar domain-containing protein n=1 ... 57 2e-07 UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A... 56 5e-07 UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A... 55 9e-07 UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root R... 55 9e-07 UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED1... 54 2e-06 UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_... 54 2e-06 UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkhol... 54 2e-06 UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=... 54 3e-06 UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria ph... 54 3e-06 UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=C... 53 5e-06 UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae ... 52 9e-06 UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibr... 51 1e-05 UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD 50 3e-05 UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv.... 50 3e-05 UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A... 50 4e-05 UniRef50_D2L4G0 Tail Collar domain protein n=1 Tax=Desulfovibrio... 49 5e-05 UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemoph... 49 5e-05 UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-l... 49 5e-05 UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio ... 49 7e-05 UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia... 49 8e-05 UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylo... 48 1e-04 UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium R... 48 1e-04 UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Micr... 48 1e-04 UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium R... 47 2e-04 UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 T... 46 4e-04 UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Ta... 46 5e-04 UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 R... 45 9e-04 UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=... 45 0.001 UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxaloba... 45 0.001 UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxaloba... 45 0.001 UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus ... 44 0.001 UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenel... 44 0.002 UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes... 44 0.002 UniRef50_B5TK79 Tail collar protein n=2 Tax=root RepID=B5TK79_9VIRU 44 0.003 UniRef50_A4P195 Putative phage tail fibre protein (Fragment) n=1... 42 0.007 UniRef50_A4YX40 Putative uncharacterized protein n=1 Tax=Bradyrh... 42 0.009 UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxaloba... 42 0.009 UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemoph... 42 0.010 UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes... 41 0.011 UniRef50_A3YA17 Prophage MuSo2, tail fiber protein, putative n=1... 41 0.012 UniRef50_C7BVI0 Structural protein n=1 Tax=Synechococcus phage S... 41 0.018 UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formi... 40 0.019 UniRef50_D1NFN8 Apo-citrate lyase phosphoribosyl-dephospho-CoA t... 40 0.019 UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacter... 40 0.026 UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes... 40 0.030 UniRef50_A6EAB8 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 R... 39 0.050 UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepI... 39 0.051 UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes... 39 0.069 >UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Enterobacteriaceae RepID=STFE_ECOLI Length = 166 Score = 338 bits (868), Expect = 3e-92, Method: Compositional matrix adjust. Identities = 166/166 (100%), Positives = 166/166 (100%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG Sbjct: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA Sbjct: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 >UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli RepID=B3I9S3_ECOLX Length = 546 Score = 188 bits (477), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 104/192 (54%), Positives = 119/192 (61%), Gaps = 49/192 (25%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFS EEYPELAKAYPTNKLPDLRGEFIR Sbjct: 378 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSVEEYPELAKAYPTNKLPDLRGEFIR 437 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD----- 115 GWDDGRGIDTGR++L+ Q + DHAH + E+W G Sbjct: 438 GWDDGRGIDTGRALLNWQPHTILDHAH-----------------YMELWTGDGLAAGSAR 480 Query: 116 ---------------IIKRGNTNDAGLPAP------DYGTFKTYKQSVDGLGAAASETRP 154 I+K T++ GL P + K Y + + +G +ETRP Sbjct: 481 EGVNPGILATYGDGGIVK---TDEPGLKVPSSLRAISSRSVKRYGEISENVG---TETRP 534 Query: 155 RNIAFNYIVRAA 166 RNIAFNYIVRAA Sbjct: 535 RNIAFNYIVRAA 546 >UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteriaceae RepID=C6V0Q3_ECO5T Length = 439 Score = 186 bits (471), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 99/171 (57%), Positives = 116/171 (67%), Gaps = 12/171 (7%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAK YPTNKLPDLRGEFIR Sbjct: 276 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKVYPTNKLPDLRGEFIR 335 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG+D GR +L++Q A H H ++ D T+ + +GT I+K+ Sbjct: 336 GWDDGRGVDNGRGLLTLQDGAIVSHNHYWGIWTSRTNDQTLESF-------TGTTILKQI 388 Query: 121 N-----TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 N P P+ + + A A+ETRPRN+AFNYIVRAA Sbjct: 389 TPLSPAINFDNYPIPNPAITEGGVVAATTKPAGANETRPRNVAFNYIVRAA 439 >UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 RepID=Q9MCR6_BPHK7 Length = 321 Score = 184 bits (466), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 104/166 (62%), Positives = 114/166 (68%), Gaps = 5/166 (3%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGA FSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 159 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAVFSAEEYPELAKAYPTNKLPDLRGEFIR 218 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFY-FDEIWVNSGTDIIKR 119 GWDDGRGID GR ILS QG A + T V +A+I+FY D ++V Sbjct: 219 GWDDGRGIDAGREILSAQGDAIRNITGTFGDGETEV-NASISFYRADGVFVTQKKLRNTI 277 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 GNT P+ + S + ASE RPRNIAFNYIVRA Sbjct: 278 GNTTIIA-DTPNNPYLINFDAS--RVVPTASENRPRNIAFNYIVRA 320 >UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia RepID=B7MJL6_ECO45 Length = 247 Score = 178 bits (452), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 100/167 (59%), Positives = 112/167 (67%), Gaps = 17/167 (10%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 97 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 156 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG+D+ R++LS Q L S ++ + F D + + S + I Sbjct: 157 GWDDGRGVDSRRAVLSTQEPTVGTFYVELAIISGTLSGSGAKFT-DSVGIGSTSSNITVS 215 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAA-SETRPRNIAFNYIVRAA 166 N ND QSV G A +TRPRNIAFNYIVRAA Sbjct: 216 NGND---------------QSVSGTVAVNPVDTRPRNIAFNYIVRAA 247 >UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae RepID=B3I8J5_ECOLX Length = 263 Score = 175 bits (443), Expect = 6e-43, Method: Compositional matrix adjust. Identities = 97/182 (53%), Positives = 111/182 (60%), Gaps = 29/182 (15%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVG PVPWPS TPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 95 LGLGEGSALPVGAPVPWPSETPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 154 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII--- 117 GWDD RGIDTGRS+LS Q +T + A ++Y ++ N I Sbjct: 155 GWDDSRGIDTGRSLLSGQA-------------ATFIRTALQDYYGYDLNTNVKVGIAFAT 201 Query: 118 -------------KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 K GN +D + D T + + D A RPRN++FNYIVR Sbjct: 202 ADSVITVGNPANPKAGNNSDYVPASADNSITGTQRTAEDNFTGAWISMRPRNLSFNYIVR 261 Query: 165 AA 166 AA Sbjct: 262 AA 263 >UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 Tax=Shigella sp. D9 RepID=UPI0001B5347E Length = 550 Score = 174 bits (441), Expect = 9e-43, Method: Compositional matrix adjust. Identities = 96/182 (52%), Positives = 112/182 (61%), Gaps = 29/182 (15%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLG+GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYP+LAK YPTNKLPDLRGEFIR Sbjct: 382 LGLGDGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPKLAKVYPTNKLPDLRGEFIR 441 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII--- 117 GWDD RGIDTGRS+LS Q +T + A ++Y ++ N I Sbjct: 442 GWDDSRGIDTGRSLLSGQA-------------ATFIRTALQDYYGYDLNTNVKVGIAFAT 488 Query: 118 -------------KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 K GN +D + D T + + D A RPRN++FNYIVR Sbjct: 489 ADSVITVGNPANPKAGNNSDYVPASADNSITGTQRTAEDNFTGAWISMRPRNLSFNYIVR 548 Query: 165 AA 166 AA Sbjct: 549 AA 550 >UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID=B7US81_ECO27 Length = 521 Score = 167 bits (423), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 94/166 (56%), Positives = 106/166 (63%), Gaps = 22/166 (13%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPW SATPPTGWLKCNGAAFS+E YP LA+AYPTNKLPDLRGEFIR Sbjct: 378 LGLGEGSALPVGVPVPWSSATPPTGWLKCNGAAFSSEMYPRLARAYPTNKLPDLRGEFIR 437 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGID GR++LS Q + H G I + +IN Y D+I N Sbjct: 438 GWDDGRGIDAGRTLLSGQDGTSFSHYGG---NFDIGSGHSINNY-DQIVSNQ-------- 485 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 P + F ++ G G RPRNI FNYIVRAA Sbjct: 486 ---------PGFSRF-SFAGPSRGDGVNYVTIRPRNITFNYIVRAA 521 >UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR91_CITRO Length = 279 Score = 166 bits (419), Expect = 4e-40, Method: Compositional matrix adjust. Identities = 95/172 (55%), Positives = 106/172 (61%), Gaps = 33/172 (19%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPP GWLKCNGA FS+ YP+L AYP+ KLPDLRGEFIR Sbjct: 135 LGLGEGSALPVGVPVPWPSATPPEGWLKCNGATFSSSLYPKLGLAYPSGKLPDLRGEFIR 194 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW------VNSGT 114 GWDDGRG D GRS+LS QG A H+H FD W +G Sbjct: 195 GWDDGRGADNGRSLLSSQGDAFRSHSHN----------------FDRSWGLENFDATAGY 238 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 D++ + N + P T SV G SETRPRNIAFNYIVRAA Sbjct: 239 DVVT-ADINGKIVNQPTRSTV-----SVGG-----SETRPRNIAFNYIVRAA 279 >UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E01-6750 RepID=UPI000190EC42 Length = 317 Score = 161 bits (407), Expect = 8e-39, Method: Compositional matrix adjust. Identities = 96/177 (54%), Positives = 105/177 (59%), Gaps = 33/177 (18%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSAT P GWLKCNGAAFS+E YP+LAKAYPTNKLPDLRGEFIR Sbjct: 163 LGLGEGSALPVGVPVPWPSATLPEGWLKCNGAAFSSEMYPKLAKAYPTNKLPDLRGEFIR 222 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGID GR ILS Q + TI FD+ N DI Sbjct: 223 GWDDGRGIDAGREILSFQ-------------------EGTIVSGFDD---NDTGDISSLS 260 Query: 121 NTNDA---GLPAPDYGTFKTYKQSVDGLGAAASE--------TRPRNIAFNYIVRAA 166 +T L + +G K D A + RPRNIAFNYIVRAA Sbjct: 261 STQYGFGDTLSSNQWGAINGKKWIFDASSKGAQKYDWWAYVSARPRNIAFNYIVRAA 317 >UniRef50_B7NJP1 Putative side tail fiber protein homolog from lambdoid prophage n=3 Tax=Escherichia coli RepID=B7NJP1_ECO7I Length = 686 Score = 157 bits (398), Expect = 9e-38, Method: Compositional matrix adjust. Identities = 92/174 (52%), Positives = 111/174 (63%), Gaps = 23/174 (13%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWP+ATPP GWLKC+G AF+ E+YP LA+AYPT +LPDLRGEFIR Sbjct: 528 LGLGEGSALPVGVPVPWPTATPPEGWLKCDGRAFTKEQYPVLARAYPTLRLPDLRGEFIR 587 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATIN-FYFDEIWVNSGTDIIKR 119 GWDDGR ID GR +LS Q + T+V N D ++++G +I Sbjct: 588 GWDDGRKIDEGRKLLSWQ-------------KGTLVGGHDDNDSALDISYMSNGNNIDYG 634 Query: 120 GNTNDAGLPAPDY-------GTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 G+ AG DY GT K ++ GA + TRPRNIAFNYIVRAA Sbjct: 635 GDKVFAGNYRSDYLWYAVLGGTNSRAKAELN--GAFFNITRPRNIAFNYIVRAA 686 >UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia RepID=A9R3H4_YERPG Length = 259 Score = 151 bits (381), Expect = 8e-36, Method: Compositional matrix adjust. Identities = 90/184 (48%), Positives = 109/184 (59%), Gaps = 38/184 (20%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSA+ VGVP+PWP+ATPP GWLKCNGA F +YP+LA AYP+ LPDLRGEFIR Sbjct: 96 LGLGEGSAILVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIR 155 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDG G+D GR ILSIQG A + + G+ R+ +AT F Sbjct: 156 GWDDGLGVDAGREILSIQGDAIRNISGGIQGRN----EATSARLF--------------- 196 Query: 121 NTNDAGLPAPDYGTFKTYKQSVD-GLGA-----------------AASETRPRNIAFNYI 162 ++N G+ D G F +Y S D +G A+E RPRNIAFNYI Sbjct: 197 SSNATGVFRTD-GQFGSYAASADVAVGVTDDRLAELFFDASRSVPTANENRPRNIAFNYI 255 Query: 163 VRAA 166 VRAA Sbjct: 256 VRAA 259 >UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterobacteriaceae RepID=A4WEL3_ENT38 Length = 340 Score = 141 bits (356), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 80/161 (49%), Positives = 97/161 (60%), Gaps = 13/161 (8%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 LPVG P+PWP ATPP GWLKCNGA F +YP+LA AYP+ LPDLRGEFIRGWDDGRG+ Sbjct: 190 LPVGFPLPWPQATPPQGWLKCNGAPFDKVKYPKLAVAYPSGLLPDLRGEFIRGWDDGRGV 249 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT---NDA 125 D+GR L+ QG A + + A F + SG + KRG+ N + Sbjct: 250 DSGRVALTTQGDAVQKMTGAASN------GAATGFVNNSTSRVSG--VFKRGSVIYPNTS 301 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 A G + S+ + +A ETRPRNIAFNYIVRAA Sbjct: 302 AQNADYQGVDLVFDSSL--MVRSAEETRPRNIAFNYIVRAA 340 >UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersinia pestis KIM D27 RepID=D1TPQ4_YERPE Length = 262 Score = 138 bits (348), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 63/93 (67%), Positives = 75/93 (80%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSA+PVGVP+PWP+ATPP GWLKCNGA F +YP+LA AYP+ LPDLRGEFIR Sbjct: 96 LGLGEGSAIPVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIR 155 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRS 93 GWDDG G+D GR ILSIQG A + + G+ R+ Sbjct: 156 GWDDGLGVDAGREILSIQGDAIRNISGGIQGRN 188 >UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU Length = 296 Score = 134 bits (336), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 75/164 (45%), Positives = 94/164 (57%), Gaps = 17/164 (10%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 +PVG P+PWP+A PP GWL+CNGA F ++PELAKAYP+ LPDLRGEFIRGWD+GRG+ Sbjct: 144 IPVGTPIPWPTAIPPVGWLQCNGAVFDKSKFPELAKAYPSGYLPDLRGEFIRGWDNGRGV 203 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTD----ATINFYFDEIWVNSGTDIIKRGNTND 124 D GR + QG A + P + D AT ++ +I + TD G T Sbjct: 204 DPGRVCSTWQGDAIRNITGSFPG---AIADNYHLATKEAFYGKINLGIATD----GTTKS 256 Query: 125 AGLPAPD--YGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + PD YG + + TRPRNIAFNYIVRA Sbjct: 257 KNIHNPDNPYG----FGFDASRVVPVPQRTRPRNIAFNYIVRAV 296 >UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CGA0_DICZE Length = 166 Score = 133 bits (334), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 76/162 (46%), Positives = 95/162 (58%), Gaps = 24/162 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP ATPP GWLKCNG AF +P+LA+ YP+ LPDLRGEFIRGWDDGRG+D+ Sbjct: 23 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 82 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS---GTDIIKRGNTN---D 124 R++LS QG A + G S + D ++D NS G+ I+ + N + D Sbjct: 83 NRNLLSSQGDAIR-NITGFVSGVYVGFDGYSGAFYDTGSRNSISPGSTIVAQLNDDFAFD 141 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 A P A+E RPRNIAFNYIVRAA Sbjct: 142 ASRVVP-----------------TANENRPRNIAFNYIVRAA 166 >UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid prophage e14 n=3 Tax=Photorhabdus RepID=C7BSQ1_PHOAA Length = 166 Score = 133 bits (334), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 78/174 (44%), Positives = 96/174 (55%), Gaps = 32/174 (18%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 +PVG+P+PWP+ PP GW+KCNGA F YP+LA AYP+ LPDLRGEFIRGWDDGRG+ Sbjct: 9 IPVGIPLPWPTDIPPYGWVKCNGAIFDKYLYPKLAVAYPSGNLPDLRGEFIRGWDDGRGV 68 Query: 69 DTGRSILSIQGYATEDHAH----------------GLPSRSTIVTDATINFYFDEIWVNS 112 D GR +LS Q H+H G PSR +N + VN Sbjct: 69 DIGRYVLSTQLADIAPHSHRIGRMWSNSNAGAEGLGTPSR-------ILNSVYQG--VNY 119 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 G D RG G+ + +G + G+ ETRPRN+AFNYIVRAA Sbjct: 120 GID--TRGLGIAIGMGSGGFGYMDNAVAASTGI-----ETRPRNVAFNYIVRAA 166 >UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bacteriophage n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2R3_PHOLL Length = 233 Score = 133 bits (334), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 76/166 (45%), Positives = 93/166 (56%), Gaps = 34/166 (20%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVGVP+P+PS P G+L CNG AF YP+LA AYP+ LPDLRGEFIRGWDD RG+D Sbjct: 93 PVGVPLPYPSRYTPAGYLTCNGQAFDKSRYPQLAIAYPSGILPDLRGEFIRGWDDSRGVD 152 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 GR +LS Q +DH H +V D++ GN + Sbjct: 153 MGRGMLSWQPAGIQDHMHYKVISKQVV-----------------EDLVLAGNQS------ 189 Query: 130 PDYGTFK--TYKQSVDG-------LGAAASETRPRNIAFNYIVRAA 166 +GT K TY +S+D +G +ETRPRNIAFNYIVRAA Sbjct: 190 --WGTEKNSTYTRSLDQNISTGGVIGTTVNETRPRNIAFNYIVRAA 233 >UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DE08_PECCP Length = 682 Score = 132 bits (333), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 73/160 (45%), Positives = 90/160 (56%), Gaps = 14/160 (8%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP T P+GWLKCNG F YP+LA+ YP LPDLRGEFIRGWDD RG+DT Sbjct: 533 VGMPMPWPQTTAPSGWLKCNGQTFDKNIYPKLAQIYPAGILPDLRGEFIRGWDDSRGVDT 592 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT----NDAG 126 GR++LS QG A + + T A F + + N ++ K + AG Sbjct: 593 GRTLLSTQGDAIRNIVGEI-----WTTAANYQFLGENLLSNGAFELFKEFTVGAIPDAAG 647 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 P F + + ASE RPRNIAFNYIVRAA Sbjct: 648 NSCPSRMKFDASR-----IVPTASENRPRNIAFNYIVRAA 682 >UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=B2PZV1_PROST Length = 526 Score = 132 bits (331), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 79/174 (45%), Positives = 91/174 (52%), Gaps = 25/174 (14%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG 61 G E S PVG P+PWP AT P+G+L CNG AF+ YP L KAYP+ KLPDLRGEFIRG Sbjct: 369 GSSELSDCPVGAPIPWPQATAPSGYLICNGQAFNKTTYPLLTKAYPSGKLPDLRGEFIRG 428 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAH----GLPSRSTIVTDATINFYFDEIWVNSGTDII 117 D GR ID GR +LS Q ATE H H G S + + T + Sbjct: 429 LDAGRNIDNGRVVLSFQRCATEHHKHISGWGEASNANAIFGKT----------------V 472 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDG-----LGAAASETRPRNIAFNYIVRAA 166 K G A +Y + G G A+ETRPRNIAF YIVRAA Sbjct: 473 KNGYVGSASTDRDNYLFYTNDGSEFQGSNPNSTGIMANETRPRNIAFLYIVRAA 526 >UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6CGA4_DICZE Length = 401 Score = 131 bits (329), Expect = 9e-30, Method: Compositional matrix adjust. Identities = 79/165 (47%), Positives = 95/165 (57%), Gaps = 20/165 (12%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP ATPP GWLKCNG AF +P+LA+AYP LPDLRGEFIRGWDDGRG+D Sbjct: 248 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQAYPGGVLPDLRGEFIRGWDDGRGVDV 307 Query: 71 GRSILSIQ-GYATEDHAHGLPSRSTIVTDATINFYFD--EIWVNSGTDIIKRGNTNDAGL 127 R +LS Q G T P+ S + A I+ D + + G DI+ N +D + Sbjct: 308 ARELLSWQKGTLTISD----PNLSAVNVGALIHANNDSANTYKSMGFDIV---NKSDYAM 360 Query: 128 PAPDYGTFKTYKQSVD------GLGAAASETRPRNIAFNYIVRAA 166 Q +D G GA TRPRNIAFNYIVRAA Sbjct: 361 LRAAINVETVGAQDLDSNGWQFGYGA----TRPRNIAFNYIVRAA 401 >UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber protein H n=2 Tax=Pectobacterium atrosepticum RepID=Q6D3Y6_ERWCT Length = 536 Score = 130 bits (327), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 75/164 (45%), Positives = 89/164 (54%), Gaps = 16/164 (9%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 AL G+P PWP AT P GWLKCNG +F +P LA AYP+ LPDLRGEFIRGWDDGRG Sbjct: 384 ALTAGMPKPWPRATAPAGWLKCNGQSFDISAFPHLAAAYPSGVLPDLRGEFIRGWDDGRG 443 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT-----DIIKRGNT 122 +D+GRS+LS Q A + I T A + E +SG + Sbjct: 444 VDSGRSLLSAQSDAIRNIV------GEIWTSAVSQQFLGETLSSSGVFELLYEFAVGAIP 497 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + AG P F + A+E RPRNIAFNYIVRAA Sbjct: 498 DAAGNSCPSRMRFDASRAV-----PTAAENRPRNIAFNYIVRAA 536 >UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D4_DICDC Length = 557 Score = 129 bits (324), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 70/155 (45%), Positives = 86/155 (55%), Gaps = 22/155 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTG 71 G+P+PWP AT PTGWLKCNG +F YP+L AYP+ LPDLRGEFIRGWDDGRG+D+G Sbjct: 425 GIPLPWPQATAPTGWLKCNGQSFDKTLYPKLTAAYPSGTLPDLRGEFIRGWDDGRGVDSG 484 Query: 72 RSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPD 131 R++LS+Q DAT + I N+ I+ N D+ + Sbjct: 485 RAVLSVQ-------------------DAT--WIQPNIESNTAATTIRIDNV-DSTFNTDE 522 Query: 132 YGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 Y A S RPRN+AFNYIVRAA Sbjct: 523 YSAVSNLPSYEHNGSRARSYVRPRNVAFNYIVRAA 557 >UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BYH6_DICD5 Length = 198 Score = 127 bits (319), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 69/155 (44%), Positives = 85/155 (54%), Gaps = 16/155 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P WP A P GWLKCNG AF +YP+LAK YP LPDLRGEFIRGWDDGRG+DT Sbjct: 59 VGIPQAWPLADAPEGWLKCNGQAFDKTKYPQLAKLYPAGTLPDLRGEFIRGWDDGRGVDT 118 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R ILS Q E H H +P + + Y ++ + D + G + L + Sbjct: 119 NRQILSAQSGMLESHNHMMPVSDPSKWNGAVYGYANDQPSANIEDFSQSGVSTSRELTSL 178 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G +ETRPRNIAF+YIV+A Sbjct: 179 TGG----------------NETRPRNIAFSYIVKA 197 >UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=Photorhabdus RepID=Q7N5C0_PHOLL Length = 239 Score = 127 bits (319), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 70/161 (43%), Positives = 94/161 (58%), Gaps = 23/161 (14%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 S++PVG P+PWP + PP+G+ CNG+AFS +YP+LA+AYP ++PDLRGEFIRGWDDGR Sbjct: 99 SSIPVGSPIPWPLSHPPSGYFTCNGSAFSRSQYPKLAEAYPDGRIPDLRGEFIRGWDDGR 158 Query: 67 GIDTGRSILSIQGYATE--DHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 G+D+GR ILS Q T+ GLP + ++ D G D++ Sbjct: 159 GVDSGRVILSAQTDNTKRIQLTKGLPDGQFL---SSYQGPVDRYQFPLGRDVL------- 208 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + T + + G ETRPRNIAFNYIV+A Sbjct: 209 ------ESATVTSIANNTGG-----HETRPRNIAFNYIVKA 238 >UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID=C6CP84_DICZE Length = 646 Score = 126 bits (316), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 70/155 (45%), Positives = 88/155 (56%), Gaps = 11/155 (7%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTG 71 G+P+PWP AT PTGWLKCNG +F + YP LA+ YP+ LPDLRGEFIRGWDDGRG+D Sbjct: 503 GIPLPWPQATAPTGWLKCNGQSFDKKLYPRLAQVYPSGVLPDLRGEFIRGWDDGRGVDNN 562 Query: 72 RSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPD 131 R +LS QG + S ++ D + + + I N+N G Sbjct: 563 RGLLSSQGDTIRNIVA-----SFVMDDQAVTINAPTGAMFPSSQIAYDANSNVGG----T 613 Query: 132 YGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 G + S + A+E RPRNIAFNYIVRAA Sbjct: 614 MGFNVVFDAS--RVVPTANENRPRNIAFNYIVRAA 646 >UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia RepID=C4UEH4_YERAL Length = 387 Score = 125 bits (314), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 71/157 (45%), Positives = 92/157 (58%), Gaps = 9/157 (5%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 P+G+P+P+P TPP G+LKCNGAAF YP LA YPT+KLPDLRGEFIRG+DDGRGID Sbjct: 238 PIGIPLPYPGTTPPAGYLKCNGAAFYPYRYPTLATLYPTHKLPDLRGEFIRGFDDGRGID 297 Query: 70 TGRSILSIQGYATEDHAHGLPSRS-TIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 T R++LS Q A ++ G+ S ++ A NF +G ND Sbjct: 298 TSRTLLSAQTDALQNITGGINGVSESLGIAAESNF--------TGAFAKAESVGNDNTPH 349 Query: 129 APDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 D ++ + A+ETRPRNI+F YI+RA Sbjct: 350 HTDITHCGSFDFDASRVVRTAAETRPRNISFCYILRA 386 >UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CP88_DICZE Length = 485 Score = 124 bits (311), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 68/159 (42%), Positives = 86/159 (54%), Gaps = 22/159 (13%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTG 71 G+P+PWP A PTGWLKCNG AF YP LA+ YP+ LPDLRGEFIRGWDDGRG+D+G Sbjct: 345 GIPLPWPQAAVPTGWLKCNGQAFDKNRYPRLAQVYPSGVLPDLRGEFIRGWDDGRGVDSG 404 Query: 72 RSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPD 131 R +LS Q R ++ IN+ + S + + + A + Sbjct: 405 REVLSQQ-------------RGSL-----INYDGPDSAPTSDSLRLSVSAAQADAVSASE 446 Query: 132 YG----TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 Y ++ Y + TRPRNIAFNYIVRAA Sbjct: 447 YAGVMLSYTAYNITTVSAAGYVGATRPRNIAFNYIVRAA 485 >UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6DA10_PECCP Length = 689 Score = 123 bits (309), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 70/155 (45%), Positives = 89/155 (57%), Gaps = 12/155 (7%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTG 71 G+P+P+P A PTGWLKCNG +F +YP LA YP+ LPDLRGEF+RGWDDGRG D Sbjct: 547 GIPLPFPGAVAPTGWLKCNGQSFDKSQYPILASRYPSGVLPDLRGEFVRGWDDGRGADAS 606 Query: 72 RSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPD 131 R++LS QG A R+ + T +N + D K + +GL + Sbjct: 607 RALLSAQGDAI---------RNIVGTIGQLNDRVNTTETAGVFDANKYTGAH-SGLTGGN 656 Query: 132 YGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 G T+ S + A+E RPRNIAFNYIVRAA Sbjct: 657 GGRIATFDAS--KVVPTAAENRPRNIAFNYIVRAA 689 >UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BT14_DICD5 Length = 534 Score = 122 bits (307), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 72/159 (45%), Positives = 91/159 (57%), Gaps = 16/159 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+P+P AT P GWLKCNG +F+ +P LA+ YP+ LPDLRGEFIRGWDD RG+D Sbjct: 389 VGIPLPYPGATAPDGWLKCNGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVDP 448 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR +LS Q H+HG V D + +++ + G+ + DA Sbjct: 449 GRGLLSFQESQNLTHSHG-------VNDPGHSHPYNKYEGSVGSGLAGFDYDQDAWNATV 501 Query: 131 DYGTFKTYKQSVDGLGAAAS---ETRPRNIAFNYIVRAA 166 G T G+ AAS E RPRNIAFNYIVRAA Sbjct: 502 YTGHVGT------GISIAASGGHEARPRNIAFNYIVRAA 534 >UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadantii RepID=C6C5D2_DICDC Length = 498 Score = 122 bits (306), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 75/177 (42%), Positives = 92/177 (51%), Gaps = 34/177 (19%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP AT PTGWLKCNG +F YP+LA YP+ LPDLRGEFIRGWDDGRG+D Sbjct: 335 VGIPLPWPQATAPTGWLKCNGQSFDKALYPKLATVYPSGVLPDLRGEFIRGWDDGRGVDA 394 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFY---FDEIWVNSGT-----DIIKRGNT 122 GR+IL+ Q T + +++ D I V G D + + Sbjct: 395 GRAILTAQ-------------NPTYLRTGMMDYNGSDVDNIGVYIGMGYAEADTAAKSIS 441 Query: 123 NDAG-LPAPDYGTFKTYKQSVDGLGAAAS------------ETRPRNIAFNYIVRAA 166 AG AP+ +G+ AS TRPRNIAFNYIVRAA Sbjct: 442 APAGAFRAPNNIDLTEQASRDNGVNGTASNTVYASEGSVWVSTRPRNIAFNYIVRAA 498 >UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli plasmid p15B n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2Q1_PHOLL Length = 478 Score = 120 bits (301), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 69/161 (42%), Positives = 87/161 (54%), Gaps = 20/161 (12%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 + VG P+PWP P G+L CNG +F+ YP+LA AYP+ LPDLRGEFIRGWDDGRG+ Sbjct: 335 ISVGSPIPWPLPNVPAGYLACNGQSFNKSLYPQLAIAYPSGVLPDLRGEFIRGWDDGRGV 394 Query: 69 DTGRSILSIQGYATED---HAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 D GR +L+ QG A + + G R + ++ N TD+ ++ DA Sbjct: 395 DRGRGVLTHQGDAIRNITGYTPGTILRGNNSYGGCFSLSGEKAPGNEYTDVWQKQVLFDA 454 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 P ASE RPRNIAFNYIVRAA Sbjct: 455 SRVVP-----------------VASENRPRNIAFNYIVRAA 478 >UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C6Z0_DICDC Length = 183 Score = 120 bits (300), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 71/162 (43%), Positives = 84/162 (51%), Gaps = 32/162 (19%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P PWP A P GWLKCNG AF +YPELAK YP+ LPDLRGEFIRGWDDGRG+DT Sbjct: 46 IGIPQPWPLADAPEGWLKCNGQAFDTAKYPELAKCYPSGTLPDLRGEFIRGWDDGRGVDT 105 Query: 71 GRSILSIQG--YATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 R ++S Q Y T D + PS I GN + + Sbjct: 106 SRELVSAQSGTYITGD-SDSQPSVQGI------------------------GNITECHVD 140 Query: 129 APDYGTFKTY---KQSVDGLGAAA--SETRPRNIAFNYIVRA 165 +PD Y D L TRPRNI+FNYIV+A Sbjct: 141 SPDSNARSIYWIPATKTDRLTGPTYWGVTRPRNISFNYIVKA 182 >UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BSQ6_PHOAA Length = 318 Score = 118 bits (296), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 48/70 (68%), Positives = 57/70 (81%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 +PVGVP+PWP+A PPTGWL+CNGAAF ++P+L AY + LPDLRGEFIRGWD RG+ Sbjct: 219 IPVGVPIPWPTAIPPTGWLQCNGAAFDKSKFPQLVAAYSSGVLPDLRGEFIRGWDSSRGV 278 Query: 69 DTGRSILSIQ 78 DT RSILS Q Sbjct: 279 DTNRSILSTQ 288 >UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersinia bercovieri ATCC 43970 RepID=C4S5W0_YERBE Length = 388 Score = 117 bits (292), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 69/160 (43%), Positives = 91/160 (56%), Gaps = 20/160 (12%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P+P+P + P G+LKCNGAAFS YP+LA YP+ LPD+RG IRGWDDGRG+D Sbjct: 243 IGIPIPYPLPSVPVGYLKCNGAAFSTVTYPKLALKYPSGVLPDMRGNAIRGWDDGRGVDA 302 Query: 71 GRSILSIQGYATED-----HAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 GR++LS Q A ++ + G + +VT T F E++ G + GN Sbjct: 303 GRALLSQQLDALQNITGNFYMGGSKQVAGVVT--TGAFGPMEVYNALGNQVTTAGNIGGI 360 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 TF + S A+ETR RNIAFNYIVRA Sbjct: 361 --------TFDASRVS-----RTAAETRMRNIAFNYIVRA 387 >UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N348_PHOLL Length = 440 Score = 116 bits (291), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 66/158 (41%), Positives = 83/158 (52%), Gaps = 13/158 (8%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 +P GVP+P+P P G+L CNG F YP+LA+AYP ++PDLRGEFIRGWDD RG+ Sbjct: 296 VPAGVPMPYPHRYTPPGYLTCNGQTFDKSLYPKLAEAYPAGRVPDLRGEFIRGWDDSRGV 355 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 D GR + Q DH H + +V D + D W S + + + Sbjct: 356 DPGRVCGTWQADCIPDHNHYKVASKQLVEDLVLT--GDAGWYTSSGSSTRTRSLDQ---- 409 Query: 129 APDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 TY V A+ETRPRNIAFNYIVRA Sbjct: 410 -------NTYTGGVTEAQVIANETRPRNIAFNYIVRAV 440 >UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseudotuberculosis IP 31758 RepID=A7FIU0_YERP3 Length = 402 Score = 116 bits (290), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 69/171 (40%), Positives = 89/171 (52%), Gaps = 43/171 (25%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG+P+PWP+ PP+GWLKCNGA F+ ++P+LA Y LPDLRGEFIRGWDDG+ D Sbjct: 259 PVGIPMPWPAHIPPSGWLKCNGATFNKAQFPQLASVYTRGVLPDLRGEFIRGWDDGKLAD 318 Query: 70 TGRSILSIQ------GYATED---------HAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 GR +LS Q GY D ++ G + T +IN + W+ +G Sbjct: 319 PGRGLLSFQEGTVVGGYDDNDTGDISSIGLYSSGFGDQLTNTQWVSIN---GKRWITAGV 375 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 I+ ++ Y A TRPRNIAFNYIVRA Sbjct: 376 SSIR----------------YEWY---------AYLSTRPRNIAFNYIVRA 401 >UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLI8_PECWW Length = 621 Score = 116 bits (290), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 75/167 (44%), Positives = 96/167 (57%), Gaps = 18/167 (10%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 SA VG+P +P A P GWLKCNG F +YP LA YP+ LPDLRGEF+RGWDD R Sbjct: 466 SAELVGMPQVFPGAVAPAGWLKCNGQQFDTAQYPILASRYPSGFLPDLRGEFVRGWDDER 525 Query: 67 GIDTGRSILSIQGYATED-----HAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 G+D GR++LS QG A + A +P T DA D ++ D G+ Sbjct: 526 GVDAGRALLSEQGDAIRNITGTMRASDVPYGHTQFVDA---LKADGVFAPIAGDKSWTGD 582 Query: 122 TN-DAGLPAPDYG-TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 ++ +AG P +G +F T + + A+E RPRNIAFNYIVRAA Sbjct: 583 SSGNAGNP---WGVSFDTSR-----VVPTANENRPRNIAFNYIVRAA 621 >UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CG98_DICZE Length = 196 Score = 116 bits (290), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 68/161 (42%), Positives = 82/161 (50%), Gaps = 30/161 (18%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P PWP A P GWLKCNG F +YP+LAK YP LPDLRGEFIRGWDD RG+DT Sbjct: 59 IGIPQPWPLAEAPEGWLKCNGQTFDTAKYPQLAKLYPAGTLPDLRGEFIRGWDDERGVDT 118 Query: 71 GRSILSIQ-GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 R +LS Q G G P+ ++I GN ++ Sbjct: 119 DRKLLSAQAGTHILGDDGGYPTLNSI------------------------GNLSECNADK 154 Query: 130 PDYGTFKTYKQSVDGLGAAASE-----TRPRNIAFNYIVRA 165 PD Y + ASE TRPRNIAF+YIV+A Sbjct: 155 PDGNVRTLYWLDTNKSEKLASEKFWGATRPRNIAFSYIVKA 195 >UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65WH4_MANSM Length = 296 Score = 115 bits (287), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 69/169 (40%), Positives = 88/169 (52%), Gaps = 23/169 (13%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 L +G+P P+P + P G L NG FS YPELAK YP+ +LPDLRGEFIRGWD+GRG+ Sbjct: 138 LLIGIPFPYPLSAVPDGCLAFNGQTFSTTTYPELAKKYPSGRLPDLRGEFIRGWDNGRGV 197 Query: 69 DTGRSILSIQGYATEDHAHGLPSR----------STIVTDATINFYFDEIWVNSGTDIIK 118 D+ R +L QG H H + + I T + IN + W+ SG D + Sbjct: 198 DSSRELLRSQGAELSAHTHYVTVTRYANSSGEFGAKISTFSAIN---NSGWLLSGADGLL 254 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGL--GAAASETRPRNIAFNYIVRA 165 L A G + K SV L +ETRPRN+AF YI A Sbjct: 255 --------LAANKSGEIVSEKNSVANLISNTGGNETRPRNVAFQYICLA 295 >UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1_YERKR Length = 402 Score = 115 bits (287), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 67/156 (42%), Positives = 86/156 (55%), Gaps = 32/156 (20%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G P+PWP P G+LKCNGA F+ +YP+LA AYP+ LPDLRGEFIRG+DDGRG+ Sbjct: 277 IGTPIPWPLTIAPAGYLKCNGAPFNKTQYPKLALAYPSGVLPDLRGEFIRGFDDGRGVRP 336 Query: 71 GRSILSIQGYATEDHAHGLPSRSTI-VTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 + +L QG + H HG+ + VT N +F T+ I +TN++G Sbjct: 337 NQPLLGWQGSEIQSHNHGITNFEIRGVTGGPTNAWFPS------TNGI---STNNSG--- 384 Query: 130 PDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ETRPRNIAFNYIVRA Sbjct: 385 -------------------GDETRPRNIAFNYIVRA 401 >UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH14_EDWI9 Length = 593 Score = 114 bits (285), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 65/156 (41%), Positives = 83/156 (53%), Gaps = 14/156 (8%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P PWP+ + P+GW+KC G +FS YPELAKAYP +LPDLRGEFIRG+DD G D Sbjct: 451 PVGTPQPWPNTSIPSGWIKCAGQSFSTSSYPELAKAYPNGRLPDLRGEFIRGYDDYGGTD 510 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 + R ILS QG A + + T F D+ + T + + + Sbjct: 511 SQRQILSWQGDA--------------MRNITGTFGVDDQTIEQVTGVFREYGRFSYDARS 556 Query: 130 PDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G + + A+E RPRNIAF YIVRA Sbjct: 557 ERNGAGRIIYFDASQVVPTANENRPRNIAFLYIVRA 592 >UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabdus RepID=Q7NAA0_PHOLL Length = 351 Score = 113 bits (282), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 64/163 (39%), Positives = 90/163 (55%), Gaps = 19/163 (11%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PW T P G+L C+G F YP+L +AYP+ LPDLRGEFIRGWD+GR ID+ Sbjct: 201 VGIPLPWSKPTAPAGYLICSGQQFDKSMYPKLGEAYPSGALPDLRGEFIRGWDNGRSIDS 260 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFY----FDEIWVNSGTDIIKRGNTNDAG 126 GR ILS Q ++ LP+ T I + N ++I+ + Sbjct: 261 GREILSHQ------NSTKLPNLYTHAASENIGLLVSPPINHFSSNYPSEIMA------SD 308 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASET---RPRNIAFNYIVRAA 166 ++G+ + + ++ G+ + T RPRNIAFNYIVRAA Sbjct: 309 FEEAEFGSGQYFSTPLNPTGSVSLSTFRVRPRNIAFNYIVRAA 351 >UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N6T1_PHOLL Length = 300 Score = 112 bits (280), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 75/168 (44%), Positives = 100/168 (59%), Gaps = 29/168 (17%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 +PVG P+PWP PP G+L CNG+AF+ +YP+LA+AYP +LPDLRGEFIRGWDDGRG+ Sbjct: 152 IPVGSPIPWPLPYPPVGYLTCNGSAFNKLQYPKLAEAYPDGRLPDLRGEFIRGWDDGRGV 211 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 D GR++LS QG A + R T +A +G ++ R + + +G+ Sbjct: 212 DMGRTMLSWQGDAMQ--------RMTGFLEA-----------GNGIGLMTRPHDSTSGVF 252 Query: 129 AP-DYGTFK-------TYKQSVDG--LGAAASETRPRNIAFNYIVRAA 166 D T +Y S D + A+ETRPRNIAFNY+VRAA Sbjct: 253 LEGDLRTISHVTQNGTSYAVSFDSSRVARTANETRPRNIAFNYVVRAA 300 >UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2U8_ERWCT Length = 619 Score = 112 bits (279), Expect = 6e-24, Method: Compositional matrix adjust. Identities = 68/171 (39%), Positives = 91/171 (53%), Gaps = 20/171 (11%) Query: 8 ALPV----GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWD 63 ALP G+P+P+P A P G+LKCNG F ++P LA YP+ LPDLRGEF+RGWD Sbjct: 457 ALPTSELAGIPLPFPGAVAPAGYLKCNGQQFDTAQFPVLASRYPSGFLPDLRGEFVRGWD 516 Query: 64 DGRGIDTGRSILSIQGYATEDHAHGL--------PSRSTIVTDATINFYFDEIWVNSGTD 115 DGRGIDT R+++S QG A + L P +T + + Y++ T+ Sbjct: 517 DGRGIDTVRALMSAQGDAIRNIVGSLFYGYDADVPVLNTNSSSGAL--YYEMSTALRDTE 574 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + T+ + K + A+E RPRNIAFNYIVRAA Sbjct: 575 SLLSLVTDSVA------NNWYPAKLDASRVVPTATENRPRNIAFNYIVRAA 619 >UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G6_9ENTR Length = 580 Score = 111 bits (277), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 68/160 (42%), Positives = 88/160 (55%), Gaps = 14/160 (8%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P+PWP ATPP G+ C+G F +YP+LA AYP+ KLP L GEFIRG D GR +D Sbjct: 432 PVGAPIPWPQATPPNGYFVCDGNYFDKAKYPQLALAYPSGKLPLLYGEFIRGLDLGRKVD 491 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG---NTNDAG 126 GR++LS QG D + R E V +G + +R N N A Sbjct: 492 PGRTVLSNQG----DAIRNITGRIGYARHGGT-----EPPVVNGEGVFRRDSNHNVNIAN 542 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 D+G+ ++ S + A+E RPRN+AF YIVRAA Sbjct: 543 GRGDDWGSVMSFNAS--RVVPTANENRPRNVAFLYIVRAA 580 >UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae RepID=D2U1K0_9ENTR Length = 366 Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 66/157 (42%), Positives = 82/157 (52%), Gaps = 9/157 (5%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P+PWP ATPP G+L CNG F + P+L AYP+ KLPDLRG FIRGWD G+G+D Sbjct: 219 PVGAPIPWPQATPPKGYLICNGEPFDKVKCPKLLIAYPSGKLPDLRGYFIRGWDAGKGVD 278 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 GR + S Q ED + R + + I N A Sbjct: 279 PGREVFSYQ----EDAIRNITGRIGFARRGGAE---PPVSADGAFVITDWCNVRVADGAN 331 Query: 130 PDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 D+G ++ S + A+E RPRNIAFNYIVR A Sbjct: 332 DDWGGVASFDPS--RVVPTANENRPRNIAFNYIVREA 366 >UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BL21_PHOAA Length = 452 Score = 107 bits (267), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 60/157 (38%), Positives = 80/157 (50%), Gaps = 8/157 (5%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P+P+P P G+L CNG F YP+LA+AYP+ ++PDLRGEFIRGWDD RG+D Sbjct: 304 PVGAPIPYPHRYTPVGYLTCNGQTFDKSLYPKLAEAYPSGRVPDLRGEFIRGWDDSRGVD 363 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 GR S Q + H H + + + K+ T G+ Sbjct: 364 PGRVCGSWQDSDNKAHIHD--------DEFCYGGGDAGGDSGTMSAFAKKYCTPKDGVNG 415 Query: 130 PDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + L + +E RPRN+AFNYIVRAA Sbjct: 416 RPTSGWLPASAGLHSLPSGGNEARPRNVAFNYIVRAA 452 >UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=Pectobacterium carotovorum subsp. carotovorum WPP14 RepID=UPI0001A44C27 Length = 195 Score = 105 bits (263), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 49/87 (56%), Positives = 59/87 (67%), Gaps = 1/87 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G +G+ L VG+P P P T P GWL C G +F YP LA YP +LPDLRGEFIR Sbjct: 79 IGAIQGNEL-VGIPQPCPLVTAPEGWLACAGQSFDTSRYPVLASRYPQGRLPDLRGEFIR 137 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAH 87 GWD+GRG+DTGR LS Q ++TE H H Sbjct: 138 GWDNGRGVDTGRGNLSSQSFSTEPHTH 164 >UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VKW8_PHOAA Length = 316 Score = 102 bits (253), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 65/163 (39%), Positives = 88/163 (53%), Gaps = 21/163 (12%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW 62 + + S +PVG P+PWP PP G++ CNG+AF+ +YP+LA+AYP +LPDLRGEFIRGW Sbjct: 174 IKKTSEIPVGSPIPWPLPHPPFGYVTCNGSAFNRSQYPKLAEAYPNGRLPDLRGEFIRGW 233 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT 122 DDGRG D GR +LS Q + ++ Y +I +R Sbjct: 234 DDGRGADNGRKLLSWQ------------------EGSALSEYLGSFTTGVAQNIHQR--- 272 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + D+ +K + G G RPRNIAFNYIV+A Sbjct: 273 DGVTYHDKDHKRYKIPSLEIIGTGVDYFRFRPRNIAFNYIVKA 315 >UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI00019136B5 Length = 137 Score = 97.4 bits (241), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 61/146 (41%), Positives = 78/146 (53%), Gaps = 33/146 (22%) Query: 39 YPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAH----------- 87 YP LAKAYPTNKLPDLRGEFIRGWDDGRG+D GR++L +Q + E H H Sbjct: 2 YPNLAKAYPTNKLPDLRGEFIRGWDDGRGVDAGRALLRLQDDSFEAHRHESFFYAGISRN 61 Query: 88 -----GLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTF----KTY 138 LPS ++T ++ +++ +I GN DY K Sbjct: 62 EIPLKNLPSSDEMLTLSSTTNALSPDGIDATNSLI--GND--------DYNCLIEGNKNN 111 Query: 139 KQSVDGLGAA---ASETRPRNIAFNY 161 K++ GL + A+ETRPRNIAFNY Sbjct: 112 KRTATGLSTSIVGATETRPRNIAFNY 137 >UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Enterobacteriaceae RepID=B3HKW0_ECOLX Length = 164 Score = 95.9 bits (237), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 72/174 (41%), Positives = 84/174 (48%), Gaps = 32/174 (18%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTG---------WLKCNGAAFSAEEYPELAKAYPTNKL 51 VGLGEG A +GVP WPSA P +LK NGA FSA +YP LAK +P+ L Sbjct: 13 VGLGEG-APAIGVPFFWPSAAMPNTVIDSWSGMVFLKFNGAKFSATDYPVLAKVFPSLVL 71 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RG+FIR WDDGRG D GR +LS Q G INF+ D I N Sbjct: 72 PEARGDFIRIWDDGRGADGGRELLSWQEATNFSQFAGNIGGG---AGHAINFH-DGIAGN 127 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 P + F SV G G RPRNIAFN++VRA Sbjct: 128 Q-----------------PGFSRFNFTSNSV-GDGVNFVAVRPRNIAFNFLVRA 163 >UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I687_PSEE4 Length = 898 Score = 95.5 bits (236), Expect = 6e-19, Method: Compositional matrix adjust. Identities = 68/190 (35%), Positives = 94/190 (49%), Gaps = 39/190 (20%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAK----AYPTN-------KL 51 L SALPVG +P+P T P G+L+ +G+ SA YP+LA A+ T +L Sbjct: 378 LNTASALPVGTMLPFPRGTVPAGFLEVDGSTQSAAVYPDLAAYLGGAFNTGNEAAGFFRL 437 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 PD RGEF+RGWD GRG+D+GR++ S QG + + H H + F D + Sbjct: 438 PDTRGEFLRGWDHGRGVDSGRAVGSTQGESFKAHTH-----------KDVGF-IDNVGGG 485 Query: 112 SGTDIIKRGNTNDAGLPAPDYG-----TFKTYKQSVDG-LGAAA----------SETRPR 155 SG + + + YG T K YK+S G LG A SETRPR Sbjct: 486 SGASAVTGATGDVTSIYGKAYGNSASATAKAYKESAPGALGGAIAGLISGSTGDSETRPR 545 Query: 156 NIAFNYIVRA 165 N+A + ++A Sbjct: 546 NLAVMWCIKA 555 Score = 71.6 bits (174), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 55/178 (30%), Positives = 87/178 (48%), Gaps = 30/178 (16%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN-----------KLPDLR 55 S+ PVG +P+P A P G+L+ +G+ S YP+LA + +LPD R Sbjct: 579 SSTPVGAILPFPKAEVPAGYLELDGSLQSVATYPDLAAYLGASYNNGTEPAGYFRLPDYR 638 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRS------TIVTDATINF--YFDE 107 GEF+RGWD GRG+D GR + + Q A ++ + R ++ A+ F F E Sbjct: 639 GEFLRGWDHGRGVDPGRGMGTSQSDAIQNITGSIGLRGGAGVGLGVMGGASGAFSTVFGE 698 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ++ + I R + + + A D F K + AA+ETRPRN + + ++A Sbjct: 699 ---STSANTITR---DASSIAASDIARFDASK-----VVRAAAETRPRNQSVMWCIKA 745 >UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BT48_DESAD Length = 208 Score = 94.7 bits (234), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 64/167 (38%), Positives = 84/167 (50%), Gaps = 32/167 (19%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 P+G + TPP GWL+CNG S YPELA N +PDLRGEFIRG D GRG+D Sbjct: 62 PIGAVAAYRGDTPPVGWLECNGQ--STTGYPELAAVVGAN-VPDLRGEFIRGLDSGRGVD 118 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG--- 126 GR++ S Q A E H+H ++TI SG + + AG Sbjct: 119 AGRALGSAQADAMERHSH----QTTITV--------------SGRTSVTASPYHSAGAAR 160 Query: 127 --LPAPDYGT-FKTYKQSVDGLGAAAS-----ETRPRNIAFNYIVRA 165 + P++G+ F S G G + S ETRPRN+A YI++A Sbjct: 161 SLVTTPNFGSPFGGASFSASGTGTSTSVGSGAETRPRNVALMYIIKA 207 >UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia coli RepID=B7UGJ3_ECO27 Length = 221 Score = 94.7 bits (234), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 70/176 (39%), Positives = 89/176 (50%), Gaps = 36/176 (20%) Query: 1 VGLGEGSALPVGVPVPWPSATPP---------TGWLKCNGAAFSAEEYPELAKAYPTNKL 51 +GLGEG A +GVP WPSA P +LK NGA FSA +YP LAK +P+ L Sbjct: 70 LGLGEG-APAIGVPFFWPSAAMPDTVIESWSGMVFLKFNGAKFSATDYPVLAKVFPSLVL 128 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHG-LPSRS-TIVTDATINFYFDEIW 109 P+ RG+FIR WDDGRG D+GR++LS Q + G P S + D +D I Sbjct: 129 PEARGDFIRIWDDGRGADSGRALLSWQAATSLSQFGGNYPEGSGHAIAD------YDGIS 182 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + P + F+ SV G G RPRNIAFN++VRA Sbjct: 183 AHE-----------------PGFSRFQYTSNSV-GDGVNFVAVRPRNIAFNFLVRA 220 >UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KHC6_PSEF5 Length = 369 Score = 89.4 bits (220), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 61/174 (35%), Positives = 85/174 (48%), Gaps = 32/174 (18%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN-----------KL 51 L SALPVG VP+P T P G+L+ +G+ SA YP+LA T +L Sbjct: 107 LKNMSALPVGAMVPFPKGTVPAGFLEVDGSVQSAATYPDLAAYLGTMFNTGGEGAGNFRL 166 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RGEF+RGWD GRG+D GR++ S Q +A H H + + T + + W + Sbjct: 167 PESRGEFLRGWDHGRGVDVGRALGSYQAHAVGSHQHPMNYWAWRDGTGTGTHNYAKPWGD 226 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 +G +K D GT G A SETRPRN+A + ++A Sbjct: 227 TGITGVK------------DPGT---------GANAGDSETRPRNLAVMWCIKA 259 >UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotrophomonas maltophilia K279a RepID=B2FIY3_STRMK Length = 410 Score = 81.6 bits (200), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 41/84 (48%), Positives = 52/84 (61%), Gaps = 10/84 (11%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY---------PTNKLPDLRGEFI 59 LP G+ +P+ PP GWL+CNGA S Y +L T +LPDLRGEF+ Sbjct: 254 LPAGMVAHFPTGGPPPGWLRCNGADVSRTTYADLFAVIGTLFGSANDMTFRLPDLRGEFV 313 Query: 60 RGWDDGRGIDTGRSILSIQGYATE 83 RGWDDGRG+D GR++ S+Q ATE Sbjct: 314 RGWDDGRGVDGGRALGSLQA-ATE 336 >UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KH70_PSEPF Length = 817 Score = 81.6 bits (200), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 59/175 (33%), Positives = 87/175 (49%), Gaps = 27/175 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN-----------KLPDL 54 GSA+PVG + +P+ P G+L+ NG+ + YP+LA T +LP+ Sbjct: 505 GSAVPVGAVMAFPTGIVPPGFLELNGSVQNTSTYPDLAAYLGTTYNKGDEGAGNFRLPES 564 Query: 55 RGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDA--TINFYFDEIWVNS 112 RGEF+RGWD GRG+D GR I + QG + DH H + + DA +N + V S Sbjct: 565 RGEFLRGWDHGRGVDAGRGIGTNQGQSMVDHYH-----TVLTADAGGVLNPIAGNL-VGS 618 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGA--AASETRPRNIAFNYIVRA 165 T++ AG+ T S+ G A +ETRPRN+A + ++A Sbjct: 619 FTNLAPISKPAGAGVLG------ATLTSSIHGPAAEKGGTETRPRNLAVMWCIKA 667 Score = 81.3 bits (199), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 60/176 (34%), Positives = 91/176 (51%), Gaps = 23/176 (13%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNK------------ 50 + + SALPVG V +P +PP G+L+ + + S+ YP+L+ AY K Sbjct: 317 IAKASALPVGSIVAFPVDSPPPGFLELDNSVKSSATYPDLS-AYLGGKFNKGDEGVGNFR 375 Query: 51 LPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV 110 LP+ RGEF+RGWD GRG+D GR+ S Q + + H H +P+ S N + + Sbjct: 376 LPEARGEFLRGWDHGRGVDGGRAQGSSQTDSLKAHYHLIPTGSGGGQAVDPNGEIPTVVL 435 Query: 111 -NSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ++ D + R ++A L G +TY AA+ETRPRNIA + ++A Sbjct: 436 KDTAADWVLRTEGDNAEL---SIGRVRTYNF------GAATETRPRNIAVMWCIKA 482 >UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia solanacearum RepID=B5S308_RALSO Length = 225 Score = 79.7 bits (195), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 57/168 (33%), Positives = 76/168 (45%), Gaps = 22/168 (13%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRGEFIR 60 G + TPP GWLKCNGAA S Y L K T LP+LR EF R Sbjct: 65 TGSVAMFACKTPPAGWLKCNGAAVSRTTYERLFKLIGTTFGAGDGAATFNLPELRAEFPR 124 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDI---- 116 GWDDGRG+D+GR+ S Q A H H ++ + D + F + + S T I Sbjct: 125 GWDDGRGVDSGRAFGSSQAQALSSHQH----KTAVGFDGSNLFGWGD---GSATPIFGSE 177 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 ++ G G G + V +G + ETRPRN+A ++ Sbjct: 178 VQAGVLRVVGAVTQSGGAARIGYTDVTPMGVSG-ETRPRNVALLACIK 224 >UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=Photorhabdus RepID=Q7N047_PHOLL Length = 602 Score = 78.6 bits (192), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 39/77 (50%), Positives = 50/77 (64%), Gaps = 1/77 (1%) Query: 9 LPVGVPVPWPSATP-PTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 +P+G + W S P P G+ G AF A +YPELAK +P KLPD RG F RG D GRG Sbjct: 458 VPIGATIEWHSTAPIPAGYEPNEGRAFRAADYPELAKIFPDLKLPDDRGLFKRGLDRGRG 517 Query: 68 IDTGRSILSIQGYATED 84 +D+GRS+ S+QG A + Sbjct: 518 LDSGRSLGSVQGDAIRN 534 >UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria phage T4 RepID=Q99362_BPT4 Length = 382 Score = 78.6 bits (192), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 34/88 (38%), Positives = 52/88 (59%), Gaps = 6/88 (6%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 S+ P+G P+PWP+ TPP G+ G F YP+LA AYP+ +PD+RG+ I+G Sbjct: 130 SSYPIGAPIPWPTDTPPNGYALMEGQTFDTRAYPKLAAAYPSGTIPDMRGQTIKGK---- 185 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRST 94 +GR++LS + + H HG + +T Sbjct: 186 --PSGRAVLSTEADGVKSHTHGASASNT 211 >UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE3_PECWW Length = 144 Score = 78.2 bits (191), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 54/152 (35%), Positives = 78/152 (51%), Gaps = 13/152 (8%) Query: 17 WPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI--DTGRSI 74 W + PP GWL+ NG F+ P LA YP++++PD RG F RGWD+G GI D+ RS+ Sbjct: 2 WGTPVPPEGWLELNGQLFNPSGNPVLADLYPSSRVPDFRGYFPRGWDNGAGIDPDSSRSV 61 Query: 75 LSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGT 134 LS Q H H + + S A F + ++ +IK T PD G+ Sbjct: 62 LSYQDDEIISHKHAI-TMSHEHHGAADGAGFPQ--TDASGPMIKHAETE------PD-GS 111 Query: 135 FKTYKQSVDGLGA-AASETRPRNIAFNYIVRA 165 F + + + + SETRP NIA +I++A Sbjct: 112 FPERSGAGNPMFSFGGSETRPHNIAVMFIIKA 143 >UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3KCU2_PSEFS Length = 658 Score = 77.4 bits (189), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 59/174 (33%), Positives = 79/174 (45%), Gaps = 36/174 (20%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN-----------KL 51 + + SALPVG V +P P G+L+ +G+ SA YP+LAK T +L Sbjct: 174 IAQSSALPVGSMVAFPIDKVPVGFLEIDGSVKSATAYPDLAKFLGTAFNKGDEGAGNFRL 233 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RGEF+RGWD GRG+D GR S Q + H H T+ N D I Sbjct: 234 PESRGEFLRGWDHGRGVDAGRLAGSYQTDQFKSHTH---EYDTMQGGGAANSVSDTIAAQ 290 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 S T+ G + G GA SETRPRN+A + ++A Sbjct: 291 SNA-------TSQTG--------------HITG-GAGGSETRPRNLAVMWCIKA 322 Score = 68.6 bits (166), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 39/103 (37%), Positives = 56/103 (54%), Gaps = 11/103 (10%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN-----------KLPDLR 55 SA+PVG +P+ A P G+L+ +G+ S YP+LA T +LP+ R Sbjct: 346 SAVPVGSIIPFLKAAVPPGYLELDGSVQSIATYPDLAAYLGTTFNTGSEPAGYFRLPESR 405 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTD 98 GEF+RGWD GRG+D GR + S Q + +P+ TI T+ Sbjct: 406 GEFLRGWDHGRGMDAGREVGSWQKGSMVAVDTNIPATQTIATN 448 >UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8Y3_OXAFO Length = 270 Score = 77.0 bits (188), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 52/167 (31%), Positives = 74/167 (44%), Gaps = 35/167 (20%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKA----------YPTNKLPDLRGE 57 A+P G V + S P G+LK +G+A EEY EL A T LPDLRGE Sbjct: 128 AVPAGTVVYFCSHKAPYGYLKADGSAVGREEYKELFAAIGVYFGSGDGVSTFNLPDLRGE 187 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 FIR D+GRG+D GR + ++Q + H HG R + ++ + W ++ Sbjct: 188 FIRSLDNGRGVDAGRELGNVQMDEFKSHYHGFLDRPNMRLESGVY-----TWTPQVMEVA 242 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + + S+ A SETRPRNIA ++ Sbjct: 243 E--------------------QDSISTTRAGGSETRPRNIALLACIK 269 >UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P172_CHRVO Length = 591 Score = 76.3 bits (186), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 59/159 (37%), Positives = 74/159 (46%), Gaps = 20/159 (12%) Query: 20 ATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEFIRGWDDGRGID 69 + PP GWLK NGAA S ++YP L A T LPDLRGEF+RGWDDGRG+D Sbjct: 438 SAPPLGWLKANGAAVSRKDYPSLFAALGTYYGAGDGSTTFNLPDLRGEFVRGWDDGRGVD 497 Query: 70 TGRSILSIQ-GYAT-EDHAHGLPSRSTIV--TDATINFYFDEIWVNSGTDIIKRGNTNDA 125 GR + Q G T D + P +++V D T+ Y D G D + + N D Sbjct: 498 NGRGFGTWQKGTLTFSDPSLTSPCVASLVHRNDNTVIGYLDL-----GADPVDK-NKYDL 551 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 GL G TRPRNIA ++ Sbjct: 552 GLSVSTANGVYLPDLDSGGWANGYGSTRPRNIALLACIK 590 >UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_LAMBD Length = 774 Score = 75.9 bits (185), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 37/87 (42%), Positives = 51/87 (58%), Gaps = 5/87 (5%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G GE SA P G P+PWPS P+G++ G AF YP+LA AYP+ LPD+RG I+ Sbjct: 522 LGAGENSAFPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIK 581 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAH 87 G +GR++LS + + H H Sbjct: 582 GKP-----ASGRAVLSQEQDGIKSHTH 603 >UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PDQ5_9PROT Length = 391 Score = 75.1 bits (183), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 58/175 (33%), Positives = 81/175 (46%), Gaps = 25/175 (14%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLP 52 L + LPVG + P G+L CNGAA S Y +L A T +P Sbjct: 228 LLSSTILPVGTIITSARTPAPDGFLLCNGAAISRSAYTDLFSAIGTAYGAGDGSSSFNIP 287 Query: 53 DLRGEFIRGWDDGRGIDTGRSILSIQGYATED---HAHGLPSRSTIVTDATINFYFDEIW 109 DLRGEFIRG D+GRG+D GR++ S QG A + A G+ R++I T Sbjct: 288 DLRGEFIRGADNGRGVDGGRALGSAQGDAIRNITARAIGMGDRNSIPT-----------L 336 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + + I K G D G F+ + + A+E RPRN+A N+ ++ Sbjct: 337 LGALYGIQKSTRIESVGDVLGDGGYFE-WGFDASKVVPVANENRPRNVAVNFYIK 390 >UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31Q92_SYNE7 Length = 387 Score = 74.7 bits (182), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 46/111 (41%), Positives = 57/111 (51%), Gaps = 26/111 (23%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL---AKA-----------------YP 47 A+P GV + TPPTG++K NGA S Y L A+A Y Sbjct: 235 AVPAGVAIWVTGNTPPTGYIKANGALLSRTTYARLWAYAQASGNIVSDAAWTGGATGSYS 294 Query: 48 TN------KLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSR 92 T ++PDLRGEFIRGW DGR +DTGR+I S Q + HAH L +R Sbjct: 295 TGDGSTTFRVPDLRGEFIRGWADGRSVDTGRAIGSTQADELKAHAHYLDTR 345 >UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacterium RepID=D0KGE5_PECWW Length = 157 Score = 74.7 bits (182), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 56/161 (34%), Positives = 79/161 (49%), Gaps = 19/161 (11%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 S++ G W + PP GWL+ NG F+ P LA YP++++PD RG F RGWD+G Sbjct: 13 SSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRGWDNGA 72 Query: 67 GID-TGRSILSIQGYATEDHAHGL-PSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 GID R+ILS+QG A + P S+ + Y NSG+ ND Sbjct: 73 GIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSSYGWPYPSNSGS-------AND 125 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 A + TF + + A+E RP NIA +I++A Sbjct: 126 ASII-----TFDASR-----VVPTAAENRPTNIAVMFIIKA 156 >UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE1_PECWW Length = 532 Score = 73.9 bits (180), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 56/161 (34%), Positives = 79/161 (49%), Gaps = 19/161 (11%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 S++ G W + PP GWL+ NG F+ P LA YP++++PD RG F RGWD+G Sbjct: 388 SSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRGWDNGA 447 Query: 67 GID-TGRSILSIQGYATEDHAHGL-PSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 GID R+ILS+QG A + P S+ + Y NSG+ ND Sbjct: 448 GIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSSYGWPYPSNSGS-------AND 500 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 A + TF + + A+E RP NIA +I++A Sbjct: 501 ASII-----TFDASR-----VVPTAAENRPTNIAVMFIIKA 531 >UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX Length = 456 Score = 73.9 bits (180), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 32/51 (62%), Positives = 38/51 (74%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEF 58 + PVG P+PWPSATPP G+L NG +FS YP+LA+AYP KLPDLR F Sbjct: 337 SYPVGSPIPWPSATPPQGYLVMNGQSFSCSRYPQLARAYPGCKLPDLRRCF 387 >UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8U2_OXAFO Length = 266 Score = 73.6 bits (179), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 54/168 (32%), Positives = 76/168 (45%), Gaps = 21/168 (12%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRGEF 58 +P G + PP G+LK +GA +YP L A T LPDLRGEF Sbjct: 107 VPTGTIAFFAMTAPPAGYLKADGAIIQRTDYPALFTAIGTTFGEGDGTTTFTLPDLRGEF 166 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK 118 IRGWD+GR ID R+ SIQG A + L +D+ +N+ W T + + Sbjct: 167 IRGWDNGRNIDCERAFGSIQGDAIRNVTGQLRYAGPQNSDSVMNYQSALQW----TSVSQ 222 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAA--ASETRPRNIAFNYIVR 164 + + +Y Y+ + D + ASE RPRNIA ++ Sbjct: 223 KSPYSAQSSQGSNY-----YEINFDASRSVPTASENRPRNIALLACIK 265 >UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VSH6_POLNA Length = 483 Score = 72.8 bits (177), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 60/177 (33%), Positives = 76/177 (42%), Gaps = 37/177 (20%) Query: 20 ATPPTGWLKCNGAAFSAEEYPELAKA----------YPTNKLPDLRGEFIRGWDDGRGID 69 +T P GWLK NGA S Y L A + T LPDLRGEFIRGWDDGRG+D Sbjct: 311 STAPPGWLKANGAGISRTAYAALFAAIGTTFGVGDGFNTFNLPDLRGEFIRGWDDGRGVD 370 Query: 70 TGRSILSIQGYATEDH------------AHGL--PSRSTIVTDATINFYFDEIWVNSGTD 115 RS+ S Q T H AHG+ P S VT + +G + Sbjct: 371 GSRSLGSSQAGETASHGHTGSTSAAGIHAHGVNDPGHSHQVTQE--GGRNTSLAYQNGPN 428 Query: 116 IIKRGNT--------NDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 RG N G+ + G + +V SETRPRN+A +++ Sbjct: 429 SAFRGEVSTLLETTRNATGIGISENGN---HSHTVTISATGGSETRPRNLALLAVIK 482 >UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella grahamii as4aup RepID=C6ABW9_BARGA Length = 370 Score = 72.8 bits (177), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 38/85 (44%), Positives = 49/85 (57%), Gaps = 10/85 (11%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRG 56 +++PVG + +P+ T P GWLK NGA S +Y +L T +LPDLRG Sbjct: 219 NSMPVGTVIYYPALTVPKGWLKANGALISRSDYAQLFAVIGTTYGAGDGKTTFRLPDLRG 278 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYA 81 EF+RG DD R ID R+I S QG A Sbjct: 279 EFLRGVDDERNIDPNRTIGSQQGDA 303 >UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralstonia phage RSL1 RepID=B2ZY49_9CAUD Length = 498 Score = 72.8 bits (177), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 63/196 (32%), Positives = 79/196 (40%), Gaps = 57/196 (29%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFS----AEEYPELAKAY------PTNKLPDLRGEF 58 +P G +P+ T P G+L CN AA S A Y + Y T LPDLRG F Sbjct: 319 VPPGTILPFAGTTIPAGYLACNAAAISRTGFASLYSVIGTTYGVGNGSTTFNLPDLRGVF 378 Query: 59 IRGWDDGRGIDTGRSILSIQG-------YATED--HAHGL--PSRSTIVTDATI------ 101 +RGWD+GRG D GR + QG +A D HAHG+ P S T T+ Sbjct: 379 VRGWDNGRGQDPGRVFGTYQGDAFRSHNHAVSDPGHAHGVYDPGHSHTWTLGTLRQSGGD 438 Query: 102 -------------NFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAA 148 F F E GT I GN G L Sbjct: 439 TSCYVPSARYGGGEFQFTETTAAVGTGIGIYGNVTGIGT-----------------LVNG 481 Query: 149 ASETRPRNIAFNYIVR 164 +ET P+N+A NYI++ Sbjct: 482 GAETTPKNVAMNYIIK 497 >UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEM1_9CAUD Length = 255 Score = 71.6 bits (174), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 39/98 (39%), Positives = 52/98 (53%), Gaps = 6/98 (6%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 G+A PVG P+ WPS T P GW G F +YP LAK YP+ LPD+RG I+ D Sbjct: 68 GNATPVGAPLAWPSDTAPDGWALMIGQTFDKVKYPLLAKVYPSGVLPDMRGRVIKAKPD- 126 Query: 66 RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINF 103 GR++LS++ + H H + + T AT F Sbjct: 127 -----GRAVLSLEEDQVKSHTHTGKAATAGGTRATSTF 159 >UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 Tax=Erwinia phage phiAT1 RepID=C5J9F2_9VIRU Length = 240 Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 3/90 (3%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW-- 62 E +P+G +PWP AT P GWL+C+G F+ + P+L N +PD RG F+RGW Sbjct: 149 EPRLVPIGAVIPWPGATVPDGWLECSGQVFNTGQNPKLYSVLGRNVVPDYRGLFLRGWAH 208 Query: 63 -DDGRGIDTGRSILSIQGYATEDHAHGLPS 91 D D GR++ S+QG A + P+ Sbjct: 209 GSDANDPDAGRALGSVQGDAIRNITGYFPA 238 >UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A9IRI0_BART1 Length = 324 Score = 70.9 bits (172), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 34/89 (38%), Positives = 46/89 (51%), Gaps = 9/89 (10%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY---------PTNKLPDLRGEF 58 + P G + P GWL C+G A+ E+YP+L KA T K+PD RG F Sbjct: 160 SFPAGFIATFAMRNIPNGWLLCDGTAYKREDYPQLFKAIGDKWGKNSDTTFKVPDFRGMF 219 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAH 87 +RG+DDGRG+D R Q + + H H Sbjct: 220 LRGFDDGRGLDNDRKFADEQQDSIKSHTH 248 >UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae RepID=A4TT73_YERPP Length = 962 Score = 70.5 bits (171), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 34/85 (40%), Positives = 48/85 (56%), Gaps = 6/85 (7%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P+PWP+ P+G+ G F YP+LA AYP+ LPD+RG I+G Sbjct: 722 PVGAPIPWPNDVAPSGFAIMQGQTFDKSVYPKLAAAYPSGVLPDMRGWMIKGK------P 775 Query: 70 TGRSILSIQGYATEDHAHGLPSRST 94 T R++LS++ + HAH + ST Sbjct: 776 TSRAVLSLEQDGIKSHAHNAAASST 800 >UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DPV9_DESVM Length = 530 Score = 69.3 bits (168), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 52/164 (31%), Positives = 75/164 (45%), Gaps = 19/164 (11%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN-------KLPDLRGEFIRG 61 +P+G + +P T PTG+L C G + YP+L Y T LPDLRGEF RG Sbjct: 212 VPIGAILDFPVNTVPTGFLVCAGQVVTRTAYPDLVT-YLTGGTVAVNATLPDLRGEFRRG 270 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 D GRG+D GR + S QG A + +T + N+ + +G + + Sbjct: 271 ADLGRGVDAGRVVGSAQGDAIRN-----------ITGSLYNYIQNNASQENGALRTQVAS 319 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 T ++ A ++ T ASE RPRNIA ++A Sbjct: 320 TLNSPFGAGTIMSWSTLSIDASRQVPTASENRPRNIAVVPCIKA 363 >UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45_9CAUD Length = 554 Score = 69.3 bits (168), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 36/69 (52%), Positives = 44/69 (63%), Gaps = 10/69 (14%) Query: 20 ATPPTGWLKCNGAAFS----AEEYPELAKAY------PTNKLPDLRGEFIRGWDDGRGID 69 +T P+GWLK NGAA S A Y E+ + T LPDLRGEF+RGWDDGRG+D Sbjct: 409 STAPSGWLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLPDLRGEFLRGWDDGRGVD 468 Query: 70 TGRSILSIQ 78 +GR I + Q Sbjct: 469 SGRGIGTWQ 477 >UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas aeruginosa PA7 RepID=A6VBH2_PSEA7 Length = 654 Score = 67.8 bits (164), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 37/93 (39%), Positives = 46/93 (49%), Gaps = 10/93 (10%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL----------AKAYPTNKLPDLRGEF 58 +P G V + +PP G+LK NGAA S Y L T LPD RGEF Sbjct: 465 VPAGAVVAFAMYSPPAGYLKANGAAVSRTAYAALFATIGTYYGAGDGSTTFNLPDYRGEF 524 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAHGLPS 91 +R DDGRG+D GR + ++Q H HG S Sbjct: 525 LRALDDGRGLDLGRQLGTLQSSQNLAHTHGASS 557 >UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3G6_OXAFO Length = 237 Score = 67.8 bits (164), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 51/158 (32%), Positives = 66/158 (41%), Gaps = 38/158 (24%) Query: 19 SATPPTGWLKCNGAAFSAEEYPELAKA-----------YPTNKLPDLRGEFIRGWDDGRG 67 S TPP GWL +G+ YP+L A T +LPDLRGEFIR D GRG Sbjct: 105 SETPPDGWLVADGSMLLVAAYPDLFAAIGTAFGSGDNGMTTFRLPDLRGEFIRCLDKGRG 164 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK-RGNTNDAG 126 +D GR + S+QG +H HG + D V G+ + + Sbjct: 165 LDDGRPLGSVQGDEIRNHNHG---------------FLDIPKVQFGSGVYSWTPQVMEVA 209 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 AP T+ SETRPRNIA ++ Sbjct: 210 EHAPIATTW-----------TGGSETRPRNIALTACIK 236 >UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escherichia RepID=B7LKX7_ESCF3 Length = 567 Score = 67.8 bits (164), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 32/80 (40%), Positives = 48/80 (60%), Gaps = 5/80 (6%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 + PVG+P+PWPS + P+G+ G F+ YP+LA AYP+ +PD+RG I+G Sbjct: 361 SCPVGMPIPWPSDSVPSGYALMTGQTFNKTSYPKLAIAYPSGVIPDMRGWIIKGKP---- 416 Query: 68 IDTGRSILSIQGYATEDHAH 87 +GR+ILS + + H H Sbjct: 417 -SSGRAILSTELDGVKSHNH 435 >UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NQ95_9RHOB Length = 329 Score = 67.4 bits (163), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 36/86 (41%), Positives = 42/86 (48%), Gaps = 10/86 (11%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEFIRG 61 G + +T P GWLK NGA S Y +L A T LPDLRGEF+RG Sbjct: 147 GCVAYYAMSTAPDGWLKANGAEISRTAYADLFAAIGTIFGVGDGNSTFNLPDLRGEFLRG 206 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAH 87 WDD RG+D R + S Q H H Sbjct: 207 WDDARGVDGARVLGSSQSDQNASHTH 232 >UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL254 RepID=B4T041_SALNS Length = 580 Score = 66.2 bits (160), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 60/192 (31%), Positives = 87/192 (45%), Gaps = 40/192 (20%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 + P GVP+PWPS T P G+ G AF YP LA AYP+ +PD+RG I+ G+ Sbjct: 396 SCPPGVPLPWPSDTIPAGYALMQGQAFDKNVYPLLAIAYPSGTIPDMRGWTIK----GKP 451 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNS-------------- 112 + +GR++LS + + H+HG + T + T T +F + N+ Sbjct: 452 V-SGRAVLSQELDGNKSHSHGARALDTDLGTKGTSSFDYGTKSSNTTGGHNHSAGGTYGG 510 Query: 113 ------------GTDIIKRGNTNDAG---LPAPDYGTF---KTYKQSVDGLGAAASETRP 154 G D + N + A + D+ + + VD G A ET Sbjct: 511 DSIGGKARVQRDGNDQLTSWNGDHAHTTWIGPHDHTVYIGPHGHVVIVDADGNA--ETTV 568 Query: 155 RNIAFNYIVRAA 166 +NIAFNYIVR A Sbjct: 569 KNIAFNYIVRLA 580 >UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX Length = 710 Score = 65.9 bits (159), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 33/87 (37%), Positives = 47/87 (54%), Gaps = 5/87 (5%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 + PVG +PWPS + PTG+ G F YP LA AYP+ LPD+RG I+G Sbjct: 493 SFPVGAAIPWPSDSVPTGYAVMQGQTFDKTTYPLLAAAYPSGVLPDMRGWTIKGKP---- 548 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRST 94 +GR +LS++ + H H + +T Sbjct: 549 -ASGRDVLSLEQDGIKSHTHSASASNT 574 >UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escherichia coli E22 RepID=B3I2W7_ECOLX Length = 654 Score = 65.9 bits (159), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 33/81 (40%), Positives = 44/81 (54%), Gaps = 6/81 (7%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 + PVG P+PWPS PTG+ G F YP LA AYP +PD+RG+ I+G + Sbjct: 445 SYPVGAPIPWPSDVTPTGYALMQGQPFDKAVYPLLAIAYPAGIIPDMRGQTIKGKPN--- 501 Query: 68 IDTGRSILSIQGYATEDHAHG 88 GR++LS + H HG Sbjct: 502 ---GRAVLSYEQDGVISHTHG 519 >UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI4_EDWTE Length = 718 Score = 65.5 bits (158), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 52/170 (30%), Positives = 76/170 (44%), Gaps = 33/170 (19%) Query: 11 VGVPVPWP-SATPPTGWLKCN-------GAAFSAEEYPELAKAYPTNKLP-DLRGEFIRG 61 +G +PW P W C G +F E +P+L YP N+LP D+RG RG Sbjct: 566 IGSLIPWALERMPQEIWPNCGMHFIPYMGQSFDPELFPKLHDVYPDNRLPTDMRGYTARG 625 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV-NSGTDIIKRG 120 WD+GRGID GR++LS Q DA N W+ +G+ + G Sbjct: 626 WDNGRGIDIGRALLSYQ------------------DDAIQNITGQFGWMPFNGSSPVASG 667 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAA-----ASETRPRNIAFNYIVRA 165 + + A +G + G A+ A +TR +++A+NYI RA Sbjct: 668 AFSVDKIGANVWGGGTERRDCAIGFNASNVVRTAEQTRVKSVAWNYITRA 717 >UniRef50_P76072 Side tail fiber protein homolog from lambdoid prophage Rac n=23 Tax=root RepID=STFR_ECOLI Length = 1120 Score = 65.1 bits (157), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 31/69 (44%), Positives = 43/69 (62%), Gaps = 5/69 (7%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P+PWPS T P+G+ G AF YP+LA AYP+ +PD+RG I+G Sbjct: 905 PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----A 959 Query: 70 TGRSILSIQ 78 +GR++LS + Sbjct: 960 SGRAVLSQE 968 >UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae RepID=Q3ZL14_ESCBL Length = 289 Score = 63.9 bits (154), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 5/79 (6%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 LP G+ + WP AT PTG+ G F YP LA+AYP+ +PD+RG+ I+ Sbjct: 83 LPPGIALAWPGATAPTGFALMLGQTFDTTAYPRLAQAYPSGVIPDMRGQTIKFLP----- 137 Query: 69 DTGRSILSIQGYATEDHAH 87 +GR++LS++ + H+H Sbjct: 138 ASGRTLLSLEADGVKSHSH 156 >UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia coli RepID=B7L485_ECO55 Length = 1056 Score = 63.9 bits (154), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 30/67 (44%), Positives = 42/67 (62%), Gaps = 5/67 (7%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P+PWPS T P+G+ G F+ YP+LA AYP+ +PD+RG I+G Sbjct: 824 PVGAPIPWPSDTVPSGYALMQGQTFNKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----A 878 Query: 70 TGRSILS 76 +GR++LS Sbjct: 879 SGRAVLS 885 >UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriaceae RepID=Q3YZL1_SHISS Length = 1029 Score = 63.2 bits (152), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 30/69 (43%), Positives = 42/69 (60%), Gaps = 5/69 (7%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 798 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----A 852 Query: 70 TGRSILSIQ 78 +GR++LS + Sbjct: 853 SGRAVLSQE 861 >UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingella oralis ATCC 51147 RepID=C4GFX3_9NEIS Length = 310 Score = 62.8 bits (151), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 39/93 (41%), Positives = 45/93 (48%), Gaps = 10/93 (10%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKA----------YPTNKL 51 G S P G + + PTGWLK NGA S Y L A + T L Sbjct: 147 GYTANSYCPSGQIGLFATDYAPTGWLKANGAVLSRTVYTNLFAAIGTRFGAGDGHSTFNL 206 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATED 84 PDLRGEF R WDDGRG+D GR + S Q A + Sbjct: 207 PDLRGEFPRFWDDGRGVDAGRVLGSWQSDAIRN 239 >UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FQX9_SALDC Length = 569 Score = 61.6 bits (148), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 59/193 (30%), Positives = 85/193 (44%), Gaps = 40/193 (20%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 ++ PVG + WPS P G+ G +F YP LA AYP+ +PD+RG I+ G+ Sbjct: 384 NSYPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGIIPDMRGWTIK----GK 439 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINF-------------------YFD 106 I +GR++LS + + H+H ++ T + T +T +F Y + Sbjct: 440 PI-SGRAVLSQEMDGNKSHSHSARAQDTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGYIN 498 Query: 107 EIWVNSGTDIIKRGN---TNDAGLPAPD----------YGTFKTYKQSVDGLGAAASETR 153 W +S + G T AG A Y + VD G A ET Sbjct: 499 SYWGDSNHTSFQPGGGAWTQAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNA--ETT 556 Query: 154 PRNIAFNYIVRAA 166 +NIAFNYIVR A Sbjct: 557 VKNIAFNYIVRLA 569 >UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P176_CHRVO Length = 435 Score = 61.6 bits (148), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 34/82 (41%), Positives = 45/82 (54%), Gaps = 10/82 (12%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRG 56 +A P G+ + P GWL +G + ++YP L A T LP+L G Sbjct: 281 AAAPAGMVAYFAMKDAPAGWLIADGRTVARKDYPALFAAIGGLYGNGDGSTTFGLPNLCG 340 Query: 57 EFIRGWDDGRGIDTGRSILSIQ 78 EFIRGWD+GRG+DTGR+I S Q Sbjct: 341 EFIRGWDNGRGVDTGRAIGSSQ 362 >UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5PP06_SALHA Length = 534 Score = 61.2 bits (147), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 35/97 (36%), Positives = 52/97 (53%), Gaps = 6/97 (6%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P+ WPS P G+ G +F YP LA AYP+ +PD+RG I+G Sbjct: 243 PVGAPIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDMRGWTIKGKP-----A 297 Query: 70 TGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYF 105 +GR+ILS + + H+H ++ T + T T +F + Sbjct: 298 SGRAILSQEMDGNKSHSHSARAQDTDLGTKTTSSFDY 334 >UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5ABB4_BURGB Length = 670 Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 36/108 (33%), Positives = 50/108 (46%), Gaps = 27/108 (25%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AK 44 +G V +P G++KC+G+ + +YP L A Sbjct: 434 IGQIVMEARTSPRAGYVKCDGSQYKRADYPALWAYAQASGALVSEAEYTDGRWGGFSTAD 493 Query: 45 AYPTNKLPDLRGEFIRGWDDGRG-IDTGRSILSIQGYATEDHAHGLPS 91 ++PDLRGEF+R W DGRG +D GR+I S QG + HAHG S Sbjct: 494 GQTYFRVPDLRGEFLRCWSDGRGDVDPGRAIGSFQGGQNQAHAHGASS 541 >UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysenteriae 1012 RepID=B3X2T1_SHIDY Length = 488 Score = 60.1 bits (144), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 29/67 (43%), Positives = 40/67 (59%), Gaps = 5/67 (7%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 P G P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 260 PPGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----A 314 Query: 70 TGRSILS 76 +GR++LS Sbjct: 315 SGRAVLS 321 >UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp. RC586 RepID=D0IJ09_9VIBR Length = 368 Score = 60.1 bits (144), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 37/89 (41%), Positives = 45/89 (50%), Gaps = 8/89 (8%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVGVP+PWPS P G+ G AF PELAK YP L DLRG + G + Sbjct: 204 PVGVPLPWPSDIAPEGFAIHKGQAFDKVANPELAKLYPDGILKDLRGMAVVGKKE----- 258 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTD 98 G ILS + A + HG P+ + TD Sbjct: 259 -GEIILSYE--ADQVKQHGYPNSTVSSTD 284 >UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID=B9BDD9_9BURK Length = 536 Score = 59.7 bits (143), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 36/107 (33%), Positives = 47/107 (43%), Gaps = 26/107 (24%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AK 44 +G V P + G+LK NGA + +YP L Sbjct: 301 IGTIVFEPRTSVRAGFLKLNGALVNRSDYPALWAYAQASGALVAESAWGQNNWGCFSTGD 360 Query: 45 AYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPS 91 T +LP+LRGEF+R WDDGRG D+ R I + Q + HAHG S Sbjct: 361 GATTFRLPELRGEFLRCWDDGRGADSARGIGTFQSFQNAWHAHGASS 407 >UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkholderia ambifaria AMMD RepID=Q0BEK5_BURCM Length = 735 Score = 59.3 bits (142), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 60/214 (28%), Positives = 78/214 (36%), Gaps = 64/214 (29%) Query: 15 VPWPSATPP-TGWLKCNGAAFSAEEYPEL--------------------------AKAYP 47 + W + T P G+LK NG +YP L Sbjct: 522 IVWEARTAPRAGFLKLNGTELKRADYPLLWAYAQGSGALVADADWGKGRHGCFSSGDGNT 581 Query: 48 TNKLPDLRGEFIRGWDDGRGIDTGRSILSIQ------------GYATEDHAHGLPSRSTI 95 T +LPDLRGEFIR WDD RG D R I S Q A DH+HG + S Sbjct: 582 TFRLPDLRGEFIRCWDDARGTDAQRQIGSWQDSLNRLHAHGASAAAVGDHSHGAWTDSQG 641 Query: 96 VTDATIN--------------FYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQS 141 +IN Y EI +N G KR + G+ G + Sbjct: 642 WHGHSINDPGHDHGIPVASGGGYIGEINLNGGGRGDKRTTGSGTGISINGDGAHG-HNVG 700 Query: 142 VDGLGA----------AASETRPRNIAFNYIVRA 165 + G GA +E+RPRN+A ++RA Sbjct: 701 IGGAGAHSHTISIGADGGNESRPRNVALLVMIRA 734 >UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7INV5_XANP2 Length = 492 Score = 59.3 bits (142), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 36/106 (33%), Positives = 47/106 (44%), Gaps = 10/106 (9%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRGEFIRG 61 G WP++TPP+G L NGA S Y L T +P+ G F+RG Sbjct: 358 GTIAMWPASTPPSGALVRNGATLSRTVYASLFAVIGTTFGAGDGATTFGVPNDLGIFVRG 417 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDE 107 WD+GRG DTGR S Q + H H + S + T F + Sbjct: 418 WDNGRGYDTGRVFGSEQADDNKSHDHARQTVSGVFTAGGAGFALQD 463 >UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FJJ3_DESAA Length = 264 Score = 58.9 bits (141), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 51/165 (30%), Positives = 70/165 (42%), Gaps = 27/165 (16%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRGEFI 59 P G V + A+ P+GWL+C+GAA S Y L T LPDLRG F+ Sbjct: 116 PTGSVVAFMGASAPSGWLECSGAAVSRTTYDNLFSVISTMYGVGDGSTTFNLPDLRGYFL 175 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 RGW G G D + +G T G A+ Y DE D++ Sbjct: 176 RGWSHGSGKDPDAGSRTDRGDGTCGDYVGTRQEDEF---ASHTHYDDE-------DLL-- 223 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 T D G P G+ + +V +ETRP+N+A YI++ Sbjct: 224 --TFDGGGPV---GSNSSGMSAVLPGSVGGAETRPKNVAVMYIIK 263 >UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W7_TRIEI Length = 671 Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 49/165 (29%), Positives = 71/165 (43%), Gaps = 21/165 (12%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNK-LPDLRGEFIRGWDD 64 G +PVG VP+ T P GWL CNG ++ E+Y EL K K LPDL+G FI G D Sbjct: 523 GWVVPVGTIVPYAGLTAPEGWLLCNGQSYDWEQYSELYKVLDEIKVLPDLKGRFIIGVGD 582 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFD----EIWVNSGTDIIKRG 120 GY+ +A G + T+ D + + + + Sbjct: 583 K------------DGYSYSLNAKGGEEKHTLTKDEMPSHDHSKGEYKFILKKDGKVTTSN 630 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 N N++ L P+ G+ + + V G E RP A NYI++ Sbjct: 631 NVNNS-LREPNLGSCEALQ--VIG-NNKPFENRPPYYALNYIIKT 671 >UniRef50_C3X912 Phage tail collar domain-containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X912_OXAFO Length = 436 Score = 57.4 bits (137), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 33/87 (37%), Positives = 44/87 (50%), Gaps = 10/87 (11%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRGE 57 ++P G + + TPP G+L NGA S Y L A T +LPDLRGE Sbjct: 284 SVPAGSVHYFATQTPPDGYLVANGALVSRTVYARLFSAIGTTFGEGDGGSTFQLPDLRGE 343 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATED 84 F+RGWD R +D R ++QG A + Sbjct: 344 FLRGWDAARNLDPERGFGTVQGDAIRN 370 >UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A9IXL3_BART1 Length = 334 Score = 55.8 bits (133), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 46/164 (28%), Positives = 67/164 (40%), Gaps = 24/164 (14%) Query: 17 WPSATPPTGWLKCNGAAFSAEEYPEL----------AKAYPTNKLPDLRGEFIRGWDDGR 66 + S P+GWL C+G +S + Y L T +PDLRG F+RG D G+ Sbjct: 178 FASEKIPSGWLLCDGKEYSRKNYANLFAVLGETWGKGDGKTTFNVPDLRGMFLRGLDSGK 237 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 ID GR + S Q + + H H + ST + + DI++ + G Sbjct: 238 EIDKGRLLGSRQEESFKSHTHEGKTDSTGKHQHS--------YPTIKNDILRYKREDYKG 289 Query: 127 LPAPDYGTFK------TYKQSVDGLGAAASETRPRNIAFNYIVR 164 A Y T ++ V ETRP N+A Y V+ Sbjct: 290 YVAVVYKTDTLTEPAGEHEHKVLLQKTGGDETRPVNMAVVYAVK 333 >UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A9ITY4_BART1 Length = 376 Score = 55.1 bits (131), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 32/89 (35%), Positives = 41/89 (46%), Gaps = 10/89 (11%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRGEF 58 LP G+ P+ P GWL C+G A+S Y L T +PD RG F Sbjct: 159 LPSGLIGPFAMERLPDGWLLCDGRAYSRRTYRALFDGIGTTWGEGDGSTTFNVPDFRGMF 218 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAH 87 +RG D R +D RS S QG + + H H Sbjct: 219 LRGMDYERNLDPWRSFASQQGCSLKAHEH 247 >UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root RepID=Q2T5M0_BURTA Length = 790 Score = 55.1 bits (131), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 38/109 (34%), Positives = 46/109 (42%), Gaps = 27/109 (24%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL------------------------ 42 SA +G V P T G+LK NG + +YPEL Sbjct: 551 SATTIGQIVFEPRTTVRPGFLKANGVLVNRADYPELWAYAQASGALVSDADWMKDRWGCF 610 Query: 43 --AKAYPTNKLPDLRGEFIRGWDDGR-GIDTGRSILSIQGYATEDHAHG 88 T +LP+LRGEFIR W D R G+D R I + QG HAHG Sbjct: 611 STGDGATTFRLPELRGEFIRCWSDARGGVDATRQIGAFQGDQNHTHAHG 659 >UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED121 RepID=A3YFP9_9GAMM Length = 207 Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 38/116 (32%), Positives = 51/116 (43%), Gaps = 34/116 (29%) Query: 6 GSALPVGVPV------------PWPSATPPTGWLKCNGAAFSAEEYPELAKAY------- 46 G A+PVG + P+ + P WLKC+G++ +YPEL A Sbjct: 29 GDAMPVGSVIAFAGEIRTSGDKPFETNLPMFNWLKCDGSSLEVAQYPELFSALGYRYGGS 88 Query: 47 -PTNKLPDLRGEFIRGWD-----------DGR-GIDTG--RSILSIQGYATEDHAH 87 LPDLRGEF+RG D +GR G G + S QG+A + H H Sbjct: 89 GQKFNLPDLRGEFLRGVDVDSSNNKKASLEGRKGAANGGNHEVGSTQGFALQSHVH 144 >UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_9CAUD Length = 760 Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 22/53 (41%), Positives = 35/53 (66%), Gaps = 1/53 (1%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 A+P+G P+ TPP G+L C+G+ FS +EYP+L + LPD+RG +++ Sbjct: 263 AVPIGSIFPFVK-TPPAGYLTCDGSTFSKDEYPDLYAYLGSTTLPDMRGRYLK 314 >UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5A8Q3_BURGB Length = 865 Score = 53.5 bits (127), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 59/224 (26%), Positives = 87/224 (38%), Gaps = 65/224 (29%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------AKAY 46 S+ +G V P T G+LK NG+ +YP L + + Sbjct: 641 SSSSIGQIVFEPRTTTRAGFLKANGSLLERADYPALWAYAQASGALISDAAWWAGQSGCF 700 Query: 47 PTN------KLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAH-------GLPSRS 93 T ++P+LRGEF+R DDGRG+DT R+ S+Q H+H G + Sbjct: 701 STGTTGTNFRIPELRGEFLRCLDDGRGLDTSRAAGSLQLSQNAKHSHDASSTVGGSHTHG 760 Query: 94 TIVTDA-TINFYFDE------IWVNS-GTDIIKRG------------------NTNDAGL 127 T A + N D+ W+ S + RG N N A L Sbjct: 761 AFTTGAGSHNHAIDQQPHAHDTWLGSVQVSGVDRGGGFGPYNGRVGEAWSDPANANIAIL 820 Query: 128 PAPDY----GTFKT--YKQSVDGLGAAASETRPRNIAFNYIVRA 165 P D+ GT+ + ++ + E RPRNIA ++RA Sbjct: 821 PTGDHVHGAGTYPAGDHNHAIAVQPSGGDEARPRNIALLAMIRA 864 >UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=Burkholderia thailandensis RepID=UPI00016A4B89 Length = 654 Score = 53.5 bits (127), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 35/110 (31%), Positives = 44/110 (40%), Gaps = 26/110 (23%) Query: 4 GEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------- 42 GE ++ VG V G+LKCNGA +YP L Sbjct: 412 GELASAMVGQIVFEMRTAARAGYLKCNGALVKRADYPALWAYAQGSGALVAEKDWMSGNF 471 Query: 43 -----AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAH 87 T ++P+LRGEF+R WDDGRG D R I + Q H H Sbjct: 472 GCFSDGDGSATFRIPELRGEFLRCWDDGRGSDADRKIGTWQDSMNRTHGH 521 >UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria phage RB43 RepID=Q56BI6_9CAUD Length = 463 Score = 53.5 bits (127), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 44/174 (25%), Positives = 67/174 (38%), Gaps = 40/174 (22%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN--------KLPDLRGEF 58 ++LP+G + + NG EYPEL LPD+RG Sbjct: 312 ASLPIGCMMMAAFNSDYGNLCIANGRGMYTYEYPELFALIGYTYGGSGNIFNLPDMRGVV 371 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDI-- 116 RG+D GRG+D GR + Q + + H H L ++ SG ++ Sbjct: 372 ARGFDAGRGLDPGRGFGTYQHHEVQSHEHPL-----------------QMIYQSGGNLPS 414 Query: 117 ------IKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 ++ ND L PD K + +ETR +N+A NY++R Sbjct: 415 WQCVYELRTAEKNDQQLYWPDPSLSK-------AMAVGGNETRMKNLAINYVIR 461 >UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=Cupriavidus taiwanensis RepID=B3R3K1_CUPTR Length = 1045 Score = 52.8 bits (125), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 34/103 (33%), Positives = 44/103 (42%), Gaps = 26/103 (25%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL---AKAY--------------------- 46 VG + P T G LK NGA +YPEL A+A Sbjct: 825 VGQIIIEPRTTARAGCLKLNGALLKRADYPELWAYAQASGAIVTDAAWLAGSWGCFSHGD 884 Query: 47 --PTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAH 87 T ++P+ RGE++R WDD RG D GR I Q + H+H Sbjct: 885 GNTTFRIPEYRGEYLRFWDDARGADAGRGIGVFQDSQNKTHSH 927 >UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GRX3_VIBCH Length = 182 Score = 51.6 bits (122), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 23/60 (38%), Positives = 32/60 (53%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 PVG +PW + P G+ G AF Y ELAK +P +PD+RG + G +DG + Sbjct: 17 FPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRGCGVIGKEDGEAV 76 >UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibrio vulgaris RepID=Q727X4_DESVH Length = 296 Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 46/148 (31%), Positives = 63/148 (42%), Gaps = 28/148 (18%) Query: 19 SATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQ 78 +AT P W +CN + + +L D RGEF RGWD GRG+D GR + S Q Sbjct: 172 NATAP-AWYRCNASGVRDATGDHI-------RLQDRRGEFARGWDHGRGVDAGRVLGSAQ 223 Query: 79 GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGL---PAPDYGTF 135 G A + + S + +V SG + + AG P D+ TF Sbjct: 224 GDAIRNIVGSMGSITAVVAGTA-----------SGAFTVTTPSNRSAGSSTGPTCDF-TF 271 Query: 136 KTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + ASE R RNIA Y+V Sbjct: 272 DASR-----VVPTASENRTRNIATLYLV 294 >UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD Length = 325 Score = 49.7 bits (117), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 22/42 (52%), Positives = 29/42 (69%) Query: 50 KLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPS 91 +LPD+RGE +R WD+GRG+D R++ S QG A E H H S Sbjct: 183 RLPDVRGEGLRLWDNGRGVDQARTLGSWQGGAIESHGHAANS 224 >UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv. viciae RepID=RHIB_RHILV Length = 219 Score = 49.7 bits (117), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 45/165 (27%), Positives = 71/165 (43%), Gaps = 51/165 (30%) Query: 25 GWLKCNGAAFSAEEYPEL------------AKAYPTNKLPDLRGEFIRGWDDGRGID--- 69 GW+ C+G A YPEL + A ++PD RG F+RG+D G G+D Sbjct: 79 GWMLCDGRYLRAAVYPELYAVLGGLYGERNSTADLEFRIPDYRGLFLRGFDAGGGMDPDA 138 Query: 70 ------TGRSIL----SIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 TG ++ S+Q A + HAH EI +G I ++ Sbjct: 139 KRRLDPTGNNVANVVGSLQCDALQVHAHPY-----------------EITTPAG--ISQQ 179 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 GN + + G+ + ++ A ETRP+N+A NY+++ Sbjct: 180 GNAAGTSISSKSTGSPENPART-------ALETRPKNVAVNYLIK 217 >UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A9ITX5_BART1 Length = 333 Score = 49.7 bits (117), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 28/93 (30%), Positives = 40/93 (43%), Gaps = 10/93 (10%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAK----------AYPTNKLPDL 54 E S P G + P WL C+G A+ +Y +L + + T +PD Sbjct: 154 ESSLYPTGFIGTFGMRDVPKDWLICDGKAYLRRDYRDLFETIGTVWGEGDSVTTFNVPDF 213 Query: 55 RGEFIRGWDDGRGIDTGRSILSIQGYATEDHAH 87 RG F+RG D G +D R S+Q + H H Sbjct: 214 RGMFLRGVDGGSNLDPNRRFASVQTDLIQSHQH 246 >UniRef50_D2L4G0 Tail Collar domain protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L4G0_9DELT Length = 319 Score = 49.3 bits (116), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 23/57 (40%), Positives = 33/57 (57%), Gaps = 4/57 (7%) Query: 9 LPVGVPVPWPSATPP---TGWLKCNGAAF-SAEEYPELAKAYPTNKLPDLRGEFIRG 61 +PVG +PWPS + P T WL+CNG A S +Y L + +P+ G+F+RG Sbjct: 41 IPVGTVIPWPSTSMPADATRWLECNGQAVPSGSQYDRLRVVLGSKPIPNYNGQFLRG 97 >UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0USC5_HAES2 Length = 652 Score = 48.9 bits (115), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 25/52 (48%), Positives = 33/52 (63%), Gaps = 2/52 (3%) Query: 6 GSALPVGVPVPWPSATP-PTGWLKCNGAAFSAEEYPELAKAY-PTNKLPDLR 55 G LPVG + +P A P G+LKC+G+ F YP+L +A +NKLPDLR Sbjct: 224 GDGLPVGSVLAFPVAVQNPQGFLKCDGSTFGRTTYPDLYRALGNSNKLPDLR 275 >UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-like viruses RepID=Q7Y2B3_9CAUD Length = 466 Score = 48.9 bits (115), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 50/169 (29%), Positives = 72/169 (42%), Gaps = 28/169 (16%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY--------PTNKLPDLRGEFI 59 A+P+G + +L CNG + + +YP+L A LPD+RG Sbjct: 312 AMPIGGIILSGFNADRGDFLICNGRSLNKNQYPQLFSAIGYTFGGSGDNFNLPDMRGLVA 371 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 RG D GR +D GR S Q A + P V D +Y G +R Sbjct: 372 RGCDHGRNLDPGRRFGSYQEDAMQRITGKFP-----VADRWRGWY-------GGAFTAQR 419 Query: 120 G--NTNDAGLPAPDYGTFKTYK--QSVDGLGAAASETRPRNIAFNYIVR 164 G +TN D+GT + +SV A+ETR +++A NYI+R Sbjct: 420 GQWSTNYKNGGGDDWGTTVNFDSGRSV----RTANETRVKSLALNYIIR 464 >UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio cholerae TM 11079-80 RepID=C2I7P2_VIBCH Length = 406 Score = 48.5 bits (114), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 45/154 (29%), Positives = 59/154 (38%), Gaps = 34/154 (22%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPD------LRGEFIRGWDD 64 VG+P W + P + Y LA+ YP D +RGEF+R D Sbjct: 274 VGMPFYWLDTSAPEWAVMEINVNLPIAVYWRLARRYPQLVRDDYINTGEIRGEFLRVLDQ 333 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 GRG+D GRSI S Q E H H + +I + T II + Sbjct: 334 GRGVDAGRSIQSYQDDELERHTHTFSAPFSITANT------------GSTGIIISASH-- 379 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 P++ T T +ETRPRNIA Sbjct: 380 ----VPNWNTTYT----------GGNETRPRNIA 399 >UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia cenocepacia J2315 RepID=B4EF34_BURCJ Length = 883 Score = 48.5 bits (114), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 30/96 (31%), Positives = 40/96 (41%), Gaps = 26/96 (27%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AKA 45 G V P T G+LK NGA +YP L Sbjct: 653 GTVVFEPRTTARAGFLKLNGALLKRADYPALWAYAQASGALSTETDWAAGWSGTFSTGDG 712 Query: 46 YPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYA 81 T ++P+LRGEF+R WDD RG+D R + + Q +A Sbjct: 713 TTTFRIPELRGEFVRCWDDTRGVDPNRGLGASQNFA 748 >UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens AM1 RepID=C5B185_METEA Length = 449 Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 45/167 (26%), Positives = 67/167 (40%), Gaps = 36/167 (21%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRGEFI 59 P G+ + + P GW+ G A ++ L T +PDLRG F+ Sbjct: 308 PPGMISAYAGQSCPVGWVDATGLALLRSDFSALFAVIGTRWGAGDGSTTFNVPDLRGYFL 367 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLP-SRSTIVTDATINFYFDEIWVNSGTDIIK 118 R D G G D GR + S Q + H H +P + +T + T NF + + +GT + Sbjct: 368 RMQDAGAGRDPGRDLGSAQAGSVGPHQHNVPVANATAGSGTTNNFVYP---LAAGTSSVP 424 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + PAP A ETRP NIA Y +++ Sbjct: 425 TTGQD----PAP------------------AGETRPINIAVWYCIKS 449 >UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW5_CLOCE Length = 368 Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 44/185 (23%), Positives = 68/185 (36%), Gaps = 34/185 (18%) Query: 9 LPVGVPVPW-----PSATPPTGWLKCNGAAFSAEEYPEL---------AKAYPTNKLPDL 54 PVG+ +P+ +GW+ C+G +Y EL P +PDL Sbjct: 5 FPVGMVIPFAGPLKEDQLKSSGWVPCDGRVLDKTQYSELFDVIGTKYGGDGIPNFNIPDL 64 Query: 55 RGEFIRGWDDGRGID--------------TGRSILSIQGYATEDHAHGLPSRSTIVTD-A 99 RG F+R D GRG D G + S+Q YAT P + I D Sbjct: 65 RGRFVRATDHGRGYDPDAQRRKASKSGGAAGDNTGSVQEYATAK-----PKNNFITNDKG 119 Query: 100 TINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAF 159 N D + + + A P + + + S + SE+RP N+ Sbjct: 120 NHNHLVDHLPTDYWNAACAITSNEGANFPGRTATSGEAGQHSHTIVSGGDSESRPVNLYM 179 Query: 160 NYIVR 164 +I++ Sbjct: 180 YWIIK 184 Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust. Identities = 45/153 (29%), Positives = 71/153 (46%), Gaps = 16/153 (10%) Query: 25 GWLKCNGAAFSAEEYPELAKAYPT------NK--LPDLRGEFIRGWDDGRGIDTGRSILS 76 GWL C G+++ A +YP+L + NK +PDLRG FIRG + G + Sbjct: 219 GWLPCIGSSYEANKYPDLYENISNIYGGDQNKFNVPDLRGLFIRGVNSNTSETPGVHGAT 278 Query: 77 IQGYATEDHAHGLPS--RSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDY-G 133 G TED++ LP T+ TD ++ + + G+ A P+ Y G Sbjct: 279 RVG-QTEDYSTALPKTLNFTLSTDGAHTHSAPKLPQDKYIENYCAGH-EVANFPSNQYTG 336 Query: 134 TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + ++ G A ETRP NI +YI++++ Sbjct: 337 NNGNHAHTIAGGDA---ETRPVNIYLDYIIKSS 366 >UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Microcystis aeruginosa RepID=A8YDB4_MICAE Length = 166 Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 25/105 (23%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-------NK--LPDLR 55 + +L +P+ +P A GW+ C+G + YPEL T NK LPD R Sbjct: 39 QAESLNANIPITYPEAY---GWMLCDGRYLEIDAYPELFAVIGTLYGKQGDNKFRLPDYR 95 Query: 56 GEFIRGWDDGRGID-----------TGRS--ILSIQGYATEDHAH 87 G F+RG D G G+D G+S I S+Q A + H H Sbjct: 96 GLFMRGVDAGSGLDPDAAERIGPEGMGKSSGIGSLQCDALQQHQH 140 >UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW4_CLOCE Length = 200 Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 46/180 (25%), Positives = 70/180 (38%), Gaps = 56/180 (31%) Query: 25 GWLKCNGAAFSAEEYPELAKAYPTNK--------LPDLRGEFIRG------WDDGRGID- 69 GWL C+G+ EYP+L +A LPD + +FIRG + GR +D Sbjct: 31 GWLICDGSKLKIAEYPDLFQAIGKAHGGDNTYFYLPDTQSKFIRGVNGDSVGESGRLMDP 90 Query: 70 -------------TGRSILSIQGYAT------------EDHAHGLPSRSTIVTDATINFY 104 TG ++ S Q +AT H H LP + D + N Y Sbjct: 91 DVAKRTFAKPGGNTGNNVGSYQDFATGLPKVSLTTDFIGSHTHSLPH----LPDGSHNAY 146 Query: 105 FDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 I + G + T ++G S + +G ETRPRN+ ++I++ Sbjct: 147 AGSIGRDGGKEAGDNTRTGESG------------SHSHEIIGGGDPETRPRNMNLHFIIK 194 >UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JYG6_9GAMM Length = 400 Score = 46.2 bits (108), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 26/69 (37%), Positives = 32/69 (46%), Gaps = 10/69 (14%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEFI 59 P G + TPP GWL C+G+ S +YP L A T LPDLR +F Sbjct: 212 PAGRTEDFAGTTPPGGWLFCDGSEVSRTQYPALFTAIGTLWGDGDGSTTFNLPDLRNDFR 271 Query: 60 RGWDDGRGI 68 RG D R + Sbjct: 272 RGCSDTRSV 280 >UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GUE7_VIBCH Length = 250 Score = 45.8 bits (107), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 20/48 (41%), Positives = 26/48 (54%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRG 56 PVG +PW + P G+ G AF Y ELAK +P +PD+RG Sbjct: 203 FPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRG 250 >UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 RepID=A6E6G6_9SPHI Length = 731 Score = 45.1 bits (105), Expect = 9e-04, Method: Composition-based stats. Identities = 46/166 (27%), Positives = 64/166 (38%), Gaps = 19/166 (11%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNK-LPDLRGEFIRG------ 61 PVG V + P WL C+G YP+L + K LPDLRG F+ G Sbjct: 575 FPVGGIVAFYGEKVPDHWLLCDGKPVDHSLYPDLYRLLGGEKRLPDLRGRFLVGAGSKYS 634 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 D G+D LS+ H H + + + + F E+ + + RG Sbjct: 635 LGDMGGVDE----LSLNVDQMPQHDHQIKAVKSYESP------FKEVNMGWAREESLRGG 684 Query: 122 T--NDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 D A Y ++ G A E RP +A NYI+RA Sbjct: 685 VYGTDRDNGADKYFVTRSNSPVKSEGGGKAHENRPPYLAVNYIIRA 730 >UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=VPH_BPHP1 Length = 925 Score = 45.1 bits (105), Expect = 0.001, Method: Compositional matrix adjust. Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 2/51 (3%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAY-PTNKLPDL 54 G +P+G V +P A T P G+LK NG F+ + +P+L + +N+LPDL Sbjct: 532 GDGVPIGSVVSFPRAVTNPVGFLKANGTTFNQQTFPDLYRTLGDSNQLPDL 582 >UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3XAA4_OXAFO Length = 305 Score = 45.1 bits (105), Expect = 0.001, Method: Compositional matrix adjust. Identities = 35/100 (35%), Positives = 43/100 (43%), Gaps = 21/100 (21%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KL 51 G+ S +PVG + T P G+LK NGAA E YPEL T L Sbjct: 7 GITPSSGVPVGTIEYFAMVTSPAGYLKANGAAVGRETYPELYATIGTTFGEGDGSSTFNL 66 Query: 52 PDLRGEFIRGWDD-GRGIDTGRSILSIQGYATEDHAHGLP 90 PDL F +G + G+ I+ G S DH H LP Sbjct: 67 PDLIDRFAQGSNTPGQKIEAGLS----------DHNHTLP 96 >UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8V5_OXAFO Length = 480 Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust. Identities = 26/65 (40%), Positives = 34/65 (52%), Gaps = 10/65 (15%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEF 58 +PVG + ++TPP G+LK +GAA E YP+L A T LPDL G F Sbjct: 196 VPVGTIEYFATSTPPAGYLKADGAAVGRETYPDLFAAIGTAFGEGDGSTTFNLPDLIGRF 255 Query: 59 IRGWD 63 +G D Sbjct: 256 AQGSD 260 >UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus influenzae PittAA RepID=A4NHY2_HAEIN Length = 556 Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust. Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 2/51 (3%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAY-PTNKLPDL 54 G +P+G V +P A T P G+LK NG F+ + +P+L + +N+LPDL Sbjct: 175 GKGVPIGAVVSFPRAVTNPVGFLKANGTTFNQQTFPDLYRTLGNSNQLPDL 225 >UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenella corrodens ATCC 23834 RepID=C0DSG4_EIKCO Length = 436 Score = 44.3 bits (103), Expect = 0.002, Method: Compositional matrix adjust. Identities = 22/50 (44%), Positives = 31/50 (62%), Gaps = 1/50 (2%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDL 54 G LPVG V +P A + P G+LK +G+ F+ YP+L + NKLP+L Sbjct: 69 GKGLPVGAVVGFPRAISSPEGYLKADGSTFAQATYPDLYRVLGGNKLPNL 118 >UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3K6_OXAFO Length = 500 Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust. Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 10/63 (15%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEF 58 +PVG V + ++ P G+LKC+GAA + YP+L A T LPD+ G F Sbjct: 213 IPVGTVVMFSASEAPAGYLKCDGAAVGRDTYPDLFAAIGTVFGAGDGETTFNLPDMIGRF 272 Query: 59 IRG 61 G Sbjct: 273 AEG 275 >UniRef50_B5TK79 Tail collar protein n=2 Tax=root RepID=B5TK79_9VIRU Length = 364 Score = 43.5 bits (101), Expect = 0.003, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 55/112 (49%), Gaps = 5/112 (4%) Query: 48 TNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDE 107 T + P+ RGEF+R D+GR +D+GR++ + Q H+H L ++ + + Sbjct: 251 TFRAPEGRGEFLRILDEGRSVDSGRAMGTFQPGTV--HSHALGAQGAGAVGSRWSDSLST 308 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAF 159 + N+ +I G+ + G P + TY+ + ++RPRNIA+ Sbjct: 309 VGANTREEIKIIGDLVNGG---PTFPAGTTYQMDTANTLLYSFKSRPRNIAY 357 >UniRef50_A4P195 Putative phage tail fibre protein (Fragment) n=1 Tax=Haemophilus influenzae 22.4-21 RepID=A4P195_HAEIN Length = 458 Score = 42.0 bits (97), Expect = 0.007, Method: Compositional matrix adjust. Identities = 20/48 (41%), Positives = 32/48 (66%), Gaps = 2/48 (4%) Query: 9 LPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAY-PTNKLPDL 54 +P+G V +P A T P G+L+ +G+ FS + +P+L + +NKLPDL Sbjct: 393 IPIGAVVSFPRAVTNPVGFLRADGSTFSQQTFPDLYRTLGNSNKLPDL 440 >UniRef50_A4YX40 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YX40_BRASO Length = 549 Score = 41.6 bits (96), Expect = 0.009, Method: Compositional matrix adjust. Identities = 26/63 (41%), Positives = 31/63 (49%), Gaps = 12/63 (19%) Query: 23 PTGWLKCNGA--AFSAEEYPELAKAYPTN---------KLPDLRGEFIRGWDDGRGIDTG 71 PTGW+ CNG A SA P A +N +PDLRG+F+RG G G D Sbjct: 213 PTGWIYCNGMPQAISASS-PAFANTLGSNFGGDGVSVFNVPDLRGQFLRGTSHGTGRDPN 271 Query: 72 RSI 74 SI Sbjct: 272 ASI 274 >UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8I7_OXAFO Length = 369 Score = 41.6 bits (96), Expect = 0.009, Method: Compositional matrix adjust. Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 10/63 (15%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEF 58 +PVG + ++TPP G+LK +G+ E YPEL A T LPDL G F Sbjct: 106 IPVGSIDYFATSTPPAGYLKADGSEVGRETYPELFTAIGTVFGEGNGDSTFNLPDLMGRF 165 Query: 59 IRG 61 +G Sbjct: 166 AQG 168 >UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0UTN0_HAES2 Length = 699 Score = 41.6 bits (96), Expect = 0.010, Method: Compositional matrix adjust. Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 2/51 (3%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAY-PTNKLPDL 54 G +P+G V +P A T PTG+LKC+G YP+L + N LP+L Sbjct: 347 GDGVPLGAIVAFPKAITNPTGFLKCDGTTIDQRTYPDLYRTLGNKNTLPNL 397 >UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W1_OXAFO Length = 365 Score = 41.2 bits (95), Expect = 0.011, Method: Compositional matrix adjust. Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 9/70 (12%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKA---------YPTNKLPD 53 L + LP G + P G+L CNGA+ S YPEL T LPD Sbjct: 212 LDKAEKLPAGTIIAVGGNITPEGFLYCNGASLSPSAYPELCAVIGGTYGGDGLTTFNLPD 271 Query: 54 LRGEFIRGWD 63 RG +++G D Sbjct: 272 FRGRWMQGND 281 >UniRef50_A3YA17 Prophage MuSo2, tail fiber protein, putative n=1 Tax=Marinomonas sp. MED121 RepID=A3YA17_9GAMM Length = 341 Score = 41.2 bits (95), Expect = 0.012, Method: Compositional matrix adjust. Identities = 21/41 (51%), Positives = 25/41 (60%) Query: 48 TNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHG 88 T LP + GEFIR +DDGRG+D GR S Q A + H H Sbjct: 243 TFTLPIVGGEFIRMFDDGRGVDDGRVFGSFQEDAFQGHWHA 283 >UniRef50_C7BVI0 Structural protein n=1 Tax=Synechococcus phage S-RSM4 RepID=C7BVI0_9CAUD Length = 428 Score = 40.8 bits (94), Expect = 0.018, Method: Compositional matrix adjust. Identities = 23/100 (23%), Positives = 44/100 (44%), Gaps = 18/100 (18%) Query: 23 PTGWLKCNGAAFSAEEYPELAKAY-----------------PTNKLPDLRGEFIRGWD-D 64 P G+++C+G+ ++ YP LA+ ++PDLR +FIR Sbjct: 41 PAGYIRCDGSVYNENTYPALAQILGLGDACVFKQPDVTLNADQFQVPDLRSKFIRASSAS 100 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFY 104 +G+ ++LS G E G+ S + + A ++ + Sbjct: 101 DQGVINDNTVLSATGLTVEKSGVGVQVSSNVGSTAVVDLF 140 >UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X1Y2_OXAFO Length = 480 Score = 40.4 bits (93), Expect = 0.019, Method: Compositional matrix adjust. Identities = 24/65 (36%), Positives = 31/65 (47%), Gaps = 10/65 (15%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRG 56 S +P+G + ATPP G+LK +GAA YP+L A T LPD+ G Sbjct: 187 SGVPIGTVEYFAMATPPAGYLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFNLPDMIG 246 Query: 57 EFIRG 61 F G Sbjct: 247 RFAEG 251 >UniRef50_D1NFN8 Apo-citrate lyase phosphoribosyl-dephospho-CoA transferase (Fragment) n=1 Tax=Haemophilus influenzae HK1212 RepID=D1NFN8_HAEIN Length = 301 Score = 40.4 bits (93), Expect = 0.019, Method: Compositional matrix adjust. Identities = 20/48 (41%), Positives = 31/48 (64%), Gaps = 2/48 (4%) Query: 9 LPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAY-PTNKLPDL 54 +P G V +P A T P G+LK NG+ F+ + +P+L + +N+LPDL Sbjct: 65 IPTGAVVSFPRAVTNPVGFLKANGSTFNQQTFPDLYRVLGNSNQLPDL 112 >UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacteriaceae RepID=B7LN99_ESCF3 Length = 593 Score = 40.0 bits (92), Expect = 0.026, Method: Compositional matrix adjust. Identities = 29/70 (41%), Positives = 39/70 (55%), Gaps = 5/70 (7%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 +A PVG P+ WPS P G+ G F YP LA AYP+ +PD+RG I+G Sbjct: 389 NAFPVGAPIAWPSDIVPEGYAIMQGQTFDKAAYPLLAAAYPSGVIPDMRGWTIKGKP--- 445 Query: 67 GIDTGRSILS 76 +GR++LS Sbjct: 446 --ASGRAVLS 453 >UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X192_OXAFO Length = 361 Score = 40.0 bits (92), Expect = 0.030, Method: Compositional matrix adjust. Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 10/65 (15%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRG 56 S +P+G + ATPP G+LK +GAA YP+L A T LPD+ G Sbjct: 78 SGVPIGTVEYFAMATPPAGYLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFNLPDMIG 137 Query: 57 EFIRG 61 +F G Sbjct: 138 QFAEG 142 >UniRef50_A6EAB8 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAB8_9SPHI Length = 183 Score = 39.3 bits (90), Expect = 0.050, Method: Compositional matrix adjust. Identities = 30/97 (30%), Positives = 44/97 (45%), Gaps = 19/97 (19%) Query: 23 PTGWLKCNGAAFSAE----EYPELAKAYPTN-----KLPDLRGEFIRGWDDGRGIDTGRS 73 P W+ CNGA + + Y + Y +N K+PDLRG G G G+ T R Sbjct: 18 PVDWMMCNGATLTVQGNEALYSLIGSTYGSNGPTDFKVPDLRGRLTVGQGLGTGL-TSRI 76 Query: 74 ILSIQGYATE--------DHAHGLPSRSTIVTDATIN 102 + S+ G T H H L + ST+ + A++N Sbjct: 77 LGSVGGAETVALTEAQLPAHNHNL-TVSTVTSPASVN 112 >UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepID=B6S308_SALDU Length = 427 Score = 39.3 bits (90), Expect = 0.051, Method: Compositional matrix adjust. Identities = 17/39 (43%), Positives = 21/39 (53%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT 48 PVG + WPS P G+ G +F YP LA AYP+ Sbjct: 387 PVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPS 425 >UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3R8_OXAFO Length = 365 Score = 38.9 bits (89), Expect = 0.069, Method: Compositional matrix adjust. Identities = 24/64 (37%), Positives = 32/64 (50%), Gaps = 12/64 (18%) Query: 9 LPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYPTN----------KLPDLRGE 57 +PVG + W + PP G+LKC+GAA + YP+L A T LPD+ G Sbjct: 88 VPVG-SIDWLAVPEPPAGYLKCDGAAIGRDTYPDLFAAIGTTFGAGDGETTFNLPDMIGR 146 Query: 58 FIRG 61 F G Sbjct: 147 FAEG 150 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Entero... 240 1e-62 UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteria... 204 7e-52 UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli R... 194 9e-49 UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 Rep... 191 6e-48 UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia Rep... 191 8e-48 UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrob... 187 1e-46 UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU 181 5e-45 UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber prote... 179 3e-44 UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia ... 176 3e-43 UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae... 176 3e-43 UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterob... 175 6e-43 UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus... 175 6e-43 UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacteriu... 174 1e-42 UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadan... 173 2e-42 UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteri... 172 3e-42 UniRef50_B7NJP1 Putative side tail fiber protein homolog from la... 172 3e-42 UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacteriu... 171 6e-42 UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammapr... 171 8e-42 UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=... 171 9e-42 UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID... 171 1e-41 UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 ... 170 2e-41 UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadan... 169 3e-41 UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID... 169 3e-41 UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Pho... 168 5e-41 UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli pl... 167 1e-40 UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae... 167 1e-40 UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 T... 167 1e-40 UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=... 166 2e-40 UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteri... 166 3e-40 UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia Rep... 164 8e-40 UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bact... 164 1e-39 UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid pr... 163 2e-39 UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH... 162 4e-39 UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectoba... 161 7e-39 UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteri... 160 2e-38 UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=... 157 8e-38 UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadan... 157 1e-37 UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersini... 156 3e-37 UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteri... 154 8e-37 UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersini... 153 2e-36 UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadan... 151 6e-36 UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae ... 150 2e-35 UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadan... 150 2e-35 UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannhei... 148 6e-35 UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1... 148 7e-35 UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabd... 148 7e-35 UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseu... 145 7e-34 UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 ... 144 1e-33 UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica ... 143 3e-33 UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 T... 142 6e-33 UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas... 141 7e-33 UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingell... 140 1e-32 UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylo... 139 2e-32 UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella gra... 139 3e-32 UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio... 138 6e-32 UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium w... 135 6e-31 UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio... 135 7e-31 UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Ta... 134 8e-31 UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxaloba... 134 1e-30 UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia sola... 132 4e-30 UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Ta... 132 5e-30 UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepI... 132 5e-30 UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica... 131 1e-29 UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralston... 130 1e-29 UniRef50_C3X912 Phage tail collar domain-containing protein n=1 ... 128 6e-29 UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labr... 128 7e-29 UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus... 127 2e-28 UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxaloba... 126 2e-28 UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_L... 125 4e-28 UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia... 124 1e-27 UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escheri... 123 2e-27 UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enter... 123 2e-27 UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacteriu... 123 2e-27 UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotroph... 123 2e-27 UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A... 121 7e-27 UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A... 121 7e-27 UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax... 120 1e-26 UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacteriu... 120 1e-26 UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polarom... 120 2e-26 UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-l... 120 2e-26 UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=... 119 3e-26 UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas ... 118 7e-26 UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 R... 117 1e-25 UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX 116 3e-25 UniRef50_P76072 Side tail fiber protein homolog from lambdoid pr... 116 3e-25 UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Entero... 115 4e-25 UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=... 114 9e-25 UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escheric... 113 2e-24 UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae R... 113 3e-24 UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio ... 112 3e-24 UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria pha... 112 4e-24 UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythrae... 112 5e-24 UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibaci... 112 5e-24 UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=... 111 1e-23 UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45... 110 1e-23 UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 ... 110 2e-23 UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A... 109 4e-23 UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella... 109 4e-23 UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp.... 107 2e-22 UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax... 106 2e-22 UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibr... 106 3e-22 UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkhol... 105 4e-22 UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae Re... 105 4e-22 UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium R... 105 6e-22 UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synecho... 104 8e-22 UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enter... 104 2e-21 UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria ph... 101 1e-20 UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia ... 100 3e-20 UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=... 99 4e-20 UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriacea... 99 4e-20 UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 T... 99 4e-20 UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter ... 98 1e-19 UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID... 98 1e-19 UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylo... 96 3e-19 UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkhol... 96 4e-19 UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkhol... 95 7e-19 UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysente... 95 9e-19 UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A... 93 3e-18 UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root R... 93 3e-18 UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia... 92 5e-18 UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium R... 92 8e-18 UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv.... 89 4e-17 UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=C... 88 1e-16 UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX 87 2e-16 UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED1... 84 2e-15 UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae ... 83 3e-15 UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD 81 1e-14 UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Micr... 78 1e-13 UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_... 77 3e-13 UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxaloba... 75 1e-12 UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenel... 72 5e-12 UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxaloba... 71 2e-11 UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=... 69 7e-11 UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus ... 67 2e-10 UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Ta... 67 2e-10 UniRef50_D2L4G0 Tail Collar domain protein n=1 Tax=Desulfovibrio... 65 1e-09 UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemoph... 65 1e-09 Sequences not found previously or not previously below threshold: UniRef50_C4MYW8 Gp12 Short tail fibers n=1 Tax=Enterobacteria ph... 98 9e-20 UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacter... 93 3e-18 UniRef50_A1HR57 Putative uncharacterized protein n=1 Tax=Thermos... 90 3e-17 UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxaloba... 74 2e-12 UniRef50_D2V5I7 Microcystin-dependent protein n=1 Tax=Naegleria ... 74 2e-12 UniRef50_P10930 Short tail fiber protein n=8 Tax=Myoviridae RepI... 74 2e-12 UniRef50_C3X971 Predicted protein n=6 Tax=Oxalobacter formigenes... 72 6e-12 UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomo... 70 3e-11 UniRef50_C9QG11 Probable tail fiber protein n=1 Tax=Vibrio orien... 70 3e-11 UniRef50_C3X909 Predicted protein n=2 Tax=Oxalobacter formigenes... 70 3e-11 UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formi... 69 5e-11 UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes... 69 6e-11 UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes... 68 1e-10 UniRef50_B5TK79 Tail collar protein n=2 Tax=root RepID=B5TK79_9VIRU 67 2e-10 UniRef50_A9AVC5 Tail Collar domain protein n=1 Tax=Herpetosiphon... 67 2e-10 UniRef50_A3YA17 Prophage MuSo2, tail fiber protein, putative n=1... 66 3e-10 UniRef50_UPI00016C4891 hypothetical protein GobsU_00190 n=1 Tax=... 66 5e-10 UniRef50_A3Y8Q8 Putative uncharacterized protein n=1 Tax=Marinom... 66 6e-10 UniRef50_A9LZ37 Tail fibre protein, putative n=21 Tax=Neisseria ... 65 1e-09 UniRef50_A6EAC0 Microcystin-dependent protein n=1 Tax=Pedobacter... 64 1e-09 UniRef50_A9AVE2 Tail Collar domain protein n=1 Tax=Herpetosiphon... 64 2e-09 UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes... 64 2e-09 UniRef50_C7PCL6 Putative uncharacterized protein n=1 Tax=Chitino... 64 2e-09 UniRef50_C3X8R9 Bacteriophage tail fiber protein n=6 Tax=Oxaloba... 64 3e-09 UniRef50_Q094A8 Phage Tail Collar Domain family n=1 Tax=Stigmate... 63 3e-09 UniRef50_C6S6V6 Putative phage tail fibre protein n=1 Tax=Neisse... 63 4e-09 UniRef50_Q4C9U4 Phage Tail Collar n=1 Tax=Crocosphaera watsonii ... 62 6e-09 UniRef50_A6EAB8 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 R... 61 1e-08 UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepI... 60 2e-08 UniRef50_A4YX40 Putative uncharacterized protein n=1 Tax=Bradyrh... 60 2e-08 UniRef50_B5JF21 Phage Tail Collar Domain family n=1 Tax=Verrucom... 60 2e-08 UniRef50_B9M3Z7 Tail Collar domain protein n=1 Tax=Geobacter sp.... 60 2e-08 UniRef50_Q2W7B2 Microcystin-dependent protein n=1 Tax=Magnetospi... 60 3e-08 UniRef50_B3QRT1 Tail Collar domain protein n=1 Tax=Chloroherpeto... 60 3e-08 UniRef50_D0LMW0 Tail Collar domain protein n=1 Tax=Haliangium oc... 59 4e-08 UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes... 59 7e-08 UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemoph... 58 1e-07 UniRef50_A1TNG3 Phage Tail Collar domain protein n=9 Tax=Bacteri... 57 1e-07 UniRef50_D1BW55 Tail Collar domain protein n=1 Tax=Xylanimonas c... 57 2e-07 UniRef50_C5RJD9 Tail Collar domain protein n=1 Tax=Clostridium c... 57 3e-07 UniRef50_C6X0H3 Microcystin dependent protein n=1 Tax=Flavobacte... 56 3e-07 UniRef50_Q8PR98 Microcystin dependent protein n=1 Tax=Xanthomona... 56 5e-07 UniRef50_D1NFN8 Apo-citrate lyase phosphoribosyl-dephospho-CoA t... 55 6e-07 UniRef50_B1JGT8 Putative uncharacterized protein n=1 Tax=Yersini... 55 7e-07 UniRef50_B5RPA6 Uncharacterized conserved protein n=73 Tax=Borre... 55 8e-07 UniRef50_Q11LT1 Microcystin-dependent protein-like n=1 Tax=Chela... 55 8e-07 UniRef50_C5RN01 Tail Collar domain protein n=1 Tax=Clostridium c... 55 9e-07 UniRef50_B5ZGB2 Tail Collar domain protein n=4 Tax=Gluconacetoba... 55 1e-06 UniRef50_Q4ZMK7 Putative uncharacterized protein n=2 Tax=Pseudom... 54 2e-06 UniRef50_UPI0001BC923E Phage tail Collar n=1 Tax=Pseudomonas syr... 54 2e-06 UniRef50_Q4KAW3 Putative uncharacterized protein n=1 Tax=Pseudom... 54 2e-06 UniRef50_Q1QPI5 Phage Tail Collar n=10 Tax=Proteobacteria RepID=... 54 2e-06 UniRef50_D2QTE9 Tail Collar domain protein n=1 Tax=Spirosoma lin... 54 3e-06 UniRef50_B8DLJ2 Tail fiber protein, putative n=3 Tax=Desulfovibr... 54 3e-06 UniRef50_Q0F1S9 Putative uncharacterized protein n=1 Tax=Maripro... 53 3e-06 UniRef50_Q72P75 Putative uncharacterized protein n=4 Tax=Leptosp... 53 4e-06 UniRef50_A4P195 Putative phage tail fibre protein (Fragment) n=1... 53 4e-06 UniRef50_Q55EP2 Putative uncharacterized protein n=1 Tax=Dictyos... 53 4e-06 UniRef50_Q8PR97 Microcystin dependent protein n=1 Tax=Xanthomona... 52 5e-06 UniRef50_Q73NL1 Tail fiber domain protein n=1 Tax=Treponema dent... 52 5e-06 UniRef50_Q4UNP6 Microcystin dependent protein n=8 Tax=Bacteria R... 52 6e-06 UniRef50_Q2W7B1 Microcystin-dependent protein n=3 Tax=Proteobact... 52 7e-06 UniRef50_UPI0001A44BB4 microcystin dependent protein n=1 Tax=Pec... 52 8e-06 UniRef50_UPI0001AF6092 hypothetical protein Psyrpo1_27141 n=2 Ta... 52 1e-05 UniRef50_B3FYL6 Gp17 n=1 Tax=Salmonella phage phiSG-JL2 RepID=B3... 52 1e-05 UniRef50_Q8Y365 Putative uncharacterized protein n=4 Tax=Ralston... 51 1e-05 UniRef50_B5RPA7 Uncharacterized conserved protein n=10 Tax=Borre... 51 1e-05 UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacter... 51 2e-05 UniRef50_C9PG79 Putative phage tail protein n=1 Tax=Vibrio furni... 51 2e-05 UniRef50_A9DEL7 Tail fiber protein 2 n=1 Tax=Yersinia phage PY10... 50 2e-05 UniRef50_C7BVI0 Structural protein n=1 Tax=Synechococcus phage S... 50 2e-05 UniRef50_C6DJW4 Tail Collar domain protein n=2 Tax=Pectobacteriu... 50 2e-05 UniRef50_Q2RUE1 Phage Tail Collar n=5 Tax=Proteobacteria RepID=Q... 50 2e-05 UniRef50_C0YLU9 Phage tail collar domain-containing protein n=1 ... 50 2e-05 UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseu... 50 3e-05 UniRef50_A6EAB9 Microcystin-dependent protein n=1 Tax=Pedobacter... 50 3e-05 UniRef50_C7PNC3 Tail Collar domain protein n=1 Tax=Chitinophaga ... 50 3e-05 UniRef50_C6X0H2 Phage tail collar domain protein n=1 Tax=Flavoba... 49 4e-05 UniRef50_C7PNC2 Tail Collar domain protein n=1 Tax=Chitinophaga ... 49 4e-05 UniRef50_A0A7D3 Putative uncharacterized protein n=1 Tax=Microcy... 49 5e-05 UniRef50_D1Y7E0 Collagen alpha 1 n=1 Tax=Pyramidobacter piscolen... 49 7e-05 UniRef50_C6MD19 Tail Collar domain protein n=2 Tax=Proteobacteri... 49 7e-05 UniRef50_A6N211 Probable tail fiber protein n=1 Tax=Microbacteri... 49 8e-05 UniRef50_B1KMR6 Tail Collar domain protein n=1 Tax=Shewanella wo... 49 9e-05 UniRef50_A5GA41 Phage Tail Collar domain protein n=1 Tax=Geobact... 48 1e-04 UniRef50_C2FWA0 Phage tail collar domain protein n=2 Tax=Sphingo... 48 1e-04 UniRef50_A1SXZ3 Phage Tail Collar domain protein n=4 Tax=Bacteri... 48 1e-04 UniRef50_D0KG77 Tail Collar domain protein n=1 Tax=Pectobacteriu... 48 1e-04 UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae... 47 2e-04 UniRef50_C5BRC7 Phage tail collar domain protein n=1 Tax=Teredin... 47 2e-04 UniRef50_Q89L34 Blr4714 protein n=1 Tax=Bradyrhizobium japonicum... 47 2e-04 UniRef50_C7PE74 Tail Collar domain protein n=3 Tax=Bacteria RepI... 47 3e-04 UniRef50_B9K0L5 Putative uncharacterized protein n=1 Tax=Agrobac... 47 3e-04 UniRef50_Q58MY1 Predicted protein n=1 Tax=Prochlorococcus phage ... 47 3e-04 UniRef50_B1J270 Tail Collar domain protein n=8 Tax=Bacteria RepI... 47 3e-04 UniRef50_C3X3W3 Predicted protein n=1 Tax=Oxalobacter formigenes... 47 3e-04 UniRef50_Q12HS4 Phage Tail Collar n=1 Tax=Shewanella denitrifica... 47 3e-04 UniRef50_C7PNC4 Tail Collar domain protein n=1 Tax=Chitinophaga ... 46 4e-04 UniRef50_A1TUY7 Phage Tail Collar domain protein n=4 Tax=Acidovo... 46 4e-04 UniRef50_B4VMZ3 Phage Tail Collar Domain family n=1 Tax=Microcol... 46 5e-04 UniRef50_Q4KAW2 Phage tail collar domain protein n=1 Tax=Pseudom... 46 5e-04 UniRef50_A8T9J8 Putative uncharacterized protein n=1 Tax=Vibrio ... 46 5e-04 UniRef50_B9Z2Z1 Putative uncharacterized protein n=1 Tax=Lutiell... 46 5e-04 UniRef50_B3PJI6 Microcystin dependent protein; MdpB n=1 Tax=Cell... 46 5e-04 UniRef50_Q2S9H9 Microcystin-dependent protein n=3 Tax=Proteobact... 46 6e-04 UniRef50_Q4KAW4 Putative uncharacterized protein n=1 Tax=Pseudom... 45 6e-04 UniRef50_B9M3Z9 Tail Collar domain protein n=2 Tax=Bacteria RepI... 45 7e-04 UniRef50_Q1QPI6 Phage Tail Collar n=2 Tax=Proteobacteria RepID=Q... 45 7e-04 UniRef50_Q5GQB8 Putative short tail fibre n=1 Tax=Synechococcus ... 45 7e-04 UniRef50_B0SX68 Tail Collar domain protein n=6 Tax=Bacteria RepI... 45 9e-04 UniRef50_A9C0W7 Tail Collar domain protein n=6 Tax=Proteobacteri... 45 9e-04 UniRef50_C7ID92 Tail Collar domain protein n=1 Tax=Clostridium p... 45 0.001 UniRef50_C2FWA1 Phage tail collar domain protein n=2 Tax=Sphingo... 45 0.001 >UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Enterobacteriaceae RepID=STFE_ECOLI Length = 166 Score = 240 bits (612), Expect = 1e-62, Method: Composition-based stats. Identities = 166/166 (100%), Positives = 166/166 (100%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG Sbjct: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA Sbjct: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 >UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteriaceae RepID=C6V0Q3_ECO5T Length = 439 Score = 204 bits (520), Expect = 7e-52, Method: Composition-based stats. Identities = 96/166 (57%), Positives = 112/166 (67%), Gaps = 2/166 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAK YPTNKLPDLRGEFIR Sbjct: 276 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKVYPTNKLPDLRGEFIR 335 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG+D GR +L++Q A H H ++ D T+ + + T + Sbjct: 336 GWDDGRGVDNGRGLLTLQDGAIVSHNHYWGIWTSRTNDQTLESFTGTTILKQITPL--SP 393 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 N P P+ + + A A+ETRPRN+AFNYIVRAA Sbjct: 394 AINFDNYPIPNPAITEGGVVAATTKPAGANETRPRNVAFNYIVRAA 439 >UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli RepID=B3I9S3_ECOLX Length = 546 Score = 194 bits (493), Expect = 9e-49, Method: Composition-based stats. Identities = 97/169 (57%), Positives = 114/169 (67%), Gaps = 3/169 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFS EEYPELAKAYPTNKLPDLRGEFIR Sbjct: 378 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSVEEYPELAKAYPTNKLPDLRGEFIR 437 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLP--SRSTIVTDATINFYFDEIWVNSG-TDII 117 GWDDGRGIDTGR++L+ Q + DHAH + + + + I G I+ Sbjct: 438 GWDDGRGIDTGRALLNWQPHTILDHAHYMELWTGDGLAAGSAREGVNPGILATYGDGGIV 497 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 K + + ++ K+ + +ETRPRNIAFNYIVRAA Sbjct: 498 KTDEPGLKVPSSLRAISSRSVKRYGEISENVGTETRPRNIAFNYIVRAA 546 >UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 RepID=Q9MCR6_BPHK7 Length = 321 Score = 191 bits (486), Expect = 6e-48, Method: Composition-based stats. Identities = 99/165 (60%), Positives = 107/165 (64%), Gaps = 3/165 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGA FSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 159 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAVFSAEEYPELAKAYPTNKLPDLRGEFIR 218 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGID GR ILS QG A + T V + + D ++V G Sbjct: 219 GWDDGRGIDAGREILSAQGDAIRNITGTFGDGETEVNASISFYRADGVFVTQKKLRNTIG 278 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 NT + A + ASE RPRNIAFNYIVRA Sbjct: 279 NTT---IIADTPNNPYLINFDASRVVPTASENRPRNIAFNYIVRA 320 >UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia RepID=A9R3H4_YERPG Length = 259 Score = 191 bits (485), Expect = 8e-48, Method: Composition-based stats. Identities = 81/166 (48%), Positives = 100/166 (60%), Gaps = 2/166 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSA+ VGVP+PWP+ATPP GWLKCNGA F +YP+LA AYP+ LPDLRGEFIR Sbjct: 96 LGLGEGSAILVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIR 155 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDG G+D GR ILSIQG A + + G+ R+ + + ++ G Sbjct: 156 GWDDGLGVDAGREILSIQGDAIRNISGGIQGRNEATSARLFSSNATGVFRTDGQFGSYAA 215 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + A D A+E RPRNIAFNYIVRAA Sbjct: 216 SADVAVGVTDD--RLAELFFDASRSVPTANENRPRNIAFNYIVRAA 259 >UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR91_CITRO Length = 279 Score = 187 bits (475), Expect = 1e-46, Method: Composition-based stats. Identities = 87/166 (52%), Positives = 102/166 (61%), Gaps = 21/166 (12%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPP GWLKCNGA FS+ YP+L AYP+ KLPDLRGEFIR Sbjct: 135 LGLGEGSALPVGVPVPWPSATPPEGWLKCNGATFSSSLYPKLGLAYPSGKLPDLRGEFIR 194 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG D GRS+LS QG A H+H ++ + +G D++ Sbjct: 195 GWDDGRGADNGRSLLSSQGDAFRSHSHNF----------DRSWGLENFDATAGYDVVTA- 243 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 D + + + SETRPRNIAFNYIVRAA Sbjct: 244 ----------DINGKIVNQPTRSTVSVGGSETRPRNIAFNYIVRAA 279 >UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU Length = 296 Score = 181 bits (460), Expect = 5e-45, Method: Composition-based stats. Identities = 69/165 (41%), Positives = 89/165 (53%), Gaps = 5/165 (3%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 + + + +PVG P+PWP+A PP GWL+CNGA F ++PELAKAYP+ LPDLRGEFIR Sbjct: 136 INSSKTNDIPVGTPIPWPTAIPPVGWLQCNGAVFDKSKFPELAKAYPSGYLPDLRGEFIR 195 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWD+GRG+D GR + QG A + P + D + + I G Sbjct: 196 GWDNGRGVDPGRVCSTWQGDAIRNITGSFPG---AIADNYHLATKEAFYGKINLGIATDG 252 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 T + PD + + TRPRNIAFNYIVRA Sbjct: 253 TTKSKNIHNPDN--PYGFGFDASRVVPVPQRTRPRNIAFNYIVRA 295 >UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber protein H n=2 Tax=Pectobacterium atrosepticum RepID=Q6D3Y6_ERWCT Length = 536 Score = 179 bits (454), Expect = 3e-44, Method: Composition-based stats. Identities = 73/159 (45%), Positives = 86/159 (54%), Gaps = 6/159 (3%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 AL G+P PWP AT P GWLKCNG +F +P LA AYP+ LPDLRGEFIRGWDDGRG Sbjct: 384 ALTAGMPKPWPRATAPAGWLKCNGQSFDISAFPHLAAAYPSGVLPDLRGEFIRGWDDGRG 443 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGL 127 +D+GRS+LS Q A + I T A + E +SG + A Sbjct: 444 VDSGRSLLSAQSDAIRNIVG------EIWTSAVSQQFLGETLSSSGVFELLYEFAVGAIP 497 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 A + A+E RPRNIAFNYIVRAA Sbjct: 498 DAAGNSCPSRMRFDASRAVPTAAENRPRNIAFNYIVRAA 536 >UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia RepID=B7MJL6_ECO45 Length = 247 Score = 176 bits (445), Expect = 3e-43, Method: Composition-based stats. Identities = 95/166 (57%), Positives = 109/166 (65%), Gaps = 15/166 (9%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 97 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 156 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG+D+ R++LS Q L S ++ + F D + + S + I Sbjct: 157 GWDDGRGVDSRRAVLSTQEPTVGTFYVELAIISGTLSGSGAKFT-DSVGIGSTSSNITVS 215 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 N ND + + +TRPRNIAFNYIVRAA Sbjct: 216 NGNDQSVSG--------------TVAVNPVDTRPRNIAFNYIVRAA 247 >UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae RepID=D2U1K0_9ENTR Length = 366 Score = 176 bits (445), Expect = 3e-43, Method: Composition-based stats. Identities = 65/160 (40%), Positives = 83/160 (51%), Gaps = 9/160 (5%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + PVG P+PWP ATPP G+L CNG F + P+L AYP+ KLPDLRG FIRGWD G+ Sbjct: 216 NNYPVGAPIPWPQATPPKGYLICNGEPFDKVKCPKLLIAYPSGKLPDLRGYFIRGWDAGK 275 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 G+D GR + S Q A + + A + + I N A Sbjct: 276 GVDPGREVFSYQEDAIRNITG-------RIGFARRGGAEPPVSADGAFVITDWCNVRVAD 328 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 D+G ++ S + A+E RPRNIAFNYIVR A Sbjct: 329 GANDDWGGVASFDPS--RVVPTANENRPRNIAFNYIVREA 366 >UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterobacteriaceae RepID=A4WEL3_ENT38 Length = 340 Score = 175 bits (443), Expect = 6e-43, Method: Composition-based stats. Identities = 75/160 (46%), Positives = 89/160 (55%), Gaps = 7/160 (4%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + LPVG P+PWP ATPP GWLKCNGA F +YP+LA AYP+ LPDLRGEFIRGWDDGR Sbjct: 188 NYLPVGFPLPWPQATPPQGWLKCNGAPFDKVKYPKLAVAYPSGLLPDLRGEFIRGWDDGR 247 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 G+D+GR L+ QG A + + A F + SG + Sbjct: 248 GVDSGRVALTTQGDAVQKMTGAASN------GAATGFVNNSTSRVSGVFKRGSVIYPNTS 301 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 DY + +A ETRPRNIAFNYIVRAA Sbjct: 302 AQNADYQGVD-LVFDSSLMVRSAEETRPRNIAFNYIVRAA 340 >UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G6_9ENTR Length = 580 Score = 175 bits (443), Expect = 6e-43, Method: Composition-based stats. Identities = 67/163 (41%), Positives = 89/163 (54%), Gaps = 14/163 (8%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + PVG P+PWP ATPP G+ C+G F +YP+LA AYP+ KLP L GEFIRG D GR Sbjct: 429 NNYPVGAPIPWPQATPPNGYFVCDGNYFDKAKYPQLALAYPSGKLPLLYGEFIRGLDLGR 488 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG---NTN 123 +D GR++LS QG A + + E V +G + +R N N Sbjct: 489 KVDPGRTVLSNQGDAIRNITGRI---------GYARHGGTEPPVVNGEGVFRRDSNHNVN 539 Query: 124 DAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 A D+G+ ++ S + A+E RPRN+AF YIVRAA Sbjct: 540 IANGRGDDWGSVMSFNAS--RVVPTANENRPRNVAFLYIVRAA 580 >UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DE08_PECCP Length = 682 Score = 174 bits (440), Expect = 1e-42, Method: Composition-based stats. Identities = 73/156 (46%), Positives = 89/156 (57%), Gaps = 6/156 (3%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP T P+GWLKCNG F YP+LA+ YP LPDLRGEFIRGWDD RG+DT Sbjct: 533 VGMPMPWPQTTAPSGWLKCNGQTFDKNIYPKLAQIYPAGILPDLRGEFIRGWDDSRGVDT 592 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR++LS QG A + I T A + E +++G + + T A A Sbjct: 593 GRTLLSTQGDAIRNIVG------EIWTTAANYQFLGENLLSNGAFELFKEFTVGAIPDAA 646 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 K + ASE RPRNIAFNYIVRAA Sbjct: 647 GNSCPSRMKFDASRIVPTASENRPRNIAFNYIVRAA 682 >UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BT14_DICD5 Length = 534 Score = 173 bits (438), Expect = 2e-42, Method: Composition-based stats. Identities = 68/156 (43%), Positives = 88/156 (56%), Gaps = 10/156 (6%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+P+P AT P GWLKCNG +F+ +P LA+ YP+ LPDLRGEFIRGWDD RG+D Sbjct: 389 VGIPLPYPGATAPDGWLKCNGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVDP 448 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR +LS Q H+HG V D + +++ + G+ + DA Sbjct: 449 GRGLLSFQESQNLTHSHG-------VNDPGHSHPYNKYEGSVGSGLAGFDYDQDAWNATV 501 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 G T + + E RPRNIAFNYIVRAA Sbjct: 502 YTGHVGTG---ISIAASGGHEARPRNIAFNYIVRAA 534 >UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CGA0_DICZE Length = 166 Score = 172 bits (437), Expect = 3e-42, Method: Composition-based stats. Identities = 73/159 (45%), Positives = 94/159 (59%), Gaps = 18/159 (11%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP ATPP GWLKCNG AF +P+LA+ YP+ LPDLRGEFIRGWDDGRG+D+ Sbjct: 23 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 82 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS---GTDIIKRGNTNDAGL 127 R++LS QG A + G S + D ++D NS G+ I+ + N + Sbjct: 83 NRNLLSSQGDAIRNIT-GFVSGVYVGFDGYSGAFYDTGSRNSISPGSTIVAQLNDD---- 137 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + A+E RPRNIAFNYIVRAA Sbjct: 138 ----------FAFDASRVVPTANENRPRNIAFNYIVRAA 166 >UniRef50_B7NJP1 Putative side tail fiber protein homolog from lambdoid prophage n=3 Tax=Escherichia coli RepID=B7NJP1_ECO7I Length = 686 Score = 172 bits (436), Expect = 3e-42, Method: Composition-based stats. Identities = 87/167 (52%), Positives = 105/167 (62%), Gaps = 9/167 (5%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWP+ATPP GWLKC+G AF+ E+YP LA+AYPT +LPDLRGEFIR Sbjct: 528 LGLGEGSALPVGVPVPWPTATPPEGWLKCDGRAFTKEQYPVLARAYPTLRLPDLRGEFIR 587 Query: 61 GWDDGRGIDTGRSILSIQ-GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 GWDDGR ID GR +LS Q G H + I++ + ++ G D + Sbjct: 588 GWDDGRKIDEGRKLLSWQKGTLVGGHDDNDSALD-------ISYMSNGNNIDYGGDKVFA 640 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 GN L G + + GA + TRPRNIAFNYIVRAA Sbjct: 641 GNYRSDYLWYAVLGG-TNSRAKAELNGAFFNITRPRNIAFNYIVRAA 686 >UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLI8_PECWW Length = 621 Score = 171 bits (434), Expect = 6e-42, Method: Composition-based stats. Identities = 65/160 (40%), Positives = 82/160 (51%), Gaps = 4/160 (2%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 SA VG+P +P A P GWLKCNG F +YP LA YP+ LPDLRGEF+RGWDD R Sbjct: 466 SAELVGMPQVFPGAVAPAGWLKCNGQQFDTAQYPILASRYPSGFLPDLRGEFVRGWDDER 525 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 G+D GR++LS QG A + + + V + K + +G Sbjct: 526 GVDAGRALLSEQGDAIRNITGTMRASDVPYGHTQFVDALKADGVFAPIAGDKSWTGDSSG 585 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 +G + A+E RPRNIAFNYIVRAA Sbjct: 586 NAGNPWGV----SFDTSRVVPTANENRPRNIAFNYIVRAA 621 >UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=B2PZV1_PROST Length = 526 Score = 171 bits (433), Expect = 8e-42, Method: Composition-based stats. Identities = 76/165 (46%), Positives = 93/165 (56%), Gaps = 7/165 (4%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG 61 G E S PVG P+PWP AT P+G+L CNG AF+ YP L KAYP+ KLPDLRGEFIRG Sbjct: 369 GSSELSDCPVGAPIPWPQATAPSGYLICNGQAFNKTTYPLLTKAYPSGKLPDLRGEFIRG 428 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 D GR ID GR +LS Q ATE H H + +A+ V +G + Sbjct: 429 LDAGRNIDNGRVVLSFQRCATEHHKH-----ISGWGEASNANAIFGKTVKNGYVGSASTD 483 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 ++ D F+ + G+ A+ETRPRNIAF YIVRAA Sbjct: 484 RDNYLFYTNDGSEFQGSNPNSTGI--MANETRPRNIAFLYIVRAA 526 >UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E01-6750 RepID=UPI000190EC42 Length = 317 Score = 171 bits (433), Expect = 9e-42, Method: Composition-based stats. Identities = 90/166 (54%), Positives = 100/166 (60%), Gaps = 11/166 (6%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSAT P GWLKCNGAAFS+E YP+LAKAYPTNKLPDLRGEFIR Sbjct: 163 LGLGEGSALPVGVPVPWPSATLPEGWLKCNGAAFSSEMYPKLAKAYPTNKLPDLRGEFIR 222 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGID GR ILS Q + +T + D + N I + Sbjct: 223 GWDDGRGIDAGREILSFQEGTIVSGFDDNDTGDISSLSSTQYGFGDTLSSNQWGAINGKK 282 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 DA + Y + RPRNIAFNYIVRAA Sbjct: 283 WIFDASSKGAQKYDWWAYVSA-----------RPRNIAFNYIVRAA 317 >UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID=B7US81_ECO27 Length = 521 Score = 171 bits (432), Expect = 1e-41, Method: Composition-based stats. Identities = 90/166 (54%), Positives = 101/166 (60%), Gaps = 22/166 (13%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPW SATPPTGWLKCNGAAFS+E YP LA+AYPTNKLPDLRGEFIR Sbjct: 378 LGLGEGSALPVGVPVPWSSATPPTGWLKCNGAAFSSEMYPRLARAYPTNKLPDLRGEFIR 437 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGID GR++LS Q + H G I + +IN Y + G Sbjct: 438 GWDDGRGIDAGRTLLSGQDGTSFSHYGG---NFDIGSGHSINNYDQIVSNQPGFSRF--- 491 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 ++ G G RPRNI FNYIVRAA Sbjct: 492 ----------------SFAGPSRGDGVNYVTIRPRNITFNYIVRAA 521 >UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 Tax=Shigella sp. D9 RepID=UPI0001B5347E Length = 550 Score = 170 bits (430), Expect = 2e-41, Method: Composition-based stats. Identities = 96/171 (56%), Positives = 110/171 (64%), Gaps = 7/171 (4%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLG+GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYP+LAK YPTNKLPDLRGEFIR Sbjct: 382 LGLGDGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPKLAKVYPTNKLPDLRGEFIR 441 Query: 61 GWDDGRGIDTGRSILSIQGY-----ATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 GWDD RGIDTGRS+LS Q A +D+ +T V D + Sbjct: 442 GWDDSRGIDTGRSLLSGQAATFIRTALQDYYGY--DLNTNVKVGIAFATADSVITVGNPA 499 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 K GN +D + D T + + D A RPRN++FNYIVRAA Sbjct: 500 NPKAGNNSDYVPASADNSITGTQRTAEDNFTGAWISMRPRNLSFNYIVRAA 550 >UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BYH6_DICD5 Length = 198 Score = 169 bits (428), Expect = 3e-41, Method: Composition-based stats. Identities = 68/155 (43%), Positives = 84/155 (54%), Gaps = 16/155 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P WP A P GWLKCNG AF +YP+LAK YP LPDLRGEFIRGWDDGRG+DT Sbjct: 59 VGIPQAWPLADAPEGWLKCNGQAFDKTKYPQLAKLYPAGTLPDLRGEFIRGWDDGRGVDT 118 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R ILS Q E H H +P + + Y ++ + D + G + L + Sbjct: 119 NRQILSAQSGMLESHNHMMPVSDPSKWNGAVYGYANDQPSANIEDFSQSGVSTSRELTSL 178 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 +ETRPRNIAF+YIV+A Sbjct: 179 T----------------GGNETRPRNIAFSYIVKA 197 >UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID=C6CP84_DICZE Length = 646 Score = 169 bits (428), Expect = 3e-41, Method: Composition-based stats. Identities = 68/156 (43%), Positives = 86/156 (55%), Gaps = 11/156 (7%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 G+P+PWP AT PTGWLKCNG +F + YP LA+ YP+ LPDLRGEFIRGWDDGRG+D Sbjct: 502 AGIPLPWPQATAPTGWLKCNGQSFDKKLYPRLAQVYPSGVLPDLRGEFIRGWDDGRGVDN 561 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R +LS QG + S ++ D + + + I N+N G Sbjct: 562 NRGLLSSQGDTIRNIV-----ASFVMDDQAVTINAPTGAMFPSSQIAYDANSNVGGTMGF 616 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + A+E RPRNIAFNYIVRAA Sbjct: 617 N------VVFDASRVVPTANENRPRNIAFNYIVRAA 646 >UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N348_PHOLL Length = 440 Score = 168 bits (426), Expect = 5e-41, Method: Composition-based stats. Identities = 64/157 (40%), Positives = 82/157 (52%), Gaps = 13/157 (8%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 +P GVP+P+P P G+L CNG F YP+LA+AYP ++PDLRGEFIRGWDD RG+ Sbjct: 296 VPAGVPMPYPHRYTPPGYLTCNGQTFDKSLYPKLAEAYPAGRVPDLRGEFIRGWDDSRGV 355 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 D GR + Q DH H + +V D + + + + + N Sbjct: 356 DPGRVCGTWQADCIPDHNHYKVASKQLVEDLVLTGDAGWYTSSGSSTRTRSLDQN----- 410 Query: 129 APDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 TY V A+ETRPRNIAFNYIVRA Sbjct: 411 --------TYTGGVTEAQVIANETRPRNIAFNYIVRA 439 >UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli plasmid p15B n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2Q1_PHOLL Length = 478 Score = 167 bits (423), Expect = 1e-40, Method: Composition-based stats. Identities = 68/161 (42%), Positives = 86/161 (53%), Gaps = 20/161 (12%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 + VG P+PWP P G+L CNG +F+ YP+LA AYP+ LPDLRGEFIRGWDDGRG+ Sbjct: 335 ISVGSPIPWPLPNVPAGYLACNGQSFNKSLYPQLAIAYPSGVLPDLRGEFIRGWDDGRGV 394 Query: 69 DTGRSILSIQGYATEDHAHGLPS---RSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 D GR +L+ QG A + P R + ++ N TD+ ++ DA Sbjct: 395 DRGRGVLTHQGDAIRNITGYTPGTILRGNNSYGGCFSLSGEKAPGNEYTDVWQKQVLFDA 454 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + ASE RPRNIAFNYIVRAA Sbjct: 455 -----------------SRVVPVASENRPRNIAFNYIVRAA 478 >UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae RepID=B3I8J5_ECOLX Length = 263 Score = 167 bits (423), Expect = 1e-40, Method: Composition-based stats. Identities = 96/169 (56%), Positives = 107/169 (63%), Gaps = 3/169 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVG PVPWPS TPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 95 LGLGEGSALPVGAPVPWPSETPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 154 Query: 61 GWDDGRGIDTGRSILSIQGYA---TEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 GWDD RGIDTGRS+LS Q T + +T V D + Sbjct: 155 GWDDSRGIDTGRSLLSGQAATFIRTALQDYYGYDLNTNVKVGIAFATADSVITVGNPANP 214 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 K GN +D + D T + + D A RPRN++FNYIVRAA Sbjct: 215 KAGNNSDYVPASADNSITGTQRTAEDNFTGAWISMRPRNLSFNYIVRAA 263 >UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BL21_PHOAA Length = 452 Score = 167 bits (423), Expect = 1e-40, Method: Composition-based stats. Identities = 60/157 (38%), Positives = 80/157 (50%), Gaps = 8/157 (5%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG P+P+P P G+L CNG F YP+LA+AYP+ ++PDLRGEFIRGWDD RG+D Sbjct: 304 PVGAPIPYPHRYTPVGYLTCNGQTFDKSLYPKLAEAYPSGRVPDLRGEFIRGWDDSRGVD 363 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 GR S Q + H H + + + K+ T G+ Sbjct: 364 PGRVCGSWQDSDNKAHIHD--------DEFCYGGGDAGGDSGTMSAFAKKYCTPKDGVNG 415 Query: 130 PDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + L + +E RPRN+AFNYIVRAA Sbjct: 416 RPTSGWLPASAGLHSLPSGGNEARPRNVAFNYIVRAA 452 >UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=Photorhabdus RepID=Q7N5C0_PHOLL Length = 239 Score = 166 bits (421), Expect = 2e-40, Method: Composition-based stats. Identities = 68/161 (42%), Positives = 90/161 (55%), Gaps = 23/161 (14%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 S++PVG P+PWP + PP+G+ CNG+AFS +YP+LA+AYP ++PDLRGEFIRGWDDGR Sbjct: 99 SSIPVGSPIPWPLSHPPSGYFTCNGSAFSRSQYPKLAEAYPDGRIPDLRGEFIRGWDDGR 158 Query: 67 GIDTGRSILSIQGYATEDH--AHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 G+D+GR ILS Q T+ GLP + ++ D G D+++ Sbjct: 159 GVDSGRVILSAQTDNTKRIQLTKGLPDGQFL---SSYQGPVDRYQFPLGRDVLESATVTS 215 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ETRPRNIAFNYIV+A Sbjct: 216 IAN------------------NTGGHETRPRNIAFNYIVKA 238 >UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6DA10_PECCP Length = 689 Score = 166 bits (419), Expect = 3e-40, Method: Composition-based stats. Identities = 67/166 (40%), Positives = 83/166 (50%), Gaps = 13/166 (7%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G S L G+P+P+P A PTGWLKCNG +F +YP LA YP+ LPDLRGEF+R Sbjct: 537 IGAMPASEL-AGIPLPFPGAVAPTGWLKCNGQSFDKSQYPILASRYPSGVLPDLRGEFVR 595 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG D R++LS QG A + TI D + + Sbjct: 596 GWDDGRGADASRALLSAQGDAIRNIV------------GTIGQLNDRVNTTETAGVFDAN 643 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 A + + A+E RPRNIAFNYIVRAA Sbjct: 644 KYTGAHSGLTGGNGGRIATFDASKVVPTAAENRPRNIAFNYIVRAA 689 >UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia RepID=C4UEH4_YERAL Length = 387 Score = 164 bits (416), Expect = 8e-40, Method: Composition-based stats. Identities = 69/156 (44%), Positives = 91/156 (58%), Gaps = 7/156 (4%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 P+G+P+P+P TPP G+LKCNGAAF YP LA YPT+KLPDLRGEFIRG+DDGRGID Sbjct: 238 PIGIPLPYPGTTPPAGYLKCNGAAFYPYRYPTLATLYPTHKLPDLRGEFIRGFDDGRGID 297 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 T R++LS Q A ++ G+ S + A + + +G ND Sbjct: 298 TSRTLLSAQTDALQNITGGINGVSESLGIAAESNF-------TGAFAKAESVGNDNTPHH 350 Query: 130 PDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 D ++ + A+ETRPRNI+F YI+RA Sbjct: 351 TDITHCGSFDFDASRVVRTAAETRPRNISFCYILRA 386 >UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bacteriophage n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2R3_PHOLL Length = 233 Score = 164 bits (414), Expect = 1e-39, Method: Composition-based stats. Identities = 72/158 (45%), Positives = 86/158 (54%), Gaps = 18/158 (11%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVGVP+P+PS P G+L CNG AF YP+LA AYP+ LPDLRGEFIRGWDD RG+D Sbjct: 93 PVGVPLPYPSRYTPAGYLTCNGQAFDKSRYPQLAIAYPSGILPDLRGEFIRGWDDSRGVD 152 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 GR +LS Q +DH H + I D++ GN + Sbjct: 153 MGRGMLSWQPAGIQDHMH-----------------YKVISKQVVEDLVLAGNQSWGTEKN 195 Query: 130 PDYGTFKTYKQSVDGL-GAAASETRPRNIAFNYIVRAA 166 Y S G+ G +ETRPRNIAFNYIVRAA Sbjct: 196 STYTRSLDQNISTGGVIGTTVNETRPRNIAFNYIVRAA 233 >UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid prophage e14 n=3 Tax=Photorhabdus RepID=C7BSQ1_PHOAA Length = 166 Score = 163 bits (412), Expect = 2e-39, Method: Composition-based stats. Identities = 69/158 (43%), Positives = 86/158 (54%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 +PVG+P+PWP+ PP GW+KCNGA F YP+LA AYP+ LPDLRGEFIRGWDDGRG+ Sbjct: 9 IPVGIPLPWPTDIPPYGWVKCNGAIFDKYLYPKLAVAYPSGNLPDLRGEFIRGWDDGRGV 68 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 D GR +LS Q H+H + + +NS + G Sbjct: 69 DIGRYVLSTQLADIAPHSHRIGRMWSNSNAGAEGLGTPSRILNSVYQGVNYGIDTRGLGI 128 Query: 129 APDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 A G+ + ETRPRN+AFNYIVRAA Sbjct: 129 AIGMGSGGFGYMDNAVAASTGIETRPRNVAFNYIVRAA 166 >UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH14_EDWI9 Length = 593 Score = 162 bits (409), Expect = 4e-39, Method: Composition-based stats. Identities = 65/158 (41%), Positives = 82/158 (51%), Gaps = 14/158 (8%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 A PVG P PWP+ + P+GW+KC G +FS YPELAKAYP +LPDLRGEFIRG+DD G Sbjct: 449 APPVGTPQPWPNTSIPSGWIKCAGQSFSTSSYPELAKAYPNGRLPDLRGEFIRGYDDYGG 508 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGL 127 D+ R ILS QG A + F D+ + T + + Sbjct: 509 TDSQRQILSWQGDAMRNITG--------------TFGVDDQTIEQVTGVFREYGRFSYDA 554 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + G + + A+E RPRNIAF YIVRA Sbjct: 555 RSERNGAGRIIYFDASQVVPTANENRPRNIAFLYIVRA 592 >UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2U8_ERWCT Length = 619 Score = 161 bits (408), Expect = 7e-39, Method: Composition-based stats. Identities = 63/166 (37%), Positives = 85/166 (51%), Gaps = 1/166 (0%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G S L G+P+P+P A P G+LKCNG F ++P LA YP+ LPDLRGEF+R Sbjct: 455 IGALPTSEL-AGIPLPFPGAVAPAGYLKCNGQQFDTAQFPVLASRYPSGFLPDLRGEFVR 513 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGIDT R+++S QG A + L + ++ Sbjct: 514 GWDDGRGIDTVRALMSAQGDAIRNIVGSLFYGYDADVPVLNTNSSSGALYYEMSTALRDT 573 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + + + K + A+E RPRNIAFNYIVRAA Sbjct: 574 ESLLSLVTDSVANNWYPAKLDASRVVPTATENRPRNIAFNYIVRAA 619 >UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6CGA4_DICZE Length = 401 Score = 160 bits (404), Expect = 2e-38, Method: Composition-based stats. Identities = 72/157 (45%), Positives = 86/157 (54%), Gaps = 4/157 (2%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP ATPP GWLKCNG AF +P+LA+AYP LPDLRGEFIRGWDDGRG+D Sbjct: 248 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQAYPGGVLPDLRGEFIRGWDDGRGVDV 307 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R +LS Q P+ S + A I+ D ++ A L A Sbjct: 308 ARELLSWQKGTL---TISDPNLSAVNVGALIHANNDSANTYKSMGFDIVNKSDYAMLRAA 364 Query: 131 -DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + T +G TRPRNIAFNYIVRAA Sbjct: 365 INVETVGAQDLDSNGWQFGYGATRPRNIAFNYIVRAA 401 >UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N6T1_PHOLL Length = 300 Score = 157 bits (398), Expect = 8e-38, Method: Composition-based stats. Identities = 70/158 (44%), Positives = 92/158 (58%), Gaps = 9/158 (5%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 +PVG P+PWP PP G+L CNG+AF+ +YP+LA+AYP +LPDLRGEFIRGWDDGRG+ Sbjct: 152 IPVGSPIPWPLPYPPVGYLTCNGSAFNKLQYPKLAEAYPDGRLPDLRGEFIRGWDDGRGV 211 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 D GR++LS QG A + L + + I +S + + G+ Sbjct: 212 DMGRTMLSWQGDAMQRMTGFLEAGNGIG--------LMTRPHDSTSGVFLEGDLRTIS-H 262 Query: 129 APDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 GT + A+ETRPRNIAFNY+VRAA Sbjct: 263 VTQNGTSYAVSFDSSRVARTANETRPRNIAFNYVVRAA 300 >UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadantii RepID=C6C5D2_DICDC Length = 498 Score = 157 bits (398), Expect = 1e-37, Method: Composition-based stats. Identities = 69/168 (41%), Positives = 84/168 (50%), Gaps = 16/168 (9%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP AT PTGWLKCNG +F YP+LA YP+ LPDLRGEFIRGWDDGRG+D Sbjct: 335 VGIPLPWPQATAPTGWLKCNGQSFDKALYPKLATVYPSGVLPDLRGEFIRGWDDGRGVDA 394 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR+IL+ Q +D + + K + AP Sbjct: 395 GRAILTAQNPTYLR----TGMMDYNGSDVDNIGVYIGMGYAEADTAAKSISAPAGAFRAP 450 Query: 131 DYGTFKTYKQSVDGLGAAAS------------ETRPRNIAFNYIVRAA 166 + +G+ AS TRPRNIAFNYIVRAA Sbjct: 451 NNIDLTEQASRDNGVNGTASNTVYASEGSVWVSTRPRNIAFNYIVRAA 498 >UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersinia pestis KIM D27 RepID=D1TPQ4_YERPE Length = 262 Score = 156 bits (393), Expect = 3e-37, Method: Composition-based stats. Identities = 66/131 (50%), Positives = 84/131 (64%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSA+PVGVP+PWP+ATPP GWLKCNGA F +YP+LA AYP+ LPDLRGEFIR Sbjct: 96 LGLGEGSAIPVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIR 155 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDG G+D GR ILSIQG A + + G+ R+ + + ++ G Sbjct: 156 GWDDGLGVDAGREILSIQGDAIRNISGGIQGRNEATSARLFSSNATGVFRTDGQFGSYAA 215 Query: 121 NTNDAGLPAPD 131 + + A D Sbjct: 216 SADVAVGVTDD 226 >UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CP88_DICZE Length = 485 Score = 154 bits (390), Expect = 8e-37, Method: Composition-based stats. Identities = 66/156 (42%), Positives = 86/156 (55%), Gaps = 14/156 (8%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 G+P+PWP A PTGWLKCNG AF YP LA+ YP+ LPDLRGEFIRGWDDGRG+D+ Sbjct: 344 AGIPLPWPQAAVPTGWLKCNGQAFDKNRYPRLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 403 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR +LS Q + ++ + ++ D + ++ + ++ Sbjct: 404 GREVLSQQRGSLINYDGPDSAPTS-----------DSLRLSVSAAQADAVSASEYAGVML 452 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 Y + S G A TRPRNIAFNYIVRAA Sbjct: 453 SYTAYNITTVSAAGYVGA---TRPRNIAFNYIVRAA 485 >UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersinia bercovieri ATCC 43970 RepID=C4S5W0_YERBE Length = 388 Score = 153 bits (387), Expect = 2e-36, Method: Composition-based stats. Identities = 64/158 (40%), Positives = 84/158 (53%), Gaps = 16/158 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P+P+P + P G+LKCNGAAFS YP+LA YP+ LPD+RG IRGWDDGRG+D Sbjct: 243 IGIPIPYPLPSVPVGYLKCNGAAFSTVTYPKLALKYPSGVLPDMRGNAIRGWDDGRGVDA 302 Query: 71 GRSILSIQGYATEDHAHGL---PSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGL 127 GR++LS Q A ++ S+ T F E++ G + GN Sbjct: 303 GRALLSQQLDALQNITGNFYMGGSKQVAGVVTTGAFGPMEVYNALGNQVTTAGNIGGITF 362 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + A+ETR RNIAFNYIVRA Sbjct: 363 -------------DASRVSRTAAETRMRNIAFNYIVRA 387 >UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D4_DICDC Length = 557 Score = 151 bits (382), Expect = 6e-36, Method: Composition-based stats. Identities = 67/156 (42%), Positives = 83/156 (53%), Gaps = 22/156 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 G+P+PWP AT PTGWLKCNG +F YP+L AYP+ LPDLRGEFIRGWDDGRG+D+ Sbjct: 424 AGIPLPWPQATAPTGWLKCNGQSFDKTLYPKLTAAYPSGTLPDLRGEFIRGWDDGRGVDS 483 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR++LS+Q + I N+ I+ N D+ Sbjct: 484 GRAVLSVQD---------------------ATWIQPNIESNTAATTIRIDNV-DSTFNTD 521 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 +Y A S RPRN+AFNYIVRAA Sbjct: 522 EYSAVSNLPSYEHNGSRARSYVRPRNVAFNYIVRAA 557 >UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CG98_DICZE Length = 196 Score = 150 bits (379), Expect = 2e-35, Method: Composition-based stats. Identities = 68/161 (42%), Positives = 82/161 (50%), Gaps = 30/161 (18%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P PWP A P GWLKCNG F +YP+LAK YP LPDLRGEFIRGWDD RG+DT Sbjct: 59 IGIPQPWPLAEAPEGWLKCNGQTFDTAKYPQLAKLYPAGTLPDLRGEFIRGWDDERGVDT 118 Query: 71 GRSILSIQ-GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 R +LS Q G G P+ ++I GN ++ Sbjct: 119 DRKLLSAQAGTHILGDDGGYPTLNSI------------------------GNLSECNADK 154 Query: 130 PDYGTFKTYKQSVDGLGAAASE-----TRPRNIAFNYIVRA 165 PD Y + ASE TRPRNIAF+YIV+A Sbjct: 155 PDGNVRTLYWLDTNKSEKLASEKFWGATRPRNIAFSYIVKA 195 >UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C6Z0_DICDC Length = 183 Score = 150 bits (379), Expect = 2e-35, Method: Composition-based stats. Identities = 64/155 (41%), Positives = 76/155 (49%), Gaps = 18/155 (11%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P PWP A P GWLKCNG AF +YPELAK YP+ LPDLRGEFIRGWDDGRG+DT Sbjct: 46 IGIPQPWPLADAPEGWLKCNGQAFDTAKYPELAKCYPSGTLPDLRGEFIRGWDDGRGVDT 105 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R ++S Q + I D G I + + A Sbjct: 106 SRELVSAQ------------------SGTYITGDSDSQPSVQGIGNITECHVDSPDSNAR 147 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 K TRPRNI+FNYIV+A Sbjct: 148 SIYWIPATKTDRLTGPTYWGVTRPRNISFNYIVKA 182 >UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65WH4_MANSM Length = 296 Score = 148 bits (374), Expect = 6e-35, Method: Composition-based stats. Identities = 61/157 (38%), Positives = 80/157 (50%), Gaps = 3/157 (1%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P P+P + P G L NG FS YPELAK YP+ +LPDLRGEFIRGWD+GRG+D+ Sbjct: 140 IGIPFPYPLSAVPDGCLAFNGQTFSTTTYPELAKKYPSGRLPDLRGEFIRGWDNGRGVDS 199 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R +L QG H H + + + + + G + L A Sbjct: 200 SRELLRSQGAELSAHTHYVTVTRYANSSGEFGAKISTFSAINNSGWLLSG-ADGLLLAAN 258 Query: 131 DYGTFKTYKQSVDGL--GAAASETRPRNIAFNYIVRA 165 G + K SV L +ETRPRN+AF YI A Sbjct: 259 KSGEIVSEKNSVANLISNTGGNETRPRNVAFQYICLA 295 >UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1_YERKR Length = 402 Score = 148 bits (373), Expect = 7e-35, Method: Composition-based stats. Identities = 60/155 (38%), Positives = 77/155 (49%), Gaps = 30/155 (19%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G P+PWP P G+LKCNGA F+ +YP+LA AYP+ LPDLRGEFIRG+DDGRG+ Sbjct: 277 IGTPIPWPLTIAPAGYLKCNGAPFNKTQYPKLALAYPSGVLPDLRGEFIRGFDDGRGVRP 336 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 + +L QG + H HG+ N G+ Sbjct: 337 NQPLLGWQGSEIQSHNHGI------------------------------TNFEIRGVTGG 366 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + + ETRPRNIAFNYIVRA Sbjct: 367 PTNAWFPSTNGISTNNSGGDETRPRNIAFNYIVRA 401 >UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabdus RepID=Q7NAA0_PHOLL Length = 351 Score = 148 bits (373), Expect = 7e-35, Method: Composition-based stats. Identities = 61/161 (37%), Positives = 84/161 (52%), Gaps = 15/161 (9%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PW T P G+L C+G F YP+L +AYP+ LPDLRGEFIRGWD+GR ID+ Sbjct: 201 VGIPLPWSKPTAPAGYLICSGQQFDKSMYPKLGEAYPSGALPDLRGEFIRGWDNGRSIDS 260 Query: 71 GRSILSIQGYATEDH--AHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 GR ILS Q + H ++ IN + + Sbjct: 261 GREILSHQNSTKLPNLYTHAASENIGLLVSPPINHFSSNYPS----------EIMASDFE 310 Query: 129 APDYGTFKTYKQSVDGLGAAASET---RPRNIAFNYIVRAA 166 ++G+ + + ++ G+ + T RPRNIAFNYIVRAA Sbjct: 311 EAEFGSGQYFSTPLNPTGSVSLSTFRVRPRNIAFNYIVRAA 351 >UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseudotuberculosis IP 31758 RepID=A7FIU0_YERP3 Length = 402 Score = 145 bits (365), Expect = 7e-34, Method: Composition-based stats. Identities = 65/156 (41%), Positives = 80/156 (51%), Gaps = 13/156 (8%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVG+P+PWP+ PP+GWLKCNGA F+ ++P+LA Y LPDLRGEFIRGWDDG+ D Sbjct: 259 PVGIPMPWPAHIPPSGWLKCNGATFNKAQFPQLASVYTRGVLPDLRGEFIRGWDDGKLAD 318 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 GR +LS Q D I + S + NT + Sbjct: 319 PGRGLLSFQEGTV-----------VGGYDDNDTGDISSIGLYSSGFGDQLTNTQWVSING 367 Query: 130 PDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + T S+ A TRPRNIAFNYIVRA Sbjct: 368 KRWIT--AGVSSIRYEWYAYLSTRPRNIAFNYIVRA 401 >UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8U2_OXAFO Length = 266 Score = 144 bits (362), Expect = 1e-33, Method: Composition-based stats. Identities = 52/168 (30%), Positives = 72/168 (42%), Gaps = 17/168 (10%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRG 56 + +P G + PP G+LK +GA +YP L A T LPDLRG Sbjct: 105 NGVPTGTIAFFAMTAPPAGYLKADGAIIQRTDYPALFTAIGTTFGEGDGTTTFTLPDLRG 164 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDI 116 EFIRGWD+GR ID R+ SIQG A + L +D+ +N+ W T + Sbjct: 165 EFIRGWDNGRNIDCERAFGSIQGDAIRNVTGQLRYAGPQNSDSVMNYQSALQW----TSV 220 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 ++ + +Y ASE RPRNIA ++ Sbjct: 221 SQKSPYSAQSSQGSNY---YEINFDASRSVPTASENRPRNIALLACIK 265 >UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VKW8_PHOAA Length = 316 Score = 143 bits (360), Expect = 3e-33, Method: Composition-based stats. Identities = 65/163 (39%), Positives = 88/163 (53%), Gaps = 21/163 (12%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW 62 + + S +PVG P+PWP PP G++ CNG+AF+ +YP+LA+AYP +LPDLRGEFIRGW Sbjct: 174 IKKTSEIPVGSPIPWPLPHPPFGYVTCNGSAFNRSQYPKLAEAYPNGRLPDLRGEFIRGW 233 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT 122 DDGRG D GR +LS Q + ++ Y +I +R Sbjct: 234 DDGRGADNGRKLLSWQE------------------GSALSEYLGSFTTGVAQNIHQR--- 272 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + D+ +K + G G RPRNIAFNYIV+A Sbjct: 273 DGVTYHDKDHKRYKIPSLEIIGTGVDYFRFRPRNIAFNYIVKA 315 >UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I687_PSEE4 Length = 898 Score = 142 bits (357), Expect = 6e-33, Method: Composition-based stats. Identities = 53/178 (29%), Positives = 80/178 (44%), Gaps = 15/178 (8%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKL 51 L SALPVG +P+P T P G+L+ +G+ SA YP+LA +L Sbjct: 378 LNTASALPVGTMLPFPRGTVPAGFLEVDGSTQSAAVYPDLAAYLGGAFNTGNEAAGFFRL 437 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHG----LPSRSTIVTDATINFYFDE 107 PD RGEF+RGWD GRG+D+GR++ S QG + + H H + + + + + Sbjct: 438 PDTRGEFLRGWDHGRGVDSGRAVGSTQGESFKAHTHKDVGFIDNVGGGSGASAVTGATGD 497 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + G + G + SETRPRN+A + ++A Sbjct: 498 VTSIYGKAYGNSASATAKAYKESAPGALGGAIAGLISGSTGDSETRPRNLAVMWCIKA 555 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 50/171 (29%), Positives = 74/171 (43%), Gaps = 16/171 (9%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKLPDLR 55 S+ PVG +P+P A P G+L+ +G+ S YP+LA +LPD R Sbjct: 579 SSTPVGAILPFPKAEVPAGYLELDGSLQSVATYPDLAAYLGASYNNGTEPAGYFRLPDYR 638 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTD-ATINFYFDEIWVNSGT 114 GEF+RGWD GRG+D GR + + Q A ++ + R + G Sbjct: 639 GEFLRGWDHGRGVDPGRGMGTSQSDAIQNITGSIGLRGGAGVGLGVMGGASGAFSTVFGE 698 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 T DA A + + AA+ETRPRN + + ++A Sbjct: 699 STSANTITRDASSIAAS----DIARFDASKVVRAAAETRPRNQSVMWCIKA 745 >UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KHC6_PSEF5 Length = 369 Score = 141 bits (356), Expect = 7e-33, Method: Composition-based stats. Identities = 59/174 (33%), Positives = 83/174 (47%), Gaps = 32/174 (18%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKL 51 L SALPVG VP+P T P G+L+ +G+ SA YP+LA T +L Sbjct: 107 LKNMSALPVGAMVPFPKGTVPAGFLEVDGSVQSAATYPDLAAYLGTMFNTGGEGAGNFRL 166 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RGEF+RGWD GRG+D GR++ S Q +A H H + + T + + W + Sbjct: 167 PESRGEFLRGWDHGRGVDVGRALGSYQAHAVGSHQHPMNYWAWRDGTGTGTHNYAKPWGD 226 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 +G +K T G A SETRPRN+A + ++A Sbjct: 227 TGITGVKDPGT---------------------GANAGDSETRPRNLAVMWCIKA 259 >UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingella oralis ATCC 51147 RepID=C4GFX3_9NEIS Length = 310 Score = 140 bits (353), Expect = 1e-32, Method: Composition-based stats. Identities = 52/173 (30%), Positives = 67/173 (38%), Gaps = 20/173 (11%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKL 51 G S P G + + PTGWLK NGA S Y L A T L Sbjct: 147 GYTANSYCPSGQIGLFATDYAPTGWLKANGAVLSRTVYTNLFAAIGTRFGAGDGHSTFNL 206 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 PDLRGEF R WDDGRG+D GR + S Q A + D + + Sbjct: 207 PDLRGEFPRFWDDGRGVDAGRVLGSWQSDAIRNIT---AQMYLYGQDGSSSQGAFGFRKQ 263 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 ++ N N+AG+ + + A E RPRNIA ++ Sbjct: 264 GERGLVWSRNDNNAGVVMDFW-------LDASKVVPTAHENRPRNIALLACIK 309 >UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PDQ5_9PROT Length = 391 Score = 139 bits (351), Expect = 2e-32, Method: Composition-based stats. Identities = 54/172 (31%), Positives = 74/172 (43%), Gaps = 19/172 (11%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLP 52 L + LPVG + P G+L CNGAA S Y +L A T +P Sbjct: 228 LLSSTILPVGTIITSARTPAPDGFLLCNGAAISRSAYTDLFSAIGTAYGAGDGSSSFNIP 287 Query: 53 DLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS 112 DLRGEFIRG D+GRG+D GR++ S QG A + T + + + Sbjct: 288 DLRGEFIRGADNGRGVDGGRALGSAQGDAIRNI--------TARAIGMGDRNSIPTLLGA 339 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 I K G D G F+ + A+E RPRN+A N+ ++ Sbjct: 340 LYGIQKSTRIESVGDVLGDGGYFEWG-FDASKVVPVANENRPRNVAVNFYIK 390 >UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella grahamii as4aup RepID=C6ABW9_BARGA Length = 370 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 53/169 (31%), Positives = 73/169 (43%), Gaps = 27/169 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLR 55 +++PVG + +P+ T P GWLK NGA S +Y +L T +LPDLR Sbjct: 218 NNSMPVGTVIYYPALTVPKGWLKANGALISRSDYAQLFAVIGTTYGAGDGKTTFRLPDLR 277 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 GEF+RG DD R ID R+I S QG A + L + A+ F + + +S T Sbjct: 278 GEFLRGVDDERNIDPNRTIGSQQGDAIRNITGEL-NFDAKAKAASGAFKYGGVSNSSNTS 336 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 K A+E RPRNIA ++R Sbjct: 337 SGSSSTI----------------KFDASRSVPTANENRPRNIALLALIR 369 >UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DPV9_DESVM Length = 530 Score = 138 bits (348), Expect = 6e-32, Method: Composition-based stats. Identities = 50/163 (30%), Positives = 71/163 (43%), Gaps = 17/163 (10%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNK------LPDLRGEFIRGW 62 +P+G + +P T PTG+L C G + YP+L LPDLRGEF RG Sbjct: 212 VPIGAILDFPVNTVPTGFLVCAGQVVTRTAYPDLVTYLTGGTVAVNATLPDLRGEFRRGA 271 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT 122 D GRG+D GR + S QG A + L N+ + +G + +T Sbjct: 272 DLGRGVDAGRVVGSAQGDAIRNITGSL-----------YNYIQNNASQENGALRTQVAST 320 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ++ A ++ T ASE RPRNIA ++A Sbjct: 321 LNSPFGAGTIMSWSTLSIDASRQVPTASENRPRNIAVVPCIKA 363 >UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE1_PECWW Length = 532 Score = 135 bits (339), Expect = 6e-31, Method: Composition-based stats. Identities = 54/162 (33%), Positives = 76/162 (46%), Gaps = 19/162 (11%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 S++ G W + PP GWL+ NG F+ P LA YP++++PD RG F RGWD+G Sbjct: 387 SSSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRGWDNG 446 Query: 66 RGIDT-GRSILSIQGYATEDHAHGL-PSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTN 123 GID R+ILS+QG A + P S+ + Y NSG + N Sbjct: 447 AGIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSSYGWPYPSNSG-------SAN 499 Query: 124 DAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 DA + + A+E RP NIA +I++A Sbjct: 500 DASIIT----------FDASRVVPTAAENRPTNIAVMFIIKA 531 >UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BT48_DESAD Length = 208 Score = 135 bits (339), Expect = 7e-31, Method: Composition-based stats. Identities = 58/160 (36%), Positives = 77/160 (48%), Gaps = 10/160 (6%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 S P+G + TPP GWL+CNG S YPELA N +PDLRGEFIRG D G Sbjct: 58 ASDYPIGAVAAYRGDTPPVGWLECNGQ--STTGYPELAAVVGAN-VPDLRGEFIRGLDSG 114 Query: 66 RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 RG+D GR++ S Q A E H+H + T T + Y + ++ N Sbjct: 115 RGVDAGRALGSAQADAMERHSHQTTITVSGRTSVTASPYH---SAGAARSLVTTPNFGSP 171 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 A +F + +ETRPRN+A YI++A Sbjct: 172 FGGA----SFSASGTGTSTSVGSGAETRPRNVALMYIIKA 207 >UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3KCU2_PSEFS Length = 658 Score = 134 bits (338), Expect = 8e-31, Method: Composition-based stats. Identities = 57/174 (32%), Positives = 76/174 (43%), Gaps = 36/174 (20%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKL 51 + + SALPVG V +P P G+L+ +G+ SA YP+LAK T +L Sbjct: 174 IAQSSALPVGSMVAFPIDKVPVGFLEIDGSVKSATAYPDLAKFLGTAFNKGDEGAGNFRL 233 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RGEF+RGWD GRG+D GR S Q + H H T+ N D I Sbjct: 234 PESRGEFLRGWDHGRGVDAGRLAGSYQTDQFKSHTHEY---DTMQGGGAANSVSDTIAAQ 290 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 S T + GA SETRPRN+A + ++A Sbjct: 291 SNA----------------------TSQTGHITGGAGGSETRPRNLAVMWCIKA 322 Score = 118 bits (296), Expect = 6e-26, Method: Composition-based stats. Identities = 47/170 (27%), Positives = 72/170 (42%), Gaps = 17/170 (10%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKLPDLR 55 SA+PVG +P+ A P G+L+ +G+ S YP+LA T +LP+ R Sbjct: 346 SAVPVGSIIPFLKAAVPPGYLELDGSVQSIATYPDLAAYLGTTFNTGSEPAGYFRLPESR 405 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 GEF+RGWD GRG+D GR + S Q + +P+ TI T+ D Sbjct: 406 GEFLRGWDHGRGMDAGREVGSWQKGSMVAVDTNIPATQTIATNLVDAAAARMRGGYDSGD 465 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + G+ + + TRP N+A + ++A Sbjct: 466 VGLYSGITLMGVN------PQANVALPGNIEVTYGITRPNNLAVMWCIKA 509 >UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8Y3_OXAFO Length = 270 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 52/167 (31%), Positives = 73/167 (43%), Gaps = 35/167 (20%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 A+P G V + S P G+LK +G+A EEY EL A T LPDLRGE Sbjct: 128 AVPAGTVVYFCSHKAPYGYLKADGSAVGREEYKELFAAIGVYFGSGDGVSTFNLPDLRGE 187 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 FIR D+GRG+D GR + ++Q + H HG R + ++ + W ++ Sbjct: 188 FIRSLDNGRGVDAGRELGNVQMDEFKSHYHGFLDRPNMRLESGVY-----TWTPQVMEV- 241 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + S+ A SETRPRNIA ++ Sbjct: 242 -------------------AEQDSISTTRAGGSETRPRNIALLACIK 269 >UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia solanacearum RepID=B5S308_RALSO Length = 225 Score = 132 bits (332), Expect = 4e-30, Method: Composition-based stats. Identities = 54/164 (32%), Positives = 73/164 (44%), Gaps = 16/164 (9%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIRG 61 G + TPP GWLKCNGAA S Y L K T LP+LR EF RG Sbjct: 66 GSVAMFACKTPPAGWLKCNGAAVSRTTYERLFKLIGTTFGAGDGAATFNLPELRAEFPRG 125 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW-VNSGTDIIKRG 120 WDDGRG+D+GR+ S Q A H H ++ + D + F + + ++ G Sbjct: 126 WDDGRGVDSGRAFGSSQAQALSSHQH----KTAVGFDGSNLFGWGDGSATPIFGSEVQAG 181 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 G G + V +G + ETRPRN+A ++ Sbjct: 182 VLRVVGAVTQSGGAARIGYTDVTPMGVSG-ETRPRNVALLACIK 224 >UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KH70_PSEPF Length = 817 Score = 132 bits (331), Expect = 5e-30, Method: Composition-based stats. Identities = 55/175 (31%), Positives = 87/175 (49%), Gaps = 21/175 (12%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP-----------TNKL 51 + + SALPVG V +P +PP G+L+ + + S+ YP+L+ +L Sbjct: 317 IAKASALPVGSIVAFPVDSPPPGFLELDNSVKSSATYPDLSAYLGGKFNKGDEGVGNFRL 376 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV- 110 P+ RGEF+RGWD GRG+D GR+ S Q + + H H +P+ S N + + Sbjct: 377 PEARGEFLRGWDHGRGVDGGRAQGSSQTDSLKAHYHLIPTGSGGGQAVDPNGEIPTVVLK 436 Query: 111 NSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ++ D + R ++A L T+ AA+ETRPRNIA + ++A Sbjct: 437 DTAADWVLRTEGDNAELSIGRVRTYNF---------GAATETRPRNIAVMWCIKA 482 Score = 132 bits (331), Expect = 5e-30, Method: Composition-based stats. Identities = 55/171 (32%), Positives = 82/171 (47%), Gaps = 19/171 (11%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKLPDL 54 GSA+PVG + +P+ P G+L+ NG+ + YP+LA T +LP+ Sbjct: 505 GSAVPVGAVMAFPTGIVPPGFLELNGSVQNTSTYPDLAAYLGTTYNKGDEGAGNFRLPES 564 Query: 55 RGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 RGEF+RGWD GRG+D GR I + QG + DH H + + +N + V S T Sbjct: 565 RGEFLRGWDHGRGVDAGRGIGTNQGQSMVDHYHTVLTAD---AGGVLNPIAGNL-VGSFT 620 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ++ AG+ G T +ETRPRN+A + ++A Sbjct: 621 NLAPISKPAGAGV----LGATLTSSIHGPAAEKGGTETRPRNLAVMWCIKA 667 >UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEM1_9CAUD Length = 255 Score = 132 bits (331), Expect = 5e-30, Method: Composition-based stats. Identities = 51/194 (26%), Positives = 75/194 (38%), Gaps = 39/194 (20%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 G+A PVG P+ WPS T P GW G F +YP LAK YP+ LPD+RG I+ D Sbjct: 68 GNATPVGAPLAWPSDTAPDGWALMIGQTFDKVKYPLLAKVYPSGVLPDMRGRVIKAKPD- 126 Query: 66 RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV--------------- 110 GR++LS++ + H H + + T AT F Sbjct: 127 -----GRAVLSLEEDQVKSHTHTGKAATAGGTRATSTFDHGNKRTTTNGNHTHGSPQGAR 181 Query: 111 ------------------NSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 N + +D + ++ ++ ++ +E Sbjct: 182 HGGSGQYTSGDDETNSVFNWPATSAAGDHFHDVQIGPHNHNVDINHEHTLQIDATGGTEN 241 Query: 153 RPRNIAFNYIVRAA 166 +NIA NYIVR A Sbjct: 242 TVKNIAMNYIVRLA 255 >UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL254 RepID=B4T041_SALNS Length = 580 Score = 131 bits (329), Expect = 1e-29, Method: Composition-based stats. Identities = 57/190 (30%), Positives = 83/190 (43%), Gaps = 36/190 (18%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 + P GVP+PWPS T P G+ G AF YP LA AYP+ +PD+RG I+G Sbjct: 396 SCPPGVPLPWPSDTIPAGYALMQGQAFDKNVYPLLAIAYPSGTIPDMRGWTIKGKPV--- 452 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNS-------------- 112 +GR++LS + + H+HG + T + T T +F + N+ Sbjct: 453 --SGRAVLSQELDGNKSHSHGARALDTDLGTKGTSSFDYGTKSSNTTGGHNHSAGGTYGG 510 Query: 113 ------------GTDIIKRGNTNDAG---LPAPDYGTF-KTYKQSVDGLGAAASETRPRN 156 G D + N + A + D+ + + V +ET +N Sbjct: 511 DSIGGKARVQRDGNDQLTSWNGDHAHTTWIGPHDHTVYIGPHGHVVIVDADGNAETTVKN 570 Query: 157 IAFNYIVRAA 166 IAFNYIVR A Sbjct: 571 IAFNYIVRLA 580 >UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralstonia phage RSL1 RepID=B2ZY49_9CAUD Length = 498 Score = 130 bits (327), Expect = 1e-29, Method: Composition-based stats. Identities = 48/185 (25%), Positives = 69/185 (37%), Gaps = 23/185 (12%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLP 52 L +P G +P+ T P G+L CN AA S + L T LP Sbjct: 313 LNPPQLVPPGTILPFAGTTIPAGYLACNAAAISRTGFASLYSVIGTTYGVGNGSTTFNLP 372 Query: 53 DLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLP----SRSTIVTDATINFYFDEI 108 DLRG F+RGWD+GRG D GR + QG A H H + + + + + Sbjct: 373 DLRGVFVRGWDNGRGQDPGRVFGTYQGDAFRSHNHAVSDPGHAHGVYDPGHSHTWTLGTL 432 Query: 109 WVNSGTDIIKRGNTNDAGL---------PAPDYGTFKTYKQSVDGLGAAASETRPRNIAF 159 + G + G + L +ET P+N+A Sbjct: 433 RQSGGDTSCYVPSARYGGGEFQFTETTAAVGTGIGIYGNVTGIGTLVNGGAETTPKNVAM 492 Query: 160 NYIVR 164 NYI++ Sbjct: 493 NYIIK 497 >UniRef50_C3X912 Phage tail collar domain-containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X912_OXAFO Length = 436 Score = 128 bits (322), Expect = 6e-29, Method: Composition-based stats. Identities = 43/168 (25%), Positives = 62/168 (36%), Gaps = 25/168 (14%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRG 56 ++P G + + TPP G+L NGA S Y L A T +LPDLRG Sbjct: 283 DSVPAGSVHYFATQTPPDGYLVANGALVSRTVYARLFSAIGTTFGEGDGGSTFQLPDLRG 342 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDI 116 EF+RGWD R +D R ++QG A + + FY+ Sbjct: 343 EFLRGWDAARNLDPERGFGTVQGDAIRNIIGTFGGNDQERRFLSGPFYY----------- 391 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 D G + + A+E RP N+A ++ Sbjct: 392 ----IGTDGGGKTGSSNGTDNFGFDASRVVPTANENRPHNVALLACIK 435 >UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NQ95_9RHOB Length = 329 Score = 128 bits (321), Expect = 7e-29, Method: Composition-based stats. Identities = 52/187 (27%), Positives = 70/187 (37%), Gaps = 29/187 (15%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRG 56 + + G + +T P GWLK NGA S Y +L A T LPDLRG Sbjct: 142 NGVAPGCVAYYAMSTAPDGWLKANGAEISRTAYADLFAAIGTIFGVGDGNSTFNLPDLRG 201 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFD--------EI 108 EF+RGWDD RG+D R + S Q H H + S + E Sbjct: 202 EFLRGWDDARGVDGARVLGSSQSDQNASHTHTGSTSSDSHSHTGTTNTTGNHTHNMAYEG 261 Query: 109 WVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQS-----------VDGLGAAASETRPRNI 157 N+GT + + P P + + V + SE RPRNI Sbjct: 262 GTNAGTGLAAPATSRSNTSPGPTVNYSGNHSHTFSTSSDSHSHSVTTDASGGSEARPRNI 321 Query: 158 AFNYIVR 164 A ++ Sbjct: 322 ALLACIK 328 >UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BSQ6_PHOAA Length = 318 Score = 127 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 49/87 (56%), Positives = 63/87 (72%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + +PVGVP+PWP+A PPTGWL+CNGAAF ++P+L AY + LPDLRGEFIRGWD R Sbjct: 217 NNIPVGVPIPWPTAIPPTGWLQCNGAAFDKSKFPQLVAAYSSGVLPDLRGEFIRGWDSSR 276 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRS 93 G+DT RSILS Q ++ + S + Sbjct: 277 GVDTNRSILSTQIDTMQNITGKVDSHN 303 >UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3G6_OXAFO Length = 237 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 50/169 (29%), Positives = 70/169 (41%), Gaps = 36/169 (21%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKLPDLR 55 + +P G + S TPP GWL +G+ YP+L A T +LPDLR Sbjct: 93 NGVPPGSVLYLCSETPPDGWLVADGSMLLVAAYPDLFAAIGTAFGSGDNGMTTFRLPDLR 152 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 GEFIR D GRG+D GR + S+QG +H HG + + + + + Sbjct: 153 GEFIRCLDKGRGLDDGRPLGSVQGDEIRNHNHGFLDIPKVQFGSGVYSWTPQ-------- 204 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + AP T+ SETRPRNIA ++ Sbjct: 205 ------VMEVAEHAPIATTWT-----------GGSETRPRNIALTACIK 236 >UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_LAMBD Length = 774 Score = 125 bits (315), Expect = 4e-28, Method: Composition-based stats. Identities = 46/144 (31%), Positives = 66/144 (45%), Gaps = 6/144 (4%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G GE SA P G P+PWPS P+G++ G AF YP+LA AYP+ LPD+RG I+ Sbjct: 522 LGAGENSAFPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIK 581 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKR 119 G +GR++LS + + H H + T + T T +F + S K Sbjct: 582 GKPA-----SGRAVLSQEQDGIKSHTHSASASGTDLGTKTTSSFDYGTKTTGSFDYGTKS 636 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVD 143 N A + T + Sbjct: 637 TNNTGAHAHSLSGSTGAAGAHAHT 660 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 34/91 (37%), Gaps = 5/91 (5%) Query: 76 SIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTF 135 S QG A S S ++ ++ V G + A + G+ Sbjct: 689 STQGIAYLSKTDSQGSHSHSLSGTAVSAGAHAHTVGIGA--HQHPVVIGAHAHSFSIGS- 745 Query: 136 KTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + ++ A +E +NIAFNYIVR A Sbjct: 746 --HGHTITVNAAGNAENTVKNIAFNYIVRLA 774 >UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia coli RepID=B7UGJ3_ECO27 Length = 221 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 67/174 (38%), Positives = 87/174 (50%), Gaps = 32/174 (18%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTG---------WLKCNGAAFSAEEYPELAKAYPTNKL 51 +GLGEG A +GVP WPSA P +LK NGA FSA +YP LAK +P+ L Sbjct: 70 LGLGEG-APAIGVPFFWPSAAMPDTVIESWSGMVFLKFNGAKFSATDYPVLAKVFPSLVL 128 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RG+FIR WDDGRG D+GR++LS Q +++ + Sbjct: 129 PEARGDFIRIWDDGRGADSGRALLSWQ------------------AATSLSQFGGNYPEG 170 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 SG I + P + F+ SV G G RPRNIAFN++VRA Sbjct: 171 SGHAIADY---DGISAHEPGFSRFQYTSNSV-GDGVNFVAVRPRNIAFNFLVRA 220 >UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escherichia RepID=B7LKX7_ESCF3 Length = 567 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 56/214 (26%), Positives = 79/214 (36%), Gaps = 58/214 (27%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 + PVG+P+PWPS + P+G+ G F+ YP+LA AYP+ +PD+RG I+G Sbjct: 359 AESCPVGMPIPWPSDSVPSGYALMTGQTFNKTSYPKLAIAYPSGVIPDMRGWIIKGKPS- 417 Query: 66 RGIDTGRSILSIQGYATEDHAHGLPSRST------------------------------- 94 +GR+ILS + + H H ST Sbjct: 418 ----SGRAILSTELDGVKSHNHTGSISSTNLGTITSTSTDLGTKTTASFNHGSRNTSTSG 473 Query: 95 -----IVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYG---------------- 133 I TD +W + +D R T AG Sbjct: 474 EHTHRIPTDGAEGKDGPSLWNSPNSDENYREPTESAGSHYHSITIGAHAHTIALGSHTHN 533 Query: 134 -TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 T+ S+ +E +NIAFNYIVR A Sbjct: 534 IVLGTHNHSIIINNTGNTENTVKNIAFNYIVRLA 567 >UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FQX9_SALDC Length = 569 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 51/193 (26%), Positives = 79/193 (40%), Gaps = 36/193 (18%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 ++ PVG + WPS P G+ G +F YP LA AYP+ +PD+RG I+G Sbjct: 382 PPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGIIPDMRGWTIKGKPI 441 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNS--------GTD 115 +GR++LS + + H+H ++ T + T +T +F + N+ G Sbjct: 442 -----SGRAVLSQEMDGNKSHSHSARAQDTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGY 496 Query: 116 IIKRGNTNDAGLPAPDYGTFK----------------------TYKQSVDGLGAAASETR 153 I ++ P G + + V +ET Sbjct: 497 INSYWGDSNHTSFQPGGGAWTQAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAETT 556 Query: 154 PRNIAFNYIVRAA 166 +NIAFNYIVR A Sbjct: 557 VKNIAFNYIVRLA 569 >UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacterium RepID=D0KGE5_PECWW Length = 157 Score = 123 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 54/162 (33%), Positives = 75/162 (46%), Gaps = 19/162 (11%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 S++ G W + PP GWL+ NG F+ P LA YP++++PD RG F RGWD+G Sbjct: 12 SSSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRGWDNG 71 Query: 66 RGIDT-GRSILSIQGYATEDHAHGL-PSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTN 123 GID R+ILS+QG A + P S+ + Y NSG+ T Sbjct: 72 AGIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSSYGWPYPSNSGSANDASIITF 131 Query: 124 DAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 DA + A+E RP NIA +I++A Sbjct: 132 DA-----------------SRVVPTAAENRPTNIAVMFIIKA 156 >UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotrophomonas maltophilia K279a RepID=B2FIY3_STRMK Length = 410 Score = 123 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 47/165 (28%), Positives = 71/165 (43%), Gaps = 18/165 (10%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFI 59 LP G+ +P+ PP GWL+CNGA S Y +L T +LPDLRGEF+ Sbjct: 254 LPAGMVAHFPTGGPPPGWLRCNGADVSRTTYADLFAVIGTLFGSANDMTFRLPDLRGEFV 313 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 RGWDDGRG+D GR++ S+Q + + + + + Sbjct: 314 RGWDDGRGVDGGRALGSLQA---------ATEVLSSWGASAGGLVSGQYQYSLADFGVHT 364 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 N + + + S++G G RPRN+A ++ Sbjct: 365 TNADSSRQVNNVGSGRLSRMDSINGGGLTLIGVRPRNVALLACIK 409 >UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A9IXL3_BART1 Length = 334 Score = 121 bits (304), Expect = 7e-27, Method: Composition-based stats. Identities = 47/173 (27%), Positives = 69/173 (39%), Gaps = 24/173 (13%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 + G + S P+GWL C+G +S + Y L T +PDLRG Sbjct: 169 SFSPGFIGTFASEKIPSGWLLCDGKEYSRKNYANLFAVLGETWGKGDGKTTFNVPDLRGM 228 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 F+RG D G+ ID GR + S Q + + H H + ST + + DI+ Sbjct: 229 FLRGLDSGKEIDKGRLLGSRQEESFKSHTHEGKTDSTGKHQHS--------YPTIKNDIL 280 Query: 118 KRGNTNDAGLPAPDYGTFK------TYKQSVDGLGAAASETRPRNIAFNYIVR 164 + + G A Y T ++ V ETRP N+A Y V+ Sbjct: 281 RYKREDYKGYVAVVYKTDTLTEPAGEHEHKVLLQKTGGDETRPVNMAVVYAVK 333 >UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A9IRI0_BART1 Length = 324 Score = 121 bits (304), Expect = 7e-27, Method: Composition-based stats. Identities = 43/168 (25%), Positives = 66/168 (39%), Gaps = 11/168 (6%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGE 57 + P G + P GWL C+G A+ E+YP+L KA T K+PD RG Sbjct: 159 ESFPAGFIATFAMRNIPNGWLLCDGTAYKREDYPQLFKAIGDKWGKNSDTTFKVPDFRGM 218 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 F+RG+DDGRG+D R Q + + H H + NF + + +G Sbjct: 219 FLRGFDDGRGLDNDRKFADEQQDSIKSHTHIGTVEESGA--HVHNFEYKGVGWPTGNIGR 276 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + + +ETRP N Y +++ Sbjct: 277 LPNYYTYNTTLKGKTDSAGAHTHKITLSHTGEAETRPVNTTVIYAIKS 324 >UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P172_CHRVO Length = 591 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 55/165 (33%), Positives = 73/165 (44%), Gaps = 16/165 (9%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIRG 61 G + + PP GWLK NGAA S ++YP L A T LPDLRGEF+RG Sbjct: 430 GQVAFFAMSAPPLGWLKANGAAVSRKDYPSLFAALGTYYGAGDGSTTFNLPDLRGEFVRG 489 Query: 62 WDDGRGIDTGRSILSIQGYAT--EDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 WDDGRG+D GR + Q D + P +++V N +++ G D + + Sbjct: 490 WDDGRGVDNGRGFGTWQKGTLTFSDPSLTSPCVASLVHR---NDNTVIGYLDLGADPVDK 546 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 N D GL G TRPRNIA ++ Sbjct: 547 -NKYDLGLSVSTANGVYLPDLDSGGWANGYGSTRPRNIALLACIK 590 >UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE3_PECWW Length = 144 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 48/153 (31%), Positives = 69/153 (45%), Gaps = 13/153 (8%) Query: 16 PWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT--GRS 73 W + PP GWL+ NG F+ P LA YP++++PD RG F RGWD+G GID RS Sbjct: 1 MWGTPVPPEGWLELNGQLFNPSGNPVLADLYPSSRVPDFRGYFPRGWDNGAGIDPDSSRS 60 Query: 74 ILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYG 133 +LS Q H H + T++ +G G Sbjct: 61 VLSYQDDEIISHKHAI----------TMSHEHHGAADGAGFPQTDASGPMIKHAETEPDG 110 Query: 134 TFKTYKQSVDGLGA-AASETRPRNIAFNYIVRA 165 +F + + + + SETRP NIA +I++A Sbjct: 111 SFPERSGAGNPMFSFGGSETRPHNIAVMFIIKA 143 >UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VSH6_POLNA Length = 483 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 53/181 (29%), Positives = 70/181 (38%), Gaps = 27/181 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIR 60 G +T P GWLK NGA S Y L A T LPDLRGEFIR Sbjct: 302 PGHINYTARSTAPPGWLKANGAGISRTAYAALFAAIGTTFGVGDGFNTFNLPDLRGEFIR 361 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATIN------------FYFDEI 108 GWDDGRG+D RS+ S Q T H H + + + +N + Sbjct: 362 GWDDGRGVDGSRSLGSSQAGETASHGHTGSTSAAGIHAHGVNDPGHSHQVTQEGGRNTSL 421 Query: 109 WVNSGTDIIKRGNTN-----DAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 +G + RG + + +V SETRPRN+A ++ Sbjct: 422 AYQNGPNSAFRGEVSTLLETTRNATGIGISENGNHSHTVTISATGGSETRPRNLALLAVI 481 Query: 164 R 164 + Sbjct: 482 K 482 >UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-like viruses RepID=Q7Y2B3_9CAUD Length = 466 Score = 120 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 46/165 (27%), Positives = 67/165 (40%), Gaps = 20/165 (12%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP--------TNKLPDLRGEFI 59 A+P+G + +L CNG + + +YP+L A LPD+RG Sbjct: 312 AMPIGGIILSGFNADRGDFLICNGRSLNKNQYPQLFSAIGYTFGGSGDNFNLPDMRGLVA 371 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 RG D GR +D GR S Q A + P V D +Y G + Sbjct: 372 RGCDHGRNLDPGRRFGSYQEDAMQRITGKFP-----VADRWRGWYGGAFTAQRG-----Q 421 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 +TN D+GT + A+ETR +++A NYI+R Sbjct: 422 WSTNYKNGGGDDWGTTVNF--DSGRSVRTANETRVKSLALNYIIR 464 >UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=Photorhabdus RepID=Q7N047_PHOLL Length = 602 Score = 119 bits (298), Expect = 3e-26, Method: Composition-based stats. Identities = 53/158 (33%), Positives = 68/158 (43%), Gaps = 15/158 (9%) Query: 8 ALPVGVPVPWPSATP-PTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 +P+G + W S P P G+ G AF A +YPELAK +P KLPD RG F RG D GR Sbjct: 457 GVPIGATIEWHSTAPIPAGYEPNEGRAFRAADYPELAKIFPDLKLPDDRGLFKRGLDRGR 516 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 G+D+GRS+ S+QG A + L I S G Sbjct: 517 GLDSGRSLGSVQGDAIRNITGSLGK--------------PTIESGSNASGAFSYQYKAGG 562 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 A G + + A+E RP N + YI R Sbjct: 563 RAAGAGGGVIAWTFDASRVVPTANENRPVNKSVIYITR 600 >UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas aeruginosa PA7 RepID=A6VBH2_PSEA7 Length = 654 Score = 118 bits (295), Expect = 7e-26, Method: Composition-based stats. Identities = 47/195 (24%), Positives = 70/195 (35%), Gaps = 33/195 (16%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLP 52 L + +P G V + +PP G+LK NGAA S Y L T LP Sbjct: 459 LNPQAIVPAGAVVAFAMYSPPAGYLKANGAAVSRTAYAALFATIGTYYGAGDGSTTFNLP 518 Query: 53 DLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW--- 109 D RGEF+R DDGRG+D GR + ++Q H HG S T+ Sbjct: 519 DYRGEFLRALDDGRGLDLGRQLGTLQSSQNLAHTHGASSSGNGGHTHTVTGTAAAAGAHS 578 Query: 110 --------------------VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA 149 V + ++ + T+ ++ + Sbjct: 579 HSIASVNATALVSGTRLATLVGNASNSTTDVAGDHTHAVTGVAALEGTHNHTIYVESSGG 638 Query: 150 SETRPRNIAFNYIVR 164 SE RPRN++ ++ Sbjct: 639 SEARPRNVSVLICIK 653 >UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 RepID=A6E6G6_9SPHI Length = 731 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 45/166 (27%), Positives = 64/166 (38%), Gaps = 19/166 (11%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY-PTNKLPDLRGEFIRG------ 61 PVG V + P WL C+G YP+L + +LPDLRG F+ G Sbjct: 575 FPVGGIVAFYGEKVPDHWLLCDGKPVDHSLYPDLYRLLGGEKRLPDLRGRFLVGAGSKYS 634 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 D G+D LS+ H H + + + + F E+ + + RG Sbjct: 635 LGDMGGVDE----LSLNVDQMPQHDHQIKAVKSYESP------FKEVNMGWAREESLRGG 684 Query: 122 T--NDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 D A Y ++ G A E RP +A NYI+RA Sbjct: 685 VYGTDRDNGADKYFVTRSNSPVKSEGGGKAHENRPPYLAVNYIIRA 730 >UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX Length = 710 Score = 116 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 59/227 (25%), Positives = 84/227 (37%), Gaps = 69/227 (30%) Query: 4 GEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWD 63 + PVG +PWPS + PTG+ G F YP LA AYP+ LPD+RG I+G Sbjct: 489 TPQDSFPVGAAIPWPSDSVPTGYAVMQGQTFDKTTYPLLAAAYPSGVLPDMRGWTIKGKP 548 Query: 64 DGRGIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVN---------SG 113 +GR +LS++ + H H + +T + T T +F + N SG Sbjct: 549 A-----SGRDVLSLEQDGIKSHTHSASASNTDLGTKTTSSFDYGTKSTNNTGAHTHNVSG 603 Query: 114 TDIIKRGNTNDAGLPAPDYG----------------------TFKTYKQSVDGLGAAA-- 149 T +T+ L P+ G + + SV G +A Sbjct: 604 TANSAGAHTHTVPLRRPNSGGMNFDWLDGASSGTVVGNGTVPSSGAHTHSVSGTATSAGA 663 Query: 150 ------------------------------SETRPRNIAFNYIVRAA 166 +E +NIAFNYIVR A Sbjct: 664 HAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 710 >UniRef50_P76072 Side tail fiber protein homolog from lambdoid prophage Rac n=23 Tax=root RepID=STFR_ECOLI Length = 1120 Score = 116 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 52/228 (22%), Positives = 76/228 (33%), Gaps = 73/228 (32%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 + PVG P+PWPS T P+G+ G AF YP+LA AYP+ +PD+RG I+G Sbjct: 900 PPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPA 959 Query: 65 GRGIDTGRSILSIQGYATE--------------------------------DHAHGLPSR 92 +GR++LS + + H H + Sbjct: 960 -----SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGS 1014 Query: 93 STIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAAS-- 150 + + + S +T + + +Y T + G AAS Sbjct: 1015 TN--SAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAG 1072 Query: 151 --------------------------------ETRPRNIAFNYIVRAA 166 E +NIAFNYIVR A Sbjct: 1073 AHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 >UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Enterobacteriaceae RepID=B3HKW0_ECOLX Length = 164 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 68/174 (39%), Positives = 81/174 (46%), Gaps = 32/174 (18%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTG---------WLKCNGAAFSAEEYPELAKAYPTNKL 51 VGLGEG A +GVP WPSA P +LK NGA FSA +YP LAK +P+ L Sbjct: 13 VGLGEG-APAIGVPFFWPSAAMPNTVIDSWSGMVFLKFNGAKFSATDYPVLAKVFPSLVL 71 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RG+FIR WDDGRG D GR +LS Q INF+ Sbjct: 72 PEARGDFIRIWDDGRGADGGRELLSWQEATNFS---QFAGNIGGGAGHAINFH------- 121 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + P + F SV G G RPRNIAFN++VRA Sbjct: 122 -----------DGIAGNQPGFSRFNFTSNSV-GDGVNFVAVRPRNIAFNFLVRA 163 >UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=Pectobacterium carotovorum subsp. carotovorum WPP14 RepID=UPI0001A44C27 Length = 195 Score = 114 bits (286), Expect = 9e-25, Method: Composition-based stats. Identities = 55/118 (46%), Positives = 70/118 (59%), Gaps = 6/118 (5%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G +G+ L VG+P P P T P GWL C G +F YP LA YP +LPDLRGEFIR Sbjct: 79 IGAIQGNEL-VGIPQPCPLVTAPEGWLACAGQSFDTSRYPVLASRYPQGRLPDLRGEFIR 137 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAH-----GLPSRSTIVTDATINFYFDEIWVNSG 113 GWD+GRG+DTGR LS Q ++TE H H GL S + I T + ++ +G Sbjct: 138 GWDNGRGVDTGRGNLSSQSFSTEPHTHDGGTLGLGSGAPIYTGKGLQDGAATLYSQTG 195 >UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escherichia coli E22 RepID=B3I2W7_ECOLX Length = 654 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 53/217 (24%), Positives = 75/217 (34%), Gaps = 63/217 (29%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + PVG P+PWPS PTG+ G F YP LA AYP +PD+RG+ I+G + Sbjct: 444 DSYPVGAPIPWPSDVTPTGYALMQGQPFDKAVYPLLAIAYPAGIIPDMRGQTIKGKPN-- 501 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNS------------- 112 GR++LS + H HG T + T T +F + S Sbjct: 502 ----GRAVLSYEQDGVISHTHGASISDTDLGTKYTSSFDYGSKPTTSFDYGNKSSTEGGW 557 Query: 113 --------GTDIIKRGNTNDAGLPAPD--------------------------------- 131 T + G+ + + Sbjct: 558 HAHNFRYCATSAYRDTPGQGLGMHSSNVSWAAGDRIEGSGNHAHVTWIGPHDHWVGIGAH 617 Query: 132 --YGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 Y + + A +E +NIAFNYIVR A Sbjct: 618 NHYVVMGYHGHTATVHAAGNAENTVKNIAFNYIVRLA 654 >UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae RepID=A4TT73_YERPP Length = 962 Score = 113 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 42/147 (28%), Positives = 64/147 (43%), Gaps = 7/147 (4%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW 62 L + PVG P+PWP+ P+G+ G F YP+LA AYP+ LPD+RG I+G Sbjct: 715 LPPPESYPVGAPIPWPNDVAPSGFAIMQGQTFDKSVYPKLAAAYPSGVLPDMRGWMIKGK 774 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKRGN 121 T R++LS++ + HAH + ST + T T F + + K N Sbjct: 775 P------TSRAVLSLEQDGIKSHAHNAAASSTDLGTKPTTTFDYGTKTSSGFDYGTKSSN 828 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAA 148 + A + T + + Sbjct: 829 STGAHAHSLSGSTSSSGAHAHTVTAHT 855 Score = 38.5 bits (88), Expect = 0.074, Method: Composition-based stats. Identities = 14/97 (14%), Positives = 27/97 (27%), Gaps = 8/97 (8%) Query: 78 QGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYG---- 133 Q + + + + + + +T G A Sbjct: 866 QNAVGKQYNTQQTTANAFNVWTSSAGDHAHSISGTAVSAGAHAHTVGIGAHAHSLSIGSH 925 Query: 134 ----TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + ++ +E +NIA+NYIVR A Sbjct: 926 SHSVAIGAHSHTITIAACGNAENTVKNIAYNYIVRLA 962 >UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio cholerae TM 11079-80 RepID=C2I7P2_VIBCH Length = 406 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 42/154 (27%), Positives = 56/154 (36%), Gaps = 34/154 (22%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPD------LRGEFIRGWDD 64 VG+P W + P + Y LA+ YP D +RGEF+R D Sbjct: 274 VGMPFYWLDTSAPEWAVMEINVNLPIAVYWRLARRYPQLVRDDYINTGEIRGEFLRVLDQ 333 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 GRG+D GRSI S Q E H H + +I + Sbjct: 334 GRGVDAGRSIQSYQDDELERHTHTFSAPFSITANTGSTGII------------------I 375 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 + P++ T T +ETRPRNIA Sbjct: 376 SASHVPNWNTTYT----------GGNETRPRNIA 399 >UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria phage T4 RepID=Q99362_BPT4 Length = 382 Score = 112 bits (280), Expect = 4e-24, Method: Composition-based stats. Identities = 43/142 (30%), Positives = 65/142 (45%), Gaps = 7/142 (4%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 S+ P+G P+PWP+ TPP G+ G F YP+LA AYP+ +PD+RG+ I+G Sbjct: 130 SSYPIGAPIPWPTDTPPNGYALMEGQTFDTRAYPKLAAAYPSGTIPDMRGQTIKGKP--- 186 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 +GR++LS + + H HG + +T + T T +F + +S K NT Sbjct: 187 ---SGRAVLSTEADGVKSHTHGASASNTDLGTKTTSSFDYGTKTTSSFDYGTKTSNTTGN 243 Query: 126 GLPAPDYGTFKTYKQSVDGLGA 147 T G Sbjct: 244 HNHTVSGTTSSAGAHQHARSGP 265 Score = 40.1 bits (92), Expect = 0.030, Method: Composition-based stats. Identities = 29/137 (21%), Positives = 42/137 (30%), Gaps = 11/137 (8%) Query: 31 GAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLP 90 GA A P+L+ TN PD + + +G I S Sbjct: 256 GAHQHARSGPQLSNGISTNIFPDGYSD---VGTNYNSKFSGTVIGSSVPCIIGK------ 306 Query: 91 SRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAAS 150 S N + +T+ G+ A + + A + Sbjct: 307 -TSNDGAHTHTWSGTTSTTGNHAHTVGIGAHTHTVGIGAHTHTVAIGSHGHTITVNATGN 365 Query: 151 -ETRPRNIAFNYIVRAA 166 E +NIAFNYIVR A Sbjct: 366 TENTVKNIAFNYIVRLA 382 >UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W7_TRIEI Length = 671 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 45/164 (27%), Positives = 68/164 (41%), Gaps = 21/164 (12%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-NKLPDLRGEFIRGWDD 64 G +PVG VP+ T P GWL CNG ++ E+Y EL K LPDL+G FI G Sbjct: 523 GWVVPVGTIVPYAGLTAPEGWLLCNGQSYDWEQYSELYKVLDEIKVLPDLKGRFIIG--- 579 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFD----EIWVNSGTDIIKRG 120 + GY+ +A G + T+ D + + + + Sbjct: 580 ---------VGDKDGYSYSLNAKGGEEKHTLTKDEMPSHDHSKGEYKFILKKDGKVTTSN 630 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 N N++ L P+ G+ + + E RP A NYI++ Sbjct: 631 NVNNS-LREPNLGSCEALQVI---GNNKPFENRPPYYALNYIIK 670 >UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FJJ3_DESAA Length = 264 Score = 112 bits (279), Expect = 5e-24, Method: Composition-based stats. Identities = 45/165 (27%), Positives = 65/165 (39%), Gaps = 27/165 (16%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEFI 59 P G V + A+ P+GWL+C+GAA S Y L T LPDLRG F+ Sbjct: 116 PTGSVVAFMGASAPSGWLECSGAAVSRTTYDNLFSVISTMYGVGDGSTTFNLPDLRGYFL 175 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 RGW G G D + +G T G + T D + + G + Sbjct: 176 RGWSHGSGKDPDAGSRTDRGDGTCGDYVGTRQEDEFAS-HTHYDDEDLLTFDGGGPV--- 231 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 G+ + +V +ETRP+N+A YI++ Sbjct: 232 -------------GSNSSGMSAVLPGSVGGAETRPKNVAVMYIIK 263 >UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI00019136B5 Length = 137 Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 55/139 (39%), Positives = 71/139 (51%), Gaps = 19/139 (13%) Query: 39 YPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAH----------- 87 YP LAKAYPTNKLPDLRGEFIRGWDDGRG+D GR++L +Q + E H H Sbjct: 2 YPNLAKAYPTNKLPDLRGEFIRGWDDGRGVDAGRALLRLQDDSFEAHRHESFFYAGISRN 61 Query: 88 -----GLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSV 142 LPS ++T ++ +++ +I +D K + Sbjct: 62 EIPLKNLPSSDEMLTLSSTTNALSPDGIDATNSLI---GNDDYNCLIEGNKNNKRTATGL 118 Query: 143 DGLGAAASETRPRNIAFNY 161 A+ETRPRNIAFNY Sbjct: 119 STSIVGATETRPRNIAFNY 137 >UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45_9CAUD Length = 554 Score = 110 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 50/166 (30%), Positives = 67/166 (40%), Gaps = 24/166 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIR 60 G+ + +T P+GWLK NGAA S Y L T LPDLRGEF+R Sbjct: 400 AGLIGYFARSTAPSGWLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLPDLRGEFLR 459 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG+D+GR I + Q S S +V D I + Sbjct: 460 GWDDGRGVDSGRGIGTWQ------------SGSPVVHDDVGGIASFNITALGDGTNVAWS 507 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASET--RPRNIAFNYIVR 164 N D + A + + + RPRN+AF ++ Sbjct: 508 NIADPWVGAFPLTMYDSSAATFVDANNKGFINMARPRNVAFLPCIK 553 >UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 Tax=Erwinia phage phiAT1 RepID=C5J9F2_9VIRU Length = 240 Score = 110 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 3/90 (3%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 E +P+G +PWP AT P GWL+C+G F+ + P+L N +PD RG F+RGW Sbjct: 149 EPRLVPIGAVIPWPGATVPDGWLECSGQVFNTGQNPKLYSVLGRNVVPDYRGLFLRGWAH 208 Query: 65 GRGI---DTGRSILSIQGYATEDHAHGLPS 91 G D GR++ S+QG A + P+ Sbjct: 209 GSDANDPDAGRALGSVQGDAIRNITGYFPA 238 >UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A9ITX5_BART1 Length = 333 Score = 109 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 19/179 (10%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDL 54 E S P G + P WL C+G A+ +Y +L + T +PD Sbjct: 154 ESSLYPTGFIGTFGMRDVPKDWLICDGKAYLRRDYRDLFETIGTVWGEGDSVTTFNVPDF 213 Query: 55 RGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 RG F+RG D G +D R S+Q + H H + S + NF+ G Sbjct: 214 RGMFLRGVDGGSNLDPNRRFASVQTDLIQSHQHEGQTLSMPHFTSNENFWDGNTTEVLGY 273 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTY---------KQSVDGLGAAASETRPRNIAFNYIVR 164 + G A + K++ Q V ETRP N++ + ++ Sbjct: 274 RLGLFGGGALANFMGIESENLKSHVATPYSFDENQEVILESTGEGETRPVNVSVLFAIK 332 >UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI4_EDWTE Length = 718 Score = 109 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 48/169 (28%), Positives = 73/169 (43%), Gaps = 31/169 (18%) Query: 11 VGVPVPWPSATPP-TGWLKCN-------GAAFSAEEYPELAKAYPTNKLP-DLRGEFIRG 61 +G +PW P W C G +F E +P+L YP N+LP D+RG RG Sbjct: 566 IGSLIPWALERMPQEIWPNCGMHFIPYMGQSFDPELFPKLHDVYPDNRLPTDMRGYTARG 625 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 WD+GRGID GR++LS Q A ++ +G+ + G Sbjct: 626 WDNGRGIDIGRALLSYQDDAIQNITGQF-----------------GWMPFNGSSPVASGA 668 Query: 122 TNDAGLPAPDYGTFKTYKQSVDG-----LGAAASETRPRNIAFNYIVRA 165 + + A +G + G + A +TR +++A+NYI RA Sbjct: 669 FSVDKIGANVWGGGTERRDCAIGFNASNVVRTAEQTRVKSVAWNYITRA 717 >UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp. RC586 RepID=D0IJ09_9VIBR Length = 368 Score = 107 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 50/170 (29%), Positives = 63/170 (37%), Gaps = 20/170 (11%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 PVGVP+PWPS P G+ G AF PELAK YP L DLRG + G +G Sbjct: 203 CPVGVPLPWPSDIAPEGFAIHKGQAFDKVANPELAKLYPDGILKDLRGMAVVGKKEGE-- 260 Query: 69 DTGRSILSIQGYATEDH-----AHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTN 123 ILS + + H + T+ T N S +T Sbjct: 261 ----IILSYEADQVKQHGYPNSTVSSTDLGSRNTNTTGNHAHGYPAGTSNGPNGPYLDTA 316 Query: 124 DAGLPAPDYGTFKTYKQSVDGLGA---------AASETRPRNIAFNYIVR 164 A T + SV A+E +NI FN+IVR Sbjct: 317 HASYGYRYTTTEGNHYHSVAIGSHAHSIAIALFGATENTIKNIKFNWIVR 366 >UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P176_CHRVO Length = 435 Score = 106 bits (265), Expect = 2e-22, Method: Composition-based stats. Identities = 46/170 (27%), Positives = 64/170 (37%), Gaps = 26/170 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLR 55 +A P G+ + P GWL +G + ++YP L A T LP+L Sbjct: 280 NAAAPAGMVAYFAMKDAPAGWLIADGRTVARKDYPALFAAIGGLYGNGDGSTTFGLPNLC 339 Query: 56 GEFIRGWDDGRGIDTGRSILSIQ-GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 GEFIRGWD+GRG+DTGR+I S Q GL + I + + Sbjct: 340 GEFIRGWDNGRGVDTGRAIGSSQISTQLLVDNDGLQTVGAIDWSSNNLSALGYEPAQANA 399 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + N+ PA RPRNIA ++ Sbjct: 400 ANLHFINSTTISNPADSSFIRSI---------------RPRNIALLACIK 434 >UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibrio vulgaris RepID=Q727X4_DESVH Length = 296 Score = 106 bits (264), Expect = 3e-22, Method: Composition-based stats. Identities = 43/145 (29%), Positives = 59/145 (40%), Gaps = 22/145 (15%) Query: 19 SATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQ 78 +AT P W +CN + + +L D RGEF RGWD GRG+D GR + S Q Sbjct: 172 NATAPA-WYRCNASGVRDATGDHI-------RLQDRRGEFARGWDHGRGVDAGRVLGSAQ 223 Query: 79 GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTY 138 G A + + S + +V SG + + AG G + Sbjct: 224 GDAIRNIVGSMGSITAVVAGTA-----------SGAFTVTTPSNRSAGSST---GPTCDF 269 Query: 139 KQSVDGLGAAASETRPRNIAFNYIV 163 + ASE R RNIA Y+V Sbjct: 270 TFDASRVVPTASENRTRNIATLYLV 294 >UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkholderia ambifaria AMMD RepID=Q0BEK5_BURCM Length = 735 Score = 105 bits (263), Expect = 4e-22, Method: Composition-based stats. Identities = 58/218 (26%), Positives = 76/218 (34%), Gaps = 61/218 (27%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL-------------------------- 42 + +G V P G+LK NG +YP L Sbjct: 517 VQIGQIVWEARTAPRAGFLKLNGTELKRADYPLLWAYAQGSGALVADADWGKGRHGCFSS 576 Query: 43 AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGY------------ATEDHAHGLP 90 T +LPDLRGEFIR WDD RG D R I S Q A DH+HG Sbjct: 577 GDGNTTFRLPDLRGEFIRCWDDARGTDAQRQIGSWQDSLNRLHAHGASAAAVGDHSHGAW 636 Query: 91 SRSTIVTDATIN--------------FYFDEIWVNSGTDIIKRG---------NTNDAGL 127 + S +IN Y EI +N G KR N + A Sbjct: 637 TDSQGWHGHSINDPGHDHGIPVASGGGYIGEINLNGGGRGDKRTTGSGTGISINGDGAHG 696 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G + ++ +E+RPRN+A ++RA Sbjct: 697 HNVGIGGAGAHSHTISIGADGGNESRPRNVALLVMIRA 734 >UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae RepID=Q3ZL14_ESCBL Length = 289 Score = 105 bits (263), Expect = 4e-22, Method: Composition-based stats. Identities = 49/212 (23%), Positives = 72/212 (33%), Gaps = 59/212 (27%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 LP G+ + WP AT PTG+ G F YP LA+AYP+ +PD+RG+ I+ Sbjct: 83 LPPGIALAWPGATAPTGFALMLGQTFDTTAYPRLAQAYPSGVIPDMRGQTIKFLPA---- 138 Query: 69 DTGRSILSIQGYATEDHAHGLPSRST----------------IVTDATINFYFDEIWVNS 112 +GR++LS++ + H+H +T D N D + Sbjct: 139 -SGRTLLSLEADGVKSHSHSGSISTTDLGTATAADTDLGTKQTSQDGLHNHVSDSRFNKL 197 Query: 113 GTDIIKRGNTNDAGLPAPDYG--------------------------------------T 134 TN+ G D Sbjct: 198 MARSSDIDGTNNTGDVDSDNPESEHRVSGMNDSLWAASVIADSGLHMHTVYIGPHAHSVY 257 Query: 135 FKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + +V +E +NIAFN IVR A Sbjct: 258 IGPHGHTVTISNFGNTENTVKNIAFNAIVRLA 289 >UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW5_CLOCE Length = 368 Score = 105 bits (262), Expect = 6e-22, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 65/180 (36%), Gaps = 24/180 (13%) Query: 9 LPVGVPVPWPSATP-----PTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDL 54 PVG+ +P+ +GW+ C+G +Y EL T +PDL Sbjct: 5 FPVGMVIPFAGPLKEDQLKSSGWVPCDGRVLDKTQYSELFDVIGTKYGGDGIPNFNIPDL 64 Query: 55 RGEFIRGWDDGRGIDTG--RSILSIQGYATEDHAHGLPSRSTIVTD--------ATINFY 104 RG F+R D GRG D R S G A D+ + +T N Sbjct: 65 RGRFVRATDHGRGYDPDAQRRKASKSGGAAGDNTGSVQEYATAKPKNNFITNDKGNHNHL 124 Query: 105 FDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 D + + + A P + + + S + SE+RP N+ +I++ Sbjct: 125 VDHLPTDYWNAACAITSNEGANFPGRTATSGEAGQHSHTIVSGGDSESRPVNLYMYWIIK 184 Score = 79.8 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 43/179 (24%), Positives = 72/179 (40%), Gaps = 21/179 (11%) Query: 5 EGSALPVGVPVPWPSATPP-------TGWLKCNGAAFSAEEYPELAKAYPT--------N 49 E LP G V + + GWL C G+++ A +YP+L + Sbjct: 192 ESILLPAGSIVSFAGDSVKKSNELIANGWLPCIGSSYEANKYPDLYENISNIYGGDQNKF 251 Query: 50 KLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPS--RSTIVTDATINFYFDE 107 +PDLRG FIRG + G + + TED++ LP T+ TD + Sbjct: 252 NVPDLRGLFIRGVNSNTSETPG-VHGATRVGQTEDYSTALPKTLNFTLSTDGAHTHSAPK 310 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + + G+ G + ++ G +ETRP NI +YI++++ Sbjct: 311 LPQDKYIENYCAGHEVANFPSNQYTGNNGNHAHTIAG---GDAETRPVNIYLDYIIKSS 366 >UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31Q92_SYNE7 Length = 387 Score = 104 bits (260), Expect = 8e-22, Method: Composition-based stats. Identities = 52/183 (28%), Positives = 71/183 (38%), Gaps = 57/183 (31%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL------------------------- 42 A+P GV + TPPTG++K NGA S Y L Sbjct: 235 AVPAGVAIWVTGNTPPTGYIKANGALLSRTTYARLWAYAQASGNIVSDAAWTGGATGSYS 294 Query: 43 -AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATI 101 T ++PDLRGEFIRGW DGR +DTGR+I S Q + HAH L +R+ T Sbjct: 295 TGDGSTTFRVPDLRGEFIRGWADGRSVDTGRAIGSTQADELKAHAHYLDTRTAPTGGGTA 354 Query: 102 NFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNY 161 + + + + +ETRPRNIA+ Sbjct: 355 ATTYTTGTAVTTSSV-------------------------------GGTETRPRNIAYLA 383 Query: 162 IVR 164 ++ Sbjct: 384 CIK 386 >UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5PP06_SALHA Length = 534 Score = 104 bits (258), Expect = 2e-21, Method: Composition-based stats. Identities = 39/145 (26%), Positives = 66/145 (45%), Gaps = 7/145 (4%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 + PVG P+ WPS P G+ G +F YP LA AYP+ +PD+RG I+G Sbjct: 238 PPDSHPVGAPIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDMRGWTIKGKPA 297 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVN-SGTDIIKRGNT 122 +GR+ILS + + H+H ++ T + T T +F + N +G + G Sbjct: 298 -----SGRAILSQEMDGNKSHSHSARAQDTDLGTKTTSSFDYGTKSTNTTGNHTNQFGGY 352 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGA 147 ++ ++ +F+ + Sbjct: 353 INSYWGDSNHTSFQPGGGAWTQAAG 377 >UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria phage RB43 RepID=Q56BI6_9CAUD Length = 463 Score = 101 bits (251), Expect = 1e-20, Method: Composition-based stats. Identities = 43/166 (25%), Positives = 63/166 (37%), Gaps = 24/166 (14%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT--------NKLPDLRGEF 58 ++LP+G + + NG EYPEL LPD+RG Sbjct: 312 ASLPIGCMMMAAFNSDYGNLCIANGRGMYTYEYPELFALIGYTYGGSGNIFNLPDMRGVV 371 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK 118 RG+D GRG+D GR + Q + + H H L I + ++ Sbjct: 372 ARGFDAGRGLDPGRGFGTYQHHEVQSHEHPLQ---------MIYQSGGNLPSWQCVYELR 422 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 ND L PD K + +ETR +N+A NY++R Sbjct: 423 TAEKNDQQLYWPDPSLSKA-------MAVGGNETRMKNLAINYVIR 461 >UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia coli RepID=B7L485_ECO55 Length = 1056 Score = 99.8 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 43/132 (32%), Positives = 65/132 (49%), Gaps = 8/132 (6%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 + PVG P+PWPS T P+G+ G F+ YP+LA AYP+ +PD+RG I+G Sbjct: 819 PPESYPVGAPIPWPSDTVPSGYALMQGQTFNKSAYPKLAAAYPSGVIPDMRGWTIKGKPA 878 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNS--GTDIIKRGN 121 +GR++LS + + H H + ST + T T +F + N+ G+ Sbjct: 879 -----SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSLSGS 933 Query: 122 TNDAGLPAPDYG 133 T AG+ G Sbjct: 934 TGSAGVHTHGNG 945 >UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=Burkholderia thailandensis RepID=UPI00016A4B89 Length = 654 Score = 99.4 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 57/243 (23%), Positives = 75/243 (30%), Gaps = 80/243 (32%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL-------------------- 42 GE ++ VG V G+LKCNGA +YP L Sbjct: 411 AGELASAMVGQIVFEMRTAARAGYLKCNGALVKRADYPALWAYAQGSGALVAEKDWMSGN 470 Query: 43 ------AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGL------- 89 T ++P+LRGEF+R WDDGRG D R I + Q H H Sbjct: 471 FGCFSDGDGSATFRIPELRGEFLRCWDDGRGSDADRKIGTWQDSMNRTHGHAAGADGVGD 530 Query: 90 -------------------------------------PSRSTIVTDATINFYFDEIWVNS 112 P + +T + E V Sbjct: 531 HGHNAWTDNQGWHGHHGWTGTNGNHNHNNDIFSRLLRPPYNGSLTGSDTAGSGSEQAVGG 590 Query: 113 GTDIIKRG--------NTNDAGLPAPDYGTFKTYKQSVDGLGA--AASETRPRNIAFNYI 162 G R NT AG A + G + S A +E RPRN+A + Sbjct: 591 GDSADIRWAGDHNHEFNTEGAGTHAHNVGVAASGAHSHAIHVAADGGNEARPRNLAVLAM 650 Query: 163 VRA 165 +RA Sbjct: 651 IRA 653 >UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriaceae RepID=Q3YZL1_SHISS Length = 1029 Score = 99.4 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 8/147 (5%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 797 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPA---- 852 Query: 69 DTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNS--GTDIIKRGNTNDA 125 +GR++LS + + H H + ST + T T +F + N+ G+T+ A Sbjct: 853 -SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSLSGSTSSA 911 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASET 152 G +T S A T Sbjct: 912 GAHQHSQTGPRTNSGSQPTGMFPAGST 938 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 38/95 (40%), Gaps = 4/95 (4%) Query: 75 LSIQGYATED--HAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDY 132 S Q T + L S ++ + + + SGT + + G+ A + Sbjct: 936 GSTQVSGTNQVGISGSLTSGTSQWVGKSSS-EGNHTHSLSGTAASAGAHAHTVGIGAHTH 994 Query: 133 G-TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 ++ ++ A +E +NIAFNYIVR A Sbjct: 995 SVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1029 >UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JYG6_9GAMM Length = 400 Score = 99.4 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 44/194 (22%), Positives = 63/194 (32%), Gaps = 44/194 (22%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGEFI 59 P G + TPP GWL C+G+ S +YP L A T LPDLR +F Sbjct: 212 PAGRTEDFAGTTPPGGWLFCDGSEVSRTQYPALFTAIGTLWGDGDGSTTFNLPDLRNDFR 271 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDA------TINFYFDEIWVNSG 113 RG D RS+ + + H+H S + + W S Sbjct: 272 RGCSDT------RSVGDSESDQIKSHSHSASSEDSGAHTHGGRSSDSGAHKHRSGWGESN 325 Query: 114 TDIIKRGNTNDAGLPAPDYGTFK----------------------TYKQSVDGLGAAASE 151 G T+ +G + ++ ++ E Sbjct: 326 RSDAPFGATSGSGHRGSGDSDWDNYLYYTDTAQPHFHWLIINQAGSHSHPINIEPTGGDE 385 Query: 152 TRPRNIAFNYIVRA 165 TRPRN I+RA Sbjct: 386 TRPRNKVLMPIIRA 399 >UniRef50_C4MYW8 Gp12 Short tail fibers n=1 Tax=Enterobacteria phage JSE RepID=C4MYW8_9CAUD Length = 467 Score = 98.2 bits (243), Expect = 9e-20, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 60/166 (36%), Gaps = 22/166 (13%) Query: 9 LPVGVPVPWPSAT-PPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEF 58 +P+G + + + CNG + +YP L LPD+RG Sbjct: 312 MPIGGIILTAFNSFDHAQFKICNGQWLNKHQYPVLFSRIGFTYGGDGGDNFALPDMRGLV 371 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK 118 RG D GRG+D GR + Q + P + F Sbjct: 372 ARGCDHGRGLDPGRGFGTYQDDTMQHMTGNFPVANRWRGWTGGVFAITGG---------- 421 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + +TN D+G+ + + + ETR +++A NY++R Sbjct: 422 QWSTNYKNGGGDDWGSIVNFDSA--RQVRTSGETRVKSLALNYMIR 465 >UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7INV5_XANP2 Length = 492 Score = 97.9 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 42/163 (25%), Positives = 60/163 (36%), Gaps = 39/163 (23%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIR 60 G WP++TPP+G L NGA S Y L T +P+ G F+R Sbjct: 357 PGTIAMWPASTPPSGALVRNGATLSRTVYASLFAVIGTTFGAGDGATTFGVPNDLGIFVR 416 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWD+GRG DTGR S Q + H H + S + T F + Sbjct: 417 GWDNGRGYDTGRVFGSEQADDNKSHDHARQTVSGVFTAGGAGFALQD------------- 463 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + + +E RP+N A+ I+ Sbjct: 464 ----------------SGSTTQRVASSGGAEARPKNRAYLPII 490 >UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID=B9BDD9_9BURK Length = 536 Score = 97.9 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 55/235 (23%), Positives = 78/235 (33%), Gaps = 80/235 (34%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AK 44 +G V P + G+LK NGA + +YP L Sbjct: 301 IGTIVFEPRTSVRAGFLKLNGALVNRSDYPALWAYAQASGALVAESAWGQNNWGCFSTGD 360 Query: 45 AYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPS------------- 91 T +LP+LRGEF+R WDDGRG D+ R I + Q + HAHG S Sbjct: 361 GATTFRLPELRGEFLRCWDDGRGADSARGIGTFQSFQNAWHAHGASSAAVGDHTHGAWTD 420 Query: 92 ---------------------------------RSTIVTDATINFYFDEIWVNSGTDIIK 118 S +D + + DI Sbjct: 421 AQGWHGHHGWTGGGGGHNHNNGIFSRLLRPPYGGSLTGSDQAGSGSEQAVGAGDSADIAW 480 Query: 119 RG------NTNDAGLPAPD--YGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G NT +G + + G + ++ G +E RPRNIA ++RA Sbjct: 481 SGDHAHEFNTEGSGTHSHNVGIGGAGAHAHAITVNGDGGNEARPRNIAMLAMIRA 535 >UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens AM1 RepID=C5B185_METEA Length = 449 Score = 96.3 bits (238), Expect = 3e-19, Method: Composition-based stats. Identities = 42/169 (24%), Positives = 62/169 (36%), Gaps = 34/169 (20%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRG 56 S P G+ + + P GW+ G A ++ L T +PDLRG Sbjct: 305 SKSPPGMISAYAGQSCPVGWVDATGLALLRSDFSALFAVIGTRWGAGDGSTTFNVPDLRG 364 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDI 116 F+R D G G D GR + S Q + H H +P + T N + + +GT Sbjct: 365 YFLRMQDAGAGRDPGRDLGSAQAGSVGPHQHNVPVANATAGSGTTNNFV--YPLAAGTSS 422 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + A A ETRP NIA Y +++ Sbjct: 423 VPTTGQDPAP----------------------AGETRPINIAVWYCIKS 449 >UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5ABB4_BURGB Length = 670 Score = 95.9 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 51/236 (21%), Positives = 77/236 (32%), Gaps = 81/236 (34%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AK 44 +G V +P G++KC+G+ + +YP L A Sbjct: 434 IGQIVMEARTSPRAGYVKCDGSQYKRADYPALWAYAQASGALVSEAEYTDGRWGGFSTAD 493 Query: 45 AYPTNKLPDLRGEFIRGWDDGRG-IDTGRSILSIQGYATEDHAHGLPS------------ 91 ++PDLRGEF+R W DGRG +D GR+I S QG + HAHG S Sbjct: 494 GQTYFRVPDLRGEFLRCWSDGRGDVDPGRAIGSFQGGQNQAHAHGASSDPDGAHVHDAWT 553 Query: 92 --------RSTIVTDATINFYFD-------EIWVNSGTDIIKRGNTNDAGLPAPDY---- 132 N ++ S T G+ N+ + D Sbjct: 554 GGAGWHSHHGVTGGGGMHNHANGVFSRLLRPPYLGSLTGSDTDGSGNEQAVGGGDSADIA 613 Query: 133 -----------------------GTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G + ++ +E RPRN+A ++RA Sbjct: 614 WAGEHQHEFWTDGAGDHVHAVGIGNAGGHAHAIHVQADGGAEARPRNVALLAMIRA 669 >UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5A8Q3_BURGB Length = 865 Score = 95.2 bits (235), Expect = 7e-19, Method: Composition-based stats. Identities = 48/228 (21%), Positives = 75/228 (32%), Gaps = 65/228 (28%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL-------------------- 42 L S+ +G V P T G+LK NG+ +YP L Sbjct: 637 LSALSSSSIGQIVFEPRTTTRAGFLKANGSLLERADYPALWAYAQASGALISDAAWWAGQ 696 Query: 43 ------AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPS----- 91 ++P+LRGEF+R DDGRG+DT R+ S+Q H+H S Sbjct: 697 SGCFSTGTTGTNFRIPELRGEFLRCLDDGRGLDTSRAAGSLQLSQNAKHSHDASSTVGGS 756 Query: 92 ---RSTIVTDATINFYFDEIW--VNSGTDIIKRGNTNDAGLPAPDYGTFKT--------- 137 + + N D+ ++ ++ + G P G Sbjct: 757 HTHGAFTTGAGSHNHAIDQQPHAHDTWLGSVQVSGVDRGGGFGPYNGRVGEAWSDPANAN 816 Query: 138 --------------------YKQSVDGLGAAASETRPRNIAFNYIVRA 165 + ++ + E RPRNIA ++RA Sbjct: 817 IAILPTGDHVHGAGTYPAGDHNHAIAVQPSGGDEARPRNIALLAMIRA 864 >UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysenteriae 1012 RepID=B3X2T1_SHIDY Length = 488 Score = 94.8 bits (234), Expect = 9e-19, Method: Composition-based stats. Identities = 38/142 (26%), Positives = 62/142 (43%), Gaps = 6/142 (4%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 P G P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 259 YPPGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA---- 314 Query: 69 DTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKRGNTNDAGL 127 +GR++LS + + H H + ST + T T +F + N+ + Sbjct: 315 -SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSISGTANSA 373 Query: 128 PAPDYGTFKTYKQSVDGLGAAA 149 A + + + + + Sbjct: 374 GAHQHKSSGAFGGTNTSIFPNG 395 >UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A9ITY4_BART1 Length = 376 Score = 93.2 bits (230), Expect = 3e-18, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 41/89 (46%), Gaps = 10/89 (11%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEF 58 LP G+ P+ P GWL C+G A+S Y L T +PD RG F Sbjct: 159 LPSGLIGPFAMERLPDGWLLCDGRAYSRRTYRALFDGIGTTWGEGDGSTTFNVPDFRGMF 218 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAH 87 +RG D R +D RS S QG + + H H Sbjct: 219 LRGMDYERNLDPWRSFASQQGCSLKAHEH 247 >UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacteriaceae RepID=B7LN99_ESCF3 Length = 593 Score = 93.2 bits (230), Expect = 3e-18, Method: Composition-based stats. Identities = 49/215 (22%), Positives = 70/215 (32%), Gaps = 55/215 (25%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG 61 G +A PVG P+ WPS P G+ G F YP LA AYP+ +PD+RG I+G Sbjct: 384 GFEPVNAFPVGAPIAWPSDIVPEGYAIMQGQTFDKAAYPLLAAAYPSGVIPDMRGWTIKG 443 Query: 62 WDDGRGIDTGRSILSIQGYATE-------------------------------------- 83 +GR++LS + + Sbjct: 444 KPA-----SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKTVSTFNHGTKTT 498 Query: 84 ----DHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYG------ 133 H H + R + + +T D G G Sbjct: 499 NNTGAHTHTVGGRYGGDSIGGKQRVQVSGTNQVSSSDGAHAHTVDIGQHNHTVGIGAHAH 558 Query: 134 --TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + ++ A +E +NIAFNYIVR A Sbjct: 559 TVALGAHGHTITVNAAGNAENTVKNIAFNYIVRLA 593 >UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root RepID=Q2T5M0_BURTA Length = 790 Score = 93.2 bits (230), Expect = 3e-18, Method: Composition-based stats. Identities = 52/239 (21%), Positives = 74/239 (30%), Gaps = 80/239 (33%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL------------------------ 42 SA +G V P T G+LK NG + +YPEL Sbjct: 551 SATTIGQIVFEPRTTVRPGFLKANGVLVNRADYPELWAYAQASGALVSDADWMKDRWGCF 610 Query: 43 --AKAYPTNKLPDLRGEFIRGWDDGR-GIDTGRSILSIQGYATEDHAHGLPS-------- 91 T +LP+LRGEFIR W D R G+D R I + QG HAHG + Sbjct: 611 STGDGATTFRLPELRGEFIRCWSDARGGVDATRQIGAFQGDQNHTHAHGAAASEAPDHVH 670 Query: 92 ---------------------RSTIVTDATINFYFDEIWVNSGTDIIKRGNTND------ 124 + ++ W G + +D Sbjct: 671 TAWTDVQGWHGHHGWTNAVGDHQHVSPWGEHPQMYNPPWGTWGAANNRGAEGSDNDNVYG 730 Query: 125 ----AGLPAPDYGTFKT--------------YKQSVDGLGAAASETRPRNIAFNYIVRA 165 AG ++ T + ++ E RPRN+A ++RA Sbjct: 731 MTSPAGNHNHEFNTEGNGNHGHAVGIGGGGRHAHTIAVQPDGGDEARPRNVALLALIRA 789 >UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia cenocepacia J2315 RepID=B4EF34_BURCJ Length = 883 Score = 92.5 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 50/230 (21%), Positives = 73/230 (31%), Gaps = 76/230 (33%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AKA 45 G V P T G+LK NGA +YP L Sbjct: 653 GTVVFEPRTTARAGFLKLNGALLKRADYPALWAYAQASGALSTETDWAAGWSGTFSTGDG 712 Query: 46 YPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYAT----------------------- 82 T ++P+LRGEF+R WDD RG+D R + + Q +A Sbjct: 713 TTTFRIPELRGEFVRCWDDTRGVDPNRGLGASQNFANAWHAHGASAAASGDHVHSAWTDV 772 Query: 83 -------------EDHAHGLP------------SRSTIVTDATINFYFDEIWVNSGTDII 117 DH H P S + + + ++ + + Sbjct: 773 QGWHGHHGWTASVGDHQHVAPYSESGIAPFGTHSTNQVGSHGGVDNDNPWAFTSGAGGHN 832 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGA--AASETRPRNIAFNYIVRA 165 NT AG + G S A+E+RPRN+A ++RA Sbjct: 833 HEFNTEGAGNHGHNVGIGAAGNHSHAITVNGDGANESRPRNVALLAMIRA 882 >UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW4_CLOCE Length = 200 Score = 91.7 bits (226), Expect = 8e-18, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 74/204 (36%), Gaps = 64/204 (31%) Query: 9 LPVGVPVPWPSATPPT--------GWLKCNGAAFSAEEYPELAKAYPT--------NKLP 52 +P+G + + GWL C+G+ EYP+L +A LP Sbjct: 7 MPIGSVISFAGEIKSEMVNRLYRMGWLICDGSKLKIAEYPDLFQAIGKAHGGDNTYFYLP 66 Query: 53 DLRGEFIRGW------DDGRGIDT--------------GRSILSIQGYA----------- 81 D + +FIRG + GR +D G ++ S Q +A Sbjct: 67 DTQSKFIRGVNGDSVGESGRLMDPDVAKRTFAKPGGNTGNNVGSYQDFATGLPKVSLTTD 126 Query: 82 -TEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQ 140 H H LP + D + N Y I + G + T ++G Sbjct: 127 FIGSHTHSLPH----LPDGSHNAYAGSIGRDGGKEAGDNTRTGESG------------SH 170 Query: 141 SVDGLGAAASETRPRNIAFNYIVR 164 S + +G ETRPRN+ ++I++ Sbjct: 171 SHEIIGGGDPETRPRNMNLHFIIK 194 >UniRef50_A1HR57 Putative uncharacterized protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HR57_9FIRM Length = 269 Score = 90.2 bits (222), Expect = 3e-17, Method: Composition-based stats. Identities = 53/184 (28%), Positives = 65/184 (35%), Gaps = 50/184 (27%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL------------------------ 42 PVG V S G+LK NGA S YP L Sbjct: 109 DGTPVGRIVAEISPICRPGYLKANGALVSRAAYPRLWAYVQARGLVVPDTVWPANYWGCF 168 Query: 43 --AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDAT 100 T +LPDLRGEFIRG DDGRG+D GR+ S Q + H H S+ + Sbjct: 169 STGDGSTTFRLPDLRGEFIRGGDDGRGVDGGRAFGSWQADGIKSHNHPYQSQPYLF---- 224 Query: 101 INFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFN 160 + G D+I T + ETRPRNIA Sbjct: 225 -------VESFDGGDVIAE-------------RTSTAKWVTHYTSNFGGPETRPRNIALL 264 Query: 161 YIVR 164 Y ++ Sbjct: 265 YCIK 268 >UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv. viciae RepID=RHIB_RHILV Length = 219 Score = 89.4 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 43/186 (23%), Positives = 69/186 (37%), Gaps = 54/186 (29%) Query: 4 GEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP------------TNKL 51 GE + P+ + A GW+ C+G A YPEL ++ Sbjct: 61 GEAAGTNAEAPISYVEA---QGWMLCDGRYLRAAVYPELYAVLGGLYGERNSTADLEFRI 117 Query: 52 PDLRGEFIRGWDDGRGIDTG-------------RSILSIQGYATEDHAHGLPSRSTIVTD 98 PD RG F+RG+D G G+D + S+Q A + HAH Sbjct: 118 PDYRGLFLRGFDAGGGMDPDAKRRLDPTGNNVANVVGSLQCDALQVHAHPYE-------- 169 Query: 99 ATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 + + I ++GN + + G+ + A ETRP+N+A Sbjct: 170 -----------ITTPAGISQQGNAAGTSISSKSTGSPENPA-------RTALETRPKNVA 211 Query: 159 FNYIVR 164 NY+++ Sbjct: 212 VNYLIK 217 >UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=Cupriavidus taiwanensis RepID=B3R3K1_CUPTR Length = 1045 Score = 87.8 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 50/220 (22%), Positives = 68/220 (30%), Gaps = 65/220 (29%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AK 44 VG + P T G LK NGA +YPEL Sbjct: 825 VGQIIIEPRTTARAGCLKLNGALLKRADYPELWAYAQASGAIVTDAAWLAGSWGCFSHGD 884 Query: 45 AYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATE------------DHAHGLPSR 92 T ++P+ RGE++R WDD RG D GR I Q + DH HG + Sbjct: 885 GNTTFRIPEYRGEYLRFWDDARGADAGRGIGVFQDSQNKTHSHAASATPVGDHNHGAWTD 944 Query: 93 STIVTDATIN------------------------FYFDEIWVNSGTDIIKRG-NTNDAGL 127 + +N Y +GT G + G Sbjct: 945 AQGWHGHGVNDPGHAHSFQTWTGGGATGAGRVSGSYVTNADAWAGTSASYTGISIAGDGS 1004 Query: 128 PAPDYGTFKTYKQSVDGLGA--AASETRPRNIAFNYIVRA 165 A + G S +E R RNI+ ++RA Sbjct: 1005 HAHNVGVGYAGNHSHAITVNADGGAEVRVRNISALAMIRA 1044 >UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX Length = 456 Score = 86.7 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 50/160 (31%), Positives = 67/160 (41%), Gaps = 39/160 (24%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + PVG P+PWPSATPP G+L NG +FS YP+LA+AYP KLPDLR F Sbjct: 336 ESYPVGSPIPWPSATPPQGYLVMNGQSFSCSRYPQLARAYPGCKLPDLRRCF-------- 387 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 + + T + E + I G ++ G Sbjct: 388 --------------------------YSWLGQRTWAGWRSEQTSPELSRPINPGVYHERG 421 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + Y G A P+NIAFNYIV+A+ Sbjct: 422 GWLKGHHSGMAYLGPGKHNGNA-----PQNIAFNYIVKAS 456 >UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED121 RepID=A3YFP9_9GAMM Length = 207 Score = 84.0 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 45/194 (23%), Positives = 70/194 (36%), Gaps = 53/194 (27%) Query: 6 GSALPVGVPV------------PWPSATPPTGWLKCNGAAFSAEEYPELAKAYP------ 47 G A+PVG + P+ + P WLKC+G++ +YPEL A Sbjct: 29 GDAMPVGSVIAFAGEIRTSGDKPFETNLPMFNWLKCDGSSLEVAQYPELFSALGYRYGGS 88 Query: 48 --TNKLPDLRGEFIRGWD-----------DGRGIDT---GRSILSIQGYATEDHAHGLPS 91 LPDLRGEF+RG D +GR + S QG+A + H H Sbjct: 89 GQKFNLPDLRGEFLRGVDVDSSNNKKASLEGRKGAANGGNHEVGSTQGFALQSHVHTYQK 148 Query: 92 RSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP-APDYGTFKTYKQSVDGLGAAAS 150 I+ + +P + + + Q + L + Sbjct: 149 PK------------------RAMPILAEPGVSTTQIPLSQEDTSTPKSSQKNENLALSDK 190 Query: 151 ETRPRNIAFNYIVR 164 ETRP N ++++ Sbjct: 191 ETRPVNTFVYWLIK 204 >UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GRX3_VIBCH Length = 182 Score = 83.2 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 41/175 (23%), Positives = 64/175 (36%), Gaps = 26/175 (14%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 PVG +PW + P G+ G AF Y ELAK +P +PD+RG + G +DG Sbjct: 17 FPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRGCGVIGKEDGE-- 74 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN----TND 124 ++ + + ++H H + S+I + SG G+ T+ Sbjct: 75 ----AVGAYEEGQVKNHGHPNSTVSSIDLGSKNTANGGNHTHFSGIAAFGGGSHRYQTDV 130 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAF-------------NYIVRAA 166 G + +G+ A IA N+IVR A Sbjct: 131 NGSGGNINTSAAGNHYHSIPMGSHAHAVT---IALFGALKNTINHRKINWIVRLA 182 >UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD Length = 325 Score = 80.9 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 54/144 (37%), Gaps = 27/144 (18%) Query: 48 TNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRST-------IVTDAT 100 +LPD+RGE +R WD+GRG+D R++ S QG A E H H S + Sbjct: 181 QFRLPDVRGEGLRLWDNGRGVDQARTLGSWQGGAIESHGHAANSGDAGAVADRRTGSGGG 240 Query: 101 INFYFD-------EIWVNSGTDIIKRGNTNDAGLPAPDYG-------------TFKTYKQ 140 N +V S T + ++ + D ++ Sbjct: 241 HNHNNGIFTRLLRAPYVGSITGSDTTNSGDEQAVGGGDSADIAAVGDHDHLIPGVGPHRH 300 Query: 141 SVDGLGAAASETRPRNIAFNYIVR 164 + +ETR RN+A +++ Sbjct: 301 DISISATGGNETRMRNVAVAALIK 324 >UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Microcystis aeruginosa RepID=A8YDB4_MICAE Length = 166 Score = 77.8 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 31/113 (27%), Positives = 48/113 (42%), Gaps = 25/113 (22%) Query: 4 GEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDL 54 + +L +P+ +P A GW+ C+G + YPEL T +LPD Sbjct: 38 AQAESLNANIPITYPEAY---GWMLCDGRYLEIDAYPELFAVIGTLYGKQGDNKFRLPDY 94 Query: 55 RGEFIRGWDDGRGIDTGRS-------------ILSIQGYATEDHAHGLPSRST 94 RG F+RG D G G+D + I S+Q A + H H + ++ Sbjct: 95 RGLFMRGVDAGSGLDPDAAERIGPEGMGKSSGIGSLQCDALQQHQHDYNASNS 147 >UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_9CAUD Length = 760 Score = 76.7 bits (187), Expect = 3e-13, Method: Composition-based stats. Identities = 34/139 (24%), Positives = 55/139 (39%), Gaps = 4/139 (2%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 A+P+G P+ TPP G+L C+G+ FS +EYP+L + LPD+RG +++ D Sbjct: 263 AVPIGSIFPF-VKTPPAGYLTCDGSTFSKDEYPDLYAYLGSTTLPDMRGRYLKMPSDLAN 321 Query: 68 IDTG--RSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 I I ++ H H ++ T+ E +V SG + Sbjct: 322 IYQKFPAIIPALLHDVDISHTHTASQQAHAHDRGTME-IGGEFFVGSGHGLYIATGAYGG 380 Query: 126 GLPAPDYGTFKTYKQSVDG 144 + G G Sbjct: 381 AFFSDSPGGADNNGGGASG 399 >UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3XAA4_OXAFO Length = 305 Score = 74.7 bits (182), Expect = 1e-12, Method: Composition-based stats. Identities = 42/157 (26%), Positives = 53/157 (33%), Gaps = 30/157 (19%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKL 51 G+ S +PVG + T P G+LK NGAA E YPEL T L Sbjct: 7 GITPSSGVPVGTIEYFAMVTSPAGYLKANGAAVGRETYPELYATIGTTFGEGDGSSTFNL 66 Query: 52 PDLRGEFIRGWDD-GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV 110 PDL F +G + G+ I+ G DH H LP Sbjct: 67 PDLIDRFAQGSNTPGQKIEAG----------LSDHNHTLPLALEETGTGYAAH------- 109 Query: 111 NSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGA 147 G++I A P YG T + L Sbjct: 110 --GSNISSGTTVGYASASNPIYGASNTVQPPALTLLP 144 >UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8I7_OXAFO Length = 369 Score = 74.0 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 43/154 (27%), Positives = 57/154 (37%), Gaps = 24/154 (15%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 +PVG + ++TPP G+LK +G+ E YPEL A T LPDL G Sbjct: 105 GIPVGSIDYFATSTPPAGYLKADGSEVGRETYPELFTAIGTVFGEGNGDSTFNLPDLMGR 164 Query: 58 FIRGWD-DGRGIDTGRSILSIQGYATEDHAH--GLPSRSTIVT-DATINFYFDEIWVNSG 113 F +G G+ I G DH H G + + I SG Sbjct: 165 FAQGSTIVGQRIKAG----------LPDHKHIEGFAGVNPNSSYGVATTAPQGNINTQSG 214 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGA 147 T + T+ A L P YG T + L Sbjct: 215 TSVSNHPYTSPASLSNPIYGASDTVQPPALTLLP 248 >UniRef50_D2V5I7 Microcystin-dependent protein n=1 Tax=Naegleria gruberi RepID=D2V5I7_NAEGR Length = 191 Score = 73.6 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 56/175 (32%), Gaps = 19/175 (10%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAE--EYPELAKAYP----------TNKLPDLRG 56 +PVG+ + T P GWL C+GA + +Y L + + +PDLRG Sbjct: 16 IPVGIVNAFAGTTIPAGWLLCDGATYPNSHPDYIRLFQTIGNAYGSTGGPHSFNVPDLRG 75 Query: 57 EFIRGWDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW 109 + G G G+ G +Q H+H + I I Sbjct: 76 RAVVGIGHGAGLSNRTLAQKVGEESHQLQISELPSHSHSGTTGKANKQPYIIVHQSGPIS 135 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 T G + + G +A E ++ NYI++ Sbjct: 136 DVFHTPGWCGGPATHKDDDNFTGANHTHNFTTNEVGGNSAHENMQPSLVLNYIIK 190 >UniRef50_P10930 Short tail fiber protein n=8 Tax=Myoviridae RepID=VG12_BPT4 Length = 527 Score = 73.6 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 42/184 (22%), Positives = 63/184 (34%), Gaps = 28/184 (15%) Query: 9 LPVGVPVPWPSATPPT-GWLKCNGAAFSAEEYPELAKAYPTN--------KLPDLRGEFI 59 +PVG + W + + P+ W C+G SA + P A T LPD+RG F+ Sbjct: 341 IPVGAIMMWAADSLPSDAWRFCHGGTVSASDCPLYASRIGTRYGGNPSNPGLPDMRGLFV 400 Query: 60 RGWDDG--------RGID-----------TGRSILSIQGYATEDHAHGLPSRSTIVTDAT 100 RG G G D TG + +Q H H A Sbjct: 401 RGSGRGSHLTNPNVNGNDQFGKPRLGVGCTGGYVGEVQIQQMSYHKHAGGFGEHDDLGAF 460 Query: 101 INFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFN 160 N + D + + K +++ +ETRP NI+ N Sbjct: 461 GNTRRSNFVGTRKGLDWDNRSYFTNDGYEIDPESQRNSKYTLNRPELIGNETRPWNISLN 520 Query: 161 YIVR 164 YI++ Sbjct: 521 YIIK 524 >UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenella corrodens ATCC 23834 RepID=C0DSG4_EIKCO Length = 436 Score = 72.4 bits (176), Expect = 5e-12, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 62/210 (29%), Gaps = 52/210 (24%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPD-------LRGE 57 G LPVG V +P A + P G+LK +G+ F+ YP+L + NKLP+ + Sbjct: 69 GKGLPVGAVVGFPRAISSPEGYLKADGSTFAQATYPDLYRVLGGNKLPNLTRSDVGMTAY 128 Query: 58 F-IRGWDDG-----------------------------------------RGIDTGRSIL 75 F I DG R ++ Sbjct: 129 FPIEAIPDGWIKYDEVATKVTQSAYPELYRLLVAQYGSIDAVPKAEDRFIRNASGSLAVG 188 Query: 76 SIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTF 135 + QG + G+ + + ++ + + + Sbjct: 189 TQQGDTIRNITGGIEALYSGYRYTLYTKADGAFTMDLDDG--ANSTFSSSKGDSDHNNRK 246 Query: 136 KTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 K A E RP+ +A ++A Sbjct: 247 KRVVFDASRSVPTADEVRPKALAMVLCIKA 276 >UniRef50_C3X971 Predicted protein n=6 Tax=Oxalobacter formigenes OXCC13 RepID=C3X971_OXAFO Length = 534 Score = 72.4 bits (176), Expect = 6e-12, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 63/176 (35%), Gaps = 30/176 (17%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKL 51 G+ + +P+G + +TPP G+LK +G A E Y EL T L Sbjct: 242 GISPLNGVPIGTVEYFAMSTPPAGYLKADGRAVGRETYAELYSVIGTTFGEGDEQTTFNL 301 Query: 52 PDLRGEFIRGWDD-GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV 110 PDL F +G + G+ I+ G LP+ ++T++ + Sbjct: 302 PDLIDRFAQGSNTPGQKIEAG-----------------LPNIEGVITNSGSILWAGNEDA 344 Query: 111 NSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET-RPRNIAFNYIVRA 165 SG + + ++ S AS+T +P + ++A Sbjct: 345 -SGAFSLTGASPRANTATVGAGANTLSFNASQSNQIYGASDTVQPPALTLLPCIKA 399 >UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8V5_OXAFO Length = 480 Score = 70.5 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 28/76 (36%), Positives = 39/76 (51%), Gaps = 11/76 (14%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRG 56 +PVG + ++TPP G+LK +GAA E YP+L A T LPDL G Sbjct: 194 KGVPVGTIEYFATSTPPAGYLKADGAAVGRETYPDLFAAIGTAFGEGDGSTTFNLPDLIG 253 Query: 57 EFIRGWD-DGRGIDTG 71 F +G D G+ ++ G Sbjct: 254 RFAQGSDVPGQKLEAG 269 >UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomonas palustris DX-1 RepID=D2MH12_RHOPA Length = 346 Score = 70.1 bits (170), Expect = 3e-11, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 59/193 (30%), Gaps = 39/193 (20%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY--------PTNKLPDLRGEFI 59 ++P G + + TPP GWL C+G S + +L A +P+L F Sbjct: 156 SIPPGFILDFAGPTPPEGWLTCDGQLVSTVTFADLFAAIGYTWGGSGGQFAVPNLVKRFR 215 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD---- 115 R G G G + ++Q H+H + D ++ + + ++ Sbjct: 216 R--HRGDGTVAG-GVGTLQTNQIGLHSHSASMDAQGHHDHYLDLWSSGMNRSNPHSHPAS 272 Query: 116 ------------------------IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASE 151 I + N + + ++ +E Sbjct: 273 GSGIGVSGGFDTGVYAPQGPLNGVSIGATDINHEHRVTGNTAGNGGHIHNITVAANGGNE 332 Query: 152 TRPRNIAFNYIVR 164 TRP + ++ Sbjct: 333 TRPDSATVMACIK 345 >UniRef50_C9QG11 Probable tail fiber protein n=1 Tax=Vibrio orientalis CIP 102891 RepID=C9QG11_VIBOR Length = 497 Score = 70.1 bits (170), Expect = 3e-11, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 52/154 (33%), Gaps = 36/154 (23%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPD------LRGEFIRGWDD 64 VG+P W + P + A Y LA+ YP D +R EF+R D Sbjct: 367 VGMPFYWLDTSAPEWAVLEINVDLPAVVYWRLARRYPALVSDDSINTGEIRAEFLRVLDL 426 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 GRGI+ + + + +H+H P+ Sbjct: 427 GRGINPAQGLNEFSDASVGEHSHRYPTGG------------------------------V 456 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 AG+ +GT T + E +PR++A Sbjct: 457 AGIGPYLHGTSWTGGYATTEPFNQGQENKPRSVA 490 >UniRef50_C3X909 Predicted protein n=2 Tax=Oxalobacter formigenes OXCC13 RepID=C3X909_OXAFO Length = 549 Score = 69.7 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 57/174 (32%), Gaps = 23/174 (13%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKL 51 G+ S +PVG + TPP G+LK +G+A S YP+L A T L Sbjct: 257 GVVCPSGVPVGAIGYFAMQTPPAGYLKADGSAVSRATYPDLFGAIGTTFGEGDGSTTFNL 316 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 PDL F +G G I + L + + + + F + Sbjct: 317 PDLIDRFAQG-----NATPGLKI----EAGLPNITGSL-TVTASNQGSAASGAFSRTQIG 366 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + G N +G Y + G P + ++A Sbjct: 367 AVGGGLGGGQYNSSGCGPNLYSFDSRVSNPIYGASNTVQ---PPALTLLPCIKA 417 >UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X1Y2_OXAFO Length = 480 Score = 69.0 bits (167), Expect = 5e-11, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 58/175 (33%), Gaps = 26/175 (14%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNK 50 +G S +P+G + ATPP G+LK +GAA YP+L A T Sbjct: 181 MGWKYPSGVPIGTVEYFAMATPPAGYLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFN 240 Query: 51 LPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV 110 LPD+ G F G + +++ + + +++ F ++ Sbjct: 241 LPDMIGRFAEG---------SATPGTVKEAGLPNITGEINGH----FGSSVAFGTGSLFT 287 Query: 111 NSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + G R G + + + +P + ++A Sbjct: 288 SIGGS---RYRATPDGTGGEAFFAAFISASRSSPIYGNSDTVQPPALTLLPCIKA 339 >UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W1_OXAFO Length = 365 Score = 68.6 bits (166), Expect = 6e-11, Method: Composition-based stats. Identities = 37/171 (21%), Positives = 59/171 (34%), Gaps = 27/171 (15%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL---------AKAYPTNKLPD 53 L + LP G + P G+L CNGA+ S YPEL T LPD Sbjct: 212 LDKAEKLPAGTIIAVGGNITPEGFLYCNGASLSPSAYPELCAVIGGTYGGDGLTTFNLPD 271 Query: 54 LRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 RG +++G D GR + + GLP+ + + I + Sbjct: 272 FRGRWMQGNDT-----AGRVLAA-----------GLPNVTGTIVSGAIAHA--TAYQTGA 313 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 I G + ++ + + A+ RP +I Y ++ Sbjct: 314 FYNIDVGAFGGYHAGSQNHYRAGFEASRSNPIYGASDTVRPPSITVRYCIK 364 >UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=VPH_BPHP1 Length = 925 Score = 68.6 bits (166), Expect = 7e-11, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 72/211 (34%), Gaps = 58/211 (27%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYP-TNKLPDL-------RG 56 G +P+G V +P A T P G+LK NG F+ + +P+L + +N+LPDL Sbjct: 532 GDGVPIGSVVSFPRAVTNPVGFLKANGTTFNQQTFPDLYRTLGDSNQLPDLTRSDVGMTA 591 Query: 57 EFIR-----GW---DDGRG--------------ID--------------------TGRSI 74 F GW D R +D G +I Sbjct: 592 YFAVDNIPSGWIAFDSIRSTVTQQNYPELYQYLVDKYSSISNVPLAEDRFIRNTGNGLNI 651 Query: 75 LSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGT 134 Q + H H + T D++ + F + ++ T D L + Sbjct: 652 GQTQSDEIKKHVHRVR---THWADSSDSSIFYDKTKTVIDSRLRTATTTDDNLSDNGFM- 707 Query: 135 FKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + ETRP+++ ++A Sbjct: 708 ---HPLLDTPMATGGDETRPKSLILKLCIKA 735 >UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X192_OXAFO Length = 361 Score = 67.8 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 61/173 (35%), Gaps = 25/173 (14%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLP 52 + + S +P+G + ATPP G+LK +GAA YP+L A T LP Sbjct: 74 INKRSGVPIGTVEYFAMATPPAGYLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFNLP 133 Query: 53 DLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS 112 D+ G+F G + +++ + + + ++ +A+ I S Sbjct: 134 DMIGQFAEG---------SATPGAVKEAGLPNIIGSISNVASGGANASSASGALSIAARS 184 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ++ + + + ++ +P + ++A Sbjct: 185 NNNMTPGSSAYGHTF------ALAINASDFNPIYGKSNTVQPPALTLLPCIKA 231 >UniRef50_B5TK79 Tail collar protein n=2 Tax=root RepID=B5TK79_9VIRU Length = 364 Score = 67.4 bits (163), Expect = 2e-10, Method: Composition-based stats. Identities = 40/182 (21%), Positives = 63/182 (34%), Gaps = 31/182 (17%) Query: 3 LGEGSALPVGVPVPWPS-ATPPTGWLKCNGAAFSAEEYPEL------------------- 42 +G P+G P + P G+ NG E+P L Sbjct: 180 MGRFDNTPLGRPTFETTIQLSPGGYGALNGTVMKRAEWPWLWDHAQQSGMLGTEATREGN 239 Query: 43 ------AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIV 96 T + P+ RGEF+R D+GR +D+GR++ + Q HA G Sbjct: 240 EGKWSSGDGALTFRAPEGRGEFLRILDEGRSVDSGRAMGTFQPGTVHSHALGAQ-----G 294 Query: 97 TDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRN 156 A + + D + + D P + TY+ + ++RPRN Sbjct: 295 AGAVGSRWSDSLSTVGANTREEIKIIGDLVNGGPTFPAGTTYQMDTANTLLYSFKSRPRN 354 Query: 157 IA 158 IA Sbjct: 355 IA 356 >UniRef50_A9AVC5 Tail Collar domain protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVC5_HERA2 Length = 934 Score = 67.0 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 37/169 (21%), Positives = 55/169 (32%), Gaps = 26/169 (15%) Query: 6 GSALPVGVPVPW--PSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWD 63 GS++P G W P GWL CNG N PDLR F+ G Sbjct: 781 GSSIPSGTINMWSGADNALPGGWLLCNGQ----------------NGTPDLRNRFVVGAG 824 Query: 64 DGRGIDT--GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 + T G +++ H H + + + T+ F G D+ K + Sbjct: 825 AAYPVGTTGGADSVTLAVNQMPSHNHAASTSNDGQHNHTLYFDTGGGGNGPGGDMAKTND 884 Query: 122 T---NDAGLPAPDYGTFKTYKQSVDGLGAAAS---ETRPRNIAFNYIVR 164 N + + SV + E RP A YI++ Sbjct: 885 GLQKNVIANFSVKTDKDGNHSHSVTIQNNGGNQAHENRPPFYALCYIMK 933 >UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus influenzae PittAA RepID=A4NHY2_HAEIN Length = 556 Score = 67.0 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 61/177 (34%), Gaps = 22/177 (12%) Query: 3 LGEGSALP------VGVPVPWPSATPPTGWLKCN--GAAFSAEEYPELAK----AYPT-N 49 LG + LP VG+ + P GW+ + + + YPEL + Y + N Sbjct: 216 LGNSNQLPDLTRSDVGMTAYFAVDNIPAGWIAFDEIATQVTEQRYPELYRHLIDKYGSIN 275 Query: 50 KLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW 109 +P + F+R G S+ IQ + H H +P D + + + Sbjct: 276 SVPKVADRFLR------NAGNGLSVGQIQEDDLKRHVHRVPIDYDSWFDDSSQGRNNSYF 329 Query: 110 -VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + +T D G S + ETRP+++ ++A Sbjct: 330 DYTTFAQSSDLWSTLGYDNADGDNGFVSPKDTS--QMATGGDETRPKSLVLKLCIKA 384 Score = 56.6 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 2/51 (3%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYP-TNKLPDL 54 G +P+G V +P A T P G+LK NG F+ + +P+L + +N+LPDL Sbjct: 175 GKGVPIGAVVSFPRAVTNPVGFLKANGTTFNQQTFPDLYRTLGNSNQLPDL 225 >UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GUE7_VIBCH Length = 250 Score = 67.0 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 20/48 (41%), Positives = 26/48 (54%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRG 56 PVG +PW + P G+ G AF Y ELAK +P +PD+RG Sbjct: 203 FPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRG 250 >UniRef50_A3YA17 Prophage MuSo2, tail fiber protein, putative n=1 Tax=Marinomonas sp. MED121 RepID=A3YA17_9GAMM Length = 341 Score = 66.3 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 31/112 (27%), Positives = 41/112 (36%), Gaps = 19/112 (16%) Query: 47 PTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFD 106 T LP + GEFIR +DDGRG+D GR S Q A + H H + Sbjct: 242 STFTLPIVGGEFIRMFDDGRGVDDGRVFGSFQEDAFQGHWHATAEGGDTLAGNGYLA--- 298 Query: 107 EIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 N + D +G+ A+ETR R+IA Sbjct: 299 ----------------NSPSYSSMDNNARDAVTDGQNGVPRMANETRSRSIA 334 >UniRef50_UPI00016C4891 hypothetical protein GobsU_00190 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4891 Length = 252 Score = 65.9 bits (159), Expect = 5e-10, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 60/199 (30%), Gaps = 41/199 (20%) Query: 7 SALPVGVPVPWPSATPPT------------GWLKCNGAAFS-------AEEYPELAKAYP 47 ++ PVG + PP GWL C+G + + EL Sbjct: 52 TSPPVGTVTAFAGTWPPKRSDGGVWTEAEIGWLLCDGRKWEDKSLDGVRADLWELRAVLD 111 Query: 48 ------------TNKLPDLRGEFIRGWDD-------GRGIDTGRSILSIQGYATEDHAHG 88 LPD RG F+RG D GR R++ QGYAT A Sbjct: 112 GPNYIPQRSAPHALHLPDYRGYFLRGLDTSPFMGPAGRDKGEPRTVGLSQGYATARPAGK 171 Query: 89 LPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTF---KTYKQSVDGL 145 + + + + +T + K + Sbjct: 172 DAFTTDKKGAHSHPLKMELKASRKAGGAAENAHTVTSINDENKKSQLPLEKDGDHVHEIT 231 Query: 146 GAAASETRPRNIAFNYIVR 164 G +ETRP N+ ++++ Sbjct: 232 GGGDAETRPVNVVVYWVIK 250 >UniRef50_A3Y8Q8 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MED121 RepID=A3Y8Q8_9GAMM Length = 303 Score = 65.5 bits (158), Expect = 6e-10, Method: Composition-based stats. Identities = 31/120 (25%), Positives = 40/120 (33%), Gaps = 21/120 (17%) Query: 39 YPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTD 98 Y T LP + GEFIR +DDGRG+D GR Q T+ + S Sbjct: 198 YWGEGNGVTTFTLPIVGGEFIRMFDDGRGVDAGRGFADYQSDLTKIPNGVILRVSNFGNG 257 Query: 99 ATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 +G D+ N + + ETRPRNIA Sbjct: 258 DGSYDLSGTSTSANGNDLHTAPRGNSSEVYY---------------------ETRPRNIA 296 >UniRef50_D2L4G0 Tail Collar domain protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L4G0_9DELT Length = 319 Score = 64.7 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 31/149 (20%), Positives = 53/149 (35%), Gaps = 13/149 (8%) Query: 9 LPVGVPVPWPSATPPTG---WLKCNGAAF-SAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 +PVG +PWPS + P WL+CNG A S +Y L + +P+ G+F+RG Sbjct: 41 IPVGTVIPWPSTSMPADATRWLECNGQAVPSGSQYDRLRVVLGSKPIPNYNGQFLRG--- 97 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 + +T H H + + V+ + + + Sbjct: 98 ---TTVSSEVGQTVADSTRAHDHLIDAHQHTVSGTASGQSYGGAIASVSISGSTSSQSYS 154 Query: 125 ---AGLPAPDYGTFKTYKQSVDGLGAAAS 150 AG + + Y ++ G S Sbjct: 155 GTIAGQHITGATSGQAYGGNIAGQHVTGS 183 >UniRef50_A9LZ37 Tail fibre protein, putative n=21 Tax=Neisseria RepID=A9LZ37_NEIM0 Length = 658 Score = 64.7 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 46/214 (21%), Positives = 69/214 (32%), Gaps = 62/214 (28%) Query: 6 GSALPVGVPVPWPSAT-PPTGWLKCNGAAFSAEEYPELAKAYP-TNKLPDLR-------G 56 +PVG V +P A P G+L+ +G F+ +P+L +A +N+LPDL Sbjct: 247 SDGIPVGAIVSFPKAVRNPAGYLRADGTTFAQNTFPDLYRALGNSNRLPDLSRTDIGITA 306 Query: 57 EFIR-----GW---DDGR----------------------------------GIDTGRSI 74 F GW DD R ++ Sbjct: 307 WFPSDQIPTGWLAFDDIRTRVTETAYPELYRLLTGKYGSIQNVPQAEDRFIRNAGNSLAV 366 Query: 75 LSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG--NTNDAGLPAPDY 132 + Q + H H + S T TDA Y D + N ND G P Sbjct: 367 GTKQEDEIKRHVHKVFSHWTNHTDAAALGYEDRNERQRSALVSTWTDENLNDNGFLTP-- 424 Query: 133 GTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 +S + E RP+ + ++AA Sbjct: 425 -------RSDSKMATGGDENRPKALVLKLCIKAA 451 >UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0USC5_HAES2 Length = 652 Score = 64.7 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 25/52 (48%), Positives = 33/52 (63%), Gaps = 2/52 (3%) Query: 6 GSALPVGVPVPWPSAT-PPTGWLKCNGAAFSAEEYPELAKAYP-TNKLPDLR 55 G LPVG + +P A P G+LKC+G+ F YP+L +A +NKLPDLR Sbjct: 224 GDGLPVGSVLAFPVAVQNPQGFLKCDGSTFGRTTYPDLYRALGNSNKLPDLR 275 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 60/198 (30%), Gaps = 35/198 (17%) Query: 3 LGEGSALP------VGVPVPWPSATPPTGWLKCN--GAAFSAEEYPELAKA----YPTN- 49 LG + LP VG+ + + P GW+ + E YPEL K Y + Sbjct: 265 LGNSNKLPDLRRSNVGMTAYFATDKIPEGWIAFDEIKEKVKKETYPELYKYLIEKYTSID 324 Query: 50 KLPDLRGEFIRGWDDG---RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFD 106 +P F+R +G + G I + + + + + FD Sbjct: 325 NVPKAEDRFLRNAHNGLKVGDVQLGSLIGTDSIDGNGAFSPYVKAIKNTYQETVDQVGFD 384 Query: 107 EIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASE--------------- 151 + + + N + + +GA +E Sbjct: 385 PLLIGDIGNGRTAVNNEHSPDSGQPETKKQYNAGLSWSVGAGRNEDLNQPIQNNKSGNGH 444 Query: 152 ----TRPRNIAFNYIVRA 165 TRP+++ ++A Sbjct: 445 FVGVTRPKSLVLKLCIKA 462 >UniRef50_A6EAC0 Microcystin-dependent protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAC0_9SPHI Length = 185 Score = 64.3 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 36/171 (21%), Positives = 47/171 (27%), Gaps = 18/171 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRG 61 +G P+ P GWL CNGA + +Y L T K+P+L+GE I G Sbjct: 5 IGEVRPFAFDWIPDGWLACNGATYPLAQYQALYSVIGTVYGGTLGQNFKVPNLQGEAIIG 64 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G + +H H T N G Sbjct: 65 AGQGPTTSAYTLAQTGGTEKAGLTVNQIPNHDHVFNGAIGATGFRTNTAGNTSYLTNFGY 124 Query: 115 DIIKRGNTNDAGLPAPDYG--TFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 A P T G A E R +A Y + Sbjct: 125 GGAGATTFTSASGYVPPGTPDTLLNPSSVTQTGGGGAHENRQPYLAVTYAI 175 >UniRef50_A9AVE2 Tail Collar domain protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVE2_HERA2 Length = 865 Score = 64.0 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 61/192 (31%), Gaps = 44/192 (22%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW 62 GS +P G W P GW C+G + + PDLR FI G Sbjct: 687 YSNGSPIPCGTIQMWSGMEVPEGWAICDGREAN------------GLRTPDLRNRFIVG- 733 Query: 63 DDGRGIDTGR--SILSIQG------------YATEDHAHGLPSRSTIVTDATIN------ 102 G D+G + QG H HG + + + Sbjct: 734 -AGANYDSGNLSVYGTNQGTTGGSDVVALTLDQMPRHTHGGSTNAAGDHSHWVEGTDADG 792 Query: 103 ------FYFDEIWVNSGTDIIKRGNTND---AGLPAPDYGTFKTYKQSVDGLGAA-ASET 152 ++ + V+ G + + ND G D ++ + +G + A E Sbjct: 793 LAKRRRHHWGDTTVDMGFGGGRNADPNDERWRGRVNTDNAGTHSHGLMIGEVGGSQAHEN 852 Query: 153 RPRNIAFNYIVR 164 RP A +I++ Sbjct: 853 RPPFYALAFIMK 864 >UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3K6_OXAFO Length = 500 Score = 64.0 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 23/64 (35%), Positives = 32/64 (50%), Gaps = 10/64 (15%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 +PVG V + ++ P G+LKC+GAA + YP+L A T LPD+ G Sbjct: 212 GIPVGTVVMFSASEAPAGYLKCDGAAVGRDTYPDLFAAIGTVFGAGDGETTFNLPDMIGR 271 Query: 58 FIRG 61 F G Sbjct: 272 FAEG 275 >UniRef50_C7PCL6 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PCL6_CHIPD Length = 439 Score = 63.6 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 29/130 (22%), Positives = 40/130 (30%), Gaps = 29/130 (22%) Query: 35 SAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSI-------LSIQGYATEDHAH 87 SA+ Y T ++PDLRG F R D G ID R S Q + H H Sbjct: 324 SAKTYWGWGDGVNTLQVPDLRGYFPRWLDLGANIDADRVASSLQNKPGSAQSDEFKSHTH 383 Query: 88 GLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGA 147 + ++ + + + G T Sbjct: 384 TWRAETSNDSLGGTGWV----------------------TSSSGNGGAGTNTLHAANDAT 421 Query: 148 AASETRPRNI 157 SETRP+NI Sbjct: 422 GGSETRPKNI 431 >UniRef50_C3X8R9 Bacteriophage tail fiber protein n=6 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8R9_OXAFO Length = 398 Score = 63.6 bits (153), Expect = 3e-09, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 50/166 (30%), Gaps = 30/166 (18%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGEFI 59 P+G + A P+G+LK +GA E YP+L A T LPDL G F Sbjct: 108 PIGSIDYFAMAALPSGYLKADGAEVGRETYPDLFAAIGTVFGEGNGETTFNLPDLIGRFP 167 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 +G + R +Q + +++ FY +D Sbjct: 168 QG--------SARPGQRVQA-GLPNITGKFRAKAAAGEIPGGAFYGIGNIGGGSSDNSA- 217 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 N + G A +V ++A Sbjct: 218 PNYEEIGFDASKSNLIYGASDTVQPAALT----------LLACIKA 253 >UniRef50_Q094A8 Phage Tail Collar Domain family n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q094A8_STIAU Length = 645 Score = 63.2 bits (152), Expect = 3e-09, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 54/171 (31%), Gaps = 14/171 (8%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY------------PTNKLPD 53 G +PVG + + ++ P GWL C+G+ S Y +L +LP Sbjct: 476 GWLVPVGTIIAYGGSSAPEGWLLCDGSTKSKTAYADLFAVIGDTYKGSSAPPSGQFRLPS 535 Query: 54 LRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 L G + + + T + +P V D + Sbjct: 536 LMARVPMGASVSSPHNYPLGTMGGEFTHTLTIS-EMPVHDHYVNDPGHSHSITTTNAEGS 594 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 D+ + + + P +G G A + +P N+I++ Sbjct: 595 GDLRPNRDASKGHVDIPTNHVTTGVTLDTNGGGQAHNNMQP-YTTVNFIIK 644 >UniRef50_C6S6V6 Putative phage tail fibre protein n=1 Tax=Neisseria meningitidis alpha14 RepID=C6S6V6_NEIML Length = 728 Score = 62.8 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 67/212 (31%), Gaps = 58/212 (27%) Query: 6 GSALPVGVPVPWPSAT-PPTGWLKCNGAAFSAEEYPELAKAYP-TNKLPDLR-------G 56 +PVG V +P A P G+L+ +G F+ +P+L +A +N+LPDL Sbjct: 317 SDGIPVGAIVSFPKAVRNPAGYLRADGTTFAQNTFPDLYRALGNSNRLPDLSRTDIGITA 376 Query: 57 EFIR-----GW---DDGR----------------------------------GIDTGRSI 74 F GW DD R ++ Sbjct: 377 WFPSDQIPTGWLAFDDIRTRVTETAYPELYRLLTGKYGSIQNVPQAEDRFIRNAGNSLAV 436 Query: 75 LSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGT 134 + Q + H H + S T TD Y D + + N + Sbjct: 437 GTKQEDEIKRHTHKVFSHWTSHTDVAAVGYEDGNERQRSALVSTWTDENLSD------NG 490 Query: 135 FKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 F T + + E RP+ + ++AA Sbjct: 491 FLTPRLD-SKMATGGDENRPKALVLKLCIKAA 521 >UniRef50_Q4C9U4 Phage Tail Collar n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4C9U4_CROWT Length = 253 Score = 62.4 bits (150), Expect = 6e-09, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 53/163 (32%), Gaps = 15/163 (9%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT--------NKLPDLRGE 57 GS +P V + A P GWL C+G + YP+L A ++PD+R Sbjct: 69 GSIIPKSSIVVFGGAVAPNGWLFCDGTPYDPSTYPQLFSAIGYGFGQVGSLFRVPDMRDR 128 Query: 58 FIRGWDDG--RGIDTGRSILSIQGYATEDHAHGL--PSRSTIVTDATINFYFDEIWVNSG 113 G RG G + S+ H+H + P + + + ++ Sbjct: 129 SPVGAGISFDRGTFGGSATTSLSVDNMPAHSHNVIDPGHTHSMNHGPGQHSAVALDYHNA 188 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRN 156 + + G Y + G G S RN Sbjct: 189 GNGVDAYVPQWGGHAHTIYASGVGISLENTGSGTPVS---VRN 228 >UniRef50_A6EAB8 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAB8_9SPHI Length = 183 Score = 61.3 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 37/169 (21%), Positives = 59/169 (34%), Gaps = 19/169 (11%) Query: 12 GVPVPWPSATPPTGWLKCNGAAF----SAEEYPELAKAYPT-----NKLPDLRGEFIRGW 62 G + P W+ CNGA + Y + Y + K+PDLRG G Sbjct: 7 GEIRAFAGTYAPVDWMMCNGATLTVQGNEALYSLIGSTYGSNGPTDFKVPDLRGRLTVGQ 66 Query: 63 DDGRGIDTGRSILSIQG--------YATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ T R + S+ G H H L + ST+ + A++N + ++ Sbjct: 67 GLGTGL-TSRILGSVGGAETVALTEAQLPAHNHNL-TVSTVTSPASVNAPSNTSYLGVVN 124 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 G G + + G+ A +A NYI+ Sbjct: 125 SSAGAGVGYVPGNATGASVRALDTQVLSNTGGSQAHANIMPFLALNYII 173 >UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepID=B6S308_SALDU Length = 427 Score = 60.5 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 17/46 (36%), Positives = 23/46 (50%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNK 50 ++ PVG + WPS P G+ G +F YP LA AYP+ Sbjct: 382 PPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGI 427 >UniRef50_A4YX40 Putative uncharacterized protein n=1 Tax=Bradyrhizobium sp. ORS278 RepID=A4YX40_BRASO Length = 549 Score = 60.5 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 38/164 (23%), Positives = 51/164 (31%), Gaps = 24/164 (14%) Query: 23 PTGWLKCNG--AAFSAEEYPELAKAYPT---------NKLPDLRGEFIRGWDDGRGIDTG 71 PTGW+ CNG A SA P A + +PDLRG+F+RG G G D Sbjct: 213 PTGWIYCNGMPQAISASS-PAFANTLGSNFGGDGVSVFNVPDLRGQFLRGTSHGTGRDPN 271 Query: 72 RSI------LSIQGYATEDHAHG------LPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 SI G A H +P D T + Sbjct: 272 ASIRYALLGGGNTGDAVGSAQHYSTANGVIPISVAATGDHTHAQALVPANDHHAAYGASG 331 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + D+ + ETRP NI ++ + Sbjct: 332 PAAYNTMEWTDDWTNTTKAGAHTHSVTGGDKETRPVNIYLDWFI 375 Score = 57.0 bits (136), Expect = 2e-07, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 52/177 (29%), Gaps = 30/177 (16%) Query: 7 SALPVGVPVPWPSATPP---------TGWLKCNGAAFSAEE--YPELAKAYP-------- 47 A P+G + GWL C G + Y L K Sbjct: 382 DAPPIGSVTAYGGDVTSIDNLTSLLADGWLPCVGQKLKKNDPTYAALYKVIGATFGQDNL 441 Query: 48 TNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDE 107 LPDLRG F+ G + + +IQ + + T Sbjct: 442 NFYLPDLRGYFVMGAGQAK-------VGAIQA---QSTTCQPITPFTTTPIGDHTHQVTG 491 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 I ++ I + A + S + +E+RP NI +YI+R Sbjct: 492 IPTDT-HTIDVVAGWDLAENNPNTTASTVAGNHSHQIVAGGDAESRPVNINVDYIIR 547 Score = 55.9 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 67/193 (34%), Gaps = 38/193 (19%) Query: 3 LGEGSALPVGVPVPWP---------SATPPTGWLKCNGAAFSAEEYPELAKAY------- 46 + S LP+G + SA GWL C+G++++ +Y L A Sbjct: 1 MASTSNLPIGFVCMFAGDLSVSTVRSALIAAGWLPCDGSSYATSQYAALYTAIGNAHGGS 60 Query: 47 -PTNKLPDLRGEFIRGWDDGRGIDT--------------GRSILSIQGYATEDHAHGLPS 91 +P+L G F+RG ID G ++ S+Q AT LP+ Sbjct: 61 GGNFNVPNLTGRFVRGTTATATIDPDAGTRTAAAPGGATGNAVGSLQAGAT-----ALPT 115 Query: 92 RSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASE 151 ++ +F+ + + ++ + D+ T + Sbjct: 116 NPWVLAQDGDHFHAYQHLDTNMHEV--WSGSTDSMARWSTTVTIGAAGGHFHTMSGGDPA 173 Query: 152 TRPRNIAFNYIVR 164 T P N A +++R Sbjct: 174 TLPVNAALYWVIR 186 >UniRef50_B5JF21 Phage Tail Collar Domain family n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JF21_9BACT Length = 373 Score = 60.5 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 37/165 (22%), Positives = 59/165 (35%), Gaps = 25/165 (15%) Query: 9 LPVGVPVPWPSAT--PPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 +P+G + W +T P GW CNG + P+LR FI G G Sbjct: 223 IPIGGIIMWSGSTSNIPAGWRLCNG----------------SGGTPNLRDRFIVGAGGGY 266 Query: 67 GIDT--GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 G++ G S +++ H H + +RS + +G + +D Sbjct: 267 GVNATGGASSVTLTTAQMPSHDHDVWTRSGDGKHWHYRSVDNSAPHPNGDGRVDLSTESD 326 Query: 125 -----AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 A L + G + Y + A E RP A +I+R Sbjct: 327 NANWNANLNSSADGAHQHYVDTPKRGSGQAHENRPPYYALAFIMR 371 >UniRef50_B9M3Z7 Tail Collar domain protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M3Z7_GEOSF Length = 173 Score = 60.1 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 48/165 (29%), Gaps = 19/165 (11%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRG 61 +G + P W C+G+ +Y L T KLPD RG Sbjct: 6 IGEIRMFGGNFAPVDWALCDGSTLQISQYDVLYAVIGTYFGGDGITNFKLPDFRGRIPVH 65 Query: 62 WDDGRGIDT---GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK 118 G+G+ G + + Q H ++ +AT NS + Sbjct: 66 MGTGQGLTPRGIGNAFGTEQETLQVAHIPAHNHVVSVGANATTAAPAGNYLGNSSNFSLY 125 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 D+ L G F A ++ N+I+ Sbjct: 126 STAAADSLLNQDTVGFFPA-------APAQPHSNMMPSLCVNFII 163 >UniRef50_Q2W7B2 Microcystin-dependent protein n=1 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2W7B2_MAGSA Length = 192 Score = 60.1 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 51/183 (27%), Gaps = 36/183 (19%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + + + P W C+G S +YP L T LPDLR G+ Sbjct: 6 GQIILFSGSYAPVNWAVCDGHQLSVSQYPALFSLLGTQFGGNGTTTFGLPDLRSRLAMGF 65 Query: 63 DDGRGIDT-----------------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYF 105 G +D G +++ H H L + V + Sbjct: 66 GTGH-VDPKASNSAPLTPYGFATNGGVETVTLTQAQIPPHTHTLNASGDPVVSPNPSGGV 124 Query: 106 DEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA----SETRPRNIAFNY 161 + + D P P T + + A E R + Y Sbjct: 125 PASFTDG-----THVAYFDTPNPIPSGMTITPKQLGASMVTTAGASQPHENRMPYLGLMY 179 Query: 162 IVR 164 I+R Sbjct: 180 IIR 182 >UniRef50_B3QRT1 Tail Collar domain protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRT1_CHLT3 Length = 176 Score = 59.7 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 52/170 (30%), Gaps = 28/170 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAF----SAEEYPELAKAYP-----TNKLPDLRGEFIRG 61 +G + P GW +CNG + Y L Y T +PDLRG + G Sbjct: 7 IGEIRLFGFGWAPDGWAQCNGQLLLINENQALYSLLGTMYGGDARSTFGVPDLRGRAVIG 66 Query: 62 WDDGRGID--------TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 + + G +++ H H L + T +N Sbjct: 67 YGQSPKLSYSYQMSQWGGEETVTLGVAQIPAHNHTLIADGATGT-----------LLNPQ 115 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + G A + D + G+ E R +A NY + Sbjct: 116 NNYLAEGAFPGAAFYSADKSVAMNQGTIGNTGGSQPHENRSPYLALNYCI 165 >UniRef50_D0LMW0 Tail Collar domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMW0_HALO1 Length = 264 Score = 59.3 bits (142), Expect = 4e-08, Method: Composition-based stats. Identities = 31/137 (22%), Positives = 44/137 (32%), Gaps = 18/137 (13%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGEFIR 60 G + T P GWL C+G+ ++YPEL A T LPD RG + Sbjct: 112 AGTLALSAAETAPDGWLFCDGSPLIRDDYPELFAAIGETYGAGDGVNTFVLPDCRGRTLI 171 Query: 61 GWDDGRGIDT---GRSILSIQGY----ATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 G G G+ G + + + H H T + W N Sbjct: 172 GAGQGNGLSDRQRGDVVGAEEHTLTIPEMPSHTH-AEHPGTGTLWFQVFERGPGTWPNER 230 Query: 114 TDIIKRGNTNDAGLPAP 130 + +T G P Sbjct: 231 SGNTLGQSTGATGGNQP 247 >UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3R8_OXAFO Length = 365 Score = 58.6 bits (140), Expect = 7e-08, Method: Composition-based stats. Identities = 23/64 (35%), Positives = 29/64 (45%), Gaps = 10/64 (15%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 +PVG PP G+LKC+GAA + YP+L A T LPD+ G Sbjct: 87 GVPVGSIDWLAVPEPPAGYLKCDGAAIGRDTYPDLFAAIGTTFGAGDGETTFNLPDMIGR 146 Query: 58 FIRG 61 F G Sbjct: 147 FAEG 150 >UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0UTN0_HAES2 Length = 699 Score = 58.2 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 66/194 (34%), Gaps = 34/194 (17%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYPT-NKLPDL-RGE----- 57 G +P+G V +P A T PTG+LKC+G YP+L + N LP+L R + Sbjct: 347 GDGVPLGAIVAFPKAITNPTGFLKCDGTTIDQRTYPDLYRTLGNKNTLPNLTRSDVGMTA 406 Query: 58 ------FIRGW------DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYF 105 GW + DT + + + + +A Sbjct: 407 YFATDNIPDGWIAFDEIKEKVKEDTYPELYKYLIEKYTSIDNVPKAEDRFLRNAANELVV 466 Query: 106 D------------EIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLG--AAASE 151 N + + N+ + + T YK +G + A E Sbjct: 467 GRVQEDAIKTHYLNYGTNHNSSNYQFHVDNNDTIATGNNKTTDNYKIRTNGAIFYSGAEE 526 Query: 152 TRPRNIAFNYIVRA 165 TRP+++ ++A Sbjct: 527 TRPKSLVLKLCIKA 540 >UniRef50_A1TNG3 Phage Tail Collar domain protein n=9 Tax=Bacteria RepID=A1TNG3_ACIAC Length = 176 Score = 57.4 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 49/168 (29%), Gaps = 25/168 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + PP GW C G + L T LPDLRG G Sbjct: 8 GEISMFAGNFPPKGWAFCQGQILPIAQNSALFALLGTTYGGNGQTTFALPDLRGRVPLGQ 67 Query: 63 DDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G+ G+ +++QG H H T +++ +G Sbjct: 68 GQGPGLQPYSQGQVGGQETVTLQGNQMPMHTH--------TTSVSVSSNAGNSAAPNGRY 119 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + ND G+ G + E + N+I+ Sbjct: 120 LAASDQRNDQYTDQSGNGSLAGVTTGFAG-NSLPHENMQPYLCINFII 166 >UniRef50_D1BW55 Tail Collar domain protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BW55_XYLCX Length = 443 Score = 57.4 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 36/160 (22%), Positives = 54/160 (33%), Gaps = 25/160 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLR 55 +A PVG+ + T P GWL+ NGAA S YP L Y T LP+ + Sbjct: 223 NAACPVGMEAGF--HTVPPGWLEHNGAAVSRTTYPALFAHYGTTYGAGDGSTTFNLPNAK 280 Query: 56 GEFIRGWDDGR------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW 109 G G D + G G ++ H H + T + D Sbjct: 281 GRTPVGLDTAQAEFNAVGKTGGAKTHTLSTAEMPSHTHTSAA-------HTHSINHDHAA 333 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA 149 V S + ++ +G+ Y + S +G Sbjct: 334 VTSSSAGSHTHGSSTSGITDRAYFARGSAPASSATVGTNG 373 >UniRef50_C5RJD9 Tail Collar domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RJD9_CLOCL Length = 199 Score = 56.6 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 35/148 (23%), Positives = 48/148 (32%), Gaps = 16/148 (10%) Query: 13 VPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGWD 63 + WP P GWL C G +Y L T KLPDLRG G Sbjct: 7 QIILWPGNFVPRGWLACEGQELPINQYTALYSLLGTTYGGNGSTTFKLPDLRGRVPVGSG 66 Query: 64 DGRGID------TGRSILSIQGYATEDHAHGLP-SRSTIVTDATINFYFDEIWVNSGTDI 116 GI+ G +++ H H ++ + + I F E N+ + Sbjct: 67 ICGGINFQQGNSGGNFNVTLTQQQMPAHTHSTTVTQGAVTVNGGIPFNGGEGTTNTPSAS 126 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDG 144 K AG P+ SV G Sbjct: 127 SKLAVGITAGGDIPNIYNTSEATGSVTG 154 >UniRef50_C6X0H3 Microcystin dependent protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0H3_FLAB3 Length = 188 Score = 56.3 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 41/151 (27%), Gaps = 13/151 (8%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G P GW +CNG + L T LPD+RG + Sbjct: 42 GQIAFVAFTFAPKGWAECNGQLLPISQNTALFSLLGTTYGGNGQTTFALPDMRGRVLIHN 101 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT 122 G G+ + + Q TE+H + + + V +G + Sbjct: 102 GQGNGL-SNYELG--QTGGTENHTLTIAEMPQHIHNVNAVSAEGNQNVPTGN-LPANTKA 157 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGAAASETR 153 D T G+ E R Sbjct: 158 LDKEYADSTANTTMNLGMISPAGGSQPHENR 188 >UniRef50_Q8PR98 Microcystin dependent protein n=1 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PR98_XANAC Length = 195 Score = 55.9 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 50/170 (29%), Gaps = 28/170 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G +P P GWL C G S +Y L T LPDLRG + G Sbjct: 6 IGEVRAFPYNFAPEGWLDCMGQTVSINQYQALFGVIGFAYGGDKQTTFGLPDLRGRAVTG 65 Query: 62 WDDGRGIDTGRSILSIQG---------YATEDHAHGLPS---RSTIVTDATINFYFDEIW 109 G G+ + +I +QG H H + + A +N + Sbjct: 66 QGQGPGL-SNYTIGQLQGTDSVALVSSTQLPAHTHSITTMFLPPATAPGAAVNTPSSSSY 124 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAAS------ETR 153 ++ + T+ + + S E R Sbjct: 125 LSRLLNPTTSPPTSYKAYAPATTTPMVQLSPNALAPFPSGSQAVQAHENR 174 >UniRef50_D1NFN8 Apo-citrate lyase phosphoribosyl-dephospho-CoA transferase (Fragment) n=1 Tax=Haemophilus influenzae HK1212 RepID=D1NFN8_HAEIN Length = 301 Score = 55.5 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 20/50 (40%), Positives = 31/50 (62%), Gaps = 2/50 (4%) Query: 7 SALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYP-TNKLPDL 54 +P G V +P A T P G+LK NG+ F+ + +P+L + +N+LPDL Sbjct: 63 DGIPTGAVVSFPRAVTNPVGFLKANGSTFNQQTFPDLYRVLGNSNQLPDL 112 >UniRef50_B1JGT8 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JGT8_YERPY Length = 472 Score = 55.5 bits (132), Expect = 7e-07, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 57/184 (30%), Gaps = 43/184 (23%) Query: 19 SATPPTGWLKCNGAAFSAEEYPE----------------------------LAKAYPTNK 50 P GW +G +P+ + T + Sbjct: 150 RTYIPEGWAPADGIILDRALWPDAWDAIQVGYSRVTDESWIRDPILRGCFSIGNGSTTFR 209 Query: 51 LPDLRGE--------FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATIN 102 +PDL G+ F+RG G ++ I IQG A + S + +A Sbjct: 210 IPDLNGKSEGSLGAAFLRG----DGKNSFGEIGRIQGDAIRNITGDFGSLGGQINNAYGI 265 Query: 103 FYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYI 162 + V G + G A + P G+ + + A++ RP N YI Sbjct: 266 VIGSKNGVFVGHG--ENGRPTSANIGQPALGS-EFVAFDASRVVPTAADNRPVNATGCYI 322 Query: 163 VRAA 166 ++ A Sbjct: 323 IKLA 326 >UniRef50_B5RPA6 Uncharacterized conserved protein n=73 Tax=Borrelia RepID=B5RPA6_BORDL Length = 265 Score = 55.1 bits (131), Expect = 8e-07, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 48/138 (34%), Gaps = 9/138 (6%) Query: 26 WLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDH 85 + +G + + Y + P L G F+R +D RS+ QGYA ++H Sbjct: 129 FCLPDGRSLPSNCYAT--RVLGITSAPSLSGRFLRQYDAS----NSRSLGDTQGYALKNH 182 Query: 86 AHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGL 145 H + + + N + + + G G Y Sbjct: 183 QHRI-NYERVNYGYGQNLFKEAYTNSKGFSDSWYWKDGSYGFVHNLKLGDPIYSHRFSRY 241 Query: 146 GA--AASETRPRNIAFNY 161 +SETRP+N+A+ + Sbjct: 242 TGYEGSSETRPKNLAYLW 259 >UniRef50_Q11LT1 Microcystin-dependent protein-like n=1 Tax=Chelativorans sp. BNC1 RepID=Q11LT1_MESSB Length = 268 Score = 55.1 bits (131), Expect = 8e-07, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 62/195 (31%), Gaps = 39/195 (20%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEE-YPE-----------LAKAYPTNKLPD 53 G +P+G V + +T P GW C G A ++ YP + ++PD Sbjct: 78 GQLVPIGTIVDYALSTAPEGWTFCYGQALTSSTPYPLLRAALLAAGSPFGTSGSDPRVPD 137 Query: 54 LRGEFIRGWDDGRGIDTGR---SILSIQGYATED------HA--------HGLPSRSTIV 96 RG G D+ G R + G D H H + Sbjct: 138 YRGRVGAGKDNMGGTSANRLTNQSGGVNGDVLGDTGGAETHTLSVGQMPSHNHSGSTGSG 197 Query: 97 TDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRN 156 + T Y + SG + + +G Y + + S ++ P N Sbjct: 198 GNHTHTMYVKNLSAGSGGNPVT---GTPSGTIDSTYQSDPSGSHSHSIPSQGGND--PHN 252 Query: 157 -----IAFNYIVRAA 166 I N I++AA Sbjct: 253 NVQPTIIVNKIIKAA 267 >UniRef50_C5RN01 Tail Collar domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RN01_CLOCL Length = 123 Score = 55.1 bits (131), Expect = 9e-07, Method: Composition-based stats. Identities = 23/69 (33%), Positives = 27/69 (39%), Gaps = 10/69 (14%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLP 52 L S G + T P GWL C+G+A S E Y L A T LP Sbjct: 53 LNGSSGTFPGKIDMTATTTAPQGWLICDGSAVSRETYANLYTAIGTTYGNGDGTTTFNLP 112 Query: 53 DLRGEFIRG 61 D+RG G Sbjct: 113 DMRGRVPIG 121 >UniRef50_B5ZGB2 Tail Collar domain protein n=4 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=B5ZGB2_GLUDA Length = 300 Score = 54.7 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 39/145 (26%), Gaps = 12/145 (8%) Query: 13 VPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGEFIRGW 62 + AT P GW+ C G A S Y L T LPDLRG G Sbjct: 106 TVADYAGATAPAGWMLCCGQAVSRATYAALFAVIGTTFGAGDGATTFGLPDLRGRVAAGV 165 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT 122 D G ++L++ G G S + T G Sbjct: 166 DSMGGTAA--NLLTMAGAGINGVQLGAAGGSQMAPSHTHAVTDPGHAHAVTDPGHAHGPG 223 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGA 147 + G P + L Sbjct: 224 SGTGFVVPQGTGGEIVTFDGGSLTP 248 >UniRef50_Q4ZMK7 Putative uncharacterized protein n=2 Tax=Pseudomonas syringae group RepID=Q4ZMK7_PSEU2 Length = 440 Score = 53.9 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 60/193 (31%), Gaps = 39/193 (20%) Query: 3 LGEGSALPVGVPVPWPSAT-PPTGWLKCNGAAFSAEEYPEL------------------- 42 +G + P+G PV + + P G+ NG + ++P L Sbjct: 242 IGRYDSTPLGRPVFETTTSFSPGGYGALNGGLLNRADWPWLWDHAQKSGMLYTEAARTGK 301 Query: 43 ------AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIV 96 T + P+ RGEF+R D+ RG+DT R S Q T + + Sbjct: 302 EGGWTSGDGALTFRGPEGRGEFLRVLDESRGVDTSRVAGSWQ-DGTWLRTIAQEWSGSDI 360 Query: 97 TDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET---- 152 + T + G G P GT + D A E Sbjct: 361 STGTYPLGNAHAQADGRISSTGPGGALPTGALIP-AGTLGYLSDTTDNGVMGAVEVNAQL 419 Query: 153 -------RPRNIA 158 R RN+A Sbjct: 420 INNWIRFRSRNMA 432 >UniRef50_UPI0001BC923E Phage tail Collar n=1 Tax=Pseudomonas syringae pv. tabaci ATCC 11528 RepID=UPI0001BC923E Length = 196 Score = 53.9 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 46/155 (29%), Gaps = 18/155 (11%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + + P GW++CNG + +Y L LP+L+G Sbjct: 6 GSIMTFGFPFAPAGWMQCNGQTLNISQYNALYALLGVIYGGNPSQNFMLPNLQGRVPINQ 65 Query: 63 DDGRGIDTGRSILSIQG--------YATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G + T R I S+ G H H + + + T N + T Sbjct: 66 GTGVNL-TNRVIGSVSGVEKVTVAIANMPAHVHQMSTLTANTTITLANPAVTGATIAPTT 124 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA 149 D G + A + V + A Sbjct: 125 DNAFIGASTSGPTSANIFSPNAGTAPVVQKGVSTA 159 >UniRef50_Q4KAW3 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KAW3_PSEF5 Length = 191 Score = 53.6 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 50/175 (28%), Gaps = 23/175 (13%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + P GW CNGA + L T LPD RG G Sbjct: 7 GEIRMFAGNFAPRGWALCNGAQLLIRNFEALYTLIGTTYGGDGSNTFCLPDYRGRTPIGQ 66 Query: 63 DDGRGIDT---GRSILSIQ----GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 +G G G+++ + Q T H HG S+ VT A + Sbjct: 67 GNGPGFTPRALGQAVGTEQVTMSALNTPPHIHGFQVSSSEVTSANPLPANQPANSYTFGK 126 Query: 116 IIKRGNTNDAGLPAPDYGTF----KTYKQSVDGLGAAASE---TRPRNIAFNYIV 163 G+ + G+ A E ++A YI+ Sbjct: 127 FKLEGSFTGLYSKGDSTSAVVSMSPNFLSPALGIPNKAVEPHSNMMGSLAITYII 181 >UniRef50_Q1QPI5 Phage Tail Collar n=10 Tax=Proteobacteria RepID=Q1QPI5_NITHX Length = 180 Score = 53.6 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 53/171 (30%), Gaps = 27/171 (15%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P W CNGA + + L T LP+ G G Sbjct: 5 IGQIQIFGFNYAPRNWAFCNGATLAIRQNTALFSLLGTMYGGDGVTTFMLPNFAGRT--G 62 Query: 62 WDDGRGID-TGRSILSIQG--------YATEDHAHGLPSRSTIVTDATINFYFDEIWVNS 112 + G+G+ T R+I G H+H + T + + Sbjct: 63 CNQGQGVGLTARTIGEAFGENSVALVSEEMPSHSHSFTVYNQTDTTKRTSAPAN------ 116 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 G+ ++ N+ F + + G G E R +A N+ + Sbjct: 117 GSSLVVPQNSTPFSSSGTANTQFSPHMGGLTG-GNQPHENRQPYLAMNFCI 166 >UniRef50_D2QTE9 Tail Collar domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTE9_9SPHI Length = 172 Score = 53.6 bits (127), Expect = 3e-06, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 50/168 (29%), Gaps = 29/168 (17%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY---------PTNKLPDLRGEFIR- 60 +G + + G++ CNG +Y L T LPDLRG Sbjct: 5 IGQIILFAGNYEIRGYVFCNGQLLDISKYTALYSLLGTTYGGNGTTTFGLPDLRGRMPIH 64 Query: 61 -GWDDG-RGIDTGRSILSIQG----YATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G + G R G+ S + H H L + S T + Sbjct: 65 FGQEPGKRSYVLGQRSGSYETTLTVDNLPAHNHALNAFSETGTASAP------------A 112 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAA-ASETRPRNIAFNY 161 + PD + +++ G +T P +A NY Sbjct: 113 GALLANTGLGDTEYLPDGTLVQMSTKAIGKTGNGRPVDTMPPYLALNY 160 >UniRef50_B8DLJ2 Tail fiber protein, putative n=3 Tax=Desulfovibrio vulgaris RepID=B8DLJ2_DESVM Length = 505 Score = 53.6 bits (127), Expect = 3e-06, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 58/190 (30%), Gaps = 53/190 (27%) Query: 8 ALPVGVPVPWPSATPPTGWLKCN-GAAFSAEEYPEL------------------------ 42 +P+G+ WP TPP G L N G E YP+L Sbjct: 212 GMPIGMTFWWPGTTPPAGSLAINDGPLLPREAYPQLWAMAQASGNIITEAAWQAQAAVQS 271 Query: 43 -------AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTI 95 T + P LR +F+RG GR++ + Q +ATE + Sbjct: 272 SVGAFSSGDGATTFRCPRLR-DFVRG----ANPSGGRAVGAWQAHATEGLFVPMDGDGET 326 Query: 96 VTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPR 155 V ++ + SG P + + ET PR Sbjct: 327 VIGVVPSWGPATHDLTSG----------------PGTVNVTASTSGILTIPTGTGETLPR 370 Query: 156 NIAFNYIVRA 165 + + ++A Sbjct: 371 TVNWLPCIKA 380 >UniRef50_Q0F1S9 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F1S9_9PROT Length = 383 Score = 53.2 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 44/126 (34%), Gaps = 9/126 (7%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCN--------GAAFSAEEYPELAKAYPTNKLPDLRGE 57 G+ P G + P G L + G AF L T P++R E Sbjct: 247 GTYFPAGTTILSIDIAGPGGQLTVSANATGNGVGTAFEISP-WGLGDGATTFNKPEVRNE 305 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 F+R DDGRG++ G + S+ + H H P + + ++ + T Sbjct: 306 FVRFADDGRGVNVGSILGSVHADSVGPHTHPTPIGGSAGGSSGFWGPSTDVSGPTDTGSN 365 Query: 118 KRGNTN 123 T Sbjct: 366 TGTETQ 371 >UniRef50_Q72P75 Putative uncharacterized protein n=4 Tax=Leptospira interrogans RepID=Q72P75_LEPIC Length = 306 Score = 52.8 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 31/132 (23%), Positives = 38/132 (28%), Gaps = 16/132 (12%) Query: 43 AKAYPTNKLPDLRGEFIRG--------WDDGRGIDTGRSILSIQGYATEDHAHGLPSRST 94 T +PD RG F RG G D G ++ H H L S Sbjct: 183 GDGSTTYNIPDRRGIFARGAGVHGSRSKAAGGNYDGG-AVGYAGQDQLFRHVHELWLNSN 241 Query: 95 IVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRP 154 T Y N ++ A +P Y +G A E P Sbjct: 242 NNTVGGTTAYSSGAGPN-------TPSSASANGASPGYSIRSVISDGSNGTPRAGDENTP 294 Query: 155 RNIAFNYIVRAA 166 IA Y VR A Sbjct: 295 AYIAVKYKVRVA 306 >UniRef50_A4P195 Putative phage tail fibre protein (Fragment) n=1 Tax=Haemophilus influenzae 22.4-21 RepID=A4P195_HAEIN Length = 458 Score = 52.8 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 20/50 (40%), Positives = 32/50 (64%), Gaps = 2/50 (4%) Query: 7 SALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYP-TNKLPDL 54 +P+G V +P A T P G+L+ +G+ FS + +P+L + +NKLPDL Sbjct: 391 DGIPIGAVVSFPRAVTNPVGFLRADGSTFSQQTFPDLYRTLGNSNKLPDL 440 >UniRef50_Q55EP2 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q55EP2_DICDI Length = 166 Score = 52.8 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 34/109 (31%), Gaps = 12/109 (11%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT--------NKLPDLRGEF 58 +++ G + P GW C+GA + YPEL + PD RG+ Sbjct: 23 NSIQPGSVNIFTGIEIPVGWSLCDGAPLNKLTYPELYRQIGDAFGSSEHEFSKPDFRGKC 82 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDE 107 G +G G+ +S H H + F Sbjct: 83 PIGAGNGVGLTNHLLTVS----ELPSHDHPVIDPGHTWHSIGGGFSSGP 127 >UniRef50_Q8PR97 Microcystin dependent protein n=1 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PR97_XANAC Length = 183 Score = 52.4 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 41/163 (25%), Gaps = 22/163 (13%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + + P GW C+G Y L T LPDLRG Sbjct: 6 GQIILFAGNYEPQGWAFCDGRQLQINTYMALYSLIGTTYGGDGRTTFNLPDLRGRVAISQ 65 Query: 63 DDGRGIDT-------------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW 109 G G +S+Q H H L + ++ + T + Sbjct: 66 GQGIARAPTPQLTARVLGQQFGTETVSLQLAEMPAHRHTLQAFNSPASSLTPTGQLPAVT 125 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 T + + S A++T Sbjct: 126 QGGNTGYLTPPAGSTPAASTLATNAVNVAGASQPHDNHMATQT 168 >UniRef50_Q73NL1 Tail fiber domain protein n=1 Tax=Treponema denticola RepID=Q73NL1_TREDE Length = 527 Score = 52.4 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 54/173 (31%), Gaps = 31/173 (17%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAK---------------AYPTN---KLP 52 +G + + G+L NG +F E YPE + Y + KLP Sbjct: 365 IGEVRYFTNKKYTYGYLYANGYSFIPELYPEFYQFWLENFGDRNKKNYLGYDSFGYPKLP 424 Query: 53 DLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS 112 DLRG +R DDG L QG A + + ++ + N Sbjct: 425 DLRGVALRAVDDGSDRGGAALALEFQGDAIRN------------LKGRVGVQGNDGYPNL 472 Query: 113 GTDIIKRGNTNDAGLPAPDYGTF-KTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + +T ++ + + A + R ++ ++ Sbjct: 473 TAGVFHTLDTGYIDSGTSQASSYLRLLGFDASRVVPTAEDNRVKSYGVYPFIK 525 >UniRef50_Q4UNP6 Microcystin dependent protein n=8 Tax=Bacteria RepID=Q4UNP6_XANC8 Length = 175 Score = 52.4 bits (124), Expect = 6e-06, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 47/169 (27%), Gaps = 25/169 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P GW C+G+ S +Y L T +PDLRG Sbjct: 6 IGEIRMFGFGRTPQGWQACDGSLLSISDYEVLFMLIGNTYGGDGQNTFAVPDLRGRVPLH 65 Query: 62 WDDGRGID-------TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G +++ G H H L + + T + Sbjct: 66 QGQGPGLSNYVIAQTAGTESVALTGLQLPAHTHTLVATTAAATATAPSGLLPGTVT---- 121 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 G+ A + + G A E + Y + Sbjct: 122 -----GDVFYATDTTGATAAPMATQSTTITGGGLAHENTMPTLTVQYCI 165 >UniRef50_Q2W7B1 Microcystin-dependent protein n=3 Tax=Proteobacteria RepID=Q2W7B1_MAGSA Length = 177 Score = 52.0 bits (123), Expect = 7e-06, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 51/169 (30%), Gaps = 25/169 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFI--R 60 G +P PTGWL C+G + L T LPDLRG I + Sbjct: 7 GEIRLFPLNWAPTGWLPCDGRSMQVSANAALFSLLGNQFGGDAKTTFFLPDLRGRTIMGQ 66 Query: 61 GWDDGRGID------TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G + G+ G +++ H H + T+ + + + + +GT Sbjct: 67 GKNPVTGVSYVTGAYGGTESVTLTTAQLPSHQHQVVGDQTVGATNPADDNYLAVPIYNGT 126 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + P G AA +A Y + Sbjct: 127 QKSLYNSGTKPVPLNP--------ASVSTVGGGAAHTNTQPYLALGYCI 167 >UniRef50_UPI0001A44BB4 microcystin dependent protein n=1 Tax=Pectobacterium carotovorum subsp. brasiliensis PBR1692 RepID=UPI0001A44BB4 Length = 269 Score = 52.0 bits (123), Expect = 8e-06, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 47/165 (28%), Gaps = 17/165 (10%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDL 54 +G A +G ++ P+G+L G + S Y L LPDL Sbjct: 31 DGDAPYIGSVCYMVTSYCPSGYLPAAGQSVSISTYQALYALIGNIWGGSPQTNNFTLPDL 90 Query: 55 RGEFIRGWDDG-------RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDE 107 RG I G G RG G ++ H H T D + Sbjct: 91 RGRSIVGAGQGTGLSLIQRGQSLGAETATLSASNVAPHTHPTAQSLTTTFDVLVPATTGN 150 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 + V + I PA +V A + T Sbjct: 151 LTVGATLPIATTTPATTGTTPANGANFLTALSATVPVGAATQNAT 195 >UniRef50_UPI0001AF6092 hypothetical protein Psyrpo1_27141 n=2 Tax=Pseudomonas syringae group RepID=UPI0001AF6092 Length = 486 Score = 51.6 bits (122), Expect = 1e-05, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 55/172 (31%), Gaps = 27/172 (15%) Query: 3 LGEGSALPVGVPVPWPSAT-PPTGWLKCNGAAFSAEEYPEL------------------- 42 +G P+G PV + P G+ NGA S E+P L Sbjct: 288 IGRYDNTPLGRPVFETTTLFSPGGYGALNGALLSRAEWPWLWDHAQKSGMVYTEAARTGK 347 Query: 43 ------AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIV 96 T + P+ RGEF+R D+ RG+DT R S Q T + V Sbjct: 348 EGGWTSGDGAQTFRGPEGRGEFLRVLDESRGVDTSRVAGSWQ-DGTWLRTVAQEWSGSDV 406 Query: 97 TDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAA 148 + + G G P G+ + + +G+ A Sbjct: 407 ETGSYLLGNGHAQADGRLSSTGPGGLLPPGALVPAGGSAYLPETTDNGVMAT 458 >UniRef50_B3FYL6 Gp17 n=1 Tax=Salmonella phage phiSG-JL2 RepID=B3FYL6_9CAUD Length = 658 Score = 51.6 bits (122), Expect = 1e-05, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 61/203 (30%), Gaps = 42/203 (20%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 + L +PVG PTGW++ G F YP LA+ +P+ + P + Sbjct: 268 LALQSTGNVPVGTVAMITHTKIPTGWVRA-GEDFDVNTYPALAELFPSGRTPSFDDRYPI 326 Query: 61 G----WDDGRGIDTG------------------------RSILSIQGYATEDHAHGLP-- 90 G G+ ID R+ S +G H+HG Sbjct: 327 GNSTVLTPGQLIDQSVPAHSHTFDVPVNVSGATAAGGEYRARTSHEGD----HSHGFSLP 382 Query: 91 ------SRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDG 144 + + + N + + + + + SV Sbjct: 383 IQNNTGAYTGRLVGGGNNPNYPQDLRFNTGGGGAHSHEFYVPSHSHTLNASGRAAGSVSS 442 Query: 145 LGAAAS-ETRPRNIAFNYIVRAA 166 G S RP + +I++AA Sbjct: 443 SGIGNSPYVRPYSTVVIFIIKAA 465 >UniRef50_Q8Y365 Putative uncharacterized protein n=4 Tax=Ralstonia solanacearum RepID=Q8Y365_RALSO Length = 182 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 51/167 (30%), Gaps = 15/167 (8%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + PP GW CNG + L T LPDLRG Sbjct: 6 GEIRLCAFSYPPKGWAACNGTLLPIAQNTALFSLLGTQYGGDGVRTFALPDLRGRTPLHR 65 Query: 63 DDGR---GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 D G G +++ H+H + S+ T + + + S Sbjct: 66 DYVNSVVGSVGGAETVTLVSSQLPVHSHLFNASSSPATSTNVGATQNHVLAASNLYSSTD 125 Query: 120 GNTNDAG---LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + +G AP + + GA E ++ NYI+ Sbjct: 126 PTISGSGTALYAAPGPLAALSGEACGSTGGAQPHENMQPSLVLNYII 172 >UniRef50_B5RPA7 Uncharacterized conserved protein n=10 Tax=Borrelia RepID=B5RPA7_BORDL Length = 259 Score = 51.2 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 44/107 (41%), Gaps = 5/107 (4%) Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P L G F+R +D D RSI Q + + H H + +R +I D + FD+++ Sbjct: 149 PSLSGRFLRHYDPHGSYDYVRSIGDTQSDSFQKHDHSI-NRDSINFDGKLVRSFDQLYKT 207 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 D + DY + + + SETRP N+A Sbjct: 208 --GDWSTGDYRFVYNMRLGDYISKYDFPSYTEYY--GDSETRPVNLA 250 >UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacterium radiotolerans JCM 2831 RepID=B1M1N8_METRJ Length = 414 Score = 50.9 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 25/67 (37%), Gaps = 10/67 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAK----------AYPTNKLPDLRGEFIR 60 G + P+GW C+G A S Y L + T LPD RG + Sbjct: 145 AGAIKAFAGPNVPSGWEICDGRAVSRTAYAALFATISTGWGNGDGFTTFNLPDARGRTLF 204 Query: 61 GWDDGRG 67 G + G G Sbjct: 205 GANRGTG 211 >UniRef50_C9PG79 Putative phage tail protein n=1 Tax=Vibrio furnissii CIP 102972 RepID=C9PG79_VIBFU Length = 410 Score = 50.9 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 41/158 (25%), Gaps = 23/158 (14%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGE 57 S+ G + W S P + G Y LA A P +PD RG Sbjct: 257 SSKTPGETMAWDSELVPEHMIVAMGQQLPVTVYHSLAAAKPEWIDDTNPLVLNIPDRRGR 316 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 F R D + A + + T + ++ + I Sbjct: 317 FTRAADGSHWLA-----GQSHDDAIRNITGSFNASGTTGSASS---------TKTQGAIA 362 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPR 155 ++ G + + E +P+ Sbjct: 363 LSNTSSWPNYVNGQSGAGYNLLFDASNVVPTSEENQPK 400 >UniRef50_A9DEL7 Tail fiber protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEL7_9CAUD Length = 640 Score = 50.5 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 56/202 (27%), Gaps = 50/202 (24%) Query: 15 VPWPSA--TPPTGWLKCNGAAFSAEEYPELAKAYPTNKL-----------PDLRGEFIRG 61 V W S P G L +G +P L + ++ P LRG + G Sbjct: 63 VMWHSTQKHLPAGCLLSDGQEVDRATWPSLFEEIEAGRVPVVPEADWLANPKLRGSYTLG 122 Query: 62 --WDDGRGID-TGRSILSI------------------QGYATEDHAHGLPSRSTIV---- 96 + R D GRS+ S+ Q A + H H + Sbjct: 123 DVVNTFRVPDYNGRSVGSLGRIFLGGDGQNAGLDGQIQESANKRHNHAITDNGHSHGVND 182 Query: 97 TDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET---- 152 + G I + A S G+ SE+ Sbjct: 183 AGHSHEKSAWVANPAGGGQIYRDPEVWITTNAADKVEVHYKTGVSTSGISLQESESGITL 242 Query: 153 --------RPRNIAFNYIVRAA 166 RP N+A YI+R A Sbjct: 243 AEDGEADARPSNVAGCYIIRGA 264 Score = 41.6 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 54/191 (28%), Gaps = 51/191 (26%) Query: 5 EGSALPVGVPVPWP-SATPPTGWLKCNGAAFSAEEYPEL--------------------- 42 +G+A VG P + P G + +G S E YP L Sbjct: 295 QGNAGYVGKVDWHPLRESVPHGRIPADGQLLSRELYPALWEAVRDRRVPVTTEELWNSDG 354 Query: 43 --------AKAYPTNKLPDLRGE--------FIRGWDDGRGIDTGRSILSIQGYATEDHA 86 ++PD G+ F+RG G+++ IQG A + Sbjct: 355 KRRGCYTEGDGSTNFRVPDYNGKTSGSLGAGFLRG----DGLNSLSESGMIQGDAIRNIK 410 Query: 87 HGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLG 146 + + A+ + +G G + + V Sbjct: 411 GYAGAVANYKALASSGALSLD---------ADQGPMYVNGANTGVWAALRNMSIDVSKAV 461 Query: 147 AAASETRPRNI 157 A++ P N+ Sbjct: 462 PTAADNHPVNV 472 >UniRef50_C7BVI0 Structural protein n=1 Tax=Synechococcus phage S-RSM4 RepID=C7BVI0_9CAUD Length = 428 Score = 50.5 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 57/175 (32%), Gaps = 58/175 (33%) Query: 12 GVPVPWPSATP-------------PTGWLKCNGAAFSAEEYPELAKAYP----------- 47 G + +P P G+++C+G+ ++ YP LA+ Sbjct: 17 GTIIAFPKELDINDPAIGVGLNLLPAGYIRCDGSVYNENTYPALAQILGLGDACVFKQPD 76 Query: 48 ------TNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATI 101 ++PDLR +FIR S S QG ++ D T+ Sbjct: 77 VTLNADQFQVPDLRSKFIRA-----------SSASDQG---------------VINDNTV 110 Query: 102 NFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRN 156 SG + N + +G F+ S D G A TRPRN Sbjct: 111 LSATGLTVEKSGVGVQVSSNVGSTAV-VDLFGQFRIPALSEDLRGNVAF-TRPRN 163 >UniRef50_C6DJW4 Tail Collar domain protein n=2 Tax=Pectobacterium carotovorum subsp. carotovorum RepID=C6DJW4_PECCP Length = 246 Score = 50.5 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 47/159 (29%), Gaps = 17/159 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEFIR 60 +G ++ P G+L NG + +Y L LPD+RG Sbjct: 36 IGSICYMVTSYCPQGYLPANGQTVTINQYQALYALIGNIWGGSPQQGNFVLPDMRGRVPV 95 Query: 61 GWDDGRGIDT---GRSIL----SIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 G G G+ G+ ++ H H S+ T + + + Sbjct: 96 GAGQGTGLANVTRGQVFGVENVALTTSNVAPHIHPATVASSGGVSGTASIAIPVVNGAAT 155 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 T++ + P+ D + AA T Sbjct: 156 TNVPDNTTSLATTSPSFDLSSVGGVDSPAKIYSNAAPTT 194 >UniRef50_Q2RUE1 Phage Tail Collar n=5 Tax=Proteobacteria RepID=Q2RUE1_RHORT Length = 187 Score = 50.5 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 48/174 (27%), Gaps = 18/174 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P W C G + + LA T LP+L+G G Sbjct: 6 IGEIRIFGFNYAPVDWAFCAGQTVAIAQNQALAVVLGQAFGGDGRTTFGLPNLQGSVPIG 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVT--DATINFYFDEIWVNS 112 G G+ G +S+ T H H + S T A N + + Sbjct: 66 AGSGPGLTPRPYAQQAGTDRVSLTLAQTPPHNHSITVASASGTLRTAGPNATAPLSFCSF 125 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 GT + G A E R + NY + A Sbjct: 126 VATKATPPKPQTTFTQTAPDGTLAPGALAPFVGGGDAHENRQPYLVLNYCISLA 179 >UniRef50_C0YLU9 Phage tail collar domain-containing protein n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YLU9_9FLAO Length = 183 Score = 50.5 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 46/169 (27%), Gaps = 16/169 (9%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELA---------KAYPTNKLPDLRGEFIRG 61 +G+ + P GW+ C+G+ S L T LP+L+G G Sbjct: 5 IGIVKLFAGNFAPRGWMFCDGSLLSISRNSALFSILGTTYGGDGITTFALPNLKGRMALG 64 Query: 62 ---WDDGRGIDTGRSILSIQ----GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 + G G + Q G + + SGT Sbjct: 65 AGNVNSGENYPLGIVSGTTQNTLLSSNLPSIGAGFQLKVANKNANSSTPTATSTIAISGT 124 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + N + + + T P + NYI+ Sbjct: 125 QVGRDFNAVPSFVNDANPDTTINPLSISFTGQGLPLNNMPPYLGLNYII 173 >UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseudotuberculosis RepID=Q66BF2_YERPS Length = 711 Score = 50.1 bits (118), Expect = 3e-05, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 53/181 (29%), Gaps = 35/181 (19%) Query: 19 SATPPTGWLKCNGAAFSAEEYP-----ELAKAYP------------------------TN 49 P GW +G S +P L+ YP T Sbjct: 386 RNYIPAGWAPADGQLLSRNLFPFALAEILSAKYPIVADDSWLFYKDQRSSFSVGDGSTTF 445 Query: 50 KLPDLRGE----FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYF 105 ++PDL G+ R + G G ++ + IQG A+ G + + Sbjct: 446 RIPDLNGKSHDSMGRVFLGGDGKNSLGEMGRIQGDASRRIT-GTFGGIGGQLNVSYGLVI 504 Query: 106 DEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 E + G + + P G + A E RP N YI++ Sbjct: 505 GETAGAFTRTGVATGRPVPSNIGEPALGELGV-SFDSALVNPTAIENRPINATGCYIIKL 563 Query: 166 A 166 A Sbjct: 564 A 564 >UniRef50_A6EAB9 Microcystin-dependent protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAB9_9SPHI Length = 198 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 43/154 (27%), Gaps = 18/154 (11%) Query: 12 GVPVPWPSATPPTGWLKCNGAAF----SAEEYPELAKAYP-----TNKLPDLRGEF---- 58 G + + P GW C+G+ + Y L + T LPDLRG Sbjct: 27 GEIRAFACSYAPEGWALCDGSLLPLSQNQALYSLLGTRFGGNGTTTFALPDLRGRVPVGT 86 Query: 59 -IRGWDDGRGIDTGRSILS----IQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 +RG G + S ++ H H + +++ + + S Sbjct: 87 GVRGASPAYTYTIGNNGGSETVALETATMPPHNHYVSAKNALGSVGLAGGILAIPNGGST 146 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGA 147 I + PD + Sbjct: 147 QVNIYNTSAGATTTLNPDTVGNTGAGSPHSNMQP 180 >UniRef50_C7PNC3 Tail Collar domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PNC3_CHIPD Length = 183 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 47/169 (27%), Gaps = 18/169 (10%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G +P P GW+ C+G + L LP+L G I G Sbjct: 6 GEIRLFPYTQIPRGWVSCSGQTLPIAQNQALFALLGVYYGGNGTTNFMLPNLNGRAIVGT 65 Query: 63 DDGR-------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G +G +++ H+H + + + YF S + Sbjct: 66 GQSTSGTNYNIGQASGTETVTLVTNNLPAHSHPVKVNVSYDQGSPNTNYFANANTPS-SP 124 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAA-ASETRPRNIAFNYIV 163 NT L +P SV G R + Y + Sbjct: 125 TQPGQNTGTVNLFSPAVTPLVEMAPSVTSTGGGLPHANRMPYLTLIYCI 173 >UniRef50_C6X0H2 Phage tail collar domain protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0H2_FLAB3 Length = 193 Score = 49.3 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 40/114 (35%), Gaps = 20/114 (17%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 VG + P P GW C+G+ S E L T +PD+RG + Sbjct: 26 VGQIMFVPYNFSPQGWHNCDGSLLSISENEVLFTLIGTTYGGDGQTTFAVPDMRGRVMI- 84 Query: 62 WDDGRGID---------TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFD 106 DDG+G +G + + H+H + + S T + + Sbjct: 85 -DDGQGNTLSSFTLGQMSGTETVQLTQAQMPAHSHTVNAVSGAGTSESPTSHLP 137 >UniRef50_C7PNC2 Tail Collar domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PNC2_CHIPD Length = 184 Score = 49.3 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 48/172 (27%), Gaps = 16/172 (9%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + PTGWL CNG + + Y L T +PDLRG G Sbjct: 6 IGEIRAFGFNFTPTGWLPCNGGLYPIQSYSTLFAILGTNFGGNGTTTFAVPDLRGVAAIG 65 Query: 62 WDDGR------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 + G+ G +++ H H + + T T+ Sbjct: 66 INLQNPSFGVPGVKGGSENVTLTIATIPAHTHMMQAVVRTSLAQTAAAISQPGPNAYLTN 125 Query: 116 IIKRG-NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 G + G + + +A Y + A+ Sbjct: 126 AFSSGPSKGVVAYSNNTSGATLNPQAIGITGSSTPHNNMDPYLAMTYCICAS 177 >UniRef50_A0A7D3 Putative uncharacterized protein n=1 Tax=Microcystis aeruginosa phage Ma-LMM01 RepID=A0A7D3_9CAUD Length = 335 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 4/48 (8%) Query: 6 GSALPVGVPVPW----PSATPPTGWLKCNGAAFSAEEYPELAKAYPTN 49 + P+G + W + PT WL NG S +YPEL T Sbjct: 256 AAGAPIGSIIMWWPLIVTQQHPTNWLPLNGQEISRTQYPELFAVIGTF 303 >UniRef50_D1Y7E0 Collagen alpha 1 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y7E0_9BACT Length = 386 Score = 48.9 bits (115), Expect = 7e-05, Method: Composition-based stats. Identities = 16/49 (32%), Positives = 23/49 (46%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDL 54 S +P+G V + G+L NGA + +YP LA LPD+ Sbjct: 108 SSGVPIGATVMFKKGQQEPGYLLANGAPYDTAKYPYLADCLGAANLPDM 156 >UniRef50_C6MD19 Tail Collar domain protein n=2 Tax=Proteobacteria RepID=C6MD19_9PROT Length = 241 Score = 48.9 bits (115), Expect = 7e-05, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 48/158 (30%), Gaps = 16/158 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFI-- 59 VG P GW +C+G +Y L T LPD+RG+ Sbjct: 35 VGEISYVAFNFAPQGWYQCDGQILPINQYQALFSLLGTNYGGDGTTTFALPDMRGKVPVH 94 Query: 60 RGWDDGR-----GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 +G G G +G +++ H H + S + + Sbjct: 95 QGQHPGGSMFTLGQTSGAENVTLTLNNMPAHNHPATATSASTSALAPGGTATSTLKAVNS 154 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 D + ++ A + + + +A+ ET Sbjct: 155 DADIKTAAGNSLANAKGLNSAYSASAPNVSMSSASIET 192 >UniRef50_A6N211 Probable tail fiber protein n=1 Tax=Microbacterium phage Min1 RepID=A6N211_9CAUD Length = 250 Score = 48.6 bits (114), Expect = 8e-05, Method: Composition-based stats. Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 10/57 (17%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDL 54 ++PVG + W + P GW+ +G A S +P L T +PDL Sbjct: 135 SVPVGTVMMWLAGPAPDGWVLLDGRAVSRAAFPTLFTLIGTTFGSGNGGTTFNVPDL 191 >UniRef50_B1KMR6 Tail Collar domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KMR6_SHEWM Length = 179 Score = 48.6 bits (114), Expect = 9e-05, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 54/169 (31%), Gaps = 21/169 (12%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G P + KC+GA + + T LPDLRG Sbjct: 6 IGEIKMVGFNFAPRSYAKCDGALLPISQNTAMFSLLGTEFGGDGRTTFGLPDLRGRTPMH 65 Query: 62 WDDGRGIDT---GRSILS----IQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 +G G+ G+S S +Q H H + + + S + Sbjct: 66 QGNGPGLSPKTMGQSSGSESNTLQLNQMPKHTHSAQLDAVSTEGTSAVPDNNMYLAKSSS 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + N PD ++ + G +A + +P + N+I+ Sbjct: 126 GL---TSVNSYSNGTPD-TVISPHETNTAGGNSAINNMQPYQV-VNFII 169 >UniRef50_A5GA41 Phage Tail Collar domain protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GA41_GEOUR Length = 205 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 38/146 (26%), Gaps = 16/146 (10%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRGW 62 G + P W C+G Y L T LPDLRG G Sbjct: 32 GEIRMFGGDYAPENWHFCDGTLLPISGYDALYSLIGTAYGGDGINNFALPDLRGRLPIGQ 91 Query: 63 DDGR-------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G G + + T H H + + S T + Sbjct: 92 GQGTDLTNHPVGEKNGTETVGLTLAQTPAHTHTVNAASGTGTQPSPENGVWASLAAVNQF 151 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQS 141 I + + + + T Q+ Sbjct: 152 ITPAEVKSPSIIHDMNSAAIGTGYQA 177 >UniRef50_C2FWA0 Phage tail collar domain protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FWA0_9SPHI Length = 196 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 48/176 (27%), Gaps = 29/176 (16%) Query: 17 WPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGWDDGRG 67 + P GW+ C+G S Y L T +PDLRG G G G Sbjct: 11 FAGNFAPAGWILCDGRLLSINNYQVLYTVIGTTYGGDGVNTFGVPDLRGRVPIGTGQGPG 70 Query: 68 IDT---GRSILSIQ----GYATEDHAHGLPSRSTI------VTDATINFYFDEIWVNSGT 114 + G+ I + H H +T V+ A + G Sbjct: 71 LTNVVLGQKIGTETVTLLPANLPVHTHTAAVNATNVPFAVKVSAAAATLHAAATGSQLGQ 130 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA-------SETRPRNIAFNYIV 163 + T PD + + A E + NYI+ Sbjct: 131 PMTDSIPTLGYNAANPDKTMGDSSLNTSGLTVNTAMMGSSLPHENMQPFLTTNYII 186 >UniRef50_A1SXZ3 Phage Tail Collar domain protein n=4 Tax=Bacteria RepID=A1SXZ3_PSYIN Length = 195 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 45/153 (29%), Gaps = 29/153 (18%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G P P GW C+G ++ L T LPD RG + Sbjct: 31 GEIAWVPYNFAPRGWASCDGQLLPITQHNALFSLLGTVYGGDGRTTFALPDARGRVMIHE 90 Query: 63 DDGRGIDTGRSIL--------SIQGYATEDHAH-----------GLPSRSTIVTDATINF 103 G G+ T R + ++Q H H P + + + + Sbjct: 91 GQGPGL-TNRRLGDKWGEEQVTLQTSQIPSHTHRQQASSGSPSSTSPEENVLASPSRTQL 149 Query: 104 YFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFK 136 Y D+ ++ D I N A Y T Sbjct: 150 YADDADIDMSADNISYTGGNLAHNNMQPYTTLH 182 >UniRef50_D0KG77 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KG77_PECWW Length = 270 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 42/157 (26%), Gaps = 17/157 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEFIR 60 +G ++ P+G+L G + S Y L LPDLRG + Sbjct: 38 IGSVCYMVTSYCPSGYLPAAGQSLSINTYQALYSLIGNLWGGSQQTGNFTLPDLRGRSLV 97 Query: 61 GWDDG-------RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 G G RG G ++ H H T + + + V + Sbjct: 98 GSGQGTGLSLITRGQSLGAETATLAASNIAPHTHPTTQSLTNTFNVLVPATTGNLNVTAA 157 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAAS 150 + P +V A + Sbjct: 158 LPLAAATPATGGATPTAGANFLTAISATVPVGAATQN 194 >UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae pv. oryzae RepID=B2SVF7_XANOP Length = 501 Score = 47.4 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 46/161 (28%), Gaps = 20/161 (12%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRG 56 S L G V S PP G L C+GA S +Y L A T +P ++ Sbjct: 246 SFLLPGQIVVMASLYPPNGLLVCDGAEISRAKYAALFAAIGTVYGAGDGSTTFNVPKIKE 305 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPS-----RSTIVTDATINFYFDEIWVN 111 + ++ S H HG + + + Sbjct: 306 GTVI-----THTSAATAVGSYDPGQVISHTHGASAAAVGDHAHYTAINAAGNHAHGASAG 360 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 + D T+ G A T + G+ +AS Sbjct: 361 AAGDHAHYAWTDAQGHHAHGGSTSASGDHQHPGVIPSASIN 401 >UniRef50_C5BRC7 Phage tail collar domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BRC7_TERTT Length = 195 Score = 47.4 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 38/157 (24%), Gaps = 20/157 (12%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRG 61 +G + GW C G + ++ L T +PDLRG G Sbjct: 5 IGDIHLFGFNFGQEGWALCQGQLMAIQDNTALYSLIGTQYGGDGRSSFGIPDLRGRVPLG 64 Query: 62 ---------WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS 112 W GR D G IQ H+H S T Sbjct: 65 TGNPPGGSQWPMGR--DAGAETCVIQESQMPTHSHPANFSSQSSLFGTTEPADLTTPETG 122 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA 149 A P Y ++ GL Sbjct: 123 AVLANVVAGGTGADKPEKIYTVTTANPVTLGGLDVTG 159 >UniRef50_Q89L34 Blr4714 protein n=1 Tax=Bradyrhizobium japonicum RepID=Q89L34_BRAJA Length = 1861 Score = 47.0 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 20/60 (33%), Positives = 24/60 (40%), Gaps = 9/60 (15%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 G VP+ S P G+L G + +YP L A T LPDL G I G Sbjct: 459 AGEIVPFLSNFAPAGYLLAAGQVLNIADYPNLYNAIGTTYGGDGVTTFALPDLTGRTIIG 518 >UniRef50_C7PE74 Tail Collar domain protein n=3 Tax=Bacteria RepID=C7PE74_CHIPD Length = 196 Score = 47.0 bits (110), Expect = 3e-04, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 47/179 (26%), Gaps = 29/179 (16%) Query: 14 PVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGWDD 64 + P GW C G S + L T LPDLRG G Sbjct: 8 IAIFGGNFNPRGWYFCQGQIMSIAQNTALFSLLGTTYGGNGQTTFALPDLRGRAPIGVGQ 67 Query: 65 GRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 G G+ G ++ H H S V A ++ Sbjct: 68 GPGLQPYAWGQLGGSETHTLIITEMPAHNHTALVNSLTVNPAASTAAGTTNIPDATMVPA 127 Query: 118 KRGNTNDAGLPAP--------DYGTFKTYKQSVDGLGAAASETRP-----RNIAFNYIV 163 K N P + T K + A ++P +A NYI+ Sbjct: 128 KLPNIGSGPTAQPIKGYAVADNTTTLAPAKVTGSVTVGIAGGSQPFSIQNPYLAVNYII 186 >UniRef50_B9K0L5 Putative uncharacterized protein n=1 Tax=Agrobacterium vitis S4 RepID=B9K0L5_AGRVS Length = 224 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 46/153 (30%), Gaps = 11/153 (7%) Query: 13 VPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY---------PTNKLPDLRGEFIRGWD 63 +P P GWL C G + +Y + LPDLRG+ G+ Sbjct: 7 TILPVGFNYAPDGWLMCWGQKLTINQYNAVYSLVSNFYGGDQQTYFNLPDLRGQMPIGYG 66 Query: 64 DGRGIDTGRSILSIQG-YATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT 122 +I + G ++ +P+ + I +GT + Sbjct: 67 QRTPTSPNYAIGNKGGNDTVSLNSTQIPAHTHAAVFTPTGNATVNIPAQTGTQTATMKAS 126 Query: 123 NDAGLPA-PDYGTFKTYKQSVDGLGAAASETRP 154 AG P G+ + A+ T P Sbjct: 127 PAAGTSQLPTAGSALAGGNTAATRIYGAASTTP 159 >UniRef50_Q58MY1 Predicted protein n=1 Tax=Prochlorococcus phage P-SSM2 RepID=Q58MY1_BPPRM Length = 597 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 47/137 (34%), Gaps = 22/137 (16%) Query: 9 LPVGVPVPW--PSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 +P GV V W P+GW+ C+G S PDLR +F+ G Sbjct: 356 IPAGVVVMWSGAQNAIPSGWVLCDGNNSS----------------PDLRDKFVIGAGSNY 399 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 +D + DH+H + + T +F + +S + + +G Sbjct: 400 AVD---NTGGSADAVVVDHSHSASTSVSGAGAHTHSFSASDSHTHSFSGSG-SDTFSGSG 455 Query: 127 LPAPDYGTFKTYKQSVD 143 + ++ S+ Sbjct: 456 SHTHSFSGSGSHGHSLS 472 >UniRef50_B1J270 Tail Collar domain protein n=8 Tax=Bacteria RepID=B1J270_PSEPW Length = 195 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 43/148 (29%), Gaps = 22/148 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + P GW C G S + L T LPDLRG G+ Sbjct: 6 GEIKMFAGNFAPRGWAFCQGQLMSIAQNNALFALLGTTYGGDGKTTFALPDLRGRGPIGF 65 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT 122 G G+ + A G+ + + ++ + + + Sbjct: 66 GTGPGL----------ADVVQGEAGGVNDVTLLQSNMPMQ---QAVIPAQTVSVAIPAVE 112 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGAAAS 150 DA AP G G GAAA Sbjct: 113 GDANAAAPSSGNVLAKSFDSSGAGAAAD 140 >UniRef50_C3X3W3 Predicted protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W3_OXAFO Length = 315 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 47/164 (28%), Gaps = 21/164 (12%) Query: 3 LGEGSALPVGVPVPWPSAT--PPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 L + +P G+ + A P+GWL CNG N PDLR F+ Sbjct: 170 LDKSEGIPSGLIAMYSGAADHIPSGWLLCNG----------------ENGTPDLRDRFVV 213 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 G + + A + + + + + Sbjct: 214 GAGKAYAV---YAKGGATTGAVSGQTGETTLTINQIPSHNHGVGYYISRSGNAGNGFQVE 270 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 T D Y T + Q + + T P A +I++ Sbjct: 271 RTTDNFAFTYLYTTVQGGNQPHSHSLSGSVSTVPPYYALCFILK 314 >UniRef50_Q12HS4 Phage Tail Collar n=1 Tax=Shewanella denitrificans OS217 RepID=Q12HS4_SHEDO Length = 198 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 54/183 (29%), Gaps = 30/183 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + + P W CNG E L + LPDLRG Sbjct: 6 IGEIRMFAGSYAPQYWAFCNGQLLPIAENQALFSLLGYVYGGTQGVSFALPDLRGRVPVH 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ + G +S+ HAH + ++T + ++ + Sbjct: 66 VGTGAGLSSKALGQRGGTEYVSLTSAQLPAHAHMVDLKATGEVNVKMSASSAKGDTAIPG 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFK-------TYKQSVDGLGAAA--SETRPRNI-----AFN 160 +P Y T +V+ G A RP I A N Sbjct: 126 PTTVPAQVLSGLIPLNAYSTSPDTTLLPVNTSTTVNVSGNTAMMGAGRPVVIEQPFLAIN 185 Query: 161 YIV 163 +I+ Sbjct: 186 FII 188 >UniRef50_C7PNC4 Tail Collar domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PNC4_CHIPD Length = 192 Score = 46.2 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 39/178 (21%), Positives = 56/178 (31%), Gaps = 26/178 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELA---------KAYPTNKLPDLR------- 55 G + P W CNGA + +Y L T LPDLR Sbjct: 6 GEIRLFAGNFAPVNWNVCNGALLAISQYDALFSLIGTQYGGDGITTFALPDLRVRVPISM 65 Query: 56 GEFIRGWDDGR---GIDTGRSILSIQGYATEDHAHGLPSRS-TIVTDATINFYFDEIWVN 111 G+ G G G +++ +H H L + + T T +N N Sbjct: 66 GQISASGGTGNYVLGQAAGTPNITLLTSNIPNHTHPLVAVNATATTGDPVNNMLAVTNGN 125 Query: 112 SGTDIIKRGNTN-DAGLPAPDYGTFKTYK----QSVDGLGAA-ASETRPRNIAFNYIV 163 + T + N LP P GT S+ G A + + NYI+ Sbjct: 126 NNTGPTAYPDVNLYTTLPLPGGGTTIPNALMDPASISPTGGTQAHDNMMPYVTINYII 183 >UniRef50_A1TUY7 Phage Tail Collar domain protein n=4 Tax=Acidovorax RepID=A1TUY7_ACIAC Length = 204 Score = 46.2 bits (108), Expect = 4e-04, Method: Composition-based stats. Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 9/61 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + W +A P GW C+G+ + + P L T +LPDLR G Sbjct: 6 IGTVLLWTAAFVPRGWALCDGSVLNITQNPALFAILGNRFGGDGRTTFQLPDLRNRVPMG 65 Query: 62 W 62 Sbjct: 66 L 66 >UniRef50_B4VMZ3 Phage Tail Collar Domain family n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VMZ3_9CYAN Length = 215 Score = 45.9 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 52/175 (29%), Gaps = 28/175 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSA--EEYPELAKAY------------PTNKLPDLRG 56 +G P GW C+G +E LA T ++PDLRG Sbjct: 38 IGQIAMVAFDFAPDGWYLCDGTLHDIISDENDILASILAGKYNQPGDPTQGTFRVPDLRG 97 Query: 57 EFIRGWDDGRG-IDTGRS-------ILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEI 108 G + G D R+ S Q T+ + I Sbjct: 98 RVPLGINPMAGNSDNDRNSYGLGDKSGSEQVELTQA------NLPEIQMKLKATNADSNE 151 Query: 109 WVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 SG+ ++ + ++ A ++ S+ A +A N+I+ Sbjct: 152 TSPSGSALLSKPRSSIYATGATNFVEMDCISSSIGSSENNAHNNLQPYLAINFII 206 >UniRef50_Q4KAW2 Phage tail collar domain protein n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KAW2_PSEF5 Length = 185 Score = 45.9 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 48/159 (30%), Gaps = 19/159 (11%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G +P P W C+G+ + YP L + LP+L+ G Sbjct: 6 GEIRLFPFNFAPDNWAVCDGSPLLVQNYPALYSVIGLTYGGTAGTSFNLPNLKSRVTIGT 65 Query: 63 DDGRGIDTGRSILSIQGYATED--------HAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G + T RS+ G T H H + ++ T ++ ++ Sbjct: 66 GQGANL-TNRSLGQAVGGDTTTLLPAHFAPHTHHVLAKDGTDTTGALDLANGTAYLAQPR 124 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETR 153 + + N AP + + A TR Sbjct: 125 GV-RLYNGTVPPTAAPVPSLHPSTVTTNGTEAAKTGSTR 162 >UniRef50_A8T9J8 Putative uncharacterized protein n=1 Tax=Vibrio sp. AND4 RepID=A8T9J8_9VIBR Length = 242 Score = 45.9 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 26/114 (22%), Positives = 41/114 (35%), Gaps = 20/114 (17%) Query: 6 GSALPVGVPVPWPSAT--PPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG-- 61 + +PVG V W S + PP GW C+G+ T +PDL G F+ G Sbjct: 84 NALIPVGTIVAWGSTSNNPPKGWALCDGS---------------TAGVPDLTGCFLMGNK 128 Query: 62 -WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 + D R + + ++ H L D + + +GT Sbjct: 129 TYGTNAVSDNRRILGTSSNASSLVLGHKLNINQIPSHDHQMTIMQEHSKSKNGT 182 >UniRef50_B9Z2Z1 Putative uncharacterized protein n=1 Tax=Lutiella nitroferrum 2002 RepID=B9Z2Z1_9NEIS Length = 373 Score = 45.9 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 14/43 (32%), Positives = 24/43 (55%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKA 45 + + S +P G+P+ WP TPP+ + +G+A + Y L A Sbjct: 94 IAKRSGMPAGIPLDWPGITPPSWAVVRDGSALNRASYASLFAA 136 >UniRef50_B3PJI6 Microcystin dependent protein; MdpB n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PJI6_CELJU Length = 175 Score = 45.9 bits (107), Expect = 5e-04, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 56/162 (34%), Gaps = 13/162 (8%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G +P + P + C G + ++ L T LP+L+G+ + Sbjct: 7 GEIRQFPYSFAPRNFSYCQGQILTIQQNAPLFSLLGTLYGGNGQTTFALPNLQGQVLMHQ 66 Query: 63 DDGRGIDTGRSILSIQGYA-TEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 G G+ R++ G A +P+ + ++ T+N + V + Sbjct: 67 GSGPGLTP-RTVGESSGSAGVSLIQAEMPNHNHLMVAKTVN-PAASVNVAEDAYLSISRA 124 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + + SV G GA A E R IA + + Sbjct: 125 QTAYSPQQDNLVSLEPTMLSVTGSGA-AHENRQPYIAMPFCI 165 >UniRef50_Q2S9H9 Microcystin-dependent protein n=3 Tax=Proteobacteria RepID=Q2S9H9_HAHCH Length = 179 Score = 45.9 bits (107), Expect = 6e-04, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 49/170 (28%), Gaps = 23/170 (13%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + + P W C+G +A + L T LPD+RG Sbjct: 6 IGEIRIFAATFAPRNWSFCDGQVLAASQQAALFSLLGSFYGGDGRTTFALPDMRGRLPLH 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGL-PSRSTIVTDATINFYFDEIWVNSG 113 + G G+ G +++ H H L S + D + + + + Sbjct: 66 FGQGPGLTPYAIGARVGVESVTVTMENMPPHTHTLMASNDAVTVDVSPSNQVTGVTDPAA 125 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 GN P Y + +A N+I+ Sbjct: 126 PFYTTTGNI------TPLASEAVGYAGGAQNQQTSPHSIMMPYLALNFII 169 >UniRef50_Q4KAW4 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KAW4_PSEF5 Length = 181 Score = 45.5 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 33/123 (26%), Gaps = 20/123 (16%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G +P A P GWL C G Y LA T LPDLRG G Sbjct: 6 GEIRLFPWAWAPQGWLLCQGQILDVVNYTALASLLGDRYGGDGRTTFGLPDLRGRAALGE 65 Query: 63 D---DGRGIDTGRSILSIQGYA--------TEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 + + + S+ G H H T T + Sbjct: 66 NPVASTSPVLGVHELGSMDGAEWVALTLNNLPAHNHVANVAVTAGTGGPAGNIPAISSTS 125 Query: 112 SGT 114 G Sbjct: 126 KGA 128 >UniRef50_B9M3Z9 Tail Collar domain protein n=2 Tax=Bacteria RepID=B9M3Z9_GEOSF Length = 184 Score = 45.5 bits (106), Expect = 7e-04, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 40/158 (25%), Gaps = 16/158 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY---------PTNKLPDLRGEFIRG 61 +G + P GW C+G + ++ L T LP+L G G Sbjct: 6 IGEIRAFAFTYAPYGWATCDGQIMNVQQNTALFSIISNTYGGDGRTTFGLPNLSGRAPMG 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 + G + G + +S+ H HG + S S Sbjct: 66 FGTGPALTPQTLGQSLGEASVSLATNNFPPHTHGFNAVSNTTATLATAADSYVAKAPSSG 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 N+ A + G + Sbjct: 126 KPPVTTNSFQPSASAGTQLAADAVLLAGTAPGTMPHKN 163 >UniRef50_Q1QPI6 Phage Tail Collar n=2 Tax=Proteobacteria RepID=Q1QPI6_NITHX Length = 176 Score = 45.5 bits (106), Expect = 7e-04, Method: Composition-based stats. Identities = 32/158 (20%), Positives = 51/158 (32%), Gaps = 23/158 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELA---------KAYPTNKLPDLRGEFIRG 61 VG + + P GWL CNG+ +Y L T P+L G Sbjct: 6 VGEIRLFGFSRVPQGWLPCNGSLQPISQYEVLFSLVGTTYGGDGVTTFGTPNLSGRVPVH 65 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 G G+ R I I G + T+++ + + + Sbjct: 66 SGTGPGVSP-RVIGEI----------GGSEKVTLLSAHMPYHDHPMVATTGPANSSQITP 114 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRP--RNI 157 + + G A D + + + V G A + T RNI Sbjct: 115 SLELGTVAGDTM-YTSDVKDVGGANTAPTSTSMAGRNI 151 >UniRef50_Q5GQB8 Putative short tail fibre n=1 Tax=Synechococcus phage S-PM2 RepID=Q5GQB8_BPSYP Length = 449 Score = 45.5 bits (106), Expect = 7e-04, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 26/46 (56%), Gaps = 2/46 (4%) Query: 2 GLGEGSALPVGVPVPWPSA--TPPTGWLKCNGAAFSAEEYPELAKA 45 G+ A +G +PW + P GW+ C+G++ A +YP LA+A Sbjct: 6 GIKSAKAAAIGTIMPWTGNISSIPDGWIICDGSSIPARDYPLLARA 51 >UniRef50_B0SX68 Tail Collar domain protein n=6 Tax=Bacteria RepID=B0SX68_CAUSK Length = 179 Score = 45.1 bits (105), Expect = 9e-04, Method: Composition-based stats. Identities = 26/116 (22%), Positives = 38/116 (32%), Gaps = 15/116 (12%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFI-- 59 VG + PP+GW C+G + E L T +PDLRG Sbjct: 6 VGEIRIFGGNFPPSGWAFCDGQLMAISENDTLFNLIGTTYGGDGQETFGIPDLRGRAPVH 65 Query: 60 RGWDDGR----GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 +G G G G +++ H H L + ST + T + Sbjct: 66 QGTQAGTTYVIGERAGVESVTLTANQMAQHTHPLMAASTAGSVGTPTGQTMLSSMG 121 >UniRef50_A9C0W7 Tail Collar domain protein n=6 Tax=Proteobacteria RepID=A9C0W7_DELAS Length = 166 Score = 45.1 bits (105), Expect = 9e-04, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 50/163 (30%), Gaps = 23/163 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRGW 62 G + P GWL CNGA L T LPDLRG G+ Sbjct: 6 GEIRAFAFGQVPRGWLLCNGAILPISTNQALFALLGTQYGGNGTSNFALPDLRGRAPIGY 65 Query: 63 DDGRGIDT--GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 G + G +++ H H L S + + T G +++ Sbjct: 66 GGGVVLGLIDGTESVTLIPSQMPLHTHQLLSSAAVATTNVP-----------GGNVMAEA 114 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + AP S G G+ E ++ N+ + Sbjct: 115 ANGLSAYGAPTNSFMAAPAVSTSG-GSQPHENMQPSLVINWCI 156 >UniRef50_C7ID92 Tail Collar domain protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7ID92_9CLOT Length = 81 Score = 45.1 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 24/58 (41%), Gaps = 10/58 (17%) Query: 9 LPV-GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRG 56 P+ G+ +P + P GW CNGA + + L T LPDLRG Sbjct: 3 CPIMGMIKLFPFSYVPRGWAICNGAILNIQSNTALYSLLGVQFGGNGSTTFGLPDLRG 60 >UniRef50_C2FWA1 Phage tail collar domain protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FWA1_9SPHI Length = 199 Score = 45.1 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 52/180 (28%), Gaps = 34/180 (18%) Query: 17 WPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGWDDGRG 67 + P W CNG + L T +LPD RG G G+ Sbjct: 11 FAGNFAPKYWALCNGQTLAINTNQALFSLLGTTYGGNGVTTFQLPDFRGRIPVGTGSGQT 70 Query: 68 IDTGRSILSIQGYATED----------HAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 + G +I S Q TE H H + + + + ++ + Sbjct: 71 VGMG-TITSGQKVGTESVTLVENNLPAHFHQVSISGKLPLTVSKSIADKSTVIDGLSLAA 129 Query: 118 KRGNTNDAGLPAPDYGTFKTY------KQSVDGL--------GAAASETRPRNIAFNYIV 163 + A P Y + + G+ G A + R + NYI+ Sbjct: 130 PARSAGRAKTPTLGYNSQTGSISLNPASIDLSGMTLTLAPIGGNAVHDNRQPALGLNYII 189 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Entero... 190 1e-47 UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteria... 181 1e-44 UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia Rep... 173 1e-42 UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU 169 4e-41 UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli R... 167 2e-40 UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrob... 165 4e-40 UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 Rep... 164 7e-40 UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae... 162 4e-39 UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber prote... 161 1e-38 UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus... 160 2e-38 UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadan... 157 8e-38 UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterob... 157 1e-37 UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacteriu... 156 2e-37 UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacteriu... 156 3e-37 UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteri... 156 3e-37 UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Pho... 156 3e-37 UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammapr... 155 4e-37 UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia ... 154 1e-36 UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=... 152 4e-36 UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=... 152 4e-36 UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID... 151 1e-35 UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli pl... 150 1e-35 UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid pr... 150 2e-35 UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 T... 150 2e-35 UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia Rep... 150 2e-35 UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadan... 149 3e-35 UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectoba... 149 4e-35 UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteri... 148 6e-35 UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 ... 147 1e-34 UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH... 147 1e-34 UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=... 147 2e-34 UniRef50_B7NJP1 Putative side tail fiber protein homolog from la... 146 2e-34 UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae... 146 3e-34 UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID... 145 5e-34 UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bact... 142 3e-33 UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 ... 141 7e-33 UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteri... 140 2e-32 UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella gra... 140 2e-32 UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersini... 139 3e-32 UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersini... 138 5e-32 UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingell... 138 7e-32 UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadan... 137 1e-31 UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannhei... 136 2e-31 UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 T... 136 3e-31 UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio... 135 5e-31 UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylo... 135 6e-31 UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteri... 134 8e-31 UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadan... 134 9e-31 UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium w... 133 2e-30 UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabd... 133 3e-30 UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A... 130 1e-29 UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralston... 129 4e-29 UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepI... 129 4e-29 UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1... 129 5e-29 UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxaloba... 128 7e-29 UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseu... 128 8e-29 UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae ... 128 8e-29 UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio... 127 1e-28 UniRef50_C3X912 Phage tail collar domain-containing protein n=1 ... 126 3e-28 UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas ... 125 4e-28 UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Ta... 125 6e-28 UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadan... 124 8e-28 UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas... 124 9e-28 UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Ta... 124 1e-27 UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 R... 124 1e-27 UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia sola... 123 2e-27 UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxaloba... 123 2e-27 UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica ... 122 4e-27 UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotroph... 121 8e-27 UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labr... 121 1e-26 UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polarom... 121 1e-26 UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-l... 120 1e-26 UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica... 119 3e-26 UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacteriu... 119 3e-26 UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=... 117 9e-26 UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A... 117 1e-25 UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus... 115 4e-25 UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythrae... 115 5e-25 UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacteriu... 114 1e-24 UniRef50_P76072 Side tail fiber protein homolog from lambdoid pr... 113 2e-24 UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax... 112 4e-24 UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_L... 112 4e-24 UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A... 111 7e-24 UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enter... 111 8e-24 UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 T... 108 5e-23 UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 ... 108 7e-23 UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45... 107 2e-22 UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escheri... 107 2e-22 UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp.... 106 2e-22 UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae R... 105 6e-22 UniRef50_C4MYW8 Gp12 Short tail fibers n=1 Tax=Enterobacteria ph... 105 7e-22 UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella... 104 2e-21 UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX 103 2e-21 UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escheric... 103 3e-21 UniRef50_D2V5I7 Microcystin-dependent protein n=1 Tax=Naegleria ... 102 3e-21 UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium R... 102 4e-21 UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax... 102 5e-21 UniRef50_Q094A8 Phage Tail Collar Domain family n=1 Tax=Stigmate... 101 7e-21 UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia... 101 9e-21 UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=... 100 2e-20 UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibaci... 100 3e-20 UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria pha... 99 7e-20 UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=... 98 8e-20 UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkhol... 98 9e-20 UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus ... 98 1e-19 UniRef50_A1TNG3 Phage Tail Collar domain protein n=9 Tax=Bacteri... 97 1e-19 UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio ... 97 1e-19 UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria ph... 97 2e-19 UniRef50_B3QRT1 Tail Collar domain protein n=1 Tax=Chloroherpeto... 97 2e-19 UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriacea... 97 2e-19 UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Entero... 96 4e-19 UniRef50_Q8Y365 Putative uncharacterized protein n=4 Tax=Ralston... 96 5e-19 UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enter... 96 6e-19 UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia ... 95 7e-19 UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxaloba... 94 2e-18 UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synecho... 94 2e-18 UniRef50_B9M3Z7 Tail Collar domain protein n=1 Tax=Geobacter sp.... 94 2e-18 UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibr... 94 2e-18 UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=... 94 2e-18 UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacter... 94 2e-18 UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae Re... 94 2e-18 UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomo... 94 2e-18 UniRef50_A6EAC0 Microcystin-dependent protein n=1 Tax=Pedobacter... 92 4e-18 UniRef50_A9AVC5 Tail Collar domain protein n=1 Tax=Herpetosiphon... 92 6e-18 UniRef50_P10930 Short tail fiber protein n=8 Tax=Myoviridae RepI... 92 6e-18 UniRef50_Q2W7B2 Microcystin-dependent protein n=1 Tax=Magnetospi... 91 1e-17 UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formi... 91 1e-17 UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkhol... 91 1e-17 UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID... 91 2e-17 UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylo... 91 2e-17 UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A... 91 2e-17 UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkhol... 91 2e-17 UniRef50_A9AVE2 Tail Collar domain protein n=1 Tax=Herpetosiphon... 90 3e-17 UniRef50_Q2RUE1 Phage Tail Collar n=5 Tax=Proteobacteria RepID=Q... 89 3e-17 UniRef50_Q4UNP6 Microcystin dependent protein n=8 Tax=Bacteria R... 89 4e-17 UniRef50_C2FWA0 Phage tail collar domain protein n=2 Tax=Sphingo... 89 4e-17 UniRef50_C3X971 Predicted protein n=6 Tax=Oxalobacter formigenes... 89 4e-17 UniRef50_Q2S9H9 Microcystin-dependent protein n=3 Tax=Proteobact... 89 4e-17 UniRef50_A5GA41 Phage Tail Collar domain protein n=1 Tax=Geobact... 89 4e-17 UniRef50_Q8PR98 Microcystin dependent protein n=1 Tax=Xanthomona... 89 5e-17 UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_... 89 7e-17 UniRef50_B1KMR6 Tail Collar domain protein n=1 Tax=Shewanella wo... 88 8e-17 UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes... 88 8e-17 UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter ... 87 1e-16 UniRef50_Q2W7B1 Microcystin-dependent protein n=3 Tax=Proteobact... 87 2e-16 UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenel... 87 2e-16 UniRef50_A1HR57 Putative uncharacterized protein n=1 Tax=Thermos... 87 2e-16 UniRef50_C3X909 Predicted protein n=2 Tax=Oxalobacter formigenes... 87 2e-16 UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium R... 87 2e-16 UniRef50_Q4KAW3 Putative uncharacterized protein n=1 Tax=Pseudom... 87 2e-16 UniRef50_B0SX68 Tail Collar domain protein n=6 Tax=Bacteria RepI... 86 3e-16 UniRef50_A6EAB8 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 R... 86 4e-16 UniRef50_Q8EKB1 Putative uncharacterized protein n=1 Tax=Shewane... 86 4e-16 UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED1... 86 4e-16 UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD 86 4e-16 UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes... 86 5e-16 UniRef50_Q1QPI5 Phage Tail Collar n=10 Tax=Proteobacteria RepID=... 86 5e-16 UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root R... 86 5e-16 UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia... 85 8e-16 UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysente... 85 8e-16 UniRef50_C6S6V6 Putative phage tail fibre protein n=1 Tax=Neisse... 85 8e-16 UniRef50_Q4C9U4 Phage Tail Collar n=1 Tax=Crocosphaera watsonii ... 84 1e-15 UniRef50_C7PNC3 Tail Collar domain protein n=1 Tax=Chitinophaga ... 84 1e-15 UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=... 84 2e-15 UniRef50_Q12HS4 Phage Tail Collar n=1 Tax=Shewanella denitrifica... 84 2e-15 UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes... 84 2e-15 UniRef50_C7PNC2 Tail Collar domain protein n=1 Tax=Chitinophaga ... 84 2e-15 UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemoph... 84 2e-15 UniRef50_UPI00016C4891 hypothetical protein GobsU_00190 n=1 Tax=... 84 2e-15 UniRef50_A9LZ37 Tail fibre protein, putative n=21 Tax=Neisseria ... 84 2e-15 UniRef50_B9M3Z9 Tail Collar domain protein n=2 Tax=Bacteria RepI... 84 2e-15 UniRef50_A9C0W7 Tail Collar domain protein n=6 Tax=Proteobacteri... 84 2e-15 UniRef50_C6X0H3 Microcystin dependent protein n=1 Tax=Flavobacte... 83 2e-15 UniRef50_D2QTE9 Tail Collar domain protein n=1 Tax=Spirosoma lin... 83 3e-15 UniRef50_A1SXZ3 Phage Tail Collar domain protein n=4 Tax=Bacteri... 83 3e-15 UniRef50_Q21FE8 Phage Tail Collar n=9 Tax=Bacteria RepID=Q21FE8_... 82 5e-15 UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxaloba... 82 5e-15 UniRef50_A6EAB9 Microcystin-dependent protein n=1 Tax=Pedobacter... 82 6e-15 UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxaloba... 82 6e-15 UniRef50_B5JF21 Phage Tail Collar Domain family n=1 Tax=Verrucom... 82 7e-15 UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX 82 8e-15 UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv.... 81 9e-15 UniRef50_D0LMW0 Tail Collar domain protein n=1 Tax=Haliangium oc... 81 1e-14 UniRef50_C0YLU9 Phage tail collar domain-containing protein n=1 ... 81 1e-14 UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae ... 81 1e-14 UniRef50_B3PJI6 Microcystin dependent protein; MdpB n=1 Tax=Cell... 81 1e-14 UniRef50_C5RJD9 Tail Collar domain protein n=1 Tax=Clostridium c... 81 2e-14 UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=C... 81 2e-14 UniRef50_C3X8R9 Bacteriophage tail fiber protein n=6 Tax=Oxaloba... 81 2e-14 UniRef50_Q8PR97 Microcystin dependent protein n=1 Tax=Xanthomona... 80 4e-14 UniRef50_C7PNC4 Tail Collar domain protein n=1 Tax=Chitinophaga ... 79 4e-14 UniRef50_Q55EP2 Putative uncharacterized protein n=1 Tax=Dictyos... 79 4e-14 UniRef50_C6MD19 Tail Collar domain protein n=2 Tax=Proteobacteri... 79 4e-14 UniRef50_B7JYU3 Tail Collar domain protein n=26 Tax=Bacteria Rep... 79 4e-14 UniRef50_C2FWA1 Phage tail collar domain protein n=2 Tax=Sphingo... 79 6e-14 UniRef50_Q2RUD9 Phage Tail Collar n=1 Tax=Rhodospirillum rubrum ... 78 1e-13 UniRef50_C6X0H2 Phage tail collar domain protein n=1 Tax=Flavoba... 77 2e-13 UniRef50_Q4KAW2 Phage tail collar domain protein n=1 Tax=Pseudom... 77 3e-13 UniRef50_C7PE74 Tail Collar domain protein n=3 Tax=Bacteria RepI... 76 4e-13 UniRef50_C9PG79 Putative phage tail protein n=1 Tax=Vibrio furni... 76 4e-13 UniRef50_UPI0001BC923E Phage tail Collar n=1 Tax=Pseudomonas syr... 76 5e-13 UniRef50_B4VMZ3 Phage Tail Collar Domain family n=1 Tax=Microcol... 76 6e-13 UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes... 75 8e-13 UniRef50_Q4KAW4 Putative uncharacterized protein n=1 Tax=Pseudom... 74 2e-12 UniRef50_B1J270 Tail Collar domain protein n=8 Tax=Bacteria RepI... 74 2e-12 UniRef50_Q11LT1 Microcystin-dependent protein-like n=1 Tax=Chela... 73 3e-12 UniRef50_Q8EKA9 Putative uncharacterized protein n=1 Tax=Shewane... 73 3e-12 UniRef50_Q1QPI6 Phage Tail Collar n=2 Tax=Proteobacteria RepID=Q... 73 4e-12 UniRef50_C3X3W3 Predicted protein n=1 Tax=Oxalobacter formigenes... 73 4e-12 UniRef50_B5RPA6 Uncharacterized conserved protein n=73 Tax=Borre... 72 5e-12 UniRef50_B1JGT8 Putative uncharacterized protein n=1 Tax=Yersini... 72 6e-12 UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Micr... 72 6e-12 UniRef50_Q73NL1 Tail fiber domain protein n=1 Tax=Treponema dent... 72 8e-12 UniRef50_C6DJW4 Tail Collar domain protein n=2 Tax=Pectobacteriu... 71 1e-11 UniRef50_B3FYL6 Gp17 n=1 Tax=Salmonella phage phiSG-JL2 RepID=B3... 71 1e-11 UniRef50_A1AK23 Phage Tail Collar domain protein n=1 Tax=Pelobac... 71 1e-11 UniRef50_UPI0001A44BB4 microcystin dependent protein n=1 Tax=Pec... 71 2e-11 UniRef50_C7PCL6 Putative uncharacterized protein n=1 Tax=Chitino... 70 2e-11 UniRef50_C5BRC7 Phage tail collar domain protein n=1 Tax=Teredin... 70 2e-11 UniRef50_B6IWH6 Putative uncharacterized protein n=2 Tax=Bacteri... 70 3e-11 UniRef50_A3YA17 Prophage MuSo2, tail fiber protein, putative n=1... 70 3e-11 UniRef50_B5TK79 Tail collar protein n=2 Tax=root RepID=B5TK79_9VIRU 69 7e-11 UniRef50_A6E583 Phage tail collar domain protein n=2 Tax=Roseova... 68 8e-11 UniRef50_D0KG77 Tail Collar domain protein n=1 Tax=Pectobacteriu... 68 1e-10 UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemoph... 68 1e-10 UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseu... 67 2e-10 UniRef50_D1BW55 Tail Collar domain protein n=1 Tax=Xylanimonas c... 67 2e-10 UniRef50_C9QG11 Probable tail fiber protein n=1 Tax=Vibrio orien... 67 3e-10 UniRef50_B9K0L5 Putative uncharacterized protein n=1 Tax=Agrobac... 67 3e-10 UniRef50_A3Y8Q8 Putative uncharacterized protein n=1 Tax=Marinom... 67 3e-10 UniRef50_A9DEL7 Tail fiber protein 2 n=1 Tax=Yersinia phage PY10... 66 3e-10 Sequences not found previously or not previously below threshold: UniRef50_B3PJI8 Conserved domain protein n=1 Tax=Cellvibrio japo... 86 3e-16 UniRef50_A5GA42 Phage Tail Collar domain protein n=2 Tax=Bacteri... 86 3e-16 UniRef50_B1HUA7 Microcystin dependent protein MdpB n=2 Tax=Bacte... 85 9e-16 UniRef50_Q2RUE0 Phage Tail Collar n=1 Tax=Rhodospirillum rubrum ... 81 1e-14 UniRef50_A9C0W8 Tail Collar domain protein n=1 Tax=Delftia acido... 80 3e-14 UniRef50_B4D821 Tail Collar domain protein n=3 Tax=Bacteria RepI... 79 5e-14 UniRef50_B0SX66 Tail Collar domain protein n=13 Tax=Bacteria Rep... 76 6e-13 UniRef50_Q2SHC1 Microcystin-dependent protein n=1 Tax=Hahella ch... 75 1e-12 UniRef50_Q4UNP7 Microcystin dependent protein n=7 Tax=Proteobact... 74 2e-12 UniRef50_B9M3Z8 Tail Collar domain protein n=3 Tax=Proteobacteri... 73 3e-12 UniRef50_C1F4Q4 Phage tail collar domain protein n=1 Tax=Acidoba... 73 4e-12 UniRef50_B2SIA5 Microcystin dependent protein n=4 Tax=Proteobact... 72 5e-12 UniRef50_C4ZIZ4 Tail Collar domain protein n=4 Tax=Proteobacteri... 70 3e-11 UniRef50_A1SXZ2 Phage Tail Collar domain protein n=2 Tax=Gammapr... 69 4e-11 UniRef50_Q2S9I0 Microcystin-dependent protein n=3 Tax=Bacteria R... 69 4e-11 UniRef50_A5GA43 Phage Tail Collar domain protein n=1 Tax=Geobact... 69 5e-11 UniRef50_A6E584 Microcystin dependent protein, putative n=1 Tax=... 69 8e-11 UniRef50_C5AIB0 Phage Tail Collar n=1 Tax=Burkholderia glumae BG... 67 3e-10 UniRef50_B8CRW5 Phage Tail Collar n=1 Tax=Shewanella piezotolera... 66 4e-10 >UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Enterobacteriaceae RepID=STFE_ECOLI Length = 166 Score = 190 bits (483), Expect = 1e-47, Method: Composition-based stats. Identities = 166/166 (100%), Positives = 166/166 (100%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG Sbjct: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA Sbjct: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 >UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteriaceae RepID=C6V0Q3_ECO5T Length = 439 Score = 181 bits (458), Expect = 1e-44, Method: Composition-based stats. Identities = 96/166 (57%), Positives = 113/166 (68%), Gaps = 2/166 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAK YPTNKLPDLRGEFIR Sbjct: 276 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKVYPTNKLPDLRGEFIR 335 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG+D GR +L++Q A H H ++ D T+ + + T + Sbjct: 336 GWDDGRGVDNGRGLLTLQDGAIVSHNHYWGIWTSRTNDQTLESFTGTTILKQITPLSPAI 395 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 N + P P+ + + A A+ETRPRN+AFNYIVRAA Sbjct: 396 NFD--NYPIPNPAITEGGVVAATTKPAGANETRPRNVAFNYIVRAA 439 >UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia RepID=A9R3H4_YERPG Length = 259 Score = 173 bits (439), Expect = 1e-42, Method: Composition-based stats. Identities = 81/166 (48%), Positives = 100/166 (60%), Gaps = 2/166 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSA+ VGVP+PWP+ATPP GWLKCNGA F +YP+LA AYP+ LPDLRGEFIR Sbjct: 96 LGLGEGSAILVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIR 155 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDG G+D GR ILSIQG A + + G+ R+ + + ++ G Sbjct: 156 GWDDGLGVDAGREILSIQGDAIRNISGGIQGRNEATSARLFSSNATGVFRTDGQFGSYAA 215 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + A D A+E RPRNIAFNYIVRAA Sbjct: 216 SADVAVGVTDD--RLAELFFDASRSVPTANENRPRNIAFNYIVRAA 259 >UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU Length = 296 Score = 169 bits (427), Expect = 4e-41, Method: Composition-based stats. Identities = 69/165 (41%), Positives = 89/165 (53%), Gaps = 5/165 (3%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 + + + +PVG P+PWP+A PP GWL+CNGA F ++PELAKAYP+ LPDLRGEFIR Sbjct: 136 INSSKTNDIPVGTPIPWPTAIPPVGWLQCNGAVFDKSKFPELAKAYPSGYLPDLRGEFIR 195 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWD+GRG+D GR + QG A + P + D + + I G Sbjct: 196 GWDNGRGVDPGRVCSTWQGDAIRNITGSFP---GAIADNYHLATKEAFYGKINLGIATDG 252 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 T + PD + + TRPRNIAFNYIVRA Sbjct: 253 TTKSKNIHNPDN--PYGFGFDASRVVPVPQRTRPRNIAFNYIVRA 295 >UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli RepID=B3I9S3_ECOLX Length = 546 Score = 167 bits (422), Expect = 2e-40, Method: Composition-based stats. Identities = 97/169 (57%), Positives = 116/169 (68%), Gaps = 3/169 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFS EEYPELAKAYPTNKLPDLRGEFIR Sbjct: 378 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSVEEYPELAKAYPTNKLPDLRGEFIR 437 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGL--PSRSTIVTDATINFYFDEIWVNSGTDIIK 118 GWDDGRGIDTGR++L+ Q + DHAH + + + + I G I Sbjct: 438 GWDDGRGIDTGRALLNWQPHTILDHAHYMELWTGDGLAAGSAREGVNPGILATYGDGGIV 497 Query: 119 RGNTNDAGLPAP-DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + +P+ + ++ K+ + +ETRPRNIAFNYIVRAA Sbjct: 498 KTDEPGLKVPSSLRAISSRSVKRYGEISENVGTETRPRNIAFNYIVRAA 546 >UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR91_CITRO Length = 279 Score = 165 bits (418), Expect = 4e-40, Method: Composition-based stats. Identities = 86/166 (51%), Positives = 97/166 (58%), Gaps = 21/166 (12%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPP GWLKCNGA FS+ YP+L AYP+ KLPDLRGEFIR Sbjct: 135 LGLGEGSALPVGVPVPWPSATPPEGWLKCNGATFSSSLYPKLGLAYPSGKLPDLRGEFIR 194 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG D GRS+LS QG A H+H + Sbjct: 195 GWDDGRGADNGRSLLSSQGDAFRSHSHNFDRSW---------------------GLENFD 233 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 T + D + + + SETRPRNIAFNYIVRAA Sbjct: 234 ATAGYDVVTADINGKIVNQPTRSTVSVGGSETRPRNIAFNYIVRAA 279 >UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 RepID=Q9MCR6_BPHK7 Length = 321 Score = 164 bits (416), Expect = 7e-40, Method: Composition-based stats. Identities = 99/165 (60%), Positives = 107/165 (64%), Gaps = 3/165 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGA FSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 159 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAVFSAEEYPELAKAYPTNKLPDLRGEFIR 218 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGID GR ILS QG A + T V + + D ++V G Sbjct: 219 GWDDGRGIDAGREILSAQGDAIRNITGTFGDGETEVNASISFYRADGVFVTQKKLRNTIG 278 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 NT + A + ASE RPRNIAFNYIVRA Sbjct: 279 NTT---IIADTPNNPYLINFDASRVVPTASENRPRNIAFNYIVRA 320 >UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae RepID=D2U1K0_9ENTR Length = 366 Score = 162 bits (410), Expect = 4e-39, Method: Composition-based stats. Identities = 64/160 (40%), Positives = 80/160 (50%), Gaps = 9/160 (5%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + PVG P+PWP ATPP G+L CNG F + P+L AYP+ KLPDLRG FIRGWD G+ Sbjct: 216 NNYPVGAPIPWPQATPPKGYLICNGEPFDKVKCPKLLIAYPSGKLPDLRGYFIRGWDAGK 275 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 G+D GR + S Q A + + A + + I N A Sbjct: 276 GVDPGREVFSYQEDAIRNITGRI-------GFARRGGAEPPVSADGAFVITDWCNVRVAD 328 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 D+G + A+E RPRNIAFNYIVR A Sbjct: 329 GANDDWGGV--ASFDPSRVVPTANENRPRNIAFNYIVREA 366 >UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber protein H n=2 Tax=Pectobacterium atrosepticum RepID=Q6D3Y6_ERWCT Length = 536 Score = 161 bits (406), Expect = 1e-38, Method: Composition-based stats. Identities = 72/159 (45%), Positives = 86/159 (54%), Gaps = 6/159 (3%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 AL G+P PWP AT P GWLKCNG +F +P LA AYP+ LPDLRGEFIRGWDDGRG Sbjct: 384 ALTAGMPKPWPRATAPAGWLKCNGQSFDISAFPHLAAAYPSGVLPDLRGEFIRGWDDGRG 443 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGL 127 +D+GRS+LS Q A + + T A + E +SG + A Sbjct: 444 VDSGRSLLSAQSDAIRNIVGEI------WTSAVSQQFLGETLSSSGVFELLYEFAVGAIP 497 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 A + A+E RPRNIAFNYIVRAA Sbjct: 498 DAAGNSCPSRMRFDASRAVPTAAENRPRNIAFNYIVRAA 536 >UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G6_9ENTR Length = 580 Score = 160 bits (404), Expect = 2e-38, Method: Composition-based stats. Identities = 64/160 (40%), Positives = 85/160 (53%), Gaps = 8/160 (5%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + PVG P+PWP ATPP G+ C+G F +YP+LA AYP+ KLP L GEFIRG D GR Sbjct: 429 NNYPVGAPIPWPQATPPNGYFVCDGNYFDKAKYPQLALAYPSGKLPLLYGEFIRGLDLGR 488 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 +D GR++LS QG A + + T+ + +S N N A Sbjct: 489 KVDPGRTVLSNQGDAIRNITGRIGYARHGGTEPPVVNGEGVFRRDSNH------NVNIAN 542 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 D+G+ + + A+E RPRN+AF YIVRAA Sbjct: 543 GRGDDWGSVM--SFNASRVVPTANENRPRNVAFLYIVRAA 580 >UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BT14_DICD5 Length = 534 Score = 157 bits (398), Expect = 8e-38, Method: Composition-based stats. Identities = 67/156 (42%), Positives = 88/156 (56%), Gaps = 10/156 (6%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+P+P AT P GWLKCNG +F+ +P LA+ YP+ LPDLRGEFIRGWDD RG+D Sbjct: 389 VGIPLPYPGATAPDGWLKCNGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVDP 448 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR +LS Q H+HG+ D + +++ + G+ + DA Sbjct: 449 GRGLLSFQESQNLTHSHGVN-------DPGHSHPYNKYEGSVGSGLAGFDYDQDAWNATV 501 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 G T + + E RPRNIAFNYIVRAA Sbjct: 502 YTGHVGTG---ISIAASGGHEARPRNIAFNYIVRAA 534 >UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterobacteriaceae RepID=A4WEL3_ENT38 Length = 340 Score = 157 bits (397), Expect = 1e-37, Method: Composition-based stats. Identities = 69/160 (43%), Positives = 84/160 (52%), Gaps = 7/160 (4%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + LPVG P+PWP ATPP GWLKCNGA F +YP+LA AYP+ LPDLRGEFIRGWDDGR Sbjct: 188 NYLPVGFPLPWPQATPPQGWLKCNGAPFDKVKYPKLAVAYPSGLLPDLRGEFIRGWDDGR 247 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 G+D+GR L+ QG A + + + + + Sbjct: 248 GVDSGRVALTTQGDAVQKMTGAAS-------NGAATGFVNNSTSRVSGVFKRGSVIYPNT 300 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + +A ETRPRNIAFNYIVRAA Sbjct: 301 SAQNADYQGVDLVFDSSLMVRSAEETRPRNIAFNYIVRAA 340 >UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLI8_PECWW Length = 621 Score = 156 bits (395), Expect = 2e-37, Method: Composition-based stats. Identities = 65/166 (39%), Positives = 84/166 (50%), Gaps = 5/166 (3%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G + L VG+P +P A P GWLKCNG F +YP LA YP+ LPDLRGEF+R Sbjct: 461 IGAMPSAEL-VGMPQVFPGAVAPAGWLKCNGQQFDTAQYPILASRYPSGFLPDLRGEFVR 519 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDD RG+D GR++LS QG A + + + V + K Sbjct: 520 GWDDERGVDAGRALLSEQGDAIRNITGTMRASDVPYGHTQFVDALKADGVFAPIAGDKSW 579 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + +G +G + A+E RPRNIAFNYIVRAA Sbjct: 580 TGDSSGNAGNPWG----VSFDTSRVVPTANENRPRNIAFNYIVRAA 621 >UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DE08_PECCP Length = 682 Score = 156 bits (394), Expect = 3e-37, Method: Composition-based stats. Identities = 72/156 (46%), Positives = 89/156 (57%), Gaps = 6/156 (3%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP T P+GWLKCNG F YP+LA+ YP LPDLRGEFIRGWDD RG+DT Sbjct: 533 VGMPMPWPQTTAPSGWLKCNGQTFDKNIYPKLAQIYPAGILPDLRGEFIRGWDDSRGVDT 592 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR++LS QG A + + T A + E +++G + + T A A Sbjct: 593 GRTLLSTQGDAIRNIVGEI------WTTAANYQFLGENLLSNGAFELFKEFTVGAIPDAA 646 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 K + ASE RPRNIAFNYIVRAA Sbjct: 647 GNSCPSRMKFDASRIVPTASENRPRNIAFNYIVRAA 682 >UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CGA0_DICZE Length = 166 Score = 156 bits (393), Expect = 3e-37, Method: Composition-based stats. Identities = 66/156 (42%), Positives = 85/156 (54%), Gaps = 12/156 (7%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP ATPP GWLKCNG AF +P+LA+ YP+ LPDLRGEFIRGWDDGRG+D+ Sbjct: 23 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 82 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R++LS QG A + ++ + SG + + Sbjct: 83 NRNLLSSQGDAIRNIT------------GFVSGVYVGFDGYSGAFYDTGSRNSISPGSTI 130 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + A+E RPRNIAFNYIVRAA Sbjct: 131 VAQLNDDFAFDASRVVPTANENRPRNIAFNYIVRAA 166 >UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N348_PHOLL Length = 440 Score = 156 bits (393), Expect = 3e-37, Method: Composition-based stats. Identities = 62/162 (38%), Positives = 80/162 (49%), Gaps = 13/162 (8%) Query: 4 GEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWD 63 +P GVP+P+P P G+L CNG F YP+LA+AYP ++PDLRGEFIRGWD Sbjct: 291 ARYDNVPAGVPMPYPHRYTPPGYLTCNGQTFDKSLYPKLAEAYPAGRVPDLRGEFIRGWD 350 Query: 64 DGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTN 123 D RG+D GR + Q DH H + +V D + + + + + N Sbjct: 351 DSRGVDPGRVCGTWQADCIPDHNHYKVASKQLVEDLVLTGDAGWYTSSGSSTRTRSLDQN 410 Query: 124 DAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 V A+ETRPRNIAFNYIVRA Sbjct: 411 TYTGG-------------VTEAQVIANETRPRNIAFNYIVRA 439 >UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=B2PZV1_PROST Length = 526 Score = 155 bits (392), Expect = 4e-37, Method: Composition-based stats. Identities = 76/165 (46%), Positives = 93/165 (56%), Gaps = 7/165 (4%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG 61 G E S PVG P+PWP AT P+G+L CNG AF+ YP L KAYP+ KLPDLRGEFIRG Sbjct: 369 GSSELSDCPVGAPIPWPQATAPSGYLICNGQAFNKTTYPLLTKAYPSGKLPDLRGEFIRG 428 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 D GR ID GR +LS Q ATE H H + +A+ V +G + Sbjct: 429 LDAGRNIDNGRVVLSFQRCATEHHKH-----ISGWGEASNANAIFGKTVKNGYVGSASTD 483 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 ++ D F+ + G+ A+ETRPRNIAF YIVRAA Sbjct: 484 RDNYLFYTNDGSEFQGSNPNSTGI--MANETRPRNIAFLYIVRAA 526 >UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia RepID=B7MJL6_ECO45 Length = 247 Score = 154 bits (388), Expect = 1e-36, Method: Composition-based stats. Identities = 95/166 (57%), Positives = 109/166 (65%), Gaps = 15/166 (9%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 97 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 156 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG+D+ R++LS Q L S ++ + F D + + S + I Sbjct: 157 GWDDGRGVDSRRAVLSTQEPTVGTFYVELAIISGTLSGSGAKFT-DSVGIGSTSSNITVS 215 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 N ND + + +TRPRNIAFNYIVRAA Sbjct: 216 NGNDQSVSG--------------TVAVNPVDTRPRNIAFNYIVRAA 247 >UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=Photorhabdus RepID=Q7N5C0_PHOLL Length = 239 Score = 152 bits (384), Expect = 4e-36, Method: Composition-based stats. Identities = 65/159 (40%), Positives = 86/159 (54%), Gaps = 19/159 (11%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 S++PVG P+PWP + PP+G+ CNG+AFS +YP+LA+AYP ++PDLRGEFIRGWDDGR Sbjct: 99 SSIPVGSPIPWPLSHPPSGYFTCNGSAFSRSQYPKLAEAYPDGRIPDLRGEFIRGWDDGR 158 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 G+D+GR ILS Q T+ ++ D G D+++ Sbjct: 159 GVDSGRVILSAQTDNTKRIQLT-KGLPDGQFLSSYQGPVDRYQFPLGRDVLESATVTSIA 217 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ETRPRNIAFNYIV+A Sbjct: 218 N------------------NTGGHETRPRNIAFNYIVKA 238 >UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E01-6750 RepID=UPI000190EC42 Length = 317 Score = 152 bits (383), Expect = 4e-36, Method: Composition-based stats. Identities = 90/166 (54%), Positives = 100/166 (60%), Gaps = 11/166 (6%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWPSAT P GWLKCNGAAFS+E YP+LAKAYPTNKLPDLRGEFIR Sbjct: 163 LGLGEGSALPVGVPVPWPSATLPEGWLKCNGAAFSSEMYPKLAKAYPTNKLPDLRGEFIR 222 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGID GR ILS Q + +T + D + N I + Sbjct: 223 GWDDGRGIDAGREILSFQEGTIVSGFDDNDTGDISSLSSTQYGFGDTLSSNQWGAINGKK 282 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 DA + Y + RPRNIAFNYIVRAA Sbjct: 283 WIFDASSKGAQKYDWWAYVSA-----------RPRNIAFNYIVRAA 317 >UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID=C6CP84_DICZE Length = 646 Score = 151 bits (380), Expect = 1e-35, Method: Composition-based stats. Identities = 64/156 (41%), Positives = 76/156 (48%), Gaps = 11/156 (7%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 G+P+PWP AT PTGWLKCNG +F + YP LA+ YP+ LPDLRGEFIRGWDDGRG+D Sbjct: 502 AGIPLPWPQATAPTGWLKCNGQSFDKKLYPRLAQVYPSGVLPDLRGEFIRGWDDGRGVDN 561 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R +LS QG + VT + Sbjct: 562 NRGLLSSQGDTIRNIVASFVMDDQAVTINAPT-----------GAMFPSSQIAYDANSNV 610 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + A+E RPRNIAFNYIVRAA Sbjct: 611 GGTMGFNVVFDASRVVPTANENRPRNIAFNYIVRAA 646 >UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli plasmid p15B n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2Q1_PHOLL Length = 478 Score = 150 bits (379), Expect = 1e-35, Method: Composition-based stats. Identities = 63/159 (39%), Positives = 79/159 (49%), Gaps = 14/159 (8%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 + VG P+PWP P G+L CNG +F+ YP+LA AYP+ LPDLRGEFIRGWDDGRG Sbjct: 334 NISVGSPIPWPLPNVPAGYLACNGQSFNKSLYPQLAIAYPSGVLPDLRGEFIRGWDDGRG 393 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGL 127 +D GR +L+ QG A + P ++ + G + Sbjct: 394 VDRGRGVLTHQGDAIRNITGYTPGTILRGNNSYGGCFSLSGEKAPGNEYTDVWQKQ---- 449 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + ASE RPRNIAFNYIVRAA Sbjct: 450 ----------VLFDASRVVPVASENRPRNIAFNYIVRAA 478 >UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid prophage e14 n=3 Tax=Photorhabdus RepID=C7BSQ1_PHOAA Length = 166 Score = 150 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 69/166 (41%), Positives = 88/166 (53%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 + + +PVG+P+PWP+ PP GW+KCNGA F YP+LA AYP+ LPDLRGEFIR Sbjct: 1 MSISILEEIPVGIPLPWPTDIPPYGWVKCNGAIFDKYLYPKLAVAYPSGNLPDLRGEFIR 60 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG+D GR +LS Q H+H + + +NS + G Sbjct: 61 GWDDGRGVDIGRYVLSTQLADIAPHSHRIGRMWSNSNAGAEGLGTPSRILNSVYQGVNYG 120 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 A G+ + ETRPRN+AFNYIVRAA Sbjct: 121 IDTRGLGIAIGMGSGGFGYMDNAVAASTGIETRPRNVAFNYIVRAA 166 >UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BL21_PHOAA Length = 452 Score = 150 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 60/165 (36%), Positives = 81/165 (49%), Gaps = 8/165 (4%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG 61 + PVG P+P+P P G+L CNG F YP+LA+AYP+ ++PDLRGEFIRG Sbjct: 296 AILPPEQHPVGAPIPYPHRYTPVGYLTCNGQTFDKSLYPKLAEAYPSGRVPDLRGEFIRG 355 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 WDD RG+D GR S Q + H H + + + K+ Sbjct: 356 WDDSRGVDPGRVCGSWQDSDNKAHIH--------DDEFCYGGGDAGGDSGTMSAFAKKYC 407 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 T G+ + + L + +E RPRN+AFNYIVRAA Sbjct: 408 TPKDGVNGRPTSGWLPASAGLHSLPSGGNEARPRNVAFNYIVRAA 452 >UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia RepID=C4UEH4_YERAL Length = 387 Score = 150 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 69/157 (43%), Positives = 91/157 (57%), Gaps = 7/157 (4%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 P+G+P+P+P TPP G+LKCNGAAF YP LA YPT+KLPDLRGEFIRG+DDGRGI Sbjct: 237 TPIGIPLPYPGTTPPAGYLKCNGAAFYPYRYPTLATLYPTHKLPDLRGEFIRGFDDGRGI 296 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 DT R++LS Q A ++ G+ S + A + + +G ND Sbjct: 297 DTSRTLLSAQTDALQNITGGINGVSESLGIAAESNF-------TGAFAKAESVGNDNTPH 349 Query: 129 APDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 D ++ + A+ETRPRNI+F YI+RA Sbjct: 350 HTDITHCGSFDFDASRVVRTAAETRPRNISFCYILRA 386 >UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BYH6_DICD5 Length = 198 Score = 149 bits (376), Expect = 3e-35, Method: Composition-based stats. Identities = 68/155 (43%), Positives = 83/155 (53%), Gaps = 16/155 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P WP A P GWLKCNG AF +YP+LAK YP LPDLRGEFIRGWDDGRG+DT Sbjct: 59 VGIPQAWPLADAPEGWLKCNGQAFDKTKYPQLAKLYPAGTLPDLRGEFIRGWDDGRGVDT 118 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R ILS Q E H H +P + + Y ++ + D + G + L Sbjct: 119 NRQILSAQSGMLESHNHMMPVSDPSKWNGAVYGYANDQPSANIEDFSQSGVSTSREL--- 175 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 +ETRPRNIAF+YIV+A Sbjct: 176 -------------TSLTGGNETRPRNIAFSYIVKA 197 >UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2U8_ERWCT Length = 619 Score = 149 bits (375), Expect = 4e-35, Method: Composition-based stats. Identities = 63/166 (37%), Positives = 85/166 (51%), Gaps = 1/166 (0%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G S L G+P+P+P A P G+LKCNG F ++P LA YP+ LPDLRGEF+R Sbjct: 455 IGALPTSEL-AGIPLPFPGAVAPAGYLKCNGQQFDTAQFPVLASRYPSGFLPDLRGEFVR 513 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGIDT R+++S QG A + L + ++ Sbjct: 514 GWDDGRGIDTVRALMSAQGDAIRNIVGSLFYGYDADVPVLNTNSSSGALYYEMSTALRDT 573 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + + + K + A+E RPRNIAFNYIVRAA Sbjct: 574 ESLLSLVTDSVANNWYPAKLDASRVVPTATENRPRNIAFNYIVRAA 619 >UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6DA10_PECCP Length = 689 Score = 148 bits (374), Expect = 6e-35, Method: Composition-based stats. Identities = 65/166 (39%), Positives = 83/166 (50%), Gaps = 13/166 (7%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G S L G+P+P+P A PTGWLKCNG +F +YP LA YP+ LPDLRGEF+R Sbjct: 537 IGAMPASEL-AGIPLPFPGAVAPTGWLKCNGQSFDKSQYPILASRYPSGVLPDLRGEFVR 595 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG D R++LS QG A + + + D + + Sbjct: 596 GWDDGRGADASRALLSAQGDAIRNIVGTIGQLN------------DRVNTTETAGVFDAN 643 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 A + + A+E RPRNIAFNYIVRAA Sbjct: 644 KYTGAHSGLTGGNGGRIATFDASKVVPTAAENRPRNIAFNYIVRAA 689 >UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 Tax=Shigella sp. D9 RepID=UPI0001B5347E Length = 550 Score = 147 bits (371), Expect = 1e-34, Method: Composition-based stats. Identities = 94/169 (55%), Positives = 107/169 (63%), Gaps = 3/169 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLG+GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYP+LAK YPTNKLPDLRGEFIR Sbjct: 382 LGLGDGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPKLAKVYPTNKLPDLRGEFIR 441 Query: 61 GWDDGRGIDTGRSILSIQGYATED---HAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 GWDD RGIDTGRS+LS Q + +T V D + Sbjct: 442 GWDDSRGIDTGRSLLSGQAATFIRTALQDYYGYDLNTNVKVGIAFATADSVITVGNPANP 501 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 K GN +D + D T + + D A RPRN++FNYIVRAA Sbjct: 502 KAGNNSDYVPASADNSITGTQRTAEDNFTGAWISMRPRNLSFNYIVRAA 550 >UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH14_EDWI9 Length = 593 Score = 147 bits (370), Expect = 1e-34, Method: Composition-based stats. Identities = 63/158 (39%), Positives = 79/158 (50%), Gaps = 14/158 (8%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 A PVG P PWP+ + P+GW+KC G +FS YPELAKAYP +LPDLRGEFIRG+DD G Sbjct: 449 APPVGTPQPWPNTSIPSGWIKCAGQSFSTSSYPELAKAYPNGRLPDLRGEFIRGYDDYGG 508 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGL 127 D+ R ILS QG A + + T + + Sbjct: 509 TDSQRQILSWQGDAMRNITGTFGVDDQTIEQVT--------------GVFREYGRFSYDA 554 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + G + + A+E RPRNIAF YIVRA Sbjct: 555 RSERNGAGRIIYFDASQVVPTANENRPRNIAFLYIVRA 592 >UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N6T1_PHOLL Length = 300 Score = 147 bits (370), Expect = 2e-34, Method: Composition-based stats. Identities = 70/158 (44%), Positives = 92/158 (58%), Gaps = 9/158 (5%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 +PVG P+PWP PP G+L CNG+AF+ +YP+LA+AYP +LPDLRGEFIRGWDDGRG+ Sbjct: 152 IPVGSPIPWPLPYPPVGYLTCNGSAFNKLQYPKLAEAYPDGRLPDLRGEFIRGWDDGRGV 211 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLP 128 D GR++LS QG A + L + + I +S + + G+ Sbjct: 212 DMGRTMLSWQGDAMQRMTGFLEAGNGIG--------LMTRPHDSTSGVFLEGDLRTIS-H 262 Query: 129 APDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 GT + A+ETRPRNIAFNY+VRAA Sbjct: 263 VTQNGTSYAVSFDSSRVARTANETRPRNIAFNYVVRAA 300 >UniRef50_B7NJP1 Putative side tail fiber protein homolog from lambdoid prophage n=3 Tax=Escherichia coli RepID=B7NJP1_ECO7I Length = 686 Score = 146 bits (369), Expect = 2e-34, Method: Composition-based stats. Identities = 85/166 (51%), Positives = 104/166 (62%), Gaps = 7/166 (4%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPWP+ATPP GWLKC+G AF+ E+YP LA+AYPT +LPDLRGEFIR Sbjct: 528 LGLGEGSALPVGVPVPWPTATPPEGWLKCDGRAFTKEQYPVLARAYPTLRLPDLRGEFIR 587 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGR ID GR +LS Q + + I++ + ++ G D + G Sbjct: 588 GWDDGRKIDEGRKLLSWQKGTL------VGGHDDNDSALDISYMSNGNNIDYGGDKVFAG 641 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 N L G + + GA + TRPRNIAFNYIVRAA Sbjct: 642 NYRSDYLWYAVLGGT-NSRAKAELNGAFFNITRPRNIAFNYIVRAA 686 >UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae RepID=B3I8J5_ECOLX Length = 263 Score = 146 bits (368), Expect = 3e-34, Method: Composition-based stats. Identities = 95/169 (56%), Positives = 106/169 (62%), Gaps = 3/169 (1%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVG PVPWPS TPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR Sbjct: 95 LGLGEGSALPVGAPVPWPSETPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 154 Query: 61 GWDDGRGIDTGRSILSIQGYATED---HAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 GWDD RGIDTGRS+LS Q + +T V D + Sbjct: 155 GWDDSRGIDTGRSLLSGQAATFIRTALQDYYGYDLNTNVKVGIAFATADSVITVGNPANP 214 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 K GN +D + D T + + D A RPRN++FNYIVRAA Sbjct: 215 KAGNNSDYVPASADNSITGTQRTAEDNFTGAWISMRPRNLSFNYIVRAA 263 >UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID=B7US81_ECO27 Length = 521 Score = 145 bits (366), Expect = 5e-34, Method: Composition-based stats. Identities = 84/166 (50%), Positives = 94/166 (56%), Gaps = 22/166 (13%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSALPVGVPVPW SATPPTGWLKCNGAAFS+E YP LA+AYPTNKLPDLRGEFIR Sbjct: 378 LGLGEGSALPVGVPVPWSSATPPTGWLKCNGAAFSSEMYPRLARAYPTNKLPDLRGEFIR 437 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRGID GR++LS Q + H G + + + S Sbjct: 438 GWDDGRGIDAGRTLLSGQDGTSFSHYGGNFDIGSGHSINNYDQIVSNQPGFSRFSFAGPS 497 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + RPRNI FNYIVRAA Sbjct: 498 RGDGVNYVTI----------------------RPRNITFNYIVRAA 521 >UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bacteriophage n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2R3_PHOLL Length = 233 Score = 142 bits (358), Expect = 3e-33, Method: Composition-based stats. Identities = 72/158 (45%), Positives = 86/158 (54%), Gaps = 18/158 (11%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 PVGVP+P+PS P G+L CNG AF YP+LA AYP+ LPDLRGEFIRGWDD RG+D Sbjct: 93 PVGVPLPYPSRYTPAGYLTCNGQAFDKSRYPQLAIAYPSGILPDLRGEFIRGWDDSRGVD 152 Query: 70 TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPA 129 GR +LS Q +DH H +V D ++ GN + Sbjct: 153 MGRGMLSWQPAGIQDHMHYKVISKQVVED-----------------LVLAGNQSWGTEKN 195 Query: 130 PDYGTFKTYKQSVDGL-GAAASETRPRNIAFNYIVRAA 166 Y S G+ G +ETRPRNIAFNYIVRAA Sbjct: 196 STYTRSLDQNISTGGVIGTTVNETRPRNIAFNYIVRAA 233 >UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8U2_OXAFO Length = 266 Score = 141 bits (356), Expect = 7e-33, Method: Composition-based stats. Identities = 50/168 (29%), Positives = 67/168 (39%), Gaps = 17/168 (10%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRG 56 + +P G + PP G+LK +GA +YP L A T LPDLRG Sbjct: 105 NGVPTGTIAFFAMTAPPAGYLKADGAIIQRTDYPALFTAIGTTFGEGDGTTTFTLPDLRG 164 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDI 116 EFIRGWD+GR ID R+ SIQG A + L +D+ +N S Sbjct: 165 EFIRGWDNGRNIDCERAFGSIQGDAIRNVTGQLRYAGPQNSDSVMN-------YQSALQW 217 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + + + ASE RPRNIA ++ Sbjct: 218 TSVSQKSPYSAQSSQGSNYYEINFDASRSVPTASENRPRNIALLACIK 265 >UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6CGA4_DICZE Length = 401 Score = 140 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 69/156 (44%), Positives = 82/156 (52%), Gaps = 2/156 (1%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP ATPP GWLKCNG AF +P+LA+AYP LPDLRGEFIRGWDDGRG+D Sbjct: 248 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQAYPGGVLPDLRGEFIRGWDDGRGVDV 307 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R +LS Q L + + N + S I + A Sbjct: 308 ARELLSWQKGTLTISDPNLSAVNVGALIHANNDSANTY--KSMGFDIVNKSDYAMLRAAI 365 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + T +G TRPRNIAFNYIVRAA Sbjct: 366 NVETVGAQDLDSNGWQFGYGATRPRNIAFNYIVRAA 401 >UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella grahamii as4aup RepID=C6ABW9_BARGA Length = 370 Score = 140 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 52/169 (30%), Positives = 71/169 (42%), Gaps = 27/169 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLR 55 +++PVG + +P+ T P GWLK NGA S +Y +L T +LPDLR Sbjct: 218 NNSMPVGTVIYYPALTVPKGWLKANGALISRSDYAQLFAVIGTTYGAGDGKTTFRLPDLR 277 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 GEF+RG DD R ID R+I S QG A + L + + Y Sbjct: 278 GEFLRGVDDERNIDPNRTIGSQQGDAIRNITGELNFDAKAKAASGAFKYGG--------- 328 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + G+ T K A+E RPRNIA ++R Sbjct: 329 --------VSNSSNTSSGSSSTIKFDASRSVPTANENRPRNIALLALIR 369 >UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersinia pestis KIM D27 RepID=D1TPQ4_YERPE Length = 262 Score = 139 bits (350), Expect = 3e-32, Method: Composition-based stats. Identities = 66/131 (50%), Positives = 84/131 (64%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +GLGEGSA+PVGVP+PWP+ATPP GWLKCNGA F +YP+LA AYP+ LPDLRGEFIR Sbjct: 96 LGLGEGSAIPVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIR 155 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDG G+D GR ILSIQG A + + G+ R+ + + ++ G Sbjct: 156 GWDDGLGVDAGREILSIQGDAIRNISGGIQGRNEATSARLFSSNATGVFRTDGQFGSYAA 215 Query: 121 NTNDAGLPAPD 131 + + A D Sbjct: 216 SADVAVGVTDD 226 >UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersinia bercovieri ATCC 43970 RepID=C4S5W0_YERBE Length = 388 Score = 138 bits (348), Expect = 5e-32, Method: Composition-based stats. Identities = 60/155 (38%), Positives = 78/155 (50%), Gaps = 10/155 (6%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P+P+P + P G+LKCNGAAFS YP+LA YP+ LPD+RG IRGWDDGRG+D Sbjct: 243 IGIPIPYPLPSVPVGYLKCNGAAFSTVTYPKLALKYPSGVLPDMRGNAIRGWDDGRGVDA 302 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR++LS Q A ++ + + N G Sbjct: 303 GRALLSQQLDALQNITGNFYMGGSKQVAGVVT----------TGAFGPMEVYNALGNQVT 352 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G + A+ETR RNIAFNYIVRA Sbjct: 353 TAGNIGGITFDASRVSRTAAETRMRNIAFNYIVRA 387 >UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingella oralis ATCC 51147 RepID=C4GFX3_9NEIS Length = 310 Score = 138 bits (347), Expect = 7e-32, Method: Composition-based stats. Identities = 52/173 (30%), Positives = 67/173 (38%), Gaps = 20/173 (11%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKL 51 G S P G + + PTGWLK NGA S Y L A T L Sbjct: 147 GYTANSYCPSGQIGLFATDYAPTGWLKANGAVLSRTVYTNLFAAIGTRFGAGDGHSTFNL 206 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 PDLRGEF R WDDGRG+D GR + S Q A + D + + Sbjct: 207 PDLRGEFPRFWDDGRGVDAGRVLGSWQSDAIRNIT---AQMYLYGQDGSSSQGAFGFRKQ 263 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 ++ N N+AG+ + + A E RPRNIA ++ Sbjct: 264 GERGLVWSRNDNNAGVVMDFW-------LDASKVVPTAHENRPRNIALLACIK 309 >UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C6Z0_DICDC Length = 183 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 64/155 (41%), Positives = 76/155 (49%), Gaps = 18/155 (11%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P PWP A P GWLKCNG AF +YPELAK YP+ LPDLRGEFIRGWDDGRG+DT Sbjct: 46 IGIPQPWPLADAPEGWLKCNGQAFDTAKYPELAKCYPSGTLPDLRGEFIRGWDDGRGVDT 105 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R ++S Q + I D G I + + A Sbjct: 106 SRELVSAQ------------------SGTYITGDSDSQPSVQGIGNITECHVDSPDSNAR 147 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 K TRPRNI+FNYIV+A Sbjct: 148 SIYWIPATKTDRLTGPTYWGVTRPRNISFNYIVKA 182 >UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65WH4_MANSM Length = 296 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 53/153 (34%), Positives = 71/153 (46%), Gaps = 1/153 (0%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P P+P + P G L NG FS YPELAK YP+ +LPDLRGEFIRGWD+GRG+D+ Sbjct: 140 IGIPFPYPLSAVPDGCLAFNGQTFSTTTYPELAKKYPSGRLPDLRGEFIRGWDNGRGVDS 199 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R +L QG H H + + + + + G Sbjct: 200 SRELLRSQGAELSAHTHYVTVTRYANSSGEFGAKISTFSAINNSGWLLSGADGLLLAANK 259 Query: 131 DYGTFKTYKQSVD-GLGAAASETRPRNIAFNYI 162 + +ETRPRN+AF YI Sbjct: 260 SGEIVSEKNSVANLISNTGGNETRPRNVAFQYI 292 >UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I687_PSEE4 Length = 898 Score = 136 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 53/178 (29%), Positives = 80/178 (44%), Gaps = 15/178 (8%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKL 51 L SALPVG +P+P T P G+L+ +G+ SA YP+LA +L Sbjct: 378 LNTASALPVGTMLPFPRGTVPAGFLEVDGSTQSAAVYPDLAAYLGGAFNTGNEAAGFFRL 437 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHG----LPSRSTIVTDATINFYFDE 107 PD RGEF+RGWD GRG+D+GR++ S QG + + H H + + + + + Sbjct: 438 PDTRGEFLRGWDHGRGVDSGRAVGSTQGESFKAHTHKDVGFIDNVGGGSGASAVTGATGD 497 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + G + G + SETRPRN+A + ++A Sbjct: 498 VTSIYGKAYGNSASATAKAYKESAPGALGGAIAGLISGSTGDSETRPRNLAVMWCIKA 555 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 47/170 (27%), Positives = 74/170 (43%), Gaps = 14/170 (8%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKLPDLR 55 S+ PVG +P+P A P G+L+ +G+ S YP+LA +LPD R Sbjct: 579 SSTPVGAILPFPKAEVPAGYLELDGSLQSVATYPDLAAYLGASYNNGTEPAGYFRLPDYR 638 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 GEF+RGWD GRG+D GR + + Q A ++ + R + + Sbjct: 639 GEFLRGWDHGRGVDPGRGMGTSQSDAIQNITGSIGLRGGAGVGLGVMGGASGAFSTVFG- 697 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + N A + + AA+ETRPRN + + ++A Sbjct: 698 --ESTSANTITRDASSIAASDIARFDASKVVRAAAETRPRNQSVMWCIKA 745 >UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DPV9_DESVM Length = 530 Score = 135 bits (340), Expect = 5e-31, Method: Composition-based stats. Identities = 51/169 (30%), Positives = 73/169 (43%), Gaps = 17/169 (10%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNK------LPDLRG 56 L + +P+G + +P T PTG+L C G + YP+L LPDLRG Sbjct: 206 LACAAFVPIGAILDFPVNTVPTGFLVCAGQVVTRTAYPDLVTYLTGGTVAVNATLPDLRG 265 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDI 116 EF RG D GRG+D GR + S QG A + L N+ + +G Sbjct: 266 EFRRGADLGRGVDAGRVVGSAQGDAIRNITGSL-----------YNYIQNNASQENGALR 314 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + +T ++ A ++ T ASE RPRNIA ++A Sbjct: 315 TQVASTLNSPFGAGTIMSWSTLSIDASRQVPTASENRPRNIAVVPCIKA 363 >UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PDQ5_9PROT Length = 391 Score = 135 bits (339), Expect = 6e-31, Method: Composition-based stats. Identities = 51/172 (29%), Positives = 74/172 (43%), Gaps = 19/172 (11%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLP 52 L + LPVG + P G+L CNGAA S Y +L A T +P Sbjct: 228 LLSSTILPVGTIITSARTPAPDGFLLCNGAAISRSAYTDLFSAIGTAYGAGDGSSSFNIP 287 Query: 53 DLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS 112 DLRGEFIRG D+GRG+D GR++ S QG A + + ++ + Sbjct: 288 DLRGEFIRGADNGRGVDGGRALGSAQGDAIRNIT--ARAIGMGDRNSIPTLLGALYGIQK 345 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 T I G+ G + + + A+E RPRN+A N+ ++ Sbjct: 346 STRIESVGDVLGD-------GGYFEWGFDASKVVPVANENRPRNVAVNFYIK 390 >UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CP88_DICZE Length = 485 Score = 134 bits (338), Expect = 8e-31, Method: Composition-based stats. Identities = 66/156 (42%), Positives = 86/156 (55%), Gaps = 14/156 (8%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 G+P+PWP A PTGWLKCNG AF YP LA+ YP+ LPDLRGEFIRGWDDGRG+D+ Sbjct: 344 AGIPLPWPQAAVPTGWLKCNGQAFDKNRYPRLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 403 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR +LS Q + ++ + ++ D + ++ + ++ Sbjct: 404 GREVLSQQRGSLINYDGPDSAPTS-----------DSLRLSVSAAQADAVSASEYAGVML 452 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 Y + S G A TRPRNIAFNYIVRAA Sbjct: 453 SYTAYNITTVSAAGYVGA---TRPRNIAFNYIVRAA 485 >UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadantii RepID=C6C5D2_DICDC Length = 498 Score = 134 bits (338), Expect = 9e-31, Method: Composition-based stats. Identities = 69/168 (41%), Positives = 86/168 (51%), Gaps = 16/168 (9%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 VG+P+PWP AT PTGWLKCNG +F YP+LA YP+ LPDLRGEFIRGWDDGRG+D Sbjct: 335 VGIPLPWPQATAPTGWLKCNGQSFDKALYPKLATVYPSGVLPDLRGEFIRGWDDGRGVDA 394 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR+IL+ Q + + V + + ++ I AP Sbjct: 395 GRAILTAQNPTYLR-TGMMDYNGSDVDNIGVYIGMGYAEADTAAKSISAPAG---AFRAP 450 Query: 131 DYGTFKTYKQSVDGLGAAAS------------ETRPRNIAFNYIVRAA 166 + +G+ AS TRPRNIAFNYIVRAA Sbjct: 451 NNIDLTEQASRDNGVNGTASNTVYASEGSVWVSTRPRNIAFNYIVRAA 498 >UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE1_PECWW Length = 532 Score = 133 bits (334), Expect = 2e-30, Method: Composition-based stats. Identities = 51/166 (30%), Positives = 72/166 (43%), Gaps = 19/166 (11%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG 61 S++ G W + PP GWL+ NG F+ P LA YP++++PD RG F RG Sbjct: 383 AWKSSSSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRG 442 Query: 62 WDDGRGIDT-GRSILSIQGYATEDHAHGL-PSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 WD+G GID R+ILS+QG A + P S+ + Y NSG+ Sbjct: 443 WDNGAGIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSSYGWPYPSNSGSANDAS 502 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + A+E RP NIA +I++A Sbjct: 503 -----------------IITFDASRVVPTAAENRPTNIAVMFIIKA 531 >UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabdus RepID=Q7NAA0_PHOLL Length = 351 Score = 133 bits (334), Expect = 3e-30, Method: Composition-based stats. Identities = 59/162 (36%), Positives = 78/162 (48%), Gaps = 9/162 (5%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + VG+P+PW T P G+L C+G F YP+L +AYP+ LPDLRGEFIRGWD+GR Sbjct: 197 DDILVGIPLPWSKPTAPAGYLICSGQQFDKSMYPKLGEAYPSGALPDLRGEFIRGWDNGR 256 Query: 67 GIDTGRSILSIQGYA-TED-HAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 ID+GR ILS Q + + H ++ IN + + Sbjct: 257 SIDSGREILSHQNSTKLPNLYTHAASENIGLLVSPPINHFSSNYPSEIMASDFEEAEFGS 316 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + S + RPRNIAFNYIVRAA Sbjct: 317 GQYFSTPLNPTGSVSLSTFRV-------RPRNIAFNYIVRAA 351 >UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A9IRI0_BART1 Length = 324 Score = 130 bits (327), Expect = 1e-29, Method: Composition-based stats. Identities = 43/167 (25%), Positives = 65/167 (38%), Gaps = 11/167 (6%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGE 57 + P G + P GWL C+G A+ E+YP+L KA T K+PD RG Sbjct: 159 ESFPAGFIATFAMRNIPNGWLLCDGTAYKREDYPQLFKAIGDKWGKNSDTTFKVPDFRGM 218 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 F+RG+DDGRG+D R Q + + H H + NF + + +G Sbjct: 219 FLRGFDDGRGLDNDRKFADEQQDSIKSHTHIGTVEESG--AHVHNFEYKGVGWPTGNIGR 276 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + + + +ETRP N Y ++ Sbjct: 277 LPNYYTYNTTLKGKTDSAGAHTHKITLSHTGEAETRPVNTTVIYAIK 323 >UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralstonia phage RSL1 RepID=B2ZY49_9CAUD Length = 498 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 48/185 (25%), Positives = 68/185 (36%), Gaps = 23/185 (12%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLP 52 L +P G +P+ T P G+L CN AA S + L T LP Sbjct: 313 LNPPQLVPPGTILPFAGTTIPAGYLACNAAAISRTGFASLYSVIGTTYGVGNGSTTFNLP 372 Query: 53 DLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVT----DATINFYFDEI 108 DLRG F+RGWD+GRG D GR + QG A H H + + + + Sbjct: 373 DLRGVFVRGWDNGRGQDPGRVFGTYQGDAFRSHNHAVSDPGHAHGVYDPGHSHTWTLGTL 432 Query: 109 WVNSGTDIIKRGNTNDAGL---------PAPDYGTFKTYKQSVDGLGAAASETRPRNIAF 159 + G + G + L +ET P+N+A Sbjct: 433 RQSGGDTSCYVPSARYGGGEFQFTETTAAVGTGIGIYGNVTGIGTLVNGGAETTPKNVAM 492 Query: 160 NYIVR 164 NYI++ Sbjct: 493 NYIIK 497 >UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEM1_9CAUD Length = 255 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 50/194 (25%), Positives = 73/194 (37%), Gaps = 39/194 (20%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 G+A PVG P+ WPS T P GW G F +YP LAK YP+ LPD+RG I+ DG Sbjct: 68 GNATPVGAPLAWPSDTAPDGWALMIGQTFDKVKYPLLAKVYPSGVLPDMRGRVIKAKPDG 127 Query: 66 RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 R++LS++ + H H + + T AT F + + Sbjct: 128 ------RAVLSLEEDQVKSHTHTGKAATAGGTRATSTFDHGNKRTTTNGNHTHGSPQGAR 181 Query: 126 GLPAPDYGTFK---------------------------------TYKQSVDGLGAAASET 152 + Y + ++ ++ +E Sbjct: 182 HGGSGQYTSGDDETNSVFNWPATSAAGDHFHDVQIGPHNHNVDINHEHTLQIDATGGTEN 241 Query: 153 RPRNIAFNYIVRAA 166 +NIA NYIVR A Sbjct: 242 TVKNIAMNYIVRLA 255 >UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1_YERKR Length = 402 Score = 129 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 60/155 (38%), Positives = 77/155 (49%), Gaps = 30/155 (19%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G P+PWP P G+LKCNGA F+ +YP+LA AYP+ LPDLRGEFIRG+DDGRG+ Sbjct: 277 IGTPIPWPLTIAPAGYLKCNGAPFNKTQYPKLALAYPSGVLPDLRGEFIRGFDDGRGVRP 336 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 + +L QG + H HG+ N G+ Sbjct: 337 NQPLLGWQGSEIQSHNHGI------------------------------TNFEIRGVTGG 366 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + + ETRPRNIAFNYIVRA Sbjct: 367 PTNAWFPSTNGISTNNSGGDETRPRNIAFNYIVRA 401 >UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8Y3_OXAFO Length = 270 Score = 128 bits (321), Expect = 7e-29, Method: Composition-based stats. Identities = 51/167 (30%), Positives = 73/167 (43%), Gaps = 35/167 (20%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 A+P G V + S P G+LK +G+A EEY EL A T LPDLRGE Sbjct: 128 AVPAGTVVYFCSHKAPYGYLKADGSAVGREEYKELFAAIGVYFGSGDGVSTFNLPDLRGE 187 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 FIR D+GRG+D GR + ++Q + H HG R + ++ + + ++ Sbjct: 188 FIRSLDNGRGVDAGRELGNVQMDEFKSHYHGFLDRPNMRLESGVYTWTPQV--------- 238 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + S+ A SETRPRNIA ++ Sbjct: 239 ----------------MEVAEQDSISTTRAGGSETRPRNIALLACIK 269 >UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseudotuberculosis IP 31758 RepID=A7FIU0_YERP3 Length = 402 Score = 128 bits (321), Expect = 8e-29, Method: Composition-based stats. Identities = 64/160 (40%), Positives = 79/160 (49%), Gaps = 13/160 (8%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 PVG+P+PWP+ PP+GWLKCNGA F+ ++P+LA Y LPDLRGEFIRGWDDG Sbjct: 255 NDVAPVGIPMPWPAHIPPSGWLKCNGATFNKAQFPQLASVYTRGVLPDLRGEFIRGWDDG 314 Query: 66 RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 + D GR +LS Q D I + S + NT Sbjct: 315 KLADPGRGLLSFQEGTVV-----------GGYDDNDTGDISSIGLYSSGFGDQLTNTQWV 363 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + S+ A TRPRNIAFNYIVRA Sbjct: 364 SINGKRW--ITAGVSSIRYEWYAYLSTRPRNIAFNYIVRA 401 >UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CG98_DICZE Length = 196 Score = 128 bits (321), Expect = 8e-29, Method: Composition-based stats. Identities = 67/160 (41%), Positives = 79/160 (49%), Gaps = 28/160 (17%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 +G+P PWP A P GWLKCNG F +YP+LAK YP LPDLRGEFIRGWDD RG+DT Sbjct: 59 IGIPQPWPLAEAPEGWLKCNGQTFDTAKYPQLAKLYPAGTLPDLRGEFIRGWDDERGVDT 118 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 R +LS Q H L T + GN ++ P Sbjct: 119 DRKLLSAQAG-----THILGDDGGYPT------------------LNSIGNLSECNADKP 155 Query: 131 DYGTFKTYKQSVDGLGAAASE-----TRPRNIAFNYIVRA 165 D Y + ASE TRPRNIAF+YIV+A Sbjct: 156 DGNVRTLYWLDTNKSEKLASEKFWGATRPRNIAFSYIVKA 195 >UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BT48_DESAD Length = 208 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 56/160 (35%), Positives = 73/160 (45%), Gaps = 10/160 (6%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 S P+G + TPP GWL+CNG S YPELA +PDLRGEFIRG D G Sbjct: 58 ASDYPIGAVAAYRGDTPPVGWLECNGQ--STTGYPELAAVVGA-NVPDLRGEFIRGLDSG 114 Query: 66 RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 RG+D GR++ S Q A E H+H + T T + Y + T Sbjct: 115 RGVDAGRALGSAQADAMERHSHQTTITVSGRTSVTASPYHS-------AGAARSLVTTPN 167 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 +F + +ETRPRN+A YI++A Sbjct: 168 FGSPFGGASFSASGTGTSTSVGSGAETRPRNVALMYIIKA 207 >UniRef50_C3X912 Phage tail collar domain-containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X912_OXAFO Length = 436 Score = 126 bits (316), Expect = 3e-28, Method: Composition-based stats. Identities = 43/168 (25%), Positives = 62/168 (36%), Gaps = 25/168 (14%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRG 56 ++P G + + TPP G+L NGA S Y L A T +LPDLRG Sbjct: 283 DSVPAGSVHYFATQTPPDGYLVANGALVSRTVYARLFSAIGTTFGEGDGGSTFQLPDLRG 342 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDI 116 EF+RGWD R +D R ++QG A + + FY+ Sbjct: 343 EFLRGWDAARNLDPERGFGTVQGDAIRNIIGTFGGNDQERRFLSGPFYY----------- 391 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 D G + + A+E RP N+A ++ Sbjct: 392 ----IGTDGGGKTGSSNGTDNFGFDASRVVPTANENRPHNVALLACIK 435 >UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas aeruginosa PA7 RepID=A6VBH2_PSEA7 Length = 654 Score = 125 bits (314), Expect = 4e-28, Method: Composition-based stats. Identities = 47/196 (23%), Positives = 71/196 (36%), Gaps = 33/196 (16%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKL 51 L + +P G V + +PP G+LK NGAA S Y L T L Sbjct: 458 NLNPQAIVPAGAVVAFAMYSPPAGYLKANGAAVSRTAYAALFATIGTYYGAGDGSTTFNL 517 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDAT----------- 100 PD RGEF+R DDGRG+D GR + ++Q H HG S T Sbjct: 518 PDYRGEFLRALDDGRGLDLGRQLGTLQSSQNLAHTHGASSSGNGGHTHTVTGTAAAAGAH 577 Query: 101 ------------INFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAA 148 ++ V + ++ + T+ ++ + Sbjct: 578 SHSIASVNATALVSGTRLATLVGNASNSTTDVAGDHTHAVTGVAALEGTHNHTIYVESSG 637 Query: 149 ASETRPRNIAFNYIVR 164 SE RPRN++ ++ Sbjct: 638 GSEARPRNVSVLICIK 653 >UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3KCU2_PSEFS Length = 658 Score = 125 bits (314), Expect = 6e-28, Method: Composition-based stats. Identities = 57/174 (32%), Positives = 76/174 (43%), Gaps = 36/174 (20%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKL 51 + + SALPVG V +P P G+L+ +G+ SA YP+LAK T +L Sbjct: 174 IAQSSALPVGSMVAFPIDKVPVGFLEIDGSVKSATAYPDLAKFLGTAFNKGDEGAGNFRL 233 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RGEF+RGWD GRG+D GR S Q + H H T+ N D I Sbjct: 234 PESRGEFLRGWDHGRGVDAGRLAGSYQTDQFKSHTHEY---DTMQGGGAANSVSDTIAAQ 290 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 S T + GA SETRPRN+A + ++A Sbjct: 291 SNA----------------------TSQTGHITGGAGGSETRPRNLAVMWCIKA 322 Score = 110 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 45/170 (26%), Positives = 68/170 (40%), Gaps = 17/170 (10%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKLPDLR 55 SA+PVG +P+ A P G+L+ +G+ S YP+LA T +LP+ R Sbjct: 346 SAVPVGSIIPFLKAAVPPGYLELDGSVQSIATYPDLAAYLGTTFNTGSEPAGYFRLPESR 405 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 GEF+RGWD GRG+D GR + S Q + +P+ T D Sbjct: 406 GEFLRGWDHGRGMDAGREVGSWQKGSMVAVDTNIPA-----TQTIATNLVDAAAARMRGG 460 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 +G+ + TRP N+A + ++A Sbjct: 461 YDSGDVGLYSGITLMGVNPQANVALPGNIEVTYGI-TRPNNLAVMWCIKA 509 >UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D4_DICDC Length = 557 Score = 124 bits (312), Expect = 8e-28, Method: Composition-based stats. Identities = 64/156 (41%), Positives = 81/156 (51%), Gaps = 22/156 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT 70 G+P+PWP AT PTGWLKCNG +F YP+L AYP+ LPDLRGEFIRGWDDGRG+D+ Sbjct: 424 AGIPLPWPQATAPTGWLKCNGQSFDKTLYPKLTAAYPSGTLPDLRGEFIRGWDDGRGVDS 483 Query: 71 GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAP 130 GR++LS+Q + ++ R + D+ Sbjct: 484 GRAVLSVQDAT----------------------WIQPNIESNTAATTIRIDNVDSTFNTD 521 Query: 131 DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 +Y A S RPRN+AFNYIVRAA Sbjct: 522 EYSAVSNLPSYEHNGSRARSYVRPRNVAFNYIVRAA 557 >UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KHC6_PSEF5 Length = 369 Score = 124 bits (312), Expect = 9e-28, Method: Composition-based stats. Identities = 59/174 (33%), Positives = 83/174 (47%), Gaps = 32/174 (18%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKL 51 L SALPVG VP+P T P G+L+ +G+ SA YP+LA T +L Sbjct: 107 LKNMSALPVGAMVPFPKGTVPAGFLEVDGSVQSAATYPDLAAYLGTMFNTGGEGAGNFRL 166 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RGEF+RGWD GRG+D GR++ S Q +A H H + + T + + W + Sbjct: 167 PESRGEFLRGWDHGRGVDVGRALGSYQAHAVGSHQHPMNYWAWRDGTGTGTHNYAKPWGD 226 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 +G +K T G A SETRPRN+A + ++A Sbjct: 227 TGITGVKDPGT---------------------GANAGDSETRPRNLAVMWCIKA 259 >UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KH70_PSEPF Length = 817 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 51/174 (29%), Positives = 79/174 (45%), Gaps = 19/174 (10%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP-----------TNKL 51 + + SALPVG V +P +PP G+L+ + + S+ YP+L+ +L Sbjct: 317 IAKASALPVGSIVAFPVDSPPPGFLELDNSVKSSATYPDLSAYLGGKFNKGDEGVGNFRL 376 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RGEF+RGWD GRG+D GR+ S Q + + H H +P+ S N + + Sbjct: 377 PEARGEFLRGWDHGRGVDGGRAQGSSQTDSLKAHYHLIPTGSGGGQAVDPNGEIPTVVLK 436 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 D + + AA+ETRPRNIA + ++A Sbjct: 437 DTAADWVLRTEGDNAELSIGRVRTYNF--------GAATETRPRNIAVMWCIKA 482 Score = 122 bits (305), Expect = 5e-27, Method: Composition-based stats. Identities = 53/175 (30%), Positives = 79/175 (45%), Gaps = 27/175 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKLPDL 54 GSA+PVG + +P+ P G+L+ NG+ + YP+LA T +LP+ Sbjct: 505 GSAVPVGAVMAFPTGIVPPGFLELNGSVQNTSTYPDLAAYLGTTYNKGDEGAGNFRLPES 564 Query: 55 RGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 RGEF+RGWD GRG+D GR I + QG + DH H + + +N Sbjct: 565 RGEFLRGWDHGRGVDAGRGIGTNQGQSMVDHYHTVLTADAGGV------------LNPIA 612 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGA----AASETRPRNIAFNYIVRA 165 + TN A + P + G +ETRPRN+A + ++A Sbjct: 613 GNLVGSFTNLAPISKPAGAGVLGATLTSSIHGPAAEKGGTETRPRNLAVMWCIKA 667 >UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 RepID=A6E6G6_9SPHI Length = 731 Score = 124 bits (310), Expect = 1e-27, Method: Composition-based stats. Identities = 39/160 (24%), Positives = 57/160 (35%), Gaps = 7/160 (4%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP-TNKLPDLRGEFIRGWDDGRG 67 PVG V + P WL C+G YP+L + +LPDLRG F+ G Sbjct: 575 FPVGGIVAFYGEKVPDHWLLCDGKPVDHSLYPDLYRLLGGEKRLPDLRGRFLVGAGSKYS 634 Query: 68 ID--TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 + G LS+ H H + + + + + + D Sbjct: 635 LGDMGGVDELSLNVDQMPQHDHQIKAVKSYESPFKEVNMGWAREESLRGGVY----GTDR 690 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 A Y ++ G A E RP +A NYI+RA Sbjct: 691 DNGADKYFVTRSNSPVKSEGGGKAHENRPPYLAVNYIIRA 730 >UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia solanacearum RepID=B5S308_RALSO Length = 225 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 53/163 (32%), Positives = 69/163 (42%), Gaps = 14/163 (8%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIRG 61 G + TPP GWLKCNGAA S Y L K T LP+LR EF RG Sbjct: 66 GSVAMFACKTPPAGWLKCNGAAVSRTTYERLFKLIGTTFGAGDGAATFNLPELRAEFPRG 125 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 WDDGRG+D+GR+ S Q A H H + + + D ++ G Sbjct: 126 WDDGRGVDSGRAFGSSQAQALSSHQHKTAVGFD---GSNLFGWGDGSATPIFGSEVQAGV 182 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 G G + V +G + ETRPRN+A ++ Sbjct: 183 LRVVGAVTQSGGAARIGYTDVTPMGVSG-ETRPRNVALLACIK 224 >UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3G6_OXAFO Length = 237 Score = 123 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 50/169 (29%), Positives = 69/169 (40%), Gaps = 36/169 (21%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-----------NKLPDLR 55 + +P G + S TPP GWL +G+ YP+L A T +LPDLR Sbjct: 93 NGVPPGSVLYLCSETPPDGWLVADGSMLLVAAYPDLFAAIGTAFGSGDNGMTTFRLPDLR 152 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 GEFIR D GRG+D GR + S+QG +H HG + + + + Sbjct: 153 GEFIRCLDKGRGLDDGRPLGSVQGDEIRNHNHGFLDIPKVQFGSGVYSW----------- 201 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + AP T+ SETRPRNIA ++ Sbjct: 202 ---TPQVMEVAEHAPIATTW-----------TGGSETRPRNIALTACIK 236 >UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VKW8_PHOAA Length = 316 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 65/163 (39%), Positives = 88/163 (53%), Gaps = 21/163 (12%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW 62 + + S +PVG P+PWP PP G++ CNG+AF+ +YP+LA+AYP +LPDLRGEFIRGW Sbjct: 174 IKKTSEIPVGSPIPWPLPHPPFGYVTCNGSAFNRSQYPKLAEAYPNGRLPDLRGEFIRGW 233 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT 122 DDGRG D GR +LS Q + ++ Y +I +R Sbjct: 234 DDGRGADNGRKLLSWQE------------------GSALSEYLGSFTTGVAQNIHQR--- 272 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + D+ +K + G G RPRNIAFNYIV+A Sbjct: 273 DGVTYHDKDHKRYKIPSLEIIGTGVDYFRFRPRNIAFNYIVKA 315 >UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotrophomonas maltophilia K279a RepID=B2FIY3_STRMK Length = 410 Score = 121 bits (304), Expect = 8e-27, Method: Composition-based stats. Identities = 47/165 (28%), Positives = 71/165 (43%), Gaps = 18/165 (10%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFI 59 LP G+ +P+ PP GWL+CNGA S Y +L T +LPDLRGEF+ Sbjct: 254 LPAGMVAHFPTGGPPPGWLRCNGADVSRTTYADLFAVIGTLFGSANDMTFRLPDLRGEFV 313 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 RGWDDGRG+D GR++ S+Q + + + + + Sbjct: 314 RGWDDGRGVDGGRALGSLQA---------ATEVLSSWGASAGGLVSGQYQYSLADFGVHT 364 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 N + + + S++G G RPRN+A ++ Sbjct: 365 TNADSSRQVNNVGSGRLSRMDSINGGGLTLIGVRPRNVALLACIK 409 >UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NQ95_9RHOB Length = 329 Score = 121 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 53/188 (28%), Positives = 72/188 (38%), Gaps = 29/188 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLR 55 + + G + +T P GWLK NGA S Y +L A T LPDLR Sbjct: 141 TNGVAPGCVAYYAMSTAPDGWLKANGAEISRTAYADLFAAIGTIFGVGDGNSTFNLPDLR 200 Query: 56 GEFIRGWDDGRGIDTGRSILSIQGYATEDHAHG--------LPSRSTIVTDATINFYFDE 107 GEF+RGWDD RG+D R + S Q H H + +T T + E Sbjct: 201 GEFLRGWDDARGVDGARVLGSSQSDQNASHTHTGSTSSDSHSHTGTTNTTGNHTHNMAYE 260 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQS-----------VDGLGAAASETRPRN 156 N+GT + + P P + + V + SE RPRN Sbjct: 261 GGTNAGTGLAAPATSRSNTSPGPTVNYSGNHSHTFSTSSDSHSHSVTTDASGGSEARPRN 320 Query: 157 IAFNYIVR 164 IA ++ Sbjct: 321 IALLACIK 328 >UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VSH6_POLNA Length = 483 Score = 121 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 52/181 (28%), Positives = 66/181 (36%), Gaps = 27/181 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIR 60 G +T P GWLK NGA S Y L A T LPDLRGEFIR Sbjct: 302 PGHINYTARSTAPPGWLKANGAGISRTAYAALFAAIGTTFGVGDGFNTFNLPDLRGEFIR 361 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWDDGRG+D RS+ S Q T H H + + + +N V Sbjct: 362 GWDDGRGVDGSRSLGSSQAGETASHGHTGSTSAAGIHAHGVNDPGHSHQVTQEGGRNTSL 421 Query: 121 NTNDAGLPA-----------------PDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + A + +V SETRPRN+A ++ Sbjct: 422 AYQNGPNSAFRGEVSTLLETTRNATGIGISENGNHSHTVTISATGGSETRPRNLALLAVI 481 Query: 164 R 164 + Sbjct: 482 K 482 >UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-like viruses RepID=Q7Y2B3_9CAUD Length = 466 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 43/165 (26%), Positives = 62/165 (37%), Gaps = 20/165 (12%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP--------TNKLPDLRGEFI 59 A+P+G + +L CNG + + +YP+L A LPD+RG Sbjct: 312 AMPIGGIILSGFNADRGDFLICNGRSLNKNQYPQLFSAIGYTFGGSGDNFNLPDMRGLVA 371 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 RG D GR +D GR S Q A + P F + Sbjct: 372 RGCDHGRNLDPGRRFGSYQEDAMQRITGKFPVADRWRGWYGGAFTAQ----------RGQ 421 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 +TN D+GT A+ETR +++A NYI+R Sbjct: 422 WSTNYKNGGGDDWGTTVN--FDSGRSVRTANETRVKSLALNYIIR 464 >UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL254 RepID=B4T041_SALNS Length = 580 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 54/190 (28%), Positives = 77/190 (40%), Gaps = 36/190 (18%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 + P GVP+PWPS T P G+ G AF YP LA AYP+ +PD+RG I+G Sbjct: 396 SCPPGVPLPWPSDTIPAGYALMQGQAFDKNVYPLLAIAYPSGTIPDMRGWTIKGKPV--- 452 Query: 68 IDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 +GR++LS + + H+HG + T + T T +F + N+ G Sbjct: 453 --SGRAVLSQELDGNKSHSHGARALDTDLGTKGTSSFDYGTKSSNTTGGHNHSAGGTYGG 510 Query: 127 --------------------LPAPDYGTF----------KTYKQSVDGLGAAASETRPRN 156 + T+ + V +ET +N Sbjct: 511 DSIGGKARVQRDGNDQLTSWNGDHAHTTWIGPHDHTVYIGPHGHVVIVDADGNAETTVKN 570 Query: 157 IAFNYIVRAA 166 IAFNYIVR A Sbjct: 571 IAFNYIVRLA 580 >UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacterium RepID=D0KGE5_PECWW Length = 157 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 17/165 (10%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG 61 S++ G W + PP GWL+ NG F+ P LA YP++++PD RG F RG Sbjct: 8 AWKSSSSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRG 67 Query: 62 WDDGRGIDT-GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 WD+G GID R+ILS+QG A + + + + + ++ Sbjct: 68 WDNGAGIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSSYGWPYPSNSGSANDAS 127 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + A+E RP NIA +I++A Sbjct: 128 IIT----------------FDASRVVPTAAENRPTNIAVMFIIKA 156 >UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=Photorhabdus RepID=Q7N047_PHOLL Length = 602 Score = 117 bits (294), Expect = 9e-26, Method: Composition-based stats. Identities = 53/158 (33%), Positives = 68/158 (43%), Gaps = 15/158 (9%) Query: 8 ALPVGVPVPWPSATP-PTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 +P+G + W S P P G+ G AF A +YPELAK +P KLPD RG F RG D GR Sbjct: 457 GVPIGATIEWHSTAPIPAGYEPNEGRAFRAADYPELAKIFPDLKLPDDRGLFKRGLDRGR 516 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 G+D+GRS+ S+QG A + L I S G Sbjct: 517 GLDSGRSLGSVQGDAIRNITGSL--------------GKPTIESGSNASGAFSYQYKAGG 562 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 A G + + A+E RP N + YI R Sbjct: 563 RAAGAGGGVIAWTFDASRVVPTANENRPVNKSVIYITR 600 >UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A9IXL3_BART1 Length = 334 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 43/167 (25%), Positives = 64/167 (38%), Gaps = 12/167 (7%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 + G + S P+GWL C+G +S + Y L T +PDLRG Sbjct: 169 SFSPGFIGTFASEKIPSGWLLCDGKEYSRKNYANLFAVLGETWGKGDGKTTFNVPDLRGM 228 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 F+RG D G+ ID GR + S Q + + H H + ST + ++I D Sbjct: 229 FLRGLDSGKEIDKGRLLGSRQEESFKSHTHEGKTDSTGKHQHSYPTIKNDILRYKREDYK 288 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + ++ V ETRP N+A Y V+ Sbjct: 289 GYVAVVYKTDTLTEP--AGEHEHKVLLQKTGGDETRPVNMAVVYAVK 333 >UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BSQ6_PHOAA Length = 318 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 49/91 (53%), Positives = 64/91 (70%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW 62 + + +PVGVP+PWP+A PPTGWL+CNGAAF ++P+L AY + LPDLRGEFIRGW Sbjct: 213 INTVNNIPVGVPIPWPTAIPPTGWLQCNGAAFDKSKFPQLVAAYSSGVLPDLRGEFIRGW 272 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRS 93 D RG+DT RSILS Q ++ + S + Sbjct: 273 DSSRGVDTNRSILSTQIDTMQNITGKVDSHN 303 >UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W7_TRIEI Length = 671 Score = 115 bits (288), Expect = 5e-25, Method: Composition-based stats. Identities = 43/166 (25%), Positives = 62/166 (37%), Gaps = 25/166 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT-NKLPDLRGEFIRGWDD 64 G +PVG VP+ T P GWL CNG ++ E+Y EL K LPDL+G FI G D Sbjct: 523 GWVVPVGTIVPYAGLTAPEGWLLCNGQSYDWEQYSELYKVLDEIKVLPDLKGRFIIGVGD 582 Query: 65 GRGID------TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK 118 G G ++ H H + ++ + Sbjct: 583 KDGYSYSLNAKGGEEKHTLTKDEMPSHDHS---------------KGEYKFILKKDGKVT 627 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 N + L P+ G+ + + E RP A NYI++ Sbjct: 628 TSNNVNNSLREPNLGSCEALQV---IGNNKPFENRPPYYALNYIIK 670 >UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE3_PECWW Length = 144 Score = 114 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 47/152 (30%), Positives = 67/152 (44%), Gaps = 11/152 (7%) Query: 16 PWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDT--GRS 73 W + PP GWL+ NG F+ P LA YP++++PD RG F RGWD+G GID RS Sbjct: 1 MWGTPVPPEGWLELNGQLFNPSGNPVLADLYPSSRVPDFRGYFPRGWDNGAGIDPDSSRS 60 Query: 74 ILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYG 133 +LS Q H H + T+ + + + + + P+ Sbjct: 61 VLSYQDDEIISHKHAI----TMSHEHHGAADGAGFPQTDASGPMIKHAETEPDGSFPERS 116 Query: 134 TFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 S SETRP NIA +I++A Sbjct: 117 GAGNPMFSF-----GGSETRPHNIAVMFIIKA 143 >UniRef50_P76072 Side tail fiber protein homolog from lambdoid prophage Rac n=23 Tax=root RepID=STFR_ECOLI Length = 1120 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 48/221 (21%), Positives = 70/221 (31%), Gaps = 59/221 (26%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW-- 62 + PVG P+PWPS T P+G+ G AF YP+LA AYP+ +PD+RG I+G Sbjct: 900 PPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPA 959 Query: 63 -------------------------DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVT 97 D G + + T H H + + Sbjct: 960 SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAG 1019 Query: 98 DATINFYFDEIWVNSGTDIIKRG----------NTNDAGLPAPDYGTF------------ 135 T + + T+ AG Sbjct: 1020 AHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVG 1079 Query: 136 ----------KTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 ++ ++ A +E +NIAFNYIVR A Sbjct: 1080 IGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 >UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P172_CHRVO Length = 591 Score = 112 bits (281), Expect = 4e-24, Method: Composition-based stats. Identities = 52/163 (31%), Positives = 63/163 (38%), Gaps = 12/163 (7%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIRG 61 G + + PP GWLK NGAA S ++YP L A T LPDLRGEF+RG Sbjct: 430 GQVAFFAMSAPPLGWLKANGAAVSRKDYPSLFAALGTYYGAGDGSTTFNLPDLRGEFVRG 489 Query: 62 WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGN 121 WDDGRG+D GR + Q L S + + K N Sbjct: 490 WDDGRGVDNGRGFGTWQKGTLTFSDPSLTSPCVASLVHRNDNTVIGYLDLGADPVDK--N 547 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 D GL G TRPRNIA ++ Sbjct: 548 KYDLGLSVSTANGVYLPDLDSGGWANGYGSTRPRNIALLACIK 590 >UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_LAMBD Length = 774 Score = 112 bits (280), Expect = 4e-24, Method: Composition-based stats. Identities = 46/146 (31%), Positives = 66/146 (45%), Gaps = 6/146 (4%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G GE SA P G P+PWPS P+G++ G AF YP+LA AYP+ LPD+RG I+ Sbjct: 522 LGAGENSAFPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIK 581 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKR 119 G +GR++LS + + H H + T + T T +F + S K Sbjct: 582 GKPA-----SGRAVLSQEQDGIKSHTHSASASGTDLGTKTTSSFDYGTKTTGSFDYGTKS 636 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGL 145 N A + T + Sbjct: 637 TNNTGAHAHSLSGSTGAAGAHAHTSG 662 >UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A9ITX5_BART1 Length = 333 Score = 111 bits (278), Expect = 7e-24, Method: Composition-based stats. Identities = 41/179 (22%), Positives = 65/179 (36%), Gaps = 19/179 (10%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDL 54 E S P G + P WL C+G A+ +Y +L + T +PD Sbjct: 154 ESSLYPTGFIGTFGMRDVPKDWLICDGKAYLRRDYRDLFETIGTVWGEGDSVTTFNVPDF 213 Query: 55 RGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 RG F+RG D G +D R S+Q + H H + S + NF+ G Sbjct: 214 RGMFLRGVDGGSNLDPNRRFASVQTDLIQSHQHEGQTLSMPHFTSNENFWDGNTTEVLGY 273 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQS---------VDGLGAAASETRPRNIAFNYIVR 164 + G A + K++ + V ETRP N++ + ++ Sbjct: 274 RLGLFGGGALANFMGIESENLKSHVATPYSFDENQEVILESTGEGETRPVNVSVLFAIK 332 >UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FQX9_SALDC Length = 569 Score = 111 bits (278), Expect = 8e-24, Method: Composition-based stats. Identities = 47/193 (24%), Positives = 71/193 (36%), Gaps = 36/193 (18%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 ++ PVG + WPS P G+ G +F YP LA AYP+ +PD+RG I+G Sbjct: 382 PPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGIIPDMRGWTIKGKPI 441 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVT---------DATINFYFDEIWVNSGTD 115 +GR++LS + + H+H ++ T + G Sbjct: 442 -----SGRAVLSQEMDGNKSHSHSARAQDTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGY 496 Query: 116 IIKRGNTNDAGLPAPDYGTF----------------------KTYKQSVDGLGAAASETR 153 I ++ P G + + V +ET Sbjct: 497 INSYWGDSNHTSFQPGGGAWTQAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAETT 556 Query: 154 PRNIAFNYIVRAA 166 +NIAFNYIVR A Sbjct: 557 VKNIAFNYIVRLA 569 >UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JYG6_9GAMM Length = 400 Score = 108 bits (270), Expect = 5e-23, Method: Composition-based stats. Identities = 44/194 (22%), Positives = 61/194 (31%), Gaps = 44/194 (22%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGEFI 59 P G + TPP GWL C+G+ S +YP L A T LPDLR +F Sbjct: 212 PAGRTEDFAGTTPPGGWLFCDGSEVSRTQYPALFTAIGTLWGDGDGSTTFNLPDLRNDFR 271 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDAT------INFYFDEIWVNSG 113 RG D RS+ + + H+H S + W S Sbjct: 272 RGCSDT------RSVGDSESDQIKSHSHSASSEDSGAHTHGGRSSDSGAHKHRSGWGESN 325 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKT----------------------YKQSVDGLGAAASE 151 G T+ +G + + ++ E Sbjct: 326 RSDAPFGATSGSGHRGSGDSDWDNYLYYTDTAQPHFHWLIINQAGSHSHPINIEPTGGDE 385 Query: 152 TRPRNIAFNYIVRA 165 TRPRN I+RA Sbjct: 386 TRPRNKVLMPIIRA 399 >UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 Tax=Erwinia phage phiAT1 RepID=C5J9F2_9VIRU Length = 240 Score = 108 bits (269), Expect = 7e-23, Method: Composition-based stats. Identities = 35/91 (38%), Positives = 50/91 (54%), Gaps = 3/91 (3%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 E +P+G +PWP AT P GWL+C+G F+ + P+L N +PD RG F+RGW Sbjct: 149 EPRLVPIGAVIPWPGATVPDGWLECSGQVFNTGQNPKLYSVLGRNVVPDYRGLFLRGWAH 208 Query: 65 G---RGIDTGRSILSIQGYATEDHAHGLPSR 92 G D GR++ S+QG A + P+ Sbjct: 209 GSDANDPDAGRALGSVQGDAIRNITGYFPAD 239 >UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45_9CAUD Length = 554 Score = 107 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 49/167 (29%), Positives = 69/167 (41%), Gaps = 26/167 (15%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIR 60 G+ + +T P+GWLK NGAA S Y L T LPDLRGEF+R Sbjct: 400 AGLIGYFARSTAPSGWLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLPDLRGEFLR 459 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHA---HGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 GWDDGRG+D+GR I + Q + H T + D T + + G + Sbjct: 460 GWDDGRGVDSGRGIGTWQSGSPVVHDDVGGIASFNITALGDGTNVAWSNIADPWVGAFPL 519 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 +++ A + F RPRN+AF ++ Sbjct: 520 TMYDSSAATFVDANNKGFINMA-------------RPRNVAFLPCIK 553 >UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escherichia RepID=B7LKX7_ESCF3 Length = 567 Score = 107 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 55/214 (25%), Positives = 80/214 (37%), Gaps = 58/214 (27%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDG 65 + PVG+P+PWPS + P+G+ G F+ YP+LA AYP+ +PD+RG I+G Sbjct: 359 AESCPVGMPIPWPSDSVPSGYALMTGQTFNKTSYPKLAIAYPSGVIPDMRGWIIKGKP-- 416 Query: 66 RGIDTGRSILSIQGYATEDHAHGLPSRSTI-----------VTDATINFYFDEIWVNSGT 114 +GR+ILS + + H H ST T T +F ++ Sbjct: 417 ---SSGRAILSTELDGVKSHNHTGSISSTNLGTITSTSTDLGTKTTASFNHGSRNTSTSG 473 Query: 115 DIIKRGNTNDA----------------------------------GLPAPD--------Y 132 + R T+ A G A Sbjct: 474 EHTHRIPTDGAEGKDGPSLWNSPNSDENYREPTESAGSHYHSITIGAHAHTIALGSHTHN 533 Query: 133 GTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 T+ S+ +E +NIAFNYIVR A Sbjct: 534 IVLGTHNHSIIINNTGNTENTVKNIAFNYIVRLA 567 >UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp. RC586 RepID=D0IJ09_9VIBR Length = 368 Score = 106 bits (265), Expect = 2e-22, Method: Composition-based stats. Identities = 50/170 (29%), Positives = 63/170 (37%), Gaps = 20/170 (11%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 PVGVP+PWPS P G+ G AF PELAK YP L DLRG + G +G Sbjct: 203 CPVGVPLPWPSDIAPEGFAIHKGQAFDKVANPELAKLYPDGILKDLRGMAVVGKKEGE-- 260 Query: 69 DTGRSILSIQGYATEDH-----AHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTN 123 ILS + + H + T+ T N S +T Sbjct: 261 ----IILSYEADQVKQHGYPNSTVSSTDLGSRNTNTTGNHAHGYPAGTSNGPNGPYLDTA 316 Query: 124 DAGLPAPDYGTFKTYKQSVDGLGA---------AASETRPRNIAFNYIVR 164 A T + SV A+E +NI FN+IVR Sbjct: 317 HASYGYRYTTTEGNHYHSVAIGSHAHSIAIALFGATENTIKNIKFNWIVR 366 >UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae RepID=A4TT73_YERPP Length = 962 Score = 105 bits (262), Expect = 6e-22, Method: Composition-based stats. Identities = 42/147 (28%), Positives = 64/147 (43%), Gaps = 7/147 (4%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW 62 L + PVG P+PWP+ P+G+ G F YP+LA AYP+ LPD+RG I+G Sbjct: 715 LPPPESYPVGAPIPWPNDVAPSGFAIMQGQTFDKSVYPKLAAAYPSGVLPDMRGWMIKGK 774 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKRGN 121 T R++LS++ + HAH + ST + T T F + + K N Sbjct: 775 P------TSRAVLSLEQDGIKSHAHNAAASSTDLGTKPTTTFDYGTKTSSGFDYGTKSSN 828 Query: 122 TNDAGLPAPDYGTFKTYKQSVDGLGAA 148 + A + T + + Sbjct: 829 STGAHAHSLSGSTSSSGAHAHTVTAHT 855 >UniRef50_C4MYW8 Gp12 Short tail fibers n=1 Tax=Enterobacteria phage JSE RepID=C4MYW8_9CAUD Length = 467 Score = 105 bits (261), Expect = 7e-22, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 57/166 (34%), Gaps = 22/166 (13%) Query: 9 LPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEF 58 +P+G + + CNG + +YP L LPD+RG Sbjct: 312 MPIGGIILTAFNSFDHAQFKICNGQWLNKHQYPVLFSRIGFTYGGDGGDNFALPDMRGLV 371 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK 118 RG D GRG+D GR + Q + P + F Sbjct: 372 ARGCDHGRGLDPGRGFGTYQDDTMQHMTGNFPVANRWRGWTGGVFAITGG---------- 421 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + +TN D+G+ + ETR +++A NY++R Sbjct: 422 QWSTNYKNGGGDDWGSIVN--FDSARQVRTSGETRVKSLALNYMIR 465 >UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI4_EDWTE Length = 718 Score = 104 bits (258), Expect = 2e-21, Method: Composition-based stats. Identities = 47/168 (27%), Positives = 70/168 (41%), Gaps = 22/168 (13%) Query: 8 ALP-VGVPVPWPSATPP-TGWLKC-------NGAAFSAEEYPELAKAYPTNKLP-DLRGE 57 P +G +PW P W C G +F E +P+L YP N+LP D+RG Sbjct: 562 GCPLIGSLIPWALERMPQEIWPNCGMHFIPYMGQSFDPELFPKLHDVYPDNRLPTDMRGY 621 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 RGWD+GRGID GR++LS Q A ++ + + V+ + Sbjct: 622 TARGWDNGRGIDIGRALLSYQDDAIQNITGQFGWMP---FNGSSPVASGAFSVDKIGANV 678 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G T + + A +TR +++A+NYI RA Sbjct: 679 WGGGTERRDC---------AIGFNASNVVRTAEQTRVKSVAWNYITRA 717 >UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX Length = 710 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 49/224 (21%), Positives = 68/224 (30%), Gaps = 59/224 (26%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG 61 + PVG +PWPS + PTG+ G F YP LA AYP+ LPD+RG I+G Sbjct: 487 AWTPQDSFPVGAAIPWPSDSVPTGYAVMQGQTFDKTTYPLLAAAYPSGVLPDMRGWTIKG 546 Query: 62 W---------------------------DDGRGIDTGRSILSIQGYATEDHAHGLPSRST 94 D G + + T H H + + Sbjct: 547 KPASGRDVLSLEQDGIKSHTHSASASNTDLGTKTTSSFDYGTKSTNNTGAHTHNVSGTAN 606 Query: 95 IVTDATINF------------------------YFDEIWVNSGTDIIKRGNTNDAGLPAP 130 T + + G AG A Sbjct: 607 SAGAHTHTVPLRRPNSGGMNFDWLDGASSGTVVGNGTVPSSGAHTHSVSGTATSAGAHAH 666 Query: 131 DYG--------TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 G ++ ++ A +E +NIAFNYIVR A Sbjct: 667 TVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 710 >UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escherichia coli E22 RepID=B3I2W7_ECOLX Length = 654 Score = 103 bits (256), Expect = 3e-21, Method: Composition-based stats. Identities = 51/219 (23%), Positives = 70/219 (31%), Gaps = 63/219 (28%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 + PVG P+PWPS PTG+ G F YP LA AYP +PD+RG+ I+G Sbjct: 442 PEDSYPVGAPIPWPSDVTPTGYALMQGQPFDKAVYPLLAIAYPAGIIPDMRGQTIKGKP- 500 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRST-----------IVTDATINFYFDEIWVNSG 113 GR++LS + H HG T + T +F + G Sbjct: 501 -----NGRAVLSYEQDGVISHTHGASISDTDLGTKYTSSFDYGSKPTTSFDYGNKSSTEG 555 Query: 114 T-----------------------------DIIKRGNTNDAGLPAPD--------YGTFK 136 +G A + Sbjct: 556 GWHAHNFRYCATSAYRDTPGQGLGMHSSNVSWAAGDRIEGSGNHAHVTWIGPHDHWVGIG 615 Query: 137 TYKQ---------SVDGLGAAASETRPRNIAFNYIVRAA 166 + + A +E +NIAFNYIVR A Sbjct: 616 AHNHYVVMGYHGHTATVHAAGNAENTVKNIAFNYIVRLA 654 >UniRef50_D2V5I7 Microcystin-dependent protein n=1 Tax=Naegleria gruberi RepID=D2V5I7_NAEGR Length = 191 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 56/175 (32%), Gaps = 19/175 (10%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAE--EYPELAKAYP----------TNKLPDLRG 56 +PVG+ + T P GWL C+GA + +Y L + + +PDLRG Sbjct: 16 IPVGIVNAFAGTTIPAGWLLCDGATYPNSHPDYIRLFQTIGNAYGSTGGPHSFNVPDLRG 75 Query: 57 EFIRGWDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW 109 + G G G+ G +Q H+H + I I Sbjct: 76 RAVVGIGHGAGLSNRTLAQKVGEESHQLQISELPSHSHSGTTGKANKQPYIIVHQSGPIS 135 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 T G + + G +A E ++ NYI++ Sbjct: 136 DVFHTPGWCGGPATHKDDDNFTGANHTHNFTTNEVGGNSAHENMQPSLVLNYIIK 190 >UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW5_CLOCE Length = 368 Score = 102 bits (255), Expect = 4e-21, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 65/180 (36%), Gaps = 24/180 (13%) Query: 9 LPVGVPVPWPSATPP-----TGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDL 54 PVG+ +P+ +GW+ C+G +Y EL T +PDL Sbjct: 5 FPVGMVIPFAGPLKEDQLKSSGWVPCDGRVLDKTQYSELFDVIGTKYGGDGIPNFNIPDL 64 Query: 55 RGEFIRGWDDGRGIDTG--RSILSIQGYATEDHAHGLPSRST--------IVTDATINFY 104 RG F+R D GRG D R S G A D+ + +T N Sbjct: 65 RGRFVRATDHGRGYDPDAQRRKASKSGGAAGDNTGSVQEYATAKPKNNFITNDKGNHNHL 124 Query: 105 FDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 D + + + A P + + + S + SE+RP N+ +I++ Sbjct: 125 VDHLPTDYWNAACAITSNEGANFPGRTATSGEAGQHSHTIVSGGDSESRPVNLYMYWIIK 184 Score = 80.5 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 43/181 (23%), Positives = 72/181 (39%), Gaps = 21/181 (11%) Query: 3 LGEGSALPVGVPVPWPSATPPT-------GWLKCNGAAFSAEEYPELAKAYPT------- 48 E LP G V + + GWL C G+++ A +YP+L + Sbjct: 190 YDESILLPAGSIVSFAGDSVKKSNELIANGWLPCIGSSYEANKYPDLYENISNIYGGDQN 249 Query: 49 -NKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLP--SRSTIVTDATINFYF 105 +PDLRG FIRG + G + + TED++ LP T+ TD Sbjct: 250 KFNVPDLRGLFIRGVNSNTSETPG-VHGATRVGQTEDYSTALPKTLNFTLSTDGAHTHSA 308 Query: 106 DEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ++ + + G+ G + ++ G +ETRP NI +YI+++ Sbjct: 309 PKLPQDKYIENYCAGHEVANFPSNQYTGNNGNHAHTIAG---GDAETRPVNIYLDYIIKS 365 Query: 166 A 166 + Sbjct: 366 S 366 >UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P176_CHRVO Length = 435 Score = 102 bits (254), Expect = 5e-21, Method: Composition-based stats. Identities = 46/170 (27%), Positives = 64/170 (37%), Gaps = 26/170 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLR 55 +A P G+ + P GWL +G + ++YP L A T LP+L Sbjct: 280 NAAAPAGMVAYFAMKDAPAGWLIADGRTVARKDYPALFAAIGGLYGNGDGSTTFGLPNLC 339 Query: 56 GEFIRGWDDGRGIDTGRSILSIQ-GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 GEFIRGWD+GRG+DTGR+I S Q GL + I + + Sbjct: 340 GEFIRGWDNGRGVDTGRAIGSSQISTQLLVDNDGLQTVGAIDWSSNNLSALGYEPAQANA 399 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + N+ PA RPRNIA ++ Sbjct: 400 ANLHFINSTTISNPADSSFIRSI---------------RPRNIALLACIK 434 >UniRef50_Q094A8 Phage Tail Collar Domain family n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q094A8_STIAU Length = 645 Score = 101 bits (252), Expect = 7e-21, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 50/177 (28%), Gaps = 26/177 (14%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT------------NKLPD 53 G +PVG + + ++ P GWL C+G+ S Y +L +LP Sbjct: 476 GWLVPVGTIIAYGGSSAPEGWLLCDGSTKSKTAYADLFAVIGDTYKGSSAPPSGQFRLPS 535 Query: 54 LRGEFIRGWDDGR------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDE 107 L G G G ++ H H + D + Sbjct: 536 LMARVPMGASVSSPHNYPLGTMGGEFTHTLTISEMPVHDHYVN-------DPGHSHSITT 588 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 D+ + + + P +G G A N+I++ Sbjct: 589 TNAEGSGDLRPNRDASKGHVDIPTNHVTTGVTLDTNGGG-QAHNNMQPYTTVNFIIK 644 >UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia coli RepID=B7UGJ3_ECO27 Length = 221 Score = 101 bits (251), Expect = 9e-21, Method: Composition-based stats. Identities = 67/174 (38%), Positives = 87/174 (50%), Gaps = 32/174 (18%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTG---------WLKCNGAAFSAEEYPELAKAYPTNKL 51 +GLGEG A +GVP WPSA P +LK NGA FSA +YP LAK +P+ L Sbjct: 70 LGLGEG-APAIGVPFFWPSAAMPDTVIESWSGMVFLKFNGAKFSATDYPVLAKVFPSLVL 128 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RG+FIR WDDGRG D+GR++LS Q +++ + Sbjct: 129 PEARGDFIRIWDDGRGADSGRALLSWQAA------------------TSLSQFGGNYPEG 170 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 SG I + P + F+ SV G G RPRNIAFN++VRA Sbjct: 171 SGHAIAD---YDGISAHEPGFSRFQYTSNSV-GDGVNFVAVRPRNIAFNFLVRA 220 >UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI00019136B5 Length = 137 Score = 100 bits (248), Expect = 2e-20, Method: Composition-based stats. Identities = 52/136 (38%), Positives = 67/136 (49%), Gaps = 13/136 (9%) Query: 39 YPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAH----------- 87 YP LAKAYPTNKLPDLRGEFIRGWDDGRG+D GR++L +Q + E H H Sbjct: 2 YPNLAKAYPTNKLPDLRGEFIRGWDDGRGVDAGRALLRLQDDSFEAHRHESFFYAGISRN 61 Query: 88 --GLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGL 145 L + + T++ + + + +D K + Sbjct: 62 EIPLKNLPSSDEMLTLSSTTNALSPDGIDATNSLIGNDDYNCLIEGNKNNKRTATGLSTS 121 Query: 146 GAAASETRPRNIAFNY 161 A+ETRPRNIAFNY Sbjct: 122 IVGATETRPRNIAFNY 137 >UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FJJ3_DESAA Length = 264 Score = 99.8 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 45/165 (27%), Positives = 65/165 (39%), Gaps = 27/165 (16%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEFI 59 P G V + A+ P+GWL+C+GAA S Y L T LPDLRG F+ Sbjct: 116 PTGSVVAFMGASAPSGWLECSGAAVSRTTYDNLFSVISTMYGVGDGSTTFNLPDLRGYFL 175 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 RGW G G D + +G T G + T D + + G + Sbjct: 176 RGWSHGSGKDPDAGSRTDRGDGTCGDYVGTRQEDEFAS-HTHYDDEDLLTFDGGGPV--- 231 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 G+ + +V +ETRP+N+A YI++ Sbjct: 232 -------------GSNSSGMSAVLPGSVGGAETRPKNVAVMYIIK 263 >UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria phage T4 RepID=Q99362_BPT4 Length = 382 Score = 98.6 bits (244), Expect = 7e-20, Method: Composition-based stats. Identities = 43/143 (30%), Positives = 65/143 (45%), Gaps = 7/143 (4%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 S+ P+G P+PWP+ TPP G+ G F YP+LA AYP+ +PD+RG+ I+G Sbjct: 130 SSYPIGAPIPWPTDTPPNGYALMEGQTFDTRAYPKLAAAYPSGTIPDMRGQTIKGKP--- 186 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 +GR++LS + + H HG + +T + T T +F + +S K NT Sbjct: 187 ---SGRAVLSTEADGVKSHTHGASASNTDLGTKTTSSFDYGTKTTSSFDYGTKTSNTTGN 243 Query: 126 GLPAPDYGTFKTYKQSVDGLGAA 148 T G Sbjct: 244 HNHTVSGTTSSAGAHQHARSGPQ 266 Score = 45.5 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 31/109 (28%), Gaps = 2/109 (1%) Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 F G+ D + ++ G + T I Sbjct: 276 FPDGYSDVGTNYNSKFSGTVIGSSVPCIIGKTS-NDGAHTHTWSGTTSTTGNHAHTVGIG 334 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 +T G ++ ++ +E +NIAFNYIVR A Sbjct: 335 AHTHTVGIGAHTHTV-AIGSHGHTITVNATGNTENTVKNIAFNYIVRLA 382 >UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=Pectobacterium carotovorum subsp. carotovorum WPP14 RepID=UPI0001A44C27 Length = 195 Score = 98.3 bits (243), Expect = 8e-20, Method: Composition-based stats. Identities = 51/121 (42%), Positives = 67/121 (55%), Gaps = 4/121 (3%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 +G +G+ L VG+P P P T P GWL C G +F YP LA YP +LPDLRGEFIR Sbjct: 79 IGAIQGNEL-VGIPQPCPLVTAPEGWLACAGQSFDTSRYPVLASRYPQGRLPDLRGEFIR 137 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWD+GRG+DTGR LS Q ++TE H H T+ + Y + + + + Sbjct: 138 GWDNGRGVDTGRGNLSSQSFSTEPHTH---DGGTLGLGSGAPIYTGKGLQDGAATLYSQT 194 Query: 121 N 121 Sbjct: 195 G 195 >UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkholderia ambifaria AMMD RepID=Q0BEK5_BURCM Length = 735 Score = 98.3 bits (243), Expect = 9e-20, Method: Composition-based stats. Identities = 52/218 (23%), Positives = 73/218 (33%), Gaps = 61/218 (27%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL-------------------------- 42 + +G V P G+LK NG +YP L Sbjct: 517 VQIGQIVWEARTAPRAGFLKLNGTELKRADYPLLWAYAQGSGALVADADWGKGRHGCFSS 576 Query: 43 AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQ------------GYATEDHAHGLP 90 T +LPDLRGEFIR WDD RG D R I S Q A DH+HG Sbjct: 577 GDGNTTFRLPDLRGEFIRCWDDARGTDAQRQIGSWQDSLNRLHAHGASAAAVGDHSHGAW 636 Query: 91 SRSTIVTDATINFYFDEIWV-----------------------NSGTDIIKRGNTNDAGL 127 + S +IN + + +G+ N + A Sbjct: 637 TDSQGWHGHSINDPGHDHGIPVASGGGYIGEINLNGGGRGDKRTTGSGTGISINGDGAHG 696 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G + ++ +E+RPRN+A ++RA Sbjct: 697 HNVGIGGAGAHSHTISIGADGGNESRPRNVALLVMIRA 734 >UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus influenzae PittAA RepID=A4NHY2_HAEIN Length = 556 Score = 97.9 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 58/177 (32%), Gaps = 22/177 (12%) Query: 3 LGEGSALP------VGVPVPWPSATPPTGWLKCN--GAAFSAEEYPELAKAY-----PTN 49 LG + LP VG+ + P GW+ + + + YPEL + N Sbjct: 216 LGNSNQLPDLTRSDVGMTAYFAVDNIPAGWIAFDEIATQVTEQRYPELYRHLIDKYGSIN 275 Query: 50 KLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW 109 +P + F+R G S+ IQ + H H +P D + + + Sbjct: 276 SVPKVADRFLR------NAGNGLSVGQIQEDDLKRHVHRVPIDYDSWFDDSSQGRNNSYF 329 Query: 110 -VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + +T D G + ETRP+++ ++A Sbjct: 330 DYTTFAQSSDLWSTLGYDNADGDNGFVSPK--DTSQMATGGDETRPKSLVLKLCIKA 384 >UniRef50_A1TNG3 Phage Tail Collar domain protein n=9 Tax=Bacteria RepID=A1TNG3_ACIAC Length = 176 Score = 97.5 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 49/168 (29%), Gaps = 25/168 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + PP GW C G + L T LPDLRG G Sbjct: 8 GEISMFAGNFPPKGWAFCQGQILPIAQNSALFALLGTTYGGNGQTTFALPDLRGRVPLGQ 67 Query: 63 DDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G+ G+ +++QG H H T +++ +G Sbjct: 68 GQGPGLQPYSQGQVGGQETVTLQGNQMPMHTHT--------TSVSVSSNAGNSAAPNGRY 119 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + ND G+ G + E + N+I+ Sbjct: 120 LAASDQRNDQYTDQSGNGSLAGVTTGFAG-NSLPHENMQPYLCINFII 166 >UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio cholerae TM 11079-80 RepID=C2I7P2_VIBCH Length = 406 Score = 97.5 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 41/154 (26%), Positives = 55/154 (35%), Gaps = 34/154 (22%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPD------LRGEFIRGWDD 64 VG+P W + P + Y LA+ YP D +RGEF+R D Sbjct: 274 VGMPFYWLDTSAPEWAVMEINVNLPIAVYWRLARRYPQLVRDDYINTGEIRGEFLRVLDQ 333 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 GRG+D GRSI S Q E H H + +I + Sbjct: 334 GRGVDAGRSIQSYQDDELERHTHTFSAPFSITANTGSTGII------------------I 375 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 + P++ T +ETRPRNIA Sbjct: 376 SASHVPNWNTTY----------TGGNETRPRNIA 399 >UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria phage RB43 RepID=Q56BI6_9CAUD Length = 463 Score = 97.1 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 42/166 (25%), Positives = 62/166 (37%), Gaps = 24/166 (14%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT--------NKLPDLRGEF 58 ++LP+G + + NG EYPEL LPD+RG Sbjct: 312 ASLPIGCMMMAAFNSDYGNLCIANGRGMYTYEYPELFALIGYTYGGSGNIFNLPDMRGVV 371 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK 118 RG+D GRG+D GR + Q + + H H L + ++ Sbjct: 372 ARGFDAGRGLDPGRGFGTYQHHEVQSHEHPLQMIYQSG---------GNLPSWQCVYELR 422 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 ND L PD K + +ETR +N+A NY++R Sbjct: 423 TAEKNDQQLYWPDPSLSKA-------MAVGGNETRMKNLAINYVIR 461 >UniRef50_B3QRT1 Tail Collar domain protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRT1_CHLT3 Length = 176 Score = 96.7 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 48/170 (28%), Gaps = 28/170 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P GW +CNG E L T +PDLRG + G Sbjct: 7 IGEIRLFGFGWAPDGWAQCNGQLLLINENQALYSLLGTMYGGDARSTFGVPDLRGRAVIG 66 Query: 62 WDDGRGID--------TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 + + G +++ H H L + T + Sbjct: 67 YGQSPKLSYSYQMSQWGGEETVTLGVAQIPAHNHTLIADGATGTLLNPQNNY-------- 118 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + G A + D + G+ E R +A NY + Sbjct: 119 ---LAEGAFPGAAFYSADKSVAMNQGTIGNTGGSQPHENRSPYLALNYCI 165 >UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriaceae RepID=Q3YZL1_SHISS Length = 1029 Score = 96.7 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 50/237 (21%), Positives = 69/237 (29%), Gaps = 75/237 (31%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW-- 62 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 793 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPA 852 Query: 63 -------------------------DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVT 97 D G + + T H H L ++ Sbjct: 853 SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSLSGSTSSAG 912 Query: 98 DATINFYFDEIWVNS----------------------------------------GTDII 117 + S Sbjct: 913 AHQHSQTGPRTNSGSQPTGMFPAGSTQVSGTNQVGISGSLTSGTSQWVGKSSSEGNHTHS 972 Query: 118 KRGNTNDAGLPAPDYG--------TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 G AG A G ++ ++ A +E +NIAFNYIVR A Sbjct: 973 LSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1029 >UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Enterobacteriaceae RepID=B3HKW0_ECOLX Length = 164 Score = 95.9 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 63/174 (36%), Positives = 76/174 (43%), Gaps = 32/174 (18%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTG---------WLKCNGAAFSAEEYPELAKAYPTNKL 51 VGLGEG A +GVP WPSA P +LK NGA FSA +YP LAK +P+ L Sbjct: 13 VGLGEG-APAIGVPFFWPSAAMPNTVIDSWSGMVFLKFNGAKFSATDYPVLAKVFPSLVL 71 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 P+ RG+FIR WDDGRG D GR +LS Q INF+ Sbjct: 72 PEARGDFIRIWDDGRGADGGRELLSWQEATNFS---QFAGNIGGGAGHAINFHDGIAGNQ 128 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G + + RPRNIAFN++VRA Sbjct: 129 PGFSRFNFTSNSVGDGVNFVA-------------------VRPRNIAFNFLVRA 163 >UniRef50_Q8Y365 Putative uncharacterized protein n=4 Tax=Ralstonia solanacearum RepID=Q8Y365_RALSO Length = 182 Score = 95.6 bits (236), Expect = 5e-19, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 51/167 (30%), Gaps = 15/167 (8%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + PP GW CNG + L T LPDLRG Sbjct: 6 GEIRLCAFSYPPKGWAACNGTLLPIAQNTALFSLLGTQYGGDGVRTFALPDLRGRTPLHR 65 Query: 63 DDGR---GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 D G G +++ H+H + S+ T + + + S Sbjct: 66 DYVNSVVGSVGGAETVTLVSSQLPVHSHLFNASSSPATSTNVGATQNHVLAASNLYSSTD 125 Query: 120 GNTNDAG---LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + +G AP + + GA E ++ NYI+ Sbjct: 126 PTISGSGTALYAAPGPLAALSGEACGSTGGAQPHENMQPSLVLNYII 172 >UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5PP06_SALHA Length = 534 Score = 95.6 bits (236), Expect = 6e-19, Method: Composition-based stats. Identities = 38/147 (25%), Positives = 59/147 (40%), Gaps = 6/147 (4%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDD 64 + PVG P+ WPS P G+ G +F YP LA AYP+ +PD+RG I+G Sbjct: 238 PPDSHPVGAPIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDMRGWTIKGKPA 297 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKRGNTN 123 +GR+ILS + + H+H ++ T + T T +F + N+ + + Sbjct: 298 -----SGRAILSQEMDGNKSHSHSARAQDTDLGTKTTSSFDYGTKSTNTTGNHTNQFGGY 352 Query: 124 DAGLPAPDYGTFKTYKQSVDGLGAAAS 150 T A Sbjct: 353 INSYWGDSNHTSFQPGGGAWTQAAGDH 379 >UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia coli RepID=B7L485_ECO55 Length = 1056 Score = 95.2 bits (235), Expect = 7e-19, Method: Composition-based stats. Identities = 50/238 (21%), Positives = 72/238 (30%), Gaps = 76/238 (31%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW-- 62 + PVG P+PWPS T P+G+ G F+ YP+LA AYP+ +PD+RG I+G Sbjct: 819 PPESYPVGAPIPWPSDTVPSGYALMQGQTFNKSAYPKLAAAYPSGVIPDMRGWTIKGKPA 878 Query: 63 -------------------------DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVT 97 D G + + T H H L + Sbjct: 879 SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSLSGSTGSAG 938 Query: 98 DATINF---------------------------------------YFDEIWVNSGTDIIK 118 T Y+ I S Sbjct: 939 VHTHGNGIRWPGGGGSALAFYDGGGFTYVQNSQYQVSPGTSSYRSYYQRIQTQSAGAHTH 998 Query: 119 RGNTNDAGLPAPDYG----------TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + A A + ++ ++ A +E +NIAFNYIVR A Sbjct: 999 SLSGTAASSGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1056 >UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8I7_OXAFO Length = 369 Score = 94.0 bits (232), Expect = 2e-18, Method: Composition-based stats. Identities = 40/171 (23%), Positives = 55/171 (32%), Gaps = 36/171 (21%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 +PVG + ++TPP G+LK +G+ E YPEL A T LPDL G Sbjct: 105 GIPVGSIDYFATSTPPAGYLKADGSEVGRETYPELFTAIGTVFGEGNGDSTFNLPDLMGR 164 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHG---LPSRSTIVTDATINFYFDEIWVNSGT 114 F +G + DH H I SGT Sbjct: 165 FAQG---------STIVGQRIKAGLPDHKHIEGFAGVNPNSSYGVATTAPQGNINTQSGT 215 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + T+ A L P YG T +P + ++A Sbjct: 216 SVSNHPYTSPASLSNPIYGASDT--------------VQPPALTLLPCIKA 252 >UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31Q92_SYNE7 Length = 387 Score = 94.0 bits (232), Expect = 2e-18, Method: Composition-based stats. Identities = 52/183 (28%), Positives = 71/183 (38%), Gaps = 57/183 (31%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL------------------------- 42 A+P GV + TPPTG++K NGA S Y L Sbjct: 235 AVPAGVAIWVTGNTPPTGYIKANGALLSRTTYARLWAYAQASGNIVSDAAWTGGATGSYS 294 Query: 43 -AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATI 101 T ++PDLRGEFIRGW DGR +DTGR+I S Q + HAH L +R+ T Sbjct: 295 TGDGSTTFRVPDLRGEFIRGWADGRSVDTGRAIGSTQADELKAHAHYLDTRTAPTGGGTA 354 Query: 102 NFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNY 161 + + + + +ETRPRNIA+ Sbjct: 355 ATTYTTGTAVTTSSV-------------------------------GGTETRPRNIAYLA 383 Query: 162 IVR 164 ++ Sbjct: 384 CIK 386 >UniRef50_B9M3Z7 Tail Collar domain protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M3Z7_GEOSF Length = 173 Score = 93.6 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 48/165 (29%), Gaps = 19/165 (11%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRG 61 +G + P W C+G+ +Y L T KLPD RG Sbjct: 6 IGEIRMFGGNFAPVDWALCDGSTLQISQYDVLYAVIGTYFGGDGITNFKLPDFRGRIPVH 65 Query: 62 WDDGRGIDT---GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK 118 G+G+ G + + Q H ++ +AT NS + Sbjct: 66 MGTGQGLTPRGIGNAFGTEQETLQVAHIPAHNHVVSVGANATTAAPAGNYLGNSSNFSLY 125 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 D+ L G F A ++ N+I+ Sbjct: 126 STAAADSLLNQDTVGFFPA-------APAQPHSNMMPSLCVNFII 163 >UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibrio vulgaris RepID=Q727X4_DESVH Length = 296 Score = 93.6 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 40/145 (27%), Positives = 55/145 (37%), Gaps = 22/145 (15%) Query: 19 SATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQ 78 +AT P W +CN + + +L D RGEF RGWD GRG+D GR + S Q Sbjct: 172 NATAPA-WYRCNASGVRDATGDHI-------RLQDRRGEFARGWDHGRGVDAGRVLGSAQ 223 Query: 79 GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTY 138 G A + + S + +V + N G + Sbjct: 224 GDAIRNIVGSMGSITAVVAGTASGAFTVTTPSNRSAGSST--------------GPTCDF 269 Query: 139 KQSVDGLGAAASETRPRNIAFNYIV 163 + ASE R RNIA Y+V Sbjct: 270 TFDASRVVPTASENRTRNIATLYLV 294 >UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=Burkholderia thailandensis RepID=UPI00016A4B89 Length = 654 Score = 93.6 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 51/243 (20%), Positives = 70/243 (28%), Gaps = 80/243 (32%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL-------------------- 42 GE ++ VG V G+LKCNGA +YP L Sbjct: 411 AGELASAMVGQIVFEMRTAARAGYLKCNGALVKRADYPALWAYAQGSGALVAEKDWMSGN 470 Query: 43 ------AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIV 96 T ++P+LRGEF+R WDDGRG D R I + Q H H + Sbjct: 471 FGCFSDGDGSATFRIPELRGEFLRCWDDGRGSDADRKIGTWQDSMNRTHGHAAGADGVGD 530 Query: 97 TDAT----------------------------------------------------INFY 104 Sbjct: 531 HGHNAWTDNQGWHGHHGWTGTNGNHNHNNDIFSRLLRPPYNGSLTGSDTAGSGSEQAVGG 590 Query: 105 FDEIWVNSGTDIIKRGNTNDAGLPAPDYG--TFKTYKQSVDGLGAAASETRPRNIAFNYI 162 D + D NT AG A + G + ++ +E RPRN+A + Sbjct: 591 GDSADIRWAGDHNHEFNTEGAGTHAHNVGVAASGAHSHAIHVAADGGNEARPRNLAVLAM 650 Query: 163 VRA 165 +RA Sbjct: 651 IRA 653 >UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacteriaceae RepID=B7LN99_ESCF3 Length = 593 Score = 93.6 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 48/210 (22%), Positives = 68/210 (32%), Gaps = 45/210 (21%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG 61 G +A PVG P+ WPS P G+ G F YP LA AYP+ +PD+RG I+G Sbjct: 384 GFEPVNAFPVGAPIAWPSDIVPEGYAIMQGQTFDKAAYPLLAAAYPSGVIPDMRGWTIKG 443 Query: 62 -------------------------------------WDDGRGIDTGRSILSIQGYATED 84 +D G + + + T Sbjct: 444 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKTVSTFNHGTKTTNNTGA 503 Query: 85 HAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYG--------TFK 136 H H + R + + +T D G G Sbjct: 504 HTHTVGGRYGGDSIGGKQRVQVSGTNQVSSSDGAHAHTVDIGQHNHTVGIGAHAHTVALG 563 Query: 137 TYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + ++ A +E +NIAFNYIVR A Sbjct: 564 AHGHTITVNAAGNAENTVKNIAFNYIVRLA 593 >UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae RepID=Q3ZL14_ESCBL Length = 289 Score = 93.6 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 49/212 (23%), Positives = 72/212 (33%), Gaps = 59/212 (27%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 LP G+ + WP AT PTG+ G F YP LA+AYP+ +PD+RG+ I+ Sbjct: 83 LPPGIALAWPGATAPTGFALMLGQTFDTTAYPRLAQAYPSGVIPDMRGQTIKFLPA---- 138 Query: 69 DTGRSILSIQGYATEDHAHGLPSRST----------------IVTDATINFYFDEIWVNS 112 +GR++LS++ + H+H +T D N D + Sbjct: 139 -SGRTLLSLEADGVKSHSHSGSISTTDLGTATAADTDLGTKQTSQDGLHNHVSDSRFNKL 197 Query: 113 GTDIIKRGNTNDAGLPAPDYGTF------------------------------------- 135 TN+ G D Sbjct: 198 MARSSDIDGTNNTGDVDSDNPESEHRVSGMNDSLWAASVIADSGLHMHTVYIGPHAHSVY 257 Query: 136 -KTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + +V +E +NIAFN IVR A Sbjct: 258 IGPHGHTVTISNFGNTENTVKNIAFNAIVRLA 289 >UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomonas palustris DX-1 RepID=D2MH12_RHOPA Length = 346 Score = 93.6 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 58/193 (30%), Gaps = 39/193 (20%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY--------PTNKLPDLRGEFI 59 ++P G + + TPP GWL C+G S + +L A +P+L F Sbjct: 156 SIPPGFILDFAGPTPPEGWLTCDGQLVSTVTFADLFAAIGYTWGGSGGQFAVPNLVKRFR 215 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS------- 112 R DG + ++Q H+H + D ++ + + ++ Sbjct: 216 RHRGDGTVAGG---VGTLQTNQIGLHSHSASMDAQGHHDHYLDLWSSGMNRSNPHSHPAS 272 Query: 113 ---------------------GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASE 151 I + N + + ++ +E Sbjct: 273 GSGIGVSGGFDTGVYAPQGPLNGVSIGATDINHEHRVTGNTAGNGGHIHNITVAANGGNE 332 Query: 152 TRPRNIAFNYIVR 164 TRP + ++ Sbjct: 333 TRPDSATVMACIK 345 >UniRef50_A6EAC0 Microcystin-dependent protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAC0_9SPHI Length = 185 Score = 92.5 bits (228), Expect = 4e-18, Method: Composition-based stats. Identities = 36/171 (21%), Positives = 47/171 (27%), Gaps = 18/171 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRG 61 +G P+ P GWL CNGA + +Y L T K+P+L+GE I G Sbjct: 5 IGEVRPFAFDWIPDGWLACNGATYPLAQYQALYSVIGTVYGGTLGQNFKVPNLQGEAIIG 64 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G + +H H T N G Sbjct: 65 AGQGPTTSAYTLAQTGGTEKAGLTVNQIPNHDHVFNGAIGATGFRTNTAGNTSYLTNFGY 124 Query: 115 DIIKRGNTNDAGLPAPDYG--TFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 A P T G A E R +A Y + Sbjct: 125 GGAGATTFTSASGYVPPGTPDTLLNPSSVTQTGGGGAHENRQPYLAVTYAI 175 >UniRef50_A9AVC5 Tail Collar domain protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVC5_HERA2 Length = 934 Score = 92.1 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 54/170 (31%), Gaps = 26/170 (15%) Query: 5 EGSALPVGVPVPW--PSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW 62 GS++P G W P GWL CNG N PDLR F+ G Sbjct: 780 NGSSIPSGTINMWSGADNALPGGWLLCNGQ----------------NGTPDLRNRFVVGA 823 Query: 63 DDGR--GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 G G +++ H H + + + T+ F G D+ K Sbjct: 824 GAAYPVGTTGGADSVTLAVNQMPSHNHAASTSNDGQHNHTLYFDTGGGGNGPGGDMAKTN 883 Query: 121 NT---NDAGLPAPDYGTFKTYKQSVDGLGAA---ASETRPRNIAFNYIVR 164 + N + + SV A E RP A YI++ Sbjct: 884 DGLQKNVIANFSVKTDKDGNHSHSVTIQNNGGNQAHENRPPFYALCYIMK 933 >UniRef50_P10930 Short tail fiber protein n=8 Tax=Myoviridae RepID=VG12_BPT4 Length = 527 Score = 92.1 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 39/184 (21%), Positives = 61/184 (33%), Gaps = 28/184 (15%) Query: 9 LPVGVPVPWPSATPPTG-WLKCNGAAFSAEEYPELAKAYPT--------NKLPDLRGEFI 59 +PVG + W + + P+ W C+G SA + P A T LPD+RG F+ Sbjct: 341 IPVGAIMMWAADSLPSDAWRFCHGGTVSASDCPLYASRIGTRYGGNPSNPGLPDMRGLFV 400 Query: 60 RGWDDGRGIDT-------------------GRSILSIQGYATEDHAHGLPSRSTIVTDAT 100 RG G + G + +Q H H A Sbjct: 401 RGSGRGSHLTNPNVNGNDQFGKPRLGVGCTGGYVGEVQIQQMSYHKHAGGFGEHDDLGAF 460 Query: 101 INFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFN 160 N + D + + K +++ +ETRP NI+ N Sbjct: 461 GNTRRSNFVGTRKGLDWDNRSYFTNDGYEIDPESQRNSKYTLNRPELIGNETRPWNISLN 520 Query: 161 YIVR 164 YI++ Sbjct: 521 YIIK 524 >UniRef50_Q2W7B2 Microcystin-dependent protein n=1 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2W7B2_MAGSA Length = 192 Score = 91.3 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 46/178 (25%), Gaps = 26/178 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + + + P W C+G S +YP L T LPDLR G+ Sbjct: 6 GQIILFSGSYAPVNWAVCDGHQLSVSQYPALFSLLGTQFGGNGTTTFGLPDLRSRLAMGF 65 Query: 63 DDGR----------------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFD 106 G + G +++ H H L + V + Sbjct: 66 GTGHVDPKASNSAPLTPYGFATNGGVETVTLTQAQIPPHTHTLNASGDPVVSPNPSGGVP 125 Query: 107 EIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + + + G + E R + YI+R Sbjct: 126 ASFTDGTHVAYFDTPNPIPSGMTITPKQLGASMVTTAG-ASQPHENRMPYLGLMYIIR 182 >UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X1Y2_OXAFO Length = 480 Score = 90.9 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 58/175 (33%), Gaps = 26/175 (14%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNK 50 +G S +P+G + ATPP G+LK +GAA YP+L A T Sbjct: 181 MGWKYPSGVPIGTVEYFAMATPPAGYLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFN 240 Query: 51 LPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV 110 LPD+ G F G + +++ + + +++ F ++ Sbjct: 241 LPDMIGRFAEG---------SATPGTVKEAGLPNITGEINGH----FGSSVAFGTGSLFT 287 Query: 111 NSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + G R G + + + +P + ++A Sbjct: 288 SIGGS---RYRATPDGTGGEAFFAAFISASRSSPIYGNSDTVQPPALTLLPCIKA 339 >UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5ABB4_BURGB Length = 670 Score = 90.9 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 49/236 (20%), Positives = 73/236 (30%), Gaps = 81/236 (34%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AK 44 +G V +P G++KC+G+ + +YP L A Sbjct: 434 IGQIVMEARTSPRAGYVKCDGSQYKRADYPALWAYAQASGALVSEAEYTDGRWGGFSTAD 493 Query: 45 AYPTNKLPDLRGEFIRGWDDGRG-IDTGRSILSIQGYATEDHAHGLPSRSTIVTDA---- 99 ++PDLRGEF+R W DGRG +D GR+I S QG + HAHG S Sbjct: 494 GQTYFRVPDLRGEFLRCWSDGRGDVDPGRAIGSFQGGQNQAHAHGASSDPDGAHVHDAWT 553 Query: 100 TINFYFDEIWVNSGTDIIKRGN-------------------------------------- 121 + V G + N Sbjct: 554 GGAGWHSHHGVTGGGGMHNHANGVFSRLLRPPYLGSLTGSDTDGSGNEQAVGGGDSADIA 613 Query: 122 ------------TNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + A G + ++ +E RPRN+A ++RA Sbjct: 614 WAGEHQHEFWTDGAGDHVHAVGIGNAGGHAHAIHVQADGGAEARPRNVALLAMIRA 669 >UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID=B9BDD9_9BURK Length = 536 Score = 90.5 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 52/235 (22%), Positives = 74/235 (31%), Gaps = 80/235 (34%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AK 44 +G V P + G+LK NGA + +YP L Sbjct: 301 IGTIVFEPRTSVRAGFLKLNGALVNRSDYPALWAYAQASGALVAESAWGQNNWGCFSTGD 360 Query: 45 AYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDAT---- 100 T +LP+LRGEF+R WDDGRG D+ R I + Q + HAHG S + Sbjct: 361 GATTFRLPELRGEFLRCWDDGRGADSARGIGTFQSFQNAWHAHGASSAAVGDHTHGAWTD 420 Query: 101 ------------------------------------------------INFYFDEIWVNS 112 D + Sbjct: 421 AQGWHGHHGWTGGGGGHNHNNGIFSRLLRPPYGGSLTGSDQAGSGSEQAVGAGDSADIAW 480 Query: 113 GTDIIKRGNTNDAGLPAPDY--GTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 D NT +G + + G + ++ G +E RPRNIA ++RA Sbjct: 481 SGDHAHEFNTEGSGTHSHNVGIGGAGAHAHAITVNGDGGNEARPRNIAMLAMIRA 535 >UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens AM1 RepID=C5B185_METEA Length = 449 Score = 90.5 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 40/168 (23%), Positives = 59/168 (35%), Gaps = 34/168 (20%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRG 56 S P G+ + + P GW+ G A ++ L T +PDLRG Sbjct: 305 SKSPPGMISAYAGQSCPVGWVDATGLALLRSDFSALFAVIGTRWGAGDGSTTFNVPDLRG 364 Query: 57 EFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDI 116 F+R D G G D GR + S Q + H H +P + T N + + + + Sbjct: 365 YFLRMQDAGAGRDPGRDLGSAQAGSVGPHQHNVPVANATAGSGTTNNFVYPLAAGTSSVP 424 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + AG ETRP NIA Y ++ Sbjct: 425 TTGQDPAPAG------------------------ETRPINIAVWYCIK 448 >UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A9ITY4_BART1 Length = 376 Score = 90.5 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 41/89 (46%), Gaps = 10/89 (11%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGEF 58 LP G+ P+ P GWL C+G A+S Y L T +PD RG F Sbjct: 159 LPSGLIGPFAMERLPDGWLLCDGRAYSRRTYRALFDGIGTTWGEGDGSTTFNVPDFRGMF 218 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAH 87 +RG D R +D RS S QG + + H H Sbjct: 219 LRGMDYERNLDPWRSFASQQGCSLKAHEH 247 >UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5A8Q3_BURGB Length = 865 Score = 90.5 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 47/228 (20%), Positives = 67/228 (29%), Gaps = 65/228 (28%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL-------------------- 42 L S+ +G V P T G+LK NG+ +YP L Sbjct: 637 LSALSSSSIGQIVFEPRTTTRAGFLKANGSLLERADYPALWAYAQASGALISDAAWWAGQ 696 Query: 43 ------AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIV 96 ++P+LRGEF+R DDGRG+DT R+ S+Q H+H S Sbjct: 697 SGCFSTGTTGTNFRIPELRGEFLRCLDDGRGLDTSRAAGSLQLSQNAKHSHDASSTVGGS 756 Query: 97 TDATI---------------------------------NFYFDEIWVNSGTDIIKRGNTN 123 F G N N Sbjct: 757 HTHGAFTTGAGSHNHAIDQQPHAHDTWLGSVQVSGVDRGGGFGPYNGRVGEAWSDPANAN 816 Query: 124 ------DAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + ++ + E RPRNIA ++RA Sbjct: 817 IAILPTGDHVHGAGTYPAGDHNHAIAVQPSGGDEARPRNIALLAMIRA 864 >UniRef50_A9AVE2 Tail Collar domain protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVE2_HERA2 Length = 865 Score = 89.8 bits (221), Expect = 3e-17, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 61/190 (32%), Gaps = 40/190 (21%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRG- 61 GS +P G W P GW C+G + + PDLR FI G Sbjct: 687 YSNGSPIPCGTIQMWSGMEVPEGWAICDGREAN------------GLRTPDLRNRFIVGA 734 Query: 62 -----------WDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATIN-------- 102 + +G G ++++ H HG + + + Sbjct: 735 GANYDSGNLSVYGTNQGTTGGSDVVALTLDQMPRHTHGGSTNAAGDHSHWVEGTDADGLA 794 Query: 103 ----FYFDEIWVNSGTDIIKRGNTND----AGLPAPDYGTFKTYKQSVDGLGAAASETRP 154 ++ + V+ G + + ND + + GT + G+ A E RP Sbjct: 795 KRRRHHWGDTTVDMGFGGGRNADPNDERWRGRVNTDNAGTHSHGLMIGEVGGSQAHENRP 854 Query: 155 RNIAFNYIVR 164 A +I++ Sbjct: 855 PFYALAFIMK 864 >UniRef50_Q2RUE1 Phage Tail Collar n=5 Tax=Proteobacteria RepID=Q2RUE1_RHORT Length = 187 Score = 89.4 bits (220), Expect = 3e-17, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 48/174 (27%), Gaps = 18/174 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P W C G + + LA T LP+L+G G Sbjct: 6 IGEIRIFGFNYAPVDWAFCAGQTVAIAQNQALAVVLGQAFGGDGRTTFGLPNLQGSVPIG 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVT--DATINFYFDEIWVNS 112 G G+ G +S+ T H H + S T A N + + Sbjct: 66 AGSGPGLTPRPYAQQAGTDRVSLTLAQTPPHNHSITVASASGTLRTAGPNATAPLSFCSF 125 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 GT + G A E R + NY + A Sbjct: 126 VATKATPPKPQTTFTQTAPDGTLAPGALAPFVGGGDAHENRQPYLVLNYCISLA 179 >UniRef50_Q4UNP6 Microcystin dependent protein n=8 Tax=Bacteria RepID=Q4UNP6_XANC8 Length = 175 Score = 89.4 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 47/169 (27%), Gaps = 25/169 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P GW C+G+ S +Y L T +PDLRG Sbjct: 6 IGEIRMFGFGRTPQGWQACDGSLLSISDYEVLFMLIGNTYGGDGQNTFAVPDLRGRVPLH 65 Query: 62 WDDGRGI-------DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G +++ G H H L + + T + Sbjct: 66 QGQGPGLSNYVIAQTAGTESVALTGLQLPAHTHTLVATTAAATATAPSGLLPGTVT---- 121 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 G+ A + + G A E + Y + Sbjct: 122 -----GDVFYATDTTGATAAPMATQSTTITGGGLAHENTMPTLTVQYCI 165 >UniRef50_C2FWA0 Phage tail collar domain protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FWA0_9SPHI Length = 196 Score = 89.4 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 50/184 (27%), Gaps = 29/184 (15%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 + + P GW+ C+G S Y L T +PDLRG G Sbjct: 5 IAEVRNFAGNFAPAGWILCDGRLLSINNYQVLYTVIGTTYGGDGVNTFGVPDLRGRVPIG 64 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTI------VTDATINFYFDEI 108 G G+ G +++ H H +T V+ A + Sbjct: 65 TGQGPGLTNVVLGQKIGTETVTLLPANLPVHTHTAAVNATNVPFAVKVSAAAATLHAAAT 124 Query: 109 WVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA-------SETRPRNIAFNY 161 G + T PD + + A E + NY Sbjct: 125 GSQLGQPMTDSIPTLGYNAANPDKTMGDSSLNTSGLTVNTAMMGSSLPHENMQPFLTTNY 184 Query: 162 IVRA 165 I+ A Sbjct: 185 IICA 188 >UniRef50_C3X971 Predicted protein n=6 Tax=Oxalobacter formigenes OXCC13 RepID=C3X971_OXAFO Length = 534 Score = 89.4 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 60/175 (34%), Gaps = 28/175 (16%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKL 51 G+ + +P+G + +TPP G+LK +G A E Y EL T L Sbjct: 242 GISPLNGVPIGTVEYFAMSTPPAGYLKADGRAVGRETYAELYSVIGTTFGEGDEQTTFNL 301 Query: 52 PDLRGEFIRGWDD-GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV 110 PDL F +G + G+ I+ G LP+ ++T++ + Sbjct: 302 PDLIDRFAQGSNTPGQKIEAG-----------------LPNIEGVITNSGSILWAGNEDA 344 Query: 111 NSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + + T + + A+ +P + ++A Sbjct: 345 SGAFSLTGASPRANTATVGAGANTLSFNASQSNQIYGASDTVQPPALTLLPCIKA 399 >UniRef50_Q2S9H9 Microcystin-dependent protein n=3 Tax=Proteobacteria RepID=Q2S9H9_HAHCH Length = 179 Score = 89.4 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 46/169 (27%), Gaps = 21/169 (12%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + + P W C+G +A + L T LPD+RG Sbjct: 6 IGEIRIFAATFAPRNWSFCDGQVLAASQQAALFSLLGSFYGGDGRTTFALPDMRGRLPLH 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 + G G+ G +++ H H L + + + +G Sbjct: 66 FGQGPGLTPYAIGARVGVESVTVTMENMPPHTHTLMASND-----AVTVDVSPSNQVTGV 120 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 G P Y + +A N+I+ Sbjct: 121 TDPAAPFYTTTGNITPLASEAVGYAGGAQNQQTSPHSIMMPYLALNFII 169 >UniRef50_A5GA41 Phage Tail Collar domain protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GA41_GEOUR Length = 205 Score = 89.4 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 45/168 (26%), Gaps = 20/168 (11%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRGW 62 G + P W C+G Y L T LPDLRG G Sbjct: 32 GEIRMFGGDYAPENWHFCDGTLLPISGYDALYSLIGTAYGGDGINNFALPDLRGRLPIGQ 91 Query: 63 DDGR-------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G G + + T H H + + S T + Sbjct: 92 GQGTDLTNHPVGEKNGTETVGLTLAQTPAHTHTVNAASGTGTQPSPENGVWASLAAVNQF 151 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 I + + + + T Q G AA + ++I+ Sbjct: 152 ITPAEVKSPSIIHDMNSAAIGTGYQ----AGGAAHLNMMPSFPLSFII 195 >UniRef50_Q8PR98 Microcystin dependent protein n=1 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PR98_XANAC Length = 195 Score = 89.0 bits (219), Expect = 5e-17, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 49/182 (26%), Gaps = 28/182 (15%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G +P P GWL C G S +Y L T LPDLRG + G Sbjct: 6 IGEVRAFPYNFAPEGWLDCMGQTVSINQYQALFGVIGFAYGGDKQTTFGLPDLRGRAVTG 65 Query: 62 WDDGRGIDTGRSILSIQG---------YATEDHAHGLPSRSTIVTDATINFYFDEIWVNS 112 G G+ +I +QG H H + + A + Sbjct: 66 QGQGPGLSN-YTIGQLQGTDSVALVSSTQLPAHTHSITTMFLPPATAPGAAVNTPSSSSY 124 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKT---------YKQSVDGLGAAASETRPRNIAFNYIV 163 + ++ + A T A E R Y + Sbjct: 125 LSRLLNPTTSPPTSYKAYAPATTTPMVQLSPNALAPFPSGSQAVQAHENRQPFTTIRYCI 184 Query: 164 RA 165 A Sbjct: 185 CA 186 >UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_9CAUD Length = 760 Score = 88.6 bits (218), Expect = 7e-17, Method: Composition-based stats. Identities = 34/142 (23%), Positives = 55/142 (38%), Gaps = 4/142 (2%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRG 67 A+P+G P+ TPP G+L C+G+ FS +EYP+L + LPD+RG +++ D Sbjct: 263 AVPIGSIFPF-VKTPPAGYLTCDGSTFSKDEYPDLYAYLGSTTLPDMRGRYLKMPSDLAN 321 Query: 68 IDT--GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDA 125 I I ++ H H ++ T+ E +V SG + Sbjct: 322 IYQKFPAIIPALLHDVDISHTHTASQQAHAHDRGTME-IGGEFFVGSGHGLYIATGAYGG 380 Query: 126 GLPAPDYGTFKTYKQSVDGLGA 147 + G G Sbjct: 381 AFFSDSPGGADNNGGGASGGLN 402 >UniRef50_B1KMR6 Tail Collar domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KMR6_SHEWM Length = 179 Score = 88.2 bits (217), Expect = 8e-17, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 47/169 (27%), Gaps = 21/169 (12%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G P + KC+GA + + T LPDLRG Sbjct: 6 IGEIKMVGFNFAPRSYAKCDGALLPISQNTAMFSLLGTEFGGDGRTTFGLPDLRGRTPMH 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 +G G+ G ++Q H H + + + S + Sbjct: 66 QGNGPGLSPKTMGQSSGSESNTLQLNQMPKHTHSAQLDAVSTEGTSAVPDNNMYLAKSSS 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + T+ T + ++ G +A N+I+ Sbjct: 126 GL-----TSVNSYSNGTPDTVISPHETNTAGGNSAINNMQPYQVVNFII 169 >UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W1_OXAFO Length = 365 Score = 88.2 bits (217), Expect = 8e-17, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 54/171 (31%), Gaps = 27/171 (15%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPD 53 L + LP G + P G+L CNGA+ S YPEL T LPD Sbjct: 212 LDKAEKLPAGTIIAVGGNITPEGFLYCNGASLSPSAYPELCAVIGGTYGGDGLTTFNLPD 271 Query: 54 LRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 RG +++G D GR + + + + + Sbjct: 272 FRGRWMQGNDT-----AGRVL----AAGLPNVTGTI---------VSGAIAHATAYQTGA 313 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 I G + ++ + + A+ RP +I Y ++ Sbjct: 314 FYNIDVGAFGGYHAGSQNHYRAGFEASRSNPIYGASDTVRPPSITVRYCIK 364 >UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7INV5_XANP2 Length = 492 Score = 87.5 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 42/163 (25%), Positives = 59/163 (36%), Gaps = 39/163 (23%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT----------NKLPDLRGEFIR 60 G WP++TPP+G L NGA S Y L T +P+ G F+R Sbjct: 357 PGTIAMWPASTPPSGALVRNGATLSRTVYASLFAVIGTTFGAGDGATTFGVPNDLGIFVR 416 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 GWD+GRG DTGR S Q + H H + S + T F Sbjct: 417 GWDNGRGYDTGRVFGSEQADDNKSHDHARQTVSGVFTAGGAGFALQ-------------- 462 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + + +E RP+N A+ I+ Sbjct: 463 ---------------DSGSTTQRVASSGGAEARPKNRAYLPII 490 >UniRef50_Q2W7B1 Microcystin-dependent protein n=3 Tax=Proteobacteria RepID=Q2W7B1_MAGSA Length = 177 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 48/169 (28%), Gaps = 25/169 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G +P PTGWL C+G + L T LPDLRG I G Sbjct: 7 GEIRLFPLNWAPTGWLPCDGRSMQVSANAALFSLLGNQFGGDAKTTFFLPDLRGRTIMGQ 66 Query: 63 DDGR--------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G +++ H H + T+ + + + + +GT Sbjct: 67 GKNPVTGVSYVTGAYGGTESVTLTTAQLPSHQHQVVGDQTVGATNPADDNYLAVPIYNGT 126 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + P G AA +A Y + Sbjct: 127 QKSLYNSGTKPVPLNP--------ASVSTVGGGAAHTNTQPYLALGYCI 167 >UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenella corrodens ATCC 23834 RepID=C0DSG4_EIKCO Length = 436 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 61/210 (29%), Gaps = 52/210 (24%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDL-------RGE 57 G LPVG V +P A + P G+LK +G+ F+ YP+L + NKLP+L Sbjct: 69 GKGLPVGAVVGFPRAISSPEGYLKADGSTFAQATYPDLYRVLGGNKLPNLTRSDVGMTAY 128 Query: 58 FIR-GWDDG-----------------------------------------RGIDTGRSIL 75 F DG R ++ Sbjct: 129 FPIEAIPDGWIKYDEVATKVTQSAYPELYRLLVAQYGSIDAVPKAEDRFIRNASGSLAVG 188 Query: 76 SIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTF 135 + QG + G+ + + ++ + + + Sbjct: 189 TQQGDTIRNITGGIEALYSGYRYTLYTKADGAFTMDLDDG--ANSTFSSSKGDSDHNNRK 246 Query: 136 KTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 K A E RP+ +A ++A Sbjct: 247 KRVVFDASRSVPTADEVRPKALAMVLCIKA 276 >UniRef50_A1HR57 Putative uncharacterized protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HR57_9FIRM Length = 269 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 52/184 (28%), Positives = 64/184 (34%), Gaps = 50/184 (27%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL------------------------ 42 PVG V S G+LK NGA S YP L Sbjct: 109 DGTPVGRIVAEISPICRPGYLKANGALVSRAAYPRLWAYVQARGLVVPDTVWPANYWGCF 168 Query: 43 --AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDAT 100 T +LPDLRGEFIRG DDGRG+D GR+ S Q + H H Sbjct: 169 STGDGSTTFRLPDLRGEFIRGGDDGRGVDGGRAFGSWQADGIKSHNHPYQ---------- 218 Query: 101 INFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFN 160 + + + G D+I T + ETRPRNIA Sbjct: 219 -SQPYLFVESFDGGDVIAER-------------TSTAKWVTHYTSNFGGPETRPRNIALL 264 Query: 161 YIVR 164 Y ++ Sbjct: 265 YCIK 268 >UniRef50_C3X909 Predicted protein n=2 Tax=Oxalobacter formigenes OXCC13 RepID=C3X909_OXAFO Length = 549 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 58/174 (33%), Gaps = 23/174 (13%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKL 51 G+ S +PVG + TPP G+LK +G+A S YP+L A T L Sbjct: 257 GVVCPSGVPVGAIGYFAMQTPPAGYLKADGSAVSRATYPDLFGAIGTTFGEGDGSTTFNL 316 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 PDL F +G G I + L + + + + F + Sbjct: 317 PDLIDRFAQG-----NATPGLKI----EAGLPNITGSL-TVTASNQGSAASGAFSRTQIG 366 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + G N +G Y + G +P + ++A Sbjct: 367 AVGGGLGGGQYNSSGCGPNLYSFDSRVSNPIYGASNT---VQPPALTLLPCIKA 417 >UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW4_CLOCE Length = 200 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 67/198 (33%), Gaps = 40/198 (20%) Query: 3 LGEGSALPVGVPVPWPSATPPT--------GWLKCNGAAFSAEEYPELAKAYP------- 47 + +P+G + + GWL C+G+ EYP+L +A Sbjct: 1 MASTERMPIGSVISFAGEIKSEMVNRLYRMGWLICDGSKLKIAEYPDLFQAIGKAHGGDN 60 Query: 48 -TNKLPDLRGEFIRGWD------DGRGIDT--------------GRSILSIQGYATEDHA 86 LPD + +FIRG + GR +D G ++ S Q +A Sbjct: 61 TYFYLPDTQSKFIRGVNGDSVGESGRLMDPDVAKRTFAKPGGNTGNNVGSYQDFA----T 116 Query: 87 HGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLG 146 T + + S + + T ++ S + +G Sbjct: 117 GLPKVSLTTDFIGSHTHSLPHLPDGSHNAYAGSIGRDGGKEAGDNTRTGESGSHSHEIIG 176 Query: 147 AAASETRPRNIAFNYIVR 164 ETRPRN+ ++I++ Sbjct: 177 GGDPETRPRNMNLHFIIK 194 >UniRef50_Q4KAW3 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KAW3_PSEF5 Length = 191 Score = 86.7 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 47/175 (26%), Gaps = 23/175 (13%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + P GW CNGA + L T LPD RG G Sbjct: 7 GEIRMFAGNFAPRGWALCNGAQLLIRNFEALYTLIGTTYGGDGSNTFCLPDYRGRTPIGQ 66 Query: 63 DDGRGIDT---GRSILSIQGY----ATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 +G G G+++ + Q T H HG S+ VT A + Sbjct: 67 GNGPGFTPRALGQAVGTEQVTMSALNTPPHIHGFQVSSSEVTSANPLPANQPANSYTFGK 126 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAA-------ASETRPRNIAFNYIV 163 G+ + + ++A YI+ Sbjct: 127 FKLEGSFTGLYSKGDSTSAVVSMSPNFLSPALGIPNKAVEPHSNMMGSLAITYII 181 >UniRef50_B0SX68 Tail Collar domain protein n=6 Tax=Bacteria RepID=B0SX68_CAUSK Length = 179 Score = 86.3 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 45/168 (26%), Gaps = 19/168 (11%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 VG + PP+GW C+G + E L T +PDLRG Sbjct: 6 VGEIRIFGGNFPPSGWAFCDGQLMAISENDTLFNLIGTTYGGDGQETFGIPDLRGRAPVH 65 Query: 62 WDDGRGID------TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G +++ H H L + ST + T + Sbjct: 66 QGTQAGTTYVIGERAGVESVTLTANQMAQHTHPLMAASTAGSVGTPTGQTMLSSMGPTGI 125 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + D T G + + N+I+ Sbjct: 126 SLNAYLPYDPANEQ----VALTPASLTPVGGNQPHDNMQPYLGLNFII 169 >UniRef50_B3PJI8 Conserved domain protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PJI8_CELJU Length = 243 Score = 86.3 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 44/168 (26%), Gaps = 22/168 (13%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P W C+G + + L T LPD RG G Sbjct: 73 IGEVRMFAITYAPRHWTDCSGQLLAINQNQALFSLLGVNFGGNGTTTFGLPDYRGRTPIG 132 Query: 62 WDDG------RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 RG G +++ H H L +++ + GT Sbjct: 133 EGSFQGNTYVRGNQGGSESVTLTVAQIPPHNHLLFAQNAPGS-------APNPIPPGGTG 185 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + ++ D + + A ++ + + Sbjct: 186 SLCAISSGDYYVVPDTNLVQLASDAVTNTGSNVAHSNLQPSLVLRFAI 233 >UniRef50_A5GA42 Phage Tail Collar domain protein n=2 Tax=Bacteria RepID=A5GA42_GEOUR Length = 181 Score = 86.3 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 40/170 (23%), Gaps = 23/170 (13%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRGW 62 G +P P GW CNGA + L T LPDL+G Sbjct: 7 GEIRIFPFDFAPRGWALCNGALLPIVQNQALYSLLNTTFGGDGKTNFGLPDLQGRVPMPP 66 Query: 63 DDGR--------GIDTGRSILSIQGYATEDHAHGLPSRS-TIVTDATINFYFDEIWVNSG 113 G G +++ H H + S + + IW Sbjct: 67 GTNPVCGNIVAAGKKDGSETVTLTTSQIPPHTHSALANSINADFASPVTLAAGNIWAK-- 124 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 ++N G A NY + Sbjct: 125 ---ADDPSSNPVNAYESGANAVMDQSALSTAGGGGAHNNMQPYQVVNYCI 171 >UniRef50_A6EAB8 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAB8_9SPHI Length = 183 Score = 85.9 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 57/168 (33%), Gaps = 17/168 (10%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRGW 62 G + P W+ CNGA + + L + K+PDLRG G Sbjct: 7 GEIRAFAGTYAPVDWMMCNGATLTVQGNEALYSLIGSTYGSNGPTDFKVPDLRGRLTVGQ 66 Query: 63 DDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G+ + G +++ H H L + ST+ + A++N + ++ Sbjct: 67 GLGTGLTSRILGSVGGAETVALTEAQLPAHNHNL-TVSTVTSPASVNAPSNTSYLGVVNS 125 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 G G + + G+ A +A NYI+ Sbjct: 126 SAGAGVGYVPGNATGASVRALDTQVLSNTGGSQAHANIMPFLALNYII 173 >UniRef50_Q8EKB1 Putative uncharacterized protein n=1 Tax=Shewanella oneidensis RepID=Q8EKB1_SHEON Length = 183 Score = 85.9 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 51/174 (29%), Gaps = 30/174 (17%) Query: 13 VPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGWD 63 + P GW KC+G+ S + L T LPDLRG Sbjct: 7 EIRMFSFEWAPKGWAKCDGSLMSIAQNNALFALLGVQFGGNGTTTFALPDLRGRAPVHVG 66 Query: 64 DGR-----------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS 112 G G +S+ H H +V +T++ + + Sbjct: 67 VSDRNSPSYSTFKIGAVGGTENVSLTQSQMPAHNH-------LVAASTVSGTVKPLANDI 119 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFK---TYKQSVDGLGAAASETRPRNIAFNYIV 163 + + N + + AP + AA + ++A N+ + Sbjct: 120 IGAGLNKQNGQPSHVYAPYNPATQVPLATDVVSVQGAGAAHQNCQPSLAINFCI 173 >UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED121 RepID=A3YFP9_9GAMM Length = 207 Score = 85.9 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 42/193 (21%), Positives = 66/193 (34%), Gaps = 51/193 (26%) Query: 6 GSALPVGVPVPWPS------------ATPPTGWLKCNGAAFSAEEYPELAKAYPT----- 48 G A+PVG + + P WLKC+G++ +YPEL A Sbjct: 29 GDAMPVGSVIAFAGEIRTSGDKPFETNLPMFNWLKCDGSSLEVAQYPELFSALGYRYGGS 88 Query: 49 ---NKLPDLRGEFIRGWD--------------DGRGIDTGRSILSIQGYATEDHAHGLPS 91 LPDLRGEF+RG D G + S QG+A + H H Sbjct: 89 GQKFNLPDLRGEFLRGVDVDSSNNKKASLEGRKGAANGGNHEVGSTQGFALQSHVHTYQK 148 Query: 92 RSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASE 151 + + + G + + + + Q + L + E Sbjct: 149 PKRAMPI-----------------LAEPGVSTTQIPLSQEDTSTPKSSQKNENLALSDKE 191 Query: 152 TRPRNIAFNYIVR 164 TRP N ++++ Sbjct: 192 TRPVNTFVYWLIK 204 >UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD Length = 325 Score = 85.9 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 56/180 (31%), Gaps = 41/180 (22%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTG 71 G+ V + P + C+ +LPD+RGE +R WD+GRG+D Sbjct: 159 GLIVA-AANWSPGVYAFCD-------------IDANQFRLPDVRGEGLRLWDNGRGVDQA 204 Query: 72 RSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT----------------- 114 R++ S QG A E H H S N+G Sbjct: 205 RTLGSWQGGAIESHGHAANSGDAGAVADRRTGSGGGHNHNNGIFTRLLRAPYVGSITGSD 264 Query: 115 ----------DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + G ++ + +ETR RN+A +++ Sbjct: 265 TTNSGDEQAVGGGDSADIAAVGDHDHLIPGVGPHRHDISISATGGNETRMRNVAVAALIK 324 >UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X192_OXAFO Length = 361 Score = 85.9 bits (211), Expect = 5e-16, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 61/174 (35%), Gaps = 25/174 (14%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKL 51 + + S +P+G + ATPP G+LK +GAA YP+L A T L Sbjct: 73 AINKRSGVPIGTVEYFAMATPPAGYLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFNL 132 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 PD+ G+F G + +++ + + + ++ +A+ I Sbjct: 133 PDMIGQFAEG---------SATPGAVKEAGLPNIIGSISNVASGGANASSASGALSIAAR 183 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 S ++ + + + ++ +P + ++A Sbjct: 184 SNNNMTPGSSAYGHTF------ALAINASDFNPIYGKSNTVQPPALTLLPCIKA 231 >UniRef50_Q1QPI5 Phage Tail Collar n=10 Tax=Proteobacteria RepID=Q1QPI5_NITHX Length = 180 Score = 85.5 bits (210), Expect = 5e-16, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 50/169 (29%), Gaps = 23/169 (13%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRG 61 +G + P W CNGA + + L T LP+ G Sbjct: 5 IGQIQIFGFNYAPRNWAFCNGATLAIRQNTALFSLLGTMYGGDGVTTFMLPNFAGRTGCN 64 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G + +++ H+H + T + +G+ Sbjct: 65 QGQGVGLTARTIGEAFGENSVALVSEEMPSHSHSFTVYNQTDTTKRTSAPA------NGS 118 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 ++ N+ F + + G G E R +A N+ + Sbjct: 119 SLVVPQNSTPFSSSGTANTQFSPHMGGLTG-GNQPHENRQPYLAMNFCI 166 >UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root RepID=Q2T5M0_BURTA Length = 790 Score = 85.5 bits (210), Expect = 5e-16, Method: Composition-based stats. Identities = 50/239 (20%), Positives = 68/239 (28%), Gaps = 80/239 (33%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL------------------------ 42 SA +G V P T G+LK NG + +YPEL Sbjct: 551 SATTIGQIVFEPRTTVRPGFLKANGVLVNRADYPELWAYAQASGALVSDADWMKDRWGCF 610 Query: 43 --AKAYPTNKLPDLRGEFIRGWDDGR-GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDA 99 T +LP+LRGEFIR W D R G+D R I + QG HAHG + Sbjct: 611 STGDGATTFRLPELRGEFIRCWSDARGGVDATRQIGAFQGDQNHTHAHGAAASEAPDHVH 670 Query: 100 T-----------------------------------------------------INFYFD 106 T + + Sbjct: 671 TAWTDVQGWHGHHGWTNAVGDHQHVSPWGEHPQMYNPPWGTWGAANNRGAEGSDNDNVYG 730 Query: 107 EIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + N A G + ++ E RPRN+A ++RA Sbjct: 731 MTSPAGNHNHEFNTEGNGNHGHAVGIGGGGRHAHTIAVQPDGGDEARPRNVALLALIRA 789 >UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia cenocepacia J2315 RepID=B4EF34_BURCJ Length = 883 Score = 85.2 bits (209), Expect = 8e-16, Method: Composition-based stats. Identities = 49/230 (21%), Positives = 75/230 (32%), Gaps = 76/230 (33%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AKA 45 G V P T G+LK NGA +YP L Sbjct: 653 GTVVFEPRTTARAGFLKLNGALLKRADYPALWAYAQASGALSTETDWAAGWSGTFSTGDG 712 Query: 46 YPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQ--------------------------- 78 T ++P+LRGEF+R WDD RG+D R + + Q Sbjct: 713 TTTFRIPELRGEFVRCWDDTRGVDPNRGLGASQNFANAWHAHGASAAASGDHVHSAWTDV 772 Query: 79 ---------GYATEDHAHGLP------------SRSTIVTDATINFYFDEIWVNSGTDII 117 + DH H P S + + + ++ + + Sbjct: 773 QGWHGHHGWTASVGDHQHVAPYSESGIAPFGTHSTNQVGSHGGVDNDNPWAFTSGAGGHN 832 Query: 118 KRGNTNDAGLPAPDY--GTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 NT AG + G + ++ G A+E+RPRN+A ++RA Sbjct: 833 HEFNTEGAGNHGHNVGIGAAGNHSHAITVNGDGANESRPRNVALLAMIRA 882 >UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysenteriae 1012 RepID=B3X2T1_SHIDY Length = 488 Score = 85.2 bits (209), Expect = 8e-16, Method: Composition-based stats. Identities = 47/230 (20%), Positives = 70/230 (30%), Gaps = 72/230 (31%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGW------ 62 P G P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 259 YPPGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRA 318 Query: 63 ---------------------DDGRGIDTGRSILSIQGYATEDHAHGLPSRS-------- 93 D G + + T H H + + Sbjct: 319 VLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQH 378 Query: 94 -----------------------------TIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 + + + + + G Sbjct: 379 KSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAAS 438 Query: 125 AGLPAPDYG--------TFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 AG A G ++ ++ A +E +NIAFNYIVR A Sbjct: 439 AGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 488 >UniRef50_C6S6V6 Putative phage tail fibre protein n=1 Tax=Neisseria meningitidis alpha14 RepID=C6S6V6_NEIML Length = 728 Score = 85.2 bits (209), Expect = 8e-16, Method: Composition-based stats. Identities = 37/178 (20%), Positives = 59/178 (33%), Gaps = 26/178 (14%) Query: 2 GLGEGSALP------VGVPVPWPSATPPTGWLKCNG--AAFSAEEYPELAKAYPTN---- 49 LG + LP +G+ +PS PTGWL + + YPEL + Sbjct: 357 ALGNSNRLPDLSRTDIGITAWFPSDQIPTGWLAFDDIRTRVTETAYPELYRLLTGKYGSI 416 Query: 50 -KLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEI 108 +P FIR ++ + Q + H H + S T TD Y D Sbjct: 417 QNVPQAEDRFIR------NAGNSLAVGTKQEDEIKRHTHKVFSHWTSHTDVAAVGYEDGN 470 Query: 109 WVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + N + F T + + E RP+ + ++AA Sbjct: 471 ERQRSALVSTWTDENLSD------NGFLTPRLD-SKMATGGDENRPKALVLKLCIKAA 521 >UniRef50_B1HUA7 Microcystin dependent protein MdpB n=2 Tax=Bacteria RepID=B1HUA7_LYSSC Length = 178 Score = 84.8 bits (208), Expect = 9e-16, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 47/169 (27%), Gaps = 22/169 (13%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P GW CNG + ++ L T LP+L Sbjct: 6 IGEIRIFSGNFAPKGWALCNGQLMNIQQNTALYSILGVQYGGDGKTTFALPNLMASAPMN 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G +++ H+H + ++ T F V +G Sbjct: 66 QGSGMGLTPRKVGEAVGTQTVTLLESQIPAHSHTPVAIQSVGTSGNPIGLFWAEGVGAGR 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + L + + G+ + NYI+ Sbjct: 126 PPKQP------HLYDTNLDVQMNTQALGLTGGSQPHNNMQPFLVMNYII 168 >UniRef50_Q4C9U4 Phage Tail Collar n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4C9U4_CROWT Length = 253 Score = 84.4 bits (207), Expect = 1e-15, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 46/165 (27%), Gaps = 11/165 (6%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT--------NKLPDLRGE 57 GS +P V + A P GWL C+G + YP+L A ++PD+R Sbjct: 69 GSIIPKSSIVVFGGAVAPNGWLFCDGTPYDPSTYPQLFSAIGYGFGQVGSLFRVPDMRDR 128 Query: 58 FIRGWDDG--RGIDTGRSILSIQGYATEDHAHGLPSRSTIVT-DATINFYFDEIWVNSGT 114 G RG G + S+ H+H + + + + Sbjct: 129 SPVGAGISFDRGTFGGSATTSLSVDNMPAHSHNVIDPGHTHSMNHGPGQHSAVALDYHNA 188 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAF 159 G A + R + F Sbjct: 189 GNGVDAYVPQWGGHAHTIYASGVGISLENTGSGTPVSVRNPYVGF 233 >UniRef50_C7PNC3 Tail Collar domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PNC3_CHIPD Length = 183 Score = 84.4 bits (207), Expect = 1e-15, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 41/168 (24%), Gaps = 16/168 (9%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G +P P GW+ C+G + L LP+L G I G Sbjct: 6 GEIRLFPYTQIPRGWVSCSGQTLPIAQNQALFALLGVYYGGNGTTNFMLPNLNGRAIVGT 65 Query: 63 DDGR-------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G +G +++ H+H + + + YF S Sbjct: 66 GQSTSGTNYNIGQASGTETVTLVTNNLPAHSHPVKVNVSYDQGSPNTNYFANANTPSSPT 125 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + G R + Y + Sbjct: 126 QPGQNTGTVNLFSPAVTPLVEMAPSVTSTGGGLPHANRMPYLTLIYCI 173 >UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=VPH_BPHP1 Length = 925 Score = 84.0 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 63/176 (35%), Gaps = 26/176 (14%) Query: 3 LGEGSALP------VGVPVPWPSATPPTGWLKCNG--AAFSAEEYPELAKAY-----PTN 49 LG+ + LP VG+ + P+GW+ + + + + YPEL + + Sbjct: 573 LGDSNQLPDLTRSDVGMTAYFAVDNIPSGWIAFDSIRSTVTQQNYPELYQYLVDKYSSIS 632 Query: 50 KLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW 109 +P FIR G +I Q + H H + + D++ + F + Sbjct: 633 NVPLAEDRFIR------NTGNGLNIGQTQSDEIKKHVHRVRTH---WADSSDSSIFYDKT 683 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 ++ T D L + + + ETRP+++ ++A Sbjct: 684 KTVIDSRLRTATTTDDNLSDNGFM----HPLLDTPMATGGDETRPKSLILKLCIKA 735 >UniRef50_Q12HS4 Phage Tail Collar n=1 Tax=Shewanella denitrificans OS217 RepID=Q12HS4_SHEDO Length = 198 Score = 84.0 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 54/183 (29%), Gaps = 30/183 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + + P W CNG E L + LPDLRG Sbjct: 6 IGEIRMFAGSYAPQYWAFCNGQLLPIAENQALFSLLGYVYGGTQGVSFALPDLRGRVPVH 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ + G +S+ HAH + ++T + ++ + Sbjct: 66 VGTGAGLSSKALGQRGGTEYVSLTSAQLPAHAHMVDLKATGEVNVKMSASSAKGDTAIPG 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFK-------TYKQSVDGLGAAA--SETRP-----RNIAFN 160 +P Y T +V+ G A RP +A N Sbjct: 126 PTTVPAQVLSGLIPLNAYSTSPDTTLLPVNTSTTVNVSGNTAMMGAGRPVVIEQPFLAIN 185 Query: 161 YIV 163 +I+ Sbjct: 186 FII 188 >UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3K6_OXAFO Length = 500 Score = 84.0 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 54/178 (30%), Gaps = 23/178 (12%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKL 51 + +PVG V + ++ P G+LKC+GAA + YP+L A T L Sbjct: 206 AVASQRGIPVGTVVMFSASEAPAGYLKCDGAAVGRDTYPDLFAAIGTVFGAGDGETTFNL 265 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDAT----INFYFDE 107 PD+ G F G + +++ D + +A I Sbjct: 266 PDMIGRFAEG---------SLTPGTVKEAGLPDVTGTIRLSDNSQINAVEADKIATANGA 316 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + Y S + L + +P + ++A Sbjct: 317 FSRVRTNSPTAYSTASVDVATTNKYDRVDFSLASQNPLYGNSDTVQPPALTLLPCIKA 374 >UniRef50_C7PNC2 Tail Collar domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PNC2_CHIPD Length = 184 Score = 84.0 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 46/172 (26%), Gaps = 16/172 (9%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + PTGWL CNG + + Y L T +PDLRG G Sbjct: 6 IGEIRAFGFNFTPTGWLPCNGGLYPIQSYSTLFAILGTNFGGNGTTTFAVPDLRGVAAIG 65 Query: 62 WDDG------RGIDTGRSILSIQGYATEDHAHGLPSRS-TIVTDATINFYFDEIWVNSGT 114 + G+ G +++ H H + + T + Sbjct: 66 INLQNPSFGVPGVKGGSENVTLTIATIPAHTHMMQAVVRTSLAQTAAAISQPGPNAYLTN 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + G + + +A Y + A+ Sbjct: 126 AFSSGPSKGVVAYSNNTSGATLNPQAIGITGSSTPHNNMDPYLAMTYCICAS 177 >UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0UTN0_HAES2 Length = 699 Score = 83.6 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 63/194 (32%), Gaps = 34/194 (17%) Query: 6 GSALPVGVPVPWPSA-TPPTGWLKCNGAAFSAEEYPELAKAYPTNK-LPDL--------- 54 G +P+G V +P A T PTG+LKC+G YP+L + LP+L Sbjct: 347 GDGVPLGAIVAFPKAITNPTGFLKCDGTTIDQRTYPDLYRTLGNKNTLPNLTRSDVGMTA 406 Query: 55 ---RGEFIRGW------DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYF 105 GW + DT + + + + +A Sbjct: 407 YFATDNIPDGWIAFDEIKEKVKEDTYPELYKYLIEKYTSIDNVPKAEDRFLRNAANELVV 466 Query: 106 D------------EIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLG--AAASE 151 N + + N+ + + T YK +G + A E Sbjct: 467 GRVQEDAIKTHYLNYGTNHNSSNYQFHVDNNDTIATGNNKTTDNYKIRTNGAIFYSGAEE 526 Query: 152 TRPRNIAFNYIVRA 165 TRP+++ ++A Sbjct: 527 TRPKSLVLKLCIKA 540 >UniRef50_UPI00016C4891 hypothetical protein GobsU_00190 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4891 Length = 252 Score = 83.6 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 59/199 (29%), Gaps = 41/199 (20%) Query: 7 SALPVGVPVPWPSATPPT------------GWLKCNGAAFS-------AEEYPELAKAYP 47 ++ PVG + PP GWL C+G + + EL Sbjct: 52 TSPPVGTVTAFAGTWPPKRSDGGVWTEAEIGWLLCDGRKWEDKSLDGVRADLWELRAVLD 111 Query: 48 TNK------------LPDLRGEFIRGWDD-------GRGIDTGRSILSIQGYATEDHAHG 88 LPD RG F+RG D GR R++ QGYAT A Sbjct: 112 GPNYIPQRSAPHALHLPDYRGYFLRGLDTSPFMGPAGRDKGEPRTVGLSQGYATARPAGK 171 Query: 89 LPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFK---TYKQSVDGL 145 + + + + +T + + Sbjct: 172 DAFTTDKKGAHSHPLKMELKASRKAGGAAENAHTVTSINDENKKSQLPLEKDGDHVHEIT 231 Query: 146 GAAASETRPRNIAFNYIVR 164 G +ETRP N+ ++++ Sbjct: 232 GGGDAETRPVNVVVYWVIK 250 >UniRef50_A9LZ37 Tail fibre protein, putative n=21 Tax=Neisseria RepID=A9LZ37_NEIM0 Length = 658 Score = 83.6 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 38/178 (21%), Positives = 58/178 (32%), Gaps = 26/178 (14%) Query: 2 GLGEGSALP------VGVPVPWPSATPPTGWLKCNG--AAFSAEEYPELAKAYPTN---- 49 LG + LP +G+ +PS PTGWL + + YPEL + Sbjct: 287 ALGNSNRLPDLSRTDIGITAWFPSDQIPTGWLAFDDIRTRVTETAYPELYRLLTGKYGSI 346 Query: 50 -KLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEI 108 +P FIR ++ + Q + H H + S T TDA Y D Sbjct: 347 QNVPQAEDRFIR------NAGNSLAVGTKQEDEIKRHVHKVFSHWTNHTDAAALGYEDRN 400 Query: 109 WVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + N D G + E RP+ + ++AA Sbjct: 401 ERQRSALVSTWTDEN-----LNDNGFLTPRSD--SKMATGGDENRPKALVLKLCIKAA 451 >UniRef50_B9M3Z9 Tail Collar domain protein n=2 Tax=Bacteria RepID=B9M3Z9_GEOSF Length = 184 Score = 83.6 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 42/169 (24%), Gaps = 16/169 (9%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY---------PTNKLPDLRGEFIRG 61 +G + P GW C+G + ++ L T LP+L G G Sbjct: 6 IGEIRAFAFTYAPYGWATCDGQIMNVQQNTALFSIISNTYGGDGRTTFGLPNLSGRAPMG 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 + G + G + +S+ H HG + S S Sbjct: 66 FGTGPALTPQTLGQSLGEASVSLATNNFPPHTHGFNAVSNTTATLATAADSYVAKAPSSG 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 N+ A + G + + + Sbjct: 126 KPPVTTNSFQPSASAGTQLAADAVLLAGTAPGTMPHKNMQPYLPVLLCI 174 >UniRef50_A9C0W7 Tail Collar domain protein n=6 Tax=Proteobacteria RepID=A9C0W7_DELAS Length = 166 Score = 83.6 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 45/163 (27%), Gaps = 23/163 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRGW 62 G + P GWL CNGA L T LPDLRG G+ Sbjct: 6 GEIRAFAFGQVPRGWLLCNGAILPISTNQALFALLGTQYGGNGTSNFALPDLRGRAPIGY 65 Query: 63 DDGRGIDT--GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 G + G +++ H H L S + + N + Sbjct: 66 GGGVVLGLIDGTESVTLIPSQMPLHTHQLLSSAAV------------ATTNVPGGNVMAE 113 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 N +F G+ E ++ N+ + Sbjct: 114 AANGLSAYGAPTNSFMAAPAVSTSGGSQPHENMQPSLVINWCI 156 >UniRef50_C6X0H3 Microcystin dependent protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0H3_FLAB3 Length = 188 Score = 83.2 bits (204), Expect = 2e-15, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 41/151 (27%), Gaps = 13/151 (8%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G P GW +CNG + L T LPD+RG + Sbjct: 42 GQIAFVAFTFAPKGWAECNGQLLPISQNTALFSLLGTTYGGNGQTTFALPDMRGRVLIHN 101 Query: 63 DDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNT 122 G G+ + Q TE+H + + + V +G ++ Sbjct: 102 GQGNGLSN-YELG--QTGGTENHTLTIAEMPQHIHNVNAVSAEGNQNVPTG-NLPANTKA 157 Query: 123 NDAGLPAPDYGTFKTYKQSVDGLGAAASETR 153 D T G+ E R Sbjct: 158 LDKEYADSTANTTMNLGMISPAGGSQPHENR 188 >UniRef50_D2QTE9 Tail Collar domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTE9_9SPHI Length = 172 Score = 83.2 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 49/168 (29%), Gaps = 29/168 (17%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY---------PTNKLPDLRGEFIRG 61 +G + + G++ CNG +Y L T LPDLRG Sbjct: 5 IGQIILFAGNYEIRGYVFCNGQLLDISKYTALYSLLGTTYGGNGTTTFGLPDLRGRMPIH 64 Query: 62 WDDGRGIDT---GRSILSIQG----YATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 + G + G+ S + H H L + S T + Sbjct: 65 FGQEPGKRSYVLGQRSGSYETTLTVDNLPAHNHALNAFSETGTASAP------------A 112 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAA-ASETRPRNIAFNY 161 + PD + +++ G +T P +A NY Sbjct: 113 GALLANTGLGDTEYLPDGTLVQMSTKAIGKTGNGRPVDTMPPYLALNY 160 >UniRef50_A1SXZ3 Phage Tail Collar domain protein n=4 Tax=Bacteria RepID=A1SXZ3_PSYIN Length = 195 Score = 83.2 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 43/168 (25%), Gaps = 29/168 (17%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G P P GW C+G ++ L T LPD RG + Sbjct: 31 GEIAWVPYNFAPRGWASCDGQLLPITQHNALFSLLGTVYGGDGRTTFALPDARGRVMIHE 90 Query: 63 DDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G+ G +++Q H H + S + + Sbjct: 91 GQGPGLTNRRLGDKWGEEQVTLQTSQIPSHTHRQQASSGSPSSTSPEENV---------- 140 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + L A D + G A + I+ Sbjct: 141 ---LASPSRTQLYADDADIDMSADNISYTGGNLAHNNMQPYTTLHCII 185 >UniRef50_Q21FE8 Phage Tail Collar n=9 Tax=Bacteria RepID=Q21FE8_SACD2 Length = 176 Score = 82.5 bits (202), Expect = 5e-15, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 43/169 (25%), Gaps = 24/169 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTN---------KLPDLRGEFIRG 61 + + + P W C+G L T LP+L+G Sbjct: 6 IAEVRIFAGSFAPRDWAFCDGQLLPIANNAALFSLIGTTYGGDGRATVGLPNLQGRAAMH 65 Query: 62 WDDGRGIDTGR-------SILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ R +++ H H L S+ + + G Sbjct: 66 PGRGPGLTARRLGERVGVETVTLNEAQIPSHTHQLRGSSSAGAETVPSPTASMAQPRGGR 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + T A G+ + IA N+I+ Sbjct: 126 IYNNQAQTQPLESLASLAAGD--------TGGSQSHNNMQPYIAMNFII 166 >UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8V5_OXAFO Length = 480 Score = 82.5 bits (202), Expect = 5e-15, Method: Composition-based stats. Identities = 37/169 (21%), Positives = 57/169 (33%), Gaps = 27/169 (15%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 +PVG + ++TPP G+LK +GAA E YP+L A T LPDL G Sbjct: 195 GVPVGTIEYFATSTPPAGYLKADGAAVGRETYPDLFAAIGTAFGEGDGSTTFNLPDLIGR 254 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 F +G D + L Y +G+ + Sbjct: 255 FAQGSD---------VPGQKLEAGLPNAIGKLS-----GFFGFTPVYKSGALSTTGSAGV 300 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET-RPRNIAFNYIVRA 165 + AG + + S AS+T +P + ++A Sbjct: 301 QFETIGVAGGASSNK--IINLDLSESNPIYGASDTVQPPALTLLPCIKA 347 >UniRef50_A6EAB9 Microcystin-dependent protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAB9_9SPHI Length = 198 Score = 82.1 bits (201), Expect = 6e-15, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 45/170 (26%), Gaps = 26/170 (15%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + + P GW C+G+ + L T LPDLRG G Sbjct: 27 GEIRAFACSYAPEGWALCDGSLLPLSQNQALYSLLGTRFGGNGTTTFALPDLRGRVPVGT 86 Query: 63 DDGR---------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 G + G ++++ H H + +++ + + S Sbjct: 87 GVRGASPAYTYTIGNNGGSETVALETATMPPHNHYVSAKNALGSVGLAGGILAIPNGGST 146 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 I + PD + + N+ + Sbjct: 147 QVNIYNTSAGATTTLNPDTVG--------NTGAGSPHSNMQPFQTINFCI 188 >UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3XAA4_OXAFO Length = 305 Score = 82.1 bits (201), Expect = 6e-15, Method: Composition-based stats. Identities = 41/174 (23%), Positives = 59/174 (33%), Gaps = 42/174 (24%) Query: 2 GLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKL 51 G+ S +PVG + T P G+LK NGAA E YPEL T L Sbjct: 7 GITPSSGVPVGTIEYFAMVTSPAGYLKANGAAVGRETYPELYATIGTTFGEGDGSSTFNL 66 Query: 52 PDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 PDL F +G + G+ I DH H LP Sbjct: 67 PDLIDRFAQGSNT-----PGQKI----EAGLSDHNHTLPL-------------------- 97 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + + G A GT Y + + + A++ +P + ++A Sbjct: 98 ---ALEETGTGYAAHGSNISSGTTVGYASASNPIYGASNTVQPPALTLLPCIKA 148 >UniRef50_B5JF21 Phage Tail Collar Domain family n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JF21_9BACT Length = 373 Score = 82.1 bits (201), Expect = 7e-15, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 54/165 (32%), Gaps = 25/165 (15%) Query: 9 LPVGVPVPWPSAT--PPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 +P+G + W +T P GW CNG+ P+LR FI G G Sbjct: 223 IPIGGIIMWSGSTSNIPAGWRLCNGS----------------GGTPNLRDRFIVGAGGGY 266 Query: 67 GIDT--GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 G++ G S +++ H H + +RS + +G + +D Sbjct: 267 GVNATGGASSVTLTTAQMPSHDHDVWTRSGDGKHWHYRSVDNSAPHPNGDGRVDLSTESD 326 Query: 125 AGLPAPDYGTFKTYKQSV-----DGLGAAASETRPRNIAFNYIVR 164 + + A E RP A +I+R Sbjct: 327 NANWNANLNSSADGAHQHYVDTPKRGSGQAHENRPPYYALAFIMR 371 >UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX Length = 456 Score = 81.7 bits (200), Expect = 8e-15, Method: Composition-based stats. Identities = 50/160 (31%), Positives = 67/160 (41%), Gaps = 39/160 (24%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 + PVG P+PWPSATPP G+L NG +FS YP+LA+AYP KLPDLR F Sbjct: 336 ESYPVGSPIPWPSATPPQGYLVMNGQSFSCSRYPQLARAYPGCKLPDLRRCF-------- 387 Query: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAG 126 + + T + E + I G ++ G Sbjct: 388 --------------------------YSWLGQRTWAGWRSEQTSPELSRPINPGVYHERG 421 Query: 127 LPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 + + Y G A P+NIAFNYIV+A+ Sbjct: 422 GWLKGHHSGMAYLGPGKHNGNA-----PQNIAFNYIVKAS 456 >UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv. viciae RepID=RHIB_RHILV Length = 219 Score = 81.3 bits (199), Expect = 9e-15, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 69/217 (31%), Gaps = 85/217 (39%) Query: 7 SALPVGVPVPWPSATPP----------------------------------TGWLKCNGA 32 + P+G P+ P GW+ C+G Sbjct: 27 AGPPIGAVCPFAGQVAPISSSVNTIWSNTPCASSGEAAGTNAEAPISYVEAQGWMLCDGR 86 Query: 33 AFSAEEYPELAKAYP------------TNKLPDLRGEFIRGWDDGRGIDT---------- 70 A YPEL ++PD RG F+RG+D G G+D Sbjct: 87 YLRAAVYPELYAVLGGLYGERNSTADLEFRIPDYRGLFLRGFDAGGGMDPDAKRRLDPTG 146 Query: 71 ---GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGL 127 + S+Q A + HAH + + I ++GN + Sbjct: 147 NNVANVVGSLQCDALQVHAHP-------------------YEITTPAGISQQGNAAGTSI 187 Query: 128 PAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + G+ + A ETRP+N+A NY+++ Sbjct: 188 SSKSTGSPENP-------ARTALETRPKNVAVNYLIK 217 >UniRef50_D0LMW0 Tail Collar domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMW0_HALO1 Length = 264 Score = 80.9 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 47/171 (27%), Gaps = 37/171 (21%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGEFIR 60 G + T P GWL C+G+ ++YPEL A T LPD RG + Sbjct: 112 AGTLALSAAETAPDGWLFCDGSPLIRDDYPELFAAIGETYGAGDGVNTFVLPDCRGRTLI 171 Query: 61 GWDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 G G G+ G ++ H H Sbjct: 172 GAGQGNGLSDRQRGDVVGAEEHTLTIPEMPSHTH--------------------AEHPGT 211 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 + + G + + + G ++ +I++ Sbjct: 212 GTLWFQVFERGPGTWPNERSGNTLGQSTGATGGNQPHNIMQPSLTVQFIIK 262 >UniRef50_C0YLU9 Phage tail collar domain-containing protein n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YLU9_9FLAO Length = 183 Score = 80.9 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 46/169 (27%), Gaps = 16/169 (9%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G+ + P GW+ C+G+ S L T LP+L+G G Sbjct: 5 IGIVKLFAGNFAPRGWMFCDGSLLSISRNSALFSILGTTYGGDGITTFALPNLKGRMALG 64 Query: 62 WDD---GRGIDTGRSILSIQ----GYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 + G G + Q G + + SGT Sbjct: 65 AGNVNSGENYPLGIVSGTTQNTLLSSNLPSIGAGFQLKVANKNANSSTPTATSTIAISGT 124 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + N + + + T P + NYI+ Sbjct: 125 QVGRDFNAVPSFVNDANPDTTINPLSISFTGQGLPLNNMPPYLGLNYII 173 >UniRef50_Q2RUE0 Phage Tail Collar n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RUE0_RHORT Length = 178 Score = 80.9 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 42/167 (25%), Gaps = 22/167 (13%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + P ++ C+G+ ++ L T +PDLRG Sbjct: 7 GEIRLFAGNWVPQQFVACDGSLLPIKQNEALFAVLGTVFGGDSVATFGVPDLRGRVPLHK 66 Query: 63 DDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G+ G +++ H H L + S T + T Sbjct: 67 GQGTGLTPRVLGQAVGTETVTVAAAEMPAHNHTLFACSDPATANSPINALPATAPAGYTQ 126 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYI 162 + G A ++A +I Sbjct: 127 FVTNQPGQTLTKLQ------LAEAAISTEGGGLAHANVMPSLALTFI 167 >UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GRX3_VIBCH Length = 182 Score = 80.9 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 41/175 (23%), Positives = 63/175 (36%), Gaps = 26/175 (14%) Query: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 PVG +PW + P G+ G AF Y ELAK +P +PD+RG + G +DG Sbjct: 17 FPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRGCGVIGKEDGE-- 74 Query: 69 DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG----TDIIKRGNTND 124 ++ + + ++H H + S+I + SG R T+ Sbjct: 75 ----AVGAYEEGQVKNHGHPNSTVSSIDLGSKNTANGGNHTHFSGIAAFGGGSHRYQTDV 130 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAF-------------NYIVRAA 166 G + +G+ A IA N+IVR A Sbjct: 131 NGSGGNINTSAAGNHYHSIPMGSHAHAVT---IALFGALKNTINHRKINWIVRLA 182 >UniRef50_B3PJI6 Microcystin dependent protein; MdpB n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PJI6_CELJU Length = 175 Score = 80.9 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 54/168 (32%), Gaps = 25/168 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G +P + P + C G + ++ L T LP+L+G+ + Sbjct: 7 GEIRQFPYSFAPRNFSYCQGQILTIQQNAPLFSLLGTLYGGNGQTTFALPNLQGQVLMHQ 66 Query: 63 DDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G+ G + +S+ +H H + +++ + V Sbjct: 67 GSGPGLTPRTVGESSGSAGVSLIQAEMPNHNHLMVAKT--------VNPAASVNVAEDAY 118 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + + + SV G G AA E R IA + + Sbjct: 119 LSISRAQTAYSPQQDNLVSLEPTMLSVTGSG-AAHENRQPYIAMPFCI 165 >UniRef50_C5RJD9 Tail Collar domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RJD9_CLOCL Length = 199 Score = 80.5 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 39/183 (21%), Positives = 54/183 (29%), Gaps = 32/183 (17%) Query: 13 VPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGWD 63 + WP P GWL C G +Y L T KLPDLRG G Sbjct: 7 QIILWPGNFVPRGWLACEGQELPINQYTALYSLLGTTYGGNGSTTFKLPDLRGRVPVGSG 66 Query: 64 DGRGID------TGRSILSIQGYATEDHAHGLP-SRSTIVTDATINFYFDEIWVNSGTDI 116 GI+ G +++ H H ++ + + I F E N+ + Sbjct: 67 ICGGINFQQGNSGGNFNVTLTQQQMPAHTHSTTVTQGAVTVNGGIPFNGGEGTTNTPSAS 126 Query: 117 IKRGNTNDAGLPAPDYGTFKTYKQSVDG----------------LGAAASETRPRNIAFN 160 K AG P+ SV G G +A Sbjct: 127 SKLAVGITAGGDIPNIYNTSEATGSVTGSFTGQVTGTTVTVGSAGGNQGVNVMQPFLALR 186 Query: 161 YIV 163 YI+ Sbjct: 187 YII 189 >UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=Cupriavidus taiwanensis RepID=B3R3K1_CUPTR Length = 1045 Score = 80.5 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 44/220 (20%), Positives = 65/220 (29%), Gaps = 65/220 (29%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPEL--------------------------AK 44 VG + P T G LK NGA +YPEL Sbjct: 825 VGQIIIEPRTTARAGCLKLNGALLKRADYPELWAYAQASGAIVTDAAWLAGSWGCFSHGD 884 Query: 45 AYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATI--- 101 T ++P+ RGE++R WDD RG D GR I Q + H+H + + Sbjct: 885 GNTTFRIPEYRGEYLRFWDDARGADAGRGIGVFQDSQNKTHSHAASATPVGDHNHGAWTD 944 Query: 102 ---------------------------------NFYFDEIWVNSGTDIIKRG---NTNDA 125 Y +GT G + + Sbjct: 945 AQGWHGHGVNDPGHAHSFQTWTGGGATGAGRVSGSYVTNADAWAGTSASYTGISIAGDGS 1004 Query: 126 GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 G + ++ +E R RNI+ ++RA Sbjct: 1005 HAHNVGVGYAGNHSHAITVNADGGAEVRVRNISALAMIRA 1044 >UniRef50_C3X8R9 Bacteriophage tail fiber protein n=6 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8R9_OXAFO Length = 398 Score = 80.5 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 51/166 (30%), Gaps = 30/166 (18%) Query: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGEFI 59 P+G + A P+G+LK +GA E YP+L A T LPDL G F Sbjct: 108 PIGSIDYFAMAALPSGYLKADGAEVGRETYPDLFAAIGTVFGEGNGETTFNLPDLIGRFP 167 Query: 60 RGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 +G + +++ FY +D Sbjct: 168 QG---------SARPGQRVQAGLPNITGKFRAKAAAGEIPGGAFYGIGNIGGGSSDNS-- 216 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 AP+Y + + A+ +P + ++A Sbjct: 217 ---------APNYEEIGFDASKSNLIYGASDTVQPAALTLLACIKA 253 >UniRef50_A9C0W8 Tail Collar domain protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9C0W8_DELAS Length = 180 Score = 80.1 bits (196), Expect = 3e-14, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 48/168 (28%), Gaps = 20/168 (11%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRGW 62 G + + P W C G ++ L LP+L+G G Sbjct: 7 GEVRAFGFSFAPVNWAFCAGQTILLQQNQALFAVIGNRFGGNGTTNFMLPNLQGRAAMGA 66 Query: 63 DDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G+ G + ++ H+HG+ ++ T A + + G Sbjct: 67 GTGPGLSPRDIGETMGTATETLTIAQIPPHSHGISAQGAGATTAIPSGNL----LAQGMQ 122 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + A + + G A +A N+ + Sbjct: 123 GVPPRGSARPTYAAGPATSAMAPTALLPVGGNQAHSNMQPVLALNFCI 170 >UniRef50_Q8PR97 Microcystin dependent protein n=1 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PR97_XANAC Length = 183 Score = 79.8 bits (195), Expect = 4e-14, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 42/163 (25%), Gaps = 22/163 (13%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + + P GW C+G Y L T LPDLRG Sbjct: 6 GQIILFAGNYEPQGWAFCDGRQLQINTYMALYSLIGTTYGGDGRTTFNLPDLRGRVAISQ 65 Query: 63 DDGR-------------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW 109 G G G +S+Q H H L + ++ + T + Sbjct: 66 GQGIARAPTPQLTARVLGQQFGTETVSLQLAEMPAHRHTLQAFNSPASSLTPTGQLPAVT 125 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 T + + S A++T Sbjct: 126 QGGNTGYLTPPAGSTPAASTLATNAVNVAGASQPHDNHMATQT 168 >UniRef50_C7PNC4 Tail Collar domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PNC4_CHIPD Length = 192 Score = 79.4 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 46/178 (25%), Gaps = 26/178 (14%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + P W CNGA + +Y L T LPDLR Sbjct: 6 GEIRLFAGNFAPVNWNVCNGALLAISQYDALFSLIGTQYGGDGITTFALPDLRVRVPISM 65 Query: 63 DDGR----------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDA-TINFYFDEIWVN 111 G G +++ +H H L + + T +N N Sbjct: 66 GQISASGGTGNYVLGQAAGTPNITLLTSNIPNHTHPLVAVNATATTGDPVNNMLAVTNGN 125 Query: 112 SGTDIIKRGNTNDA------GLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + T + N G G A + + NYI+ Sbjct: 126 NNTGPTAYPDVNLYTTLPLPGGGTTIPNALMDPASISPTGGTQAHDNMMPYVTINYII 183 >UniRef50_Q55EP2 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q55EP2_DICDI Length = 166 Score = 79.4 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 41/167 (24%), Gaps = 47/167 (28%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT--------NKLPDLRGEF 58 +++ G + P GW C+GA + YPEL + PD RG+ Sbjct: 23 NSIQPGSVNIFTGIEIPVGWSLCDGAPLNKLTYPELYRQIGDAFGSSEHEFSKPDFRGKC 82 Query: 59 IRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIK 118 G +G G+ +S H H + F + Sbjct: 83 PIGAGNGVGLTNHLLTVS----ELPSHDHPVIDPGHTWHSIGGGFSSGPHGSRGES---- 134 Query: 119 RGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 I N+I++ Sbjct: 135 -------------------------------HNIMQPYITINFIIKL 150 >UniRef50_C6MD19 Tail Collar domain protein n=2 Tax=Proteobacteria RepID=C6MD19_9PROT Length = 241 Score = 79.4 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 45/159 (28%), Gaps = 16/159 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 VG P GW +C+G +Y L T LPD+RG+ Sbjct: 35 VGEISYVAFNFAPQGWYQCDGQILPINQYQALFSLLGTNYGGDGTTTFALPDMRGKVPVH 94 Query: 62 WDDGR-------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G +G +++ H H + S + + Sbjct: 95 QGQHPGGSMFTLGQTSGAENVTLTLNNMPAHNHPATATSASTSALAPGGTATSTLKAVNS 154 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETR 153 D + ++ A + + + +A+ ET Sbjct: 155 DADIKTAAGNSLANAKGLNSAYSASAPNVSMSSASIETT 193 >UniRef50_B7JYU3 Tail Collar domain protein n=26 Tax=Bacteria RepID=B7JYU3_CYAP8 Length = 189 Score = 79.4 bits (194), Expect = 4e-14, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 47/174 (27%), Gaps = 21/174 (12%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 V + P + C+G + L T LPDLRG Sbjct: 6 VAEIKMFGGNFAPVNYAFCDGQLMPISQNSALFSLLGTTYGGNGISTFALPDLRGRVPMH 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 +G G+ G +++ H H S +T V A + + Sbjct: 66 PGNGPGLSPRVLGESDGSETVTLLSNNVPSHTHTASSTATSVMRANNTATDNALDPTGRG 125 Query: 115 DIIKRGNTNDAGLP-----APDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 I + P A T S + G A + ++I+ Sbjct: 126 LGIADRDVYVNAAPNVDMIAGSVNTTVNTTISPNTGGNAPVSIMQPFLCVSFII 179 >UniRef50_B4D821 Tail Collar domain protein n=3 Tax=Bacteria RepID=B4D821_9BACT Length = 179 Score = 79.0 bits (193), Expect = 5e-14, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 41/173 (23%), Gaps = 21/173 (12%) Query: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGE 57 S V +P PTGW C+G + L T LP+++G Sbjct: 2 SNPFVAEIRIFPFNFAPTGWAFCDGQILPLSQNTALFSLLGTTYGGDGKSNFALPNMQGN 61 Query: 58 FIRGWDDGR-------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWV 110 G G G +S+ H HG+ R+ + Sbjct: 62 APMHPGQGPSLSLHDLGETGGSDTVSLLESEIPSHNHGMKVRNLAPPSVLPAPAPANAFG 121 Query: 111 NSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 S T + G + N+ + Sbjct: 122 RSNGGAAYATYTAGTS-----NIGAMDPRVIAPAGGDQPHNNLMPYLTLNFCI 169 >UniRef50_C2FWA1 Phage tail collar domain protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FWA1_9SPHI Length = 199 Score = 79.0 bits (193), Expect = 6e-14, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 46/183 (25%), Gaps = 32/183 (17%) Query: 13 VPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGWD 63 + P W CNG + L T +LPD RG G Sbjct: 7 EIRLFAGNFAPKYWALCNGQTLAINTNQALFSLLGTTYGGNGVTTFQLPDFRGRIPVGTG 66 Query: 64 DGR---------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G+ G G +++ H H + + + + ++ + Sbjct: 67 SGQTVGMGTITSGQKVGTESVTLVENNLPAHFHQVSISGKLPLTVSKSIADKSTVIDGLS 126 Query: 115 DIIKRGNTNDAGLPAPDYGTFKT--------------YKQSVDGLGAAASETRPRNIAFN 160 + A P Y + G A + R + N Sbjct: 127 LAAPARSAGRAKTPTLGYNSQTGSISLNPASIDLSGMTLTLAPIGGNAVHDNRQPALGLN 186 Query: 161 YIV 163 YI+ Sbjct: 187 YII 189 >UniRef50_Q2RUD9 Phage Tail Collar n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RUD9_RHORT Length = 186 Score = 77.8 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 44/166 (26%), Gaps = 14/166 (8%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIR- 60 +G + PP GW +C+G A + L T LPDLRG Sbjct: 12 IGELRLFGFDYPPVGWAQCDGQALPIGQNQALYALLGIQFGGDPRTTFNLPDLRGRVALA 71 Query: 61 ---GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 G G+ G + E A + + D + I + Sbjct: 72 DSWGTPLPAGVTPGTHYATGDKGGAETVALTQATVPSHSHDIGASSTSGTIPNPANAYFA 131 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 T SV G G ++ NY + Sbjct: 132 DAAGTLTTYCVGGTTVGLDPASVSVAGAG-QGHANSQPSLVLNYCI 176 >UniRef50_C6X0H2 Phage tail collar domain protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0H2_FLAB3 Length = 193 Score = 77.1 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 42/165 (25%), Gaps = 27/165 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 VG + P P GW C+G+ S E L T +PD+RG + Sbjct: 26 VGQIMFVPYNFSPQGWHNCDGSLLSISENEVLFTLIGTTYGGDGQTTFAVPDMRGRVMID 85 Query: 62 WDDGR-------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G +G + + H+H + + S T + + Sbjct: 86 DGQGNTLSSFTLGQMSGTETVQLTQAQMPAHSHTVNAVSGAGTSESPTSHLP-------- 137 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAF 159 D + G+ + Sbjct: 138 ---ANTGILDKEYSNQPLTSTMKMGMLSAAGGSQPHNNIQPYLTM 179 >UniRef50_Q4KAW2 Phage tail collar domain protein n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KAW2_PSEF5 Length = 185 Score = 76.7 bits (187), Expect = 3e-13, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 49/171 (28%), Gaps = 20/171 (11%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G +P P W C+G+ + YP L + LP+L+ G Sbjct: 6 GEIRLFPFNFAPDNWAVCDGSPLLVQNYPALYSVIGLTYGGTAGTSFNLPNLKSRVTIGT 65 Query: 63 DDGRGIDT---GRSILSIQGYATED----HAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G + G+++ H H + ++ T ++ ++ Sbjct: 66 GQGANLTNRSLGQAVGGDTTTLLPAHFAPHTHHVLAKDGTDTTGALDLANGTAYLAQPRG 125 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETR---PRNIAFNYIV 163 ++ N AP + + A TR + Y + Sbjct: 126 -VRLYNGTVPPTAAPVPSLHPSTVTTNGTEAAKTGSTRDVMQPFLTLRYCI 175 >UniRef50_C7PE74 Tail Collar domain protein n=3 Tax=Bacteria RepID=C7PE74_CHIPD Length = 196 Score = 75.9 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 47/179 (26%), Gaps = 29/179 (16%) Query: 14 PVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGWDD 64 + P GW C G S + L T LPDLRG G Sbjct: 8 IAIFGGNFNPRGWYFCQGQIMSIAQNTALFSLLGTTYGGNGQTTFALPDLRGRAPIGVGQ 67 Query: 65 GRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 G G+ G ++ H H S V A ++ Sbjct: 68 GPGLQPYAWGQLGGSETHTLIITEMPAHNHTALVNSLTVNPAASTAAGTTNIPDATMVPA 127 Query: 118 KRGNTNDAGLPAP--------DYGTFKTYKQSVDGLGAAASETRP-----RNIAFNYIV 163 K N P + T K + A ++P +A NYI+ Sbjct: 128 KLPNIGSGPTAQPIKGYAVADNTTTLAPAKVTGSVTVGIAGGSQPFSIQNPYLAVNYII 186 >UniRef50_C9PG79 Putative phage tail protein n=1 Tax=Vibrio furnissii CIP 102972 RepID=C9PG79_VIBFU Length = 410 Score = 75.9 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 41/162 (25%), Gaps = 23/162 (14%) Query: 3 LGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPD 53 S+ G + W S P + G Y LA A P +PD Sbjct: 253 WKPFSSKTPGETMAWDSELVPEHMIVAMGQQLPVTVYHSLAAAKPEWIDDTNPLVLNIPD 312 Query: 54 LRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 RG F R D + A + + T + ++ + Sbjct: 313 RRGRFTRAADGSHWLA-----GQSHDDAIRNITGSFNASGTTGSASS---------TKTQ 358 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPR 155 I ++ G + + E +P+ Sbjct: 359 GAIALSNTSSWPNYVNGQSGAGYNLLFDASNVVPTSEENQPK 400 >UniRef50_UPI0001BC923E Phage tail Collar n=1 Tax=Pseudomonas syringae pv. tabaci ATCC 11528 RepID=UPI0001BC923E Length = 196 Score = 75.5 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 45/155 (29%), Gaps = 18/155 (11%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + + P GW++CNG + +Y L LP+L+G Sbjct: 6 GSIMTFGFPFAPAGWMQCNGQTLNISQYNALYALLGVIYGGNPSQNFMLPNLQGRVPINQ 65 Query: 63 DDGRGIDTGRSILSIQG--------YATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G + R I S+ G H H + + + T N + T Sbjct: 66 GTGVNLTN-RVIGSVSGVEKVTVAIANMPAHVHQMSTLTANTTITLANPAVTGATIAPTT 124 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA 149 D G + A + V + A Sbjct: 125 DNAFIGASTSGPTSANIFSPNAGTAPVVQKGVSTA 159 >UniRef50_B0SX66 Tail Collar domain protein n=13 Tax=Bacteria RepID=B0SX66_CAUSK Length = 181 Score = 75.5 bits (184), Expect = 6e-13, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 41/165 (24%), Gaps = 13/165 (7%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G P W +CNG S + L T LPDLR + Sbjct: 7 GELRIMSFNFAPKAWAQCNGQLLSINQNQALFSLLGTTYGGNGQTTFALPDLRTRVPAHF 66 Query: 63 DDG--RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 +G G ++ H H L + + + G Sbjct: 67 GQSYVQGQVMGEYSHTLIQTEIPQHVHILQADAATAAANNTSSAVAGNSFGQSLSKASSG 126 Query: 121 NTNDAGLPAP--DYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 +T+ + T G+ E R +A + Sbjct: 127 STSAFNMYNTALTPSAPMTPAALAPAGGSQPHENRQPFLALTICI 171 >UniRef50_B4VMZ3 Phage Tail Collar Domain family n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VMZ3_9CYAN Length = 215 Score = 75.5 bits (184), Expect = 6e-13, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 50/175 (28%), Gaps = 28/175 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSA--EEYPELAKAY------------PTNKLPDLRG 56 +G P GW C+G +E LA T ++PDLRG Sbjct: 38 IGQIAMVAFDFAPDGWYLCDGTLHDIISDENDILASILAGKYNQPGDPTQGTFRVPDLRG 97 Query: 57 EFIRGWDDGRG--------IDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEI 108 G + G G S Q T+ + I Sbjct: 98 RVPLGINPMAGNSDNDRNSYGLGDKSGSEQVELTQA------NLPEIQMKLKATNADSNE 151 Query: 109 WVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 SG+ ++ + ++ A ++ S+ A +A N+I+ Sbjct: 152 TSPSGSALLSKPRSSIYATGATNFVEMDCISSSIGSSENNAHNNLQPYLAINFII 206 >UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3R8_OXAFO Length = 365 Score = 75.1 bits (183), Expect = 8e-13, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 53/168 (31%), Gaps = 25/168 (14%) Query: 8 ALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLRGE 57 +PVG PP G+LKC+GAA + YP+L A T LPD+ G Sbjct: 87 GVPVGSIDWLAVPEPPAGYLKCDGAAIGRDTYPDLFAAIGTTFGAGDGETTFNLPDMIGR 146 Query: 58 FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 F G + + + + I ++ + W + + Sbjct: 147 FAEG---------SATPGIKKEAGLPNVSGVSAVEGCINKGSSTSSGPFTYWRENNLILN 197 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRA 165 + D G + + + + +P + ++A Sbjct: 198 TSPS------NTHDLGGEIFSLSNGNPIYGNSDTVQPPALTLLPCIKA 239 >UniRef50_Q2SHC1 Microcystin-dependent protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SHC1_HAHCH Length = 171 Score = 74.8 bits (182), Expect = 1e-12, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 42/166 (25%), Gaps = 22/166 (13%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G PP W +C+G + L T LPDLRG Sbjct: 5 IGQISMMGFGFPPEDWAQCDGQIQQITQNQALYSLIGIYFGGNGTSTYALPDLRGRTPVH 64 Query: 62 WDDGR----GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDII 117 G+ G ++I H H + + S+ Sbjct: 65 LGASYSSQVGVTGGLEQVTITESTMGAHNHPVYASSSPADKGGPKSDRILAETPDIYCPA 124 Query: 118 KRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 T ++ + G+ + ++ N+ + Sbjct: 125 DNLTTMNSQAIGLNGGSSGSV---------TPHANMQPSLTINFCI 161 >UniRef50_Q4KAW4 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KAW4_PSEF5 Length = 181 Score = 74.0 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 40/172 (23%), Gaps = 26/172 (15%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G +P A P GWL C G Y LA T LPDLRG G Sbjct: 6 GEIRLFPWAWAPQGWLLCQGQILDVVNYTALASLLGDRYGGDGRTTFGLPDLRGRAALGE 65 Query: 63 DDGRGIDT-----------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 + G +++ H H T T I Sbjct: 66 NPVASTSPVLGVHELGSMDGAEWVALTLNNLPAHNHVANVAVTAGTGG----PAGNIPAI 121 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 S T D T +I N+ + Sbjct: 122 SSTSKGAVSKPTYVAYADKDRVTINPTTVVTTLGYPLP--NMQPSIVGNFCI 171 >UniRef50_Q4UNP7 Microcystin dependent protein n=7 Tax=Proteobacteria RepID=Q4UNP7_XANC8 Length = 181 Score = 74.0 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 43/172 (25%), Gaps = 27/172 (15%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + P + +CNG + L T LPD+RG G+ Sbjct: 7 GQIMMTGFVFAPKYFAQCNGQLLPVNQNQALFSLLGTRFGGNGSTTFALPDMRGRTPVGF 66 Query: 63 DDGR-----------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN 111 G ++G +S+ H H L S + + + + Sbjct: 67 APSADPAWQPSPLPMGQNSGAENVSLLPDNLPAHNHSLEGSSAAGNNRSPSGRSFGTNAS 126 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + A AP G+ N+ + Sbjct: 127 TTGPAT-------ALYAAPGPLVAMNPATVAQAGGSQPHPNLQPYTTLNFCI 171 >UniRef50_B1J270 Tail Collar domain protein n=8 Tax=Bacteria RepID=B1J270_PSEPW Length = 195 Score = 74.0 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 56/180 (31%), Gaps = 28/180 (15%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + P GW C G S + L T LPDLRG G+ Sbjct: 6 GEIKMFAGNFAPRGWAFCQGQLMSIAQNNALFALLGTTYGGDGKTTFALPDLRGRGPIGF 65 Query: 63 DDGRGI-------DTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G+ G + +++ +P+++ V + + +SG Sbjct: 66 GTGPGLADVVQGEAGGVNDVTLLQSNMPMQQAVIPAQTVSVAIPAVEGDANAAAPSSGNV 125 Query: 116 IIKRGNTNDAGLPAPDYGT------------FKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + K +++ AG A Y + G + + + N+I+ Sbjct: 126 LAKSFDSSGAGAAADIYSSDVPNTSLKPFNVTVPQTSVNLGGASLPVSVQNPYLGMNFII 185 >UniRef50_Q11LT1 Microcystin-dependent protein-like n=1 Tax=Chelativorans sp. BNC1 RepID=Q11LT1_MESSB Length = 268 Score = 73.2 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 55/192 (28%), Gaps = 33/192 (17%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEE-YPE-----------LAKAYPTNKLPD 53 G +P+G V + +T P GW C G A ++ YP + ++PD Sbjct: 78 GQLVPIGTIVDYALSTAPEGWTFCYGQALTSSTPYPLLRAALLAAGSPFGTSGSDPRVPD 137 Query: 54 LRGEFIRGWDDGRGIDTGR-------------------SILSIQGYATEDHAHGLPSRST 94 RG G D+ G R ++ H H + S Sbjct: 138 YRGRVGAGKDNMGGTSANRLTNQSGGVNGDVLGDTGGAETHTLSVGQMPSHNHSGSTGS- 196 Query: 95 IVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRP 154 + T Y + SG + + + + G Sbjct: 197 -GGNHTHTMYVKNLSAGSGGNPVTGTPSGTIDSTYQSDPSGSHSHSIPSQGGNDPHNNVQ 255 Query: 155 RNIAFNYIVRAA 166 I N I++AA Sbjct: 256 PTIIVNKIIKAA 267 >UniRef50_Q8EKA9 Putative uncharacterized protein n=1 Tax=Shewanella oneidensis RepID=Q8EKA9_SHEON Length = 181 Score = 73.2 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 43/169 (25%), Gaps = 19/169 (11%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 VG + P GW CNG A + ++ L LPD R G Sbjct: 6 VGEIRMMANTYAPYGWAYCNGQAIAIQQNAVLFSVIGIAFGGNGTTMFNLPDFRSAAPIG 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G + +++ H H L ++ + + G Sbjct: 66 TGQGPGLTQVVIGEFKGEADVTLTYGTIPAHTHTLSGKTMLGDQSIPANNLYLAADGGGG 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 N T + ++ + + + Sbjct: 126 G---TENLFYMLPGTTTPDTSLSPNCIGSSGSGSSHPNAQPFLTLGFCI 171 >UniRef50_B9M3Z8 Tail Collar domain protein n=3 Tax=Proteobacteria RepID=B9M3Z8_GEOSF Length = 174 Score = 73.2 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 41/164 (25%), Gaps = 18/164 (10%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY---------PTNKLPDLRGEFIRGW 62 G + P GW C+G + L T LPD +G + Sbjct: 7 GEIRLFGFNFAPVGWATCDGQIMQITQNQALYALLGTTYGGNGTTTFNLPDFQGRTLLHA 66 Query: 63 DD--GRGIDTGRSILSI-QGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKR 119 + G G G +++ H H L + + + Sbjct: 67 NATYGEGKAGGVETVALATTSELPVHNHVLAANTGPGGSNVPQGNILAATQSPDNTKTAY 126 Query: 120 GNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + GT G+A ++ N+ + Sbjct: 127 ATAKATPVANLAGGTLSPA------GGSAGHNNVQPSLTVNFCI 164 >UniRef50_Q1QPI6 Phage Tail Collar n=2 Tax=Proteobacteria RepID=Q1QPI6_NITHX Length = 176 Score = 72.8 bits (177), Expect = 4e-12, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 42/169 (24%), Gaps = 24/169 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 VG + + P GWL CNG+ +Y L T P+L G Sbjct: 6 VGEIRLFGFSRVPQGWLPCNGSLQPISQYEVLFSLVGTTYGGDGVTTFGTPNLSGRVPVH 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G +++ H H + + T S Sbjct: 66 SGTGPGVSPRVIGEIGGSEKVTLLSAHMPYHDHPMVA--------TTGPANSSQITPSLE 117 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 G+T G + + + + + Sbjct: 118 LGTVAGDTMYTSDVKDVGGANTAPTSTSMAGRNIPHDNLMPTLPVQFCI 166 >UniRef50_C3X3W3 Predicted protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W3_OXAFO Length = 315 Score = 72.8 bits (177), Expect = 4e-12, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 47/164 (28%), Gaps = 21/164 (12%) Query: 3 LGEGSALPVGVPVPW--PSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 L + +P G+ + + P+GWL CNG N PDLR F+ Sbjct: 170 LDKSEGIPSGLIAMYSGAADHIPSGWLLCNG----------------ENGTPDLRDRFVV 213 Query: 61 GWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRG 120 G + + A + + + + + Sbjct: 214 GAGKAYAV---YAKGGATTGAVSGQTGETTLTINQIPSHNHGVGYYISRSGNAGNGFQVE 270 Query: 121 NTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 T D Y T + Q + + T P A +I++ Sbjct: 271 RTTDNFAFTYLYTTVQGGNQPHSHSLSGSVSTVPPYYALCFILK 314 >UniRef50_C1F4Q4 Phage tail collar domain protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F4Q4_ACIC5 Length = 211 Score = 72.8 bits (177), Expect = 4e-12, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 44/195 (22%), Gaps = 43/195 (22%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G P W CNG S L T KLPD RG + Sbjct: 7 GEIRMVAFNFAPQYWAFCNGQTLSIASNNALFALLGTFYGGDGISTFKLPDFRGRMPLAF 66 Query: 63 DDGR-------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G + G+ + + H H + + G Sbjct: 67 GQGPNLPVYEIGENGGQPTMQLSQAQLPSHTHLAQFQPSGGGGTPTVSVTVNGSSAHGNS 126 Query: 116 IIKRGNT--------------NDAGLPAPDYGTFKTYKQSVDGLGAAASETR-------- 153 GN A G ++ G+ + Sbjct: 127 GSATGNYIAGMTDVNRSAGQLFVNNPEASTLGAIAGVSATISGVPSGGGTVTNAVAGQGQ 186 Query: 154 -----PRNIAFNYIV 163 P ++ N+++ Sbjct: 187 AFSIEPPYLSVNFVI 201 >UniRef50_B5RPA6 Uncharacterized conserved protein n=73 Tax=Borrelia RepID=B5RPA6_BORDL Length = 265 Score = 72.4 bits (176), Expect = 5e-12, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 47/138 (34%), Gaps = 9/138 (6%) Query: 26 WLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDH 85 + +G + + Y + P L G F+R +D RS+ QGYA ++H Sbjct: 129 FCLPDGRSLPSNCYAT--RVLGITSAPSLSGRFLRQYDA----SNSRSLGDTQGYALKNH 182 Query: 86 AHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGL 145 H + + N + + + G G Y Sbjct: 183 QHRIN-YERVNYGYGQNLFKEAYTNSKGFSDSWYWKDGSYGFVHNLKLGDPIYSHRFSRY 241 Query: 146 GA--AASETRPRNIAFNY 161 +SETRP+N+A+ + Sbjct: 242 TGYEGSSETRPKNLAYLW 259 >UniRef50_B2SIA5 Microcystin dependent protein n=4 Tax=Proteobacteria RepID=B2SIA5_XANOP Length = 175 Score = 72.4 bits (176), Expect = 5e-12, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 40/169 (23%), Gaps = 25/169 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P GW C+G+ EY L T +PDLRG Sbjct: 6 IGEIRMFGFGRTPQGWQACDGSLLQISEYEPLYVLLGTAYGGNGTSTFAVPDLRGRLPIH 65 Query: 62 WDDGRGID-------TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G +++ + T Sbjct: 66 QGQGPGLSNYPLAQRAGTETVTLTELQMP---------AHTHTAQATTAAATTTAPAGLL 116 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 G+ A + + + G + + Y + Sbjct: 117 PAAVNGSQFYASDVTGATMLALSPQSTSFAGGNQPHDNVMPTLTVQYCI 165 >UniRef50_B1JGT8 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JGT8_YERPY Length = 472 Score = 72.1 bits (175), Expect = 6e-12, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 52/179 (29%), Gaps = 35/179 (19%) Query: 20 ATPPTGWLKCNGAAFSAEEYPELAKAYPTNKL----------PDLRGEFIRGWDD----- 64 P GW +G +P+ A P LRG F G Sbjct: 151 TYIPEGWAPADGIILDRALWPDAWDAIQVGYSRVTDESWIRDPILRGCFSIGNGSTTFRI 210 Query: 65 -----------------GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDE 107 G G ++ I IQG A + S + +A + Sbjct: 211 PDLNGKSEGSLGAAFLRGDGKNSFGEIGRIQGDAIRNITGDFGSLGGQINNAYGIVIGSK 270 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 V G + G A + P G+ + + A++ RP N YI++ A Sbjct: 271 NGVFVGHG--ENGRPTSANIGQPALGS-EFVAFDASRVVPTAADNRPVNATGCYIIKLA 326 >UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Microcystis aeruginosa RepID=A8YDB4_MICAE Length = 166 Score = 72.1 bits (175), Expect = 6e-12, Method: Composition-based stats. Identities = 31/113 (27%), Positives = 48/113 (42%), Gaps = 25/113 (22%) Query: 4 GEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDL 54 + +L +P+ +P A GW+ C+G + YPEL T +LPD Sbjct: 38 AQAESLNANIPITYPEAY---GWMLCDGRYLEIDAYPELFAVIGTLYGKQGDNKFRLPDY 94 Query: 55 RGEFIRGWDDGRGIDTGRS-------------ILSIQGYATEDHAHGLPSRST 94 RG F+RG D G G+D + I S+Q A + H H + ++ Sbjct: 95 RGLFMRGVDAGSGLDPDAAERIGPEGMGKSSGIGSLQCDALQQHQHDYNASNS 147 >UniRef50_Q73NL1 Tail fiber domain protein n=1 Tax=Treponema denticola RepID=Q73NL1_TREDE Length = 527 Score = 71.7 bits (174), Expect = 8e-12, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 51/172 (29%), Gaps = 29/172 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELA------------------KAYPTNKLP 52 +G + + G+L NG +F E YPE ++ KLP Sbjct: 365 IGEVRYFTNKKYTYGYLYANGYSFIPELYPEFYQFWLENFGDRNKKNYLGYDSFGYPKLP 424 Query: 53 DLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNS 112 DLRG +R DDG L QG A + + + + + Sbjct: 425 DLRGVALRAVDDGSDRGGAALALEFQGDAIRNLKGRV----------GVQGNDGYPNLTA 474 Query: 113 GTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVR 164 G D+G + + A + R ++ ++ Sbjct: 475 GVFHTLDTGYIDSGTSQ-ASSYLRLLGFDASRVVPTAEDNRVKSYGVYPFIK 525 >UniRef50_C6DJW4 Tail Collar domain protein n=2 Tax=Pectobacterium carotovorum subsp. carotovorum RepID=C6DJW4_PECCP Length = 246 Score = 71.3 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 47/160 (29%), Gaps = 17/160 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLRGEFIR 60 +G ++ P G+L NG + +Y L LPD+RG Sbjct: 36 IGSICYMVTSYCPQGYLPANGQTVTINQYQALYALIGNIWGGSPQQGNFVLPDMRGRVPV 95 Query: 61 GWDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 G G G+ G +++ H H S+ T + + + Sbjct: 96 GAGQGTGLANVTRGQVFGVENVALTTSNVAPHIHPATVASSGGVSGTASIAIPVVNGAAT 155 Query: 114 TDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETR 153 T++ + P+ D + AA T Sbjct: 156 TNVPDNTTSLATTSPSFDLSSVGGVDSPAKIYSNAAPTTT 195 >UniRef50_B3FYL6 Gp17 n=1 Tax=Salmonella phage phiSG-JL2 RepID=B3FYL6_9CAUD Length = 658 Score = 71.3 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 58/199 (29%), Gaps = 34/199 (17%) Query: 1 VGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIR 60 + L +PVG PTGW++ G F YP LA+ +P+ + P + Sbjct: 268 LALQSTGNVPVGTVAMITHTKIPTGWVRA-GEDFDVNTYPALAELFPSGRTPSFDDRYPI 326 Query: 61 G----WDDGRGIDTGRSILSIQGY--------------------ATEDHAHGLP------ 90 G G+ ID S DH+HG Sbjct: 327 GNSTVLTPGQLIDQSVPAHSHTFDVPVNVSGATAAGGEYRARTSHEGDHSHGFSLPIQNN 386 Query: 91 --SRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAA 148 + + + N + + + + + SV G Sbjct: 387 TGAYTGRLVGGGNNPNYPQDLRFNTGGGGAHSHEFYVPSHSHTLNASGRAAGSVSSSGIG 446 Query: 149 AS-ETRPRNIAFNYIVRAA 166 S RP + +I++AA Sbjct: 447 NSPYVRPYSTVVIFIIKAA 465 >UniRef50_A1AK23 Phage Tail Collar domain protein n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AK23_PELPD Length = 209 Score = 71.3 bits (173), Expect = 1e-11, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 50/194 (25%), Gaps = 42/194 (21%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G+ + + P G+ CNG + L + LPDLRG + Sbjct: 6 GMVMTFAYNWAPVGFAACNGQTTQIMQNQALYSLLGVAFGGNGSTSFNLPDLRGRTPVHF 65 Query: 63 DDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYF-------DEI 108 G + G+ ++ H H + ++ T + Sbjct: 66 GQGTNLTPRTFAAQFGQENGTLTVANLPPHNHAIGEKTAGQTVTATAAATVNAGDVQGSL 125 Query: 109 WVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQS-------------------VDGLGAAA 149 T +G +Y T K + + G A Sbjct: 126 NKPQNTAYWAKGWDAADSSVLSNYTTTKNVTMASDAVQVTVTPAFNASNLNVANTGGGGA 185 Query: 150 SETRPRNIAFNYIV 163 ++A N+ + Sbjct: 186 FPLAQPSLALNFCI 199 >UniRef50_UPI0001A44BB4 microcystin dependent protein n=1 Tax=Pectobacterium carotovorum subsp. brasiliensis PBR1692 RepID=UPI0001A44BB4 Length = 269 Score = 70.5 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 47/165 (28%), Gaps = 17/165 (10%) Query: 5 EGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDL 54 +G A +G ++ P+G+L G + S Y L LPDL Sbjct: 31 DGDAPYIGSVCYMVTSYCPSGYLPAAGQSVSISTYQALYALIGNIWGGSPQTNNFTLPDL 90 Query: 55 RGEFIRGWDDG-------RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDE 107 RG I G G RG G ++ H H T D + Sbjct: 91 RGRSIVGAGQGTGLSLIQRGQSLGAETATLSASNVAPHTHPTAQSLTTTFDVLVPATTGN 150 Query: 108 IWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 + V + I PA +V A + T Sbjct: 151 LTVGATLPIATTTPATTGTTPANGANFLTALSATVPVGAATQNAT 195 >UniRef50_C7PCL6 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PCL6_CHIPD Length = 439 Score = 70.1 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 29/136 (21%), Positives = 42/136 (30%), Gaps = 29/136 (21%) Query: 35 SAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSI-------LSIQGYATEDHAH 87 SA+ Y T ++PDLRG F R D G ID R S Q + H H Sbjct: 324 SAKTYWGWGDGVNTLQVPDLRGYFPRWLDLGANIDADRVASSLQNKPGSAQSDEFKSHTH 383 Query: 88 GLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGA 147 + ++ + + + G T Sbjct: 384 TWRAETSNDSLGGTGWV----------------------TSSSGNGGAGTNTLHAANDAT 421 Query: 148 AASETRPRNIAFNYIV 163 SETRP+NI ++ Sbjct: 422 GGSETRPKNIGELPLI 437 >UniRef50_C5BRC7 Phage tail collar domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BRC7_TERTT Length = 195 Score = 70.1 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 37/155 (23%), Gaps = 16/155 (10%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRG 61 +G + GW C G + ++ L T +PDLRG G Sbjct: 5 IGDIHLFGFNFGQEGWALCQGQLMAIQDNTALYSLIGTQYGGDGRSSFGIPDLRGRVPLG 64 Query: 62 WDDGR-------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 + G D G IQ H+H S T Sbjct: 65 TGNPPGGSQWPMGRDAGAETCVIQESQMPTHSHPANFSSQSSLFGTTEPADLTTPETGAV 124 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA 149 A P Y ++ GL Sbjct: 125 LANVVAGGTGADKPEKIYTVTTANPVTLGGLDVTG 159 >UniRef50_B6IWH6 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B6IWH6_RHOCS Length = 206 Score = 70.1 bits (170), Expect = 3e-11, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 49/172 (28%), Gaps = 26/172 (15%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G +PW + P+ W C G + LPDLRG G Sbjct: 6 IGTIMPWAVSWAPSNWSLCMGQILPVNGNQAVFALIGATYGGNGSTNFALPDLRGRVPVG 65 Query: 62 WDDGR-------------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEI 108 G G+ +++ H H + ++ Sbjct: 66 AGQFPGSGGIPPTTNRVIGQSGGQEQVNLTQSQMPVHTHAAQA---TGGGGSVTLSAYTG 122 Query: 109 WVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET-RPRNIAF 159 +S + + T AG D T K Y + AS T +P +I Sbjct: 123 PADSSAPAVGKYLTAAAGDFGGDAVTVKIYGPASGTAVPIASGTVQPPSITV 174 >UniRef50_A3YA17 Prophage MuSo2, tail fiber protein, putative n=1 Tax=Marinomonas sp. MED121 RepID=A3YA17_9GAMM Length = 341 Score = 69.7 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 31/112 (27%), Positives = 41/112 (36%), Gaps = 19/112 (16%) Query: 47 PTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFD 106 T LP + GEFIR +DDGRG+D GR S Q A + H H + Sbjct: 242 STFTLPIVGGEFIRMFDDGRGVDDGRVFGSFQEDAFQGHWHATAEGGDTLAGNGYL---- 297 Query: 107 EIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 N + D +G+ A+ETR R+IA Sbjct: 298 ---------------ANSPSYSSMDNNARDAVTDGQNGVPRMANETRSRSIA 334 >UniRef50_C4ZIZ4 Tail Collar domain protein n=4 Tax=Proteobacteria RepID=C4ZIZ4_THASP Length = 172 Score = 69.7 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 38/169 (22%), Gaps = 28/169 (16%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRG 61 +G + P GW +CNG + L T LP+L G G Sbjct: 6 IGEIRSFGFNFAPRGWAQCNGQLLPIAQNTALFSILGTMYGGDGRTNFALPNLSGAVAMG 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G +++ H H + F + Sbjct: 66 SGQGSGLTPRSQGERGGSESVTLTLGEMPVHGHAANANPANGNQPGPGGNFWAQDLGGSK 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 D G A G N+ + Sbjct: 126 DYADAGTVAMAPGAIGIAGE------------GQPHSNVQPFQVLNFCI 162 >UniRef50_A1SXZ2 Phage Tail Collar domain protein n=2 Tax=Gammaproteobacteria RepID=A1SXZ2_PSYIN Length = 199 Score = 69.4 bits (168), Expect = 4e-11, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 39/169 (23%), Gaps = 30/169 (17%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G P W C+G L T LP+LR + Sbjct: 34 GEIKWVGFNFAPRDWAFCDGQLLPIAHNTALFALIGTIYGGDGITTFALPELRSRVMIHK 93 Query: 63 DDGRGIDTGRSILS--------IQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ R+I + H H L S+ S Sbjct: 94 GRGAGLSN-RAIGQKAGEERVVLSPAELASHNHLLKGSSSSANSTLPQDGTPATLRRSRI 152 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 + + + G +++ ++ N I+ Sbjct: 153 YNAMLPDVDMDITALANAGGRQSHDNVA------------PSLTLNCII 189 >UniRef50_Q2S9I0 Microcystin-dependent protein n=3 Tax=Bacteria RepID=Q2S9I0_HAHCH Length = 176 Score = 69.4 bits (168), Expect = 4e-11, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 43/169 (25%), Gaps = 24/169 (14%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 + + PP W C+G +E L T LP+L+G Sbjct: 6 IAEIRIFGFNYPPRSWSFCSGQIIPIDENQALFALIGSIYGGDARVTMGLPNLQGRSPLH 65 Query: 62 WDDGRGID-------TGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G + + H H L + T + Sbjct: 66 TGKGPGLTERQLADYGGLPEVELAAAQIPPHTHTLSAAKQAGTTSEPTGQLFAYQAGDVQ 125 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 K+ D +P + V E R + ++ + Sbjct: 126 PDYKQPPLGDLQAMSPGMLAYAGQSSMV--------ENRQPFLGLSFCI 166 >UniRef50_A5GA43 Phage Tail Collar domain protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GA43_GEOUR Length = 177 Score = 69.4 bits (168), Expect = 5e-11, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 37/169 (21%), Gaps = 23/169 (13%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIRG 61 +G + P GW C G S E+ L LP+L+ G Sbjct: 6 IGEIRTFAGNFAPYGWFYCQGQRLSITEFQALYAVIGATYGGDGSTYFNLPNLQAYAPMG 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGT 114 G G+ G ++ H H + T Sbjct: 66 QGAGTGLTPRTLGHACGVPATTLIDNQMPPHTHAAQGTNATGTSNNPANRIW-------A 118 Query: 115 DIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 +I G G + N+I+ Sbjct: 119 KVISTPQIQPYGKTVASTPVAMKADALASAGGGSTHSNMQPYQGINFIM 167 >UniRef50_B5TK79 Tail collar protein n=2 Tax=root RepID=B5TK79_9VIRU Length = 364 Score = 68.6 bits (166), Expect = 7e-11, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 65/189 (34%), Gaps = 31/189 (16%) Query: 3 LGEGSALPVGVPVPWPS-ATPPTGWLKCNGAAFSAEEYPEL------------------- 42 +G P+G P + P G+ NG E+P L Sbjct: 180 MGRFDNTPLGRPTFETTIQLSPGGYGALNGTVMKRAEWPWLWDHAQQSGMLGTEATREGN 239 Query: 43 ------AKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIV 96 T + P+ RGEF+R D+GR +D+GR++ + Q H + Sbjct: 240 EGKWSSGDGALTFRAPEGRGEFLRILDEGRSVDSGRAMGTFQPGTVHSH-----ALGAQG 294 Query: 97 TDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRN 156 A + + D + + D P + TY+ + ++RPRN Sbjct: 295 AGAVGSRWSDSLSTVGANTREEIKIIGDLVNGGPTFPAGTTYQMDTANTLLYSFKSRPRN 354 Query: 157 IAFNYIVRA 165 IA+ ++ Sbjct: 355 IAYPARIKL 363 >UniRef50_A6E584 Microcystin dependent protein, putative n=1 Tax=Roseovarius sp. TM1035 RepID=A6E584_9RHOB Length = 207 Score = 68.6 bits (166), Expect = 8e-11, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 42/168 (25%), Gaps = 23/168 (13%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G + P GW + G + L T LPDLRG G Sbjct: 37 GDIMIVGFNFCPRGWSEAAGQLLPIAQNQALFSLLGTQFGGDGITTFALPDLRGRITVGQ 96 Query: 63 DDGRGIDT---GRSILS----IQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 G G+ G+ S + H+H + + + G+ Sbjct: 97 GTGNGLTPRLAGQRFGSETKVMTEATMPQHSHTVQANNLDGDLPGPGNKLLAAAPTGGSG 156 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIV 163 N+ + + + T +A + + Sbjct: 157 -------NETIYSQANPNVTMSAAMIAPAGASTPISTLDPTLALYHCI 197 >UniRef50_A6E583 Phage tail collar domain protein n=2 Tax=Roseovarius sp. TM1035 RepID=A6E583_9RHOB Length = 208 Score = 68.2 bits (165), Expect = 8e-11, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 42/134 (31%), Gaps = 17/134 (12%) Query: 11 VGVPVPWP-SATPPTGWLKCNGAAFSAEEYPELAKAYPT---------NKLPDLRGEFIR 60 VG P PTGW + NG + E L T LPDLRG Sbjct: 36 VGEIAPMGIVNFCPTGWAETNGQLLAISENSALFALIGTTFGGDGNVSFGLPDLRGRIPV 95 Query: 61 GWDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSG 113 G G G+ G+ +++ H+H + + ++ + G Sbjct: 96 GQGTGTGLSPRSWGQSSGQQTVTLTTNQLAAHSHAVNATNSDGNFPGPGGKILAAAPDGG 155 Query: 114 TDIIKRGNTNDAGL 127 + + A + Sbjct: 156 SGQETIYSDQPANV 169 >UniRef50_D0KG77 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KG77_PECWW Length = 270 Score = 68.2 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 43/162 (26%), Gaps = 17/162 (10%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY----------PTNKLPDLR 55 G +G ++ P+G+L G + S Y L LPDLR Sbjct: 33 GDEPYIGSVCYMVTSYCPSGYLPAAGQSLSINTYQALYSLIGNLWGGSQQTGNFTLPDLR 92 Query: 56 GEFIRGWDDG-------RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEI 108 G + G G RG G ++ H H T + + + Sbjct: 93 GRSLVGSGQGTGLSLITRGQSLGAETATLAASNIAPHTHPTTQSLTNTFNVLVPATTGNL 152 Query: 109 WVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAAS 150 V + + P +V A + Sbjct: 153 NVTAALPLAAATPATGGATPTAGANFLTAISATVPVGAATQN 194 >UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0USC5_HAES2 Length = 652 Score = 67.8 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 59/199 (29%), Gaps = 35/199 (17%) Query: 2 GLGEGSALP------VGVPVPWPSATPPTGWLKCN--GAAFSAEEYPELAKAY-----PT 48 LG + LP VG+ + + P GW+ + E YPEL K Sbjct: 264 ALGNSNKLPDLRRSNVGMTAYFATDKIPEGWIAFDEIKEKVKKETYPELYKYLIEKYTSI 323 Query: 49 NKLPDLRGEFIRGWDDG---RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYF 105 + +P F+R +G + G I + + + + + F Sbjct: 324 DNVPKAEDRFLRNAHNGLKVGDVQLGSLIGTDSIDGNGAFSPYVKAIKNTYQETVDQVGF 383 Query: 106 DEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASE-------------- 151 D + + + N + + +GA +E Sbjct: 384 DPLLIGDIGNGRTAVNNEHSPDSGQPETKKQYNAGLSWSVGAGRNEDLNQPIQNNKSGNG 443 Query: 152 -----TRPRNIAFNYIVRA 165 TRP+++ ++A Sbjct: 444 HFVGVTRPKSLVLKLCIKA 462 >UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseudotuberculosis RepID=Q66BF2_YERPS Length = 711 Score = 67.4 bits (163), Expect = 2e-10, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 53/180 (29%), Gaps = 35/180 (19%) Query: 20 ATPPTGWLKCNGAAFSAEEYP-----ELAKAYP------------------------TNK 50 P GW +G S +P L+ YP T + Sbjct: 387 NYIPAGWAPADGQLLSRNLFPFALAEILSAKYPIVADDSWLFYKDQRSSFSVGDGSTTFR 446 Query: 51 LPDLRGE----FIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFD 106 +PDL G+ R + G G ++ + IQG A+ G + + Sbjct: 447 IPDLNGKSHDSMGRVFLGGDGKNSLGEMGRIQGDASRR-ITGTFGGIGGQLNVSYGLVIG 505 Query: 107 EIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA 166 E + G + + P G + A E RP N YI++ A Sbjct: 506 ETAGAFTRTGVATGRPVPSNIGEPALGELG-VSFDSALVNPTAIENRPINATGCYIIKLA 564 >UniRef50_D1BW55 Tail Collar domain protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BW55_XYLCX Length = 443 Score = 67.1 bits (162), Expect = 2e-10, Method: Composition-based stats. Identities = 36/160 (22%), Positives = 53/160 (33%), Gaps = 25/160 (15%) Query: 6 GSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP----------TNKLPDLR 55 +A PVG+ T P GWL+ NGAA S YP L Y T LP+ + Sbjct: 223 NAACPVGMEA--GFHTVPPGWLEHNGAAVSRTTYPALFAHYGTTYGAGDGSTTFNLPNAK 280 Query: 56 GEFIRGWDDGR------GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIW 109 G G D + G G ++ H H + T + D Sbjct: 281 GRTPVGLDTAQAEFNAVGKTGGAKTHTLSTAEMPSHTHTSAA-------HTHSINHDHAA 333 Query: 110 VNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAA 149 V S + ++ +G+ Y + S +G Sbjct: 334 VTSSSAGSHTHGSSTSGITDRAYFARGSAPASSATVGTNG 373 >UniRef50_C9QG11 Probable tail fiber protein n=1 Tax=Vibrio orientalis CIP 102891 RepID=C9QG11_VIBOR Length = 497 Score = 66.7 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 52/154 (33%), Gaps = 36/154 (23%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPD------LRGEFIRGWDD 64 VG+P W + P + A Y LA+ YP D +R EF+R D Sbjct: 367 VGMPFYWLDTSAPEWAVLEINVDLPAVVYWRLARRYPALVSDDSINTGEIRAEFLRVLDL 426 Query: 65 GRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTND 124 GRGI+ + + + +H+H P+ Sbjct: 427 GRGINPAQGLNEFSDASVGEHSHRYPTGG------------------------------V 456 Query: 125 AGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 AG+ +GT T + E +PR++A Sbjct: 457 AGIGPYLHGTSWTGGYATTEPFNQGQENKPRSVA 490 >UniRef50_B9K0L5 Putative uncharacterized protein n=1 Tax=Agrobacterium vitis S4 RepID=B9K0L5_AGRVS Length = 224 Score = 66.7 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 39/162 (24%), Gaps = 23/162 (14%) Query: 13 VPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAY---------PTNKLPDLRGEFIRGWD 63 +P P GWL C G + +Y + LPDLRG+ G+ Sbjct: 7 TILPVGFNYAPDGWLMCWGQKLTINQYNAVYSLVSNFYGGDQQTYFNLPDLRGQMPIGYG 66 Query: 64 DGRGIDTGRSIL--------SIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTD 115 +I S+ H H T I Sbjct: 67 QRTPTSPNYAIGNKGGNDTVSLNSTQIPAHTHAAVFTPTGNATVNIPAQTGTQTATMKAS 126 Query: 116 IIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNI 157 + P G+ + A+ T P + Sbjct: 127 PAAGTSQL------PTAGSALAGGNTAATRIYGAASTTPVTL 162 >UniRef50_A3Y8Q8 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MED121 RepID=A3Y8Q8_9GAMM Length = 303 Score = 66.7 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 31/120 (25%), Positives = 40/120 (33%), Gaps = 21/120 (17%) Query: 39 YPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTD 98 Y T LP + GEFIR +DDGRG+D GR Q T+ + S Sbjct: 198 YWGEGNGVTTFTLPIVGGEFIRMFDDGRGVDAGRGFADYQSDLTKIPNGVILRVSNFGNG 257 Query: 99 ATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIA 158 +G D+ N + + ETRPRNIA Sbjct: 258 DGSYDLSGTSTSANGNDLHTAPRGNSSEVYY---------------------ETRPRNIA 296 >UniRef50_C5AIB0 Phage Tail Collar n=1 Tax=Burkholderia glumae BGR1 RepID=C5AIB0_BURGB Length = 205 Score = 66.7 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 40/189 (21%), Gaps = 37/189 (19%) Query: 12 GVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRGW 62 G P GW C G + + + L T LPD R G Sbjct: 7 GEIRMVAFDFAPAGWALCLGQSVTIAQNNALFALLGTAYGGTGVTTFNLPDFRSRSPVGV 66 Query: 63 DDG--------RGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVN--- 111 G RG G +++ H H T + Sbjct: 67 GTGAPGLTPVTRGQQGGTETVTLTTNQLPTHTHVATVAGGGGTSTISISIPATTNTSAPQ 126 Query: 112 ---------SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET--------RP 154 T + A + S+T R Sbjct: 127 SAPANNMVLGPASSSGHSATIYSTAAANTNLLPFNASVTTAPPTVTNSQTGMGTPFPIRN 186 Query: 155 RNIAFNYIV 163 + N+I+ Sbjct: 187 PYLGVNFII 195 >UniRef50_A9DEL7 Tail fiber protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEL7_9CAUD Length = 640 Score = 66.3 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 56/202 (27%), Gaps = 50/202 (24%) Query: 15 VPWPSA--TPPTGWLKCNGAAFSAEEYPELAKAYPTNKL-----------PDLRGEFIRG 61 V W S P G L +G +P L + ++ P LRG + G Sbjct: 63 VMWHSTQKHLPAGCLLSDGQEVDRATWPSLFEEIEAGRVPVVPEADWLANPKLRGSYTLG 122 Query: 62 --WDDGRGID-TGRSILSI------------------QGYATEDHAHGLPSRSTIVT--- 97 + R D GRS+ S+ Q A + H H + Sbjct: 123 DVVNTFRVPDYNGRSVGSLGRIFLGGDGQNAGLDGQIQESANKRHNHAITDNGHSHGVND 182 Query: 98 -DATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET---- 152 + G I + A S G+ SE+ Sbjct: 183 AGHSHEKSAWVANPAGGGQIYRDPEVWITTNAADKVEVHYKTGVSTSGISLQESESGITL 242 Query: 153 --------RPRNIAFNYIVRAA 166 RP N+A YI+R A Sbjct: 243 AEDGEADARPSNVAGCYIIRGA 264 Score = 53.2 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 55/193 (28%), Gaps = 43/193 (22%) Query: 5 EGSALPVGVPVPWP-SATPPTGWLKCNGAAFSAEEYPEL--------------------- 42 +G+A VG P + P G + +G S E YP L Sbjct: 295 QGNAGYVGKVDWHPLRESVPHGRIPADGQLLSRELYPALWEAVRDRRVPVTTEELWNSDG 354 Query: 43 --------AKAYPTNKLPDLRGEFI----RGWDDGRGIDTGRSILSIQGYATEDHAHGLP 90 ++PD G+ G+ G G+++ IQG A + Sbjct: 355 KRRGCYTEGDGSTNFRVPDYNGKTSGSLGAGFLRGDGLNSLSESGMIQGDAIRNIKGYA- 413 Query: 91 SRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAAS 150 + Y + + +G G + + V A+ Sbjct: 414 --------GAVANYKALASSGALSLDADQGPMYVNGANTGVWAALRNMSIDVSKAVPTAA 465 Query: 151 ETRPRNIAFNYIV 163 + P N+ +++ Sbjct: 466 DNHPVNVTGCFVI 478 >UniRef50_B8CRW5 Phage Tail Collar n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CRW5_SHEPW Length = 215 Score = 65.9 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 40/161 (24%), Gaps = 19/161 (11%) Query: 11 VGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYP---------TNKLPDLRGEFIRG 61 +G + P W C G + L T + DLRG + G Sbjct: 6 IGEIRAVGFSFAPRNWAICKGQLMEISQNASLFSLIGSNYGGDGRVTFGIADLRGRTVTG 65 Query: 62 WDDGRGIDT-------GRSILSIQGYATEDHAHGLPSRSTIVTDATINF---YFDEIWVN 111 G GR +++Q H H L + S T + Sbjct: 66 QGQPPGQANRVIGQLGGRQNITLQSAQLPPHNHPLNASSKEGTTSDPTNAVLATGSGSSK 125 Query: 112 SGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASET 152 I + P + + S + + T Sbjct: 126 VTVSIPAGIPVSGQIGSGPGASPLQNGQTSDSKTASGSVTT 166 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.130 0.383 Lambda K H 0.267 0.0404 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,004,906,962 Number of Sequences: 3077464 Number of extensions: 39785983 Number of successful extensions: 114616 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 739 Number of HSP's successfully gapped in prelim test: 355 Number of HSP's that attempted gapping in prelim test: 111488 Number of HSP's gapped (non-prelim): 2148 length of query: 166 length of database: 1,040,396,356 effective HSP length: 119 effective length of query: 47 effective length of database: 674,178,140 effective search space: 31686372580 effective search space used: 31686372580 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.6 bits) S2: 88 (38.5 bits)