BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (146 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P77656 Uncharacterized protein yfdK n=23 Tax=Enterobact... 299 2e-80 UniRef50_Q8KT34 Tail fiber assembly protein n=31 Tax=Enterobacte... 194 8e-49 UniRef50_A4W725 Phage tail assembly chaperone gp38 n=1 Tax=Enter... 153 2e-36 UniRef50_Q66W74 Putative phage tail fiber assembly protein n=1 T... 151 5e-36 UniRef50_C7BRV4 Phage tail fiber assembly protein n=5 Tax=Photor... 139 4e-32 UniRef50_D2TR89 Putative phage tail fibre assembly protein n=1 T... 131 5e-30 UniRef50_P09154 Uncharacterized protein ymfS n=2 Tax=Escherichia... 128 6e-29 UniRef50_Q8FJG5 Putative uncharacterized protein yfdK n=1 Tax=Es... 124 9e-28 UniRef50_C7BRK4 Tail fiber assembly protein n=7 Tax=Enterobacter... 124 1e-27 UniRef50_C7BRK2 Similar to probable tail fiber assembly protein ... 120 1e-26 UniRef50_B4TAQ6 Tail fiber assembly protein n=8 Tax=Salmonella e... 119 4e-26 UniRef50_B3XF69 Phage tail assembly chaperone gp38 n=2 Tax=Esche... 110 2e-23 UniRef50_C4V089 Bacteriophage tail fiber assembly protein n=1 Ta... 107 2e-22 UniRef50_Q2NQF0 Hypothetical phage protein n=1 Tax=Sodalis gloss... 105 5e-22 UniRef50_C4UND0 Tail assembly chaperone gp38 n=2 Tax=Yersinia ru... 92 5e-18 UniRef50_Q2NWF3 Putative uncharacterized protein n=1 Tax=Sodalis... 91 1e-17 UniRef50_B7UGJ5 Predicted tail fiber assembly protein n=1 Tax=Es... 90 3e-17 UniRef50_A4W7Q4 Phage tail assembly chaperone gp38 n=1 Tax=Enter... 89 5e-17 UniRef50_D2TYN1 Phage tail assembly protein n=1 Tax=Arsenophonus... 87 2e-16 UniRef50_UPI0001826513 hypothetical protein EcanA3_06430 n=1 Tax... 82 4e-15 UniRef50_C4U6G1 Phage tail assembly chaperone gp38 n=2 Tax=Enter... 81 1e-14 UniRef50_Q9T1R1 Probable tail fiber assembly protein n=8 Tax=roo... 79 5e-14 UniRef50_Q2NRZ4 Putative uncharacterized protein n=1 Tax=Sodalis... 77 1e-13 UniRef50_C9XYE1 Tail fiber assembly protein homolog from lambdoi... 75 8e-13 UniRef50_C4SF02 Tail assembly chaperone gp38 n=1 Tax=Yersinia mo... 74 1e-12 UniRef50_B1JGV6 Tail assembly chaperone gp38 n=2 Tax=Yersinia ps... 70 2e-11 UniRef50_B1JPG7 Tail assembly chaperone gp38 n=1 Tax=Yersinia ps... 70 2e-11 UniRef50_C4UND1 Conserved hypothetical phage tail fiber protein ... 66 4e-10 UniRef50_C4TT86 Conserved hypothetical phage tail fiber protein ... 65 9e-10 UniRef50_B7M8C3 Putative phage tail fiber assembly protein n=1 T... 63 2e-09 UniRef50_B2K2J2 Putative uncharacterized protein n=1 Tax=Yersini... 60 2e-08 UniRef50_A7ZYE1 Putative tail fiber assembly protein n=1 Tax=Esc... 60 3e-08 UniRef50_Q31HT5 Putative uncharacterized protein n=1 Tax=Thiomic... 59 5e-08 UniRef50_O22005 Probable tail fiber assembly protein n=4 Tax=roo... 55 5e-07 UniRef50_Q83M74 Putative phage tail fibre protein n=1 Tax=Shigel... 55 1e-06 UniRef50_B2K1B5 Putative uncharacterized protein n=6 Tax=Yersini... 54 1e-06 UniRef50_B1JS57 Tail assembly chaperone gp38 n=3 Tax=Yersinia Re... 53 2e-06 UniRef50_C6CP80 Tail assembly chaperone gp38 n=2 Tax=Dickeya Rep... 52 8e-06 UniRef50_Q47427 Tail fiber assembly protein homolog n=47 Tax=roo... 50 2e-05 UniRef50_B7UG05 Predicted tail fiber assembly protein n=1 Tax=Es... 49 4e-05 UniRef50_B7NSC3 Putative tail fiber assembly protein (Possibly p... 49 4e-05 UniRef50_D0ZBI3 Phage tail assembly chaperone gp38 n=1 Tax=Edwar... 48 8e-05 UniRef50_P77326 Putative tail fiber assembly protein homolog fro... 48 1e-04 UniRef50_UPI0001C3422A phage tail fiber assembly protein n=1 Tax... 47 2e-04 UniRef50_A4TJD5 Phage tail fiber assembly protein n=22 Tax=Yersi... 45 6e-04 UniRef50_B7NA68 Putative uncharacterized protein n=1 Tax=Escheri... 44 0.002 UniRef50_A4WEL2 Phage tail assembly chaperone gp38 n=1 Tax=Enter... 42 0.005 UniRef50_A9I964 Putative phage tail fibre protein n=1 Tax=Bordet... 42 0.008 UniRef50_P03740 Tail fiber assembly protein n=117 Tax=root RepID... 41 0.011 UniRef50_D2TR90 Putative phage tail fibre assembly protein n=1 T... 40 0.017 UniRef50_B2Q1Q3 Putative uncharacterized protein n=1 Tax=Provide... 40 0.021 UniRef50_C6C5D3 Tail assembly chaperone gp38 n=1 Tax=Dickeya dad... 39 0.036 UniRef50_D2U2G7 Phage tail fiber assembly protein n=1 Tax=Arseno... 39 0.051 UniRef50_A5A5F3 Tail fiber assembly n=4 Tax=Pseudomonas aerugino... 39 0.053 UniRef50_C7BTY5 Hypothetical phage protein n=1 Tax=Photorhabdus ... 39 0.071 >UniRef50_P77656 Uncharacterized protein yfdK n=23 Tax=Enterobacteriaceae RepID=YFDK_ECOLI Length = 146 Score = 299 bits (765), Expect = 2e-80, Method: Compositional matrix adjust. Identities = 146/146 (100%), Positives = 146/146 (100%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP Sbjct: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY Sbjct: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 Query: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 LDALELVDTSSAPDIEWPTPPAVQAR Sbjct: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 >UniRef50_Q8KT34 Tail fiber assembly protein n=31 Tax=Enterobacteriaceae RepID=Q8KT34_ECOLX Length = 145 Score = 194 bits (493), Expect = 8e-49, Method: Compositional matrix adjust. Identities = 96/143 (67%), Positives = 110/143 (76%), Gaps = 1/143 (0%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFS-GLPPKGKIRIAGENGFPAW 62 YS + N F +K+DY A SWPDDA+ V + VY EF+ PP KIR+AG+NG P W Sbjct: 2 FYSPSLNIFVNPALKDDYINANSWPDDALAVSDDVYNEFAINTPPYDKIRVAGKNGLPTW 61 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 + IPPP+HEE I AE ++Q L+NQAN+YMNSKQW GKAAIGRLK EELA YNLWLDYLD Sbjct: 62 ALIPPPSHEELIQQAESERQLLLNQANEYMNSKQWPGKAAIGRLKDEELALYNLWLDYLD 121 Query: 123 ALELVDTSSAPDIEWPTPPAVQA 145 ALELVDTSSAPDIEWPTPP QA Sbjct: 122 ALELVDTSSAPDIEWPTPPVTQA 144 >UniRef50_A4W725 Phage tail assembly chaperone gp38 n=1 Tax=Enterobacter sp. 638 RepID=A4W725_ENT38 Length = 142 Score = 153 bits (386), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 78/141 (55%), Positives = 96/141 (68%), Gaps = 2/141 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 YI+S + FYP E + + G WP D VEV + +P +GK I +G+PA Sbjct: 3 KYIWSPSLAGFYPTEEQSIFEGLGGWPTDGVEVSASAHDALFPIP-EGKC-IGTVDGYPA 60 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W ++PPPTHEE +A AE++KQ I+ AN Y+NSKQW GKAA+GRLK E AQYNLWLDYL Sbjct: 61 WIDLPPPTHEEMVAQAEIEKQSRIDAANAYINSKQWPGKAAMGRLKDTEKAQYNLWLDYL 120 Query: 122 DALELVDTSSAPDIEWPTPPA 142 D LE VDTS+APDI WP PPA Sbjct: 121 DELEAVDTSTAPDITWPEPPA 141 >UniRef50_Q66W74 Putative phage tail fiber assembly protein n=1 Tax=Klebsiella pneumoniae RepID=Q66W74_KLEPN Length = 142 Score = 151 bits (382), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 72/143 (50%), Positives = 97/143 (67%), Gaps = 1/143 (0%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M I+S N F+P+ +K DY AGSWP D ++V VY+EF+ PP+GK+R +N P Sbjct: 1 MEIIFSPGQNKFFPVPLKTDYENAGSWPTDGIDVGYDVYLEFTANPPEGKVRGVVDN-MP 59 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW + PT E+ + A +K+++LI +AN+Y+ KQWAGKA++GRL +E AQYN WLDY Sbjct: 60 AWVDKSSPTQEQLVTQAAVKQKRLITEANEYIGLKQWAGKASLGRLSDDERAQYNAWLDY 119 Query: 121 LDALELVDTSSAPDIEWPTPPAV 143 LD LE V APDI WPTPP + Sbjct: 120 LDELEAVKPEDAPDIIWPTPPVM 142 >UniRef50_C7BRV4 Phage tail fiber assembly protein n=5 Tax=Photorhabdus RepID=C7BRV4_PHOAA Length = 141 Score = 139 bits (349), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 3/140 (2%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSG-LPPKGKIRIAGENGFPA 61 Y YSATTN+FYP+E K+DY AGS+P+DAVEVD+ V+IEF+G +PPKGK RIAG+NG P Sbjct: 2 YYYSATTNAFYPVEWKQDYINAGSFPNDAVEVDKSVFIEFAGSIPPKGKYRIAGKNGLPE 61 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W++IPPPT EE I+ AE +K Q I+ AN+ + A + I EE+ W Y Sbjct: 62 WADIPPPTKEELISIAESQKAQFISLANEKIMPLSDAEELDIAT--DEEMLLLKEWKKYR 119 Query: 122 DALELVDTSSAPDIEWPTPP 141 L VDTS+AP+I+WP P Sbjct: 120 VMLNRVDTSNAPEIDWPITP 139 >UniRef50_D2TR89 Putative phage tail fibre assembly protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR89_CITRO Length = 141 Score = 131 bits (330), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 62/115 (53%), Positives = 86/115 (74%), Gaps = 1/115 (0%) Query: 28 PDDAVEVDEQVYIEFSGLP-PKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLIN 86 P +A+EV +Y EF+G+ P GK+ A ++G+P W + PPP+H+E IA AE +KQ+LI+ Sbjct: 22 PSNAIEVSADIYNEFAGVAWPDGKVLGADDSGYPTWIDAPPPSHDELIAQAEAEKQRLID 81 Query: 87 QANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 + N ++N +QW K A+GRL +E AQ+N WLDYLDA+ VDTS+APDIEWPTPP Sbjct: 82 ETNVWINGQQWPSKLALGRLSEDEKAQFNEWLDYLDAVSAVDTSTAPDIEWPTPP 136 >UniRef50_P09154 Uncharacterized protein ymfS n=2 Tax=Escherichia coli RepID=YMFS_ECOLI Length = 137 Score = 128 bits (321), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 62/102 (60%), Positives = 77/102 (75%), Gaps = 6/102 (5%) Query: 51 IRIAGENGFPAWSEIPPP------THEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIG 104 ++ E G +S PP TH++++A A +KQ LI+ A D++NS+QW GKAA+G Sbjct: 36 LKSQAEGGVIDFSVFPPSIKEVIRTHDDEVADANFQKQMLISDATDFINSRQWQGKAALG 95 Query: 105 RLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQAR 146 RLK +EL QYNLWLDYL+ALELVDTSSAPDIEWPTPPAVQAR Sbjct: 96 RLKEDELKQYNLWLDYLEALELVDTSSAPDIEWPTPPAVQAR 137 >UniRef50_Q8FJG5 Putative uncharacterized protein yfdK n=1 Tax=Escherichia coli O6 RepID=Q8FJG5_ECOL6 Length = 158 Score = 124 bits (311), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 59/139 (42%), Positives = 86/139 (61%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 +I+ F +K +Y + WP + V++ + EF PP+GKI A +NG PAW Sbjct: 15 FIWDKVNARFMAYILKNEYERNRMWPKEGVDISNETACEFMKQPPEGKILGADDNGMPAW 74 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 ++PP T+ E +A A+ +KQ I +A +Y+N+KQW GKA +GRL EL YN+WLDY++ Sbjct: 75 IDMPPLTYTELVAKAKTEKQARIIEAVNYINNKQWQGKALLGRLNDTELKMYNIWLDYIE 134 Query: 123 ALELVDTSSAPDIEWPTPP 141 ALE +D S A D +PT P Sbjct: 135 ALEAIDPSKASDTAFPTKP 153 >UniRef50_C7BRK4 Tail fiber assembly protein n=7 Tax=Enterobacteriaceae RepID=C7BRK4_PHOAA Length = 144 Score = 124 bits (311), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 59/140 (42%), Positives = 87/140 (62%), Gaps = 2/140 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 NY++SA +FYPL +++DY +A SWP+D + V + ++ +FSG+PP GKI +GE+G P Sbjct: 5 NYVFSALNKAFYPLSLQQDYIEADSWPNDPISVTDDIFYKFSGMPPIGKILSSGEDGLPC 64 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W +IPPPT EE I+ AE ++ + I AN+ + A + I EEL W Y Sbjct: 65 WEDIPPPTKEELISIAEAQRSKFIFLANEKITPLADAVELDIA--TNEELLSLKAWKKYR 122 Query: 122 DALELVDTSSAPDIEWPTPP 141 L +DTS+AP+I+WP P Sbjct: 123 VMLNRIDTSTAPEIDWPIAP 142 >UniRef50_C7BRK2 Similar to probable tail fiber assembly protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BRK2_PHOAA Length = 144 Score = 120 bits (302), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 61/140 (43%), Positives = 85/140 (60%), Gaps = 2/140 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 NY++SA +FYP+ +++DY AGSWP+D + V + ++ EFSG+PP GKI +GE+ P Sbjct: 5 NYVFSALNKAFYPISLQQDYIAAGSWPNDPLPVTDDIFNEFSGIPPAGKILSSGEDALPC 64 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W +IPPPT EE I AE +K Q I+ AN+ + A + I EE+ W Y Sbjct: 65 WEDIPPPTKEELIYIAENQKAQFISLANEKITPLSDAEELDIAT--DEEMLLLKEWKKYR 122 Query: 122 DALELVDTSSAPDIEWPTPP 141 L VDTS+AP I+WP P Sbjct: 123 VMLNRVDTSNAPKIDWPITP 142 >UniRef50_B4TAQ6 Tail fiber assembly protein n=8 Tax=Salmonella enterica subsp. enterica RepID=B4TAQ6_SALHS Length = 121 Score = 119 bits (297), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 59/113 (52%), Positives = 80/113 (70%), Gaps = 1/113 (0%) Query: 34 VDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMN 93 DE+V + PPKG + + NG AW IPPPT ++ I+AA +K++ I+QAN++MN Sbjct: 10 TDEEVDKYYMKTPPKG-MYLGSSNGRIAWVCIPPPTQDDLISAANQEKKKRIDQANEHMN 68 Query: 94 SKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQAR 146 S++W GKAA+GRL G+ELAQYNLWLDYLDAL+ VDTS A +I P A+ + Sbjct: 69 SRRWPGKAALGRLTGDELAQYNLWLDYLDALKAVDTSVAQNIACPIRKAIHIK 121 >UniRef50_B3XF69 Phage tail assembly chaperone gp38 n=2 Tax=Escherichia coli 101-1 RepID=B3XF69_ECOLX Length = 142 Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 65/145 (44%), Positives = 87/145 (60%), Gaps = 5/145 (3%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIE-FSGLPPKGKIRIAGENGFPA 61 Y++SA N F+P+ KE + +G WPDD V V E+ + + F +PP+ +I NG PA Sbjct: 2 YVWSAKANGFFPISEKEKFEASGLWPDDGVIVSEEEHKKLFMDIPPRK--QIGTLNGKPA 59 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 +IP PT +E IA AE+KK QL +A+ ++ +Q A A I EE + W Y Sbjct: 60 LIDIPQPTKKELIAIAEVKKSQLREKADSEISWRQDAVDADIA--TDEETSTLTEWKKYR 117 Query: 122 DALELVDTSSAPDIEWPTPPAVQAR 146 L VDTS+APDIEWPTPPAV AR Sbjct: 118 VLLMRVDTSTAPDIEWPTPPAVHAR 142 >UniRef50_C4V089 Bacteriophage tail fiber assembly protein n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4V089_YERRO Length = 139 Score = 107 bits (266), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 60/141 (42%), Positives = 79/141 (56%), Gaps = 3/141 (2%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M YSA NSFYP E++E Y AGSWPDDAVEV +++ EF P GK R A +G P Sbjct: 1 MKVFYSAIDNSFYPDELREQYVTAGSWPDDAVEVSNKLFQEFIT-APAGKERKADTDGMP 59 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W ++ PP+ E IA AE KK +L++QAN + Q A + +E+ W Y Sbjct: 60 RWVDVQPPSEIELIAQAEYKKTELMSQANSEIAPLQDA--VDLNMANADEVTALQTWKKY 117 Query: 121 LDALELVDTSSAPDIEWPTPP 141 L VD +AP+I+WP P Sbjct: 118 RVLLNRVDIDAAPEIDWPVAP 138 >UniRef50_Q2NQF0 Hypothetical phage protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NQF0_SODGM Length = 173 Score = 105 bits (262), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 54/133 (40%), Positives = 75/133 (56%), Gaps = 3/133 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y++S TT +FYPL K+ Y AGSWPDD VEV E ++++F PP GK+R NG+P Sbjct: 11 RYVFSGTTGAFYPLSRKQGYVDAGSWPDDGVEVKEDIFMKFQN-PPPGKLRGGDANGYPC 69 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W + PPPT E++ +A L K+ +++A + Q A +G E A W Y Sbjct: 70 WVDTPPPTLEDERRSAALTKKSQLDEAGRIIGPLQDA--VDLGMTTNTEKASLLTWKKYR 127 Query: 122 DALELVDTSSAPD 134 L VD S+APD Sbjct: 128 MLLNRVDISTAPD 140 >UniRef50_C4UND0 Tail assembly chaperone gp38 n=2 Tax=Yersinia ruckeri RepID=C4UND0_YERRU Length = 140 Score = 92.0 bits (227), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 55/140 (39%), Positives = 76/140 (54%), Gaps = 3/140 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 NY++SA F + Y AG D +++D+ +YIEF+G PP GK R NG PA Sbjct: 3 NYVWSAQNRVFLAEALLPSYDDAGWNLSDIIKIDDSIYIEFNGNPPVGKQR-GVINGMPA 61 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W ++PPPT E I++A +K +L + A+ + +Q A G E+A W Y Sbjct: 62 WVDLPPPTSGELISSANAEKSRLKSIADSGIEWRQDAVND--GSASDREIADLAAWRKYR 119 Query: 122 DALELVDTSSAPDIEWPTPP 141 AL +DTS APDIEWP P Sbjct: 120 VALMRIDTSKAPDIEWPLKP 139 >UniRef50_Q2NWF3 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NWF3_SODGM Length = 133 Score = 90.9 bits (224), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 40/90 (44%), Positives = 58/90 (64%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 Y +SA T SF+P+ M DY +AGS PDD V+VDE + +F PP GK R A G+PAW Sbjct: 22 YKFSARTGSFFPVSMLNDYIKAGSLPDDLVDVDETTFWQFCASPPSGKQRGANAQGYPAW 81 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYM 92 ++PPPT EE + ++ K++L+++ M Sbjct: 82 IDVPPPTPEEARLSVDVTKRRLMDEVTRAM 111 >UniRef50_B7UGJ5 Predicted tail fiber assembly protein n=1 Tax=Escherichia coli O127:H6 str. E2348/69 RepID=B7UGJ5_ECO27 Length = 138 Score = 89.7 bits (221), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 58/145 (40%), Positives = 80/145 (55%), Gaps = 8/145 (5%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MN Y SFYP +K+ Y AGSWP++ +VD++ ++G+ P+GK A +NG P Sbjct: 1 MNKFYKG---SFYPEALKDVYISAGSWPENGADVDDETMAIYTGVAPEGKTLGADKNGNP 57 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW +IPP + E+QI AE K+ L + A+ + +Q A A I EE A + W Y Sbjct: 58 AWIDIPPLSAEQQIIQAEQKRTVLRSMADKEIVWRQDAFDAEIA--TAEETAALSEWKKY 115 Query: 121 LDALELVDTSSAPDIEWPTPPAVQA 145 L VDTS+ WPTPP QA Sbjct: 116 RVLLMRVDTSNP---VWPTPPGEQA 137 >UniRef50_A4W7Q4 Phage tail assembly chaperone gp38 n=1 Tax=Enterobacter sp. 638 RepID=A4W7Q4_ENT38 Length = 140 Score = 88.6 bits (218), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 51/146 (34%), Positives = 78/146 (53%), Gaps = 10/146 (6%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y YSA TN+FYP + ++Y + G++PDDAV V E + ++S P GK+R+A + G P Sbjct: 1 MEYYYSAKTNAFYPDILIDEYKKHGTFPDDAVLVTEACFNQYSADPEPGKMRVADKKGMP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYM----NSKQWAGKAAIGRLKGEELAQYNL 116 +W + P PT E + A +K L +QA+ + ++K++ + L +E Sbjct: 61 SWGDQPEPTREMMVYQASEQKNALRDQADKIIAPLKDAKEYGIITEVEDLVLKE------ 114 Query: 117 WLDYLDALELVDTSSAPDIEWPTPPA 142 W Y L VD + PDI WP P Sbjct: 115 WAIYRYNLSKVDVETYPDINWPVKPV 140 >UniRef50_D2TYN1 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2TYN1_9ENTR Length = 140 Score = 86.7 bits (213), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 48/150 (32%), Positives = 79/150 (52%), Gaps = 22/150 (14%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLP-PKGKIRIAGENGFPA 61 Y YS N FYP E+K+ Y +AGS+P D +EVD+ VY EF+ K+R++G++GFP Sbjct: 2 YFYSPKENLFYPNELKDIYIEAGSFPSDVIEVDDAVYFEFTKYDLSDNKVRVSGKDGFPK 61 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMN---------SKQWAGKAAIGRLKGEELA 112 W EE+I+ K+QLI + +N ++ W + +G + + + Sbjct: 62 W-------EEEKIS-----KRQLIEDTKEKINFLLKEVKNVTQIWQTQLTLGIITDSDKS 109 Query: 113 QYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 + W+ Y L+ +D + DI WP+ P+ Sbjct: 110 KLTDWMIYAQKLQQIDLKNINDISWPSKPS 139 >UniRef50_UPI0001826513 hypothetical protein EcanA3_06430 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001826513 Length = 148 Score = 82.4 bits (202), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 54/140 (38%), Positives = 72/140 (51%), Gaps = 2/140 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y YSA TN+FY + DY QAGS PDD E+ Q Y GK+ E+G P Sbjct: 10 TYFYSAETNAFYVSALMSDYDQAGSLPDDISEISNQWYEYLISGQATGKVITPDEHGKPV 69 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 SE PPT +E AE +K +LI +A + + Q A + +G +EL+ + Y Sbjct: 70 LSEPEPPTPQELREIAEGEKSRLIREAGEAIAVLQDADE--LGMATDDELSALSRLKRYR 127 Query: 122 DALELVDTSSAPDIEWPTPP 141 L +D S+APDIEWP P Sbjct: 128 VILNRLDISTAPDIEWPEKP 147 >UniRef50_C4U6G1 Phage tail assembly chaperone gp38 n=2 Tax=Enterobacteriaceae RepID=C4U6G1_YERAL Length = 51 Score = 80.9 bits (198), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 38/50 (76%), Positives = 42/50 (84%) Query: 92 MNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 MNSKQW GKAA+GRLK +E AQYN WLDYLD LE VDTS+APDI+WP P Sbjct: 1 MNSKQWPGKAAMGRLKDDEKAQYNAWLDYLDLLEEVDTSTAPDIDWPVAP 50 >UniRef50_Q9T1R1 Probable tail fiber assembly protein n=8 Tax=root RepID=TFA_BPAPS Length = 155 Score = 78.6 bits (192), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 52/141 (36%), Positives = 70/141 (49%), Gaps = 3/141 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLP-PKGKIRIAGENGFP 60 Y + +++ MK+DY +AGSW D A V VY EF+ P P GK E G P Sbjct: 16 TYYFGQRKLAWFAGSMKKDYIEAGSWDDKAKAVPYSVYREFALNPAPIGKTLGISEKGDP 75 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W +IPP T + I AE KK L+ A + ++ Q A + EE + W Y Sbjct: 76 IWVDIPPKTKHQLITEAEDKKSGLMQGAREVISPLQDAIDLEMA--TQEETQKLTAWKRY 133 Query: 121 LDALELVDTSSAPDIEWPTPP 141 L +DTS+APDI+WP P Sbjct: 134 RVLLNRLDTSNAPDIDWPKKP 154 >UniRef50_Q2NRZ4 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRZ4_SODGM Length = 142 Score = 77.4 bits (189), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 50/141 (35%), Positives = 67/141 (47%), Gaps = 3/141 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQA-GSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 +Y YSATTN FY + MK Y + WP+DAV V ++Y K KI A +NG P Sbjct: 3 DYFYSATTNGFYHISMKSIYEDSDNGWPEDAVPVSNELYQALLEGQSKNKIIKANKNGMP 62 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT E+ ++ A+ KK L+ QA + Q A + E +W Y Sbjct: 63 VLGDRPAPTEEQNLSMAQSKKSMLLEQATGKIIPLQDA--VDLNMATQVEETTLLMWKKY 120 Query: 121 LDALELVDTSSAPDIEWPTPP 141 L +D S A DI WP P Sbjct: 121 RVMLTRLDVSKATDIAWPQCP 141 >UniRef50_C9XYE1 Tail fiber assembly protein homolog from lambdoid prophage e14 n=1 Tax=Cronobacter turicensis RepID=C9XYE1_CROTZ Length = 200 Score = 74.7 bits (182), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 44/112 (39%), Positives = 60/112 (53%), Gaps = 5/112 (4%) Query: 36 EQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELK-----KQQLINQAND 90 E V I G P G R A F W+ T+E+ AA+++ K L AN+ Sbjct: 89 EPVKITLPGDYPAGTTRAAPATRFDVWNGKAWVTNEDARRAADVENAKAMKSSLRGMANE 148 Query: 91 YMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 ++ +QW + +GRL +E A + WLDYL+AL VDTS APDI+WP PA Sbjct: 149 IISQQQWPSRLTLGRLNEQEQAAFTAWLDYLEALAAVDTSRAPDIQWPQLPA 200 >UniRef50_C4SF02 Tail assembly chaperone gp38 n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SF02_YERMO Length = 152 Score = 74.3 bits (181), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 50/141 (35%), Positives = 69/141 (48%), Gaps = 3/141 (2%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M +S T F L+M ED + + S+ D D + + PP GK + G P Sbjct: 14 MKIFFSPTILGFRTLDMVEDGSYSDSYGDFVELSDSERLNYWKQSPPCGKT-LGVATGRP 72 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW ++PPPTHEE +A+A KK QL A+ + +Q A G +E+ W Y Sbjct: 73 AWVDLPPPTHEELVASAIAKKNQLKAAADSEIEWRQDA--VDDGSASEKEIVDLAAWRKY 130 Query: 121 LDALELVDTSSAPDIEWPTPP 141 AL +DTS AP +EWP P Sbjct: 131 RLALMRIDTSKAPGVEWPESP 151 >UniRef50_B1JGV6 Tail assembly chaperone gp38 n=2 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JGV6_YERPY Length = 145 Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 53/150 (35%), Positives = 74/150 (49%), Gaps = 15/150 (10%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAG-SWPDDAVEVDEQVYIEF-SGLPPKGKIRIAGENG 58 M +SAT F P E + D T +WP DAV + + +EF P GK+ + Sbjct: 1 MMIYFSATIGGFIPGEWRVDGTYTDETWPTDAVLLTDIESVEFWKRTAPSGKM-LGSVKY 59 Query: 59 FPAWSEIPPPTHEEQ-------IAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEEL 111 P W ++P PT E +A A+LKK +LI+ A D + + + G+ K EL Sbjct: 60 RPVWVDLPTPTAVEVASQKAGFVAQAKLKKSKLISDARDRIEILK--DRIEAGQDKAAEL 117 Query: 112 AQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 LW Y AL+ +D S+APDIEWP P Sbjct: 118 ---KLWKSYRIALDDIDVSAAPDIEWPVAP 144 >UniRef50_B1JPG7 Tail assembly chaperone gp38 n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JPG7_YERPY Length = 142 Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 49/140 (35%), Positives = 70/140 (50%), Gaps = 3/140 (2%) Query: 5 YSATTNSFYPLEMKEDYT-QAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWS 63 YSAT N F P E + D T +WP DAV + ++ ++ + P G + +G PAW Sbjct: 4 YSATLNGFIPAEWRFDGTYNINTWPGDAVLLSDKESDKYWKVTPAGGKVLGSVSGRPAWV 63 Query: 64 EIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDA 123 +IPP T +E I A K+ A+ ++ +Q A A K E+++ W Y A Sbjct: 64 DIPPITIDELIYCAVQNKRVRKEVADSEIDWRQDAVDAEEASKK--EISELAAWKKYRVA 121 Query: 124 LELVDTSSAPDIEWPTPPAV 143 L +D S APDI WP P V Sbjct: 122 LMRIDISKAPDINWPESPNV 141 >UniRef50_C4UND1 Conserved hypothetical phage tail fiber protein n=2 Tax=Enterobacteriaceae RepID=C4UND1_YERRU Length = 165 Score = 65.9 bits (159), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 36/94 (38%), Positives = 55/94 (58%), Gaps = 6/94 (6%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWP-DDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 YIYSA N+F+P++ +DY+ W DAVEV + V +EF G P GK+R AG +G P Sbjct: 3 TYIYSAKNNAFFPVDYLDDYSH---WDLSDAVEVSDGVAMEFMGGAPIGKVRAAGVDGHP 59 Query: 61 AWSEIPP--PTHEEQIAAAELKKQQLINQANDYM 92 W++ PP P ++++A+ + + A D M Sbjct: 60 CWTDKPPALPLSDDELASLARQYRDAFIVATDNM 93 >UniRef50_C4TT86 Conserved hypothetical phage tail fiber protein n=1 Tax=Yersinia kristensenii ATCC 33638 RepID=C4TT86_YERKR Length = 161 Score = 64.7 bits (156), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 28/64 (43%), Positives = 41/64 (64%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 Y +SATT SFYP E+ + YT AG+ P D +E+ + +Y +F+ P GK+R A + G P W Sbjct: 2 YCFSATTLSFYPKELLDVYTDAGTLPSDLIEIGDDIYAQFAAQQPAGKMRGADKKGKPVW 61 Query: 63 SEIP 66 +P Sbjct: 62 VNVP 65 >UniRef50_B7M8C3 Putative phage tail fiber assembly protein n=1 Tax=Escherichia coli IAI1 RepID=B7M8C3_ECO8A Length = 146 Score = 63.2 bits (152), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 40/108 (37%), Positives = 59/108 (54%), Gaps = 1/108 (0%) Query: 35 DEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNS 94 D Q I S + + I A +G + P +H+EQ+A AE +KQ +I+ A ++ Sbjct: 37 DNQQLINISDISEQPGIGWAYSDGVFSAPLPPERSHDEQVADAEHQKQSMIDAAMVNISV 96 Query: 95 KQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 Q +A +L EE + N+ LDY+DA+ DTS+APDIEWP P Sbjct: 97 IQLKLQAG-RKLTQEETTRLNVVLDYIDAVTATDTSTAPDIEWPDEPC 143 >UniRef50_B2K2J2 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis PB1/+ RepID=B2K2J2_YERPB Length = 149 Score = 60.5 bits (145), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 46/147 (31%), Positives = 74/147 (50%), Gaps = 15/147 (10%) Query: 5 YSATTNSFYPLEMKEDYTQAGSWPDDAVEV-DEQVYIEFSGLPPKGKIRIAGENGFPAWS 63 +S + FYP M +D T P D +++ D + + +PP G++ + G P W Sbjct: 9 FSPANSMFYPQYMIDDGTFHADLPTDLIDITDAENTTYWRQMPPPGQV-LGVIKGRPGWV 67 Query: 64 EIPPPTHEEQ-------IAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNL 116 ++PPP+ + A A+ KK +LI A+D ++ + + +G+ K +EL L Sbjct: 68 DLPPPSAIDIAAKKAALTAQAKAKKTKLIGDASDEIDVLK--DRIELGQDKADEL---KL 122 Query: 117 WLDYLDALELVDTSSAPDIEWPTPPAV 143 W Y AL+ +D S APDI WP P V Sbjct: 123 WKSYRIALDDIDVS-APDINWPESPNV 148 >UniRef50_A7ZYE1 Putative tail fiber assembly protein n=1 Tax=Escherichia coli HS RepID=A7ZYE1_ECOHS Length = 137 Score = 59.7 bits (143), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 42/113 (37%), Positives = 61/113 (53%), Gaps = 11/113 (9%) Query: 35 DEQVYIEFSGLPPKGKIR-IAGENGFPAWSEIPPPT----HEEQIAAAELKKQQLINQAN 89 D Q I S + + I + + GF A PPT H+E +A AE KKQ L++ A Sbjct: 29 DNQQLINISDISEQPGIGWVYSDGGFTA-----PPTQERSHDELVADAEQKKQSLLDAAM 83 Query: 90 DYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 ++ Q +A +L EE + N+ LDY++A+ +DTS+APDI WP PA Sbjct: 84 ANISVIQLKLQAG-RKLTQEETTRLNVVLDYIEAVTAIDTSTAPDIIWPVFPA 135 >UniRef50_Q31HT5 Putative uncharacterized protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31HT5_THICR Length = 194 Score = 58.9 bits (141), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 34/95 (35%), Positives = 54/95 (56%), Gaps = 4/95 (4%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEV-DEQVYIEFSGLPPKGKIRIAGENGF 59 M YSAT N+F+ +K DY Q SWP DA+++ D +V P+GK A NG Sbjct: 1 MTIHYSATKNAFFDDALKSDYEQFNSWPSDAIKMTDAEVSTYHGKQSPQGKQLGADANGR 60 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNS 94 P W ++PPP+ + AA K +Q+ ++A ++++ Sbjct: 61 PIWVDLPPPSLGDANAA---KSKQINDEAQKFIDA 92 >UniRef50_O22005 Probable tail fiber assembly protein n=4 Tax=root RepID=TFA_BPSF5 Length = 167 Score = 55.5 bits (132), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 45/141 (31%), Positives = 59/141 (41%), Gaps = 10/141 (7%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M+Y YSA+TN FY E + PDDAVE+ E + K+ GENG P Sbjct: 33 MSYFYSASTNGFYSTEF-----HGTNIPDDAVEISESEWETLINSQGVTKMITCGENGHP 87 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT E + KK LI +A + + Q A +G +E W Y Sbjct: 88 VIVDRPSPTPERLALINDEKKSALIAEATNVIAPLQDA--VDLGMATDDETKLLLAWEKY 145 Query: 121 LDALELVDTSSAPDIEWPTPP 141 L VD + EWP P Sbjct: 146 RVLLMRVDIKNT---EWPKKP 163 >UniRef50_Q83M74 Putative phage tail fibre protein n=1 Tax=Shigella flexneri RepID=Q83M74_SHIFL Length = 144 Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 44/141 (31%), Positives = 59/141 (41%), Gaps = 10/141 (7%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M+Y YSA+TN FY E + PDDAVE+ E + K+ GENG P Sbjct: 12 MSYFYSASTNGFYSTEF-----HGTNIPDDAVEISESEWKTLINAQSVTKMITCGENGHP 66 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT E+ + KK LI +A + + Q A +G +E W Y Sbjct: 67 VIVDRPSPTPEQLALINDEKKSALIAEATNVIAPLQDA--VDLGMATDDETKLLLAWKKY 124 Query: 121 LDALELVDTSSAPDIEWPTPP 141 L V+ EWP P Sbjct: 125 RVLLMRVNVVKP---EWPMHP 142 >UniRef50_B2K1B5 Putative uncharacterized protein n=6 Tax=Yersinia RepID=B2K1B5_YERPB Length = 159 Score = 54.3 bits (129), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 31/92 (33%), Positives = 49/92 (53%), Gaps = 4/92 (4%) Query: 5 YSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEF-SGLPPKGKIRIAGENGFPAWS 63 +SATT FYP E KE+Y GSWPDDA+ + ++ ++ +P GK+ + G P W Sbjct: 4 FSATTGGFYPQEWKEEYLATGSWPDDALLLTKKEQTKYWKHVPATGKM-LGVMKGRPVWL 62 Query: 64 EIP--PPTHEEQIAAAELKKQQLINQANDYMN 93 +IP P H + +AA + + + D + Sbjct: 63 DIPPLPAPHGDTLAALARRHRDAFIKTTDSIT 94 >UniRef50_B1JS57 Tail assembly chaperone gp38 n=3 Tax=Yersinia RepID=B1JS57_YERPY Length = 139 Score = 53.1 bits (126), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 43/142 (30%), Positives = 65/142 (45%), Gaps = 5/142 (3%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEV-DEQVYIEFSGLPPKGKIRIAGENGF 59 M ++S +F P M D + + + D + V DE++ + P GKI + +G Sbjct: 1 MKALFSPKLITFIPENMVVDGSYSHNITDSLIAVTDEELATYWRQNSPDGKI-LGVVDGR 59 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLD 119 P W +P P HEE + + KK QL A+ ++ +Q A A K E++ W Sbjct: 60 PIWVNLPLPLHEELVLGSSTKKSQLKADADSEIDWRQDAVDAEEANKK--EISALAAWRK 117 Query: 120 YLDALELVDTSSAPDIEWPTPP 141 Y AL +D S P I WP P Sbjct: 118 YRIALMRIDVSHMP-ITWPIKP 138 >UniRef50_C6CP80 Tail assembly chaperone gp38 n=2 Tax=Dickeya RepID=C6CP80_DICZE Length = 131 Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 2/77 (2%) Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 P +E IA A L+K QL+++A + + W +G + +E ++ W +Y+ ++ Sbjct: 56 PEKRRDEYIADATLRKTQLLSEAQKMIAN--WQTDLMLGVISDDEKSRLVRWREYMKQVD 113 Query: 126 LVDTSSAPDIEWPTPPA 142 +D SAPDI WP PP Sbjct: 114 AIDAQSAPDITWPVPPT 130 >UniRef50_Q47427 Tail fiber assembly protein homolog n=47 Tax=root RepID=TFAB_ECOLX Length = 203 Score = 50.1 bits (118), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 27/69 (39%), Positives = 39/69 (56%), Gaps = 1/69 (1%) Query: 70 HEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDT 129 H + AAE K+Q LI+ A D ++ Q +A +L E Q N LDY+D L +D Sbjct: 128 HSAAVEAAETKRQSLIDTAMDSISLIQLKLRAG-RKLTQAETTQLNSVLDYIDELNAMDL 186 Query: 130 SSAPDIEWP 138 ++APD+ WP Sbjct: 187 TTAPDLNWP 195 >UniRef50_B7UG05 Predicted tail fiber assembly protein n=1 Tax=Escherichia coli O127:H6 str. E2348/69 RepID=B7UG05_ECO27 Length = 122 Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 26/77 (33%), Positives = 45/77 (58%), Gaps = 3/77 (3%) Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 PPPTHE+ I AAE ++Q+L++ A+ M W + +G + A+ + WL Y + ++ Sbjct: 46 PPPTHEQLIQAAENERQRLLSAADAIM--LDWRTELMLGEISDANRAKLSAWLLYKNQVK 103 Query: 126 LVDTSSAPD-IEWPTPP 141 VD ++ P+ + WP P Sbjct: 104 AVDVTTDPEHVNWPVIP 120 >UniRef50_B7NSC3 Putative tail fiber assembly protein (Possibly partial) n=2 Tax=Escherichia coli RepID=B7NSC3_ECO7I Length = 136 Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 2/78 (2%) Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 PP TH+E + AE ++Q L++ AN + W +G + LW +Y+++L Sbjct: 59 PPKTHDELLREAENERQCLLDSANSLI--MNWQSDLLLGIISENNKGNLLLWKEYVNSLM 116 Query: 126 LVDTSSAPDIEWPTPPAV 143 VD S P+I WP P + Sbjct: 117 SVDLSLVPEITWPERPEI 134 >UniRef50_D0ZBI3 Phage tail assembly chaperone gp38 n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI3_EDWTE Length = 193 Score = 48.1 bits (113), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 2/82 (2%) Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW H + AE KK L+++A + + W + +G + + A W+ Y Sbjct: 113 AWVTDLNAQHAANVELAEQKKSLLLSEAQEKIG--LWQTELQLGMITDSDKAALITWMTY 170 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 + A++ VDTS+APDI WP PA Sbjct: 171 IKAVQAVDTSAAPDIAWPPKPA 192 >UniRef50_P77326 Putative tail fiber assembly protein homolog from prophage CPS-53 n=30 Tax=Enterobacteriaceae RepID=TFAS_ECOLI Length = 114 Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 27/72 (37%), Positives = 41/72 (56%), Gaps = 1/72 (1%) Query: 70 HEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDT 129 H + AAE ++Q LI+ A ++ Q +A +L E ++ N LDY+DA+ DT Sbjct: 42 HSVAVDAAEAQRQSLIDTAMASISLIQLKLQAG-RKLMQAETSRLNTVLDYIDAVTATDT 100 Query: 130 SSAPDIEWPTPP 141 S+APD+ WP P Sbjct: 101 STAPDVIWPELP 112 >UniRef50_UPI0001C3422A phage tail fiber assembly protein n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001C3422A Length = 124 Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 33/114 (28%), Positives = 49/114 (42%), Gaps = 2/114 (1%) Query: 28 PDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQ 87 P D +++ + Y EF P + R W +I PP+ EE + A+ K QL++ Sbjct: 10 PSDLMKITDLEYEEFMVSPDRKTPRFNINRNCMEWVDIAPPSKEEAVQHADSLKAQLMDV 69 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 A + Q A + +E W Y L +D + APDIEWP P Sbjct: 70 ATQAILPLQDAVDLDMA--TDKETILLTEWKKYRVRLNRIDVNVAPDIEWPESP 121 >UniRef50_A4TJD5 Phage tail fiber assembly protein n=22 Tax=Yersinia RepID=A4TJD5_YERPP Length = 139 Score = 45.4 bits (106), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 12/95 (12%) Query: 54 AGENGFPAWSEIPPPT-------HEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRL 106 GE W+E P+ E + A+LKK +LI+ A+D + + + +G+ Sbjct: 49 TGEWTGGVWAETSGPSTIDISAQKAEFVTQAKLKKSKLISDASDRIEILK--DRIELGQD 106 Query: 107 KGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 + EL LW Y AL+ +D S+APDIEWP P Sbjct: 107 RAAEL---KLWKSYRIALDDIDVSAAPDIEWPLKP 138 >UniRef50_B7NA68 Putative uncharacterized protein n=1 Tax=Escherichia coli UMN026 RepID=B7NA68_ECOLU Length = 134 Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust. Identities = 31/82 (37%), Positives = 42/82 (51%), Gaps = 4/82 (4%) Query: 66 PPP--THEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDA 123 PPP +H +A AEL+K L+ AN+ + Q A + +E W Y Sbjct: 54 PPPERSHNALVAEAELQKSALLTVANNAIAPLQDAVDLEMA--TDDEQTLLLAWKKYRVL 111 Query: 124 LELVDTSSAPDIEWPTPPAVQA 145 L VDTS+AP+IEWPT P +A Sbjct: 112 LNRVDTSAAPEIEWPTQPGERA 133 >UniRef50_A4WEL2 Phage tail assembly chaperone gp38 n=1 Tax=Enterobacter sp. 638 RepID=A4WEL2_ENT38 Length = 198 Score = 42.4 bits (98), Expect = 0.005, Method: Compositional matrix adjust. Identities = 22/65 (33%), Positives = 36/65 (55%), Gaps = 2/65 (3%) Query: 77 AELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIE 136 A KK L+++A+ ++ W +G + E+ A W+ Y+ AL VD ++APDI+ Sbjct: 135 ARQKKAGLLSEAHSTIS--LWQTGLQLGIISDEDKASLITWMTYIQALNAVDVTAAPDID 192 Query: 137 WPTPP 141 WP P Sbjct: 193 WPLMP 197 >UniRef50_A9I964 Putative phage tail fibre protein n=1 Tax=Bordetella petrii DSM 12804 RepID=A9I964_BORPD Length = 155 Score = 41.6 bits (96), Expect = 0.008, Method: Compositional matrix adjust. Identities = 29/112 (25%), Positives = 55/112 (49%), Gaps = 11/112 (9%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWS 63 YS T FY +M + P DAVE+ +++Y + GK +A E+GFPA + Sbjct: 2 FYSVETGGFYSAKM-----HGKAMPADAVEITDELYSQLLAGQSDGKRIVADESGFPALA 56 Query: 64 EIPPPTHEE-QIAAAELKKQQLINQA-----NDYMNSKQWAGKAAIGRLKGE 109 + PPT + ++ + ++ + + A +D N+ +A + A+ + + E Sbjct: 57 DPLPPTPAQIEVQKVAVVQKHMDDAARALRYDDIANAVTYAEEPAVPKFQAE 108 >UniRef50_P03740 Tail fiber assembly protein n=117 Tax=root RepID=TFA_LAMBD Length = 194 Score = 41.2 bits (95), Expect = 0.011, Method: Compositional matrix adjust. Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 2/72 (2%) Query: 73 QIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSA 132 +I AE K+ L+ A++++ Q A I EE + W Y L VDTS+A Sbjct: 125 RIREAEETKKSLMQVASEHIAPLQDAADLEIA--TKEETSLLEAWKKYRVLLNRVDTSTA 182 Query: 133 PDIEWPTPPAVQ 144 PDIEWP P ++ Sbjct: 183 PDIEWPAVPVME 194 >UniRef50_D2TR90 Putative phage tail fibre assembly protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR90_CITRO Length = 199 Score = 40.4 bits (93), Expect = 0.017, Method: Compositional matrix adjust. Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 2/64 (3%) Query: 78 ELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEW 137 E +K L+ A D ++ W + +G + ++ A WL Y+ L+ VDT ++PDI W Sbjct: 136 EHQKTALLAAAQDTISI--WQTELQLGIISDDDKASLISWLSYIKELQTVDTDASPDINW 193 Query: 138 PTPP 141 P P Sbjct: 194 PVAP 197 >UniRef50_B2Q1Q3 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q1Q3_PROST Length = 116 Score = 40.0 bits (92), Expect = 0.021, Method: Compositional matrix adjust. Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 2/74 (2%) Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 PPPT E+ IA AE +KQ L+N+A + Q A +G EE Q +W +Y + Sbjct: 42 PPPTKEQLIAEAEYQKQALLNEATAAIAPLQDA--VDLGIATDEEREQLRVWKEYRVEVN 99 Query: 126 LVDTSSAPDIEWPT 139 VD + WP Sbjct: 100 RVDVGLGLCVNWPV 113 >UniRef50_C6C5D3 Tail assembly chaperone gp38 n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D3_DICDC Length = 208 Score = 39.3 bits (90), Expect = 0.036, Method: Compositional matrix adjust. Identities = 25/70 (35%), Positives = 36/70 (51%), Gaps = 2/70 (2%) Query: 74 IAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAP 133 I AA+ ++++ AND +N +A +G EE + + W YL L VD AP Sbjct: 141 IQAAKAEQEKRRRSANDRLNELTYA--INLGIATPEEASALSSWQAYLVLLSRVDFGHAP 198 Query: 134 DIEWPTPPAV 143 DI WPT P + Sbjct: 199 DIVWPTEPGM 208 >UniRef50_D2U2G7 Phage tail fiber assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G7_9ENTR Length = 177 Score = 38.9 bits (89), Expect = 0.051, Method: Compositional matrix adjust. Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 2/69 (2%) Query: 73 QIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSA 132 QI A KK QLIN+A +N + K +G + + W Y L +DTS A Sbjct: 110 QINTANEKKLQLINEAEQIINPLE--RKVRLGMGNDIDASTLREWEIYSVKLNDIDTSIA 167 Query: 133 PDIEWPTPP 141 PDI+WP P Sbjct: 168 PDIDWPEKP 176 >UniRef50_A5A5F3 Tail fiber assembly n=4 Tax=Pseudomonas aeruginosa RepID=A5A5F3_PSEAE Length = 152 Score = 38.9 bits (89), Expect = 0.053, Method: Compositional matrix adjust. Identities = 26/105 (24%), Positives = 43/105 (40%), Gaps = 11/105 (10%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 Y +S + +FYP ++E Y AG WP D V +++ + G+ + NG P Sbjct: 5 YYFSPSQVAFYPASLREVYEHAGCWPVDGEWVSAELHEQLMNEQAAGRAISSDVNGNPVA 64 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLK 107 E PP L +QQ + +S+ A + R + Sbjct: 65 IERPP-----------LSRQQRSTHERRWRDSQLLATDGLVVRHR 98 >UniRef50_C7BTY5 Hypothetical phage protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BTY5_PHOAA Length = 178 Score = 38.5 bits (88), Expect = 0.071, Method: Compositional matrix adjust. Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 2/73 (2%) Query: 69 THEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD 128 T EE I AE ++ QL+ +AN+ + Q A I EE W Y L +D Sbjct: 106 TKEELIRKAEYERVQLLVKANNIIVPLQDAIDLNIA--TEEEKNTLLKWKKYRIMLNRID 163 Query: 129 TSSAPDIEWPTPP 141 S+ P+I WP+PP Sbjct: 164 ISTTPEIVWPSPP 176 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77656 Uncharacterized protein yfdK n=23 Tax=Enterobact... 209 2e-53 UniRef50_Q8KT34 Tail fiber assembly protein n=31 Tax=Enterobacte... 173 2e-42 UniRef50_C7BRV4 Phage tail fiber assembly protein n=5 Tax=Photor... 171 8e-42 UniRef50_Q66W74 Putative phage tail fiber assembly protein n=1 T... 166 1e-40 UniRef50_C7BRK2 Similar to probable tail fiber assembly protein ... 166 2e-40 UniRef50_C7BRK4 Tail fiber assembly protein n=7 Tax=Enterobacter... 166 3e-40 UniRef50_Q2NQF0 Hypothetical phage protein n=1 Tax=Sodalis gloss... 161 6e-39 UniRef50_A4W725 Phage tail assembly chaperone gp38 n=1 Tax=Enter... 159 2e-38 UniRef50_C4V089 Bacteriophage tail fiber assembly protein n=1 Ta... 159 3e-38 UniRef50_B3XF69 Phage tail assembly chaperone gp38 n=2 Tax=Esche... 159 3e-38 UniRef50_UPI0001826513 hypothetical protein EcanA3_06430 n=1 Tax... 157 1e-37 UniRef50_A4W7Q4 Phage tail assembly chaperone gp38 n=1 Tax=Enter... 153 1e-36 UniRef50_C4UND0 Tail assembly chaperone gp38 n=2 Tax=Yersinia ru... 149 3e-35 UniRef50_Q2NRZ4 Putative uncharacterized protein n=1 Tax=Sodalis... 149 3e-35 UniRef50_Q9T1R1 Probable tail fiber assembly protein n=8 Tax=roo... 147 9e-35 UniRef50_Q8FJG5 Putative uncharacterized protein yfdK n=1 Tax=Es... 146 2e-34 UniRef50_C4SF02 Tail assembly chaperone gp38 n=1 Tax=Yersinia mo... 145 5e-34 UniRef50_O22005 Probable tail fiber assembly protein n=4 Tax=roo... 144 1e-33 UniRef50_B1JPG7 Tail assembly chaperone gp38 n=1 Tax=Yersinia ps... 143 1e-33 UniRef50_D2TR89 Putative phage tail fibre assembly protein n=1 T... 139 2e-32 UniRef50_Q83M74 Putative phage tail fibre protein n=1 Tax=Shigel... 139 2e-32 UniRef50_B7UGJ5 Predicted tail fiber assembly protein n=1 Tax=Es... 137 1e-31 UniRef50_B1JS57 Tail assembly chaperone gp38 n=3 Tax=Yersinia Re... 127 1e-28 UniRef50_Q2NWF3 Putative uncharacterized protein n=1 Tax=Sodalis... 125 3e-28 UniRef50_B4TAQ6 Tail fiber assembly protein n=8 Tax=Salmonella e... 124 1e-27 UniRef50_UPI0001C3422A phage tail fiber assembly protein n=1 Tax... 122 4e-27 UniRef50_B1JGV6 Tail assembly chaperone gp38 n=2 Tax=Yersinia ps... 122 4e-27 UniRef50_D2TYN1 Phage tail assembly protein n=1 Tax=Arsenophonus... 122 4e-27 UniRef50_P09154 Uncharacterized protein ymfS n=2 Tax=Escherichia... 120 1e-26 UniRef50_B2K2J2 Putative uncharacterized protein n=1 Tax=Yersini... 114 7e-25 UniRef50_C9XYE1 Tail fiber assembly protein homolog from lambdoi... 107 1e-22 UniRef50_A7ZYE1 Putative tail fiber assembly protein n=1 Tax=Esc... 103 1e-21 UniRef50_B7M8C3 Putative phage tail fiber assembly protein n=1 T... 103 2e-21 UniRef50_Q31HT5 Putative uncharacterized protein n=1 Tax=Thiomic... 93 3e-18 UniRef50_B7NSC3 Putative tail fiber assembly protein (Possibly p... 89 3e-17 UniRef50_D0ZBI3 Phage tail assembly chaperone gp38 n=1 Tax=Edwar... 89 3e-17 UniRef50_C4UND1 Conserved hypothetical phage tail fiber protein ... 88 9e-17 UniRef50_B2K1B5 Putative uncharacterized protein n=6 Tax=Yersini... 85 7e-16 UniRef50_C4TT86 Conserved hypothetical phage tail fiber protein ... 83 2e-15 UniRef50_Q47427 Tail fiber assembly protein homolog n=47 Tax=roo... 82 4e-15 UniRef50_C6CP80 Tail assembly chaperone gp38 n=2 Tax=Dickeya Rep... 81 8e-15 UniRef50_B7UG05 Predicted tail fiber assembly protein n=1 Tax=Es... 80 2e-14 UniRef50_P77326 Putative tail fiber assembly protein homolog fro... 76 5e-13 UniRef50_A4TJD5 Phage tail fiber assembly protein n=22 Tax=Yersi... 74 1e-12 UniRef50_C4U6G1 Phage tail assembly chaperone gp38 n=2 Tax=Enter... 68 8e-11 Sequences not found previously or not previously below threshold: UniRef50_B6VLW5 Tail fiber assembly protein homolog from lambdoi... 96 5e-19 UniRef50_B7NA68 Putative uncharacterized protein n=1 Tax=Escheri... 94 1e-18 UniRef50_B4TRG5 Fels-2 prophage Tfa n=22 Tax=Enterobacteriaceae ... 93 2e-18 UniRef50_Q2NV40 Hypothetical phage protein n=2 Tax=Enterobacteri... 87 2e-16 UniRef50_P03740 Tail fiber assembly protein n=117 Tax=root RepID... 87 2e-16 UniRef50_B2Q1Q3 Putative uncharacterized protein n=1 Tax=Provide... 84 2e-15 UniRef50_C7BTY5 Hypothetical phage protein n=1 Tax=Photorhabdus ... 82 6e-15 UniRef50_B6XAD7 Putative uncharacterized protein n=2 Tax=Provide... 80 3e-14 UniRef50_UPI000197C594 tail assembly chaperone gp38 n=1 Tax=Prov... 80 3e-14 UniRef50_A8GLQ6 Tail assembly chaperone gp38 n=2 Tax=root RepID=... 79 5e-14 UniRef50_A4WEL2 Phage tail assembly chaperone gp38 n=1 Tax=Enter... 77 1e-13 UniRef50_Q7N2R7 Similar to phage tail fiber assembly protein n=1... 77 2e-13 UniRef50_A5A5F3 Tail fiber assembly n=4 Tax=Pseudomonas aerugino... 76 4e-13 UniRef50_C7BSP4 Putative tail fiber protein of prophage cp-933x ... 75 7e-13 UniRef50_O68721 Lambda tail fiber assembly protein G n=32 Tax=En... 74 1e-12 UniRef50_B4T1N8 Tail assembly chaperone gp38 n=3 Tax=Salmonella ... 74 1e-12 UniRef50_A4JWL9 Putative uncharacterized protein n=1 Tax=Burkhol... 74 1e-12 UniRef50_C4SNQ0 Putative uncharacterized protein n=4 Tax=Yersini... 74 1e-12 UniRef50_B2VJG1 Tail fiber assembly protein n=1 Tax=Erwinia tasm... 74 2e-12 UniRef50_P40784 Tail fiber assembly protein homolog from lambdoi... 73 3e-12 UniRef50_D2TR90 Putative phage tail fibre assembly protein n=1 T... 72 7e-12 UniRef50_A8GA30 Tail assembly chaperone gp38 n=2 Tax=Enterobacte... 71 9e-12 UniRef50_D0FT42 Phage tail assembly chaperone n=2 Tax=Erwinia py... 70 2e-11 UniRef50_D2U2G7 Phage tail fiber assembly protein n=1 Tax=Arseno... 70 2e-11 UniRef50_A7MLN9 Putative uncharacterized protein n=1 Tax=Cronoba... 69 3e-11 UniRef50_D1P8C4 Tail fiber assembly protein n=1 Tax=Providencia ... 69 7e-11 UniRef50_C6C6Z1 Tail assembly chaperone gp38 n=3 Tax=Dickeya Rep... 67 2e-10 UniRef50_Q7N1H8 Similarities with lambda tail fiber assembly pro... 67 2e-10 UniRef50_A1JMQ0 Phage tail fiber assembly protein n=5 Tax=Yersin... 66 3e-10 UniRef50_C6C5D3 Tail assembly chaperone gp38 n=1 Tax=Dickeya dad... 66 3e-10 UniRef50_C5AKX8 Putative uncharacterized protein n=1 Tax=Burkhol... 65 8e-10 UniRef50_Q7P0Y3 Probable tail fiber assembly protein n=1 Tax=Chr... 64 1e-09 UniRef50_C6CP83 Tail assembly chaperone gp38 n=1 Tax=Dickeya zea... 64 2e-09 UniRef50_Q7NAA2 Complete genome; segment 1/17 n=1 Tax=Photorhabd... 63 3e-09 UniRef50_B3G0V8 Putative uncharacterized protein n=1 Tax=Pseudom... 62 4e-09 UniRef50_Q3ZL13 Tail fiber assembly protein n=1 Tax=Escherichia ... 62 7e-09 UniRef50_Q32IA2 Hypothetical prophage protein n=1 Tax=Shigella d... 61 1e-08 UniRef50_Q7Y3Y8 Tail fiber assembly protein n=4 Tax=root RepID=Q... 61 1e-08 UniRef50_C4K4X4 Phage tail assembly chaperone n=2 Tax=Candidatus... 61 1e-08 UniRef50_A9DEL3 Tail fiber related protein n=1 Tax=Yersinia phag... 61 1e-08 UniRef50_A9I964 Putative phage tail fibre protein n=1 Tax=Bordet... 60 2e-08 UniRef50_B3HH41 Tail assembly chaperone gp38 n=3 Tax=Enterobacte... 58 8e-08 UniRef50_Q9B026 Probable tail fiber assembly protein n=1 Tax=Pha... 58 9e-08 UniRef50_C4KQ09 Putative uncharacterized protein n=9 Tax=Burkhol... 58 1e-07 UniRef50_C1D954 HsdM n=1 Tax=Laribacter hongkongensis HLHK9 RepI... 57 1e-07 UniRef50_Q4ZMK8 Putative uncharacterized protein n=4 Tax=Pseudom... 57 1e-07 UniRef50_P26699 Probable tail fiber assembly protein n=56 Tax=ro... 57 2e-07 UniRef50_B4T266 Caudovirales tail fibre assembly protein n=16 Ta... 57 2e-07 UniRef50_D0KLI7 Tail assembly chaperone gp38 n=2 Tax=Enterobacte... 56 3e-07 UniRef50_B4TML3 Caudovirales tail fibre assembly protein n=16 Ta... 56 3e-07 UniRef50_B4T2D9 Gp20 n=17 Tax=root RepID=B4T2D9_SALNS 55 5e-07 UniRef50_B1JPI1 Tail assembly chaperone gp38 n=3 Tax=Yersinia ps... 55 9e-07 UniRef50_D1UEY6 Putative uncharacterized protein n=2 Tax=Burkhol... 54 1e-06 UniRef50_B4TI69 Putative phage tail fiber assembly protein n=13 ... 54 2e-06 UniRef50_B5TK82 Conserved hypothetical phage protein n=1 Tax=Pse... 54 2e-06 UniRef50_B5S309 Tail fiber assembly protein homolog n=2 Tax=Rals... 53 3e-06 UniRef50_C6CP85 Putative phage tail fibre protein n=2 Tax=Dickey... 53 4e-06 UniRef50_C6DE09 Tail assembly chaperone gp38 n=5 Tax=Enterobacte... 52 4e-06 UniRef50_Q849T8 Eag0005 n=3 Tax=Haemophilus influenzae RepID=Q84... 52 5e-06 UniRef50_Q3KH46 Putative phage related protein n=2 Tax=root RepI... 52 7e-06 UniRef50_A7ZL71 Putative uncharacterized protein n=1 Tax=Escheri... 52 8e-06 UniRef50_Q1I688 Putative phage protein n=1 Tax=Pseudomonas entom... 51 1e-05 UniRef50_A4SL83 Phage tail fiber assembly protein n=1 Tax=Aeromo... 50 3e-05 UniRef50_B3I4G5 Tail fiber assembly protein n=7 Tax=Escherichia ... 50 3e-05 UniRef50_B6Z9I1 Putative phage tail fiber assembly protein n=1 T... 49 4e-05 UniRef50_Q9ZXK5 Orf21 n=2 Tax=root RepID=Q9ZXK5_9CAUD 49 4e-05 UniRef50_B1JB16 Putative uncharacterized protein n=1 Tax=Pseudom... 49 5e-05 UniRef50_Q1I679 Putative phage tail fiber assembly protein n=1 T... 49 5e-05 UniRef50_B3RGH1 Putative tail fiber assembly protein n=1 Tax=Esc... 48 1e-04 UniRef50_C0DSG5 Putative uncharacterized protein n=1 Tax=Eikenel... 48 1e-04 UniRef50_Q2S7G7 Putative uncharacterized protein n=1 Tax=Hahella... 48 1e-04 UniRef50_B0VK51 Putative uncharacterized protein 51 n=1 Tax=Azos... 47 2e-04 UniRef50_B2TWU3 Tail fiber assembly protein n=20 Tax=root RepID=... 47 2e-04 UniRef50_C4F418 Putative uncharacterized protein n=3 Tax=Haemoph... 47 2e-04 UniRef50_Q3KH77 Hypothetical phage related protein n=1 Tax=Pseud... 47 2e-04 UniRef50_B1JM08 Putative uncharacterized protein n=1 Tax=Yersini... 47 2e-04 UniRef50_Q3KH44 Putative phage tail assembly protein n=1 Tax=Pse... 47 2e-04 UniRef50_Q31Z52 Putative tail fiber assembly protein n=1 Tax=Shi... 47 3e-04 UniRef50_C8UR23 Putative uncharacterized protein n=1 Tax=Escheri... 45 5e-04 UniRef50_D1P3V4 Bacteriophage tail fiber assembly protein n=4 Ta... 45 7e-04 UniRef50_Q30VV8 Putative uncharacterized protein n=1 Tax=Desulfo... 45 0.001 UniRef50_Q9KW02 Tail fiber assembly protein n=2 Tax=Pseudomonas ... 44 0.001 UniRef50_C9R438 Putative tail fiber assembly protein n=3 Tax=roo... 44 0.002 UniRef50_C5BH15 Putative Tail fiber assembly protein-like protei... 44 0.002 UniRef50_UPI00016A4B8D hypothetical protein BthaT_33832 n=1 Tax=... 43 0.003 UniRef50_O52622 Putative uncharacterized protein n=1 Tax=Salmone... 43 0.003 UniRef50_Q87Y71 Tail fiber assembly domain protein n=1 Tax=Pseud... 43 0.004 UniRef50_Q126B3 Putative uncharacterized protein n=1 Tax=Polarom... 42 0.005 UniRef50_A5X9J5 Putative tail fiber assembly protein n=1 Tax=Aer... 42 0.008 UniRef50_B0USZ7 Putative uncharacterized protein n=5 Tax=Pasteur... 41 0.011 UniRef50_A9DEM0 Tail fiber assembly protein n=1 Tax=Yersinia pha... 39 0.039 UniRef50_Q7P173 Probable tail fiber assembly protein n=1 Tax=Chr... 39 0.045 UniRef50_Q72D32 Tail fiber assembly protein, putative n=5 Tax=De... 39 0.060 >UniRef50_P77656 Uncharacterized protein yfdK n=23 Tax=Enterobacteriaceae RepID=YFDK_ECOLI Length = 146 Score = 209 bits (532), Expect = 2e-53, Method: Composition-based stats. Identities = 146/146 (100%), Positives = 146/146 (100%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP Sbjct: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY Sbjct: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 Query: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 LDALELVDTSSAPDIEWPTPPAVQAR Sbjct: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 >UniRef50_Q8KT34 Tail fiber assembly protein n=31 Tax=Enterobacteriaceae RepID=Q8KT34_ECOLX Length = 145 Score = 173 bits (438), Expect = 2e-42, Method: Composition-based stats. Identities = 96/143 (67%), Positives = 109/143 (76%), Gaps = 1/143 (0%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEF-SGLPPKGKIRIAGENGFPAW 62 YS + N F +K+DY A SWPDDA+ V + VY EF PP KIR+AG+NG P W Sbjct: 2 FYSPSLNIFVNPALKDDYINANSWPDDALAVSDDVYNEFAINTPPYDKIRVAGKNGLPTW 61 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 + IPPP+HEE I AE ++Q L+NQAN+YMNSKQW GKAAIGRLK EELA YNLWLDYLD Sbjct: 62 ALIPPPSHEELIQQAESERQLLLNQANEYMNSKQWPGKAAIGRLKDEELALYNLWLDYLD 121 Query: 123 ALELVDTSSAPDIEWPTPPAVQA 145 ALELVDTSSAPDIEWPTPP QA Sbjct: 122 ALELVDTSSAPDIEWPTPPVTQA 144 >UniRef50_C7BRV4 Phage tail fiber assembly protein n=5 Tax=Photorhabdus RepID=C7BRV4_PHOAA Length = 141 Score = 171 bits (432), Expect = 8e-42, Method: Composition-based stats. Identities = 74/141 (52%), Positives = 96/141 (68%), Gaps = 3/141 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSG-LPPKGKIRIAGENGFP 60 Y YSATTN+FYP+E K+DY AGS+P+DAVEVD+ V+IEF+G +PPKGK RIAG+NG P Sbjct: 1 MYYYSATTNAFYPVEWKQDYINAGSFPNDAVEVDKSVFIEFAGSIPPKGKYRIAGKNGLP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W++IPPPT EE I+ AE +K Q I+ AN+ + A + I EE+ W Y Sbjct: 61 EWADIPPPTKEELISIAESQKAQFISLANEKIMPLSDAEELDIA--TDEEMLLLKEWKKY 118 Query: 121 LDALELVDTSSAPDIEWPTPP 141 L VDTS+AP+I+WP P Sbjct: 119 RVMLNRVDTSNAPEIDWPITP 139 >UniRef50_Q66W74 Putative phage tail fiber assembly protein n=1 Tax=Klebsiella pneumoniae RepID=Q66W74_KLEPN Length = 142 Score = 166 bits (421), Expect = 1e-40, Method: Composition-based stats. Identities = 72/143 (50%), Positives = 97/143 (67%), Gaps = 1/143 (0%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M I+S N F+P+ +K DY AGSWP D ++V VY+EF+ PP+GK+R +N P Sbjct: 1 MEIIFSPGQNKFFPVPLKTDYENAGSWPTDGIDVGYDVYLEFTANPPEGKVRGVVDN-MP 59 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW + PT E+ + A +K+++LI +AN+Y+ KQWAGKA++GRL +E AQYN WLDY Sbjct: 60 AWVDKSSPTQEQLVTQAAVKQKRLITEANEYIGLKQWAGKASLGRLSDDERAQYNAWLDY 119 Query: 121 LDALELVDTSSAPDIEWPTPPAV 143 LD LE V APDI WPTPP + Sbjct: 120 LDELEAVKPEDAPDIIWPTPPVM 142 >UniRef50_C7BRK2 Similar to probable tail fiber assembly protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BRK2_PHOAA Length = 144 Score = 166 bits (420), Expect = 2e-40, Method: Composition-based stats. Identities = 61/140 (43%), Positives = 85/140 (60%), Gaps = 2/140 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 NY++SA +FYP+ +++DY AGSWP+D + V + ++ EFSG+PP GKI +GE+ P Sbjct: 5 NYVFSALNKAFYPISLQQDYIAAGSWPNDPLPVTDDIFNEFSGIPPAGKILSSGEDALPC 64 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W +IPPPT EE I AE +K Q I+ AN+ + A + I EE+ W Y Sbjct: 65 WEDIPPPTKEELIYIAENQKAQFISLANEKITPLSDAEELDIA--TDEEMLLLKEWKKYR 122 Query: 122 DALELVDTSSAPDIEWPTPP 141 L VDTS+AP I+WP P Sbjct: 123 VMLNRVDTSNAPKIDWPITP 142 >UniRef50_C7BRK4 Tail fiber assembly protein n=7 Tax=Enterobacteriaceae RepID=C7BRK4_PHOAA Length = 144 Score = 166 bits (419), Expect = 3e-40, Method: Composition-based stats. Identities = 59/140 (42%), Positives = 87/140 (62%), Gaps = 2/140 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 NY++SA +FYPL +++DY +A SWP+D + V + ++ +FSG+PP GKI +GE+G P Sbjct: 5 NYVFSALNKAFYPLSLQQDYIEADSWPNDPISVTDDIFYKFSGMPPIGKILSSGEDGLPC 64 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W +IPPPT EE I+ AE ++ + I AN+ + A + I EEL W Y Sbjct: 65 WEDIPPPTKEELISIAEAQRSKFIFLANEKITPLADAVELDIA--TNEELLSLKAWKKYR 122 Query: 122 DALELVDTSSAPDIEWPTPP 141 L +DTS+AP+I+WP P Sbjct: 123 VMLNRIDTSTAPEIDWPIAP 142 >UniRef50_Q2NQF0 Hypothetical phage protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NQF0_SODGM Length = 173 Score = 161 bits (407), Expect = 6e-39, Method: Composition-based stats. Identities = 54/133 (40%), Positives = 75/133 (56%), Gaps = 3/133 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y++S TT +FYPL K+ Y AGSWPDD VEV E ++++F PP GK+R NG+P Sbjct: 11 RYVFSGTTGAFYPLSRKQGYVDAGSWPDDGVEVKEDIFMKF-QNPPPGKLRGGDANGYPC 69 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W + PPPT E++ +A L K+ +++A + Q A +G E A W Y Sbjct: 70 WVDTPPPTLEDERRSAALTKKSQLDEAGRIIGPLQDA--VDLGMTTNTEKASLLTWKKYR 127 Query: 122 DALELVDTSSAPD 134 L VD S+APD Sbjct: 128 MLLNRVDISTAPD 140 >UniRef50_A4W725 Phage tail assembly chaperone gp38 n=1 Tax=Enterobacter sp. 638 RepID=A4W725_ENT38 Length = 142 Score = 159 bits (403), Expect = 2e-38, Method: Composition-based stats. Identities = 77/141 (54%), Positives = 95/141 (67%), Gaps = 2/141 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 YI+S + FYP E + + G WP D VEV + + P+GK +G+PA Sbjct: 3 KYIWSPSLAGFYPTEEQSIFEGLGGWPTDGVEVSASAHDALFPI-PEGKCIGTV-DGYPA 60 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W ++PPPTHEE +A AE++KQ I+ AN Y+NSKQW GKAA+GRLK E AQYNLWLDYL Sbjct: 61 WIDLPPPTHEEMVAQAEIEKQSRIDAANAYINSKQWPGKAAMGRLKDTEKAQYNLWLDYL 120 Query: 122 DALELVDTSSAPDIEWPTPPA 142 D LE VDTS+APDI WP PPA Sbjct: 121 DELEAVDTSTAPDITWPEPPA 141 >UniRef50_C4V089 Bacteriophage tail fiber assembly protein n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4V089_YERRO Length = 139 Score = 159 bits (402), Expect = 3e-38, Method: Composition-based stats. Identities = 60/142 (42%), Positives = 79/142 (55%), Gaps = 3/142 (2%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M YSA NSFYP E++E Y AGSWPDDAVEV +++ EF P GK R A +G P Sbjct: 1 MKVFYSAIDNSFYPDELREQYVTAGSWPDDAVEVSNKLFQEFI-TAPAGKERKADTDGMP 59 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W ++ PP+ E IA AE KK +L++QAN + Q A + +E+ W Y Sbjct: 60 RWVDVQPPSEIELIAQAEYKKTELMSQANSEIAPLQDA--VDLNMANADEVTALQTWKKY 117 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L VD +AP+I+WP P Sbjct: 118 RVLLNRVDIDAAPEIDWPVAPE 139 >UniRef50_B3XF69 Phage tail assembly chaperone gp38 n=2 Tax=Escherichia coli 101-1 RepID=B3XF69_ECOLX Length = 142 Score = 159 bits (401), Expect = 3e-38, Method: Composition-based stats. Identities = 65/146 (44%), Positives = 87/146 (59%), Gaps = 5/146 (3%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIE-FSGLPPKGKIRIAGENGFP 60 Y++SA N F+P+ KE + +G WPDD V V E+ + + F +PP+ +I NG P Sbjct: 1 MYVWSAKANGFFPISEKEKFEASGLWPDDGVIVSEEEHKKLFMDIPPRKQI--GTLNGKP 58 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 A +IP PT +E IA AE+KK QL +A+ ++ +Q A A I EE + W Y Sbjct: 59 ALIDIPQPTKKELIAIAEVKKSQLREKADSEISWRQDAVDADIA--TDEETSTLTEWKKY 116 Query: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 L VDTS+APDIEWPTPPAV AR Sbjct: 117 RVLLMRVDTSTAPDIEWPTPPAVHAR 142 >UniRef50_UPI0001826513 hypothetical protein EcanA3_06430 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001826513 Length = 148 Score = 157 bits (396), Expect = 1e-37, Method: Composition-based stats. Identities = 54/140 (38%), Positives = 71/140 (50%), Gaps = 2/140 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y YSA TN+FY + DY QAGS PDD E+ Q Y GK+ E+G P Sbjct: 10 TYFYSAETNAFYVSALMSDYDQAGSLPDDISEISNQWYEYLISGQATGKVITPDEHGKPV 69 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 SE PPT +E AE +K +LI +A + + Q A +G +EL+ + Y Sbjct: 70 LSEPEPPTPQELREIAEGEKSRLIREAGEAIAVLQDAD--ELGMATDDELSALSRLKRYR 127 Query: 122 DALELVDTSSAPDIEWPTPP 141 L +D S+APDIEWP P Sbjct: 128 VILNRLDISTAPDIEWPEKP 147 >UniRef50_A4W7Q4 Phage tail assembly chaperone gp38 n=1 Tax=Enterobacter sp. 638 RepID=A4W7Q4_ENT38 Length = 140 Score = 153 bits (387), Expect = 1e-36, Method: Composition-based stats. Identities = 51/142 (35%), Positives = 74/142 (52%), Gaps = 2/142 (1%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y YSA TN+FYP + ++Y + G++PDDAV V E + ++S P GK+R+A + G P Sbjct: 1 MEYYYSAKTNAFYPDILIDEYKKHGTFPDDAVLVTEACFNQYSADPEPGKMRVADKKGMP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 +W + P PT E + A +K L +QA+ + + A G + E W Y Sbjct: 61 SWGDQPEPTREMMVYQASEQKNALRDQADKIIAPLKDAK--EYGIITEVEDLVLKEWAIY 118 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L VD + PDI WP P Sbjct: 119 RYNLSKVDVETYPDINWPVKPV 140 >UniRef50_C4UND0 Tail assembly chaperone gp38 n=2 Tax=Yersinia ruckeri RepID=C4UND0_YERRU Length = 140 Score = 149 bits (376), Expect = 3e-35, Method: Composition-based stats. Identities = 55/141 (39%), Positives = 76/141 (53%), Gaps = 3/141 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 NY++SA F + Y AG D +++D+ +YIEF+G PP GK R NG PA Sbjct: 3 NYVWSAQNRVFLAEALLPSYDDAGWNLSDIIKIDDSIYIEFNGNPPVGKQR-GVINGMPA 61 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W ++PPPT E I++A +K +L + A+ + +Q A G E+A W Y Sbjct: 62 WVDLPPPTSGELISSANAEKSRLKSIADSGIEWRQDAVN--DGSASDREIADLAAWRKYR 119 Query: 122 DALELVDTSSAPDIEWPTPPA 142 AL +DTS APDIEWP P Sbjct: 120 VALMRIDTSKAPDIEWPLKPE 140 >UniRef50_Q2NRZ4 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRZ4_SODGM Length = 142 Score = 149 bits (376), Expect = 3e-35, Method: Composition-based stats. Identities = 50/142 (35%), Positives = 68/142 (47%), Gaps = 3/142 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGS-WPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 +Y YSATTN FY + MK Y + + WP+DAV V ++Y K KI A +NG P Sbjct: 3 DYFYSATTNGFYHISMKSIYEDSDNGWPEDAVPVSNELYQALLEGQSKNKIIKANKNGMP 62 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT E+ ++ A+ KK L+ QA + Q A + E +W Y Sbjct: 63 VLGDRPAPTEEQNLSMAQSKKSMLLEQATGKIIPLQDA--VDLNMATQVEETTLLMWKKY 120 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L +D S A DI WP P Sbjct: 121 RVMLTRLDVSKATDIAWPQCPE 142 >UniRef50_Q9T1R1 Probable tail fiber assembly protein n=8 Tax=root RepID=TFA_BPAPS Length = 155 Score = 147 bits (371), Expect = 9e-35, Method: Composition-based stats. Identities = 52/142 (36%), Positives = 69/142 (48%), Gaps = 3/142 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLP-PKGKIRIAGENGFP 60 Y + +++ MK+DY +AGSW D A V VY EF+ P P GK E G P Sbjct: 16 TYYFGQRKLAWFAGSMKKDYIEAGSWDDKAKAVPYSVYREFALNPAPIGKTLGISEKGDP 75 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W +IPP T + I AE KK L+ A + ++ Q A EE + W Y Sbjct: 76 IWVDIPPKTKHQLITEAEDKKSGLMQGAREVISPLQDAIDLE--MATQEETQKLTAWKRY 133 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L +DTS+APDI+WP P Sbjct: 134 RVLLNRLDTSNAPDIDWPKKPE 155 >UniRef50_Q8FJG5 Putative uncharacterized protein yfdK n=1 Tax=Escherichia coli O6 RepID=Q8FJG5_ECOL6 Length = 158 Score = 146 bits (369), Expect = 2e-34, Method: Composition-based stats. Identities = 59/139 (42%), Positives = 86/139 (61%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 +I+ F +K +Y + WP + V++ + EF PP+GKI A +NG PAW Sbjct: 15 FIWDKVNARFMAYILKNEYERNRMWPKEGVDISNETACEFMKQPPEGKILGADDNGMPAW 74 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 ++PP T+ E +A A+ +KQ I +A +Y+N+KQW GKA +GRL EL YN+WLDY++ Sbjct: 75 IDMPPLTYTELVAKAKTEKQARIIEAVNYINNKQWQGKALLGRLNDTELKMYNIWLDYIE 134 Query: 123 ALELVDTSSAPDIEWPTPP 141 ALE +D S A D +PT P Sbjct: 135 ALEAIDPSKASDTAFPTKP 153 >UniRef50_C4SF02 Tail assembly chaperone gp38 n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SF02_YERMO Length = 152 Score = 145 bits (365), Expect = 5e-34, Method: Composition-based stats. Identities = 50/141 (35%), Positives = 69/141 (48%), Gaps = 3/141 (2%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M +S T F L+M ED + + S+ D D + + PP GK + G P Sbjct: 14 MKIFFSPTILGFRTLDMVEDGSYSDSYGDFVELSDSERLNYWKQSPPCGKT-LGVATGRP 72 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW ++PPPTHEE +A+A KK QL A+ + +Q A G +E+ W Y Sbjct: 73 AWVDLPPPTHEELVASAIAKKNQLKAAADSEIEWRQDA--VDDGSASEKEIVDLAAWRKY 130 Query: 121 LDALELVDTSSAPDIEWPTPP 141 AL +DTS AP +EWP P Sbjct: 131 RLALMRIDTSKAPGVEWPESP 151 >UniRef50_O22005 Probable tail fiber assembly protein n=4 Tax=root RepID=TFA_BPSF5 Length = 167 Score = 144 bits (362), Expect = 1e-33, Method: Composition-based stats. Identities = 45/142 (31%), Positives = 59/142 (41%), Gaps = 10/142 (7%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M+Y YSA+TN FY E + PDDAVE+ E + K+ GENG P Sbjct: 33 MSYFYSASTNGFYSTEF-----HGTNIPDDAVEISESEWETLINSQGVTKMITCGENGHP 87 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT E + KK LI +A + + Q A +G +E W Y Sbjct: 88 VIVDRPSPTPERLALINDEKKSALIAEATNVIAPLQDA--VDLGMATDDETKLLLAWEKY 145 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L VD + EWP P Sbjct: 146 RVLLMRVDIKN---TEWPKKPE 164 >UniRef50_B1JPG7 Tail assembly chaperone gp38 n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JPG7_YERPY Length = 142 Score = 143 bits (361), Expect = 1e-33, Method: Composition-based stats. Identities = 48/143 (33%), Positives = 70/143 (48%), Gaps = 3/143 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQA-GSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 YSAT N F P E + D T +WP DAV + ++ ++ + P G + +G P Sbjct: 1 MVYYSATLNGFIPAEWRFDGTYNINTWPGDAVLLSDKESDKYWKVTPAGGKVLGSVSGRP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW +IPP T +E I A K+ A+ ++ +Q A A +E+++ W Y Sbjct: 61 AWVDIPPITIDELIYCAVQNKRVRKEVADSEIDWRQDAVDAE--EASKKEISELAAWKKY 118 Query: 121 LDALELVDTSSAPDIEWPTPPAV 143 AL +D S APDI WP P V Sbjct: 119 RVALMRIDISKAPDINWPESPNV 141 >UniRef50_D2TR89 Putative phage tail fibre assembly protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR89_CITRO Length = 141 Score = 139 bits (351), Expect = 2e-32, Method: Composition-based stats. Identities = 66/145 (45%), Positives = 92/145 (63%), Gaps = 8/145 (5%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLP-PKGKIRIAGENGFP 60 +S FY + P +A+EV +Y EF+G+ P GK+ A ++G+P Sbjct: 3 KIYFSQDPVGFY-------IEGVSAVPSNAIEVSADIYNEFAGVAWPDGKVLGADDSGYP 55 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W + PPP+H+E IA AE +KQ+LI++ N ++N +QW K A+GRL +E AQ+N WLDY Sbjct: 56 TWIDAPPPSHDELIAQAEAEKQRLIDETNVWINGQQWPSKLALGRLSEDEKAQFNEWLDY 115 Query: 121 LDALELVDTSSAPDIEWPTPPAVQA 145 LDA+ VDTS+APDIEWPTPP A Sbjct: 116 LDAVSAVDTSTAPDIEWPTPPEQPA 140 >UniRef50_Q83M74 Putative phage tail fibre protein n=1 Tax=Shigella flexneri RepID=Q83M74_SHIFL Length = 144 Score = 139 bits (351), Expect = 2e-32, Method: Composition-based stats. Identities = 44/142 (30%), Positives = 59/142 (41%), Gaps = 10/142 (7%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M+Y YSA+TN FY E + PDDAVE+ E + K+ GENG P Sbjct: 12 MSYFYSASTNGFYSTEF-----HGTNIPDDAVEISESEWKTLINAQSVTKMITCGENGHP 66 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT E+ + KK LI +A + + Q A +G +E W Y Sbjct: 67 VIVDRPSPTPEQLALINDEKKSALIAEATNVIAPLQDA--VDLGMATDDETKLLLAWKKY 124 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L V+ EWP P Sbjct: 125 RVLLMRVNVVKP---EWPMHPN 143 >UniRef50_B7UGJ5 Predicted tail fiber assembly protein n=1 Tax=Escherichia coli O127:H6 str. E2348/69 RepID=B7UGJ5_ECO27 Length = 138 Score = 137 bits (345), Expect = 1e-31, Method: Composition-based stats. Identities = 58/145 (40%), Positives = 80/145 (55%), Gaps = 8/145 (5%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MN Y SFYP +K+ Y AGSWP++ +VD++ ++G+ P+GK A +NG P Sbjct: 1 MNKFY---KGSFYPEALKDVYISAGSWPENGADVDDETMAIYTGVAPEGKTLGADKNGNP 57 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW +IPP + E+QI AE K+ L + A+ + +Q A A I EE A + W Y Sbjct: 58 AWIDIPPLSAEQQIIQAEQKRTVLRSMADKEIVWRQDAFDAEIA--TAEETAALSEWKKY 115 Query: 121 LDALELVDTSSAPDIEWPTPPAVQA 145 L VDTS+ WPTPP QA Sbjct: 116 RVLLMRVDTSNP---VWPTPPGEQA 137 >UniRef50_B1JS57 Tail assembly chaperone gp38 n=3 Tax=Yersinia RepID=B1JS57_YERPY Length = 139 Score = 127 bits (318), Expect = 1e-28, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 63/143 (44%), Gaps = 5/143 (3%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVY-IEFSGLPPKGKIRIAGENGF 59 M ++S +F P M D + + + D + V ++ + P GKI + +G Sbjct: 1 MKALFSPKLITFIPENMVVDGSYSHNITDSLIAVTDEELATYWRQNSPDGKI-LGVVDGR 59 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLD 119 P W +P P HEE + + KK QL A+ ++ +Q A A +E++ W Sbjct: 60 PIWVNLPLPLHEELVLGSSTKKSQLKADADSEIDWRQDAVDAE--EANKKEISALAAWRK 117 Query: 120 YLDALELVDTSSAPDIEWPTPPA 142 Y AL +D S P I WP P Sbjct: 118 YRIALMRIDVSHMP-ITWPIKPE 139 >UniRef50_Q2NWF3 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NWF3_SODGM Length = 133 Score = 125 bits (315), Expect = 3e-28, Method: Composition-based stats. Identities = 43/114 (37%), Positives = 62/114 (54%), Gaps = 2/114 (1%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 Y +SA T SF+P+ M DY +AGS PDD V+VDE + +F PP GK R A G+PAW Sbjct: 22 YKFSARTGSFFPVSMLNDYIKAGSLPDDLVDVDETTFWQFCASPPSGKQRGANAQGYPAW 81 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNL 116 ++PPPT EE + ++ K++L+++ M + A E A Sbjct: 82 IDVPPPTPEEARLSVDVTKRRLMDEVTRAMAPLEDAVDLD--MATDAEKAALLA 133 >UniRef50_B4TAQ6 Tail fiber assembly protein n=8 Tax=Salmonella enterica subsp. enterica RepID=B4TAQ6_SALHS Length = 121 Score = 124 bits (310), Expect = 1e-27, Method: Composition-based stats. Identities = 61/122 (50%), Positives = 83/122 (68%), Gaps = 2/122 (1%) Query: 26 SWPDDA-VEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQL 84 W +D DE+V + PPKG + + NG AW IPPPT ++ I+AA +K++ Sbjct: 1 MWSNDFYALTDEEVDKYYMKTPPKG-MYLGSSNGRIAWVCIPPPTQDDLISAANQEKKKR 59 Query: 85 INQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQ 144 I+QAN++MNS++W GKAA+GRL G+ELAQYNLWLDYLDAL+ VDTS A +I P A+ Sbjct: 60 IDQANEHMNSRRWPGKAALGRLTGDELAQYNLWLDYLDALKAVDTSVAQNIACPIRKAIH 119 Query: 145 AR 146 + Sbjct: 120 IK 121 >UniRef50_UPI0001C3422A phage tail fiber assembly protein n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001C3422A Length = 124 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 2/114 (1%) Query: 28 PDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQ 87 P D +++ + Y EF P + R W +I PP+ EE + A+ K QL++ Sbjct: 10 PSDLMKITDLEYEEFMVSPDRKTPRFNINRNCMEWVDIAPPSKEEAVQHADSLKAQLMDV 69 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 A + Q A +E W Y L +D + APDIEWP P Sbjct: 70 ATQAILPLQDAVDLD--MATDKETILLTEWKKYRVRLNRIDVNVAPDIEWPESP 121 >UniRef50_B1JGV6 Tail assembly chaperone gp38 n=2 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JGV6_YERPY Length = 145 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 51/150 (34%), Positives = 74/150 (49%), Gaps = 15/150 (10%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAG-SWPDDAVEVDEQVYIEFSG-LPPKGKIRIAGENG 58 M +SAT F P E + D T +WP DAV + + +EF P GK+ + Sbjct: 1 MMIYFSATIGGFIPGEWRVDGTYTDETWPTDAVLLTDIESVEFWKRTAPSGKM-LGSVKY 59 Query: 59 FPAWSEIPPPTHEEQ-------IAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEEL 111 P W ++P PT E +A A+LKK +LI+ A D + + +A ++ Sbjct: 60 RPVWVDLPTPTAVEVASQKAGFVAQAKLKKSKLISDARDRIEILKDRIEAG-----QDKA 114 Query: 112 AQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 A+ LW Y AL+ +D S+APDIEWP P Sbjct: 115 AELKLWKSYRIALDDIDVSAAPDIEWPVAP 144 >UniRef50_D2TYN1 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2TYN1_9ENTR Length = 140 Score = 122 bits (305), Expect = 4e-27, Method: Composition-based stats. Identities = 48/151 (31%), Positives = 79/151 (52%), Gaps = 22/151 (14%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLP-PKGKIRIAGENGFP 60 Y YS N FYP E+K+ Y +AGS+P D +EVD+ VY EF+ K+R++G++GFP Sbjct: 1 MYFYSPKENLFYPNELKDIYIEAGSFPSDVIEVDDAVYFEFTKYDLSDNKVRVSGKDGFP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMN---------SKQWAGKAAIGRLKGEEL 111 W EE+I+ K+QLI + +N ++ W + +G + + Sbjct: 61 KW-------EEEKIS-----KRQLIEDTKEKINFLLKEVKNVTQIWQTQLTLGIITDSDK 108 Query: 112 AQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 ++ W+ Y L+ +D + DI WP+ P+ Sbjct: 109 SKLTDWMIYAQKLQQIDLKNINDISWPSKPS 139 >UniRef50_P09154 Uncharacterized protein ymfS n=2 Tax=Escherichia coli RepID=YMFS_ECOLI Length = 137 Score = 120 bits (301), Expect = 1e-26, Method: Composition-based stats. Identities = 69/152 (45%), Positives = 88/152 (57%), Gaps = 21/152 (13%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M T F+ G P D+ E+ + + +G G Sbjct: 1 MKIYCCLNTVGFF-------MDGCGVIPPDSKEITAEHWQSLLKSQAEG--------GVI 45 Query: 61 AWSEIPP------PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQY 114 +S PP TH++++A A +KQ LI+ A D++NS+QW GKAA+GRLK +EL QY Sbjct: 46 DFSVFPPSIKEVIRTHDDEVADANFQKQMLISDATDFINSRQWQGKAALGRLKEDELKQY 105 Query: 115 NLWLDYLDALELVDTSSAPDIEWPTPPAVQAR 146 NLWLDYL+ALELVDTSSAPDIEWPTPPAVQAR Sbjct: 106 NLWLDYLEALELVDTSSAPDIEWPTPPAVQAR 137 >UniRef50_B2K2J2 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis PB1/+ RepID=B2K2J2_YERPB Length = 149 Score = 114 bits (286), Expect = 7e-25, Method: Composition-based stats. Identities = 42/146 (28%), Positives = 66/146 (45%), Gaps = 13/146 (8%) Query: 5 YSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSE 64 +S + FYP M +D T P D +++ + + P + G P W + Sbjct: 9 FSPANSMFYPQYMIDDGTFHADLPTDLIDITDAENTTYWRQMPPPGQVLGVIKGRPGWVD 68 Query: 65 IPPPT-------HEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLW 117 +PPP+ A A+ KK +LI A+D ++ + + +G+ K +E LW Sbjct: 69 LPPPSAIDIAAKKAALTAQAKAKKTKLIGDASDEIDVLKD--RIELGQDKADE---LKLW 123 Query: 118 LDYLDALELVDTSSAPDIEWPTPPAV 143 Y AL+ +D S APDI WP P V Sbjct: 124 KSYRIALDDIDVS-APDINWPESPNV 148 >UniRef50_C9XYE1 Tail fiber assembly protein homolog from lambdoid prophage e14 n=1 Tax=Cronobacter turicensis RepID=C9XYE1_CROTZ Length = 200 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 44/112 (39%), Positives = 60/112 (53%), Gaps = 5/112 (4%) Query: 36 EQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELK-----KQQLINQAND 90 E V I G P G R A F W+ T+E+ AA+++ K L AN+ Sbjct: 89 EPVKITLPGDYPAGTTRAAPATRFDVWNGKAWVTNEDARRAADVENAKAMKSSLRGMANE 148 Query: 91 YMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 ++ +QW + +GRL +E A + WLDYL+AL VDTS APDI+WP PA Sbjct: 149 IISQQQWPSRLTLGRLNEQEQAAFTAWLDYLEALAAVDTSRAPDIQWPQLPA 200 >UniRef50_A7ZYE1 Putative tail fiber assembly protein n=1 Tax=Escherichia coli HS RepID=A7ZYE1_ECOHS Length = 137 Score = 103 bits (257), Expect = 1e-21, Method: Composition-based stats. Identities = 43/142 (30%), Positives = 65/142 (45%), Gaps = 7/142 (4%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MN Y+ N + D + PD Q I S + + I +G Sbjct: 1 MNASYAVIENGMVVNVIVWDGEAEFTVPD------NQQLINISDISEQPGIGWVYSDGGF 54 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 +H+E +A AE KKQ L++ A ++ Q +A +L EE + N+ LDY Sbjct: 55 TAPPTQERSHDELVADAEQKKQSLLDAAMANISVIQLKLQAG-RKLTQEETTRLNVVLDY 113 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 ++A+ +DTS+APDI WP PA Sbjct: 114 IEAVTAIDTSTAPDIIWPVFPA 135 >UniRef50_B7M8C3 Putative phage tail fiber assembly protein n=1 Tax=Escherichia coli IAI1 RepID=B7M8C3_ECO8A Length = 146 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 46/142 (32%), Positives = 68/142 (47%), Gaps = 7/142 (4%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MN Y+ N + D + PD Q I S + + I A +G Sbjct: 9 MNASYAVIENGMVMNVIAWDGEAEFTVPD------NQQLINISDISEQPGIGWAYSDGVF 62 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P +H+EQ+A AE +KQ +I+ A ++ Q +A +L EE + N+ LDY Sbjct: 63 SAPLPPERSHDEQVADAEHQKQSMIDAAMVNISVIQLKLQAG-RKLTQEETTRLNVVLDY 121 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 +DA+ DTS+APDIEWP P Sbjct: 122 IDAVTATDTSTAPDIEWPDEPC 143 >UniRef50_B6VLW5 Tail fiber assembly protein homolog from lambdoid prophage dlp12 n=4 Tax=Enterobacteriaceae RepID=B6VLW5_PHOAA Length = 150 Score = 95.5 bits (236), Expect = 5e-19, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 52/161 (32%), Gaps = 32/161 (19%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 +S +FY KE VE+ + + E +G ++ + G+P Sbjct: 1 MVYFSRKECAFYNEAYKE-----------CVEITAEKHNELLAGQSRGLSIVSNKEGYPV 49 Query: 62 WSEIPP-------------------PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAA 102 E P EQ AE KKQQL+ + + Q A Sbjct: 50 LIERAPSVYHKYDGEKWIISESDKIKLRREQQQQAEHKKQQLMLTVSKQIAPLQDAVDLE 109 Query: 103 IGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 EE + Y L VD + PDI WP P V Sbjct: 110 --MASDEEKSLLAALKKYRVLLNRVDVNLVPDIHWPEKPRV 148 >UniRef50_B7NA68 Putative uncharacterized protein n=1 Tax=Escherichia coli UMN026 RepID=B7NA68_ECOLU Length = 134 Score = 94.0 bits (232), Expect = 1e-18, Method: Composition-based stats. Identities = 31/96 (32%), Positives = 43/96 (44%), Gaps = 2/96 (2%) Query: 50 KIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGE 109 I + +G P +H +A AEL+K L+ AN+ + Q A + Sbjct: 40 GIGWSYSDGVFTAPPPPERSHNALVAEAELQKSALLTVANNAIAPLQDAVDLE--MATDD 97 Query: 110 ELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQA 145 E W Y L VDTS+AP+IEWPT P +A Sbjct: 98 EQTLLLAWKKYRVLLNRVDTSAAPEIEWPTQPGERA 133 >UniRef50_B4TRG5 Fels-2 prophage Tfa n=22 Tax=Enterobacteriaceae RepID=B4TRG5_SALSV Length = 135 Score = 93.2 bits (230), Expect = 2e-18, Method: Composition-based stats. Identities = 40/141 (28%), Positives = 61/141 (43%), Gaps = 10/141 (7%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y YS F+ + D T++ + PDD + + ++ Y E GK I G P Sbjct: 3 EYYYSFKEKGFF---WQPD-TESDNSPDDLIPLTDEYYRELMQGQVDGK-YIEHRKGGPV 57 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 E T EE +A AE +K +L+ +A + A K I EE+ + W Y Sbjct: 58 LVEHREYTPEELVAQAEARKAELLAEAESVIAPLARAVKLKIA--TDEEIKRLEAWELYS 115 Query: 122 DALELVDTSSAPDIEWPTPPA 142 + VDT++ +WP PA Sbjct: 116 VMVNRVDTANP---DWPEKPA 133 >UniRef50_Q31HT5 Putative uncharacterized protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31HT5_THICR Length = 194 Score = 92.8 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 4/94 (4%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDE-QVYIEFSGLPPKGKIRIAGENGF 59 M YSAT N+F+ +K DY Q SWP DA+++ + +V P+GK A NG Sbjct: 1 MTIHYSATKNAFFDDALKSDYEQFNSWPSDAIKMTDAEVSTYHGKQSPQGKQLGADANGR 60 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMN 93 P W ++PPP+ + AA K +Q+ ++A +++ Sbjct: 61 PIWVDLPPPSLGDANAA---KSKQINDEAQKFID 91 >UniRef50_B7NSC3 Putative tail fiber assembly protein (Possibly partial) n=2 Tax=Escherichia coli RepID=B7NSC3_ECO7I Length = 136 Score = 89.4 bits (220), Expect = 3e-17, Method: Composition-based stats. Identities = 27/93 (29%), Positives = 44/93 (47%), Gaps = 2/93 (2%) Query: 51 IRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEE 110 I + G + PP TH+E + AE ++Q L++ AN + W +G + Sbjct: 44 IGWFYDKGKLSSPTQPPKTHDELLREAENERQCLLDSANSLI--MNWQSDLLLGIISENN 101 Query: 111 LAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 LW +Y+++L VD S P+I WP P + Sbjct: 102 KGNLLLWKEYVNSLMSVDLSLVPEITWPERPEI 134 >UniRef50_D0ZBI3 Phage tail assembly chaperone gp38 n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI3_EDWTE Length = 193 Score = 89.4 bits (220), Expect = 3e-17, Method: Composition-based stats. Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 2/89 (2%) Query: 54 AGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQ 113 AW H + AE KK L+++A + + W + +G + + A Sbjct: 106 YDAWDGTAWVTDLNAQHAANVELAEQKKSLLLSEAQEKIGL--WQTELQLGMITDSDKAA 163 Query: 114 YNLWLDYLDALELVDTSSAPDIEWPTPPA 142 W+ Y+ A++ VDTS+APDI WP PA Sbjct: 164 LITWMTYIKAVQAVDTSAAPDIAWPPKPA 192 >UniRef50_C4UND1 Conserved hypothetical phage tail fiber protein n=2 Tax=Enterobacteriaceae RepID=C4UND1_YERRU Length = 165 Score = 87.8 bits (216), Expect = 9e-17, Method: Composition-based stats. Identities = 36/102 (35%), Positives = 57/102 (55%), Gaps = 6/102 (5%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWP-DDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 YIYSA N+F+P++ +DY+ W DAVEV + V +EF G P GK+R AG +G P Sbjct: 3 TYIYSAKNNAFFPVDYLDDYSH---WDLSDAVEVSDGVAMEFMGGAPIGKVRAAGVDGHP 59 Query: 61 AWSEIPP--PTHEEQIAAAELKKQQLINQANDYMNSKQWAGK 100 W++ PP P ++++A+ + + A D M ++ Sbjct: 60 CWTDKPPALPLSDDELASLARQYRDAFIVATDNMMVSDYSID 101 >UniRef50_Q2NV40 Hypothetical phage protein n=2 Tax=Enterobacteriaceae RepID=Q2NV40_SODGM Length = 203 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 33/143 (23%), Positives = 47/143 (32%), Gaps = 14/143 (9%) Query: 12 FYPLEMKEDYTQAGSWPDDAVEVDE--------QVYIEFSGLPPKGKIRIAGENGFPAWS 63 F P MK +Y G ++ E+ + + NG Sbjct: 45 FQPETMKIEYETNGVIRSMGYDISGFCPEGCSVAEVSEWPEEAAANR-KWCFLNGQ---V 100 Query: 64 EIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDA 123 T +E + A K+ + QA + Q A E W Y Sbjct: 101 VPRVYTADELMEQATHKRDYRLEQAAKIIAPLQDAVDLD--MATDAEKVTLLAWKKYRVL 158 Query: 124 LELVDTSSAPDIEWPTPPAVQAR 146 L +D SSAPDI+WP PP+ R Sbjct: 159 LNRLDISSAPDIDWPDPPSETNR 181 >UniRef50_P03740 Tail fiber assembly protein n=117 Tax=root RepID=TFA_LAMBD Length = 194 Score = 86.7 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 35/117 (29%), Positives = 54/117 (46%), Gaps = 7/117 (5%) Query: 30 DAVEVDE--QVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQ 87 DA+ + E + F+ L P G+ + + AW + +I AE K+ L+ Sbjct: 83 DALFISELGPLPENFTWLSPGGEYQ---KWNGTAWVKDTEAEKLFRIREAEETKKSLMQV 139 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQ 144 A++++ Q A I EE + W Y L VDTS+APDIEWP P ++ Sbjct: 140 ASEHIAPLQDAADLEIA--TKEETSLLEAWKKYRVLLNRVDTSTAPDIEWPAVPVME 194 >UniRef50_B2K1B5 Putative uncharacterized protein n=6 Tax=Yersinia RepID=B2K1B5_YERPB Length = 159 Score = 84.7 bits (208), Expect = 7e-16, Method: Composition-based stats. Identities = 36/143 (25%), Positives = 58/143 (40%), Gaps = 8/143 (5%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 +SATT FYP E KE+Y GSWPDDA+ + ++ ++ P + G P Sbjct: 1 MIYFSATTGGFYPQEWKEEYLATGSWPDDALLLTKKEQTKYWKHVPATGKMLGVMKGRPV 60 Query: 62 WSEIP--PPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWL- 118 W +IP P H + +AA + + + D + ++ L + A+ Sbjct: 61 WLDIPPLPAPHGDTLAALARRHRDAFIKTTDSITVIDYS--IDDSPLTDTQRAELTATRA 118 Query: 119 DYLDALELVDTSSAPDIEWPTPP 141 Y + P +E P P Sbjct: 119 AYRAWPT---VENWPRVELPELP 138 >UniRef50_B2Q1Q3 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q1Q3_PROST Length = 116 Score = 83.6 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 39/140 (27%), Positives = 53/140 (37%), Gaps = 26/140 (18%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y Y N Y E D +Q DD V + EQ + + Sbjct: 1 MKY-YKDKNNEVYAYE--SDGSQDAFIADDLVLIAEQEALAITN---------------- 41 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 PPPT E+ IA AE +KQ L+N+A + Q A +G EE Q +W +Y Sbjct: 42 -----PPPTKEQLIAEAEYQKQALLNEATAAIAPLQDA--VDLGIATDEEREQLRVWKEY 94 Query: 121 LDALELVDTSSAPDIEWPTP 140 + VD + WP Sbjct: 95 RVEVNRVDVGLGLCVNWPVS 114 >UniRef50_C4TT86 Conserved hypothetical phage tail fiber protein n=1 Tax=Yersinia kristensenii ATCC 33638 RepID=C4TT86_YERKR Length = 161 Score = 83.2 bits (204), Expect = 2e-15, Method: Composition-based stats. Identities = 41/142 (28%), Positives = 64/142 (45%), Gaps = 7/142 (4%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y +SATT SFYP E+ + YT AG+ P D +E+ + +Y +F+ P GK+R A + G P Sbjct: 1 MYCFSATTLSFYPKELLDVYTDAGTLPSDLIEIGDDIYAQFAAQQPAGKMRGADKKGKPV 60 Query: 62 WSEIPPPTHEEQIAAAELKK-QQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWL-D 119 W +P P AA ++ + A D M ++ L + ++ Sbjct: 61 WVNVPAPVVTADAVAATARRYRDAFITATDAMTIIDYS--IDDKPLTDAQRSELMAIRAA 118 Query: 120 YLDALELVDTSSAPDIEWPTPP 141 Y ++ P IE P P Sbjct: 119 YRAWPT---LANWPLIELPELP 137 >UniRef50_Q47427 Tail fiber assembly protein homolog n=47 Tax=root RepID=TFAB_ECOLX Length = 203 Score = 82.4 bits (202), Expect = 4e-15, Method: Composition-based stats. Identities = 33/114 (28%), Positives = 49/114 (42%), Gaps = 6/114 (5%) Query: 32 VEVDEQVYIEFSGLPPKGKIRIA-----GENGFPAWSEIPPPTHEEQIAAAELKKQQLIN 86 ++ I G P+ IA + W H + AAE K+Q LI+ Sbjct: 85 IDTGNPEEITVLGDYPENTTTIAPLTPYDKWDGEKWVVDTEAQHSAAVEAAETKRQSLID 144 Query: 87 QANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTP 140 A D ++ Q +A +L E Q N LDY+D L +D ++APD+ WP Sbjct: 145 TAMDSISLIQLKLRAG-RKLTQAETTQLNSVLDYIDELNAMDLTTAPDLNWPEK 197 >UniRef50_C7BTY5 Hypothetical phage protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BTY5_PHOAA Length = 178 Score = 81.7 bits (200), Expect = 6e-15, Method: Composition-based stats. Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 2/74 (2%) Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 T EE I AE ++ QL+ +AN+ + Q A + EE W Y L + Sbjct: 105 ATKEELIRKAEYERVQLLVKANNIIVPLQDA--IDLNIATEEEKNTLLKWKKYRIMLNRI 162 Query: 128 DTSSAPDIEWPTPP 141 D S+ P+I WP+PP Sbjct: 163 DISTTPEIVWPSPP 176 >UniRef50_C6CP80 Tail assembly chaperone gp38 n=2 Tax=Dickeya RepID=C6CP80_DICZE Length = 131 Score = 81.3 bits (199), Expect = 8e-15, Method: Composition-based stats. Identities = 33/143 (23%), Positives = 58/143 (40%), Gaps = 14/143 (9%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSW-PDDAVEVDEQVYIEFSGLPPKGKIRIAGENGF 59 M+ +Y+ N + D W P + +D + + +G Sbjct: 1 MSKVYAVIENGVVINTVVWDSDVGADWKPQNGALID--------ISSERVGVGYLYSDGV 52 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLD 119 P +E IA A L+K QL+++A + W +G + +E ++ W + Sbjct: 53 FT---PPEKRRDEYIADATLRKTQLLSEAQKMIA--NWQTDLMLGVISDDEKSRLVRWRE 107 Query: 120 YLDALELVDTSSAPDIEWPTPPA 142 Y+ ++ +D SAPDI WP PP Sbjct: 108 YMKQVDAIDAQSAPDITWPVPPT 130 >UniRef50_B7UG05 Predicted tail fiber assembly protein n=1 Tax=Escherichia coli O127:H6 str. E2348/69 RepID=B7UG05_ECO27 Length = 122 Score = 80.1 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 26/79 (32%), Positives = 46/79 (58%), Gaps = 3/79 (3%) Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 PPPTHE+ I AAE ++Q+L++ A+ M W + +G + A+ + WL Y + ++ Sbjct: 46 PPPTHEQLIQAAENERQRLLSAADAIM--LDWRTELMLGEISDANRAKLSAWLLYKNQVK 103 Query: 126 LVDTSSAPD-IEWPTPPAV 143 VD ++ P+ + WP P + Sbjct: 104 AVDVTTDPEHVNWPVIPEL 122 >UniRef50_B6XAD7 Putative uncharacterized protein n=2 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XAD7_9ENTR Length = 177 Score = 79.7 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 2/80 (2%) Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 E +A AE + L+N+A ++S Q A I E+ W Y Sbjct: 100 VLPYETPKETLVAKAENTLRLLLNEATIKIDSLQDAVDLDIA--TDAEIVSLKEWKKYRV 157 Query: 123 ALELVDTSSAPDIEWPTPPA 142 L VDTS+APD+ +P P Sbjct: 158 LLNRVDTSTAPDVSFPEKPE 177 >UniRef50_UPI000197C594 tail assembly chaperone gp38 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C594 Length = 169 Score = 79.7 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 36/90 (40%), Gaps = 5/90 (5%) Query: 52 RIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEEL 111 R +NG P+ EE + A +KQ L+ +A + Q A +G EE Sbjct: 84 RWIYKNG---AIMQYEPSLEESVFLASQQKQLLLEEATAAIAPLQDA--VDLGIATDEER 138 Query: 112 AQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 Q W +Y + VD ++ WP P Sbjct: 139 VQLKAWKEYRVEVNRVDIGLGENVNWPVKP 168 >UniRef50_A8GLQ6 Tail assembly chaperone gp38 n=2 Tax=root RepID=A8GLQ6_SERP5 Length = 170 Score = 78.6 bits (192), Expect = 5e-14, Method: Composition-based stats. Identities = 25/80 (31%), Positives = 34/80 (42%), Gaps = 2/80 (2%) Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 P + E A A+ K +LI A ++ Q A I +ELA+ W+ Y Sbjct: 93 VVPRPYSQAELSAQAQQAKNKLIELATKAISPLQDAKDLDIA--TDDELAKLKEWMVYRV 150 Query: 123 ALELVDTSSAPDIEWPTPPA 142 L VDT P+I WP P Sbjct: 151 HLNRVDTGMTPNIVWPQSPE 170 >UniRef50_A4WEL2 Phage tail assembly chaperone gp38 n=1 Tax=Enterobacter sp. 638 RepID=A4WEL2_ENT38 Length = 198 Score = 77.4 bits (189), Expect = 1e-13, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 50/115 (43%), Gaps = 7/115 (6%) Query: 33 EVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAE-----LKKQQLINQ 87 E + V I G + A F W+ T E + A+ KK L+++ Sbjct: 86 ETGQAVGITAPGAYAQNVTLSAPLTPFDRWNGQSWVTDLEALRQADESQARQKKAGLLSE 145 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 A+ ++ W +G + E+ A W+ Y+ AL VD ++APDI+WP P Sbjct: 146 AHSTISL--WQTGLQLGIISDEDKASLITWMTYIQALNAVDVTAAPDIDWPLMPE 198 >UniRef50_Q7N2R7 Similar to phage tail fiber assembly protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2R7_PHOLL Length = 170 Score = 76.7 bits (187), Expect = 2e-13, Method: Composition-based stats. Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 2/73 (2%) Query: 69 THEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD 128 T EEQ + +K + + AN+ + Q A +G +E A W Y +L +D Sbjct: 99 TLEEQRQQEKNQKSEKMIAANEVIQPLQDA--IDLGIATNKEKALLLEWKRYRVSLNRID 156 Query: 129 TSSAPDIEWPTPP 141 TS A +I WP P Sbjct: 157 TSLASEIIWPEQP 169 >UniRef50_A5A5F3 Tail fiber assembly n=4 Tax=Pseudomonas aeruginosa RepID=A5A5F3_PSEAE Length = 152 Score = 75.5 bits (184), Expect = 4e-13, Method: Composition-based stats. Identities = 32/140 (22%), Positives = 56/140 (40%), Gaps = 12/140 (8%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y +S + +FYP ++E Y AG WP D V +++ + G+ + NG P Sbjct: 4 EYYFSPSQVAFYPASLREVYEHAGCWPVDGEWVSAELHEQLMNEQAAGRAISSDVNGNPV 63 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 E PP + +Q + E + + A D + + + G+ QY+ + Y Sbjct: 64 AIERPPLSR-QQRSTHERRWRDSQLLATDGLVVRH-RDQLETGKETTLLPVQYHELMSYR 121 Query: 122 DALELVDTSSAPDIEWPTPP 141 +L +WP P Sbjct: 122 ASLR----------DWPEEP 131 >UniRef50_P77326 Putative tail fiber assembly protein homolog from prophage CPS-53 n=30 Tax=Enterobacteriaceae RepID=TFAS_ECOLI Length = 114 Score = 75.5 bits (184), Expect = 5e-13, Method: Composition-based stats. Identities = 28/90 (31%), Positives = 43/90 (47%), Gaps = 1/90 (1%) Query: 54 AGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQ 113 + W H + AAE ++Q LI+ A ++ Q +A +L E ++ Sbjct: 26 YDKWDGEKWVTDTEAQHSVAVDAAEAQRQSLIDTAMASISLIQLKLQAG-RKLMQAETSR 84 Query: 114 YNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 N LDY+DA+ DTS+APD+ WP P Sbjct: 85 LNTVLDYIDAVTATDTSTAPDVIWPELPEE 114 >UniRef50_C7BSP4 Putative tail fiber protein of prophage cp-933x (Tail fiber assembl protein) n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BSP4_PHOAA Length = 183 Score = 75.1 bits (183), Expect = 7e-13, Method: Composition-based stats. Identities = 24/71 (33%), Positives = 30/71 (42%), Gaps = 2/71 (2%) Query: 71 EEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTS 130 EE E KK+QL+ +A + Q A I E W Y L +DTS Sbjct: 114 EEIKQGVESKKRQLMVEACTKIAPLQDAVDLDIA--TEAEKDALLAWKKYRVMLNRIDTS 171 Query: 131 SAPDIEWPTPP 141 A +IEWP P Sbjct: 172 QAYNIEWPEQP 182 >UniRef50_A4TJD5 Phage tail fiber assembly protein n=22 Tax=Yersinia RepID=A4TJD5_YERPP Length = 139 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 38/141 (26%), Positives = 55/141 (39%), Gaps = 21/141 (14%) Query: 11 SFYPLEMKEDYTQAGSWPDDAVEVDE--QVYIEFSGLPPKGKIRIAGENGFPAWSEIPPP 68 FY + + Y W D V Y P G E W+E P Sbjct: 11 GFYIEDYIDGYLPKN-WTADLVGDGYYKAQYQNADIDPDTG------EWTGGVWAETSGP 63 Query: 69 T-------HEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 + E + A+LKK +LI+ A+D + + + + A+ LW Y Sbjct: 64 STIDISAQKAEFVTQAKLKKSKLISDASDRIEILKDRIELG-----QDRAAELKLWKSYR 118 Query: 122 DALELVDTSSAPDIEWPTPPA 142 AL+ +D S+APDIEWP P Sbjct: 119 IALDDIDVSAAPDIEWPLKPE 139 >UniRef50_O68721 Lambda tail fiber assembly protein G n=32 Tax=Enterobacteriaceae RepID=O68721_YERPE Length = 202 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 29/79 (36%), Gaps = 2/79 (2%) Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW + + AE K L+ + ++ Q A EE Y Sbjct: 113 AWVKDEEAEKTALVGEAEQNKSVLMKNVSQQISLLQDAIDLD--MATDEEKETLVALKKY 170 Query: 121 LDALELVDTSSAPDIEWPT 139 L VDTS APDI+WP Sbjct: 171 RVLLNRVDTSLAPDIDWPI 189 >UniRef50_B4T1N8 Tail assembly chaperone gp38 n=3 Tax=Salmonella enterica subsp. enterica RepID=B4T1N8_SALNS Length = 171 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 31/116 (26%), Positives = 45/116 (38%), Gaps = 15/116 (12%) Query: 30 DAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEE--QIAAAELKKQQLINQ 87 D VE D Y + GK + I P T+ E +K + + Sbjct: 69 DIVESDSLPYDDII----SGKYQFVDNK-------IIPRTYNEVELTQITNAEKSKKLKL 117 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 AN+ + Q A +G EE+ + W Y + +DTS+ DI WP PP V Sbjct: 118 ANEKIRPLQDA--VDLGIATDEEIQKLGAWKRYRVEINRIDTSNLLDISWPLPPDV 171 >UniRef50_A4JWL9 Putative uncharacterized protein n=1 Tax=Burkholderia phage phiE255 RepID=A4JWL9_9CAUD Length = 116 Score = 74.0 bits (180), Expect = 1e-12, Method: Composition-based stats. Identities = 28/114 (24%), Positives = 50/114 (43%), Gaps = 3/114 (2%) Query: 32 VEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDY 91 +E+ ++ + +GK ++G P + PPT E+ + + +L+ +A+ Sbjct: 4 IEITDEQWKMLLAGESQGKRMAVDDSGAPVLLDPLPPTVEQIVTGNTAARDRLLERASVA 63 Query: 92 MNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQA 145 + Q A +G E AQ W+ Y AL+ VD + D WP P + A Sbjct: 64 LTPLQTA--ITLGEATDGETAQARAWITYTRALKSVDL-TQRDPTWPEQPKIVA 114 >UniRef50_C4SNQ0 Putative uncharacterized protein n=4 Tax=Yersinia RepID=C4SNQ0_YERFR Length = 192 Score = 74.0 bits (180), Expect = 1e-12, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 44/112 (39%), Gaps = 7/112 (6%) Query: 36 EQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQ-----IAAAELKKQQLINQAND 90 E I G P+ +A + F W+ E +A A KK L+ +AN Sbjct: 83 EPQTINQLGSLPENTTLLAPSSSFDRWNGTKWVKDSEAEKQYYLAEARQKKSILLEEANT 142 Query: 91 YMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 + + + + + E + W Y L +D S+APDI WP PA Sbjct: 143 QIEILKDSIEFDMSTSTAE--TELVAWRKYRVQLNQLDISAAPDINWPKQPA 192 >UniRef50_B2VJG1 Tail fiber assembly protein n=1 Tax=Erwinia tasmaniensis RepID=B2VJG1_ERWT9 Length = 198 Score = 73.6 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 40/115 (34%), Gaps = 7/115 (6%) Query: 32 VEVDEQVYIEFSGLPPKGKIRIA-----GENGFPAWSEIPPPTHEEQIAAAELKKQQLIN 86 + + I G P + + W E IA A K LI Sbjct: 84 IRTGAEQQITVPGDYPADTTIYSPSTPYDKWNGERWVTDEAAKAEADIAEAAAAKAVLIK 143 Query: 87 QANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 A + Q A + EE ++Y+ W Y L VD S APDI WP PP Sbjct: 144 SAAAKIEPLQDAVQLD--MATDEEKSRYDAWRKYRVLLTRVDISQAPDINWPEPP 196 >UniRef50_P40784 Tail fiber assembly protein homolog from lambdoid prophage Fels-1 n=65 Tax=Enterobacteriaceae RepID=YCDD_SALTY Length = 191 Score = 72.8 bits (177), Expect = 3e-12, Method: Composition-based stats. Identities = 24/103 (23%), Positives = 39/103 (37%), Gaps = 6/103 (5%) Query: 40 IEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAG 99 + + P G + AW ++ AE K +L+ A+ + Q A Sbjct: 95 ENVTSVSPGGGYKKWDSKAK-AWVNDEGAEVAARLREAEGTKSRLLQMASGKIAPLQDA- 152 Query: 100 KAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 +G +E AQ + W Y + VDTS+ +WP P Sbjct: 153 -VDLGIATDDEKAQLDEWKKYRVLVNRVDTSNP---DWPEQPV 191 >UniRef50_D2TR90 Putative phage tail fibre assembly protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR90_CITRO Length = 199 Score = 71.7 bits (174), Expect = 7e-12, Method: Composition-based stats. Identities = 32/116 (27%), Positives = 49/116 (42%), Gaps = 7/116 (6%) Query: 32 VEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAA-----ELKKQQLIN 86 +E E V I G P G + + W T + AA E +K L+ Sbjct: 85 IENGEPVEITAPGDYPAGTTTLFPSTPYDEWDGEKWVTDTDAQHAADVAAAEHQKTALLA 144 Query: 87 QANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 A D ++ W + +G + ++ A WL Y+ L+ VDT ++PDI WP P Sbjct: 145 AAQDTISI--WQTELQLGIISDDDKASLISWLSYIKELQTVDTDASPDINWPVAPV 198 >UniRef50_A8GA30 Tail assembly chaperone gp38 n=2 Tax=Enterobacteriaceae RepID=A8GA30_SERP5 Length = 184 Score = 71.3 bits (173), Expect = 9e-12, Method: Composition-based stats. Identities = 25/70 (35%), Positives = 30/70 (42%), Gaps = 2/70 (2%) Query: 72 EQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSS 131 E A KKQ L QA+ + + A + EE Q W Y L VD Sbjct: 116 ELQQQAMNKKQDLSKQASLKIATLNDAVELE--MASEEEQKQLTAWKTYRVLLSRVDPGL 173 Query: 132 APDIEWPTPP 141 APDI+WP PP Sbjct: 174 APDIDWPQPP 183 >UniRef50_D0FT42 Phage tail assembly chaperone n=2 Tax=Erwinia pyrifoliae RepID=D0FT42_ERWPY Length = 186 Score = 70.5 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 28/127 (22%), Positives = 48/127 (37%), Gaps = 8/127 (6%) Query: 22 TQAGSWPDDAVEVDEQVYIEFSGLP--PKGKIRIAGENGFPAWSEIPPPTHEEQIAAAEL 79 + WP DA + E + P + T EE A L Sbjct: 66 DASTLWPSDAC-IAEINKNQLPKDFELPINTDGWQFDGKKIV---PRAYTQEELQEKAGL 121 Query: 80 KKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPT 139 K+ L+ A+ + + A + I +E+ +W+ Y + +D ++AP+I+WP Sbjct: 122 IKENLLQLASVKIAPLRDAQELDIA--TDDEINALKIWMTYRVQINRIDITNAPNIKWPG 179 Query: 140 PPAVQAR 146 P V R Sbjct: 180 MPDVSRR 186 >UniRef50_D2U2G7 Phage tail fiber assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G7_9ENTR Length = 177 Score = 70.1 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 26/87 (29%), Positives = 36/87 (41%), Gaps = 2/87 (2%) Query: 55 GENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQY 114 + W QI A KK QLIN+A +N + + +G + + Sbjct: 92 DKWDGKKWVTDNQAVKAAQINTANEKKLQLINEAEQIINPLERKVRLGMG--NDIDASTL 149 Query: 115 NLWLDYLDALELVDTSSAPDIEWPTPP 141 W Y L +DTS APDI+WP P Sbjct: 150 REWEIYSVKLNDIDTSIAPDIDWPEKP 176 >UniRef50_A7MLN9 Putative uncharacterized protein n=1 Tax=Cronobacter sakazakii ATCC BAA-894 RepID=A7MLN9_ENTS8 Length = 193 Score = 69.3 bits (168), Expect = 3e-11, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 34/80 (42%), Gaps = 2/80 (2%) Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W + Q A +K L+N+A++ ++ Q A +E + + Sbjct: 116 WLPDDKALKDIQQKEAVERKTMLMNEASNEISLLQDAVDLD--MATEDETTRLLALKKFR 173 Query: 122 DALELVDTSSAPDIEWPTPP 141 L +DT+SAPDI WP P Sbjct: 174 VLLSRIDTTSAPDISWPIAP 193 >UniRef50_D1P8C4 Tail fiber assembly protein n=1 Tax=Providencia rustigianii DSM 4541 RepID=D1P8C4_9ENTR Length = 117 Score = 68.6 bits (166), Expect = 7e-11, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 35/77 (45%), Gaps = 2/77 (2%) Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 PP + E+ IA AE KKQ L+++A ++ + + +E W Y Sbjct: 43 PPVSKEQHIAEAEQKKQFLLDEAERHIAILERKVRLE--MATDDEKDLLTAWEIYSVKTA 100 Query: 126 LVDTSSAPDIEWPTPPA 142 DTS APDI+W P Sbjct: 101 DADTSKAPDIDWGVKPE 117 >UniRef50_C4U6G1 Phage tail assembly chaperone gp38 n=2 Tax=Enterobacteriaceae RepID=C4U6G1_YERAL Length = 51 Score = 68.2 bits (165), Expect = 8e-11, Method: Composition-based stats. Identities = 38/51 (74%), Positives = 42/51 (82%) Query: 92 MNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 MNSKQW GKAA+GRLK +E AQYN WLDYLD LE VDTS+APDI+WP P Sbjct: 1 MNSKQWPGKAAMGRLKDDEKAQYNAWLDYLDLLEEVDTSTAPDIDWPVAPE 51 >UniRef50_C6C6Z1 Tail assembly chaperone gp38 n=3 Tax=Dickeya RepID=C6C6Z1_DICDC Length = 204 Score = 66.6 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 32/89 (35%), Gaps = 2/89 (2%) Query: 54 AGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQ 113 W Q+ A+ + I +A + +A + +E + Sbjct: 118 YDVWQDDGWVTDAQAQKTAQVEVAKAQLADDIAEAEKQITVLHYA--VDLNIATEQETQR 175 Query: 114 YNLWLDYLDALELVDTSSAPDIEWPTPPA 142 W YL L VD S AP I+WPT PA Sbjct: 176 LADWKTYLVLLNRVDVSEAPSIDWPTIPA 204 >UniRef50_Q7N1H8 Similarities with lambda tail fiber assembly protein G n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N1H8_PHOLL Length = 258 Score = 66.6 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 18/72 (25%), Positives = 24/72 (33%), Gaps = 2/72 (2%) Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W Q AE +K QA+ + Q A I E + W Y Sbjct: 128 WVTDKSALKSHQTEQAEQQKIARQQQADAAIKPLQDAIDLDIA--TDAEKSALVEWKKYR 185 Query: 122 DALELVDTSSAP 133 + VD S+AP Sbjct: 186 VRVNRVDLSTAP 197 >UniRef50_A1JMQ0 Phage tail fiber assembly protein n=5 Tax=Yersinia RepID=A1JMQ0_YERE8 Length = 204 Score = 66.3 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 30/108 (27%), Positives = 44/108 (40%), Gaps = 9/108 (8%) Query: 38 VYIEFSGLP-PKGKIRIA-----GENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDY 91 I F+ P PK K I AW I AA +K LINQ +++ Sbjct: 97 ESIIFALGPIPKNKTLIQPMHEFDIWTGTAWEVDQQALKARHITAAVQQKTALINQVSEH 156 Query: 92 MNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPT 139 +N A + ++ Q + Y AL +D ++APDI+WP Sbjct: 157 INILLDAIAID---NQQTDIQQLAAFKQYRVALMRIDPNTAPDIDWPE 201 >UniRef50_C6C5D3 Tail assembly chaperone gp38 n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D3_DICDC Length = 208 Score = 66.3 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 37/89 (41%), Gaps = 2/89 (2%) Query: 55 GENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQY 114 W I AA+ ++++ AND +N +A +G EE + Sbjct: 122 DVWRDGRWVTDDAAKINAAIQAAKAEQEKRRRSANDRLNELTYAIN--LGIATPEEASAL 179 Query: 115 NLWLDYLDALELVDTSSAPDIEWPTPPAV 143 + W YL L VD APDI WPT P + Sbjct: 180 SSWQAYLVLLSRVDFGHAPDIVWPTEPGM 208 >UniRef50_C5AKX8 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AKX8_BURGB Length = 142 Score = 64.7 bits (156), Expect = 8e-10, Method: Composition-based stats. Identities = 27/113 (23%), Positives = 45/113 (39%), Gaps = 5/113 (4%) Query: 32 VEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDY 91 +++ E GK+ + P + PP+ ++ A + L+ +A+ Sbjct: 30 IKISEDEQAWLLEGAANGKVMAVDDKERPILLDPAPPSPDQIRARNTAYRDWLLERASVA 89 Query: 92 MNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSA-PDIEWPTPPAV 143 + Q A +G E A W+ Y AL+ VD A PD WP PA+ Sbjct: 90 LTPLQTA--MLLGNATEAEKALARQWIVYARALKKVDLGVALPD--WPAAPAI 138 >UniRef50_Q7P0Y3 Probable tail fiber assembly protein n=1 Tax=Chromobacterium violaceum RepID=Q7P0Y3_CHRVO Length = 146 Score = 64.3 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 5/76 (6%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M YSA T FY E A +P DAV+V+ VY GK+ A NG P Sbjct: 1 MTIFYSAGTGGFYDSE-----IHAEGYPADAVQVEASVYEALFRGQEAGKLIQADGNGCP 55 Query: 61 AWSEIPPPTHEEQIAA 76 + P + E+Q A Sbjct: 56 VLVDGPALSIEQQRQA 71 >UniRef50_C6CP83 Tail assembly chaperone gp38 n=1 Tax=Dickeya zeae Ech1591 RepID=C6CP83_DICZE Length = 206 Score = 63.6 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 37/101 (36%), Gaps = 5/101 (4%) Query: 42 FSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKA 101 + PP G W E + AA + A+D + +A Sbjct: 110 LTLQPPAG---AFDRWDGEQWITDNEAYQESLLKAARQACETRRQTAHDRIRELTYAQ-- 164 Query: 102 AIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 +G +E W YL L +D S PDI+WPTPP+ Sbjct: 165 ELGMATEQETQSLKDWKIYLVQLSRIDLSLLPDIDWPTPPS 205 >UniRef50_Q7NAA2 Complete genome; segment 1/17 n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7NAA2_PHOLL Length = 113 Score = 62.8 bits (151), Expect = 3e-09, Method: Composition-based stats. Identities = 19/64 (29%), Positives = 28/64 (43%), Gaps = 3/64 (4%) Query: 79 LKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWP 138 +K L+ QA ++ Q A +G E+ Y AL +DT +A DI+WP Sbjct: 53 RQKYYLLEQATIKISPLQDA--IDLGIATDSEITMLMELKKYRVALNRMDT-TAKDIKWP 109 Query: 139 TPPA 142 P Sbjct: 110 EKPE 113 >UniRef50_B3G0V8 Putative uncharacterized protein n=1 Tax=Pseudomonas aeruginosa RepID=B3G0V8_PSEAE Length = 144 Score = 62.4 bits (150), Expect = 4e-09, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 51/140 (36%), Gaps = 18/140 (12%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M++ + +FY + D PDDAVE+ + + +GK AG++G P Sbjct: 1 MSHFFGTKPIAFYDTAINTD------IPDDAVEITADEHADLLAAQARGKRIAAGKDGRP 54 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT +E + + + + + + + + + E Y+ Y Sbjct: 55 ILLDPPAPTRDELESFERIWRDARLRETDSLVARHRDEIETGEAPTLDTEK--YSALQAY 112 Query: 121 LDALELVDTSSAPDIEWPTP 140 AL +WP Sbjct: 113 RRALR----------DWPEA 122 >UniRef50_Q3ZL13 Tail fiber assembly protein n=1 Tax=Escherichia blattae RepID=Q3ZL13_ESCBL Length = 291 Score = 61.6 bits (148), Expect = 7e-09, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 35/81 (43%), Gaps = 3/81 (3%) Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W + E + A AE K L+ +A+ + +A + G+ EE A+ W Y Sbjct: 214 TWVKDISAESEYKQAQAEQHKASLLTEASQQIAVLSYAVDS--GQATEEESARLARWQVY 271 Query: 121 LDALELVDTSSAPDIEWPTPP 141 A+ DT + DI WP P Sbjct: 272 RLAVNRTDT-TLNDITWPEKP 291 >UniRef50_Q32IA2 Hypothetical prophage protein n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IA2_SHIDS Length = 109 Score = 61.3 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 8/84 (9%) Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 P + AE ++Q+L+N+A + + W + +G + ++ + W++Y+ A++ + Sbjct: 28 PVPVDYSKLAEKQRQRLLNEAKEITS--DWKTELELGTISDDDKVRLTQWMEYIKAVKAL 85 Query: 128 DTSSAPD------IEWPTPPAVQA 145 D S+A D I WP P A Sbjct: 86 DLSTATDEISFDAINWPERPDAAA 109 >UniRef50_Q7Y3Y8 Tail fiber assembly protein n=4 Tax=root RepID=Q7Y3Y8_9CAUD Length = 135 Score = 60.9 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 27/101 (26%), Positives = 43/101 (42%), Gaps = 6/101 (5%) Query: 46 PPKGKIRIAGENGFPAW-SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQW---AGKA 101 P G AGENG W P +E I A K+ ++ A+D + + + Sbjct: 35 PEDGINYYAGENG--EWLVGPAPQVVQEMIIEATQKQIAALSYASDIIGAIADEIEGLED 92 Query: 102 AIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 + + + W Y ++ +D S+AP+IEWP PP Sbjct: 93 SEEDVPDKLRTDLKAWKQYRVKVKNIDVSNAPNIEWPVPPE 133 >UniRef50_C4K4X4 Phage tail assembly chaperone n=2 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K4X4_HAMD5 Length = 178 Score = 60.9 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 32/73 (43%), Gaps = 3/73 (4%) Query: 69 THEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD 128 T E+ A +K++ +++A+ + A +G +E+ W Y AL +D Sbjct: 107 TEAERTRQAAREKEKWMDRASKAIGPLADA--VELGIATEQEVQALKNWKAYRVALHRLD 164 Query: 129 TSSAPDIEWPTPP 141 A +I WP P Sbjct: 165 P-KAGEITWPEVP 176 >UniRef50_A9DEL3 Tail fiber related protein n=1 Tax=Yersinia phage PY100 RepID=A9DEL3_9CAUD Length = 185 Score = 60.9 bits (146), Expect = 1e-08, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 34/79 (43%), Gaps = 3/79 (3%) Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 P + E I AA +K+QLI + + ++ + A +G L E + +Y Sbjct: 106 IVRQMPDNSEIIDAARERKRQLIEEVSLEIDVLKDAE--ELGDLTPREAQRLAALKNYRV 163 Query: 123 ALELVDTSSAPDIEWPTPP 141 L VD S +I WP P Sbjct: 164 ELMRVDISKDGEI-WPIKP 181 >UniRef50_A9I964 Putative phage tail fibre protein n=1 Tax=Bordetella petrii DSM 12804 RepID=A9I964_BORPD Length = 155 Score = 60.5 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 53/122 (43%), Gaps = 12/122 (9%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWS 63 YS T FY +M + P DAVE+ +++Y + GK +A E+GFPA + Sbjct: 2 FYSVETGGFYSAKM-----HGKAMPADAVEITDELYSQLLAGQSDGKRIVADESGFPALA 56 Query: 64 EIPPPTHEEQIAAAELKKQQLINQA------NDYMNSKQWAGKAAIGRLKGEELAQYNLW 117 + PPT + Q+ ++ A +D N+ +A + A+ + E + W Sbjct: 57 DPLPPTPAQIEVQKVAVVQKHMDDAARALRYDDIANAVTYAEEPAVPKF-QAEGQAFREW 115 Query: 118 LD 119 Sbjct: 116 RS 117 >UniRef50_B3HH41 Tail assembly chaperone gp38 n=3 Tax=Enterobacteriaceae RepID=B3HH41_ECOLX Length = 176 Score = 58.2 bits (139), Expect = 8e-08, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 8/81 (9%) Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 P + AE ++ +L A ++ K+ +G + EE + +W Y L+ + Sbjct: 95 PVPVDYRQQAESERARLTAIAEREISDKK--TDLLLGIIGDEEKEKLTVWRIYAKLLQAM 152 Query: 128 DTSSAPD------IEWPTPPA 142 D S+ D IEWP P Sbjct: 153 DFSTITDKTSYNAIEWPVSPE 173 >UniRef50_Q9B026 Probable tail fiber assembly protein n=1 Tax=Phage GMSE-1 RepID=Q9B026_9VIRU Length = 147 Score = 58.2 bits (139), Expect = 9e-08, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 39/111 (35%), Gaps = 8/111 (7%) Query: 31 AVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQAND 90 A++ + Y G GK A +K+Q A D Sbjct: 41 AIDASNEPYAMIGGRYEDGKFIPVPPPLPEPLPPEF------LREQAMGEKRQRDAAARD 94 Query: 91 YMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 + ++ + + + G E + W Y L D S+APDI WPTPP Sbjct: 95 AIALLEYVIELDMQQ--GGEAKKLRAWKKYRVLLNRADISAAPDIYWPTPP 143 >UniRef50_C4KQ09 Putative uncharacterized protein n=9 Tax=Burkholderia pseudomallei RepID=C4KQ09_BURPS Length = 150 Score = 57.8 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 27/111 (24%), Positives = 49/111 (44%), Gaps = 3/111 (2%) Query: 33 EVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYM 92 E+ ++ + +GK +NG P + PPPT E+ I + + +L+ +A+ + Sbjct: 33 EITDEQWKMLLDGESRGKRMALDDNGVPVLLDPPPPTIEQIIVSNTAMRDRLLERASVAL 92 Query: 93 NSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 Q A +G E Q W+ Y AL+ +D + + WP P + Sbjct: 93 TPLQTA--IMLGDATDSEAQQARAWIAYTRALKGIDLTRR-EPTWPEQPEM 140 >UniRef50_C1D954 HsdM n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D954_LARHH Length = 283 Score = 57.4 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 26/120 (21%), Positives = 41/120 (34%), Gaps = 18/120 (15%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWS 63 YSATT FY D+ + P DA E+ + + G+I A E G+P + Sbjct: 2 FYSATTCGFY------DHYSNNAIPADAGEITSEQHAALLAGQSDGRIITANEQGYPVLT 55 Query: 64 EIPPPTHEEQIAAA--------ELKKQQLINQANDY----MNSKQWAGKAAIGRLKGEEL 111 + PP T + A + ++ I A + Q + E Sbjct: 56 DPPPATLDTLRDCALLMLPAWEKAERTSGIEHAGQRWLTTSAALQDIRDVLLAGAVLGEQ 115 >UniRef50_Q4ZMK8 Putative uncharacterized protein n=4 Tax=Pseudomonas syringae group RepID=Q4ZMK8_PSEU2 Length = 188 Score = 57.4 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 23/94 (24%), Positives = 35/94 (37%), Gaps = 4/94 (4%) Query: 51 IRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEE 110 W+ +AA L + Q + +A + Q+A A +G E Sbjct: 92 PGKYYVWREGDWALDKEAQRVALASAALLVRDQRLQEAATRIVPLQYA--ADLGDATEAE 149 Query: 111 LAQYNLWLDYLDALELVD-TSSAP-DIEWPTPPA 142 A W Y L ++ S P IEWP+PP+ Sbjct: 150 KASLLEWKRYSVKLNRIEQFSDYPLQIEWPSPPS 183 >UniRef50_P26699 Probable tail fiber assembly protein n=56 Tax=root RepID=TFA_BPP2 Length = 175 Score = 56.6 bits (135), Expect = 2e-07, Method: Composition-based stats. Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 5/73 (6%) Query: 69 THEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD 128 T +EQ AE +K L+++A + + + A + + EE A+ W Y + VD Sbjct: 107 TADEQQQQAESQKAALLSEAENVIQPLERAVR--LNMATDEERARLESWERYSVLVSRVD 164 Query: 129 TSSAPDIEWPTPP 141 ++ EWP P Sbjct: 165 PANP---EWPEMP 174 >UniRef50_B4T266 Caudovirales tail fibre assembly protein n=16 Tax=Enterobacteriaceae RepID=B4T266_SALNS Length = 175 Score = 56.6 bits (135), Expect = 2e-07, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 37/76 (48%), Gaps = 8/76 (10%) Query: 72 EQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSS 131 + A AE +Q+L++ AN + W + A+G + ++ A W+ Y+ L+ +D + Sbjct: 101 DYQAKAETTRQKLLDGANSIIA--DWRTELALGEISDDDKATLTKWMSYIKGLKSLDLTG 158 Query: 132 APD------IEWPTPP 141 D I+WP P Sbjct: 159 ISDEATFNKIQWPALP 174 >UniRef50_D0KLI7 Tail assembly chaperone gp38 n=2 Tax=Enterobacteriaceae RepID=D0KLI7_PECWW Length = 209 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 25/102 (24%), Positives = 35/102 (34%), Gaps = 8/102 (7%) Query: 47 PKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKK-----QQLINQANDYMNSKQWAGKA 101 P+ +A F W T+ E A K AN + +A Sbjct: 111 PENMTFLAPATEFDQWDGTTWVTNVEAQQLAATKNLQQELAARRATANSRITELSYA--V 168 Query: 102 AIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 + EE Q W YL AL +D +A + WP P+V Sbjct: 169 DLAIATDEEQEQLTQWKIYLVALSRIDL-TAVSVVWPEAPSV 209 >UniRef50_B4TML3 Caudovirales tail fibre assembly protein n=16 Tax=root RepID=B4TML3_SALSV Length = 191 Score = 56.2 bits (134), Expect = 3e-07, Method: Composition-based stats. Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 5/75 (6%) Query: 69 THEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD 128 + EE AE +K + +++A + A K I EE+ + W Y + VD Sbjct: 121 SPEELRKKAEAEKIRRLSEAESAIAPLARAVKLKIA--TDEEIKRLEAWELYSVMVNRVD 178 Query: 129 TSSAPDIEWPTPPAV 143 T+S +WP P V Sbjct: 179 TASP---DWPEVPDV 190 >UniRef50_B4T2D9 Gp20 n=17 Tax=root RepID=B4T2D9_SALNS Length = 184 Score = 55.5 bits (132), Expect = 5e-07, Method: Composition-based stats. Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 5/75 (6%) Query: 69 THEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD 128 + EE AE +K + + +A + A K I EE+ + + W Y + VD Sbjct: 114 SPEELRKKAEDEKVRRLAEAESAIAPLARAVKLKIA--TDEEIKRLDAWELYSVMVNRVD 171 Query: 129 TSSAPDIEWPTPPAV 143 T+S +WP P V Sbjct: 172 TASP---DWPEVPDV 183 >UniRef50_B1JPI1 Tail assembly chaperone gp38 n=3 Tax=Yersinia pseudotuberculosis RepID=B1JPI1_YERPY Length = 172 Score = 54.7 bits (130), Expect = 9e-07, Method: Composition-based stats. Identities = 17/75 (22%), Positives = 34/75 (45%), Gaps = 6/75 (8%) Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 P+ ++ AAAE ++++L++ + + + A G +E + Y AL + Sbjct: 104 PSVDDLTAAAEERRRELMSNVSVEIATLDD--IAQSGTGTEQEQERLAALKQYRIALMRL 161 Query: 128 DTSSAPDIEWPTPPA 142 D + +WP PA Sbjct: 162 DINE----QWPVLPA 172 >UniRef50_D1UEY6 Putative uncharacterized protein n=2 Tax=Burkholderia RepID=D1UEY6_9BURK Length = 147 Score = 54.3 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 26/112 (23%), Positives = 45/112 (40%), Gaps = 3/112 (2%) Query: 31 AVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQAND 90 AV++ ++ + +GK NG PA + PPT +Q ++ + + Sbjct: 31 AVQITTALWRKLIDGQGQGKRIALDANGMPALFDPLPPTAAQQAELMRGRRDAALQATDW 90 Query: 91 YMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV-DTSSAPDIEWPTPP 141 + Q G E Q+ + L+Y AL + D + PDI+ P P Sbjct: 91 LVARHQDETLIGAGTTLTAE--QFVVLLNYRQALRELADAEAWPDIDLPAAP 140 >UniRef50_B4TI69 Putative phage tail fiber assembly protein n=13 Tax=Salmonella enterica subsp. enterica RepID=B4TI69_SALHS Length = 176 Score = 53.9 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 32/76 (42%), Gaps = 2/76 (2%) Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 P + +A A ++ + + +N A + + EL + + + L + Sbjct: 102 PVQVDYVALATAERDRRMASVTSKINQLMEAQDDS--DITDAELVELSDLREVRTKLRRL 159 Query: 128 DTSSAPDIEWPTPPAV 143 D + APDI+WP P V Sbjct: 160 DLTGAPDIDWPEVPDV 175 >UniRef50_B5TK82 Conserved hypothetical phage protein n=1 Tax=Pseudomonas phage DVM-2008 RepID=B5TK82_9VIRU Length = 143 Score = 53.5 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 28/141 (19%), Positives = 48/141 (34%), Gaps = 21/141 (14%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y+ +T FY +GS P+DA+++ ++ Y P K + G P Sbjct: 1 MLIYYAQSTGGFYNSI-----DHSGSLPEDAIKITDEEYRTLFAAPFLNKRIESDAKGRP 55 Query: 61 AWSEIPPPTHEEQIAAAELKK--QQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWL 118 E+ + E +K + A D + ++ + G + QY Sbjct: 56 VLLEL---SVNELTVRMTNEKNWRDGSLTATDRLIAR-DRDEMDDGGGTTLDQTQYTQLQ 111 Query: 119 DYLDALELVDTSSAPDIEWPT 139 Y AL +WP Sbjct: 112 AYRRALR----------DWPQ 122 >UniRef50_B5S309 Tail fiber assembly protein homolog n=2 Tax=Ralstonia solanacearum RepID=B5S309_RALSO Length = 198 Score = 53.2 bits (126), Expect = 3e-06, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 29/81 (35%), Gaps = 3/81 (3%) Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW A +++ + QA + Q A + E A+ W Y Sbjct: 119 AWQLDEALCASLHRADGLVERNIRMKQARRAIEPLQAA--VDLADATEAEAARLVAWRRY 176 Query: 121 LDALELVDTSSAPDIEWPTPP 141 L AL VD + P + WP P Sbjct: 177 LVALNRVDLDADP-VAWPVAP 196 >UniRef50_C6CP85 Putative phage tail fibre protein n=2 Tax=Dickeya zeae Ech1591 RepID=C6CP85_DICZE Length = 125 Score = 52.8 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 49/142 (34%), Gaps = 22/142 (15%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENG-FPAW 62 YS +T+ FY E + PDDA+E+ + Y +G + I E+ P Sbjct: 2 FYSKSTSGFYSDE-----INGVNIPDDAIEIRDDYYQYLLDQQVRGNVIIFDESTKKPIA 56 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 P + + A ++ L+ A+D+ W Y + Sbjct: 57 VTPVPLSDTQLAEDARRQRDNLL-TASDWTQVSDAPVDQQ-------------AWRTYRE 102 Query: 123 ALELVDTSS--APDIEWPTPPA 142 L V + +I WP+ P Sbjct: 103 ILRQVPEQAGFPLNIAWPSQPE 124 >UniRef50_C6DE09 Tail assembly chaperone gp38 n=5 Tax=Enterobacteriaceae RepID=C6DE09_PECCP Length = 206 Score = 52.4 bits (124), Expect = 4e-06, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 39/103 (37%), Gaps = 8/103 (7%) Query: 47 PKGKIRIAGENGFPAWSEIPPPTHEEQIAAA--ELKKQQL---INQANDYMNSKQWAGKA 101 P + + + F W + T E A KK +L ++QA++ + A + Sbjct: 107 PANQTLLVPTSEFDKWEDGKWVTDLEAQRQALIANKKVELNTKLSQASERIQVLSDAVEL 166 Query: 102 AIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQ 144 + EE + W Y L VD +S ++ P P + Sbjct: 167 NLA--TEEEKNELKAWKTYRLQLSRVDVNSFEEVL-PNLPNLN 206 >UniRef50_Q849T8 Eag0005 n=3 Tax=Haemophilus influenzae RepID=Q849T8_HAEIN Length = 80 Score = 52.0 bits (123), Expect = 5e-06, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 32/76 (42%), Gaps = 9/76 (11%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y N F+ DY+ G P+ AVE+ ++ Y+E +GK IA G+P Sbjct: 4 MTMYY---KNGFF------DYSYGGFVPEGAVEISQETYLELLNGQAQGKQIIADNTGYP 54 Query: 61 AWSEIPPPTHEEQIAA 76 A E P E Sbjct: 55 ALMEPQPSAAHELNLD 70 >UniRef50_Q3KH46 Putative phage related protein n=2 Tax=root RepID=Q3KH46_PSEPF Length = 188 Score = 51.6 bits (122), Expect = 7e-06, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 29/88 (32%), Gaps = 4/88 (4%) Query: 57 NGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNL 116 AW ++ L+ +A + Q+A IG EE Q Sbjct: 98 WRGDAWQLDEQARLLSISQQMLEQRDTLLREAVLRIAPLQYAE--DIGDATHEEQMQLLE 155 Query: 117 WLDYLDALELVDTSS--APDIEWPTPPA 142 W Y L +D + +I WP+ P Sbjct: 156 WKLYSVELNRIDKQTGFPREITWPSLPG 183 >UniRef50_A7ZL71 Putative uncharacterized protein n=1 Tax=Escherichia coli E24377A RepID=A7ZL71_ECO24 Length = 179 Score = 51.6 bits (122), Expect = 8e-06, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 8/81 (9%) Query: 69 THEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD 128 T E A AE ++ LI+ A + ++ W + + + + WL Y+ AL+ +D Sbjct: 99 TAAEWQARAESQRSALISDAKERISL--WQSELLLDIITNYDKESLTEWLAYIKALQALD 156 Query: 129 TSSAPD------IEWPTPPAV 143 S D WP P V Sbjct: 157 LSGVTDEASYNATVWPDEPRV 177 >UniRef50_Q1I688 Putative phage protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I688_PSEE4 Length = 143 Score = 50.9 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 30/140 (21%), Positives = 50/140 (35%), Gaps = 21/140 (15%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 YSA FY ++ + PDDAVE+ +++ GK + NG P Sbjct: 1 MIFYSAQNQGFYDSKVLD-----RMRPDDAVEISAELHAVLMRGQAIGKQIVVESNGMPG 55 Query: 62 WSEIPPPTHEEQIAAAELK-KQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 E+ E AA E + +++ + + + + G Q++ L Y Sbjct: 56 LREL-----IENAAAVERGWRDRMLADSLKLRDRHRDQLELGGGAETNLSPEQFHALLTY 110 Query: 121 LDALELVDTSSAPDIEWPTP 140 L AL +WP Sbjct: 111 LQALR----------DWPQS 120 >UniRef50_A4SL83 Phage tail fiber assembly protein n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SL83_AERS4 Length = 141 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 31/89 (34%), Gaps = 8/89 (8%) Query: 55 GENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQY 114 GE G PPT +Q A + + QA M + A +G + E + Sbjct: 58 GEFGDIEVITPSPPTETQQQARLNEE----LKQAATAMAPLKDAD--TLGIISDAERQRL 111 Query: 115 NLWLDYLDALELVDTSS--APDIEWPTPP 141 W Y L + S ++ WP P Sbjct: 112 TAWQRYRVTLYRLPQSDGWPTEVNWPEMP 140 >UniRef50_B3I4G5 Tail fiber assembly protein n=7 Tax=Escherichia RepID=B3I4G5_ECOLX Length = 101 Score = 49.7 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 2/48 (4%) Query: 85 INQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSA 132 I + ++Y+ Q A I EE + W Y L VDTS A Sbjct: 32 IQEFSEYIAPLQDAVDLEIA--TEEERSLLEAWNKYRVLLNRVDTSVA 77 >UniRef50_B6Z9I1 Putative phage tail fiber assembly protein n=1 Tax=Kluyvera phage Kvp1 RepID=B6Z9I1_9CAUD Length = 174 Score = 49.3 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 4/65 (6%) Query: 78 ELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEW 137 E +K +LI QA+ ++ Q+A +G E W Y A+ VD S P +W Sbjct: 113 ENQKARLIAQASTKID--QYADFIELGEEGLE--GILKAWRKYRLAVFKVDLSVLPFYDW 168 Query: 138 PTPPA 142 P P Sbjct: 169 PEKPE 173 >UniRef50_Q9ZXK5 Orf21 n=2 Tax=root RepID=Q9ZXK5_9CAUD Length = 148 Score = 49.3 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 51/142 (35%), Gaps = 22/142 (15%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGS--WPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGF 59 + Y T FY E+ + A S WP + ++ Y +G + +AGE+G Sbjct: 1 MFFYCPKTGGFYSPEVHGEQMPAESELWP-----LTDEEYEALLDAQGQGLLIVAGEDGQ 55 Query: 60 PAWSEIPPPTHEEQIAAAELK-KQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWL 118 P + PPP +E +A E + + ++ + + + + L E+ Sbjct: 56 PV-ATPPPPLGDEALATIERDWRDRQLDDTDALVARHRDELEVGTTTLSTEQYQALQA-- 112 Query: 119 DYLDALELVDTSSAPDIEWPTP 140 Y L +WP Sbjct: 113 -YRRQLR----------DWPES 123 >UniRef50_B1JB16 Putative uncharacterized protein n=1 Tax=Pseudomonas putida W619 RepID=B1JB16_PSEPW Length = 149 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 34/138 (24%), Positives = 55/138 (39%), Gaps = 19/138 (13%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 + S +T FY + A S P DAVE+ E + G++ + G++ P Sbjct: 5 FFASKSTRGFYSSD------SASSIPVDAVEITEAYRNQLLEGERAGRVIVWGDS-EPFL 57 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 + PPPT EE A E + + + A D + ++ + IG AQ++ L+Y Sbjct: 58 EDPPPPTGEEL-AVVERRWRDMQLLATDGIVARH-RDERDIGGPTTLNTAQFSELLEYRQ 115 Query: 123 ALELVDTSSAPDIEWPTP 140 L WP Sbjct: 116 DLR----------NWPQA 123 >UniRef50_Q1I679 Putative phage tail fiber assembly protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I679_PSEE4 Length = 126 Score = 48.9 bits (115), Expect = 5e-05, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 26/68 (38%), Gaps = 4/68 (5%) Query: 77 AELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV--DTSSAPD 134 A+ + +L A+ + Q A E+A+ W Y AL + D Sbjct: 61 AQAEALRLRAIADTAIAPLQDAVDLD--EASEAEVARLKEWRRYRVALNRLPEQPGYPAD 118 Query: 135 IEWPTPPA 142 I+WP PA Sbjct: 119 IDWPLAPA 126 >UniRef50_B3RGH1 Putative tail fiber assembly protein n=1 Tax=Escherichia phage rv5 RepID=B3RGH1_9CAUD Length = 194 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 26/70 (37%), Gaps = 1/70 (1%) Query: 74 IAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAP 133 + AE + + A + + Q L ++ Y L +DTS AP Sbjct: 126 VQQAEGVIETELAWATARIGAYQDMIDLEY-DLTDDQKRNIRDLKMYRVKLLEIDTSKAP 184 Query: 134 DIEWPTPPAV 143 DI +P P + Sbjct: 185 DIFFPERPTL 194 >UniRef50_C0DSG5 Putative uncharacterized protein n=1 Tax=Eikenella corrodens ATCC 23834 RepID=C0DSG5_EIKCO Length = 215 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 30/67 (44%), Gaps = 6/67 (8%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M YS T +FY + P+DAVE+ + + +G++ + G++G P Sbjct: 1 MTIYYSKTNQAFYDSSI------HSRLPEDAVEISHEQHAALLAGQSQGQVIMPGKDGKP 54 Query: 61 AWSEIPP 67 + + P Sbjct: 55 VLAPLAP 61 >UniRef50_Q2S7G7 Putative uncharacterized protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S7G7_HAHCH Length = 198 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 4/61 (6%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M+Y YS +T FY + AG P D V+V E+ +G + G P Sbjct: 1 MHYFYSPSTRGFYLESLH----AAGGLPLDGVKVTEEERQALLDGQAQGLTIEINDQGRP 56 Query: 61 A 61 Sbjct: 57 V 57 >UniRef50_B0VK51 Putative uncharacterized protein 51 n=1 Tax=Azospirillum phage Cd RepID=B0VK51_9CAUD Length = 228 Score = 47.4 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 23/100 (23%), Positives = 41/100 (41%), Gaps = 11/100 (11%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWS 63 +Y+A+T FY + P DAVE+ ++ Y G+ + G++G P + Sbjct: 2 LYAASTGGFYDRA-----IHGDTVPADAVEITDEEYAALFDGQSLGQRIVPGQDGRPTFY 56 Query: 64 EIPPPTHEEQIA---AAELKKQQLINQANDYMNSKQWAGK 100 PT ++ A AA ++Q +N +W Sbjct: 57 T---PTLDDTKADRKAAATARRQTERDRGVVVNGNRWHSD 93 >UniRef50_B2TWU3 Tail fiber assembly protein n=20 Tax=root RepID=B2TWU3_SHIB3 Length = 181 Score = 47.4 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 8/77 (10%) Query: 72 EQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSS 131 + AE ++ L+ Q + +W +G + E+ + + Y +L+ +D S+ Sbjct: 104 DYRLKAEDERDALLAQVSARTG--EWEEDLLLGLISDEDREKLKAYRIYAKSLQAMDFSA 161 Query: 132 APD------IEWPTPPA 142 D IEWP P Sbjct: 162 ITDKSSYNAIEWPVSPE 178 >UniRef50_C4F418 Putative uncharacterized protein n=3 Tax=Haemophilus influenzae RepID=C4F418_HAEIN Length = 208 Score = 47.0 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 35/108 (32%), Gaps = 13/108 (12%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y Y + F ++ + P AV++ +Y +GK IA + G P Sbjct: 1 MYYYDSANKCFLSDDI-------HNIPAHAVQITNDLYSTLLNGQTQGKQIIADKTGNPI 53 Query: 62 WSEIPPPTHEEQI------AAAELKKQQLINQANDYMNSKQWAGKAAI 103 + P + + K+ L+ + + + A I Sbjct: 54 LIDPQPSAAHQLNLDTLTWEISAEKQTALLAETQTRLVANIDKHAAKI 101 >UniRef50_Q3KH77 Hypothetical phage related protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KH77_PSEPF Length = 146 Score = 47.0 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 28/77 (36%), Gaps = 2/77 (2%) Query: 67 PPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALEL 126 PTHEE +A + Q+ + A ++ K +G + A + Y A+ Sbjct: 69 EPTHEEHLAINAARMQERFDVAALWLTFNPLQYKLDLGVATPADEAALLAYKQYFVAVSE 128 Query: 127 VD--TSSAPDIEWPTPP 141 V I WP P Sbjct: 129 VKKQPGYPATINWPVAP 145 >UniRef50_B1JM08 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JM08_YERPY Length = 116 Score = 47.0 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 33/85 (38%), Gaps = 6/85 (7%) Query: 58 GFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLW 117 G P+ + P T + K Q + +A + + Q A + EE+ + LW Sbjct: 34 GRPSEWQPAPMTSSDARDI----KAQALTEAYIQVTALQAAVSTQLA--TPEEITELVLW 87 Query: 118 LDYLDALELVDTSSAPDIEWPTPPA 142 YL + V S DI WP P Sbjct: 88 QTYLVLMNRVVPDSPLDIVWPKKPE 112 >UniRef50_Q3KH44 Putative phage tail assembly protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KH44_PSEPF Length = 146 Score = 46.6 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 33/155 (21%), Positives = 53/155 (34%), Gaps = 33/155 (21%) Query: 1 MNYIYSATTNSF------YPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIA 54 M + A T F YP P+ AVE+ Y E GK+ A Sbjct: 1 MAIYFHAQTRGFELVDSPYPEP-----------PEGAVEITRAQYAELFAGQASGKVISA 49 Query: 55 GENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQY 114 +G P ++ P + +A+ E + + Q ++ + A + +G ++ Sbjct: 50 SASGQPVLND--PVISPQALASRERAWRDNVLQDTQWLVWR-DAEELEVGEGTTLRTEEF 106 Query: 115 NLWLDYLDALELVDTSSAPDIEW---PTPPAVQAR 146 L Y AL +W P P QAR Sbjct: 107 KQLLAYRQALR----------DWPNDPEFPDAQAR 131 >UniRef50_Q31Z52 Putative tail fiber assembly protein n=1 Tax=Shigella boydii Sb227 RepID=Q31Z52_SHIBS Length = 123 Score = 46.6 bits (109), Expect = 3e-04, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 8/80 (10%) Query: 72 EQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSS 131 + AE ++ L+ Q + +W +G + E+ + Y +L+ +D S+ Sbjct: 46 DYRLKAEDERDALLAQVSARTG--EWEEDLLLGLISDEDKEKLKACRIYAKSLQAMDFST 103 Query: 132 APD------IEWPTPPAVQA 145 D I WP P A Sbjct: 104 ITDKATYNAINWPERPDAAA 123 >UniRef50_C8UR23 Putative uncharacterized protein n=1 Tax=Escherichia coli O111:H- str. 11128 RepID=C8UR23_ECO1A Length = 78 Score = 45.5 bits (106), Expect = 5e-04, Method: Composition-based stats. Identities = 18/70 (25%), Positives = 31/70 (44%), Gaps = 3/70 (4%) Query: 73 QIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSA 132 Q AE +KQ L+ D ++ W + +G + + + W+ Y +E DTS Sbjct: 12 QRQQAEKEKQSLLQLVRDK--TQLWDSQLRLGIISVQGKQKLTEWILYAQKVESTDTSIL 69 Query: 133 PDIEWPTPPA 142 P + +P P Sbjct: 70 P-VTFPEKPE 78 >UniRef50_D1P3V4 Bacteriophage tail fiber assembly protein n=4 Tax=Providencia rustigianii DSM 4541 RepID=D1P3V4_9ENTR Length = 165 Score = 45.1 bits (105), Expect = 7e-04, Method: Composition-based stats. Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 4/62 (6%) Query: 74 IAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAP 133 + +LKK+QL+N+ + ++ Q + +G EE+ + +Y +L S+ Sbjct: 101 RSQIDLKKKQLMNEVSSLIDPLQDSF--DMGVATSEEIEKLIALKEYRISLNR--ASTLH 156 Query: 134 DI 135 DI Sbjct: 157 DI 158 >UniRef50_Q30VV8 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30VV8_DESDG Length = 194 Score = 44.7 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 34/91 (37%), Gaps = 5/91 (5%) Query: 5 YSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSE 64 YS +TN+FY + P DAV V + + +G+I ENG P Sbjct: 3 YSPSTNAFYHPA-----VHGHAIPADAVAVSPEEHATLLAAQARGQIIRPDENGCPVAVT 57 Query: 65 IPPPTHEEQIAAAELKKQQLINQANDYMNSK 95 P + K+ ++ + A + + Sbjct: 58 PAAPPAPTRAELYTAKQTEIRDGAESMLTAL 88 >UniRef50_Q9KW02 Tail fiber assembly protein n=2 Tax=Pseudomonas aeruginosa RepID=Q9KW02_PSEAE Length = 146 Score = 44.3 bits (103), Expect = 0.001, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 47/145 (32%), Gaps = 11/145 (7%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWP--DDAVEVDEQVYIEFSGLPPKGKIRIAGENGF 59 + A T FY E P D+ +++ Y +GK + G Sbjct: 1 MIFFHAATGGFYSKE-----IHGSRMPLEDEMHPLEDAEYQALLRAQSEGKRIVTDHTGR 55 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLD 119 P + P P + + + + + + + + + + +G+ Q Sbjct: 56 PICVDPPAPAKDILVQRERIWRDRQLQLTDGPLARHRD--EQDLGKTTTLSQEQLRELTL 113 Query: 120 YLDALELVDTSSA-PDIEW-PTPPA 142 Y L ++ PD+ P PPA Sbjct: 114 YRAVLRDWPIAAEFPDLNARPEPPA 138 >UniRef50_C9R438 Putative tail fiber assembly protein n=3 Tax=root RepID=C9R438_AGGAD Length = 203 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 26/106 (24%), Positives = 34/106 (32%), Gaps = 13/106 (12%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y + FY E G P AVE+ EQ Y GK IA G P Sbjct: 1 MAIYY---KDGFYNDE------NGGYVPQGAVEITEQTYRTLLEGQSAGKQIIADSEGKP 51 Query: 61 AWSEIPPPTHEEQI----AAAELKKQQLINQANDYMNSKQWAGKAA 102 E P E +E K L+ + + +K + Sbjct: 52 ILVEPQPSHLHEFKNGKWIISEKNKTALLLEQRKTICAKINQLRDE 97 >UniRef50_C5BH15 Putative Tail fiber assembly protein-like protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BH15_EDWI9 Length = 206 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 13/90 (14%), Positives = 22/90 (24%), Gaps = 2/90 (2%) Query: 54 AGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRL-KGEELA 112 W + A KK L+ A + Sbjct: 118 YDVWNGEKWVTDTRQQQQHITANHLRKKNALLETATQRIEILMDKISLTATDTPTQTIQE 177 Query: 113 QYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 + W Y ++ + + P I+WP P Sbjct: 178 RLLAWRKYRAQVDDISADT-PHIDWPAMPE 206 >UniRef50_UPI00016A4B8D hypothetical protein BthaT_33832 n=1 Tax=Burkholderia thailandensis TXDOH RepID=UPI00016A4B8D Length = 147 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 30/114 (26%), Positives = 46/114 (40%), Gaps = 7/114 (6%) Query: 32 VEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDY 91 VE+ + + GK E+G P E T + A + + A D+ Sbjct: 32 VEITPRQHAMLLEGAAAGKTVAVTEDGHPILLEPEKQTRAQLADAKRAARDAAL-VATDW 90 Query: 92 MNSK-QWAGKAAIG-RLKGEELAQYNLWLDYLDALELV-DTSSAPDIEWPTPPA 142 + S+ Q G L +E AQ L Y AL + D + P+++ PTPPA Sbjct: 91 LTSRHQDEKALGDGTTLMPDEYAQL---LRYRQALRNLGDATGWPNVDLPTPPA 141 >UniRef50_O52622 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhimurium RepID=O52622_SALTY Length = 94 Score = 42.8 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 17/63 (26%), Positives = 28/63 (44%), Gaps = 5/63 (7%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 +Y YS F+ + D T++ ++PDD + + ++ Y E GK I G P Sbjct: 3 DYYYSFKEKGFF---WQPD-TESDNYPDDLIPLTDEYYRELMQGQVDGK-YIEHRKGGPV 57 Query: 62 WSE 64 E Sbjct: 58 LVE 60 >UniRef50_Q87Y71 Tail fiber assembly domain protein n=1 Tax=Pseudomonas syringae pv. tomato RepID=Q87Y71_PSESM Length = 92 Score = 42.8 bits (99), Expect = 0.004, Method: Composition-based stats. Identities = 14/73 (19%), Positives = 26/73 (35%), Gaps = 4/73 (5%) Query: 72 EQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSS 131 +++A + +L A+ + Q A +E+A W + AL + Sbjct: 22 QRLADVVTEIARLRKIADYTIAPLQDAVDID--DATADEVASLKAWKQFRVALNRIPAQP 79 Query: 132 --APDIEWPTPPA 142 I+WP P Sbjct: 80 GYYEVIDWPVMPT 92 >UniRef50_Q126B3 Putative uncharacterized protein n=1 Tax=Polaromonas sp. JS666 RepID=Q126B3_POLSJ Length = 223 Score = 42.4 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 16/53 (30%), Positives = 23/53 (43%) Query: 27 WPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAEL 79 P DAVE+ ++ + +GK A NG P P PT + AA+ Sbjct: 55 IPPDAVEITDEQHAALLEGQTQGKRIEADANGAPVLITPPAPTLDALKEAAQA 107 >UniRef50_A5X9J5 Putative tail fiber assembly protein n=1 Tax=Aeromonas phage phiO18P RepID=A5X9J5_9CAUD Length = 141 Score = 41.6 bits (96), Expect = 0.008, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 29/78 (37%), Gaps = 6/78 (7%) Query: 68 PTHEEQIAAAELKKQ--QLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 E + AE +++ L +A ++ Q A + + +EL + Y AL Sbjct: 65 AQPEYLPSEAEQQRKLDALQAEATLHIAPLQDAKELKLA--TPQELDKLEALQRYRIALM 122 Query: 126 LVDTSS--APDIEWPTPP 141 + S + WP P Sbjct: 123 RLPQSEGWPSSVTWPEMP 140 >UniRef50_B0USZ7 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=B0USZ7_HAES2 Length = 208 Score = 41.2 bits (95), Expect = 0.011, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 17/105 (16%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M + + FY + + P AVE+ E +Y +GK I ENG+P Sbjct: 1 MKIYF---KDGFYMSHI------HKNIPQGAVEISEDLYRSLLVGQSEGKQIITDENGYP 51 Query: 61 AWSEIPPPT-----HEEQIAAAELKK---QQLINQANDYMNSKQW 97 ++ P + + + E + Q+ + + +N+ + Sbjct: 52 QLADPQPSPFHHIEKGQWVISPENQTAHLTQVRAEMREKINALRD 96 >UniRef50_A9DEM0 Tail fiber assembly protein n=1 Tax=Yersinia phage PY100 RepID=A9DEM0_9CAUD Length = 143 Score = 39.3 bits (90), Expect = 0.039, Method: Composition-based stats. Identities = 32/115 (27%), Positives = 51/115 (44%), Gaps = 15/115 (13%) Query: 39 YIEFSGLPPKGKIRIAGENGFPAWSEIPPPTH-EEQIAAAELKKQQLINQANDYMNSK-Q 96 +IE P+ AGENG W P ++++ AA+ +K +L+ +AN +N Q Sbjct: 32 WIEMHAPKPEAGDWYAGENG--EWMYGEAPEEIKDRVMAAKSQKMRLLTEANAMINMIEQ 89 Query: 97 WAGKAAIGRLKG------EELAQYNL----WLDYLDALELVDTSSAPDIEWPTPP 141 A + +K +E+ N W +Y L VD A +I+WP P Sbjct: 90 EAIETPELYVKQVYNPYTDEIDNVNEELEKWHNYRKELIRVDVE-AKEIKWPELP 143 >UniRef50_Q7P173 Probable tail fiber assembly protein n=1 Tax=Chromobacterium violaceum RepID=Q7P173_CHRVO Length = 192 Score = 38.9 bits (89), Expect = 0.045, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 36/93 (38%), Gaps = 11/93 (11%) Query: 59 FPAWSEIPPPTHEEQIAAAELKK-----QQLINQANDYMNSKQWAGKAAIGRLKGEELAQ 113 FP W ++ +AA +K +Q + A + A +IG +EL + Sbjct: 102 FPTWDGKGWSINKTAQSAALAQKTAAELKQRLADAYAARRPLEDAE--SIGIATADELQK 159 Query: 114 YNLWLDYLDALELV-DTSSAPDI---EWPTPPA 142 W Y L + D + P + +WP PA Sbjct: 160 LAAWKRYCVDLSRLPDLAMWPRLVGADWPKQPA 192 >UniRef50_Q72D32 Tail fiber assembly protein, putative n=5 Tax=Desulfovibrio vulgaris RepID=Q72D32_DESVH Length = 148 Score = 38.5 bits (88), Expect = 0.060, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 24/61 (39%), Gaps = 6/61 (9%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M +S +T FY D + P DA+ + + +++ + +G I G P Sbjct: 1 MQRYHSPSTGGFY-----LDGVHSD-IPADAIPITDSEHVDLTDALAQGCIIKMDAEGRP 54 Query: 61 A 61 Sbjct: 55 C 55 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77656 Uncharacterized protein yfdK n=23 Tax=Enterobact... 171 6e-42 UniRef50_C7BRV4 Phage tail fiber assembly protein n=5 Tax=Photor... 157 1e-37 UniRef50_C7BRK4 Tail fiber assembly protein n=7 Tax=Enterobacter... 151 5e-36 UniRef50_C7BRK2 Similar to probable tail fiber assembly protein ... 151 8e-36 UniRef50_UPI0001826513 hypothetical protein EcanA3_06430 n=1 Tax... 148 5e-35 UniRef50_B3XF69 Phage tail assembly chaperone gp38 n=2 Tax=Esche... 146 2e-34 UniRef50_Q2NRZ4 Putative uncharacterized protein n=1 Tax=Sodalis... 145 5e-34 UniRef50_C4V089 Bacteriophage tail fiber assembly protein n=1 Ta... 143 1e-33 UniRef50_Q8KT34 Tail fiber assembly protein n=31 Tax=Enterobacte... 142 3e-33 UniRef50_Q66W74 Putative phage tail fiber assembly protein n=1 T... 140 1e-32 UniRef50_A4W7Q4 Phage tail assembly chaperone gp38 n=1 Tax=Enter... 140 1e-32 UniRef50_Q2NQF0 Hypothetical phage protein n=1 Tax=Sodalis gloss... 140 2e-32 UniRef50_O22005 Probable tail fiber assembly protein n=4 Tax=roo... 140 2e-32 UniRef50_Q9T1R1 Probable tail fiber assembly protein n=8 Tax=roo... 137 1e-31 UniRef50_Q83M74 Putative phage tail fibre protein n=1 Tax=Shigel... 135 4e-31 UniRef50_B1JPG7 Tail assembly chaperone gp38 n=1 Tax=Yersinia ps... 135 4e-31 UniRef50_A4W725 Phage tail assembly chaperone gp38 n=1 Tax=Enter... 134 1e-30 UniRef50_D2TR89 Putative phage tail fibre assembly protein n=1 T... 132 4e-30 UniRef50_C4UND0 Tail assembly chaperone gp38 n=2 Tax=Yersinia ru... 129 2e-29 UniRef50_C4SF02 Tail assembly chaperone gp38 n=1 Tax=Yersinia mo... 127 1e-28 UniRef50_B7UGJ5 Predicted tail fiber assembly protein n=1 Tax=Es... 125 4e-28 UniRef50_P09154 Uncharacterized protein ymfS n=2 Tax=Escherichia... 124 7e-28 UniRef50_Q8FJG5 Putative uncharacterized protein yfdK n=1 Tax=Es... 121 1e-26 UniRef50_B1JGV6 Tail assembly chaperone gp38 n=2 Tax=Yersinia ps... 118 5e-26 UniRef50_B1JS57 Tail assembly chaperone gp38 n=3 Tax=Yersinia Re... 118 5e-26 UniRef50_D2TYN1 Phage tail assembly protein n=1 Tax=Arsenophonus... 117 9e-26 UniRef50_B6VLW5 Tail fiber assembly protein homolog from lambdoi... 114 7e-25 UniRef50_UPI0001C3422A phage tail fiber assembly protein n=1 Tax... 114 1e-24 UniRef50_Q2NV40 Hypothetical phage protein n=2 Tax=Enterobacteri... 112 5e-24 UniRef50_B2K2J2 Putative uncharacterized protein n=1 Tax=Yersini... 111 1e-23 UniRef50_B4TRG5 Fels-2 prophage Tfa n=22 Tax=Enterobacteriaceae ... 108 5e-23 UniRef50_A7ZYE1 Putative tail fiber assembly protein n=1 Tax=Esc... 105 6e-22 UniRef50_Q2NWF3 Putative uncharacterized protein n=1 Tax=Sodalis... 103 2e-21 UniRef50_B7M8C3 Putative phage tail fiber assembly protein n=1 T... 102 3e-21 UniRef50_P03740 Tail fiber assembly protein n=117 Tax=root RepID... 100 2e-20 UniRef50_A5A5F3 Tail fiber assembly n=4 Tax=Pseudomonas aerugino... 98 6e-20 UniRef50_C9XYE1 Tail fiber assembly protein homolog from lambdoi... 98 1e-19 UniRef50_B4TAQ6 Tail fiber assembly protein n=8 Tax=Salmonella e... 97 1e-19 UniRef50_D0FT42 Phage tail assembly chaperone n=2 Tax=Erwinia py... 95 7e-19 UniRef50_B7NA68 Putative uncharacterized protein n=1 Tax=Escheri... 94 9e-19 UniRef50_B7NSC3 Putative tail fiber assembly protein (Possibly p... 94 1e-18 UniRef50_B2Q1Q3 Putative uncharacterized protein n=1 Tax=Provide... 94 1e-18 UniRef50_O68721 Lambda tail fiber assembly protein G n=32 Tax=En... 93 2e-18 UniRef50_D0ZBI3 Phage tail assembly chaperone gp38 n=1 Tax=Edwar... 93 3e-18 UniRef50_C6CP80 Tail assembly chaperone gp38 n=2 Tax=Dickeya Rep... 93 4e-18 UniRef50_A4WEL2 Phage tail assembly chaperone gp38 n=1 Tax=Enter... 92 6e-18 UniRef50_B2VJG1 Tail fiber assembly protein n=1 Tax=Erwinia tasm... 91 9e-18 UniRef50_A4JWL9 Putative uncharacterized protein n=1 Tax=Burkhol... 91 1e-17 UniRef50_C4SNQ0 Putative uncharacterized protein n=4 Tax=Yersini... 91 1e-17 UniRef50_C4TT86 Conserved hypothetical phage tail fiber protein ... 90 2e-17 UniRef50_B3G0V8 Putative uncharacterized protein n=1 Tax=Pseudom... 90 2e-17 UniRef50_B2K1B5 Putative uncharacterized protein n=6 Tax=Yersini... 89 4e-17 UniRef50_C7BTY5 Hypothetical phage protein n=1 Tax=Photorhabdus ... 89 5e-17 UniRef50_B6XAD7 Putative uncharacterized protein n=2 Tax=Provide... 89 5e-17 UniRef50_Q47427 Tail fiber assembly protein homolog n=47 Tax=roo... 89 6e-17 UniRef50_B4T1N8 Tail assembly chaperone gp38 n=3 Tax=Salmonella ... 88 7e-17 UniRef50_P40784 Tail fiber assembly protein homolog from lambdoi... 88 7e-17 UniRef50_D2TR90 Putative phage tail fibre assembly protein n=1 T... 88 1e-16 UniRef50_A8GA30 Tail assembly chaperone gp38 n=2 Tax=Enterobacte... 88 1e-16 UniRef50_Q1I688 Putative phage protein n=1 Tax=Pseudomonas entom... 87 2e-16 UniRef50_A8GLQ6 Tail assembly chaperone gp38 n=2 Tax=root RepID=... 87 2e-16 UniRef50_UPI000197C594 tail assembly chaperone gp38 n=1 Tax=Prov... 87 2e-16 UniRef50_D2U2G7 Phage tail fiber assembly protein n=1 Tax=Arseno... 87 2e-16 UniRef50_A7MLN9 Putative uncharacterized protein n=1 Tax=Cronoba... 86 4e-16 UniRef50_Q9KW02 Tail fiber assembly protein n=2 Tax=Pseudomonas ... 85 5e-16 UniRef50_C6CP83 Tail assembly chaperone gp38 n=1 Tax=Dickeya zea... 85 7e-16 UniRef50_A4TJD5 Phage tail fiber assembly protein n=22 Tax=Yersi... 85 8e-16 UniRef50_C7BSP4 Putative tail fiber protein of prophage cp-933x ... 84 1e-15 UniRef50_B5TK82 Conserved hypothetical phage protein n=1 Tax=Pse... 84 2e-15 UniRef50_C6C6Z1 Tail assembly chaperone gp38 n=3 Tax=Dickeya Rep... 83 2e-15 UniRef50_Q7N1H8 Similarities with lambda tail fiber assembly pro... 82 6e-15 UniRef50_Q3ZL13 Tail fiber assembly protein n=1 Tax=Escherichia ... 81 8e-15 UniRef50_Q7N2R7 Similar to phage tail fiber assembly protein n=1... 81 1e-14 UniRef50_C6C5D3 Tail assembly chaperone gp38 n=1 Tax=Dickeya dad... 81 1e-14 UniRef50_C6CP85 Putative phage tail fibre protein n=2 Tax=Dickey... 80 2e-14 UniRef50_B7UG05 Predicted tail fiber assembly protein n=1 Tax=Es... 80 2e-14 UniRef50_C5AKX8 Putative uncharacterized protein n=1 Tax=Burkhol... 80 3e-14 UniRef50_Q31HT5 Putative uncharacterized protein n=1 Tax=Thiomic... 79 3e-14 UniRef50_C4KQ09 Putative uncharacterized protein n=9 Tax=Burkhol... 79 3e-14 UniRef50_A9I964 Putative phage tail fibre protein n=1 Tax=Bordet... 79 5e-14 UniRef50_C4UND1 Conserved hypothetical phage tail fiber protein ... 78 1e-13 UniRef50_A1JMQ0 Phage tail fiber assembly protein n=5 Tax=Yersin... 78 1e-13 UniRef50_B1JB16 Putative uncharacterized protein n=1 Tax=Pseudom... 78 1e-13 UniRef50_D1UEY6 Putative uncharacterized protein n=2 Tax=Burkhol... 76 2e-13 UniRef50_Q9ZXK5 Orf21 n=2 Tax=root RepID=Q9ZXK5_9CAUD 76 3e-13 UniRef50_Q7P0Y3 Probable tail fiber assembly protein n=1 Tax=Chr... 75 7e-13 UniRef50_Q3KH46 Putative phage related protein n=2 Tax=root RepI... 74 1e-12 UniRef50_P77326 Putative tail fiber assembly protein homolog fro... 74 1e-12 UniRef50_C1D954 HsdM n=1 Tax=Laribacter hongkongensis HLHK9 RepI... 74 1e-12 UniRef50_Q3KH44 Putative phage tail assembly protein n=1 Tax=Pse... 73 2e-12 UniRef50_Q4ZMK8 Putative uncharacterized protein n=4 Tax=Pseudom... 73 2e-12 UniRef50_D0KLI7 Tail assembly chaperone gp38 n=2 Tax=Enterobacte... 73 4e-12 UniRef50_Q9B026 Probable tail fiber assembly protein n=1 Tax=Pha... 73 4e-12 UniRef50_Q7Y3Y8 Tail fiber assembly protein n=4 Tax=root RepID=Q... 72 5e-12 UniRef50_Q7NAA2 Complete genome; segment 1/17 n=1 Tax=Photorhabd... 72 7e-12 UniRef50_D1P8C4 Tail fiber assembly protein n=1 Tax=Providencia ... 71 1e-11 UniRef50_C6DE09 Tail assembly chaperone gp38 n=5 Tax=Enterobacte... 71 2e-11 UniRef50_C4K4X4 Phage tail assembly chaperone n=2 Tax=Candidatus... 71 2e-11 UniRef50_B4TI69 Putative phage tail fiber assembly protein n=13 ... 70 2e-11 UniRef50_A9DEL3 Tail fiber related protein n=1 Tax=Yersinia phag... 70 2e-11 UniRef50_P26699 Probable tail fiber assembly protein n=56 Tax=ro... 69 3e-11 UniRef50_C5BH15 Putative Tail fiber assembly protein-like protei... 69 3e-11 UniRef50_B4TML3 Caudovirales tail fibre assembly protein n=16 Ta... 68 7e-11 UniRef50_B4T2D9 Gp20 n=17 Tax=root RepID=B4T2D9_SALNS 68 1e-10 UniRef50_Q32IA2 Hypothetical prophage protein n=1 Tax=Shigella d... 66 3e-10 UniRef50_B3HH41 Tail assembly chaperone gp38 n=3 Tax=Enterobacte... 66 4e-10 UniRef50_C9R438 Putative tail fiber assembly protein n=3 Tax=roo... 65 6e-10 UniRef50_B0VK51 Putative uncharacterized protein 51 n=1 Tax=Azos... 65 6e-10 UniRef50_B1JPI1 Tail assembly chaperone gp38 n=3 Tax=Yersinia ps... 64 9e-10 UniRef50_B5S309 Tail fiber assembly protein homolog n=2 Tax=Rals... 64 2e-09 UniRef50_B3RGH1 Putative tail fiber assembly protein n=1 Tax=Esc... 64 2e-09 UniRef50_B4T266 Caudovirales tail fibre assembly protein n=16 Ta... 61 8e-09 UniRef50_A7ZL71 Putative uncharacterized protein n=1 Tax=Escheri... 61 9e-09 UniRef50_C4F418 Putative uncharacterized protein n=3 Tax=Haemoph... 61 1e-08 UniRef50_Q31Z52 Putative tail fiber assembly protein n=1 Tax=Shi... 61 2e-08 UniRef50_C4U6G1 Phage tail assembly chaperone gp38 n=2 Tax=Enter... 61 2e-08 UniRef50_Q3KH77 Hypothetical phage related protein n=1 Tax=Pseud... 60 2e-08 UniRef50_Q849T8 Eag0005 n=3 Tax=Haemophilus influenzae RepID=Q84... 60 2e-08 UniRef50_B2TWU3 Tail fiber assembly protein n=20 Tax=root RepID=... 60 3e-08 UniRef50_A4SL83 Phage tail fiber assembly protein n=1 Tax=Aeromo... 59 4e-08 UniRef50_B6Z9I1 Putative phage tail fiber assembly protein n=1 T... 59 5e-08 UniRef50_C8UR23 Putative uncharacterized protein n=1 Tax=Escheri... 58 9e-08 UniRef50_B1JM08 Putative uncharacterized protein n=1 Tax=Yersini... 57 1e-07 UniRef50_Q1I679 Putative phage tail fiber assembly protein n=1 T... 56 4e-07 UniRef50_C0DSG5 Putative uncharacterized protein n=1 Tax=Eikenel... 55 7e-07 UniRef50_Q2S7G7 Putative uncharacterized protein n=1 Tax=Hahella... 54 2e-06 UniRef50_Q30VV8 Putative uncharacterized protein n=1 Tax=Desulfo... 54 2e-06 UniRef50_B3I4G5 Tail fiber assembly protein n=7 Tax=Escherichia ... 53 3e-06 UniRef50_D1P3V4 Bacteriophage tail fiber assembly protein n=4 Ta... 52 6e-06 Sequences not found previously or not previously below threshold: UniRef50_UPI00016A4B8D hypothetical protein BthaT_33832 n=1 Tax=... 68 1e-10 UniRef50_Q48C55 Prophage PSPPH06, putative tail fiber protein n=... 63 3e-09 UniRef50_Q126B3 Putative uncharacterized protein n=1 Tax=Polarom... 55 6e-07 UniRef50_O52622 Putative uncharacterized protein n=1 Tax=Salmone... 55 6e-07 UniRef50_A5X9J5 Putative tail fiber assembly protein n=1 Tax=Aer... 55 8e-07 UniRef50_Q7P173 Probable tail fiber assembly protein n=1 Tax=Chr... 52 5e-06 UniRef50_B0USZ7 Putative uncharacterized protein n=5 Tax=Pasteur... 51 1e-05 UniRef50_Q87Y71 Tail fiber assembly domain protein n=1 Tax=Pseud... 50 2e-05 UniRef50_Q0BEK6 Bacteriophage-acquired protein n=3 Tax=Burkholde... 49 4e-05 UniRef50_Q72D32 Tail fiber assembly protein, putative n=5 Tax=De... 48 8e-05 UniRef50_B3R3K0 Phage tail fiber assembly protein n=1 Tax=Cupria... 44 0.001 UniRef50_A3YA18 Putative uncharacterized protein n=1 Tax=Marinom... 41 0.008 UniRef50_A7FIT9 Putative uncharacterized protein n=5 Tax=Yersini... 41 0.012 UniRef50_A9DEM0 Tail fiber assembly protein n=1 Tax=Yersinia pha... 40 0.025 UniRef50_A8T0S3 Putative uncharacterized protein n=1 Tax=Vibrio ... 39 0.034 UniRef50_Q1I690 Putative phage protein n=1 Tax=Pseudomonas entom... 39 0.045 UniRef50_A5FI80 Putative uncharacterized protein n=1 Tax=Flavoba... 39 0.049 UniRef50_A4JDE2 Putative uncharacterized protein n=2 Tax=Burkhol... 39 0.051 UniRef50_C5A8Q2 Bacteriophage-acquired protein n=2 Tax=Burkholde... 39 0.059 UniRef50_A4JWI7 Putative uncharacterized protein n=4 Tax=Burkhol... 38 0.092 UniRef50_A1TPR3 Putative uncharacterized protein n=1 Tax=Acidovo... 38 0.099 >UniRef50_P77656 Uncharacterized protein yfdK n=23 Tax=Enterobacteriaceae RepID=YFDK_ECOLI Length = 146 Score = 171 bits (433), Expect = 6e-42, Method: Composition-based stats. Identities = 146/146 (100%), Positives = 146/146 (100%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP Sbjct: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY Sbjct: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 Query: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 LDALELVDTSSAPDIEWPTPPAVQAR Sbjct: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 >UniRef50_C7BRV4 Phage tail fiber assembly protein n=5 Tax=Photorhabdus RepID=C7BRV4_PHOAA Length = 141 Score = 157 bits (396), Expect = 1e-37, Method: Composition-based stats. Identities = 74/141 (52%), Positives = 95/141 (67%), Gaps = 3/141 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGL-PPKGKIRIAGENGFP 60 Y YSATTN+FYP+E K+DY AGS+P+DAVEVD+ V+IEF+G PPKGK RIAG+NG P Sbjct: 1 MYYYSATTNAFYPVEWKQDYINAGSFPNDAVEVDKSVFIEFAGSIPPKGKYRIAGKNGLP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W++IPPPT EE I+ AE +K Q I+ AN+ + A + I EE+ W Y Sbjct: 61 EWADIPPPTKEELISIAESQKAQFISLANEKIMPLSDAEELDI--ATDEEMLLLKEWKKY 118 Query: 121 LDALELVDTSSAPDIEWPTPP 141 L VDTS+AP+I+WP P Sbjct: 119 RVMLNRVDTSNAPEIDWPITP 139 >UniRef50_C7BRK4 Tail fiber assembly protein n=7 Tax=Enterobacteriaceae RepID=C7BRK4_PHOAA Length = 144 Score = 151 bits (382), Expect = 5e-36, Method: Composition-based stats. Identities = 59/140 (42%), Positives = 87/140 (62%), Gaps = 2/140 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 NY++SA +FYPL +++DY +A SWP+D + V + ++ +FSG+PP GKI +GE+G P Sbjct: 5 NYVFSALNKAFYPLSLQQDYIEADSWPNDPISVTDDIFYKFSGMPPIGKILSSGEDGLPC 64 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W +IPPPT EE I+ AE ++ + I AN+ + A + I EEL W Y Sbjct: 65 WEDIPPPTKEELISIAEAQRSKFIFLANEKITPLADAVELDI--ATNEELLSLKAWKKYR 122 Query: 122 DALELVDTSSAPDIEWPTPP 141 L +DTS+AP+I+WP P Sbjct: 123 VMLNRIDTSTAPEIDWPIAP 142 >UniRef50_C7BRK2 Similar to probable tail fiber assembly protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BRK2_PHOAA Length = 144 Score = 151 bits (380), Expect = 8e-36, Method: Composition-based stats. Identities = 61/140 (43%), Positives = 85/140 (60%), Gaps = 2/140 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 NY++SA +FYP+ +++DY AGSWP+D + V + ++ EFSG+PP GKI +GE+ P Sbjct: 5 NYVFSALNKAFYPISLQQDYIAAGSWPNDPLPVTDDIFNEFSGIPPAGKILSSGEDALPC 64 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W +IPPPT EE I AE +K Q I+ AN+ + A + I EE+ W Y Sbjct: 65 WEDIPPPTKEELIYIAENQKAQFISLANEKITPLSDAEELDI--ATDEEMLLLKEWKKYR 122 Query: 122 DALELVDTSSAPDIEWPTPP 141 L VDTS+AP I+WP P Sbjct: 123 VMLNRVDTSNAPKIDWPITP 142 >UniRef50_UPI0001826513 hypothetical protein EcanA3_06430 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001826513 Length = 148 Score = 148 bits (374), Expect = 5e-35, Method: Composition-based stats. Identities = 54/141 (38%), Positives = 72/141 (51%), Gaps = 2/141 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y YSA TN+FY + DY QAGS PDD E+ Q Y GK+ E+G P Sbjct: 10 TYFYSAETNAFYVSALMSDYDQAGSLPDDISEISNQWYEYLISGQATGKVITPDEHGKPV 69 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 SE PPT +E AE +K +LI +A + + Q A + +G +EL+ + Y Sbjct: 70 LSEPEPPTPQELREIAEGEKSRLIREAGEAIAVLQDADE--LGMATDDELSALSRLKRYR 127 Query: 122 DALELVDTSSAPDIEWPTPPA 142 L +D S+APDIEWP P Sbjct: 128 VILNRLDISTAPDIEWPEKPD 148 >UniRef50_B3XF69 Phage tail assembly chaperone gp38 n=2 Tax=Escherichia coli 101-1 RepID=B3XF69_ECOLX Length = 142 Score = 146 bits (367), Expect = 2e-34, Method: Composition-based stats. Identities = 65/146 (44%), Positives = 87/146 (59%), Gaps = 5/146 (3%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIE-FSGLPPKGKIRIAGENGFP 60 Y++SA N F+P+ KE + +G WPDD V V E+ + + F +PP+ +I NG P Sbjct: 1 MYVWSAKANGFFPISEKEKFEASGLWPDDGVIVSEEEHKKLFMDIPPRKQIGT--LNGKP 58 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 A +IP PT +E IA AE+KK QL +A+ ++ +Q A A I EE + W Y Sbjct: 59 ALIDIPQPTKKELIAIAEVKKSQLREKADSEISWRQDAVDADI--ATDEETSTLTEWKKY 116 Query: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 L VDTS+APDIEWPTPPAV AR Sbjct: 117 RVLLMRVDTSTAPDIEWPTPPAVHAR 142 >UniRef50_Q2NRZ4 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRZ4_SODGM Length = 142 Score = 145 bits (365), Expect = 5e-34, Method: Composition-based stats. Identities = 50/141 (35%), Positives = 67/141 (47%), Gaps = 3/141 (2%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGS-WPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y YSATTN FY + MK Y + + WP+DAV V ++Y K KI A +NG P Sbjct: 4 YFYSATTNGFYHISMKSIYEDSDNGWPEDAVPVSNELYQALLEGQSKNKIIKANKNGMPV 63 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 + P PT E+ ++ A+ KK L+ QA + Q A + E +W Y Sbjct: 64 LGDRPAPTEEQNLSMAQSKKSMLLEQATGKIIPLQDAVD--LNMATQVEETTLLMWKKYR 121 Query: 122 DALELVDTSSAPDIEWPTPPA 142 L +D S A DI WP P Sbjct: 122 VMLTRLDVSKATDIAWPQCPE 142 >UniRef50_C4V089 Bacteriophage tail fiber assembly protein n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4V089_YERRO Length = 139 Score = 143 bits (361), Expect = 1e-33, Method: Composition-based stats. Identities = 60/142 (42%), Positives = 79/142 (55%), Gaps = 3/142 (2%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M YSA NSFYP E++E Y AGSWPDDAVEV +++ EF P GK R A +G P Sbjct: 1 MKVFYSAIDNSFYPDELREQYVTAGSWPDDAVEVSNKLFQEFI-TAPAGKERKADTDGMP 59 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W ++ PP+ E IA AE KK +L++QAN + Q A + +E+ W Y Sbjct: 60 RWVDVQPPSEIELIAQAEYKKTELMSQANSEIAPLQDAVDLNMANA--DEVTALQTWKKY 117 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L VD +AP+I+WP P Sbjct: 118 RVLLNRVDIDAAPEIDWPVAPE 139 >UniRef50_Q8KT34 Tail fiber assembly protein n=31 Tax=Enterobacteriaceae RepID=Q8KT34_ECOLX Length = 145 Score = 142 bits (358), Expect = 3e-33, Method: Composition-based stats. Identities = 96/143 (67%), Positives = 109/143 (76%), Gaps = 1/143 (0%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEF-SGLPPKGKIRIAGENGFPAW 62 YS + N F +K+DY A SWPDDA+ V + VY EF PP KIR+AG+NG P W Sbjct: 2 FYSPSLNIFVNPALKDDYINANSWPDDALAVSDDVYNEFAINTPPYDKIRVAGKNGLPTW 61 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 + IPPP+HEE I AE ++Q L+NQAN+YMNSKQW GKAAIGRLK EELA YNLWLDYLD Sbjct: 62 ALIPPPSHEELIQQAESERQLLLNQANEYMNSKQWPGKAAIGRLKDEELALYNLWLDYLD 121 Query: 123 ALELVDTSSAPDIEWPTPPAVQA 145 ALELVDTSSAPDIEWPTPP QA Sbjct: 122 ALELVDTSSAPDIEWPTPPVTQA 144 >UniRef50_Q66W74 Putative phage tail fiber assembly protein n=1 Tax=Klebsiella pneumoniae RepID=Q66W74_KLEPN Length = 142 Score = 140 bits (353), Expect = 1e-32, Method: Composition-based stats. Identities = 72/143 (50%), Positives = 97/143 (67%), Gaps = 1/143 (0%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M I+S N F+P+ +K DY AGSWP D ++V VY+EF+ PP+GK+R +N P Sbjct: 1 MEIIFSPGQNKFFPVPLKTDYENAGSWPTDGIDVGYDVYLEFTANPPEGKVRGVVDN-MP 59 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW + PT E+ + A +K+++LI +AN+Y+ KQWAGKA++GRL +E AQYN WLDY Sbjct: 60 AWVDKSSPTQEQLVTQAAVKQKRLITEANEYIGLKQWAGKASLGRLSDDERAQYNAWLDY 119 Query: 121 LDALELVDTSSAPDIEWPTPPAV 143 LD LE V APDI WPTPP + Sbjct: 120 LDELEAVKPEDAPDIIWPTPPVM 142 >UniRef50_A4W7Q4 Phage tail assembly chaperone gp38 n=1 Tax=Enterobacter sp. 638 RepID=A4W7Q4_ENT38 Length = 140 Score = 140 bits (353), Expect = 1e-32, Method: Composition-based stats. Identities = 51/141 (36%), Positives = 75/141 (53%), Gaps = 2/141 (1%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y YSA TN+FYP + ++Y + G++PDDAV V E + ++S P GK+R+A + G P Sbjct: 1 MEYYYSAKTNAFYPDILIDEYKKHGTFPDDAVLVTEACFNQYSADPEPGKMRVADKKGMP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 +W + P PT E + A +K L +QA+ + + A + G + E W Y Sbjct: 61 SWGDQPEPTREMMVYQASEQKNALRDQADKIIAPLKDAKE--YGIITEVEDLVLKEWAIY 118 Query: 121 LDALELVDTSSAPDIEWPTPP 141 L VD + PDI WP P Sbjct: 119 RYNLSKVDVETYPDINWPVKP 139 >UniRef50_Q2NQF0 Hypothetical phage protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NQF0_SODGM Length = 173 Score = 140 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 54/132 (40%), Positives = 75/132 (56%), Gaps = 3/132 (2%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 Y++S TT +FYPL K+ Y AGSWPDD VEV E ++++F PP GK+R NG+P W Sbjct: 12 YVFSGTTGAFYPLSRKQGYVDAGSWPDDGVEVKEDIFMKF-QNPPPGKLRGGDANGYPCW 70 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 + PPPT E++ +A L K+ +++A + Q A +G E A W Y Sbjct: 71 VDTPPPTLEDERRSAALTKKSQLDEAGRIIGPLQDAVD--LGMTTNTEKASLLTWKKYRM 128 Query: 123 ALELVDTSSAPD 134 L VD S+APD Sbjct: 129 LLNRVDISTAPD 140 >UniRef50_O22005 Probable tail fiber assembly protein n=4 Tax=root RepID=TFA_BPSF5 Length = 167 Score = 140 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 45/142 (31%), Positives = 59/142 (41%), Gaps = 10/142 (7%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M+Y YSA+TN FY E + PDDAVE+ E + K+ GENG P Sbjct: 33 MSYFYSASTNGFYSTEF-----HGTNIPDDAVEISESEWETLINSQGVTKMITCGENGHP 87 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT E + KK LI +A + + Q A +G +E W Y Sbjct: 88 VIVDRPSPTPERLALINDEKKSALIAEATNVIAPLQDAVD--LGMATDDETKLLLAWEKY 145 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L VD + EWP P Sbjct: 146 RVLLMRVDIK---NTEWPKKPE 164 >UniRef50_Q9T1R1 Probable tail fiber assembly protein n=8 Tax=root RepID=TFA_BPAPS Length = 155 Score = 137 bits (345), Expect = 1e-31, Method: Composition-based stats. Identities = 52/142 (36%), Positives = 70/142 (49%), Gaps = 3/142 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLP-PKGKIRIAGENGFP 60 Y + +++ MK+DY +AGSW D A V VY EF+ P P GK E G P Sbjct: 16 TYYFGQRKLAWFAGSMKKDYIEAGSWDDKAKAVPYSVYREFALNPAPIGKTLGISEKGDP 75 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W +IPP T + I AE KK L+ A + ++ Q A + EE + W Y Sbjct: 76 IWVDIPPKTKHQLITEAEDKKSGLMQGAREVISPLQDAIDLEM--ATQEETQKLTAWKRY 133 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L +DTS+APDI+WP P Sbjct: 134 RVLLNRLDTSNAPDIDWPKKPE 155 >UniRef50_Q83M74 Putative phage tail fibre protein n=1 Tax=Shigella flexneri RepID=Q83M74_SHIFL Length = 144 Score = 135 bits (340), Expect = 4e-31, Method: Composition-based stats. Identities = 44/142 (30%), Positives = 59/142 (41%), Gaps = 10/142 (7%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M+Y YSA+TN FY E + PDDAVE+ E + K+ GENG P Sbjct: 12 MSYFYSASTNGFYSTEF-----HGTNIPDDAVEISESEWKTLINAQSVTKMITCGENGHP 66 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT E+ + KK LI +A + + Q A +G +E W Y Sbjct: 67 VIVDRPSPTPEQLALINDEKKSALIAEATNVIAPLQDAVD--LGMATDDETKLLLAWKKY 124 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L V+ EWP P Sbjct: 125 RVLLMRVNVVKP---EWPMHPN 143 >UniRef50_B1JPG7 Tail assembly chaperone gp38 n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JPG7_YERPY Length = 142 Score = 135 bits (340), Expect = 4e-31, Method: Composition-based stats. Identities = 48/144 (33%), Positives = 70/144 (48%), Gaps = 3/144 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQA-GSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 YSAT N F P E + D T +WP DAV + ++ ++ + P G + +G P Sbjct: 1 MVYYSATLNGFIPAEWRFDGTYNINTWPGDAVLLSDKESDKYWKVTPAGGKVLGSVSGRP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW +IPP T +E I A K+ A+ ++ +Q A A +E+++ W Y Sbjct: 61 AWVDIPPITIDELIYCAVQNKRVRKEVADSEIDWRQDAVDAE--EASKKEISELAAWKKY 118 Query: 121 LDALELVDTSSAPDIEWPTPPAVQ 144 AL +D S APDI WP P V Sbjct: 119 RVALMRIDISKAPDINWPESPNVA 142 >UniRef50_A4W725 Phage tail assembly chaperone gp38 n=1 Tax=Enterobacter sp. 638 RepID=A4W725_ENT38 Length = 142 Score = 134 bits (336), Expect = 1e-30, Method: Composition-based stats. Identities = 77/142 (54%), Positives = 95/142 (66%), Gaps = 2/142 (1%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 YI+S + FYP E + + G WP D VEV + + P+GK +G+PA Sbjct: 3 KYIWSPSLAGFYPTEEQSIFEGLGGWPTDGVEVSASAHDALFPI-PEGKCIGT-VDGYPA 60 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W ++PPPTHEE +A AE++KQ I+ AN Y+NSKQW GKAA+GRLK E AQYNLWLDYL Sbjct: 61 WIDLPPPTHEEMVAQAEIEKQSRIDAANAYINSKQWPGKAAMGRLKDTEKAQYNLWLDYL 120 Query: 122 DALELVDTSSAPDIEWPTPPAV 143 D LE VDTS+APDI WP PPA Sbjct: 121 DELEAVDTSTAPDITWPEPPAA 142 >UniRef50_D2TR89 Putative phage tail fibre assembly protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR89_CITRO Length = 141 Score = 132 bits (331), Expect = 4e-30, Method: Composition-based stats. Identities = 66/145 (45%), Positives = 93/145 (64%), Gaps = 8/145 (5%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLP-PKGKIRIAGENGFP 60 +S FY + + P +A+EV +Y EF+G+ P GK+ A ++G+P Sbjct: 3 KIYFSQDPVGFYIEGV-------SAVPSNAIEVSADIYNEFAGVAWPDGKVLGADDSGYP 55 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W + PPP+H+E IA AE +KQ+LI++ N ++N +QW K A+GRL +E AQ+N WLDY Sbjct: 56 TWIDAPPPSHDELIAQAEAEKQRLIDETNVWINGQQWPSKLALGRLSEDEKAQFNEWLDY 115 Query: 121 LDALELVDTSSAPDIEWPTPPAVQA 145 LDA+ VDTS+APDIEWPTPP A Sbjct: 116 LDAVSAVDTSTAPDIEWPTPPEQPA 140 >UniRef50_C4UND0 Tail assembly chaperone gp38 n=2 Tax=Yersinia ruckeri RepID=C4UND0_YERRU Length = 140 Score = 129 bits (324), Expect = 2e-29, Method: Composition-based stats. Identities = 55/141 (39%), Positives = 76/141 (53%), Gaps = 3/141 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 NY++SA F + Y AG D +++D+ +YIEF+G PP GK R NG PA Sbjct: 3 NYVWSAQNRVFLAEALLPSYDDAGWNLSDIIKIDDSIYIEFNGNPPVGKQRGVI-NGMPA 61 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 W ++PPPT E I++A +K +L + A+ + +Q A G E+A W Y Sbjct: 62 WVDLPPPTSGELISSANAEKSRLKSIADSGIEWRQDAVND--GSASDREIADLAAWRKYR 119 Query: 122 DALELVDTSSAPDIEWPTPPA 142 AL +DTS APDIEWP P Sbjct: 120 VALMRIDTSKAPDIEWPLKPE 140 >UniRef50_C4SF02 Tail assembly chaperone gp38 n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SF02_YERMO Length = 152 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 48/141 (34%), Positives = 69/141 (48%), Gaps = 3/141 (2%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M +S T F L+M ED + + S+ D VE+ + + + P + G P Sbjct: 14 MKIFFSPTILGFRTLDMVEDGSYSDSY-GDFVELSDSERLNYWKQSPPCGKTLGVATGRP 72 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW ++PPPTHEE +A+A KK QL A+ + +Q A G +E+ W Y Sbjct: 73 AWVDLPPPTHEELVASAIAKKNQLKAAADSEIEWRQDAVDD--GSASEKEIVDLAAWRKY 130 Query: 121 LDALELVDTSSAPDIEWPTPP 141 AL +DTS AP +EWP P Sbjct: 131 RLALMRIDTSKAPGVEWPESP 151 >UniRef50_B7UGJ5 Predicted tail fiber assembly protein n=1 Tax=Escherichia coli O127:H6 str. E2348/69 RepID=B7UGJ5_ECO27 Length = 138 Score = 125 bits (314), Expect = 4e-28, Method: Composition-based stats. Identities = 58/145 (40%), Positives = 80/145 (55%), Gaps = 8/145 (5%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MN Y SFYP +K+ Y AGSWP++ +VD++ ++G+ P+GK A +NG P Sbjct: 1 MNKFY---KGSFYPEALKDVYISAGSWPENGADVDDETMAIYTGVAPEGKTLGADKNGNP 57 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 AW +IPP + E+QI AE K+ L + A+ + +Q A A I EE A + W Y Sbjct: 58 AWIDIPPLSAEQQIIQAEQKRTVLRSMADKEIVWRQDAFDAEI--ATAEETAALSEWKKY 115 Query: 121 LDALELVDTSSAPDIEWPTPPAVQA 145 L VDTS+ WPTPP QA Sbjct: 116 RVLLMRVDTSNP---VWPTPPGEQA 137 >UniRef50_P09154 Uncharacterized protein ymfS n=2 Tax=Escherichia coli RepID=YMFS_ECOLI Length = 137 Score = 124 bits (312), Expect = 7e-28, Method: Composition-based stats. Identities = 67/146 (45%), Positives = 88/146 (60%), Gaps = 9/146 (6%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M T F+ G P D+ E+ + + +G + + FP Sbjct: 1 MKIYCCLNTVGFF-------MDGCGVIPPDSKEITAEHWQSLLKSQAEGGVI--DFSVFP 51 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + TH++++A A +KQ LI+ A D++NS+QW GKAA+GRLK +EL QYNLWLDY Sbjct: 52 PSIKEVIRTHDDEVADANFQKQMLISDATDFINSRQWQGKAALGRLKEDELKQYNLWLDY 111 Query: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 L+ALELVDTSSAPDIEWPTPPAVQAR Sbjct: 112 LEALELVDTSSAPDIEWPTPPAVQAR 137 >UniRef50_Q8FJG5 Putative uncharacterized protein yfdK n=1 Tax=Escherichia coli O6 RepID=Q8FJG5_ECOL6 Length = 158 Score = 121 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 59/139 (42%), Positives = 86/139 (61%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 +I+ F +K +Y + WP + V++ + EF PP+GKI A +NG PAW Sbjct: 15 FIWDKVNARFMAYILKNEYERNRMWPKEGVDISNETACEFMKQPPEGKILGADDNGMPAW 74 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 ++PP T+ E +A A+ +KQ I +A +Y+N+KQW GKA +GRL EL YN+WLDY++ Sbjct: 75 IDMPPLTYTELVAKAKTEKQARIIEAVNYINNKQWQGKALLGRLNDTELKMYNIWLDYIE 134 Query: 123 ALELVDTSSAPDIEWPTPP 141 ALE +D S A D +PT P Sbjct: 135 ALEAIDPSKASDTAFPTKP 153 >UniRef50_B1JGV6 Tail assembly chaperone gp38 n=2 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JGV6_YERPY Length = 145 Score = 118 bits (296), Expect = 5e-26, Method: Composition-based stats. Identities = 48/149 (32%), Positives = 70/149 (46%), Gaps = 13/149 (8%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAG-SWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGF 59 M +SAT F P E + D T +WP DAV + + +EF + Sbjct: 1 MMIYFSATIGGFIPGEWRVDGTYTDETWPTDAVLLTDIESVEFWKRTAPSGKMLGSVKYR 60 Query: 60 PAWSEIPPPTHEE-------QIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELA 112 P W ++P PT E +A A+LKK +LI+ A D + + +A ++ A Sbjct: 61 PVWVDLPTPTAVEVASQKAGFVAQAKLKKSKLISDARDRIEILKDRIEAG-----QDKAA 115 Query: 113 QYNLWLDYLDALELVDTSSAPDIEWPTPP 141 + LW Y AL+ +D S+APDIEWP P Sbjct: 116 ELKLWKSYRIALDDIDVSAAPDIEWPVAP 144 >UniRef50_B1JS57 Tail assembly chaperone gp38 n=3 Tax=Yersinia RepID=B1JS57_YERPY Length = 139 Score = 118 bits (296), Expect = 5e-26, Method: Composition-based stats. Identities = 36/142 (25%), Positives = 59/142 (41%), Gaps = 3/142 (2%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M ++S +F P M D + + + D + V ++ + + +G P Sbjct: 1 MKALFSPKLITFIPENMVVDGSYSHNITDSLIAVTDEELATYWRQNSPDGKILGVVDGRP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W +P P HEE + + KK QL A+ ++ +Q A A +E++ W Y Sbjct: 61 IWVNLPLPLHEELVLGSSTKKSQLKADADSEIDWRQDAVDAE--EANKKEISALAAWRKY 118 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 AL +D S P I WP P Sbjct: 119 RIALMRIDVSHMP-ITWPIKPE 139 >UniRef50_D2TYN1 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2TYN1_9ENTR Length = 140 Score = 117 bits (293), Expect = 9e-26, Method: Composition-based stats. Identities = 44/142 (30%), Positives = 75/142 (52%), Gaps = 4/142 (2%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSG-LPPKGKIRIAGENGFP 60 Y YS N FYP E+K+ Y +AGS+P D +EVD+ VY EF+ K+R++G++GFP Sbjct: 1 MYFYSPKENLFYPNELKDIYIEAGSFPSDVIEVDDAVYFEFTKYDLSDNKVRVSGKDGFP 60 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 W E + + I + K L+ + + + W + +G + + ++ W+ Y Sbjct: 61 KW-EEEKISKRQLIEDTKEKINFLLKEVKNVT--QIWQTQLTLGIITDSDKSKLTDWMIY 117 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 L+ +D + DI WP+ P+ Sbjct: 118 AQKLQQIDLKNINDISWPSKPS 139 >UniRef50_B6VLW5 Tail fiber assembly protein homolog from lambdoid prophage dlp12 n=4 Tax=Enterobacteriaceae RepID=B6VLW5_PHOAA Length = 150 Score = 114 bits (286), Expect = 7e-25, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 54/161 (33%), Gaps = 32/161 (19%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 +S +FY KE VE+ + + E +G ++ + G+P Sbjct: 1 MVYFSRKECAFYNEAYKE-----------CVEITAEKHNELLAGQSRGLSIVSNKEGYPV 49 Query: 62 WSEIPPPTHE-------------------EQIAAAELKKQQLINQANDYMNSKQWAGKAA 102 E P + EQ AE KKQQL+ + + Q A Sbjct: 50 LIERAPSVYHKYDGEKWIISESDKIKLRREQQQQAEHKKQQLMLTVSKQIAPLQDAVDLE 109 Query: 103 IGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 + EE + Y L VD + PDI WP P V Sbjct: 110 M--ASDEEKSLLAALKKYRVLLNRVDVNLVPDIHWPEKPRV 148 >UniRef50_UPI0001C3422A phage tail fiber assembly protein n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001C3422A Length = 124 Score = 114 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 33/115 (28%), Positives = 49/115 (42%), Gaps = 2/115 (1%) Query: 27 WPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLIN 86 P D +++ + Y EF P + R W +I PP+ EE + A+ K QL++ Sbjct: 9 VPSDLMKITDLEYEEFMVSPDRKTPRFNINRNCMEWVDIAPPSKEEAVQHADSLKAQLMD 68 Query: 87 QANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 A + Q A + +E W Y L +D + APDIEWP P Sbjct: 69 VATQAILPLQDAVDLDM--ATDKETILLTEWKKYRVRLNRIDVNVAPDIEWPESP 121 >UniRef50_Q2NV40 Hypothetical phage protein n=2 Tax=Enterobacteriaceae RepID=Q2NV40_SODGM Length = 203 Score = 112 bits (279), Expect = 5e-24, Method: Composition-based stats. Identities = 33/143 (23%), Positives = 48/143 (33%), Gaps = 14/143 (9%) Query: 12 FYPLEMKEDYTQAGSWPDDAVEVDE--------QVYIEFSGLPPKGKIRIAGENGFPAWS 63 F P MK +Y G ++ E+ + + NG Sbjct: 45 FQPETMKIEYETNGVIRSMGYDISGFCPEGCSVAEVSEWPEEAAANR-KWCFLNGQ---V 100 Query: 64 EIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDA 123 T +E + A K+ + QA + Q A + E W Y Sbjct: 101 VPRVYTADELMEQATHKRDYRLEQAAKIIAPLQDAVDLDM--ATDAEKVTLLAWKKYRVL 158 Query: 124 LELVDTSSAPDIEWPTPPAVQAR 146 L +D SSAPDI+WP PP+ R Sbjct: 159 LNRLDISSAPDIDWPDPPSETNR 181 >UniRef50_B2K2J2 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis PB1/+ RepID=B2K2J2_YERPB Length = 149 Score = 111 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 39/150 (26%), Positives = 63/150 (42%), Gaps = 13/150 (8%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 +S + FYP M +D T P D +++ + + P + G P Sbjct: 6 KAKFSPANSMFYPQYMIDDGTFHADLPTDLIDITDAENTTYWRQMPPPGQVLGVIKGRPG 65 Query: 62 WSEIPPPT-------HEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQY 114 W ++PPP+ A A+ KK +LI A+D ++ + + ++ + Sbjct: 66 WVDLPPPSAIDIAAKKAALTAQAKAKKTKLIGDASDEIDVLKDRIELG-----QDKADEL 120 Query: 115 NLWLDYLDALELVDTSSAPDIEWPTPPAVQ 144 LW Y AL+ +D S APDI WP P V Sbjct: 121 KLWKSYRIALDDIDVS-APDINWPESPNVA 149 >UniRef50_B4TRG5 Fels-2 prophage Tfa n=22 Tax=Enterobacteriaceae RepID=B4TRG5_SALSV Length = 135 Score = 108 bits (270), Expect = 5e-23, Method: Composition-based stats. Identities = 39/141 (27%), Positives = 57/141 (40%), Gaps = 10/141 (7%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y YS F+ + D S PDD + + ++ Y E GK G P Sbjct: 3 EYYYSFKEKGFF---WQPDTESDNS-PDDLIPLTDEYYRELMQGQVDGKYI-EHRKGGPV 57 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 E T EE +A AE +K +L+ +A + A K I EE+ + W Y Sbjct: 58 LVEHREYTPEELVAQAEARKAELLAEAESVIAPLARAVKLKI--ATDEEIKRLEAWELYS 115 Query: 122 DALELVDTSSAPDIEWPTPPA 142 + VDT++ +WP PA Sbjct: 116 VMVNRVDTANP---DWPEKPA 133 >UniRef50_A7ZYE1 Putative tail fiber assembly protein n=1 Tax=Escherichia coli HS RepID=A7ZYE1_ECOHS Length = 137 Score = 105 bits (261), Expect = 6e-22, Method: Composition-based stats. Identities = 43/142 (30%), Positives = 65/142 (45%), Gaps = 7/142 (4%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MN Y+ N + D + PD Q I S + + I +G Sbjct: 1 MNASYAVIENGMVVNVIVWDGEAEFTVPD------NQQLINISDISEQPGIGWVYSDGGF 54 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 +H+E +A AE KKQ L++ A ++ Q +A +L EE + N+ LDY Sbjct: 55 TAPPTQERSHDELVADAEQKKQSLLDAAMANISVIQLKLQAG-RKLTQEETTRLNVVLDY 113 Query: 121 LDALELVDTSSAPDIEWPTPPA 142 ++A+ +DTS+APDI WP PA Sbjct: 114 IEAVTAIDTSTAPDIIWPVFPA 135 >UniRef50_Q2NWF3 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NWF3_SODGM Length = 133 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 2/114 (1%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 Y +SA T SF+P+ M DY +AGS PDD V+VDE + +F PP GK R A G+PAW Sbjct: 22 YKFSARTGSFFPVSMLNDYIKAGSLPDDLVDVDETTFWQFCASPPSGKQRGANAQGYPAW 81 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNL 116 ++PPPT EE + ++ K++L+++ M + A + E A Sbjct: 82 IDVPPPTPEEARLSVDVTKRRLMDEVTRAMAPLEDAVDLDM--ATDAEKAALLA 133 >UniRef50_B7M8C3 Putative phage tail fiber assembly protein n=1 Tax=Escherichia coli IAI1 RepID=B7M8C3_ECO8A Length = 146 Score = 102 bits (254), Expect = 3e-21, Method: Composition-based stats. Identities = 46/141 (32%), Positives = 68/141 (48%), Gaps = 7/141 (4%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 MN Y+ N + D + PD Q I S + + I A +G Sbjct: 9 MNASYAVIENGMVMNVIAWDGEAEFTVPD------NQQLINISDISEQPGIGWAYSDGVF 62 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P +H+EQ+A AE +KQ +I+ A ++ Q +A +L EE + N+ LDY Sbjct: 63 SAPLPPERSHDEQVADAEHQKQSMIDAAMVNISVIQLKLQAG-RKLTQEETTRLNVVLDY 121 Query: 121 LDALELVDTSSAPDIEWPTPP 141 +DA+ DTS+APDIEWP P Sbjct: 122 IDAVTATDTSTAPDIEWPDEP 142 >UniRef50_P03740 Tail fiber assembly protein n=117 Tax=root RepID=TFA_LAMBD Length = 194 Score = 99.9 bits (247), Expect = 2e-20, Method: Composition-based stats. Identities = 35/117 (29%), Positives = 54/117 (46%), Gaps = 7/117 (5%) Query: 30 DAVEVDE--QVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQ 87 DA+ + E + F+ L P G+ + + AW + +I AE K+ L+ Sbjct: 83 DALFISELGPLPENFTWLSPGGEYQ---KWNGTAWVKDTEAEKLFRIREAEETKKSLMQV 139 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQ 144 A++++ Q A I EE + W Y L VDTS+APDIEWP P ++ Sbjct: 140 ASEHIAPLQDAADLEI--ATKEETSLLEAWKKYRVLLNRVDTSTAPDIEWPAVPVME 194 >UniRef50_A5A5F3 Tail fiber assembly n=4 Tax=Pseudomonas aeruginosa RepID=A5A5F3_PSEAE Length = 152 Score = 98.3 bits (243), Expect = 6e-20, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 53/140 (37%), Gaps = 12/140 (8%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y +S + +FYP ++E Y AG WP D V +++ + G+ + NG P Sbjct: 4 EYYFSPSQVAFYPASLREVYEHAGCWPVDGEWVSAELHEQLMNEQAAGRAISSDVNGNPV 63 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 E PP + +++ + + + + + G+ QY+ + Y Sbjct: 64 AIERPPLSRQQRSTHERRWRDSQLLATDGLVVRHRDQ--LETGKETTLLPVQYHELMSYR 121 Query: 122 DALELVDTSSAPDIEWPTPP 141 +L +WP P Sbjct: 122 ASLR----------DWPEEP 131 >UniRef50_C9XYE1 Tail fiber assembly protein homolog from lambdoid prophage e14 n=1 Tax=Cronobacter turicensis RepID=C9XYE1_CROTZ Length = 200 Score = 97.9 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 44/114 (38%), Positives = 60/114 (52%), Gaps = 5/114 (4%) Query: 34 VDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELK-----KQQLINQA 88 E V I G P G R A F W+ T+E+ AA+++ K L A Sbjct: 87 TGEPVKITLPGDYPAGTTRAAPATRFDVWNGKAWVTNEDARRAADVENAKAMKSSLRGMA 146 Query: 89 NDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 N+ ++ +QW + +GRL +E A + WLDYL+AL VDTS APDI+WP PA Sbjct: 147 NEIISQQQWPSRLTLGRLNEQEQAAFTAWLDYLEALAAVDTSRAPDIQWPQLPA 200 >UniRef50_B4TAQ6 Tail fiber assembly protein n=8 Tax=Salmonella enterica subsp. enterica RepID=B4TAQ6_SALHS Length = 121 Score = 97.2 bits (240), Expect = 1e-19, Method: Composition-based stats. Identities = 55/121 (45%), Positives = 80/121 (66%) Query: 26 SWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLI 85 W +D + ++ ++ P + + NG AW IPPPT ++ I+AA +K++ I Sbjct: 1 MWSNDFYALTDEEVDKYYMKTPPKGMYLGSSNGRIAWVCIPPPTQDDLISAANQEKKKRI 60 Query: 86 NQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQA 145 +QAN++MNS++W GKAA+GRL G+ELAQYNLWLDYLDAL+ VDTS A +I P A+ Sbjct: 61 DQANEHMNSRRWPGKAALGRLTGDELAQYNLWLDYLDALKAVDTSVAQNIACPIRKAIHI 120 Query: 146 R 146 + Sbjct: 121 K 121 >UniRef50_D0FT42 Phage tail assembly chaperone n=2 Tax=Erwinia pyrifoliae RepID=D0FT42_ERWPY Length = 186 Score = 94.9 bits (234), Expect = 7e-19, Method: Composition-based stats. Identities = 28/127 (22%), Positives = 48/127 (37%), Gaps = 8/127 (6%) Query: 22 TQAGSWPDDAVEVDEQVYIEFSGLP--PKGKIRIAGENGFPAWSEIPPPTHEEQIAAAEL 79 + WP DA + E + P + T EE A L Sbjct: 66 DASTLWPSDAC-IAEINKNQLPKDFELPINTDGWQFDGKK---IVPRAYTQEELQEKAGL 121 Query: 80 KKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPT 139 K+ L+ A+ + + A + I +E+ +W+ Y + +D ++AP+I+WP Sbjct: 122 IKENLLQLASVKIAPLRDAQELDI--ATDDEINALKIWMTYRVQINRIDITNAPNIKWPG 179 Query: 140 PPAVQAR 146 P V R Sbjct: 180 MPDVSRR 186 >UniRef50_B7NA68 Putative uncharacterized protein n=1 Tax=Escherichia coli UMN026 RepID=B7NA68_ECOLU Length = 134 Score = 94.5 bits (233), Expect = 9e-19, Method: Composition-based stats. Identities = 36/136 (26%), Positives = 52/136 (38%), Gaps = 4/136 (2%) Query: 10 NSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPT 69 N Y + W D + + L I + +G P + Sbjct: 2 NDVYAVVDNNVVINVIIW--DGISEWKPEAGNLVPLNGDAGIGWSYSDGVFTAPPPPERS 59 Query: 70 HEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDT 129 H +A AEL+K L+ AN+ + Q A + +E W Y L VDT Sbjct: 60 HNALVAEAELQKSALLTVANNAIAPLQDAVDLEM--ATDDEQTLLLAWKKYRVLLNRVDT 117 Query: 130 SSAPDIEWPTPPAVQA 145 S+AP+IEWPT P +A Sbjct: 118 SAAPEIEWPTQPGERA 133 >UniRef50_B7NSC3 Putative tail fiber assembly protein (Possibly partial) n=2 Tax=Escherichia coli RepID=B7NSC3_ECO7I Length = 136 Score = 94.1 bits (232), Expect = 1e-18, Method: Composition-based stats. Identities = 35/141 (24%), Positives = 57/141 (40%), Gaps = 16/141 (11%) Query: 5 YSATTNSFYPLEMKEDYTQAGSWPDDA--VEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 Y+ N D A P +A + V + V +I + G + Sbjct: 8 YAVIENGVVTNIAVWDGESA-WQPTNALVIPVSDNV-----------RIGWFYDKGKLSS 55 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 PP TH+E + AE ++Q L++ AN + W +G + LW +Y++ Sbjct: 56 PTQPPKTHDELLREAENERQCLLDSANSLI--MNWQSDLLLGIISENNKGNLLLWKEYVN 113 Query: 123 ALELVDTSSAPDIEWPTPPAV 143 +L VD S P+I WP P + Sbjct: 114 SLMSVDLSLVPEITWPERPEI 134 >UniRef50_B2Q1Q3 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q1Q3_PROST Length = 116 Score = 93.7 bits (231), Expect = 1e-18, Method: Composition-based stats. Identities = 37/139 (26%), Positives = 51/139 (36%), Gaps = 25/139 (17%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y N Y E D +Q DD V + EQ + + Sbjct: 1 MKYYKDKNNEVYAYE--SDGSQDAFIADDLVLIAEQEALAITN----------------- 41 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 PPPT E+ IA AE +KQ L+N+A + Q A +G EE Q +W +Y Sbjct: 42 ----PPPTKEQLIAEAEYQKQALLNEATAAIAPLQDAVD--LGIATDEEREQLRVWKEYR 95 Query: 122 DALELVDTSSAPDIEWPTP 140 + VD + WP Sbjct: 96 VEVNRVDVGLGLCVNWPVS 114 >UniRef50_O68721 Lambda tail fiber assembly protein G n=32 Tax=Enterobacteriaceae RepID=O68721_YERPE Length = 202 Score = 93.3 bits (230), Expect = 2e-18, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 41/112 (36%), Gaps = 7/112 (6%) Query: 33 EVDEQVYIEFSGLPPKGKIRIAG-----ENGFPAWSEIPPPTHEEQIAAAELKKQQLINQ 87 E E V+I G P+ I+ + AW + + AE K L+ Sbjct: 80 ETGEPVFIAELGPLPENVTYISPNGEYQKWDGSAWVKDEEAEKTALVGEAEQNKSVLMKN 139 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPT 139 + ++ Q A + EE Y L VDTS APDI+WP Sbjct: 140 VSQQISLLQDAIDLDM--ATDEEKETLVALKKYRVLLNRVDTSLAPDIDWPI 189 >UniRef50_D0ZBI3 Phage tail assembly chaperone gp38 n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI3_EDWTE Length = 193 Score = 92.9 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 27/90 (30%), Positives = 42/90 (46%), Gaps = 2/90 (2%) Query: 53 IAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELA 112 AW H + AE KK L+++A + + W + +G + + A Sbjct: 105 PYDAWDGTAWVTDLNAQHAANVELAEQKKSLLLSEAQEKIGL--WQTELQLGMITDSDKA 162 Query: 113 QYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 W+ Y+ A++ VDTS+APDI WP PA Sbjct: 163 ALITWMTYIKAVQAVDTSAAPDIAWPPKPA 192 >UniRef50_C6CP80 Tail assembly chaperone gp38 n=2 Tax=Dickeya RepID=C6CP80_DICZE Length = 131 Score = 92.5 bits (228), Expect = 4e-18, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 58/144 (40%), Gaps = 14/144 (9%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSW-PDDAVEVDEQVYIEFSGLPPKGKIRIAGENGF 59 M+ +Y+ N + D W P + +D + + +G Sbjct: 1 MSKVYAVIENGVVINTVVWDSDVGADWKPQNGALID--------ISSERVGVGYLYSDGV 52 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLD 119 P +E IA A L+K QL+++A + W +G + +E ++ W + Sbjct: 53 F---TPPEKRRDEYIADATLRKTQLLSEAQKMIA--NWQTDLMLGVISDDEKSRLVRWRE 107 Query: 120 YLDALELVDTSSAPDIEWPTPPAV 143 Y+ ++ +D SAPDI WP PP Sbjct: 108 YMKQVDAIDAQSAPDITWPVPPTA 131 >UniRef50_A4WEL2 Phage tail assembly chaperone gp38 n=1 Tax=Enterobacter sp. 638 RepID=A4WEL2_ENT38 Length = 198 Score = 91.8 bits (226), Expect = 6e-18, Method: Composition-based stats. Identities = 28/115 (24%), Positives = 47/115 (40%), Gaps = 7/115 (6%) Query: 33 EVDEQVYIEFSGLPPKGKIRIA-----GENGFPAWSEIPPPTHEEQIAAAELKKQQLINQ 87 E + V I G + A +W + + A KK L+++ Sbjct: 86 ETGQAVGITAPGAYAQNVTLSAPLTPFDRWNGQSWVTDLEALRQADESQARQKKAGLLSE 145 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 A+ ++ W +G + E+ A W+ Y+ AL VD ++APDI+WP P Sbjct: 146 AHSTISL--WQTGLQLGIISDEDKASLITWMTYIQALNAVDVTAAPDIDWPLMPE 198 >UniRef50_B2VJG1 Tail fiber assembly protein n=1 Tax=Erwinia tasmaniensis RepID=B2VJG1_ERWT9 Length = 198 Score = 91.4 bits (225), Expect = 9e-18, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 40/115 (34%), Gaps = 7/115 (6%) Query: 32 VEVDEQVYIEFSGLPPKGKIRI-----AGENGFPAWSEIPPPTHEEQIAAAELKKQQLIN 86 + + I G P + W E IA A K LI Sbjct: 84 IRTGAEQQITVPGDYPADTTIYSPSTPYDKWNGERWVTDEAAKAEADIAEAAAAKAVLIK 143 Query: 87 QANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 A + Q A + + EE ++Y+ W Y L VD S APDI WP PP Sbjct: 144 SAAAKIEPLQDAVQLDM--ATDEEKSRYDAWRKYRVLLTRVDISQAPDINWPEPP 196 >UniRef50_A4JWL9 Putative uncharacterized protein n=1 Tax=Burkholderia phage phiE255 RepID=A4JWL9_9CAUD Length = 116 Score = 91.0 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 28/114 (24%), Positives = 50/114 (43%), Gaps = 3/114 (2%) Query: 32 VEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDY 91 +E+ ++ + +GK ++G P + PPT E+ + + +L+ +A+ Sbjct: 4 IEITDEQWKMLLAGESQGKRMAVDDSGAPVLLDPLPPTVEQIVTGNTAARDRLLERASVA 63 Query: 92 MNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQA 145 + Q A +G E AQ W+ Y AL+ VD + D WP P + A Sbjct: 64 LTPLQTA--ITLGEATDGETAQARAWITYTRALKSVDL-TQRDPTWPEQPKIVA 114 >UniRef50_C4SNQ0 Putative uncharacterized protein n=4 Tax=Yersinia RepID=C4SNQ0_YERFR Length = 192 Score = 90.6 bits (223), Expect = 1e-17, Method: Composition-based stats. Identities = 27/115 (23%), Positives = 43/115 (37%), Gaps = 7/115 (6%) Query: 33 EVDEQVYIEFSGLPPKGKIRIA-----GENGFPAWSEIPPPTHEEQIAAAELKKQQLINQ 87 + E I G P+ +A W + + +A A KK L+ + Sbjct: 80 QTGEPQTINQLGSLPENTTLLAPSSSFDRWNGTKWVKDSEAEKQYYLAEARQKKSILLEE 139 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 AN + + + + + E + W Y L +D S+APDI WP PA Sbjct: 140 ANTQIEILKDSIEFDMSTSTAE--TELVAWRKYRVQLNQLDISAAPDINWPKQPA 192 >UniRef50_C4TT86 Conserved hypothetical phage tail fiber protein n=1 Tax=Yersinia kristensenii ATCC 33638 RepID=C4TT86_YERKR Length = 161 Score = 90.2 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 41/142 (28%), Positives = 64/142 (45%), Gaps = 7/142 (4%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y +SATT SFYP E+ + YT AG+ P D +E+ + +Y +F+ P GK+R A + G P Sbjct: 1 MYCFSATTLSFYPKELLDVYTDAGTLPSDLIEIGDDIYAQFAAQQPAGKMRGADKKGKPV 60 Query: 62 WSEIPPPTHEEQIAAAELKK-QQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWL-D 119 W +P P AA ++ + A D M ++ L + ++ Sbjct: 61 WVNVPAPVVTADAVAATARRYRDAFITATDAMTIIDYSIDDK--PLTDAQRSELMAIRAA 118 Query: 120 YLDALELVDTSSAPDIEWPTPP 141 Y ++ P IE P P Sbjct: 119 YRAWPT---LANWPLIELPELP 137 >UniRef50_B3G0V8 Putative uncharacterized protein n=1 Tax=Pseudomonas aeruginosa RepID=B3G0V8_PSEAE Length = 144 Score = 90.2 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 53/140 (37%), Gaps = 18/140 (12%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M++ + +FY + D PDDAVE+ + + +GK AG++G P Sbjct: 1 MSHFFGTKPIAFYDTAINTD------IPDDAVEITADEHADLLAAQARGKRIAAGKDGRP 54 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + P PT +E + + + + + + + + + G + +Y+ Y Sbjct: 55 ILLDPPAPTRDELESFERIWRDARLRETDSLVARHRD--EIETGEAPTLDTEKYSALQAY 112 Query: 121 LDALELVDTSSAPDIEWPTP 140 AL +WP Sbjct: 113 RRALR----------DWPEA 122 >UniRef50_B2K1B5 Putative uncharacterized protein n=6 Tax=Yersinia RepID=B2K1B5_YERPB Length = 159 Score = 89.1 bits (219), Expect = 4e-17, Method: Composition-based stats. Identities = 36/143 (25%), Positives = 59/143 (41%), Gaps = 8/143 (5%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 +SATT FYP E KE+Y GSWPDDA+ + ++ ++ P + G P Sbjct: 1 MIYFSATTGGFYPQEWKEEYLATGSWPDDALLLTKKEQTKYWKHVPATGKMLGVMKGRPV 60 Query: 62 WSEIP--PPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWL- 118 W +IP P H + +AA + + + D + ++ + L + A+ Sbjct: 61 WLDIPPLPAPHGDTLAALARRHRDAFIKTTDSITVIDYSIDDS--PLTDTQRAELTATRA 118 Query: 119 DYLDALELVDTSSAPDIEWPTPP 141 Y + P +E P P Sbjct: 119 AYRAWPT---VENWPRVELPELP 138 >UniRef50_C7BTY5 Hypothetical phage protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BTY5_PHOAA Length = 178 Score = 88.7 bits (218), Expect = 5e-17, Method: Composition-based stats. Identities = 35/147 (23%), Positives = 52/147 (35%), Gaps = 14/147 (9%) Query: 5 YSATTNSFYPLEMKEDYTQAGSWPDDAVEVD----------EQVYIEFSGLPPKGKIRIA 54 + + N F +K Y G +V E E + Sbjct: 34 WYTSQNQFSTETLKIMYDANGLIRAITTDVSKFAPLGFSVAEINKNEVPKEFNEKTNYKW 93 Query: 55 GENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQY 114 +G + T EE I AE ++ QL+ +AN+ + Q A I EE Sbjct: 94 IFDGQKIF--PYVATKEELIRKAEYERVQLLVKANNIIVPLQDAIDLNI--ATEEEKNTL 149 Query: 115 NLWLDYLDALELVDTSSAPDIEWPTPP 141 W Y L +D S+ P+I WP+PP Sbjct: 150 LKWKKYRIMLNRIDISTTPEIVWPSPP 176 >UniRef50_B6XAD7 Putative uncharacterized protein n=2 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XAD7_9ENTR Length = 177 Score = 88.7 bits (218), Expect = 5e-17, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 2/80 (2%) Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 E +A AE + L+N+A ++S Q A I E+ W Y Sbjct: 100 VLPYETPKETLVAKAENTLRLLLNEATIKIDSLQDAVDLDI--ATDAEIVSLKEWKKYRV 157 Query: 123 ALELVDTSSAPDIEWPTPPA 142 L VDTS+APD+ +P P Sbjct: 158 LLNRVDTSTAPDVSFPEKPE 177 >UniRef50_Q47427 Tail fiber assembly protein homolog n=47 Tax=root RepID=TFAB_ECOLX Length = 203 Score = 88.7 bits (218), Expect = 6e-17, Method: Composition-based stats. Identities = 33/114 (28%), Positives = 49/114 (42%), Gaps = 6/114 (5%) Query: 32 VEVDEQVYIEFSGLPPKGKIRIA-----GENGFPAWSEIPPPTHEEQIAAAELKKQQLIN 86 ++ I G P+ IA + W H + AAE K+Q LI+ Sbjct: 85 IDTGNPEEITVLGDYPENTTTIAPLTPYDKWDGEKWVVDTEAQHSAAVEAAETKRQSLID 144 Query: 87 QANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTP 140 A D ++ Q +A +L E Q N LDY+D L +D ++APD+ WP Sbjct: 145 TAMDSISLIQLKLRAG-RKLTQAETTQLNSVLDYIDELNAMDLTTAPDLNWPEK 197 >UniRef50_B4T1N8 Tail assembly chaperone gp38 n=3 Tax=Salmonella enterica subsp. enterica RepID=B4T1N8_SALNS Length = 171 Score = 88.3 bits (217), Expect = 7e-17, Method: Composition-based stats. Identities = 28/114 (24%), Positives = 41/114 (35%), Gaps = 11/114 (9%) Query: 30 DAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQAN 89 D VE D Y + GK + E +K + + AN Sbjct: 69 DIVESDSLPYDDII----SGKYQFVDNK-----IIPRTYNEVELTQITNAEKSKKLKLAN 119 Query: 90 DYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 + + Q A +G EE+ + W Y + +DTS+ DI WP PP V Sbjct: 120 EKIRPLQDAVD--LGIATDEEIQKLGAWKRYRVEINRIDTSNLLDISWPLPPDV 171 >UniRef50_P40784 Tail fiber assembly protein homolog from lambdoid prophage Fels-1 n=65 Tax=Enterobacteriaceae RepID=YCDD_SALTY Length = 191 Score = 88.3 bits (217), Expect = 7e-17, Method: Composition-based stats. Identities = 24/105 (22%), Positives = 40/105 (38%), Gaps = 6/105 (5%) Query: 37 QVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQ 96 + + + P G + AW ++ AE K +L+ A+ + Q Sbjct: 92 PLPENVTSVSPGGGYKKWDSKAK-AWVNDEGAEVAARLREAEGTKSRLLQMASGKIAPLQ 150 Query: 97 WAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 A +G +E AQ + W Y + VDTS+ +WP P Sbjct: 151 DAVD--LGIATDDEKAQLDEWKKYRVLVNRVDTSNP---DWPEQP 190 >UniRef50_D2TR90 Putative phage tail fibre assembly protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR90_CITRO Length = 199 Score = 87.5 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 34/117 (29%), Positives = 50/117 (42%), Gaps = 7/117 (5%) Query: 32 VEVDEQVYIEFSGLPPKGKIRI-----AGENGFPAWSEIPPPTHEEQIAAAELKKQQLIN 86 +E E V I G P G + E W H +AAAE +K L+ Sbjct: 85 IENGEPVEITAPGDYPAGTTTLFPSTPYDEWDGEKWVTDTDAQHAADVAAAEHQKTALLA 144 Query: 87 QANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 A D ++ W + +G + ++ A WL Y+ L+ VDT ++PDI WP P Sbjct: 145 AAQDTIS--IWQTELQLGIISDDDKASLISWLSYIKELQTVDTDASPDINWPVAPVA 199 >UniRef50_A8GA30 Tail assembly chaperone gp38 n=2 Tax=Enterobacteriaceae RepID=A8GA30_SERP5 Length = 184 Score = 87.5 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 42/121 (34%), Gaps = 9/121 (7%) Query: 21 YTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELK 80 + +G +P++ + + + + E A K Sbjct: 72 FDASGFFPEN---MSVAEIEQLPEHADIDGRWFFDGTQ----IKPRTYSVAELQQQAMNK 124 Query: 81 KQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTP 140 KQ L QA+ + + A + + EE Q W Y L VD APDI+WP P Sbjct: 125 KQDLSKQASLKIATLNDAVELEM--ASEEEQKQLTAWKTYRVLLSRVDPGLAPDIDWPQP 182 Query: 141 P 141 P Sbjct: 183 P 183 >UniRef50_Q1I688 Putative phage protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I688_PSEE4 Length = 143 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 52/143 (36%), Gaps = 11/143 (7%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 YSA FY ++ + PDDAVE+ +++ GK + NG P Sbjct: 1 MIFYSAQNQGFYDSKVLD-----RMRPDDAVEISAELHAVLMRGQAIGKQIVVESNGMPG 55 Query: 62 WSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYL 121 E+ E A + +++ + + + + G Q++ L YL Sbjct: 56 LRELI----ENAAAVERGWRDRMLADSLKLRDRHRDQLELGGGAETNLSPEQFHALLTYL 111 Query: 122 DALELVDTSSA-PDI-EWPTPPA 142 AL S A PD + P P Sbjct: 112 QALRDWPQSDAFPDAGKRPVAPD 134 >UniRef50_A8GLQ6 Tail assembly chaperone gp38 n=2 Tax=root RepID=A8GLQ6_SERP5 Length = 170 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 28/99 (28%), Positives = 38/99 (38%), Gaps = 2/99 (2%) Query: 44 GLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAI 103 P G + G P + E A A+ K +LI A ++ Q A I Sbjct: 74 DSLPDGCDILGGWVFDGKKVVPRPYSQAELSAQAQQAKNKLIELATKAISPLQDAKDLDI 133 Query: 104 GRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 +ELA+ W+ Y L VDT P+I WP P Sbjct: 134 --ATDDELAKLKEWMVYRVHLNRVDTGMTPNIVWPQSPE 170 >UniRef50_UPI000197C594 tail assembly chaperone gp38 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C594 Length = 169 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 36/90 (40%), Gaps = 5/90 (5%) Query: 52 RIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEEL 111 R +NG P+ EE + A +KQ L+ +A + Q A +G EE Sbjct: 84 RWIYKNG---AIMQYEPSLEESVFLASQQKQLLLEEATAAIAPLQDAVD--LGIATDEER 138 Query: 112 AQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 Q W +Y + VD ++ WP P Sbjct: 139 VQLKAWKEYRVEVNRVDIGLGENVNWPVKP 168 >UniRef50_D2U2G7 Phage tail fiber assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G7_9ENTR Length = 177 Score = 87.1 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 36/88 (40%), Gaps = 2/88 (2%) Query: 54 AGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQ 113 + W QI A KK QLIN+A +N + + +G + + Sbjct: 91 FDKWDGKKWVTDNQAVKAAQINTANEKKLQLINEAEQIINPLERKVRLGMG--NDIDAST 148 Query: 114 YNLWLDYLDALELVDTSSAPDIEWPTPP 141 W Y L +DTS APDI+WP P Sbjct: 149 LREWEIYSVKLNDIDTSIAPDIDWPEKP 176 >UniRef50_A7MLN9 Putative uncharacterized protein n=1 Tax=Cronobacter sakazakii ATCC BAA-894 RepID=A7MLN9_ENTS8 Length = 193 Score = 85.6 bits (210), Expect = 4e-16, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 37/93 (39%), Gaps = 2/93 (2%) Query: 49 GKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKG 108 G R W + Q A +K L+N+A++ ++ Q A + Sbjct: 103 GPPRAWEFWDGEKWLPDDKALKDIQQKEAVERKTMLMNEASNEISLLQDAVDLDM--ATE 160 Query: 109 EELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 +E + + L +DT+SAPDI WP P Sbjct: 161 DETTRLLALKKFRVLLSRIDTTSAPDISWPIAP 193 >UniRef50_Q9KW02 Tail fiber assembly protein n=2 Tax=Pseudomonas aeruginosa RepID=Q9KW02_PSEAE Length = 146 Score = 85.2 bits (209), Expect = 5e-16, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 47/145 (32%), Gaps = 11/145 (7%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWP--DDAVEVDEQVYIEFSGLPPKGKIRIAGENGF 59 + A T FY E P D+ +++ Y +GK + G Sbjct: 1 MIFFHAATGGFYSKE-----IHGSRMPLEDEMHPLEDAEYQALLRAQSEGKRIVTDHTGR 55 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLD 119 P + P P + + + + + + + + + + +G+ Q Sbjct: 56 PICVDPPAPAKDILVQRERIWRDRQLQLTDGPLARHRD--EQDLGKTTTLSQEQLRELTL 113 Query: 120 YLDALELVDTSSA-PDIEW-PTPPA 142 Y L ++ PD+ P PPA Sbjct: 114 YRAVLRDWPIAAEFPDLNARPEPPA 138 >UniRef50_C6CP83 Tail assembly chaperone gp38 n=1 Tax=Dickeya zeae Ech1591 RepID=C6CP83_DICZE Length = 206 Score = 84.8 bits (208), Expect = 7e-16, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 38/101 (37%), Gaps = 5/101 (4%) Query: 42 FSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKA 101 + PP G W E + AA + A+D + +A + Sbjct: 110 LTLQPPAG---AFDRWDGEQWITDNEAYQESLLKAARQACETRRQTAHDRIRELTYAQE- 165 Query: 102 AIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 +G +E W YL L +D S PDI+WPTPP+ Sbjct: 166 -LGMATEQETQSLKDWKIYLVQLSRIDLSLLPDIDWPTPPS 205 >UniRef50_A4TJD5 Phage tail fiber assembly protein n=22 Tax=Yersinia RepID=A4TJD5_YERPP Length = 139 Score = 84.8 bits (208), Expect = 8e-16, Method: Composition-based stats. Identities = 39/151 (25%), Positives = 59/151 (39%), Gaps = 21/151 (13%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDE--QVYIEFSGLPPKGKIRIAGENG 58 M + ++ FY + + Y W D V Y P G E Sbjct: 1 MKKLNKLDSDGFYIEDYIDGYLPKN-WTADLVGDGYYKAQYQNADIDPDTG------EWT 53 Query: 59 FPAWSEIPPPT-------HEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEEL 111 W+E P+ E + A+LKK +LI+ A+D + + + + Sbjct: 54 GGVWAETSGPSTIDISAQKAEFVTQAKLKKSKLISDASDRIEILKDRIELG-----QDRA 108 Query: 112 AQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 A+ LW Y AL+ +D S+APDIEWP P Sbjct: 109 AELKLWKSYRIALDDIDVSAAPDIEWPLKPE 139 >UniRef50_C7BSP4 Putative tail fiber protein of prophage cp-933x (Tail fiber assembl protein) n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BSP4_PHOAA Length = 183 Score = 84.1 bits (206), Expect = 1e-15, Method: Composition-based stats. Identities = 24/71 (33%), Positives = 30/71 (42%), Gaps = 2/71 (2%) Query: 71 EEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTS 130 EE E KK+QL+ +A + Q A I E W Y L +DTS Sbjct: 114 EEIKQGVESKKRQLMVEACTKIAPLQDAVDLDI--ATEAEKDALLAWKKYRVMLNRIDTS 171 Query: 131 SAPDIEWPTPP 141 A +IEWP P Sbjct: 172 QAYNIEWPEQP 182 >UniRef50_B5TK82 Conserved hypothetical phage protein n=1 Tax=Pseudomonas phage DVM-2008 RepID=B5TK82_9VIRU Length = 143 Score = 83.7 bits (205), Expect = 2e-15, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 46/139 (33%), Gaps = 17/139 (12%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y+ +T FY +GS P+DA+++ ++ Y P K + G P Sbjct: 1 MLIYYAQSTGGFYNSI-----DHSGSLPEDAIKITDEEYRTLFAAPFLNKRIESDAKGRP 55 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 E+ ++ + + + + + + + G + QY Y Sbjct: 56 VLLELSVNELTVRMTNEKNWRDGSLTATDRLIA--RDRDEMDDGGGTTLDQTQYTQLQAY 113 Query: 121 LDALELVDTSSAPDIEWPT 139 AL +WP Sbjct: 114 RRALR----------DWPQ 122 >UniRef50_C6C6Z1 Tail assembly chaperone gp38 n=3 Tax=Dickeya RepID=C6C6Z1_DICDC Length = 204 Score = 83.3 bits (204), Expect = 2e-15, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 32/89 (35%), Gaps = 2/89 (2%) Query: 54 AGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQ 113 W Q+ A+ + I +A + +A I +E + Sbjct: 118 YDVWQDDGWVTDAQAQKTAQVEVAKAQLADDIAEAEKQITVLHYAVDLNI--ATEQETQR 175 Query: 114 YNLWLDYLDALELVDTSSAPDIEWPTPPA 142 W YL L VD S AP I+WPT PA Sbjct: 176 LADWKTYLVLLNRVDVSEAPSIDWPTIPA 204 >UniRef50_Q7N1H8 Similarities with lambda tail fiber assembly protein G n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N1H8_PHOLL Length = 258 Score = 81.8 bits (200), Expect = 6e-15, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 34/108 (31%), Gaps = 7/108 (6%) Query: 31 AVEVDEQVYIEFSGLPPKGKIRIAGENGFP-----AWSEIPPPTHEEQIAAAELKKQQLI 85 A + + + I G P + + F W Q AE +K Sbjct: 92 ATDTGQPINITAIGALPDNLTLQSPQTPFDKRENKQWVTDKSALKSHQTEQAEQQKIARQ 151 Query: 86 NQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAP 133 QA+ + Q A I E + W Y + VD S+AP Sbjct: 152 QQADAAIKPLQDAIDLDI--ATDAEKSALVEWKKYRVRVNRVDLSTAP 197 >UniRef50_Q3ZL13 Tail fiber assembly protein n=1 Tax=Escherichia blattae RepID=Q3ZL13_ESCBL Length = 291 Score = 81.4 bits (199), Expect = 8e-15, Method: Composition-based stats. Identities = 26/126 (20%), Positives = 41/126 (32%), Gaps = 4/126 (3%) Query: 17 MKEDYTQAGSW-PDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIA 75 +D W D + + + W + E + A Sbjct: 169 YIKDMRGLTVWNTTDKAPLTISELGPVPDGYTQLVPGEFDQWDGNTWVKDISAESEYKQA 228 Query: 76 AAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDI 135 AE K L+ +A+ + +A G+ EE A+ W Y A+ DT + DI Sbjct: 229 QAEQHKASLLTEASQQIAVLSYAVD--SGQATEEESARLARWQVYRLAVNRTDT-TLNDI 285 Query: 136 EWPTPP 141 WP P Sbjct: 286 TWPEKP 291 >UniRef50_Q7N2R7 Similar to phage tail fiber assembly protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2R7_PHOLL Length = 170 Score = 81.0 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 31/136 (22%), Positives = 54/136 (39%), Gaps = 8/136 (5%) Query: 12 FYPLEMKEDYTQAGSWPDDAVEVDE--QVYIEFSGLPPK---GKIRIAGENGFPAW-SEI 65 F +K + + G + ++ +++ + L P+ +I I F Sbjct: 36 FLEETIKFTFNKDGVITSISKDISSLFPIHLSVAELAPEDVPTEIIITDGWFFDGEKIIK 95 Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 T EEQ + +K + + AN+ + Q A +G +E A W Y +L Sbjct: 96 RIYTLEEQRQQEKNQKSEKMIAANEVIQPLQDAID--LGIATNKEKALLLEWKRYRVSLN 153 Query: 126 LVDTSSAPDIEWPTPP 141 +DTS A +I WP P Sbjct: 154 RIDTSLASEIIWPEQP 169 >UniRef50_C6C5D3 Tail assembly chaperone gp38 n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D3_DICDC Length = 208 Score = 80.6 bits (197), Expect = 1e-14, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 37/91 (40%), Gaps = 2/91 (2%) Query: 53 IAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELA 112 W I AA+ ++++ AND +N +A +G EE + Sbjct: 120 AFDVWRDGRWVTDDAAKINAAIQAAKAEQEKRRRSANDRLNELTYAIN--LGIATPEEAS 177 Query: 113 QYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 + W YL L VD APDI WPT P + Sbjct: 178 ALSSWQAYLVLLSRVDFGHAPDIVWPTEPGM 208 >UniRef50_C6CP85 Putative phage tail fibre protein n=2 Tax=Dickeya zeae Ech1591 RepID=C6CP85_DICZE Length = 125 Score = 80.2 bits (196), Expect = 2e-14, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 49/142 (34%), Gaps = 22/142 (15%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENG-FPAW 62 YS +T+ FY E + PDDA+E+ + Y +G + I E+ P Sbjct: 2 FYSKSTSGFYSDE-----INGVNIPDDAIEIRDDYYQYLLDQQVRGNVIIFDESTKKPIA 56 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 P + + A ++ L+ A+D+ W Y + Sbjct: 57 VTPVPLSDTQLAEDARRQRDNLL-TASDWTQVSDAPVDQQ-------------AWRTYRE 102 Query: 123 ALELVDTSS--APDIEWPTPPA 142 L V + +I WP+ P Sbjct: 103 ILRQVPEQAGFPLNIAWPSQPE 124 >UniRef50_B7UG05 Predicted tail fiber assembly protein n=1 Tax=Escherichia coli O127:H6 str. E2348/69 RepID=B7UG05_ECO27 Length = 122 Score = 79.8 bits (195), Expect = 2e-14, Method: Composition-based stats. Identities = 34/144 (23%), Positives = 58/144 (40%), Gaps = 26/144 (18%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y N Y + TQ + V + +E Sbjct: 4 MMKYYKDENNVVYAYDAY--GTQDAFIKEGLVPITRSEAMEIIN---------------- 45 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 PPPTHE+ I AAE ++Q+L++ A+ M W + +G + A+ + WL Y Sbjct: 46 -----PPPTHEQLIQAAENERQRLLSAADAIM--LDWRTELMLGEISDANRAKLSAWLLY 98 Query: 121 LDALELVDTSSAPD-IEWPTPPAV 143 + ++ VD ++ P+ + WP P + Sbjct: 99 KNQVKAVDVTTDPEHVNWPVIPEL 122 >UniRef50_C5AKX8 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5AKX8_BURGB Length = 142 Score = 79.8 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 56/144 (38%), Gaps = 5/144 (3%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M+ IY+A + +D + + P +++ E GK+ + P Sbjct: 1 MSVIYAAFDTQRHICAFGDDQSMPEALP--FIKISEDEQAWLLEGAANGKVMAVDDKERP 58 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 + PP+ ++ A + L+ +A+ + Q A +G E A W+ Y Sbjct: 59 ILLDPAPPSPDQIRARNTAYRDWLLERASVALTPLQTA--MLLGNATEAEKALARQWIVY 116 Query: 121 LDALELVDTSSAPDIEWPTPPAVQ 144 AL+ VD A +WP PA+ Sbjct: 117 ARALKKVDLGVAL-PDWPAAPAIA 139 >UniRef50_Q31HT5 Putative uncharacterized protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31HT5_THICR Length = 194 Score = 79.4 bits (194), Expect = 3e-14, Method: Composition-based stats. Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 4/97 (4%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEF-SGLPPKGKIRIAGENGF 59 M YSAT N+F+ +K DY Q SWP DA+++ + + P+GK A NG Sbjct: 1 MTIHYSATKNAFFDDALKSDYEQFNSWPSDAIKMTDAEVSTYHGKQSPQGKQLGADANGR 60 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQ 96 P W ++PPP+ + A K +Q+ ++A ++++ Sbjct: 61 PIWVDLPPPSLGDANA---AKSKQINDEAQKFIDANM 94 >UniRef50_C4KQ09 Putative uncharacterized protein n=9 Tax=Burkholderia pseudomallei RepID=C4KQ09_BURPS Length = 150 Score = 79.4 bits (194), Expect = 3e-14, Method: Composition-based stats. Identities = 27/113 (23%), Positives = 49/113 (43%), Gaps = 3/113 (2%) Query: 33 EVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYM 92 E+ ++ + +GK +NG P + PPPT E+ I + + +L+ +A+ + Sbjct: 33 EITDEQWKMLLDGESRGKRMALDDNGVPVLLDPPPPTIEQIIVSNTAMRDRLLERASVAL 92 Query: 93 NSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQA 145 Q A +G E Q W+ Y AL+ +D + + WP P + Sbjct: 93 TPLQTA--IMLGDATDSEAQQARAWIAYTRALKGIDLTR-REPTWPEQPEMAR 142 >UniRef50_A9I964 Putative phage tail fibre protein n=1 Tax=Bordetella petrii DSM 12804 RepID=A9I964_BORPD Length = 155 Score = 78.7 bits (192), Expect = 5e-14, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 53/122 (43%), Gaps = 12/122 (9%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWS 63 YS T FY +M + P DAVE+ +++Y + GK +A E+GFPA + Sbjct: 2 FYSVETGGFYSAKM-----HGKAMPADAVEITDELYSQLLAGQSDGKRIVADESGFPALA 56 Query: 64 EIPPPTHEEQIAAAELKKQQLINQA------NDYMNSKQWAGKAAIGRLKGEELAQYNLW 117 + PPT + Q+ ++ A +D N+ +A + A+ + E + W Sbjct: 57 DPLPPTPAQIEVQKVAVVQKHMDDAARALRYDDIANAVTYAEEPAVPKF-QAEGQAFREW 115 Query: 118 LD 119 Sbjct: 116 RS 117 >UniRef50_C4UND1 Conserved hypothetical phage tail fiber protein n=2 Tax=Enterobacteriaceae RepID=C4UND1_YERRU Length = 165 Score = 77.9 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 41/141 (29%), Positives = 66/141 (46%), Gaps = 10/141 (7%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 YIYSA N+F+P++ +DY+ DAVEV + V +EF G P GK+R AG +G P Sbjct: 3 TYIYSAKNNAFFPVDYLDDYSHWDL--SDAVEVSDGVAMEFMGGAPIGKVRAAGVDGHPC 60 Query: 62 WSEIPP--PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWL- 118 W++ PP P ++++A+ + + A D M ++ L + + + Sbjct: 61 WTDKPPALPLSDDELASLARQYRDAFIVATDNMMVSDYSIDDI--PLTSTQRTELTVTRA 118 Query: 119 DYLDALELVDTSSAPDIEWPT 139 Y + P IE P Sbjct: 119 AYRSWPT---LAGWPLIELPV 136 >UniRef50_A1JMQ0 Phage tail fiber assembly protein n=5 Tax=Yersinia RepID=A1JMQ0_YERE8 Length = 204 Score = 77.9 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 30/113 (26%), Positives = 45/113 (39%), Gaps = 8/113 (7%) Query: 32 VEVDEQVYIEFSGLPPKGKIRI-----AGENGFPAWSEIPPPTHEEQIAAAELKKQQLIN 86 +E + I G PK K I AW I AA +K LIN Sbjct: 92 IETKYESIIFALGPIPKNKTLIQPMHEFDIWTGTAWEVDQQALKARHITAAVQQKTALIN 151 Query: 87 QANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPT 139 Q ++++N A + ++ Q + Y AL +D ++APDI+WP Sbjct: 152 QVSEHINILLDAIAID---NQQTDIQQLAAFKQYRVALMRIDPNTAPDIDWPE 201 >UniRef50_B1JB16 Putative uncharacterized protein n=1 Tax=Pseudomonas putida W619 RepID=B1JB16_PSEPW Length = 149 Score = 77.9 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 49/138 (35%), Gaps = 19/138 (13%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 + S +T FY A S P DAVE+ E + G++ + G++ P Sbjct: 5 FFASKSTRGFYSS------DSASSIPVDAVEITEAYRNQLLEGERAGRVIVWGDS-EPFL 57 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 + PPPT EE + + + + + + IG AQ++ L+Y Sbjct: 58 EDPPPPTGEELAVVERRWRDMQLLATDGIVARHRD--ERDIGGPTTLNTAQFSELLEYRQ 115 Query: 123 ALELVDTSSAPDIEWPTP 140 L WP Sbjct: 116 DLR----------NWPQA 123 >UniRef50_D1UEY6 Putative uncharacterized protein n=2 Tax=Burkholderia RepID=D1UEY6_9BURK Length = 147 Score = 76.4 bits (186), Expect = 2e-13, Method: Composition-based stats. Identities = 26/113 (23%), Positives = 46/113 (40%), Gaps = 3/113 (2%) Query: 31 AVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQAND 90 AV++ ++ + +GK NG PA + PPT +Q ++ + + Sbjct: 31 AVQITTALWRKLIDGQGQGKRIALDANGMPALFDPLPPTAAQQAELMRGRRDAALQATDW 90 Query: 91 YMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV-DTSSAPDIEWPTPPA 142 + Q + IG Q+ + L+Y AL + D + PDI+ P P Sbjct: 91 LVARHQD--ETLIGAGTTLTAEQFVVLLNYRQALRELADAEAWPDIDLPAAPD 141 >UniRef50_Q9ZXK5 Orf21 n=2 Tax=root RepID=Q9ZXK5_9CAUD Length = 148 Score = 76.4 bits (186), Expect = 3e-13, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 45/141 (31%), Gaps = 20/141 (14%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGS--WPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGF 59 + Y T FY E+ + A S WP + ++ Y +G + +AGE+G Sbjct: 1 MFFYCPKTGGFYSPEVHGEQMPAESELWP-----LTDEEYEALLDAQGQGLLIVAGEDGQ 55 Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLD 119 P + PP E + + ++ + + + + QY Sbjct: 56 PVATPPPPLGDEALATIERDWRDRQLDDTDALVARHRDELEVG---TTTLSTEQYQALQA 112 Query: 120 YLDALELVDTSSAPDIEWPTP 140 Y L +WP Sbjct: 113 YRRQLR----------DWPES 123 >UniRef50_Q7P0Y3 Probable tail fiber assembly protein n=1 Tax=Chromobacterium violaceum RepID=Q7P0Y3_CHRVO Length = 146 Score = 75.2 bits (183), Expect = 7e-13, Method: Composition-based stats. Identities = 35/134 (26%), Positives = 52/134 (38%), Gaps = 12/134 (8%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M YSA T FY E A +P DAV+V+ VY GK+ A NG P Sbjct: 1 MTIFYSAGTGGFYDSE-----IHAEGYPADAVQVEASVYEALFRGQEAGKLIQADGNGCP 55 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLI--NQANDYMNSKQWAGKAAIGRLKGEELAQYNLWL 118 + P + E+Q A Q I +A+ + ++ +G + + Sbjct: 56 VLVDGPALSIEQQRQARIAHCQGEIGRLEADQHRAVREL-LTLMLGGAIPTDALRTEA-- 112 Query: 119 DYLDALELVDTSSA 132 L+ VDT+ A Sbjct: 113 --GQKLQQVDTAIA 124 >UniRef50_Q3KH46 Putative phage related protein n=2 Tax=root RepID=Q3KH46_PSEPF Length = 188 Score = 74.4 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 37/113 (32%), Gaps = 8/113 (7%) Query: 35 DEQVYIEFSGLPPKGKIRIAG----ENGFPAWSEIPPPTHEEQIAAAELKKQQLINQAND 90 +Q++ E LP + AW ++ L+ +A Sbjct: 72 GQQIWSELGELPDTLTTQPWPGEFHVWRGDAWQLDEQARLLSISQQMLEQRDTLLREAVL 131 Query: 91 YMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD--TSSAPDIEWPTPP 141 + Q+A IG EE Q W Y L +D T +I WP+ P Sbjct: 132 RIAPLQYAED--IGDATHEEQMQLLEWKLYSVELNRIDKQTGFPREITWPSLP 182 >UniRef50_P77326 Putative tail fiber assembly protein homolog from prophage CPS-53 n=30 Tax=Enterobacteriaceae RepID=TFAS_ECOLI Length = 114 Score = 74.4 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 1/91 (1%) Query: 53 IAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELA 112 + W H + AAE ++Q LI+ A ++ Q +A +L E + Sbjct: 25 PYDKWDGEKWVTDTEAQHSVAVDAAEAQRQSLIDTAMASISLIQLKLQAG-RKLMQAETS 83 Query: 113 QYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 + N LDY+DA+ DTS+APD+ WP P Sbjct: 84 RLNTVLDYIDAVTATDTSTAPDVIWPELPEE 114 >UniRef50_C1D954 HsdM n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D954_LARHH Length = 283 Score = 74.1 bits (180), Expect = 1e-12, Method: Composition-based stats. Identities = 26/120 (21%), Positives = 41/120 (34%), Gaps = 18/120 (15%) Query: 4 IYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWS 63 YSATT FY D+ + P DA E+ + + G+I A E G+P + Sbjct: 2 FYSATTCGFY------DHYSNNAIPADAGEITSEQHAALLAGQSDGRIITANEQGYPVLT 55 Query: 64 EIPPPTHEEQIAAA--------ELKKQQLINQANDY----MNSKQWAGKAAIGRLKGEEL 111 + PP T + A + ++ I A + Q + E Sbjct: 56 DPPPATLDTLRDCALLMLPAWEKAERTSGIEHAGQRWLTTSAALQDIRDVLLAGAVLGEQ 115 >UniRef50_Q3KH44 Putative phage tail assembly protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KH44_PSEPF Length = 146 Score = 73.3 bits (178), Expect = 2e-12, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 51/146 (34%), Gaps = 15/146 (10%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M + A T F E+ + P+ AVE+ Y E GK+ A +G P Sbjct: 1 MAIYFHAQTRGF---ELVDSPYPEP--PEGAVEITRAQYAELFAGQASGKVISASASGQP 55 Query: 61 AWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 ++ + +A+ E + + Q ++ + A + +G ++ L Y Sbjct: 56 VLNDP--VISPQALASRERAWRDNVLQDTQWLVW-RDAEELEVGEGTTLRTEEFKQLLAY 112 Query: 121 LDALELVDTSSAPDIEWPTPPAVQAR 146 AL P P QAR Sbjct: 113 RQALRDWPND-------PEFPDAQAR 131 >UniRef50_Q4ZMK8 Putative uncharacterized protein n=4 Tax=Pseudomonas syringae group RepID=Q4ZMK8_PSEU2 Length = 188 Score = 73.3 bits (178), Expect = 2e-12, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 34/94 (36%), Gaps = 4/94 (4%) Query: 51 IRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEE 110 W+ +AA L + Q + +A + Q+A A +G E Sbjct: 92 PGKYYVWREGDWALDKEAQRVALASAALLVRDQRLQEAATRIVPLQYA--ADLGDATEAE 149 Query: 111 LAQYNLWLDYLDALELVD--TSSAPDIEWPTPPA 142 A W Y L ++ + IEWP+PP+ Sbjct: 150 KASLLEWKRYSVKLNRIEQFSDYPLQIEWPSPPS 183 >UniRef50_D0KLI7 Tail assembly chaperone gp38 n=2 Tax=Enterobacteriaceae RepID=D0KLI7_PECWW Length = 209 Score = 72.5 bits (176), Expect = 4e-12, Method: Composition-based stats. Identities = 26/116 (22%), Positives = 39/116 (33%), Gaps = 8/116 (6%) Query: 33 EVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAE-----LKKQQLINQ 87 E + + G P+ +A F W T+ E A + Sbjct: 97 ETRQSQAVMQFGELPENMTFLAPATEFDQWDGTTWVTNVEAQQLAATKNLQQELAARRAT 156 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 AN + +A + EE Q W YL AL +D +A + WP P+V Sbjct: 157 ANSRITELSYAVD--LAIATDEEQEQLTQWKIYLVALSRIDL-TAVSVVWPEAPSV 209 >UniRef50_Q9B026 Probable tail fiber assembly protein n=1 Tax=Phage GMSE-1 RepID=Q9B026_9VIRU Length = 147 Score = 72.5 bits (176), Expect = 4e-12, Method: Composition-based stats. Identities = 27/113 (23%), Positives = 40/113 (35%), Gaps = 8/113 (7%) Query: 31 AVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQAND 90 A++ + Y G GK P E A +K+Q A D Sbjct: 41 AIDASNEPYAMIGGRYEDGKFIPVPP------PLPEPLPPEFLREQAMGEKRQRDAAARD 94 Query: 91 YMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 + ++ + + + E + W Y L D S+APDI WPTPP Sbjct: 95 AIALLEYVIELDMQQG--GEAKKLRAWKKYRVLLNRADISAAPDIYWPTPPDE 145 >UniRef50_Q7Y3Y8 Tail fiber assembly protein n=4 Tax=root RepID=Q7Y3Y8_9CAUD Length = 135 Score = 72.1 bits (175), Expect = 5e-12, Method: Composition-based stats. Identities = 26/100 (26%), Positives = 42/100 (42%), Gaps = 4/100 (4%) Query: 46 PPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGK---AA 102 P G AGENG P +E I A K+ ++ A+D + + + + Sbjct: 35 PEDGINYYAGENG-EWLVGPAPQVVQEMIIEATQKQIAALSYASDIIGAIADEIEGLEDS 93 Query: 103 IGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 + + W Y ++ +D S+AP+IEWP PP Sbjct: 94 EEDVPDKLRTDLKAWKQYRVKVKNIDVSNAPNIEWPVPPE 133 >UniRef50_Q7NAA2 Complete genome; segment 1/17 n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7NAA2_PHOLL Length = 113 Score = 71.7 bits (174), Expect = 7e-12, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 3/77 (3%) Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 E++ +K L+ QA ++ Q A +G E+ Y AL Sbjct: 40 QQAIIEQKNKTYHRQKYYLLEQATIKISPLQDAID--LGIATDSEITMLMELKKYRVALN 97 Query: 126 LVDTSSAPDIEWPTPPA 142 +DT +A DI+WP P Sbjct: 98 RMDT-TAKDIKWPEKPE 113 >UniRef50_D1P8C4 Tail fiber assembly protein n=1 Tax=Providencia rustigianii DSM 4541 RepID=D1P8C4_9ENTR Length = 117 Score = 70.6 bits (171), Expect = 1e-11, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 2/77 (2%) Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 PP + E+ IA AE KKQ L+++A ++ + + + +E W Y Sbjct: 43 PPVSKEQHIAEAEQKKQFLLDEAERHIAILERKVRLEM--ATDDEKDLLTAWEIYSVKTA 100 Query: 126 LVDTSSAPDIEWPTPPA 142 DTS APDI+W P Sbjct: 101 DADTSKAPDIDWGVKPE 117 >UniRef50_C6DE09 Tail assembly chaperone gp38 n=5 Tax=Enterobacteriaceae RepID=C6DE09_PECCP Length = 206 Score = 70.6 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 25/117 (21%), Positives = 43/117 (36%), Gaps = 8/117 (6%) Query: 33 EVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAA--ELKKQQL---INQ 87 E + I G P + + + F W + T E A KK +L ++Q Sbjct: 93 ETRQPQKITDIGELPANQTLLVPTSEFDKWEDGKWVTDLEAQRQALIANKKVELNTKLSQ 152 Query: 88 ANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQ 144 A++ + A + + EE + W Y L VD +S ++ P P + Sbjct: 153 ASERIQVLSDAVELNL--ATEEEKNELKAWKTYRLQLSRVDVNSFEEV-LPNLPNLN 206 >UniRef50_C4K4X4 Phage tail assembly chaperone n=2 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K4X4_HAMD5 Length = 178 Score = 70.6 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 33/79 (41%), Gaps = 3/79 (3%) Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 T E+ A +K++ +++A+ + A + +G +E+ W Y Sbjct: 101 VVRGTYTEAERTRQAAREKEKWMDRASKAIGPLADAVE--LGIATEQEVQALKNWKAYRV 158 Query: 123 ALELVDTSSAPDIEWPTPP 141 AL +D A +I WP P Sbjct: 159 ALHRLDP-KAGEITWPEVP 176 >UniRef50_B4TI69 Putative phage tail fiber assembly protein n=13 Tax=Salmonella enterica subsp. enterica RepID=B4TI69_SALHS Length = 176 Score = 70.2 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 2/77 (2%) Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 P + +A A ++ + + +N A + + EL + + + L + Sbjct: 102 PVQVDYVALATAERDRRMASVTSKINQLMEAQDDS--DITDAELVELSDLREVRTKLRRL 159 Query: 128 DTSSAPDIEWPTPPAVQ 144 D + APDI+WP P V Sbjct: 160 DLTGAPDIDWPEVPDVA 176 >UniRef50_A9DEL3 Tail fiber related protein n=1 Tax=Yersinia phage PY100 RepID=A9DEL3_9CAUD Length = 185 Score = 70.2 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 24/104 (23%), Positives = 39/104 (37%), Gaps = 8/104 (7%) Query: 38 VYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQW 97 G+ I+ P + E I AA +K+QLI + + ++ + Sbjct: 86 ETWSLPDDFELGQFVISD-----GAIVRQMPDNSEIIDAARERKRQLIEEVSLEIDVLKD 140 Query: 98 AGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPP 141 A + +G L E + +Y L VD S +I WP P Sbjct: 141 AEE--LGDLTPREAQRLAALKNYRVELMRVDISKDGEI-WPIKP 181 >UniRef50_P26699 Probable tail fiber assembly protein n=56 Tax=root RepID=TFA_BPP2 Length = 175 Score = 69.4 bits (168), Expect = 3e-11, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 51/137 (37%), Gaps = 13/137 (9%) Query: 12 FYPLEMKEDYTQAG---SWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGF----PAWSE 64 F P +K Y + + DA + +P R A ++G Sbjct: 44 FQPDTIKIVYDENNIIVAITRDASTL-NPEGFSVVEVPDITSNRRADDSGKWMFKDGAVV 102 Query: 65 IPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDAL 124 T +EQ AE +K L+++A + + + A + + EE A+ W Y + Sbjct: 103 KRIYTADEQQQQAESQKAALLSEAENVIQPLERAVR--LNMATDEERARLESWERYSVLV 160 Query: 125 ELVDTSSAPDIEWPTPP 141 VD ++ EWP P Sbjct: 161 SRVDPANP---EWPEMP 174 >UniRef50_C5BH15 Putative Tail fiber assembly protein-like protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BH15_EDWI9 Length = 206 Score = 69.4 bits (168), Expect = 3e-11, Method: Composition-based stats. Identities = 13/91 (14%), Positives = 22/91 (24%), Gaps = 2/91 (2%) Query: 53 IAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRL-KGEEL 111 W + A KK L+ A + Sbjct: 117 PYDVWNGEKWVTDTRQQQQHITANHLRKKNALLETATQRIEILMDKISLTATDTPTQTIQ 176 Query: 112 AQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 + W Y ++ + + P I+WP P Sbjct: 177 ERLLAWRKYRAQVDDISADT-PHIDWPAMPE 206 >UniRef50_B4TML3 Caudovirales tail fibre assembly protein n=16 Tax=root RepID=B4TML3_SALSV Length = 191 Score = 68.3 bits (165), Expect = 7e-11, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 5/82 (6%) Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 + EE AE +K + +++A + A K I EE+ + W Y Sbjct: 115 VVQRGYSPEELRKKAEAEKIRRLSEAESAIAPLARAVKLKI--ATDEEIKRLEAWELYSV 172 Query: 123 ALELVDTSSAPDIEWPTPPAVQ 144 + VDT+S +WP P V Sbjct: 173 MVNRVDTASP---DWPEVPDVA 191 >UniRef50_UPI00016A4B8D hypothetical protein BthaT_33832 n=1 Tax=Burkholderia thailandensis TXDOH RepID=UPI00016A4B8D Length = 147 Score = 67.9 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 31/141 (21%), Positives = 48/141 (34%), Gaps = 10/141 (7%) Query: 6 SATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEI 65 S SF K D VE+ + + GK E+G P E Sbjct: 13 SRRIRSFCDETSKPDGLA-------FVEITPRQHAMLLEGAAAGKTVAVTEDGHPILLEP 65 Query: 66 PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALE 125 T + A + + + + Q + A+G +Y L Y AL Sbjct: 66 EKQTRAQLADAKRAARDAALVATDWLTSRHQD--EKALGDGTTLMPDEYAQLLRYRQALR 123 Query: 126 LV-DTSSAPDIEWPTPPAVQA 145 + D + P+++ PTPPA A Sbjct: 124 NLGDATGWPNVDLPTPPACVA 144 >UniRef50_B4T2D9 Gp20 n=17 Tax=root RepID=B4T2D9_SALNS Length = 184 Score = 67.5 bits (163), Expect = 1e-10, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 5/82 (6%) Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 + EE AE +K + + +A + A K I EE+ + + W Y Sbjct: 108 VVQRVYSPEELRKKAEDEKVRRLAEAESAIAPLARAVKLKI--ATDEEIKRLDAWELYSV 165 Query: 123 ALELVDTSSAPDIEWPTPPAVQ 144 + VDT+S +WP P V Sbjct: 166 MVNRVDTASP---DWPEVPDVA 184 >UniRef50_Q32IA2 Hypothetical prophage protein n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32IA2_SHIDS Length = 109 Score = 66.3 bits (160), Expect = 3e-10, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 8/84 (9%) Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 P + AE ++Q+L+N+A + + W + +G + ++ + W++Y+ A++ + Sbjct: 28 PVPVDYSKLAEKQRQRLLNEAKEITS--DWKTELELGTISDDDKVRLTQWMEYIKAVKAL 85 Query: 128 DTSSAPD------IEWPTPPAVQA 145 D S+A D I WP P A Sbjct: 86 DLSTATDEISFDAINWPERPDAAA 109 >UniRef50_B3HH41 Tail assembly chaperone gp38 n=3 Tax=Enterobacteriaceae RepID=B3HH41_ECOLX Length = 176 Score = 66.0 bits (159), Expect = 4e-10, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 34/84 (40%), Gaps = 8/84 (9%) Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 P + AE ++ +L A ++ K +G + EE + +W Y L+ + Sbjct: 95 PVPVDYRQQAESERARLTAIAEREISDK--KTDLLLGIIGDEEKEKLTVWRIYAKLLQAM 152 Query: 128 DTSSAPD------IEWPTPPAVQA 145 D S+ D IEWP P + Sbjct: 153 DFSTITDKTSYNAIEWPVSPEASS 176 >UniRef50_C9R438 Putative tail fiber assembly protein n=3 Tax=root RepID=C9R438_AGGAD Length = 203 Score = 65.2 bits (157), Expect = 6e-10, Method: Composition-based stats. Identities = 26/106 (24%), Positives = 34/106 (32%), Gaps = 13/106 (12%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y + FY E G P AVE+ EQ Y GK IA G P Sbjct: 1 MAIYY---KDGFYNDE------NGGYVPQGAVEITEQTYRTLLEGQSAGKQIIADSEGKP 51 Query: 61 AWSEIPPPTHEEQIA----AAELKKQQLINQANDYMNSKQWAGKAA 102 E P E +E K L+ + + +K + Sbjct: 52 ILVEPQPSHLHEFKNGKWIISEKNKTALLLEQRKTICAKINQLRDE 97 >UniRef50_B0VK51 Putative uncharacterized protein 51 n=1 Tax=Azospirillum phage Cd RepID=B0VK51_9CAUD Length = 228 Score = 65.2 bits (157), Expect = 6e-10, Method: Composition-based stats. Identities = 21/114 (18%), Positives = 39/114 (34%), Gaps = 7/114 (6%) Query: 5 YSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSE 64 Y+A+T FY + P DAVE+ ++ Y G+ + G++G P + Sbjct: 3 YAASTGGFYDRA-----IHGDTVPADAVEITDEEYAALFDGQSLGQRIVPGQDGRPTFYT 57 Query: 65 IPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIG--RLKGEELAQYNL 116 + AA ++Q +N +W +A+ Sbjct: 58 PTLDDTKADRKAAATARRQTERDRGVVVNGNRWHSDKGSADDIATAVAMARLQE 111 >UniRef50_B1JPI1 Tail assembly chaperone gp38 n=3 Tax=Yersinia pseudotuberculosis RepID=B1JPI1_YERPY Length = 172 Score = 64.4 bits (155), Expect = 9e-10, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 35/83 (42%), Gaps = 6/83 (7%) Query: 60 PAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLD 119 + P+ ++ AAAE ++++L++ + + + A G +E + Sbjct: 96 DGAVVMRIPSVDDLTAAAEERRRELMSNVSVEIATLDDI--AQSGTGTEQEQERLAALKQ 153 Query: 120 YLDALELVDTSSAPDIEWPTPPA 142 Y AL +D + +WP PA Sbjct: 154 YRIALMRLDINE----QWPVLPA 172 >UniRef50_B5S309 Tail fiber assembly protein homolog n=2 Tax=Ralstonia solanacearum RepID=B5S309_RALSO Length = 198 Score = 63.7 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 32/94 (34%), Gaps = 3/94 (3%) Query: 50 KIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGE 109 + +A AW A +++ + QA + Q A + Sbjct: 108 RPTVAHLWDGAAWQLDEALCASLHRADGLVERNIRMKQARRAIEPLQAAVD--LADATEA 165 Query: 110 ELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAV 143 E A+ W YL AL VD + P + WP P Sbjct: 166 EAARLVAWRRYLVALNRVDLDADP-VAWPVAPDA 198 >UniRef50_B3RGH1 Putative tail fiber assembly protein n=1 Tax=Escherichia phage rv5 RepID=B3RGH1_9CAUD Length = 194 Score = 63.7 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 28/84 (33%), Gaps = 2/84 (2%) Query: 61 AWSEIPPPTHEEQ-IAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLD 119 T E + AE + + A + + Q L ++ Sbjct: 112 RLVGDEFVTDNEFFVQQAEGVIETELAWATARIGAYQDMIDLEY-DLTDDQKRNIRDLKM 170 Query: 120 YLDALELVDTSSAPDIEWPTPPAV 143 Y L +DTS APDI +P P + Sbjct: 171 YRVKLLEIDTSKAPDIFFPERPTL 194 >UniRef50_Q48C55 Prophage PSPPH06, putative tail fiber protein n=1 Tax=Pseudomonas syringae pv. phaseolicola 1448A RepID=Q48C55_PSE14 Length = 145 Score = 62.9 bits (151), Expect = 3e-09, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 37/121 (30%), Gaps = 15/121 (12%) Query: 29 DDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQA 88 D +E+ + Y G ++G PA P+ E + + Q Sbjct: 27 SDLIEISIEHYQSLLEAQSNGMRIDLDDSGRPAAIAPFAPSIETLRENDRCWRDYQLKQT 86 Query: 89 NDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTP---PAVQA 145 + ++ + + G+ + Y Y AL +WP P + A Sbjct: 87 DGMVSRHRD--ELEAGQATTLSVEHYRALQAYRGALR----------DWPEHSSFPDISA 134 Query: 146 R 146 R Sbjct: 135 R 135 >UniRef50_B4T266 Caudovirales tail fibre assembly protein n=16 Tax=Enterobacteriaceae RepID=B4T266_SALNS Length = 175 Score = 61.3 bits (147), Expect = 8e-09, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 8/80 (10%) Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 + A AE +Q+L++ AN + W + A+G + ++ A W+ Y+ L+ + Sbjct: 97 AVPVDYQAKAETTRQKLLDGANSIIA--DWRTELALGEISDDDKATLTKWMSYIKGLKSL 154 Query: 128 DTSSAPD------IEWPTPP 141 D + D I+WP P Sbjct: 155 DLTGISDEATFNKIQWPALP 174 >UniRef50_A7ZL71 Putative uncharacterized protein n=1 Tax=Escherichia coli E24377A RepID=A7ZL71_ECO24 Length = 179 Score = 61.3 bits (147), Expect = 9e-09, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 48/144 (33%), Gaps = 12/144 (8%) Query: 12 FYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENG----FPAWSEIPP 67 F K Y G + +V + S + A +G Sbjct: 38 FAEDTYKVAYDSDGIVRSISTDVSALCPVSLSVAEVESLPDGADIDGNWVFDGESVVART 97 Query: 68 PTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV 127 T E A AE ++ LI+ A + ++ W + + + + WL Y+ AL+ + Sbjct: 98 LTAAEWQARAESQRSALISDAKERISL--WQSELLLDIITNYDKESLTEWLAYIKALQAL 155 Query: 128 DTSSAPD------IEWPTPPAVQA 145 D S D WP P V + Sbjct: 156 DLSGVTDEASYNATVWPDEPRVTS 179 >UniRef50_C4F418 Putative uncharacterized protein n=3 Tax=Haemophilus influenzae RepID=C4F418_HAEIN Length = 208 Score = 61.3 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 35/108 (32%), Gaps = 13/108 (12%) Query: 2 NYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPA 61 Y Y + F ++ + P AV++ +Y +GK IA + G P Sbjct: 1 MYYYDSANKCFLSDDI-------HNIPAHAVQITNDLYSTLLNGQTQGKQIIADKTGNPI 53 Query: 62 WSEIPPPTHEEQI------AAAELKKQQLINQANDYMNSKQWAGKAAI 103 + P + + K+ L+ + + + A I Sbjct: 54 LIDPQPSAAHQLNLDTLTWEISAEKQTALLAETQTRLVANIDKHAAKI 101 >UniRef50_Q31Z52 Putative tail fiber assembly protein n=1 Tax=Shigella boydii Sb227 RepID=Q31Z52_SHIBS Length = 123 Score = 60.6 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 8/80 (10%) Query: 72 EQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSS 131 + AE ++ L+ Q + +W +G + E+ + Y +L+ +D S+ Sbjct: 46 DYRLKAEDERDALLAQVSARTG--EWEEDLLLGLISDEDKEKLKACRIYAKSLQAMDFST 103 Query: 132 APD------IEWPTPPAVQA 145 D I WP P A Sbjct: 104 ITDKATYNAINWPERPDAAA 123 >UniRef50_C4U6G1 Phage tail assembly chaperone gp38 n=2 Tax=Enterobacteriaceae RepID=C4U6G1_YERAL Length = 51 Score = 60.6 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 38/51 (74%), Positives = 42/51 (82%) Query: 92 MNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPA 142 MNSKQW GKAA+GRLK +E AQYN WLDYLD LE VDTS+APDI+WP P Sbjct: 1 MNSKQWPGKAAMGRLKDDEKAQYNAWLDYLDLLEEVDTSTAPDIDWPVAPE 51 >UniRef50_Q3KH77 Hypothetical phage related protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KH77_PSEPF Length = 146 Score = 60.2 bits (144), Expect = 2e-08, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 35/112 (31%), Gaps = 2/112 (1%) Query: 32 VEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDY 91 V + + + + +G PTHEE +A + Q+ + A + Sbjct: 34 VGIADAEWYDITGDTTAHVGWKVFFKLEEDGLLFVEPTHEEHLAINAARMQERFDVAALW 93 Query: 92 MNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD--TSSAPDIEWPTPP 141 + K +G + A + Y A+ V I WP P Sbjct: 94 LTFNPLQYKLDLGVATPADEAALLAYKQYFVAVSEVKKQPGYPATINWPVAP 145 >UniRef50_Q849T8 Eag0005 n=3 Tax=Haemophilus influenzae RepID=Q849T8_HAEIN Length = 80 Score = 59.8 bits (143), Expect = 2e-08, Method: Composition-based stats. Identities = 23/73 (31%), Positives = 32/73 (43%), Gaps = 9/73 (12%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M Y N F+ DY+ G P+ AVE+ ++ Y+E +GK IA G+P Sbjct: 4 MTMYY---KNGFF------DYSYGGFVPEGAVEISQETYLELLNGQAQGKQIIADNTGYP 54 Query: 61 AWSEIPPPTHEEQ 73 A E P E Sbjct: 55 ALMEPQPSAAHEL 67 >UniRef50_B2TWU3 Tail fiber assembly protein n=20 Tax=root RepID=B2TWU3_SHIB3 Length = 181 Score = 59.8 bits (143), Expect = 3e-08, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 8/77 (10%) Query: 72 EQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSS 131 + AE ++ L+ Q + +W +G + E+ + + Y +L+ +D S+ Sbjct: 104 DYRLKAEDERDALLAQVSARTG--EWEEDLLLGLISDEDREKLKAYRIYAKSLQAMDFSA 161 Query: 132 APD------IEWPTPPA 142 D IEWP P Sbjct: 162 ITDKSSYNAIEWPVSPE 178 >UniRef50_A4SL83 Phage tail fiber assembly protein n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SL83_AERS4 Length = 141 Score = 59.0 bits (141), Expect = 4e-08, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 32/91 (35%), Gaps = 8/91 (8%) Query: 53 IAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELA 112 + GE G PPT +Q A + + QA M + A +G + E Sbjct: 56 VFGEFGDIEVITPSPPTETQQQARLNEE----LKQAATAMAPLKDAD--TLGIISDAERQ 109 Query: 113 QYNLWLDYLDALELVDTSS--APDIEWPTPP 141 + W Y L + S ++ WP P Sbjct: 110 RLTAWQRYRVTLYRLPQSDGWPTEVNWPEMP 140 >UniRef50_B6Z9I1 Putative phage tail fiber assembly protein n=1 Tax=Kluyvera phage Kvp1 RepID=B6Z9I1_9CAUD Length = 174 Score = 58.6 bits (140), Expect = 5e-08, Method: Composition-based stats. Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 4/66 (6%) Query: 78 ELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEW 137 E +K +LI QA+ ++ Q+A +G E + W Y A+ VD S P +W Sbjct: 113 ENQKARLIAQASTKID--QYADFIELGEEGLEGI--LKAWRKYRLAVFKVDLSVLPFYDW 168 Query: 138 PTPPAV 143 P P Sbjct: 169 PEKPEA 174 >UniRef50_C8UR23 Putative uncharacterized protein n=1 Tax=Escherichia coli O111:H- str. 11128 RepID=C8UR23_ECO1A Length = 78 Score = 57.9 bits (138), Expect = 9e-08, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 3/80 (3%) Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 Q AE +KQ L+ D + W + +G + + + W+ Y Sbjct: 2 VCKVTSQPVMQRQQAEKEKQSLLQLVRDKT--QLWDSQLRLGIISVQGKQKLTEWILYAQ 59 Query: 123 ALELVDTSSAPDIEWPTPPA 142 +E DTS P + +P P Sbjct: 60 KVESTDTSILP-VTFPEKPE 78 >UniRef50_B1JM08 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis YPIII RepID=B1JM08_YERPY Length = 116 Score = 57.5 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 33/85 (38%), Gaps = 6/85 (7%) Query: 58 GFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLW 117 G P+ + P T + K Q + +A + + Q A + EE+ + LW Sbjct: 34 GRPSEWQPAPMTSSDARDI----KAQALTEAYIQVTALQAAVSTQL--ATPEEITELVLW 87 Query: 118 LDYLDALELVDTSSAPDIEWPTPPA 142 YL + V S DI WP P Sbjct: 88 QTYLVLMNRVVPDSPLDIVWPKKPE 112 >UniRef50_Q1I679 Putative phage tail fiber assembly protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I679_PSEE4 Length = 126 Score = 55.9 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 26/68 (38%), Gaps = 4/68 (5%) Query: 77 AELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVD--TSSAPD 134 A+ + +L A+ + Q A E+A+ W Y AL + D Sbjct: 61 AQAEALRLRAIADTAIAPLQDAVDLD--EASEAEVARLKEWRRYRVALNRLPEQPGYPAD 118 Query: 135 IEWPTPPA 142 I+WP PA Sbjct: 119 IDWPLAPA 126 >UniRef50_Q126B3 Putative uncharacterized protein n=1 Tax=Polaromonas sp. JS666 RepID=Q126B3_POLSJ Length = 223 Score = 55.2 bits (131), Expect = 6e-07, Method: Composition-based stats. Identities = 23/113 (20%), Positives = 32/113 (28%), Gaps = 30/113 (26%) Query: 4 IYSATTNSFYPLEMK----------------EDYTQAGS--------------WPDDAVE 33 YS T FY ++ D S P DAVE Sbjct: 2 FYSKLTGGFYSADLHGSRTLVIADAAWVPPLIDGMPDPSAIAPTIEINNPDCKIPPDAVE 61 Query: 34 VDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLIN 86 + ++ + +GK A NG P P PT + AA+ Sbjct: 62 ITDEQHAALLEGQTQGKRIEADANGAPVLITPPAPTLDALKEAAQAAIDAHFE 114 >UniRef50_O52622 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhimurium RepID=O52622_SALTY Length = 94 Score = 55.2 bits (131), Expect = 6e-07, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 5/90 (5%) Query: 3 YIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAW 62 Y YS F+ + D T++ ++PDD + + ++ Y E GK G P Sbjct: 4 YYYSFKEKGFF---WQPD-TESDNYPDDLIPLTDEYYRELMQGQVDGKYI-EHRKGGPVL 58 Query: 63 SEIPPPTHEEQIAAAELKKQQLINQANDYM 92 E + + + +K+ + + + + Sbjct: 59 VEHRNIRLKSWLHRLKPEKRNFLLRQSQLL 88 >UniRef50_C0DSG5 Putative uncharacterized protein n=1 Tax=Eikenella corrodens ATCC 23834 RepID=C0DSG5_EIKCO Length = 215 Score = 55.2 bits (131), Expect = 7e-07, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 30/67 (44%), Gaps = 6/67 (8%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M YS T +FY + P+DAVE+ + + +G++ + G++G P Sbjct: 1 MTIYYSKTNQAFYDSSI------HSRLPEDAVEISHEQHAALLAGQSQGQVIMPGKDGKP 54 Query: 61 AWSEIPP 67 + + P Sbjct: 55 VLAPLAP 61 >UniRef50_A5X9J5 Putative tail fiber assembly protein n=1 Tax=Aeromonas phage phiO18P RepID=A5X9J5_9CAUD Length = 141 Score = 54.8 bits (130), Expect = 8e-07, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 6/81 (7%) Query: 65 IPPPTHEEQIAAAELKK--QQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLD 122 E + AE ++ L +A ++ Q A + + +EL + Y Sbjct: 62 RTQAQPEYLPSEAEQQRKLDALQAEATLHIAPLQDAKELKL--ATPQELDKLEALQRYRI 119 Query: 123 ALELVDTSS--APDIEWPTPP 141 AL + S + WP P Sbjct: 120 ALMRLPQSEGWPSSVTWPEMP 140 >UniRef50_Q2S7G7 Putative uncharacterized protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S7G7_HAHCH Length = 198 Score = 53.6 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 17/62 (27%), Positives = 24/62 (38%), Gaps = 4/62 (6%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M+Y YS +T FY + AG P D V+V E+ +G + G P Sbjct: 1 MHYFYSPSTRGFYLESLH----AAGGLPLDGVKVTEEERQALLDGQAQGLTIEINDQGRP 56 Query: 61 AW 62 Sbjct: 57 VA 58 >UniRef50_Q30VV8 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30VV8_DESDG Length = 194 Score = 53.6 bits (127), Expect = 2e-06, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 35/91 (38%), Gaps = 5/91 (5%) Query: 5 YSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSE 64 YS +TN+FY + + P DAV V + + +G+I ENG P Sbjct: 3 YSPSTNAFYHPAV-----HGHAIPADAVAVSPEEHATLLAAQARGQIIRPDENGCPVAVT 57 Query: 65 IPPPTHEEQIAAAELKKQQLINQANDYMNSK 95 P + K+ ++ + A + + Sbjct: 58 PAAPPAPTRAELYTAKQTEIRDGAESMLTAL 88 >UniRef50_B3I4G5 Tail fiber assembly protein n=7 Tax=Escherichia RepID=B3I4G5_ECOLX Length = 101 Score = 52.9 bits (125), Expect = 3e-06, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 2/48 (4%) Query: 85 INQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSA 132 I + ++Y+ Q A I EE + W Y L VDTS A Sbjct: 32 IQEFSEYIAPLQDAVDLEI--ATEEERSLLEAWNKYRVLLNRVDTSVA 77 >UniRef50_Q7P173 Probable tail fiber assembly protein n=1 Tax=Chromobacterium violaceum RepID=Q7P173_CHRVO Length = 192 Score = 52.5 bits (124), Expect = 5e-06, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 29/93 (31%), Gaps = 6/93 (6%) Query: 54 AGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQ 113 WS + +Q + A + A +IG +EL + Sbjct: 102 FPTWDGKGWSINKTAQSAALAQKTAAELKQRLADAYAARRPLEDAE--SIGIATADELQK 159 Query: 114 YNLWLDYLDALELV-DTSSAPDI---EWPTPPA 142 W Y L + D + P + +WP PA Sbjct: 160 LAAWKRYCVDLSRLPDLAMWPRLVGADWPKQPA 192 >UniRef50_D1P3V4 Bacteriophage tail fiber assembly protein n=4 Tax=Providencia rustigianii DSM 4541 RepID=D1P3V4_9ENTR Length = 165 Score = 51.7 bits (122), Expect = 6e-06, Method: Composition-based stats. Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 4/62 (6%) Query: 74 IAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAP 133 + +LKK+QL+N+ + ++ Q + +G EE+ + +Y +L S+ Sbjct: 101 RSQIDLKKKQLMNEVSSLIDPLQDSFD--MGVATSEEIEKLIALKEYRISLNR--ASTLH 156 Query: 134 DI 135 DI Sbjct: 157 DI 158 >UniRef50_B0USZ7 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=B0USZ7_HAES2 Length = 208 Score = 50.9 bits (120), Expect = 1e-05, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 41/106 (38%), Gaps = 17/106 (16%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M + + FY + + + P AVE+ E +Y +GK I ENG+P Sbjct: 1 MKIYF---KDGFYMSHIHK------NIPQGAVEISEDLYRSLLVGQSEGKQIITDENGYP 51 Query: 61 AWSEIPPP-----THEEQIAAAELK---KQQLINQANDYMNSKQWA 98 ++ P + + + E + Q+ + + +N+ + Sbjct: 52 QLADPQPSPFHHIEKGQWVISPENQTAHLTQVRAEMREKINALRDE 97 >UniRef50_Q87Y71 Tail fiber assembly domain protein n=1 Tax=Pseudomonas syringae pv. tomato RepID=Q87Y71_PSESM Length = 92 Score = 50.2 bits (118), Expect = 2e-05, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 27/83 (32%), Gaps = 12/83 (14%) Query: 69 THEEQIAAAELKK--------QQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDY 120 T E + A ++ +L A+ + Q A +E+A W + Sbjct: 11 TKEMKQEQAAKQRLADVVTEIARLRKIADYTIAPLQDAVDID--DATADEVASLKAWKQF 68 Query: 121 LDALELVD--TSSAPDIEWPTPP 141 AL + I+WP P Sbjct: 69 RVALNRIPAQPGYYEVIDWPVMP 91 >UniRef50_Q0BEK6 Bacteriophage-acquired protein n=3 Tax=Burkholderia RepID=Q0BEK6_BURCM Length = 253 Score = 49.4 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 31/90 (34%), Gaps = 4/90 (4%) Query: 55 GENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQY 114 AW E AAA + +A K A A G L E A + Sbjct: 100 YVWRDGAWVVDEAIVAERVRAAAMSDFYARMEKARQQNLGKMDARAA--GLLSDVEEAMF 157 Query: 115 NLWLDYLDALEL-VDTSSAP-DIEWPTPPA 142 + W Y AL VD + P DI WP P Sbjct: 158 DAWAAYQVALVRVVDLPTFPNDIVWPDEPD 187 >UniRef50_Q72D32 Tail fiber assembly protein, putative n=5 Tax=Desulfovibrio vulgaris RepID=Q72D32_DESVH Length = 148 Score = 48.2 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 14/72 (19%), Positives = 28/72 (38%), Gaps = 6/72 (8%) Query: 1 MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFP 60 M +S +T FY + D P DA+ + + +++ + +G I G P Sbjct: 1 MQRYHSPSTGGFYLDGVHSD------IPADAIPITDSEHVDLTDALAQGCIIKMDAEGRP 54 Query: 61 AWSEIPPPTHEE 72 + P + + Sbjct: 55 CVAPAPGQSLVD 66 >UniRef50_B3R3K0 Phage tail fiber assembly protein n=1 Tax=Cupriavidus taiwanensis RepID=B3R3K0_CUPTR Length = 291 Score = 44.0 bits (102), Expect = 0.001, Method: Composition-based stats. Identities = 22/99 (22%), Positives = 33/99 (33%), Gaps = 4/99 (4%) Query: 47 PKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRL 106 P + W+ E AAA + +L+ A + K A A + Sbjct: 92 PDPRPSELHRWTAEGWTLDAALVAERTRAAAMAEFDRLMAIAREANAGKADAYAAGLLDA 151 Query: 107 KGEELAQYNLWLDYLDALELVDTSSAPDI--EWPTPPAV 143 G A + W Y L V S+ + +WP P V Sbjct: 152 VGA--ALFKAWSAYQLDLVRVVNSTDFPVVADWPEAPNV 188 >UniRef50_A3YA18 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MED121 RepID=A3YA18_9GAMM Length = 167 Score = 41.3 bits (95), Expect = 0.008, Method: Composition-based stats. Identities = 15/112 (13%), Positives = 32/112 (28%), Gaps = 14/112 (12%) Query: 34 VDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMN 93 + + + ++ I P++EE ++A + LIN+ + + Sbjct: 64 IGTEYFDSNGNQTKINRLGEVPPTDALFERPIITPSNEELESSARASRNTLINKVSIEIE 123 Query: 94 SKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSS--APDIEWPTPPAV 143 + W Y L + + I+W PA Sbjct: 124 RLTD------------NKKETENWRTYRQFLRDIPSQKGFPQSIDWGIAPAE 163 >UniRef50_A7FIT9 Putative uncharacterized protein n=5 Tax=Yersinia RepID=A7FIT9_YERP3 Length = 218 Score = 40.9 bits (94), Expect = 0.012, Method: Composition-based stats. Identities = 20/100 (20%), Positives = 34/100 (34%), Gaps = 6/100 (6%) Query: 43 SGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAA 102 G P +A F W T ++++AA + + +A D M ++ Sbjct: 101 IGDYPADTTELAPTTTFDQWDGTRWVTDKDRVAAVARRYRDAFIEATDPMMVSDYSIDDM 160 Query: 103 IGRLKGEELAQYNLWLDYLDALELVDTS-SAPDIEWPTPP 141 L E+ + A + T + P IE P P Sbjct: 161 --PLTSEQRRELAET---RLAFKTWPTQENWPRIELPDIP 195 >UniRef50_A9DEM0 Tail fiber assembly protein n=1 Tax=Yersinia phage PY100 RepID=A9DEM0_9CAUD Length = 143 Score = 39.8 bits (91), Expect = 0.025, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 43/124 (34%), Gaps = 13/124 (10%) Query: 30 DAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTH-EEQIAAAELKKQQLINQA 88 D+V + PK + W P ++++ AA+ +K +L+ +A Sbjct: 21 DSVGIPSAWDGWIEMHAPKPEAGDWYAGENGEWMYGEAPEEIKDRVMAAKSQKMRLLTEA 80 Query: 89 NDYMNSKQWA--------GKAAIGRLKGEEL---AQYNLWLDYLDALELVDTSSAPDIEW 137 N +N + K E + W +Y L VD A +I+W Sbjct: 81 NAMINMIEQEAIETPELYVKQVYNPYTDEIDNVNEELEKWHNYRKELIRVDVE-AKEIKW 139 Query: 138 PTPP 141 P P Sbjct: 140 PELP 143 >UniRef50_A8T0S3 Putative uncharacterized protein n=1 Tax=Vibrio sp. AND4 RepID=A8T0S3_9VIBR Length = 108 Score = 39.4 bits (90), Expect = 0.034, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 29/98 (29%), Gaps = 8/98 (8%) Query: 49 GKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQL---INQANDYMNSKQWAGKAAIGR 105 G+ E+ E + +L I A + ++ + Sbjct: 12 GETLFNVPAELGTLIEMGFSQERAAEICLEAEHAELWEQILSARNARLAQTDFTQVGDAP 71 Query: 106 LKGEELAQYNLWLDYLDALELVD--TSSAPDIEWPTPP 141 + E+ + Y AL + S+ D+ WP P Sbjct: 72 ITPEKKVAFA---SYRQALRDLPQNFSNPNDVVWPEKP 106 >UniRef50_Q1I690 Putative phage protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I690_PSEE4 Length = 203 Score = 39.0 bits (89), Expect = 0.045, Method: Composition-based stats. Identities = 25/138 (18%), Positives = 41/138 (29%), Gaps = 27/138 (19%) Query: 21 YTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQ------- 73 W A+ VD+ V + + P+G+ + + P E Sbjct: 69 GDGGPMW---ALLVDDTVVET-TDIDPQGRFHASMKWLECPADTQPGDVMVEGNFTTPEP 124 Query: 74 --IAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSS 131 AE + A+ ++ S+ + +G QY LDY AL Sbjct: 125 VGRTIAERAWRDGQLHASQWLTSRH-REELDLGIQPSLTPTQYAELLDYRQALR------ 177 Query: 132 APDIEWPTP---PAVQAR 146 WP P + R Sbjct: 178 ----VWPQAELFPELARR 191 >UniRef50_A5FI80 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FI80_FLAJ1 Length = 122 Score = 39.0 bits (89), Expect = 0.049, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 49/132 (37%), Gaps = 25/132 (18%) Query: 11 SFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTH 70 FY + E + P +E+ E + + K + G++ + ++ + Sbjct: 14 GFYVEGIHE------NIPQPNIELTEDEWHQAL---SKNYKVVNGKHTYSSFIQNQ---- 60 Query: 71 EEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELV-DT 129 EE ++ + + L+ +++ + L ++ W +Y + L + + Sbjct: 61 EEILSNLRITRNLLLAESDW--------TQLEDSPLSEDKKD---EWKNYREELRDLTNL 109 Query: 130 SSAPDIEWPTPP 141 + +I WP P Sbjct: 110 DNLTNIIWPLKP 121 >UniRef50_A4JDE2 Putative uncharacterized protein n=2 Tax=Burkholderia vietnamiensis G4 RepID=A4JDE2_BURVG Length = 146 Score = 39.0 bits (89), Expect = 0.051, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 26/91 (28%), Gaps = 8/91 (8%) Query: 62 WSEIPPPTHEEQIAAA------ELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYN 115 W+ +E A E + + + D + A A Sbjct: 52 WASDSVAQPDEADVEAFFRANEEAIRAEHVRLFRDMALRNTDSKTIAPPDAPDSIKALAV 111 Query: 116 LWLDYLDALELVDTSS--APDIEWPTPPAVQ 144 W Y +AL V ++ WP PP Q Sbjct: 112 AWAAYREALRNVPEQEGFPFEVTWPAPPDEQ 142 >UniRef50_C5A8Q2 Bacteriophage-acquired protein n=2 Tax=Burkholderia RepID=C5A8Q2_BURGB Length = 211 Score = 38.6 bits (88), Expect = 0.059, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 32/91 (35%), Gaps = 8/91 (8%) Query: 56 ENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYN 115 AW E+ AA + QL+ A + K A A G L E + + Sbjct: 101 AWVDGAWVVPDSVIEAEKRAARQQTFDQLMANAKKANDGKADAYAA--GLLDDEGIYYFK 158 Query: 116 LWLDYLDAL----ELVDTSSAPDIEWPTPPA 142 W Y AL D S+ I WP PA Sbjct: 159 AWSAYQMALVAAMNSTDASA--TITWPKTPA 187 >UniRef50_A4JWI7 Putative uncharacterized protein n=4 Tax=Burkholderia RepID=A4JWI7_BURVG Length = 146 Score = 38.2 bits (87), Expect = 0.092, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 29/89 (32%), Gaps = 14/89 (15%) Query: 61 AWSEI--PPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWL 118 AW E+ +A K+ L+ Q + + AG A Sbjct: 67 AWWPDYADEYALAEETTSARQKRDDLLKQVDPMVERAADAGDADFE----------AALR 116 Query: 119 DYLDALELVDTSS--APDIEWPTPPAVQA 145 Y AL V + ++ WPT PA A Sbjct: 117 KYRQALRDVPAQAGFPFNVTWPTLPAKSA 145 >UniRef50_A1TPR3 Putative uncharacterized protein n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TPR3_ACIAC Length = 146 Score = 37.8 bits (86), Expect = 0.099, Method: Composition-based stats. Identities = 23/97 (23%), Positives = 31/97 (31%), Gaps = 16/97 (16%) Query: 46 PPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGR 105 PP A + G W E ++ +L+ A D+ A + + Sbjct: 62 PPWPGQGHAFDFGRLEWVPD----AAELWGYVRARRDELLR-ACDWRVLPDAPTPADMRQ 116 Query: 106 LKGEELAQYNLWLDYLDALELVDTSSAPD-IEWPTPP 141 WLDY AL V P IEWP P Sbjct: 117 ----------AWLDYRQALRDVTGQGDPRAIEWPVAP 143 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.124 0.376 Lambda K H 0.267 0.0378 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 867,607,719 Number of Sequences: 3077464 Number of extensions: 28929520 Number of successful extensions: 69559 Number of sequences better than 1.0e-01: 158 Number of HSP's better than 0.1 without gapping: 282 Number of HSP's successfully gapped in prelim test: 84 Number of HSP's that attempted gapping in prelim test: 68947 Number of HSP's gapped (non-prelim): 371 length of query: 146 length of database: 1,040,396,356 effective HSP length: 109 effective length of query: 37 effective length of database: 704,952,780 effective search space: 26083252860 effective search space used: 26083252860 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.7 bits) S2: 87 (38.2 bits)