BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (320 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriacea... 538 e-152 UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysente... 439 e-122 UniRef50_P76072 Side tail fiber protein homolog from lambdoid pr... 353 4e-96 UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia ... 343 3e-93 UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enter... 343 6e-93 UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX 330 4e-89 UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_L... 319 9e-86 UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enter... 315 1e-84 UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria pha... 291 2e-77 UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae R... 279 1e-73 UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacter... 241 2e-62 UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escheri... 232 1e-59 UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica... 227 3e-58 UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escheric... 225 1e-57 UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae Re... 194 2e-48 UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepI... 182 2e-44 UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepI... 171 3e-41 UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enter... 148 3e-34 UniRef50_B3X4P8 Tail fiber n=3 Tax=Enterobacteriaceae RepID=B3X4... 139 1e-31 UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp.... 127 6e-28 UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae ... 106 9e-22 UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclass... 106 1e-21 UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclass... 93 2e-17 UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 T... 90 1e-16 UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae... 89 3e-16 UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacteriu... 87 6e-16 UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bact... 87 8e-16 UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli pl... 86 2e-15 UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacter... 85 3e-15 UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammapr... 85 4e-15 UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus... 84 9e-15 UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteri... 82 3e-14 UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrob... 82 4e-14 UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterob... 81 5e-14 UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid pr... 79 2e-13 UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersini... 79 2e-13 UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadan... 79 3e-13 UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteri... 78 3e-13 UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabd... 77 6e-13 UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia Rep... 77 7e-13 UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadan... 77 8e-13 UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Pho... 77 1e-12 UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersini... 77 1e-12 UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber prote... 76 2e-12 UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=... 75 3e-12 UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1... 74 5e-12 UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID... 74 8e-12 UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus... 74 8e-12 UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteri... 73 1e-11 UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX 72 2e-11 UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH... 72 2e-11 UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteri... 72 3e-11 UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU 71 4e-11 UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 Rep... 70 7e-11 UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Entero... 69 2e-10 UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadan... 69 3e-10 UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae... 69 3e-10 UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=... 69 3e-10 UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteria... 68 4e-10 UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia ... 68 5e-10 UniRef50_B7NJP1 Putative side tail fiber protein homolog from la... 67 6e-10 UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli R... 67 6e-10 UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacteriu... 67 8e-10 UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Ta... 67 1e-09 UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia Rep... 67 1e-09 UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseu... 66 2e-09 UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadan... 65 4e-09 UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 ... 64 6e-09 UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannhei... 64 6e-09 UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae... 63 1e-08 UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae ... 63 2e-08 UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectoba... 62 2e-08 UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadan... 62 3e-08 UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID... 62 3e-08 UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=... 61 5e-08 UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 ... 56 1e-06 UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacteriu... 56 2e-06 UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=... 54 1e-05 UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio... 54 1e-05 UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=... 52 3e-05 UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 T... 52 4e-05 UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacter... 51 4e-05 UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica ... 51 5e-05 UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A... 50 7e-05 UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 T... 50 1e-04 UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella... 49 2e-04 UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacter... 48 4e-04 UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium w... 48 5e-04 UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae... 48 5e-04 UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkhol... 48 6e-04 UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacteriu... 47 9e-04 UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio... 45 0.003 UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkhol... 44 0.009 UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_... 44 0.011 UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A... 41 0.044 UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia... 41 0.053 UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Ta... 40 0.084 >UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriaceae RepID=Q3YZL1_SHISS Length = 1029 Score = 538 bits (1387), Expect = e-152, Method: Compositional matrix adjust. Identities = 277/321 (86%), Positives = 292/321 (90%), Gaps = 3/321 (0%) Query: 3 ITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 62 + ALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF Sbjct: 709 VAALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 768 Query: 63 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP 122 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP Sbjct: 769 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP 828 Query: 123 KLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFD 182 KLA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSFD Sbjct: 829 KLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFD 888 Query: 183 YGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAF---GGTNTSIFPNGYTAISNLSAGIM 239 YGTKSTNNTGAHTHS+SG+ +SAGAHQH +G G T +FP G T +S + + Sbjct: 889 YGTKSTNNTGAHTHSLSGSTSSAGAHQHSQTGPRTNSGSQPTGMFPAGSTQVSGTNQVGI 948 Query: 240 STTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN 299 S + SG ++ GK+SS+G HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN Sbjct: 949 SGSLTSGTSQWVGKSSSEGNHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN 1008 Query: 300 AAGNAENTVKNIAFNYIVRLA 320 AAGNAENTVKNIAFNYIVRLA Sbjct: 1009 AAGNAENTVKNIAFNYIVRLA 1029 >UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysenteriae 1012 RepID=B3X2T1_SHIDY Length = 488 Score = 439 bits (1128), Expect = e-122, Method: Compositional matrix adjust. Identities = 233/241 (96%), Positives = 235/241 (97%), Gaps = 3/241 (1%) Query: 80 LYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGW 139 LYT P +FYP GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGW Sbjct: 251 LYT---PSEQFYPPGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGW 307 Query: 140 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSIS 199 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSFDYGTKSTNNTGAHTHSIS Sbjct: 308 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSIS 367 Query: 200 GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGA 259 GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGA Sbjct: 368 GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGA 427 Query: 260 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL Sbjct: 428 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 487 Query: 320 A 320 A Sbjct: 488 A 488 >UniRef50_P76072 Side tail fiber protein homolog from lambdoid prophage Rac n=23 Tax=root RepID=STFR_ECOLI Length = 1120 Score = 353 bits (906), Expect = 4e-96, Method: Compositional matrix adjust. Identities = 200/259 (77%), Positives = 208/259 (80%), Gaps = 14/259 (5%) Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 F RS RD WA++YTS + P E YPVGAPIPWPSDTVPSGYALMQGQ FDKSAY Sbjct: 876 FYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAY 934 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 PKLA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSF Sbjct: 935 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST 241 DYGTKSTNNTGAHTHS+SG+ NSAGAH H N TA +N AG ST Sbjct: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTHS------------LANVNTASANSGAGSAST 1042 Query: 242 TSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 301 +N TSS GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA Sbjct: 1043 RLSVVHNQNYA-TSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 1101 Query: 302 GNAENTVKNIAFNYIVRLA 320 GNAENTVKNIAFNYIVRLA Sbjct: 1102 GNAENTVKNIAFNYIVRLA 1120 >UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia coli RepID=B7L485_ECO55 Length = 1056 Score = 343 bits (881), Expect = 3e-93, Method: Compositional matrix adjust. Identities = 194/263 (73%), Positives = 211/263 (80%), Gaps = 5/263 (1%) Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 F RS RD WA++YTS + P E YPVGAPIPWPSDTVPSGYALMQGQTF+KSAY Sbjct: 795 FYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQTFNKSAY 853 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 PKLA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSF Sbjct: 854 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 913 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAF---GGTNTSIFP-NGYTAISNLSAG 237 DYGTKSTNNTGAHTHS+SG+ SAG H H + + GG+ + + G+T + N Sbjct: 914 DYGTKSTNNTGAHTHSLSGSTGSAGVHTHGNGIRWPGGGGSALAFYDGGGFTYVQNSQYQ 973 Query: 238 IMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTIT 297 + TS +T S GAHTHSLSGTAAS+GAHAHTVGIGAHTHSVAIGSHGHTIT Sbjct: 974 VSPGTSSYRSYYQRIQTQSAGAHTHSLSGTAASSGAHAHTVGIGAHTHSVAIGSHGHTIT 1033 Query: 298 VNAAGNAENTVKNIAFNYIVRLA 320 VNAAGNAENTVKNIAFNYIVRLA Sbjct: 1034 VNAAGNAENTVKNIAFNYIVRLA 1056 >UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FQX9_SALDC Length = 569 Score = 343 bits (879), Expect = 6e-93, Method: Compositional matrix adjust. Identities = 189/320 (59%), Positives = 214/320 (66%), Gaps = 52/320 (16%) Query: 3 ITALTDNTQGA-AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 + ALT T+G+ +GL + EVYNNGYPT YGNI+ L G G+GE+LIGWSGT+GA APA Sbjct: 300 LPALTGATRGSDSGLIMGEVYNNGYPTQYGNILRLTG---TGDGEILIGWSGTNGAPAPA 356 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 +IRS RDT DA WS WA LYTS +PP YPVGA I WPSD P+GYALMQGQ+FDKSAY Sbjct: 357 YIRSHRDTADAEWSEWAMLYTSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAY 416 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 P LA+AYPSG+IPDMRGWTIKGKP SGRAVLSQE DG KSH+HSA A TDLGT++TSSF Sbjct: 417 PLLAIAYPSGIIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHSARAQDTDLGTKSTSSF 476 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN-TSIFPNGYTAISNLSAGIMS 240 DYGTKSTN TG HTH G NS +G +N TS P G Sbjct: 477 DYGTKSTNTTGNHTHQFGGYINS----------YWGDSNHTSFQPGG------------- 513 Query: 241 TTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 GA T +AG HAHTV IG H H++ IG HGH + V+A Sbjct: 514 -----------------GAWTQ-------AAGDHAHTVYIGGHEHTMYIGPHGHVVIVDA 549 Query: 301 AGNAENTVKNIAFNYIVRLA 320 GNAE TVKNIAFNYIVRLA Sbjct: 550 DGNAETTVKNIAFNYIVRLA 569 >UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX Length = 710 Score = 330 bits (846), Expect = 4e-89, Method: Compositional matrix adjust. Identities = 178/263 (67%), Positives = 195/263 (74%), Gaps = 29/263 (11%) Query: 61 AFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSA 120 ++ RS+ T D W+ W P + +PVGA IPWPSD+VP+GYA+MQGQTFDK+ Sbjct: 474 SYTRSQYSTGD--WTAWT--------PQDSFPVGAAIPWPSDSVPTGYAVMQGQTFDKTT 523 Query: 121 YPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSS 180 YP LA AYPSGV+PDMRGWTIKGKPASGR VLS EQDGIKSHTHSASAS+TDLGT+TTSS Sbjct: 524 YPLLAAAYPSGVLPDMRGWTIKGKPASGRDVLSLEQDGIKSHTHSASASNTDLGTKTTSS 583 Query: 181 FDYGTKSTNNTGAHTHSISGTANSAGAHQHK---SSGAFGGTNTSIFPNGYTAISNLSAG 237 FDYGTKSTNNTGAHTH++SGTANSAGAH H GG N Sbjct: 584 FDYGTKSTNNTGAHTHNVSGTANSAGAHTHTVPLRRPNSGGMNFDWLDG----------- 632 Query: 238 IMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTIT 297 + SG G S GAHTHS+SGTA SAGAHAHTVGIGAHTHSVAIGSHGHTIT Sbjct: 633 -----ASSGTVVGNGTVPSSGAHTHSVSGTATSAGAHAHTVGIGAHTHSVAIGSHGHTIT 687 Query: 298 VNAAGNAENTVKNIAFNYIVRLA 320 VNAAGNAENTVKNIAFNYIVRLA Sbjct: 688 VNAAGNAENTVKNIAFNYIVRLA 710 >UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_LAMBD Length = 774 Score = 319 bits (817), Expect = 9e-86, Method: Compositional matrix adjust. Identities = 174/251 (69%), Positives = 188/251 (74%), Gaps = 27/251 (10%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRA 150 +P GAPIPWPSD VPSGY LMQGQ FDKSAYPKLAVAYPSGV+PDMRGWTIKGKPASGRA Sbjct: 530 FPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIKGKPASGRA 589 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTS----------SFDYGTKSTNNTGAHTHSISG 200 VLSQEQDGIKSHTHSASAS TDLGT+TTS SFDYGTKSTNNTGAH HS+SG Sbjct: 590 VLSQEQDGIKSHTHSASASGTDLGTKTTSSFDYGTKTTGSFDYGTKSTNNTGAHAHSLSG 649 Query: 201 TANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNA--GKTSSDG 258 + +AGAH H S + S + G I+ G +ST G+ A KT S G Sbjct: 650 STGAAGAHAHTSGLRMNSSGWSQY--GTATIT----GSLSTVKGTSTQGIAYLSKTDSQG 703 Query: 259 AHTHSLSGTAASAGAHAHTVGIGAHTHSV---------AIGSHGHTITVNAAGNAENTVK 309 +H+HSLSGTA SAGAHAHTVGIGAH H V +IGSHGHTITVNAAGNAENTVK Sbjct: 704 SHSHSLSGTAVSAGAHAHTVGIGAHQHPVVIGAHAHSFSIGSHGHTITVNAAGNAENTVK 763 Query: 310 NIAFNYIVRLA 320 NIAFNYIVRLA Sbjct: 764 NIAFNYIVRLA 774 >UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5PP06_SALHA Length = 534 Score = 315 bits (808), Expect = 1e-84, Method: Compositional matrix adjust. Identities = 175/305 (57%), Positives = 203/305 (66%), Gaps = 52/305 (17%) Query: 3 ITALTDNTQGA-AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 + ALT T+G+ +GL + EVYNNGYPT YGNI+ L G G+GE+LIGWSGT+GA APA Sbjct: 156 LPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTG---TGDGEILIGWSGTNGAPAPA 212 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 +IRS RDT DA WS WA LYT+ +PP + +PVGAPI WPSD P+GYALMQGQ+FDKSAY Sbjct: 213 YIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAPIAWPSDATPAGYALMQGQSFDKSAY 272 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 P LA+AYPSGVIPDMRGWTIKGKPASGRA+LSQE DG KSH+HSA A TDLGT+TTSSF Sbjct: 273 PLLAIAYPSGVIPDMRGWTIKGKPASGRAILSQEMDGNKSHSHSARAQDTDLGTKTTSSF 332 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN-TSIFPNGYTAISNLSAGIMS 240 DYGTKSTN TG HT+ G NS +G +N TS P G Sbjct: 333 DYGTKSTNTTGNHTNQFGGYINS----------YWGDSNHTSFQPGG------------- 369 Query: 241 TTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 GA T +AG HAHTV IG H H++ IG HGH + V+A Sbjct: 370 -----------------GAWTQ-------AAGDHAHTVYIGGHEHTMYIGPHGHVVIVDA 405 Query: 301 AGNAE 305 GNAE Sbjct: 406 DGNAE 410 >UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria phage T4 RepID=Q99362_BPT4 Length = 382 Score = 291 bits (745), Expect = 2e-77, Method: Compositional matrix adjust. Identities = 166/252 (65%), Positives = 190/252 (75%), Gaps = 23/252 (9%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRA 150 YP+GAPIPWP+DT P+GYALM+GQTFD AYPKLA AYPSG IPDMRG TIKGKP SGRA Sbjct: 132 YPIGAPIPWPTDTPPNGYALMEGQTFDTRAYPKLAAAYPSGTIPDMRGQTIKGKP-SGRA 190 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKST----------NNTGAHTHSISG 200 VLS E DG+KSHTH ASAS+TDLGT+TTSSFDYGTK+T N TG H H++SG Sbjct: 191 VLSTEADGVKSHTHGASASNTDLGTKTTSSFDYGTKTTSSFDYGTKTSNTTGNHNHTVSG 250 Query: 201 TANSAGAHQHKSSG--AFGGTNTSIFPNGYTAI-SNLSAGIMSTTSGSGQTRNAGKTSSD 257 T +SAGAHQH SG G +T+IFP+GY+ + +N ++ T GS GKTS+D Sbjct: 251 TTSSAGAHQHARSGPQLSNGISTNIFPDGYSDVGTNYNSKFSGTVIGSSVPCIIGKTSND 310 Query: 258 GAHTHSLSGTAASA---------GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTV 308 GAHTH+ SGT ++ GAH HTVGIGAHTH+VAIGSHGHTITVNA GN ENTV Sbjct: 311 GAHTHTWSGTTSTTGNHAHTVGIGAHTHTVGIGAHTHTVAIGSHGHTITVNATGNTENTV 370 Query: 309 KNIAFNYIVRLA 320 KNIAFNYIVRLA Sbjct: 371 KNIAFNYIVRLA 382 >UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae RepID=A4TT73_YERPP Length = 962 Score = 279 bits (713), Expect = 1e-73, Method: Compositional matrix adjust. Identities = 161/262 (61%), Positives = 187/262 (71%), Gaps = 30/262 (11%) Query: 80 LYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGW 139 LY+S PP E YPVGAPIPWP+D PSG+A+MQGQTFDKS YPKLA AYPSGV+PDMRGW Sbjct: 710 LYSSVLPPPESYPVGAPIPWPNDVAPSGFAIMQGQTFDKSVYPKLAAAYPSGVLPDMRGW 769 Query: 140 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDL----------GTETTSSFDYGTKSTN 189 IKGKP S RAVLS EQDGIKSH H+A+ASSTDL GT+T+S FDYGTKS+N Sbjct: 770 MIKGKPTS-RAVLSLEQDGIKSHAHNAAASSTDLGTKPTTTFDYGTKTSSGFDYGTKSSN 828 Query: 190 NTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTR 249 +TGAH HS+SG+ +S+GAH H T + +P + + + G T + T Sbjct: 829 STGAHAHSLSGSTSSSGAHAHTV------TAHTQYPRSTDSRNQNAVGKQYNTQQT--TA 880 Query: 250 NAGK--TSSDGAHTHSLSGTAASAGAHAHTVGIGA---------HTHSVAIGSHGHTITV 298 NA TSS G H HS+SGTA SAGAHAHTVGIGA H+HSVAIG+H HTIT+ Sbjct: 881 NAFNVWTSSAGDHAHSISGTAVSAGAHAHTVGIGAHAHSLSIGSHSHSVAIGAHSHTITI 940 Query: 299 NAAGNAENTVKNIAFNYIVRLA 320 A GNAENTVKNIA+NYIVRLA Sbjct: 941 AACGNAENTVKNIAYNYIVRLA 962 >UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacteriaceae RepID=B7LN99_ESCF3 Length = 593 Score = 241 bits (616), Expect = 2e-62, Method: Compositional matrix adjust. Identities = 154/242 (63%), Positives = 170/242 (70%), Gaps = 31/242 (12%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 + P +PVGAPI WPSD VP GYA+MQGQTFDK+AYP LA AYPSGVIPDMRGWT Sbjct: 381 FERGFEPVNAFPVGAPIAWPSDIVPEGYAIMQGQTFDKAAYPLLAAAYPSGVIPDMRGWT 440 Query: 141 IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK--STNNTGAHTHSI 198 IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSFDYGTK ST N G Sbjct: 441 IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKTVSTFNHGTK---- 496 Query: 199 SGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDG 258 T N+ GAH H G +GG ++ SG+ Q +SSDG Sbjct: 497 --TTNNTGAHTHTVGGRYGG-------------DSIGGKQRVQVSGTNQV-----SSSDG 536 Query: 259 AHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 AH H++ G H HTVGIGAH H+VA+G+HGHTITVNAAGNAENTVKNIAFNYIVR Sbjct: 537 AHAHTV-----DIGQHNHTVGIGAHAHTVALGAHGHTITVNAAGNAENTVKNIAFNYIVR 591 Query: 319 LA 320 LA Sbjct: 592 LA 593 >UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escherichia RepID=B7LKX7_ESCF3 Length = 567 Score = 232 bits (592), Expect = 1e-59, Method: Compositional matrix adjust. Identities = 124/243 (51%), Positives = 158/243 (65%), Gaps = 44/243 (18%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 AE PVG PIPWPSD+VPSGYALM GQTF+K++YPKLA+AYPSGVIPDMRGW IKGKP+S Sbjct: 359 AESCPVGMPIPWPSDSVPSGYALMTGQTFNKTSYPKLAIAYPSGVIPDMRGWIIKGKPSS 418 Query: 148 GRAVLSQEQDGIKSHTHSASAS----------STDLGTETTSSFDYGTKSTNNTGAHTHS 197 GRA+LS E DG+KSH H+ S S STDLGT+TT+SF++G+++T+ +G HTH Sbjct: 419 GRAILSTELDGVKSHNHTGSISSTNLGTITSTSTDLGTKTTASFNHGSRNTSTSGEHTHR 478 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 I + GA G S++ S + T S Sbjct: 479 IP------------TDGAEGKDGPSLW-----------------NSPNSDENYREPTESA 509 Query: 258 GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIV 317 G+H HS+ + GAHAHT+ +G+HTH++ +G+H H+I +N GN ENTVKNIAFNYIV Sbjct: 510 GSHYHSI-----TIGAHAHTIALGSHTHNIVLGTHNHSIIINNTGNTENTVKNIAFNYIV 564 Query: 318 RLA 320 RLA Sbjct: 565 RLA 567 >UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL254 RepID=B4T041_SALNS Length = 580 Score = 227 bits (579), Expect = 3e-58, Method: Compositional matrix adjust. Identities = 123/234 (52%), Positives = 139/234 (59%), Gaps = 46/234 (19%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA 146 P P G P+PWPSDT+P+GYALMQGQ FDK+ YP LA+AYPSG IPDMRGWTIKGKP Sbjct: 393 PMMSCPPGVPLPWPSDTIPAGYALMQGQAFDKNVYPLLAIAYPSGTIPDMRGWTIKGKPV 452 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 SGRAVLSQE DG KSH+H A A TDLGT+ TSSFDYGTKS+N TG H HS GT Sbjct: 453 SGRAVLSQELDGNKSHSHGARALDTDLGTKGTSSFDYGTKSSNTTGGHNHSAGGT----- 507 Query: 207 AHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSG 266 +GG + G + + G Sbjct: 508 ---------YGG---------------------DSIGGKARVQRDGNDQ----------- 526 Query: 267 TAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 + G HAHT IG H H+V IG HGH + V+A GNAE TVKNIAFNYIVRLA Sbjct: 527 LTSWNGDHAHTTWIGPHDHTVYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA 580 >UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escherichia coli E22 RepID=B3I2W7_ECOLX Length = 654 Score = 225 bits (574), Expect = 1e-57, Method: Compositional matrix adjust. Identities = 134/257 (52%), Positives = 153/257 (59%), Gaps = 49/257 (19%) Query: 74 WSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVI 133 W+PW P + YPVGAPIPWPSD P+GYALMQGQ FDK+ YP LA+AYP+G+I Sbjct: 437 WTPWM--------PEDSYPVGAPIPWPSDVTPTGYALMQGQPFDKAVYPLLAIAYPAGII 488 Query: 134 PDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGT-------- 185 PDMRG TIKGKP +GRAVLS EQDG+ SHTH AS S TDLGT+ TSSFDYG+ Sbjct: 489 PDMRGQTIKGKP-NGRAVLSYEQDGVISHTHGASISDTDLGTKYTSSFDYGSKPTTSFDY 547 Query: 186 --KSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTS 243 KS+ G H H+ A SA + G G ++ SN+S Sbjct: 548 GNKSSTEGGWHAHNFRYCATSA---YRDTPGQGLGMHS----------SNVSWAAGDRIE 594 Query: 244 GSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGN 303 GSG H H G H H VGIGAH H V +G HGHT TV+AAGN Sbjct: 595 GSGN------------HAH-----VTWIGPHDHWVGIGAHNHYVVMGYHGHTATVHAAGN 637 Query: 304 AENTVKNIAFNYIVRLA 320 AENTVKNIAFNYIVRLA Sbjct: 638 AENTVKNIAFNYIVRLA 654 >UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae RepID=Q3ZL14_ESCBL Length = 289 Score = 194 bits (494), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 116/242 (47%), Positives = 142/242 (58%), Gaps = 27/242 (11%) Query: 79 QLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRG 138 +L++S + P G + WP T P+G+ALM GQTFD +AYP+LA AYPSGVIPDMRG Sbjct: 75 RLFSSDY----MLPPGIALAWPGATAPTGFALMLGQTFDTTAYPRLAQAYPSGVIPDMRG 130 Query: 139 WTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSI 198 TIK PASGR +LS E DG+KSH+HS S S+TDLGT T + D GTK T+ G H H Sbjct: 131 QTIKFLPASGRTLLSLEADGVKSHSHSGSISTTDLGTATAADTDLGTKQTSQDGLHNHVS 190 Query: 199 SGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDG 258 N A +SS G NT G + + + + R +G S Sbjct: 191 DSRFNKLMA---RSSDIDGTNNT---------------GDVDSDNPESEHRVSGMNDSLW 232 Query: 259 AHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 A + A +G H HTV IG H HSV IG HGHT+T++ GN ENTVKNIAFN IVR Sbjct: 233 A-----ASVIADSGLHMHTVYIGPHAHSVYIGPHGHTVTISNFGNTENTVKNIAFNAIVR 287 Query: 319 LA 320 LA Sbjct: 288 LA 289 >UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepID=B6S308_SALDU Length = 427 Score = 182 bits (461), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 88/131 (67%), Positives = 103/131 (78%), Gaps = 4/131 (3%) Query: 3 ITALTDNTQGA-AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 + ALT T+G+ +GL + EVYNNGYPT YGNI+ L G G+GE+LIGWSGT+GA APA Sbjct: 300 LPALTGATRGSDSGLIMGEVYNNGYPTQYGNILRLTG---TGDGEILIGWSGTNGAPAPA 356 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 +IRS RDT DA WS WA LYTS +PP YPVGA I WPSD P+GYALMQGQ+FDKSAY Sbjct: 357 YIRSHRDTADAEWSEWAMLYTSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAY 416 Query: 122 PKLAVAYPSGV 132 P LA+AYPSG+ Sbjct: 417 PLLAIAYPSGI 427 >UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEM1_9CAUD Length = 255 Score = 171 bits (433), Expect = 3e-41, Method: Compositional matrix adjust. Identities = 106/229 (46%), Positives = 129/229 (56%), Gaps = 45/229 (19%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAV 151 PVGAP+ WPSDT P G+ALM GQTFDK YP LA YPSGV+PDMRG IK KP GRAV Sbjct: 72 PVGAPLAWPSDTAPDGWALMIGQTFDKVKYPLLAKVYPSGVLPDMRGRVIKAKP-DGRAV 130 Query: 152 LSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHK 211 LS E+D +KSHTH+ A++ GT TS+FD+G K T G HTH G+ A +H Sbjct: 131 LSLEEDQVKSHTHTGKAATAG-GTRATSTFDHGNKRTTTNGNHTH---GSPQGA---RHG 183 Query: 212 SSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASA 271 SG + TSG +T + + ++A Sbjct: 184 GSGQY-------------------------TSGDDETNSVFNWPA-----------TSAA 207 Query: 272 GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 G H H V IG H H+V I +H HT+ ++A G ENTVKNIA NYIVRLA Sbjct: 208 GDHFHDVQIGPHNHNVDI-NHEHTLQIDATGGTENTVKNIAMNYIVRLA 255 >UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enterobacteria phage T4 RepID=Q38190_BPT4 Length = 226 Score = 148 bits (373), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 118/226 (52%), Positives = 140/226 (61%), Gaps = 46/226 (20%) Query: 140 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSIS 199 TIKGKP SGRAVLS E DG+K+H+HSASASSTDLGT+TTSSFDYGTK TN+TG HTHS S Sbjct: 1 TIKGKP-SGRAVLSAEADGVKAHSHSASASSTDLGTKTTSSFDYGTKGTNSTGGHTHSGS 59 Query: 200 GTANSAGAHQH--------------KSSGAF----GGTNTSIFPNGYTAIS--NLSAGIM 239 G+ ++ G H H SS A GG+NT+ N S SAG Sbjct: 60 GSTSTNGEHSHYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDH 119 Query: 240 STTSGSGQ-----------------------TRNAG--KTSSDGAHTHSLSGTAASAGAH 274 S + G G+ + AG T++ G H+H+ S +SAG H Sbjct: 120 SHSVGIGEHSHYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDH 179 Query: 275 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 +H+VGIGAHTH+VAIGSHGHTITVN+ GN ENTVKNIAFNYIV LA Sbjct: 180 SHSVGIGAHTHTVAIGSHGHTITVNSTGNTENTVKNIAFNYIVALA 225 >UniRef50_B3X4P8 Tail fiber n=3 Tax=Enterobacteriaceae RepID=B3X4P8_SHIDY Length = 305 Score = 139 bits (350), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 62/81 (76%), Positives = 71/81 (87%) Query: 3 ITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 62 +TAL+ QG AGL++YEVYNNGYPTAYGN++HLKG A GEGELLIGWSGTSGAHAP + Sbjct: 119 VTALSSTAQGNAGLQMYEVYNNGYPTAYGNVLHLKGAAASGEGELLIGWSGTSGAHAPVY 178 Query: 63 IRSRRDTTDANWSPWAQLYTS 83 IRSRRDTTDA WS WAQ++TS Sbjct: 179 IRSRRDTTDAVWSEWAQVFTS 199 >UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp. RC586 RepID=D0IJ09_9VIBR Length = 368 Score = 127 bits (319), Expect = 6e-28, Method: Compositional matrix adjust. Identities = 86/233 (36%), Positives = 111/233 (47%), Gaps = 66/233 (28%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 A+ PVG P+PWPSD P G+A+ +GQ FDK A P+LA YP G++ D+RG + GK Sbjct: 200 AKICPVGVPLPWPSDIAPEGFAIHKGQAFDKVANPELAKLYPDGILKDLRGMAVVGK-KE 258 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSI-SGTANSAG 206 G +LS E D +K H + S T SS D G+++TN TG H H +GT+N Sbjct: 259 GEIILSYEADQVKQHGYPNS---------TVSSTDLGSRNTNTTGNHAHGYPAGTSNG-- 307 Query: 207 AHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSG 266 PNG D AH Sbjct: 308 ------------------PNG--------------------------PYLDTAH------ 317 Query: 267 TAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 AS G + +T G H HSVAIGSH H+I + G ENT+KNI FN+IVR+ Sbjct: 318 --ASYG-YRYTTTEGNHYHSVAIGSHAHSIAIALFGATENTIKNIKFNWIVRM 367 >UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GRX3_VIBCH Length = 182 Score = 106 bits (265), Expect = 9e-22, Method: Compositional matrix adjust. Identities = 73/233 (31%), Positives = 106/233 (45%), Gaps = 64/233 (27%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 + +PVG IPW +D P G+ + +GQ FD + Y +LA +P+G+IPDMRG + GK Sbjct: 14 VKIFPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRGCGVIGKE-D 72 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 G AV + E+ +K+H H S T SS D G+K+T N G HTH A G+ Sbjct: 73 GEAVGAYEEGQVKNHGHPNS---------TVSSIDLGSKNTANGGNHTHFSGIAAFGGGS 123 Query: 208 HQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGT 267 H++++ +GSG N TS+ G H H Sbjct: 124 HRYQTD----------------------------VNGSGGNIN---TSAAGNHYH----- 147 Query: 268 AASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 S+ +GSH H +T+ G +NT+ + N+IVRLA Sbjct: 148 ------------------SIPMGSHAHAVTIALFGALKNTINHRKINWIVRLA 182 >UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclassified Myoviridae RepID=C5H7L2_9CAUD Length = 391 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 74/142 (52%), Positives = 88/142 (61%), Gaps = 23/142 (16%) Query: 162 HTHSASAS--STDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHK---SSGAF 216 HTHSAS S S D G+++TS+FDYGTK+TN+ GAHTH+ SGT ++AG H H+ Sbjct: 230 HTHSASVSISSFDYGSKSTSTFDYGTKTTNSAGAHTHTFSGTTSNAGNHNHRVPMRGNDR 289 Query: 217 GGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAH 276 GGTN AI T S NA T GAHTHS SGT AS+GAH+H Sbjct: 290 GGTN---------AI---------TASADAGVGNAMYTDLAGAHTHSFSGTTASSGAHSH 331 Query: 277 TVGIGAHTHSVAIGSHGHTITV 298 TV IGAH+H+V IGSH HT TV Sbjct: 332 TVAIGAHSHTVNIGSHSHTGTV 353 >UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclassified Myoviridae RepID=Q6KGF6_9CAUD Length = 782 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 77/207 (37%), Positives = 103/207 (49%), Gaps = 50/207 (24%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASG 148 + YPVG + S+ P+ +A P L Y + + G TI+ A+G Sbjct: 589 KIYPVGIVTWFNSNVNPN------------TALPGLTWTYLNNGV----GRTIRIAAANG 632 Query: 149 RAV--------LSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG 200 V ++ + SHTHS SA TTSSFDYGTK+TN TGAHTHS+SG Sbjct: 633 SDVATTGGSDSVTLSVGNLPSHTHSFSA--------TTSSFDYGTKTTNTTGAHTHSVSG 684 Query: 201 TANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAH 260 + N+ GAH H G +GG ++ SG+ Q +S G H Sbjct: 685 STNNTGAHTHTFGGRYGG-------------DSIGGKHRVHVSGTEQV-----SSVAGDH 726 Query: 261 THSLSGTAASAGAHAHTVGIGAHTHSV 287 +H++ GTAAS G HAHTVGIGAH+H+V Sbjct: 727 SHTVYGTAASNGNHAHTVGIGAHSHTV 753 >UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BL21_PHOAA Length = 452 Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 48/97 (49%), Positives = 57/97 (58%), Gaps = 5/97 (5%) Query: 74 WSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVI 133 W W +Y+SA P E +PVGAPIP+P P GY GQTFDKS YPKLA AYPSG + Sbjct: 286 WIGWDIVYSSAILPPEQHPVGAPIPYPHRYTPVGYLTCNGQTFDKSLYPKLAEAYPSGRV 345 Query: 134 PDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHS 165 PD+RG I+G S GR S + K+H H Sbjct: 346 PDLRGEFIRGWDDSRGVDPGRVCGSWQDSDNKAHIHD 382 >UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae RepID=D2U1K0_9ENTR Length = 366 Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 40/78 (51%), Positives = 51/78 (65%), Gaps = 5/78 (6%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA---- 146 YPVGAPIPWP T P GY + G+ FDK PKL +AYPSG +PD+RG+ I+G A Sbjct: 218 YPVGAPIPWPQATPPKGYLICNGEPFDKVKCPKLLIAYPSGKLPDLRGYFIRGWDAGKGV 277 Query: 147 -SGRAVLSQEQDGIKSHT 163 GR V S ++D I++ T Sbjct: 278 DPGREVFSYQEDAIRNIT 295 >UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DE08_PECCP Length = 682 Score = 87.4 bits (215), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 48/113 (42%), Positives = 63/113 (55%), Gaps = 18/113 (15%) Query: 64 RSRRDTTDANWSPWAQLYTSAHPPA----------EFYPVGAPIPWPSDTVPSGYALMQG 113 RS RD + PWA++YT P E VG P+PWP T PSG+ G Sbjct: 497 RSSRDNSGFE-KPWARIYTDQDKPTAADIGALSLNEI--VGMPMPWPQTTAPSGWLKCNG 553 Query: 114 QTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKS 161 QTFDK+ YPKLA YP+G++PD+RG I+G S GR +LS + D I++ Sbjct: 554 QTFDKNIYPKLAQIYPAGILPDLRGEFIRGWDDSRGVDTGRTLLSTQGDAIRN 606 >UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bacteriophage n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2R3_PHOLL Length = 233 Score = 87.0 bits (214), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 40/78 (51%), Positives = 52/78 (66%), Gaps = 5/78 (6%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS---- 147 PVG P+P+PS P+GY GQ FDKS YP+LA+AYPSG++PD+RG I+G S Sbjct: 93 PVGVPLPYPSRYTPAGYLTCNGQAFDKSRYPQLAIAYPSGILPDLRGEFIRGWDDSRGVD 152 Query: 148 -GRAVLSQEQDGIKSHTH 164 GR +LS + GI+ H H Sbjct: 153 MGRGMLSWQPAGIQDHMH 170 >UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli plasmid p15B n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2Q1_PHOLL Length = 478 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 5/76 (6%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPAS 147 VG+PIPWP VP+GY GQ+F+KS YP+LA+AYPSGV+PD+RG I+G Sbjct: 337 VGSPIPWPLPNVPAGYLACNGQSFNKSLYPQLAIAYPSGVLPDLRGEFIRGWDDGRGVDR 396 Query: 148 GRAVLSQEQDGIKSHT 163 GR VL+ + D I++ T Sbjct: 397 GRGVLTHQGDAIRNIT 412 >UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacteria phage phiEcoM-GJ1 RepID=A9Q1X5_9CAUD Length = 356 Score = 85.1 bits (209), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 76/224 (33%), Positives = 119/224 (53%), Gaps = 14/224 (6%) Query: 62 FIRSRRDT-TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSA 120 ++RS+ T +DAN+ W + + YP+G + + + T P+ G +++ Sbjct: 87 WVRSQNGTVSDANFDEWTEFVNMNNIYNAIYPIGIVVKFDNATNPNNN--FTGTVWEQII 144 Query: 121 YPKLAVAY--PSGVIPDMRGWTIKGKPASGRAV--LSQEQDGIKSHTHSASASSTDLGTE 176 ++A A P D + +I G + AV L G+++HTH ++ S + Sbjct: 145 DGRVARAATGPEAGTADGQIGSIAGSDTANIAVTNLPGHTHGMQNHTHGIASHSHTMAHT 204 Query: 177 TTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSA 236 T + D+G +++++GAHTHS+SGTA SAGAHQH F G + + T+ N+S Sbjct: 205 HTINHDHGAVTSSSSGAHTHSVSGTAASAGAHQHTEGSPFTG-DVNFGTTTSTSKDNISD 263 Query: 237 GIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 + S + TR TSS GAHTHS+SGTAASAGAH H+V + Sbjct: 264 WLYSPS-----TRYP-LTSSSGAHTHSVSGTAASAGAHTHSVDL 301 >UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=B2PZV1_PROST Length = 526 Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 40/78 (51%), Positives = 50/78 (64%), Gaps = 5/78 (6%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA----- 146 PVGAPIPWP T PSGY + GQ F+K+ YP L AYPSG +PD+RG I+G A Sbjct: 377 PVGAPIPWPQATAPSGYLICNGQAFNKTTYPLLTKAYPSGKLPDLRGEFIRGLDAGRNID 436 Query: 147 SGRAVLSQEQDGIKSHTH 164 +GR VLS ++ + H H Sbjct: 437 NGRVVLSFQRCATEHHKH 454 >UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G6_9ENTR Length = 580 Score = 83.6 bits (205), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 40/78 (51%), Positives = 51/78 (65%), Gaps = 5/78 (6%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KP 145 YPVGAPIPWP T P+GY + G FDK+ YP+LA+AYPSG +P + G I+G K Sbjct: 431 YPVGAPIPWPQATPPNGYFVCDGNYFDKAKYPQLALAYPSGKLPLLYGEFIRGLDLGRKV 490 Query: 146 ASGRAVLSQEQDGIKSHT 163 GR VLS + D I++ T Sbjct: 491 DPGRTVLSNQGDAIRNIT 508 >UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CP88_DICZE Length = 485 Score = 81.6 bits (200), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 44/99 (44%), Positives = 60/99 (60%), Gaps = 7/99 (7%) Query: 79 QLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRG 138 QL T+A AE G P+PWP VP+G+ GQ FDK+ YP+LA YPSGV+PD+RG Sbjct: 332 QLATTAWFAAEI--AGIPLPWPQAAVPTGWLKCNGQAFDKNRYPRLAQVYPSGVLPDLRG 389 Query: 139 WTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTD 172 I+G SGR VLSQ++ + ++ SA ++D Sbjct: 390 EFIRGWDDGRGVDSGREVLSQQRGSLINYDGPDSAPTSD 428 >UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR91_CITRO Length = 279 Score = 81.6 bits (200), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 43/103 (41%), Positives = 59/103 (57%), Gaps = 5/103 (4%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPA 146 PVG P+PWPS T P G+ G TF S YPKL +AYPSG +PD+RG I+G Sbjct: 144 PVGVPVPWPSATPPEGWLKCNGATFSSSLYPKLGLAYPSGKLPDLRGEFIRGWDDGRGAD 203 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 +GR++LS + D +SH+H+ S + T+ +D T N Sbjct: 204 NGRSLLSSQGDAFRSHSHNFDRSWGLENFDATAGYDVVTADIN 246 >UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterobacteriaceae RepID=A4WEL3_ENT38 Length = 340 Score = 80.9 bits (198), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 39/83 (46%), Positives = 51/83 (61%), Gaps = 5/83 (6%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----K 144 + PVG P+PWP T P G+ G FDK YPKLAVAYPSG++PD+RG I+G Sbjct: 189 YLPVGFPLPWPQATPPQGWLKCNGAPFDKVKYPKLAVAYPSGLLPDLRGEFIRGWDDGRG 248 Query: 145 PASGRAVLSQEQDGIKSHTHSAS 167 SGR L+ + D ++ T +AS Sbjct: 249 VDSGRVALTTQGDAVQKMTGAAS 271 >UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid prophage e14 n=3 Tax=Photorhabdus RepID=C7BSQ1_PHOAA Length = 166 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 40/81 (49%), Positives = 48/81 (59%), Gaps = 5/81 (6%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS- 147 E PVG P+PWP+D P G+ G FDK YPKLAVAYPSG +PD+RG I+G Sbjct: 7 EEIPVGIPLPWPTDIPPYGWVKCNGAIFDKYLYPKLAVAYPSGNLPDLRGEFIRGWDDGR 66 Query: 148 ----GRAVLSQEQDGIKSHTH 164 GR VLS + I H+H Sbjct: 67 GVDIGRYVLSTQLADIAPHSH 87 >UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersinia pestis KIM D27 RepID=D1TPQ4_YERPE Length = 262 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 36/75 (48%), Positives = 49/75 (65%), Gaps = 5/75 (6%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPA 146 PVG P+PWP+ T P G+ G FDK YPKLA+AYPSG++PD+RG I+G Sbjct: 105 PVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIRGWDDGLGVD 164 Query: 147 SGRAVLSQEQDGIKS 161 +GR +LS + D I++ Sbjct: 165 AGREILSIQGDAIRN 179 >UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D4_DICDC Length = 557 Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 42/95 (44%), Positives = 58/95 (61%), Gaps = 6/95 (6%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASG 148 G P+PWP T P+G+ GQ+FDK+ YPKL AYPSG +PD+RG I+G SG Sbjct: 425 GIPLPWPQATAPTGWLKCNGQSFDKTLYPKLTAAYPSGTLPDLRGEFIRGWDDGRGVDSG 484 Query: 149 RAVLS-QEQDGIKSHTHSASASSTDLGTETTSSFD 182 RAVLS Q+ I+ + S +A++T S+F+ Sbjct: 485 RAVLSVQDATWIQPNIESNTAATTIRIDNVDSTFN 519 >UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CGA0_DICZE Length = 166 Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 41/106 (38%), Positives = 62/106 (58%), Gaps = 5/106 (4%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P+PWP T P+G+ GQ FDK+A+PKLA YPSGV+PD+RG I+G S Sbjct: 23 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 82 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 R +LS + D I++ T S + + +D G++++ + G+ Sbjct: 83 NRNLLSSQGDAIRNITGFVSGVYVGFDGYSGAFYDTGSRNSISPGS 128 >UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabdus RepID=Q7NAA0_PHOLL Length = 351 Score = 77.4 bits (189), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 41/83 (49%), Positives = 51/83 (61%), Gaps = 9/83 (10%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPAS 147 VG P+PW T P+GY + GQ FDKS YPKL AYPSG +PD+RG I+G S Sbjct: 201 VGIPLPWSKPTAPAGYLICSGQQFDKSMYPKLGEAYPSGALPDLRGEFIRGWDNGRSIDS 260 Query: 148 GRAVLSQEQDGIK---SHTHSAS 167 GR +LS Q+ K +TH+AS Sbjct: 261 GREILSH-QNSTKLPNLYTHAAS 282 >UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia RepID=A9R3H4_YERPG Length = 259 Score = 77.0 bits (188), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 35/74 (47%), Positives = 48/74 (64%), Gaps = 5/74 (6%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPAS 147 VG P+PWP+ T P G+ G FDK YPKLA+AYPSG++PD+RG I+G + Sbjct: 106 VGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIRGWDDGLGVDA 165 Query: 148 GRAVLSQEQDGIKS 161 GR +LS + D I++ Sbjct: 166 GREILSIQGDAIRN 179 >UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadantii RepID=C6C5D2_DICDC Length = 498 Score = 77.0 bits (188), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 47/133 (35%), Positives = 65/133 (48%), Gaps = 21/133 (15%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPAS 147 VG P+PWP T P+G+ GQ+FDK+ YPKLA YPSGV+PD+RG I+G + Sbjct: 335 VGIPLPWPQATAPTGWLKCNGQSFDKALYPKLATVYPSGVLPDLRGEFIRGWDDGRGVDA 394 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 GRA+L+ + + T L T DY +N G + A++A Sbjct: 395 GRAILTAQ-------------NPTYL---RTGMMDYNGSDVDNIGVYIGMGYAEADTAAK 438 Query: 208 HQHKSSGAFGGTN 220 +GAF N Sbjct: 439 SISAPAGAFRAPN 451 >UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N348_PHOLL Length = 440 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 39/84 (46%), Positives = 47/84 (55%), Gaps = 5/84 (5%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS---- 147 P G P+P+P P GY GQTFDKS YPKLA AYP+G +PD+RG I+G S Sbjct: 297 PAGVPMPYPHRYTPPGYLTCNGQTFDKSLYPKLAEAYPAGRVPDLRGEFIRGWDDSRGVD 356 Query: 148 -GRAVLSQEQDGIKSHTHSASASS 170 GR + + D I H H AS Sbjct: 357 PGRVCGTWQADCIPDHNHYKVASK 380 >UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersinia bercovieri ATCC 43970 RepID=C4S5W0_YERBE Length = 388 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 38/81 (46%), Positives = 50/81 (61%), Gaps = 5/81 (6%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG---- 143 AE +G PIP+P +VP GY G F YPKLA+ YPSGV+PDMRG I+G Sbjct: 238 AERELIGIPIPYPLPSVPVGYLKCNGAAFSTVTYPKLALKYPSGVLPDMRGNAIRGWDDG 297 Query: 144 -KPASGRAVLSQEQDGIKSHT 163 +GRA+LSQ+ D +++ T Sbjct: 298 RGVDAGRALLSQQLDALQNIT 318 >UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber protein H n=2 Tax=Pectobacterium atrosepticum RepID=Q6D3Y6_ERWCT Length = 536 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 43/94 (45%), Positives = 55/94 (58%), Gaps = 7/94 (7%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPASG 148 G P PWP T P+G+ GQ+FD SA+P LA AYPSGV+PD+RG I+G SG Sbjct: 388 GMPKPWPRATAPAGWLKCNGQSFDISAFPHLAAAYPSGVLPDLRGEFIRGWDDGRGVDSG 447 Query: 149 RAVLSQEQDGIKSHTHSA--SASSTDLGTETTSS 180 R++LS + D I++ SA S ET SS Sbjct: 448 RSLLSAQSDAIRNIVGEIWTSAVSQQFLGETLSS 481 >UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=Photorhabdus RepID=Q7N5C0_PHOLL Length = 239 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 38/74 (51%), Positives = 44/74 (59%), Gaps = 5/74 (6%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPA 146 PVG+PIPWP PSGY G F +S YPKLA AYP G IPD+RG I+G Sbjct: 102 PVGSPIPWPLSHPPSGYFTCNGSAFSRSQYPKLAEAYPDGRIPDLRGEFIRGWDDGRGVD 161 Query: 147 SGRAVLSQEQDGIK 160 SGR +LS + D K Sbjct: 162 SGRVILSAQTDNTK 175 >UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1_YERKR Length = 402 Score = 74.3 bits (181), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 37/78 (47%), Positives = 47/78 (60%), Gaps = 7/78 (8%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAV- 151 +G PIPWP P+GY G F+K+ YPKLA+AYPSGV+PD+RG I+G GR V Sbjct: 277 IGTPIPWPLTIAPAGYLKCNGAPFNKTQYPKLALAYPSGVLPDLRGEFIRGF-DDGRGVR 335 Query: 152 -----LSQEQDGIKSHTH 164 L + I+SH H Sbjct: 336 PNQPLLGWQGSEIQSHNH 353 >UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID=C6CP84_DICZE Length = 646 Score = 73.9 bits (180), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 34/77 (44%), Positives = 47/77 (61%), Gaps = 5/77 (6%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASG 148 G P+PWP T P+G+ GQ+FDK YP+LA YPSGV+PD+RG I+G + Sbjct: 503 GIPLPWPQATAPTGWLKCNGQSFDKKLYPRLAQVYPSGVLPDLRGEFIRGWDDGRGVDNN 562 Query: 149 RAVLSQEQDGIKSHTHS 165 R +LS + D I++ S Sbjct: 563 RGLLSSQGDTIRNIVAS 579 >UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BSQ6_PHOAA Length = 318 Score = 73.6 bits (179), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 34/78 (43%), Positives = 49/78 (62%), Gaps = 5/78 (6%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS--- 147 PVG PIPWP+ P+G+ G FDKS +P+L AY SGV+PD+RG I+G +S Sbjct: 219 IPVGVPIPWPTAIPPTGWLQCNGAAFDKSKFPQLVAAYSSGVLPDLRGEFIRGWDSSRGV 278 Query: 148 --GRAVLSQEQDGIKSHT 163 R++LS + D +++ T Sbjct: 279 DTNRSILSTQIDTMQNIT 296 >UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6CGA4_DICZE Length = 401 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 30/51 (58%), Positives = 38/51 (74%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 VG P+PWP T P+G+ GQ FDK+A+PKLA AYP GV+PD+RG I+G Sbjct: 248 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQAYPGGVLPDLRGEFIRG 298 >UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX Length = 456 Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 30/49 (61%), Positives = 36/49 (73%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMR 137 E YPVG+PIPWPS T P GY +M GQ+F S YP+LA AYP +PD+R Sbjct: 336 ESYPVGSPIPWPSATPPQGYLVMNGQSFSCSRYPQLARAYPGCKLPDLR 384 >UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH14_EDWI9 Length = 593 Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 34/77 (44%), Positives = 50/77 (64%), Gaps = 5/77 (6%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASG--- 148 PVG P PWP+ ++PSG+ GQ+F S+YP+LA AYP+G +PD+RG I+G G Sbjct: 451 PVGTPQPWPNTSIPSGWIKCAGQSFSTSSYPELAKAYPNGRLPDLRGEFIRGYDDYGGTD 510 Query: 149 --RAVLSQEQDGIKSHT 163 R +LS + D +++ T Sbjct: 511 SQRQILSWQGDAMRNIT 527 >UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6DA10_PECCP Length = 689 Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 44/118 (37%), Positives = 64/118 (54%), Gaps = 12/118 (10%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 A P +E G P+P+P P+G+ GQ+FDKS YP LA YPSGV+PD+RG ++G Sbjct: 539 AMPASEL--AGIPLPFPGAVAPTGWLKCNGQSFDKSQYPILASRYPSGVLPDLRGEFVRG 596 Query: 144 -----KPASGRAVLSQEQDGIKSHTHSASASSTDLG-TETTSSFDYGTKSTNNTGAHT 195 + RA+LS + D I++ + + + TET FD + TGAH+ Sbjct: 597 WDDGRGADASRALLSAQGDAIRNIVGTIGQLNDRVNTTETAGVFD----ANKYTGAHS 650 >UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU Length = 296 Score = 71.2 bits (173), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 50/141 (35%), Positives = 65/141 (46%), Gaps = 15/141 (10%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-------K 144 PVG PIPWP+ P G+ G FDKS +P+LA AYPSG +PD+RG I+G Sbjct: 145 PVGTPIPWPTAIPPVGWLQCNGAVFDKSKFPELAKAYPSGYLPDLRGEFIRGWDNGRGVD 204 Query: 145 PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS 204 P GR + + D I++ T S + D T YG N G T GT S Sbjct: 205 P--GRVCSTWQGDAIRNITGSFPGAIADNYHLATKEAFYGKI---NLGIAT---DGTTKS 256 Query: 205 AGAHQHKSSGAFGGTNTSIFP 225 H + FG + + P Sbjct: 257 KNIHNPDNPYGFGFDASRVVP 277 >UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 RepID=Q9MCR6_BPHK7 Length = 321 Score = 70.5 bits (171), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 33/78 (42%), Positives = 46/78 (58%), Gaps = 5/78 (6%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 167 LPVGVPVPWPSATPPTGWLKCNGAVFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 226 Query: 146 ASGRAVLSQEQDGIKSHT 163 +GR +LS + D I++ T Sbjct: 227 DAGREILSAQGDAIRNIT 244 >UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Enterobacteriaceae RepID=STFE_ECOLI Length = 166 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 37/97 (38%), Positives = 55/97 (56%), Gaps = 6/97 (6%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----A 146 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 10 PVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGID 69 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDY 183 +GR++LS + + H H + ST + T+ T +F + Sbjct: 70 TGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYF 105 >UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BT14_DICD5 Length = 534 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 33/77 (42%), Positives = 48/77 (62%), Gaps = 5/77 (6%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS----- 147 VG P+P+P T P G+ GQ+F+K+A+P LA YPSG +PD+RG I+G S Sbjct: 389 VGIPLPYPGATAPDGWLKCNGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVDP 448 Query: 148 GRAVLSQEQDGIKSHTH 164 GR +LS ++ +H+H Sbjct: 449 GRGLLSFQESQNLTHSH 465 >UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae RepID=B3I8J5_ECOLX Length = 263 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 5/67 (7%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS---- 147 PVGAP+PWPS+T P+G+ G F YP+LA AYP+ +PD+RG I+G S Sbjct: 104 PVGAPVPWPSETPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDSRGID 163 Query: 148 -GRAVLS 153 GR++LS Sbjct: 164 TGRSLLS 170 >UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E01-6750 RepID=UPI000190EC42 Length = 317 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 44/126 (34%), Positives = 68/126 (53%), Gaps = 9/126 (7%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----A 146 PVG P+PWPS T+P G+ G F YPKLA AYP+ +PD+RG I+G Sbjct: 172 PVGVPVPWPSATLPEGWLKCNGAAFSSEMYPKLAKAYPTNKLPDLRGEFIRGWDDGRGID 231 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYG-TKSTNNTGA-HTHSISGTANS 204 +GR +LS ++ I S + D+ + +++ + +G T S+N GA + A+S Sbjct: 232 AGREILSFQEGTIVSGFD--DNDTGDISSLSSTQYGFGDTLSSNQWGAINGKKWIFDASS 289 Query: 205 AGAHQH 210 GA ++ Sbjct: 290 KGAQKY 295 >UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteriaceae RepID=C6V0Q3_ECO5T Length = 439 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 35/96 (36%), Positives = 50/96 (52%), Gaps = 5/96 (5%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T P+G+ G F YP+LA YP+ +PD+RG I+G Sbjct: 284 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKVYPTNKLPDLRGEFIRGWDDGRGV 343 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 +GR +L+ + I SH H ++ +T SF Sbjct: 344 DNGRGLLTLQDGAIVSHNHYWGIWTSRTNDQTLESF 379 >UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia RepID=B7MJL6_ECO45 Length = 247 Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 45/122 (36%), Positives = 63/122 (51%), Gaps = 10/122 (8%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KP 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 105 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGV 164 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDL---GTETTSSFDYGTKSTNNT--GAHTHSISG 200 S RAVLS ++ + + + S L G + T S G+ S+N T + S+SG Sbjct: 165 DSRRAVLSTQEPTVGTFYVELAIISGTLSGSGAKFTDSVGIGSTSSNITVSNGNDQSVSG 224 Query: 201 TA 202 T Sbjct: 225 TV 226 >UniRef50_B7NJP1 Putative side tail fiber protein homolog from lambdoid prophage n=3 Tax=Escherichia coli RepID=B7NJP1_ECO7I Length = 686 Score = 67.4 bits (163), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 39/99 (39%), Positives = 54/99 (54%), Gaps = 7/99 (7%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPA 146 PVG P+PWP+ T P G+ G+ F K YP LA AYP+ +PD+RG I+G K Sbjct: 537 PVGVPVPWPTATPPEGWLKCDGRAFTKEQYPVLARAYPTLRLPDLRGEFIRGWDDGRKID 596 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETT-SSFDYG 184 GR +LS Q G H + S+ D+ + ++ DYG Sbjct: 597 EGRKLLSW-QKGTLVGGHDDNDSALDISYMSNGNNIDYG 634 >UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli RepID=B3I9S3_ECOLX Length = 546 Score = 67.4 bits (163), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 5/79 (6%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 386 LPVGVPVPWPSATPPTGWLKCNGAAFSVEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 445 Query: 146 ASGRAVLSQEQDGIKSHTH 164 +GRA+L+ + I H H Sbjct: 446 DTGRALLNWQPHTILDHAH 464 >UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLI8_PECWW Length = 621 Score = 67.0 bits (162), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 38/96 (39%), Positives = 53/96 (55%), Gaps = 7/96 (7%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 A P AE VG P +P P+G+ GQ FD + YP LA YPSG +PD+RG ++G Sbjct: 463 AMPSAEL--VGMPQVFPGAVAPAGWLKCNGQQFDTAQYPILASRYPSGFLPDLRGEFVRG 520 Query: 144 KP-----ASGRAVLSQEQDGIKSHTHSASASSTDLG 174 +GRA+LS++ D I++ T + AS G Sbjct: 521 WDDERGVDAGRALLSEQGDAIRNITGTMRASDVPYG 556 >UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GUE7_VIBCH Length = 250 Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 24/51 (47%), Positives = 35/51 (68%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRG 138 + +PVG IPW +D P G+ + +GQ FD + Y +LA +P+G+IPDMRG Sbjct: 200 VKIFPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRG 250 >UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia RepID=C4UEH4_YERAL Length = 387 Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 51/180 (28%), Positives = 78/180 (43%), Gaps = 14/180 (7%) Query: 9 NTQGAAGLELYEVYNNGYPTAYGN--IIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSR 66 NT A G+ Y P +G+ I H++ + + T+ H A I + Sbjct: 160 NTLAATGMYSVNQYAANIPEGFGDATIQHIQNDSLTAHQFIF----STNNTHTAAKI-AY 214 Query: 67 RDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV 126 R + W W + TS P+G P+P+P T P+GY G F YP LA Sbjct: 215 RLRSYGQWREWIDIVTSRSD--TLTPIGIPLPYPGTTPPAGYLKCNGAAFYPYRYPTLAT 272 Query: 127 AYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 YP+ +PD+RG I+G + R +LS + D +++ T + S LG S+F Sbjct: 273 LYPTHKLPDLRGEFIRGFDDGRGIDTSRTLLSAQTDALQNITGGINGVSESLGIAAESNF 332 >UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseudotuberculosis IP 31758 RepID=A7FIU0_YERP3 Length = 402 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 36/88 (40%), Positives = 50/88 (56%), Gaps = 11/88 (12%) Query: 74 WSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVI 133 WS W Q+ P VG P+PWP+ PSG+ G TF+K+ +P+LA Y GV+ Sbjct: 247 WSDWIQIGNDVAP------VGIPMPWPAHIPPSGWLKCNGATFNKAQFPQLASVYTRGVL 300 Query: 134 PDMRGWTIK----GKPAS-GRAVLSQEQ 156 PD+RG I+ GK A GR +LS ++ Sbjct: 301 PDLRGEFIRGWDDGKLADPGRGLLSFQE 328 >UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BYH6_DICD5 Length = 198 Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 32/83 (38%), Positives = 44/83 (53%), Gaps = 5/83 (6%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P WP P G+ GQ FDK+ YP+LA YP+G +PD+RG I+G + Sbjct: 59 VGIPQAWPLADAPEGWLKCNGQAFDKTKYPQLAKLYPAGTLPDLRGEFIRGWDDGRGVDT 118 Query: 148 GRAVLSQEQDGIKSHTHSASASS 170 R +LS + ++SH H S Sbjct: 119 NRQILSAQSGMLESHNHMMPVSD 141 >UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 Tax=Shigella sp. D9 RepID=UPI0001B5347E Length = 550 Score = 64.3 bits (155), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 31/68 (45%), Positives = 40/68 (58%), Gaps = 5/68 (7%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS--- 147 PVG P+PWPS T P+G+ G F YPKLA YP+ +PD+RG I+G S Sbjct: 390 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPKLAKVYPTNKLPDLRGEFIRGWDDSRGI 449 Query: 148 --GRAVLS 153 GR++LS Sbjct: 450 DTGRSLLS 457 >UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65WH4_MANSM Length = 296 Score = 64.3 bits (155), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 31/77 (40%), Positives = 42/77 (54%), Gaps = 5/77 (6%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPAS 147 +G P P+P VP G GQTF + YP+LA YPSG +PD+RG I+G S Sbjct: 140 IGIPFPYPLSAVPDGCLAFNGQTFSTTTYPELAKKYPSGRLPDLRGEFIRGWDNGRGVDS 199 Query: 148 GRAVLSQEQDGIKSHTH 164 R +L + + +HTH Sbjct: 200 SRELLRSQGAELSAHTH 216 >UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae RepID=D0FSD9_ERWPY Length = 311 Score = 62.8 bits (151), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 70/248 (28%), Positives = 101/248 (40%), Gaps = 59/248 (23%) Query: 75 SPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFD---KSAYPKLAVAYPSG 131 S W + +P YP+G + P+ L G T+ ++ +LA A S Sbjct: 113 SGWVEFKADVNPVDMLYPIGIVTWFAQKKDPN--KLFPGTTWKYIGENRTIRLASANGSD 170 Query: 132 VIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT 191 V+ T G + AV + G HT SA+ S D GT+ TS+FDYG K T+ Sbjct: 171 VM------TTGGSDSVTLAVGNIPAHG---HTFSANTGSFDYGTKGTSTFDYGNKVTDTQ 221 Query: 192 GAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNA 251 G+HTHS + + P G + + GI TT T Sbjct: 222 GSHTHSYN----------------------EVIPRGASGMD--IGGIWETTIRGSDT--- 254 Query: 252 GKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNI 311 ++AGAHAH V IGAH H+V IG+H H+++ A T N+ Sbjct: 255 -----------------STAGAHAHNVAIGAHGHTVEIGAHSHSVSGTTANTGAGTAINV 297 Query: 312 AFNYIVRL 319 N ++L Sbjct: 298 T-NAFIKL 304 >UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CG98_DICZE Length = 196 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 25/51 (49%), Positives = 33/51 (64%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 +G P PWP P G+ GQTFD + YP+LA YP+G +PD+RG I+G Sbjct: 59 IGIPQPWPLAEAPEGWLKCNGQTFDTAKYPQLAKLYPAGTLPDLRGEFIRG 109 >UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2U8_ERWCT Length = 619 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 7/83 (8%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 A P +E G P+P+P P+GY GQ FD + +P LA YPSG +PD+RG ++G Sbjct: 457 ALPTSEL--AGIPLPFPGAVAPAGYLKCNGQQFDTAQFPVLASRYPSGFLPDLRGEFVRG 514 Query: 144 KP-----ASGRAVLSQEQDGIKS 161 + RA++S + D I++ Sbjct: 515 WDDGRGIDTVRALMSAQGDAIRN 537 >UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C6Z0_DICDC Length = 183 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 25/51 (49%), Positives = 32/51 (62%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 +G P PWP P G+ GQ FD + YP+LA YPSG +PD+RG I+G Sbjct: 46 IGIPQPWPLADAPEGWLKCNGQAFDTAKYPELAKCYPSGTLPDLRGEFIRG 96 >UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID=B7US81_ECO27 Length = 521 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 32/73 (43%), Positives = 42/73 (57%), Gaps = 6/73 (8%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KP 145 PVG P+PW S T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 386 LPVGVPVPWSSATPPTGWLKCNGAAFSSEMYPRLARAYPTNKLPDLRGEFIRGWDDGRGI 445 Query: 146 ASGRAVLSQEQDG 158 +GR +LS QDG Sbjct: 446 DAGRTLLSG-QDG 457 >UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=Pectobacterium carotovorum subsp. carotovorum WPP14 RepID=UPI0001A44C27 Length = 195 Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 33/78 (42%), Positives = 41/78 (52%), Gaps = 5/78 (6%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P P P T P G+ GQ+FD S YP LA YP G +PD+RG I+G + Sbjct: 88 VGIPQPCPLVTAPEGWLACAGQSFDTSRYPVLASRYPQGRLPDLRGEFIRGWDNGRGVDT 147 Query: 148 GRAVLSQEQDGIKSHTHS 165 GR LS + + HTH Sbjct: 148 GRGNLSSQSFSTEPHTHD 165 >UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 Tax=Erwinia phage phiAT1 RepID=C5J9F2_9VIRU Length = 240 Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 31/83 (37%), Positives = 42/83 (50%), Gaps = 8/83 (9%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG----- 143 P+GA IPWP TVP G+ GQ F+ PKL V+PD RG ++G Sbjct: 151 RLVPIGAVIPWPGATVPDGWLECSGQVFNTGQNPKLYSVLGRNVVPDYRGLFLRGWAHGS 210 Query: 144 ---KPASGRAVLSQEQDGIKSHT 163 P +GRA+ S + D I++ T Sbjct: 211 DANDPDAGRALGSVQGDAIRNIT 233 >UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE3_PECWW Length = 144 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 30/78 (38%), Positives = 41/78 (52%), Gaps = 7/78 (8%) Query: 99 WPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-------KPASGRAV 151 W + P G+ + GQ F+ S P LA YPS +PD RG+ +G P S R+V Sbjct: 2 WGTPVPPEGWLELNGQLFNPSGNPVLADLYPSSRVPDFRGYFPRGWDNGAGIDPDSSRSV 61 Query: 152 LSQEQDGIKSHTHSASAS 169 LS + D I SH H+ + S Sbjct: 62 LSYQDDEIISHKHAITMS 79 >UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=Photorhabdus RepID=Q7N047_PHOLL Length = 602 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 33/101 (32%), Positives = 53/101 (52%), Gaps = 6/101 (5%) Query: 92 PVGAPIPWPSDT-VPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA---- 146 P+GA I W S +P+GY +G+ F + YP+LA +P +PD RG +G Sbjct: 459 PIGATIEWHSTAPIPAGYEPNEGRAFRAADYPELAKIFPDLKLPDDRGLFKRGLDRGRGL 518 Query: 147 -SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK 186 SGR++ S + D I++ T S + + G+ + +F Y K Sbjct: 519 DSGRSLGSVQGDAIRNITGSLGKPTIESGSNASGAFSYQYK 559 >UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DPV9_DESVM Length = 530 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 13/90 (14%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSG-------VIPDMRGWT 140 A F P+GA + +P +TVP+G+ + GQ ++AYP L V Y +G +PD+RG Sbjct: 209 AAFVPIGAILDFPVNTVPTGFLVCAGQVVTRTAYPDL-VTYLTGGTVAVNATLPDLRGEF 267 Query: 141 IKGKPA-----SGRAVLSQEQDGIKSHTHS 165 +G +GR V S + D I++ T S Sbjct: 268 RRGADLGRGVDAGRVVGSAQGDAIRNITGS 297 >UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N6T1_PHOLL Length = 300 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 35/77 (45%), Positives = 44/77 (57%), Gaps = 5/77 (6%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPA 146 PVG+PIPWP P GY G F+K YPKLA AYP G +PD+RG I+G Sbjct: 153 PVGSPIPWPLPYPPVGYLTCNGSAFNKLQYPKLAEAYPDGRLPDLRGEFIRGWDDGRGVD 212 Query: 147 SGRAVLSQEQDGIKSHT 163 GR +LS + D ++ T Sbjct: 213 MGRTMLSWQGDAMQRMT 229 >UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JYG6_9GAMM Length = 400 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 62/226 (27%), Positives = 91/226 (40%), Gaps = 60/226 (26%) Query: 103 TVPSGYALMQGQTFDKSAYPKLAVAY----------PSGVIPDMRGWTIKGKPASGRAVL 152 T P G+ G ++ YP L A + +PD+R +G + R+V Sbjct: 223 TPPGGWLFCDGSEVSRTQYPALFTAIGTLWGDGDGSTTFNLPDLRNDFRRGC-SDTRSVG 281 Query: 153 SQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKS 212 E D IKSH+HSAS+ ++GAHTH G ++ +GAH+H+S Sbjct: 282 DSESDQIKSHSHSASSE--------------------DSGAHTHG--GRSSDSGAHKHRS 319 Query: 213 SGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAG 272 +G +N S P G TSGSG R +G + D ++ +A Sbjct: 320 --GWGESNRSDAPFG-------------ATSGSGH-RGSGDSDWDNYLYYT-----DTAQ 358 Query: 273 AHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 H H + I GSH H I + G E +N I+R Sbjct: 359 PHFHWLIINQ------AGSHSHPINIEPTGGDETRPRNKVLMPIIR 398 >UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacteria phage WV8 RepID=C5H7L3_9CAUD Length = 848 Score = 51.2 bits (121), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 66/212 (31%), Positives = 94/212 (44%), Gaps = 52/212 (24%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASG 148 + YPVG + S+ P+ +A P L Y + + G TI+ A+G Sbjct: 647 KIYPVGIVTWFNSNVNPN------------TALPGLTWTYLNNGV----GRTIRIAAANG 690 Query: 149 RAV--------LSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG 200 V ++ + SHTHS SA TTSSFDYGTK+++ TG H H+ G Sbjct: 691 SDVATTGGSDSVTLSVGNLPSHTHSFSA--------TTSSFDYGTKTSSTTGNHNHN-RG 741 Query: 201 TANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGA- 259 T G+ + S A S F YTA +G S +G G ++G Sbjct: 742 TMEITGSFGYFRSDA------SSF---YTA-----SGAFYLGSQAGSKGYTGNNFTNGIP 787 Query: 260 ----HTHSLSGTAASAGAHAHTVGIGAHTHSV 287 + + SG + G H+HTVGIGAH+H+V Sbjct: 788 VNFNASRNWSGVTNTTGNHSHTVGIGAHSHTV 819 >UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VKW8_PHOAA Length = 316 Score = 51.2 bits (121), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 33/70 (47%), Positives = 44/70 (62%), Gaps = 5/70 (7%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-----KPA 146 PVG+PIPWP P GY G F++S YPKLA AYP+G +PD+RG I+G Sbjct: 181 PVGSPIPWPLPHPPFGYVTCNGSAFNRSQYPKLAEAYPNGRLPDLRGEFIRGWDDGRGAD 240 Query: 147 SGRAVLSQEQ 156 +GR +LS ++ Sbjct: 241 NGRKLLSWQE 250 >UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A9IRI0_BART1 Length = 324 Score = 50.4 bits (119), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 50/194 (25%), Positives = 73/194 (37%), Gaps = 61/194 (31%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY---------PSGVIPDM 136 P E +P G + +P+G+ L G + + YP+L A + +PD Sbjct: 156 PKIESFPAGFIATFAMRNIPNGWLLCDGTAYKREDYPQLFKAIGDKWGKNSDTTFKVPDF 215 Query: 137 RGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT 191 RG ++G + R ++QD IKSHTH +GT + Sbjct: 216 RGMFLRGFDDGRGLDNDRKFADEQQDSIKSHTH--------IGT------------VEES 255 Query: 192 GAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNA 251 GAH H+ ++K G G N PN YT + L Sbjct: 256 GAHVHNF----------EYKGVGWPTG-NIGRLPNYYTYNTTLK---------------- 288 Query: 252 GKTSSDGAHTHSLS 265 GKT S GAHTH ++ Sbjct: 289 GKTDSAGAHTHKIT 302 >UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I687_PSEE4 Length = 898 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 54/181 (29%), Positives = 77/181 (42%), Gaps = 45/181 (24%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGV------------IPDMRGW 139 PVG +P+P TVP+G+ + G T + YP LA AY G +PD RG Sbjct: 385 PVGTMLPFPRGTVPAGFLEVDGSTQSAAVYPDLA-AYLGGAFNTGNEAAGFFRLPDTRGE 443 Query: 140 TIKG-----KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAH 194 ++G SGRAV S + + K+HTH D+G G + Sbjct: 444 FLRGWDHGRGVDSGRAVGSTQGESFKAHTHK------DVGFIDNVGGGSGASAVTGATGD 497 Query: 195 THSISGTA------NSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQT 248 SI G A +A A++ + GA GG AI AG++S ++G +T Sbjct: 498 VTSIYGKAYGNSASATAKAYKESAPGALGG-----------AI----AGLISGSTGDSET 542 Query: 249 R 249 R Sbjct: 543 R 543 >UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI4_EDWTE Length = 718 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 14/85 (16%) Query: 93 VGAPIPWPSDTVPSG--------YALMQGQTFDKSAYPKLAVAYPSGVIP-DMRGWTIKG 143 +G+ IPW + +P + GQ+FD +PKL YP +P DMRG+T +G Sbjct: 566 IGSLIPWALERMPQEIWPNCGMHFIPYMGQSFDPELFPKLHDVYPDNRLPTDMRGYTARG 625 Query: 144 KPAS-----GRAVLSQEQDGIKSHT 163 GRA+LS + D I++ T Sbjct: 626 WDNGRGIDIGRALLSYQDDAIQNIT 650 >UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacterium radiotolerans JCM 2831 RepID=B1M1N8_METRJ Length = 414 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 73/278 (26%), Positives = 117/278 (42%), Gaps = 60/278 (21%) Query: 74 WSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGV- 132 + P AQ+Y + P E GA + VPSG+ + G+ ++AY L +G Sbjct: 128 YDPVAQVYRTLSPTTE--QAGAIKAFAGPNVPSGWEICDGRAVSRTAYAALFATISTGWG 185 Query: 133 ---------IPDMRGWTIKG-KPASGRAVLSQEQDG-----------------IKSHTHS 165 +PD RG T+ G +GR + DG + SH H+ Sbjct: 186 NGDGFTTFNLPDARGRTLFGANRGTGRLTAAGGLDGSLGNMGGADQVVMLAPQMPSHIHT 245 Query: 166 ASASST---DLGTETTSSFDYGTKSTNNTGAHTHS----------ISGTANSAGAHQHKS 212 ++ S + + + D+G T G H HS GT +++G H H Sbjct: 246 STMSPAGFFEPEIQKAGAHDHG--GTKVGGDHAHSGTTGLSGTHTHGGTTDTSGDHAHVV 303 Query: 213 SGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGK------TSSDGAHTHSLSG 266 +G +T PN ++ ++ G + G+GQT +G T G HTH+ S Sbjct: 304 QYGYGLVSTQT-PNNAQVVTGINLG----SQGNGQTTQSGPHQHTFTTGQGGNHTHAFS- 357 Query: 267 TAASAGAHAHTVGI-GAHTHSV-AIGSHGHTITVNAAG 302 G+HAH + + G HTH++ +H HT+ ++AAG Sbjct: 358 -TDPGGSHAHEIPVDGDHTHTIDPTPNHVHTLVIDAAG 394 >UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE1_PECWW Length = 532 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 29/77 (37%), Positives = 40/77 (51%), Gaps = 14/77 (18%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-------KPA 146 G P+P P G+ + GQ F+ S P LA YPS +PD RG+ +G P Sbjct: 399 GTPVP------PEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRGWDNGAGIDPD 452 Query: 147 SGRAVLSQEQDGIKSHT 163 S RA+LS + D I++ T Sbjct: 453 S-RAILSVQGDAIRNIT 468 >UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae pv. oryzae RepID=B2SVF7_XANOP Length = 501 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 75/295 (25%), Positives = 112/295 (37%), Gaps = 64/295 (21%) Query: 68 DTTDANWSPWAQLY-TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV 126 D D W + + + P F G + S P+G + G ++ Y L Sbjct: 224 DMLDGRQGDWYRDFGNMLNVPQSFLLPGQIVVMASLYPPNGLLVCDGAEISRAKYAALFA 283 Query: 127 A----YPSGV------IPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTE 176 A Y +G +P ++ T+ ++ AV S + + SHTH ASA++ Sbjct: 284 AIGTVYGAGDGSTTFNVPKIKEGTVITHTSAATAVGSYDPGQVISHTHGASAAAVGDHAH 343 Query: 177 TTSSFDYGTKS----------------TNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 T+ G + T+ G H H G+ +++G HQH G Sbjct: 344 YTAINAAGNHAHGASAGAAGDHAHYAWTDAQGHHAHG--GSTSASGDHQHP------GVI 395 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTV-- 278 S NGY G+ + G T + G H HS GT AG+H H + Sbjct: 396 PSASINGY--------GVYRERDNDAAPSD-GWTGAGGNHAHSF-GTDG-AGSHGHNISM 444 Query: 279 -GIGAHTHSVAI---------------GSHGHTITVNAAGNAENTVKNIAFNYIV 317 G+G HTH + I G+H HTITVNAAG +N + Y + Sbjct: 445 NGVGNHTHGIGIAEGGNHVHDVDHRGAGAHAHTITVNAAGGIDNLPAGLRMTYCI 499 >UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkholderia ambifaria AMMD RepID=Q0BEK5_BURCM Length = 735 Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 77/276 (27%), Positives = 109/276 (39%), Gaps = 81/276 (29%) Query: 79 QLYTSAHPPAEFYPVG-APIPWPSDTVP-SGYALMQGQTFDKSAYPKL-AVAYPSGVI-- 133 +L TSA A V I W + T P +G+ + G ++ YP L A A SG + Sbjct: 503 KLITSAWFAAAVADVQIGQIVWEARTAPRAGFLKLNGTELKRADYPLLWAYAQGSGALVA 562 Query: 134 -----------------------PDMRGWTIK----GKPASGRAVLSQEQDGI-KSHTHS 165 PD+RG I+ + + + QD + + H H Sbjct: 563 DADWGKGRHGCFSSGDGNTTFRLPDLRGEFIRCWDDARGTDAQRQIGSWQDSLNRLHAHG 622 Query: 166 ASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFP 225 ASA++ +G + ++ T++ G H HSI N G H H A GG Sbjct: 623 ASAAA--VGDHSHGAW------TDSQGWHGHSI----NDPG-HDHGIPVASGG------- 662 Query: 226 NGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASA---GAHAHTVGIGA 282 GY NL+ G G G R G SGT S GAH H VGIG Sbjct: 663 -GYIGEINLNGG------GRGDKRTTG------------SGTGISINGDGAHGHNVGIGG 703 Query: 283 HTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 G+H HTI++ A G E+ +N+A ++R Sbjct: 704 ------AGAHSHTISIGADGGNESRPRNVALLVMIR 733 >UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacterium RepID=D0KGE5_PECWW Length = 157 Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 17/108 (15%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG-------KPA 146 G P+P P G+ + GQ F+ S P LA YPS +PD RG+ +G P Sbjct: 24 GTPVP------PEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRGWDNGAGIDPD 77 Query: 147 SGRAVLSQEQDGIKSHTHSASA-SSTDLGTETTSSFDYGTKSTNNTGA 193 S RA+LS + D I++ T + S++ G SS YG +N+G+ Sbjct: 78 S-RAILSVQGDAIRNITGEFNPGGSSNWGKGVFSS--YGWPYPSNSGS 122 >UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BT48_DESAD Length = 208 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 8/79 (10%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA---- 146 YP+GA + DT P G+ GQ+ + YP+LA + V PD+RG I+G + Sbjct: 61 YPIGAVAAYRGDTPPVGWLECNGQS--TTGYPELAAVVGANV-PDLRGEFIRGLDSGRGV 117 Query: 147 -SGRAVLSQEQDGIKSHTH 164 +GRA+ S + D ++ H+H Sbjct: 118 DAGRALGSAQADAMERHSH 136 >UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5ABB4_BURGB Length = 670 Score = 43.5 bits (101), Expect = 0.009, Method: Compositional matrix adjust. Identities = 69/255 (27%), Positives = 95/255 (37%), Gaps = 75/255 (29%) Query: 106 SGYALMQGQTFDKSAYPKL-AVAYPSGV-------------------------IPDMRG- 138 +GY G + ++ YP L A A SG +PD+RG Sbjct: 447 AGYVKCDGSQYKRADYPALWAYAQASGALVSEAEYTDGRWGGFSTADGQTYFRVPDLRGE 506 Query: 139 ----WTI-KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 W+ +G GRA+ S + ++H H AS ++ GA Sbjct: 507 FLRCWSDGRGDVDPGRAIGSFQGGQNQAHAHGAS--------------------SDPDGA 546 Query: 194 HTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNG-YTAISN---LSAGIMSTTSGSGQTR 249 H H A + GA H G GG NG ++ + L + S T GSG + Sbjct: 547 HVHD----AWTGGAGWHSHHGVTGGGGMHNHANGVFSRLLRPPYLGSLTGSDTDGSGNEQ 602 Query: 250 NAGKTSSDGAHTHSLSGTAASAGAHAH---TVGIGAHTHSVAIGS---HGHTITVNAAGN 303 G S A AG H H T G G H H+V IG+ H H I V A G Sbjct: 603 AVGGGDS---------ADIAWAGEHQHEFWTDGAGDHVHAVGIGNAGGHAHAIHVQADGG 653 Query: 304 AENTVKNIAFNYIVR 318 AE +N+A ++R Sbjct: 654 AEARPRNVALLAMIR 668 >UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_9CAUD Length = 760 Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust. Identities = 32/86 (37%), Positives = 40/86 (46%), Gaps = 14/86 (16%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIK--------- 142 P+G+ P+ T P+GY G TF K YP L S +PDMRG +K Sbjct: 265 PIGSIFPFVK-TPPAGYLTCDGSTFSKDEYPDLYAYLGSTTLPDMRGRYLKMPSDLANIY 323 Query: 143 -GKPASGRAVLSQEQDGIKSHTHSAS 167 PA A+L D SHTH+AS Sbjct: 324 QKFPAIIPALL---HDVDISHTHTAS 346 >UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A9IXL3_BART1 Length = 334 Score = 41.2 bits (95), Expect = 0.044, Method: Compositional matrix adjust. Identities = 31/131 (23%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 101 SDTVPSGYALMQGQTFDKSAYPKLAVAY----------PSGVIPDMRGWTIKGKPA---- 146 S+ +PSG+ L G+ + + Y L + +PD+RG ++G + Sbjct: 180 SEKIPSGWLLCDGKEYSRKNYANLFAVLGETWGKGDGKTTFNVPDLRGMFLRGLDSGKEI 239 Query: 147 -SGRAVLSQEQDGIKSHTHSASASST-----DLGTETTSSFDYGTKSTNNTGAHTHSISG 200 GR + S++++ KSHTH ST T Y + A + Sbjct: 240 DKGRLLGSRQEESFKSHTHEGKTDSTGKHQHSYPTIKNDILRYKREDYKGYVAVVYKTDT 299 Query: 201 TANSAGAHQHK 211 AG H+HK Sbjct: 300 LTEPAGEHEHK 310 >UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia cenocepacia J2315 RepID=B4EF34_BURCJ Length = 883 Score = 41.2 bits (95), Expect = 0.053, Method: Compositional matrix adjust. Identities = 26/65 (40%), Positives = 33/65 (50%), Gaps = 8/65 (12%) Query: 254 TSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAF 313 TS G H H + AG H H VGIGA G+H H ITVN G E+ +N+A Sbjct: 825 TSGAGGHNHEFN--TEGAGNHGHNVGIGA------AGNHSHAITVNGDGANESRPRNVAL 876 Query: 314 NYIVR 318 ++R Sbjct: 877 LAMIR 881 >UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3KCU2_PSEFS Length = 658 Score = 40.4 bits (93), Expect = 0.084, Method: Compositional matrix adjust. Identities = 29/90 (32%), Positives = 41/90 (45%), Gaps = 16/90 (17%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLA----VAYPSG-------VIPD----- 135 PVG+ + +P D VP G+ + G +AYP LA A+ G +P+ Sbjct: 181 PVGSMVAFPIDKVPVGFLEIDGSVKSATAYPDLAKFLGTAFNKGDEGAGNFRLPESRGEF 240 Query: 136 MRGWTIKGKPASGRAVLSQEQDGIKSHTHS 165 +RGW +GR S + D KSHTH Sbjct: 241 LRGWDHGRGVDAGRLAGSYQTDQFKSHTHE 270 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriacea... 391 e-107 UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enter... 328 1e-88 UniRef50_P76072 Side tail fiber protein homolog from lambdoid pr... 303 6e-81 UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enter... 295 2e-78 UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia ... 290 5e-77 UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysente... 283 5e-75 UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX 272 1e-71 UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae R... 260 6e-68 UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_L... 252 8e-66 UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica... 227 4e-58 UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escheric... 225 1e-57 UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria pha... 225 2e-57 UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacter... 221 3e-56 UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escheri... 217 5e-55 UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae Re... 215 2e-54 UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepI... 191 3e-47 UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepI... 185 2e-45 UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia Rep... 179 1e-43 UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp.... 153 7e-36 UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU 145 2e-33 UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacter... 137 5e-31 UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae ... 136 1e-30 UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteri... 135 2e-30 UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=... 130 5e-29 UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclass... 129 1e-28 UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrob... 128 2e-28 UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 T... 128 2e-28 UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersini... 127 5e-28 UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae... 127 7e-28 UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteria... 125 2e-27 UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteri... 125 2e-27 UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadan... 125 3e-27 UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacteriu... 123 8e-27 UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia ... 123 1e-26 UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterob... 123 1e-26 UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia Rep... 122 2e-26 UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli R... 121 2e-26 UniRef50_B3X4P8 Tail fiber n=3 Tax=Enterobacteriaceae RepID=B3X4... 121 3e-26 UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae... 121 4e-26 UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseu... 118 2e-25 UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus... 118 2e-25 UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bact... 118 3e-25 UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclass... 116 1e-24 UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH... 116 1e-24 UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enter... 116 1e-24 UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 T... 115 2e-24 UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Entero... 115 2e-24 UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Pho... 115 2e-24 UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber prote... 115 3e-24 UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=... 114 3e-24 UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus... 114 4e-24 UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli pl... 114 4e-24 UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadan... 113 9e-24 UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID... 113 1e-23 UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 Rep... 113 1e-23 UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 ... 113 1e-23 UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammapr... 111 2e-23 UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID... 111 2e-23 UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteri... 111 3e-23 UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadan... 111 4e-23 UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae... 111 5e-23 UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadan... 110 6e-23 UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid pr... 109 2e-22 UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae ... 108 2e-22 UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteri... 108 3e-22 UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadan... 108 3e-22 UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 ... 108 4e-22 UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacteriu... 107 5e-22 UniRef50_B7NJP1 Putative side tail fiber protein homolog from la... 106 9e-22 UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacter... 105 2e-21 UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersini... 105 2e-21 UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae... 105 3e-21 UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacter... 104 3e-21 UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1... 100 6e-20 UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannhei... 99 1e-19 UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectoba... 100 1e-19 UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkhol... 99 2e-19 UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium w... 99 2e-19 UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabd... 96 2e-18 UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=... 95 3e-18 UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A... 93 9e-18 UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=... 93 1e-17 UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=... 92 2e-17 UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacteriu... 91 4e-17 UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica ... 90 1e-16 UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 T... 88 5e-16 UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella... 86 2e-15 UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio... 84 5e-15 UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX 83 9e-15 UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacteriu... 83 1e-14 UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Ta... 79 2e-13 Sequences not found previously or not previously below threshold: UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkhol... 98 5e-19 UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root R... 96 2e-18 UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas ... 94 8e-18 UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia... 90 1e-16 UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=... 88 4e-16 UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=C... 85 3e-15 UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID... 85 5e-15 UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella gra... 83 1e-14 UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylo... 80 9e-14 UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralston... 80 1e-13 UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio... 80 1e-13 UniRef50_B2I5N0 Tail Collar domain protein n=13 Tax=Xylella fast... 80 1e-13 UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Ta... 77 7e-13 UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polarom... 76 1e-12 UniRef50_C3X912 Phage tail collar domain-containing protein n=1 ... 76 1e-12 UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas... 74 6e-12 UniRef50_Q7N541 Similar to DNA inversion product and tail fiber ... 73 1e-11 UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia... 72 3e-11 UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotroph... 71 3e-11 UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 R... 70 8e-11 UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomo... 70 9e-11 UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxaloba... 70 1e-10 UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 ... 70 1e-10 UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Entero... 70 2e-10 UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingell... 69 2e-10 UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Ta... 68 4e-10 UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_... 68 4e-10 UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax... 68 6e-10 UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxaloba... 67 7e-10 UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkhol... 67 7e-10 UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythrae... 67 9e-10 UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45... 67 9e-10 UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes... 67 1e-09 UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A... 66 2e-09 UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD 66 2e-09 UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labr... 65 3e-09 UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=... 64 7e-09 UniRef50_B3Z3L3 Phage minor structural protein n=3 Tax=Bacillus ... 63 1e-08 UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemoph... 63 1e-08 UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-l... 62 3e-08 UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia sola... 62 3e-08 UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A... 61 5e-08 UniRef50_Q4C9U4 Phage Tail Collar n=1 Tax=Crocosphaera watsonii ... 61 6e-08 UniRef50_A9AVE2 Tail Collar domain protein n=1 Tax=Herpetosiphon... 61 6e-08 UniRef50_C2RWX3 Phage minor structural protein n=1 Tax=Bacillus ... 61 6e-08 UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A... 61 7e-08 UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synecho... 60 1e-07 UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax... 60 1e-07 UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxaloba... 59 2e-07 UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formi... 59 3e-07 UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemoph... 58 3e-07 UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=... 58 5e-07 UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes... 57 7e-07 UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes... 57 1e-06 UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium R... 57 1e-06 UniRef50_C3LHF1 Phage minor structural protein n=13 Tax=Bacteria... 57 1e-06 UniRef50_D1BW55 Tail Collar domain protein n=1 Tax=Xylanimonas c... 56 1e-06 UniRef50_D1NFN8 Apo-citrate lyase phosphoribosyl-dephospho-CoA t... 56 1e-06 UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED1... 56 2e-06 UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxaloba... 56 2e-06 UniRef50_A9LZ37 Tail fibre protein, putative n=21 Tax=Neisseria ... 56 2e-06 UniRef50_C3X909 Predicted protein n=2 Tax=Oxalobacter formigenes... 55 3e-06 UniRef50_A1HR57 Putative uncharacterized protein n=1 Tax=Thermos... 55 4e-06 UniRef50_C6S6V6 Putative phage tail fibre protein n=1 Tax=Neisse... 55 4e-06 UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter ... 54 8e-06 UniRef50_C4VIX0 74kDa protein n=28 Tax=root RepID=C4VIX0_ENTFA 53 1e-05 UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes... 53 1e-05 UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibaci... 53 1e-05 UniRef50_B8QTW7 Putative tail fiber protein n=1 Tax=Erwinia phag... 53 1e-05 UniRef50_C3X971 Predicted protein n=6 Tax=Oxalobacter formigenes... 53 1e-05 UniRef50_B3FYL6 Gp17 n=1 Tax=Salmonella phage phiSG-JL2 RepID=B3... 53 2e-05 UniRef50_B6XJ97 Putative uncharacterized protein n=2 Tax=Enterob... 53 2e-05 UniRef50_UPI000180B6D6 PREDICTED: similar to glutamate receptor,... 52 3e-05 UniRef50_C3X8R9 Bacteriophage tail fiber protein n=6 Tax=Oxaloba... 51 4e-05 UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus ... 51 5e-05 UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylo... 51 6e-05 UniRef50_A4P195 Putative phage tail fibre protein (Fragment) n=1... 50 8e-05 UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxaloba... 50 9e-05 UniRef50_C9PG79 Putative phage tail protein n=1 Tax=Vibrio furni... 50 1e-04 UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Micr... 50 1e-04 UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibr... 50 2e-04 UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium R... 49 2e-04 UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenel... 48 3e-04 UniRef50_Q6J803 Pas28 n=1 Tax=Actinoplanes phage phiAsp2 RepID=Q... 48 3e-04 UniRef50_A2EHN1 Phage tail fiber repeat family protein n=27 Tax=... 48 4e-04 UniRef50_D0LMW0 Tail Collar domain protein n=1 Tax=Haliangium oc... 48 4e-04 UniRef50_C4MYW8 Gp12 Short tail fibers n=1 Tax=Enterobacteria ph... 48 5e-04 UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio ... 47 7e-04 UniRef50_Q094A8 Phage Tail Collar Domain family n=1 Tax=Stigmate... 47 0.001 UniRef50_D2L4G0 Tail Collar domain protein n=1 Tax=Desulfovibrio... 47 0.001 UniRef50_B6IWH6 Putative uncharacterized protein n=2 Tax=Bacteri... 46 0.002 UniRef50_D2V5I7 Microcystin-dependent protein n=1 Tax=Naegleria ... 45 0.003 UniRef50_A0A7D3 Putative uncharacterized protein n=1 Tax=Microcy... 45 0.004 UniRef50_C3YB93 Putative uncharacterized protein n=1 Tax=Branchi... 45 0.004 UniRef50_C5RJD9 Tail Collar domain protein n=1 Tax=Clostridium c... 44 0.006 UniRef50_Q8PR98 Microcystin dependent protein n=1 Tax=Xanthomona... 44 0.007 UniRef50_P10930 Short tail fiber protein n=8 Tax=Myoviridae RepI... 44 0.007 UniRef50_Q4KAW4 Putative uncharacterized protein n=1 Tax=Pseudom... 44 0.007 UniRef50_Q58MY1 Predicted protein n=1 Tax=Prochlorococcus phage ... 44 0.008 UniRef50_D1ANH0 Putative uncharacterized protein n=1 Tax=Sebalde... 44 0.009 UniRef50_A9AVC5 Tail Collar domain protein n=1 Tax=Herpetosiphon... 43 0.011 UniRef50_B0MAM5 Putative uncharacterized protein n=2 Tax=Anaeros... 43 0.011 UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv.... 43 0.011 UniRef50_C5RN01 Tail Collar domain protein n=1 Tax=Clostridium c... 43 0.012 UniRef50_A6EAC0 Microcystin-dependent protein n=1 Tax=Pedobacter... 43 0.012 UniRef50_Q11LT1 Microcystin-dependent protein-like n=1 Tax=Chela... 43 0.016 UniRef50_Q7N6A5 Complete genome; segment 6/17 n=1 Tax=Photorhabd... 43 0.018 UniRef50_Q84CW8 Putative transmembrane protein n=1 Tax=unculture... 43 0.018 UniRef50_Q7N687 Complete genome; segment 6/17 n=1 Tax=Photorhabd... 42 0.022 UniRef50_B2W978 Putative uncharacterized protein n=2 Tax=Pleospo... 42 0.024 UniRef50_A8T9J8 Putative uncharacterized protein n=1 Tax=Vibrio ... 42 0.024 UniRef50_D1Y7E0 Collagen alpha 1 n=1 Tax=Pyramidobacter piscolen... 42 0.025 UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria ph... 42 0.026 UniRef50_A1TNG3 Phage Tail Collar domain protein n=9 Tax=Bacteri... 42 0.026 UniRef50_B8DLJ2 Tail fiber protein, putative n=3 Tax=Desulfovibr... 42 0.027 UniRef50_C7BIF9 Putative uncharacterized protein n=1 Tax=Photorh... 42 0.031 UniRef50_Q8GDJ7 Orf24 n=1 Tax=Photorhabdus luminescens RepID=Q8G... 42 0.031 UniRef50_C3X3W3 Predicted protein n=1 Tax=Oxalobacter formigenes... 42 0.034 UniRef50_C7BQB5 Putative uncharacterized protein n=1 Tax=Photorh... 41 0.040 UniRef50_Q2W7B1 Microcystin-dependent protein n=3 Tax=Proteobact... 41 0.042 UniRef50_B5ZGB2 Tail Collar domain protein n=4 Tax=Gluconacetoba... 41 0.042 UniRef50_Q2W7B2 Microcystin-dependent protein n=1 Tax=Magnetospi... 41 0.045 UniRef50_B6VNN2 Putative uncharacterized protein n=1 Tax=Photorh... 41 0.045 UniRef50_B3QRT1 Tail Collar domain protein n=1 Tax=Chloroherpeto... 41 0.057 UniRef50_A1TUY7 Phage Tail Collar domain protein n=4 Tax=Acidovo... 41 0.057 UniRef50_A2A761 Zinc finger protein 69 n=3 Tax=Mus musculus RepI... 41 0.078 UniRef50_B0MAY2 Putative uncharacterized protein n=1 Tax=Anaeros... 40 0.095 UniRef50_A5GA42 Phage Tail Collar domain protein n=2 Tax=Bacteri... 40 0.096 >UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriaceae RepID=Q3YZL1_SHISS Length = 1029 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 277/321 (86%), Positives = 292/321 (90%), Gaps = 3/321 (0%) Query: 3 ITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 62 + ALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF Sbjct: 709 VAALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 768 Query: 63 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP 122 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP Sbjct: 769 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP 828 Query: 123 KLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFD 182 KLA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSFD Sbjct: 829 KLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFD 888 Query: 183 YGTKSTNNTGAHTHSISGTANSAGAHQHKSSGA---FGGTNTSIFPNGYTAISNLSAGIM 239 YGTKSTNNTGAHTHS+SG+ +SAGAHQH +G G T +FP G T +S + + Sbjct: 889 YGTKSTNNTGAHTHSLSGSTSSAGAHQHSQTGPRTNSGSQPTGMFPAGSTQVSGTNQVGI 948 Query: 240 STTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN 299 S + SG ++ GK+SS+G HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN Sbjct: 949 SGSLTSGTSQWVGKSSSEGNHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN 1008 Query: 300 AAGNAENTVKNIAFNYIVRLA 320 AAGNAENTVKNIAFNYIVRLA Sbjct: 1009 AAGNAENTVKNIAFNYIVRLA 1029 >UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FQX9_SALDC Length = 569 Score = 328 bits (841), Expect = 1e-88, Method: Composition-based stats. Identities = 187/319 (58%), Positives = 211/319 (66%), Gaps = 50/319 (15%) Query: 3 ITALTDNTQGA-AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 + ALT T+G+ +GL + EVYNNGYPT YGNI+ L G G+GE+LIGWSGT+GA APA Sbjct: 300 LPALTGATRGSDSGLIMGEVYNNGYPTQYGNILRLTG---TGDGEILIGWSGTNGAPAPA 356 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 +IRS RDT DA WS WA LYTS +PP YPVGA I WPSD P+GYALMQGQ+FDKSAY Sbjct: 357 YIRSHRDTADAEWSEWAMLYTSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAY 416 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 P LA+AYPSG+IPDMRGWTIKGKP SGRAVLSQE DG KSH+HSA A TDLGT++TSSF Sbjct: 417 PLLAIAYPSGIIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHSARAQDTDLGTKSTSSF 476 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST 241 DYGTKSTN TG HTH G NS +TS P G Sbjct: 477 DYGTKSTNTTGNHTHQFGGYINS---------YWGDSNHTSFQPGG-------------- 513 Query: 242 TSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 301 GA T +AG HAHTV IG H H++ IG HGH + V+A Sbjct: 514 ----------------GAWTQ-------AAGDHAHTVYIGGHEHTMYIGPHGHVVIVDAD 550 Query: 302 GNAENTVKNIAFNYIVRLA 320 GNAE TVKNIAFNYIVRLA Sbjct: 551 GNAETTVKNIAFNYIVRLA 569 >UniRef50_P76072 Side tail fiber protein homolog from lambdoid prophage Rac n=23 Tax=root RepID=STFR_ECOLI Length = 1120 Score = 303 bits (775), Expect = 6e-81, Method: Composition-based stats. Identities = 200/259 (77%), Positives = 208/259 (80%), Gaps = 14/259 (5%) Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 F RS RD WA++YTS + P E YPVGAPIPWPSDTVPSGYALMQGQ FDKSAY Sbjct: 876 FYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAY 934 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 PKLA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSF Sbjct: 935 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST 241 DYGTKSTNNTGAHTHS+SG+ NSAGAH H N TA +N AG ST Sbjct: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTHS------------LANVNTASANSGAGSAST 1042 Query: 242 TSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 301 +N TSS GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA Sbjct: 1043 RLSVVHNQNYA-TSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 1101 Query: 302 GNAENTVKNIAFNYIVRLA 320 GNAENTVKNIAFNYIVRLA Sbjct: 1102 GNAENTVKNIAFNYIVRLA 1120 >UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5PP06_SALHA Length = 534 Score = 295 bits (754), Expect = 2e-78, Method: Composition-based stats. Identities = 173/305 (56%), Positives = 200/305 (65%), Gaps = 50/305 (16%) Query: 3 ITALTDNTQGA-AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 + ALT T+G+ +GL + EVYNNGYPT YGNI+ L G G+GE+LIGWSGT+GA APA Sbjct: 156 LPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTG---TGDGEILIGWSGTNGAPAPA 212 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 +IRS RDT DA WS WA LYT+ +PP + +PVGAPI WPSD P+GYALMQGQ+FDKSAY Sbjct: 213 YIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAPIAWPSDATPAGYALMQGQSFDKSAY 272 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 P LA+AYPSGVIPDMRGWTIKGKPASGRA+LSQE DG KSH+HSA A TDLGT+TTSSF Sbjct: 273 PLLAIAYPSGVIPDMRGWTIKGKPASGRAILSQEMDGNKSHSHSARAQDTDLGTKTTSSF 332 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST 241 DYGTKSTN TG HT+ G NS +TS P G Sbjct: 333 DYGTKSTNTTGNHTNQFGGYINS---------YWGDSNHTSFQPGG-------------- 369 Query: 242 TSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 301 GA T +AG HAHTV IG H H++ IG HGH + V+A Sbjct: 370 ----------------GAWTQ-------AAGDHAHTVYIGGHEHTMYIGPHGHVVIVDAD 406 Query: 302 GNAEN 306 GNAE Sbjct: 407 GNAET 411 >UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia coli RepID=B7L485_ECO55 Length = 1056 Score = 290 bits (741), Expect = 5e-77, Method: Composition-based stats. Identities = 194/263 (73%), Positives = 208/263 (79%), Gaps = 5/263 (1%) Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 F RS RD WA++YTS + P E YPVGAPIPWPSDTVPSGYALMQGQTF+KSAY Sbjct: 795 FYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQTFNKSAY 853 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 PKLA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSF Sbjct: 854 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 913 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFP----NGYTAISNLSAG 237 DYGTKSTNNTGAHTHS+SG+ SAG H H + + G S G+T + N Sbjct: 914 DYGTKSTNNTGAHTHSLSGSTGSAGVHTHGNGIRWPGGGGSALAFYDGGGFTYVQNSQYQ 973 Query: 238 IMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTIT 297 + TS +T S GAHTHSLSGTAAS+GAHAHTVGIGAHTHSVAIGSHGHTIT Sbjct: 974 VSPGTSSYRSYYQRIQTQSAGAHTHSLSGTAASSGAHAHTVGIGAHTHSVAIGSHGHTIT 1033 Query: 298 VNAAGNAENTVKNIAFNYIVRLA 320 VNAAGNAENTVKNIAFNYIVRLA Sbjct: 1034 VNAAGNAENTVKNIAFNYIVRLA 1056 >UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysenteriae 1012 RepID=B3X2T1_SHIDY Length = 488 Score = 283 bits (724), Expect = 5e-75, Method: Composition-based stats. Identities = 236/272 (86%), Positives = 240/272 (88%) Query: 49 IGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGY 108 I W TS A +Y+ P +FYP GAPIPWPSDTVPSGY Sbjct: 217 IQWDYTSNASVTIHTSPAYSANKPEGLTDGTVYSLYTPSEQFYPPGAPIPWPSDTVPSGY 276 Query: 109 ALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASA 168 ALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASA Sbjct: 277 ALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASA 336 Query: 169 SSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGY 228 SSTDLGT+TTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGY Sbjct: 337 SSTDLGTKTTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGY 396 Query: 229 TAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVA 288 TAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVA Sbjct: 397 TAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVA 456 Query: 289 IGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 IGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 457 IGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 488 >UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX Length = 710 Score = 272 bits (695), Expect = 1e-71, Method: Composition-based stats. Identities = 176/260 (67%), Positives = 193/260 (74%), Gaps = 23/260 (8%) Query: 61 AFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSA 120 ++ RS+ T D W+ W P + +PVGA IPWPSD+VP+GYA+MQGQTFDK+ Sbjct: 474 SYTRSQYSTGD--WTAWT--------PQDSFPVGAAIPWPSDSVPTGYAVMQGQTFDKTT 523 Query: 121 YPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSS 180 YP LA AYPSGV+PDMRGWTIKGKPASGR VLS EQDGIKSHTHSASAS+TDLGT+TTSS Sbjct: 524 YPLLAAAYPSGVLPDMRGWTIKGKPASGRDVLSLEQDGIKSHTHSASASNTDLGTKTTSS 583 Query: 181 FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMS 240 FDYGTKSTNNTGAHTH++SGTANSAGAH H S Sbjct: 584 FDYGTKSTNNTGAHTHNVSGTANSAGAHTHTVPLRR-------------PNSGGMNFDWL 630 Query: 241 TTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 + SG G S GAHTHS+SGTA SAGAHAHTVGIGAHTHSVAIGSHGHTITVNA Sbjct: 631 DGASSGTVVGNGTVPSSGAHTHSVSGTATSAGAHAHTVGIGAHTHSVAIGSHGHTITVNA 690 Query: 301 AGNAENTVKNIAFNYIVRLA 320 AGNAENTVKNIAFNYIVRLA Sbjct: 691 AGNAENTVKNIAFNYIVRLA 710 >UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae RepID=A4TT73_YERPP Length = 962 Score = 260 bits (663), Expect = 6e-68, Method: Composition-based stats. Identities = 162/284 (57%), Positives = 190/284 (66%), Gaps = 30/284 (10%) Query: 56 GAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQT 115 G + T AN+ LY+S PP E YPVGAPIPWP+D PSG+A+MQGQT Sbjct: 690 GTPEYVATKPASSTNGANYI----LYSSVLPPPESYPVGAPIPWPNDVAPSGFAIMQGQT 745 Query: 116 FDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDL-- 173 FDKS YPKLA AYPSGV+PDMRGW IKGKP S RAVLS EQDGIKSH H+A+ASSTDL Sbjct: 746 FDKSVYPKLAAAYPSGVLPDMRGWMIKGKPTS-RAVLSLEQDGIKSHAHNAAASSTDLGT 804 Query: 174 --------GTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFP 225 GT+T+S FDYGTKS+N+TGAH HS+SG+ +S+GAH H T + +P Sbjct: 805 KPTTTFDYGTKTSSGFDYGTKSSNSTGAHAHSLSGSTSSSGAHAHTV------TAHTQYP 858 Query: 226 NGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGA--- 282 + + + G T + TSS G H HS+SGTA SAGAHAHTVGIGA Sbjct: 859 RSTDSRNQNAVGKQYNTQQTTANAFNVWTSSAGDHAHSISGTAVSAGAHAHTVGIGAHAH 918 Query: 283 ------HTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 H+HSVAIG+H HTIT+ A GNAENTVKNIA+NYIVRLA Sbjct: 919 SLSIGSHSHSVAIGAHSHTITIAACGNAENTVKNIAYNYIVRLA 962 >UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_LAMBD Length = 774 Score = 252 bits (644), Expect = 8e-66, Method: Composition-based stats. Identities = 171/254 (67%), Positives = 184/254 (72%), Gaps = 27/254 (10%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 +P GAPIPWPSD VPSGY LMQGQ FDKSAYPKLAVAYPSGV+PDMRGWTIKGKPAS Sbjct: 527 NSAFPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIKGKPAS 586 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTS----------SFDYGTKSTNNTGAHTHS 197 GRAVLSQEQDGIKSHTHSASAS TDLGT+TTS SFDYGTKSTNNTGAH HS Sbjct: 587 GRAVLSQEQDGIKSHTHSASASGTDLGTKTTSSFDYGTKTTGSFDYGTKSTNNTGAHAHS 646 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTR--NAGKTS 255 +SG+ +AGAH H S + S + G +ST G+ KT Sbjct: 647 LSGSTGAAGAHAHTSGLRMNSSGWSQYGTATIT------GSLSTVKGTSTQGIAYLSKTD 700 Query: 256 SDGAHTHSLSGTAASAGAHAHTVGIGAHTHSV---------AIGSHGHTITVNAAGNAEN 306 S G+H+HSLSGTA SAGAHAHTVGIGAH H V +IGSHGHTITVNAAGNAEN Sbjct: 701 SQGSHSHSLSGTAVSAGAHAHTVGIGAHQHPVVIGAHAHSFSIGSHGHTITVNAAGNAEN 760 Query: 307 TVKNIAFNYIVRLA 320 TVKNIAFNYIVRLA Sbjct: 761 TVKNIAFNYIVRLA 774 >UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL254 RepID=B4T041_SALNS Length = 580 Score = 227 bits (578), Expect = 4e-58, Method: Composition-based stats. Identities = 123/236 (52%), Positives = 139/236 (58%), Gaps = 46/236 (19%) Query: 85 HPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGK 144 P P G P+PWPSDT+P+GYALMQGQ FDK+ YP LA+AYPSG IPDMRGWTIKGK Sbjct: 391 WRPMMSCPPGVPLPWPSDTIPAGYALMQGQAFDKNVYPLLAIAYPSGTIPDMRGWTIKGK 450 Query: 145 PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS 204 P SGRAVLSQE DG KSH+H A A TDLGT+ TSSFDYGTKS+N TG H HS GT Sbjct: 451 PVSGRAVLSQELDGNKSHSHGARALDTDLGTKGTSSFDYGTKSSNTTGGHNHSAGGT--- 507 Query: 205 AGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSL 264 +GG + G + + G Sbjct: 508 -----------YGGDSIG---------------------GKARVQRDGNDQ--------- 526 Query: 265 SGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 + G HAHT IG H H+V IG HGH + V+A GNAE TVKNIAFNYIVRLA Sbjct: 527 --LTSWNGDHAHTTWIGPHDHTVYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA 580 >UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escherichia coli E22 RepID=B3I2W7_ECOLX Length = 654 Score = 225 bits (573), Expect = 1e-57, Method: Composition-based stats. Identities = 134/257 (52%), Positives = 153/257 (59%), Gaps = 49/257 (19%) Query: 74 WSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVI 133 W+PW P + YPVGAPIPWPSD P+GYALMQGQ FDK+ YP LA+AYP+G+I Sbjct: 437 WTPWM--------PEDSYPVGAPIPWPSDVTPTGYALMQGQPFDKAVYPLLAIAYPAGII 488 Query: 134 PDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGT-------- 185 PDMRG TIKGKP +GRAVLS EQDG+ SHTH AS S TDLGT+ TSSFDYG+ Sbjct: 489 PDMRGQTIKGKP-NGRAVLSYEQDGVISHTHGASISDTDLGTKYTSSFDYGSKPTTSFDY 547 Query: 186 --KSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTS 243 KS+ G H H+ A SA + G G ++S N+S Sbjct: 548 GNKSSTEGGWHAHNFRYCATSA---YRDTPGQGLGMHSS----------NVSWAAGDRIE 594 Query: 244 GSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGN 303 GS G H H G H H VGIGAH H V +G HGHT TV+AAGN Sbjct: 595 GS------------GNHAH-----VTWIGPHDHWVGIGAHNHYVVMGYHGHTATVHAAGN 637 Query: 304 AENTVKNIAFNYIVRLA 320 AENTVKNIAFNYIVRLA Sbjct: 638 AENTVKNIAFNYIVRLA 654 >UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria phage T4 RepID=Q99362_BPT4 Length = 382 Score = 225 bits (572), Expect = 2e-57, Method: Composition-based stats. Identities = 166/255 (65%), Positives = 188/255 (73%), Gaps = 23/255 (9%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 YP+GAPIPWP+DT P+GYALM+GQTFD AYPKLA AYPSG IPDMRG TIKGKP S Sbjct: 129 VSSYPIGAPIPWPTDTPPNGYALMEGQTFDTRAYPKLAAAYPSGTIPDMRGQTIKGKP-S 187 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKST----------NNTGAHTHS 197 GRAVLS E DG+KSHTH ASAS+TDLGT+TTSSFDYGTK+T N TG H H+ Sbjct: 188 GRAVLSTEADGVKSHTHGASASNTDLGTKTTSSFDYGTKTTSSFDYGTKTSNTTGNHNHT 247 Query: 198 ISGTANSAGAHQHKSSGA--FGGTNTSIFPNGYTAI-SNLSAGIMSTTSGSGQTRNAGKT 254 +SGT +SAGAHQH SG G +T+IFP+GY+ + +N ++ T GS GKT Sbjct: 248 VSGTTSSAGAHQHARSGPQLSNGISTNIFPDGYSDVGTNYNSKFSGTVIGSSVPCIIGKT 307 Query: 255 SSDGAHTHSLSGT---------AASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAE 305 S+DGAHTH+ SGT GAH HTVGIGAHTH+VAIGSHGHTITVNA GN E Sbjct: 308 SNDGAHTHTWSGTTSTTGNHAHTVGIGAHTHTVGIGAHTHTVAIGSHGHTITVNATGNTE 367 Query: 306 NTVKNIAFNYIVRLA 320 NTVKNIAFNYIVRLA Sbjct: 368 NTVKNIAFNYIVRLA 382 >UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacteriaceae RepID=B7LN99_ESCF3 Length = 593 Score = 221 bits (562), Expect = 3e-56, Method: Composition-based stats. Identities = 155/259 (59%), Positives = 173/259 (66%), Gaps = 36/259 (13%) Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 + RS RD P +PVGAPI WPSD VP GYA+MQGQTFDK+AY Sbjct: 371 YYRSSRDGYGFE---------RGFEPVNAFPVGAPIAWPSDIVPEGYAIMQGQTFDKAAY 421 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 P LA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSF Sbjct: 422 PLLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 481 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST 241 DYGTK+ + T T N+ GAH H G +GG ++ Sbjct: 482 DYGTKTVSTFNHGTK----TTNNTGAHTHTVGGRYGG-------------DSIGGKQRVQ 524 Query: 242 TSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 301 SG+ Q +SSDGAH H++ G H HTVGIGAH H+VA+G+HGHTITVNAA Sbjct: 525 VSGTNQV-----SSSDGAHAHTV-----DIGQHNHTVGIGAHAHTVALGAHGHTITVNAA 574 Query: 302 GNAENTVKNIAFNYIVRLA 320 GNAENTVKNIAFNYIVRLA Sbjct: 575 GNAENTVKNIAFNYIVRLA 593 >UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escherichia RepID=B7LKX7_ESCF3 Length = 567 Score = 217 bits (552), Expect = 5e-55, Method: Composition-based stats. Identities = 124/248 (50%), Positives = 158/248 (63%), Gaps = 44/248 (17%) Query: 83 SAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIK 142 AE PVG PIPWPSD+VPSGYALM GQTF+K++YPKLA+AYPSGVIPDMRGW IK Sbjct: 354 DKAIAAESCPVGMPIPWPSDSVPSGYALMTGQTFNKTSYPKLAIAYPSGVIPDMRGWIIK 413 Query: 143 GKPASGRAVLSQEQDGIKSHTHSASAS----------STDLGTETTSSFDYGTKSTNNTG 192 GKP+SGRA+LS E DG+KSH H+ S S STDLGT+TT+SF++G+++T+ +G Sbjct: 414 GKPSSGRAILSTELDGVKSHNHTGSISSTNLGTITSTSTDLGTKTTASFNHGSRNTSTSG 473 Query: 193 AHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAG 252 HTH I + GA G S++ S + Sbjct: 474 EHTHRI------------PTDGAEGKDGPSLW-----------------NSPNSDENYRE 504 Query: 253 KTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIA 312 T S G+H HS+ + GAHAHT+ +G+HTH++ +G+H H+I +N GN ENTVKNIA Sbjct: 505 PTESAGSHYHSI-----TIGAHAHTIALGSHTHNIVLGTHNHSIIINNTGNTENTVKNIA 559 Query: 313 FNYIVRLA 320 FNYIVRLA Sbjct: 560 FNYIVRLA 567 >UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae RepID=Q3ZL14_ESCBL Length = 289 Score = 215 bits (546), Expect = 2e-54, Method: Composition-based stats. Identities = 110/231 (47%), Positives = 132/231 (57%), Gaps = 23/231 (9%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGR 149 P G + WP T P+G+ALM GQTFD +AYP+LA AYPSGVIPDMRG TIK PASGR Sbjct: 82 MLPPGIALAWPGATAPTGFALMLGQTFDTTAYPRLAQAYPSGVIPDMRGQTIKFLPASGR 141 Query: 150 AVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQ 209 +LS E DG+KSH+HS S S+TDLGT T + D GTK T+ G H H N A Sbjct: 142 TLLSLEADGVKSHSHSGSISTTDLGTATAADTDLGTKQTSQDGLHNHVSDSRFNKLMARS 201 Query: 210 HKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAA 269 G + G + + + + R +G S A + A Sbjct: 202 SDIDG------------------TNNTGDVDSDNPESEHRVSGMNDSLWA-----ASVIA 238 Query: 270 SAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 +G H HTV IG H HSV IG HGHT+T++ GN ENTVKNIAFN IVRLA Sbjct: 239 DSGLHMHTVYIGPHAHSVYIGPHGHTVTISNFGNTENTVKNIAFNAIVRLA 289 >UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEM1_9CAUD Length = 255 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 105/229 (45%), Positives = 127/229 (55%), Gaps = 45/229 (19%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAV 151 PVGAP+ WPSDT P G+ALM GQTFDK YP LA YPSGV+PDMRG IK KP GRAV Sbjct: 72 PVGAPLAWPSDTAPDGWALMIGQTFDKVKYPLLAKVYPSGVLPDMRGRVIKAKPD-GRAV 130 Query: 152 LSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHK 211 LS E+D +KSHTH+ A++ GT TS+FD+G K T G HTH S +H Sbjct: 131 LSLEEDQVKSHTHTGKAATAG-GTRATSTFDHGNKRTTTNGNHTHG------SPQGARHG 183 Query: 212 SSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASA 271 SG + TSG +T + + ++A Sbjct: 184 GSGQY-------------------------TSGDDETNSVFNWPA-----------TSAA 207 Query: 272 GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 G H H V IG H H+V I +H HT+ ++A G ENTVKNIA NYIVRLA Sbjct: 208 GDHFHDVQIGPHNHNVDI-NHEHTLQIDATGGTENTVKNIAMNYIVRLA 255 >UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepID=B6S308_SALDU Length = 427 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 88/131 (67%), Positives = 103/131 (78%), Gaps = 4/131 (3%) Query: 3 ITALTDNTQGA-AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 + ALT T+G+ +GL + EVYNNGYPT YGNI+ L G G+GE+LIGWSGT+GA APA Sbjct: 300 LPALTGATRGSDSGLIMGEVYNNGYPTQYGNILRLTG---TGDGEILIGWSGTNGAPAPA 356 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 +IRS RDT DA WS WA LYTS +PP YPVGA I WPSD P+GYALMQGQ+FDKSAY Sbjct: 357 YIRSHRDTADAEWSEWAMLYTSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAY 416 Query: 122 PKLAVAYPSGV 132 P LA+AYPSG+ Sbjct: 417 PLLAIAYPSGI 427 >UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia RepID=C4UEH4_YERAL Length = 387 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 54/212 (25%), Positives = 84/212 (39%), Gaps = 14/212 (6%) Query: 9 NTQGAAGLELYEVYNNGYPTAYGN--IIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSR 66 NT A G+ Y P +G+ I H++ + + T+ H A I + Sbjct: 160 NTLAATGMYSVNQYAANIPEGFGDATIQHIQNDSLTAHQFIF----STNNTHTAAKI-AY 214 Query: 67 RDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV 126 R + W W + TS P+G P+P+P T P+GY G F YP LA Sbjct: 215 RLRSYGQWREWIDIVTSRSDT--LTPIGIPLPYPGTTPPAGYLKCNGAAFYPYRYPTLAT 272 Query: 127 AYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 YP+ +PD+RG I+G + R +LS + D +++ T + S LG S+F Sbjct: 273 LYPTHKLPDLRGEFIRGFDDGRGIDTSRTLLSAQTDALQNITGGINGVSESLGIAAESNF 332 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSS 213 + G G+ +S Sbjct: 333 TGAFAKAESVGNDNTPHHTDITHCGSFDFDAS 364 >UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp. RC586 RepID=D0IJ09_9VIBR Length = 368 Score = 153 bits (386), Expect = 7e-36, Method: Composition-based stats. Identities = 77/233 (33%), Positives = 107/233 (45%), Gaps = 64/233 (27%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA 146 A+ PVG P+PWPSD P G+A+ +GQ FDK A P+LA YP G++ D+RG + GK Sbjct: 199 AAKICPVGVPLPWPSDIAPEGFAIHKGQAFDKVANPELAKLYPDGILKDLRGMAVVGK-K 257 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 G +LS E D +K H + S T SS D G+++TN TG H H ++ Sbjct: 258 EGEIILSYEADQVKQHGYPNS---------TVSSTDLGSRNTNTTGNHAHGYPAGTSN-- 306 Query: 207 AHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSG 266 +G + T + + Y T+++G H HS Sbjct: 307 ----GPNGPYLDTAHASYGYRY-------------------------TTTEGNHYHS--- 334 Query: 267 TAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 VAIGSH H+I + G ENT+KNI FN+IVR+ Sbjct: 335 --------------------VAIGSHAHSIAIALFGATENTIKNIKFNWIVRM 367 >UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU Length = 296 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 49/140 (35%), Positives = 64/140 (45%), Gaps = 11/140 (7%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG PIPWP+ P G+ G FDKS +P+LA AYPSG +PD+RG I+G Sbjct: 144 IPVGTPIPWPTAIPPVGWLQCNGAVFDKSKFPELAKAYPSGYLPDLRGEFIRGWDNGRGV 203 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 GR + + D I++ T S + D T YG N G T GT S Sbjct: 204 DPGRVCSTWQGDAIRNITGSFPGAIADNYHLATKEAFYGKI---NLGIAT---DGTTKSK 257 Query: 206 GAHQHKSSGAFGGTNTSIFP 225 H + FG + + P Sbjct: 258 NIHNPDNPYGFGFDASRVVP 277 >UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacteria phage phiEcoM-GJ1 RepID=A9Q1X5_9CAUD Length = 356 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 75/232 (32%), Positives = 121/232 (52%), Gaps = 14/232 (6%) Query: 54 TSGAHAPAFIRSRRDT-TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQ 112 T+ ++ ++RS+ T +DAN+ W + + YP+G + + + T P+ Sbjct: 79 TNFSNKRMWVRSQNGTVSDANFDEWTEFVNMNNIYNAIYPIGIVVKFDNATNPNNN--FT 136 Query: 113 GQTFDKSAYPKLAVAY--PSGVIPDMRGWTIKGKPASGRAV--LSQEQDGIKSHTHSASA 168 G +++ ++A A P D + +I G + AV L G+++HTH ++ Sbjct: 137 GTVWEQIIDGRVARAATGPEAGTADGQIGSIAGSDTANIAVTNLPGHTHGMQNHTHGIAS 196 Query: 169 SSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGY 228 S + T + D+G +++++GAHTHS+SGTA SAGAHQH F G + Sbjct: 197 HSHTMAHTHTINHDHGAVTSSSSGAHTHSVSGTAASAGAHQHTEGSPFTGD-VNFGTTTS 255 Query: 229 TAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 T+ N+S + S ++ TSS GAHTHS+SGTAASAGAH H+V + Sbjct: 256 TSKDNISDWLYSPST------RYPLTSSSGAHTHSVSGTAASAGAHTHSVDL 301 >UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GRX3_VIBCH Length = 182 Score = 136 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 71/235 (30%), Positives = 103/235 (43%), Gaps = 64/235 (27%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP 145 + +PVG IPW +D P G+ + +GQ FD + Y +LA +P+G+IPDMRG + GK Sbjct: 12 FAVKIFPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRGCGVIGKE 71 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 G AV + E+ +K+H H S T SS D G+K+T N G HTH A Sbjct: 72 D-GEAVGAYEEGQVKNHGHPNS---------TVSSIDLGSKNTANGGNHTHFSGIAAFGG 121 Query: 206 GAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLS 265 G+H++++ G N TS+ G H Sbjct: 122 GSHRYQTDVNGSGGNI-------------------------------NTSAAGNHY---- 146 Query: 266 GTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 HS+ +GSH H +T+ G +NT+ + N+IVRLA Sbjct: 147 -------------------HSIPMGSHAHAVTIALFGALKNTINHRKINWIVRLA 182 >UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CGA0_DICZE Length = 166 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 41/106 (38%), Positives = 62/106 (58%), Gaps = 5/106 (4%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P+PWP T P+G+ GQ FDK+A+PKLA YPSGV+PD+RG I+G S Sbjct: 23 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 82 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 R +LS + D I++ T S + + +D G++++ + G+ Sbjct: 83 NRNLLSSQGDAIRNITGFVSGVYVGFDGYSGAFYDTGSRNSISPGS 128 >UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E01-6750 RepID=UPI000190EC42 Length = 317 Score = 130 bits (327), Expect = 5e-29, Method: Composition-based stats. Identities = 44/128 (34%), Positives = 68/128 (53%), Gaps = 9/128 (7%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T+P G+ G F YPKLA AYP+ +PD+RG I+G Sbjct: 171 LPVGVPVPWPSATLPEGWLKCNGAAFSSEMYPKLAKAYPTNKLPDLRGEFIRGWDDGRGI 230 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYG-TKSTNNTGA-HTHSISGTAN 203 +GR +LS ++ I S + D+ + +++ + +G T S+N GA + A+ Sbjct: 231 DAGREILSFQEGTIVSGFDDND--TGDISSLSSTQYGFGDTLSSNQWGAINGKKWIFDAS 288 Query: 204 SAGAHQHK 211 S GA ++ Sbjct: 289 SKGAQKYD 296 >UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclassified Myoviridae RepID=Q6KGF6_9CAUD Length = 782 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 83/292 (28%), Positives = 121/292 (41%), Gaps = 72/292 (24%) Query: 33 IIHLKGMTAVGEGELLIGWSGTSGAHAPA----------FIRSRRDTTDANWSPWAQLYT 82 + ++ G + + + G S + + RS RD + Q+YT Sbjct: 497 LYNVTGYSGGSTQLVFQMYQGASSTPSAQLKFNYRNGGFWYRSSRDGFGFE-EDFTQIYT 555 Query: 83 SAHPP---------------------------AEFYPVGAPIPWPSDTVPSGYALMQGQT 115 + P + YPVG + S+ P+ Sbjct: 556 EKYKPTPSAIGAYTKAETDQKIAEAISDSTDLNKIYPVGIVTWFNSNVNPN--------- 606 Query: 116 FDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT 175 +A P L Y + + G TI+ A+G V + + + S T + Sbjct: 607 ---TALPGLTWTYLNNGV----GRTIRIAAANGSDVATTGGSDSVTLSVGNLPSHTHSFS 659 Query: 176 ETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLS 235 TTSSFDYGTK+TN TGAHTHS+SG+ N+ GAH H G +GG ++ Sbjct: 660 ATTSSFDYGTKTTNTTGAHTHSVSGSTNNTGAHTHTFGGRYGG-------------DSIG 706 Query: 236 AGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 287 SG+ Q +S G H+H++ GTAAS G HAHTVGIGAH+H+V Sbjct: 707 GKHRVHVSGTEQV-----SSVAGDHSHTVYGTAASNGNHAHTVGIGAHSHTV 753 >UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR91_CITRO Length = 279 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 43/104 (41%), Positives = 59/104 (56%), Gaps = 5/104 (4%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T P G+ G TF S YPKL +AYPSG +PD+RG I+G Sbjct: 143 LPVGVPVPWPSATPPEGWLKCNGATFSSSLYPKLGLAYPSGKLPDLRGEFIRGWDDGRGA 202 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 +GR++LS + D +SH+H+ S + T+ +D T N Sbjct: 203 DNGRSLLSSQGDAFRSHSHNFDRSWGLENFDATAGYDVVTADIN 246 >UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BL21_PHOAA Length = 452 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 64/166 (38%), Positives = 75/166 (45%), Gaps = 10/166 (6%) Query: 58 HAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFD 117 H IR + W W +Y+SA P E +PVGAPIP+P P GY GQTFD Sbjct: 271 HGEVVIRQSW-NSGKTWIGWDIVYSSAILPPEQHPVGAPIPYPHRYTPVGYLTCNGQTFD 329 Query: 118 KSAYPKLAVAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSAS---AS 169 KS YPKLA AYPSG +PD+RG I+G S GR S + K+H H Sbjct: 330 KSLYPKLAEAYPSGRVPDLRGEFIRGWDDSRGVDPGRVCGSWQDSDNKAHIHDDEFCYGG 389 Query: 170 STDLGTETTSSFDYGTKSTNNTGAHTHSISG-TANSAGAHQHKSSG 214 G T S T G + SG SAG H S G Sbjct: 390 GDAGGDSGTMSAFAKKYCTPKDGVNGRPTSGWLPASAGLHSLPSGG 435 >UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersinia pestis KIM D27 RepID=D1TPQ4_YERPE Length = 262 Score = 127 bits (319), Expect = 5e-28, Method: Composition-based stats. Identities = 42/115 (36%), Positives = 60/115 (52%), Gaps = 6/115 (5%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWP+ T P G+ G FDK YPKLA+AYPSG++PD+RG I+G Sbjct: 104 IPVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIRGWDDGLGV 163 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT-GAHTHSIS 199 +GR +LS + D I++ + + SS G T+ G++ S Sbjct: 164 DAGREILSIQGDAIRNISGGIQGRNEATSARLFSSNATGVFRTDGQFGSYAASAD 218 >UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae pv. oryzae RepID=B2SVF7_XANOP Length = 501 Score = 127 bits (318), Expect = 7e-28, Method: Composition-based stats. Identities = 62/279 (22%), Positives = 100/279 (35%), Gaps = 32/279 (11%) Query: 68 DTTDANWSPWAQLY-TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV 126 D D W + + + P F G + S P+G + G ++ Y L Sbjct: 224 DMLDGRQGDWYRDFGNMLNVPQSFLLPGQIVVMASLYPPNGLLVCDGAEISRAKYAALFA 283 Query: 127 AYPS----------GVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTE 176 A + +P ++ T+ ++ AV S + + SHTH ASA++ Sbjct: 284 AIGTVYGAGDGSTTFNVPKIKEGTVITHTSAATAVGSYDPGQVISHTHGASAAAVGDHAH 343 Query: 177 TTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSA 236 T+ G + + + A + H G+ + P + S Sbjct: 344 YTAINAAGNHAHGASAGAAGDHAHYAWTDAQGHHAHGGSTSASGDHQHPGVIPSASINGY 403 Query: 237 GIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTV---GIGAHTHSVAI---- 289 G+ + G T + G H HS AG+H H + G+G HTH + I Sbjct: 404 GVYRERDNDAAPSD-GWTGAGGNHAHSF--GTDGAGSHGHNISMNGVGNHTHGIGIAEGG 460 Query: 290 -----------GSHGHTITVNAAGNAENTVKNIAFNYIV 317 G+H HTITVNAAG +N + Y + Sbjct: 461 NHVHDVDHRGAGAHAHTITVNAAGGIDNLPAGLRMTYCI 499 >UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteriaceae RepID=C6V0Q3_ECO5T Length = 439 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 35/96 (36%), Positives = 50/96 (52%), Gaps = 5/96 (5%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T P+G+ G F YP+LA YP+ +PD+RG I+G Sbjct: 284 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKVYPTNKLPDLRGEFIRGWDDGRGV 343 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 +GR +L+ + I SH H ++ +T SF Sbjct: 344 DNGRGLLTLQDGAIVSHNHYWGIWTSRTNDQTLESF 379 >UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CP88_DICZE Length = 485 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 55/185 (29%), Positives = 80/185 (43%), Gaps = 31/185 (16%) Query: 26 YPTAYGNIIHL-KGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSA 84 YP + + + K EG + + GA FIRS W W +L + + Sbjct: 253 YPVQFAGSLDVEKNTADSAEGCIQRYTTYGGGALPRMFIRSYN-AGKQVWGAWQELASLS 311 Query: 85 HPPAEFYP------------------------VGAPIPWPSDTVPSGYALMQGQTFDKSA 120 P P G P+PWP VP+G+ GQ FDK+ Sbjct: 312 SPTFTGTPTAPTAEAGSNTTQLATTAWFAAEIAGIPLPWPQAAVPTGWLKCNGQAFDKNR 371 Query: 121 YPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGT 175 YP+LA YPSGV+PD+RG I+G SGR VLSQ++ + ++ SA ++D Sbjct: 372 YPRLAQVYPSGVLPDLRGEFIRGWDDGRGVDSGREVLSQQRGSLINYDGPDSAPTSDSLR 431 Query: 176 ETTSS 180 + S+ Sbjct: 432 LSVSA 436 >UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadantii RepID=C6C5D2_DICDC Length = 498 Score = 125 bits (312), Expect = 3e-27, Method: Composition-based stats. Identities = 47/164 (28%), Positives = 71/164 (43%), Gaps = 21/164 (12%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P+PWP T P+G+ GQ+FDK+ YPKLA YPSGV+PD+RG I+G + Sbjct: 335 VGIPLPWPQATAPTGWLKCNGQSFDKALYPKLATVYPSGVLPDLRGEFIRGWDDGRGVDA 394 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 GRA+L+ + + T DY +N G + A++A Sbjct: 395 GRAILTAQ----------------NPTYLRTGMMDYNGSDVDNIGVYIGMGYAEADTAAK 438 Query: 208 HQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNA 251 +GAF N + + ++ +T S + Sbjct: 439 SISAPAGAFRAPNNIDLTEQASRDNGVNGTASNTVYASEGSVWV 482 >UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DE08_PECCP Length = 682 Score = 123 bits (308), Expect = 8e-27, Method: Composition-based stats. Identities = 48/123 (39%), Positives = 64/123 (52%), Gaps = 14/123 (11%) Query: 57 AHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEF--------YPVGAPIPWPSDTVPSGY 108 A+ RS RD + PWA++YT P VG P+PWP T PSG+ Sbjct: 490 ANGGIKYRSSRDNSGFE-KPWARIYTDQDKPTAADIGALSLNEIVGMPMPWPQTTAPSGW 548 Query: 109 ALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHT 163 GQTFDK+ YPKLA YP+G++PD+RG I+G S GR +LS + D I++ Sbjct: 549 LKCNGQTFDKNIYPKLAQIYPAGILPDLRGEFIRGWDDSRGVDTGRTLLSTQGDAIRNIV 608 Query: 164 HSA 166 Sbjct: 609 GEI 611 >UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia RepID=B7MJL6_ECO45 Length = 247 Score = 123 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 45/123 (36%), Positives = 63/123 (51%), Gaps = 10/123 (8%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 105 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGV 164 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDL---GTETTSSFDYGTKSTNNT--GAHTHSISG 200 S RAVLS ++ + + + S L G + T S G+ S+N T + S+SG Sbjct: 165 DSRRAVLSTQEPTVGTFYVELAIISGTLSGSGAKFTDSVGIGSTSSNITVSNGNDQSVSG 224 Query: 201 TAN 203 T Sbjct: 225 TVA 227 >UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterobacteriaceae RepID=A4WEL3_ENT38 Length = 340 Score = 123 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 42/98 (42%), Positives = 57/98 (58%), Gaps = 6/98 (6%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-- 145 + PVG P+PWP T P G+ G FDK YPKLAVAYPSG++PD+RG I+G Sbjct: 187 DNYLPVGFPLPWPQATPPQGWLKCNGAPFDKVKYPKLAVAYPSGLLPDLRGEFIRGWDDG 246 Query: 146 ---ASGRAVLSQEQDGIKSHTHSASA-SSTDLGTETTS 179 SGR L+ + D ++ T +AS ++T +TS Sbjct: 247 RGVDSGRVALTTQGDAVQKMTGAASNGAATGFVNNSTS 284 >UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia RepID=A9R3H4_YERPG Length = 259 Score = 122 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 41/113 (36%), Positives = 59/113 (52%), Gaps = 6/113 (5%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P+PWP+ T P G+ G FDK YPKLA+AYPSG++PD+RG I+G + Sbjct: 106 VGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIRGWDDGLGVDA 165 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT-GAHTHSIS 199 GR +LS + D I++ + + SS G T+ G++ S Sbjct: 166 GREILSIQGDAIRNISGGIQGRNEATSARLFSSNATGVFRTDGQFGSYAASAD 218 >UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli RepID=B3I9S3_ECOLX Length = 546 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 34/87 (39%), Positives = 47/87 (54%), Gaps = 5/87 (5%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 386 LPVGVPVPWPSATPPTGWLKCNGAAFSVEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 445 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTD 172 +GRA+L+ + I H H + D Sbjct: 446 DTGRALLNWQPHTILDHAHYMELWTGD 472 >UniRef50_B3X4P8 Tail fiber n=3 Tax=Enterobacteriaceae RepID=B3X4P8_SHIDY Length = 305 Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats. Identities = 71/135 (52%), Positives = 85/135 (62%), Gaps = 11/135 (8%) Query: 3 ITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 62 +TAL+ QG AGL++YEVYNNGYPTAYGN++HLKG A GEGELLIGWSGTSGAHAP + Sbjct: 119 VTALSSTAQGNAGLQMYEVYNNGYPTAYGNVLHLKGAAASGEGELLIGWSGTSGAHAPVY 178 Query: 63 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP 122 IRSRRDTTDA WS WAQ++TS S T + G FD + Sbjct: 179 IRSRRDTTDAVWSEWAQVFTSKDSFNA----------ASATKLQTPRKINGTAFDGTRDI 228 Query: 123 KLAVAYPSGVIPDMR 137 ++ SG + D R Sbjct: 229 TISST-DSGAVRDFR 242 >UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae RepID=D2U1K0_9ENTR Length = 366 Score = 121 bits (302), Expect = 4e-26, Method: Composition-based stats. Identities = 39/79 (49%), Positives = 50/79 (63%), Gaps = 5/79 (6%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 YPVGAPIPWP T P GY + G+ FDK PKL +AYPSG +PD+RG+ I+G Sbjct: 218 YPVGAPIPWPQATPPKGYLICNGEPFDKVKCPKLLIAYPSGKLPDLRGYFIRGWDAGKGV 277 Query: 146 ASGRAVLSQEQDGIKSHTH 164 GR V S ++D I++ T Sbjct: 278 DPGREVFSYQEDAIRNITG 296 >UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseudotuberculosis IP 31758 RepID=A7FIU0_YERP3 Length = 402 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 46/207 (22%), Positives = 82/207 (39%), Gaps = 21/207 (10%) Query: 1 MNITALTDNTQG--AAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAH 58 +N+ +L + G L+++ + YP + ++ + E ++G Sbjct: 176 INLNSLGQDALGIYVQALDVFATLDRNYPITIAGSLVVRPSAYGAQQEYTPFYTGRK--- 232 Query: 59 APAFIRSRRD--TTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF 116 ++R+ + WS W Q+ PVG P+PWP+ PSG+ G TF Sbjct: 233 ---YVRNLMGVWNGNGPWSDWIQIGND------VAPVGIPMPWPAHIPPSGWLKCNGATF 283 Query: 117 DKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASST 171 +K+ +P+LA Y GV+PD+RG I+G GR +LS ++ + Sbjct: 284 NKAQFPQLASVYTRGVLPDLRGEFIRGWDDGKLADPGRGLLSFQEGTVVGGYDDNDTGDI 343 Query: 172 DLGTETTSSFDYGTKSTNNTGAHTHSI 198 +S F +T + Sbjct: 344 SSIGLYSSGFGDQLTNTQWVSINGKRW 370 >UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G6_9ENTR Length = 580 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 51/139 (36%), Positives = 69/139 (49%), Gaps = 14/139 (10%) Query: 31 GNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEF 90 G+ L TA G+ + + A I S++D T A S + Sbjct: 380 GDGQKLGFETAPGDAY-FVYRDAKNNNKAVVTIPSKKDGTLALTSDVEAINN-------- 430 Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 YPVGAPIPWP T P+GY + G FDK+ YP+LA+AYPSG +P + G I+G Sbjct: 431 YPVGAPIPWPQATPPNGYFVCDGNYFDKAKYPQLALAYPSGKLPLLYGEFIRGLDLGRKV 490 Query: 146 ASGRAVLSQEQDGIKSHTH 164 GR VLS + D I++ T Sbjct: 491 DPGRTVLSNQGDAIRNITG 509 >UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bacteriophage n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2R3_PHOLL Length = 233 Score = 118 bits (294), Expect = 3e-25, Method: Composition-based stats. Identities = 46/120 (38%), Positives = 65/120 (54%), Gaps = 7/120 (5%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS---- 147 PVG P+P+PS P+GY GQ FDKS YP+LA+AYPSG++PD+RG I+G S Sbjct: 93 PVGVPLPYPSRYTPAGYLTCNGQAFDKSRYPQLAIAYPSGILPDLRGEFIRGWDDSRGVD 152 Query: 148 -GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 GR +LS + GI+ H H S + + N+T +T S+ ++ G Sbjct: 153 MGRGMLSWQPAGIQDHMHYKVISKQVVEDLVLAGNQSWGTEKNST--YTRSLDQNISTGG 210 >UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclassified Myoviridae RepID=C5H7L2_9CAUD Length = 391 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 82/233 (35%), Positives = 106/233 (45%), Gaps = 53/233 (22%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 YPVG + PS Y L G T++ + +G + G Sbjct: 154 QASYPVGTIHLSVNSANPSTYLLCGG-TWELVS----------------KGRALVGYDTD 196 Query: 148 GRAVLSQEQDG--------IKSHTHSA-------------SASSTDLGTETTSSFDYGTK 186 R V S + +HTHS S SS D G+++TS+FDYGTK Sbjct: 197 SRPVGSTFGSQTVALTNNNLPAHTHSIYLTGGGHTHSASVSISSFDYGSKSTSTFDYGTK 256 Query: 187 STNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSG 246 +TN+ GAHTH+ SGT ++AG H H+ ++ T S Sbjct: 257 TTNSAGAHTHTFSGTTSNAGNHNHRVPMRG---------------NDRGGTNAITASADA 301 Query: 247 QTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN 299 NA T GAHTHS SGT AS+GAH+HTV IGAH+H+V IGSH HT TV Sbjct: 302 GVGNAMYTDLAGAHTHSFSGTTASSGAHSHTVAIGAHSHTVNIGSHSHTGTVT 354 >UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH14_EDWI9 Length = 593 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 55/187 (29%), Positives = 84/187 (44%), Gaps = 25/187 (13%) Query: 23 NNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAP--------------AFIR--SR 66 N AY N + + G+ L S TS +AP A+ R S Sbjct: 362 ANAVRYAYENAVRPATTSQAGQVLLEDSVSSTSTTNAPTSSALKRTYDRANSAYDRANSA 421 Query: 67 RDTTDANWSPWAQLYTSAHPPAEFY----PVGAPIPWPSDTVPSGYALMQGQTFDKSAYP 122 D + +S +Y A+ + PVG P PWP+ ++PSG+ GQ+F S+YP Sbjct: 422 YDRASSAYSYAGSIYDKAYDAYDIARRAPPVGTPQPWPNTSIPSGWIKCAGQSFSTSSYP 481 Query: 123 KLAVAYPSGVIPDMRGWTIKGKPASG-----RAVLSQEQDGIKSHTHSASASSTDLGTET 177 +LA AYP+G +PD+RG I+G G R +LS + D +++ T + + T Sbjct: 482 ELAKAYPNGRLPDLRGEFIRGYDDYGGTDSQRQILSWQGDAMRNITGTFGVDDQTIEQVT 541 Query: 178 TSSFDYG 184 +YG Sbjct: 542 GVFREYG 548 >UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enterobacteria phage T4 RepID=Q38190_BPT4 Length = 226 Score = 116 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 114/226 (50%), Positives = 135/226 (59%), Gaps = 46/226 (20%) Query: 140 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSIS 199 TIKGKP SGRAVLS E DG+K+H+HSASASSTDLGT+TTSSFDYGTK TN+TG HTHS S Sbjct: 1 TIKGKP-SGRAVLSAEADGVKAHSHSASASSTDLGTKTTSSFDYGTKGTNSTGGHTHSGS 59 Query: 200 GTANSAGAHQH------------------KSSGAFGGTNTSIFPNGYTAIS--NLSAGIM 239 G+ ++ G H H S GG+NT+ N S SAG Sbjct: 60 GSTSTNGEHSHYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDH 119 Query: 240 STTSGSGQTR-------------------------NAGKTSSDGAHTHSLSGTAASAGAH 274 S + G G+ T++ G H+H+ S +SAG H Sbjct: 120 SHSVGIGEHSHYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDH 179 Query: 275 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 +H+VGIGAHTH+VAIGSHGHTITVN+ GN ENTVKNIAFNYIV LA Sbjct: 180 SHSVGIGAHTHTVAIGSHGHTITVNSTGNTENTVKNIAFNYIVALA 225 >UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JYG6_9GAMM Length = 400 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 60/237 (25%), Positives = 90/237 (37%), Gaps = 60/237 (25%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTI 141 P G + T P G+ G ++ YP L A + +PD+R Sbjct: 212 PAGRTEDFAGTTPPGGWLFCDGSEVSRTQYPALFTAIGTLWGDGDGSTTFNLPDLRNDFR 271 Query: 142 KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGT 201 +G + R+V E D IKSH+HSAS + ++GAHTH G Sbjct: 272 RGCSDT-RSVGDSESDQIKSHSHSAS--------------------SEDSGAHTH--GGR 308 Query: 202 ANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHT 261 ++ +GAH+H+S +G +N S P G T+ R +G + D Sbjct: 309 SSDSGAHKHRS--GWGESNRSDAPFGATS--------------GSGHRGSGDSDWDNYLY 352 Query: 262 HSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 + +A H H + I GSH H I + G E +N I+R Sbjct: 353 Y-----TDTAQPHFHWLIINQ------AGSHSHPINIEPTGGDETRPRNKVLMPIIR 398 >UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Enterobacteriaceae RepID=STFE_ECOLI Length = 166 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 42/121 (34%), Positives = 61/121 (50%), Gaps = 8/121 (6%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 9 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 68 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 +GR++LS + + H H + ST + T+ T +F + N+ G N A Sbjct: 69 DTGRSILSIQGYATEDHAHGLPSRST-IVTDATINFYFDEIWVNSGTDIIKR--GNTNDA 125 Query: 206 G 206 G Sbjct: 126 G 126 >UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N348_PHOLL Length = 440 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 41/100 (41%), Positives = 51/100 (51%), Gaps = 6/100 (6%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS--- 147 P G P+P+P P GY GQTFDKS YPKLA AYP+G +PD+RG I+G S Sbjct: 296 VPAGVPMPYPHRYTPPGYLTCNGQTFDKSLYPKLAEAYPAGRVPDLRGEFIRGWDDSRGV 355 Query: 148 --GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGT 185 GR + + D I H H AS + + D G Sbjct: 356 DPGRVCGTWQADCIPDHNHYKVASKQLVEDLVLTG-DAGW 394 >UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber protein H n=2 Tax=Pectobacterium atrosepticum RepID=Q6D3Y6_ERWCT Length = 536 Score = 115 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 43/95 (45%), Positives = 55/95 (57%), Gaps = 7/95 (7%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 G P PWP T P+G+ GQ+FD SA+P LA AYPSGV+PD+RG I+G S Sbjct: 387 AGMPKPWPRATAPAGWLKCNGQSFDISAFPHLAAAYPSGVLPDLRGEFIRGWDDGRGVDS 446 Query: 148 GRAVLSQEQDGIKSHTHSA--SASSTDLGTETTSS 180 GR++LS + D I++ SA S ET SS Sbjct: 447 GRSLLSAQSDAIRNIVGEIWTSAVSQQFLGETLSS 481 >UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=Photorhabdus RepID=Q7N5C0_PHOLL Length = 239 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 45/125 (36%), Positives = 53/125 (42%), Gaps = 19/125 (15%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 PVG+PIPWP PSGY G F +S YPKLA AYP G IPD+RG I+G Sbjct: 99 SSIPVGSPIPWPLSHPPSGYFTCNGSAFSRSQYPKLAEAYPDGRIPDLRGEFIRGWDDGR 158 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFD--------------YGTKSTN 189 SGR +LS + D K + + D T N Sbjct: 159 GVDSGRVILSAQTDNTKRIQLTKGLPDGQFLSSYQGPVDRYQFPLGRDVLESATVTSIAN 218 Query: 190 NTGAH 194 NTG H Sbjct: 219 NTGGH 223 >UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BSQ6_PHOAA Length = 318 Score = 114 bits (285), Expect = 4e-24, Method: Composition-based stats. Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 5/95 (5%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 PVG PIPWP+ P+G+ G FDKS +P+L AY SGV+PD+RG Sbjct: 209 VNDLINTVNNIPVGVPIPWPTAIPPTGWLQCNGAAFDKSKFPQLVAAYSSGVLPDLRGEF 268 Query: 141 IKGKP-----ASGRAVLSQEQDGIKSHTHSASASS 170 I+G + R++LS + D +++ T + + Sbjct: 269 IRGWDSSRGVDTNRSILSTQIDTMQNITGKVDSHN 303 >UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli plasmid p15B n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2Q1_PHOLL Length = 478 Score = 114 bits (285), Expect = 4e-24, Method: Composition-based stats. Identities = 47/139 (33%), Positives = 65/139 (46%), Gaps = 5/139 (3%) Query: 48 LIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSG 107 IG GT+G + + + T A VG+PIPWP VP+G Sbjct: 292 FIGIEGTAGNRLTIYANDENSNRKYTLATPEKSGTLATLDDINISVGSPIPWPLPNVPAG 351 Query: 108 YALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSH 162 Y GQ+F+KS YP+LA+AYPSGV+PD+RG I+G GR VL+ + D I++ Sbjct: 352 YLACNGQSFNKSLYPQLAIAYPSGVLPDLRGEFIRGWDDGRGVDRGRGVLTHQGDAIRNI 411 Query: 163 THSASASSTDLGTETTSSF 181 T + F Sbjct: 412 TGYTPGTILRGNNSYGGCF 430 >UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D4_DICDC Length = 557 Score = 113 bits (282), Expect = 9e-24, Method: Composition-based stats. Identities = 48/126 (38%), Positives = 69/126 (54%), Gaps = 9/126 (7%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 TS AE G P+PWP T P+G+ GQ+FDK+ YPKL AYPSG +PD+RG Sbjct: 414 ITSGWFAAEI--AGIPLPWPQATAPTGWLKCNGQSFDKTLYPKLTAAYPSGTLPDLRGEF 471 Query: 141 IKGKPA-----SGRAVLSQE-QDGIKSHTHSASASSTDLGTETTSSFDYG-TKSTNNTGA 193 I+G SGRAVLS + I+ + S +A++T S+F+ + +N + Sbjct: 472 IRGWDDGRGVDSGRAVLSVQDATWIQPNIESNTAATTIRIDNVDSTFNTDEYSAVSNLPS 531 Query: 194 HTHSIS 199 + H+ S Sbjct: 532 YEHNGS 537 >UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID=B7US81_ECO27 Length = 521 Score = 113 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 58/121 (47%), Gaps = 10/121 (8%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PW S T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 386 LPVGVPVPWSSATPPTGWLKCNGAAFSSEMYPRLARAYPTNKLPDLRGEFIRGWDDGRGI 445 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 +GR +LS + SH D+G+ + + +Y +N G S +G + Sbjct: 446 DAGRTLLSGQDGTSFSHYGGN----FDIGSGHSIN-NYDQIVSNQPGFSRFSFAGPSRGD 500 Query: 206 G 206 G Sbjct: 501 G 501 >UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 RepID=Q9MCR6_BPHK7 Length = 321 Score = 113 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 39/115 (33%), Positives = 55/115 (47%), Gaps = 10/115 (8%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 167 LPVGVPVPWPSATPPTGWLKCNGAVFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGI 226 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLG-----TETTSSFDYGTKSTNNTGAHT 195 +GR +LS + D I++ T + T++ F K N G T Sbjct: 227 DAGREILSAQGDAIRNITGTFGDGETEVNASISFYRADGVFVTQKKLRNTIGNTT 281 >UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 Tax=Shigella sp. D9 RepID=UPI0001B5347E Length = 550 Score = 113 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 41/151 (27%), Positives = 67/151 (44%), Gaps = 14/151 (9%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS--- 147 PVG P+PWPS T P+G+ G F YPKLA YP+ +PD+RG I+G S Sbjct: 390 LPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPKLAKVYPTNKLPDLRGEFIRGWDDSRGI 449 Query: 148 --GRAVLSQEQD-----GIKSHTHSASASSTDLG---TETTSSFDYGTKSTNNTGAHTHS 197 GR++LS + ++ + ++ +G S G + G ++ Sbjct: 450 DTGRSLLSGQAATFIRTALQDYYGYDLNTNVKVGIAFATADSVITVGNPANPKAGNNSDY 509 Query: 198 ISGTANSA-GAHQHKSSGAFGGTNTSIFPNG 227 + +A+++ Q + F G S+ P Sbjct: 510 VPASADNSITGTQRTAEDNFTGAWISMRPRN 540 >UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=B2PZV1_PROST Length = 526 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 50/147 (34%), Positives = 69/147 (46%), Gaps = 16/147 (10%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVGAPIPWP T PSGY + GQ F+K+ YP L AYPSG +PD+RG I+G Sbjct: 376 CPVGAPIPWPQATAPSGYLICNGQAFNKTTYPLLTKAYPSGKLPDLRGEFIRGLDAGRNI 435 Query: 146 ASGRAVLSQEQDGIKSHTH-----SASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG 200 +GR VLS ++ + H H AS ++ G +T + G+ ST+ + Sbjct: 436 DNGRVVLSFQRCATEHHKHISGWGEASNANAIFG-KTVKNGYVGSASTDRD-----NYLF 489 Query: 201 TANSAGAHQHKSSGAFGGTNTSIFPNG 227 N Q + + G P Sbjct: 490 YTNDGSEFQGSNPNSTGIMANETRPRN 516 >UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID=C6CP84_DICZE Length = 646 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 38/95 (40%), Positives = 51/95 (53%), Gaps = 7/95 (7%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 TS AE G P+PWP T P+G+ GQ+FDK YP+LA YPSGV+PD+RG Sbjct: 492 ITSGWFAAEL--AGIPLPWPQATAPTGWLKCNGQSFDKKLYPRLAQVYPSGVLPDLRGEF 549 Query: 141 IKGKP-----ASGRAVLSQEQDGIKSHTHSASASS 170 I+G + R +LS + D I++ S Sbjct: 550 IRGWDDGRGVDNNRGLLSSQGDTIRNIVASFVMDD 584 >UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6DA10_PECCP Length = 689 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 42/155 (27%), Positives = 62/155 (40%), Gaps = 9/155 (5%) Query: 64 RSRRDTTDANWSPWAQLYTSAHPPAEFYPV----GAPIPWPSDTVPSGYALMQGQTFDKS 119 R W P G P+P+P P+G+ GQ+FDKS Sbjct: 513 RVYIAAAGGAWRSVYHEGNLTPAAIGAMPASELAGIPLPFPGAVAPTGWLKCNGQSFDKS 572 Query: 120 AYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLG 174 YP LA YPSGV+PD+RG ++G + RA+LS + D I++ + + + Sbjct: 573 QYPILASRYPSGVLPDLRGEFVRGWDDGRGADASRALLSAQGDAIRNIVGTIGQLNDRVN 632 Query: 175 TETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQ 209 T T+ K T T G + A + Sbjct: 633 TTETAGVFDANKYTGAHSGLTGGNGGRIATFDASK 667 Score = 44.9 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 77/333 (23%), Positives = 109/333 (32%), Gaps = 57/333 (17%) Query: 21 VYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQL 80 VY YP A +IH G A +G G G R+ RD S WA++ Sbjct: 381 VYRAQYPGAGQMLIHFHGAGASCPSLQFLGEYGNGGLS----YRTARDGMGFEHS-WAKI 435 Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 YT+ P VGA +P + G + G P A GV+PD Sbjct: 436 YTTQFKPTA-ADVGA-LPIAGGALQGGIRIGAGN----IDLP--ARRAVVGVMPD----- 482 Query: 141 IKGKPASGRAVLSQEQDGIKSHTHSASAS---STDLGTETT------SSFDYGTKSTNNT 191 S R +LS D + S++ +TD S + G + Sbjct: 483 -----ESYRQMLSLSPDNTVVFGNPNSSAVIHTTDRVYIAAAGGAWRSVYHEGNLTPAAI 537 Query: 192 GAHTHS--------ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST-- 241 GA S G G + + + + G Sbjct: 538 GAMPASELAGIPLPFPGAVAPTGWLKCNGQSFDKSQYPILASRYPSGVLPDLRGEFVRGW 597 Query: 242 TSGSGQTRNAGKTSSDGAHTHSLSGT----------AASAGAHAHTVGIGAHTHSVAIGS 291 G G + S+ G ++ GT +AG GAH+ + G+ Sbjct: 598 DDGRGADASRALLSAQGDAIRNIVGTIGQLNDRVNTTETAGVFDANKYTGAHS-GLTGGN 656 Query: 292 HGHTITVNAAG----NAENTVKNIAFNYIVRLA 320 G T +A+ AEN +NIAFNYIVR A Sbjct: 657 GGRIATFDASKVVPTAAENRPRNIAFNYIVRAA 689 >UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BT14_DICD5 Length = 534 Score = 111 bits (276), Expect = 4e-23, Method: Composition-based stats. Identities = 54/201 (26%), Positives = 89/201 (44%), Gaps = 27/201 (13%) Query: 33 IIHLKGMTAVGEG-ELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFY 91 ++ G + +G ++ +GW+G+ +R + D T ++ W + + P Sbjct: 339 VVRAGGGNGMADGHQISLGWTGSG-------LRVQVDAT--SFDLWHKD--NVFPIHAAE 387 Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS---- 147 VG P+P+P T P G+ GQ+F+K+A+P LA YPSG +PD+RG I+G S Sbjct: 388 IVGIPLPYPGATAPDGWLKCNGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVD 447 Query: 148 -GRAVLSQEQDGIKSHTHSASAS--------STDLGTETTSSFDYGTKSTNNT--GAHTH 196 GR +LS ++ +H+H + + FDY + N T H Sbjct: 448 PGRGLLSFQESQNLTHSHGVNDPGHSHPYNKYEGSVGSGLAGFDYDQDAWNATVYTGHVG 507 Query: 197 SISGTANSAGAHQHKSSGAFG 217 + A S G + AF Sbjct: 508 TGISIAASGGHEARPRNIAFN 528 Score = 48.4 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 29/127 (22%), Positives = 48/127 (37%) Query: 194 HTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGK 253 + S + A A ++ S I + + G++S T + G Sbjct: 408 NGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVDPGRGLLSFQESQNLTHSHGV 467 Query: 254 TSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAF 313 +H ++ + +G A +V G G I++ A+G E +NIAF Sbjct: 468 NDPGHSHPYNKYEGSVGSGLAGFDYDQDAWNATVYTGHVGTGISIAASGGHEARPRNIAF 527 Query: 314 NYIVRLA 320 NYIVR A Sbjct: 528 NYIVRAA 534 >UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae RepID=B3I8J5_ECOLX Length = 263 Score = 111 bits (276), Expect = 5e-23, Method: Composition-based stats. Identities = 42/151 (27%), Positives = 70/151 (46%), Gaps = 14/151 (9%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS--- 147 PVGAP+PWPS+T P+G+ G F YP+LA AYP+ +PD+RG I+G S Sbjct: 103 LPVGAPVPWPSETPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDSRGI 162 Query: 148 --GRAVLSQEQD-----GIKSHTHSASASSTDLG---TETTSSFDYGTKSTNNTGAHTHS 197 GR++LS + ++ + ++ +G S G + G ++ Sbjct: 163 DTGRSLLSGQAATFIRTALQDYYGYDLNTNVKVGIAFATADSVITVGNPANPKAGNNSDY 222 Query: 198 ISGTANSA-GAHQHKSSGAFGGTNTSIFPNG 227 + +A+++ Q + F G S+ P Sbjct: 223 VPASADNSITGTQRTAEDNFTGAWISMRPRN 253 >UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BYH6_DICD5 Length = 198 Score = 110 bits (275), Expect = 6e-23, Method: Composition-based stats. Identities = 32/83 (38%), Positives = 44/83 (53%), Gaps = 5/83 (6%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P WP P G+ GQ FDK+ YP+LA YP+G +PD+RG I+G + Sbjct: 59 VGIPQAWPLADAPEGWLKCNGQAFDKTKYPQLAKLYPAGTLPDLRGEFIRGWDDGRGVDT 118 Query: 148 GRAVLSQEQDGIKSHTHSASASS 170 R +LS + ++SH H S Sbjct: 119 NRQILSAQSGMLESHNHMMPVSD 141 >UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid prophage e14 n=3 Tax=Photorhabdus RepID=C7BSQ1_PHOAA Length = 166 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 49/142 (34%), Positives = 63/142 (44%), Gaps = 9/142 (6%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS- 147 E PVG P+PWP+D P G+ G FDK YPKLAVAYPSG +PD+RG I+G Sbjct: 7 EEIPVGIPLPWPTDIPPYGWVKCNGAIFDKYLYPKLAVAYPSGNLPDLRGEFIRGWDDGR 66 Query: 148 ----GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG-TA 202 GR VLS + I H+H ++ GT S + G Sbjct: 67 GVDIGRYVLSTQLADIAPHSHRIGRMWSNSNAGAEG---LGTPSRILNSVYQGVNYGIDT 123 Query: 203 NSAGAHQHKSSGAFGGTNTSIF 224 G SG FG + ++ Sbjct: 124 RGLGIAIGMGSGGFGYMDNAVA 145 >UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CG98_DICZE Length = 196 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 34/134 (25%), Positives = 55/134 (41%), Gaps = 14/134 (10%) Query: 63 IRSRRDTTDANWSPWAQLYTSAHPPAEFYP---------VGAPIPWPSDTVPSGYALMQG 113 ++ + D N + + + YP +G P PWP P G+ G Sbjct: 20 YKNSQTHNDGNLHGCCRCHGEQQYAPDIYPASTDGLKELIGIPQPWPLAEAPEGWLKCNG 79 Query: 114 QTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASA 168 QTFD + YP+LA YP+G +PD+RG I+G + R +LS + Sbjct: 80 QTFDTAKYPQLAKLYPAGTLPDLRGEFIRGWDDERGVDTDRKLLSAQAGTHILGDDGGYP 139 Query: 169 SSTDLGTETTSSFD 182 + +G + + D Sbjct: 140 TLNSIGNLSECNAD 153 >UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6CGA4_DICZE Length = 401 Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats. Identities = 42/148 (28%), Positives = 66/148 (44%), Gaps = 7/148 (4%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS----- 147 VG P+PWP T P+G+ GQ FDK+A+PKLA AYP GV+PD+RG I+G Sbjct: 248 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQAYPGGVLPDLRGEFIRGWDDGRGVDV 307 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 R +LS ++ + + S+ ++G ++ D + + A Sbjct: 308 ARELLSWQKGTLT--ISDPNLSAVNVGALIHANNDSANTYKSMGFDIVNKSDYAMLRAAI 365 Query: 208 HQHKSSGAFGGTNTSIFPNGYTAISNLS 235 + +N F G T N++ Sbjct: 366 NVETVGAQDLDSNGWQFGYGATRPRNIA 393 >UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C6Z0_DICDC Length = 183 Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats. Identities = 36/146 (24%), Positives = 57/146 (39%), Gaps = 13/146 (8%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 +G P PWP P G+ GQ FD + YP+LA YPSG +PD+RG I+G + Sbjct: 46 IGIPQPWPLADAPEGWLKCNGQAFDTAKYPELAKCYPSGTLPDLRGEFIRGWDDGRGVDT 105 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 R ++S + + + S +G T D + ++ SI + Sbjct: 106 SRELVSAQSGTYITGDSDSQPSVQGIGNITECHVD-------SPDSNARSIYWIPATKTD 158 Query: 208 HQHKSSGAFGGTNTSIFPNGYTAISN 233 + +G T Y + Sbjct: 159 -RLTGPTYWGVTRPRNISFNYIVKAG 183 >UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 Tax=Erwinia phage phiAT1 RepID=C5J9F2_9VIRU Length = 240 Score = 108 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 32/90 (35%), Positives = 43/90 (47%), Gaps = 8/90 (8%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGK--- 144 P+GA IPWP TVP G+ GQ F+ PKL V+PD RG ++G Sbjct: 150 PRLVPIGAVIPWPGATVPDGWLECSGQVFNTGQNPKLYSVLGRNVVPDYRGLFLRGWAHG 209 Query: 145 -----PASGRAVLSQEQDGIKSHTHSASAS 169 P +GRA+ S + D I++ T A Sbjct: 210 SDANDPDAGRALGSVQGDAIRNITGYFPAD 239 >UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLI8_PECWW Length = 621 Score = 107 bits (267), Expect = 5e-22, Method: Composition-based stats. Identities = 38/97 (39%), Positives = 53/97 (54%), Gaps = 7/97 (7%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 A P AE VG P +P P+G+ GQ FD + YP LA YPSG +PD+RG ++G Sbjct: 463 AMPSAEL--VGMPQVFPGAVAPAGWLKCNGQQFDTAQYPILASRYPSGFLPDLRGEFVRG 520 Query: 144 KP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGT 175 +GRA+LS++ D I++ T + AS G Sbjct: 521 WDDERGVDAGRALLSEQGDAIRNITGTMRASDVPYGH 557 >UniRef50_B7NJP1 Putative side tail fiber protein homolog from lambdoid prophage n=3 Tax=Escherichia coli RepID=B7NJP1_ECO7I Length = 686 Score = 106 bits (264), Expect = 9e-22, Method: Composition-based stats. Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 7/101 (6%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG P+PWP+ T P G+ G+ F K YP LA AYP+ +PD+RG I+G Sbjct: 536 LPVGVPVPWPTATPPEGWLKCDGRAFTKEQYPVLARAYPTLRLPDLRGEFIRGWDDGRKI 595 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTS-SFDYGT 185 GR +LS ++ + H + S+ D+ + + DYG Sbjct: 596 DEGRKLLSWQKGTLV-GGHDDNDSALDISYMSNGNNIDYGG 635 >UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacterium radiotolerans JCM 2831 RepID=B1M1N8_METRJ Length = 414 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 71/276 (25%), Positives = 115/276 (41%), Gaps = 52/276 (18%) Query: 74 WSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGV- 132 + P AQ+Y + P E GA + VPSG+ + G+ ++AY L +G Sbjct: 128 YDPVAQVYRTLSPTTE--QAGAIKAFAGPNVPSGWEICDGRAVSRTAYAALFATISTGWG 185 Query: 133 ---------IPDMRGWTIKG-KPASGRAVLSQEQDG-----------------IKSHTHS 165 +PD RG T+ G +GR + DG + SH H+ Sbjct: 186 NGDGFTTFNLPDARGRTLFGANRGTGRLTAAGGLDGSLGNMGGADQVVMLAPQMPSHIHT 245 Query: 166 ASASSTDL---GTETTSSFDYGTKSTNNTGAHTHSI----------SGTANSAGAHQHKS 212 ++ S + + D+G T G H HS GT +++G H H Sbjct: 246 STMSPAGFFEPEIQKAGAHDHG--GTKVGGDHAHSGTTGLSGTHTHGGTTDTSGDHAHVV 303 Query: 213 SGAFGGTNTSIFPNGYTAISNLSAGIMST--TSGSGQTRNAGKTSSDGAHTHSLSGTAAS 270 +G +T PN ++ ++ G T+ SG ++ T G HTH+ S Sbjct: 304 QYGYGLVSTQT-PNNAQVVTGINLGSQGNGQTTQSGPHQHTFTTGQGGNHTHAFS--TDP 360 Query: 271 AGAHAHTVGI-GAHTHSV-AIGSHGHTITVNAAGNA 304 G+HAH + + G HTH++ +H HT+ ++AAG+ Sbjct: 361 GGSHAHEIPVDGDHTHTIDPTPNHVHTLVIDAAGSG 396 >UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersinia bercovieri ATCC 43970 RepID=C4S5W0_YERBE Length = 388 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 43/102 (42%), Positives = 57/102 (55%), Gaps = 7/102 (6%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-- 145 AE +G PIP+P +VP GY G F YPKLA+ YPSGV+PDMRG I+G Sbjct: 238 AERELIGIPIPYPLPSVPVGYLKCNGAAFSTVTYPKLALKYPSGVLPDMRGNAIRGWDDG 297 Query: 146 ---ASGRAVLSQEQDGIKSHTHS--ASASSTDLGTETTSSFD 182 +GRA+LSQ+ D +++ T + S G TT +F Sbjct: 298 RGVDAGRALLSQQLDALQNITGNFYMGGSKQVAGVVTTGAFG 339 >UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae RepID=D0FSD9_ERWPY Length = 311 Score = 105 bits (261), Expect = 3e-21, Method: Composition-based stats. Identities = 68/251 (27%), Positives = 101/251 (40%), Gaps = 59/251 (23%) Query: 72 ANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFD---KSAYPKLAVAY 128 S W + +P YP+G + P+ L G T+ ++ +LA A Sbjct: 110 GKGSGWVEFKADVNPVDMLYPIGIVTWFAQKKDPN--KLFPGTTWKYIGENRTIRLASAN 167 Query: 129 PSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKST 188 S V+ T G + AV + G HT SA+ S D GT+ TS+FDYG K T Sbjct: 168 GSDVM------TTGGSDSVTLAVGNIPAHG---HTFSANTGSFDYGTKGTSTFDYGNKVT 218 Query: 189 NNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQT 248 + G+HTHS + + P G + + GI TT Sbjct: 219 DTQGSHTHSYN----------------------EVIPRGASGMD--IGGIWETTIRGSD- 253 Query: 249 RNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTV 308 TS+ GAH H+++ IGAH H+V IG+H H+++ A T Sbjct: 254 -----TSTAGAHAHNVA--------------IGAHGHTVEIGAHSHSVSGTTANTGAGTA 294 Query: 309 KNIAFNYIVRL 319 N+ N ++L Sbjct: 295 INVT-NAFIKL 304 >UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacteria phage WV8 RepID=C5H7L3_9CAUD Length = 848 Score = 104 bits (260), Expect = 3e-21, Method: Composition-based stats. Identities = 62/218 (28%), Positives = 91/218 (41%), Gaps = 42/218 (19%) Query: 78 AQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMR 137 A+ + + + YPVG + S+ P+ +A P L Y + + Sbjct: 636 AEAISDSTDLNKIYPVGIVTWFNSNVNPN------------TALPGLTWTYLNNGV---- 679 Query: 138 GWTIKGKPASGRAVLSQEQ--------DGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 G TI+ A+G V + + SHTHS SA TTSSFDYGTK+++ Sbjct: 680 GRTIRIAAANGSDVATTGGSDSVTLSVGNLPSHTHSFSA--------TTSSFDYGTKTSS 731 Query: 190 NTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTR 249 TG H H+ GT G+ + S A YTA G + + G Sbjct: 732 TTGNHNHN-RGTMEITGSFGYFRSDASSF---------YTASGAFYLGSQAGSKGYTGNN 781 Query: 250 NAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 287 + + + SG + G H+HTVGIGAH+H+V Sbjct: 782 FTNGIPVNFNASRNWSGVTNTTGNHSHTVGIGAHSHTV 819 >UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1_YERKR Length = 402 Score = 100 bits (249), Expect = 6e-20, Method: Composition-based stats. Identities = 45/111 (40%), Positives = 58/111 (52%), Gaps = 10/111 (9%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAV- 151 +G PIPWP P+GY G F+K+ YPKLA+AYPSGV+PD+RG I+G GR V Sbjct: 277 IGTPIPWPLTIAPAGYLKCNGAPFNKTQYPKLALAYPSGVLPDLRGEFIRGFDD-GRGVR 335 Query: 152 -----LSQEQDGIKSHTHSASA--SSTDLGTETTSSF-DYGTKSTNNTGAH 194 L + I+SH H + G T + F STNN+G Sbjct: 336 PNQPLLGWQGSEIQSHNHGITNFEIRGVTGGPTNAWFPSTNGISTNNSGGD 386 >UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65WH4_MANSM Length = 296 Score = 99 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 10/122 (8%) Query: 82 TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTI 141 A + +G P P+P VP G GQTF + YP+LA YPSG +PD+RG I Sbjct: 129 NEAFSTLKNLLIGIPFPYPLSAVPDGCLAFNGQTFSTTTYPELAKKYPSGRLPDLRGEFI 188 Query: 142 KGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 +G S R +L + + +HTH + + SS ++G K + + + Sbjct: 189 RGWDNGRGVDSSRELLRSQGAELSAHTHYVTVT-----RYANSSGEFGAKISTFSAINNS 243 Query: 197 SI 198 Sbjct: 244 GW 245 >UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2U8_ERWCT Length = 619 Score = 99.6 bits (246), Expect = 1e-19, Method: Composition-based stats. Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 5/79 (6%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 G P+P+P P+GY GQ FD + +P LA YPSG +PD+RG ++G + Sbjct: 464 AGIPLPFPGAVAPAGYLKCNGQQFDTAQFPVLASRYPSGFLPDLRGEFVRGWDDGRGIDT 523 Query: 148 GRAVLSQEQDGIKSHTHSA 166 RA++S + D I++ S Sbjct: 524 VRALMSAQGDAIRNIVGSL 542 >UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkholderia ambifaria AMMD RepID=Q0BEK5_BURCM Length = 735 Score = 98.8 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 75/276 (27%), Positives = 107/276 (38%), Gaps = 81/276 (29%) Query: 79 QLYTSAHPPAEFYPVGA-PIPWPSDTVP-SGYALMQGQTFDKSAYPKL-AVAYPSG---- 131 +L TSA A V I W + T P +G+ + G ++ YP L A A SG Sbjct: 503 KLITSAWFAAAVADVQIGQIVWEARTAPRAGFLKLNGTELKRADYPLLWAYAQGSGALVA 562 Query: 132 ---------------------VIPDMRGWTIKGKPASG-----RAVLSQEQDGIKSHTHS 165 +PD+RG I+ + R + S + + H H Sbjct: 563 DADWGKGRHGCFSSGDGNTTFRLPDLRGEFIRCWDDARGTDAQRQIGSWQDSLNRLHAHG 622 Query: 166 ASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFP 225 ASA++ +G + ++ T++ G H HSI + H H A GG Sbjct: 623 ASAAA--VGDHSHGAW------TDSQGWHGHSI-----NDPGHDHGIPVASGG------- 662 Query: 226 NGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASA---GAHAHTVGIGA 282 GY NL+ G G G R G SGT S GAH H VGIG Sbjct: 663 -GYIGEINLNGG------GRGDKRTTG------------SGTGISINGDGAHGHNVGIGG 703 Query: 283 HTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 G+H HTI++ A G E+ +N+A ++R Sbjct: 704 ------AGAHSHTISIGADGGNESRPRNVALLVMIR 733 >UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE1_PECWW Length = 532 Score = 98.8 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 36/129 (27%), Positives = 57/129 (44%), Gaps = 9/129 (6%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 A + G W + P G+ + GQ F+ S P LA YPS +PD RG+ +G Sbjct: 383 AWKSSSSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRG 442 Query: 144 KPA------SGRAVLSQEQDGIKSHTHS-ASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 RA+LS + D I++ T S++ G SS YG +N+G+ Sbjct: 443 WDNGAGIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSS--YGWPYPSNSGSAND 500 Query: 197 SISGTANSA 205 + T +++ Sbjct: 501 ASIITFDAS 509 >UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5ABB4_BURGB Length = 670 Score = 97.7 bits (241), Expect = 5e-19, Method: Composition-based stats. Identities = 60/274 (21%), Positives = 90/274 (32%), Gaps = 57/274 (20%) Query: 79 QLYTSAHPPAEF--YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKL-AVAYPSG---- 131 + T+ A +G + + +GY G + ++ YP L A A SG Sbjct: 418 RFATTEWVTAAIGTASIGQIVMEARTSPRAGYVKCDGSQYKRADYPALWAYAQASGALVS 477 Query: 132 ---------------------VIPDMRGWTIK------GKPASGRAVLSQEQDGIKSHTH 164 +PD+RG ++ G GRA+ S + ++H H Sbjct: 478 EAEYTDGRWGGFSTADGQTYFRVPDLRGEFLRCWSDGRGDVDPGRAIGSFQGGQNQAHAH 537 Query: 165 SASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIF 224 AS+ + G HS G G H H + G + + Sbjct: 538 GASSDPDGAHVHDAWTGGAGW----------HSHHGVTGGGGMHNHAN-----GVFSRLL 582 Query: 225 PNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHT 284 Y S S + ++ + G H H AG H H VGIG Sbjct: 583 RPPYLGSLTGSDTDGSGNEQAVGGGDSADIAWAGEHQHEF--WTDGAGDHVHAVGIGN-- 638 Query: 285 HSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 G H H I V A G AE +N+A ++R Sbjct: 639 ----AGGHAHAIHVQADGGAEARPRNVALLAMIR 668 >UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root RepID=Q2T5M0_BURTA Length = 790 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 72/292 (24%), Positives = 102/292 (34%), Gaps = 68/292 (23%) Query: 60 PAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKS 119 PA RS R T + W + SA +G + P TV G+ G +++ Sbjct: 532 PAADRSTRAAT----TEWVRTVLSATT------IGQIVFEPRTTVRPGFLKANGVLVNRA 581 Query: 120 AYPKLAVAY---------------------------PSGVIPDMRGWTIK------GKPA 146 YP+L AY + +P++RG I+ G Sbjct: 582 DYPEL-WAYAQASGALVSDADWMKDRWGCFSTGDGATTFRLPELRGEFIRCWSDARGGVD 640 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 + R + + + D +H H A+AS T T+ G H H G N+ G Sbjct: 641 ATRQIGAFQGDQNHTHAHGAAASEAPDHVHTA--------WTDVQGWHGH--HGWTNAVG 690 Query: 207 AHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSG 266 HQH S +G P T + + G GS G TS G H H + Sbjct: 691 DHQHVSP--WGEHPQMYNPPWGTWGAANNRGA----EGSDNDNVYGMTSPAGNHNHEFN- 743 Query: 267 TAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 G H H VG G H HTI V G E +N+A ++R Sbjct: 744 -TEGNGNHGHAVG------IGGGGRHAHTIAVQPDGGDEARPRNVALLALIR 788 >UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabdus RepID=Q7NAA0_PHOLL Length = 351 Score = 95.7 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 35/68 (51%), Positives = 43/68 (63%), Gaps = 5/68 (7%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P+PW T P+GY + GQ FDKS YPKL AYPSG +PD+RG I+G S Sbjct: 201 VGIPLPWSKPTAPAGYLICSGQQFDKSMYPKLGEAYPSGALPDLRGEFIRGWDNGRSIDS 260 Query: 148 GRAVLSQE 155 GR +LS + Sbjct: 261 GREILSHQ 268 >UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=Pectobacterium carotovorum subsp. carotovorum WPP14 RepID=UPI0001A44C27 Length = 195 Score = 95.4 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 36/117 (30%), Positives = 49/117 (41%), Gaps = 12/117 (10%) Query: 71 DANWSPWAQLYTSAHPPAEFY-------PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPK 123 + N W + + + + VG P P P T P G+ GQ+FD S YP Sbjct: 59 EGNGGRWRREFNTENLTPSSIGAIQGNELVGIPQPCPLVTAPEGWLACAGQSFDTSRYPV 118 Query: 124 LAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGT 175 LA YP G +PD+RG I+G +GR LS + + HTH G Sbjct: 119 LASRYPQGRLPDLRGEFIRGWDNGRGVDTGRGNLSSQSFSTEPHTHDGGTLGLGSGA 175 >UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas aeruginosa PA7 RepID=A6VBH2_PSEA7 Length = 654 Score = 93.8 bits (231), Expect = 8e-18, Method: Composition-based stats. Identities = 55/266 (20%), Positives = 96/266 (36%), Gaps = 70/266 (26%) Query: 69 TTDANWSPWAQLYTSAHP-PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVA 127 + + W+P+ +++ S + P P GA + + + P+GY G ++AY L Sbjct: 442 SGNYTWAPFLEIWHSGNLNPQAIVPAGAVVAFAMYSPPAGYLKANGAAVSRTAYAALFAT 501 Query: 128 ----YPSG------VIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTD 172 Y +G +PD RG ++ GR + + + +HTH AS Sbjct: 502 IGTYYGAGDGSTTFNLPDYRGEFLRALDDGRGLDLGRQLGTLQSSQNLAHTHGAS----- 556 Query: 173 LGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAIS 232 ++ G HTH T A S + N + +G + Sbjct: 557 ---------------SSGNGGHTH----TVTGTAAAAGAHSHSIASVNATALVSGTRLAT 597 Query: 233 NLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSH 292 + ST T G HTH+++G AA G +H Sbjct: 598 LVGNASNST------------TDVAGDHTHAVTGVAALEG------------------TH 627 Query: 293 GHTITVNAAGNAENTVKNIAFNYIVR 318 HTI V ++G +E +N++ ++ Sbjct: 628 NHTIYVESSGGSEARPRNVSVLICIK 653 >UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A9IRI0_BART1 Length = 324 Score = 93.4 bits (230), Expect = 9e-18, Method: Composition-based stats. Identities = 50/207 (24%), Positives = 73/207 (35%), Gaps = 62/207 (29%) Query: 73 NWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP--- 129 W L P E +P G + +P+G+ L G + + YP+L A Sbjct: 144 EHEGWY-LLNPTPPKIESFPAGFIATFAMRNIPNGWLLCDGTAYKREDYPQLFKAIGDKW 202 Query: 130 ------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETT 178 + +PD RG ++G + R ++QD IKSHTH + Sbjct: 203 GKNSDTTFKVPDFRGMFLRGFDDGRGLDNDRKFADEQQDSIKSHTHIGT----------- 251 Query: 179 SSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGI 238 +GAH H+ ++K G G N PN YT + L Sbjct: 252 ---------VEESGAHVHNF----------EYKGVGWPTG-NIGRLPNYYTYNTTLK--- 288 Query: 239 MSTTSGSGQTRNAGKTSSDGAHTHSLS 265 GKT S GAHTH ++ Sbjct: 289 -------------GKTDSAGAHTHKIT 302 >UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=Photorhabdus RepID=Q7N047_PHOLL Length = 602 Score = 93.4 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 33/102 (32%), Positives = 52/102 (50%), Gaps = 6/102 (5%) Query: 91 YPVGAPIPWPSDTV-PSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP---- 145 P+GA I W S P+GY +G+ F + YP+LA +P +PD RG +G Sbjct: 458 VPIGATIEWHSTAPIPAGYEPNEGRAFRAADYPELAKIFPDLKLPDDRGLFKRGLDRGRG 517 Query: 146 -ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK 186 SGR++ S + D I++ T S + + G+ + +F Y K Sbjct: 518 LDSGRSLGSVQGDAIRNITGSLGKPTIESGSNASGAFSYQYK 559 >UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N6T1_PHOLL Length = 300 Score = 92.3 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 36/86 (41%), Positives = 46/86 (53%), Gaps = 5/86 (5%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS--- 147 PVG+PIPWP P GY G F+K YPKLA AYP G +PD+RG I+G Sbjct: 152 IPVGSPIPWPLPYPPVGYLTCNGSAFNKLQYPKLAEAYPDGRLPDLRGEFIRGWDDGRGV 211 Query: 148 --GRAVLSQEQDGIKSHTHSASASST 171 GR +LS + D ++ T A + Sbjct: 212 DMGRTMLSWQGDAMQRMTGFLEAGNG 237 >UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacterium RepID=D0KGE5_PECWW Length = 157 Score = 91.1 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 36/129 (27%), Positives = 57/129 (44%), Gaps = 9/129 (6%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 A + G W + P G+ + GQ F+ S P LA YPS +PD RG+ +G Sbjct: 8 AWKSSSSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRG 67 Query: 144 KPA------SGRAVLSQEQDGIKSHTHS-ASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 RA+LS + D I++ T S++ G SS YG +N+G+ Sbjct: 68 WDNGAGIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSS--YGWPYPSNSGSAND 125 Query: 197 SISGTANSA 205 + T +++ Sbjct: 126 ASIITFDAS 134 >UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia cenocepacia J2315 RepID=B4EF34_BURCJ Length = 883 Score = 90.0 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 56/252 (22%), Positives = 89/252 (35%), Gaps = 50/252 (19%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY------------------------- 128 G + P T +G+ + G ++ YP L AY Sbjct: 653 GTVVFEPRTTARAGFLKLNGALLKRADYPAL-WAYAQASGALSTETDWAAGWSGTFSTGD 711 Query: 129 --PSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK 186 + IP++RG ++ + ++ ++ ++ A + Sbjct: 712 GTTTFRIPELRGEFVRCWDDTRGVDPNRGLGASQNFANAWHAHGASAAASGDHVH---SA 768 Query: 187 STNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSG 246 T+ G H H G S G HQH + + G I P G + + + + G Sbjct: 769 WTDVQGWHGH--HGWTASVGDHQHVAPYSESG----IAPFGTHSTNQVGSHG-----GVD 817 Query: 247 QTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAEN 306 TS G H H + AG H H VGIGA G+H H ITVN G E+ Sbjct: 818 NDNPWAFTSGAGGHNHEFN--TEGAGNHGHNVGIGA------AGNHSHAITVNGDGANES 869 Query: 307 TVKNIAFNYIVR 318 +N+A ++R Sbjct: 870 RPRNVALLAMIR 881 >UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VKW8_PHOAA Length = 316 Score = 89.6 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 41/173 (23%), Positives = 64/173 (36%), Gaps = 27/173 (15%) Query: 17 ELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSP 76 E+ N + G + L + + A+ R+ D Sbjct: 127 EIVNSLRENINGKVPNSWRINGKALTEDINL-------NASDVGAYTRAEVDR------- 172 Query: 77 WAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDM 136 PVG+PIPWP P GY G F++S YPKLA AYP+G +PD+ Sbjct: 173 -------LIKKTSEIPVGSPIPWPLPHPPFGYVTCNGSAFNRSQYPKLAEAYPNGRLPDL 225 Query: 137 RGWTIKGKP-----ASGRAVLSQEQD-GIKSHTHSASASSTDLGTETTSSFDY 183 RG I+G +GR +LS ++ + + S + + + Sbjct: 226 RGEFIRGWDDGRGADNGRKLLSWQEGSALSEYLGSFTTGVAQNIHQRDGVTYH 278 >UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=Burkholderia thailandensis RepID=UPI00016A4B89 Length = 654 Score = 88.0 bits (216), Expect = 4e-16, Method: Composition-based stats. Identities = 56/274 (20%), Positives = 90/274 (32%), Gaps = 58/274 (21%) Query: 79 QLYTSAHPPAEFYP--VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKL-AVAYPSG---- 131 ++ T+ E VG + +GY G ++ YP L A A SG Sbjct: 403 RVATTQWIAGELASAMVGQIVFEMRTAARAGYLKCNGALVKRADYPALWAYAQGSGALVA 462 Query: 132 ---------------------VIPDMRGWTIKGKPA-----SGRAVLSQEQDGIKSHTHS 165 IP++RG ++ + R + + + ++H H+ Sbjct: 463 EKDWMSGNFGCFSDGDGSATFRIPELRGEFLRCWDDGRGSDADRKIGTWQDSMNRTHGHA 522 Query: 166 ASASSTDLGTETTSSFDYG-TKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIF 224 A A D+G T+N G H H N + + + Sbjct: 523 AGAD---------GVGDHGHNAWTDNQGWHGHHGWTGTN-------GNHNHNNDIFSRLL 566 Query: 225 PNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHT 284 Y S S + + ++ G H H + AG HAH VG+ A Sbjct: 567 RPPYNGSLTGSDTAGSGSEQAVGGGDSADIRWAGDHNHEFN--TEGAGTHAHNVGVAAS- 623 Query: 285 HSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 G+H H I V A G E +N+A ++R Sbjct: 624 -----GAHSHAIHVAADGGNEARPRNLAVLAMIR 652 >UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I687_PSEE4 Length = 898 Score = 87.7 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 53/186 (28%), Positives = 76/186 (40%), Gaps = 45/186 (24%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSG------------VIP 134 A PVG +P+P TVP+G+ + G T + YP LA AY G +P Sbjct: 380 TASALPVGTMLPFPRGTVPAGFLEVDGSTQSAAVYPDLA-AYLGGAFNTGNEAAGFFRLP 438 Query: 135 DMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 D RG ++G SGRAV S + + K+HTH D+G G + Sbjct: 439 DTRGEFLRGWDHGRGVDSGRAVGSTQGESFKAHTHK------DVGFIDNVGGGSGASAVT 492 Query: 190 NTGAHTHSISGTA------NSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTS 243 SI G A +A A++ + GA GG AG++S ++ Sbjct: 493 GATGDVTSIYGKAYGNSASATAKAYKESAPGALGGA---------------IAGLISGST 537 Query: 244 GSGQTR 249 G +TR Sbjct: 538 GDSETR 543 Score = 80.0 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 52/271 (19%), Positives = 91/271 (33%), Gaps = 34/271 (12%) Query: 2 NITALTDNTQGAAGLELYEVYNNGYPTAYGN-----IIHLKGMTAVGEGELLIGWSGTSG 56 ++T++ G + + Y P A G I G + L + W Sbjct: 497 DVTSIYGKAYGNSASATAKAYKESAPGALGGAIAGLISGSTGDSETRPRNLAVMWC---- 552 Query: 57 AHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF 116 I++ + A L PVGA +P+P VP+GY + G Sbjct: 553 ------IKAWNAPVNQGQIDVAALVAELKALRSSTPVGAILPFPKAEVPAGYLELDGSLQ 606 Query: 117 DKSAYPKLA----VAYPSG-------VIPDMRGWTIKGKP-----ASGRAVLSQEQDGIK 160 + YP LA +Y +G +PD RG ++G GR + + + D I+ Sbjct: 607 SVATYPDLAAYLGASYNNGTEPAGYFRLPDYRGEFLRGWDHGRGVDPGRGMGTSQSDAIQ 666 Query: 161 SHTHSA---SASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFG 217 + T S + LG +S + T +T A+T + ++ +A + Sbjct: 667 NITGSIGLRGGAGVGLGVMGGASGAFSTVFGESTSANTITRDASSIAASDIARFDASKVV 726 Query: 218 GTNTSIFPNGYTAISNLSAGIMSTTSGSGQT 248 P + + + A G Sbjct: 727 RAAAETRPRNQSVMWCIKAWSTPVNQGQVDV 757 >UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI4_EDWTE Length = 718 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 52/217 (23%), Positives = 80/217 (36%), Gaps = 31/217 (14%) Query: 2 NITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 N A QG G+ + G P++ + G G G L + + SG + Sbjct: 475 NGGAFAGCNQG--GIYEVSI---GTPSSVADFPMKNGTYIYGYGVLYV--TSNSGTISQL 527 Query: 62 FIRSRRDTTDA--------NWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSG------ 107 +I S A N+ WA ++ +G+ IPW + +P Sbjct: 528 YI-SHNGQIAARIKWGDQPNFKSWAVYDPNSSFEYGCPLIGSLIPWALERMPQEIWPNCG 586 Query: 108 --YALMQGQTFDKSAYPKLAVAYPSGVIP-DMRGWTIKGKP-----ASGRAVLSQEQDGI 159 + GQ+FD +PKL YP +P DMRG+T +G GRA+LS + D I Sbjct: 587 MHFIPYMGQSFDPELFPKLHDVYPDNRLPTDMRGYTARGWDNGRGIDIGRALLSYQDDAI 646 Query: 160 KSHTHSAS-ASSTDLGTETTSSFDYGTKSTNNTGAHT 195 ++ T + +F N G T Sbjct: 647 QNITGQFGWMPFNGSSPVASGAFSVDKIGANVWGGGT 683 >UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=Cupriavidus taiwanensis RepID=B3R3K1_CUPTR Length = 1045 Score = 85.3 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 70/326 (21%), Positives = 111/326 (34%), Gaps = 74/326 (22%) Query: 28 TAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQ-LYTSAHP 86 TA G M GEL IG + T G+ A + ++ L T+A Sbjct: 757 TAPGTTARAVKMRLADNGELRIGNTATDGSGAKLQVTGYATADTPPAGDSSRKLATTAWV 816 Query: 87 PAEFY--PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY---------------- 128 + VG I P T +G + G ++ YP+L AY Sbjct: 817 MSTLLTASVGQIIIEPRTTARAGCLKLNGALLKRADYPEL-WAYAQASGAIVTDAAWLAG 875 Query: 129 -----------PSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTD 172 + IP+ RG ++ + GR + + K+H+H+ASA+ Sbjct: 876 SWGCFSHGDGNTTFRIPEYRGEYLRFWDDARGADAGRGIGVFQDSQNKTHSHAASATPV- 934 Query: 173 LGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAIS 232 ++G T+ G H H ++ P + Sbjct: 935 ------GDHNHGA-WTDAQGWHGHGVN------------------------DPGHAHSFQ 963 Query: 233 NLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSH 292 + G + + + G + A G+HAH VG+G G+H Sbjct: 964 TWTGGGATGAGRVSGSYVTNADAWAGTSASYTGISIAGDGSHAHNVGVG------YAGNH 1017 Query: 293 GHTITVNAAGNAENTVKNIAFNYIVR 318 H ITVNA G AE V+NI+ ++R Sbjct: 1018 SHAITVNADGGAEVRVRNISALAMIR 1043 >UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID=B9BDD9_9BURK Length = 536 Score = 84.6 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 51/258 (19%), Positives = 87/258 (33%), Gaps = 56/258 (21%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY------------------------ 128 +G + P +V +G+ + G ++S YP L AY Sbjct: 301 IGTIVFEPRTSVRAGFLKLNGALVNRSDYPAL-WAYAQASGALVAESAWGQNNWGCFSTG 359 Query: 129 ---PSGVIPDMRGWTIKGKPA-----SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSS 180 + +P++RG ++ S R + + + H H AS+++ T Sbjct: 360 DGATTFRLPELRGEFLRCWDDGRGADSARGIGTFQSFQNAWHAHGASSAAVGDHTH---- 415 Query: 181 FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMS 240 GA T + + G + + Y S S Sbjct: 416 -----------GAWTDAQGWHGHHGWTGGGGGHNHNNGIFSRLLRPPYGGSLTGSDQAGS 464 Query: 241 TTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 + + ++ + G H H + +G H+H VGIG G+H H ITVN Sbjct: 465 GSEQAVGAGDSADIAWSGDHAHEFN--TEGSGTHSHNVGIGG------AGAHAHAITVNG 516 Query: 301 AGNAENTVKNIAFNYIVR 318 G E +NIA ++R Sbjct: 517 DGGNEARPRNIAMLAMIR 534 >UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DPV9_DESVM Length = 530 Score = 84.2 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 42/152 (27%), Positives = 70/152 (46%), Gaps = 15/152 (9%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGV-------IPDMRGWT 140 A F P+GA + +P +TVP+G+ + GQ ++AYP L V Y +G +PD+RG Sbjct: 209 AAFVPIGAILDFPVNTVPTGFLVCAGQVVTRTAYPDL-VTYLTGGTVAVNATLPDLRGEF 267 Query: 141 IKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT--GA 193 +G +GR V S + D I++ T S + ++ + ST N+ GA Sbjct: 268 RRGADLGRGVDAGRVVGSAQGDAIRNITGSLYNYIQNNASQENGALRTQVASTLNSPFGA 327 Query: 194 HTHSISGTANSAGAHQHKSSGAFGGTNTSIFP 225 T T + + Q ++ N ++ P Sbjct: 328 GTIMSWSTLSIDASRQVPTASENRPRNIAVVP 359 >UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX Length = 456 Score = 83.4 bits (204), Expect = 9e-15, Method: Composition-based stats. Identities = 30/52 (57%), Positives = 36/52 (69%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 E YPVG+PIPWPS T P GY +M GQ+F S YP+LA AYP +PD+R Sbjct: 336 ESYPVGSPIPWPSATPPQGYLVMNGQSFSCSRYPQLARAYPGCKLPDLRRCF 387 >UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE3_PECWW Length = 144 Score = 83.0 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 31/90 (34%), Positives = 43/90 (47%), Gaps = 7/90 (7%) Query: 99 WPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-------ASGRAV 151 W + P G+ + GQ F+ S P LA YPS +PD RG+ +G S R+V Sbjct: 2 WGTPVPPEGWLELNGQLFNPSGNPVLADLYPSSRVPDFRGYFPRGWDNGAGIDPDSSRSV 61 Query: 152 LSQEQDGIKSHTHSASASSTDLGTETTSSF 181 LS + D I SH H+ + S G + F Sbjct: 62 LSYQDDEIISHKHAITMSHEHHGAADGAGF 91 >UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella grahamii as4aup RepID=C6ABW9_BARGA Length = 370 Score = 82.7 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 29/113 (25%), Positives = 47/113 (41%), Gaps = 17/113 (15%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV----AYPSG------VIPDMR 137 PVG I +P+ TVP G+ G +S Y +L Y +G +PD+R Sbjct: 218 NNSMPVGTVIYYPALTVPKGWLKANGALISRSDYAQLFAVIGTTYGAGDGKTTFRLPDLR 277 Query: 138 GWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGT 185 G ++G R + SQ+ D I++ T + + + +F YG Sbjct: 278 GEFLRGVDDERNIDPNRTIGSQQGDAIRNITGELNFDAK--AKAASGAFKYGG 328 >UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PDQ5_9PROT Length = 391 Score = 80.3 bits (196), Expect = 9e-14, Method: Composition-based stats. Identities = 33/135 (24%), Positives = 49/135 (36%), Gaps = 15/135 (11%) Query: 70 TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP 129 A+ + + + + + PVG I P G+ L G +SAY L A Sbjct: 213 GAADNDNYTRAVSFSLLSSTILPVGTIITSARTPAPDGFLLCNGAAISRSAYTDLFSAIG 272 Query: 130 ----------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLG 174 S IPD+RG I+G GRA+ S + D I++ T A Sbjct: 273 TAYGAGDGSSSFNIPDLRGEFIRGADNGRGVDGGRALGSAQGDAIRNITARAIGMGDRNS 332 Query: 175 TETTSSFDYGTKSTN 189 T YG + + Sbjct: 333 IPTLLGALYGIQKST 347 >UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralstonia phage RSL1 RepID=B2ZY49_9CAUD Length = 498 Score = 80.0 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 49/254 (19%), Positives = 82/254 (32%), Gaps = 79/254 (31%) Query: 80 LYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS--------- 130 Y P + P G +P+ T+P+GY ++ + L + Sbjct: 308 FYDQILNPPQLVPPGTILPFAGTTIPAGYLACNAAAISRTGFASLYSVIGTTYGVGNGST 367 Query: 131 -GVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYG 184 +PD+RG ++G GR + + D +SH H+ S +D G Sbjct: 368 TFNLPDLRGVFVRGWDNGRGQDPGRVFGTYQGDAFRSHNHAVSDP-----GHAHGVYDPG 422 Query: 185 TKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSG 244 H+H+ + + + Sbjct: 423 ---------HSHTWT--------------------------------------LGTLRQS 435 Query: 245 SGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA 304 G T ++ G + T A+ G GIG + + IG+ VN G A Sbjct: 436 GGDTSCYVPSARYGGGEFQFTETTAAVG-----TGIGIYGNVTGIGT-----LVN--GGA 483 Query: 305 ENTVKNIAFNYIVR 318 E T KN+A NYI++ Sbjct: 484 ETTPKNVAMNYIIK 497 >UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BT48_DESAD Length = 208 Score = 80.0 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 39/143 (27%), Positives = 63/143 (44%), Gaps = 13/143 (9%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP- 145 A YP+GA + DT P G+ GQ + YP+LA + +PD+RG I+G Sbjct: 57 AASDYPIGAVAAYRGDTPPVGWLECNGQ--STTGYPELAAVVGAN-VPDLRGEFIRGLDS 113 Query: 146 ----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKS-----TNNTGAHTH 196 +GRA+ S + D ++ H+H + + + + T S + + T N G+ Sbjct: 114 GRGVDAGRALGSAQADAMERHSHQTTITVSGRTSVTASPYHSAGAARSLVTTPNFGSPFG 173 Query: 197 SISGTANSAGAHQHKSSGAFGGT 219 S +A+ G SGA Sbjct: 174 GASFSASGTGTSTSVGSGAETRP 196 >UniRef50_B2I5N0 Tail Collar domain protein n=13 Tax=Xylella fastidiosa RepID=B2I5N0_XYLF2 Length = 414 Score = 79.6 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 60/283 (21%), Positives = 97/283 (34%), Gaps = 27/283 (9%) Query: 60 PAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKS 119 + T D N Q + Y G + G L G+ ++ Sbjct: 105 ASRYGLYLSTADNNTDTPLQNEPNRWTALSRYEPGQIVYTAGKRALPGTLLCDGRAVSRA 164 Query: 120 AYPKLAV----AYPSG------VIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASAS 169 YP+L +Y +G IP+ T+ A V + + SH H+A+A Sbjct: 165 MYPRLFEEINTSYGAGDGVSTFNIPNFLEGTVGVHTADPALVGTFTSGQVISHAHTATAE 224 Query: 170 STDLGTETTSSFDYG--TKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPN- 226 + G T + A H ++ G HQH S ++ G + I + Sbjct: 225 EGGRHLHPVTVHPAGRHTHPASAAAAGNHLHQAWSDEQGLHQHTGSTSWDGDHAHILGSF 284 Query: 227 GYTAISNLSAGIMSTTSGSGQTRNAG------KTSSDGAHTHSLSGTAASAGAHAHTVGI 280 S G G T G T ++G H H++S A AG H H + + Sbjct: 285 RAIYASGRDMGFYEQNQGKVTTNVTGGHLHRFTTDANGKHAHNISMQA--AGFHVHDIAV 342 Query: 281 GA---HTHSV---AIGSHGHTITVNAAGNAENTVKNIAFNYIV 317 A H H+ + G HGHT++++ G N + + Sbjct: 343 TAEADHAHAATAESAGRHGHTVSIDRFGEHHNLPAGLRVMACL 385 >UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GUE7_VIBCH Length = 250 Score = 78.8 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 25/63 (39%), Positives = 38/63 (60%), Gaps = 2/63 (3%) Query: 78 AQLYTSAH--PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPD 135 +L + + +PVG IPW +D P G+ + +GQ FD + Y +LA +P+G+IPD Sbjct: 188 DRLVNNLWLKFAVKIFPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPD 247 Query: 136 MRG 138 MRG Sbjct: 248 MRG 250 >UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3KCU2_PSEFS Length = 658 Score = 77.3 bits (188), Expect = 7e-13, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 68/208 (32%), Gaps = 32/208 (15%) Query: 26 YPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAH-------------APAFIRSRRDTTDA 72 YP +Y ++ G G+ + TS A A++ A Sbjct: 105 YPESYKPVLATSG---SGKEFYIRSIFETSNAAIVTLLIDDTVVKATRAWVMDYLGRQLA 161 Query: 73 NWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV------ 126 + + PVG+ + +P D VP G+ + G +AYP LA Sbjct: 162 EGTYTKAEIEMLIAQSSALPVGSMVAFPIDKVPVGFLEIDGSVKSATAYPDLAKFLGTAF 221 Query: 127 -----AYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTE 176 + +P+ RG ++G +GR S + D KSHTH Sbjct: 222 NKGDEGAGNFRLPESRGEFLRGWDHGRGVDAGRLAGSYQTDQFKSHTHEYDTMQGGGAAN 281 Query: 177 TTSSFDYGTKSTNNTGAHTHSISGTANS 204 + S + + H +G + + Sbjct: 282 SVSDTIAAQSNATSQTGHITGGAGGSET 309 Score = 63.4 bits (152), Expect = 9e-09, Method: Composition-based stats. Identities = 28/101 (27%), Positives = 42/101 (41%), Gaps = 18/101 (17%) Query: 76 PWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY------- 128 A L + PVG+ IP+ VP GY + G + YP LA AY Sbjct: 333 DVAALVSELDVLKSAVPVGSIIPFLKAAVPPGYLELDGSVQSIATYPDLA-AYLGTTFNT 391 Query: 129 ---PSG--VIPDMRGWTIKGKP-----ASGRAVLSQEQDGI 159 P+G +P+ RG ++G +GR V S ++ + Sbjct: 392 GSEPAGYFRLPESRGEFLRGWDHGRGMDAGREVGSWQKGSM 432 >UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VSH6_POLNA Length = 483 Score = 76.5 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 46/229 (20%), Positives = 77/229 (33%), Gaps = 65/229 (28%) Query: 100 PSDTVPSGYALMQGQTFDKSAYPKLAVAYPS----------GVIPDMRGWTIKGKPASGR 149 T P G+ G ++AY L A + +PD+RG I+G GR Sbjct: 309 ARSTAPPGWLKANGAGISRTAYAALFAAIGTTFGVGDGFNTFNLPDLRGEFIRGWDD-GR 367 Query: 150 AVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQ 209 V S G+ T +H H+ G+ ++AG H Sbjct: 368 GV--------------------------DGSRSLGSSQAGETASHGHT--GSTSAAGIHA 399 Query: 210 HKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAA 269 H + P ++ + G S+ +L T Sbjct: 400 HGVND----------PGHSHQVTQEGG------RNTSLAYQNGPNSAFRGEVSTLLETTR 443 Query: 270 SAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 +A GIG + G+H HT+T++A G +E +N+A +++ Sbjct: 444 NA------TGIGISEN----GNHSHTVTISATGGSETRPRNLALLAVIK 482 >UniRef50_C3X912 Phage tail collar domain-containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X912_OXAFO Length = 436 Score = 76.5 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 54/149 (36%), Gaps = 19/149 (12%) Query: 70 TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP 129 D+ W + + P G+ + + T P GY + G ++ Y +L A Sbjct: 264 YDSYLKKWVLQNPAKGIAIDSVPAGSVHYFATQTPPDGYLVANGALVSRTVYARLFSAIG 323 Query: 130 S----------GVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLG 174 + +PD+RG ++G R + + D I++ + + + Sbjct: 324 TTFGEGDGGSTFQLPDLRGEFLRGWDAARNLDPERGFGTVQGDAIRNIIGTFGGNDQERR 383 Query: 175 TETTSSFDYGTKSTNNTGAHTHSISGTAN 203 + + GT + G T S +GT N Sbjct: 384 FLSGPFYYIGT----DGGGKTGSSNGTDN 408 >UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KHC6_PSEF5 Length = 369 Score = 74.2 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 36/138 (26%), Positives = 58/138 (42%), Gaps = 18/138 (13%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV-----------AYPSGVIPDMRGW 139 PVGA +P+P TVP+G+ + G + YP LA + +P+ RG Sbjct: 113 LPVGAMVPFPKGTVPAGFLEVDGSVQSAATYPDLAAYLGTMFNTGGEGAGNFRLPESRGE 172 Query: 140 TIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAH 194 ++G GRA+ S + + SH H + + GT T + +Y + TG Sbjct: 173 FLRGWDHGRGVDVGRALGSYQAHAVGSHQHPMNYWAWRDGTG-TGTHNYAKPWGD-TGIT 230 Query: 195 THSISGTANSAGAHQHKS 212 GT +AG + + Sbjct: 231 GVKDPGTGANAGDSETRP 248 >UniRef50_Q7N541 Similar to DNA inversion product and tail fiber protein from lambdoid prophage n=2 Tax=Photorhabdus RepID=Q7N541_PHOLL Length = 337 Score = 73.4 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 55/226 (24%), Positives = 87/226 (38%), Gaps = 66/226 (29%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFD---KSAYPKLAVAYPSGVIPDMRGWTIKGK 144 YPVG I + + P+ L G T++ ++ +LA A S ++ Sbjct: 163 NTQYPVGIVIWFAQNKNPN--VLFPGTTWEYIGENKTIRLASANGSDIL----------- 209 Query: 145 PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS 204 G ++S + +H H+ TTS+FDYGTK+TN G H H + Sbjct: 210 STGGNDLISLTAAQMPAHNHTF--------FGTTSTFDYGTKTTNIAGEHYHDSGWGETT 261 Query: 205 AGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSL 264 G + H G N G S+D + Sbjct: 262 GGRYGHF---------------------------------DGSKNNQGSKSTDWNNA--- 285 Query: 265 SGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKN 310 ++ GAH+HTV IGAH H+++ G+ T ++ GNA ++ N Sbjct: 286 KFNTSTNGAHSHTVSIGAHNHTIS-GN-----TGDSGGNAAISITN 325 >UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia coli RepID=B7UGJ3_ECO27 Length = 221 Score = 71.9 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 36/88 (40%), Gaps = 14/88 (15%) Query: 93 VGAPIPWPSDTVPSG---------YALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 +G P WPS +P + G F + YP LA +PS V+P+ RG I+ Sbjct: 79 IGVPFFWPSAAMPDTVIESWSGMVFLKFNGAKFSATDYPVLAKVFPSLVLPEARGDFIRI 138 Query: 144 KP-----ASGRAVLSQEQDGIKSHTHSA 166 SGRA+LS + S Sbjct: 139 WDDGRGADSGRALLSWQAATSLSQFGGN 166 >UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotrophomonas maltophilia K279a RepID=B2FIY3_STRMK Length = 410 Score = 71.5 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 41/117 (35%), Gaps = 14/117 (11%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMR 137 P G +P+ P G+ G ++ Y L + +PD+R Sbjct: 250 SERLLPAGMVAHFPTGGPPPGWLRCNGADVSRTTYADLFAVIGTLFGSANDMTFRLPDLR 309 Query: 138 GWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 G ++G GRA+ S + + ASA G S D+G +TN Sbjct: 310 GEFVRGWDDGRGVDGGRALGSLQAATEVLSSWGASAGGLVSGQYQYSLADFGVHTTN 366 >UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 RepID=A6E6G6_9SPHI Length = 731 Score = 70.3 bits (170), Expect = 8e-11, Method: Composition-based stats. Identities = 31/140 (22%), Positives = 51/140 (36%), Gaps = 13/140 (9%) Query: 79 QLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY-PSGVIPDMR 137 +L +A +PVG + + + VP + L G+ D S YP L +PD+R Sbjct: 563 ELKRAAAATILDFPVGGIVAFYGEKVPDHWLLCDGKPVDHSLYPDLYRLLGGEKRLPDLR 622 Query: 138 GWTIKG-------KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNN 190 G + G G LS D + H H A + + + + + Sbjct: 623 GRFLVGAGSKYSLGDMGGVDELSLNVDQMPQHDHQIKAVKSYESPFKEVNMGWAREESLR 682 Query: 191 TGAHTHSISGTANSAGAHQH 210 G + GT GA ++ Sbjct: 683 GG-----VYGTDRDNGADKY 697 >UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomonas palustris DX-1 RepID=D2MH12_RHOPA Length = 346 Score = 70.3 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 58/301 (19%), Positives = 82/301 (27%), Gaps = 54/301 (17%) Query: 24 NGYPTAYGNIIHLKGM----TAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQ 79 N +P A G I G A +G+ F R + T Sbjct: 95 NSFPNASGPITRSLGAGYGFAATADGDASGPAFSFGSEPGLGFYRKSQGT---------I 145 Query: 80 LYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGW 139 Y P G + + T P G+ GQ + L A G+ Sbjct: 146 AYPGTLRGIGSIPPGFILDFAGPTPPEGWLTCDGQLVSTVTFADLFAAI---------GY 196 Query: 140 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSIS 199 T G S Q + + D T + GT TN G H+HS S Sbjct: 197 TWGG---------SGGQFAVPNLVKRFRRHRGD----GTVAGGVGTLQTNQIGLHSHSAS 243 Query: 200 GTANSAGAHQHKS-SGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDG 258 A H S +N P + I + G ++D Sbjct: 244 MDAQGHHDHYLDLWSSGMNRSNPHSHPASGSGIGVSGGFDTGVYAPQGPLNGVSIGATDI 303 Query: 259 AHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 H H ++G A G H H ITV A G E + ++ Sbjct: 304 NHEHRVTGNTAGNGGHIHN------------------ITVAANGGNETRPDSATVMACIK 345 Query: 319 L 319 + Sbjct: 346 V 346 >UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3G6_OXAFO Length = 237 Score = 69.9 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 16/92 (17%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSG-----------VIPDMRGW 139 P G+ + S+T P G+ + G +AYP L A + +PD+RG Sbjct: 95 VPPGSVLYLCSETPPDGWLVADGSMLLVAAYPDLFAAIGTAFGSGDNGMTTFRLPDLRGE 154 Query: 140 TIKGKP-----ASGRAVLSQEQDGIKSHTHSA 166 I+ GR + S + D I++H H Sbjct: 155 FIRCLDKGRGLDDGRPLGSVQGDEIRNHNHGF 186 >UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8U2_OXAFO Length = 266 Score = 69.6 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 33/89 (37%), Gaps = 15/89 (16%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWT 140 P G + P+GY G ++ YP L A + +PD+RG Sbjct: 107 VPTGTIAFFAMTAPPAGYLKADGAIIQRTDYPALFTAIGTTFGEGDGTTTFTLPDLRGEF 166 Query: 141 IKGKP-----ASGRAVLSQEQDGIKSHTH 164 I+G RA S + D I++ T Sbjct: 167 IRGWDNGRNIDCERAFGSIQGDAIRNVTG 195 >UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Enterobacteriaceae RepID=B3HKW0_ECOLX Length = 164 Score = 69.6 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 35/78 (44%), Gaps = 14/78 (17%) Query: 93 VGAPIPWPSDTVPSG---------YALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 +G P WPS +P+ + G F + YP LA +PS V+P+ RG I+ Sbjct: 22 IGVPFFWPSAAMPNTVIDSWSGMVFLKFNGAKFSATDYPVLAKVFPSLVLPEARGDFIRI 81 Query: 144 KP-----ASGRAVLSQEQ 156 GR +LS ++ Sbjct: 82 WDDGRGADGGRELLSWQE 99 >UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingella oralis ATCC 51147 RepID=C4GFX3_9NEIS Length = 310 Score = 69.2 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 43/116 (37%), Gaps = 16/116 (13%) Query: 83 SAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGV 132 + + + P G + +D P+G+ G ++ Y L A + Sbjct: 146 TGYTANSYCPSGQIGLFATDYAPTGWLKANGAVLSRTVYTNLFAAIGTRFGAGDGHSTFN 205 Query: 133 IPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDY 183 +PD+RG + +GR + S + D I++ T G+ + +F + Sbjct: 206 LPDLRGEFPRFWDDGRGVDAGRVLGSWQSDAIRNITAQM-YLYGQDGSSSQGAFGF 260 >UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KH70_PSEPF Length = 817 Score = 68.4 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 50/272 (18%), Positives = 90/272 (33%), Gaps = 38/272 (13%) Query: 33 IIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFY- 91 ++ +G A EL IG T A R R W + Sbjct: 443 VLRTEGDNA----ELSIGRVRTYNFGAATETRPRNIAVMWCIKAWNAPVNQGNIDVAALV 498 Query: 92 ----------PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY------------P 129 PVGA + +P+ VP G+ + G + S YP LA AY Sbjct: 499 KEVSRLGSAVPVGAVMAFPTGIVPPGFLELNGSVQNTSTYPDLA-AYLGTTYNKGDEGAG 557 Query: 130 SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTD-----LGTETTS 179 + +P+ RG ++G +GR + + + + H H+ + + Sbjct: 558 NFRLPESRGEFLRGWDHGRGVDAGRGIGTNQGQSMVDHYHTVLTADAGGVLNPIAGNLVG 617 Query: 180 SFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIM 239 SF + GA + T++ G K N ++ + ++ G + Sbjct: 618 SFTNLAPISKPAGAGVLGATLTSSIHGPAAEKGGTETRPRNLAVMWCIKAWNAPINQGNI 677 Query: 240 STTSGSGQTRNAGKTSSDGAHTHSLSGTAASA 271 + + + A +T+ A + + T A A Sbjct: 678 DIAALAVLAQQASETNQGTAKVATQAQTNAGA 709 Score = 65.3 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 27/103 (26%), Positives = 41/103 (39%), Gaps = 18/103 (17%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSG------------VIP 134 A PVG+ + +P D+ P G+ + + YP L AY G +P Sbjct: 319 KASALPVGSIVAFPVDSPPPGFLELDNSVKSSATYPDL-SAYLGGKFNKGDEGVGNFRLP 377 Query: 135 DMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTD 172 + RG ++G GRA S + D +K+H H S Sbjct: 378 EARGEFLRGWDHGRGVDGGRAQGSSQTDSLKAHYHLIPTGSGG 420 >UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_9CAUD Length = 760 Score = 68.0 bits (164), Expect = 4e-10, Method: Composition-based stats. Identities = 44/192 (22%), Positives = 70/192 (36%), Gaps = 30/192 (15%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRA 150 P+G+ P+ T P+GY G TF K YP L S +PDMRG +K Sbjct: 264 VPIGSIFPF-VKTPPAGYLTCDGSTFSKDEYPDLYAYLGSTTLPDMRGRYLKMPSD---- 318 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH-----SISGTANSA 205 + + A L + S + T + AH H I G Sbjct: 319 --------LANIYQKFPAIIPALLHDVDISHTH----TASQQAHAHDRGTMEIGGEFFVG 366 Query: 206 GAHQ-HKSSGAFGGTNTSIFPNGYTAISNLSAG-------IMSTTSGSGQTRNAGKTSSD 257 H + ++GA+GG S P G ++G ++ + +G T + + Sbjct: 367 SGHGLYIATGAYGGAFFSDSPGGADNNGGGASGGLNRRWVFRASRNWTGLTSYSAPAITV 426 Query: 258 GAHTHSLSGTAA 269 A T+++ T Sbjct: 427 NALTNAIRQTTN 438 >UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P172_CHRVO Length = 591 Score = 67.6 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 39/179 (21%), Positives = 57/179 (31%), Gaps = 26/179 (14%) Query: 68 DTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVA 127 D T W+PW L E Y G + P G+ G + YP L A Sbjct: 410 DGTTGRWNPWRTLIH------EDYLTGQVAFFAMSAPPLGWLKANGAAVSRKDYPSLFAA 463 Query: 128 ----YPSG------VIPDMRGWTIKGKP-----ASGRAVLSQEQDGI----KSHTHSASA 168 Y +G +PD+RG ++G +GR + ++ + S T A Sbjct: 464 LGTYYGAGDGSTTFNLPDLRGEFVRGWDDGRGVDNGRGFGTWQKGTLTFSDPSLTSPCVA 523 Query: 169 SSTDLGTETTSSF-DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPN 226 S T + D G + TAN S G G ++ N Sbjct: 524 SLVHRNDNTVIGYLDLGADPVDKNKYDLGLSVSTANGVYLPDLDSGGWANGYGSTRPRN 582 >UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8Y3_OXAFO Length = 270 Score = 67.2 bits (162), Expect = 7e-10, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 34/93 (36%), Gaps = 15/93 (16%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRG 138 P G + + S P GY G + Y +L A + +PD+RG Sbjct: 127 MAVPAGTVVYFCSHKAPYGYLKADGSAVGREEYKELFAAIGVYFGSGDGVSTFNLPDLRG 186 Query: 139 WTIKGKP-----ASGRAVLSQEQDGIKSHTHSA 166 I+ +GR + + + D KSH H Sbjct: 187 EFIRSLDNGRGVDAGRELGNVQMDEFKSHYHGF 219 >UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5A8Q3_BURGB Length = 865 Score = 67.2 bits (162), Expect = 7e-10, Method: Composition-based stats. Identities = 48/258 (18%), Positives = 79/258 (30%), Gaps = 71/258 (27%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY------------------------ 128 +G + P T +G+ G +++ YP L AY Sbjct: 645 IGQIVFEPRTTTRAGFLKANGSLLERADYPAL-WAYAQASGALISDAAWWAGQSGCFSTG 703 Query: 129 ---PSGVIPDMRGWTIKGKPA-----SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSS 180 + IP++RG ++ + RA S + H+H AS Sbjct: 704 TTGTNFRIPELRGEFLRCLDDGRGLDTSRAAGSLQLSQNAKHSHDAS------------- 750 Query: 181 FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMS 240 + G+HTH AG+H H +T + + + Sbjct: 751 -------STVGGSHTHGAF--TTGAGSHNHAIDQQPHAHDTWLGSVQVSGVDR------- 794 Query: 241 TTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 G G G+ + + + G H H G + G H H I V Sbjct: 795 ---GGGFGPYNGRVGEAWSDPANANIAILPTGDHVHGAG------TYPAGDHNHAIAVQP 845 Query: 301 AGNAENTVKNIAFNYIVR 318 +G E +NIA ++R Sbjct: 846 SGGDEARPRNIALLAMIR 863 >UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W7_TRIEI Length = 671 Score = 66.9 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 32/119 (26%), Positives = 47/119 (39%), Gaps = 15/119 (12%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS-GVIPDMRGWTIKGKPA-- 146 PVG +P+ T P G+ L GQ++D Y +L V+PD++G I G Sbjct: 525 VVPVGTIVPYAGLTAPEGWLLCNGQSYDWEQYSELYKVLDEIKVLPDLKGRFIIGVGDKD 584 Query: 147 ---------SGRAVLSQEQDGIKSHTHSASASSTDL---GTETTSSFDYGTKSTNNTGA 193 G + +D + SH HS L G TTS+ + N G+ Sbjct: 585 GYSYSLNAKGGEEKHTLTKDEMPSHDHSKGEYKFILKKDGKVTTSNNVNNSLREPNLGS 643 >UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45_9CAUD Length = 554 Score = 66.9 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 60/187 (32%), Gaps = 33/187 (17%) Query: 9 NTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRD 68 N A G Y N P+ +G ++ + +A + + + +R Sbjct: 334 NALVAPGEYYYTSDNANAPSGHG-VLKVWRESAT---MVFQLVHSSDNE-----VFTRYR 384 Query: 69 TTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY 128 + W+ W QL A G + T PSG+ G ++ Y L Sbjct: 385 ASSGTWTAWRQLVGQA---------GLIGYFARSTAPSGWLKANGAAVSRTTYAALYAEI 435 Query: 129 P----------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDL 173 + +PD+RG ++G SGR + + + H +S ++ Sbjct: 436 GTTFGAGDGAATFNLPDLRGEFLRGWDDGRGVDSGRGIGTWQSGSPVVHDDVGGIASFNI 495 Query: 174 GTETTSS 180 + Sbjct: 496 TALGDGT 502 >UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W1_OXAFO Length = 365 Score = 66.9 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 58/160 (36%), Gaps = 10/160 (6%) Query: 73 NWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP--- 129 W A+L AE P G I + P G+ G + SAYP+L Sbjct: 201 RWEIMAELAGKLD-KAEKLPAGTIIAVGGNITPEGFLYCNGASLSPSAYPELCAVIGGTY 259 Query: 130 ------SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDY 183 + +PD RG ++G +GR + + + + A A +T T + D Sbjct: 260 GGDGLTTFNLPDFRGRWMQGNDTAGRVLAAGLPNVTGTIVSGAIAHATAYQTGAFYNIDV 319 Query: 184 GTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSI 223 G + G+ H +G S + +S + ++ Sbjct: 320 GAFGGYHAGSQNHYRAGFEASRSNPIYGASDTVRPPSITV 359 >UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A9IXL3_BART1 Length = 334 Score = 65.7 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 30/134 (22%), Positives = 49/134 (36%), Gaps = 23/134 (17%) Query: 67 RDTTDANWSPW--------AQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDK 118 RD N W P E + G + S+ +PSG+ L G+ + + Sbjct: 138 RDIAGKNADGWFLTNPTIKLPEIPPFPPLPESFSPGFIGTFASEKIPSGWLLCDGKEYSR 197 Query: 119 SAYPKLAVAYP----------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHT 163 Y L + +PD+RG ++G GR + S++++ KSHT Sbjct: 198 KNYANLFAVLGETWGKGDGKTTFNVPDLRGMFLRGLDSGKEIDKGRLLGSRQEESFKSHT 257 Query: 164 HSASASSTDLGTET 177 H ST + Sbjct: 258 HEGKTDSTGKHQHS 271 >UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD Length = 325 Score = 65.7 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 63/194 (32%), Gaps = 57/194 (29%) Query: 132 VIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK 186 +PD+RG ++ R + S + I+SH H+A Sbjct: 183 RLPDVRGEGLRLWDNGRGVDQARTLGSWQGGAIESHGHAA-------------------- 222 Query: 187 STNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSG 246 N+G S G H H + + L + Sbjct: 223 ---NSGDAGAVADRRTGSGGGHNHNNG----------------IFTRLLRAPYVGSITGS 263 Query: 247 QTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVA-IGSHGHTITVNAAGNAE 305 T N+G + G G A +G H H + +G H H I+++A G E Sbjct: 264 DTTNSGDEQAVG------------GGDSADIAAVGDHDHLIPGVGPHRHDISISATGGNE 311 Query: 306 NTVKNIAFNYIVRL 319 ++N+A ++++ Sbjct: 312 TRMRNVAVAALIKI 325 >UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NQ95_9RHOB Length = 329 Score = 65.3 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 72/241 (29%), Gaps = 73/241 (30%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS----------GVIPDMRGWTIK 142 G + T P G+ G ++AY L A + +PD+RG ++ Sbjct: 146 PGCVAYYAMSTAPDGWLKANGAEISRTAYADLFAAIGTIFGVGDGNSTFNLPDLRGEFLR 205 Query: 143 GKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G + R + S + D SHTH Sbjct: 206 GWDDARGVDGARVLGSSQSDQNASHTH--------------------------------- 232 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 S + H +G T Y +N G+ + + T + Sbjct: 233 ----TGSTSSDSHSHTGTTNTTGNHTHNMAYEGGTNAGTGLAAPATSRSNTSPGPTVNYS 288 Query: 258 GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIV 317 G H+H+ S ++ S ++T +A+G +E +NIA + Sbjct: 289 GNHSHTFSTSSDSHSH---------------------SVTTDASGGSEARPRNIALLACI 327 Query: 318 R 318 + Sbjct: 328 K 328 >UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI00019136B5 Length = 137 Score = 63.8 bits (153), Expect = 7e-09, Method: Composition-based stats. Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 5/51 (9%) Query: 120 AYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHS 165 YP LA AYP+ +PD+RG I+G +GRA+L + D ++H H Sbjct: 1 MYPNLAKAYPTNKLPDLRGEFIRGWDDGRGVDAGRALLRLQDDSFEAHRHE 51 >UniRef50_B3Z3L3 Phage minor structural protein n=3 Tax=Bacillus cereus group RepID=B3Z3L3_BACCE Length = 679 Score = 63.4 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 38/159 (23%), Positives = 71/159 (44%), Gaps = 16/159 (10%) Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQH 210 +L+ E + ++++ + + E+TS+ ST++ G + + +S G Sbjct: 404 ILTYETEEFRAYSRATKGGGAIV--ESTSAGGAVVNSTSSGGG----VVNSTSSGGGSTQ 457 Query: 211 KSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSG---------SGQTRNAGKTSSDGAHT 261 SS G T TS G + S G + +T+ SG N+ + + G H Sbjct: 458 TSSSGGGSTQTSTSGGGGSFTSEAGGGAVPSTTQKSFAEMHLMSGVPENSIGSENWGNHL 517 Query: 262 HSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 H + + +H+HTV + +H H V I +H H++T+ A Sbjct: 518 HEIVINGDNF-SHSHTVTVPSHKHQVNIPAHSHSVTIPA 555 Score = 59.9 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 65/161 (40%), Gaps = 11/161 (6%) Query: 138 GWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G I ++G AV++ S +++S+ G+ TSS G+ T+ +G Sbjct: 422 GGAIVESTSAGGAVVN----STSSGGGVVNSTSSGGGSTQTSSSGGGSTQTSTSGG--GG 475 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 + GA + +F + + S + +G + T + Sbjct: 476 SFTSEAGGGAVPSTTQKSFAEMHLMSGVPENSIGSENWGNHLHEIVINGDNFSHSHTVTV 535 Query: 258 GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV 298 +H H ++ AH+H+V I AHTHSV I +H H I + Sbjct: 536 PSHKHQVN-----IPAHSHSVTIPAHTHSVTIPNHTHEINI 571 Score = 57.6 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 38/165 (23%), Positives = 65/165 (39%), Gaps = 11/165 (6%) Query: 138 GWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G + +SG V++ S S SS+ G+ TS+ G T+ G + Sbjct: 432 GGAVVNSTSSGGGVVN----STSSGGGSTQTSSSGGGSTQTSTSGGGGSFTSEAGG--GA 485 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 + T + A H SG + S + ++ S + + + + Sbjct: 486 VPSTTQKSFAEMHLMSGVPENSIGSENWGNHLHEIVINGDNFSHSHTVTVPSHKHQVNIP 545 Query: 258 GAHTHSLSGTA----ASAGAHAHTVGIGAHTHSVAIGSHGHTITV 298 AH+HS++ A + H H + I HTH + I +H HTIT+ Sbjct: 546 -AHSHSVTIPAHTHSVTIPNHTHEINIPNHTHEINIPNHTHTITL 589 >UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0USC5_HAES2 Length = 652 Score = 63.0 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 20/69 (28%), Positives = 30/69 (43%), Gaps = 8/69 (11%) Query: 71 DANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDT-VPSGYALMQGQTFDKSAYPKLAVAYP 129 +WS W + PVG+ + +P P G+ G TF ++ YP L A Sbjct: 213 GTSWSAWKDVGGDG------LPVGSVLAFPVAVQNPQGFLKCDGSTFGRTTYPDLYRALG 266 Query: 130 -SGVIPDMR 137 S +PD+R Sbjct: 267 NSNKLPDLR 275 >UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-like viruses RepID=Q7Y2B3_9CAUD Length = 466 Score = 61.9 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 27/142 (19%), Positives = 48/142 (33%), Gaps = 16/142 (11%) Query: 70 TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP 129 + S Q T P+G I + + + G++ +K+ YP+L A Sbjct: 292 YKGSNSDGNQFVTKNELANHAMPIGGIILSGFNADRGDFLICNGRSLNKNQYPQLFSAIG 351 Query: 130 --------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASS---TDL 173 + +PDMRG +G GR S ++D ++ T + Sbjct: 352 YTFGGSGDNFNLPDMRGLVARGCDHGRNLDPGRRFGSYQEDAMQRITGKFPVADRWRGWY 411 Query: 174 GTETTSSFDYGTKSTNNTGAHT 195 G T+ + + N G Sbjct: 412 GGAFTAQRGQWSTNYKNGGGDD 433 >UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia solanacearum RepID=B5S308_RALSO Length = 225 Score = 61.9 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 41/106 (38%), Gaps = 20/106 (18%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIKG 143 G+ + T P+G+ G ++ Y +L + +P++R +G Sbjct: 66 GSVAMFACKTPPAGWLKCNGAAVSRTTYERLFKLIGTTFGAGDGAATFNLPELRAEFPRG 125 Query: 144 KP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYG 184 SGRA S + + SH H T +G + ++ F +G Sbjct: 126 WDDGRGVDSGRAFGSSQAQALSSHQH-----KTAVGFDGSNLFGWG 166 >UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A9ITY4_BART1 Length = 376 Score = 61.1 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 43/271 (15%), Positives = 80/271 (29%), Gaps = 53/271 (19%) Query: 70 TDANWSPWAQLYTSAHPPA--EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVA 127 D + + W L + + + P G P+ + +P G+ L G+ + + Y L Sbjct: 136 YDEDITGWQILNPTRGKVSFLKRLPSGLIGPFAMERLPDGWLLCDGRAYSRRTYRALFDG 195 Query: 128 YP----------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSA-----S 167 + +PD RG ++G R+ SQ+ +K+H H + Sbjct: 196 IGTTWGEGDGSTTFNVPDFRGMFLRGMDYERNLDPWRSFASQQGCSLKAHEHFIGPAFPN 255 Query: 168 ASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNG 227 S+ + +SS T+ + G A + G+ P Sbjct: 256 DHSSRKRRDVSSSQAPVTRRKRAIDEECLGLDGDALDKCNQEFD---QIAGSPQVEVPFW 312 Query: 228 YTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 287 +T + S +G T AH Sbjct: 313 FTEKDKPARLPWFIRSPFANFLYYSTPIKEGVMT-----------AH------------- 348 Query: 288 AIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 H H + + G E N++ Y ++ Sbjct: 349 ----HEHHLMAESVGGVETRPVNVSIVYGIK 375 >UniRef50_Q4C9U4 Phage Tail Collar n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4C9U4_CROWT Length = 253 Score = 61.1 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 30/140 (21%), Positives = 45/140 (32%), Gaps = 27/140 (19%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSG--------VIPDMRGWT 140 P + + + P+G+ G +D S YP+L A G +PDMR + Sbjct: 70 SIIPKSSIVVFGGAVAPNGWLFCDGTPYDPSTYPQLFSAIGYGFGQVGSLFRVPDMRDRS 129 Query: 141 IKGKPAS-------GRAVLSQEQDGIKS---------HTHSASASSTDLGTETTSSFDYG 184 G S G A S D + + HTHS + + G Sbjct: 130 PVGAGISFDRGTFGGSATTSLSVDNMPAHSHNVIDPGHTHSMNHGPGQHSAVALDYHNAG 189 Query: 185 T---KSTNNTGAHTHSISGT 201 G H H+I + Sbjct: 190 NGVDAYVPQWGGHAHTIYAS 209 >UniRef50_A9AVE2 Tail Collar domain protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVE2_HERA2 Length = 865 Score = 60.7 bits (145), Expect = 6e-08, Method: Composition-based stats. Identities = 43/189 (22%), Positives = 60/189 (31%), Gaps = 32/189 (16%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRA 150 P G W VP G+A+ G+ + PD+R I G A+ Sbjct: 693 IPCGTIQMWSGMEVPEGWAICDGREAN------------GLRTPDLRNRFIVGAGAN--- 737 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQH 210 S + S TT D + + HTH G+ N+AG H H Sbjct: 738 ------------YDSGNLSVYGTNQGTTGGSDVVALTLDQMPRHTH--GGSTNAAGDHSH 783 Query: 211 KSSG--AFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTA 268 G A G G T + G + + R T + G H+H L Sbjct: 784 WVEGTDADGLAKRRRHHWGDTTVDMGFGGGRNADPNDERWRGRVNTDNAGTHSHGLM-IG 842 Query: 269 ASAGAHAHT 277 G+ AH Sbjct: 843 EVGGSQAHE 851 >UniRef50_C2RWX3 Phage minor structural protein n=1 Tax=Bacillus cereus BDRD-ST24 RepID=C2RWX3_BACCE Length = 695 Score = 60.7 bits (145), Expect = 6e-08, Method: Composition-based stats. Identities = 35/147 (23%), Positives = 61/147 (41%), Gaps = 10/147 (6%) Query: 161 SHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 + A +ST G T S G + +TG +S + + G + SG GG+ Sbjct: 426 TGGGGAIIASTGAGGGTVQSTGGGGGTVQSTGGGGAQVSTSTSGGGVSKSTESG--GGST 483 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLS--------GTAASAG 272 + G + ++ + + SG N+ + + G H H + A + Sbjct: 484 QTSGAGGGISTTSDHKTFLELSIMSGVPENSIGSENWGNHLHEIKIPGDYFTHNHAINLP 543 Query: 273 AHAHTVGIGAHTHSVAIGSHGHTITVN 299 H H+V I H+H+ ++ SH H IT+N Sbjct: 544 NHYHSVLIQPHSHNFSVPSHSHQITLN 570 >UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A9ITX5_BART1 Length = 333 Score = 60.7 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 27/132 (20%), Positives = 43/132 (32%), Gaps = 18/132 (13%) Query: 75 SPWAQLYTSAHPPAE---FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP-- 129 W + + P E YP G + VP + + G+ + + Y L Sbjct: 139 DSWYLVNPTPMPREEESSLYPTGFIGTFGMRDVPKDWLICDGKAYLRRDYRDLFETIGTV 198 Query: 130 --------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTE 176 + +PD RG ++G R S + D I+SH H S T Sbjct: 199 WGEGDSVTTFNVPDFRGMFLRGVDGGSNLDPNRRFASVQTDLIQSHQHEGQTLSMPHFTS 258 Query: 177 TTSSFDYGTKST 188 + +D T Sbjct: 259 NENFWDGNTTEV 270 >UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31Q92_SYNE7 Length = 387 Score = 59.9 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 27/114 (23%), Positives = 41/114 (35%), Gaps = 33/114 (28%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY----------------- 128 A P G I +T P+GY G ++ Y +L AY Sbjct: 231 IAALAVPAGVAIWVTGNTPPTGYIKANGALLSRTTYARL-WAYAQASGNIVSDAAWTGGA 289 Query: 129 ----------PSGVIPDMRGWTIKGK-----PASGRAVLSQEQDGIKSHTHSAS 167 + +PD+RG I+G +GRA+ S + D +K+H H Sbjct: 290 TGSYSTGDGSTTFRVPDLRGEFIRGWADGRSVDTGRAIGSTQADELKAHAHYLD 343 >UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P176_CHRVO Length = 435 Score = 59.9 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 46/132 (34%), Gaps = 17/132 (12%) Query: 80 LYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVA----YPSG---- 131 + P G + P+G+ + G+T + YP L A Y +G Sbjct: 272 DGATVTQVNAAAPAGMVAYFAMKDAPAGWLIADGRTVARKDYPALFAAIGGLYGNGDGST 331 Query: 132 --VIPDMRGWTIKGKP-----ASGRAVLSQEQDG--IKSHTHSASASSTDLGTETTSSFD 182 +P++ G I+G +GRA+ S + + + + + D + S+ Sbjct: 332 TFGLPNLCGEFIRGWDNGRGVDTGRAIGSSQISTQLLVDNDGLQTVGAIDWSSNNLSALG 391 Query: 183 YGTKSTNNTGAH 194 Y N H Sbjct: 392 YEPAQANAANLH 403 >UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8I7_OXAFO Length = 369 Score = 59.2 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 77/223 (34%), Gaps = 25/223 (11%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWT 140 PVG+ + + T P+GY G + YP+L A + +PD+ G Sbjct: 106 IPVGSIDYFATSTPPAGYLKADGSEVGRETYPELFTAIGTVFGEGNGDSTFNLPDLMGRF 165 Query: 141 IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA-HTHSIS 199 +G G+ + + G+ H H + + + YG +T G +T S + Sbjct: 166 AQGSTIVGQRIKA----GLPDHKHIEGFAGVNPNS------SYGVATTAPQGNINTQSGT 215 Query: 200 GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGA 259 +N S G + ++ P T + + A +T G + + A Sbjct: 216 SVSNHPYTSPASLSNPIYGASDTVQPPALTLLPCIKAFDAATGPGLIDVTGLSQEIALKA 275 Query: 260 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAG 302 +L G G H V H + + + V+ G Sbjct: 276 DKKNLFG----IGQTYHDVTHERQNHVIYTNTSSKPLFVSIYG 314 >UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X1Y2_OXAFO Length = 480 Score = 58.8 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 63/177 (35%), Gaps = 20/177 (11%) Query: 83 SAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGV 132 P+G + T P+GY G ++ YP L A + Sbjct: 181 MGWKYPSGVPIGTVEYFAMATPPAGYLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFN 240 Query: 133 IPDMRGWTIKGKPASGRAVLSQEQDGIKSHT-----HSASASSTDLGTETTSSFDYGTKS 187 +PDM G +G G + ++ G+ + T H S+ + G+ TS ++ Sbjct: 241 LPDMIGRFAEGSATPG----TVKEAGLPNITGEINGHFGSSVAFGTGSLFTSIGGSRYRA 296 Query: 188 T-NNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTS 243 T + TG + + S + + +S ++ P + ++ G++ T Sbjct: 297 TPDGTGGEAFFAAFISASRSSPIYGNSDTVQPPALTLLPCIKAFDAAVNPGLIDVTE 353 >UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0UTN0_HAES2 Length = 699 Score = 58.4 bits (139), Expect = 3e-07, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 48/135 (35%), Gaps = 18/135 (13%) Query: 11 QGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDT- 69 Q G + + Y + + HL E++ G T+ R DT Sbjct: 272 QVKTGSDDVDNYKTDGHYYFASSQHLPDNNGAWHVEVVSGGQTTAVRQIA---RKANDTK 328 Query: 70 ------TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSD-TVPSGYALMQGQTFDKSAYP 122 + + W+ W + P+GA + +P T P+G+ G T D+ YP Sbjct: 329 VKTRFFSGSKWTEWKDIGGDG------VPLGAIVAFPKAITNPTGFLKCDGTTIDQRTYP 382 Query: 123 KLAVAYPS-GVIPDM 136 L + +P++ Sbjct: 383 DLYRTLGNKNTLPNL 397 >UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=VPH_BPHP1 Length = 925 Score = 57.6 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 35/141 (24%), Positives = 49/141 (34%), Gaps = 17/141 (12%) Query: 6 LTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFI-R 64 L G +E + NGY T GN G GE I +A I R Sbjct: 449 LAGYGIGNFKVEQGQGDANGYKTD-GNYYLASGQNLPENGEWHIEVVSGGATNAVRQIAR 507 Query: 65 SRRDT-------TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSD-TVPSGYALMQGQTF 116 D +NWS W P+G+ + +P T P G+ G TF Sbjct: 508 KANDNKIKTRFFNGSNWSEWKDAGGDG------VPIGSVVSFPRAVTNPVGFLKANGTTF 561 Query: 117 DKSAYPKLAVAYP-SGVIPDM 136 ++ +P L S +PD+ Sbjct: 562 NQQTFPDLYRTLGDSNQLPDL 582 >UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X192_OXAFO Length = 361 Score = 57.2 bits (136), Expect = 7e-07, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 51/146 (34%), Gaps = 15/146 (10%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWT 140 P+G + T P+GY G ++ YP L A + +PDM G Sbjct: 80 VPIGTVEYFAMATPPAGYLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFNLPDMIGQF 139 Query: 141 IKGKPASGRAVLSQEQDGIKSHTHSASA-SSTDLGTETTSSFDYGTKSTNNTGAHTHSIS 199 +G G + ++ G+ + S S +S + S +NN S Sbjct: 140 AEGSATPG----AVKEAGLPNIIGSISNVASGGANASSASGALSIAARSNNNMTPGSSAY 195 Query: 200 GTANSAGAHQHKSSGAFGGTNTSIFP 225 G + + + +G +NT P Sbjct: 196 GHTFALAINASDFNPIYGKSNTVQPP 221 >UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3R8_OXAFO Length = 365 Score = 56.8 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 65/202 (32%), Gaps = 21/202 (10%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWT 140 PVG+ P+GY G + YP L A + +PDM G Sbjct: 88 VPVGSIDWLAVPEPPAGYLKCDGAAIGRDTYPDLFAAIGTTFGAGDGETTFNLPDMIGRF 147 Query: 141 IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDY-----GTKSTNNTGAHT 195 +G G ++ G+ + + ++ +TSS + N + ++T Sbjct: 148 AEGSATPGIK----KEAGLPNVSGVSAVEGCINKGSSTSSGPFTYWRENNLILNTSPSNT 203 Query: 196 HSISGTAN--SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGK 253 H + G S G + +S ++ P + +++G++ T + + Sbjct: 204 HDLGGEIFSLSNGNPIYGNSDTVQPPALTLLPCIKAFDAAVNSGLIDITELANEVTGKAD 263 Query: 254 TSSDGAHTHSLSGTAASAGAHA 275 + + H Sbjct: 264 KTQVANLAMPSDTGISVTVPHT 285 >UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW5_CLOCE Length = 368 Score = 56.8 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 49/242 (20%), Positives = 79/242 (32%), Gaps = 76/242 (31%) Query: 91 YPVGAPIPWPSDTV-----PSGYALMQGQTFDKSAYPKLAVA---------YPSGVIPDM 136 +PVG IP+ SG+ G+ DK+ Y +L P+ IPD+ Sbjct: 5 FPVGMVIPFAGPLKEDQLKSSGWVPCDGRVLDKTQYSELFDVIGTKYGGDGIPNFNIPDL 64 Query: 137 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 RG ++ GR + D + AS S G T S +Y T N Sbjct: 65 RGRFVRA-TDHGR---GYDPDAQRR---KASKSGGAAGDNTGSVQEYATAKPKN------ 111 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS 256 + N G H H ++ TS+ Sbjct: 112 --NFITNDKGNHNH---------------------------LVDHLPTDYWNAACAITSN 142 Query: 257 DGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYI 316 +GA+ + T+ AG H+HT+ G G++E+ N+ +I Sbjct: 143 EGANFPGRTATSGEAGQHSHTIVSG--------------------GDSESRPVNLYMYWI 182 Query: 317 VR 318 ++ Sbjct: 183 IK 184 Score = 48.0 bits (112), Expect = 4e-04, Method: Composition-based stats. Identities = 41/225 (18%), Positives = 79/225 (35%), Gaps = 33/225 (14%) Query: 42 VGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSP------WAQLYTSAHPPAEFY-PVG 94 EG G + TSG A + D+ P W +TS+ P G Sbjct: 141 SNEGANFPGRTATSG-EAGQHSHTIVSGGDSESRPVNLYMYWIIKFTSSDYDESILLPAG 199 Query: 95 APIPWPSDTV-------PSGYALMQGQTFDKSAYPKLAVAYPS--------GVIPDMRGW 139 + + + D+V +G+ G +++ + YP L + +PD+RG Sbjct: 200 SIVSFAGDSVKKSNELIANGWLPCIGSSYEANKYPDLYENISNIYGGDQNKFNVPDLRGL 259 Query: 140 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSIS 199 I+G ++ + E G+ H + + D T + ++ + + GAHTHS Sbjct: 260 FIRGVNSN-----TSETPGV--HGATRVGQTEDYSTALPKTLNF---TLSTDGAHTHSAP 309 Query: 200 GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSG 244 + + ++ + ++ AG + T Sbjct: 310 KLPQDKYIENYCAGHEVANFPSNQYTGNNGNHAHTIAGGDAETRP 354 >UniRef50_C3LHF1 Phage minor structural protein n=13 Tax=Bacteria RepID=C3LHF1_BACAC Length = 657 Score = 56.8 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 30/123 (24%), Positives = 50/123 (40%), Gaps = 9/123 (7%) Query: 176 ETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLS 235 + +SS K++++ G H H + G G P T+ S + Sbjct: 454 QASSSGGGTVKASSSGGDHVHKMF----HGGGIVPAEPSTIGLYTAFSDPGRNTSASFYA 509 Query: 236 AGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHT 295 G S+ G + N S HTHS++ H H++ I +TH ++I +H H Sbjct: 510 KGTGSSFYTYGSSGNHTHDISIPNHTHSIN-----IPNHTHSISIPNYTHDISIPNHTHD 564 Query: 296 ITV 298 IT+ Sbjct: 565 ITL 567 >UniRef50_D1BW55 Tail Collar domain protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BW55_XYLCX Length = 443 Score = 56.5 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 48/252 (19%), Positives = 91/252 (36%), Gaps = 58/252 (23%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS----------GVIPDMR 137 PVG + TVP G+ G ++ YP L Y + +P+ + Sbjct: 223 NAACPVGMEAGFH--TVPPGWLEHNGAAVSRTTYPALFAHYGTTYGAGDGSTTFNLPNAK 280 Query: 138 GWTIKGKPAS-----------GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK 186 G T G + G + + SHTH+++A + + + D+ Sbjct: 281 GRTPVGLDTAQAEFNAVGKTGGAKTHTLSTAEMPSHTHTSAAHT------HSINHDHAAV 334 Query: 187 STNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSG 246 ++++ G+HTH + + + + G+ ++ ++ NG T Sbjct: 335 TSSSAGSHTH--GSSTSGITDRAYFARGSAPASSATVGTNGVTP---------------- 376 Query: 247 QTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAEN 306 G + +S T ASAGAH HTV + + + + + G T + + N Sbjct: 377 -----------GPWDYVVSSTLASAGAHTHTVDLPSFSGTSGSTTPGATGSTGSGSAHNN 425 Query: 307 TVKNIAFNYIVR 318 + + VR Sbjct: 426 LPPYLVRRWCVR 437 >UniRef50_D1NFN8 Apo-citrate lyase phosphoribosyl-dephospho-CoA transferase (Fragment) n=1 Tax=Haemophilus influenzae HK1212 RepID=D1NFN8_HAEIN Length = 301 Score = 56.1 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 33/80 (41%), Gaps = 8/80 (10%) Query: 59 APAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSD-TVPSGYALMQGQTFD 117 + R + +WS W +L T P GA + +P T P G+ G TF+ Sbjct: 39 TDVYERHQTSYQTDSWSAWKKLNTDG------IPTGAVVSFPRAVTNPVGFLKANGSTFN 92 Query: 118 KSAYPKLAVAYP-SGVIPDM 136 + +P L S +PD+ Sbjct: 93 QQTFPDLYRVLGNSNQLPDL 112 >UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED121 RepID=A3YFP9_9GAMM Length = 207 Score = 55.7 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 23/113 (20%), Positives = 40/113 (35%), Gaps = 39/113 (34%) Query: 92 PVGAPI------------PWPSDTVPSGYALMQGQTFDKSAYPKLAVA----YPSGV--- 132 PVG+ I P+ ++ + G + + + YP+L A Y Sbjct: 33 PVGSVIAFAGEIRTSGDKPFETNLPMFNWLKCDGSSLEVAQYPELFSALGYRYGGSGQKF 92 Query: 133 -IPDMRGWTIKGKPAS----------GR---------AVLSQEQDGIKSHTHS 165 +PD+RG ++G GR V S + ++SH H+ Sbjct: 93 NLPDLRGEFLRGVDVDSSNNKKASLEGRKGAANGGNHEVGSTQGFALQSHVHT 145 >UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8V5_OXAFO Length = 480 Score = 55.7 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 55/173 (31%), Gaps = 33/173 (19%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSG----------VIPD 135 + PVG + + T P+GY G + YP L A + +PD Sbjct: 191 IAKKGVPVGTIEYFATSTPPAGYLKADGAAVGRETYPDLFAAIGTAFGEGDGSTTFNLPD 250 Query: 136 MRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHT 195 + G +G G+ + + G+ + S +G +GA Sbjct: 251 LIGRFAQGSDVPGQKL----EAGLPNAIGKLSG-------------FFGFTPVYKSGA-- 291 Query: 196 HSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQT 248 T SAG A GG +++ N + SN G T T Sbjct: 292 ---LSTTGSAGVQFETIGVA-GGASSNKIINLDLSESNPIYGASDTVQPPALT 340 >UniRef50_A9LZ37 Tail fibre protein, putative n=21 Tax=Neisseria RepID=A9LZ37_NEIM0 Length = 658 Score = 55.7 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 11/92 (11%) Query: 50 GWSGTSGAHA---PAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDT-VP 105 GW G A + R + + + NWS W +L + PVGA + +P P Sbjct: 212 GWCRQLGYPAYTSDVYERHQTSSANDNWSAWKKLNSDG------IPVGAIVSFPKAVRNP 265 Query: 106 SGYALMQGQTFDKSAYPKLAVAYP-SGVIPDM 136 +GY G TF ++ +P L A S +PD+ Sbjct: 266 AGYLRADGTTFAQNTFPDLYRALGNSNRLPDL 297 Score = 47.6 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 71/210 (33%), Gaps = 21/210 (10%) Query: 82 TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQG--QTFDKSAYPKLAVA----YPS-GVIP 134 ++ P +G +PSD +P+G+ ++AYP+L Y S +P Sbjct: 291 SNRLPDLSRTDIGITAWFPSDQIPTGWLAFDDIRTRVTETAYPELYRLLTGKYGSIQNVP 350 Query: 135 DMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKS------- 187 I+ + AV ++++D IK H H + T ++ Y ++ Sbjct: 351 QAEDRFIR-NAGNSLAVGTKQEDEIKRHVHKVFSHWT--NHTDAAALGYEDRNERQRSAL 407 Query: 188 TNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQ 247 + + +G + + G + A L + G+ Sbjct: 408 VSTWTDENLNDNGFLTPRSDSKMATGGDENRPKALVLKLCIKAADTLGEAVFW-IKSHGE 466 Query: 248 TRNAGKTSSDGAHTHSLSGTAASAGAHAHT 277 T NAG G +L A A H HT Sbjct: 467 TINAGALD-AGTLAQNLQDKADRA--HTHT 493 >UniRef50_C3X909 Predicted protein n=2 Tax=Oxalobacter formigenes OXCC13 RepID=C3X909_OXAFO Length = 549 Score = 55.3 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 25/126 (19%), Positives = 49/126 (38%), Gaps = 19/126 (15%) Query: 71 DANWSPWAQLYTSAHPP-----AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLA 125 D + A ++ +P PVGA + T P+GY G ++ YP L Sbjct: 239 DLQFDEEALIWILQNPASGVVCPSGVPVGAIGYFAMQTPPAGYLKADGSAVSRATYPDLF 298 Query: 126 VAYP----------SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT 175 A + +PD+ +G G + + G+ + T S + ++++ G+ Sbjct: 299 GAIGTTFGEGDGSTTFNLPDLIDRFAQGNATPGLKI----EAGLPNITGSLTVTASNQGS 354 Query: 176 ETTSSF 181 + +F Sbjct: 355 AASGAF 360 >UniRef50_A1HR57 Putative uncharacterized protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HR57_9FIRM Length = 269 Score = 54.5 bits (129), Expect = 4e-06, Method: Composition-based stats. Identities = 46/206 (22%), Positives = 69/206 (33%), Gaps = 50/206 (24%) Query: 29 AYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPA 88 A G+I + + E ++ +G SG P + D + W Sbjct: 57 AVGDICYSPNAPSYTRMECVV--AGRSGTTEPTWPTVGNMVVDGTVT-WI-----VDDVR 108 Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY-------------------- 128 + PVG + S GY G ++AYP+L AY Sbjct: 109 DGTPVGRIVAEISPICRPGYLKANGALVSRAAYPRL-WAYVQARGLVVPDTVWPANYWGC 167 Query: 129 -------PSGVIPDMRGWTIKGKPASGR-----AVLSQEQDGIKSHTHSASASS------ 170 + +PD+RG I+G A S + DGIKSH H + Sbjct: 168 FSTGDGSTTFRLPDLRGEFIRGGDDGRGVDGGRAFGSWQADGIKSHNHPYQSQPYLFVES 227 Query: 171 ---TDLGTETTSSFDYGTKSTNNTGA 193 D+ E TS+ + T T+N G Sbjct: 228 FDGGDVIAERTSTAKWVTHYTSNFGG 253 >UniRef50_C6S6V6 Putative phage tail fibre protein n=1 Tax=Neisseria meningitidis alpha14 RepID=C6S6V6_NEIML Length = 728 Score = 54.5 bits (129), Expect = 4e-06, Method: Composition-based stats. Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 11/92 (11%) Query: 50 GWSGTSGAHA---PAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDT-VP 105 GW G A + R + + + NWS W +L + PVGA + +P P Sbjct: 282 GWCRQLGYPAYTSDVYERHQVSSANDNWSAWKKLNSDG------IPVGAIVSFPKAVRNP 335 Query: 106 SGYALMQGQTFDKSAYPKLAVAYP-SGVIPDM 136 +GY G TF ++ +P L A S +PD+ Sbjct: 336 AGYLRADGTTFAQNTFPDLYRALGNSNRLPDL 367 Score = 48.4 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 69/208 (33%), Gaps = 17/208 (8%) Query: 82 TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQG--QTFDKSAYPKLAVA----YPS-GVIP 134 ++ P +G +PSD +P+G+ ++AYP+L Y S +P Sbjct: 361 SNRLPDLSRTDIGITAWFPSDQIPTGWLAFDDIRTRVTETAYPELYRLLTGKYGSIQNVP 420 Query: 135 DMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKS-----TN 189 I+ + AV ++++D IK HTH + T ++ G + + Sbjct: 421 QAEDRFIR-NAGNSLAVGTKQEDEIKRHTHKVFSHWTSHTDVAAVGYEDGNERQRSALVS 479 Query: 190 NTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTR 249 S +G + + G + A L + G+T Sbjct: 480 TWTDENLSDNGFLTPRLDSKMATGGDENRPKALVLKLCIKAADTLGEAVFW-IKSHGETV 538 Query: 250 NAGKTSSDGAHTHSLSGTAASAGAHAHT 277 NAG G L A H HT Sbjct: 539 NAGALD-AGTLEQGLQDKADR--DHTHT 563 >UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7INV5_XANP2 Length = 492 Score = 53.8 bits (127), Expect = 8e-06, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 47/139 (33%), Gaps = 18/139 (12%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIK 142 G WP+ T PSG + G T ++ Y L + +P+ G ++ Sbjct: 357 PGTIAMWPASTPPSGALVRNGATLSRTVYASLFAVIGTTFGAGDGATTFGVPNDLGIFVR 416 Query: 143 GKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G +GR S++ D KSH H+ S G T + + + +T S Sbjct: 417 GWDNGRGYDTGRVFGSEQADDNKSHDHARQTVS---GVFTAGGAGFALQDSGSTTQRVAS 473 Query: 198 ISGTANSAGAHQHKSSGAF 216 G + F Sbjct: 474 SGGAEARPKNRAYLPIIYF 492 >UniRef50_C4VIX0 74kDa protein n=28 Tax=root RepID=C4VIX0_ENTFA Length = 671 Score = 53.4 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 30/119 (25%), Positives = 46/119 (38%), Gaps = 16/119 (13%) Query: 188 TNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQ 247 TN G T+++ G+H H N P L AG + Sbjct: 467 TNTDGGSAQ----TSSANGSHDHLM------FNVIQGPPQTLPKITLRAGGGGEIYTEAR 516 Query: 248 TRNAGKTSSDGAHTHSLSGTAAS------AGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 S+ HTH+++ + S AH+H V I HTHS+++ SH H + + A Sbjct: 517 GGTFRTASAADNHTHTVNVPSHSHRFNIDIPAHSHVVSIPNHTHSISVPSHSHQVRIPA 575 >UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3K6_OXAFO Length = 500 Score = 53.4 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 24/126 (19%), Positives = 47/126 (37%), Gaps = 12/126 (9%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWT 140 PVG + + + P+GY G + YP L A + +PDM G Sbjct: 213 IPVGTVVMFSASEAPAGYLKCDGAAVGRDTYPDLFAAIGTVFGAGDGETTFNLPDMIGRF 272 Query: 141 IKGKPASGRAVLSQEQD--GIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSI 198 +G G + D G + ++ ++ + T++ + TN+ A++ + Sbjct: 273 AEGSLTPGTVKEAGLPDVTGTIRLSDNSQINAVEADKIATANGAFSRVRTNSPTAYSTAS 332 Query: 199 SGTANS 204 A + Sbjct: 333 VDVATT 338 >UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FJJ3_DESAA Length = 264 Score = 53.4 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 24/117 (20%), Positives = 42/117 (35%), Gaps = 24/117 (20%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY----------PSGVIPDMRGW 139 P G+ + + + PSG+ G ++ Y L + +PD+RG+ Sbjct: 114 INPTGSVVAFMGASAPSGWLECSGAAVSRTTYDNLFSVISTMYGVGDGSTTFNLPDLRGY 173 Query: 140 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 ++G S + S TD G T + GT+ + +HTH Sbjct: 174 FLRGWSHG-------------SGKDPDAGSRTDRGDGTCGDY-VGTRQEDEFASHTH 216 >UniRef50_B8QTW7 Putative tail fiber protein n=1 Tax=Erwinia phage phiEa21-4 RepID=B8QTW7_9CAUD Length = 357 Score = 53.0 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 37/159 (23%), Positives = 55/159 (34%), Gaps = 11/159 (6%) Query: 126 VAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDG---IKSHTHSASASSTDLGTETTSSFD 182 A PS IP + +G+ +S D + +S+ + + Sbjct: 169 SANPSTYIP------VGTWALTGQGRVSVGYDAGNSSRPAGTKFGSSTVTIDVANLPAHT 222 Query: 183 YGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTT 242 +G T G H+H SG+ AGAH H +SG G T + G Sbjct: 223 HG--VTVTGGNHSHGASGSTTGAGAHNHVASGNTGYAGDHNHTYTTTRQGGGNPGNHVGH 280 Query: 243 SGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIG 281 + T G HTH +S + G HAH + I Sbjct: 281 GSNEIHYTNEATGVAGGHTHYVSLATNTVGDHAHGLNIN 319 >UniRef50_C3X971 Predicted protein n=6 Tax=Oxalobacter formigenes OXCC13 RepID=C3X971_OXAFO Length = 534 Score = 53.0 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 64/179 (35%), Gaps = 16/179 (8%) Query: 57 AHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF 116 A A ++ + D W Q P P+G + T P+GY G+ Sbjct: 217 AGAGYWLELQYDEALDKW--VLQNPAKGISPLNGVPIGTVEYFAMSTPPAGYLKADGRAV 274 Query: 117 DKSAYPKLAVAYP----------SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSA 166 + Y +L + +PD+ +G G+ + + G+ + Sbjct: 275 GRETYAELYSVIGTTFGEGDEQTTFNLPDLIDRFAQGSNTPGQKI----EAGLPNIEGVI 330 Query: 167 SASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFP 225 + S + L + + + + A+T ++ AN+ + +S+ +G ++T P Sbjct: 331 TNSGSILWAGNEDASGAFSLTGASPRANTATVGAGANTLSFNASQSNQIYGASDTVQPP 389 >UniRef50_B3FYL6 Gp17 n=1 Tax=Salmonella phage phiSG-JL2 RepID=B3FYL6_9CAUD Length = 658 Score = 53.0 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 49/262 (18%), Positives = 82/262 (31%), Gaps = 56/262 (21%) Query: 50 GWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEF---YPVGAPIPWPSDTVPS 106 G S T +++ A ++ A+ + +A + PVG +P+ Sbjct: 232 GQSATDASNSAAQAKASEVNAKASEVNAKRDADAALLALQSTGNVPVGTVAMITHTKIPT 291 Query: 107 GYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSA 166 G+ G+ FD + YP LA +PSG P G VL+ + + Sbjct: 292 GWVR-AGEDFDVNTYPALAELFPSGRTPSFDDRYPIG----NSTVLT--PGQLIDQSVP- 343 Query: 167 SASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPN 226 AH+H+ N +GA ++ + Sbjct: 344 --------------------------AHSHTFDVPVNVSGATAAGGEYRARTSHEGDHSH 377 Query: 227 GYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHS 286 G++ + G + G T G GAH+H Sbjct: 378 GFSLPIQNNTGAYTGRLVGGGNNPNYPQDLRFN-----------------TGGGGAHSHE 420 Query: 287 VAIGSHGHTITVNAAGNAENTV 308 + SH HT+ NA+G A +V Sbjct: 421 FYVPSHSHTL--NASGRAAGSV 440 >UniRef50_B6XJ97 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B6XJ97_9ENTR Length = 432 Score = 52.6 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 65/198 (32%), Gaps = 49/198 (24%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 YP+G + + + P+ + +P Y TI+ AS Sbjct: 248 DTIYPIGVVVWFAQNKNPN------------TLFPGTKWQY------IGENRTIRLAAAS 289 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 G VLS T D T + + H HS SGTA S+G Sbjct: 290 GANVLS------------------------TGGSDSITLNASQMPVHNHSFSGTATSSGG 325 Query: 208 HQH-------KSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAH 260 H H + A GG + G+ + + + ++ T + Sbjct: 326 HTHDKGTMNITGAFAIGGGKSEGQAPGFASGVFSKTTRTLKVNTASGVVDSSVTQINMNA 385 Query: 261 THSLSGTAASAGAHAHTV 278 + +G +S+GAH HTV Sbjct: 386 ASAWTGNTSSSGAHTHTV 403 >UniRef50_UPI000180B6D6 PREDICTED: similar to glutamate receptor, ionotropic, delta 2 n=1 Tax=Ciona intestinalis RepID=UPI000180B6D6 Length = 1235 Score = 51.8 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 30/149 (20%), Positives = 51/149 (34%), Gaps = 4/149 (2%) Query: 162 HTHSASASSTDLGTETTSS-FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 H H + + + G TT++ +G +T H H + T + G + G Sbjct: 365 HGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPLGHGHMTTTTPSGHGHMTTTTPSGHGHMT 424 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 T+ + +G T+ + T++ H H + T + G H T Sbjct: 425 TTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTTP 483 Query: 281 GAHTHSVAIGS--HGHTITVNAAGNAENT 307 H H HGH T +G+ T Sbjct: 484 SGHGHMTTTTPSGHGHMTTTTPSGHGHMT 512 Score = 51.1 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 50/150 (33%), Gaps = 4/150 (2%) Query: 162 HTHSASASSTDLGTETTSS-FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 H H + + + G TT++ +G +T H H + T + G + G Sbjct: 376 HGHMTTTTPSGHGHMTTTTPLGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMT 435 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 T+ + +G T+ + T++ H H + T + G H T Sbjct: 436 TTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTTP 494 Query: 281 GAHTHSVAIGS--HGHTITVNAAGNAENTV 308 H H HGH T + + T Sbjct: 495 SGHGHMTTTTPSGHGHMTTTTPSSHGHMTA 524 Score = 50.3 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 48/149 (32%), Gaps = 4/149 (2%) Query: 162 HTHSASASSTDLGTETTSS-FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 H + + G TT++ +G +T H H + T G + G Sbjct: 354 GGHMTTTPPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPLGHGHMTTTTPSGHGHMT 413 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 T+ + +G T+ + T++ H H + T + G H T Sbjct: 414 TTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTTP 472 Query: 281 GAHTHSVAIGS--HGHTITVNAAGNAENT 307 H H HGH T +G+ T Sbjct: 473 SGHGHMTTTTPSGHGHMTTTTPSGHGHMT 501 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 49/135 (36%), Gaps = 3/135 (2%) Query: 162 HTHSASASSTDLGTETTSS-FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 H H + + + G TT++ +G +T H H + T + G + G Sbjct: 398 HGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMT 457 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 T+ + +G T+ + T++ H H + T + G H T Sbjct: 458 TTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTTP 516 Query: 281 GAHTH-SVAIGSHGH 294 +H H + A + GH Sbjct: 517 SSHGHMTAATPASGH 531 Score = 43.7 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 42/134 (31%), Gaps = 4/134 (2%) Query: 177 TTSSFDYGTKSTNNT-GAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLS 235 T+ +G T H H + T + G + G T+ + Sbjct: 347 YTTQASHGGHMTTTPPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPLGHGHMTTTTP 406 Query: 236 AGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGS--HG 293 +G T+ + T++ H H + T + G H T H H HG Sbjct: 407 SGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTTPSGHGHMTTTTPSGHG 465 Query: 294 HTITVNAAGNAENT 307 H T +G+ T Sbjct: 466 HMTTTTPSGHGHMT 479 >UniRef50_C3X8R9 Bacteriophage tail fiber protein n=6 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8R9_OXAFO Length = 398 Score = 51.5 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 33/87 (37%), Gaps = 14/87 (16%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTI 141 P+G+ + +PSGY G + YP L A + +PD+ G Sbjct: 108 PIGSIDYFAMAALPSGYLKADGAEVGRETYPDLFAAIGTVFGEGNGETTFNLPDLIGRFP 167 Query: 142 KGKPASGRAVLSQEQDGIKSHTHSASA 168 +G G+ V Q G+ + T A Sbjct: 168 QGSARPGQRV----QAGLPNITGKFRA 190 >UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus influenzae PittAA RepID=A4NHY2_HAEIN Length = 556 Score = 51.1 bits (120), Expect = 5e-05, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 2/48 (4%) Query: 91 YPVGAPIPWPSD-TVPSGYALMQGQTFDKSAYPKLAVAYP-SGVIPDM 136 P+GA + +P T P G+ G TF++ +P L S +PD+ Sbjct: 178 VPIGAVVSFPRAVTNPVGFLKANGTTFNQQTFPDLYRTLGNSNQLPDL 225 Score = 46.4 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 51/144 (35%), Gaps = 15/144 (10%) Query: 93 VGAPIPWPSDTVPSGYALMQ--GQTFDKSAYPKLAV----AYPS-GVIPDMRGWTIKGKP 145 VG + D +P+G+ + YP+L Y S +P + ++ Sbjct: 230 VGMTAYFAVDNIPAGWIAFDEIATQVTEQRYPELYRHLIDKYGSINSVPKVADRFLR-NA 288 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTET-----TSSFDYGTKSTNNTGAHTHSISG 200 +G +V ++D +K H H ++ S FDY T + ++ T Sbjct: 289 GNGLSVGQIQEDDLKRHVHRVPIDYDSWFDDSSQGRNNSYFDYTTFAQSSDLWSTLGYDN 348 Query: 201 TANSAG--AHQHKSSGAFGGTNTS 222 G + + S A GG T Sbjct: 349 ADGDNGFVSPKDTSQMATGGDETR 372 >UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens AM1 RepID=C5B185_METEA Length = 449 Score = 50.7 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 26/120 (21%), Positives = 46/120 (38%), Gaps = 19/120 (15%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS----------GVIPDMRGWTI 141 P G + + P G+ G +S + L + +PD+RG+ + Sbjct: 308 PPGMISAYAGQSCPVGWVDATGLALLRSDFSALFAVIGTRWGAGDGSTTFNVPDLRGYFL 367 Query: 142 K-----GKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDY----GTKSTNNTG 192 + GR + S + + H H+ ++ G+ TT++F Y GT S TG Sbjct: 368 RMQDAGAGRDPGRDLGSAQAGSVGPHQHNVPVANATAGSGTTNNFVYPLAAGTSSVPTTG 427 >UniRef50_A4P195 Putative phage tail fibre protein (Fragment) n=1 Tax=Haemophilus influenzae 22.4-21 RepID=A4P195_HAEIN Length = 458 Score = 50.3 bits (118), Expect = 8e-05, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 8/77 (10%) Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSD-TVPSGYALMQGQTFDKSA 120 + R + +WS W +L T P+GA + +P T P G+ G TF + Sbjct: 370 YERHQTSYQTDSWSAWKKLNTDG------IPIGAVVSFPRAVTNPVGFLRADGSTFSQQT 423 Query: 121 YPKLAVAYP-SGVIPDM 136 +P L S +PD+ Sbjct: 424 FPDLYRTLGNSNKLPDL 440 >UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3XAA4_OXAFO Length = 305 Score = 50.3 bits (118), Expect = 9e-05, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 52/149 (34%), Gaps = 30/149 (20%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDM 136 P+ PVG + T P+GY G + YP+L + +PD+ Sbjct: 10 PSSGVPVGTIEYFAMVTSPAGYLKANGAAVGRETYPELYATIGTTFGEGDGSSTFNLPDL 69 Query: 137 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 +G G+ + + G+ H H+ + + G TG H Sbjct: 70 IDRFAQGSNTPGQKI----EAGLSDHNHTLPLALEETG----------------TGYAAH 109 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIFP 225 + ++ + + S+ +G +NT P Sbjct: 110 GSNISSGTTVGYASASNPIYGASNTVQPP 138 >UniRef50_C9PG79 Putative phage tail protein n=1 Tax=Vibrio furnissii CIP 102972 RepID=C9PG79_VIBFU Length = 410 Score = 50.3 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 32/137 (23%), Positives = 49/137 (35%), Gaps = 12/137 (8%) Query: 74 WSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS--- 130 WS + + P G + W S+ VP + GQ + Y LA A P Sbjct: 244 WSDTTKPF--YWKPFSSKTPGETMAWDSELVPEHMIVAMGQQLPVTVYHSLAAAKPEWID 301 Query: 131 ------GVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFD-Y 183 IPD RG + S D I++ T S +AS T +T + Sbjct: 302 DTNPLVLNIPDRRGRFTRAADGSHWLAGQSHDDAIRNITGSFNASGTTGSASSTKTQGAI 361 Query: 184 GTKSTNNTGAHTHSISG 200 +T++ + + SG Sbjct: 362 ALSNTSSWPNYVNGQSG 378 >UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Microcystis aeruginosa RepID=A8YDB4_MICAE Length = 166 Score = 49.9 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 43/124 (34%), Gaps = 30/124 (24%) Query: 82 TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS---------GV 132 +S + AE PI +P G+ L G+ + AYP+L + Sbjct: 34 SSQNAQAESLNANIPITYPEAY---GWMLCDGRYLEIDAYPELFAVIGTLYGKQGDNKFR 90 Query: 133 IPDMRGWTIKGKPAS------------------GRAVLSQEQDGIKSHTHSASASSTDLG 174 +PD RG ++G A + S + D ++ H H +AS++ Sbjct: 91 LPDYRGLFMRGVDAGSGLDPDAAERIGPEGMGKSSGIGSLQCDALQQHQHDYNASNSHFE 150 Query: 175 TETT 178 Sbjct: 151 INRA 154 >UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibrio vulgaris RepID=Q727X4_DESVH Length = 296 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 5/70 (7%) Query: 132 VIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK 186 + D RG +G +GR + S + D I++ S + + + + +F T Sbjct: 195 RLQDRRGEFARGWDHGRGVDAGRVLGSAQGDAIRNIVGSMGSITAVVAGTASGAFTVTTP 254 Query: 187 STNNTGAHTH 196 S + G+ T Sbjct: 255 SNRSAGSSTG 264 >UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW4_CLOCE Length = 200 Score = 48.8 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 62/185 (33%), Gaps = 27/185 (14%) Query: 87 PAEFYPVGAPIPWPSDTVPS--------GYALMQGQTFDKSAYPKLAVAYPSG------- 131 E P+G+ I + + G+ + G + YP L A Sbjct: 3 STERMPIGSVISFAGEIKSEMVNRLYRMGWLICDGSKLKIAEYPDLFQAIGKAHGGDNTY 62 Query: 132 -VIPDMRGWTIKG--KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYG---- 184 +PD + I+G + G + + T + +T + F G Sbjct: 63 FYLPDTQSKFIRGVNGDSVGES-GRLMDPDVAKRTFAKPGGNTGNNVGSYQDFATGLPKV 121 Query: 185 TKSTNNTGAHTHSI----SGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMS 240 + +T+ G+HTHS+ G+ N+ + G G NT +G + + G Sbjct: 122 SLTTDFIGSHTHSLPHLPDGSHNAYAGSIGRDGGKEAGDNTRTGESGSHSHEIIGGGDPE 181 Query: 241 TTSGS 245 T + Sbjct: 182 TRPRN 186 >UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenella corrodens ATCC 23834 RepID=C0DSG4_EIKCO Length = 436 Score = 48.4 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 1/47 (2%) Query: 91 YPVGAPIPWPSDTV-PSGYALMQGQTFDKSAYPKLAVAYPSGVIPDM 136 PVGA + +P P GY G TF ++ YP L +P++ Sbjct: 72 LPVGAVVGFPRAISSPEGYLKADGSTFAQATYPDLYRVLGGNKLPNL 118 Score = 46.8 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 47/122 (38%), Gaps = 18/122 (14%) Query: 93 VGAPIPWPSDTVPSGYALMQ--GQTFDKSAYPKLAVA----YPS-GVIPDMRGWTIKGKP 145 VG +P + +P G+ +SAYP+L Y S +P I+ Sbjct: 123 VGMTAYFPIEAIPDGWIKYDEVATKVTQSAYPELYRLLVAQYGSIDAVPKAEDRFIR-NA 181 Query: 146 ASGRAVLSQEQDGIKSHTHSASASST----DLGTETTSSF------DYGTKSTNNTGAHT 195 + AV +Q+ D I++ T A + L T+ +F + +++ G Sbjct: 182 SGSLAVGTQQGDTIRNITGGIEALYSGYRYTLYTKADGAFTMDLDDGANSTFSSSKGDSD 241 Query: 196 HS 197 H+ Sbjct: 242 HN 243 >UniRef50_Q6J803 Pas28 n=1 Tax=Actinoplanes phage phiAsp2 RepID=Q6J803_9CAUD Length = 1291 Score = 48.4 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 56/265 (21%), Positives = 89/265 (33%), Gaps = 28/265 (10%) Query: 58 HAPAFIRSR---RDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPS--DTVPSGYALMQ 112 ++ R+R R+ D S W+ Y P G + WP ++P G+ Sbjct: 702 PCCSYYRARTIGREDGDLRISDWSDTYDPG------IPSGIIVMWPGTDASLPEGW---- 751 Query: 113 GQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTD 172 L YP GV D G A+ SH H+ + +++ Sbjct: 752 ------ERTTALDGRYPKGVPDDTTQPGTTGGAATHSHTTPGHTHDT-SHLHTVTGATS- 803 Query: 173 LGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN-TSIFPNGYTAI 231 T T +S D +T HTH+ T ++ S G +N + + Sbjct: 804 AATGTFASSDGAVGTTVALNTHTHTRPSTNSATVVSGSASPGTNTASNDPARAEVIFMES 863 Query: 232 SNLSAGIMSTTSG-SGQTRNAGKTSSD-GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAI 289 G+ + +G S G A +AG T G +H+ I Sbjct: 864 DGSPLGLPDGALALTLDVALSGWADSSLGNTGGRFIKGAPAAGDGGTTAGSSVASHTHDI 923 Query: 290 GSHGHTITVNAAGNAENTVKNIAFN 314 +HGHT T + G+ N + A N Sbjct: 924 DAHGHTGTSH--GHTSNPTNSFASN 946 >UniRef50_A2EHN1 Phage tail fiber repeat family protein n=27 Tax=Trichomonas vaginalis RepID=A2EHN1_TRIVA Length = 1034 Score = 48.4 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 22/99 (22%), Positives = 32/99 (32%), Gaps = 10/99 (10%) Query: 177 TTSSFDYGTKST---NNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISN 233 + SS D+G T N H+H S T ++ G H H + N Sbjct: 924 SNSSMDFGGSKTIRVENLPPHSHRFSATTSTNGEHSHS-------MTKRGYTNLAAGSDR 976 Query: 234 LSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAG 272 T S G HTH+++G + G Sbjct: 977 QGMNRYDITDDPRDVSTGIFCGSAGNHTHTVAGITDATG 1015 >UniRef50_D0LMW0 Tail Collar domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMW0_HALO1 Length = 264 Score = 48.4 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 29/136 (21%), Positives = 45/136 (33%), Gaps = 23/136 (16%) Query: 100 PSDTVPSGYALMQGQTFDKSAYPKLAVA----YPSG------VIPDMRGWTIKG------ 143 ++T P G+ G + YP+L A Y +G V+PD RG T+ G Sbjct: 119 AAETAPDGWLFCDGSPLIRDDYPELFAAIGETYGAGDGVNTFVLPDCRGRTLIGAGQGNG 178 Query: 144 -KPASGRAVLSQEQDG-----IKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 V+ E+ + SHTH+ + L + GT +G Sbjct: 179 LSDRQRGDVVGAEEHTLTIPEMPSHTHAEHPGTGTLWFQVFER-GPGTWPNERSGNTLGQ 237 Query: 198 ISGTANSAGAHQHKSS 213 +G H Sbjct: 238 STGATGGNQPHNIMQP 253 >UniRef50_C4MYW8 Gp12 Short tail fibers n=1 Tax=Enterobacteria phage JSE RepID=C4MYW8_9CAUD Length = 467 Score = 48.0 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 42/129 (32%), Gaps = 17/129 (13%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGK 144 G + + + + + GQ +K YP L + +PDMRG +G Sbjct: 316 GIILTAFNSFDHAQFKICNGQWLNKHQYPVLFSRIGFTYGGDGGDNFALPDMRGLVARGC 375 Query: 145 P-----ASGRAVLSQEQDGIKSHTHSASASS---TDLGTETTSSFDYGTKSTNNTGAHTH 196 GR + + D ++ T + ++ G + + + N G Sbjct: 376 DHGRGLDPGRGFGTYQDDTMQHMTGNFPVANRWRGWTGGVFAITGGQWSTNYKNGGGDDW 435 Query: 197 SISGTANSA 205 +SA Sbjct: 436 GSIVNFDSA 444 >UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio cholerae TM 11079-80 RepID=C2I7P2_VIBCH Length = 406 Score = 47.2 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 29/140 (20%), Positives = 56/140 (40%), Gaps = 20/140 (14%) Query: 85 HPPAEFYPVGAPIPWPSDTVPSGYALMQGQT-FDKSAYPKLAVAYPSGVIPD------MR 137 P VG P W + P +A+M+ + Y +LA YP V D +R Sbjct: 266 WIPYTGDQVGMPFYWLDTSAPE-WAVMEINVNLPIAVYWRLARRYPQLVRDDYINTGEIR 324 Query: 138 GWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTG 192 G ++ +GR++ S + D ++ HTH+ SA + + + G+ + Sbjct: 325 GEFLRVLDQGRGVDAGRSIQSYQDDELERHTHTFSAP-------FSITANTGSTGIIISA 377 Query: 193 AHTHSISGTANSAGAHQHKS 212 +H + + T + ++ Sbjct: 378 SHVPNWNTTYTGGNETRPRN 397 >UniRef50_Q094A8 Phage Tail Collar Domain family n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q094A8_STIAU Length = 645 Score = 46.8 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 52/167 (31%), Gaps = 25/167 (14%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY----------PSG--VIPDMR 137 PVG I + + P G+ L G T K+AY L PSG +P + Sbjct: 478 LVPVGTIIAYGGSSAPEGWLLCDGSTKSKTAYADLFAVIGDTYKGSSAPPSGQFRLPSLM 537 Query: 138 GWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G S + + T T S N+ G H+HS Sbjct: 538 ARVPMGASVSS-----------PHNYPLGTMGGEFTHTLTISEMPVHDHYVNDPG-HSHS 585 Query: 198 ISGT-ANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTS 243 I+ T A +G + + G + + + G + + Sbjct: 586 ITTTNAEGSGDLRPNRDASKGHVDIPTNHVTTGVTLDTNGGGQAHNN 632 >UniRef50_D2L4G0 Tail Collar domain protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L4G0_9DELT Length = 319 Score = 46.8 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 38/146 (26%), Positives = 56/146 (38%), Gaps = 7/146 (4%) Query: 91 YPVGAPIPWPSDTVPSG---YALMQGQTF-DKSAYPKLAVAYPSGVIPDMRGWTIKGKPA 146 PVG IPWPS ++P+ + GQ S Y +L V S IP+ G ++G Sbjct: 41 IPVGTVIPWPSTSMPADATRWLECNGQAVPSGSQYDRLRVVLGSKPIPNYNGQFLRG-TT 99 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 V D ++H H A + + G ++ + T S S + AG Sbjct: 100 VSSEVGQTVADSTRAHDHLIDAHQHTVSGTASGQSYGGAIASVSISGSTSSQSYSGTIAG 159 Query: 207 AHQH--KSSGAFGGTNTSIFPNGYTA 230 H S A+GG G T+ Sbjct: 160 QHITGATSGQAYGGNIAGQHVTGSTS 185 >UniRef50_B6IWH6 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B6IWH6_RHOCS Length = 206 Score = 45.7 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 64/181 (35%), Gaps = 25/181 (13%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTF----DKSAYPKLAVAYPSGV-----IPDMRGWTIKG 143 +G +PW PS ++L GQ +++ + + Y +PD+RG G Sbjct: 6 IGTIMPWAVSWAPSNWSLCMGQILPVNGNQAVFALIGATYGGNGSTNFALPDLRGRVPVG 65 Query: 144 K----------PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 P + R + + T S + T ++ G + A Sbjct: 66 AGQFPGSGGIPPTTNRVIGQSGGQEQVNLTQS------QMPVHTHAAQATGGGGSVTLSA 119 Query: 194 HTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGK 253 +T +A + G + ++G FGG ++ G + + + + S NAG Sbjct: 120 YTGPADSSAPAVGKYLTAAAGDFGGDAVTVKIYGPASGTAVPIASGTVQPPSITVGNAGG 179 Query: 254 T 254 T Sbjct: 180 T 180 >UniRef50_D2V5I7 Microcystin-dependent protein n=1 Tax=Naegleria gruberi RepID=D2V5I7_NAEGR Length = 191 Score = 44.9 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 54/185 (29%), Gaps = 35/185 (18%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKS--AYPKLAVAYP----------SGVIPDMR 137 PVG + T+P+G+ L G T+ S Y +L S +PD+R Sbjct: 15 IIPVGIVNAFAGTTIPAGWLLCDGATYPNSHPDYIRLFQTIGNAYGSTGGPHSFNVPDLR 74 Query: 138 GWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G + G H A S+ L + + + +HS Sbjct: 75 GRAVVG------------------IGHGAGLSNRTLAQKVGEE----SHQLQISELPSHS 112 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 SGT A + G + G+ +G+ T N Sbjct: 113 HSGTTGKANKQPYIIVHQSGPISDVFHTPGWCGGPATH-KDDDNFTGANHTHNFTTNEVG 171 Query: 258 GAHTH 262 G H Sbjct: 172 GNSAH 176 >UniRef50_A0A7D3 Putative uncharacterized protein n=1 Tax=Microcystis aeruginosa phage Ma-LMM01 RepID=A0A7D3_9CAUD Length = 335 Score = 44.9 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 11/49 (22%), Positives = 20/49 (40%), Gaps = 4/49 (8%) Query: 86 PPAEFYPVG-APIPWPSDTV---PSGYALMQGQTFDKSAYPKLAVAYPS 130 A P+G + WP P+ + + GQ ++ YP+L + Sbjct: 254 IAAAGAPIGSIIMWWPLIVTQQHPTNWLPLNGQEISRTQYPELFAVIGT 302 >UniRef50_C3YB93 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YB93_BRAFL Length = 749 Score = 44.9 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 34/138 (24%), Positives = 55/138 (39%), Gaps = 12/138 (8%) Query: 172 DLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAI 231 D G + +G +++ G +HS ++S G H + G NG Sbjct: 470 DKGVQGGQFHSHGD-QSHSHGDQSHSHGDQSHSHGDQSHSNGGQSHSHGGQSHSNGGQFH 528 Query: 232 SNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAH----TVGIGAHTHSV 287 S+ G S + G + G++ S G +HS G + S G H+H GI + V Sbjct: 529 SH---GDQSHSHGGQSHSHGGQSHSHGDQSHSHGGQSHSHGGHSHDMNRNAGIASVAWMV 585 Query: 288 AIGSHGHT----ITVNAA 301 +G H +T+ AA Sbjct: 586 IMGDGLHNFADGVTIGAA 603 >UniRef50_C5RJD9 Tail Collar domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RJD9_CLOCL Length = 199 Score = 44.1 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 52/150 (34%), Gaps = 12/150 (8%) Query: 96 PIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGKPA 146 I WP + VP G+ +GQ + Y L + +PD+RG G Sbjct: 8 IILWPGNFVPRGWLACEGQELPINQYTALYSLLGTTYGGNGSTTFKLPDLRGRVPVGSGI 67 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 G + Q+ + + + + T +T+ T N G + GT N+ Sbjct: 68 CG-GINFQQGNSGGNFNVTLTQQQMPAHTHSTTV--TQGAVTVNGGIPFNGGEGTTNTPS 124 Query: 207 AHQHKSSGAFGGTNTSIFPNGYTAISNLSA 236 A + G G + N A +++ Sbjct: 125 ASSKLAVGITAGGDIPNIYNTSEATGSVTG 154 >UniRef50_Q8PR98 Microcystin dependent protein n=1 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PR98_XANAC Length = 195 Score = 44.1 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 22/96 (22%), Positives = 36/96 (37%), Gaps = 10/96 (10%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKG 143 +G +P + P G+ GQT + Y L + +PD+RG + G Sbjct: 6 IGEVRAFPYNFAPEGWLDCMGQTVSINQYQALFGVIGFAYGGDKQTTFGLPDLRGRAVTG 65 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTS 179 + G + + ++ A SST L T S Sbjct: 66 Q-GQGPGLSNYTIGQLQGTDSVALVSSTQLPAHTHS 100 >UniRef50_P10930 Short tail fiber protein n=8 Tax=Myoviridae RepID=VG12_BPT4 Length = 527 Score = 44.1 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 38/177 (21%), Positives = 61/177 (34%), Gaps = 18/177 (10%) Query: 83 SAHPPAEFYPVGAPIPWPSDTVPS-GYALMQGQTFDKSAYPKLAVAY-------PSG-VI 133 + + PVGA + W +D++PS + G T S P A PS + Sbjct: 333 TQNEIDRTIPVGAIMMWAADSLPSDAWRFCHGGTVSASDCPLYASRIGTRYGGNPSNPGL 392 Query: 134 PDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 PDMRG ++G SGR + + LG T + G Sbjct: 393 PDMRGLFVRG---SGR----GSHLTNPNVNGNDQFGKPRLGVGCTGGY-VGEVQIQQMSY 444 Query: 194 HTHSIS-GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTR 249 H H+ G + GA + F GT + + + +N I + + + Sbjct: 445 HKHAGGFGEHDDLGAFGNTRRSNFVGTRKGLDWDNRSYFTNDGYEIDPESQRNSKYT 501 >UniRef50_Q4KAW4 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KAW4_PSEF5 Length = 181 Score = 44.1 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 45/147 (30%), Gaps = 10/147 (6%) Query: 99 WPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGKPASGR 149 +P P G+ L QGQ D Y LA + +PD+RG G+ Sbjct: 11 FPWAWAPQGWLLCQGQILDVVNYTALASLLGDRYGGDGRTTFGLPDLRGRAALGENPVAS 70 Query: 150 AVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQ 209 + S A + L + T TG +I ++++ Sbjct: 71 TSPVLGVHELGSMDG-AEWVALTLNNLPAHNHVANVAVTAGTGGPAGNIPAISSTSKGAV 129 Query: 210 HKSSGAFGGTNTSIFPNGYTAISNLSA 236 K + + N T ++ L Sbjct: 130 SKPTYVAYADKDRVTINPTTVVTTLGY 156 >UniRef50_Q58MY1 Predicted protein n=1 Tax=Prochlorococcus phage P-SSM2 RepID=Q58MY1_BPPRM Length = 597 Score = 43.7 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 38/179 (21%), Positives = 67/179 (37%), Gaps = 8/179 (4%) Query: 91 YPVGAPIPW--PSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASG 148 P G + W + +PSG+ L G K + S D G + Sbjct: 356 IPAGVVVMWSGAQNAIPSGWVLCDGNNSSPDLRDKFVIGAGSNYAVDNTGGSADAVVVDH 415 Query: 149 RAVLSQEQDGIKSHTHSASASST------DLGTETTSSFDYGTKSTNNTGAHTHSISGTA 202 S G +HTHS SAS + G++T S T S + +G+H HS+S + Sbjct: 416 SHSASTSVSGAGAHTHSFSASDSHTHSFSGSGSDTFSGSGSHTHSFSGSGSHGHSLSLSD 475 Query: 203 NSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHT 261 ++ + N+ + ++ + G + SGS + + + + G+ T Sbjct: 476 SAHQHTSAIPAQNQVAGNSGSQTIWGSVTNSPTWGATANVSGSADSASVSISGTTGSGT 534 >UniRef50_D1ANH0 Putative uncharacterized protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1ANH0_SEBTE Length = 390 Score = 43.7 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 39/120 (32%), Gaps = 6/120 (5%) Query: 158 GIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFG 217 G+ ++ S +A GT+TT T N +H H++ G ++ G H H + Sbjct: 247 GVDTNDVSFNAGEKIGGTQTT------TLGVGNLPSHNHNVQGATDAQGNHYHIVNDHSH 300 Query: 218 GTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHT 277 +G + T T + TH + AG H H Sbjct: 301 YVPPHAHGLSVLRAKAGDSGGNGGNTAYNGTIVNYSTDATDLWTHGSAPATNWAGQHNHW 360 Score = 42.2 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 42/118 (35%), Gaps = 11/118 (9%) Query: 200 GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGK-TSSDG 258 GT + + FGGT + + + G T+ + Sbjct: 216 GTIYTTVNKDFDPNVTFGGTWERYAKGRTLVGVDTNDVSFNAGEKIGGTQTTTLGVGNLP 275 Query: 259 AHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV------NAAGNAENTVKN 310 +H H++ G + G H H V H+H V H H ++V ++ GN NT N Sbjct: 276 SHNHNVQGATDAQGNHYHIV--NDHSHYVP--PHAHGLSVLRAKAGDSGGNGGNTAYN 329 >UniRef50_A9AVC5 Tail Collar domain protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVC5_HERA2 Length = 934 Score = 43.4 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 54/191 (28%), Gaps = 54/191 (28%) Query: 89 EFYPVGAPIPWPSDTV--PSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA 146 P G W P G+ L GQ PD+R + G A Sbjct: 782 SSIPSGTINMWSGADNALPGGWLLCNGQ----------------NGTPDLRNRFVVGAGA 825 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 + TT D T + N +H H + + ++ G Sbjct: 826 A-------------------------YPVGTTGGADSVTLAVNQMPSHNH--AASTSNDG 858 Query: 207 AHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSG 266 H H ++ + G+ + + KT DG H+HS++ Sbjct: 859 QHNHT----LYFDTGGGGNGPGGDMAKTNDGLQKNVIAN----FSVKTDKDGNHSHSVT- 909 Query: 267 TAASAGAHAHT 277 + G AH Sbjct: 910 IQNNGGNQAHE 920 >UniRef50_B0MAM5 Putative uncharacterized protein n=2 Tax=Anaerostipes caccae DSM 14662 RepID=B0MAM5_9FIRM Length = 386 Score = 43.4 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 28/102 (27%), Positives = 43/102 (42%), Gaps = 8/102 (7%) Query: 184 GTKSTNNTGAHTHSISG-TANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTT 242 G + AH HS++G + + G H H + A G ++ + S A ++ Sbjct: 266 GGSKNSVVVAHNHSVNGLSVSPVGDHTHTINSA-GNHRHGVYADVDCITSATGARDLACP 324 Query: 243 SGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHT 284 + T T G H+HS+SG G H HTV AH+ Sbjct: 325 EKNKDTLWESATPYAGNHSHSMSGK----GGHNHTVP--AHS 360 >UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv. viciae RepID=RHIB_RHILV Length = 219 Score = 43.4 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 39/118 (33%), Gaps = 31/118 (26%) Query: 106 SGYALMQGQTFDKSAYPKLAVAYP------------SGVIPDMRGWTIKGKPASG----- 148 G+ L G+ + YP+L IPD RG ++G A G Sbjct: 78 QGWMLCDGRYLRAAVYPELYAVLGGLYGERNSTADLEFRIPDYRGLFLRGFDAGGGMDPD 137 Query: 149 -------------RAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 V S + D ++ H H +T G + + S+ +TG+ Sbjct: 138 AKRRLDPTGNNVANVVGSLQCDALQVHAHPYEI-TTPAGISQQGNAAGTSISSKSTGS 194 >UniRef50_C5RN01 Tail Collar domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RN01_CLOCL Length = 123 Score = 43.4 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 15/55 (27%), Positives = 21/55 (38%), Gaps = 10/55 (18%) Query: 100 PSDTVPSGYALMQGQTFDKSAYPKLAVA----YPSG------VIPDMRGWTIKGK 144 + T P G+ + G + Y L A Y +G +PDMRG G Sbjct: 68 ATTTAPQGWLICDGSAVSRETYANLYTAIGTTYGNGDGTTTFNLPDMRGRVPIGS 122 >UniRef50_A6EAC0 Microcystin-dependent protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAC0_9SPHI Length = 185 Score = 43.4 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 57/180 (31%), Gaps = 30/180 (16%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKG 143 +G P+ D +P G+ G T+ + Y L + +P+++G I G Sbjct: 5 IGEVRPFAFDWIPDGWLACNGATYPLAQYQALYSVIGTVYGGTLGQNFKVPNLQGEAIIG 64 Query: 144 KP------------ASGRAVLSQEQDGIKSHTHSASASSTDLGTETT--------SSFDY 183 G + I +H H + + G T ++F Y Sbjct: 65 AGQGPTTSAYTLAQTGGTEKAGLTVNQIPNHDHVFNGAIGATGFRTNTAGNTSYLTNFGY 124 Query: 184 GTK-STNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTT 242 G +T T A + GT ++ + GG + + P + G + Sbjct: 125 GGAGATTFTSASGYVPPGTPDTLLNPSSVTQTGGGGAHENRQPYLAVTYAICFNGYYPSR 184 >UniRef50_Q11LT1 Microcystin-dependent protein-like n=1 Tax=Chelativorans sp. BNC1 RepID=Q11LT1_MESSB Length = 268 Score = 43.0 bits (99), Expect = 0.016, Method: Composition-based stats. Identities = 39/184 (21%), Positives = 66/184 (35%), Gaps = 12/184 (6%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 T A + P+G + + T P G+ GQ S YP + + Sbjct: 71 VTDAATVGQLVPIGTIVDYALSTAPEGWTFCYGQALTSS------TPYPLLRAALLAAGS 124 Query: 141 IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG 200 G S V + + + +S + T + + G + GA TH++S Sbjct: 125 PFGTSGSDPRVPDYRG-RVGAGKDNMGGTSANRLTNQSGGVN-GDVLGDTGGAETHTLS- 181 Query: 201 TANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAH 260 +H H S GG +T +S S G T + SG + ++ G+H Sbjct: 182 -VGQMPSHNHSGSTGSGGNHTHTM--YVKNLSAGSGGNPVTGTPSGTIDSTYQSDPSGSH 238 Query: 261 THSL 264 +HS+ Sbjct: 239 SHSI 242 >UniRef50_Q7N6A5 Complete genome; segment 6/17 n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N6A5_PHOLL Length = 405 Score = 42.6 bits (98), Expect = 0.018, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 64/197 (32%), Gaps = 27/197 (13%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 + G + + + PSG+A G + PD+R I Sbjct: 223 NIDANKLLSKGMIVMFSGSSAPSGWAFCDG----------------NNGTPDLRSRFIMC 266 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 G S+ S S S ++ TTS+ S NT T S Sbjct: 267 GETVSET-------GKSSNKASGSGSGKNVSRNTTSTAVSVNVSVLNTTL-TESQIPKHK 318 Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHS 263 + + ++ F +T+I ++++ I TSG H H+ Sbjct: 319 HIESLPYYTTLGFAYDHTTIGATNNKIDNSVNGLIWKRTSGPDYHPYTSDIGGGQGHNHN 378 Query: 264 LSGTAASAGAHAHTVGI 280 S AS+ +H H+V + Sbjct: 379 AS---ASSPSHTHSVDV 392 >UniRef50_Q84CW8 Putative transmembrane protein n=1 Tax=uncultured bacterium RepID=Q84CW8_9BACT Length = 406 Score = 42.6 bits (98), Expect = 0.018, Method: Composition-based stats. Identities = 52/214 (24%), Positives = 87/214 (40%), Gaps = 19/214 (8%) Query: 102 DTVPSGYALMQGQTF--DKSAYPKL-AVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDG 158 P+G+ L GQT+ +S YP L AV SG+ M G +I + R ++ G Sbjct: 186 AAAPTGWLLF-GQTYLSGQSTYPALWAVLVASGLTSWMSGTSIVLPDLADRVLMDGGTLG 244 Query: 159 IKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGG 218 ++ + S+ +L S H H +A ++ H H S GG Sbjct: 245 ATGGANAVTLSTANLPAHDHSI------------DHNHGSVTSAGNSVNHTHTFSDTTGG 292 Query: 219 TNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS-DGAHTHSLSGTA-ASAGAHAH 276 T + ++ A + + +G NA T + G HTH++SGT + AH H Sbjct: 293 TGEHNHNAWFVDVTGGGAASRAAPASTGSGTNAQITIAGGGDHTHTVSGTTGGDSVAHTH 352 Query: 277 TVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKN 310 V + + G +T + G+ +N ++N Sbjct: 353 AVDLPNFAGTSGSVGSGTAVTTHP-GSPQNQLRN 385 >UniRef50_Q7N687 Complete genome; segment 6/17 n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N687_PHOLL Length = 343 Score = 42.2 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 67/199 (33%), Gaps = 34/199 (17%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA 146 P P G + + +VP+G+ L G + P++ I G Sbjct: 161 PNTVLPRGMIVMFSGKSVPTGWTLCDG----------------NNGTPNLIDRFILGGNF 204 Query: 147 SG-----RAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGT 201 SG +S +D KS +++ ++ ++ +T+ + S H+H Sbjct: 205 SGIDGKSSTTVSGPKDS-KSFNFNSNEATLNINGKTSER----SLSIGQIPNHSHLSGIN 259 Query: 202 ANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHT 261 ++ +G T + N S + +SG + +S H Sbjct: 260 IDTN------IMAQYGATQIGKTDRAVASSKNTSERYLYYSSGILSSNGTIGQNSPETHD 313 Query: 262 HSLSGTAASAGAHAHTVGI 280 H ++ T + G H H I Sbjct: 314 HDINLT--NTGNHFHKNQI 330 >UniRef50_B2W978 Putative uncharacterized protein n=2 Tax=Pleosporineae RepID=B2W978_PYRTR Length = 694 Score = 42.2 bits (97), Expect = 0.024, Method: Composition-based stats. Identities = 27/127 (21%), Positives = 41/127 (32%), Gaps = 2/127 (1%) Query: 165 SASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIF 224 + L G T + +++SG+ N +Q SG N + Sbjct: 233 NFGNHYGALEFGMLGHMSSGAVETPHDNNLMNNMSGSVN--MYNQQVPSGYPDQNNAAAM 290 Query: 225 PNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHT 284 G + + GS TS G+H H + G HA +G G T Sbjct: 291 AFGPNGLPGSEWQETQSRQGSMHVHTPNNTSGSGSHDHHPHRNDSLNGPHAFAIGQGPAT 350 Query: 285 HSVAIGS 291 HS A + Sbjct: 351 HSTASPA 357 >UniRef50_A8T9J8 Putative uncharacterized protein n=1 Tax=Vibrio sp. AND4 RepID=A8T9J8_9VIBR Length = 242 Score = 42.2 bits (97), Expect = 0.024, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 65/190 (34%), Gaps = 38/190 (20%) Query: 43 GEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPS- 101 G G L I + TS A +++ + ++ PVG + W S Sbjct: 48 GSGNLKIDYEETSSQLAGTGLKNNSSKLNVDYDE---------LLNALIPVGTIVAWGST 98 Query: 102 -DTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIK 160 + P G+AL G T +PD+ G + G G +S D + Sbjct: 99 SNNPPKGWALCDGST---------------AGVPDLTGCFLMGNKTYGTNAVS---DNRR 140 Query: 161 SHTHSASASSTDLGTETTS----SFDY-----GTKSTNNTGAHTHSISGTANSAGAHQHK 211 S++ASS LG + S D+ S + G + H A + Sbjct: 141 ILGTSSNASSLVLGHKLNINQIPSHDHQMTIMQEHSKSKNGTYMHYYLAPATGNNNNWRG 200 Query: 212 SSGAFGGTNT 221 ++ + GG + Sbjct: 201 NTNSKGGNQS 210 >UniRef50_D1Y7E0 Collagen alpha 1 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y7E0_9BACT Length = 386 Score = 42.2 bits (97), Expect = 0.025, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 48/168 (28%), Gaps = 8/168 (4%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRA 150 P+GA + + GY L G +D + YP LA + +PDM T P Sbjct: 111 VPIGATVMFKKGQQEPGYLLANGAPYDTAKYPYLADCLGAANLPDM-SHTPVLIPGWAWY 169 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQH 210 V + + ++ + + YG +++ + + G G Sbjct: 170 VKAYHRPNVR---GDLKRLQILVHQLPSDQAAYGAYNSSEDILNLYIPQGIQGIQGPMGP 226 Query: 211 KSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDG 258 G P G G G G T G Sbjct: 227 TGPAGPQGPQGEQGPRGIQGPQ----GEQGLRGPEGAQGVKGDTGEQG 270 >UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria phage RB43 RepID=Q56BI6_9CAUD Length = 463 Score = 42.2 bits (97), Expect = 0.026, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 66/188 (35%), Gaps = 23/188 (12%) Query: 3 ITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 62 + A TD+T+ L+L + +G ++G ++ L G L + ++GA+ Sbjct: 230 VNAGTDDTKAVTPLKLANLKGSG--GSFG-LVKLSTEVNAG----LANTALSAGANVVPS 282 Query: 63 IRSRRDTTDANWSP---WAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKS 119 R T A + A Y + P+G + ++ + G+ Sbjct: 283 NRDSAITGGALYQGSVAAANKYQTHSDIEASLPIGCMMMAAFNSDYGNLCIANGRGMYTY 342 Query: 120 AYPKLAV----AYPSGV----IPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSA 166 YP+L Y +PDMRG +G GR + + ++SH H Sbjct: 343 EYPELFALIGYTYGGSGNIFNLPDMRGVVARGFDAGRGLDPGRGFGTYQHHEVQSHEHPL 402 Query: 167 SASSTDLG 174 G Sbjct: 403 QMIYQSGG 410 >UniRef50_A1TNG3 Phage Tail Collar domain protein n=9 Tax=Bacteria RepID=A1TNG3_ACIAC Length = 176 Score = 42.2 bits (97), Expect = 0.026, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 47/134 (35%), Gaps = 21/134 (15%) Query: 94 GAPIPWPSDTVPSGYALMQGQ----TFDKSAYPKLAVAYPSGV-----IPDMRGWTIKGK 144 G + + P G+A QGQ + + + L Y +PD+RG G+ Sbjct: 8 GEISMFAGNFPPKGWAFCQGQILPIAQNSALFALLGTTYGGNGQTTFALPDLRGRVPLGQ 67 Query: 145 PA------------SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTG 192 G+ ++ + + + HTH+ S S + + + +++ Sbjct: 68 GQGPGLQPYSQGQVGGQETVTLQGNQMPMHTHTTSVSVSSNAGNSAAPNGRYLAASDQRN 127 Query: 193 AHTHSISGTANSAG 206 SG + AG Sbjct: 128 DQYTDQSGNGSLAG 141 >UniRef50_B8DLJ2 Tail fiber protein, putative n=3 Tax=Desulfovibrio vulgaris RepID=B8DLJ2_DESVM Length = 505 Score = 42.2 bits (97), Expect = 0.027, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 36/110 (32%), Gaps = 32/110 (29%) Query: 92 PVGAPIPWPSDTVPSGYALMQ-GQTFDKSAYPKL-AVAYPSGVI---------------- 133 P+G WP T P+G + G + AYP+L A+A SG I Sbjct: 214 PIGMTFWWPGTTPPAGSLAINDGPLLPREAYPQLWAMAQASGNIITEAAWQAQAAVQSSV 273 Query: 134 --------------PDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASAS 169 P +R + P+ GRAV + + + Sbjct: 274 GAFSSGDGATTFRCPRLRDFVRGANPSGGRAVGAWQAHATEGLFVPMDGD 323 >UniRef50_C7BIF9 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BIF9_PHOAA Length = 286 Score = 41.8 bits (96), Expect = 0.031, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 57/203 (28%), Gaps = 36/203 (17%) Query: 87 PAEFYPVGAPIPWPSDTV--PSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGK 144 + +P G + + P G+A G ++ +PD+R I Sbjct: 98 SDQIFPKGMIVMFSGSENEIPPGWAFCDGGEYNGI------------KVPDLRNRFIMCS 145 Query: 145 PASGRAVLSQ----EQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG 200 S K+ + + + + ++ T + H H Sbjct: 146 ETFAEKGESSKKANGDGNNKNFLKDTESITVSIDVKVENT----TLDISQIPKHNHIQG- 200 Query: 201 TANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDG-- 258 + S FG + Y + S+ T S TS G Sbjct: 201 -------LPYHSDVGFGYPHVKWGKTPYRIDNTYSSSFWHTDKSSNNDDLHPNTSEVGEG 253 Query: 259 -AHTHSLSGTAASAGAHAHTVGI 280 H HS AS+ H+H V + Sbjct: 254 KGHNHS---ATASSSPHSHKVDV 273 >UniRef50_Q8GDJ7 Orf24 n=1 Tax=Photorhabdus luminescens RepID=Q8GDJ7_PHOLU Length = 434 Score = 41.8 bits (96), Expect = 0.031, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 60/215 (27%), Gaps = 47/215 (21%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA 146 P + P G + + VP G+A G PD+R + G Sbjct: 255 PDKVLPRGMIVMFSGSVVPQGWAFCDGT----------------NGTPDLRDRFVSG--- 295 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK-STNNTGAHTHSISGTANSA 205 + + G +F+ T N + + T + Sbjct: 296 -----------AWQLSDAGNTNDKRITGDNKNKAFNAQTTADKTNLSVNVQDTTLTIDQI 344 Query: 206 GAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLS 265 +H H T + + + L+ + + D +SL Sbjct: 345 PSHSHIEGMRMQITQAAEYGLVTQKANQLN-----------RYNLNNQIIHDSNEDYSLH 393 Query: 266 GTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 T G AH H SV +H HT++V+ Sbjct: 394 KTNEIGGGKAHN-----HQTSVNETAHQHTVSVSP 423 >UniRef50_C3X3W3 Predicted protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W3_OXAFO Length = 315 Score = 41.8 bits (96), Expect = 0.034, Method: Composition-based stats. Identities = 28/132 (21%), Positives = 45/132 (34%), Gaps = 43/132 (32%) Query: 100 PSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG---------KPASGRA 150 +D +PSG+ L G PD+R + G K + Sbjct: 187 AADHIPSGWLLCNG----------------ENGTPDLRDRFVVGAGKAYAVYAKGGATTG 230 Query: 151 VLSQEQD-------GIKSHTH--------SASASSTDLGTETTSSFDYGTKSTNNTGA-- 193 +S + I SH H S +A + TT +F + T G Sbjct: 231 AVSGQTGETTLTINQIPSHNHGVGYYISRSGNAGNGFQVERTTDNFAFTYLYTTVQGGNQ 290 Query: 194 -HTHSISGTANS 204 H+HS+SG+ ++ Sbjct: 291 PHSHSLSGSVST 302 >UniRef50_C7BQB5 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BQB5_PHOAA Length = 406 Score = 41.4 bits (95), Expect = 0.040, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 55/190 (28%), Gaps = 36/190 (18%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA 146 P + P G + + ++ P+G+A G + PD+R I Sbjct: 225 PNKVLPRGMIVMFSGNSAPTGWAFCDG----------------NSGTPDLRSRFIMCGE- 267 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 + + G S+ S S + + TTS+ + NT T + Sbjct: 268 ------TISETGKSSNKASGSGNGKNFSRNTTSTTVSVNVTVQNTTL-------TESQIP 314 Query: 207 AHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSG 266 H+H + + G+ + + + + S G H + Sbjct: 315 KHKHIEALPY------YNTLGFAYGNTPIGSTKYQINNTSSSMFFWHPSPTGNDYHPYTS 368 Query: 267 TAASAGAHAH 276 H H Sbjct: 369 EVGGGQGHNH 378 >UniRef50_Q2W7B1 Microcystin-dependent protein n=3 Tax=Proteobacteria RepID=Q2W7B1_MAGSA Length = 177 Score = 41.4 bits (95), Expect = 0.042, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 55/165 (33%), Gaps = 32/165 (19%) Query: 99 WPSDTVPSGYALMQGQTFDKSAYPKLAV---------AYPSGVIPDMRGWTIKGKPASGR 149 +P + P+G+ G++ SA L A + +PD+RG TI G+ Sbjct: 12 FPLNWAPTGWLPCDGRSMQVSANAALFSLLGNQFGGDAKTTFFLPDLRGRTIMGQ----- 66 Query: 150 AVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGT----ANSA 205 + T + T +T +H H + G A + Sbjct: 67 --------------GKNPVTGVSYVTGAYGGTESVTLTTAQLPSHQHQVVGDQTVGATNP 112 Query: 206 GAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRN 250 + + + GT S++ +G + A + + G+ T Sbjct: 113 ADDNYLAVPIYNGTQKSLYNSGTKPVPLNPASVSTVGGGAAHTNT 157 >UniRef50_B5ZGB2 Tail Collar domain protein n=4 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=B5ZGB2_GLUDA Length = 300 Score = 41.4 bits (95), Expect = 0.042, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 50/176 (28%), Gaps = 19/176 (10%) Query: 95 APIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIKGK 144 + T P+G+ L GQ ++ Y L + +PD+RG G Sbjct: 106 TVADYAGATAPAGWMLCCGQAVSRATYAALFAVIGTTFGAGDGATTFGLPDLRGRVAAGV 165 Query: 145 PASGRA---VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGT 201 + G +L+ GI A+ S + T + D G H H Sbjct: 166 DSMGGTAANLLTMAGAGINGVQLGAAGGSQMAPSHTHAVTDPGHAHAVTDPGHAHGPGSG 225 Query: 202 ANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 GG + T + TT + T G T + Sbjct: 226 TG------FVVPQGTGGEIVTFDGGSLTPEHATAQTTADTTGVTVDTATTGITLAA 275 >UniRef50_Q2W7B2 Microcystin-dependent protein n=1 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2W7B2_MAGSA Length = 192 Score = 41.4 bits (95), Expect = 0.045, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 49/169 (28%), Gaps = 6/169 (3%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLS 153 G I + P +A+ G S YP L + G T G P + Sbjct: 6 GQIILFSGSYAPVNWAVCDGHQLSVSQYPALFSLLGTQF--GGNGTTTFGLPDLRSRLAM 63 Query: 154 QEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSS 213 G S SA T G T + T + HTH T N++G + Sbjct: 64 GFGTGHVDPKASNSAPLTPYGFATNGGVETVTLTQAQIPPHTH----TLNASGDPVVSPN 119 Query: 214 GAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTH 262 + G + + T Q + T++ + H Sbjct: 120 PSGGVPASFTDGTHVAYFDTPNPIPSGMTITPKQLGASMVTTAGASQPH 168 >UniRef50_B6VNN2 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VNN2_PHOAA Length = 508 Score = 41.4 bits (95), Expect = 0.045, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 62/198 (31%), Gaps = 29/198 (14%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTI-K 142 A P P G + + T P+G+AL G + P++ I Sbjct: 318 AIDPNNVLPKGVIVMFSGSTAPTGWALCDG----------------NNGTPNLIDRFILG 361 Query: 143 GKPASGRAVLSQEQDGIKS---HTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSIS 199 GK V + G K+ S+ ++ + +T S H H Sbjct: 362 GKGTDINGVSTNTASGTKNSKLFDFSSDEATLTIDGKTLGR----ALSLQQIPNHAHFSG 417 Query: 200 GTANSAGAHQHKSSGAFGGT-NTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDG 258 ++ + + S + N S+G++ + + + G + Sbjct: 418 IIMDTEKVNYYGSKKITTNVWGVTTGDNTSVRYIYKSSGVLDSNNNVSNSTLGGNSLQTH 477 Query: 259 AHTHSLSGTAASAGAHAH 276 H ++GT G H+H Sbjct: 478 DHDIKITGT----GKHSH 491 >UniRef50_B3QRT1 Tail Collar domain protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRT1_CHLT3 Length = 176 Score = 41.0 bits (94), Expect = 0.057, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 50/170 (29%), Gaps = 28/170 (16%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTF----DKSAYPKLAVAYP-----SGVIPDMRGWTIKG 143 +G + P G+A GQ +++ Y L Y + +PD+RG + G Sbjct: 7 IGEIRLFGFGWAPDGWAQCNGQLLLINENQALYSLLGTMYGGDARSTFGVPDLRGRAVIG 66 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 S + S + G E T T AH H++ Sbjct: 67 YGQSPKLSYSYQMSQ--------------WGGEETV-----TLGVAQIPAHNHTLIADGA 107 Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGK 253 + +++ G ++ G + T GS N Sbjct: 108 TGTLLNPQNNYLAEGAFPGAAFYSADKSVAMNQGTIGNTGGSQPHENRSP 157 >UniRef50_A1TUY7 Phage Tail Collar domain protein n=4 Tax=Acidovorax RepID=A1TUY7_ACIAC Length = 204 Score = 41.0 bits (94), Expect = 0.057, Method: Composition-based stats. Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 9/60 (15%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS---------GVIPDMRGWTIKG 143 +G + W + VP G+AL G + + P L + +PD+R G Sbjct: 6 IGTVLLWTAAFVPRGWALCDGSVLNITQNPALFAILGNRFGGDGRTTFQLPDLRNRVPMG 65 >UniRef50_A2A761 Zinc finger protein 69 n=3 Tax=Mus musculus RepID=A2A761_MOUSE Length = 587 Score = 40.7 bits (93), Expect = 0.078, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 41/101 (40%), Gaps = 3/101 (2%) Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 G A+LS + +G + + A T GT S D T T TG+H + GT + Sbjct: 33 GEALLSHDANGTQQ---ESLADGTTPGTPAAGSHDGATPGTTATGSHDEATPGTPAAGSH 89 Query: 208 HQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQT 248 G++ P TA S+ +T +GS ++ Sbjct: 90 DGETPGIPAAGSHDGETPGTPTAGSHDGVTPGTTAAGSQES 130 >UniRef50_B0MAY2 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MAY2_9FIRM Length = 582 Score = 40.3 bits (92), Expect = 0.095, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 43/122 (35%), Gaps = 14/122 (11%) Query: 195 THSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKT 254 SI + N+A + FGG+ + G + SG + Sbjct: 409 VGSIYISVNNAN-----PASFFGGSWVQFATGKTIVGVDTGQGEFNAVEKSGGHKELQ-- 461 Query: 255 SSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSV-AIGSHGHTITVNAAGNAENTVKNIAF 313 +H H ++ S H HTV H H++ G+H H + +N + T N Sbjct: 462 ----SHAHGMNNHVHSLNNHTHTVP--NHVHTMQGAGNHYHYLGINKDAVQKGTSYNKPN 515 Query: 314 NY 315 N+ Sbjct: 516 NF 517 >UniRef50_A5GA42 Phage Tail Collar domain protein n=2 Tax=Bacteria RepID=A5GA42_GEOUR Length = 181 Score = 40.3 bits (92), Expect = 0.096, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 59/169 (34%), Gaps = 25/169 (14%) Query: 99 WPSDTVPSGYALMQGQ----TFDKSAYPKLAVAYP-----SGVIPDMRGWT--------- 140 +P D P G+AL G +++ Y L + + +PD++G Sbjct: 12 FPFDFAPRGWALCNGALLPIVQNQALYSLLNTTFGGDGKTNFGLPDLQGRVPMPPGTNPV 71 Query: 141 ----IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGT---KSTNNTGA 193 + G ++ I HTHSA A+S + + + G K+ + + Sbjct: 72 CGNIVAAGKKDGSETVTLTTSQIPPHTHSALANSINADFASPVTLAAGNIWAKADDPSSN 131 Query: 194 HTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTT 242 ++ AN+ S+ GG + ++ P G+ T Sbjct: 132 PVNAYESGANAVMDQSALSTAGGGGAHNNMQPYQVVNYCIALMGLYPTR 180 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriacea... 291 2e-77 UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enter... 245 1e-63 UniRef50_P76072 Side tail fiber protein homolog from lambdoid pr... 239 9e-62 UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia ... 224 3e-57 UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enter... 214 4e-54 UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysente... 211 2e-53 UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX 209 1e-52 UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae R... 203 7e-51 UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_L... 194 4e-48 UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escheric... 180 6e-44 UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica... 177 4e-43 UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacter... 175 1e-42 UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escheri... 171 2e-41 UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae Re... 171 2e-41 UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria pha... 160 5e-38 UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepI... 154 4e-36 UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepI... 144 5e-33 UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia Rep... 142 2e-32 UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseu... 129 1e-28 UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU 126 1e-27 UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae... 124 5e-27 UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkhol... 123 6e-27 UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=... 118 3e-25 UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteri... 117 5e-25 UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersini... 116 1e-24 UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 T... 116 1e-24 UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp.... 116 1e-24 UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus... 116 1e-24 UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root R... 115 2e-24 UniRef50_B2I5N0 Tail Collar domain protein n=13 Tax=Xylella fast... 115 3e-24 UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli pl... 113 7e-24 UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteri... 113 7e-24 UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteri... 112 1e-23 UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=... 112 2e-23 UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas ... 111 3e-23 UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterob... 110 5e-23 UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteria... 110 5e-23 UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH... 110 5e-23 UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrob... 110 6e-23 UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadan... 110 8e-23 UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia Rep... 109 1e-22 UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A... 108 2e-22 UniRef50_B3X4P8 Tail fiber n=3 Tax=Enterobacteriaceae RepID=B3X4... 108 2e-22 UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomo... 108 2e-22 UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli R... 108 2e-22 UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae... 108 4e-22 UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclass... 108 4e-22 UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacteriu... 107 5e-22 UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammapr... 107 5e-22 UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bact... 107 8e-22 UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=... 106 1e-21 UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus... 106 1e-21 UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Pho... 105 2e-21 UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia... 105 2e-21 UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=C... 105 2e-21 UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadan... 105 2e-21 UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae ... 104 3e-21 UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 T... 103 9e-21 UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclass... 103 1e-20 UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteri... 103 1e-20 UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid pr... 102 1e-20 UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID... 102 1e-20 UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 ... 102 1e-20 UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadan... 102 2e-20 UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia ... 101 3e-20 UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 ... 101 3e-20 UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 T... 101 4e-20 UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadan... 101 4e-20 UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Entero... 100 6e-20 UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber prote... 100 9e-20 UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae... 100 1e-19 UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella... 99 2e-19 UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica ... 99 2e-19 UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkhol... 99 2e-19 UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxaloba... 99 2e-19 UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacter... 99 3e-19 UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 Rep... 98 4e-19 UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae ... 98 5e-19 UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Ta... 98 5e-19 UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersini... 97 6e-19 UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID... 97 1e-18 UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacteriu... 96 1e-18 UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadan... 95 2e-18 UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannhei... 95 4e-18 UniRef50_B7NJP1 Putative side tail fiber protein homolog from la... 94 8e-18 UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID... 92 3e-17 UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=... 92 3e-17 UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylo... 92 4e-17 UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Ta... 91 5e-17 UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes... 90 8e-17 UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium w... 90 1e-16 UniRef50_C3X912 Phage tail collar domain-containing protein n=1 ... 89 2e-16 UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella gra... 89 2e-16 UniRef50_C3X971 Predicted protein n=6 Tax=Oxalobacter formigenes... 89 2e-16 UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkhol... 89 2e-16 UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectoba... 89 2e-16 UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A... 89 3e-16 UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polarom... 89 3e-16 UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45... 88 3e-16 UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1... 87 7e-16 UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacter... 87 7e-16 UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes... 87 1e-15 UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=... 86 1e-15 UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemoph... 85 2e-15 UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotroph... 85 3e-15 UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralston... 84 5e-15 UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formi... 84 5e-15 UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxaloba... 84 6e-15 UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingell... 84 6e-15 UniRef50_A9LZ37 Tail fibre protein, putative n=21 Tax=Neisseria ... 84 7e-15 UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 R... 84 7e-15 UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=... 84 8e-15 UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax... 84 9e-15 UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabd... 84 1e-14 UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio... 83 1e-14 UniRef50_C6S6V6 Putative phage tail fibre protein n=1 Tax=Neisse... 83 2e-14 UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemoph... 83 2e-14 UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_... 83 2e-14 UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labr... 82 2e-14 UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 ... 82 3e-14 UniRef50_UPI000180B6D6 PREDICTED: similar to glutamate receptor,... 81 5e-14 UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxaloba... 81 6e-14 UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacteriu... 81 6e-14 UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxaloba... 80 7e-14 UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacter... 80 9e-14 UniRef50_B3Z3L3 Phage minor structural protein n=3 Tax=Bacillus ... 80 9e-14 UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-l... 80 1e-13 UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio... 79 3e-13 UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=... 79 3e-13 UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A... 78 4e-13 UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas... 77 7e-13 UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes... 77 7e-13 UniRef50_B3FYL6 Gp17 n=1 Tax=Salmonella phage phiSG-JL2 RepID=B3... 77 8e-13 UniRef50_C2RWX3 Phage minor structural protein n=1 Tax=Bacillus ... 77 9e-13 UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX 77 9e-13 UniRef50_C3X909 Predicted protein n=2 Tax=Oxalobacter formigenes... 76 1e-12 UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes... 75 2e-12 UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enter... 75 3e-12 UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A... 75 3e-12 UniRef50_Q6J803 Pas28 n=1 Tax=Actinoplanes phage phiAsp2 RepID=Q... 74 5e-12 UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythrae... 74 7e-12 UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacteriu... 74 8e-12 UniRef50_C3X8R9 Bacteriophage tail fiber protein n=6 Tax=Oxaloba... 73 8e-12 UniRef50_D1NFN8 Apo-citrate lyase phosphoribosyl-dephospho-CoA t... 73 8e-12 UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxaloba... 73 1e-11 UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae... 72 3e-11 UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax... 72 3e-11 UniRef50_D1BW55 Tail Collar domain protein n=1 Tax=Xylanimonas c... 71 6e-11 UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia sola... 70 8e-11 UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD 70 1e-10 UniRef50_C3LHF1 Phage minor structural protein n=13 Tax=Bacteria... 69 2e-10 UniRef50_C4VIX0 74kDa protein n=28 Tax=root RepID=C4VIX0_ENTFA 69 2e-10 UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia... 69 3e-10 UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Entero... 67 8e-10 UniRef50_A9AVE2 Tail Collar domain protein n=1 Tax=Herpetosiphon... 67 9e-10 UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium R... 67 9e-10 UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium R... 67 1e-09 UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Ta... 66 1e-09 UniRef50_Q7N541 Similar to DNA inversion product and tail fiber ... 66 2e-09 UniRef50_Q4C9U4 Phage Tail Collar n=1 Tax=Crocosphaera watsonii ... 65 3e-09 UniRef50_C9PG79 Putative phage tail protein n=1 Tax=Vibrio furni... 65 4e-09 UniRef50_Q094A8 Phage Tail Collar Domain family n=1 Tax=Stigmate... 65 4e-09 UniRef50_C4MYW8 Gp12 Short tail fibers n=1 Tax=Enterobacteria ph... 65 5e-09 UniRef50_A1HR57 Putative uncharacterized protein n=1 Tax=Thermos... 64 5e-09 UniRef50_A4P195 Putative phage tail fibre protein (Fragment) n=1... 64 8e-09 UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus ... 63 9e-09 UniRef50_D0LMW0 Tail Collar domain protein n=1 Tax=Haliangium oc... 62 2e-08 UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED1... 61 4e-08 UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter ... 60 9e-08 UniRef50_B8QTW7 Putative tail fiber protein n=1 Tax=Erwinia phag... 59 2e-07 UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio ... 58 5e-07 UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibaci... 57 9e-07 UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synecho... 56 1e-06 UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Micr... 56 2e-06 UniRef50_B6XJ97 Putative uncharacterized protein n=2 Tax=Enterob... 55 2e-06 UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenel... 55 3e-06 UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=... 55 4e-06 UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylo... 54 5e-06 UniRef50_D2L4G0 Tail Collar domain protein n=1 Tax=Desulfovibrio... 48 6e-04 UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibr... 47 0.001 Sequences not found previously or not previously below threshold: UniRef50_C9MDX4 Tail fiber protein (Fragment) n=5 Tax=Haemophilu... 52 4e-05 UniRef50_C5RN01 Tail Collar domain protein n=1 Tax=Clostridium c... 51 6e-05 UniRef50_D2V5I7 Microcystin-dependent protein n=1 Tax=Naegleria ... 50 1e-04 UniRef50_A0A7D3 Putative uncharacterized protein n=1 Tax=Microcy... 50 1e-04 UniRef50_C5RJD9 Tail Collar domain protein n=1 Tax=Clostridium c... 49 2e-04 UniRef50_Q2W7B2 Microcystin-dependent protein n=1 Tax=Magnetospi... 49 2e-04 UniRef50_B8DLJ2 Tail fiber protein, putative n=3 Tax=Desulfovibr... 49 2e-04 UniRef50_Q55EP2 Putative uncharacterized protein n=1 Tax=Dictyos... 48 3e-04 UniRef50_D1ANH0 Putative uncharacterized protein n=1 Tax=Sebalde... 48 5e-04 UniRef50_A1TUY7 Phage Tail Collar domain protein n=4 Tax=Acidovo... 48 5e-04 UniRef50_C7BIF9 Putative uncharacterized protein n=1 Tax=Photorh... 48 5e-04 UniRef50_A8T9J8 Putative uncharacterized protein n=1 Tax=Vibrio ... 47 7e-04 UniRef50_UPI000194E452 PREDICTED: similar to tau-tubulin kinase ... 47 7e-04 UniRef50_Q7N651 Complete genome; segment 6/17 n=4 Tax=Gammaprote... 47 8e-04 UniRef50_A9DEL7 Tail fiber protein 2 n=1 Tax=Yersinia phage PY10... 47 8e-04 UniRef50_Q58MY1 Predicted protein n=1 Tax=Prochlorococcus phage ... 47 0.001 UniRef50_D1Y7E0 Collagen alpha 1 n=1 Tax=Pyramidobacter piscolen... 47 0.001 UniRef50_B6VNN2 Putative uncharacterized protein n=1 Tax=Photorh... 47 0.001 UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria ph... 46 0.001 UniRef50_Q8PR98 Microcystin dependent protein n=1 Tax=Xanthomona... 46 0.002 UniRef50_B5ZGB2 Tail Collar domain protein n=4 Tax=Gluconacetoba... 46 0.002 UniRef50_A6EAB9 Microcystin-dependent protein n=1 Tax=Pedobacter... 46 0.002 UniRef50_Q4KAW4 Putative uncharacterized protein n=1 Tax=Pseudom... 46 0.002 UniRef50_UPI00016C488F hypothetical protein GobsU_00180 n=1 Tax=... 46 0.002 UniRef50_C3YB93 Putative uncharacterized protein n=1 Tax=Branchi... 46 0.002 UniRef50_Q11LT1 Microcystin-dependent protein-like n=1 Tax=Chela... 46 0.002 UniRef50_Q84CW8 Putative transmembrane protein n=1 Tax=unculture... 45 0.003 UniRef50_A1TNG3 Phage Tail Collar domain protein n=9 Tax=Bacteri... 45 0.003 UniRef50_Q7N687 Complete genome; segment 6/17 n=1 Tax=Photorhabd... 45 0.003 UniRef50_Q7N6A5 Complete genome; segment 6/17 n=1 Tax=Photorhabd... 45 0.003 UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv.... 45 0.003 UniRef50_A6EAB8 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 R... 45 0.005 UniRef50_UPI0001BC923E Phage tail Collar n=1 Tax=Pseudomonas syr... 44 0.006 UniRef50_A1SXZ3 Phage Tail Collar domain protein n=4 Tax=Bacteri... 44 0.006 UniRef50_Q8GDJ7 Orf24 n=1 Tax=Photorhabdus luminescens RepID=Q8G... 44 0.007 UniRef50_C1D6M7 Phage-related protein n=1 Tax=Laribacter hongkon... 44 0.007 UniRef50_Q8PR97 Microcystin dependent protein n=1 Tax=Xanthomona... 44 0.007 UniRef50_A6EAC0 Microcystin-dependent protein n=1 Tax=Pedobacter... 44 0.008 UniRef50_A6N211 Probable tail fiber protein n=1 Tax=Microbacteri... 44 0.008 UniRef50_Q2W7B1 Microcystin-dependent protein n=3 Tax=Proteobact... 44 0.009 UniRef50_B1J270 Tail Collar domain protein n=8 Tax=Bacteria RepI... 43 0.011 UniRef50_Q6J802 Pas29 n=1 Tax=Actinoplanes phage phiAsp2 RepID=Q... 43 0.013 UniRef50_C6X0H2 Phage tail collar domain protein n=1 Tax=Flavoba... 43 0.016 UniRef50_UPI000186F374 low-density lipoprotein receptor, putativ... 43 0.017 UniRef50_A9AVC5 Tail Collar domain protein n=1 Tax=Herpetosiphon... 43 0.017 UniRef50_B9M3Z7 Tail Collar domain protein n=1 Tax=Geobacter sp.... 43 0.019 UniRef50_C6X0H3 Microcystin dependent protein n=1 Tax=Flavobacte... 42 0.021 UniRef50_D1SWB1 Tail Collar domain protein n=1 Tax=Acidovorax av... 42 0.022 UniRef50_C5IHQ0 Gp58 n=1 Tax=Burkholderia phage BcepIL02 RepID=C... 42 0.028 UniRef50_B4D821 Tail Collar domain protein n=3 Tax=Bacteria RepI... 42 0.030 UniRef50_UPI0001A44BB4 microcystin dependent protein n=1 Tax=Pec... 42 0.038 UniRef50_B6IWH6 Putative uncharacterized protein n=2 Tax=Bacteri... 42 0.039 UniRef50_B2JL06 Tail Collar domain protein n=2 Tax=Burkholderia ... 42 0.041 UniRef50_D2QTE9 Tail Collar domain protein n=1 Tax=Spirosoma lin... 42 0.041 UniRef50_A5GA41 Phage Tail Collar domain protein n=1 Tax=Geobact... 41 0.049 UniRef50_B9Z2I2 Tail Collar domain protein n=2 Tax=Chromobacteri... 41 0.055 UniRef50_C5JB62 Putative phage tail protein n=1 Tax=uncultured b... 41 0.063 UniRef50_C7BQB5 Putative uncharacterized protein n=1 Tax=Photorh... 41 0.065 UniRef50_C6C8S0 Tail Collar domain protein n=4 Tax=Dickeya RepID... 40 0.078 UniRef50_C2FWA0 Phage tail collar domain protein n=2 Tax=Sphingo... 40 0.078 >UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriaceae RepID=Q3YZL1_SHISS Length = 1029 Score = 291 bits (744), Expect = 2e-77, Method: Composition-based stats. Identities = 277/321 (86%), Positives = 292/321 (90%), Gaps = 3/321 (0%) Query: 3 ITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 62 + ALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF Sbjct: 709 VAALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 768 Query: 63 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP 122 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP Sbjct: 769 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP 828 Query: 123 KLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFD 182 KLA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSFD Sbjct: 829 KLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFD 888 Query: 183 YGTKSTNNTGAHTHSISGTANSAGAHQHKSSGA---FGGTNTSIFPNGYTAISNLSAGIM 239 YGTKSTNNTGAHTHS+SG+ +SAGAHQH +G G T +FP G T +S + + Sbjct: 889 YGTKSTNNTGAHTHSLSGSTSSAGAHQHSQTGPRTNSGSQPTGMFPAGSTQVSGTNQVGI 948 Query: 240 STTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN 299 S + SG ++ GK+SS+G HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN Sbjct: 949 SGSLTSGTSQWVGKSSSEGNHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN 1008 Query: 300 AAGNAENTVKNIAFNYIVRLA 320 AAGNAENTVKNIAFNYIVRLA Sbjct: 1009 AAGNAENTVKNIAFNYIVRLA 1029 >UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FQX9_SALDC Length = 569 Score = 245 bits (625), Expect = 1e-63, Method: Composition-based stats. Identities = 182/319 (57%), Positives = 206/319 (64%), Gaps = 50/319 (15%) Query: 3 ITALTDNTQGA-AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 + ALT T+G+ +GL + EVYNNGYPT YGNI+ L G G+GE+LIGWSGT+GA APA Sbjct: 300 LPALTGATRGSDSGLIMGEVYNNGYPTQYGNILRLTG---TGDGEILIGWSGTNGAPAPA 356 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 +IRS RDT DA WS WA LYTS +PP YPVGA I WPSD P+GYALMQGQ+FDKSAY Sbjct: 357 YIRSHRDTADAEWSEWAMLYTSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAY 416 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 P LA+AYPSG+IPDMRGWTIKGKP SGRAVLSQE DG KSH+HSA A TDLGT++TSSF Sbjct: 417 PLLAIAYPSGIIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHSARAQDTDLGTKSTSSF 476 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST 241 DYGTKSTN TG HTH G NS + Sbjct: 477 DYGTKSTNTTGNHTHQFGGYINSYW--------------------------------GDS 504 Query: 242 TSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 301 S Q T + G HAHTV IG H H++ IG HGH + V+A Sbjct: 505 NHTSFQPGGGAWTQAAG--------------DHAHTVYIGGHEHTMYIGPHGHVVIVDAD 550 Query: 302 GNAENTVKNIAFNYIVRLA 320 GNAE TVKNIAFNYIVRLA Sbjct: 551 GNAETTVKNIAFNYIVRLA 569 >UniRef50_P76072 Side tail fiber protein homolog from lambdoid prophage Rac n=23 Tax=root RepID=STFR_ECOLI Length = 1120 Score = 239 bits (609), Expect = 9e-62, Method: Composition-based stats. Identities = 200/263 (76%), Positives = 209/263 (79%), Gaps = 14/263 (5%) Query: 58 HAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFD 117 + F RS RD WA++YTS + P E YPVGAPIPWPSDTVPSGYALMQGQ FD Sbjct: 872 NGGLFYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFD 930 Query: 118 KSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTET 177 KSAYPKLA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+T Sbjct: 931 KSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKT 990 Query: 178 TSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAG 237 TSSFDYGTKSTNNTGAHTHS+SG+ NSAGAH H N TA +N AG Sbjct: 991 TSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHS------------LANVNTASANSGAG 1038 Query: 238 IMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTIT 297 ST +N TSS GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTIT Sbjct: 1039 SASTRLSVVHNQNYA-TSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTIT 1097 Query: 298 VNAAGNAENTVKNIAFNYIVRLA 320 VNAAGNAENTVKNIAFNYIVRLA Sbjct: 1098 VNAAGNAENTVKNIAFNYIVRLA 1120 >UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia coli RepID=B7L485_ECO55 Length = 1056 Score = 224 bits (570), Expect = 3e-57, Method: Composition-based stats. Identities = 207/328 (63%), Positives = 225/328 (68%), Gaps = 11/328 (3%) Query: 2 NITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHA-- 59 NI A + GA V N AY N+ + +G T A Sbjct: 731 NIGAFARRSTGAYADSDGAVPWNAESGAY-NVTRSGDSYILVNFYTGVGSCRTLQMKAHY 789 Query: 60 ---PAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF 116 F RS RD WA++YTS + P E YPVGAPIPWPSDTVPSGYALMQGQTF Sbjct: 790 RNRGLFYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQTF 848 Query: 117 DKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTE 176 +KSAYPKLA AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT+ Sbjct: 849 NKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTK 908 Query: 177 TTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIF----PNGYTAIS 232 TTSSFDYGTKSTNNTGAHTHS+SG+ SAG H H + + G S G+T + Sbjct: 909 TTSSFDYGTKSTNNTGAHTHSLSGSTGSAGVHTHGNGIRWPGGGGSALAFYDGGGFTYVQ 968 Query: 233 NLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSH 292 N + TS +T S GAHTHSLSGTAAS+GAHAHTVGIGAHTHSVAIGSH Sbjct: 969 NSQYQVSPGTSSYRSYYQRIQTQSAGAHTHSLSGTAASSGAHAHTVGIGAHTHSVAIGSH 1028 Query: 293 GHTITVNAAGNAENTVKNIAFNYIVRLA 320 GHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 1029 GHTITVNAAGNAENTVKNIAFNYIVRLA 1056 >UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5PP06_SALHA Length = 534 Score = 214 bits (543), Expect = 4e-54, Method: Composition-based stats. Identities = 168/305 (55%), Positives = 195/305 (63%), Gaps = 50/305 (16%) Query: 3 ITALTDNTQGA-AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 + ALT T+G+ +GL + EVYNNGYPT YGNI+ L G G+GE+LIGWSGT+GA APA Sbjct: 156 LPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTG---TGDGEILIGWSGTNGAPAPA 212 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 +IRS RDT DA WS WA LYT+ +PP + +PVGAPI WPSD P+GYALMQGQ+FDKSAY Sbjct: 213 YIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAPIAWPSDATPAGYALMQGQSFDKSAY 272 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 P LA+AYPSGVIPDMRGWTIKGKPASGRA+LSQE DG KSH+HSA A TDLGT+TTSSF Sbjct: 273 PLLAIAYPSGVIPDMRGWTIKGKPASGRAILSQEMDGNKSHSHSARAQDTDLGTKTTSSF 332 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST 241 DYGTKSTN TG HT+ G NS + Sbjct: 333 DYGTKSTNTTGNHTNQFGGYINSYW--------------------------------GDS 360 Query: 242 TSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 301 S Q T + G HAHTV IG H H++ IG HGH + V+A Sbjct: 361 NHTSFQPGGGAWTQAAG--------------DHAHTVYIGGHEHTMYIGPHGHVVIVDAD 406 Query: 302 GNAEN 306 GNAE Sbjct: 407 GNAET 411 >UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysenteriae 1012 RepID=B3X2T1_SHIDY Length = 488 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 240/289 (83%), Positives = 246/289 (85%), Gaps = 3/289 (1%) Query: 32 NIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFY 91 +I G A G + I W TS A +Y+ P +FY Sbjct: 203 DIYVAIGNYATG---VNIQWDYTSNASVTIHTSPAYSANKPEGLTDGTVYSLYTPSEQFY 259 Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAV 151 P GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAV Sbjct: 260 PPGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAV 319 Query: 152 LSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHK 211 LSQEQDGIKSHTHSASASSTDLGT+TTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHK Sbjct: 320 LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHK 379 Query: 212 SSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASA 271 SSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASA Sbjct: 380 SSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASA 439 Query: 272 GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 440 GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 488 >UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX Length = 710 Score = 209 bits (531), Expect = 1e-52, Method: Composition-based stats. Identities = 179/305 (58%), Positives = 202/305 (66%), Gaps = 30/305 (9%) Query: 16 LELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWS 75 L Y YP + + + + + + ++ RS+ T D Sbjct: 436 LNAYTSAALKYPENLAGTLVVLKNAGITQIYYVY-------NTSRSYTRSQYSTGDW--- 485 Query: 76 PWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPD 135 +A P + +PVGA IPWPSD+VP+GYA+MQGQTFDK+ YP LA AYPSGV+PD Sbjct: 486 -------TAWTPQDSFPVGAAIPWPSDSVPTGYAVMQGQTFDKTTYPLLAAAYPSGVLPD 538 Query: 136 MRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHT 195 MRGWTIKGKPASGR VLS EQDGIKSHTHSASAS+TDLGT+TTSSFDYGTKSTNNTGAHT Sbjct: 539 MRGWTIKGKPASGRDVLSLEQDGIKSHTHSASASNTDLGTKTTSSFDYGTKSTNNTGAHT 598 Query: 196 HSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTS 255 H++SGTANSAGAH H S + SG G Sbjct: 599 HNVSGTANSAGAHTHTVPLR-------------RPNSGGMNFDWLDGASSGTVVGNGTVP 645 Query: 256 SDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNY 315 S GAHTHS+SGTA SAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNY Sbjct: 646 SSGAHTHSVSGTATSAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNY 705 Query: 316 IVRLA 320 IVRLA Sbjct: 706 IVRLA 710 >UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae RepID=A4TT73_YERPP Length = 962 Score = 203 bits (515), Expect = 7e-51, Method: Composition-based stats. Identities = 151/278 (54%), Positives = 182/278 (65%), Gaps = 18/278 (6%) Query: 56 GAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQT 115 G + T AN+ LY+S PP E YPVGAPIPWP+D PSG+A+MQGQT Sbjct: 690 GTPEYVATKPASSTNGANY----ILYSSVLPPPESYPVGAPIPWPNDVAPSGFAIMQGQT 745 Query: 116 FDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT 175 FDKS YPKLA AYPSGV+PDMRGW IKGKP S RAVLS EQDGIKSH H+A+ASSTDLGT Sbjct: 746 FDKSVYPKLAAAYPSGVLPDMRGWMIKGKPTS-RAVLSLEQDGIKSHAHNAAASSTDLGT 804 Query: 176 ETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNT----SIFPNGYTAI 231 + T++FDYGTK+++ T S + T A + +S + +T + +P + Sbjct: 805 KPTTTFDYGTKTSSGFDYGTKSSNSTGAHAHSLSGSTSSSGAHAHTVTAHTQYPRSTDSR 864 Query: 232 SNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGA--------- 282 + + G T + TSS G H HS+SGTA SAGAHAHTVGIGA Sbjct: 865 NQNAVGKQYNTQQTTANAFNVWTSSAGDHAHSISGTAVSAGAHAHTVGIGAHAHSLSIGS 924 Query: 283 HTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 H+HSVAIG+H HTIT+ A GNAENTVKNIA+NYIVRLA Sbjct: 925 HSHSVAIGAHSHTITIAACGNAENTVKNIAYNYIVRLA 962 >UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_LAMBD Length = 774 Score = 194 bits (491), Expect = 4e-48, Method: Composition-based stats. Identities = 167/252 (66%), Positives = 182/252 (72%), Gaps = 23/252 (9%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 +P GAPIPWPSD VPSGY LMQGQ FDKSAYPKLAVAYPSGV+PDMRGWTIKGKPAS Sbjct: 527 NSAFPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIKGKPAS 586 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKS----------TNNTGAHTHS 197 GRAVLSQEQDGIKSHTHSASAS TDLGT+TTSSFDYGTK+ TNNTGAH HS Sbjct: 587 GRAVLSQEQDGIKSHTHSASASGTDLGTKTTSSFDYGTKTTGSFDYGTKSTNNTGAHAHS 646 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 +SG+ +AGAH H S + S Y + + + + KT S Sbjct: 647 LSGSTGAAGAHAHTSGLRMNSSGWSQ----YGTATITGSLSTVKGTSTQGIAYLSKTDSQ 702 Query: 258 GAHTHSLSGTAASAGAHAHTVG---------IGAHTHSVAIGSHGHTITVNAAGNAENTV 308 G+H+HSLSGTA SAGAHAHTVG IGAH HS +IGSHGHTITVNAAGNAENTV Sbjct: 703 GSHSHSLSGTAVSAGAHAHTVGIGAHQHPVVIGAHAHSFSIGSHGHTITVNAAGNAENTV 762 Query: 309 KNIAFNYIVRLA 320 KNIAFNYIVRLA Sbjct: 763 KNIAFNYIVRLA 774 >UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escherichia coli E22 RepID=B3I2W7_ECOLX Length = 654 Score = 180 bits (455), Expect = 6e-44, Method: Composition-based stats. Identities = 136/309 (44%), Positives = 167/309 (54%), Gaps = 25/309 (8%) Query: 12 GAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTD 71 L+ Y+V + A N G L++ GA R + Sbjct: 371 ANKNLDDYQVPGLYFQEANNNTSAAMNYPENSAGSLMVLR----GAGVTQVYRVYNSSRS 426 Query: 72 ANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSG 131 + S ++ L + P + YPVGAPIPWPSD P+GYALMQGQ FDK+ YP LA+AYP+G Sbjct: 427 YSRSKYSTLAWTPWMPEDSYPVGAPIPWPSDVTPTGYALMQGQPFDKAVYPLLAIAYPAG 486 Query: 132 VIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT 191 +IPDMRG TIKGKP +GRAVLS EQDG+ SHTH AS S TDLGT+ TSSFDYG+K T + Sbjct: 487 IIPDMRGQTIKGKP-NGRAVLSYEQDGVISHTHGASISDTDLGTKYTSSFDYGSKPTTSF 545 Query: 192 GAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNA 251 S + A ++ ++ A+ G+ +S Sbjct: 546 DYGNKSSTEGGWHAHNFRYCATSAYR--------------DTPGQGLGMHSSNVSWAAGD 591 Query: 252 GKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNI 311 S G H H G H H VGIGAH H V +G HGHT TV+AAGNAENTVKNI Sbjct: 592 RIEGS-GNHAH-----VTWIGPHDHWVGIGAHNHYVVMGYHGHTATVHAAGNAENTVKNI 645 Query: 312 AFNYIVRLA 320 AFNYIVRLA Sbjct: 646 AFNYIVRLA 654 >UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL254 RepID=B4T041_SALNS Length = 580 Score = 177 bits (449), Expect = 4e-43, Method: Composition-based stats. Identities = 128/264 (48%), Positives = 148/264 (56%), Gaps = 54/264 (20%) Query: 57 AHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF 116 + +I +R D T+ + W P P G P+PWPSDT+P+GYALMQGQ F Sbjct: 371 SIGNTWIGARWDATNGSGFTWR--------PMMSCPPGVPLPWPSDTIPAGYALMQGQAF 422 Query: 117 DKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTE 176 DK+ YP LA+AYPSG IPDMRGWTIKGKP SGRAVLSQE DG KSH+H A A TDLGT+ Sbjct: 423 DKNVYPLLAIAYPSGTIPDMRGWTIKGKPVSGRAVLSQELDGNKSHSHGARALDTDLGTK 482 Query: 177 TTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSA 236 TSSFDYGTKS+N TG H HS GT ++ Sbjct: 483 GTSSFDYGTKSSNTTGGHNHSAGGTYGG---------------------------DSIGG 515 Query: 237 GIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTI 296 G+ Q TS +G H AHT IG H H+V IG HGH + Sbjct: 516 KARVQRDGNDQL-----TSWNGDH--------------AHTTWIGPHDHTVYIGPHGHVV 556 Query: 297 TVNAAGNAENTVKNIAFNYIVRLA 320 V+A GNAE TVKNIAFNYIVRLA Sbjct: 557 IVDADGNAETTVKNIAFNYIVRLA 580 >UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacteriaceae RepID=B7LN99_ESCF3 Length = 593 Score = 175 bits (444), Expect = 1e-42, Method: Composition-based stats. Identities = 159/295 (53%), Positives = 178/295 (60%), Gaps = 36/295 (12%) Query: 26 YPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAH 85 Y G ++ G G + + RS RD Sbjct: 335 YNVMDGGASYIVAHFFSGVGSCRSFQLRADYKNRGLYYRSSRDGYGFE---------RGF 385 Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP 145 P +PVGAPI WPSD VP GYA+MQGQTFDK+AYP LA AYPSGVIPDMRGWTIKGKP Sbjct: 386 EPVNAFPVGAPIAWPSDIVPEGYAIMQGQTFDKAAYPLLAAAYPSGVIPDMRGWTIKGKP 445 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 ASGRAVLSQEQDGIKSHTHSASASSTDLGT+TTSSFDYGTK+ + T T N+ Sbjct: 446 ASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKTVSTFNHGTK----TTNNT 501 Query: 206 GAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLS 265 GAH H G +GG ++ SG+ Q +SSDGAH H Sbjct: 502 GAHTHTVGGRYGG-------------DSIGGKQRVQVSGTNQV-----SSSDGAHAH--- 540 Query: 266 GTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 G H HTVGIGAH H+VA+G+HGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 541 --TVDIGQHNHTVGIGAHAHTVALGAHGHTITVNAAGNAENTVKNIAFNYIVRLA 593 >UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escherichia RepID=B7LKX7_ESCF3 Length = 567 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 123/257 (47%), Positives = 160/257 (62%), Gaps = 24/257 (9%) Query: 64 RSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPK 123 + TT+A + AE PVG PIPWPSD+VPSGYALM GQTF+K++YPK Sbjct: 335 TNSTSTTEAATPNAVKAAMDKAIAAESCPVGMPIPWPSDSVPSGYALMTGQTFNKTSYPK 394 Query: 124 LAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDY 183 LA+AYPSGVIPDMRGW IKGKP+SGRA+LS E DG+KSH H+ S SST+LGT T++S D Sbjct: 395 LAIAYPSGVIPDMRGWIIKGKPSSGRAILSTELDGVKSHNHTGSISSTNLGTITSTSTDL 454 Query: 184 GTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTS 243 GTK+T + + + +++G H H+ T + G S Sbjct: 455 GTKTTASFNHGSRN----TSTSGEHTHRIP---------------TDGAEGKDGPSLWNS 495 Query: 244 GSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGN 303 + T S G+H HS++ GAHAHT+ +G+HTH++ +G+H H+I +N GN Sbjct: 496 PNSDENYREPTESAGSHYHSIT-----IGAHAHTIALGSHTHNIVLGTHNHSIIINNTGN 550 Query: 304 AENTVKNIAFNYIVRLA 320 ENTVKNIAFNYIVRLA Sbjct: 551 TENTVKNIAFNYIVRLA 567 >UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae RepID=Q3ZL14_ESCBL Length = 289 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 110/230 (47%), Positives = 132/230 (57%), Gaps = 23/230 (10%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRA 150 P G + WP T P+G+ALM GQTFD +AYP+LA AYPSGVIPDMRG TIK PASGR Sbjct: 83 LPPGIALAWPGATAPTGFALMLGQTFDTTAYPRLAQAYPSGVIPDMRGQTIKFLPASGRT 142 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQH 210 +LS E DG+KSH+HS S S+TDLGT T + D GTK T+ G H H N A Sbjct: 143 LLSLEADGVKSHSHSGSISTTDLGTATAADTDLGTKQTSQDGLHNHVSDSRFNKLMARSS 202 Query: 211 KSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAAS 270 G + G + + + + R +G S A + A Sbjct: 203 DIDG------------------TNNTGDVDSDNPESEHRVSGMNDSLWA-----ASVIAD 239 Query: 271 AGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 +G H HTV IG H HSV IG HGHT+T++ GN ENTVKNIAFN IVRLA Sbjct: 240 SGLHMHTVYIGPHAHSVYIGPHGHTVTISNFGNTENTVKNIAFNAIVRLA 289 >UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria phage T4 RepID=Q99362_BPT4 Length = 382 Score = 160 bits (404), Expect = 5e-38, Method: Composition-based stats. Identities = 165/254 (64%), Positives = 186/254 (73%), Gaps = 23/254 (9%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASG 148 YP+GAPIPWP+DT P+GYALM+GQTFD AYPKLA AYPSG IPDMRG TIKGKP SG Sbjct: 130 SSYPIGAPIPWPTDTPPNGYALMEGQTFDTRAYPKLAAAYPSGTIPDMRGQTIKGKP-SG 188 Query: 149 RAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKST----------NNTGAHTHSI 198 RAVLS E DG+KSHTH ASAS+TDLGT+TTSSFDYGTK+T N TG H H++ Sbjct: 189 RAVLSTEADGVKSHTHGASASNTDLGTKTTSSFDYGTKTTSSFDYGTKTSNTTGNHNHTV 248 Query: 199 SGTANSAGAHQHKSSGA--FGGTNTSIFPNGYT-AISNLSAGIMSTTSGSGQTRNAGKTS 255 SGT +SAGAHQH SG G +T+IFP+GY+ +N ++ T GS GKTS Sbjct: 249 SGTTSSAGAHQHARSGPQLSNGISTNIFPDGYSDVGTNYNSKFSGTVIGSSVPCIIGKTS 308 Query: 256 SDGAHTHSLSG---------TAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAEN 306 +DGAHTH+ SG GAH HTVGIGAHTH+VAIGSHGHTITVNA GN EN Sbjct: 309 NDGAHTHTWSGTTSTTGNHAHTVGIGAHTHTVGIGAHTHTVAIGSHGHTITVNATGNTEN 368 Query: 307 TVKNIAFNYIVRLA 320 TVKNIAFNYIVRLA Sbjct: 369 TVKNIAFNYIVRLA 382 >UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEM1_9CAUD Length = 255 Score = 154 bits (389), Expect = 4e-36, Method: Composition-based stats. Identities = 103/229 (44%), Positives = 123/229 (53%), Gaps = 45/229 (19%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAV 151 PVGAP+ WPSDT P G+ALM GQTFDK YP LA YPSGV+PDMRG IK KP GRAV Sbjct: 72 PVGAPLAWPSDTAPDGWALMIGQTFDKVKYPLLAKVYPSGVLPDMRGRVIKAKPD-GRAV 130 Query: 152 LSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHK 211 LS E+D +KSHTH+ A++ GT TS+FD+G K T G HTH A G+ Q+ Sbjct: 131 LSLEEDQVKSHTHTGKAATAG-GTRATSTFDHGNKRTTTNGNHTHGSPQGARHGGSGQYT 189 Query: 212 SSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASA 271 S T+S ++A Sbjct: 190 SGDDE-------------------------------------TNSVFNWP-----ATSAA 207 Query: 272 GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 G H H V IG H H+V I +H HT+ ++A G ENTVKNIA NYIVRLA Sbjct: 208 GDHFHDVQIGPHNHNVDI-NHEHTLQIDATGGTENTVKNIAMNYIVRLA 255 >UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepID=B6S308_SALDU Length = 427 Score = 144 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 88/131 (67%), Positives = 103/131 (78%), Gaps = 4/131 (3%) Query: 3 ITALTDNTQGA-AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 + ALT T+G+ +GL + EVYNNGYPT YGNI+ L G G+GE+LIGWSGT+GA APA Sbjct: 300 LPALTGATRGSDSGLIMGEVYNNGYPTQYGNILRLTG---TGDGEILIGWSGTNGAPAPA 356 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 +IRS RDT DA WS WA LYTS +PP YPVGA I WPSD P+GYALMQGQ+FDKSAY Sbjct: 357 YIRSHRDTADAEWSEWAMLYTSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAY 416 Query: 122 PKLAVAYPSGV 132 P LA+AYPSG+ Sbjct: 417 PLLAIAYPSGI 427 >UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia RepID=C4UEH4_YERAL Length = 387 Score = 142 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 56/233 (24%), Positives = 88/233 (37%), Gaps = 14/233 (6%) Query: 3 ITALTDNTQGAAGLELYEVYNNGYPTAYGN--IIHLKGMTAVGEGELLIGWSGTSGAHAP 60 I + NT A G+ Y P +G+ I H++ + + T+ H Sbjct: 154 IASAGINTLAATGMYSVNQYAANIPEGFGDATIQHIQNDSLTAHQFIF----STNNTHTA 209 Query: 61 AFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSA 120 A I + R + W W + TS P+G P+P+P T P+GY G F Sbjct: 210 AKI-AYRLRSYGQWREWIDIVTSRSDTLT--PIGIPLPYPGTTPPAGYLKCNGAAFYPYR 266 Query: 121 YPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGT 175 YP LA YP+ +PD+RG I+G + R +LS + D +++ T + S LG Sbjct: 267 YPTLATLYPTHKLPDLRGEFIRGFDDGRGIDTSRTLLSAQTDALQNITGGINGVSESLGI 326 Query: 176 ETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGY 228 S+F + G G+ +S + N Sbjct: 327 AAESNFTGAFAKAESVGNDNTPHHTDITHCGSFDFDASRVVRTAAETRPRNIS 379 >UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseudotuberculosis IP 31758 RepID=A7FIU0_YERP3 Length = 402 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 46/210 (21%), Positives = 82/210 (39%), Gaps = 21/210 (10%) Query: 1 MNITALTDNTQG--AAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAH 58 +N+ +L + G L+++ + YP + ++ + E ++G Sbjct: 176 INLNSLGQDALGIYVQALDVFATLDRNYPITIAGSLVVRPSAYGAQQEYTPFYTGRK--- 232 Query: 59 APAFIRSRRD--TTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF 116 ++R+ + WS W Q+ PVG P+PWP+ PSG+ G TF Sbjct: 233 ---YVRNLMGVWNGNGPWSDWIQIGND------VAPVGIPMPWPAHIPPSGWLKCNGATF 283 Query: 117 DKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASST 171 +K+ +P+LA Y GV+PD+RG I+G GR +LS ++ + Sbjct: 284 NKAQFPQLASVYTRGVLPDLRGEFIRGWDDGKLADPGRGLLSFQEGTVVGGYDDNDTGDI 343 Query: 172 DLGTETTSSFDYGTKSTNNTGAHTHSISGT 201 +S F +T + Sbjct: 344 SSIGLYSSGFGDQLTNTQWVSINGKRWITA 373 >UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU Length = 296 Score = 126 bits (315), Expect = 1e-27, Method: Composition-based stats. Identities = 46/140 (32%), Positives = 62/140 (44%), Gaps = 11/140 (7%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 PVG PIPWP+ P G+ G FDKS +P+LA AYPSG +PD+RG I+G Sbjct: 144 IPVGTPIPWPTAIPPVGWLQCNGAVFDKSKFPELAKAYPSGYLPDLRGEFIRGWDNGRGV 203 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 GR + + D I++ T S + D T YG + GT S Sbjct: 204 DPGRVCSTWQGDAIRNITGSFPGAIADNYHLATKEAFYGKINLGIAT------DGTTKSK 257 Query: 206 GAHQHKSSGAFGGTNTSIFP 225 H + FG + + P Sbjct: 258 NIHNPDNPYGFGFDASRVVP 277 >UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae pv. oryzae RepID=B2SVF7_XANOP Length = 501 Score = 124 bits (310), Expect = 5e-27, Method: Composition-based stats. Identities = 62/279 (22%), Positives = 100/279 (35%), Gaps = 32/279 (11%) Query: 68 DTTDANWSPWAQLY-TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV 126 D D W + + + P F G + S P+G + G ++ Y L Sbjct: 224 DMLDGRQGDWYRDFGNMLNVPQSFLLPGQIVVMASLYPPNGLLVCDGAEISRAKYAALFA 283 Query: 127 AYPS----------GVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTE 176 A + +P ++ T+ ++ AV S + + SHTH ASA++ Sbjct: 284 AIGTVYGAGDGSTTFNVPKIKEGTVITHTSAATAVGSYDPGQVISHTHGASAAAVGDHAH 343 Query: 177 TTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSA 236 T+ G + + + A + H G+ + P + S Sbjct: 344 YTAINAAGNHAHGASAGAAGDHAHYAWTDAQGHHAHGGSTSASGDHQHPGVIPSASINGY 403 Query: 237 GIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTV---GIGAHTHSVAI---- 289 G+ + G T + G H HS AG+H H + G+G HTH + I Sbjct: 404 GVYRERDNDAAPSD-GWTGAGGNHAHSF--GTDGAGSHGHNISMNGVGNHTHGIGIAEGG 460 Query: 290 -----------GSHGHTITVNAAGNAENTVKNIAFNYIV 317 G+H HTITVNAAG +N + Y + Sbjct: 461 NHVHDVDHRGAGAHAHTITVNAAGGIDNLPAGLRMTYCI 499 >UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5ABB4_BURGB Length = 670 Score = 123 bits (309), Expect = 6e-27, Method: Composition-based stats. Identities = 60/303 (19%), Positives = 92/303 (30%), Gaps = 57/303 (18%) Query: 51 WSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEF--YPVGAPIPWPSDTVPSGY 108 + T A F + + A + T+ A +G + + +GY Sbjct: 390 VTVTFAQEATYFQKPAAGPSPAPDDSSLRFATTEWVTAAIGTASIGQIVMEARTSPRAGY 449 Query: 109 ALMQGQTFDKSAYPKL--------------------------AVAYPSGVIPDMRGWTIK 142 G + ++ YP L A +PD+RG ++ Sbjct: 450 VKCDGSQYKRADYPALWAYAQASGALVSEAEYTDGRWGGFSTADGQTYFRVPDLRGEFLR 509 Query: 143 GKPA------SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 GRA+ S + ++H H AS+ + G H Sbjct: 510 CWSDGRGDVDPGRAIGSFQGGQNQAHAHGASSDPDGAHVHDAWTGGAGW----------H 559 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS 256 S G G H H G + + Y S S + ++ + Sbjct: 560 SHHGVTGGGGMHNHA-----NGVFSRLLRPPYLGSLTGSDTDGSGNEQAVGGGDSADIAW 614 Query: 257 DGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYI 316 G H H AG H H VGIG G H H I V A G AE +N+A + Sbjct: 615 AGEHQHEF--WTDGAGDHVHAVGIGN------AGGHAHAIHVQADGGAEARPRNVALLAM 666 Query: 317 VRL 319 +R Sbjct: 667 IRA 669 >UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=Burkholderia thailandensis RepID=UPI00016A4B89 Length = 654 Score = 118 bits (294), Expect = 3e-25, Method: Composition-based stats. Identities = 52/274 (18%), Positives = 86/274 (31%), Gaps = 56/274 (20%) Query: 79 QLYTSAHPPAEFYP--VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKL-AVAYPSGVI-- 133 ++ T+ E VG + +GY G ++ YP L A A SG + Sbjct: 403 RVATTQWIAGELASAMVGQIVFEMRTAARAGYLKCNGALVKRADYPALWAYAQGSGALVA 462 Query: 134 -----------------------PDMRGWTIKGKPASG-----RAVLSQEQDGIKSHTHS 165 P++RG ++ R + + + ++H H+ Sbjct: 463 EKDWMSGNFGCFSDGDGSATFRIPELRGEFLRCWDDGRGSDADRKIGTWQDSMNRTHGHA 522 Query: 166 ASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFP 225 A A T+N G H H + + + + Sbjct: 523 AGADGVGDHGH--------NAWTDNQGWHGH-------HGWTGTNGNHNHNNDIFSRLLR 567 Query: 226 NGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTH 285 Y S S + + ++ G H H + AG HAH VG+ A Sbjct: 568 PPYNGSLTGSDTAGSGSEQAVGGGDSADIRWAGDHNHEFN--TEGAGTHAHNVGVAAS-- 623 Query: 286 SVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 G+H H I V A G E +N+A ++R Sbjct: 624 ----GAHSHAIHVAADGGNEARPRNLAVLAMIRA 653 >UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CP88_DICZE Length = 485 Score = 117 bits (293), Expect = 5e-25, Method: Composition-based stats. Identities = 63/242 (26%), Positives = 98/242 (40%), Gaps = 37/242 (15%) Query: 1 MNITALTDNTQGA--AGLELYEVYNNGYPTAYGNIIHL-KGMTAVGEGELLIGWSGTSGA 57 +++ LT + G L YP + + + K EG + + GA Sbjct: 226 LDLNTLTGSRAGRFWQNLNAAATAALNYPVQFAGSLDVEKNTADSAEGCIQRYTTYGGGA 285 Query: 58 HAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYP------------------------V 93 FIRS W W +L + + P P Sbjct: 286 LPRMFIRSYN-AGKQVWGAWQELASLSSPTFTGTPTAPTAEAGSNTTQLATTAWFAAEIA 344 Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASG 148 G P+PWP VP+G+ GQ FDK+ YP+LA YPSGV+PD+RG I+G SG Sbjct: 345 GIPLPWPQAAVPTGWLKCNGQAFDKNRYPRLAQVYPSGVLPDLRGEFIRGWDDGRGVDSG 404 Query: 149 RAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDY----GTKSTNNTGAHTHSISGTANS 204 R VLSQ++ + ++ SA ++D + S+ ++ ++T T ++ Sbjct: 405 REVLSQQRGSLINYDGPDSAPTSDSLRLSVSAAQADAVSASEYAGVMLSYTAYNITTVSA 464 Query: 205 AG 206 AG Sbjct: 465 AG 466 >UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersinia pestis KIM D27 RepID=D1TPQ4_YERPE Length = 262 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 5/111 (4%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 PVG P+PWP+ T P G+ G FDK YPKLA+AYPSG++PD+RG I+G Sbjct: 102 SAIPVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIRGWDDGL 161 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAH 194 +GR +LS + D I++ + + SS G T+ Sbjct: 162 GVDAGREILSIQGDAIRNISGGIQGRNEATSARLFSSNATGVFRTDGQFGS 212 >UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BL21_PHOAA Length = 452 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 66/184 (35%), Positives = 79/184 (42%), Gaps = 11/184 (5%) Query: 47 LLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPS 106 ++I TS H IR + W W +Y+SA P E +PVGAPIP+P P Sbjct: 261 VVIYQKYTS-HHGEVVIRQSW-NSGKTWIGWDIVYSSAILPPEQHPVGAPIPYPHRYTPV 318 Query: 107 GYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKS 161 GY GQTFDKS YPKLA AYPSG +PD+RG I+G GR S + K+ Sbjct: 319 GYLTCNGQTFDKSLYPKLAEAYPSGRVPDLRGEFIRGWDDSRGVDPGRVCGSWQDSDNKA 378 Query: 162 HTHSAS---ASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGT-ANSAGAHQHKSSGAFG 217 H H G T S T G + SG SAG H S G Sbjct: 379 HIHDDEFCYGGGDAGGDSGTMSAFAKKYCTPKDGVNGRPTSGWLPASAGLHSLPSGGNEA 438 Query: 218 GTNT 221 Sbjct: 439 RPRN 442 >UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp. RC586 RepID=D0IJ09_9VIBR Length = 368 Score = 116 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 77/243 (31%), Positives = 110/243 (45%), Gaps = 66/243 (27%) Query: 79 QLYTSAHP--PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDM 136 +L ++ A+ PVG P+PWPSD P G+A+ +GQ FDK A P+LA YP G++ D+ Sbjct: 189 KLVSNLWLKLAAKICPVGVPLPWPSDIAPEGFAIHKGQAFDKVANPELAKLYPDGILKDL 248 Query: 137 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 RG + GK G +LS E D +K H + S SS D G+++TN TG H H Sbjct: 249 RGMAVVGK-KEGEIILSYEADQVKQHGYPNST---------VSSTDLGSRNTNTTGNHAH 298 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS 256 ++ +G + T + + Y T++ Sbjct: 299 GYPAGTSNG------PNGPYLDTAHASYGYRY-------------------------TTT 327 Query: 257 DGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYI 316 +G H HS VAIGSH H+I + G ENT+KNI FN+I Sbjct: 328 EGNHYHS-----------------------VAIGSHAHSIAIALFGATENTIKNIKFNWI 364 Query: 317 VRL 319 VR+ Sbjct: 365 VRM 367 >UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G6_9ENTR Length = 580 Score = 116 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 53/180 (29%), Positives = 79/180 (43%), Gaps = 14/180 (7%) Query: 31 GNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEF 90 G+ L TA G+ + + A I S++D T A S + Sbjct: 380 GDGQKLGFETAPGDA-YFVYRDAKNNNKAVVTIPSKKDGTLALTSDVEAINN-------- 430 Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP----- 145 YPVGAPIPWP T P+GY + G FDK+ YP+LA+AYPSG +P + G I+G Sbjct: 431 YPVGAPIPWPQATPPNGYFVCDGNYFDKAKYPQLALAYPSGKLPLLYGEFIRGLDLGRKV 490 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 GR VLS + D I++ T + + + + +N + + G + Sbjct: 491 DPGRTVLSNQGDAIRNITGRIGYARHGGTEPPVVNGEGVFRRDSNHNVNIANGRGDDWGS 550 >UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root RepID=Q2T5M0_BURTA Length = 790 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 62/264 (23%), Positives = 90/264 (34%), Gaps = 58/264 (21%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY-------------------- 128 +G + P TV G+ G +++ YP+L AY Sbjct: 551 SATTIGQIVFEPRTTVRPGFLKANGVLVNRADYPEL-WAYAQASGALVSDADWMKDRWGC 609 Query: 129 -------PSGVIPDMRGWTIKGKPASG------RAVLSQEQDGIKSHTHSASASSTDLGT 175 + +P++RG I+ + R + + + D +H H A+AS Sbjct: 610 FSTGDGATTFRLPELRGEFIRCWSDARGGVDATRQIGAFQGDQNHTHAHGAAASEAPDHV 669 Query: 176 ETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLS 235 T T+ G H H G N+ G HQH S +G P T + + Sbjct: 670 H--------TAWTDVQGWHGH--HGWTNAVGDHQHVSP--WGEHPQMYNPPWGTWGAANN 717 Query: 236 AGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHT 295 G GS G TS G H H + G H H VG G H HT Sbjct: 718 RGA----EGSDNDNVYGMTSPAGNHNHEFN--TEGNGNHGHAVG------IGGGGRHAHT 765 Query: 296 ITVNAAGNAENTVKNIAFNYIVRL 319 I V G E +N+A ++R Sbjct: 766 IAVQPDGGDEARPRNVALLALIRA 789 >UniRef50_B2I5N0 Tail Collar domain protein n=13 Tax=Xylella fastidiosa RepID=B2I5N0_XYLF2 Length = 414 Score = 115 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 57/283 (20%), Positives = 93/283 (32%), Gaps = 27/283 (9%) Query: 60 PAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKS 119 + T D N Q + Y G + G L G+ ++ Sbjct: 105 ASRYGLYLSTADNNTDTPLQNEPNRWTALSRYEPGQIVYTAGKRALPGTLLCDGRAVSRA 164 Query: 120 AYPKLAVAY----------PSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASAS 169 YP+L + IP+ T+ A V + + SH H+A+A Sbjct: 165 MYPRLFEEINTSYGAGDGVSTFNIPNFLEGTVGVHTADPALVGTFTSGQVISHAHTATAE 224 Query: 170 STDLGTETTSSFDYGTKSTNNT--GAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFP-N 226 + G + + A H ++ G HQH S ++ G + I Sbjct: 225 EGGRHLHPVTVHPAGRHTHPASAAAAGNHLHQAWSDEQGLHQHTGSTSWDGDHAHILGSF 284 Query: 227 GYTAISNLSAGIMSTTSGSGQTRNAG------KTSSDGAHTHSLSGTAASAGAHAHTVGI 280 S G G T G T ++G H H++S A AG H H + + Sbjct: 285 RAIYASGRDMGFYEQNQGKVTTNVTGGHLHRFTTDANGKHAHNISMQA--AGFHVHDIAV 342 Query: 281 GA---HTHSV---AIGSHGHTITVNAAGNAENTVKNIAFNYIV 317 A H H+ + G HGHT++++ G N + + Sbjct: 343 TAEADHAHAATAESAGRHGHTVSIDRFGEHHNLPAGLRVMACL 385 >UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli plasmid p15B n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2Q1_PHOLL Length = 478 Score = 113 bits (283), Expect = 7e-24, Method: Composition-based stats. Identities = 54/192 (28%), Positives = 83/192 (43%), Gaps = 12/192 (6%) Query: 1 MNITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGEL------LIGWSGT 54 +N AL+ + AG ++ + + +A + + + E IG GT Sbjct: 240 INGKALSGDVSLNAG-DVGALPISSTLSAQTGTLRINNGSNWPNIEFRAANKHFIGIEGT 298 Query: 55 SGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQ 114 +G + + + T A VG+PIPWP VP+GY GQ Sbjct: 299 AGNRLTIYANDENSNRKYTLATPEKSGTLATLDDINISVGSPIPWPLPNVPAGYLACNGQ 358 Query: 115 TFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASAS 169 +F+KS YP+LA+AYPSGV+PD+RG I+G GR VL+ + D I++ T + Sbjct: 359 SFNKSLYPQLAIAYPSGVLPDLRGEFIRGWDDGRGVDRGRGVLTHQGDAIRNITGYTPGT 418 Query: 170 STDLGTETTSSF 181 F Sbjct: 419 ILRGNNSYGGCF 430 >UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6CGA4_DICZE Length = 401 Score = 113 bits (282), Expect = 7e-24, Method: Composition-based stats. Identities = 50/234 (21%), Positives = 88/234 (37%), Gaps = 7/234 (2%) Query: 7 TDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSR 66 ++Q A + + P + + + + T+ A ++ + Sbjct: 162 YADSQLNAHVAAANPHPQYAPLSSPALTGVPTAPTAANSANSTQLATTAFVKNTALLKEQ 221 Query: 67 RDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV 126 AN S + + VG P+PWP T P+G+ GQ FDK+A+PKLA Sbjct: 222 NGADIANKSAFLANLGLSDTLKIADIVGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQ 281 Query: 127 AYPSGVIPDMRGWTIKGKPASG-----RAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 AYP GV+PD+RG I+G R +LS ++ + + S+ ++G ++ Sbjct: 282 AYPGGVLPDLRGEFIRGWDDGRGVDVARELLSWQKGTLT--ISDPNLSAVNVGALIHANN 339 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLS 235 D + + A + +N F G T N++ Sbjct: 340 DSANTYKSMGFDIVNKSDYAMLRAAINVETVGAQDLDSNGWQFGYGATRPRNIA 393 >UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CGA0_DICZE Length = 166 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 41/106 (38%), Positives = 62/106 (58%), Gaps = 5/106 (4%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P+PWP T P+G+ GQ FDK+A+PKLA YPSGV+PD+RG I+G S Sbjct: 23 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 82 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 R +LS + D I++ T S + + +D G++++ + G+ Sbjct: 83 NRNLLSSQGDAIRNITGFVSGVYVGFDGYSGAFYDTGSRNSISPGS 128 >UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E01-6750 RepID=UPI000190EC42 Length = 317 Score = 112 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 40/128 (31%), Positives = 56/128 (43%), Gaps = 5/128 (3%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 PVG P+PWPS T+P G+ G F YPKLA AYP+ +PD+RG I+G Sbjct: 169 SALPVGVPVPWPSATLPEGWLKCNGAAFSSEMYPKLAKAYPTNKLPDLRGEFIRGWDDGR 228 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 +GR +LS ++ I S + T F S + A+ Sbjct: 229 GIDAGREILSFQEGTIVSGFDDNDTGDISSLSSTQYGFGDTLSSNQWGAINGKKWIFDAS 288 Query: 204 SAGAHQHK 211 S GA ++ Sbjct: 289 SKGAQKYD 296 >UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas aeruginosa PA7 RepID=A6VBH2_PSEA7 Length = 654 Score = 111 bits (277), Expect = 3e-23, Method: Composition-based stats. Identities = 58/311 (18%), Positives = 108/311 (34%), Gaps = 77/311 (24%) Query: 26 YPTAYGNIIHLKGMTAVGEGE--LLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTS 83 P +G + A E E + G S + F + + W+P+ +++ S Sbjct: 402 VPEIFGMSYGVVATFAYSERETRASQLFFGQSPENKLMF-----RSGNYTWAPFLEIWHS 456 Query: 84 AHP-PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGV 132 + P P GA + + + P+GY G ++AY L + Sbjct: 457 GNLNPQAIVPAGAVVAFAMYSPPAGYLKANGAAVSRTAYAALFATIGTYYGAGDGSTTFN 516 Query: 133 IPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKS 187 +PD RG ++ GR + + + +HTH AS+S Sbjct: 517 LPDYRGEFLRALDDGRGLDLGRQLGTLQSSQNLAHTHGASSSG----------------- 559 Query: 188 TNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQ 247 G HTH+++GTA + +++ + + + Sbjct: 560 ---NGGHTHTVTGTA----------------AAAGAHSHSIASVNATALVSGTRLATLVG 600 Query: 248 TRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENT 307 + T G HTH+++G AA G +H HTI V ++G +E Sbjct: 601 NASNSTTDVAGDHTHAVTGVAALEG------------------THNHTIYVESSGGSEAR 642 Query: 308 VKNIAFNYIVR 318 +N++ ++ Sbjct: 643 PRNVSVLICIK 653 >UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterobacteriaceae RepID=A4WEL3_ENT38 Length = 340 Score = 110 bits (275), Expect = 5e-23, Method: Composition-based stats. Identities = 46/136 (33%), Positives = 65/136 (47%), Gaps = 8/136 (5%) Query: 63 IRSRRDTTDANWSPWAQLYTSAHPPAEF---YPVGAPIPWPSDTVPSGYALMQGQTFDKS 119 ++ D+ A A + AE PVG P+PWP T P G+ G FDK Sbjct: 159 VKMYADSVLAAHVDAANPHPQYLKTAEIDNYLPVGFPLPWPQATPPQGWLKCNGAPFDKV 218 Query: 120 AYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLG 174 YPKLAVAYPSG++PD+RG I+G SGR L+ + D ++ T +AS + Sbjct: 219 KYPKLAVAYPSGLLPDLRGEFIRGWDDGRGVDSGRVALTTQGDAVQKMTGAASNGAATGF 278 Query: 175 TETTSSFDYGTKSTNN 190 ++S G + Sbjct: 279 VNNSTSRVSGVFKRGS 294 >UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteriaceae RepID=C6V0Q3_ECO5T Length = 439 Score = 110 bits (275), Expect = 5e-23, Method: Composition-based stats. Identities = 36/102 (35%), Positives = 50/102 (49%), Gaps = 5/102 (4%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS- 147 PVG P+PWPS T P+G+ G F YP+LA YP+ +PD+RG I+G Sbjct: 282 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKVYPTNKLPDLRGEFIRGWDDGR 341 Query: 148 ----GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGT 185 GR +L+ + I SH H ++ +T SF T Sbjct: 342 GVDNGRGLLTLQDGAIVSHNHYWGIWTSRTNDQTLESFTGTT 383 >UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH14_EDWI9 Length = 593 Score = 110 bits (275), Expect = 5e-23, Method: Composition-based stats. Identities = 53/200 (26%), Positives = 82/200 (41%), Gaps = 27/200 (13%) Query: 23 NNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAP---------------------A 61 N AY N + + G+ L S TS +AP A Sbjct: 362 ANAVRYAYENAVRPATTSQAGQVLLEDSVSSTSTTNAPTSSALKRTYDRANSAYDRANSA 421 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 + R+ S + + Y + PVG P PWP+ ++PSG+ GQ+F S+Y Sbjct: 422 YDRAS-SAYSYAGSIYDKAYDAYDIARRAPPVGTPQPWPNTSIPSGWIKCAGQSFSTSSY 480 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKPASG-----RAVLSQEQDGIKSHTHSASASSTDLGTE 176 P+LA AYP+G +PD+RG I+G G R +LS + D +++ T + + Sbjct: 481 PELAKAYPNGRLPDLRGEFIRGYDDYGGTDSQRQILSWQGDAMRNITGTFGVDDQTIEQV 540 Query: 177 TTSSFDYGTKSTNNTGAHTH 196 T +YG S + Sbjct: 541 TGVFREYGRFSYDARSERNG 560 >UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR91_CITRO Length = 279 Score = 110 bits (275), Expect = 6e-23, Method: Composition-based stats. Identities = 50/171 (29%), Positives = 71/171 (41%), Gaps = 7/171 (4%) Query: 24 NGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTS 83 G PTA + S + + + + P A + Sbjct: 78 TGTPTAPTPASSDNSKKLATTEFVARIISALTETVSGKLSQEQNGADIP--DPEAFVKNL 135 Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 PVG P+PWPS T P G+ G TF S YPKL +AYPSG +PD+RG I+G Sbjct: 136 GLGEGSALPVGVPVPWPSATPPEGWLKCNGATFSSSLYPKLGLAYPSGKLPDLRGEFIRG 195 Query: 144 KPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 GR++LS + D +SH+H+ S + T+ +D T N Sbjct: 196 WDDGRGADNGRSLLSSQGDAFRSHSHNFDRSWGLENFDATAGYDVVTADIN 246 >UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BT14_DICD5 Length = 534 Score = 110 bits (274), Expect = 8e-23, Method: Composition-based stats. Identities = 54/201 (26%), Positives = 88/201 (43%), Gaps = 27/201 (13%) Query: 33 IIHLKGMTAVGEG-ELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFY 91 ++ G + +G ++ +GW+G+ +R + D T ++ W + AE Sbjct: 339 VVRAGGGNGMADGHQISLGWTGSG-------LRVQVDAT--SFDLWHKDNVFPIHAAEI- 388 Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----A 146 VG P+P+P T P G+ GQ+F+K+A+P LA YPSG +PD+RG I+G Sbjct: 389 -VGIPLPYPGATAPDGWLKCNGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVD 447 Query: 147 SGRAVLSQEQDGIKSHTHSASASS--------TDLGTETTSSFDYGTKSTNNT--GAHTH 196 GR +LS ++ +H+H + + FDY + N T H Sbjct: 448 PGRGLLSFQESQNLTHSHGVNDPGHSHPYNKYEGSVGSGLAGFDYDQDAWNATVYTGHVG 507 Query: 197 SISGTANSAGAHQHKSSGAFG 217 + A S G + AF Sbjct: 508 TGISIAASGGHEARPRNIAFN 528 Score = 82.0 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 65/205 (31%), Gaps = 1/205 (0%) Query: 116 FDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT 175 D+ ++ A + D ++ G SG V H + Sbjct: 331 IDQIDNDRVVRAGGGNGMADGHQISL-GWTGSGLRVQVDATSFDLWHKDNVFPIHAAEIV 389 Query: 176 ETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLS 235 + T + S + A A ++ S I + + Sbjct: 390 GIPLPYPGATAPDGWLKCNGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVDPG 449 Query: 236 AGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHT 295 G++S T + G +H ++ + +G A +V G G Sbjct: 450 RGLLSFQESQNLTHSHGVNDPGHSHPYNKYEGSVGSGLAGFDYDQDAWNATVYTGHVGTG 509 Query: 296 ITVNAAGNAENTVKNIAFNYIVRLA 320 I++ A+G E +NIAFNYIVR A Sbjct: 510 ISIAASGGHEARPRNIAFNYIVRAA 534 >UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia RepID=A9R3H4_YERPG Length = 259 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 5/111 (4%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 VG P+PWP+ T P G+ G FDK YPKLA+AYPSG++PD+RG I+G Sbjct: 102 SAILVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIRGWDDGL 161 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAH 194 +GR +LS + D I++ + + SS G T+ Sbjct: 162 GVDAGREILSIQGDAIRNISGGIQGRNEATSARLFSSNATGVFRTDGQFGS 212 >UniRef50_A9ITY4 Phage related protein n=6 Tax=Bartonella RepID=A9ITY4_BART1 Length = 376 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 42/271 (15%), Positives = 79/271 (29%), Gaps = 53/271 (19%) Query: 70 TDANWSPWAQLYTSAHPPA--EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVA 127 D + + W L + + + P G P+ + +P G+ L G+ + + Y L Sbjct: 136 YDEDITGWQILNPTRGKVSFLKRLPSGLIGPFAMERLPDGWLLCDGRAYSRRTYRALFDG 195 Query: 128 YP----------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTD 172 + +PD RG ++G R+ SQ+ +K+H H + + Sbjct: 196 IGTTWGEGDGSTTFNVPDFRGMFLRGMDYERNLDPWRSFASQQGCSLKAHEHFIGPAFPN 255 Query: 173 LGTET-----TSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNG 227 + +SS T+ + G A + G+ P Sbjct: 256 DHSSRKRRDVSSSQAPVTRRKRAIDEECLGLDGDALDKCNQEFD---QIAGSPQVEVPFW 312 Query: 228 YTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 287 +T + S +G T AH Sbjct: 313 FTEKDKPARLPWFIRSPFANFLYYSTPIKEGVMT-----------AH------------- 348 Query: 288 AIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 H H + + G E N++ Y ++ Sbjct: 349 ----HEHHLMAESVGGVETRPVNVSIVYGIK 375 >UniRef50_B3X4P8 Tail fiber n=3 Tax=Enterobacteriaceae RepID=B3X4P8_SHIDY Length = 305 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 71/135 (52%), Positives = 85/135 (62%), Gaps = 11/135 (8%) Query: 3 ITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAF 62 +TAL+ QG AGL++YEVYNNGYPTAYGN++HLKG A GEGELLIGWSGTSGAHAP + Sbjct: 119 VTALSSTAQGNAGLQMYEVYNNGYPTAYGNVLHLKGAAASGEGELLIGWSGTSGAHAPVY 178 Query: 63 IRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYP 122 IRSRRDTTDA WS WAQ++TS S T + G FD + Sbjct: 179 IRSRRDTTDAVWSEWAQVFTSKDSFNA----------ASATKLQTPRKINGTAFDGTRDI 228 Query: 123 KLAVAYPSGVIPDMR 137 ++ SG + D R Sbjct: 229 TISST-DSGAVRDFR 242 >UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomonas palustris DX-1 RepID=D2MH12_RHOPA Length = 346 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 55/304 (18%), Positives = 78/304 (25%), Gaps = 54/304 (17%) Query: 21 VYNNGYPTAYGNIIHLKGM----TAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSP 76 N +P A G I G A +G+ F R + T Sbjct: 92 TLKNSFPNASGPITRSLGAGYGFAATADGDASGPAFSFGSEPGLGFYRKSQGT------- 144 Query: 77 WAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDM 136 Y P G + + T P G+ GQ + L A Sbjct: 145 --IAYPGTLRGIGSIPPGFILDFAGPTPPEGWLTCDGQLVSTVTFADLFAAIGYTW---- 198 Query: 137 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 S Q + + D T + GT TN G H+H Sbjct: 199 --------------GGSGGQFAVPNLVKRFRRHRGD----GTVAGGVGTLQTNQIGLHSH 240 Query: 197 SISGTANSAGAHQHKS-SGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTS 255 S S A H S +N P + I + G + Sbjct: 241 SASMDAQGHHDHYLDLWSSGMNRSNPHSHPASGSGIGVSGGFDTGVYAPQGPLNGVSIGA 300 Query: 256 SDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNY 315 +D H H ++G A G H H ITV A G E + Sbjct: 301 TDINHEHRVTGNTAGNGGHIHN------------------ITVAANGGNETRPDSATVMA 342 Query: 316 IVRL 319 +++ Sbjct: 343 CIKV 346 >UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli RepID=B3I9S3_ECOLX Length = 546 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 34/89 (38%), Positives = 47/89 (52%), Gaps = 5/89 (5%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 384 SALPVGVPVPWPSATPPTGWLKCNGAAFSVEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 443 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTD 172 +GRA+L+ + I H H + D Sbjct: 444 GIDTGRALLNWQPHTILDHAHYMELWTGD 472 >UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae RepID=D2U1K0_9ENTR Length = 366 Score = 108 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 39/87 (44%), Positives = 50/87 (57%), Gaps = 5/87 (5%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP 145 YPVGAPIPWP T P GY + G+ FDK PKL +AYPSG +PD+RG+ I+G Sbjct: 213 DAVNNYPVGAPIPWPQATPPKGYLICNGEPFDKVKCPKLLIAYPSGKLPDLRGYFIRGWD 272 Query: 146 -----ASGRAVLSQEQDGIKSHTHSAS 167 GR V S ++D I++ T Sbjct: 273 AGKGVDPGREVFSYQEDAIRNITGRIG 299 >UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclassified Myoviridae RepID=Q6KGF6_9CAUD Length = 782 Score = 108 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 86/311 (27%), Positives = 127/311 (40%), Gaps = 73/311 (23%) Query: 14 AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA----------FI 63 G E+ + + G + ++ G + + + G S + + Sbjct: 479 TGTEVGDSDGIAWNAKTG-LYNVTGYSGGSTQLVFQMYQGASSTPSAQLKFNYRNGGFWY 537 Query: 64 RSRRDTTDANWSPWAQLYTSAHPP---------------------------AEFYPVGAP 96 RS RD + Q+YT + P + YPVG Sbjct: 538 RSSRDGFGFE-EDFTQIYTEKYKPTPSAIGAYTKAETDQKIAEAISDSTDLNKIYPVGIV 596 Query: 97 IPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQ 156 + S+ P+ +A P L Y + + G TI+ A+G V + Sbjct: 597 TWFNSNVNPN------------TALPGLTWTYLNNGV----GRTIRIAAANGSDVATTGG 640 Query: 157 DGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAF 216 + + S T + TTSSFDYGTK+TN TGAHTHS+SG+ N+ GAH H G + Sbjct: 641 SDSVTLSVGNLPSHTHSFSATTSSFDYGTKTTNTTGAHTHSVSGSTNNTGAHTHTFGGRY 700 Query: 217 GGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAH 276 GG ++ SG+ Q +S G H+H++ GTAAS G HAH Sbjct: 701 GG-------------DSIGGKHRVHVSGTEQV-----SSVAGDHSHTVYGTAASNGNHAH 742 Query: 277 TVGIGAHTHSV 287 TVGIGAH+H+V Sbjct: 743 TVGIGAHSHTV 753 >UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DE08_PECCP Length = 682 Score = 107 bits (266), Expect = 5e-22, Method: Composition-based stats. Identities = 49/135 (36%), Positives = 65/135 (48%), Gaps = 14/135 (10%) Query: 45 GELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYP--------VGAP 96 G A+ RS RD + PWA++YT P VG P Sbjct: 478 GSASAAQFYFDFANGGIKYRSSRDNSGFE-KPWARIYTDQDKPTAADIGALSLNEIVGMP 536 Query: 97 IPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS-----GRAV 151 +PWP T PSG+ GQTFDK+ YPKLA YP+G++PD+RG I+G S GR + Sbjct: 537 MPWPQTTAPSGWLKCNGQTFDKNIYPKLAQIYPAGILPDLRGEFIRGWDDSRGVDTGRTL 596 Query: 152 LSQEQDGIKSHTHSA 166 LS + D I++ Sbjct: 597 LSTQGDAIRNIVGEI 611 >UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=B2PZV1_PROST Length = 526 Score = 107 bits (266), Expect = 5e-22, Method: Composition-based stats. Identities = 58/234 (24%), Positives = 79/234 (33%), Gaps = 18/234 (7%) Query: 3 ITALTDNTQGAAGLE-LYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 I N G G + V + Y G K ++ S + Sbjct: 298 IPGNAGNPWGNNGSAHIINVRDGNYGFQIGRTTGNKNLS-------FRILSANVFSPPSV 350 Query: 62 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 121 + T D N + L S PVGAPIPWP T PSGY + GQ F+K+ Y Sbjct: 351 LYSTGNTTKDHNGN----LKVSGSSELSDCPVGAPIPWPQATAPSGYLICNGQAFNKTTY 406 Query: 122 PKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTE 176 P L AYPSG +PD+RG I+G +GR VLS ++ + H H S Sbjct: 407 PLLTKAYPSGKLPDLRGEFIRGLDAGRNIDNGRVVLSFQRCATEHHKH-ISGWGEASNAN 465 Query: 177 TTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTA 230 + + N Q + + G P Sbjct: 466 AIFGKTVKNGYVGSASTDRDNYLFYTNDGSEFQGSNPNSTGIMANETRPRNIAF 519 >UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bacteriophage n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2R3_PHOLL Length = 233 Score = 107 bits (265), Expect = 8e-22, Method: Composition-based stats. Identities = 46/127 (36%), Positives = 66/127 (51%), Gaps = 7/127 (5%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS---- 147 PVG P+P+PS P+GY GQ FDKS YP+LA+AYPSG++PD+RG I+G S Sbjct: 93 PVGVPLPYPSRYTPAGYLTCNGQAFDKSRYPQLAIAYPSGILPDLRGEFIRGWDDSRGVD 152 Query: 148 -GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 GR +LS + GI+ H H S + + N+T +T S+ ++ G Sbjct: 153 MGRGMLSWQPAGIQDHMHYKVISKQVVEDLVLAGNQSWGTEKNST--YTRSLDQNISTGG 210 Query: 207 AHQHKSS 213 + Sbjct: 211 VIGTTVN 217 >UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=Photorhabdus RepID=Q7N5C0_PHOLL Length = 239 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 43/129 (33%), Positives = 58/129 (44%), Gaps = 5/129 (3%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 PVG+PIPWP PSGY G F +S YPKLA AYP G IPD+RG I+G Sbjct: 99 SSIPVGSPIPWPLSHPPSGYFTCNGSAFSRSQYPKLAEAYPDGRIPDLRGEFIRGWDDGR 158 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 SGR +LS + D K + + D + +++ AN Sbjct: 159 GVDSGRVILSAQTDNTKRIQLTKGLPDGQFLSSYQGPVDRYQFPLGRDVLESATVTSIAN 218 Query: 204 SAGAHQHKS 212 + G H+ + Sbjct: 219 NTGGHETRP 227 >UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BSQ6_PHOAA Length = 318 Score = 106 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 43/177 (24%), Positives = 69/177 (38%), Gaps = 23/177 (12%) Query: 17 ELYEVYNNGYPTAYGNIIHLKGMTAVGEGELL---IGWSGTSGA-HAPAFIRSRRDTTDA 72 E+ N + G + L +G + GA A + + R Sbjct: 127 EIINSLREDLNNRVPNTRKVNGKELSTDINLSAVDVGALPSDGAVIAANKLATARTIAGV 186 Query: 73 NWSPWAQL--------------YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDK 118 + A + PVG PIPWP+ P+G+ G FDK Sbjct: 187 AFDGTANINIPAGNVGAYTKAEVNDLINTVNNIPVGVPIPWPTAIPPTGWLQCNGAAFDK 246 Query: 119 SAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASS 170 S +P+L AY SGV+PD+RG I+G + R++LS + D +++ T + + Sbjct: 247 SKFPQLVAAYSSGVLPDLRGEFIRGWDSSRGVDTNRSILSTQIDTMQNITGKVDSHN 303 >UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N348_PHOLL Length = 440 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 44/131 (33%), Positives = 63/131 (48%), Gaps = 6/131 (4%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 Y++ + P G P+P+P P GY GQTFDKS YPKLA AYP+G +PD+RG Sbjct: 286 YSNFDARYDNVPAGVPMPYPHRYTPPGYLTCNGQTFDKSLYPKLAEAYPAGRVPDLRGEF 345 Query: 141 IKGKPA-----SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHT 195 I+G GR + + D I H H AS + + D G +++ + T Sbjct: 346 IRGWDDSRGVDPGRVCGTWQADCIPDHNHYKVASKQLVEDLVL-TGDAGWYTSSGSSTRT 404 Query: 196 HSISGTANSAG 206 S+ + G Sbjct: 405 RSLDQNTYTGG 415 >UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia cenocepacia J2315 RepID=B4EF34_BURCJ Length = 883 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 55/253 (21%), Positives = 86/253 (33%), Gaps = 50/253 (19%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY------------------------- 128 G + P T +G+ + G ++ YP L AY Sbjct: 653 GTVVFEPRTTARAGFLKLNGALLKRADYPAL-WAYAQASGALSTETDWAAGWSGTFSTGD 711 Query: 129 --PSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK 186 + IP++RG ++ + ++ ++ ++ A T Sbjct: 712 GTTTFRIPELRGEFVRCWDDTRGVDPNRGLGASQNFANAWHAHGASAAASGDHVHSAWTD 771 Query: 187 STNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSG 246 G H G S G HQH + + G I P G + + + + G Sbjct: 772 VQGWHGHH-----GWTASVGDHQHVAPYSESG----IAPFGTHSTNQVGSHG-----GVD 817 Query: 247 QTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAEN 306 TS G H H + AG H H VGIGA G+H H ITVN G E+ Sbjct: 818 NDNPWAFTSGAGGHNHEFN--TEGAGNHGHNVGIGA------AGNHSHAITVNGDGANES 869 Query: 307 TVKNIAFNYIVRL 319 +N+A ++R Sbjct: 870 RPRNVALLAMIRA 882 >UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=Cupriavidus taiwanensis RepID=B3R3K1_CUPTR Length = 1045 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 65/296 (21%), Positives = 97/296 (32%), Gaps = 12/296 (4%) Query: 28 TAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQ-LYTSAHP 86 TA G M GEL IG + T G+ A + ++ L T+A Sbjct: 757 TAPGTTARAVKMRLADNGELRIGNTATDGSGAKLQVTGYATADTPPAGDSSRKLATTAWV 816 Query: 87 PAEFY--PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKL-AVAYPSGVIPDMRGWTIKG 143 + VG I P T +G + G ++ YP+L A A SG I W Sbjct: 817 MSTLLTASVGQIIIEPRTTARAGCLKLNGALLKRADYPELWAYAQASGAIVTDAAWLAGS 876 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 + I + D G + H+H+ S T Sbjct: 877 WGCFSHGDGNT-TFRIPEYRGEYLRFWDDARGADAG-RGIGVFQDSQNKTHSHAASATPV 934 Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHS 263 H + + P + + G + + + G Sbjct: 935 GDHNHGAWTDAQGWHGHGVNDPGHAHSFQTWTGGGATGAGRVSGSYVTNADAWAGTSASY 994 Query: 264 LSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 + A G+HAH VG+G G+H H ITVNA G AE V+NI+ ++R Sbjct: 995 TGISIAGDGSHAHNVGVG------YAGNHSHAITVNADGGAEVRVRNISALAMIRA 1044 >UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadantii RepID=C6C5D2_DICDC Length = 498 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 47/166 (28%), Positives = 71/166 (42%), Gaps = 21/166 (12%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P+PWP T P+G+ GQ+FDK+ YPKLA YPSGV+PD+RG I+G + Sbjct: 335 VGIPLPWPQATAPTGWLKCNGQSFDKALYPKLATVYPSGVLPDLRGEFIRGWDDGRGVDA 394 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 GRA+L+ + + T DY +N G + A++A Sbjct: 395 GRAILTAQ----------------NPTYLRTGMMDYNGSDVDNIGVYIGMGYAEADTAAK 438 Query: 208 HQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGK 253 +GAF N + + ++ +T S + Sbjct: 439 SISAPAGAFRAPNNIDLTEQASRDNGVNGTASNTVYASEGSVWVST 484 >UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CG98_DICZE Length = 196 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 34/134 (25%), Positives = 55/134 (41%), Gaps = 14/134 (10%) Query: 63 IRSRRDTTDANWSPWAQLYTSAHPPAEFYP---------VGAPIPWPSDTVPSGYALMQG 113 ++ + D N + + + YP +G P PWP P G+ G Sbjct: 20 YKNSQTHNDGNLHGCCRCHGEQQYAPDIYPASTDGLKELIGIPQPWPLAEAPEGWLKCNG 79 Query: 114 QTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASA 168 QTFD + YP+LA YP+G +PD+RG I+G + R +LS + Sbjct: 80 QTFDTAKYPQLAKLYPAGTLPDLRGEFIRGWDDERGVDTDRKLLSAQAGTHILGDDGGYP 139 Query: 169 SSTDLGTETTSSFD 182 + +G + + D Sbjct: 140 TLNSIGNLSECNAD 153 >UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JYG6_9GAMM Length = 400 Score = 103 bits (256), Expect = 9e-21, Method: Composition-based stats. Identities = 60/238 (25%), Positives = 88/238 (36%), Gaps = 60/238 (25%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS----------GVIPDMRGWTI 141 P G + T P G+ G ++ YP L A + +PD+R Sbjct: 212 PAGRTEDFAGTTPPGGWLFCDGSEVSRTQYPALFTAIGTLWGDGDGSTTFNLPDLRNDFR 271 Query: 142 KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGT 201 +G + R+V E D IKSH+HSAS+ +GAHTH G Sbjct: 272 RGCSDT-RSVGDSESDQIKSHSHSASSED--------------------SGAHTHG--GR 308 Query: 202 ANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHT 261 ++ +GAH+H+S +G +N S P G T R +G + D Sbjct: 309 SSDSGAHKHRSG--WGESNRSDAPFGAT--------------SGSGHRGSGDSDWDNYLY 352 Query: 262 HSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 + +A H H + I GSH H I + G E +N I+R Sbjct: 353 Y-----TDTAQPHFHWLIINQ------AGSHSHPINIEPTGGDETRPRNKVLMPIIRA 399 >UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclassified Myoviridae RepID=C5H7L2_9CAUD Length = 391 Score = 103 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 83/256 (32%), Positives = 119/256 (46%), Gaps = 22/256 (8%) Query: 57 AHAPAFIRSRRDTTDANWSPWAQLYTSAHPP-AEFYPVGAPIPWPSDTVPSGYALMQGQT 115 ++ + S DA+ A+ + + YPVG + PS Y L G T Sbjct: 122 SNVQNWTTSNLYNEDADKYATAKAVNNLYKAIQASYPVGTIHLSVNSANPSTYLLCGG-T 180 Query: 116 FDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLS-----QEQDGIKSHTHSASASS 170 ++ + + V Y + P + + + + + G +H+ S S SS Sbjct: 181 WELVSKGRALVGYDTDSRPVGSTFGSQTVALTNNNLPAHTHSIYLTGGGHTHSASVSISS 240 Query: 171 TDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTA 230 D G+++TS+FDYGTK+TN+ GAHTH+ SGT ++AG H H+ Sbjct: 241 FDYGSKSTSTFDYGTKTTNSAGAHTHTFSGTTSNAGNHNHRVPMRGNDR----------- 289 Query: 231 ISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIG 290 T S NA T GAHTHS SGT AS+GAH+HTV IGAH+H+V IG Sbjct: 290 ----GGTNAITASADAGVGNAMYTDLAGAHTHSFSGTTASSGAHSHTVAIGAHSHTVNIG 345 Query: 291 SHGHTITVNAAGNAEN 306 SH HT TV + + Sbjct: 346 SHSHTGTVTVSSSEHT 361 >UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6DA10_PECCP Length = 689 Score = 103 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 45/177 (25%), Positives = 66/177 (37%), Gaps = 9/177 (5%) Query: 54 TSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPV----GAPIPWPSDTVPSGYA 109 S A R W P G P+P+P P+G+ Sbjct: 503 NSSAVIHTTDRVYIAAAGGAWRSVYHEGNLTPAAIGAMPASELAGIPLPFPGAVAPTGWL 562 Query: 110 LMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTH 164 GQ+FDKS YP LA YPSGV+PD+RG ++G + RA+LS + D I++ Sbjct: 563 KCNGQSFDKSQYPILASRYPSGVLPDLRGEFVRGWDDGRGADASRALLSAQGDAIRNIVG 622 Query: 165 SASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNT 221 + + + T T+ K T T G + A + + A Sbjct: 623 TIGQLNDRVNTTETAGVFDANKYTGAHSGLTGGNGGRIATFDASKVVPTAAENRPRN 679 >UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid prophage e14 n=3 Tax=Photorhabdus RepID=C7BSQ1_PHOAA Length = 166 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 49/153 (32%), Positives = 66/153 (43%), Gaps = 9/153 (5%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS- 147 E PVG P+PWP+D P G+ G FDK YPKLAVAYPSG +PD+RG I+G Sbjct: 7 EEIPVGIPLPWPTDIPPYGWVKCNGAIFDKYLYPKLAVAYPSGNLPDLRGEFIRGWDDGR 66 Query: 148 ----GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG-TA 202 GR VLS + I H+H ++ + GT S + G Sbjct: 67 GVDIGRYVLSTQLADIAPHSHRIGRMWSNSNA---GAEGLGTPSRILNSVYQGVNYGIDT 123 Query: 203 NSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLS 235 G SG FG + ++ + + Sbjct: 124 RGLGIAIGMGSGGFGYMDNAVAASTGIETRPRN 156 >UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID=B7US81_ECO27 Length = 521 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 38/125 (30%), Positives = 55/125 (44%), Gaps = 10/125 (8%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 PVG P+PW S T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 384 SALPVGVPVPWSSATPPTGWLKCNGAAFSSEMYPRLARAYPTNKLPDLRGEFIRGWDDGR 443 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 +GR +LS + SH + S +Y +N G S +G + Sbjct: 444 GIDAGRTLLSGQDGTSFSHYGGNFDIGSG-----HSINNYDQIVSNQPGFSRFSFAGPSR 498 Query: 204 SAGAH 208 G + Sbjct: 499 GDGVN 503 >UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 Tax=Shigella sp. D9 RepID=UPI0001B5347E Length = 550 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 43/157 (27%), Positives = 67/157 (42%), Gaps = 14/157 (8%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS- 147 PVG P+PWPS T P+G+ G F YPKLA YP+ +PD+RG I+G S Sbjct: 388 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPKLAKVYPTNKLPDLRGEFIRGWDDSR 447 Query: 148 ----GRAVLSQEQD-----GIKSHTHSASASSTDLG---TETTSSFDYGTKSTNNTGAHT 195 GR++LS + ++ + ++ +G S G + G ++ Sbjct: 448 GIDTGRSLLSGQAATFIRTALQDYYGYDLNTNVKVGIAFATADSVITVGNPANPKAGNNS 507 Query: 196 HSISGTA-NSAGAHQHKSSGAFGGTNTSIFPNGYTAI 231 + +A NS Q + F G S+ P + Sbjct: 508 DYVPASADNSITGTQRTAEDNFTGAWISMRPRNLSFN 544 >UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C6Z0_DICDC Length = 183 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 36/146 (24%), Positives = 57/146 (39%), Gaps = 13/146 (8%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 +G P PWP P G+ GQ FD + YP+LA YPSG +PD+RG I+G + Sbjct: 46 IGIPQPWPLADAPEGWLKCNGQAFDTAKYPELAKCYPSGTLPDLRGEFIRGWDDGRGVDT 105 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 R ++S + + + S +G T D + ++ SI + Sbjct: 106 SRELVSAQSGTYITGDSDSQPSVQGIGNITECHVD-------SPDSNARSIYWIPATKTD 158 Query: 208 HQHKSSGAFGGTNTSIFPNGYTAISN 233 + +G T Y + Sbjct: 159 -RLTGPTYWGVTRPRNISFNYIVKAG 183 >UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia RepID=B7MJL6_ECO45 Length = 247 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 42/139 (30%), Positives = 63/139 (45%), Gaps = 12/139 (8%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 103 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 162 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 S RAVLS ++ + + + S L G K T++ G + S + T + Sbjct: 163 GVDSRRAVLSTQEPTVGTFYVELAIISGTLS-------GSGAKFTDSVGIGSTSSNITVS 215 Query: 204 SAGAHQHKSSGAFGGTNTS 222 + + A +T Sbjct: 216 NGNDQSVSGTVAVNPVDTR 234 >UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 Tax=Erwinia phage phiAT1 RepID=C5J9F2_9VIRU Length = 240 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 31/92 (33%), Positives = 42/92 (45%), Gaps = 8/92 (8%) Query: 85 HPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGK 144 P+GA IPWP TVP G+ GQ F+ PKL V+PD RG ++G Sbjct: 147 DLEPRLVPIGAVIPWPGATVPDGWLECSGQVFNTGQNPKLYSVLGRNVVPDYRGLFLRGW 206 Query: 145 P--------ASGRAVLSQEQDGIKSHTHSASA 168 +GRA+ S + D I++ T A Sbjct: 207 AHGSDANDPDAGRALGSVQGDAIRNITGYFPA 238 >UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I687_PSEE4 Length = 898 Score = 101 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 53/284 (18%), Positives = 97/284 (34%), Gaps = 41/284 (14%) Query: 2 NITALTDNTQGAAGLELYEVYNNGYPTAYGN-----IIHLKGMTAVGEGELLIGWSGTSG 56 ++T++ G + + Y P A G I G + L + W Sbjct: 497 DVTSIYGKAYGNSASATAKAYKESAPGALGGAIAGLISGSTGDSETRPRNLAVMWC---- 552 Query: 57 AHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF 116 I++ + A L PVGA +P+P VP+GY + G Sbjct: 553 ------IKAWNAPVNQGQIDVAALVAELKALRSSTPVGAILPFPKAEVPAGYLELDGSLQ 606 Query: 117 DKSAYPKLAVAYPS-----------GVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIK 160 + YP LA + +PD RG ++G GR + + + D I+ Sbjct: 607 SVATYPDLAAYLGASYNNGTEPAGYFRLPDYRGEFLRGWDHGRGVDPGRGMGTSQSDAIQ 666 Query: 161 SHTHSA---SASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQH------- 210 + T S + LG +S + T +T A+T + ++ +A Sbjct: 667 NITGSIGLRGGAGVGLGVMGGASGAFSTVFGESTSANTITRDASSIAASDIARFDASKVV 726 Query: 211 KSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKT 254 +++ N S+ + ++ G + + Q A +T Sbjct: 727 RAAAETRPRNQSVMWCIKAWSTPVNQGQVDVAALVSQVGPATET 770 Score = 80.4 bits (196), Expect = 7e-14, Method: Composition-based stats. Identities = 44/189 (23%), Positives = 72/189 (38%), Gaps = 18/189 (9%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS-----------GVIPD 135 A PVG +P+P TVP+G+ + G T + YP LA +PD Sbjct: 380 TASALPVGTMLPFPRGTVPAGFLEVDGSTQSAAVYPDLAAYLGGAFNTGNEAAGFFRLPD 439 Query: 136 MRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNN 190 RG ++G SGRAV S + + K+HTH ++G + +S G Sbjct: 440 TRGEFLRGWDHGRGVDSGRAVGSTQGESFKAHTHKDVGFIDNVGGGSGASAVTGATGDVT 499 Query: 191 TGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPN--GYTAISNLSAGIMSTTSGSGQT 248 + + + +A A++ + GA GG + G + + +M Sbjct: 500 SIYGKAYGNSASATAKAYKESAPGALGGAIAGLISGSTGDSETRPRNLAVMWCIKAWNAP 559 Query: 249 RNAGKTSSD 257 N G+ Sbjct: 560 VNQGQIDVA 568 >UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BYH6_DICD5 Length = 198 Score = 101 bits (250), Expect = 4e-20, Method: Composition-based stats. Identities = 37/135 (27%), Positives = 53/135 (39%), Gaps = 8/135 (5%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----AS 147 VG P WP P G+ GQ FDK+ YP+LA YP+G +PD+RG I+G + Sbjct: 59 VGIPQAWPLADAPEGWLKCNGQAFDKTKYPQLAKLYPAGTLPDLRGEFIRGWDDGRGVDT 118 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 R +LS + ++SH H S + + S SG + S Sbjct: 119 NRQILSAQSGMLESHNHMMPVSDPSKWNGAVYGY---ANDQPSANIEDFSQSGVSTSREL 175 Query: 208 HQHKSSGAFGGTNTS 222 N + Sbjct: 176 TSLTGGNETRPRNIA 190 >UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Enterobacteriaceae RepID=STFE_ECOLI Length = 166 Score = 100 bits (249), Expect = 6e-20, Method: Composition-based stats. Identities = 39/129 (30%), Positives = 61/129 (47%), Gaps = 6/129 (4%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 +GR++LS + + H H + ST + T+ T +F + N+ + Sbjct: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIV-TDATINFYFDEIWVNSGTDIIKRGNTNDA 125 Query: 204 SAGAHQHKS 212 A + + Sbjct: 126 GLPAPDYGT 134 >UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber protein H n=2 Tax=Pectobacterium atrosepticum RepID=Q6D3Y6_ERWCT Length = 536 Score = 100 bits (247), Expect = 9e-20, Method: Composition-based stats. Identities = 37/88 (42%), Positives = 50/88 (56%), Gaps = 5/88 (5%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP---- 145 G P PWP T P+G+ GQ+FD SA+P LA AYPSGV+PD+RG I+G Sbjct: 384 ALTAGMPKPWPRATAPAGWLKCNGQSFDISAFPHLAAAYPSGVLPDLRGEFIRGWDDGRG 443 Query: 146 -ASGRAVLSQEQDGIKSHTHSASASSTD 172 SGR++LS + D I++ S+ Sbjct: 444 VDSGRSLLSAQSDAIRNIVGEIWTSAVS 471 >UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae RepID=B3I8J5_ECOLX Length = 263 Score = 99.7 bits (246), Expect = 1e-19, Method: Composition-based stats. Identities = 46/176 (26%), Positives = 74/176 (42%), Gaps = 14/176 (7%) Query: 70 TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP 129 +D + L PVGAP+PWPS+T P+G+ G F YP+LA AYP Sbjct: 82 SDGAAAISTALTNLGLGEGSALPVGAPVPWPSETPPTGWLKCNGAAFSAEEYPELAKAYP 141 Query: 130 SGVIPDMRGWTIKGKPAS-----GRAVLSQEQD-----GIKSHTHSASASSTDLG---TE 176 + +PD+RG I+G S GR++LS + ++ + ++ +G Sbjct: 142 TNKLPDLRGEFIRGWDDSRGIDTGRSLLSGQAATFIRTALQDYYGYDLNTNVKVGIAFAT 201 Query: 177 TTSSFDYGTKSTNNTGAHTHSISGTA-NSAGAHQHKSSGAFGGTNTSIFPNGYTAI 231 S G + G ++ + +A NS Q + F G S+ P + Sbjct: 202 ADSVITVGNPANPKAGNNSDYVPASADNSITGTQRTAEDNFTGAWISMRPRNLSFN 257 >UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI4_EDWTE Length = 718 Score = 99.3 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 52/217 (23%), Positives = 80/217 (36%), Gaps = 31/217 (14%) Query: 2 NITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPA 61 N A QG G+ + G P++ + G G G L + + SG + Sbjct: 475 NGGAFAGCNQG--GIYEVSI---GTPSSVADFPMKNGTYIYGYGVLYV--TSNSGTISQL 527 Query: 62 FIRSRRDTTDA--------NWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPS------- 106 +I S A N+ WA ++ +G+ IPW + +P Sbjct: 528 YI-SHNGQIAARIKWGDQPNFKSWAVYDPNSSFEYGCPLIGSLIPWALERMPQEIWPNCG 586 Query: 107 -GYALMQGQTFDKSAYPKLAVAYPSGVIP-DMRGWTIKGKP-----ASGRAVLSQEQDGI 159 + GQ+FD +PKL YP +P DMRG+T +G GRA+LS + D I Sbjct: 587 MHFIPYMGQSFDPELFPKLHDVYPDNRLPTDMRGYTARGWDNGRGIDIGRALLSYQDDAI 646 Query: 160 KSHTHSAS-ASSTDLGTETTSSFDYGTKSTNNTGAHT 195 ++ T + +F N G T Sbjct: 647 QNITGQFGWMPFNGSSPVASGAFSVDKIGANVWGGGT 683 >UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VKW8_PHOAA Length = 316 Score = 98.9 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 41/173 (23%), Positives = 63/173 (36%), Gaps = 27/173 (15%) Query: 17 ELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSP 76 E+ N + G + L + + A+ R+ D Sbjct: 127 EIVNSLRENINGKVPNSWRINGKALTEDINL-------NASDVGAYTRAEVDR------- 172 Query: 77 WAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDM 136 PVG+PIPWP P GY G F++S YPKLA AYP+G +PD+ Sbjct: 173 -------LIKKTSEIPVGSPIPWPLPHPPFGYVTCNGSAFNRSQYPKLAEAYPNGRLPDL 225 Query: 137 RGWTIKGKPAS-----GRAVLSQEQD-GIKSHTHSASASSTDLGTETTSSFDY 183 RG I+G GR +LS ++ + + S + + + Sbjct: 226 RGEFIRGWDDGRGADNGRKLLSWQEGSALSEYLGSFTTGVAQNIHQRDGVTYH 278 >UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkholderia ambifaria AMMD RepID=Q0BEK5_BURCM Length = 735 Score = 98.9 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 58/279 (20%), Positives = 91/279 (32%), Gaps = 75/279 (26%) Query: 74 WSPWAQLYTSAHPPAEF--YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKL-AVAYPS 130 +L TSA A +G + +G+ + G ++ YP L A A S Sbjct: 498 GDNSNKLITSAWFAAAVADVQIGQIVWEARTAPRAGFLKLNGTELKRADYPLLWAYAQGS 557 Query: 131 G-------------------------VIPDMRGWTIKGKPASG-----RAVLSQEQDGIK 160 G +PD+RG I+ + R + S + + Sbjct: 558 GALVADADWGKGRHGCFSSGDGNTTFRLPDLRGEFIRCWDDARGTDAQRQIGSWQDSLNR 617 Query: 161 SHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 H H ASA++ + T++ G H HSI+ + G G G N Sbjct: 618 LHAHGASAAAVGDHSH--------GAWTDSQGWHGHSINDPGHDHGIPVASGGGYIGEIN 669 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 + G + G + GAH H VGI Sbjct: 670 LNGGGRGDKRTTGSGTG----------------------------ISINGDGAHGHNVGI 701 Query: 281 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 G G+H HTI++ A G E+ +N+A ++R Sbjct: 702 GG------AGAHSHTISIGADGGNESRPRNVALLVMIRA 734 >UniRef50_C3X8I7 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8I7_OXAFO Length = 369 Score = 98.9 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 76/230 (33%), Gaps = 23/230 (10%) Query: 83 SAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGV 132 A PVG+ + + T P+GY G + YP+L A + Sbjct: 98 EAEIAKRGIPVGSIDYFATSTPPAGYLKADGSEVGRETYPELFTAIGTVFGEGNGDSTFN 157 Query: 133 IPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTG 192 +PD+ G +G G+ + + G+ H H G SS+ T + Sbjct: 158 LPDLMGRFAQGSTIVGQRI----KAGLPDHKH----IEGFAGVNPNSSYGVATTA-PQGN 208 Query: 193 AHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAG 252 +T S + +N S G + ++ P T + + A +T G Sbjct: 209 INTQSGTSVSNHPYTSPASLSNPIYGASDTVQPPALTLLPCIKAFDAATGPGLIDVTGLS 268 Query: 253 KTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAG 302 + + A +L G G H V H + + + V+ G Sbjct: 269 QEIALKADKKNLFG----IGQTYHDVTHERQNHVIYTNTSSKPLFVSIYG 314 >UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacteria phage phiEcoM-GJ1 RepID=A9Q1X5_9CAUD Length = 356 Score = 98.5 bits (243), Expect = 3e-19, Method: Composition-based stats. Identities = 75/239 (31%), Positives = 122/239 (51%), Gaps = 14/239 (5%) Query: 49 IGWSGTSGAHAPAFIRSRRDT-TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSG 107 + T+ ++ ++RS+ T +DAN+ W + + YP+G + + + T P+ Sbjct: 74 VMQRYTNFSNKRMWVRSQNGTVSDANFDEWTEFVNMNNIYNAIYPIGIVVKFDNATNPNN 133 Query: 108 YALMQGQTFDKSAYPKLAVAY--PSGVIPDMRGWTIKGKPASGRAV--LSQEQDGIKSHT 163 G +++ ++A A P D + +I G + AV L G+++HT Sbjct: 134 N--FTGTVWEQIIDGRVARAATGPEAGTADGQIGSIAGSDTANIAVTNLPGHTHGMQNHT 191 Query: 164 HSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSI 223 H ++ S + T + D+G +++++GAHTHS+SGTA SAGAHQH F G + Sbjct: 192 HGIASHSHTMAHTHTINHDHGAVTSSSSGAHTHSVSGTAASAGAHQHTEGSPFTGD-VNF 250 Query: 224 FPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGA 282 T+ N+S + S ++ TSS GAHTHS+SGTAASAGAH H+V + Sbjct: 251 GTTTSTSKDNISDWLYSPSTRYPL------TSSSGAHTHSVSGTAASAGAHTHSVDLPN 303 >UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 RepID=Q9MCR6_BPHK7 Length = 321 Score = 98.1 bits (242), Expect = 4e-19, Method: Composition-based stats. Identities = 44/180 (24%), Positives = 70/180 (38%), Gaps = 11/180 (6%) Query: 36 LKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGA 95 + G G + G++GA A D + PVG Sbjct: 118 INGTAVTIPGIGKLAQKGSNGAVTVA------DGGTGATNAADARTNLGLGEGSALPVGV 171 Query: 96 PIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRA 150 P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G +GR Sbjct: 172 PVPWPSATPPTGWLKCNGAVFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDAGRE 231 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQH 210 +LS + D I++ T + T++ + G T +T + + + Sbjct: 232 ILSAQGDAIRNITGTFGDGETEVNASISFYRADGVFVTQKKLRNTIGNTTIIADTPNNPY 291 >UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GRX3_VIBCH Length = 182 Score = 97.8 bits (241), Expect = 5e-19, Method: Composition-based stats. Identities = 69/244 (28%), Positives = 104/244 (42%), Gaps = 66/244 (27%) Query: 79 QLYTSAH--PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDM 136 +L + + +PVG IPW +D P G+ + +GQ FD + Y +LA +P+G+IPDM Sbjct: 3 RLVNNLWLKFAVKIFPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDM 62 Query: 137 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 RG + GK G AV + E+ +K+H H S SS D G+K+T N G HTH Sbjct: 63 RGCGVIGKED-GEAVGAYEEGQVKNHGHPNST---------VSSIDLGSKNTANGGNHTH 112 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS 256 A G+H++++ + TS+ Sbjct: 113 FSGIAAFGGGSHRYQTD-------------------------------VNGSGGNINTSA 141 Query: 257 DGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYI 316 G H HS + +GSH H +T+ G +NT+ + N+I Sbjct: 142 AGNHYHS-----------------------IPMGSHAHAVTIALFGALKNTINHRKINWI 178 Query: 317 VRLA 320 VRLA Sbjct: 179 VRLA 182 >UniRef50_Q3KH70 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KH70_PSEPF Length = 817 Score = 97.8 bits (241), Expect = 5e-19, Method: Composition-based stats. Identities = 52/315 (16%), Positives = 96/315 (30%), Gaps = 50/315 (15%) Query: 7 TDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGM------------------TAVGEGELL 48 QG++ + + + + PT G + T EL Sbjct: 395 GGRAQGSSQTDSLKAHYHLIPTGSGGGQAVDPNGEIPTVVLKDTAADWVLRTEGDNAELS 454 Query: 49 IGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPA-----------EFYPVGAPI 97 IG T A R R W + PVGA + Sbjct: 455 IGRVRTYNFGAATETRPRNIAVMWCIKAWNAPVNQGNIDVAALVKEVSRLGSAVPVGAVM 514 Query: 98 PWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS-----------GVIPDMRGWTIKGKP- 145 +P+ VP G+ + G + S YP LA + +P+ RG ++G Sbjct: 515 AFPTGIVPPGFLELNGSVQNTSTYPDLAAYLGTTYNKGDEGAGNFRLPESRGEFLRGWDH 574 Query: 146 ----ASGRAVLSQEQDGIKSHTHSASASSTD-----LGTETTSSFDYGTKSTNNTGAHTH 196 +GR + + + + H H+ + + SF + GA Sbjct: 575 GRGVDAGRGIGTNQGQSMVDHYHTVLTADAGGVLNPIAGNLVGSFTNLAPISKPAGAGVL 634 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS 256 + T++ G K N ++ + ++ G + + + + A +T+ Sbjct: 635 GATLTSSIHGPAAEKGGTETRPRNLAVMWCIKAWNAPINQGNIDIAALAVLAQQASETNQ 694 Query: 257 DGAHTHSLSGTAASA 271 A + + T A A Sbjct: 695 GTAKVATQAQTNAGA 709 Score = 70.4 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 39/263 (14%), Positives = 71/263 (26%), Gaps = 25/263 (9%) Query: 22 YNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANW----SPW 77 P G M A G + S T A + S Sbjct: 250 LTTNAPITLGTTALTFKMLAGRTGIAAGTYKSLSVDEYGRATAGSNPDTLAGFGIKDSYT 309 Query: 78 AQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY--------- 128 + A PVG+ + +P D+ P G+ + + YP L+ Sbjct: 310 KAEVEALIAKASALPVGSIVAFPVDSPPPGFLELDNSVKSSATYPDLSAYLGGKFNKGDE 369 Query: 129 --PSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 + +P+ RG ++G GRA S + D +K+H H S + Sbjct: 370 GVGNFRLPEARGEFLRGWDHGRGVDGGRAQGSSQTDSLKAHYHLIPTGSGGGQAVDPNGE 429 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGA-----HQHKSSGAFGGTNTSIFPNGYTAISNLSA 236 + G + ++ N ++ + ++ Sbjct: 430 IPTVVLKDTAADWVLRTEGDNAELSIGRVRTYNFGAATETRPRNIAVMWCIKAWNAPVNQ 489 Query: 237 GIMSTTSGSGQTRNAGKTSSDGA 259 G + + + G GA Sbjct: 490 GNIDVAALVKEVSRLGSAVPVGA 512 >UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersinia bercovieri ATCC 43970 RepID=C4S5W0_YERBE Length = 388 Score = 97.4 bits (240), Expect = 6e-19, Method: Composition-based stats. Identities = 48/168 (28%), Positives = 71/168 (42%), Gaps = 8/168 (4%) Query: 45 GELLIGWSGTSGAHAPAFIRSRRDTT---DANWSPWAQLYTSAHPPAEFYPVGAPIPWPS 101 G++ G + GA R D N AE +G PIP+P Sbjct: 192 GDVRGGRIISKGAVYAGEERVEGSAALIVDGNIQGTLWGGNLYAYLAERELIGIPIPYPL 251 Query: 102 DTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQ 156 +VP GY G F YPKLA+ YPSGV+PDMRG I+G +GRA+LSQ+ Sbjct: 252 PSVPVGYLKCNGAAFSTVTYPKLALKYPSGVLPDMRGNAIRGWDDGRGVDAGRALLSQQL 311 Query: 157 DGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS 204 D +++ T + + ++ +G N + + +G Sbjct: 312 DALQNITGNFYMGGSKQVAGVVTTGAFGPMEVYNALGNQVTTAGNIGG 359 >UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID=C6CP84_DICZE Length = 646 Score = 96.6 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 42/138 (30%), Positives = 62/138 (44%), Gaps = 8/138 (5%) Query: 69 TTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY 128 T A + TS AE G P+PWP T P+G+ GQ+FDK YP+LA Y Sbjct: 480 TAPAAGDNSSTAITSGWFAAEL--AGIPLPWPQATAPTGWLKCNGQSFDKKLYPRLAQVY 537 Query: 129 PSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLG-TETTSSFD 182 PSGV+PD+RG I+G R +LS + D I++ S + T + Sbjct: 538 PSGVLPDLRGEFIRGWDDGRGVDNNRGLLSSQGDTIRNIVASFVMDDQAVTINAPTGAMF 597 Query: 183 YGTKSTNNTGAHTHSISG 200 ++ + ++ G Sbjct: 598 PSSQIAYDANSNVGGTMG 615 >UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLI8_PECWW Length = 621 Score = 96.2 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 42/149 (28%), Positives = 61/149 (40%), Gaps = 8/149 (5%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-- 145 VG P +P P+G+ GQ FD + YP LA YPSG +PD+RG ++G Sbjct: 465 PSAELVGMPQVFPGAVAPAGWLKCNGQQFDTAQYPILASRYPSGFLPDLRGEFVRGWDDE 524 Query: 146 ---ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSS---FDYGTKSTNNTGAHTHSIS 199 +GRA+LS++ D I++ T + AS G D + T S Sbjct: 525 RGVDAGRALLSEQGDAIRNITGTMRASDVPYGHTQFVDALKADGVFAPIAGDKSWTGDSS 584 Query: 200 GTANSAGAHQHKSSGAFGGTNTSIFPNGY 228 G A + +S N + N Sbjct: 585 GNAGNPWGVSFDTSRVVPTANENRPRNIA 613 >UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D4_DICDC Length = 557 Score = 95.5 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 50/138 (36%), Positives = 71/138 (51%), Gaps = 9/138 (6%) Query: 69 TTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY 128 T A TS AE G P+PWP T P+G+ GQ+FDK+ YPKL AY Sbjct: 402 TAPAAGDNSTSAITSGWFAAEI--AGIPLPWPQATAPTGWLKCNGQSFDKTLYPKLTAAY 459 Query: 129 PSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDG-IKSHTHSASASSTDLGTETTSSFD 182 PSG +PD+RG I+G SGRAVLS + I+ + S +A++T S+F+ Sbjct: 460 PSGTLPDLRGEFIRGWDDGRGVDSGRAVLSVQDATWIQPNIESNTAATTIRIDNVDSTFN 519 Query: 183 YG-TKSTNNTGAHTHSIS 199 + +N ++ H+ S Sbjct: 520 TDEYSAVSNLPSYEHNGS 537 >UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65WH4_MANSM Length = 296 Score = 94.7 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 40/155 (25%), Positives = 60/155 (38%), Gaps = 5/155 (3%) Query: 78 AQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMR 137 + A + +G P P+P VP G GQTF + YP+LA YPSG +PD+R Sbjct: 125 KKATNEAFSTLKNLLIGIPFPYPLSAVPDGCLAFNGQTFSTTTYPELAKKYPSGRLPDLR 184 Query: 138 GWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTG 192 G I+G S R +L + + +HTH + + + + + NN+G Sbjct: 185 GEFIRGWDNGRGVDSSRELLRSQGAELSAHTHYVTVTRYANSSGEFGAKISTFSAINNSG 244 Query: 193 AHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNG 227 G +A S N G Sbjct: 245 WLLSGADGLLLAANKSGEIVSEKNSVANLISNTGG 279 >UniRef50_B7NJP1 Putative side tail fiber protein homolog from lambdoid prophage n=3 Tax=Escherichia coli RepID=B7NJP1_ECO7I Length = 686 Score = 93.5 bits (230), Expect = 8e-18, Method: Composition-based stats. Identities = 36/145 (24%), Positives = 57/145 (39%), Gaps = 5/145 (3%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS- 147 PVG P+PWP+ T P G+ G+ F K YP LA AYP+ +PD+RG I+G Sbjct: 534 SALPVGVPVPWPTATPPEGWLKCDGRAFTKEQYPVLARAYPTLRLPDLRGEFIRGWDDGR 593 Query: 148 ----GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 GR +LS ++ + ++ ++ DYG + + Sbjct: 594 KIDEGRKLLSWQKGTLVGGHDDNDSALDISYMSNGNNIDYGGDKVFAGNYRSDYLWYAVL 653 Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGY 228 + K+ N + N Sbjct: 654 GGTNSRAKAELNGAFFNITRPRNIA 678 >UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID=B9BDD9_9BURK Length = 536 Score = 92.0 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 58/345 (16%), Positives = 106/345 (30%), Gaps = 53/345 (15%) Query: 7 TDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSR 66 N GA+ + Y + I + +G +G + Sbjct: 212 AGNGDGASASTTNVALRSWYGIGFAPTIDGMPVPRTEFSHWFDTRTGNTGFRGTLDVGGL 271 Query: 67 RDTTDA-NWSPWAQLYTSAHPPAEFYP--VGAPIPWPSDTVPSGYALMQGQTFDKSAYPK 123 + ++ T+ A +G + P +V +G+ + G ++S YP Sbjct: 272 ITAQTPPSGDASKRVPTTEWVVAAIASAGIGTIVFEPRTSVRAGFLKLNGALVNRSDYPA 331 Query: 124 LAVAY---------------------------PSGVIPDMRGWTIKGKPASGRAVLSQEQ 156 L AY + +P++RG ++ A ++ Sbjct: 332 L-WAYAQASGALVAESAWGQNNWGCFSTGDGATTFRLPELRGEFLRCWDDGRGADSARGI 390 Query: 157 DGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKS--SG 214 +S ++ A + T + G H + G ++ Sbjct: 391 GTFQSFQNAWHAHGASSAAVGDHTHGAWTDAQGWHGHHGWTGGGGGHNHNNGIFSRLLRP 450 Query: 215 AFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAH 274 +GG+ T G + + AG + + SG + T G H+H++ AGAH Sbjct: 451 PYGGSLTGSDQAGSGSEQAVGAGDSADIAWSGDHAHEFNTEGSGTHSHNV--GIGGAGAH 508 Query: 275 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 A H ITVN G E +NIA ++R Sbjct: 509 A------------------HAITVNGDGGNEARPRNIAMLAMIRA 535 >UniRef50_P51735 Probable tail fiber protein n=27 Tax=root RepID=VPH_BPHP1 Length = 925 Score = 91.6 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 52/333 (15%), Positives = 90/333 (27%), Gaps = 61/333 (18%) Query: 4 TALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFI 63 L G +E + NGY T GN G GE I +A I Sbjct: 447 NTLAGYGIGNFKVEQGQGDANGYKTD-GNYYLASGQNLPENGEWHIEVVSGGATNAVRQI 505 Query: 64 -RSRRDT-------TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSD-TVPSGYALMQGQ 114 R D +NWS W P+G+ + +P T P G+ G Sbjct: 506 ARKANDNKIKTRFFNGSNWSEWKDAGGDG------VPIGSVVSFPRAVTNPVGFLKANGT 559 Query: 115 TFDKSAYPKLAVAYP-SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDL 173 TF++ +P L S +PD+ + V + + A + Sbjct: 560 TFNQQTFPDLYRTLGDSNQLPDL----------TRSDVGMTAYFAVDNIPSGWIAFDSIR 609 Query: 174 GTETTSSFD----YGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYT 229 T T ++ Y ++ + + G + + Sbjct: 610 STVTQQNYPELYQYLVDKYSSISNVPLAEDRFIRNTGNGLNIGQTQSDEIKKHVHRVRTH 669 Query: 230 AISNL-SAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVA 288 + S+ T +R T++D + + Sbjct: 670 WADSSDSSIFYDKTKTVIDSRLRTATTTDDNLSDNGFM---------------------- 707 Query: 289 IGSHGHTI--TVNAAGNAENTVKNIAFNYIVRL 319 H + T A G E K++ ++ Sbjct: 708 -----HPLLDTPMATGGDETRPKSLILKLCIKA 735 >UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PDQ5_9PROT Length = 391 Score = 91.6 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 39/177 (22%), Positives = 60/177 (33%), Gaps = 20/177 (11%) Query: 28 TAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDT-----TDANWSPWAQLYT 82 + N + L+ G+ + A I S+ + A+ + + + Sbjct: 166 NSLANTLVLRDANGDFAGKYVTAGHFKLTAPVQNNIFSKNNEILFRVGAADNDNYTRAVS 225 Query: 83 SAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGV 132 + + PVG I P G+ L G +SAY L A S Sbjct: 226 FSLLSSTILPVGTIITSARTPAPDGFLLCNGAAISRSAYTDLFSAIGTAYGAGDGSSSFN 285 Query: 133 IPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYG 184 IPD+RG I+G GRA+ S + D I++ T A T YG Sbjct: 286 IPDLRGEFIRGADNGRGVDGGRALGSAQGDAIRNITARAIGMGDRNSIPTLLGALYG 342 >UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3KCU2_PSEFS Length = 658 Score = 91.2 bits (224), Expect = 5e-17, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 74/244 (30%), Gaps = 44/244 (18%) Query: 26 YPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAH-------------APAFIRSRRDTTDA 72 YP +Y ++ G G+ + TS A A++ A Sbjct: 105 YPESYKPVLATSG---SGKEFYIRSIFETSNAAIVTLLIDDTVVKATRAWVMDYLGRQLA 161 Query: 73 NWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS-- 130 + + PVG+ + +P D VP G+ + G +AYP LA + Sbjct: 162 EGTYTKAEIEMLIAQSSALPVGSMVAFPIDKVPVGFLEIDGSVKSATAYPDLAKFLGTAF 221 Query: 131 ---------GVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTE 176 +P+ RG ++G +GR S + D KSHTH Sbjct: 222 NKGDEGAGNFRLPESRGEFLRGWDHGRGVDAGRLAGSYQTDQFKSHTHEYDTMQGGGAAN 281 Query: 177 TTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSA 236 + S + + H +G N ++ + ++ Sbjct: 282 SVSDTIAAQSNATSQTGHITGGAG------------GSETRPRNLAVMWCIKAWNAPINQ 329 Query: 237 GIMS 240 G + Sbjct: 330 GNID 333 Score = 80.8 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 82/272 (30%), Gaps = 34/272 (12%) Query: 5 ALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIR 64 + + + +N + G+I G + L + W I+ Sbjct: 273 TMQGGGAANSVSDTIAAQSNAT-SQTGHITGGAGGSETRPRNLAVMWC----------IK 321 Query: 65 SRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKL 124 + + A L + PVG+ IP+ VP GY + G + YP L Sbjct: 322 AWNAPINQGNIDVAALVSELDVLKSAVPVGSIIPFLKAAVPPGYLELDGSVQSIATYPDL 381 Query: 125 AVAYPS-----------GVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASA 168 A + +P+ RG ++G +GR V S ++ + + + A Sbjct: 382 AAYLGTTFNTGSEPAGYFRLPESRGEFLRGWDHGRGMDAGREVGSWQKGSMVAVDTNIPA 441 Query: 169 SSTDLGTETT-------SSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNT 221 + T +D G + + + + G N Sbjct: 442 TQTIATNLVDAAAARMRGGYDSGDVGLYSGITLMGVNPQANVALPGNIEVTYGITRPNNL 501 Query: 222 SIFPNGYTAISNLSAGIMSTTSGSGQTRNAGK 253 ++ + ++ G + ++ + + N Sbjct: 502 AVMWCIKAWNAPINQGQIDISALALEVSNLAN 533 Score = 43.1 bits (99), Expect = 0.013, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 36/103 (34%), Gaps = 4/103 (3%) Query: 217 GGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAH 276 G F + L +G+ + +T +HTH A++ Sbjct: 224 GDEGAGNFRLPESRGEFLRGWDHGRGVDAGRLAGSYQTDQFKSHTHEYDTMQGGGAANSV 283 Query: 277 TVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 + I A +++ + H AG +E +N+A + ++ Sbjct: 284 SDTIAAQSNATSQTGH----ITGGAGGSETRPRNLAVMWCIKA 322 >UniRef50_C3X3R8 Predicted protein n=4 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3R8_OXAFO Length = 365 Score = 90.4 bits (222), Expect = 8e-17, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 66/208 (31%), Gaps = 21/208 (10%) Query: 85 HPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIP 134 PVG+ P+GY G + YP L A + +P Sbjct: 82 DIMKRGVPVGSIDWLAVPEPPAGYLKCDGAAIGRDTYPDLFAAIGTTFGAGDGETTFNLP 141 Query: 135 DMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNN---- 190 DM G +G G ++ G+ + + ++ +TSS + NN Sbjct: 142 DMIGRFAEGSATPGIK----KEAGLPNVSGVSAVEGCINKGSSTSSGPFTYWRENNLILN 197 Query: 191 -TGAHTHSISGTAN--SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQ 247 + ++TH + G S G + +S ++ P + +++G++ T + + Sbjct: 198 TSPSNTHDLGGEIFSLSNGNPIYGNSDTVQPPALTLLPCIKAFDAAVNSGLIDITELANE 257 Query: 248 TRNAGKTSSDGAHTHSLSGTAASAGAHA 275 + + H Sbjct: 258 VTGKADKTQVANLAMPSDTGISVTVPHT 285 >UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE1_PECWW Length = 532 Score = 89.7 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 32/130 (24%), Positives = 55/130 (42%), Gaps = 7/130 (5%) Query: 82 TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTI 141 + A + G W + P G+ + GQ F+ S P LA YPS +PD RG+ Sbjct: 381 SGAWKSSSSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFP 440 Query: 142 KGKPAS------GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHT 195 +G RA+LS + D I++ T + + + YG +N+G+ Sbjct: 441 RGWDNGAGIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGV-FSSYGWPYPSNSGSAN 499 Query: 196 HSISGTANSA 205 + T +++ Sbjct: 500 DASIITFDAS 509 >UniRef50_C3X912 Phage tail collar domain-containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X912_OXAFO Length = 436 Score = 89.3 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 54/151 (35%), Gaps = 15/151 (9%) Query: 70 TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP 129 D+ W + + P G+ + + T P GY + G ++ Y +L A Sbjct: 264 YDSYLKKWVLQNPAKGIAIDSVPAGSVHYFATQTPPDGYLVANGALVSRTVYARLFSAIG 323 Query: 130 S----------GVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLG 174 + +PD+RG ++G R + + D I++ + + + Sbjct: 324 TTFGEGDGGSTFQLPDLRGEFLRGWDAARNLDPERGFGTVQGDAIRNIIGTFGGNDQERR 383 Query: 175 TETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 + + GT TG+ + + +++ Sbjct: 384 FLSGPFYYIGTDGGGKTGSSNGTDNFGFDAS 414 >UniRef50_C6ABW9 Phage tail collar protein n=1 Tax=Bartonella grahamii as4aup RepID=C6ABW9_BARGA Length = 370 Score = 89.3 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 48/123 (39%), Gaps = 17/123 (13%) Query: 78 AQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS------- 130 +++ PVG I +P+ TVP G+ G +S Y +L + Sbjct: 208 SEIDALLDALNNSMPVGTVIYYPALTVPKGWLKANGALISRSDYAQLFAVIGTTYGAGDG 267 Query: 131 ---GVIPDMRGWTIKGKPA-----SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFD 182 +PD+RG ++G R + SQ+ D I++ T + + + +F Sbjct: 268 KTTFRLPDLRGEFLRGVDDERNIDPNRTIGSQQGDAIRNITGELNFDAKAKAA--SGAFK 325 Query: 183 YGT 185 YG Sbjct: 326 YGG 328 >UniRef50_C3X971 Predicted protein n=6 Tax=Oxalobacter formigenes OXCC13 RepID=C3X971_OXAFO Length = 534 Score = 88.9 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 72/210 (34%), Gaps = 22/210 (10%) Query: 57 AHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF 116 A A ++ + D W Q P P+G + T P+GY G+ Sbjct: 217 AGAGYWLELQYDEALDKW--VLQNPAKGISPLNGVPIGTVEYFAMSTPPAGYLKADGRAV 274 Query: 117 DKSAYPKLAVAYP----------SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSA 166 + Y +L + +PD+ +G G+ + + G+ + Sbjct: 275 GRETYAELYSVIGTTFGEGDEQTTFNLPDLIDRFAQGSNTPGQKI----EAGLPNIEGVI 330 Query: 167 SASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIF-- 224 + S + L + + + + A+T ++ AN+ + +S+ +G ++T Sbjct: 331 TNSGSILWAGNEDASGAFSLTGASPRANTATVGAGANTLSFNASQSNQIYGASDTVQPPA 390 Query: 225 ----PNGYTAISNLSAGIMSTTSGSGQTRN 250 P + + G++ T + + Sbjct: 391 LTLLPCIKAFDAAANPGLIDITGLANEVDG 420 >UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5A8Q3_BURGB Length = 865 Score = 88.9 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 48/255 (18%), Positives = 78/255 (30%), Gaps = 11/255 (4%) Query: 68 DTTDANWSPWAQLYTSAH--PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKL- 124 T A + T+ +G + P T +G+ G +++ YP L Sbjct: 618 GPTPAAGDRSTRFATTEWVLSALSSSSIGQIVFEPRTTTRAGFLKANGSLLERADYPALW 677 Query: 125 AVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYG 184 A A SG + W G+ + I D G +S G Sbjct: 678 AYAQASGALISDAAWWA-GQSGCFSTGTTGTNFRIPELRGEF-LRCLDDGRGLDTSRAAG 735 Query: 185 TKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSG 244 + + H+H S T + H ++GA + S + G Sbjct: 736 SLQLSQNAKHSHDASSTVGGSHTHGAFTTGAGSHNHAIDQQPHAHDTWLGSVQVSGVDRG 795 Query: 245 SGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA 304 G G+ + + + G H H G + G H H I V +G Sbjct: 796 GGFGPYNGRVGEAWSDPANANIAILPTGDHVHGAG------TYPAGDHNHAIAVQPSGGD 849 Query: 305 ENTVKNIAFNYIVRL 319 E +NIA ++R Sbjct: 850 EARPRNIALLAMIRA 864 >UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2U8_ERWCT Length = 619 Score = 88.9 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 31/111 (27%), Positives = 48/111 (43%), Gaps = 5/111 (4%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 G P+P+P P+GY GQ FD + +P LA YPSG +PD+RG ++G Sbjct: 459 PTSELAGIPLPFPGAVAPAGYLKCNGQQFDTAQFPVLASRYPSGFLPDLRGEFVRGWDDG 518 Query: 148 G-----RAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 RA++S + D I++ S ++ G + A Sbjct: 519 RGIDTVRALMSAQGDAIRNIVGSLFYGYDADVPVLNTNSSSGALYYEMSTA 569 >UniRef50_A9IRI0 Phage related protein n=9 Tax=Bartonella RepID=A9IRI0_BART1 Length = 324 Score = 88.5 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 39/188 (20%), Positives = 64/188 (34%), Gaps = 19/188 (10%) Query: 60 PAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKS 119 + L P E +P G + +P+G+ L G + + Sbjct: 130 GIYEMVYNSGVLIKEHEGWYLLNPTPPKIESFPAGFIATFAMRNIPNGWLLCDGTAYKRE 189 Query: 120 AYPKLAVAYP---------SGVIPDMRGWTIKGKPASG-----RAVLSQEQDGIKSHTHS 165 YP+L A + +PD RG ++G R ++QD IKSHTH Sbjct: 190 DYPQLFKAIGDKWGKNSDTTFKVPDFRGMFLRGFDDGRGLDNDRKFADEQQDSIKSHTHI 249 Query: 166 ASASSTDLGTETTSSFDYGTKSTNNTG-----AHTHSISGTANSAGAHQHKSSGAFGGTN 220 + + G + N + ++ G +SAGAH HK + + G Sbjct: 250 GTVEESGAHVHNFEYKGVGWPTGNIGRLPNYYTYNTTLKGKTDSAGAHTHKITLSHTGEA 309 Query: 221 TSIFPNGY 228 + N Sbjct: 310 ETRPVNTT 317 >UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VSH6_POLNA Length = 483 Score = 88.5 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 75/236 (31%), Gaps = 65/236 (27%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS----------GVIPDMRGWTIK 142 G T P G+ G ++AY L A + +PD+RG I+ Sbjct: 302 PGHINYTARSTAPPGWLKANGAGISRTAYAALFAAIGTTFGVGDGFNTFNLPDLRGEFIR 361 Query: 143 GKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTA 202 G S+ ++ + +H +G+ Sbjct: 362 GWDDGRGVDGSRSLGSSQAGETA-----------------------------SHGHTGST 392 Query: 203 NSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTH 262 ++AG H H + P ++ + G S+ Sbjct: 393 SAAGIHAHGVND----------PGHSHQVTQEGG------RNTSLAYQNGPNSAFRGEVS 436 Query: 263 SLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 318 +L T +A GIG + G+H HT+T++A G +E +N+A +++ Sbjct: 437 TLLETTRNA------TGIGISEN----GNHSHTVTISATGGSETRPRNLALLAVIK 482 >UniRef50_A4PE45 Tail fiber protein gpH n=3 Tax=root RepID=A4PE45_9CAUD Length = 554 Score = 88.1 bits (216), Expect = 3e-16, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 62/198 (31%), Gaps = 33/198 (16%) Query: 9 NTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRD 68 N A G Y N P+ +G ++ + +A + + + +R Sbjct: 334 NALVAPGEYYYTSDNANAPSGHG-VLKVWRESAT---MVFQLVHSSDNE-----VFTRYR 384 Query: 69 TTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY 128 + W+ W QL A G + T PSG+ G ++ Y L Sbjct: 385 ASSGTWTAWRQLVGQA---------GLIGYFARSTAPSGWLKANGAAVSRTTYAALYAEI 435 Query: 129 PS----------GVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDL 173 + +PD+RG ++G SGR + + + H +S ++ Sbjct: 436 GTTFGAGDGAATFNLPDLRGEFLRGWDDGRGVDSGRGIGTWQSGSPVVHDDVGGIASFNI 495 Query: 174 GTETTSSFDYGTKSTNNT 191 + + + Sbjct: 496 TALGDGTNVAWSNIADPW 513 >UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1_YERKR Length = 402 Score = 87.4 bits (214), Expect = 7e-16, Method: Composition-based stats. Identities = 40/128 (31%), Positives = 59/128 (46%), Gaps = 9/128 (7%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 Y++ E +G PIPWP P+GY G F+K+ YPKLA+AYPSGV+PD+RG Sbjct: 265 YSNFDERYEPALIGTPIPWPLTIAPAGYLKCNGAPFNKTQYPKLALAYPSGVLPDLRGEF 324 Query: 141 IKGKPA-----SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHT 195 I+G + +L + I+SH H T+ + + G T Sbjct: 325 IRGFDDGRGVRPNQPLLGWQGSEIQSHNHGI----TNFEIRGVTGGPTNAWFPSTNGIST 380 Query: 196 HSISGTAN 203 ++ G Sbjct: 381 NNSGGDET 388 >UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacteria phage WV8 RepID=C5H7L3_9CAUD Length = 848 Score = 87.0 bits (213), Expect = 7e-16, Method: Composition-based stats. Identities = 57/210 (27%), Positives = 88/210 (41%), Gaps = 26/210 (12%) Query: 78 AQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMR 137 A+ + + + YPVG + S+ P+ +A P L Y + + Sbjct: 636 AEAISDSTDLNKIYPVGIVTWFNSNVNPN------------TALPGLTWTYLNNGV---- 679 Query: 138 GWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G TI+ A+G V + + + S T + TTSSFDYGTK+++ TG H H+ Sbjct: 680 GRTIRIAAANGSDVATTGGSDSVTLSVGNLPSHTHSFSATTSSFDYGTKTSSTTGNHNHN 739 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 GT G+ + S A YTA G + + G + Sbjct: 740 R-GTMEITGSFGYFRSDASSF---------YTASGAFYLGSQAGSKGYTGNNFTNGIPVN 789 Query: 258 GAHTHSLSGTAASAGAHAHTVGIGAHTHSV 287 + + SG + G H+HTVGIGAH+H+V Sbjct: 790 FNASRNWSGVTNTTGNHSHTVGIGAHSHTV 819 >UniRef50_C3X3W1 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W1_OXAFO Length = 365 Score = 86.6 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 33/162 (20%), Positives = 57/162 (35%), Gaps = 10/162 (6%) Query: 72 ANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP-- 129 W A+L E P G I + P G+ G + SAYP+L Sbjct: 200 DRWEIMAELAGKLDKA-EKLPAGTIIAVGGNITPEGFLYCNGASLSPSAYPELCAVIGGT 258 Query: 130 -------SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFD 182 + +PD RG ++G +GR + + + + A A +T T + D Sbjct: 259 YGGDGLTTFNLPDFRGRWMQGNDTAGRVLAAGLPNVTGTIVSGAIAHATAYQTGAFYNID 318 Query: 183 YGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIF 224 G + G+ H +G S + +S + ++ Sbjct: 319 VGAFGGYHAGSQNHYRAGFEASRSNPIYGASDTVRPPSITVR 360 >UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N6T1_PHOLL Length = 300 Score = 86.2 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 36/112 (32%), Positives = 51/112 (45%), Gaps = 5/112 (4%) Query: 66 RRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLA 125 + + + + + PVG+PIPWP P GY G F+K YPKLA Sbjct: 127 QEINSLREHTYTREEIDNRIKTVGEIPVGSPIPWPLPYPPVGYLTCNGSAFNKLQYPKLA 186 Query: 126 VAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTD 172 AYP G +PD+RG I+G GR +LS + D ++ T A + Sbjct: 187 EAYPDGRLPDLRGEFIRGWDDGRGVDMGRTMLSWQGDAMQRMTGFLEAGNGI 238 >UniRef50_B0UTN0 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0UTN0_HAES2 Length = 699 Score = 85.4 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 50/148 (33%), Gaps = 22/148 (14%) Query: 4 TALTDNT----QGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHA 59 T L Q G + + Y + + HL E++ G T+ Sbjct: 261 TTLAGYGITDFQVKTGSDDVDNYKTDGHYYFASSQHLPDNNGAWHVEVVSGGQTTA---V 317 Query: 60 PAFIRSRRDT-------TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSD-TVPSGYALM 111 R DT + + W+ W + P+GA + +P T P+G+ Sbjct: 318 RQIARKANDTKVKTRFFSGSKWTEWKDIGGDG------VPLGAIVAFPKAITNPTGFLKC 371 Query: 112 QGQTFDKSAYPKLAVAYPS-GVIPDMRG 138 G T D+ YP L + +P++ Sbjct: 372 DGTTIDQRTYPDLYRTLGNKNTLPNLTR 399 >UniRef50_B2FIY3 Putative phage collar protein n=1 Tax=Stenotrophomonas maltophilia K279a RepID=B2FIY3_STRMK Length = 410 Score = 85.1 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 45/126 (35%), Gaps = 14/126 (11%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS---------GVIPDMRGWT 140 P G +P+ P G+ G ++ Y L + +PD+RG Sbjct: 253 LLPAGMVAHFPTGGPPPGWLRCNGADVSRTTYADLFAVIGTLFGSANDMTFRLPDLRGEF 312 Query: 141 IKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHT 195 ++G GRA+ S + + ASA G S D+G +TN + Sbjct: 313 VRGWDDGRGVDGGRALGSLQAATEVLSSWGASAGGLVSGQYQYSLADFGVHTTNADSSRQ 372 Query: 196 HSISGT 201 + G+ Sbjct: 373 VNNVGS 378 >UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralstonia phage RSL1 RepID=B2ZY49_9CAUD Length = 498 Score = 84.3 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 46/254 (18%), Positives = 80/254 (31%), Gaps = 79/254 (31%) Query: 80 LYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------- 129 Y P + P G +P+ T+P+GY ++ + L Sbjct: 308 FYDQILNPPQLVPPGTILPFAGTTIPAGYLACNAAAISRTGFASLYSVIGTTYGVGNGST 367 Query: 130 SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYG 184 + +PD+RG ++G GR + + D +SH H+ Sbjct: 368 TFNLPDLRGVFVRGWDNGRGQDPGRVFGTYQGDAFRSHNHAV------------------ 409 Query: 185 TKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSG 244 + H H + G ++T + + S + S G Sbjct: 410 -----SDPGHAHGVY---------------DPGHSHTWTLGTLRQSGGDTSCYVPSARYG 449 Query: 245 SGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA 304 G+ + T++ G GIG + + IG+ G A Sbjct: 450 GGEFQFTETTAAVG-------------------TGIGIYGNVTGIGT-------LVNGGA 483 Query: 305 ENTVKNIAFNYIVR 318 E T KN+A NYI++ Sbjct: 484 ETTPKNVAMNYIIK 497 >UniRef50_C3X1Y2 Tail fiber protein gpH n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X1Y2_OXAFO Length = 480 Score = 84.3 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 66/216 (30%), Gaps = 22/216 (10%) Query: 48 LIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSG 107 W + + + F +D S P+G + T P+G Sbjct: 148 SQAWVIDNFSKSKLFPGGAKDQVLTKLSD--NSGDMGWKYPSGVPIGTVEYFAMATPPAG 205 Query: 108 YALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIKGKPASGRAVLSQEQD 157 Y G ++ YP L A + +PDM G +G G + ++ Sbjct: 206 YLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFNLPDMIGRFAEGSATPG----TVKEA 261 Query: 158 GIKSHTHSASASSTDLGTETTSSFDYG------TKSTNNTGAHTHSISGTANSAGAHQHK 211 G+ + T + T S + + TG + + S + + Sbjct: 262 GLPNITGEINGHFGSSVAFGTGSLFTSIGGSRYRATPDGTGGEAFFAAFISASRSSPIYG 321 Query: 212 SSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQ 247 +S ++ P + ++ G++ T + + Sbjct: 322 NSDTVQPPALTLLPCIKAFDAAVNPGLIDVTELANE 357 >UniRef50_C3X8V5 Bacteriophage tail fiber protein n=2 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8V5_OXAFO Length = 480 Score = 84.3 bits (206), Expect = 6e-15, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 58/190 (30%), Gaps = 13/190 (6%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPD 135 + PVG + + T P+GY G + YP L A + +PD Sbjct: 191 IAKKGVPVGTIEYFATSTPPAGYLKADGAAVGRETYPDLFAAIGTAFGEGDGSTTFNLPD 250 Query: 136 MRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGT---KSTNNTG 192 + G +G G+ + + + I + + +++ G G Sbjct: 251 LIGRFAQGSDVPGQKLEAGLPNAIGKLSGFFGFTPVYKSGALSTTGSAGVQFETIGVAGG 310 Query: 193 AHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAG 252 A ++ I S + +S ++ P + G++ T + + + Sbjct: 311 ASSNKIINLDLSESNPIYGASDTVQPPALTLLPCIKAFDAATDPGLIDITELAQEMADKT 370 Query: 253 KTSSDGAHTH 262 + Sbjct: 371 DKMTAANAAM 380 >UniRef50_C4GFX3 Putative uncharacterized protein n=2 Tax=Kingella oralis ATCC 51147 RepID=C4GFX3_9NEIS Length = 310 Score = 84.3 bits (206), Expect = 6e-15, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 63/191 (32%), Gaps = 21/191 (10%) Query: 8 DNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRR 67 T+ L E T G + + +G + T A + A ++ Sbjct: 76 QYTETEVASLLTESIREIIKTTIGETKASSQTAGIMKVLNTLGSTATDAALSAAQGKALN 135 Query: 68 DTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVA 127 D A + + + P G + +D P+G+ G ++ Y L A Sbjct: 136 DAIAA-----LNALLTGYTANSYCPSGQIGLFATDYAPTGWLKANGAVLSRTVYTNLFAA 190 Query: 128 YP----------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTD 172 + +PD+RG + +GR + S + D I++ T D Sbjct: 191 IGTRFGAGDGHSTFNLPDLRGEFPRFWDDGRGVDAGRVLGSWQSDAIRNITAQMYLYGQD 250 Query: 173 LGTETTSSFDY 183 G+ + +F + Sbjct: 251 -GSSSQGAFGF 260 >UniRef50_A9LZ37 Tail fibre protein, putative n=21 Tax=Neisseria RepID=A9LZ37_NEIM0 Length = 658 Score = 83.9 bits (205), Expect = 7e-15, Method: Composition-based stats. Identities = 62/351 (17%), Positives = 100/351 (28%), Gaps = 85/351 (24%) Query: 4 TALTDNTQGAAGLELYEVYNN--------GYPTAYGNIIHLKGMTAVGEGEL---LIGWS 52 L+ G +E + N PTA G+ TA + GW Sbjct: 155 NTLSGYGIGNFKVETFRGDLNTLKTDGIYSLPTAVGSSNLPVENTACHIQVIAGTQPGWC 214 Query: 53 GTSGAHA---PAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDT-VPSGY 108 G A + R + + + NWS W +L + PVGA + +P P+GY Sbjct: 215 RQLGYPAYTSDVYERHQTSSANDNWSAWKKLNSDG------IPVGAIVSFPKAVRNPAGY 268 Query: 109 ALMQGQTFDKSAYPKLAVAYP-SGVIPDM------------RGWTIKGK----------- 144 G TF ++ +P L A S +PD+ G Sbjct: 269 LRADGTTFAQNTFPDLYRALGNSNRLPDLSRTDIGITAWFPSDQIPTGWLAFDDIRTRVT 328 Query: 145 -------------------------------PASGRAVLSQEQDGIKSHTHSASASSTDL 173 + AV ++++D IK H H + T+ Sbjct: 329 ETAYPELYRLLTGKYGSIQNVPQAEDRFIRNAGNSLAVGTKQEDEIKRHVHKVFSHWTNH 388 Query: 174 GTETTSSFDYGTKSTNN-----TGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGY 228 ++ + + + +G + + G + Sbjct: 389 TDAAALGYEDRNERQRSALVSTWTDENLNDNGFLTPRSDSKMATGGDENRPKALVLKLCI 448 Query: 229 TAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVG 279 A L + G+T NAG G +L A A H HT Sbjct: 449 KAADTLGEAVFWI-KSHGETINAGALD-AGTLAQNLQDKADRA--HTHTAA 495 >UniRef50_A6E6G6 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 RepID=A6E6G6_9SPHI Length = 731 Score = 83.9 bits (205), Expect = 7e-15, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 56/163 (34%), Gaps = 9/163 (5%) Query: 79 QLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY-PSGVIPDMR 137 +L +A +PVG + + + VP + L G+ D S YP L +PD+R Sbjct: 563 ELKRAAAATILDFPVGGIVAFYGEKVPDHWLLCDGKPVDHSLYPDLYRLLGGEKRLPDLR 622 Query: 138 GWTIKG-------KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNN 190 G + G G LS D + H H A + + + + + Sbjct: 623 GRFLVGAGSKYSLGDMGGVDELSLNVDQMPQHDHQIKAVKSYESPFKEVNMGWAREESLR 682 Query: 191 TGAHTHSISGTANSAGAHQHKSSG-AFGGTNTSIFPNGYTAIS 232 G + A+ + S + GG Y A++ Sbjct: 683 GGVYGTDRDNGADKYFVTRSNSPVKSEGGGKAHENRPPYLAVN 725 >UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=Pectobacterium carotovorum subsp. carotovorum WPP14 RepID=UPI0001A44C27 Length = 195 Score = 83.9 bits (205), Expect = 8e-15, Method: Composition-based stats. Identities = 36/121 (29%), Positives = 49/121 (40%), Gaps = 12/121 (9%) Query: 72 ANWSPWAQLYTSAHPPAEFY-------PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKL 124 N W + + + + VG P P P T P G+ GQ+FD S YP L Sbjct: 60 GNGGRWRREFNTENLTPSSIGAIQGNELVGIPQPCPLVTAPEGWLACAGQSFDTSRYPVL 119 Query: 125 AVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTS 179 A YP G +PD+RG I+G +GR LS + + HTH G + Sbjct: 120 ASRYPQGRLPDLRGEFIRGWDNGRGVDTGRGNLSSQSFSTEPHTHDGGTLGLGSGAPIYT 179 Query: 180 S 180 Sbjct: 180 G 180 >UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P172_CHRVO Length = 591 Score = 83.5 bits (204), Expect = 9e-15, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 60/207 (28%), Gaps = 27/207 (13%) Query: 43 GEGELLIGWSGTSGAHAPAFIRSRRD-TTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPS 101 + G + R+R D T W+PW L E Y G + Sbjct: 384 NNINYSCQLTADYGNGSMMRFRTRNDDGTTGRWNPWRTLIH------EDYLTGQVAFFAM 437 Query: 102 DTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIKGKPAS---- 147 P G+ G + YP L A + +PD+RG ++G Sbjct: 438 SAPPLGWLKANGAAVSRKDYPSLFAALGTYYGAGDGSTTFNLPDLRGEFVRGWDDGRGVD 497 Query: 148 -GRAVLSQEQDGI----KSHTHSASASSTDLGTETTSSF-DYGTKSTNNTGAHTHSISGT 201 GR + ++ + S T AS T + D G + T Sbjct: 498 NGRGFGTWQKGTLTFSDPSLTSPCVASLVHRNDNTVIGYLDLGADPVDKNKYDLGLSVST 557 Query: 202 ANSAGAHQHKSSGAFGGTNTSIFPNGY 228 AN S G G ++ N Sbjct: 558 ANGVYLPDLDSGGWANGYGSTRPRNIA 584 >UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabdus RepID=Q7NAA0_PHOLL Length = 351 Score = 83.5 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 39/154 (25%), Positives = 62/154 (40%), Gaps = 6/154 (3%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 + VG P+PW T P+GY + GQ FDKS YPKL AYPSG +PD+RG I+G Sbjct: 192 KILTEDDILVGIPLPWSKPTAPAGYLICSGQQFDKSMYPKLGEAYPSGALPDLRGEFIRG 251 Query: 144 KP-----ASGRAVLSQEQDG-IKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 SGR +LS + + + A++ + L + + + Sbjct: 252 WDNGRSIDSGREILSHQNSTKLPNLYTHAASENIGLLVSPPINHFSSNYPSEIMASDFEE 311 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAI 231 + + +G+ + + P Sbjct: 312 AEFGSGQYFSTPLNPTGSVSLSTFRVRPRNIAFN 345 >UniRef50_B8DPV9 Tail Collar domain protein n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DPV9_DESVM Length = 530 Score = 83.1 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 48/212 (22%), Positives = 80/212 (37%), Gaps = 24/212 (11%) Query: 53 GTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPP-----------AEFYPVGAPIPWPS 101 S + S DA + QL A F P+GA + +P Sbjct: 163 AASATASAVRAESSVSGLDAALTGVHQLREEVLQTVADARQDVLACAAFVPIGAILDFPV 222 Query: 102 DTVPSGYALMQGQTFDKSAYPKLAVAYPSG------VIPDMRGWTIKGKP-----ASGRA 150 +TVP+G+ + GQ ++AYP L G +PD+RG +G +GR Sbjct: 223 NTVPTGFLVCAGQVVTRTAYPDLVTYLTGGTVAVNATLPDLRGEFRRGADLGRGVDAGRV 282 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT--GAHTHSISGTANSAGAH 208 V S + D I++ T S + ++ + ST N+ GA T T + + Sbjct: 283 VGSAQGDAIRNITGSLYNYIQNNASQENGALRTQVASTLNSPFGAGTIMSWSTLSIDASR 342 Query: 209 QHKSSGAFGGTNTSIFPNGYTAISNLSAGIMS 240 Q ++ N ++ P + +SA + Sbjct: 343 QVPTASENRPRNIAVVPCIKAYHAPMSAAPVD 374 >UniRef50_C6S6V6 Putative phage tail fibre protein n=1 Tax=Neisseria meningitidis alpha14 RepID=C6S6V6_NEIML Length = 728 Score = 82.7 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 64/351 (18%), Positives = 99/351 (28%), Gaps = 85/351 (24%) Query: 4 TALTDNTQGAAGLELYEVYNN--------GYPTAYGNIIHLKGMTAVGEGEL---LIGWS 52 L+ G +E + N PTA G+ TA + GW Sbjct: 225 NTLSGYGIGNFKVETFRGDLNTLKTDGIYSLPTAVGSSNLPVENTACHIQVIAGTQPGWC 284 Query: 53 GTSGAHA---PAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDT-VPSGY 108 G A + R + + + NWS W +L + PVGA + +P P+GY Sbjct: 285 RQLGYPAYTSDVYERHQVSSANDNWSAWKKLNSDG------IPVGAIVSFPKAVRNPAGY 338 Query: 109 ALMQGQTFDKSAYPKLAVAYP-SGVIPDM------------RGWTIKGK----------- 144 G TF ++ +P L A S +PD+ G Sbjct: 339 LRADGTTFAQNTFPDLYRALGNSNRLPDLSRTDIGITAWFPSDQIPTGWLAFDDIRTRVT 398 Query: 145 -------------------------------PASGRAVLSQEQDGIKSHTHSASASSTDL 173 + AV ++++D IK HTH + T Sbjct: 399 ETAYPELYRLLTGKYGSIQNVPQAEDRFIRNAGNSLAVGTKQEDEIKRHTHKVFSHWTSH 458 Query: 174 GTETTSSFDYGTKSTNN-----TGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGY 228 ++ G + + S +G + + G + Sbjct: 459 TDVAAVGYEDGNERQRSALVSTWTDENLSDNGFLTPRLDSKMATGGDENRPKALVLKLCI 518 Query: 229 TAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVG 279 A L + G+T NAG G L A H HT Sbjct: 519 KAADTLGEAVFWI-KSHGETVNAGALD-AGTLEQGLQDKADR--DHTHTAA 565 >UniRef50_B0USC5 Phage Tail Collar domain protein n=1 Tax=Haemophilus somnus 2336 RepID=B0USC5_HAES2 Length = 652 Score = 82.7 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 32/145 (22%), Positives = 46/145 (31%), Gaps = 17/145 (11%) Query: 4 TALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFI 63 T L G + N GN G G I +A I Sbjct: 139 TTLAGYGIGDFKVGTSTGDANDCKID-GNYYFASGQNLPSAGAWHIAVMSGGQTNAIRQI 197 Query: 64 -RSRRDT-------TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDT-VPSGYALMQGQ 114 R ++ +WS W + PVG+ + +P P G+ G Sbjct: 198 ARKANESKVQTRYFNGTSWSAWKDVGGDG------LPVGSVLAFPVAVQNPQGFLKCDGS 251 Query: 115 TFDKSAYPKLAVAYP-SGVIPDMRG 138 TF ++ YP L A S +PD+R Sbjct: 252 TFGRTTYPDLYRALGNSNKLPDLRR 276 >UniRef50_Q4TVW2 Tail fiber protein n=4 Tax=Viruses RepID=Q4TVW2_9CAUD Length = 760 Score = 82.7 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 44/188 (23%), Positives = 71/188 (37%), Gaps = 20/188 (10%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGR 149 P+G+ P+ T P+GY G TF K YP L S +PDMRG +K Sbjct: 263 AVPIGSIFPF-VKTPPAGYLTCDGSTFSKDEYPDLYAYLGSTTLPDMRGRYLKMPSDLAN 321 Query: 150 AVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQ 209 + + I + H S T ++ + D GT I G H Sbjct: 322 -IYQKFPAIIPALLHDVDISHTHTASQQAHAHDRGTME----------IGGEFFVGSGHG 370 Query: 210 -HKSSGAFGGTNTSIFPNGYTAISNLSAGI-------MSTTSGSGQTRNAGKTSSDGAHT 261 + ++GA+GG S P G ++G ++ + +G T + + A T Sbjct: 371 LYIATGAYGGAFFSDSPGGADNNGGGASGGLNRRWVFRASRNWTGLTSYSAPAITVNALT 430 Query: 262 HSLSGTAA 269 +++ T Sbjct: 431 NAIRQTTN 438 >UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NQ95_9RHOB Length = 329 Score = 82.4 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 72/241 (29%), Gaps = 73/241 (30%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS----------GVIPDMRGWTIK 142 G + T P G+ G ++AY L A + +PD+RG ++ Sbjct: 146 PGCVAYYAMSTAPDGWLKANGAEISRTAYADLFAAIGTIFGVGDGNSTFNLPDLRGEFLR 205 Query: 143 GKPASG-----RAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G + R + S + D SHTH Sbjct: 206 GWDDARGVDGARVLGSSQSDQNASHTH--------------------------------- 232 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 S + H +G T Y +N G+ + + T + Sbjct: 233 ----TGSTSSDSHSHTGTTNTTGNHTHNMAYEGGTNAGTGLAAPATSRSNTSPGPTVNYS 288 Query: 258 GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIV 317 G H+H+ S ++ S ++T +A+G +E +NIA + Sbjct: 289 GNHSHTFSTSSDSHSH---------------------SVTTDASGGSEARPRNIALLACI 327 Query: 318 R 318 + Sbjct: 328 K 328 >UniRef50_C3X8U2 Phage Tail Collar Domain containing protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8U2_OXAFO Length = 266 Score = 81.6 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 41/133 (30%), Gaps = 17/133 (12%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPD 135 P G + P+GY G ++ YP L A + +PD Sbjct: 102 IAKNGVPTGTIAFFAMTAPPAGYLKADGAIIQRTDYPALFTAIGTTFGEGDGTTTFTLPD 161 Query: 136 MRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGT--ETTSSFDYGTKST 188 +RG I+G RA S + D I++ T + S T + Sbjct: 162 LRGEFIRGWDNGRNIDCERAFGSIQGDAIRNVTGQLRYAGPQNSDSVMNYQSALQWTSVS 221 Query: 189 NNTGAHTHSISGT 201 + S G+ Sbjct: 222 QKSPYSAQSSQGS 234 >UniRef50_UPI000180B6D6 PREDICTED: similar to glutamate receptor, ionotropic, delta 2 n=1 Tax=Ciona intestinalis RepID=UPI000180B6D6 Length = 1235 Score = 81.2 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 30/149 (20%), Positives = 51/149 (34%), Gaps = 4/149 (2%) Query: 162 HTHSASASSTDLGTETTSS-FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 H H + + + G TT++ +G +T H H + T + G + G Sbjct: 365 HGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPLGHGHMTTTTPSGHGHMTTTTPSGHGHMT 424 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 T+ + +G T+ + T++ H H + T + G H T Sbjct: 425 TTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTTP 483 Query: 281 GAHTHSVAIGS--HGHTITVNAAGNAENT 307 H H HGH T +G+ T Sbjct: 484 SGHGHMTTTTPSGHGHMTTTTPSGHGHMT 512 Score = 79.3 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 50/154 (32%), Gaps = 5/154 (3%) Query: 156 QDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGA 215 D + + ++ + T S + T +T + H H + T + G + Sbjct: 340 ADQVIQVYTTQASHGGHMTTTPPSGHGHMTTTTPS--GHGHMTTTTPSGHGHMTTTTPLG 397 Query: 216 FGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHA 275 G T+ + +G T+ + T++ H H + T + G H Sbjct: 398 HGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HM 456 Query: 276 HTVGIGAHTHSVAIGS--HGHTITVNAAGNAENT 307 T H H HGH T +G+ T Sbjct: 457 TTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMT 490 Score = 79.3 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 48/150 (32%), Gaps = 4/150 (2%) Query: 161 SHTHSASASSTDLGTETTSS-FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGT 219 H + + G TT++ +G +T H H + T G + G Sbjct: 353 HGGHMTTTPPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPLGHGHMTTTTPSGHGHM 412 Query: 220 NTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVG 279 T+ + +G T+ + T++ H H + T + G H T Sbjct: 413 TTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTT 471 Query: 280 IGAHTHSVAIGS--HGHTITVNAAGNAENT 307 H H HGH T +G+ T Sbjct: 472 PSGHGHMTTTTPSGHGHMTTTTPSGHGHMT 501 Score = 78.9 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 50/149 (33%), Gaps = 4/149 (2%) Query: 162 HTHSASASSTDLGTETTSS-FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 H H + + + G TT++ +G +T H H + T + G + G Sbjct: 376 HGHMTTTTPSGHGHMTTTTPLGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMT 435 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 T+ + +G T+ + T++ H H + T + G H T Sbjct: 436 TTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTTP 494 Query: 281 GAHTHSVAIGS--HGHTITVNAAGNAENT 307 H H HGH T + + T Sbjct: 495 SGHGHMTTTTPSGHGHMTTTTPSSHGHMT 523 Score = 75.0 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 51/152 (33%), Gaps = 7/152 (4%) Query: 162 HTHSASASSTDLGTETTSS-FDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN 220 H H + + G TT++ +G +T H H + T + G + G Sbjct: 387 HGHMTTTTPLGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMT 446 Query: 221 TSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI 280 T+ + +G T+ + T++ H H + T + G H T Sbjct: 447 TTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTTP 505 Query: 281 GAHTHSVAIG--SHGHTITVNAA---GNAENT 307 H H SHGH A G+ E T Sbjct: 506 SGHGHMTTTTPSSHGHMTAATPASGHGSHETT 537 Score = 75.0 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 46/145 (31%), Gaps = 3/145 (2%) Query: 165 SASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIF 224 + + + TT + G +T H H + T + G + G T+ Sbjct: 336 TIPNADQVIQVYTTQASHGGHMTTTPPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTP 395 Query: 225 PNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHT 284 + +G T+ + T++ H H + T + G H T H Sbjct: 396 LGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHGHMTTTTPSGHG-HMTTTTPSGHG 454 Query: 285 HSVAIGS--HGHTITVNAAGNAENT 307 H HGH T +G+ T Sbjct: 455 HMTTTTPSGHGHMTTTTPSGHGHMT 479 Score = 40.4 bits (92), Expect = 0.085, Method: Composition-based stats. Identities = 20/101 (19%), Positives = 29/101 (28%), Gaps = 3/101 (2%) Query: 209 QHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTA 268 H + F G N S + +A + T + T+ H + Sbjct: 303 HHGNQMEFCGFNISQSLYHSLNNVSYAALLELLTIPNADQVIQVYTTQASHGGHMTTTPP 362 Query: 269 ASAGAHAHTVGIGAHTHSVAIGS--HGHTITVNAAGNAENT 307 + G H T H H HGH T G+ T Sbjct: 363 SGHG-HMTTTTPSGHGHMTTTTPSGHGHMTTTTPLGHGHMT 402 >UniRef50_C3X8Y3 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8Y3_OXAFO Length = 270 Score = 80.8 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 22/94 (23%), Positives = 34/94 (36%), Gaps = 15/94 (15%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMR 137 P G + + S P GY G + Y +L A + +PD+R Sbjct: 126 YMAVPAGTVVYFCSHKAPYGYLKADGSAVGREEYKELFAAIGVYFGSGDGVSTFNLPDLR 185 Query: 138 GWTIKGKP-----ASGRAVLSQEQDGIKSHTHSA 166 G I+ +GR + + + D KSH H Sbjct: 186 GEFIRSLDNGRGVDAGRELGNVQMDEFKSHYHGF 219 >UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacterium RepID=D0KGE5_PECWW Length = 157 Score = 80.8 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 53/142 (37%), Gaps = 6/142 (4%) Query: 82 TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTI 141 + A + G W + P G+ + GQ F+ S P LA YPS +PD RG+ Sbjct: 6 SGAWKSSSSIQPGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFP 65 Query: 142 KGKPAS------GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHT 195 +G RA+LS + D I++ T + + + S +N+ A+ Sbjct: 66 RGWDNGAGIDPDSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSSYGWPYPSNSGSAND 125 Query: 196 HSISGTANSAGAHQHKSSGAFG 217 SI S + Sbjct: 126 ASIITFDASRVVPTAAENRPTN 147 >UniRef50_C3X3G6 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3G6_OXAFO Length = 237 Score = 80.4 bits (196), Expect = 7e-14, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 45/133 (33%), Gaps = 16/133 (12%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP-----------SGVIPDMR 137 P G+ + S+T P G+ + G +AYP L A + +PD+R Sbjct: 93 NGVPPGSVLYLCSETPPDGWLVADGSMLLVAAYPDLFAAIGTAFGSGDNGMTTFRLPDLR 152 Query: 138 GWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTG 192 G I+ GR + S + D I++H H S+ Sbjct: 153 GEFIRCLDKGRGLDDGRPLGSVQGDEIRNHNHGFLDIPKVQFGSGVYSWTPQVMEVAEHA 212 Query: 193 AHTHSISGTANSA 205 + +G + + Sbjct: 213 PIATTWTGGSETR 225 >UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacterium radiotolerans JCM 2831 RepID=B1M1N8_METRJ Length = 414 Score = 80.0 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 72/332 (21%), Positives = 121/332 (36%), Gaps = 49/332 (14%) Query: 32 NIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFY 91 +L T + W G + + P AQ+Y + P E Sbjct: 86 TTTNLNPCTLAADNNAPKPWLRWDGTQFGPGDIGQNVVWSVVYDPVAQVYRTLSPTTE-- 143 Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGV----------IPDMRGWTI 141 GA + VPSG+ + G+ ++AY L +G +PD RG T+ Sbjct: 144 QAGAIKAFAGPNVPSGWEICDGRAVSRTAYAALFATISTGWGNGDGFTTFNLPDARGRTL 203 Query: 142 KGKP-ASGRAVLSQEQD-----------------GIKSHTHSASASSTDLG---TETTSS 180 G +GR + D + SH H+++ S + + Sbjct: 204 FGANRGTGRLTAAGGLDGSLGNMGGADQVVMLAPQMPSHIHTSTMSPAGFFEPEIQKAGA 263 Query: 181 FDYGTKSTNNTGAHTHSIS--------GTANSAGAHQHKSSGAFGGTNTSIFPN-GYTAI 231 D+G AH+ + GT +++G H H +G +T N Sbjct: 264 HDHGGTKVGGDHAHSGTTGLSGTHTHGGTTDTSGDHAHVVQYGYGLVSTQTPNNAQVVTG 323 Query: 232 SNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGI-GAHTHSV-AI 289 NL + T+ SG ++ T G HTH+ S G+HAH + + G HTH++ Sbjct: 324 INLGSQGNGQTTQSGPHQHTFTTGQGGNHTHAFS--TDPGGSHAHEIPVDGDHTHTIDPT 381 Query: 290 GSHGHTITVNAAGNAE---NTVKNIAFNYIVR 318 +H HT+ ++AAG+ N + ++ Sbjct: 382 PNHVHTLVIDAAGSGAPHPNVPPGAVVIWAIK 413 >UniRef50_B3Z3L3 Phage minor structural protein n=3 Tax=Bacillus cereus group RepID=B3Z3L3_BACCE Length = 679 Score = 80.0 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 69/167 (41%), Gaps = 6/167 (3%) Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN----NTGAHTHSISGTAN 203 +L+ E + ++++ + + + + + S+ N+ + + T++ Sbjct: 401 NSLILTYETEEFRAYSRATKGGGAIVESTSAGGAVVNSTSSGGGVVNSTSSGGGSTQTSS 460 Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHS 263 S G S+ GG+ TS G + + SG N+ + + G H H Sbjct: 461 SGGGSTQTSTSGGGGSFTSEAGGGAVPSTTQKSFA-EMHLMSGVPENSIGSENWGNHLHE 519 Query: 264 LSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKN 310 + + +H+HTV + +H H V I +H H++T+ A ++ + Sbjct: 520 IVINGDNF-SHSHTVTVPSHKHQVNIPAHSHSVTIPAHTHSVTIPNH 565 >UniRef50_Q7Y2B3 Gp12 Short tail fibers n=2 Tax=unclassified T4-like viruses RepID=Q7Y2B3_9CAUD Length = 466 Score = 79.7 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 51/152 (33%), Gaps = 16/152 (10%) Query: 70 TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP 129 + S Q T P+G I + + + G++ +K+ YP+L A Sbjct: 292 YKGSNSDGNQFVTKNELANHAMPIGGIILSGFNADRGDFLICNGRSLNKNQYPQLFSAIG 351 Query: 130 --------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASS---TDL 173 + +PDMRG +G GR S ++D ++ T + Sbjct: 352 YTFGGSGDNFNLPDMRGLVARGCDHGRNLDPGRRFGSYQEDAMQRITGKFPVADRWRGWY 411 Query: 174 GTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 G T+ + + N G + +S Sbjct: 412 GGAFTAQRGQWSTNYKNGGGDDWGTTVNFDSG 443 >UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BT48_DESAD Length = 208 Score = 78.5 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 39/148 (26%), Positives = 63/148 (42%), Gaps = 13/148 (8%) Query: 87 PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP- 145 A YP+GA + DT P G+ GQ + YP+LA + +PD+RG I+G Sbjct: 57 AASDYPIGAVAAYRGDTPPVGWLECNGQ--STTGYPELAAVVGAN-VPDLRGEFIRGLDS 113 Query: 146 ----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK-----STNNTGAHTH 196 +GRA+ S + D ++ H+H + + + + T S + +T N G+ Sbjct: 114 GRGVDAGRALGSAQADAMERHSHQTTITVSGRTSVTASPYHSAGAARSLVTTPNFGSPFG 173 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIF 224 S +A+ G SGA Sbjct: 174 GASFSASGTGTSTSVGSGAETRPRNVAL 201 >UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=Photorhabdus RepID=Q7N047_PHOLL Length = 602 Score = 78.5 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 55/123 (44%), Gaps = 6/123 (4%) Query: 90 FYPVGAPIPWPSDTV-PSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP--- 145 P+GA I W S P+GY +G+ F + YP+LA +P +PD RG +G Sbjct: 457 GVPIGATIEWHSTAPIPAGYEPNEGRAFRAADYPELAKIFPDLKLPDDRGLFKRGLDRGR 516 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 SGR++ S + D I++ T S + + G+ + +F Y K+ + Sbjct: 517 GLDSGRSLGSVQGDAIRNITGSLGKPTIESGSNASGAFSYQYKAGGRAAGAGGGVIAWTF 576 Query: 204 SAG 206 A Sbjct: 577 DAS 579 >UniRef50_A9IXL3 Phage-related protein n=6 Tax=Bartonella RepID=A9IXL3_BART1 Length = 334 Score = 78.1 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 51/137 (37%), Gaps = 23/137 (16%) Query: 67 RDTTDANWSPWAQLYTS--------AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDK 118 RD N W + P E + G + S+ +PSG+ L G+ + + Sbjct: 138 RDIAGKNADGWFLTNPTIKLPEIPPFPPLPESFSPGFIGTFASEKIPSGWLLCDGKEYSR 197 Query: 119 SAYPKLAVAYP----------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHT 163 Y L + +PD+RG ++G GR + S++++ KSHT Sbjct: 198 KNYANLFAVLGETWGKGDGKTTFNVPDLRGMFLRGLDSGKEIDKGRLLGSRQEESFKSHT 257 Query: 164 HSASASSTDLGTETTSS 180 H ST + + Sbjct: 258 HEGKTDSTGKHQHSYPT 274 Score = 43.8 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 35/128 (27%), Gaps = 23/128 (17%) Query: 212 SSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASA 271 G + F L G+ + + S +HTH G S Sbjct: 208 GETWGKGDGKTTFNVPDLRGMFLRGLDSGKEIDKGRLLGSRQEESFKSHTHE--GKTDST 265 Query: 272 GAHAHT---------------------VGIGAHTHSVAIGSHGHTITVNAAGNAENTVKN 310 G H H+ V T + G H H + + G E N Sbjct: 266 GKHQHSYPTIKNDILRYKREDYKGYVAVVYKTDTLTEPAGEHEHKVLLQKTGGDETRPVN 325 Query: 311 IAFNYIVR 318 +A Y V+ Sbjct: 326 MAVVYAVK 333 >UniRef50_Q4KHC6 Tail fibre protein, putative n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KHC6_PSEF5 Length = 369 Score = 77.3 bits (188), Expect = 7e-13, Method: Composition-based stats. Identities = 46/250 (18%), Positives = 82/250 (32%), Gaps = 35/250 (14%) Query: 68 DTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVA 127 D + + + PVGA +P+P TVP+G+ + G + YP LA Sbjct: 90 DAYTKSVTYTKAEIEALLKNMSALPVGAMVPFPKGTVPAGFLEVDGSVQSAATYPDLAAY 149 Query: 128 YPS-----------GVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASST 171 + +P+ RG ++G GRA+ S + + SH H + + Sbjct: 150 LGTMFNTGGEGAGNFRLPESRGEFLRGWDHGRGVDVGRALGSYQAHAVGSHQHPMNYWAW 209 Query: 172 DLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAI 231 GT T + K +TG GT +AG + N ++ Sbjct: 210 RDGTGTGTHNYA--KPWGDTGITGVKDPGTGANAGDSE------TRPRNLAVMWCIKAWN 261 Query: 232 SNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGS 291 + ++ G + + + + AS + +V + Sbjct: 262 APVNQGNIDISGLAANVSALETRPRGLGDGQAWQNVTAS-----------RVSGTVYTNT 310 Query: 292 HGHTITVNAA 301 G I V A+ Sbjct: 311 TGRPIQVQAS 320 >UniRef50_C3X192 Predicted protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X192_OXAFO Length = 361 Score = 77.3 bits (188), Expect = 7e-13, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 62/187 (33%), Gaps = 21/187 (11%) Query: 89 EFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRG 138 P+G + T P+GY G ++ YP L A + +PDM G Sbjct: 78 SGVPIGTVEYFAMATPPAGYLKADGAAVGRATYPDLFAAIGTTFGAGDGETTFNLPDMIG 137 Query: 139 WTIKGKPASGRAVLSQEQDGIKSHTHSAS-ASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 +G G + ++ G+ + S S +S + S +NN S Sbjct: 138 QFAEGSATPG----AVKEAGLPNIIGSISNVASGGANASSASGALSIAARSNNNMTPGSS 193 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIF------PNGYTAISNLSAGIMSTTSGSGQTRNA 251 G + + + +G +NT P + ++ G++ T + + Sbjct: 194 AYGHTFALAINASDFNPIYGKSNTVQPPALTLLPCIKAFDAAVNPGLIDITELANEMAGK 253 Query: 252 GKTSSDG 258 +G Sbjct: 254 VDKVING 260 >UniRef50_B3FYL6 Gp17 n=1 Tax=Salmonella phage phiSG-JL2 RepID=B3FYL6_9CAUD Length = 658 Score = 77.0 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 47/305 (15%), Positives = 92/305 (30%), Gaps = 49/305 (16%) Query: 23 NNGYPTAYGNIIHLKGMTAVG--EGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQL 80 + + A + G +A G S T +++ A ++ A+ + Sbjct: 203 SRTFAEAAQGSANSAGQSATNANNAMQSAGQSATDASNSAAQAKASEVNAKASEVNAKRD 262 Query: 81 YTSAHPPAEF---YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMR 137 +A + PVG +P+G+ G+ FD + YP LA +PSG P Sbjct: 263 ADAALLALQSTGNVPVGTVAMITHTKIPTGWVRA-GEDFDVNTYPALAELFPSGRTPSFD 321 Query: 138 GWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G G + AH+H+ Sbjct: 322 DRYPIGNST---------------------------------VLTPGQLIDQSVPAHSHT 348 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 N +GA ++ +G++ + G + G Sbjct: 349 FDVPVNVSGATAAGGEYRARTSHEGDHSHGFSLPIQNNTGAYTGRLVGGGNNPNYPQDLR 408 Query: 258 GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAI-GSHGHTITVNAAGNAE-NTVKNIAFNY 315 GAH+H + +H+H++ G +++ + GN+ + + Sbjct: 409 FN--------TGGGGAHSHEFYVPSHSHTLNASGRAAGSVSSSGIGNSPYVRPYSTVVIF 460 Query: 316 IVRLA 320 I++ A Sbjct: 461 IIKAA 465 >UniRef50_C2RWX3 Phage minor structural protein n=1 Tax=Bacillus cereus BDRD-ST24 RepID=C2RWX3_BACCE Length = 695 Score = 77.0 bits (187), Expect = 9e-13, Method: Composition-based stats. Identities = 36/160 (22%), Positives = 63/160 (39%), Gaps = 10/160 (6%) Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGA 207 G+A + A +ST G T S G + +TG +S + + G Sbjct: 413 GKATKGGGATVQSTGGGGAIIASTGAGGGTVQSTGGGGGTVQSTGGGGAQVSTSTSGGGV 472 Query: 208 HQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGT 267 + SG GG+ + G + ++ + + SG N+ + + G H H + Sbjct: 473 SKSTESG--GGSTQTSGAGGGISTTSDHKTFLELSIMSGVPENSIGSENWGNHLHEIKIP 530 Query: 268 AA--------SAGAHAHTVGIGAHTHSVAIGSHGHTITVN 299 + H H+V I H+H+ ++ SH H IT+N Sbjct: 531 GDYFTHNHAINLPNHYHSVLIQPHSHNFSVPSHSHQITLN 570 >UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX Length = 456 Score = 77.0 bits (187), Expect = 9e-13, Method: Composition-based stats. Identities = 41/110 (37%), Positives = 54/110 (49%), Gaps = 5/110 (4%) Query: 31 GNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEF 90 G + G + GE L + + T+G R +T A S + E Sbjct: 283 GGTVRSDGRLSTGE-YLQLDKTATAGTKCAPDGLVGRTSTGAILS----CQSGMWAGFES 337 Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWT 140 YPVG+PIPWPS T P GY +M GQ+F S YP+LA AYP +PD+R Sbjct: 338 YPVGSPIPWPSATPPQGYLVMNGQSFSCSRYPQLARAYPGCKLPDLRRCF 387 >UniRef50_C3X909 Predicted protein n=2 Tax=Oxalobacter formigenes OXCC13 RepID=C3X909_OXAFO Length = 549 Score = 76.2 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMR 137 PVGA + T P+GY G ++ YP L A + +PD+ Sbjct: 261 PSGVPVGAIGYFAMQTPPAGYLKADGSAVSRATYPDLFGAIGTTFGEGDGSTTFNLPDLI 320 Query: 138 GWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF---------DYGTKST 188 +G G + + G+ + T S + ++++ G+ + +F Sbjct: 321 DRFAQGNATPGLKI----EAGLPNITGSLTVTASNQGSAASGAFSRTQIGAVGGGLGGGQ 376 Query: 189 NNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQT 248 N+ ++ + + +S ++ P + + + + Sbjct: 377 YNSSGCGPNLYSFDSRVSNPIYGASNTVQPPALTLLPCIKAFDAYPAVQAGISAMNAIGA 436 Query: 249 RNAGKTSSDGAHT 261 ++ H Sbjct: 437 PTRAARATVEPHA 449 >UniRef50_C3X3K6 Predicted protein n=2 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3K6_OXAFO Length = 500 Score = 75.4 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 64/204 (31%), Gaps = 23/204 (11%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPD 135 PVG + + + P+GY G + YP L A + +PD Sbjct: 208 ASQRGIPVGTVVMFSASEAPAGYLKCDGAAVGRDTYPDLFAAIGTVFGAGDGETTFNLPD 267 Query: 136 MRGWTIKGKPASGRAVLSQEQD--GIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 M G +G G + D G + ++ ++ + T++ + TN+ A Sbjct: 268 MIGRFAEGSLTPGTVKEAGLPDVTGTIRLSDNSQINAVEADKIATANGAFSRVRTNSPTA 327 Query: 194 HTHSISGTANSAG-----------AHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTT 242 ++ + A + + +S ++ P + ++ G++ T Sbjct: 328 YSTASVDVATTNKYDRVDFSLASQNPLYGNSDTVQPPALTLLPCIKAFDAAVNPGLIDVT 387 Query: 243 SGSGQTRNAGKTSSDGAHTHSLSG 266 + + + S Sbjct: 388 ELANEVTTKTTPAQAANAAMPSST 411 >UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enterobacteria phage T4 RepID=Q38190_BPT4 Length = 226 Score = 75.0 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 62/157 (39%), Positives = 82/157 (52%), Gaps = 6/157 (3%) Query: 164 HSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSI 223 + + + + S+ G +TN G H+H+ S +SAG H H Sbjct: 75 WNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDHSHSVGIGEHS----- 129 Query: 224 FPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAH 283 G ++ T++ G H+H+ S +SAG H+H+VGIGAH Sbjct: 130 -HYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDHSHSVGIGAH 188 Query: 284 THSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 TH+VAIGSHGHTITVN+ GN ENTVKNIAFNYIV LA Sbjct: 189 THTVAIGSHGHTITVNSTGNTENTVKNIAFNYIVALA 225 >UniRef50_A9ITX5 Phage-related protein n=6 Tax=Bartonella RepID=A9ITX5_BART1 Length = 333 Score = 75.0 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 44/133 (33%), Gaps = 18/133 (13%) Query: 72 ANWSPWAQLYTSAHP---PAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY 128 + W + + P + YP G + VP + + G+ + + Y L Sbjct: 136 SRGDSWYLVNPTPMPREEESSLYPTGFIGTFGMRDVPKDWLICDGKAYLRRDYRDLFETI 195 Query: 129 P----------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDL 173 + +PD RG ++G R S + D I+SH H S Sbjct: 196 GTVWGEGDSVTTFNVPDFRGMFLRGVDGGSNLDPNRRFASVQTDLIQSHQHEGQTLSMPH 255 Query: 174 GTETTSSFDYGTK 186 T + +D T Sbjct: 256 FTSNENFWDGNTT 268 >UniRef50_Q6J803 Pas28 n=1 Tax=Actinoplanes phage phiAsp2 RepID=Q6J803_9CAUD Length = 1291 Score = 74.3 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 57/266 (21%), Positives = 83/266 (31%), Gaps = 28/266 (10%) Query: 57 AHAPAFIRSR---RDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTV--PSGYALM 111 ++ R+R R+ D S W+ Y P G + WP P G+ Sbjct: 701 EPCCSYYRARTIGREDGDLRISDWSDTYDPG------IPSGIIVMWPGTDASLPEGW--- 751 Query: 112 QGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASST 171 L YP GV D G A+ A+S Sbjct: 752 -------ERTTALDGRYPKGVPDDTTQPGTTGGAATHSHTTPGHTHDTSHLHTVTGATSA 804 Query: 172 DLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTN-TSIFPNGYTA 230 GT +S GT NT HTH+ T ++ S G +N + + Sbjct: 805 ATGTFASSDGAVGTTVALNT--HTHTRPSTNSATVVSGSASPGTNTASNDPARAEVIFME 862 Query: 231 ISNLSAGIMSTTSG-SGQTRNAGKTSSD-GAHTHSLSGTAASAGAHAHTVGIGAHTHSVA 288 G+ + +G S G A +AG T G +H+ Sbjct: 863 SDGSPLGLPDGALALTLDVALSGWADSSLGNTGGRFIKGAPAAGDGGTTAGSSVASHTHD 922 Query: 289 IGSHGHTITVNAAGNAENTVKNIAFN 314 I +HGHT T + G+ N + A N Sbjct: 923 IDAHGHTGTSH--GHTSNPTNSFASN 946 >UniRef50_Q116W7 Phage Tail Collar n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W7_TRIEI Length = 671 Score = 73.9 bits (179), Expect = 7e-12, Method: Composition-based stats. Identities = 32/119 (26%), Positives = 47/119 (39%), Gaps = 15/119 (12%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS-GVIPDMRGWTIKGKPA-- 146 PVG +P+ T P G+ L GQ++D Y +L V+PD++G I G Sbjct: 525 VVPVGTIVPYAGLTAPEGWLLCNGQSYDWEQYSELYKVLDEIKVLPDLKGRFIIGVGDKD 584 Query: 147 ---------SGRAVLSQEQDGIKSHTHSASASSTDL---GTETTSSFDYGTKSTNNTGA 193 G + +D + SH HS L G TTS+ + N G+ Sbjct: 585 GYSYSLNAKGGEEKHTLTKDEMPSHDHSKGEYKFILKKDGKVTTSNNVNNSLREPNLGS 643 >UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE3_PECWW Length = 144 Score = 73.9 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 36/134 (26%), Positives = 52/134 (38%), Gaps = 10/134 (7%) Query: 98 PWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-------ASGRA 150 W + P G+ + GQ F+ S P LA YPS +PD RG+ +G S R+ Sbjct: 1 MWGTPVPPEGWLELNGQLFNPSGNPVLADLYPSSRVPDFRGYFPRGWDNGAGIDPDSSRS 60 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTA---NSAGA 207 VLS + D I SH H+ + S G + F S G+ + AG Sbjct: 61 VLSYQDDEIISHKHAITMSHEHHGAADGAGFPQTDASGPMIKHAETEPDGSFPERSGAGN 120 Query: 208 HQHKSSGAFGGTNT 221 G+ + Sbjct: 121 PMFSFGGSETRPHN 134 >UniRef50_C3X8R9 Bacteriophage tail fiber protein n=6 Tax=Oxalobacter formigenes OXCC13 RepID=C3X8R9_OXAFO Length = 398 Score = 73.5 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 52/171 (30%), Gaps = 18/171 (10%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTI 141 P+G+ + +PSGY G + YP L A + +PD+ G Sbjct: 108 PIGSIDYFAMAALPSGYLKADGAEVGRETYPDLFAAIGTVFGEGNGETTFNLPDLIGRFP 167 Query: 142 KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSIS-- 199 +G G+ V Q G+ + T A G +F + ++ Sbjct: 168 QGSARPGQRV----QAGLPNITGKFRA-KAAAGEIPGGAFYGIGNIGGGSSDNSAPNYEE 222 Query: 200 -GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTR 249 G S + +S ++ + + G++ + Sbjct: 223 IGFDASKSNLIYGASDTVQPAALTLLACIKAFDAASNPGLVDVAGLARDVH 273 >UniRef50_D1NFN8 Apo-citrate lyase phosphoribosyl-dephospho-CoA transferase (Fragment) n=1 Tax=Haemophilus influenzae HK1212 RepID=D1NFN8_HAEIN Length = 301 Score = 73.5 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 25/108 (23%), Positives = 39/108 (36%), Gaps = 13/108 (12%) Query: 33 IIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYP 92 I + G +L T + R + +WS W +L T P Sbjct: 18 IQVIAGGDGTWCRQLAYVAYSTD-----VYERHQTSYQTDSWSAWKKLNTDG------IP 66 Query: 93 VGAPIPWPSD-TVPSGYALMQGQTFDKSAYPKLAVAYP-SGVIPDMRG 138 GA + +P T P G+ G TF++ +P L S +PD+ Sbjct: 67 TGAVVSFPRAVTNPVGFLKANGSTFNQQTFPDLYRVLGNSNQLPDLTR 114 >UniRef50_C3XAA4 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3XAA4_OXAFO Length = 305 Score = 73.5 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 57/174 (32%), Gaps = 24/174 (13%) Query: 79 QLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP--------- 129 Q P+ PVG + T P+GY G + YP+L Sbjct: 2 QNPAKGITPSSGVPVGTIEYFAMVTSPAGYLKANGAAVGRETYPELYATIGTTFGEGDGS 61 Query: 130 -SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKST 188 + +PD+ +G G+ + + G+ H H+ + + GT Y + Sbjct: 62 STFNLPDLIDRFAQGSNTPGQKI----EAGLSDHNHTLPLALEETGT------GYAAHGS 111 Query: 189 NNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTT 242 N + T SA + +S ++ P + + G++ T Sbjct: 112 NISSGTTVGY----ASASNPIYGASNTVQPPALTLLPCIKAFDAATNPGLIDIT 161 >UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae RepID=D0FSD9_ERWPY Length = 311 Score = 71.6 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 48/248 (19%), Positives = 79/248 (31%), Gaps = 53/248 (21%) Query: 72 ANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSG 131 S W + +P YP+G + P+ +P Y Sbjct: 110 GKGSGWVEFKADVNPVDMLYPIGIVTWFAQKKDPN------------KLFPGTTWKY--- 154 Query: 132 VIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT 191 TI+ A+G V++ + + + T SFDYGTK Sbjct: 155 ---IGENRTIRLASANGSDVMTTGGSDSVTLAVGNIPAHGHTFSANTGSFDYGTK----- 206 Query: 192 GAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNA 251 + G + G+ + + P G + + + Sbjct: 207 -------GTSTFDYGNKVTDTQGSHTHSYNEVIPRGASGMDIGGIWETTIRGSD------ 253 Query: 252 GKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNI 311 ++AGAHAH V IGAH H+V IG+H H+++ A T N+ Sbjct: 254 ----------------TSTAGAHAHNVAIGAHGHTVEIGAHSHSVSGTTANTGAGTAINV 297 Query: 312 AFNYIVRL 319 N ++L Sbjct: 298 T-NAFIKL 304 >UniRef50_Q7P176 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P176_CHRVO Length = 435 Score = 71.6 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 73/219 (33%), Gaps = 23/219 (10%) Query: 7 TDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSR 66 T GA+ +L + N+ A G + L+ + A +G A + ++ Sbjct: 205 YGITDGASKTDLQKAINDLVAGAPGALNTLQELAAA------LGNDANYAASITKQLSNK 258 Query: 67 RDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAV 126 D + P G + P+G+ + G+T + YP L Sbjct: 259 ADKATTLAGYGIADGATVTQVNAAAPAGMVAYFAMKDAPAGWLIADGRTVARKDYPALFA 318 Query: 127 AYP----------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDG--IKSHTHSASAS 169 A + +P++ G I+G +GRA+ S + + + + Sbjct: 319 AIGGLYGNGDGSTTFGLPNLCGEFIRGWDNGRGVDTGRAIGSSQISTQLLVDNDGLQTVG 378 Query: 170 STDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAH 208 + D + S+ Y N H + + +N A + Sbjct: 379 AIDWSSNNLSALGYEPAQANAANLHFINSTTISNPADSS 417 >UniRef50_D1BW55 Tail Collar domain protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BW55_XYLCX Length = 443 Score = 70.8 bits (171), Expect = 6e-11, Method: Composition-based stats. Identities = 47/233 (20%), Positives = 80/233 (34%), Gaps = 18/233 (7%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPAS 147 PVG + TVP G+ G ++ YP L Y + Sbjct: 223 NAACPVGMEAGFH--TVPPGWLEHNGAAVSRTTYPALFAHYGTTY-----------GAGD 269 Query: 148 GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS-AG 206 G + ++ +A + T T ST +HTH+ + +S Sbjct: 270 GSTTFNLPNAKGRTPVGLDTAQAEFNAVGKTGGAKTHTLSTAEMPSHTHTSAAHTHSINH 329 Query: 207 AHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSG 266 H +S + G +G T + + G +S + T + G + +S Sbjct: 330 DHAAVTSSSAGSHTHGSSTSGITDRAYFARGSAPASSATVGTNGV----TPGPWDYVVSS 385 Query: 267 TAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 T ASAGAH HTV + + + + + G T + + N + + VR Sbjct: 386 TLASAGAHTHTVDLPSFSGTSGSTTPGATGSTGSGSAHNNLPPYLVRRWCVRA 438 >UniRef50_B5S308 Phage tail collar protein n=2 Tax=Ralstonia solanacearum RepID=B5S308_RALSO Length = 225 Score = 70.4 bits (170), Expect = 8e-11, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 40/144 (27%), Gaps = 15/144 (10%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS----------GVIPDMRGWTIKG 143 G+ + T P+G+ G ++ Y +L + +P++R +G Sbjct: 66 GSVAMFACKTPPAGWLKCNGAAVSRTTYERLFKLIGTTFGAGDGAATFNLPELRAEFPRG 125 Query: 144 KPA-----SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSI 198 SGRA S + + SH H + + A + Sbjct: 126 WDDGRGVDSGRAFGSSQAQALSSHQHKTAVGFDGSNLFGWGDGSATPIFGSEVQAGVLRV 185 Query: 199 SGTANSAGAHQHKSSGAFGGTNTS 222 G +G S Sbjct: 186 VGAVTQSGGAARIGYTDVTPMGVS 209 >UniRef50_B5TAB1 Gp47 n=2 Tax=root RepID=B5TAB1_9CAUD Length = 325 Score = 70.0 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 64/194 (32%), Gaps = 55/194 (28%) Query: 131 GVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGT 185 +PD+RG ++ R + S + I+SH H+A++ Sbjct: 182 FRLPDVRGEGLRLWDNGRGVDQARTLGSWQGGAIESHGHAANSGDAGA---------VAD 232 Query: 186 KSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGS 245 + T + G H H+ G T + Y S S + Sbjct: 233 RRTGSGGGHNHN-------------------NGIFTRLLRAPYVGSITGSDTTNSGDEQA 273 Query: 246 GQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAE 305 ++ ++ G H H + G +G H H I+++A G E Sbjct: 274 VGGGDSADIAAVGDHDHLIPG----------------------VGPHRHDISISATGGNE 311 Query: 306 NTVKNIAFNYIVRL 319 ++N+A ++++ Sbjct: 312 TRMRNVAVAALIKI 325 >UniRef50_C3LHF1 Phage minor structural protein n=13 Tax=Bacteria RepID=C3LHF1_BACAC Length = 657 Score = 69.3 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 30/128 (23%), Positives = 49/128 (38%), Gaps = 17/128 (13%) Query: 176 ETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLS 235 + +SS K++++ G H H G G P T+ S + Sbjct: 454 QASSSGGGTVKASSSGGDHVHK----MFHGGGIVPAEPSTIGLYTAFSDPGRNTSASFYA 509 Query: 236 AGIMSTTSGSGQTRNAGKTSSDGAHTHSLS----GTAASAGAHAHTVGIGAHTHSVAIGS 291 G S+ G S G HTH +S + + H H++ I +TH ++I + Sbjct: 510 KGTGSSFYTYG---------SSGNHTHDISIPNHTHSINIPNHTHSISIPNYTHDISIPN 560 Query: 292 HGHTITVN 299 H H IT+ Sbjct: 561 HTHDITLP 568 >UniRef50_C4VIX0 74kDa protein n=28 Tax=root RepID=C4VIX0_ENTFA Length = 671 Score = 68.9 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 27/115 (23%), Positives = 44/115 (38%), Gaps = 12/115 (10%) Query: 192 GAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNA 251 + T+++ G+H H N P L AG + Sbjct: 467 TNTDGGSAQTSSANGSHDHLM------FNVIQGPPQTLPKITLRAGGGGEIYTEARGGTF 520 Query: 252 GKTSSDGAHTHSLSGTAAS------AGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 S+ HTH+++ + S AH+H V I HTHS+++ SH H + + A Sbjct: 521 RTASAADNHTHTVNVPSHSHRFNIDIPAHSHVVSIPNHTHSISVPSHSHQVRIPA 575 >UniRef50_B7UGJ3 Predicted tai fiber protein n=15 Tax=Escherichia coli RepID=B7UGJ3_ECO27 Length = 221 Score = 68.9 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 26/102 (25%), Positives = 37/102 (36%), Gaps = 14/102 (13%) Query: 93 VGAPIPWPSDTVPSG---------YALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 +G P WPS +P + G F + YP LA +PS V+P+ RG I+ Sbjct: 79 IGVPFFWPSAAMPDTVIESWSGMVFLKFNGAKFSATDYPVLAKVFPSLVLPEARGDFIRI 138 Query: 144 KP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSS 180 SGRA+LS + S + Sbjct: 139 WDDGRGADSGRALLSWQAATSLSQFGGNYPEGSGHAIADYDG 180 >UniRef50_B3HKW0 Phage Tail Collar Domain protein n=11 Tax=Enterobacteriaceae RepID=B3HKW0_ECOLX Length = 164 Score = 66.9 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 24/102 (23%), Positives = 36/102 (35%), Gaps = 14/102 (13%) Query: 93 VGAPIPWPSDTVPSG---------YALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 +G P WPS +P+ + G F + YP LA +PS V+P+ RG I+ Sbjct: 22 IGVPFFWPSAAMPNTVIDSWSGMVFLKFNGAKFSATDYPVLAKVFPSLVLPEARGDFIRI 81 Query: 144 KP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSS 180 GR +LS ++ S Sbjct: 82 WDDGRGADGGRELLSWQEATNFSQFAGNIGGGAGHAINFHDG 123 >UniRef50_A9AVE2 Tail Collar domain protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVE2_HERA2 Length = 865 Score = 66.9 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 43/229 (18%), Positives = 66/229 (28%), Gaps = 56/229 (24%) Query: 91 YPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRA 150 P G W VP G+A+ G+ + PD+R I G A Sbjct: 693 IPCGTIQMWSGMEVPEGWAICDGREAN------------GLRTPDLRNRFIVGAGA---- 736 Query: 151 VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQH 210 + S + S TT D + + HTH S A +H Sbjct: 737 -----------NYDSGNLSVYGTNQGTTGGSDVVALTLDQMPRHTHGGSTNAAGDHSHWV 785 Query: 211 KSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAAS 270 + + A G G T + G + + R T + Sbjct: 786 EGTDADGLAKRRRHHWGDTTVDMGFGGGRNADPNDERWRGRVNTDN-------------- 831 Query: 271 AGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 319 AG H+H + IG V + EN A +I+++ Sbjct: 832 AGTHSHGLMIGE---------------VGGSQAHENRPPFYALAFIMKV 865 >UniRef50_B8HZW5 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW5_CLOCE Length = 368 Score = 66.9 bits (161), Expect = 9e-10, Method: Composition-based stats. Identities = 49/242 (20%), Positives = 79/242 (32%), Gaps = 76/242 (31%) Query: 91 YPVGAPIPWPSDTV-----PSGYALMQGQTFDKSAYPKLAVA---------YPSGVIPDM 136 +PVG IP+ SG+ G+ DK+ Y +L P+ IPD+ Sbjct: 5 FPVGMVIPFAGPLKEDQLKSSGWVPCDGRVLDKTQYSELFDVIGTKYGGDGIPNFNIPDL 64 Query: 137 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 RG ++ GR + D + AS S G T S +Y T N Sbjct: 65 RGRFVR-ATDHGRG---YDPDAQRR---KASKSGGAAGDNTGSVQEYATAKPKN------ 111 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS 256 + N G H H ++ TS+ Sbjct: 112 --NFITNDKGNHNH---------------------------LVDHLPTDYWNAACAITSN 142 Query: 257 DGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYI 316 +GA+ + T+ AG H+HT+ G G++E+ N+ +I Sbjct: 143 EGANFPGRTATSGEAGQHSHTIVSG--------------------GDSESRPVNLYMYWI 182 Query: 317 VR 318 ++ Sbjct: 183 IK 184 Score = 55.0 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 47/309 (15%), Positives = 85/309 (27%), Gaps = 97/309 (31%) Query: 32 NIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSP------WAQLYTSAH 85 + + EG G + TSG A + D+ P W +TS+ Sbjct: 131 DYWNAACAITSNEGANFPGRTATSGE-AGQHSHTIVSGGDSESRPVNLYMYWIIKFTSSD 189 Query: 86 PPAEFY-PVGAPIPWPSDTVP-------SGYALMQGQTFDKSAYPKLAVAYPS------- 130 P G+ + + D+V +G+ G +++ + YP L + Sbjct: 190 YDESILLPAGSIVSFAGDSVKKSNELIANGWLPCIGSSYEANKYPDLYENISNIYGGDQN 249 Query: 131 -GVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 +PD+RG I+G ++ +T+ + + Sbjct: 250 KFNVPDLRGLFIRGVNSNTSETPGVHGAT----------RVGQTEDYSTALPKTLNFTLS 299 Query: 190 NTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTR 249 GAHTHS + + ++ + Sbjct: 300 TDGAHTHSAPKLPQDKYIENYCAGHEVANFPSNQY------------------------- 334 Query: 250 NAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVK 309 + G HAHT+ A G+AE Sbjct: 335 ------------------TGNNGNHAHTI---------------------AGGDAETRPV 355 Query: 310 NIAFNYIVR 318 NI +YI++ Sbjct: 356 NIYLDYIIK 364 >UniRef50_B8HZW4 Tail Collar domain protein n=2 Tax=Clostridium RepID=B8HZW4_CLOCE Length = 200 Score = 66.6 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 60/185 (32%), Gaps = 25/185 (13%) Query: 86 PPAEFYPVGAPIPWPSDTVPS--------GYALMQGQTFDKSAYPKLAVAYPS------- 130 E P+G+ I + + G+ + G + YP L A Sbjct: 2 ASTERMPIGSVISFAGEIKSEMVNRLYRMGWLICDGSKLKIAEYPDLFQAIGKAHGGDNT 61 Query: 131 -GVIPDMRGWTIKG-KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYG---- 184 +PD + I+G S + T + +T + F G Sbjct: 62 YFYLPDTQSKFIRGVNGDSVGESGRLMDPDVAKRTFAKPGGNTGNNVGSYQDFATGLPKV 121 Query: 185 TKSTNNTGAHTHSI----SGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMS 240 + +T+ G+HTHS+ G+ N+ + G G NT +G + + G Sbjct: 122 SLTTDFIGSHTHSLPHLPDGSHNAYAGSIGRDGGKEAGDNTRTGESGSHSHEIIGGGDPE 181 Query: 241 TTSGS 245 T + Sbjct: 182 TRPRN 186 >UniRef50_A3GUE7 Tail fiber protein H, putative (Fragment) n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GUE7_VIBCH Length = 250 Score = 66.2 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 25/62 (40%), Positives = 38/62 (61%), Gaps = 2/62 (3%) Query: 79 QLYTSAH--PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDM 136 +L + + +PVG IPW +D P G+ + +GQ FD + Y +LA +P+G+IPDM Sbjct: 189 RLVNNLWLKFAVKIFPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDM 248 Query: 137 RG 138 RG Sbjct: 249 RG 250 >UniRef50_Q7N541 Similar to DNA inversion product and tail fiber protein from lambdoid prophage n=2 Tax=Photorhabdus RepID=Q7N541_PHOLL Length = 337 Score = 66.2 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 54/226 (23%), Positives = 85/226 (37%), Gaps = 66/226 (29%) Query: 88 AEFYPVGAPIPWPSDTVPSGYALMQGQTFD---KSAYPKLAVAYPSGVIPDMRGWTIKGK 144 YPVG I + + P+ L G T++ ++ +LA A S ++ Sbjct: 163 NTQYPVGIVIWFAQNKNPN--VLFPGTTWEYIGENKTIRLASANGSDIL----------- 209 Query: 145 PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS 204 G ++S + +H H+ TTS+FDYGTK+TN G H H + Sbjct: 210 STGGNDLISLTAAQMPAHNHTF--------FGTTSTFDYGTKTTNIAGEHYHDSGWGETT 261 Query: 205 AGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSL 264 G + H G N G S+D + Sbjct: 262 GGRYGHF---------------------------------DGSKNNQGSKSTDWNNA--- 285 Query: 265 SGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKN 310 ++ GAH+HTV IGAH H+++ T ++ GNA ++ N Sbjct: 286 KFNTSTNGAHSHTVSIGAHNHTISGN------TGDSGGNAAISITN 325 >UniRef50_Q4C9U4 Phage Tail Collar n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4C9U4_CROWT Length = 253 Score = 65.0 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 67/214 (31%), Gaps = 25/214 (11%) Query: 36 LKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGA 95 L+ + + LLI + TSG I R + + P + Sbjct: 23 LRKSSTAPQVGLLIARAETSGGSISGQIEDLRPNVN------MLVQPLVDNVGSIIPKSS 76 Query: 96 PIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGV--------IPDMRGWTIKGKPAS 147 + + P+G+ G +D S YP+L A G +PDMR + G S Sbjct: 77 IVVFGGAVAPNGWLFCDGTPYDPSTYPQLFSAIGYGFGQVGSLFRVPDMRDRSPVGAGIS 136 Query: 148 -------GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG 200 G A S D + +H+H+ D G + + G S H Sbjct: 137 FDRGTFGGSATTSLSVDNMPAHSHNV----IDPGHTHSMNHGPGQHSAVALDYHNAGNGV 192 Query: 201 TANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNL 234 A H + G S+ G ++ Sbjct: 193 DAYVPQWGGHAHTIYASGVGISLENTGSGTPVSV 226 >UniRef50_C9PG79 Putative phage tail protein n=1 Tax=Vibrio furnissii CIP 102972 RepID=C9PG79_VIBFU Length = 410 Score = 65.0 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 32/138 (23%), Positives = 49/138 (35%), Gaps = 12/138 (8%) Query: 74 WSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS--- 130 WS + + P G + W S+ VP + GQ + Y LA A P Sbjct: 244 WSDTTKPF--YWKPFSSKTPGETMAWDSELVPEHMIVAMGQQLPVTVYHSLAAAKPEWID 301 Query: 131 ------GVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDY- 183 IPD RG + S D I++ T S +AS T +T + Sbjct: 302 DTNPLVLNIPDRRGRFTRAADGSHWLAGQSHDDAIRNITGSFNASGTTGSASSTKTQGAI 361 Query: 184 GTKSTNNTGAHTHSISGT 201 +T++ + + SG Sbjct: 362 ALSNTSSWPNYVNGQSGA 379 >UniRef50_Q094A8 Phage Tail Collar Domain family n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q094A8_STIAU Length = 645 Score = 65.0 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 50/175 (28%), Gaps = 25/175 (14%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS------------GVIPDMR 137 PVG I + + P G+ L G T K+AY L +P + Sbjct: 478 LVPVGTIIAYGGSSAPEGWLLCDGSTKSKTAYADLFAVIGDTYKGSSAPPSGQFRLPSLM 537 Query: 138 GWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G S + + T T S N+ G H+HS Sbjct: 538 ARVPMGASVSS-----------PHNYPLGTMGGEFTHTLTISEMPVHDHYVNDPG-HSHS 585 Query: 198 ISGT-ANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNA 251 I+ T A +G + + G + + + G + + T Sbjct: 586 ITTTNAEGSGDLRPNRDASKGHVDIPTNHVTTGVTLDTNGGGQAHNNMQPYTTVN 640 >UniRef50_C4MYW8 Gp12 Short tail fibers n=1 Tax=Enterobacteria phage JSE RepID=C4MYW8_9CAUD Length = 467 Score = 64.6 bits (155), Expect = 5e-09, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 47/156 (30%), Gaps = 18/156 (11%) Query: 68 DTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWP-SDTVPSGYALMQGQTFDKSAYPKLAV 126 D + Q P+G I + + + + GQ +K YP L Sbjct: 289 DVYRHSGGDGNQFIIKNELDGLCMPIGGIILTAFNSFDHAQFKICNGQWLNKHQYPVLFS 348 Query: 127 AYP---------SGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASS-- 170 + +PDMRG +G GR + + D ++ T + ++ Sbjct: 349 RIGFTYGGDGGDNFALPDMRGLVARGCDHGRGLDPGRGFGTYQDDTMQHMTGNFPVANRW 408 Query: 171 -TDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 G + + + N G +SA Sbjct: 409 RGWTGGVFAITGGQWSTNYKNGGGDDWGSIVNFDSA 444 >UniRef50_A1HR57 Putative uncharacterized protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HR57_9FIRM Length = 269 Score = 64.3 bits (154), Expect = 5e-09, Method: Composition-based stats. Identities = 47/221 (21%), Positives = 71/221 (32%), Gaps = 50/221 (22%) Query: 14 AGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDAN 73 + + A G+I + + E ++ +G SG P + D Sbjct: 42 TDVITAAAWQPNKTYAVGDICYSPNAPSYTRMECVV--AGRSGTTEPTWPTVGNMVVDGT 99 Query: 74 WSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY----- 128 + W + PVG + S GY G ++AYP+L AY Sbjct: 100 VT-WI-----VDDVRDGTPVGRIVAEISPICRPGYLKANGALVSRAAYPRL-WAYVQARG 152 Query: 129 ----------------------PSGVIPDMRGWTI-----KGKPASGRAVLSQEQDGIKS 161 + +PD+RG I GRA S + DGIKS Sbjct: 153 LVVPDTVWPANYWGCFSTGDGSTTFRLPDLRGEFIRGGDDGRGVDGGRAFGSWQADGIKS 212 Query: 162 HTHSASA---------SSTDLGTETTSSFDYGTKSTNNTGA 193 H H + D+ E TS+ + T T+N G Sbjct: 213 HNHPYQSQPYLFVESFDGGDVIAERTSTAKWVTHYTSNFGG 253 >UniRef50_A4P195 Putative phage tail fibre protein (Fragment) n=1 Tax=Haemophilus influenzae 22.4-21 RepID=A4P195_HAEIN Length = 458 Score = 63.9 bits (153), Expect = 8e-09, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 50/142 (35%), Gaps = 12/142 (8%) Query: 1 MNITALTD--NTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAH 58 + I + NT G+ + H++ + A G+G + Sbjct: 309 LKIQSFAGDINTLKIDGIYAITQASRSQNLPVSTSCHIQ-VIAGGDGHWCRQIAYI-AYS 366 Query: 59 APAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSD-TVPSGYALMQGQTFD 117 + R + +WS W +L T P+GA + +P T P G+ G TF Sbjct: 367 TDMYERHQTSYQTDSWSAWKKLNTDG------IPIGAVVSFPRAVTNPVGFLRADGSTFS 420 Query: 118 KSAYPKLAVAYP-SGVIPDMRG 138 + +P L S +PD+ Sbjct: 421 QQTFPDLYRTLGNSNKLPDLTR 442 >UniRef50_A4NHY2 Probable tail fiber protein n=1 Tax=Haemophilus influenzae PittAA RepID=A4NHY2_HAEIN Length = 556 Score = 63.5 bits (152), Expect = 9e-09, Method: Composition-based stats. Identities = 40/279 (14%), Positives = 73/279 (26%), Gaps = 71/279 (25%) Query: 49 IGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTS---AHPPAEFYPVGAPIPWPSD-TV 104 I + + A A + S + + + + P+GA + +P T Sbjct: 133 IATTRATDAVAGQTVLSHKINGTDKTKAATEFALNELNKELAGKGVPIGAVVSFPRAVTN 192 Query: 105 PSGYALMQGQTFDKSAYPKLAVAYP-SGVIPDMR------------GWTIKGK------- 144 P G+ G TF++ +P L S +PD+ G Sbjct: 193 PVGFLKANGTTFNQQTFPDLYRTLGNSNQLPDLTRSDVGMTAYFAVDNIPAGWIAFDEIA 252 Query: 145 -----------------------------------PASGRAVLSQEQDGIKSHTHSASAS 169 +G +V ++D +K H H Sbjct: 253 TQVTEQRYPELYRHLIDKYGSINSVPKVADRFLRNAGNGLSVGQIQEDDLKRHVHRVPID 312 Query: 170 STDLGT-----ETTSSFDYGTKSTNNTGAHTHSISGTANSAG------AHQHKSSGAFGG 218 S FDY T + ++ T G Q + G Sbjct: 313 YDSWFDDSSQGRNNSYFDYTTFAQSSDLWSTLGYDNADGDNGFVSPKDTSQMATGGDETR 372 Query: 219 TNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 + + A+++ + G+ NAG + Sbjct: 373 PKSLVLKLCIKALNSFDDVVFWI-KSHGEVTNAGTLDAG 410 >UniRef50_D0LMW0 Tail Collar domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LMW0_HALO1 Length = 264 Score = 62.3 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 42/144 (29%), Gaps = 23/144 (15%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIK 142 G ++T P G+ G + YP+L A + V+PD RG T+ Sbjct: 112 AGTLALSAAETAPDGWLFCDGSPLIRDDYPELFAAIGETYGAGDGVNTFVLPDCRGRTLI 171 Query: 143 G------------KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNN 190 G G + + SHTH+ + L + GT Sbjct: 172 GAGQGNGLSDRQRGDVVGAEEHTLTIPEMPSHTHAEHPGTGTLWFQVF-ERGPGTWPNER 230 Query: 191 TGAHTHSISGTANSAGAHQHKSSG 214 +G +G H Sbjct: 231 SGNTLGQSTGATGGNQPHNIMQPS 254 >UniRef50_A3YFP9 35 kDa protein-like n=1 Tax=Marinomonas sp. MED121 RepID=A3YFP9_9GAMM Length = 207 Score = 61.2 bits (146), Expect = 4e-08, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 38/126 (30%), Gaps = 39/126 (30%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDT------------VPSGYALMQGQTFDKSAYPKLAVAY 128 +T + PVG+ I + + + G + + + YP+L A Sbjct: 22 FTPPAIMGDAMPVGSVIAFAGEIRTSGDKPFETNLPMFNWLKCDGSSLEVAQYPELFSAL 81 Query: 129 P--------SGVIPDMRGWTIKGKPASGR-------------------AVLSQEQDGIKS 161 +PD+RG ++G V S + ++S Sbjct: 82 GYRYGGSGQKFNLPDLRGEFLRGVDVDSSNNKKASLEGRKGAANGGNHEVGSTQGFALQS 141 Query: 162 HTHSAS 167 H H+ Sbjct: 142 HVHTYQ 147 >UniRef50_A7INV5 Tail Collar domain protein n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7INV5_XANP2 Length = 492 Score = 60.4 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 28/133 (21%), Positives = 46/133 (34%), Gaps = 18/133 (13%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIK 142 G WP+ T PSG + G T ++ Y L + +P+ G ++ Sbjct: 357 PGTIAMWPASTPPSGALVRNGATLSRTVYASLFAVIGTTFGAGDGATTFGVPNDLGIFVR 416 Query: 143 GKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G +GR S++ D KSH H+ T G T + + + +T S Sbjct: 417 GWDNGRGYDTGRVFGSEQADDNKSHDHARQ---TVSGVFTAGGAGFALQDSGSTTQRVAS 473 Query: 198 ISGTANSAGAHQH 210 G + Sbjct: 474 SGGAEARPKNRAY 486 >UniRef50_B8QTW7 Putative tail fiber protein n=1 Tax=Erwinia phage phiEa21-4 RepID=B8QTW7_9CAUD Length = 357 Score = 59.2 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 40/188 (21%), Positives = 62/188 (32%), Gaps = 5/188 (2%) Query: 120 AYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTS 179 YP Y + +G+ +S D S + + + T + Sbjct: 157 MYPIGHRIYTDNSANPSTYIPVGTWALTGQGRVSVGYDAGNSSRPAGTKFGSSTVTIDVA 216 Query: 180 SFDYGTKSTN-NTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGI 238 + T G H+H SG+ AGAH H +SG G T + G Sbjct: 217 NLPAHTHGVTVTGGNHSHGASGSTTGAGAHNHVASGNTGYAGDHNHTYTTTRQGGGNPGN 276 Query: 239 MSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV 298 + T G HTH +S + G HAH + I + A G+ + T Sbjct: 277 HVGHGSNEIHYTNEATGVAGGHTHYVSLATNTVGDHAHGLNININ----ASGNLSMSGTS 332 Query: 299 NAAGNAEN 306 N G+ + Sbjct: 333 NPTGSGQA 340 >UniRef50_C2I7P2 Phage-related tail fiber protein n=1 Tax=Vibrio cholerae TM 11079-80 RepID=C2I7P2_VIBCH Length = 406 Score = 57.7 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 56/143 (39%), Gaps = 20/143 (13%) Query: 82 TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQT-FDKSAYPKLAVAYPSGVIPD----- 135 P VG P W + P +A+M+ + Y +LA YP V D Sbjct: 263 PFYWIPYTGDQVGMPFYWLDTSAPE-WAVMEINVNLPIAVYWRLARRYPQLVRDDYINTG 321 Query: 136 -MRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 +RG ++ +GR++ S + D ++ HTH+ SA + + + G+ Sbjct: 322 EIRGEFLRVLDQGRGVDAGRSIQSYQDDELERHTHTFSAP-------FSITANTGSTGII 374 Query: 190 NTGAHTHSISGTANSAGAHQHKS 212 + +H + + T + ++ Sbjct: 375 ISASHVPNWNTTYTGGNETRPRN 397 >UniRef50_B8FJJ3 Tail Collar domain protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FJJ3_DESAA Length = 264 Score = 56.9 bits (135), Expect = 9e-07, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 49/156 (31%), Gaps = 26/156 (16%) Query: 92 PVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY----------PSGVIPDMRGWTI 141 P G+ + + + PSG+ G ++ Y L + +PD+RG+ + Sbjct: 116 PTGSVVAFMGASAPSGWLECSGAAVSRTTYDNLFSVISTMYGVGDGSTTFNLPDLRGYFL 175 Query: 142 KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGT 201 +G S + S TD G T + GT+ + +HTH Sbjct: 176 RGWSHG-------------SGKDPDAGSRTDRGDGTCGDY-VGTRQEDEFASHTHYDDED 221 Query: 202 --ANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLS 235 G +S T N++ Sbjct: 222 LLTFDGGGPVGSNSSGMSAVLPGSVGGAETRPKNVA 257 >UniRef50_Q31Q92 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q31Q92_SYNE7 Length = 387 Score = 56.2 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 27/115 (23%), Positives = 41/115 (35%), Gaps = 33/115 (28%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY----------------- 128 A P G I +T P+GY G ++ Y +L AY Sbjct: 231 IAALAVPAGVAIWVTGNTPPTGYIKANGALLSRTTYARL-WAYAQASGNIVSDAAWTGGA 289 Query: 129 ----------PSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASA 168 + +PD+RG I+G +GRA+ S + D +K+H H Sbjct: 290 TGSYSTGDGSTTFRVPDLRGEFIRGWADGRSVDTGRAIGSTQADELKAHAHYLDT 344 >UniRef50_A8YDB4 Genome sequencing data, contig C291 n=2 Tax=Microcystis aeruginosa RepID=A8YDB4_MICAE Length = 166 Score = 56.2 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 25/119 (21%), Positives = 43/119 (36%), Gaps = 30/119 (25%) Query: 82 TSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS---------GV 132 +S + AE PI +P G+ L G+ + AYP+L + Sbjct: 34 SSQNAQAESLNANIPITYPEAY---GWMLCDGRYLEIDAYPELFAVIGTLYGKQGDNKFR 90 Query: 133 IPDMRGWTIKGKPAS------------------GRAVLSQEQDGIKSHTHSASASSTDL 173 +PD RG ++G A + S + D ++ H H +AS++ Sbjct: 91 LPDYRGLFMRGVDAGSGLDPDAAERIGPEGMGKSSGIGSLQCDALQQHQHDYNASNSHF 149 >UniRef50_B6XJ97 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B6XJ97_9ENTR Length = 432 Score = 55.4 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 87/270 (32%), Gaps = 47/270 (17%) Query: 19 YEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPW- 77 Y + + Y A G+ + G+ + T+G + I S T + + Sbjct: 173 YAMKGDSYTKAEGDGRYQSKGNYAPAGDYATNTALTNGLNTKLNISSIAQATGTSTTNVM 232 Query: 78 -----AQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF---DKSAYPKLAVAYP 129 +A YP+G + + + P+ L G + ++ +LA A Sbjct: 233 SQKAVTDALQNAVNLDTIYPIGVVVWFAQNKNPNT--LFPGTKWQYIGENRTIRLAAASG 290 Query: 130 SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 + V+ G ++ + H HS S ++T Sbjct: 291 ANVL-----------STGGSDSITLNASQMPVHNHSFSGTAT------------------ 321 Query: 190 NTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTR 249 ++G HTH G + A GG + G+ + + + Sbjct: 322 SSGGHTH-------DKGTMNITGAFAIGGGKSEGQAPGFASGVFSKTTRTLKVNTASGVV 374 Query: 250 NAGKTSSDGAHTHSLSGTAASAGAHAHTVG 279 ++ T + + +G +S+GAH HTV Sbjct: 375 DSSVTQINMNAASAWTGNTSSSGAHTHTVT 404 >UniRef50_C0DSG4 Putative uncharacterized protein n=1 Tax=Eikenella corrodens ATCC 23834 RepID=C0DSG4_EIKCO Length = 436 Score = 55.0 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 1/51 (1%) Query: 89 EFYPVGAPIPWPSD-TVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRG 138 + PVGA + +P + P GY G TF ++ YP L +P++ Sbjct: 70 KGLPVGAVVGFPRAISSPEGYLKADGSTFAQATYPDLYRVLGGNKLPNLTR 120 Score = 52.7 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 28/127 (22%), Positives = 48/127 (37%), Gaps = 12/127 (9%) Query: 93 VGAPIPWPSDTVPSGYALMQ--GQTFDKSAYPKLA----VAYPS-GVIPDMRGWTIKGKP 145 VG +P + +P G+ +SAYP+L Y S +P I+ Sbjct: 123 VGMTAYFPIEAIPDGWIKYDEVATKVTQSAYPELYRLLVAQYGSIDAVPKAEDRFIR-NA 181 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 + AV +Q+ D I++ T A + + D + GA+ S ++S Sbjct: 182 SGSLAVGTQQGDTIRNITGGIEALYSGYRYTLYTKADGAFTMDLDDGAN----STFSSSK 237 Query: 206 GAHQHKS 212 G H + Sbjct: 238 GDSDHNN 244 >UniRef50_UPI00019136B5 bacteriophage tail fiber protein n=7 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI00019136B5 Length = 137 Score = 55.0 bits (130), Expect = 4e-06, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 62/201 (30%), Gaps = 69/201 (34%) Query: 120 AYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLG 174 YP LA AYP+ +PD+RG I+G +GRA+L + D ++H H Sbjct: 1 MYPNLAKAYPTNKLPDLRGEFIRGWDDGRGVDAGRALLRLQDDSFEAHRHE--------- 51 Query: 175 TETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNL 234 + AG +++ ++ + T + Sbjct: 52 --------------------------SFFYAGISRNEIPLKNLPSSDEMLTLSSTTNALS 85 Query: 235 SAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGH 294 GI +T S G + + + +G + S Sbjct: 86 PDGIDATNSLIGNDDYNCLIEGNKNNKRTATGLSTSI----------------------- 122 Query: 295 TITVNAAGNAENTVKNIAFNY 315 G E +NIAFNY Sbjct: 123 ------VGATETRPRNIAFNY 137 >UniRef50_C5B185 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens AM1 RepID=C5B185_METEA Length = 449 Score = 54.2 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 60/180 (33%), Gaps = 23/180 (12%) Query: 47 LLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFY--------PVGAPIP 98 L G S T ++ + R + + +A Y P G Sbjct: 255 LTFGGSSTLVEGPKLYVPNGRLAVASGDTAFAMYIGDGIWTLMGYQPITDSKSPPGMISA 314 Query: 99 WPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIKGKP--- 145 + + P G+ G +S + L + +PD+RG+ ++ + Sbjct: 315 YAGQSCPVGWVDATGLALLRSDFSALFAVIGTRWGAGDGSTTFNVPDLRGYFLRMQDAGA 374 Query: 146 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 GR + S + + H H+ ++ G+ TT++F Y + ++ T A Sbjct: 375 GRDPGRDLGSAQAGSVGPHQHNVPVANATAGSGTTNNFVYPLAAGTSSVPTTGQDPAPAG 434 >UniRef50_C9MDX4 Tail fiber protein (Fragment) n=5 Tax=Haemophilus influenzae RepID=C9MDX4_HAEIN Length = 478 Score = 51.5 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 34/111 (30%), Gaps = 16/111 (14%) Query: 4 TALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFI 63 L G +E + NGY T GN G G I +A I Sbjct: 374 NTLAGYGIGNFKVEQGQGDANGYKTD-GNYYLASGQNLPENGAWHIEVVSGGATNAVRQI 432 Query: 64 -RSRRDT-------TDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSD-TVP 105 R D +NWS W + P+GA + +P T P Sbjct: 433 ARKANDNKIKTRFFNGSNWSEWKETGGDG------VPIGAVVSFPRAVTNP 477 >UniRef50_C5RN01 Tail Collar domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RN01_CLOCL Length = 123 Score = 50.8 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 14/62 (22%), Positives = 20/62 (32%), Gaps = 10/62 (16%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIK 142 G + T P G+ + G + Y L A + +PDMRG Sbjct: 61 PGKIDMTATTTAPQGWLICDGSAVSRETYANLYTAIGTTYGNGDGTTTFNLPDMRGRVPI 120 Query: 143 GK 144 G Sbjct: 121 GS 122 >UniRef50_D2V5I7 Microcystin-dependent protein n=1 Tax=Naegleria gruberi RepID=D2V5I7_NAEGR Length = 191 Score = 50.0 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 49/161 (30%), Gaps = 28/161 (17%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKS--AYPKLAVAYP----------SGVIPDMR 137 PVG + T+P+G+ L G T+ S Y +L S +PD+R Sbjct: 15 IIPVGIVNAFAGTTIPAGWLLCDGATYPNSHPDYIRLFQTIGNAYGSTGGPHSFNVPDLR 74 Query: 138 GWTIKGKPAS------------GRAVLSQEQDGIKSHTHSASASSTD----LGTETTSSF 181 G + G G + + SH+HS + + + + Sbjct: 75 GRAVVGIGHGAGLSNRTLAQKVGEESHQLQISELPSHSHSGTTGKANKQPYIIVHQSGPI 134 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTS 222 + G + H H + G N++ Sbjct: 135 SDVFHTPGWCGGPATHKDDDNFTGANHTHNFTTNEVGGNSA 175 >UniRef50_A0A7D3 Putative uncharacterized protein n=1 Tax=Microcystis aeruginosa phage Ma-LMM01 RepID=A0A7D3_9CAUD Length = 335 Score = 50.0 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 44/129 (34%), Gaps = 5/129 (3%) Query: 9 NTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRD 68 Q A N+ I+ L+ + ++ + + +P + R Sbjct: 176 TAQIATINATVSAINDRLNLEIPKIVALQNTVGSLQTQVTNLSNTKANIDSPFLTGNPRT 235 Query: 69 TTDANWSPWAQL-YTSAHPPAEFYPVGAPIPW----PSDTVPSGYALMQGQTFDKSAYPK 123 + A++ + + A P+G+ I W + P+ + + GQ ++ YP+ Sbjct: 236 VSPTTNDSIARVDWVNQKIAAAGAPIGSIIMWWPLIVTQQHPTNWLPLNGQEISRTQYPE 295 Query: 124 LAVAYPSGV 132 L + Sbjct: 296 LFAVIGTFY 304 >UniRef50_C5RJD9 Tail Collar domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RJD9_CLOCL Length = 199 Score = 49.2 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 49/145 (33%), Gaps = 20/145 (13%) Query: 95 APIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGK- 144 I WP + VP G+ +GQ + Y L + +PD+RG G Sbjct: 7 QIILWPGNFVPRGWLACEGQELPINQYTALYSLLGTTYGGNGSTTFKLPDLRGRVPVGSG 66 Query: 145 ----------PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAH 194 + G ++ Q + +HTHS + + + F+ G +TN A Sbjct: 67 ICGGINFQQGNSGGNFNVTLTQQQMPAHTHSTTVTQGAVTVNGGIPFNGGEGTTNTPSAS 126 Query: 195 THSISGTANSAGAHQHKSSGAFGGT 219 + G ++ G+ Sbjct: 127 SKLAVGITAGGDIPNIYNTSEATGS 151 >UniRef50_Q2W7B2 Microcystin-dependent protein n=1 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2W7B2_MAGSA Length = 192 Score = 49.2 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 50/179 (27%), Gaps = 24/179 (13%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGK 144 G I + P +A+ G S YP L + +PD+R G Sbjct: 6 GQIILFSGSYAPVNWAVCDGHQLSVSQYPALFSLLGTQFGGNGTTTFGLPDLRSRLAMGF 65 Query: 145 PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS 204 + S SA T G T + T + HTH T N+ Sbjct: 66 GTGHVDPKA-----------SNSAPLTPYGFATNGGVETVTLTQAQIPPHTH----TLNA 110 Query: 205 AGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHS 263 +G + + G + + T Q + T++ + H Sbjct: 111 SGDPVVSPNPSGGVPASFTDGTHVAYFDTPNPIPSGMTITPKQLGASMVTTAGASQPHE 169 >UniRef50_B8DLJ2 Tail fiber protein, putative n=3 Tax=Desulfovibrio vulgaris RepID=B8DLJ2_DESVM Length = 505 Score = 48.8 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 63/244 (25%), Gaps = 40/244 (16%) Query: 54 TSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEF-----YPVGAPIPWPSDTVPSGY 108 + F+R D A T P+G WP T P+G Sbjct: 171 QATTEVVGFVRRATDEKAAEGVDTEDYVTPKQLADALKRVGGMPIGMTFWWPGTTPPAGS 230 Query: 109 ALMQ-GQTFDKSAYPKL-AVAYPSGVI------------------------------PDM 136 + G + AYP+L A+A SG I P + Sbjct: 231 LAINDGPLLPREAYPQLWAMAQASGNIITEAAWQAQAAVQSSVGAFSSGDGATTFRCPRL 290 Query: 137 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 R + P+ GRAV + + + D T +G + + T Sbjct: 291 RDFVRGANPSGGRAVGAWQAHATE---GLFVPMDGDGETVIGVVPSWGPATHDLTSGPGT 347 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS 256 + S +G + P +AG + + S Sbjct: 348 VNVTASTSGILTIPTGTGETLPRTVNWLPCIKAFDHVTNAGEVDIAALVATLAGKVDRSD 407 Query: 257 DGAH 260 H Sbjct: 408 WSQH 411 >UniRef50_Q55EP2 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q55EP2_DICDI Length = 166 Score = 48.5 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 36/123 (29%), Gaps = 11/123 (8%) Query: 80 LYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP--------SG 131 + G+ + +P G++L G +K YP+L Sbjct: 14 VANLEKKIENSIQPGSVNIFTGIEIPVGWSLCDGAPLNKLTYPELYRQIGDAFGSSEHEF 73 Query: 132 VIPDMRGWTIKGKPAS-GRAVLSQEQDGIKSHTHS--ASASSTDLGTETTSSFDYGTKST 188 PD RG G G + SH H + SS +G++ Sbjct: 74 SKPDFRGKCPIGAGNGVGLTNHLLTVSELPSHDHPVIDPGHTWHSIGGGFSSGPHGSRGE 133 Query: 189 NNT 191 ++ Sbjct: 134 SHN 136 >UniRef50_D1ANH0 Putative uncharacterized protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1ANH0_SEBTE Length = 390 Score = 48.1 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 40/141 (28%), Gaps = 2/141 (1%) Query: 160 KSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGT 219 ++ + + E T N +H H++ G ++ G H H + Sbjct: 243 RTLVGVDTNDVSFNAGEKIGGTQTTTLGVGNLPSHNHNVQGATDAQGNHYHIVNDHSHYV 302 Query: 220 NTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAH--T 277 +G + T T + TH + AG H H Sbjct: 303 PPHAHGLSVLRAKAGDSGGNGGNTAYNGTIVNYSTDATDLWTHGSAPATNWAGQHNHWFN 362 Query: 278 VGIGAHTHSVAIGSHGHTITV 298 V GA A + ITV Sbjct: 363 VTSGATGSGQAFSNLSPYITV 383 Score = 46.1 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 25/118 (21%), Positives = 41/118 (34%), Gaps = 11/118 (9%) Query: 200 GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAG-KTSSDG 258 GT + + FGGT + + + G T+ + Sbjct: 216 GTIYTTVNKDFDPNVTFGGTWERYAKGRTLVGVDTNDVSFNAGEKIGGTQTTTLGVGNLP 275 Query: 259 AHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV------NAAGNAENTVKN 310 +H H++ G + G H H V HS + H H ++V ++ GN NT N Sbjct: 276 SHNHNVQGATDAQGNHYHIVN----DHSHYVPPHAHGLSVLRAKAGDSGGNGGNTAYN 329 >UniRef50_A1TUY7 Phage Tail Collar domain protein n=4 Tax=Acidovorax RepID=A1TUY7_ACIAC Length = 204 Score = 47.7 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 14/75 (18%), Positives = 24/75 (32%), Gaps = 9/75 (12%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKG 143 +G + W + VP G+AL G + + P L + +PD+R G Sbjct: 6 IGTVLLWTAAFVPRGWALCDGSVLNITQNPALFAILGNRFGGDGRTTFQLPDLRNRVPMG 65 Query: 144 KPASGRAVLSQEQDG 158 + Sbjct: 66 LQTVDQPPGPTGAAS 80 >UniRef50_C7BIF9 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BIF9_PHOAA Length = 286 Score = 47.7 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 57/199 (28%), Gaps = 28/199 (14%) Query: 87 PAEFYPVGAPIPWPSDTV--PSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGK 144 + +P G + + P G+A G ++ +PD+R I Sbjct: 98 SDQIFPKGMIVMFSGSENEIPPGWAFCDGGEYNGIK------------VPDLRNRFIMCS 145 Query: 145 PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS 204 S ++ + + + + + T + H H Sbjct: 146 ETFAEKGESSKKANGDGNNKNFLKDTESITVSIDVKVENTTLDISQIPKHNH-------I 198 Query: 205 AGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDG---AHT 261 G H S FG + Y + S+ T S TS G H Sbjct: 199 QGLPYH-SDVGFGYPHVKWGKTPYRIDNTYSSSFWHTDKSSNNDDLHPNTSEVGEGKGHN 257 Query: 262 HSLSGTAASAGAHAHTVGI 280 HS AS+ H+H V + Sbjct: 258 HS---ATASSSPHSHKVDV 273 >UniRef50_D2L4G0 Tail Collar domain protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L4G0_9DELT Length = 319 Score = 47.7 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 44/190 (23%), Positives = 63/190 (33%), Gaps = 10/190 (5%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSG---YALMQGQTF-DKSAYPKLAVAYPSGVIPDM 136 PVG IPWPS ++P+ + GQ S Y +L V S IP+ Sbjct: 31 VPIPITSQGTIPVGTVIPWPSTSMPADATRWLECNGQAVPSGSQYDRLRVVLGSKPIPNY 90 Query: 137 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTH 196 G ++G V D ++H H A + + G ++ + T Sbjct: 91 NGQFLRG-TTVSSEVGQTVADSTRAHDHLIDAHQHTVSGTASGQSYGGAIASVSISGSTS 149 Query: 197 SISGTANSAGAHQH--KSSGAFGGTNTSIFPNGYTAISNLSA---GIMSTTSGSGQTRNA 251 S S + AG H S A+GG G T+ ST + T Sbjct: 150 SQSYSGTIAGQHITGATSGQAYGGNIAGQHVTGSTSGQAYIYDIAAAGSTWWPATGTPGY 209 Query: 252 GKTSSDGAHT 261 T S H Sbjct: 210 IDTVSSITHY 219 >UniRef50_A8T9J8 Putative uncharacterized protein n=1 Tax=Vibrio sp. AND4 RepID=A8T9J8_9VIBR Length = 242 Score = 47.3 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 48/176 (27%), Gaps = 13/176 (7%) Query: 65 SRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDT--VPSGYALMQGQTFDKSAYP 122 S+ T + PVG + W S + P G+AL G T + P Sbjct: 61 SQLAGTGLKNNSSKLNVDYDELLNALIPVGTIVAWGSTSNNPPKGWALCDGST---AGVP 117 Query: 123 KLA-------VAYPSGVIPDMRGWTIKGKPASGRAVL-SQEQDGIKSHTHSASASSTDLG 174 L Y + + D R AS + + I SH H + Sbjct: 118 DLTGCFLMGNKTYGTNAVSDNRRILGTSSNASSLVLGHKLNINQIPSHDHQMTIMQEHSK 177 Query: 175 TETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTA 230 ++ + Y + + + +H H + T + Sbjct: 178 SKNGTYMHYYLAPATGNNNNWRGNTNSKGGNQSHSHNVNVNALATKLKVAGQPKHY 233 >UniRef50_UPI000194E452 PREDICTED: similar to tau-tubulin kinase n=1 Tax=Taeniopygia guttata RepID=UPI000194E452 Length = 452 Score = 47.3 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 34/109 (31%), Gaps = 1/109 (0%) Query: 143 GKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTA 202 G +V S + SH H + S + G+ + G+H H G+ Sbjct: 27 GASDGHGSVGSDGHGSLGSHGHGSPGSHGHGSPGSHGHGSPGSHGHGSPGSHGHGSPGSH 86 Query: 203 NSAGAHQH-KSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRN 250 S G+ G + +G I S GI+ + S Sbjct: 87 GHGSPGILQSSPGSDGHGSPGSHGHGSPGILRGSPGILRGSPRSPLPGQ 135 >UniRef50_Q7N651 Complete genome; segment 6/17 n=4 Tax=Gammaproteobacteria RepID=Q7N651_PHOLL Length = 434 Score = 47.3 bits (110), Expect = 8e-04, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 60/183 (32%), Gaps = 17/183 (9%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 + P P G + + + P+G+A G + PD+R + Sbjct: 254 SIDPKTVLPKGMIVMFSGSSAPTGWAFCDG----------------NHGTPDLRSRFVMC 297 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 S + + + S ++T + + T + + H H Sbjct: 298 SETISETGKSSNKASGSGNGKNYSRNTTSTTVSVSVTVKNTTLTESQIPYHYHIGGMGYW 357 Query: 204 SAGAHQHKSS-GAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTH 262 + ++ + + + + +N + ++ SG GQ N T+S +H H Sbjct: 358 TNKGMKYGTEYYSEYASYIRNDLDSVMQSANGARYAYTSPSGGGQGHNHPATASSPSHDH 417 Query: 263 SLS 265 S++ Sbjct: 418 SVN 420 >UniRef50_A9DEL7 Tail fiber protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEL7_9CAUD Length = 640 Score = 47.3 bits (110), Expect = 8e-04, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 69/226 (30%), Gaps = 26/226 (11%) Query: 97 IPWPSDTV--PSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQ 154 + W S P+G L GQ D++ +P L +G +P + A+ + S Sbjct: 63 VMWHSTQKHLPAGCLLSDGQEVDRATWPSLFEEIEAGRVPVVPEAD---WLANPKLRGSY 119 Query: 155 EQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSG 214 + + + +G+ ++ G I +AN H +G Sbjct: 120 TLGDVVNTFRVPDYNGRSVGSLGRIFLGGDGQNAGLDG----QIQESANKRHNHAITDNG 175 Query: 215 AFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAH 274 G N + + +A AG H + Sbjct: 176 HSHGVNDAGHSHEKSAWVANPAGGGQIYRDPEVWITTNAADKVEVHYKT----------- 224 Query: 275 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 320 G T +++ IT+ G A+ N+A YI+R A Sbjct: 225 ------GVSTSGISLQESESGITLAEDGEADARPSNVAGCYIIRGA 264 >UniRef50_Q58MY1 Predicted protein n=1 Tax=Prochlorococcus phage P-SSM2 RepID=Q58MY1_BPPRM Length = 597 Score = 46.9 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 67/183 (36%), Gaps = 18/183 (9%) Query: 91 YPVGAPIPW--PSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASG 148 P G + W + +PSG+ L G + PD+R + G ++ Sbjct: 356 IPAGVVVMWSGAQNAIPSGWVLCDG----------------NNSSPDLRDKFVIGAGSNY 399 Query: 149 RAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAH 208 + HS SAS++ G + + S ++ + + S + + + + H Sbjct: 400 AVDNTGGSADAVVVDHSHSASTSVSGAGAHTHSFSASDSHTHSFSGSGSDTFSGSGSHTH 459 Query: 209 QHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTA 268 SG+ G + + + + ++ SGS + S T ++SG+A Sbjct: 460 SFSGSGSHGHSLSLSDSAHQHTSAIPAQNQVAGNSGSQTIWGSVTNSPTWGATANVSGSA 519 Query: 269 ASA 271 SA Sbjct: 520 DSA 522 >UniRef50_D1Y7E0 Collagen alpha 1 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y7E0_9BACT Length = 386 Score = 46.9 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 52/197 (26%), Gaps = 12/197 (6%) Query: 64 RSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPK 123 + + + + + P+GA + + GY L G +D + YP Sbjct: 84 KVTYNDFLGKFQIVKEYAAFLEDCSSGVPIGATVMFKKGQQEPGYLLANGAPYDTAKYPY 143 Query: 124 LAVAYPSGVIPDM--RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 LA + +PDM I G V + + + + + Sbjct: 144 LADCLGAANLPDMSHTPVLIPGWA---WYVKAYHR---PNVRGDLKRLQILVHQLPSDQA 197 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST 241 YG +++ + + G G G P G G Sbjct: 198 AYGAYNSSEDILNLYIPQGIQGIQGPMGPTGPAGPQGPQGEQGPRGI----QGPQGEQGL 253 Query: 242 TSGSGQTRNAGKTSSDG 258 G G T G Sbjct: 254 RGPEGAQGVKGDTGEQG 270 >UniRef50_Q727X4 Tail fiber protein, putative n=4 Tax=Desulfovibrio vulgaris RepID=Q727X4_DESVH Length = 296 Score = 46.5 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 19/118 (16%), Positives = 39/118 (33%), Gaps = 13/118 (11%) Query: 101 SDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQE 155 + T P+ + + + + D RG +G +GR + S + Sbjct: 172 NATAPA-WYRCNASGVRDATGDHI-------RLQDRRGEFARGWDHGRGVDAGRVLGSAQ 223 Query: 156 QDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSS 213 D I++ S + + + + +F T S + G+ T A +S Sbjct: 224 GDAIRNIVGSMGSITAVVAGTASGAFTVTTPSNRSAGSSTGPTCDFTFDASRVVPTAS 281 >UniRef50_B6VNN2 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VNN2_PHOAA Length = 508 Score = 46.5 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 56/196 (28%), Gaps = 25/196 (12%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 A P P G + + T P+G+AL G + P++ I G Sbjct: 318 AIDPNNVLPKGVIVMFSGSTAPTGWALCDG----------------NNGTPNLIDRFILG 361 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 + +S ++ SS + S H H + Sbjct: 362 GKGTDINGVSTNTASGTKNSKLFDFSSDEATLTIDGKTLGRALSLQQIPNHAHFSGIIMD 421 Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAH--- 260 + +G + G T N S + +SG + N S+ G + Sbjct: 422 TEKV------NYYGSKKITTNVWGVTTGDNTSVRYIYKSSGVLDSNNNVSNSTLGGNSLQ 475 Query: 261 THSLSGTAASAGAHAH 276 TH G H+H Sbjct: 476 THDHDIKITGTGKHSH 491 >UniRef50_Q56BI6 Gp12 short tail fibers n=1 Tax=Enterobacteria phage RB43 RepID=Q56BI6_9CAUD Length = 463 Score = 46.1 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 34/112 (30%), Gaps = 13/112 (11%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS--------GV 132 Y + P+G + ++ + G+ YP+L Sbjct: 304 YQTHSDIEASLPIGCMMMAAFNSDYGNLCIANGRGMYTYEYPELFALIGYTYGGSGNIFN 363 Query: 133 IPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTS 179 +PDMRG +G GR + + ++SH H G + Sbjct: 364 LPDMRGVVARGFDAGRGLDPGRGFGTYQHHEVQSHEHPLQMIYQSGGNLPSW 415 >UniRef50_Q8PR98 Microcystin dependent protein n=1 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PR98_XANAC Length = 195 Score = 46.1 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 58/185 (31%), Gaps = 20/185 (10%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKG 143 +G +P + P G+ GQT + Y L + +PD+RG + G Sbjct: 6 IGEVRAFPYNFAPEGWLDCMGQTVSINQYQALFGVIGFAYGGDKQTTFGLPDLRGRAVTG 65 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 + G+ ++T + + +++ T S + G A Sbjct: 66 ---------QGQGPGLSNYTIGQLQGTDSVALVSSTQLPAHTHSITTMFLPPATAPGAAV 116 Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHS 263 + + S T+ P Y A + + M S + + + AH + Sbjct: 117 NTPSSSSYLSRLLNP--TTSPPTSYKAYAPATTTPMVQLSPNALAPFPSGSQAVQAHENR 174 Query: 264 LSGTA 268 T Sbjct: 175 QPFTT 179 >UniRef50_B5ZGB2 Tail Collar domain protein n=4 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=B5ZGB2_GLUDA Length = 300 Score = 46.1 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 52/178 (29%), Gaps = 13/178 (7%) Query: 95 APIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIKGK 144 + T P+G+ L GQ ++ Y L + +PD+RG G Sbjct: 106 TVADYAGATAPAGWMLCCGQAVSRATYAALFAVIGTTFGAGDGATTFGLPDLRGRVAAGV 165 Query: 145 PASGRA---VLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGT 201 + G +L+ GI A+ S + T + D G H H Sbjct: 166 DSMGGTAANLLTMAGAGINGVQLGAAGGSQMAPSHTHAVTDPGHAHAVTDPGHAHGPGSG 225 Query: 202 ANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGA 259 + + + + G+ T+ +G T A T + Sbjct: 226 TGFVVPQGTGGEIVTFDGGSLTPEHATAQTTADTTGVTVDTATTGITLAAAGTGASQN 283 >UniRef50_A6EAB9 Microcystin-dependent protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAB9_9SPHI Length = 198 Score = 46.1 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 50/173 (28%), Gaps = 24/173 (13%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKG- 143 G + P G+AL G S L + +PD+RG G Sbjct: 27 GEIRAFACSYAPEGWALCDGSLLPLSQNQALYSLLGTRFGGNGTTTFALPDLRGRVPVGT 86 Query: 144 -------------KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNN 190 G ++ E + H H SA + LG+ + + + Sbjct: 87 GVRGASPAYTYTIGNNGGSETVALETATMPPHNHYVSAKNA-LGSVGLAGGILAIPNGGS 145 Query: 191 TGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTS 243 T + ++ S A + + G ++++ P G+ Sbjct: 146 TQVNIYNTSAGATTTLNPDTVGNTGAGSPHSNMQPFQTINFCIAVLGLYPPRP 198 >UniRef50_Q4KAW4 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf-5 RepID=Q4KAW4_PSEF5 Length = 181 Score = 46.1 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 51/172 (29%), Gaps = 10/172 (5%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGK 144 G +P P G+ L QGQ D Y LA + +PD+RG G+ Sbjct: 6 GEIRLFPWAWAPQGWLLCQGQILDVVNYTALASLLGDRYGGDGRTTFGLPDLRGRAALGE 65 Query: 145 PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS 204 + S + + T L + T TG +I +++ Sbjct: 66 NPVASTSPVLGVHELGSMDGAEWVALT-LNNLPAHNHVANVAVTAGTGGPAGNIPAISST 124 Query: 205 AGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS 256 + K + + N T ++ L + + T + Sbjct: 125 SKGAVSKPTYVAYADKDRVTINPTTVVTTLGYPLPNMQPSIVGNFCIAVTGT 176 >UniRef50_UPI00016C488F hypothetical protein GobsU_00180 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C488F Length = 224 Score = 45.8 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 49/179 (27%), Gaps = 22/179 (12%) Query: 92 PVGAPIPW--PSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGR 149 P+G + W +VP G+ + G+ + + A +G P++ KG A Sbjct: 55 PIGTVVMWWGDRASVPPGWEVCDGKPVETNG------AILTGTKPNLVDRFPKGATAGRN 108 Query: 150 AVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQ 209 V A A+ + + G H H I G+ Sbjct: 109 TVADL-----------AKAAGGSNNLPALKLANISGLAVGRNGEHEHRIPTF---DGSTS 154 Query: 210 HKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTA 268 + N Y G +G + +DG T + A Sbjct: 155 TDRNSYVEIPNYRGRTYDYLNGPPTEKGGAHEHPLTGFVGDKNGKDADGGDTSGANQPA 213 >UniRef50_C3YB93 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YB93_BRAFL Length = 749 Score = 45.8 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 43/146 (29%), Gaps = 1/146 (0%) Query: 154 QEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTG-AHTHSISGTANSAGAHQHKS 212 + SH + + + S +G +S +N G +H+H +N H H Sbjct: 473 VQGGQFHSHGDQSHSHGDQSHSHGDQSHSHGDQSHSNGGQSHSHGGQSHSNGGQFHSHGD 532 Query: 213 SGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAG 272 G + + + S + G + ++ A + Sbjct: 533 QSHSHGGQSHSHGGQSHSHGDQSHSHGGQSHSHGGHSHDMNRNAGIASVAWMVIMGDGLH 592 Query: 273 AHAHTVGIGAHTHSVAIGSHGHTITV 298 A V IGA + +I V Sbjct: 593 NFADGVTIGAAFATSLTTGLSTSIAV 618 Score = 45.0 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 28/110 (25%), Positives = 43/110 (39%), Gaps = 7/110 (6%) Query: 191 TGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRN 250 G HS ++S G H NG + S G S ++G + Sbjct: 474 QGGQFHSHGDQSHSHGDQSHSHGDQSHSHGDQSHSNGGQSHS---HGGQSHSNGGQFHSH 530 Query: 251 AGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNA 300 ++ S G +HS G + S G +H+ G +H+H G H H + NA Sbjct: 531 GDQSHSHGGQSHSHGGQSHSHGDQSHSHGGQSHSH----GGHSHDMNRNA 576 Score = 42.3 bits (97), Expect = 0.025, Method: Composition-based stats. Identities = 24/115 (20%), Positives = 40/115 (34%), Gaps = 10/115 (8%) Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHS 263 H H G + + + G S ++G + G++ S+G HS Sbjct: 475 GGQFHSHGDQSHSHGDQSHSHGDQSHSH-----GDQSHSNGGQSHSHGGQSHSNGGQFHS 529 Query: 264 LSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAE---NTVKNIAFNY 315 + S G +H+ G +H+H SH H ++ G N IA Sbjct: 530 HGDQSHSHGGQSHSHGGQSHSHGDQ--SHSHGGQSHSHGGHSHDMNRNAGIASVA 582 >UniRef50_Q11LT1 Microcystin-dependent protein-like n=1 Tax=Chelativorans sp. BNC1 RepID=Q11LT1_MESSB Length = 268 Score = 45.8 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 58/190 (30%), Gaps = 5/190 (2%) Query: 81 YTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTF-DKSAYPKLAVAYPSGVIPDMRGW 139 T A + P+G + + T P G+ GQ + YP L A + P Sbjct: 71 VTDAATVGQLVPIGTIVDYALSTAPEGWTFCYGQALTSSTPYPLLRAALLAAGSPFGTSG 130 Query: 140 T-IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSI 198 + + GR ++ G S + S G + T + + +H+ Sbjct: 131 SDPRVPDYRGRVGAGKDNMGGTSANRLTNQSGGVNGDVLGDTGGAETHTLSVGQMPSHNH 190 Query: 199 SGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDG 258 SG+ S G H H + P T + + S + + S G Sbjct: 191 SGSTGSGGNHTHTMYVKNLSAGSGGNPVTGTPSGTIDSTYQS---DPSGSHSHSIPSQGG 247 Query: 259 AHTHSLSGTA 268 H+ Sbjct: 248 NDPHNNVQPT 257 >UniRef50_Q84CW8 Putative transmembrane protein n=1 Tax=uncultured bacterium RepID=Q84CW8_9BACT Length = 406 Score = 45.4 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 44/203 (21%), Positives = 74/203 (36%), Gaps = 16/203 (7%) Query: 102 DTVPSGYALMQGQTFD-KSAYPKLAVAY-PSGVIPDMRGWTIKGKPASGRAVLSQEQDGI 159 P+G+ L +S YP L SG+ M G +I + R ++ G Sbjct: 186 AAAPTGWLLFGQTYLSGQSTYPALWAVLVASGLTSWMSGTSIVLPDLADRVLMDGGTLGA 245 Query: 160 KSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGT 219 ++ + S+ +L S H H +A ++ H H S GGT Sbjct: 246 TGGANAVTLSTANLPAHDHSI------------DHNHGSVTSAGNSVNHTHTFSDTTGGT 293 Query: 220 NTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS-DGAHTHSLSGTAAS-AGAHAHT 277 + ++ A + + +G NA T + G HTH++SGT + AH H Sbjct: 294 GEHNHNAWFVDVTGGGAASRAAPASTGSGTNAQITIAGGGDHTHTVSGTTGGDSVAHTHA 353 Query: 278 VGIGAHTHSVAIGSHGHTITVNA 300 V + + G +T + Sbjct: 354 VDLPNFAGTSGSVGSGTAVTTHP 376 >UniRef50_A1TNG3 Phage Tail Collar domain protein n=9 Tax=Bacteria RepID=A1TNG3_ACIAC Length = 176 Score = 45.4 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 47/148 (31%), Gaps = 21/148 (14%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKG- 143 G + + P G+A QGQ + L + +PD+RG G Sbjct: 8 GEISMFAGNFPPKGWAFCQGQILPIAQNSALFALLGTTYGGNGQTTFALPDLRGRVPLGQ 67 Query: 144 -----------KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTG 192 G+ ++ + + + HTH+ S S + + + +++ Sbjct: 68 GQGPGLQPYSQGQVGGQETVTLQGNQMPMHTHTTSVSVSSNAGNSAAPNGRYLAASDQRN 127 Query: 193 AHTHSISGTANSAGAHQHKSSGAFGGTN 220 SG + AG + + N Sbjct: 128 DQYTDQSGNGSLAGVTTGFAGNSLPHEN 155 >UniRef50_Q7N687 Complete genome; segment 6/17 n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N687_PHOLL Length = 343 Score = 45.4 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 54/191 (28%), Gaps = 22/191 (11%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 + P P G + + +VP+G+ L G + P++ I G Sbjct: 158 SIDPNTVLPRGMIVMFSGKSVPTGWTLCDG----------------NNGTPNLIDRFILG 201 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 SG S + S + +S + + S H+H + Sbjct: 202 GNFSGIDGKSSTTVSGPKDSKSFNFNSNEATLNINGKTSERSLSIGQIPNHSHLSGINID 261 Query: 204 S------AGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSD 257 + K+ A + + Y + LS+ + + ++ Sbjct: 262 TNIMAQYGATQIGKTDRAVASSKNTSERYLYYSSGILSSNGTIGQNSPETHDHDINLTNT 321 Query: 258 GAHTHSLSGTA 268 G H H T Sbjct: 322 GNHFHKNQITT 332 >UniRef50_Q7N6A5 Complete genome; segment 6/17 n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N6A5_PHOLL Length = 405 Score = 45.4 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 58/195 (29%), Gaps = 27/195 (13%) Query: 86 PPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP 145 + G + + + PSG+A G + PD+R I Sbjct: 225 DANKLLSKGMIVMFSGSSAPSGWAFCDG----------------NNGTPDLRSRFIMCGE 268 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 S + + S ++T S T + + H H S + Sbjct: 269 TVSETGKSSNKASGSGSGKNVSRNTTSTAVSVNVSVLNTTLTESQIPKHKHIESLPYYTT 328 Query: 206 GAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLS 265 + +T+I ++++ I TSG H H+ S Sbjct: 329 LGFAYD--------HTTIGATNNKIDNSVNGLIWKRTSGPDYHPYTSDIGGGQGHNHNAS 380 Query: 266 GTAASAGAHAHTVGI 280 AS+ +H H+V + Sbjct: 381 ---ASSPSHTHSVDV 392 >UniRef50_Q03314 Protein rhiB n=2 Tax=Rhizobium leguminosarum bv. viciae RepID=RHIB_RHILV Length = 219 Score = 45.0 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 39/118 (33%), Gaps = 31/118 (26%) Query: 106 SGYALMQGQTFDKSAYPKLAVAYP------------SGVIPDMRGWTIKGKPASG----- 148 G+ L G+ + YP+L IPD RG ++G A G Sbjct: 78 QGWMLCDGRYLRAAVYPELYAVLGGLYGERNSTADLEFRIPDYRGLFLRGFDAGGGMDPD 137 Query: 149 -------------RAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGA 193 V S + D ++ H H +T G + + S+ +TG+ Sbjct: 138 AKRRLDPTGNNVANVVGSLQCDALQVHAHPYEI-TTPAGISQQGNAAGTSISSKSTGS 194 >UniRef50_A6EAB8 Phage Tail Collar n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAB8_9SPHI Length = 183 Score = 44.6 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 16/153 (10%), Positives = 41/153 (26%), Gaps = 4/153 (2%) Query: 94 GAPIPWPSDTVPSGYALMQGQTF----DKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGR 149 G + P + + G T +++ Y + Y S D + ++G+ G+ Sbjct: 7 GEIRAFAGTYAPVDWMMCNGATLTVQGNEALYSLIGSTYGSNGPTDFKVPDLRGRLTVGQ 66 Query: 150 AVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQ 209 + + I A + + + + + + + + Sbjct: 67 GLGTGLTSRILGSVGGAETVALTEAQLPAHNHNLTVSTVTSPASVNAPSNTSYLGVVNSS 126 Query: 210 HKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTT 242 + + N + LS S Sbjct: 127 AGAGVGYVPGNATGASVRALDTQVLSNTGGSQA 159 >UniRef50_UPI0001BC923E Phage tail Collar n=1 Tax=Pseudomonas syringae pv. tabaci ATCC 11528 RepID=UPI0001BC923E Length = 196 Score = 44.2 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 52/165 (31%), Gaps = 13/165 (7%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIK-- 142 G+ + + P+G+ GQT + S Y L + ++P+++G Sbjct: 6 GSIMTFGFPFAPAGWMQCNGQTLNISQYNALYALLGVIYGGNPSQNFMLPNLQGRVPINQ 65 Query: 143 --GKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG 200 G + R + S + + + + T++ + TGA + Sbjct: 66 GTGVNLTNRVIGSVSGVEKVTVAIANMPAHVHQMSTLTANTTITLANPAVTGATIAPTTD 125 Query: 201 TANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGS 245 A + +S N P +S G M GS Sbjct: 126 NAFIGASTSGPTSANIFSPNAGTAPVVQKGVSTAITGTMQPVGGS 170 >UniRef50_A1SXZ3 Phage Tail Collar domain protein n=4 Tax=Bacteria RepID=A1SXZ3_PSYIN Length = 195 Score = 44.2 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 42/132 (31%), Gaps = 21/132 (15%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTI--- 141 G P + P G+A GQ + + L + +PD RG + Sbjct: 31 GEIAWVPYNFAPRGWASCDGQLLPITQHNALFSLLGTVYGGDGRTTFALPDARGRVMIHE 90 Query: 142 ---------KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTG 192 + G ++ + I SHTH ASS + + + S Sbjct: 91 GQGPGLTNRRLGDKWGEEQVTLQTSQIPSHTHRQQASSGSPSSTSPEENVLASPSRTQLY 150 Query: 193 AHTHSISGTANS 204 A I +A++ Sbjct: 151 ADDADIDMSADN 162 >UniRef50_Q8GDJ7 Orf24 n=1 Tax=Photorhabdus luminescens RepID=Q8GDJ7_PHOLU Length = 434 Score = 44.2 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 63/221 (28%), Gaps = 33/221 (14%) Query: 65 SRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKL 124 S R A + P + P G + + VP G+A G Sbjct: 233 SYRFRVKAGNGIKVDDKGVSIDPDKVLPRGMIVMFSGSVVPQGWAFCDG----------- 281 Query: 125 AVAYPSGVIPDMRGWTIKGK---PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSF 181 + PD+R + G +G + K+ + +A +T T + + Sbjct: 282 -----TNGTPDLRDRFVSGAWQLSDAGNTNDKRITGDNKN--KAFNAQTTADKTNLSVNV 334 Query: 182 DYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMST 241 T + + +H+H A ++ + ++ + Sbjct: 335 QDTTLTIDQIPSHSHIEGMRMQITQAAEYG-----------LVTQKANQLNRYNLNNQII 383 Query: 242 TSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGA 282 + G H+ T+ + AH HTV + Sbjct: 384 HDSNEDYSLHKTNEIGGGKAHN-HQTSVNETAHQHTVSVSP 423 >UniRef50_C1D6M7 Phage-related protein n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D6M7_LARHH Length = 257 Score = 44.2 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 34/166 (20%), Positives = 52/166 (31%), Gaps = 11/166 (6%) Query: 66 RRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLA 125 D+ P + ++ G + DT G G ++AY L Sbjct: 78 SADSAGLEGHPASHFAKASDLAHPGSRPGRLVVTFRDTPEPGTLACNGAAVSRTAYAALF 137 Query: 126 VAYP----------SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGT 175 A + +P++ A+G V S + SHTH+ +A ST Sbjct: 138 AAIGTKYGAGDGSSTFNLPNIPDGHA-LLAANGSVVGSLSVGEVISHTHTGTALSTGSEH 196 Query: 176 ETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNT 221 SS + N +G+ AG H H S G T Sbjct: 197 IHFSSSAAYVTNGGNMAGGAGVGTGSQIPAGTHTHALSINATGATT 242 >UniRef50_Q8PR97 Microcystin dependent protein n=1 Tax=Xanthomonas axonopodis pv. citri RepID=Q8PR97_XANAC Length = 183 Score = 43.8 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 52/170 (30%), Gaps = 31/170 (18%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGK 144 G I + + P G+A G+ + Y L + +PD+RG + Sbjct: 6 GQIILFAGNYEPQGWAFCDGRQLQINTYMALYSLIGTTYGGDGRTTFNLPDLRGRVAISQ 65 Query: 145 PAS------------------GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTK 186 G +S + + +H H+ A + +S G Sbjct: 66 GQGIARAPTPQLTARVLGQQFGTETVSLQLAEMPAHRHTLQA----FNSPASSLTPTGQL 121 Query: 187 STNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSA 236 G +T ++ A S A ++ A S + + A LS Sbjct: 122 PAVTQGGNTGYLTPPAGSTPAASTLATNAVNVAGASQPHDNHMATQTLSY 171 >UniRef50_A6EAC0 Microcystin-dependent protein n=1 Tax=Pedobacter sp. BAL39 RepID=A6EAC0_9SPHI Length = 185 Score = 43.8 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 51/171 (29%), Gaps = 12/171 (7%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVL 152 +G P+ D +P G+ G T+ + Y L + + + Sbjct: 5 IGEVRPFAFDWIPDGWLACNGATYPLAQYQALYSVIGTVYGGTLGQNFKVPNLQGEAIIG 64 Query: 153 SQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKS 212 + + ++T + + + G + N H H +G + G + + Sbjct: 65 AGQGPTTSAYTLAQTGGTEKAG-----------LTVNQIPNHDHVFNGAIGATGFRTNTA 113 Query: 213 SGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHS 263 TN G T ++G + + + T + G H Sbjct: 114 GNTSYLTNFGYGGAGATTF-TSASGYVPPGTPDTLLNPSSVTQTGGGGAHE 163 >UniRef50_A6N211 Probable tail fiber protein n=1 Tax=Microbacterium phage Min1 RepID=A6N211_9CAUD Length = 250 Score = 43.8 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 11/43 (25%), Positives = 20/43 (46%) Query: 90 FYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGV 132 PVG + W + P G+ L+ G+ ++A+P L + Sbjct: 135 SVPVGTVMMWLAGPAPDGWVLLDGRAVSRAAFPTLFTLIGTTF 177 >UniRef50_Q2W7B1 Microcystin-dependent protein n=3 Tax=Proteobacteria RepID=Q2W7B1_MAGSA Length = 177 Score = 43.8 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 51/172 (29%), Gaps = 23/172 (13%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKG- 143 G +P + P+G+ G++ SA L + +PD+RG TI G Sbjct: 7 GEIRLFPLNWAPTGWLPCDGRSMQVSANAALFSLLGNQFGGDAKTTFFLPDLRGRTIMGQ 66 Query: 144 ------------KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT 191 G ++ + SH H +G + +Y N Sbjct: 67 GKNPVTGVSYVTGAYGGTESVTLTTAQLPSHQHQV-VGDQTVGATNPADDNYLAVPIYNG 125 Query: 192 GAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTS 243 + SGT S+ G +T+ P ++G Sbjct: 126 TQKSLYNSGTKPVPLNPASVSTVGGGAAHTNTQPYLALGYCICTSGYYPPRP 177 >UniRef50_B1J270 Tail Collar domain protein n=8 Tax=Bacteria RepID=B1J270_PSEPW Length = 195 Score = 43.5 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 49/177 (27%), Gaps = 16/177 (9%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGK 144 G + + P G+A QGQ + L + +PD+RG G Sbjct: 6 GEIKMFAGNFAPRGWAFCQGQLMSIAQNNALFALLGTTYGGDGKTTFALPDLRGRGPIGF 65 Query: 145 PASGR--AVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISG-- 200 V+ E G+ T S + + SG Sbjct: 66 GTGPGLADVVQGEAGGVNDVTLLQSNMPMQQAVIPAQTVSVAIPAVEGDANAAAPSSGNV 125 Query: 201 ---TANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKT 254 + +S+GA + NTS+ P T ++ S Q G Sbjct: 126 LAKSFDSSGAGAAADIYSSDVPNTSLKPFNVTVPQTSVNLGGASLPVSVQNPYLGMN 182 >UniRef50_Q6J802 Pas29 n=1 Tax=Actinoplanes phage phiAsp2 RepID=Q6J802_9CAUD Length = 936 Score = 43.1 bits (99), Expect = 0.013, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 64/210 (30%), Gaps = 14/210 (6%) Query: 88 AEFYPVGAPIPWPSD--TVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKP 145 + P + WP T+PSG+ + L YP G + G Sbjct: 371 PDTIPSDMILAWPGTVGTIPSGW----------TRVTALDGFYPRGSNGTGVPTGVTGGA 420 Query: 146 ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSA 205 + I +H+HS S+ + TTS+ G HTH+ + S Sbjct: 421 TTHSHTTVNHVHTIGAHSHSVGGSTGSSNSNTTSARFNGASQAQADQPHTHTRPSSTGSR 480 Query: 206 GAHQHKS--SGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHS 263 A G N + + S+ + T T ++D Sbjct: 481 AAQNSGGSAPGTNAANNIPLTRDVIWIASDGAQANYPTGILGWATEAVSGWTNDADSAGR 540 Query: 264 LSGTAASAGAHAHTVGIGAHTHSVAIGSHG 293 AA G G HTH+V +HG Sbjct: 541 FLKGAAGGGNGGANTGAATHTHAVNAHTHG 570 >UniRef50_C6X0H2 Phage tail collar domain protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0H2_FLAB3 Length = 193 Score = 42.7 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 45/151 (29%), Gaps = 10/151 (6%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKG 143 VG + P + P G+ G S L + +PDMRG + Sbjct: 26 VGQIMFVPYNFSPQGWHNCDGSLLSISENEVLFTLIGTTYGGDGQTTFAVPDMRGRVMI- 84 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 G + S + + T + G ++ + +H + +G + Sbjct: 85 DDGQGNTLSSFTLGQMSGTETVQLTQAQMPAHSHTVNAVSGAGTSESPTSHLPANTGILD 144 Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGYTAISNL 234 ++Q +S G ++ + Sbjct: 145 KEYSNQPLTSTMKMGMLSAAGGSQPHNNIQP 175 >UniRef50_UPI000186F374 low-density lipoprotein receptor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186F374 Length = 2887 Score = 42.7 bits (98), Expect = 0.017, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 36/171 (21%), Gaps = 17/171 (9%) Query: 142 KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNN------TGAHT 195 +G G SH H + + D+ T ++ H Sbjct: 421 QGTDEDNNGDHHMHDHGTGSHDHIHGQGTDEDNNGDHHMHDHSTMEHDHIHGQGINEDHN 480 Query: 196 HSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKT- 254 + G H H N + + Sbjct: 481 GDHHMHDHGTGQHDHFQGQGMNEDNLGNGHVHDHGTDEHNHFHGQGMNEDNLGNGHVHVH 540 Query: 255 -SSDGAHTHSLSGTAASAGAH-AHTVGIGAHTHSVAIGS--------HGHT 295 + + H H + G H H + H + + H HT Sbjct: 541 GTDEHNHFHGQGMNEDNQGDHRMHYQSVDEQGHILVQETNEDNNGVQHSHT 591 >UniRef50_A9AVC5 Tail Collar domain protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVC5_HERA2 Length = 934 Score = 42.7 bits (98), Expect = 0.017, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 51/190 (26%), Gaps = 54/190 (28%) Query: 89 EFYPVGAPIPWPSDTV--PSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPA 146 P G W P G+ L GQ PD+R + G Sbjct: 782 SSIPSGTINMWSGADNALPGGWLLCNGQ----------------NGTPDLRNRFVVG--- 822 Query: 147 SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAG 206 + TT D T + N +H H + + ++ G Sbjct: 823 ----------------------AGAAYPVGTTGGADSVTLAVNQMPSHNH--AASTSNDG 858 Query: 207 AHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSG 266 H H GG + L + + KT DG H+HS++ Sbjct: 859 QHNHTLYFDTGGGGNGPGGDMAKTNDGL--------QKNVIANFSVKTDKDGNHSHSVTI 910 Query: 267 TAASAGAHAH 276 G AH Sbjct: 911 QNNG-GNQAH 919 >UniRef50_B9M3Z7 Tail Collar domain protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M3Z7_GEOSF Length = 173 Score = 42.7 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 19/116 (16%), Positives = 36/116 (31%), Gaps = 21/116 (18%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPS---------GVIPDMRGWTIK- 142 +G + + P +AL G T S Y L + +PD RG Sbjct: 6 IGEIRMFGGNFAPVDWALCDGSTLQISQYDVLYAVIGTYFGGDGITNFKLPDFRGRIPVH 65 Query: 143 ---GKPASGRAVL--------SQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKS 187 G+ + R + + + I +H H S + ++ + + Sbjct: 66 MGTGQGLTPRGIGNAFGTEQETLQVAHIPAHNHVVSVGANATTAAPAGNYLGNSSN 121 >UniRef50_C6X0H3 Microcystin dependent protein n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0H3_FLAB3 Length = 188 Score = 42.3 bits (97), Expect = 0.021, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 50/167 (29%), Gaps = 29/167 (17%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGK 144 G P G+A GQ S L + +PDMRG + Sbjct: 42 GQIAFVAFTFAPKGWAECNGQLLPISQNTALFSLLGTTYGGNGQTTFALPDMRGRVL--- 98 Query: 145 PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANS 204 + + + +G+ ++ + + + T + H H+++ + Sbjct: 99 ------IHNGQGNGLSNYELGQTGGTENH-----------TLTIAEMPQHIHNVNAVSAE 141 Query: 205 AGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNA 251 + + TA + ++ G++S GS N Sbjct: 142 GNQNVPTGNLPANTKALDKEYADSTANTTMNLGMISPAGGSQPHENR 188 >UniRef50_D1SWB1 Tail Collar domain protein n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1SWB1_9BURK Length = 204 Score = 42.3 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 22/123 (17%), Positives = 39/123 (31%), Gaps = 10/123 (8%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGK 144 G I W + +P G+A+ G L + +PD+R G Sbjct: 6 GQIILWATPWIPRGWAICDGTLLSIQQNAALFSLIGTAYGGNGVSTFALPDLRNRVPVGS 65 Query: 145 -PASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 A+ A+ S + + + +AS T ++ G S + A Sbjct: 66 QNAANVAITSGAATASTTASVTLAASQLPPHTHAVAAAGNGKLSVAIPANAGAAADTNAP 125 Query: 204 SAG 206 + G Sbjct: 126 ANG 128 >UniRef50_C5IHQ0 Gp58 n=1 Tax=Burkholderia phage BcepIL02 RepID=C5IHQ0_9CAUD Length = 316 Score = 41.9 bits (96), Expect = 0.028, Method: Composition-based stats. Identities = 40/260 (15%), Positives = 65/260 (25%), Gaps = 54/260 (20%) Query: 64 RSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPS-GYALMQGQTFDKSAYP 122 S AN L + PVG W SD P GY + GQ Sbjct: 99 TSDVTPASANLDRTINLRSLYQILDLLEPVGTVKYWDSDDPPPPGYFVCNGQ-------- 150 Query: 123 KLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFD 182 PD R I G AS + + + + + + D Sbjct: 151 --------NGTPDWRDRFIVGAGASYARRATGGANTV-------TLGPEHMPVHSHGVRD 195 Query: 183 YGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTT 242 G H H ++ G + + Y S S + Sbjct: 196 PGHAHGVADPGHNHYVN----------DPGHNHNNGIFSRLLRPPYPGSITGSDTAGSGS 245 Query: 243 SGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAG 302 + ++ G + + ++I G I ++ AG Sbjct: 246 EQAVGGGDSADIVPSGT-----------------GIWLNGSGTGISIYGSGTGIWLDNAG 288 Query: 303 N---AENTVKNIAFNYIVRL 319 EN +A I ++ Sbjct: 289 GGQTHENRPPYVAIPIIRKM 308 >UniRef50_B4D821 Tail Collar domain protein n=3 Tax=Bacteria RepID=B4D821_9BACT Length = 179 Score = 41.9 bits (96), Expect = 0.030, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 42/147 (28%), Gaps = 2/147 (1%) Query: 99 WPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDG 158 +P + P+G+A GQ S L + D + + + Sbjct: 12 FPFNFAPTGWAFCDGQILPLSQNTALFSLLGTTYGGDGKSNFALPNMQGNAPMHPGQGPS 71 Query: 159 IKSHTHSASASSTDLGTETT--SSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAF 216 + H + S + + S ++G K N AN+ G ++ A Sbjct: 72 LSLHDLGETGGSDTVSLLESEIPSHNHGMKVRNLAPPSVLPAPAPANAFGRSNGGAAYAT 131 Query: 217 GGTNTSIFPNGYTAISNLSAGIMSTTS 243 TS + + G + Sbjct: 132 YTAGTSNIGAMDPRVIAPAGGDQPHNN 158 >UniRef50_UPI0001A44BB4 microcystin dependent protein n=1 Tax=Pectobacterium carotovorum subsp. brasiliensis PBR1692 RepID=UPI0001A44BB4 Length = 269 Score = 41.5 bits (95), Expect = 0.038, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 59/184 (32%), Gaps = 17/184 (9%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP----------SGVIPDMRGWTIK 142 +G+ + PSGY GQ+ S Y L + +PD+RG +I Sbjct: 37 IGSVCYMVTSYCPSGYLPAAGQSVSISTYQALYALIGNIWGGSPQTNNFTLPDLRGRSIV 96 Query: 143 GKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGT-------KSTNNTGAHT 195 G L Q + + T + SAS+ T T+ T + N T T Sbjct: 97 GAGQGTGLSLIQRGQSLGAETATLSASNVAPHTHPTAQSLTTTFDVLVPATTGNLTVGAT 156 Query: 196 HSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTS 255 I+ T + + F ++ P G + G + T+ Sbjct: 157 LPIATTTPATTGTTPANGANFLTALSATVPVGAATQNATFKGPYQAAKPANTAYLIADTT 216 Query: 256 SDGA 259 GA Sbjct: 217 VSGA 220 >UniRef50_B6IWH6 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B6IWH6_RHOCS Length = 206 Score = 41.5 bits (95), Expect = 0.039, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 65/180 (36%), Gaps = 19/180 (10%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTF----DKSAYPKLAVAYP-----SGVIPDMRGWTIKG 143 +G +PW PS ++L GQ +++ + + Y + +PD+RG G Sbjct: 6 IGTIMPWAVSWAPSNWSLCMGQILPVNGNQAVFALIGATYGGNGSTNFALPDLRGRVPVG 65 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLG-------TETTSSFDYGTKSTNNTGAHTH 196 +G+ S + S + T ++ G + A+T Sbjct: 66 ---AGQFPGSGGIPPTTNRVIGQSGGQEQVNLTQSQMPVHTHAAQATGGGGSVTLSAYTG 122 Query: 197 SISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSS 256 +A + G + ++G FGG ++ G + + + + S NAG T+ Sbjct: 123 PADSSAPAVGKYLTAAAGDFGGDAVTVKIYGPASGTAVPIASGTVQPPSITVGNAGGTAP 182 >UniRef50_B2JL06 Tail Collar domain protein n=2 Tax=Burkholderia RepID=B2JL06_BURP8 Length = 261 Score = 41.5 bits (95), Expect = 0.041, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 49/198 (24%), Gaps = 14/198 (7%) Query: 65 SRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPW--PSDTVPSGYALMQGQTFDKSAYP 122 R D W L P G I W +P+G+A G + Sbjct: 57 KHRTNADDGWIDVGPLDDILGDFRSASPKGTIIMWWGDGTKIPTGWAKCDGTQGTPNL-- 114 Query: 123 KLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFD 182 + +P G T G ++ + H H S G Sbjct: 115 -------TDKVPVCAGGTYASGATGGANTVTLSASQMPVHAHGISDPGHGHGVADGGHNH 167 Query: 183 YGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGT---NTSIFPNGYTAISNLSAGIM 239 + + + G S +G + N + G + AGI Sbjct: 168 SVWDNGHQHTLPNLGSVQAGSDNGGASSPVSTGYGSSRYQNPTDAGGGNHGNNASGAGIG 227 Query: 240 STTSGSGQTRNAGKTSSD 257 +G+G + Sbjct: 228 IYGNGTGIGIQNAGGGAA 245 >UniRef50_D2QTE9 Tail Collar domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTE9_9SPHI Length = 172 Score = 41.5 bits (95), Expect = 0.041, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 49/171 (28%), Gaps = 25/171 (14%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAY---------PSGVIPDMRGWTIK- 142 +G I + + GY GQ D S Y L + +PD+RG Sbjct: 5 IGQIILFAGNYEIRGYVFCNGQLLDISKYTALYSLLGTTYGGNGTTTFGLPDLRGRMPIH 64 Query: 143 -----GKPA------SGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNT 191 GK + SG + D + +H H+ +A T+S G N Sbjct: 65 FGQEPGKRSYVLGQRSGSYETTLTVDNLPAHNHALNA----FSETGTASAPAGALLANTG 120 Query: 192 GAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTT 242 T + + + G ++ P GI Sbjct: 121 LGDTEYLPDGTLVQMSTKAIGKTGNGRPVDTMPPYLALNYQIALEGIYPQR 171 >UniRef50_A5GA41 Phage Tail Collar domain protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GA41_GEOUR Length = 205 Score = 41.1 bits (94), Expect = 0.049, Method: Composition-based stats. Identities = 16/131 (12%), Positives = 34/131 (25%), Gaps = 11/131 (8%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLS 153 G + D P + G S Y L + D + Sbjct: 32 GEIRMFGGDYAPENWHFCDGTLLPISGYDALYSLIGTAYGGDGINNFALPDLRGRLPIGQ 91 Query: 154 QEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSS 213 + + +H + +G + T AHTH+++ + + ++ Sbjct: 92 GQGTDLTNHPVGEKNGTETVG-----------LTLAQTPAHTHTVNAASGTGTQPSPENG 140 Query: 214 GAFGGTNTSIF 224 + F Sbjct: 141 VWASLAAVNQF 151 >UniRef50_B9Z2I2 Tail Collar domain protein n=2 Tax=Chromobacterium group RepID=B9Z2I2_9NEIS Length = 195 Score = 41.1 bits (94), Expect = 0.055, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 49/168 (29%), Gaps = 10/168 (5%) Query: 94 GAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLS 153 G + + P +AL QGQT+ S Y L + G AS L Sbjct: 6 GTITVFGFNYAPQDWALCQGQTYQVSQYEALYSLLGTLY----------GGTASQNFKLP 55 Query: 154 QEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSS 213 + T A + + + HS + T+ + H + Sbjct: 56 NLTGRMPIGTGPAQPTYNLPAYNPGQFGGTQNVALSVANLPAHSHTATSMAVTFHAAGTP 115 Query: 214 GAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHT 261 +T N Y S G+ + S + + + S +T Sbjct: 116 TPATPASTPSASNPYLGASGGGTGLANIWSSALNNPVSVQGLSATGNT 163 >UniRef50_C5JB62 Putative phage tail protein n=1 Tax=uncultured bacterium RepID=C5JB62_9BACT Length = 260 Score = 40.8 bits (93), Expect = 0.063, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 58/186 (31%), Gaps = 23/186 (12%) Query: 93 VGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKG 143 +G + P GY QGQT S+Y L + + +PD+RG G Sbjct: 17 IGEICYFGMSYCPQGYLPAQGQTLAISSYQPLYSLFGTAYGGNGTSTFALPDLRGRMPVG 76 Query: 144 KP--------------ASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTN 189 + LS Q + HTH+A+ + T + T G ++ Sbjct: 77 TGQAPGMQNVNLAEQMGTQSVTLSTLQVPLPQHTHAAAFAPTTGQQQVTLPAIQGKGTSL 136 Query: 190 NTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTR 249 + ++ ++ + S + T I P + + + G Sbjct: 137 SGTGTVGVVAAAPDANSTNTPASGSNYSLTGAKITPGNLSGPYTTTQPGTGNAATVGNVA 196 Query: 250 NAGKTS 255 + S Sbjct: 197 VSVDAS 202 >UniRef50_C7BQB5 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BQB5_PHOAA Length = 406 Score = 40.8 bits (93), Expect = 0.065, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 51/193 (26%), Gaps = 36/193 (18%) Query: 84 AHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKG 143 + P + P G + + ++ P+G+A G + PD+R I Sbjct: 222 SIDPNKVLPRGMIVMFSGNSAPTGWAFCDG----------------NSGTPDLRSRFIMC 265 Query: 144 KPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTAN 203 SS ++ + + T Sbjct: 266 GET----------------ISETGKSSNKASGSGNGKNFSRNTTSTTVSVNVTVQNTTLT 309 Query: 204 SAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHS 263 + +HK A NT F G T I + I +T+S + G H Sbjct: 310 ESQIPKHKHIEALPYYNTLGFAYGNTPIGSTKYQINNTSSSMFFWHPSPT----GNDYHP 365 Query: 264 LSGTAASAGAHAH 276 + H H Sbjct: 366 YTSEVGGGQGHNH 378 >UniRef50_C6C8S0 Tail Collar domain protein n=4 Tax=Dickeya RepID=C6C8S0_DICDC Length = 310 Score = 40.4 bits (92), Expect = 0.078, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 64/241 (26%), Gaps = 33/241 (13%) Query: 93 VGAPIPWPSDTVPSG-YALMQGQTFDKSAYPKLAVAYPS---------GVIPDMRGWTIK 142 +G P+ Y G+T + S Y L + ++PD+RG T Sbjct: 37 IGGICYMAGTYCPADDYLPADGRTLNISDYQVLYAVIGTLYGGNASTNFMLPDLRGRTAI 96 Query: 143 G-----------KPASGRAVL------------SQEQDGIKSHTHSASASSTDLGTETTS 179 G P G+ + + + HTH A+ + + T Sbjct: 97 GAGPLNGSSPTYNPIPGQKIGQEVSNVVGTGSVTLTASQVPPHTHPATLTLNGVSGTTPV 156 Query: 180 SFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIM 239 + + ++ + S A+ K + + + S Sbjct: 157 ASGAVSLTSLSGSITNLPFSAVASLGVTGIAKIGSSTTTGRSVSLTDKALLTSVGGPSAQ 216 Query: 240 STTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN 299 R G S A+ G + TV V + G TV Sbjct: 217 IYAPSGTNDRQVGPDGSVTGTASGTVSGTANGGQLSGTVSGNVSLPVVGAVTVGANATVP 276 Query: 300 A 300 A Sbjct: 277 A 277 >UniRef50_C2FWA0 Phage tail collar domain protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FWA0_9SPHI Length = 196 Score = 40.4 bits (92), Expect = 0.078, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 52/175 (29%), Gaps = 21/175 (12%) Query: 99 WPSDTVPSGYALMQGQTFDKSAYPKLAVAYP---------SGVIPDMRGWTIKGKPAS-- 147 + + P+G+ L G+ + Y L + +PD+RG G Sbjct: 11 FAGNFAPAGWILCDGRLLSINNYQVLYTVIGTTYGGDGVNTFGVPDLRGRVPIGTGQGPG 70 Query: 148 ----------GRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHS 197 G ++ + HTH+A+ ++T++ S T TG+ Sbjct: 71 LTNVVLGQKIGTETVTLLPANLPVHTHTAAVNATNVPFAVKVSAAAATLHAAATGSQLGQ 130 Query: 198 ISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAG 252 + + + G ++ + + + + T N Sbjct: 131 PMTDSIPTLGYNAANPDKTMGDSSLNTSGLTVNTAMMGSSLPHENMQPFLTTNYI 185 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.305 0.107 0.274 Lambda K H 0.267 0.0328 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,501,737,288 Number of Sequences: 3077464 Number of extensions: 59061339 Number of successful extensions: 276527 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 832 Number of HSP's successfully gapped in prelim test: 2110 Number of HSP's that attempted gapping in prelim test: 249454 Number of HSP's gapped (non-prelim): 18057 length of query: 320 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 192 effective length of database: 646,480,964 effective search space: 124124345088 effective search space used: 124124345088 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.7 bits) S2: 92 (40.4 bits)