BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (1120 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76072 Side tail fiber protein homolog from lambdoid pr... 2240 0.0 UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia ... 575 e-162 UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriacea... 472 e-131 UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacter... 360 2e-97 UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX 352 4e-95 UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysente... 347 2e-93 UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_L... 337 2e-90 UniRef50_A8A0A4 L-shaped tail fiber protein n=12 Tax=root RepID=... 304 1e-80 UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae R... 302 5e-80 UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enter... 277 2e-72 UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria pha... 274 2e-71 UniRef50_B5YU13 Tail fiber protein n=140 Tax=root RepID=B5YU13_E... 265 6e-69 UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escheri... 254 1e-65 UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enter... 249 4e-64 UniRef50_C2DS71 Tail fiber protein n=10 Tax=Escherichia RepID=C2... 249 4e-64 UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica... 249 5e-64 UniRef50_C6UHV3 Predicted tail fiber protein n=22 Tax=Escherichi... 248 7e-64 UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escheric... 246 4e-63 UniRef50_B4TP26 Side tail fiber protein n=43 Tax=Salmonella ente... 214 2e-53 UniRef50_C8U9W7 Probable tail fiber protein-like protein n=1 Tax... 212 6e-53 UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae Re... 204 1e-50 UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepI... 181 1e-43 UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclass... 176 3e-42 UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enter... 157 3e-36 UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella... 143 4e-32 UniRef50_D2TSH8 Phage tail fibre protein n=7 Tax=root RepID=D2TS... 134 3e-29 UniRef50_Q858V4 GpH n=9 Tax=root RepID=Q858V4_9CAUD 134 3e-29 UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp.... 133 3e-29 UniRef50_C4U3E2 Tail fiber protein (Fragment) n=1 Tax=Yersinia k... 126 5e-27 UniRef50_B7MWN9 Putative tail fiber protein (GpH) n=2 Tax=Escher... 120 4e-25 UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclass... 119 6e-25 UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacteriu... 118 1e-24 UniRef50_D2TJ16 Putative phage tail fibre protein n=1 Tax=Citrob... 117 3e-24 UniRef50_B6IAV4 Putative phage tail fiber protein n=1 Tax=Escher... 116 5e-24 UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae ... 115 1e-23 UniRef50_Q9LA62 ORF-401-like protein n=1 Tax=Enterobacterial pha... 113 5e-23 UniRef50_B7MW07 Putative tail fiber protein from prophage n=4 Ta... 111 1e-22 UniRef50_Q7Y3Z0 Tail fiber protein n=1 Tax=Yersinia phage PY54 R... 105 8e-21 UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacter... 82 1e-13 UniRef50_Q5GAE0 Putative uncharacterized protein n=3 Tax=Singapo... 82 2e-13 UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseu... 75 1e-11 UniRef50_C9KJG4 Side tail fiber protein from lambdoid prophage R... 72 1e-10 UniRef50_B3YHG3 Tail fiber protein n=2 Tax=Salmonella enterica s... 71 2e-10 UniRef50_C3R3S9 Predicted protein n=1 Tax=Bacteroides sp. 2_2_4 ... 66 7e-09 UniRef50_C5DLU8 KLTH0G03696p n=1 Tax=Lachancea thermotolerans CB... 65 2e-08 UniRef50_D0Z3B3 Putative tail fiber protein n=1 Tax=Photobacteri... 65 2e-08 UniRef50_B5YYN6 Tail fiber protein n=60 Tax=root RepID=B5YYN6_ECO5E 62 2e-07 UniRef50_Q0I4L1 Putative uncharacterized protein n=1 Tax=Haemoph... 61 3e-07 UniRef50_C4Y8G3 Putative uncharacterized protein n=1 Tax=Clavisp... 60 3e-07 UniRef50_C5BA56 Putative uncharacterized protein n=1 Tax=Edwards... 60 4e-07 UniRef50_B2I4I4 Putative phage tail protein n=1 Tax=Enterobacter... 60 6e-07 UniRef50_A9WC61 Autotransporter-associated beta strand repeat pr... 59 1e-06 UniRef50_C3YC67 Putative uncharacterized protein n=2 Tax=Branchi... 57 4e-06 UniRef50_A9ITY7 Putative uncharacterized protein n=2 Tax=Bartone... 54 4e-05 UniRef50_Q0I488 Putative uncharacterized protein n=1 Tax=Haemoph... 54 5e-05 UniRef50_B0X1G5 Microtubule-associated protein futsch n=1 Tax=Cu... 51 3e-04 UniRef50_C3L3V8 Putative uncharacterized protein n=2 Tax=Candida... 49 0.001 UniRef50_P13390 L-shaped tail fiber protein n=4 Tax=Enterobacter... 48 0.002 UniRef50_B6K9X3 Putative uncharacterized protein n=1 Tax=Toxopla... 48 0.002 UniRef50_Q5ULL7 Orf97 n=1 Tax=Lactobacillus phage LP65 RepID=Q5U... 47 0.005 UniRef50_Q9TYL3 Putative uncharacterized protein n=2 Tax=Caenorh... 47 0.005 UniRef50_B9Q2F9 Putative uncharacterized protein n=1 Tax=Toxopla... 45 0.011 >UniRef50_P76072 Side tail fiber protein homolog from lambdoid prophage Rac n=23 Tax=root RepID=STFR_ECOLI Length = 1120 Score = 2240 bits (5804), Expect = 0.0, Method: Compositional matrix adjust. Identities = 1120/1120 (100%), Positives = 1120/1120 (100%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV Sbjct: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA Sbjct: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT Sbjct: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT Sbjct: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE Sbjct: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE Sbjct: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 Query: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA Sbjct: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 Query: 541 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR 600 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR Sbjct: 541 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR 600 Query: 601 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVV 660 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVV Sbjct: 601 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVV 660 Query: 661 NDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGN 720 NDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGN Sbjct: 661 NDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGN 720 Query: 721 QIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLN 780 QIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLN Sbjct: 721 QIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLN 780 Query: 781 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAY 840 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAY Sbjct: 781 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAY 840 Query: 841 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLP 900 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLP Sbjct: 841 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLP 900 Query: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS 960 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS Sbjct: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS 960 Query: 961 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA 1020 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA Sbjct: 961 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA 1020 Query: 1021 HTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI 1080 HTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI Sbjct: 1021 HTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI 1080 Query: 1081 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 1081 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 >UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia coli RepID=B7L485_ECO55 Length = 1056 Score = 575 bits (1483), Expect = e-162, Method: Compositional matrix adjust. Identities = 292/376 (77%), Positives = 310/376 (82%), Gaps = 25/376 (6%) Query: 766 TGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYA 825 +G L + ++ +L GNA TATKL +++GV+FDGS DI LT+ ++ AFARR+T YA Sbjct: 685 SGPLSVTDGITGALKGNADTATKLAAAPKINGVKFDGSADINLTSENIGAFARRSTGAYA 744 Query: 826 DADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYG 885 D+DG VPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRN GLFYRSSRDGYG Sbjct: 745 DSDGAVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNRGLFYRSSRDGYG 804 Query: 886 FEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGV 945 FEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQ F+KSAYPKLAAAYPSGV Sbjct: 805 FEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQTFNKSAYPKLAAAYPSGV 864 Query: 946 IPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTG 1005 IPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTG Sbjct: 865 IPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTG 924 Query: 1006 AHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSA-----STRLSVVHNQNY-------- 1052 AHTHS+SGST SAG HTH N G GSA + V N Y Sbjct: 925 AHTHSLSGSTGSAGVHTHG----NGIRWPGGGGSALAFYDGGGFTYVQNSQYQVSPGTSS 980 Query: 1053 --------ATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA 1104 T SAGAHTHSLSGTAAS+GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA Sbjct: 981 YRSYYQRIQTQSAGAHTHSLSGTAASSGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA 1040 Query: 1105 ENTVKNIAFNYIVRLA 1120 ENTVKNIAFNYIVRLA Sbjct: 1041 ENTVKNIAFNYIVRLA 1056 Score = 405 bits (1040), Expect = e-111, Method: Compositional matrix adjust. Identities = 476/529 (89%), Positives = 494/529 (93%), Gaps = 6/529 (1%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M VKISGVLKDGTGKPVQNCTI LKA+R S+TVVVNT+ASENPDEAGRYSMDVE+GQYSV Sbjct: 1 MTVKISGVLKDGTGKPVQNCTIVLKARRTSSTVVVNTVASENPDEAGRYSMDVEHGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 LLVEGFPPSHAGTITVYE S+PGTLNDFLGAMTEDD RPEALRRFE MVEEV+RNASAV Sbjct: 61 TLLVEGFPPSHAGTITVYEGSRPGTLNDFLGAMTEDDVRPEALRRFEQMVEEVSRNASAV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDAS SA EAATHA DAA SARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 121 AQNTAAAKKSASDASASASEAATHATDAAASARAASTSAGQAASSAQSASSSAGTASTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 EA+KSAAAAESSKSAAATSA AAKTSETNA+AS +SAATSASTATTKASEAATSAR AA Sbjct: 181 REAAKSAAAAESSKSAAATSASAAKTSETNAAASQKSAATSASTATTKASEAATSARGAA 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT Sbjct: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAG+ATEQASAAARSASAAKT Sbjct: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGKATEQASAAARSASAAKT 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK SATTASTKATE Sbjct: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATE 420 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE Sbjct: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 Query: 481 TLAATPKAVKSAYDNAEKRLQKDQ--NG----ADIPDKGCFLNNINAVS 523 TLAATPKAVK+A DNA R+ ++ NG ADI + +N+V+ Sbjct: 481 TLAATPKAVKAANDNANGRVPSNRKVNGKALTADITLTPKDIGTLNSVT 529 >UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriaceae RepID=Q3YZL1_SHISS Length = 1029 Score = 472 bits (1214), Expect = e-131, Method: Compositional matrix adjust. Identities = 503/566 (88%), Positives = 509/566 (89%), Gaps = 44/566 (7%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV Sbjct: 3 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 62 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDS+PGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV Sbjct: 63 ILLVEGFPPSHAGTITVYEDSRPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 122 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDASTSAREAATHA DAADSARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 123 AQNTAAAKKSASDASTSAREAATHATDAADSARAASTSAGQAASSAQSASSSAGTASTKA 182 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEASKSAAAAESSKSAAATSAGAAKT ETNA+AS QSAATSASTATTKASEAATSARDA+ Sbjct: 183 TEASKSAAAAESSKSAAATSAGAAKTLETNAAASQQSAATSASTATTKASEAATSARDAS 242 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSASSAASSATAA NSAKAAKTSETNARSSETAAGQSASAAAGSKT Sbjct: 243 ASKEAAKSSETNASSSASSAASSATAAANSAKAAKTSETNARSSETAAGQSASAAAGSKT 302 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA+AAARSASAAKT Sbjct: 303 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQATAAARSASAAKT 362 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 SETNAKASET AESSKTAAASSASSAASSASSASASKDEATRQASAAK SATTASTKATE Sbjct: 363 SETNAKASETRAESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATE 422 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE Sbjct: 423 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 482 Query: 481 TLAATPKAVKSAY--------------------------------------------DNA 496 +LAATPKAVK+AY DNA Sbjct: 483 SLAATPKAVKAAYDLANGKYTAQDATTAQKGIIQLSSATNSTSETLAATPKAVKAANDNA 542 Query: 497 EKRLQKDQNGADIPDKGCFLNNINAV 522 EKRLQKDQNGADIP K F NI A Sbjct: 543 EKRLQKDQNGADIPGKDTFTKNIGAC 568 Score = 365 bits (938), Expect = 4e-99, Method: Compositional matrix adjust. Identities = 199/262 (75%), Positives = 211/262 (80%), Gaps = 17/262 (6%) Query: 876 FYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAY 934 F RS RD WA++YTS + P E YPVGAPIPWPSDTVPSGYALMQGQ FDKSAY Sbjct: 768 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 827 Query: 935 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF Sbjct: 828 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 887 Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAG---SASTRLSVVHNQN 1051 DYGTKSTNNTGAHTHS+SGST+SAGAH HS T S + G + ST++S + Sbjct: 888 DYGTKSTNNTGAHTHSLSGSTSSAGAHQHSQTGPRTNSGSQPTGMFPAGSTQVSGTNQVG 947 Query: 1052 YA-------------TSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV 1098 + +SS G HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV Sbjct: 948 ISGSLTSGTSQWVGKSSSEGNHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV 1007 Query: 1099 NAAGNAENTVKNIAFNYIVRLA 1120 NAAGNAENTVKNIAFNYIVRLA Sbjct: 1008 NAAGNAENTVKNIAFNYIVRLA 1029 >UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacteriaceae RepID=B7LN99_ESCF3 Length = 593 Score = 360 bits (924), Expect = 2e-97, Method: Compositional matrix adjust. Identities = 216/361 (59%), Positives = 245/361 (67%), Gaps = 50/361 (13%) Query: 763 DNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATD 822 DNA G + G K+ NG+AL S DI L+ V AF+ T Sbjct: 280 DNANGRVPSGRKV----NGHAL------------------SSDIKLSPEDVNAFSLGCTG 317 Query: 823 TYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRD 882 Y +DGGVPWNA+SG YNV G SYI+ +F++GVGSCR+ Q++A Y+N GL+YRSSRD Sbjct: 318 QYPSSDGGVPWNAKSGLYNVMDGGASYIVAHFFSGVGSCRSFQLRADYKNRGLYYRSSRD 377 Query: 883 GYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYP 942 GYGFE + P ++PVGAPI WPSD VP GYA+MQGQ FDK+AYP LAAAYP Sbjct: 378 GYGFERGFE--------PVNAFPVGAPIAWPSDIVPEGYAIMQGQTFDKAAYPLLAAAYP 429 Query: 943 SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTK--S 1000 SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTK S Sbjct: 430 SGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKTVS 489 Query: 1001 TNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSV-VHNQNYATSSAGA 1059 T N G T TN+ GAHTH T G S + V V N +SS GA Sbjct: 490 TFNHGTKT------TNNTGAHTH------TVGGRYGGDSIGGKQRVQVSGTNQVSSSDGA 537 Query: 1060 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 H H++ G H HTVGIGAH H+VA+G+HGHTITVNAAGNAENTVKNIAFNYIVRL Sbjct: 538 HAHTV-----DIGQHNHTVGIGAHAHTVALGAHGHTITVNAAGNAENTVKNIAFNYIVRL 592 Query: 1120 A 1120 A Sbjct: 593 A 593 Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 41/69 (59%), Positives = 51/69 (73%), Gaps = 2/69 (2%) Query: 433 STAESAATRAETAAKRAEDIASA-VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 ST+E+ A ++ A K A D+A +DA+T +KGIVQLSSATNS SETLAATPKAVK+ Sbjct: 219 STSETQAATSK-AVKTAYDLADGKYTAQDATTARKGIVQLSSATNSDSETLAATPKAVKA 277 Query: 492 AYDNAEKRL 500 A DNA R+ Sbjct: 278 ANDNANGRV 286 >UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX Length = 710 Score = 352 bits (904), Expect = 4e-95, Method: Compositional matrix adjust. Identities = 174/223 (78%), Positives = 191/223 (85%), Gaps = 4/223 (1%) Query: 900 PPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPA 959 P +S+PVGA IPWPSD+VP+GYA+MQGQ FDK+ YP LAAAYPSGV+PDMRGWTIKGKPA Sbjct: 490 PQDSFPVGAAIPWPSDSVPTGYAVMQGQTFDKTTYPLLAAAYPSGVLPDMRGWTIKGKPA 549 Query: 960 SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAG 1019 SGR VLS EQDGIKSHTHSASAS+TDLGTKTTSSFDYGTKSTNNTGAHTH+VSG+ NSAG Sbjct: 550 SGRDVLSLEQDGIKSHTHSASASNTDLGTKTTSSFDYGTKSTNNTGAHTHNVSGTANSAG 609 Query: 1020 AHTHS--LANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHT 1077 AHTH+ L N+ N ++ +VV N S+GAHTHS+SGTA SAGAHAHT Sbjct: 610 AHTHTVPLRRPNSGGMNFDWLDGASSGTVVGNG--TVPSSGAHTHSVSGTATSAGAHAHT 667 Query: 1078 VGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 VGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 668 VGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 710 Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 58/145 (40%), Positives = 81/145 (55%), Gaps = 6/145 (4%) Query: 384 SSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAE 443 SSA +S S + A+ +A + A + TA T G + + ST+E+ A + Sbjct: 271 SSATNSTSESLAATPKAVKAAYELANGKYTAQDATTAQKGIVQLSNATNSTSETLAATPK 330 Query: 444 TAAKRAEDIASA-VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK 502 A K A D+A+ +DA+T +KG+VQLSSATNSTSETLAATPKAVK+A DNA R+ Sbjct: 331 -AVKAAYDLANGKYTAQDATTARKGLVQLSSATNSTSETLAATPKAVKAANDNANGRVPS 389 Query: 503 DQNGADIPDKGCFLNNINAVSKTDF 527 + P N++N S+ F Sbjct: 390 GRKVNGKP----LTNDVNVTSQDIF 410 >UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysenteriae 1012 RepID=B3X2T1_SHIDY Length = 488 Score = 347 bits (889), Expect = 2e-93, Method: Compositional matrix adjust. Identities = 190/241 (78%), Positives = 196/241 (81%), Gaps = 13/241 (5%) Query: 893 VYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGW 952 VY+ + YP GAPIPWPSDTVPSGYALMQGQ FDKSAYPKLA AYPSGVIPDMRGW Sbjct: 248 VYSLYTPSEQFYPPGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGW 307 Query: 953 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVS 1012 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHS+S Sbjct: 308 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSIS 367 Query: 1013 GSTNSAGAHTHS------------LANVNTASANSGAGSASTRLSVVHNQNYA-TSSAGA 1059 G+ NSAGAH H N TA +N AG ST +N TSS GA Sbjct: 368 GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGA 427 Query: 1060 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL Sbjct: 428 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 487 Query: 1120 A 1120 A Sbjct: 488 A 488 Score = 69.3 bits (168), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 36/44 (81%), Positives = 40/44 (90%) Query: 456 VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 +ALEDASTTKKGIVQLSSATNSTSE+ AATPKAVK+AYD A + Sbjct: 1 MALEDASTTKKGIVQLSSATNSTSESQAATPKAVKAAYDLANGK 44 Score = 68.9 bits (167), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 43/71 (60%), Positives = 53/71 (74%), Gaps = 2/71 (2%) Query: 431 SKSTAESAATRAETAAKRAEDIASA-VALEDASTTKKGIVQLSSATNSTSETLAATPKAV 489 + ST+ES A + A K A D+A+ +DA+T +KGIVQLSSATNSTSETLAATPKAV Sbjct: 20 TNSTSESQAATPK-AVKAAYDLANGKYTAQDATTAQKGIVQLSSATNSTSETLAATPKAV 78 Query: 490 KSAYDNAEKRL 500 K+A DNA R+ Sbjct: 79 KAANDNANGRV 89 >UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_LAMBD Length = 774 Score = 337 bits (864), Expect = 2e-90, Method: Compositional matrix adjust. Identities = 176/245 (71%), Positives = 187/245 (76%), Gaps = 28/245 (11%) Query: 904 YPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA 963 +P GAPIPWPSD VPSGY LMQGQAFDKSAYPKLA AYPSGV+PDMRGWTIKGKPASGRA Sbjct: 530 FPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIKGKPASGRA 589 Query: 964 VLSQEQDGIKSHTHSASASSTDLGTKTTS----------SFDYGTKSTNNTGAHTHSVSG 1013 VLSQEQDGIKSHTHSASAS TDLGTKTTS SFDYGTKSTNNTGAH HS+SG Sbjct: 590 VLSQEQDGIKSHTHSASASGTDLGTKTTSSFDYGTKTTGSFDYGTKSTNNTGAHAHSLSG 649 Query: 1014 STNSAGAHTHSLANVNTASANSGAGSAST--RLSVVH---NQNYA----TSSAGAHTHSL 1064 ST +AGAH H+ +S S G+A+ LS V Q A T S G+H+HSL Sbjct: 650 STGAAGAHAHTSGLRMNSSGWSQYGTATITGSLSTVKGTSTQGIAYLSKTDSQGSHSHSL 709 Query: 1065 SGTAASAGAHAHTVGIGAHTHSV---------AIGSHGHTITVNAAGNAENTVKNIAFNY 1115 SGTA SAGAHAHTVGIGAH H V +IGSHGHTITVNAAGNAENTVKNIAFNY Sbjct: 710 SGTAVSAGAHAHTVGIGAHQHPVVIGAHAHSFSIGSHGHTITVNAAGNAENTVKNIAFNY 769 Query: 1116 IVRLA 1120 IVRLA Sbjct: 770 IVRLA 774 Score = 329 bits (844), Expect = 3e-88, Method: Compositional matrix adjust. Identities = 284/352 (80%), Positives = 302/352 (85%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPVQNCTIQLKA+RNSTTVVVNT+ SENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVKISGVLKDGTGKPVQNCTIQLKARRNSTTVVVNTVGSENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 IL V+GFPPSHAGTITVYEDSQPGTLNDFL AMTEDDARPE LRR ELMVEEVARNAS V Sbjct: 61 ILQVDGFPPSHAGTITVYEDSQPGTLNDFLCAMTEDDARPEVLRRLELMVEEVARNASVV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQ+TA AKKSA DAS SA + A DA DSARAASTSAGQAASSAQ ASS A AS KA Sbjct: 121 AQSTADAKKSAGDASASAAQVAALVTDATDSARAASTSAGQAASSAQEASSGAEAASAKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEA KSAAAAESSK+AAATSAGAAKTSETNA+AS QSAATSASTA TKASEAATSARDA Sbjct: 181 TEAEKSAAAAESSKNAAATSAGAAKTSETNAAASQQSAATSASTAATKASEAATSARDAV 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSA AASSATAA NSA+AAKTSETNARSSETAA +SASAAA +KT Sbjct: 241 ASKEAAKSSETNASSSAGRAASSATAAENSARAAKTSETNARSSETAAERSASAAADAKT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 AAA SAS AST A +A+ SA +A +S +A ++A A A A + ASA A Sbjct: 301 AAAGSASTASTKATEAAGSAVSASQSKSAAEAAAIRAKNSAKRAEDIASAVA 352 >UniRef50_A8A0A4 L-shaped tail fiber protein n=12 Tax=root RepID=A8A0A4_ECOHS Length = 1258 Score = 304 bits (778), Expect = 1e-80, Method: Compositional matrix adjust. Identities = 264/330 (80%), Positives = 281/330 (85%), Gaps = 28/330 (8%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAV+ISGVLKDG GKP+QNCTIQLKA+RNSTTVVVNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVRISGVLKDGAGKPIQNCTIQLKARRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDD RPEALRRFELMVEEVARNASAV Sbjct: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDVRPEALRRFELMVEEVARNASAV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDA TSAREAATHA DAADSARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 121 AQNTAAAKKSASDARTSAREAATHATDAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEASKSAAAAESSKSAAATSAGAAKTSETNA+AS +SAATSASTATTKASEAATSARDA+ Sbjct: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNAAASQKSAATSASTATTKASEAATSARDAS 240 Query: 241 ASK----------------------------EAAKSSETNASSSASSAASSATAAGNSAK 272 ASK +AAK+SE NA +SA +AA S TA+ NSA Sbjct: 241 ASKVAAKSSETSAASSAGSAASSATAAGNSAKAAKTSEMNADNSAQAAADSQTASANSAT 300 Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAA 302 AAK SETNA++SE+AA S + A S+ A Sbjct: 301 AAKKSETNAKNSESAAKVSETNAKASENKA 330 >UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae RepID=A4TT73_YERPP Length = 962 Score = 302 bits (774), Expect = 5e-80, Method: Compositional matrix adjust. Identities = 161/254 (63%), Positives = 191/254 (75%), Gaps = 27/254 (10%) Query: 893 VYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGW 952 +Y+S PPESYPVGAPIPWP+D PSG+A+MQGQ FDKS YPKLAAAYPSGV+PDMRGW Sbjct: 710 LYSSVLPPPESYPVGAPIPWPNDVAPSGFAIMQGQTFDKSVYPKLAAAYPSGVLPDMRGW 769 Query: 953 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDL----------GTKTTSSFDYGTKSTN 1002 IKGKP S RAVLS EQDGIKSH H+A+ASSTDL GTKT+S FDYGTKS+N Sbjct: 770 MIKGKPTS-RAVLSLEQDGIKSHAHNAAASSTDLGTKPTTTFDYGTKTSSGFDYGTKSSN 828 Query: 1003 NTGAHTHSVSGSTNSAGAHTHSLA-------NVNTASANSGAGSASTRLSVVHNQNYATS 1055 +TGAH HS+SGST+S+GAH H++ + ++ + N+ +T+ + + N TS Sbjct: 829 STGAHAHSLSGSTSSSGAHAHTVTAHTQYPRSTDSRNQNAVGKQYNTQQTTANAFNVWTS 888 Query: 1056 SAGAHTHSLSGTAASAGAHAHTVGIGA---------HTHSVAIGSHGHTITVNAAGNAEN 1106 SAG H HS+SGTA SAGAHAHTVGIGA H+HSVAIG+H HTIT+ A GNAEN Sbjct: 889 SAGDHAHSISGTAVSAGAHAHTVGIGAHAHSLSIGSHSHSVAIGAHSHTITIAACGNAEN 948 Query: 1107 TVKNIAFNYIVRLA 1120 TVKNIA+NYIVRLA Sbjct: 949 TVKNIAYNYIVRLA 962 Score = 42.7 bits (99), Expect = 0.090, Method: Compositional matrix adjust. Identities = 62/175 (35%), Positives = 89/175 (50%), Gaps = 51/175 (29%) Query: 371 SAESSKTAAASSASSAASSASSASASKD----------------------------EATR 402 SA++SK AAA+S ++AA+S + A SKD E +R Sbjct: 263 SAKTSKEAAAASEAAAANSENEARTSKDTAVAAAAEASANATSADASRHDVDTNKAEVSR 322 Query: 403 ---QASAAKSSATTASTKATEAAGSATAAAQSK------STAESAATRAETA-------- 445 + AA+ S S +A AA +A A +K S +S A +A +A Sbjct: 323 MKDEVFAARDSTIQYSEEAKTAADTAAREAATKTSDQLLSAVKSEAEKANSASASAQGFA 382 Query: 446 --AKR----AEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYD 494 AKR A++IA + DA+T+++G+VQLSSAT+S SETLA+TPKAVK+ D Sbjct: 383 DDAKRFRDEAQEIAEGSKVNDATTSQQGVVQLSSATDSESETLASTPKAVKTVMD 437 >UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FQX9_SALDC Length = 569 Score = 277 bits (709), Expect = 2e-72, Method: Compositional matrix adjust. Identities = 144/246 (58%), Positives = 160/246 (65%), Gaps = 34/246 (13%) Query: 876 FYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAY 934 + RS RD E +WA +YTS N PP SYPVGA I WPSD P+GYALMQGQ+FDKSAY Sbjct: 357 YIRSHRDTADAEWSEWAMLYTSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAY 416 Query: 935 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 P LA AYPSG+IPDMRGWTIKGKP SGRAVLSQE DG KSH+HSA A TDLGTK+TSSF Sbjct: 417 PLLAIAYPSGIIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHSARAQDTDLGTKSTSSF 476 Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYAT 1054 DYGTKSTN TG HTH G NS + N S G G+ + Sbjct: 477 DYGTKSTNTTGNHTHQFGGYINS------YWGDSNHTSFQPGGGAWT------------- 517 Query: 1055 SSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFN 1114 +AG HAHTV IG H H++ IG HGH + V+A GNAE TVKNIAFN Sbjct: 518 --------------QAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAETTVKNIAFN 563 Query: 1115 YIVRLA 1120 YIVRLA Sbjct: 564 YIVRLA 569 >UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria phage T4 RepID=Q99362_BPT4 Length = 382 Score = 274 bits (700), Expect = 2e-71, Method: Compositional matrix adjust. Identities = 159/257 (61%), Positives = 177/257 (68%), Gaps = 44/257 (17%) Query: 903 SYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGR 962 SYP+GAPIPWP+DT P+GYALM+GQ FD AYPKLAAAYPSG IPDMRG TIKGKP SGR Sbjct: 131 SYPIGAPIPWPTDTPPNGYALMEGQTFDTRAYPKLAAAYPSGTIPDMRGQTIKGKP-SGR 189 Query: 963 AVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKST----------NNTGAHTHSVS 1012 AVLS E DG+KSHTH ASAS+TDLGTKTTSSFDYGTK+T N TG H H+VS Sbjct: 190 AVLSTEADGVKSHTHGASASNTDLGTKTTSSFDYGTKTTSSFDYGTKTSNTTGNHNHTVS 249 Query: 1013 GSTNSAGAHTHS-----LAN---------------VNTASANSGAGSASTRLSVVHNQNY 1052 G+T+SAGAH H+ L+N N S SG S+ ++ Sbjct: 250 GTTSSAGAHQHARSGPQLSNGISTNIFPDGYSDVGTNYNSKFSGTVIGSSVPCIIG---- 305 Query: 1053 ATSSAGAHTHSLSGTA---------ASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGN 1103 TS+ GAHTH+ SGT GAH HTVGIGAHTH+VAIGSHGHTITVNA GN Sbjct: 306 KTSNDGAHTHTWSGTTSTTGNHAHTVGIGAHTHTVGIGAHTHTVAIGSHGHTITVNATGN 365 Query: 1104 AENTVKNIAFNYIVRLA 1120 ENTVKNIAFNYIVRLA Sbjct: 366 TENTVKNIAFNYIVRLA 382 >UniRef50_B5YU13 Tail fiber protein n=140 Tax=root RepID=B5YU13_ECO5E Length = 451 Score = 265 bits (678), Expect = 6e-69, Method: Compositional matrix adjust. Identities = 157/247 (63%), Positives = 184/247 (74%), Gaps = 7/247 (2%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPV+NCTIQLKA+R S+TVVVNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 15 MAVKISGVLKDGTGKPVENCTIQLKARRTSSTVVVNTVASENPDEAGRYSMDVEYGQYSV 74 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAM+EDD RPEALRRFELMV Sbjct: 75 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRFELMV-------EEA 127 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A++ AKK+A +A TSAR A A+ A +SA A TSAG A+ SA+ A+ SA A Sbjct: 128 ARHAEEAKKNAGEAETSARNAGISASQAEESAANADTSAGDASESARQAAESAAAAKQSE 187 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 +S SA+AA S ++ SA A+ S A ++ +AA A+TAT KA E+A SA+ A Sbjct: 188 EASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAESAQSAE 247 Query: 241 ASKEAAK 247 S+ AA+ Sbjct: 248 QSRIAAE 254 >UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escherichia RepID=B7LKX7_ESCF3 Length = 567 Score = 254 bits (649), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 128/231 (55%), Positives = 161/231 (69%), Gaps = 35/231 (15%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASG 961 ES PVG PIPWPSD+VPSGYALM GQ F+K++YPKLA AYPSGVIPDMRGW IKGKP+SG Sbjct: 360 ESCPVGMPIPWPSDSVPSGYALMTGQTFNKTSYPKLAIAYPSGVIPDMRGWIIKGKPSSG 419 Query: 962 RAVLSQEQDGIKSHTHSASAS----------STDLGTKTTSSFDYGTKSTNNTGAHTHSV 1011 RA+LS E DG+KSH H+ S S STDLGTKTT+SF++G+++T+ +G HTH + Sbjct: 420 RAILSTELDGVKSHNHTGSISSTNLGTITSTSTDLGTKTTASFNHGSRNTSTSGEHTHRI 479 Query: 1012 SGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNY--ATSSAGAHTHSLSGTAA 1069 + + G SL N S NS ++NY T SAG+H HS+ Sbjct: 480 P-TDGAEGKDGPSLWN----SPNS-------------DENYREPTESAGSHYHSI----- 516 Query: 1070 SAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 + GAHAHT+ +G+HTH++ +G+H H+I +N GN ENTVKNIAFNYIVRLA Sbjct: 517 TIGAHAHTIALGSHTHNIVLGTHNHSIIINNTGNTENTVKNIAFNYIVRLA 567 Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 31/57 (54%), Positives = 36/57 (63%), Gaps = 5/57 (8%) Query: 460 DASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGA-----DIPD 511 DA+ T+KG QLS+ATNS ET AATPKAVK+AYD A + N A IPD Sbjct: 208 DATLTQKGFTQLSNATNSDDETKAATPKAVKTAYDLANSKAATSHNHAWSQITGIPD 264 Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust. Identities = 23/39 (58%), Positives = 31/39 (79%) Query: 458 LEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNA 496 + D + T+KG+V+L++ATNSTS T AATP AVK+A D A Sbjct: 318 IPDGTLTQKGVVKLNNATNSTSTTEAATPNAVKAAMDKA 356 >UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5PP06_SALHA Length = 534 Score = 249 bits (637), Expect = 4e-64, Method: Compositional matrix adjust. Identities = 130/231 (56%), Positives = 149/231 (64%), Gaps = 34/231 (14%) Query: 876 FYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAY 934 + RS RD E +WA +YT+ N PP+S+PVGAPI WPSD P+GYALMQGQ+FDKSAY Sbjct: 213 YIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAPIAWPSDATPAGYALMQGQSFDKSAY 272 Query: 935 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 P LA AYPSGVIPDMRGWTIKGKPASGRA+LSQE DG KSH+HSA A TDLGTKTTSSF Sbjct: 273 PLLAIAYPSGVIPDMRGWTIKGKPASGRAILSQEMDGNKSHSHSARAQDTDLGTKTTSSF 332 Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYAT 1054 DYGTKSTN TG HT+ G NS + N S G G+ + Sbjct: 333 DYGTKSTNTTGNHTNQFGGYINS------YWGDSNHTSFQPGGGAWTQ------------ 374 Query: 1055 SSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAE 1105 +AG HAHTV IG H H++ IG HGH + V+A GNAE Sbjct: 375 ---------------AAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAE 410 >UniRef50_C2DS71 Tail fiber protein n=10 Tax=Escherichia RepID=C2DS71_ECOLX Length = 686 Score = 249 bits (636), Expect = 4e-64, Method: Compositional matrix adjust. Identities = 154/264 (58%), Positives = 189/264 (71%), Gaps = 21/264 (7%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAV+ISGVLKDG GKP+QNCTIQLKAKRNSTTVVVNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVQISGVLKDGAGKPIQNCTIQLKAKRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAG ITVYEDS+PGTLNDFLGA TEDD RPEAL RFE MVEEVARNA Sbjct: 61 ILLVEGFPPSHAGAITVYEDSKPGTLNDFLGAATEDDVRPEALYRFEKMVEEVARNAE-- 118 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 +AS ++ +A+++ T AA + ++A+ + T+AG +A +A S+ ++A A+T A Sbjct: 119 ---------AASQSAAAAKKSETAAASSRNAAKTSETNAGNSAKAAASSKTAAQNAATAA 169 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 + +A A+E + SA + + S NA +SAA +A ATTKA EAA A A Sbjct: 170 ERSETNARASEEA------SADSEEASRRNA----ESAAENAGVATTKAREAAADATKAG 219 Query: 241 ASKEAAKSSETNASSSASSAASSA 264 K+ A S+ T A +A A +A Sbjct: 220 QKKDEALSAATRAEKAADRAEVAA 243 Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 63/142 (44%), Positives = 82/142 (57%), Gaps = 8/142 (5%) Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNA------KASETSAESSKTA 378 K E A +A A+ A A + +AAA S +AAKTSETNA AS +A + Sbjct: 108 KMVEEVARNAEAASQSAAAAKKSETAAASSRNAAKTSETNAGNSAKAAASSKTAAQNAAT 167 Query: 379 AASSASSAASSASSASASKDEATRQ-ASAAKSSATTASTKATEAAGSATAAAQSKSTAES 437 AA + + A ++ ASA +EA+R+ A +A +A A+TKA EAA AT A Q K A S Sbjct: 168 AAERSETNARASEEASADSEEASRRNAESAAENAGVATTKAREAAADATKAGQKKDEALS 227 Query: 438 AATRAETAAKRAEDIASAVALE 459 AATRAE AA RAE +A+ V E Sbjct: 228 AATRAEKAADRAE-VAAEVTAE 248 Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 56/114 (49%), Positives = 79/114 (69%), Gaps = 4/114 (3%) Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGS 298 AA+S+ AAK+SETNA +SA +AASS TAA N+A AA+ SETNAR+SE A+ S A+ + Sbjct: 134 AASSRNAAKTSETNAGNSAKAAASSKTAAQNAATAAERSETNARASEEASADSEEASRRN 193 Query: 299 KTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 +AA +A A+T A +A+A AT AG+ + A S+A T+A +A ++A AA Sbjct: 194 AESAAENAGVATTKAREAAADATKAGQKKDEALSAA----TRAEKAADRAEVAA 243 >UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL254 RepID=B4T041_SALNS Length = 580 Score = 249 bits (636), Expect = 5e-64, Method: Compositional matrix adjust. Identities = 126/221 (57%), Positives = 139/221 (62%), Gaps = 33/221 (14%) Query: 900 PPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPA 959 P S P G P+PWPSDT+P+GYALMQGQAFDK+ YP LA AYPSG IPDMRGWTIKGKP Sbjct: 393 PMMSCPPGVPLPWPSDTIPAGYALMQGQAFDKNVYPLLAIAYPSGTIPDMRGWTIKGKPV 452 Query: 960 SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAG 1019 SGRAVLSQE DG KSH+H A A TDLGTK TSSFDYGTKS+N TG H HS G+ Sbjct: 453 SGRAVLSQELDGNKSHSHGARALDTDLGTKGTSSFDYGTKSSNTTGGHNHSAGGT----- 507 Query: 1020 AHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVG 1079 G S + V + N +S G HAHT Sbjct: 508 ---------------YGGDSIGGKARVQRDGNDQLTSWN-------------GDHAHTTW 539 Query: 1080 IGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 IG H H+V IG HGH + V+A GNAE TVKNIAFNYIVRLA Sbjct: 540 IGPHDHTVYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA 580 Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 40/66 (60%), Positives = 48/66 (72%), Gaps = 1/66 (1%) Query: 445 AAKRAEDIASA-VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 A K A D+A A +DA+TT+KGIVQLSS T+S E AATPKAVK A DNA KRL K+ Sbjct: 196 AVKAAYDLADAKYTAQDATTTRKGIVQLSSVTDSNDENQAATPKAVKIAMDNANKRLAKE 255 Query: 504 QNGADI 509 +N AD+ Sbjct: 256 RNLADL 261 >UniRef50_C6UHV3 Predicted tail fiber protein n=22 Tax=Escherichia RepID=C6UHV3_ECOBR Length = 792 Score = 248 bits (634), Expect = 7e-64, Method: Compositional matrix adjust. Identities = 186/290 (64%), Positives = 213/290 (73%), Gaps = 14/290 (4%) Query: 2 AVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVI 61 AVKISGVLKDG GKP+QNCTIQLKAKRNSTTV+VNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 3 AVKISGVLKDGAGKPIQNCTIQLKAKRNSTTVLVNTVASENPDEAGRYSMDVEYGQYSVT 62 Query: 62 LLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVA 121 LLVEGFPPSHAGTITVYE S+PGTLNDFLGAMTEDD PEALRRFE MVEE ARNA A + Sbjct: 63 LLVEGFPPSHAGTITVYEGSRPGTLNDFLGAMTEDDVMPEALRRFEAMVEEAARNAEAAS 122 Query: 122 QNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKAT 181 Q+ AAAKKS + A++S A T AA+SA+AA+ S +A+SA +A S A T Sbjct: 123 QSAAAAKKSETAAASSKNAAKTSETHAANSAQAAAASQTASANSATAAKKSENNAKNSET 182 Query: 182 EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAA 241 A S A+SS++AA TS AKTSET A +S +AA S S A A+ AA SA AA Sbjct: 183 AAKTSETNAKSSQAAAKTSETNAKTSETAAKSSQAAAAESESAAAGSATSAAGSATAAAN 242 Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 S++AAK+SETNA SS + AAKTSETNA++SETAA S Sbjct: 243 SQKAAKTSETNAKSSQT--------------AAKTSETNAKASETAAKNS 278 Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 82/192 (42%), Positives = 100/192 (52%), Gaps = 42/192 (21%) Query: 206 TSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSAT 265 TSET+A+ S Q+AA S + + A+ A S +A S+ AAK+SETNA SS + Sbjct: 144 TSETHAANSAQAAAASQTASANSATAAKKSENNAKNSETAAKTSETNAKSSQA------- 196 Query: 266 AAGNSAKAAKTSETNARSSET----------------------------AAGQSASAAAG 297 AAKTSETNA++SET AA S AA Sbjct: 197 -------AAKTSETNAKTSETAAKSSQAAAAESESAAAGSATSAAGSATAAANSQKAAKT 249 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 S+T A SS +AA TS A AS TAA S ++AA S S A A A A+A+A S A Sbjct: 250 SETNAKSSQTAAKTSETNAKASETAAKNSQDAAAQSESAAAGSASAAASSATASANSQKA 309 Query: 358 AKTSETNAKASE 369 AKTSETNAKASE Sbjct: 310 AKTSETNAKASE 321 Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust. Identities = 72/142 (50%), Positives = 95/142 (66%), Gaps = 7/142 (4%) Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 +SET+A++SA +AA+S TA+ NSA AAK SE NA++SET AA S+T A SS + Sbjct: 144 TSETHAANSAQAAAASQTASANSATAAKKSENNAKNSET-------AAKTSETNAKSSQA 196 Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA 367 AA TS A S TAA S +AA S S A A A A+AAA S AAKTSETNAK+ Sbjct: 197 AAKTSETNAKTSETAAKSSQAAAAESESAAAGSATSAAGSATAAANSQKAAKTSETNAKS 256 Query: 368 SETSAESSKTAAASSASSAASS 389 S+T+A++S+T A +S ++A +S Sbjct: 257 SQTAAKTSETNAKASETAAKNS 278 >UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escherichia coli E22 RepID=B3I2W7_ECOLX Length = 654 Score = 246 bits (627), Expect = 4e-63, Method: Compositional matrix adjust. Identities = 129/226 (57%), Positives = 149/226 (65%), Gaps = 16/226 (7%) Query: 899 LPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP 958 +P +SYPVGAPIPWPSD P+GYALMQGQ FDK+ YP LA AYP+G+IPDMRG TIKGKP Sbjct: 441 MPEDSYPVGAPIPWPSDVTPTGYALMQGQPFDKAVYPLLAIAYPAGIIPDMRGQTIKGKP 500 Query: 959 ASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSA 1018 +GRAVLS EQDG+ SHTH AS S TDLGTK TSSFDYG+K T + + S+ Sbjct: 501 -NGRAVLSYEQDGVISHTHGASISDTDLGTKYTSSFDYGSKPTTSFDYG----NKSSTEG 555 Query: 1019 GAHTHSLANVNTA----SANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAH 1074 G H H+ T+ + G G S+ +S +G H H G H Sbjct: 556 GWHAHNFRYCATSAYRDTPGQGLGMHSSNVSWAAGDR--IEGSGNHAH-----VTWIGPH 608 Query: 1075 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 H VGIGAH H V +G HGHT TV+AAGNAENTVKNIAFNYIVRLA Sbjct: 609 DHWVGIGAHNHYVVMGYHGHTATVHAAGNAENTVKNIAFNYIVRLA 654 Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust. Identities = 24/41 (58%), Positives = 32/41 (78%) Query: 460 DASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 DAST++KG V+LSS TNS SE +A TPKA+K+ +NA R+ Sbjct: 299 DASTSEKGFVRLSSETNSDSEAMAVTPKALKAVNENANGRV 339 >UniRef50_B4TP26 Side tail fiber protein n=43 Tax=Salmonella enterica RepID=B4TP26_SALSV Length = 892 Score = 214 bits (545), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 262/534 (49%), Positives = 321/534 (60%), Gaps = 15/534 (2%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M V ISGVLKD TG PVQNCTIQLKA R STTVVVNT+ASENPD+AGRYSMDVE GQY+V Sbjct: 1 MPVLISGVLKDATGTPVQNCTIQLKACRTSTTVVVNTVASENPDDAGRYSMDVEQGQYTV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 LLVEG+PPSHAG ITVY+DS+PGTLNDFLGAMTEDD RPEALRRFE MVEEVAR AS Sbjct: 61 TLLVEGYPPSHAGVITVYDDSKPGTLNDFLGAMTEDDVRPEALRRFEAMVEEVARQASEA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 ++N +A +++ A TSA +AA A A ++A AA SA QAASSA SA SSAGTA+TKA Sbjct: 121 SRNATSAGQASEQAQTSAGQAAESATAAVNAAGAAEASATQAASSAASAESSAGTATTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 EAS SAA+A+++++AAA SA AAKTSE NA AS +A SA+ A A+ A TSA A Sbjct: 181 GEASASAASADTARTAAAASAAAAKTSEANADASRTAAGDSAAAAAASATAAQTSAARAG 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 AS+ AAK SET A+SSA A +SATAA S KAA S A+ SET A SAS AA S T Sbjct: 241 ASETAAKMSETQAASSAGDAGASATAAAASEKAAAASAAAAKISETNAATSASTAAASAT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AA+SSAS AS A + SA+ A +S+ +A ++A+ A A A + A + ++ Sbjct: 301 AASSSASEASNHAAASDTSASLAAQSSTAAGAAATRAEDAAKRAEDIADVISLEDASLTK 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 +S T ++S AA A A D S A + T T + Sbjct: 361 KGIVKLSSATDSDSEALAATPKAVKTVMGEVQTKAPLD------SPAFTGTPTTPTPPDD 414 Query: 421 AAGSATA---------AAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 A G TA AA S ES T E A D + A + + K+ + Sbjct: 415 AKGLQTANAEFVRKLIAALVGSVPESLDTLQELADALGNDPSFATTVLNKLAGKQPLDDT 474 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKT 525 +A + S ++ ++A L K QNG DI DK F I AV+ T Sbjct: 475 LTALSGKSVDGLIEYVGLRETINHAADALLKSQNGGDIQDKKQFARTIGAVTST 528 >UniRef50_C8U9W7 Probable tail fiber protein-like protein n=1 Tax=Escherichia coli O103:H2 str. 12009 RepID=C8U9W7_ECO10 Length = 377 Score = 212 bits (540), Expect = 6e-53, Method: Compositional matrix adjust. Identities = 103/169 (60%), Positives = 118/169 (69%) Query: 530 KRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGF 589 K+GM+ NAP+ A GK+YPV+ RS GS ELASRVIITT + MNNCEFNG Sbjct: 4 KKGMQQYAFNAPSNAVGGKWYPVIFRRSTGSTGELASRVIITTTSAGGNYEMNNCEFNGM 63 Query: 590 VMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGL 649 VMPGGWTDRG YA G F YQ NERAIHSI+ S K DD+ SVFYV+G AFPV E+GL Sbjct: 64 VMPGGWTDRGSYAAGYFSTYQTNERAIHSIVTSLKEDDVCSVFYVEGRAFPVRVSAEEGL 123 Query: 650 SISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLI 698 ++ P D V TTYK+GATNPATE A ILDF +GRGFY SHS+ Sbjct: 124 TVIVPTQDYTVGQTTYKWGATNPATESTNAQAILDFNNGRGFYCSHSIF 172 >UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae RepID=Q3ZL14_ESCBL Length = 289 Score = 204 bits (520), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 110/217 (50%), Positives = 134/217 (61%), Gaps = 12/217 (5%) Query: 905 PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAV 964 P G + WP T P+G+ALM GQ FD +AYP+LA AYPSGVIPDMRG TIK PASGR + Sbjct: 84 PPGIALAWPGATAPTGFALMLGQTFDTTAYPRLAQAYPSGVIPDMRGQTIKFLPASGRTL 143 Query: 965 LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHS 1024 LS E DG+KSH+HS S S+TDLGT T + D GTK T+ G H H N A + Sbjct: 144 LSLEADGVKSHSHSGSISTTDLGTATAADTDLGTKQTSQDGLHNHVSDSRFNKLMARSSD 203 Query: 1025 LANV-NTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAH 1083 + NT +S + R+S +++ +A S A +G H HTV IG H Sbjct: 204 IDGTNNTGDVDSDNPESEHRVSGMNDSLWAAS-----------VIADSGLHMHTVYIGPH 252 Query: 1084 THSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 HSV IG HGHT+T++ GN ENTVKNIAFN IVRLA Sbjct: 253 AHSVYIGPHGHTVTISNFGNTENTVKNIAFNAIVRLA 289 >UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEM1_9CAUD Length = 255 Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust. Identities = 102/216 (47%), Positives = 124/216 (57%), Gaps = 32/216 (14%) Query: 905 PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAV 964 PVGAP+ WPSDT P G+ALM GQ FDK YP LA YPSGV+PDMRG IK KP GRAV Sbjct: 72 PVGAPLAWPSDTAPDGWALMIGQTFDKVKYPLLAKVYPSGVLPDMRGRVIKAKP-DGRAV 130 Query: 965 LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHS 1024 LS E+D +KSHTH+ A++ GT+ TS+FD+G K T G HTH G+ ++ Sbjct: 131 LSLEEDQVKSHTHTGKAATAG-GTRATSTFDHGNKRTTTNGNHTHGSPQGARHGGSGQYT 189 Query: 1025 LANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHT 1084 + T S N+ +SA AG H H V IG H Sbjct: 190 SGDDETNSV----------------FNWPATSA-------------AGDHFHDVQIGPHN 220 Query: 1085 HSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 H+V I +H HT+ ++A G ENTVKNIA NYIVRLA Sbjct: 221 HNVDI-NHEHTLQIDATGGTENTVKNIAMNYIVRLA 255 >UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclassified Myoviridae RepID=Q6KGF6_9CAUD Length = 782 Score = 176 bits (447), Expect = 3e-42, Method: Compositional matrix adjust. Identities = 131/358 (36%), Positives = 180/358 (50%), Gaps = 71/358 (19%) Query: 768 ELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADA 827 +L++G SAS+N A ++Q P S ++ T + ++ A++ D+ Sbjct: 429 QLILGNS-SASINKTLTLAGQIQ-PSDFSNLDARYYTQSTANSRYMLAYSSGTGTEVGDS 486 Query: 828 DGGVPWNAESGAYNVT--RSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYG 885 DG + WNA++G YNVT G + ++ Y G S + Q+K +YRNGG +YRSSRDG+G Sbjct: 487 DG-IAWNAKTGLYNVTGYSGGSTQLVFQMYQGASSTPSAQLKFNYRNGGFWYRSSRDGFG 545 Query: 886 FEEDWAEVYTSKNLPPES---------------------------YPVGAPIPWPSDTVP 918 FEED+ ++YT K P S YPVG + S+ P Sbjct: 546 FEEDFTQIYTEKYKPTPSAIGAYTKAETDQKIAEAISDSTDLNKIYPVGIVTWFNSNVNP 605 Query: 919 SGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAV--------LSQEQD 970 + +A P L Y + + G TI+ A+G V ++ Sbjct: 606 N------------TALPGLTWTYLNNGV----GRTIRIAAANGSDVATTGGSDSVTLSVG 649 Query: 971 GIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNT 1030 + SHTHS SA TTSSFDYGTK+TN TGAHTHSVSGSTN+ GAHTH+ Sbjct: 650 NLPSHTHSFSA--------TTSSFDYGTKTTNTTGAHTHSVSGSTNNTGAHTHTFG---- 697 Query: 1031 ASANSGAGSASTRLSV-VHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 1087 G S + V V +S AG H+H++ GTAAS G HAHTVGIGAH+H+V Sbjct: 698 --GRYGGDSIGGKHRVHVSGTEQVSSVAGDHSHTVYGTAASNGNHAHTVGIGAHSHTV 753 >UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enterobacteria phage T4 RepID=Q38190_BPT4 Length = 226 Score = 157 bits (396), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 114/232 (49%), Positives = 134/232 (57%), Gaps = 71/232 (30%) Query: 953 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVS 1012 TIKGKP SGRAVLS E DG+K+H+HSASASSTDLGTKTTSSFDYGTK TN+TG HTHS S Sbjct: 1 TIKGKP-SGRAVLSAEADGVKAHSHSASASSTDLGTKTTSSFDYGTKGTNSTGGHTHSGS 59 Query: 1013 GSTNSAGAHTHSLANVNTASANSGAGSASTRLSVV------------------HNQNYAT 1054 GST++ G H+H + N G G ++S H ++ T Sbjct: 60 GSTSTNGEHSHYIEAWN------GTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGT 113 Query: 1055 SSAGAHTHSL------------SGTA---------------------------------- 1068 SSAG H+HS+ +GT Sbjct: 114 SSAGDHSHSVGIGEHSHYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGT 173 Query: 1069 ASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 +SAG H+H+VGIGAHTH+VAIGSHGHTITVN+ GN ENTVKNIAFNYIV LA Sbjct: 174 SSAGDHSHSVGIGAHTHTVAIGSHGHTITVNSTGNTENTVKNIAFNYIVALA 225 >UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI4_EDWTE Length = 718 Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 66/136 (48%), Positives = 92/136 (67%) Query: 4 KISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILL 63 +I+G+LKDG GKP+ NC I LKA R S +V+V+T+AS++P EAG Y M E GQY V L Sbjct: 3 RITGILKDGMGKPITNCEIALKALRTSASVIVHTVASQSPGEAGLYDMAAEPGQYRVTLC 62 Query: 64 VEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQN 123 V+G+PP + G I +Y DS GTLN FLG + D RP+ ++ FE+MV +V+ ++ V +N Sbjct: 63 VDGYPPEYVGDIQIYHDSPDGTLNYFLGLPVDGDLRPDVMKEFEIMVAKVSAQSAEVEKN 122 Query: 124 TAAAKKSASDASTSAR 139 AA +SA A S + Sbjct: 123 KDAAAESARSALNSQQ 138 Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 14/85 (16%) Query: 906 VGAPIPW-----PSDTVPSG---YALMQGQAFDKSAYPKLAAAYPSGVIP-DMRGWTIKG 956 +G+ IPW P + P+ + GQ+FD +PKL YP +P DMRG+T +G Sbjct: 566 IGSLIPWALERMPQEIWPNCGMHFIPYMGQSFDPELFPKLHDVYPDNRLPTDMRGYTARG 625 Query: 957 KPAS-----GRAVLSQEQDGIKSHT 976 GRA+LS + D I++ T Sbjct: 626 WDNGRGIDIGRALLSYQDDAIQNIT 650 >UniRef50_D2TSH8 Phage tail fibre protein n=7 Tax=root RepID=D2TSH8_CITRO Length = 617 Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 62/126 (49%), Positives = 86/126 (68%) Query: 775 LSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWN 834 LS L+GNA TATKL+T R+++GV FDGS DI+++A +V AFA R T + D V WN Sbjct: 413 LSGELSGNAATATKLKTARKIAGVGFDGSSDISISAKNVNAFALRQTGNTVNGDTSVGWN 472 Query: 835 AESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVY 894 +SGAYN G S ++++F GSC +Q + +Y+NGG+ YRS+RDGYGFE W++ Y Sbjct: 473 WDSGAYNALIGGASALILHFNINAGSCPAVQFRVNYKNGGISYRSARDGYGFELGWSDFY 532 Query: 895 TSKNLP 900 T+ P Sbjct: 533 TTTRKP 538 Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 35/56 (62%), Positives = 42/56 (75%), Gaps = 1/56 (1%) Query: 445 AAKRAEDIASA-VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 A K A D+A+A +DA+T +KGIVQLSSATNSTSETLAAT KAVK+ D K+ Sbjct: 199 AVKAAYDLANAKYTAQDATTAQKGIVQLSSATNSTSETLAATSKAVKAVMDETNKK 254 >UniRef50_Q858V4 GpH n=9 Tax=root RepID=Q858V4_9CAUD Length = 913 Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 67/127 (52%), Positives = 87/127 (68%), Gaps = 2/127 (1%) Query: 775 LSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRAT-DTYADADGGVPW 833 LS L+GNA TATKL+T R+++ V FDG+ DI LT ++ AFA T DT A+ D V W Sbjct: 637 LSGELSGNAATATKLKTARKINNVSFDGTSDINLTPKNIGAFASGKTGDTVAN-DKAVGW 695 Query: 834 NAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEV 893 N SGAYN T G S ++++F G GSC Q + +Y+NGG+FYRS+RDGYGFE DW+E Sbjct: 696 NWSSGAYNATTGGASTLILHFNIGEGSCPAAQFRVNYKNGGIFYRSARDGYGFEADWSEF 755 Query: 894 YTSKNLP 900 YT+ P Sbjct: 756 YTTTRKP 762 Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 49/74 (66%), Positives = 54/74 (72%), Gaps = 1/74 (1%) Query: 460 DASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNI 519 DA+ T+KG QLSSATNSTSE LAATPKAVK+A DNA RL K+QNGADI DK FL+NI Sbjct: 168 DATLTEKGFTQLSSATNSTSEKLAATPKAVKAANDNANSRLAKNQNGADIQDKSAFLDNI 227 Query: 520 NAVSKTDFADKRGM 533 S T F GM Sbjct: 228 GVTSLT-FMKHNGM 240 >UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp. RC586 RepID=D0IJ09_9VIBR Length = 368 Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 82/215 (38%), Positives = 107/215 (49%), Gaps = 51/215 (23%) Query: 905 PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAV 964 PVG P+PWPSD P G+A+ +GQAFDK A P+LA YP G++ D+RG + GK G + Sbjct: 204 PVGVPLPWPSDIAPEGFAIHKGQAFDKVANPELAKLYPDGILKDLRGMAVVGK-KEGEII 262 Query: 965 LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHS 1024 LS E D +K H + S T SS D G+++TN TG H H T S G + Sbjct: 263 LSYEADQVKQHGYPNS---------TVSSTDLGSRNTNTTGNHAHGYPAGT-SNGPNGPY 312 Query: 1025 LANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHT 1084 L +TA A+ G +T G H H+V IG+H Sbjct: 313 L---DTAHASYGYRYTTTE----------------------------GNHYHSVAIGSHA 341 Query: 1085 HSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 HS+AI G T ENT+KNI FN+IVR+ Sbjct: 342 HSIAIALFGAT---------ENTIKNIKFNWIVRM 367 >UniRef50_C4U3E2 Tail fiber protein (Fragment) n=1 Tax=Yersinia kristensenii ATCC 33638 RepID=C4U3E2_YERKR Length = 430 Score = 126 bits (317), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 76/187 (40%), Positives = 111/187 (59%), Gaps = 2/187 (1%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+VKISG L DG G P+ C I LKA+ N+ VV+ T+A+ G YS + + G+Y V Sbjct: 1 MSVKISGALIDGAGIPMSGCQIILKARVNTAEVVMRTIATITTGRNGEYSFEAQVGRYCV 60 Query: 61 ILLVEGFPPSHA-GTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASA 119 L G+ + G ITVY+DS+PGTLNDFL A+ E D +P+ ++RFE +V + ++A Sbjct: 61 YLR-HGWSNEYCVGDITVYDDSKPGTLNDFLIALDEGDLKPDVVKRFEELVAQAQQSADM 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 A++ +S D + +A +AADA SA AA+ S A S + A+SSA +A+ Sbjct: 120 AAESAQQVSQSVQDVTKVKDDAKRYAADAQTSATAAAESQSTATESEKRAASSAHSATQS 179 Query: 180 ATEASKS 186 A A +S Sbjct: 180 AQNAQES 186 >UniRef50_B7MWN9 Putative tail fiber protein (GpH) n=2 Tax=Escherichia coli RepID=B7MWN9_ECO81 Length = 701 Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 102/254 (40%), Positives = 137/254 (53%), Gaps = 33/254 (12%) Query: 318 ASATAAG----KSAESAASSASTATTKA---------GEATEQASAAARSASAAKTSETN 364 AS TA G SA ++AS AT KA G+ T Q + AR +S TN Sbjct: 169 ASLTAKGFTQLSSATNSASETLAATPKAVKAAYDLANGKYTAQDATTARKGLVQLSSATN 228 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT--ASTKATEAA 422 + + +A AA ++ +A A+ ++ + +SA S + T A+ KA + A Sbjct: 229 STSETLAATPKAVKAAYDLANGKYTAQDATTARKGLVQLSSATNSDSETLAATPKAVKVA 288 Query: 423 ---GSATAAAQSKSTAE------SAATRAET--------AAKRAEDIASA-VALEDASTT 464 + AQ +TA S+AT +++ A K A D+A+ +DA+T Sbjct: 289 YDLANGKYTAQDATTARKGLVQLSSATNSDSETLAATPKAVKVAYDLANGKYTAQDATTA 348 Query: 465 KKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSK 524 +KG+VQLSSATNS SETLAATPKAVKSAYDNAEKRLQKDQNGADIPDK FL NI A + Sbjct: 349 RKGLVQLSSATNSDSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKRLFLRNIGATNS 408 Query: 525 TDFADKRGMRYVRV 538 T + G + R+ Sbjct: 409 TTMSFSGGTGWFRL 422 >UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclassified Myoviridae RepID=C5H7L2_9CAUD Length = 391 Score = 119 bits (299), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 74/135 (54%), Positives = 91/135 (67%), Gaps = 22/135 (16%) Query: 975 HTHSASAS--STDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSL------- 1025 HTHSAS S S D G+K+TS+FDYGTK+TN+ GAHTH+ SG+T++AG H H + Sbjct: 230 HTHSASVSISSFDYGSKSTSTFDYGTKTTNSAGAHTHTFSGTTSNAGNHNHRVPMRGNDR 289 Query: 1026 --ANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAH 1083 N TASA++G G+A T AGAHTHS SGT AS+GAH+HTV IGAH Sbjct: 290 GGTNAITASADAGVGNA-----------MYTDLAGAHTHSFSGTTASSGAHSHTVAIGAH 338 Query: 1084 THSVAIGSHGHTITV 1098 +H+V IGSH HT TV Sbjct: 339 SHTVNIGSHSHTGTV 353 >UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DE08_PECCP Length = 682 Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 64/164 (39%), Positives = 91/164 (55%), Gaps = 15/164 (9%) Query: 824 YADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDG 883 Y D+D + W++ +GAY S ++ + GS Q + NGG+ YRSSRD Sbjct: 445 YQDSD--LAWDSPTGAYLKDNGTHSSLIWHMGLNAGSASAAQFYFDFANGGIKYRSSRDN 502 Query: 884 YGFEEDWAEVYTSKNLPPE--------SYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYP 935 GFE+ WA +YT ++ P + VG P+PWP T PSG+ GQ FDK+ YP Sbjct: 503 SGFEKPWARIYTDQDKPTAADIGALSLNEIVGMPMPWPQTTAPSGWLKCNGQTFDKNIYP 562 Query: 936 KLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKS 974 KLA YP+G++PD+RG I+G S GR +LS + D I++ Sbjct: 563 KLAQIYPAGILPDLRGEFIRGWDDSRGVDTGRTLLSTQGDAIRN 606 >UniRef50_D2TJ16 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TJ16_CITRO Length = 538 Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 79/250 (31%), Positives = 129/250 (51%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V ISG L +G G P+ C I L A N++ VV + A D AG+Y+ + + G+Y+V Sbjct: 1 MSVLISGALINGAGVPMAGCKIYLDALVNTSEVVTESFAVIETDAAGQYAFEAQKGKYTV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 + + P G I+VY+DS+PGTLNDFL A+ E D +P+ ++RFE MV + ++A A Sbjct: 61 HIKQKNGPKCCVGDISVYDDSKPGTLNDFLTALDEGDLKPDVVKRFEEMVAQAQQSAEAA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A++ A + +DA + T A + + A + + A AG Sbjct: 121 AESEQQAGQHVADAQQIKSDCETLADNVQQNTNAVEENTQRVEQLASEVGLHAGQVQQGV 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 + + A+ + +A SA +K S NA+ S Q+A A A + A+DAA Sbjct: 181 QNVTDAVKKAQQAAKNSADSATDSKNSADNAALSEQNAQKHAQKAEQHEQQTKQYAQDAA 240 Query: 241 ASKEAAKSSE 250 + E+A++++ Sbjct: 241 TAAESAENAK 250 >UniRef50_B6IAV4 Putative phage tail fiber protein n=1 Tax=Escherichia coli SE11 RepID=B6IAV4_ECOSE Length = 590 Score = 116 bits (290), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 104/339 (30%), Positives = 161/339 (47%), Gaps = 39/339 (11%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V I G L DG G P+ C I LK++ N++ VV+ T A G YS + G+Y V Sbjct: 1 MSVLIYGALTDGAGIPMSGCHIILKSRVNTSEVVMRTEADVVTGNNGEYSFEARTGKYRV 60 Query: 61 ILLVEGFPPSHA-GTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASA 119 L +G+ + G I VY+D++PGTLNDFL A E D +P+ ++RFE MV Sbjct: 61 -YLKQGWRDEYCVGDIAVYDDAKPGTLNDFLTAPDEGDLKPDVVKRFERMV--------- 110 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 A A++SA A+ S ++A H AD A + + A Sbjct: 111 -----AQAQQSAESAAESEQQAGQHVAD--------------AQKIKEDCQTLADNVQLN 151 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 AT ++ E + +AG + + + +++ A +A + + A E+ +A +A Sbjct: 152 ATAVAEDKQHVEHLAAEVEQNAGQMQQGVQSVTDAVKQAQQAADDSASSAEESKNNADNA 211 Query: 240 AASKEAAKSSETNASSSASSAASSA-TAAGNSAKAAK-------TSETNARSSETAAGQS 291 A S+++AKS NA+ SA +A S A AGN+ + A+ + R +E A Q Sbjct: 212 ARSEQSAKSHADNAARSAQNAKSHADNVAGNTLQTAQDVTATAAARDDAERFAENAR-QD 270 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESA 330 A+A A + A A +A SA + SA A A +A Sbjct: 271 ATATACDRKATAEDVKSAGESAASSEQSARVAAGYARAA 309 >UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GRX3_VIBCH Length = 182 Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 75/217 (34%), Positives = 106/217 (48%), Gaps = 51/217 (23%) Query: 904 YPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA 963 +PVG IPW +D P G+ + +GQAFD + Y +LA +P+G+IPDMRG + GK G A Sbjct: 17 FPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRGCGVIGK-EDGEA 75 Query: 964 VLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTH 1023 V + E+ +K+H H S T SS D G+K+T N G HTH + G+H + Sbjct: 76 VGAYEEGQVKNHGHPNS---------TVSSIDLGSKNTANGGNHTHFSGIAAFGGGSHRY 126 Query: 1024 SLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAH 1083 +VN G+G N TS+AG H Sbjct: 127 Q-TDVN------GSGG-----------NINTSAAGNH----------------------- 145 Query: 1084 THSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 HS+ +GSH H +T+ G +NT+ + N+IVRLA Sbjct: 146 YHSIPMGSHAHAVTIALFGALKNTINHRKINWIVRLA 182 >UniRef50_Q9LA62 ORF-401-like protein n=1 Tax=Enterobacterial phage P-EibA RepID=Q9LA62_9CAUD Length = 479 Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 76/162 (46%), Positives = 98/162 (60%), Gaps = 4/162 (2%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M V+ISGVLKDGTGKPV CTI+LKA+R + TV+V T+A P+E G YS DVE G Y V Sbjct: 1 MTVRISGVLKDGTGKPVPGCTIELKARRTTETVIVTTVAQGQPEETGSYSFDVEPGWYRV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDAR--PEALRRFELMVEEVARNAS 118 L EG+ PS+ G I V DS+PGTLN FL M +D+A+ P+AL E + E+ + A Sbjct: 61 TLNTEGYAPSYVGDILVKADSEPGTLNKFL--MEQDEAQYYPKALAELEAVAAEILKRAE 118 Query: 119 AVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG 160 A A + AKK A +A A E A A+ + + G Sbjct: 119 ASAASAEEAKKRAENARGPAGEKGDTGPQGATGAKGPAGATG 160 Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 32/57 (56%), Positives = 37/57 (64%), Gaps = 1/57 (1%) Query: 435 AESAATRAETAAKRAEDIASAVA-LEDASTTKKGIVQLSSATNSTSETLAATPKAVK 490 A A R ET A +VA + DAST++KG+VQLSS TNS ET AATPKAVK Sbjct: 231 AGPAGPRGETGPAGPAGPAGSVASVPDASTSQKGVVQLSSDTNSDDETKAATPKAVK 287 >UniRef50_B7MW07 Putative tail fiber protein from prophage n=4 Tax=Escherichia coli ED1a RepID=B7MW07_ECO81 Length = 520 Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 76/172 (44%), Positives = 98/172 (56%), Gaps = 4/172 (2%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M V+ISGVLKDGTGKPV CTI+LKA+R + TV+V T+A P E G YS DVE G Y V Sbjct: 1 MTVRISGVLKDGTGKPVPGCTIELKARRTTETVIVTTVAQGQPGETGSYSFDVEPGWYRV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDAR--PEALRRFELMVEEVARNAS 118 L EG+ PS+ G I V DS+PGTLN FL M +D+A+ P+AL E + E+ + A Sbjct: 61 TLNTEGYAPSYVGDILVKADSEPGTLNKFL--MEQDEAQYYPKALAELEAVAAEILKRAE 118 Query: 119 AVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 A A + AKK A +A A E A A+ + + G + Sbjct: 119 ASAASAEEAKKRAENARGPAGEKGDTGPQGATGAQGPAGATGAVGPKGEPGP 170 Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 32/58 (55%), Positives = 38/58 (65%), Gaps = 1/58 (1%) Query: 435 AESAATRAETAAKRAEDIASAVA-LEDASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 A A R ET A +VA + DAST++KG+VQLSS TNS ET AATPKAVK+ Sbjct: 232 AGPAGPRGETGPAGPAGPAGSVASVPDASTSQKGVVQLSSDTNSDDETKAATPKAVKA 289 >UniRef50_Q7Y3Z0 Tail fiber protein n=1 Tax=Yersinia phage PY54 RepID=Q7Y3Z0_9CAUD Length = 690 Score = 105 bits (263), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 111/358 (31%), Positives = 178/358 (49%), Gaps = 13/358 (3%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M++KISG+L TG+P + I L+A + S TV+ ++ G YS++VE G+Y V Sbjct: 1 MSIKISGILPGPTGEPAAHIGITLRAIKTSLTVITTLESNSITGTDGAYSLNVEPGKYDV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 +L V+G GTI VY DS PGTLN+FL A+ E+D PE + + E + E R A Sbjct: 61 LLWVDGINARRVGTINVYSDSLPGTLNNFLTALREEDGTPEIILQLEQLRAEAVRAALEA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAAS-SAQSASSSAGTASTK 179 ++ A + A A ++A AA A+ +A +AA A++A S+ T + + Sbjct: 121 KESKNEATQQAGIAISAADNAAQETAELIKAAVKEDADRAEAARYGAETAQSTVNTLAAE 180 Query: 180 A----TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATS 235 +E + A+AA +S + AA+S+ ++ S + + +S +AA S +A A +A S Sbjct: 181 VARHHSEVGQLASAASNSAAEAASSSNSSAQSASESESSKNAAALSEQSALAGAEDAGNS 240 Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAA 295 A AA K AAK A A+ A +SA + S A + N + S+T + + Sbjct: 241 ATAAAGDKTAAKGFRDEAEEFAARAKASAESIDVS---ALEEQINQKVSQTEFDDTIADK 297 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 A ++ A+ AST + AGK + + + AG+AT+Q A + Sbjct: 298 ASNQALTDGLATKASTQ----QLTDGLAGK-LDKIGGTLTGPLILAGDATDQKGAVTK 350 >UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacteria phage phiEcoM-GJ1 RepID=A9Q1X5_9CAUD Length = 356 Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 75/233 (32%), Positives = 119/233 (51%), Gaps = 18/233 (7%) Query: 862 RTLQMKAHYRNGGLFYRSSRDGY---GFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVP 918 R +Q ++ N ++ RS F+E W E N+ YP+G + + + T P Sbjct: 73 RVMQRYTNFSNKRMWVRSQNGTVSDANFDE-WTEFVNMNNIYNAIYPIGIVVKFDNATNP 131 Query: 919 SGYALMQGQAFDKSAYPKLA--AAYPSGVIPDMRGWTIKGKPASGRAV--LSQEQDGIKS 974 + G +++ ++A A P D + +I G + AV L G+++ Sbjct: 132 NNN--FTGTVWEQIIDGRVARAATGPEAGTADGQIGSIAGSDTANIAVTNLPGHTHGMQN 189 Query: 975 HTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASAN 1034 HTH ++ S + T + D+G +++++GAHTHSVSG+ SAGAH H+ + T N Sbjct: 190 HTHGIASHSHTMAHTHTINHDHGAVTSSSSGAHTHSVSGTAASAGAHQHTEGSPFTGDVN 249 Query: 1035 SGAGSASTRLSVVHNQNYA-------TSSAGAHTHSLSGTAASAGAHAHTVGI 1080 G + ST + + Y+ TSS+GAHTHS+SGTAASAGAH H+V + Sbjct: 250 FGT-TTSTSKDNISDWLYSPSTRYPLTSSSGAHTHSVSGTAASAGAHTHSVDL 301 >UniRef50_Q5GAE0 Putative uncharacterized protein n=3 Tax=Singapore grouper iridovirus RepID=Q5GAE0_9VIRU Length = 1137 Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 121/325 (37%), Positives = 168/325 (51%), Gaps = 14/325 (4%) Query: 127 AKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKS 186 A + A+DAS+ A EA A DA+ A A A +A+S A+ ASS A A KATEAS Sbjct: 444 ADQKATDASSKAEEADQKATDASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSK 503 Query: 187 AAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAA 246 A A S A++ A A T A A++ A A KA+EA++ A +A++ E A Sbjct: 504 AEEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEA 563 Query: 247 KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 T A A+ A+S A A A A + A S A Q A T A A Sbjct: 564 DQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKA-------TEADQKA 616 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAA--KTSETN 364 + AS+ A +A AT A AE A+S A A KA EA ++A+ A + A+ A K E + Sbjct: 617 TEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEADQKATEASSKAEEAD 676 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS 424 KA+E A+ T A+S A A A+ AS+ +EA ++A+ A S A AS+KA EA+ Sbjct: 677 QKATE--ADQKATEASSKAEEADQKATEASSKAEEADQKATEASSKAEEASSKAEEASSK 734 Query: 425 ATAAAQSKSTAESAATRAETAAKRA 449 A A+ S AE A+++AE A ++A Sbjct: 735 AEEAS---SKAEEASSKAEEADQKA 756 Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 121/330 (36%), Positives = 171/330 (51%), Gaps = 11/330 (3%) Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 A +A A EA++ A +A+ A A A +A+S A+ ASS A AS+KA EA + A A Sbjct: 469 AEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEADQKATEA 528 Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 + + A++ A A T AS+ + A++ A A KA+EA A +A++ E A Sbjct: 529 DQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQKA 588 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 T ASS A A+S A A A A T A S A Q A+ A+ A+S A A Sbjct: 589 TEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEAD 648 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAA--KTSETNAKAS 368 A +A AT A + A A+S A A KA EA ++A+ A+ A A K +E ++KA Sbjct: 649 QKATEADQKATEADQKATEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAE 708 Query: 369 E-----TSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG 423 E T A S A+S A A+S A AS+ +EA+ +A A AT AS+KA EA+ Sbjct: 709 EADQKATEASSKAEEASSKAEEASSKAEEASSKAEEASSKAEEADQKATEASSKAEEASS 768 Query: 424 SATAAAQ----SKSTAESAATRAETAAKRA 449 A A Q + S AE A+++AE A ++A Sbjct: 769 KAEEADQKATEASSKAEEASSKAEEADQKA 798 Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 127/343 (37%), Positives = 182/343 (53%), Gaps = 26/343 (7%) Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 A +AS+ A EA A +A A AS+ A +A A ASS A AS+KA EA + A A Sbjct: 511 AEEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEA 570 Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 + + A++ A A T AS+ + A++ A A KA+EA A +A++ E A Sbjct: 571 DQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQKA 630 Query: 251 TNASSSASSAASSATAAGNSAKAA--KTSETNARSSETA-----AGQSASAAAGSKTAAA 303 T ASS A A+S A A A A K +E + +++E + A Q A+ A T A+ Sbjct: 631 TEASSKAEEASSKAEEADQKATEADQKATEADQKATEASSKAEEADQKATEADQKATEAS 690 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT---EQASAAARSASAAKT 360 S A A A +AS+ A A + A A+S A A++KA EA+ E+AS+ A AS+ K Sbjct: 691 SKAEEADQKATEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEASSKAEEASS-KA 749 Query: 361 SETNAKASETSAESSKTAAASS--------ASSAASSASSASASKDEATRQASAAKSSAT 412 E + KA+E S SK ASS A+ A+S A AS+ +EA ++A+ A S A Sbjct: 750 EEADQKATEAS---SKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAE 806 Query: 413 TASTKATEAAGSATAAAQ----SKSTAESAATRAETAAKRAED 451 A KATEA+ A A Q + S AE A+++AE A+ +AE+ Sbjct: 807 EADQKATEASSKAEEADQKATEASSKAEEASSKAEEASSKAEE 849 Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 121/341 (35%), Positives = 172/341 (50%), Gaps = 7/341 (2%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 EE + A+ Q A A +A A EA++ A +A+ A A A +A A AS Sbjct: 519 EEADQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEAS 578 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 S A A KATEAS A A S A A A T AS+ + A A+ A++KA Sbjct: 579 SKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAE 638 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 EA++ A +A A T A A+ A+S A A A A T A S A Q Sbjct: 639 EASSKAEEADQKATEADQKATEADQKATEASSKAEEADQKATEADQKATEASSKAEEADQ 698 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 A+ A+ A A+ AS+ A +AS+ A A AE A+S A A++KA EA ++A+ Sbjct: 699 KATEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEASSKAEEASSKAEEADQKATE 758 Query: 351 AARSASAA--KTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 A+ A A K E + KA+E S+++ + A+S A A A+ AS+ +EA ++A+ A Sbjct: 759 ASSKAEEASSKAEEADQKATEASSKAEE--ASSKAEEADQKATEASSKAEEADQKATEAS 816 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRA 449 S A A KATEA+ A A+ S AE A+++AE A ++A Sbjct: 817 SKAEEADQKATEASSKAEEAS---SKAEEASSKAEEADQKA 854 Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 117/319 (36%), Positives = 162/319 (50%), Gaps = 24/319 (7%) Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 A +A A EA A +A+ A A A +A+S A+ ASS A A KATEA + A A Sbjct: 602 AEEADQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEA 661 Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 + + A++ A A T A A++ A A KA+EA++ A E A Sbjct: 662 DQKATEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKA-------EEADQKA 714 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 T ASS A A+S A A + A+ A + A S A Q A T A+S A AS Sbjct: 715 TEASSKAEEASSKAEEASSKAEEASSKAEEASSKAEEADQKA-------TEASSKAEEAS 767 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 + A +A AT A AE A+S A A KA EA+ +A A + K +E ++KA E Sbjct: 768 SKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEADQ-----KATEASSKAEE- 821 Query: 371 SAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQ 430 A+ T A+S A A+S A AS+ +EA ++A+ A S A AS+KA EA AT A+ Sbjct: 822 -ADQKATEASSKAEEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEAS- 879 Query: 431 SKSTAESAATRAETAAKRA 449 S AE A+++AE A ++A Sbjct: 880 --SKAEEASSKAEEADQKA 896 Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 123/348 (35%), Positives = 172/348 (49%), Gaps = 25/348 (7%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 EE + A+ + A + A+DAS+ A EA A DA+ A A A A+S A+ A Sbjct: 414 EEADQKATEASSKAEEADQKATDASSKAEEADQKATDASSKAEEADQKATDASSKAEEAD 473 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 A AS+KA EAS A A+ + A++ A AS+ + A++ A A KA+ Sbjct: 474 QKATEASSKAEEASSKAEEADQKATEASSKA-------EEASSKAEEASSKAEEADQKAT 526 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 EA A +A++ E A T ASS A A+S A A A A T A S A Q Sbjct: 527 EADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQ 586 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 A T A+S A AS+ A +A AT A + A A+S A A KA EA+ +A Sbjct: 587 KA-------TEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAEE 639 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 A+ +K E + KA+E A+ T A A+ A+S A A EA ++A+ A S Sbjct: 640 AS-----SKAEEADQKATE--ADQKATEADQKATEASSKAEEADQKATEADQKATEASSK 692 Query: 411 ATTASTKATEAAGSATAAAQ----SKSTAESAATRAETAAKRAEDIAS 454 A A KATEA+ A A Q + S AE A+++AE A+ +AE+ +S Sbjct: 693 AEEADQKATEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEASS 740 Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 103/290 (35%), Positives = 149/290 (51%), Gaps = 14/290 (4%) Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 A +AS+ A EA A +A A A A +A+S A+ A A A KATEAS A A Sbjct: 637 AEEASSKAEEADQKATEADQKATEADQKATEASSKAEEADQKATEADQKATEASSKAEEA 696 Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 + + A++ A A T AS+ + A++ A A++KA EA++ A +A++ E A Sbjct: 697 DQKATEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEASSKAEEASSKAEEADQKA 756 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 T ASS A A+S A A A A + A S A Q A+ A+ A A+ AS Sbjct: 757 TEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEADQKATEAS 816 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 + A +A AT A AE A+S A A++KA EA ++A+ A+ +K E ++KA E Sbjct: 817 SKAEEADQKATEASSKAEEASSKAEEASSKAEEADQKATEAS-----SKAEEASSKAEE- 870 Query: 371 SAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 A A+ A+S A AS+ +EA ++A+ A AT AS+KA E Sbjct: 871 --------ADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEE 912 Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 112/302 (37%), Positives = 157/302 (51%), Gaps = 11/302 (3%) Query: 157 TSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQ 216 T A +A A ASS A A KATEAS A A+ + A++ A A T+AS+ + Sbjct: 397 TGATEADQKATEASSKAEEADQKATEASSKAEEADQKATDASSKAEEADQKATDASSKAE 456 Query: 217 SA---ATSAST----ATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGN 269 A AT AS+ A KA+EA++ A +A++ E A T ASS A A+S A A + Sbjct: 457 EADQKATDASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEASS 516 Query: 270 SAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAES 329 A+ A T A T A A A T A+S A AS+ A +A AT A + A Sbjct: 517 KAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATE 576 Query: 330 AASSASTATTKAGEATEQASAAARSASAA--KTSETNAKASETSAESSKTAAASSASSAA 387 A+S A A KA EA+ +A A+ A A K +E + KA+E S+++ + A A+ A+ Sbjct: 577 ASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEE--ADQKATEAS 634 Query: 388 SSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAK 447 S A AS+ +EA ++A+ A AT A KATEA+ A A Q + A+ AT A + A+ Sbjct: 635 SKAEEASSKAEEADQKATEADQKATEADQKATEASSKAEEADQKATEADQKATEASSKAE 694 Query: 448 RA 449 A Sbjct: 695 EA 696 Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 84/232 (36%), Positives = 121/232 (52%) Query: 127 AKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKS 186 A + A++A A EA++ A +A A AS+ A +A A ASS A AS+KA EAS Sbjct: 675 ADQKATEADQKATEASSKAEEADQKATEASSKAEEADQKATEASSKAEEASSKAEEASSK 734 Query: 187 AAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAA 246 A A S A++ A A T AS+ + A++ A A KA+EA++ A +A++ E A Sbjct: 735 AEEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEA 794 Query: 247 KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 T ASS A A AT A + A+ A T A S A A A+ A A Sbjct: 795 DQKATEASSKAEEADQKATEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEADQKA 854 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAA 358 + AS+ A +AS+ A A + A A+S A A++KA EA ++A+ A + A+ A Sbjct: 855 TEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEA 906 >UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseudotuberculosis RepID=Q66BF2_YERPS Length = 711 Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 99/364 (27%), Positives = 160/364 (43%), Gaps = 23/364 (6%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V +SG++ + G+PV N I L A NS TV+ A+ D G Y + +E G YS+ Sbjct: 1 MSVTVSGIMINPVGEPVVNAQITLTAVTNSLTVLNAFSATVRTDGVGTYRIQLEEGSYSI 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLG-AMTEDDARPEALRRFELMVEEVARNASA 119 + G + G +T+ + P TLN L + E + P+ + F + ++VA + + Sbjct: 61 TVAANGRSFVY-GAVTLDNTTGPSTLNQLLKQQIMESELTPDVILYFRQIQQQVANDLAT 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + +A +A A S EA +A D +++ A QA SA +++ S A+ Sbjct: 120 IKVLEISATDAAESAGHSRDEAMLYAKDLSEALATAKGYRDQAGISADASALSQQEAAIS 179 Query: 180 ATEASKSAAAAESSKSAAAT-----------------SAGAAKTSETNASASLQSAATSA 222 T A SA +A S+ A + + A +T+E +++ A A Sbjct: 180 ETSAKASADSALLSEQNALSYRDSAQSAAATAADDASTLAAERTAE-KIKLQVKTDADRA 238 Query: 223 STATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNAR 282 A + + +S D A + T A+ +A + AT A NS A ++ A Sbjct: 239 EAARIASEQIKSSVDDTAQTVAQQHGETTQAAIAARDSEVKATTAANS--AVQSEALAAI 296 Query: 283 SSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAG 342 S+ETA Q+A + K AA A A QA ASA + G + + AG Sbjct: 297 SAETAR-QNAGISTVDKNAAKGFRDEAEGFAQQAHASAESVGDVMPKTGGAFTGPVELAG 355 Query: 343 EATE 346 +ATE Sbjct: 356 DATE 359 >UniRef50_C9KJG4 Side tail fiber protein from lambdoid prophage Rac n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KJG4_9FIRM Length = 932 Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 134/389 (34%), Positives = 195/389 (50%), Gaps = 25/389 (6%) Query: 97 DARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDA-STSAREAATHAADAADSARAA 155 D E LR + +++V + + A K+S +D +T +++ A A A A Sbjct: 40 DVIAEDLRWLKENIDDVKDTS-----DLEAIKQSVTDMYNTMKNDSSFGEATAKAQAEEA 94 Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 A A SA +A + +TK+TE + + A +S A + KT E + S S Sbjct: 95 KKQAQAALESATNAKTYYDDITTKSTEVNNTIAEIKSYIEKAEALNESNKTLEQSISDSA 154 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 A A +A + A+ AATS +A AS+ AK+SETNA S ++AA S + A A Sbjct: 155 TVATNKAKSAASSATNAATSETNAKASETKAKASETNAKVSETNAAKSESNAKAHMDATA 214 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 TSE+NA++SET A S +A+ S+T A +S + A + +S SA A AES+ S S Sbjct: 215 TSESNAKTSETNAKASQAASKTSETNAKTSETNAKQYSINSSNSADLAKAWAESSDSPDS 274 Query: 336 TATTKAGEATEQAS------------AAARSASAAKTSETNAKASE-------TSAESSK 376 T + Q+S +A S + AKTSETNAK SE T++ SS Sbjct: 275 VNDTDSTTGKTQSSKTWAIYSKDRAISAFTSETHAKTSETNAKTSETNAANSATNSASSA 334 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 TA+A+SA AA+SA++A S+ A AS AK+S T A T T A S T AA S+ + Sbjct: 335 TASANSAEEAATSATNAKTSETNAATSASNAKTSETNAKTSETNAKASETNAATSEGNTK 394 Query: 437 SAATRAETAAKRAEDIASAVALEDASTTK 465 +A+ A + A+ I S V + A K Sbjct: 395 GYMEKAQVAYESAKAIQSVVDVAKADAEK 423 Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 113/323 (34%), Positives = 163/323 (50%), Gaps = 28/323 (8%) Query: 86 LNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHA 145 + D M D + EA + + EE + A A ++ AK D +T + E Sbjct: 69 VTDMYNTMKNDSSFGEATAKAQ--AEEAKKQAQAALESATNAKTYYDDITTKSTEVNNTI 126 Query: 146 ADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAK 205 A+ A + QS S SA A+ KA A+ SA A +S++ A S AK Sbjct: 127 AEIKSYIEKAEALNESNKTLEQSISDSATVATNKAKSAASSATNAATSETNAKASETKAK 186 Query: 206 TSETNASASLQSAATSASTAT-------TKASEAATSARDAAASKEAAKSSETNASSSAS 258 SETNA S +AA S S A T S A TS +A AS+ A+K+SETNA +S + Sbjct: 187 ASETNAKVSETNAAKSESNAKAHMDATATSESNAKTSETNAKASQAASKTSETNAKTSET 246 Query: 259 SAASSATAAGNSAKAAK-----------TSETNARSSETAAGQS--------ASAAAGSK 299 +A + + NSA AK ++T++ + +T + ++ A +A S+ Sbjct: 247 NAKQYSINSSNSADLAKAWAESSDSPDSVNDTDSTTGKTQSSKTWAIYSKDRAISAFTSE 306 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 T A +S + A TS A+ SAT + SA ++A+SA A T A A + AA SAS AK Sbjct: 307 THAKTSETNAKTSETNAANSATNSASSATASANSAEEAATSATNAKTSETNAATSASNAK 366 Query: 360 TSETNAKASETSAESSKTAAASS 382 TSETNAK SET+A++S+T AA+S Sbjct: 367 TSETNAKTSETNAKASETNAATS 389 >UniRef50_B3YHG3 Tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica serovar Kentucky RepID=B3YHG3_SALET Length = 573 Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 116/357 (32%), Positives = 165/357 (46%), Gaps = 37/357 (10%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M++ +SG+LK G + I L A S ++ AS + G Y M+V G YS+ Sbjct: 1 MSILVSGILKSPAGAIIAGAQITLTALTTSPDLLAGVSASAVTSDTGYYGMNVLPGVYSL 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGA-MTEDDARPEALRRFELMVEEVARNASA 119 + V G + G+ + TLN L + E E L F + VA + Sbjct: 61 TVAVNGKSQVY-GSFRLDGTETTVTLNMVLRRNLVEVSIPDELLVDFRQIQNNVADDLET 119 Query: 120 VAQNTAAAK-------KSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 + Q A ++A+DA SA AA ADAADS + A Q A + Q A + Sbjct: 120 IRQLELRASGSADNAVRTAADAKASAESAARSEADAADSEKKAE----QFARNLQDAVAK 175 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A +++ +AA + + AT+A +A ++ A+ + A AS A Sbjct: 176 -------AGDSASAAALSAAGAGEQATAAKSAALEAADSKAATEKA----------ASNA 218 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 A S ++AA S AA++SE +SAA SAT A S KAA E + + ET AGQSA Sbjct: 219 ALSEKNAADSALAARTSE-------NSAADSATKADASEKAAVLYEQTSSTHETNAGQSA 271 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 + AA S T AA SA A SA +A+ AT A A +A +A+ ATT E Q S Sbjct: 272 ADAALSATKAADSALNAGKSATEAAGYATDAQTQAGNAKRAATDATTAKDEIVRQIS 328 Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust. Identities = 79/202 (39%), Positives = 103/202 (50%) Query: 244 EAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAA 303 E + E AS SA +A +A A SA++A SE +A SE A Q A + A Sbjct: 118 ETIRQLELRASGSADNAVRTAADAKASAESAARSEADAADSEKKAEQFARNLQDAVAKAG 177 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSET 363 SASAA+ SA A ATAA +A AA S + A A AA SA AA+TSE Sbjct: 178 DSASAAALSAAGAGEQATAAKSAALEAADSKAATEKAASNAALSEKNAADSALAARTSEN 237 Query: 364 NAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG 423 +A S T A++S+ AA ++++ ++A S +A A+ A SA A ATEAAG Sbjct: 238 SAADSATKADASEKAAVLYEQTSSTHETNAGQSAADAALSATKAADSALNAGKSATEAAG 297 Query: 424 SATAAAQSKSTAESAATRAETA 445 AT A A+ AAT A TA Sbjct: 298 YATDAQTQAGNAKRAATDATTA 319 >UniRef50_C3R3S9 Predicted protein n=1 Tax=Bacteroides sp. 2_2_4 RepID=C3R3S9_9BACE Length = 1039 Score = 66.2 bits (160), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 109/355 (30%), Positives = 164/355 (46%), Gaps = 39/355 (10%) Query: 56 GQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVAR 115 G Y + ++G H+G E++ ND G +T+ + FE+ +++ Sbjct: 411 GLYGTNVYLKGTFVLHSGK--KIEEAIDDVKNDLNGRITDVETN------FEIREGQISS 462 Query: 116 NASAVAQNTAAAKKSASDAS---TSAREAATHAADAADSARAASTSAGQAASSA----QS 168 V + AK+S ++AS TSA +A +A+ +A A+ A+T+AG+ S Sbjct: 463 KIKEVNIAVSNAKQSETNASGSATSAGVSANNASKSATDAQGAATNAGKILEEVTLKESS 522 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 + +AG STK TE +K A T+ TNA S SA+ SA TA+ K Sbjct: 523 VTQTAGEISTKVTEVNKKVT--------------EANTAATNAKNSATSASGSAGTASGK 568 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 A EAA SA +A S + A + + SS + AGN + +E + E Sbjct: 569 AGEAANSAANAKQSADNAAKVLEDVTLKESSITQT---AGNI--TLQVTEVTKKVVEANT 623 Query: 289 GQSASAAAG-----SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE 343 + ++ S T A +SAS AST AG+AS SAT A SA+SAA+ + + K Sbjct: 624 AATTASTKAAEASTSATNAKNSASTASTKAGEASTSATNAKNSADSAAAKLTVVSQKESS 683 Query: 344 ATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 + AS+ T A S T+A + AASSA++AA SA+ A A D Sbjct: 684 INQTASSITLQVKEVTTKANEAANSATTAATKAGEAASSATNAAKSATDAKALLD 738 Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 80/246 (32%), Positives = 114/246 (46%), Gaps = 57/246 (23%) Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAAS 332 ++K E N S A QS + A+GS T+A SA+ AS SA A +AT AGK E Sbjct: 461 SSKIKEVNIAVSN--AKQSETNASGSATSAGVSANNASKSATDAQGAATNAGKILEEVTL 518 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 S+ T AGE + + + + + A T+ TNAK S TSA S A+ A AA+SA++ Sbjct: 519 KESSVTQTAGEISTKVTEVNKKVTEANTAATNAKNSATSASGSAGTASGKAGEAANSAAN 578 Query: 393 ASASKD----------------------------EATRQ--------------------- 403 A S D E T++ Sbjct: 579 AKQSADNAAKVLEDVTLKESSITQTAGNITLQVTEVTKKVVEANTAATTASTKAAEASTS 638 Query: 404 ASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI---ASAVALED 460 A+ AK+SA+TASTKA EA+ SAT A K++A+SAA + +++ I AS++ L+ Sbjct: 639 ATNAKNSASTASTKAGEASTSATNA---KNSADSAAAKLTVVSQKESSINQTASSITLQV 695 Query: 461 ASTTKK 466 T K Sbjct: 696 KEVTTK 701 >UniRef50_C5DLU8 KLTH0G03696p n=1 Tax=Lachancea thermotolerans CBS 6340 RepID=C5DLU8_LACTC Length = 2085 Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 95/318 (29%), Positives = 165/318 (51%), Gaps = 23/318 (7%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 +E + A A AQ T ++ S+ +TS +E++T A +A S + TS+ +S Q +S Sbjct: 248 QESSTEAGASAQLTEESQTSSPTRTTSGQESSTEAGASAQSTEESQTSSPTRTTSGQESS 307 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 TEA SA + E S++++ + + S T A AS QS S +++ T+ + Sbjct: 308 ----------TEAGASAQSTEESQTSSPIGITSGQESSTEAGASAQSTEESQTSSPTRTT 357 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 S+ +A AS ++ + S+T++ +S S+T AG SA++ + S+T++ T+ GQ Sbjct: 358 SGQESSTEAGASAQSTEESQTSSPIGITSGQESSTEAGASAQSTEESQTSSPIGITS-GQ 416 Query: 291 SASAAAG---SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ 347 +S AG T + ++S T++GQ S+ T AG SA+S S +++ T+ E Sbjct: 417 ESSTEAGASAQSTEESQTSSPTRTTSGQESS--TEAGASAQSTEESQTSSPTRTTSGQES 474 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAA 407 ++ A SA + + S+T++ TS + S T A + A S S +S+ TR S Sbjct: 475 STEAGASAQSTEESQTSSPIGITSGQESSTEAGARAQSTEESQTSS------PTRTTSGQ 528 Query: 408 KSSATTASTKATEAAGSA 425 +SS T A T + A G++ Sbjct: 529 ESS-TKAGTSSQSALGTS 545 Score = 56.2 bits (134), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 77/259 (29%), Positives = 136/259 (52%), Gaps = 20/259 (7%) Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 +TEA SA E S++++ T + + S T A AS QS S +++ T+ + S+ +A Sbjct: 251 STEAGASAQLTEESQTSSPTRTTSGQESSTEAGASAQSTEESQTSSPTRTTSGQESSTEA 310 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAG-- 297 AS ++ + S+T++ +S S+T AG SA++ + S+T++ + T +GQ +S AG Sbjct: 311 GASAQSTEESQTSSPIGITSGQESSTEAGASAQSTEESQTSS-PTRTTSGQESSTEAGAS 369 Query: 298 -SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSAS 356 T + ++S ++GQ S+ T AG SA+S S +++ E ++ A SA Sbjct: 370 AQSTEESQTSSPIGITSGQESS--TEAGASAQSTEESQTSSPIGITSGQESSTEAGASAQ 427 Query: 357 AAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAST 416 + + S+T++ TS + S T A +SA S S +S+ TR S +SS Sbjct: 428 STEESQTSSPTRTTSGQESSTEAGASAQSTEESQTSS------PTRTTSGQESS------ 475 Query: 417 KATEAAGSATAAAQSKSTA 435 TEA SA + +S++++ Sbjct: 476 --TEAGASAQSTEESQTSS 492 Score = 44.3 bits (103), Expect = 0.030, Method: Compositional matrix adjust. Identities = 96/339 (28%), Positives = 171/339 (50%), Gaps = 39/339 (11%) Query: 143 THAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAG 202 T ADA+ S+R +TSA SS+ + S S T++T+++E + + + AESS Sbjct: 145 TRTADAS-SSRPDTTSA---LSSSPTKSISGDTSTTRSSETASAESGAESSP-------- 192 Query: 203 AAKTSETNASASLQSAA---TSASTATTKASEAATSARDAAASKEAAKSSETNASSSASS 259 +SE +S QS+ T ST T S A T S ++ + S+T++ +S Sbjct: 193 -LFSSEILSSMIPQSSGHINTQTSTQPTTRSSAETDV-----STQSTEESQTSSPIGITS 246 Query: 260 AASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAG---SKTAAASSASAASTSAGQA 316 S+T AG SA+ + S+T++ + T +GQ +S AG T + ++S T++GQ Sbjct: 247 GQESSTEAGASAQLTEESQTSS-PTRTTSGQESSTEAGASAQSTEESQTSSPTRTTSGQE 305 Query: 317 SAS-ATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT-----------SETN 364 S++ A A+ +S E + +S+ T E++ +A A+A+S ++T S T Sbjct: 306 SSTEAGASAQSTEESQTSSPIGITSGQESSTEAGASAQSTEESQTSSPTRTTSGQESSTE 365 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS 424 A AS S E S+T++ +S S++ A AS +++ T+ +TEA S Sbjct: 366 AGASAQSTEESQTSSPIGITSGQESSTEAGASAQSTEESQTSSPIGITSGQESSTEAGAS 425 Query: 425 ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAST 463 A + +S++++ + T + ++ A ASA + E++ T Sbjct: 426 AQSTEESQTSSPTRTTSGQESSTEAG--ASAQSTEESQT 462 >UniRef50_D0Z3B3 Putative tail fiber protein n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z3B3_LISDA Length = 1008 Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 114/380 (30%), Positives = 177/380 (46%), Gaps = 45/380 (11%) Query: 87 NDFLGAMTEDDARPEALR----RFELMVEE---VARNASAVAQNTAAAKKSASDASTSAR 139 +D +GA+ E E L+ +F+ ++E + + S +A+ K+ +AR Sbjct: 248 DDVMGALDETHQSLEWLKFMQVQFDTKLQEMTLIDEHLSILAREIEKNKQKVEQLRLNAR 307 Query: 140 EAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS---KSA 196 E+A +AA +A A + A + A+S + A + +EA KS +A+ S K+ Sbjct: 308 ESAGNAATSALRAEHEANRA-EVAASIDIVREAEKQAKSSKSEADKSLSASLVSVNAKNV 366 Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 A A AK SE +A+ S Q+A ++ TA KE AK S ASS+ Sbjct: 367 AVAKANEAKQSELSATTSAQNAEQNSLTA-----------------KEQAKLSTEKASSA 409 Query: 257 ASSAASSATA----------AGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 A SA +S ++ A ++ A K S A S AG+SA+ A AA+ A Sbjct: 410 AISAKNSKSSEKSALEAASEAALNSTATKQSANLASSHAVTAGESANTAEQKADDAANQA 469 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 S A+ AG A ++A A+ S AA+S A+ +A A+ A AA A A + Sbjct: 470 SIATQQAGIAKSNADASLNSQTLAANSVELASNQAKLASNSAKVAAEKAMIAIN-----Q 524 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 S E++K+ +A AA SA + S + +A A +A ++T A EA+ S Sbjct: 525 VSLAQQEAAKSRV--NAGGAAQSAKDSERSSQTSIAKADVAAKNAGLSATHAIEASQSFI 582 Query: 427 AAAQSKSTAESAATRAETAA 446 A ++ +A SAA RAETA Sbjct: 583 RALSAEESAISAAKRAETAV 602 Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust. Identities = 83/266 (31%), Positives = 137/266 (51%), Gaps = 12/266 (4%) Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADS-ARAASTSAGQAASSAQSASSSAGTASTK 179 A+ A + KS +D S SA + +A + A + A A S A +SAQ+A ++ TA + Sbjct: 339 AEKQAKSSKSEADKSLSASLVSVNAKNVAVAKANEAKQSELSATTSAQNAEQNSLTAKEQ 398 Query: 180 A---TEASKSAA-AAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT---KASEA 232 A TE + SAA +A++SKS+ ++ AA + N++A+ QSA ++S A T A+ A Sbjct: 399 AKLSTEKASSAAISAKNSKSSEKSALEAASEAALNSTATKQSANLASSHAVTAGESANTA 458 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 A DAA A A S+A ++ +S T A NS + A A +S A + A Sbjct: 459 EQKADDAANQASIATQQAGIAKSNADASLNSQTLAANSVELASNQAKLASNSAKVAAEKA 518 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAG----EATEQA 348 A + A A+ + +AG A+ SA + +S++++ + A A AG A E + Sbjct: 519 MIAINQVSLAQQEAAKSRVNAGGAAQSAKDSERSSQTSIAKADVAAKNAGLSATHAIEAS 578 Query: 349 SAAARSASAAKTSETNAKASETSAES 374 + R+ SA +++ + AK +ET+ S Sbjct: 579 QSFIRALSAEESAISAAKRAETAVAS 604 >UniRef50_B5YYN6 Tail fiber protein n=60 Tax=root RepID=B5YYN6_ECO5E Length = 645 Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 133/495 (26%), Positives = 190/495 (38%), Gaps = 57/495 (11%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V +SG LK G+ + I L A S + T AS E G Y M ++ G+Y+V Sbjct: 1 MSVVVSGTLKSPDGEAISGANITLTALTVSPDALSGTSASAVTREGGYYGMTMDPGEYAV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFL-GAMTEDDARPEALRRFELMVEEVARNASA 119 + V+G + G + + TLN L ++ E E L F ++ N Sbjct: 61 SVTVKGKTVVY-GRVRIEGTESTVTLNMLLRRSLVEVSIPGELLTDF----RQIQNN--- 112 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 VA + A ++ D +T T A + +SA A++ SA +A +A S ++ AG +T Sbjct: 113 VADDLATIRRLNEDTATK----NTQATQSKESAAASAKSASDSAKTATSRAAEAGQKATD 168 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 ATEA+ + A T+AG A+ S T A S ++A A KA + A AR Sbjct: 169 ATEAA----------TRAVTAAGNAEESSTRAGESEKAAGADAE----KARQHAEKAR-- 212 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 A SA A AA SA+ A+ NAR G++ G K Sbjct: 213 ------------LAQESAGEILKRAEAATVSAEEARRMAENARGPRGPQGET-----GPK 255 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 A G+ + A A GE EQ + K Sbjct: 256 GDVGPKGETGPVG---PQGPAGPKGERGDVGAQGAVGPAGPRGEKGEQGERGPQGIPGLK 312 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 +T + + + + EA Q + T Sbjct: 313 -GDTGERGPKGDQGDMGPKGEKGDPGGPAGPQGPKGERGEAGPQGPMG-ARGERGETGPR 370 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 G A + T R E SA + DA+T +KGIVQLSSAT+S Sbjct: 371 GEPGPAGPRGERGETGPQGP-RGEPGP-----AGSAANVADATTAQKGIVQLSSATDSDD 424 Query: 480 ETLAATPKAVKSAYD 494 ET AATPKAVK+A D Sbjct: 425 ETKAATPKAVKAAMD 439 >UniRef50_Q0I4L1 Putative uncharacterized protein n=1 Tax=Haemophilus somnus 129PT RepID=Q0I4L1_HAES1 Length = 762 Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 87/266 (32%), Positives = 131/266 (49%), Gaps = 28/266 (10%) Query: 237 RDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSET----NARSSETAAGQSA 292 R A S E A+ ++T AS S A+ AT A A+ AKT T A+ ++T A QSA Sbjct: 104 RLAIQSNEQAQQAKTEASQS----ATQATKANRQAQQAKTEATEANRQAQQAKTEASQSA 159 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 + A +KT A+ SA+ A+ + QA + T A +S A+ A A T+A ++ QA+ Sbjct: 160 TQAQQAKTEASQSATQATKANRQAQQAKTEASQS----ATQAQQAKTEASQSATQATEVN 215 Query: 353 RSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSAT 412 R A AKT T A A+ +KT A ++ A SS + S +AT+ + AK A+ Sbjct: 216 RQAQQAKTEATEAN---RQAQQAKTKAENAERIATSSIKTVQQSAIQATQAENLAKKWAS 272 Query: 413 TASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLS 472 + + + K +A A AE A++ + A+A +D V +S Sbjct: 273 NPQNQIVQ---------EDKYSAYHYALEAEKYAEKVK--ATAEGRKDWQFIDN--VPIS 319 Query: 473 SATNSTSETLAATPKAVKSAYDNAEK 498 N +SET A+ +VK+AYD AEK Sbjct: 320 KDVNDSSETNLASAFSVKTAYDRAEK 345 Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 95/301 (31%), Positives = 139/301 (46%), Gaps = 30/301 (9%) Query: 150 DSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSET 209 + A+ A T A Q+A+ A A+ A A T+ATEA++ A A++ S +AT A AKT + Sbjct: 111 EQAQQAKTEASQSATQATKANRQAQQAKTEATEANRQAQQAKTEASQSATQAQQAKTEAS 170 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGN 269 ++ A A A T+AS++AT A+ A + AS +A+ AT Sbjct: 171 QSATQATKANRQAQQAKTEASQSATQAQQ--------------AKTEASQSATQATEVNR 216 Query: 270 SAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSA-- 327 A+ AKT T A A Q+ + A ++ A SS SA QA+ + A K A Sbjct: 217 QAQQAKTEATEANRQ---AQQAKTKAENAERIATSSIKTVQQSAIQATQAENLAKKWASN 273 Query: 328 ---ESAASSASTATTKAGEATEQASAAARSASAAKTSE--TNAKASETSAESSKT--AAA 380 + +A A EA + A +A K + N S+ +SS+T A+A Sbjct: 274 PQNQIVQEDKYSAYHYALEAEKYAEKVKATAEGRKDWQFIDNVPISKDVNDSSETNLASA 333 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT----AAAQSKSTAE 436 S +A A A +A RQA AK+ AT A+ +A +A AT A Q+K AE Sbjct: 334 FSVKTAYDRAEKAKTEATKANRQAQQAKTEATEANRQAQQAKTEATEAKRQAQQAKRLAE 393 Query: 437 S 437 S Sbjct: 394 S 394 Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 87/298 (29%), Positives = 133/298 (44%), Gaps = 31/298 (10%) Query: 93 MTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSA 152 +T+++A E R + E A+ A A +A A+ A+ A++A T A +A A Sbjct: 92 VTDNNATTEQNTRLAIQSNEQAQQAKTEASQSAT---QATKANRQAQQAKTEATEANRQA 148 Query: 153 RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAS 212 + A T A Q+A+ AQ A + A ++T+AT+A++ A A++ S +AT A AKT + ++ Sbjct: 149 QQAKTEASQSATQAQQAKTEASQSATQATKANRQAQQAKTEASQSATQAQQAKTEASQSA 208 Query: 213 ASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAK 272 A A T+A+EA A+ A E A+ T++ + +A AT A N AK Sbjct: 209 TQATEVNRQAQQAKTEATEANRQAQQAKTKAENAERIATSSIKTVQQSAIQATQAENLAK 268 Query: 273 AAKTSETN-------------ARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASAS 319 ++ N A +E A + + A G K S +S + Sbjct: 269 KWASNPQNQIVQEDKYSAYHYALEAEKYAEKVKATAEGRKDWQFIDNVPISKDVNDSSET 328 Query: 320 ATAAGKSAESA---ASSASTATTKAG--------EATEQASAAARSASAAKTSETNAK 366 A+ S ++A A A T TKA EATE R A AKT T AK Sbjct: 329 NLASAFSVKTAYDRAEKAKTEATKANRQAQQAKTEATEAN----RQAQQAKTEATEAK 382 >UniRef50_C4Y8G3 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y8G3_CLAL4 Length = 653 Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 66/249 (26%), Positives = 136/249 (54%), Gaps = 4/249 (1%) Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 S+ Q +S ++S + + ST AT + S +++ S ++ +T A ++K SET S Sbjct: 392 SSKPSQTSSKPNTSSKPSTSESTDATSSKPSETSSKPSTTSESTDATSSKPSETTTQPSQ 451 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 S+ S +T+ + + S + S+ ++K S+T++ S +S+ S T++ S ++K Sbjct: 452 TSSKPSETTSQPSQTSSKPSETTSQPSQTSSKPSQTSSKPSETSSKPSETSSKPSETSSK 511 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 SET+++ SET++ S +++ S+T++ S +++ S + S T++ S S S S Sbjct: 512 PSETSSKPSETSSKPSQTSSKPSETSSKPSETSSKPSETSSKPSETSSKPSQTS--SKPS 569 Query: 336 TATTKAGEATEQAS--AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSA 393 ++K E + + S ++ S +++K SET++K SET+++ T+ S +S S+ S Sbjct: 570 ETSSKPSETSSKPSQTSSKPSETSSKPSETSSKPSETTSQPYTTSQPSETTSQPSTTSQP 629 Query: 394 SASKDEATR 402 ++ T+ Sbjct: 630 DTTQPSTTK 638 Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 87/308 (28%), Positives = 155/308 (50%), Gaps = 26/308 (8%) Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 S++ TS E ++ A S ST +A+ SS S+K +SK + Sbjct: 353 PSNSVTSTTEVPGFSSSAVPSKSTDSTDVSSSATQPSETSSKPSQTSSKPNTSSKPS--- 409 Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 ++ +T A ++K SET++ S S +T A+++ K SE T S+ ++K SE Sbjct: 410 ----TSESTDATSSKPSETSSKPSTTSESTDATSS--KPSETTT-----QPSQTSSKPSE 458 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 T + S +S+ S T + S ++K S+T+++ SET++ S +++ S+T++ S +++ Sbjct: 459 TTSQPSQTSSKPSETTSQPSQTSSKPSQTSSKPSETSSKPSETSSKPSETSSKPSETSSK 518 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 S + S T++ S S S S ++K E +++ S +++K S+T++K SET Sbjct: 519 PSETSSKPSQTSSKPSETS--SKPSETSSKPSE-----TSSKPSETSSKPSQTSSKPSET 571 Query: 371 SAE----SSKTAAASSASSAASSASSASASKD-EATRQASAAKSSATTASTKATEAAGSA 425 S++ SSK + SS S SS S ++SK E T Q + T S +T + Sbjct: 572 SSKPSETSSKPSQTSSKPSETSSKPSETSSKPSETTSQPYTTSQPSETTSQPSTTSQPDT 631 Query: 426 TAAAQSKS 433 T + +KS Sbjct: 632 TQPSTTKS 639 Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 73/272 (26%), Positives = 145/272 (53%), Gaps = 13/272 (4%) Query: 203 AAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAAS 262 A + SET++ S S+ +T++ ++ +T A + S+ ++K S T+ S+ A+S+ Sbjct: 385 ATQPSETSSKPSQTSSK--PNTSSKPSTSESTDATSSKPSETSSKPSTTSESTDATSSKP 442 Query: 263 SATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATA 322 S T S ++K SET ++ S+T++ S + + S+T++ S +++ S + S T+ Sbjct: 443 SETTTQPSQTSSKPSETTSQPSQTSSKPSETTSQPSQTSSKPSQTSSKPSETSSKPSETS 502 Query: 323 AGKSAESAASSASTATTKAGEATEQAS--AAARSASAAKTSETNAKASETSAESSKTAAA 380 + S S S S ++K E + + S ++ S +++K SET++K SETS++ S+T++ Sbjct: 503 SKPSETS--SKPSETSSKPSETSSKPSQTSSKPSETSSKPSETSSKPSETSSKPSETSSK 560 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 S +S+ S +S+ S+ + +++K S T++ T + S T +Q +T++ + T Sbjct: 561 PSQTSSKPSETSSKPSETSSKPSQTSSKPSETSSKPSETSSKPSET-TSQPYTTSQPSET 619 Query: 441 RAETAAKRAEDIASAVALEDASTTKKGIVQLS 472 ++ + D STTK G Q S Sbjct: 620 TSQPSTTSQPDTT------QPSTTKSGFPQPS 645 Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust. Identities = 54/219 (24%), Positives = 119/219 (54%), Gaps = 9/219 (4%) Query: 130 SASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAA 189 S+ ++TS AT + + + + + TS+ + +++Q + +S+ + T + + S+ Sbjct: 425 SSKPSTTSESTDATSSKPSETTTQPSQTSSKPSETTSQPSQTSSKPSETTSQPSQTSSKP 484 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSS 249 +++S + TS+ ++TS + S + + TS+ + T + + TS++ S+ ++K S Sbjct: 485 SQTSSKPSETSSKPSETSSKPSETSSKPSETSSKPSETSSKPSQTSSK---PSETSSKPS 541 Query: 250 ETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASS--AS 307 ET++ S +S+ S T++ S ++K SET+++ SET++ S +++ S+T++ S +S Sbjct: 542 ETSSKPSETSSKPSETSSKPSQTSSKPSETSSKPSETSSKPSQTSSKPSETSSKPSETSS 601 Query: 308 AASTSAGQ----ASASATAAGKSAESAASSASTATTKAG 342 S + Q + S T + S S + +TTK+G Sbjct: 602 KPSETTSQPYTTSQPSETTSQPSTTSQPDTTQPSTTKSG 640 >UniRef50_C5BA56 Putative uncharacterized protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BA56_EDWI9 Length = 743 Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 65/204 (31%), Positives = 95/204 (46%), Gaps = 2/204 (0%) Query: 5 ISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLV 64 ISGVL D +GK V I L A NS V+ S E G+YS+ +E G YS+ + Sbjct: 2 ISGVLLDPSGKAVSGAQITLTAIANSMQVLRGFTCSVMTAENGQYSVRLEEGNYSISVAH 61 Query: 65 EGFPPSHAGTITVYEDSQPGTLNDFL-GAMTEDDARPEALRRFELMVEEVARNASAVAQN 123 +G + G +T+ EDS P +LN L + E + PE + F + +VA + + + Sbjct: 62 QGRNFVY-GAVTLTEDSAPSSLNALLHQQVMEQEVTPEVILYFRQIQHKVADDVVIMQRL 120 Query: 124 TAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEA 183 + ++A A S R A AA S R A+ A +A+ A+ A TA A Sbjct: 121 QHDSSQAARAAQESQRHAQASKVAAAGSVRQAAAHRLAAGQAAEMAADYAQTAQDSQRHA 180 Query: 184 SKSAAAAESSKSAAATSAGAAKTS 207 +S AA S+ A AA+ S Sbjct: 181 QRSEMAAAESEQRTADHRLAAEQS 204 >UniRef50_B2I4I4 Putative phage tail protein n=1 Tax=Enterobacteria phage EPS7 RepID=B2I4I4_9CAUD Length = 807 Score = 59.7 bits (143), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 142/398 (35%), Positives = 202/398 (50%), Gaps = 91/398 (22%) Query: 133 DASTSAREAATHAAD----AADSARAASTSAGQAASSAQSASSSAGT------ASTKATE 182 D+ +A+E+ T+A D AA A ++ TSA Q+A+SA A AG AS + E Sbjct: 60 DSEIAAKESETNAKDSENLAAIYANSSETSATQSAASATEAERQAGLSKDSADASATSAE 119 Query: 183 ASK--------SAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT-------- 226 SK +A AE S+ A + A+ ++T A+ S Q+AATSA+ +T Sbjct: 120 ESKGFRDSAELAAQNAEQSRLLAEQAKKDAEAAKTAAATSEQNAATSATESTNQAIAAAG 179 Query: 227 --TKASEAATSARDAAASKEAAKSSETNASSSA--------------------------- 257 T+A E AT+A+D S+ AAK+SE NA +S Sbjct: 180 SATEAGEYATTAKD---SEIAAKTSELNAKNSENESAISAEASEASASQSAISASQSAAS 236 Query: 258 ---------------SSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA 302 ++A S+ AA S AKTSETNA++SET A A+AA S+T A Sbjct: 237 ATKAAESSAAAKISETTAIESSAAAKTSEINAKTSETNAKTSETNAAAYAAAAKTSETNA 296 Query: 303 ASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSE 362 A SA++AS S G AE+ A+ AST+ A AA S + KTSE Sbjct: 297 ADSAASASDSKGFRD--------EAEAFAAQASTS----------ALAAKNSETNTKTSE 338 Query: 363 TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAA 422 N+KASE +A+ ++ +A+ SA++A + ++ +DEA + A ++ATTA+ KA EAA Sbjct: 339 INSKASEDAAKLAQQSASGSANTATQAMTTTKGYRDEAEVFKNTATTAATTATDKALEAA 398 Query: 423 GSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 GSAT A + + A SAA RAETAA AE + A +D Sbjct: 399 GSATIAGEKATNATSAADRAETAAASAEQVMQASLKKD 436 >UniRef50_A9WC61 Autotransporter-associated beta strand repeat protein n=2 Tax=Chloroflexus RepID=A9WC61_CHLAA Length = 1320 Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 109/397 (27%), Positives = 192/397 (48%), Gaps = 15/397 (3%) Query: 117 ASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTA 176 AS + TA+A + + T A+T + +A ++ S SA + + + SA++SA Sbjct: 626 ASVTPEPTASATPEPTASVTPEPTASTTPSPSATASVTPSPSATASVTPSPSATASATPE 685 Query: 177 STKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSA 236 T +T S SA A+ + + A+T+ + T+ T S S ++AT TA+T S +AT++ Sbjct: 686 PTASTTPSPSATASTTPEPTASTTPSPSATASTTPSPSATASATPEPTASTTPSPSATAS 745 Query: 237 RDAAASKEAA--KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA---AGQS 291 + S A+ S AS + S +A+++ SA A+ T E A ++ + A + Sbjct: 746 VTPSPSATASTTPSPSATASVTPSPSATASMTPSPSATASATPEPTASTTPSPSATASVT 805 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA- 350 S +A + T + SA+A++T + A+ASAT + + S + +TT + AT A+ Sbjct: 806 PSPSATASTTPSPSATASTTPSPSATASATPSPSATASVTPEPTASTTPSPSATASATPE 865 Query: 351 ------AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQA 404 + SA+A+ T +A AS T + S+ + S S AS+ S SA+ + Sbjct: 866 PTASVTPSPSATASVTPSPSATASVTPSPSATASTTPSPSVTASTTPSPSATASTTPSPS 925 Query: 405 SAAKSSATTASTKATEAAGSATAA--AQSKSTAESAATRAETAAKRAEDIASAVALEDAS 462 + A ++ + ++T +T + SATA+ +TA + + + TA+ E AS A+ Sbjct: 926 ATASTTPSPSATASTTPSPSATASTTPSPSATASTTPSPSATASATPEPTASVTPSPSAT 985 Query: 463 -TTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEK 498 +T +S T S S T + TP +A E Sbjct: 986 ASTTPSPSATASVTPSPSATASVTPSPSATASTTPEP 1022 Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 99/354 (27%), Positives = 180/354 (50%), Gaps = 22/354 (6%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 M + ASA + T AST+ +AT + + SA AST+ +A+++ + Sbjct: 776 MTPSPSATASATPEPT---------ASTTPSPSATASVTPSPSAT-ASTTPSPSATASTT 825 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 S SA ++T + A+ S ++ + + SA A+ T E AS + +AT++ T + Sbjct: 826 PSPSATASATPSPSATASVTPEPTASTTPSPSATASATPEPTASVTPSPSATASVTPSPS 885 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 A+ + T + A AS + S AS++ S +A+++T SA A+ T +A +S T Sbjct: 886 ATASVTPSPSATASTTPSPS--VTASTTPSPSATASTTPSPSATASTTPSPSATASTT-- 941 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 S +A + T + SA+A++T + A+ASAT ++ + + SA+ +TT + AT Sbjct: 942 ---PSPSATASTTPSPSATASTTPSPSATASATPEPTASVTPSPSATASTTPSPSATASV 998 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + SA+A+ T +A AS T ++ T + SA+++ + + SA+AS T + +A Sbjct: 999 T-PSPSATASVTPSPSATASTTPEPTASTTPSPSATASTTPSPSATAS----TTPSPSAT 1053 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS 462 +S T T + + SATA+ +A ++ T TA+ A+A A + + Sbjct: 1054 ASTTPEPTASVTPSPSATASVTPSPSATASTTPEPTASTTPSPSATASATPEPT 1107 Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 109/387 (28%), Positives = 193/387 (49%), Gaps = 11/387 (2%) Query: 117 ASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTA 176 ASA + TA+ S S A+ S T + + SA A++T + A +SA +++ T Sbjct: 680 ASATPEPTASTTPSPS-ATASTTPEPTASTTPSPSATASTTPSPSATASATPEPTASTTP 738 Query: 177 STKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSA 236 S AT AS + + + ++ + + SA A+ T +A+AS+ + ++ ++AT + + + T + Sbjct: 739 SPSAT-ASVTPSPSATASTTPSPSATASVTPSPSATASMTPSPSATASATPEPTASTTPS 797 Query: 237 RDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAA 296 A AS + S+ AS++ S +A+++T SA A+ T +A +S T +++ + Sbjct: 798 PSATASVTPSPSA--TASTTPSPSATASTTPSPSATASATPSPSATASVTPEPTASTTPS 855 Query: 297 GSKTAAASSASAAS-TSAGQASASATAAGKSAESAASSAS-TATTKAGEATEQASAAARS 354 S TA+A+ AS T + A+AS T + + S S S TA+T + ++ + S Sbjct: 856 PSATASATPEPTASVTPSPSATASVTPSPSATASVTPSPSATASTTPSPSVTASTTPSPS 915 Query: 355 ASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTA 414 A+A+ T +A AS T + S+ + S S+ AS+ S SA+ +T + +A +SAT Sbjct: 916 ATASTTPSPSATASTTPSPSATASTTPSPSATASTTPSPSATA--STTPSPSATASATPE 973 Query: 415 STKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVA---LEDASTTKKGIVQL 471 T + + SATA+ +A ++ T + +A SA A E ++T Sbjct: 974 PTASVTPSPSATASTTPSPSATASVTPSPSATASVTPSPSATASTTPEPTASTTPSPSAT 1033 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEK 498 +S T S S T + TP +A E Sbjct: 1034 ASTTPSPSATASTTPSPSATASTTPEP 1060 Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 102/395 (25%), Positives = 190/395 (48%), Gaps = 28/395 (7%) Query: 117 ASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTA 176 ASA + TA+ + AST+ +AT + + SA A+ T + A +SA +++ T Sbjct: 634 ASATPEPTASVTPEPT-ASTTPSPSATASVTPSPSATASVTPSPSATASATPEPTASTTP 692 Query: 177 STKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSAS------------- 223 S AT ++ A ++ S +AT++ S T ++ +A+T+ S Sbjct: 693 SPSATASTTPEPTASTTPSPSATASTTPSPSATASATPEPTASTTPSPSATASVTPSPSA 752 Query: 224 ----------TATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKA 273 TA+ S +AT++ + S A+ + E AS++ S +A+++ SA A Sbjct: 753 TASTTPSPSATASVTPSPSATASMTPSPSATASATPEPTASTTPSPSATASVTPSPSATA 812 Query: 274 AKTSETNARSSETAAGQSASAAAGSKTAAAS---SASAASTSAGQASASATAAGKSAESA 330 + T +A +S T + + ++A S +A AS +A++T + A+ASAT ++ + Sbjct: 813 STTPSPSATASTTPSPSATASATPSPSATASVTPEPTASTTPSPSATASATPEPTASVTP 872 Query: 331 ASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 + SA+ + T + AT + + SA+A+ T + AS T + S+ + S S+ AS+ Sbjct: 873 SPSATASVTPSPSATASVT-PSPSATASTTPSPSVTASTTPSPSATASTTPSPSATASTT 931 Query: 391 SSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAE 450 S SA+ ++ A ++ + ++T +T + SATA+A + TA + + TA+ Sbjct: 932 PSPSATASTTPSPSATASTTPSPSATASTTPSPSATASATPEPTASVTPSPSATASTTPS 991 Query: 451 DIASAVALEDASTTKKGIVQLSSATNSTSETLAAT 485 A+A S T S+ ++T E A+T Sbjct: 992 PSATASVTPSPSATASVTPSPSATASTTPEPTAST 1026 >UniRef50_C3YC67 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3YC67_BRAFL Length = 1134 Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 81/269 (30%), Positives = 143/269 (53%), Gaps = 16/269 (5%) Query: 179 KATEASKSAAAAESSK--SAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSA 236 +ATEAS + A + + A+A S A+ + T ASA +A+T T + S A++ Sbjct: 46 QATEASAESTTASTPQVTEASAESTTASTSQVTEASAESTTASTPQVTEASAESTTASTP 105 Query: 237 RDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAA 296 R AS E+ S + +++ + +++T+ A A T+ + + +E +A + ++ Sbjct: 106 RATEASAESTTVSTPQVTEASAESTTASTSQATEASAESTTASTPQVTEASAESTTASTP 165 Query: 297 GSKTAAASSASAASTSAGQASASATAAGK------SAESAASSASTATTKAGEATEQASA 350 + A+A S +A++ A +ASA +T A SAES +S T + E+T ++ Sbjct: 166 QATEASAESTTASTPQATEASAESTTASTPQVTEASAESTTASTPQVTEASAESTTASTP 225 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 A ASA T+ + + +E SAES+ TA+ A+ A++ +++AS T Q + A + Sbjct: 226 QATEASAESTTASTPQVTEASAEST-TASTPRATEASAESTTAS------TPQVTEASAE 278 Query: 411 ATTAST-KATEAAGSATAAAQSKSTAESA 438 +TTAST +ATEA+ +T A+ + T SA Sbjct: 279 STTASTSQATEASAESTTASTPQVTEASA 307 Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust. Identities = 81/264 (30%), Positives = 139/264 (52%), Gaps = 13/264 (4%) Query: 152 ARAASTSAGQAASSAQSASSSAGTAST-KATEASKSAAAAESSKSAAATSAGAAKTSETN 210 A A ST+A + S AS+ + TAST + TEAS + A + + AT A A T+ + Sbjct: 65 ASAESTTA--STSQVTEASAESTTASTPQVTEASAESTTASTPR---ATEASAESTTVST 119 Query: 211 ASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNS 270 + +++A S + +T++A+EA SA AS + ++++++ A+ A+A + Sbjct: 120 PQVT-EASAESTTASTSQATEA--SAESTTASTPQVTEASAESTTASTPQATEASAESTT 176 Query: 271 AKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESA 330 A + +E +A S+ + Q A+A S TA+ + AS + AS + A SAES Sbjct: 177 ASTPQATEASAESTTASTPQVTEASAESTTASTPQVTEASAESTTAS-TPQATEASAEST 235 Query: 331 ASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 +S T + E+T ++ A ASA T+ + + +E SAES+ TA+ S A+ A SA Sbjct: 236 TASTPQVTEASAESTTASTPRATEASAESTTASTPQVTEASAEST-TASTSQATEA--SA 292 Query: 391 SSASASKDEATRQASAAKSSATTA 414 S +AS + T ++ + +S T+ Sbjct: 293 ESTTASTPQVTEASAESHNSVYTS 316 Score = 44.3 bits (103), Expect = 0.026, Method: Compositional matrix adjust. Identities = 79/270 (29%), Positives = 141/270 (52%), Gaps = 23/270 (8%) Query: 122 QNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQ--SASSSAGTAST- 178 Q T A+ +S + ++ EA+ + A+ S +++ AS+ Q AS+ + TAST Sbjct: 46 QATEASAESTTASTPQVTEASAESTTASTSQVTEASAESTTASTPQVTEASAESTTASTP 105 Query: 179 KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARD 238 +ATEAS A ++ S + +A+++ + S + +++A S + +T + +EA SA Sbjct: 106 RATEAS----AESTTVSTPQVTEASAESTTASTSQATEASAESTTASTPQVTEA--SAES 159 Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGS 298 AS A + ++++++ A+ A+A +A + +E +A S+ + Q A+A S Sbjct: 160 TTASTPQATEASAESTTASTPQATEASAESTTASTPQVTEASAESTTASTPQVTEASAES 219 Query: 299 KTAA---ASSASAASTSAG-----QASASATAAGK------SAESAASSASTATTKAGEA 344 TA+ A+ ASA ST+A +ASA +T A SAES +S T + E+ Sbjct: 220 TTASTPQATEASAESTTASTPQVTEASAESTTASTPRATEASAESTTASTPQVTEASAES 279 Query: 345 TEQASAAARSASAAKTSETNAKASETSAES 374 T +++ A ASA T+ + + +E SAES Sbjct: 280 TTASTSQATEASAESTTASTPQVTEASAES 309 >UniRef50_A9ITY7 Putative uncharacterized protein n=2 Tax=Bartonella RepID=A9ITY7_BART1 Length = 1077 Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 122/429 (28%), Positives = 192/429 (44%), Gaps = 91/429 (21%) Query: 102 ALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQ 161 AL+RFE V++V+ NA ++ ++A A E+ T A A +AR AS +A + Sbjct: 153 ALQRFE-EVKQVSENAVNIS----------TEAKRLADESKTIATRAEQTAREASQTATE 201 Query: 162 AASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATS 221 A A ++ T AT+AS A A+ + A A AK ++S+ + Sbjct: 202 TTQVAAKAVATCHEVKTVATQASLKADGAKQTADDAKDIAEKAKELSEGTTSSITELTKT 261 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 S T +A T RDA K+ A++S+T + + +AA + TA+ ++ ++T+ + + Sbjct: 262 TSQVQTAVEKALTDLRDA---KQIAEASKTLSEEAKQTAADALTASKDAKSQSETAVSRS 318 Query: 282 RSSETAAGQSASAAAGSKTAAASS---ASAASTSAGQASASATAAGKSAESAASSASTAT 338 ++ + QS A K + AS A AA T QAS +A+ A AE+A S+A +AT Sbjct: 319 EEAKALSEQSKGACDEFKASVASVEKVAEAAKTGVEQASQTASEAKGIAETAKSTADSAT 378 Query: 339 TKAGEAT-----------------EQASAAARSAS--------------AAKTSE----- 362 KA +A EQA A R A KT E Sbjct: 379 AKAEQAQQEASEASRLASEAKVVAEQALQADRQAVREGSESTKSLVEAVQKKTEEAERVA 438 Query: 363 --------------TNAKASETSAESSKTAAASSASSA--------------------AS 388 T AK + T+A + TAA AS A AS Sbjct: 439 QDSKRVCEETKQLATEAKNASTNALTEATAAKEKASRALTTVNDVKNISEEVKGLAEKAS 498 Query: 389 SASS-ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAK 447 AS+ A + D+A R+A++AK++A AS+ AT+A GSA Q+++ +E A T A+T+ Sbjct: 499 RASTEAQKTSDQALREATSAKTTADMASSTATDAKGSAE---QAQTVSEEAKTLAQTSKN 555 Query: 448 RAEDIASAV 456 ++I + Sbjct: 556 ACDEIKQTI 564 Score = 43.9 bits (102), Expect = 0.038, Method: Compositional matrix adjust. Identities = 105/352 (29%), Positives = 151/352 (42%), Gaps = 55/352 (15%) Query: 126 AAKKSASDASTSAREA---ATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATE 182 AAK AS +A EA A A ADSA A + A Q AS A +S A + +A + Sbjct: 348 AAKTGVEQASQTASEAKGIAETAKSTADSATAKAEQAQQEASEASRLASEAKVVAEQALQ 407 Query: 183 ASKSAAAAESSKSAAATSAGAAKTSE-------------------TNASASLQSAATSAS 223 A + A S + + A KT E T A + +A T A+ Sbjct: 408 ADRQAVREGSESTKSLVEAVQKKTEEAERVAQDSKRVCEETKQLATEAKNASTNALTEAT 467 Query: 224 TATTKASEAATSARDAAASKEAAK-------SSETNASSSASSAASSATAAGNSAKAAKT 276 A KAS A T+ D E K + T A ++ A AT+A +A A + Sbjct: 468 AAKEKASRALTTVNDVKNISEEVKGLAEKASRASTEAQKTSDQALREATSAKTTADMASS 527 Query: 277 SETNARSSETAAGQSASAAAGSKTAAASSASAA----------STSAGQASASATAAGKS 326 + T+A+ S A Q+ + + +KT A +S +A + A A ++AT A + Sbjct: 528 TATDAKGS---AEQAQTVSEEAKTLAQTSKNACDEIKQTIGDVKSVAENALSTATTAKQK 584 Query: 327 AESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASS---A 383 + + S + TK+GEA A A R AS S+ +AE +K AAS A Sbjct: 585 GDEISQQISESFTKSGEAKTLAEEAKRLAS----------TSQETAEEAKVKAASVERIA 634 Query: 384 SSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTA 435 + A +ASS+ + +EA +AS AKS A A A A A A + TA Sbjct: 635 TEANQTASSSKSVSEEAKEEASKAKSIALEAKNTADSATAKAEQAKEETETA 686 >UniRef50_Q0I488 Putative uncharacterized protein n=1 Tax=Haemophilus somnus 129PT RepID=Q0I488_HAES1 Length = 2906 Score = 53.5 bits (127), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 111/341 (32%), Positives = 153/341 (44%), Gaps = 40/341 (11%) Query: 134 ASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS 193 A T A E AT A A A AA A QA + A A + A A KA +A A A+ Sbjct: 1911 AKTKAEEFATKAEQAKGEAEAAKLGAEQAQTVAVDAKNKALEAQGKAEQAQNKAEEAQGK 1970 Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNA 253 AA A AA+ A ++A A A KA +A + A A AAK A Sbjct: 1971 AEAAKDEAVAAQQGAVTAKNQAETARDGAVDAKNKAEQAKSQAETFATQANAAKQDAVTA 2030 Query: 254 SSSASSAASSATAAGNSAKAAKTSETNARS-SETAAGQSASAAAGSKTAAASSASAASTS 312 + A SA + A AA +A AK NA++ +ET A Q+ +A G A A Sbjct: 2031 KNQAESARNEANAAKTAALDAKQGAENAKNQAETFATQANTAKQG--------ALEAKDK 2082 Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK------------- 359 A QA A A AA A++A + A A KA EA +A AA +A + Sbjct: 2083 AEQAKADAVAAKLGADAAQTLAVNAKDKALEAQGKAEAAQAAAQNSASQAQTAQNKAEQA 2142 Query: 360 ------------TSETNAKASETSAESSKTAAASSASSAASSASSASASK---DEATRQA 404 T++T A+A+ A ++K A + ++A + + A A+K ++A QA Sbjct: 2143 QAAAVAAQQGADTAKTQAEAARNEAVTAKNQAEDAKTAALEAQNKAEAAKLGAEQAKAQA 2202 Query: 405 SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETA 445 AK+ A +A EA + TAA S AE+A T+AETA Sbjct: 2203 DVAKNQAESAR---DEAVAAQTAAQGLASQAEAAKTQAETA 2240 Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 95/292 (32%), Positives = 129/292 (44%), Gaps = 20/292 (6%) Query: 133 DASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAES 192 +A A +A + A A A AA +A +A + A +A A A +A A A AA+ Sbjct: 1833 EAKNKAEQAKSQAETFATQANAAKDAALEAQAGANNAKQEAEAARYEAVMAKDDAVAAKR 1892 Query: 193 SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETN 252 AA T+A Q +A+ A A TKA E AT A A EAAK Sbjct: 1893 GAEAAQTAA--------------QGSASQAEAAKTKAEEFATKAEQAKGEAEAAKLGAEQ 1938 Query: 253 ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTS 312 A + A A + A A A+ A+ A+ AA A AA A + A A Sbjct: 1939 AQTVAVDAKNKALEAQGKAEQAQNKAEEAQGKAEAAKDEAVAAQQGAVTAKNQAETARDG 1998 Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSA 372 A A A A AE+ A+ A+ A A A QA +A A+AAKT+ +AK A Sbjct: 1999 AVDAKNKAEQAKSQAETFATQANAAKQDAVTAKNQAESARNEANAAKTAALDAK---QGA 2055 Query: 373 ESSKTAAASSASSAASSASSASASKDEATR---QASAAKSSATTASTKATEA 421 E++K A + A+ A ++ A +KD+A + A AAK A A T A A Sbjct: 2056 ENAKNQAETFATQANTAKQGALEAKDKAEQAKADAVAAKLGADAAQTLAVNA 2107 Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 118/356 (33%), Positives = 158/356 (44%), Gaps = 52/356 (14%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 E ARN + A+N A DA T+A EA A A A A A A + A+SA Sbjct: 1748 EAARNEAVTAKN------QAEDAKTAALEAQNKAEAAKLGAEQAKAQADVAKNQAESARD 1801 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 A A T+A E + A A+++ A A AK A + ++ AT A+ A A E Sbjct: 1802 EAVVAKTQAEEFATKAQTAQAAAVNAQQGAVEAKNKAEQAKSQAETFATQANAAKDAALE 1861 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 A A +A EAA+ A A A AA A+AA+T Sbjct: 1862 AQAGANNAKQEAEAARYE-------AVMAKDDAVAAKRGAEAAQT--------------- 1899 Query: 292 ASAAAGSKTAAASSASAA-------STSAGQASASATAAGKSAESAASSASTATTKAGEA 344 AA GS AS A AA +T A QA A AA AE A + A A KA EA Sbjct: 1900 --AAQGS----ASQAEAAKTKAEEFATKAEQAKGEAEAAKLGAEQAQTVAVDAKNKALEA 1953 Query: 345 T---EQASAAARSASA-AKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 EQA A A A+ ++ A A++ A ++K A ++ A + + A +K +A Sbjct: 1954 QGKAEQAQNKAEEAQGKAEAAKDEAVAAQQGAVTAKNQAETARDGAVDAKNKAEQAKSQA 2013 Query: 401 T---RQASAAKSSATTASTKA----TEAAGSATAAAQSKSTAESAATRAETAAKRA 449 QA+AAK A TA +A EA + TAA +K AE+A +AET A +A Sbjct: 2014 ETFATQANAAKQDAVTAKNQAESARNEANAAKTAALDAKQGAENAKNQAETFATQA 2069 Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust. Identities = 112/331 (33%), Positives = 151/331 (45%), Gaps = 10/331 (3%) Query: 127 AKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKS 186 A+ A++A A A A A D A AA A A ++AQ ++S A A TKA E + Sbjct: 1862 AQAGANNAKQEAEAARYEAVMAKDDAVAAKRGAEAAQTAAQGSASQAEAAKTKAEEFATK 1921 Query: 187 AAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAA 246 A A+ AA A A+T +A A A A KA EA A A AA Sbjct: 1922 AEQAKGEAEAAKLGAEQAQTVAVDAKNKALEAQGKAEQAQNKAEEAQGKAEAAKDEAVAA 1981 Query: 247 KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 + A + A +A A A N A+ AK+ + AA Q A A +A + A Sbjct: 1982 QQGAVTAKNQAETARDGAVDAKNKAEQAKSQAETFATQANAAKQDAVTAKNQAESARNEA 2041 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS-------AAARSASAAK 359 +AA T+A A A A AE+ A+ A+TA A EA ++A AA A AA+ Sbjct: 2042 NAAKTAALDAKQGAENAKNQAETFATQANTAKQGALEAKDKAEQAKADAVAAKLGADAAQ 2101 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 T NAK A+ AA ++A ++AS A +A ++A A AA+ A TA T+A Sbjct: 2102 TLAVNAKDKALEAQGKAEAAQAAAQNSASQAQTAQNKAEQAQAAAVAAQQGADTAKTQAE 2161 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAE 450 A A A K+ AE A T A A +AE Sbjct: 2162 AARNEAVTA---KNQAEDAKTAALEAQNKAE 2189 Score = 44.7 bits (104), Expect = 0.022, Method: Compositional matrix adjust. Identities = 94/292 (32%), Positives = 128/292 (43%), Gaps = 11/292 (3%) Query: 162 AASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATS 221 A + A++A + A TA +A +A +A A++ AA A AK A +SA Sbjct: 1743 AKTQAEAARNEAVTAKNQAEDAKTAALEAQNKAEAAKLGAEQAKAQADVAKNQAESARDE 1802 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 A A T+A E AT A+ A A+ A+ A + A A S A A AAK + A Sbjct: 1803 AVVAKTQAEEFATKAQTAQAAAVNAQQGAVEAKNKAEQAKSQAETFATQANAAKDAALEA 1862 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 ++ A Q A AA A A AA A AA +A+ +AS A A TKA Sbjct: 1863 QAGANNAKQEAEAARYEAVMAKDDAVAAKRGA-------EAAQTAAQGSASQAEAAKTKA 1915 Query: 342 GEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 E +A A A AAK A+ A++ A A A + A A + A Sbjct: 1916 EEFATKAEQAKGEAEAAKLGAEQAQTVAVDAKNKALEAQGKAEQAQNKAEEAQGKAEAAK 1975 Query: 402 RQASAAKSSATTASTKATEAAGSAT----AAAQSKSTAESAATRAETAAKRA 449 +A AA+ A TA +A A A A Q+KS AE+ AT+A A + A Sbjct: 1976 DEAVAAQQGAVTAKNQAETARDGAVDAKNKAEQAKSQAETFATQANAAKQDA 2027 Score = 44.7 bits (104), Expect = 0.022, Method: Compositional matrix adjust. Identities = 111/371 (29%), Positives = 176/371 (47%), Gaps = 40/371 (10%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARA----ASTSAGQAASSAQ 167 E A+N + AQ A A A D + +A++ A A + A++AR A A QA S A+ Sbjct: 1958 EQAQNKAEEAQGKAEA---AKDEAVAAQQGAVTAKNQAETARDGAVDAKNKAEQAKSQAE 2014 Query: 168 SASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT 227 + ++ A A A A A +A + +AA T+A AK NA ++ AT A+TA Sbjct: 2015 TFATQANAAKQDAVTAKNQAESARNEANAAKTAALDAKQGAENAKNQAETFATQANTAKQ 2074 Query: 228 KASEAATSAR----DAAASK---EAAKSSETNASSSA---------------SSAASSAT 265 A EA A DA A+K +AA++ NA A +SA+ + T Sbjct: 2075 GALEAKDKAEQAKADAVAAKLGADAAQTLAVNAKDKALEAQGKAEAAQAAAQNSASQAQT 2134 Query: 266 AAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGK 325 A + +A + + ++TA Q A AA A + A A T+A +A A AA Sbjct: 2135 AQNKAEQAQAAAVAAQQGADTAKTQ-AEAARNEAVTAKNQAEDAKTAALEAQNKAEAAKL 2193 Query: 326 SAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAES-------SKTA 378 AE A + A A +A A ++A AA +AA+ + A+A++T AE+ +K Sbjct: 2194 GAEQAKAQADVAKNQAESARDEAVAAQ---TAAQGLASQAEAAKTQAETARDGAVDAKNK 2250 Query: 379 AASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESA 438 A + S A + A+ A+A+KD A + A ++ A + +A + A+ ++ AE Sbjct: 2251 AEQAKSQAETFATQANAAKDAALEAQAGANAAQQGAESAKDDAVAAQKASEDARDKAEGF 2310 Query: 439 ATRAETAAKRA 449 AT+A+ A +A Sbjct: 2311 ATKADDAKTKA 2321 >UniRef50_B0X1G5 Microtubule-associated protein futsch n=1 Tax=Culex quinquefasciatus RepID=B0X1G5_CULQU Length = 4575 Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 109/360 (30%), Positives = 167/360 (46%), Gaps = 55/360 (15%) Query: 111 EEVARNASAVAQ--NTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAAS---- 164 EE +R ASA +Q AA++K ++ +H ++ A S ++A+ + +S Sbjct: 2437 EESSRPASAASQASEKAASEKPDEKEASRPESVVSHVSEKAASEKSATLEKPEESSRPTS 2496 Query: 165 SAQSASSSAGTASTKATEASK--SAAAAESSKSAAATSAGAAKTSETNASASLQSAATSA 222 +A AS +A + A EAS+ SAA+ S K+A+ SA K E AS Sbjct: 2497 AASQASETAPSEKQDAKEASRPESAASHVSEKAASEKSAMLEKPEEKEAS--------RP 2548 Query: 223 STATTKASEAATSARDAAASK---EAAKSSETNASSSASSAASSATAAGNSAKAAKTSET 279 ++AT++ SE ATS +D A K + A SE+ AS + AAS ++K A E+ Sbjct: 2549 ASATSQVSEKATSEKDTPAEKTDGKEASRSESVASHVSEKAASDKAEEKPASKEASRPES 2608 Query: 280 NA-RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTAT 338 A SE AA A SK +AS + A +E AAS S Sbjct: 2609 VASHVSEKAASDKAEEKPASK---------------EASRPESVASHVSEKAASEKSATL 2653 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASK- 397 K E++ ASAA++++ A + + +AK + S +AAS S A+S S + K Sbjct: 2654 EKPEESSRPASAASQASEKAPSEKPDAKEA-----SRPDSAASHVSEKAASEKSMTLDKP 2708 Query: 398 --DEATRQASAA---------KSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAA 446 EA+R ASAA + SAT E++ ++A +Q+ E A+R E+AA Sbjct: 2709 EEKEASRPASAASHVSEKAASEKSATLEKPDDKESSRPSSALSQAD---EKEASRPESAA 2765 >UniRef50_C3L3V8 Putative uncharacterized protein n=2 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L3V8_AMOA5 Length = 891 Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust. Identities = 103/321 (32%), Positives = 154/321 (47%), Gaps = 26/321 (8%) Query: 135 STSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSK 194 ST+++EA T A+D A A+ + A Q A +A+ +++A KA EA++ A + Sbjct: 432 STASKEAKT-ASDEAIKAQEVTEKAQQQAEAARDQANTANDKVIKAQEATEKAQQQAKAA 490 Query: 195 SAA-ATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNA 253 +T++G AKT+ A + Q A + A T ASE A +AR+ EA K Sbjct: 491 KEQASTASGDAKTASDEAKKAQQQAKAARDQANT-ASEEAKAARN-----EAEK-----V 539 Query: 254 SSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSA 313 A +A A A AKA AR+ A Q A AA A+ A AS A Sbjct: 540 QQQAEAARDQANTASEEAKA-------ARNEAEKAQQQAEAARDQSNTASGDAKTASDEA 592 Query: 314 GQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAE 373 +A A AA A +A+ A +A +A +QA AA A+ A SE KA E + + Sbjct: 593 KKAQQQAEAARDQANTASEETKAARNEAEKAQQQAEAARDQANTA--SEEAIKAQEATEK 650 Query: 374 SSKTAA--ASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQS 431 ++K A A +A++AA+ A++AS A +A A+ +A +TKA + AQ Sbjct: 651 ATKQAKDDAETATNAATQANTASEEAKTARNEAIEAQQAAEKEATKAMKQVEQIKKKAQE 710 Query: 432 KSTAESAAT--RAETAAKRAE 450 K+ + A + ETA K+AE Sbjct: 711 KAQQKQAKKLAKEETARKKAE 731 >UniRef50_P13390 L-shaped tail fiber protein n=4 Tax=Enterobacteria phage T5 RepID=VLTF_BPT5 Length = 1396 Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust. Identities = 70/178 (39%), Positives = 94/178 (52%), Gaps = 21/178 (11%) Query: 217 SAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAA--------- 267 + S S ++ A++ ++ + AS AAK SE NA S +A S A Sbjct: 28 TVVLSNSISSITAADVTSAIESSKASGPAAKQSEINAKQSELNAKDSENEAEISATSSQQ 87 Query: 268 ------------GNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQ 315 NSAKAAKTSETNA +S+ AA S + AA S ++A+S A+AA SA Sbjct: 88 SATQSASSATASANSAKAAKTSETNANNSKNAAKTSETNAASSASSASSFATAAENSARA 147 Query: 316 ASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAE 373 A S T AG SA++A +S + A A A + A +S +AAKTSETNAK SE A+ Sbjct: 148 AKTSETNAGNSAQAADASKTAAANSATAAKTSETNAKKSETAAKTSETNAKTSENKAK 205 Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust. Identities = 56/106 (52%), Positives = 69/106 (65%) Query: 205 KTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSA 264 KTSETNA+ S +A TS + A + AS A++ A A S AAK+SETNA +SA +A +S Sbjct: 107 KTSETNANNSKNAAKTSETNAASSASSASSFATAAENSARAAKTSETNAGNSAQAADASK 166 Query: 265 TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 TAA NSA AAKTSETNA+ SETAA S + A S+ A AS Sbjct: 167 TAAANSATAAKTSETNAKKSETAAKTSETNAKTSENKAKEYLDMAS 212 Score = 44.7 bits (104), Expect = 0.020, Method: Compositional matrix adjust. Identities = 87/180 (48%), Positives = 110/180 (61%), Gaps = 19/180 (10%) Query: 178 TKATEASK-SAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA--- 233 T A E+SK S AA+ S+ A S AK SE A S S+ SA+ + + A+ +A Sbjct: 44 TSAIESSKASGPAAKQSEINAKQSELNAKDSENEAEISATSSQQSATQSASSATASANSA 103 Query: 234 ----TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 TS +A SK AAK+SETNA+SSASSA+S ATAA NSA+AAKTSETN AG Sbjct: 104 KAAKTSETNANNSKNAAKTSETNAASSASSASSFATAAENSARAAKTSETN-------AG 156 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 SA AA SKTAAA+SA+AA TS A S TA A+++ ++A T+ KA E + AS Sbjct: 157 NSAQAADASKTAAANSATAAKTSETNAKKSETA----AKTSETNAKTSENKAKEYLDMAS 212 >UniRef50_B6K9X3 Putative uncharacterized protein n=1 Tax=Toxoplasma gondii ME49 RepID=B6K9X3_TOXGO Length = 705 Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust. Identities = 84/281 (29%), Positives = 137/281 (48%), Gaps = 26/281 (9%) Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 T A E A + A A++SSET A +A + A S + +E NA+ + Sbjct: 379 TPAEENAEEPKQAEEQANASQSSETPAEENAEEPKQAEEQANASQSSQTPAEENAQEPKQ 438 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATA---------AGKSAESAASSASTA 337 A Q A+A+ S+T A +A + QA+AS ++ K AE A+++ ++ Sbjct: 439 AEEQ-ANASQSSETPAEENAEEPKQAEEQANASQSSETPAEENAEEPKQAEEQANASQSS 497 Query: 338 TTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASK 397 T A E TE+ A A+A+++SET A+ + + + A +S SSA + +A K Sbjct: 498 ETPAEENTEEPKQAEERANASQSSETPAEENAQEPKQGEEQANASQSSATPAEENAEEPK 557 Query: 398 DEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAA---KRAEDIAS 454 +A QA+A++SS T A A E A + ++ ++S+ T AE A K+AE+ A+ Sbjct: 558 -QAEEQANASQSSETPAEENAQE----PKQAEERENASQSSETPAEENAQEPKQAEEQAN 612 Query: 455 A------VALEDASTTKKGIVQ--LSSATNSTSETLAATPK 487 A A E+A K+G Q S ++ + +E A PK Sbjct: 613 ASQSSETPAEENAEEPKQGEEQANASQSSETPAEENAQVPK 653 >UniRef50_Q5ULL7 Orf97 n=1 Tax=Lactobacillus phage LP65 RepID=Q5ULL7_9CAUD Length = 2112 Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust. Identities = 71/215 (33%), Positives = 109/215 (50%), Gaps = 17/215 (7%) Query: 148 AADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS 207 A DSA + +A QAAS A A S+A + A K+ + A +KSA + A Sbjct: 722 AGDSATRVANNASQAASGAVIAGSTA------SVNADKAVSVANQAKSAGDNATRVAN-- 773 Query: 208 ETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAA 267 NAS + A + STAT A++A+T A +A ++ + A S NA+S A+SA S+A Sbjct: 774 --NASQAASGAILAGSTATVIANKASTVANEAKSAGDNATSVANNATSVANSAKSTA--- 828 Query: 268 GNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSA 327 +S A SE +S+ QS + A S+ A S ++A + Q++A+A A +A Sbjct: 829 -DSTYAYANSEIAVQSTAITKAQSTADNAFSQAQAVGSQASAEIAV-QSNATAKAQ-STA 885 Query: 328 ESAASSASTATTKAGEATEQASAAARSASAAKTSE 362 ++A S A+TA G+ T QA + S +E Sbjct: 886 DNAFSKATTAIDN-GKVTSQAVTDLKDGSKLTIAE 919 Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust. Identities = 58/205 (28%), Positives = 106/205 (51%), Gaps = 13/205 (6%) Query: 259 SAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASA 318 +A SAT N+A A + A S+ + A + A +A +A+ + +A QA++ Sbjct: 721 TAGDSATRVANNASQAASGAVIAGSTASVNADKAVSVANQAKSAGDNATRVANNASQAAS 780 Query: 319 SATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTA 378 A AG +A A+ AST +A A + A++ A +A++ A +++++A+S+ Sbjct: 781 GAILAGSTATVIANKASTVANEAKSAGDNATSVANNATSV------ANSAKSTADSTYAY 834 Query: 379 AASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESA 438 A S + +++ + A ++ D A QA A S A + E A + A A+++STA++A Sbjct: 835 ANSEIAVQSTAITKAQSTADNAFSQAQAVGSQA------SAEIAVQSNATAKAQSTADNA 888 Query: 439 ATRAETAAKRAEDIASAVA-LEDAS 462 ++A TA + + AV L+D S Sbjct: 889 FSKATTAIDNGKVTSQAVTDLKDGS 913 >UniRef50_Q9TYL3 Putative uncharacterized protein n=2 Tax=Caenorhabditis elegans RepID=Q9TYL3_CAEEL Length = 605 Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust. Identities = 104/269 (38%), Positives = 148/269 (55%), Gaps = 16/269 (5%) Query: 177 STKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA--SEAAT 234 ST +T A S A+ + S A T+AG T+ T A S S A STATT A S A+T Sbjct: 87 STASTAAGGSTASTAAGGSTATTAAGG-STATTAAGGSTASTAAGGSTATTAAGGSTAST 145 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 +A + AS A S+ +S+A+ +++ TAAG S A T+ + +S A G +A+ Sbjct: 146 AAGGSTASTAAGGST----ASTAAGGSTATTAAGGST--ATTAAGGSTASTAAGGSTATT 199 Query: 295 AAGSKTAA-ASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 AAG TA+ A+ S AST+AG ++A+ TAAG S + A+ STA+T AG +T +A Sbjct: 200 AAGGSTASTAAGGSTASTAAGGSTAT-TAAGGSTATTAAGGSTASTAAGGSTASTAAGGS 258 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT 413 +A+ A T A+ S S TAA S ++ A+ S+AS + +T +A S+ATT Sbjct: 259 TATTAAGGSTATTAAGGSTAS--TAAGGSTATTAAGGSTASTAAGGSTASTAAGGSTATT 316 Query: 414 A---STKATEAAGSATAAAQSKSTAESAA 439 A ST T A GS + A STA +AA Sbjct: 317 AAGGSTATTAAGGSTASTAAGGSTASTAA 345 Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust. Identities = 111/285 (38%), Positives = 161/285 (56%), Gaps = 19/285 (6%) Query: 156 STSAGQAASSAQSASSSAGTA---STKATEASKSAAAAESSKSAAATSAGAAKTSETNAS 212 ST+AG + +S + S+A TA ST T A S A + S A+T+AG T+ T A Sbjct: 81 STAAGGSTASTAAGGSTASTAAGGSTATTAAGGSTATTAAGGSTASTAAGG-STATTAAG 139 Query: 213 ASLQSAATSASTATTKA--SEAATSARDAAASKEAAKSSETNAS--SSASSAA---SSAT 265 S S A STA+T A S A+T+A + A+ A S+ T A+ S+AS+AA ++ T Sbjct: 140 GSTASTAAGGSTASTAAGGSTASTAAGGSTATTAAGGSTATTAAGGSTASTAAGGSTATT 199 Query: 266 AAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA-ASSASAASTSAGQASASATAAG 324 AAG S A T+ + +S A G +A+ AAG TA A+ S AST+AG ++AS TAAG Sbjct: 200 AAGGST--ASTAAGGSTASTAAGGSTATTAAGGSTATTAAGGSTASTAAGGSTAS-TAAG 256 Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSAS 384 S + A+ STATT AG +T +A +A+ A T + A+ S S TAA S + Sbjct: 257 GSTATTAAGGSTATTAAGGSTASTAAGGSTATTAAGGSTASTAAGGSTAS--TAAGGSTA 314 Query: 385 SAASSASSASASKDEATRQASAAKSSATTAS--TKATEAAGSATA 427 + A+ S+A+ + +T +A S+A+TA+ + AT AAG +TA Sbjct: 315 TTAAGGSTATTAAGGSTASTAAGGSTASTAAGGSTATTAAGGSTA 359 >UniRef50_B9Q2F9 Putative uncharacterized protein n=1 Tax=Toxoplasma gondii GT1 RepID=B9Q2F9_TOXGO Length = 972 Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust. Identities = 99/350 (28%), Positives = 171/350 (48%), Gaps = 36/350 (10%) Query: 159 AGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSA 218 A + A+++QS+ + A + + +A A A++SS++ A +A K +E A+AS Sbjct: 362 AEEQANASQSSETPAEENAQEPKQAEDQANASQSSETPAEENAEEPKQAEEQANASQ--- 418 Query: 219 ATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSE 278 ++ T A E A + A A++SSET A +A + A S + +E Sbjct: 419 -----SSETPAEENAQEPKQAEDQANASQSSETPAEENAEEPKQAEEQANASQSSETPAE 473 Query: 279 TNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATA---------AGKSAES 329 NA+ + A Q A+A+ S+T A +A + QA+AS ++ K AE Sbjct: 474 ENAQEPKQAEEQ-ANASQSSETPAEENAQEPKQTEEQANASQSSETPAEENAQEPKQAEE 532 Query: 330 AASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASS 389 A+++ ++ T A E ++ A A+A+++SET A E + E +T ++AS ++ + Sbjct: 533 QANASQSSETPAEENAQEPKQAEEQANASQSSET--PAEENAQEPKQTEEQANASQSSET 590 Query: 390 ASSASASK-DEATRQASAAKSSATTASTKATE---AAGSATAAAQSKSTAESAATRAETA 445 + +A + +A QA+A++SSAT A A E A A A+ S++ AE A Sbjct: 591 PAEENAEEPKQAEEQANASQSSATPAEENAEEPKQAEEQANASQSSETPAEENAQE---- 646 Query: 446 AKRAEDIASA------VALEDASTTKKGIVQ--LSSATNSTSETLAATPK 487 K+AE+ A+A A E+A K+ Q S ++ + +E A PK Sbjct: 647 PKQAEEQANASQSSETPAEENAEEPKQAEDQANASQSSETPAEENAEEPK 696 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76072 Side tail fiber protein homolog from lambdoid pr... 1112 0.0 UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia ... 417 e-114 UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacter... 384 e-104 UniRef50_C8U9W7 Probable tail fiber protein-like protein n=1 Tax... 315 5e-84 UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enter... 312 6e-83 UniRef50_B4TP26 Side tail fiber protein n=43 Tax=Salmonella ente... 306 3e-81 UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escheric... 289 6e-76 UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriacea... 288 9e-76 UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX 285 8e-75 UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclass... 284 2e-74 UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica... 272 6e-71 UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enter... 268 1e-69 UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae Re... 267 2e-69 UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae R... 259 6e-67 UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria pha... 257 2e-66 UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysente... 256 3e-66 UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_L... 254 1e-65 UniRef50_A8A0A4 L-shaped tail fiber protein n=12 Tax=root RepID=... 251 1e-64 UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escheri... 248 8e-64 UniRef50_C6UHV3 Predicted tail fiber protein n=22 Tax=Escherichi... 239 4e-61 UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepI... 235 6e-60 UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseu... 232 5e-59 UniRef50_B5YU13 Tail fiber protein n=140 Tax=root RepID=B5YU13_E... 229 5e-58 UniRef50_B5YYN6 Tail fiber protein n=60 Tax=root RepID=B5YYN6_ECO5E 229 5e-58 UniRef50_B6IAV4 Putative phage tail fiber protein n=1 Tax=Escher... 227 2e-57 UniRef50_D2TJ16 Putative phage tail fibre protein n=1 Tax=Citrob... 225 5e-57 UniRef50_C2DS71 Tail fiber protein n=10 Tax=Escherichia RepID=C2... 224 2e-56 UniRef50_Q7Y3Z0 Tail fiber protein n=1 Tax=Yersinia phage PY54 R... 212 5e-53 UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp.... 197 2e-48 UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae ... 197 3e-48 UniRef50_C4U3E2 Tail fiber protein (Fragment) n=1 Tax=Yersinia k... 195 8e-48 UniRef50_B3YHG3 Tail fiber protein n=2 Tax=Salmonella enterica s... 190 2e-46 UniRef50_C5BA56 Putative uncharacterized protein n=1 Tax=Edwards... 188 1e-45 UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacter... 188 1e-45 UniRef50_Q858V4 GpH n=9 Tax=root RepID=Q858V4_9CAUD 187 2e-45 UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacteriu... 186 4e-45 UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella... 178 9e-43 UniRef50_C3R3S9 Predicted protein n=1 Tax=Bacteroides sp. 2_2_4 ... 172 6e-41 UniRef50_C9KJG4 Side tail fiber protein from lambdoid prophage R... 172 7e-41 UniRef50_D2TSH8 Phage tail fibre protein n=7 Tax=root RepID=D2TS... 161 1e-37 UniRef50_B7MW07 Putative tail fiber protein from prophage n=4 Ta... 160 4e-37 UniRef50_Q0I488 Putative uncharacterized protein n=1 Tax=Haemoph... 148 1e-33 UniRef50_Q5GAE0 Putative uncharacterized protein n=3 Tax=Singapo... 147 2e-33 UniRef50_Q9LA62 ORF-401-like protein n=1 Tax=Enterobacterial pha... 145 1e-32 UniRef50_B7MWN9 Putative tail fiber protein (GpH) n=2 Tax=Escher... 144 2e-32 UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enter... 141 1e-31 UniRef50_C4Y8G3 Putative uncharacterized protein n=1 Tax=Clavisp... 139 5e-31 UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclass... 138 1e-30 UniRef50_C5DLU8 KLTH0G03696p n=1 Tax=Lachancea thermotolerans CB... 138 1e-30 UniRef50_Q0I4L1 Putative uncharacterized protein n=1 Tax=Haemoph... 130 3e-28 UniRef50_A9ITY7 Putative uncharacterized protein n=2 Tax=Bartone... 130 4e-28 UniRef50_B2I4I4 Putative phage tail protein n=1 Tax=Enterobacter... 126 3e-27 UniRef50_D0Z3B3 Putative tail fiber protein n=1 Tax=Photobacteri... 115 2e-23 UniRef50_C3L3V8 Putative uncharacterized protein n=2 Tax=Candida... 102 1e-19 UniRef50_B0X1G5 Microtubule-associated protein futsch n=1 Tax=Cu... 90 4e-16 Sequences not found previously or not previously below threshold: UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacter... 125 1e-26 UniRef50_C9XHA4 Phage variable tail-fibre protein n=1 Tax=Salmon... 123 5e-26 UniRef50_Q32D03 Putative uncharacterized protein n=2 Tax=root Re... 122 8e-26 UniRef50_UPI0001760AF3 PREDICTED: similar to nahoda CG12781-PA n... 121 2e-25 UniRef50_UPI0001826514 putative tail fiber protein (GpH) n=2 Tax... 121 2e-25 UniRef50_Q7N0S6 Similarities with prophage tail fiber protein n=... 120 2e-25 UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadan... 108 2e-21 UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 T... 108 2e-21 UniRef50_D1RZD4 Putative uncharacterized protein n=1 Tax=Serrati... 108 2e-21 UniRef50_B6W6V0 Putative uncharacterized protein n=2 Tax=Anaeroc... 106 4e-21 UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae... 106 5e-21 UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteri... 105 1e-20 UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepI... 105 1e-20 UniRef50_A9IY35 Putative uncharacterized protein n=1 Tax=Bartone... 104 2e-20 UniRef50_Q19CF5 Gp36 small distal tail fiber subunit n=1 Tax=Aer... 102 6e-20 UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersini... 101 2e-19 UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae... 101 2e-19 UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterob... 99 4e-19 UniRef50_UPI00016C0A2F hypothetical protein Epulo_08463 n=1 Tax=... 100 6e-19 UniRef50_C6AD76 Putative uncharacterized protein n=1 Tax=Bartone... 99 9e-19 UniRef50_C5RZI7 Putative uncharacterized protein n=1 Tax=Actinob... 98 1e-18 UniRef50_Q6H236 Paternally-expressed gene 3 protein n=10 Tax=Eut... 98 2e-18 UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Pho... 97 3e-18 UniRef50_A7ZN97 Tail fiber family protein n=2 Tax=Escherichia co... 97 4e-18 UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammapr... 95 1e-17 UniRef50_Q9WXA5 Tail fiber n=2 Tax=Pectobacterium carotovorum Re... 94 3e-17 UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrob... 93 4e-17 UniRef50_C2BQ43 Putative uncharacterized protein n=2 Tax=Coryneb... 93 6e-17 UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteri... 92 9e-17 UniRef50_C6LLQ7 Glycogen synthase n=6 Tax=Clostridiales RepID=C6... 92 1e-16 UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteri... 92 1e-16 UniRef50_Q7WYN2 Cellulosomal scaffoldin anchoring protein C n=1 ... 92 2e-16 UniRef50_B0MLL9 Putative uncharacterized protein n=1 Tax=Eubacte... 88 2e-15 UniRef50_C4TT85 Gp19 n=1 Tax=Yersinia kristensenii ATCC 33638 Re... 88 3e-15 UniRef50_Q3A2X7 Ribonuclease E n=3 Tax=Bacteria RepID=Q3A2X7_PELCD 88 3e-15 UniRef50_B7NJP1 Putative side tail fiber protein homolog from la... 87 3e-15 UniRef50_A6C5K2 Putative uncharacterized protein n=1 Tax=Plancto... 87 3e-15 UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1... 86 8e-15 UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH... 86 1e-14 UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae... 85 2e-14 UniRef50_O94854 Uncharacterized protein KIAA0754 n=6 Tax=Euarcho... 85 2e-14 UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas ... 84 2e-14 UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli pl... 84 3e-14 UniRef50_UPI0001B553A8 Transglycosylase domain protein n=1 Tax=S... 84 3e-14 UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadan... 83 7e-14 UniRef50_B5JW06 EleCtron transport complex, rnfabcdge type, c su... 83 7e-14 UniRef50_A9EEQ2 Amylopullulanase n=9 Tax=cellular organisms RepI... 83 8e-14 UniRef50_UPI00016BFB6A hypothetical protein Epulo_10362 n=1 Tax=... 81 2e-13 UniRef50_B6KAY2 Putative uncharacterized protein n=3 Tax=Toxopla... 81 2e-13 UniRef50_C5KW02 Calcium-binding tyrosine phosphorylation-regulat... 80 4e-13 UniRef50_P13390 L-shaped tail fiber protein n=4 Tax=Enterobacter... 80 5e-13 UniRef50_Q4L3P2 Similar toputative cell-surface adhesin SdrF n=4... 80 7e-13 UniRef50_Q1HTS1 S1L n=1 Tax=Squirrel poxvirus RepID=Q1HTS1_9POXV 79 1e-12 UniRef50_B5DVG8 GA26604 n=2 Tax=pseudoobscura subgroup RepID=B5D... 79 1e-12 UniRef50_A9IXX1 Phage-related protein n=4 Tax=Bartonella RepID=A... 78 1e-12 UniRef50_B3XNY6 LPXTG-motif cell wall anchor domain protein n=1 ... 77 4e-12 UniRef50_C2CVN0 Putative uncharacterized protein n=1 Tax=Gardner... 77 4e-12 UniRef50_UPI0001C37D90 hypothetical protein RflaF_18851 n=1 Tax=... 76 7e-12 UniRef50_Q4QLS0 IgA-specific serine endopeptidase n=6 Tax=Haemop... 76 1e-11 UniRef50_Q7Q4S4 AGAP000893-PA n=1 Tax=Anopheles gambiae RepID=Q7... 75 1e-11 UniRef50_C4KLT7 Hep_Hag family protein n=45 Tax=Proteobacteria R... 75 1e-11 UniRef50_C4Y9G0 Putative uncharacterized protein n=1 Tax=Clavisp... 75 2e-11 UniRef50_C8PMD0 Putative antigen protein n=1 Tax=Treponema vince... 74 3e-11 UniRef50_C5DNW3 ZYRO0A12012p n=1 Tax=Zygosaccharomyces rouxii Re... 74 3e-11 UniRef50_Q4PI38 Putative uncharacterized protein n=1 Tax=Ustilag... 74 4e-11 UniRef50_A5KAV0 Merozoite surface protein 3 gamma (MSP3g), putat... 73 4e-11 UniRef50_C2CM00 Putative uncharacterized protein n=2 Tax=Coryneb... 73 5e-11 UniRef50_B6KQD3 SRS domain-containing protein n=2 Tax=Toxoplasma... 73 6e-11 UniRef50_P12036 Neurofilament heavy polypeptide n=78 Tax=root Re... 73 6e-11 UniRef50_B4NI35 GK12979 n=1 Tax=Drosophila willistoni RepID=B4NI... 73 7e-11 UniRef50_B3LWZ3 GF16318 n=1 Tax=Drosophila ananassae RepID=B3LWZ... 73 8e-11 UniRef50_B3L6P2 Merozoite surface protein 3, putative n=2 Tax=Pl... 72 1e-10 UniRef50_Q1QX63 Ribonuclease E n=5 Tax=Gammaproteobacteria RepID... 72 1e-10 UniRef50_B1VIS6 DNA polymerase III, gamma and tau subunits n=10 ... 72 1e-10 UniRef50_B5H2D1 Putative uncharacterized protein n=1 Tax=Strepto... 72 1e-10 UniRef50_C4KTI1 Cell divisionftsk/spoiiie n=95 Tax=cellular orga... 72 1e-10 UniRef50_UPI00015B5167 PREDICTED: similar to ENSANGP00000017739 ... 71 2e-10 UniRef50_A5KAV7 Merozoite surface protein 3 alpha (MSP3a), putat... 71 2e-10 UniRef50_C7YRX8 Predicted protein n=4 Tax=Eukaryota RepID=C7YRX8... 71 3e-10 UniRef50_C7XZV3 Predicted protein n=3 Tax=Lactobacillus jensenii... 71 3e-10 UniRef50_Q4KTW7 Merozoite surface protein 3 alpha n=159 Tax=Plas... 71 3e-10 UniRef50_D2RBM3 Putative uncharacterized protein n=1 Tax=Gardner... 71 3e-10 UniRef50_UPI000023F701 hypothetical protein FG10084.1 n=1 Tax=Gi... 70 4e-10 UniRef50_C0QBA1 Putative plasmin-sensitive surface protein (Pls ... 70 4e-10 UniRef50_B8CF96 Predicted protein n=2 Tax=Thalassiosira pseudona... 70 5e-10 UniRef50_A5KAV8 Merozoite surface protein 3 (MSP3), putative n=2... 70 5e-10 UniRef50_A2ET23 Putative uncharacterized protein n=1 Tax=Trichom... 69 1e-09 UniRef50_C2EAZ6 Allergen V5/Tpx-1 family protein n=1 Tax=Lactoba... 69 1e-09 UniRef50_C2CRT4 Possible phage tail fiber protein n=1 Tax=Coryne... 68 2e-09 UniRef50_B0X1W5 Putative uncharacterized protein n=1 Tax=Culex q... 67 3e-09 UniRef50_B4L608 GI16285 n=1 Tax=Drosophila mojavensis RepID=B4L6... 67 4e-09 UniRef50_Q0V3K3 Putative uncharacterized protein n=1 Tax=Phaeosp... 66 6e-09 UniRef50_A4N1T0 Immunoglobin A1 protease n=1 Tax=Haemophilus inf... 66 9e-09 UniRef50_Q75A72 ADR046Cp n=1 Tax=Eremothecium gossypii RepID=Q75... 66 1e-08 UniRef50_A5IUV0 LPXTG-motif cell wall anchor domain n=71 Tax=Sta... 66 1e-08 UniRef50_B2W3G7 Predicted protein n=1 Tax=Pyrenophora tritici-re... 65 1e-08 UniRef50_B3NH03 GG13891 n=1 Tax=Drosophila erecta RepID=B3NH03_D... 65 1e-08 UniRef50_B3I282 Side tail fiber protein n=1 Tax=Escherichia coli... 65 2e-08 UniRef50_Q2HAR4 Putative uncharacterized protein n=1 Tax=Chaetom... 65 2e-08 UniRef50_B4N0R4 GK24431 n=1 Tax=Drosophila willistoni RepID=B4N0... 65 2e-08 UniRef50_D1ZPA3 Whole genome shotgun sequence assembly, scaffold... 65 2e-08 UniRef50_C5PDM6 Putative uncharacterized protein n=2 Tax=Coccidi... 64 3e-08 UniRef50_A1C839 PT repeat family protein n=1 Tax=Aspergillus cla... 64 4e-08 UniRef50_A7AF35 Putative uncharacterized protein n=1 Tax=Parabac... 64 4e-08 UniRef50_B1YVB3 YadA domain protein n=6 Tax=Burkholderia RepID=B... 63 5e-08 UniRef50_B1N0K0 Putative mucus binding protein n=3 Tax=Lactobaci... 63 6e-08 UniRef50_C8VCU8 Putative uncharacterized protein n=2 Tax=Emerice... 63 7e-08 UniRef50_C2CTF5 Putative uncharacterized protein n=1 Tax=Gardner... 63 9e-08 UniRef50_A4A060 Putative uncharacterized protein n=1 Tax=Blastop... 63 9e-08 UniRef50_C7JGL3 Chromosome segregation protein SMC n=9 Tax=Alpha... 62 1e-07 UniRef50_B3M4N1 GF24494 n=6 Tax=Eukaryota RepID=B3M4N1_DROAN 61 2e-07 UniRef50_D0MQM2 Putative uncharacterized protein n=1 Tax=Phytoph... 61 3e-07 UniRef50_C1YUA7 Outer membrane protein/peptidoglycan-associated ... 61 3e-07 UniRef50_Q6CPZ4 KLLA0E01035p n=2 Tax=Saccharomycetaceae RepID=Q6... 60 4e-07 UniRef50_Q9L2C3 Large Ala/Glu-rich protein n=17 Tax=Streptomyces... 60 4e-07 UniRef50_Q1D010 Putative uncharacterized protein n=2 Tax=cellula... 60 6e-07 UniRef50_C5QRU2 Triblock protein copolymer TR8T n=1 Tax=Staphylo... 60 6e-07 UniRef50_Q08SC3 Adventurous gliding protein Z n=1 Tax=Stigmatell... 60 8e-07 UniRef50_Q9NDI9 Merozoite surface protein 3g n=1 Tax=Plasmodium ... 59 9e-07 UniRef50_A4AGZ2 Large Ala/Glu-rich protein n=1 Tax=marine actino... 59 1e-06 UniRef50_A7TJN9 Putative uncharacterized protein n=3 Tax=cellula... 58 2e-06 UniRef50_Q5TVN3 AGAP010846-PA (Fragment) n=1 Tax=Anopheles gambi... 58 2e-06 UniRef50_B8M9D3 PE repeat family protein n=3 Tax=Trichocomaceae ... 58 2e-06 UniRef50_B6Q6N2 PT repeat family protein n=1 Tax=Penicillium mar... 58 2e-06 UniRef50_P12027 Polysialoglycoprotein n=2 Tax=Oncorhynchus RepID... 58 2e-06 UniRef50_C0CXN6 Putative uncharacterized protein (Fragment) n=1 ... 58 2e-06 UniRef50_C6MUP3 Chromosome segregation ATPase-like protein n=1 T... 58 3e-06 UniRef50_UPI00016A9E96 hypothetical protein BoklC_19619 n=2 Tax=... 58 3e-06 UniRef50_A7A5H3 Putative uncharacterized protein n=1 Tax=Bifidob... 58 3e-06 UniRef50_C5M4U8 Predicted protein n=1 Tax=Candida tropicalis MYA... 57 3e-06 UniRef50_A4ED70 Putative uncharacterized protein n=2 Tax=Bacteri... 57 4e-06 UniRef50_UPI00017F7AFB YALI0E22572p n=1 Tax=Yarrowia lipolytica ... 57 4e-06 UniRef50_Q2UB42 Predicted protein n=2 Tax=Aspergillus RepID=Q2UB... 57 5e-06 UniRef50_UPI00017935A3 PREDICTED: similar to neurofilament, heav... 57 5e-06 UniRef50_Q1D823 Adventurous-gliding motility protein Z n=1 Tax=M... 56 6e-06 UniRef50_A7IYB1 Gp40 n=1 Tax=Corynebacterium phage P1201 RepID=A... 56 7e-06 UniRef50_Q7SBR0 Putative uncharacterized protein n=1 Tax=Neurosp... 56 8e-06 UniRef50_C0MB36 Putative cell surface-anchored protein n=3 Tax=S... 56 9e-06 UniRef50_A7SNK2 Predicted protein n=2 Tax=Nematostella vectensis... 56 1e-05 UniRef50_C2CUY9 Putative uncharacterized protein n=1 Tax=Gardner... 56 1e-05 UniRef50_A8WT52 Putative uncharacterized protein n=2 Tax=Caenorh... 56 1e-05 UniRef50_Q9MBI3 Gp21, tail fiber protein n=1 Tax=Corynebacterium... 56 1e-05 UniRef50_C7NGP9 Fe-S oxidoreductase n=2 Tax=Actinomycetales RepI... 55 2e-05 UniRef50_B6HJ47 Pc21g19350 protein n=1 Tax=Penicillium chrysogen... 55 2e-05 UniRef50_C1E6C0 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 55 2e-05 UniRef50_Q2TY93 Predicted protein n=2 Tax=Aspergillus RepID=Q2TY... 55 2e-05 UniRef50_C0ZX71 Putative uncharacterized protein n=1 Tax=Rhodoco... 55 2e-05 UniRef50_A3YYY9 Putative exonuclease SbcC n=1 Tax=Synechococcus ... 55 2e-05 UniRef50_Q9TYL3 Putative uncharacterized protein n=2 Tax=Caenorh... 54 3e-05 UniRef50_C5M5Z1 Predicted protein n=2 Tax=Saccharomycetales RepI... 54 3e-05 UniRef50_C1FF73 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 53 5e-05 UniRef50_C7Z475 Putative uncharacterized protein n=1 Tax=Nectria... 53 5e-05 UniRef50_A1BA44 OmpA/MotB domain protein n=1 Tax=Paracoccus deni... 53 6e-05 UniRef50_UPI0001925BA0 PREDICTED: similar to mucin 2 n=5 Tax=Hyd... 53 6e-05 UniRef50_C7YSP6 Predicted protein n=1 Tax=Nectria haematococca m... 53 6e-05 UniRef50_UPI0000DB6B60 PREDICTED: hypothetical protein n=2 Tax=A... 53 6e-05 UniRef50_Q6ZBP4 Os08g0490700 protein n=2 Tax=Oryza sativa RepID=... 53 9e-05 UniRef50_A7A6B3 Putative uncharacterized protein n=1 Tax=Bifidob... 52 1e-04 UniRef50_A6W5H3 Metal dependent phosphohydrolase n=1 Tax=Kineoco... 52 1e-04 UniRef50_A8HPQ1 Predicted protein n=1 Tax=Chlamydomonas reinhard... 52 2e-04 UniRef50_C1E1Z7 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 52 2e-04 UniRef50_UPI0001A5E657 PREDICTED: hypothetical protein n=8 Tax=H... 51 2e-04 UniRef50_B5EEN8 Flagellar hook-length control protein n=2 Tax=Ge... 51 2e-04 UniRef50_A0NYB8 Possible OmpA family member n=1 Tax=Labrenzia ag... 51 2e-04 UniRef50_A4R2V7 Predicted protein n=1 Tax=Magnaporthe grisea Rep... 51 3e-04 UniRef50_A4F6R6 Putative uncharacterized protein n=1 Tax=Sacchar... 51 3e-04 UniRef50_B2B4U0 Predicted CDS Pa_2_2450 n=3 Tax=cellular organis... 51 3e-04 UniRef50_B2WBM9 Putative uncharacterized protein n=1 Tax=Pyrenop... 50 4e-04 UniRef50_B6IFW9 Putative uncharacterized protein n=1 Tax=Caenorh... 50 4e-04 UniRef50_B0W468 Papilin n=4 Tax=Coelomata RepID=B0W468_CULQU 50 4e-04 UniRef50_Q9RSJ1 Putative uncharacterized protein n=1 Tax=Deinoco... 50 4e-04 UniRef50_B1VNN7 Putative uncharacterized protein n=1 Tax=Strepto... 50 7e-04 UniRef50_C8RQF5 Putative uncharacterized protein n=1 Tax=Coryneb... 50 8e-04 UniRef50_A8IBV4 Predicted protein n=1 Tax=Chlamydomonas reinhard... 49 0.001 UniRef50_Q0CSU4 Predicted protein n=1 Tax=Aspergillus terreus NI... 49 0.001 UniRef50_C7TJM8 Putative uncharacterized protein n=3 Tax=Lactoba... 49 0.001 UniRef50_C1E2Y3 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 48 0.002 UniRef50_A9FYY9 Protein kinase n=2 Tax=Sorangium cellulosum 'So ... 48 0.002 UniRef50_B7GRB7 Allergen V5/Tpx-1 family protein n=1 Tax=Bifidob... 48 0.002 UniRef50_A1C962 Putative uncharacterized protein n=1 Tax=Aspergi... 48 0.002 UniRef50_B8JIY2 Nucleolar and coiled-body phosphoprotein 1-like ... 48 0.002 UniRef50_A8HZI1 Predicted protein n=1 Tax=Chlamydomonas reinhard... 48 0.002 UniRef50_C1MMF6 Predicted protein n=1 Tax=Micromonas pusilla CCM... 48 0.003 UniRef50_Q9W1J0 Transporter n=17 Tax=Endopterygota RepID=Q9W1J0_... 48 0.003 UniRef50_C0XU23 Possible gp21, tail fiber protein n=1 Tax=Coryne... 48 0.003 UniRef50_Q1IXI7 Putative uncharacterized protein n=1 Tax=Deinoco... 48 0.003 UniRef50_B2AE55 Predicted CDS Pa_4_2900 n=1 Tax=Podospora anseri... 47 0.003 UniRef50_A4X773 Putative uncharacterized protein n=2 Tax=Salinis... 47 0.004 UniRef50_A8LW37 Putative uncharacterized protein n=2 Tax=Salinis... 46 0.006 UniRef50_B4MT61 GK20099 n=1 Tax=Drosophila willistoni RepID=B4MT... 46 0.010 UniRef50_UPI0000F2B26E PREDICTED: similar to PKA phosphorylated ... 46 0.012 UniRef50_D0DXJ3 Putative uncharacterized protein (Fragment) n=1 ... 45 0.014 UniRef50_Q5YPQ7 Putative transporter n=2 Tax=Actinomycetales Rep... 45 0.014 UniRef50_UPI0001AF0361 Phage-related protein, tail component n=1... 45 0.018 UniRef50_A9WSY5 Putative uncharacterized protein n=1 Tax=Renibac... 45 0.019 >UniRef50_P76072 Side tail fiber protein homolog from lambdoid prophage Rac n=23 Tax=root RepID=STFR_ECOLI Length = 1120 Score = 1112 bits (2875), Expect = 0.0, Method: Composition-based stats. Identities = 1120/1120 (100%), Positives = 1120/1120 (100%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV Sbjct: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA Sbjct: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT Sbjct: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT Sbjct: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE Sbjct: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE Sbjct: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 Query: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA Sbjct: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 Query: 541 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR 600 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR Sbjct: 541 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR 600 Query: 601 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVV 660 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVV Sbjct: 601 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVV 660 Query: 661 NDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGN 720 NDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGN Sbjct: 661 NDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGN 720 Query: 721 QIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLN 780 QIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLN Sbjct: 721 QIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLN 780 Query: 781 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAY 840 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAY Sbjct: 781 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAY 840 Query: 841 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLP 900 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLP Sbjct: 841 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLP 900 Query: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS 960 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS Sbjct: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS 960 Query: 961 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA 1020 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA Sbjct: 961 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA 1020 Query: 1021 HTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI 1080 HTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI Sbjct: 1021 HTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI 1080 Query: 1081 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 1081 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 >UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia coli RepID=B7L485_ECO55 Length = 1056 Score = 417 bits (1071), Expect = e-114, Method: Composition-based stats. Identities = 290/372 (77%), Positives = 308/372 (82%), Gaps = 17/372 (4%) Query: 766 TGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYA 825 +G L + ++ +L GNA TATKL +++GV+FDGS DI LT+ ++ AFARR+T YA Sbjct: 685 SGPLSVTDGITGALKGNADTATKLAAAPKINGVKFDGSADINLTSENIGAFARRSTGAYA 744 Query: 826 DADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYG 885 D+DG VPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRN GLFYRSSRDGYG Sbjct: 745 DSDGAVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNRGLFYRSSRDGYG 804 Query: 886 FEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGV 945 FEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQ F+KSAYPKLAAAYPSGV Sbjct: 805 FEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQTFNKSAYPKLAAAYPSGV 864 Query: 946 IPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTG 1005 IPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTG Sbjct: 865 IPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTG 924 Query: 1006 AHTHSVSGSTNSAGAHTHSLANVNTASANS------GAG-----------SASTRLSVVH 1048 AHTHS+SGST SAG HTH S G G S T + Sbjct: 925 AHTHSLSGSTGSAGVHTHGNGIRWPGGGGSALAFYDGGGFTYVQNSQYQVSPGTSSYRSY 984 Query: 1049 NQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTV 1108 Q T SAGAHTHSLSGTAAS+GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTV Sbjct: 985 YQRIQTQSAGAHTHSLSGTAASSGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTV 1044 Query: 1109 KNIAFNYIVRLA 1120 KNIAFNYIVRLA Sbjct: 1045 KNIAFNYIVRLA 1056 Score = 300 bits (768), Expect = 2e-79, Method: Composition-based stats. Identities = 471/541 (87%), Positives = 492/541 (90%), Gaps = 3/541 (0%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M VKISGVLKDGTGKPVQNCTI LKA+R S+TVVVNT+ASENPDEAGRYSMDVE+GQYSV Sbjct: 1 MTVKISGVLKDGTGKPVQNCTIVLKARRTSSTVVVNTVASENPDEAGRYSMDVEHGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 LLVEGFPPSHAGTITVYE S+PGTLNDFLGAMTEDD RPEALRRFE MVEEV+RNASAV Sbjct: 61 TLLVEGFPPSHAGTITVYEGSRPGTLNDFLGAMTEDDVRPEALRRFEQMVEEVSRNASAV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDAS SA EAATHA DAA SARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 121 AQNTAAAKKSASDASASASEAATHATDAAASARAASTSAGQAASSAQSASSSAGTASTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 EA+KSAAAAESSKSAAATSA AAKTSETNA+AS +SAATSASTATTKASEAATSAR AA Sbjct: 181 REAAKSAAAAESSKSAAATSASAAKTSETNAAASQKSAATSASTATTKASEAATSARGAA 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT Sbjct: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAG+ATEQASAAARSASAAKT Sbjct: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGKATEQASAAARSASAAKT 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK SATTASTKATE Sbjct: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATE 420 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE Sbjct: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 Query: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFL---NNINAVSKTDFADKRGMRYVR 537 TLAATPKAVK+A DNA R+ ++ +I ++ + G + + Sbjct: 481 TLAATPKAVKAANDNANGRVPSNRKVNGKALTADITLTPKDIGTLNSVTISFSGGAGWFK 540 Query: 538 V 538 + Sbjct: 541 L 541 Score = 53.0 bits (125), Expect = 6e-05, Method: Composition-based stats. Identities = 123/340 (36%), Positives = 183/340 (53%), Gaps = 4/340 (1%) Query: 138 AREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAA 197 AA AA SA A+T A +AA+SA+ A++S A + T AS SA++A SS +AA Sbjct: 208 ETNAAASQKSAATSASTATTKASEAATSARGAAASKEAAKSSETNASSSASSAASSATAA 267 Query: 198 ATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSA 257 SA AAKTSETNA +S +A SAS A + AA+SA A+ S A +S T A SA Sbjct: 268 GNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSA 327 Query: 258 SSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQAS 317 SAASSA+ A A A + A S +AA S + A S+T+A SS +AA++SA A+ Sbjct: 328 ESAASSASTATTKAGKATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAA 387 Query: 318 ASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKT 377 +SA++A S + A AS A A A+ +A+ AA SA+AA S++ A+++ T AE++ Sbjct: 388 SSASSASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAK 447 Query: 378 AAASSASSAASSASSASASKDEATRQASAAKSSATT--ASTKATEAAGSATAAAQSKSTA 435 A AS A + AS +K + +SA S++ T A+ KA +AA + Sbjct: 448 RAEDIAS--AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKAANDNANGRVPSNRK 505 Query: 436 ESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSAT 475 + +DI + ++ + + G +L++ T Sbjct: 506 VNGKALTADITLTPKDIGTLNSVTISFSGGAGWFKLATVT 545 >UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacteriaceae RepID=B7LN99_ESCF3 Length = 593 Score = 384 bits (986), Expect = e-104, Method: Composition-based stats. Identities = 212/361 (58%), Positives = 246/361 (68%), Gaps = 46/361 (12%) Query: 761 AIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRA 820 A DNA G + G K+ NG+AL S DI L+ V AF+ Sbjct: 278 ANDNANGRVPSGRKV----NGHAL------------------SSDIKLSPEDVNAFSLGC 315 Query: 821 TDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSS 880 T Y +DGGVPWNA+SG YNV G SYI+ +F++GVGSCR+ Q++A Y+N GL+YRSS Sbjct: 316 TGQYPSSDGGVPWNAKSGLYNVMDGGASYIVAHFFSGVGSCRSFQLRADYKNRGLYYRSS 375 Query: 881 RDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAA 940 RDGYGFE + P ++PVGAPI WPSD VP GYA+MQGQ FDK+AYP LAAA Sbjct: 376 RDGYGFERGFE--------PVNAFPVGAPIAWPSDIVPEGYAIMQGQTFDKAAYPLLAAA 427 Query: 941 YPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKS 1000 YPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTK+ Sbjct: 428 YPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKT 487 Query: 1001 TNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSV-VHNQNYATSSAGA 1059 + T +TN+ GAHTH++ G S + V V N +SS GA Sbjct: 488 VSTFNHGT----KTTNNTGAHTHTVG------GRYGGDSIGGKQRVQVSGTNQVSSSDGA 537 Query: 1060 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 H H++ G H HTVGIGAH H+VA+G+HGHTITVNAAGNAENTVKNIAFNYIVRL Sbjct: 538 HAHTV-----DIGQHNHTVGIGAHAHTVALGAHGHTITVNAAGNAENTVKNIAFNYIVRL 592 Query: 1120 A 1120 A Sbjct: 593 A 593 Score = 76.5 bits (186), Expect = 5e-12, Method: Composition-based stats. Identities = 46/138 (33%), Positives = 63/138 (45%), Gaps = 8/138 (5%) Query: 416 TKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSAT 475 T+ G + ST+E+ A ++ + +DA+T +KGIVQLSSAT Sbjct: 202 DATTKQKGLVQLNSAVNSTSETQAATSKAVKTAYDLADGKYTAQDATTARKGIVQLSSAT 261 Query: 476 NSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRG-MR 534 NS SETLAATPKAVK+A DNA R+ + ++I + A G Sbjct: 262 NSDSETLAATPKAVKAANDNANGRVPSGRKVNG----HALSSDIKLSPEDVNAFSLGCTG 317 Query: 535 YVRVN---APAGATSGKY 549 + P A SG Y Sbjct: 318 QYPSSDGGVPWNAKSGLY 335 Score = 50.3 bits (118), Expect = 4e-04, Method: Composition-based stats. Identities = 36/104 (34%), Positives = 55/104 (52%) Query: 445 AAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQ 504 A ++ ++ + DA+T +KG+VQL+SA NSTSET AAT KAVK+AYD A+ + Sbjct: 187 GAGAQKESRESLGVLDATTKQKGLVQLNSAVNSTSETQAATSKAVKTAYDLADGKYTAQD 246 Query: 505 NGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGK 548 + N+ S+T A + ++ NA SG+ Sbjct: 247 ATTARKGIVQLSSATNSDSETLAATPKAVKAANDNANGRVPSGR 290 >UniRef50_C8U9W7 Probable tail fiber protein-like protein n=1 Tax=Escherichia coli O103:H2 str. 12009 RepID=C8U9W7_ECO10 Length = 377 Score = 315 bits (807), Expect = 5e-84, Method: Composition-based stats. Identities = 103/172 (59%), Positives = 118/172 (68%) Query: 527 FADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEF 586 K+GM+ NAP+ A GK+YPV+ RS GS ELASRVIITT + MNNCEF Sbjct: 1 MDKKKGMQQYAFNAPSNAVGGKWYPVIFRRSTGSTGELASRVIITTTSAGGNYEMNNCEF 60 Query: 587 NGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIE 646 NG VMPGGWTDRG YA G F YQ NERAIHSI+ S K DD+ SVFYV+G AFPV E Sbjct: 61 NGMVMPGGWTDRGSYAAGYFSTYQTNERAIHSIVTSLKEDDVCSVFYVEGRAFPVRVSAE 120 Query: 647 DGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLI 698 +GL++ P D V TTYK+GATNPATE A ILDF +GRGFY SHS+ Sbjct: 121 EGLTVIVPTQDYTVGQTTYKWGATNPATESTNAQAILDFNNGRGFYCSHSIF 172 >UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FQX9_SALDC Length = 569 Score = 312 bits (798), Expect = 6e-83, Method: Composition-based stats. Identities = 171/406 (42%), Positives = 207/406 (50%), Gaps = 64/406 (15%) Query: 736 DGAKTYLLLTNQGDVYGGWNTLR-----PFAIDNATGE--LVIGTKLSASLNGNALTATK 788 D KTY + N ++Y G L+ D A G L + +K + N A + Sbjct: 207 DNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCG 266 Query: 789 LQTPRRVSGVEFDGSKDITLTAAHVAAFARRA-----TDTYADADGGV--------PWNA 835 P + V+ G+ + TA A R T +D G+ + Sbjct: 267 ADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGATRGSDSGLIMGEVYNNGYPT 326 Query: 836 ESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFE-EDWAEVY 894 + G ++ ++G + RS RD E +WA +Y Sbjct: 327 QYGNILRLTGTGDGEILIGWSGTNGAPAPA----------YIRSHRDTADAEWSEWAMLY 376 Query: 895 TSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTI 954 TS N PP SYPVGA I WPSD P+GYALMQGQ+FDKSAYP LA AYPSG+IPDMRGWTI Sbjct: 377 TSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGIIPDMRGWTI 436 Query: 955 KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGS 1014 KGKP SGRAVLSQE DG KSH+HSA A TDLGTK+TSSFDYGTKSTN TG HTH G Sbjct: 437 KGKPISGRAVLSQEMDGNKSHSHSARAQDTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGY 496 Query: 1015 TNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAH 1074 NS + N S G G+ + +AG H Sbjct: 497 INS------YWGDSNHTSFQPGGGAWTQ---------------------------AAGDH 523 Query: 1075 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 AHTV IG H H++ IG HGH + V+A GNAE TVKNIAFNYIVRLA Sbjct: 524 AHTVYIGGHEHTMYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA 569 >UniRef50_B4TP26 Side tail fiber protein n=43 Tax=Salmonella enterica RepID=B4TP26_SALSV Length = 892 Score = 306 bits (784), Expect = 3e-81, Method: Composition-based stats. Identities = 262/547 (47%), Positives = 325/547 (59%), Gaps = 16/547 (2%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M V ISGVLKD TG PVQNCTIQLKA R STTVVVNT+ASENPD+AGRYSMDVE GQY+V Sbjct: 1 MPVLISGVLKDATGTPVQNCTIQLKACRTSTTVVVNTVASENPDDAGRYSMDVEQGQYTV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 LLVEG+PPSHAG ITVY+DS+PGTLNDFLGAMTEDD RPEALRRFE MVEEVAR AS Sbjct: 61 TLLVEGYPPSHAGVITVYDDSKPGTLNDFLGAMTEDDVRPEALRRFEAMVEEVARQASEA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 ++N +A +++ A TSA +AA A A ++A AA SA QAASSA SA SSAGTA+TKA Sbjct: 121 SRNATSAGQASEQAQTSAGQAAESATAAVNAAGAAEASATQAASSAASAESSAGTATTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 EAS SAA+A+++++AAA SA AAKTSE NA AS +A SA+ A A+ A TSA A Sbjct: 181 GEASASAASADTARTAAAASAAAAKTSEANADASRTAAGDSAAAAAASATAAQTSAARAG 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 AS+ AAK SET A+SSA A +SATAA S KAA S A+ SET A SAS AA S T Sbjct: 241 ASETAAKMSETQAASSAGDAGASATAAAASEKAAAASAAAAKISETNAATSASTAAASAT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AA+SSAS AS A + SA+ A +S+ +A ++A+ A A A + A + ++ Sbjct: 301 AASSSASEASNHAAASDTSASLAAQSSTAAGAAATRAEDAAKRAEDIADVISLEDASLTK 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 +S T ++S AA A A D S A + T T + Sbjct: 361 KGIVKLSSATDSDSEALAATPKAVKTVMGEVQTKAPLD------SPAFTGTPTTPTPPDD 414 Query: 421 AAGSATA---------AAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 A G TA AA S ES T E A D + A + + K+ + Sbjct: 415 AKGLQTANAEFVRKLIAALVGSVPESLDTLQELADALGNDPSFATTVLNKLAGKQPLDDT 474 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKR 531 +A + S ++ ++A L K QNG DI DK F I AV+ T + Sbjct: 475 LTALSGKSVDGLIEYVGLRETINHAADALLKSQNGGDIQDKKQFARTIGAVTSTTISL-G 533 Query: 532 GMRYVRV 538 + ++ Sbjct: 534 ESGWFKI 540 >UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escherichia coli E22 RepID=B3I2W7_ECOLX Length = 654 Score = 289 bits (738), Expect = 6e-76, Method: Composition-based stats. Identities = 152/363 (41%), Positives = 188/363 (51%), Gaps = 46/363 (12%) Query: 763 DNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFAR-RAT 821 +NA G + K+ NG+AL T R + FDG ++ + Sbjct: 333 ENANGRVPASRKV----NGHALNGDINVTSRDI----FDGQVIAIGANKNLDDYQVPGLY 384 Query: 822 DTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSR 881 A+ + N S S +++ G G + ++ N Y S+ Sbjct: 385 FQEANNNTSAAMNYPE------NSAGSLMVLR---GAGVTQVYRVY----NSSRSYSRSK 431 Query: 882 DGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAY 941 W +P +SYPVGAPIPWPSD P+GYALMQGQ FDK+ YP LA AY Sbjct: 432 YSTLAWTPW--------MPEDSYPVGAPIPWPSDVTPTGYALMQGQPFDKAVYPLLAIAY 483 Query: 942 PSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKST 1001 P+G+IPDMRG TIKGKP +GRAVLS EQDG+ SHTH AS S TDLGTK TSSFDYG+K T Sbjct: 484 PAGIIPDMRGQTIKGKP-NGRAVLSYEQDGVISHTHGASISDTDLGTKYTSSFDYGSKPT 542 Query: 1002 NNTGAHTHSVSGSTNSAGAHTHSLANVNTA----SANSGAGSASTRLSVVHNQNYATSSA 1057 + S+ G H H+ T+ + G G S+ +S + Sbjct: 543 TSFDYGN----KSSTEGGWHAHNFRYCATSAYRDTPGQGLGMHSSNVSWAAGDRI--EGS 596 Query: 1058 GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIV 1117 G H H G H H VGIGAH H V +G HGHT TV+AAGNAENTVKNIAFNYIV Sbjct: 597 GNHAH-----VTWIGPHDHWVGIGAHNHYVVMGYHGHTATVHAAGNAENTVKNIAFNYIV 651 Query: 1118 RLA 1120 RLA Sbjct: 652 RLA 654 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 33/107 (30%), Positives = 48/107 (44%), Gaps = 4/107 (3%) Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 A S AA + A+ + + DAST++KG V+LSS TNS SE Sbjct: 260 AVASIDAAGNITDLRPKGTLNDQAASDALKKHEQSRNHPDASTSEKGFVRLSSETNSDSE 319 Query: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDF 527 +A TPKA+K+ +NA R+ + +IN S+ F Sbjct: 320 AMAVTPKALKAVNENANGRVPASRKVNG----HALNGDINVTSRDIF 362 >UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriaceae RepID=Q3YZL1_SHISS Length = 1029 Score = 288 bits (736), Expect = 9e-76, Method: Composition-based stats. Identities = 503/566 (88%), Positives = 508/566 (89%), Gaps = 44/566 (7%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV Sbjct: 3 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 62 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDS+PGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV Sbjct: 63 ILLVEGFPPSHAGTITVYEDSRPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 122 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDASTSAREAATHA DAADSARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 123 AQNTAAAKKSASDASTSAREAATHATDAADSARAASTSAGQAASSAQSASSSAGTASTKA 182 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEASKSAAAAESSKSAAATSAGAAKT ETNA+AS QSAATSASTATTKASEAATSARDA+ Sbjct: 183 TEASKSAAAAESSKSAAATSAGAAKTLETNAAASQQSAATSASTATTKASEAATSARDAS 242 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSASSAASSATAA NSAKAAKTSETNARSSETAAGQSASAAAGSKT Sbjct: 243 ASKEAAKSSETNASSSASSAASSATAAANSAKAAKTSETNARSSETAAGQSASAAAGSKT 302 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA+AAARSASAAKT Sbjct: 303 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQATAAARSASAAKT 362 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 SETNAKASET AESSKTAAASSASSAASSASSASASKDEATRQASAAK SATTASTKATE Sbjct: 363 SETNAKASETRAESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATE 422 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV----------- 469 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV Sbjct: 423 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 482 Query: 470 ---------------------------------QLSSATNSTSETLAATPKAVKSAYDNA 496 QLSSATNSTSETLAATPKAVK+A DNA Sbjct: 483 SLAATPKAVKAAYDLANGKYTAQDATTAQKGIIQLSSATNSTSETLAATPKAVKAANDNA 542 Query: 497 EKRLQKDQNGADIPDKGCFLNNINAV 522 EKRLQKDQNGADIP K F NI A Sbjct: 543 EKRLQKDQNGADIPGKDTFTKNIGAC 568 Score = 277 bits (707), Expect = 2e-72, Method: Composition-based stats. Identities = 198/262 (75%), Positives = 206/262 (78%), Gaps = 17/262 (6%) Query: 876 FYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAY 934 F RS RD WA++YTS + P E YPVGAPIPWPSDTVPSGYALMQGQ FDKSAY Sbjct: 768 FIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAY 827 Query: 935 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF Sbjct: 828 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 887 Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNY-- 1052 DYGTKSTNNTGAHTHS+SGST+SAGAH HS T S + G + V N Sbjct: 888 DYGTKSTNNTGAHTHSLSGSTSSAGAHQHSQTGPRTNSGSQPTGMFPAGSTQVSGTNQVG 947 Query: 1053 --------------ATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV 1098 +SS G HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV Sbjct: 948 ISGSLTSGTSQWVGKSSSEGNHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV 1007 Query: 1099 NAAGNAENTVKNIAFNYIVRLA 1120 NAAGNAENTVKNIAFNYIVRLA Sbjct: 1008 NAAGNAENTVKNIAFNYIVRLA 1029 >UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX Length = 710 Score = 285 bits (728), Expect = 8e-75, Method: Composition-based stats. Identities = 190/360 (52%), Positives = 220/360 (61%), Gaps = 28/360 (7%) Query: 761 AIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRA 820 A DNA G + G K++ + + +G + + L + Sbjct: 379 ANDNANGRVPSGRKVNG---KPLTNDVNVTSQDIFNGQSINIGANQNLDNYKTPGLYHQP 435 Query: 821 TDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSS 880 + Y A P N + +G + Q+ Y + RS Sbjct: 436 LNAYTSAALKYPENLAGTLVVLKNAGIT----------------QIYYVYNTSRSYTRSQ 479 Query: 881 RDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAA 940 + W P +S+PVGA IPWPSD+VP+GYA+MQGQ FDK+ YP LAAA Sbjct: 480 YSTGDWTA-WT--------PQDSFPVGAAIPWPSDSVPTGYAVMQGQTFDKTTYPLLAAA 530 Query: 941 YPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKS 1000 YPSGV+PDMRGWTIKGKPASGR VLS EQDGIKSHTHSASAS+TDLGTKTTSSFDYGTKS Sbjct: 531 YPSGVLPDMRGWTIKGKPASGRDVLSLEQDGIKSHTHSASASNTDLGTKTTSSFDYGTKS 590 Query: 1001 TNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAH 1060 TNNTGAHTH+VSG+ NSAGAHTH++ S S N S+GAH Sbjct: 591 TNNTGAHTHNVSGTANSAGAHTHTVPLRRPNSGGMNFDWLDGASSGTVVGNGTVPSSGAH 650 Query: 1061 THSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 THS+SGTA SAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 651 THSVSGTATSAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 710 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 72/251 (28%), Positives = 110/251 (43%), Gaps = 6/251 (2%) Query: 279 TNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTAT 338 +R A+ + S + S + A+T +A A GK AS+ Sbjct: 164 EQSRRHPDASLTAKGFVQLSSATNSVSETQAATPKAVKAAYDLANGKYTAQDASTTRKGL 223 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAAS--SASSAASSASSASAS 396 + AT S + A + + + +A+ + TA SSA +S S + A+ Sbjct: 224 VQLSSATNSDSETLAATPKAVKAAYDLANGKYTAQDATTARKGLIQLSSATNSTSESLAA 283 Query: 397 KDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAV 456 +A + A + TA T G + + ST+E+ A + + Sbjct: 284 TPKAVKAAYELANGKYTAQDATTAQKGIVQLSNATNSTSETLAATPKAVKAAYDLANGKY 343 Query: 457 ALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFL 516 +DA+T +KG+VQLSSATNSTSETLAATPKAVK+A DNA R+ + P Sbjct: 344 TAQDATTARKGLVQLSSATNSTSETLAATPKAVKAANDNANGRVPSGRKVNGKP----LT 399 Query: 517 NNINAVSKTDF 527 N++N S+ F Sbjct: 400 NDVNVTSQDIF 410 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 79/294 (26%), Positives = 120/294 (40%), Gaps = 11/294 (3%) Query: 255 SSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAG 314 ++A S + + +T T + ++ A + + + S Sbjct: 109 NTAESYKPAVAEGSGRTQTFRTILTVSSTATVALTVDNTMVMATVDYVDNKLKEHEQSRR 168 Query: 315 QASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAES 374 AS TA G S+A+++ + T AA A A N K + A S Sbjct: 169 HPDASLTAKGFVQLSSATNSVSETQ----------AATPKAVKAAYDLANGKYTAQDA-S 217 Query: 375 SKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKST 434 + SSA +S S A+ +A + A + TA T G ++ + ST Sbjct: 218 TTRKGLVQLSSATNSDSETLAATPKAVKAAYDLANGKYTAQDATTARKGLIQLSSATNST 277 Query: 435 AESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYD 494 +ES A + E +DA+T +KGIVQLS+ATNSTSETLAATPKAVK+AYD Sbjct: 278 SESLAATPKAVKAAYELANGKYTAQDATTAQKGIVQLSNATNSTSETLAATPKAVKAAYD 337 Query: 495 NAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGK 548 A + + N+ S+T A + ++ NA SG+ Sbjct: 338 LANGKYTAQDATTARKGLVQLSSATNSTSETLAATPKAVKAANDNANGRVPSGR 391 >UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclassified Myoviridae RepID=Q6KGF6_9CAUD Length = 782 Score = 284 bits (725), Expect = 2e-74, Method: Composition-based stats. Identities = 131/358 (36%), Positives = 180/358 (50%), Gaps = 71/358 (19%) Query: 768 ELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADA 827 +L++G SAS+N A ++Q P S ++ T + ++ A++ D+ Sbjct: 429 QLILGNS-SASINKTLTLAGQIQ-PSDFSNLDARYYTQSTANSRYMLAYSSGTGTEVGDS 486 Query: 828 DGGVPWNAESGAYNVT--RSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYG 885 DG + WNA++G YNVT G + ++ Y G S + Q+K +YRNGG +YRSSRDG+G Sbjct: 487 DG-IAWNAKTGLYNVTGYSGGSTQLVFQMYQGASSTPSAQLKFNYRNGGFWYRSSRDGFG 545 Query: 886 FEEDWAEVYTSKNLPPES---------------------------YPVGAPIPWPSDTVP 918 FEED+ ++YT K P S YPVG + S+ P Sbjct: 546 FEEDFTQIYTEKYKPTPSAIGAYTKAETDQKIAEAISDSTDLNKIYPVGIVTWFNSNVNP 605 Query: 919 SGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAV--------LSQEQD 970 + +A P L Y + + G TI+ A+G V ++ Sbjct: 606 N------------TALPGLTWTYLNNGV----GRTIRIAAANGSDVATTGGSDSVTLSVG 649 Query: 971 GIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNT 1030 + SHTHS SA TTSSFDYGTK+TN TGAHTHSVSGSTN+ GAHTH+ Sbjct: 650 NLPSHTHSFSA--------TTSSFDYGTKTTNTTGAHTHSVSGSTNNTGAHTHTFG---- 697 Query: 1031 ASANSGAGSASTRLSV-VHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 1087 G S + V V +S AG H+H++ GTAAS G HAHTVGIGAH+H+V Sbjct: 698 --GRYGGDSIGGKHRVHVSGTEQVSSVAGDHSHTVYGTAASNGNHAHTVGIGAHSHTV 753 >UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL254 RepID=B4T041_SALNS Length = 580 Score = 272 bits (694), Expect = 6e-71, Method: Composition-based stats. Identities = 126/221 (57%), Positives = 139/221 (62%), Gaps = 33/221 (14%) Query: 900 PPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPA 959 P S P G P+PWPSDT+P+GYALMQGQAFDK+ YP LA AYPSG IPDMRGWTIKGKP Sbjct: 393 PMMSCPPGVPLPWPSDTIPAGYALMQGQAFDKNVYPLLAIAYPSGTIPDMRGWTIKGKPV 452 Query: 960 SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAG 1019 SGRAVLSQE DG KSH+H A A TDLGTK TSSFDYGTKS+N TG H HS G+ Sbjct: 453 SGRAVLSQELDGNKSHSHGARALDTDLGTKGTSSFDYGTKSSNTTGGHNHSAGGT----- 507 Query: 1020 AHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVG 1079 G S + V + N +S G HAHT Sbjct: 508 ---------------YGGDSIGGKARVQRDGNDQLTS-------------WNGDHAHTTW 539 Query: 1080 IGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 IG H H+V IG HGH + V+A GNAE TVKNIAFNYIVRLA Sbjct: 540 IGPHDHTVYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA 580 Score = 76.5 bits (186), Expect = 5e-12, Method: Composition-based stats. Identities = 38/113 (33%), Positives = 57/113 (50%) Query: 415 STKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSA 474 + G + + S +E+ A + + + +DA+TT+KGIVQLSS Sbjct: 167 PDATLKEKGFTQLSNATDSESETLAATPKAVKAAYDLADAKYTAQDATTTRKGIVQLSSV 226 Query: 475 TNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDF 527 T+S E AATPKAVK A DNA KRL K++N AD+ + ++ + Sbjct: 227 TDSNDENQAATPKAVKIAMDNANKRLAKERNLADLTNIQQARQSLQLGNSATL 279 >UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5PP06_SALHA Length = 534 Score = 268 bits (684), Expect = 1e-69, Method: Composition-based stats. Identities = 130/232 (56%), Positives = 149/232 (64%), Gaps = 34/232 (14%) Query: 876 FYRSSRDGYGFE-EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAY 934 + RS RD E +WA +YT+ N PP+S+PVGAPI WPSD P+GYALMQGQ+FDKSAY Sbjct: 213 YIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAPIAWPSDATPAGYALMQGQSFDKSAY 272 Query: 935 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 P LA AYPSGVIPDMRGWTIKGKPASGRA+LSQE DG KSH+HSA A TDLGTKTTSSF Sbjct: 273 PLLAIAYPSGVIPDMRGWTIKGKPASGRAILSQEMDGNKSHSHSARAQDTDLGTKTTSSF 332 Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYAT 1054 DYGTKSTN TG HT+ G NS + N S G G+ + Sbjct: 333 DYGTKSTNTTGNHTNQFGGYINS------YWGDSNHTSFQPGGGAWTQ------------ 374 Query: 1055 SSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAEN 1106 +AG HAHTV IG H H++ IG HGH + V+A GNAE Sbjct: 375 ---------------AAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAET 411 Score = 83.0 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 40/165 (24%), Positives = 60/165 (36%), Gaps = 5/165 (3%) Query: 388 SSASSASASKDEATRQASAAKSSATTASTK---ATEAAGSATAAAQSKSTAESAATRAET 444 A D + T + AT A A A S E+ T E Sbjct: 1 MGEVQTKAPLDSPALTGTPTAPMPETTAAGIEIATAAFVVAKVAQLVGSAPEALDTLQEL 60 Query: 445 AAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQ 504 A D A+ + + K+ + + +A + S +++ ++A L K Q Sbjct: 61 ADALGNDPNFAITVLNKLAGKQPLDETLTALSGKSADGFIEYISLRETINHAADALHKSQ 120 Query: 505 NGADIPDKGCFLNNINAV--SKTDFADKRGMRYVRVNAPAGATSG 547 NG DIP+K F+ NI A+ S T A R + A G T G Sbjct: 121 NGGDIPEKPLFVQNIGALPASGTAVAANRLASRGALPALTGTTRG 165 >UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae RepID=Q3ZL14_ESCBL Length = 289 Score = 267 bits (682), Expect = 2e-69, Method: Composition-based stats. Identities = 110/217 (50%), Positives = 134/217 (61%), Gaps = 12/217 (5%) Query: 905 PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAV 964 P G + WP T P+G+ALM GQ FD +AYP+LA AYPSGVIPDMRG TIK PASGR + Sbjct: 84 PPGIALAWPGATAPTGFALMLGQTFDTTAYPRLAQAYPSGVIPDMRGQTIKFLPASGRTL 143 Query: 965 LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHS 1024 LS E DG+KSH+HS S S+TDLGT T + D GTK T+ G H H N A + Sbjct: 144 LSLEADGVKSHSHSGSISTTDLGTATAADTDLGTKQTSQDGLHNHVSDSRFNKLMARSSD 203 Query: 1025 LANV-NTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAH 1083 + NT +S + R+S +++ +A S A +G H HTV IG H Sbjct: 204 IDGTNNTGDVDSDNPESEHRVSGMNDSLWAAS-----------VIADSGLHMHTVYIGPH 252 Query: 1084 THSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 HSV IG HGHT+T++ GN ENTVKNIAFN IVRLA Sbjct: 253 AHSVYIGPHGHTVTISNFGNTENTVKNIAFNAIVRLA 289 >UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae RepID=A4TT73_YERPP Length = 962 Score = 259 bits (660), Expect = 6e-67, Method: Composition-based stats. Identities = 161/254 (63%), Positives = 191/254 (75%), Gaps = 27/254 (10%) Query: 893 VYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGW 952 +Y+S PPESYPVGAPIPWP+D PSG+A+MQGQ FDKS YPKLAAAYPSGV+PDMRGW Sbjct: 710 LYSSVLPPPESYPVGAPIPWPNDVAPSGFAIMQGQTFDKSVYPKLAAAYPSGVLPDMRGW 769 Query: 953 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDL----------GTKTTSSFDYGTKSTN 1002 IKGKP S RAVLS EQDGIKSH H+A+ASSTDL GTKT+S FDYGTKS+N Sbjct: 770 MIKGKPTS-RAVLSLEQDGIKSHAHNAAASSTDLGTKPTTTFDYGTKTSSGFDYGTKSSN 828 Query: 1003 NTGAHTHSVSGSTNSAGAHTHSLA-------NVNTASANSGAGSASTRLSVVHNQNYATS 1055 +TGAH HS+SGST+S+GAH H++ + ++ + N+ +T+ + + N TS Sbjct: 829 STGAHAHSLSGSTSSSGAHAHTVTAHTQYPRSTDSRNQNAVGKQYNTQQTTANAFNVWTS 888 Query: 1056 SAGAHTHSLSGTAASAGAHAHTVGIGA---------HTHSVAIGSHGHTITVNAAGNAEN 1106 SAG H HS+SGTA SAGAHAHTVGIGA H+HSVAIG+H HTIT+ A GNAEN Sbjct: 889 SAGDHAHSISGTAVSAGAHAHTVGIGAHAHSLSIGSHSHSVAIGAHSHTITIAACGNAEN 948 Query: 1107 TVKNIAFNYIVRLA 1120 TVKNIA+NYIVRLA Sbjct: 949 TVKNIAYNYIVRLA 962 Score = 115 bits (288), Expect = 8e-24, Method: Composition-based stats. Identities = 117/472 (24%), Positives = 201/472 (42%), Gaps = 12/472 (2%) Query: 79 EDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSA 138 ED T+ + E +A + + A A+ AA +S+S+A A Sbjct: 118 EDGTEVTVKSLTQIVDEHNANQKWYTDNADAINAAGEKAREAAERALAAAQSSSEARAKA 177 Query: 139 REAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAA 198 EAA +A A++ AA SA + +S A+ SA ++ A+ A S + +S++ AA Sbjct: 178 DEAAQSSASASEYKTAAELSAAASKASEHGAAESAASSKASASAAKTSEDNSAASETNAA 237 Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 S AA S ++++ S A A +A T AA S AA S+ A++S+ A ++A+ Sbjct: 238 ESKAAAALSASSSANSASEALQYAESAKTSKEAAAASEAAAANSENEARTSKDTAVAAAA 297 Query: 259 SAASSATAAGNSAKAAKT--SETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 A+++AT+A S T +E + E A + ++ + A+ +A + + Sbjct: 298 EASANATSADASRHDVDTNKAEVSRMKDEVFAARDSTIQYSEEAKTAADTAAREAATKTS 357 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK-TSETNAKASETSAESS 375 +A AE A S++++A A +A A A +K T ++ S+ Sbjct: 358 DQLLSAVKSEAEKANSASASAQGFADDAKRFRDEAQEIAEGSKVNDATTSQQGVVQLSSA 417 Query: 376 KTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG---------SAT 426 + + + +S + + + S A S A TA T A AAG ++ Sbjct: 418 TDSESETLASTPKAVKTVMDAVALKAPIDSPALSGAPTAPTPAITAAGREIATAAFVASK 477 Query: 427 AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATP 486 A S E+ T E AA D A + + K+ + +A + S Sbjct: 478 VAQLVGSAPEALDTLNELAAALGNDPNFATTITNMLARKQPLDGTLTALSGRSPQGVIDY 537 Query: 487 KAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRV 538 + + + A +QK QNGADIPDK F+ NI AVS + + + ++ Sbjct: 538 LGLLNTVNLAAGSIQKSQNGADIPDKRLFVKNIGAVSSARISFVKESGWYKL 589 >UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria phage T4 RepID=Q99362_BPT4 Length = 382 Score = 257 bits (656), Expect = 2e-66, Method: Composition-based stats. Identities = 159/278 (57%), Positives = 180/278 (64%), Gaps = 44/278 (15%) Query: 882 DGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAY 941 F + + + K SYP+GAPIPWP+DT P+GYALM+GQ FD AYPKLAAAY Sbjct: 110 GSGNFANLNSTIESLKTDIVSSYPIGAPIPWPTDTPPNGYALMEGQTFDTRAYPKLAAAY 169 Query: 942 PSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKST 1001 PSG IPDMRG TIKGKP SGRAVLS E DG+KSHTH ASAS+TDLGTKTTSSFDYGTK+T Sbjct: 170 PSGTIPDMRGQTIKGKP-SGRAVLSTEADGVKSHTHGASASNTDLGTKTTSSFDYGTKTT 228 Query: 1002 ----------NNTGAHTHSVSGSTNSAGAHTHSLANV--------------------NTA 1031 N TG H H+VSG+T+SAGAH H+ + N Sbjct: 229 SSFDYGTKTSNTTGNHNHTVSGTTSSAGAHQHARSGPQLSNGISTNIFPDGYSDVGTNYN 288 Query: 1032 SANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGT---------AASAGAHAHTVGIGA 1082 S SG S+ ++ TS+ GAHTH+ SGT GAH HTVGIGA Sbjct: 289 SKFSGTVIGSSVPCIIG----KTSNDGAHTHTWSGTTSTTGNHAHTVGIGAHTHTVGIGA 344 Query: 1083 HTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 HTH+VAIGSHGHTITVNA GN ENTVKNIAFNYIVRLA Sbjct: 345 HTHTVAIGSHGHTITVNATGNTENTVKNIAFNYIVRLA 382 >UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysenteriae 1012 RepID=B3X2T1_SHIDY Length = 488 Score = 256 bits (654), Expect = 3e-66, Method: Composition-based stats. Identities = 186/241 (77%), Positives = 194/241 (80%), Gaps = 13/241 (5%) Query: 893 VYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGW 952 VY+ + YP GAPIPWPSDTVPSGYALMQGQ FDKSAYPKLA AYPSGVIPDMRGW Sbjct: 248 VYSLYTPSEQFYPPGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGW 307 Query: 953 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVS 1012 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHS+S Sbjct: 308 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSIS 367 Query: 1013 GSTNSAGAHTHS----LANVNTASANSG---------AGSASTRLSVVHNQNYATSSAGA 1059 G+ NSAGAH H NT+ +G ++T S TSS GA Sbjct: 368 GTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGA 427 Query: 1060 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL Sbjct: 428 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 487 Query: 1120 A 1120 A Sbjct: 488 A 488 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 43/127 (33%), Positives = 63/127 (49%), Gaps = 3/127 (2%) Query: 415 STKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSA 474 +T G ++ + ST+ES A + + +DA+T +KGIVQLSSA Sbjct: 4 EDASTTKKGIVQLSSATNSTSESQAATPKAVKAAYDLANGKYTAQDATTAQKGIVQLSSA 63 Query: 475 TNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFL---NNINAVSKTDFADKR 531 TNSTSETLAATPKAVK+A DNA R+ + +I ++ T + Sbjct: 64 TNSTSETLAATPKAVKAANDNANGRVPSARKVNGKALSADITLTPKDIGTLNSTTMSFSG 123 Query: 532 GMRYVRV 538 G + ++ Sbjct: 124 GAGWFKL 130 Score = 45.3 bits (105), Expect = 0.012, Method: Composition-based stats. Identities = 42/85 (49%), Positives = 52/85 (61%) Query: 456 VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCF 515 +ALEDASTTKKGIVQLSSATNSTSE+ AATPKAVK+AYD A + Sbjct: 1 MALEDASTTKKGIVQLSSATNSTSESQAATPKAVKAAYDLANGKYTAQDATTAQKGIVQL 60 Query: 516 LNNINAVSKTDFADKRGMRYVRVNA 540 + N+ S+T A + ++ NA Sbjct: 61 SSATNSTSETLAATPKAVKAANDNA 85 >UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_LAMBD Length = 774 Score = 254 bits (649), Expect = 1e-65, Method: Composition-based stats. Identities = 316/522 (60%), Positives = 356/522 (68%), Gaps = 6/522 (1%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPVQNCTIQLKA+RNSTTVVVNT+ SENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVKISGVLKDGTGKPVQNCTIQLKARRNSTTVVVNTVGSENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 IL V+GFPPSHAGTITVYEDSQPGTLNDFL AMTEDDARPE LRR ELMVEEVARNAS V Sbjct: 61 ILQVDGFPPSHAGTITVYEDSQPGTLNDFLCAMTEDDARPEVLRRLELMVEEVARNASVV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQ+TA AKKSA DAS SA + A DA DSARAASTSAGQAASSAQ ASS A AS KA Sbjct: 121 AQSTADAKKSAGDASASAAQVAALVTDATDSARAASTSAGQAASSAQEASSGAEAASAKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEA KSAAAAESSK+AAATSAGAAKTSETNA+AS QSAATSASTA TKASEAATSARDA Sbjct: 181 TEAEKSAAAAESSKNAAATSAGAAKTSETNAAASQQSAATSASTAATKASEAATSARDAV 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSA AASSATAA NSA+AAKTSETNARSSETAA +SASAAA +KT Sbjct: 241 ASKEAAKSSETNASSSAGRAASSATAAENSARAAKTSETNARSSETAAERSASAAADAKT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAA SAS AST A +A+ SA +A +S +A ++A A A A + ASA A + Sbjct: 301 AAAGSASTASTKATEAAGSAVSASQSKSAAEAAAIRAKNSAKRAEDIASAVALEDADTTR 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA---TTASTK 417 +S T++ S AA A ++ A D + +A T + Sbjct: 361 KGIVQLSSATNSTSETLAATPKAVKVVMDETNRKAPLDSPALTGTPTAPTALRGTNNTQI 420 Query: 418 ATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNS 477 A A A A ++ ++ T E AA D A + +A K+ +A Sbjct: 421 ANTAFVLAAIADVIDASPDALNTLNELAAALGNDPDFATTMTNALAGKQPKNATLTALAG 480 Query: 478 TSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNI 519 S P A ++A + Q G DI K + + Sbjct: 481 LSTAKNKLPY---FAENDAASLTELTQVGRDILAKNSVADVL 519 Score = 242 bits (616), Expect = 9e-62, Method: Composition-based stats. Identities = 171/248 (68%), Positives = 183/248 (73%), Gaps = 28/248 (11%) Query: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS 960 ++P GAPIPWPSD VPSGY LMQGQAFDKSAYPKLA AYPSGV+PDMRGWTIKGKPAS Sbjct: 527 NSAFPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIKGKPAS 586 Query: 961 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTS----------SFDYGTKSTNNTGAHTHS 1010 GRAVLSQEQDGIKSHTHSASAS TDLGTKTTS SFDYGTKSTNNTGAH HS Sbjct: 587 GRAVLSQEQDGIKSHTHSASASGTDLGTKTTSSFDYGTKTTGSFDYGTKSTNNTGAHAHS 646 Query: 1011 VSGSTNSAGAHTHSLANVNTASANSGAGSAS---------TRLSVVHNQNYATSSAGAHT 1061 +SGST +AGAH H+ +S S G+A+ + T S G+H+ Sbjct: 647 LSGSTGAAGAHAHTSGLRMNSSGWSQYGTATITGSLSTVKGTSTQGIAYLSKTDSQGSHS 706 Query: 1062 HSLSGTAASAGAHAHTVGIGAHTHSVA---------IGSHGHTITVNAAGNAENTVKNIA 1112 HSLSGTA SAGAHAHTVGIGAH H V IGSHGHTITVNAAGNAENTVKNIA Sbjct: 707 HSLSGTAVSAGAHAHTVGIGAHQHPVVIGAHAHSFSIGSHGHTITVNAAGNAENTVKNIA 766 Query: 1113 FNYIVRLA 1120 FNYIVRLA Sbjct: 767 FNYIVRLA 774 >UniRef50_A8A0A4 L-shaped tail fiber protein n=12 Tax=root RepID=A8A0A4_ECOHS Length = 1258 Score = 251 bits (640), Expect = 1e-64, Method: Composition-based stats. Identities = 283/342 (82%), Positives = 298/342 (87%), Gaps = 7/342 (2%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAV+ISGVLKDG GKP+QNCTIQLKA+RNSTTVVVNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVRISGVLKDGAGKPIQNCTIQLKARRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDD RPEALRRFELMVEEVARNASAV Sbjct: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDVRPEALRRFELMVEEVARNASAV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDA TSAREAATHA DAADSARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 121 AQNTAAAKKSASDARTSAREAATHATDAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEASKSAAAAESSKSAAATSAGAAKTSETNA+AS +SAATSASTATTKASEAATSARDA+ Sbjct: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNAAASQKSAATSASTATTKASEAATSARDAS 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA-------S 293 ASK AAKSSET+A+SSA SAASSATAAGNSAKAAKTSE NA +S AA S + Sbjct: 241 ASKVAAKSSETSAASSAGSAASSATAAGNSAKAAKTSEMNADNSAQAAADSQTASANSAT 300 Query: 294 AAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 AA S+T A +S SAA S A AS A + + S Sbjct: 301 AAKKSETNAKNSESAAKVSETNAKASENKAKEYLDKVGGLVS 342 >UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escherichia RepID=B7LKX7_ESCF3 Length = 567 Score = 248 bits (633), Expect = 8e-64, Method: Composition-based stats. Identities = 125/235 (53%), Positives = 158/235 (67%), Gaps = 31/235 (13%) Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 K + ES PVG PIPWPSD+VPSGYALM GQ F+K++YPKLA AYPSGVIPDMRGW IK Sbjct: 354 DKAIAAESCPVGMPIPWPSDSVPSGYALMTGQTFNKTSYPKLAIAYPSGVIPDMRGWIIK 413 Query: 956 GKPASGRAVLSQEQDGIKSHTHSASAS----------STDLGTKTTSSFDYGTKSTNNTG 1005 GKP+SGRA+LS E DG+KSH H+ S S STDLGTKTT+SF++G+++T+ +G Sbjct: 414 GKPSSGRAILSTELDGVKSHNHTGSISSTNLGTITSTSTDLGTKTTASFNHGSRNTSTSG 473 Query: 1006 AHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLS 1065 HTH + + + G SL N + N T SAG+H HS+ Sbjct: 474 EHTHRIP-TDGAEGKDGPSLWNSPNSDENY---------------REPTESAGSHYHSI- 516 Query: 1066 GTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 + GAHAHT+ +G+HTH++ +G+H H+I +N GN ENTVKNIAFNYIVRLA Sbjct: 517 ----TIGAHAHTIALGSHTHNIVLGTHNHSIIINNTGNTENTVKNIAFNYIVRLA 567 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 32/77 (41%), Positives = 39/77 (50%), Gaps = 5/77 (6%) Query: 440 TRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 E + + DA+ T+KG QLS+ATNS ET AATPKAVK+AYD A + Sbjct: 188 ATREYVDDALKTHQQSRNHPDATLTQKGFTQLSNATNSDDETKAATPKAVKTAYDLANSK 247 Query: 500 LQKDQNGA-----DIPD 511 N A IPD Sbjct: 248 AATSHNHAWSQITGIPD 264 Score = 45.7 bits (106), Expect = 0.010, Method: Composition-based stats. Identities = 35/145 (24%), Positives = 56/145 (38%) Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 + ++ TN+ +A A +++ ++ S A Sbjct: 212 TQKGFTQLSNATNSDDETKAATPKAVKTAYDLANSKAATSHNHAWSQITGIPDGTLTQKG 271 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 T + + AA S A A + + D + T+KG+V+L Sbjct: 272 VVKLNNTTNSTSTTEAATPSAVKAAMDKAIAAAPSSHTHAWGQITGIPDGTLTQKGVVKL 331 Query: 472 SSATNSTSETLAATPKAVKSAYDNA 496 ++ATNSTS T AATP AVK+A D A Sbjct: 332 NNATNSTSTTEAATPNAVKAAMDKA 356 >UniRef50_C6UHV3 Predicted tail fiber protein n=22 Tax=Escherichia RepID=C6UHV3_ECOBR Length = 792 Score = 239 bits (610), Expect = 4e-61, Method: Composition-based stats. Identities = 217/352 (61%), Positives = 253/352 (71%) Query: 2 AVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVI 61 AVKISGVLKDG GKP+QNCTIQLKAKRNSTTV+VNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 3 AVKISGVLKDGAGKPIQNCTIQLKAKRNSTTVLVNTVASENPDEAGRYSMDVEYGQYSVT 62 Query: 62 LLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVA 121 LLVEGFPPSHAGTITVYE S+PGTLNDFLGAMTEDD PEALRRFE MVEE ARNA A + Sbjct: 63 LLVEGFPPSHAGTITVYEGSRPGTLNDFLGAMTEDDVMPEALRRFEAMVEEAARNAEAAS 122 Query: 122 QNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKAT 181 Q+ AAAKKS + A++S A T AA+SA+AA+ S +A+SA +A S A T Sbjct: 123 QSAAAAKKSETAAASSKNAAKTSETHAANSAQAAAASQTASANSATAAKKSENNAKNSET 182 Query: 182 EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAA 241 A S A+SS++AA TS AKTSET A +S +AA S S A A+ AA SA AA Sbjct: 183 AAKTSETNAKSSQAAAKTSETNAKTSETAAKSSQAAAAESESAAAGSATSAAGSATAAAN 242 Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTA 301 S++AAK+SETNA SS ++A +S T A S AAK S+ A SE+AA SASAAA S TA Sbjct: 243 SQKAAKTSETNAKSSQTAAKTSETNAKASETAAKNSQDAAAQSESAAAGSASAAASSATA 302 Query: 302 AASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 +A+S AA TS A AS TAA SA+++A+S + A A E AS AA Sbjct: 303 SANSQKAAKTSETNAKASETAAANSAKASAASQTAAKASEDAAREYASQAAE 354 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 99/224 (44%), Positives = 130/224 (58%), Gaps = 14/224 (6%) Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNA 253 ++AAA+S AAKTSET+A+ S Q+AA S + + A+ A S +A S+ AAK+SETNA Sbjct: 132 ETAAASSKNAAKTSETHAANSAQAAAASQTASANSATAAKKSENNAKNSETAAKTSETNA 191 Query: 254 SSSASSAASSATAAGNSAK--------------AAKTSETNARSSETAAGQSASAAAGSK 299 SS ++A +S T A S AA S T+A S TAA S AA S+ Sbjct: 192 KSSQAAAKTSETNAKTSETAAKSSQAAAAESESAAAGSATSAAGSATAAANSQKAAKTSE 251 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 T A SS +AA TS A AS TAA S ++AA S S A A A A+A+A S AAK Sbjct: 252 TNAKSSQTAAKTSETNAKASETAAKNSQDAAAQSESAAAGSASAAASSATASANSQKAAK 311 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQ 403 TSETNAKASET+A +S A+A+S ++A +S +A +A Sbjct: 312 TSETNAKASETAAANSAKASAASQTAAKASEDAAREYASQAAEP 355 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 98/234 (41%), Positives = 138/234 (58%) Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAA 295 AA+SK AAK+SET+A++SA +AA+S TA+ NSA AAK SE NA++SETAA S + A Sbjct: 132 ETAAASSKNAAKTSETHAANSAQAAAASQTASANSATAAKKSENNAKNSETAAKTSETNA 191 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSA 355 S+ AA +S + A TS A +S AA +S +AA SA++A A A AA S Sbjct: 192 KSSQAAAKTSETNAKTSETAAKSSQAAAAESESAAAGSATSAAGSATAAANSQKAAKTSE 251 Query: 356 SAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAS 415 + AK+S+T AK SET+A++S+TAA +S +AA S S+A+ S A A+A+ +S A Sbjct: 252 TNAKSSQTAAKTSETNAKASETAAKNSQDAAAQSESAAAGSASAAASSATASANSQKAAK 311 Query: 416 TKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 T T A S TAAA S + ++ T A+ + A + AS A + Sbjct: 312 TSETNAKASETAAANSAKASAASQTAAKASEDAAREYASQAAEPYKQVLQPLPD 365 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 77/190 (40%), Positives = 106/190 (55%) Query: 285 ETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEA 344 ETAA S +AA S+T AA+SA AA+ S ++ SATAA KS +A +S + A T A Sbjct: 132 ETAAASSKNAAKTSETHAANSAQAAAASQTASANSATAAKKSENNAKNSETAAKTSETNA 191 Query: 345 TEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQA 404 +AA S + AKTSET AK+S+ +A S++AAA SA+SAA SA++A+ S+ A Sbjct: 192 KSSQAAAKTSETNAKTSETAAKSSQAAAAESESAAAGSATSAAGSATAAANSQKAAKTSE 251 Query: 405 SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 + AKSS T A T T A S TAA S+ A + + A +A A A+A A + Sbjct: 252 TNAKSSQTAAKTSETNAKASETAAKNSQDAAAQSESAAAGSASAAASSATASANSQKAAK 311 Query: 465 KKGIVQLSSA 474 +S Sbjct: 312 TSETNAKASE 321 >UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEM1_9CAUD Length = 255 Score = 235 bits (600), Expect = 6e-60, Method: Composition-based stats. Identities = 102/216 (47%), Positives = 124/216 (57%), Gaps = 32/216 (14%) Query: 905 PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAV 964 PVGAP+ WPSDT P G+ALM GQ FDK YP LA YPSGV+PDMRG IK KP GRAV Sbjct: 72 PVGAPLAWPSDTAPDGWALMIGQTFDKVKYPLLAKVYPSGVLPDMRGRVIKAKP-DGRAV 130 Query: 965 LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHS 1024 LS E+D +KSHTH+ A++ GT+ TS+FD+G K T G HTH G+ ++ Sbjct: 131 LSLEEDQVKSHTHTGKAATAG-GTRATSTFDHGNKRTTTNGNHTHGSPQGARHGGSGQYT 189 Query: 1025 LANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHT 1084 + T S N+ +SA AG H H V IG H Sbjct: 190 SGDDETNSV----------------FNWPATSA-------------AGDHFHDVQIGPHN 220 Query: 1085 HSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 H+V I +H HT+ ++A G ENTVKNIA NYIVRLA Sbjct: 221 HNVDI-NHEHTLQIDATGGTENTVKNIAMNYIVRLA 255 >UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseudotuberculosis RepID=Q66BF2_YERPS Length = 711 Score = 232 bits (592), Expect = 5e-59, Method: Composition-based stats. Identities = 100/379 (26%), Positives = 157/379 (41%), Gaps = 21/379 (5%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V +SG++ + G+PV N I L A NS TV+ A+ D G Y + +E G YS+ Sbjct: 1 MSVTVSGIMINPVGEPVVNAQITLTAVTNSLTVLNAFSATVRTDGVGTYRIQLEEGSYSI 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFL-GAMTEDDARPEALRRFELMVEEVARNASA 119 + G + G +T+ + P TLN L + E + P+ + F + ++VA + + Sbjct: 61 TVAANGRSFVY-GAVTLDNTTGPSTLNQLLKQQIMESELTPDVILYFRQIQQQVANDLAT 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + +A +A A S EA +A D +++ A QA SA +++ S A+ Sbjct: 120 IKVLEISATDAAESAGHSRDEAMLYAKDLSEALATAKGYRDQAGISADASALSQQEAAIS 179 Query: 180 ATEASKSAAAAESSKSAAAT----------------SAGAAKTSETNASASLQSAATSAS 223 T A SA +A S+ A + S AA+ + +++ A A Sbjct: 180 ETSAKASADSALLSEQNALSYRDSAQSAAATAADDASTLAAERTAEKIKLQVKTDADRAE 239 Query: 224 TATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARS 283 A + + +S D A + T A+ +A + AT A NS A SE A Sbjct: 240 AARIASEQIKSSVDDTAQTVAQQHGETTQAAIAARDSEVKATTAANS---AVQSEALAAI 296 Query: 284 SETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE 343 S A Q+A + K AA A A QA ASA + G + + AG+ Sbjct: 297 SAETARQNAGISTVDKNAAKGFRDEAEGFAQQAHASAESVGDVMPKTGGAFTGPVELAGD 356 Query: 344 ATEQASAAARSASAAKTSE 362 ATE E Sbjct: 357 ATEPLEPVTFQQFERTGGE 375 Score = 45.7 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 53/201 (26%), Positives = 82/201 (40%), Gaps = 4/201 (1%) Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA 367 +A+ +A A S A A+ + + +TA +A A A+A S A SET+AKA Sbjct: 126 SATDAAESAGHSRDEAMLYAKDLSEALATAKGYRDQAGISADASALSQQEAAISETSAKA 185 Query: 368 SETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATA 427 S SA S+ A S +A SA++ +A A+ T A A + A Sbjct: 186 SADSALLSEQNALSY-RDSAQSAAATAADDASTLAAERTAEKIKLQVKTDADRAEAARIA 244 Query: 428 AAQSKSTAESAATR--AETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAAT 485 + Q KS+ + A + IA+ + A+T VQ S A + S A Sbjct: 245 SEQIKSSVDDTAQTVAQQHGETTQAAIAARDSEVKATTAANSAVQ-SEALAAISAETARQ 303 Query: 486 PKAVKSAYDNAEKRLQKDQNG 506 + + NA K + + G Sbjct: 304 NAGISTVDKNAAKGFRDEAEG 324 >UniRef50_B5YU13 Tail fiber protein n=140 Tax=root RepID=B5YU13_ECO5E Length = 451 Score = 229 bits (583), Expect = 5e-58, Method: Composition-based stats. Identities = 172/384 (44%), Positives = 213/384 (55%), Gaps = 23/384 (5%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPV+NCTIQLKA+R S+TVVVNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 15 MAVKISGVLKDGTGKPVENCTIQLKARRTSSTVVVNTVASENPDEAGRYSMDVEYGQYSV 74 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAM+EDD RPEALRRFELMV Sbjct: 75 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRFELMV-------EEA 127 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A++ AKK+A +A TSAR A A+ A +SA A TSAG A+ SA+ A+ SA A Sbjct: 128 ARHAEEAKKNAGEAETSARNAGISASQAEESAANADTSAGDASESARQAAESAAAAKQSE 187 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 +S SA+AA S ++ SA A+ S A ++ +AA A+TAT KA E+A SA+ A Sbjct: 188 EASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAESAQSAE 247 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 S+ AA+ + + + G + + + G + A A + Sbjct: 248 QSRIAAE-------EAVNRIPTVVGPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGER 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 A A A G E + + G + AA + Sbjct: 301 GPAGDAGPAGPQ--------GPKGDRGERGETGLTGNAGPQGPKGDTG-AAGPAGPQGPK 351 Query: 361 SETNAKASETSAESSKTAAASSAS 384 ET A + + Sbjct: 352 GETGAAGPVGATGPQGPKGDPGET 375 >UniRef50_B5YYN6 Tail fiber protein n=60 Tax=root RepID=B5YYN6_ECO5E Length = 645 Score = 229 bits (583), Expect = 5e-58, Method: Composition-based stats. Identities = 128/524 (24%), Positives = 186/524 (35%), Gaps = 60/524 (11%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V +SG LK G+ + I L A S + T AS E G Y M ++ G+Y+V Sbjct: 1 MSVVVSGTLKSPDGEAISGANITLTALTVSPDALSGTSASAVTREGGYYGMTMDPGEYAV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGA-MTEDDARPEALRRFELMVEEVARNASA 119 + V+G + G + + TLN L + E E L F + VA + + Sbjct: 61 SVTVKGKTVVY-GRVRIEGTESTVTLNMLLRRSLVEVSIPGELLTDFRQIQNNVADDLAT 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + + + A+ S AA A A+DSA+ A++ A +A A A+ +A Sbjct: 120 IRRLNEDTATKNTQATQSKESAAASAKSASDSAKTATSRAAEAGQKATDATEAA------ 173 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 + A T+AG A+ S T A S ++A A A A +A Sbjct: 174 ---------------TRAVTAAGNAEESSTRAGESEKAAGADAEKARQHAEKAR------ 212 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 A SA A AA SA+ A+ NAR G++ G K Sbjct: 213 ------------LAQESAGEILKRAEAATVSAEEARRMAENARGPRGPQGET-----GPK 255 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 A G+ + A A GE EQ + K Sbjct: 256 GDVGPKGETGPVG---PQGPAGPKGERGDVGAQGAVGPAGPRGEKGEQGERGPQGIPGLK 312 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 +T + + + + EA Q T Sbjct: 313 -GDTGERGPKGDQGDMGPKGEKGDPGGPAGPQGPKGERGEAGPQGPMGA-RGERGETGPR 370 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 G A + T R E SA + DA+T +KGIVQLSSAT+S Sbjct: 371 GEPGPAGPRGERGETGPQGP-RGEPGPA-----GSAANVADATTAQKGIVQLSSATDSDD 424 Query: 480 ETLAATPKAVKSAYDNAE---KRLQKDQNGADIPDKGCFLNNIN 520 ET AATPKAVK+A D A + ++ G +P + Sbjct: 425 ETKAATPKAVKAAMDVANEAKTKAEEAAAGGGVPGPKGDKGDTG 468 >UniRef50_B6IAV4 Putative phage tail fiber protein n=1 Tax=Escherichia coli SE11 RepID=B6IAV4_ECOSE Length = 590 Score = 227 bits (577), Expect = 2e-57, Method: Composition-based stats. Identities = 107/376 (28%), Positives = 168/376 (44%), Gaps = 17/376 (4%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V I G L DG G P+ C I LK++ N++ VV+ T A G YS + G+Y V Sbjct: 1 MSVLIYGALTDGAGIPMSGCHIILKSRVNTSEVVMRTEADVVTGNNGEYSFEARTGKYRV 60 Query: 61 ILLVEGFPPSHA-GTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASA 119 L +G+ + G I VY+D++PGTLNDFL A E D +P+ ++RFE MV + ++A + Sbjct: 61 YLK-QGWRDEYCVGDIAVYDDAKPGTLNDFLTAPDEGDLKPDVVKRFERMVAQAQQSAES 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 A++ A + +DA + T A + +A A + A +AG Sbjct: 120 AAESEQQAGQHVADAQKIKEDCQTLADNVQLNATAVAEDKQHVEHLAAEVEQNAGQMQQG 179 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 + + A+ + +A+SA +K + NA+ S QSA + A +A Sbjct: 180 VQSVTDAVKQAQQAADDSASSAEESKNNADNAARSEQSA--------------KSHADNA 225 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 A S + AKS N + + A TA + A+ NAR TA A A Sbjct: 226 ARSAQNAKSHADNVAGNTLQTAQDVTATAAARDDAERFAENARQDATATACDRKATAEDV 285 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 +A SA+++ SA A+ A AA ++ + +G +E A AA A Sbjct: 286 KSAGESAASSEQSARVAAGYARAAEQAKNDIDVLLANTLKTSGNLSEIA-AAGEQAQQES 344 Query: 360 TSETNAKASETSAESS 375 K++ T + Sbjct: 345 RDNLGLKSAATMEPQA 360 >UniRef50_D2TJ16 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TJ16_CITRO Length = 538 Score = 225 bits (574), Expect = 5e-57, Method: Composition-based stats. Identities = 79/255 (30%), Positives = 129/255 (50%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V ISG L +G G P+ C I L A N++ VV + A D AG+Y+ + + G+Y+V Sbjct: 1 MSVLISGALINGAGVPMAGCKIYLDALVNTSEVVTESFAVIETDAAGQYAFEAQKGKYTV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 + + P G I+VY+DS+PGTLNDFL A+ E D +P+ ++RFE MV + ++A A Sbjct: 61 HIKQKNGPKCCVGDISVYDDSKPGTLNDFLTALDEGDLKPDVVKRFEEMVAQAQQSAEAA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A++ A + +DA + T A + + A + + A AG Sbjct: 121 AESEQQAGQHVADAQQIKSDCETLADNVQQNTNAVEENTQRVEQLASEVGLHAGQVQQGV 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 + + A+ + +A SA +K S NA+ S Q+A A A + A+DAA Sbjct: 181 QNVTDAVKKAQQAAKNSADSATDSKNSADNAALSEQNAQKHAQKAEQHEQQTKQYAQDAA 240 Query: 241 ASKEAAKSSETNASS 255 + E+A++++ Sbjct: 241 TAAESAENAKGEIDE 255 >UniRef50_C2DS71 Tail fiber protein n=10 Tax=Escherichia RepID=C2DS71_ECOLX Length = 686 Score = 224 bits (569), Expect = 2e-56, Method: Composition-based stats. Identities = 148/252 (58%), Positives = 176/252 (69%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAV+ISGVLKDG GKP+QNCTIQLKAKRNSTTVVVNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVQISGVLKDGAGKPIQNCTIQLKAKRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAG ITVYEDS+PGTLNDFLGA TEDD RPEAL RFE MVEEVARNA A Sbjct: 61 ILLVEGFPPSHAGAITVYEDSKPGTLNDFLGAATEDDVRPEALYRFEKMVEEVARNAEAA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 +Q+ AAAKKS + A++S A T +A +SA+AA++S A ++A +A S A Sbjct: 121 SQSAAAAKKSETAAASSRNAAKTSETNAGNSAKAAASSKTAAQNAATAAERSETNARASE 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 ++ S A+ + +AA +AG A T A+A A A + A+ A +A A Sbjct: 181 EASADSEEASRRNAESAAENAGVATTKAREAAADATKAGQKKDEALSAATRAEKAADRAE 240 Query: 241 ASKEAAKSSETN 252 + E N Sbjct: 241 VAAEVTAEPYAN 252 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 61/139 (43%), Positives = 86/139 (61%) Query: 215 LQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA 274 ++ A +A A+ A+ A S AA+S+ AAK+SETNA +SA +AASS TAA N+A AA Sbjct: 110 VEEVARNAEAASQSAAAAKKSETAAASSRNAAKTSETNAGNSAKAAASSKTAAQNAATAA 169 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 + SETNAR+SE A+ S A+ + +AA +A A+T A +A+A AT AG+ + A S+A Sbjct: 170 ERSETNARASEEASADSEEASRRNAESAAENAGVATTKAREAAADATKAGQKKDEALSAA 229 Query: 335 STATTKAGEATEQASAAAR 353 + A A A A A Sbjct: 230 TRAEKAADRAEVAAEVTAE 248 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 73/224 (32%), Positives = 101/224 (45%), Gaps = 17/224 (7%) Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNA------KASETSAESSKTA 378 K E A +A A+ A A + +AAA S +AAKTSETNA AS +A + Sbjct: 108 KMVEEVARNAEAASQSAAAAKKSETAAASSRNAAKTSETNAGNSAKAAASSKTAAQNAAT 167 Query: 379 AASSASSAASSASSASASKDEATRQ-ASAAKSSATTASTKATEAAGSATAAAQSKSTAES 437 AA + + A ++ ASA +EA+R+ A +A +A A+TKA EAA AT A Q K A S Sbjct: 168 AAERSETNARASEEASADSEEASRRNAESAAENAGVATTKAREAAADATKAGQKKDEALS 227 Query: 438 AATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAE 497 AATRAE AA RAE A A A+ +S +P K A + Sbjct: 228 AATRAEKAADRAEVAAEVTAEPYANIVPPLPDVWIPFNDSLDMIAGFSPGYKKIAIGDDV 287 Query: 498 KRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAP 541 ++ D+ F A + T ++ +N P Sbjct: 288 VQVASDKQVN-------FSR---ASTATYINKSGELKTAEINEP 321 Score = 53.4 bits (126), Expect = 4e-05, Method: Composition-based stats. Identities = 51/137 (37%), Positives = 76/137 (55%), Gaps = 2/137 (1%) Query: 174 GTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 + A AS+SAAAA+ S++AAA+S AAKTSETNA S ++AA+S + A A+ A Sbjct: 111 EEVARNAEAASQSAAAAKKSETAAASSRNAAKTSETNAGNSAKAAASSKTAAQNAATAAE 170 Query: 234 TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA--RSSETAAGQS 291 S +A AS+EA+ SE + +A SAA +A A A+ A T A + E + + Sbjct: 171 RSETNARASEEASADSEEASRRNAESAAENAGVATTKAREAAADATKAGQKKDEALSAAT 230 Query: 292 ASAAAGSKTAAASSASA 308 + A + A+ +A Sbjct: 231 RAEKAADRAEVAAEVTA 247 >UniRef50_Q7Y3Z0 Tail fiber protein n=1 Tax=Yersinia phage PY54 RepID=Q7Y3Z0_9CAUD Length = 690 Score = 212 bits (540), Expect = 5e-53, Method: Composition-based stats. Identities = 111/359 (30%), Positives = 178/359 (49%), Gaps = 13/359 (3%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M++KISG+L TG+P + I L+A + S TV+ ++ G YS++VE G+Y V Sbjct: 1 MSIKISGILPGPTGEPAAHIGITLRAIKTSLTVITTLESNSITGTDGAYSLNVEPGKYDV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 +L V+G GTI VY DS PGTLN+FL A+ E+D PE + + E + E R A Sbjct: 61 LLWVDGINARRVGTINVYSDSLPGTLNNFLTALREEDGTPEIILQLEQLRAEAVRAALEA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAAS-SAQSASSSAGTASTK 179 ++ A + A A ++A AA A+ +A +AA A++A S+ T + + Sbjct: 121 KESKNEATQQAGIAISAADNAAQETAELIKAAVKEDADRAEAARYGAETAQSTVNTLAAE 180 Query: 180 A----TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATS 235 +E + A+AA +S + AA+S+ ++ S + + +S +AA S +A A +A S Sbjct: 181 VARHHSEVGQLASAASNSAAEAASSSNSSAQSASESESSKNAAALSEQSALAGAEDAGNS 240 Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAA 295 A AA K AAK A A+ A +SA + SA + N + S+T + + Sbjct: 241 ATAAAGDKTAAKGFRDEAEEFAARAKASAESIDVSALE---EQINQKVSQTEFDDTIADK 297 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARS 354 A ++ A+ AST + AGK + + + AG+AT+Q A + Sbjct: 298 ASNQALTDGLATKASTQ----QLTDGLAGK-LDKIGGTLTGPLILAGDATDQKGAVTKQ 351 Score = 59.2 bits (141), Expect = 9e-07, Method: Composition-based stats. Identities = 67/253 (26%), Positives = 103/253 (40%), Gaps = 13/253 (5%) Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE-----AATSARDAAASKEA 245 E ++ A +A AK S+ A+ A ++A A + +E A A A++ Sbjct: 107 EQLRAEAVRAALEAKESKNEATQQAGIAISAADNAAQETAELIKAAVKEDADRAEAARYG 166 Query: 246 AKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASS 305 A+++++ ++ A+ A + G A AA S A SS ++ QSAS + SK AAA S Sbjct: 167 AETAQSTVNTLAAEVARHHSEVGQLASAASNSAAEAASSSNSSAQSASESESSKNAAALS 226 Query: 306 ASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNA 365 +A A A SATAA +A A A A A + SA + N Sbjct: 227 EQSALAGAEDAGNSATAAAGDKTAAKGFRDEAEEFAARAKASAESIDVSALE---EQINQ 283 Query: 366 KASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAST-----KATE 420 K S+T + + AS+ + A+ AS + K T AT+ Sbjct: 284 KVSQTEFDDTIADKASNQALTDGLATKASTQQLTDGLAGKLDKIGGTLTGPLILAGDATD 343 Query: 421 AAGSATAAAQSKS 433 G+ T KS Sbjct: 344 QKGAVTKQQLDKS 356 >UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp. RC586 RepID=D0IJ09_9VIBR Length = 368 Score = 197 bits (500), Expect = 2e-48, Method: Composition-based stats. Identities = 81/217 (37%), Positives = 106/217 (48%), Gaps = 51/217 (23%) Query: 903 SYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGR 962 PVG P+PWPSD P G+A+ +GQAFDK A P+LA YP G++ D+RG + GK G Sbjct: 202 ICPVGVPLPWPSDIAPEGFAIHKGQAFDKVANPELAKLYPDGILKDLRGMAVVGK-KEGE 260 Query: 963 AVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHT 1022 +LS E D +K H + S T SS D G+++TN TG H H T S G + Sbjct: 261 IILSYEADQVKQHGYPNS---------TVSSTDLGSRNTNTTGNHAHGYPAGT-SNGPNG 310 Query: 1023 HSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGA 1082 L +TA A+ G +T G H H+V IG+ Sbjct: 311 PYL---DTAHASYGYRYTTTE----------------------------GNHYHSVAIGS 339 Query: 1083 HTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 H HS+AI G ENT+KNI FN+IVR+ Sbjct: 340 HAHSIAI---------ALFGATENTIKNIKFNWIVRM 367 >UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GRX3_VIBCH Length = 182 Score = 197 bits (499), Expect = 3e-48, Method: Composition-based stats. Identities = 71/218 (32%), Positives = 100/218 (45%), Gaps = 51/218 (23%) Query: 903 SYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGR 962 +PVG IPW +D P G+ + +GQAFD + Y +LA +P+G+IPDMRG + GK G Sbjct: 16 IFPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRGCGVIGK-EDGE 74 Query: 963 AVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHT 1022 AV + E+ +K+H H S T SS D G+K+T N G HTH + G+H Sbjct: 75 AVGAYEEGQVKNHGHPNS---------TVSSIDLGSKNTANGGNHTHFSGIAAFGGGSHR 125 Query: 1023 HSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGA 1082 + N TS+AG H H Sbjct: 126 YQTD------------------VNGSGGNINTSAAGNHYH-------------------- 147 Query: 1083 HTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 S+ +GSH H +T+ G +NT+ + N+IVRLA Sbjct: 148 ---SIPMGSHAHAVTIALFGALKNTINHRKINWIVRLA 182 >UniRef50_C4U3E2 Tail fiber protein (Fragment) n=1 Tax=Yersinia kristensenii ATCC 33638 RepID=C4U3E2_YERKR Length = 430 Score = 195 bits (495), Expect = 8e-48, Method: Composition-based stats. Identities = 109/401 (27%), Positives = 177/401 (44%), Gaps = 2/401 (0%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+VKISG L DG G P+ C I LKA+ N+ VV+ T+A+ G YS + + G+Y V Sbjct: 1 MSVKISGALIDGAGIPMSGCQIILKARVNTAEVVMRTIATITTGRNGEYSFEAQVGRYCV 60 Query: 61 ILLVEGFPPSHA-GTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASA 119 L G+ + G ITVY+DS+PGTLNDFL A+ E D +P+ ++RFE +V + ++A Sbjct: 61 YLR-HGWSNEYCVGDITVYDDSKPGTLNDFLIALDEGDLKPDVVKRFEELVAQAQQSADM 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 A++ +S D + +A +AADA SA AA+ S A S + A+SSA +A+ Sbjct: 120 AAESAQQVSQSVQDVTKVKDDAKRYAADAQTSATAAAESQSTATESEKRAASSAHSATQS 179 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 A A +S AA+ + A + N + +Q+ A S A T A T A Sbjct: 180 AQNAQESKEAAQQAAQNAQNCRNEVEEVANNLANEVQTKAPLDSPALTGTPTAPTPDISA 239 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 + A A S+ ++ A N A ++ N ++ T A + Sbjct: 240 TGGEVATAEFVKQAVSALVDSSPEALDTLNELAEALGNDPNFATTMTNALAGKQPLNPAL 299 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 T+ + +A + A ++ + + + + A + + S Sbjct: 300 TSLSGLVTAENKLAYFSNKNVMSLANLSAVGRVIIGQNSKSEVLEYLGALKSTNNLSEIA 359 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 T+ T+A+ AA + S + + A Sbjct: 360 TAGTDAQQQTRQHLGLGDAATMNVQSDIHDRTEGRLALPGA 400 >UniRef50_B3YHG3 Tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica serovar Kentucky RepID=B3YHG3_SALET Length = 573 Score = 190 bits (482), Expect = 2e-46, Method: Composition-based stats. Identities = 134/476 (28%), Positives = 191/476 (40%), Gaps = 30/476 (6%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M++ +SG+LK G + I L A S ++ AS + G Y M+V G YS+ Sbjct: 1 MSILVSGILKSPAGAIIAGAQITLTALTTSPDLLAGVSASAVTSDTGYYGMNVLPGVYSL 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGA-MTEDDARPEALRRFELMVEEVARNASA 119 + V G + G+ + TLN L + E E L F + VA + Sbjct: 61 TVAVNGKSQVY-GSFRLDGTETTVTLNMVLRRNLVEVSIPDELLVDFRQIQNNVADDLET 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + Q A SA A AADA SA +A A S A+ S A Sbjct: 120 IRQLELRAS-------GSADNAVRTAADAKASAESA-------ARSEADAADSEKKAEQF 165 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 A + A A S SAAA SA A T A ++ AA S + AS AA S ++A Sbjct: 166 ARNLQDAVAKAGDSASAAALSAAGAGEQATAAKSAALEAADSKAATEKAASNAALSEKNA 225 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 A S AA++SE +SAA SAT A S KAA E + + ET AGQSA+ AA S Sbjct: 226 ADSALAARTSE-------NSAADSATKADASEKAAVLYEQTSSTHETNAGQSAADAALSA 278 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 T AA SA A SA +A+ AT A A +A +A+ ATT E Q S S + Sbjct: 279 TKAADSALNAGKSATEAAGYATDAQTQAGNAKRAATDATTAKDEIVRQISGFDEHVSQQE 338 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 T T ++ A + A +A +A A+ ++A + + T Sbjct: 339 TVIT------AKGQTLVDQARTEAINAGQAAQHAAQVLEDAINASIKGEKGDTGEQGPQG 392 Query: 420 EAAGSATAAAQSKSTAESA-ATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSA 474 + + + +T K + A + + + S+ Sbjct: 393 IPGPAGPPGPKGDKGDAGYRGLKGDTGLKGEKGDTGPSAYDIWKSQQPDGTDTSTV 448 >UniRef50_C5BA56 Putative uncharacterized protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BA56_EDWI9 Length = 743 Score = 188 bits (477), Expect = 1e-45, Method: Composition-based stats. Identities = 124/543 (22%), Positives = 204/543 (37%), Gaps = 16/543 (2%) Query: 5 ISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLV 64 ISGVL D +GK V I L A NS V+ S E G+YS+ +E G YS+ + Sbjct: 2 ISGVLLDPSGKAVSGAQITLTAIANSMQVLRGFTCSVMTAENGQYSVRLEEGNYSISVAH 61 Query: 65 EGFPPSHAGTITVYEDSQPGTLNDFL-GAMTEDDARPEALRRFELMVEEVARNASAVAQN 123 +G + G +T+ EDS P +LN L + E + PE + F + +VA + + + Sbjct: 62 QGRNFVY-GAVTLTEDSAPSSLNALLHQQVMEQEVTPEVILYFRQIQHKVADDVVIMQRL 120 Query: 124 TAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEA 183 + ++A A S R A AA S R A+ A +A+ A+ A TA A Sbjct: 121 QHDSSQAARAAQESQRHAQASKVAAAGSVRQAAAHRLAAGQAAEMAADYAQTAQDSQRHA 180 Query: 184 SKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASK 243 +S AA S+ A AA+ S A+ A + + A +K Sbjct: 181 QRSEMAAAESEQRTADHRLAAEQSAEVAAVHAAEEAAARVAEVVH-----NDSERADVAK 235 Query: 244 EAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAA 303 + A+S+ NA A A A A AA S+ + + A +A AA + Sbjct: 236 KQAESAARNADQYAQQAGIKAQNAQA---AADVSQEAIQVTRQARDDTARYAAEVQGYTQ 292 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSET 363 + +A A A+ A + +A+ A R+ A Sbjct: 293 QAVLSAQQIKEDTDTGLLVAQDIAQQVAVVNMAVSRVVEDASYVEQAVIRTLDKAPVDSP 352 Query: 364 NAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG 423 + + +A ++AA + S + + + A+ +A Sbjct: 353 VFTGTPQAPTPDGSAVGQEIATAAFVLAQVSKLINSSPAAMDTLQELASALGNDPEFSAT 412 Query: 424 SATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLA 483 Q ++ A + + + + A+ +++G+VQLS +T STS + A Sbjct: 413 VMNLIGQKLDKLQNGADIPDKSRFLQN-----IGVVSATISRRGVVQLSDSTESTSISEA 467 Query: 484 ATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAG 543 AT A++ Y A R+ + + + A + T A KR A G Sbjct: 468 ATANALRRTYQYAT-RIATTNQIGQVQLEDSVSSTSTARAPTCSALKRTYDEATRRASTG 526 Query: 544 ATS 546 Sbjct: 527 QAG 529 >UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacteria phage phiEcoM-GJ1 RepID=A9Q1X5_9CAUD Length = 356 Score = 188 bits (476), Expect = 1e-45, Method: Composition-based stats. Identities = 75/238 (31%), Positives = 119/238 (50%), Gaps = 18/238 (7%) Query: 857 GVGSCRTLQMKAHYRNGGLFYRSSRDGY---GFEEDWAEVYTSKNLPPESYPVGAPIPWP 913 R +Q ++ N ++ RS F+E W E N+ YP+G + + Sbjct: 68 NANVGRVMQRYTNFSNKRMWVRSQNGTVSDANFDE-WTEFVNMNNIYNAIYPIGIVVKFD 126 Query: 914 SDTVPSGYALMQGQAFDKSAYPKLA--AAYPSGVIPDMRGWTIKGKPASGRAV--LSQEQ 969 + T P+ G +++ ++A A P D + +I G + AV L Sbjct: 127 NATNPNNN--FTGTVWEQIIDGRVARAATGPEAGTADGQIGSIAGSDTANIAVTNLPGHT 184 Query: 970 DGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVN 1029 G+++HTH ++ S + T + D+G +++++GAHTHSVSG+ SAGAH H+ + Sbjct: 185 HGMQNHTHGIASHSHTMAHTHTINHDHGAVTSSSSGAHTHSVSGTAASAGAHQHTEGSPF 244 Query: 1030 TASANSGAGSASTRLSVVHNQNYA-------TSSAGAHTHSLSGTAASAGAHAHTVGI 1080 T N G + ST + + Y+ TSS+GAHTHS+SGTAASAGAH H+V + Sbjct: 245 TGDVNFGT-TTSTSKDNISDWLYSPSTRYPLTSSSGAHTHSVSGTAASAGAHTHSVDL 301 >UniRef50_Q858V4 GpH n=9 Tax=root RepID=Q858V4_9CAUD Length = 913 Score = 187 bits (475), Expect = 2e-45, Method: Composition-based stats. Identities = 77/180 (42%), Positives = 102/180 (56%), Gaps = 11/180 (6%) Query: 732 LWRNDGAKT-YLLLTNQGDVYG-----GWNTLRPFAIDNAT---GELVIGTKLSASLNGN 782 +WR+ KT Y T + +YG +N R ID A + LS L+GN Sbjct: 587 VWRSTSNKTNYRFFTVR--LYGNPGERSFNIRRLPIIDEAQTWEAKQTFSAGLSGELSGN 644 Query: 783 ALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNV 842 A TATKL+T R+++ V FDG+ DI LT ++ AFA T D V WN SGAYN Sbjct: 645 AATATKLKTARKINNVSFDGTSDINLTPKNIGAFASGKTGDTVANDKAVGWNWSSGAYNA 704 Query: 843 TRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPE 902 T G S ++++F G GSC Q + +Y+NGG+FYRS+RDGYGFE DW+E YT+ P Sbjct: 705 TTGGASTLILHFNIGEGSCPAAQFRVNYKNGGIFYRSARDGYGFEADWSEFYTTTRKPTA 764 Score = 98.4 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 56/139 (40%), Positives = 70/139 (50%), Gaps = 1/139 (0%) Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS 454 A + A + A T S A+ + A + + + + Sbjct: 103 AVSNMAESYKPELAEGSGRAQTCRMVIILSNVASVELSIDASTVMATQDYVDDKIAEHEQ 162 Query: 455 AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 + DA+ T+KG QLSSATNSTSE LAATPKAVK+A DNA RL K+QNGADI DK Sbjct: 163 SRRHPDATLTEKGFTQLSSATNSTSEKLAATPKAVKAANDNANSRLAKNQNGADIQDKSA 222 Query: 515 FLNNINAVSKTDFADKRGM 533 FL+NI S T F GM Sbjct: 223 FLDNIGVTSLT-FMKHNGM 240 >UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DE08_PECCP Length = 682 Score = 186 bits (472), Expect = 4e-45, Method: Composition-based stats. Identities = 69/206 (33%), Positives = 105/206 (50%), Gaps = 22/206 (10%) Query: 783 ALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNV 842 AL+++++ T V + + +V + Y D+D + W++ +GAY Sbjct: 411 ALSSSRVPTAADVGAI-----TKTDADSHYVHQGSSGVI--YQDSD--LAWDSPTGAYLK 461 Query: 843 TRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPE 902 S ++ + GS Q + NGG+ YRSSRD GFE+ WA +YT ++ P Sbjct: 462 DNGTHSSLIWHMGLNAGSASAAQFYFDFANGGIKYRSSRDNSGFEKPWARIYTDQDKPTA 521 Query: 903 --------SYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTI 954 + VG P+PWP T PSG+ GQ FDK+ YPKLA YP+G++PD+RG I Sbjct: 522 ADIGALSLNEIVGMPMPWPQTTAPSGWLKCNGQTFDKNIYPKLAQIYPAGILPDLRGEFI 581 Query: 955 KGKPAS-----GRAVLSQEQDGIKSH 975 +G S GR +LS + D I++ Sbjct: 582 RGWDDSRGVDTGRTLLSTQGDAIRNI 607 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 33/131 (25%), Positives = 50/131 (38%), Gaps = 16/131 (12%) Query: 419 TEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNST 478 +A A+ K E + T A A+ + + T AT+S Sbjct: 33 RQAKELASRTRYLKKEQEKTGSDLATHAAAADPHTQYAPKANPTFTGTPKAPT-PATDSN 91 Query: 479 SETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRV 538 S+ +A T V+S +L KDQNGADIPD+ F N+ + R Sbjct: 92 SQQIATTAF-VRSV---GATKLAKDQNGADIPDRELFNRNLGSS-----------RAYSS 136 Query: 539 NAPAGATSGKY 549 + P G + G + Sbjct: 137 SIPIGGSDGLW 147 >UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI4_EDWTE Length = 718 Score = 178 bits (451), Expect = 9e-43, Method: Composition-based stats. Identities = 181/443 (40%), Positives = 240/443 (54%), Gaps = 9/443 (2%) Query: 4 KISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILL 63 +I+G+LKDG GKP+ NC I LKA R S +V+V+T+AS++P EAG Y M E GQY V L Sbjct: 3 RITGILKDGMGKPITNCEIALKALRTSASVIVHTVASQSPGEAGLYDMAAEPGQYRVTLC 62 Query: 64 VEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQN 123 V+G+PP + G I +Y DS GTLN FLG + D RP+ ++ FE+MV +V+ ++ V +N Sbjct: 63 VDGYPPEYVGDIQIYHDSPDGTLNYFLGLPVDGDLRPDVMKEFEIMVAKVSAQSAEVEKN 122 Query: 124 TAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEA 183 AA +SA A S + A + + AA+SA AA S A +S Q A+S A +A A Sbjct: 123 KDAAAESARSALNSQQSAHSSESAAAESAAAALASQNAAKASEQLAASGAQSAQASQQAA 182 Query: 184 SKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASK 243 S +AA S +AA S AAK SE A++S +A S +A S AA SA A AS+ Sbjct: 183 KASESAAADSAAAALASQNAAKESEQAAASSALAAQASQQSAHGSESAAAESAAAALASQ 242 Query: 244 EAAKSSETNASSSASSAASSATAAGNSAKAAKTSE------TNARSSETAAGQSASAAAG 297 AAK+SE A+SSA +AA+ A A A A E A SS T A S AAG Sbjct: 243 NAAKASELAATSSAETAANDAAAKAAQATEATLKEAVRADADRAASSATEAHSSTEQAAG 302 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 S ++A +S AA+ SA QA+ A S +AA SAS+A +A+ ASAAA SASA Sbjct: 303 SASSAHNSQMAAAQSASQAAGLADKVKASEAAAAESASSAAQSVSQASSSASAAAGSASA 362 Query: 358 AKTSETNAKASETSAESSKTAA---ASSASSAASSASSASASKDEATRQASAAKSSATTA 414 AK+SET A S +AE S +A A S + S AA Sbjct: 363 AKSSETAAAGSALAAEGSAQSAKVEADRISGGLDTKQDKSELLGAIAALQDAANKIVVLT 422 Query: 415 STKATEAAGSATAAAQSKSTAES 437 + EAA +T A S + Sbjct: 423 GPSSVEAADLSTFAKSLLSKTDQ 445 Score = 86.9 bits (213), Expect = 4e-15, Method: Composition-based stats. Identities = 60/246 (24%), Positives = 98/246 (39%), Gaps = 35/246 (14%) Query: 761 AIDNATGELVIGTKLSASLNGNALTATK--LQTPRRVSGVEFDGSKDITLTAAHVAAFAR 818 A+ +A ++V+ T S+ + T K L + S +E G K+ A + A+++ Sbjct: 410 ALQDAANKIVVLTGPSSVEAADLSTFAKSLLSKTDQDSAIECLGLKETVTLAGN--AWSK 467 Query: 819 RATDTYADADGGVPWNAESGAYNVTRSGDSYILVN-------------FYTGVGSCRTLQ 865 + + N + G Y V+ S + Y S Q Sbjct: 468 KYIGRLNNGGAFAGCN-QGGIYEVSIGTPSSVADFPMKNGTYIYGYGVLYVTSNSGTISQ 526 Query: 866 MKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYP-VGAPIPW-----PSDTVPS 919 + + NG + R + WA VY + P +G+ IPW P + P+ Sbjct: 527 LYISH-NGQIAARIKWGDQPNFKSWA-VYDPNSSFEYGCPLIGSLIPWALERMPQEIWPN 584 Query: 920 G---YALMQGQAFDKSAYPKLAAAYPSGVIP-DMRGWTIKGKPAS-----GRAVLSQEQD 970 + GQ+FD +PKL YP +P DMRG+T +G GRA+LS + D Sbjct: 585 CGMHFIPYMGQSFDPELFPKLHDVYPDNRLPTDMRGYTARGWDNGRGIDIGRALLSYQDD 644 Query: 971 GIKSHT 976 I++ T Sbjct: 645 AIQNIT 650 >UniRef50_C3R3S9 Predicted protein n=1 Tax=Bacteroides sp. 2_2_4 RepID=C3R3S9_9BACE Length = 1039 Score = 172 bits (436), Expect = 6e-41, Method: Composition-based stats. Identities = 118/441 (26%), Positives = 181/441 (41%), Gaps = 21/441 (4%) Query: 56 GQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVAR 115 G Y + ++G H+G E++ ND G +T+ + FE+ +++ Sbjct: 411 GLYGTNVYLKGTFVLHSGK--KIEEAIDDVKNDLNGRITDVETN------FEIREGQISS 462 Query: 116 NASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT 175 V + AK+S ++AS SA A A +A+ SA A +A A + + + Sbjct: 463 KIKEVNIAVSNAKQSETNASGSATSAGVSANNASKSATDAQGAATNAGKILEEVTLKESS 522 Query: 176 ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATS 235 + A E S A T+ TNA S SA+ SA TA+ KA EAA S Sbjct: 523 VTQTAGEISTKVT-------EVNKKVTEANTAATNAKNSATSASGSAGTASGKAGEAANS 575 Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAA 295 A +A S + A + + SS +A A ++ T A A+ A Sbjct: 576 AANAKQSADNAAKVLEDVTLKESSITQTAGNITLQVTEVTKKVVEANTAATTASTKAAEA 635 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSA 355 + S T A +SAS AST AG+AS SAT A SA+SAA+ + + K + AS+ Sbjct: 636 STSATNAKNSASTASTKAGEASTSATNAKNSADSAAAKLTVVSQKESSINQTASSITLQV 695 Query: 356 SAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAS 415 T A S T+A + AASSA++AA SA+ A A D K T Sbjct: 696 KEVTTKANEAANSATTAATKAGEAASSATNAAKSATDAKALLDNVD-----GKYVTKTVY 750 Query: 416 TKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSAT 475 + + K+ + +R A A +AL+ + ++ + S T Sbjct: 751 DSEIKVLSDSINLKVEKTDFNALGSRVAAAEASISAQAGQIALKASQSSVNDLTGRMS-T 809 Query: 476 NSTSETLAATPKAVKSAYDNA 496 +S T A ++K A Sbjct: 810 AESSITQNAEQISLKVTSTEA 830 >UniRef50_C9KJG4 Side tail fiber protein from lambdoid prophage Rac n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KJG4_9FIRM Length = 932 Score = 172 bits (435), Expect = 7e-41, Method: Composition-based stats. Identities = 134/389 (34%), Positives = 195/389 (50%), Gaps = 25/389 (6%) Query: 97 DARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDA-STSAREAATHAADAADSARAA 155 D E LR + +++V + + A K+S +D +T +++ A A A A Sbjct: 40 DVIAEDLRWLKENIDDVKDTS-----DLEAIKQSVTDMYNTMKNDSSFGEATAKAQAEEA 94 Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 A A SA +A + +TK+TE + + A +S A + KT E + S S Sbjct: 95 KKQAQAALESATNAKTYYDDITTKSTEVNNTIAEIKSYIEKAEALNESNKTLEQSISDSA 154 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 A A +A + A+ AATS +A AS+ AK+SETNA S ++AA S + A A Sbjct: 155 TVATNKAKSAASSATNAATSETNAKASETKAKASETNAKVSETNAAKSESNAKAHMDATA 214 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 TSE+NA++SET A S +A+ S+T A +S + A + +S SA A AES+ S S Sbjct: 215 TSESNAKTSETNAKASQAASKTSETNAKTSETNAKQYSINSSNSADLAKAWAESSDSPDS 274 Query: 336 TATTKAGEATEQAS------------AAARSASAAKTSETNAKASE-------TSAESSK 376 T + Q+S +A S + AKTSETNAK SE T++ SS Sbjct: 275 VNDTDSTTGKTQSSKTWAIYSKDRAISAFTSETHAKTSETNAKTSETNAANSATNSASSA 334 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 TA+A+SA AA+SA++A S+ A AS AK+S T A T T A S T AA S+ + Sbjct: 335 TASANSAEEAATSATNAKTSETNAATSASNAKTSETNAKTSETNAKASETNAATSEGNTK 394 Query: 437 SAATRAETAAKRAEDIASAVALEDASTTK 465 +A+ A + A+ I S V + A K Sbjct: 395 GYMEKAQVAYESAKAIQSVVDVAKADAEK 423 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 100/381 (26%), Positives = 180/381 (47%), Gaps = 7/381 (1%) Query: 24 LKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPP--SHAGTITVYEDS 81 L+A + S T + NT+ +++ E + +E ++ IT Sbjct: 62 LEAIKQSVTDMYNTMKNDSSFGEATAKAQAEEAKKQAQAALESATNAKTYYDDITTKSTE 121 Query: 82 QPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREA 141 T+ + + + +A E+ + E + + A A+ A++ A++ +A+ + T+A+ + Sbjct: 122 VNNTIAEIKSYIEKAEALNESNKTLEQSISDSATVATNKAKSAASSATNAATSETNAKAS 181 Query: 142 ATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSA 201 T A + +A+ + T+A A S +A + +T + A S A++S++A+ TS Sbjct: 182 ETKAKASETNAKVSETNA---AKSESNAKAHMDATATSESNAKTSETNAKASQAASKTSE 238 Query: 202 GAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAA 261 AKTSETNA +++ SA A A + + D+ ++ ++ + A + Sbjct: 239 TNAKTSETNAKQYSINSSNSADLAKAWAESS--DSPDSVNDTDSTTGKTQSSKTWAIYSK 296 Query: 262 SSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASAT 321 A +A S AKTSETNA++SET A SA+ +A S TA+A+SA A+TSA A S T Sbjct: 297 DRAISAFTSETHAKTSETNAKTSETNAANSATNSASSATASANSAEEAATSATNAKTSET 356 Query: 322 AAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAAS 381 A SA +A +S + A T A + AA S K A+ + SA++ ++ Sbjct: 357 NAATSASNAKTSETNAKTSETNAKASETNAATSEGNTKGYMEKAQVAYESAKAIQSVVDV 416 Query: 382 SASSAASSASSASASKDEATR 402 + + A + A +D + Sbjct: 417 AKADAEKCVADVEAVRDSLAK 437 Score = 88.4 bits (217), Expect = 1e-15, Method: Composition-based stats. Identities = 79/311 (25%), Positives = 132/311 (42%), Gaps = 3/311 (0%) Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 T + E A A + A A A A+ A T D ++ S Sbjct: 75 TMKNDSSFGEATAKAQAEEAKKQAQAALESATNAKTYYDDITTKSTEVNNTIAEIKSYIE 134 Query: 259 SAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASA 318 A + + ++ S T A + +A SA+ AA S+T A +S + A S A Sbjct: 135 KAEALNESNKTLEQSISDSATVATNKAKSAASSATNAATSETNAKASETKAKASETNAKV 194 Query: 319 SATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTA 378 S T A KS +A + T A + A S +A+KTSETNAK SET+A+ Sbjct: 195 SETNAAKSESNAKAHMDATATSESNAKTSETNAKASQAASKTSETNAKTSETNAKQYSIN 254 Query: 379 AASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESA 438 +++SA A + A S+ ++ + ++ T A + A +A S++ A+++ Sbjct: 255 SSNSADLAKAWAESS--DSPDSVNDTDSTTGKTQSSKTWAIYSKDRAISAFTSETHAKTS 312 Query: 439 ATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNST-SETLAATPKAVKSAYDNAE 497 T A+T+ A + A+ A ++ +SATN+ SET AAT + + Sbjct: 313 ETNAKTSETNAANSATNSASSATASANSAEEAATSATNAKTSETNAATSASNAKTSETNA 372 Query: 498 KRLQKDQNGAD 508 K + + ++ Sbjct: 373 KTSETNAKASE 383 Score = 68.4 bits (165), Expect = 1e-09, Method: Composition-based stats. Identities = 66/270 (24%), Positives = 115/270 (42%), Gaps = 4/270 (1%) Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 +D + + +S ++ + ++ A A+ AK A S T A Sbjct: 55 DVKDTSDLEAIKQSVTDMYNTMKNDSSFGEATAKAQAEEAKKQAQAALESATNAKTYYDD 114 Query: 295 AAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARS 354 T ++ + + +A A + +S + SA+ AT KA A A+ AA S Sbjct: 115 ITTKSTEVNNTIAEIKSYIEKAEALNESNKTLEQSISDSATVATNKAKSAASSATNAATS 174 Query: 355 ASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTA 414 + AK SET AKASET+A+ S+T AA S S+A + + + S+ A + AK+S + Sbjct: 175 ETNAKASETKAKASETNAKVSETNAAKSESNAKAHMDATATSESNAKTSETNAKASQAAS 234 Query: 415 STKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSA 474 T T A S T A Q + ++A A+ A+ ++ S + + + G Q S Sbjct: 235 KTSETNAKTSETNAKQYSINSSNSADLAKAWAESSDSPDSV----NDTDSTTGKTQSSKT 290 Query: 475 TNSTSETLAATPKAVKSAYDNAEKRLQKDQ 504 S+ A + ++ +E + + Sbjct: 291 WAIYSKDRAISAFTSETHAKTSETNAKTSE 320 >UniRef50_D2TSH8 Phage tail fibre protein n=7 Tax=root RepID=D2TSH8_CITRO Length = 617 Score = 161 bits (407), Expect = 1e-37, Method: Composition-based stats. Identities = 63/137 (45%), Positives = 87/137 (63%) Query: 766 TGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYA 825 T LS L+GNA TATKL+T R+++GV FDGS DI+++A +V AFA R T Sbjct: 404 TNRQTFSGGLSGELSGNAATATKLKTARKIAGVGFDGSSDISISAKNVNAFALRQTGNTV 463 Query: 826 DADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYG 885 + D V WN +SGAYN G S ++++F GSC +Q + +Y+NGG+ YRS+RDGYG Sbjct: 464 NGDTSVGWNWDSGAYNALIGGASALILHFNINAGSCPAVQFRVNYKNGGISYRSARDGYG 523 Query: 886 FEEDWAEVYTSKNLPPE 902 FE W++ YT+ P Sbjct: 524 FELGWSDFYTTTRKPSA 540 Score = 59.2 bits (141), Expect = 9e-07, Method: Composition-based stats. Identities = 40/149 (26%), Positives = 70/149 (46%), Gaps = 2/149 (1%) Query: 355 ASAAKTSETNAKASETSAESSKT--AAASSASSAASSASSASASKDEATRQASAAKSSAT 412 + A++ + A++ +T +S+A+ A + ++ + + + Sbjct: 108 GNTAESYKPTVAEGSGRAQTFRTILTVSSTATVALTVDNTMVMATVDYVDDKLKEHEQSR 167 Query: 413 TASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLS 472 + A G ++ + S +E+ A + + + +DA+T +KGIVQLS Sbjct: 168 RHPDASLTAKGFVQLSSATNSVSETQAATPKAVKAAYDLANAKYTAQDATTAQKGIVQLS 227 Query: 473 SATNSTSETLAATPKAVKSAYDNAEKRLQ 501 SATNSTSETLAAT KAVK+ D K+ Sbjct: 228 SATNSTSETLAATSKAVKAVMDETNKKAP 256 Score = 52.6 bits (124), Expect = 9e-05, Method: Composition-based stats. Identities = 41/136 (30%), Positives = 56/136 (41%) Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 A + A T T S+TA + + ++ + Sbjct: 110 TAESYKPTVAEGSGRAQTFRTILTVSSTATVALTVDNTMVMATVDYVDDKLKEHEQSRRH 169 Query: 459 EDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNN 518 DAS T KG VQLSSATNS SET AATPKAVK+AYD A + + Sbjct: 170 PDASLTAKGFVQLSSATNSVSETQAATPKAVKAAYDLANAKYTAQDATTAQKGIVQLSSA 229 Query: 519 INAVSKTDFADKRGMR 534 N+ S+T A + ++ Sbjct: 230 TNSTSETLAATSKAVK 245 >UniRef50_B7MW07 Putative tail fiber protein from prophage n=4 Tax=Escherichia coli ED1a RepID=B7MW07_ECO81 Length = 520 Score = 160 bits (403), Expect = 4e-37, Method: Composition-based stats. Identities = 102/429 (23%), Positives = 145/429 (33%), Gaps = 4/429 (0%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M V+ISGVLKDGTGKPV CTI+LKA+R + TV+V T+A P E G YS DVE G Y V Sbjct: 1 MTVRISGVLKDGTGKPVPGCTIELKARRTTETVIVTTVAQGQPGETGSYSFDVEPGWYRV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 L EG+ PS+ G I V DS+PGTLN FL E P+AL E + E+ + A A Sbjct: 61 TLNTEGYAPSYVGDILVKADSEPGTLNKFLMEQDEAQYYPKALAELEAVAAEILKRAEAS 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A + AKK A +A A E A A+ + + G + + Sbjct: 121 AASAEEAKKRAENARGPAGEKGDTGPQGATGAQGPAGATGAVGPKGEPGPKGERGETGPQ 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 T + Q+ A + Sbjct: 181 GPKGDKGDPGGPPGPKGDTGPRGEAGPPGPQGPAGQTGPKGDKGEPGATGPAGPAGPRGE 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 + +S ++ S S+ ET A + + A A Sbjct: 241 TGPAGPAGPAGSVASVPDASTSQKGVVQLSSDTNSDDETKAATPKAVKAVMAEVQAAKTK 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 A ++ AA A G + + A G+A A A + +A Sbjct: 301 AEEAATRAAVPGPKGDRGEPGAPGAVGPAGPRGPAGAAGPKGDAG-PAGPAGKDGTAGAE 359 Query: 361 SETNAKASETSAESSKTAAASSA--SSAASSASSASASKDEATRQASAAKSSATTASTKA 418 + + + + + + E RQ T T Sbjct: 360 GKAGPAGPRGERGPAGAQGVPGPVGPAGPAGKTGPRGLQGETGRQGPTGPQG-PTGETGP 418 Query: 419 TEAAGSATA 427 G Sbjct: 419 QGPQGPTGR 427 Score = 56.1 bits (133), Expect = 7e-06, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 59/184 (32%), Gaps = 20/184 (10%) Query: 310 STSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASE 369 A A A G + A+ A G + + ET + + Sbjct: 128 KKRAENARGPAGEKGDTGPQGATGAQGPAGATGAVGPKGEPGPKGER----GETGPQGPK 183 Query: 370 TSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAA 429 + + + A + AT AG A Sbjct: 184 GDKGDPGGPPGPKGDTGPRGEAGPPGPQGPAGQTGP----KGDKGEPGATGPAGPAGPRG 239 Query: 430 QSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAV 489 ++ + S ++ DAST++KG+VQLSS TNS ET AATPKAV Sbjct: 240 ETGPAGPAGPA------------GSVASVPDASTSQKGVVQLSSDTNSDDETKAATPKAV 287 Query: 490 KSAY 493 K+ Sbjct: 288 KAVM 291 >UniRef50_Q0I488 Putative uncharacterized protein n=1 Tax=Haemophilus somnus 129PT RepID=Q0I488_HAES1 Length = 2906 Score = 148 bits (372), Expect = 1e-33, Method: Composition-based stats. Identities = 119/425 (28%), Positives = 169/425 (39%), Gaps = 33/425 (7%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 R A A + A A T A E AT A A A AA A QA + A A + Sbjct: 1890 AKRGAEAAQTAAQGSASQAEAAKTKAEEFATKAEQAKGEAEAAKLGAEQAQTVAVDAKNK 1949 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A A KA +A A A+ AA A AA+ A ++A A A KA +A Sbjct: 1950 ALEAQGKAEQAQNKAEEAQGKAEAAKDEAVAAQQGAVTAKNQAETARDGAVDAKNKAEQA 2009 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 + A A AAK A + A SA + A AA +A AK NA++ A Sbjct: 2010 KSQAETFATQANAAKQDAVTAKNQAESARNEANAAKTAALDAKQGAENAKN-------QA 2062 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 A A A A A QA A A AA A++A + A A KA EA +A AA Sbjct: 2063 ETFATQANTAKQGALEAKDKAEQAKADAVAAKLGADAAQTLAVNAKDKALEAQGKAEAAQ 2122 Query: 353 RSASAAK-------------------------TSETNAKASETSAESSKTAAASSASSAA 387 +A + T++T A+A+ A ++K A + ++A Sbjct: 2123 AAAQNSASQAQTAQNKAEQAQAAAVAAQQGADTAKTQAEAARNEAVTAKNQAEDAKTAAL 2182 Query: 388 SSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAK 447 + + A A+K A + + A + A + EA + TAA S AE+A T+AETA Sbjct: 2183 EAQNKAEAAKLGAEQAKAQADVAKNQAESARDEAVAAQTAAQGLASQAEAAKTQAETARD 2242 Query: 448 RAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGA 507 A D + + Q ++A ++ E A A + A + + + Sbjct: 2243 GAVDAKNKAEQAKSQAETFA-TQANAAKDAALEAQAGANAAQQGAESAKDDAVAAQKASE 2301 Query: 508 DIPDK 512 D DK Sbjct: 2302 DARDK 2306 Score = 123 bits (307), Expect = 5e-26, Method: Composition-based stats. Identities = 118/447 (26%), Positives = 169/447 (37%), Gaps = 17/447 (3%) Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 A AK+ A DA A +A A A D A AA T A A+ A +A A Sbjct: 1605 NQAVLAQAGAEQAKQDALDAQGKAEQAKQGADAAKDEAVAAKTQAETFATQANTAKQGAL 1664 Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQ--------------SAAT 220 A KA +A A AA+ A T A AK A + +A Sbjct: 1665 EAKDKAEQAKGEAEAAKLGAEQAQTVAVDAKNKALEAQGKAEAAQAAAQNSASQAQTAQN 1724 Query: 221 SASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETN 280 A A A A A A EAA++ A + A A ++A A N A+AAK Sbjct: 1725 KAEQAQVAAVAAQQGADTAKTQAEAARNEAVTAKNQAEDAKTAALEAQNKAEAAKLGAEQ 1784 Query: 281 ARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTK 340 A++ A A +A A + A +T A A A+A A + A A + A A ++ Sbjct: 1785 AKAQADVAKNQAESARDEAVVAKTQAEEFATKAQTAQAAAVNAQQGAVEAKNKAEQAKSQ 1844 Query: 341 AGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 A QA+AA +A A+ NAK +A A A +A A +A + + Sbjct: 1845 AETFATQANAAKDAALEAQAGANNAKQEAEAARYEAVMAKDDAVAAKRGAEAAQTAAQGS 1904 Query: 401 TRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 QA AAK+ A +TKA +A G A AA A++ A A+ K E A ++ Sbjct: 1905 ASQAEAAKTKAEEFATKAEQAKGEAEAAKLGAEQAQTVAVDAK--NKALEAQGKAEQAQN 1962 Query: 461 ASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNIN 520 + +G + + ++ A T K + K++ F N Sbjct: 1963 KAEEAQGKAEAAKDEAVAAQQGAVTAKNQAETARDGAVD-AKNKAEQAKSQAETFATQAN 2021 Query: 521 AVSKTDFADKRGMRYVRVNAPAGATSG 547 A + K R A A T+ Sbjct: 2022 AAKQDAVTAKNQAESARNEANAAKTAA 2048 Score = 116 bits (289), Expect = 7e-24, Method: Composition-based stats. Identities = 120/433 (27%), Positives = 160/433 (36%), Gaps = 49/433 (11%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 E A A Q AK A DA T+A +A A +A + A +T A A A Sbjct: 1466 QAESFATQADTAKQGAETAKNQAEDAKTAALDAKQGAENAKNQAETFATQANTAKQGALE 1525 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 A A A A A A AA++ A A A+ A A+ Q++A+ A TA K Sbjct: 1526 AKDKAEQAKADAVAAKLGADAAQTLAVNAKDKALEAQGKAEAAQAAAQNSASQAQTAQNK 1585 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 A A A A A +AAK+ A + A A A A A+ AK A+ AA Sbjct: 1586 AEAAKLGAEQAKAQADAAKNQAVLAQAGAEQAKQDALDAQGKAEQAKQGADAAKDEAVAA 1645 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE----- 343 A A A A A A QA A AA AE A + A A KA E Sbjct: 1646 KTQAETFATQANTAKQGALEAKDKAEQAKGEAEAAKLGAEQAQTVAVDAKNKALEAQGKA 1705 Query: 344 -------------------------------------ATEQASAAARSASAAKTSETNAK 366 A QA AA A AK +AK Sbjct: 1706 EAAQAAAQNSASQAQTAQNKAEQAQVAAVAAQQGADTAKTQAEAARNEAVTAKNQAEDAK 1765 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 ++ A + A +A A A A D A QA +A+ A A T+A E A A Sbjct: 1766 -------TAALEAQNKAEAAKLGAEQAKAQADVAKNQAESARDEAVVAKTQAEEFATKAQ 1818 Query: 427 AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATP 486 A + A+ A A+ A++A+ A A + + + + A N+ E AA Sbjct: 1819 TAQAAAVNAQQGAVEAKNKAEQAKSQAETFATQANAAKDAALEAQAGANNAKQEAEAARY 1878 Query: 487 KAVKSAYDNAEKR 499 +AV + D + Sbjct: 1879 EAVMAKDDAVAAK 1891 >UniRef50_Q5GAE0 Putative uncharacterized protein n=3 Tax=Singapore grouper iridovirus RepID=Q5GAE0_9VIRU Length = 1137 Score = 147 bits (371), Expect = 2e-33, Method: Composition-based stats. Identities = 126/380 (33%), Positives = 173/380 (45%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 EE + A+ + A + A+DAS+ A EA A DA+ A A A +A+S A+ Sbjct: 426 KAEEADQKATDASSKAEEADQKATDASSKAEEADQKATDASSKAEEADQKATEASSKAEE 485 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 ASS A A KATEAS A A S A++ A A T A A++ A A K Sbjct: 486 ASSKAEEADQKATEASSKAEEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQK 545 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 A+EA++ A +A++ E A T A A+ A+S A A A A + A S A Sbjct: 546 ATEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEA 605 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 Q A+ A T A+S A A A +AS+ A A AE A A+ A KA EA ++A Sbjct: 606 DQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEADQKA 665 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + A+ A A T A T A S A A+ A+S A A EA+ +A A Sbjct: 666 TEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEADQKATEASSKAEEAS 725 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 S A AS+KA EA+ A A+ A+ AT A + A+ A A + + K Sbjct: 726 SKAEEASSKAEEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAE 785 Query: 469 VQLSSATNSTSETLAATPKA 488 S A + + A+ KA Sbjct: 786 EASSKAEEADQKATEASSKA 805 Score = 146 bits (368), Expect = 5e-33, Method: Composition-based stats. Identities = 125/387 (32%), Positives = 178/387 (45%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + E + A + A + A++AS+ A EA++ A +A+ A A A +A A Sbjct: 473 DQKATEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEADQKATEADQKA 532 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 ASS A A KATEAS A A S A A A T AS+ + A A+ A+ Sbjct: 533 TEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQKATEAS 592 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 +KA EA++ A +A A T ASS A A AT A + A+ A + A T Sbjct: 593 SKAEEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKAT 652 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A Q A+ A T A+S A A A +A AT A AE A A+ A++KA EA + Sbjct: 653 EADQKATEADQKATEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEADQ 712 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 +A+ A+ A A + A + A S A+S A A A+ AS+ +EA+ +A Sbjct: 713 KATEASSKAEEASSKAEEASSKAEEASSKAEEASSKAEEADQKATEASSKAEEASSKAEE 772 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 A AT AS+KA EA+ A A Q + A S A A+ A A A + + K Sbjct: 773 ADQKATEASSKAEEASSKAEEADQKATEASSKAEEADQKATEASSKAEEADQKATEASSK 832 Query: 467 GIVQLSSATNSTSETLAATPKAVKSAY 493 S A ++S+ A KA +++ Sbjct: 833 AEEASSKAEEASSKAEEADQKATEASS 859 Score = 145 bits (366), Expect = 8e-33, Method: Composition-based stats. Identities = 125/383 (32%), Positives = 175/383 (45%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 EE + A+ Q A A +A A EA++ A +A+ A A A +A A Sbjct: 517 KAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATE 576 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 ASS A A KATEAS A A S A A A T AS+ + A A+ A++K Sbjct: 577 ASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSK 636 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 A EA++ A +A A T A A+ A+S A A A A T A S A Sbjct: 637 AEEASSKAEEADQKATEADQKATEADQKATEASSKAEEADQKATEADQKATEASSKAEEA 696 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 Q A+ A+ A A+ AS+ A +AS+ A A AE A+S A A++KA EA ++A Sbjct: 697 DQKATEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEASSKAEEASSKAEEADQKA 756 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + A+ A A + A T A S A+S A A A+ AS+ +EA ++A+ A Sbjct: 757 TEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEADQKATEAS 816 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 S A A KATEA+ A A+ A S A A+ A A A + + +K Sbjct: 817 SKAEEADQKATEASSKAEEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKAT 876 Query: 469 VQLSSATNSTSETLAATPKAVKS 491 S A ++S+ A KA ++ Sbjct: 877 EASSKAEEASSKAEEADQKATEA 899 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 124/404 (30%), Positives = 178/404 (44%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + E + A+ + A + A++AS+ A EA++ A +A A A A +A+S A Sbjct: 564 DQKATEADQKATEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKA 623 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 + A A AS+KA EAS A A+ + A A A T AS+ + A A+ A Sbjct: 624 EEADQKATEASSKAEEASSKAEEADQKATEADQKATEADQKATEASSKAEEADQKATEAD 683 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 KA+EA++ A +A A S A A+ A+S A A + A+ A + A S Sbjct: 684 QKATEASSKAEEADQKATEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEASSKAE 743 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A A A T A+S A AS+ A +A AT A AE A+S A A KA EA+ Sbjct: 744 EASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEASS 803 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 +A A + A+ A + A T A S A+S A A+S A A EA+ +A Sbjct: 804 KAEEADQKATEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEADQKATEASSKAEE 863 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 A S A A KATEA+ A A+ A+ AT A+ A A A V T Sbjct: 864 ASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEEVDKRLTKTEND 923 Query: 467 GIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIP 510 + A+++ A + K+ + D+N A I Sbjct: 924 AAWAYTEASSAAVAAKTADDASKKAIAQTETNKNYIDENSAKIT 967 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 106/332 (31%), Positives = 157/332 (47%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + E + A+ + A + A++A A EA++ A +A A AS+ A +A A Sbjct: 655 DQKATEADQKATEASSKAEEADQKATEADQKATEASSKAEEADQKATEASSKAEEADQKA 714 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 ASS A AS+KA EAS A A S A++ A A T AS+ + A++ A A Sbjct: 715 TEASSKAEEASSKAEEASSKAEEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEAD 774 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 KA+EA++ A +A++ E A T ASS A A AT A + A+ A T A S Sbjct: 775 QKATEASSKAEEASSKAEEADQKATEASSKAEEADQKATEASSKAEEADQKATEASSKAE 834 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A A A+ A A+ AS+ A +AS+ A A + A A+S A A++KA EA + Sbjct: 835 EASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQ 894 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 +A+ A + A+ A + T E+ A + ASSAA +A +A + +A Q Sbjct: 895 KATEADQKATEASSKAEEVDKRLTKTENDAAWAYTEASSAAVAAKTADDASKKAIAQTET 954 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESA 438 K+ S K T A K+ E+A Sbjct: 955 NKNYIDENSAKITRLDPKIDALENFKTETETA 986 Score = 92.3 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 76/284 (26%), Positives = 123/284 (43%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 EE + A + A A +A A EA++ A +A+ A A A +A+S A+ Sbjct: 727 KAEEASSKAEEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAEE 786 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 ASS A A KATEAS A A+ + A++ A A T AS+ + A++ A A++K Sbjct: 787 ASSKAEEADQKATEASSKAEEADQKATEASSKAEEADQKATEASSKAEEASSKAEEASSK 846 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 A EA A +A++ E A S A A+ A+S A A + A+ A T A T A Sbjct: 847 AEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEADQKATEA 906 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 A T + A+ A T A A+ +A A +++ A + T E + + Sbjct: 907 SSKAEEVDKRLTKTENDAAWAYTEASSAAVAAKTADDASKKAIAQTETNKNYIDENSAKI 966 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 + A + +T + + + + + S +S + Sbjct: 967 TRLDPKIDALENFKTETETALAAIVADGDSITGGFRSEVTSFTK 1010 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 66/257 (25%), Positives = 106/257 (41%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + E + A + A + A++AS+ A EA++ A +A A AS+ A +A A Sbjct: 753 DQKATEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEADQKATEASSKAEEADQKA 812 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 ASS A A KATEAS A A S A++ A A T AS+ + A++ A A Sbjct: 813 TEASSKAEEADQKATEASSKAEEASSKAEEASSKAEEADQKATEASSKAEEASSKAEEAD 872 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 KA+EA++ A +A++ E A T A A+ A+S A + A + + Sbjct: 873 QKATEASSKAEEASSKAEEADQKATEADQKATEASSKAEEVDKRLTKTENDAAWAYTEAS 932 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 +A +A A + A + + SA T ++ + + T Sbjct: 933 SAAVAAKTADDASKKAIAQTETNKNYIDENSAKITRLDPKIDALENFKTETETALAAIVA 992 Query: 347 QASAAARSASAAKTSET 363 + + TS T Sbjct: 993 DGDSITGGFRSEVTSFT 1009 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 84/393 (21%), Positives = 155/393 (39%), Gaps = 12/393 (3%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + + + + ++ + ++S + + A + A+ A + A Sbjct: 78 RAVSDAAYKALVGLEKSVSETRRSMVNIEEKTSDLTDVAKETLGKAQEAVAESQDAVVKM 137 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 + T + + A + AA+ S A+ + A + SE + + S + + Sbjct: 138 GDLTEKVKTLAVQPNVAEVANEAAKKSVEASQLANEALENSEEAVKKADDALCASENASR 197 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 + A + + + + EAAKS+E A+ A A SSA A ++A A+ A + Sbjct: 198 SSAIASQKTEKALETAAEAAKSAEV-AALMAKIATSSANAVKDTADEAREKAEAANLAAD 256 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 +A + A + AG A A A A AG+ A A AS A A + ++ Sbjct: 257 SAFKKADSVAGKAEEAEKKAVEAVAKADYVVGKIEEAGQRAYEADKKASDAIILASDVSK 316 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 + + A + A + +A A +A A + A S +A+ ++A+ +A A Sbjct: 317 KVESVADGVNNALDASNDASAKADAANRKAEEAFAKADSVTEKIDAAAKKAEDASEKAVA 376 Query: 407 AKSSA-----------TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASA 455 A ++A T T ATEA AT A+ A+ AT A + A+ A+ A+ Sbjct: 377 AAAAANDKAQTVLDMIQTVGTGATEADQKATEASSKAEEADQKATEASSKAEEADQKATD 436 Query: 456 VALEDASTTKKGIVQLSSATNSTSETLAATPKA 488 + + +K S A + + A+ KA Sbjct: 437 ASSKAEEADQKATDASSKAEEADQKATDASSKA 469 >UniRef50_Q9LA62 ORF-401-like protein n=1 Tax=Enterobacterial phage P-EibA RepID=Q9LA62_9CAUD Length = 479 Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats. Identities = 102/395 (25%), Positives = 149/395 (37%), Gaps = 3/395 (0%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M V+ISGVLKDGTGKPV CTI+LKA+R + TV+V T+A P+E G YS DVE G Y V Sbjct: 1 MTVRISGVLKDGTGKPVPGCTIELKARRTTETVIVTTVAQGQPEETGSYSFDVEPGWYRV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 L EG+ PS+ G I V DS+PGTLN FL E P+AL E + E+ + A A Sbjct: 61 TLNTEGYAPSYVGDILVKADSEPGTLNKFLMEQDEAQYYPKALAELEAVAAEILKRAEAS 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSA--GTAST 178 A + AKK A +A A E A A+ + + G + T Sbjct: 121 AASAEEAKKRAENARGPAGEKGDTGPQGATGAKGPAGATGAVGPKGEPGPKGERGETGPQ 180 Query: 179 KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARD 238 K A + A + AT A A Sbjct: 181 GPKGDKGDPGGPPGPKGDTGPRGEAGPRPQGPAGQTGPKGDKGEPGATGPAGPAGPRGET 240 Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGS 298 A S + +++S + ++ +T ++ + + +A + Sbjct: 241 GPAGPAGPAGSVASVPDASTSQKGVVQLSSDTNSDDETKAATPKAVKAVMAEVQAAKTKA 300 Query: 299 KTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAA 358 + AA +A + A G + + A+ A G ++ A A Sbjct: 301 EEAATRAAVPGPKGDRGEPGAPGAVGPAGPQGPAGAAGAKGAPGPKGDKGEPGATG-PAG 359 Query: 359 KTSETNAKASETSAESSKTAAASSASSAASSASSA 393 +T A A + S A + Sbjct: 360 PQGKTGPAGPRGPAGPQGAAGRNGNVSTEKYAVGS 394 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 55/184 (29%), Gaps = 21/184 (11%) Query: 310 STSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASE 369 A A A G + A+ A G + + + Sbjct: 128 KKRAENARGPAGEKGDTGPQGATGAKGPAGATGAVGPKGEPGPKGERGETGPQGPKGDKG 187 Query: 370 TSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAA 429 + + + + AT AG A Sbjct: 188 DPGGPPGPKGDTGPRG--EAGPRPQGPAGQTGPKGD-------KGEPGATGPAGPAGPRG 238 Query: 430 QSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAV 489 ++ + S ++ DAST++KG+VQLSS TNS ET AATPKAV Sbjct: 239 ETGPAGPAGPA------------GSVASVPDASTSQKGVVQLSSDTNSDDETKAATPKAV 286 Query: 490 KSAY 493 K+ Sbjct: 287 KAVM 290 >UniRef50_B7MWN9 Putative tail fiber protein (GpH) n=2 Tax=Escherichia coli RepID=B7MWN9_ECO81 Length = 701 Score = 144 bits (363), Expect = 2e-32, Method: Composition-based stats. Identities = 94/319 (29%), Positives = 138/319 (43%), Gaps = 33/319 (10%) Query: 253 ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTS 312 ++ A S + ++ + + + + ++ + S Sbjct: 104 VANMAESYKPALAEGSGRSQTCRMVIIVSSVASVELTIDTTTVMATQDYVDDKIAEHEQS 163 Query: 313 AGQASASATAAG----KSAESAASSASTATTKA---------GEATEQASAAARSASAAK 359 AS TA G SA ++AS AT KA G+ T Q + AR Sbjct: 164 RRHPDASLTAKGFTQLSSATNSASETLAATPKAVKAAYDLANGKYTAQDATTARKGLVQL 223 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASK--------------------DE 399 +S TN+ + +A AA ++ +A A+ ++ + Sbjct: 224 SSATNSTSETLAATPKAVKAAYDLANGKYTAQDATTARKGLVQLSSATNSDSETLAATPK 283 Query: 400 ATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALE 459 A + A + TA T G ++ + S +E+ A + + + Sbjct: 284 AVKVAYDLANGKYTAQDATTARKGLVQLSSATNSDSETLAATPKAVKVAYDLANGKYTAQ 343 Query: 460 DASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNI 519 DA+T +KG+VQLSSATNS SETLAATPKAVKSAYDNAEKRLQKDQNGADIPDK FL NI Sbjct: 344 DATTARKGLVQLSSATNSDSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKRLFLRNI 403 Query: 520 NAVSKTDFADKRGMRYVRV 538 A + T + G + R+ Sbjct: 404 GATNSTTMSFSGGTGWFRL 422 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 45/155 (29%), Positives = 63/155 (40%), Gaps = 5/155 (3%) Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS 454 A + A A + + T S+ A+ + + + + + Sbjct: 103 AVANMAESYKPALAEGSGRSQTCRMVIIVSSVASVELTIDTTTVMATQDYVDDKIAEHEQ 162 Query: 455 AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 + DAS T KG QLSSATNS SETLAATPKAVK+AYD A + Sbjct: 163 SRRHPDASLTAKGFTQLSSATNSASETLAATPKAVKAAYDLANGKYTAQDATTARKGLVQ 222 Query: 515 FLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKY 549 + N+ S+T A + V A +GKY Sbjct: 223 LSSATNSTSETLAATPK-----AVKAAYDLANGKY 252 >UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enterobacteria phage T4 RepID=Q38190_BPT4 Length = 226 Score = 141 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 111/226 (49%), Positives = 132/226 (58%), Gaps = 59/226 (26%) Query: 953 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVS 1012 TIKGKP SGRAVLS E DG+K+H+HSASASSTDLGTKTTSSFDYGTK TN+TG HTHS S Sbjct: 1 TIKGKP-SGRAVLSAEADGVKAHSHSASASSTDLGTKTTSSFDYGTKGTNSTGGHTHSGS 59 Query: 1013 GSTNSAGAHTHSLANVNTASA------------NSGAGSASTRLSVVHNQNYATSSAGAH 1060 GST++ G H+H + N +G + + + H ++ TSSAG H Sbjct: 60 GSTSTNGEHSHYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDH 119 Query: 1061 THSL----------------------------------------------SGTAASAGAH 1074 +HS+ S +SAG H Sbjct: 120 SHSVGIGEHSHYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDH 179 Query: 1075 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 +H+VGIGAHTH+VAIGSHGHTITVN+ GN ENTVKNIAFNYIV LA Sbjct: 180 SHSVGIGAHTHTVAIGSHGHTITVNSTGNTENTVKNIAFNYIVALA 225 >UniRef50_C4Y8G3 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y8G3_CLAL4 Length = 653 Score = 139 bits (350), Expect = 5e-31, Method: Composition-based stats. Identities = 69/310 (22%), Positives = 153/310 (49%), Gaps = 2/310 (0%) Query: 128 KKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSA 187 + ++D + + +++ S + A S + S+ +++T+ +E S Sbjct: 336 ETKSTDIPIKPSDVPSIPSNSVTSTTEVPGFSSSAVPSKSTDSTDVSSSATQPSETSSKP 395 Query: 188 AAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 + S + ++ + + T T++ S S+ S ++ +T A+ + S S+ ++K Sbjct: 396 SQTSSKPNTSSKPSTSESTDATSSKPSETSSKPSTTSESTDATSSKPSETTTQPSQTSSK 455 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 SET + S +S+ S T + S ++K S+T+++ SET++ S +++ S+T++ S + Sbjct: 456 PSETTSQPSQTSSKPSETTSQPSQTSSKPSQTSSKPSETSSKPSETSSKPSETSSKPSET 515 Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA 367 ++ S + S T++ S S+ S +++ + +++ S +++K SET++K Sbjct: 516 SSKPSETSSKPSQTSSKPSETSSKPSETSSKPSETSSKPSETSSKPSQTSSKPSETSSKP 575 Query: 368 SETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATA 427 SETS++ S+T++ S +S+ S +S+ S E T Q + T S +T + T Sbjct: 576 SETSSKPSQTSSKPSETSSKPSETSSKPS--ETTSQPYTTSQPSETTSQPSTTSQPDTTQ 633 Query: 428 AAQSKSTAES 437 + +KS Sbjct: 634 PSTTKSGFPQ 643 Score = 131 bits (328), Expect = 2e-28, Method: Composition-based stats. Identities = 70/317 (22%), Positives = 156/317 (49%), Gaps = 7/317 (2%) Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 T + S + T TE +++A SKS +T ++ T + S+ Sbjct: 336 ETKSTDIPIKPSDVPSIPSNSVTSTTEVPGFSSSAVPSKSTDSTDVSSSATQPSETSSKP 395 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 ++ +T++ ++ +T A + S+ ++K S T+ S+ A+S+ S T S ++K Sbjct: 396 SQTSSKPNTSSKPSTSESTDATSSKPSETSSKPSTTSESTDATSSKPSETTTQPSQTSSK 455 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 SET ++ S+T++ S + + S+T++ S +++ S + S T++ S S+ S + Sbjct: 456 PSETTSQPSQTSSKPSETTSQPSQTSSKPSQTSSKPSETSSKPSETSSKPSETSSKPSET 515 Query: 336 TATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASA 395 ++ + +++ S +++K SET++K SETS++ S+T++ S +S+ S +S+ Sbjct: 516 SSKPSETSSKPSQTSSKPSETSSKPSETSSKPSETSSKPSETSSKPSQTSSKPSETSSK- 574 Query: 396 SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASA 455 + +++ + + + T+S + ++ + +Q +T++ + T ++ + D Sbjct: 575 PSETSSKPSQTSSKPSETSSKPSETSSKPSETTSQPYTTSQPSETTSQPSTTSQPDTT-- 632 Query: 456 VALEDASTTKKGIVQLS 472 STTK G Q S Sbjct: 633 ----QPSTTKSGFPQPS 645 Score = 119 bits (297), Expect = 7e-25, Method: Composition-based stats. Identities = 58/292 (19%), Positives = 131/292 (44%), Gaps = 36/292 (12%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 + + ++ + + K + + S E+ T A + S ++ S ++ A S+ S Sbjct: 385 ATQPSETSSKPSQTSSKPNTSSKPSTSES-TDATSSKPSETSSKPSTTSESTDATSSKPS 443 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 T T + S ++ S++++ S ++ S+T++ S S+ S +++ + + Sbjct: 444 ETTTQPSQTSSKPSETTSQPSQTSSKPSETTSQPSQTSSKPSQTSSKPSETSSKPSETSS 503 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 S + S+ ++K SET++ S +S+ S T++ S ++K SET+++ SET++ S Sbjct: 504 KPSETSSKPSETSSKPSETSSKPSQTSSKPSETSSKPSETSSKPSETSSKPSETSSKPSQ 563 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 +++ S+T++ S +++ S + S T++ S Sbjct: 564 TSSKPSETSSKPSETSSKPSQTSSKPSETSSKPS-------------------------- 597 Query: 353 RSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQA 404 ET++K SET+++ T+ S +S S+ S ++ T+ Sbjct: 598 ---------ETSSKPSETTSQPYTTSQPSETTSQPSTTSQPDTTQPSTTKSG 640 >UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclassified Myoviridae RepID=C5H7L2_9CAUD Length = 391 Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats. Identities = 79/217 (36%), Positives = 107/217 (49%), Gaps = 24/217 (11%) Query: 895 TSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTI 954 SYPVG + PS Y L G ++ + + Y + P Sbjct: 148 NLYKAIQASYPVGTIHLSVNSANPSTYLLCGG-TWELVSKGRALVGYDTDSRPVG----- 201 Query: 955 KGKPASGRAVLSQEQDGIKSHTHSA-------------SASSTDLGTKTTSSFDYGTKST 1001 G ++ + + +HTHS S SS D G+K+TS+FDYGTK+T Sbjct: 202 ---STFGSQTVALTNNNLPAHTHSIYLTGGGHTHSASVSISSFDYGSKSTSTFDYGTKTT 258 Query: 1002 NNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHT 1061 N+ GAHTH+ SG+T++AG H H + + G + + T AGAHT Sbjct: 259 NSAGAHTHTFSGTTSNAGNHNHRV--PMRGNDRGGTNAITASADAGVGNAMYTDLAGAHT 316 Query: 1062 HSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITV 1098 HS SGT AS+GAH+HTV IGAH+H+V IGSH HT TV Sbjct: 317 HSFSGTTASSGAHSHTVAIGAHSHTVNIGSHSHTGTV 353 >UniRef50_C5DLU8 KLTH0G03696p n=1 Tax=Lachancea thermotolerans CBS 6340 RepID=C5DLU8_LACTC Length = 2085 Score = 138 bits (347), Expect = 1e-30, Method: Composition-based stats. Identities = 95/391 (24%), Positives = 187/391 (47%), Gaps = 17/391 (4%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 +E + A A AQ T ++ S+ +TS +E++T A +A S + TS+ +S Q +S Sbjct: 248 QESSTEAGASAQLTEESQTSSPTRTTSGQESSTEAGASAQSTEESQTSSPTRTTSGQESS 307 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 TEA SA + E S++++ + + S T A AS QS S +++ T+ + Sbjct: 308 ----------TEAGASAQSTEESQTSSPIGITSGQESSTEAGASAQSTEESQTSSPTRTT 357 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 S+ +A AS ++ + S+T++ +S S+T AG SA++ + S+T++ T+ + Sbjct: 358 SGQESSTEAGASAQSTEESQTSSPIGITSGQESSTEAGASAQSTEESQTSSPIGITSGQE 417 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 S++ A S + S +++ T S+T AG SA+S S +++ T+ E ++ Sbjct: 418 SSTEAGASAQSTEESQTSSPTRTTSGQESSTEAGASAQSTEESQTSSPTRTTSGQESSTE 477 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSA---SASKDEATRQASAA 407 A SA + + S+T++ TS + S T A + A S S +S+ + S E++ +A + Sbjct: 478 AGASAQSTEESQTSSPIGITSGQESSTEAGARAQSTEESQTSSPTRTTSGQESSTKAGTS 537 Query: 408 KSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKG 467 SA S +T + A + S+S+ + E+ + +S T++ Sbjct: 538 SQSALGTSVHSTTGSPLGGANSYSQSSYSVPRSSDESYPLSQYNPSSTSIF----TSEPY 593 Query: 468 IVQLSSATNSTSETLAATPKAVKSAYDNAEK 498 ++ ++ ET+ ++ + + Sbjct: 594 GTAATTLSDGEPETITSSFISARPTSTVPSP 624 Score = 108 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 82/399 (20%), Positives = 174/399 (43%), Gaps = 21/399 (5%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 +E + A A AQ+T ++ S+ TS +E++T A +A S + TS+ +S Q +S Sbjct: 304 QESSTEAGASAQSTEESQTSSPIGITSGQESSTEAGASAQSTEESQTSSPTRTTSGQESS 363 Query: 171 SSAGTASTK------------------ATEASKSAAAAESSKSAAATSAGAAKTSETNAS 212 + AG ++ +TEA SA + E S++++ + + S T A Sbjct: 364 TEAGASAQSTEESQTSSPIGITSGQESSTEAGASAQSTEESQTSSPIGITSGQESSTEAG 423 Query: 213 ASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAK 272 AS QS S +++ T+ + S+ +A AS ++ + S+T++ + +S S+T AG SA+ Sbjct: 424 ASAQSTEESQTSSPTRTTSGQESSTEAGASAQSTEESQTSSPTRTTSGQESSTEAGASAQ 483 Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAAS 332 + + S+T++ T+ +S++ A + S +++ T S+T AG S++SA Sbjct: 484 STEESQTSSPIGITSGQESSTEAGARAQSTEESQTSSPTRTTSGQESSTKAGTSSQSALG 543 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 ++ +TT + + + +S + ++ + S T+ +S ++ + Sbjct: 544 TSVHSTTGSPLGGANSYS---QSSYSVPRSSDESYPLSQYNPSSTSIFTSEPYGTAATTL 600 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 + + T +A+ ++T S T S T + S ++A+ ++ + E Sbjct: 601 SDGEPETITSSFISARPTSTVPSPSITTFISSTTTGSPLPSGTFNSASTSQVVTRSPEGS 660 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 + ++ + S T+ Sbjct: 661 SIPETSSSSTMSSTSQPSRLSFRTEDMTTMLGFSSGTSE 699 Score = 77.3 bits (188), Expect = 3e-12, Method: Composition-based stats. Identities = 67/361 (18%), Positives = 143/361 (39%), Gaps = 16/361 (4%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 +E + A A AQ+T ++ S+ +TS +E++T A +A S + TS+ +S Q +S Sbjct: 416 QESSTEAGASAQSTEESQTSSPTRTTSGQESSTEAGASAQSTEESQTSSPTRTTSGQESS 475 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 TEA SA + E S++++ + + S T A A QS S +++ T+ + Sbjct: 476 ----------TEAGASAQSTEESQTSSPIGITSGQESSTEAGARAQSTEESQTSSPTRTT 525 Query: 231 EAATSARDAAASKEAAKS----SETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 S+ A S ++A S T + +++ S ++ + + + S T Sbjct: 526 SGQESSTKAGTSSQSALGTSVHSTTGSPLGGANSYSQSSYSVPRSSDESYPLSQYNPSST 585 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 + S + T + ++S A ++T S + SS +T + Sbjct: 586 SIFTSEPYGTAATTLSDGEPETITSSFISARPTSTVPSPSITTFISSTTTGSPLPSGTFN 645 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 AS ++ + +S+ S T+ S S ++ + S+ Sbjct: 646 SAS--TSQVVTRSPEGSSIPETSSSSTMSSTSQPSRLSFRTEDMTTMLGFSSGTSEHLSS 703 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 ++++ + ++ G++T++ +++ S A + ++ A T Sbjct: 704 SRTAGVPTDSGSSSNPGASTSSPNDQTSINSQPVTATSPQPSYTSLSPTYESRSAQTMTS 763 Query: 467 G 467 G Sbjct: 764 G 764 Score = 52.6 bits (124), Expect = 8e-05, Method: Composition-based stats. Identities = 84/449 (18%), Positives = 159/449 (35%), Gaps = 52/449 (11%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 +E + A A AQ+T ++ S+ +TS +E++T A +A S + TS+ +S Q +S Sbjct: 444 QESSTEAGASAQSTEESQTSSPTRTTSGQESSTEAGASAQSTEESQTSSPIGITSGQESS 503 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT---------- 220 + AG + E+ S+ +S ++T AG + S S + + Sbjct: 504 TEAGARAQSTEESQTSSPTRTTSGQESSTKAGTSSQSALGTSVHSTTGSPLGGANSYSQS 563 Query: 221 ------------SASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAG 268 S ++ TS A+ + +SS SA ++T Sbjct: 564 SYSVPRSSDESYPLSQYNPSSTSIFTSEPYGTAATTLSDGEPETITSSFISARPTSTVPS 623 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 S +S T + SAS + + S+ ++S+ S+++ + S Sbjct: 624 PSITTFISSTTTGSPLPSGTFNSASTSQVVTRSPEGSSIPETSSSSTMSSTSQPSRLSFR 683 Query: 329 S--------AASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 + +S S + + A + + S A TS N + S S + T+ Sbjct: 684 TEDMTTMLGFSSGTSEHLSSSRTAGVPTDSGSSSNPGASTSSPNDQTSINSQPVTATSPQ 743 Query: 381 SSASS-----AASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTA 435 S +S + SA + ++ + S + +++ T +Q S Sbjct: 744 PSYTSLSPTYESRSAQTMTSGALTSVPGEVLPSSLSASSAPLTETPVPGRTDFSQVTSLY 803 Query: 436 ESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLS--------------SATNST--- 478 S E + + S+ A E +T+ + + SAT S Sbjct: 804 SSVMHSNENPGRSGNLVTSSAAPEVGTTSSLSFITSAYSEISVSSSGPAAISATPSRVPG 863 Query: 479 SETLAATPKAVKSAYDNAEKRLQKDQNGA 507 S + + TP + + QK Q A Sbjct: 864 SISSSQTPSPSEPVVLMTSQSSQKSQTTA 892 >UniRef50_Q0I4L1 Putative uncharacterized protein n=1 Tax=Haemophilus somnus 129PT RepID=Q0I4L1_HAES1 Length = 762 Score = 130 bits (327), Expect = 3e-28, Method: Composition-based stats. Identities = 88/295 (29%), Positives = 138/295 (46%), Gaps = 20/295 (6%) Query: 205 KTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSA 264 KT+E + A+ A T + + R A S E A+ ++T AS SA+ A + Sbjct: 72 KTAEQSVYAARDKVTELAQQVTDNNATTEQNTRLAIQSNEQAQQAKTEASQSATQATKAN 131 Query: 265 TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAG 324 A + A + A+ ++T A QSA+ A +KT A+ SA+ A+ + QA + T A Sbjct: 132 RQAQQAKTEATEANRQAQQAKTEASQSATQAQQAKTEASQSATQATKANRQAQQAKTEAS 191 Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSAS 384 +SA A A T+A ++ QA+ R A AKT T A A+ +KT A ++ Sbjct: 192 QSAT----QAQQAKTEASQSATQATEVNRQAQQAKTEATEAN---RQAQQAKTKAENAER 244 Query: 385 SAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAET 444 A SS + S +AT+ + AK A+ + + + K +A A AE Sbjct: 245 IATSSIKTVQQSAIQATQAENLAKKWASNPQNQIVQ---------EDKYSAYHYALEAEK 295 Query: 445 AAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 A++ + A+A +D V +S N +SET A+ +VK+AYD AEK Sbjct: 296 YAEKVK--ATAEGRKDWQFIDN--VPISKDVNDSSETNLASAFSVKTAYDRAEKA 346 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 82/337 (24%), Positives = 140/337 (41%), Gaps = 6/337 (1%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 +++ ++ A + +D + + + A + + A+ A T A Q+A+ A Sbjct: 70 LLKTAEQSVYAARDKVTELAQQVTDNNATTEQNTRLAIQSNEQAQQAKTEASQSATQATK 129 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 A+ A A T+ATEA++ A A++ S +AT A AKT + ++ A A A T+ Sbjct: 130 ANRQAQQAKTEATEANRQAQQAKTEASQSATQAQQAKTEASQSATQATKANRQAQQAKTE 189 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 AS++AT A+ A + + T + A A + AT A A+ AKT NA T++ Sbjct: 190 ASQSATQAQQAKTEASQSATQATEVNRQAQQAKTEATEANRQAQQAKTKAENAERIATSS 249 Query: 289 GQSASAAAGSKTAAASSAS--AASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 ++ +A T A + A A++ +A A A A A + Sbjct: 250 IKTVQQSAIQATQAENLAKKWASNPQNQIVQEDKYSAYHYALEAEKYAEKVKATAEGRKD 309 Query: 347 ----QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATR 402 ++ + + + + S +A A + A+ A A A EA R Sbjct: 310 WQFIDNVPISKDVNDSSETNLASAFSVKTAYDRAEKAKTEATKANRQAQQAKTEATEANR 369 Query: 403 QASAAKSSATTASTKATEAAGSATAAAQSKSTAESAA 439 QA AK+ AT A +A +A A + K+T Sbjct: 370 QAQQAKTEATEAKRQAQQAKRLAESKQSPKTTLAGYG 406 Score = 106 bits (265), Expect = 4e-21, Method: Composition-based stats. Identities = 77/320 (24%), Positives = 136/320 (42%), Gaps = 19/320 (5%) Query: 86 LNDFLGAMTEDDARPEALRRFELMVEEVARNASAVA-----------QNTAAAKKSASDA 134 + + +T+++A E R + E A+ A A + AK A++A Sbjct: 85 VTELAQQVTDNNATTEQNTRLAIQSNEQAQQAKTEASQSATQATKANRQAQQAKTEATEA 144 Query: 135 STSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSK 194 + A++A T A+ +A A+ A T A Q+A+ A A+ A A T+A++++ A A++ Sbjct: 145 NRQAQQAKTEASQSATQAQQAKTEASQSATQATKANRQAQQAKTEASQSATQAQQAKTEA 204 Query: 195 SAAATSAGA-------AKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 S +AT A AKT T A+ Q A T A A A+ + + + +A A+ Sbjct: 205 SQSATQATEVNRQAQQAKTEATEANRQAQQAKTKAENAERIATSSIKTVQQSAIQATQAE 264 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 + +S+ + A + + + TA G+ + + Sbjct: 265 NLAKKWASNPQNQIVQEDKYSAYHYALEAEKYAEKVKATAEGRKDWQFIDNVPISKDVND 324 Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA 367 ++ T+ A + T A AE A + A+ A +A +A +A+ A R A AKT T AK Sbjct: 325 SSETNLASAFSVKT-AYDRAEKAKTEATKANRQAQQAKTEATEANRQAQQAKTEATEAKR 383 Query: 368 SETSAESSKTAAASSASSAA 387 A+ + S ++ A Sbjct: 384 QAQQAKRLAESKQSPKTTLA 403 Score = 96.5 bits (238), Expect = 5e-18, Method: Composition-based stats. Identities = 84/347 (24%), Positives = 141/347 (40%), Gaps = 23/347 (6%) Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 T+ ++ + A + ++ A S A + A S T A+ + Sbjct: 72 KTAEQSVYAARDKVTELAQQVTDNNATTEQNTRLAIQSNEQAQQAKTEASQSATQATKAN 131 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 + A + + AT +A + +A+ S A+ ++T AS SA+ A + A + A Sbjct: 132 RQAQQAKTEATEANRQAQQAKTEASQSATQAQQAKTEASQSATQATKANRQAQQAKTEAS 191 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 S T A+ ++T A QSA+ A A + + A+ + QA + T A + A SS Sbjct: 192 QSATQAQQAKTEASQSATQATEVNRQAQQAKTEATEANRQAQQAKTKAENAERIATSSIK 251 Query: 336 TATTKAGEATEQASAAARSASAAKTSETN---------AKASETSAESSKTAAA------ 380 T A +AT+ + A + AS + A +E AE K A Sbjct: 252 TVQQSAIQATQAENLAKKWASNPQNQIVQEDKYSAYHYALEAEKYAEKVKATAEGRKDWQ 311 Query: 381 -------SSASSAASSASSASA-SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 432 S + +S + ASA S A +A AK+ AT A+ +A +A AT A + Sbjct: 312 FIDNVPISKDVNDSSETNLASAFSVKTAYDRAEKAKTEATKANRQAQQAKTEATEANRQA 371 Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 A++ AT A+ A++A+ +A + + GI T S Sbjct: 372 QQAKTEATEAKRQAQQAKRLAESKQSPKTTLAGYGITDFQVKTGSGD 418 >UniRef50_A9ITY7 Putative uncharacterized protein n=2 Tax=Bartonella RepID=A9ITY7_BART1 Length = 1077 Score = 130 bits (325), Expect = 4e-28, Method: Composition-based stats. Identities = 100/421 (23%), Positives = 180/421 (42%), Gaps = 34/421 (8%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + + V Q + A +++A A E+ T A A +AR AS +A + A A Sbjct: 150 LATALQRFEEVKQVSENAVNISTEAKRLADESKTIATRAEQTAREASQTATETTQVAAKA 209 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 ++ T AT+AS A A+ + A A AK ++S+ + S T Sbjct: 210 VATCHEVKTVATQASLKADGAKQTADDAKDIAEKAKELSEGTTSSITELTKTTSQVQTAV 269 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 +A T RDA K+ A++S+T + + +AA + TA+ ++ ++T+ + + ++ + Sbjct: 270 EKALTDLRDA---KQIAEASKTLSEEAKQTAADALTASKDAKSQSETAVSRSEEAKALSE 326 Query: 290 QSASAAAGSKTAAASS---ASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT- 345 QS A K + AS A AA T QAS +A+ A AE+A S+A +AT KA +A Sbjct: 327 QSKGACDEFKASVASVEKVAEAAKTGVEQASQTASEAKGIAETAKSTADSATAKAEQAQQ 386 Query: 346 ----------------EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASS 389 EQA A R A + T + A A Sbjct: 387 EASEASRLASEAKVVAEQALQADRQAVREGSESTKSLVE---------AVQKKTEEAERV 437 Query: 390 ASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRA 449 A + +E + A+ AK+++T A T+AT A A+ A + + ++ + + A++A Sbjct: 438 AQDSKRVCEETKQLATEAKNASTNALTEATAAKEKASRALTTVNDVKNISEEVKGLAEKA 497 Query: 450 EDIASAVALEDASTTKKGIVQLSSA--TNSTSETLAATPKAVKSAYDNAEKRLQKDQNGA 507 ++ ++ ++A +ST+ + + ++ + A+ Q +N Sbjct: 498 SRASTEAQKTSDQALREATSAKTTADMASSTATDAKGSAEQAQTVSEEAKTLAQTSKNAC 557 Query: 508 D 508 D Sbjct: 558 D 558 Score = 129 bits (324), Expect = 6e-28, Method: Composition-based stats. Identities = 99/402 (24%), Positives = 156/402 (38%), Gaps = 18/402 (4%) Query: 105 RFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAAS 164 F+ V V + A A A ++AS+A A A + A A A A A +A+ Sbjct: 334 EFKASVASVEKVAEAAKTGVEQASQTASEAKGIAETAKSTADSATAKAEQAQQEASEASR 393 Query: 165 SAQSASSSAGTASTK---------------ATEASKSAAAAESSKSAAATSAGAAKTSET 209 A A A A K AE + K T Sbjct: 394 LASEAKVVAEQALQADRQAVREGSESTKSLVEAVQKKTEEAERVAQDSKRVCEETKQLAT 453 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGN 269 A + +A T A+ A KAS A T+ D E K AS +++ A ++ A Sbjct: 454 EAKNASTNALTEATAAKEKASRALTTVNDVKNISEEVKGLAEKASRASTEAQKTSDQALR 513 Query: 270 SAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAES 329 A +AKT+ A S+ T A SA A A + A + + + + AE+ Sbjct: 514 EATSAKTTADMASSTATDAKGSAEQAQTVSEEAKTLAQTSKNACDEIKQTIGDVKSVAEN 573 Query: 330 AASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASS 389 A S+A+TA K E ++Q S + + AKT AK ++++ + A A+S Sbjct: 574 ALSTATTAKQKGDEISQQISESFTKSGEAKTLAEEAKRLASTSQETAEEAKVKAASVERI 633 Query: 390 ASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRA 449 A+ A+ + + + AK A+ A + A EA +A +A + AE A ETA + Sbjct: 634 ATEANQTASSSKSVSEEAKEEASKAKSIALEAKNTADSAT---AKAEQAKEETETAKQGL 690 Query: 450 EDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 ++ S++ A+T + + A T KS Sbjct: 691 GEVKSSLDAVTATTNSASTAASEAKVLAEEAKSALTEVQEKS 732 Score = 99.2 bits (245), Expect = 9e-19, Method: Composition-based stats. Identities = 78/361 (21%), Positives = 137/361 (37%), Gaps = 1/361 (0%) Query: 144 HAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGA 203 +A + A A A A+ A + A ++AT+ ++ AA ++ S A +A Sbjct: 51 QVVNARQESEKAMARANGAQQVAEEAKRVSEQALSEATKTGEAVTAATTTASLAKDTADT 110 Query: 204 AKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASS 263 AK A + +A A A+ S A + +A + Sbjct: 111 AKGLAEEAKNASDAAKHMAEETKAAVDRASGEINGTKGSLATALQRFEEVKQVSENAVNI 170 Query: 264 ATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAA 323 +T A A +KT T A + A Q+A+ A ++ T A QAS A A Sbjct: 171 STEAKRLADESKTIATRAEQTAREASQTATETTQVAAKAVATCHEVKTVATQASLKADGA 230 Query: 324 GKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSA 383 ++A+ A A A + T + ++ S +T+ A A+ A+ + + Sbjct: 231 KQTADDAKDIAEKAKELSEGTTSSITELTKTTSQVQTAVEKALTDLRDAKQIAEASKTLS 290 Query: 384 SSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAE 443 A +A+ A + +A Q+ A S + A + ++ G+ S ++ E A A+ Sbjct: 291 EEAKQTAADALTASKDAKSQSETAVSRSEEAKALSEQSKGACDEFKASVASVEKVAEAAK 350 Query: 444 TAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 T ++A AS + + A + E A+ A A AE+ LQ D Sbjct: 351 TGVEQASQTASEAKGIAETAKSTADSATAKAEQAQQEASEASRLAS-EAKVVAEQALQAD 409 Query: 504 Q 504 + Sbjct: 410 R 410 >UniRef50_B2I4I4 Putative phage tail protein n=1 Tax=Enterobacteria phage EPS7 RepID=B2I4I4_9CAUD Length = 807 Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats. Identities = 129/377 (34%), Positives = 193/377 (51%), Gaps = 24/377 (6%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 + + + A A+ + TSA ++A A +A A + SA +A+SA+ + Sbjct: 65 AKESETNAKDSENLAAIYANSSETSATQSAASATEAERQAGLSKDSADASATSAEESKGF 124 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 +A A A +S AE +K A + AA TSE NA+ S + A A A+EA Sbjct: 125 RDSAELAAQNAEQSRLLAEQAKKDAEAAKTAAATSEQNAATSATESTNQAIAAAGSATEA 184 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 A A S+ AAK+SE NA +S + +A SA A+ SA + S + + +S T A +S+ Sbjct: 185 GEYATTAKDSEIAAKTSELNAKNSENESAISAEASEASASQSAISASQSAASATKAAESS 244 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTK------------ 340 +AA S+T A S++AA TS A S T A S +AA+ A+ A T Sbjct: 245 AAAKISETTAIESSAAAKTSEINAKTSETNAKTSETNAAAYAAAAKTSETNAADSAASAS 304 Query: 341 ------------AGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 A +A+ A AA S + KTSE N+KASE +A+ ++ +A+ SA++A Sbjct: 305 DSKGFRDEAEAFAAQASTSALAAKNSETNTKTSEINSKASEDAAKLAQQSASGSANTATQ 364 Query: 389 SASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKR 448 + ++ +DEA + A ++ATTA+ KA EAAGSAT A + + A SAA RAETAA Sbjct: 365 AMTTTKGYRDEAEVFKNTATTAATTATDKALEAAGSATIAGEKATNATSAADRAETAAAS 424 Query: 449 AEDIASAVALEDASTTK 465 AE + A +D + Sbjct: 425 AEQVMQASLKKDQNLND 441 Score = 106 bits (265), Expect = 4e-21, Method: Composition-based stats. Identities = 139/448 (31%), Positives = 208/448 (46%), Gaps = 30/448 (6%) Query: 118 SAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTAS 177 S + + + ++ SA A A +S A S AA A S+ +SA ++ Sbjct: 35 SISSITASELTGAVEASAASAAAAKDSEIAAKESETNAKDSENLAAIYANSSETSATQSA 94 Query: 178 TKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSAR 237 ATEA + A ++ S A+ATSA +K +A + Q+A S A +A + Sbjct: 95 ASATEAERQAGLSKDSADASATSAEESKGFRDSAELAAQNAEQSRLLAEQAKKDAEAAKT 154 Query: 238 DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAG 297 AA S++ A +S T +++ A +AA SAT AG A AK SE A++SE A S + +A Sbjct: 155 AAATSEQNAATSATESTNQAIAAAGSATEAGEYATTAKDSEIAAKTSELNAKNSENESAI 214 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 S A+ +SAS ++ SA Q++ASAT A +S+ +A S +TA E ++AA S Sbjct: 215 SAEASEASASQSAISASQSAASATKAAESSAAAKISETTAI-------ESSAAAKTSEIN 267 Query: 358 AKTSETNAKASETSAESSKTAAASSASSAASSAS----------SASASKDEATRQASAA 407 AKTSETNAK SET+A + AA +S ++AA SA+ A A +A+ A AA Sbjct: 268 AKTSETNAKTSETNAAAYAAAAKTSETNAADSAASASDSKGFRDEAEAFAAQASTSALAA 327 Query: 408 KSSATTASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 K+S T T + S +A+ S +TA A T + AE + Sbjct: 328 KNSETNTKTSEINSKASEDAAKLAQQSASGSANTATQAMTTTKGYRDEAEVFKNTATTAA 387 Query: 461 ASTTKKGIVQLSSATNSTSETLAATP------KAVKSAYDNAEKRLQKDQNGADIPDKGC 514 + T K + SAT + + AT A SA + L+KDQN D+ +K Sbjct: 388 TTATDKALEAAGSATIAGEKATNATSAADRAETAAASAEQVMQASLKKDQNLNDLANKDL 447 Query: 515 FLNNINAVSKTDFADKRGMRYVRVNAPA 542 + + D+ Y PA Sbjct: 448 AREALKVEAVNSVKDQYAGAYNSFRNPA 475 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 97/338 (28%), Positives = 148/338 (43%), Gaps = 22/338 (6%) Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 T+ A+K + ++ +AS T +A SA A S+ AAK SETNA S + Sbjct: 18 TTTTASKYPKYTVVLGTSISSITASELTGAVEASAASAAAAKDSEIAAKESETNAKDSEN 77 Query: 259 SAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASA 318 AA A ++ SA + S T A + SA A+A S + +A +A A Sbjct: 78 LAAIYANSSETSATQSAASATEAERQAGLSKDSADASATSAEESKGFRDSAELAAQNAEQ 137 Query: 319 ---------------------SATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 S A SA + + A A A EA E A+ A S A Sbjct: 138 SRLLAEQAKKDAEAAKTAAATSEQNAATSATESTNQAIAAAGSATEAGEYATTAKDSEIA 197 Query: 358 AKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTK 417 AKTSE NAK SE + S A+ +SAS +A SAS ++AS +A ++AAK S TTA Sbjct: 198 AKTSELNAKNSENESAISAEASEASASQSAISASQSAASATKAAESSAAAKISETTAIES 257 Query: 418 ATEAAGSATAAAQSKSTAESAATRAETAAKRAE-DIASAVALEDASTTKKGIVQLSSATN 476 + A S A S++ A+++ T A A A+ +A +++ KG + A Sbjct: 258 SAAAKTSEINAKTSETNAKTSETNAAAYAAAAKTSETNAADSAASASDSKGFRDEAEAFA 317 Query: 477 STSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 + + T A K ++ +E + ++ A + + Sbjct: 318 AQASTSALAAKNSETNTKTSEINSKASEDAAKLAQQSA 355 >UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacteria phage WV8 RepID=C5H7L3_9CAUD Length = 848 Score = 125 bits (312), Expect = 1e-26, Method: Composition-based stats. Identities = 63/212 (29%), Positives = 93/212 (43%), Gaps = 35/212 (16%) Query: 887 EEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVI 946 ++ AE + + YPVG + S+ P+ +A P L Y + + Sbjct: 632 DQKIAEAISDSTDLNKIYPVGIVTWFNSNVNPN------------TALPGLTWTYLNNGV 679 Query: 947 PDMRGWTIKGKPASGRAV--------LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGT 998 G TI+ A+G V ++ + SHTHS SA TTSSFDYGT Sbjct: 680 ----GRTIRIAAANGSDVATTGGSDSVTLSVGNLPSHTHSFSA--------TTSSFDYGT 727 Query: 999 KSTNNTGAHTHSVSGS--TNSAGAHTHSLANVNTASANSGAGS-ASTRLSVVHNQNYATS 1055 K+++ TG H H+ T S G ++ TAS GS A ++ +N Sbjct: 728 KTSSTTGNHNHNRGTMEITGSFGYFRSDASSFYTASGAFYLGSQAGSKGYTGNNFTNGIP 787 Query: 1056 SAGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 1087 + + SG + G H+HTVGIGAH+H+V Sbjct: 788 VNFNASRNWSGVTNTTGNHSHTVGIGAHSHTV 819 >UniRef50_C9XHA4 Phage variable tail-fibre protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhimurium str. D23580 RepID=C9XHA4_SALTD Length = 697 Score = 123 bits (307), Expect = 5e-26, Method: Composition-based stats. Identities = 59/177 (33%), Positives = 89/177 (50%) Query: 362 ETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEA 421 + K+ + K + S+A S S A+ +A + ++ TA T Sbjct: 158 AEHEKSRRHPDATLKEKGFTQLSNATDSESETLAATPKAVKTVYDLANAKYTAQDATTTR 217 Query: 422 AGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSET 481 G + + S +E+ A + + + +DA+T +KGI+QLS+AT+STSET Sbjct: 218 KGIVQLSNATDSVSETLAATPKAVKVAYDLANAKYTAQDATTARKGIIQLSNATDSTSET 277 Query: 482 LAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRV 538 LAATPKAVK+A DNA RL K+ NG DIPDK F I AV+ T+ + ++ Sbjct: 278 LAATPKAVKTAMDNANGRLAKNSNGGDIPDKKQFARTIGAVTSTNITFNDASGWYKI 334 >UniRef50_Q32D03 Putative uncharacterized protein n=2 Tax=root RepID=Q32D03_SHIDS Length = 90 Score = 122 bits (306), Expect = 8e-26, Method: Composition-based stats. Identities = 39/90 (43%), Positives = 51/90 (56%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V ISG L DG G P+ I LK++ N+ VV++T+A G Y G+Y V Sbjct: 1 MSVVISGALTDGAGIPMSGYHIILKSRVNTPEVVMHTVADVMTGNDGEYCFHARTGKYGV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFL 90 L + + G I VYEDS+PGTLNDFL Sbjct: 61 YLKQDWRNEYNVGDIAVYEDSKPGTLNDFL 90 >UniRef50_UPI0001760AF3 PREDICTED: similar to nahoda CG12781-PA n=1 Tax=Danio rerio RepID=UPI0001760AF3 Length = 566 Score = 121 bits (303), Expect = 2e-25, Method: Composition-based stats. Identities = 73/443 (16%), Positives = 140/443 (31%), Gaps = 14/443 (3%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG---QAASSAQSA 169 V ++ S+ ST ++ A D+A + +++ A A Sbjct: 44 VDSKEVKAPDTQNSSDLSSPIDSTLSQSAEVLGDDSAPAVEGKNSNKSVDVPTAPIANKD 103 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 + K +++ + + K +T+ + + Sbjct: 104 TEPQDNDPKKQKNNTENLKSDTGKQKNDLKLETDPKQPKTDLNQLETEPKQLETELKQPE 163 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 +E ++ K K SET S + S T S K SET + SET Sbjct: 164 TEPKQLETESKQPKTEPKQSETEPKQSETEPKQSETEPKQSETEPKQSETEPKQSETEPK 223 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 QS + S+T S + S + S T +S S + E + + Sbjct: 224 QSETEPKQSETEPKQSETEPKQSETEPKQSETEPKQSETEPKQSDTEPKQSETEPKQSET 283 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQ-ASAAK 408 +S + K SET K SET + S+T S + S + S+ E +Q + K Sbjct: 284 EPKQSETEPKQSETEPKQSETEPKQSETEPKQSETEPKQSETEPKQSETEPKQQLETEPK 343 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 T S + + ++ + + + + D + Sbjct: 344 QQLETESKQPVTENKQLETENKQQNNDPVQ--KGDDSVQPKNDHKQQDNDPEHLKPSPAA 401 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 Q + T +E+ P ++ D + ++ D+ N++ Sbjct: 402 PQPQNFTEYIAESTFLGPDLGFGDDTGNDEDDDGDDDDDEVVDENDVSNDVEV------- 454 Query: 529 DKRGMRYVRVNAPAGATSGKYYP 551 K G + +++P + K YP Sbjct: 455 -KGGSSWDDISSPEDSQGKKNYP 476 >UniRef50_UPI0001826514 putative tail fiber protein (GpH) n=2 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001826514 Length = 719 Score = 121 bits (302), Expect = 2e-25, Method: Composition-based stats. Identities = 114/431 (26%), Positives = 177/431 (41%), Gaps = 43/431 (9%) Query: 362 ETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEA 421 + ++ + + SSA SAS + A+ +A + A TA T Sbjct: 158 AEHEQSRRHPDATLTAKGFTQLSSATDSASESVAATPKAVKAAYDLAKGKYTAQDATTAQ 217 Query: 422 AGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSET 481 G ++ + ST+ES A + + +DA+T +KGIVQLSSAT+S SE Sbjct: 218 KGIVQLSSATDSTSESVAATPKAVKAAYDLAKGKYTAQDATTAQKGIVQLSSATDSASEA 277 Query: 482 LAATPKAVKSAYDNAEKRLQKDQN------GADIPDKGCFL---NNINAVSKTDFADKRG 532 LAATPKAVK+A DNA R+ + ADI + + + D Sbjct: 278 LAATPKAVKAANDNANGRVPSGRKINGRALSADISITAQDIFNGQTVGIGNAEDLNAYTT 337 Query: 533 MRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMP 592 A A A +GK YP + AGS+ E+ IT R N+ + + Sbjct: 338 PGLYYQPANAQAQTGKNYPEAM---AGSL-EVYKHAGITQVYRVYN---NSRSYIRTLYS 390 Query: 593 GGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSIS 652 G W+ A+ + N A + G + V+G + +G Sbjct: 391 GTWS-----AWAKQYDAANKPTAGEVGALPVTGGTVTGNATVNGT-----LSVGNGRRFE 440 Query: 653 APGAD-LVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLS------- 704 + N + +G + T +L+FK G++ + ++S Sbjct: 441 ISSQNSSTANGSLLLWGNADRPT-------VLEFKDATGYHFYSQRNKDGSVSFSFNGVS 493 Query: 705 --CKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAI 762 + ++ E V+R N R+ G YGA WRNDG YLL+TN GD G +N+LRPF + Sbjct: 494 SFAGGITSSGEFVSRSANGFRIAYGSYGAFWRNDGGSLYLLVTNSGDSLGTFNSLRPFTV 553 Query: 763 DNATGELVIGT 773 ATG++ + Sbjct: 554 SLATGDITMNK 564 Score = 43.7 bits (101), Expect = 0.039, Method: Composition-based stats. Identities = 31/100 (31%), Positives = 45/100 (45%) Query: 400 ATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALE 459 A A + A T S + + + + + + Sbjct: 108 AESYKPALAEGSGRAQTVRMVIMVSDIESVELTIDTSMVMATQDYVDDKLAEHEQSRRHP 167 Query: 460 DASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 DA+ T KG QLSSAT+S SE++AATPKAVK+AYD A+ + Sbjct: 168 DATLTAKGFTQLSSATDSASESVAATPKAVKAAYDLAKGK 207 >UniRef50_Q7N0S6 Similarities with prophage tail fiber protein n=3 Tax=Photorhabdus RepID=Q7N0S6_PHOLL Length = 617 Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats. Identities = 64/179 (35%), Positives = 88/179 (49%), Gaps = 28/179 (15%) Query: 763 DNATGELVIGTKLSASLNGNA------------LTATKLQTPRRVSGVEFDGSKDITLTA 810 DNA L + + NA L + R+++G G DI+L A Sbjct: 130 DNANNRLAKNQNGADIPDKNAFVKNLGLAETANLAKNAVPNSRKINGKALTG--DISLNA 187 Query: 811 AHVAAFARRATDTYADADGGVPWNAESGAYNVTRSG-DSYILVNFYTGVGSCRTLQMKAH 869 V AF T ++ VPWNA +G Y++ R G DS + +F GVGSC Q+K Sbjct: 188 GDVGAFRLGLTGNNTVSN-QVPWNANTGLYDLLRPGIDSQHIAHFNNGVGSCPAFQLKVQ 246 Query: 870 YRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPES------------YPVGAPIPWPSDT 916 Y+N G+ YRS+RD YGFEEDW ++YT+KN P + Y V P+PW +DT Sbjct: 247 YKNSGIAYRSARDNYGFEEDWTDIYTTKNKPTAADVGAFRLGLAGGYSVNNPVPWNADT 305 Score = 110 bits (275), Expect = 3e-22, Method: Composition-based stats. Identities = 51/119 (42%), Positives = 68/119 (57%), Gaps = 6/119 (5%) Query: 790 QTPRRVSGVEFD----GSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRS 845 ++ R G E D + TAA V AF Y + VPWNA++G Y++ R Sbjct: 255 RSARDNYGFEEDWTDIYTTKNKPTAADVGAFRLGLAGGY-SVNNPVPWNADTGLYDLLRP 313 Query: 846 G-DSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPES 903 G DS + +F G GSC Q+K YRNGG+ YRS+RD YGFEEDW ++YT+KN P + Sbjct: 314 GIDSQHIAHFNNGAGSCPAFQLKVQYRNGGIAYRSARDNYGFEEDWTDIYTTKNKPTPA 372 Score = 63.4 bits (152), Expect = 4e-08, Method: Composition-based stats. Identities = 38/102 (37%), Positives = 51/102 (50%), Gaps = 1/102 (0%) Query: 441 RAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 + T +A + A + AS T+KGIVQL + S +LA T K V DNA RL Sbjct: 78 KLTTQLNKALEKKIATEIPSASLTQKGIVQL-TDKTGNSNSLAVTQKLVSDVNDNANNRL 136 Query: 501 QKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPA 542 K+QNGADIPDK F+ N+ + A ++N A Sbjct: 137 AKNQNGADIPDKNAFVKNLGLAETANLAKNAVPNSRKINGKA 178 >UniRef50_D0Z3B3 Putative tail fiber protein n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z3B3_LISDA Length = 1008 Score = 115 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 130/603 (21%), Positives = 225/603 (37%), Gaps = 80/603 (13%) Query: 3 VKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVIL 62 + + G+L D + + IQ+ + S +V+ + D G YS + G Y +I Sbjct: 1 MLVKGILSDAADQCIPKGIIQIVSINTSESVLEGSTVWIKADNEGHYSFTLLPGSY-LIY 59 Query: 63 LVEGFP--PSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 G + G V +D+ G+LN +G T P +++ + R+A Sbjct: 60 AQSGRQNDVVYLGETIVTDDTPDGSLNSIVGITT--PVLPPQVQQAVNAANKATRSAEDA 117 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 + A + S + T + +++ A + ++ Q + Sbjct: 118 NDKYQDLIELAKTVTNSINDLVTTVNHVENLSQSVEGYALASGNALQGSLRIKAEVGAVN 177 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 S + A+T + AS+S Q A A++A A+ AA ++ Sbjct: 178 QSVRLVKDEILSIQKNIIHLKSDAETFSSKASSSAQKAQKQANSAVLSANNAAADSQKTF 237 Query: 241 ASKEA----------------------------------------------AKSSETNAS 254 A+ E N Sbjct: 238 RLMTVVEKYRDDVMGALDETHQSLEWLKFMQVQFDTKLQEMTLIDEHLSILAREIEKNKQ 297 Query: 255 SSASSAASSATAAGNSAKAAKTSETNARSSETAAG-----QSASAAAGSKTAAASSASA- 308 ++ +AGN+A +A +E A +E AA ++ A SK+ A S SA Sbjct: 298 KVEQLRLNARESAGNAATSALRAEHEANRAEVAASIDIVREAEKQAKSSKSEADKSLSAS 357 Query: 309 ---------ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 A A +A S +A SA++A ++ TA +A +TE+AS+AA SA +K Sbjct: 358 LVSVNAKNVAVAKANEAKQSELSATTSAQNAEQNSLTAKEQAKLSTEKASSAAISAKNSK 417 Query: 360 TSE-------TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSAT 412 +SE + A + T+ + S A+S A +A SA++A D+A QAS A A Sbjct: 418 SSEKSALEAASEAALNSTATKQSANLASSHAVTAGESANTAEQKADDAANQASIATQQAG 477 Query: 413 TASTKATEAAGSATAAAQ----SKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 A + A + S T AA + + A+ A+ A+ AA++A + V+L K + Sbjct: 478 IAKSNADASLNSQTLAANSVELASNQAKLASNSAKVAAEKAMIAINQVSLAQQEAAKSRV 537 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 +A ++ ++ K+ D A K A I F+ ++A A Sbjct: 538 NAGGAAQSAKDSERSSQTSIAKA--DVAAKNAGLSATHA-IEASQSFIRALSAEESAISA 594 Query: 529 DKR 531 KR Sbjct: 595 AKR 597 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 67/230 (29%), Positives = 103/230 (44%), Gaps = 12/230 (5%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + + AS+ A + +K S A +A EAA ++ SA AS+ A A SA Sbjct: 396 KEQAKLSTEKASSAAISAKNSKSSEKSALEAASEAALNSTATKQSANLASSHAVTAGESA 455 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 +A A A+ +A+ A++ A A+S+ A+ S A S AS + A+ SA A Sbjct: 456 NTAEQKADDAANQASIATQQAGIAKSNADASLNSQTLAANSVELASNQAKLASNSAKVAA 515 Query: 227 TKA-----------SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 KA EAA S +A + ++AK SE ++ +S + A +A AG SA A Sbjct: 516 EKAMIAINQVSLAQQEAAKSRVNAGGAAQSAKDSERSSQTSIAKADVAAKNAGLSATHA- 574 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGK 325 + + +A +SA +AA A +S S A G S A + Sbjct: 575 IEASQSFIRALSAEESAISAAKRAETAVASLSGAMIEQGGVDLSKGVAPQ 624 >UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BT14_DICD5 Length = 534 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 70/275 (25%), Positives = 101/275 (36%), Gaps = 84/275 (30%) Query: 851 LVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPI 910 +V G G Q+ + GL R D F+ W + + + VG P+ Sbjct: 339 VVRAGGGNGMADGHQISLGWTGSGL--RVQVDATSFDL-WHK--DNVFPIHAAEIVGIPL 393 Query: 911 PWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVL 965 P+P T P G+ GQ+F+K+A+P LA YPSG +PD+RG I+G S GR +L Sbjct: 394 PYPGATAPDGWLKCNGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVDPGRGLL 453 Query: 966 SQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSL 1025 S ++ +H+H N H+H + S G Sbjct: 454 SFQESQNLTHSHGV-----------------------NDPGHSHPYNKYEGSVG------ 484 Query: 1026 ANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTH 1085 S + +Y + A TV G H Sbjct: 485 -------------------SGLAGFDYDQDAWNA-----------------TVYTG---H 505 Query: 1086 SVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 G I++ A+G E +NIAFNYIVR A Sbjct: 506 V------GTGISIAASGGHEARPRNIAFNYIVRAA 534 >UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BL21_PHOAA Length = 452 Score = 108 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 71/242 (29%), Positives = 101/242 (41%), Gaps = 30/242 (12%) Query: 763 DNATGELVIGTKLSASLNGNALTAT------------KLQTPRRVSGVEFDGSKDITLTA 810 DNA L + + NA + + R+++G G D++L+A Sbjct: 143 DNANSRLAKNQNGADIPDKNAFVKNLGLSETVNKANNAVPSSRKINGKALSG--DVSLSA 200 Query: 811 AHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYI--------LVNFYTGVGSCR 862 V A + T ++ + SG V + + + F Sbjct: 201 GDVGAISVNPLSTLTESKKFQNF-LSSGFILVNVPNTATVNDFPFPTRVYGFGILEVRSS 259 Query: 863 TLQMKAHYRN--GGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSG 920 + + Y + G + R S + W VY+S LPPE +PVGAPIP+P P G Sbjct: 260 GVVIYQKYTSHHGEVVIRQSWNSGKTWIGWDIVYSSAILPPEQHPVGAPIPYPHRYTPVG 319 Query: 921 YALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSH 975 Y GQ FDKS YPKLA AYPSG +PD+RG I+G S GR S + K+H Sbjct: 320 YLTCNGQTFDKSLYPKLAEAYPSGRVPDLRGEFIRGWDDSRGVDPGRVCGSWQDSDNKAH 379 Query: 976 TH 977 H Sbjct: 380 IH 381 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/128 (26%), Positives = 53/128 (41%), Gaps = 7/128 (5%) Query: 446 AKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQN 505 ++ D + + + L + + AT A+K+ DNA RL K+QN Sbjct: 95 REQQPDHSQSAWKPMSDFIGAPKSVLDALNTKQDKGDYATNSALKTVNDNANSRLAKNQN 154 Query: 506 GADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELA 565 GADIPDK F+ N+ + A+ ++N A SG V SAG V ++ Sbjct: 155 GADIPDKNAFVKNLGLSETVNKANNAVPSSRKING--KALSGD-----VSLSAGDVGAIS 207 Query: 566 SRVIITTA 573 + T Sbjct: 208 VNPLSTLT 215 >UniRef50_D1RZD4 Putative uncharacterized protein n=1 Tax=Serratia odorifera 4Rx13 RepID=D1RZD4_SEROD Length = 759 Score = 108 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 44/109 (40%), Positives = 58/109 (53%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAV ISG L G P TI L A + S+ VV +S G YS+ VE G ++V Sbjct: 1 MAVLISGKLIGPNGDPRPGVTIMLTAVKTSSAVVHLAPSSSTTGADGSYSLSVEVGTHNV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELM 109 ++ G P G ITVY DS+PGTLNDFL + +D+ P + + M Sbjct: 61 MIEAYGRPFEKVGQITVYSDSKPGTLNDFLTSPGQDELTPAIVAIVDDM 109 >UniRef50_B6W6V0 Putative uncharacterized protein n=2 Tax=Anaerococcus hydrogenalis DSM 7454 RepID=B6W6V0_9FIRM Length = 684 Score = 106 bits (265), Expect = 4e-21, Method: Composition-based stats. Identities = 101/421 (23%), Positives = 170/421 (40%), Gaps = 31/421 (7%) Query: 88 DFLGAMTEDDARPEALR-RFELMVEEVARNASAVAQNT---AAAKKSASDASTSAREAAT 143 D + T++D + E L+ E + +++ + + + AKKSA + + ++ + Sbjct: 289 DLVTVKTDNDNQYEDLKKEIEELKKQIKEKEAVIKRLQSELEEAKKSAQKSKDALEKSKS 348 Query: 144 HAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGA 203 A A + A AA A +AA + ++A A A +A EA K A+ ++ K A Sbjct: 349 EAQAANEKANAAQEKAKKAAENLEAAQKDAEEAKRQAEEAKKDASLSKQEKKEAEEKVIE 408 Query: 204 AKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASS 263 A+T E A + A +DA E AK N +A+ A + Sbjct: 409 AQTKEKAARLEAEKAK-----------------KDAERKIEEAKDQANNLVEAANQAKEN 451 Query: 264 ATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAA 323 A A K AK SE A+ +S A+ K A A T QA+ Sbjct: 452 AEKA---EKLAKESEQAAKDKVKELEKSNQASQAEKEKAQKDLEKAQTDLEQANKEKEKV 508 Query: 324 GKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSA 383 K A+ A + A A +A E + ++ + A+ + A + A Sbjct: 509 EKEAKDAKAQAQEAKESLEKAQENIENLKKEKENLTKEKSEIENQLAEAKEAAKKAQAEA 568 Query: 384 SSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAE 443 S A A+ A S +EA ++A A+ A A T+A EA T + + K+ A++ A AE Sbjct: 569 SEANKKANKAQTSAEEANKKAQQAQEEANKAKTEAEEAKADKTKSEEEKAKAQAKAEEAE 628 Query: 444 TAAKRA-EDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK 502 ++A E+IA A S + Q ++ T +E A + V K++++ Sbjct: 629 ---RKAQEEIAKAKEEAQKSVKE---AQTAAETAKEAEVKAKEAQKVAEENLENLKKIKR 682 Query: 503 D 503 Sbjct: 683 G 683 >UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae RepID=D0FSD9_ERWPY Length = 311 Score = 106 bits (264), Expect = 5e-21, Method: Composition-based stats. Identities = 60/229 (26%), Positives = 84/229 (36%), Gaps = 47/229 (20%) Query: 888 EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIP 947 W E N YP+G + P+ L G + + + Sbjct: 113 SGWVEFKADVNPVDMLYPIGIVTWFAQKKDPN--KLFPGTTWKYIGENRTIRLASANGSD 170 Query: 948 DMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASAS--STDLGTKTTSSFDYGTKSTNNTG 1005 M G ++ I +H H+ SA+ S D GTK TS+FDYG K T+ G Sbjct: 171 VM--------TTGGSDSVTLAVGNIPAHGHTFSANTGSFDYGTKGTSTFDYGNKVTDTQG 222 Query: 1006 AHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLS 1065 +HTHS ++ AS G T + Sbjct: 223 SHTHS------------YNEVIPRGASGMDIGGIWETTIRGSDT---------------- 254 Query: 1066 GTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFN 1114 ++AGAHAH V IGAH H+V IG+H H++ +G NT A N Sbjct: 255 ---STAGAHAHNVAIGAHGHTVEIGAHSHSV----SGTTANTGAGTAIN 296 >UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CGA0_DICZE Length = 166 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 42/106 (39%), Positives = 63/106 (59%), Gaps = 5/106 (4%) Query: 906 VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----AS 960 VG P+PWP T P+G+ GQAFDK+A+PKLA YPSGV+PD+RG I+G S Sbjct: 23 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 82 Query: 961 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGA 1006 R +LS + D I++ T S + + +D G++++ + G+ Sbjct: 83 NRNLLSSQGDAIRNITGFVSGVYVGFDGYSGAFYDTGSRNSISPGS 128 >UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepID=B6S308_SALDU Length = 427 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 74/231 (32%), Positives = 100/231 (43%), Gaps = 31/231 (13%) Query: 736 DGAKTYLLLTNQGDVYGGWNTLR-----PFAIDNATGE--LVIGTKLSASLNGNALTATK 788 D KTY + N ++Y G L+ D A G L + +K + N A + Sbjct: 207 DNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCG 266 Query: 789 LQTPRRVSGVEFDGSKDITLTAAHVAAFARRA-----TDTYADADGGV--------PWNA 835 P + V+ G+ + TA A R T +D G+ + Sbjct: 267 ADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGATRGSDSGLIMGEVYNNGYPT 326 Query: 836 ESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFE-EDWAEVY 894 + G ++ ++G + RS RD E +WA +Y Sbjct: 327 QYGNILRLTGTGDGEILIGWSGTNGAPAPA----------YIRSHRDTADAEWSEWAMLY 376 Query: 895 TSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGV 945 TS N PP SYPVGA I WPSD P+GYALMQGQ+FDKSAYP LA AYPSG+ Sbjct: 377 TSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGI 427 >UniRef50_A9IY35 Putative uncharacterized protein n=1 Tax=Bartonella tribocorum CIP 105476 RepID=A9IY35_BART1 Length = 1347 Score = 104 bits (258), Expect = 2e-20, Method: Composition-based stats. Identities = 92/394 (23%), Positives = 156/394 (39%), Gaps = 11/394 (2%) Query: 82 QPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREA 141 P +++L + + E + + ++V ++ AS + T A + +A + +A Sbjct: 3 IPDHSHEYLIPVATQEEIREGISQDTVVVPKLLGTASLYSYETFAPLEQVVNARQESEKA 62 Query: 142 ATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSA 201 A A A + + QA S +A TA T + EA + AA ++ S A +A Sbjct: 63 MARANGAQQVAEESKRVSEQALSEVTKTVQTATTALTTSHEAKEEVKAATTTASLAKDTA 122 Query: 202 GAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAA 261 AK A + +A A A A+ S + A S + +A Sbjct: 123 DTAKGLAEEAKNASDAAKHMAEEAKAAVDRASGEINSTKGSLDTALQSFEEVKQVSENAV 182 Query: 262 SSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA-------ASSASAASTSAG 314 + +T A A +K T A + A Q+A+ A + A AS A Sbjct: 183 NISTEAKRLADESKIIATRAEQTAREASQTANETTQVSATAVATCHEVKTVAMQASLKAA 242 Query: 315 QASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAES 374 A +A A AE A + AT E T+ S +S A T AK A+ Sbjct: 243 GAKQTADDAKDMAEKAKGLSERATDSVTELTKTVSQVEKSVETALTEVREAKEKSEEAKG 302 Query: 375 SKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAA----Q 430 + A +AS A + +A ++AT + A+ ++ A ++A A +A +A+ Sbjct: 303 AGEQALQTASEAKGISETAKGLSEQATTVSREAQKTSEQALSEARAAKSTADSASNTAMD 362 Query: 431 SKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 +K +AE A T +E A K +D +A + + Sbjct: 363 AKGSAEEAKTLSEEAKKLVQDNKNAFDESNKTLE 396 >UniRef50_Q19CF5 Gp36 small distal tail fiber subunit n=1 Tax=Aeromonas phage 25 RepID=Q19CF5_9CAUD Length = 1305 Score = 102 bits (254), Expect = 6e-20, Method: Composition-based stats. Identities = 72/273 (26%), Positives = 110/273 (40%), Gaps = 20/273 (7%) Query: 394 SASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIA 453 SA A R+ T A A A +A +A++ ++ + T T + + A Sbjct: 307 SAGTKLANRKIYTEADKPTPAEIGALAAGANAVSASKLQTARTISLTGGATGSVSFDGSA 366 Query: 454 SAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKG 513 +A + T T +TPK + + AE ++ Sbjct: 367 NASIAVTITNNSHTHSDYVKKTGDTMSGNLSTPKVLLTDAQGAEANSVTRRD-------- 418 Query: 514 CFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTA 573 V T A +G++ + + PAGA + Y PV+ + S S V I T+ Sbjct: 419 -------FVESTVTAAGKGVKDLTITLPAGAPATGYIPVMFRTNGTS----DSFVFIDTS 467 Query: 574 TRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFY 633 PMNNC F G V GW+D YAYG F Y +ERAIHSI D V Y Sbjct: 468 FNVGEHPMNNCSFIGNVRASGWSDGRSYAYGKFTIYSTSERAIHSIHAPF-EDSFAYVVY 526 Query: 634 VDGAAFPVFAFIEDGLSISAPGADLVVNDTTYK 666 V+ AFP+ ++ G +++A D+ + +K Sbjct: 527 VETRAFPITVRVDIGTTVTAHATDVTYGTSVFK 559 >UniRef50_C3L3V8 Putative uncharacterized protein n=2 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=C3L3V8_AMOA5 Length = 891 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 77/322 (23%), Positives = 118/322 (36%), Gaps = 3/322 (0%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 + +A + A++ A A A A+ A A + A A A A +AS Sbjct: 497 ASGDAKTASDEAKKAQQQAKAARDQANTASEEAKAARNEAEKVQQQAEAARDQANTASEE 556 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A A +A +A + A AA + A+ A A A ++A A+TA+ + A Sbjct: 557 AKAARNEAEKAQQQAEAARDQSNTASGDAKTASDEAKKAQQQAEAARDQANTASEETKAA 616 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 A A EAA+ AS A A + A K AK A ++ T A ++ Sbjct: 617 RNEAEKAQQQAEAARDQANTASEEAIKAQEATEKAT---KQAKDDAETATNAATQANTAS 673 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 A ++ A + AA A +A K A+ A E + + Sbjct: 674 EEAKTARNEAIEAQQAAEKEATKAMKQVEQIKKKAQEKAQQKQAKKLAKEETARKKAEQE 733 Query: 353 RSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSAT 412 K ++ AK E + + +K A A A A D A + + A A Sbjct: 734 AIEEDKKQADLVAKVKEEAIKVAKEAVKKQVEDATEQAKEAKKQADLAIKAKAGAIEEAE 793 Query: 413 TASTKATEAAGSATAAAQSKST 434 A+T+A A AT AA + Sbjct: 794 KAATQAKVHAEIATNAAAEQVN 815 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 51/240 (21%), Positives = 90/240 (37%), Gaps = 1/240 (0%) Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 K A+S + K + QA A AA A +A+ A Sbjct: 218 KEIRQAAKSHTDTSSIEQKIEGLQKAIDKVKEELTEFISAQAQQQAKAAKDQASTASGDA 277 Query: 335 STATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSAS 394 TA+ +A +A +QA AA A+ A A+ A+ A A++A+ A +A Sbjct: 278 KTASDEAEKAQQQAEAARDQANTASEEAKAARNEAEKAQQQAETARDQANTASKEAKTAR 337 Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS 454 A A+ + A +A A A++ TA A +A+ ++A+ A Sbjct: 338 NEAINAQEAIEKAQQQVKSNVEAANKAVEQANTASKEAKTASDEAIKAQEVTEKAQQQAE 397 Query: 455 AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 A A + A+T +++ AT + A + +A A+ + ++ +K Sbjct: 398 A-ARDQANTANDKVIKAQEATEKAQQQAKAAKEQASTASKEAKTASDEAIKAQEVTEKAQ 456 >UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersinia pestis KIM D27 RepID=D1TPQ4_YERPE Length = 262 Score = 101 bits (250), Expect = 2e-19, Method: Composition-based stats. Identities = 36/77 (46%), Positives = 49/77 (63%), Gaps = 5/77 (6%) Query: 905 PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----A 959 PVG P+PWP+ T P G+ G FDK YPKLA AYPSG++PD+RG I+G Sbjct: 105 PVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIRGWDDGLGVD 164 Query: 960 SGRAVLSQEQDGIKSHT 976 +GR +LS + D I++ + Sbjct: 165 AGREILSIQGDAIRNIS 181 >UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae RepID=D2U1K0_9ENTR Length = 366 Score = 101 bits (250), Expect = 2e-19, Method: Composition-based stats. Identities = 40/80 (50%), Positives = 51/80 (63%), Gaps = 5/80 (6%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS- 960 +YPVGAPIPWP T P GY + G+ FDK PKL AYPSG +PD+RG+ I+G A Sbjct: 216 NNYPVGAPIPWPQATPPKGYLICNGEPFDKVKCPKLLIAYPSGKLPDLRGYFIRGWDAGK 275 Query: 961 ----GRAVLSQEQDGIKSHT 976 GR V S ++D I++ T Sbjct: 276 GVDPGREVFSYQEDAIRNIT 295 >UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterobacteriaceae RepID=A4WEL3_ENT38 Length = 340 Score = 99 bits (247), Expect = 4e-19, Method: Composition-based stats. Identities = 77/289 (26%), Positives = 111/289 (38%), Gaps = 41/289 (14%) Query: 738 AKTYLLLTNQG-------DVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQ 790 AK + +LTNQG G L A+ + G L L K Sbjct: 3 AKYFAILTNQGAAKLANATALGTKLNLTQLAVGDGNGFLPTPDPAQTRL-----INQKRI 57 Query: 791 TPRRVSGVEFDGSKDI---TLTAAHVAAFARRATDTYADADG---GVPWNAESGAYNVTR 844 P + V+ + S I + + F R Y D DG V E+ + Sbjct: 58 APLNMLSVDPNNSSQIIAEQIIPENEGGFWIREIGLY-DDDGVLIAVANCPETYKPQLQE 116 Query: 845 SGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDG------YGFEEDWAEVYTSKN 898 + V + + +K + L R DG + A + N Sbjct: 117 GSGRTQTIRMILIVSATSAITLKID-PSVVLATRRFVDGKVTEVKMYADSVLAAHVDAAN 175 Query: 899 LPP--------ESY-PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDM 949 P ++Y PVG P+PWP T P G+ G FDK YPKLA AYPSG++PD+ Sbjct: 176 PHPQYLKTAEIDNYLPVGFPLPWPQATPPQGWLKCNGAPFDKVKYPKLAVAYPSGLLPDL 235 Query: 950 RGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSAS-ASSTDLGTKTTS 992 RG I+G SGR L+ + D ++ T +AS ++T +TS Sbjct: 236 RGEFIRGWDDGRGVDSGRVALTTQGDAVQKMTGAASNGAATGFVNNSTS 284 >UniRef50_UPI00016C0A2F hypothetical protein Epulo_08463 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0A2F Length = 926 Score = 99.6 bits (246), Expect = 6e-19, Method: Composition-based stats. Identities = 126/463 (27%), Positives = 199/463 (42%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 +V A+ V K S +D A +A + DA A A +S A SS A Sbjct: 328 ADVKYMAANVKSTAVDVKSSKADVKYMAADAKSSKVDAKYMAADAKSSRADAKSSKVDAK 387 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 SS A + A S A A+S+ + A +S AK++ NA + A +S + A + A+ Sbjct: 388 SSRADAKSTAANVKSSKADAKSTAADAKSSRANAKSTAANAKSMAADAKSSRADAKSMAA 447 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 +A ++A DA + AKS+ +A+ S + A S A ++A AK++ NA+SS A Sbjct: 448 DAKSTAVDAKSMAADAKSTAADATFSKADATFSRADAKSTAVDAKSTAANAKSSRADAKY 507 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 SA+ A + A SS + A +S A + A A + A S AS A + + A+ Sbjct: 508 SAADAKSTAANAKSSKADAKSSRADAKSMAVDAKSTRADAKSMASDAKSSRADVKYMAAD 567 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 A S + AK++ N K+S A+SS+ A S + A S+A+ A ++ +A + AKS+ Sbjct: 568 ATFSKADAKSTAANVKSSRADAKSSRADATFSKADAKSTAADAKSTAVDAKSSRADAKST 627 Query: 411 ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQ 470 A A + A +A S A SK+ A S A A+++ A+ A+ V A Sbjct: 628 AADAXSTAADAKSSRADATFSKADARSTAADAKSSTADAKSTAANVKSSKADAKSTAADA 687 Query: 471 LSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADK 530 S A ++ S AT + A + K + D N+ + Sbjct: 688 KSMAADAKSSRADATFSKADAKSSKANAKSSKADAKSMAADAKSMAANVKSSKADVKYMA 747 Query: 531 RGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTA 573 + NA + A + K V S A+ T A Sbjct: 748 ADAKSTAANAKSTAVNVKSTAVDAKSSKADAKSTAADAKSTAA 790 Score = 96.1 bits (237), Expect = 7e-18, Method: Composition-based stats. Identities = 115/388 (29%), Positives = 179/388 (46%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 + A AK S +DA A +A AADA +A A + A A S A A S Sbjct: 119 DAKSTAVDAKSMAVDAKSSRADAKYMAADAKYMAADAKSTAANAKSMAADAKSMAVDAKS 178 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 +A A + A +AA +SSK+ A +S AK+S+ +A + A +S + A A++ Sbjct: 179 TAVDAKSSKANAKSTAANVKSSKADAKSSRANAKSSKADAKSMAVDAKSSRADAKYTAAD 238 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 A ++A +A ++ AKS+ N SS ++A S A +S K NA+S+ A S Sbjct: 239 AKSTAANAKSTPATAKSTPANVKSSRANAKFSKADAKSSRADVKYMAANAKSTAANAKSS 298 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 + A SK A S + A +S A +S A + S+A + + A+ A Sbjct: 299 RADATFSKADATFSKANAKSSKADAKSSKADVKYMAANVKSTAVDVKSSKADVKYMAADA 358 Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 S AK +AK+S A+SSK A SS + A S+A++ +SK +A A+ AKSS Sbjct: 359 KSSKVDAKYMAADAKSSRADAKSSKVDAKSSRADAKSTAANVKSSKADAKSTAADAKSSR 418 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 A + A A A A S++ A+S A A++ A A+ +A+ A T Sbjct: 419 ANAKSTAANAKSMAADAKSSRADAKSMAADAKSTAVDAKSMAADAKSTAADATFSKADAT 478 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKR 499 S ++ S + A A + A+ + Sbjct: 479 FSRADAKSTAVDAKSTAANAKSSRADAK 506 Score = 95.0 bits (234), Expect = 1e-17, Method: Composition-based stats. Identities = 113/436 (25%), Positives = 191/436 (43%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 + V + A AK S +DA +S + AA+A +A A +S A S A+ S Sbjct: 253 AKSTPANVKSSRANAKFSKADAKSSRADVKYMAANAKSTAANAKSSRADATFSKADATFS 312 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A + +A S A + + ++A K+S+ + A +S A A++A Sbjct: 313 KANAKSSKADAKSSKADVKYMAANVKSTAVDVKSSKADVKYMAADAKSSKVDAKYMAADA 372 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 +S DA +SK AKSS +A S+A++ SS A ++A AK+S NA+S+ A A Sbjct: 373 KSSRADAKSSKVDAKSSRADAKSTAANVKSSKADAKSTAADAKSSRANAKSTAANAKSMA 432 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 + A S+ A S A+ A ++A A + A A +A A S + AT +A A A Sbjct: 433 ADAKSSRADAKSMAADAKSTAVDAKSMAADAKSTAADATFSKADATFSRADAKSTAVDAK 492 Query: 353 RSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSAT 412 +A+ AK+S +AK S A+S+ A SS + A SS + A + +A + AKS A+ Sbjct: 493 STAANAKSSRADAKYSAADAKSTAANAKSSKADAKSSRADAKSMAVDAKSTRADAKSMAS 552 Query: 413 TASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLS 472 A + + A A SK+ A+S A +++ A+ + A S Sbjct: 553 DAKSSRADVKYMAADATFSKADAKSTAANVKSSRADAKSSRADATFSKADAKSTAADAKS 612 Query: 473 SATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRG 532 +A ++ S A A + A+ + + D + + + + Sbjct: 613 TAVDAKSSRADAKSTAADAXSTAADAKSSRADATFSKADARSTAADAKSSTADAKSTAAN 672 Query: 533 MRYVRVNAPAGATSGK 548 ++ + +A + A K Sbjct: 673 VKSSKADAKSTAADAK 688 Score = 93.4 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 113/440 (25%), Positives = 194/440 (44%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 M + + + A AK +A+DA ++A A + AA+A S A ++A A S+A + Sbjct: 18 MAADAKSTRANAKSSAANAKSTAADAKSTAANAKSTAANAKSSRANAKSTAADAKSTAAN 77 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 SS A + A +A S A A+S+ + A ++A AK+S+ +A ++ A + A A + Sbjct: 78 VKSSKADAKSTAADAKSSKADAKSTAADAKSTAVDAKSSKGDAKSTAVDAKSMAVDAKSS 137 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 ++A A DA AKS+ NA S A+ A S A A ++A AK+S+ NA+S+ Sbjct: 138 RADAKYMAADAKYMAADAKSTAANAKSMAADAKSMAVDAKSTAVDAKSSKANAKSTAANV 197 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 S + A S+ A SS + A + A A +S A +A A S+A+ A + A Sbjct: 198 KSSKADAKSSRANAKSSKADAKSMAVDAKSSRADAKYTAADAKSTAANAKSTPATAKSTP 257 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + S + AK S+ +AK+S + A S+A++A SS + A+ SK +AT + AK Sbjct: 258 ANVKSSRANAKFSKADAKSSRADVKYMAANAKSTAANAKSSRADATFSKADATFSKANAK 317 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 SS A + + A + +S+ + A A+ A Sbjct: 318 SSKADAKSSKADVKYMAANVKSTAVDVKSSKADVKYMAADAKSSKVDAKYMAADAKSSRA 377 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 SS ++ S A A A+ + + + N +++ + Sbjct: 378 DAKSSKVDAKSSRADAKSTAANVKSSKADAKSTAADAKSSRANAKSTAANAKSMAADAKS 437 Query: 529 DKRGMRYVRVNAPAGATSGK 548 + + + +A + A K Sbjct: 438 SRADAKSMAADAKSTAVDAK 457 Score = 92.7 bits (228), Expect = 8e-17, Method: Composition-based stats. Identities = 115/447 (25%), Positives = 203/447 (45%) Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + + A AK A++ A +A + A+A SA A ++A A S+A +A S+A A + Sbjct: 1 MKSSRADAKYMAANVKYMAADAKSTRANAKSSAANAKSTAADAKSTAANAKSTAANAKSS 60 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 A +AA A+S+ + +S AK++ +A +S A ++A+ A + A +A +S DA Sbjct: 61 RANAKSTAADAKSTAANVKSSKADAKSTAADAKSSKADAKSTAADAKSTAVDAKSSKGDA 120 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 ++ AKS +A SS + A A A A AK++ NA+S A A A + Sbjct: 121 KSTAVDAKSMAVDAKSSRADAKYMAADAKYMAADAKSTAANAKSMAADAKSMAVDAKSTA 180 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 A SS + A ++A +S A S +A SS + A + A +A + A +A+ AK Sbjct: 181 VDAKSSKANAKSTAANVKSSKADAKSSRANAKSSKADAKSMAVDAKSSRADAKYTAADAK 240 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 ++ NAK++ +A+S+ SS ++A S + A +S+ + A+ AKS+A A + Sbjct: 241 STAANAKSTPATAKSTPANVKSSRANAKFSKADAKSSRADVKYMAANAKSTAANAKSSRA 300 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 +A S A SK+ A+S+ A+++ + +A+ V A ++ S Sbjct: 301 DATFSKADATFSKANAKSSKADAKSSKADVKYMAANVKSTAVDVKSSKADVKYMAADAKS 360 Query: 480 ETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVN 539 + A A + A+ + K + D N+ + + + R N Sbjct: 361 SKVDAKYMAADAKSSRADAKSSKVDAKSSRADAKSTAANVKSSKADAKSTAADAKSSRAN 420 Query: 540 APAGATSGKYYPVVVMRSAGSVSELAS 566 A + A + K S +A+ Sbjct: 421 AKSTAANAKSMAADAKSSRADAKSMAA 447 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 108/381 (28%), Positives = 190/381 (49%), Gaps = 2/381 (0%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 + AS + A K A+DA+ S +A + AA+ S A +S A S A Sbjct: 545 ADAKSMASDAKSSRADVKYMAADATFSKADAKSTAANVKSSRADAKSSRADATFSKADAK 604 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 S+A A + A +A S A A+S+ + A ++A AK+S +A+ S A ++A+ A + + Sbjct: 605 STAADAKSTAVDAKSSRADAKSTAADAXSTAADAKSSRADATFSKADARSTAADAKSSTA 664 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 +A ++A + +SK AKS+ +A S A+ A SS A S AK+S+ NA+SS+ A Sbjct: 665 DAKSTAANVKSSKADAKSTAADAKSMAADAKSSRADATFSKADAKSSKANAKSSKADAKS 724 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 A+ A SS + A A ++A A +A + S+A A + +A A+ Sbjct: 725 MAADAKSMAANVKSSKADVKYMAADAKSTAANAKSTAVNVKSTAVDAKSSKADAKSTAAD 784 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 A +A+ K+S+ NAK+S A+SS+ A S+A +A SS A ++ +A ++S A + Sbjct: 785 AKSTAADVKSSKANAKSSRADAKSSRADAKSTAVNAKSSKGDAKSTAVDA--KSSRADAK 842 Query: 411 ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQ 470 +T A+ K+T A + + A A +++A+ A+ A + + K + Sbjct: 843 STAANVKSTAANATFSKADVKYMAANVKSSKADVKYMAADAKYMATNAKSSRVDVKYMAA 902 Query: 471 LSSATNSTSETLAATPKAVKS 491 + ++ +++ AA K+ + Sbjct: 903 DAKSSKGDAKSTAANAKSTAA 923 Score = 81.9 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 100/340 (29%), Positives = 166/340 (48%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 + + + + A AK +A+DA ++A +A + ADA +A A ++A A SS A+ Sbjct: 587 ADAKSSRADATFSKADAKSTAADAKSTAVDAKSSRADAKSTAADAXSTAADAKSSRADAT 646 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 S A + A +A S A A+S+ + +S AK++ +A + A +S + AT + Sbjct: 647 FSKADARSTAADAKSSTADAKSTAANVKSSKADAKSTAADAKSMAADAKSSRADATFSKA 706 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 +A +S +A +SK AKS +A S A++ SS A AK++ NA+S+ Sbjct: 707 DAKSSKANAKSSKADAKSMAADAKSMAANVKSSKADVKYMAADAKSTAANAKSTAVNVKS 766 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 +A A SK A S+A+ A ++A +S A S A SS + A + A A Sbjct: 767 TAVDAKSSKADAKSTAADAKSTAADVKSSKANAKSSRADAKSSRADAKSTAVNAKSSKGD 826 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 A +A AK+S +AK++ + +S+ A S + A++ +SK + A+ AK Sbjct: 827 AKSTAVDAKSSRADAKSTAANVKSTAANATFSKADVKYMAANVKSSKADVKYMAADAKYM 886 Query: 411 ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAE 450 AT A + + A A SK A+S A A++ A A+ Sbjct: 887 ATNAKSSRVDVKYMAADAKSSKGDAKSTAANAKSTAANAK 926 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 52/181 (28%), Positives = 77/181 (42%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 M + A+ K +A DA +S +A + AADA +A +S A SS Sbjct: 746 MAADAKSTAANAKSTAVNVKSTAVDAKSSKADAKSTAADAKSTAADVKSSKANAKSSRAD 805 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 A SS A + A A S A+S+ A +S AK++ N ++ +A S + Sbjct: 806 AKSSRADAKSTAVNAKSSKGDAKSTAVDAKSSRADAKSTAANVKSTAANATFSKADVKYM 865 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 A+ +S D AK TNA SS A A +S AK++ NA+S+ A Sbjct: 866 AANVKSSKADVKYMAADAKYMATNAKSSRVDVKYMAADAKSSKGDAKSTAANAKSTAANA 925 Query: 289 G 289 Sbjct: 926 K 926 >UniRef50_C6AD76 Putative uncharacterized protein n=1 Tax=Bartonella grahamii as4aup RepID=C6AD76_BARGA Length = 1370 Score = 98.8 bits (244), Expect = 9e-19, Method: Composition-based stats. Identities = 89/382 (23%), Positives = 137/382 (35%), Gaps = 2/382 (0%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 V + +Q + + A T AREA A DA + ++ A + Sbjct: 174 VTTAVTESSQRVTQVQGTVETALTEAREAKATAGDAKTLSEQTKSAFDDAKQVMGEVKNV 233 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A TA+ + +A +A A+ + A++ AG AK + A A + S A A Sbjct: 234 AETAAASSGQALTTAKEAKMTAETASSVAGDAKETALRALADVNDLKQSVDHVKNTAEGA 293 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 +A + T A + A A A +A+T NA+ +A +A Sbjct: 294 KRTAEGVEQKALTIEDVATQAKTKAGEAKQLVEEVKMRAHSAETLANNAKRLADSAKDTA 353 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE--QASA 350 A A + A ++A A A A K A+ A +AS A T A EA + + S Sbjct: 354 VEAKNKAEDAHVLSQDAKSTAETAEEKAARAEKKADDATVTASEAKTMAQEAKQALETST 413 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 A S A + AK A + + ++A A S AT A AK++ Sbjct: 414 ATEDVSEALSKAAEAKKVADQAATLANESKTTAVEAKGLVEEVKQSVATATETAEDAKNT 473 Query: 411 ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQ 470 A A+ A A + A+ A+ A A A+ A + T Sbjct: 474 ARKATVDIISIKAKALTAESVANEAKGASESAVRVANTAKTTAEEAKSAADTATATAHQA 533 Query: 471 LSSATNSTSETLAATPKAVKSA 492 SA +T A A +A Sbjct: 534 QQSAEEATKLASEAKAVAESAA 555 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 55/222 (24%), Positives = 90/222 (40%), Gaps = 7/222 (3%) Query: 280 NARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATT 339 NAR A A+ A + A + A + + A+ + A +A SA ++AS A T Sbjct: 54 NARKEAEKAMARANGAQQTADQAKTIADSVKELSDAAATTIAQAAGTAASAVTTASEAKT 113 Query: 340 KAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDE 399 +A A EQA+ A +++AAK + +KAS A+ AA A A A + D Sbjct: 114 RAETALEQANTALTTSNAAKQTAEISKASSEDAKMKSEAAERLAQEAKRIAEDTKGAADR 173 Query: 400 ATRQASAAKS-------SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 T + + + TA T+A EA +A A +SA A+ +++ Sbjct: 174 VTTAVTESSQRVTQVQGTVETALTEAREAKATAGDAKTLSEQTKSAFDDAKQVMGEVKNV 233 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYD 494 A A +A ++S A A+++ D Sbjct: 234 AETAAASSGQALTTAKEAKMTAETASSVAGDAKETALRALAD 275 >UniRef50_C5RZI7 Putative uncharacterized protein n=1 Tax=Actinobacillus minor NM305 RepID=C5RZI7_9PAST Length = 1493 Score = 98.4 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 95/386 (24%), Positives = 157/386 (40%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + + A+ + + A A +++ A SA +A ++ A T A A A Sbjct: 743 KEVANNANALATTMITDVAKAVRNSESAIDSADKAYANSVTAMAQTTTFVTRASNAEKVA 802 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 Q A A +A AT+A +A A A A A S ++ S ++ +TAT Sbjct: 803 QEAKDLATSADVNATKAIDTANTASLKAEQAYGKADALTQSVSDLSTTVGGLEGKVNTAT 862 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 KA A+T A A+ A + A++ A A ++A A A A + TNA++ Sbjct: 863 AKAESASTIANQASTKANEASTKADTATTKAEQALTTANAVNTKADNAVSLATNAQTQAQ 922 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 +A Q+A A + ++A A A + + A A A A+ A T A ++ Sbjct: 923 SAQQTAQEALTRVSDVQTTAETAKIIAERTESQIGAISDVANKADEKATQAVTLAQQSAT 982 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 A A + A+ A T + A A + A ++A A+S+A+SAS D A A+ Sbjct: 983 VADNANKLATDAYTQSSTAIVKADEASAKADTALTTAQKASSTANSASVKADNAISIANV 1042 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 A A A EA+ AT A + + + + A +A+ A+ A Sbjct: 1043 ANVKADETKVIAQEASDRATNAEKVALSVTEEVGKVKALATQADTTANKALTASALAQTN 1102 Query: 467 GIVQLSSATNSTSETLAATPKAVKSA 492 L+ A+N+ A A+ + Sbjct: 1103 SSQALTLASNANDNAKEAQNVAMNAT 1128 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 82/382 (21%), Positives = 130/382 (34%), Gaps = 28/382 (7%) Query: 132 SDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAE 191 A + A + + + A A A A + A A Sbjct: 691 EQALAKSDLVMAKATELETQNANMLDEISAISQVVGANKVLAENAVDTANLAKEVANNAN 750 Query: 192 SSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSET 251 + + T A + +A S A ++ TA + + T A +A + AK T Sbjct: 751 ALATTMITDVAKAVRNSESAIDSADKAYANSVTAMAQTTTFVTRASNAEKVAQEAKDLAT 810 Query: 252 NASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAST 311 +A +A+ A +A A A+ A S + + G A + A +AST Sbjct: 811 SADVNATKAIDTANTASLKAEQAYGKADALTQSVSDLSTTVGGLEGKVNTATAKAESAST 870 Query: 312 SAGQASASATAAGKSAESAASSASTATTKAGEATEQA----------------------- 348 A QAS A A A++A + A A T A +A Sbjct: 871 IANQASTKANEASTKADTATTKAEQALTTANAVNTKADNAVSLATNAQTQAQSAQQTAQE 930 Query: 349 -----SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQ 403 S +A AK ++ + A A+ A + A ++ D A + Sbjct: 931 ALTRVSDVQTTAETAKIIAERTESQIGAISDVANKADEKATQAVTLAQQSATVADNANKL 990 Query: 404 ASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAST 463 A+ A + ++TA KA EA+ A A + A S A A A A IA+ ++ T Sbjct: 991 ATDAYTQSSTAIVKADEASAKADTALTTAQKASSTANSASVKADNAISIANVANVKADET 1050 Query: 464 TKKGIVQLSSATNSTSETLAAT 485 ATN+ L+ T Sbjct: 1051 KVIAQEASDRATNAEKVALSVT 1072 Score = 59.2 bits (141), Expect = 9e-07, Method: Composition-based stats. Identities = 61/294 (20%), Positives = 109/294 (37%), Gaps = 7/294 (2%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E + V V A +A+ A T++ A T+++ A A A+ +A +A + A Sbjct: 1065 EKVALSVTEEVGKVKALATQADTTANKALTASALAQTNSSQALTLASNANDNAKEAQNVA 1124 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 +A++ A AST A EA A A A A A S A+ +A + A TA Sbjct: 1125 MNATTIATKASTDANEAKALATTASQDALTATEQANLATASAEQATNLATTANSKADTAL 1184 Query: 227 TKASEAA----TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNAR 282 +SEA ++ ++ + E AKS+ T A+ + A A + + A+ T R Sbjct: 1185 ATSSEAKVLANSAVDTSSKANETAKSAMTQATVAFDKAE--AMSNQVNLNTAEIEATKTR 1242 Query: 283 SSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAG 342 + T A ++ + + A A + + A+ + Sbjct: 1243 LASTDAVVESNTKSIHDIEFTVNVLDNQAVKYNADKKAVTLQSNGNKGTVIENVASGEVS 1302 Query: 343 EATEQASAAAR-SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASA 395 + QA A+ A+ K + + T A+ A + ++ Sbjct: 1303 ATSTQAVNGAQLHATEQKVEKIVEEQLIPVQADVATVKTEVATVKAKATKNSQR 1356 >UniRef50_Q6H236 Paternally-expressed gene 3 protein n=10 Tax=Eutheria RepID=PEG3_BOVIN Length = 2387 Score = 97.7 bits (241), Expect = 2e-18, Method: Composition-based stats. Identities = 66/403 (16%), Positives = 125/403 (31%), Gaps = 8/403 (1%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 EE A+ V A+ + S S A T D+A A + + + Sbjct: 1232 TEEPAQTNYTVESAEASYTEEPSQTSCIEEPAQTSYTDSAADTSCTEEPAQTSCTEEPAQ 1291 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 +S + + + + + + + A+TS T A A T+ T Sbjct: 1292 TSYTQEPAQTSCTEEPAQTSCTEEPAQTSYTQEPAQTSCTEEPAQTSYTQEPAQTSCT-- 1349 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 E A ++ ++ + S + A +S T + + + E A Sbjct: 1350 EEPAQTSYTEEPAQTSYTEEPAQTSYTQEPAQTSCTEEPAQTSYTEEPAQTSYTEEPAQT 1409 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE--ATEQ 347 A + + ++ + Q S + A S + S A A A E Sbjct: 1410 SYTQEPAQTSYTEEPAQTSYTEEPAQTSYAQEPAQTSYAEEPAQTSYAEEPAQTSYAEEP 1469 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAA 407 A + A + + E ++T+ A + + A S E Q S A Sbjct: 1470 AQTSYTQEPAQTNYTEEPAEASYTEEPAQTSYAEEPAQTSYPEEPAQTSYAEEPAQTSYA 1529 Query: 408 KSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKG 467 + A T E + + T+ A A+T+ S ++ + Sbjct: 1530 E---EPAQTSYPEEPAQTSYTEEPAQTSY-AKEPAQTSYPEEPAQTSYAEEPAQTSYAEE 1585 Query: 468 IVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIP 510 Q S A + + P + + ++K+Q D+P Sbjct: 1586 PAQTSYAEEPAQTSYSEEPAQTRYTGNELRSDMRKNQLRPDMP 1628 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 65/388 (16%), Positives = 131/388 (33%), Gaps = 24/388 (6%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + V R+ S+ T+ +S S A T A+ A A A + + + Sbjct: 918 LRSVLRSLSSTDPQTSYQGQSV-QMSYPQEAAQTSYAELAAQTSYAEEPAQTSYAVEPAQ 976 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 +S A + + + + A+ + + A+TS TN +A A A T+ T+A Sbjct: 977 TSYAEEPAQTSYTEAPAEASYTEEPAQTSCIEEPAQTSYTNPAAETSYAEEPAQTSYTEA 1036 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 A+ + A + + ++T+ ++ A+ + + A S A + E A Sbjct: 1037 PAEASYTEEPAQTSCIEEPAQTSYTNPAAETSYTEEPAQTSYTEA--PAEASGIEEPAQT 1094 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 +A S ++ Q S + +A +S + + E A Sbjct: 1095 NYTEESAEVSYTEEPSQTSCIEEPAQTSYTD-------PAAETSYTEEPAQTSYTQEPAQ 1147 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 + A + + + E ++T+ + A+ + A S E Sbjct: 1148 TSCTEEPAQTSCTEEPAQTSYTQEPAQTSYTKEPAEASYTEEPAQTSCIE---------- 1197 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 A T T+ + A+ + T+ + AET+ E + +E A + Sbjct: 1198 --EPAQTNYTKESAKASYTEEPAQTSYTDPA-AETSYT-EEPAQTNYTVESAEASYTEEP 1253 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAE 497 +S ++T A S + Sbjct: 1254 SQTSCIEEPAQTSYTDSAADTSCTEEPA 1281 Score = 61.1 bits (146), Expect = 2e-07, Method: Composition-based stats. Identities = 51/291 (17%), Positives = 95/291 (32%), Gaps = 10/291 (3%) Query: 214 SLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKA 273 S + A K S + R S + + S + A + A+ Sbjct: 896 SSELAEHQKIHNRKKLSGSKNYLRSVLRSLSSTDPQTSYQGQSVQMSYPQEAAQTSYAEL 955 Query: 274 AKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS--AESAA 331 A + ++T+ + + ++ A +S + A +AS + A S E A Sbjct: 956 AAQTSYAEEPAQTSYAVEPAQTSYAEEPAQTSYTEAP---AEASYTEEPAQTSCIEEPAQ 1012 Query: 332 SSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSAS 391 +S + + A E A + A A + + E ++T+ + A+ + + Sbjct: 1013 TSYTNPAAETSYAEEPAQTSYTEAPAEASYTEEPAQTSCIEEPAQTSYTNPAAETSYTEE 1072 Query: 392 SASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATR-----AETAA 446 A S EA +AS + A T T+ + +Q+ E A T AET+ Sbjct: 1073 PAQTSYTEAPAEASGIEEPAQTNYTEESAEVSYTEEPSQTSCIEEPAQTSYTDPAAETSY 1132 Query: 447 KRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAE 497 S ++ + Q S + P + AE Sbjct: 1133 TEEPAQTSYTQEPAQTSCTEEPAQTSCTEEPAQTSYTQEPAQTSYTKEPAE 1183 >UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N348_PHOLL Length = 440 Score = 97.3 bits (240), Expect = 3e-18, Method: Composition-based stats. Identities = 47/123 (38%), Positives = 63/123 (51%), Gaps = 12/123 (9%) Query: 871 RNGGLFYRSSRDGYGFEEDWA------EVYTSKNLPPESYPVGAPIPWPSDTVPSGYALM 924 +NG LFY S R E ++ Y++ + ++ P G P+P+P P GY Sbjct: 258 KNGCLFY-SHRLSSNNVELFSTGKIIPSDYSNFDARYDNVPAGVPMPYPHRYTPPGYLTC 316 Query: 925 QGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSA 979 GQ FDKS YPKLA AYP+G +PD+RG I+G S GR + + D I H H Sbjct: 317 NGQTFDKSLYPKLAEAYPAGRVPDLRGEFIRGWDDSRGVDPGRVCGTWQADCIPDHNHYK 376 Query: 980 SAS 982 AS Sbjct: 377 VAS 379 >UniRef50_A7ZN97 Tail fiber family protein n=2 Tax=Escherichia coli RepID=A7ZN97_ECO24 Length = 336 Score = 96.9 bits (239), Expect = 4e-18, Method: Composition-based stats. Identities = 39/110 (35%), Positives = 56/110 (50%), Gaps = 6/110 (5%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MA+ ISGV +G G+PV + L A+ S+ VV+ T+ + AG Y D+ G Y V Sbjct: 1 MAI-ISGVYANGVGEPVVGVQLVLTARVTSSRVVMTTVVEQETGAAGEYKFDMNPGVYVV 59 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMV 110 ++ G I V DS GTLND+L + D+ P AL + +V Sbjct: 60 TA-----SAAYLGVINVNPDSVDGTLNDYLTNFSADELTPAALAEIQELV 104 >UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=B2PZV1_PROST Length = 526 Score = 95.4 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 64/239 (26%), Positives = 93/239 (38%), Gaps = 18/239 (7%) Query: 809 TAAHVAAFARRATDTYADADGGVPW--NAESGAYNVTRSGDSYILVNFYTGVG-SCRTLQ 865 + + A + G PW N + NV + + S R L Sbjct: 282 SIENNTATTLGCGFYAIPGNAGNPWGNNGSAHIINVRDGNYGFQIGRTTGNKNLSFRILS 341 Query: 866 MKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQ 925 + + Y + + S + PVGAPIPWP T PSGY + Sbjct: 342 ANV-FSPPSVLYSTGNTTKDHN---GNLKVSGSSELSDCPVGAPIPWPQATAPSGYLICN 397 Query: 926 GQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPA-----SGRAVLSQEQDGIKSHTH--- 977 GQAF+K+ YP L AYPSG +PD+RG I+G A +GR VLS ++ + H H Sbjct: 398 GQAFNKTTYPLLTKAYPSGKLPDLRGEFIRGLDAGRNIDNGRVVLSFQRCATEHHKHISG 457 Query: 978 --SASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASAN 1034 AS ++ G KT + G+ ST+ ++ GS + N Sbjct: 458 WGEASNANAIFG-KTVKNGYVGSASTDRDNYLFYTNDGSEFQGSNPNSTGIMANETRPR 515 >UniRef50_Q9WXA5 Tail fiber n=2 Tax=Pectobacterium carotovorum RepID=Q9WXA5_ERWCA Length = 667 Score = 94.2 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 11/113 (9%) Query: 801 DGSKDITLTAAHVAAFAR----------RATDTYADADGGVPWNAESGAYNVTRSGDSYI 850 D TAA + A A+ ++ D + WN+ +GAY GD+ + Sbjct: 413 DYDTANKPTAADIGAIAKTDADNNYVRQGSSGVVYKND-DLAWNSPTGAYLKDNGGDASL 471 Query: 851 LVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPES 903 + + GS Q +Y NGGL YRSSRD GFE+ WA +Y+ ++ P + Sbjct: 472 IWHIGLNTGSTSAAQFHFNYANGGLKYRSSRDSLGFEKPWARIYSDQDKPTAA 524 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 32/131 (24%), Positives = 49/131 (37%), Gaps = 16/131 (12%) Query: 419 TEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNST 478 +A A K E + T A A+ + + T AT++ Sbjct: 33 RQAKELANRTRYLKKEQEKTGSDLATHAAAADPHTQYAPKANPTFTGTPKAPT-PATDNN 91 Query: 479 SETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRV 538 S+ +A T VKS A +L KDQNGADI D+ N+ + R Sbjct: 92 SQQVATTAF-VKSV---AATKLAKDQNGADIQDRELLNRNLGSS-----------RAYSS 136 Query: 539 NAPAGATSGKY 549 + P G ++G + Sbjct: 137 SIPIGGSAGSW 147 Score = 43.4 bits (100), Expect = 0.056, Method: Composition-based stats. Identities = 42/189 (22%), Positives = 65/189 (34%), Gaps = 23/189 (12%) Query: 765 ATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHV--------AAF 816 A L + + G A TKL R+++GV FDG+ DI + + + A Sbjct: 285 ANIALTPANIGALPVAGTAAAETKLAVARKIAGVAFDGTADIDVNSQGIFSASLSIGNAV 344 Query: 817 ARRATDTYADADGGVPWNAESGA-YNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGL 875 T V A SG Y ++G +L + Q+ Y N Sbjct: 345 DLNTYTTPGLYHQAVNAQAASGKNYPEAQAGSLEVLKHAGI-------TQVYRIYNNSRC 397 Query: 876 FYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYP 935 + R+ G W Y + N P + +GA +D + QG + Sbjct: 398 YKRTQY--SGAWSAWVLDYDTANKPTAA-DIGAIAKTDAD----NNYVRQGSSGVVYKND 450 Query: 936 KLAAAYPSG 944 LA P+G Sbjct: 451 DLAWNSPTG 459 Score = 42.6 bits (98), Expect = 0.087, Method: Composition-based stats. Identities = 23/34 (67%), Positives = 29/34 (85%) Query: 781 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVA 814 G A+ ATKL TPR+++GVEFDGSKDITLT A++ Sbjct: 533 GTAVAATKLATPRKINGVEFDGSKDITLTPANLG 566 >UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR91_CITRO Length = 279 Score = 93.4 bits (230), Expect = 4e-17, Method: Composition-based stats. Identities = 43/121 (35%), Positives = 61/121 (50%), Gaps = 5/121 (4%) Query: 880 SRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA 939 ++G + A V + PVG P+PWPS T P G+ G F S YPKL Sbjct: 119 EQNGADIPDPEAFVKNLGLGEGSALPVGVPVPWPSATPPEGWLKCNGATFSSSLYPKLGL 178 Query: 940 AYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 AYPSG +PD+RG I+G +GR++LS + D +SH+H+ S T+ + Sbjct: 179 AYPSGKLPDLRGEFIRGWDDGRGADNGRSLLSSQGDAFRSHSHNFDRSWGLENFDATAGY 238 Query: 995 D 995 D Sbjct: 239 D 239 >UniRef50_C2BQ43 Putative uncharacterized protein n=2 Tax=Corynebacterium RepID=C2BQ43_9CORY Length = 1274 Score = 93.1 bits (229), Expect = 6e-17, Method: Composition-based stats. Identities = 67/419 (15%), Positives = 145/419 (34%), Gaps = 25/419 (5%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 V+++ R+ + + + +A + +A + + ++ S A Sbjct: 25 VKQILRSLLGKKKPGRSNEPRTPVYQPVQPKAGSTSAPQTPNKKDSAPQKSANTESTTGA 84 Query: 170 SSSAGTASTKATEASKSAA--AAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT 227 S+ + A S + A+ + + T A AA+ SE+ + + A A Sbjct: 85 QSTEPKGKPQDRTAKDSESKNQAKQAPARKPTPAKAAQASESKPATAKPKDAKPAVKPAA 144 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 K + ++ + A+ AK + ++S +S+A + AAK + A+ ++ Sbjct: 145 KGTTRPSTEKPQASKATPAKPQQPKSTSPSSAAKAPKAKPAAQESAAKPTA--AQQNKAE 202 Query: 288 AGQSASAAAGSKTAAASS----------------ASAASTSAGQASASATAAGKSAESAA 331 + A+ + A AS ++T + + + + S + AA Sbjct: 203 SSPKAAVRKQQDSPAQDKNPQRKAHEPVQRPRRGASFSTTPSKSSPKTPSTRSNSPKPAA 262 Query: 332 SSASTATTK-AGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 + + K A + +QA + +++ A + S + +K + A+S ++ Sbjct: 263 KPVTQPSQKGAVKPAQQAQPTTQKPKQTPKPSSSSSAKKASPQPAKPQKSQPANSKPATG 322 Query: 391 SSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAE 450 + D R + + + A + + K +A ETA ++A Sbjct: 323 QQQAQKPDSPKRVPANSAKKQPQPQPAPKKNAQPSKPSQPQKGQG--SARTPETAKQKAG 380 Query: 451 DIASAVALEDASTTKKGIVQ--LSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGA 507 ASA + S T + + A K+ D + K Q GA Sbjct: 381 PQASAPSAPKPKKGTTAPTPKVASPKTAADASQQKARSTQNKTTQDGNKVVQPKSQQGA 439 Score = 44.9 bits (104), Expect = 0.017, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 61/167 (36%), Gaps = 3/167 (1%) Query: 362 ETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEA 421 + A ++ +K +A S+ S + A +++ + Q AK S + K A Sbjct: 53 QPKAGSTSAPQTPNKKDSAPQKSANTESTTGAQSTEPKGKPQDRTAKDSESKNQAKQAPA 112 Query: 422 AGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALE---DASTTKKGIVQLSSATNST 478 A A S ++ A + + A + A +K + +++ Sbjct: 113 RKPTPAKAAQASESKPATAKPKDAKPAVKPAAKGTTRPSTEKPQASKATPAKPQQPKSTS 172 Query: 479 SETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKT 525 + A PKA +A ++A K QN A+ K ++ ++ Sbjct: 173 PSSAAKAPKAKPAAQESAAKPTAAQQNKAESSPKAAVRKQQDSPAQD 219 >UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6DA10_PECCP Length = 689 Score = 92.3 bits (227), Expect = 9e-17, Method: Composition-based stats. Identities = 85/354 (24%), Positives = 134/354 (37%), Gaps = 36/354 (10%) Query: 797 GVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYT 856 V DG + +A + AF T T + + V W+A SG Y G +L++F+ Sbjct: 342 NVTLDGLTN-KPSATDIGAFPLGFTGTVNNDE--VAWDANSGVYRAQYPGAGQMLIHFHG 398 Query: 857 GVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDT 916 SC +LQ Y NGGL YR++RDG GFE WA++YT++ P + VGA +P Sbjct: 399 AGASCPSLQFLGEYGNGGLSYRTARDGMGFEHSWAKIYTTQFKPTAA-DVGA-LPIAGGA 456 Query: 917 VPSGYALMQGQ-----------AFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVL 965 + G + G +Y ++ + P + + + R + Sbjct: 457 LQGGIRIGAGNIDLPARRAVVGVMPDESYRQMLSLSPDNTVVFGNPNSSAVIHTTDRVYI 516 Query: 966 SQEQDGIKSHTHSASASSTDLGTKTTSS-------FDYGTKSTNNTGAHTHSVSGSTNSA 1018 + +S H + + +G S F T + S S Sbjct: 517 AAAGGAWRSVYHEGNLTPAAIGAMPASELAGIPLPFPGAVAPTGWLKCNGQSFDKSQYPI 576 Query: 1019 GAHTH------SLANVNTASANSGAGSASTRL--SVVHNQNYATSSAGAHTHSLSGTAAS 1070 A + L + G G+ ++R S + + T + Sbjct: 577 LASRYPSGVLPDLRGEFVRGWDDGRGADASRALLSAQGDAIRNIVGTIGQLNDRVNTTET 636 Query: 1071 AGAHAHTVGIGAHTHSVAIGSHGHTITVNAAG----NAENTVKNIAFNYIVRLA 1120 AG GAH+ + G+ G T +A+ AEN +NIAFNYIVR A Sbjct: 637 AGVFDANKYTGAHS-GLTGGNGGRIATFDASKVVPTAAENRPRNIAFNYIVRAA 689 Score = 42.6 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 57/325 (17%), Positives = 95/325 (29%), Gaps = 20/325 (6%) Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 + G++ + AA A S A T A A + A A A+ Sbjct: 50 ETTGKSLQTHLAASDPHSQYAPKNSPALTGTPTAPTTAQTTNNTQIATTAFVKTAIAALI 109 Query: 371 SAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQ 430 + + ++A + S + A ++ A +A Sbjct: 110 NGSPAALDTLQELANALGNDPHFSTTILNAIADVKTDAANKLNAHASVLDAHPRYAPKDS 169 Query: 431 SKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVK 490 T A A + + + +A + G + L P Sbjct: 170 PALTGTPTAPTAASNSNNTQLATTAFVKAAVAALVNGSPAALDTLQELAAALGNDPNFST 229 Query: 491 SAYDNAEKRLQKDQNGADIPDKGCFLNNI----NAVSKTDFADKRGMRYVRV-------- 538 + + +L KDQNGADI DK FL+ + V+ FA RV Sbjct: 230 TILNALAGKLAKDQNGADISDKAKFLSTLVYRGELVNHGTFAACNREGVYRVALSDGNTV 289 Query: 539 -NAPAGATSGKYYP---VVVMRSAGSVSEL---ASRVIITTATRTAGDPMN-NCEFNGFV 590 + P Y + V G++S++ + T + N +G Sbjct: 290 TDMPRNIRGEILYSYGFLFVNEIGGAISQMYLPHRGPVATRQNWDGSYSLGWNVTLDGLT 349 Query: 591 MPGGWTDRGRYAYGMFWQYQNNERA 615 TD G + G N+E A Sbjct: 350 NKPSATDIGAFPLGFTGTVNNDEVA 374 >UniRef50_C6LLQ7 Glycogen synthase n=6 Tax=Clostridiales RepID=C6LLQ7_9FIRM Length = 811 Score = 92.3 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 55/325 (16%), Positives = 100/325 (30%), Gaps = 4/325 (1%) Query: 142 ATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSA 201 A A+ +S A + S A+ A+ + A E + A ++ A A Sbjct: 2 ARKKAETVNSKEAPAESIAAEAAKAEPVKKAKEEPVKAAEEVKEVKAKEVKAEEPKAEPA 61 Query: 202 GAAKTSETNASASLQSAATSASTATTKASEAATSARDAA-ASKEAAKSSETNASSSASSA 260 K A+ + A A KA E T A + AK+ A + + Sbjct: 62 KEEKPKAKTTRAAKTAKTEKA--APVKAEEVKTEPVKAEEPKAKPAKAEAVKAEPAKAEE 119 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 + + AKA A+ A + + A ++ A + + + Sbjct: 120 PKAESVKAEEAKAEPAKTEEAKVEPAKAEEPKAEPAKAEKPKAKTTRTTKATKTAKAVKE 179 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 A + + + + + A E + A+ + + A +K AA Sbjct: 180 EKAAEPEKKTGKTEAAKSEAAKEEPVKEEKPKAKATKTTRTRKSKAAPAAEEVKAKAPAA 239 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE-SAA 439 + +A+ + T AK+ A TEA A++ E Sbjct: 240 EETKAEKPAAAPVAEEAKAETPVVEEAKTEAPAMEEAKTEAPAVEEVKAETPVVEEAKTE 299 Query: 440 TRAETAAKRAEDIASAVALEDASTT 464 A K E +A V E+ + Sbjct: 300 APAAEEVKAEEPVAEEVKAEEPTYE 324 >UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CP88_DICZE Length = 485 Score = 91.9 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 51/171 (29%), Positives = 73/171 (42%), Gaps = 32/171 (18%) Query: 847 DSYILVNFYTGVGSCRTLQMKAHYRNG---GLFYRSSRDGYGFEEDWAEVYTSKNLPPES 903 + V T + +Q Y G +F RS G W E+ + + Sbjct: 258 AGSLDVEKNTADSAEGCIQRYTTYGGGALPRMFIRSYNAGKQVWGAWQELASLSSPTFTG 317 Query: 904 YP------------------------VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA 939 P G P+PWP VP+G+ GQAFDK+ YP+LA Sbjct: 318 TPTAPTAEAGSNTTQLATTAWFAAEIAGIPLPWPQAAVPTGWLKCNGQAFDKNRYPRLAQ 377 Query: 940 AYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTD 985 YPSGV+PD+RG I+G SGR VLSQ++ + ++ SA ++D Sbjct: 378 VYPSGVLPDLRGEFIRGWDDGRGVDSGREVLSQQRGSLINYDGPDSAPTSD 428 >UniRef50_Q7WYN2 Cellulosomal scaffoldin anchoring protein C n=1 Tax=Acetivibrio cellulolyticus RepID=Q7WYN2_9FIRM Length = 1237 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 78/414 (18%), Positives = 153/414 (36%), Gaps = 17/414 (4%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 V +A+A T +AK + + ++T+ A + A+ + Q+A ++ S++ Sbjct: 531 VTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAMPTETPSAT 590 Query: 173 AGTASTKAT--EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 A T++ + SA A + +A + + T+ + S T ++TAT + Sbjct: 591 ATPTPTQSAMPTVTPSATATPTPTQSAIPTVTPSATATPTPTQSAMPTVTPSATATPTPT 650 Query: 231 EAATSARDAAASKE------AAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSS 284 ++A +A+ A + +A+++ + S+ SA A T +A + Sbjct: 651 QSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAMPT 710 Query: 285 ETAAGQSASAAAGSK-TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE 343 T + + S SA+A T A + T + + + SA T + Sbjct: 711 VTPSTTATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAKPTVTPSAT 770 Query: 344 ATEQASAAA-----RSASAAKTSETNAKASETSAESSKTAAASSA--SSAASSASSASAS 396 AT + +A SA+A T +A + T + ++ SA + S+ + + + Sbjct: 771 ATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAKPTVTPSATVTPTPT 830 Query: 397 KDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAV 456 + +A ++ T + SATA +A T + TA A Sbjct: 831 QSAMPTVTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAKPT 890 Query: 457 ALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIP 510 A+ T Q + T + S T TP + + + IP Sbjct: 891 ETPSATAT-PTPTQSAMPTETPSATATPTPTQSEMPTETPSATATPTPTQSAIP 943 Score = 88.8 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 63/376 (16%), Positives = 134/376 (35%), Gaps = 9/376 (2%) Query: 125 AAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEAS 184 +AK + + ++T+ A + A+ + Q+A + S++A T++ + Sbjct: 507 QSAKPTVTPSATATPTPTQSAKPTVTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPT 566 Query: 185 KSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKE 244 + +A + + ++ + + + T +AT + ++ S Sbjct: 567 VTPSATATPTPTQSAMPTETPSATATPTPTQSAMPTVTPSATATPTPTQSAIPTVTPSAT 626 Query: 245 AAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAAS 304 A + +A + + +A++ SAK T A + T QSA A Sbjct: 627 ATPTPTQSAMPTVTPSATATPTPTQSAKPTVTPSATATPTPT---QSAMPTVTPSATATP 683 Query: 305 SASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETN 364 + + ++ SA+AT + + ST T + + SA+A T + Sbjct: 684 TPTQSAMPTVTPSATATPTPTQSAMPTVTPSTTATPTPTQSAMPT-VTPSATATPTPTQS 742 Query: 365 AKASETSAESSKTAAASSA--SSAASSASSASASKDEATRQASAAKSSATTASTKATEAA 422 A + T + ++ SA + S+ ++ + ++ +A ++ T + Sbjct: 743 AMPTVTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAMPTVT 802 Query: 423 GSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETL 482 SATA +A+ T + T A A+ T SA + + + Sbjct: 803 PSATATPTPTQSAKPTVTPSATVTPTPTQSAMPTVTPSATAT---PTPTQSAKPTVTPSA 859 Query: 483 AATPKAVKSAYDNAEK 498 ATP +SA Sbjct: 860 TATPTPTQSAMPTVTP 875 Score = 87.7 bits (215), Expect = 2e-15, Method: Composition-based stats. Identities = 69/365 (18%), Positives = 140/365 (38%), Gaps = 14/365 (3%) Query: 116 NASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT 175 +A+A T +A + + ++T+ A + A+ + Q+A + S++A Sbjct: 588 SATATPTPTQSAMPTVTPSATATPTPTQSAIPTVTPSATATPTPTQSAMPTVTPSATATP 647 Query: 176 ASTKAT--EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 T++ + SA A + +A + + T+ + S T ++TAT +++A Sbjct: 648 TPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSA 707 Query: 234 ------TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 ++ ++ A + +A+++ + S+ SA A T +A+ + T Sbjct: 708 MPTVTPSTTATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAKPTVTP 767 Query: 288 -AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A + + + SA+A T A + T + + + SA T + T Sbjct: 768 SATATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAKPTVTPSATVT- 826 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 + SA T +A A+ T +S+K SA++ + SA + + Sbjct: 827 ----PTPTQSAMPTVTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPT 882 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 SA T + A + T +A T + AT T ++ + SA A + + Sbjct: 883 PTQSAKPTETPSATATPTPTQSAMPTETPSATATPTPTQSEMPTETPSATATPTPTQSAI 942 Query: 467 GIVQL 471 V Sbjct: 943 PTVTP 947 Score = 84.6 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 73/386 (18%), Positives = 150/386 (38%), Gaps = 15/386 (3%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 V +A+A T +A + + ++T+ A + A+ + Q+A + S++ Sbjct: 567 VTPSATATPTPTQSAMPTETPSATATPTPTQSAMPTVTPSATATPTPTQSAIPTVTPSAT 626 Query: 173 AGTASTKAT--EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 A T++ + SA A + +A + + T+ + S T ++TAT + Sbjct: 627 ATPTPTQSAMPTVTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPT 686 Query: 231 EAA-----TSARDAAASKEAAKSSETNASSSASSAASSAT-AAGNSAKAAKTSETNARSS 284 ++A SA ++A + T ++++ + SA SA A T +A + Sbjct: 687 QSAMPTVTPSATATPTPTQSAMPTVTPSTTATPTPTQSAMPTVTPSATATPTPTQSAMPT 746 Query: 285 ETA-AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE 343 T A + + +K SA+A T A + T + + + SA T + Sbjct: 747 VTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSAT 806 Query: 344 ATEQASAAA-----RSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 AT + +A SA+ T +A + T + ++ SA + +++A+ + Sbjct: 807 ATPTPTQSAKPTVTPSATVTPTPTQSAMPTVTPSATATPTPTQSAKPTVTPSATATPTPT 866 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 ++ ++AT T++ + + +A A T + T +A S + Sbjct: 867 QSAMPTVTPSATATPTPTQSAKPTETPSATATPTPTQSAMPTETPSATATPTPTQSEMPT 926 Query: 459 EDASTT-KKGIVQLSSATNSTSETLA 483 E S T Q + T + T + Sbjct: 927 ETPSATATPTPTQSAIPTVTPEVTTS 952 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 52/322 (16%), Positives = 112/322 (34%), Gaps = 5/322 (1%) Query: 177 STKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSA 236 T A+ + + ++ + + ++ + + + T +AT + ++ Sbjct: 469 PTPTQSATPTVTPSATATPTQSATPTVTPSATATTTPTQSAKPTVTPSATATPTPTQSAK 528 Query: 237 RDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAA 296 S A + +A + + +A++ SA T A + T QSA Sbjct: 529 PTVTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPT---QSAMPTE 585 Query: 297 GSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSAS 356 A + + ++ SA+AT + + S T + + SA+ Sbjct: 586 TPSATATPTPTQSAMPTVTPSATATPTPTQSAIPTVTPSATATPTPTQSAMPT-VTPSAT 644 Query: 357 AAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAST 416 A T +AK + T + ++ SA + +++A+ + ++ ++AT T Sbjct: 645 ATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATPTPT 704 Query: 417 KATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATN 476 ++ + + A T + T +A SA+ S T SA Sbjct: 705 QSAMPTVTPSTTATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATP-TPTQSAKP 763 Query: 477 STSETLAATPKAVKSAYDNAEK 498 + + + ATP +SA Sbjct: 764 TVTPSATATPTPTQSAMPTVTP 785 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 60/388 (15%), Positives = 126/388 (32%), Gaps = 15/388 (3%) Query: 108 LMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSA-GQAASSA 166 + + SA+ T + A +A + + SA SA Sbjct: 638 TVTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSA 697 Query: 167 QSASSSAGTASTKATEASKSAAAAESSK-----SAAATSAGAAKTSETNASASLQSAATS 221 + + +A T ++ + S +A + +++ + S + T Sbjct: 698 TATPTPTQSAMPTVTPSTTATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATPTP 757 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAA-----SSATAAGNSAKAAKT 276 +A + +AT+ S + A+ + + +A SATA ++AK Sbjct: 758 TQSAKPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAMPTVTPSATATPTPTQSAKP 817 Query: 277 SETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAST 336 + T + + QSA A + + ++ SA+AT + + S Sbjct: 818 TVTPSATVTPTPTQSAMPTVTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSA 877 Query: 337 ATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASAS 396 T + + + SA+A T +A +ET + ++ + S + SA+A+ Sbjct: 878 TATPTPTQSAKPTE-TPSATATPTPTQSAMPTETPSATATP--TPTQSEMPTETPSATAT 934 Query: 397 KDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAV 456 T+ A + T S T + + ST + + A + + Sbjct: 935 -PTPTQSAIPTVTPEVTTSATPTPTPTGSVTPGVTTSTTPTPTQTVKPTATTVNEGPGVI 993 Query: 457 ALEDASTTKKGIVQLSSATNSTSETLAA 484 + I + +T + A Sbjct: 994 PGGNPDVNPSPITTPTPKPTATPTSGAV 1021 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 52/317 (16%), Positives = 107/317 (33%), Gaps = 7/317 (2%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 V +A+A T +A + + ++T+ A + A+ + Q+A + S++ Sbjct: 729 VTPSATATPTPTQSAMPTVTPSATATPTPTQSAKPTVTPSATATPTPTQSAMPTVTPSAT 788 Query: 173 AGTASTKAT--EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 A T++ + SA A + +A + + T + S T ++TAT + Sbjct: 789 ATPTPTQSAMPTVTPSATATPTPTQSAKPTVTPSATVTPTPTQSAMPTVTPSATATPTPT 848 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 ++A S A + +A + + +A++ SAK +T A + T Q Sbjct: 849 QSAKP--TVTPSATATPTPTQSAMPTVTPSATATPTPTQSAKPTETPSATATPTPT---Q 903 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 SA A + + + SA+AT + + T+ T S Sbjct: 904 SAMPTETPSATATPTPTQSEMPTETPSATATPTPTQSAIPTVTPEVTTSATPTPTPTGSV 963 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 ++ + T + + + + + + + S A Sbjct: 964 TPGVTTSTTPTPTQTVKPTATTVNEGPGVIPGGNPDVNPSPITTPTPKPTATPTSGAVVE 1023 Query: 411 ATTASTKATEAAGSATA 427 T+ S TA Sbjct: 1024 PTSEVPGPDGLPLSHTA 1040 >UniRef50_B0X1G5 Microtubule-associated protein futsch n=1 Tax=Culex quinquefasciatus RepID=B0X1G5_CULQU Length = 4575 Score = 90.4 bits (222), Expect = 4e-16, Method: Composition-based stats. Identities = 83/421 (19%), Positives = 155/421 (36%), Gaps = 12/421 (2%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 E VA + S A + A +K AS ++ A+H ++ A S +A A + AS +S Sbjct: 2577 RSESVASHVSEKAASDKAEEKPASKEASRPESVASHVSEKAASDKAEEKPASKEASRPES 2636 Query: 169 ASSSAGTASTKATEAS-KSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT 227 +S + A+ + + SAA+ ++ A + + +A A+ +A + Sbjct: 2637 VASHVSEKAASEKSATLEKPEESSRPASAASQASEKAPSEKPDAKE-----ASRPDSAAS 2691 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 SE A S + K K + AS+++ + +A+ + + E++ SS + Sbjct: 2692 HVSEKAASEKSMTLDKPEEKEASRPASAASHVSEKAASEKSATLEKPDDKESSRPSSALS 2751 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAES----AASSASTATTKAGE 343 A+ A+ S S + + E+ A A E Sbjct: 2752 QADEKEASRPESAASHVSEKVTSEGKPEDKPVSRPESMVGETKPSDKAEDKPDGKQDAKE 2811 Query: 344 ATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQ 403 A+ S+A+ + A + +T K A S + AS S A+S S A K E T Sbjct: 2812 ASRPESSASHVSEKAASDKTEDKQDAKEA-SRPESVASHVSEKAASDKSPLADKAEETSA 2870 Query: 404 ASAAKSSATTAS-TKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS 462 + A + AS A+ +T + + ++ + + + K + A A+ Sbjct: 2871 SKEASRPESVASHVSEKAASEKSTQLEKPEESSRPVSAASHVSEKAVSEKADAMETSRPE 2930 Query: 463 TTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAV 522 + + + SA T + A +EK + + PD +A Sbjct: 2931 SVASHVSEKESAEVQIKTEDKETSRPASVASHVSEKAASEKSATLEKPDDKEPSRPESAA 2990 Query: 523 S 523 S Sbjct: 2991 S 2991 >UniRef50_B0MLL9 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MLL9_9FIRM Length = 1114 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 68/402 (16%), Positives = 132/402 (32%), Gaps = 23/402 (5%) Query: 78 YEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTS 137 E ND + EA++ FE + E+ A ++++ + Sbjct: 361 IETEIDDEQNDTATEEAVSEPVEEAVKAFEEIAEQ-----EETADIETESEQNDTATEEP 415 Query: 138 AREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAA 197 E A A + +A + +A + A+E + A A + Sbjct: 416 VSEPVEDAVKAFEEIAEQEETADSKTEIDSEQNDTAVEEA--ASEPVEDAVKALEEIAEQ 473 Query: 198 ATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSA 257 +A + SE N A+ ++ + A E A A S+ +K ++T A Sbjct: 474 EETADSEAESEQNDIAAEEAVSEPVEDAVKAFEEIAEQEETA-DSETESKQNDTAAEEPI 532 Query: 258 SSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQAS 317 S A K E A ETA ++ + + TA AAS A Sbjct: 533 SE---------PVEDAVKAFEEIAEQEETADSETEIDSEQNDTAT---EEAASEPVEDAV 580 Query: 318 ASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKT 377 + + E+A S A + ATE+A + + E + +E+ Sbjct: 581 KAFEEIAEQEETADSEAVIDDEQNDTATEEAVSEPVEDAVKAFEEIAEQEETADSETEID 640 Query: 378 AAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAES 437 + + ++ + S +A + + + +A + K+ E A ++ S S Sbjct: 641 S-EQNDTATEEAVSEPVEDAVKAFEEIAEQEETADSEE-KSLEEQFHAIMEEETASEPVS 698 Query: 438 AATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 + AE+ ++ + L + G + S + Sbjct: 699 SIRTAESVEDALKEFGDFIGL-SETADSTGSIGSSEDISDDD 739 Score = 86.9 bits (213), Expect = 4e-15, Method: Composition-based stats. Identities = 79/447 (17%), Positives = 151/447 (33%), Gaps = 23/447 (5%) Query: 79 EDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSA 138 E ND + +A++ FE E+A T + A+ Sbjct: 244 ETEIDSEQNDIATEEAVSEPVEDAVKAFE----EIAEQKETADIETEIDSEQNETAT--- 296 Query: 139 REAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAA 198 EA + + A A + A S + + +E + A A + Sbjct: 297 EEAVSEPVEDAVKAFEEIAQQDETADSEAESEQNETATEEAVSEPVEDAVKAFEEIAEQE 356 Query: 199 TSAGAAKT--SETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 +A E N +A+ ++ + A E A A E+ ++ Sbjct: 357 ETADIETEIDDEQNDTATEEAVSEPVEEAVKAFEEIAEQEETADIETESEQNDTATEEPV 416 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 + A + A+ ET +E + Q+ +A + + A A + Sbjct: 417 SEPVED---AVKAFEEIAEQEETADSKTEIDSEQNDTAVEEAASEPVEDAVKALEEIAEQ 473 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 +A + +S ++ ++ + +A + A A SET +K ++T+AE Sbjct: 474 EETADSEAESEQNDIAAEEAVSEPVEDAVKAFEEIAEQEETA-DSETESKQNDTAAEEPI 532 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATT--ASTKATEAAGSATAAAQSKST 434 + A A + + D T S +AT AS +A + A+ + T Sbjct: 533 SEPVEDAVKAFEEIAEQEETADSETEIDSEQNDTATEEAASEPVEDAVKAFEEIAEQEET 592 Query: 435 AESAATRAETAAKRAEDIASAVALEDA-----STTKKGIVQLSSATNSTSETLAATPKAV 489 A+S A + A + A + +EDA ++ S + + AT +AV Sbjct: 593 ADSEAVIDDEQNDTATEEAVSEPVEDAVKAFEEIAEQEETADSETEIDSEQNDTATEEAV 652 Query: 490 KSAYDNAEKR---LQKDQNGADIPDKG 513 ++A K + + + AD +K Sbjct: 653 SEPVEDAVKAFEEIAEQEETADSEEKS 679 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 62/364 (17%), Positives = 113/364 (31%), Gaps = 17/364 (4%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E V E +A + A + + S + ++ + ++A A +A Sbjct: 413 EEPVSEPVEDAVKAFEEIAE--QEETADSKTEIDSEQNDTAVEEAASEPVEDAVKALEEI 470 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 +A + + + A A + E A S + + + A Sbjct: 471 AEQEETADSEAESEQNDIAAEEAVSEPVEDAVKAFEEIAEQEETA-DSETESKQNDTAAE 529 Query: 227 TKASEAATSARDA---AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARS 283 SE A A A +E SET S + A+ A+ A K E A Sbjct: 530 EPISEPVEDAVKAFEEIAEQEETADSETEIDSEQNDTATEEAASEPVEDAVKAFEEIAEQ 589 Query: 284 SETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE 343 ETA ++ + TA A S A + + E+A S + + Sbjct: 590 EETADSEAVIDDEQNDTAT---EEAVSEPVEDAVKAFEEIAEQEETADSETEIDSEQNDT 646 Query: 344 ATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQ 403 ATE+A + + E + +E A +AS +S A Sbjct: 647 ATEEAVSEPVEDAVKAFEEIAEQEETADSEEKSLEEQFHAIMEEETASEPVSSIRTAESV 706 Query: 404 ASAAKSSAT----TASTKATEAAGSATAAAQSKSTAESA----ATRAETAAKRAEDIASA 455 A K + + +T + GS+ + S +T ++ ++ ++ A Sbjct: 707 EDALKEFGDFIGLSETADSTGSIGSSEDISDDDLEQTSGIFEESTLSDELSETPDEPIDA 766 Query: 456 VALE 459 Sbjct: 767 DEKP 770 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 72/419 (17%), Positives = 141/419 (33%), Gaps = 16/419 (3%) Query: 95 EDDARPEAL-RRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSAR 153 E + PE F E + + + + E + A+ Sbjct: 87 ETETEPEISPVNFTFTQSEDDDDTGVSFERVFDHFNRLESSVGNEGEDVGESGKKAEDTA 146 Query: 154 AASTSAGQAASSAQSASSSAGTASTKATE----ASKSAAAAESSKSAAATSAGAAKTSET 209 + SA A + ++ AG E AS A A + + + E Sbjct: 147 EEALSAVTATAVEENFDEQAGMPKQMLLENGIMASAPIANAVQELAEMPSVIQDNEEIED 206 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGN 269 +S ++ E A A + A +E SET S + A+ + Sbjct: 207 VRLPVDESIFD---LYASEPVEDAVKAFEEIAEQEETADSETEIDSEQNDIATEEAVSEP 263 Query: 270 SAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA----SASATAAGK 325 A K E A ETA ++ + ++TA + S A +A + A Sbjct: 264 VEDAVKAFEEIAEQKETADIETEIDSEQNETATEEAVSEPVEDAVKAFEEIAQQDETADS 323 Query: 326 SAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASS 385 AES + +T + + A A +T++ + + +++ A S Sbjct: 324 EAESEQNETATEEAVSEPVEDAVKAFEEIAEQEETADIETEIDDEQNDTATEEAVSEPVE 383 Query: 386 AASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAA-AQSKSTAESAATRAET 444 A A A ++E + ++ + T +E A A + E+A ++ E Sbjct: 384 EAVKAFEEIAEQEETADIETESEQNDTATEEPVSEPVEDAVKAFEEIAEQEETADSKTEI 443 Query: 445 AAKR---AEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 +++ A + A++ +EDA + I + +S +E+ A ++ + E + Sbjct: 444 DSEQNDTAVEEAASEPVEDAVKALEEIAEQEETADSEAESEQNDIAAEEAVSEPVEDAV 502 Score = 60.7 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 48/286 (16%), Positives = 93/286 (32%), Gaps = 14/286 (4%) Query: 101 EALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG 160 +A++ FE + E+ A + +K++ + A E A A + +A Sbjct: 500 DAVKAFEEIAEQ-----EETADSETESKQNDTAAEEPISEPVEDAVKAFEEIAEQEETAD 554 Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 + +A + E + A A +T+++ A + T Sbjct: 555 SETEIDSEQNDTATEEAAS--------EPVEDAVKAFEEIAEQEETADSEAVIDDEQNDT 606 Query: 221 SASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETN 280 + A ++ E A A + A +E SET S + A+ + A K E Sbjct: 607 ATEEAVSEPVEDAVKAFEEIAEQEETADSETEIDSEQNDTATEEAVSEPVEDAVKAFEEI 666 Query: 281 ARSSETAAGQSASAAAGS-KTAAASSASAASTSAGQASASATAAGKSAESAASSASTATT 339 A ETA + S +AS +S A + A + + S + +T Sbjct: 667 AEQEETADSEEKSLEEQFHAIMEEETASEPVSSIRTAESVEDALKEFGDFIGLSETADST 726 Query: 340 KAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASS 385 + ++E S ++ E+ + A + Sbjct: 727 GSIGSSEDISDDDLEQTSGIFEESTLSDELSETPDEPIDADEKPRT 772 >UniRef50_C4TT85 Gp19 n=1 Tax=Yersinia kristensenii ATCC 33638 RepID=C4TT85_YERKR Length = 732 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 90/374 (24%), Positives = 147/374 (39%), Gaps = 24/374 (6%) Query: 383 ASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRA 442 AS A+ + + T A A + G ++ + S +E+ A Sbjct: 263 ASIDAAGNITDLRPQGSLTDDALAKHEKSRNHPDGTLAEKGFVKLSSATDSNSETLAATP 322 Query: 443 ETAAKRAEDIASAVAL-------EDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDN 495 + + ++++ L D + KG VQL+SAT+STSETLAATPKAVK A DN Sbjct: 323 KAVKAVMDSASNSLDLHEKSRDHPDGTLLYKGFVQLASATDSTSETLAATPKAVKIAMDN 382 Query: 496 AEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVM 555 A RL KD+NGADIP+ F N+ + V + + + Sbjct: 383 ANARLAKDRNGADIPNVPLFRQNLALKGAALVDIGKTAGTVAAGDDSRIVNA------LP 436 Query: 556 RSAGS-VSELASRVIITTATRTAGDPMN----NCEFNGFVMPGGWTDRGRYAYGMFWQYQ 610 ++ G+ LA I+ G N NG + G T G +A + + + Sbjct: 437 KTGGTVTGWLAVTGILDGPIGPGGYKSNILVGTAGGNGAITNAGGTGFGLHASNVIYFWN 496 Query: 611 NNERAIHSIM-MSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGA 669 +N SI + S+ GAA + + E + +D + K+ Sbjct: 497 DNSGYAMSISPTILSVNRPISILSGAGAALSIKSQNEGDVCYIMSVSDTA---SKSKWYI 553 Query: 670 TNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEY 729 N + D++ + K+G + ++ S KL + +++A GG G+ Sbjct: 554 GNTQENNTSFDLV-NSKAGVALKINDTISTTAPFSASKLTSAGDVIAGGGATTYQANGDI 612 Query: 730 -GALWRNDGAKTYL 742 GA W N TY+ Sbjct: 613 KGAAWANGLLSTYI 626 >UniRef50_Q3A2X7 Ribonuclease E n=3 Tax=Bacteria RepID=Q3A2X7_PELCD Length = 926 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 67/388 (17%), Positives = 123/388 (31%), Gaps = 13/388 (3%) Query: 130 SASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAA 189 S+ ST E+ A A + A + + + +A ST+ATE ++ A Sbjct: 546 PVSEPSTLTTESEEQQPAATQDAAESQPMATDTEVAREVSEKTAEPVSTEATEPAEKTAE 605 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSAS---TATTKASEAATSARDAAASKEAA 246 S+ ++ + + A +A A A EA Sbjct: 606 PASAAEEQPQKPSRPRSRSGRRKPATKKTEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAK 665 Query: 247 KSSETNASSSASSAA--SSATAAGNSAK-AAKTSETNARSSETAAGQSASAAAGSKTAAA 303 ++ A A A + A AK AK E A +K A Sbjct: 666 PEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAK 725 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSET 363 A + + A A ++ A A E+ Q+ A + KT+ Sbjct: 726 PEAKPEAKPEAKPEAKPEAKPEAKPEAKPEAKPKAKPKVESEAQSEAMPEAKP--KTTRR 783 Query: 364 NAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG 423 S T ++ A S+ S+ + + S ++ AK + +TK T Sbjct: 784 APSRSRTKTKTKTETAESNTSATGTETDTPSEKDEKPAETKPPAKRARARRTTKPTAKTA 843 Query: 424 SATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLA 483 + + S + + A ETAA + A + + +S+ ++++ A Sbjct: 844 TQASDDVSPAEKDETAKATETAAPKRRTRARKTTAKTVKSGDD-----ASSVSASNAEEA 898 Query: 484 ATPKAVKSAYDNAEKRLQKDQNGADIPD 511 K + A K +++ + D Sbjct: 899 EKKKPTRRPRKPAAKAVEESKKAETSED 926 >UniRef50_B7NJP1 Putative side tail fiber protein homolog from lambdoid prophage n=3 Tax=Escherichia coli RepID=B7NJP1_ECO7I Length = 686 Score = 87.3 bits (214), Expect = 3e-15, Method: Composition-based stats. Identities = 90/427 (21%), Positives = 150/427 (35%), Gaps = 12/427 (2%) Query: 90 LGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAA 149 +G + E + E + + E + + EAA A+ Sbjct: 1 MGNLNETEKWEENIYQLETSDPVLGGADGISNRAPRQLANRTKWLKKKTEEAAQSLAEHV 60 Query: 150 DSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSET 209 S + + S+++++ + + AT AA + + A S + T Sbjct: 61 RSRNHPDATLTAKGFTQLSSATNSTSETLAAT-PKAVKAAYDLAAGKAPASHTHPWSQIT 119 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGN 269 A+ +A + ++ S++ T A A K A + ++ ++ + Sbjct: 120 GVPAASLTAKGTVQLSSATDSQSETEAATPKAVKIAYDLARGKYTAQDATTTRKGIVQLS 179 Query: 270 SAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAES 329 SA + A A +A + + A +A T SA Sbjct: 180 SAINNTSETLAATPKAVKAAYDLAAGKAPASHTHPWSQITGVPAASLTAKGTVQLSSATD 239 Query: 330 AASSASTATTKA---------GEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 + S AT KA G+ T Q + R +S N + +A AA Sbjct: 240 SQSETEAATPKAVKIAYDLARGKYTAQDATTTRKGIVQLSSAINNTSETLAATPKAVKAA 299 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 ++ + AS A++ + T + AT+ + S T AA K+ + Sbjct: 300 YDLAAGKAPASHTHPWSQITGVPAASLTAKGTVQLSSATD-SQSETEAATPKAVKAAYDL 358 Query: 441 RAETAAKRAEDI-ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 A A + + AS T KG VQLSSAT+S SET AATPKAVK+AYD A + Sbjct: 359 AAGKAPVSHTHPWSQITGVPAASLTAKGTVQLSSATDSQSETEAATPKAVKAAYDLAAGK 418 Query: 500 LQKDQNG 506 Sbjct: 419 APVSHTH 425 Score = 83.0 bits (203), Expect = 6e-14, Method: Composition-based stats. Identities = 88/395 (22%), Positives = 149/395 (37%), Gaps = 9/395 (2%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + + A +A A A + + + AA+ A +A+ S + ++ A Sbjct: 92 TPKAVKAAYDLAAGKAPASHTHPWSQITGVPAASLTAKGTVQLSSATDSQSETEAATPKA 151 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 A + A + + ++ ++T A + +A A Sbjct: 152 VKIAYDLARGKYTAQDATTTRKGIVQLSSAINNTSETLAATPKAVKAAYDLAAGKAPASH 211 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 + + A+ AK T SSA+ + S AA KA K + AR TA Sbjct: 212 THPWSQITGVPAASLTAKG--TVQLSSATDSQSETEAATP--KAVKIAYDLARGKYTAQD 267 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 + + + + SA ++ A+ A A + + AS + A+ Sbjct: 268 ATTTRKGIVQLS---SAINNTSETLAATPKAVKAAYDLAAGKAPASHTHPWSQITGVPAA 324 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 + + +S T++++ +A AA ++ + S A++ + Sbjct: 325 SLTAKGTVQLSSATDSQSETEAATPKAVKAAYDLAAGKAPVSHTHPWSQITGVPAASLTA 384 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI-ASAVALEDASTTKKGI 468 T + AT+ + S T AA K+ + A A + + AS T KG Sbjct: 385 KGTVQLSSATD-SQSETEAATPKAVKAAYDLAAGKAPVSHTHPWSQITGVPAASLTAKGT 443 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 VQLSSA NSTSE LAATPKAVK+AYD A + D Sbjct: 444 VQLSSAINSTSEILAATPKAVKAAYDLANGKQPAD 478 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 67/252 (26%), Positives = 95/252 (37%), Gaps = 13/252 (5%) Query: 296 AGSKTAAASSASAASTSAGQASASATAAG-KSAESAASSASTATTKAGEATEQASAAARS 354 AA S + S A+ TA G SA +S S +A + A A Sbjct: 46 KKKTEEAAQSLAEHVRSRNHPDATLTAKGFTQLSSATNSTSETLAATPKAVKAAYDLAAG 105 Query: 355 ASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTA 414 + A + ++ + A S SSA S S A+ +A + A TA Sbjct: 106 KAPASHTHPWSQITGVPAASLTAKGTVQLSSATDSQSETEAATPKAVKIAYDLARGKYTA 165 Query: 415 STKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIA------------SAVALEDAS 462 T G ++ +T+E+ A + + A + AS Sbjct: 166 QDATTTRKGIVQLSSAINNTSETLAATPKAVKAAYDLAAGKAPASHTHPWSQITGVPAAS 225 Query: 463 TTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAV 522 T KG VQLSSAT+S SET AATPKAVK AYD A + + IN Sbjct: 226 LTAKGTVQLSSATDSQSETEAATPKAVKIAYDLARGKYTAQDATTTRKGIVQLSSAINNT 285 Query: 523 SKTDFADKRGMR 534 S+T A + ++ Sbjct: 286 SETLAATPKAVK 297 Score = 61.9 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 31/73 (42%), Positives = 43/73 (58%), Gaps = 5/73 (6%) Query: 905 PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----A 959 PVG P+PWP+ T P G+ G+AF K YP LA AYP+ +PD+RG I+G Sbjct: 537 PVGVPVPWPTATPPEGWLKCDGRAFTKEQYPVLARAYPTLRLPDLRGEFIRGWDDGRKID 596 Query: 960 SGRAVLSQEQDGI 972 GR +LS ++ + Sbjct: 597 EGRKLLSWQKGTL 609 >UniRef50_A6C5K2 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C5K2_9PLAN Length = 1354 Score = 87.3 bits (214), Expect = 3e-15, Method: Composition-based stats. Identities = 74/443 (16%), Positives = 152/443 (34%), Gaps = 14/443 (3%) Query: 86 LNDFLGAMTE--DDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAAT 143 L+ L E D AR +++ + +V + ++ Q A ++ A A+ T Sbjct: 766 LDQLLERRPELVDAARNYQMKQLDSLVRQTSQLVEPQTQLAEAIQEQAPVAAGRPAAPQT 825 Query: 144 HAADA-ADSARAASTSAGQAASSAQSASSSAGTASTKATEASK-SAAAAESSKSAAATSA 201 A AD A A ++ A++ AG AS A E+S+ +A K A A A Sbjct: 826 AEPPAEADQPATAPGENQAADNNTPPAATPAGQASPAAAESSRNAATTVADGKPATAAQA 885 Query: 202 GAAKTSETNASASLQSAATSASTATTKASEA-ATSARDAAASKEAAKSSETNASSSASSA 260 +K S A A + A + + + A +A + ++ + A+ Sbjct: 886 AESKMSNEPARARAEPAKEESVRDLQRQQQMLAEAATNFVLETAQQFGPDSEPTRRATQL 945 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 A + A +A A + E + ++ + A + ++ A + A+ Sbjct: 946 AEESIKAQQAADAGRLHEASQAANRASKKADEIMQAHQEQNKENTERDAFLEQAERMANL 1005 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK-ASETSAESSKTAA 379 + AS++ + +A + T+QA A A + + SET+ K +E ++ Sbjct: 1006 QQSQAEKMEQASTSESKRQEALQNTQQALARQTQALSKELSETSRKLETEPIGLKKESQQ 1065 Query: 380 ASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAA 439 A +A A E + + A+++ + A SK A+ + Sbjct: 1066 ADRTRKKTETAGQAMEKAVEGLQDENLAQAAEQAQQAAQALQEAARQAQQASKQKAKESP 1125 Query: 440 TRAETAAKRAEDIASAVALED--------ASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 + + + + + + + + Q + ++ + Sbjct: 1126 VPEKVGNQVTDAAQQLREAQKQLEQSPEFKNQSDQTMAQQQQSPPGDAKENSQQSSPADQ 1185 Query: 492 AYDNAEKRLQKDQNGADIPDKGC 514 E K Q ++ DK Sbjct: 1186 KEKATESGDAKAQANSEAGDKSQ 1208 >UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1_YERKR Length = 402 Score = 85.7 bits (210), Expect = 8e-15, Method: Composition-based stats. Identities = 57/208 (27%), Positives = 89/208 (42%), Gaps = 31/208 (14%) Query: 795 VSGVEFDGSKDITLTAAHVAAFAR---------RATDTYADADGGVPWNAE---SGAYNV 842 ++G FDG+ +I++ AA V A T +A+ G + + S + + Sbjct: 154 IAGKSFDGTANISIGAADVGALPLLGGTLSGPLEITGVHAEPLGPNGYKSNIKTSASGAI 213 Query: 843 TRSGDSYI-------LVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYT 895 T +G + I + + G +L + N + S G ++ E Y Sbjct: 214 TNAGGTGIGLNADKGIYFWNDNTGYAMSLSLTKLSVNRAITALDSTITPGDYSNFDERYE 273 Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 +G PIPWP P+GY G F+K+ YPKLA AYPSGV+PD+RG I+ Sbjct: 274 PAL-------IGTPIPWPLTIAPAGYLKCNGAPFNKTQYPKLALAYPSGVLPDLRGEFIR 326 Query: 956 GKPAS-----GRAVLSQEQDGIKSHTHS 978 G + +L + I+SH H Sbjct: 327 GFDDGRGVRPNQPLLGWQGSEIQSHNHG 354 >UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH14_EDWI9 Length = 593 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 43/153 (28%), Positives = 72/153 (47%), Gaps = 11/153 (7%) Query: 856 TGVGSCRTLQMKAHYRNGGL------FYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAP 909 T + L+ N + R+S + + Y + ++ + PVG P Sbjct: 396 TNAPTSSALKRTYDRANSAYDRANSAYDRASSAYSYAGSIYDKAYDAYDIARRAPPVGTP 455 Query: 910 IPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASG-----RAV 964 PWP+ ++PSG+ GQ+F S+YP+LA AYP+G +PD+RG I+G G R + Sbjct: 456 QPWPNTSIPSGWIKCAGQSFSTSSYPELAKAYPNGRLPDLRGEFIRGYDDYGGTDSQRQI 515 Query: 965 LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYG 997 LS + D +++ T + + T +YG Sbjct: 516 LSWQGDAMRNITGTFGVDDQTIEQVTGVFREYG 548 >UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae pv. oryzae RepID=B2SVF7_XANOP Length = 501 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 71/265 (26%), Positives = 107/265 (40%), Gaps = 47/265 (17%) Query: 885 GFEEDW-AEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAA--- 940 G + DW + N+P G + S P+G + G ++ Y L AA Sbjct: 228 GRQGDWYRDFGNMLNVPQSFLLPGQIVVMASLYPPNGLLVCDGAEISRAKYAALFAAIGT 287 Query: 941 -YPSG------VIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSS 993 Y +G +P ++ T+ ++ AV S + + SHTH ASA++ T+ Sbjct: 288 VYGAGDGSTTFNVPKIKEGTVITHTSAATAVGSYDPGQVISHTHGASAAAVGDHAHYTAI 347 Query: 994 FDYGTKS----------------TNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGA 1037 G + T+ G H H GST+++G H H V +++ +G Sbjct: 348 NAAGNHAHGASAGAAGDHAHYAWTDAQGHHAH--GGSTSASGDHQH--PGVIPSASINGY 403 Query: 1038 GSASTRLSVVHNQNYATSSAGAHTHSLSGTAASA----------GAHAHTVGI---GAHT 1084 G R + + T + G H HS A + G H H +GI G H Sbjct: 404 GVYRERDNDAAPSDGWTGAGGNHAHSFGTDGAGSHGHNISMNGVGNHTHGIGIAEGGNHV 463 Query: 1085 HSVA---IGSHGHTITVNAAGNAEN 1106 H V G+H HTITVNAAG +N Sbjct: 464 HDVDHRGAGAHAHTITVNAAGGIDN 488 >UniRef50_O94854 Uncharacterized protein KIAA0754 n=6 Tax=Euarchontoglires RepID=K0754_HUMAN Length = 1291 Score = 84.6 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 89/409 (21%), Positives = 154/409 (37%), Gaps = 14/409 (3%) Query: 93 MTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSA 152 +TE+D PE V +A + + + ++ A+ E T A A + Sbjct: 828 ITEEDGTPEGPVTPATTVHAPEEPDTAAVRVSTPEEPASPAAAVPTPEEPTSPAAAVPTP 887 Query: 153 RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAS 212 TS A + +S A T S +AA + + +A T+ + Sbjct: 888 EEP-TSPAAAVPPPEEPTSPAAAVPTPEEPTSPAAAVPTPEEPTSPAAAVPTPEEPTSPA 946 Query: 213 ASLQSAATSASTATTKAS-EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSA 271 A++ + S A + E TS A + E S + A+ +A Sbjct: 947 AAVPTPEEPTSPAAAVPTPEEPTSPAAAVPTPEEPASPAAAVPTPEEPASPAAAVPTPEE 1006 Query: 272 KAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAA 331 A + E +A SA+ A + +AS A+A T A AS +A A E + Sbjct: 1007 PA--FPAPAVPTPEESA--SAAVAVPTPEESASPAAAVPTPAESASFAAVVATLE-EPTS 1061 Query: 332 SSASTATTKAGEAT-EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 +AS T A AT E+ ++ A S ++ + A A E + AAA +S+ Sbjct: 1062 PAASVPTPAAMVATLEEFTSPAASVPTSEEPASLAAAVSNPEEPTSPAAAVPTLEEPTSS 1121 Query: 391 SSASASKDEATRQASAAKSSATTASTKA-----TEAAGSATAAAQSKSTAESAATRAETA 445 ++A + +E + A++ + AS A E A A A + A AA+ Sbjct: 1122 AAAVLTPEELSSPAASVPTPEEPASPAAAVSNLEEPASPAAAVPTPEVAAIPAASVPTP- 1180 Query: 446 AKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYD 494 A A+ +E+ S + +S+ T+S + TP +++ Sbjct: 1181 EVPAIPAAAVPPMEEVSPIGVPFLGVSAHTDSVPISEEGTPVLEEASST 1229 >UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas aeruginosa PA7 RepID=A6VBH2_PSEA7 Length = 654 Score = 84.2 bits (206), Expect = 2e-14, Method: Composition-based stats. Identities = 88/349 (25%), Positives = 138/349 (39%), Gaps = 56/349 (16%) Query: 762 IDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGV----EFDGSKDITLTAAH---VA 814 I NA + + A A +L+ + + + +D+ A V Sbjct: 311 IPNAISDDPALDRGDVLATTKATRAVQLRAAQDLDDLAKSLGTAARRDVGTDAGDLLEVG 370 Query: 815 AFARRATDTYADA-----DGGVPWNAESGAYNVTRSGDSY-ILVNFYTGVGSCRTLQMKA 868 AF D+ A + V + Y G SY ++ F R Q+ Sbjct: 371 AFGWGTNDSPVAASVNIYESSVTKFTPATEYVPEIFGMSYGVVATFAYSERETRASQLFF 430 Query: 869 HYR-NGGLFYRSSRDGYGFEEDWAEVYTSKNL-PPESYPVGAPIPWPSDTVPSGYALMQG 926 L +RS + + E++ S NL P P GA + + + P+GY G Sbjct: 431 GQSPENKLMFRSGNYTW---APFLEIWHSGNLNPQAIVPAGAVVAFAMYSPPAGYLKANG 487 Query: 927 QAFDKSAYPKLAAA----YPSG------VIPDMRGWTIKGKPAS-----GRAVLSQEQDG 971 A ++AY L A Y +G +PD RG ++ GR + + + Sbjct: 488 AAVSRTAYAALFATIGTYYGAGDGSTTFNLPDYRGEFLRALDDGRGLDLGRQLGTLQSSQ 547 Query: 972 IKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTA 1031 +HTH AS+S G HTH+V+G+ +AGAH+HS+A+VN Sbjct: 548 NLAHTHGASSSG--------------------NGGHTHTVTGTAAAAGAHSHSIASVNAT 587 Query: 1032 SANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI 1080 + SG A+ V + N T AG HTH+++G AA G H HT+ + Sbjct: 588 ALVSGTRLAT---LVGNASNSTTDVAGDHTHAVTGVAALEGTHNHTIYV 633 >UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli plasmid p15B n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2Q1_PHOLL Length = 478 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 64/236 (27%), Positives = 97/236 (41%), Gaps = 37/236 (15%) Query: 761 AIDNATGELVIGTKLSASLNGNA------------LTATKLQTPRRVSGVEFDGSKDITL 808 A DNA L + N + L + + R+++G G D++L Sbjct: 194 ATDNANSRLAKNQNGADIPNKSEFIKNLGLTETVELAKSAVPNSRKINGKALSG--DVSL 251 Query: 809 TAAHVAAFARRATDTYADADGGV--PWNAESGAYNVTRSGDSYILVNFYTGVGSC-RTLQ 865 A V A +T + + N + + +F G+ L Sbjct: 252 NAGDVGALPISSTLSAQTGTLRINNGSNWPNIEFRAANK-------HFIGIEGTAGNRLT 304 Query: 866 MKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQ 925 + A+ N Y + + T L + VG+PIPWP VP+GY Sbjct: 305 IYANDENSNRKYTLATP--------EKSGTLATLDDINISVGSPIPWPLPNVPAGYLACN 356 Query: 926 GQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHT 976 GQ+F+KS YP+LA AYPSGV+PD+RG I+G GR VL+ + D I++ T Sbjct: 357 GQSFNKSLYPQLAIAYPSGVLPDLRGEFIRGWDDGRGVDRGRGVLTHQGDAIRNIT 412 Score = 72.6 bits (176), Expect = 8e-11, Method: Composition-based stats. Identities = 67/197 (34%), Positives = 89/197 (45%), Gaps = 30/197 (15%) Query: 408 KSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKG 467 K + + T+ S +A + + AAT R + + DAS T+KG Sbjct: 89 KITTEIPAASLTQKGISQLNSATNSDREDQAATPKAVHDVRKIAESKLSGVSDASLTQKG 148 Query: 468 IVQLSSATNSTSETLAATPKAVKSAY---------------------DNAEKRLQKDQNG 506 IVQLSSATNST+ETLAATPKAVK AY DNA RL K+QNG Sbjct: 149 IVQLSSATNSTNETLAATPKAVKGAYDFANTANVAAKNAHDEANRATDNANSRLAKNQNG 208 Query: 507 ADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELAS 566 ADIP+K F+ N+ + A ++N GK V +AG V L Sbjct: 209 ADIPNKSEFIKNLGLTETVELAKSAVPNSRKIN-------GKALSGDVSLNAGDVGALP- 260 Query: 567 RVIITTATRTAGDPMNN 583 + T + +T +NN Sbjct: 261 -ISSTLSAQTGTLRINN 276 >UniRef50_UPI0001B553A8 Transglycosylase domain protein n=1 Tax=Streptomyces sp. SPB78 RepID=UPI0001B553A8 Length = 1544 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 78/406 (19%), Positives = 141/406 (34%), Gaps = 20/406 (4%) Query: 102 ALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQ 161 A+++ + + +A + T KS A+ A + A+ A + R A+ S Q Sbjct: 390 AVKKVSEVFQAQKASADEATRATDNGAKSNVQAAQRALQMASAQDSLAQAHRQAARSIAQ 449 Query: 162 AASSAQSASSSAGTASTKA----TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQS 217 A + A+ +A A +A T+ +++ AE+S + +A A+ S +A + Sbjct: 450 ANRQVEDATRAAADAQQRAADQRTQGARAIERAEASLADSARGVLRAEQSVADAQRDAKQ 509 Query: 218 AATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATA----------- 266 A + A A++ + D + + T A A + + A Sbjct: 510 AQLDLTAARKTAAQQLQALDDQLRDGQLGQRDATLRVQEAQLALNKSMADPRSTQLERDR 569 Query: 267 AGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS 326 A A S + S +SA A + A +A QA + AG+S Sbjct: 570 AQLQLDQATQSLKEQQQSYKDLQKSAEAQRRAGVEGAEVVLSAQERLTQAQRQSADAGQS 629 Query: 327 AESAASSASTATTKAGEATEQASA----AARSASAAKTSETNAKASETSAESSKTAAASS 382 A + +TA +A QAS AA+ A+ T+A + A++ + Sbjct: 630 LADAQRAQTTAARDLADAQRQASRDQVDAAQRVKDAQRGVTDAVQAVADAQTDGARQVEA 689 Query: 383 ASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRA 442 A SA + ++ A TK A + A + + T S A + Sbjct: 690 AERGVQSARLSGIDTTVKAATSTDKYREALDKLTKPQRALYDSVAGLKERFTDWSDALQP 749 Query: 443 ETAAKRAEDI-ASAVALEDASTTKKGIVQLSSATNSTSETLAATPK 487 E + A+ L + +G + S + P Sbjct: 750 EVLPLMTRGVRAAGKTLPAFTPIVEGATRAVSRLFDAASKNLKKPF 795 >UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D4_DICDC Length = 557 Score = 82.7 bits (202), Expect = 7e-14, Method: Composition-based stats. Identities = 42/100 (42%), Positives = 59/100 (59%), Gaps = 6/100 (6%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP--- 958 + G P+PWP T P+G+ GQ+FDK+ YPKL AAYPSG +PD+RG I+G Sbjct: 420 AAEIAGIPLPWPQATAPTGWLKCNGQSFDKTLYPKLTAAYPSGTLPDLRGEFIRGWDDGR 479 Query: 959 --ASGRAVLSQEQDG-IKSHTHSASASSTDLGTKTTSSFD 995 SGRAVLS + I+ + S +A++T S+F+ Sbjct: 480 GVDSGRAVLSVQDATWIQPNIESNTAATTIRIDNVDSTFN 519 >UniRef50_B5JW06 EleCtron transport complex, rnfabcdge type, c subunit n=2 Tax=Gammaproteobacteria RepID=B5JW06_9GAMM Length = 792 Score = 82.7 bits (202), Expect = 7e-14, Method: Composition-based stats. Identities = 80/376 (21%), Positives = 154/376 (40%), Gaps = 11/376 (2%) Query: 69 PSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAK 128 PSH + Y ++ N + + A+ R E + E A A+ + Sbjct: 411 PSHIPLVQYYRFAKTEIRNAEVEKRKAEHAKERHDFRLERLEREQQEKAEAMRRKKEEIA 470 Query: 129 KSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA-SSSAGTASTKATEASKSA 187 + +DA+ +A ++ A +A A + A +A + Q A + + A+ A +A+ +A Sbjct: 471 RKKADANKAA--SSDKPEQDAIAAAVARSEARKAEKAKQRAGTRTDNNAANDAKQAAIAA 528 Query: 188 AAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 A A + + + + ++ +A + + A AS+ SA DA S A Sbjct: 529 AKARAKANTEGSDSAKSQAEAAKQAAIEAAKKRAQQQAQPSASD--KSADDARKSAIEAA 586 Query: 248 SSETNASSSA-SSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 + A SA ++ A+ N+ +AAK +SS A + S A K AA ++A Sbjct: 587 KARAQAKQSANQRDSAGASNKQNAIEAAKQRALAKQSSAQEAPEGESKATDPKQAAIAAA 646 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 A + + Q A+ TAA A A+ K +Q+ A+ + K + Sbjct: 647 KARALAKQQGEAANTAAESQAGDDPKKAAIEAAKKRALEKQSKASETDTNEEKPVDPKKA 706 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 A E + + + + S +++ + +++ + EA + + AK EA+ ++ Sbjct: 707 AIEAAKQRALAKQSGSNTASDDAPANSKQAAIEAAKTRALAKQ-----QDNGDEASEPSS 761 Query: 427 AAAQSKSTAESAATRA 442 A ++ E+A RA Sbjct: 762 ATDPKQAAIEAAKQRA 777 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 84/352 (23%), Positives = 139/352 (39%), Gaps = 19/352 (5%) Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 + A++ + K +A+K+A++ + + A +A A++ A + Q A T Sbjct: 454 EQQEKAEAMRRKKEEIARKKADANKAASSDKPEQD--AIAAAVARSEARKAEKAKQRAGT 511 Query: 221 SASTATTKASEAATSARDAAASKEAAKSSETNA-SSSASSAASSATAAGNSAKAAKTSET 279 +A DA + AA + A + + SA S A AA +A A Sbjct: 512 RTDN---------NAANDAKQAAIAAAKARAKANTEGSDSAKSQAEAAKQAAIEAAKKRA 562 Query: 280 NARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA-SASATAAGKSAESAASSASTAT 338 ++ +A+ +SA A S AA + + A SA Q SA A+ + E+A A Sbjct: 563 QQQAQPSASDKSADDARKSAIEAAKARAQAKQSANQRDSAGASNKQNAIEAAKQRALAKQ 622 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 + A EA E S A AA + ++ E++ TAA S A A+ +A K Sbjct: 623 SSAQEAPEGESKATDPKQAAIAAAKARALAKQQGEAANTAAESQAGDDPKKAAIEAAKKR 682 Query: 399 EATRQ--ASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAV 456 +Q AS ++ A +A A +K + + A+ A + I +A Sbjct: 683 ALEKQSKASETDTNEEKPVDPKKAAIEAAKQRALAKQSGSNTASDDAPANSKQAAIEAAK 742 Query: 457 ALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 A G + S +++T AA A + A A K Q G+D Sbjct: 743 TRALAKQQDNG-DEASEPSSATDPKQAAIEAAKQRALARARD---KSQQGSD 790 >UniRef50_A9EEQ2 Amylopullulanase n=9 Tax=cellular organisms RepID=A9EEQ2_LACPL Length = 2057 Score = 82.7 bits (202), Expect = 8e-14, Method: Composition-based stats. Identities = 83/402 (20%), Positives = 164/402 (40%), Gaps = 9/402 (2%) Query: 183 ASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAAS 242 A+ + + + + AA S AA +++T S + ++ +A++ T S ++ DAA S Sbjct: 36 AATANSTDVVNSTDAANSTDAANSTDTANSTDVANSTDAANSTDTANSTDVANSTDAANS 95 Query: 243 KEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA 302 + A S++ S+ A+++ +A A ++ A T N+ + + S A + T A Sbjct: 96 TDTANSTDVANSTDAANSTDTADTANSTDAANSTDTANSTDVANSTDTANSTDAANSTDA 155 Query: 303 ASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSE 362 A+S A+++ A++T A S ++A S+ + +T +T+ A++ + S + Sbjct: 156 ANSTDTANSTDT---ANSTDAANSTDAANSTDTANSTGTANSTDAANSTDTANSTDTANS 212 Query: 363 TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAA 422 T+ S +A S+ TA ++ +++ +A+S + T ++ A +S TA++ T + Sbjct: 213 TDTANSTDAANSTDTANSTDTANSTDTANSTDTANSTDTANSTDAANSTDTANSTDTANS 272 Query: 423 GSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETL 482 + +S S + A E A + D S T G +++T + Sbjct: 273 TDTANSTESSSEYATQALSDEKNATQNNDFTSFDKKWAYEGTDLGFNYSTTSTTFKIWSP 332 Query: 483 AATPKAVKSAYDNAEKRLQKDQ-----NGADIPDKGCFLNNINAVSKTDFADKRGMRY-V 536 AT + S N + G N I + T ++ GM Y Sbjct: 333 TATSVQLISYGTNTNPTAAQVSAKAMTRGTSATPTNHATNTIGVWTLTVPGNQNGMVYAY 392 Query: 537 RVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAG 578 ++ G S S SVS + T+ Sbjct: 393 KLTFADGTVSDYAGSTYGTLSTSSVSNTTNDPYSIATTQGGN 434 >UniRef50_UPI00016BFB6A hypothetical protein Epulo_10362 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFB6A Length = 1888 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 77/386 (19%), Positives = 157/386 (40%), Gaps = 4/386 (1%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E++ E + V+ A A ++ ++ + T A + AD++ S + + Sbjct: 630 EIVSAEKTDASETVSAEKADASETV--SAEKPDASETEAGEKADASETVSAGKAATSET- 686 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 SA +A + + A +A+ S + + T + + SA A+ + S Sbjct: 687 TSAGKAATSETVSAGKAATSETVSAEKAVTSETVSVEKAATSETVSAEKADASETVSAEK 746 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSET-NARSSE 285 T ASE + + A+ +A+ ++ + + SA +SAT+ SA+ A TSET +A + Sbjct: 747 TDASETVSGEKTDASETVSAEKADASETVSAEKTVASATSETVSAEKAVTSETVSAEKAA 806 Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 T+ SA S+T +A A+ T +G+ + ++ A+ + S A E Sbjct: 807 TSETVSAEKTDASETVSAEKTDASETVSGEKTDASETVSGEKTDASETVSAEKPDASETV 866 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 A + + +E + S ++S+T +A + AA+S + + D + ++ Sbjct: 867 SGEKAVTSETESGEKAEASETVSGEKTDASETTSAENGEKAATSETVSVEKADASETVSA 926 Query: 406 AAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTK 465 ++ T S + A+G++ + K A +T A + S + + T Sbjct: 927 EKAVTSETVSGEKAVASGTSETVSAEKPDASETEVGEKTVVSEASETVSGEKTDASETVS 986 Query: 466 KGIVQLSSATNSTSETLAATPKAVKS 491 S ++ + T A K+ Sbjct: 987 AEKADASETVSAEKADASETVSAEKA 1012 Score = 80.0 bits (195), Expect = 5e-13, Method: Composition-based stats. Identities = 86/421 (20%), Positives = 168/421 (39%), Gaps = 4/421 (0%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E + E + V+ A A ++ S A E + A +A +A SA Sbjct: 459 ETVSGEKTDASETVSAGKADASETVSAEKADASETVSAGKADASETVSAEKAATSETVSA 518 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 A +S ++ KA + +A + + + T++ + S A+ + S Sbjct: 519 GKADASETVSAEKAVTSETVSAEKTVASATSETTSAEKPDASETVSGEKTDASETTSGEK 578 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 T ASE ++ + A+ +A+ ++ + + SA A +SAT+ SA A SE S Sbjct: 579 TDASETVSAEKADASETVSAEKADASETVSAEKAVASATSETVSAGKADASEI---VSAE 635 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 S + +A A+ + ++ ++ + A ++ + ++ S T+ AT Sbjct: 636 KTDASETVSAEKADASETVSAEKPDASETEAGEKADASETVSAGKAATSETTSAGKAATS 695 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 + +A ++A++ S A SET + + + ++ A ++ + SA K +A+ S Sbjct: 696 ETVSAGKAATSETVSAEKAVTSETVSVEKAATSETVSAEKADASETVSAEKTDASETVSG 755 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS-AVALEDASTTK 465 K+ A+ + A +A ++ ++A S AE A A A E S K Sbjct: 756 EKTDASETVSAEKADASETVSAEKTVASATSETVSAEKAVTSETVSAEKAATSETVSAEK 815 Query: 466 KGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKT 525 + SA + + + K S + EK + A+ PD ++ AV+ Sbjct: 816 TDASETVSAEKTDASETVSGEKTDASETVSGEKTDASETVSAEKPDASETVSGEKAVTSE 875 Query: 526 D 526 Sbjct: 876 T 876 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 79/395 (20%), Positives = 156/395 (39%), Gaps = 4/395 (1%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 A N A + + + A + T + E A + + AS ++ ++ AS + Sbjct: 900 SAENGEKAATSETVSVEKADASETVSAEKAVTSETVSGEKAVASGTSETVSAEKPDASET 959 Query: 173 AGTASTKATEASKSAAAAESSKSA---AATSAGAAKTSETNASASLQSAATSASTATT-K 228 T +EAS++ + ++ S A + + S A AS +A A + T Sbjct: 960 EVGEKTVVSEASETVSGEKTDASETVSAEKADASETVSAEKADASETVSAEKADASETVS 1019 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 A +AATS +A A+ +SET ++ ++ + + ++++ + +A + +A Sbjct: 1020 AGKAATSETVSAEKAVASATSETVSAEKTDASETVSAEKADASETVSAEKADASETVSAG 1079 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 + S + AS + +A + SA S +A A + T + E T+ + Sbjct: 1080 KAATSETVSVEKTDASETVSGEKTATSETVSAGKTDASETVSAEKADASETVSAEKTDAS 1139 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + +A + + KA+ + S + AAAS SA + +S + S ++A + Sbjct: 1140 ETVSAGKAATSETTSAGKAATSETVSVEKAAASETVSAEKAVTSETVSAEKAATSEPVSG 1199 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 + T + E ++ + K+ + +T A VA E S K Sbjct: 1200 EKTDASETVSAEKTATSETVSAGKAATSETVSGEKTDASETVSAEKTVASETVSVEKTDA 1259 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 + SA + + + KA S +AEK + Sbjct: 1260 SETISAEKADASETVSAEKADASETTSAEKTVASA 1294 Score = 77.6 bits (189), Expect = 2e-12, Method: Composition-based stats. Identities = 83/385 (21%), Positives = 160/385 (41%), Gaps = 6/385 (1%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 E A T +A K+A+ + S +AA +A+ A + T + + A++++ S Sbjct: 1140 ETVSAGKAATSETTSAGKAATSETVSVEKAAASETVSAEKAVTSETVSAEKAATSEPVSG 1199 Query: 172 SAGTAST--KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 AS A + + S + + + T +G + SA A+ + S T A Sbjct: 1200 EKTDASETVSAEKTATSETVSAGKAATSETVSGEKTDASETVSAEKTVASETVSVEKTDA 1259 Query: 230 SEAATSARDAAA---SKEAAKSSETNASSSASSAASSATAAGNSAKAAKT-SETNARSSE 285 SE ++ + A+ S E A +SET ++ ++A+S T +G A++T S A +SE Sbjct: 1260 SETISAEKADASETVSAEKADASETTSAEKTVASATSETVSGEKTDASETVSAEKADASE 1319 Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 T + + A+ +++ S ASA+ A+ + S T A E Sbjct: 1320 TVSAEKTDASETVSVEKVATSETVSAEKTVASAAPETVSAEKTDASETVSGEKTDASETV 1379 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 AA +A+ ++T+ S A++S+T +A ++A+S + ++ D + + Sbjct: 1380 SGEKAATSETVSAEKADTSETVSAGKADASETVSAEKPVASATSETVSAEKADASETVSV 1439 Query: 406 AAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTK 465 ++ T S + T A+ ++ + K+ A + + A V E S K Sbjct: 1440 EKTDASETVSAEKTVASATSETVSAEKADASETVSGEKADASETVSAGKTVTSETVSAGK 1499 Query: 466 KGIVQLSSATNSTSETLAATPKAVK 490 + SA + + + T K Sbjct: 1500 AAASETVSAEKTVASATSETVSGEK 1524 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 73/416 (17%), Positives = 169/416 (40%), Gaps = 5/416 (1%) Query: 118 SAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTAS 177 + + ++ A+TS + A + + + ++ + AS ++ T S Sbjct: 891 EKTDASETTSAENGEKAATSETVSVEKADASETVSAEKAVTSETVSGEKAVASGTSETVS 950 Query: 178 TKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSAR 237 + +AS++ ++ S A+ + KT + ++ ++ A+ +A + SA Sbjct: 951 AEKPDASETEVGEKTVVSEASETVSGEKTDASETVSAEKADASETVSAEKADASETVSAE 1010 Query: 238 DAAASKE-AAKSSETNASSSASSAASSATAAGNSAKAAKTSET-NARSSETAAGQSASAA 295 A AS+ +A + T+ + SA A +SAT+ SA+ SET +A ++ + SA A Sbjct: 1011 KADASETVSAGKAATSETVSAEKAVASATSETVSAEKTDASETVSAEKADASETVSAEKA 1070 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSA 355 S+T +A A+ + T + + + ++ + + + S T A E A A Sbjct: 1071 DASETVSAGKAATSETVSVEKTDASETVSGEKTATSETVSAGKTDASETVSAEKADASET 1130 Query: 356 SAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAS 415 +A+ ++ + S A +S+T +A A+++ + + +A+ + + + + + + Sbjct: 1131 VSAEKTDASETVSAGKAATSETTSAGKAATSETVSVEKAAASETVSAEKAVTSETVSAEK 1190 Query: 416 TKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSAT 475 +E A +AE AT +A +A A++ + T V Sbjct: 1191 AATSEPVSGEKTDASETVSAEKTATSETVSAGKA---ATSETVSGEKTDASETVSAEKTV 1247 Query: 476 NSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKR 531 S + ++ T + + + A+ AD + + + + + ++ Sbjct: 1248 ASETVSVEKTDASETISAEKADASETVSAEKADASETTSAEKTVASATSETVSGEK 1303 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 59/331 (17%), Positives = 127/331 (38%) Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 + A +++ +T AA + T+ + E ++ SA Sbjct: 370 ASVEKAATSNIVPVEKTTTLNIIPVEKAAILNIVPEEKTATSETVSGEKTDASETVSAEK 429 Query: 221 SASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETN 280 + ++ T + ATS +A A+ +SET + ++ + + ++++ + + Sbjct: 430 ADASETVSVEKVATSETVSAEKTVASATSETVSGEKTDASETVSAGKADASETVSAEKAD 489 Query: 281 ARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTK 340 A + +A AS ++ AA S +A + + SA A S +A + T Sbjct: 490 ASETVSAGKADASETVSAEKAATSETVSAGKADASETVSAEKAVTSETVSAEKTVASATS 549 Query: 341 AGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 + E+ A+ + + +T A + +A + AS S+ + ++ A Sbjct: 550 ETTSAEKPDASETVSGEKTDASETTSGEKTDASETVSAEKADASETVSAEKADASETVSA 609 Query: 401 TRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 + ++A S +A SA S++ + A +ET + D + A E Sbjct: 610 EKAVASATSETVSAGKADASEIVSAEKTDASETVSAEKADASETVSAEKPDASETEAGEK 669 Query: 461 ASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 A ++ ++ + +TS AAT + V + Sbjct: 670 ADASETVSAGKAATSETTSAGKAATSETVSA 700 Score = 59.9 bits (143), Expect = 5e-07, Method: Composition-based stats. Identities = 50/292 (17%), Positives = 108/292 (36%) Query: 213 ASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAK 272 + + + S T ASE ++ + A+ + + T+ + SA +SAT+ S + Sbjct: 405 EEKTATSETVSGEKTDASETVSAEKADASETVSVEKVATSETVSAEKTVASATSETVSGE 464 Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAAS 332 SET + A+ ++ A + ++ + AS + A+ + + ++ AS Sbjct: 465 KTDASETVSAGKADASETVSAEKADASETVSAGKADASETVSAEKAATSETVSAGKADAS 524 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 +A T A SA++ TS ASET + A+ +++ ++ + Sbjct: 525 ETVSAEKAVTSETVSAEKTVASATSETTSAEKPDASETVSGEKTDASETTSGEKTDASET 584 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 SA K +A+ SA K+ A+ + A + + + S AE Sbjct: 585 VSAEKADASETVSAEKADASETVSAEKAVASATSETVSAGKADASEIVSAEKTDASETVS 644 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQ 504 A + + +K + A + + ++ + + + Sbjct: 645 AEKADASETVSAEKPDASETEAGEKADASETVSAGKAATSETTSAGKAATSE 696 >UniRef50_B6KAY2 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B6KAY2_TOXGO Length = 1112 Score = 81.1 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 67/388 (17%), Positives = 127/388 (32%), Gaps = 5/388 (1%) Query: 66 GFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTA 125 GF + G I +P FL + E +A+ ++ V S + Sbjct: 730 GFLKTSLGKIPAPATHKP----RFLNLLDETTRPDDAVHGEPEVLRIVRNRMSIMQNVIV 785 Query: 126 AAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASK 185 A + A ++ E A+D + + S++ +S A A Sbjct: 786 TAAEPAPESQAKREEGQASASDGSAPEAGGDRAKPSDGSTSLKSSQGRERTVQAADSAGV 845 Query: 186 SAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEA 245 + + A + ++ A A+ A+ A+T +AA S + Sbjct: 846 PVSGSGRLGGKAKEAVSGRESGRGTTLAPPSRASAKATNASTGGKKAAESKAAGKSKAAE 905 Query: 246 AKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASS 305 A + + + + S ATA G ++K K A ++ + S + +K+AA+ + Sbjct: 906 ASTRSLQSPRPEAKSKSGATAEGKTSKGGKQGGAPAAGKQSKSA-SPTTKQPNKSAASPN 964 Query: 306 ASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNA 365 A SA A AA +A+ A A + + ASA A A + Sbjct: 965 AKLDPKSAAGRKGDAKAAAPAAKGDAKQAGFSPRPKENSRTAASADATKQKQASPAGKQP 1024 Query: 366 KASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSA 425 + + + ++T + S+ S + + Q A + S A + A A Sbjct: 1025 AKAANAGKIAETKVTAPPPVKKSAGVSPKGAAAKPAAQKDAGPPKKDSKSGTAEKKAAPA 1084 Query: 426 TAAAQSKSTAESAATRAETAAKRAEDIA 453 + + + A A Sbjct: 1085 KPVESAPKKEATGKALKKAAPSGPPKKA 1112 >UniRef50_C5KW02 Calcium-binding tyrosine phosphorylation-regulated protein, putative n=4 Tax=Perkinsus marinus ATCC 50983 RepID=C5KW02_9ALVE Length = 597 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 50/314 (15%), Positives = 80/314 (25%) Query: 128 KKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSA 187 + AS ++ + +A +A A +A A A + TEA Sbjct: 137 SEDASSSNATTTDAPVGTTEAPVETTEAPVETTEAPLETTEAPVETTEAPVETTEAPIET 196 Query: 188 AAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 A + A A T A + + +E + Sbjct: 197 TEAPIETTEAPIETTEAPIETTEAPVETTEVPVETTEVPVETTEVPVETTEVPVETTEVP 256 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 T + T A T T A + T + Sbjct: 257 VETTEVPVETTEVPVETTEAPIETTEVPIETTEVPIETTEAPVETTGKPVGTTEVPVDTT 316 Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA 367 A +A T A A + E + A + T Sbjct: 317 EAPVDTTEAPVDTTEAPVDTTEAPVETTEVPVDTTEVPVDTTEAPVDTTEVPVETTEVPG 376 Query: 368 SETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATA 427 T + KT + A + + A A EA + + A + T A + TEA T Sbjct: 377 ETTESPVEKTEGPVDTTEAPAETTEAPAETTEAPAETTEAPAETTEAPAETTEAPAETTE 436 Query: 428 AAQSKSTAESAATR 441 A + A + T Sbjct: 437 APAETTEAPADTTE 450 Score = 71.9 bits (174), Expect = 1e-10, Method: Composition-based stats. Identities = 48/319 (15%), Positives = 79/319 (24%) Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 + A+ + + A +A A A + TEA A + A Sbjct: 136 CSEDASSSNATTTDAPVGTTEAPVETTEAPVETTEAPLETTEAPVETTEAPVETTEAPIE 195 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 A T A A + A + +E + T + Sbjct: 196 TTEAPIETTEAPIETTEAPIETTEAPVETTEVPVETTEVPVETTEVPVETTEVPVETTEV 255 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 T T A T + T A + + Sbjct: 256 PVETTEVPVETTEVPVETTEAPIETTEVPIETTEVPIETTEAPVETTGKPVGTTEVPVDT 315 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 T A A + A EA + + + T A T T Sbjct: 316 TEAPVDTTEAPVDTTEAPVDTTEAPVETTEVPVDTTEVPVDTTEAPVDTTEVPVETTEVP 375 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 + + + EA + + A + T A + TEA T A + A + T Sbjct: 376 GETTESPVEKTEGPVDTTEAPAETTEAPAETTEAPAETTEAPAETTEAPAETTEAPAETT 435 Query: 441 RAETAAKRAEDIASAVALE 459 A A + V ++ Sbjct: 436 EAPAETTEAPADTTEVCMK 454 Score = 69.9 bits (169), Expect = 5e-10, Method: Composition-based stats. Identities = 52/328 (15%), Positives = 82/328 (25%), Gaps = 4/328 (1%) Query: 153 RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAS 212 + A S++++ A TEA A + A A T A Sbjct: 127 EGCDGDPYTCSEDASSSNATTTDAPVGTTEAPVETTEAPVETTEAPLETTEAPVETTEAP 186 Query: 213 ASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAK 272 A + A + +EA +A A T + T Sbjct: 187 VETTEAPIETTEAPIETTEAPIETTEAPIETTEAPVETTEVPVETTEVPVETTEVPVETT 246 Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAAS 332 T T + T A + + T A Sbjct: 247 EVPVETTEVPVETTEVPVETTEVPVETTEAPIETTEVPIETTEVPIETTEAPVETTGKPV 306 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 + EA + A + A T A T T + A + Sbjct: 307 GTTEVPVDTTEAPVDTTEAPVDTTEAPVDTTEAPVETTEVPVDTTEVPVDTTEAPVDTTE 366 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 E + + + T TEA T A +++T A T A Sbjct: 367 VPVETTEVPGETTESPVEKTEGPVDTTEAPAETTEAP-AETTEAPAETTEAPAETTE--- 422 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSE 480 A A E + T + + + A T+E Sbjct: 423 APAETTEAPAETTEAPAETTEAPADTTE 450 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 60/387 (15%), Positives = 97/387 (25%), Gaps = 18/387 (4%) Query: 92 AMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADS 151 A E P + EV + V T ++ E + Sbjct: 206 APIETTEAPIETTEAPVETTEVPVETTEVPVETTEVPVETTEVPVETTEVPVETTEVPVE 265 Query: 152 ARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA 211 +A + TEA + A T A Sbjct: 266 TTEVPVETTEAPIETTEVPIETTEVPIETTEAPVETTGKPVGTTEVPVDTTEAPVDTTEA 325 Query: 212 SASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSA 271 A + A + +E + A T + T + Sbjct: 326 PVDTTEAPVDTTEAPVETTEVPVDTTEVPVDTTEAPVDTTEVPVETTEVPGETTESPVEK 385 Query: 272 KAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAA 331 T A + T A + A T A + + A +A A T A A Sbjct: 386 TEGPVDTTEAPAETTEAPAETTEAPAETTEAPAETTEAPAETTEAPAETTEAPAETTEAP 445 Query: 332 S------------------SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAE 373 + S S + K A A S + S+ + E Sbjct: 446 ADTTEVCMKICNEFGPCRDSDSGSYCKTWLAPSVCFGIADSGESLCYSDVDDCEGEPIEC 505 Query: 374 SSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKS 433 S T A + ++ AS+ + A + T + + A T A + A A + Sbjct: 506 SDTTEAPAVETTEASAVKTTEAPAVKTTEAPAVETTEAPAVETTEAPAVETTEAPAVETT 565 Query: 434 TAESAATRAETAAKRAEDIASAVALED 460 A + T A + ED A + Sbjct: 566 EAPAVETTEAPAVETTEDSAVESTVPS 592 >UniRef50_P13390 L-shaped tail fiber protein n=4 Tax=Enterobacteria phage T5 RepID=VLTF_BPT5 Length = 1396 Score = 80.0 bits (195), Expect = 5e-13, Method: Composition-based stats. Identities = 78/205 (38%), Positives = 113/205 (55%) Query: 177 STKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSA 236 T S A++ K S + + + +++++S+ S A A S Sbjct: 9 QQMVTMDQNSITASKYPKYTVVLSNSISSITAADVTSAIESSKASGPAAKQSEINAKQSE 68 Query: 237 RDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAA 296 +A S+ A+ S T++ SA+ +ASSATA+ NSAKAAKTSETNA +S+ AA S + AA Sbjct: 69 LNAKDSENEAEISATSSQQSATQSASSATASANSAKAAKTSETNANNSKNAAKTSETNAA 128 Query: 297 GSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSAS 356 S ++A+S A+AA SA A S T AG SA++A +S + A A A + A +S + Sbjct: 129 SSASSASSFATAAENSARAAKTSETNAGNSAQAADASKTAAANSATAAKTSETNAKKSET 188 Query: 357 AAKTSETNAKASETSAESSKTAAAS 381 AAKTSETNAK SE A+ A+ Sbjct: 189 AAKTSETNAKTSENKAKEYLDMASE 213 Score = 76.9 bits (187), Expect = 4e-12, Method: Composition-based stats. Identities = 86/225 (38%), Positives = 115/225 (51%) Query: 106 FELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASS 165 + MV + +A S S +S +A + + + S AA S A S Sbjct: 8 LQQMVTMDQNSITASKYPKYTVVLSNSISSITAADVTSAIESSKASGPAAKQSEINAKQS 67 Query: 166 AQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTA 225 +A S A AT + +SA + SS +A+A SA AAKTSETNA+ S +A TS + A Sbjct: 68 ELNAKDSENEAEISATSSQQSATQSASSATASANSAKAAKTSETNANNSKNAAKTSETNA 127 Query: 226 TTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSE 285 + AS A++ A A S AAK+SETNA +SA +A +S TAA NSA AAKTSETNA+ SE Sbjct: 128 ASSASSASSFATAAENSARAAKTSETNAGNSAQAADASKTAAANSATAAKTSETNAKKSE 187 Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESA 330 TAA S + A S+ A AS + G + S Sbjct: 188 TAAKTSETNAKTSENKAKEYLDMASELVSPVTQYDWPVGTNNNSV 232 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 75/185 (40%), Positives = 103/185 (55%) Query: 162 AASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATS 221 ++ +A +S+ ++ A +S A+ S+ A S A+ S T++ S +A+S Sbjct: 36 SSITAADVTSAIESSKASGPAAKQSEINAKQSELNAKDSENEAEISATSSQQSATQSASS 95 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 A+ + A A TS +A SK AAK+SETNA+SSASSA+S ATAA NSA+AAKTSETNA Sbjct: 96 ATASANSAKAAKTSETNANNSKNAAKTSETNAASSASSASSFATAAENSARAAKTSETNA 155 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 +S AA S +AAA S TAA +S + A S A S T A S A A+ Sbjct: 156 GNSAQAADASKTAAANSATAAKTSETNAKKSETAAKTSETNAKTSENKAKEYLDMASELV 215 Query: 342 GEATE 346 T+ Sbjct: 216 SPVTQ 220 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 69/216 (31%), Positives = 107/216 (49%) Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 T +S +++ S + + + S+ ++ S AA S+ A S A Sbjct: 13 TMDQNSITASKYPKYTVVLSNSISSITAADVTSAIESSKASGPAAKQSEINAKQSELNAK 72 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 S +A SAT++ +SA +ASSA+ + A A + A S +AAKTSETNA +S + Sbjct: 73 DSENEAEISATSSQQSATQSASSATASANSAKAAKTSETNANNSKNAAKTSETNAASSAS 132 Query: 371 SAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQ 430 SA S TAA +SA +A +S ++A S A +AA +SAT A T T A S TAA Sbjct: 133 SASSFATAAENSARAAKTSETNAGNSAQAADASKTAAANSATAAKTSETNAKKSETAAKT 192 Query: 431 SKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 S++ A+++ +A+ A ++ S V D Sbjct: 193 SETNAKTSENKAKEYLDMASELVSPVTQYDWPVGTN 228 Score = 66.9 bits (161), Expect = 4e-09, Method: Composition-based stats. Identities = 72/189 (38%), Positives = 107/189 (56%) Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTA 301 S + + + +S+ S+ +S AA S AK SE NA+ SE A SA+++ S T Sbjct: 32 SNSISSITAADVTSAIESSKASGPAAKQSEINAKQSELNAKDSENEAEISATSSQQSATQ 91 Query: 302 AASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTS 361 +ASSA+A++ SA A S T A S +A +S + A + A A+ A+AA SA AAKTS Sbjct: 92 SASSATASANSAKAAKTSETNANNSKNAAKTSETNAASSASSASSFATAAENSARAAKTS 151 Query: 362 ETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEA 421 ETNA S +A++SKTAAA+SA++A +S ++A S+ A + AK+S A A Sbjct: 152 ETNAGNSAQAADASKTAAANSATAAKTSETNAKKSETAAKTSETNAKTSENKAKEYLDMA 211 Query: 422 AGSATAAAQ 430 + + Q Sbjct: 212 SELVSPVTQ 220 >UniRef50_Q4L3P2 Similar toputative cell-surface adhesin SdrF n=4 Tax=Staphylococcus RepID=Q4L3P2_STAHJ Length = 1855 Score = 79.6 bits (194), Expect = 7e-13, Method: Composition-based stats. Identities = 71/358 (19%), Positives = 128/358 (35%), Gaps = 17/358 (4%) Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 A A AA+ + + SAQS+S +A T +A+ E + +T+ Sbjct: 41 AGHDVAKAAEDTKTEEGVTSNSDESAQSSSETANTGVETTEQATAEQPTTEEKATEESTT 100 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 + T A+ +S ST E+ T + ++A + S T S+ A Sbjct: 101 ----EQPSTEEKATEESTTEQPSTEEKATEESTT--EQPSTEEKATEESTTEQPSTEEKA 154 Query: 261 ASSATAAGNS--AKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQAS- 317 + +T S KA+K S T S+E A + ++ S AS S + + Sbjct: 155 SKESTTEQPSTEEKASKESTTEQPSTEEKASKESTTEQPSTEEKASKESTTEQPSTEEKV 214 Query: 318 -------ASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 S+ S ES ST E+T + S+ AS T+E + +T Sbjct: 215 TEESTTEQSSIEEKASKESTTEQPSTEEKVTEESTTEQSSTEEKASKESTTEQPSTEEKT 274 Query: 371 SAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQ 430 S E++ + ++ S + S + + + +A S A ++ A+E A A Q Sbjct: 275 SEENTTERSTVIDKDSSQSVQNISQNLGVSNEETISALSDAGVNTSNASEVDAIA-ALIQ 333 Query: 431 SKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKA 488 T + + + + K + ++ S+ + A A Sbjct: 334 KDYQNNKNENPVATFSNTSSQATTNNNTRTRTLANKIRIAAAAVAEDNSKIIDADALA 391 >UniRef50_Q1HTS1 S1L n=1 Tax=Squirrel poxvirus RepID=Q1HTS1_9POXV Length = 1258 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 80/477 (16%), Positives = 154/477 (32%), Gaps = 27/477 (5%) Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 + A+ + ++ A++A T A A A A + T A A A Sbjct: 553 QQAAKTDKRLRDLEQRATEAETQAARAEARAEAAEAKSAELETQASDAEDRADELQQKTE 612 Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAAT 234 +ATEA K AA A A + + T A KA E+ Sbjct: 613 ELEKRATEAEKDAARARERVKVAEAKSAELEEKATEAEDRADELEAQVDGLKRKADESEQ 672 Query: 235 SARDAAASKEAAKS----------------SETNASSSASSAASSATAAGNSAKAAKTSE 278 A +A A++ + + + S+ A A+T E Sbjct: 673 RALEAEKDAARARALTEVAEAKAEEFEEKAAAAEDRAEELESKSAVLEAQVEKLEARTDE 732 Query: 279 TNARSSE-TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTA 337 +A+ +E + + A T A S + + +A+A + E + Sbjct: 733 LDAQVTELETEKRDLTQKAEELTRKADQLSEQTRDLEEKAAAADERKRYLEKLNEALEK- 791 Query: 338 TTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASK 397 KA E ++ ++ + A+ +A+ A AS ++ Sbjct: 792 --KAVECEDRTRELSQKTQGLEEKAAAAETRAEDLAKKLSASEEKARDLERGASRSAEKI 849 Query: 398 DEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVA 457 Q S K A T+A Q E A E + E A + Sbjct: 850 SNLETQNSDLKEKANNLETQAAALEKKTQDLEQKNQDLEKKADDLEQKTQELEKKAEDLK 909 Query: 458 LEDASTTKKG--IVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK-DQNGADIPDKGC 514 ++ KK + Q + +E L +A + + E+R ++ ++ ++ DKG Sbjct: 910 QKNQDLEKKADDLEQKTQELEKKAEALETDNQAAQQKTEALEERNRELEKTAKELEDKGA 969 Query: 515 FLNN----INAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASR 567 L N + +++ + + + A + + + V + + ++ E A + Sbjct: 970 LLQNQLATMGELTRDLEQRNKSLEDRALTAESKSAEAEKRNVDLEKKNQTLHERAEK 1026 >UniRef50_B5DVG8 GA26604 n=2 Tax=pseudoobscura subgroup RepID=B5DVG8_DROPS Length = 2855 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 57/412 (13%), Positives = 123/412 (29%), Gaps = 2/412 (0%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 V + + + D + +A + +D A S ++ S Sbjct: 1578 VPASDEDKTPDSTEKTMLGAEPEDETATAVPSVGDTSDEEAPATTDVPSKDESEQKPSSV 1637 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 + S + T + AA E S + T+ + + +T A+ S + S S Sbjct: 1638 PAEIEPESDRTTTPAPVDAAKEESDEQSTTTPSSDERKDTTAAPSDEK-TPSVSVTPEAE 1696 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 + + A A S E + + + A+ S + SSE AG Sbjct: 1697 HDEKSDATAAPVSDEDRTEKPVDEKEPLPAGEDVEKESDIETSTARPSAATSPSSEEEAG 1756 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 +++ + + A A T++ + AA + + +++ E Sbjct: 1757 DDSASTDKTPSQEAEEKPEAPTTSSEEEDDTAAAATTTPAPSAADKVPDVP-KLPQETPE 1815 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 S++ T + + S + + A S+ S+ S + EA + + + Sbjct: 1816 DVLPSSTETSTEQERESTAAPSLDDKEPAVTSAPSADIDSDITTIGPVSEADEKPTEEEK 1875 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 K E + + + E+ A K ++ A+ + Sbjct: 1876 PIEEQKPKEDEKPTEGEKPTEEEQSTEAPAKLPTPEDKIGSQPEDKISATTAAPEEGSTE 1935 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINA 521 ST + + + + + D+P + I A Sbjct: 1936 ASDEIVPSTDSQADEETEDKQPSSTAQTPGEKTPETSTDLPSEDADTVTIGA 1987 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 71/427 (16%), Positives = 128/427 (29%), Gaps = 26/427 (6%) Query: 78 YEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTS 137 + + T LGA ED+ + EE +++ + K S+ A Sbjct: 1583 EDKTPDSTEKTMLGAEPEDETATAVPSVGDTSDEEAPATTDVPSKDESEQKPSSVPAEIE 1642 Query: 138 AREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAA 197 T D+A+ S S+ + S + T + AE + + Sbjct: 1643 PESDRTTTPAPVDAAKEESDEQSTTTPSSDERKDTTAAPSDEKTPSVSVTPEAEHDEKSD 1702 Query: 198 ATSA----------------------GAAKTSETNASASLQSAATSASTATTKASEAATS 235 AT+A K S+ S + SAATS S+ ++A++ Sbjct: 1703 ATAAPVSDEDRTEKPVDEKEPLPAGEDVEKESDIETSTARPSAATSPSSEEEAGDDSAST 1762 Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAA 295 + + E + T +S A++AT + A K + ET S+ Sbjct: 1763 DKTPSQEAEEKPEAPTTSSEEEDDTAAAATTTPAPSAADKVPDVPKLPQETPEDVLPSST 1822 Query: 296 AGSKTAAASSASAASTSAGQASASATAAG--KSAESAASSASTATTKAGEATEQASAAAR 353 S S +A S + + ++ + S + S A K E + Sbjct: 1823 ETSTEQERESTAAPSLDDKEPAVTSAPSADIDSDITTIGPVSEADEKPTEEEKPIEEQKP 1882 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT 413 T K +E + A + S S A + + ++S Sbjct: 1883 KEDEKPTE--GEKPTEEEQSTEAPAKLPTPEDKIGSQPEDKISATTAAPEEGSTEASDEI 1940 Query: 414 ASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSS 473 + ++A S + ET+ + A V + + V Sbjct: 1941 VPSTDSQADEETEDKQPSSTAQTPGEKTPETSTDLPSEDADTVTIGAPFAEEDASVPSDE 2000 Query: 474 ATNSTSE 480 STS Sbjct: 2001 KKPSTSA 2007 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 62/409 (15%), Positives = 120/409 (29%), Gaps = 6/409 (1%) Query: 108 LMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQ 167 +E A A + K+ D++T A A D A +A A+ + Sbjct: 2106 KQADEELTPAVTPAAPETSDKEPTVDSTTVAP-AKEEEEDLAATATPATGIKETSEKEPS 2164 Query: 168 SASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT 227 S++ + + SA A +K + T + A++AT Sbjct: 2165 VDSTTVAPVKEDDQDLTASATPATDAKEPSEKEPS---VDSTTVVPDKEDDEDLAASATP 2221 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 S ++ + + A+S + ++ A+ S SE S T+ Sbjct: 2222 ATDAKEPSEKEPSVDVQEAESGTKPPTDASDEQPIDEAASSTSPPTVDESEQPTTSPATS 2281 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ 347 AS AG K S + + + S +A T Q Sbjct: 2282 VKDEASTPAGDKVPEDELDVTTIASPAVSDVKQDDTKDTTATTVSPLIDEKEEAVTPTAQ 2341 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAA 407 + + + + A+ +T A + AA + S+A+ E Sbjct: 2342 DDSKTPIDVSTVSPTSEAEHDQTEAPLDSSTAAPAIPETDSTAAPVDVPSAEEIDTKPME 2401 Query: 408 KSSATTASTKATEAAGSATAAAQSKSTAESAATRAET--AAKRAEDIASAVALEDASTTK 465 + TA+ + T A S+ + + E+ + + TT Sbjct: 2402 DVMSQTAAPAKEDDQTPVTVAPVDHDVEPSSEASEQPPVSEDVGEESTEPASSDAEDTTD 2461 Query: 466 KGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 + + + T ATP + + ++P+K Sbjct: 2462 QSPAGAKLKPPTATTTSEATPSEAAVTEADIVPETASPELEKEVPEKST 2510 >UniRef50_A9IXX1 Phage-related protein n=4 Tax=Bartonella RepID=A9IXX1_BART1 Length = 1136 Score = 78.4 bits (191), Expect = 1e-12, Method: Composition-based stats. Identities = 76/395 (19%), Positives = 156/395 (39%), Gaps = 9/395 (2%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 ++ S A + A + A+ + A+ + AS + ++ +A++S Sbjct: 229 ASQPTSTTANTSTHASQPIPAAANTTTHASQPIPATDHTTTHASHPTPASQPTSTTANTS 288 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 + +T A+ S A AAA + A S A+ + A+ A + A Sbjct: 289 THASQPTSTTANTS-THASQPTPAAAQAPTHASHSAPAAATTSTHASQPIPAAAQAPTHA 347 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ-- 290 + SA AA + A + +++A+ A + N++ A + ++ T A Q Sbjct: 348 SHSAPAAANTTTHASHPTSTTANTATHAPQPTSTTANTSTHASQPTSTTANTSTHASQPT 407 Query: 291 -----SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 +++ A+ AAA++++ AS A+ +AT A + A++++ A+ Sbjct: 408 STTATTSTHASHPTPAAATTSTHASHPTPAAANTATHASHPTSTTATTSTHASQPTSTTA 467 Query: 346 EQASAAARSASA-AKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQA 404 ++ A++ A A A T+ + + +A ++ T A+ SA + + + A A + Sbjct: 468 NTSTHASQPAPATANTATHAPQPAPAAAHTTSTHASHSAPATDHTTTHAPQPTPAAAQAP 527 Query: 405 SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 + A + T A+ A AAA + + A A A A A A A + Sbjct: 528 THASQPIPATAQAPTHASHPAPAAANTSTHASHPAPAAANTTTHAPQPAPAAANTSTHAS 587 Query: 465 KKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 ++ T+++ T AA + ++ Sbjct: 588 HPAPAAANTTTHASHPTPAAANTSTHASQPIPATA 622 >UniRef50_B3XNY6 LPXTG-motif cell wall anchor domain protein n=1 Tax=Lactobacillus reuteri 100-23 RepID=B3XNY6_LACRE Length = 2129 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 86/392 (21%), Positives = 169/392 (43%), Gaps = 10/392 (2%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 A NA A+ ++A+++ A A + + SA A SA ++ Sbjct: 108 SASNAVDKAKLLNPTSDDIANANSALTSAQGTVESAKNQLQNDQNSASAAKDDLASAQAT 167 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A A++ A SAA+A + ++A + A+ +++ + L A+++ A EA Sbjct: 168 ANNATSAQQSAQASAASASDALNSATKALSDAQNAKSASQDQLNQASSAIDQAQKAYDEA 227 Query: 233 ATSARDAAASKEAA-------KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSE 285 A +A D S+ +A ++ T+ + +A+++ ++A + A+T+ A ++ Sbjct: 228 AKNATDLTDSQASAVKNLADANANATSVDKAVETASNAVSSANVANDQAQTAFDQASQAQ 287 Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 + A QSA A +A ++ + A QAS++ A ++ + A + A A A + + Sbjct: 288 SQAAQSAQQAHDDLNSAKTAQAGAQIKLNQASSATAQASEAVKDAQARADQANADAAKQS 347 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 + + A ++ A + +A+ + A + A + A S+ SA + +A + Sbjct: 348 QALTDANKTFKQASDTANDAQTAMDKANQNVQVATEQLNHAKSAKQSADQALVDAQAKLK 407 Query: 406 AAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTK 465 A +A A K + +AT Q+ + A+ A T A+ A K A+ + + A Sbjct: 408 QANETAANAQAKLDQLVQNATGDNQAVADAKKALTTAQNAVKDAQAKLDSANIAVADANT 467 Query: 466 K---GIVQLSSATNSTSETLAATPKAVKSAYD 494 K L+SA + AA K K+ D Sbjct: 468 KQSNAQDALTSAQHDAEARQAALTKTQKALND 499 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 86/419 (20%), Positives = 164/419 (39%), Gaps = 7/419 (1%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + ++++ +NA+ Q A AKK+ + A + ++A A + A+T A + Sbjct: 417 QAKLDQLVQNATGDNQAVADAKKALTTAQNAVKDAQAKLDSANIAVADANTKQSNAQDAL 476 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 SA A T+ K+ A++ AA A+ + A+ +L +A ++ A Sbjct: 477 TSAQHDAEARQAALTKTQKALNDAQAQLQAAQQLQDRAEQAVKTANTNLANANSALKAAQ 536 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 T + A ++ A ++ +AK++ET A+ + +SA + T A N+ A+K + NA + Sbjct: 537 TTQANAQSALDQANSALSSAKATETQANQALASATQAQTNANNALNASKQTLKNANDALA 596 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A +AA + T A + A + A ++ T A +A S A + +A + Sbjct: 597 DAKAKQAAANDALTKAQQTVKDADANLASAKSALTDATNKLTNAKS-AVASAQQALDQAN 655 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 Q A + A+ T + ++ +++ AA A A A+ ++ ++QA + Sbjct: 656 QDIATKTAEVASATKANDDAQADLQTKTTAMQAAQKAV------QDAQATYNQLSKQADS 709 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 ++S + + AA K A + A AE + A Sbjct: 710 LQASIDSYVDNTQIKVPAGMKAAYDKMVARAQAVAAEGGNPLTDSEYIADRKAYMDLAST 769 Query: 467 GIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKT 525 G+ +N+ + K L DQ +NN+ A T Sbjct: 770 GMDLNKFTSNTLDKQNVIARKGTSEVRSEFFNNLTYDQRLELAQYAVDMINNVRAAYGT 828 >UniRef50_C2CVN0 Putative uncharacterized protein n=1 Tax=Gardnerella vaginalis ATCC 14019 RepID=C2CVN0_GARVA Length = 3104 Score = 76.9 bits (187), Expect = 4e-12, Method: Composition-based stats. Identities = 71/420 (16%), Positives = 134/420 (31%), Gaps = 32/420 (7%) Query: 125 AAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEAS 184 A ++ SA+E + D+ A A S+ S T +S Sbjct: 135 ATFDRTTRQMVGSAKEYSVKNTDSNSIAEGVENQETVAGDSSVGVKDSGEYFVTNVDSSS 194 Query: 185 KSAAA-----------AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 S + A S+ S A SA T ++S + A S + ++ Sbjct: 195 PSNRSFDTNNFEDRSAASSADSNADNSATKTATKGKQDTSSSEIAKPVKSAKKSDTTQTK 254 Query: 234 TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA----RSSETAAG 289 + A + + A+ ++NAS A A + + ++S T Sbjct: 255 DEKTNPAETGKVAEKPKSNASVPAHKTTEKPAVAKPKVENSVKPADTKPADTKASYTKPA 314 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 + +A ++ + ASA + + + A A SA++ + T+ + Q S Sbjct: 315 AAPAAKQSAQPETSQPASAKTVAKTKPEAKAEPNQNSAQTEHAENKQPATE-QNTSAQPS 373 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 + + E K + ++ SS ++++ A + + S Sbjct: 374 STHEENANTVNDENKPKVRSRRSVDESANGSNKGSSVTPASNAPETLSASAPSAGTESSS 433 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 A ++ + E G+A ++ + T TA ++ AV + K Sbjct: 434 GAGNTASPSAEGNGNAQGGQGAQGQSAENPTAQGTAQGSHDNAQGAVTGTGVTGKDKSAT 493 Query: 470 QLSSAT-NSTSETLAATPKA---------------VKSAYDNAEKRLQKDQNGADIPDKG 513 +S T S T A T +K DN E + + N DK Sbjct: 494 PSASDTTPSPDSTPAGTQNTGTTTNNAATDPNSQVIKPNTDNPEAKSTEQANEQVQTDKK 553 Score = 50.7 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 73/436 (16%), Positives = 137/436 (31%), Gaps = 15/436 (3%) Query: 148 AADSARAASTSAGQA-ASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKT 206 + SA+ S S + A S +S +A TA ++ + SA+ +A A ++ + Sbjct: 17 SIKSAKHKSVSGDASNAKSVKSVKKNAKTAKRSSSSTATSASGHRLGVAAVALASTLSMV 76 Query: 207 SETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATA 266 A+ + S + + S A TS + A ++ Sbjct: 77 LPGAAALARTSFNPDSFDPSYVNSVAKTSGKGAGNGNGKGDGKGDGNGLYKHGLLNTGAT 136 Query: 267 AGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS 326 + + S T + A +T A S+ S + ++ S Sbjct: 137 FDRTTRQMVGSAKEYSVKNTDSNSIAEGVENQETVAGDSSVGVKDSGEYFVTNVDSSSPS 196 Query: 327 AESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSA 386 S + A+ S A SA+ T +S A+ K+A S + Sbjct: 197 NRSFD---TNNFEDRSAASSADSNADNSATKTATKGKQDTSSSEIAKPVKSAKKSDTTQT 253 Query: 387 ASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAA 446 ++ + + A + S A A + K A +++ + A T+ Sbjct: 254 KDEKTNPAETGKVAEKPKSNASVPAHKTTEKPAVAKPKV------ENSVKPADTKPADTK 307 Query: 447 KRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNG 506 A+A A + ++ + + T A P + ++AE + + Sbjct: 308 ASYTKPAAAPAAKQSAQPETSQPASAKTVAKTKPEAKAEPNQNSAQTEHAENKQPATEQN 367 Query: 507 ADIPDKGCFLNNINAVSKTDFADKRGMRYV-----RVNAPAGATSGKYYPVVVMRSAGSV 561 N N V+ + R R V N + T P + SA S Sbjct: 368 TSAQPSSTHEENANTVNDENKPKVRSRRSVDESANGSNKGSSVTPASNAPETLSASAPSA 427 Query: 562 SELASRVIITTATRTA 577 +S TA+ +A Sbjct: 428 GTESSSGAGNTASPSA 443 >UniRef50_UPI0001C37D90 hypothetical protein RflaF_18851 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37D90 Length = 828 Score = 76.1 bits (185), Expect = 7e-12, Method: Composition-based stats. Identities = 76/408 (18%), Positives = 135/408 (33%), Gaps = 21/408 (5%) Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 SDA + A +A + + + + AS +S A + +K A Sbjct: 98 DSDAGKGQKAPEAAPAAVEKAADSDAALKEMFSIKPEQASQKPAESSMAAIDLTKPAENK 157 Query: 191 ESSKSAAATSAGAAKTSETNASASLQ-SAATSASTATTKASEAATSARDAAASKEAAKSS 249 + A +A K + A + Q A + + KA + A ++ A+++A + Sbjct: 158 AVELAKAPQTAEQPKEEKAAAEKAEQPKPAENKAVELVKAPQTAEQPKEEKAAEKAEQPK 217 Query: 250 ETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAA 309 T + + A KAA + +E A + A A ++ A+ Sbjct: 218 PTENKAVELAKAPQTAEQPKEEKAAAEKAEQPKPAENKAVELAKAPQTTEQPKEEKAAEN 277 Query: 310 STSAGQASASATAAGKSAESA-ASSASTATTKAGEATEQASAAARSASAAKTSETNAKAS 368 + + A A KA +A ++A A K + A+ Sbjct: 278 DAAEANSKRPHKKGKTVKVKARAVHVKAVEAKAEKAIDKADEKKPVEPAKKPEQPKEMAA 337 Query: 369 ETSAESSKTAAASSASSAASSASSASA---------------SKDEATRQASAAKSSATT 413 ++ K AA + + AA + +A + + E T+ A A Sbjct: 338 IEPPKAEKDAAKPAENKAAEQSKAAESKPVELAKKPEQPKEMAAIEPTKAEKEAAKPAEN 397 Query: 414 --ASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 A + + A + + E AA A K A + A A E + +K V+ Sbjct: 398 KAAEQSKADESKPVEPAKKPEQPKEMAAIEPSKAEKEAANPAENKAAEQSKADEKKPVEP 457 Query: 472 SSATNSTSETLAAT-PKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNN 518 + E A PKA K A AE + +Q+ AD + Sbjct: 458 AKMPEQPKEMAAIEPPKAEKEAAKPAENKAA-EQSKADESKPVQLAKD 504 >UniRef50_Q4QLS0 IgA-specific serine endopeptidase n=6 Tax=Haemophilus influenzae RepID=Q4QLS0_HAEI8 Length = 1794 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 69/469 (14%), Positives = 151/469 (32%), Gaps = 47/469 (10%) Query: 99 RPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTS 158 PE RR + + A+ + + + + + + A A + A + Sbjct: 1020 NPEVERRNQTVDTPSIATANNMQADVPSVSNNHEETARVEAPIPLPAPPAPATGSAMANE 1079 Query: 159 AGQAASSAQSASSSAGTAST--KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQ 216 + + + T +T +E A+ S + S K +E + S Sbjct: 1080 QPETRPAETVQPTMEDTNTTHPSGSEPQADTTQADDPNSESVPSETIEKVAENSPQESET 1139 Query: 217 SAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKT 276 A + AT ++ A++A + EA + A + + + + ++ T Sbjct: 1140 VA-KNEQKATETTAQNDEVAKEAKPTVEANTQTNELAQNGSETEETQEAETARQSEINST 1198 Query: 277 SETNARSSETAAGQSASAAAGS-----------KTAAASSASAASTSAGQASASATAAGK 325 ET T + + A + + A Q + A+ + Sbjct: 1199 EETVVEDDPTISEPKSRPRRSISSSSNNINLAGTEDTAKVETEKTQEAPQVAFQASPKQE 1258 Query: 326 SAESAASS----ASTATTKAGEATEQASAA------------------ARSASAAKTSET 363 E A + + T+QA A + +A +T+ + Sbjct: 1259 EPEMAKQQEQPKTVQSQAQPETTTQQAEPARENVSTVNNVKEAQPQAKPTTVAAKETTAS 1318 Query: 364 NAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS---SATTAST---- 416 N++ ET+ + A + + S + + + + A + + A+T Sbjct: 1319 NSEQKETAQPVANPKTAENKAENPQSTETTDENIHQPEAHTAVASTEVVTPENATTPIKP 1378 Query: 417 ---KATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSS 473 K TEA T + T + ++ A++ + T + +V S Sbjct: 1379 VENKTTEAEQPVTETTTVSTENPVVKNPENTTPATTQSTVNSEAVQSETATTEAVVSQSK 1438 Query: 474 ATNSTSETLAATP-KAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINA 521 T++ T+A+T V ++ + R ++ + A + L+ NA Sbjct: 1439 VTSAEETTVASTQETTVDNSGSTPQPRSRRTRRSAQNSYEPVELHTENA 1487 >UniRef50_Q7Q4S4 AGAP000893-PA n=1 Tax=Anopheles gambiae RepID=Q7Q4S4_ANOGA Length = 2727 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 57/398 (14%), Positives = 116/398 (29%), Gaps = 13/398 (3%) Query: 91 GAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAAD 150 G EDDA+PE ++ R + VA K+ A+ ST + + + A D Sbjct: 1104 GQRGEDDAKPEDTTSGVETDQQDKRVTTVVADEVEDDKEQATTVSTISSDKQQEVSKADD 1163 Query: 151 SARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETN 210 +S+ A S + A T A + + S+ +E + Sbjct: 1164 EVTTSSSVAEDTPVSVEQDEKLEQPAPTTVRTVEADEDTAVAQEQDKPQSSDD--QAEDD 1221 Query: 211 ASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNS 270 S + ++ + E A E A E + + + + T + Sbjct: 1222 DEPSSTTVRADKPSSVEQDEEGPAQEATTARVTETATEQEESVTEFETKRKTIQTTVAPT 1281 Query: 271 AKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESA 330 + + + ++ + A S AA + ES Sbjct: 1282 TVSEQQDDL-------EKTETTTQPAASAAAADELQEEEHQEVKLTDEPELNEDDAQEST 1334 Query: 331 ASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 A++ S T++ +T+ + ++ + + S+ +S+ + + +A Sbjct: 1335 AATTSRPTSQEEASTQYDHVPDSTHKPSEAHDDEQPGTTVSSVTSEKESDVPVAVTVQAA 1394 Query: 391 SSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAE 450 +D + + A S TT+S +A + E + A Sbjct: 1395 MEQEDDEDVSKTTEATAAHSTTTSSNAVDDAIIYRVDEEEDNKPIVKPTLEEEVSTSHAV 1454 Query: 451 DIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKA 488 K V T + +P Sbjct: 1455 QPTHDEVKPVEDAAKPQAVD----AQETDDDNLISPVG 1488 >UniRef50_C4KLT7 Hep_Hag family protein n=45 Tax=Proteobacteria RepID=C4KLT7_BURPS Length = 1185 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 68/378 (17%), Positives = 187/378 (49%), Gaps = 4/378 (1%) Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAAS----SAQSAS 170 NASA +N+ A ++ + +++ T++ + D++ A+ T+A + + ++ Sbjct: 489 TNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASASGENSTATGTDST 548 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 +S ++ T ++ S + +S + A+ + + + T+++AS ++ + + +T Sbjct: 549 ASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGD 608 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 + S +A+A+ E + ++ T++++S S++ ++ T + S + S TNA ++ + Sbjct: 609 NSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTA 668 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 + + + S + + ++ + ++ S ++AS T A + E++ ++ + +T +T + Sbjct: 669 TGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGAN 728 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 + S + S TNA A+ ++ ++ TA+ +S S++ ++ ++++AS + +T + A ++ Sbjct: 729 STASGDNSTASGTNASATGENSTATGTASTASGSNSTANGTNSTASGNNSTASGTNASAT 788 Query: 411 ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQ 470 ++ T++A S T + + + + ++ + + A ++T G Sbjct: 789 GENSTATGTDSAASGTNSTANGTNSTASGDNSTASGTNASATGENSTATGTASTASGSNS 848 Query: 471 LSSATNSTSETLAATPKA 488 ++ NST+ AT Sbjct: 849 TANGANSTASGAGATATG 866 Score = 70.3 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 79/440 (17%), Positives = 195/440 (44%), Gaps = 5/440 (1%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 N + + + S ++AS S + D+ S ++ + + +S ++++ Sbjct: 427 NSTANGTNSTASGDNSTASGTNASASGENSTATGTDSTASGSNSTANGTNSTASGDNSTA 486 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 S AS ++ + + +S S + + + S N++AS +A+ S +T ++ Sbjct: 487 SGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASASGENSTATGTD 546 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASS---ATAAGNSAKAAKTSETNARSSETAA 288 + S ++ A+ + +S N+++S ++A+++ +TA G + A+ ++ T ++ TA+ Sbjct: 547 STASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTAS 606 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 G +++A+ + +A +++A T + + +++TA G ++ ++ +++ + T A E + Sbjct: 607 GDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENS 666 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAA--SSASSASASKDEATRQASA 406 +A ++A+ ++ T + T++ + TA+ ++AS+ S+A+ ++ + A+ Sbjct: 667 TATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANG 726 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 A S+A+ ++ A+ SAT + + S A+ + + A AS + T Sbjct: 727 ANSTASGDNSTASGTNASATGENSTATGTASTASGSNSTANGTNSTASGNNSTASGTNAS 786 Query: 467 GIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTD 526 + S+AT + S A + + A + + + Sbjct: 787 ATGENSTATGTDSAASGTNSTANGTNSTASGDNSTASGTNASATGENSTATGTASTASGS 846 Query: 527 FADKRGMRYVRVNAPAGATS 546 + G A A AT Sbjct: 847 NSTANGANSTASGAGATATG 866 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 79/389 (20%), Positives = 183/389 (47%), Gaps = 6/389 (1%) Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 +++A N+ A +++ + ++ + T+A+ +++ A T + + S++ + +++ Sbjct: 629 TDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNST 688 Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA- 233 + +T + +A+A + +A T + A+ ++ T A+ ++ +++ + T AS Sbjct: 689 ASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGTNASATGE 748 Query: 234 -TSARDAAA----SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 ++A A+ S A + + AS + S+A+ + +A A +++ A + + A Sbjct: 749 NSTATGTASTASGSNSTANGTNSTASGNNSTASGTNASATGENSTATGTDSAASGTNSTA 808 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 + S A+G + A+ + ++A+ A+ +A+ A S +A + STA+ AT + Sbjct: 809 NGTNSTASGDNSTASGTNASATGENSTATGTASTASGSNSTANGANSTASGAGATATGEN 868 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 +AA + + A + +A + ++A + A+ S+A + S+AS + AT + +AA Sbjct: 869 AAATGAGATATGNNASASGTSSTAGGANAIASGENSTANGANSTASGAGATATGENAAAT 928 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 + TA+ A+G+++ A + + A + A A A S+ E A+ G Sbjct: 929 GAGATATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAAGDGS 988 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAE 497 L S ++ AT ++ N+ Sbjct: 989 TALGSNAVASGVGSVATGAGSVASGANSS 1017 >UniRef50_C4Y9G0 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y9G0_CLAL4 Length = 1008 Score = 75.0 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 73/405 (18%), Positives = 147/405 (36%), Gaps = 7/405 (1%) Query: 108 LMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQ 167 ++ A + +Q + ++A S + + A + + + S + + +A Sbjct: 462 QTAPSGSQTAPSGSQTVPSGSQTAPSGSQTVPSGSQTAPSGSQTVPSGSQTVPSGSQTAP 521 Query: 168 SASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT 227 S S +A + S S++A + + + + +A + S+T S S + + S + + Sbjct: 522 SGSQTAPSGSQTVPSGSQTAPSGSQTAPSGSQTAPSG--SQTVPSGSQTAPSGSQTVPSG 579 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 + + S + S+ A S+T S S ++ + S TA S S+T S+TA Sbjct: 580 SQTVPSGSQTAPSGSQTAPSGSQTVPSGSQTAPSGSQTAPSGSQTVPSGSQTAPSGSQTA 639 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ 347 S + +GS+TA + S + S S S S T S + + S + + + Sbjct: 640 PSGSQTVPSGSQTAPSGSQTVPSGSQTAPSGSQTVPSGSQTAPSGSQTVPSGSQTAPSGS 699 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAA 407 + + S +A S+T S+T A S S + + + + + T + Sbjct: 700 QTVPSGSQTAPSGSQTVPSGSQT-APSGSQTVPSGSQTYPHTTTLTVPTGSSHTIPGTFT 758 Query: 408 KSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKG 467 S + T +GS + A +S + + Sbjct: 759 PSECSDTCTIIVVPSGSQSIPGTPVVPTTPAIPSQPLTTITVPTCSSYTETGTFTPSDCA 818 Query: 468 IVQLSSATNSTSETL----AATPKAVKSAYDNAEKRLQKDQNGAD 508 V + TS + ATP ++ + E + QN + Sbjct: 819 EVSTCTVIVITSGSAFTESTATPTGSQTELTHTEGTSTEAQNTSG 863 >UniRef50_C8PMD0 Putative antigen protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PMD0_9SPIO Length = 592 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 64/264 (24%), Positives = 104/264 (39%), Gaps = 10/264 (3%) Query: 71 HAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKS 130 ++GT++ + S N + M E D + A R + M++ R +SA A++ Sbjct: 190 YSGTLSTIDTSSLSDKN-VVDKMREQDDKDIATR--KDMIDLKERESSAARDRANVAQQD 246 Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 A A +A AA A + +A Q+ A+ A A A +A ++ K AAAA Sbjct: 247 ADAARQNAGTRQNEAAAVQREADKSKAAAAQSKQDAEKARKDAEDAKKQAAQSEKDAAAA 306 Query: 191 ESSKSAAA---TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 +A A A+ + Q A+ A KA+EA S ++AAA ++ A+ Sbjct: 307 RQQAQRNPNDRKAAEEAAQKRQEAARNRQEASDKDKAAAEKANEAKKSDQEAAAKQKEAE 366 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 + A +A A A + + A E A+S +A + Sbjct: 367 EKQKAADEAAKQAQDKEREAASEKQFADAKEQEAQSDRK----DVAADTRKIIEEKRAER 422 Query: 308 AASTSAGQASASATAAGKSAESAA 331 A A ASA A K +S + Sbjct: 423 KAQDEAAFASALPGAMLKVVDSGS 446 >UniRef50_C5DNW3 ZYRO0A12012p n=1 Tax=Zygosaccharomyces rouxii RepID=C5DNW3_ZYGRO Length = 645 Score = 73.8 bits (179), Expect = 3e-11, Method: Composition-based stats. Identities = 55/348 (15%), Positives = 105/348 (30%), Gaps = 5/348 (1%) Query: 166 AQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTA 225 A A + + + ++ + Q Sbjct: 284 ADKAETMGEKKEEEPKKEAEEPKIEAEEPKKEVEEPKKETEEPKTEGYTKQEEEIKEQQT 343 Query: 226 TTKASEAATSARDAAASK--EAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARS 283 + SEA ++ EAA + T A + + A + A+ ETN + Sbjct: 344 EKEVSEAKEDVNTDKLNEAGEAATETTTEVKEDAPQGDQNVSEAKKEEEPAQVPETNEKV 403 Query: 284 SETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE---SAASSASTATTK 340 E A + + +K AS A A + + +A A + + A A+ Sbjct: 404 VEEAKPEVDATIQANKEPEASQAGAQAEAVSSEAAEPKEASTEEKLDVTEAKEATKPEET 463 Query: 341 AGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 E E S A A + +A ++ ++TA + ++ A+ A KD + Sbjct: 464 KEETIEAKSGTQEEAPEAVEEKPDASDAKPEEAKNETADEKTGANEAAELKPDEAKKDTS 523 Query: 401 TRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 + + T A + +A +A +S A E+ A +++ E Sbjct: 524 EVKPDEVEEKTTEAKPEEVKAETAAVQPEESNQEESKAEVTPESNADESKESPKESQPET 583 Query: 461 ASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 A T + +E+ ATP+ G+ Sbjct: 584 AETKPSETQDNLETSQPKTESQEATPEPSSQENTQTPTPKPTPSAGSG 631 >UniRef50_Q4PI38 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PI38_USTMA Length = 1014 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 85/440 (19%), Positives = 155/440 (35%), Gaps = 18/440 (4%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE 140 S+P +LN G + AL R +V+R S A A K T A Sbjct: 200 SRPSSLNRLSGRREPNLLHSPALSR------DVSRERSEAASPAPAHLKP-----TGASY 248 Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 AT A AD A S + + + + + +SS + A + Sbjct: 249 DATTKARRADRAITDSPQRTSLNRTNYKSVELEEDGQLPSDDDQPEKKSQQSSPAIAGSE 308 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 A + K S+ L A + S + AT R A S A A+ + S Sbjct: 309 AASRKPSKLTTRQRLAKATKAGSRLLAASERRATPERWTAPSATQASPPSKLATPAKVST 368 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 + S+ + +A ++ + + S AA S + A+ + + ++ + + Sbjct: 369 SESSGSNPLAASLQRSPSKTNPEETSPSASSRRAAPASSSYRANEEAGSRSTCQERNRRP 428 Query: 321 TAAGKSAESAASSASTATTKAGEA---TEQASAAARSASAAKTSETNAKASETSAESSKT 377 +AA +A + A + + +A RS S++ +AK S S ++S+ Sbjct: 429 SAASTAATNLAREKPLTAAEKSSGALLSRRADEPTRSGSSSTPKTRSAKDSMGSNKTSEP 488 Query: 378 AAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAES 437 AA+ + +S S A +E A +A T SA A SK+ A S Sbjct: 489 TAAAGDRRSQTSESKTKADSNERKVSTKQAAQAAVQVVTMKKAKPSSACEARSSKAAASS 548 Query: 438 AATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAE 497 ++ E + + A++ ++ + ST+ + + ++ D A+ Sbjct: 549 RPSK----RSEPESCEQERSAKAAASKREQPPADKRVSTSTTSSSSTKANGSQTQIDRAD 604 Query: 498 KRLQKDQNGADIPDKGCFLN 517 +K + D F Sbjct: 605 PLPRKRRRTDDFSSSDSFAR 624 >UniRef50_A5KAV0 Merozoite surface protein 3 gamma (MSP3g), putative n=1 Tax=Plasmodium vivax RepID=A5KAV0_PLAVI Length = 845 Score = 73.4 bits (178), Expect = 4e-11, Method: Composition-based stats. Identities = 90/380 (23%), Positives = 144/380 (37%), Gaps = 44/380 (11%) Query: 116 NASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT 175 N + Q+TA A+KSAS AS +EAA + A A +++ Sbjct: 253 NVTEAVQSTAEAEKSASGASAKVKEAAHNVAKKLMDAVQKLEKVSTELPKDNEQAATIDN 312 Query: 176 ASTKATEASKSAAAAE--------------------SSKSAAATSAGAAKTSETNASASL 215 + TEA K A ++ A + A+ N Sbjct: 313 VNEVVTEAVKEKEKAMISAEVAKAEAANAEAQLAKIEAERAKYEANKIAEEYTDNVKGEA 372 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA- 274 + A A+ A++KA+EA+ +A+ A+ ++ N + A A +A A + A Sbjct: 373 KKAEEKANEASSKATEASNNAKGASGEEKQTNPQAANVKAKAGEAIKAAKEAKKAKTEAY 432 Query: 275 ----KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESA 330 T A+ + A Q A A T AA A A A +A A++A Sbjct: 433 IALCVTKTLVAKENAKKAEQEAKNAKDKATKAAKEAEEAKKQAEKAEKITETVKNEAKTA 492 Query: 331 ASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 + A+T +A A A AK + +A K + + Sbjct: 493 TDEEAKASTGKKDAEINAGYVDEEVYAVNIEFEIAKEAAKTAAQHK-----ALEILDKAE 547 Query: 391 SSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAE 450 +A + + AT +A A A TA TKATEA E+AA +A+ A+++A+ Sbjct: 548 KNAEIAAENATAKAQEATKKAETAKTKATEA--------------ETAAKKAQDASEKAK 593 Query: 451 DIASAVALEDASTTKKGIVQ 470 IA+ V + AST + + Q Sbjct: 594 AIAADVLAQKASTEAQSLKQ 613 Score = 53.4 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 103/311 (33%), Gaps = 9/311 (2%) Query: 118 SAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTAS 177 V A + AST ++A +A + A + A +A++A+ Sbjct: 483 ETVKNEAKTATDEEAKASTGKKDAEINAGYVDEEVYAVNIEFEIAKEAAKTAAQHKALEI 542 Query: 178 TKATE--ASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE--AA 233 E A +A A + A A AKT T A + + A ++ A A++ A Sbjct: 543 LDKAEKNAEIAAENATAKAQEATKKAETAKTKATEAETAAKKAQDASEKAKAIAADVLAQ 602 Query: 234 TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS 293 ++ +A + K+ A+ N S + A A + A ++ + S++ A+ Sbjct: 603 KASTEAQSLKQEAEKLAENIKKSNVTDEEKAKADEAAKAAKDAADQASASAK-----KAN 657 Query: 294 AAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 A + T A + + A A A A ++ K + EQ + Sbjct: 658 DAKIAATNAQVVVTLQTKKAESAKAEDAAKEAMKARDKAAFELLKIKKQDVLEQVDVSPS 717 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT 413 + + ++ A + ++E ++ + S+ Sbjct: 718 GSDNLNDVDEQVALEVGEQQNETEDAEPQEAEEGDEEDEEDTEEEEIQDESDHTEESSAK 777 Query: 414 ASTKATEAAGS 424 + + + G Sbjct: 778 QAAQQEKQQGE 788 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 82/435 (18%), Positives = 145/435 (33%), Gaps = 39/435 (8%) Query: 113 VARNASAVAQNTAA-AKKSASDASTSAREAATH--AADAADSARAASTSAGQAASSAQSA 169 VA A A N A +K A + + + A +A + A +A +A+ A Sbjct: 156 VADKAKETAMNKAKKSKDEAEKIAGVNTNSMAYLYAGNAVTAEIEAKAEKEKAKKAAEIA 215 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 + K A + S + + T A S A SAS A+ K Sbjct: 216 KHAVDAYELKKEAEKAQEEAEAAKTEIEKLSKVYKEGNVTEAVQSTAEAEKSASGASAKV 275 Query: 230 SEAA--------------TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 EAA ++ N + + A A SA+ AK Sbjct: 276 KEAAHNVAKKLMDAVQKLEKVSTELPKDNEQAATIDNVNEVVTEAVKEKEKAMISAEVAK 335 Query: 276 TSETNARS------SETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAES 329 NA + +E A ++ A A A A +AS+ AT A +A+ Sbjct: 336 AEAANAEAQLAKIEAERAKYEANKIAEEYTDNVKGEAKKAEEKANEASSKATEASNNAKG 395 Query: 330 AASSASTATTKAGEATEQASAAARSA----------------SAAKTSETNAKASETSAE 373 A+ +A +A A ++A + ++ NAK +E A+ Sbjct: 396 ASGEEKQTNPQAANVKAKAGEAIKAAKEAKKAKTEAYIALCVTKTLVAKENAKKAEQEAK 455 Query: 374 SSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKS 433 ++K A +A A + A ++ + AK++ + +T + A Sbjct: 456 NAKDKATKAAKEAEEAKKQAEKAEKITETVKNEAKTATDEEAKASTGKKDAEINAGYVDE 515 Query: 434 TAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAY 493 + E A + A+ A ALE +K + + ++ + K+ Sbjct: 516 EVYAVNIEFEIAKEAAKTAAQHKALEILDKAEKNAEIAAENATAKAQEATKKAETAKTKA 575 Query: 494 DNAEKRLQKDQNGAD 508 AE +K Q+ ++ Sbjct: 576 TEAETAAKKAQDASE 590 >UniRef50_C2CM00 Putative uncharacterized protein n=2 Tax=Corynebacterium RepID=C2CM00_CORST Length = 1354 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 71/414 (17%), Positives = 140/414 (33%), Gaps = 24/414 (5%) Query: 112 EVARNASAVAQNTAAAKKSASDASTS---AREAATHAADAADSARAASTSAGQAASSAQS 168 E NAS A A A KSA ++++ + A D +++A+ A +A +++ Sbjct: 247 EPVENASKPAAPKAPADKSAKQSTSTIKGTKNAEKSEKDGSNAAKPADKTASDQKNASVD 306 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 A T+ S+K + A K + ++ +SA ++ Sbjct: 307 HPKQAKHEPTQQNSTKPVVKDTPSAKLPPRPNNNAQKQQPAKDAQQPKNGNSSAGNKQSQ 366 Query: 229 ASEA-ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSE-TNARSSET 286 S+A A ++ + NA S +++A AAG +A K ++ Sbjct: 367 NSKAPKAPADKSSKPSTSTIEGTKNAEKSEKDGSNAAKAAGKNADGKKKPRLIYPPVTKQ 426 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 Q+ A+ K A + Q++ T A ++ T A +A Sbjct: 427 QVTQTQKASVAPKPQNAQAPKTQQPKTQQSAQPNTGAKPDTSKQSTKPETPKQGASKAGP 486 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSAS-----------------SAASS 389 A+ AA + +A ++ A+ + A + S + Sbjct: 487 TAAGAAAAGISATKAKAEAQNPPAKQHPKQPNAGKNKQPEGKPANAAQASQTAQASQPAG 546 Query: 390 ASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRA 449 A A Q+ A ++ + + + A A + S A+ T A+ Sbjct: 547 KPQAQAQPKAQAPQSQAKQTQPQAQAQQPQKPRPEAEQANRPASAAKPGTTAKAAGAEAT 606 Query: 450 EDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 + A+ A E + + + S + + S+ PK + A + + Q Sbjct: 607 ANSANHAATEQEIA--RALARRSQSVGTPSQQQLKAPKPPREAQKHPNRSAQTS 658 >UniRef50_B6KQD3 SRS domain-containing protein n=2 Tax=Toxoplasma gondii RepID=B6KQD3_TOXGO Length = 854 Score = 73.0 bits (177), Expect = 6e-11, Method: Composition-based stats. Identities = 55/365 (15%), Positives = 102/365 (27%), Gaps = 7/365 (1%) Query: 116 NASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT 175 A+ + + A + A DAA + QA+ A AS + G Sbjct: 274 EAAELPRALRAGVSPRALADLRPNARGAEKKDAAPKKQLTEEEPPQASGKAAGASGANGA 333 Query: 176 ASTKATE-----ASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 ATE K AAA + ++ + A Sbjct: 334 PRENATEKERKLPEKPAAAVARGEDSSLEGKAPEEAGALEKGDEAIEGKKRPDVAALSER 393 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 S + + +S +SA S + A++ A + E +A SE G Sbjct: 394 AERKSEPIRTPGRASPPASSPALRTSAPSDTALASSGPPEALPLEKEEIDAPRSEGKEGL 453 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 A + + + + + A A E+ + T + Sbjct: 454 KTELATPAGQVKPTGGPSQPEKPLETATVAPPAKTQPETTTVAPPAKTQPETTTVAPPAK 513 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 + + T A +KT ++ + + + + + Sbjct: 514 TQPETTTVAPPAKTQPETTTVAPPAKTQPETATVPPPAKTQPETTTAPPPAKTQPETTTV 573 Query: 411 ATTASTKA--TEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 A A T+ T A A ++ + A A T+ ETA T+K + Sbjct: 574 APPAKTQPETTTVAPPAKTQPETTTVAPPAKTQPETATVAPPAKTQPGTTTAPPHTEKQV 633 Query: 469 VQLSS 473 ++ Sbjct: 634 EPTAA 638 >UniRef50_P12036 Neurofilament heavy polypeptide n=78 Tax=root RepID=NFH_HUMAN Length = 1026 Score = 73.0 bits (177), Expect = 6e-11, Method: Composition-based stats. Identities = 60/362 (16%), Positives = 104/362 (28%), Gaps = 20/362 (5%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E E A++ KS A + + A A +A + S +A S Sbjct: 645 EAKSPEKAKSPEKAKSPEKEEAKSPEKAKSPVKAEAKSPEKAKSPVKAEAKSPEKAKSPV 704 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSA---- 222 + + S A + E +KS A+S A + AK+ + S + A + Sbjct: 705 KEEAKSPEKAKSPVKEEAKSPEKAKSPVKEEAKTPEKAKSPVKEEAKSPEKAKSPEKAKT 764 Query: 223 -----STATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTS 277 A T A E A S D K + E S + + A + K Sbjct: 765 LDVKSPEAKTPAKEEARSPADKFPEKAKSPVKEEVKSPEKAKSPLKEDAKAPEKEIPKKE 824 Query: 278 ETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS---AESAASSA 334 E + E Q K A A A + + + A K Sbjct: 825 EVKSPVKEEEKPQEVKVKEPPKKAEEEKAPATPKTEEKKDSKKEEAPKKEAPKPKVEEKK 884 Query: 335 STATTKAGEAT--------EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSA 386 A K E+ E A + ++ ++ + A Sbjct: 885 EPAVEKPKESKVEAKKEEAEDKKKVPTPEKEAPAKVEVKEDAKPKEKTEVAKKEPDDAKA 944 Query: 387 ASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAA 446 + A + ++ + + + TEA SK ++ A +AE ++ Sbjct: 945 KEPSKPAEKKEAAPEKKDTKEEKAKKPEEKPKTEAKAKEDDKTLSKEPSKPKAEKAEKSS 1004 Query: 447 KR 448 Sbjct: 1005 ST 1006 >UniRef50_B4NI35 GK12979 n=1 Tax=Drosophila willistoni RepID=B4NI35_DROWI Length = 2758 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 76/468 (16%), Positives = 141/468 (30%), Gaps = 36/468 (7%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E+ E + + + A K D T E H A SA + + Sbjct: 1922 EISKGEQEATTTIPDEESEAGDKKPVDQETPKEE---HEATTVPSAEESDEERDDDKKTV 1978 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 + + +T +S + K + T S + +S + Sbjct: 1979 EKEPAKEEQEATTVRSPEESDEERDDDKKTVEKEPSKDEQEATTVSTAEESDKEPSDDKK 2038 Query: 227 T------KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETN 280 + K + AT+A A S+E + S AT + ++ + + + Sbjct: 2039 SVQPEISKGEQEATTAIPAEESEEKPAGDKKPVKKEPSKEEQEATTVSPAEESDEERDDD 2098 Query: 281 ARSSET-AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASS----AS 335 +++E + Q A T + + S + S Sbjct: 2099 TKTNEKKPSEQEPEATTVPSTEESDKKPTDDKKTSEKEPSKDEQEPTTVQPGQEVHHVPS 2158 Query: 336 TATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASA 395 + E EQ + +A S S S+ + S+ ++ + + + +S+++A Sbjct: 2159 DEKEPSEEGQEQETTSAPSPSVPIASDDDDSDSDDQKPTTVSPIDDKKEAGVTESSTSAA 2218 Query: 396 SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASA 455 ++ A TT S A E + + E K +D+ S Sbjct: 2219 PSEKDEESKDGAPIVPTTESPVAIETGSTVAPFDTPSAE--------EFDKKPMDDVMSQ 2270 Query: 456 VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCF 515 + + VQ +AT+S SE TP V D + + Q D G Sbjct: 2271 TVAPHSGEEEYPTVQPEAATSS-SEDSEKTPVTVAPVDDAIKVTSKPSQQPQPSEDVGEE 2329 Query: 516 LNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSE 563 V D +K A GK+ P V ++ + E Sbjct: 2330 STESGDVESEDTTEK-------------APGGKHRPPVQEQTTDKIPE 2364 >UniRef50_B3LWZ3 GF16318 n=1 Tax=Drosophila ananassae RepID=B3LWZ3_DROAN Length = 2634 Score = 72.6 bits (176), Expect = 8e-11, Method: Composition-based stats. Identities = 70/442 (15%), Positives = 136/442 (30%), Gaps = 12/442 (2%) Query: 78 YEDSQPGTLNDFLGAMTEDDARPEALRRF-ELMVEEVARNASAVAQNTAAAKKSASDA-S 135 D QPG A D+ P+ ++ A AVA + + + + Sbjct: 1726 ESDEQPGEKLPSSTAQAPDEKIPDVSTELPAEESDKATTVAPAVASSEDESAPTDEKIPA 1785 Query: 136 TSAREAATH-AADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSK 194 T + EA T A A T + + A E + A + Sbjct: 1786 TPSEEAGTADATTVPPQAAEDDTKKLAGEEPSSTEKIPATEDKKPEDEVLEEKPDAATQV 1845 Query: 195 SAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNAS 254 + A ++ T+ A ++ + + A ++ +E T + + + + Sbjct: 1846 PELSEDAVSS-TAAPVAGEEVEKEKDATTAAPSQEAEYPTEVPKPTEKEPSLGEEDIESG 1904 Query: 255 SSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS--ASAAAGSKTAAASSASAASTS 312 + S+ S S++ A TS S T A + A+ + A + S Sbjct: 1905 TKPSTEESDEAPIDESSEEASTSAPVQEESSTVAEEKIDATTVPSPAKPTSEQDEAITVS 1964 Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSA 372 A +++ A+ E Q + A+ + A T S Sbjct: 1965 PVSDEKVEIPAQDDSKTTIDVATEIPVSQEEEVSQTTPASTVSDAETTPAPVEVPSAVEI 2024 Query: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 432 E+ S + A A+ A+ E Q + T + A A + Sbjct: 2025 ETKPMDEVMSQTVAPHGATDDEAASTEEADQTPVTVAPQDAEKTPISVAPQDADKTPVTV 2084 Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQ------LSSATNSTSETLAATP 486 + E T A K E I + EDA V ++ T ++ + AT Sbjct: 2085 APQEDEKTPATAVPKEDEKIPATETPEDADKIPATSVPQEDEKIPATETPEDADKIPATI 2144 Query: 487 KAVKSAYDNAEKRLQKDQNGAD 508 + + + + ++ ++ + Sbjct: 2145 APIVPSVEPSSEKPSISEDVGE 2166 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 49/364 (13%), Positives = 106/364 (29%), Gaps = 6/364 (1%) Query: 153 RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAS 212 + S + + ++++ + + + K + + +SA + + Sbjct: 1071 KHEEESQPTLSPQLEESTTAIADKTPEGADEEKKPEEEKEQEKEKPSSAASPAPEVEEEA 1130 Query: 213 ASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAK 272 T E + + S + ++ + K Sbjct: 1131 EKPIDKEHKPVDEKTPEDEIVAATEQPIKKPTEEEISTPAEKETEPEEPTTPVSIEGEKK 1190 Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAAS 332 A+ S + +E G+ A GS T A +S +A ++ E A+ Sbjct: 1191 PAEESAEEEKPAED--GKPALEDEGSATTTAPEKETPESSTELPTADVDKKPEAEEEPAT 1248 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 + T ++ + + + A+ T A+S + + S+ Sbjct: 1249 ATQTPKIESPTSVPAGEETEDLDKFTTVPPSEVSDEKEPADDEDTVPATSVPTQSEDIST 1308 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 + T S A+ A +TEA + +T ET+ + A Sbjct: 1309 SRPIVPLPTLAPSQAEK--EPAEESSTEAEIEEKPTKEESATEPITKGEMETSEETATPS 1366 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDK 512 S +ED T + + + E TP K+ + E+ D+P Sbjct: 1367 VSENEVEDEEGTAAPVPAPAPGSTDAEEDK--TPSTEKAQPTSEEEAATDAPAAGDLPKL 1424 Query: 513 GCFL 516 + Sbjct: 1425 PQDI 1428 Score = 50.7 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 63/407 (15%), Positives = 135/407 (33%), Gaps = 10/407 (2%) Query: 114 ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSA 173 + A+ + + S + ++ E+A + S + S Sbjct: 377 SSTAAIASDSRIDVPSSTEETKETSTESAEEDVAKIVTTPEPEGSGEEETSKPAHVPEKE 436 Query: 174 GTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 T ++ + A A + A + + E + S++S ++ K + A Sbjct: 437 VTEDELIKVSTSAPAKASPGEETATEPSAEEEEEEAKPTPSVESTEEEKEPSSEKPTSAE 496 Query: 234 TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS 293 + + +A + E + + S+ + +K + T S E ++++ Sbjct: 497 EGSGEQEEGDKATPAPEESQEDELAKPTSTPAGIDEKEEESKPTSTPEGSGEEEKEETST 556 Query: 294 AAA--GSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 +A G + + A + + AA + E S+ TA A + +E A Sbjct: 557 SAPPTGDEKEEEVKITPAPEEEEEEAKPTPAAEEEKEDGDSAKPTAIPPASDDSEAAKPT 616 Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS- 410 S A+ E + + T + + + +S + ++ +DE ++S A + Sbjct: 617 ESSEEASGEGEEDIVKATTPSGEVSSEGGEEEPAKETSPAPEASGEDEKEEESSTAAPAD 676 Query: 411 ---ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKG 467 A T T A E AT + +AE + T E+ A K+ Sbjct: 677 GVEAETVQTPADEKDIIATPSPTEDVSAEEEIVKVTTPTSLGEEPAEEST----ELPKED 732 Query: 468 IVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 V+ S+A + + + A +A R+ + A Sbjct: 733 DVESSTAAPAIASSTEGIQDASIETSTSAPARIGGQEEEASTASTPE 779 >UniRef50_B3L6P2 Merozoite surface protein 3, putative n=2 Tax=Plasmodium knowlesi strain H RepID=B3L6P2_PLAKH Length = 961 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 72/354 (20%), Positives = 118/354 (33%), Gaps = 23/354 (6%) Query: 116 NASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT 175 N A + + +A++ T A ++A +A A +A A+ + + Sbjct: 288 RLETAKNNIEQALEKIEEELKNAKQKTTKATESARAATEA-IETIEAMEVARKSENKGEN 346 Query: 176 ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL--------QSAATSASTATT 227 A+ T AE + AA + + + ++ A + + AA A+ A Sbjct: 347 ATINETNVKNIEQHAEEAMEAAKKNTESKEQAQILAGIAKIIVALEEAKIAAKEAALAKN 406 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 A E A A A KEAA T A++ N K K + +E Sbjct: 407 LADELARMATSANVPKEAATGKATQANNKVKEIEKLLEEIKNIEKIEKEKRPLEKVNEAK 466 Query: 288 AG-QSASAAAGSKTAAASSASAASTSAG--QASASATAAGKSAESAASSASTATTKAGEA 344 Q A+ A A A A + A A AT + A+ + A A A A Sbjct: 467 EKVQKATEAKDIAKKAKEEAEIAVSVARVHVAKQEATRLARVADEEKNKAKEAAKTAENA 526 Query: 345 TEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS-----------A 393 ++A + + K++E E++K + A +A Sbjct: 527 KKEAEKVSGNFKNVAKIVEAVKSAEAEVETAKKEEEKAEMEAIEAAQEAYTLEYLLLMVT 586 Query: 394 SASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAK 447 A+ + Q KS+ A KA +A A AQ A A A+ Sbjct: 587 EAANEAEAAQGENIKSTLEKAQEKALQAVEQANHKAQETFNFTQKAIDASKKAQ 640 >UniRef50_Q1QX63 Ribonuclease E n=5 Tax=Gammaproteobacteria RepID=Q1QX63_CHRSD Length = 1175 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 66/283 (23%), Positives = 117/283 (41%), Gaps = 13/283 (4%) Query: 114 ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSA 173 +RNAS A+ + +A TSA A AA+A R A+ +A A S+ A ++A Sbjct: 882 SRNASKPARTASETADAAPQGKTSAD-APETAAEAPSQPREAAAAAQPADSATSEAPATA 940 Query: 174 GT---ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 AS A A ++A +E AT + A ET+AS + + + + A++ A+ Sbjct: 941 NADTRASAPAPSADDASALSEPKAPRDATPSAEAPADETSASDAPRGEVSDSQPASSAAA 1000 Query: 231 EAATSA-RDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 + ATS RD + A ++ A+++ +++++ A A+ T+ + T A Sbjct: 1001 DEATSTGRDTVEQEAATPVTKDEAATAPATSSADADVDTRQDAASSTATPQDDTLATPAS 1060 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS------TATTKAGE 343 Q + S A A A + +A AS+S A + A + T + E Sbjct: 1061 Q--AEQKASTDAPAPQAEPTTPAAAAASSSDEQAPAEVSESQPQAEPKAPSDASATSSEE 1118 Query: 344 ATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSA 386 + A + A T + + + SA + A + Sbjct: 1119 KASTETPAPQPEQTASTEASEPQQATASAPRRRRRAHNDPREQ 1161 >UniRef50_B1VIS6 DNA polymerase III, gamma and tau subunits n=10 Tax=Corynebacterineae RepID=B1VIS6_CORU7 Length = 1102 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 80/386 (20%), Positives = 136/386 (35%), Gaps = 11/386 (2%) Query: 91 GAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAAD 150 GA ED A +A E+M A + Q + A + ++A+ AAD + Sbjct: 572 GAQDEDPAIAQAREAREIMDRRRAAAHANAGQVSQQAGAGQAPQPSAAQAPQQAAADQSQ 631 Query: 151 SARAASTSAGQAASS-------AQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGA 203 A AA+ S Q+ +S +G A + TE +S A +A + Sbjct: 632 RAGAAAQSLSQSQASTEVSGVEPSDQQPQSGNAQAENTE-QRSVQPTGQPAEAPVEAAQS 690 Query: 204 AKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASS 263 ++A+ A +A + R+ S A T A+ ++ Sbjct: 691 TDAPADTSAAATAPAGDAAQVDEQAWQKILEKVREHDLSAWIAGRDATLAAEQPTAGQIV 750 Query: 264 ATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAA 323 A + S+ NA AA + AA + + + A A +A Sbjct: 751 VEHATGALAKFVNSDRNAPIYSKAAEEVLGAAHTVLATVGGNNPGKAQAGSAEEAPAESA 810 Query: 324 GKSAESAASSASTATTKAGEATEQA-SAAARSASAAKTSETNAKASETSAESSKTAAASS 382 ++ ++ ++A+ A ++ A S A + N + T+ SS AA Sbjct: 811 TPTSTTSGNNATGAEASHSTVSQPAVSQAEPGQVGPDHAAANQAPAVTAEPSSANAAGGH 870 Query: 383 ASSAASSASSASASKDEATRQAS-AAKSSATTASTKATEAAGSATAAAQSKSTAESAATR 441 A S A+ + A A + A A A T+AT+ A A +A + + E A Sbjct: 871 APSEAAGSGQALGEGTSAGQDAPGEAAGFGQAAPTQATQPAQPAQPSAGASAL-ERARRM 929 Query: 442 AETAAKRAEDIASAVALEDASTTKKG 467 AE +A +A A + K Sbjct: 930 AEASAAQATSPTPAPTRPSHTEEPKP 955 >UniRef50_B5H2D1 Putative uncharacterized protein n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5H2D1_STRCL Length = 1075 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 60/399 (15%), Positives = 115/399 (28%), Gaps = 5/399 (1%) Query: 80 DSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAR 139 D +D A E+D + +AL +E A + Q A AK+ A + Sbjct: 433 DDANKAKDDANKAKIENDNKVKALEEEFRRKQEEAEAEARRRQAEADAKQ-AEAERKQEQ 491 Query: 140 EAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAAT 199 + A A++ R +A + A +A +K A A Sbjct: 492 KEKEAEAKQAEAERKRDEKEREAEAKQAEAERKQEEKQAEAERKRDEKEREAGTKQAEAE 551 Query: 200 SAGAAKTSETNASASLQSAATSASTATTKAS---EAATSARDAAASKEAAKSSETNASSS 256 K +E A A A +A + A + A+ +T + Sbjct: 552 RKQEEKQAEQEARQERLQAEQEAKQDRLQAEAERKQAEQEAKQEQKEREAEERQTLLMNQ 611 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 A +A + ++ + E A Q+ + + A A Sbjct: 612 ARIDQERQRREQERKQAEQEAKQGEKEREADAKQAEAERKQEEKEAEQERKQAEKEKEAE 671 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 + A A + E A + K EA ++ A + + + + + ++ Sbjct: 672 AKQAEAERERDEKQAEQEAKQEQKEKEAEQKRIRTEAEYEAKQAEAERKQEEKQAEQEAR 731 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 + A A A + +A +A + A +A + + + Sbjct: 732 QERLQAEQEARQDRLQAEADQRQAEAEARREQQQAEQERKQAEAEKRAERQMRELGMDSG 791 Query: 437 SAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSAT 475 S +R + + L + SS T Sbjct: 792 SG-SRGTDGPRLPGGDSGTTTLNPDGSVTMDYPDGSSRT 829 >UniRef50_C4KTI1 Cell divisionftsk/spoiiie n=95 Tax=cellular organisms RepID=C4KTI1_BURPS Length = 1863 Score = 71.9 bits (174), Expect = 1e-10, Method: Composition-based stats. Identities = 90/443 (20%), Positives = 168/443 (37%), Gaps = 17/443 (3%) Query: 114 ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSAR--AASTSAGQAASSAQSASS 171 + A A A A+ +A D + R AT AA +A R + S A S+ + Sbjct: 364 STAAVAALGKRAQARPAAPDPRFAPRRPATQAAVSAARNRPMTFTPSRQTAGSTPPQPAP 423 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 A TA+ A A K A A + A A+ AS + AS A A Sbjct: 424 RAQTAAPTAETARKRAPANPARAPLYAWHEKPAERIAPAASVHETLRSIEASAAQWTALA 483 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA--- 288 ATS + + ++ S A+++A+ + A SA+ A ++ + S+ET A Sbjct: 484 GATSTAATPVTARESMAAPAAPSGGAAASAAPDSHAPTSAETAAPNDHASTSAETVAPDG 543 Query: 289 -GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ 347 +++ A A +SA A+ +++ TAA ++ + A + E Sbjct: 544 HAPTSAETAAPDGHAPTSAETAAPDGHAPTSAETAAPDGHAPTSAETAAPDGHAPTSAET 603 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS---ASASKDEATRQA 404 A+ ++++A+T+ ++ A TSAE++ +++ ++A + ++ + A Sbjct: 604 AAPNDHASTSAETAAPDSHAP-TSAETAAPDGHHASTITEATAPNGHVSATVETSAVAAP 662 Query: 405 SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 + +A + A AA + T++SAA A A A++ A + Sbjct: 663 AGITQAAPPIAADICPAGEHVIAAVEPACTSDSAAIGAAAIAHAEAGAAASTAETASPIG 722 Query: 465 KKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSK 524 + S + T++T P ++ + N A + Sbjct: 723 VDTHIAPSREADRTAQTAPTAPSPAEATPHVDAPHALD-------VAARALVGNTAATAH 775 Query: 525 TDFADKRGMRYVRVNAPAGATSG 547 A + +PA +TSG Sbjct: 776 GAAAVDGSAQRADTASPAASTSG 798 >UniRef50_UPI00015B5167 PREDICTED: similar to ENSANGP00000017739 n=1 Tax=Nasonia vitripennis RepID=UPI00015B5167 Length = 2721 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 58/417 (13%), Positives = 127/417 (30%), Gaps = 8/417 (1%) Query: 97 DARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAAS 156 +PEA + + + + A ++DA+ + + + Sbjct: 1593 SVQPEADNQMSDHHTSTSSDDNVTPTTQKVAVDESTDATEKTPSKDEAEPEMMTTTPSVK 1652 Query: 157 TSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQ 216 + +S + T S E S S +S S A + E + + Sbjct: 1653 SEPESTPEEPESGKENIPTLSPVGEEISSSVVPESTSTSTEAEMSVKGAEEEPSTPEDDK 1712 Query: 217 SAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKT 276 S +T + + + + + ++S T A+ S A +++ + Sbjct: 1713 SKVDEEKLSTVTSETPESEEEGVSETTSSIQTSSTEATEEDKSKLPEEEIATPASEDTVS 1772 Query: 277 SETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE--SAASSA 334 +++ R+ E S S + S S +T+A + + T + KS E + Sbjct: 1773 AQSTERTDE-----SPKPEDDSAGSVEPSPSPEATNAPEEDVTTTPSSKSTELEVEKAEP 1827 Query: 335 STATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSAS 394 + + + A A T + + + +S +S +S S Sbjct: 1828 TEPAQEVETTEKPKDTAQPDEEATPTGAPLEPVDAETQKPISESGEASTTS-PASPESEL 1886 Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS 454 S D + + + E +T A+ + S+ A A ++ Sbjct: 1887 TSADTSESATPSVEEQTEKVQEPEIEKEPISTEASLASSSETPAEPSATKSSIPESSTEL 1946 Query: 455 AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPD 511 + + + SET AA + + + + + G D P+ Sbjct: 1947 DSETQKPEVAEVEPEVTVPSVEEASETSAAVTEPTSTGEEAKKPEDTTESVGTDKPE 2003 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 56/427 (13%), Positives = 135/427 (31%), Gaps = 12/427 (2%) Query: 79 EDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSA 138 E + GT + DD + E V ++ ++ A S + Sbjct: 2044 EPAVTGTSEEATSKAPSDDVTAPHEPELTSVTESVTEQEQTSSEASSTASPSDESTPEAK 2103 Query: 139 REAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAA 198 + + ++ +A S T+ + ++ + + Sbjct: 2104 P-VDENKPEETPIEAQPEPDTTESGEAATVKSIEQPEVDTEMEKTTEKPEEKQPEEEKPE 2162 Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 + E + K ++ ++ ++ + T ++ S + Sbjct: 2163 EKIPEEEKLEEQTPEEEKPEEQKPEEEKAKQDTTESTDEATGEAQTVSEETITLSTPSEA 2222 Query: 259 SAASSATAAGNSAKAAKTSETNARSSETAAG--------QSASAAAGSKTAAASSASAAS 310 + S + + T++G ++ + A AA + AS + Sbjct: 2223 GESDVKEKPTESLIETEKETSEPSVELTSSGTIDSKIGVETEGSTAAEDQAAVTEASTEA 2282 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 + ++S+ +A + + + + +T+ A ++ + S E + T Sbjct: 2283 SKVDESSSPISAEPTDEDQKSPALAETSTETPVAKDKIDEPEEATSEKPDEEIPSHEELT 2342 Query: 371 SAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQ 430 + E +K A ++ + + S +S SS + TEAA S T++++ Sbjct: 2343 TVEPTKVEAYPDEETSHAPSESPDKELMSTVFDSSEETSSESEKKDLTTEAAISETSSSE 2402 Query: 431 SKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVK 490 + + E A A + DA ++ + +S T AA P + Sbjct: 2403 PTVSEKPDEKEEEPTATAASE---EPTSSDAGSSTSEVPTTASTDAEDQSTKAAVPADEQ 2459 Query: 491 SAYDNAE 497 S+ + +E Sbjct: 2460 SSVEPSE 2466 >UniRef50_A5KAV7 Merozoite surface protein 3 alpha (MSP3a), putative n=2 Tax=Plasmodium vivax RepID=A5KAV7_PLAVI Length = 907 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 98/471 (20%), Positives = 156/471 (33%), Gaps = 49/471 (10%) Query: 97 DARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAR----------------- 139 DA + +V + A K ++ +T + Sbjct: 136 DAINKVKEYASSKESQVKKKVEEAKSAADEATKGSTKENTEQKAKAAEAALGEAQNAKVQ 195 Query: 140 -EAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAA 198 E A D A A A +A AQ A+ A A K A A S +S A Sbjct: 196 MEKAAAIVDEVVKAMEAEKEAQKAKEEAQKANEEAQNAKNKFVMIKPKQAGAGSPESKAE 255 Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 A A T A + A+ A+ A KA+EAA A + A K+ E + Sbjct: 256 QEAKIANDKTTEAEQKAEQASAKATDAAQKATEAAQKAIEVAKKATQEKTGEEKEIEQEN 315 Query: 259 SAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASA 318 A+ A + + A A A SA A A A+ Sbjct: 316 VTKIKEIASNAVKDAKDAKKAKREAQIKAEIVKLELAKEEAKKAVESAKKAKDEAAAAAK 375 Query: 319 SATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTA 378 ++ +A +A AA A A TKA + S +A + ++ +A ++ A Sbjct: 376 TSKSAKLAATRAAKKAEEAETKANLLENKNKQGPISLAATAEAAEKEASTAVAAVATAEA 435 Query: 379 AAS----------------------SASSAASSASSASASKDEATRQASAAK--SSATTA 414 A + A +A+ A ++ EA AK + A Sbjct: 436 AEKAKTEEVEKKEAEAEEKIKTLIQKVAKAIKAANQAKKAQIEAEIAVEVAKIEEHSEVA 495 Query: 415 STKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIA----SAVALEDASTTKKGIVQ 470 + EA + A Q+ S A+ A T+ E AAK AE + + +E A+ +K + Sbjct: 496 QKEVEEAEKANAKAKQAASEAQEAKTQTEKAAKAAEMVKAKDLAKTEVEIATKAEKEVAD 555 Query: 471 LSSATNSTSETLAATPKAVKSAYD---NAEKRLQKDQNGADIPDKGCFLNN 518 + S A+K + NAEK+ K+ + + D + Sbjct: 556 AKMEADEESSEAVEKAHAIKMQLEVAINAEKKAAKEVDITKLEDAKKEAQD 606 Score = 44.9 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 86/470 (18%), Positives = 136/470 (28%), Gaps = 87/470 (18%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQ---------- 161 + + A T A++ A AS A +AA A +AA A + A Q Sbjct: 253 KAEQEAKIANDKTTEAEQKAEQASAKATDAAQKATEAAQKAIEVAKKATQEKTGEEKEIE 312 Query: 162 ------AASSAQSASSSAGTASTKATEASKSAA-----AAESSKSAAATSAGAAKTSETN 210 A +A A A EA A A+ A SA AK Sbjct: 313 QENVTKIKEIASNAVKDAKDAKKAKREAQIKAEIVKLELAKEEAKKAVESAKKAKDEAAA 372 Query: 211 ASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNAS---------------- 254 A+ + +SA +A+ A KA EA T A + S + Sbjct: 373 AAKTSKSAKLAATRAAKKAEEAETKANLLENKNKQGPISLAATAEAAEKEASTAVAAVAT 432 Query: 255 ---------------------------SSASSAASSATAAGNSAKAAKTSETNARSSE-- 285 + A +A A + A+ + A+ E Sbjct: 433 AEAAEKAKTEEVEKKEAEAEEKIKTLIQKVAKAIKAANQAKKAQIEAEIAVEVAKIEEHS 492 Query: 286 ------TAAGQSASAAAGSKTAAASSASAASTSAGQ----------ASASATAAGKSAES 329 + A+A A + A A + A + A A K+ + Sbjct: 493 EVAQKEVEEAEKANAKAKQAASEAQEAKTQTEKAAKAAEMVKAKDLAKTEVEIATKAEKE 552 Query: 330 AASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET-----SAESSKTAAASSAS 384 A + A ++ EA E+A A A +E A A+ A +A Sbjct: 553 VADAKMEADEESSEAVEKAHAIKMQLEVAINAEKKAAKEVDITKLEDAKKEAQDATKNAK 612 Query: 385 SAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAET 444 S AS ++ A + A+ A A A Q++ AA E Sbjct: 613 ELLSIVQQASMLAIAKSQDALNSLEKVKKAAEVAKAKLAKAEAKNQAEEAKAIAAKADEL 672 Query: 445 AAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYD 494 + A E A+ ++ + T A K ++ + Sbjct: 673 SKATKASTAEKTKAEAAAKEAAAQATKAADEAGKAATAADNTKKAATSAE 722 >UniRef50_C7YRX8 Predicted protein n=4 Tax=Eukaryota RepID=C7YRX8_NECH7 Length = 4743 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 69/408 (16%), Positives = 133/408 (32%), Gaps = 8/408 (1%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 + E + A T D AA A+ + + + Sbjct: 872 KLAEADDKPTEEAPPTEEKPAETDDKPVEEAPAAEEKPAEAEKPAEEAPPTEEKPAETDD 931 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 S + AA KSA +A A + + A Sbjct: 932 KPSEEANPVEAEKPVEEPTPAASEEKSAEEVNAIEADGETSESDDKPTEAVKPVEAEDKP 991 Query: 229 ASEAATSARDAAASKEA--AKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 A EAA + D ++EA A+S + S+ + A +++ SA++ + +A+++E Sbjct: 992 AEEAAPAETDDKPTEEAEPAESEKPVEESTPAEANDTSSEEAKSAESEEKPAEDAKATEA 1051 Query: 287 AAGQSASAAAGSKTAAASSASAA---STSAGQASASATAAGKSAESAASSASTATTKAGE 343 A +S A A + + + SA+ ++ + T + E Sbjct: 1052 DASESDDKPAEGVKPAEPEETPSAEVEPAEENKSANDIPETEAKSTEEMKPEKETEQQPE 1111 Query: 344 ATEQASAAARS---ASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 ATE +AA +AKT E S + +++ + +A + +A +E Sbjct: 1112 ATEPPAAAETQKAEEESAKTEEPVEPEQTESGDKAESEEPKAGDNATTDGETAGDDLEEQ 1171 Query: 401 TRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 ++ ++ ++ AST +A A TA+ E A + A ++ Sbjct: 1172 AKETASPEAKDEVASTAGGKATEEEAPIASDDDTADKEPPATEDHPSEAAEADDGKAAQE 1231 Query: 461 ASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 A + V+ ++ + E+ AE K D Sbjct: 1232 AKAAEDEKVESAAKPSEDDESTDGEAAGETEKAGTAEAEEPKTTEKPD 1279 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 82/457 (17%), Positives = 161/457 (35%), Gaps = 42/457 (9%) Query: 96 DDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAA 155 DD EA++ E E+ +A A+ + A A + + A+A D++ Sbjct: 975 DDKPTEAVKPVEA--EDKPAEEAAPAETDDKPTEEAEPAESEKPVEESTPAEANDTSSEE 1032 Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS------ET 209 + SA A+ A ++ AS + ++ AE ++ +A A + ET Sbjct: 1033 AKSAESEEKPAEDAKATEADASESDDKPAEGVKPAEPEETPSAEVEPAEENKSANDIPET 1092 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA-ASSATAAG 268 A ++ + + +A+E +A A +E+AK+ E S + + Sbjct: 1093 EAKSTEEMKPEKETEQQPEATEPPAAAETQKAEEESAKTEEPVEPEQTESGDKAESEEPK 1152 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 A ET E A ++AS A + AS A A + A + +A+ Sbjct: 1153 AGDNATTDGETAGDDLEEQAKETASPEAKDE-----VASTAGGKATEEEAPIASDDDTAD 1207 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 + ++A EA + +A A+ + E+ AK SE + AA + + + Sbjct: 1208 KEPPATEDHPSEAAEADDGKAAQEAKAAEDEKVESAAKPSEDDESTDGEAAGETEKAGTA 1267 Query: 389 SASSASASKD---------------------------EATRQASAAKSSATTASTKATEA 421 A ++ E + K A A++ E Sbjct: 1268 EAEEPKTTEKPDETSDSESEAESEGKVATAEAKEPEVEVSAPDDDEKKGADAAASAEAET 1327 Query: 422 AGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSET 481 +A AQ++ T + A + + + A V ++ + + +S S++ Sbjct: 1328 NAAAETPAQTEDTDKEAPETSAESVPEPKPAADDVKPKEELDANAASTEAEKSGDSDSDS 1387 Query: 482 LAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNN 518 K V A ++ K+ +++PD + Sbjct: 1388 DNEDDKPVAEATPESKPEAPKE-TPSEVPDSSKATED 1423 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 65/427 (15%), Positives = 145/427 (33%), Gaps = 24/427 (5%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 E + A+ TA+ + AST+ +A A A A A+ Sbjct: 1162 ETAGDDLEEQAKETASPEAKDEVASTAGGKATEEEAPIASDDDTADKEPPATEDHPSEAA 1221 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 + + + +A++ ++K + + + + A A +T + Sbjct: 1222 EADDGKAAQEAKAAEDEKVESAAKPSEDDESTDGEAAGETEKAGTAEAEEPKTTEKPDET 1281 Query: 231 EAATSARDAAASKEAAKSSETNASSSA----------SSAASSATAAGNSAKAAKTSETN 280 + S ++ A++ E SA ++A++ A + A+T +T+ Sbjct: 1282 SDSESEAESEGKVATAEAKEPEVEVSAPDDDEKKGADAAASAEAETNAAAETPAQTEDTD 1341 Query: 281 ARSSETAAGQSASAAAGS---KTAAASSASAASTSAGQASASATAAGKSAESAASSASTA 337 + ET+A + K A+AAST A ++ S + + + + A+ Sbjct: 1342 KEAPETSAESVPEPKPAADDVKPKEELDANAASTEAEKSGDSDSDSDNEDDKPVAEAT-P 1400 Query: 338 TTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASK 397 +K E S S+ A + SE+ E S ++ + + + +A Sbjct: 1401 ESKPEAPKETPSEVPDSSKATEDSESADDDVEKSQPPEASSKVEPEEATPETPAETTAPA 1460 Query: 398 DEATRQASAAKS---------SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKR 448 ++ + A A + A + +A A+ + + Sbjct: 1461 PSPDSESDSESDSESDASEDSPDADKEPAAAAAKEESAAPTEEAKPEPTAEETAKPSETK 1520 Query: 449 AEDIASAVALEDASTTKKGIVQLSSATNSTSETLAAT-PKAVKSAYDNAEKRLQKDQNGA 507 ++ A +++K+ + + + S ++ PKAV+ A + ++ A Sbjct: 1521 PKEEAEGEPAATETSSKEPDSEKAQDSESEDDSDDEEAPKAVEEKATPAAEAEKETPAEA 1580 Query: 508 DIPDKGC 514 + PD Sbjct: 1581 EKPDADQ 1587 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 66/390 (16%), Positives = 134/390 (34%), Gaps = 6/390 (1%) Query: 99 RPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTS 158 P + E + + + A K + +A + +DS Sbjct: 1333 TPAQTEDTDKEAPETSAESVPEPKPAADDVKPKEELDANAASTEAEKSGDSDSDSDNEDD 1392 Query: 159 AGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSA 218 A ++ +S + ++ ++SK+ +ES+ S +S+ Sbjct: 1393 KPVAEATPESKPEAPKETPSEVPDSSKATEDSESADDDVEKSQPPEASSKVEPE-EATPE 1451 Query: 219 ATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSE 278 + +TA + ++ + + + S + S + + +A++A + A AK T+E Sbjct: 1452 TPAETTAPAPSPDSESDSESDSESDASEDSPDADKEPAAAAAKEESAAPTEEAKPEPTAE 1511 Query: 279 TNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTAT 338 A+ SET + A + SS S A + + + + A A +T Sbjct: 1512 ETAKPSETKPKEEAEGEPAATET--SSKEPDSEKAQDSESEDDSDDEEAPKAVEEKATPA 1569 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 +A E A A A AK S+++ ++ E + A+ AS A + A K Sbjct: 1570 AEA-EKETPAEAEKPDADQAKNSDSDDESDEEEPPQAGDEKATPASEA-EKETPAEVEKP 1627 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKS-TAESAATRAETAAKRAEDIASAVA 457 + + + + + A AT+ A+ ++ E A+ D S Sbjct: 1628 DVEKDQDSDSDNESDDEDPPKAADDKATSEAEKEAPAPEVGKPDAQNDDDSDSDDDSDDG 1687 Query: 458 LEDASTTKKGIVQLSSATNSTSETLAATPK 487 + K + + T + A T K Sbjct: 1688 DASKAGDDKAVSEAEKPTPAPDVDKADTEK 1717 Score = 60.3 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 47/355 (13%), Positives = 109/355 (30%), Gaps = 11/355 (3%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 E A + + AA +++ + E A + + A A T A + A+ + Sbjct: 170 EPAESEEKAEKTPAAEEETPAAEEAPKEEDKDTAKE--EDAPAEETPATEETPVAEPEET 227 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 ++ + + A+ E + + + T A+ + ++ K E Sbjct: 228 PDKEETSTDEASKEETASKEVPTEETPAEDTSKEEASTEAAPTEEA---------PKEEE 278 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 + + + +++ + + A N KA ET A + A + Sbjct: 279 SKEGEASTEETAAKDEVPAPEEAAAEDDSTAKAETPANEEKAPAAEETPAEPEGSEAPKE 338 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 +A +A++ K + + + ++ Sbjct: 339 EAAENDDGGGEDETAASVEKKPLTKKEKKKKREKEKKEKEKKEKEEKERKEKEKKEKEKK 398 Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 + + + + + E A + + S + +S + +AS A+ S Sbjct: 399 EKEKKEKEKEKEKKEKEKKEKEKKAAAKSEKSHHHKSKHAHSSKDAKSSEDKASVAEGSE 458 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 T + A AA+S + + A + A + A + AS +K Sbjct: 459 TAERSAPAADAKEEDKAAESDAETKPAGEVSSPADEAAPADDATPDEVKASEEEK 513 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 68/417 (16%), Positives = 138/417 (33%), Gaps = 27/417 (6%) Query: 92 AMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADS 151 A D +PE A +AV + + ++ ++A + AA Sbjct: 1888 APQASDEKPEDTATIVEEEPAPAAETTAVDKEPESPEEKPAEAKSVEEPAAEEKPIEEAP 1947 Query: 152 ARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA 211 A A A + A++ + S A A K+ E + +A A ++ + A+ + T Sbjct: 1948 AAAEEKPAEEPAAAEEKPSEEAPAAEEKSVEEAPTADRAPVAEKEPEPAEQPAEEAPTAE 2007 Query: 212 SASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSA 271 + +++ A A+ TSA + A + + + ++ +AA+ Sbjct: 2008 APAVEEKLAEEPAAGAAAAAEETSAEEQPAEEAPSVDEKLAEETATEAAAADKEEPAEEK 2067 Query: 272 KAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAA 331 A + A + A + + A ++T A + AA + A+ A + A + Sbjct: 2068 PAEEPVVEAAAEEKPAEEKPSEEAPTAETPTAEESDAAEKKPAEEKATEEPAAEVAATEK 2127 Query: 332 SSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSAS 391 EA+E A+ AA A+ A+E K + A+ A + Sbjct: 2128 EPEPAEDKPVEEASEPATEAASEEKPAEEKPAEEPAAEEKQAEEKPTEEAPAAEPAEKPT 2187 Query: 392 SASASKDEATRQASAAKSSATTASTKATEAAGSATAA----------------------- 428 S ++ A + + +A A+ + A Sbjct: 2188 EESPAETTAEEKPTEEAPAAEVAAEEKPAEEAPTAEAVTEEKPAEEAPAAEKAEPEEAPS 2247 Query: 429 ----AQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSET 481 + ++T +A T E A ++ A A E+ ++ + +S Sbjct: 2248 TDDKPEEQATPAAAETTEEAPAAESKAEEEAKAAEEEPADEEPAAEAKPEEDSKPAD 2304 >UniRef50_C7XZV3 Predicted protein n=3 Tax=Lactobacillus jensenii RepID=C7XZV3_9LACO Length = 1195 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 87/456 (19%), Positives = 168/456 (36%), Gaps = 21/456 (4%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 + A NA+ A A + + A S EA H A A S A ++A A+ Sbjct: 710 ADQATNAAKSA--METANDNLNAAKASQSEAGDHLTAAQTVLDNAQRSLSGAVTTANDAA 767 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 + A A E+ +A +A++ A+ + A+T+ T A +L A +TA + Sbjct: 768 RTLSEAEKTAAESETNAKSAQTQAQKASKAQADAQTALTTAQKTLDHANDEMATANAALT 827 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 A DA+ + AK++ N + + + A ++ +A+ + + AA Q Sbjct: 828 AANAKLADASRTLNDAKTALANYDAKHAQDKVALKEAQDAHQASIQASAITGFALNAARQ 887 Query: 291 SASAAAGSKTAAASSASAAS----------------TSAGQASASATAAGKSAESAASSA 334 A + TA + A S +A + + A + A+ Sbjct: 888 FFEQADTALTANKKVVAKAQATIDNMNAQINAVNKAVSDAKARLESNKSVHDAYAIATVN 947 Query: 335 STATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSAS 394 + KA E + AAA+ A ++ A+ ++ A S S + A + Sbjct: 948 NQDAQKALEKAQADLAAAKQVLAEAQAKLTEAKKSEEAKKAEEAKPSEDSKKSEEAKPSE 1007 Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS 454 SK ++S + TK +E + + A S+ + +S + +K++E+ Sbjct: 1008 DSKKSDDTKSSEDSKKSD--DTKPSEDSKKSEEAKPSEDSKKSEEAKPSEDSKKSEEAKP 1065 Query: 455 AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 + + + TK S +SE A + VK+ + A+ + + + P + Sbjct: 1066 SEDAKKSDDTKPSEDSKKSDDTKSSED-AKKSEEVKATEEPADIKAPEVKPVVIAPAQEV 1124 Query: 515 FLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYY 550 + TD + V P + ++Y Sbjct: 1125 ITSQKKVAVSTDKLTSAALTQRTVKMPIKSRVEEHY 1160 Score = 65.7 bits (158), Expect = 9e-09, Method: Composition-based stats. Identities = 87/437 (19%), Positives = 157/437 (35%), Gaps = 30/437 (6%) Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 ++ +D + A + AA A +++ +A + A + Sbjct: 662 NEDINNETAKVTDLNAQLTVANDKMKALTSALTAAQAVAQKSSDDKANADQATNAAKSAM 721 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 A+ + AA++S+S A AA+T NA SL A T+A+ A SEA + AA Sbjct: 722 ETANDNLNAAKASQSEAGDHLTAAQTVLDNAQRSLSGAVTTANDAARTLSEA---EKTAA 778 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKA--------------------AKTSETN 280 S+ AKS++T A ++ + A + TA + K + Sbjct: 779 ESETNAKSAQTQAQKASKAQADAQTALTTAQKTLDHANDEMATANAALTAANAKLADASR 838 Query: 281 ARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTK 340 + A + A A A A ++ QASA A +A A TA T Sbjct: 839 TLNDAKTALANYDAKHAQDKVALKEAQDAHQASIQASAITGFALNAARQFFEQADTALTA 898 Query: 341 AGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSA-----SA 395 + +A A + +A + A + + S + + + A + A A Sbjct: 899 NKKVVAKAQATIDNMNAQINAVNKAVSDAKARLESNKSVHDAYAIATVNNQDAQKALEKA 958 Query: 396 SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASA 455 D A + A++ A K +E A A A S+ + +S + +K+++D S+ Sbjct: 959 QADLAAAKQVLAEAQAKLTEAKKSEEAKKAEEAKPSEDSKKSEEAKPSEDSKKSDDTKSS 1018 Query: 456 VALEDASTTKKGIVQLSSATNSTSET--LAATPKAVKSAYDNAEKRLQKDQNGADIPDKG 513 + + TK S SE + K + + + E + +D +D Sbjct: 1019 EDSKKSDDTKPSEDSKKSEEAKPSEDSKKSEEAKPSEDSKKSEEAKPSEDAKKSDDTKPS 1078 Query: 514 CFLNNINAVSKTDFADK 530 + ++ A K Sbjct: 1079 EDSKKSDDTKSSEDAKK 1095 >UniRef50_Q4KTW7 Merozoite surface protein 3 alpha n=159 Tax=Plasmodium vivax RepID=Q4KTW7_PLAVI Length = 859 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 83/389 (21%), Positives = 138/389 (35%), Gaps = 23/389 (5%) Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 N V A ++ A + EA A A A A +A+ A A A Sbjct: 177 NNLENVKSQVKIADEALKKAKSKKNEAEIAAELVK--AVVAKEEAQKASDEAHKAYDKAQ 234 Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAAT 234 A TKA +AS A A ++ A+ + + +T + NA + A A T +A Sbjct: 235 EAYTKAQKASDEAQKAHANVQQASKTKRSGETLKNNAETAANKAKEEVGKAETVVKDAKN 294 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 A + AK T+A ++A A A A+ AK A+ + A + Sbjct: 295 -----AKDLDEAKQKATDAETAAKDAKKEQVKAEIVAEVAK-----AKVPKEEADAAQKK 344 Query: 295 AAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARS 354 A +K A + Q A K A AS A+ A T+AG+ ++A +++ Sbjct: 345 AEEAKKIVDKIAQDSKVPEAQREA------KLATQTASKATEAATEAGKKAQEAEESSKE 398 Query: 355 ASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTA 414 A + K +AE + A ++ + A A A K +A A Sbjct: 399 AEEKAETSDAVKGKADAAEKAAGEAKKASIETEIAIEVAKAEVLNA-----EVKKTAQEA 453 Query: 415 STKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSA 474 ATEA A A + A++ +AE + + + E+ + + A Sbjct: 454 EKDATEAKEQAEKAKAAAEEAKTHGEKAEKVGESTKAHSDEAQQENKNAKDASEEAENRA 513 Query: 475 TNSTSETLAATPKAVKSAYDNAEKRLQKD 503 ++ E A ++ + D Sbjct: 514 VDALEEAYAVEAHLARTKNAAESAKSATD 542 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 77/359 (21%), Positives = 126/359 (35%), Gaps = 8/359 (2%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + ++ + A A A AS A++A + A+ + R+ T A ++A Sbjct: 216 KEEAQKASDEAHKAYDKAQEAYTKAQKASDEAQKAHANVQQASKTKRSGETLKNNAETAA 275 Query: 167 QSASSSAGTASTKATEASKSAA--AAESSKSAAATSAGAAKTSETNASASLQSAATSAST 224 A G A T +A + A+ + A T+A AK + A + A Sbjct: 276 NKAKEEVGKAETVVKDAKNAKDLDEAKQKATDAETAAKDAKKEQVKAEIVAEVAKAKVPK 335 Query: 225 ATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSS 284 A++ K A S A A A +A+ A +A A A S Sbjct: 336 EEADAAQKKAEEAKKIVDKIAQDSKVPEAQREAKLATQTASKATEAATEAGKKAQEAEES 395 Query: 285 ETAAGQSASAAAGSKTAAASSASAASTSAGQASASATA-----AGKSAESAASSASTATT 339 A + A + K A ++ AA + + + A A +A A Sbjct: 396 SKEAEEKAETSDAVKGKADAAEKAAGEAKKASIETEIAIEVAKAEVLNAEVKKTAQEAEK 455 Query: 340 KAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDE 399 A EA EQA A +A AKT A+ S ++ A +A ++ A + Sbjct: 456 DATEAKEQAEKAKAAAEEAKTHGEKAEKVGESTKAHSDEAQQENKNAKDASEEAENRAVD 515 Query: 400 ATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 A +A A ++ A A SAT ++ + E A A A ++ A + Sbjct: 516 ALEEAYAVEAHLARTKNAAESAK-SATDMSELEKAKEEAIDAANIAHQKWLKATQAATI 573 >UniRef50_D2RBM3 Putative uncharacterized protein n=1 Tax=Gardnerella vaginalis 409-05 RepID=D2RBM3_GARVA Length = 656 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 75/393 (19%), Positives = 144/393 (36%), Gaps = 14/393 (3%) Query: 97 DARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAAS 156 +A + + E V+RNA A A A A+ A++AA+ A+ AA + + Sbjct: 154 NALTDKVSHLIEQNETVSRNAQNAADAANNATSIAQQAAQQAKDAASEASQAAQNTQNVI 213 Query: 157 TSAGQAASSAQSASSSAGTASTKA--------TEASKSAAAAESSKSAAATSAGAAKTSE 208 + A + A ++ +A A+ +A +AA A S A A +A + Sbjct: 214 SHATEVAQQCDASKQTADQAAKRADDAVSGLKQTVQNAAADAASKVQQAVERANSATQAV 273 Query: 209 TNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAG 268 ++A T E A+ A + ++ A+ + + + A Sbjct: 274 DAVREKTEAANKQTETDLAALREEVVKAQRAGFTASSSAQKCDEAAQAYRNVSGEVAQAK 333 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 +++ A + A + + + + A ++ A SA K Sbjct: 334 QTSEQAVEAANQALHTAQESAAAVAQAQSVLDQVKDASETAKRVV-----SAVEELKQTN 388 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK-TAAASSASSAA 387 +AA A+ A A+ A +A++ S A + + + + Sbjct: 389 NAALEATRTANAQASAAADAAGKANNATSTANSAAQAANDAAGKVTQTLQESETRFKAVE 448 Query: 388 SSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAK 447 +A+ A + A A AA+S+A A +KA +AAGSA A + ++A A + Sbjct: 449 QAANDAKSVAGTANSTAEAARSTAEQAQSKANDAAGSAQRAQNTANSAIEATDNNKNRID 508 Query: 448 RAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 E S++ ++ K +A+ + S Sbjct: 509 SMESDVSSLKNSCSAAQSKANDAAQTASKAQSV 541 >UniRef50_UPI000023F701 hypothetical protein FG10084.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023F701 Length = 4221 Score = 70.3 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 65/440 (14%), Positives = 131/440 (29%), Gaps = 22/440 (5%) Query: 93 MTEDDARPEALRRFELMVE-------------------EVARNASAVAQNTAAAKKSASD 133 M E PE E E + A AA+K+ + Sbjct: 1043 MMEPMIEPEVPVEPTTEPEVSEKASNEPTSEPKTAAKPEADNPTTESATGPEAAEKALPN 1102 Query: 134 ASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS 193 ST+ +E T + S +T + +S + A + ++ +E+ Sbjct: 1103 ESTTEQEPETTVDSESKSVTGPATEPEATEKAIESVAEPEDAAKPETVVKPETVIDSEAP 1162 Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNA 253 + T+E + ++ S + + A+EA + + E SE Sbjct: 1163 TESEVVETAVEPTAEPEIAPEPETVIDSEAPSDESAAEATPEPKVVEKAIEPVSESEAAV 1222 Query: 254 SSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSA 313 + A+ S + + + + +A A + + + A A Sbjct: 1223 ENEAARPESDDSVVQPTTEPEVAEKVAVDEPALEPELETAAEAKASVDSKPAVEPAVEPA 1282 Query: 314 GQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAE 373 + S A K+AE + E A+ + ++E A+ A+ Sbjct: 1283 VEPSTELEVAEKAAEQDYEPEVVEKAASDELPSALETASDQPATDPSAEPTAEPVLKPAD 1342 Query: 374 SSKTAAASSAS-SAASSASSASASKDEATRQASA--AKSSATTASTKATEAAGSATAAAQ 430 SK + +S + E A A+ A+ + A E A A Sbjct: 1343 DSKPGTEPEVDLNKPASEQALDEPLAELKPAADTEAAEKVASDEPSPAPETASDQPAVKP 1402 Query: 431 SKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVK 490 + E+ + + R E + + I + S + + + Sbjct: 1403 TAVEPETDPDKQASEQARDESSTEIKLAAEPEVDEPAIETTAEKAASDEPSPTPETASDQ 1462 Query: 491 SAYDNAEKRLQKDQNGADIP 510 A + A K + ++ +D P Sbjct: 1463 PATEPAVKSAPESESVSDKP 1482 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 53/404 (13%), Positives = 112/404 (27%), Gaps = 12/404 (2%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTS--- 137 + P T + E A PE + E +V+ S + + + + Sbjct: 1122 TGPATEPEATEKAIESVAEPEDAAKPETVVKPETVIDSEAPTESEVVETAVEPTAEPEIA 1181 Query: 138 --AREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKS 195 A + +SA A+ + + S S +A + + + Sbjct: 1182 PEPETVIDSEAPSDESAAEATPEPKVVEKAIEPVSESEAAVENEAARPESDDSVVQPTTE 1241 Query: 196 AAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASS 255 A + A ++ + A A + + E A+ A+ Sbjct: 1242 PEVAEKVAVDEPALEPELETAAEAKASVDSKPAVEPAVEPAVEPSTELEVAE----KAAE 1297 Query: 256 SASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQ 315 AA + +A + ++ +++ +A +A + Sbjct: 1298 QDYEPEVVEKAASDELPSALETASDQPATDPSAEPTAEPVLKPADDSKPGTEPEVDLNKP 1357 Query: 316 ASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESS 375 AS A + A+ A A + A A A K + + S Sbjct: 1358 ASEQALDEPLAELKPAADTEAAEKVASDEPSPAPETASDQPAVKPTAVEPETDPDKQASE 1417 Query: 376 KTAAASSASSAASSASSASASKDEATRQ---ASAAKSSATTASTKATEAAGSATAAAQSK 432 + SS ++ E T + + + TAS + +A Sbjct: 1418 QARDESSTEIKLAAEPEVDEPAIETTAEKAASDEPSPTPETASDQPATEPAVKSAPESES 1477 Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATN 476 + + A +E++ K + S + T+ G+ Sbjct: 1478 VSDKPPAVESESSTKSEPTVESVKETPEVVKTESGVEPEQVVVP 1521 Score = 63.4 bits (152), Expect = 4e-08, Method: Composition-based stats. Identities = 61/379 (16%), Positives = 111/379 (29%), Gaps = 10/379 (2%) Query: 92 AMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADS 151 A ED PEA EE+ ++ A + + A + D Sbjct: 2056 ASDEDQTAPEAAPSTSEKSEEILVADVDPGESEAPSAEPDGAAEVDDKHGGATPPDGVLE 2115 Query: 152 ARAASTSAGQAASSAQSASSSAGTA----STKATEASKSAAAAESSKSAAATSAGAAKTS 207 A S ++ + A + ++ G + T E + A + A ++ Sbjct: 2116 QTVAEVSDNESDTPAPAPATEDGESKEKDKTAEQEPTIEPPATDEKPEETAQASNDEPEP 2175 Query: 208 ETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAA 267 S + A T K + +++A S+ + + A + Sbjct: 2176 PVAPSDEPKEAETPEVALAEKPTVVGDDSKEAEVSEAEPVVEVPVQDNPPEAEAEAEVEP 2235 Query: 268 GNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA-SASATAAGKS 326 A ++ + A + +A + +A +A + S AG+ Sbjct: 2236 VKEASVSEEKLAEEEQTAQAQEEPQQGSAPADNTPEEEEKSAKDEHIEAPTTSDDKAGED 2295 Query: 327 AESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSA 386 +SA ++A E + AA+ S T+ + AE + A Sbjct: 2296 VKSAEEPPASAPIAEDELNQSPPTAAQEESKETTTAEETTEATPVAEPVSASEEPPAGDT 2355 Query: 387 ASSASSASASKDEATRQASAA-----KSSATTASTKATEAAGSATAAAQSKSTAESAATR 441 S+ A A A+ A S E A K +SA Sbjct: 2356 VSTDDKAVDDPKNAEDVATPAVQEPCSPEGEGESKPVDETPAPEPEDATVKPIDDSADAP 2415 Query: 442 AETAAKRAEDIASAVALED 460 A K+ ++ A ED Sbjct: 2416 PAEAEKKEDEPVKATTKED 2434 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 75/483 (15%), Positives = 155/483 (32%), Gaps = 48/483 (9%) Query: 82 QPGTLNDFLGAMTEDDARPEALRR---------FELMVEEVARN------ASAVAQNTAA 126 +P T + E D PE + + E ++ A + A V + Sbjct: 1284 EPSTELEVAEKAAEQDYEPEVVEKAASDELPSALETASDQPATDPSAEPTAEPVLKPADD 1343 Query: 127 AKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKS 186 +K + A+ A D + + A A S A ++ + Sbjct: 1344 SKPGTEPEVDLNKPASEQALDEPLAELKPAADTEAAEKVASDEPSPAPETASDQPAVKPT 1403 Query: 187 A---------AAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSAR 237 A A+E ++ ++T A E + A +A +AS + E A+ Sbjct: 1404 AVEPETDPDKQASEQARDESSTEIKLAAEPEVDEPAIETTAEKAASDEPSPTPETASDQP 1463 Query: 238 DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARS-------------- 283 + ++A SE+ + + + S+T + + ++ K + ++ Sbjct: 1464 ATEPAVKSAPESESVSDKPPAVESESSTKSEPTVESVKETPEVVKTESGVEPEQVVVPKD 1523 Query: 284 -------SETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAST 336 E+ + + +G + + A+ S + + GKS E A S+ S Sbjct: 1524 DVSSEDEDESEFESESESESGVEPDNGAPEEVANNKDIDNSDTESLEGKSTERATSADSE 1583 Query: 337 ATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASAS 396 + E ++ A S K E + +AS AE+ A++ + S A+ Sbjct: 1584 VDSAKPTIREMSNEAKEEPSPLKKKEDSQEAS---AETEAPASSVEPENNDVEVSEPKAA 1640 Query: 397 KDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAV 456 E A S + A++++ + + S+ + + + + A+ Sbjct: 1641 ATEGKDNTVVANMSESEATSESEDEDEDEDDDSDSEHQPAESDKATDALPAKIAEEEPAL 1700 Query: 457 ALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFL 516 A +D + K S T ST E A + D + Sbjct: 1701 AEKDTLDSAKEDDSDSPITESTDEKKDKPSPPETPAEKANAPATPSHEEAEDSTETVELA 1760 Query: 517 NNI 519 + + Sbjct: 1761 DTV 1763 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 44/383 (11%), Positives = 101/383 (26%), Gaps = 8/383 (2%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 + + + + A A+ S+ D+ S EA + D A ++ Sbjct: 1843 KTSHAPDTLDTDEANAQVSSEDSKGSVIEAEDAVQEPLDGGAQDKPMPDNEAPKEPESND 1902 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 + A+ A+ + + + + + + + A AT Sbjct: 1903 KNSPHTDAEETQEPVASGADDKEPEESPLISTKEKAALDEPIAEEEAKQPIDDATGHDE- 1961 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 K A + ++ + TA +++ E ++ Sbjct: 1962 -KPETESPPLDKPANDEPALDDKDLGATRPAQQTAEDVEMCDEDSADEAPAVDEK--QET 2018 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 SK + + A +AE++ + + + Sbjct: 2019 PGDPVPSKPEEVEDTTLEEKPVVETLADDKDIQDAAEASDEDQTAPEAAPSTSEKSEEIL 2078 Query: 352 ARSASAAKTSETNAKA-SETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 ++ +A+ + A + + S ++ + A A + Sbjct: 2079 VADVDPGESEAPSAEPDGAAEVDDKHGGATPPDGVLEQTVAEVSDNESDTPAPAPATEDG 2138 Query: 411 ATTASTKATEAAGSATAAAQSKSTAESAAT---RAETAAKRAEDIASAVALEDASTTKKG 467 + K E + A + E+A E +++ A E A K Sbjct: 2139 ESKEKDKTAEQEPTIEPPATDEKPEETAQASNDEPEPPVAPSDEPKEAETPEVALAEKPT 2198 Query: 468 IVQLSSATNSTSETLAATPKAVK 490 +V S SE V+ Sbjct: 2199 VVGDDSKEAEVSEAEPVVEVPVQ 2221 >UniRef50_C0QBA1 Putative plasmin-sensitive surface protein (Pls family protein) n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QBA1_DESAH Length = 567 Score = 70.3 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 84/393 (21%), Positives = 143/393 (36%), Gaps = 12/393 (3%) Query: 127 AKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKS 186 + K ++ T+ ++ T + + + A ++ ++ + K E + Sbjct: 20 SDKKMTNNETAKKQV-TKKTPSKTTDKKAEKQQPAEEAAKKAKEQRLAEEAAKKAEEKRL 78 Query: 187 AAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAA 246 A A + AAK +E AA A A + A + A Sbjct: 79 ADEAAKKAEEKRLADEAAKKAEE--KRLADEAAKKAEEKRLADEAAKKAEEKRLADEAAK 136 Query: 247 KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 K+ E + A+ A A +AK A+ ++++ A + + A K A Sbjct: 137 KAEEKRLADEAAKKAEEKRLADEAAKKAEEKRLADQAAKKAEEKRLADEAAKKAEEKRLA 196 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 A+ A + + AA K+ E A A KA E AA ++ E K Sbjct: 197 DEAAKKAEEKRLADEAAKKAEEK--RLADEAAKKAEEKRLADEAAKKAEEKRLADEAAKK 254 Query: 367 ASETS-AESSKTAAASS--ASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG 423 A E A+ + A A AA A DEA ++A K A A+ KA E Sbjct: 255 AEEKRLADEAAKKAEEKRLADEAAKKAEE-KRLADEAAKKAEE-KRLADEAAKKAEEKRL 312 Query: 424 SATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLA 483 + AA ++ AE AK AE AS V ++ +T+ +++ + L Sbjct: 313 ADEAAKRAIEIKRVIREIAEQEAKEAESTASQVQKQN-NTSDAKPKPMAAILAGIAFVLL 371 Query: 484 ATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFL 516 + AV ++ +N+++ K N KG F Sbjct: 372 IS-LAVGASINNSKRFYIKQTNAGVEIWKGQFS 403 Score = 69.9 bits (169), Expect = 5e-10, Method: Composition-based stats. Identities = 63/291 (21%), Positives = 100/291 (34%), Gaps = 10/291 (3%) Query: 108 LMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQ 167 + +E A+ A AAKK+ A EAA A + + AA + + A Sbjct: 77 RLADEAAKKAEEKRLADEAAKKAEEK--RLADEAAKKAEEKRLADEAAKKA--EEKRLAD 132 Query: 168 SASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT 227 A+ A A K+ + AA A + ++ A + + A A Sbjct: 133 EAAKKAEEKRLADEAAKKAEE--KRLADEAAKKAEEKRLADQAAKKAEE--KRLADEAAK 188 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 KA E + D AA K K A+ A + AA + + E ++ E Sbjct: 189 KAEEKRLA--DEAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEEKR 246 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ 347 A+ A K A +A A A+ A K A+ + A EA ++ Sbjct: 247 LADEAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEEKRLADEAAKK 306 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 A + AAK + + AE A S+AS +++ A Sbjct: 307 AEEKRLADEAAKRAIEIKRVIREIAEQEAKEAESTASQVQKQNNTSDAKPK 357 Score = 69.6 bits (168), Expect = 7e-10, Method: Composition-based stats. Identities = 64/360 (17%), Positives = 107/360 (29%), Gaps = 11/360 (3%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + M + + ++ A EAA + + A Sbjct: 22 KKMTNNETAKKQVTKKTPSKTTDKKAEKQQPAEEAA--KKAKEQRLAEEAAKKAEEKRLA 79 Query: 167 QSASSSAGT-----ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATS 221 A+ A + K E + A A + AAK +E AA Sbjct: 80 DEAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEE--KRLADEAAKK 137 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 A A + A + A K+ E + A+ A A +AK A+ Sbjct: 138 AEEKRLADEAAKKAEEKRLADEAAKKAEEKRLADQAAKKAEEKRLADEAAKKAEEKRLAD 197 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 +++ A + + A K A A+ A + + AA K+ E A A KA Sbjct: 198 EAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEEK--RLADEAAKKA 255 Query: 342 GEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 E AA ++ E KA E A A +A A + Sbjct: 256 EEKRLADEAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEEKRLADEAAKKAEEKRLADE 315 Query: 402 RQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDA 461 A + EA + + A+Q + ++ + + A IA + + A Sbjct: 316 AAKRAIEIKRVIREIAEQEAKEAESTASQVQKQNNTSDAKPKPMAAILAGIAFVLLISLA 375 >UniRef50_B8CF96 Predicted protein n=2 Tax=Thalassiosira pseudonana RepID=B8CF96_THAPS Length = 2232 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 77/444 (17%), Positives = 143/444 (32%), Gaps = 33/444 (7%) Query: 102 ALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQ 161 + +F + E A A + KS A + ++ A +S Sbjct: 1554 LVSKFNQIDPEEHAQALARVEQMKEECKSMKGLKDKAEKDSSSAKGLVAQLNKEISSQKA 1613 Query: 162 AASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL-----Q 216 + + ++A K ++A++ + + +++ AA A A+ +NA S + Sbjct: 1614 SMDAFRTALEKTKAEKEKLSKAAQLNDSKKVAEAQAALKASKAELDASNARLSNFKDMLR 1673 Query: 217 SAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKT 276 +A +A ++ + K A SS + SAT A + Sbjct: 1674 KMQVGLKSAKDAEDKAKSTEKALQKEITELKHKLAVAESSQTKGGESATEAKVEDQTPVV 1733 Query: 277 SETNARSSETAAGQSA-------------------SAAAGSKTAAASSASAASTSAGQ-A 316 + ++ET A A +K A A+ A+ + Sbjct: 1734 EQPKTVATETPAVPEKKIETTTTTTTTASDVIKPDETADATKAAPVVDATEATPLVEEKP 1793 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 S A ++ +++ + AT K G+ + A +A + +S K S T ++ Sbjct: 1794 SEVAASSEMKVDASEKAPVAATKKKGKKRKAAVPQKNAAPSDSSSAPPQKKSATQDAAAS 1853 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSA------TTASTKATEAAGSATAAAQ 430 A+ + A+ S S + +K T AST A A + AA Sbjct: 1854 AKASGEKTGASESIKSTTHTKQAPVPPKKKIVLKKKAAAKETPASTPA--AVTNIAPAAT 1911 Query: 431 SKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVK 490 +K+TA S A A + KK + T + E K Sbjct: 1912 AKTTAASTKEEEMKLKMLLLKKRKAEAAQRLQVAKKKKETEAIETAAAVEESEKISKESA 1971 Query: 491 SAYDNAEKRLQKDQNGADIPDKGC 514 + +A + K+ D P Sbjct: 1972 PSITSASEDTPKETESNDKPVAKA 1995 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 84/447 (18%), Positives = 143/447 (31%), Gaps = 14/447 (3%) Query: 107 ELMVEEVARNASAVAQNTAAA-KKSASDASTSAREAATHAADAADSARAASTSAGQAASS 165 E+ ++ + +A A A K+ S A+ A AA A A A A Sbjct: 1607 EISSQKASMDAFRTALEKTKAEKEKLSKAAQLNDSKKVAEAQAALKASKAELDASNA--R 1664 Query: 166 AQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTA 225 + A + A+S++ A K A +S SA+ A Sbjct: 1665 LSNFKDMLRKMQVGLKSAKDAEDKAKSTEKALQKEITELKHKLAVAESSQTKGGESATEA 1724 Query: 226 -----TTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETN 280 T + T A + A E + T +++AS A + A T Sbjct: 1725 KVEDQTPVVEQPKTVATETPAVPEKKIETTTTTTTTASDVIKPDETADATKAAPVVDATE 1784 Query: 281 ARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTK 340 A +A++ K A+ A +A + A ++AA S S++ Sbjct: 1785 ATPLVEEKPSEVAASSEMKVDASEK---APVAATKKKGKKRKAAVPQKNAAPSDSSSAPP 1841 Query: 341 AGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 ++ Q +AA+ AS KT + + S T + + A++ Sbjct: 1842 QKKSATQDAAASAKASGEKTGASESIKSTTHTKQAPVP-PKKKIVLKKKAAAKETPASTP 1900 Query: 401 TRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 + A ++ + +T+ K AA R + A K+ E A A Sbjct: 1901 AAVTNIAPAATAKTTAASTKEEEMKLKMLLLKKRKAEAAQRLQVAKKKKETEAIETAAAV 1960 Query: 461 ASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNIN 520 + K S T S SE ++ A L + +P+K + Sbjct: 1961 EESEKISKESAPSIT-SASEDTPKETESNDKPVAKAVPALSSPRPLPPVPEKDEPKEEMR 2019 Query: 521 AVSKTDFADKRGMRYV-RVNAPAGATS 546 + DK+ V + AGAT Sbjct: 2020 EPTTAAVDDKKDTAAVGSSTSVAGATG 2046 >UniRef50_A5KAV8 Merozoite surface protein 3 (MSP3), putative n=2 Tax=Plasmodium vivax RepID=A5KAV8_PLAVI Length = 1243 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 76/372 (20%), Positives = 122/372 (32%), Gaps = 28/372 (7%) Query: 117 ASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTA 176 + A++ + EAA ++ A A+ A + A+ AQ A+ A A Sbjct: 80 SGQSEDAIVKAQQEDGEVEGQQDEAALQ-SEDEKEAENAAEEAQKFATQAQGAAEQAAQA 138 Query: 177 STKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSA 236 + A + +K A A AK N S +A A A KA E A Sbjct: 139 AQAAQDEAKKITENTEKIEEAVKQATDAKEEAENESREANNAKEEADAAARKAKENKEDA 198 Query: 237 RDAAASKEAAKSSETNASSSASSAASSATAA----------GNSAKAAKTSETNARSSET 286 + +AA A++ A +A A A +AK A+ +E E Sbjct: 199 VNQKKIAQAALERAKTAATKAQTAKGKAEKALETTKAEVAKELAAKEAREAEKTRAVEEA 258 Query: 287 AAGQSASAAAGSKTAAASSA---------------SAASTSAGQASASATAAGKSAESAA 331 + A+ + + +A AT A + AE+ + Sbjct: 259 QQIAKQAEEQLKTATKATQEAAQAAQAAQDEAKKITENTEKIEEAVKQATDAKEEAENES 318 Query: 332 SSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSAS 391 A+ A +A A +A A K +A T+A ++ A A A Sbjct: 319 REANNAKEEADAAARKAKENKEDAVNQKKIAQSALDKATNAATNAQKAKEKAEIALERTK 378 Query: 392 S--ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRA 449 + + + +A AA+ A S K T A A + AE A +A+ A Sbjct: 379 AEVSKELAKKEVLEAEAAQKEAKDISDKMTIANKPVNKANLASKRAEEALEKAKKHVATA 438 Query: 450 EDIASAVALEDA 461 E +A Sbjct: 439 ESATEEAKGANA 450 >UniRef50_A2ET23 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2ET23_TRIVA Length = 2722 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 45/417 (10%), Positives = 108/417 (25%), Gaps = 2/417 (0%) Query: 94 TEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSAR 153 TE A PE + ++E V ++ A AK+ + E ++ A A Sbjct: 2058 TEQQAAPEVSEEEQFLIELVKDFITSEAAVFERAKQESQHVVKLEEEIISNEKRAQMEAE 2117 Query: 154 AASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS--ETNA 211 A +A+ + + ++ + + + E Sbjct: 2118 RAEKKRIAEQKAAEKRAKKEQKKIQQLKRLEEAVKKEIENSNNKTPKNTKEEQRKHEEEL 2177 Query: 212 SASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSA 271 A + A + + R+ A + A + +E +A A N Sbjct: 2178 KAQQEKEAEKEFIKSLMDQQKENELREKEAKEAAQRKAEEEVRKAAEEERKQRENAENFK 2237 Query: 272 KAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAA 331 K + + + + A A+ + + +T Sbjct: 2238 KREEKRLAKEAAEAKKKQREEKRKEEERKRAEEEKKKAAEKKAEQANVSTGKVNKKAEQR 2297 Query: 332 SSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSAS 391 + E A A + AKT + +A + + + ++ + Sbjct: 2298 RIEEERKREQEEKMYAALERAMAMQEAKTRDIDAADLQGMNDDAAAEDDDDIVVVSAKSE 2357 Query: 392 SASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAED 451 A + +Q + + + ++ + + + Sbjct: 2358 PKKAEAAKPAQQKKTQQKAEEDDDDVVVVPTKAQQNKSEVAAKPAEQKKAEQAKPAEQKK 2417 Query: 452 IASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 + A E + + + +A + AE+ +Q A+ Sbjct: 2418 VEQAKPAEQKKVEQAKPAEQKKVEQAKPAEQKKAEQAKPAEQKKAEQAKPAEQKKAE 2474 >UniRef50_C2EAZ6 Allergen V5/Tpx-1 family protein n=1 Tax=Lactobacillus ruminis ATCC 25644 RepID=C2EAZ6_9LACO Length = 1053 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 68/355 (19%), Positives = 120/355 (33%), Gaps = 34/355 (9%) Query: 151 SARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAAT--SAGAAKTSE 208 S +A++T+ +A A + A A + A+ ++A++ AA T S AK + Sbjct: 655 SVQASNTALSEAQQDLARAKAKADQAKRELDAAAAKLSSAQTENQAAQTDYSNAVAKQTA 714 Query: 209 TNASASLQSAATSASTATTKASEAATSA------------RDAAASKEAAKSSETNASSS 256 TNA + AS+ AS+A + ++ EAAK+++ A Sbjct: 715 TNAVFKTAQSRLDASSKKLSASKAKNTELKRELDNLTKVQKEQEQKVEAAKNAKAAAELD 774 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 + + A N+ A+ T A++++ ++ A + AA A + Sbjct: 775 LAKSKQEAETKKNALTKAQAELTAAKTAQKLTAENLKQATANVEAAGQKVEQAVAGLAEI 834 Query: 317 SASATAA-----------------GKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 A A K+A S S A A + EQ +A A Sbjct: 835 KAEIKQAETELQKIAYEIANFDALKKAATSKVEETSRALEAAKQKLEQKTAILEEKQAGL 894 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 + A SA + + + A + K + A +A Sbjct: 895 KQAERNLEQAKDNLDFANKVLAEKQQALESAQNKLDALENANKLLEETKKNLENAKQEAA 954 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSA 474 + AA + + + E A K D A A + + K + +L SA Sbjct: 955 NKQQAYEAALKKYNASLQE---LENAKKVLADAKQASAQANRKLSTKQLEELVSA 1006 Score = 60.7 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 77/383 (20%), Positives = 130/383 (33%), Gaps = 14/383 (3%) Query: 142 ATHAADAADSARAASTSAGQAASSAQSASSS--AGTASTKATEASKSAAA--AESSKSAA 197 T ++ +A T+A + +S A + + KA A SA E+ Sbjct: 17 GTGTVNSIANADTIETTAKNETTIKNDVKTSKQAASETQKANAALNSAQQDFNETQAKND 76 Query: 198 ATSAGAAK-TSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 A A K SE N +A+ A + ++ A + A +A S Sbjct: 77 ADKADLNKVESEKNQAAAAVKDAQEKYDHQKRNEASSNQISSAKGDVDNASKELNDAQSK 136 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 A SA N+ A+ + A+S + A A A +K A A + Sbjct: 137 ADSAKKDQADKENAKDQAQKEQDQAKSDKDKAQADADQAQANKDQAQKENDALNKGQLND 196 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 A SA + A T +A + + A + A + A+ + A+ Sbjct: 197 KKDELNAEISARK--DEQTKAETALDKANQAKNDADKQKEDAVSENKTAQGNLNDAQQKY 254 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 AA +A + +A + D A++ A + +T+ A A+ +S + Sbjct: 255 DAAKKDFDAAGQNQQNAQSKADSASQAYKDALAKLEGLTTENLN-KELVKAKAELESAQK 313 Query: 437 SAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNA 496 K + A + K Q + A + E + A A K AYD A Sbjct: 314 DYEKTLSEYKKLVSEKG------QADSKAKEAKQKADAAKAAFEKIQAEYAAKKKAYDAA 367 Query: 497 EKRLQKDQNGADIPDKGCFLNNI 519 + ++ QN D KG N+ Sbjct: 368 VEEKKQAQNKLDSLKKGTLKINV 390 >UniRef50_C2CRT4 Possible phage tail fiber protein n=1 Tax=Corynebacterium striatum ATCC 6940 RepID=C2CRT4_CORST Length = 577 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 65/230 (28%), Positives = 105/230 (45%), Gaps = 6/230 (2%) Query: 64 VEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQN 123 V G +ITV T+ G E + R E + A A+ + Sbjct: 139 VRGPAGVGLKSITVEGSELVVTVTSEAGTYVMTRIPLEDVVRAEADQAVASVQA-AIKAD 197 Query: 124 TAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEA 183 A +SA+ A S AA A+DA ++A + T A A+ +++A +A+ T A Sbjct: 198 VDKAVESAAAAKESEDVAAKSASDAQNAA--SETVATAQAAIKDDVAAAAKSATDAQTAA 255 Query: 184 SKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASK 243 SK+ +A+S+ A +A A+ ++ ++++ + A +A+T A++ A AA Sbjct: 256 SKTVESAQSAIRADVDAAAASASAAQASASTATTQAETATTQAGTATK---QADVVAALA 312 Query: 244 EAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS 293 + AK+SETNA +S ++A SAT A N A AKT A T S Sbjct: 313 KNAKTSETNAKTSETNAGKSATTATNEANRAKTEADRAAQKATETANSIE 362 Score = 62.6 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 66/212 (31%), Positives = 93/212 (43%), Gaps = 29/212 (13%) Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 + T EA T+ A QA +S Q+A +A +SAAAA Sbjct: 155 SELVVTVTSEAGTYVMTRIPLEDVVRAEADQAVASVQAA------IKADVDKAVESAAAA 208 Query: 191 ESSKSAAATSAGAAKTSETNASASLQSA-----------ATSASTATTKASEAATSARDA 239 + S+ AA SA A+ + + A+ Q+A AT A TA +K E+A SA A Sbjct: 209 KESEDVAAKSASDAQNAASETVATAQAAIKDDVAAAAKSATDAQTAASKTVESAQSAIRA 268 Query: 240 ------------AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 AS A + A++ A +A A AK AKTSETNA++SET Sbjct: 269 DVDAAAASASAAQASASTATTQAETATTQAGTATKQADVVAALAKNAKTSETNAKTSETN 328 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASAS 319 AG+SA+ A A + A A+ A + + S Sbjct: 329 AGKSATTATNEANRAKTEADRAAQKATETANS 360 >UniRef50_B0X1W5 Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0X1W5_CULQU Length = 2930 Score = 67.2 bits (162), Expect = 3e-09, Method: Composition-based stats. Identities = 56/397 (14%), Positives = 107/397 (26%), Gaps = 9/397 (2%) Query: 78 YEDSQPGTLNDFLGAMTE------DDARPEALRRFELMVEEVARNASAVAQNTAAAKKSA 131 E+ +P D + TE ++ +P E+ A +A + K + Sbjct: 2244 DEEEKPAVEADEVQKPTESAVEADEEQKPTEAAVEADEEEKPAVDAVEPVEADEEQKPTE 2303 Query: 132 SDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAE 191 S + T AADA A SA A A Sbjct: 2304 SAVEADEEQKPTPAADAEKPVEADEEQKP--TESAVEADEEEKPAVDAVEPVEADEEQKP 2361 Query: 192 SSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSET 251 + + A + + + A +A E A DA EA + + Sbjct: 2362 TESAVEADEEQKPTPAADAEKPAESEEEQKPTEAAVEADEEEKPAVDAVEPVEADEEQKP 2421 Query: 252 NASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAST 311 S+ + A+ S+ + E Q + +A + + A+ Sbjct: 2422 TESAVEADEEEQKPTPVADAQEPVESDEEVKPVEADEVQKPTESAV-EADEQQKPTEAAV 2480 Query: 312 SAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETS 371 A + A + + A+ + + + Sbjct: 2481 EADEEEKPAEDVPTRVGEEEEEEQKPVAADEKPETTDAPASDEKTEDEATTVAPAVVADE 2540 Query: 372 AESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQS 431 + + + + +S E + +A ++ A+ T+A A Q Sbjct: 2541 VKETDDTGKPAVETVPTSQDDVEDESGEHDEKPAAEATTVAAAADATTQAPVVADEKEQE 2600 Query: 432 KSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 + E A A +A E A+TT + Sbjct: 2601 PAVEEEVPATTAKAEVTAAPTEAAKDEEAATTTTAPV 2637 >UniRef50_B4L608 GI16285 n=1 Tax=Drosophila mojavensis RepID=B4L608_DROMO Length = 1174 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 59/396 (14%), Positives = 156/396 (39%), Gaps = 4/396 (1%) Query: 133 DASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAES 192 + + A D++ ++T AA ++ + +++T+ S A+++S Sbjct: 729 QSLSRAAAVDQEPVDSSPQETDSTTYQTYAAEDKNASDPYSQETPSESTDTSTPDASSDS 788 Query: 193 SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETN 252 + S A + S T ++T T A+E ++ + + + + Sbjct: 789 TSEPQVQSLSRAAAVDQEPVDSSPQ-ETDSTTYQTYAAEDKNASDPYSQETPSESTDTST 847 Query: 253 ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTS 312 +S+ S + + + A A ++ ET + + AA K A+ + ++ Sbjct: 848 PDASSDSTSEPQVLSLSRAAAVDQEPVDSSPQETDSTTYQTYAAEDKNASDPYSQETTSE 907 Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSA 372 A ++S S + + +++S + + ++A ++ ++ + + T +T A + ++ Sbjct: 908 ASESSTSDSLSDTTSDSTSEPQVQSLSRAAAIDQEPVDSSPQETDSTTYQTYAAEDKNAS 967 Query: 373 ESSKTAAASSASSAA---SSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAA 429 + S ++ + +S+ S S + ++ +A+A +S + T++ T AA Sbjct: 968 DPYSQETPSDSTDTSTPDASSDSTSEPQVQSLSRAAAVDQEPVDSSPQETDSTTYQTYAA 1027 Query: 430 QSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAV 489 + K+ ++ + + A + S+ ST++ + LS A E + +P+ Sbjct: 1028 EDKNASDPYSQETTSEASESSTSDSSSDTTSDSTSEPQVQSLSKAAAVDQEPVDTSPQGS 1087 Query: 490 KSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKT 525 + A D ++ + N++ + Sbjct: 1088 EDANDATLDSTTASEDLTIANVDDKQVRNVDEATAN 1123 >UniRef50_Q0V3K3 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0V3K3_PHANO Length = 2441 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 62/421 (14%), Positives = 131/421 (31%), Gaps = 9/421 (2%) Query: 93 MTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSA 152 + ED++ PE + A VA A+ + A AT + AD Sbjct: 451 LMEDNSEPEDKADVADAADAEATPVEDVAPAEEASPTDETAAEAQEDADATTEGEKADGD 510 Query: 153 RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAS 212 + +A+ A+ A + ++K AA + S + Sbjct: 511 DTSKEEPSPSAAEELPAAEPAVAEEAEPKNDEVKEETPTNAKEPAADISSEPVPSPEDVI 570 Query: 213 ASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAK 272 AA A+ + S+A A+ ++ + E + + ++ TA+ + A+ Sbjct: 571 PPETDAAAEATAEASSDSDAVDVAKAEDNKEDEQTADEPVTGAEEARPEATETASVDIAE 630 Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAAS 332 A+ + + + + A + S + + Sbjct: 631 DAEAPAVEVVPESESVPIAEAPSKPEDEAVSEDQEEEQPKEALTEESVIEVAATEPACDE 690 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 +++ + E T + SA + + E + + + ++ +A ++ Sbjct: 691 ASTEESGIPTEPTSEEPPTEESAVESPSDEVSIEGPVVNEPPAEESAKEEPV--TEEPTT 748 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESA-----ATRAETAAK 447 AS DE ++ S AT S A + + + + K E A E AA+ Sbjct: 749 DEASNDEPPKEESLEVELATDES--AVDPSAQEMVSGEPKVQGEEAQDVASEATPEPAAE 806 Query: 448 RAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGA 507 + E + + + Q S+ + + A A +E + A Sbjct: 807 SQSATSLEEDSEQSDVADQPVEQDSADAPAATSLAAPDKSANLEENIGSEDASAETTPSA 866 Query: 508 D 508 D Sbjct: 867 D 867 >UniRef50_A4N1T0 Immunoglobin A1 protease n=1 Tax=Haemophilus influenzae R3021 RepID=A4N1T0_HAEIN Length = 1550 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 83/576 (14%), Positives = 185/576 (32%), Gaps = 55/576 (9%) Query: 11 DGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAG--RYSMDVEYGQYSVILLVEGFP 68 D TG+P +N + L N+T +N N + G +Y + G+Y + + Sbjct: 778 DKTGEPTKN-ELTLFDASNATRNNLNVSLVGNTVDLGAWKYKLRNVNGRYDL------YN 830 Query: 69 PSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFE----------------LMVEE 112 P + + T N+ + ++ E + R E E Sbjct: 831 PEVEKRNQTVDTTNITTPNNIQADVPSVPSQNEEIARVEAPVPPPAPATPSETTKTEAEN 890 Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 + + V +N A ++ + A EA ++ + + A + + + + Sbjct: 891 SPQKSETVEKNEQDATETTAQNREVAEEAKSNVEANTQTNKVAQSGSETEETQTTETKET 950 Query: 173 AGTASTKATEASK--SAAAAESSKSAAATSAGAAKTSETNASASLQS------------- 217 A + EA + S + + ++ A + K ET Q Sbjct: 951 AKVEEDEIQEAPQMTSETSPKQAEPAPEEVSTDTKVEETQVQPQTQPTTVTAEDTTTPNG 1010 Query: 218 --AATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 A + + T A + +++ A + + + +A + K Sbjct: 1011 KPAEETQPSEKTNAESVTSVSQNQAEKTVSQSTKDKIVVEKEETAKVEKEKTQEAPKVTS 1070 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS-AESAASSA 334 SET Q+A + T + A Q SAT + + A+ +S+ Sbjct: 1071 QVSPKQEQSETVQPQTALESENVPTVKNAEEVQAQ---LQTQPSATVSTEQPAKETSSNV 1127 Query: 335 STATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSAS 394 T++ + S N++ S + ++ + +SA + ++++ Sbjct: 1128 EQPVTESTTVNTRNSVVENP-QNTTQPAVNSENSTPKSRRKRSVSQPQETSAEETTATST 1186 Query: 395 ASKDEATRQASAAKSSATTAS-TKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIA 453 A ++ S + T A E ++T ++S+ + + Sbjct: 1187 NETTVADNSRRRSRRSVSQPQETSAEETTVTSTEKTTVADNSKSSKPNRRSRRSVRSEPT 1246 Query: 454 SAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKG 513 A+ + + V LS+ T ST+ + K+ + Q+ + + Sbjct: 1247 ------VANGSDRSAVALSNLT-STNTNAVISDARAKAQFVALNVGKAVSQHISQLEMNN 1299 Query: 514 CFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKY 549 NI + + + +Y R ++ + T + Sbjct: 1300 EGQYNIWVSNTSMNENYSSSQYRRFSSKSTQTQLGW 1335 >UniRef50_Q75A72 ADR046Cp n=1 Tax=Eremothecium gossypii RepID=Q75A72_ASHGO Length = 812 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 75/351 (21%), Positives = 137/351 (39%), Gaps = 21/351 (5%) Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 ++ +AK S A+ SA+E+A A+ A + AA+ S + G+A Sbjct: 118 AEHSSESAKDDKSAAALSAKESAGIASPAPSGDDNKTAKPSNAATGDASTAPYWGSAVPG 177 Query: 180 ATEASKSAAAAESSKSAAA-------TSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A+ +A + +AA SA T +S + + SA T + Sbjct: 178 YESAAPEYESAAPAYESAAPEGPSPLPSASELTTCGAKSSTVVDAGGDSAKTEVRGPTVG 237 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 + A +A ++ AA ++ T A+ SA++ T AG++ + + T A ++ + S Sbjct: 238 GSDASEAKSTCGAATATTTTAAGSATAGLDLKTEAGSACEPSATVSAIAFGNQQSPAGSV 297 Query: 293 SAAAGSKTAAASSASA---ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 + ++T A+SA ++ A AS S SA+ S + +KA AS Sbjct: 298 LTSTANETEGANSAHTDAVSTIKADDASGSIETKLHSADVPVQSPALDGSKADSIPGTAS 357 Query: 350 A---------AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 +A +A +E++ S + + + + + S ASS + A Sbjct: 358 PGNVSPVKIGGNTAAGSAVATESDVVVSTDAGSGASSPGKTELEAGVSGASSPLRTSPLA 417 Query: 401 TRQASAAK--SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRA 449 + A + A T +A TA++ TA + T + K A Sbjct: 418 RSATNGALPINEAVLPPTGGKDAQHCGTASSSKVGTAPISTTSEAPSGKAA 468 >UniRef50_A5IUV0 LPXTG-motif cell wall anchor domain n=71 Tax=Staphylococcus aureus RepID=A5IUV0_STAA9 Length = 2481 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 85/457 (18%), Positives = 151/457 (33%), Gaps = 26/457 (5%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 V N A AA SDA + A+ A ++ ++T QAA Sbjct: 954 VTTAKDNGIAAINQVQAATTKKSDAKAEIAQKASERKTAIEAMNDSTTEEQQAAKDKVDQ 1013 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAG-------AAKTSETNASASLQSAATSA 222 + A A+ A+++ A + AAK + + + ++A + Sbjct: 1014 AVVTANADIDNATANTDVDNAKTTNEATIAAITPDANVKPAAKQAIADKVQAQETAIDAN 1073 Query: 223 STATTKASEAATSARDAAASKEAAK----------SSETNASSSASSAASSATAAGNSAK 272 + +TT+ EAA + A + NA + A AT ++AK Sbjct: 1074 NGSTTEEKEAAKQQVQTEKTAADAAIDAAHSNVEVEAAKNAEIAKIEAIQPATTTKDNAK 1133 Query: 273 AAKTSETNARSSETAAGQSASAAA----GSKTAAASSASAASTSAGQASASATAAGKSAE 328 A ++ N R + A Q +A + A + + ++ A + A + E Sbjct: 1134 QAIATKANERKTAIAQTQDITAEEIAAANADVDNAVTQANSNIEAANSQNDVDQAKTTGE 1193 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 ++ + K A + +A + + +A E A ++ + ++ A Sbjct: 1194 TSIDQVTPTVNKKATARNEITAILNNKLQEIQATPDATDEEKQAADAEANTENGKANQAI 1253 Query: 389 SASSASASKDEATRQASAAKSSATT---ASTKATEAAGSATAAAQSKSTAESAATRAETA 445 SA++ +A DEA A AA ++ T A + A + + AT E Sbjct: 1254 SAATTNAQVDEAKANAEAAINAVTPKVVKKQAAKDEIDQLQATQTNVINNDQNATNEEKE 1313 Query: 446 AKRAEDIASAVALED--ASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 A + + ++ + T V + S AVKS N + Sbjct: 1314 AAIQQLATAVTDAKNNITAATDDNGVDTAKDAGKNSIQSTQPATAVKSNAKNEVDQAVTT 1373 Query: 504 QNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 QN A G NA K +NA Sbjct: 1374 QNQAIDNTTGATTEEKNAAKDLVLKAKEKAYQDILNA 1410 >UniRef50_B2W3G7 Predicted protein n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2W3G7_PYRTR Length = 1367 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 85/411 (20%), Positives = 162/411 (39%), Gaps = 17/411 (4%) Query: 114 ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSA 173 + + A+ + S + + +EA A + G+ ++++++ Sbjct: 616 STVSKLEAEIESTKGSSDEQTTAAKKEAEELRETVAKLEAELEAAKGEVTKVSEASNAKV 675 Query: 174 GTASTKATEASKSAAAAESSKSAAATS-AGAAKTSETNASASLQSAATSASTATTKASEA 232 EA S AA ES +AA + A++ S T + S + + TK SE Sbjct: 676 TELEGSLKEAQDSLAAKESELAAAKEEVSKASEASATKVAELEGSLKEAQESLATKESEL 735 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 A + +A + E++K + + +S A +S + ++ A+ T +S+ Sbjct: 736 AAAKEEATKAVESSKGDAEGLQKTIADLEASLKEAKDSLSSKESELEAAKGDVTKVTESS 795 Query: 293 SA-----------AAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 S+ A S TA S AA + A +A S+ + S S + +A Sbjct: 796 SSKIAELEASLKEAQESITAHKSELEAAKSEAKKAVESSKGDAEGLRSTISELEASLKEA 855 Query: 342 GE---ATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 + A E AA++ + T + +K +E A + +A + A SA A K Sbjct: 856 KDGLAAKESELEAAKADVSQATESSGSKIAELEASLKAAQDSLAAKESELGAKSAEAGKV 915 Query: 399 EATRQA-SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVA 457 QA + AKS A+T +AA ++ ++ Q + T ETA + ++ A A A Sbjct: 916 TELEQALATAKSDLEAATTAKEDAAKASESSTQEAEGLRNKITELETAV-KEKEAALAEA 974 Query: 458 LEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 + K G + ++ +L T +++ + LQ Q+ ++ Sbjct: 975 TQATEAAKGGANESAAKIAELEASLKETTSKLEAKETEHSESLQAAQSSSN 1025 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 89/467 (19%), Positives = 173/467 (37%), Gaps = 35/467 (7%) Query: 73 GTITVYEDSQPGTLNDFLGAMTEDDARPEALR-RFELMVEEVARNASAVAQNTAAAKKSA 131 G +T +S + + ++ E A + E E + + + + + Sbjct: 786 GDVTKVTESSSSKIAELEASLKEAQESITAHKSELEAAKSEAKKAVESSKGDAEGLRSTI 845 Query: 132 SDASTSAREAATHAADAADSARAASTSAGQAASSAQ------SASSSAGTASTKATEASK 185 S+ S +EA A AA QA S+ AS A S A E+ Sbjct: 846 SELEASLKEAKDGLAAKESELEAAKADVSQATESSGSKIAELEASLKAAQDSLAAKESEL 905 Query: 186 SAAAAESSK----SAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAA 241 A +AE+ K A +A + + T A A+ S++ T A Sbjct: 906 GAKSAEAGKVTELEQALATAKSDLEAATTAKEDAAKASESSTQEAEGLRNKITELETAVK 965 Query: 242 SKEAAKSS-------------ETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 KEAA + E+ A + A+ T + AK + SE+ + ++ Sbjct: 966 EKEAALAEATQATEAAKGGANESAAKIAELEASLKETTSKLEAKETEHSESLQAAQSSSN 1025 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASA--TAAGKSAESAASSASTATTKAGEATE 346 + A+ A A ++ A A A+ + + A ++A+S+ + + K A E Sbjct: 1026 DKIAALEKELADAKAEASKVAELEAKLAAKESEHSEALQTAQSSGNEKVASLEKDLAAAE 1085 Query: 347 QASAAARSASAAKTSE---------TNAKASETSAESSKTAAASSASSAASSASSASASK 397 Q+ + A A E + + ++ A S+ S+A + + Sbjct: 1086 QSLEETKKAKEAVDEELKATKEAEAEAKETAAKASTLESELAELKISTEKSTAEKSELEE 1145 Query: 398 DEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVA 457 + + A+ + KA E+ +A A ++T S A+ E A ++ + + + Sbjct: 1146 KLKASETKVTELEASVEAAKAKESELTALQAKLDEATQASEASIKELEAAKSGETEAKTS 1205 Query: 458 LEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQ 504 LE T K + ++A S +E + K+ ++A+++L+K + Sbjct: 1206 LETLQATLKEQEEKTTALQSEAEAAKKAQEEAKAELESAKEQLEKAK 1252 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 54/315 (17%), Positives = 98/315 (31%), Gaps = 12/315 (3%) Query: 101 EALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG 160 + + E + + AS VA+ A S+ S + + A + + S +A Sbjct: 1026 DKIAALEKELADAKAEASKVAELEAKLAAKESEHSEALQTAQSSGNEKVASLEKDLAAAE 1085 Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 Q+ + A + EA A + S + K S ++A Sbjct: 1086 QSLEETKKAKEAVDEELKATKEAEAEAKETAAKASTLESELAELKISTEKSTAEKSELEE 1145 Query: 221 SASTATTKASEAATSARDAAASK------EAAKSSETNASSSASSAASSATAAGNSAKAA 274 + TK +E S A A + +A T AS + + AA + A Sbjct: 1146 KLKASETKVTELEASVEAAKAKESELTALQAKLDEATQASEA---SIKELEAAKSGETEA 1202 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 KTS +++ + +A AA + A A A E + Sbjct: 1203 KTSLETLQATLKEQEEKTTALQSEAEAAKKAQEEAKAELESAKEQLEKAKADQEELKKTN 1262 Query: 335 STATTKAGEATEQAS---AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSAS 391 + KA + E + A ++A +ET + A + +A + Sbjct: 1263 AELLEKASKIVEVPATEVPAEKAAEVPVEKAIEEPVAETKEPEAPVDAPAVDGAAETETK 1322 Query: 392 SASASKDEATRQASA 406 A + E + Sbjct: 1323 DAEEPESEPQTPTTP 1337 >UniRef50_B3NH03 GG13891 n=1 Tax=Drosophila erecta RepID=B3NH03_DROER Length = 1010 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 53/418 (12%), Positives = 156/418 (37%), Gaps = 4/418 (0%) Query: 94 TEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAA--DAADS 151 TE E+ ++ ++T + ++ + ++++ + ++ Sbjct: 328 TESPLPTESSTEVTDQSSSTESLPNSTQESTTESPLPTESSTEVSDQSSSTESLPNSTQE 387 Query: 152 ARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA 211 + S ++A+ SSS + E++ + S + + + ++ + Sbjct: 388 STTESPLPTESATEVTDQSSSTESIPDSTQESTSESPLPTESSTEVTDQSSSTESLPDST 447 Query: 212 SASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSA 271 S + ++ T+ ++ ++S S + + + + S++ +++ + Sbjct: 448 QESTTESPLPTESS-TEVTDQSSSTESLPDSTQESTTESPLPTESSTEVTDQSSSTESIP 506 Query: 272 KAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAA 331 + T T + + + S + S ++T + + S+T + S Sbjct: 507 DSTTEESTTESPLPTESSTEVTDQSSSTESLPDSTQESTTESPLPTESSTEVTDQSSSTE 566 Query: 332 SSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSAS 391 S + T ++G +TE + S A S + + E+S + ++ S++ S Sbjct: 567 SIPDSTTQESGLSTESSLTTESSTGATNESSSTEASEESSVSTEGPSSTESSTGVPEEPS 626 Query: 392 SASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAED 451 S + +T++ + S + +ST + ++ + ++S +T S + + ++ ++ Sbjct: 627 PTEPSPNTSTQELPSFTSPSFESSTVESTSSSENPSTSESSTTENSENSSSTQSSPQSST 686 Query: 452 IASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQ-NGAD 508 S + E+ ST++ + +S + T + ++ + NG D Sbjct: 687 EESITSSENPSTSESSPSTPGNGGDSGISGSSTTESPYTTDIPSSSEASPSTPGNGGD 744 >UniRef50_B3I282 Side tail fiber protein n=1 Tax=Escherichia coli E22 RepID=B3I282_ECOLX Length = 945 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 44/175 (25%), Positives = 66/175 (37%) Query: 103 LRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQA 162 ++RFE MV + ++A A A++ A + +DA + T A + +A A + Q Sbjct: 2 VKRFEEMVAQAQQSAEAAAESEQQAGQHVADAQQIKSDCETLADNVQQNAEAVTEDKKQV 61 Query: 163 ASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSA 222 A A SA+ A A +A K A + AAT G AK S A+ S QSA Sbjct: 62 AQLASSATQDAARAEQAVKDADKIVQKAVDKLADAATLTGEAKASAEAAAKSEQSAKQHR 121 Query: 223 STATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTS 277 A + + S+T+ S +A A K Sbjct: 122 DEAQRIVDDLKGTNASTTQKGLVQLCSDTDNDSEELAATPKAVKTVMDETKTKAP 176 >UniRef50_Q2HAR4 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2HAR4_CHAGB Length = 2795 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 71/426 (16%), Positives = 144/426 (33%), Gaps = 10/426 (2%) Query: 93 MTEDDARPE-----ALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAAD 147 +TED P+ A + E + A ++ A A + S + +A + Sbjct: 331 LTEDTGAPDAESAGAQKPLEADEPTPEPKSEAEPESEAEALQENSQKNPAAEPEVAKTEE 390 Query: 148 AADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS 207 A + A + A + + + + A + A +K Sbjct: 391 ATGGDEKEQEARSTAPEPTVEETLVAEVETVDSARSEQPAGTESEPVAEAGNEPAESKRE 450 Query: 208 ETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAA 267 ++ + + AA T + A A + + +ET + + +S ++ T + Sbjct: 451 PSSEADEDELAADVKETLAPELVAEAQEEPTPEAVESLSPDAETAVNQAPASESTQDTDS 510 Query: 268 GNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSA 327 +++++ E + A S A + + ++A A A A ++ Sbjct: 511 KDNSESVTEIEEKLAAEPEFAEDSKDEPAAAIVTSDNAAEDAPAPEEAAEADEKPTEEAT 570 Query: 328 ESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAA 387 E+ + + AGE A + S+ ET +AS + A + +++ Sbjct: 571 EATSVEETVKEEPAGENRPVFGAISAEKSSEPEPETKDEASAETETVEPAIEAKTETASD 630 Query: 388 SSASSASASKDEATRQASAAKSSATT----ASTKATEAAGSATAAAQSKSTAESAATRAE 443 S +D T + K+ TT + E + A A ++ +E A + Sbjct: 631 PDTSVDITVEDIPTEKVDELKAGDTTKVLAPESAVVEVSAVADPAPAGEAKSEPEAPVSA 690 Query: 444 TAAKRAEDIASAV-ALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK 502 + E IA D +K ++ + ++A P K+ D AEK + Sbjct: 691 ARDTKVEGIAETEKPEADEELAEKPGDRVEEEGDQEIRSVAEAPTESKAESDTAEKPAET 750 Query: 503 DQNGAD 508 D AD Sbjct: 751 DSAQAD 756 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 57/382 (14%), Positives = 97/382 (25%), Gaps = 13/382 (3%) Query: 134 ASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS 193 A +A E + D R T Q S S + + + A A E Sbjct: 1310 AGQAAAECISQPTDTDSDLRLPDTHDTQDKSLMTSTVNDDTPDTPEQLNAGNMAGENEHG 1369 Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNA 253 + A S + A + A+ A + A E ++ +A Sbjct: 1370 TETKMLCSDAGGGQAQTPEGSGLEGTAPSEQAVSAATVTAEPPLNHVAKTEGQVTTARDA 1429 Query: 254 SSSASSAASSA--------TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASS 305 A TA+ + K E S + + Sbjct: 1430 DEVDLPVQDQAQPVLGITVTASHQETEQVKPDEETRFSEAGITEVTPVSTIEDAAPILGV 1489 Query: 306 ASAASTSAGQASASAT--AAGKSAESAASSASTATTKAGEAT-EQASAAARSASAAKTSE 362 +AA G + T S+ +A T A E EQA ++ A SE Sbjct: 1490 GTAAPEGVGAEQQAQTIIPTRSSSRAAEVKPQPVLTLAEEPEVEQAREVPQAKQPAAASE 1549 Query: 363 TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAA 422 T A S+ A+ + S E + + + A A + Sbjct: 1550 EIIALETTDDVLDSDKAEGHESAGATRVNQVEDSAQE--QPLPLHEETREDAQVSAETSV 1607 Query: 423 GSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETL 482 A ++ S E AA + ++ + V + Sbjct: 1608 VEQKIAPEAASIDELAAATEPEPTFEDKATVLEPEVDQREISMARPVLTALPELDDELAG 1667 Query: 483 AATPKAVKSAYDNAEKRLQKDQ 504 + + E + Q Sbjct: 1668 PVPDIPAAESSETKEAEAESTQ 1689 >UniRef50_B4N0R4 GK24431 n=1 Tax=Drosophila willistoni RepID=B4N0R4_DROWI Length = 872 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 65/450 (14%), Positives = 136/450 (30%), Gaps = 19/450 (4%) Query: 87 NDFLGAMTEDDARPEALRRFELM-----VEEVARNASAVAQNTAAAKKSASDASTSAREA 141 + LG +DD PE + F+ + ++ + + Q T ++ ++ Sbjct: 128 DQTLGGPNDDDEVPELVGDFDEVAKVEAEQKDVKKLNEEKQTTGKQEQKEKKQDNKKKQE 187 Query: 142 ATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAA---------AES 192 + + + A+ G+AA + + A ++ + + Sbjct: 188 NQNKEKSNKNTIDATKPNGEAAKNKNQSKPEAKNKDQGKSQTAPTNQNIGIESQTKPQPE 247 Query: 193 SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETN 252 + + +K ++T A EA S +A+ + ++K Sbjct: 248 ASKNKKDNKEKSKAAQTQGKPQSNVAKQPPKVDPKIGPEAVQSQAKSASQEASSKDLIQP 307 Query: 253 ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTS 312 S + + + K A+ + A + +++AA AA Sbjct: 308 TPEVTKSQKGNESTPPPAKGQEKPPAEPAKPQPQPSLVDQQAKSSTESAANKGNQAAPAV 367 Query: 313 AGQASASATAAGKSAE---SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASE 369 + + T E S +T + E A S AA+T T A Sbjct: 368 IKDTAKNVTERAGIVEPPKQVQSQPATIKEEPKITQETAKVQVESQQAAQTIPTTETAKI 427 Query: 370 TSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSAT-TASTKATEAAGSATAA 428 S AA + S A+ E R A K+ T + +T + A Sbjct: 428 EVKSKSPPAAVKENTKNEGKGKSPPANIKEPIRPADIQKTPVTGKEAPASTVPSQPAVDD 487 Query: 429 AQSKSTAESAATRAETAAKRAEDIASAVALED-ASTTKKGIVQLSSATNSTSETLAATPK 487 Q+K T + A + +T+ A A + A + + + + A Sbjct: 488 KQTKITPDLAKAQVDTSPPTAAVKQQATPSPEAAKNIQMPPTTEAKPSAEPVKIQAPQQS 547 Query: 488 AVKSAYDNAEKRLQKDQNGADIPDKGCFLN 517 + + + + + DI + + Sbjct: 548 QPTATVEQVKPNVAPTKTETDIKPQPTVVK 577 Score = 59.5 bits (142), Expect = 7e-07, Method: Composition-based stats. Identities = 62/449 (13%), Positives = 122/449 (27%), Gaps = 14/449 (3%) Query: 79 EDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSA 138 SQ N +G ++ +PEA + + E+ + + AK+ Sbjct: 225 GKSQTAPTNQNIGIESQTKPQPEASKNKKDNKEKSKAAQTQGKPQSNVAKQPPKVDPKIG 284 Query: 139 REAATHAADAADSAR------AASTSAGQAASSAQSASSSA-GTASTKATEASKSAAAAE 191 EA A +A + ++ +S A G A A + Sbjct: 285 PEAVQSQAKSASQEASSKDLIQPTPEVTKSQKGNESTPPPAKGQEKPPAEPAKPQPQPSL 344 Query: 192 SSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSET 251 + A +++ AA A A ++ A + + A KE K ++ Sbjct: 345 VDQQAKSSTESAANKGNQAAPAVIKDTAKNVTERAGIVEPPKQVQSQPATIKEEPKITQE 404 Query: 252 NASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAST 311 A S ++ T + + ++ ++ A Sbjct: 405 TAKVQVESQQAAQTIPTTETAKIEVKSKSPPAAVKENTKNEGKGKSPPANIKEPIRPADI 464 Query: 312 SAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETS 371 + A A A Q + +A+ + + + +A++ Sbjct: 465 QKTPVTGKEAPASTVPSQPAVDDKQTKITPDLAKAQVDTSPPTAAVKQQATPSPEAAKNI 524 Query: 372 AESSKTAAASSAS---SAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAA 428 T A SA A S +A+ ++ + K+ S + Sbjct: 525 QMPPTTEAKPSAEPVKIQAPQQSQPTATVEQVKPNVAPTKTETDIKPQPTVVKEESKLSP 584 Query: 429 AQSKSTAESAATRAETAAKRAE---DIASAVALEDASTTKKGIVQLSSATNSTSETLAAT 485 +K ++ AA + + SA A K + S T+S + Sbjct: 585 EPAKIESQPAAVKEQPNPIGEPAKNQNKSASPPAIAKKEVKPDSKPSKTTSSPPPSPQI- 643 Query: 486 PKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 +K Q A I Sbjct: 644 ASPIKEETQTPTAGPPPAQTLAQIVATPT 672 >UniRef50_D1ZPA3 Whole genome shotgun sequence assembly, scaffold_70 n=1 Tax=Sordaria macrospora RepID=D1ZPA3_SORMA Length = 1409 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 72/439 (16%), Positives = 132/439 (30%), Gaps = 10/439 (2%) Query: 118 SAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTAS 177 ++ A + A+ T D Q + ++ + Sbjct: 515 KETSEKPAHEDDTKDKATQETAHEDTQTDDEVTQQTTQDEITQQTTEEESTQQTTQEDVA 574 Query: 178 TKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSAS-------TATTKAS 230 + EA + E + K + S + A + Sbjct: 575 EEVHEAVATKDNVEVKEDTTPQK-TDLKEEAHPIAESKEEATEKEDVEEHTVHETAQEEH 633 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA-AG 289 SA + AS++A + + +A++ + A + A + ETA A Sbjct: 634 PTKESAPEEGASEKAHEEETLAKTYAAAAHEALEAEASQESTEAAQEAEAPPTEETAPAV 693 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASA-SATAAGKSAESAASSASTATTKAGEATEQA 348 ++A A S+ A+ A+ + S A A K+A S+ T E A Sbjct: 694 ETAPVAEDSEANKAAVATEGAPSKEIAPAVDVAPVEKAAPVEDSAPVEKVTSVSEEAAPA 753 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + A +A +E A+ET+ T A+ S+ + +A S++ + +A++ + Sbjct: 754 AEATPAAETVTVAEETTPATETAHVEESTGASEDTSAKPAPVEAAPVSEELSGEKAASTE 813 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 +A E A A + A A AE+ A E T Sbjct: 814 ETAPVIHAAPVETAAVEEATPVEDAEPVQATYAEAVKATDAEEPAPVKETEPVEETTLDK 873 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 + +E A P V +A R + F V+ + Sbjct: 874 AAPAEEATPVAEDAAEQPGPVANAAKEILGRSEPVTESQSGAVTPKFARTAAEVADSAAL 933 Query: 529 DKRGMRYVRVNAPAGATSG 547 G RV+ +G Sbjct: 934 LNEGTPEDRVSDEEAGKTG 952 Score = 56.1 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 54/391 (13%), Positives = 116/391 (29%), Gaps = 13/391 (3%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E+ + + V + D + +T A AA A ++ + Sbjct: 146 EVEASQEKETVTTVTAVSETGGPPEHDVVLKVVDVSTTDAPAAKDAAHSNADVKDNTPAK 205 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 ++S A T + ++ A + + S ++E+N + S + A ++ Sbjct: 206 DESTSEAHTETKVHAPSTTEATLKDEALSEEEDKTKTKDSTESNVAPIEVSTSAEAPSSP 265 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 S+ +A + + A E ++ S +A A+ +A Sbjct: 266 ETPSKEEETAPEETKVETATVVEEISSKPEDSQHQKAAAKEEVPARKTAPEAESATKESP 325 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 AA + SA + Q + T A E A + + E Sbjct: 326 AAEKETSAEPTVENEETKIEEPTVVDVEQKKEAETDAESPVEKADVKKESTSPVETEEEA 385 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSA----SASKDEATR 402 + SA+ + A+A+ +K + + + + S SKD+ T Sbjct: 386 TPAEDTVSAAPEAETTPEAEATLKEETPAKDTESKEVNQSKDLSHDTPAPESVSKDQETV 445 Query: 403 QA---------SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIA 453 A A++ + ++ A + + K +++ T E ++ Sbjct: 446 AAGVTVSEKDLPEAETVSKDETSLAETEKEESAPTHEDKVQKDTSNTTEEQPTEKETAPE 505 Query: 454 SAVALEDASTTKKGIVQLSSATNSTSETLAA 484 E + T + A Sbjct: 506 EEATEEKITKETSEKPAHEDDTKDKATQETA 536 >UniRef50_C5PDM6 Putative uncharacterized protein n=2 Tax=Coccidioides RepID=C5PDM6_COCP7 Length = 1433 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 75/469 (15%), Positives = 139/469 (29%), Gaps = 27/469 (5%) Query: 50 SMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELM 109 S ++ G+Y T E G +N+ L T D++ E L+ + Sbjct: 58 SFELAPGKYQYKFRAGSGDAWFCDTDVPTEVDNDGNMNNVLLVETASDSKKEGLQAGQSK 117 Query: 110 VEEVARNASAVAQNTAAAKKSASD---------ASTSAREAATHAADAADSARAASTSAG 160 + NA V + +K+A+D + A + A+ A+ A + Sbjct: 118 SDNTTENAETVVASDETTEKAAADKESNGVNGVVAEKADPESKEASVPAEEAPEKAEKES 177 Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 ++ A K+T + A K+ + +Q A T Sbjct: 178 TGGEDVAEQPTTVSEAIAKSTPSDLKPADDSQPKANIKDEDVSVNGGPVEKKLEVQEAET 237 Query: 221 SASTATTKASEAAT--SARDAAASKEAAKSS-----ETNASSSASSAASSATAAGNSAKA 273 + KA E ++ S+ D +S A+S E + + A +++ Sbjct: 238 ESPQPEKKAPEVSSPDSSADPISSATEAESPGQAEPEPSEPKAEEPKAPVEPEIQPTSEV 297 Query: 274 AKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASS 333 K +E E+ A + A A S T A +S++ + + Sbjct: 298 EKQAEATEPKDESPAVLNPEAPQQLTETAKDLESRPQTPASMSSSTRSLNPAAPAFVPGK 357 Query: 334 ASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSA 393 + + T+ A+ S E A + S+ ++S Sbjct: 358 FTPVSVPNPAETDDAAPKDDKESQPPAPEEPKVAEKVPEASTAGDEVDKDKEEVVASSKP 417 Query: 394 SASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKS-----TAESAATRAETAAKR 448 + T + A A +A A+ + A + E A + Sbjct: 418 EEDVAQTTTTTAEETKETPAAEQPAMDAVKEVKKPAEPAAPEQTEEPAEEAPKVEKAPEV 477 Query: 449 AEDIASAVALEDASTT------KKGIVQLSSATNSTSETLAATPKAVKS 491 A+ T K + S T + SE LA A + Sbjct: 478 ETAAAAVSTDPIPEATAEAVTEDKEEEKNVSTTEAPSEPLAENVTASAT 526 >UniRef50_A1C839 PT repeat family protein n=1 Tax=Aspergillus clavatus RepID=A1C839_ASPCL Length = 1885 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 59/402 (14%), Positives = 122/402 (30%), Gaps = 10/402 (2%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E + T ++ S + + A + S ++ SS Sbjct: 85 EAVKHAADDEGVVNNILTVSSSPSTTATPVVRGDKAVKEESETPAVNGVSDKVEESESSK 144 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAA--KTSETNASASLQSAATSAST 224 + +S A T A E K + +++ + +A + E+ S + A + Sbjct: 145 EPEVTSTSKAETVADEEQKQSEVPSTTEKDLPAESESAVKEEKESVPEPSKDTDAKDDAK 204 Query: 225 ATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSS 284 A A + S E ++ + + + TA +A K A+ + Sbjct: 205 AEPATESTAQPETNGTESTEQPSETKNDTPEAEKAETVKETAVEAPVEAVKDEAPKAQET 264 Query: 285 ETAAGQSASAAAGSK----TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTK 340 + ++ A + AS + S + AT + S A + Sbjct: 265 PDTSAEAERATVEEEVNIGDKKKQEASESVVSVAEEKPEATKDAEEPASTAEKSFAEVAA 324 Query: 341 AGEATEQASAA--ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 A TE AS A A S+A S T A +E + A A + + Sbjct: 325 AEPVTEDASVAECAPEESSATESATEAVLTEAPVTEQPSEITEEAKPAVQESLE-QKESE 383 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAK-RAEDIASAVA 457 A + +++ + EA + +A ++ + E + +++ Sbjct: 384 SAEAEQPTQETAEEKTIIEPNEAPTAEVSANEAVVEEPAKEITPEVVIEDSKKELDGVAP 443 Query: 458 LEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 D +K ++ A + +A + A + Sbjct: 444 AVDEKPAEKVEETIAVAKEEPAADESAKEPVSEEASNQETPA 485 >UniRef50_A7AF35 Putative uncharacterized protein n=1 Tax=Parabacteroides merdae ATCC 43184 RepID=A7AF35_9PORP Length = 511 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 69/220 (31%), Positives = 104/220 (47%) Query: 101 EALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG 160 E + + + N + + AA K+A+DA+ A AA A +A A AA+ A Sbjct: 161 EQMSQIKEEANTAISNVNTAKVSAEAATKAANDAAALANAAAGQATQSAGDADAATKLAV 220 Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 A + A+ + A TA+ A A+ SA A+ A A A +A+ +A Sbjct: 221 AATALAEEKAGIANTAAENADTAAASANMAKEEADKATVEANIAAGKANDAAGKADTATL 280 Query: 221 SASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETN 280 +A+TAT KA+EAA+SA AA + AA + +SA +A SAT A +A AK + Sbjct: 281 NANTATDKANEAASSATTAAENANAAVERADDTIASAETATKSATDAALAANTAKENADK 340 Query: 281 ARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 A ++ AA A+ AG AA +A+ A A+ A Sbjct: 341 AANTAKAAATLANEKAGLADTAALAANTAKEDTIVATGKA 380 >UniRef50_B1YVB3 YadA domain protein n=6 Tax=Burkholderia RepID=B1YVB3_BURA4 Length = 737 Score = 63.4 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 96/392 (24%), Positives = 170/392 (43%), Gaps = 4/392 (1%) Query: 114 ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSA 173 + A+ + +A+ A + A ++ AA A A ++A A S A SS+A Sbjct: 179 SSTATGADSEASGKDSTANGARSKATGDSSTAAGADSQASGKDSTANGARSKATGDSSTA 238 Query: 174 GTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 A ++A+ +A A S + +++A A + + ++ A + A+ ++ A+ A Sbjct: 239 AGADSEASGKDSTANGARSKATGDSSTAAGADSEASGKDSTANGARSKATGDSSTAAGAD 298 Query: 234 TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS 293 + A ++ A+S T SS+ + A S A+ ++A A++ T S+ A AS Sbjct: 299 SEASGKDSTANGARSKATGESSTVAGADSEASGKDSTANGARSKATGDSSTAAGADSQAS 358 Query: 294 AAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 + A S A+ S++A A + A+ +A A S A+ ++ A A QAS Sbjct: 359 GKDSAANGARSKATGDSSTAAGADSQASGKDSTANGARSKATGDSSTAAGADSQASGKDS 418 Query: 354 SASAAKTSETNAKASETSAESSKTAAA--SSASSAASSASSASASKDEATRQASAAKSSA 411 +A+ A++ T S T+A + A+ S+A+ A S+A+ S++ A QAS S+A Sbjct: 419 TANGARSKATGE--SSTAAGADAQASGRDSTANGARSTATGESSTAAGADSQASGRDSTA 476 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 A +KAT + +A A S +S A A + A + E + Sbjct: 477 NGARSKATGESSTAGGADSVASGKDSTANGARSKATGDSSTVAGADSEASGKDSTANGAR 536 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 S AT +S A +A R + Sbjct: 537 SKATGDSSTAAGADSEASGKDSTATGARARAS 568 >UniRef50_B1N0K0 Putative mucus binding protein n=3 Tax=Lactobacillales RepID=B1N0K0_LEUCK Length = 1977 Score = 63.0 bits (151), Expect = 6e-08, Method: Composition-based stats. Identities = 47/242 (19%), Positives = 120/242 (49%) Query: 182 EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAA 241 A +A + A T++ ++ ++ S S ++ + T+ +S +AT+A A Sbjct: 52 SADVAATSTSGIAVRAETTSASSSSAVKADSTSANNSIAVKAETTSASSNSATNAETTGA 111 Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTA 301 S +A ++ET ++SS S+ + T A +++ + + +S T A + +++ + A Sbjct: 112 SSNSATNAETTSASSISATNAETTGASSNSATNAETTGASSNSATNAETTGASSNSATNA 171 Query: 302 AASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTS 361 + AS+ S + + + +++ + +AE+ +S+ +AT SA +++A S Sbjct: 172 ETTGASSNSATNAETTGASSISATNAETTGASSISATNAETTGASSNSATNADSTSANNS 231 Query: 362 ETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEA 421 + +++ SS A ++++S+ S+A++ + S ++ + S++++++ KA Sbjct: 232 SAVKAETASASSSSAVNAETTSASSISAANAETTSASSSSAANAETTSASSSSAVKADAI 291 Query: 422 AG 423 + Sbjct: 292 SA 293 Score = 62.6 bits (150), Expect = 8e-08, Method: Composition-based stats. Identities = 68/266 (25%), Positives = 138/266 (51%), Gaps = 3/266 (1%) Query: 203 AAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAAS 262 +A + T+ S A T+++++++ +TSA ++ A K S+ +N++++A + + Sbjct: 52 SADVAATSTSGIAVRAETTSASSSSAVKADSTSANNSIAVKAETTSASSNSATNAETTGA 111 Query: 263 SATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATA 322 S+ +A N+ + +S + + T A +++ A + A+++SA+ A T+ G +S SAT Sbjct: 112 SSNSATNAETTSASSISATNAETTGASSNSATNAETTGASSNSATNAETT-GASSNSATN 170 Query: 323 AGKSAESAASSASTATTKAG--EATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 A + S+ S+ + TT A AT + A S SA T A ++ + S +A Sbjct: 171 AETTGASSNSATNAETTGASSISATNAETTGASSISATNAETTGASSNSATNADSTSANN 230 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 SSA A ++++S+S++ + T AS+ ++ ++ ++ +A +A + S S+A A Sbjct: 231 SSAVKAETASASSSSAVNAETTSASSISAANAETTSASSSSAANAETTSASSSSAVKADA 290 Query: 441 RAETAAKRAEDIASAVALEDASTTKK 466 + + IA+ V+ DA T K Sbjct: 291 ISAGGNQIVAQIATNVSKNDAIVTTK 316 Score = 60.7 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 64/258 (24%), Positives = 138/258 (53%), Gaps = 2/258 (0%) Query: 154 AASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASA 213 AA++++G A + +++SS+ +T A+ S A + SA++ SA A+T+ ++++ Sbjct: 56 AATSTSGIAVRAETTSASSSSAVKADSTSANNSIAVKAETTSASSNSATNAETTGASSNS 115 Query: 214 SLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKA 273 + + TSAS+ + +E ++ ++A + E +S +A+++ ++ ASS + +A+ Sbjct: 116 ATNAETTSASSISATNAETTGASSNSATNAETTGASSNSATNAETTGASS--NSATNAET 173 Query: 274 AKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASS 333 S +A ++ET S SA T A+S ++ + + G +S SAT A ++ + +S+ Sbjct: 174 TGASSNSATNAETTGASSISATNAETTGASSISATNAETTGASSNSATNADSTSANNSSA 233 Query: 334 ASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSA 393 T A ++ + ++S + + AS +SA +++T +ASS+S+ + A SA Sbjct: 234 VKAETASASSSSAVNAETTSASSISAANAETTSASSSSAANAETTSASSSSAVKADAISA 293 Query: 394 SASKDEATRQASAAKSSA 411 ++ A + +K+ A Sbjct: 294 GGNQIVAQIATNVSKNDA 311 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 69/253 (27%), Positives = 130/253 (51%), Gaps = 6/253 (2%) Query: 162 AASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATS 221 S SA +A + S A A ++A++ S+ A +TSA + + +++ ++AT+ Sbjct: 46 VTSRPVSADVAATSTSGIAVRAETTSASSSSAVKADSTSANNSIAVKAETTSASSNSATN 105 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKT---SE 278 A T ++ A + +A+S A + T ASS++++ A + A+ NSA A+T S Sbjct: 106 AETTGASSNSATNAETTSASSISATNAETTGASSNSATNAETTGASSNSATNAETTGASS 165 Query: 279 TNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTAT 338 +A ++ET S SA T A+S ++ + + G +S SAT A + ASS S Sbjct: 166 NSATNAETTGASSNSATNAETTGASSISATNAETTGASSISATNAE---TTGASSNSATN 222 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 + A ++ A +ASA+ +S NA+ + S+ S+ A +SASS++++ + +++ Sbjct: 223 ADSTSANNSSAVKAETASASSSSAVNAETTSASSISAANAETTSASSSSAANAETTSASS 282 Query: 399 EATRQASAAKSSA 411 + +A A + Sbjct: 283 SSAVKADAISAGG 295 >UniRef50_C8VCU8 Putative uncharacterized protein n=2 Tax=Emericella nidulans RepID=C8VCU8_EMENI Length = 1592 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 78/437 (17%), Positives = 143/437 (32%), Gaps = 16/437 (3%) Query: 94 TEDDARPEALRRFELMVEEVARNASAVAQNT--AAAKKSASDASTSAREAATHAADAADS 151 E +A++ E E A A A + + A T E A A Sbjct: 636 EEVATATDAVKSVETTTVEPAVEAEAATEKAKVEESTTVDEVAETEVAETAKEVASEEPK 695 Query: 152 ARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA 211 A A+ +++ + + + + + + + A A T Sbjct: 696 TEEPVAVAEAVDEPAKEVANTEPSEAAVPENPAPTEEPEKGATNEEPKPAEAVAEPMTEP 755 Query: 212 SASL---------QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAAS 262 + AA ++ A A + +A A+++ A S E A ++A S Sbjct: 756 ANVAVETEESKEATEAAAESAAEPAVAETAVENVSEAPAAEKEAVSEEPKAEEPIATAES 815 Query: 263 SATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA-ASSASAASTSAGQASASAT 321 + ++ S + A +A A+ A SS A + +A A T Sbjct: 816 PEVPGKETVVEESAPDSVTESKDAPAEVAAEASITEVPAVLKSSEEQADKAVAEAPADTT 875 Query: 322 AAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAA-- 379 K ESA +T T A ATE A+ + A++ + E AA Sbjct: 876 PTAKPVESATQEPATETADAPSATEPATTESPKEPASEAPTEVPTVETVATEEVTEAAHD 935 Query: 380 --ASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAES 437 A AS + + S + A A+ A + + ++ Sbjct: 936 KPAEEQPETVDGASGKTVDAEVPGETQSTTTAEAIAAAPIEKFATEETVSKIPVEGVSKE 995 Query: 438 AATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAE 497 T E ++ ++ A+ E + + + A + +E+ ATP+A K+ Y AE Sbjct: 996 ETTVGEPGTEKPDEAAAPEVSEVEVAEESKAPETTPAEIAPAESTNATPEASKAEYAPAE 1055 Query: 498 KRLQKDQNGADIPDKGC 514 ++ +PD+ Sbjct: 1056 TAEEEPPVEKQLPDETA 1072 Score = 50.7 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 57/357 (15%), Positives = 110/357 (30%), Gaps = 6/357 (1%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E E+ A+ A++S + +T A A + +A A A + + A Sbjct: 1001 EPGTEKPDEAAAPEVSEVEVAEESKAPETTPAEIAPAESTNATPEASKAEYAPAETAEEE 1060 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 E + A+ E ++T+ T T+A A+ + +A+ T Sbjct: 1061 PPV------EKQLPDETAAPASVEEQPAGESSTTDAVNTTELTSADAASEPVPEAATDDT 1114 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 T + T+ A A+ A + + A++ + + + +E+ Sbjct: 1115 TATAPVVTAEPAAEAATSAPEEQKEEAAAQEPDSKPVDKDSTAVEATPAETLKEEPVTES 1174 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 +A A S A+ + + A + + A+T A E Sbjct: 1175 VTENAAEADKVEPEVVEPSKDASPVEVIEEPVAEAATETTVAEENTPATTENEPASIKEE 1234 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 A A E A+ +A + + A++ + + S A Sbjct: 1235 VTEPAEPVIDEAAVEEPVAEKPAANAPAIEAGASTVPQETSEEFEAPSKPAAPLDEAAIV 1294 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAST 463 + + K E + S S+ E E E S V ++DA Sbjct: 1295 EAPAVASEEQKLDEVSASEIRPETSQQAPEETTPVPEKEPTAPEPAESKVEVKDAEV 1351 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 64/392 (16%), Positives = 119/392 (30%) Query: 96 DDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAA 155 +A PEA + E Q AS A E++T A +A Sbjct: 1040 TNATPEASKAEYAPAETAEEEPPVEKQLPDETAAPASVEEQPAGESSTTDAVNTTELTSA 1099 Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 ++ +A +++ T A + +A E K AA +K + +++A Sbjct: 1100 DAASEPVPEAATDDTTATAPVVTAEPAAEAATSAPEEQKEEAAAQEPDSKPVDKDSTAVE 1159 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 + A + S +A E + S+ + + A + Sbjct: 1160 ATPAETLKEEPVTESVTENAAEADKVEPEVVEPSKDASPVEVIEEPVAEAATETTVAEEN 1219 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 T T + + A AA A A A A A + + Sbjct: 1220 TPATTENEPASIKEEVTEPAEPVIDEAAVEEPVAEKPAANAPAIEAGASTVPQETSEEFE 1279 Query: 336 TATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASA 395 + A E A A + ++ + ASE E+S+ A + ++ Sbjct: 1280 APSKPAAPLDEAAIVEAPAVASEEQKLDEVSASEIRPETSQQAPEETTPVPEKEPTAPEP 1339 Query: 396 SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASA 455 ++ + + + + A+T+ T A T+ A + + + S Sbjct: 1340 AESKVEVKDAEVHPVESVATTEETTAETPNTSVAGASQEVPAQVEEEKVFKDEDSFPTSE 1399 Query: 456 VALEDASTTKKGIVQLSSATNSTSETLAATPK 487 V A+ V +A + E AA P Sbjct: 1400 VIAGGAAAAAAAAVVAGAAAVAHKEEPAAAPV 1431 >UniRef50_C2CTF5 Putative uncharacterized protein n=1 Tax=Gardnerella vaginalis ATCC 14019 RepID=C2CTF5_GARVA Length = 885 Score = 62.6 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 58/405 (14%), Positives = 121/405 (29%), Gaps = 11/405 (2%) Query: 108 LMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQ 167 V + + + + TS +A + + +A + Sbjct: 91 QTVNSPVTGDADAGKQEKENPGTVQGSGTSKNNTPANA-EILQKVEESKNLQKEAQNKHN 149 Query: 168 SASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT 227 A+ + A +A + ++ A K+ A S + ++ +A + Sbjct: 150 EANKAVEEAKQEAEDTARKINEAVEKKNNAENSIAETNKKIETETQNIANAEQRIEEKSK 209 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 EA + +K+ A S ++ A A + + N +++ Sbjct: 210 AVKEAEKDKKVTDEAKQNADQVNKIRSEEKAAKDELAEDAKDDNEETAEDLANEEEAKSK 269 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATT-----KAG 342 + A + A A Q A+A E+ TA A Sbjct: 270 LEAELAEAKQKSESTKVEAEKLEKEAQQKEAAAEGVDNKIENITKQIKTAEENLKTLDAD 329 Query: 343 EATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATR 402 + + ++ K A+ + + + T + + + + Sbjct: 330 NDQNKINQLKEKVASTKKQAEEAEKTAAQKQLAYTKQKEKVAEYERELQKLESDRSKLGD 389 Query: 403 QASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS 462 + K T A TK TEA Q K+ A +A +E + A+ +A D S Sbjct: 390 PSHIEK--LTNAQTKKTEAQDIEE---QLKNEASTAKENSEKLSTEAKKAEEELAKADKS 444 Query: 463 TTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGA 507 + + ++ S S +L K A A ++ + + A Sbjct: 445 SQIRDLLDTISELTSEINSLTRISDNAKDARKAANEKKSESNSLA 489 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 72/447 (16%), Positives = 147/447 (32%), Gaps = 13/447 (2%) Query: 93 MTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAA--DAAD 150 +T + + E ++ A A ++ + K A + A +++ D Sbjct: 396 LTNAQTKKTEAQDIEEQLKNEASTAKENSEKLSTEAKKAEEELAKADKSSQIRDLLDTIS 455 Query: 151 SARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETN 210 + S + + +A+ A +A +++ +K + ES + + K S+ + Sbjct: 456 ELTSEINSLTRISDNAKDARKAANEKKSESNSLAKKVSDKESELAELENTIKELKESKQD 515 Query: 211 ASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSA--SSAASSATAAG 268 A L A+ A A +A +AA S A ++EA K+ + A +A A Sbjct: 516 AEKDLNVASRDAEKAKEEAEKAAQSL--LARTEEAQKAEKKVQDEKANFENAKIDKETAE 573 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 + K + A + A A + A T+ A S E Sbjct: 574 KNLNQLKEDLKRIDKEISEAKTNVEATKQLAETAKQKLNDAKTNVEATKAKILELKASIE 633 Query: 329 SAASSASTATTKAGEATEQA-----SAAARSASAAKTSETNAKASETSAESSKTAAASSA 383 + + + E A A A S A E S ++K A S Sbjct: 634 ETKRTLISKYAASKTNIETAFDLMIKEVANRAKELDNSTNTLNAVENSLTAAKQKAEESL 693 Query: 384 SSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAE 443 + + ++ + A A + S A + +A Q+ + Sbjct: 694 NPQVQKPAPSTPAPAPAPSTPQNAPQNNAAPSAPQASAPSVSQSAPQAAPKHSAVPANQT 753 Query: 444 TAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 + A+ A+ +E+A+ ++ + T + + A T + + K D Sbjct: 754 ESTHTAK--AAEELVENATAKSPATAEIPARTPRAAASAAPTASSANTTKSEESKNTSND 811 Query: 504 QNGADIPDKGCFLNNINAVSKTDFADK 530 + + N ++ S+ + +DK Sbjct: 812 SAKDENQKESEESKNEDSKSEDNSSDK 838 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 57/297 (19%), Positives = 112/297 (37%), Gaps = 6/297 (2%) Query: 101 EALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG 160 E L+R + + E N A Q AK+ +DA T+ AT A A T Sbjct: 581 EDLKRIDKEISEAKTNVEATKQLAETAKQKLNDAKTNVE--ATKAKILELKASIEETKRT 638 Query: 161 QAASSAQSASSSAGTASTKATEASKSA---AAAESSKSAAATSAGAAKTSETNA-SASLQ 216 + A S ++ E + A + ++ +A S AAK + + +Q Sbjct: 639 LISKYAASKTNIETAFDLMIKEVANRAKELDNSTNTLNAVENSLTAAKQKAEESLNPQVQ 698 Query: 217 SAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKT 276 A S S + ++ AA S+ + + S+ +A + N ++ T Sbjct: 699 KPAPSTPAPAPAPSTPQNAPQNNAAPSAPQASAPSVSQSAPQAAPKHSAVPANQTESTHT 758 Query: 277 SETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAST 336 ++ E A +S + A ++ASAA T++ + + + ++ +A + Sbjct: 759 AKAAEELVENATAKSPATAEIPARTPRAAASAAPTASSANTTKSEESKNTSNDSAKDENQ 818 Query: 337 ATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSA 393 ++ + + S S +A+ S N ++ AA + + +A +A A Sbjct: 819 KESEESKNEDSKSEDNSSDKSAQASGANNGEHAADNSTTLIAAIAGSITAGIAAIGA 875 >UniRef50_A4A060 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A060_9PLAN Length = 1330 Score = 62.6 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 76/409 (18%), Positives = 134/409 (32%), Gaps = 25/409 (6%) Query: 127 AKKSASDASTSAREAATHAADAADS-ARAASTSAGQAASSAQSASSSAGTASTKATEASK 185 A+ A+ S A DAA A A +A Q A S + +A Sbjct: 853 AQGEPEGAAQSQAAADKSLGDAAQKLASAIDQAAQQLAQDLGKQSPQLDQLAQQAGAVDP 912 Query: 186 SAAAAESSKSAAATSAGAAK--TSETNASASLQSAATSASTATTKASEAATSARDAAASK 243 +AA+A A+ + +++ AS + +A +A +AA + Sbjct: 913 NAASALRQAEQASQAGAEMNSPQADSPASPAPGEMQQAADSAQRSLQQAAAALTAREQQL 972 Query: 244 EAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAA 303 KS ++ A ++ A A + + E A GQ+ S + + AAA Sbjct: 973 ARDKSIAEALAALAQQQQNARDEIDRQADALAENMPSPPQGEAAEGQTTSPISPAAAAAA 1032 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ----------ASAAAR 353 + AS +A + + A+ + EA Q A A+ Sbjct: 1033 QQLANASRQFAEAQTATGQGAQQISGQQEVANQPLREGLEAASQLPLPEQSLAPAPPASD 1092 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAAS-----SASSASASKDEATRQASAAK 408 A E N+ A E ++ A+ + + +AS +A A Sbjct: 1093 LAMGETGGEMNSSAQSPGGEGNQPASNNQPLGQPAQGQGPGEQTASGDASQAMPGAGPDL 1152 Query: 409 SSATTA-STKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKG 467 ++ T + ATA A + A A +A AA +A Sbjct: 1153 GPTSSELGTGLVPNSPQATADAIAGGQAVQQALQAMPAAPQAGMPGQVAGAPRPGVGDGP 1212 Query: 468 IVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK------DQNGADIP 510 Q + +++ T P+ ++ A + +Q+GAD+P Sbjct: 1213 TAQAPAPAAASAATPGQQPQPGAASSVAATGGTSQGGEQTQNQDGADMP 1261 Score = 56.1 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 77/388 (19%), Positives = 149/388 (38%), Gaps = 16/388 (4%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQS 168 V EV + + A SA+D + + A + A + AA + Sbjct: 565 QVAEVLKKSKPAIAAAAKDLASAADPKNADKATPQLQKAAEAAQSAGESLKQAAAELRKE 624 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 A +A + + +AA+ + + +A + E +A + AA A K Sbjct: 625 AGKAAQELAEMTGQQLAQTSAAKEAVKQSLDNAASGDNLEQLQAAQEKIAA--AQMEQMK 682 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 A + +A ++E K ++ ++ A++A S A + AA + + +E A Sbjct: 683 AEGKSAAAAAQQLAQEIGKLAQMQQAADAAAAQLSQGKANSPVDAAAQQQQVSDEAEGLA 742 Query: 289 GQSASAAAGS------KTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAG 342 Q+ A A + AAA+ + + A+A AA + A + + A K+ Sbjct: 743 QQTDGAIAEALQNAEKAAAAAAKETLSGDPNKAAAAREEAAAQLASAQRKAQQMAAEKSA 802 Query: 343 EATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATR 402 E Q AAA++ +A +E A++ + E++ A ++ + A++ + A + A + Sbjct: 803 EPAGQTDAAAQAKAADLAAEAAQLAADAAPEAAAAAQQAADAGEAAAKNLAQGEPEGAAQ 862 Query: 403 QASAAKSSATTASTKATEA--AGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 +AA S A+ K A + A + A+ A + ASA+ + Sbjct: 863 SQAAADKSLGDAAQKLASAIDQAAQQLAQDLGKQSPQLDQLAQQAGAVDPNAASALRQAE 922 Query: 461 ASTTKKGIVQLSSATNSTSETLAATPKA 488 ++ Q + NS A+P Sbjct: 923 QAS------QAGAEMNSPQADSPASPAP 944 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 82/433 (18%), Positives = 152/433 (35%), Gaps = 32/433 (7%) Query: 102 ALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQ 161 A + E M E A+A Q K A + AA + A+S A+ Q Sbjct: 674 AAAQMEQMKAEGKSAAAAAQQLAQEIGKLAQMQQAADAAAAQLSQGKANSPVDAAAQQQQ 733 Query: 162 AASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATS 221 + A+ + A +A + ++ AAAA + ++ + AA E A+ + + Sbjct: 734 VSDEAEGLAQQTDGAIAEALQNAEKAAAAAAKETLSGDPNKAAAAREEAAAQLASAQRKA 793 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 A K++E A AA +K A ++E ++ ++ ++A A + ++ A Sbjct: 794 QQMAAEKSAEPAGQTDAAAQAKAADLAAEAAQLAADAAPEAAAAAQQAADAGEAAAKNLA 853 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQA-SASATAAGKSAESAASSASTATTK 340 + A QS +AA S AA ++A A Q + + A A Sbjct: 854 QGEPEGAAQSQAAADKSLGDAAQKLASAIDQAAQQLAQDLGKQSPQLDQLAQQAGAVDPN 913 Query: 341 AGEATEQASAAARSASAAKTSETNAKASETSA--ESSKTAAASS---------------- 382 A A QA A+++ + + + ++ AS + + +A S Sbjct: 914 AASALRQAEQASQAGAEMNSPQADSPASPAPGEMQQAADSAQRSLQQAAAALTAREQQLA 973 Query: 383 -----ASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAES 437 A + A+ A ++DE RQA A + + + A G T+ + A + Sbjct: 974 RDKSIAEALAALAQQQQNARDEIDRQADALAENMP-SPPQGEAAEGQTTSPISPAAAAAA 1032 Query: 438 AATRAETAAKRAEDIASAVALEDASTTKKGIVQL-------SSATNSTSETLAATPKAVK 490 + A+ + S ++ Q +S ++LA P A Sbjct: 1033 QQLANASRQFAEAQTATGQGAQQISGQQEVANQPLREGLEAASQLPLPEQSLAPAPPASD 1092 Query: 491 SAYDNAEKRLQKD 503 A + Sbjct: 1093 LAMGETGGEMNSS 1105 >UniRef50_C7JGL3 Chromosome segregation protein SMC n=9 Tax=Alphaproteobacteria RepID=C7JGL3_ACEP3 Length = 1515 Score = 61.9 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 87/473 (18%), Positives = 164/473 (34%), Gaps = 34/473 (7%) Query: 8 VLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLA--SENPDEAGRYSMDVE--YGQYSVILL 63 +LKD + P+ + LK+ + + LA PD S+ + GQ V Sbjct: 731 LLKDASAAPLPGKAVALKSVITAPPELNRVLAYTGVVPDGTDGASLQAQLLPGQCLV--- 787 Query: 64 VEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQN 123 S AG + ++ F E D+ L + ++ E + + + Q+ Sbjct: 788 ------SRAGDLWRWDG--------FYTRAGEPDSSARRLAQ-RRILRETSARIAEMEQH 832 Query: 124 TAAAKKSASDASTS--AREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKAT 181 A++ A A T+ A E S S + S ++ A A Sbjct: 833 VPQAEEKAVAARTNVQAGEKQAQEQRVERSKLEQSLQKARTQESELERQHTSFRARLDAL 892 Query: 182 EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSA---ATSASTATTKASE-AATSAR 237 + A A +++ +A +A + + Q A A TA KA + T+ + Sbjct: 893 RPQQERALAAKAEAESALAAATTAQQAVPPAQNFQQALATAREQHTAAQKAEQDCRTALK 952 Query: 238 DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAG 297 A + + + +T + ++A + + + ++ Q ++A Sbjct: 953 LAEQTFQRVQQKQTQTENQHTAATTRLETLAPERLRLRQNLEAEEANILELEQRLTSAQT 1012 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 + A A++ A + QA + A + A +A +T + + EQA +A Sbjct: 1013 -ENATAAALKDAQDNLEQAQRAFQMASSAFAQAEQAAQASTQQQQKMQEQALTLRSRIAA 1071 Query: 358 AKTSETNAKASETSAESSKTAAAS-SASSAASSASSASASK----DEATRQASAAKSSAT 412 + + A+ TAA A +AA+ A + + S ++ Sbjct: 1072 LTPRLEELQQEQQDAQDKLTAATQTEAQTAAALPQDAEETLAHLHTQRAALTSQLDATRE 1131 Query: 413 TASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTK 465 +T EA+ T + E + RA TA +E+ A V TK Sbjct: 1132 LRATLQAEASTLETRLTSLVAAEEEWSQRAATANAESENAAQRVETARNEHTK 1184 Score = 48.8 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 43/233 (18%), Positives = 82/233 (35%), Gaps = 2/233 (0%) Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 +E R++E+ ++ A + ++ SA E A + Sbjct: 181 EAELKLRATESNLTRAEDRRQQLSDRLDGLAEQSRDASRYRELSAALREAETELLAVLHA 240 Query: 336 TATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASA 395 A A + A+ A ++ + + + +A +E A A A +A ++ Sbjct: 241 RARLAVERAIDNAARARKALTEHEEAAESAVVAEFEANKVLPGAREKADAARTALERCRV 300 Query: 396 SKDEATRQASAAKSSATTAST--KATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIA 453 + R+ A + A A+ K EA A +TA +AE A A + Sbjct: 301 LAEGVAREEERAATQANDAAERLKQHEADADAAKTRLDDATATLERLKAEKAETEAAIAS 360 Query: 454 SAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNG 506 +A T ++ + + + T L A ++A+D A + L QN Sbjct: 361 LPTRTAEAETKQQTLAKELAETEQKLAQLTAELNTARAAHDRAAENLTAAQNH 413 >UniRef50_B3M4N1 GF24494 n=6 Tax=Eukaryota RepID=B3M4N1_DROAN Length = 1254 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 57/402 (14%), Positives = 128/402 (31%), Gaps = 8/402 (1%) Query: 78 YEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTS 137 S P + + + PE + + ++ + SA D +T Sbjct: 134 ESTSSPPKETTTVDPIESTTSPPEESTIGDPEESTTSAPDDTTTEDPDESTTSAPDETT- 192 Query: 138 AREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAA 197 A + A D A ++ ++ + ++T A E + + ES+ SA Sbjct: 193 AEDPDESTTSAPDETTAVDPDESTTSAPEETTTEDPDDSTTFAPEETTTEDPDESTTSAP 252 Query: 198 ATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSA 257 + T + + + + T+ ++ T D + + +++ + S Sbjct: 253 EETTTEDPNESTTSPSEESTTGEPDESTTSAPNDTTTEDPDESTTSAPDETTAEDPDEST 312 Query: 258 SSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQAS 317 +SA T + TA AS + A ++TSA + + Sbjct: 313 TSAPEETTTEDPDESTTSAPDET-----TAEDPDASTTSAPDGTTAEDPDESTTSAPEET 367 Query: 318 ASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKT 377 + + ++A +T T T+ + + + S T Sbjct: 368 TTEDP--DESTTSAPEETTTEDPDESTTSAPDETTAEDPDESTTSAPEETTTEDPDESTT 425 Query: 378 AAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAES 437 +A ++ S+ SA D T + +SA +T + +A ++ + Sbjct: 426 SAPEETTTEDPDESTTSAPNDTTTEDPDDSTTSAPDETTAEDPDESTTSAPEETTTEDTD 485 Query: 438 AATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 +T + AED + TT + + +++ + Sbjct: 486 ESTTSAPDETTAEDPDESTTSAPEETTTEDPDESTTSAPDET 527 >UniRef50_D0MQM2 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MQM2_PHYIN Length = 728 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 62/328 (18%), Positives = 128/328 (39%), Gaps = 3/328 (0%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 ++ A + +A + A+TS ++A + A+ S A + A + + S Sbjct: 209 ANSALASAAGSGSAEAPTKQVAATSTEDSALGSDVASGSGGAPIKKKKKKAIAIALSDGS 268 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A A + S A + K+AA + A ++ T+AS S + TA + Sbjct: 269 ADAELASADASGSSDAPIKKKKAAAPSEDSDAASTLTDASGSAEPPINKKKTAVASDEGS 328 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 A + + + S + A++ ++ S S + S+ET + Sbjct: 329 ALESGEDSTRASGDASGSSGTPIKKKKKATAVASSDGSVDPELASAEASGSAETPTKKKK 388 Query: 293 SAAAGSKT---AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 +AA+ + +A +AS + ++G A + S SA +T + A++ AS Sbjct: 389 AAASSKDSGGGSAEDAASTLTDTSGSAEPPTKKEKTAVASDDGSAIGSTEDSALASDDAS 448 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 +A + + K + T++ S + ++ A+ +ASS S+ +S ++T + Sbjct: 449 GSAEAVTNKKPAVTSSDGSVADSAATSLEASLAASSEGSADKPTQSSAVDSTAGSGPGSD 508 Query: 410 SATTASTKATEAAGSATAAAQSKSTAES 437 + A A A +++ S Sbjct: 509 KGSAAEAPAGSGVAQVAPPAGTEAPGMS 536 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 65/363 (17%), Positives = 124/363 (34%), Gaps = 5/363 (1%) Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 + A++ A+ S +A +K AATS + AS S + Sbjct: 202 SGEPVTPANSALASAAGSGSAEAPTKQVAATSTEDSALGSDVASGSGGAPIKKKKKKAIA 261 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 + + SA AS +A+ SS+ ++A S + A ++ A S + + A Sbjct: 262 IALSDGSADAELASADASGSSDAPIKKKKAAAPSEDSDAASTLTDASGSAEPPINKKKTA 321 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 S GS + ++ AS A +S + K A + ASS + + A Sbjct: 322 VASDE---GSALESGEDSTRASGDASGSSGTPIKKKKKATAVASSDGSVDPELASAEASG 378 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 SA + + + SAE + + ++ SA + A + Sbjct: 379 SAETPTKKKKAAASSKDSGG-GSAEDAASTLTDTSGSAEPPTKKEKTAVASDDGSAIGST 437 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 + AS A+ +A + T + ++++ + + + A AS+ D T + Sbjct: 438 EDSALASDDASGSAEAVTNKKPAVTSSDGSVADSAATSLEASLAASSEGSADKPTQSSAV 497 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 + + + + AA S + G + D + DF+ Sbjct: 498 DSTAGSGPGSDKGSAAEA-PAGSGVAQVAPPAGTEAPGMSVKDSIVLSESFGGPHGRDFS 556 Query: 529 DKR 531 DK Sbjct: 557 DKN 559 >UniRef50_C1YUA7 Outer membrane protein/peptidoglycan-associated (Lipo)protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YUA7_NOCDA Length = 1124 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 57/393 (14%), Positives = 127/393 (32%), Gaps = 11/393 (2%) Query: 96 DDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAA 155 +A P+ RF EV+ + + + + +++T +A ++ A S +A Sbjct: 711 SEAEPDDTARFGA-ASEVSTRPDTRSDGSVIGRPAVEESTTRPSSSAPSVSEEAVSGSSA 769 Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSE-TNASAS 214 A + A ++ A S A + + T G ++++ T ++ + Sbjct: 770 DAEP--AGTGADRSAPEPSAAEEAEPRPSAFEQLAAEEQVSRETVTGPSESARPTASAPA 827 Query: 215 LQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA 274 + AA S ASE + A + + + S ++ S+ A SAT + + Sbjct: 828 PEKAAAEESAPEQSASERSAREEGAPVASPSERPSAEQYAAELSAPAPSATREQRAPEET 887 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAA-------ASSASAASTSAGQASASATAAGKSA 327 + R + +A + + K A ++ A + A + Sbjct: 888 ASEPAEPRENASAQVTAPARTEQPKENAPAEPPAQRPTSEEAVPARPAAERREPEPEPAR 947 Query: 328 ESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAA 387 + ++ + +A ++ A + A ET+ A + Sbjct: 948 PAPGRPSTGTVPSVVASPARAQRPSQQTRTAPAQQETASEEETAPVRKAAKRAVRSRLPK 1007 Query: 388 SSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAK 447 S ++++ + + +A A A A + S AE +RAE + Sbjct: 1008 PSGTASAPAAQASPSAPAARSEQLPLPDQPARPERARARAQNAAASGAEPLTSRAERMER 1067 Query: 448 RAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 R + + + + S+ Sbjct: 1068 RTRAERRNATAPAGAVAEHPVPDTEGGESDGSQ 1100 >UniRef50_Q6CPZ4 KLLA0E01035p n=2 Tax=Saccharomycetaceae RepID=Q6CPZ4_KLULA Length = 1878 Score = 60.3 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 63/357 (17%), Positives = 156/357 (43%), Gaps = 6/357 (1%) Query: 135 STSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSK 194 ST + AT S A + S+ A++S ++ +TEA+ S A + S Sbjct: 803 STESSTEATSTESFTSSTTTADPQEQTSTESSTEATTSDVISTESSTEATTSEATSTESS 862 Query: 195 SAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNAS 254 + A TS + S T A+++ +++ + + + +S + + +SS + Sbjct: 863 TEATTSDVISTESSTEATSTESFTSSTTTADPQEQTSTESSTEAITSEATSTESSTEAIT 922 Query: 255 SSASSAASSATAAGNSAKAAKTSETNARSSETAAG------QSASAAAGSKTAAASSASA 308 S A+S +++T + A ++ + T + + T + + + + S T +++ A++ Sbjct: 923 SEATSTEATSTESSTEATTSEATSTESSTEATTSDVISTESSTEATSTESSTESSTEATS 982 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKAS 368 +S ++ + S E+ ++ + T++T + EQ S + + + + + ++ Sbjct: 983 TESSTEATTSDVISTESSTEATSTESFTSSTTTADPQEQTSTESSTEATTSEATSTESST 1042 Query: 369 ETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAA 428 E + + + +S+ +++ S++S++ + D + ++ + + ATT+ +TE++ A + Sbjct: 1043 EATTSEATSTESSTEATSTESSTSSTTTADPQEQTSTESSTEATTSEATSTESSTEAITS 1102 Query: 429 AQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAAT 485 + ST S + A + + S + T++ S T A T Sbjct: 1103 SDVTSTESSTEATSTEATSTESSTEAITSEATTSEATSTESSTEATTSTESSTEATT 1159 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 77/346 (22%), Positives = 165/346 (47%), Gaps = 9/346 (2%) Query: 116 NASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT 175 ++ + + S S +T+ + T + ++ + S + SS +A+ T Sbjct: 614 TSTESPTEATSTESSTSSTTTADPQEQTSTESSTEATTSDVISTESSTSSTTTANPQEQT 673 Query: 176 ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA------SASLQSAATSASTATTKA 229 ++ +TEA+ S A + S + A TS + S T A ++S +A T+T + Sbjct: 674 STESSTEATTSEATSTESSTEATTSDVISTESSTEATSTESFTSSTTTADPQEQTSTESS 733 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 +EA TS + S A +SE ++ S++ A S+ ++ ++ A +T+ SS A Sbjct: 734 TEAITSEATSTESSTEATTSEATSTESSTEATSTESSTSSTTTADPQEQTSTESSTEATT 793 Query: 290 QSASAAAGSKTAAASSA---SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A+ + T +++ A + ++S A + +S+ A +S +T + EAT Sbjct: 794 SEATTSDVISTESSTEATSTESFTSSTTTADPQEQTSTESSTEATTSDVISTESSTEATT 853 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 + + S++ A TS+ + S T A S+++ +S+ ++ +S +S + T +A++ Sbjct: 854 SEATSTESSTEATTSDVISTESSTEATSTESFTSSTTTADPQEQTSTESSTEAITSEATS 913 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 +SS +++AT ++T ++ +T+E+ +T + T A ++ I Sbjct: 914 TESSTEAITSEATSTEATSTESSTEATTSEATSTESSTEATTSDVI 959 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 63/315 (20%), Positives = 151/315 (47%), Gaps = 2/315 (0%) Query: 94 TEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSAR 153 T ++ EA + E + + + +S+++A+TS + + +A S Sbjct: 641 TSTESSTEATTSDVISTESSTSSTTTANPQEQTSTESSTEATTSEATSTESSTEATTSDV 700 Query: 154 AASTSAGQA--ASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA 211 ++ S+ +A S S++++A +TE+S A +E++ + ++T A ++ + T + Sbjct: 701 ISTESSTEATSTESFTSSTTTADPQEQTSTESSTEAITSEATSTESSTEATTSEATSTES 760 Query: 212 SASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSA 271 S S +S S+ TT + TS + + + ++ S+ +S+ A+S + +S Sbjct: 761 STEATSTESSTSSTTTADPQEQTSTESSTEATTSEATTSDVISTESSTEATSTESFTSST 820 Query: 272 KAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAA 331 A E + S T A S + S T A +S + ++ S+ +A+ S + +S+ A Sbjct: 821 TTADPQEQTSTESSTEATTSDVISTESSTEATTSEATSTESSTEATTSDVISTESSTEAT 880 Query: 332 SSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSAS 391 S+ S ++ ++ ++ S A + T+ ++S + S T+ ++++ +++ A+ Sbjct: 881 STESFTSSTTTADPQEQTSTESSTEAITSEATSTESSTEAITSEATSTEATSTESSTEAT 940 Query: 392 SASASKDEATRQASA 406 ++ A+ E++ +A+ Sbjct: 941 TSEATSTESSTEATT 955 Score = 53.4 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 92/451 (20%), Positives = 184/451 (40%), Gaps = 33/451 (7%) Query: 99 RPEALRRFELMVEEVARNASAVAQ-NTAAAKKSASDASTSAREAATHAADAADSARAAST 157 EA+ +++ E + A++ +T A + A + + + + + + Sbjct: 515 STEAITSSDVISTESSTEATSTDVTSTEAITSDVTSAEVPFDGSLSFSRTSVPEEQTPTE 574 Query: 158 SAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS---------- 207 S+ +A +S+ S+ +TEA+ S A + S + A TS + S Sbjct: 575 SSTEAITSSDVTSTE------SSTEATTSEATSTESSTEATTSDVTSTESPTEATSTESS 628 Query: 208 ---ETNASASLQSAATSASTATTK-----------ASEAATSARDAAASKEAAKSSETNA 253 T A Q++ S++ ATT + A + + S A +SE + Sbjct: 629 TSSTTTADPQEQTSTESSTEATTSDVISTESSTSSTTTANPQEQTSTESSTEATTSEATS 688 Query: 254 SSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSA 313 + S++ A +S + S+ A ++E+ S+ TA Q ++ S A S A++ +S Sbjct: 689 TESSTEATTSDVISTESSTEATSTESFTSSTTTADPQEQTSTESSTEAITSEATSTESST 748 Query: 314 GQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAE 373 ++ AT+ S E+ ++ +ST++T + EQ S + + + + T+ S S+ Sbjct: 749 EATTSEATSTESSTEATSTESSTSSTTTADPQEQTSTESSTEATTSEATTSDVISTESST 808 Query: 374 SS--KTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQS 431 + + SS ++A +++ S EAT + S+T A+T + S+T A S Sbjct: 809 EATSTESFTSSTTTADPQEQTSTESSTEATTSDVISTESSTEATTSEATSTESSTEATTS 868 Query: 432 KSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 + ++T A + +A E ST S AT++ S T A T +A + Sbjct: 869 DVISTESSTEATSTESFTSSTTTADPQEQTSTESSTEAITSEATSTESSTEAITSEATST 928 Query: 492 AYDNAEKRLQKDQNGADIPDKGCFLNNINAV 522 + E + + A + + + Sbjct: 929 EATSTESSTEATTSEATSTESSTEATTSDVI 959 Score = 50.3 bits (118), Expect = 4e-04, Method: Composition-based stats. Identities = 76/348 (21%), Positives = 159/348 (45%), Gaps = 2/348 (0%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 ++ +S A +T + S + A + + + +A S ++ S+ +A +S A+S+ Sbjct: 871 ISTESSTEATSTESFTSSTTTADPQEQTSTESSTEAITSEATSTESSTEAITS--EATST 928 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 T++ +TEA+ S A + S + A TS + S T A+++ S +S +T++S Sbjct: 929 EATSTESSTEATTSEATSTESSTEATTSDVISTESSTEATSTESSTESSTEATSTESSTE 988 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 AT++ + +S + +SS ++A + S+ A TSE + S T A S Sbjct: 989 ATTSDVISTESSTEATSTESFTSSTTTADPQEQTSTESSTEATTSEATSTESSTEATTSE 1048 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 + + S T A S+ S+ S++ T+ S E+ S A++ + T + Sbjct: 1049 ATSTESSTEATSTESSTSSTTTADPQEQTSTESSTEATTSEATSTESSTEAITSSDVTST 1108 Query: 353 RSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSAT 412 S++ A ++E + S T A +S+ + + S+ +S+ ++ S + + ++ Sbjct: 1109 ESSTEATSTEATSTESSTEAITSEATTSEATSTESSTEATTSTESSTEATTSDVISTESS 1168 Query: 413 TASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 T S+ + S+T A S + ++T A T+ + + ++ D Sbjct: 1169 TESSTEATSTESSTEATTSDVISTESSTEATTSEATSTESSTEATTSD 1216 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 72/365 (19%), Positives = 164/365 (44%), Gaps = 9/365 (2%) Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSA 196 + +T ++ A ++ A ST + A ++ +S+ + +TEA+ + ++ E+ S Sbjct: 1073 PQEQTSTESSTEATTSEATSTESSTEAITSSDVTSTESSTEATSTEATSTESSTEAITSE 1132 Query: 197 AATSAGAAKTSETNASASLQSAATSAST---------ATTKASEAATSARDAAASKEAAK 247 A TS + S T A+ S +S+ + ++ ++ + + S+ +A S + Sbjct: 1133 ATTSEATSTESSTEATTSTESSTEATTSDVISTESSTESSTEATSTESSTEATTSDVIST 1192 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 S T A++S +++ S+T A S + S T + + T+ S A + SS Sbjct: 1193 ESSTEATTSEATSTESSTEATTSDVISTESSTESSTEATSTESSTEATTSDVISTESSTE 1252 Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA 367 A ST + S++ + +S+ A +S +T + EA + + +S TS + + Sbjct: 1253 ATSTESSTESSTEATSTESSTEATTSDVISTESSTEAITSSDVTSTESSTEATSTESFTS 1312 Query: 368 SETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATA 427 S T+A+ + + S+S S+ SS + T ++S ++S + + + Sbjct: 1313 STTTADPQEQTSTESSSEVTSTGSSTVITVSSTTILEPEERTSTESSSEVTSTGSSTVLT 1372 Query: 428 AAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPK 487 A+ + T E++++ +S V + ++T + + S+ +++ + + + + Sbjct: 1373 ASSTTILEPEERTSTESSSEVTSTGSSTVIIASSTTILEPEERTSTESSTEATSTESPIE 1432 Query: 488 AVKSA 492 A KS+ Sbjct: 1433 ATKSS 1437 Score = 44.1 bits (102), Expect = 0.028, Method: Composition-based stats. Identities = 74/384 (19%), Positives = 174/384 (45%), Gaps = 8/384 (2%) Query: 109 MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAAST--SAGQAASSA 166 + +++ + A + +S+++A TS+ +T ++ A S A ST S S A Sbjct: 1074 QEQTSTESSTEATTSEATSTESSTEAITSSDVTSTESSTEATSTEATSTESSTEAITSEA 1133 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 ++ +++ +ST+AT +++S+ A +S + S+ + T T+ +S ++ + +T Sbjct: 1134 TTSEATSTESSTEATTSTESSTEATTSDVISTESSTESSTEATSTESSTEATTSD-VIST 1192 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 ++EA TS + S A +S+ ++ S++ +++ AT+ +S +A + + SS Sbjct: 1193 ESSTEATTSEATSTESSTEATTSDVISTESSTESSTEATSTESSTEATTSDVISTESSTE 1252 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A +S + ++ + S++ A+TS ++ S+T A S++ ++ +ST T T Sbjct: 1253 ATSTESSTESSTEATSTESSTEATTSDVISTESSTEAITSSDVTSTESSTEATSTESFTS 1312 Query: 347 QASAAARSAS-----AAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 + A +++ + T + T + ++ S+ +SS +++ S T Sbjct: 1313 STTTADPQEQTSTESSSEVTSTGSSTVITVSSTTILEPEERTSTESSSEVTSTGSSTVLT 1372 Query: 402 RQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDA 461 ++ ST+++ S ++ +++ + E + + A++ Sbjct: 1373 ASSTTILEPEERTSTESSSEVTSTGSSTVIIASSTTILEPEERTSTESSTEATSTESPIE 1432 Query: 462 STTKKGIVQLSSATNSTSETLAAT 485 +T I+ S+ ST E T Sbjct: 1433 ATKSSDIISTQSSNTSTEEDNYYT 1456 >UniRef50_Q9L2C3 Large Ala/Glu-rich protein n=17 Tax=Streptomyces RepID=Q9L2C3_STRCO Length = 1326 Score = 60.3 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 82/396 (20%), Positives = 129/396 (32%), Gaps = 20/396 (5%) Query: 119 AVAQNTAAAKKSASDASTSAREAATHA-ADAADSARAASTSAGQAASSAQSASSSAGTAS 177 Q ++ + + SAR A A+A A A + S+A+ A Sbjct: 726 EADQERERVREQSEELLASARNRVEEAQAEAVRLVEEADRRATEMVSAAEQ-----HAAQ 780 Query: 178 TKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSAR 237 + + A A E + + AA+ + T A A +ASE A R Sbjct: 781 VRESVAGLHEQAQEEITGLRSAAEHAAERTRTEAQEEADRVRADAYAERERASEDAGRLR 840 Query: 238 DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAG 297 A + A + + S + + + S A+ T A + A QSAS Sbjct: 841 REAQEETEAAKALAERTVSEAITEADRIRSDVSEH-AQRVRTEASDAIAEAEQSASRTRA 899 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 A+ S +A QA T A AE + + A T A A +A A Sbjct: 900 DAREDANR--IRSDAATQADTLITEARSEAERLTTE-TAAETDRIRTQTLAEAERVTAEA 956 Query: 358 AKTSETNAKASETSAESSKTAAAS-----SASSAASSASSASASKDEATRQASAAKSSAT 412 A SE + T AE +T + A + A + S + EA R + A + Sbjct: 957 ASESERVRTEAATEAERLRTETIAEADRVRAEAGARAEQLVSDATGEAERLRAEAADTVG 1016 Query: 413 TASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLS 472 +A A A + + T A E+ + K+ Sbjct: 1017 SAQQHAERLRTEADRVRREAAAEAERVTTA-----AREEAERTLDEARKDANKRRSEAAE 1071 Query: 473 SATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 +ET A K + A A+K ++ AD Sbjct: 1072 QVDTLITETAAEADKLLTEAQQQAQKTTADAESQAD 1107 >UniRef50_Q1D010 Putative uncharacterized protein n=2 Tax=cellular organisms RepID=Q1D010_MYXXD Length = 2138 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 85/431 (19%), Positives = 159/431 (36%), Gaps = 28/431 (6%) Query: 99 RPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTS 158 +PE E E + A A + A+ + A A ++ Sbjct: 1194 QPEWATSAEQAHTEAQAQWTETAAQAALGSEQTEWAANTPEAAQADWASSSTETAPGEIQ 1253 Query: 159 AGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASA----- 213 A A A+ + TA ++ +A+ + A A +++ S T A Sbjct: 1254 AEWAEPIAEDGQAQWATAEAPPSDGWNTASTEAAPVEVQAEWAESSEASPTEVQADGSTP 1313 Query: 214 -----------------SLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 A +A A T+A + A A+ +A T + + Sbjct: 1314 TLDAAPVEVQAEWDDPAQQDGATLTAEPAATEAQAEWAAPEIADATTDAQAGWTTPGADT 1373 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 A S ++++ G + +E A ++ETAA + ++ A A+ + +AGQ Sbjct: 1374 AQSEWTASSENGWATAEEVQTEWTAPATETAAQPEWATTPAAEAARTDWAAPGTENAGQP 1433 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 AT A ++ + A+ E QA A +A AA +SE + A+E + Sbjct: 1434 E-WATPATEAVPGEGAQPEWASPATEEV--QAEWATPAAEAAASSEWASPAAEGAQAEWA 1490 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS-ATAAAQSKSTA 435 T A +A+S+ ++ + + E A+ +S+ AS E G AT A ++ +++ Sbjct: 1491 TPATETAASSEWASPATEEVQAEWATPAAETAASSEWASPAVEEVQGEWATPATEAAASS 1550 Query: 436 ESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLA--ATPKAVKSAY 493 + A+ AE A A+ A + Q A+ + E A+P ++ Sbjct: 1551 QWASPAAEGAQSEWATPATEAAQAGWAAAPVEAAQPEWASPAAEEAQGEWASPATEEAQG 1610 Query: 494 DNAEKRLQKDQ 504 + A ++ Q Sbjct: 1611 EWANPAAEEAQ 1621 >UniRef50_C5QRU2 Triblock protein copolymer TR8T n=1 Tax=Staphylococcus epidermidis M23864:W1 RepID=C5QRU2_STAEP Length = 752 Score = 59.5 bits (142), Expect = 6e-07, Method: Composition-based stats. Identities = 69/407 (16%), Positives = 120/407 (29%), Gaps = 18/407 (4%) Query: 125 AAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEAS 184 A+++A + +A+ A +A + A Q A++ Q ++A A TK E Sbjct: 122 NQAEQNAGTPQNNQADASVTPAQSAGQQGQGTPGADQNATTGQQTGTAAQGADTKPGENG 181 Query: 185 K-SAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA------ATSAR 237 +A + A T T A + +A T AS+ Sbjct: 182 AGTAVTPAPTTGNNNQQDQAGATGGTPAKPDTTAGQDTAKPGNTDASQTPGNNQQGQDTT 241 Query: 238 DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAG 297 A ASK A + TNA + + N+ A T+ T A + T AGQ A+ Sbjct: 242 QAGASKPDASTPNTNAEAKPTPGTQGQQTDQNNQAGAGTTVTPAPENNTQAGQGATGTPA 301 Query: 298 SKTAAAS----SASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 T A +A + A + T G + ++ + A Sbjct: 302 QGTEAKPGNDQNAGTGQQTGTPAQGTETKPGNDQQGQGATGTGTPENTKPGENGAGTTVT 361 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT 413 +T E + +A + + + + + T + A T Sbjct: 362 PTPGENAGQTKPNGQENQEKPGAPETDQNAGTGQQTGTPDQNNAGDTTVKPPTAPDQGTE 421 Query: 414 ASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSS 473 A T+ AT Q+ + A + + + T Sbjct: 422 AK-PDTQTKPDATTPDQNGTGAPGQDAGTKPGTTPDQTPGNTEQQGQGGTQTGTTTPQPG 480 Query: 474 ATNSTSETLAATPKAVKSAYDNAEK------RLQKDQNGADIPDKGC 514 T + A + +N + + DQ G PD Sbjct: 481 TTPDQNNQAGAGTTVTPAPENNTQPGQGTETKPGTDQQGQGAPDNTK 527 >UniRef50_Q08SC3 Adventurous gliding protein Z n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08SC3_STIAU Length = 1402 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 77/451 (17%), Positives = 139/451 (30%), Gaps = 50/451 (11%) Query: 101 EALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAAD---------AADS 151 E +R+ + E+ + Q S A A H A A + Sbjct: 359 ERDQRYAELEGEIQALQERLQQTEQERDTSVRALEQRALAAEDHGAQSDAEIERLKAERA 418 Query: 152 ARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA 211 A A + A A A + K + ++ + A A + Sbjct: 419 ALEARLNQQIAELEADVARTIGERDQLKQEKDAQEEELTQQLNERDAKLATLERELADTI 478 Query: 212 SASLQSAATSASTATTKASEAAT------SARDAAASKEAAKSSETNASSSASSAASSAT 265 + + A ST + + A +EA +SE A A A + Sbjct: 479 ARNEHHEAELNSTIQQHLERIGELEGEVEALKAHLADREAELTSELEALGQAKDALETDL 538 Query: 266 AAGNSAKAAKTSETNARSS---ETAAGQSASAAAGSKTAAASSASAASTSAGQASASATA 322 A + AR ET A + ++ + A A + + S T Sbjct: 539 TGQLEASQRTGEQLQARIVSLDETIAQRDSTIEGLQTDVSDRDAKIADLTGNLEATSQTL 598 Query: 323 AGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA--------------- 367 A A A + + +TT+ + + A+ +T ET + Sbjct: 599 AETQATLATTEETLSTTRGELEATSQTLSETQATLTRTEETLSTTRGELEATSQTLSETQ 658 Query: 368 ---SETSAESSKTAAASSASSAASSASSASASKDEATRQAS----------AAKSSATTA 414 + T S T A+S S + + ++ E T ++ AK+ AT A Sbjct: 659 TTLATTEETLSTTRGELEATSQTLSETQTTLARTEETLTSTRGELEATSEALAKTQATLA 718 Query: 415 STKATEAAG----SATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQ 470 +T+ T + AT+ S++ A T + R E A++ L T + + Sbjct: 719 TTEETLSTTRGELEATSQTLSETQTTLATTEETLSTTRGELEATSETLAKTQATLQQTLA 778 Query: 471 LSSATNSTSETLAATPKAVKSAYDNAEKRLQ 501 + T + + L+ +S + L Sbjct: 779 ELAQTTTIRDELSVELDDARSTLEYTRSELA 809 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 53/315 (16%), Positives = 98/315 (31%), Gaps = 11/315 (3%) Query: 74 TITVYEDSQPGTLNDFLGAMTEDDARPEALRR-FELMVEEVAR-NASAVAQNTAAAKKSA 131 ++ TL E + E L R E++ + + AQ A +++ Sbjct: 1012 DLSGKSQELSDTLRKLSSVTQEKQRQTEVLTREVTAKTEQIKQLESKLEAQTAEAKRQTD 1071 Query: 132 SDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAE 191 + A+ +SA ++ A S AG + Sbjct: 1072 TLQQQVAQLGGELEGVRKESAEQLRAASNAQAKLTSERDSLAGQLQQSEARLQQQNQTQT 1131 Query: 192 SSKSAAATSAGA----AKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 + ++ A +A +E + Q A+ A K EA T A + + Sbjct: 1132 NERAEAKRAADELTRKLAAAEARITQLTQEGQQRATEADAKLKEAQTQLTTRARKIQELE 1191 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT----AAA 303 + N+ S+ + A +A AK +ET A+ + + A K A Sbjct: 1192 LAVENSVSTKAR-LEKELTAKATAAEAKANETTAKLATLQRERKELEAKQLKELEDLNAK 1250 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSET 363 A A +A E + + A ++ + AA A+A ++ Sbjct: 1251 QKAELERRDAIKAQEVTRLQQSVQEKSKALKVAELELARYKSKSPAPAAAPAAAKPGAKP 1310 Query: 364 NAKASETSAESSKTA 378 A +E + KT Sbjct: 1311 AAVGAEDEEAAVKTQ 1325 >UniRef50_Q9NDI9 Merozoite surface protein 3g n=1 Tax=Plasmodium vivax RepID=Q9NDI9_PLAVI Length = 969 Score = 59.2 bits (141), Expect = 9e-07, Method: Composition-based stats. Identities = 73/413 (17%), Positives = 133/413 (32%), Gaps = 18/413 (4%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA----- 166 V A+ + A A+ AST A A+T A + A +A +A Sbjct: 376 NVTEEANKAKVASTKASTEATKASTEATNASTEATKPSSKAANVKKKTDEAIKAAKEAKK 435 Query: 167 -----------QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 A ++ A A A K+ A AE+ A+ + A+ + T A+ Sbjct: 436 AKTEAYIALFVTKAMAAKEKAKKSAEAADKAKAQAEAVNGASEKTKKDAEHAATKANEKK 495 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 T+A A A + A ++E K + A S + A+ Sbjct: 496 THTETAADAAKKNAEVKVEEEDNVAKNEEKMKKKVDDVIEKVLEALKSEEDTYQAQIQAE 555 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 + A E + K + +A + A + ++ + Sbjct: 556 IAVQVANVEEACEKAKTAEQEAKKAKDEAVKAAKEAEEAKKQAEKAEKITKTATEEANKA 615 Query: 336 TATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASA 395 +Q + + N + A + A +A Sbjct: 616 KEEEAKASEAKQEAETKAGDVDEEVYAVNVEFESVKAAAKAAAHHKVPEILDKEKKNAEN 675 Query: 396 SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT--RAETAAKRAEDIA 453 + +A+ +A+ AK++A TA+ KATEA +A A ++ A++ A AE A+ A+ + Sbjct: 676 AAKKASAKATEAKTTAETATKKATEAKTAAGNAQKASENAKAIAADVLAEKASTEAQSLK 735 Query: 454 SAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNG 506 A K + A + AA ++ ++ K G Sbjct: 736 EEAKKLAADIKKSNVTNEEKAKRDKAANDAAHQASLSASKAKEAKTAATQAKG 788 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 74/365 (20%), Positives = 143/365 (39%), Gaps = 15/365 (4%) Query: 100 PEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSA 159 +A+ + E EE+ ++ + + + K+ A++ A++ A + + A+A + SA Sbjct: 292 TDAVEKLEKASEELLKD-NYLRDTVNSLKEGATEEQKKAKKEEEKAKISEEVAKAEAASA 350 Query: 160 GQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAA 219 Q A ++ + + + A+T A T A+ + A Sbjct: 351 KQFAKIEAERANYEAN-KIAENHPNTNVTEEANKAKVASTKA------STEATKASTEAT 403 Query: 220 TSASTATTKASEAA-----TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA 274 +++ AT +S+AA T AA + +E + + A ++ A SA+AA Sbjct: 404 NASTEATKPSSKAANVKKKTDEAIKAAKEAKKAKTEAYIALFVTKAMAAKEKAKKSAEAA 463 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 ++ A + A+ ++ A + T A + T+A A +A + ++ A + Sbjct: 464 DKAKAQAEAVNGASEKTKKDAEHAATKANEKKTHTETAADAAKKNAEVKVEEEDNVAKNE 523 Query: 335 STATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSAS 394 K + E+ A +S T + +A ++ A A +A A A Sbjct: 524 EKMKKKVDDVIEKVLEALKSEED--TYQAQIQAEIAVQVANVEEACEKAKTAEQEAKKAK 581 Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS 454 +A ++A AK A A A A A + ++ A A AET A ++ Sbjct: 582 DEAVKAAKEAEEAKKQAEKAEKITKTATEEANKAKEEEAKASEAKQEAETKAGDVDEEVY 641 Query: 455 AVALE 459 AV +E Sbjct: 642 AVNVE 646 Score = 46.4 bits (108), Expect = 0.006, Method: Composition-based stats. Identities = 67/436 (15%), Positives = 143/436 (32%), Gaps = 3/436 (0%) Query: 76 TVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDAS 135 T E + + + E+D + + + V++V ++ ++ A Sbjct: 496 THTETAADAAKKNAEVKVEEEDNVAKNEEKMKKKVDDVIEKVLEALKSEEDTYQAQIQAE 555 Query: 136 TSAREAA-THAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSK 194 + + A A + A +A + A A A + A + KA + +K+A + Sbjct: 556 IAVQVANVEEACEKAKTAEQEAKKAKDEAVKAAKEAEEAKKQAEKAEKITKTATEEANKA 615 Query: 195 SAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNAS 254 A AK + + + + AA +A + K + + Sbjct: 616 KEEEAKASEAKQEAETKAGDVDEEVYAVNVEFESVKAAAKAAAHHKVPEILDKEKKNAEN 675 Query: 255 SSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAG 314 ++ ++A + A + A K + ++ A S +A A + A AS + S Sbjct: 676 AAKKASAKATEAKTTAETATKKATEAKTAAGNAQKASENAKAIAADVLAEKASTEAQSLK 735 Query: 315 QASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAES 374 + + A K + + A +A QAS +A A AKT+ T AK + Sbjct: 736 EEAKKLAADIKKSNVTNEEKAKRDKAANDAAHQASLSASKAKEAKTAATQAKGEVALEKK 795 Query: 375 SKTAAASSASSAASSASSASASKD--EATRQASAAKSSATTASTKATEAAGSATAAAQSK 432 + +A + ++ + + A+ + + +Q + + + + + Sbjct: 796 KEESAKAVEAAKEAMKARDKAAFELLKLKKQDVLEQVDVSPSGNNNLNDVDEQVSLEVGE 855 Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 492 ES + + E+ E + + T + K+ K Sbjct: 856 QENESDDAPPQETEEVTEEGDEEDEEEMEEDEIQDESGHTEETPTEQAAEQEKSKSEKVL 915 Query: 493 YDNAEKRLQKDQNGAD 508 D L Q+ D Sbjct: 916 NDEEAHNLLAQQHKED 931 >UniRef50_A4AGZ2 Large Ala/Glu-rich protein n=1 Tax=marine actinobacterium PHSC20C1 RepID=A4AGZ2_9ACTN Length = 779 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 92/429 (21%), Positives = 167/429 (38%), Gaps = 26/429 (6%) Query: 98 ARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAAST 157 R +A E ++ + R A+A++Q + ++ S + + EAA ADA AR A+T Sbjct: 110 LRDDATSESERILADANREAAALSQ--TSRSEADSLIARARDEAAQLTADA---AREAAT 164 Query: 158 SAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQS 217 G A+ A +SA + ++ A + +A + A L Sbjct: 165 IRGAVATEAADVRTSAKREAAALRAETERAMTELRATTATDVAEAREAAEALARDAELGR 224 Query: 218 AATSASTATTKA---SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA 274 A A + + + D A E A++ S++A + ++ T A ++ A Sbjct: 225 ATFEAENTKQREDLDRDIEQARTDIAREIETARAELEAESTTARESLANETKAAHTKLAN 284 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 +T A + +A A ++ AA +A A T A +A+ + AA +E A Sbjct: 285 ETKA--AHTKLANETAAAQAKQKAELDAAYAALEAETQATRAATESEAARIRSELEAEVL 342 Query: 335 STATTKAGEATEQAS----AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 +T A E TE + + +T+ +A+ S+ AA ++ +A Sbjct: 343 TTRAELAAELTETRELWEVEERDARAQLAADDTSTRAALEDEVSTTRAAVEHEVTSTKAA 402 Query: 391 ------SSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAET 444 ++ +A + E +A + TT + + +ATAA + T A A Sbjct: 403 LEQEVTTTKTALEQEVKVTKAALEQEVTTTRSNLEQEVATATAALNDEVTTTRAELEAHV 462 Query: 445 AAKRAEDIAS------AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEK 498 R+E A A+ E AST K ++SS + + +T ++ NA Sbjct: 463 KVTRSELAADVAAQKKALDREVASTKKALAKEVSSTRTALENDVESTRAGLEEEIANARS 522 Query: 499 RLQKDQNGA 507 K+ + A Sbjct: 523 EFDKESSTA 531 >UniRef50_A7TJN9 Putative uncharacterized protein n=3 Tax=cellular organisms RepID=A7TJN9_VANPO Length = 1256 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 70/387 (18%), Positives = 156/387 (40%), Gaps = 7/387 (1%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E+ E A + A+ + ST A + + ++ A+ ++SA Sbjct: 339 EIESTESASATLTSESSIVASTNATEIESTEPASATLTSESSIVASTNATEIESTESASA 398 Query: 167 QSASSSAGTASTKATE--ASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSAST 224 S S+ AST ATE +++SA+A +S+S+ S A + T +++ ++ +S Sbjct: 399 TLTSESSIVASTNATEIESTESASATLTSESSIVASTNATEIESTEPASATLTSESSIVA 458 Query: 225 ATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSS 284 +T +T + A + E++ + TNA+ S+ +SAT S+ A T+ T Sbjct: 459 STNATEIESTESASATLTSESSIVASTNATEIESTEPASATLTSESSIVASTNAT----- 513 Query: 285 ETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEA 344 E + +SASA S++ + + + ST Q S S ++ ++ ++ ++S+ T Sbjct: 514 EIESTESASATLTSESNSFNHTTVGSTETTQTSESVISSLYTSINSTLASSSTTGALSTQ 573 Query: 345 TEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQA 404 + SA S++ + + T+ E S ++ + S+ + +++ +++ Sbjct: 574 SVSESAIYTSSNTTIPQTSEPVTTITTTECSGDVCSTVTITEPCSSETVTSTHEQSVTTI 633 Query: 405 SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 + + S ST S+ + + + T E + + + T Sbjct: 634 TTTECSNDVCSTVTITEPCSSETVTSTHEQSVTTITTTECSNDVCSTVTITEPCSSETVT 693 Query: 465 KKGIVQLSSATNSTSETLAATPKAVKS 491 +++ T + + + Sbjct: 694 ATHEQSVTTITTTECSNDVCSTVTITE 720 Score = 56.1 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 70/345 (20%), Positives = 144/345 (41%), Gaps = 2/345 (0%) Query: 143 THAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAG 202 T + S A + ++AS+ ++ SS ++ S A+A + ++ ++ Sbjct: 326 TSESSIVASTNATEIESTESASATLTSESSIVASTNATEIESTEPASATLTSESSIVAST 385 Query: 203 AAKTSETNASASLQSAATSASTATTKASEAATSARDAAA-SKEAAKSSETNASSSASSAA 261 A E+ SAS + S+ A+T A+E ++ +A + E++ + TNA+ S+ Sbjct: 386 NATEIESTESASATLTSESSIVASTNATEIESTESASATLTSESSIVASTNATEIESTEP 445 Query: 262 SSATAAGNSAKAAKTSETNARSSETAAGQ-SASAAAGSKTAAASSASAASTSAGQASASA 320 +SAT S+ A T+ T S+E+A+ ++ ++ + T A S SA S S+ Sbjct: 446 ASATLTSESSIVASTNATEIESTESASATLTSESSIVASTNATEIESTEPASATLTSESS 505 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 A +A S+ S + T E+ S +TSE+ + TS S+ +++ Sbjct: 506 IVASTNATEIESTESASATLTSESNSFNHTTVGSTETTQTSESVISSLYTSINSTLASSS 565 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 ++ + + S S ++ T ++ T +T+ + S + S+ +T Sbjct: 566 TTGALSTQSVSESAIYTSSNTTIPQTSEPVTTITTTECSGDVCSTVTITEPCSSETVTST 625 Query: 441 RAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAAT 485 ++ S + T+ + ++T+ S T T Sbjct: 626 HEQSVTTITTTECSNDVCSTVTITEPCSSETVTSTHEQSVTTITT 670 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 68/387 (17%), Positives = 149/387 (38%), Gaps = 9/387 (2%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E+ E A + A+ + ST + A + + ++ A+ ++SA Sbjct: 364 EIESTEPASATLTSESSIVASTNATEIESTESASATLTSESSIVASTNATEIESTESASA 423 Query: 167 QSASSSAGTASTKATE--ASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSAST 224 S S+ AST ATE +++ A+A +S+S+ S A + T ++++ ++ +S Sbjct: 424 TLTSESSIVASTNATEIESTEPASATLTSESSIVASTNATEIESTESASATLTSESSIVA 483 Query: 225 ATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSS 284 +T +T A + E++ + TNA+ S+ ++SAT S T+ + ++ Sbjct: 484 STNATEIESTEPASATLTSESSIVASTNATEIESTESASATLTSESNSFNHTTVGSTETT 543 Query: 285 ETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE- 343 +T+ +S + ASS++ + S S SA + +S T E Sbjct: 544 QTSESVISSLYTSINSTLASSSTTGALSTQSVSESAIYTSSNTTIPQTSEPVTTITTTEC 603 Query: 344 -----ATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 +T + S + T E + T+ S+ + + + S+ + +++ + Sbjct: 604 SGDVCSTVTITEPCSSETVTSTHEQSVTTITTTECSNDVCSTVTITE-PCSSETVTSTHE 662 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 ++ + + S ST S+ + + + T E + + Sbjct: 663 QSVTTITTTECSNDVCSTVTITEPCSSETVTATHEQSVTTITTTECSNDVCSTVTITEPC 722 Query: 459 EDASTTKKGIVQLSSATNSTSETLAAT 485 + T + + T S + +T Sbjct: 723 SSETVTTQEQSITTYTTTECSNDVCST 749 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 59/255 (23%), Positives = 114/255 (44%), Gaps = 6/255 (2%) Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSE-TAAGQSASAAAGSKT 300 + E++ + TNA+ S+ ++SAT S+ A T+ T S+E +A ++ ++ + T Sbjct: 326 TSESSIVASTNATEIESTESASATLTSESSIVASTNATEIESTEPASATLTSESSIVAST 385 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 A S S SA S S+ A +A S+ S + T E++ AS A + T Sbjct: 386 NATEIESTESASATLTSESSIVASTNATEIESTESASATLTSESSIVASTNATEIES--T 443 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 +A + S+ + T A S+ ++SA+ S S A+ A+ +S+ ++T +E Sbjct: 444 EPASATLTSESSIVASTNATEIESTESASATLTSESSIVASTNATEIESTEPASATLTSE 503 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDI---ASAVALEDASTTKKGIVQLSSATNS 477 ++ A+ A + ESA+ + + ++ S ++S S Sbjct: 504 SSIVASTNATEIESTESASATLTSESNSFNHTTVGSTETTQTSESVISSLYTSINSTLAS 563 Query: 478 TSETLAATPKAVKSA 492 +S T A + ++V + Sbjct: 564 SSTTGALSTQSVSES 578 >UniRef50_Q5TVN3 AGAP010846-PA (Fragment) n=1 Tax=Anopheles gambiae RepID=Q5TVN3_ANOGA Length = 738 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 61/399 (15%), Positives = 126/399 (31%), Gaps = 19/399 (4%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 E+ A+ + + + KS + S S + + D+ + + + + + + Q Sbjct: 224 EKPEDEAAEESDDPKQSPKSGKEPSKSQPKKGSDKPDSKNKKPSEDSKSSEDPENQQDDV 283 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAA---------KTSETNASASLQSAATS 221 SS A E+ + +S K + + KTSE + S+ Sbjct: 284 SSEKPEDEAAEESDDPKQSLKSGKEPSKSQPKKGSDKPDSKNKKTSEDSKSSEDPENQQD 343 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 + EAA + D S ++AK + S S + ++ + Sbjct: 344 DVNSEKPEDEAAEESDDPKQSSKSAKEPSKSQLKKESDKPDSKNK-------KPSEDSKS 396 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 S+ S + S+ + ++ + A + + Sbjct: 397 SEDPENQQDDVSSENPEDKNKKPSEDSKSSEGPEIEQDDVSSENPEDKVAEESDDPKQSS 456 Query: 342 GEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 A E + + + S S+ N K SE S S S+ A+ D+ Sbjct: 457 KSAKEPSKSQPKKESEKPDSK-NKKPSEDSKSSEDPENQQDDVSSEKPEDEAAEESDDPK 515 Query: 402 RQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAE--TAAKRAEDIASAVALE 459 + +AK S+ + K +E S + + ++++ ED A+ + + Sbjct: 516 QSPKSAKESSKSQPKKGSEKPDIKNKKPSKDSKSSEDPENQQDDVSSEKPEDEAAEESDD 575 Query: 460 DASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEK 498 + K G S SE K + ++E Sbjct: 576 PKQSPKSGKEPSKSQPKKGSEKPDIKNKNPSADSKSSED 614 Score = 57.2 bits (136), Expect = 3e-06, Method: Composition-based stats. Identities = 40/371 (10%), Positives = 116/371 (31%), Gaps = 4/371 (1%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQ--AASSAQS 168 E + + + KS + E+ + + + ++ + Sbjct: 45 AEESDDPKQSPKTGKEPSKSQPKKGSEKPESKHKKSSEDSKSSEDPENQQDDLSSEKPED 104 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 ++ ++ +++K + ++ K + + K SE + S+ ++ Sbjct: 105 KAAEESDDPKQSPKSAKEPSKSQPKKGSEKPDSKNKKPSEDSKSSEDPENQQDDVSSEKP 164 Query: 229 ASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 EAA + D S ++AK + + S + S + + + + + ++ Sbjct: 165 EDEAAEESDDPKQSSKSAKEP-SKSQPKKGSDKPDSKNKKPSEDSKSSEDPENQQDDVSS 223 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 + AA S + + + S S S ++ + Sbjct: 224 EKPEDEAAEESDDPKQSPKSGKEPSKSQPKKGSDKPDSKNKKPSEDSKSSEDPENQQDDV 283 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 S+ AA+ S+ ++ ++ E SK+ + S S + + Sbjct: 284 SSEKPEDEAAEESDDPKQSLKSGKEPSKSQPKKGSDKPDSKNKKTSEDSKSSEDPENQQD 343 Query: 409 S-SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKG 467 ++ +A E + +++S + + E+ +++ + + + + Sbjct: 344 DVNSEKPEDEAAEESDDPKQSSKSAKEPSKSQLKKESDKPDSKNKKPSEDSKSSEDPENQ 403 Query: 468 IVQLSSATNST 478 +SS Sbjct: 404 QDDVSSENPED 414 Score = 56.5 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 43/368 (11%), Positives = 119/368 (32%), Gaps = 11/368 (2%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + +++ + + + + KS+ D + ++ + + + + + + Sbjct: 373 KSQLKKESDKPDSKNKKPSEDSKSSEDPENQQDDVSSENPEDKNKKPSEDSKSSEGPEIE 432 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAA---------KTSETNASASLQS 217 Q SS A E+ +++S+K + + K SE + S+ Sbjct: 433 QDDVSSENPEDKVAEESDDPKQSSKSAKEPSKSQPKKESEKPDSKNKKPSEDSKSSEDPE 492 Query: 218 AATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTS 277 ++ EAA + D S ++AK S + + S S + + Sbjct: 493 NQQDDVSSEKPEDEAAEESDDPKQSPKSAKES-SKSQPKKGSEKPDIKNKKPSKDSKSSE 551 Query: 278 ETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTA 337 + + + ++ + AA S + + + S+ S + Sbjct: 552 DPENQQDDVSSEKPEDEAAEESDDPKQSPKSGKEPSKSQPKKGSEKPDIKNKNPSADSKS 611 Query: 338 TTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASK 397 + + S+ AA+ S+ ++ +++ E SK+ + S S Sbjct: 612 SEDPENQQDDVSSEKPEDEAAEESDDPKQSPKSAKEPSKSQLKKGSEKPDSKNKRPSEDS 671 Query: 398 DEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVA 457 + + + ++ + S K+ E + ++ + +++ + +A Sbjct: 672 KSSEDPEN-QQDDVSSEKPEDEAGEESDDPKQSPKTGKEPSKSQPKKGSEKPDIKLAART 730 Query: 458 LEDASTTK 465 S+ Sbjct: 731 QPAESSLT 738 Score = 49.1 bits (115), Expect = 9e-04, Method: Composition-based stats. Identities = 48/327 (14%), Positives = 109/327 (33%), Gaps = 5/327 (1%) Query: 189 AAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKS 248 ++ K++ K SE + S+ ++ EAA + D S + K Sbjct: 1 EVQAKKASQKPDGKHKKPSEDSKSSEDPENQQDDVSSEKPKDEAAEESDDPKQSPKTGKE 60 Query: 249 SETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASA 308 + + S + +S + + + + + ++ + AA S + Sbjct: 61 P-SKSQPKKGSEKPESKHKKSSEDSKSSEDPENQQDDLSSEKPEDKAAEESDDPKQSPKS 119 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKAS 368 A + + S S S ++ + S+ AA+ S+ ++S Sbjct: 120 AKEPSKSQPKKGSEKPDSKNKKPSEDSKSSEDPENQQDDVSSEKPEDEAAEESDDPKQSS 179 Query: 369 ETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAA 428 +++ E SK+ + S S + + + ++ + A S Sbjct: 180 KSAKEPSKSQPKKGSDKPDSKNKKPSEDSKSSEDPEN-QQDDVSSEKPEDEAAEESDDPK 238 Query: 429 AQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPK- 487 KS E + ++ + + + D + ED+ +++ Q ++ E AA Sbjct: 239 QSPKSGKEPSKSQPKKGSDKP-DSKNKKPSEDSKSSEDPENQQDDVSSEKPEDEAAEESD 297 Query: 488 -AVKSAYDNAEKRLQKDQNGADIPDKG 513 +S E + + G+D PD Sbjct: 298 DPKQSLKSGKEPSKSQPKKGSDKPDSK 324 >UniRef50_B8M9D3 PE repeat family protein n=3 Tax=Trichocomaceae RepID=B8M9D3_TALSN Length = 1304 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 45/366 (12%), Positives = 95/366 (25%), Gaps = 9/366 (2%) Query: 124 TAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEA 183 + + + + S ++A D + + + + A E Sbjct: 180 SEESSPTPEEPSEEKQKAEAAPTDEKSAEEQPPEDQPKEENQPDEVVVEEPAPTDTAAEN 239 Query: 184 SKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASK 243 + + A+E+ + AA S T A+ +S + +T + K Sbjct: 240 TAADGASETPSAEAADSTNPEATESEPANEEQESPEPPKNESTGGNPKKNKKKDKKNKKK 299 Query: 244 EAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAA 303 A E A + A E++ A G + Sbjct: 300 NAPSEPEPEAPAIPEG------PPAEEATKVDLPVEEKPVEESSNPTEAWEGFGGGKKSK 353 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS---AAARSASAAKT 360 QA + + ++ + TEQAS + A+ Sbjct: 354 KKNGKKGKKKSQAQDTPAEEAPQEVVSTEDSAAVDPPTADTTEQASTEDTPTPAEPVAEQ 413 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 T + + + + + EA +K A +AT+ Sbjct: 414 PATEEPTGGEAVAEAPVVEEPVVENLPAEEEKGDGPEQEAPPVDETSKELHEAAPAEATQ 473 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 + ++ E++ E A A + + S+ N +++ Sbjct: 474 TTEEGEEPSSEEAVPEASQEEPAAEDSTPETSAETPAETVKDAKEDASTEEPSSANVSTD 533 Query: 481 TLAATP 486 TP Sbjct: 534 DTTVTP 539 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 44/424 (10%), Positives = 110/424 (25%), Gaps = 10/424 (2%) Query: 130 SASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAA 189 ++ + E + ++ + + A + S + A+ +A + Sbjct: 18 EPQASADANNEGDAEQTTEENQSKPSEETPAPANDNPPSEAKDEEPATQTEEKAEGEESG 77 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSS 249 + + + +T A T A T ++EA T+ A A Sbjct: 78 GDVTANGDSTEVTDAPAESVEKEGGENDTPTDAPAEETASTEAVTTEDVAPAENTPRTED 137 Query: 250 ETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAA 309 + SS S S A E E A ++ ++ + + A Sbjct: 138 TPAVEDTPSSEDGSPEENYPSTDDAPQVEDATPIEENPANGTSEESSPTPEEPSEEKQKA 197 Query: 310 STSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASE 369 + ++ + + + T+ A+ + ASE Sbjct: 198 EAAPTDEKSAEEQPPEDQPKEENQPDEVVVEEPAPTDTAAENTAADG----------ASE 247 Query: 370 TSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAA 429 T + + + A+ + + + + K + + Sbjct: 248 TPSAEAADSTNPEATESEPANEEQESPEPPKNESTGGNPKKNKKKDKKNKKKNAPSEPEP 307 Query: 430 QSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAV 489 ++ + E T + + + +G + + +A Sbjct: 308 EAPAIPEGPPAEEATKVDLPVEEKPVEESSNPTEAWEGFGGGKKSKKKNGKKGKKKSQAQ 367 Query: 490 KSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKY 549 + + A + + ++ A + + T + P G + Sbjct: 368 DTPAEEAPQEVVSTEDSAAVDPPTADTTEQASTEDTPTPAEPVAEQPATEEPTGGEAVAE 427 Query: 550 YPVV 553 PVV Sbjct: 428 APVV 431 Score = 45.7 bits (106), Expect = 0.011, Method: Composition-based stats. Identities = 62/406 (15%), Positives = 129/406 (31%), Gaps = 7/406 (1%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + + E A + +A++ ++A A + +AA + A S Sbjct: 593 DSIQAEKEAEAKQAEDDAEEEASAAAEPVWGEGDSAQIDAHDDEETKAAEEAVVPEAESE 652 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 +++ T++TE A TS A +T+E + + A +A+ Sbjct: 653 AREAAAEVEEKTESTEEESPATEEPQPDPLEETSTDAPETNEESVTPKEAEAVEAAADDQ 712 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 T A +A +E ++E S A+S+ + AK E A T Sbjct: 713 TIEEPAKE---EAQPVEEETPTAEDEMVSEEPEASSAEVDTADEELKAKIPEETATEDAT 769 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A ++ A A + + A + K Sbjct: 770 PEEVPAEDTPQEESVIDE---ATPEPAPEETIVADDPKDELAEEPAPEEEVAAKETGEEP 826 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 A AS ++ + + + E + A++ A A+ A+ + A Sbjct: 827 VEKPATEPASEELPADDKPEETSQAEEPAVEEASAPEPVAEEPAAEEVAAAEPVEESPPA 886 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 A+++ + + + A E+ A A + +A + + + Sbjct: 887 AEAAKDEPAEEVVAEVAADETLAAEAPAEEAPAEEAPAPVEVTVPEEAAEQVAEGEPVEA 946 Query: 467 GIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDK 512 + ++A + SE + P A ++ + + ADI D+ Sbjct: 947 PAAEGATA-ETPSEPVNEEPLAKEAPVEEPAPEATEPTIEADIIDR 991 >UniRef50_B6Q6N2 PT repeat family protein n=1 Tax=Penicillium marneffei ATCC 18224 RepID=B6Q6N2_PENMQ Length = 2150 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 69/404 (17%), Positives = 127/404 (31%), Gaps = 4/404 (0%) Query: 101 EALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG 160 EA EE A+ A + A +S S A T A + Sbjct: 517 EAPVEISDSAEEPAKEAVTEEADAVPASESDIKVEPSTEVAETPVEQVETEAVTEAADEP 576 Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 + + S T A +S+ E + + + S ++ + Sbjct: 577 VSVPAEVSEVPIPETTEVTEETAPESSKLEEEPTTEPTAEIEEVAEEKEPVTESAETPSE 636 Query: 221 SASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETN 280 + T +E AT+ +S ++ +S S A A+ Sbjct: 637 TVPATTEDTAEEATNDESPKEVDAHEESVTAEPETATTSPEVSEEQTTALAAVAEPEHLA 696 Query: 281 ARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTK 340 S Q A A +T A+ ++T +A+A A + + E ++ + Sbjct: 697 VEESSPEPAQVAEEKAEDRTEASEVPVNSTT---EAAAIAEESVTTVEPVDAAEKPEESV 753 Query: 341 AGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 EA A + T T + +++ A S+A SAA S S Sbjct: 754 EEEAAAPVEVTEEPAVESATEPTEPTEATEEPAAAEPIAESTAESAAESVSEPVVEPAAE 813 Query: 401 TRQASAAKSSATTASTKATE-AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALE 459 S +S A S TE A + +A+ + + + T AE + AVA Sbjct: 814 PVAESVEESVAEAVSEPVTETAVDPTSESAEPAAESVAEVAAEPTTETAAEPVVEAVAES 873 Query: 460 DASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 + V+ + + + +A P A + + ++ + Sbjct: 874 VVEPISEPAVEPVAEPVAEAVAESAEPTAEPAVTEEEKEAAPSE 917 Score = 49.9 bits (117), Expect = 5e-04, Method: Composition-based stats. Identities = 48/389 (12%), Positives = 109/389 (28%), Gaps = 2/389 (0%) Query: 114 ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSA 173 A A + AA + +++ +AD +++ + +A +S + Sbjct: 902 AEPAVTEEEKEAAPSEVVAESDAVHSHDENTSADISEATKTTVEDGQPSAEKEESLPTEE 961 Query: 174 GTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 AS E++ A+ + ++ +E ++ ++ + A+ + E Sbjct: 962 APASEPVEESATPEASENEPSAEVVEETPVSEPAEAPVASLKEAEPVQEAAASDEVVEPV 1021 Query: 234 TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS 293 A++ + ++ + +T + T+E ++ + A Sbjct: 1022 VVEEAVASTVNEPEPVTEDSEVQPAEEVDPSTVKEELPEEPVTAEEPKAATTETVAEEAI 1081 Query: 294 AAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 A +A A + S S T + A A Sbjct: 1082 ETETDAIAEEPAAEDAEEEEEEEEEEEIETTTEPAVDTSEESKEETADEQPITDAVEATV 1141 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT 413 +A + S A T ++ S A + S A AK+ Sbjct: 1142 VENAVEESTIEEPAEATEKTPTEVVGESPVEPAQEALSKAVEETPAEPTVEEPAKTVEAV 1201 Query: 414 ASTK--ATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 T + E TA ++ E + E + + + + Sbjct: 1202 TETPVESVEEQPVDTAEQVAEPVEEVSVEETAAEPATEEPTQIVEETPLEADVETAVEPV 1261 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKRL 500 + E + +SA + K + Sbjct: 1262 VAEVAEPVEKTSVEVTPTQSADEEVAKTI 1290 Score = 44.9 bits (104), Expect = 0.017, Method: Composition-based stats. Identities = 57/441 (12%), Positives = 113/441 (25%), Gaps = 21/441 (4%) Query: 35 VNTLASENPDEAGRYSMDVEYGQYSVILLVEGFP-------PSHAGTITVYEDSQPGTLN 87 V T A+E +E +M E + V +V+ P P+ +G + E T Sbjct: 1422 VVTEATEAVEETPVEAMPAEPAEEQVTEVVQETPVEAIQETPTESGEVQAKEAVAEPTTE 1481 Query: 88 DFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAAD 147 + + + V E A ++ D + + E Sbjct: 1482 SVEDEIRPAEETVVEVVEETPAVPESEAPAEPAQIVEETPVEADVDTAAAPVEEEVAETV 1541 Query: 148 AADSARAASTSAGQAASSAQSA---SSSAGTASTKATEASKSAAAAESSKSAAATSAGAA 204 + A+ + A+ + AA+ + + Sbjct: 1542 EETQVEPVEEKPAEPAAEETDKDVKEAPEAEAAEANEQVENVEDAAQEPAVEVSDATKEQ 1601 Query: 205 KTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSA 264 T T+A A + + E+A + + +T + + S Sbjct: 1602 PTQSTDAPVEEPEAVIEEAKPEEEVVESANAVEEPLTKAVEEAPVQTEEIPAPVAETPSE 1661 Query: 265 TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAG 324 + A + +A+ A +T + A +A Sbjct: 1662 AHPDSHKITEVAVAAGAAAVAGLGVFAATKTASDET--------SKEQASEAKDVFIEIV 1713 Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSAS 384 K S+ ++ E E A+A + + + E A A+ Sbjct: 1714 KEVAPVESTDVEPSSLPEETAEPIEASAAKEVETEAAPVEEQPIEVLAAKEVETEAAPVE 1773 Query: 385 SAASSASSASASKDEATR---QASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATR 441 A + + EA Q A ++ + A A + E+A Sbjct: 1774 EQPVEALATKDVETEAAPVEEQPVEALATKEVETEAAPVEEQPIEVLAAKEVETEAAPVE 1833 Query: 442 AETAAKRAEDIASAVALEDAS 462 + A A Sbjct: 1834 EQPVEALATKDVETEAAPVEE 1854 >UniRef50_P12027 Polysialoglycoprotein n=2 Tax=Oncorhynchus RepID=PSGP_ONCMY Length = 542 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 80/399 (20%), Positives = 156/399 (39%), Gaps = 5/399 (1%) Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 N + A+DA +S + ++D A ++ AA+ +G A+S + S Sbjct: 75 TNEVESQASPNHGSSPANDALSSEEKLRRVSSDDAATSEAATGPSGDDATSEAATGPSGD 134 Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAAT 234 A+++A A + + + ++ + + S A + + SEAAT Sbjct: 135 DATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAAT 194 Query: 235 SAR-DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS 293 D A S+ A S +A+S A++ S A +A + + ++ +G A+ Sbjct: 195 GPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDAT 254 Query: 294 AAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 + A + + + S A+T A++ AA + A+S + +AT +A+ Sbjct: 255 SEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPS 314 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT 413 A + T + ++E++ + A+S A++ S + EA S +++ Sbjct: 315 GDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEA 374 Query: 414 ASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----TTKKGIV 469 A+ + + A S A S A S A + + A+ + +DA+ T G Sbjct: 375 ATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDD 434 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 S A S A + A + D+A +G D Sbjct: 435 ATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDD 473 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 79/393 (20%), Positives = 154/393 (39%), Gaps = 5/393 (1%) Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A ++ + S + EAAT + ++ AA+ +G A+S + S A+++A Sbjct: 94 ALSSEEKLRRVSSDDAATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEA 153 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSAR-DA 239 A + + + ++ + + S A + + SEAAT D Sbjct: 154 ATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDD 213 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 A S+ A S +A+S A++ S A +A + + ++ +G A++ A + Sbjct: 214 ATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATG 273 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 + + S A+T A++ AA + A+S + +AT +A+ A Sbjct: 274 PSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATS 333 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 + T + ++E++ + A+S A++ S + EA S +++ A+ + Sbjct: 334 EAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSG 393 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----TTKKGIVQLSSAT 475 + A S A S A S A + + A+ + +DA+ T G S A Sbjct: 394 DDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAA 453 Query: 476 NSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 S A + A + D+A +G D Sbjct: 454 TGPSGDDATSEAATGPSGDDATSEAATGPSGDD 486 Score = 56.1 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 84/414 (20%), Positives = 159/414 (38%), Gaps = 6/414 (1%) Query: 101 EALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG 160 E LRR + A+ + + A ++ + + A A D+ A+T Sbjct: 99 EKLRRVSSDDAATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPS 158 Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAS-ASLQSAA 219 +++++A+ +G +T S A S + + A + T S S A Sbjct: 159 GDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEA 218 Query: 220 TSASTATTKASEAATSAR-DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSE 278 + + SEAAT D A S+ A S +A+S A++ S A +A + Sbjct: 219 ATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDD 278 Query: 279 TNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTAT 338 + ++ +G A++ A + + + S A+T A++ AA + A+S + Sbjct: 279 ATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATG 338 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 +AT +A+ A + T + ++E++ + A+S A++ S + Sbjct: 339 PSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATS 398 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 EA S +++ A+ + + A S A S A S A + + A+ + Sbjct: 399 EAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSG 458 Query: 459 EDAS----TTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 +DA+ T G S A S A + A + D+A +G D Sbjct: 459 DDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDD 512 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 78/380 (20%), Positives = 150/380 (39%), Gaps = 5/380 (1%) Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSA 196 + EAAT + ++ AA+ +G A+S + S A+++A A + + Sbjct: 162 ATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATG 221 Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSAR-DAAASKEAAKSSETNASS 255 + ++ + + S A + + SEAAT D A S+ A S +A+S Sbjct: 222 PSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATS 281 Query: 256 SASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQ 315 A++ S A +A + + ++ +G A++ A + + + S A+T Sbjct: 282 EAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSG 341 Query: 316 ASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESS 375 A++ AA + A+S + +AT +A+ A + T + ++E++ Sbjct: 342 DDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAA 401 Query: 376 KTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTA 435 + A+S A++ S + EA S +++ A+ + + A S A S A Sbjct: 402 TGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDA 461 Query: 436 ESAATRAETAAKRAEDIASAVALEDAS----TTKKGIVQLSSATNSTSETLAATPKAVKS 491 S A + + A+ + +DA+ T G S A S A + A Sbjct: 462 TSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGP 521 Query: 492 AYDNAEKRLQKDQNGADIPD 511 + D+A +G D D Sbjct: 522 SGDDATSEAATGPSGDDAMD 541 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 73/356 (20%), Positives = 134/356 (37%), Gaps = 5/356 (1%) Query: 158 SAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQS 217 A S S + A + E A+ S A + K ++ + S Sbjct: 53 HALALLRSIGSDAKQAREEYLETNEVESQASPNHGSSPANDALSSEEKLRRVSSDDAATS 112 Query: 218 AATSASTATTKASEAATSAR-DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKT 276 A + + SEAAT D A S+ A S +A+S A++ S A +A Sbjct: 113 EAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSG 172 Query: 277 SETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAST 336 + + ++ +G A++ A + + + S A+T A++ AA + A+S + Sbjct: 173 DDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAA 232 Query: 337 ATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASAS 396 +AT +A+ A + T + ++E++ + A+S A++ S + Sbjct: 233 TGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDA 292 Query: 397 KDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAV 456 EA S +++ A+ + + A S A S A S A + + A+ Sbjct: 293 TSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGP 352 Query: 457 ALEDAS----TTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 + +DA+ T G S A S A + A + D+A +G D Sbjct: 353 SGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDD 408 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 81/405 (20%), Positives = 153/405 (37%), Gaps = 8/405 (1%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 + S + + S A + + T+ ++ +S SS Sbjct: 30 QKQDQVSLQRRLGELSSNDVSIVHALALLRSIGSDAKQAREEYLETNEVESQASPNHGSS 89 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT---K 228 A A + + + ++ ++ AA +G TSE S A + A+T + Sbjct: 90 PANDALSSEEKLRRVSSDDAATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDA 149 Query: 229 ASEAATSAR-DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 SEAAT D A S+ A S +A+S A++ S A +A + + ++ Sbjct: 150 TSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGP 209 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ 347 +G A++ A + + + S A+T A++ AA + A+S + +AT + Sbjct: 210 SGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSE 269 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAA 407 A+ A + T + ++E++ + A+S A++ S + EA S Sbjct: 270 AATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGD 329 Query: 408 KSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----T 463 +++ A+ + + A S A S A S A + + A+ + +DA+ T Sbjct: 330 DATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAAT 389 Query: 464 TKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 G S A S A + A + D+A +G D Sbjct: 390 GPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDD 434 Score = 51.8 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 73/383 (19%), Positives = 145/383 (37%), Gaps = 5/383 (1%) Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 A A E + A+ + ++ + ++ SS A+++A A Sbjct: 65 AKQAREEYLETNEVESQASPNHGSSPANDALSSEEKLRRVSSDDAATSEAATGPSGDDAT 124 Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSAR-DAAASKEAAKSS 249 + + + ++ + + S A + + SEAAT D A S+ A S Sbjct: 125 SEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPS 184 Query: 250 ETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAA 309 +A+S A++ S A +A + + ++ +G A++ A + + + S A Sbjct: 185 GDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEA 244 Query: 310 STSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASE 369 +T A++ AA + A+S + +AT +A+ A + T + Sbjct: 245 ATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDD 304 Query: 370 TSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAA 429 ++E++ + A+S A++ S + EA S +++ A+ + + A S A Sbjct: 305 ATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATG 364 Query: 430 QSKSTAESAATRAETAAKRAEDIASAVALEDAS----TTKKGIVQLSSATNSTSETLAAT 485 S A S A + + A+ + +DA+ T G S A S A + Sbjct: 365 PSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATS 424 Query: 486 PKAVKSAYDNAEKRLQKDQNGAD 508 A + D+A +G D Sbjct: 425 EAATGPSGDDATSEAATGPSGDD 447 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 72/393 (18%), Positives = 149/393 (37%), Gaps = 5/393 (1%) Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 +Q S+ + + A A + + + + S + + + Sbjct: 29 SQKQDQVSLQRRLGELSSNDVSIVHALALLRSIGSDAKQAREEYLETNEVESQASPNHGS 88 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSAR-DA 239 + A+ + ++ E + ++ A ++ + + S A + + SEAAT D Sbjct: 89 SPANDALSSEEKLRRVSSDDAATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDD 148 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 A S+ A S +A+S A++ S A +A + + ++ +G A++ A + Sbjct: 149 ATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATG 208 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 + + S A+T A++ AA + A+S + +AT +A+ A Sbjct: 209 PSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATS 268 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 + T + ++E++ + A+S A++ S + EA S +++ A+ + Sbjct: 269 EAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSG 328 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----TTKKGIVQLSSAT 475 + A S A S A S A + + A+ + +DA+ T G S A Sbjct: 329 DDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAATGPSGDDATSEAA 388 Query: 476 NSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 S A + A + D+A +G D Sbjct: 389 TGPSGDDATSEAATGPSGDDATSEAATGPSGDD 421 >UniRef50_C0CXN6 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CXN6_9CLOT Length = 1351 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 65/376 (17%), Positives = 111/376 (29%), Gaps = 31/376 (8%) Query: 108 LMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAAD-AADSARAASTSAGQAASSA 166 ++ ++ ++ + +TA + + +D S E T A D A A AA+ Sbjct: 60 IIPDQPKQDINEATSSTADKENTNTDQSNQTNENTTPAEDQGAGDAAQDEIDKENAAAPD 119 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 Q+ + A + + A++ A A ++ A+ T+ A Sbjct: 120 QNVTDGENNAQAPQEDVAAPDGEAQTP--DAGNQGENAPEADAADQAAEPEGDTAEPNAA 177 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 + EAA D A +EA E NA AS A N Sbjct: 178 DEPEEAAEPEGDTAEPEEA-AEPEDNAEPIASVIRHYAPVV----------ADNENGEAA 226 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A S +A A A + A A +A + A+ T + Sbjct: 227 PADDSQKEQVADVPDSADQAEAPNEDAAVEEPKAEEPKADEPAADAPAAGDTVTTPDGNT 286 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 A E + TA + + A++ + + D Q Sbjct: 287 ADGAVN-------------TPDENGTGDAATAPDENGTGDAATTPDENGTGDVTAPQEPT 333 Query: 407 AKSSATTASTKAT-EAAGSATAAAQSKSTAESAAT---RAETAAKRAEDIASAVALEDAS 462 ++ A T T A E + A + A E+ + E Sbjct: 334 DETPAPTPDTPAAGEDQNNGQPAGTPDANQPQAPEEELPQESVVTIIPPATPSEGTEQKP 393 Query: 463 TTKKGIVQLSSATNST 478 +K +S+AT S Sbjct: 394 EEQKPEDNVSAATTSD 409 >UniRef50_C6MUP3 Chromosome segregation ATPase-like protein n=1 Tax=Geobacter sp. M18 RepID=C6MUP3_9DELT Length = 747 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 58/369 (15%), Positives = 111/369 (30%), Gaps = 5/369 (1%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + A A Q+ + A A + A A R A +AA Sbjct: 176 QPDPAEQAELARQDAERKRLEALKAEQERQAAEEAKRQKAAEKREMERRAAEAAKVKAEQ 235 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSE--TNASASLQSAATSASTATT 227 A + + A++ A + + + A+ A + Q Sbjct: 236 EQVAKEKAAQELLAAERARQEKLALERERMAEAKAEQVRRVAEAKRAEQELQERREAEME 295 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 + + A A +E + + A++ A +A+ A+ +E AR E Sbjct: 296 RIAAQRAQAEKQAREREQLAAEKAEQEREAAAERERAEQERKAAQRAE-AERVAREREQL 354 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAST--ATTKAGEAT 345 A + A A A +A AA ++ + A KA + Sbjct: 355 AAEKAEQRRQEAAEKKRQAEQALQEKRKAEQERRAARRAEAERLAREREQLAAEKAEKLR 414 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 A R A A+ +A+ ++ + A++ + A R+ + Sbjct: 415 LAAVEKKRKAEQLLQERLKAEQERKAAQRAQQERLALEREMAAAKKAEQQRVAAAERKRA 474 Query: 406 AAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTK 465 + + AA A A ++ AA +AE A + A + + + Sbjct: 475 EQELQERKRAEADRLAALRAQAEQLARERERLAAEKAELARRAAALERANAEKIRVAAIQ 534 Query: 466 KGIVQLSSA 474 K + + A Sbjct: 535 KPSLPAAGA 543 >UniRef50_UPI00016A9E96 hypothetical protein BoklC_19619 n=2 Tax=Burkholderia oklahomensis RepID=UPI00016A9E96 Length = 469 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 62/303 (20%), Positives = 110/303 (36%), Gaps = 2/303 (0%) Query: 135 STSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEAS--KSAAAAES 192 + + T A+ A SARA + +A A S A ++ A + +SAA A Sbjct: 31 AAAREAKPTAKAENARSARATARGGERAGEQADLLSGFAEDSAAHAQREADGRSAADAAG 90 Query: 193 SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETN 252 +++ A SAG + + N+ S + A S T + A A + Sbjct: 91 TRTQDAASAGESVSVPANSVESASADAGSGRRDTPAVQVFEPATAQAGIDDAPAAGRTDS 150 Query: 253 ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTS 312 SS+ A S AA A S+ +A+ ETA GQ+A + ++ Sbjct: 151 RRSSSRRANSKKQAAQRGAHRGSASQGDAKRDETARGQTAQVQGEAAHDPTPEHPTRQSA 210 Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSA 372 A +A ++ A + + ST E S A A+A +++T A Sbjct: 211 ADEARSTEAPAAPDTANIEPTDSTRADPNQANPEPPSPVATRAAAVPERRDTGASADTPA 270 Query: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 432 + + +A+ AA+ +S+ S + + + + T +A A + Sbjct: 271 LAKTPRTSGNATPAAALSSAPSGTGGGNVAGVPTQSTPSAASPTSFPASASPARERQTAA 330 Query: 433 STA 435 Sbjct: 331 PQT 333 >UniRef50_A7A5H3 Putative uncharacterized protein n=1 Tax=Bifidobacterium adolescentis L2-32 RepID=A7A5H3_BIFAD Length = 1748 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 75/247 (30%), Positives = 116/247 (46%), Gaps = 8/247 (3%) Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 +V RNA +AQ + +A+ + +A A +AA +A+ A +A A +A++A Sbjct: 839 ADVERNADEIAQAKSDIADNAAKTT----DAKKTAENAAAAAKNAQGTADTATGAAKTAQ 894 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 +A A T A A+ +A A+S+ AA T+A +AK + NA +A SA A + AS Sbjct: 895 DTANAAQTAAKSATTTAGQAKSAADAAQTAAESAKKTAGNAETLANTANESAKAAKSDAS 954 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 A T A +A + A S T A ++A SAA SAT A +A+ A T+ A Sbjct: 955 TAKTDAANAKTTAANASSVATQAKATADSAAQSATDAATAARKANTAAAAAAGVANGKAD 1014 Query: 291 ---SASAAAGSKTAAASSASAASTSAGQASASATAA-GKSAESAASSASTATTKAGEATE 346 +A A S A++ + A +A + AA+ A+ A KA +A + Sbjct: 1015 VLIQGTAPATSMRKASTLWIDTTNGANTPKRWNGSAWVAVTDKAATDAANAAVKANDAAK 1074 Query: 347 QASAAAR 353 A A A Sbjct: 1075 TAQATAD 1081 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 74/245 (30%), Positives = 126/245 (51%), Gaps = 5/245 (2%) Query: 155 ASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASAS 214 +A + A + + +A + A +AAAA++++ A T+ GAAKT++ A Sbjct: 841 VERNADEIAQAKSDIADNAAKTTDAKKTAENAAAAAKNAQGTADTATGAAKTAQDTA--- 897 Query: 215 LQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA 274 +A T+A +ATT A +A ++A A + E+AK + NA + A++A SA AA + A A Sbjct: 898 -NAAQTAAKSATTTAGQAKSAADAAQTAAESAKKTAGNAETLANTANESAKAAKSDASTA 956 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 KT NA+++ A A+ A + +AA SA+ A+T+A +A+ +A AA A A Sbjct: 957 KTDAANAKTTAANASSVATQAKATADSAAQSATDAATAARKANTAAAAAAGVANGKADVL 1016 Query: 335 STATTKAGEATEQASAAARSASAAKTSET-NAKASETSAESSKTAAASSASSAASSASSA 393 T A + ++ + + A T + N A + + T AA++A A +A +A Sbjct: 1017 IQGTAPATSMRKASTLWIDTTNGANTPKRWNGSAWVAVTDKAATDAANAAVKANDAAKTA 1076 Query: 394 SASKD 398 A+ D Sbjct: 1077 QATAD 1081 Score = 45.3 bits (105), Expect = 0.014, Method: Composition-based stats. Identities = 74/394 (18%), Positives = 137/394 (34%), Gaps = 15/394 (3%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAA---DSARAASTSAGQAASSAQSA 169 +A + +A A + + A S E D A + + S QA + Sbjct: 593 AVDHAGNKSDWSAIATVTVASAV-SPDEVKQIQKDLADNQTALKDNSAKLTQAQKDIAAN 651 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 + S + A A +S+ A + + A +Q+ ++ A+ Sbjct: 652 QQAQAATSKELESAKADIKANQSAIGTANATLKDNTSKIAQAQKDIQANKSNLDAASKTL 711 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 ++A T A KS T A+ S A SA A A + + + Sbjct: 712 AQAKTDLTQAQKDIAQTKSDLTTANGEISKAKKSAAQAYAEAHSKNHTFRGPDMPKDNLI 771 Query: 290 QSASAAAGSKTAAASSASAASTSAGQAS-----------ASATAAGKSAESAASSASTAT 338 K ++ + A + + S + Sbjct: 772 VGDLWLKTQKYWTRWRGEKNNSPSLLADFYTYWLGTPNASPSVLVPLSDRVIDTLVWDGA 831 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 A + AK+ + A T A+ + AA++A +A +A +A+ + Sbjct: 832 AWNHMGYADVERNADEIAQAKSDIADNAAKTTDAKKTAENAAAAAKNAQGTADTATGAAK 891 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 A A+AA+++A +A+T A +A +A AA + +A+ A AET A A + A A Sbjct: 892 TAQDTANAAQTAAKSATTTAGQAKSAADAAQTAAESAKKTAGNAETLANTANESAKAAKS 951 Query: 459 EDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 492 + ++ ++A N++S A A +A Sbjct: 952 DASTAKTDAANAKTTAANASSVATQAKATADSAA 985 >UniRef50_C5M4U8 Predicted protein n=1 Tax=Candida tropicalis MYA-3404 RepID=C5M4U8_CANTT Length = 1689 Score = 57.2 bits (136), Expect = 3e-06, Method: Composition-based stats. Identities = 99/416 (23%), Positives = 205/416 (49%), Gaps = 13/416 (3%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 E +A++ A + + + SA+ ++ A E+ A A A ++ SA ASSA AS Sbjct: 575 ESDASATSGASDASESDASATSGASDASESNASATSGASDASESNASATSGASSASDASE 634 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATS--------AGAAKTSETNASASLQSAATSAS 223 S +A++ A++AS+S A+A S S+A+ + +GA+ S+ + S + ++ S++ Sbjct: 635 SDASATSGASDASESNASATSGVSSASDAGESDASATSGASSASDASESNASATSGVSSA 694 Query: 224 TATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATA-----AGNSAKAAKTSE 278 + +++ +ATS +A+ + +S T+ +S AS + +SAT+ + S A + Sbjct: 695 SDASESDASATSGVSSASDAGESDASATSGASDASESNASATSGVSSASDASESDASATS 754 Query: 279 TNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTAT 338 + +SE+ A ++ ++ S + +++++ + S+ ++ + A+ S S+AS A + Sbjct: 755 GASDASESNASATSGVSSASDASESNASATSGVSSASDASESNASATSGVSSASDAGESD 814 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 A AS + SA++ +S ++A S+ SA S + A+ S +SA S SSAS + + Sbjct: 815 ASATSGASDASESDASATSGVSSASDASESDASATSGASDASESNASATSGVSSASDASE 874 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 S S++ + + A+ +G+++A+ S+S A + + + + + ++ + Sbjct: 875 SNASATSGVSSASDASESNASATSGASSASDASESDASATSGVSSASDASESNASATSGV 934 Query: 459 EDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 AS + +S + SE+ A+ V SA D +E + D Sbjct: 935 SSASDASESDASATSGASDASESDASATSGVSSASDASESNASATSGVSSASDASE 990 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 96/391 (24%), Positives = 200/391 (51%), Gaps = 3/391 (0%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 A +A + + AS+++ SA + A+DA++S +A++ A A+ S SA+S Sbjct: 709 SSASDAGESDASATSGASDASESNASATSGVSSASDASESDASATSGASDASESNASATS 768 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 +AS A+E++ SA + SS S A+ S +A + ++AS + +S A++ S A+ + Sbjct: 769 GVSSAS-DASESNASATSGVSSASDASESNASATSGVSSASDAGESDASATSGASDASES 827 Query: 232 --AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 +ATS +A+ + +S T+ +S AS + +SAT+ +SA A S +A S ++A Sbjct: 828 DASATSGVSSASDASESDASATSGASDASESNASATSGVSSASDASESNASATSGVSSAS 887 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 ++ + A + + A+S++ A+ + A S ++A+ S +A++++ ++ ++ ++ Sbjct: 888 DASESNASATSGASSASDASESDASATSGVSSASDASESNASATSGVSSASDASESDASA 947 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 + S ++ + + S S S A+A+S S+AS AS ++AS AS A Sbjct: 948 TSGASDASESDASATSGVSSASDASESNASATSGVSSASDASESNASATSGVSSASDASE 1007 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 S +A++ A+ A+ ++ + A + S SA+ +E+ A +SA +++ + V Sbjct: 1008 SNASATSGASSASDASESNASATSGVSSASDASESNASATSGASSASDASESNASATSGV 1067 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 +S + +S + + + +A + Sbjct: 1068 SSASDASESSASATSGASDASESNASATSGV 1098 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 76/325 (23%), Positives = 175/325 (53%), Gaps = 5/325 (1%) Query: 116 NASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT 175 + + + A+ +++ + S+ A + +A S + ++ + +A+S S++S A Sbjct: 688 TSGVSSASDASESDASATSGVSSASDAGESDASATSGASDASESNASATSGVSSASDASE 747 Query: 176 ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATS 235 + AT + A+ + +S ++ +SA A S +A++ + SA+ AS + A+ +S Sbjct: 748 SDASATSGASDASESNASATSGVSSASDASESNASATSGVSSASD-ASESNASATSGVSS 806 Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAA 295 A DA S +A S ++AS S +SA S ++A +++++ ++ + A + + + S Sbjct: 807 ASDAGESDASATSGASDASESDASATSGVSSASDASESDASATSGASDASESNASATSGV 866 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSA 355 + + A+ S+ASA S + + AS + A ++ ++++S ++ + + + +++ A + Sbjct: 867 SSASDASESNASATSGVSSASDASESNASATSGASSASDASESDASATSGVSSASDASES 926 Query: 356 SAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAS 415 +A+ TS ++ + + +++S T+ AS AS + +SA+S +S +A S + +SAT+ Sbjct: 927 NASATSGVSSASDASESDASATSGASDASESDASATSGVSSASDA----SESNASATSGV 982 Query: 416 TKATEAAGSATAAAQSKSTAESAAT 440 + A++A+ S +A S+A A+ Sbjct: 983 SSASDASESNASATSGVSSASDASE 1007 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 92/422 (21%), Positives = 194/422 (45%), Gaps = 19/422 (4%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 V ++ A A + ++ S AS ++ + +D + ++ +S S + S S+S+ Sbjct: 470 NVLESSDASATSDVSSASDISSASDASDSDPSSTSDVSSASDNSSASDISSTSEIASSSN 529 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 ++ + + + + A+ +S +++A+ A + S T+ ++ + SA++ + ASE Sbjct: 530 TSSASDISTSSDASESNASATSGASSASDARESNASATSGASDASESDASATSGASDASE 589 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 + SA A+ + +S T+ +S AS + +SAT+ +SA A S+ +A S + A +S Sbjct: 590 SDASATSGASDASESNASATSGASDASESNASATSGASSASDASESDASATSGASDASES 649 Query: 292 ASAAAGSKTAAASSAS-------------------AASTSAGQASASATAAGKSAESAAS 332 ++A ++A+ + A++TS +++ A+ + SA S S Sbjct: 650 NASATSGVSSASDAGESDASATSGASSASDASESNASATSGVSSASDASESDASATSGVS 709 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 SAS A AT AS A+ S ++A + ++A + S S+ + A+ ++ S AS+ S Sbjct: 710 SASDAGESDASATSGASDASESNASATSGVSSASDASESDASATSGASDASESNASATSG 769 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 S++ D + ASA ++ + + A+ ++ ++ S + A+ + + D Sbjct: 770 VSSASDASESNASATSGVSSASDASESNASATSGVSSASDAGESDASATSGASDASESDA 829 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDK 512 ++ + AS + +S + SE+ A+ V SA D +E + D Sbjct: 830 SATSGVSSASDASESDASATSGASDASESNASATSGVSSASDASESNASATSGVSSASDA 889 Query: 513 GC 514 Sbjct: 890 SE 891 >UniRef50_A4ED70 Putative uncharacterized protein n=2 Tax=Bacteria RepID=A4ED70_9ACTN Length = 1116 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 82/369 (22%), Positives = 148/369 (40%), Gaps = 15/369 (4%) Query: 148 AADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS 207 A + AA+ +A +S ++ A T + + A + +A+ T A +A + Sbjct: 29 ARLTVTAATMAAMTGSSLISPFTAFAQTGDGGTQHPAVMSPIAAHAGAASGTGAKSAAQA 88 Query: 208 ETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSAT-- 265 + ++ A A EAA +AA++++ AK+S +A S+ ++A +A Sbjct: 89 IADLQKAVDEAKAKEDAAKASYDEAAGPYNEAASARDQAKASYDSAVSAGTAADRAAMDE 148 Query: 266 ------AAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASAS 319 ++A AA A++ A AS + +A +A A + +A A Sbjct: 149 YARQVAEGKDAADAAGKDLEQAKAGLADAKADASEKDEAYQSALKAAQDAKDALDKAKAD 208 Query: 320 ATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAA 379 A +A A SAA A A + + A A + A S+ A S + + AA Sbjct: 209 AVSATPEAISAAEQAVRDAQAAVDRAQAGLANANATLADAQSKLVAAQSAKDSADAVLAA 268 Query: 380 ASSASSAASSASSASASKDEATRQASAAKSSATT------ASTKATEAAGSATAAAQSKS 433 A AA + ++A+++ E + AA + T+ A K +A + AA +S Sbjct: 269 AQQNKDAADAKAAAASAAYEKAKADLAAAEAGTSGPEYDAAKQKVADAEATLAAARAVQS 328 Query: 434 TAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAY 493 ES + ++AA A+ + A S ++ V +S N L A + +A Sbjct: 329 QCESELEQVQSAAATAQAELND-AQASLSAKQQAAVDAASGVNDAQSALDAANSDLDAAK 387 Query: 494 DNAEKRLQK 502 + K Sbjct: 388 QANADAIAK 396 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 68/326 (20%), Positives = 122/326 (37%), Gaps = 10/326 (3%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + V + R A + AA ++ +DA T +AA DAA++A + S Q A Sbjct: 727 KQGVADAQRAVDATKADVAAKQQGVTDAQT-ELDAAKSDLDAANAAVDTAKSTVQQKQVA 785 Query: 167 QSASSSAGTASTKATEASKSAAAAE-SSKSAAATSAGAAKTSETNASASLQSAATSASTA 225 A+++A T + +++K+ AA+ A +A +L +A + A Sbjct: 786 FDAANAAVTTAQSKLDSAKADTAAKQQGVDDANADLAKFFQDVADAKKALDTAKSVHDAA 845 Query: 226 TTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSE 285 E AT A +A + +A + +A + A + ++T +A Sbjct: 846 AADQVEKATVLAAAEQKADATARALADAQRAVDAAKADIGVAADRLTGSQTDLDDA---- 901 Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 QS A + A A +A AA +A++ +A + + A +A Sbjct: 902 ----QSNLDILTGLAAKLAEAQQREQDAVKAVNDTKAALDAAKADTIAAESLVSAAEQAK 957 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 QA A ++ A + +A+ + A ++A A + + A A DE Sbjct: 958 AQADAKLSKLNSIDAGAAIASGHDVNADDALNALFAAAVEARAKVAPAKAILDEKQAAVD 1017 Query: 406 AAKSSATTASTKATEAAGSATAAAQS 431 +S A +A AA Q Sbjct: 1018 GLQSGYDAALAAYEQAKSDRIAAEQK 1043 Score = 48.8 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 63/252 (25%), Positives = 105/252 (41%), Gaps = 2/252 (0%) Query: 94 TEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSAR 153 A PEA+ E V + A A + +DA S AA A D+AD+ Sbjct: 208 DAVSATPEAISAAEQAVRDAQAAVDRAQAGLANANATLADAQ-SKLVAAQSAKDSADAVL 266 Query: 154 AASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASA 213 AA+ AA + +A+S+A + A+++ + AA A+ + A A Sbjct: 267 AAAQQNKDAADAKAAAASAAYEKAKADLAAAEAGTSGPEY-DAAKQKVADAEATLAAARA 325 Query: 214 SLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKA 273 + + A+ A DA AS A + + +A+S + A S+ AA + A Sbjct: 326 VQSQCESELEQVQSAAATAQAELNDAQASLSAKQQAAVDAASGVNDAQSALDAANSDLDA 385 Query: 274 AKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASS 333 AK + +A + AA Q+ A +K AA + A T+ A A+ TAA + + A + Sbjct: 386 AKQANADAIAKLDAAKQAVKDAESAKAAADVELANAKTAKDTADAAVTAAQQKVDEAQAK 445 Query: 334 ASTATTKAGEAT 345 +A + + Sbjct: 446 LDSADAQLKQGA 457 >UniRef50_UPI00017F7AFB YALI0E22572p n=1 Tax=Yarrowia lipolytica RepID=UPI00017F7AFB Length = 1601 Score = 56.8 bits (135), Expect = 4e-06, Method: Composition-based stats. Identities = 71/430 (16%), Positives = 154/430 (35%), Gaps = 16/430 (3%) Query: 79 EDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSA 138 + P T ++ ++ ++ PE+ E + S+ + S + TS Sbjct: 465 NEPIPETTSEVPSSVETVESTPESTTEASTESVEPSSTESSTDPVPESTTVSVTSDPTSE 524 Query: 139 REAATHAADAADSARAASTSAGQAAS--SAQSASSSAGTASTKATEASKSAAAAESSK-- 194 T ++D + +ST S + S A A+ S+S+ S Sbjct: 525 PFPETTSSDPEIAQNDSSTVGPSTTEEYSVEPTPDSTSEAPQSASGTSESSTEYTSEAVT 584 Query: 195 ----SAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 ++ T+ + TSE + SL ++ S ++E+ + A + + S Sbjct: 585 PFPTPSSETTFINSTTSEPGTTESLSTSELSTEATVEPSTESTSEAVTPLPTPSSETSFI 644 Query: 251 TNASSSASSAASSATAAGNSAK-AAKTSETNARSSE------TAAGQSASAAAGSKTAAA 303 + +S ++ S++ A +++TS N+ +SE T + S+T+ Sbjct: 645 NSTTSELGASESTSETATPLPTPSSETSFINSTTSELGSTGSTPETVTPLPTPSSETSFI 704 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSET 363 +S ++ ++ S +AT + + ST + + + +++TS Sbjct: 705 NSTTSELGASESTSETATPLPTPSSETSFINSTTSELGSTGSTSEAVTPLPTPSSETSFI 764 Query: 364 NAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG 423 N+ SE S + A +S SS ++ S A + +AT T ++E Sbjct: 765 NSTTSELGTTESTSEAVTSLP-TPSSETTFINSTTSALGTTESTSETATPLPTSSSETDF 823 Query: 424 SATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLA 483 + + ++ T + + ++ + D S T G ++ + S T Sbjct: 824 ANSTTSEPVITTSPPTSESPVTTDSPSTASNESVILDGSLTSVGFPNSTTVVSQNSTTEL 883 Query: 484 ATPKAVKSAY 493 + A Sbjct: 884 ESFTESSVAS 893 >UniRef50_Q2UB42 Predicted protein n=2 Tax=Aspergillus RepID=Q2UB42_ASPOR Length = 1429 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 65/443 (14%), Positives = 129/443 (29%), Gaps = 15/443 (3%) Query: 77 VYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDAST 136 + P T + +A+ EA ++ E + A AV + A + ++ T Sbjct: 252 KKAEEVPATSTQEEAPVVNGEAKVEAEKQPEESQPQKATEKVAV-KPEEPAVEMSTPEVT 310 Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSA 196 + A + +A + A A +A +TK + S Sbjct: 311 AEETPAQESNTEKPTATETPAAETTTEQPATEAQPTAEEETTKEPATEPAETKQAEVVSE 370 Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 A A+ + S ++ + + A++E K T A Sbjct: 371 EPAKEEPTPEEPKEAPATEELVKES-VKEEAAPEQSKETVSEKPAAEEPVKEETTEAVK- 428 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAA-STSAGQ 315 +SAT + + A E A +E + + S + A + Sbjct: 429 ----ETSATEKLDESDKAPVQEETA--AEESQETTKEPVKEEIAPGKSEETPAIKAPVAE 482 Query: 316 ASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESS 375 A + A +A A + ET+A+ A Sbjct: 483 EPAVEEPVEEKAAPEEPKDISAADPAVVEAPVKETVKEEGVSEAPKETSAEEPVKEAVKE 542 Query: 376 KTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTA 435 + + AA + A E ++ + + A+ K + A Sbjct: 543 EPVPEKTEQPAAPEPAVAEEPAKEPVKEEPVPEKTEEPAAPKESVKEIVKEEAVSEAPKE 602 Query: 436 ESAATRAETAAKRAEDIASAVALEDAS----TTKKGIVQLSSATNSTSETLAATPKAVKS 491 SA A + ++ + +A ++ LS+A E++A P +KS Sbjct: 603 TSAEEPATNDTAAQDKPSTEETVVEAVKEVPVVEETKETLSTAAPDAQESVAQEPV-IKS 661 Query: 492 AYDNAEKRLQKDQNGADIPDKGC 514 + ++GA+ + Sbjct: 662 SATEDAPSEPTKESGAEKAVEEQ 684 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 72/400 (18%), Positives = 110/400 (27%), Gaps = 13/400 (3%) Query: 101 EALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG 160 E+++ V + A K + A + A+ Sbjct: 962 ESVKELAAEEAAVQEPTPVEQPSEEPAVKESP--VEEPPTVEGSVAAEPSTEEPAAEQPA 1019 Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 + A ++ A+ +A E + A K S + ET A S+ Sbjct: 1020 KEAELVSEVLTTEEPATKQAAEPEPAEQPAAEEKP-VDDSTEKPASEETVAEDSVPEPTE 1078 Query: 221 SASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASS--ATAAGNSAKAAKTSE 278 A A T A+E + S E A SA ASS S+KA E Sbjct: 1079 KAVVAETPATEEVVEKATTEEPAKEPVSEEPVADKSAEQPASSDVVEETAESSKAVVPEE 1138 Query: 279 TNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTAT 338 E A QS T + A+ + + A ++ A Sbjct: 1139 VTLNEVEPAVTQSHEEKPEEVTELPKEETVAAEPVAVETPAEQKAPETEPEVVPKAEAEE 1198 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 + E AS +A A E S K S ++ A+K+ Sbjct: 1199 AVIAQKEEPASEPVAAAPAEAEKEEPVSVSSGEPPVEK-------ESTDTAEVVPEATKE 1251 Query: 399 EATRQASAAKSSATT-ASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVA 457 +A Q +SA A A SA AAA + ETA+ Sbjct: 1252 DAPEQDGKNNASAELIGVGAAAAVAASAAAAAAGVAALSHTDKEPETASAEESQPTKGED 1311 Query: 458 LEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAE 497 + + A SE A +S + Sbjct: 1312 KAASLAPESQPSASKHAPLPPSENPEADINEPRSGSEEPA 1351 >UniRef50_UPI00017935A3 PREDICTED: similar to neurofilament, heavy polypeptide isoform 1 n=2 Tax=Acyrthosiphon pisum RepID=UPI00017935A3 Length = 431 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 75/407 (18%), Positives = 141/407 (34%), Gaps = 9/407 (2%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 V EVA AV K+ + T+A + A +DS + + + Sbjct: 23 VGEVAPAVPAVPSEAVPQKQVEAKPETNAASPVSDAKPESDSKPVDAEVKPTVSEVKAES 82 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 ++A A+ +S +A E +A A + A Sbjct: 83 EQKPSGEPKPESDAKPVVASESKPESDPKPAAVVESKPENDAVAPETNNDAKPENAAAPV 142 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 SE + A ++ A++ S AS + AA +++ N ++ Sbjct: 143 SENKPATDAKAETELIAQAKPE--SKPASDLKAEPEAAKPNSEVPVALPLNPTETKATQQ 200 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 + AA+ A A+ A + + AA +AE A S S +T+ A E + Sbjct: 201 SVETNQVEQAAPAAAQADPAAAPAADPAPAPAAAPVAAEEAKLSESAPSTENKAAEEPSK 260 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 A + +++ A ++E S+T + + + +S SA A A+ Sbjct: 261 PAEQQ-----SAKPVEDAVPAASEISETKVSPAVPAVPEVPASPSAPAVADPVSAPEAEK 315 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 +A A + + A ++ +S A + K E+ A + A +K Sbjct: 316 NAEPAKAANSAEPAVQSEAKPAEDIQKSGAVVSAENPKPVEEQKPAEVAKPAEQSKS--E 373 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFL 516 + A T ++ A PK +SA D +++ ++ A K Sbjct: 374 APAEAPKPTEQSAAEEPKKPESANDEKKEQHSVNKRDATKEKKPTDS 420 >UniRef50_Q1D823 Adventurous-gliding motility protein Z n=1 Tax=Myxococcus xanthus DK 1622 RepID=AGLZ_MYXXD Length = 1395 Score = 56.5 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 65/328 (19%), Positives = 103/328 (31%), Gaps = 24/328 (7%) Query: 23 QLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQ 82 +L + T + TLA A +E +++ + +T Sbjct: 961 ELGTRVAELTQLTATLAQTENTRA-----HLEERLHTLTEESQRREELLQNDLTQKGTEL 1015 Query: 83 PGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAA 142 TL E +R+ E++ EVA + A + A++A A Sbjct: 1016 SDTLRKL------THVTQEKMRQAEVLNREVATRTEQLKAMEAKLQTQATEARRQAEGLG 1069 Query: 143 THAA---------DAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS 193 A + R A AA +A G A ++ A+ + Sbjct: 1070 QQITGLNEQLEQGRKALAGREDQLRAAGAAQQKLTAERD-GLAGQLQQAEARLQQQAQQA 1128 Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAA--SKEAAK-SSE 250 A + AA + + Q A A T+A+EA A+D S A K Sbjct: 1129 NQERADAKRAADELAAKLAKTEQRITQFAQDAQTQATEADARAKDLQGQLSARAKKIQDL 1188 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 A +A A S A N+ AA S+ + S+ AA Q ++ A AA Sbjct: 1189 ELAVENAQGAKSRAEKELNAKVAAAESKAHEASTRLAAAQKERKDLEARHAKEQEDLAAK 1248 Query: 311 TSAGQASASATAAGKSAESAASSASTAT 338 A A A + A S + Sbjct: 1249 QKAELERRDAIKAQEVARLQQSVQEKSK 1276 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 68/416 (16%), Positives = 124/416 (29%), Gaps = 22/416 (5%) Query: 81 SQPGTLNDFLGAMTEDDA-RPEALRRFELMVEEVARNASAVAQNTA--AAKKSASDASTS 137 G + + + + R +R E + + A++ A +A S Sbjct: 402 ELDGEIQALQERLQQTEQERDTTVRGLEARAARAEEHGTQADAEIHRLNAERDALEAKLS 461 Query: 138 AREAATHAADAADSARAASTSAGQAASSAQ-SASSSAGTASTKATEASKSAAAAESSKSA 196 + A A A + A A+ + A E S A + + Sbjct: 462 QQVADLEADLARTMGERDQLRLDKDAQEAELTQRIEERDAKLGTLERELSETIARNEHTE 521 Query: 197 AATSAGAAKT--------SETNASASLQSAATSASTATTKA-SEAATSARDAAASKEAAK 247 A +A + E A + + TA +A +A + A Sbjct: 522 AELNANIQQQLERIGELEGEVEAVKTHLEDRENELTAELQALGQAKDELETDLNDRLQAL 581 Query: 248 SSETNASSSA-SSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 S +A + S +A +A T + A +S+ Q +T SA Sbjct: 582 SQAKDALEADLSRQLEELRSAKAELEADLTGQIQALTSQLEETQRQLD-DSQRTGEQLSA 640 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 A + +T + AA + + A + A+T + A Sbjct: 641 RVAQLEDTVSQRESTIESLQGDVAARDQRISELSGDLEATSQTLAQTQQTLAQTEQQLAD 700 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT-------ASTKAT 419 T A + A + A+S + + A + + A++ A T+ T Sbjct: 701 TQNTLASTEGALAETRGELDATSQTLQQTQQTLAQTEGALAETRGELDATSQTLAQTQQT 760 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSAT 475 A A + A + T AET + + A +G +Q +S T Sbjct: 761 LAQTEQQLADTQNTLASTEGTLAETRGELEATSQTLQQTHAALEDTRGALQETSDT 816 Score = 44.9 bits (104), Expect = 0.016, Method: Composition-based stats. Identities = 71/406 (17%), Positives = 127/406 (31%), Gaps = 18/406 (4%) Query: 104 RRFELMVEEVARNASAVAQNTAAAKKSASDA-STSAREAATHAADAADSARAASTSAGQA 162 R EL + A A AA+ SA TS R+ +A + A Sbjct: 852 LRSELSETQGNYEAERAAHEKLAAESSAHIGDLTSERDGLRSELEATSQTLEQTHGQLAA 911 Query: 163 ASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSA 222 A + A S KA ++++ + +++ A + T + A + Sbjct: 912 TRDALAREQHAHQESRKAAASTQTTLEGQLAEARAHGEDLGEHLTLTKHELGTRVAELTQ 971 Query: 223 STATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATA-AGNSAKAAKTSETNA 281 TAT +E T A + S+ + T + K ++ Sbjct: 972 LTATLAQTEN-TRAHLEERLHTLTEESQRREELLQNDLTQKGTELSDTLRKLTHVTQEKM 1030 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 R +E + A+ K A + A+ + QA + + KA Sbjct: 1031 RQAEVLNREVATRTEQLKAMEAKLQTQATEARRQAEG-----LGQQITGLNEQLEQGRKA 1085 Query: 342 GEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 E AA +A T+E + A + ++ + ++ + A + DE Sbjct: 1086 LAGREDQLRAAGAAQQKLTAERDGLAGQLQQAEARLQQQAQQAN--QERADAKRAADELA 1143 Query: 402 RQASAAKSS----ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS--- 454 + + + A A T+ATEA A S E A + A+ S Sbjct: 1144 AKLAKTEQRITQFAQDAQTQATEADARAKDLQGQLSARAKKIQDLELAVENAQGAKSRAE 1203 Query: 455 -AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 + + A+ K + + E + K D A K+ Sbjct: 1204 KELNAKVAAAESKAHEASTRLAAAQKERKDLEARHAKEQEDLAAKQ 1249 >UniRef50_A7IYB1 Gp40 n=1 Tax=Corynebacterium phage P1201 RepID=A7IYB1_9CAUD Length = 571 Score = 56.1 bits (133), Expect = 7e-06, Method: Composition-based stats. Identities = 86/366 (23%), Positives = 140/366 (38%), Gaps = 17/366 (4%) Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 AA SA AAKTSETNA S +A TS +TA T A+ A S AA SK AA +S TNA +S Sbjct: 147 AAQSASAAKTSETNAKTSATNAKTSETTAKTSATNAKDSETAAARSKTAAATSATNAKTS 206 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA-----------ASS 305 ++AA+SAT A NSA A+ S T+A +S + A A A A Sbjct: 207 ETNAATSATNAANSATASANSATDAANSANSITDGAEVATSKAAEAAAAADRAEQAMAGK 266 Query: 306 ASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNA 365 A A + K A+ +A A +A + S S + Sbjct: 267 ADLIGGKVPTAQLPEISLTKPFSVASRTALLALDVQEGDVGIITAGSDKGSYILGSGPSK 326 Query: 366 KASETSAESSKTAAASSASSAASSASSASASKDEATRQA--SAAKSSATTASTK----AT 419 S + A + + + SA+ A + A + ST Sbjct: 327 VFSSWIPLAVSADAPVQSVNGQTGTVVLSAANVGAAPTSHTHTASQISGLPSTDGGYNTR 386 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 +A A++ + + + ++ A + S + +T ++G S+ +S Sbjct: 387 DATDPASSYPAGVDVSLNNVNKGWSSVIGAASLPSLGSYVVVTTVRQGFYNESTWQYLSS 446 Query: 480 ETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVN 539 TL + V+ ++A L++ + + + V+++ + V Sbjct: 447 CTLPGSSIYVRKWQNDAWTTLRRLTDDGHTHTSAQISDATSLVTESTVVRRDSAGDFYVK 506 Query: 540 APAGAT 545 P +T Sbjct: 507 TPTAST 512 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 80/300 (26%), Positives = 120/300 (40%), Gaps = 5/300 (1%) Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 A A T A+ S +A TS + A T A+ A TS A S AK SET A+ S Sbjct: 133 TAELIEAFDTIRIGAAQSASAAKTSETNAKTSATNAKTSETTAKTSATNAKDSETAAARS 192 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 ++AA+SAT A S A TS TNA +S TA+ SA+ AA S + A A++ A +A Sbjct: 193 KTAAATSATNAKTSETNAATSATNAANSATASANSATDAANSANSITDGAEVATSKAAEA 252 Query: 317 SASATAA----GKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSA 372 +A+A A A+ TA T+ S A+R+A A + T+ Sbjct: 253 AAAADRAEQAMAGKADLIGGKVPTAQLPEISLTKPFSVASRTALLALDVQEGDVGIITAG 312 Query: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 432 + S S S+ A +A Q+ ++ S AA ++ S+ Sbjct: 313 SDKGSYILGSGPSKVFSSWIPLAVSADAPVQSVNGQTGTVVLSAANVGAAPTSHTHTASQ 372 Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 492 + + T + A D AS+ + SS + S + V + Sbjct: 373 ISGLPS-TDGGYNTRDATDPASSYPAGVDVSLNNVNKGWSSVIGAASLPSLGSYVVVTTV 431 >UniRef50_Q7SBR0 Putative uncharacterized protein n=1 Tax=Neurospora crassa RepID=Q7SBR0_NEUCR Length = 1353 Score = 56.1 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 64/366 (17%), Positives = 114/366 (31%), Gaps = 9/366 (2%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E E A ++T D T T+ A Sbjct: 458 EEPAHEDDTRDKATQEDTVHQDTQTDDEVTQQTTQEDITQQTTAEESTQQTTQEDVTEDA 517 Query: 167 QSASSSAGTASTK------ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 +++ K T+ E+SK T A+ + +A + Sbjct: 518 PEVAATKDEVDVKDDTTPHETDLKDETHPMEASKEEEPTQKEDAEEHAVDETAQEEQPTK 577 Query: 221 SASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETN 280 ++ T A+ ATS A +E A ET+A + A+ A + A + E Sbjct: 578 ESAPEETTATGEATSEEVA---REKADEDETHAKTYAAVAHEALEAETSPEVTEAAPEAE 634 Query: 281 ARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTK 340 A +E A + +A A + A A + + A +E S +T + + Sbjct: 635 APQAEETAPATEAAPAEETSPAVDVAPVEKAAPVEDDVPVEKATSLSEENVSVDATNSAE 694 Query: 341 AGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 +E + A+ A + + + + + AA S +S + Sbjct: 695 EAAPSEIPADVVEEAALAAEATPAEETTPVTEATRVEEAAPVEESTHASEDAPVKEAPLE 754 Query: 401 TRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED 460 S + AST+ T A A A + + A AE ++A A E+ Sbjct: 755 AAPVSEELPAEKAASTEETAPALDAAPAETAAVEEVTPAKEAEPVQATYAEVAQATDAEE 814 Query: 461 ASTTKK 466 + +K Sbjct: 815 PAHVEK 820 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 65/397 (16%), Positives = 121/397 (30%), Gaps = 22/397 (5%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + + A A A+ + +A A A A + S + Sbjct: 569 TAQEEQPTKESAPEETTATGEATSEEVAREKADEDETHAKTYAAVAHEALEAETSPEVTE 628 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSAS-----T 224 ++ A A + AA S A A K + ++ A + + Sbjct: 629 AAPEAEAPQAEETAPATEAAPAEETSPAVDVAPVEKAAPVEDDVPVEKATSLSEENVSVD 688 Query: 225 ATTKASEAATSARDA----------AASKEAAKSSETNASSSASSA-ASSATAAGNSAKA 273 AT A EAA S A A+ + T A+ +A +T A A Sbjct: 689 ATNSAEEAAPSEIPADVVEEAALAAEATPAEETTPVTEATRVEEAAPVEESTHASEDAPV 748 Query: 274 AKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASS 333 + A SE + A++ +TA A A+ A T+A + A A + A Sbjct: 749 KEAPLEAAPVSEELPAEKAASTE--ETAPALDAAPAETAAVEEVTPAKEAEPVQATYAEV 806 Query: 334 ASTATTKAG---EATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 A + E TE A +A + + ++++ ++A+ + Sbjct: 807 AQATDAEEPAHVEKTEPVEEATVDETAPTEEAATVEEEAANVPAAQSEPVANAAKEIFAR 866 Query: 391 SSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAE 450 S + A+++A A + A G+ + ++ R Sbjct: 867 SEPVTESQSGAATPTFARTAAEVADSAALLDEGTPEDRVSDEEAGKTGFRRLSATPITEV 926 Query: 451 DIASAVALEDASTTKKGIVQL-SSATNSTSETLAATP 486 +A + A + S + +E + P Sbjct: 927 ADTAAEVADSAKYLDNEATETDKSEIPTPAEDGSHNP 963 >UniRef50_C0MB36 Putative cell surface-anchored protein n=3 Tax=Streptococcus equi RepID=C0MB36_STRE4 Length = 673 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 60/401 (14%), Positives = 115/401 (28%), Gaps = 10/401 (2%) Query: 119 AVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTAST 178 + AK A A++A A A + A T A + A++A A A+ Sbjct: 203 TAEETAKQAKTDKERAEAEAKKAKEEAKTAEGKVKQAETEKRNAEAKARTAEEEAKQATA 262 Query: 179 KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---SEAATS 235 +A A A+ A +A + + Q A A+ A Sbjct: 263 DKEKAETEAKKAKEEAKTAKEAAHQEQEKAKQLEQANQQANQRANLAEKSKKDLETQKEK 322 Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAA 295 AK+ A SA+ +K + + + A Sbjct: 323 LEQEIKEATEAKN---KAEQKLKDLQDSASQGSELSKQLLKEKEELTTKLQELQKQAEEK 379 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ-ASAAARS 354 A ++GQ + + +A EQ Sbjct: 380 TTEIEKLKQELEANKQNSGQLGQQEQKLQEQLNKVQKELKQKEMELKQAQEQLKQEQKPH 439 Query: 355 ASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTA 414 + + A+ +E + + S+ ++ A S+ +A + S A+ T A Sbjct: 440 EGGGDSDASKARITELEKQVQTLTKEKADLSSTLESTKAQLSETQA--RLSEAQKQLTAA 497 Query: 415 STKATEAAGSATAAA-QSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSS 473 K T TA Q ++ ++ + + + K + + A +K + Sbjct: 498 QEKLTTLEAEKTALQHQVETISKQLSETRDLSEKEKAALQEQINKLKAEIEQKTKEIEAL 557 Query: 474 ATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 S PK K+ + + + + P+K Sbjct: 558 KQGMQSHQGQEKPKDPKTPETPKDPKTPEKNDQPQAPEKRS 598 >UniRef50_A7SNK2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7SNK2_NEMVE Length = 571 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 51/359 (14%), Positives = 108/359 (30%), Gaps = 6/359 (1%) Query: 126 AAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASK 185 A++ + TS +AA A+ + + +A + + S + + +A + Sbjct: 147 QARRKTKTSKTSKAQAAKQASRKTKTRQLQDQAARPSRKTKTSKQQDQDKQAARPRQARQ 206 Query: 186 SAAAAESSKSAAATSAGAAKTSETNASASL------QSAATSASTATTKASEAATSARDA 239 ++ + AA+ + S Q+A ++ TK + + A Sbjct: 207 ASHKTNKLQDQNKQDKQAARPTSRRIKTSKLQDQDKQAARPIEASHKTKRRKPQDQDKQA 266 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 A ++A++ ++T + + S A + + Q+A S+ Sbjct: 267 ARPRQASRKTKTMRQARRKQQDKQDASNKTSKTQATRQASRKTKTRQLRDQAARTRQESR 326 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 S A+ A+ K A A S QAS +++ +K Sbjct: 327 KTKTSKTKASKQPDQDKQAARPRQAKQARQAGRKTSKPQDHTARQARQASRKTKTSKTSK 386 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 TS+ + + + + AS S + K+S T Sbjct: 387 TSKPQDQDKQDKRDKREKKDKQGASCKTSKPQDQDKQAARPRQARRKTKTSKTQDQAAKP 446 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNST 478 A T + AA + + S ++ + + +S S Sbjct: 447 RQASPKTKPQDQDKQHKQAARVRQARQASCKTKTSKPQDQNKQAARPKQAREASRKTSK 505 Score = 52.6 bits (124), Expect = 8e-05, Method: Composition-based stats. Identities = 50/366 (13%), Positives = 117/366 (31%), Gaps = 3/366 (0%) Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + A + + TS + A A + + TS Q A Sbjct: 89 TSNPQDQAARPSCKTKTSKPQDKQGARQACRKTKTSKTSKPQDQDKQDKQDKQAARPRQA 148 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASAS-LQSAATSASTATTKASEAATSARD 238 + S + + A+ + + A S + A Sbjct: 149 RRKTKTSKTSKAQAAKQASRKTKTRQLQDQAARPSRKTKTSKQQDQDKQAARPRQARQAS 208 Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGS 298 +K ++ + ++ +S + + K A + ++ Q A Sbjct: 209 HKTNKLQDQNKQDKQAARPTSRRIKTSKLQDQDKQAARPIEASHKTKRRKPQDQDKQAAR 268 Query: 299 KTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAA 358 A+ + + A+ K++++ A+ ++ TK + +QA+ + + Sbjct: 269 PRQASRKTKTMRQARRKQQDKQDASNKTSKTQATRQASRKTKTRQLRDQAARTRQESRKT 328 Query: 359 KTSETNA-KASETSAESSKTAAASSASSAASSASSASA-SKDEATRQASAAKSSATTAST 416 KTS+T A K + ++++ A A A S + +A + + K+S T+ ++ Sbjct: 329 KTSKTKASKQPDQDKQAARPRQAKQARQAGRKTSKPQDHTARQARQASRKTKTSKTSKTS 388 Query: 417 KATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATN 476 K + + K + A+ + + + A + T K Q + Sbjct: 389 KPQDQDKQDKRDKREKKDKQGASCKTSKPQDQDKQAARPRQARRKTKTSKTQDQAAKPRQ 448 Query: 477 STSETL 482 ++ +T Sbjct: 449 ASPKTK 454 >UniRef50_C2CUY9 Putative uncharacterized protein n=1 Tax=Gardnerella vaginalis ATCC 14019 RepID=C2CUY9_GARVA Length = 1493 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 66/402 (16%), Positives = 123/402 (30%), Gaps = 6/402 (1%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + + +T A ++ ++ + E + A A + + A + + A +A Sbjct: 67 TNQGGNSGQNQPGSTPAGQQVGGNSEGTKPEEKKNEA-AEQAIKDAKSKSEDAQKDVTTA 125 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 + A+ KATEA +A A++ +AA S A +T A +L +A + A Sbjct: 126 DNEVKQANEKATEAKNTAEEAQNGLNAANKSKDKADKEKTEADNNLNAANNEVNAKEKVA 185 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 EAA + + ++ +T A + A + AA K A+ + T+ ++ A Sbjct: 186 EEAAGDVAAKDKALKDEQAKQTAAEEAKKKAENEQAAAEEEKKTAENNVTDKNTALEKAK 245 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 + + K A A+ A + A ++ Sbjct: 246 GTLESKQDEKNDYDKKLKEAQEKLENEKKEQARLDDIADRAKTKQEQADLAVENNKKEQQ 305 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 A T + + A A A A Sbjct: 306 EKQNKIDALNNPTTTTNEEVERLKREASEAEQKAKDAKEDEDKKDKEAKNKQNDLDNAGK 365 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 + +A + A+ S E A A+ AA A AS I Sbjct: 366 NLNSAEDALEK-----NPDAKKLSDTEKAVKDAKNAADEAAKNASTTTANFDGFLDYVIK 420 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPD 511 ++T+ + L A + K + + K D + Sbjct: 421 TYKNSTSKEDKALLADAQRAKQILNGERVEIMKQVQNGDKTE 462 Score = 50.3 bits (118), Expect = 4e-04, Method: Composition-based stats. Identities = 47/308 (15%), Positives = 103/308 (33%), Gaps = 7/308 (2%) Query: 96 DDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAA 155 D A+ +A + + E+ + + Q A+++ DA + A A A + Sbjct: 997 DAAKNDAQLQKNKIDGELTQAVTQAKQKVDQAQQAVQDAIKAKDSAVEALAAANKNVTEL 1056 Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 + A++ Q + A TK +A A +++A T AK + + ++ Sbjct: 1057 TEKISGFAAAIQKHDNEIKAAKTKVADAQGYLDTATQKQTSAETELKNAKQTAKEKAEAV 1116 Query: 216 QSAATS-------ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAG 268 ++A + S A +A + A ++ AAK+ T A + S + AA Sbjct: 1117 KTAEQNVEKAIAAISAAKESVKQAKITVASAQSALAAAKTKVTQAQTKVESYKTKFAAAV 1176 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 + A + + A + + S++ ++ S S + E Sbjct: 1177 AALPEQVRVTYQANFAAVTGNFTTMLATLNSCSTTLSSATSTLQKSAESLSKAVPEPAPE 1236 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 A + E + + + S T ++ S A+ Sbjct: 1237 PPDPVPPAPVPPAPVPVPTPPTPPVVPAPPVVPEHHNEGGSNTNTGSGTQEQNTESGNAN 1296 Query: 389 SASSASAS 396 + ++++ Sbjct: 1297 QNTGSTSA 1304 >UniRef50_A8WT52 Putative uncharacterized protein n=2 Tax=Caenorhabditis briggsae RepID=A8WT52_CAEBR Length = 4574 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 59/388 (15%), Positives = 131/388 (33%), Gaps = 2/388 (0%) Query: 123 NTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATE 182 + + + ++AS ++ + + S T A ++ +A+ +A T ++ + Sbjct: 3485 SAEESATAVTEASGEEPAVSSTSVPSELSKDDQVTEASGEETTTAAATEAAVTEASGEED 3544 Query: 183 ASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAAS 242 + +AA + S + S + T AS + + + T+AS TSA ++A + Sbjct: 3545 TTAAAATEPAVSSTSVPSELSKDDQVTEASGEESTTTAATEASVTEASGEETSAEESATA 3604 Query: 243 KEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA 302 A E SS++ + S A + + A S A + A Sbjct: 3605 VTEASGEEPAVSSTSVPSELSKDDQVTEASGEEITTAAATESTANEASGEEATTTAAAAE 3664 Query: 303 ASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSE 362 A A+ A A+A+ A +++ +++ST+ +Q + A+ + Sbjct: 3665 AVVTEASGEEATTAAAAKAAVTEASGEEPAASSTSVPSELPKDDQVTEASGEEITTAAAT 3724 Query: 363 TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAA 422 + E++ + S ++ + S+ + E+T A + + ++E + Sbjct: 3725 EATVTEASGEEATTVKTVVNVSVEVNNTTETSSPETESTTPEGPAFVTGSEIEIPSSEES 3784 Query: 423 GSATA--AAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 G+ T + T + + T + A E +A + E Sbjct: 3785 GTTTTHDPSIPVITPKPSVTSTDLATDSVESTTAATEKQAEKKIIGDHNAEKEDAEKKEE 3844 Query: 481 TLAATPKAVKSAYDNAEKRLQKDQNGAD 508 E + D Sbjct: 3845 EEDLPSFVTDEGVSTEEPSTATSSSVTD 3872 >UniRef50_Q9MBI3 Gp21, tail fiber protein n=1 Tax=Corynebacterium phage BFK20 RepID=Q9MBI3_9CAUD Length = 285 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 63/158 (39%), Positives = 87/158 (55%) Query: 143 THAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAG 202 T+ A+ AS A QAA A+ + G T +A S A++S++A TS Sbjct: 96 TYTPPVVGEAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQT 155 Query: 203 AAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAAS 262 AAKTSETNA S +A SA+ A+T A+ A TS +A AS AA +S TNA +S ++A + Sbjct: 156 AAKTSETNAKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKT 215 Query: 263 SATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 S T A S A S T A +S T+A SA++ S+T Sbjct: 216 SETNAKTSETNAGASATAAANSATSASNSAASIKTSET 253 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 68/158 (43%), Positives = 93/158 (58%) Query: 157 TSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQ 216 T AQ AS A A+ +A + +++ + A TS AK SET S Sbjct: 96 TYTPPVVGEAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQT 155 Query: 217 SAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKT 276 +A TS + A T + A SA A+ S AK+SETNA +SA++A++SAT A S AAKT Sbjct: 156 AAKTSETNAKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKT 215 Query: 277 SETNARSSETAAGQSASAAAGSKTAAASSASAASTSAG 314 SETNA++SET AG SA+AAA S T+A++SA++ TS Sbjct: 216 SETNAKTSETNAGASATAAANSATSASNSAASIKTSET 253 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 78/182 (42%), Positives = 101/182 (55%) Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 A AS +A +A+ A T+ AKTSE NA AS + TS + A Sbjct: 99 PPVVGEAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAK 158 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 T + A TS +A AS AA +S TNA +S ++A +SATAA SA AKTS+T A++SET Sbjct: 159 TSETNAKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSET 218 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A S + A S TAAA+SA++AS SA S T AG SA +AA+SA+ A A A Sbjct: 219 NAKTSETNAGASATAAANSATSASNSAASIKTSETNAGASATAAANSATAAGLAADRAAV 278 Query: 347 QA 348 Q Sbjct: 279 QP 280 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 59/149 (39%), Positives = 86/149 (57%) Query: 138 AREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAA 197 A++A+ A AAD A+ + G ++ A +S A T S AA++S++ A Sbjct: 105 AQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAKTSETNA 164 Query: 198 ATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSA 257 TS A S T AS S +A TS + A A+ A+TSA +A S+ AAK+SETNA +S Sbjct: 165 KTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSETNAKTSE 224 Query: 258 SSAASSATAAGNSAKAAKTSETNARSSET 286 ++A +SATAA NSA +A S + ++SET Sbjct: 225 TNAGASATAAANSATSASNSAASIKTSET 253 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 70/178 (39%), Positives = 98/178 (55%) Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTA 301 + A A+ A A + AKTSE NA++SETA S +AA S+T Sbjct: 104 EAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAKTSETN 163 Query: 302 AASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTS 361 A +S + A SA AS SAT A S +A +SA+ A+T A A +AA S + AKTS Sbjct: 164 AKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSETNAKTS 223 Query: 362 ETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 ETNA AS T+A +S T+A++SA+S +S ++A AS A A+AA +A A+ + Sbjct: 224 ETNAGASATAAANSATSASNSAASIKTSETNAGASATAAANSATAAGLAADRAAVQPR 281 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 65/150 (43%), Positives = 86/150 (57%) Query: 214 SLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKA 273 Q A+ A A +A A D + AK+SE NA +S ++ +S TAA S Sbjct: 104 EAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAKTSETN 163 Query: 274 AKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASS 333 AKTSETNA +S TAA SA+ A S+T A +SA+AASTSA A S TAA S +A +S Sbjct: 164 AKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSETNAKTS 223 Query: 334 ASTATTKAGEATEQASAAARSASAAKTSET 363 + A A A A++A+ SA++ KTSET Sbjct: 224 ETNAGASATAAANSATSASNSAASIKTSET 253 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 61/149 (40%), Positives = 85/149 (57%) Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 A A A+ + A T ++A TS +A AS+ A K+S+T A +S ++A Sbjct: 105 AQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAKTSETNA 164 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 +S T AG SA AA TS TNA++SET AG SA+AA+ S T A +S +AA TS A S Sbjct: 165 KTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSETNAKTSE 224 Query: 321 TAAGKSAESAASSASTATTKAGEATEQAS 349 T AG SA +AA+SA++A+ A + Sbjct: 225 TNAGASATAAANSATSASNSAASIKTSET 253 Score = 50.7 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 63/149 (42%), Positives = 86/149 (57%) Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 A A+ +A +AA A+ A ++ A +S ++A +S TA S AAKTSETNA Sbjct: 105 AQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAKTSETNA 164 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 ++SET AG SA+AA+ S T A +S + A SA AS SAT A S +A +S + A T Sbjct: 165 KTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSETNAKTSE 224 Query: 342 GEATEQASAAARSASAAKTSETNAKASET 370 A A+AAA SA++A S + K SET Sbjct: 225 TNAGASATAAANSATSASNSAASIKTSET 253 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 66/150 (44%), Positives = 87/150 (58%) Query: 130 SASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAA 189 A AS A +AA A AD T+ QA +S +A +S T T A S Sbjct: 104 EAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAKTSETN 163 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSS 249 A++S++ A SA AA TS TNA S +A SA+ A+T A+ A TS A S+ AK+S Sbjct: 164 AKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSETNAKTS 223 Query: 250 ETNASSSASSAASSATAAGNSAKAAKTSET 279 ETNA +SA++AA+SAT+A NSA + KTSET Sbjct: 224 ETNAGASATAAANSATSASNSAASIKTSET 253 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 68/179 (37%), Positives = 101/179 (56%), Gaps = 3/179 (1%) Query: 229 ASEAATSARDAAASKEAAKSSET---NASSSASSAASSATAAGNSAKAAKTSETNARSSE 285 EA ++R+A + + AK + ++ + A +S A S A KTS+T A++SE Sbjct: 102 VGEAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAKTSE 161 Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 T A S + A S TAA++SA+ A TS A ASATAA SA +A +S + A T A Sbjct: 162 TNAKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSETNAK 221 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQA 404 + A SA+AA S T+A S S ++S+T A +SA++AA+SA++A + D A Q Sbjct: 222 TSETNAGASATAAANSATSASNSAASIKTSETNAGASATAAANSATAAGLAADRAAVQP 280 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 46/121 (38%), Positives = 74/121 (61%) Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSAS 384 A+ A+ A A +A ++ + + AKTSE NAKASET+ ++S+TAA +S + Sbjct: 103 GEAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAKTSET 162 Query: 385 SAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAET 444 +A +S ++A AS A+ A+ AK+S T A AT A+ SAT A S++ A+++ T A+T Sbjct: 163 NAKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSETNAKT 222 Query: 445 A 445 + Sbjct: 223 S 223 Score = 44.9 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 69/211 (32%), Positives = 95/211 (45%), Gaps = 10/211 (4%) Query: 46 AGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDAR--PEAL 103 G+ ++ +E G +V L P A T +E P T L ++ E D P + Sbjct: 48 NGKATLTLEPGPVTVQFL-----PIGAAGKTKFEGVVPDTGPVTLRSVIEGDFTYTPPVV 102 Query: 104 RRFELMVEEV---ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAG 160 + E A A +A + + A TS A S AA TS Sbjct: 103 GEAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQTAAKTSET 162 Query: 161 QAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT 220 A +S +A +SA AST AT A S A +S +AA+TSA AKTS+T A S +A T Sbjct: 163 NAKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQTAAKTSETNAKT 222 Query: 221 SASTATTKASEAATSARDAAASKEAAKSSET 251 S + A A+ AA SA A+ S + K+SET Sbjct: 223 SETNAGASATAAANSATSASNSAASIKTSET 253 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 57/165 (34%), Positives = 81/165 (49%), Gaps = 7/165 (4%) Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 + +A +AS+ A A A G T+ A S +A S + T + Sbjct: 96 TYTPPVVGEAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADKTSQT 155 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 A TS + AK+SETNA +SA++A++SAT A S A S T A +S T A Sbjct: 156 AAKTSETN-------AKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKT 208 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 S +AA S+T A +S + A SA A+ SAT+A SA S +S + Sbjct: 209 SQTAAKTSETNAKTSETNAGASATAAANSATSASNSAASIKTSET 253 Score = 43.7 bits (101), Expect = 0.040, Method: Composition-based stats. Identities = 48/133 (36%), Positives = 68/133 (51%), Gaps = 3/133 (2%) Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAST---ATTKAGEATEQASAAA 352 G T A ++ +A +A A A+ + A T A +A Sbjct: 92 EGDFTYTPPVVGEAQKASREAVQAADRAKGIADKFGDVDTAIAQAKTSENNAKASETADK 151 Query: 353 RSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSAT 412 S +AAKTSETNAK SET+A +S TAA++SA++A +S ++A AS A+ A+ AK+S T Sbjct: 152 TSQTAAKTSETNAKTSETNAGASATAASTSATNAKTSETNAGASATAASTSATNAKTSQT 211 Query: 413 TASTKATEAAGSA 425 A T T A S Sbjct: 212 AAKTSETNAKTSE 224 >UniRef50_C7NGP9 Fe-S oxidoreductase n=2 Tax=Actinomycetales RepID=C7NGP9_KYTSD Length = 1388 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 83/424 (19%), Positives = 153/424 (36%), Gaps = 20/424 (4%) Query: 92 AMTEDDARPE---------ALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAA 142 + EDD PE A+ + E A AV + AA S + + A Sbjct: 847 RVDEDDRSPEGETTVDEQGAVGQVEEDRAAGEEPADAVDEPAAAGAPSDDTTNAADTAGA 906 Query: 143 THAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAG 202 T AA +A AAS +A QA + + + + + + + + A S+ A +A Sbjct: 907 TTGDAAAGAASAASGTAPQAFTGDRIGADTPTRFGWASQDGAPTGGTAASASQTAPAAAA 966 Query: 203 AAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASS--- 259 + N +A+ + A + + A + A S A ++ T A + Sbjct: 967 DSDADSDNTAAAETTTAAATGGTADGTTPEAEAPAPAGPSAGTAPAAFTGQRIGADTPTR 1026 Query: 260 ---AASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 A AT A S A T A S AAG + + +A + + + G A Sbjct: 1027 FGWATDQATGA-PSTSQAAAPATEATVSPGAAGATGAGSAPAGHSGHRIGGESPRRFGWA 1085 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 + + +SA AA++ T A AA + + +A T +T+ + + A + Sbjct: 1086 TGTGGEQAESATPAAATQPEPTVTAATQASPEPAAEQPSESAVTEQTSTTVTTSQATPAD 1145 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 AA +++ +++ + S + ++ + ATE S + AA +++ Sbjct: 1146 DAAPAASGPISTAPAGHSGDRIGLDSPQRFGWATGRGSDQDATEGEASTSQAAVTETAGP 1205 Query: 437 SAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNA 496 S +T T A +S +T + S S + + A + Sbjct: 1206 STSTEEPTPTVSAPADSSV----QRTTATAPQGHTGTTIGSDSASRFGWAQGGAQAAETE 1261 Query: 497 EKRL 500 +R Sbjct: 1262 VERA 1265 >UniRef50_B6HJ47 Pc21g19350 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6HJ47_PENCW Length = 1702 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 58/361 (16%), Positives = 117/361 (32%), Gaps = 5/361 (1%) Query: 110 VEEVARNASAVAQNTAAAKKSASDAST-SAREAATHAADAADSARAASTSAGQAASSAQS 168 VE++ A ++ T ++ A + E A + A + + + A + Sbjct: 282 VEDIVEPAKEESKTTEVVPETTEAAKEETPAETAVEESKAVEDIVEPAKEESKTAEAVPE 341 Query: 169 ASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTK 228 A T + + + + A A +++ + T+A A Sbjct: 342 AVVEEKTTEVVPETTEAAKEETPAEAAVEESKAVEAVPEVAEVVPAVEESKTAAEVAQ-A 400 Query: 229 ASEAATSARDA--AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 +EAAT + A A +E +A A + A+ + A Sbjct: 401 TTEAATEEKTTEAAPEAAAESKAEEAVPETAQEAPAVEEPKTEVAQPTTEEKPAAEEKVV 460 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A A +T A + A AT + + A+ A E Sbjct: 461 EAAPVTVAEPVEETPAVEDSKAEVAEPTTEEKPATEEKVVGVVSETPAAEEAKTAEAIPE 520 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 A +A + ++T E + ++S A A + SA+++++ + Sbjct: 521 VAQETPAVEAAVEETKTEEAVPEVAQKTSAAE-EPKAEVAEQTTEEKSATEEKSATEEKV 579 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 A++ + T + + + A + AQ E+AA ++TA A E+ + Sbjct: 580 AEAVSETPAAEEAKTAEAIPEIAQETPAVEAAAEESKTAEVVEPSAEEKPATEEKAVEAA 639 Query: 467 G 467 Sbjct: 640 P 640 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 65/381 (17%), Positives = 119/381 (31%), Gaps = 15/381 (3%) Query: 135 STSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSK 194 T A EAA ++ + A + + E S + + Sbjct: 524 ETPAVEAAVEETKTEEAVPEVAQKTSAAEEPKAEVAEQTTEEKSATEEKSATEEKVAEAV 583 Query: 195 SAAATSAGAAKTSETNASASLQSA-ATSASTATTKASEAATSARDAAASKEAAKSSETNA 253 S + A A A +A + T A SA + A++E A + Sbjct: 584 SETPAAEEAKTAEAIPEIAQETPAVEAAAEESKT-AEVVEPSAEEKPATEEKAVEAAPET 642 Query: 254 SSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSA 313 + + A AA + A + A A+ AA + + Sbjct: 643 VAEPVVEEAPAAAAAVVEEKTTEVVPEATEAAKAPVAEAAVEESKAVEAAPEIVQETPAV 702 Query: 314 GQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAE 373 G++ A + E + +T K EA + A +E + T Sbjct: 703 GESKVEEVAEPTTEEKPTTETATEEAKPAEAV-SETVAEPVVEETPAAEAAMEEKTTDVV 761 Query: 374 SSKTAAASSASSAASSASSASASKDE-----ATRQASAAKSSATTASTKATEAAGSATAA 428 T AA + AA +A S +E A ++ + +A T T AT A Sbjct: 762 PEATEAAKEETPAAEAAVEESKPAEEPMTEKAVQEPVMEAKAENSADTTTTPVPEIATQA 821 Query: 429 AQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKA 488 + T E++ + E ++ +++A E +T + + ET P Sbjct: 822 TAVEPTDEASEPKEEVISEPTKEVAEEAPAEHTATEELAAEPV-------KETEVVEPVP 874 Query: 489 VKSAYDNAEKRLQKDQNGADI 509 ++A E +++ A + Sbjct: 875 EQTAEAGKESAVEEPAKAAAV 895 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 62/406 (15%), Positives = 121/406 (29%), Gaps = 7/406 (1%) Query: 111 EEVARNASAVAQNTAAAKKSASDAS------TSAREAATHAADAADSARAASTSAGQAAS 164 EE + A K A A+ A E T +AA+ A T + A Sbjct: 97 EESPAEPAVEESAPVEASKEAEPAAEPVVEEKPAAEPTTAEPEAAEKAAEPVTEESKTAE 156 Query: 165 SAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSAST 224 + + S E +K A + A + A + A + T Sbjct: 157 AVPETVAEPVEESKAVEEVAKPATEESKTVEAVPETVAEPVEESKAAEEVAKPATEESKT 216 Query: 225 ATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSS 284 A + A++E + E ++ S T A + + Sbjct: 217 EVVPDVAAEPIVEETPAAEEIKTAEEVAPAAEEKSVEEKTTEAVPETAEPAAEKPAEAAV 276 Query: 285 ETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEA 344 E + + + + +T A + A A + +++ A ++ A Sbjct: 277 EESKAVEDIVEPAKEESKTTEVVPETTEAAKEETPAETAVEESKAVEDIVEPAKEESKTA 336 Query: 345 TEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQA 404 A + ET A E + + + + + A A ++ T Sbjct: 337 EAVPEAVVEEKTTEVVPETTEAAKEETPAEAAVEESKAVEAVPEVAEVVPAVEESKTAAE 396 Query: 405 SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 A ++ K TEAA A A ++++ A A + ++A E + Sbjct: 397 VAQATTEAATEEKTTEAAPEAAAESKAEEAVPETAQEAPAVEEPKTEVAQPTTEEKPAAE 456 Query: 465 KKGI-VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADI 509 +K + + ET A + A E++ ++ + Sbjct: 457 EKVVEAAPVTVAEPVEETPAVEDSKAEVAEPTTEEKPATEEKVVGV 502 Score = 43.7 bits (101), Expect = 0.040, Method: Composition-based stats. Identities = 60/372 (16%), Positives = 122/372 (32%), Gaps = 7/372 (1%) Query: 128 KKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSA 187 + + +S E T AD ++S S + TA+ K Sbjct: 17 TDAPVEPISSIDEKPTAEADLSESTATLVEPDSTQQSKDSTKPEVPMTAAEKKKAKKAKK 76 Query: 188 AAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 + +S ++ + A KT E + + +A +AS+ A A + ++ A Sbjct: 77 KQQKKQESVSSIALEADKTPEESPAEPAV-----EESAPVEASKEAEPAAEPVVEEKPAA 131 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 T +A AA T +A+A + A + A A + Sbjct: 132 EPTTAEPEAAEKAAEPVTEESKTAEAVPETVAEPVEESKAVEEVAKPATEESKTVEAVPE 191 Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA 367 + ++ A+ A + E + + E +A + K+ Sbjct: 192 TVAEPVEESKAAEEVAKPATEESKTEVVPDVAAEPIVEETPAAEEIKTAEEVAPAAEEKS 251 Query: 368 SETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATA 427 E + A A+ + A+ + E + + +S T + TEAA T Sbjct: 252 VEEKTTEAVPETAEPAAEKPAEAAVEESKAVEDIVEPAKEESKTTEVVPETTEAAKEETP 311 Query: 428 AAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSAT--NSTSETLAAT 485 A + +++ E A + ++ + T + + + + A + +E Sbjct: 312 AETAVEESKAVEDIVEPAKEESKTAEAVPEAVVEEKTTEVVPETTEAAKEETPAEAAVEE 371 Query: 486 PKAVKSAYDNAE 497 KAV++ + AE Sbjct: 372 SKAVEAVPEVAE 383 >UniRef50_C1E6C0 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E6C0_9CHLO Length = 2355 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 61/278 (21%), Positives = 99/278 (35%), Gaps = 7/278 (2%) Query: 166 AQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTA 225 A++A SSA A+ E A A+ + A A + A + + Sbjct: 1395 AKAAESSAKDAAENLKE--LDALRADLEMFNSRVIALTASEASLKAELEEANKKGADLEK 1452 Query: 226 TTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSE 285 K EAA A +A+ +EAA + A +S A A ++ + + A Sbjct: 1453 VQKELEAAKKAVEASKKEEAALKKKLEAIEKTTSTADQEKGAKLASLETEIEKLRAELEA 1512 Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 A A +A A + + A A A + S A A T++ +A Sbjct: 1513 AN-----EATKKQVANAVEAAKKAKSETDKLKAEADKAKADMDKLKSDAKNAKTESDKAK 1567 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 +A A AK KA A++ A S A +A + + A A D+A Sbjct: 1568 AEADKAKADMDKAKAEADKLKADMDKAKAEADKAKSDAKNAKTESDKAKAEADKAKADMD 1627 Query: 406 AAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAE 443 AK+ A +A A ++ S A ++ A+ Sbjct: 1628 KAKAEADKLKADMDKAKTEHKAELEAASAAAASTASAK 1665 >UniRef50_Q2TY93 Predicted protein n=2 Tax=Aspergillus RepID=Q2TY93_ASPOR Length = 628 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 76/441 (17%), Positives = 139/441 (31%), Gaps = 24/441 (5%) Query: 116 NASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT 175 ++A +T A++ DA A T + + + + + Sbjct: 79 PSAAPPASTEEAREILQDAVNKAETGPTDKPELVEGEAIGAVEPSVTDEPLKLTPETLP- 137 Query: 176 ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL---QSAATSASTATTKASEA 232 +T TE S E+ + A S A + A + A A+ Sbjct: 138 -ATTGTETSLPERPKETVTDTSDVHAKRPYESSLTAKDDELPHKIAKVDDTVAAAGATSG 196 Query: 233 ATSARDAAASKEAAKSSETNASSSASS---AASSATAAGNSAKAAKTSETNARSSETAAG 289 + + A K + + A + + + + +AA Sbjct: 197 HVAETGSLARKPGVPAETGSVQPQIVPGLGADPNDNVSTVLNVPGLSPIEEKSTVPSAAE 256 Query: 290 QSASAAAGSKTAAASSASAASTSAG--QASASATAAGKSAESAASSASTATTKAGEATEQ 347 SA A + T + A T A A+A AT K +E + + T T+ A + Sbjct: 257 ASAVPAPNTSTEPKGAEDIAPTGAPTSDANADATEKSKDSEISKAEEETVATEPKGAEDI 316 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAA 407 A A +++ K +E S A+ ++A + AS A A+A Sbjct: 317 APTGAPTSAGNVDVAEQPKDTELSKAEETVASEPPTTAAPAVASEPEAPSTAELTTAAAP 376 Query: 408 KSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKG 467 T + + A + + +K+ + SA T TA A V +S Sbjct: 377 SEVTGTETVSPSGATAPSATDSTTKAASASADTGTGTATSAPAPPAEPVLKVPSSIHDGP 436 Query: 468 IVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDF 527 +++SA KSA D A ++++ GA + I +K Sbjct: 437 DSKVTSAN--------------KSAADQAVSNVEQEAEGASANTQAPVTETIPEETKPAA 482 Query: 528 ADKRGMRYVRVNAPAGATSGK 548 + G V+ P + + Sbjct: 483 EVQEGTGAVKNQVPENQPTAR 503 >UniRef50_C0ZX71 Putative uncharacterized protein n=1 Tax=Rhodococcus erythropolis PR4 RepID=C0ZX71_RHOE4 Length = 655 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 66/355 (18%), Positives = 106/355 (29%), Gaps = 36/355 (10%) Query: 179 KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARD 238 A +A A S A A + A SA A+ AA +A Sbjct: 114 DTKAARDTAVTAAESVGDIADDVAQITEDRAAAELARDEARESAEEASESRGLAAVAAVS 173 Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS----- 293 A S A ET A+ AS+A+SSATAA S AA E NA ++ A Q+A+ Sbjct: 174 AGDSSTLAGQHETAAAGHASNASSSATAADQSESAAAQHEVNAETAADLAVQTAAGIQDV 233 Query: 294 ----------------------AAAGSKTAAASSASAASTSAGQASASATAAGKSAESAA 331 + T A A Q S E+ A Sbjct: 234 AADAAQVAADRQAVESAASAVATDRQTVTDARGVVVTAKDDVEQVKLDVHQVKSSIENTA 293 Query: 332 SSASTAT--------TKAGEATEQASAAARSASAAKTSETNA-KASETSAESSKTAAASS 382 + T+ + + + A A A+ + S + A + Sbjct: 294 TLVDQTLTQYGAQFVTERELSQQAVTDATTQAQRAEDAAEGIVAGSVLDNAVTTPKIADA 353 Query: 383 ASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRA 442 A + + ++ + AS D+A + T A T + A + A Sbjct: 354 AVTKSKTSPTVQASLDKADGAVQEGDTRLTNARTPTAHSHAVADVTGLQGALDGKAPASH 413 Query: 443 ETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAE 497 + + + + +A + K + + A P V + A Sbjct: 414 SHSESQVTGLTAKLAALQPLSEKGQANGYAPLDATGKLAAAYQPSYVDDVLEYAN 468 >UniRef50_A3YYY9 Putative exonuclease SbcC n=1 Tax=Synechococcus sp. WH 5701 RepID=A3YYY9_9SYNE Length = 1002 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 66/385 (17%), Positives = 135/385 (35%), Gaps = 16/385 (4%) Query: 90 LGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAA 149 L +TE +A+ EA+ F + R + + + A+++A A ++ +A Sbjct: 286 LTRLTELEAKQEAVADFRQRLSRAER--AEALRPSVDAEQAARVALSTLEAGIKAELQSA 343 Query: 150 DSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSET 209 AR + + + + + A + A ++ ++ A AG A+ Sbjct: 344 IRARDTADALPVSLVLLDLTALPSPEELGSARNDLAARRAELTALASQALEAGQAQARAA 403 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGN 269 A+A ++A S+++ + + A A+ A S+ +A + A Sbjct: 404 AAAARARAADLRLSSSSAARESRQQARQAAGAAFTKACSARDQLDGLQRAAQDAHDCARA 463 Query: 270 SAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAES 329 A + + A A + A +A A +A AG Sbjct: 464 VA------AITPALHQEKIAIATKAKADGRLNQAQAALNDQRRRQIAGMAARLAGGLVPE 517 Query: 330 AASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA--------S 381 A + A A + A A SE +A + + A Sbjct: 518 APCPVCGSAHHPAPAQPSQDAISEEAITAAESELSAATTAAQQAAVAVEKAQGERKAVLE 577 Query: 382 SASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATR 441 A +AAS +A + +A++ +AA++ A T ++ E + +++ +SA+T Sbjct: 578 RAGTAASDPVAADTAARQASQALTAAQALAETVASLEQEISNHERELEALQASIQSASTE 637 Query: 442 AETAAKRAEDIASAVALEDASTTKK 466 A+ A D + + T++ Sbjct: 638 LAMQARAATDETNRAQALSTAITRE 662 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 77/325 (23%), Positives = 125/325 (38%), Gaps = 10/325 (3%) Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATE-ASKSAAAAESSKSAAAT 199 +A H A A S A S A AA S SA+++A + A E A A AA+ Sbjct: 525 SAHHPAPAQPSQDAISEEAITAAESELSAATTAAQQAAVAVEKAQGERKAVLERAGTAAS 584 Query: 200 SAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASS 259 AA T+ AS +L +A A T + E + R+ A + + +S+ T + A Sbjct: 585 DPVAADTAARQASQALTAAQALAETVASLEQEISNHERELEALQASIQSASTELAMQA-R 643 Query: 260 AASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAAS---SASAASTSAGQA 316 AA+ T + A T E A +S + A A+ ++++AST QA Sbjct: 644 AATDETNRAQALSTAITRELGEGVDPKQALKSIEPLEAALKALAARCHASASASTRLEQA 703 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 S+ +E A A A K + A+ A +T + SA+ + Sbjct: 704 SSRLARDLAGSEFADGQAVVAALKPETMRQ---LWAQRIKAFETEVIELRGLLASADLAD 760 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 ++AA A A+ + A + + A T + T S A + S E Sbjct: 761 LPQERPDTAAALEAVLAADAARTAAVERHSEARGAQTEIQRLTSEHSSG--AGELASRRE 818 Query: 437 SAATRAETAAKRAEDIASAVALEDA 461 A + A + + A ++L+ Sbjct: 819 QAQLFSAVADRCSGRTAPFISLQRW 843 >UniRef50_Q9TYL3 Putative uncharacterized protein n=2 Tax=Caenorhabditis elegans RepID=Q9TYL3_CAEEL Length = 605 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 89/373 (23%), Positives = 164/373 (43%), Gaps = 4/373 (1%) Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 AG S + ++ + S+ S + T++ AA S + +A +A+T+A +T+ + Sbjct: 17 AGALSQDTSTSTVTTTTVASTSSGSTTASTAAGGSTSTTAAGGSTASTAAGGSTSTTAAG 76 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 ++ AA A+ ++ + +S+A+ +++ TAAG S + A ++ + + Sbjct: 77 GSTVSTAAGGSTASTAAGGSTASTAAGGSTATTAAGGSTATTAAGGSTASTAAGGSTATT 136 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 +A + + AA ++A++ A S ++TAAG S + A+ STATT AG +T +A Sbjct: 137 AAGGSTASTAAGGSTAST--AAGGSTASTAAGGSTATTAAGGSTATTAAGGSTASTAAGG 194 Query: 353 RSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSAT 412 +A+ A T + A+ S S+ +++ ++A S ++ +A A+ A + +S Sbjct: 195 STATTAAGGSTASTAAGGSTASTAAGGSTATTAAGGSTATTAAGGSTASTAAGGSTASTA 254 Query: 413 TASTKATEAAGSATAAAQSKSTAESAATRAETAAKRA--EDIASAVALEDASTTKKGIVQ 470 + AT AAG +TA + + S A TA A ++A AST G Sbjct: 255 AGGSTATTAAGGSTATTAAGGSTASTAAGGSTATTAAGGSTASTAAGGSTASTAAGGSTA 314 Query: 471 LSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADK 530 ++A ST+ T A A +A + + S A Sbjct: 315 TTAAGGSTATTAAGGSTASTAAGGSTASTAAGGSTATTAAGGSTASTGVTVASSPAPAAA 374 Query: 531 RGMRYVRVNAPAG 543 Y R N P+G Sbjct: 375 CEPGYKRFNRPSG 387 >UniRef50_C5M5Z1 Predicted protein n=2 Tax=Saccharomycetales RepID=C5M5Z1_CANTT Length = 1628 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 67/417 (16%), Positives = 153/417 (36%), Gaps = 3/417 (0%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + A++ Q A SA ++ + A + ++S Q SSAQ Sbjct: 136 TSSAQQQATSSVQQPQQATSSAQPQQATSSAQQQATSSAQQPQQETTSSIQQQQSSAQQP 195 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 ++SA ++ A + + + A S+ + ETN+S + +S + Sbjct: 196 ATSANQEPATSSVQQPQQATSSTQQLQEAASSIHQQEQETNSSPTATPVVSSIQQPEQET 255 Query: 230 SEA-ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA 288 + + AT+A ++ E + ++SA + ++ + + T +S Sbjct: 256 TPSPATTAAISSVQPEQELQEQEKETASAPVTTPATSSVQPEQETTSSPATTPATSSVQP 315 Query: 289 GQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA 348 Q + + + A +S T+ A+ AT++ + E + + T ++E A Sbjct: 316 EQETTPSPATTPATSSVQPEQETTPSPATTPATSSVQQPE--DETTPSPATTPATSSETA 373 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + T T+ SS + +S++ + ++ +S + D + S Sbjct: 374 VRPSDTQLEDTTVSAEEPQPTTTHTSSTFSTSSTSVTEKTTVTSGTTLPDVTSETTSTRD 433 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 ++ T+ T T + T+ +++ + AT T+ A ++ + T + Sbjct: 434 TATETSPTLETPSVQDVTSQTPVDTSSPTPATSDVTSPTPATSDVTSQTPVTSGVTSQTP 493 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKT 525 V SS T +TS+T + + F++++ A + + Sbjct: 494 VDTSSPTPATSDTTDTVTPVTSELDTTTSQDAETVSENNTSSPDSNFVSSVGAETSS 550 >UniRef50_C1FF73 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FF73_9CHLO Length = 369 Score = 53.4 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 61/280 (21%), Positives = 112/280 (40%), Gaps = 3/280 (1%) Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 E + + Q A A ++ ++ + +AA A+A A A A A ++A+++ Sbjct: 71 EARKAEAVAKQAKAEATRAREESDAAKVQAAEDVANAKAIAADAKKEAELALNAAKTSRL 130 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 A A+ KA +A + A + + S A A A +A L + + S + Sbjct: 131 DAAAAAKKAMDAQRELARVKDAASRAEKLATAKMQGADRKTAQLMAKSASLEHEKKSLAA 190 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 + S EAA +S T+A +S+ ++ S AA A + + NA+ A + Sbjct: 191 EHAETKATLLSVEAALASATDALASSEASNKSIRAADAQKYAKEVAALNAKYDAKTAQLA 250 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSA---ESAASSASTATTKAGEATEQA 348 A+ A+ A + A A A + + A + A + ++ +A +A Sbjct: 251 VELASAKAANVAAFELANTLRAQHAKECARHSAELAKVKDVAKKALDAVKSREADAERRA 310 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 A K + A+ S +AE + A +S +SA Sbjct: 311 ERLHEEVMALKVAAAEARESLAAAEGNVLDAKNSRASAVR 350 >UniRef50_C7Z475 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7Z475_NECH7 Length = 868 Score = 53.4 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 73/306 (23%), Positives = 138/306 (45%), Gaps = 3/306 (0%) Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT-SASTATTKASEAATSARDA 239 E S +++ A SA T S S AT SAS + T+ SA Sbjct: 242 REPVSSPTTVATTQEATTQSATETGTETATESGSATETATGSASESATETETGTESATGT 301 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 + E +SET ++ +++ + TA + + + T + + + + SA Sbjct: 302 ETASETETASETETATESATETGTETATESGSATESATGTESATETASETATESATETES 361 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 T ++ S+ T AS +AT + ES + + + T +TE A+ +A + +A Sbjct: 362 TTETATESSTETGTETASETATESATETESTTGTETASETGTESSTETATESASTTESAS 421 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 +E+ ++ T+ +S T AS++ S +AS + + AT A+ + S+ TAS AT Sbjct: 422 ATESTTESGSTTEGASTTETASASGSTTETASETATETETATETATESGSTTETAS--AT 479 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 E+A + A+++ ST+E+A+ A T+ + +A S + ++ + ST+ Sbjct: 480 ESASTTETASETASTSETASESASTSTEAGSTTETASTTASESASTSTESGSTTESASTT 539 Query: 480 ETLAAT 485 E+ + T Sbjct: 540 ESASTT 545 >UniRef50_A1BA44 OmpA/MotB domain protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1BA44_PARDP Length = 711 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 46/303 (15%), Positives = 88/303 (29%), Gaps = 18/303 (5%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 +E + A + + +A T A E + A A Q +A Sbjct: 109 QQEPQPRSEAAPEEQETSPPAAKPEVTRAEEPQAKPPVVEAAEPPAKPQAAQPDQAADQD 168 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 A T + A A+ ++ A A A A Sbjct: 169 GLPAPKPDTPDAVTTTDPAPAQKPQADA-------------ADPQGSRPAAQPEAAPDAE 215 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 ++A T A E + T A + A+ + A+ +A ++ET Sbjct: 216 TQATTETTPAEPRPEQSDEPPTAAPDAQQDDAAHQAPGAEPPQPAEQQADDAPATETPDA 275 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 Q + + A+ + + A + E A +A EQA+ Sbjct: 276 QELERRLQEQ---SQEAAPQPDQSDEQRAEDGQRPDAQELEGRLQEQADGQAQPEAEQAT 332 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA--SSASASKDEATRQASAA 407 A+ + +A+E +A ++ A ++ + + + A A Sbjct: 333 EVQAPEVQAQPNAVAQRAAEEAAPAAAAALSAGEEQEEGQGELNEVQITDENARSSAEEF 392 Query: 408 KSS 410 +S Sbjct: 393 ATS 395 Score = 51.1 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 50/317 (15%), Positives = 99/317 (31%), Gaps = 16/317 (5%) Query: 153 RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA- 211 + + A +++ +A T+A E E+++ A A + Sbjct: 110 QEPQPRSEAAPEEQETSPPAAKPEVTRAEEPQAKPPVVEAAEPPAKPQAAQPDQAADQDG 169 Query: 212 -------SASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSA 264 + + A +A A AA EAA +ET A++ + A Sbjct: 170 LPAPKPDTPDAVTTTDPAPAQKPQADAADPQGSRPAAQPEAAPDAETQATTETTPAEPRP 229 Query: 265 TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA----ASSASAASTSAGQASASA 320 + AA ++ + + + + A A A + S A Sbjct: 230 EQSDEPPTAAPDAQQDDAAHQAPGAEPPQPAEQQADDAPATETPDAQELERRLQEQSQEA 289 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 ++ + A A T +A E A+ + A Sbjct: 290 APQPDQSDEQRAEDGQRPDAQELEGRLQEQADGQAQPEAEQATEVQAPEVQAQPNAVAQR 349 Query: 381 SSASSAASSASSASASKDEATRQA--SAAKSSATTASTKATEAAGS--ATAAAQSKSTAE 436 ++ +A ++A++ SA +++ Q + + + A + A E A S Q + A Sbjct: 350 AAEEAAPAAAAALSAGEEQEEGQGELNEVQITDENARSSAEEFATSVAQGLQQQQPAEAP 409 Query: 437 SAATRAETAAKRAEDIA 453 R + +DIA Sbjct: 410 QEEARRDDGDDTVKDIA 426 >UniRef50_UPI0001925BA0 PREDICTED: similar to mucin 2 n=5 Tax=Hydra magnipapillata RepID=UPI0001925BA0 Length = 3408 Score = 53.0 bits (125), Expect = 6e-05, Method: Composition-based stats. Identities = 64/385 (16%), Positives = 137/385 (35%), Gaps = 6/385 (1%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 V S V +T A + + AAT + S ST+ + S+ Sbjct: 2113 VETTPSEVQTSTNTASPIEETTTIVSTAAATETVETTPSEVQTSTNTASQIEETTTIVST 2172 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A T T S+ ++ T+ + ++ T + S +++ + E Sbjct: 2173 AAATDTVETTPSEVQTTTNTASPIEETTTIVSTSAATETVKTTPSEVQTSTNTASAIEET 2232 Query: 233 ATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 T AAA + + +S+ +++ T S AA +T + + Sbjct: 2233 TTIVSTAAAIETVETTPSEVQTSTNTASQIEETTTIVSTTAAI--DTVKTTPSEVQTSTN 2290 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAA 352 +A+ +T S SAA+ + + +A + + +T A T + + + Sbjct: 2291 TASPIEETTTIVSTSAATETVETTPTEVQTSTNTASQIEKTTTIVSTAAATDTVETTPSE 2350 Query: 353 RSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSAT 412 S S+ + S ++ ++ S +S ++AS ++ T ++AA + Sbjct: 2351 VQTSTNTASQIEETTTIVSTAAATETVETTPSEVQTSTNTASQIEETTTIVSTAAATDTV 2410 Query: 413 TASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLS 472 + + + + + +T S + +T ++ + IV S Sbjct: 2411 ETTPSEVQRTTNTASPIEETTTIVSTSAATDTVETTPSEVQTTTNTASPIEETTTIVSTS 2470 Query: 473 SATNSTSETLAATPKAVKSAYDNAE 497 +A SET+ TP V+++ + A Sbjct: 2471 AA----SETVETTPSEVQTSTNTAS 2491 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 69/386 (17%), Positives = 140/386 (36%), Gaps = 8/386 (2%) Query: 113 VARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS 172 V S V T A + + AAT + S ST+ + S+ Sbjct: 595 VETTPSEVEITTNTASPIEEKTTIVSTSAATETVETTPSEVQTSTNTASQIEETTTIVST 654 Query: 173 AGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEA 232 A T T S+ ++ T+ ++ T + S ++ + E Sbjct: 655 AAATDTFETTPSEVQTTTNTASPIEETTTIVRTSAATEPVETNPSKVQKSTNTASAIEET 714 Query: 233 ATSARDAAASKE-AAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 T AAA++ SE +++ +S+ T +++ A +T ET +T+A Sbjct: 715 TTIVITAAATETVETTPSEVQITTNTASSIEETTTIVSTSAATETIETTPSEVQTSA--- 771 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 +A+A +T S +AA+ + + + +A + + +T A T + + + Sbjct: 772 NAASAIEETTTIVSTAAATETVETTPSEVQTSTNTASQIEETTTIVSTTAAIDTVKTTPS 831 Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 S S + S + ++ S +S ++AS ++ T ++AA Sbjct: 832 EVQTSTNTASPIEETTTIVSTSEATETVETTPSEVQTSTNTASQIEETTTIVSTAAAIDT 891 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 + + + + + +T S + +T D+ + IV Sbjct: 892 VETTPSEVQTTTNTASPIEETTTIVSTSAATDTVETTPSDVQTTTNTASPIEETTTIVST 951 Query: 472 SSATNSTSETLAATPKAVKSAYDNAE 497 S+A SET+ TP V+++ + A Sbjct: 952 SAA----SETVETTPSEVQTSTNTAS 973 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 65/424 (15%), Positives = 142/424 (33%), Gaps = 11/424 (2%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE 140 S ++ +E E V+ + + T ++ S + S E Sbjct: 818 STTAAIDTVKTTPSEVQTSTNTASPIEETTTIVSTSEATETVETTPSEVQTSTNTASQIE 877 Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS----KSA 196 T A + T+ + ++ +AS T + +T A+ S + Sbjct: 878 ETTTIVSTAAAIDTVETTPSEVQTTTNTASPIEETTTIVSTSAATDTVETTPSDVQTTTN 937 Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 A+ T + ++AS T + T+ + +A S AA + S Sbjct: 938 TASPIEETTTIVSTSAASETVETTPSEVQTSTNTASAIEETTTIVSTAAATETVETTPSK 997 Query: 257 ASSAASSA---TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSA 313 ++ ++A A +ET + + +A+ K S +AA + Sbjct: 998 VQTSTNTASPIEETTTIVSTAAATETVETTPSEVQTSTNTASPIKKITTIVSTTAARGTV 1057 Query: 314 GQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAE 373 + +A + + +T A T + + + S S + S Sbjct: 1058 ETTPTEVQTSTNTASQIEETTTIVSTTAAIDTVKTTPSEVQTSTNTASPIEETTTIVSTS 1117 Query: 374 SSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKS 433 + ++ S +S ++AS ++ T ++AA + + + + + + + + Sbjct: 1118 EATETVETTPSEVQTSTNTASPIEETTTIVSTAAATDTVETTPSEVQTSTNTASPIEETT 1177 Query: 434 TAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAY 493 T S + +T ++ + IV S+A SET+ TP V+++ Sbjct: 1178 TIVSTSAATDTVETTPSEVQTTTNTASPIEETTTIVSTSAA----SETVETTPSEVQTST 1233 Query: 494 DNAE 497 + A Sbjct: 1234 NTAS 1237 >UniRef50_C7YSP6 Predicted protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YSP6_NECH7 Length = 491 Score = 53.0 bits (125), Expect = 6e-05, Method: Composition-based stats. Identities = 72/415 (17%), Positives = 114/415 (27%), Gaps = 16/415 (3%) Query: 85 TLNDFLGAMTEDDARPEAL----RRFELMVE-----EVARNASAVAQNTAAAKKSASDAS 135 T++D L ++E A+ EAL E + E A A A + A + A Sbjct: 41 TVHDLLKHLSEAAAQIEALEKAKADLEAQLATPAASEAAEEAPAAEEEKAVEETPAEPEK 100 Query: 136 TSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKS 195 + A + A + + +A+ ++ E K A + Sbjct: 101 VAEEVPAAAEPEEEKPAEEEKAAPVEEVPAAEPEKAAEEEKPAAEAEVEKPVEAEPEKPA 160 Query: 196 AAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASS 255 A +A K +E A + A A + A+ Sbjct: 161 EEAPAAEEEKVAEPEKVAEPEEKPAEVEPEKVVEEAPAAPAEEKPVEAPVAEEKPAPVEE 220 Query: 256 SASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQ 315 + A A +A K +E + AA + A + A A Sbjct: 221 EKPAPAEEKAAEPEKVEAEKPTEEEKPTEAPAAEEKPVEAPVVEEKPAPVEEKAV--EAP 278 Query: 316 ASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESS 375 A+ A A A K E + A A A E A ET A Sbjct: 279 AAEVEKPAEPEAAKPVEETPAAEEKPVEEEKAAEKVAEEEKPAPVEEKAAPVEETPAAEP 338 Query: 376 KTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTA 435 + A + A + +A + A + + T A E + K Sbjct: 339 EKPAEEKPAEAEKPVEAPAAEEKPAPVEEKVVEEKPVEVETPAVEEKAVEAPVVEEKPAP 398 Query: 436 --ESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKA 488 E AA ET E A E + ++ +E A P Sbjct: 399 VEEKAAPVEETPVAEPEKPAEEKPAEVEKVVE---APVAEEKPVEAEPTKAEPVP 450 Score = 51.8 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 65/408 (15%), Positives = 110/408 (26%), Gaps = 1/408 (0%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E V + Q A K ++A T + AA A A A Sbjct: 12 EADVASAQKYGDVAQQAIALLSKFKTEAKTVHDLLKHLSEAAAQIEALEKAKADLEAQLA 71 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 A+S A + A E + A AA+ E + ++A A Sbjct: 72 TPAASEAAEEAPAAEEEKAVEETPAEPEKVAEEVPAAAEPEEEKPAEEEKAAPVEEVPAA 131 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 A A A E +E + + AA A A + E Sbjct: 132 EPEKAAEEEKPAAEAEVEKPVEAEPEKPAEEAPAAEEEKVAEPEKVAEPEEKPAEVEPEK 191 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 ++ +A A K A A + + A A K+AE A T + Sbjct: 192 VVEEAPAAPAEEKPVEAPVAEE-KPAPVEEEKPAPAEEKAAEPEKVEAEKPTEEEKPTEA 250 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 A+ + + + + + AA A++++ + A Sbjct: 251 PAAEEKPVEAPVVEEKPAPVEEKAVEAPAAEVEKPAEPEAAKPVEETPAAEEKPVEEEKA 310 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 A+ A E + + + A + A K E A+ Sbjct: 311 AEKVAEEEKPAPVEEKAAPVEETPAAEPEKPAEEKPAEAEKPVEAPAAEEKPAPVEEKVV 370 Query: 467 GIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 + T + E P + EK ++ P+K Sbjct: 371 EEKPVEVETPAVEEKAVEAPVVEEKPAPVEEKAAPVEETPVAEPEKPA 418 >UniRef50_UPI0000DB6B60 PREDICTED: hypothetical protein n=2 Tax=Apis mellifera RepID=UPI0000DB6B60 Length = 1633 Score = 53.0 bits (125), Expect = 6e-05, Method: Composition-based stats. Identities = 70/416 (16%), Positives = 144/416 (34%), Gaps = 17/416 (4%) Query: 93 MTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSA 152 + E + + ++ E+ E+ + + K +S Sbjct: 1192 IKEPEIKEPEIKEPEIKEPEIKEPEIKEPEIKESEIKEIEVSSQKPEIKELLKEPEIKET 1251 Query: 153 RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAS 212 + A +++ ++ AA A +++S A T A K ++T S Sbjct: 1252 EIKEPEIEKVAEVSENKVVETAAIASATAAVVAGAAGAVAAQSKAKTKALGTKPTKTTTS 1311 Query: 213 ASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAK 272 + S T+ +K + T AAA K+ + ++ + +S S+ ++ S Sbjct: 1312 KPTPT--RSTPTSPSKTVSSTTRTSTAAAMKKPSTTTPSRPKDLDASKKSTISSTTTSKS 1369 Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAAS 332 + S + +S TA ++ + + + AS AS TA G S S Sbjct: 1370 SVAKSASKTTTSTTATKSTSKTSVSTTSKPKPVASTASKPITVTDKKPTANGDVKSSNKS 1429 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 + A +++ +AS A + KTS T A T+A ++++A+ ++ A Sbjct: 1430 AT------ATKSSSKASPLATKTTLVKTSSTRASTGTTTAPKVRSSSANKLNTTA----- 1478 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 K +T ++ A + TA + T + + + T + + Sbjct: 1479 ----KASSTTISTTASNRPKTAPSSGTASKPRMSLNKLPAIDKQVKETANKQISMGRTST 1534 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 ++ S T + A+ +T T A+P + K + G D Sbjct: 1535 PASKTTSRLSMTSSSTTTIKRASLTTKTTTVASPTKKSTPVSKTSKTSLSGKTGGD 1590 >UniRef50_Q6ZBP4 Os08g0490700 protein n=2 Tax=Oryza sativa RepID=Q6ZBP4_ORYSJ Length = 420 Score = 52.6 bits (124), Expect = 9e-05, Method: Composition-based stats. Identities = 82/376 (21%), Positives = 141/376 (37%), Gaps = 11/376 (2%) Query: 123 NTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATE 182 A K +DAS A+++ AD + + + + A+ S AG A T AT Sbjct: 23 LALAVKDYPADASAVAKKSPASKADTPTTGKESVAGKTDVVTVAK--KSPAGKADTSATY 80 Query: 183 ASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAAS 242 +AA A++ + A A T T+ ++ A + + A +A TSA Sbjct: 81 KEYAAAKADAVTVTKKSPAAKADTPTTSKESAAGKANAATVAKKSPAGKADTSATATGKE 140 Query: 243 KEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA 302 AAK+ + + +A + A G + A + T A+ S +A K AA Sbjct: 141 YAAAKADAITVTKKSPAAKADMPATGKESIAKVDAATVAKES--------TAGKTGKKAA 192 Query: 303 ASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSE 362 A + + + +A A+ A A A + A+ +A A A Sbjct: 193 AKEFTMSGKTNTEADAATVAKKSLAGKAGTPATGKEYAVTKADAATVAKKSPADKTGKES 252 Query: 363 TNAKASETSAESSKTAA-ASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEA 421 AKA + TA +A S +S S + AT++++A K+ TA+ ++T + Sbjct: 253 VVAKADTATVTKKSTAGKTGKKVAAKESPASHKTSMEAATKKSTANKTGTETAAKESTVS 312 Query: 422 AGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSET 481 + T A +STA + A A+ + S K ++ + T++ Sbjct: 313 GKTDTETAAKESTAPGKTDTTAAVKESTAGKGDAPAMAEKSAAGKAEASAAAKESPTNKA 372 Query: 482 LAATPKAVKSAYDNAE 497 AA Y Sbjct: 373 DAAAAGPTSGGYQYVN 388 >UniRef50_A7A6B3 Putative uncharacterized protein n=1 Tax=Bifidobacterium adolescentis L2-32 RepID=A7A6B3_BIFAD Length = 1651 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 50/151 (33%), Positives = 78/151 (51%) Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 + A A AK +D + +A A +AA +A+ A +A A +A++A +A Sbjct: 791 ADVEKNANEIAQAKSDIADNAAKTTDAKKAAENAAAAAKTAQGTADTANGAAKTAQDTAN 850 Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAAT 234 A T A A+ +A A+ + +AA T+A +AK + NA +A SA +A + A+ A T Sbjct: 851 AAQTAAKSATATAGQAKDAANAAQTAAESAKKTAGNAETLANTANESAKSAKSDAASAKT 910 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSAT 265 A +A + A S T A ++A SAA SAT Sbjct: 911 DAANAKTTAANASSVATQAKATADSAAQSAT 941 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 48/153 (31%), Positives = 83/153 (54%), Gaps = 4/153 (2%) Query: 155 ASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASAS 214 +A + A + + +A + A +AAAA++++ A T+ GAAKT++ A Sbjct: 793 VEKNANEIAQAKSDIADNAAKTTDAKKAAENAAAAAKTAQGTADTANGAAKTAQDTA--- 849 Query: 215 LQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA 274 +A T+A +AT A +A +A A + E+AK + NA + A++A SA +A + A +A Sbjct: 850 -NAAQTAAKSATATAGQAKDAANAAQTAAESAKKTAGNAETLANTANESAKSAKSDAASA 908 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 KT NA+++ A A+ A + +AA SA+ Sbjct: 909 KTDAANAKTTAANASSVATQAKATADSAAQSAT 941 >UniRef50_A6W5H3 Metal dependent phosphohydrolase n=1 Tax=Kineococcus radiotolerans SRS30216 RepID=A6W5H3_KINRD Length = 736 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 56/291 (19%), Positives = 110/291 (37%), Gaps = 3/291 (1%) Query: 215 LQSAATSASTATTKASEAATSARDAAASKEAAKSSETNA-SSSASSAASSATAAGNSAKA 273 Q+ +T+AS A T + A + AA A ++ A + +A+ A+ A+ A + A Sbjct: 340 AQALSTAASVAQTAGGQVAAAVLSAAVGVGAGSAAVAVASTPAAAVEAAPASTASPAEVA 399 Query: 274 AKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASS 333 A T + + S +++ ++ SSAS+++ S+ A AA + S+ Sbjct: 400 AGTPRAGSATPAPRTPSSPGSSSEGSSSPGSSASSSAASSTLPVFPAAAAPSPSASSIPV 459 Query: 334 ASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSA 393 S+ S + T + + T +++ AA +S ++A +A A Sbjct: 460 LPGPLPLTAGPEASPSSLVPSTTPPARGTTGPRPAATPGAATRPAAPASRTTAP-TAGGA 518 Query: 394 SASKDEATRQASAAKSSATTASTKATEAAG-SATAAAQSKSTAESAATRAETAAKRAEDI 452 SA K + A+ + +T A++ K + ++ + T A + + Sbjct: 519 SAVKPSTGAPKPKPSTGASKPKPSTGASKPKPSTGASKPKPSTGASKPKPSTGASKPKPS 578 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 A + ++ T K + SE PK + A E Sbjct: 579 TGASKPKPSTGTPKPKPSTGTPKPKPSEPAGPKPKPSEPAGPKPEPSEPAG 629 >UniRef50_A8HPQ1 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8HPQ1_CHLRE Length = 2449 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 80/388 (20%), Positives = 143/388 (36%), Gaps = 24/388 (6%) Query: 134 ASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS 193 A+ ++ T A + + AA+ A + +A A +A + A+ Sbjct: 6 ATEASASGLTGAKRRSGAVAAAAPGGVAARAGGGGGPPAAKRAKQEAGTPRAAEASKAQP 65 Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA------TSARDAAASKEAAK 247 K+AAA +AG+++ +A+A+ A A +T AA T+ A SK+ A Sbjct: 66 KAAAAAAAGSSRRPPRSAAAAAAVADADAKPESTAPKSAAKSKGKTTAEDAGATSKQPAP 125 Query: 248 SSETNASSSAS-------------SAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 +T A ++ + +A ++ K K A ++E A+G++A Sbjct: 126 KGKTTAKAAENDADEDKAADAAQATAVAAKKDDAAGGKKGKAVAKKAGAAEAASGEAAGD 185 Query: 295 AAGSKTAAASSASAASTSAGQASAS---ATAAGKSAESAASSASTATTKAGEATEQASAA 351 G TA A AA A A A A ++ + A A K G A + +AA Sbjct: 186 KNGKATAKAGKGEAAGVKAEAAEPQYSGAADAKRAGGAKGKGAQQAKKKDGAADAETAAA 245 Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 S + E + +++ + K+EA S A ++ Sbjct: 246 TTSEAEEVKPEPGEPKPQGKRKAAAASGKKGKGHGQEEVVEPKEEKEEAVAGPSTAAAAD 305 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 + A+ ++TAAA K A R AA + A + + AS + Sbjct: 306 SQATDG--GKPAASTAAASEKQAASGKGKRGAAAADAKKPAARTPSRKTASASAASKDNA 363 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKR 499 ++A ++ T A ++AE + Sbjct: 364 AAAGSNKEATSPGEGVKTAEATEDAEMK 391 >UniRef50_C1E1Z7 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E1Z7_9CHLO Length = 767 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 74/468 (15%), Positives = 144/468 (30%), Gaps = 38/468 (8%) Query: 91 GAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAAD 150 GA +DDA + + + + A N K DA+ R+ A + Sbjct: 178 GASEDDDAAAKRAKSGDDDSPSATKKLFADDTNALEEAKQLEDANAKLRD---ELKAAKE 234 Query: 151 SARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETN 210 + + + SA++ + A A + K+ A ++ + + + AA ++ Sbjct: 235 TIEERDRTIAEHIESAETTKAQYEEALAAAAKEHKAKLAELTADAKSHDANVAAMSASCE 294 Query: 211 ASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNS 270 A+ + +A A E A A+ + + +A++AA+ A G+ Sbjct: 295 AAIKSKEEMQAALDRAVAAREGAEKNAAETAAALEEVKAALKDAEAAAAAAAEKAAEGSK 354 Query: 271 AKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESA 330 A+ + A+ E + AA + A+ A A A+ Sbjct: 355 ELEAENASLRAKLEEERTTTAEKAAEIREAREKLEATEA---------DLAVAKNDADQL 405 Query: 331 ASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 A + A A ++A A A+ + SE A+ E A A + A + Sbjct: 406 AIKLAEAKASLAAAVKRADEADTCAAPLRASENEARRVCARKEEEAKAMADRLAVARMAH 465 Query: 391 SSASASKDE-------------------------ATRQASAAKSSATTASTKATEAAGSA 425 A A+ E + A A A AG+ Sbjct: 466 KQAMAALKEVGLRLEYDGTGAAAAEADAEATEAEPNTEPPTMDDGAEDAPMDADGVAGAT 525 Query: 426 TAAAQSKSTAESAATRA-ETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAA 484 AA T ++ ET + + V ++ + S+ + SE Sbjct: 526 QAAPGDFETEDAIDEEPRETENEGDRIVGMDVLYASVPASEVPASDVPSSMDDGSEDEPV 585 Query: 485 TPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRG 532 + + + A + +++ A ++ A G Sbjct: 586 PDTELAAPAEGASPVAAMAPPSPREQPESAARHSVGAATQEAAAVLGG 633 >UniRef50_UPI0001A5E657 PREDICTED: hypothetical protein n=8 Tax=Homo sapiens RepID=UPI0001A5E657 Length = 1704 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 58/372 (15%), Positives = 135/372 (36%) Query: 114 ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSA 173 A + + + + S + + + + S + + +A S + +S A Sbjct: 32 ASETTTASTAGSETTTPSPTGSQTTIVSISGSEITTTSTAGSENTTVSSAGSGTTTASMA 91 Query: 174 GTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 G+ +T +T S++ + + SA ++T+ + ++S + ++A + TT S Sbjct: 92 GSETTVSTAGSETTTVSITGTETTMVSAMGSETTTNSTTSSETTVTSTAGSETTTVSTVG 151 Query: 234 TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS 293 + A + ++ T S + + + S ++T+ + SET + Sbjct: 152 SETTTAYTADSETTAASTTGSEMTTVFTAGSETITPSTAGSETTTVSTAGSETTTVSTTG 211 Query: 294 AAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 + + + A S +AAST + + +TA ++ S A S +TA + T A Sbjct: 212 SETTTASTAHSETTAASTMGSETTKVSTAGSETTVSTAGSETTAASTEDSETNTAFTEDS 271 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT 413 + A T+ A+ T+ A+ + + +S +K +A + Sbjct: 272 KTTTASTTGFETTAASTTGSEPTMASTMGSETTMASTIGPETTKVSTASSEVTTVFAAGS 331 Query: 414 ASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSS 473 + +A+ T + + S +A+ + + + + S Sbjct: 332 ETIRASTVGSETTTVSTTGSETTTASIMGSETSTDSTTGSETTTASTEGSETTTASTEGS 391 Query: 474 ATNSTSETLAAT 485 + S T + T Sbjct: 392 EATTVSTTGSET 403 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 53/348 (15%), Positives = 127/348 (36%) Query: 125 AAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEAS 184 + ++ S + + T + S + T+A S + S+AG+ +T +T S Sbjct: 192 SETTTVSTAGSETTTVSTTGSETTTASTAHSETTAASTMGSETTKVSTAGSETTVSTAGS 251 Query: 185 KSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKE 244 ++ AA+ +KT+ + + +AA++ + T AS + A+ Sbjct: 252 ETTAASTEDSETNTAFTEDSKTTTASTTGFETTAASTTGSEPTMASTMGSETTMASTIGP 311 Query: 245 AAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAAS 304 T +S + A+ + S ++T+ + SET + + + S Sbjct: 312 ETTKVSTASSEVTTVFAAGSETIRASTVGSETTTVSTTGSETTTASIMGSETSTDSTTGS 371 Query: 305 SASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETN 364 + AST + + ++T ++ + + + T T ++ + S A ++ Sbjct: 372 ETTTASTEGSETTTASTEGSEATTVSTTGSETTTVSITDSETTTTCTEGSEMTAVSTTVF 431 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS 424 + ++ S T A++S S ++++ S + T + + T + T G Sbjct: 432 ETTTASTEGSEITIASTSDSETTTASTEGSETTTVTTAGSETKTAYTTGSETTTASNTGL 491 Query: 425 ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLS 472 T + + + A+ + S + ++ V + Sbjct: 492 ETTTVFTIGSDTTTASTEGSETTAVSATGSEMTTVSTEGSENTTVSTT 539 Score = 48.8 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 66/391 (16%), Positives = 139/391 (35%), Gaps = 1/391 (0%) Query: 142 ATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSA 201 + A +A + +T+A S + A +T +T S++ + + S Sbjct: 1 MSSETTVAPAAGSNTTTASTTGSETTTILIKASETTTASTAGSETTTPSPTGSQTTIVSI 60 Query: 202 GAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAA 261 ++ + T+ + S + +SA + TT AS A + + A E S T ++ SA Sbjct: 61 SGSEITTTSTAGSENTTVSSAGSGTTTASMAGSETTVSTAGSETTTVSITGTETTMVSAM 120 Query: 262 SSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASAT 321 S T ++ + T + A SET + + + A S +AAST+ + + T Sbjct: 121 GSETTTNSTTSSETTVTSTA-GSETTTVSTVGSETTTAYTADSETTAASTTGSEMTTVFT 179 Query: 322 AAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAAS 381 A ++ + + + T T + + S + ++ + + ++ S T ++ Sbjct: 180 AGSETITPSTAGSETTTVSTAGSETTTVSTTGSETTTASTAHSETTAASTMGSETTKVST 239 Query: 382 SASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATR 441 + S S + + + + + + +T A+ TAA+ + S A+T Sbjct: 240 AGSETTVSTAGSETTAASTEDSETNTAFTEDSKTTTASTTGFETTAASTTGSEPTMASTM 299 Query: 442 AETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQ 501 + + AS+ + S T S + T + + + Sbjct: 300 GSETTMASTIGPETTKVSTASSEVTTVFAAGSETIRASTVGSETTTVSTTGSETTTASIM 359 Query: 502 KDQNGADIPDKGCFLNNINAVSKTDFADKRG 532 + D S+T A G Sbjct: 360 GSETSTDSTTGSETTTASTEGSETTTASTEG 390 Score = 46.1 bits (107), Expect = 0.008, Method: Composition-based stats. Identities = 54/376 (14%), Positives = 127/376 (33%), Gaps = 6/376 (1%) Query: 108 LMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQ 167 V + + S D+ T+ A + +T+A S Sbjct: 235 TKVSTAGSETTVSTAGSETTAASTEDSETNTAFTEDSKTTTASTTGFETTAASTTGSEPT 294 Query: 168 SAS------SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATS 221 AS + A T + T+ S +++ + +A + + A+ + S + T+ Sbjct: 295 MASTMGSETTMASTIGPETTKVSTASSEVTTVFAAGSETIRASTVGSETTTVSTTGSETT 354 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 ++ + ++ + S T AS+ S A + +T + + T Sbjct: 355 TASIMGSETSTDSTTGSETTTASTEGSETTTASTEGSEATTVSTTGSETTTVSITDSETT 414 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 + + +A + +T AS+ + T A + + T A + + + Sbjct: 415 TTCTEGSEMTAVSTTVFETTTASTEGSEITIASTSDSETTTASTEGSETTTVTTAGSETK 474 Query: 342 GEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 T + S + +T+ S+T+ S++ + ++ S+ S ++ S E T Sbjct: 475 TAYTTGSETTTASNTGLETTTVFTIGSDTTTASTEGSETTAVSATGSEMTTVSTEGSENT 534 Query: 402 RQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDA 461 ++ + T ++T S + + + A T ++ AS E Sbjct: 535 TVSTTGSETTTVSTTGLETTTTSTEGSEMTTVSTTGAETTTDSTEGSGTTAASTAGSETT 594 Query: 462 STTKKGIVQLSSATNS 477 + + +++T Sbjct: 595 TVSTADSENTTASTAD 610 >UniRef50_B5EEN8 Flagellar hook-length control protein n=2 Tax=Geobacter RepID=B5EEN8_GEOBB Length = 660 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 60/368 (16%), Positives = 105/368 (28%), Gaps = 10/368 (2%) Query: 126 AAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASK 185 A KS A A A AA A + + A++ + + + Sbjct: 113 QALKSDLAAKPEQGTAPAGAETAAAEVATAEVATAEVATAGKILPEMETVTTDGERDEEA 172 Query: 186 SAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEA 245 AA + +A AA A+A Q A EAA R A A Sbjct: 173 PVQAASPKPAGQEMAAAAATQPAPKAAAVEQGAVPRGLEVAAGKVEAARERRGTEAQVAA 232 Query: 246 AKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASS 305 ++ T A A+ A A E Q ++ + A Sbjct: 233 GANALTQLDELQQKALERQAVERPEAQIAAAQAEKAVLPEAGQPQQPASTKLGQKA---- 288 Query: 306 ASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNA 365 A A A AA + + G A A + + Sbjct: 289 -----EQAELPQQKGEAVASGATEAARMEESNPAQPGAKGAAAGFVATPVNGSVRESLTQ 343 Query: 366 KASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSA 425 A E + ++ A+ A+ A+ + + + A AK+ + A A Sbjct: 344 TAPEAAQQAIHADEAADANKEQQKAAGNPGTAEVTAQAAPKAKAQGEVIHPQQQGATPEA 403 Query: 426 TAAAQSKSTAESAATR-AETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAA 484 +++T +A R + ++ + A L+ + + A Sbjct: 404 PRPEAAQATERTAQRRDLQHGDEKHVPVQGADNAGQPEAASAAAKDLTGTKGAPVVSAAI 463 Query: 485 TPKAVKSA 492 TP+ ++ A Sbjct: 464 TPEQLRGA 471 >UniRef50_A0NYB8 Possible OmpA family member n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NYB8_9RHOB Length = 628 Score = 51.1 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 49/279 (17%), Positives = 92/279 (32%), Gaps = 7/279 (2%) Query: 189 AAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKS 248 + ++ A A + +A S S A A ++ +AA + + Sbjct: 14 SPSRAQETAVDCAANPDQEACRTEEAAPAAEASTSEAP-----AEPASEEAAPEAPSEAT 68 Query: 249 SETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASA 308 E ++ + + A + A+ +E A E A AA +TA A Sbjct: 69 QEAAPEAAPEAGQDAEPAEAIETQEAQPAEKPAAPVEEATPAEKPAAPVEETAPAEKPQN 128 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKAS 368 + + +A+ A ++ A+ A + S A +A + A+ Sbjct: 129 TEETTTEETAAPEAKPEAPVEEAAPAEEPAVEPSSEQTNTSPEAGAAEQPAAEQPADAAA 188 Query: 369 ETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAA 428 T A ++T+ S + A + A+ + A ++ A+ A A A Sbjct: 189 STEAGQAETS--ESPETQADAPKPATEEQPADAAAAEEPAAAEEPAAASAEPAPEEAAPQ 246 Query: 429 AQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKG 467 S +AET A A A ++ T + G Sbjct: 247 QTDAEQKTSDPAQAETLAAPDAAPVEATAADETVTEEPG 285 >UniRef50_A4R2V7 Predicted protein n=1 Tax=Magnaporthe grisea RepID=A4R2V7_MAGGR Length = 547 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/159 (23%), Positives = 70/159 (44%), Gaps = 11/159 (6%) Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 A A A+ S A + A A A+ +A +A AR A+ S AA+ S Sbjct: 186 SQELQRAQDQARQAQESARQAGEQARQNADQARQASEQARQAQDQARRASDSASAAQRSA 245 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAA-----------GQSASAAAGSK 299 +A S A ++ SS+ +A ++ A + + A S+ + Q+AS + ++ Sbjct: 246 DSAVSIAQASISSSASAQIASNLASATASAAASAASIIAAARSSANQLMDQAASEVSVAR 305 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTAT 338 A S+ + AST QA +A + ++A + + ++ Sbjct: 306 AEATSARAEASTQISQAQGAAVSVTQAALAVVGTFIGSS 344 >UniRef50_A4F6R6 Putative uncharacterized protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4F6R6_SACEN Length = 1264 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 68/362 (18%), Positives = 133/362 (36%), Gaps = 10/362 (2%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + E+ +A++ +A ++ ++ +A T A + AD+A + T+ A Sbjct: 254 DASEEQPTESAASEDAGSAEDGEADESSAGEREDAQTAAGEDADAADQSVTADDSTADET 313 Query: 167 QSASSSAGTASTKATE--ASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSAST 224 ++A AG + T +T+ A + + S A +E+ A + S A Sbjct: 314 EAADDEAGESETSSTDTVALGTVQPESDEDGETSESGTAEWEAESEAGETEASETGPAED 373 Query: 225 ATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSS 284 +++A A + A + A S+A+ +SAT + + +++SE + + Sbjct: 374 ESSEAEVADEAEAAEEAEAADTAAEPETADSAAAE-TTSATESDSPEPESESSEPESDEA 432 Query: 285 ETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEA 344 ET A + AA A + +A A + AE+ A+ TA A Sbjct: 433 ET-------AEPTPDSEAAEDEPAQAEAAPSPEEPEAVADEDAETEAAGTETAAEAGTAA 485 Query: 345 TEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQA 404 A +E + AE ++ +A +A A +A +A Sbjct: 486 ERDADEWPTDEHERVEAEPETREPVEQAEEAEESAEPAAEEPAQAAVEEPGLTAPGEPEA 545 Query: 405 SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 + A+ TA A + Q+ +E AA + A + + +A T Sbjct: 546 APAEPEQATAPEAEQAAPAEPEQSEQAPEESEQAADQQAPAVEPEQQRPAAPPERSVEQT 605 Query: 465 KK 466 ++ Sbjct: 606 QQ 607 >UniRef50_B2B4U0 Predicted CDS Pa_2_2450 n=3 Tax=cellular organisms RepID=B2B4U0_PODAN Length = 2782 Score = 50.7 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 59/362 (16%), Positives = 106/362 (29%), Gaps = 4/362 (1%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 + + A S K + AA + A++A Sbjct: 150 DSLAPAAAEPTSEAPATEEQPKVKEVKERSLEALAAESKQNEEPRTEEKVEETSAPAAAA 209 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 + + T + + + A E + T A ++ AA+S Sbjct: 210 EPTPVAEVTGTKETEAPAAEVATPEVKEEEKPTEDDAPAAEAAAPVEKVEDAASSTPAPE 269 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNS---AKAAKTSETNARS 283 +A+E A S A + A + ++ S + A A +E A Sbjct: 270 PEAAEEAASTPAADPTPAEAAAEVKEEPAAESKSEDVAEAPAAETVVEAEKSFAEVAAPD 329 Query: 284 SETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE 343 A A A + A A K+ E +A A Sbjct: 330 VTEGAAPVVIDLKSEPEAEKPQEETKEVGAFKVEEPANEAEKTEEESAPVADATPEPTPV 389 Query: 344 ATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQ 403 + + A + KT+E + + + E T A + + A+ E + Sbjct: 390 EQSTPAEDVKPAESEKTAEPEVEPTPETKEEPATQAKEEPVAEPVPEAKEEAAAAEPVSE 449 Query: 404 ASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAST 463 A + A + + +A E A +SK + + ETA + E+ V E A Sbjct: 450 AKEEPAVAESVA-EAKEEPTVAELVTESKDEPVAEVAKEETAVETKEEATPDVKEEVAPE 508 Query: 464 TK 465 T+ Sbjct: 509 TE 510 >UniRef50_B2WBM9 Putative uncharacterized protein n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2WBM9_PYRTR Length = 699 Score = 50.3 bits (118), Expect = 4e-04, Method: Composition-based stats. Identities = 74/434 (17%), Positives = 136/434 (31%), Gaps = 28/434 (6%) Query: 96 DDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAA-----D 150 +D +P ++ +SA AQ K + A A DAA Sbjct: 237 EDDKPAVDGAAAEHEAAISDESSAAAQPAENPDKPVTGAQEEATLKPAAPTDAAAVSGET 296 Query: 151 SARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAA------ 204 + A S+ + S + + GT +TK E K A + + + A SA + Sbjct: 297 ATLAPPASSTETPKPTASKAKTNGTPATKKQEIKKPATISTTKAAKAPISAAKSPLPKAA 356 Query: 205 ----------KTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNAS 254 + S A S A + AS+ A A ++ A Sbjct: 357 PKTPPKPKAATPAAPAPKPSPSKVARPVSQAKSTASKEPVKAPVTKAPALKTPAAAAPAK 416 Query: 255 SSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAG 314 ++ ++ A+ A + + ++ + +A TA S + Sbjct: 417 KTSRTSLRPPVASSAPAPTSSAAAKPKAAAPAPENKKPAAPKPIATAPRKRHVCTSPTGF 476 Query: 315 QASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAES 374 + A + + +A TA + A EQA+ + + T KAS S Sbjct: 477 KKPAPKSPTRPVGLPSRLTAPTAASAAKHGEEQATKKPATTTRPATKIAAPKASRPSVAP 536 Query: 375 SKTAAASSASSAASSASSA-------SASKDEATRQASAAKSSATTASTKATEAAGSATA 427 + T A S S+ASSA + A + + K A A Sbjct: 537 TATTATKRPESRTSTASSAPKGGFLERMMRPTAASSSKTHEKPHEKPQDKPASPPRKAPA 596 Query: 428 AAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPK 487 ++ + + E AAK + + + E + + + + T+E Sbjct: 597 GSKPSTLQKGKKKVEEVAAKVKDAVTNGHDDEHKTEDEHKSEGEAVKESDTTEQAIEGAT 656 Query: 488 AVKSAYDNAEKRLQ 501 A ++ + A+ Sbjct: 657 AEETPQEPAKAATP 670 >UniRef50_B6IFW9 Putative uncharacterized protein n=1 Tax=Caenorhabditis briggsae RepID=B6IFW9_CAEBR Length = 806 Score = 50.3 bits (118), Expect = 4e-04, Method: Composition-based stats. Identities = 73/404 (18%), Positives = 144/404 (35%), Gaps = 17/404 (4%) Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 DA + A HA+DAAD+A+ A ++A + + + + + + A + Sbjct: 162 NEDAKENMDAAKEHASDAADTAKEGVKDTASAINNAFESLT--ESVADDSAHKTGVAFSD 219 Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 + K A +AK + ++A+ ++ ++A A ++ + D A ++ ++ E Sbjct: 220 GAKKEEAKEHVESAKDAASDAAIEVKHDVKDTASAINNAFDSMKN-DDVAPQQKEEEAKE 278 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAA------S 304 + S+ AG++ + K +NA S+ T S + + T Sbjct: 279 EVKEEAKGYLESAKETAGDAVETVKEQASNAGSAITNVISSLAESITGDTTHKTGEAFND 338 Query: 305 SASAASTSAGQASASATAAGKSAESAASSASTATTKAGEA-----TEQASAAARSASAAK 359 + A+ + A A + A +A S+ S T E+ T + + + A Sbjct: 339 AKKEAAEFSESVKGHAETAQEHASNAGSTLSNVLTSLKESIVGDTTHKTGEVLQDETEAL 398 Query: 360 TSETNAKASETSAESSKTAAASSASS--AASSASSASASKDEATRQASAAKSSATTASTK 417 S + K + + A + S A A K+ A +AK A +K Sbjct: 399 QSNVDEKYTAVNVAPLGDAGKEKVEEIMHGDAPSYAEAVKEHLDETADSAKEKAEETGSK 458 Query: 418 ATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNS 477 ++A S + +T ++ E A + + SA + S TNS Sbjct: 459 VSDALTSVAQSISDDTTHKTGGALNEAADTAKDTLESAKNKAS-DVVDSVKERASEGTNS 517 Query: 478 TSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINA 521 E +A K + ++ +D P + A Sbjct: 518 VEEQFVFGKEAAKEKAEEVVDTAKEAVASSDTPFTDDIAHQTGA 561 >UniRef50_B0W468 Papilin n=4 Tax=Coelomata RepID=B0W468_CULQU Length = 2472 Score = 50.3 bits (118), Expect = 4e-04, Method: Composition-based stats. Identities = 76/401 (18%), Positives = 173/401 (43%), Gaps = 20/401 (4%) Query: 77 VYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEV--ARNASAVAQNTAAAKKSASDA 134 + +S GT ++ D+ FE +V + + S V + D Sbjct: 674 IDMESSQGTTE---SSLETDEMMLSDATGFETGATDVDTSTDVSTVEGSGMDIDIRVDDL 730 Query: 135 STSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSK 194 S +A+T A ++ S ST A S+ + SS T S+ ++ S A++AE S Sbjct: 731 EGSGDDASTIADSSSSSVEVFSTDADSTGSTPDALSSIKETESSASSSDSTDASSAEGSS 790 Query: 195 SAAATSAGAAKTSE-------TNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 + +AT + A + T AS+S++S+++ ++ +S ATS ++ + Sbjct: 791 TQSATESSALSSESSISAEATTEASSSVESSSSPLDASSDSSSTDATSESSSSDVTSESS 850 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 +S+ SS S + +++ S+ + + ++ +S A+ +S A S S++ Sbjct: 851 TSDATTESSTSESTDASSTTDASSLSDTSESSSTSASSEASTDGSSTDASSADEITESST 910 Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK- 366 S S+ +S+ + ++ES+ A +++ + +AT+ + S + +++S T+A Sbjct: 911 DVSESSSDSSSLDVSTESASESSTVDAESSSDASTDATDTSDVTESSDATSESSGTDATE 970 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 ++++ + T A + S+ SS++ A +++ + + ++ S T AS +TE++ Sbjct: 971 ETQSTGPTESTVAEETVSTVESSSTEAVSTESSVSDASESSTESVTGASDMSTESSTDVE 1030 Query: 427 AA-------AQSKSTAESAATRAETAAKRAEDIASAVALED 460 ++ + S + + + + Sbjct: 1031 SSTFDIWQRGGDDDESSSTPYTLTSIIAKEQKPSKCKPRPK 1071 >UniRef50_Q9RSJ1 Putative uncharacterized protein n=1 Tax=Deinococcus radiodurans RepID=Q9RSJ1_DEIRA Length = 528 Score = 50.3 bits (118), Expect = 4e-04, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 59/207 (28%) Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 RNA + + T A AA + A +A ++A Sbjct: 67 RNAVSTIAQADQLRPQIEALRTEVGTVQGELRAARTEREAARSEAQKAGQEREAARQELA 126 Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAAT 234 A A + A T A Q++ + + + Sbjct: 127 AARQNLASAQQEQARLTKQAQDLQTRLKTLAEQRRQLEAQAQASREKLQASQKQLQASED 186 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 A + K A A +A + A AA + + A+++ AA A+ Sbjct: 187 RATQLDSQVLDLKLRSAQAEQEAQNAQTRANAAQARTEELQRRAAAAQATAQAAQTRAAQ 246 Query: 295 AAGSKTAAASSASAASTSAGQASASAT 321 A+ A++ A A QA A Sbjct: 247 ASQKAQQASARAEQVREQARQAQRRAE 273 >UniRef50_B1VNN7 Putative uncharacterized protein n=1 Tax=Streptomyces griseus subsp. griseus NBRC 13350 RepID=B1VNN7_STRGG Length = 2431 Score = 49.5 bits (116), Expect = 7e-04, Method: Composition-based stats. Identities = 107/379 (28%), Positives = 172/379 (45%), Gaps = 8/379 (2%) Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 +A+ + A A+ A+ A A A AD+AR A + Sbjct: 334 QNAARTKVAALVVSAKAQAAKAATAAQKAATAQQEAWAVADAARTPRGRGLMYAQQSVQV 393 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL---QSAATSASTAT 226 + ++ A+ A +A+++A AA ++ A A + A ++++A ++ +A +AS A Sbjct: 394 ARASAAATAAAAKATETALAAANATIADADALLAKAQTDSHAISTEFRRVAAEEAASQAK 453 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 A A +A+ AAAS + AK + T A A +AT A + A+ + NA +S Sbjct: 454 AAADSAEANAQGAAASAKRAKDARTTAEQKRDKAEDAATTAASERAKAQAEKANAVASRA 513 Query: 287 AAG---QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA-- 341 A + A A ++A +A T+A ASA ATA K A A +A TA KA Sbjct: 514 EAAIEREKAQKAQQRAATEQTTAKSADTAAETASADATAKRKLATEKAKAAQTAREKAVA 573 Query: 342 GEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 +QA+AA +A A + A+ ++ TAA ++AS AA ++++A + D+AT Sbjct: 574 ATQAKQATAARAAALEAAATAAAGTAAAAETRAAATAARTAASQAAQASTAAQTAADQAT 633 Query: 402 RQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDA 461 A A+S+ATTA A A +A A + T+ AA+ A AA A D + AL Sbjct: 634 TAAVGARSAATTAEGAARRAEANAGKAWSAYRTSLGAASSAHAAAAVALDASQDAALRAE 693 Query: 462 STTKKGIVQLSSATNSTSE 480 + A + E Sbjct: 694 NAATASQNATKLAEKAEQE 712 Score = 42.6 bits (98), Expect = 0.095, Method: Composition-based stats. Identities = 64/286 (22%), Positives = 117/286 (40%), Gaps = 2/286 (0%) Query: 88 DFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAR-EAATHAA 146 D + A+ + + L + + A + A A+ +AR + A Sbjct: 287 DMIEAIRQAWIVEQILTWRKYWQDAAANGIDMSDKPDQAFYDKATADQNAARTKVAALVV 346 Query: 147 DAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKT 206 A A A+T+A +AA++ Q A + A A T A + + A+ +A AA Sbjct: 347 SAKAQAAKAATAAQKAATAQQEAWAVADAARTPRGRGLMYAQQSVQ-VARASAAATAAAA 405 Query: 207 SETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATA 266 T + + +A + + A ++ + A + AA+ + + A ++A SA ++A Sbjct: 406 KATETALAAANATIADADALLAKAQTDSHAISTEFRRVAAEEAASQAKAAADSAEANAQG 465 Query: 267 AGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS 326 A SAK AK + T A A +A+ AA + A + + A S +A+ A K+ Sbjct: 466 AAASAKRAKDARTTAEQKRDKAEDAATTAASERAKAQAEKANAVASRAEAAIEREKAQKA 525 Query: 327 AESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSA 372 + AA+ +TA + A ++ A A A+ + A Sbjct: 526 QQRAATEQTTAKSADTAAETASADATAKRKLATEKAKAAQTAREKA 571 >UniRef50_C8RQF5 Putative uncharacterized protein n=1 Tax=Corynebacterium jeikeium ATCC 43734 RepID=C8RQF5_CORJE Length = 486 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 107/383 (27%), Positives = 156/383 (40%), Gaps = 19/383 (4%) Query: 95 EDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHA--------- 145 E DA E R F ++E + + + +AK A A SA A A Sbjct: 99 ERDALDELRRDFGAWIDEARASLATADSSAKSAKAEADKAKGSADAAGKSATAAAGSASA 158 Query: 146 -----ADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 +A SA AA SA A S ASS A +A + + A+ SA AA S AA S Sbjct: 159 AKISADNAGKSASAADGSATAADKSRSDASSFANSAKSASESATASAQAATESAGAAKDS 218 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 A +A +S +NA +S ++A A A + A A +SA A S +AAK + A SAS+A Sbjct: 219 ANSAVSSASNAYSSAKAAKADADRAKSSADSAGSSASAAKGSADAAKGAADAAGKSASAA 278 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 ASSA+AA A A TS TNA S TAA A AA ++ + + + + + Sbjct: 279 ASSASAAKTDAGKAATSATNADKSATAAKADADRAANIASSTSWDGDKLTVNGKTSPSLT 338 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 G + G E S A T + ET S Sbjct: 339 GPQGVKGDRGPRGYKGDQGDPGPQGEPGDKGDPGESGATTWGAISGKPETYPPESHKHTL 398 Query: 381 SSASSAASSASSASA-----SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTA 435 + ++A +S +SA+ S+D A R A +SA A+TK + S++ + Sbjct: 399 ADVTNAPNSHTSAATGNTLVSRDSAGRAQFATPTSAGHAATKGYVDSMSSSLTEIMDTHL 458 Query: 436 ESAATRAETAAKRAEDIASAVAL 458 A A+ ++ + Sbjct: 459 TKGAFELRVASSAPPSGTASNTI 481 >UniRef50_A8IBV4 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IBV4_CHLRE Length = 1084 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 61/360 (16%), Positives = 117/360 (32%), Gaps = 10/360 (2%) Query: 105 RFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAAS 164 R + + +A + AVA + + ++ + A+ + A+ Sbjct: 292 RLKQVEAALAARSDAVAASERGLESRGAEVQRKEIDVEAATVRLAEVEARVEAEQAELAT 351 Query: 165 SAQSASSSAGTASTKATEASKSAA-----AAESSKSAAATSAGAAKTSETNASASLQSAA 219 ++ A A +A A + A A +K +E +A A Sbjct: 352 KREAFELEAAEARAEAARVEALKRGLDGERAAVEAARAEVQAARSKLAEESAKLKADREA 411 Query: 220 TSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSET 279 + A ARD AA +++ A++ A++ ++A + E Sbjct: 412 AESVLAEAAKRTKELDARDGAAKAAMEAANQAAAAAKAAAEEAAAAKKTVEERETALLEA 471 Query: 280 NARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATT 339 A + A AA S + A A A +A+ + + A A Sbjct: 472 QASVKKAQEQAQARAAETSAASEACVARKHDLDAREAALAERESK-----VAVEAKAVEA 526 Query: 340 KAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDE 399 + E +A AR + K E + + +T AA AA A A +D Sbjct: 527 RKAEVGREAEDVARREARLKPQEKELSGRAVTLAARETGAAEVTKEAAKRLKDAEAREDA 586 Query: 400 ATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALE 459 ++ A A ++ ++ +T+ A ++ + K+ E A+ A E Sbjct: 587 LAKREEALGKREEALKAAEAALAEREKSSKEATTTSGKATSKKDNDLKQREAAATKRAAE 646 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 69/346 (19%), Positives = 117/346 (33%), Gaps = 2/346 (0%) Query: 116 NASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT 175 + A A A+ A+ + + A A A + A + A+ ++ A Sbjct: 378 DGERAAVEAARAEVQAARSKLAEESAKLKADREAAESVLAEAAKRTKELDARDGAAKAAM 437 Query: 176 ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATS 235 + A+ AAA E++ + E AS A A T A+ A Sbjct: 438 EAANQAAAAAKAAAEEAAAAKKTVEERETALLEAQASVKKAQEQAQARAAETSAASEACV 497 Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAA--GNSAKAAKTSETNARSSETAAGQSAS 293 AR A +E + + + A A A G A+ E + E A Sbjct: 498 ARKHDLDAREAALAERESKVAVEAKAVEARKAEVGREAEDVARREARLKPQEKELSGRAV 557 Query: 294 AAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 A +T AA A+ A A A K E+ E+ ++ Sbjct: 558 TLAARETGAAEVTKEAAKRLKDAEAREDALAKREEALGKREEALKAAEAALAEREKSSKE 617 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT 413 + + + + + E++ T A+ + A ++ A E A+AA AT Sbjct: 618 ATTTSGKATSKKDNDLKQREAAATKRAAELEAQAKELAAREAKAAEREAAAAAAAEEATF 677 Query: 414 ASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALE 459 +A EA A AA+ + AA AE K+++ A+ + Sbjct: 678 KQQQAGEAVRQAGEAAKRAAETAKAAADAEGRVKKSDAEIQAMQKQ 723 >UniRef50_Q0CSU4 Predicted protein n=1 Tax=Aspergillus terreus NIH2624 RepID=Q0CSU4_ASPTN Length = 1181 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 59/393 (15%), Positives = 117/393 (29%), Gaps = 12/393 (3%) Query: 122 QNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKAT 181 +N +A + A S + +A AS A S+ S+ + ++S A Sbjct: 30 ENNPSADPAPESAVASGENGPPDPDNNPPAAEEASPPADNPEESSTSSPAD-TSSSPDAE 88 Query: 182 EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAA 241 +S++ A A + +K NA+ +A S+ + Sbjct: 89 GSSEATTAPPDPAETPADAPAESKDGGENAAPETDDSAQKPEDENPDKSDEPAAEEQTTE 148 Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTA 301 + + + + + A + A + A A +A Sbjct: 149 NPAETPAETPAETPAENPAETPAENTEPAQTPDAGDSETAVDHTEAPSNEDDFPECDPSA 208 Query: 302 AASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTS 361 A A + +A + + AG + A S + Sbjct: 209 EIDKVEAEKAEAERQAAEDAKVEEEEAAKEKELIDVDNGAGTSDATPDAVTDS-PQQDSD 267 Query: 362 ETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEA 421 A+ET+ E+S + + + A + A + + T KA E Sbjct: 268 VAGEPAAETNDETSA-SDEQGTVGGDAPPEAPEAPEAPEAPAAEVSANDVTAEEPKAAET 326 Query: 422 AGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSET 481 S + A+ +S + A+ A+ S T + + S T + E Sbjct: 327 QESDSPLAEPES-----PAAEDPGAEPADPQPSPPEAAAEDTPETPPEETSKETVNDKED 381 Query: 482 LAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 P+ +A + E Q+ N ++ +K Sbjct: 382 ----PEPAPAANTSGEDEPQQADNPSNDEEKDQ 410 >UniRef50_C7TJM8 Putative uncharacterized protein n=3 Tax=Lactobacillus rhamnosus RepID=C7TJM8_LACRL Length = 3390 Score = 48.8 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 88/312 (28%), Positives = 145/312 (46%), Gaps = 3/312 (0%) Query: 124 TAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEA 183 AS A+ A E A A DAA A + + A AAS A SA++ A + + A Sbjct: 1758 AQNLNNQASAAADIASEYADKANDAAGKALSEANKADSAASQAVSANNRAKQIADTVSAA 1817 Query: 184 SKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASK 243 + A++ S S +A SA + + + + A++AS A+ A+SA Sbjct: 1818 NDQASSKASQASESALSASVVAQEASATANNASAIASAASDTAKSANAIASSAASRFPGN 1877 Query: 244 EAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAA 303 ++ S A+ + A+S AT A +A +A + +S S +A+ + A Sbjct: 1878 DSLASLAKTANDATIQASSYATQASATAGSAVSLAKV--TSSANLAASKAASQANSAIVA 1935 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSET 363 + S AST A QAS A A +A++A+S+A A + A +A QA+ A+ +A +K Sbjct: 1936 GNMSQASTFANQASNFAKIASSAADAASSTADDAMSAALQAKGQAAIASSAADDSKRLAG 1995 Query: 364 NAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG 423 N S T AA A A+ + ++A K + ++A + A A+ +A+++A Sbjct: 1996 NIATLGDRLVSDATKAADRA-KASGDIAESAAVKYPSDTAITSANNVANQAADEASDSAD 2054 Query: 424 SATAAAQSKSTA 435 SA +AAQS Sbjct: 2055 SAKSAAQSGDIP 2066 >UniRef50_C1E2Y3 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E2Y3_9CHLO Length = 3044 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 89/396 (22%), Positives = 137/396 (34%), Gaps = 22/396 (5%) Query: 80 DSQPGTLNDFLGAMTED-DARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSA 138 D PG D A E +A P + + +V R A +A + A D Sbjct: 600 DEGPGGGLDLDKAAKERLEATPMLASQATKLAADVERAADLLASRLCETRDKADDLRRRC 659 Query: 139 REAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAA 198 AAT A++ R A + + +S A+ A +++ A S+ A Sbjct: 660 D-AATEFEHGAEALRDAYVKSTKECERLAKELASLTEANELARQSATDADDLASAAMEAR 718 Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 A A + + S A + + A AA +A +E +++A Sbjct: 719 DRAETAAKAARENAESTVERAEALRARAEARAAKAELLASAAREAQAEAEAEAAQATAAL 778 Query: 259 SAASSATAAGNSAKAAKTSETNARSSETAAGQSA--------SAAAGSKTAAASSASAAS 310 AA A+ +A A T E A+ + AA S+ ++ +S ++A Sbjct: 779 DAARRREASAAAAAARATDEIVAKEAAAAAAVSSAEELTAIVTSLKTQVATLKTSLTSAE 838 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 T A A A AA E A A A T A +A +A A SE + Sbjct: 839 TKAKAAEGDANAARDECERALRRAEDAETAAATYKIRADSAD--ADVEVKSERIRELRAA 896 Query: 371 SAESSKTAAASSASSAA-SSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAA 429 A++ T A + A + A K A R+ + A A + K E+ AA Sbjct: 897 VADALGTRADDEDTVAKLNEELLAIGEKYAAAREEAKANKDALADAVKRVESLEDEIAA- 955 Query: 430 QSKSTAESAATRAETAAKRAEDIASAVALEDASTTK 465 A AE+A A D A A T+ Sbjct: 956 --------ARLLAESATDDAVDAAGEYFAAYAEETQ 983 >UniRef50_A9FYY9 Protein kinase n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FYY9_SORC5 Length = 1219 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 66/402 (16%), Positives = 135/402 (33%), Gaps = 13/402 (3%) Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 A A A + DA + +R + + +A A + Sbjct: 436 EPAGAAASEPPEGDEDDEDALSHSRPTPVRNTPTLVAELTRDSRPTAIGGAAFMAQIAFA 495 Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA-------SASLQSAATSASTATT 227 + +A A+ + ++ + T GAA ++ AS ++ + Sbjct: 496 QEAARADAAAAADDSSVPEDARPTTVGGAALIAQAAFAREAAGGGASSETDEPRPPRESW 555 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 + +AA + + AA +E A T+ + ++ + A A + + + + Sbjct: 556 QLLDAAAPSPEPAAPQEPALPVSTDPAPGDAARSVEAPVATAATGLSSLRLSLDDLLSSP 615 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ 347 S + S +S A + A ASS + +A Sbjct: 616 GQPERRPVEPSAKRPSPSGVEEPSSPEAEKAPSKPPSSPAAERASSKPPSAERASSKPPS 675 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAA 407 A A+ + E + A S++ + A A S S+ +A + + +A Sbjct: 676 AERASSKPPSTSAVEPASVAKPASSKPPSSPEAEQAPSKPLSSPAAERASSKPPSSPAAE 735 Query: 408 KSSATTASTKATEAAGSATAAAQSKSTAESAATRA------ETAAKRAEDIASAVALEDA 461 ++S+ S+ A E A S ++ + AE A+++ E A+ + + A Sbjct: 736 RASSKPPSSPAAERASSKPPSSPAAPAAERASSKPPSTSAVEPASVAKPASSKPPSSSVA 795 Query: 462 STTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 ++ S+A +S SE L+A+ S A + Sbjct: 796 ESSPSKPPSSSAAESSPSEPLSASAAEPASVAKPASSKPPSG 837 >UniRef50_B7GRB7 Allergen V5/Tpx-1 family protein n=1 Tax=Bifidobacterium longum subsp. infantis ATCC 15697 RepID=B7GRB7_BIFLI Length = 973 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 77/363 (21%), Positives = 142/363 (39%), Gaps = 13/363 (3%) Query: 71 HAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKS 130 + +T + DS ++ A+++ + + + A A+ A + Sbjct: 566 YLADLTAWRDSLTNADANWQTALSKSKQAAQDASDAAEALAAAQQAARKAAEEAQQAAQK 625 Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 A D +A EA A D A A A A + A Q+A + A +A+ ++T+ ++S AAA Sbjct: 626 AKDLQAAADEA-QKAYDEAVKANADKAKALEEARKDQTAKNEAYSAAQQSTKEARSGAAA 684 Query: 191 ESSKSAAATSAGAAKTSETNASAS--------LQSAATSASTATTKASEAATSARDAAAS 242 ++ AA +A + A+ + Q+AA + + A +A T A A Sbjct: 685 ATAAQTAAQTAVDKANTAVAAAQAKIDAADKLAQTAAKNKTDAEAAIRQANTDKTKAEAD 744 Query: 243 KEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA 302 AK+++ A + +A + T + +AA A + A + + A A Sbjct: 745 LADAKAAKAEAEKAKQAALEAKTVSDAKVEAAGKQVKAADEAIANAKAAYAKAKDDLDTA 804 Query: 303 ASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSE 362 A + + A ++ + A + + A K EA + AA ++ AK Sbjct: 805 TGKLDDARNTIKRLQN----AEENLKKANAKLTEAQAKLDEANKAKDAADKAYEKAKADY 860 Query: 363 TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAA 422 A + +++ A + + A A + + EA +QA AK A KA +A Sbjct: 861 DAKLADKQASDKELANAKQAEAEAKKKAEEEARKQQEAQKQADQAKQDAEAKKQKAEQAR 920 Query: 423 GSA 425 A Sbjct: 921 KQA 923 >UniRef50_A1C962 Putative uncharacterized protein n=1 Tax=Aspergillus clavatus RepID=A1C962_ASPCL Length = 1061 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 59/399 (14%), Positives = 143/399 (35%), Gaps = 18/399 (4%) Query: 98 ARPEALRRFEL--MVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAA 155 R E L+ + M E++ R+ + A + T+ +A + + + + Sbjct: 20 VRAEDLQELKPQFMPEQLKRHIDYGTTSLADIPTAGEAQLTARSDANSFVSYFNNLLQNG 79 Query: 156 STSAGQAASSAQSASSSAGTASTKA----------TEASKSAAAAESSKSAAATSAGAAK 205 S A +A A A + + + + + TSA + Sbjct: 80 SGGAATSAPVASEAPVLVVPITISVDANGVTHTITGAPTTAPTPTAAPEPVVVTSATSTT 139 Query: 206 TSETNASASLQSAATSASTATTKASEA-ATSARDAAASKEAAKSSETNASSSASSAASSA 264 +++ + +SAS+ +S A S ++ + E+ S ASSS+ + Sbjct: 140 EPPAPPTSAAANTGSSASSQDPTSSGANQPSPKNVDPTSESQTSPAALASSSSPEPSPKN 199 Query: 265 TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAG 324 + ++ + + ++ S E ++ S + T +S+ A + S A Sbjct: 200 VDPTSESQTSPAAPVSSSSPEPSSTPEQSPPKEAPTTTTPPVEQSSSEAQSPATSEKAQT 259 Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNA-KASETSAESSKTAAASSA 383 +A S+ + + + + +A+ + + +NA A+ + S + +S Sbjct: 260 PTASSSEQAQTPTVSSSDQASSEGKQTTPDHTPTPVVSSNAPTATSGNIISDVVSGVNSV 319 Query: 384 SSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE----SAA 439 S SS+++ A+ + + T T + + T A+Q++++ A+ Sbjct: 320 VSGILETSSSTSVTPTASSPVTEESKTPEATPTGTTPSEATPTKASQTETSQPETRDPAS 379 Query: 440 TRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNST 478 + + +S A + + G++ ++ S Sbjct: 380 STPLPGSSTDAVPSSIEATPTPTPSTSGLIDSLTSVISE 418 >UniRef50_B8JIY2 Nucleolar and coiled-body phosphoprotein 1-like n=5 Tax=Eukaryota RepID=B8JIY2_DANRE Length = 1001 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 72/388 (18%), Positives = 159/388 (40%), Gaps = 2/388 (0%) Query: 128 KKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSA 187 KK + + A AA + +S+S+ ++S + A T +K T A + Sbjct: 542 KKPVTTPVSKPAPAKPPAAKTTNKPAESSSSSSDSSSDEEPKKKPATTPVSKPTPAKPTP 601 Query: 188 AAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 ++K A ++S ++ + + + A+ + K + AA A ++ S ++ Sbjct: 602 TVKTTNKQAESSSDSSSDEEDKPQKKAATTPASKPVATSAKTTPAAKPAESSSDSDSSSD 661 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSAS 307 +++ + + A + A +S+++ ET A ++A AA + T + ++AS Sbjct: 662 EEAQKKTTTPVPSKPATPAKPAAKAAESSSDSSDSEDETPAAKTAKPAAATATKSPAAAS 721 Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA 367 SA + +A ++++ + S + T A + A + A +S ++ + Sbjct: 722 KPPVSASKPAAESSSSDSDSSSEDEKQAKTTVTPKAAPKPAVTPVNAKKAESSSSESSDS 781 Query: 368 SETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATA 427 S++ E+ KT A ++ + +++ ++ + + SS+ ++S + + AT Sbjct: 782 SDSETEAKKTPAKPVVANGKPVTKTPTSAAKTPAKKTAESSSSSDSSSDEEEPSKTPATK 841 Query: 428 AAQSKSTAESAATRA--ETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAAT 485 + T + T +T + A A A +S + +ATN T AT Sbjct: 842 TPPATKTPPATKTPPATKTPPATKKQAAEAKAESSSSEESSEEEETPAATNGTQSAKKAT 901 Query: 486 PKAVKSAYDNAEKRLQKDQNGADIPDKG 513 PK K A + + Q ++ P + Sbjct: 902 PKNKKIATSTPQTFPRTKQKDSNAPFRR 929 >UniRef50_A8HZI1 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8HZI1_CHLRE Length = 1041 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 83/410 (20%), Positives = 133/410 (32%), Gaps = 18/410 (4%) Query: 68 PPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEAL---RRFELMVEEVARNASAVAQNT 124 H G+ + P + + + + + E L +RF+ + Sbjct: 495 NFVHFGSKPPEAKAAPVAKRPAVDLLDDFEEKLENLPIAKRFKP--AQPVAKPPQQLLKA 552 Query: 125 AAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEAS 184 +A+ S +AA D R S +A + +A + + + + Sbjct: 553 PKPSAAAAPPPRSDTQAAPAIRDEQQGPRQPSLQRQPSAKANNNAGAGSVLPLIRTGSGT 612 Query: 185 KSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKE 244 + A ++ AA+ A TS +A + + + T A+ AA A + A E Sbjct: 613 AAGTAPAAAAPAASKPQKAGNTSSAKPPLQSGAATGAGTGSRTDAAPAAGIAAKSGADLE 672 Query: 245 AAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAAS 304 A + S A + S AA + A ++ A G SA AA A Sbjct: 673 AGPLKKKPRPSGADGSGPSQAAAAAAGIDDARRGVAAPQAKAAVGSSAKAA-QPAEQA-- 729 Query: 305 SASAASTSAGQASASATAAGKSAESAASSASTA--TTKAGEATEQASAAARSASAAKTSE 362 A A A A K +AA +A A K EA + A S + Sbjct: 730 ----ARPRASLPLPGAVPAEKREGAAAGAAGEAQKANKPAEAAQPPKAPRASDPG-PSRS 784 Query: 363 TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAA 422 T KA + +A S S+ + A A++A A A A +A+ A A Sbjct: 785 TKEKAKKDAAPSGAQKERSADAPARPLATTAPRPLPGA---AKGAAGREPSAAAAAAPAV 841 Query: 423 GSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLS 472 S A + + A AA D+A G Q S Sbjct: 842 PSPAPAPALAAATTAVGPLAVVAAAPHLDLALPSNGPTNRVAWPGTGQTS 891 >UniRef50_C1MMF6 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MMF6_9CHLO Length = 1781 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 70/331 (21%), Positives = 119/331 (35%), Gaps = 3/331 (0%) Query: 119 AVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTAST 178 A A+ A ++ ++A+ +A A + SA +SA + A A++ Sbjct: 644 ATAEKRCAVVQNVANAAKERSDADVSAMREKLADAEKRCSAAAERASAADVAEKALAAAS 703 Query: 179 KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARD 238 A A+ A K A G + A + + A +EAA R Sbjct: 704 DAAAAATEKAEWNMKKLKKAIEKGKGYEQRVEVLETELKRAKADAEARDAEAEAAKKTRA 763 Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA-AGQSASAAAG 297 AAK++ + ++ A ++ + +E+N+++SE A A A A Sbjct: 764 KETRDAAAKTASAEERADGLRERVASLEAELESQQSAAAESNSKASEAALASARARVAEL 823 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 S A ++ A A+A+A AA + E + A KA A +A AA+ A Sbjct: 824 SDEVARATRDADRREREAAAATAAAAESATELERAREFRAKAKAKIADYEAKVAAKDAEC 883 Query: 358 AKTSE--TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAS 415 AK + +KA ++ A A AA + + + + K TA Sbjct: 884 AKLAGDVEASKAELAASAEKLAVAEKRAKVAADVSRAIETKSTALETEVATLKEKLETAR 943 Query: 416 TKATEAAGSATAAAQSKSTAESAATRAETAA 446 A + AA + + A R A Sbjct: 944 RDVDRAREAVVAANEVAGARVAKAQRDADAR 974 >UniRef50_Q9W1J0 Transporter n=17 Tax=Endopterygota RepID=Q9W1J0_DROME Length = 1201 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 59/389 (15%), Positives = 142/389 (36%), Gaps = 21/389 (5%) Query: 93 MTEDDARPE--ALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAAD 150 + E++A P+ A ++ + +V + + T + A + + +A + Sbjct: 665 LIEEEASPKQAANKKDRQLTLDVDVDVECLPLATVGKRPGAPKSMGKQGSSGNLQTNAKN 724 Query: 151 SA-RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSET 209 ++ +T + + +A+ K A ++ E + + G K+++ Sbjct: 725 GTHKSPTTEISPSKQPLIPSEKDNKSANIKVLAAGTQSSGRECTTMSTFAGGGEKKSTKV 784 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSET---NASSSASSAASSATA 266 N + SAA +++ + +A + + A ++ ++ ++AA+ Sbjct: 785 NLATGATSAAAPLASSNIATAATVAAATKSPPAAGVAPTTLVGIAATGTTVAAAANGKAT 844 Query: 267 AGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS 326 + + + T A ++ A+ +A+A + A++ + ++ + S + A Sbjct: 845 LKTTISSGAVAATAAGGNKAASVAAATATVAAGKTPAAAMALNVAASVKVSQTDKTAPLK 904 Query: 327 AESAASSASTATTKAGEATEQASAAARSASAA----KTSETNAKASETSAESSKTAAASS 382 +AS A+ +AS A A+AA + T A+ +A K A Sbjct: 905 TTISASVATAP------GGTKASTAPSVATAAVVSVSSGATVPAAASKAAAPLKATAPPP 958 Query: 383 ASSAASSASSASAS-----KDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAES 437 + +A ++ +A ++ A +SA+ A T + + KS A + Sbjct: 959 SLAAVNATQAAKSTPVAAASALVASAPPAKSTSASVAKTNTNAGSKPTATLSTCKSDAPA 1018 Query: 438 AATRAETAAKRAEDIASAVALEDASTTKK 466 AA + A + + +K Sbjct: 1019 AAASVGVTNGKVTPSGGATSTAQPAVNQK 1047 >UniRef50_C0XU23 Possible gp21, tail fiber protein n=1 Tax=Corynebacterium lipophiloflavum DSM 44291 RepID=C0XU23_9CORY Length = 458 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 90/393 (22%), Positives = 165/393 (41%), Gaps = 10/393 (2%) Query: 26 AKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGT 85 A V +T A E +G + +V+ G V L +GF + A +TV E + P + Sbjct: 28 AATTVDGRVKSTRAVEINVSSGPVTKNVDPGALMVQLQCDGFSDTQARQVTVPE-AGPVS 86 Query: 86 LNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHA 145 L D L + E D P + + E A +A+ A++ A S + + + +AA A Sbjct: 87 LRDLL--LNEFDYAPAVVSTVARLT-ERAEDATERAEDIADTFGSLNGVTKAVSDAAASA 143 Query: 146 ADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAK 205 A +SA A+++A A S A+SSA TA+ ++ +++ T+ G Sbjct: 144 QAARESAETATSAASSANESGSVAASSASTATQAEQIIAEYRQTVIAARDETLTARGETV 203 Query: 206 TSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSAT 265 T+ NA+AS +AA SA TA + A + A+ S AA +S A+ A A A Sbjct: 204 TAAGNAAASESNAADSAMTAAGSQAGAQAAEVAASDSAAAAAASAAAATVEADRAKKEAD 263 Query: 266 AAGNSAKAAKTSETNARSSETAAGQSASAAA----GSKTAAASSASAASTSAGQASASAT 321 AG +A+ + + + +E AG + ++ +SA AA T+A A Sbjct: 264 RAGQAAEESSVKAVSDKVNEILAGAPEAYDTLLEIATELERGASAEAALTAAIAGKAPLD 323 Query: 322 AAGKSAESAASSASTATTKAGEATEQASAAAR--SASAAKTSETNAKASETSAESSKTAA 379 + + A+ A S + + K E + + ++ + + Sbjct: 324 HIHTAQDVGAAPAFHEHNARDVWDASGSTVQQVLDSHELKVREISGIKTSLDSKVNTSDV 383 Query: 380 ASSASSAASSASSASASKDEATRQASAAKSSAT 412 +S+ ++ + +A+ + + A+ A ++ Sbjct: 384 SSTLTNGSVVRRTATGTINVASPTDPAEAATKG 416 >UniRef50_Q1IXI7 Putative uncharacterized protein n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1IXI7_DEIGD Length = 636 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 61/381 (16%), Positives = 126/381 (33%), Gaps = 7/381 (1%) Query: 127 AKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKS 186 A D + E T A + +S + S+ Q ++ + ++ Sbjct: 50 ALTPPPDLKRAPLEVVTLAPPTPATPPVSSPVRERVTSTPQPPATPRTPPAAAPKPTRQA 109 Query: 187 AAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAA 246 A +S+ + A ++ AA S S + SA A + + + +++SA +AA Sbjct: 110 QTAPKSTPTPAPSTPAAAARSS--PSPAQTSAPAPAPESASPPTASSSSAEPLRPLPQAA 167 Query: 247 KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 + + + +A A S A + + + SA A A A Sbjct: 168 QQVKVESVEPRGAAPFPLPQADASVPAREQEKPVPETPAVPVRTENSAPEAQAPAEAPPA 227 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 + + TA + ++A++S + A A + + + T+A Sbjct: 228 EPVTATPAPDPEPVTALPRREDAASASTAETAAPARAEATATEEPAPARGNSPAASTSAD 287 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 + + +++ A S +A+ + + A + + T+ +AA A Sbjct: 288 ETAVTRIPARSPAEPQGSV----PEAATGPRGTLPPSTAEAVAGREGSVTRTPQAAVPAA 343 Query: 427 AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATP 486 +T + A AETAA ++ V + + + +++ T T Sbjct: 344 PTPSRDTTPDQ-APVAETAAPTPTRSSAPVTAVPETAPAAEVSWPAPTPGTSAGTGEGTV 402 Query: 487 KAVKSAYDNAEKRLQKDQNGA 507 V QN A Sbjct: 403 SPVARTGSVPGDARPGPQNEA 423 >UniRef50_B2AE55 Predicted CDS Pa_4_2900 n=1 Tax=Podospora anserina RepID=B2AE55_PODAN Length = 796 Score = 47.2 bits (110), Expect = 0.003, Method: Composition-based stats. Identities = 70/392 (17%), Positives = 122/392 (31%), Gaps = 31/392 (7%) Query: 106 FELMVEEVARNASAVAQNTAAAKKSASDASTSA--REAATHAADAADSARAASTSAGQAA 163 E V E+ +++ A A + A E S + G Sbjct: 263 LEGKVPEIVKHSQEEAHVEPEASAIEEEVKEKAAVEEELLQKVPEVPSTSEGTAGVGTDK 322 Query: 164 SSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSAS 223 S + + A + AA ++++A + A A+ A AS Sbjct: 323 SENDKSVAETVAAVAATAGTALLGAAVVAAQNAGEVATEVAHKVSDVAADYAAKAPEVAS 382 Query: 224 TATTKASEAATSARD---------AAASKEAAKSSETNASSSASSAA------------- 261 KA+E A+ A + E A + A+ A+ AA Sbjct: 383 DVAQKATEVASDAAQKASEAAANVTTQATETATEATQKATEVATDAATNLPDSVKEILPE 442 Query: 262 ------SSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQ 315 + A AK + E+ A +S A S AAA++A+ +A + Sbjct: 443 SVQQTITEAQQQAVEAKQEEIVESLAPDVPAPVKESLKEALESPEAAANTAAVEEKAAVE 502 Query: 316 ASASATAAG-KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAES 374 A S E +A+ A E+ + AA +A + ++ A + Sbjct: 503 AELLKEVKPADSIEESAAKAQAYIKAEEAKAEEEAKAAAAAKIEEETKVVADVVPVEVKE 562 Query: 375 SKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKST 434 S A S +A+++A+ + EA A + AA + Sbjct: 563 SLEKAGESPEAASNAAAVEDKREVEAELLEEVKAVEPEPLKIDAPKPVEETKAADTAVVV 622 Query: 435 AESAATRAETAAKRAEDIASAVALEDASTTKK 466 AA A A +AV +E + ++ Sbjct: 623 EPPAAAEEVKAVDAAPAADTAVVVEPPAAAEE 654 >UniRef50_A4X773 Putative uncharacterized protein n=2 Tax=Salinispora RepID=A4X773_SALTO Length = 375 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 63/309 (20%), Positives = 101/309 (32%), Gaps = 10/309 (3%) Query: 101 EALRRFELMVEEVARNASA----------VAQNTAAAKKSASDASTSAREAATHAADAAD 150 EA+ R + +V + A +AQ A A + A T EA + DAA Sbjct: 66 EAVDRLDQIVASLTEALHAQLSPVGVEQQLAQARADAANQVAAAQTDRDEARRASEDAAA 125 Query: 151 SARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETN 210 + A A A + +SA + A A +AT A A A ++ A + A+ + Sbjct: 126 ATALAREQARNATAEQESARTEAARAVAEATAAVDRADQARQARDQARKARDQARQEASA 185 Query: 211 ASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNS 270 A A L A T+ + A + A AA + AA + Sbjct: 186 AQALLGQAQQDRDAVRTQLHHLGAELTAQRQQAADLAAERDAARAEAQRAAQAEAAATDQ 245 Query: 271 AKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESA 330 A+ A A A+AA + A + A QA A A A + Sbjct: 246 ARGHHADAQRAHQDTQRAQAQAAAATQERDRAVADRDRAEAQRQQAVGDAERAHADAATL 305 Query: 331 ASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSA 390 A + + A + +AA + A++ A A + A++ Sbjct: 306 AEHHESLRGQLTVAQQDITAAQGQLAELTARLRAAESDRDEAHRRVAQLADQVAQLAAAL 365 Query: 391 SSASASKDE 399 + A Sbjct: 366 AHREAPTPT 374 >UniRef50_A8LW37 Putative uncharacterized protein n=2 Tax=Salinispora RepID=A8LW37_SALAI Length = 1426 Score = 46.4 bits (108), Expect = 0.006, Method: Composition-based stats. Identities = 56/182 (30%), Positives = 75/182 (41%) Query: 99 RPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTS 158 P + A + A + A + A+ A A +A A A Sbjct: 821 PPGLHKFLTEGRYAAAERDQSAAAHLAVVASLVERINEVAQTATQDAMNAQAVAAEARDD 880 Query: 159 AGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSA 218 A AA A A+ SA A+T A +A++ A A S + A + AKT+ T A S +SA Sbjct: 881 AASAADYANQAAQSAQAAATYAAQAAQYANQASKSVAEAEAAVQTAKTAATQAVDSARSA 940 Query: 219 ATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSE 278 SAS A A SAR+A A +AA S A SA AA +A A K A+ E Sbjct: 941 IRSASWAILSHERAVQSAREAQAFAQAAYDSAIAAGKSAEEAAKAAEDARREYKLAEGRE 1000 Query: 279 TN 280 Sbjct: 1001 VA 1002 Score = 45.3 bits (105), Expect = 0.014, Method: Composition-based stats. Identities = 70/331 (21%), Positives = 116/331 (35%), Gaps = 19/331 (5%) Query: 146 ADAADSARAASTSAGQAASSAQSASSSAGTA-----STKATEASKSAAAAESSKSAAATS 200 DAA +A+ AS +A A +AQ+A +A A + + + + A + AA + Sbjct: 544 GDAATAAQRASEAANSATIAAQAAVETASQAIEVYDAAREADTERLAVFQDERVEAAHQA 603 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 A ++ A+ A + +EA A + AA+ AA+ N ++ + Sbjct: 604 AAQYDEAQAAATWDALQAEQRDAETDQLIAEALNPATETAAAVTAARKVAMNLVHASGTW 663 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 A A A+ E AAGQ + A + +TS A A Sbjct: 664 TRQAAQAALGGSDAQVMEFVRTGIAEAAGQD------DRIVVGELAISENTSLRDA---A 714 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 AA + +++ S G + + +AAK S A + S Sbjct: 715 LAALEGSDAEVSQFLATQDYPGRYIQDRLKVNQIMAAAKDSGDTHLAQKAQEALSNGDGQ 774 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAES--- 437 + AS +A+A A + + A + A + K E Sbjct: 775 VLRTFIASGQHTAAAISQRIQVNQILASAESGPEVKAAAQIALTGPPPGLHKFLTEGRYA 834 Query: 438 AATRAETAAKRAEDIASAVAL--EDASTTKK 466 AA R ++AA +AS V E A T + Sbjct: 835 AAERDQSAAAHLAVVASLVERINEVAQTATQ 865 >UniRef50_B4MT61 GK20099 n=1 Tax=Drosophila willistoni RepID=B4MT61_DROWI Length = 713 Score = 45.7 bits (106), Expect = 0.010, Method: Composition-based stats. Identities = 60/419 (14%), Positives = 133/419 (31%), Gaps = 18/419 (4%) Query: 88 DFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAA-----------KKSASDAST 136 D L ++ D E E+ E + N +A +N++ A +++ D Sbjct: 153 DSLDSVVNGDLSLEVNIDIEMENENLFNNGNAGEENSSGAGAGRTAVMSPCRRTLPDEGE 212 Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSA 196 + D R + +A +A A +A+++ +K + + Sbjct: 213 LPKAKKAKLDVNDDEERGSDATAAASAGDASAATAAGNDVKSKNGNDNNDDEVDARPRQE 272 Query: 197 AATSAGAAKTSE--TNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNAS 254 A + E + + A ++ T+ DA + E K S Sbjct: 273 LPEIAEPDEEVEGGDELKLAGKQLKEHVDKEEIPAPDSPTAEEDAPKADEDVKQSNDEDD 332 Query: 255 SSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAG 314 + ++ A + + E +A ++ ++ + +A Sbjct: 333 QEKKTKSNDTEAKSPPDQEKIIEKQEIEEDEKSAKPEGEQDQQDESKNKNTKLVVAAAAA 392 Query: 315 QASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAES 374 A+A A ++ + + + +Q +A AS AK + +A + Sbjct: 393 AAAAKEDAPSAEQVQKETTKTKTEEEIIKTADQEKKSAVDASEAKPVGEEKPVAAAAAVA 452 Query: 375 SKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKST 434 + + ++A S+A+S +A+ + K + + Sbjct: 453 ATRDKPNEEATAKSAAASDAATAATNIESKKNITEKTESPQKKVVDNDEKVGEKPGEEEK 512 Query: 435 AESAATRAETAAKRAEDIA--SAVALEDASTTKKGIVQLSSATNS---TSETLAATPKA 488 + A A E+ A E + K + +T++ +E LAATP++ Sbjct: 513 TKEEAATDAATAAVTEESAKKQKEDDEQKTAIKAASDDGAKSTDTEDKPTEVLAATPQS 571 >UniRef50_UPI0000F2B26E PREDICTED: similar to PKA phosphorylated calcium and CABYR-binding protein n=1 Tax=Monodelphis domestica RepID=UPI0000F2B26E Length = 819 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 86/242 (35%), Gaps = 8/242 (3%) Query: 188 AAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKE--- 244 ++ + G+ K S S++S AT ++ + A + A + + + ASK Sbjct: 183 TGMSEAEFLVLSKPGSKKDSVCQPEDSVESKATPRTSRESSAGKTAPAPKSSTASKTFSP 242 Query: 245 --AAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA 302 +AK S+ A++ SATA+ + +A+ S AA S+ S + Sbjct: 243 PGSAKVFSPQGSTEAATPPKSATASKTFSPPGSAKIISAQGSTEAAR--ISSTQASAKGS 300 Query: 303 ASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSE 362 + + S S +S + A S E+A S+S A+ +A +A+ S SA + + Sbjct: 301 KTVSLQKSVSRAFSSPGSVKAQGSTEAAKISSSQASHRASKASAI-EPPEESGSAIRKTS 359 Query: 363 TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAA 422 S + + + + A +A K E Sbjct: 360 IVKLESFDKKAPIDLESPALETELPPEEEAPLEDVKVEASPLEEAFLEVESAPEKEAEVV 419 Query: 423 GS 424 Sbjct: 420 EP 421 >UniRef50_D0DXJ3 Putative uncharacterized protein (Fragment) n=1 Tax=Lactobacillus jensenii 115-3-CHN RepID=D0DXJ3_9LACO Length = 502 Score = 45.3 bits (105), Expect = 0.014, Method: Composition-based stats. Identities = 81/393 (20%), Positives = 146/393 (37%), Gaps = 12/393 (3%) Query: 117 ASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTA 176 A N +A +S D++T+A +A A D + + A +A S Q S+ A Sbjct: 51 AGQTVLNNDSATQSEVDSATTAINSAKSALDGETTDKNALETAVNDQSDVQKTSAYY-NA 109 Query: 177 STKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSA 236 S +A A +A + ++ + S T+A + +SA +T T A E A + Sbjct: 110 SDDKKQAYDDAVSAGQTVLNDDSATQSEVDSATSAIDNAKSALDGQATDKT-ALETAVND 168 Query: 237 RDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAA 296 ++ A ++ + + A ++ N+ A ++ +A S+ A + A Sbjct: 169 QNDVQKTSAYYNASDDKKQAYDDAVAAGQTVLNNDSATQSEVDSATSAINNAKSALDGQA 228 Query: 297 GSKTA-----AASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 KTA S ++ + AS A A SA + + A+ Sbjct: 229 TDKTALETAVNEQSTVESTPAYYNASDDKKQAYDDAVSAGQKVLNNDSATQSEADSATTT 288 Query: 352 ARSASAAKTSETNAKASETSA--ESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 SA AA ET K + +A + S S+ +A+ A A + S Sbjct: 289 INSAKAALDGETTDKRALETAVNDQSNVQKTSAYYNASDEKKQAYDDAVAAGQTVLNNDS 348 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALED--ASTTKKG 467 + + AT A +A +A ++T +SA A + ++ D Sbjct: 349 ATQSEVDSATTAINNAKSALDGETTDKSALETAVNDQSDVQKTSAYYNASDDKKQAYDDA 408 Query: 468 IVQLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 + + N+ S T + + +A DNA+ L Sbjct: 409 VSAGQTVLNNDSATQS-EVDSATTAIDNAKAAL 440 >UniRef50_Q5YPQ7 Putative transporter n=2 Tax=Actinomycetales RepID=Q5YPQ7_NOCFA Length = 1367 Score = 45.3 bits (105), Expect = 0.014, Method: Composition-based stats. Identities = 66/400 (16%), Positives = 135/400 (33%), Gaps = 14/400 (3%) Query: 108 LMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQ 167 A + + +A+ ++++ + S +A + + Q S Sbjct: 656 RKSASPGEPAGRSSASASASASASAERTRSGGPRNGNAGQPNRATGTDTEHTRQTTRSTG 715 Query: 168 SASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATT 227 SA T + T A + AAES+ + ++ + A +S T+ + T Sbjct: 716 RTRESAQT-TGSRTTAGDATRAAESATANKPSTPTRSTNGTRTARSSSNPTRTTPAGDPT 774 Query: 228 KASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETA 287 A A +AR + + AA+ A ++ +A A + + + + A + T Sbjct: 775 AAD--AGAARSRSGTAPAARPDTRTAPTTTPAARPGDGTATTTTRTTRPGDGTATTPTTR 832 Query: 288 AGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ 347 G + ++T + SA +A T +G A A A T A Sbjct: 833 PGDGTATTPTTRTDDGPAVEPGEPSAATTAARTTRSGDGA--APRPTRQADTAAD----- 885 Query: 348 ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAA 407 A +A+ T + + ++ T + +A S + + S A + A Sbjct: 886 ARRPGDTATTGGAPTTRQTNATDAQRTAGTGSGHAADSTPTGSGSTPAESAGTGPAGATA 945 Query: 408 KSSATTASTKATEAAGSAT--AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTK 465 +S +T+ +T A A ++ T + ++A T Sbjct: 946 ESDSTSDGNGGVGQRSRSTGGNGATGGRHARTSGTATDRPRATVGGPSAADRKSARMTGD 1005 Query: 466 KGI--VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 +G Q ++A + + P A ++ + R + D Sbjct: 1006 RGAREPQKTAAPENRGNSGRFAPSAARAESATEQPRAETD 1045 Score = 44.5 bits (103), Expect = 0.021, Method: Composition-based stats. Identities = 46/165 (27%), Positives = 78/165 (47%), Gaps = 3/165 (1%) Query: 130 SASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAA 189 + +T+A T + R +ST+A +AA+SA+ ++AGT + AT A +A A Sbjct: 1202 ESPAVATTAPTTDTKRTRTRRTTRGSSTAAAKAAASAEPVPATAGTPADSATPAETTAPA 1261 Query: 190 AESSKSAAA---TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAA 246 + + A S+ + K S + A ++ +A + AT A + +A+ AA Sbjct: 1262 PQPAPEQPAAPAKSSASTKPSPSTAKSTPTKSAPTKPAATKSAPTKSAAAKSTTPKATAA 1321 Query: 247 KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 KS+ ++ S+ A SAT A +S T T ARS +T + Sbjct: 1322 KSTARKRPAAKSTGAESATGAKSSTAQKSTRSTRARSDDTGSAPE 1366 >UniRef50_UPI0001AF0361 Phage-related protein, tail component n=1 Tax=Acinetobacter baumannii AB900 RepID=UPI0001AF0361 Length = 3727 Score = 44.9 bits (104), Expect = 0.018, Method: Composition-based stats. Identities = 82/370 (22%), Positives = 151/370 (40%), Gaps = 16/370 (4%) Query: 100 PEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSA 159 P+ + + A+ + + + A A A T A +A +A A T+A Sbjct: 852 PDQVLDLISGQIGESDLATELQGKIENSVTVSEAAKIVADNAQTAATNAQTAATDAKTAA 911 Query: 160 GQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKT----------SET 209 +A +A SA + A +A A+EAS +AA A+++ A T+A AKT + + Sbjct: 912 SEAQKAASSAQTQASSAQQIASEASATAANAKNAADQAVTAANQAKTAADNAATTAATAS 971 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKS-SETNASSSASSAASSATAAG 268 + ++ Q+ A A+ A +K + T++ + K A ++ + T A S + + T+ Sbjct: 972 STASKAQTTANDAAAAASKVASDLTTSTNQLNKKIADETDARTVAISKLNDGLTIETSQR 1031 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 + AA S S T S+ + A +SA+ + S+ + + + Sbjct: 1032 KTEDAALLSNIETYKSSTNGTLSSLQTQINTNATNTSANTSKISSLDSRLTTNEGKTADA 1091 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 A++ + T A ++A+A A S +A K+ ++ K A AS Sbjct: 1092 INAAATAQQTA--SSAVDKANAVANSVTALKSELSSGKGINNIVAPFSDPQELPALGGAS 1149 Query: 389 SASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKR 448 A D A R+ A + TA+ + A A S+ + A RA T + Sbjct: 1150 RTV---ALVDSALRRNGKAYKVSFTAAAHYVYFGTAQAALAPSQMAMQVEAGRAYTFSVW 1206 Query: 449 AEDIASAVAL 458 + +++AV Sbjct: 1207 LKALSTAVPS 1216 >UniRef50_A9WSY5 Putative uncharacterized protein n=1 Tax=Renibacterium salmoninarum ATCC 33209 RepID=A9WSY5_RENSM Length = 697 Score = 44.9 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 36/119 (30%), Positives = 54/119 (45%) Query: 133 DASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAES 192 DA EA A A A A+ + QA A+ AS+SA A+ A++A A A Sbjct: 563 DAIRFKDEARRSADAAKVDADQAAEYSRQAQKPAEDASASAKLANDAASQARSDAREAAG 622 Query: 193 SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSET 251 + A SA +A+ S AS + + A SA A + AA ++ +A A+ + E Sbjct: 623 LATQAEVSAASARRSAAIASEAAKQAEASAVQAGKDSDAAAAASTEAWAAVAQKRQFED 681 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76072 Side tail fiber protein homolog from lambdoid pr... 718 0.0 UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia ... 413 e-113 UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enter... 328 8e-88 UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacter... 309 4e-82 UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escheric... 271 9e-71 UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX 243 2e-62 UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae Re... 235 8e-60 UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enter... 233 3e-59 UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica... 230 3e-58 UniRef50_C8U9W7 Probable tail fiber protein-like protein n=1 Tax... 228 1e-57 UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria pha... 223 3e-56 UniRef50_C4TT85 Gp19 n=1 Tax=Yersinia kristensenii ATCC 33638 Re... 218 1e-54 UniRef50_UPI0001826514 putative tail fiber protein (GpH) n=2 Tax... 216 3e-54 UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriacea... 208 8e-52 UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteri... 205 9e-51 UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escheri... 204 1e-50 UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclass... 204 1e-50 UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepI... 202 7e-50 UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysente... 199 3e-49 UniRef50_B4TP26 Side tail fiber protein n=43 Tax=Salmonella ente... 199 7e-49 UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 T... 196 3e-48 UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae R... 196 5e-48 UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacteriu... 188 1e-45 UniRef50_B6IAV4 Putative phage tail fiber protein n=1 Tax=Escher... 186 5e-45 UniRef50_A8A0A4 L-shaped tail fiber protein n=12 Tax=root RepID=... 184 1e-44 UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae ... 180 3e-43 UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterob... 179 4e-43 UniRef50_C6UHV3 Predicted tail fiber protein n=22 Tax=Escherichi... 176 5e-42 UniRef50_D2TJ16 Putative phage tail fibre protein n=1 Tax=Citrob... 174 2e-41 UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammapr... 173 4e-41 UniRef50_C2DS71 Tail fiber protein n=10 Tax=Escherichia RepID=C2... 172 7e-41 UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadan... 172 7e-41 UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp.... 171 1e-40 UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_L... 168 2e-39 UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseu... 163 3e-38 UniRef50_Q858V4 GpH n=9 Tax=root RepID=Q858V4_9CAUD 163 3e-38 UniRef50_C4U3E2 Tail fiber protein (Fragment) n=1 Tax=Yersinia k... 158 9e-37 UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas ... 158 1e-36 UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepI... 155 8e-36 UniRef50_Q19CF5 Gp36 small distal tail fiber subunit n=1 Tax=Aer... 153 3e-35 UniRef50_B5YU13 Tail fiber protein n=140 Tax=root RepID=B5YU13_E... 151 1e-34 UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclass... 150 3e-34 UniRef50_C5BA56 Putative uncharacterized protein n=1 Tax=Edwards... 149 5e-34 UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacter... 146 3e-33 UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae... 143 4e-32 UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Pho... 143 5e-32 UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1... 142 8e-32 UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli pl... 141 1e-31 UniRef50_Q7Y3Z0 Tail fiber protein n=1 Tax=Yersinia phage PY54 R... 141 2e-31 UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteri... 140 4e-31 UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella... 136 5e-30 UniRef50_Q7N0S6 Similarities with prophage tail fiber protein n=... 134 2e-29 UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae... 128 1e-27 UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersini... 127 2e-27 UniRef50_D0Z3B3 Putative tail fiber protein n=1 Tax=Photobacteri... 126 5e-27 UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacter... 126 6e-27 UniRef50_B7MW07 Putative tail fiber protein from prophage n=4 Ta... 126 7e-27 UniRef50_A4N1T0 Immunoglobin A1 protease n=1 Tax=Haemophilus inf... 125 8e-27 UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH... 124 1e-26 UniRef50_B5YYN6 Tail fiber protein n=60 Tax=root RepID=B5YYN6_ECO5E 122 1e-25 UniRef50_B3YHG3 Tail fiber protein n=2 Tax=Salmonella enterica s... 120 4e-25 UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrob... 119 5e-25 UniRef50_Q9LA62 ORF-401-like protein n=1 Tax=Enterobacterial pha... 117 3e-24 UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae... 117 3e-24 UniRef50_Q32D03 Putative uncharacterized protein n=2 Tax=root Re... 117 4e-24 UniRef50_D2TSH8 Phage tail fibre protein n=7 Tax=root RepID=D2TS... 117 4e-24 UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enter... 112 6e-23 UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteri... 112 1e-22 UniRef50_B7MWN9 Putative tail fiber protein (GpH) n=2 Tax=Escher... 112 1e-22 UniRef50_C9XHA4 Phage variable tail-fibre protein n=1 Tax=Salmon... 102 1e-19 UniRef50_D1RZD4 Putative uncharacterized protein n=1 Tax=Serrati... 101 1e-19 UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadan... 93 6e-17 UniRef50_A7ZN97 Tail fiber family protein n=2 Tax=Escherichia co... 93 7e-17 UniRef50_C4KLT7 Hep_Hag family protein n=45 Tax=Proteobacteria R... 87 5e-15 UniRef50_C5PDM6 Putative uncharacterized protein n=2 Tax=Coccidi... 85 1e-14 UniRef50_C7JGL3 Chromosome segregation protein SMC n=9 Tax=Alpha... 82 1e-13 UniRef50_Q2UB42 Predicted protein n=2 Tax=Aspergillus RepID=Q2UB... 82 2e-13 UniRef50_C8VCU8 Putative uncharacterized protein n=2 Tax=Emerice... 82 2e-13 UniRef50_Q7Q4S4 AGAP000893-PA n=1 Tax=Anopheles gambiae RepID=Q7... 81 2e-13 UniRef50_Q2HAR4 Putative uncharacterized protein n=1 Tax=Chaetom... 76 9e-12 UniRef50_B7NJP1 Putative side tail fiber protein homolog from la... 76 1e-11 UniRef50_Q9WXA5 Tail fiber n=2 Tax=Pectobacterium carotovorum Re... 76 1e-11 UniRef50_Q7SBR0 Putative uncharacterized protein n=1 Tax=Neurosp... 68 2e-09 UniRef50_C9KJG4 Side tail fiber protein from lambdoid prophage R... 67 5e-09 UniRef50_B6Q6N2 PT repeat family protein n=1 Tax=Penicillium mar... 63 5e-08 UniRef50_UPI00015B5167 PREDICTED: similar to ENSANGP00000017739 ... 56 9e-06 Sequences not found previously or not previously below threshold: UniRef50_P45386 Immunoglobulin A1 protease translocator n=45 Tax... 132 7e-29 UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid pr... 131 2e-28 UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkhol... 126 5e-27 UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bact... 124 3e-26 UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseu... 123 4e-26 UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=... 118 1e-24 UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU 118 2e-24 UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia Rep... 114 2e-23 UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia... 113 3e-23 UniRef50_B1LMZ0 Putative phage tail fiber protein n=4 Tax=Entero... 109 5e-22 UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus... 108 1e-21 UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersini... 108 1e-21 UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia Rep... 107 4e-21 UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus... 105 1e-20 UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteria... 105 1e-20 UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabd... 104 2e-20 UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadan... 104 2e-20 UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root R... 104 3e-20 UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Entero... 102 6e-20 UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=... 102 6e-20 UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae ... 102 7e-20 UniRef50_Q06852 Cell surface glycoprotein 1 n=8 Tax=cellular org... 100 3e-19 UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadan... 99 6e-19 UniRef50_B5Q8N4 Tail protein n=6 Tax=root RepID=B5Q8N4_SALVI 99 1e-18 UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralston... 99 1e-18 UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteri... 99 1e-18 UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=... 96 7e-18 UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber prote... 96 9e-18 UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=... 96 1e-17 UniRef50_P18771 Large tail fiber protein p34 n=5 Tax=T4-like vir... 96 1e-17 UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 ... 95 2e-17 UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 Rep... 94 3e-17 UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkhol... 94 4e-17 UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=... 93 5e-17 UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkhol... 93 7e-17 UniRef50_B2I5N0 Tail Collar domain protein n=13 Tax=Xylella fast... 92 1e-16 UniRef50_Q7N541 Similar to DNA inversion product and tail fiber ... 92 1e-16 UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli R... 91 2e-16 UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia ... 91 2e-16 UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID... 91 3e-16 UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID... 90 4e-16 UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax... 88 2e-15 UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacteriu... 86 8e-15 UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacteriu... 85 2e-14 UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae... 84 4e-14 UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Ta... 83 5e-14 UniRef50_Q0C8E2 Predicted protein n=1 Tax=Aspergillus terreus NI... 83 5e-14 UniRef50_Q92954 Proteoglycan 4 C-terminal part n=17 Tax=Eutheria... 83 1e-13 UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=... 83 1e-13 UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 T... 82 1e-13 UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadan... 82 1e-13 UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica ... 82 2e-13 UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 ... 81 3e-13 UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX 81 4e-13 UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannhei... 80 6e-13 UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomo... 79 7e-13 UniRef50_Q37842 Tail fiber protein H n=1 Tax=Enterobacteria phag... 78 2e-12 UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectoba... 78 2e-12 UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium w... 77 3e-12 UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID... 77 3e-12 UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacteriu... 77 5e-12 UniRef50_UPI000180CCCC PREDICTED: similar to zymogen granule mem... 76 6e-12 UniRef50_A1Z8Q2 CG13185, isoform B n=4 Tax=Drosophila RepID=A1Z8... 76 7e-12 UniRef50_Q179R7 Putative uncharacterized protein n=1 Tax=Aedes a... 76 9e-12 UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylo... 76 1e-11 UniRef50_A8DYB0 CG13185, isoform C n=9 Tax=Drosophila RepID=A8DY... 74 3e-11 UniRef50_Q5Z1P8 Putative uncharacterized protein n=1 Tax=Nocardi... 73 5e-11 UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=C... 72 1e-10 UniRef50_Q1XI26 PvLEA1 protein n=1 Tax=Polypedilum vanderplanki ... 71 2e-10 UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polarom... 71 3e-10 UniRef50_Q29NV9 GA17619 n=5 Tax=Eukaryota RepID=Q29NV9_DROPS 71 3e-10 UniRef50_A7T1V7 Predicted protein n=4 Tax=cellular organisms Rep... 69 7e-10 UniRef50_Q4ABH1 Muscle-specific protein 300, isoform D n=33 Tax=... 69 1e-09 UniRef50_Q4WP03 Chromatin modification-related protein vid21 n=1... 68 2e-09 UniRef50_Q0A8Q8 Ribonuclease E n=8 Tax=Gammaproteobacteria RepID... 68 2e-09 UniRef50_UPI000186F3DC Titin, putative n=1 Tax=Pediculus humanus... 67 3e-09 UniRef50_B4PV00 GE23539 n=2 Tax=melanogaster subgroup RepID=B4PV... 67 5e-09 UniRef50_B2VW07 Predicted protein n=1 Tax=Pyrenophora tritici-re... 66 8e-09 UniRef50_A0YKP6 Hemolysin-type calcium-binding toxin n=1 Tax=Lyn... 66 9e-09 UniRef50_D2HNP5 Putative uncharacterized protein (Fragment) n=1 ... 65 1e-08 UniRef50_B8QTW7 Putative tail fiber protein n=1 Tax=Erwinia phag... 65 2e-08 UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labr... 64 3e-08 UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio... 64 4e-08 UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 T... 64 4e-08 UniRef50_C1F9K0 Putative uncharacterized protein n=1 Tax=Acidoba... 64 5e-08 UniRef50_B8LYA4 Uro-adherence factor A, putative n=1 Tax=Talarom... 63 5e-08 UniRef50_Q9XW25 Protein Y18D10A.1, partially confirmed by transc... 63 6e-08 UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacter... 62 1e-07 UniRef50_D0WHF5 Putative uncharacterized protein n=1 Tax=Slackia... 62 1e-07 UniRef50_B8ZQ26 Choline-binding surface protein A n=10 Tax=Strep... 62 1e-07 UniRef50_UPI000023F160 hypothetical protein FG00031.1 n=1 Tax=Gi... 61 3e-07 UniRef50_B7CDC6 Putative uncharacterized protein n=1 Tax=Eubacte... 61 4e-07 UniRef50_B4VM02 Putative uncharacterized protein n=3 Tax=Microco... 60 4e-07 UniRef50_A5K4N9 Dynein heavy chain, putative n=2 Tax=Plasmodium ... 60 7e-07 UniRef50_C9S6D8 Putative uncharacterized protein n=1 Tax=Vertici... 59 1e-06 UniRef50_B4VJR6 Putative uncharacterized protein n=1 Tax=Microco... 58 2e-06 UniRef50_Q9NU22 Midasin n=31 Tax=Coelomata RepID=MDN1_HUMAN 58 2e-06 UniRef50_UPI000194DC45 PREDICTED: similar to mucin 16 n=1 Tax=Ta... 58 2e-06 UniRef50_B9PU60 Putative uncharacterized protein n=3 Tax=Toxopla... 58 2e-06 UniRef50_A9RX33 Predicted protein n=1 Tax=Physcomitrella patens ... 58 2e-06 UniRef50_C7YJJ9 Putative uncharacterized protein n=1 Tax=Nectria... 58 2e-06 UniRef50_Q9HR92 Halobacterial transducer protein 6 n=3 Tax=Halob... 58 3e-06 UniRef50_A6SPK9 Putative uncharacterized protein n=1 Tax=Botryot... 58 3e-06 UniRef50_UPI0000F2DCD9 PREDICTED: similar to MICAL-like 2 n=1 Ta... 57 3e-06 UniRef50_B9CN32 Putative uncharacterized protein n=1 Tax=Atopobi... 57 4e-06 UniRef50_D2R8K8 Putative uncharacterized protein n=1 Tax=Pirellu... 56 7e-06 UniRef50_D2PNZ5 Putative uncharacterized protein n=1 Tax=Kribbel... 56 7e-06 UniRef50_B9LNH9 Putative uncharacterized protein n=1 Tax=Halorub... 56 8e-06 UniRef50_B6XJ97 Putative uncharacterized protein n=2 Tax=Enterob... 56 1e-05 UniRef50_B0WIK0 Putative uncharacterized protein n=1 Tax=Culex q... 56 1e-05 UniRef50_UPI00016C0C40 TPR repeat n=1 Tax=Epulopiscium sp. 'N.t.... 56 1e-05 UniRef50_Q84CW8 Putative transmembrane protein n=1 Tax=unculture... 55 2e-05 UniRef50_C1CP01 Pneumococcal surface protein A n=75 Tax=Streptoc... 54 2e-05 UniRef50_Q6ZZ82 Eukaryotic initiation factor 4G n=2 Tax=Echinace... 54 2e-05 UniRef50_Q58MY1 Predicted protein n=1 Tax=Prochlorococcus phage ... 54 3e-05 UniRef50_A8N2G9 Putative uncharacterized protein n=1 Tax=Coprino... 54 3e-05 UniRef50_UPI00006A011C mucin 16 (MUC16), mRNA n=3 Tax=Xenopus (S... 54 4e-05 UniRef50_UPI00006CB316 hypothetical protein TTHERM_00456860 n=1 ... 53 5e-05 UniRef50_C3ZPV4 Putative uncharacterized protein n=1 Tax=Branchi... 53 7e-05 UniRef50_Q2Y2L9 Major surface glycoprotein G n=13 Tax=Avian meta... 53 8e-05 UniRef50_Q1JCN4 Extracellular matrix binding protein n=6 Tax=Str... 52 1e-04 UniRef50_B4CVU3 Putative uncharacterized protein n=1 Tax=Chthoni... 52 1e-04 UniRef50_UPI00016C0209 S-layer domain protein n=1 Tax=Epulopisci... 52 1e-04 UniRef50_Q4SE75 Chromosome undetermined SCAF14625, whole genome ... 52 1e-04 UniRef50_Q9I7U4 Titin n=39 Tax=Eukaryota RepID=TITIN_DROME 52 1e-04 UniRef50_C1FYY4 Predicted protein n=3 Tax=Paracoccidioides brasi... 52 1e-04 UniRef50_C0FZM7 Putative uncharacterized protein n=1 Tax=Rosebur... 51 2e-04 UniRef50_C5VC26 Secreted cell wall-associated hydrolase n=2 Tax=... 51 2e-04 UniRef50_D2H4Z1 Putative uncharacterized protein n=1 Tax=Ailurop... 51 2e-04 UniRef50_UPI000023CDE0 hypothetical protein FG09994.1 n=1 Tax=Gi... 51 3e-04 UniRef50_Q04HY6 Choline binding protein A n=135 Tax=Streptococcu... 51 3e-04 UniRef50_A8XJP6 Putative uncharacterized protein n=2 Tax=Caenorh... 51 4e-04 UniRef50_UPI000023F3CB hypothetical protein FG08587.1 n=1 Tax=Gi... 51 4e-04 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 50 4e-04 UniRef50_B6HHF2 Pc20g14690 protein n=1 Tax=Penicillium chrysogen... 50 5e-04 UniRef50_C9SNW3 Putative uncharacterized protein n=1 Tax=Vertici... 50 5e-04 UniRef50_C3Z0S4 Putative uncharacterized protein n=1 Tax=Branchi... 50 5e-04 UniRef50_A7E639 Putative uncharacterized protein n=1 Tax=Sclerot... 50 6e-04 UniRef50_B2INK9 Choline binding protein A n=25 Tax=Streptococcus... 50 6e-04 UniRef50_C6WQE4 YD repeat protein n=1 Tax=Actinosynnema mirum DS... 50 6e-04 UniRef50_D0N8F2 Mucin-like protein n=1 Tax=Phytophthora infestan... 49 8e-04 UniRef50_Q8VQ55 Pneumococcal surface protein A (Fragment) n=21 T... 49 8e-04 UniRef50_A7BAA1 Putative uncharacterized protein n=1 Tax=Actinom... 49 8e-04 UniRef50_Q4P4T5 Putative uncharacterized protein n=1 Tax=Ustilag... 49 0.001 UniRef50_B6K087 GYF domain-containing protein n=1 Tax=Schizosacc... 49 0.001 UniRef50_B6H2C8 Pc13g04790 protein n=1 Tax=Penicillium chrysogen... 49 0.001 UniRef50_A4R522 Putative uncharacterized protein n=2 Tax=Eukaryo... 48 0.002 UniRef50_Q0CYT2 Putative uncharacterized protein n=1 Tax=Aspergi... 48 0.002 UniRef50_UPI00017F7AFB YALI0E22572p n=1 Tax=Yarrowia lipolytica ... 48 0.002 UniRef50_C7QJN3 Hedgehog/intein hint domain protein n=1 Tax=Cate... 48 0.002 UniRef50_C9SUW3 Putative uncharacterized protein n=1 Tax=Vertici... 48 0.003 UniRef50_A3X9B4 Putative uncharacterized protein n=1 Tax=Roseoba... 47 0.003 UniRef50_Q9KW53 Tail fiber n=9 Tax=Enterobacteriaceae RepID=Q9KW... 47 0.003 UniRef50_UPI00015FF553 UPI00015FF553 related cluster n=2 Tax=Dro... 47 0.004 UniRef50_Q6C7I9 YALI0E00484p n=1 Tax=Yarrowia lipolytica RepID=Q... 47 0.004 UniRef50_B7PES8 Secreted mucin MUC17, putative (Fragment) n=1 Ta... 47 0.004 UniRef50_C2HLK2 Surface protein n=5 Tax=Lactobacillales RepID=C2... 47 0.005 UniRef50_Q6W4X9 Mucin-6 n=10 Tax=Catarrhini RepID=MUC6_HUMAN 46 0.007 UniRef50_C0WA14 Putative uncharacterized protein n=1 Tax=Acidami... 46 0.008 UniRef50_A8I7A5 Histone methyltransferase n=2 Tax=Eukaryota RepI... 46 0.009 UniRef50_D0NLX0 Putative uncharacterized protein n=1 Tax=Phytoph... 46 0.009 UniRef50_B1Z702 Putative uncharacterized protein n=7 Tax=Alphapr... 46 0.010 UniRef50_Q6CGV5 YALI0A15796p n=1 Tax=Yarrowia lipolytica RepID=Q... 46 0.011 UniRef50_C1CZC2 Putative uncharacterized protein n=1 Tax=Deinoco... 46 0.011 >UniRef50_P76072 Side tail fiber protein homolog from lambdoid prophage Rac n=23 Tax=root RepID=STFR_ECOLI Length = 1120 Score = 718 bits (1851), Expect = 0.0, Method: Composition-based stats. Identities = 1120/1120 (100%), Positives = 1120/1120 (100%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV Sbjct: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA Sbjct: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT Sbjct: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT Sbjct: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE Sbjct: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE Sbjct: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 Query: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA Sbjct: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 Query: 541 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR 600 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR Sbjct: 541 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR 600 Query: 601 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVV 660 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVV Sbjct: 601 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVV 660 Query: 661 NDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGN 720 NDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGN Sbjct: 661 NDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGN 720 Query: 721 QIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLN 780 QIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLN Sbjct: 721 QIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLN 780 Query: 781 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAY 840 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAY Sbjct: 781 GNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAY 840 Query: 841 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLP 900 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLP Sbjct: 841 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLP 900 Query: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS 960 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS Sbjct: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS 960 Query: 961 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA 1020 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA Sbjct: 961 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA 1020 Query: 1021 HTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI 1080 HTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI Sbjct: 1021 HTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI 1080 Query: 1081 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 1081 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 >UniRef50_B7L485 Putative tail fiber protein n=4 Tax=Escherichia coli RepID=B7L485_ECO55 Length = 1056 Score = 413 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 702/1140 (61%), Positives = 777/1140 (68%), Gaps = 104/1140 (9%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M VKISGVLKDGTGKPVQNCTI LKA+R S+TVVVNT+ASENPDEAGRYSMDVE+GQYSV Sbjct: 1 MTVKISGVLKDGTGKPVQNCTIVLKARRTSSTVVVNTVASENPDEAGRYSMDVEHGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 LLVEGFPPSHAGTITVYE S+PGTLNDFLGAMTEDD RPEALRRFE MVEEV+RNASAV Sbjct: 61 TLLVEGFPPSHAGTITVYEGSRPGTLNDFLGAMTEDDVRPEALRRFEQMVEEVSRNASAV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDAS SA EAATHA DAA SARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 121 AQNTAAAKKSASDASASASEAATHATDAAASARAASTSAGQAASSAQSASSSAGTASTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 EA+KSAAAAESSKSAAATSA AAKTSETNA+AS +SAATSASTATTKASEAATSAR AA Sbjct: 181 REAAKSAAAAESSKSAAATSASAAKTSETNAAASQKSAATSASTATTKASEAATSARGAA 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT Sbjct: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAG+ATEQASAAARSASAAKT Sbjct: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGKATEQASAAARSASAAKT 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK Sbjct: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK------------ 408 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 SA +T G ++ + ST+E Sbjct: 409 --------------------------------GSATTASTKATEAAGSATAAAQSKSTAE 436 Query: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 + A + ++ + + + N+ S+T A + ++ NA Sbjct: 437 SAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKAANDNA 496 Query: 541 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGR 600 S V ++ G + + G Sbjct: 497 NGRVPS--NRKVNGKALTADITLTPKD-------------------IGTLNSVTISFSGG 535 Query: 601 YAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGA---D 657 + + + + G + +G GA Sbjct: 536 AGWFKLATVTMPQASSIVYIALIGGAGYNVGSPQQAGISELVLRAGNGKPKGITGALWKR 595 Query: 658 LVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVAR 717 V T + + T+ T I + I ++ + + + + ++ ++ + + Sbjct: 596 TAVGLTNFAWINTSGDTYDIYVE-IGNYATSVNIHWDCTTNASVSIYTSPTYSASKPSSV 654 Query: 718 GGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSA 777 G + + + +D P +G L + ++ Sbjct: 655 TGGVVYTMYSSHQKPTPSDIGAL------------------PTTGGTISGPLSVTDGITG 696 Query: 778 SLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAES 837 +L GNA TATKL +++GV+FDGS DI LT+ ++ AFARR+T YAD+DG VPWNAES Sbjct: 697 ALKGNADTATKLAAAPKINGVKFDGSADINLTSENIGAFARRSTGAYADSDGAVPWNAES 756 Query: 838 GAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSK 897 GAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRN GLFYRSSRDGYGFEEDWAEVYTSK Sbjct: 757 GAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNRGLFYRSSRDGYGFEEDWAEVYTSK 816 Query: 898 NLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGK 957 NLPPESYPVGAPIPWPSDTVPSGYALMQGQ F+KSAYPKLAAAYPSGVIPDMRGWTIKGK Sbjct: 817 NLPPESYPVGAPIPWPSDTVPSGYALMQGQTFNKSAYPKLAAAYPSGVIPDMRGWTIKGK 876 Query: 958 PASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNS 1017 PASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHS+SGST S Sbjct: 877 PASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSLSGSTGS 936 Query: 1018 AGAHTHSLANVNTASANSGAG-----------------SASTRLSVVHNQNYATSSAGAH 1060 AG HTH S S T + Q T SAGAH Sbjct: 937 AGVHTHGNGIRWPGGGGSALAFYDGGGFTYVQNSQYQVSPGTSSYRSYYQRIQTQSAGAH 996 Query: 1061 THSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 THSLSGTAAS+GAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 997 THSLSGTAASSGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1056 >UniRef50_B5FQX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5FQX9_SALDC Length = 569 Score = 328 bits (840), Expect = 8e-88, Method: Composition-based stats. Identities = 170/406 (41%), Positives = 205/406 (50%), Gaps = 64/406 (15%) Query: 736 DGAKTYLLLTNQGDVYGGWNTLR-----PFAIDNATGE--LVIGTKLSASLNGNALTATK 788 D KTY + N ++Y G L+ D A G L + +K + N A + Sbjct: 207 DNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCG 266 Query: 789 LQTPRRVSGVEFDGSKDITLTAAHVAAFARRA-----TDTYADADGG--------VPWNA 835 P + V+ G+ + TA A R T +D G + Sbjct: 267 ADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGATRGSDSGLIMGEVYNNGYPT 326 Query: 836 ESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGF-EEDWAEVY 894 + G ++ ++G + RS RD +WA +Y Sbjct: 327 QYGNILRLTGTGDGEILIGWSGTNGAPAPA----------YIRSHRDTADAEWSEWAMLY 376 Query: 895 TSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTI 954 TS N PP SYPVGA I WPSD P+GYALMQGQ+FDKSAYP LA AYPSG+IPDMRGWTI Sbjct: 377 TSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGIIPDMRGWTI 436 Query: 955 KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGS 1014 KGKP SGRAVLSQE DG KSH+HSA A TDLGTK+TSSFDYGTKSTN TG HTH G Sbjct: 437 KGKPISGRAVLSQEMDGNKSHSHSARAQDTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGY 496 Query: 1015 TNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAH 1074 NS + N S G G+ + +AG H Sbjct: 497 INS------YWGDSNHTSFQPGGGAWTQ---------------------------AAGDH 523 Query: 1075 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 AHTV IG H H++ IG HGH + V+A GNAE TVKNIAFNYIVRLA Sbjct: 524 AHTVYIGGHEHTMYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA 569 >UniRef50_B7LN99 Putative tail fiber protein n=2 Tax=Enterobacteriaceae RepID=B7LN99_ESCF3 Length = 593 Score = 309 bits (790), Expect = 4e-82, Method: Composition-based stats. Identities = 205/352 (58%), Positives = 242/352 (68%), Gaps = 26/352 (7%) Query: 770 VIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADG 829 + A N ++ + R+V+G S DI L+ V AF+ T Y +DG Sbjct: 267 TLAATPKAVKAANDNANGRVPSGRKVNGHAL--SSDIKLSPEDVNAFSLGCTGQYPSSDG 324 Query: 830 GVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEED 889 GVPWNA+SG YNV G SYI+ +F++GVGSCR+ Q++A Y+N GL+YRSSRDGYGFE Sbjct: 325 GVPWNAKSGLYNVMDGGASYIVAHFFSGVGSCRSFQLRADYKNRGLYYRSSRDGYGFERG 384 Query: 890 WAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDM 949 + P ++PVGAPI WPSD VP GYA+MQGQ FDK+AYP LAAAYPSGVIPDM Sbjct: 385 FE--------PVNAFPVGAPIAWPSDIVPEGYAIMQGQTFDKAAYPLLAAAYPSGVIPDM 436 Query: 950 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTH 1009 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTK+ + T Sbjct: 437 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKTVSTFNHGT- 495 Query: 1010 SVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVV-HNQNYATSSAGAHTHSLSGTA 1068 +TN+ GAHTH++ G S + V N +SS GAH H++ Sbjct: 496 ---KTTNNTGAHTHTVGG------RYGGDSIGGKQRVQVSGTNQVSSSDGAHAHTV---- 542 Query: 1069 ASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 G H HTVGIGAH H+VA+G+HGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 543 -DIGQHNHTVGIGAHAHTVALGAHGHTITVNAAGNAENTVKNIAFNYIVRLA 593 >UniRef50_B3I2W7 Phage Tail Collar Domain family n=1 Tax=Escherichia coli E22 RepID=B3I2W7_ECOLX Length = 654 Score = 271 bits (692), Expect = 9e-71, Method: Composition-based stats. Identities = 152/365 (41%), Positives = 188/365 (51%), Gaps = 46/365 (12%) Query: 761 AIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFAR-R 819 +NA G + K+ NG+AL T R + FDG ++ + Sbjct: 331 VNENANGRVPASRKV----NGHALNGDINVTSRDI----FDGQVIAIGANKNLDDYQVPG 382 Query: 820 ATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRS 879 A+ + N S S +++ G G + ++ N Y Sbjct: 383 LYFQEANNNTSAAMNYP------ENSAGSLMVLR---GAGVTQVYRVY----NSSRSYSR 429 Query: 880 SRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA 939 S+ W +P +SYPVGAPIPWPSD P+GYALMQGQ FDK+ YP LA Sbjct: 430 SKYSTLAWTPW--------MPEDSYPVGAPIPWPSDVTPTGYALMQGQPFDKAVYPLLAI 481 Query: 940 AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTK 999 AYP+G+IPDMRG TIKGKP +GRAVLS EQDG+ SHTH AS S TDLGTK TSSFDYG+K Sbjct: 482 AYPAGIIPDMRGQTIKGKP-NGRAVLSYEQDGVISHTHGASISDTDLGTKYTSSFDYGSK 540 Query: 1000 STNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANS----GAGSASTRLSVVHNQNYATS 1055 T + S+ G H H+ T++ G G S+ +S Sbjct: 541 PTTSFDYGN----KSSTEGGWHAHNFRYCATSAYRDTPGQGLGMHSSNVSWAAGDR--IE 594 Query: 1056 SAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNY 1115 +G H H G H H VGIGAH H V +G HGHT TV+AAGNAENTVKNIAFNY Sbjct: 595 GSGNHAH-----VTWIGPHDHWVGIGAHNHYVVMGYHGHTATVHAAGNAENTVKNIAFNY 649 Query: 1116 IVRLA 1120 IVRLA Sbjct: 650 IVRLA 654 >UniRef50_B3XIH6 GpH n=7 Tax=Enterobacteriaceae RepID=B3XIH6_ECOLX Length = 710 Score = 243 bits (620), Expect = 2e-62, Method: Composition-based stats. Identities = 191/360 (53%), Positives = 220/360 (61%), Gaps = 28/360 (7%) Query: 761 AIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRA 820 A DNA G + G K++ + + +G + + L + Sbjct: 379 ANDNANGRVPSGRKVNG---KPLTNDVNVTSQDIFNGQSINIGANQNLDNYKTPGLYHQP 435 Query: 821 TDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSS 880 + Y A P N + +G + Q+ Y + RS Sbjct: 436 LNAYTSAALKYPENLAGTLVVLKNAGIT----------------QIYYVYNTSRSYTRSQ 479 Query: 881 RDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAA 940 G W P +S+PVGA IPWPSD+VP+GYA+MQGQ FDK+ YP LAAA Sbjct: 480 -YSTGDWTAWT--------PQDSFPVGAAIPWPSDSVPTGYAVMQGQTFDKTTYPLLAAA 530 Query: 941 YPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKS 1000 YPSGV+PDMRGWTIKGKPASGR VLS EQDGIKSHTHSASAS+TDLGTKTTSSFDYGTKS Sbjct: 531 YPSGVLPDMRGWTIKGKPASGRDVLSLEQDGIKSHTHSASASNTDLGTKTTSSFDYGTKS 590 Query: 1001 TNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAH 1060 TNNTGAHTH+VSG+ NSAGAHTH++ S S N S+GAH Sbjct: 591 TNNTGAHTHNVSGTANSAGAHTHTVPLRRPNSGGMNFDWLDGASSGTVVGNGTVPSSGAH 650 Query: 1061 THSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 THS+SGTA SAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 651 THSVSGTATSAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 710 >UniRef50_Q3ZL14 Tail fiber protein n=2 Tax=Enterobacteriaceae RepID=Q3ZL14_ESCBL Length = 289 Score = 235 bits (598), Expect = 8e-60, Method: Composition-based stats. Identities = 110/224 (49%), Positives = 134/224 (59%), Gaps = 12/224 (5%) Query: 898 NLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGK 957 P G + WP T P+G+ALM GQ FD +AYP+LA AYPSGVIPDMRG TIK Sbjct: 77 FSSDYMLPPGIALAWPGATAPTGFALMLGQTFDTTAYPRLAQAYPSGVIPDMRGQTIKFL 136 Query: 958 PASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNS 1017 PASGR +LS E DG+KSH+HS S S+TDLGT T + D GTK T+ G H H N Sbjct: 137 PASGRTLLSLEADGVKSHSHSGSISTTDLGTATAADTDLGTKQTSQDGLHNHVSDSRFNK 196 Query: 1018 AGAHTHSLANV-NTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAH 1076 A + + NT +S + R+S +++ +A S A +G H H Sbjct: 197 LMARSSDIDGTNNTGDVDSDNPESEHRVSGMNDSLWAAS-----------VIADSGLHMH 245 Query: 1077 TVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 TV IG H HSV IG HGHT+T++ GN ENTVKNIAFN IVRLA Sbjct: 246 TVYIGPHAHSVYIGPHGHTVTISNFGNTENTVKNIAFNAIVRLA 289 >UniRef50_B5PP06 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica RepID=B5PP06_SALHA Length = 534 Score = 233 bits (594), Expect = 3e-59, Method: Composition-based stats. Identities = 142/337 (42%), Positives = 173/337 (51%), Gaps = 57/337 (16%) Query: 784 LTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRA-----TDTYADADGG-------- 830 + P + V+ G+ + TA A R T T +D G Sbjct: 118 KSQNGGDIPEKPLFVQNIGALPASGTAVAANRLASRGALPALTGTTRGSDSGLIMGEVYN 177 Query: 831 VPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGF-EED 889 + + G ++ ++G + RS RD + Sbjct: 178 NGYPTQYGNILRLTGTGDGEILIGWSGTNGAPAPA----------YIRSHRDTADAEWSE 227 Query: 890 WAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDM 949 WA +YT+ N PP+S+PVGAPI WPSD P+GYALMQGQ+FDKSAYP LA AYPSGVIPDM Sbjct: 228 WAMLYTTLNPPPDSHPVGAPIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDM 287 Query: 950 RGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTH 1009 RGWTIKGKPASGRA+LSQE DG KSH+HSA A TDLGTKTTSSFDYGTKSTN TG HT+ Sbjct: 288 RGWTIKGKPASGRAILSQEMDGNKSHSHSARAQDTDLGTKTTSSFDYGTKSTNTTGNHTN 347 Query: 1010 SVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAA 1069 G NS + N S G G+ + Sbjct: 348 QFGGYINS------YWGDSNHTSFQPGGGAWTQ--------------------------- 374 Query: 1070 SAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAEN 1106 +AG HAHTV IG H H++ IG HGH + V+A GNAE Sbjct: 375 AAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAET 411 >UniRef50_B4T041 Gp19 n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL254 RepID=B4T041_SALNS Length = 580 Score = 230 bits (584), Expect = 3e-58, Method: Composition-based stats. Identities = 142/359 (39%), Positives = 172/359 (47%), Gaps = 42/359 (11%) Query: 762 IDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRAT 821 I A L +G + ++ T R ++ + I L A V + Sbjct: 264 IQQARQSLQLGNSATLNVGTTPDTVAAGDDARIITTKKAIDDTQIGLGAQPV--MWVSSA 321 Query: 822 DTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSR 881 D + G A + + Y+ + + G + + Sbjct: 322 DDLSSLPSGARRFASNKVPATILPVNDYVFLEVIAKRDCVDGCAVLITDSIGNTWIGARW 381 Query: 882 DGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAY 941 D + P S P G P+PWPSDT+P+GYALMQGQAFDK+ YP LA AY Sbjct: 382 DATN-GSGFTWR------PMMSCPPGVPLPWPSDTIPAGYALMQGQAFDKNVYPLLAIAY 434 Query: 942 PSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKST 1001 PSG IPDMRGWTIKGKP SGRAVLSQE DG KSH+H A A TDLGTK TSSFDYGTKS+ Sbjct: 435 PSGTIPDMRGWTIKGKPVSGRAVLSQELDGNKSHSHGARALDTDLGTKGTSSFDYGTKSS 494 Query: 1002 NNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHT 1061 N TG H HS G+ G S + V + N +S Sbjct: 495 NTTGGHNHSAGGT--------------------YGGDSIGGKARVQRDGNDQLTS----- 529 Query: 1062 HSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 G HAHT IG H H+V IG HGH + V+A GNAE TVKNIAFNYIVRLA Sbjct: 530 --------WNGDHAHTTWIGPHDHTVYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA 580 >UniRef50_C8U9W7 Probable tail fiber protein-like protein n=1 Tax=Escherichia coli O103:H2 str. 12009 RepID=C8U9W7_ECO10 Length = 377 Score = 228 bits (579), Expect = 1e-57, Method: Composition-based stats. Identities = 109/204 (53%), Positives = 128/204 (62%), Gaps = 1/204 (0%) Query: 527 FADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEF 586 K+GM+ NAP+ A GK+YPV+ RS GS ELASRVIITT + MNNCEF Sbjct: 1 MDKKKGMQQYAFNAPSNAVGGKWYPVIFRRSTGSTGELASRVIITTTSAGGNYEMNNCEF 60 Query: 587 NGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIE 646 NG VMPGGWTDRG YA G F YQ NERAIHSI+ S K DD+ SVFYV+G AFPV E Sbjct: 61 NGMVMPGGWTDRGSYAAGYFSTYQTNERAIHSIVTSLKEDDVCSVFYVEGRAFPVRVSAE 120 Query: 647 DGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCK 706 +GL++ P D V TTYK+GATNPATE A ILDF +GRGFY SHS+ + + Sbjct: 121 EGLTVIVPTQDYTVGQTTYKWGATNPATESTNAQAILDFNNGRGFYCSHSIFGINAIFSG 180 Query: 707 KLFATDEIVARGGNQIRMIGGEYG 730 L A G N I + + G Sbjct: 181 NL-GIGTANALGSNSIVLGDNDTG 203 >UniRef50_Q99362 Protein 37 (Fragment) n=1 Tax=Enterobacteria phage T4 RepID=Q99362_BPT4 Length = 382 Score = 223 bits (567), Expect = 3e-56, Method: Composition-based stats. Identities = 158/274 (57%), Positives = 178/274 (64%), Gaps = 36/274 (13%) Query: 882 DGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAY 941 F + + + K SYP+GAPIPWP+DT P+GYALM+GQ FD AYPKLAAAY Sbjct: 110 GSGNFANLNSTIESLKTDIVSSYPIGAPIPWPTDTPPNGYALMEGQTFDTRAYPKLAAAY 169 Query: 942 PSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKST 1001 PSG IPDMRG TIKGKP SGRAVLS E DG+KSHTH ASAS+TDLGTKTTSSFDYGTK+T Sbjct: 170 PSGTIPDMRGQTIKGKP-SGRAVLSTEADGVKSHTHGASASNTDLGTKTTSSFDYGTKTT 228 Query: 1002 ----------NNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQN 1051 N TG H H+VSG+T+SAGAH H+ + ++ S V N N Sbjct: 229 SSFDYGTKTSNTTGNHNHTVSGTTSSAGAHQHARSGPQLSNGISTNIFPDGYSDVGTNYN 288 Query: 1052 ----------------YATSSAGAHTHSLSGT---------AASAGAHAHTVGIGAHTHS 1086 TS+ GAHTH+ SGT GAH HTVGIGAHTH+ Sbjct: 289 SKFSGTVIGSSVPCIIGKTSNDGAHTHTWSGTTSTTGNHAHTVGIGAHTHTVGIGAHTHT 348 Query: 1087 VAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 VAIGSHGHTITVNA GN ENTVKNIAFNYIVRLA Sbjct: 349 VAIGSHGHTITVNATGNTENTVKNIAFNYIVRLA 382 >UniRef50_C4TT85 Gp19 n=1 Tax=Yersinia kristensenii ATCC 33638 RepID=C4TT85_YERKR Length = 732 Score = 218 bits (554), Expect = 1e-54, Method: Composition-based stats. Identities = 89/380 (23%), Positives = 143/380 (37%), Gaps = 24/380 (6%) Query: 396 SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASA 455 + T A A + G ++ + S +E+ A + + +++ Sbjct: 276 PQGSLTDDALAKHEKSRNHPDGTLAEKGFVKLSSATDSNSETLAATPKAVKAVMDSASNS 335 Query: 456 -------VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 D + KG VQL+SAT+STSETLAATPKAVK A DNA RL KD+NGAD Sbjct: 336 LDLHEKSRDHPDGTLLYKGFVQLASATDSTSETLAATPKAVKIAMDNANARLAKDRNGAD 395 Query: 509 IPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSV-SELASR 567 IP+ F N+ + V + + + ++ G+V LA Sbjct: 396 IPNVPLFRQNLALKGAALVDIGKTAGTVAAGDDSRIVNA------LPKTGGTVTGWLAVT 449 Query: 568 VIITTATRTAGDPMN----NCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIM-MS 622 I+ G N NG + G T G +A + + + +N SI Sbjct: 450 GILDGPIGPGGYKSNILVGTAGGNGAITNAGGTGFGLHASNVIYFWNDNSGYAMSISPTI 509 Query: 623 NKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVI 682 + S+ GAA + + E + +D + K+ N + D++ Sbjct: 510 LSVNRPISILSGAGAALSIKSQNEGDVCYIMSVSDTA---SKSKWYIGNTQENNTSFDLV 566 Query: 683 LDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEY-GALWRNDGAKTY 741 + K+G + ++ S KL + +++A GG G+ GA W N TY Sbjct: 567 -NSKAGVALKINDTISTTAPFSASKLTSAGDVIAGGGATTYQANGDIKGAAWANGLLSTY 625 Query: 742 LLLTNQGDVYGGWNTLRPFA 761 + G R Sbjct: 626 INAMRSTSALSGNGWWRDPV 645 >UniRef50_UPI0001826514 putative tail fiber protein (GpH) n=2 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI0001826514 Length = 719 Score = 216 bits (549), Expect = 3e-54, Method: Composition-based stats. Identities = 110/441 (24%), Positives = 171/441 (38%), Gaps = 43/441 (9%) Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 A + ++ + + SSA SAS + A+ +A + A Sbjct: 148 ATQDYVDDKLAEHEQSRRHPDATLTAKGFTQLSSATDSASESVAATPKAVKAAYDLAKGK 207 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 TA T G ++ + ST+ES A + + +DA+T +KGIVQL Sbjct: 208 YTAQDATTAQKGIVQLSSATDSTSESVAATPKAVKAAYDLAKGKYTAQDATTAQKGIVQL 267 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFL---------NNINAV 522 SSAT+S SE LAATPKAVK+A DNA R+ + + Sbjct: 268 SSATDSASEALAATPKAVKAANDNANGRVPSGRKINGRALSADISITAQDIFNGQTVGIG 327 Query: 523 SKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMN 582 + D A A A +GK YP + S E+ IT R N Sbjct: 328 NAEDLNAYTTPGLYYQPANAQAQTGKNYPEAMAGSL----EVYKHAGITQVYRVYN---N 380 Query: 583 NCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVF 642 + + + G W+ A+ + N A + G + V+G Sbjct: 381 SRSYIRTLYSGTWS-----AWAKQYDAANKPTAGEVGALPVTGGTVTGNATVNGT----- 430 Query: 643 AFIEDGLSISAPGADL-VVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVND 701 + +G + N + +G + T +L+FK G++ + Sbjct: 431 LSVGNGRRFEISSQNSSTANGSLLLWGNADRPT-------VLEFKDATGYHFYSQRNKDG 483 Query: 702 NLS---------CKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYG 752 ++S + ++ E V+R N R+ G YGA WRNDG YLL+TN GD G Sbjct: 484 SVSFSFNGVSSFAGGITSSGEFVSRSANGFRIAYGSYGAFWRNDGGSLYLLVTNSGDSLG 543 Query: 753 GWNTLRPFAIDNATGELVIGT 773 +N+LRPF + ATG++ + Sbjct: 544 TFNSLRPFTVSLATGDITMNK 564 >UniRef50_Q3YZL1 Phage protein-related n=42 Tax=Enterobacteriaceae RepID=Q3YZL1_SHISS Length = 1029 Score = 208 bits (529), Expect = 8e-52, Method: Composition-based stats. Identities = 353/933 (37%), Positives = 443/933 (47%), Gaps = 37/933 (3%) Query: 209 TNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAG 268 + + + A + + A AS+A+TSAR+AA A S AS+SA AASSA +A Sbjct: 113 EEVARNASAVAQNTAAAKKSASDASTSAREAATHATDAADSARAASTSAGQAASSAQSAS 172 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 +SA A T T A S AA S SAAA S AA + + A+ S A+ SA+ A A Sbjct: 173 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTLETNAAASQQSAATSASTATTKAS 232 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 AA+SA A+ A + A+ SAS+A +S T A S +A++S+T A SS ++A Sbjct: 233 EAATSARDASASKEAAKSSETNASSSASSAASSATAAANSAKAAKTSETNARSSETAAGQ 292 Query: 389 SASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKR 448 SAS+A+ SK A ASAA +SA AS AT A SA +AA S STA + A A A Sbjct: 293 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQATA 352 Query: 449 AEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 A ASA + + SS T + S +A A ++ E Q Sbjct: 353 AARSASAAKTSETNAKASETRAESSKTAAASSASSAASSASSASASKDEATRQASAAKGS 412 Query: 509 IPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRV 568 + K A A + V S + Sbjct: 413 ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS---TTKKG 469 Query: 569 IITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDL 628 I+ ++ T + V G+Y Q + S S Sbjct: 470 IVQLSSATNSTSESLAATPKAVKAAYDLANGKYTAQDATTAQKGIIQLSSATNSTSETLA 529 Query: 629 RSVFYVDGAAFPVFAFIEDGLS-ISAPGADL---VVNDTTYKFGATNPATECIAADVILD 684 + V A ++ + PG D + G+ + T ++ Sbjct: 530 ATPKAVKAANDNAEKRLQKDQNGADIPGKDTFTKNIGACRAFGGSVSTTTGNWTTAQFIE 589 Query: 685 FKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLL 744 + +G + + + S ++I+ G + G + A T + Sbjct: 590 WLDSQGAFNHPYWMCKGSWSYG----NNKIITDTGCGNIHLAGAVIEVMGIKSAMTIRIT 645 Query: 745 TNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSK 804 T GG + I++ T + S N T + + G+ Sbjct: 646 TPTTSSGGGTTNAQFTYINHGTDYSPGWRRDYNSRNK--------PTASEIGALPSGGTA 697 Query: 805 DITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTL 864 ++ A A TD A G + + Y ++ G G Sbjct: 698 VSSVNLASKGRVA-ALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIG 756 Query: 865 QMKAHYRNGGLFYRSSRDGYGF-EEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYAL 923 + F RS RD WA++YTS + P E YPVGAPIPWPSDTVPSGYAL Sbjct: 757 WSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYAL 816 Query: 924 MQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASS 983 MQGQ FDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASS Sbjct: 817 MQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASS 876 Query: 984 TDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTR 1043 TDLGTKTTSSFDYGTKSTNNTGAHTHS+SGST+SAGAH HS T S + G Sbjct: 877 TDLGTKTTSSFDYGTKSTNNTGAHTHSLSGSTSSAGAHQHSQTGPRTNSGSQPTGMFPAG 936 Query: 1044 LSVVHNQNY----------------ATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 1087 + V N +SS G HTHSLSGTAASAGAHAHTVGIGAHTHSV Sbjct: 937 STQVSGTNQVGISGSLTSGTSQWVGKSSSEGNHTHSLSGTAASAGAHAHTVGIGAHTHSV 996 Query: 1088 AIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 AIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 997 AIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1029 Score = 182 bits (461), Expect = 7e-44, Method: Composition-based stats. Identities = 493/578 (85%), Positives = 513/578 (88%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV Sbjct: 3 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 62 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDS+PGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV Sbjct: 63 ILLVEGFPPSHAGTITVYEDSRPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 122 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDASTSAREAATHA DAADSARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 123 AQNTAAAKKSASDASTSAREAATHATDAADSARAASTSAGQAASSAQSASSSAGTASTKA 182 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEASKSAAAAESSKSAAATSAGAAKT ETNA+AS QSAATSASTATTKASEAATSARDA+ Sbjct: 183 TEASKSAAAAESSKSAAATSAGAAKTLETNAAASQQSAATSASTATTKASEAATSARDAS 242 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSASSAASSATAA NSAKAAKTSETNARSSETAAGQSASAAAGSKT Sbjct: 243 ASKEAAKSSETNASSSASSAASSATAAANSAKAAKTSETNARSSETAAGQSASAAAGSKT 302 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA+AAARSASAAKT Sbjct: 303 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQATAAARSASAAKT 362 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 SETNAKASET AESSKTAAASSASSAASSASSASASKDEATRQASAAK SATTASTKATE Sbjct: 363 SETNAKASETRAESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATE 422 Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE Sbjct: 423 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 482 Query: 481 TLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 +LAATPKAVK+AYD A + + N+ S+T A + ++ NA Sbjct: 483 SLAATPKAVKAAYDLANGKYTAQDATTAQKGIIQLSSATNSTSETLAATPKAVKAANDNA 542 Query: 541 PAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAG 578 + + + + + A R + + T G Sbjct: 543 EKRLQKDQNGADIPGKDTFTKNIGACRAFGGSVSTTTG 580 >UniRef50_C6DA10 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6DA10_PECCP Length = 689 Score = 205 bits (520), Expect = 9e-51, Method: Composition-based stats. Identities = 97/445 (21%), Positives = 158/445 (35%), Gaps = 41/445 (9%) Query: 706 KKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNA 765 L E+V G G Y + T + +G++ + L I A Sbjct: 256 STLVYRGELVNHGTFAACNREGVYRVALSDGNTVTDMPRNIRGEILYSYGFLFVNEIGGA 315 Query: 766 TGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYA 825 ++ + + A + V DG + +A + AF T T Sbjct: 316 ISQMYLPHRGPV-----ATRQNWDGSYSLGWNVTLDGLTN-KPSATDIGAFPLGFTGTVN 369 Query: 826 DADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYG 885 + + V W+A SG Y G +L++F+ SC +LQ Y NGGL YR++RDG G Sbjct: 370 NDE--VAWDANSGVYRAQYPGAGQMLIHFHGAGASCPSLQFLGEYGNGGLSYRTARDGMG 427 Query: 886 FEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQ-----------AFDKSAY 934 FE WA++YT++ P + VGA +P + G + G +Y Sbjct: 428 FEHSWAKIYTTQFKPTAA-DVGA-LPIAGGALQGGIRIGAGNIDLPARRAVVGVMPDESY 485 Query: 935 PKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSS- 993 ++ + P + + + R ++ +S H + + +G S Sbjct: 486 RQMLSLSPDNTVVFGNPNSSAVIHTTDRVYIAAAGGAWRSVYHEGNLTPAAIGAMPASEL 545 Query: 994 ------FDYGTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSAS 1041 F T + S S A + L + G G+ + Sbjct: 546 AGIPLPFPGAVAPTGWLKCNGQSFDKSQYPILASRYPSGVLPDLRGEFVRGWDDGRGADA 605 Query: 1042 TRL--SVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVN 1099 +R S + + T +AG GAH+ + G+ G T + Sbjct: 606 SRALLSAQGDAIRNIVGTIGQLNDRVNTTETAGVFDANKYTGAHS-GLTGGNGGRIATFD 664 Query: 1100 AAG----NAENTVKNIAFNYIVRLA 1120 A+ AEN +NIAFNYIVR A Sbjct: 665 ASKVVPTAAENRPRNIAFNYIVRAA 689 >UniRef50_B7LKX7 Putative side tail phage protein n=2 Tax=Escherichia RepID=B7LKX7_ESCF3 Length = 567 Score = 204 bits (519), Expect = 1e-50, Method: Composition-based stats. Identities = 145/398 (36%), Positives = 199/398 (50%), Gaps = 32/398 (8%) Query: 733 WRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTP 792 + +D KT+ N D + + + A L +K T Sbjct: 192 YVDDALKTHQQSRNHPDATLTQKGFTQLSNATNSDDETKAATPKAVKTAYDLANSKAATS 251 Query: 793 RRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILV 852 + + G D TLT V T++ + + P ++ + S Sbjct: 252 HNHAWSQITGIPDGTLTQKGVVKLNN-TTNSTSTTEAATPSAVKAAMDKAIAAAPSSHTH 310 Query: 853 NFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPW 912 + G + S+ + + K + ES PVG PIPW Sbjct: 311 AWGQITGIPDGTLTQKGVVKLNNATNSTSTTEAATPNAVKAAMDKAIAAESCPVGMPIPW 370 Query: 913 PSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGI 972 PSD+VPSGYALM GQ F+K++YPKLA AYPSGVIPDMRGW IKGKP+SGRA+LS E DG+ Sbjct: 371 PSDSVPSGYALMTGQTFNKTSYPKLAIAYPSGVIPDMRGWIIKGKPSSGRAILSTELDGV 430 Query: 973 KSHTHSASAS----------STDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHT 1022 KSH H+ S S STDLGTKTT+SF++G+++T+ +G HTH + + + G Sbjct: 431 KSHNHTGSISSTNLGTITSTSTDLGTKTTASFNHGSRNTSTSGEHTHRIP-TDGAEGKDG 489 Query: 1023 HSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGA 1082 SL N + N T SAG+H HS+ + GAHAHT+ +G+ Sbjct: 490 PSLWNSPNSDENY---------------REPTESAGSHYHSI-----TIGAHAHTIALGS 529 Query: 1083 HTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 HTH++ +G+H H+I +N GN ENTVKNIAFNYIVRLA Sbjct: 530 HTHNIVLGTHNHSIIINNTGNTENTVKNIAFNYIVRLA 567 >UniRef50_Q6KGF6 Putative tail fiber protein GP37 n=2 Tax=unclassified Myoviridae RepID=Q6KGF6_9CAUD Length = 782 Score = 204 bits (518), Expect = 1e-50, Method: Composition-based stats. Identities = 165/608 (27%), Positives = 250/608 (41%), Gaps = 113/608 (18%) Query: 552 VVVMRSAGSVSELASRVIITTATRTAGDPMN-NCEFNGFVMPGGWTDRGRYAYGMFWQYQ 610 V + ++ +L + + T + +N + G V P +D F Q Sbjct: 187 VGISKNGSDAVQLYNNKYDSALTIASNISVNKSLAITGQVQP---SDFSNLDARYFTQTV 243 Query: 611 NNERAIHSIMMSN--KGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFG 668 N+R +N + + + + S+ G D T ++ Sbjct: 244 ANQRFAQLAANNNFTGTNTFSRNLTIISDSAALRLKNATSSSLFVQGVDS---QNTNRWY 300 Query: 669 ATNPATECIAADVILDFKSGRGFYE-SHSLIVNDNLSCKKLFATDEI------------- 714 N + A+ ++ ++ G + + VN N + Sbjct: 301 VGN--GDNTASVLLHNYVHGSNIRLDNGYISVNQNFRITGQVQPSDFSNIDSRYIPAATL 358 Query: 715 --VAR-------GGNQIRMIGGEYGALWRNDG--------AKTYLLLTNQGDVYGGWNTL 757 +AR G Q + GE G + +N K ++ G N+ Sbjct: 359 STIARTNAQNTFNGAQTVVSDGE-GLVIKNSTQNRPLYIRGKDAANVSRWWLGVGDPNST 417 Query: 758 RPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFA 817 ++ +G +I SAS+N A ++Q P S ++ T + ++ A++ Sbjct: 418 DVALNNSFSGTQLILGNSSASINKTLTLAGQIQ-PSDFSNLDARYYTQSTANSRYMLAYS 476 Query: 818 RRATDTYADADGGVPWNAESGAYNVT--RSGDSYILVNFYTGVGSCRTLQMKAHYRNGGL 875 D+DG + WNA++G YNVT G + ++ Y G S + Q+K +YRNGG Sbjct: 477 SGTGTEVGDSDG-IAWNAKTGLYNVTGYSGGSTQLVFQMYQGASSTPSAQLKFNYRNGGF 535 Query: 876 FYRSSRDGYGFEEDWAEVYTSKNLPPES---------------------------YPVGA 908 +YRSSRDG+GFEED+ ++YT K P S YPVG Sbjct: 536 WYRSSRDGFGFEEDFTQIYTEKYKPTPSAIGAYTKAETDQKIAEAISDSTDLNKIYPVGI 595 Query: 909 PIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAV---- 964 + S+ P+ +A P L Y + + G TI+ A+G V Sbjct: 596 VTWFNSNVNPN------------TALPGLTWTYLNNGV----GRTIRIAAANGSDVATTG 639 Query: 965 ----LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA 1020 ++ + SHTHS SA TTSSFDYGTK+TN TGAHTHSVSGSTN+ GA Sbjct: 640 GSDSVTLSVGNLPSHTHSFSA--------TTSSFDYGTKTTNTTGAHTHSVSGSTNNTGA 691 Query: 1021 HTHSLANVNTASANSGAGSASTRLSV-VHNQNYATSSAGAHTHSLSGTAASAGAHAHTVG 1079 HTH+ G S + V V +S AG H+H++ GTAAS G HAHTVG Sbjct: 692 HTHTFGG------RYGGDSIGGKHRVHVSGTEQVSSVAGDHSHTVYGTAASNGNHAHTVG 745 Query: 1080 IGAHTHSV 1087 IGAH+H+V Sbjct: 746 IGAHSHTV 753 >UniRef50_A9DEM1 Tail protein 2 n=1 Tax=Yersinia phage PY100 RepID=A9DEM1_9CAUD Length = 255 Score = 202 bits (512), Expect = 7e-50, Method: Composition-based stats. Identities = 102/219 (46%), Positives = 125/219 (57%), Gaps = 32/219 (14%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASG 961 + PVGAP+ WPSDT P G+ALM GQ FDK YP LA YPSGV+PDMRG IK KP G Sbjct: 69 NATPVGAPLAWPSDTAPDGWALMIGQTFDKVKYPLLAKVYPSGVLPDMRGRVIKAKPD-G 127 Query: 962 RAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAH 1021 RAVLS E+D +KSHTH+ A++ GT+ TS+FD+G K T G HTH G+ Sbjct: 128 RAVLSLEEDQVKSHTHTGKAATAG-GTRATSTFDHGNKRTTTNGNHTHGSPQGARHGGSG 186 Query: 1022 THSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIG 1081 ++ + T S N+ +SA AG H H V IG Sbjct: 187 QYTSGDDETNSV----------------FNWPATSA-------------AGDHFHDVQIG 217 Query: 1082 AHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 H H+V I +H HT+ ++A G ENTVKNIA NYIVRLA Sbjct: 218 PHNHNVDI-NHEHTLQIDATGGTENTVKNIAMNYIVRLA 255 >UniRef50_B3X2T1 Side tail fiber protein n=1 Tax=Shigella dysenteriae 1012 RepID=B3X2T1_SHIDY Length = 488 Score = 199 bits (506), Expect = 3e-49, Method: Composition-based stats. Identities = 207/405 (51%), Positives = 235/405 (58%), Gaps = 19/405 (4%) Query: 735 NDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRR 794 N + G TL P I + + + +T + + Sbjct: 84 NANGRVPSARKVNGKALSADITLTPKDIGTLNSTTMSFSGGAGWFKLATVTMPQASSVVS 143 Query: 795 VSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYN----VTRSGDSYI 850 ++ + G + A ++ RA + G W S + V SGD+Y Sbjct: 144 ITLIGGAGFNVGSPQQAGISELVLRAGNGNPKGITGALWQRTSTGFTNFAWVNTSGDTYD 203 Query: 851 LVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAE--VYTSKNLPPESYPVGA 908 + + +Q + S E + VY+ + YP GA Sbjct: 204 IYVAIGNYATGVNIQWDYTSNASVTIHTSPAYSANKPEGLTDGTVYSLYTPSEQFYPPGA 263 Query: 909 PIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQE 968 PIPWPSDTVPSGYALMQGQ FDKSAYPKLA AYPSGVIPDMRGWTIKGKPASGRAVLSQE Sbjct: 264 PIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQE 323 Query: 969 QDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANV 1028 QDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHS+SG+ NSAGAH H + Sbjct: 324 QDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGA 383 Query: 1029 NTA-------------SANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHA 1075 S S ++T S TSS GAHTHSLSGTAASAGAHA Sbjct: 384 FGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHA 443 Query: 1076 HTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 HTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA Sbjct: 444 HTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 488 >UniRef50_B4TP26 Side tail fiber protein n=43 Tax=Salmonella enterica RepID=B4TP26_SALSV Length = 892 Score = 199 bits (504), Expect = 7e-49, Method: Composition-based stats. Identities = 255/541 (47%), Positives = 319/541 (58%), Gaps = 4/541 (0%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M V ISGVLKD TG PVQNCTIQLKA R STTVVVNT+ASENPD+AGRYSMDVE GQY+V Sbjct: 1 MPVLISGVLKDATGTPVQNCTIQLKACRTSTTVVVNTVASENPDDAGRYSMDVEQGQYTV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 LLVEG+PPSHAG ITVY+DS+PGTLNDFLGAMTEDD RPEALRRFE MVEEVAR AS Sbjct: 61 TLLVEGYPPSHAGVITVYDDSKPGTLNDFLGAMTEDDVRPEALRRFEAMVEEVARQASEA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 ++N +A +++ A TSA +AA A A ++A AA SA QAASSA SA SSAGTA+TKA Sbjct: 121 SRNATSAGQASEQAQTSAGQAAESATAAVNAAGAAEASATQAASSAASAESSAGTATTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 EAS SAA+A+++++AAA SA AAKTSE NA AS +A SA+ A A+ A TSA A Sbjct: 181 GEASASAASADTARTAAAASAAAAKTSEANADASRTAAGDSAAAAAASATAAQTSAARAG 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 AS+ AAK SET A+SSA A +SATAA S KAA S A+ SET A SAS AA S T Sbjct: 241 ASETAAKMSETQAASSAGDAGASATAAAASEKAAAASAAAAKISETNAATSASTAAASAT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AA+SSAS AS A + SA+ A +S+ +A ++A+ A A A + A + ++ Sbjct: 301 AASSSASEASNHAAASDTSASLAAQSSTAAGAAATRAEDAAKRAEDIADVISLEDASLTK 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATE 420 +S T ++S AA A A D + + + Sbjct: 361 KGIVKLSSATDSDSEALAATPKAVKTVMGEVQTKAPLDSPAFTGTPTTPTPPDDAKGLQT 420 Query: 421 AAGS---ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNS 477 A AA S ES T E A D + A + + K+ + +A + Sbjct: 421 ANAEFVRKLIAALVGSVPESLDTLQELADALGNDPSFATTVLNKLAGKQPLDDTLTALSG 480 Query: 478 TSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVR 537 S ++ ++A L K QNG DI DK F I AV+ T + + + Sbjct: 481 KSVDGLIEYVGLRETINHAADALLKSQNGGDIQDKKQFARTIGAVTSTTISL-GESGWFK 539 Query: 538 V 538 + Sbjct: 540 I 540 >UniRef50_C7BL21 Similarities with phage tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BL21_PHOAA Length = 452 Score = 196 bits (498), Expect = 3e-48, Method: Composition-based stats. Identities = 71/244 (29%), Positives = 101/244 (41%), Gaps = 30/244 (12%) Query: 761 AIDNATGELVIGTKLSASLNGNALTAT------------KLQTPRRVSGVEFDGSKDITL 808 DNA L + + NA + + R+++G G D++L Sbjct: 141 VNDNANSRLAKNQNGADIPDKNAFVKNLGLSETVNKANNAVPSSRKINGKALSG--DVSL 198 Query: 809 TAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYI--------LVNFYTGVGS 860 +A V A + T ++ + SG V + + + F Sbjct: 199 SAGDVGAISVNPLSTLTESKKFQNF-LSSGFILVNVPNTATVNDFPFPTRVYGFGILEVR 257 Query: 861 CRTLQMKAHYRN--GGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVP 918 + + Y + G + R S + W VY+S LPPE +PVGAPIP+P P Sbjct: 258 SSGVVIYQKYTSHHGEVVIRQSWNSGKTWIGWDIVYSSAILPPEQHPVGAPIPYPHRYTP 317 Query: 919 SGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIK 973 GY GQ FDKS YPKLA AYPSG +PD+RG I+G S GR S + K Sbjct: 318 VGYLTCNGQTFDKSLYPKLAEAYPSGRVPDLRGEFIRGWDDSRGVDPGRVCGSWQDSDNK 377 Query: 974 SHTH 977 +H H Sbjct: 378 AHIH 381 Score = 76.0 bits (184), Expect = 9e-12, Method: Composition-based stats. Identities = 33/131 (25%), Positives = 53/131 (40%), Gaps = 7/131 (5%) Query: 443 ETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK 502 + ++ D + + + L + + AT A+K+ DNA RL K Sbjct: 92 KENREQQPDHSQSAWKPMSDFIGAPKSVLDALNTKQDKGDYATNSALKTVNDNANSRLAK 151 Query: 503 DQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVS 562 +QNGADIPDK F+ N+ + A+ ++N GK V SAG V Sbjct: 152 NQNGADIPDKNAFVKNLGLSETVNKANNAVPSSRKIN-------GKALSGDVSLSAGDVG 204 Query: 563 ELASRVIITTA 573 ++ + T Sbjct: 205 AISVNPLSTLT 215 >UniRef50_A4TT73 Phage tail protein n=30 Tax=Enterobacteriaceae RepID=A4TT73_YERPP Length = 962 Score = 196 bits (496), Expect = 5e-48, Method: Composition-based stats. Identities = 263/887 (29%), Positives = 392/887 (44%), Gaps = 63/887 (7%) Query: 265 TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAG 324 TA + S T +A T A + +AA A +A+ A AA Sbjct: 108 TAVDEVTLEREDGTEVTVKSLTQIVDEHNANQKWYTDNADAINAAGEKAREAAERALAAA 167 Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSAS 384 +S+ A + A A + A+E +AA SA+A+K SE A S S+++S +AA +S Sbjct: 168 QSSSEARAKADEAAQSSASASEYKTAAELSAAASKASEHGAAESAASSKASASAAKTSED 227 Query: 385 SAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAET 444 ++A+S ++A+ SK A AS++ +SA+ A A A S AAA S++ A ++ A T Sbjct: 228 NSAASETNAAESKAAAALSASSSANSASEALQYAESAKTSKEAAAASEAAAANSENEART 287 Query: 445 AAKRAEDIASAVALEDA--STTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK 502 + A A+ + ++ + + + + + A + + A+ Sbjct: 288 SKDTAVAAAAEASANATSADASRHDVDTNKAEVSRMKDEVFAARDSTIQYSEEAKTAADT 347 Query: 503 DQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVS 562 A L+ + + ++ + + +A + + + S Sbjct: 348 AAREAATKTSDQLLSAVKSEAEKANSASASAQGFADDAKRFRDEAQEIAEGSKVNDATTS 407 Query: 563 ELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMS 622 + + + + + + VM D + + + ++ Sbjct: 408 QQGVVQLSSATDSESETLASTPKAVKTVM-----DAVALKAPIDSPALSGAPTAPTPAIT 462 Query: 623 NKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVI 682 G ++ + +V + + L A + ND + TN D Sbjct: 463 AAGREIATAAFVASKVAQLVGSAPEALDTLNELAAALGNDPNFATTITNMLARKQPLDGT 522 Query: 683 LDFKSGR---GFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAK 739 L SGR G + L+ NL+ + + R+ GA+ + Sbjct: 523 LTALSGRSPQGVIDYLGLLNTVNLAAGSIQKSQN--GADIPDKRLFVKNIGAV-----SS 575 Query: 740 TYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVE 799 + + Y P A L+ G +A L A + + + V Sbjct: 576 ARISFVKESGWYKLATVTMPQGASTALITLIGGAGYNAGLYDQAAISEIVLRSGNWNPVG 635 Query: 800 FDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVG 859 + A + D V S ++ Sbjct: 636 ITATLWQRSPAGAQGVAWINTSGDVYD-------------IYVNVGQYSIDVIALSDCTN 682 Query: 860 SCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPS 919 + + G Y +++ +Y+S PPESYPVGAPIPWP+D PS Sbjct: 683 NASIVLF------GTPEYVATKPASSTNGANYILYSSVLPPPESYPVGAPIPWPNDVAPS 736 Query: 920 GYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSA 979 G+A+MQGQ FDKS YPKLAAAYPSGV+PDMRGW IKGKP S RAVLS EQDGIKSH H+A Sbjct: 737 GFAIMQGQTFDKSVYPKLAAAYPSGVLPDMRGWMIKGKPTS-RAVLSLEQDGIKSHAHNA 795 Query: 980 SASSTDLGTKTT----------SSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSL---- 1025 +ASSTDLGTK T S FDYGTKS+N+TGAH HS+SGST+S+GAH H++ Sbjct: 796 AASSTDLGTKPTTTFDYGTKTSSGFDYGTKSSNSTGAHAHSLSGSTSSSGAHAHTVTAHT 855 Query: 1026 ---ANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGA 1082 + ++ + N+ +T+ + + N TSSAG H HS+SGTA SAGAHAHTVGIGA Sbjct: 856 QYPRSTDSRNQNAVGKQYNTQQTTANAFNVWTSSAGDHAHSISGTAVSAGAHAHTVGIGA 915 Query: 1083 ---------HTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 H+HSVAIG+H HTIT+ A GNAENTVKNIA+NYIVRLA Sbjct: 916 HAHSLSIGSHSHSVAIGAHSHTITIAACGNAENTVKNIAYNYIVRLA 962 Score = 88.3 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 121/505 (23%), Positives = 209/505 (41%), Gaps = 12/505 (2%) Query: 79 EDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSA 138 ED T+ + E +A + + A A+ AA +S+S+A A Sbjct: 118 EDGTEVTVKSLTQIVDEHNANQKWYTDNADAINAAGEKAREAAERALAAAQSSSEARAKA 177 Query: 139 REAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAA 198 EAA +A A++ AA SA + +S A+ SA ++ A+ A S + +S++ AA Sbjct: 178 DEAAQSSASASEYKTAAELSAAASKASEHGAAESAASSKASASAAKTSEDNSAASETNAA 237 Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 S AA S ++++ S A A +A T AA S AA S+ A++S+ A ++A+ Sbjct: 238 ESKAAAALSASSSANSASEALQYAESAKTSKEAAAASEAAAANSENEARTSKDTAVAAAA 297 Query: 259 SAASSATAAGNSAKAAKTSETNAR--SSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 A+++AT+A S T++ E A + ++ + A+ +A + + Sbjct: 298 EASANATSADASRHDVDTNKAEVSRMKDEVFAARDSTIQYSEEAKTAADTAAREAATKTS 357 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK-TSETNAKASETSAESS 375 +A AE A S++++A A +A A A +K T ++ S+ Sbjct: 358 DQLLSAVKSEAEKANSASASAQGFADDAKRFRDEAQEIAEGSKVNDATTSQQGVVQLSSA 417 Query: 376 KTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATA-------- 427 + + + +S + + + S A S A TA T A AAG A Sbjct: 418 TDSESETLASTPKAVKTVMDAVALKAPIDSPALSGAPTAPTPAITAAGREIATAAFVASK 477 Query: 428 -AAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATP 486 A S E+ T E AA D A + + K+ + +A + S Sbjct: 478 VAQLVGSAPEALDTLNELAAALGNDPNFATTITNMLARKQPLDGTLTALSGRSPQGVIDY 537 Query: 487 KAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATS 546 + + + A +QK QNGADIPDK F+ NI AVS + + + ++ Sbjct: 538 LGLLNTVNLAAGSIQKSQNGADIPDKRLFVKNIGAVSSARISFVKESGWYKLATVTMPQG 597 Query: 547 GKYYPVVVMRSAGSVSELASRVIIT 571 + ++ AG + L + I+ Sbjct: 598 ASTALITLIGGAGYNAGLYDQAAIS 622 >UniRef50_C6DE08 Tail Collar domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DE08_PECCP Length = 682 Score = 188 bits (475), Expect = 1e-45, Method: Composition-based stats. Identities = 131/625 (20%), Positives = 210/625 (33%), Gaps = 98/625 (15%) Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 +A A+ K E + T A A+ + + T + AT+S S Sbjct: 34 QAKELASRTRYLKKEQEKTGSDLATHAAAADPHTQYAPKANPTFT-GTPKAPTPATDSNS 92 Query: 480 ETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAV---SKTDFADKRGMRYV 536 + +A T +L KDQNGADIPD+ F N+ + S + + Sbjct: 93 QQIATTAFVRSVGAT----KLAKDQNGADIPDRELFNRNLGSSRAYSSSIPIGGSDGLWT 148 Query: 537 RVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPM----------NNCEF 586 S + GS S I + T P+ N+ Sbjct: 149 TAEFIGWLESQGAFVHAYWVCRGS--WSYSHNKIISDTECGQIPLAGSVVEVMGQNDATT 206 Query: 587 NGFVMPGGWTDRGRYAYGMFWQY-----------------QNNERAIHSIMMSNKGDDLR 629 P + + Y +N A + K Sbjct: 207 IRITTPSTTPAGFSDSANAQFTYIYNGVDYSPGWRRDYNTKNKPTAADIGALPEKAVAQA 266 Query: 630 SVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGR 689 + +G+ D N T +AA + K+G Sbjct: 267 AAKLATPRTI-------NGVPF-----DGSANIALTPANLGLTETVNLAAGALEKAKNGA 314 Query: 690 GFY----ESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLT 745 ++ + LS FA + G+Y + ND + Sbjct: 315 DISDKTAFYSNVTLRGTLSDGMTFANCD-----------KAGDY-VVAINDPNTVSDMPL 362 Query: 746 NQGDVYGGWNTLRPFAIDNATGELVIGTKLS-----------ASLNGNALTATKLQTPRR 794 +G G+ L F N G+ I + AL+++++ T Sbjct: 363 YKGQKLYGYGVLHVFQHGNFVGQEYINHSGDYAWRQKWDDGINTPWVVALSSSRVPTAAD 422 Query: 795 VSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNF 854 V + + +V + D + W++ +GAY S ++ + Sbjct: 423 VGAI-----TKTDADSHYVHQGSSGVIYQ----DSDLAWDSPTGAYLKDNGTHSSLIWHM 473 Query: 855 YTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPE--------SYPV 906 GS Q + NGG+ YRSSRD GFE+ WA +YT ++ P + V Sbjct: 474 GLNAGSASAAQFYFDFANGGIKYRSSRDNSGFEKPWARIYTDQDKPTAADIGALSLNEIV 533 Query: 907 GAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----G 961 G P+PWP T PSG+ GQ FDK+ YPKLA YP+G++PD+RG I+G S G Sbjct: 534 GMPMPWPQTTAPSGWLKCNGQTFDKNIYPKLAQIYPAGILPDLRGEFIRGWDDSRGVDTG 593 Query: 962 RAVLSQEQDGIKSHTHSASASSTDL 986 R +LS + D I++ ++ + Sbjct: 594 RTLLSTQGDAIRNIVGEIWTTAANY 618 Score = 47.4 bits (110), Expect = 0.003, Method: Composition-based stats. Identities = 81/450 (18%), Positives = 134/450 (29%), Gaps = 105/450 (23%) Query: 764 NATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVA--------- 814 N + + A A KL TPR ++GV FDGS +I LT A++ Sbjct: 245 NTKNKPTAADIGALPEKAVAQAAAKLATPRTINGVPFDGSANIALTPANLGLTETVNLAA 304 Query: 815 ----------------AFARRATDTYADADG-GVPWNAESGAYNVTRSGDSYI------- 850 AF T +DG ++G Y V + + + Sbjct: 305 GALEKAKNGADISDKTAFYSNVTLRGTLSDGMTFANCDKAGDYVVAINDPNTVSDMPLYK 364 Query: 851 --------LVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPE 902 +++ + Q ++ G +R D G W +S +P Sbjct: 365 GQKLYGYGVLHVFQHGNFVG--QEYINHS-GDYAWRQKWDD-GINTPWVVALSSSRVPTA 420 Query: 903 SYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRG---------WT 953 + VGA +D+ + + QG + LA P+G G Sbjct: 421 A-DVGAITKTDADS----HYVHQGSSGVIYQDSDLAWDSPTGAYLKDNGTHSSLIWHMGL 475 Query: 954 IKGKPASGRAVLSQEQDGIKS----------------HTHSASASSTDLGTKTTSS---- 993 G ++ + GIK +T ++ D+G + + Sbjct: 476 NAGSASAAQFYFDFANGGIKYRSSRDNSGFEKPWARIYTDQDKPTAADIGALSLNEIVGM 535 Query: 994 ---FDYGTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRL 1044 + T + + + + A + L + G + R Sbjct: 536 PMPWPQTTAPSGWLKCNGQTFDKNIYPKLAQIYPAGILPDLRGEFIRGWDDSRGVDTGRT 595 Query: 1045 --SVVHNQNYATSS----AGAHTHSLSGTAASAGAHA----HTVGIGAHTHSVAIGSHGH 1094 S + A+ L S GA TVG A S Sbjct: 596 LLSTQGDAIRNIVGEIWTTAANYQFLGENLLSNGAFELFKEFTVGAIP---DAAGNSCPS 652 Query: 1095 TITVNAAG----NAENTVKNIAFNYIVRLA 1120 + +A+ +EN +NIAFNYIVR A Sbjct: 653 RMKFDASRIVPTASENRPRNIAFNYIVRAA 682 >UniRef50_B6IAV4 Putative phage tail fiber protein n=1 Tax=Escherichia coli SE11 RepID=B6IAV4_ECOSE Length = 590 Score = 186 bits (470), Expect = 5e-45, Method: Composition-based stats. Identities = 103/374 (27%), Positives = 167/374 (44%), Gaps = 14/374 (3%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V I G L DG G P+ C I LK++ N++ VV+ T A G YS + G+Y V Sbjct: 1 MSVLIYGALTDGAGIPMSGCHIILKSRVNTSEVVMRTEADVVTGNNGEYSFEARTGKYRV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 L G I VY+D++PGTLNDFL A E D +P+ ++RFE MV + ++A + Sbjct: 61 YLKQGWRDEYCVGDIAVYDDAKPGTLNDFLTAPDEGDLKPDVVKRFERMVAQAQQSAESA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A++ A + +DA + T A + +A A + A +AG Sbjct: 121 AESEQQAGQHVADAQKIKEDCQTLADNVQLNATAVAEDKQHVEHLAAEVEQNAGQMQQGV 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 + + A+ + +A+SA +K + NA+ S QSA + A +AA Sbjct: 181 QSVTDAVKQAQQAADDSASSAEESKNNADNAARSEQSA--------------KSHADNAA 226 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 S + AKS N + + A TA + A+ NAR TA A A Sbjct: 227 RSAQNAKSHADNVAGNTLQTAQDVTATAAARDDAERFAENARQDATATACDRKATAEDVK 286 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 +A SA+++ SA A+ A AA ++ + +G +E A+A ++ ++ Sbjct: 287 SAGESAASSEQSARVAAGYARAAEQAKNDIDVLLANTLKTSGNLSEIAAAGEQAQQESRD 346 Query: 361 SETNAKASETSAES 374 + A+ ++ Sbjct: 347 NLGLKSAATMEPQA 360 >UniRef50_A8A0A4 L-shaped tail fiber protein n=12 Tax=root RepID=A8A0A4_ECOHS Length = 1258 Score = 184 bits (467), Expect = 1e-44, Method: Composition-based stats. Identities = 283/332 (85%), Positives = 296/332 (89%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAV+ISGVLKDG GKP+QNCTIQLKA+RNSTTVVVNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVRISGVLKDGAGKPIQNCTIQLKARRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDD RPEALRRFELMVEEVARNASAV Sbjct: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDVRPEALRRFELMVEEVARNASAV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQNTAAAKKSASDA TSAREAATHA DAADSARAASTSAGQAASSAQSASSSAGTASTKA Sbjct: 121 AQNTAAAKKSASDARTSAREAATHATDAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEASKSAAAAESSKSAAATSAGAAKTSETNA+AS +SAATSASTATTKASEAATSARDA+ Sbjct: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNAAASQKSAATSASTATTKASEAATSARDAS 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASK AAKSSET+A+SSA SAASSATAAGNSAKAAKTSE NA +S AA S +A+A S T Sbjct: 241 ASKVAAKSSETSAASSAGSAASSATAAGNSAKAAKTSEMNADNSAQAAADSQTASANSAT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAAS 332 AA S + A S A S T A S A Sbjct: 301 AAKKSETNAKNSESAAKVSETNAKASENKAKE 332 >UniRef50_A3GRX3 Tail fiber like-protein n=1 Tax=Vibrio cholerae NCTC 8457 RepID=A3GRX3_VIBCH Length = 182 Score = 180 bits (455), Expect = 3e-43, Method: Composition-based stats. Identities = 71/219 (32%), Positives = 101/219 (46%), Gaps = 51/219 (23%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASG 961 + +PVG IPW +D P G+ + +GQAFD + Y +LA +P+G+IPDMRG + GK G Sbjct: 15 KIFPVGGAIPWFTDVAPEGFGMFKGQAFDVNVYTELAKVFPNGIIPDMRGCGVIGKED-G 73 Query: 962 RAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAH 1021 AV + E+ +K+H H S T SS D G+K+T N G HTH + G+H Sbjct: 74 EAVGAYEEGQVKNHGHPNS---------TVSSIDLGSKNTANGGNHTHFSGIAAFGGGSH 124 Query: 1022 THSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIG 1081 + N TS+AG H HS Sbjct: 125 RY------------------QTDVNGSGGNINTSAAGNHYHS------------------ 148 Query: 1082 AHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 + +GSH H +T+ G +NT+ + N+IVRLA Sbjct: 149 -----IPMGSHAHAVTIALFGALKNTINHRKINWIVRLA 182 >UniRef50_A4WEL3 Phage Tail Collar domain protein n=2 Tax=Enterobacteriaceae RepID=A4WEL3_ENT38 Length = 340 Score = 179 bits (454), Expect = 4e-43, Method: Composition-based stats. Identities = 74/288 (25%), Positives = 107/288 (37%), Gaps = 39/288 (13%) Query: 738 AKTYLLLTNQG-------DVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQ 790 AK + +LTNQG G L A+ + G L L K Sbjct: 3 AKYFAILTNQGAAKLANATALGTKLNLTQLAVGDGNGFLPTPDPAQTR-----LINQKRI 57 Query: 791 TPRRVSGVEFDGSKDI---TLTAAHVAAFARRATDTYADADG--GVPWNAESGAYNVTRS 845 P + V+ + S I + + F R Y D V E+ + Sbjct: 58 APLNMLSVDPNNSSQIIAEQIIPENEGGFWIREIGLYDDDGVLIAVANCPETYKPQLQEG 117 Query: 846 GDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDG------YGFEEDWAEVYTSKNL 899 + V + + +K + L R DG + A + N Sbjct: 118 SGRTQTIRMILIVSATSAITLKID-PSVVLATRRFVDGKVTEVKMYADSVLAAHVDAANP 176 Query: 900 PPESY---------PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMR 950 P+ PVG P+PWP T P G+ G FDK YPKLA AYPSG++PD+R Sbjct: 177 HPQYLKTAEIDNYLPVGFPLPWPQATPPQGWLKCNGAPFDKVKYPKLAVAYPSGLLPDLR 236 Query: 951 GWTIKGKP-----ASGRAVLSQEQDGIKSHTHSAS-ASSTDLGTKTTS 992 G I+G SGR L+ + D ++ T +AS ++T +TS Sbjct: 237 GEFIRGWDDGRGVDSGRVALTTQGDAVQKMTGAASNGAATGFVNNSTS 284 Score = 52.8 bits (124), Expect = 7e-05, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 60/196 (30%), Gaps = 25/196 (12%) Query: 945 VIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDY----GTKS 1000 D + +K S VL+ D ++ H + ++ F T Sbjct: 150 RFVDGKVTEVKMYADS---VLAAHVDA--ANPHPQYLKTAEIDNYLPVGFPLPWPQATPP 204 Query: 1001 TNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRLSVVHNQNYAT 1054 + A + L + G G S R+++ + Sbjct: 205 QGWLKCNGAPFDKVKYPKLAVAYPSGLLPDLRGEFIRGWDDGRGVDSGRVALTTQGDAVQ 264 Query: 1055 SSAGAHTHSLSG------TAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA----GNA 1104 GA ++ + T+ +G I +T + G + +++ Sbjct: 265 KMTGAASNGAATGFVNNSTSRVSGVFKRGSVIYPNTSAQNADYQGVDLVFDSSLMVRSAE 324 Query: 1105 ENTVKNIAFNYIVRLA 1120 E +NIAFNYIVR A Sbjct: 325 ETRPRNIAFNYIVRAA 340 >UniRef50_C6UHV3 Predicted tail fiber protein n=22 Tax=Escherichia RepID=C6UHV3_ECOBR Length = 792 Score = 176 bits (445), Expect = 5e-42, Method: Composition-based stats. Identities = 217/363 (59%), Positives = 254/363 (69%) Query: 2 AVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVI 61 AVKISGVLKDG GKP+QNCTIQLKAKRNSTTV+VNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 3 AVKISGVLKDGAGKPIQNCTIQLKAKRNSTTVLVNTVASENPDEAGRYSMDVEYGQYSVT 62 Query: 62 LLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVA 121 LLVEGFPPSHAGTITVYE S+PGTLNDFLGAMTEDD PEALRRFE MVEE ARNA A + Sbjct: 63 LLVEGFPPSHAGTITVYEGSRPGTLNDFLGAMTEDDVMPEALRRFEAMVEEAARNAEAAS 122 Query: 122 QNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKAT 181 Q+ AAAKKS + A++S A T AA+SA+AA+ S +A+SA +A S A T Sbjct: 123 QSAAAAKKSETAAASSKNAAKTSETHAANSAQAAAASQTASANSATAAKKSENNAKNSET 182 Query: 182 EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAA 241 A S A+SS++AA TS AKTSET A +S +AA S S A A+ AA SA AA Sbjct: 183 AAKTSETNAKSSQAAAKTSETNAKTSETAAKSSQAAAAESESAAAGSATSAAGSATAAAN 242 Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTA 301 S++AAK+SETNA SS ++A +S T A S AAK S+ A SE+AA SASAAA S TA Sbjct: 243 SQKAAKTSETNAKSSQTAAKTSETNAKASETAAKNSQDAAAQSESAAAGSASAAASSATA 302 Query: 302 AASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTS 361 +A+S AA TS A AS TAA SA+++A+S + A A E AS AA Sbjct: 303 SANSQKAAKTSETNAKASETAAANSAKASAASQTAAKASEDAAREYASQAAEPYKQVLQP 362 Query: 362 ETN 364 + Sbjct: 363 LPD 365 >UniRef50_D2TJ16 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TJ16_CITRO Length = 538 Score = 174 bits (440), Expect = 2e-41, Method: Composition-based stats. Identities = 79/255 (30%), Positives = 129/255 (50%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V ISG L +G G P+ C I L A N++ VV + A D AG+Y+ + + G+Y+V Sbjct: 1 MSVLISGALINGAGVPMAGCKIYLDALVNTSEVVTESFAVIETDAAGQYAFEAQKGKYTV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 + + P G I+VY+DS+PGTLNDFL A+ E D +P+ ++RFE MV + ++A A Sbjct: 61 HIKQKNGPKCCVGDISVYDDSKPGTLNDFLTALDEGDLKPDVVKRFEEMVAQAQQSAEAA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A++ A + +DA + T A + + A + + A AG Sbjct: 121 AESEQQAGQHVADAQQIKSDCETLADNVQQNTNAVEENTQRVEQLASEVGLHAGQVQQGV 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 + + A+ + +A SA +K S NA+ S Q+A A A + A+DAA Sbjct: 181 QNVTDAVKKAQQAAKNSADSATDSKNSADNAALSEQNAQKHAQKAEQHEQQTKQYAQDAA 240 Query: 241 ASKEAAKSSETNASS 255 + E+A++++ Sbjct: 241 TAAESAENAKGEIDE 255 >UniRef50_B2PZV1 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=B2PZV1_PROST Length = 526 Score = 173 bits (436), Expect = 4e-41, Method: Composition-based stats. Identities = 64/272 (23%), Positives = 102/272 (37%), Gaps = 20/272 (7%) Query: 780 NGNALTATKLQTPRRVSGVEFDGSKDITLT----AAHVAAFARRATDTYADADGGVPW-- 833 + +A + + P+R G + ++ + A + G PW Sbjct: 249 DKDAKNVSVVTIPKRSGTAMLVGDYGVGVSLPQSIENNTATTLGCGFYAIPGNAGNPWGN 308 Query: 834 NAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEV 893 N + NV + + + + + Y + + Sbjct: 309 NGSAHIINVRDGNYGFQIGRTTGNKNLSFRILSANVFSPPSVLYSTGNTTKDHNG---NL 365 Query: 894 YTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWT 953 S + PVGAPIPWP T PSGY + GQAF+K+ YP L AYPSG +PD+RG Sbjct: 366 KVSGSSELSDCPVGAPIPWPQATAPSGYLICNGQAFNKTTYPLLTKAYPSGKLPDLRGEF 425 Query: 954 IKGKP-----ASGRAVLSQEQDGIKSHTH-----SASASSTDLGTKTTSSFDYGTKSTNN 1003 I+G +GR VLS ++ + H H AS ++ G KT + G+ ST+ Sbjct: 426 IRGLDAGRNIDNGRVVLSFQRCATEHHKHISGWGEASNANAIFG-KTVKNGYVGSASTDR 484 Query: 1004 TGAHTHSVSGSTNSAGAHTHSLANVNTASANS 1035 ++ GS + N + Sbjct: 485 DNYLFYTNDGSEFQGSNPNSTGIMANETRPRN 516 >UniRef50_C2DS71 Tail fiber protein n=10 Tax=Escherichia RepID=C2DS71_ECOLX Length = 686 Score = 172 bits (435), Expect = 7e-41, Method: Composition-based stats. Identities = 148/252 (58%), Positives = 176/252 (69%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAV+ISGVLKDG GKP+QNCTIQLKAKRNSTTVVVNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVQISGVLKDGAGKPIQNCTIQLKAKRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAG ITVYEDS+PGTLNDFLGA TEDD RPEAL RFE MVEEVARNA A Sbjct: 61 ILLVEGFPPSHAGAITVYEDSKPGTLNDFLGAATEDDVRPEALYRFEKMVEEVARNAEAA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 +Q+ AAAKKS + A++S A T +A +SA+AA++S A ++A +A S A Sbjct: 121 SQSAAAAKKSETAAASSRNAAKTSETNAGNSAKAAASSKTAAQNAATAAERSETNARASE 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 ++ S A+ + +AA +AG A T A+A A A + A+ A +A A Sbjct: 181 EASADSEEASRRNAESAAENAGVATTKAREAAADATKAGQKKDEALSAATRAEKAADRAE 240 Query: 241 ASKEAAKSSETN 252 + E N Sbjct: 241 VAAEVTAEPYAN 252 >UniRef50_D2BT14 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BT14_DICD5 Length = 534 Score = 172 bits (434), Expect = 7e-41, Method: Composition-based stats. Identities = 70/275 (25%), Positives = 101/275 (36%), Gaps = 84/275 (30%) Query: 851 LVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPI 910 +V G G Q+ + GL R D F+ W + + + VG P+ Sbjct: 339 VVRAGGGNGMADGHQISLGWTGSGL--RVQVDATSFD-LWHK--DNVFPIHAAEIVGIPL 393 Query: 911 PWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVL 965 P+P T P G+ GQ+F+K+A+P LA YPSG +PD+RG I+G S GR +L Sbjct: 394 PYPGATAPDGWLKCNGQSFNKAAFPLLAQRYPSGFLPDLRGEFIRGWDDSRGVDPGRGLL 453 Query: 966 SQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSL 1025 S ++ +H+H N H+H + S G Sbjct: 454 SFQESQNLTHSHGV-----------------------NDPGHSHPYNKYEGSVG------ 484 Query: 1026 ANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTH 1085 S + +Y + A TV G H Sbjct: 485 -------------------SGLAGFDYDQDAWNA-----------------TVYTG-HV- 506 Query: 1086 SVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 G I++ A+G E +NIAFNYIVR A Sbjct: 507 -------GTGISIAASGGHEARPRNIAFNYIVRAA 534 >UniRef50_D0IJ09 Tail fiber protein H putative n=1 Tax=Vibrio sp. RC586 RepID=D0IJ09_9VIBR Length = 368 Score = 171 bits (432), Expect = 1e-40, Method: Composition-based stats. Identities = 78/218 (35%), Positives = 107/218 (49%), Gaps = 51/218 (23%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASG 961 + PVG P+PWPSD P G+A+ +GQAFDK A P+LA YP G++ D+RG + GK G Sbjct: 201 KICPVGVPLPWPSDIAPEGFAIHKGQAFDKVANPELAKLYPDGILKDLRGMAVVGK-KEG 259 Query: 962 RAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAH 1021 +LS E D +K H + S T SS D G+++TN TG H H T++ Sbjct: 260 EIILSYEADQVKQHGYPNS---------TVSSTDLGSRNTNTTGNHAHGYPAGTSNG--- 307 Query: 1022 THSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIG 1081 + ++TA A+ G +T G H H+V IG Sbjct: 308 -PNGPYLDTAHASYGYRYTTTE----------------------------GNHYHSVAIG 338 Query: 1082 AHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 +H HS+AI G ENT+KNI FN+IVR+ Sbjct: 339 SHAHSIAI---------ALFGATENTIKNIKFNWIVRM 367 >UniRef50_P03764 Side tail fiber protein n=2 Tax=root RepID=STF_LAMBD Length = 774 Score = 168 bits (423), Expect = 2e-39, Method: Composition-based stats. Identities = 170/253 (67%), Positives = 183/253 (72%), Gaps = 28/253 (11%) Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 ++P GAPIPWPSD VPSGY LMQGQAFDKSAYPKLA AYPSGV+PDMRGWTIK Sbjct: 522 LGAGENSAFPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIK 581 Query: 956 GKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKS----------TNNTG 1005 GKPASGRAVLSQEQDGIKSHTHSASAS TDLGTKTTSSFDYGTK+ TNNTG Sbjct: 582 GKPASGRAVLSQEQDGIKSHTHSASASGTDLGTKTTSSFDYGTKTTGSFDYGTKSTNNTG 641 Query: 1006 AHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSAS---------TRLSVVHNQNYATSS 1056 AH HS+SGST +AGAH H+ +S S G+A+ + T S Sbjct: 642 AHAHSLSGSTGAAGAHAHTSGLRMNSSGWSQYGTATITGSLSTVKGTSTQGIAYLSKTDS 701 Query: 1057 AGAHTHSLSGTAASAGAHAHTVGIGAHTHSVA---------IGSHGHTITVNAAGNAENT 1107 G+H+HSLSGTA SAGAHAHTVGIGAH H V IGSHGHTITVNAAGNAENT Sbjct: 702 QGSHSHSLSGTAVSAGAHAHTVGIGAHQHPVVIGAHAHSFSIGSHGHTITVNAAGNAENT 761 Query: 1108 VKNIAFNYIVRLA 1120 VKNIAFNYIVRLA Sbjct: 762 VKNIAFNYIVRLA 774 Score = 157 bits (396), Expect = 2e-36, Method: Composition-based stats. Identities = 315/546 (57%), Positives = 361/546 (66%), Gaps = 3/546 (0%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPVQNCTIQLKA+RNSTTVVVNT+ SENPDEAGRYSMDVEYGQYSV Sbjct: 1 MAVKISGVLKDGTGKPVQNCTIQLKARRNSTTVVVNTVGSENPDEAGRYSMDVEYGQYSV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 IL V+GFPPSHAGTITVYEDSQPGTLNDFL AMTEDDARPE LRR ELMVEEVARNAS V Sbjct: 61 ILQVDGFPPSHAGTITVYEDSQPGTLNDFLCAMTEDDARPEVLRRLELMVEEVARNASVV 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 AQ+TA AKKSA DAS SA + A DA DSARAASTSAGQAASSAQ ASS A AS KA Sbjct: 121 AQSTADAKKSAGDASASAAQVAALVTDATDSARAASTSAGQAASSAQEASSGAEAASAKA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 TEA KSAAAAESSK+AAATSAGAAKTSETNA+AS QSAATSASTA TKASEAATSARDA Sbjct: 181 TEAEKSAAAAESSKNAAATSAGAAKTSETNAAASQQSAATSASTAATKASEAATSARDAV 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 ASKEAAKSSETNASSSA AASSATAA NSA+AAKTSETNARSSETAA +SASAAA +KT Sbjct: 241 ASKEAAKSSETNASSSAGRAASSATAAENSARAAKTSETNARSSETAAERSASAAADAKT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 AAA SAS AST A +A+ SA +A +S +A ++A A A A + ASA A + Sbjct: 301 AAAGSASTASTKATEAAGSAVSASQSKSAAEAAAIRAKNSAKRAEDIASAVALEDADTTR 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA---TTASTK 417 +S T++ S AA A ++ A D + +A T + Sbjct: 361 KGIVQLSSATNSTSETLAATPKAVKVVMDETNRKAPLDSPALTGTPTAPTALRGTNNTQI 420 Query: 418 ATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNS 477 A A A A ++ ++ T E AA D A + +A K+ +A Sbjct: 421 ANTAFVLAAIADVIDASPDALNTLNELAAALGNDPDFATTMTNALAGKQPKNATLTALAG 480 Query: 478 TSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVR 537 S P ++ + + Q ++ L + A + F + + Sbjct: 481 LSTAKNKLPYFAENDAASLTELTQVGRDILAKNSVADVLEYLGAGENSAFPAGAPIPWPS 540 Query: 538 VNAPAG 543 P+G Sbjct: 541 DIVPSG 546 >UniRef50_Q66BF2 Hypothetical phage protein n=1 Tax=Yersinia pseudotuberculosis RepID=Q66BF2_YERPS Length = 711 Score = 163 bits (412), Expect = 3e-38, Method: Composition-based stats. Identities = 98/376 (26%), Positives = 154/376 (40%), Gaps = 21/376 (5%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V +SG++ + G+PV N I L A NS TV+ A+ D G Y + +E G YS+ Sbjct: 1 MSVTVSGIMINPVGEPVVNAQITLTAVTNSLTVLNAFSATVRTDGVGTYRIQLEEGSYSI 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLND-FLGAMTEDDARPEALRRFELMVEEVARNASA 119 + G + G +T+ + P TLN + E + P+ + F + ++VA + + Sbjct: 61 TVAANGRSFVY-GAVTLDNTTGPSTLNQLLKQQIMESELTPDVILYFRQIQQQVANDLAT 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + +A +A A S EA +A D +++ A QA SA +++ S A+ Sbjct: 120 IKVLEISATDAAESAGHSRDEAMLYAKDLSEALATAKGYRDQAGISADASALSQQEAAIS 179 Query: 180 ATEASKSAAAAESSKSAA----------------ATSAGAAKTSETNASASLQSAATSAS 223 T A SA +A S+ A S AA+ + +++ A A Sbjct: 180 ETSAKASADSALLSEQNALSYRDSAQSAAATAADDASTLAAERTAEKIKLQVKTDADRAE 239 Query: 224 TATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARS 283 A + + +S D A + T A+ +A + AT A NS A SE A Sbjct: 240 AARIASEQIKSSVDDTAQTVAQQHGETTQAAIAARDSEVKATTAANS---AVQSEALAAI 296 Query: 284 SETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE 343 S A Q+A + K AA A A QA ASA + G + + AG+ Sbjct: 297 SAETARQNAGISTVDKNAAKGFRDEAEGFAQQAHASAESVGDVMPKTGGAFTGPVELAGD 356 Query: 344 ATEQASAAARSASAAK 359 ATE Sbjct: 357 ATEPLEPVTFQQFERT 372 >UniRef50_Q858V4 GpH n=9 Tax=root RepID=Q858V4_9CAUD Length = 913 Score = 163 bits (412), Expect = 3e-38, Method: Composition-based stats. Identities = 80/186 (43%), Positives = 105/186 (56%), Gaps = 12/186 (6%) Query: 732 LWRNDGAKT-YLLLTNQGDVYG-----GWNTLRPFAIDNAT---GELVIGTKLSASLNGN 782 +WR+ KT Y T + +YG +N R ID A + LS L+GN Sbjct: 587 VWRSTSNKTNYRFFTVR--LYGNPGERSFNIRRLPIIDEAQTWEAKQTFSAGLSGELSGN 644 Query: 783 ALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNV 842 A TATKL+T R+++ V FDG+ DI LT ++ AFA T D V WN SGAYN Sbjct: 645 AATATKLKTARKINNVSFDGTSDINLTPKNIGAFASGKTGDTVANDKAVGWNWSSGAYNA 704 Query: 843 TRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPE 902 T G S ++++F G GSC Q + +Y+NGG+FYRS+RDGYGFE DW+E YT+ P Sbjct: 705 TTGGASTLILHFNIGEGSCPAAQFRVNYKNGGIFYRSARDGYGFEADWSEFYTTTRKPTA 764 Query: 903 SYPVGA 908 VGA Sbjct: 765 G-DVGA 769 Score = 87.5 bits (214), Expect = 2e-15, Method: Composition-based stats. Identities = 60/174 (34%), Positives = 77/174 (44%), Gaps = 1/174 (0%) Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS 454 A + A + A T S A+ + A + + + + Sbjct: 103 AVSNMAESYKPELAEGSGRAQTCRMVIILSNVASVELSIDASTVMATQDYVDDKIAEHEQ 162 Query: 455 AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGC 514 + DA+ T+KG QLSSATNSTSE LAATPKAVK+A DNA RL K+QNGADI DK Sbjct: 163 SRRHPDATLTEKGFTQLSSATNSTSEKLAATPKAVKAANDNANSRLAKNQNGADIQDKSA 222 Query: 515 FLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRV 568 FL+NI S T F GM N + KY S + + Sbjct: 223 FLDNIGVTSLT-FMKHNGMIPTTDNLDSYGPEEKYLGTWSCPSQSTAKPESGYP 275 >UniRef50_C4U3E2 Tail fiber protein (Fragment) n=1 Tax=Yersinia kristensenii ATCC 33638 RepID=C4U3E2_YERKR Length = 430 Score = 158 bits (399), Expect = 9e-37, Method: Composition-based stats. Identities = 108/400 (27%), Positives = 174/400 (43%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+VKISG L DG G P+ C I LKA+ N+ VV+ T+A+ G YS + + G+Y V Sbjct: 1 MSVKISGALIDGAGIPMSGCQIILKARVNTAEVVMRTIATITTGRNGEYSFEAQVGRYCV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 L G ITVY+DS+PGTLNDFL A+ E D +P+ ++RFE +V + ++A Sbjct: 61 YLRHGWSNEYCVGDITVYDDSKPGTLNDFLIALDEGDLKPDVVKRFEELVAQAQQSADMA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A++ +S D + +A +AADA SA AA+ S A S + A+SSA +A+ A Sbjct: 121 AESAQQVSQSVQDVTKVKDDAKRYAADAQTSATAAAESQSTATESEKRAASSAHSATQSA 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 A +S AA+ + A + N + +Q+ A S A T A T A Sbjct: 181 QNAQESKEAAQQAAQNAQNCRNEVEEVANNLANEVQTKAPLDSPALTGTPTAPTPDISAT 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 + A A S+ ++ A N A ++ N ++ T A + T Sbjct: 241 GGEVATAEFVKQAVSALVDSSPEALDTLNELAEALGNDPNFATTMTNALAGKQPLNPALT 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 + + +A + A ++ + + + + A + + S T Sbjct: 301 SLSGLVTAENKLAYFSNKNVMSLANLSAVGRVIIGQNSKSEVLEYLGALKSTNNLSEIAT 360 Query: 361 SETNAKASETSAESSKTAAASSASSAASSASSASASKDEA 400 + T+A+ AA + S + + A Sbjct: 361 AGTDAQQQTRQHLGLGDAATMNVQSDIHDRTEGRLALPGA 400 >UniRef50_A6VBH2 Putative tail fiber protein n=2 Tax=Pseudomonas aeruginosa PA7 RepID=A6VBH2_PSEA7 Length = 654 Score = 158 bits (398), Expect = 1e-36, Method: Composition-based stats. Identities = 92/390 (23%), Positives = 147/390 (37%), Gaps = 74/390 (18%) Query: 753 GWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVE----FDGSKDITL 808 + I NA + + A A +L+ + + + +D+ Sbjct: 302 TKTDVGLDKIPNAISDDPALDRGDVLATTKATRAVQLRAAQDLDDLAKSLGTAARRDVGT 361 Query: 809 TAAH---VAAFARRATDTYADA-----DGGVPWNAESGAYNVTRSGDSY-ILVNFYTGVG 859 A V AF D+ A + V + Y G SY ++ F Sbjct: 362 DAGDLLEVGAFGWGTNDSPVAASVNIYESSVTKFTPATEYVPEIFGMSYGVVATFAYSER 421 Query: 860 SCRTLQMKAHYRN-GGLFYRSSRDGYGFEEDWAEVYTSKNL-PPESYPVGAPIPWPSDTV 917 R Q+ L +RS + E++ S NL P P GA + + + Sbjct: 422 ETRASQLFFGQSPENKLMFRSGNYT---WAPFLEIWHSGNLNPQAIVPAGAVVAFAMYSP 478 Query: 918 PSGYALMQGQAFDKSAYPKLAAA----YPSG------VIPDMRGWTIKGKPAS-----GR 962 P+GY G A ++AY L A Y +G +PD RG ++ GR Sbjct: 479 PAGYLKANGAAVSRTAYAALFATIGTYYGAGDGSTTFNLPDYRGEFLRALDDGRGLDLGR 538 Query: 963 AVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHT 1022 + + + +HTH AS+S G HTH+V+G+ +AGAH+ Sbjct: 539 QLGTLQSSQNLAHTHGASSSG--------------------NGGHTHTVTGTAAAAGAHS 578 Query: 1023 HSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGA 1082 HS+A+VN + SG A+ V + N T AG HTH+++G AA G H HT+ Sbjct: 579 HSIASVNATALVSGTRLAT---LVGNASNSTTDVAGDHTHAVTGVAALEGTHNHTIY--- 632 Query: 1083 HTHSVAIGSHGHTITVNAAGNAENTVKNIA 1112 V ++G +E +N++ Sbjct: 633 ---------------VESSGGSEARPRNVS 647 >UniRef50_B6S308 ORF32 (Fragment) n=3 Tax=Enterobacteriaceae RepID=B6S308_SALDU Length = 427 Score = 155 bits (391), Expect = 8e-36, Method: Composition-based stats. Identities = 73/231 (31%), Positives = 98/231 (42%), Gaps = 31/231 (13%) Query: 736 DGAKTYLLLTNQGDVYGGWNTLR-----PFAIDNATGE--LVIGTKLSASLNGNALTATK 788 D KTY + N ++Y G L+ D A G L + +K + N A + Sbjct: 207 DNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCG 266 Query: 789 LQTPRRVSGVEFDGSKDITLTAAHVAAFARRA-----TDTYADADGG--------VPWNA 835 P + V+ G+ + TA A R T +D G + Sbjct: 267 ADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGATRGSDSGLIMGEVYNNGYPT 326 Query: 836 ESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGF-EEDWAEVY 894 + G ++ ++G + RS RD +WA +Y Sbjct: 327 QYGNILRLTGTGDGEILIGWSGTNGAPAPA----------YIRSHRDTADAEWSEWAMLY 376 Query: 895 TSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGV 945 TS N PP SYPVGA I WPSD P+GYALMQGQ+FDKSAYP LA AYPSG+ Sbjct: 377 TSLNPPPNSYPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGI 427 >UniRef50_Q19CF5 Gp36 small distal tail fiber subunit n=1 Tax=Aeromonas phage 25 RepID=Q19CF5_9CAUD Length = 1305 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 103/442 (23%), Positives = 159/442 (35%), Gaps = 88/442 (19%) Query: 394 SASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIA 453 SA A R+ T A A A +A +A++ ++ + T T + + A Sbjct: 307 SAGTKLANRKIYTEADKPTPAEIGALAAGANAVSASKLQTARTISLTGGATGSVSFDGSA 366 Query: 454 SAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKG 513 +A + T T +TPK + + AE ++ Sbjct: 367 NASIAVTITNNSHTHSDYVKKTGDTMSGNLSTPKVLLTDAQGAEANSVTRRD-------- 418 Query: 514 CFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTA 573 V T A +G++ + + PAGA + Y PV+ + S S V I T+ Sbjct: 419 -------FVESTVTAAGKGVKDLTITLPAGAPATGYIPVMFRTNGTS----DSFVFIDTS 467 Query: 574 TRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFY 633 PMNNC F G V GW+D YAYG F Y +ERAIHSI D V Y Sbjct: 468 FNVGEHPMNNCSFIGNVRASGWSDGRSYAYGKFTIYSTSERAIHSIHAPF-EDSFAYVVY 526 Query: 634 VDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYE 693 V+ AFP+ ++ G +++A D+ + +K + V+ +F G Y Sbjct: 527 VETRAFPITVRVDIGTTVTAHATDVTYGTSVFK--VNGQVSGNTKTIVLGNFDKDSGTY- 583 Query: 694 SHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGG 753 N + Y N Sbjct: 584 -----------------------------------------NGSNRVYDTGYNP------ 596 Query: 754 WNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHV 813 A G L + NGNA+TA++LQT R ++GV FDG+ +I+++A +V Sbjct: 597 --------TPEAVGALPV--------NGNAVTASRLQTARLIAGVPFDGTSNISISATNV 640 Query: 814 AAFARRATDTYADADGGVPWNA 835 A T + V + Sbjct: 641 GA--VNKTGDAMTGNLSVAADY 660 >UniRef50_B5YU13 Tail fiber protein n=140 Tax=root RepID=B5YU13_ECO5E Length = 451 Score = 151 bits (381), Expect = 1e-34, Method: Composition-based stats. Identities = 168/369 (45%), Positives = 210/369 (56%), Gaps = 7/369 (1%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAVKISGVLKDGTGKPV+NCTIQLKA+R S+TVVVNT+ASENPDEAGRYSMDVEYGQYSV Sbjct: 15 MAVKISGVLKDGTGKPVENCTIQLKARRTSSTVVVNTVASENPDEAGRYSMDVEYGQYSV 74 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAM+EDD RPEALRRFELM Sbjct: 75 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRFELM-------VEEA 127 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A++ AKK+A +A TSAR A A+ A +SA A TSAG A+ SA+ A+ SA A Sbjct: 128 ARHAEEAKKNAGEAETSARNAGISASQAEESAANADTSAGDASESARQAAESAAAAKQSE 187 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 +S SA+AA S ++ SA A+ S A ++ +AA A+TAT KA E+A SA+ A Sbjct: 188 EASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAESAQSAE 247 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 S+ AA+ + + + K + + A AG Sbjct: 248 QSRIAAEEAVNRIPTVVGPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGERGPAGDAG 307 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 A + AG + A+ G E +A A+ + Sbjct: 308 PAGPQGPKGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGATGPQG 367 Query: 361 SETNAKASE 369 + + ++ Sbjct: 368 PKGDPGETQ 376 >UniRef50_C5H7L2 Putative tail fiber protein GP37 n=3 Tax=unclassified Myoviridae RepID=C5H7L2_9CAUD Length = 391 Score = 150 bits (377), Expect = 3e-34, Method: Composition-based stats. Identities = 79/220 (35%), Positives = 108/220 (49%), Gaps = 24/220 (10%) Query: 895 TSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTI 954 SYPVG + PS Y L G ++ + + Y + P Sbjct: 148 NLYKAIQASYPVGTIHLSVNSANPSTYLLCGG-TWELVSKGRALVGYDTDSRPVG----- 201 Query: 955 KGKPASGRAVLSQEQDGIKSHTHSA-------------SASSTDLGTKTTSSFDYGTKST 1001 G ++ + + +HTHS S SS D G+K+TS+FDYGTK+T Sbjct: 202 ---STFGSQTVALTNNNLPAHTHSIYLTGGGHTHSASVSISSFDYGSKSTSTFDYGTKTT 258 Query: 1002 NNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHT 1061 N+ GAHTH+ SG+T++AG H H + + G + + T AGAHT Sbjct: 259 NSAGAHTHTFSGTTSNAGNHNHRV--PMRGNDRGGTNAITASADAGVGNAMYTDLAGAHT 316 Query: 1062 HSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAA 1101 HS SGT AS+GAH+HTV IGAH+H+V IGSH HT TV + Sbjct: 317 HSFSGTTASSGAHSHTVAIGAHSHTVNIGSHSHTGTVTVS 356 >UniRef50_C5BA56 Putative uncharacterized protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BA56_EDWI9 Length = 743 Score = 149 bits (376), Expect = 5e-34, Method: Composition-based stats. Identities = 120/544 (22%), Positives = 201/544 (36%), Gaps = 16/544 (2%) Query: 4 KISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILL 63 ISGVL D +GK V I L A NS V+ S E G+YS+ +E G YS+ + Sbjct: 1 MISGVLLDPSGKAVSGAQITLTAIANSMQVLRGFTCSVMTAENGQYSVRLEEGNYSISVA 60 Query: 64 VEGFPPSHAGTITVYEDSQPGTLNDFL-GAMTEDDARPEALRRFELMVEEVARNASAVAQ 122 +G + G +T+ EDS P +LN L + E + PE + F + +VA + + + Sbjct: 61 HQGRNFVY-GAVTLTEDSAPSSLNALLHQQVMEQEVTPEVILYFRQIQHKVADDVVIMQR 119 Query: 123 NTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATE 182 + ++A A S R A AA S R A+ A +A+ A+ A TA Sbjct: 120 LQHDSSQAARAAQESQRHAQASKVAAAGSVRQAAAHRLAAGQAAEMAADYAQTAQDSQRH 179 Query: 183 ASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAAS 242 A +S AA S+ A AA+ S A+ A + Sbjct: 180 AQRSEMAAAESEQRTADHRLAAEQSAEVAAVHAAEEAAARVAEVVHNDSERADVAK---- 235 Query: 243 KEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA 302 K +E+ A ++ A + A N+ AA S+ + + A +A AA + Sbjct: 236 ----KQAESAARNADQYAQQAGIKAQNAQAAADVSQEAIQVTRQARDDTARYAAEVQGYT 291 Query: 303 ASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSE 362 + +A A A+ A + +A+ A R+ A Sbjct: 292 QQAVLSAQQIKEDTDTGLLVAQDIAQQVAVVNMAVSRVVEDASYVEQAVIRTLDKAPVDS 351 Query: 363 TNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAA 422 + + +A ++AA + S + + + A+ +A Sbjct: 352 PVFTGTPQAPTPDGSAVGQEIATAAFVLAQVSKLINSSPAAMDTLQELASALGNDPEFSA 411 Query: 423 GSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETL 482 Q ++ A + + + + A+ +++G+VQLS +T STS + Sbjct: 412 TVMNLIGQKLDKLQNGADIPDKSRFL-----QNIGVVSATISRRGVVQLSDSTESTSISE 466 Query: 483 AATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPA 542 AAT A++ Y A R+ + + + A + T A KR A Sbjct: 467 AATANALRRTYQYA-TRIATTNQIGQVQLEDSVSSTSTARAPTCSALKRTYDEATRRAST 525 Query: 543 GATS 546 G Sbjct: 526 GQAG 529 >UniRef50_A9Q1X5 Putative tail fiber protein n=1 Tax=Enterobacteria phage phiEcoM-GJ1 RepID=A9Q1X5_9CAUD Length = 356 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 73/237 (30%), Positives = 118/237 (49%), Gaps = 16/237 (6%) Query: 857 GVGSCRTLQMKAHYRNGGLFYRSSRDGYG--FEEDWAEVYTSKNLPPESYPVGAPIPWPS 914 R +Q ++ N ++ RS ++W E N+ YP+G + + + Sbjct: 68 NANVGRVMQRYTNFSNKRMWVRSQNGTVSDANFDEWTEFVNMNNIYNAIYPIGIVVKFDN 127 Query: 915 DTVPSGYALMQGQAFDKSAYPKLAAAY--PSGVIPDMRGWTIKGKPASGRAV--LSQEQD 970 T P+ G +++ ++A A P D + +I G + AV L Sbjct: 128 ATNPNNN--FTGTVWEQIIDGRVARAATGPEAGTADGQIGSIAGSDTANIAVTNLPGHTH 185 Query: 971 GIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNT 1030 G+++HTH ++ S + T + D+G +++++GAHTHSVSG+ SAGAH H+ + T Sbjct: 186 GMQNHTHGIASHSHTMAHTHTINHDHGAVTSSSSGAHTHSVSGTAASAGAHQHTEGSPFT 245 Query: 1031 ASANSGAGSASTRLSVVHNQNYA-------TSSAGAHTHSLSGTAASAGAHAHTVGI 1080 N G + ST + + Y+ TSS+GAHTHS+SGTAASAGAH H+V + Sbjct: 246 GDVNFGT-TTSTSKDNISDWLYSPSTRYPLTSSSGAHTHSVSGTAASAGAHTHSVDL 301 >UniRef50_B2SVF7 Phage-related protein n=3 Tax=Xanthomonas oryzae pv. oryzae RepID=B2SVF7_XANOP Length = 501 Score = 143 bits (359), Expect = 4e-32, Method: Composition-based stats. Identities = 69/268 (25%), Positives = 106/268 (39%), Gaps = 46/268 (17%) Query: 890 WAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAA----YPSG- 944 + + N+P G + S P+G + G ++ Y L AA Y +G Sbjct: 234 YRDFGNMLNVPQSFLLPGQIVVMASLYPPNGLLVCDGAEISRAKYAALFAAIGTVYGAGD 293 Query: 945 -----VIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTK 999 +P ++ T+ ++ AV S + + SHTH ASA++ T+ G Sbjct: 294 GSTTFNVPKIKEGTVITHTSAATAVGSYDPGQVISHTHGASAAAVGDHAHYTAINAAGNH 353 Query: 1000 S----------------TNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTR 1043 + T+ G H H GST+++G H H V +++ +G G R Sbjct: 354 AHGASAGAAGDHAHYAWTDAQGHHAH--GGSTSASGDHQH--PGVIPSASINGYGVYRER 409 Query: 1044 LSVVHNQNYATSSAGAHTHSLSGTAASA----------GAHAHTVGI---GAHTHSVA-- 1088 + + T + G H HS A + G H H +GI G H H V Sbjct: 410 DNDAAPSDGWTGAGGNHAHSFGTDGAGSHGHNISMNGVGNHTHGIGIAEGGNHVHDVDHR 469 Query: 1089 -IGSHGHTITVNAAGNAENTVKNIAFNY 1115 G+H HTITVNAAG +N + Y Sbjct: 470 GAGAHAHTITVNAAGGIDNLPAGLRMTY 497 >UniRef50_Q7N348 Similarities with tail fiber protein n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N348_PHOLL Length = 440 Score = 143 bits (359), Expect = 5e-32, Method: Composition-based stats. Identities = 63/265 (23%), Positives = 103/265 (38%), Gaps = 28/265 (10%) Query: 768 ELVIGTKLSASLNGNALTATK--LQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYA 825 L + N + ++T + + S +V + + Sbjct: 166 RLSKSQNGADIPNKSEFIKNLGLVETVNKANNAVSSISGGTIKAGLNVQ--SVLSVGISQ 223 Query: 826 DADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYG 885 + + + N + DS I + + + +NG LFY S R Sbjct: 224 NKNLRISSN---------ETADSQINLIVWGNSNRKTIFE--CGDKNGCLFY-SHRLSSN 271 Query: 886 FEEDWA------EVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA 939 E ++ Y++ + ++ P G P+P+P P GY GQ FDKS YPKLA Sbjct: 272 NVELFSTGKIIPSDYSNFDARYDNVPAGVPMPYPHRYTPPGYLTCNGQTFDKSLYPKLAE 331 Query: 940 AYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 AYP+G +PD+RG I+G S GR + + D I H H AS L + Sbjct: 332 AYPAGRVPDLRGEFIRGWDDSRGVDPGRVCGTWQADCIPDHNHYKVASKQ-LVEDLVLTG 390 Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAG 1019 D G +++ + T S+ +T + G Sbjct: 391 DAGWYTSSGSSTRTRSLDQNTYTGG 415 >UniRef50_C4TTI1 Tail fiber protein n=2 Tax=Yersinia RepID=C4TTI1_YERKR Length = 402 Score = 142 bits (357), Expect = 8e-32, Method: Composition-based stats. Identities = 72/280 (25%), Positives = 113/280 (40%), Gaps = 35/280 (12%) Query: 758 RPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFA 817 D A + + + NG A ATKL T R ++G FDG+ +I++ AA V A Sbjct: 117 TQNLNDVANKATALANLGALAANGTAAAATKLATARTIAGKSFDGTANISIGAADVGALP 176 Query: 818 R---------RATDTYADADGGVPWNAE---SGAYNVTRSGDSYI-------LVNFYTGV 858 T +A+ G + + S + +T +G + I + + Sbjct: 177 LLGGTLSGPLEITGVHAEPLGPNGYKSNIKTSASGAITNAGGTGIGLNADKGIYFWNDNT 236 Query: 859 GSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVP 918 G +L + N + S G ++ E Y +G PIPWP P Sbjct: 237 GYAMSLSLTKLSVNRAITALDSTITPGDYSNFDERYEPAL-------IGTPIPWPLTIAP 289 Query: 919 SGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIK 973 +GY G F+K+ YPKLA AYPSGV+PD+RG I+G + +L + I+ Sbjct: 290 AGYLKCNGAPFNKTQYPKLALAYPSGVLPDLRGEFIRGFDDGRGVRPNQPLLGWQGSEIQ 349 Query: 974 SHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSG 1013 SH H T+ + + + G T++ G Sbjct: 350 SHNHGI----TNFEIRGVTGGPTNAWFPSTNGISTNNSGG 385 Score = 42.8 bits (98), Expect = 0.081, Method: Composition-based stats. Identities = 21/121 (17%), Positives = 35/121 (28%), Gaps = 7/121 (5%) Query: 999 KSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAG 1058 + + + A + + G R + + Sbjct: 288 APAGYLKCNGAPFNKTQYPKLALAYPSGVLPDLRGEFIRGFDDGRGVRPNQPLLGWQGSE 347 Query: 1059 AHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVR 1118 +H+ T V G + A + I+ N +G E +NIAFNYIVR Sbjct: 348 IQSHNHGITNFEI----RGVTGGP---TNAWFPSTNGISTNNSGGDETRPRNIAFNYIVR 400 Query: 1119 L 1119 Sbjct: 401 A 401 >UniRef50_Q7N2Q1 Similar to Sc/SvQ protein of Escherichia coli plasmid p15B n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2Q1_PHOLL Length = 478 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 64/237 (27%), Positives = 97/237 (40%), Gaps = 37/237 (15%) Query: 761 AIDNATGELVIGTKLSASLNGNA------------LTATKLQTPRRVSGVEFDGSKDITL 808 A DNA L + N + L + + R+++G G D++L Sbjct: 194 ATDNANSRLAKNQNGADIPNKSEFIKNLGLTETVELAKSAVPNSRKINGKALSG--DVSL 251 Query: 809 TAAHVAAFARRATDTYADADGGV--PWNAESGAYNVTRSGDSYILVNFYTGVGSC-RTLQ 865 A V A +T + + N + + +F G+ L Sbjct: 252 NAGDVGALPISSTLSAQTGTLRINNGSNWPNIEFRAANK-------HFIGIEGTAGNRLT 304 Query: 866 MKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQ 925 + A+ N Y + + T L + VG+PIPWP VP+GY Sbjct: 305 IYANDENSNRKYTLATP--------EKSGTLATLDDINISVGSPIPWPLPNVPAGYLACN 356 Query: 926 GQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTH 977 GQ+F+KS YP+LA AYPSGV+PD+RG I+G GR VL+ + D I++ T Sbjct: 357 GQSFNKSLYPQLAIAYPSGVLPDLRGEFIRGWDDGRGVDRGRGVLTHQGDAIRNITG 413 >UniRef50_Q7Y3Z0 Tail fiber protein n=1 Tax=Yersinia phage PY54 RepID=Q7Y3Z0_9CAUD Length = 690 Score = 141 bits (354), Expect = 2e-31, Method: Composition-based stats. Identities = 106/360 (29%), Positives = 169/360 (46%), Gaps = 13/360 (3%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M++KISG+L TG+P + I L+A + S TV+ ++ G YS++VE G+Y V Sbjct: 1 MSIKISGILPGPTGEPAAHIGITLRAIKTSLTVITTLESNSITGTDGAYSLNVEPGKYDV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 +L V+G GTI VY DS PGTLN+FL A+ E+D PE + + E + E R A Sbjct: 61 LLWVDGINARRVGTINVYSDSLPGTLNNFLTALREEDGTPEIILQLEQLRAEAVRAALEA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSS-----AGT 175 ++ A + A A ++A AA A+ +A +AA + S A Sbjct: 121 KESKNEATQQAGIAISAADNAAQETAELIKAAVKEDADRAEAARYGAETAQSTVNTLAAE 180 Query: 176 ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATS 235 + +E + A+AA +S + AA+S+ ++ S + + +S +AA S +A A +A S Sbjct: 181 VARHHSEVGQLASAASNSAAEAASSSNSSAQSASESESSKNAAALSEQSALAGAEDAGNS 240 Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAA 295 A AA K AAK A A+ A +SA + S A + N + S+T + + Sbjct: 241 ATAAAGDKTAAKGFRDEAEEFAARAKASAESIDVS---ALEEQINQKVSQTEFDDTIADK 297 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSA 355 A ++ A+ AST + + + AG+AT+Q A + Sbjct: 298 ASNQALTDGLATKAST-----QQLTDGLAGKLDKIGGTLTGPLILAGDATDQKGAVTKQQ 352 >UniRef50_C6CGA0 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CGA0_DICZE Length = 166 Score = 140 bits (351), Expect = 4e-31, Method: Composition-based stats. Identities = 42/106 (39%), Positives = 63/106 (59%), Gaps = 5/106 (4%) Query: 906 VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----AS 960 VG P+PWP T P+G+ GQAFDK+A+PKLA YPSGV+PD+RG I+G S Sbjct: 23 VGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQVYPSGVLPDLRGEFIRGWDDGRGVDS 82 Query: 961 GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGA 1006 R +LS + D I++ T S + + +D G++++ + G+ Sbjct: 83 NRNLLSSQGDAIRNITGFVSGVYVGFDGYSGAFYDTGSRNSISPGS 128 Score = 61.3 bits (146), Expect = 2e-07, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 43/143 (30%), Gaps = 23/143 (16%) Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRLSVVH 1048 T + + + A + L + G G S R + Sbjct: 30 PQATPPAGWLKCNGQAFDKNAFPKLAQVYPSGVLPDLRGEFIRGWDDGRGVDSNRNLLSS 89 Query: 1049 NQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTIT-------VNAA 1101 + + G + G +GA T G+ +I + +A+ Sbjct: 90 QGDAIRNITGFVSGVYVGFDGYSGAFYDT---GSRN---SISPGSTIVAQLNDDFAFDAS 143 Query: 1102 G----NAENTVKNIAFNYIVRLA 1120 EN +NIAFNYIVR A Sbjct: 144 RVVPTANENRPRNIAFNYIVRAA 166 >UniRef50_D0ZBI4 Putative tail fiber protein n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZBI4_EDWTE Length = 718 Score = 136 bits (341), Expect = 5e-30, Method: Composition-based stats. Identities = 61/263 (23%), Positives = 101/263 (38%), Gaps = 35/263 (13%) Query: 761 AIDNATGELVIGTKLSASLNGNALTATK--LQTPRRVSGVEFDGSKDITLTAAHVAAFAR 818 A+ +A ++V+ T S+ + T K L + S +E G K+ A + A+++ Sbjct: 410 ALQDAANKIVVLTGPSSVEAADLSTFAKSLLSKTDQDSAIECLGLKETVTLAGN--AWSK 467 Query: 819 RATDTYADADGGVPWNAESGAYNVTRSGDSYILVN-------------FYTGVGSCRTLQ 865 + + N + G Y V+ S + Y S Q Sbjct: 468 KYIGRLNNGGAFAGCN-QGGIYEVSIGTPSSVADFPMKNGTYIYGYGVLYVTSNSGTISQ 526 Query: 866 MKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYP-VGAPIPW-----PSDTVPS 919 + + NG + R + WA VY + P +G+ IPW P + P+ Sbjct: 527 LYISH-NGQIAARIKWGDQPNFKSWA-VYDPNSSFEYGCPLIGSLIPWALERMPQEIWPN 584 Query: 920 G---YALMQGQAFDKSAYPKLAAAYPSGVIP-DMRGWTIKGKPAS-----GRAVLSQEQD 970 + GQ+FD +PKL YP +P DMRG+T +G GRA+LS + D Sbjct: 585 CGMHFIPYMGQSFDPELFPKLHDVYPDNRLPTDMRGYTARGWDNGRGIDIGRALLSYQDD 644 Query: 971 GIKSHTHSASASSTDLGTKTTSS 993 I++ T + + S Sbjct: 645 AIQNITGQFGWMPFNGSSPVASG 667 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 179/446 (40%), Positives = 241/446 (54%), Gaps = 9/446 (2%) Query: 4 KISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILL 63 +I+G+LKDG GKP+ NC I LKA R S +V+V+T+AS++P EAG Y M E GQY V L Sbjct: 3 RITGILKDGMGKPITNCEIALKALRTSASVIVHTVASQSPGEAGLYDMAAEPGQYRVTLC 62 Query: 64 VEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQN 123 V+G+PP + G I +Y DS GTLN FLG + D RP+ ++ FE+MV +V+ ++ V +N Sbjct: 63 VDGYPPEYVGDIQIYHDSPDGTLNYFLGLPVDGDLRPDVMKEFEIMVAKVSAQSAEVEKN 122 Query: 124 TAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEA 183 AA +SA A S + A + + AA+SA AA S A +S Q A+S A +A A Sbjct: 123 KDAAAESARSALNSQQSAHSSESAAAESAAAALASQNAAKASEQLAASGAQSAQASQQAA 182 Query: 184 SKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASK 243 S +AA S +AA S AAK SE A++S +A S +A S AA SA A AS+ Sbjct: 183 KASESAAADSAAAALASQNAAKESEQAAASSALAAQASQQSAHGSESAAAESAAAALASQ 242 Query: 244 EAAKSSETNASSSASSAASSATAAGNS------AKAAKTSETNARSSETAAGQSASAAAG 297 AAK+SE A+SSA +AA+ A A +A + A SS T A S AAG Sbjct: 243 NAAKASELAATSSAETAANDAAAKAAQATEATLKEAVRADADRAASSATEAHSSTEQAAG 302 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 S ++A +S AA+ SA QA+ A S +AA SAS+A +A+ ASAAA SASA Sbjct: 303 SASSAHNSQMAAAQSASQAAGLADKVKASEAAAAESASSAAQSVSQASSSASAAAGSASA 362 Query: 358 AKTSETNAKASETSAESSKTAA---ASSASSAASSASSASASKDEATRQASAAKSSATTA 414 AK+SET A S +AE S +A A S + S AA Sbjct: 363 AKSSETAAAGSALAAEGSAQSAKVEADRISGGLDTKQDKSELLGAIAALQDAANKIVVLT 422 Query: 415 STKATEAAGSATAAAQSKSTAESAAT 440 + EAA +T A S + + Sbjct: 423 GPSSVEAADLSTFAKSLLSKTDQDSA 448 >UniRef50_Q7N0S6 Similarities with prophage tail fiber protein n=3 Tax=Photorhabdus RepID=Q7N0S6_PHOLL Length = 617 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 66/197 (33%), Positives = 90/197 (45%), Gaps = 28/197 (14%) Query: 745 TNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNA------------LTATKLQTP 792 T + L DNA L + + NA L + Sbjct: 112 TGNSNSLAVTQKLVSDVNDNANNRLAKNQNGADIPDKNAFVKNLGLAETANLAKNAVPNS 171 Query: 793 RRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSG-DSYIL 851 R+++G G DI+L A V AF T ++ VPWNA +G Y++ R G DS + Sbjct: 172 RKINGKALTG--DISLNAGDVGAFRLGLTGNNTVSN-QVPWNANTGLYDLLRPGIDSQHI 228 Query: 852 VNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPE--------- 902 +F GVGSC Q+K Y+N G+ YRS+RD YGFEEDW ++YT+KN P Sbjct: 229 AHFNNGVGSCPAFQLKVQYKNSGIAYRSARDNYGFEEDWTDIYTTKNKPTAADVGAFRLG 288 Query: 903 ---SYPVGAPIPWPSDT 916 Y V P+PW +DT Sbjct: 289 LAGGYSVNNPVPWNADT 305 Score = 92.1 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 75/264 (28%), Positives = 105/264 (39%), Gaps = 31/264 (11%) Query: 802 GSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSG-DSYILVNFYTGVGS 860 + TAA V AF Y+ + VPWNA++G Y++ R G DS + +F G GS Sbjct: 271 YTTKNKPTAADVGAFRLGLAGGYSVNN-PVPWNADTGLYDLLRPGIDSQHIAHFNNGAGS 329 Query: 861 CRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSG 920 C Q+K YRNGG+ YRS+RD YGFEEDW ++YT+KN P + +GA Sbjct: 330 CPAFQLKVQYRNGGIAYRSARDNYGFEEDWTDIYTTKNKPTPA-DIGA------------ 376 Query: 921 YALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA---------VLSQEQDG 971 YA +G F + Y A I D+ W IK P G A + G Sbjct: 377 YAKSEGSEFIQPKYINQA------NISDLTAW-IKSLPQGGHAFRFSGNDSGIGYAWSGG 429 Query: 972 IKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTA 1031 + H A SF +G NT + S +++ Sbjct: 430 YITRMHDIWAGFVAHYDYAGISFIHGNDGGGNTKVSRLWTDKNARSDANGILRVSSPVVD 489 Query: 1032 SANSGAGSASTRLSVVHNQNYATS 1055 G ++ V + T Sbjct: 490 IHPDGTYELTSEAEGVTVKRIDTG 513 Score = 66.3 bits (159), Expect = 7e-09, Method: Composition-based stats. Identities = 42/121 (34%), Positives = 56/121 (46%), Gaps = 8/121 (6%) Query: 443 ETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK 502 T +A + A + AS T+KGIVQL + S +LA T K V DNA RL K Sbjct: 80 TTQLNKALEKKIATEIPSASLTQKGIVQL-TDKTGNSNSLAVTQKLVSDVNDNANNRLAK 138 Query: 503 DQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVS 562 +QNGADIPDK F+ N+ + A ++N GK + +AG V Sbjct: 139 NQNGADIPDKNAFVKNLGLAETANLAKNAVPNSRKIN-------GKALTGDISLNAGDVG 191 Query: 563 E 563 Sbjct: 192 A 192 >UniRef50_P45386 Immunoglobulin A1 protease translocator n=45 Tax=Proteobacteria RepID=IGA4_HAEIN Length = 1849 Score = 132 bits (331), Expect = 7e-29, Method: Composition-based stats. Identities = 77/539 (14%), Positives = 145/539 (26%), Gaps = 22/539 (4%) Query: 11 DGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAG-RYSMDVEYGQYSVILLVEGFPP 69 D TG+P N A + + TLA+ + D +Y + G+Y + + P Sbjct: 943 DKTGEPNHNELTLFDASNATRNNLEVTLANGSVDRGAWKYKLRNVNGRYDL------YNP 996 Query: 70 SHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKK 129 + + T ND + E + R E V A + + + Sbjct: 997 EVEKRNQTVDTTNITTPNDIQADAPSAQSNNEEIARVETPVPPPAPATESAIASEQPETR 1056 Query: 130 SASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAA 189 A A + E T + +T S + + ++ A E A Sbjct: 1057 PAETAQPAMEETNTANSTETAPKSDTATQTENPNSESVPSETTEKVAENPPQENETVAKN 1116 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSS 249 + + + AK A T+ +EA S ++ A S Sbjct: 1117 EQEATEPTPQNGEVAKE------------DQPTVEANTQTNEATQSEGKTEETQTAETKS 1164 Query: 250 ETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS--AAAGSKTAAASSAS 307 E S + S T + ++ + ET Q A + A + Sbjct: 1165 EPTESVTVSENQPEKTVSQSTEDKVVVEKEEKAKVETEETQKAPQVTSKEPPKQAEPAPE 1224 Query: 308 AASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKA 367 T A A + AA+ ++ +K E T+Q S + N Sbjct: 1225 EVPTDTNAEEAQALQQTQPTTVAAAETTSPNSKPAEETQQPSEKTNAEPVTPVVSENTAT 1284 Query: 368 SETSAESSKTAAASSASSAASSASSASASKDEATRQA-SAAKSSATTASTKATEAAGSAT 426 T E + AS S +++ + + K A A Sbjct: 1285 QPTETEETAKVEKEKTQEVPQVASQESPKQEQPAAKPQAQTKPQAEPARENVLTTKNVGE 1344 Query: 427 AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATP 486 Q++ +S A A+ + T+ +S+ ++ + Sbjct: 1345 PQPQAQPQTQSTAVPTTGETAANSKPAAKPQAQAKPQTEPARENVSTVNTKEPQSQTSAT 1404 Query: 487 KAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGAT 545 + + +++ I A + R Sbjct: 1405 VSTEQPAKETSSNVEQPAPENSINTGSATTMTETAEKSDKPQMETVTENDRQPEANTVA 1463 >UniRef50_C7BSQ1 Side tail fiber protein homolog from lambdoid prophage e14 n=3 Tax=Photorhabdus RepID=C7BSQ1_PHOAA Length = 166 Score = 131 bits (327), Expect = 2e-28, Method: Composition-based stats. Identities = 65/233 (27%), Positives = 82/233 (35%), Gaps = 75/233 (32%) Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 E PVG P+PWP+D P G+ G FDK YPKLA AYPSG +PD+RG I+ Sbjct: 1 MSISILEEIPVGIPLPWPTDIPPYGWVKCNGAIFDKYLYPKLAVAYPSGNLPDLRGEFIR 60 Query: 956 GKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHS 1010 G GR VLS + I H+H Sbjct: 61 GWDDGRGVDIGRYVLSTQLADIAPHSHRIGRMW--------------------------- 93 Query: 1011 VSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAAS 1070 S ++AGA + S G NY + G Sbjct: 94 ---SNSNAGAEGLGTPSRILNSVYQGV-------------NYGIDTRGL----------- 126 Query: 1071 AGAHAHTVGIGAHTHSVAIGSHGHTI---TVNAAGNAENTVKNIAFNYIVRLA 1120 + IG +GS G V A+ E +N+AFNYIVR A Sbjct: 127 ------GIAIG-------MGSGGFGYMDNAVAASTGIETRPRNVAFNYIVRAA 166 >UniRef50_D0FSD9 Phage related-protein n=2 Tax=Erwinia pyrifoliae RepID=D0FSD9_ERWPY Length = 311 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 63/261 (24%), Positives = 91/261 (34%), Gaps = 47/261 (18%) Query: 856 TGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSD 915 G Q + ++ + G W E N YP+G + Sbjct: 81 DGKPYAIRSQAYYGKKLWQSKIGNNNEEPGKGSGWVEFKADVNPVDMLYPIGIVTWFAQK 140 Query: 916 TVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSH 975 P+ L G + + + M G ++ I +H Sbjct: 141 KDPN--KLFPGTTWKYIGENRTIRLASANGSDVM--------TTGGSDSVTLAVGNIPAH 190 Query: 976 THSASAS--STDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASA 1033 H+ SA+ S D GTK TS+FDYG K T+ G+HTHS ++ AS Sbjct: 191 GHTFSANTGSFDYGTKGTSTFDYGNKVTDTQGSHTHS------------YNEVIPRGASG 238 Query: 1034 NSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHG 1093 G T + ++AGAHAH V IGAH H+V IG+H Sbjct: 239 MDIGGIWETTIRGS-------------------DTSTAGAHAHNVAIGAHGHTVEIGAHS 279 Query: 1094 HTITVNAAGNAENTVKNIAFN 1114 H++ +G NT A N Sbjct: 280 HSV----SGTTANTGAGTAIN 296 >UniRef50_D1TPQ4 Phage tail collar domain protein n=1 Tax=Yersinia pestis KIM D27 RepID=D1TPQ4_YERPE Length = 262 Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats. Identities = 53/206 (25%), Positives = 78/206 (37%), Gaps = 17/206 (8%) Query: 811 AHVAAFARRATDTYADAD--GGVP-------WNAESGAYNVTRSGDSYILVNFYTGVGSC 861 + + + D + GGVP W VT D + ++ Sbjct: 5 GDIPNTRADSNGEFTDGNVAGGVPPTLLPAEWFNTIQRELVTVVQDGGLTLDPNDDTQVL 64 Query: 862 RTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGY 921 L+ L S G + + PVG P+PWP+ T P G+ Sbjct: 65 AALKKLFLQSGNNL---SEIKDAGPTAITQTLANLGLGEGSAIPVGVPLPWPTATPPEGW 121 Query: 922 ALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHT 976 G FDK YPKLA AYPSG++PD+RG I+G GR +LS + D I++ + Sbjct: 122 LKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIRGWDDGLGVDAGREILSIQGDAIRNIS 181 Query: 977 HSASASSTDLGTKTTSSFDYGTKSTN 1002 + + SS G T+ Sbjct: 182 GGIQGRNEATSARLFSSNATGVFRTD 207 >UniRef50_C5ABB4 Putative uncharacterized protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5ABB4_BURGB Length = 670 Score = 126 bits (315), Expect = 5e-27, Method: Composition-based stats. Identities = 105/514 (20%), Positives = 165/514 (32%), Gaps = 108/514 (21%) Query: 695 HSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGW 754 L V + + + + GG R G YGA RNDG+ YLL T +GD G Sbjct: 175 SRLQVAGTVRADGVVS--DTPDAGGAHFRARYGGYGAYLRNDGSSVYLLSTKKGDPSGQP 232 Query: 755 NTLRPFAIDNATGELVIGTKLSA-SLNGNALTATKLQTPR----------RVSGVEFDGS 803 N RPFA + TG + I ++ GNA ++ + G + Sbjct: 233 NDYRPFAWNLGTGYVTIDAGGHGTAIGGNATIVGDVRIGHGLDEGHARIGPLDGYFYSNK 292 Query: 804 KDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGD---------SYILVNF 854 + + V +F A D DG + W++ + G + + Sbjct: 293 TSVGWWSPTVGSFQYVAQDRTFRVDGYLVWHSGNMTPLDANKGGVIGGNVTFAAGQRLFL 352 Query: 855 YTGVGSCRTLQM-----------------KAHYRNGGLFYRSSRDGYGFEEDWA------ 891 G + +L NG + +++ F++ A Sbjct: 353 DEGSAAFPSLAFVNDGVPDTGFYHARDGVFGVTCNGEVTVTFAQEATYFQKPAAGPSPAP 412 Query: 892 EVYTSKNLPPESY-------PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSG 944 + + + E +G + + +GY G + ++ YP L AY Sbjct: 413 DDSSLRFATTEWVTAAIGTASIGQIVMEARTSPRAGYVKCDGSQYKRADYPAL-WAYAQA 471 Query: 945 ---------------------------VIPDMRGWTIKGKPAS------GRAVLSQEQDG 971 +PD+RG ++ GRA+ S + Sbjct: 472 SGALVSEAEYTDGRWGGFSTADGQTYFRVPDLRGEFLRCWSDGRGDVDPGRAIGSFQGGQ 531 Query: 972 IKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTA 1031 ++H H AS+ GA HS G T G H H+ + Sbjct: 532 NQAHAHGASSDPDGAH----------VHDAWTGGAGWHSHHGVTGGGGMHNHANGVFSRL 581 Query: 1032 SANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAH---TVGIGAHTHSVA 1088 GS + + A S A AG H H T G G H H+V Sbjct: 582 LRPPYLGSLTGSDTDGSGNEQAVGGGD------SADIAWAGEHQHEFWTDGAGDHVHAVG 635 Query: 1089 IGS---HGHTITVNAAGNAENTVKNIAFNYIVRL 1119 IG+ H H I V A G AE +N+A ++R Sbjct: 636 IGNAGGHAHAIHVQADGGAEARPRNVALLAMIRA 669 >UniRef50_D0Z3B3 Putative tail fiber protein n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z3B3_LISDA Length = 1008 Score = 126 bits (315), Expect = 5e-27, Method: Composition-based stats. Identities = 104/577 (18%), Positives = 196/577 (33%), Gaps = 11/577 (1%) Query: 3 VKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVIL 62 + + G+L D + + IQ+ + S +V+ + D G YS + G Y + Sbjct: 1 MLVKGILSDAADQCIPKGIIQIVSINTSESVLEGSTVWIKADNEGHYSFTLLPGSYLIYA 60 Query: 63 LVEGFP-PSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVA 121 + G V +D+ G+LN +G T P +++ + R+A Sbjct: 61 QSGRQNDVVYLGETIVTDDTPDGSLNSIVGITT--PVLPPQVQQAVNAANKATRSAEDAN 118 Query: 122 QNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKAT 181 + A + S + T + +++ A + ++ Q + Sbjct: 119 DKYQDLIELAKTVTNSINDLVTTVNHVENLSQSVEGYALASGNALQGSLRIKAEVGAVNQ 178 Query: 182 EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAA 241 S + A+T + AS+S Q A A++A A+ AA ++ Sbjct: 179 SVRLVKDEILSIQKNIIHLKSDAETFSSKASSSAQKAQKQANSAVLSANNAAADSQKTFR 238 Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTA 301 + + + S T + A +K Sbjct: 239 LMTVVEKYRDDVMGALDETHQSLEWLKFMQVQFDTKLQEMTLIDEHLSILAREIEKNKQK 298 Query: 302 AASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTS 361 A SAG A+ SA A A A +AS + EA +QA ++ A + ++ Sbjct: 299 VEQLRLNARESAGNAATSALRAEHEANRAEVAASIDIVR--EAEKQAKSSKSEADKSLSA 356 Query: 362 ETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEA 421 + ++ A + A S SA +SA +A + A QA + A++A+ A + Sbjct: 357 SLVSVNAKNVAVAKANEAKQSELSATTSAQNAEQNSLTAKEQAKLSTEKASSAAISAKNS 416 Query: 422 AGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSET 481 S +A ++ S A +T + +A A A + +K + A+ +T + Sbjct: 417 KSSEKSALEAASEAALNSTATKQSANLASSHAVTAGESANTAEQKADDAANQASIATQQA 476 Query: 482 LAATPKA-----VKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYV 536 A A ++ N+ + A K + A+++ A + + Sbjct: 477 GIAKSNADASLNSQTLAANSVELASNQAKLASNSAKVAAEKAMIAINQVSLAQQEAAKS- 535 Query: 537 RVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTA 573 RVNA A S K S A ++ Sbjct: 536 RVNAGGAAQSAKDSERSSQTSIAKADVAAKNAGLSAT 572 >UniRef50_C5H7L3 Putative tail fiber protein n=1 Tax=Enterobacteria phage WV8 RepID=C5H7L3_9CAUD Length = 848 Score = 126 bits (314), Expect = 6e-27, Method: Composition-based stats. Identities = 76/346 (21%), Positives = 126/346 (36%), Gaps = 47/346 (13%) Query: 753 GWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAH 812 G+N P L LN A ++ R + G + + Sbjct: 510 GFNINTPNQAGKCEIVLRTSNNNPKGLNVVAWRTSENTIVRDI------GYVNTSGDTYD 563 Query: 813 VAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRN 872 + A TY ++ ++ + + + ++ G+ + + Sbjct: 564 IYYLA----GTYQNSTTTRVQSSSNASVQLFEVPQTFDDAPQGIVKGT--IAKYYTSLQK 617 Query: 873 GGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKS 932 + ++ AE + + YPVG + S+ P+ + Sbjct: 618 PTPSDIGAYTKAETDQKIAEAISDSTDLNKIYPVGIVTWFNSNVNPN------------T 665 Query: 933 AYPKLAAAYPSGVIPDMRGWTIKGKPASGRAV--------LSQEQDGIKSHTHSASASST 984 A P L Y + + G TI+ A+G V ++ + SHTHS SA Sbjct: 666 ALPGLTWTYLNNGV----GRTIRIAAANGSDVATTGGSDSVTLSVGNLPSHTHSFSA--- 718 Query: 985 DLGTKTTSSFDYGTKSTNNTGAHTHSVSGS--TNSAGAHTHSLANVNTASANSGAGS-AS 1041 TTSSFDYGTK+++ TG H H+ T S G ++ TAS GS A Sbjct: 719 -----TTSSFDYGTKTSSTTGNHNHNRGTMEITGSFGYFRSDASSFYTASGAFYLGSQAG 773 Query: 1042 TRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 1087 ++ +N + + SG + G H+HTVGIGAH+H+V Sbjct: 774 SKGYTGNNFTNGIPVNFNASRNWSGVTNTTGNHSHTVGIGAHSHTV 819 >UniRef50_B7MW07 Putative tail fiber protein from prophage n=4 Tax=Escherichia coli ED1a RepID=B7MW07_ECO81 Length = 520 Score = 126 bits (314), Expect = 7e-27, Method: Composition-based stats. Identities = 100/419 (23%), Positives = 144/419 (34%), Gaps = 3/419 (0%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M V+ISGVLKDGTGKPV CTI+LKA+R + TV+V T+A P E G YS DVE G Y V Sbjct: 1 MTVRISGVLKDGTGKPVPGCTIELKARRTTETVIVTTVAQGQPGETGSYSFDVEPGWYRV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 L EG+ PS+ G I V DS+PGTLN FL E P+AL E + E+ + A A Sbjct: 61 TLNTEGYAPSYVGDILVKADSEPGTLNKFLMEQDEAQYYPKALAELEAVAAEILKRAEAS 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA 180 A + AKK A +A A E A A+ + + G + + Sbjct: 121 AASAEEAKKRAENARGPAGEKGDTGPQGATGAQGPAGATGAVGPKGEPGPKGERGETGPQ 180 Query: 181 TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA 240 T + Q+ A + Sbjct: 181 GPKGDKGDPGGPPGPKGDTGPRGEAGPPGPQGPAGQTGPKGDKGEPGATGPAGPAGPRGE 240 Query: 241 ASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKT 300 + +S ++ S S+ ET A + + A A Sbjct: 241 TGPAGPAGPAGSVASVPDASTSQKGVVQLSSDTNSDDETKAATPKAVKAVMAEVQAAKTK 300 Query: 301 AAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKT 360 A ++ AA A G + + A G+A A A + +A Sbjct: 301 AEEAATRAAVPGPKGDRGEPGAPGAVGPAGPRGPAGAAGPKGDAG-PAGPAGKDGTAGAE 359 Query: 361 SETNAKASETSAESSKTAAASSA--SSAASSASSASASKDEATRQASAAKSSATTASTK 417 + + + + + + E RQ T + Sbjct: 360 GKAGPAGPRGERGPAGAQGVPGPVGPAGPAGKTGPRGLQGETGRQGPTGPQGPTGETGP 418 >UniRef50_A4N1T0 Immunoglobin A1 protease n=1 Tax=Haemophilus influenzae R3021 RepID=A4N1T0_HAEIN Length = 1550 Score = 125 bits (313), Expect = 8e-27, Method: Composition-based stats. Identities = 71/565 (12%), Positives = 175/565 (30%), Gaps = 33/565 (5%) Query: 11 DGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAG--RYSMDVEYGQYSVILLVEGFP 68 D TG+P +N + L N+T +N N + G +Y + G+Y + + Sbjct: 778 DKTGEPTKN-ELTLFDASNATRNNLNVSLVGNTVDLGAWKYKLRNVNGRYDL------YN 830 Query: 69 PSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFE--LMVEEVARNASAVAQNTAA 126 P + + T N+ + ++ E + R E + A + Sbjct: 831 PEVEKRNQTVDTTNITTPNNIQADVPSVPSQNEEIARVEAPVPPPAPATPSETTKTEAEN 890 Query: 127 AKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKS 186 + + + + ++A A + A A ++ + + A S + T T+ TE ++ Sbjct: 891 SPQKSETVEKNEQDATETTAQNREVAEEAKSNVEANTQTNKVAQSGSETEETQTTETKET 950 Query: 187 AAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAA 246 A E A + + S T + T+ + Sbjct: 951 AKVEEDEIQEAPQMTSETSPKQAEPAPEEVSTDTKVEETQVQPQTQPTTVTAEDTTTPNG 1010 Query: 247 KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 K +E S ++A S + + N A+ + T + + ++ A ++ Sbjct: 1011 KPAEETQPSEKTNAESVTSVSQNQAEKTVSQSTKDKIVVEKEETAKVEKEKTQEAPKVTS 1070 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 + + A +S + + A+ + + +S Sbjct: 1071 QVSPKQEQSETVQPQTALESENVPTVKNAEEVQAQLQTQPSATVSTEQPAKETSSNVEQP 1130 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTK--------- 417 +E++ +++ + + + A ++ S ++ R+ S ++ T+A Sbjct: 1131 VTESTTVNTRNSVVENPQNTTQPAVNSENSTPKSRRKRSVSQPQETSAEETTATSTNETT 1190 Query: 418 -------------ATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 + SA + + + A ++++ S + + Sbjct: 1191 VADNSRRRSRRSVSQPQETSAEETTVTSTEKTTVADNSKSSKPNRRSRRSVRSEPTVANG 1250 Query: 465 KKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSK 524 S ST+ + K+ + Q+ + + NI + Sbjct: 1251 SDRSAVALSNLTSTNTNAVISDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNIWVSNT 1310 Query: 525 TDFADKRGMRYVRVNAPAGATSGKY 549 + + +Y R ++ + T + Sbjct: 1311 SMNENYSSSQYRRFSSKSTQTQLGW 1335 >UniRef50_C5BH14 Tail fiber n=2 Tax=Enterobacteriaceae RepID=C5BH14_EDWI9 Length = 593 Score = 124 bits (311), Expect = 1e-26, Method: Composition-based stats. Identities = 54/257 (21%), Positives = 97/257 (37%), Gaps = 22/257 (8%) Query: 759 PFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAH------ 812 P A + G + + + A T ++ + S+ + Sbjct: 296 PTATLDEVGFVRLNDSTDSDSTTQAATPNAVKRTYEEATRAASTSQAGQVLLEDSISSYS 355 Query: 813 -VAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYR 871 A A + +++G + S S T + L+ Sbjct: 356 TTNAATANAVRYAYENAVRPATTSQAGQVLLEDSVSSTST----TNAPTSSALKRTYDRA 411 Query: 872 NGGL------FYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQ 925 N + R+S + + Y + ++ + PVG P PWP+ ++PSG+ Sbjct: 412 NSAYDRANSAYDRASSAYSYAGSIYDKAYDAYDIARRAPPVGTPQPWPNTSIPSGWIKCA 471 Query: 926 GQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASG-----RAVLSQEQDGIKSHTHSAS 980 GQ+F S+YP+LA AYP+G +PD+RG I+G G R +LS + D +++ T + Sbjct: 472 GQSFSTSSYPELAKAYPNGRLPDLRGEFIRGYDDYGGTDSQRQILSWQGDAMRNITGTFG 531 Query: 981 ASSTDLGTKTTSSFDYG 997 + T +YG Sbjct: 532 VDDQTIEQVTGVFREYG 548 >UniRef50_Q7N2R3 Similar to tail fiber assembly protein from bacteriophage n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N2R3_PHOLL Length = 233 Score = 124 bits (309), Expect = 3e-26, Method: Composition-based stats. Identities = 47/149 (31%), Positives = 71/149 (47%), Gaps = 11/149 (7%) Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 + + PVG P+P+PS P+GY GQAFDKS YP+LA AYPSG++PD+RG I+ Sbjct: 84 LATIDDITAPVGVPLPYPSRYTPAGYLTCNGQAFDKSRYPQLAIAYPSGILPDLRGEFIR 143 Query: 956 GKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTH- 1009 G S GR +LS + GI+ H H S + + G +S T+ Sbjct: 144 GWDDSRGVDMGRGMLSWQPAGIQDHMHYKVISKQVVEDLVLA----GNQSWGTEKNSTYT 199 Query: 1010 -SVSGSTNSAGAHTHSLANVNTASANSGA 1037 S+ + ++ G ++ + Sbjct: 200 RSLDQNISTGGVIGTTVNETRPRNIAFNY 228 >UniRef50_A7FIU0 Tail collar domain protein n=1 Tax=Yersinia pseudotuberculosis IP 31758 RepID=A7FIU0_YERP3 Length = 402 Score = 123 bits (307), Expect = 4e-26, Method: Composition-based stats. Identities = 62/292 (21%), Positives = 104/292 (35%), Gaps = 46/292 (15%) Query: 728 EYGALWRNDGAKTYLLLTNQ------GDVYGGWNTLRPFAIDNATGELVIGTKLSASLNG 781 + G +WR + A TYL N L + A V + + + G Sbjct: 101 DIG-VWRKENANTYLQTANHFSEIAAAGPAAVAQALTNLGLKEAAKRDVGTSNDTVASGG 159 Query: 782 NALTATKLQTPRRVSGVEFDGSKD-ITLTAAHVAAFARRATDTYADADGGVPWNAESGAY 840 + R V+ + D I L + + A D + N Sbjct: 160 D---------SRIVNAFQLRSVLDEINLNS--LGQDALGIYVQALDVFATLDRNYPITI- 207 Query: 841 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRS---SRDGYGFEEDWAEVYTSK 897 ++V + Q + G + R+ +G G DW ++ Sbjct: 208 ------AGSLVVR----PSAYGAQQEYTPFYTGRKYVRNLMGVWNGNGPWSDWIQIGNDV 257 Query: 898 NLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGK 957 PVG P+PWP+ PSG+ G F+K+ +P+LA+ Y GV+PD+RG I+G Sbjct: 258 A------PVGIPMPWPAHIPPSGWLKCNGATFNKAQFPQLASVYTRGVLPDLRGEFIRGW 311 Query: 958 PAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNT 1004 GR +LS ++ + + D+ + S +G + TN Sbjct: 312 DDGKLADPGRGLLSFQEGTV--VGGYDDNDTGDISSIGLYSSGFGDQLTNTQ 361 >UniRef50_B5YYN6 Tail fiber protein n=60 Tax=root RepID=B5YYN6_ECO5E Length = 645 Score = 122 bits (304), Expect = 1e-25, Method: Composition-based stats. Identities = 135/596 (22%), Positives = 194/596 (32%), Gaps = 58/596 (9%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V +SG LK G+ + I L A S + T AS E G Y M ++ G+Y+V Sbjct: 1 MSVVVSGTLKSPDGEAISGANITLTALTVSPDALSGTSASAVTREGGYYGMTMDPGEYAV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGA-MTEDDARPEALRRFELMVEEVARNASA 119 + V+G + G + + TLN L + E E L F + VA + + Sbjct: 61 SVTVKGKTVVY-GRVRIEGTESTVTLNMLLRRSLVEVSIPGELLTDFRQIQNNVADDLAT 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + + + A+ S AA A A+DSA+ A++ A +A A Sbjct: 120 IRRLNEDTATKNTQATQSKESAAASAKSASDSAKTATSRAAEAGQKATD----------- 168 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 A + + A T+AG A+ S T A S ++A A A A +A Sbjct: 169 ----------ATEAATRAVTAAGNAEESSTRAGESEKAAGADAEKARQHAEKAR------ 212 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 A SA A AA SA+ A+ NAR G++ Sbjct: 213 ------------LAQESAGEILKRAEAATVSAEEARRMAENARGPRGPQGETG------- 253 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 T A G+ + A A GE EQ + K Sbjct: 254 -PKGDVGPKGETGPVGPQGPAGPKGERGDVGAQGAVGPAGPRGEKGEQGERGPQGIPGLK 312 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 +T + + + + EA Q T Sbjct: 313 -GDTGERGPKGDQGDMGPKGEKGDPGGPAGPQGPKGERGEAGPQGPMGAR-GERGETGPR 370 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 G A + T SA + DA+T +KGIVQLSSAT+S Sbjct: 371 GEPGPAGPRGERGETGPQGP------RGEPGPAGSAANVADATTAQKGIVQLSSATDSDD 424 Query: 480 ETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRG-MRYVRV 538 ET AATPKAVK+A D A + K + A + A +G Sbjct: 425 ETKAATPKAVKAAMDVANEAKTKAEEAAAGGGVPGPKGDKGDTGPAGPAGPKGDKGERGD 484 Query: 539 NAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGG 594 P GAT + + + T P G P G Sbjct: 485 TGPVGATGERGPAGDAGPAGPQGPKGDRGERGETGLTGNAGPQGPKGDTGAAGPAG 540 >UniRef50_B3YHG3 Tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica serovar Kentucky RepID=B3YHG3_SALET Length = 573 Score = 120 bits (299), Expect = 4e-25, Method: Composition-based stats. Identities = 123/410 (30%), Positives = 171/410 (41%), Gaps = 23/410 (5%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M++ +SG+LK G + I L A S ++ AS + G Y M+V G YS+ Sbjct: 1 MSILVSGILKSPAGAIIAGAQITLTALTTSPDLLAGVSASAVTSDTGYYGMNVLPGVYSL 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGA-MTEDDARPEALRRFELMVEEVARNASA 119 + V G + G+ + TLN L + E E L F + VA + Sbjct: 61 TVAVNGKSQVY-GSFRLDGTETTVTLNMVLRRNLVEVSIPDELLVDFRQIQNNVADDLET 119 Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + Q A A +A +A A SA AA S A+ S A Sbjct: 120 IRQLELRAS--------------GSADNAVRTAADAKASAESAARSEADAADSEKKAEQF 165 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 A + A A S SAAA SA A T A ++ AA S + AS AA S ++A Sbjct: 166 ARNLQDAVAKAGDSASAAALSAAGAGEQATAAKSAALEAADSKAATEKAASNAALSEKNA 225 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 A S AA++SE +SAA SAT A S KAA E + + ET AGQSA+ AA S Sbjct: 226 ADSALAARTSE-------NSAADSATKADASEKAAVLYEQTSSTHETNAGQSAADAALSA 278 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 T AA SA A SA +A+ AT A A +A +A+ ATT E Q S S + Sbjct: 279 TKAADSALNAGKSATEAAGYATDAQTQAGNAKRAATDATTAKDEIVRQISGFDEHVSQQE 338 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 T T + ++ A A+ A+ + + + + Sbjct: 339 TVITAKGQTLVDQARTEAINAGQAAQHAAQVLEDAINASIKGEKGDTGEQ 388 >UniRef50_D2TR91 Putative phage tail fibre protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TR91_CITRO Length = 279 Score = 119 bits (298), Expect = 5e-25, Method: Composition-based stats. Identities = 51/171 (29%), Positives = 71/171 (41%), Gaps = 7/171 (4%) Query: 837 SGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTS 896 +G S T R + +G L ++G + A V Sbjct: 78 TGTPTAPTPASSDNSKKLATTEFVARIISALTETVSGKL--SQEQNGADIPDPEAFVKNL 135 Query: 897 KNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKG 956 + PVG P+PWPS T P G+ G F S YPKL AYPSG +PD+RG I+G Sbjct: 136 GLGEGSALPVGVPVPWPSATPPEGWLKCNGATFSSSLYPKLGLAYPSGKLPDLRGEFIRG 195 Query: 957 KPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTN 1002 GR++LS + D +SH+H+ S T+ +D T N Sbjct: 196 WDDGRGADNGRSLLSSQGDAFRSHSHNFDRSWGLENFDATAGYDVVTADIN 246 Score = 47.8 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 44/134 (32%), Gaps = 14/134 (10%) Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRL--SV 1046 T + + S S + L + G G+ + R S Sbjct: 152 PSATPPEGWLKCNGATFSSSLYPKLGLAYPSGKLPDLRGEFIRGWDDGRGADNGRSLLSS 211 Query: 1047 VHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAEN 1106 + + S + L A+AG T I + T++V G +E Sbjct: 212 QGDAFRSHSHNFDRSWGLENFDATAGYDVVTADING---KIVNQPTRSTVSV---GGSET 265 Query: 1107 TVKNIAFNYIVRLA 1120 +NIAFNYIVR A Sbjct: 266 RPRNIAFNYIVRAA 279 Score = 43.2 bits (99), Expect = 0.054, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 58/183 (31%), Gaps = 14/183 (7%) Query: 421 AAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE 480 A A+ K E + + E + T + A++ S+ Sbjct: 35 AKQLASRTLYLKQQVEQGVSDLADHIAADDPHTQYAPKESPTFT-GTPTAPTPASSDNSK 93 Query: 481 TLAATPKAVK---SAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVR 537 LA T + + + +L ++QNGADIPD F+ N+ + + + Sbjct: 94 KLATTEFVARIISALTETVSGKLSQEQNGADIPDPEAFVKNLGLGEGSALPVGVPVPW-- 151 Query: 538 VNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTD 597 P+ + + G+ + + A + P EF G D Sbjct: 152 ---PSATPPEGWL-----KCNGATFSSSLYPKLGLAYPSGKLPDLRGEFIRGWDDGRGAD 203 Query: 598 RGR 600 GR Sbjct: 204 NGR 206 >UniRef50_UPI00016A4B89 phage-related tail fiber protein n=2 Tax=Burkholderia thailandensis RepID=UPI00016A4B89 Length = 654 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 104/579 (17%), Positives = 169/579 (29%), Gaps = 115/579 (19%) Query: 630 SVFYVDGAAFP-VFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSG 688 V + P V + G+ A G ++ Sbjct: 101 GVLQLASQGLPAVVLSNDGGVRAGA------AGPVVLVTGNAERVRVTKEGRALVGTADD 154 Query: 689 RGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQG 748 G +L V N S + G Q+R G +YGA RND YL+ T +G Sbjct: 155 NG---RDALQVRGNASASNGVIAGGLDRGDGGQLRACGNQYGAFIRNDDVAVYLMSTKKG 211 Query: 749 DVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITL 808 D G W+ RP + +G+++I S ++ G T R + + Sbjct: 212 DPLGSWSDWRPLSWSLESGKVLIDGNGSGAVFGGTADFTGDLAVGRNTSEGHMRLGPVDG 271 Query: 809 ---TAAHVAAFARRATDTYA--------DADGGVPWNAESGAYNVTRSGDS--------- 848 + + +Y DG W+ S G + Sbjct: 272 YFYASKQSIGWWSPTIGSYQYIFADRTFRVDGKAVWHEGSLTPLDLNRGGTLKGQLTLDP 331 Query: 849 YILVNFYTGVGSCRTLQM-----------------KAHYRNGGLFYRSSRDGYGFEED-W 890 + G + +L A NG + DG F++ + Sbjct: 332 GARIMLAEGSAAFPSLTFANDGAPDTGLFHIADGHFAVSSNGAQTVHFAPDGTYFDKPAF 391 Query: 891 AEVYTSKNLPPE------------SYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLA 938 + + S VG + +GY G ++ YP L Sbjct: 392 GGHPNAGDRSNRVATTQWIAGELASAMVGQIVFEMRTAARAGYLKCNGALVKRADYPAL- 450 Query: 939 AAYPSG---------------------------VIPDMRGWTIKGKPAS-----GRAVLS 966 AY G IP++RG ++ R + + Sbjct: 451 WAYAQGSGALVAEKDWMSGNFGCFSDGDGSATFRIPELRGEFLRCWDDGRGSDADRKIGT 510 Query: 967 QEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLA 1026 + ++H H+A A T+N G H H TN H + + Sbjct: 511 WQDSMNRTHGHAAGADGVGDHGH--------NAWTDNQGWHGHHGWTGTNGNHNHNNDIF 562 Query: 1027 NVNTASANSG------AGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGI 1080 + +G + + +V + AG H H + AG HAH VG+ Sbjct: 563 SRLLRPPYNGSLTGSDTAGSGSEQAVGGGDSADIRWAGDHNHEFN--TEGAGTHAHNVGV 620 Query: 1081 GAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 A G+H H I V A G E +N+A ++R Sbjct: 621 AAS------GAHSHAIHVAADGGNEARPRNLAVLAMIRA 653 >UniRef50_Q9AHZ5 YdaB n=29 Tax=Enterobacteriaceae RepID=Q9AHZ5_PHOLU Length = 296 Score = 118 bits (294), Expect = 2e-24, Method: Composition-based stats. Identities = 43/141 (30%), Positives = 58/141 (41%), Gaps = 11/141 (7%) Query: 895 TSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTI 954 + PVG PIPWP+ P G+ G FDKS +P+LA AYPSG +PD+RG I Sbjct: 135 DINSSKTNDIPVGTPIPWPTAIPPVGWLQCNGAVFDKSKFPELAKAYPSGYLPDLRGEFI 194 Query: 955 KGKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTH 1009 +G GR + + D I++ T S + D T YG + Sbjct: 195 RGWDNGRGVDPGRVCSTWQGDAIRNITGSFPGAIADNYHLATKEAFYGKINLGIA----- 249 Query: 1010 SVSGSTNSAGAHTHSLANVNT 1030 G+T S H Sbjct: 250 -TDGTTKSKNIHNPDNPYGFG 269 >UniRef50_Q9LA62 ORF-401-like protein n=1 Tax=Enterobacterial phage P-EibA RepID=Q9LA62_9CAUD Length = 479 Score = 117 bits (291), Expect = 3e-24, Method: Composition-based stats. Identities = 102/395 (25%), Positives = 149/395 (37%), Gaps = 3/395 (0%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M V+ISGVLKDGTGKPV CTI+LKA+R + TV+V T+A P+E G YS DVE G Y V Sbjct: 1 MTVRISGVLKDGTGKPVPGCTIELKARRTTETVIVTTVAQGQPEETGSYSFDVEPGWYRV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 L EG+ PS+ G I V DS+PGTLN FL E P+AL E + E+ + A A Sbjct: 61 TLNTEGYAPSYVGDILVKADSEPGTLNKFLMEQDEAQYYPKALAELEAVAAEILKRAEAS 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSA--GTAST 178 A + AKK A +A A E A A+ + + G + T Sbjct: 121 AASAEEAKKRAENARGPAGEKGDTGPQGATGAKGPAGATGAVGPKGEPGPKGERGETGPQ 180 Query: 179 KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARD 238 K A + A + AT A A Sbjct: 181 GPKGDKGDPGGPPGPKGDTGPRGEAGPRPQGPAGQTGPKGDKGEPGATGPAGPAGPRGET 240 Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGS 298 A S + +++S + ++ +T ++ + + +A + Sbjct: 241 GPAGPAGPAGSVASVPDASTSQKGVVQLSSDTNSDDETKAATPKAVKAVMAEVQAAKTKA 300 Query: 299 KTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAA 358 + AA +A + A G + + A+ A G ++ A A Sbjct: 301 EEAATRAAVPGPKGDRGEPGAPGAVGPAGPQGPAGAAGAKGAPGPKGDKGEPGAT-GPAG 359 Query: 359 KTSETNAKASETSAESSKTAAASSASSAASSASSA 393 +T A A + S A + Sbjct: 360 PQGKTGPAGPRGPAGPQGAAGRNGNVSTEKYAVGS 394 >UniRef50_D2U1K0 Phage tail protein n=1 Tax=Arsenophonus nasoniae RepID=D2U1K0_9ENTR Length = 366 Score = 117 bits (291), Expect = 3e-24, Method: Composition-based stats. Identities = 44/107 (41%), Positives = 58/107 (54%), Gaps = 6/107 (5%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS- 960 +YPVGAPIPWP T P GY + G+ FDK PKL AYPSG +PD+RG+ I+G A Sbjct: 216 NNYPVGAPIPWPQATPPKGYLICNGEPFDKVKCPKLLIAYPSGKLPDLRGYFIRGWDAGK 275 Query: 961 ----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNN 1003 GR V S ++D I++ T + G + S D T+ Sbjct: 276 GVDPGREVFSYQEDAIRNITGRIGFARRG-GAEPPVSADGAFVITDW 321 >UniRef50_Q32D03 Putative uncharacterized protein n=2 Tax=root RepID=Q32D03_SHIDS Length = 90 Score = 117 bits (291), Expect = 4e-24, Method: Composition-based stats. Identities = 38/89 (42%), Positives = 50/89 (56%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V ISG L DG G P+ I LK++ N+ VV++T+A G Y G+Y V Sbjct: 1 MSVVISGALTDGAGIPMSGYHIILKSRVNTPEVVMHTVADVMTGNDGEYCFHARTGKYGV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDF 89 L + + G I VYEDS+PGTLNDF Sbjct: 61 YLKQDWRNEYNVGDIAVYEDSKPGTLNDF 89 >UniRef50_D2TSH8 Phage tail fibre protein n=7 Tax=root RepID=D2TSH8_CITRO Length = 617 Score = 117 bits (291), Expect = 4e-24, Method: Composition-based stats. Identities = 63/138 (45%), Positives = 87/138 (63%) Query: 766 TGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYA 825 T LS L+GNA TATKL+T R+++GV FDGS DI+++A +V AFA R T Sbjct: 404 TNRQTFSGGLSGELSGNAATATKLKTARKIAGVGFDGSSDISISAKNVNAFALRQTGNTV 463 Query: 826 DADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYG 885 + D V WN +SGAYN G S ++++F GSC +Q + +Y+NGG+ YRS+RDGYG Sbjct: 464 NGDTSVGWNWDSGAYNALIGGASALILHFNINAGSCPAVQFRVNYKNGGISYRSARDGYG 523 Query: 886 FEEDWAEVYTSKNLPPES 903 FE W++ YT+ P Sbjct: 524 FELGWSDFYTTTRKPSAG 541 >UniRef50_A9R3H4 Tail collar domain protein n=20 Tax=Yersinia RepID=A9R3H4_YERPG Length = 259 Score = 114 bits (284), Expect = 2e-23, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 56/117 (47%), Gaps = 5/117 (4%) Query: 891 AEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMR 950 + + VG P+PWP+ T P G+ G FDK YPKLA AYPSG++PD+R Sbjct: 91 QTLANLGLGEGSAILVGVPLPWPTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLR 150 Query: 951 GWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTN 1002 G I+G GR +LS + D I++ + + + SS G T+ Sbjct: 151 GEFIRGWDDGLGVDAGREILSIQGDAIRNISGGIQGRNEATSARLFSSNATGVFRTD 207 Score = 48.6 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 42/147 (28%), Gaps = 21/147 (14%) Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRL--SV 1046 T + A + L + G G + R S+ Sbjct: 113 PTATPPEGWLKCNGAIFDKVKYPKLALAYPSGILPDLRGEFIRGWDDGLGVDAGREILSI 172 Query: 1047 VHNQNYATSSA----GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGH-----TIT 1097 + S T + ++ + G G++ S + + Sbjct: 173 QGDAIRNISGGIQGRNEATSARLFSSNATGVFRTDGQFGSYAASADVAVGVTDDRLAELF 232 Query: 1098 VNAAG----NAENTVKNIAFNYIVRLA 1120 +A+ EN +NIAFNYIVR A Sbjct: 233 FDASRSVPTANENRPRNIAFNYIVRAA 259 >UniRef50_B4EF34 Putative phage tail protein n=1 Tax=Burkholderia cenocepacia J2315 RepID=B4EF34_BURCJ Length = 883 Score = 113 bits (282), Expect = 3e-23, Method: Composition-based stats. Identities = 108/512 (21%), Positives = 155/512 (30%), Gaps = 111/512 (21%) Query: 699 VNDNLSCKKLFATDEIVARG----GNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGW 754 N + + +R G R I G YGA RNDG YLL TN+GD G W Sbjct: 391 RNAIQAAGNATFNGGLTSRAMDVNGAHFRAIFGGYGAFLRNDGTNVYLLSTNKGDPEGQW 450 Query: 755 NTLRPFAIDNATGELVIGTKLSASLNGN------ALTATKLQTPRRVSGVEFDGSKDITL 808 N RP + TG + I + S + G L+ Q + DG Sbjct: 451 NDFRPVTWNLETGRVTIDDRGSGTAIGGNTTVRGELSVGTGQAQGSIRVGPVDGFLYSNG 510 Query: 809 TAA-----HVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILV-------NFYT 856 AF D DG W++ + + G + Y Sbjct: 511 DGYGWWTPTKGAFQYYIADRTFRIDGNPVWHSGNLSPLDRTKGGTMSGDLWFDPGKRIYL 570 Query: 857 GVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVY---------TSKNLPPESYPV- 906 GS + + F V T + P Sbjct: 571 SEGSAQAPSLTFTNDGAPDTGLYHTGDGMFGVTCNSVSQVSFTPGGTTFQTPVQGQTPPA 630 Query: 907 ----------------------GAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSG 944 G + P T +G+ + G ++ YP L AY Sbjct: 631 GDRSTRLATTEWVVAAIASAAIGTVVFEPRTTARAGFLKLNGALLKRADYPAL-WAYAQA 689 Query: 945 ---------------------------VIPDMRGWTIKGKPAS-----GRAVLSQEQDGI 972 IP++RG ++ + R + + + Sbjct: 690 SGALSTETDWAAGWSGTFSTGDGTTTFRIPELRGEFVRCWDDTRGVDPNRGLGASQN--- 746 Query: 973 KSHTHSASASSTDLGTKTTSSFDY-GTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTA 1031 + + G +S D+ + T+ G H H G T S G H H + Sbjct: 747 ------FANAWHAHGASAAASGDHVHSAWTDVQGWHGHH--GWTASVGDHQHVAPYSESG 798 Query: 1032 SANSGAGSASTRLSVVHNQNYA----TSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSV 1087 A G S + S N TS AG H H + AG H H VGIGA Sbjct: 799 IAPFGTHSTNQVGSHGGVDNDNPWAFTSGAGGHNHEFN--TEGAGNHGHNVGIGA----- 851 Query: 1088 AIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 G+H H ITVN G E+ +N+A ++R Sbjct: 852 -AGNHSHAITVNGDGANESRPRNVALLAMIRA 882 >UniRef50_Q38190 Gp37, tip of tail fiber (Fragment) n=5 Tax=Enterobacteria phage T4 RepID=Q38190_BPT4 Length = 226 Score = 112 bits (280), Expect = 6e-23, Method: Composition-based stats. Identities = 111/226 (49%), Positives = 131/226 (57%), Gaps = 59/226 (26%) Query: 953 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVS 1012 TIKGKP SGRAVLS E DG+K+H+HSASASSTDLGTKTTSSFDYGTK TN+TG HTHS S Sbjct: 1 TIKGKP-SGRAVLSAEADGVKAHSHSASASSTDLGTKTTSSFDYGTKGTNSTGGHTHSGS 59 Query: 1013 GSTNSAGAHTHSLANVNTASA------------NSGAGSASTRLSVVHNQNYATSSAGAH 1060 GST++ G H+H + N +G + + + H ++ TSSAG H Sbjct: 60 GSTSTNGEHSHYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDH 119 Query: 1061 THS----------------------------------------------LSGTAASAGAH 1074 +HS S +SAG H Sbjct: 120 SHSVGIGEHSHYIEAWNGTGVGGNKMSSYAISYRAGGSNTNAAGNHSHTFSFGTSSAGDH 179 Query: 1075 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 +H+VGIGAHTH+VAIGSHGHTITVN+ GN ENTVKNIAFNYIV LA Sbjct: 180 SHSVGIGAHTHTVAIGSHGHTITVNSTGNTENTVKNIAFNYIVALA 225 >UniRef50_C6CP88 Tail Collar domain protein n=5 Tax=Enterobacteriaceae RepID=C6CP88_DICZE Length = 485 Score = 112 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 52/178 (29%), Positives = 75/178 (42%), Gaps = 32/178 (17%) Query: 847 DSYILVNFYTGVGSCRTLQMKAHYRNG---GLFYRSSRDGYGFEEDWAEVYTSKNLPPES 903 + V T + +Q Y G +F RS G W E+ + + Sbjct: 258 AGSLDVEKNTADSAEGCIQRYTTYGGGALPRMFIRSYNAGKQVWGAWQELASLSSPTFTG 317 Query: 904 YP------------------------VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA 939 P G P+PWP VP+G+ GQAFDK+ YP+LA Sbjct: 318 TPTAPTAEAGSNTTQLATTAWFAAEIAGIPLPWPQAAVPTGWLKCNGQAFDKNRYPRLAQ 377 Query: 940 AYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTS 992 YPSGV+PD+RG I+G SGR VLSQ++ + ++ SA ++D + S Sbjct: 378 VYPSGVLPDLRGEFIRGWDDGRGVDSGREVLSQQRGSLINYDGPDSAPTSDSLRLSVS 435 Score = 47.8 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 66/209 (31%), Gaps = 31/209 (14%) Query: 940 AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGI---KSHTHSASASSTDLGTKTTS---- 992 Y G +P M I+ A + + ++ + T + +A + + G+ TT Sbjct: 280 TYGGGALPRM---FIRSYNAGKQVWGAWQELASLSSPTFTGTPTAPTAEAGSNTTQLATT 336 Query: 993 ------------SFDYGTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASAN 1034 + T + + + A + L + Sbjct: 337 AWFAAEIAGIPLPWPQAAVPTGWLKCNGQAFDKNRYPRLAQVYPSGVLPDLRGEFIRGWD 396 Query: 1035 SGAGSASTRL--SVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSH 1092 G G S R S + S S + + A A V + + + Sbjct: 397 DGRGVDSGREVLSQQRGSLINYDGPDSAPTSDSLRLSVSAAQADAVSASEYAGVMLSYTA 456 Query: 1093 GHTITVNAAGNA-ENTVKNIAFNYIVRLA 1120 + TV+AAG +NIAFNYIVR A Sbjct: 457 YNITTVSAAGYVGATRPRNIAFNYIVRAA 485 >UniRef50_B7MWN9 Putative tail fiber protein (GpH) n=2 Tax=Escherichia coli RepID=B7MWN9_ECO81 Length = 701 Score = 112 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 101/354 (28%), Positives = 152/354 (42%), Gaps = 5/354 (1%) Query: 253 ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTS 312 AS + ++ A + +R A+ + S ++S + A+T Sbjct: 135 ASVELTIDTTTVMATQDYVDDKIAEHEQSRRHPDASLTAKGFTQLSSATNSASETLAATP 194 Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSA 372 +A A GK A++A + AT S + A + + + +A Sbjct: 195 KAVKAAYDLANGKYTAQDATTARKGLVQLSSATNSTSETLAATPKAVKAAYDLANGKYTA 254 Query: 373 ESSKTAAAS--SASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQ 430 + + TA SSA +S S A+ +A + A + TA T G ++ Sbjct: 255 QDATTARKGLVQLSSATNSDSETLAATPKAVKVAYDLANGKYTAQDATTARKGLVQLSSA 314 Query: 431 SKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVK 490 + S +E+ A + + +DA+T +KG+VQLSSATNS SETLAATPKAVK Sbjct: 315 TNSDSETLAATPKAVKVAYDLANGKYTAQDATTARKGLVQLSSATNSDSETLAATPKAVK 374 Query: 491 SAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYY 550 SAYDNAEKRLQKDQNGADIPDK FL NI A + T + G + R+ + Sbjct: 375 SAYDNAEKRLQKDQNGADIPDKRLFLRNIGATNSTTMSFSGGTGWFRLATVTMPQASSVV 434 Query: 551 PVVVMRSAG-SVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAY 603 + ++ AG +V+ I R N G + +A+ Sbjct: 435 YISLIGGAGYNVNSPMQAGISELVLRAGN--GNPKGLTGALWRRTSVGFTNFAW 486 >UniRef50_B1LMZ0 Putative phage tail fiber protein n=4 Tax=Enterobacteriaceae RepID=B1LMZ0_ECOSM Length = 673 Score = 109 bits (272), Expect = 5e-22, Method: Composition-based stats. Identities = 99/540 (18%), Positives = 158/540 (29%), Gaps = 61/540 (11%) Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS 454 A + A A + + T S+ A+ + + + + Sbjct: 103 AVANMAESYKPALAEGSGRSQTCRMVIIVSSVASVDLTIDTTTVMATQDYVDDKIAEHEQ 162 Query: 455 AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQN----GADIP 510 + DAS T KG QLSSATNSTSETLAATPKAVK+ D K+ + P Sbjct: 163 SRRHPDASLTAKGFTQLSSATNSTSETLAATPKAVKTVMDETNKKAPLNSPALTGTPTTP 222 Query: 511 DKGCFLNNINAVSKT-----DFADKRGMRYVRVNAPAGATSGKYYPVVVMR-SAGSVSEL 564 NN + A A + P + + Sbjct: 223 TARQGTNNTQIANTAFVMAAIAALVDSSPDALNTLNELAAALGNDPNFATTMTNALAGKQ 282 Query: 565 ASRVIITTATRTAGDPMNNCEFNGFVMP--GGWTDRGRYAYGMFWQYQNNERAIHSIMMS 622 +T A F G + T GR I + + Sbjct: 283 PKDATLTALAGLATAADRFPYFIGNDVASLATLTKVGRDILAKSTVAA----VIEYLGLQ 338 Query: 623 NKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVI 682 + + +G E+ + N K G N A + + Sbjct: 339 ETVNRAGNAVQKNGDTLSGGLTFEND-----SILAWIRNTDWAKIGFKNDADGDTDSFMW 393 Query: 683 LDF-KSGRGFYESHSLIVNDNLSCKKL-----------FATDEIVARGGNQIRMIGGEYG 730 + +G +++ S L E++++ N +R+ G YG Sbjct: 394 FETGDNGNEYFKWRSRQSTTTKDLMNLKWDALYVLVNAIVNGEVISKSANGLRIAYGNYG 453 Query: 731 ALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTK----LSASLNGNALTA 786 RNDG+ TY +LTN GD G +N LRP I+NATG + +G + A+ + Sbjct: 454 FFIRNDGSNTYFMLTNSGDNMGTYNGLRPLWINNATGAVSMGRGLNVSGETLSDRFAINS 513 Query: 787 TKLQTPRRVSGVEFDGSKDITLTAAH-------------VAAFARRATDTYADADGGVPW 833 + + G + +A + + Y + Sbjct: 514 SNGMWIQMRDNNAIFGKNIVNTDSAQALLRQNHADRKFMIGGLGNKQFGIYMINNSRTAN 573 Query: 834 NAESGAYNVTRSG---DSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDW 890 + AY S ++ Y S Y G + W Sbjct: 574 GTDGQAYMDNNGNWLCGSQVIPGNYGNFDS--------RYVKDVRLGSQQYYGVNNWQTW 625 >UniRef50_D2U2G6 Phage tail assembly protein n=1 Tax=Arsenophonus nasoniae RepID=D2U2G6_9ENTR Length = 580 Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats. Identities = 57/230 (24%), Positives = 85/230 (36%), Gaps = 38/230 (16%) Query: 842 VTRSGDSYILVNFYTG------VGSCRTLQMKAHYRNGG---LFYRSSRDGYGFEEDWAE 892 + + + + G + +N + S +DG E Sbjct: 367 RSNGNYTSLTLIKGDGQKLGFETAPGDAYFVYRDAKNNNKAVVTIPSKKDGTLALTSDVE 426 Query: 893 VYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGW 952 +YPVGAPIPWP T P+GY + G FDK+ YP+LA AYPSG +P + G Sbjct: 427 AIN-------NYPVGAPIPWPQATPPNGYFVCDGNYFDKAKYPQLALAYPSGKLPLLYGE 479 Query: 953 TIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAH 1007 I+G GR VLS + D I++ T + GT+ G Sbjct: 480 FIRGLDLGRKVDPGRTVLSNQGDAIRNITGRIGYARHG-----------GTEPPVVNG-- 526 Query: 1008 THSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSA 1057 G H ++AN S ++R+ N+N + A Sbjct: 527 ----EGVFRRDSNHNVNIANGRGDDWGSVMSFNASRVVPTANENRPRNVA 572 >UniRef50_C4S5W0 Putative uncharacterized protein n=2 Tax=Yersinia bercovieri ATCC 43970 RepID=C4S5W0_YERBE Length = 388 Score = 108 bits (268), Expect = 1e-21, Method: Composition-based stats. Identities = 74/253 (29%), Positives = 103/253 (40%), Gaps = 23/253 (9%) Query: 779 LNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESG 838 ++G A +ATKL T R++ GV+FDG+KDITL + + G G Sbjct: 119 IDGTAASATKLATARKIGGVDFDGTKDITLPFINTTDANVQLAGNL----GAQGNITVQG 174 Query: 839 AYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSS---RDGYGFEEDWAEVYT 895 + + + + V R + A Y S+ DG W Sbjct: 175 SLIAKTNIAADGRCDIVGDVRGGRIISKGAVYAGEERVEGSAALIVDGNIQGTLWG--GN 232 Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 E +G PIP+P +VP GY G AF YPKLA YPSGV+PDMRG I+ Sbjct: 233 LYAYLAERELIGIPIPYPLPSVPVGYLKCNGAAFSTVTYPKLALKYPSGVLPDMRGNAIR 292 Query: 956 GKPAS-----GRAVLSQEQDGIKSHTHSA--SASSTDLGTKTTSSF-------DYGTKST 1001 G GRA+LSQ+ D +++ T + S G TT +F G + T Sbjct: 293 GWDDGRGVDAGRALLSQQLDALQNITGNFYMGGSKQVAGVVTTGAFGPMEVYNALGNQVT 352 Query: 1002 NNTGAHTHSVSGS 1014 + S Sbjct: 353 TAGNIGGITFDAS 365 Score = 44.4 bits (102), Expect = 0.024, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 44/140 (31%), Gaps = 15/140 (10%) Query: 994 FDYGTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRL--S 1045 + + + + S T A + + + G G + R S Sbjct: 249 YPLPSVPVGYLKCNGAAFSTVTYPKLALKYPSGVLPDMRGNAIRGWDDGRGVDAGRALLS 308 Query: 1046 VVHNQNYATSSAG--AHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAG- 1102 + + + ++G + V G+ G IT +A+ Sbjct: 309 QQLDALQNITGNFYMGGSKQVAGVVTTGAFGPMEVYNALGNQVTTAGNIG-GITFDASRV 367 Query: 1103 ---NAENTVKNIAFNYIVRL 1119 AE ++NIAFNYIVR Sbjct: 368 SRTAAETRMRNIAFNYIVRA 387 >UniRef50_C4UEH4 Variable tail fiber protein n=3 Tax=Yersinia RepID=C4UEH4_YERAL Length = 387 Score = 107 bits (265), Expect = 4e-21, Method: Composition-based stats. Identities = 64/302 (21%), Positives = 100/302 (33%), Gaps = 36/302 (11%) Query: 738 AKTYLLLTNQGDVYGGWNTLRPFAIDNATG----------ELVIGTKLSASLNGNALTAT 787 AK + L N D Y P A+ A + + N Sbjct: 55 AKGFTRLNNAIDSYIETEAATPKAVQKAVSAAVALMSRHLDTPYPHPQYLLASQNLYELI 114 Query: 788 KLQTPRRVSGVEFDGSKDITLTAAH----VAAFARR---ATDTYADADGGVPWNAESGAY 840 + R+ + ++++ A V AF + Y Sbjct: 115 DKKAARKNLQLGSAATRNV-GNAQDELMAVGAFGWGRSCIIASAGINTLAATGMYSVNQY 173 Query: 841 NVTRSGDSYILVNFYTGVGSCRTLQMKAHYRN----GGLFYRSSRDGYGFEEDWAEVYTS 896 + S Q N + YR YG +W ++ TS Sbjct: 174 AANIPEGFGDATIQHIQNDSLTAHQFIFSTNNTHTAAKIAYRLR--SYGQWREWIDIVTS 231 Query: 897 KNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKG 956 ++ P+G P+P+P T P+GY G AF YP LA YP+ +PD+RG I+G Sbjct: 232 RS--DTLTPIGIPLPYPGTTPPAGYLKCNGAAFYPYRYPTLATLYPTHKLPDLRGEFIRG 289 Query: 957 KPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYG-----TKSTNNTGA 1006 R +LS + D +++ T + S LG S+F + +NT Sbjct: 290 FDDGRGIDTSRTLLSAQTDALQNITGGINGVSESLGIAAESNFTGAFAKAESVGNDNTPH 349 Query: 1007 HT 1008 HT Sbjct: 350 HT 351 >UniRef50_C7BSQ6 Putative tail fiber protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BSQ6_PHOAA Length = 318 Score = 105 bits (260), Expect = 1e-20, Method: Composition-based stats. Identities = 54/203 (26%), Positives = 84/203 (41%), Gaps = 41/203 (20%) Query: 786 ATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRS 845 ++ R+V+G E S DI L+A V A +DG V + Sbjct: 137 NNRVPNTRKVNGKEL--STDINLSAVDVGALP---------SDGAV------------IA 173 Query: 846 GDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYP 905 + GV T + N G + ++ + P Sbjct: 174 ANKLATARTIAGVAFDGTANINIPAGNVGAYTKAE-------------VNDLINTVNNIP 220 Query: 906 VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----AS 960 VG PIPWP+ P+G+ G AFDKS +P+L AAY SGV+PD+RG I+G + Sbjct: 221 VGVPIPWPTAIPPTGWLQCNGAAFDKSKFPQLVAAYSSGVLPDLRGEFIRGWDSSRGVDT 280 Query: 961 GRAVLSQEQDGIKSHTHSASASS 983 R++LS + D +++ T + + Sbjct: 281 NRSILSTQIDTMQNITGKVDSHN 303 >UniRef50_C6V0Q3 Phage tail fiber protein n=13 Tax=Enterobacteriaceae RepID=C6V0Q3_ECO5T Length = 439 Score = 105 bits (260), Expect = 1e-20, Method: Composition-based stats. Identities = 37/108 (34%), Positives = 52/108 (48%), Gaps = 5/108 (4%) Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 + PVG P+PWPS T P+G+ G AF YP+LA YP+ +PD+RG I+ Sbjct: 276 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKVYPTNKLPDLRGEFIR 335 Query: 956 GKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGT 998 G GR +L+ + I SH H ++ +T SF T Sbjct: 336 GWDDGRGVDNGRGLLTLQDGAIVSHNHYWGIWTSRTNDQTLESFTGTT 383 Score = 91.0 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 74/442 (16%), Positives = 118/442 (26%), Gaps = 66/442 (14%) Query: 739 KTYLLLTNQG-------DVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQT 791 K Y +LTNQG + G L A+ +A G L L K Sbjct: 4 KYYAILTNQGAARLANATMLGSKLNLTQMAVGDANGVLPTPDPAQTK-----LINQKRIA 58 Query: 792 PRRVSGVEFDGSKDI---TLTAAHVAAFARRATDTYADADG--GVPWNAESGAYNVTRSG 846 P + V+ + I + + F R Y D V E+ + Sbjct: 59 PLNLLSVDPNNQSQIIAEQIIPENEGGFWIREIGLYDDEGVLIAVANCPETYKPQLQEGS 118 Query: 847 DSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGY------GFEEDWAEVYTSKNLP 900 + V + + +K + L R D ++ +++ Sbjct: 119 GRTQTIRMILVVTNTEAITLKID-PSVVLATRKYVDDKVLELRLYVDDQMRNHIAAQDPH 177 Query: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMR---------- 950 + P P+ A + + + A PD Sbjct: 178 TQYAQKHNPTFTGEPKAPTPAAGNNTTRIATTEFVQTAITALINGAPDTLDTLKEIAAAI 237 Query: 951 GWTIKGKPASGRAVLSQEQ-DGIKSHTHSASASS--TDLGTKTTSSFDYG--------TK 999 K A+ ++ D +H + LG S+ G T Sbjct: 238 NNDPKFSTTINNALSGKQPLDETLTHLSGKDVAGLLAYLGLGEGSALPVGVPVPWPSATP 297 Query: 1000 STNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRL--------- 1044 T + + S A + L + G G + R Sbjct: 298 PTGWLKCNGAAFSAEEYPELAKVYPTNKLPDLRGEFIRGWDDGRGVDNGRGLLTLQDGAI 357 Query: 1045 -SVVHNQNYATSSAGAHT-HSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNA-- 1100 S H TS T S +GT + I + + + V A Sbjct: 358 VSHNHYWGIWTSRTNDQTLESFTGTTILKQITPLSPAINFDNYPIPNPAITEGGVVAATT 417 Query: 1101 --AGNAENTVKNIAFNYIVRLA 1120 AG E +N+AFNYIVR A Sbjct: 418 KPAGANETRPRNVAFNYIVRAA 439 >UniRef50_Q7NAA0 Complete genome; segment 1/17 n=2 Tax=Photorhabdus RepID=Q7NAA0_PHOLL Length = 351 Score = 104 bits (258), Expect = 2e-20, Method: Composition-based stats. Identities = 72/330 (21%), Positives = 116/330 (35%), Gaps = 32/330 (9%) Query: 718 GGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSA 777 GN GG Y +L G Y+ + G + A G + Sbjct: 22 SGNHNIKTGGNYSSLSLTKGDGRYVFIETTSHEEGSFC---------AIGHRESNSSNIN 72 Query: 778 SLNGNALTATKLQTPRRVSGVEFDGS-KDITLTAAHVAAFARRATDTYADAD----GGVP 832 ++ + S E D + + + Y D + G Sbjct: 73 VVHLPRKSGYIAIANEHYSKSESDSRFIQLNTDTKTSGYILVKTANYYDDPNSRHLGRSG 132 Query: 833 WNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAE 892 + +G N+ + ++ + GG S Y F+E Sbjct: 133 FLRPNGIDNL-----GALAIHIAHPDVDSPQHARGLSFGYGGYSEAFSVSTYAFDESGNF 187 Query: 893 VYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGW 952 K L + VG P+PW T P+GY + GQ FDKS YPKL AYPSG +PD+RG Sbjct: 188 RGKRKILTEDDILVGIPLPWSKPTAPAGYLICSGQQFDKSMYPKLGEAYPSGALPDLRGE 247 Query: 953 TIKGKP-----ASGRAVLSQEQDG-IKS-HTHSASASSTDLGTKTTSSFDYGTKSTNNTG 1005 I+G SGR +LS + + + +TH+AS + L + + F ++N Sbjct: 248 FIRGWDNGRSIDSGREILSHQNSTKLPNLYTHAASENIGLLVSPPINHF------SSNYP 301 Query: 1006 AHTHSVSGSTNSAGAHTHSLANVNTASANS 1035 + + G+ + +N + S Sbjct: 302 SEIMASDFEEAEFGSGQYFSTPLNPTGSVS 331 >UniRef50_D2BYH6 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BYH6_DICD5 Length = 198 Score = 104 bits (258), Expect = 2e-20, Method: Composition-based stats. Identities = 36/102 (35%), Positives = 50/102 (49%), Gaps = 7/102 (6%) Query: 906 VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS----- 960 VG P WP P G+ GQAFDK+ YP+LA YP+G +PD+RG I+G Sbjct: 59 VGIPQAWPLADAPEGWLKCNGQAFDKTKYPQLAKLYPAGTLPDLRGEFIRGWDDGRGVDT 118 Query: 961 GRAVLSQEQDGIKSHTH--SASASSTDLGTKTTSSFDYGTKS 1000 R +LS + ++SH H S S G + D + + Sbjct: 119 NRQILSAQSGMLESHNHMMPVSDPSKWNGAVYGYANDQPSAN 160 >UniRef50_Q2T5M0 Phage-related tail fiber protein n=19 Tax=root RepID=Q2T5M0_BURTA Length = 790 Score = 104 bits (257), Expect = 3e-20, Method: Composition-based stats. Identities = 109/514 (21%), Positives = 171/514 (33%), Gaps = 108/514 (21%) Query: 696 SLIVNDNLSCKKLFATDEIVARG-GNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGW 754 +L V + + A I A G G Q R + YGA RNDG Y L T +G GG+ Sbjct: 294 ALQVRGGVDASEGVAARAIDAGGAGGQFRAVYDGYGAFIRNDGRSVYFLSTPKGAPDGGF 353 Query: 755 NTLRPFAIDNATGELVIGTKLSASLNGNAL-TATKLQTPRRVS-GVEFDGSKDITLTAAH 812 N RPF+ +TG++++ + ++ G A+ A L+ R+ S G G D L A Sbjct: 354 NDYRPFSWSLSTGQVIVDGSGAGTVFGGAVDVARDLEVGRQASEGHIKLGPVDGYLYANP 413 Query: 813 VAAFARRATDTYADA---------DGGVPW---------NAESGAYNVTRSGDSYILVNF 854 V+ + DG + W ++ G S + Sbjct: 414 VSTGWWSPAGSSYQYIFADHTFRIDGRMAWHEGNLDPLDKSKGGMLAGDVSFAPGKRLVL 473 Query: 855 YTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEED-------------WAEVYTSKNLPP 901 G + +L Y ++ +G + + + T P Sbjct: 474 AEGSPAAPSLTFANDGAPDTGLYHAADGEFGVTCNGRAVVRFSPALVAFEQPVTVPTPPA 533 Query: 902 ESYP-----------------VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSG 944 +G + P TV G+ G +++ YP+L AY Sbjct: 534 ADRSTRAATTEWVRTVLSATTIGQIVFEPRTTVRPGFLKANGVLVNRADYPEL-WAYAQA 592 Query: 945 ---------------------------VIPDMRGWTIKGKPAS------GRAVLSQEQDG 971 +P++RG I+ + R + + + D Sbjct: 593 SGALVSDADWMKDRWGCFSTGDGATTFRLPELRGEFIRCWSDARGGVDATRQIGAFQGDQ 652 Query: 972 IKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTA 1031 +H H A+AS T T+ G H H G TN+ G H H + Sbjct: 653 NHTHAHGAAASEAPDHVHT--------AWTDVQGWHGHH--GWTNAVGDHQH--VSPWGE 700 Query: 1032 SANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAH---TVGIGAHTHSVA 1088 T + G+ ++ G + AG H H T G G H H+V Sbjct: 701 HPQMYNPPWGTW-----GAANNRGAEGSDNDNVYGMTSPAGNHNHEFNTEGNGNHGHAVG 755 Query: 1089 IGS---HGHTITVNAAGNAENTVKNIAFNYIVRL 1119 IG H HTI V G E +N+A ++R Sbjct: 756 IGGGGRHAHTIAVQPDGGDEARPRNVALLALIRA 789 >UniRef50_P33227 Putative protein stfE (Fragment) n=60 Tax=Enterobacteriaceae RepID=STFE_ECOLI Length = 166 Score = 102 bits (254), Expect = 6e-20, Method: Composition-based stats. Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 5/98 (5%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS- 960 + PVG P+PWPS T P+G+ G AF YP+LA AYP+ +PD+RG I+G Sbjct: 7 SALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 66 Query: 961 ----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF 994 GR++LS + + H H + ST + T + + Sbjct: 67 GIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFY 104 Score = 46.3 bits (107), Expect = 0.007, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 46/149 (30%), Gaps = 23/149 (15%) Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRL-SVV 1047 T T + + S A + L + G G + R + Sbjct: 18 PSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSI 77 Query: 1048 HNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTI----------- 1096 + G + S T A+ + + + + T + G+ Sbjct: 78 QGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKT 137 Query: 1097 ---TVNAAGNA--ENTVKNIAFNYIVRLA 1120 +V+ G A E +NIAFNYIVR A Sbjct: 138 YKQSVDGLGAAASETRPRNIAFNYIVRAA 166 >UniRef50_Q7N5C0 Similarities with DNA inversion product n=5 Tax=Photorhabdus RepID=Q7N5C0_PHOLL Length = 239 Score = 102 bits (254), Expect = 6e-20, Method: Composition-based stats. Identities = 57/202 (28%), Positives = 77/202 (38%), Gaps = 18/202 (8%) Query: 828 DGGVPWNAESGAYN---VTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGY 884 G P N + N S S ++ NF L +G + ++ Sbjct: 32 TGFPPENVSTHVLNKVLRQSSTISSVVANFIATQSGEDVL------DDGDI----AKLTV 81 Query: 885 GFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSG 944 F + K S PVG+PIPWP PSGY G AF +S YPKLA AYP G Sbjct: 82 RFNRALDKALEQKISGISSIPVGSPIPWPLSHPPSGYFTCNGSAFSRSQYPKLAEAYPDG 141 Query: 945 VIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTK 999 IPD+RG I+G SGR +LS + D K + + D Sbjct: 142 RIPDLRGEFIRGWDDGRGVDSGRVILSAQTDNTKRIQLTKGLPDGQFLSSYQGPVDRYQF 201 Query: 1000 STNNTGAHTHSVSGSTNSAGAH 1021 + +V+ N+ G H Sbjct: 202 PLGRDVLESATVTSIANNTGGH 223 >UniRef50_C6CG98 Tail Collar domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CG98_DICZE Length = 196 Score = 102 bits (253), Expect = 7e-20, Method: Composition-based stats. Identities = 29/89 (32%), Positives = 43/89 (48%), Gaps = 5/89 (5%) Query: 887 EEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVI 946 E+ +A + +G P PWP P G+ GQ FD + YP+LA YP+G + Sbjct: 40 EQQYAPDIYPASTDGLKELIGIPQPWPLAEAPEGWLKCNGQTFDTAKYPQLAKLYPAGTL 99 Query: 947 PDMRGWTIKGKP-----ASGRAVLSQEQD 970 PD+RG I+G + R +LS + Sbjct: 100 PDLRGEFIRGWDDERGVDTDRKLLSAQAG 128 >UniRef50_C9XHA4 Phage variable tail-fibre protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhimurium str. D23580 RepID=C9XHA4_SALTD Length = 697 Score = 102 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 77/329 (23%), Positives = 123/329 (37%), Gaps = 19/329 (5%) Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 A + K+ + K + S+A S S A+ +A + ++ Sbjct: 148 ATQDYVDDRLAEHEKSRRHPDATLKEKGFTQLSNATDSESETLAATPKAVKTVYDLANAK 207 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 TA T G + + S +E+ A + + + +DA+T +KGI+QL Sbjct: 208 YTAQDATTTRKGIVQLSNATDSVSETLAATPKAVKVAYDLANAKYTAQDATTARKGIIQL 267 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKR 531 S+AT+STSETLAATPKAVK+A DNA RL K+ NG DIPDK F I AV+ T+ Sbjct: 268 SNATDSTSETLAATPKAVKTAMDNANGRLAKNSNGGDIPDKKQFARTIGAVTSTNITFND 327 Query: 532 GMRYVRVNAPAGATSGKYYPVVVMRSAG-SVSELASRVIITTATRTAGDPMNNCEFNGFV 590 + ++ + + + AG + I R + Sbjct: 328 ASGWYKIATVVMPQATSTAVIKLYGGAGFNAGSPEQAAISELVLRAGN------GSPVGI 381 Query: 591 MPGGWTDRGRYAYGMFWQYQNNERAIHSIMM--------SNKGDDLRSVFYVDGAAFPVF 642 W R A + N + I + D V + P + Sbjct: 382 TATLW--RRSPAAANEVAWVNTSGDTYDIYINIGQYAYWLIAQYDYTGNANVTLHSTPEY 439 Query: 643 AFIEDGLSISAPGADLVVNDTTYKFGATN 671 + ++ G S G + + K A + Sbjct: 440 SSVQPGNS--TSGQTYTIYSSLMKPTAGD 466 >UniRef50_D1RZD4 Putative uncharacterized protein n=1 Tax=Serratia odorifera 4Rx13 RepID=D1RZD4_SEROD Length = 759 Score = 101 bits (251), Expect = 1e-19, Method: Composition-based stats. Identities = 54/159 (33%), Positives = 69/159 (43%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MAV ISG L G P TI L A + S+ VV +S G YS+ VE G ++V Sbjct: 1 MAVLISGKLIGPNGDPRPGVTIMLTAVKTSSAVVHLAPSSSTTGADGSYSLSVEVGTHNV 60 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAV 120 ++ G P G ITVY DS+PGTLNDFL + +D+ P + + M A A Sbjct: 61 MIEAYGRPFEKVGQITVYSDSKPGTLNDFLTSPGQDELTPAIVAIVDDMRAAAAVYAQQA 120 Query: 121 AQNTAAAKKSASDASTSAREAATHAADAADSARAASTSA 159 + AK SA A T A T Sbjct: 121 REARDDAKASADAAQAGIDAYPTLTKAQEAVNSGAETRE 159 >UniRef50_Q06852 Cell surface glycoprotein 1 n=8 Tax=cellular organisms RepID=SLAP1_CLOTH Length = 2313 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 31/380 (8%), Positives = 83/380 (21%), Gaps = 5/380 (1%) Query: 74 TITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASD 133 D + E+ + +E + + + Sbjct: 1637 DEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTP 1696 Query: 134 ASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS 193 + T T + T + + S + S T S E + ++ Sbjct: 1697 SETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEP 1756 Query: 194 KSAAATSAGAAKTSETNASAS-----LQSAATSASTATTKASEAATSARDAAASKEAAKS 248 + + T + S ++ T T + + + Sbjct: 1757 TPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDE 1816 Query: 249 SETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASA 308 + + S + S+ S E + + + Sbjct: 1817 PTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPI 1876 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKAS 368 + + + S E S T + + + + + + ++ Sbjct: 1877 PTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTP 1936 Query: 369 ETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAA 428 S S + + + + T + + + T + E S Sbjct: 1937 SDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPT 1996 Query: 429 AQSKSTAESAATRAETAAKR 448 + T T ++ Sbjct: 1997 PSDEPTPSDEPTPSDEPTPS 2016 >UniRef50_C6C6Z0 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C6Z0_DICDC Length = 183 Score = 99.5 bits (245), Expect = 6e-19, Method: Composition-based stats. Identities = 28/70 (40%), Positives = 38/70 (54%), Gaps = 5/70 (7%) Query: 906 VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS----- 960 +G P PWP P G+ GQAFD + YP+LA YPSG +PD+RG I+G Sbjct: 46 IGIPQPWPLADAPEGWLKCNGQAFDTAKYPELAKCYPSGTLPDLRGEFIRGWDDGRGVDT 105 Query: 961 GRAVLSQEQD 970 R ++S + Sbjct: 106 SRELVSAQSG 115 >UniRef50_B5Q8N4 Tail protein n=6 Tax=root RepID=B5Q8N4_SALVI Length = 718 Score = 99.1 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 74/391 (18%), Positives = 122/391 (31%), Gaps = 99/391 (25%) Query: 436 ESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDN 495 + A+ + DA+T +KG VQLSS TNS SE+LAATPKAVK+A DN Sbjct: 275 PKGTLNEQQASDALRKHEQSRNHPDATTREKGFVQLSSDTNSESESLAATPKAVKTAMDN 334 Query: 496 AEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPA-----GATSGKYY 550 A RL+K+ NG+DIPDK F+ I A D A G + + Sbjct: 335 ANGRLEKNSNGSDIPDKDLFVRRIGAARAFDGAVIIGGDDNPWTTAEFIVWLESQGAFNH 394 Query: 551 PVVVMRSA---------------------------GSVSELASRVIITTATRTAGDPMNN 583 P + R G + RV TT T G Sbjct: 395 PYWMCRGTWSYALNKVITDTGCGNICLAGAVIEVMGVRGAMTIRVTTTTTTSGYGIASAQ 454 Query: 584 CEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRS------------- 630 + D + + N A + G + Sbjct: 455 FTYINN------GDGYSPGWRRDFNTINKPTAGDVGALPITGGRINGALGIGTDNALGGN 508 Query: 631 ---------------------------VFYVDGAAFPVF--------AFIEDGLSISAPG 655 V Y+D + + +G +++ Sbjct: 509 SIVFGDNDTGFKWHSDGVLGIYANNAQVGYIDNSGLHMLADIRATGVVRAGNGKTLTLSS 568 Query: 656 -ADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLS--CKKLFATD 712 + +N +G T +++ +G++ + ++S + + Sbjct: 569 GNNSALNAGLSLWGGGERPT-------VIELSDEQGWHLYSQRNTDGSISFTVNGIVYCN 621 Query: 713 EIVARGGNQIRMIGGE-YGALWRNDGAKTYL 742 + G I G+ +G++W N T++ Sbjct: 622 ALNIGGA--IYQNNGDIFGSVWGNGWLSTWI 650 >UniRef50_B2ZY49 Phage tail collar domain protein n=1 Tax=Ralstonia phage RSL1 RepID=B2ZY49_9CAUD Length = 498 Score = 98.7 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 81/389 (20%), Positives = 128/389 (32%), Gaps = 87/389 (22%) Query: 759 PFAIDNATGELVIGTKLSASLNGNALTATKLQTP-RRVSGVEFDGS-------KDITLTA 810 P A EL + S NA + + TP RR G + + + Sbjct: 167 PLTPLLAIHELSAYGEASLLEQNNASSWANVGTPYRRYLGSRAGNLNATTFPVVNASTSW 226 Query: 811 AHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFY---TGVGSCRTLQM- 866 +AA D G + A+ +V S F T G+ + + Sbjct: 227 VEIAASRLNTQDLTVGNRGFLLQTADGYFRSVNNVQTSGSNYRFNLNVTNDGTYNNVPLP 286 Query: 867 -KAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQ 925 + + YR +G Y PP+ P G +P+ T+P+GY Sbjct: 287 NTPQVGSNVILYRCDVEGLNL------FYDQILNPPQLVPPGTILPFAGTTIPAGYLACN 340 Query: 926 GQAFDKSAYPKLAA----AYPSG------VIPDMRGWTIKGKPAS-----GRAVLSQEQD 970 A ++ + L + Y G +PD+RG ++G GR + + D Sbjct: 341 AAAISRTGFASLYSVIGTTYGVGNGSTTFNLPDLRGVFVRGWDNGRGQDPGRVFGTYQGD 400 Query: 971 GIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVS-GSTNSAGAHTHSLANVN 1029 +SH H+ S G H+H+ + G+ +G T + Sbjct: 401 AFRSHNHAVSDPGHAHGVY--------------DPGHSHTWTLGTLRQSGGDT----SCY 442 Query: 1030 TASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAI 1089 SA G G + T A+ G GIG + + I Sbjct: 443 VPSARYGGGEF----------------------QFTETTAAVGT-----GIGIYGNVTGI 475 Query: 1090 GSHGHTITVNAAGNAENTVKNIAFNYIVR 1118 G+ VN G AE T KN+A NYI++ Sbjct: 476 GT-----LVN--GGAETTPKNVAMNYIIK 497 >UniRef50_C6CGA4 Tail Collar domain protein n=7 Tax=Enterobacteriaceae RepID=C6CGA4_DICZE Length = 401 Score = 98.7 bits (243), Expect = 1e-18, Method: Composition-based stats. Identities = 63/300 (21%), Positives = 93/300 (31%), Gaps = 73/300 (24%) Query: 828 DGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFE 887 + V Y S + T S + Q+ ++G Sbjct: 168 NAHVAAANPHPQYAPLSSPALTGVPTAPTAANSANSTQLATTAFVKNTALLKEQNGADIA 227 Query: 888 EDWAEVYTSKNLPP--ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGV 945 A + + VG P+PWP T P+G+ GQAFDK+A+PKLA AYP GV Sbjct: 228 NKSAFLANLGLSDTLKIADIVGIPLPWPQATPPAGWLKCNGQAFDKNAFPKLAQAYPGGV 287 Query: 946 IPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKS 1000 +PD+RG I+G R +LS ++ + Sbjct: 288 LPDLRGEFIRGWDDGRGVDVARELLSWQKGTL---------------------------- 319 Query: 1001 TNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAH 1060 T + S + GA H+ + + G + + + GA Sbjct: 320 TISDPNL------SAVNVGALIHANNDSANTYKSMGFDIVNKSDYAMLRAAINVETVGAQ 373 Query: 1061 THSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA 1120 +G GA +NIAFNYIVR A Sbjct: 374 DLDSNGWQFGYGA--------------------------------TRPRNIAFNYIVRAA 401 >UniRef50_UPI000190EC42 bacteriophage tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E01-6750 RepID=UPI000190EC42 Length = 317 Score = 96.4 bits (237), Expect = 7e-18, Method: Composition-based stats. Identities = 44/160 (27%), Positives = 63/160 (39%), Gaps = 5/160 (3%) Query: 881 RDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAA 940 ++G E V + PVG P+PWPS T+P G+ G AF YPKLA A Sbjct: 148 QNGADIPEPALFVKNLGLGEGSALPVGVPVPWPSATLPEGWLKCNGAAFSSEMYPKLAKA 207 Query: 941 YPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFD 995 YP+ +PD+RG I+G GR +LS ++ I S + T F Sbjct: 208 YPTNKLPDLRGEFIRGWDDGRGIDAGREILSFQEGTIVSGFDDNDTGDISSLSSTQYGFG 267 Query: 996 YGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANS 1035 S + +S GA + +A + Sbjct: 268 DTLSSNQWGAINGKKWIFDASSKGAQKYDWWAYVSARPRN 307 >UniRef50_Q6D3Y6 Probable bacteriophage variable tail fiber protein H n=2 Tax=Pectobacterium atrosepticum RepID=Q6D3Y6_ERWCT Length = 536 Score = 96.0 bits (236), Expect = 9e-18, Method: Composition-based stats. Identities = 62/260 (23%), Positives = 91/260 (35%), Gaps = 23/260 (8%) Query: 753 GWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAH 812 +TL+ A A + + P + + G+ + Sbjct: 226 TLDTLKKLAAAVGNNPDFHSMIGDAIDSKLSKAQNGADIPDKNQFISNLGAVSLATFQII 285 Query: 813 VAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRN 872 ++ +DA P AY + S V F + +R+ Sbjct: 286 TG---KKLLSADSDALTIKPVVDGQRAYVQFNNAASTEAVAFVGLRYGANEQALTFEHRS 342 Query: 873 GGLFYRSSRDGYGFEE------------DWAEVYTSKNLPPESYPVGAPIPWPSDTVPSG 920 G R F T + G P PWP T P+G Sbjct: 343 GA-SIRIDNGAINFYGLPHGTTAPTGTNTTQLASTEFVQNELALTAGMPKPWPRATAPAG 401 Query: 921 YALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSH 975 + GQ+FD SA+P LAAAYPSGV+PD+RG I+G SGR++LS + D I++ Sbjct: 402 WLKCNGQSFDISAFPHLAAAYPSGVLPDLRGEFIRGWDDGRGVDSGRSLLSAQSDAIRNI 461 Query: 976 THSA--SASSTDLGTKTTSS 993 SA S +T SS Sbjct: 462 VGEIWTSAVSQQFLGETLSS 481 Score = 50.1 bits (117), Expect = 4e-04, Method: Composition-based stats. Identities = 32/142 (22%), Positives = 45/142 (31%), Gaps = 19/142 (13%) Query: 997 GTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRL--SVVH 1048 T + S S A + L + G G S R S Sbjct: 396 ATAPAGWLKCNGQSFDISAFPHLAAAYPSGVLPDLRGEFIRGWDDGRGVDSGRSLLSAQS 455 Query: 1049 NQNYATSSAGAHT----HSLSGTAASAGAHA--HTVGIGAHTHSVAIGSHGHTITVNAAG 1102 + + L T +S+G + +GA A S + +A+ Sbjct: 456 DAIRNIVGEIWTSAVSQQFLGETLSSSGVFELLYEFAVGA-IPDAAGNSCPSRMRFDASR 514 Query: 1103 N----AENTVKNIAFNYIVRLA 1120 AEN +NIAFNYIVR A Sbjct: 515 AVPTAAENRPRNIAFNYIVRAA 536 >UniRef50_Q7N047 Similarities with bacteriophage protein n=3 Tax=Photorhabdus RepID=Q7N047_PHOLL Length = 602 Score = 95.6 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 54/252 (21%), Positives = 88/252 (34%), Gaps = 40/252 (15%) Query: 763 DNATGELVIGTKLSASLNGNALTAT------------KLQTPRRVSGVEFDGSKDITLTA 810 +NA L + N N + RR++G S DI+L A Sbjct: 327 ENADSRLAKNQNGADIPNKNEFVKNIGLVETVEQAMNAVPNDRRINGKSL--SSDISLMA 384 Query: 811 AHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHY 870 + V A+ + T + + + G Q+ Sbjct: 385 SDVGAYHQTTYMTELPGRHYSGPFSCG----RENGWAKGVSIGVGGDTG-----QIWID- 434 Query: 871 RNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTV-PSGYALMQGQAF 929 + L R E+ P+GA I W S P+GY +G+AF Sbjct: 435 ADAQLHTRFLNANGAVEQ----------KITVGVPIGATIEWHSTAPIPAGYEPNEGRAF 484 Query: 930 DKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASASST 984 + YP+LA +P +PD RG +G SGR++ S + D I++ T S + Sbjct: 485 RAADYPELAKIFPDLKLPDDRGLFKRGLDRGRGLDSGRSLGSVQGDAIRNITGSLGKPTI 544 Query: 985 DLGTKTTSSFDY 996 + G+ + +F Y Sbjct: 545 ESGSNASGAFSY 556 >UniRef50_P18771 Large tail fiber protein p34 n=5 Tax=T4-like viruses RepID=VG34_BPT4 Length = 1289 Score = 95.6 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 118/877 (13%), Positives = 234/877 (26%), Gaps = 77/877 (8%) Query: 86 LNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHA 145 + D G + R + + + +A T A + S A T Sbjct: 412 IEDSDGKYWVVQQNVPTVERVDSLNDSTRARLGVIALATQAQANVDLENSPQKELAITPE 471 Query: 146 ADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAK 205 A +A + A++AQ ++ + + + ++ A A Sbjct: 472 TLANRTATETRRGIARIATTAQVNQNTTFSFADDII-ITPKKLNERTATETRRGVAEIAT 530 Query: 206 TSETNASASLQS----AATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAA 261 ETNA + A + S T A A+ +++ + ++ Sbjct: 531 QQETNAGTDDTTIITPKKLQARQGSESLSGIVTFVSTAGATPASSRELNGTNVYNKNTDN 590 Query: 262 SSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAA--ASSASAASTSAGQASAS 319 + A ++ A + A + A TS Sbjct: 591 LVVSPKALDQYKATPTQQGAVILAVESEVIAGQSQQGWANAVVTPETLHKKTSTDGRIGL 650 Query: 320 ATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAA 379 A +S + + + A T +A+ + + T + + S+ Sbjct: 651 IEIATQSEVNTGTDYTRAVTPKTLNDRRATESLSGIAEIATQVEFDAGVDDTRISTPLKI 710 Query: 380 ASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAA 439 + +S ++ A + E+ + A+ AT + T ++ Sbjct: 711 KTRFNSTDRTSVVALSGLVESGTLWDHYTLNILEANETQRGTLRVATQVEAAAGTLDNVL 770 Query: 440 TRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 + A ++ A+ ++ ++ S A A + Sbjct: 771 ITPKKLLGTKSTEAQEGVIKVATQSETVTGTSANTAVSPKNLKWIAQSEPTWAATTAIRG 830 Query: 500 LQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAG 559 K +G+ + + + + Y A K G Sbjct: 831 FVKTSSGSITFVGNDTVGSTQDLELYEKNSYAVSPYELNRVLANYLPLKAKAADTNLLDG 890 Query: 560 -SVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHS 618 S+ R I T + V G A + +N Sbjct: 891 LDSSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTGEFGGSLAANRTFTIRNT------ 944 Query: 619 IMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIA 678 S+ + G A GA+ + + +G Sbjct: 945 -------GAPTSIVFEKGPA---------------SGANPAQSMSIRVWGNQFGGGSDTT 982 Query: 679 ADVILDFKSGRGFYESHSLIVNDNLS--------------------------CKKLFATD 712 + + + + N++ + + A Sbjct: 983 RSTVFEVGDDTSHHFYSQRNKDGNIAFNINGTVMPININASGLMNVNGTATFGRSVTANG 1042 Query: 713 EIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIG 772 E +++ N R I G+YG RND + TY LLT GD GG+N LRP I+N +G++ IG Sbjct: 1043 EFISKSANAFRAINGDYGFFIRNDASNTYFLLTAAGDQTGGFNGLRPLLINNQSGQITIG 1102 Query: 773 TKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVP 832 L + + R S + F + A + Sbjct: 1103 EGLIIAKGVTINSGGLTVNSRIRSQGTKTSDLYTRAPTSDTVGFWSIDINDSATYNQFPG 1162 Query: 833 W-------NAESGAYNVTRS----GDSYILVNFYTGVGSCRTLQMKAHYRNGG--LFYRS 879 + N +G + R + T + + R+ Sbjct: 1163 YFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRT 1222 Query: 880 SRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDT 916 + + +V+ N P S +GA +P + T Sbjct: 1223 WQKTKNSWSSFVQVFDGGNPPQPS-DIGA-LPSDNAT 1257 >UniRef50_C5J9F2 Putative uncharacterized protein (Fragment) n=1 Tax=Erwinia phage phiAT1 RepID=C5J9F2_9VIRU Length = 240 Score = 94.8 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 34/93 (36%), Positives = 47/93 (50%), Gaps = 8/93 (8%) Query: 898 NLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGK 957 +L P P+GA IPWP TVP G+ GQ F+ PKL + V+PD RG ++G Sbjct: 147 DLEPRLVPIGAVIPWPGATVPDGWLECSGQVFNTGQNPKLYSVLGRNVVPDYRGLFLRGW 206 Query: 958 --------PASGRAVLSQEQDGIKSHTHSASAS 982 P +GRA+ S + D I++ T A Sbjct: 207 AHGSDANDPDAGRALGSVQGDAIRNITGYFPAD 239 >UniRef50_Q9MCR6 Tail fiber n=1 Tax=Enterobacteria phage HK97 RepID=Q9MCR6_BPHK7 Length = 321 Score = 94.1 bits (231), Expect = 3e-17, Method: Composition-based stats. Identities = 40/152 (26%), Positives = 62/152 (40%), Gaps = 6/152 (3%) Query: 842 VTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPP 901 + S + I V + + K + + G G Sbjct: 106 ASTSANQSITVTINGTAVTIPGIG-KLAQKGSNGAVTVADGGTGATNAADARTNLGLGEG 164 Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS- 960 + PVG P+PWPS T P+G+ G F YP+LA AYP+ +PD+RG I+G Sbjct: 165 SALPVGVPVPWPSATPPTGWLKCNGAVFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGR 224 Query: 961 ----GRAVLSQEQDGIKSHTHSASASSTDLGT 988 GR +LS + D I++ T + T++ Sbjct: 225 GIDAGREILSAQGDAIRNITGTFGDGETEVNA 256 >UniRef50_Q0BEK5 Phage Tail Collar domain protein n=1 Tax=Burkholderia ambifaria AMMD RepID=Q0BEK5_BURCM Length = 735 Score = 93.7 bits (230), Expect = 4e-17, Method: Composition-based stats. Identities = 95/628 (15%), Positives = 173/628 (27%), Gaps = 107/628 (17%) Query: 544 ATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEF----NGFVMPGGWTDRG 599 G++ + + V + R++G N G + Sbjct: 162 TPEGRWLIGGAGDNGQDTVRVQGSVGVIGGIRSSGYDANGAGGQFRAVGTDYGVMLRNDN 221 Query: 600 RYAYGMFWQYQNNERAIHS---IMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGA 656 + A+ + + + S +R G F +A + + G Sbjct: 222 KSAWLLSTNKGDPNGTYNDYRPFAWSLDTGAVRIDGTGCGTIFGGYARFAGNVESAGVGY 281 Query: 657 DLVVNDTTYKFGATNPATECIAADVILDFKSG-RGFYESHSLIVNDNLSCKKLFATDEIV 715 +G+ D++L + D++S + A D+ Sbjct: 282 FGGGAGKAKDYGSGVTLGANTGGDIVLKAAGADKDMKLWDIQSNTDSMS---IRAVDDDW 338 Query: 716 ARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYG---------GWNTLRPFAIDNAT 766 G +R+ + KT L+ N G V + N++ Sbjct: 339 TIGIPAVRITRANL-----STSIKTLELVPNTGRVLSAGAWDDGRSSFQNKGSLKSSNSS 393 Query: 767 GELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYAD 826 G LV A LT T ++ V +FA R + Sbjct: 394 GALVASNGGGAGQTSIQLTREGAPTDQKTWEV----------IQGADGSFAVRTVNDTYT 443 Query: 827 ADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGF 886 Y + + VG G + S G Sbjct: 444 NSQAAINVTRGSTYAL--GTLQLMPQGGRVVVGKVGDDGSTQLQVGGMVTAVSPPAGDNS 501 Query: 887 EEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSG-- 944 + + +G + +G+ + G ++ YP L AY G Sbjct: 502 NKLITSAW--FAAAVADVQIGQIVWEARTAPRAGFLKLNGTELKRADYP-LLWAYAQGSG 558 Query: 945 -------------------------VIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKS 974 +PD+RG I+ + R + S + + Sbjct: 559 ALVADADWGKGRHGCFSSGDGNTTFRLPDLRGEFIRCWDDARGTDAQRQIGSWQDSLNRL 618 Query: 975 HTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASAN 1034 H H ASA++ + T++ G H HS++ H + Sbjct: 619 HAHGASAAAVGDHSHG--------AWTDSQGWHGHSIN-------DPGHDHGIPVASGGG 663 Query: 1035 SGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIG---AHTHSVAIGS 1091 L+ + T+ +G + GAH H VGIG AH+H+++IG Sbjct: 664 YIGEI---NLNGGGRGDKRTTGSGT-----GISINGDGAHGHNVGIGGAGAHSHTISIG- 714 Query: 1092 HGHTITVNAAGNAENTVKNIAFNYIVRL 1119 A G E+ +N+A ++R Sbjct: 715 --------ADGGNESRPRNVALLVMIRA 734 >UniRef50_UPI0001A44C27 bacteriophage tail fiber protein n=2 Tax=Pectobacterium carotovorum subsp. carotovorum WPP14 RepID=UPI0001A44C27 Length = 195 Score = 93.3 bits (229), Expect = 5e-17, Method: Composition-based stats. Identities = 38/128 (29%), Positives = 51/128 (39%), Gaps = 12/128 (9%) Query: 866 MKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQ 925 + NGG + R + + VG P P P T P G+ Sbjct: 55 IYVDEGNGGRWRREFNTENLTPSSIGAI-------QGNELVGIPQPCPLVTAPEGWLACA 107 Query: 926 GQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSAS 980 GQ+FD S YP LA+ YP G +PD+RG I+G +GR LS + + HTH Sbjct: 108 GQSFDTSRYPVLASRYPQGRLPDLRGEFIRGWDNGRGVDTGRGNLSSQSFSTEPHTHDGG 167 Query: 981 ASSTDLGT 988 G Sbjct: 168 TLGLGSGA 175 >UniRef50_C6C5D4 Tail Collar domain protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C5D4_DICDC Length = 557 Score = 92.9 bits (228), Expect = 6e-17, Method: Composition-based stats. Identities = 55/242 (22%), Positives = 86/242 (35%), Gaps = 24/242 (9%) Query: 765 ATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDT- 823 + + ++ A + K+ TP V+G + A ++ + Sbjct: 291 GGNSVALIDSGDITIAPKAGQSLKITTPVTVNGDGNQILLQQSTKAPSSGSYIKGLNSDG 350 Query: 824 ----YADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRS 879 W G + + V G G + A Sbjct: 351 VFLWTVGRSESNGWLELGGKGSNRIALADSGDVTIAPGTGKTTKITSTATLTAPAAGD-- 408 Query: 880 SRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA 939 W + G P+PWP T P+G+ GQ+FDK+ YPKL A Sbjct: 409 -NSTSAITSGW----------FAAEIAGIPLPWPQATAPTGWLKCNGQSFDKTLYPKLTA 457 Query: 940 AYPSGVIPDMRGWTIKGKP-----ASGRAVLSQEQDGI-KSHTHSASASSTDLGTKTTSS 993 AYPSG +PD+RG I+G SGRAVLS + + + S +A++T S+ Sbjct: 458 AYPSGTLPDLRGEFIRGWDDGRGVDSGRAVLSVQDATWIQPNIESNTAATTIRIDNVDST 517 Query: 994 FD 995 F+ Sbjct: 518 FN 519 >UniRef50_A7ZN97 Tail fiber family protein n=2 Tax=Escherichia coli RepID=A7ZN97_ECO24 Length = 336 Score = 92.9 bits (228), Expect = 7e-17, Method: Composition-based stats. Identities = 38/108 (35%), Positives = 54/108 (50%), Gaps = 6/108 (5%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 MA+ ISGV +G G+PV + L A+ S+ VV+ T+ + AG Y D+ G Y V Sbjct: 1 MAI-ISGVYANGVGEPVVGVQLVLTARVTSSRVVMTTVVEQETGAAGEYKFDMNPGVYVV 59 Query: 61 ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFEL 108 ++ G I V DS GTLND+L + D+ P AL + Sbjct: 60 TA-----SAAYLGVINVNPDSVDGTLNDYLTNFSADELTPAALAEIQE 102 >UniRef50_C5A8Q3 Phage-related tail fiber protein n=1 Tax=Burkholderia glumae BGR1 RepID=C5A8Q3_BURGB Length = 865 Score = 92.5 bits (227), Expect = 7e-17, Method: Composition-based stats. Identities = 138/820 (16%), Positives = 224/820 (27%), Gaps = 146/820 (17%) Query: 391 SSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAE 450 A + A + + A G T +T T + A Sbjct: 100 QDAPIMEKSPAAILLLAADAVFSTIDAAQLEFGPTTFLNPPATTERQGVVELATVDEVAA 159 Query: 451 DIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGA--D 508 + A+ K+ + S + + P AV+++ D ++ Sbjct: 160 GADATRAVTPLGAAKRYMP--------ISGGIFSGPIAVQASVDTNNAQVVVSPAVGKLG 211 Query: 509 IPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRV 568 K F D + R + VR G +Y + R + A+ Sbjct: 212 AESKVRFSAAFGDPKSNDTS-PRVVASVRAGFNGGVWGSEYLDFHLNRVPNDARDDANMD 270 Query: 569 IITTATRTAGDPMNNCEFNGFVMPGGWTD---RGRYAYGMFWQYQNNERAIHSIMMSNKG 625 G G T G + + +A + +SN Sbjct: 271 RALRLAAGG------RAVIGATQDDGGTALQVGGSVRANNYGIGDQSSKASGLVGISNGT 324 Query: 626 DDLRSVFYVDGA--AFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVIL 683 + FY + + + D G T+ A + Sbjct: 325 NGPNVGFYGRESVGGGAITLSTGGKERLRLK------GDGKLLIG-TDTPDASAAVLQVA 377 Query: 684 DFKSGRGFYESHSLIVNDNLS-CKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYL 742 S G S+I N + + + +T + GG R G YGA RNDG+ YL Sbjct: 378 GAASVTG-----SIIANGTSAFARGVQSTG--LDDGGGSFRATNGGYGAFLRNDGSHMYL 430 Query: 743 LLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNG-------NALTATKLQTPRRV 795 L T +GD G +N RPFA D +G + I + G +A Sbjct: 431 LSTKKGDPLGSFNDYRPFAWDLDSGYVTIDYNGHGASVGGALSIAISASIGRGRDQGVIW 490 Query: 796 SGVEFDGSKDI-----------------------------------TLTAAHVAAFARRA 820 G DG LT A A Sbjct: 491 LGPRNDGYLYSTADSCGWWYGAAGSVQYVFGDHSLRVDNQPVWHAGNLTPLDRNAGGWMA 550 Query: 821 TDTYADADGGV----PWNAESGAYNVTRSGDSYILVNFYTGV-GSCRTLQMKAHYRNGGL 875 +D + + L + G G Q + G Sbjct: 551 SDVHFAPGARLVLSEGSPGAPSLTFENDGAPDTGLYHVADGQFGVTCNSQPIVGFAPDGT 610 Query: 876 FYRSSR--DGYGFEEDWAEVYTS--KNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDK 931 + S+ + T+ S +G + P T +G+ G ++ Sbjct: 611 RFTSNVFGPTPAAGDRSTRFATTEWVLSALSSSSIGQIVFEPRTTTRAGFLKANGSLLER 670 Query: 932 SAYPKLAAAYPSG---------------------------VIPDMRGWTIKGKPAS---- 960 + YP L AY IP++RG ++ Sbjct: 671 ADYPAL-WAYAQASGALISDAAWWAGQSGCFSTGTTGTNFRIPELRGEFLRCLDDGRGLD 729 Query: 961 -GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAG 1019 RA S + H+H AS++ S +G +T G+H H++ Sbjct: 730 TSRAAGSLQLSQNAKHSHDASSTVGG-------SHTHGAFTTGA-GSHNHAIDQ---QPH 778 Query: 1020 AHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVG 1079 AH L +V + + G G V + ++ A G H H G Sbjct: 779 AHDTWLGSVQVSGVDRGGGFGPYNGRVGEAWSDPANANIA--------ILPTGDHVHGAG 830 Query: 1080 IGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 + G H H I V +G E +NIA ++R Sbjct: 831 ------TYPAGDHNHAIAVQPSGGDEARPRNIALLAMIRA 864 >UniRef50_B2I5N0 Tail Collar domain protein n=13 Tax=Xylella fastidiosa RepID=B2I5N0_XYLF2 Length = 414 Score = 92.1 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 54/248 (21%), Positives = 90/248 (36%), Gaps = 44/248 (17%) Query: 904 YPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA----AYPSG------VIPDMRGWT 953 Y G + G L G+A ++ YP+L +Y +G IP+ T Sbjct: 136 YEPGQIVYTAGKRALPGTLLCDGRAVSRAMYPRLFEEINTSYGAGDGVSTFNIPNFLEGT 195 Query: 954 IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKS------------- 1000 + A V + + SH H+A+A + G + Sbjct: 196 VGVHTADPALVGTFTSGQVISHAHTATAEEGGRHLHPVTVHPAGRHTHPASAAAAGNHLH 255 Query: 1001 ---TNNTGAHTHSVSGSTNSAGAHTH--------SLANVNTASANSGAGSASTRLSVVHN 1049 ++ G H H+ GST+ G H H + + G +T ++ H Sbjct: 256 QAWSDEQGLHQHT--GSTSWDGDHAHILGSFRAIYASGRDMGFYEQNQGKVTTNVTGGHL 313 Query: 1050 QNYATSSAGAHTHSLSGTAASAGAHAHTVGIGA---HTHSVA---IGSHGHTITVNAAGN 1103 + T + G H H++S A AG H H + + A H H+ G HGHT++++ G Sbjct: 314 HRFTTDANGKHAHNISMQA--AGFHVHDIAVTAEADHAHAATAESAGRHGHTVSIDRFGE 371 Query: 1104 AENTVKNI 1111 N + Sbjct: 372 HHNLPAGL 379 >UniRef50_Q7N541 Similar to DNA inversion product and tail fiber protein from lambdoid prophage n=2 Tax=Photorhabdus RepID=Q7N541_PHOLL Length = 337 Score = 92.1 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 59/337 (17%), Positives = 107/337 (31%), Gaps = 55/337 (16%) Query: 779 LNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESG 838 + + L Q+ S + + A+ T + + S Sbjct: 39 ITTHLLNKVLRQSSTISSVIADFIATQSGNDVLDDGDIAKLITQLNRALEQKITTKVPSA 98 Query: 839 AYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSS--RDGYGFEEDWAEVYTS 896 + L + + Q N R ++G + + Sbjct: 99 SLTQQ---GVIQLTDQTGESNALAATQKLVSDVNQNANNRLEKNQNGADIPDKNTFLKNL 155 Query: 897 K---NLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWT 953 + YPVG I + + P+ L G ++ K + + Sbjct: 156 GLIETIINTQYPVGIVIWFAQNKNPN--VLFPGTTWEYIGENKTIRLASANGSDIL---- 209 Query: 954 IKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSG 1013 G ++S + +H H+ TTS+FDYGTK+TN G H H Sbjct: 210 ----STGGNDLISLTAAQMPAHNHTF--------FGTTSTFDYGTKTTNIAGEHYHDSGW 257 Query: 1014 STNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGA 1073 + G + H + N ++ + +N + TS + GA Sbjct: 258 GETTGGRYGHFDGSKNNQG---------SKSTDWNNAKFNTS--------------TNGA 294 Query: 1074 HAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKN 1110 H+HTV IGAH H+++ + ++ GNA ++ N Sbjct: 295 HSHTVSIGAHNHTISGNTG------DSGGNAAISITN 325 Score = 44.0 bits (101), Expect = 0.037, Method: Composition-based stats. Identities = 33/81 (40%), Positives = 44/81 (54%), Gaps = 1/81 (1%) Query: 444 TAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKD 503 T RA + + AS T++G++QL + S LAAT K V NA RL+K+ Sbjct: 81 TQLNRALEQKITTKVPSASLTQQGVIQL-TDQTGESNALAATQKLVSDVNQNANNRLEKN 139 Query: 504 QNGADIPDKGCFLNNINAVSK 524 QNGADIPDK FL N+ + Sbjct: 140 QNGADIPDKNTFLKNLGLIET 160 >UniRef50_B3I9S3 DNA inversion product n=5 Tax=Escherichia coli RepID=B3I9S3_ECOLX Length = 546 Score = 91.4 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 43/153 (28%), Positives = 63/153 (41%), Gaps = 16/153 (10%) Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 + PVG P+PWPS T P+G+ G AF YP+LA AYP+ +PD+RG I+ Sbjct: 378 LGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSVEEYPELAKAYPTNKLPDLRGEFIR 437 Query: 956 GKPAS-----GRAVLSQEQDGIKSHTH-----------SASASSTDLGTKTTSSFDYGTK 999 G GRA+L+ + I H H + SA + D G Sbjct: 438 GWDDGRGIDTGRALLNWQPHTILDHAHYMELWTGDGLAAGSAREGVNPGILATYGDGGIV 497 Query: 1000 STNNTGAHTHSVSGSTNSAGAHTHSLANVNTAS 1032 T+ G S + +S + + N + Sbjct: 498 KTDEPGLKVPSSLRAISSRSVKRYGEISENVGT 530 >UniRef50_B7MJL6 Putative phage tail protein n=3 Tax=Escherichia RepID=B7MJL6_ECO45 Length = 247 Score = 91.0 bits (223), Expect = 2e-16, Method: Composition-based stats. Identities = 46/134 (34%), Positives = 66/134 (49%), Gaps = 10/134 (7%) Query: 891 AEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMR 950 + + PVG P+PWPS T P+G+ G AF YP+LA AYP+ +PD+R Sbjct: 92 TALENLGLGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLR 151 Query: 951 GWTIKGKP-----ASGRAVLSQEQDGIKSHTHS---ASASSTDLGTKTTSSFDYGTKSTN 1002 G I+G S RAVLS ++ + + S + + G K T S G+ S+N Sbjct: 152 GEFIRGWDDGRGVDSRRAVLSTQEPTVGTFYVELAIISGTLSGSGAKFTDSVGIGSTSSN 211 Query: 1003 NT--GAHTHSVSGS 1014 T + SVSG+ Sbjct: 212 ITVSNGNDQSVSGT 225 Score = 50.9 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 50/135 (37%), Gaps = 10/135 (7%) Query: 995 DYGTKSTNNTGAHTHSVSGSTNSAGAHTH------SLANVNTASANSGAGSASTRLSVVH 1048 T T + + S A + L + G G S R V+ Sbjct: 114 PSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGVDSRRA-VLS 172 Query: 1049 NQNYATSSAGAHTHSLSGTAASAGA-HAHTVGIGAHTHSVAI--GSHGHTITVNAAGNAE 1105 Q + +SGT + +GA +VGIG+ + ++ + G+ A + Sbjct: 173 TQEPTVGTFYVELAIISGTLSGSGAKFTDSVGIGSTSSNITVSNGNDQSVSGTVAVNPVD 232 Query: 1106 NTVKNIAFNYIVRLA 1120 +NIAFNYIVR A Sbjct: 233 TRPRNIAFNYIVRAA 247 >UniRef50_C6CP84 Tail Collar domain protein n=2 Tax=Dickeya RepID=C6CP84_DICZE Length = 646 Score = 91.0 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 50/265 (18%), Positives = 79/265 (29%), Gaps = 28/265 (10%) Query: 725 IGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNAL 784 + G + + LT Y F + + ++ A Sbjct: 334 ANADVGVTLQTNLGARIAQLT-----YAAAGANTQFYSYKGANSIKLADGGDITITPKAG 388 Query: 785 TATKLQTPRRVSGVEFDGSKDITLTAAHVAAF-----ARRATDTYADADGGVPWNAESGA 839 +L P R + A ++ + W G Sbjct: 389 QTLELNGPIRALADGNQINLQQKTAAPTSGSYIRATNSDGVFLWIVGRSESNGWLELGGK 448 Query: 840 YNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNL 899 + + + G G + SS Sbjct: 449 GSNRIALADSGDITIAPGTGKTTKITSTVTLTAPAAGDNSSTAITSGW------------ 496 Query: 900 PPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPA 959 + G P+PWP T P+G+ GQ+FDK YP+LA YPSGV+PD+RG I+G Sbjct: 497 -FAAELAGIPLPWPQATAPTGWLKCNGQSFDKKLYPRLAQVYPSGVLPDLRGEFIRGWDD 555 Query: 960 S-----GRAVLSQEQDGIKSHTHSA 979 R +LS + D I++ S Sbjct: 556 GRGVDNNRGLLSSQGDTIRNIVASF 580 Score = 59.0 bits (140), Expect = 9e-07, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 67/226 (29%), Gaps = 19/226 (8%) Query: 907 GAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLS 966 G + + +G+ + G+ ++ A A I G T K L+ Sbjct: 428 GVFLWIVGRSESNGWLELGGKGSNRIA----LADSGDITIAPGTGKTTKITST---VTLT 480 Query: 967 QEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTH--- 1023 G S T S + T T + S A + Sbjct: 481 APAAGDNSSTAITSGWFAAELAGIPLPWPQATAPTGWLKCNGQSFDKKLYPRLAQVYPSG 540 Query: 1024 ---SLANVNTASANSGAGSASTRL--SVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTV 1078 L + G G + R S + ++ A GA + Sbjct: 541 VLPDLRGEFIRGWDDGRGVDNNRGLLSSQGDTIRNIVASFVMDDQAVTINAPTGAMFPSS 600 Query: 1079 GIGAHTHSVAIGSHGHTITVNAAG----NAENTVKNIAFNYIVRLA 1120 I +S G+ G + +A+ EN +NIAFNYIVR A Sbjct: 601 QIAYDANSNVGGTMGFNVVFDASRVVPTANENRPRNIAFNYIVRAA 646 >UniRef50_B7US81 Predicted tail fiber protein n=16 Tax=root RepID=B7US81_ECO27 Length = 521 Score = 90.2 bits (221), Expect = 4e-16, Method: Composition-based stats. Identities = 42/141 (29%), Positives = 62/141 (43%), Gaps = 10/141 (7%) Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 + PVG P+PW S T P+G+ G AF YP+LA AYP+ +PD+RG I+ Sbjct: 378 LGLGEGSALPVGVPVPWSSATPPTGWLKCNGAAFSSEMYPRLARAYPTNKLPDLRGEFIR 437 Query: 956 GKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHS 1010 G GR +LS + SH D+G+ S +Y +N G S Sbjct: 438 GWDDGRGIDAGRTLLSGQDGTSFSHYGGN----FDIGSGH-SINNYDQIVSNQPGFSRFS 492 Query: 1011 VSGSTNSAGAHTHSLANVNTA 1031 +G + G + ++ N Sbjct: 493 FAGPSRGDGVNYVTIRPRNIT 513 >UniRef50_Q7P172 Probable bacteriophge tail fiber protein n=1 Tax=Chromobacterium violaceum RepID=Q7P172_CHRVO Length = 591 Score = 87.9 bits (215), Expect = 2e-15, Method: Composition-based stats. Identities = 50/244 (20%), Positives = 82/244 (33%), Gaps = 40/244 (16%) Query: 823 TYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLF-YRSSR 881 + A + G + SG Y + + N + Q+ A Y NG + +R+ Sbjct: 355 SVAAGEMGFNYGTSSGIYGPFIAFGGLVDNNINY------SCQLTADYGNGSMMRFRTRN 408 Query: 882 D--GYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA 939 D G W L E Y G + P G+ G A + YP L A Sbjct: 409 DDGTTGRWNPWR------TLIHEDYLTGQVAFFAMSAPPLGWLKANGAAVSRKDYPSLFA 462 Query: 940 A----YPSG------VIPDMRGWTIKGKPAS-----GRAVLSQEQDGI----KSHTHSAS 980 A Y +G +PD+RG ++G GR + ++ + S T Sbjct: 463 ALGTYYGAGDGSTTFNLPDLRGEFVRGWDDGRGVDNGRGFGTWQKGTLTFSDPSLTSPCV 522 Query: 981 ASSTDLGTKTTSSF-DYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGS 1039 AS T + D G + + + A+ L ++++ +G GS Sbjct: 523 ASLVHRNDNTVIGYLDLGADPVDKNK-----YDLGLSVSTANGVYLPDLDSGGWANGYGS 577 Query: 1040 ASTR 1043 R Sbjct: 578 TRPR 581 >UniRef50_C4KLT7 Hep_Hag family protein n=45 Tax=Proteobacteria RepID=C4KLT7_BURPS Length = 1185 Score = 86.7 bits (212), Expect = 5e-15, Method: Composition-based stats. Identities = 88/582 (15%), Positives = 217/582 (37%), Gaps = 6/582 (1%) Query: 147 DAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKT 206 + SA + S + +S +A++S ++ T ++ S + +S + A+ S + Sbjct: 399 QGSVSANTGTASGDNSTASGDNATASGTNSTANGTNSTASGDNSTASGTNASASGENSTA 458 Query: 207 SETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATA 266 + T+++AS ++ + + +T + S +A+A+ E + ++ T++++S S++ ++ T Sbjct: 459 TGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTN 518 Query: 267 AGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS 326 + S + S TNA +S + + + + S + + ++ + ++ S ++AS T A + Sbjct: 519 STASGDNSTASGTNASASGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASAT 578 Query: 327 AESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSA 386 E++ ++ + +T +T + + S + S TNA A+ ++ ++ T + +S S++ Sbjct: 579 GENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNS 638 Query: 387 ASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAA 446 ++ ++++AS D +T + A ++ ++ T++ S + + + + + ++ + + Sbjct: 639 TANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASG 698 Query: 447 KRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNG 506 A +T G ++ NST+ +T ++ + Sbjct: 699 TNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGTNASATGENSTATGTAST 758 Query: 507 ADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELAS 566 A N N+ + + + G + AT +A + AS Sbjct: 759 AS--GSNSTANGTNSTASGNNSTASGTNASATGENSTATGTDSAASGTNSTANGTNSTAS 816 Query: 567 RVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGD 626 T + A N G ++ + + G Sbjct: 817 GDNSTASGTNASATGENSTATGTASTASGSNSTANGANSTASGAGATATGENAAATGAGA 876 Query: 627 DLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFK 686 + A+ + G + A G + N A Sbjct: 877 T----ATGNNASASGTSSTAGGANAIASGENSTANGANSTASGAGATATGENAAATGAGA 932 Query: 687 SGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGE 728 + G S S + + + + A G N G Sbjct: 933 TATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNGS 974 Score = 47.1 bits (109), Expect = 0.004, Method: Composition-based stats. Identities = 113/824 (13%), Positives = 239/824 (29%), Gaps = 18/824 (2%) Query: 280 NARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATT 339 A + T A A+ S + AA ++AAS A +++ A G A + S+++ T Sbjct: 223 AAGVNATDAVNVGQLASLSTSTAAGLSTAASGVASLSTSLLGAVGDLASLSTSASTGLAT 282 Query: 340 KAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDE 399 +++ +A + T+ + +T+ + S S S+ + Sbjct: 283 ADSGIASLSTSLLGTADNVTSLSTSLSTVNANLAGLQTSVDNVVSYDDPSKSAITLGGAG 342 Query: 400 ATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALE 459 T +A + +T+A + + ++ + + ++ Sbjct: 343 VTTPVLLTNVAAGKIAATSTDAVNGSQLYTLQQEFSQQYDLLTSQVSSLSTSVSGLQGSV 402 Query: 460 DASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNI 519 A+T S+A+ + A + + A + Sbjct: 403 SANTGTASGDN-STASGDNATASGTNSTANGTNSTASGDNSTASGTNASASGENSTATGT 461 Query: 520 NAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGD 579 ++ + + G + A+ +A AS T + Sbjct: 462 DSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTA 521 Query: 580 PMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAF 639 +N +G + A G + + + GD+ S A+ Sbjct: 522 SGDNSTASGT--NASASGENSTATGTDSTASGSNSTANGTNSTASGDN--STASGTNASA 577 Query: 640 PVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIV 699 G +A G++ N T N A + + G + S Sbjct: 578 TGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSN 637 Query: 700 NDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRP 759 + + D A G N + + + G N+ Sbjct: 638 STANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTAS 697 Query: 760 FAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARR 819 +ATGE T ++ +G+ TA + G + + T + A Sbjct: 698 GTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASG-TNASATGENSTATGTA 756 Query: 820 ATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRS 879 +T + +++ + SG + ++ T G+ NG S Sbjct: 757 STASGSNSTANGTNSTASGNNSTASGTNASATGENSTATGTDSAASGTNSTANGTNSTAS 816 Query: 880 SRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA 939 + + + + + T + G S A Sbjct: 817 GDNSTASGTNASATGENSTATGT-----------ASTASGSNSTANGANSTASGAGATAT 865 Query: 940 AYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTK 999 + T ASG + + + I S +++A+ + + G Sbjct: 866 GENAAATGAGATATGNNASASGTSSTAGGANAIAS-GENSTANGANSTASGAGATATGEN 924 Query: 1000 STNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGA 1059 + T + + ++ S + T AN + NS A A++ S + + S+A A Sbjct: 925 AAATGAGATATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAA 984 Query: 1060 HTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGN 1103 S + + + + +V GA + + S + NA G Sbjct: 985 GDGSTALGSNAVASGVGSVATGAGSVASGANSSAYGTGSNATGA 1028 >UniRef50_D0KLI8 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KLI8_PECWW Length = 621 Score = 86.0 bits (210), Expect = 8e-15, Method: Composition-based stats. Identities = 35/94 (37%), Positives = 52/94 (55%), Gaps = 5/94 (5%) Query: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-- 958 P + VG P +P P+G+ GQ FD + YP LA+ YPSG +PD+RG ++G Sbjct: 465 PSAELVGMPQVFPGAVAPAGWLKCNGQQFDTAQYPILASRYPSGFLPDLRGEFVRGWDDE 524 Query: 959 ---ASGRAVLSQEQDGIKSHTHSASASSTDLGTK 989 +GRA+LS++ D I++ T + AS G Sbjct: 525 RGVDAGRALLSEQGDAIRNITGTMRASDVPYGHT 558 Score = 57.1 bits (135), Expect = 4e-06, Method: Composition-based stats. Identities = 69/418 (16%), Positives = 110/418 (26%), Gaps = 62/418 (14%) Query: 756 TLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAA 815 TL+ A T L+A A P + + G D+ Sbjct: 213 TLQELANALGNDPNFSTTILNALAGKLAKDQNGADIPDKSRFRQNVGLGDVWNNQNLPGP 272 Query: 816 FARRAT-DTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGG 874 F D D P + Y +V G + +Q+ Y Sbjct: 273 FGASINSGIIHDCDLIAPGTTALCSGTAKNGPGFYAIVFSAYGHTNQYVIQVARAYGGAE 332 Query: 875 LFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVG--------------APIPWPSDTVPSG 920 R+ +W+ +T NL P + A + P D Sbjct: 333 FATRARNGDVNQWTEWSRSWTLSNLNPMTTDTEQTITGGKTIRRNNEALLLKPQDIGLGS 392 Query: 921 YALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHT---- 976 Y L + Y Y S + D+ T+ + +L Q D T Sbjct: 393 YILARDADDSDRWY----LGYSSANVKDL---TLHNYASRSSIILQQAGDCAVHATRLKI 445 Query: 977 ------HSASASSTDLGTKTTSS-------FDYGTKSTNNTGAHTHSVSGSTNSAGAHTH 1023 H S + +G ++ F + + A + Sbjct: 446 NEHEVWHKGSLTPAAIGAMPSAELVGMPQVFPGAVAPAGWLKCNGQQFDTAQYPILASRY 505 Query: 1024 SLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGA---HAHTVGI 1080 + G R S G +++GT ++ H V Sbjct: 506 PSGFLPDLRGEFVRGWDDERGVDAGRAL--LSEQGDAIRNITGTMRASDVPYGHTQFVDA 563 Query: 1081 GAHTHSVAIGSHGHTITVNAAGNA------------------ENTVKNIAFNYIVRLA 1120 A + + T +++GNA EN +NIAFNYIVR A Sbjct: 564 LKADGVFAPIAGDKSWTGDSSGNAGNPWGVSFDTSRVVPTANENRPRNIAFNYIVRAA 621 Score = 45.5 bits (105), Expect = 0.011, Method: Composition-based stats. Identities = 50/286 (17%), Positives = 82/286 (28%), Gaps = 1/286 (0%) Query: 278 ETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTA 337 ET+ G S A + A T+ AA A S A Sbjct: 18 ETSDPVVAGPGGISNRQAEQLASRTAYLKKMQETTGDSLQKHL-AASDPHSQYAPKNSPA 76 Query: 338 TTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASK 397 T A A A + A +A A+ + + ++A + S + Sbjct: 77 LTGTPTAPTTAQTANNTQIATTAFVKSAIAALINGSPAALDTLQELANALGNDPHFSTTI 136 Query: 398 DEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVA 457 A ++ A +A A T A A +++ + +A Sbjct: 137 LNAITDVKTDATNKLNAHASILDAHPQYAPKASPAFTGTPTAPTAASSSNDTQLATTAFV 196 Query: 458 LEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLN 517 + G + L P + + +L KDQNGADIPDK F Sbjct: 197 KAAVAALVNGSPTALDTLQELANALGNDPNFSTTILNALAGKLAKDQNGADIPDKSRFRQ 256 Query: 518 NINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSE 563 N+ + + G +N+ P +G+ Sbjct: 257 NVGLGDVWNNQNLPGPFGASINSGIIHDCDLIAPGTTALCSGTAKN 302 >UniRef50_C5PDM6 Putative uncharacterized protein n=2 Tax=Coccidioides RepID=C5PDM6_COCP7 Length = 1433 Score = 85.2 bits (208), Expect = 1e-14, Method: Composition-based stats. Identities = 70/523 (13%), Positives = 158/523 (30%), Gaps = 1/523 (0%) Query: 50 SMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELM 109 S ++ G+Y T E G +N+ L T D++ E L+ + Sbjct: 58 SFELAPGKYQYKFRAGSGDAWFCDTDVPTEVDNDGNMNNVLLVETASDSKKEGLQAGQSK 117 Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + NA V + +K+A+D ++ +S A+ + + + + Sbjct: 118 SDNTTENAETVVASDETTEKAAADKESNGVNGVVAEKADPESKEASVPAEEAPEKAEKES 177 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 + A T + A + S A S A + + S + + Sbjct: 178 TGGEDVAEQPTTVSEAIAKSTPSDLKPADDSQPKANIKDEDVSVNGGPVEKKLEVQEAET 237 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 + S + + ++++ A S + + T+ Sbjct: 238 ESPQPEKKAPEVSSPDSSADPISSATEAESPGQAEPEPSEPKAEEPKAPVEPEIQPTSEV 297 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 + + A K + + + + +A + ++ SS++ + A A Sbjct: 298 EKQAEATEPKDESPAVLNPEAPQQLTETAKDLESRPQTPASMSSSTRSLNPAAPAFVPGK 357 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 S ++ A + ++ A ++++ + +++K Sbjct: 358 FTPVSVPNPAETDDAAPKDDKESQPPAPEEPKVAEKVPEASTAGDEVDKDKEEVVASSKP 417 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 A T T A + A + ++ + A A + A E K V Sbjct: 418 EEDVAQTTTTTAEETKETPAAEQPAMDAVKEVKKPAEPAAPEQTEEPAEEAPKVEKAPEV 477 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFAD 529 + ++A ST AT +AV + EK + + ++ + + V + D Sbjct: 478 ETAAAAVSTDPIPEATAEAV-TEDKEEEKNVSTTEAPSEPLAENVTASATETVEEPAQKD 536 Query: 530 KRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITT 572 + + + PV + A V +T Sbjct: 537 DEVVPSSKTEVEEKEVQQQEKPVDEVPVAEDSPAEPEPVKAST 579 >UniRef50_D0KGE3 Tail Collar domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE3_PECWW Length = 144 Score = 84.8 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 38/134 (28%), Positives = 54/134 (40%), Gaps = 18/134 (13%) Query: 912 WPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP-------ASGRAV 964 W + P G+ + GQ F+ S P LA YPS +PD RG+ +G S R+V Sbjct: 2 WGTPVPPEGWLELNGQLFNPSGNPVLADLYPSSRVPDFRGYFPRGWDNGAGIDPDSSRSV 61 Query: 965 LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTG---AHTHSVSGSTNS---- 1017 LS + D I SH H+ + S G + F T+ +G H + + Sbjct: 62 LSYQDDEIISHKHAITMSHEHHGAADGAGFP----QTDASGPMIKHAETEPDGSFPERSG 117 Query: 1018 AGAHTHSLANVNTA 1031 AG S T Sbjct: 118 AGNPMFSFGGSETR 131 >UniRef50_B3I8J5 DNA inversion product n=3 Tax=Enterobacteriaceae RepID=B3I8J5_ECOLX Length = 263 Score = 83.7 bits (204), Expect = 4e-14, Method: Composition-based stats. Identities = 44/167 (26%), Positives = 70/167 (41%), Gaps = 14/167 (8%) Query: 883 GYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYP 942 G + + PVGAP+PWPS+T P+G+ G AF YP+LA AYP Sbjct: 82 SDGAAAISTALTNLGLGEGSALPVGAPVPWPSETPPTGWLKCNGAAFSAEEYPELAKAYP 141 Query: 943 SGVIPDMRGWTIKGKPAS-----GRAVLSQEQD-----GIKSHTHSASASSTDLG---TK 989 + +PD+RG I+G S GR++LS + ++ + ++ +G Sbjct: 142 TNKLPDLRGEFIRGWDDSRGIDTGRSLLSGQAATFIRTALQDYYGYDLNTNVKVGIAFAT 201 Query: 990 TTSSFDYGTKSTNNTGAHTHSVSGST-NSAGAHTHSLANVNTASANS 1035 S G + G ++ V S NS + + T + S Sbjct: 202 ADSVITVGNPANPKAGNNSDYVPASADNSITGTQRTAEDNFTGAWIS 248 >UniRef50_C3KCU2 Putative phage tail fiber-related protein n=1 Tax=Pseudomonas fluorescens SBW25 RepID=C3KCU2_PSEFS Length = 658 Score = 83.3 bits (203), Expect = 5e-14, Method: Composition-based stats. Identities = 54/310 (17%), Positives = 94/310 (30%), Gaps = 33/310 (10%) Query: 738 AKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTAT-KLQTPRRVS 796 A Y LLT+ G Y + + ++ +G A N +A K + R Sbjct: 2 ADYYTLLTDAGIAYET--ACKAAGVPIKLAQISVGDGNGAVYNPDASAKALKREVWRGPL 59 Query: 797 GVEFDGSKDIT------LTAAHVAAFARRATDTYADADGGVP---WNAESGAYNVTRSGD 847 F K+ + V + R + D + T Sbjct: 60 NALFQDEKNANWLMAEVTIPSDVGGWYVREAGLWTDTGILYAIVKYPESYKPVLATSGSG 119 Query: 848 SYILVNFYTGVGSCRTLQMKAHY---RNGGLFYRSSRDGYGFEEDWAEV-YTSKNLPPES 903 + + + + + + E + + + Sbjct: 120 KEFYIRSIFETSNAAIVTLLIDDTVVKATRAWVMDYLGRQLAEGTYTKAEIEMLIAQSSA 179 Query: 904 YPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA-----------AYPSGVIPDMRGW 952 PVG+ + +P D VP G+ + G +AYP LA + +P+ RG Sbjct: 180 LPVGSMVAFPIDKVPVGFLEIDGSVKSATAYPDLAKFLGTAFNKGDEGAGNFRLPESRGE 239 Query: 953 TIKGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTS-SFDYGTKSTNNTGA 1006 ++G +GR S + D KSHTH + S + + +T+ TG Sbjct: 240 FLRGWDHGRGVDAGRLAGSYQTDQFKSHTHEYDTMQGGGAANSVSDTIAAQSNATSQTGH 299 Query: 1007 HTHSVSGSTN 1016 T GS Sbjct: 300 ITGGAGGSET 309 Score = 60.2 bits (143), Expect = 4e-07, Method: Composition-based stats. Identities = 38/183 (20%), Positives = 69/183 (37%), Gaps = 23/183 (12%) Query: 821 TDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSS 880 T Y GG N+ S S + + G G T ++ + Sbjct: 268 THEYDTMQGGGAANSVSDTIAAQ-SNATSQTGHITGGAGGSETRPRNL----AVMWCIKA 322 Query: 881 RDGYGFEEDW--AEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLA 938 + + + A + + ++ + PVG+ IP+ VP GY + G + YP LA Sbjct: 323 WNAPINQGNIDVAALVSELDVLKSAVPVGSIIPFLKAAVPPGYLELDGSVQSIATYPDLA 382 Query: 939 A---------AYPSG--VIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHTHSASAS 982 A + P+G +P+ RG ++G +GR V S ++ + + + A+ Sbjct: 383 AYLGTTFNTGSEPAGYFRLPESRGEFLRGWDHGRGMDAGREVGSWQKGSMVAVDTNIPAT 442 Query: 983 STD 985 T Sbjct: 443 QTI 445 >UniRef50_Q0C8E2 Predicted protein n=1 Tax=Aspergillus terreus NIH2624 RepID=Q0C8E2_ASPTN Length = 1383 Score = 83.3 bits (203), Expect = 5e-14, Method: Composition-based stats. Identities = 50/499 (10%), Positives = 121/499 (24%), Gaps = 3/499 (0%) Query: 51 MDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMV 110 DV GQY + G +++ L + + P ++ V Sbjct: 58 FDVAPGQYQYRFREGPNGAWFHDETVKTVPGEDGLIHNILHVEDKSEPVP--VKEEPAPV 115 Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 ++ A N + + + S ++ + + + + +A Sbjct: 116 KDDAENHTETNGVSEETTAPVQETSAGEKKDDSKPVEPEPVSNGVESEVNGVEPAAPEPE 175 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 A +A ++ A+ ++E S + A + A T Sbjct: 176 QENSAAPAPEADAVTQKPEEPQPETKLEEVPAASTSTEMAKSETTPQATSEAQADATTEK 235 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 + + + T + + A A K ET + E + Sbjct: 236 LEESQPEKGTEEPTKTEEASTASPETEQKQIEVTPEANPVATEEKAEETQ-PAKEVTEPE 294 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 + S + T A S + T ++ ++ + Sbjct: 295 TKSEQTPATTEPEQEEKQAEVSPEVETPVETKDKPEESQLEAAEEPVESEPVVEERSETV 354 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 + A ++ S+ + + + Sbjct: 355 PETAEKATSAETETSEVDSAEPAKEAVKEEKSSQEPVTEEKPEPVPETAEEAAPVETATE 414 Query: 411 ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQ 470 + TE A +K A E E A +E + + + + Sbjct: 415 EPVPAEPVTEEPAKEIATEVAKEEASQEPVVEEPKESTQEASAIEEPMETPAAETQAVKE 474 Query: 471 LSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADK 530 + A S A + + K +D+ + ++ + A + Sbjct: 475 DAEAAKEESVQEAVKEETSQEPVVEESKESTQDEPVVEEANEAPVAETSAVEEAAEAAKE 534 Query: 531 RGMRYVRVNAPAGATSGKY 549 ++ + + Sbjct: 535 ESVQEAVKEESSQGPVVEE 553 >UniRef50_Q92954 Proteoglycan 4 C-terminal part n=17 Tax=Eutheria RepID=PRG4_HUMAN Length = 1404 Score = 82.5 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 49/493 (9%), Positives = 108/493 (21%), Gaps = 9/493 (1%) Query: 74 TITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASD 133 + + + + E + AL + + ++ S Sbjct: 315 KTSAKDLAPTSKVLAKPTPKAETTTKGPALTTPKEPTPTTPKEPASTTPKEPTPTTIKSA 374 Query: 134 ASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS 193 +T A T A + + + A + +TK + + +A + Sbjct: 375 PTTPKEPAPTTTKSAPTTPKEPAP--TTTKEPAPTTPKEPAPTTTKEPAPTTTKSAPTTP 432 Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNA 253 K A T+ + A + +T A A + Sbjct: 433 KEPAPTTPKKPAPTTPKEPAPTTPKEPTPTTPKEPAPTTKEPAPTTPKEPAPTAPKKPAP 492 Query: 254 SSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSA 313 ++ A ++ + + T + T + + SA Sbjct: 493 TTPKEPAPTTPKEPAPTTTKEPSPTTPKEPAPTTTKSAPTTTKEPAPTTTKSAPTTPKEP 552 Query: 314 GQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAE 373 + A E A ++ + + + + K Sbjct: 553 SPTTTKEPAPTTPKEPAPTTPKKPAPTTPKEPAPTTPKEPAPTTTKKPAPTTPKEPAPTT 612 Query: 374 SSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKS 433 +TA + ++ + + E + + + TT + Sbjct: 613 PKETAPTTPKKLTPTTPEKLAPTTPEKPAPTTPEELAPTTPEEPTPTTPEEPAPTTPKAA 672 Query: 434 TAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAY 493 + A T K +A + E TPK Sbjct: 673 APNTPKEPAPTTPKEPAPTTPKEPAPTTPKETAPTTPKGTAPTTLKEPAPTTPKKPAPKE 732 Query: 494 DNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVV 553 + D P T + PA T P Sbjct: 733 LAPTTTKEPTSTTCDKPAPTT-------PKGTAPTTPKEPAPTTPKEPAPTTPKGTAPTT 785 Query: 554 VMRSAGSVSELAS 566 + A + + + Sbjct: 786 LKEPAPTTPKKPA 798 Score = 75.6 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 58/448 (12%), Positives = 114/448 (25%), Gaps = 7/448 (1%) Query: 77 VYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDAST 136 T T + P + + + + T + + + Sbjct: 383 PTTTKSAPTTPKEPAPTTTKEPAPTTPKEPAPTTTKEPAPTTTKSAPTTPKEPAPTTPKK 442 Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSA 196 A A +T A ++ + A ++ + A + E + + Sbjct: 443 PAPTTPKEPAPTTPKEPTPTTPKEPAPTTKEPAPTTPKEPAPTAPKKPAPTTPKEPAPTT 502 Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 A + + + T+ S TT A T+ + A + + + T + Sbjct: 503 PKEPAPTTTKEPSPTTPKEPAPTTTKSAPTTTKEPAPTTTKSAPTTPKEPSPTTTKEPAP 562 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 + + T A E A + A + A + + Sbjct: 563 TTPKEPAPTTPKKPAPTTPKEPAPTTPKEPAPTTTKKPAPTTPKEPAPTTPKETAPTTPK 622 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 + T K A + + T + T + + A A T E + Sbjct: 623 KLTPTTPEKLAPTTPEKPAPTTPEELAPTTPEEPTPTTPEEPAPTTPKAAAPNTPKEPAP 682 Query: 377 TAAASSASSAASSASSASASKDEATRQASAA----KSSATTASTKATE---AAGSATAAA 429 T A + + + + T A K A T K A + Sbjct: 683 TTPKEPAPTTPKEPAPTTPKETAPTTPKGTAPTTLKEPAPTTPKKPAPKELAPTTTKEPT 742 Query: 430 QSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAV 489 + + T TA ++ A E A TT KG + + + PK + Sbjct: 743 STTCDKPAPTTPKGTAPTTPKEPAPTTPKEPAPTTPKGTAPTTLKEPAPTTPKKPAPKEL 802 Query: 490 KSAYDNAEKRLQKDQNGADIPDKGCFLN 517 D+ P + Sbjct: 803 APTTTKGPTSTTSDKPAPTTPKETAPTT 830 >UniRef50_Q7N6T1 Similarities with DNA inversion product n=2 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N6T1_PHOLL Length = 300 Score = 82.5 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 5/91 (5%) Query: 892 EVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRG 951 E ++ PVG+PIPWP P GY G AF+K YPKLA AYP G +PD+RG Sbjct: 140 EEIDNRIKTVGEIPVGSPIPWPLPYPPVGYLTCNGSAFNKLQYPKLAEAYPDGRLPDLRG 199 Query: 952 WTIKGKPAS-----GRAVLSQEQDGIKSHTH 977 I+G GR +LS + D ++ T Sbjct: 200 EFIRGWDDGRGVDMGRTMLSWQGDAMQRMTG 230 >UniRef50_B5JYG6 Phage Tail Collar Domain family (Fragment) n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JYG6_9GAMM Length = 400 Score = 82.1 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 52/225 (23%), Positives = 80/225 (35%), Gaps = 47/225 (20%) Query: 905 PVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSG----------VIPDMRGWTI 954 P G + T P G+ G ++ YP L A + +PD+R Sbjct: 212 PAGRTEDFAGTTPPGGWLFCDGSEVSRTQYPALFTAIGTLWGDGDGSTTFNLPDLRNDFR 271 Query: 955 KGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGS 1014 +G + R+V E D IKSH+HSAS+ +GAHTH G Sbjct: 272 RGCSDT-RSVGDSESDQIKSHSHSASSED--------------------SGAHTH--GGR 308 Query: 1015 TNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAH 1074 ++ +GAH H + +++ G+ S + + +A H Sbjct: 309 SSDSGAHKHRSGWGESNRSDAPFGATSGSGHRGSGDSDW--------DNYLYYTDTAQPH 360 Query: 1075 AHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 H + I GSH H I + G E +N I+R Sbjct: 361 FHWLIIN------QAGSHSHPINIEPTGGDETRPRNKVLMPIIRA 399 >UniRef50_C7JGL3 Chromosome segregation protein SMC n=9 Tax=Alphaproteobacteria RepID=C7JGL3_ACEP3 Length = 1515 Score = 82.1 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 83/572 (14%), Positives = 178/572 (31%), Gaps = 31/572 (5%) Query: 8 VLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLA--SENPDEAGRYSMDVE--YGQYSVILL 63 +LKD + P+ + LK+ + + LA PD S+ + GQ V Sbjct: 731 LLKDASAAPLPGKAVALKSVITAPPELNRVLAYTGVVPDGTDGASLQAQLLPGQCLV--- 787 Query: 64 VEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQN 123 S AG + ++ F E D+ L + ++ E + + + Q+ Sbjct: 788 ------SRAGDLWRWDG--------FYTRAGEPDSSARRLAQ-RRILRETSARIAEMEQH 832 Query: 124 TAAAKKSASDAST--SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKAT 181 A++ A A T A E S S + S ++ A A Sbjct: 833 VPQAEEKAVAARTNVQAGEKQAQEQRVERSKLEQSLQKARTQESELERQHTSFRARLDAL 892 Query: 182 EASKSAAAAESSKSAAATSAGAAK----TSETNASASLQSAATSASTATTKASEAATSAR 237 + A A +++ +A +A N +L +A + A + T+ + Sbjct: 893 RPQQERALAAKAEAESALAAATTAQQAVPPAQNFQQALATAREQHTAAQKAEQDCRTALK 952 Query: 238 DAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAG 297 A + + + +T + ++A + + + ++ Q ++A Sbjct: 953 LAEQTFQRVQQKQTQTENQHTAATTRLETLAPERLRLRQNLEAEEANILELEQRLTSAQT 1012 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 AA + A + QA + A + A +A +T + + EQA +A Sbjct: 1013 ENATAA-ALKDAQDNLEQAQRAFQMASSAFAQAEQAAQASTQQQQKMQEQALTLRSRIAA 1071 Query: 358 AKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTK 417 + + A+ TAA + + A++ + AA +S A+ + Sbjct: 1072 LTPRLEELQQEQQDAQDKLTAATQTEAQTAAALPQDAEETLAHLHTQRAALTSQLDATRE 1131 Query: 418 ATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSS--AT 475 + + +++ T+ AA + + S A + T + ++S A Sbjct: 1132 LRATLQAEASTLETRLTSLVAAEEEWSQRAATANAESENAAQRVETARNEHTKVSELPAE 1191 Query: 476 NSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRY 535 + + +++ + A ++ + Sbjct: 1192 AQRQKQQTLSALEEAEEAYAQADKIRAEAESALNAANEQRRRTEAELNTARENLLKADAK 1251 Query: 536 VRVNAPAGATSGKYYPVVVMRSAGSVSELASR 567 P + G ++E A Sbjct: 1252 SEQAQAILDQLLADTPTPPRQPTGDLTEAAES 1283 >UniRef50_C6C5D2 Tail Collar domain protein n=2 Tax=Dickeya dadantii RepID=C6C5D2_DICDC Length = 498 Score = 81.7 bits (199), Expect = 1e-13, Method: Composition-based stats. Identities = 46/156 (29%), Positives = 73/156 (46%), Gaps = 13/156 (8%) Query: 906 VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS----- 960 VG P+PWP T P+G+ GQ+FDK+ YPKLA YPSGV+PD+RG I+G Sbjct: 335 VGIPLPWPQATAPTGWLKCNGQSFDKALYPKLATVYPSGVLPDLRGEFIRGWDDGRGVDA 394 Query: 961 GRAVLSQEQ-----DGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHT---HSVS 1012 GRA+L+ + G+ + S + + D KS + +++ Sbjct: 395 GRAILTAQNPTYLRTGMMDYNGSDVDNIGVYIGMGYAEADTAAKSISAPAGAFRAPNNID 454 Query: 1013 GSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVH 1048 + ++ + + NT A+ G+ STR + Sbjct: 455 LTEQASRDNGVNGTASNTVYASEGSVWVSTRPRNIA 490 >UniRef50_Q2UB42 Predicted protein n=2 Tax=Aspergillus RepID=Q2UB42_ASPOR Length = 1429 Score = 81.7 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 78/525 (14%), Positives = 142/525 (27%), Gaps = 8/525 (1%) Query: 51 MDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMV 110 + GQY G +N++L + + A Sbjct: 64 FKLAPGQYQYRFREGATGSWFHDESVKNASGTEGLVNNYLTVKSAAEQETPAQNGASEKA 123 Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 EE + A + S A H A AD + A Q Sbjct: 124 EEATKEPETDASTEGGPVTNGVHKSNGVNGVAEHEAPVADESPADDQKTEQVPEKPVETD 183 Query: 171 SSAGTASTK-------ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSAS 223 + A T + E + + S + A + S+ + A S Sbjct: 184 AKAETEAEPKEALSNGTKEPEVNGTEPAAQTSESKDEPVAEEKSDPVVERPEEKVAESQP 243 Query: 224 TATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARS 283 A + +E A +++E A A A + + K A E A Sbjct: 244 VAEKEGTEKKAEEVPATSTQEEAPVVNGEAKVEAEKQPEESQPQKATEKVAVKPEEPAVE 303 Query: 284 SETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGE 343 T + A +A+ + AT A +AE + + Sbjct: 304 MSTPEVTAEETPAQESNTEKPTATETPAAETTTEQPATEAQPTAEEETTKEPATEPAETK 363 Query: 344 ATEQASAAARSASAAKTSETNAKASETSAESS-KTAAASSASSAASSASSASASKDEATR 402 E S A A+E + S K AA S S A+ + Sbjct: 364 QAEVVSEEPAKEEPTPEEPKEAPATEELVKESVKEEAAPEQSKETVSEKPAAEEPVKEET 423 Query: 403 QASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS 462 + ++SAT ++ +A AA+ + E A ++E+ + A Sbjct: 424 TEAVKETSATEKLDESDKAPVQEETAAEESQETTKEPVKEEIAPGKSEETPAIKAPVAEE 483 Query: 463 TTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAV 522 + V+ +A + AA P V++ K + + + + Sbjct: 484 PAVEEPVEEKAAPEEPKDISAADPAVVEAPVKETVKEEGVSEAPKETSAEEPVKEAVKEE 543 Query: 523 SKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASR 567 + ++ V K PV + + + + Sbjct: 544 PVPEKTEQPAAPEPAVAEEPAKEPVKEEPVPEKTEEPAAPKESVK 588 >UniRef50_B6VKW8 Sc/svq protein n=2 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VKW8_PHOAA Length = 316 Score = 81.7 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 55/205 (26%), Positives = 76/205 (37%), Gaps = 38/205 (18%) Query: 825 ADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGY 884 + V N R + + N + G T + + + G + R+ D Sbjct: 114 NNDTLAVTQKLVQEIVNSLRENINGKVPNSWRINGKALTEDINLNASDVGAYTRAEVD-- 171 Query: 885 GFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSG 944 PVG+PIPWP P GY G AF++S YPKLA AYP+G Sbjct: 172 -----------RLIKKTSEIPVGSPIPWPLPHPPFGYVTCNGSAFNRSQYPKLAEAYPNG 220 Query: 945 VIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTK 999 +PD+RG I+G GR +LS ++ SA S LG+ TT G Sbjct: 221 RLPDLRGEFIRGWDDGRGADNGRKLLSWQEG---------SALSEYLGSFTT-----GVA 266 Query: 1000 STNNTGAHTHSVSGSTNSAGAHTHS 1024 + H G T H Sbjct: 267 Q------NIHQRDGVTYHDKDHKRY 285 >UniRef50_C8VCU8 Putative uncharacterized protein n=2 Tax=Emericella nidulans RepID=C8VCU8_EMENI Length = 1592 Score = 81.7 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 75/473 (15%), Positives = 156/473 (32%) Query: 72 AGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSA 131 AG + E + + T + A + VEE A K Sbjct: 630 AGDLKGEEVATATDAVKSVETTTVEPAVEAEAATEKAKVEESTTVDEVAETEVAETAKEV 689 Query: 132 SDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAE 191 + E A + A+ + + A+ ++ + + E K A A Sbjct: 690 ASEEPKTEEPVAVAEAVDEPAKEVANTEPSEAAVPENPAPTEEPEKGATNEEPKPAEAVA 749 Query: 192 SSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSET 251 + A A + S+ A+ +SAA A T + + A + A E K+ E Sbjct: 750 EPMTEPANVAVETEESKEATEAAAESAAEPAVAETAVENVSEAPAAEKEAVSEEPKAEEP 809 Query: 252 NASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAST 311 A++ + T SA + T +A + A A K++ + A + Sbjct: 810 IATAESPEVPGKETVVEESAPDSVTESKDAPAEVAAEASITEVPAVLKSSEEQADKAVAE 869 Query: 312 SAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETS 371 + + +A + + A+ + A + AT ++ S + + A+E Sbjct: 870 APADTTPTAKPVESATQEPATETADAPSATEPATTESPKEPASEAPTEVPTVETVATEEV 929 Query: 372 AESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQS 431 E++ A AS + + S + A A+ A + Sbjct: 930 TEAAHDKPAEEQPETVDGASGKTVDAEVPGETQSTTTAEAIAAAPIEKFATEETVSKIPV 989 Query: 432 KSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 + ++ T E ++ ++ A+ E + + + A + +E+ ATP+A K+ Sbjct: 990 EGVSKEETTVGEPGTEKPDEAAAPEVSEVEVAEESKAPETTPAEIAPAESTNATPEASKA 1049 Query: 492 AYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGA 544 Y AE ++ +PD+ ++ + + + + + A Sbjct: 1050 EYAPAETAEEEPPVEKQLPDETAAPASVEEQPAGESSTTDAVNTTELTSADAA 1102 >UniRef50_Q7Q4S4 AGAP000893-PA n=1 Tax=Anopheles gambiae RepID=Q7Q4S4_ANOGA Length = 2727 Score = 81.0 bits (197), Expect = 2e-13, Method: Composition-based stats. Identities = 55/390 (14%), Positives = 113/390 (28%), Gaps = 9/390 (2%) Query: 89 FLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADA 148 G EDDA+PE ++ R + VA K+ A+ ST + + + A Sbjct: 1102 EEGQRGEDDAKPEDTTSGVETDQQDKRVTTVVADEVEDDKEQATTVSTISSDKQQEVSKA 1161 Query: 149 ADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSE 208 D +S+ A S + A T A + + S+ +E Sbjct: 1162 DDEVTTSSSVAEDTPVSVEQDEKLEQPAPTTVRTVEADEDTAVAQEQDKPQSSDD--QAE 1219 Query: 209 TNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAG 268 + S + ++ + E A E A E + + + + T Sbjct: 1220 DDDEPSSTTVRADKPSSVEQDEEGPAQEATTARVTETATEQEESVTEFETKRKTIQTTVA 1279 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 + + + + ++ + A S AA + E Sbjct: 1280 PTTVSEQQDDL-------EKTETTTQPAASAAAADELQEEEHQEVKLTDEPELNEDDAQE 1332 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 S A++ S T++ +T+ + ++ + + S+ +S+ + + Sbjct: 1333 STAATTSRPTSQEEASTQYDHVPDSTHKPSEAHDDEQPGTTVSSVTSEKESDVPVAVTVQ 1392 Query: 389 SASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKR 448 +A +D + + A S TT+S +A + E + Sbjct: 1393 AAMEQEDDEDVSKTTEATAAHSTTTSSNAVDDAIIYRVDEEEDNKPIVKPTLEEEVSTSH 1452 Query: 449 AEDIASAVALEDASTTKKGIVQLSSATNST 478 A K V + Sbjct: 1453 AVQPTHDEVKPVEDAAKPQAVDAQETDDDN 1482 >UniRef50_UPI0001B5347E putative variable tail fiber protein n=1 Tax=Shigella sp. D9 RepID=UPI0001B5347E Length = 550 Score = 80.6 bits (196), Expect = 3e-13, Method: Composition-based stats. Identities = 32/79 (40%), Positives = 43/79 (54%), Gaps = 5/79 (6%) Query: 896 SKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIK 955 + PVG P+PWPS T P+G+ G AF YPKLA YP+ +PD+RG I+ Sbjct: 382 LGLGDGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPKLAKVYPTNKLPDLRGEFIR 441 Query: 956 GKPAS-----GRAVLSQEQ 969 G S GR++LS + Sbjct: 442 GWDDSRGIDTGRSLLSGQA 460 >UniRef50_Q93D60 PilV n=7 Tax=Escherichia coli RepID=Q93D60_ECOLX Length = 456 Score = 80.6 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 46/160 (28%), Positives = 60/160 (37%), Gaps = 28/160 (17%) Query: 822 DTYADADGGVPWNAESGAYNVTRSGD----SYILVNFYTGVGSCRTLQMKAHYRNGGLFY 877 DG + G N+T G + G G T ++K +GG Sbjct: 228 GDITSEDGWLITRNNKGLMNITHGGGFSMTDSQWIRAVNGKGITTTGEIKGGKVSGGTVR 287 Query: 878 RSSRDGYGFEEDWAEVYTSKNL------------------------PPESYPVGAPIPWP 913 R G + T+ ESYPVG+PIPWP Sbjct: 288 SDGRLSTGEYLQLDKTATAGTKCAPDGLVGRTSTGAILSCQSGMWAGFESYPVGSPIPWP 347 Query: 914 SDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWT 953 S T P GY +M GQ+F S YP+LA AYP +PD+R Sbjct: 348 SATPPQGYLVMNGQSFSCSRYPQLARAYPGCKLPDLRRCF 387 >UniRef50_Q65WH4 Putative uncharacterized protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65WH4_MANSM Length = 296 Score = 79.8 bits (194), Expect = 6e-13, Method: Composition-based stats. Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 10/113 (8%) Query: 895 TSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTI 954 ++ +G P P+P VP G GQ F + YP+LA YPSG +PD+RG I Sbjct: 129 NEAFSTLKNLLIGIPFPYPLSAVPDGCLAFNGQTFSTTTYPELAKKYPSGRLPDLRGEFI 188 Query: 955 KGKP-----ASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTN 1002 +G S R +L + + +HTH + + SS ++G K + Sbjct: 189 RGWDNGRGVDSSRELLRSQGAELSAHTHYVTVT-----RYANSSGEFGAKIST 236 >UniRef50_D2MH12 Tail Collar domain protein n=1 Tax=Rhodopseudomonas palustris DX-1 RepID=D2MH12_RHOPA Length = 346 Score = 79.4 bits (193), Expect = 7e-13, Method: Composition-based stats. Identities = 51/291 (17%), Positives = 81/291 (27%), Gaps = 57/291 (19%) Query: 828 DGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFE 887 + SG + T G G FYR S+ + Sbjct: 92 TLKNSFPNASGPITRSLGAGYG---FAATADGDASGPAFSFGSEPGLGFYRKSQGTIAYP 148 Query: 888 EDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAA-----YP 942 + + P G + + T P G+ GQ + L AA Sbjct: 149 GTLRGIGS--------IPPGFILDFAGPTPPEGWLTCDGQLVSTVTFADLFAAIGYTWGG 200 Query: 943 SGV---IPDMRGWTIKGKPASGRA--VLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYG 997 SG +P++ + + A V + + + I H+HSAS + D Sbjct: 201 SGGQFAVPNLVKRFRRHRGDGTVAGGVGTLQTNQIGLHSHSASMDAQGHHDHY---LDLW 257 Query: 998 TKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSA 1057 + N + H+H + + S G + + + AT Sbjct: 258 SSGMNRSNPHSH-------------PASGSGIGVSGGFDTGVYAPQGPLNGVSIGATD-- 302 Query: 1058 GAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTV 1108 H H ++G A G H H ITV A G E Sbjct: 303 INHEHRVTGNTAGNGGHIHN------------------ITVAANGGNETRP 335 >UniRef50_Q37842 Tail fiber protein H n=1 Tax=Enterobacteria phage 186 RepID=Q37842_BP186 Length = 462 Score = 78.3 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 61/237 (25%), Positives = 91/237 (38%), Gaps = 17/237 (7%) Query: 391 SSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAE 450 S+ + E A + + G ++ + S +E A + + Sbjct: 143 STMVMATQEYVDDRIAEHEKSRRHPDATLKEKGFTQLSSATDSASEVLAATPKAVKAAYD 202 Query: 451 DIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIP 510 + +DAST +KGIV+LSSA +STSE AATPKAVK A DNA RL KD+NGADIP Sbjct: 203 LANAKYTAQDASTAQKGIVRLSSAADSTSEAEAATPKAVKIAMDNANARLAKDRNGADIP 262 Query: 511 DKGCFLNNINAVSKTDFADKRGMRY-VRVNAPAGATSGKYYPVVVMRSAGSVSELASRVI 569 + F+ NI D A + +N ++ GS+ Sbjct: 263 NPPLFVQNIGLKPTVDKAANAVDKNGDTMNGNLTLKGDYRLSFIIQNEDGSIRA------ 316 Query: 570 ITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGD 626 + +G + G G + +G Q+ +H GD Sbjct: 317 ---------YIFKDKGGDGIRISNGDDGGGDFVFGKNGQFYC-PDIMHVGNTIVWGD 363 >UniRef50_Q6D2U8 Bacteriophage tail fiber protein n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2U8_ERWCT Length = 619 Score = 77.9 bits (189), Expect = 2e-12, Method: Composition-based stats. Identities = 31/83 (37%), Positives = 46/83 (55%), Gaps = 5/83 (6%) Query: 901 PESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS 960 P S G P+P+P P+GY GQ FD + +P LA+ YPSG +PD+RG ++G Sbjct: 459 PTSELAGIPLPFPGAVAPAGYLKCNGQQFDTAQFPVLASRYPSGFLPDLRGEFVRGWDDG 518 Query: 961 G-----RAVLSQEQDGIKSHTHS 978 RA++S + D I++ S Sbjct: 519 RGIDTVRALMSAQGDAIRNIVGS 541 Score = 47.1 bits (109), Expect = 0.004, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 71/244 (29%), Gaps = 1/244 (0%) Query: 278 ETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTA 337 ET+ G S A + A T+ AA A S A Sbjct: 18 ETSDPVVAGPGGISNRQAEQLASRTAYLKKMQETTGESLQKH-IAASDPHSQYAPKNSPA 76 Query: 338 TTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASK 397 T A A A + A +A A+ + + ++A + S + Sbjct: 77 LTGTPTAPTTAQTANNTQIATTAFVKSAIAALINGSPAALDTLQELANALGNDPHFSTTI 136 Query: 398 DEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVA 457 A ++ A +A A T A A + + + +A Sbjct: 137 LNAIADVKTDSTNKLNAHASILDAHPQYAPKASPALTGTPTAPTAASGSNDMQLATTAFV 196 Query: 458 LEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLN 517 + G + L P + + +L K+QNGADIPDK F Sbjct: 197 KTAVAALVNGSPAALDTLQELANALGNDPNFSTTVLNALAGKLAKNQNGADIPDKSQFRQ 256 Query: 518 NINA 521 NI Sbjct: 257 NIGL 260 >UniRef50_D0KGE1 Shufflon domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KGE1_PECWW Length = 532 Score = 77.5 bits (188), Expect = 3e-12, Method: Composition-based stats. Identities = 32/118 (27%), Positives = 53/118 (44%), Gaps = 9/118 (7%) Query: 906 VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASG---- 961 G W + P G+ + GQ F+ S P LA+ YPS +PD RG+ +G Sbjct: 392 PGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRGWDNGAGIDP 451 Query: 962 --RAVLSQEQDGIKSHTHSAS---ASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGS 1014 RA+LS + D I++ T + +S+ G ++ + Y + S + A + S Sbjct: 452 DSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSSYGWPYPSNSGSANDASIITFDAS 509 >UniRef50_B9BDD9 Bacteriophage protein n=3 Tax=Burkholderia RepID=B9BDD9_9BURK Length = 536 Score = 77.1 bits (187), Expect = 3e-12, Method: Composition-based stats. Identities = 76/428 (17%), Positives = 132/428 (30%), Gaps = 82/428 (19%) Query: 751 YGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTA 810 +G L P A + G + + T + +A A + ++ L+ Sbjct: 131 FGPATFLNPPATTDRKGVVELATTEEVAAGTDATRAVT------PATLKPRLDAKANLSG 184 Query: 811 AH-VAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTL----- 864 A + R A A GG +G + + + + + + G+G T+ Sbjct: 185 ADFTGRISTRDVLHLASAPGGTGAILSAGNGDGASASTTNVALRSWYGIGFAPTIDGMPV 244 Query: 865 --QMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYP-------------VGAP 909 +H+ + R + P +G Sbjct: 245 PRTEFSHWFDTRTGNTGFRGTLDVGGLITAQTPPSGDASKRVPTTEWVVAAIASAGIGTI 304 Query: 910 IPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSG------------------------- 944 + P +V +G+ + G ++S YP L AY Sbjct: 305 VFEPRTSVRAGFLKLNGALVNRSDYPAL-WAYAQASGALVAESAWGQNNWGCFSTGDGAT 363 Query: 945 --VIPDMRGWTIKGKPA-----SGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYG 997 +P++RG ++ S R + + + H H AS+++ T Sbjct: 364 TFRLPELRGEFLRCWDDGRGADSARGIGTFQSFQNAWHAHGASSAAVGDHTHGA------ 417 Query: 998 TKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSA 1057 T A G + + + G + T + + A Sbjct: 418 -----WTDAQGWHGHHGWTGGGGGHNHNNGIFSRLLRPPYGGSLTGSDQAGSGSEQAVGA 472 Query: 1058 GAHTHSLSGTAASAGAHAH---TVGIGAHTHSVAIG---SHGHTITVNAAGNAENTVKNI 1111 G S A +G HAH T G G H+H+V IG +H H ITVN G E +NI Sbjct: 473 GD-----SADIAWSGDHAHEFNTEGSGTHSHNVGIGGAGAHAHAITVNGDGGNEARPRNI 527 Query: 1112 AFNYIVRL 1119 A ++R Sbjct: 528 AMLAMIRA 535 >UniRef50_D0KGE5 Tail Collar domain protein n=4 Tax=Pectobacterium RepID=D0KGE5_PECWW Length = 157 Score = 76.7 bits (186), Expect = 5e-12, Method: Composition-based stats. Identities = 30/108 (27%), Positives = 47/108 (43%), Gaps = 6/108 (5%) Query: 906 VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASG---- 961 G W + P G+ + GQ F+ S P LA+ YPS +PD RG+ +G Sbjct: 17 PGTITMWGTPVPPEGWLELNGQPFNPSGNPVLASLYPSSQVPDFRGYFPRGWDNGAGIDP 76 Query: 962 --RAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAH 1007 RA+LS + D I++ T + + K S +N+ A+ Sbjct: 77 DSRAILSVQGDAIRNITGEFNPGGSSNWGKGVFSSYGWPYPSNSGSAN 124 >UniRef50_UPI000180CCCC PREDICTED: similar to zymogen granule membrane glycoprotein 2 n=1 Tax=Ciona intestinalis RepID=UPI000180CCCC Length = 1639 Score = 76.3 bits (185), Expect = 6e-12, Method: Composition-based stats. Identities = 77/494 (15%), Positives = 175/494 (35%), Gaps = 7/494 (1%) Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E + + T ++ S A+T++ A A S S + ++++ Sbjct: 705 ESVAATTSSTTGESVATTTSSTTDESVAATTSSTADESVATTTSSTTDESVATTTSSTTD 764 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 +S + + + ++ S+ +GA TS T + + +++ + Sbjct: 765 ESQQQQQQRLLQQMNLLQQQHESVATTTSSTTDESGAPTTSSTTDESVATTTSSTTDESV 824 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 + + T A + S +SS + + + T + + ++ + T++ + E+ Sbjct: 825 ATTTSSTTDESVATTTSSTTDESVATTTSSTTDESVATTTSSTTNESVVATTTSSTTDES 884 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 A ++S S SSA+ S + +S + + + SA + TT + Sbjct: 885 VATTTSSTTDESVATTTSSATDESVATTTSSTTDESVATTTSSATDESVATTTSSATDES 944 Query: 347 QASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASA 406 A+ + + + + T++ E+ A ++ + S ++ SSA+ + + Sbjct: 945 VATTTSSTTDESVATTTSSATDESVATTTSSTTDESVATTTSSATD-ESVATTTSSATDE 1003 Query: 407 AKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKK 466 + ++ T+++T + A +++A +S +T S+AT A + +VA +STT + Sbjct: 1004 SVATTTSSATDESVATTTSSATDESVATTTSSATDESVATTTSSATDESVATTTSSTTDE 1063 Query: 467 GIVQLSSATNSTSETLAATPKAVKSAYDNAEK----RLQKDQNGADIPDKGCFLNNINAV 522 + +S+T S + +S + K + V Sbjct: 1064 SVATTTSSTTDESVATKVSSSTTQSTSKTDKPIIRGGSGKTTTDSTNKTDSTPTKTKPGV 1123 Query: 523 SKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSA--GSVSELASRVIITTATRTAGDP 580 + + + + + T + V VI TTA G Sbjct: 1124 NDSPPTTHATKQPSADASFSTTTPFSPLEPTEPLTTTYPVVVTGDVEVIPTTAVNQIGSG 1183 Query: 581 MNNCEFNGFVMPGG 594 NG GG Sbjct: 1184 EGPTSDNGTTSNGG 1197 >UniRef50_A1Z8Q2 CG13185, isoform B n=4 Tax=Drosophila RepID=A1Z8Q2_DROME Length = 5303 Score = 76.3 bits (185), Expect = 7e-12, Method: Composition-based stats. Identities = 42/347 (12%), Positives = 98/347 (28%), Gaps = 15/347 (4%) Query: 72 AGTITVYEDSQPGTLNDFLGAM-TEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKS 130 G ED + G + EDD++PE + + ++ Sbjct: 4671 EGEEATPEDEKDEAETQKRGELEDEDDSKPED------------SPEDSKEEKEEKREEK 4718 Query: 131 ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAA 190 + S S +A+ + S+SA Q + T K Sbjct: 4719 PEEHSQSKDKASKEENVQSMPETDQSSSADQVQQPQDPDIKQDQKLDEQETGEEKDGVGQ 4778 Query: 191 ESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 + + G A+T ET + ++ + + S +A +K + Sbjct: 4779 AENDADDGGHQGVAETQETVSQEDRKNERQTQEKRKQGRTNEERSLGEAEQNKLKQLKTI 4838 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 S S + + + + + + + A ++ Sbjct: 4839 DQLKDSKESDDAEQEKPEPMDQTEAEEYQHVKEPKNSDKTTLDNATEEQSKKIQHQEDEP 4898 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 + + A AE A E+ +S +KT ++ + Sbjct: 4899 PNEEEIEAENVDELMEAEEPAVDPEDDAELEQLGAEKTE--QKSDKPSKTEKSKEQLETP 4956 Query: 371 SAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTK 417 + + + SS ++A ++ + ++S A+ ++ + Sbjct: 4957 EGMEIEGEVVLTMTVPRSSETTAHSNSEILLDKSSHAEDLSSAEQIE 5003 >UniRef50_Q179R7 Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q179R7_AEDAE Length = 3217 Score = 76.0 bits (184), Expect = 9e-12, Method: Composition-based stats. Identities = 39/460 (8%), Positives = 100/460 (21%), Gaps = 1/460 (0%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE 140 + D EDD +PE + + + + + ++ + + R Sbjct: 2411 ASIAPEQDEEQKPVEDDKKPELDEADKEPTDVPVEHDAEEQKPAVEPVEADEEEPAATRI 2470 Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 A + A A + + +SA + +T A+ + +A Sbjct: 2471 PEVEADEEEKPAVAVESDEEEKPASATESDEQTP-VTTMASAMDVVEEDEKVKPMPSADE 2529 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 + + A + + +T A T+ + + + E + A Sbjct: 2530 TEKDEMTPVEADEAEKPVSTDAPAVETEKPAVSMDEEEEEEKPIESDEEEHKPAVEPVQA 2589 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 + + ++ A + + AA Sbjct: 2590 DDEEEEKPAQEPVDAEHDEEEKPAQEPVEADEEVAVTTASPAADEMKPEVEEEKPTLYKE 2649 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 + ++ + A + T A + + E K Sbjct: 2650 EEPATETSVKDDEQDKPIDVESDEEDKPAEATTVEQEQQQPTTMATPEKEADEEQKPLMP 2709 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 + A A + A + A + A + + Sbjct: 2710 VESDEEDKPAPEADDQTKPSEADEEAKATEAPIKEADEEDKPAEVQPTADEEQEVKPTPA 2769 Query: 441 RAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 E ++ ++ + + P + + Sbjct: 2770 SDEVQTPELDEKIEEEPVKSDDAAPATTAATLADDSEKESDETGMPTVDEVPQTPVQPTE 2829 Query: 501 QKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 ++ + V + V A Sbjct: 2830 SDEEQHEEATPAATPAEEPKPVEADEEQKPEMAEPVTTAA 2869 >UniRef50_Q2HAR4 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2HAR4_CHAGB Length = 2795 Score = 76.0 bits (184), Expect = 9e-12, Method: Composition-based stats. Identities = 65/468 (13%), Positives = 133/468 (28%), Gaps = 10/468 (2%) Query: 83 PGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAA 142 + +TED P+A E ++ + S + A Sbjct: 321 TAHVAAATDKLTEDTGAPDAESAGAQKPLEADEPTPEPKSEAEPESEAEALQENSQKNPA 380 Query: 143 THAADAADSARA-----ASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAA 197 A + A + A + + + + A + A Sbjct: 381 AEPEVAKTEEATGGDEKEQEARSTAPEPTVEETLVAEVETVDSARSEQPAGTESEPVAEA 440 Query: 198 ATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSA 257 +K ++ + + AA T + A A + + +ET + + Sbjct: 441 GNEPAESKREPSSEADEDELAADVKETLAPELVAEAQEEPTPEAVESLSPDAETAVNQAP 500 Query: 258 SSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQAS 317 +S ++ T + +++++ E + A S A + + ++A A A Sbjct: 501 ASESTQDTDSKDNSESVTEIEEKLAAEPEFAEDSKDEPAAAIVTSDNAAEDAPAPEEAAE 560 Query: 318 ASATAAGKSAESAASSASTATTKAGE-----ATEQASAAARSASAAKTSETNAKASETSA 372 A ++ E+ + + AGE A ++ K + + A Sbjct: 561 ADEKPTEEATEATSVEETVKEEPAGENRPVFGAISAEKSSEPEPETKDEASAETETVEPA 620 Query: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 432 +KT AS ++ ++ +A S +A + A A Sbjct: 621 IEAKTETASDPDTSVDITVEDIPTEKVDELKAGDTTKVLAPESAVVEVSAVADPAPAGEA 680 Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 492 + A A K + D +K ++ + ++A P K+ Sbjct: 681 KSEPEAPVSAARDTKVEGIAETEKPEADEELAEKPGDRVEEEGDQEIRSVAEAPTESKAE 740 Query: 493 YDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNA 540 D AEK + D AD + + A T N Sbjct: 741 SDTAEKPAETDSAQADSAEAAKPDVEVEATPATGEETANSEEDDNGNE 788 >UniRef50_C8PDQ5 Phage Tail Collar Domain protein n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PDQ5_9PROT Length = 391 Score = 75.6 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 45/225 (20%), Positives = 75/225 (33%), Gaps = 26/225 (11%) Query: 774 KLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPW 833 K+ A + N + T + S + + L AA + + A Sbjct: 101 KIEALIKSNLIDDTAPKATATYSSEKIGELLQLKLNKTDQAADSAKLGGVVASDFMKKSE 160 Query: 834 NAESGAYNVT-------RSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGF 886 + + + V + +Q +N + +R G Sbjct: 161 YSPTSNSLANTLVLRDANGDFAGKYVTAGHFKLTAP-VQNNIFSKNNEILFRV---GAAD 216 Query: 887 EEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAA----AYP 942 +++ + L PVG I P G+ L G A +SAY L + AY Sbjct: 217 NDNYTRAVSFSLLSSTILPVGTIITSARTPAPDGFLLCNGAAISRSAYTDLFSAIGTAYG 276 Query: 943 SG------VIPDMRGWTIKGKP-----ASGRAVLSQEQDGIKSHT 976 +G IPD+RG I+G GRA+ S + D I++ T Sbjct: 277 AGDGSSSFNIPDLRGEFIRGADNGRGVDGGRALGSAQGDAIRNIT 321 >UniRef50_B7NJP1 Putative side tail fiber protein homolog from lambdoid prophage n=3 Tax=Escherichia coli RepID=B7NJP1_ECO7I Length = 686 Score = 75.6 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 119/620 (19%), Positives = 195/620 (31%), Gaps = 37/620 (5%) Query: 384 SSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAE 443 A+ + A + A ++ + + S T AA K+ + A Sbjct: 46 KKKTEEAAQSLAEHVRSRNHPDATLTAKGFTQLSSATNSTSETLAATPKAVKAAYDLAAG 105 Query: 444 TAAKRAEDI-ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK 502 A + + AS T KG VQLSSAT+S SET AATPKAVK AYD A + Sbjct: 106 KAPASHTHPWSQITGVPAASLTAKGTVQLSSATDSQSETEAATPKAVKIAYDLARGKYTA 165 Query: 503 DQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVS 562 + IN S+T A + ++ A A + +P + + S Sbjct: 166 QDATTTRKGIVQLSSAINNTSETLAATPKAVKAAYDLAAGKAPASHTHPWSQITGVPAAS 225 Query: 563 ELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMS 622 A +++ + + AY + + A + Sbjct: 226 LTAK----------GTVQLSSATDSQSETEAATPKAVKIAYDLARGKYTAQDATTTRKGI 275 Query: 623 NKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVI 682 + + AA P L+ A + PA A + Sbjct: 276 VQLSSAINNTSETLAATPKAVKAAYDLAAGKAPASHTHPWSQI---TGVPAASLTAKGTV 332 Query: 683 LDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYL 742 + E+ + + L A V+ ++ G +L + Sbjct: 333 QLSSATDSQSETEAATPKAVKAAYDLAAGKAPVSHTHPWSQITGVPAASLTAKGTVQLSS 392 Query: 743 LLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDG 802 +Q + A D A G+ + T + + Sbjct: 393 ATDSQSETEAATPKAVKAAYDLAAGKAPVSH------THPWSQITGVPAASLTAKGTVQL 446 Query: 803 SKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCR 862 S I T+ +AA + Y A+G P +A A + + + Sbjct: 447 SSAINSTSEILAATPKAVKAAYDLANGKQPADATLTALAGLATAADRLPYFTGADRAALA 506 Query: 863 TLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYA 922 TL R + + PVG P+PWP+ T P G+ Sbjct: 507 TLTA------------IGRAIIAKGSIKDVLNYLGLGEGSALPVGVPVPWPTATPPEGWL 554 Query: 923 LMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPAS-----GRAVLSQEQDGIKSHTH 977 G+AF K YP LA AYP+ +PD+RG I+G GR +LS ++ + Sbjct: 555 KCDGRAFTKEQYPVLARAYPTLRLPDLRGEFIRGWDDGRKIDEGRKLLSWQKGTLVGGHD 614 Query: 978 SASASSTDLGTKTTSSFDYG 997 ++ ++ DYG Sbjct: 615 DNDSALDISYMSNGNNIDYG 634 >UniRef50_Q9WXA5 Tail fiber n=2 Tax=Pectobacterium carotovorum RepID=Q9WXA5_ERWCA Length = 667 Score = 75.6 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 11/114 (9%) Query: 800 FDGSKDITLTAAHVAAFAR----------RATDTYADADGGVPWNAESGAYNVTRSGDSY 849 D TAA + A A+ ++ D + WN+ +GAY GD+ Sbjct: 412 LDYDTANKPTAADIGAIAKTDADNNYVRQGSSGVVYKND-DLAWNSPTGAYLKDNGGDAS 470 Query: 850 ILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPES 903 ++ + GS Q +Y NGGL YRSSRD GFE+ WA +Y+ ++ P + Sbjct: 471 LIWHIGLNTGSTSAAQFHFNYANGGLKYRSSRDSLGFEKPWARIYSDQDKPTAA 524 Score = 46.7 bits (108), Expect = 0.006, Method: Composition-based stats. Identities = 89/552 (16%), Positives = 158/552 (28%), Gaps = 15/552 (2%) Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 +TS+ + + + A A + +A+A + A A + Sbjct: 18 ETSDPVVGGPDGVSNRQAKELANRTRYLKKEQEKTGSDLATHAAAADPHTQYAPKANPTF 77 Query: 335 STATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSAS 394 + AT+ S + + K+ A + + + + + +S A S+S Sbjct: 78 TGTPKAPTPATDNNSQQVATTAFVKSVAATKLAKDQNGADIQDRELLNRNLGSSRAYSSS 137 Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIAS 454 + + A+ S A A A + S + + + A + Sbjct: 138 IPIGGSAGSWTTAEFIGWLESQAAFVHAYWACRGSWSYTHNKIISDTECGQIPLAGSVVE 197 Query: 455 AVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAY------DNAEKRLQKDQNGAD 508 + DA+T + + A S S T Y D K + Sbjct: 198 VMGQHDATTIRITTPSTTPAGLSDSANAQFTYIYNGVDYSPGWRRDYNTKNKPTAADVGA 257 Query: 509 IPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRV 568 +P+K + N + PV +A + +A ++ Sbjct: 258 LPEKAVAQAAAKLATPRTI--NGVPFDGTANIALTPANIGALPVAGTAAAETKLAVARKI 315 Query: 569 IITTATRTAGDPMNNCE-FNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDD 627 TA +N+ F+ + G D Y + N +A Sbjct: 316 AGVAFDGTADIDVNSQGIFSASLSIGNAVDLNTYTTPGLYHQAVNAQAASGKNYPEAQAG 375 Query: 628 LRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKS 687 V G + + + + N T + Sbjct: 376 SLEVLKHAGITQVYRIYN-NSRCYKRTQYSGAWSAWVLDYDTANKPTAADIGAIAKTDAD 434 Query: 688 GRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQ 747 + S +V N T + G +I A+ + N Sbjct: 435 NNYVRQGSSGVVYKNDDLAWNSPTGAYLKDNGGDASLIWHIGLNTGSTSAAQFHFNYANG 494 Query: 748 GDVYGGWNTLRPFAIDNA-----TGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDG 802 G Y F A + + G A+ ATKL TPR+++GVEFDG Sbjct: 495 GLKYRSSRDSLGFEKPWARIYSDQDKPTAADIGALPAAGTAVAATKLATPRKINGVEFDG 554 Query: 803 SKDITLTAAHVA 814 SKDITLT A++ Sbjct: 555 SKDITLTPANLG 566 >UniRef50_A8DYB0 CG13185, isoform C n=9 Tax=Drosophila RepID=A8DYB0_DROME Length = 5547 Score = 74.0 bits (179), Expect = 3e-11, Method: Composition-based stats. Identities = 41/356 (11%), Positives = 97/356 (27%), Gaps = 12/356 (3%) Query: 72 AGTITVYEDSQPGTLNDFLGAM-TEDDARPEALRRFELMVEEVARNASAVA-QNTAAAKK 129 G ED + G + EDD++PE +E R + Sbjct: 4894 EGEEATPEDEKDEAETQKRGELEDEDDSKPEDSPEDSKEEKEEKREEKPEEHSQSKDKAS 4953 Query: 130 SASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKA--------T 181 + + + +AD + Q ++ G + T Sbjct: 4954 KEENVQSMPETDQSSSADQVQQPQDPDIKQDQKLDEQETGEEKDGVGQAENDVSFSFHCT 5013 Query: 182 EASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAA 241 S++ G A+T ET + ++ + + S +A Sbjct: 5014 LYIYIILIYPISQADDGGHQGVAETQETVSQEDRKNERQTQEKRKQGRTNEERSLGEAEQ 5073 Query: 242 SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTA 301 +K + S S + + + + + + + A ++ Sbjct: 5074 NKLKQLKTIDQLKDSKESDDAEQEKPEPMDQTEAEEYQHVKEPKNSDKTTLDNATEEQSK 5133 Query: 302 AASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTS 361 + + A AE A E+ +S +KT Sbjct: 5134 KIQHQEDEPPNEEEIEAENVDELMEAEEPAVDPEDDAELEQLGAEKTE--QKSDKPSKTE 5191 Query: 362 ETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTK 417 ++ + + + + SS ++A ++ + ++S A+ ++ + Sbjct: 5192 KSKEQLETPEGMEIEGEVVLTMTVPRSSETTAHSNSEILLDKSSHAEDLSSAEQIE 5247 >UniRef50_Q5Z1P8 Putative uncharacterized protein n=1 Tax=Nocardia farcinica RepID=Q5Z1P8_NOCFA Length = 2348 Score = 73.3 bits (177), Expect = 5e-11, Method: Composition-based stats. Identities = 63/403 (15%), Positives = 121/403 (30%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE 140 PG + G T D P A Q+T A S A Sbjct: 415 EAPGQTDPASGTRTADSTAPGATAPAAAAAPGTESAGGPTPQSTQPAGSVGDVPSAQADA 474 Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 T A A A+ A+ + S + A T S + +++ Sbjct: 475 GPTTEAAPTPDADTATDGGSPPAADTRRESDTGDEAPGHQTADSTTPTPRDTAADTPTGE 534 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 A + + + + T + ++ A T AA + A + + A Sbjct: 535 ARDSTGTPGQGERAGVTDRNPLDTRASASTAATTGTPGAATAGGTALPNGALPGAGIPVA 594 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 ++ A + SS TA S G+ T+ +S ++++ + + Sbjct: 595 TAAPGAGAPTPGPGTAPVGGTSSSGTATPPGTSTQPGTSTSTNTSTQPGTSTSPGTATPS 654 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 + SAA+ + +T G+ A+ A + + A S A Sbjct: 655 GTPTPAGTSAATGTANQSTAPGDGAPSATVAQPQPTPTADARPPASPGTAPGASIAPTHA 714 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 S A+ +++ SA+ A + + A +A A + + A + Sbjct: 715 SPATVDGHLSAAPSAATHAAPATVDSHAPGTPGTAPTADGRVDAAPGATSAPAAGPPAGS 774 Query: 441 RAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLA 483 ET+ +A A + +V + + + A Sbjct: 775 ITETSPAAPASAGAARAGIVDGSVAPPVVAVQARSTGVDVPAA 817 >UniRef50_B3R3K1 Bacteriophage large tail fiber protein n=1 Tax=Cupriavidus taiwanensis RepID=B3R3K1_CUPTR Length = 1045 Score = 71.7 bits (173), Expect = 1e-10, Method: Composition-based stats. Identities = 137/935 (14%), Positives = 256/935 (27%), Gaps = 94/935 (10%) Query: 249 SETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASA 308 + + T A ++ SS TA + + + + Sbjct: 140 PPATTTVKGVVELADNTETQIGTDATRSVTPAGLSSRTATDTRTGLVELATNSETQTGTD 199 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKAS 368 A+ S A S+ A ++ A+ G +A A + + K + Sbjct: 200 ATRSVTPAGLSSRTATETRTGLLEIATQTEVDQGTDDARAVTAKKLLARLKLLGFGGVGT 259 Query: 369 ETSAESSKTAAASS-----ASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG 423 E + A +A + + +A + S+ Sbjct: 260 EGAITDMDDATVPGGLLYVPGAAINVPLNLGPGVALHRPYGTAGFQLFSPYSSDRIVFRR 319 Query: 424 SATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLA 483 + A Q+ + A + A++T+ + ++ S+ TL Sbjct: 320 RTSNAWQAWKELALLDSPTFVGTPLVPTAAKGTTTKQAASTEFVMAAIADLVASSPGTLD 379 Query: 484 ATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAG 543 + ++ ++ + DK + I + A + Sbjct: 380 TLKELAEALGNDPNFATTVTTQLGNKFDKAGGV--IGLQTSGWPAINAPTARIIDGGNTQ 437 Query: 544 ATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAY 603 A +G + + V + A A + + F+ PG + + ++ Sbjct: 438 AANGPSGGLAIESYGPGVQLIDRSAGARNAQLMADGSILSVSFDTTAAPGSYAYQFLFSA 497 Query: 604 GMFWQYQN--NERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVN 661 + + A + + G + + F A S P Sbjct: 498 DGYMAVGGPLSSAAAIRMGLPIPGGAVSQHGIYNNVEFNETATSTGATYSSIPRVKDAAF 557 Query: 662 DTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQ 721 G P A+ + ++ + I+ + ++ A + G+ Sbjct: 558 TMASLVGFFAPTPVVGASATVTEYAGLMANDVTLPNILRKIGARLRMGAGPDKWNLYGDG 617 Query: 722 IRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNG 781 G L A LL G R D A G + + G Sbjct: 618 T-ASNHLAGKLLLGTTADNGNLLQVAGSASIATRLYRGLDADLAVG-ATSIAGIQNAATG 675 Query: 782 NALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYAD--ADGGVPWNAESGA 839 N + + K +T A + + GV N + Sbjct: 676 NDSYINTARFSNDTQPAGINIGKSRGVTVTTQGAVLSGDPLGHVNFCGSDGVGMNVAAQV 735 Query: 840 YNVTRSGDSY------ILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEV 893 V + + + R ++M+ ++ DG G + Sbjct: 736 EVVASENYTTTARGAHMDFRTTAPGTTARAVKMRLADNGELRIGNTATDGSGAKLQVTGY 795 Query: 894 YTSKNLPPES-----------------YPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPK 936 T+ P VG I P T +G + G ++ YP+ Sbjct: 796 ATADTPPAGDSSRKLATTAWVMSTLLTASVGQIIIEPRTTARAGCLKLNGALLKRADYPE 855 Query: 937 LAAAYPSG---------------------------VIPDMRGWTIKGKPAS-----GRAV 964 L AY IP+ RG ++ + GR + Sbjct: 856 L-WAYAQASGAIVTDAAWLAGSWGCFSHGDGNTTFRIPEYRGEYLRFWDDARGADAGRGI 914 Query: 965 LSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHS 1024 + K+H+H+ASA+ T+ G H H V + H HS Sbjct: 915 GVFQDSQNKTHSHAASATPVGDHNHG--------AWTDAQGWHGHGV-----NDPGHAHS 961 Query: 1025 LANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHT 1084 A + + ++ AG + A G+HAH VG+G Sbjct: 962 FQTWTGGGATGAGRVSGSYVTNADAW------AGTSASYTGISIAGDGSHAHNVGVG--- 1012 Query: 1085 HSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRL 1119 G+H H ITVNA G AE V+NI+ ++R Sbjct: 1013 ---YAGNHSHAITVNADGGAEVRVRNISALAMIRA 1044 >UniRef50_Q1XI26 PvLEA1 protein n=1 Tax=Polypedilum vanderplanki RepID=Q1XI26_9DIPT Length = 742 Score = 71.3 bits (172), Expect = 2e-10, Method: Composition-based stats. Identities = 46/421 (10%), Positives = 109/421 (25%), Gaps = 25/421 (5%) Query: 10 KDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPP 69 KD G+ ++N ++ + + + D+ G+ Sbjct: 234 KDAAGEKMENAKEKIIQVKEAAKDKIGHAVDVTTDKLGQAKDATAEKLVQAKDATAEKLG 293 Query: 70 SHAGTITVY------EDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEV---------- 113 +A +T E ++ ++ + D E L + + Sbjct: 294 -YAKDVTAEKLGLAAEKTKETLVDAKDTIVEAKDTTKEKLGHAADVTADKLGHAKDVTAD 352 Query: 114 --ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 + A + AK + D A++ A + + + Sbjct: 353 KLGQAAEKTKETLVDAKDATKDKLVQAKDVTADKLGHAKDVTKDKLAQAADKTKETLVET 412 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKT-SETNASASLQSAATSASTATTKAS 230 TA A K+ +K A G AK + + + + A Sbjct: 413 KDKTADKLGQAADKTKEKLVEAKDVTADKLGHAKDVTADKLGRAAEKTKETLVDAKDTTK 472 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 + A+D A K + +T + + + K + +++ G Sbjct: 473 DKLAYAKDVTADKLNYAADKTKEKLVDAKDTTKDKLGYAADKTKEKLADAKDTTKDKFGD 532 Query: 291 SASAAAGSKTAAASSASAASTSAGQ-----ASASATAAGKSAESAASSASTATTKAGEAT 345 + A A + A + A+A G + ++ A K E Sbjct: 533 AKEATKDKYEDAKQKMAETKDKAKEKFFEAKDATADKLGNAKDATKDKLGYAADKTKEKY 592 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 ++A A + + ++ + + + + + + Sbjct: 593 DEAKDATKDKLGYAKDKLVETKDAAKDKTKEKYEEAKDKFGQARDVTKERWDETKDAAKN 652 Query: 406 A 406 Sbjct: 653 K 653 >UniRef50_A1VSH6 Phage Tail Collar domain protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VSH6_POLNA Length = 483 Score = 70.9 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 78/217 (35%), Gaps = 52/217 (23%) Query: 906 VGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAA----------YPSGVIPDMRGWTIK 955 G T P G+ G ++AY L AA + + +PD+RG I+ Sbjct: 302 PGHINYTARSTAPPGWLKANGAGISRTAYAALFAAIGTTFGVGDGFNTFNLPDLRGEFIR 361 Query: 956 GKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGST 1015 G GR V S G+ T +H H+ GST Sbjct: 362 GWDD-GRGV--------------------------DGSRSLGSSQAGETASHGHT--GST 392 Query: 1016 NSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHA 1075 ++AG H H + + + + G +T L+ + N A + + A G Sbjct: 393 SAAGIHAHGVNDPGHSHQVTQEGGRNTSLAYQNGPNSAFRGEVSTLLETTRNATGIGISE 452 Query: 1076 HTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIA 1112 + G+H HT+T++A G +E +N+A Sbjct: 453 N-------------GNHSHTVTISATGGSETRPRNLA 476 >UniRef50_Q29NV9 GA17619 n=5 Tax=Eukaryota RepID=Q29NV9_DROPS Length = 5605 Score = 70.6 bits (170), Expect = 3e-10, Method: Composition-based stats. Identities = 64/483 (13%), Positives = 141/483 (29%), Gaps = 36/483 (7%) Query: 86 LNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHA 145 ++ + ED+ PEA V + A +A++ + +A + E A Sbjct: 1477 VSKIPRSANEDEKDPEAGDETVDSVPDSAGDAASTPSRRDRKRSTARSRRNANSEDGGSA 1536 Query: 146 ADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAK 205 S A + + S + S + S ++ K Sbjct: 1537 RRNRGSLSAKALKKRRNRGRIVPESDGEDDTMDRTPPPSPPPDSELDSNKRRSSRNTQRK 1596 Query: 206 TSETN-------------ASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETN 252 + ++ ++ S++ A S A + A A + + Sbjct: 1597 KYVDDVMLRFSDDENSLLVASPVKKDKKSSAAANAANSNAGSDVEKAEPQSGAEGEAGNS 1656 Query: 253 ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAA--------- 303 A + + ++ A ++ ++ A + + SA+A + Sbjct: 1657 AGEGEKPSLAMDESSQLEASSSTSAAAAAAAEKERQSSGESASAAMSSKPNYVYINTGDE 1716 Query: 304 -------------SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 + + K+A+S S T+ TE + Sbjct: 1717 DSMVVQLVLAMRMGKRELIPEVPKEKTPEPKVEDKAADSEDSKEKNKETEDKPKTEVETE 1776 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 A + +K E K S+E + S S +++ + + Sbjct: 1777 AESKDAESKLEEKKPKPETESSEMEVDDTKEEPLEKEETEKSEEESSEKSDEEKMEVDET 1836 Query: 411 ATTASTKATEA-AGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 T + K+ E A +K+ E AAT A ++ + + +++ T+ K V Sbjct: 1837 TETTAVKSPEETKSEAEDEEAAKTDEEEAATVAPEEEEKESETENEDEAKESETSDKMEV 1896 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFAD 529 + ++ + S AA ++ + + +A ++ A Sbjct: 1897 EENAGDSPKSPKPAAAATEEETVDTKTSVKSPSSPKPEGTEETKATTEKESAEKESAAAP 1956 Query: 530 KRG 532 K G Sbjct: 1957 KEG 1959 >UniRef50_A7T1V7 Predicted protein n=4 Tax=cellular organisms RepID=A7T1V7_NEMVE Length = 2040 Score = 69.4 bits (167), Expect = 7e-10, Method: Composition-based stats. Identities = 65/581 (11%), Positives = 139/581 (23%), Gaps = 20/581 (3%) Query: 31 TTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFL 90 +T V NT+A P G V+ G G P S+ N Sbjct: 626 STRVPNTIAPGLPGTEGSEG-TVQPGN-QPGSQPNGQPASNQSGNQPGSQPGSQPGNPPG 683 Query: 91 GAMTEDDARPEALRRFELM------VEEVARNASAVAQNTAAAKKSASDASTSAREAATH 144 + ++ N A + + AS A + + Sbjct: 684 SQPGSQPGSQPESQPGNQPGSQPNGRKQAQINQEASQETNREINQKASQMVKRANQPGSQ 743 Query: 145 AADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAA 204 + + + A+ S S ++ A S+ + Sbjct: 744 PESQPGNQPGSQPNGQAGANQPGSQPGSQPGNQPGNQPNGQAGANQPGSQPESQPGNQPG 803 Query: 205 KTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSA 264 A A S ++ S + A S + + + + Sbjct: 804 SQPNGQAG-----ANQPGSQPGSQPGNQPGSQPNGQAGANQPGSQPGSQPGNQPGSQPNG 858 Query: 265 TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAG 324 A N + S+ + GQ+ + GS+ + S GQA A+ + Sbjct: 859 QAGANQPGSQPGSQPGNQPGSQPNGQAGANQPGSQPGSQPGNQPGSQPNGQAGANQPGSQ 918 Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSAS 384 ++ S +AG + ++ + + + + + Sbjct: 919 PGSQPGNQPGSQPNGQAGANQPGSQPGSQPGNQPGSQPNGQAGANQPGSQPGSQPGNQPG 978 Query: 385 SAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAET 444 S + + A+ + Q S A + + +Q + ES Sbjct: 979 SQPNGQAGANQPGSQPGSQPGNQPGSQPNGQAGANQPG--SQPGSQPGNQPESQPNGQAG 1036 Query: 445 AAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQ 504 A + S + + Q S N + + + ++ Sbjct: 1037 ANQPGSQPGSQPGNQ---PGSQPGSQPGSQPNGQAGANQPGSQPGSQPGSQPNGQAGANR 1093 Query: 505 NGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSEL 564 G+ + + ++ G + + + S + Sbjct: 1094 PGS--QPGSQPGRQPGSQPGSQPGNQPGSQPGNQPESQPGSQIGNQKGIQSGSQPGIQPN 1151 Query: 565 ASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGM 605 + ++ P N P G G Sbjct: 1152 GQPGVNQPGSQPGNQPGNQPGGQAGPAPAGTQSGSSNQIGY 1192 >UniRef50_Q4ABH1 Muscle-specific protein 300, isoform D n=33 Tax=cellular organisms RepID=Q4ABH1_DROME Length = 12345 Score = 68.6 bits (165), Expect = 1e-09, Method: Composition-based stats. Identities = 73/640 (11%), Positives = 159/640 (24%), Gaps = 36/640 (5%) Query: 31 TTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFL 90 VV T P D + ++ I E T DF Sbjct: 9030 PVVVATTSPVHVPTADVVEPKDSSPTSTTAAVVDVEAVVEDINEIWPLEHHLKPTNIDFS 9089 Query: 91 GAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTS-AREAATHAADAA 149 + E A E + + + S + ++ + + Sbjct: 9090 QHVEELAAPAAVTAE-----TEASMPVEEIWPTSPETGNSLTLEQYEFEPQSPHEESTKS 9144 Query: 150 DSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSET 209 D + T A + ++ TK T S+ A + Sbjct: 9145 DLVKPQETEPQVVAETKPEGITTGSITITKTTTTITSSTEVPEETLVQNVPADEQQPPAN 9204 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSET----NASSSASSAASSAT 265 +QS + T E +++ A+ +++ E + S+ + Sbjct: 9205 KIKTDIQSFLEAEQTLAAALKEQSSTPTGASVAEDVQTQPEEIVLEERTVEISTIKTEEN 9264 Query: 266 AAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGK 325 S + + A +T AA + Sbjct: 9265 QQEPVIVEEVKSLPVEPEPVEPELEEVAIAIVEQTEEKPEEPVIEKQPASGPIDLRAATQ 9324 Query: 326 SAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASS 385 S ++ASTA K + + + + + E +A + + + Sbjct: 9325 LFISGEAAASTAPQKTFQISAPSLEDNGAGVLKVVLGKESTNEEDTAAPTTGKVSMTIIE 9384 Query: 386 AASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETA 445 A++ ++ + + + ++ + +A + S + + Sbjct: 9385 TAAAPAADAKRRRKKKKRRDTKHEEELEQEQETEPEPVAAVKEPEVSSDVPVSPEDSPRD 9444 Query: 446 AKRAEDIASAVALEDASTTK---------KGIVQLSSATNSTSETLAATPKAVKSAYDNA 496 R E I D S+ + +V S + T P V Sbjct: 9445 TVRHESIVEISPDSDLSSIEIDTKVKIVEDAVVSSPSESPRTPMVELVIPTEVVELALVE 9504 Query: 497 EKRLQKDQNGADIPDKGCFLNNINAVSKTD--------FADKRGMR-YVRVNAPAGATSG 547 ++ Q +K +I +V + A + + T Sbjct: 9505 DEEQQTTPRIPSPTEKSEVEQDIKSVQTSPQHQPKLDETAVQTSLEVQPDNQENESQTLI 9564 Query: 548 KYYPVVVMRSAGSVSELASRVIITTA---TRTAGDPMNNCEFNGFVMPGGWTDRGRYAYG 604 ++ E + V I+T T +G P E + ++ Sbjct: 9565 VEITETEAQTTPRSEEQSVAVEISTTEIQTDVSGQPAETVEISSQTTVTTTIEKEL---- 9620 Query: 605 MFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAF 644 +++ RA + ++ + PV Sbjct: 9621 -QTTPKDSPRAPEAGSSDVVESLVQDLVKDMTTDLPVRTS 9659 >UniRef50_Q7SBR0 Putative uncharacterized protein n=1 Tax=Neurospora crassa RepID=Q7SBR0_NEUCR Length = 1353 Score = 68.2 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 81/471 (17%), Positives = 153/471 (32%), Gaps = 2/471 (0%) Query: 74 TITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASD 133 IT ++ T +TED A + + ++ + + + T + S + Sbjct: 494 DITQQTTAEESTQQTTQEDVTEDAPEVAATKDEVDVKDDTTPHETDLKDETHPMEASKEE 553 Query: 134 ASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS 193 T +A HA D + + ++A ++S A KA E A + Sbjct: 554 EPTQKEDAEEHAVDETAQEEQPTKESAPEETTATGEATSEEVAREKADEDETHAKTYAAV 613 Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNA 253 A + + + +E A A +A +E + A D A ++AA + Sbjct: 614 AHEALEAETSPEVTEAAPEAEAPQAEETAPATEAAPAEETSPAVDVAPVEKAAPVEDDVP 673 Query: 254 SSSASSAASS--ATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAST 311 A+S + + A NSA+ A SE A E AA + + A T + Sbjct: 674 VEKATSLSEENVSVDATNSAEEAAPSEIPADVVEEAALAAEATPAEETTPVTEATRVEEA 733 Query: 312 SAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETS 371 + + S A+ E+ +A + E A + AA + + Sbjct: 734 APVEESTHASEDAPVKEAPLEAAPVSEELPAEKAASTEETAPALDAAPAETAAVEEVTPA 793 Query: 372 AESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQS 431 E+ A + + A+ A + + + + +A T E + AAQS Sbjct: 794 KEAEPVQATYAEVAQATDAEEPAHVEKTEPVEEATVDETAPTEEAATVEEEAANVPAAQS 853 Query: 432 KSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 + A +A + E + A A T + + T E + +A K+ Sbjct: 854 EPVANAAKEIFARSEPVTESQSGAATPTFARTAAEVADSAALLDEGTPEDRVSDEEAGKT 913 Query: 492 AYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPA 542 + + N ++TD ++ + PA Sbjct: 914 GFRRLSATPITEVADTAAEVADSAKYLDNEATETDKSEIPTPAEDGSHNPA 964 >UniRef50_Q4WP03 Chromatin modification-related protein vid21 n=10 Tax=Trichocomaceae RepID=VID21_ASPFU Length = 1467 Score = 67.9 bits (163), Expect = 2e-09, Method: Composition-based stats. Identities = 42/379 (11%), Positives = 100/379 (26%), Gaps = 4/379 (1%) Query: 50 SMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELM 109 S + GQ + ++ S + + P + Sbjct: 122 SFQQQEGQTAQVIEQRLVEDSLKKDDARIHQQPALATTSEIPQVANSPTTPAQPDQLPQK 181 Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + + + + S S A + A + ++ ++ S A Sbjct: 182 ASFAVPDTPPTSTSHESVDASVSPALKGLPPSEVAPPRAVSNQLPSAQRKPESRPSLVLA 241 Query: 170 SSSAGTASTKATEASK--SAAAAESSKSAAATSAGAAKTSETNASASLQS--AATSASTA 225 S + A+ A + A + S + A + S + Sbjct: 242 QPSEDQPLSPASSAGPYSNNTPAPVAVSPDTSPAEEVTEGADEVALSPKRVGPVQLQPGL 301 Query: 226 TTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSE 285 + A + ++ A +S++ + +S+ S+ + + +++ ++ Sbjct: 302 VPSTPDEQLQLEAAQSLQQNALASKSIGDVTTASSLSNEVIKEDVGPTPSAAADSSKETQ 361 Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 S A + A S Q+ T + AE +++T K A Sbjct: 362 DQTSTSVEAPEPKRPDGVVVAPEESQPPAQSIQEETQSQVGAEVKVVASTTPAGKKPTAA 421 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 A + +S S + ++ A + + + Sbjct: 422 AVLPAQPERMTTRVSSGAIRHKSVSEILGETPKPSAVQPEKAHAIEKPADMVRAPASASP 481 Query: 406 AAKSSATTASTKATEAAGS 424 + + KA E S Sbjct: 482 ESAAKMRLKDRKAREKERS 500 >UniRef50_Q0A8Q8 Ribonuclease E n=8 Tax=Gammaproteobacteria RepID=Q0A8Q8_ALHEH Length = 1073 Score = 67.9 bits (163), Expect = 2e-09, Method: Composition-based stats. Identities = 63/455 (13%), Positives = 109/455 (23%), Gaps = 1/455 (0%) Query: 97 DARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAAS 156 + P A R + N ++ A +A+ A Sbjct: 600 EKAPAASGRRGSDQKGATNGRRGGGSNGGKGGRTRQKARPEDGRSASQPEATGKPESDAR 659 Query: 157 TSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQ 216 A + A A + S + + +AA +T +A Sbjct: 660 RDADETAEGAGRSGRSRRGRRGGRRRRRSGSTGGGQPEESAARQDQKGQTPPAAEAADDA 719 Query: 217 SAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKT 276 +A A A AAA K AA + + + + Sbjct: 720 PRDRAAGKAEAAAERPRE-GDRAAAEKPAASEKVKADKPALPTITEEEIQGSATPQLKDW 778 Query: 277 SETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAST 336 ++E A+ +SA A S TAA S + + A A+ A +++ Sbjct: 779 PPRGQVATEAASPESAPADDESGTAAGSPKATQAQRPATADTEASDTKADAPASSGETPA 838 Query: 337 ATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASAS 396 A + E + S S + + S Sbjct: 839 DQPTGAPPAAGGDGAKAPTAEKAAGEKPGARRKRSKPSVQPGTPPVLPQDIGPDEPSVRS 898 Query: 397 KDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAV 456 A+ + + + A A T A A A S Sbjct: 899 GRPRISAAAPTEGNEADTAGSADVAPREPTPANTGTDQPPKPAPEPAPAKAPEPAPGSDS 958 Query: 457 ALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFL 516 A ++ +A + + ATP + E D PD Sbjct: 959 EPAAADQDAAPAEEVPNAADDGAAGPQATPWLEHEPESSREAIGAHSAGTPDAPDAQKAP 1018 Query: 517 NNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYP 551 + + T + A S + P Sbjct: 1019 ADTADAADTAGDEGGDQTQETDEASRKGGSRRKGP 1053 >UniRef50_UPI000186F3DC Titin, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186F3DC Length = 10733 Score = 67.5 bits (162), Expect = 3e-09, Method: Composition-based stats. Identities = 57/514 (11%), Positives = 124/514 (24%), Gaps = 10/514 (1%) Query: 67 FPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAA 126 P S + V D +P + + EDD +PE + +++ + +N + Sbjct: 6738 KPESESPVEQVKPDEKPEIVEE---KPKEDDKKPEEPVKKLKTLKKPDEKITPETENFES 6794 Query: 127 AKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKS 186 K + ++ + + ++ T + K Sbjct: 6795 LKSKLKPVKDTRETETEVVQKPNETLDDEKKKLKKVPETNETEPKEDSPLKTDTHKKPKQ 6854 Query: 187 AAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAA 246 ++++ T T + T TK + S K+ Sbjct: 6855 KPNETTNENELETIKLKPTTQPKPKKPKEDDESQQPDTEKTKHTHTKLSRDTTQTHKQRT 6914 Query: 247 KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 ++ E + SS + + S S + S T + K + Sbjct: 6915 ETIENELNVLKSSNDTLSKTEIISEMGESVSLSVDVSEITPKESTDKIEDKEKLSKPDEL 6974 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 + + K E + A E + K Sbjct: 6975 KEKPEKILKKPEEKPKDEDDKKDKPKVVKKKPAKKEEEEKVPEVAGDKPEEKPKYEQDKK 7034 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 AE T S K + TK E Sbjct: 7035 DKSEVAEKKFTKDEEKVPEVVDVKSEEKPEKPKDEEDKKGKPKEIKKKPTKKEEEKVPEV 7094 Query: 427 AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATP 486 A + E +++ + E ++ + ++ + Sbjct: 7095 AGDKPDEKPEEKPEKSKDEQDKKEKPTLVEKKPTKKEEEEKVTEIEELKSEEKPEEKLEK 7154 Query: 487 KAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATS 546 + K + +K K ++ + I + +K + + + P A Sbjct: 7155 TSEKPKDEGDKKDKPKLGKKKPTQNEDEKIPEITSRNKPEEKLQE-----ILEKPKDADD 7209 Query: 547 GKYYPVVVMRSAGSVSELASRVIITTATRTAGDP 580 + P + + + I T T+ P Sbjct: 7210 KQDTPETLKEK--PTKQADEKGIEVTETQPGEKP 7241 >UniRef50_C9KJG4 Side tail fiber protein from lambdoid prophage Rac n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KJG4_9FIRM Length = 932 Score = 66.7 bits (160), Expect = 5e-09, Method: Composition-based stats. Identities = 164/896 (18%), Positives = 304/896 (33%), Gaps = 55/896 (6%) Query: 92 AMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADS 151 + ++ + + ++E + + A + +T +++ A A Sbjct: 31 SAPDNSGIYDVIAEDLRWLKENIDDVKDTSDLEAIKQSVTDMYNTMKNDSSFGEATAKAQ 90 Query: 152 ARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNA 211 A A A A SA +A + +TK+TE + + A +S A + KT E + Sbjct: 91 AEEAKKQAQAALESATNAKTYYDDITTKSTEVNNTIAEIKSYIEKAEALNESNKTLEQSI 150 Query: 212 SASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSA 271 S S A A +A + A+ AATS +A AS+ AK+SETNA S ++AA S + A Sbjct: 151 SDSATVATNKAKSAASSATNAATSETNAKASETKAKASETNAKVSETNAAKSESNAKAHM 210 Query: 272 KAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAA 331 A +SE+ A S + A S+ A+ +S + A TS A + + SA+ A Sbjct: 211 DAT-------ATSESNAKTSETNAKASQAASKTSETNAKTSETNAKQYSINSSNSADLAK 263 Query: 332 SSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSAS 391 + A ++ + ++ + ++KT +K SA +S+T A +S ++A +S + Sbjct: 264 AWAESSDSP--DSVNDTDSTTGKTQSSKTWAIYSKDRAISAFTSETHAKTSETNAKTSET 321 Query: 392 SASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAED 451 +A+ S + A+A+ +SA A+T AT A S T AA S S A+++ T A+T+ A+ Sbjct: 322 NAANSATNSASSATASANSAEEAATSATNAKTSETNAATSASNAKTSETNAKTSETNAK- 380 Query: 452 IASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPD 511 + +A+T++ E+ A V A +AEK + + D Sbjct: 381 ----ASETNAATSEGNTKGYMEKAQVAYESAKAIQSVVDVAKADAEKCVADVEAVRDSLA 436 Query: 512 KGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIIT 571 K + + +D + Y A T G ++ + L V ++ Sbjct: 437 K-MMTYQGSVDNYSDLPANPQVGYSYNVKNADKTHGVNAGDNLVWNGTDWDNLGGTVDMS 495 Query: 572 TATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSV 631 D N V +T + N I + + GD Sbjct: 496 LFAELGKDVRFNA-----VTATTFTGDLKGTADKATN-DKNGADIAATYLKKTGDTATGK 549 Query: 632 FYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGF 691 + ++ + G + KF D++ Sbjct: 550 ITFNSTDLNALPEVKKTSDDNMSGIRFS---SKSKFLGAVGKRLSNGEDLL--------- 597 Query: 692 YESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVY 751 NL + +R N GA T Sbjct: 598 ----------NLRSDNTTTDIVLDSRNYNNFAPTNTGSGASGTWGINIAGNAATATKLAT 647 Query: 752 GGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAA 811 L A +AT + ++A+++ +A Q + + + I+ A Sbjct: 648 ARTIALSGNASGSATFDGSGNATINATVSESAHATKATQDTNGRAFTDTNAYMHISYLAN 707 Query: 812 HVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYR 871 + T Y + +T + Y + + + Sbjct: 708 GTDFNDVKTTGIYYCTQDTYTNRPHNSWGILTVYSIGT-VKQEYRPDNAAVYYTREYNNS 766 Query: 872 NGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDK 931 N + + + + + Y ++ + YPVG+ T P L +++ Sbjct: 767 NWTAWSKVAASTADNADTVDD-YHVSDIISKIYPVGSIYMSMVATNPHD--LFGVGTWER 823 Query: 932 SAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLG 987 + ++ K G A + D + H H+ D G Sbjct: 824 ISQGRMLLGADDS--------AYKAGATGGEATHTLTVDEMPRHFHNYDLYVGDYG 871 >UniRef50_B4PV00 GE23539 n=2 Tax=melanogaster subgroup RepID=B4PV00_DROYA Length = 2641 Score = 66.7 bits (160), Expect = 5e-09, Method: Composition-based stats. Identities = 63/507 (12%), Positives = 130/507 (25%), Gaps = 12/507 (2%) Query: 55 YGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVA 114 G+Y+ T ++ F +ED+ +P ++ + Sbjct: 1223 EGEYAGPTEQPEASTPAQFADTAEKEVDDKLATTFAPISSEDELKPADEKKPTDEAQIPV 1282 Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 A ++ A + +A + S Sbjct: 1283 AEIPASTAEPESSTPELPAADLDKKPEEDSTKATEAPESDKVPEVPTSAPAEDEIEESDK 1342 Query: 175 TASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAAT 234 + + S S + + + SE S + A Sbjct: 1343 FTTVAPPKTSASDETEPAGEEDLVPATFEPVESEFEVSTKKPAVQGPPLPTLAPAQPEKK 1402 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSAT----AAGNSAKAAKTSETNARSSETAAGQ 290 + +++ + + +S +S + A +A E ++ T Sbjct: 1403 PVDEEDSTEAEISTEPSAEVEKETSGETSESDSKIDAPTTAAPVSAEEEEDKTPSTDKTV 1462 Query: 291 SASAAAGSKTAAASSASAASTSA--------GQASASATAAGKSAESAASSASTATTKAG 342 A + A A+ +A TAA E + Sbjct: 1463 EAEEKVTTVAPVAGDDEEANLPKLPQDIFEEELPAAVTTAAPSKVEDEQKPVDDGEKQFE 1522 Query: 343 EATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATR 402 + + ++++A+ T+A +K AA ++ A S S + E Sbjct: 1523 DGKKPIDEETSTSASAENEIEPESDRATTAAPTKEEAAEPSTGAPESDESKETPESEVAT 1582 Query: 403 QASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS 462 + A T+S E + +A T ET A ED ++ A + Sbjct: 1583 TVAPAGEKIPTSSITPDEEGTATSAPVAKPDEDVEKETSTETPASSEEDEDTSTAQTPSQ 1642 Query: 463 TTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAV 522 +K + ET+ AT S + L +D Sbjct: 1643 VPEKKPEAPAQTPEEEEETVGATTAPTTSDEVPPIQGLPEDVLAEIPQPSTETGIKQQDE 1702 Query: 523 SKTDFADKRGMRYVRVNAPAGATSGKY 549 + + + + + A A + Sbjct: 1703 TTGAPSVEPSVTEIDQEATTVAPIAEK 1729 >UniRef50_B2VW07 Predicted protein n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2VW07_PYRTR Length = 2856 Score = 65.9 bits (158), Expect = 8e-09, Method: Composition-based stats. Identities = 69/502 (13%), Positives = 147/502 (29%), Gaps = 3/502 (0%) Query: 77 VYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDAST 136 P + +A P A A + + + Sbjct: 687 TPAGEPPVEETPAVEEAPAIEATPATEEAPSAETTSAAEETPAPEDSPVDEPTPVFEETP 746 Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSA 196 + +A AA++ TS + + A A+ S T A+ A ++ A ++ + Sbjct: 747 APDDAPFAETPAAETPAPEETSTPEKDTPAAEATPSDETPVADASTAEETPAVEAAAVAE 806 Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 ++A A ++E +A + + +A A+ +A +T + + Sbjct: 807 DTSAADATPSAEEAPAAEVPLVVEETQATKETPITEEPATEEAPAADDAPALDKTPTAET 866 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 ++ + A + + + + E A AA + + + Sbjct: 867 SAPEETHVADAIPTVEETSVVDEAPVAEEVHDDSEAIAATEMPAVDEALLPEPEVAVEEE 926 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 + + A+++ + + + A E+A+ AA + A+ S ++ + S Sbjct: 927 KIPPLDEIPATDPASAAEESTSVQGSTAAEEATPAAEATEKARVSVDDSLPEQESTSEES 986 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 + + S D +T A A E A + Sbjct: 987 PVVEEQSPAPVPSPEDNEEVSDSSTEPAQDLPVVEEPAVESQPEEAHPVVTEEPVAEESG 1046 Query: 437 SAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETL---AATPKAVKSAY 493 + A +A + + +++D TT T T+E ATP A Sbjct: 1047 EVSIPAAESAPVDASSSESESVKDPLTTDATTEAQVEDTMETTEPAVIDDATPATSGEAD 1106 Query: 494 DNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVV 553 + ++ D P + A + D + + PVV Sbjct: 1107 SAIDTVDDLTESPIDAPATEVVADVAEAAADPVLVDDHVVEEASSEDVVADVTQSELPVV 1166 Query: 554 VMRSAGSVSELASRVIITTATR 575 V + + V T+A Sbjct: 1167 VDNLVDTEASNEDGVHDTSAAP 1188 >UniRef50_A0YKP6 Hemolysin-type calcium-binding toxin n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YKP6_9CYAN Length = 681 Score = 65.9 bits (158), Expect = 9e-09, Method: Composition-based stats. Identities = 39/328 (11%), Positives = 82/328 (25%), Gaps = 1/328 (0%) Query: 203 AAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAAS 262 + +T+ S S + + ++A ++E + S S Sbjct: 12 ESSEEDTSLPESQTSESPELEDEEDSPLLDPQPEIAESGESDSASNAEDLSESEPRSPVE 71 Query: 263 SATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATA 322 S+ A+ + S++ + S T + S + + + Sbjct: 72 SSEDESKETPASSDPQIIQPESKSGSNSDVDVPPESNTEPETRTEVEPESPDENTPPPQS 131 Query: 323 AGKSAESAASSASTATTKAGEATEQASAAARSAS-AAKTSETNAKASETSAESSKTAAAS 381 + + + E E + A SE + + ES +T A+S Sbjct: 132 QTPQSPNLEDEQDSPLNPQPEIDETEEDSPPQAEVEPPESEPRSPVESSEDESKETPASS 191 Query: 382 SASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATR 441 + S S S E +++ + T ++ + + +S Sbjct: 192 DPQIIQPESKSGSNSNVEIPPESNTEPETRTEVEPESPDENTPPPQSQTPQSPNLEDEQD 251 Query: 442 AETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQ 501 + + D + A + S E E + Sbjct: 252 SPLNPQPEIDETEEDSPPQAEVEPPESEPRTPVEPSEDEPKETPTTTDPQTTPEPESKSG 311 Query: 502 KDQNGADIPDKGCFLNNINAVSKTDFAD 529 D N P V D Sbjct: 312 SDSNVDVPPQSNTEPETRPEVEPEPPDD 339 >UniRef50_D2HNP5 Putative uncharacterized protein (Fragment) n=1 Tax=Ailuropoda melanoleuca RepID=D2HNP5_AILME Length = 1476 Score = 65.2 bits (156), Expect = 1e-08, Method: Composition-based stats. Identities = 49/431 (11%), Positives = 95/431 (22%), Gaps = 4/431 (0%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE 140 + TED PE E+A + A + + A Sbjct: 677 ETTTEALEHSEPATEDPEHPEPATEAPE-CPELATEDPEFPEPATEALEHSEPAMEDPEH 735 Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 A +T A + A A + A T+A E + A A + Sbjct: 736 P--EPVTEAPKWPEPATEAPECLEPATEAPEHSEPA-TEAPELPELTTEASECPELATEA 792 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 + + S + S A S A+ A Sbjct: 793 PEHPEPATEALEHSEPAMEAPELPELIMESPEVPEPATEAPECPEPVSEAPECLEPATEA 852 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 A + + T T A + + + + A + A Sbjct: 853 PECPEPATEALEQPATEAPERPEPATEAPERPEPVSEAPERPEPVSEAPERPEPVSEAPE 912 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 S + + E +A A+ A A + Sbjct: 913 RPEPVSEAPERPEPVSEAPERPEPVLEAPECPEPATGPPRPAVEATDPHKPASIGEEVEE 972 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 + + A + K + + A + A ++ Sbjct: 973 GLLAPELGTCPGACVCAGDGAETHLPQKEAPGGENQGAGGLESAPQGARKAPGDCGPEVH 1032 Query: 441 RAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 A S +A+ + V ++S S + + ++ Sbjct: 1033 PAACPEVGHAWPQSPAEEGEANPEPRSPVAVASEAGLGSCSEFPPRSVPRPGRRCPKEPG 1092 Query: 501 QKDQNGADIPD 511 + P+ Sbjct: 1093 PTSPAPSQQPE 1103 >UniRef50_B8QTW7 Putative tail fiber protein n=1 Tax=Erwinia phage phiEa21-4 RepID=B8QTW7_9CAUD Length = 357 Score = 65.2 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 57/312 (18%), Positives = 101/312 (32%), Gaps = 53/312 (16%) Query: 809 TAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKA 868 +V + D N ++ + +N Y+ +Q Sbjct: 50 NPHNVTKAQVGLGNVTNDEQLVKAQN----LGDLPNVEMAQQNLNVYSKEAVDSVVQAHI 105 Query: 869 HYRNGGLFYRSSRDGYGFEEDWAE-----------------VYTSKNLPPESYPVGAPIP 911 + +N S+ G G E++ V + YP+G I Sbjct: 106 NDKNNPHNTTKSQVGLGNVENYTVALSYTDASANKYVTAKVVNDLYKMIQGMYPIGHRIY 165 Query: 912 WPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDG 971 + PS Y + + + +++ Y +G G G + ++ + Sbjct: 166 TDNSANPSTYIPVG--TWALTGQGRVSVGYDAGNSSRPAG------TKFGSSTVTIDVAN 217 Query: 972 IKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTA 1031 + +HTH T G H+H SGST AGAH H + Sbjct: 218 LPAHTHGV---------------------TVTGGNHSHGASGSTTGAGAHNHVASGNTGY 256 Query: 1032 SANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGS 1091 + + +TR + N+ + H + AG H H V + T++V + Sbjct: 257 AGDHNHTYTTTRQGGGNPGNHVGHGS-NEIHYTNEATGVAGGHTHYVSLA--TNTVGDHA 313 Query: 1092 HGHTITVNAAGN 1103 HG I +NA+GN Sbjct: 314 HGLNININASGN 325 >UniRef50_A0NQ95 Putative tail fiber-related protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NQ95_9RHOB Length = 329 Score = 64.0 bits (153), Expect = 3e-08, Method: Composition-based stats. Identities = 47/246 (19%), Positives = 79/246 (32%), Gaps = 60/246 (24%) Query: 882 DGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAY 941 +G A + G + T P G+ G ++AY L AA Sbjct: 122 NGLDANNVQAALEKLTTKVTNGVAPGCVAYYAMSTAPDGWLKANGAEISRTAYADLFAAI 181 Query: 942 PS----------GVIPDMRGWTIKGKPA-----SGRAVLSQEQDGIKSHTHSASASSTDL 986 + +PD+RG ++G R + S + D SHTH+ S SS Sbjct: 182 GTIFGVGDGNSTFNLPDLRGEFLRGWDDARGVDGARVLGSSQSDQNASHTHTGSTSSDSH 241 Query: 987 GTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSV 1046 T++ T++ +G A + +++ Sbjct: 242 SHTGTTNTTGNHTHNMAYEGGTNAGTGLAAPATSRSNTSPGP------------------ 283 Query: 1047 VHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAEN 1106 T +G H+HT S + SH H++T +A+G +E Sbjct: 284 --------------------TVNYSGNHSHTF-------STSSDSHSHSVTTDASGGSEA 316 Query: 1107 TVKNIA 1112 +NIA Sbjct: 317 RPRNIA 322 >UniRef50_C6BT48 Tail Collar domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BT48_DESAD Length = 208 Score = 64.0 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 35/125 (28%), Positives = 59/125 (47%), Gaps = 10/125 (8%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKP--- 958 YP+GA + DT P G+ GQ+ + YP+LAA + +PD+RG I+G Sbjct: 59 SDYPIGAVAAYRGDTPPVGWLECNGQS--TTGYPELAAVVGAN-VPDLRGEFIRGLDSGR 115 Query: 959 --ASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSF--DYGTKSTNNTGAHTHSVSGS 1014 +GRA+ S + D ++ H+H + + + + T S + +S T G+ Sbjct: 116 GVDAGRALGSAQADAMERHSHQTTITVSGRTSVTASPYHSAGAARSLVTTPNFGSPFGGA 175 Query: 1015 TNSAG 1019 + SA Sbjct: 176 SFSAS 180 >UniRef50_Q1I687 Putative phage variable tail fibre protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1I687_PSEE4 Length = 898 Score = 64.0 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 41/178 (23%), Positives = 66/178 (37%), Gaps = 21/178 (11%) Query: 902 ESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSG------------VIPDM 949 + PVG +P+P TVP+G+ + G + YP LA AY G +PD Sbjct: 382 SALPVGTMLPFPRGTVPAGFLEVDGSTQSAAVYPDLA-AYLGGAFNTGNEAAGFFRLPDT 440 Query: 950 RGWTIKGKP-----ASGRAVLSQEQDGIKSHTH---SASASSTDLGTKTTSSFDYGTKST 1001 RG ++G SGRAV S + + K+HTH + + + G ++ Sbjct: 441 RGEFLRGWDHGRGVDSGRAVGSTQGESFKAHTHKDVGFIDNVGGGSGASAVTGATGDVTS 500 Query: 1002 NNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGA 1059 A+ +S S + + A SG+ S + + A Sbjct: 501 IYGKAYGNSASATAKAYKESAPGALGGAIAGLISGSTGDSETRPRNLAVMWCIKAWNA 558 Score = 63.2 bits (151), Expect = 6e-08, Method: Composition-based stats. Identities = 53/239 (22%), Positives = 89/239 (37%), Gaps = 27/239 (11%) Query: 801 DGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGS 860 G+ +T V + +A A A + GA G L++ TG Sbjct: 486 SGASAVTGATGDVTSIYGKAYGNSASATAKAYKESAPGAL----GGAIAGLISGSTGDSE 541 Query: 861 CRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSG 920 R + + ++ AE+ ++ PVGA +P+P VP+G Sbjct: 542 TRPRNLAVMWCIKAWNAPVNQGQIDVAALVAELKALRSST----PVGAILPFPKAEVPAG 597 Query: 921 YALMQGQAFDKSAYPKLAA---------AYPSG--VIPDMRGWTIKGKP-----ASGRAV 964 Y + G + YP LAA P+G +PD RG ++G GR + Sbjct: 598 YLELDGSLQSVATYPDLAAYLGASYNNGTEPAGYFRLPDYRGEFLRGWDHGRGVDPGRGM 657 Query: 965 LSQEQDGIKSHTHSA---SASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGA 1020 + + D I++ T S + LG +S + T +T A+T + S+ +A Sbjct: 658 GTSQSDAIQNITGSIGLRGGAGVGLGVMGGASGAFSTVFGESTSANTITRDASSIAASD 716 >UniRef50_C1F9K0 Putative uncharacterized protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F9K0_ACIC5 Length = 1272 Score = 63.6 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 70/456 (15%), Positives = 132/456 (28%), Gaps = 27/456 (5%) Query: 5 ISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSM-DVEYGQYSVILL 63 ++G + DG G PV++ + ++A + T D G + + + G Y V + Sbjct: 17 MAGRILDGDGAPVEHARLMIEALGS------GTTYDLGTDPNGNFLLPQLAPGAYKVAVE 70 Query: 64 VEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNAS----- 118 GF + V Q + + RP A + + ++ Sbjct: 71 APGFAAWTLPDVEVSAGQQRELSPRLESLLAKKQPRPAAGPQGSKSASQRVSESALTLSM 130 Query: 119 -------------AVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASS 165 A A + A AR A AA + +A + Sbjct: 131 LQRQGGQRSLHVREPALAEPLALQEPEAAPALARVAHAPAAASPLQIYTRLRNAQTVHAR 190 Query: 166 AQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTA 225 + +T A + A + + + S+ + + + + SAS Sbjct: 191 VSVTTPQGLVEATIAEKPVVPAGTTDWAAFEPGRQQVSEPASQPASRPAGKRVSESASQP 250 Query: 226 TTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSE 285 A + A A + A+ +A S + + ++ A ++ Sbjct: 251 GGGFPGAVRMREQSDAIAHANPHANHGATQAAESDSLDVPPVPPLNEHMTVEDSTAEATA 310 Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 +S + + +A++ A A QA AA + S T Sbjct: 311 ADDHESTATRSAPAKTSANTLELAEAFAPQAPERGEQDQSDIRRAAVTNSDVATDGDPNR 370 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 S SA + + +A + S T + S + + R A+ Sbjct: 371 AAHSTGEDDVSAFEPAGEDAIEVSRRSRSDSTDSFEVRSDHPVDRVHGQSFIE--DRDAA 428 Query: 406 AAKSSATTASTKATEAAGSATAAAQSKSTAESAATR 441 +A T T T A T + AA Sbjct: 429 WGAENAGTTLTTQTAAGAFVTTPYKPADRRLQAALE 464 >UniRef50_B6Q6N2 PT repeat family protein n=1 Tax=Penicillium marneffei ATCC 18224 RepID=B6Q6N2_PENMQ Length = 2150 Score = 63.2 bits (151), Expect = 5e-08, Method: Composition-based stats. Identities = 56/501 (11%), Positives = 118/501 (23%), Gaps = 2/501 (0%) Query: 52 DVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVE 111 E D T +D +P A + L E Sbjct: 901 TAEPAVTEEEKEAAPSEVVAESDAVHSHDENTSADISEATKTTVEDGQPSAEKEESLPTE 960 Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 E + T A ++ A + A+A ++ + +AA+S + Sbjct: 961 EAPASEPVEESATPEASENEPSAEVVEETPVSEPAEAPVASLKEAEPVQEAAASDEVVEP 1020 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 + +T +S A + E +A+T T Sbjct: 1021 VVVEEAVASTVNEPEPVTEDSEVQPAEEVDPSTVKEELPEEPVTAEEPKAATTETVAEEA 1080 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 T A A + E ++ A ++++ +K + + A + Sbjct: 1081 IETETDAIAEEPAAEDAEEEEEEEEEEEIETTTEPAVDTSEESKEETADEQPITDAVEAT 1140 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 A ++ A A + + + + + T E + A Sbjct: 1141 VVENAVEESTIEEPAEATEKTPTEVVGESPVEPAQEALSKAVEETPAEPTVEEPAKTVEA 1200 Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 E +E AE + + ++ ++ ++ Sbjct: 1201 VTETPVESVEEQPVDTAEQVAEPVEEVSVEETAAEPATEEPTQIVEETPLEADVETAVEP 1260 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 A S + E A T E AA+ + + + ++ Sbjct: 1261 VVAEVAEPVEKTSVEVTPTQSADEEVAKTIEEPAAEEPVQAMEET--PVETIAEPTVEEI 1318 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKR 531 + TP A + E+ ++ I + A Sbjct: 1319 AEPVKEILVEATPTPSADEEVAKTIEEPAAEETPVQTIAEPAVEEIAKPVKHSLIKAMPT 1378 Query: 532 GMRYVRVNAPAGATSGKYYPV 552 + V Sbjct: 1379 EPAEEESTKIVEEAPAEAVSV 1399 Score = 58.2 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 43/419 (10%), Positives = 108/419 (25%), Gaps = 1/419 (0%) Query: 88 DFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAAD 147 E A P A E + + A ++ A + + + T A Sbjct: 877 PISEPAVEPVAEPVAEAVAESAEPTAEPAVTEEEKEAAPSEVVAESDAVHSHDENTSADI 936 Query: 148 AADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS 207 + + A + A + A+ A+ E S + + Sbjct: 937 SEATKTTVEDGQPSAEKEESLPTEEAPASEPVEESATPEASENEPSAEVVEETPVSEPAE 996 Query: 208 ETNAS-ASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATA 266 AS + +A++ A + ++ + ++ + S+ Sbjct: 997 APVASLKEAEPVQEAAASDEVVEPVVVEEAVASTVNEPEPVTEDSEVQPAEEVDPSTVKE 1056 Query: 267 AGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS 326 A + A + A+ + + T + Sbjct: 1057 ELPEEPVTAEEPKAATTETVAEEAIETETDAIAEEPAAEDAEEEEEEEEEEEIETTTEPA 1116 Query: 327 AESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSA 386 +++ S + + +A T E A+A+E + + + Sbjct: 1117 VDTSEESKEETADEQPITDAVEATVVENAVEESTIEEPAEATEKTPTEVVGESPVEPAQE 1176 Query: 387 ASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAA 446 A S + + + + + T ++ E TA ++ E + Sbjct: 1177 ALSKAVEETPAEPTVEEPAKTVEAVTETPVESVEEQPVDTAEQVAEPVEEVSVEETAAEP 1236 Query: 447 KRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQN 505 E + + + + + E + +SA + K +++ Sbjct: 1237 ATEEPTQIVEETPLEADVETAVEPVVAEVAEPVEKTSVEVTPTQSADEEVAKTIEEPAA 1295 >UniRef50_B8LYA4 Uro-adherence factor A, putative n=1 Tax=Talaromyces stipitatus ATCC 10500 RepID=B8LYA4_TALSN Length = 1959 Score = 63.2 bits (151), Expect = 5e-08, Method: Composition-based stats. Identities = 58/514 (11%), Positives = 125/514 (24%) Query: 15 KPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGT 74 PV+ + + ASE E + + E +V Sbjct: 315 APVEEAKEDATEVAEESELTTTAEASEQAAEPHTVTTEAEPAAKTVPEPEFQTEKEAETD 374 Query: 75 ITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDA 134 E T T+ + + A + Sbjct: 375 SVEAEQVPAATTTAEAVEDTKPQTEATEAEVPVALESAPESAPEPETAAKEEVEAPAHEP 434 Query: 135 STSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSK 194 + +E T A A + ++ + + AAAE Sbjct: 435 ADEVKEDTTEKAVEEKPAETEVSEVVESVKESATEGQPEEAEPEPRETFPHEPAAAEMKD 494 Query: 195 SAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNAS 254 ++ + +A + +A+ T A ++ S Sbjct: 495 TSVVEQPKEIEAEPVEVAADVVEPTVTATEVPEAVPAETTVETAAKEAEAVPASGSDVKI 554 Query: 255 SSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAG 314 + ++ A A T + E A A A S A + + Sbjct: 555 VPPAEEETTVAPAETEPAAVTTETKTPEAVEEPASAPAEAPEVSTPETPEVAEETAPESS 614 Query: 315 QASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAES 374 Q A A A ++ + A + T+ + E + Sbjct: 615 QVEDGTATESPIAVEEAVEEKEAVAESAVTEPETVATTTEDTTEATTSNEEPSKEINTHE 674 Query: 375 SKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKST 434 A A +A+ + +++ + A + A + + TT + E + + + Sbjct: 675 EPAEAQPEAVTASPEVAESTSVESSAEKVEEAHELADTTPAIAEPEQVAAEESTPEPVQA 734 Query: 435 AESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYD 494 AE AA+ + + Q+ S T ++++ + Sbjct: 735 AEEKTDEETVAAEAPIESTEPETAAIDEQPVAAVEQIDDVAKSEEPVETETADSIEATEE 794 Query: 495 NAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 A + + +P+ + Sbjct: 795 PAAAPAESESTPVSVPEPAVESVEELTTEPVAES 828 >UniRef50_Q9XW25 Protein Y18D10A.1, partially confirmed by transcript evidence n=1 Tax=Caenorhabditis elegans RepID=Q9XW25_CAEEL Length = 1634 Score = 63.2 bits (151), Expect = 6e-08, Method: Composition-based stats. Identities = 42/420 (10%), Positives = 105/420 (25%), Gaps = 2/420 (0%) Query: 51 MDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMV 110 M +E +V G P S + R + Sbjct: 531 MSMEPAAAAVTPAPRGRPRSRSAAKVSENTEPLSEAPSAPVKRGRGRPRSRSTMSITEDS 590 Query: 111 EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSAS 170 E + +A A + + ++ S T + + Sbjct: 591 EPSTSSTAAKRSKRAESDEEEEQDLKLTNKSPEKPK--KPSKTTEETVGDVLKKRLRDTA 648 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 + T ++ A TS+ K ++ S + Sbjct: 649 KTTATVIHTPGPPLRTRKMERMRAPTAVTSSKKEKPKNAGSADSSINEEEHEDETMILEE 708 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 + + + + ++ + ++ + K + A+ + Sbjct: 709 QTLDLPQQTSQQEPRISCGSELLDEQFDASEEHSGTVPSAPELTKNPAPPVPEASEASAE 768 Query: 291 SASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASA 350 + + + A + + A ++ + + S + +A +A +S Sbjct: 769 PPKIDIPEQATPILALALALPTVSPTALEPPKAQENPTAELPTTSEISGRAPQALPTSSQ 828 Query: 351 AARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSS 410 ++ +A + + S + ++ + S SS + + QA S Sbjct: 829 TPPTSGSAAPPVDDLLSEILSGAKTTKTRKAAPPAVQKSISSTTQQAPPTSVQAPPTSCS 888 Query: 411 ATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQ 470 A S ++ T + + + A I+ + T K Q Sbjct: 889 AAPPVDDLLSEILSGAKTTKTTKTTKMPPVDQKKISSEAPPISDSAPTSVHQQTPKSPKQ 948 >UniRef50_B1M1N8 Tail Collar domain protein n=1 Tax=Methylobacterium radiotolerans JCM 2831 RepID=B1M1N8_METRJ Length = 414 Score = 62.1 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 75/356 (21%), Positives = 118/356 (33%), Gaps = 73/356 (20%) Query: 815 AFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGG 874 A A G + + ++ + F T + + A Sbjct: 44 ALWLVDNSGVNQAFGSDAYTVITQQGVSAKAAGRAHTLKFRTTTTNLNPCTLAADNNAPK 103 Query: 875 LFYRSSRDGYGFEED-------WAEVYTSK-----NLPPESYPVGAPIPWPSDTVPSGYA 922 + R DG F W+ VY L P + GA + VPSG+ Sbjct: 104 PWLR--WDGTQFGPGDIGQNVVWSVVYDPVAQVYRTLSPTTEQAGAIKAFAGPNVPSGWE 161 Query: 923 LMQGQAFDKSAYPKLAAAYPSG----------VIPDMRGWTIKGKPASGRAVLSQEQD-- 970 + G+A ++AY L A +G +PD RG T+ G G L+ Sbjct: 162 ICDGRAVSRTAYAALFATISTGWGNGDGFTTFNLPDARGRTLFG-ANRGTGRLTAAGGLD 220 Query: 971 -----------------GIKSHTHSASASSTDL---GTKTTSSFDYG----------TKS 1000 + SH H+++ S + + D+G + + Sbjct: 221 GSLGNMGGADQVVMLAPQMPSHIHTSTMSPAGFFEPEIQKAGAHDHGGTKVGGDHAHSGT 280 Query: 1001 TNNTGA------------HTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVH 1048 T +G H H V T + A V T G+ T S H Sbjct: 281 TGLSGTHTHGGTTDTSGDHAHVVQYGYGLVSTQTPNNAQVVTGINLGSQGNGQTTQSGPH 340 Query: 1049 NQNYATSSAGAHTHSLSGTAASAGAHAHTVGI-GAHTHSVA-IGSHGHTITVNAAG 1102 + T G HTH+ S G+HAH + + G HTH++ +H HT+ ++AAG Sbjct: 341 QHTFTTGQGGNHTHAFSTDPG--GSHAHEIPVDGDHTHTIDPTPNHVHTLVIDAAG 394 >UniRef50_D0WHF5 Putative uncharacterized protein n=1 Tax=Slackia exigua ATCC 700122 RepID=D0WHF5_9ACTN Length = 1065 Score = 62.1 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 77/458 (16%), Positives = 130/458 (28%), Gaps = 42/458 (9%) Query: 13 TGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHA 72 TGKPV RN +V T + P Sbjct: 299 TGKPVAGAAESYDIARN---LVHVTDVASIP----------------------------- 326 Query: 73 GTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSAS 132 T E +L D D A P E A S S Sbjct: 327 ---TPSERESDESLQDARAEDAADTAVPTEASSGAAAAEPDADAESESTDKAVGGMPEPS 383 Query: 133 DASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAES 192 ++ +A A+ + A A A + + A S G ++ A A Sbjct: 384 ESLAQPVSSALTASPHKEPAEVPEWYAKATARARKEAEESTGKV-ERSRYAEIPVVPAAE 442 Query: 193 SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETN 252 S + A+ + A + T S S A + A + A + D A + S + Sbjct: 443 SPAVASDAIHAEEPCTTPVSDSTDDAPETPGQAPSDAVSSRAEETDIAPAGTDEMDSRAD 502 Query: 253 ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTS 312 + + + A + A E A S ++ S K AS+S Sbjct: 503 SEGAGEQEIHAEETAIDERAADVREEDTAVSEAADRDETVSPDETDKPDEPVEEGQASSS 562 Query: 313 AGQASASATA------AGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 +A A A+ SA + A A ++ + A S ++ A Sbjct: 563 DDSEAAEEEPLPVDPLARTQKLPASVSAERSAELADRAKKERVSVAYDDSLRPAADLGAT 622 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 + ++ +A ++ S + + E + A AS + + +A Sbjct: 623 SIMNPVAAAASAQDNAMHVFEIPRISPAPTAAEPLSSFDDLRQRAPLASATESVSKAAAK 682 Query: 427 AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 + E + A+ AK + A A A+ Sbjct: 683 DLLSTLPPIEDPKSSADATAKIEINRAGTFAAASATGA 720 >UniRef50_B8ZQ26 Choline-binding surface protein A n=10 Tax=Streptococcus pneumoniae RepID=B8ZQ26_STRPJ Length = 874 Score = 62.1 bits (148), Expect = 1e-07, Method: Composition-based stats. Identities = 78/706 (11%), Positives = 177/706 (25%), Gaps = 44/706 (6%) Query: 97 DARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSA---- 152 ++ +V + + AK+S + + +A A + Sbjct: 168 PTNTYKTLELDIAESDVEVKKAELELVKEEAKESRDEKKINQAKAKVENKKAEATRLKNI 227 Query: 153 RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAS 212 + A +A A + A A+++ ++ + A + A S ++ Sbjct: 228 KTDREKAEEAKRRADAKLQEANVATSEQDKSKRRAKREVLGELATPDKKENDAKSSDSSV 287 Query: 213 ASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAK 272 + S A +++ + N ++ A + Sbjct: 288 GEETLTSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEV 347 Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAAS 332 E ++ + + A +K + + + A A + A++ Sbjct: 348 KKAELELVKEEAKESRDEKKINQAKAKVENKKAEATRLKNIKTDREKAEEAKRRADAKLQ 407 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 A+ AT++ ++ +A A + N S S+ +T + S A + Sbjct: 408 EANVATSEQDKSKRRAKREVLGELATPDKKENDAKSSDSSVGEETLTSPSLKPEKKVAEA 467 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 ++ + + T + A + + E A + + Sbjct: 468 EKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKESRNEE 527 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDK 512 + K +L + + K + D +++ + A P Sbjct: 528 KIKQVKAKVESKKAEATRLENIKTDRKKAEEEEAKRRAAEEDKVKEKPAEQPQPAPAPQP 587 Query: 513 GCFLNNINAVSKTDFADKRGMRY-VRVNAPAGATSGKYYP----VVVMRSAGSVSELASR 567 + + PA + + Y R A + Sbjct: 588 EKPTEEPENPAPAPAPKPENPAEKPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKAEK 647 Query: 568 VIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDD 627 + +T N + F G G W Y N+ A+ + + N G Sbjct: 648 PAQPSTPKTGWKQENGMWY--FYNTDGSMATGWLQNNGSWYYLNSNGAMATGWLQNNGS- 704 Query: 628 LRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKS 687 +G+ + + Y + Sbjct: 705 -WYYLNANGSMATGWLQNNGSWYYLNANGSMATGWLQY---------------------N 742 Query: 688 GRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQ 747 G +Y + AT + G G+ W + Y L N Sbjct: 743 GSWYYLN----------ANGDMATGWLQNNGSWYYLNANGDMATGWLQNNGSWYYLNANG 792 Query: 748 GDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPR 793 G + NA G++ G + ++ + Sbjct: 793 DMATGWLQYNGSWYYLNANGDMETGWVKDGDTWYYLEASGAMKASQ 838 >UniRef50_UPI000023F160 hypothetical protein FG00031.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023F160 Length = 1534 Score = 60.5 bits (144), Expect = 3e-07, Method: Composition-based stats. Identities = 77/538 (14%), Positives = 150/538 (27%), Gaps = 32/538 (5%) Query: 48 RYSMDVEYGQYSVILLVEGFPPSHAGTITV-YEDSQPGTLNDFLGAMTEDDARPEALRRF 106 Y + GQ S G P +G I+V S+ + A++ PE Sbjct: 175 TYGDSTKTGQDSAHDTSTGTSPHESGQISVSIPFSESDLNTNTQDAVSNTIGLPEGSVTG 234 Query: 107 ELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSA 166 E +V D+ ++ A + ++ + + Sbjct: 235 TGSFPEPTSEGGSVPGTVTDG-----DSEETSGGAGNSGSPSSPQTDGSFPTGTADFPGE 289 Query: 167 QSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTAT 226 S + + AS AT S + ++ S+G +E + + + AT Sbjct: 290 HSGTVTNPDASGTATGPEDSGNPSGTASGFPEQSSGTLVNTEVSGQPTGTADVPGGPGAT 349 Query: 227 TKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSET 286 + AT S +S+T + S + +AT G + A S + Sbjct: 350 DANT--ATDLPTTIPSDVPGGASDTATNPEGSGPSGTATGPGEAGTTASGGVPGEASGTS 407 Query: 287 AAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 AS A + S +A + S T + ++S +T +A Sbjct: 408 PQDSGASDTATIPEGSGPSGTATVPGEPGTTDSPTGISEGVSDSSSGTATVPGEADNTAS 467 Query: 347 QASAAARSASAAKTSETNAKASETSAESSK-TAAASSASSAASSASSASASKDEATRQAS 405 S ++ S+ N T S+ + ++ S ++ T Q Sbjct: 468 GGVPGDASGTSPPGSDANPTEVATDGTSNGGPTVSHDPTNTPGSPEETVSNPATGTEQGQ 527 Query: 406 AAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTK 465 + A + G T+ A +TA + + + T+ Sbjct: 528 DTATQTDAAPSTEVAPTGIDTSIAGPATTAPGNTLPNDQGTLTQSNGDEFTSAAPGDATE 587 Query: 466 KGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKT 525 +++ N TL + + + + + + + +A S T Sbjct: 588 DHSGAVTTDANGAPTTLPGNQDQTTGSPEQSTSASSDQNSDGNSNETSLGQEDSSAASTT 647 Query: 526 DFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNN 583 D + G T A+ T +NN Sbjct: 648 D-----------------------GNAAQPTTEGDGVTTKPVADNTDASSTGSSEVNN 682 Score = 44.0 bits (101), Expect = 0.031, Method: Composition-based stats. Identities = 47/364 (12%), Positives = 113/364 (31%), Gaps = 4/364 (1%) Query: 25 KAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPG 84 +A S + + P+ +G G+ G + + + PG Sbjct: 402 EASGTSPQDSGASDTATIPEGSGPSGTATVPGEPGTTDSPTGISEGVSDSSSGTAT-VPG 460 Query: 85 TLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSAS--DASTSAREAA 142 ++ DA + + EVA + ++ T + + + + A Sbjct: 461 EADNTASGGVPGDASGTSPPGSDANPTEVATDGTSNGGPTVSHDPTNTPGSPEETVSNPA 520 Query: 143 THAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAG 202 T D+A + + +S AG A+T + + + ++ Sbjct: 521 TGTEQGQDTATQTDAAPSTEVAPTGIDTSIAGPATTAPGNTLPNDQGTLTQSNGDEFTSA 580 Query: 203 AAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAAS 262 A + + S ++ + A A T + T + + + S + ++S+ N++ ++ Sbjct: 581 APGDATEDHSGAVTTDANGAPTTLPGNQDQTTGSPEQSTSASSDQNSDGNSNETSLGQED 640 Query: 263 SATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATA 322 S +AA + A T T + A+ + ++ ++ + + S + Sbjct: 641 S-SAASTTDGNAAQPTTEGDGVTTKPVADNTDASSTGSSEVNNGATTTGSGPVETTDFDG 699 Query: 323 AGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASS 382 + A S+ + T A + S T+ T + Sbjct: 700 VPSTTVPAPSATANPVTSAATTDTDMPVVTEAPPGFNPSTVIGHPEWTTNTWITTTTSEG 759 Query: 383 ASSA 386 + Sbjct: 760 SDPT 763 >UniRef50_B7CDC6 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CDC6_9FIRM Length = 1127 Score = 60.5 bits (144), Expect = 4e-07, Method: Composition-based stats. Identities = 63/436 (14%), Positives = 146/436 (33%), Gaps = 19/436 (4%) Query: 1 MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSV 60 M+V G DGT + + ++ +E +A E Q Sbjct: 36 MSVFAKGATADGTDINSTPANKEESKIEKTEKELLQEKINEAQKKADEAKEAEEKAQKVY 95 Query: 61 ILLVEG-FPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASA 119 EG + P+ A T+ + +L ++E + + L + + + A Sbjct: 96 DTYNEGTYVPTKANVNTLKSNYDASSLEAQKAIVSELEKQVANLEENQKKLGQANEQKKA 155 Query: 120 VAQNTAAAKKSASDASTSAREAA---------THAADAADSARAASTSAGQAASSAQSAS 170 + + +DA++ +EA T + + + +A + + A Sbjct: 156 LQTQLDKTTNALADANSKLQEAQKKYDALLNGTSEEEISKDVEEKEKALAEAQKALEEAQ 215 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAAT---------S 221 ++ T + + TE A A + A + A+TS A +L +A Sbjct: 216 NTVSTLTQQKTETEAKATEATQNVENAQKAYAEAQTSVNEAQKALDTAVANYNEKKAIYD 275 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 ++ T ++ ++A A+++ A + ++ A++ T+A S N Sbjct: 276 GASDPTIKAQYEADIKNAENELVTAQTNLQVALENQTAKANAVTSAEKSLSDVNQEILNL 335 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 + ++ + A ++ + A A A+ + K E+A ++ A Sbjct: 336 DAQIAEKQKALDELNQTINDAETALNEAKAQLETAKANQASKEKELETAKANLEKAKQDV 395 Query: 342 GEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 + + A + A + T + + A++ + A + ++++T Sbjct: 396 TTQQSKVNEAQEAVIAQEKVVTQLRTDKEQAQAKIEQGSKGFFEAYGYTDALKILEEQST 455 Query: 402 RQASAAKSSATTASTK 417 + A +T Sbjct: 456 ANGGSTNIGAENDATS 471 >UniRef50_B4VM02 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VM02_9CYAN Length = 1889 Score = 60.2 bits (143), Expect = 4e-07, Method: Composition-based stats. Identities = 41/413 (9%), Positives = 111/413 (26%), Gaps = 9/413 (2%) Query: 73 GTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSAS 132 G + ++ + + A+ + PEA + + + A + + Sbjct: 676 GDLKTTLETALQSAQAQIAAIQTPEKPPEAEPISQAIAPSAKSPETKTAPASPGKGEQQK 735 Query: 133 DASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAES 192 + T A+ +AT + A ++ + SA + +E+ Sbjct: 736 PSDTEAKSSATPEQVSQTPESAKPQPVSESPQREKQPEQSAVKGDNTQETPKPKSPESET 795 Query: 193 SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETN 252 A + + + + S S + A + + + Sbjct: 796 VAGGDAIAKSSDTEDKPSVEVSQPSEPDTIQRACAACEQEKEEETVQPLLIQRQPMGISP 855 Query: 253 ASSSASSAASSATAAGNS---------AKAAKTSETNARSSETAAGQSASAAAGSKTAAA 303 S S ++ + A ++ + A + A K+ Sbjct: 856 TSDSQATIIQAKLENPIEWARGILDGLRGDASAKQSELQGDAAAKQSGVNRQAQGKSVEI 915 Query: 304 SSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSET 363 S S A A+ A+S ++ + ++ + + AS Sbjct: 916 DSQGKGKASEVDAEGKNKASELDADSQGKASEVDAEGKNKGSQLDTDSQGKASEVDAEGK 975 Query: 364 NAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAG 423 N + + +K++ + S + T + + + + T+ + Sbjct: 976 NKGSQLDADSQAKSSELDADGKTKGSELQTDSQGKGETLTSDSQTKGSELETDSQTKGSQ 1035 Query: 424 SATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATN 476 T + + E+ A + + + D + ++ + Sbjct: 1036 LETESQAKGNDLETEAIAKGNDLETEAETQAQELQGDEAGFEQEAENTKTDIE 1088 >UniRef50_A5K4N9 Dynein heavy chain, putative n=2 Tax=Plasmodium RepID=A5K4N9_PLAVI Length = 5331 Score = 59.8 bits (142), Expect = 7e-07, Method: Composition-based stats. Identities = 52/349 (14%), Positives = 104/349 (29%), Gaps = 10/349 (2%) Query: 30 STTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDF 89 T V T+ +E P E E G V HAG+ + + L++ Sbjct: 3976 EATQVELTVKAEEPTEPEVIPNASEPG-------VTANTNEHAGSASPTAPEEAANLDEP 4028 Query: 90 LGAMTEDDARPEALRRFELMVEEVA--RNASAVAQNTAAAKKSASDASTSAREAATHAAD 147 TE PEA ++E + + A+ + +A+ + E Sbjct: 4029 KEETTEKPEAPEAAVEVAPNLDEPKEGETGAEPEEPKVEAESTEPEAAANPDEPKEEETT 4088 Query: 148 AADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS 207 A + +A + + T EA+ +A + + + Sbjct: 4089 EKQEAPEVAVE-TAEPEAAANPEAPEAAVETVEPEAAANADEPKEEDTTEKPEEPVVEAE 4147 Query: 208 ETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAA 267 + + A+ A + A++ A + N + A Sbjct: 4148 PEEPQVAAEPTEPEAAANPEAPEAAVETVEPVVAAEPAEPEAAANPDGPKEEETTEKQEA 4207 Query: 268 GNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSA 327 +A A E A + T + A+ A + ++ + A A A ++ Sbjct: 4208 LEAAVEAVEPEAAAEPAATPTALESDASPVESEVVAKAEVSSKSEAVVNPAEVDAKPEAP 4267 Query: 328 ESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 +A A + K E + S + ++ + E +E +K Sbjct: 4268 PAAEVEAPPSARKGEEDEDSTSIESSHLDGSELESDDLDGEEAQSEETK 4316 >UniRef50_C9S6D8 Putative uncharacterized protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9S6D8_VERA1 Length = 2466 Score = 58.6 bits (139), Expect = 1e-06, Method: Composition-based stats. Identities = 72/582 (12%), Positives = 158/582 (27%), Gaps = 7/582 (1%) Query: 11 DGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPS 70 + +P+ ++ + + V + A VE + P Sbjct: 439 EPASEPITASAVEPASGSAAEPAVEPVSEPADEPPAESAEPAVEPVSETAAEPAAEPAPE 498 Query: 71 HAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKS 130 A ++P + A L A + + T Sbjct: 499 AAAEPPAKPAAEPAAASVVEPASESVAEPAVELSPEPAAEPAAEPPAESATEPTVEPAPE 558 Query: 131 --ASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAA 188 A A A E AT A + AA +A A A + A ++ Sbjct: 559 PVAEPAVEPAPEPATEPAAVPAAEPAAEPAAELAVEPAPQPVAEPAPEPAAEPAAVPASE 618 Query: 189 AAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKS 248 A A A + A + + AA A+ + + + A + E+ Sbjct: 619 PAAEPAVEPAPQPVAEPAPQPVAEPAPEPAAEPAAVPAVEPAVDSAPESTAEPAAESNVE 678 Query: 249 SETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASA 308 + ++ +S +++ A + + ++ + AA + + Sbjct: 679 PAVVPDAEPATEPASEPTIEPTSEPAAEPVVESATEPAEPPAESAVEPSPEPAAEPATNL 738 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKAS 368 A ++ A S + + T E+T + A SA + + A S Sbjct: 739 AVEPNDKSVIEPAAKPASGPNTEPVVESPTQPTAESTIETVAEPAVESADEAATDPAVGS 798 Query: 369 ETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAA 428 A + + SSA + A + + T + A+ Sbjct: 799 AVEPALEDVTEAVIKPAEPQAESSAEPVAESTDVPAIESSTQEIAIEPPVTSPSDLASEP 858 Query: 429 A-----QSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLA 483 +S + + +++ + STTK +++ S T + Sbjct: 859 TFEITIESAAEVVTESSKDAIEPTAVPTVDPLTEPAAESTTKSAVLRTSPETAPEAAAEP 918 Query: 484 ATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAG 543 A + + ++ + + ++ + + A + + Sbjct: 919 AVEPSTACSTEDTPEAVTENTIESTSEPTHEPATELAAGTPAGPTIESSAVPDDNPTTES 978 Query: 544 ATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCE 585 G + V+ AS + TA++ P + Sbjct: 979 TAKGSDGEDTLEPVGKPVAPDASAEGVQTASKQTSAPEDEAS 1020 Score = 52.5 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 56/412 (13%), Positives = 132/412 (32%), Gaps = 1/412 (0%) Query: 73 GTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSAS 132 +V E + + E +P A E + E +A A + A Sbjct: 743 NDKSVIEPAAKPASGPNTEPVVESPTQPTAESTIETVAEPAVESADEAATDPAVGSAVEP 802 Query: 133 DASTSAREAATHA-ADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAE 191 A A SA + S A + + + T ++ + Sbjct: 803 ALEDVTEAVIKPAEPQAESSAEPVAESTDVPAIESSTQEIAIEPPVTSPSDLASEPTFEI 862 Query: 192 SSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSET 251 + +SAA ++K + + T + +T S ++ + A A + E Sbjct: 863 TIESAAEVVTESSKDAIEPTAVPTVDPLTEPAAESTTKSAVLRTSPETAPEAAAEPAVEP 922 Query: 252 NASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAST 311 + + S + T + + T E + ++ ++ S A Sbjct: 923 STACSTEDTPEAVTENTIESTSEPTHEPATELAAGTPAGPTIESSAVPDDNPTTESTAKG 982 Query: 312 SAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETS 371 S G+ + + +++A TA+ + ++AS + + A +S A S Sbjct: 983 SDGEDTLEPVGKPVAPDASAEGVQTASKQTSAPEDEASDESDAPEADTSSPPLADTSSAE 1042 Query: 372 AESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQS 431 ++ +++ + + + +A+ S AT ++ + + AA Sbjct: 1043 PALTELEGDDDSATGPNEDEPIKDVQHAISEEAAVLISEATPSNVSSPTKSDETATAAGD 1102 Query: 432 KSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLA 483 + + E+A + A + + D + + LS+ + + + + Sbjct: 1103 EQSTEAAIASSGDAIIFINKVDGGKSEADENEAPSPDLGLSADVDEPAPSPS 1154 >UniRef50_B4VJR6 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VJR6_9CYAN Length = 1792 Score = 58.2 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 62/582 (10%), Positives = 165/582 (28%), Gaps = 6/582 (1%) Query: 82 QPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREA 141 QP + + E + + E E A A+ ++ + ++ + + Sbjct: 206 QPVETEEKVEPEAEQELQEEDETI--QRQEMPDSEAEAIPESEVDSIENPEEEPDIQAKV 263 Query: 142 ATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSA 201 + ++ AS++ + S + E + + Sbjct: 264 EQPKHNLSELLANASSTPPSSIQRQDDLGESGNPIQQQLVENEEKLEPEAKQEPPELDET 323 Query: 202 GAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAA 261 + + S+ ++ + ++ + N + SS+ Sbjct: 324 IQHQEIPDAEEDEILSSEPQTVQRQSETPDIPDQEKEKSEDYHNFLEIPINVPGTPSSSI 383 Query: 262 SSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASAT 321 G + + + + + + + S+ Sbjct: 384 QRQVNFGRLSNQVVQPIKPTSNPGLNRLNTFKSQRIN-SPVNPVNPVLNQSSSHTVERQE 442 Query: 322 AAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAAS 381 ++ + S + S E ++ ++ + + +S +S+ T Sbjct: 443 SSTEEKISRKVTTSEEFRPVKEGEQEEDNKPDTSVSEEVPSEEETTEVSSTKSTLTTNTH 502 Query: 382 SASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATR 441 + + S+ S SK ++ +S + +T ++A A ++K+ + T Sbjct: 503 NDNKNTSNKSQEKPSKALNEQEQPETESKPESETTTGSDAQEVELAVEEAKTELGAKVTE 562 Query: 442 AETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQ 501 +K +E + A E+A+ K + + A +E T + + + Sbjct: 563 PAQGSKDSEIAQADTAKEEATQQNKSTEESTEAVAVDNEANPETVAGEDTTQGDLDS--- 619 Query: 502 KDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSV 561 + + + + A + T + A + + + S Sbjct: 620 QQTEQTEKVAQTAQAESAVAETSTQMEAAQSNVEQLPTAQPNFVPSQPKAMFLTTEDNSA 679 Query: 562 SELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMM 621 + A V+ + NN E GG + A G+ + ++ + + Sbjct: 680 TVGAENVLGSVGDGVVLFQTNNPEQFQTDENGGGLVQQIPASGVAAIEEGKQQESQAALA 739 Query: 622 SNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDT 663 D SV V A + I++ + + D + Sbjct: 740 QFMSDGATSVSQVTDAGQTIQPRIQEATTQAQALIDAAIQQN 781 >UniRef50_Q9NU22 Midasin n=31 Tax=Coelomata RepID=MDN1_HUMAN Length = 5596 Score = 58.2 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 49/503 (9%), Positives = 116/503 (23%), Gaps = 36/503 (7%) Query: 71 HAGTITVYEDSQPGT------------LNDFLGAMTEDDARPEALRRFELMVEEVARNAS 118 H G + E + + E + + S Sbjct: 4762 HMGDLNGEEADKLDERLWGDDDEEEDEEEEDNKTEETGPGMDEEDSELVAKDDNLDSGNS 4821 Query: 119 AVAQNTAAAKKSASDASTSAREAATHAADAADSARAAST--------SAGQAASSAQSAS 170 ++ K+ +A + R + + Sbjct: 4822 NKDKSQQDKKEEKEEAEADDGGQGEDKINEQIDERDYDENEVDPYHGNQEKVPEPEALDL 4881 Query: 171 SSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKAS 230 ++ + E + + + A ++ + + + Sbjct: 4882 PDDLNLDSEDKNGGEDTDNEEGEEENPLEIKEKPEEAGHEAEERGETETDQNESQSPQEP 4941 Query: 231 EAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQ 290 E S D A +E + + A+ + + K E + E Sbjct: 4942 EEGPSEDDKAEGEEEMDTGADDQDGDAAQHPEEHSEEQQQSVEEKDKEADEEGGENGPAD 5001 Query: 291 S----ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATE 346 + A AS T + A + A + + E Sbjct: 5002 QGFQPQEEEEREDSDTEEQVPEALERKEHASCGQTGVENMQNTQAMELAGAAPEKEQGKE 5061 Query: 347 QASAAARSASAAKTSETN-----AKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 + + A A+ A+ E+N A T + + S + + Sbjct: 5062 EHGSGAADANQAEGHESNFIAQLASQKHTRKNTQSFKRKPGQADNERSMGDHNERVHKRL 5121 Query: 402 RQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDA 461 R + + + A A K ++ A A+T +++ + Sbjct: 5122 RTVDTDSHAEQGPAQQPQAQVEDADAFEHIKQGSD--AYDAQTYDVASKEQQQSAKDSGK 5179 Query: 462 STTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINA 521 ++ I T E AA + +K + + ++ + + Sbjct: 5180 DQEEEEIEDTLMDTEEQEEFKAADVEQLKPEEIKSGTTAPLGFDEMEVE-----IQTVKT 5234 Query: 522 VSKTDFADKRGMRYVRVNAPAGA 544 D + + P + Sbjct: 5235 EEDQDPRTDKAHKETENEKPERS 5257 >UniRef50_UPI000194DC45 PREDICTED: similar to mucin 16 n=1 Tax=Taeniopygia guttata RepID=UPI000194DC45 Length = 3422 Score = 58.2 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 73/512 (14%), Positives = 162/512 (31%), Gaps = 14/512 (2%) Query: 9 LKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFP 68 LK PVQ+ + + + +TT T+ + + + G P Sbjct: 393 LKTPATTPVQSSSTVIPTQVKATTEGTQTMVTPSMHAQTSQTSSTAPGTSPTSATTSSLP 452 Query: 69 PSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAK 128 S + S P + A + +T A+ Sbjct: 453 WS-----STLGTSSPALSTPTSTTTLLETTYTAAATSALTSEPSTVPTSMRTEFSTLTAQ 507 Query: 129 KSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAA 188 ++ ST+ + +A ++ Q + S++++ ++ T+++ + + Sbjct: 508 STSRPESTAPASSTGAGTSHFQTASTVVSTMEQTSKVETSSTTTMQSSPTESSVSQPVST 567 Query: 189 AAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKS 248 AA + + + TS + + + + ++T + + S S+ K+ Sbjct: 568 AATEAATLSTPGPAQMVTSVPGSRPTSAATSGLPLSSTLEITSPTHSTTTPTTSRIPEKT 627 Query: 249 SETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASA 308 S + S S ++ T+A S + + S A+ A TA SS Sbjct: 628 STESTSVSPTTRLEGITSAAQSTSEPGSKALTSSSQPEASPIVAVIQNLGTTAEYSSLKT 687 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKAS 368 + T S+ + T + A S +++ T+ +S Sbjct: 688 ---------PATTPVQSSSTVIPTHEKATTEGTQTMVTPSMHAQTSQTSSTAPGTSPTSS 738 Query: 369 ETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAA 428 TS+ ++ +S+ + ++ S+ + + T A++A +S + + S A Sbjct: 739 TTSSLPWSSSLGTSSPALSTPTSTTTPLETTYTAAATSALTSEPSTVPTSMRTEFSTLTA 798 Query: 429 AQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKA 488 + +A A AS V T+K S+ +S +E+ + P Sbjct: 799 QSTSHPESTAPASTTGAGTSHFQTASTVVSTMEQTSKGETSSTSTMQSSNTESSVSQPVT 858 Query: 489 VKSAYDNAEKRLQKDQNGADIPDKGCFLNNIN 520 + Q +P + Sbjct: 859 TAATEAATLSTTGPAQMVTSVPGSRPTSAATS 890 >UniRef50_B9PU60 Putative uncharacterized protein n=3 Tax=Toxoplasma gondii RepID=B9PU60_TOXGO Length = 1920 Score = 58.2 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 61/488 (12%), Positives = 127/488 (26%), Gaps = 21/488 (4%) Query: 17 VQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTIT 76 + +C + + + + S DE + + + P Sbjct: 1006 LPHCPLLISTSSLQSALSPFFALSSQTDETREKAARGKPSEKKGRRAFALPPLWGLKGGQ 1065 Query: 77 VYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDAST 136 SQ N A+ A + + ++A Sbjct: 1066 SLGASQLMAGNAIYAALHLALMTYPAAFDSVSSSQAPRDEDEGAESEAGGSAQAAGPDDE 1125 Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGT------------------AST 178 S EA + A A + Sbjct: 1126 SLVEAKREEEKKEGEKPQGDRRRMEENRRHGEAIRFAVQDRLLKNIAGTVDLLEDLRVAA 1185 Query: 179 KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARD 238 EA++S + + + + + +SAA + A + Sbjct: 1186 SLGEAAESLKGPKGASGNSQVDEDCTREESRKEGDAEKSAAKDSDDACRREEGQKGEGDG 1245 Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGS 298 +++ A E A + + +K ++ + + TA A S Sbjct: 1246 KLVTEDEAAEREKRRGHQAEDEGTEDRGTKSEGDDSKKTDEKTKEATTADATKKGAVPAS 1305 Query: 299 KTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAA 358 + +A A A+ A G+++ +AA + T + A A A Sbjct: 1306 Q---GPTAKNAKKKGMHAAGPLRALGQASVTAADVLAFLNTICWAGRREEKPAGEKAEEA 1362 Query: 359 KTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKA 418 E A+ E +A + A + ++ + A K+E +Q S + Sbjct: 1363 VKPEQEARKEEEAALAPAKKAEETDGVTPDASGTELAKKEETDKQPPKQTSQGLVEARDH 1422 Query: 419 TEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNST 478 + A A + ++ +A SA + + Q +S ++ Sbjct: 1423 LPVGLLSAATAAAALLEKNEIRLLTASASGKSQDGSASSGPALPSGPAAGAQSASLSSHQ 1482 Query: 479 SETLAATP 486 S + P Sbjct: 1483 SPLVGQQP 1490 >UniRef50_A9RX33 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RX33_PHYPA Length = 1222 Score = 57.8 bits (137), Expect = 2e-06, Method: Composition-based stats. Identities = 58/460 (12%), Positives = 123/460 (26%), Gaps = 5/460 (1%) Query: 77 VYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDAST 136 + + ++ ED+ PE + V A + + A Sbjct: 491 YDRNEEDREAKPLSKSVKEDEETPEVAEKGLASPAPVPEEKDAPSFELDTTPLEKTAAPK 550 Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSA---SSSAGTASTKATEASKSAAAAESS 193 + + A A A + S +G +E + ++ Sbjct: 551 HSEYDPAELVEEAGIAAPALSEDAHVDQPDVEQVHDSKVSGEVEASTSEPVIETESGSAA 610 Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAA--ASKEAAKSSET 251 + +S A + AA A DA + ++T Sbjct: 611 PAEGVSSVEEAVHESITTPVAENLAAEVGEQDGIGVEFPANDNPDADSWRIFKQEIENDT 670 Query: 252 NASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAST 311 + +S A + A AA +E +S A +A+ A Sbjct: 671 SQASEGPEANVAEEAKVEELAAAADAEVETTASAGKRELLGGPGAFQVDSASVEVEVAPA 730 Query: 312 SAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETS 371 +++ + SA ++ + E ++A+A++ +AS + S Sbjct: 731 RVEESAVTEVEKPTSATEGNNAEKASALVEEEVKQEATASSATASPLDANAAETALELQS 790 Query: 372 AESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQS 431 ES A S S S A +D + + ++ + + + Sbjct: 791 RESELPVAGSEPLSETVSEIQAVPDRDADSPVSEDQAAAIGDGTLEVQHDDAPMDVVESA 850 Query: 432 KSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKS 491 +++ +TA +++ A + + + SE + V+ Sbjct: 851 PLGNDASTPALDTAQAEEPVASTSSTSATADGVPEVQSREFEIPGAGSEPIPEASLGVEG 910 Query: 492 AYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKR 531 A + + A D S D Sbjct: 911 ANEVDVDSPVSAEQDAPAGDDTPVALEQAVPSSLDSPTPG 950 >UniRef50_C7YJJ9 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7YJJ9_NECH7 Length = 662 Score = 57.8 bits (137), Expect = 2e-06, Method: Composition-based stats. Identities = 44/407 (10%), Positives = 115/407 (28%) Query: 72 AGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSA 131 +G + + +P DD PE EV A + + Sbjct: 47 SGDVPANTEPEPAPEQPTQSEPLPDDTPPEVSPTAAEEDPEVPSGEPDPEVPGGAREGPS 106 Query: 132 SDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAE 191 A E + A + A + + + + + + A + A +++ Sbjct: 107 EAAPEVPSETPSEAPEVPTEAAPEGPAETPSETPGEPSEAPADLTTPAAEPDTQAPDTQA 166 Query: 192 SSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSET 251 + A T A+ + + A++ T + + +S + A+ T Sbjct: 167 PTTQAPVTQTPEAQPPVNLDGTNTLTPASTTPTDDEEPAPTGSSGGEDTEPTGGAEPEPT 226 Query: 252 NASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAST 311 + T +A + + S+T A + + G + + + Sbjct: 227 VETGGGDDDTPEETGGSGETEAPEETPQETGGSDTDAPEPTNTDDGEEPTETRNLVPDTV 286 Query: 312 SAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETS 371 + A A++ S +A S +A +T+A + Sbjct: 287 EPTEEPTGAATNPSYQGPEATTGSDDDGEATATDVDQPEETGSEPSATGEDTSATTGSSD 346 Query: 372 AESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQS 431 + ++ + +S ++ S ++ + + + + + A + Sbjct: 347 DDDAEESVVASKTAGGSGDKTSFTTVATSAATSKSIELPLDNAEPGENAKFSTVNGKTAI 406 Query: 432 KSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNST 478 + ++ + T + + K S+++ T Sbjct: 407 VLSPKANSKADCTINVESPLEIPKDDYVRVIASVKVQKPAGSSSDKT 453 >UniRef50_Q9HR92 Halobacterial transducer protein 6 n=3 Tax=Halobacterium salinarum RepID=HTR6_HALSA Length = 778 Score = 57.8 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 74/474 (15%), Positives = 150/474 (31%), Gaps = 9/474 (1%) Query: 22 IQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDS 81 I L R + + + A E G Y D++ + + + S T+ Sbjct: 312 IALTLGRGTVRALNDLEAKAAALERGEYDTDLDVARVDELGRLFEAFASLRDTVQAR--- 368 Query: 82 QPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREA 141 + + + AR EA A +A ++ A A++ + A + Sbjct: 369 ---IRDANEQQVDAEAARSEAEAAQADAEAAQAEAEAAREESEAQARRLETTAEAFSETM 425 Query: 142 ATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSA 201 +AA A A A + + A + A ++ + A+ SA Sbjct: 426 RAYAAGDLTVRLDADVEQAAMADIAAAFNEMAADMEATIADVVAFADEVATASTDASDSA 485 Query: 202 GAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAA 261 A + + + S ++ A+ + A+ + +A+ E +S + ++ AA Sbjct: 486 AAVEQTGRDVSDAVGRIRDRAADQRDQLEAVASETDEMSATIEEVAASADQVAETSQRAA 545 Query: 262 SSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASAT 321 + +A+ A AA + + + +A + A Q + A Sbjct: 546 ALGDDGQAAAQDAVAQLEEIEDETQAAATAVDDLEAKMSEIETIVAAITDIAEQTNMLAL 605 Query: 322 AAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAAS 381 A A A A E + A + SA+ + +A ++ ++ Sbjct: 606 NANIEAARADQDGDGFAVVADEVKDLADESKASAAEIEALVAEVRAQTETSVAAMDRIQE 665 Query: 382 SASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATR 441 S + S S E + + A + S + A S + + A Sbjct: 666 RVSDGVETVSETERSLSEIAGRIAEADTGVQEISNAMDDQAASVSDVTTAVGD---VAAL 722 Query: 442 AETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDN 495 E A AE A A A + + + + A ++ + A V + + Sbjct: 723 GEETATEAESTADAAAEQATTLSDVAAQTETLAEHAVALREHAAQFEVAADNEP 776 >UniRef50_A6SPK9 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6SPK9_BOTFB Length = 3554 Score = 57.8 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 52/519 (10%), Positives = 126/519 (24%), Gaps = 28/519 (5%) Query: 75 ITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAA-------- 126 I E +P TL + E E + + A ++++ Sbjct: 1592 IPTEEKPEPATLETTEETPAIESQYTEKDLPGEETIPQGEAEPIATPEDSSEPHQGIEVP 1651 Query: 127 --AKKSASDASTSAREAATHAADAADSAR--AASTSAGQAASSAQSASSSAGTASTKATE 182 + +A +E ++ + + + ++ + + Sbjct: 1652 ASIENREPEALEKEQEIEVTTPNSVEQSDLVQDTPGEDDVTELSKDELDPERELAVEEIP 1711 Query: 183 ASKSAAAAESSKSAAATSAGAAKTSE------TNASASLQSAATSASTATTKASEAATSA 236 + A A E S+ A AK E + ++ + + + A + Sbjct: 1712 GEEEAVAMEGSEEEAVDEGERAKVQEIEDLGDDDLKSTEEIVPDAVEEEKSTEDIAPENV 1771 Query: 237 RDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAA 296 + E A + + + S + ++ + E S Sbjct: 1772 VEYVNPSEEALQAGEDKPVDEPISQESDVNLTTDLQHTLPADEEEKLPEIKESNEPSLEE 1831 Query: 297 GSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA-------- 348 + A+ + +A+ E+ ++ S + E E Sbjct: 1832 TNIENASPEVLIDKPTDLEATPPLEINEPVPETEPANVSGFADPSVETEEIPIVPDHDVD 1891 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 S ++ + S + + S + A D Sbjct: 1892 SHTQVPEASGEVSADDLEIPTDSEVIEPFNEEQKVDEETENERLAEHPIDPQETNLKNED 1951 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 A A + + KS +ES A + + ++ + + Sbjct: 1952 REPNNEDIPIENAESVAEPSKEDKS-SESVAEIETPHLDSNDQNEGSAEVDTKDLETEAL 2010 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 T +E + E + + D +K + + + Sbjct: 2011 YPSKEETPDQTEEAVELSNDQSNPSPIFETDVPVSE-IDDQDEKPVEVEARDLEMEDGEH 2069 Query: 529 DKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASR 567 + P+ + V+ + V E S Sbjct: 2070 HSDEVPEKSAEKPSQTLQEESDSEPVVETETYVPESNSH 2108 >UniRef50_UPI0000F2DCD9 PREDICTED: similar to MICAL-like 2 n=1 Tax=Monodelphis domestica RepID=UPI0000F2DCD9 Length = 910 Score = 57.5 bits (136), Expect = 3e-06, Method: Composition-based stats. Identities = 70/518 (13%), Positives = 155/518 (29%), Gaps = 7/518 (1%) Query: 41 ENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARP 100 + +G Y E G + + H+ + + D A P Sbjct: 108 SSTLHSGAYKSTGEPGIFVCMNHHTKASIPHSKSPHPDRKQPAAGYASRTNPVPMDTASP 167 Query: 101 EALRRFELMV--EEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTS 158 LRR + EV +N S ++ + A +S +++ + A +S +S + Sbjct: 168 GVLRRAQEPDKLREVGQNTSQPSREPVRNSPAKGLAQSSGWNSSSVGSSAMNSLSPSSPA 227 Query: 159 AGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNAS----AS 214 ++ SS S+ + T T+ + + +S + + ++ T + + Sbjct: 228 LQKSPSSTSSSPNPFLTRFTQNSPVGGKPSVTNNSPVGWSPAPRKSEPLTTPSKLDLHTT 287 Query: 215 LQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA 274 SA T A S R + S+ A+S + S + S + + Sbjct: 288 SPQGKFSAGATTQSPPSAGASPRGKSDSQATAQSITSQMKSDFRALPQSQANSHIATSEH 347 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 + ++ S + ++ + S ++A T A A S Sbjct: 348 RLDLGSSSSPNSWTSCASKTQQAREKFFQPSCTSADKPPASKEYGLTFTSPQAAGKAPSG 407 Query: 335 STATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSAS 394 +T A + + ++A T ++ + S A + S Sbjct: 408 ATVAPGQAVQERTKDKARSFLIQNLKAGLASNGPGSTAAQGPTRSSPTPSHAPTPDSKPD 467 Query: 395 ASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAES-AATRAETAAKRAEDIA 453 + + + A T ++++ S A+ Q + + + E R + Sbjct: 468 GLRGVSKAKVEPASPRPGTMMELSSKSKPSTPASPQRVKKSPTFSRPSQELLNPRQKPDT 527 Query: 454 SAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKG 513 S V + +T+ I ++ + + S D G + K Sbjct: 528 SGVNGQGPRSTENQIEDPAAWRSKLKPVEKISSVDKVSEQKEKVTPTPIDTKGLALSKKP 587 Query: 514 CFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYP 551 ++ + K P+ + + P Sbjct: 588 LENSSTSIEITLSSPLKDKASTSASALPSKSQASVPAP 625 >UniRef50_B9CN32 Putative uncharacterized protein n=1 Tax=Atopobium rimae ATCC 49626 RepID=B9CN32_9ACTN Length = 814 Score = 57.1 bits (135), Expect = 4e-06, Method: Composition-based stats. Identities = 79/433 (18%), Positives = 129/433 (29%), Gaps = 12/433 (2%) Query: 187 AAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAA 246 A+ + A A A + T SAS ++ A A T K +E A A + E A Sbjct: 377 ASTLTKEGTTAQDDADAQQRRVTALSASTRTIAEDAKKTTIKVAEVDDRAVKATTTAEEA 436 Query: 247 KSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 K + ++AA A AA + + T A + T +AA + T A +A Sbjct: 437 KKTVVKVEEQVTTAAKKADAAASKVEEVSTKAEKAAQAVTTVVSDVAAAKAAATEAKKTA 496 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 AA A A A A A + A+ A A A A+ A +A+AAK + Sbjct: 497 DAAGQEAHDAKTQAKTAATQAAAVDGKATEAKQMASAAGTVATQAEETATAAKQAVDKLS 556 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 + ++ + +A+ A ++ + + + ATEA S Sbjct: 557 NAFSTDAEGAHVGSKAAAHTTIDAQGLHVMSG--AKEIATIAQNTVALAKGATEAEISMV 614 Query: 427 AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSE---TLA 483 +A + + S S T V +A S + Sbjct: 615 DSAFNLNVKRENNPDIGVIQDAVFTGESFSFNPHYSYTTMSSVLKFAAIESEINRGIEVF 674 Query: 484 ATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAG 543 P+ + AY A + + G + + Sbjct: 675 IKPQGYEVAY-IANENVLSSATGTHKHLINLLTVTPWVTMRVTDGSNVIPKAY---VKWR 730 Query: 544 ATSG-KYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYA 602 G V V V + + + + G G W D + Sbjct: 731 IYGGFLLLDVYVPAGYSGVHTVEKLPEKWRPADSNYVVLATQQAEGT--AGVWVDGVGGS 788 Query: 603 YGMFWQYQNNERA 615 YG W Y + Sbjct: 789 YGDIWIYNSARGY 801 >UniRef50_D2R8K8 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R8K8_9PLAN Length = 1678 Score = 56.3 bits (133), Expect = 7e-06, Method: Composition-based stats. Identities = 40/388 (10%), Positives = 98/388 (25%), Gaps = 5/388 (1%) Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 A + + + R T Sbjct: 115 DEPAATEQIVERRPPKVIPEYRPEQLLPAEDRPRQDFERPVETQTPEPVEPVTEIVRQPE 174 Query: 259 SAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASA 318 + T + + +A ++ A+ S + + Sbjct: 175 QEKQETPPEPQPVPVPEQVATTEPNVVKRPRPNEAAPRQAEQASKLSRQTKPSEMKVSQV 234 Query: 319 SATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTA 378 + T A+ A + AS + + A+ A +S + + ++++ + Sbjct: 235 TETPQITEPRPTGVEATAAKSTVKRQDPAASPSGKVAADAPSSTLDTPQPRVARQTAQRS 294 Query: 379 AASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESA 438 A SA+ + + + Q + +++ AT+ +T TE A S + + +T+ Sbjct: 295 AEQSATKTPTLERAVATPAATPRSQVAVSETPATSKATTPTELAPSTVSPTKRTATSVEV 354 Query: 439 ATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEK 498 A++ T A + +++ + + S T + A Sbjct: 355 ASKEATDVPLARATPDSKPQRAELPSEQRPQVAQAQQPTPSRQ---TRNTARPDLATAAA 411 Query: 499 RLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSA 558 + + + P A + + + + Sbjct: 412 EVAPGEASPTAQPTEMAPAATALARASAEVSSPTPSTTPATEPNTANTQQAAARIARTTG 471 Query: 559 GSVSELASRVIIT--TATRTAGDPMNNC 584 S + T T +NN Sbjct: 472 QSAPAATANPQATPTRTRATGNPSINNA 499 Score = 44.4 bits (102), Expect = 0.027, Method: Composition-based stats. Identities = 80/568 (14%), Positives = 155/568 (27%), Gaps = 7/568 (1%) Query: 15 KPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAG- 73 PV + A NS T + G E + ++ + G + Sbjct: 672 APVAGAPRRAVAASNSATSPAAVESPAAAVAQGSGDQSAEPARMALSRSIAGTAGAGRSP 731 Query: 74 --TITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSA 131 + P + E + E + + A A+ ++ Sbjct: 732 NFDRALPGAESPAQVASAAARRAEATQKAEPGDALSPSAPATVARSRSDADLPTASLRAE 791 Query: 132 SDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAE 191 + +A + T A +A S+ A S + A S + + A +T+ Sbjct: 792 ATEVATAPGSNTTADISASSSAALSRADANARPSQVTGAPGASDVDVGSTQVVAEQGMGR 851 Query: 192 SSKSAAATSAGAAKTSETNASASLQSAA-TSASTATTKASEAATSARDAAASKEAAKSSE 250 +S ++ + S A S + A + +A A A + Sbjct: 852 ASGGGQTQLNFDTQSPQIARRTSSGGAPIVSLAAADLGDTASAPMADGGGQPSAAQPDAT 911 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 T A + ++ S + G S T +++T A S A S +A+ A Sbjct: 912 TLAINRTTAGGESTISGGPSKADEAGPVTEVNTAQTLAQSQVSRADNSDGSASGGGEPAL 971 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 T+ + ++A A + + + A ++ Sbjct: 972 TAEEEEEERKRRLARAAAGGAPQLALSGPTLADVAASPMGDGGDGGTPSPEPNAAPSALA 1031 Query: 371 SAESSKTAAASSASSAASSASSASASKDEATRQA--SAAKSSATTASTKATEAAGSATAA 428 + + A A + ++A + A + +T T G TA+ Sbjct: 1032 TNRQQSPDGGAPAGGAPQALAAAGEPGESGAETKGAIAIARAEAAEATPGTPEVGGGTAS 1091 Query: 429 AQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKA 488 T + A+ + + + + V T + P Sbjct: 1092 PSRAPTGPAFVASAQADVAMVGGMPESGGSPQGAPLEAQGVDGGRIAGGARATADSGPAG 1151 Query: 489 VKSAYDNAEKRLQKDQNGADIPDKGCFLNNIN-AVSKTDFADKRGMRYVRVNAPAGATSG 547 + + A A N V +D A V P GA+ Sbjct: 1152 AMAGSEVAIADSVGGAGSAPGSRSTSAAGNEGPMVDGSDVAGGPARSSVDSPGPLGASVV 1211 Query: 548 KYYPVVVMRSAGSVSELASRVIITTATR 575 P + SA + +EL + T Sbjct: 1212 AEIPEIGPNSAVAQAELDHSMGGMGDTP 1239 >UniRef50_D2PNZ5 Putative uncharacterized protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PNZ5_9ACTO Length = 1905 Score = 56.3 bits (133), Expect = 7e-06, Method: Composition-based stats. Identities = 62/441 (14%), Positives = 115/441 (26%) Query: 114 ARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSA 173 + A + A D+ A ++ A + A S Sbjct: 454 TPDTPASLRRAHDGDPPAPDSPADAWPDNDATSELEAVAGDVTPYAPADPSDETGVLEPL 513 Query: 174 GTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 T + A S + +A A S S + + S+ + A+ Sbjct: 514 PTDTAPAAPLSATPDDDTGVIPTLDATAYAGPFPSDAGSFSSDAGSISSEADAEDSGPAS 573 Query: 234 TSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSAS 293 S + E ++ E +A SA + A A + A Sbjct: 574 PSDASQSVRYEVDQADEDSARPSALGRSPFVQFNQPGATPASPASDTAPPFVDPDATEPF 633 Query: 294 AAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAAR 353 A ++S A+ + A S + S+ TA + Sbjct: 634 TPYDPSAADSTSPQDATGDSPFAQYEVDQNEDSGTAPPSAQPTAGDNSPFVQFNQPGGRP 693 Query: 354 SASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATT 413 +A++A + AS + AA S S A + + A + S + Sbjct: 694 AATSAPPEHDSDLASPFIDPDATEPFTPYDPPAADSTSPAPQDASDGSSIARSGTDSGSP 753 Query: 414 ASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSS 473 + + + + S + + + ++ + + Sbjct: 754 SPQHDGTSPFAQRDNTDADPDGFSPFPQYQQPGATPPSPGNQAHRGPSALDRHDQLDEEL 813 Query: 474 ATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGM 533 S S ATP A ++ + + G D P+ A Sbjct: 814 DATSPSARRDATPPAPYGQSEDDPDTAAQYEQGGDKPNAAEASAQYEPTDDAPDASSPFA 873 Query: 534 RYVRVNAPAGATSGKYYPVVV 554 RY R ATS P + Sbjct: 874 RYDRTENEPDATSQVARPGDL 894 >UniRef50_B9LNH9 Putative uncharacterized protein n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LNH9_HALLT Length = 783 Score = 55.9 bits (132), Expect = 8e-06, Method: Composition-based stats. Identities = 56/428 (13%), Positives = 125/428 (29%), Gaps = 12/428 (2%) Query: 42 NPDEAGRYSMDVEYGQYSVILL----VEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDD 97 D+ D E G Y+V + E P G +T D+ ED+ Sbjct: 149 ETDDDAFELADEEVGIYTVYEVDLDVREIPDPEATGDVTAAGDAVVVEDEVVDEGTAEDE 208 Query: 98 ARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAAST 157 A + + ++ + + +A +R + Sbjct: 209 ASEDETPEDDPTEDDPVNDDLVRDATLESETDDDGGEPARGVHDGEGGDEAPSPSRERAG 268 Query: 158 SAGQAASSAQ--------SASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSET 209 + ++ +++ + ++ T AS A + + S Sbjct: 269 EPDEPNEPSKIGGATDTAETAATGESEASTETNASTGRGPAGRVGARSTDDGTDDAPSPQ 328 Query: 210 NASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGN 269 + + S S AS+A S D ++ + S + S + Sbjct: 329 SVGDRSDVSDRSDSPDAHDASDAPESDVDHPNDDVFSEEEQWREQKSIPALDPSEASDPP 388 Query: 270 SAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAES 329 +++ ++ NA T+ + + + ++AA +T +S S Sbjct: 389 GSRSRDETDPNAGGRSTSEAGDRRRSRSAGRGGDTGSAAAPVDDRSPEPRSTEGDRSNRS 448 Query: 330 AASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASS 389 A S ++ + + E+ + AA + +A + + + A A+ Sbjct: 449 AVSQRASGSGASERVDERVEKLEAALEAATRERESLEAERDTLATDRDEIADERDELATE 508 Query: 390 ASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRA 449 + + KS + A + E+ + +AA T ++ A Sbjct: 509 VDRLESELSTLREERDRLKSELSAARDRLPESDRTISAAEARSGTNLFVRYESKGGATLE 568 Query: 450 EDIASAVA 457 + AV Sbjct: 569 DAHDGAVD 576 >UniRef50_UPI00015B5167 PREDICTED: similar to ENSANGP00000017739 n=1 Tax=Nasonia vitripennis RepID=UPI00015B5167 Length = 2721 Score = 55.9 bits (132), Expect = 9e-06, Method: Composition-based stats. Identities = 49/459 (10%), Positives = 126/459 (27%) Query: 74 TITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASD 133 + +P T A + +PE E E+ + + Sbjct: 2112 ETPIEAQPEPDTTESGEAATVKSIEQPEVDTEMEKTTEKPEEKQPEEEKPEEKIPEEEKL 2171 Query: 134 ASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESS 193 + E + T+ ++ ++ + S T + + + E Sbjct: 2172 EEQTPEEEKPEEQKPEEEKAKQDTTESTDEATGEAQTVSEETITLSTPSEAGESDVKEKP 2231 Query: 194 KSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNA 253 + + ++S + + +A A+++ + ++ Sbjct: 2232 TESLIETEKETSEPSVELTSSGTIDSKIGVETEGSTAAEDQAAVTEASTEASKVDESSSP 2291 Query: 254 SSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSA 313 S+ + + A + A ++ A Sbjct: 2292 ISAEPTDEDQKSPALAETSTETPVAKDKIDEPEEATSEKPDEEIPSHEELTTVEPTKVEA 2351 Query: 314 GQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAE 373 ++ A +S + S +++ + + A+ ++TS + SE E Sbjct: 2352 YPDEETSHAPSESPDKELMSTVFDSSEETSSESEKKDLTTEAAISETSSSEPTVSEKPDE 2411 Query: 374 SSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKS 433 + A++AS +S+ + S++ + T ++ A+ +T A+ A E + + A + Sbjct: 2412 KEEEPTATAASEEPTSSDAGSSTSEVPTTASTDAEDQSTKAAVPADEQSSVEPSEAITPE 2471 Query: 434 TAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAY 493 + + + T + P+ ++ Sbjct: 2472 ADQYHVDQQTEEPAAPGAPSEEEQKPVEDHTSTVHPETVKPYTRPDFPDQQMPETDDASI 2531 Query: 494 DNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRG 532 + KDQ D + + AV G Sbjct: 2532 FPPDGADIKDQFNPDEAEPETDDYDDQAVYGPGTCRYGG 2570 >UniRef50_B6XJ97 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B6XJ97_9ENTR Length = 432 Score = 55.9 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/246 (13%), Positives = 68/246 (27%), Gaps = 22/246 (8%) Query: 833 WNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAE 892 + G G+ ++ T L K + + +S ++ + Sbjct: 180 YTKAEGDGRYQSKGNYAPAGDYATNTALTNGLNTKLNISSIAQATGTSTTNVMSQKAVTD 239 Query: 893 VYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGW 952 + YP+G + + + P+ L G + + + + Sbjct: 240 ALQNAVNLDTIYPIGVVVWFAQNKNPNT--LFPGTKWQYIGENRTIRLAAASGANVL--- 294 Query: 953 TIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVS 1012 G ++ + H HS S ++T G T D GT + Sbjct: 295 -----STGGSDSITLNASQMPVHNHSFSGTATSSGGHT---HDKGTMNIT---------G 337 Query: 1013 GSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAG 1072 G + + + + + T + +G +S+G Sbjct: 338 AFAIGGGKSEGQAPGFASGVFSKTTRTLKVNTASGVVDSSVTQINMNAASAWTGNTSSSG 397 Query: 1073 AHAHTV 1078 AH HTV Sbjct: 398 AHTHTV 403 >UniRef50_B0WIK0 Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0WIK0_CULQU Length = 1943 Score = 55.5 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 74/492 (15%), Positives = 166/492 (33%), Gaps = 12/492 (2%) Query: 98 ARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAAST 157 PE + + L + N + + A + T++ + +++ + + A+ Sbjct: 138 TTPELVEFYYLWKKTPGANNNRPHRRRRAGSLRRRNTRTNSNASNSNSTPPNSNKKEATP 197 Query: 158 SAGQA--ASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 A SS++ + S S+ + + S+ A SA AA + A+A Sbjct: 198 EPSSAVTESSSRPSPISKEENSSVTEDDISECDSDSSATKAIKESAAAAVAAAAAAAAVA 257 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 +A+ +A+ A T + + K + ++A + NSA Sbjct: 258 TAASATAAAAGGGGGGGETGEDSPSRMRTRQKPTAKEQQQQQQQQQANANSNSNSAVTTG 317 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 + + T ++ + + A++ G++ A GK + S Sbjct: 318 NTGKRPKRGGTETPETTPVDSPKTPSKKDEAASGKGQKGKSKAETPIKGKKRANELDPDS 377 Query: 336 TATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASA 395 + + +++ +S + +S + S S T + S + S++ A Sbjct: 378 NDDKDSQKRKRSDVSSSSDSSQSSSSSNECSSHLQSPTESITTDSRPGSVLDEAESNSEA 437 Query: 396 SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASA 455 + + + ++AA + ++ + + S AA+ +T ++ T+ + +D Sbjct: 438 ATEVSINVSAAAATPTAPSTKEEEKDPLSLGPAAEPMATESPESSNPPTSVTKTDDKEDD 497 Query: 456 VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCF 515 + A + + ++ T ET TP S E+ + + Sbjct: 498 GTVAAAPASVTAALPTAADTAKLPETEDITPPPTASPTPVTEESIPPAVPTTTPEECSDK 557 Query: 516 LNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATR 575 A + T ++ P A + + +S+LA+ Sbjct: 558 SEEAPAAAATTAPEQPPAIGPIPPTPGQAPNEQEM----------LSKLANMKQEINLQA 607 Query: 576 TAGDPMNNCEFN 587 AG +NN EFN Sbjct: 608 GAGSALNNTEFN 619 >UniRef50_UPI00016C0C40 TPR repeat n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0C40 Length = 2606 Score = 55.5 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 91/737 (12%), Positives = 209/737 (28%), Gaps = 9/737 (1%) Query: 79 EDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSA 138 +D+ + ++ +AS A ++ S +D ++ Sbjct: 1719 DDASDNAKKSSATTAASHADASDNAKKSSATPVASHDDASDNADKSSTTTASHADDASDD 1778 Query: 139 REAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAA 198 + ++ A+ + + A A +S AT + A++++ ++A Sbjct: 1779 ADESSTTTAASHDDASNNAKKSSATPVASHDDASDNAGKFWATTTASHEDASDNATKSSA 1838 Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 T+A + + NA S + A S A+ A +++ + + A + + +++ + AS Sbjct: 1839 TTAASHDDASDNADESSTTTAASHDDASDNAKKSSATTTASHADDASDDADKSSTTPVAS 1898 Query: 259 SAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASA 318 A ++ A S S + S + + A+ + A+ + + + + + Sbjct: 1899 HANDASDDADESPTITVASHADDASDDADESSTTPVASHANDASDDADESPTITVASHAD 1958 Query: 319 SATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTA 378 A+ + + ++ A A+ A + ++ + + Sbjct: 1959 DASDDADESSTITVASHADDASDDADESSTITVASHANDASDDADESSTITVASHADASD 2018 Query: 379 AASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESA 438 A+ +S+ ++ + AS D + A + + A + ++ +++ Sbjct: 2019 NATKSSATPVASHADDASDDADKSSTTTASHADDASGNADETAPAAIKLTSEIHVLSDND 2078 Query: 439 ATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEK 498 + ++A AS S+T + + A+ A KS+ A Sbjct: 2079 DESVSKIDDILAEAHKSIATTTASHDDASDNADESSTTTAASHDDASDNATKSSATTAAS 2138 Query: 499 RLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSA 558 N + + + T + + + A +S A Sbjct: 2139 HDDASDNAKKSSATPVASHEDASDNATKSSATTAASHDDASDNAKKSSA--TTTASHADA 2196 Query: 559 GSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHS 618 ++ + I + N E + A + A S Sbjct: 2197 SDNADETAPAAIKLTSEIHVLSDNTDESVSKIDDILAEAHKSIATTTASHADASNNAKKS 2256 Query: 619 IMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIA 678 + D S +A A + + + A + A A I Sbjct: 2257 SATTAASHDDASDNATKSSATTAAAHEDASDNATKSSATTTASHEDASDNADETAPAAIK 2316 Query: 679 ADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGA 738 + S + + + L + +A G W A Sbjct: 2317 LTSEIHV-------LSDNTDESVSKIDDILAEAHKSIATTTASHADASDNAGKFWATTTA 2369 Query: 739 KTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGV 798 N A DNA + NA ++ T Sbjct: 2370 SHDDASDNAKKSSATTTASHADASDNAKKSSATTAASYDDASDNAKKSSATTTASHADAS 2429 Query: 799 EFDGSKDITLTAAHVAA 815 + T A+H A Sbjct: 2430 DDATKSSATTAASHEDA 2446 >UniRef50_Q84CW8 Putative transmembrane protein n=1 Tax=uncultured bacterium RepID=Q84CW8_9BACT Length = 406 Score = 54.8 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 46/201 (22%), Positives = 85/201 (42%), Gaps = 6/201 (2%) Query: 915 DTVPSGYALMQGQAFD-KSAYPKL-AAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGI 972 P+G+ L +S YP L A SG+ M G +I + R ++ G Sbjct: 186 AAAPTGWLLFGQTYLSGQSTYPALWAVLVASGLTSWMSGTSIVLPDLADRVLMDGGTLGA 245 Query: 973 KSHTHSASASSTDL-GTKTTSSFDYGTKST-NNTGAHTHSVSGSTNSAGAHTHSLANVNT 1030 ++ + S+ +L + ++G+ ++ N+ HTH+ S +T G H H+ V+ Sbjct: 246 TGGANAVTLSTANLPAHDHSIDHNHGSVTSAGNSVNHTHTFSDTTGGTGEHNHNAWFVDV 305 Query: 1031 ASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAAS-AGAHAHTVGIGAHTHSVAI 1089 + + +A N + G HTH++SGT + AH H V + + Sbjct: 306 TGGGAASRAAPASTGSGTNAQITIAGGGDHTHTVSGTTGGDSVAHTHAVDLPNFAGTSGS 365 Query: 1090 GSHGHTITVNAAGNAENTVKN 1110 G +T + G+ +N ++N Sbjct: 366 VGSGTAVTTHP-GSPQNQLRN 385 >UniRef50_C1CP01 Pneumococcal surface protein A n=75 Tax=Streptococcus pneumoniae RepID=C1CP01_STRZT Length = 724 Score = 54.4 bits (128), Expect = 2e-05, Method: Composition-based stats. Identities = 90/696 (12%), Positives = 182/696 (26%), Gaps = 35/696 (5%) Query: 106 FELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASS 165 F V R + + + A+K A + A H +A A A + Sbjct: 21 FVTSQPTVVRAEESPVASQSKAEKDYDAAVKKSEAAKKHYEEAKKKAEDAQKKYDEDQKK 80 Query: 166 AQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTA 225 ++ + AS K EA+K A + A+ ++ A ++ A A Sbjct: 81 TEAKAEKERKASEKIAEATKEVQQAYLAYLQASN-----ESQRKEADKKIKEATQRKDEA 135 Query: 226 TTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSE 285 + T+ S+ A + + + A A E A+ E Sbjct: 136 EAAFATIRTTIVVPEPSELAETKKKAEEAKAEEKVAKRKYDYATLKLALAKKEVEAKELE 195 Query: 286 TAAGQSASAAAGSKTAAASSASAA-STSAGQASASATAAGKSAESAASSASTATTKAGEA 344 Q + + A A A A+ A +A A Sbjct: 196 IEKLQYEISTLEQEVATAQHQVDNLKKLLAGADPDDGTEVIEAKLKKGEAELNAKQAELA 255 Query: 345 TEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQA 404 +Q S +T + + + E+ A + + ++ + A Sbjct: 256 KKQTELEKLLDSLDPEGKTQDELDKEAEEAELDKKADELQNKVADLEKEISNLEILLGGA 315 Query: 405 SAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTT 464 +A + A + A A + + +S +T + ++ A + A Sbjct: 316 DPEDDTAALQNKLAAKKAELAKKQTELEKLLDSLDPEGKTQDELDKEAEEAELDKKADEL 375 Query: 465 KKGIVQLSSATNSTSETLAATPKAVKSA---YDNAEKRLQKDQNGADIPDKGCFLNNINA 521 + + L ++ L +A A K+ + ++ ++ L Sbjct: 376 QNKVADLEKEISNLEILLGGADSEDDTAALQNKLATKKAELEKTQKELDAALNELGPDGD 435 Query: 522 VSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPM 581 +T + + P A + + A Sbjct: 436 EEETPAPAPQPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKP-----EQPAKPEK 490 Query: 582 NNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPV 641 E P + GM++ Y + ++ + + N G A V Sbjct: 491 PAEEPTQPEKPATPKTGWKQENGMWYFYNTD-GSMATGWLQNNGSWYYLNANGSMATGWV 549 Query: 642 FAFIEDGLSISAPGADLVVNDTTYK----FGATNPATECIAADVILDFKSGRGFYESHSL 697 + + + +K + N + A L + +G +Y + Sbjct: 550 K---DGDTWYYLEASGAMKASQWFKVSDKWYYVN--SNGAMATGWLQY-NGSWYYLN--- 600 Query: 698 IVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTL 757 AT + G G+ W Y L N G Sbjct: 601 -------ANGDMATGWLQYNGSWYYLNANGDMATGWAKVNGSWYYLNANGAMATGWAKVN 653 Query: 758 RPFAIDNATGELVIGTKLSASLNGNALTATKLQTPR 793 + NA G + G + ++ + Sbjct: 654 GSWYYLNANGSMATGWVKDGDTWYYLEASGAMKASQ 689 >UniRef50_Q6ZZ82 Eukaryotic initiation factor 4G n=2 Tax=Echinacea RepID=Q6ZZ82_SPHGR Length = 1745 Score = 54.4 bits (128), Expect = 2e-05, Method: Composition-based stats. Identities = 59/494 (11%), Positives = 122/494 (24%), Gaps = 19/494 (3%) Query: 75 ITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDA 134 I + + S + D L T + P + R + V+ ++ + AA + Sbjct: 269 IRIVDPSTNQDVTDTLLRDTSGSSSPHSGRSSANVTPPVSTSSEDAKRIQAAFASKVAMV 328 Query: 135 STSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAES-- 192 +T++ + ++ Q Q + Sbjct: 329 ATASDLPQSDIPPPHRVPPQSAPQQPQTQQPPQQHQPVPQPIPQPVPQQLPMQPGQPPVV 388 Query: 193 --SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSE 250 A+ A + E A + AA A Sbjct: 389 VMPGGQPVPVVAGGNFPVVAPGAAPVVAQVPSQPVEPAVPEGAPNTVVAATPVAEAVQPP 448 Query: 251 TNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAA----GSKTAAASSA 306 +A++A + + T A+ A +A A Sbjct: 449 RPVPQAAAAAVPAPLPVQQPVQIQSVPGTPAQQIPPEAIPAAVQPPVNPVTPAVPPAGPP 508 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 + + A TA S AS + +T +EQ A A +T Sbjct: 509 AGPPANVQAAPVQETAPPSSKHELASVNTASTVGRASPSEQEDAPDSDAVPPVVQQTVQP 568 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 +ET + +++ + S + +K +A+T S Sbjct: 569 TTETKPNQVDDDSKQISANNP----AVQQSMPASVVPQPESKEAASTLPESKDGKKKSQK 624 Query: 427 AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATP 486 Q + + A K + + E+ + + ++ ET Sbjct: 625 KRFQELDKKTTKGSDEFDAYKTDDVQEPTPSQENPGKEVEPEPEAKASKEPVEET----- 679 Query: 487 KAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATS 546 V+ + + + Q D + + ++K + Sbjct: 680 --VQDDIPSEKPKETTTQPQEDKKSTEKTDESPQSDVAKRKSEKESSVVSEKESEETVVP 737 Query: 547 GKYYPVVVMRSAGS 560 P V + S Sbjct: 738 DTSVPAKVEKEEKS 751 >UniRef50_Q58MY1 Predicted protein n=1 Tax=Prochlorococcus phage P-SSM2 RepID=Q58MY1_BPPRM Length = 597 Score = 54.4 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 40/175 (22%), Positives = 67/175 (38%), Gaps = 13/175 (7%) Query: 905 PVGAPIPW--PSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGR 962 P G + W + +PSG+ L G K S D G + Sbjct: 357 PAGVVVMWSGAQNAIPSGWVLCDGNNSSPDLRDKFVIGAGSNYAVDNTGGSADAVVVDHS 416 Query: 963 AVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSA---- 1018 S G +HTHS SAS + + + S G+ + + +G+HTHS SGS + Sbjct: 417 HSASTSVSGAGAHTHSFSASDSHTHSFSGS----GSDTFSGSGSHTHSFSGSGSHGHSLS 472 Query: 1019 ---GAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAAS 1070 AH H+ A +GS + SV ++ + ++ + + + + S Sbjct: 473 LSDSAHQHTSAIPAQNQVAGNSGSQTIWGSVTNSPTWGATANVSGSADSASVSIS 527 >UniRef50_A8N2G9 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8N2G9_COPC7 Length = 1150 Score = 54.0 bits (127), Expect = 3e-05, Method: Composition-based stats. Identities = 60/419 (14%), Positives = 125/419 (29%), Gaps = 1/419 (0%) Query: 52 DVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVE 111 + QY I VE S + IT + + A P Sbjct: 610 TNQPEQYQQISPVEAKADSFSPDITNANQDKAKLDSPSTIARAPSWDVPSDNTAARRAGP 669 Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 A + + +S + A A + + S G + Sbjct: 670 FDAVQPTPLNTTVPTIPSQSSPWGQPTSASFQGPAQAKEVSPWGVPSQGPNEPAWSEPQQ 729 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 +A + A + + ES + A + T E A + +S + + T + Sbjct: 730 AAPEPTVDQQGAVEERSPVESRTAPAPGVEVSESTPEPAAPSPTKSKGSKSPTTASTPLT 789 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 A + ++ + K+ A +++ + AK +E + A + Sbjct: 790 DAGESTESPVVAQPPKAPWAKADKKKGKMSTTISLREIQDAEAKLAEARKEKEKERARLA 849 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 A+AAA S + + S G ++ A + +S + T A + A Sbjct: 850 AAAAATSG-DSKEDIQPFTASWGLPTSQAGSRSTILPKEVTSTTPTPTPAAAPVWTSVAK 908 Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 A A + A + + + A++ + + A+ ++ + Sbjct: 909 APIAKKTMKEIQEEEERRKKAAAKEVSVAAAPVKRGYAEQAVKATPTQSAAPSPGNAWVT 968 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQ 470 AS K + GSA A + ++ A T + + ++A + + Sbjct: 969 VGASGKPSTPVGSAPRPAATVISSVPAGTSSMKTTTPSTTRSAASVARPSPAPGAKVED 1027 >UniRef50_UPI00006A011C mucin 16 (MUC16), mRNA n=3 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A011C Length = 1660 Score = 53.6 bits (126), Expect = 4e-05, Method: Composition-based stats. Identities = 83/557 (14%), Positives = 203/557 (36%), Gaps = 4/557 (0%) Query: 17 VQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTIT 76 Q+ T L + +T NT A+ A S + Q + + E + T Sbjct: 30 TQSATTPLTSTEVPSTTTENTSATHTTPSASSLSTE----QKTTAVTHETQSATTPLAST 85 Query: 77 VYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDAST 136 + T + + E + S + + +++S T Sbjct: 86 EVPSTTTETTSATHTTPSASSISTEQETTAVTQETQSVTTPSTSTEVPSTTTETSSATHT 145 Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSA 196 + ++ + + SA +S + S++ T S T S S+ + E +A Sbjct: 146 TPSASSLSTEQKTTAVTHETQSATTPLASTEVPSTTTETTSATHTTPSASSISTEQETTA 205 Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 + T T+ S+ TS++T TT + + T+ + + A +S+ +S+ Sbjct: 206 VMQETQSVTTPSTSTEVPSTSSETSSATHTTPSVSSITTEQKSTAVTHETQSATKPLTST 265 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 + ++ T++ + + +S T +++ G +++ + T S+ + S++ Sbjct: 266 EVPSTTTETSSASHTTPSVSSITTEKTTAVTHGTQSASTPLTSTEVPSTTTETSSAIHFT 325 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 ++ +++ +A + + + + + +TE S +++SA ++ + + S S Sbjct: 326 LSNISSSAGQKTTAVTHETQSASTSLTSTEVPSTTTKASSAIHSTPSVSSISTEQKTSIV 385 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 T SA++ +S S S + ++ + +S+++ K T + E Sbjct: 386 THETQSATTPLTSTEVLSTSTETSSAIQTTVSTSSSSTEQKTTFGTQETQRVTFPLTRTE 445 Query: 437 SAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNA 496 +T ET++ ++ TT S T TS + +T + Sbjct: 446 VPSTTNETSSAIHTTPSALSISTKQETTAVTHGTQSVTTPLTSTEVPSTSTGTSNKTTTV 505 Query: 497 EKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMR 556 + Q + +K ++ T ++ + ++ + +T + V Sbjct: 506 TQETQNVTIPSTSAEKPSTTTETSSAIHTTPSEASSAIHSTPSSSSISTKQETSAVTRET 565 Query: 557 SAGSVSELASRVIITTA 573 + + S +++ V T+ Sbjct: 566 QSATTSFISTEVPSTST 582 >UniRef50_UPI00006CB316 hypothetical protein TTHERM_00456860 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CB316 Length = 1627 Score = 53.2 bits (125), Expect = 5e-05, Method: Composition-based stats. Identities = 55/533 (10%), Positives = 144/533 (27%), Gaps = 5/533 (0%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE 140 LN + + + + V ++ N +A+ + + ++ Sbjct: 584 EIEEPLNQQTVSTQPSQNNSAQTAALQNIQKSVPPKVNSTPSNQGSAQIKPQQSQQNVQK 643 Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 A A+SA + ++ + + + + ++A T Sbjct: 644 PQNPANQQANSAPNTKPQSANPQTTQNQQQNKINNPAATTNSNTSNVLNQNKPNTSAVTQ 703 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 K S +A S S+ A+ + + + N ++ ++A Sbjct: 704 NSQTKPLNQTNGQSSNGSAISNSSQIKPANPSQQNQN--KPQTQNQTGVVQNKPANNTTA 761 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 + A + N+ T N + + Q++ ++ ++S ++ S+ Sbjct: 762 GTPAGSTPNTTPKNTTQVQNQQGTNQKPVQNSQGIQQNQQNRSASQNSVSSGTNTTQNKV 821 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 +++ + S +A Q + + + AA Sbjct: 822 AVQNQNSSNIKPQNSVNPNQAKPVNSQQAQGNTQNPVKTQVNPAIGTGQGQKTQANPQAA 881 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 + S+ A K + Q + +++T ++ + A + S + T Sbjct: 882 VKIQNTQ---SNPQALKPQTPGQNTNLNKPGQSSATNISKPNPANPAGQVNASKPVNNGT 938 Query: 441 RAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 + T ++ A++ + + Q + + + + L Sbjct: 939 ASNTNVQKTPQQQQNQAIKQQNPQNQSQSQQVKPVQTQNTQRPQPQQPGTINQGQKQPIL 998 Query: 501 QKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGS 560 + Q + + N+ T + + A T K S Sbjct: 999 AQKQVNPNGTQIQPGVGASNSQRSTPATPTKATHSNQYRLTASTTPTKQASTPTKIPQSS 1058 Query: 561 VSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNE 613 ++ +NN + G + + NN Sbjct: 1059 TPTKTVSSNSQQKKESSQTHVNNFSLSTSSTNQGEKNYSQSQQTKDSGSSNNP 1111 >UniRef50_C3ZPV4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZPV4_BRAFL Length = 1379 Score = 52.8 bits (124), Expect = 7e-05, Method: Composition-based stats. Identities = 56/542 (10%), Positives = 145/542 (26%), Gaps = 17/542 (3%) Query: 51 MDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDAR--------PEA 102 + +E + + PP + ++ + + + + + E + PE Sbjct: 636 LHLEPETGEARTVTDTEPPWKSFDLSPTSKKEMSSELESVAKVDEVSSPVTIETLSEPEV 695 Query: 103 LRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQA 162 ++ E E+ A S+ +S+ + ++ + + +A + + Sbjct: 696 VKIDEQFEEKALTFAKPEETYRITESASSESSSSESEKSNVDSDTSTQTASIETPLKTEK 755 Query: 163 ASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSA 222 + + + S A+ A S +S Sbjct: 756 EEEVPVTPQKQVEPTVVQESKTFESILDSPSMQKKASEPTAVTPSVGIGGDPTDDVVSSV 815 Query: 223 STATTKASEAAT---------SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKA 273 +A T + S ++ ++ + S+ ++ Sbjct: 816 GSALTSWRTRKPWRTRGARFDDDSSSGRSTPESEKTKDASFFSSLLGSTKEDTIKGIRSD 875 Query: 274 AKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASS 333 K +E S T + + S S+ +A + + ++ +SS Sbjct: 876 KKRTEVARSDSRTPRFKYKGFLKTYEDDEDEDRKGDSFSSPEAKTDIESQREESQEKSSS 935 Query: 334 ASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSA 393 + ++ +A A + T A+ + + + Sbjct: 936 NEIEEKLSESSSFEAPARKTGQFNGPSVSTTAQKDQVTDTPPMRNIDDIIQHLNNRPDKQ 995 Query: 394 SASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIA 453 ++S D++ R +S T + + Q + + Sbjct: 996 NSSVDQSPRPEHVPAASETPSRPPLPSPDQLTASQRQLAQQDSAPPAAPGRPPLPSTPNL 1055 Query: 454 SAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKG 513 S A T ++ +S+ + TS T+ A + ++ + AD P Sbjct: 1056 SRTESYKALTPEQQSEIISAKSTPTSRTVKAEVENGTPKVTETTPKVAETPKVADTPPNI 1115 Query: 514 CFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTA 573 + + + T+ K + S +++ T+ Sbjct: 1116 PDSTPRTPDTSAPKVAEVTPKVPVTTPKVPETTPKIQAETTPKVPDSTPKVSQTTPKTSE 1175 Query: 574 TR 575 R Sbjct: 1176 AR 1177 >UniRef50_Q2Y2L9 Major surface glycoprotein G n=13 Tax=Avian metapneumovirus RepID=VGLG_AMPV1 Length = 585 Score = 52.8 bits (124), Expect = 8e-05, Method: Composition-based stats. Identities = 53/465 (11%), Positives = 119/465 (25%), Gaps = 12/465 (2%) Query: 120 VAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + + + T D + +++ S + + + Sbjct: 68 TPAPRDNKTNTENTKKETTFHTTTTTRDPEVRETKTTKPKTNEGATSPSRNLTTKGDIHQ 127 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 T A+ A + SK K + T S+ + + + TT + A+ A Sbjct: 128 TTRATTEAELEKQSKQTIEPDTSTKKHTPTRPSSESPTTTQATAQLTTPTAPKASIAPKN 187 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 + + T +S A + A + KA + + + Sbjct: 188 RQATTKKTETGTTTTSRAKKTNNPTETATTTLKATTETGKGKEGPTQHTIKEQPETTAGE 247 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 T S S A A + S+ ++ + + A + Sbjct: 248 TTTPQSRRTTSRPAPTTKTEEEAETTKTRTTKSTQTSTGPPGPTRSTPSKTATENNKRTT 307 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 T + A+ S + ++T A + + + + + T + + + Sbjct: 308 TIKRPNTANTDSRQQTRTTAEQDRQIQTKAKPTTNGAHAQTTTTPEHNTDTTNSTKESSK 367 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 E + ++++ + E A+ A R A+ + T+ +S T + Sbjct: 368 EDKTTRDPSSKTPTDQEDASKGTTAANPRKNTEANTRTPPTTTPTRHTTESATSTTGDKT 427 Query: 480 ETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVN 539 + K+ + + L+N T + + Sbjct: 428 KAKTTRWKSTADRQPIRNSTTAETKTAQSKQPTPKQLSNNTTPENTTPPNNKSSSQTD-- 485 Query: 540 APAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNC 584 S V PMN C Sbjct: 486 ----------AAPTEEIEIRSSLWRRRYVYGPCRENVLEHPMNPC 520 >UniRef50_Q1JCN4 Extracellular matrix binding protein n=6 Tax=Streptococcus pyogenes RepID=Q1JCN4_STRPB Length = 1755 Score = 52.5 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 68/497 (13%), Positives = 126/497 (25%), Gaps = 1/497 (0%) Query: 50 SMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELM 109 ++ QY V+ + G + + D + A Sbjct: 1203 GIENINNQYQHGDGVDVRKATAKGDLEKEAAKVKALIAKDPTLTQADKDKQTAAVDAAKN 1262 Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 A + + A A A ++A+A + + Sbjct: 1263 TAIAAVDKATTADGVNQELGKGITAINKAYRPGEGVKARKEAAKADLEKEAAKVKALIAK 1322 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 + A A+ AA + + + E + + A Sbjct: 1323 DPTLTQADKDKQTAAVDAAKNTAIAAVDKATTAEGINQELGKGITAINKAYRPGEGVKAR 1382 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 EAA + + A+K A + + A A + A A + + Sbjct: 1383 KEAAKADLEKEAAKVKALITNDPTLTKADKAKQTGAVAKALKAAIAAVDKATTAEGINQE 1442 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 A +K A A +A AA A T KA + A Sbjct: 1443 LGKGITAINKAYRPGEGVKARKEAAKADLEREAAKVREAIANDPTLTKADKAKQTEAVAK 1502 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 A + +A + T ++ + + A +A + Sbjct: 1503 ALKAAIAAVDKATTAEGINQELGKGITAINKAYRPGEGVKARKEAAKANLEKVAKETKAL 1562 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 + TE A A Q+ + A A+T ++ Sbjct: 1563 ISGDRYLSETEKAAQKQAVEQALAKALGQVEAAKTVEAVKLAENLGTVAIRSAYVAGLAK 1622 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFAD 529 AT + +E A +A+K A ++ D + K N++ KT A Sbjct: 1623 DTDQATAALNEAKQAAIEALKQAAAETLAKITTDAKLTEAQ-KAEQSENVSLALKTAIAT 1681 Query: 530 KRGMRYVRVNAPAGATS 546 R + + A Sbjct: 1682 VRSAQSIASVKEAKDKG 1698 >UniRef50_B4CVU3 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVU3_9BACT Length = 755 Score = 52.1 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 67/545 (12%), Positives = 143/545 (26%), Gaps = 21/545 (3%) Query: 80 DSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTA-AAKKSASDASTSA 138 D + + E++ + E L + E R + A+ Sbjct: 209 DEKKRYETNKEAKPQENEQQREQLAILNRLKELAQRQQDINERLKELQTALQAAKTEEQK 268 Query: 139 REAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAA 198 E + + + +A + A A A + A ++++S Sbjct: 269 EEIRRQLKRLQEEEKQMLSDIDEAKQKMEQAQQQAQLADERQQLDKIRGEAQQAAESMEK 328 Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 + A + T A LQ T+ + A + A E Sbjct: 329 GATSQALANGTRAQRDLQQMHDEFRKKTSGQFNEEMRQMRSDARELAQNQEELAEKLQPQ 388 Query: 259 SAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAST------S 312 A +++ + +E A+ + S+ A + A Sbjct: 389 PAKQERPTLDGNSEREQLTEKFAKQEGDLKKLTEKMKDISEQAETAEPLLAKELYDSLRK 448 Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSA 372 QA T + A+ A +A ++ A S +A A Sbjct: 449 TTQAGTDQTLEKTQELAQRGYANEARQFEQKAHQEIDDLKSGVERAAESVLGNEADALRA 508 Query: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 432 ++ + + +++ + + + + T G A Q Sbjct: 509 ARAEVDQLKKDLDREIARARPDLAQNSEKQPGNQPGEPNSKEGQQPTGKDGQQGKAGQQG 568 Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 492 S + + +++A + K Q +S ++T A + Sbjct: 569 SPQSADREQKTADSQKAGGQQKGEGQQKGEGEGKEEGQQKGEGSSENQTADAKGQGKGQG 628 Query: 493 YDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPV 552 + +Q G P +A + + + +G+ S + Sbjct: 629 KGGQNGQASGEQAGGAPP------QEASAPEQFAGRENPSAQQAGNGGKSGSPSARNANG 682 Query: 553 VVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNN 612 R+A + +LAS + + NN T G F + + Sbjct: 683 GGPRTASRLLQLAS--------QAGENSSNNNGGGNGGGGADTTFDGPLTGDRFVDWSDR 734 Query: 613 ERAIH 617 R + Sbjct: 735 LRNVE 739 >UniRef50_UPI00016C0209 S-layer domain protein n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0209 Length = 3575 Score = 52.1 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 68/505 (13%), Positives = 159/505 (31%), Gaps = 2/505 (0%) Query: 89 FLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADA 148 A+ + + + + V+ SA + AK++ + + + A + Sbjct: 2463 LTDAIADAQSVYDNSDASKSEVDFAKAGLSAAISSFNNAKQNGTMVNAVDKAALETLLSS 2522 Query: 149 ADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSE 208 A++ A + S+ + S+ + T+A A + A+ + + K Sbjct: 2523 ANATLATAKSSDDGTDISPSSKWVSTEIYAALTDAIXXAQGVYDNSDASKSEVDSTKXXL 2582 Query: 209 TNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAG 268 + A S +A + A + +A + A S + + + S+ +T Sbjct: 2583 SAAIXSFNNAKQXGTMVNAVDKAALETLLSSANATLATAKSXDDGTDISPSSKWVSTEIY 2642 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 + A + A+ + +AA SS + A + +A AA ++ Sbjct: 2643 AALTDAIADAQXVYDNSDASKSEVDSTKXELSAAISSFNNAKQNGTMVNAVDKAALETLL 2702 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 S+A++ + + T+ + ++ ++ + T+A A S + A+ S S + Sbjct: 2703 SSANATLATAKSSNDGTDISPSSKWVSTEIYAALTDAIADAQSVYDNSDASKSEVDSTKA 2762 Query: 389 SASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKR 448 S+A +S + A + + + A +A + A A+S + ++ + Sbjct: 2763 GLSAAISSFNNAKQNGTMVNAVDKAALETLLSSANATLATAKSSDDGTDISPSSKWVSSE 2822 Query: 449 AEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 + S ++ SE AA + + Sbjct: 2823 IYAALTDAISSAQGVYDNSDASKSEVDSTKSELSAAISSFNNAKQNGTMVNAVDKAALET 2882 Query: 509 IPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRV 568 + S ++V A T V ++ + Sbjct: 2883 LLSSANATLATAKSSDDGTDISPSSKWVSTEIYAALTDAIADAQSVYDNSDASKSEVDFA 2942 Query: 569 IITTATRTAGDPMNNCEFNGFVMPG 593 + + NN + NG ++ Sbjct: 2943 KADLSAAISSF--NNAKQNGTMVNA 2965 Score = 50.9 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 68/505 (13%), Positives = 158/505 (31%), Gaps = 2/505 (0%) Query: 89 FLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADA 148 A+ + + V+ SA + AK+ + + + A + Sbjct: 2554 LTDAIXXAQGVYDNSDASKSEVDSTKXXLSAAIXSFNNAKQXGTMVNAVDKAALETLLSS 2613 Query: 149 ADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSE 208 A++ A + S + S+ + T+A A + A+ + + K Sbjct: 2614 ANATLATAKSXDDGTDISPSSKWVSTEIYAALTDAIADAQXVYDNSDASKSEVDSTKXEL 2673 Query: 209 TNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAG 268 + A +S +A + + A + +A + A S + + + S+ +T Sbjct: 2674 SAAISSFNNAKQNGTMVNAVDKAALETLLSSANATLATAKSSNDGTDISPSSKWVSTEIY 2733 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 + A + + A+ + +AA SS + A + +A AA ++ Sbjct: 2734 AALTDAIADAQSVYDNSDASKSEVDSTKAGLSAAISSFNNAKQNGTMVNAVDKAALETLL 2793 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 S+A++ + + T+ + ++ +S + T+A +S + A+ S S S Sbjct: 2794 SSANATLATAKSSDDGTDISPSSKWVSSEIYAALTDAISSAQGVYDNSDASKSEVDSTKS 2853 Query: 389 SASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKR 448 S+A +S + A + + + A +A + A A+S + ++ + Sbjct: 2854 ELSAAISSFNNAKQNGTMVNAVDKAALETLLSSANATLATAKSSDDGTDISPSSKWVSTE 2913 Query: 449 AEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 + + S S + ++ AA + + Sbjct: 2914 IYAALTDAIADAQSVYDNSDASKSEVDFAKADLSAAISSFNNAKQNGTMVNAVDKAALET 2973 Query: 509 IPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRV 568 + S ++V A T V ++ + Sbjct: 2974 LLSSANATLATAKSSDDGTDISPSSKWVSTEIYAALTDAIADAQSVYDNSDASKSEVDFA 3033 Query: 569 IITTATRTAGDPMNNCEFNGFVMPG 593 + NN + NG ++ Sbjct: 3034 KADLSAAITSF--NNAKQNGTMVSA 3056 Score = 49.8 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 66/505 (13%), Positives = 161/505 (31%), Gaps = 2/505 (0%) Query: 89 FLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADA 148 A++ + + + V+ + SA + AK++ + + A + Sbjct: 2281 LTDAISSAQSVYDNSDASKSEVDSTKADLSAAISSFNNAKQNGTMIDAVDKAALETLLSS 2340 Query: 149 ADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSE 208 A++ A + S+ + S+ + T+A SA + + A+ + + K Sbjct: 2341 ANATLATAKSSDDGTDISPSSKWVSSEIYAALTDAIASAQSVYDNSDASKSEVDSTKADL 2400 Query: 209 TNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAG 268 + A +S +A + + + + + +A + A S + + + S+ +T Sbjct: 2401 SAAISSFNNAKQNGTMVSAVDKTSLETLLSSANATLATAKSSNDGTDISPSSKWVSTEIY 2460 Query: 269 NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAE 328 + A + + A+ A +AA SS + A + +A AA ++ Sbjct: 2461 AALTDAIADAQSVYDNSDASKSEVDFAKAGLSAAISSFNNAKQNGTMVNAVDKAALETLL 2520 Query: 329 SAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAAS 388 S+A++ + + T+ + ++ ++ + T+A + A+ S S Sbjct: 2521 SSANATLATAKSSDDGTDISPSSKWVSTEIYAALTDAIXXAQGVYDNSDASKSEVDSTKX 2580 Query: 389 SASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKR 448 S+A S + A + + + A +A + A A+S + ++ + Sbjct: 2581 XLSAAIXSFNNAKQXGTMVNAVDKAALETLLSSANATLATAKSXDDGTDISPSSKWVSTE 2640 Query: 449 AEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGAD 508 + + S ++ E AA + + Sbjct: 2641 IYAALTDAIADAQXVYDNSDASKSEVDSTKXELSAAISSFNNAKQNGTMVNAVDKAALET 2700 Query: 509 IPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRV 568 + S ++V A T V ++ + Sbjct: 2701 LLSSANATLATAKSSNDGTDISPSSKWVSTEIYAALTDAIADAQSVYDNSDASKSEVDST 2760 Query: 569 IITTATRTAGDPMNNCEFNGFVMPG 593 + + NN + NG ++ Sbjct: 2761 KAGLSAAISSF--NNAKQNGTMVNA 2783 Score = 49.8 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 96/596 (16%), Positives = 208/596 (34%), Gaps = 14/596 (2%) Query: 8 VLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGF 67 L D K + V AS + D A R +E + +L + Sbjct: 1919 ALSDAISTATAVANDATKQASVDSAVEALDKASNSFDVAKRAGTKIEVADKTSLLTLIDA 1978 Query: 68 PPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAA 127 ++ ++TV +D + + A ++ DA +A+ + + + A V + A Sbjct: 1979 ATANLASVTVSDDGSDVDVANMWVAHSDYDALSDAISTATAVANDATKQA--VVDSAVEA 2036 Query: 128 KKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK---ATEAS 184 AS+A A+ A T A ++ AS A A ++ + S T + Sbjct: 2037 LDKASNAFDEAKRAGTKIEIADKTSLLASIDAAAANLASATVSDDGADVDVADMWVTHSD 2096 Query: 185 KSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKE 244 A + S + A + + +A +L A+ + A ++ + + + Sbjct: 2097 YDALSDAISTATAVANDATKQAVVDSAVEALDKASKAFDEAKRAGTQIGIGNKSSLETLL 2156 Query: 245 AAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAAS 304 ++ ++ + S+ + ++ + + T+A +S + ++ A+ A + Sbjct: 2157 SSANATLATAKSSDDGTDISPSSKWVSTEIYAALTDAIASAQSVYDNSDASKSEVDFAKA 2216 Query: 305 SASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETN 364 SAA TS A + T ++A + ++ + + + ++K + Sbjct: 2217 DLSAAITSFNNAKQNGTMVNAVDKAALETLLSSANATLATAKSSDDGTDISPSSKWVSSE 2276 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS 424 A+ T A SS + ++ ++ S S A A + AK + T A + A Sbjct: 2277 IYAALTDAISSAQSVYDNSDASKSEVDSTKADLSAAISSFNNAKQNGTM--IDAVDKAAL 2334 Query: 425 ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAA 484 T + + +T +A + + + + A T Q + S++ Sbjct: 2335 ETLLSSANATLATAKSSDDGTDISPSSKWVSSEIYAALTDAIASAQSVYDNSDASKSEVD 2394 Query: 485 TPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGA 544 + KA SA ++ +++ DK +++ + T K ++ + Sbjct: 2395 STKADLSAAISSFNNAKQNGTMVSAVDKTSLETLLSSANATLATAKSSNDGTDISPSSKW 2454 Query: 545 TSGKYYP--VVVMRSAGSVSELASRVIITTATRTAGDPM-----NNCEFNGFVMPG 593 S + Y + A SV + + AG NN + NG ++ Sbjct: 2455 VSTEIYAALTDAIADAQSVYDNSDASKSEVDFAKAGLSAAISSFNNAKQNGTMVNA 2510 Score = 45.5 bits (105), Expect = 0.012, Method: Composition-based stats. Identities = 87/598 (14%), Positives = 181/598 (30%), Gaps = 21/598 (3%) Query: 8 VLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGF 67 L D K + V AS DEA R +E S +L + Sbjct: 1649 ALSDAISTATAVANDATKQAVVDSAVEALDKASNAFDEAKRAGTKIEIADKSALLALIDA 1708 Query: 68 PPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAA 127 ++ ++TV +D + A ++ DA +A+ + + + A V + A Sbjct: 1709 AAANLASVTVSDDGSDVDAANMWVAHSDYDALFDAISTATAVANDATKQA--VVDSAVEA 1766 Query: 128 KKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK---ATEAS 184 AS A A+ A T A ++ AS A A ++ + S T + Sbjct: 1767 LDKASKAFDEAKRAGTKIEVADKTSLLASIDAATANLASVTVSDDGTDVDVADMWVTHSD 1826 Query: 185 KSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKE 244 A + S + A + + +A +L A+ + A ++ + + + + Sbjct: 1827 YDALSDAISTATAVANDATKQAVVDSAVEALDKASNAFDEAKRAGTKIEIADKTSLLALI 1886 Query: 245 AAKSSETNASSSASSAASSATAA---------GNSAKAAKTSETNARSSETAAGQSASAA 295 A ++ + + + A A S + + +++ A+ SA A Sbjct: 1887 DAATANLASVTVSDDGADVDVADMWVTHSDYDALSDAISTATAVANDATKQASVDSAVEA 1946 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSA 355 + + A A T A ++ A +A ++ T + + A S Sbjct: 1947 LDKASNSFDVAKRAGTKIEVADKTSLLTLIDAATANLASVTVSDDGSDVDVANMWVAHSD 2006 Query: 356 SAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAS 415 A + + + + + + S+ + ++++ +K T+ A K+S + Sbjct: 2007 YDALSDAISTATAVANDATKQAVVDSAVEALDKASNAFDEAKRAGTKIEIADKTSLLASI 2066 Query: 416 TKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSAT 475 A SAT + + + D S TK+ +V + Sbjct: 2067 DAAAANLASATVSDDGADVDVADMWVTHSDYDALSDAISTATAVANDATKQAVVDSAVEA 2126 Query: 476 NSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRY 535 + K + K + + S ++ Sbjct: 2127 LDKASKAFDEAKRAGTQIGIGNKSSLET-----LLSSANATLATAKSSDDGTDISPSSKW 2181 Query: 536 VRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPG 593 V A T V ++ + + NN + NG ++ Sbjct: 2182 VSTEIYAALTDAIASAQSVYDNSDASKSEVDFAKADLSAAITSF--NNAKQNGTMVNA 2237 Score = 45.1 bits (104), Expect = 0.016, Method: Composition-based stats. Identities = 95/565 (16%), Positives = 200/565 (35%), Gaps = 16/565 (2%) Query: 39 ASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDA 98 AS+ DEA R +E + +L ++ ++TV +D + D ++ DA Sbjct: 1770 ASKAFDEAKRAGTKIEVADKTSLLASIDAATANLASVTVSDDGTDVDVADMWVTHSDYDA 1829 Query: 99 RPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTS 158 +A+ + + + A V + A AS+A A+ A T A ++ A Sbjct: 1830 LSDAISTATAVANDATKQA--VVDSAVEALDKASNAFDEAKRAGTKIEIADKTSLLALID 1887 Query: 159 AGQAASSAQSASSSAGTASTK---ATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 A A ++ + S T + A + S + A + + S +A +L Sbjct: 1888 AATANLASVTVSDDGADVDVADMWVTHSDYDALSDAISTATAVANDATKQASVDSAVEAL 1947 Query: 216 QSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAK 275 A+ S A ++ + + + + A ++ + + + + A A + Sbjct: 1948 DKASNSFDVAKRAGTKIEVADKTSLLTLIDAATANLASVTVSDDGSDVDVANMWVAHSDY 2007 Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 + ++A S+ TA A+ + +A A +++A + A + A+ + AS Sbjct: 2008 DALSDAISTATAVANDAT--KQAVVDSAVEALDKASNAFDEAKRAGTKIEIADKTSLLAS 2065 Query: 336 TATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASA 395 A A+ S A T++ S S A ++ ++ + SA Sbjct: 2066 IDAAAANLASATVSDDGADVDVADMWVTHSDYDALSDAISTATAVANDATKQAVVDSAVE 2125 Query: 396 SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASA 455 + D+A++ AK + T + T + + +T +A + + + Sbjct: 2126 ALDKASKAFDEAKRAGT--QIGIGNKSSLETLLSSANATLATAKSSDDGTDISPSSKWVS 2183 Query: 456 VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCF 515 + A T Q + S++ KA SA + +++ + DK Sbjct: 2184 TEIYAALTDAIASAQSVYDNSDASKSEVDFAKADLSAAITSFNNAKQNGTMVNAVDKAAL 2243 Query: 516 LNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYP--VVVMRSAGSVSELASRVIITTA 573 +++ + T K ++ + S + Y + SA SV + + Sbjct: 2244 ETLLSSANATLATAKSSDDGTDISPSSKWVSSEIYAALTDAISSAQSVYDNSDASKSEVD 2303 Query: 574 TRTAGDPM-----NNCEFNGFVMPG 593 + A NN + NG ++ Sbjct: 2304 STKADLSAAISSFNNAKQNGTMIDA 2328 >UniRef50_Q4SE75 Chromosome undetermined SCAF14625, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4SE75_TETNG Length = 2835 Score = 52.1 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 55/457 (12%), Positives = 122/457 (26%), Gaps = 11/457 (2%) Query: 55 YGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVA 114 G E ++ ED + + ED RP A + E E+ + Sbjct: 1101 PGLEDKRSSGEAPEGLDGDIFSLEEDEKKDSTPAAPRPAEEDTVRPTAGGQREGKEEQSS 1160 Query: 115 RNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAG 174 + + ++ +STS +E + ++ S S + G Sbjct: 1161 TGPAQPEDDKDMSRGEQEFSSTSVKETEKSKEAEGGGDVHPAATSQDLPSVDTSEVQTGG 1220 Query: 175 TASTKA-TEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 T A TEA E + + + +A ++ T SEAA Sbjct: 1221 TGVPAADTEAVHQETGTEEKEEVDTPQPLRQEVQDPSAGIQTTREEDTSKDLTADVSEAA 1280 Query: 234 TSARDAAA-SKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSA 292 A + + T A + S+ + +S + E SE + ++ Sbjct: 1281 GDAPEVKGHQGTQPPETGTGAIQTFSTNTQEDKTSDSSQQQEPMKEIKEEKSEAISEETV 1340 Query: 293 SAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQ----- 347 K + ++ ++AT + A++ A E Sbjct: 1341 EKDREEKKSQEEELKDGGLQEDRSRSAATEGEMDTDLQAATTDGAIGGPHAEAEALQDHR 1400 Query: 348 ----ASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQ 403 ++ + + + + + S SS ++E Sbjct: 1401 DPDGQTSPSPAEKKTTKENICEEEVKPVRRDICSGVQDKTSEEESSDEDREDEEEEDICM 1460 Query: 404 ASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAST 463 A + + T S + + E + +++ + Sbjct: 1461 GGAGSRPLSVIFSSRTPPLTSEHVIGVIPTLTMQEPSIDEDVPQSSKEEPKRSPQLEEDR 1520 Query: 464 TKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 +V +++ + + ++ E R Sbjct: 1521 PVVRVVPSEKMSSALEQQSPSDAPPSRAEPAETEPRT 1557 >UniRef50_Q9I7U4 Titin n=39 Tax=Eukaryota RepID=TITIN_DROME Length = 18141 Score = 52.1 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 35/478 (7%), Positives = 92/478 (19%), Gaps = 17/478 (3%) Query: 91 GAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAAD 150 + EDD +PE E + EV + + + + + + Sbjct: 10973 ETVEEDDKQPETTVTVEEVPYEVEKP-DEIQELPEEVRVVETVTEDGKPKKKKIRTRVIK 11031 Query: 151 SARAASTSAGQAASSAQSASSSAGTASTKA----TEASKSAAAAESSKSAAATSAGAAKT 206 + + + + T + + E + T K Sbjct: 11032 KVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKP 11091 Query: 207 SETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATA 266 + + TK + + + Sbjct: 11092 KKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVR 11151 Query: 267 AGNSAKAAKTSETNA------RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 + + + + + + + Sbjct: 11152 VVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKP 11211 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 + E + + + + + K T + E + +T Sbjct: 11212 EEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVT 11271 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 E R K + + T Sbjct: 11272 VEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVE 11331 Query: 441 RAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKS------AYD 494 + + + E+ + + + + +E K +++ D Sbjct: 11332 EDDKQPETTVTVEEVPYEEEKLEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGD 11391 Query: 495 NAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPV 552 E + D + +S + + V V P A + P Sbjct: 11392 KQEVTTIETVEEDDKKAETTVTVEETELSAPSVGKVQLKKRVIVQKPEDAVTVFELPE 11449 Score = 48.6 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 44/532 (8%), Positives = 112/532 (21%), Gaps = 21/532 (3%) Query: 10 KDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPP 69 +G G T+ L + N + +V ++ E S+ Q ++ P Sbjct: 9263 IEGHGDFKPG-TVNLTSNSNLNSELVVSVVQEVTSVPSLGSLATVEPQ-----ELKAMPV 9316 Query: 70 SHAGTITVYEDSQPGTLNDFL--GAMTEDDARPEALRRFELMV---------EEVARNAS 118 + + T Y + G +F + EDD +PE E + +E+ Sbjct: 9317 TKSSTNLAYSEEVKGNKQEFTKIETVEEDDKQPETTVTVEELPYEEEKPEEIQELPEEVC 9376 Query: 119 AVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTAST 178 V T K T + Q ++ Sbjct: 9377 VVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKP 9436 Query: 179 KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARD 238 + + + + K T ++ + T + Sbjct: 9437 EEIQELPEEVRVVETVTEDGKPKK--KKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETT 9494 Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA--RSSETAAGQSASAAA 296 + + + K K + + + Sbjct: 9495 VTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRFIKKVKGDKQEVTKIET 9554 Query: 297 GSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSAS 356 + + + E + + + + + Sbjct: 9555 VEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVK 9614 Query: 357 AAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAST 416 K T + E + +T E R Sbjct: 9615 GDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKK 9674 Query: 417 KATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATN 476 K + + T + + + E+ + + + Sbjct: 9675 KIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVE 9734 Query: 477 STSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 + +E K +++ K +++ + ++ + Sbjct: 9735 TVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPY 9786 Score = 48.2 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 46/491 (9%), Positives = 96/491 (19%), Gaps = 48/491 (9%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 10547 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 10606 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 10607 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 10666 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---------------SEAAT 234 + ++ K T + +T T + E Sbjct: 10667 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 10726 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 K K T T + K ET E + Sbjct: 10727 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 10786 Query: 295 AAGSKTAAASSASAASTSAGQASAS------ATAAGKSAESAASSASTATTKAGEATEQA 348 + G E K E T Sbjct: 10787 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 10846 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + + ++ K E T+ + + Sbjct: 10847 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 10906 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 +T E + ET + + + K Sbjct: 10907 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 10966 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 +++ ET+ K ++ E + ++ +I + + + V++ Sbjct: 10967 QEVT-----KIETVEEDDKQPETTVTVEEVPYEVEKPD-EIQELPEEVRVVETVTEDGKP 11020 Query: 529 DKRGMRYVRVN 539 K+ +R + Sbjct: 11021 KKKKIRTRVIK 11031 Score = 48.2 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 38/488 (7%), Positives = 93/488 (19%), Gaps = 18/488 (3%) Query: 74 TITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASD 133 I + T+ D ++ F + ++E + + + ++ S Sbjct: 14211 KILEENVPEDTVEKPLEALHTDSDLEKPDVQEFSISIKEEEQKHTHPEKKKSSKISSEQP 14270 Query: 134 ASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKS-----AA 188 S + + Q S + + + + A Sbjct: 14271 KQPSTEQYEISVTEHDLKPEEEKPFTVQVIQSETNVEETKDDTGKVHKQVTTKRMLRRPA 14330 Query: 189 AAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARD---------- 238 + + T + K D Sbjct: 14331 GEGEIEIIEVVRDDQPEAEITIVEYEPEPVNQDEKPKEPKKKTRKVKKDDIHDYIQKLIE 14390 Query: 239 -AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAG 297 E K + + + + + +S T + Sbjct: 14391 LETPKTELEKYEKIEFEPIVKDKPLDSPIDVLDESPKEVQKKDKKSRSTKVPNEETPVQE 14450 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 A + + T++ Sbjct: 14451 QYAKVNVVEEEAPEQPEIPVQILEVKPVEVDVKEVITEDGKPVQEKTTKRVLKKIGPEEQ 14510 Query: 358 AKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTK 417 T ++ + + + +S S SK++ + Sbjct: 14511 TTFKITMIESEDNDSVTVIVDEEPEIASPQSIEEHPEQSKEKLAPKPKKTVRKVK--KDD 14568 Query: 418 ATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNS 477 ++ K E ++ ++ E + I L T Sbjct: 14569 LSDYVKKLIEEEIPKVDLEKYEKVEMPEKPVKLTVSDSIPEEPKPDKSQPISVLPDTTKP 14628 Query: 478 TSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVR 537 TPK + ++ + + DIP+ + T Sbjct: 14629 KKTKTPKTPKTEDTDQQVPDEPTETTVDTTDIPELTPTQTAQPEDTATAQITPSAQEEKS 14688 Query: 538 VNAPAGAT 545 T Sbjct: 14689 TQDDTKDT 14696 Score = 46.7 bits (108), Expect = 0.005, Method: Composition-based stats. Identities = 45/483 (9%), Positives = 88/483 (18%), Gaps = 47/483 (9%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 9766 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 9825 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++A + + T Sbjct: 9826 VKGDKQEVTKIETAEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 9885 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---------------SEAAT 234 + ++ K T + +T T + E Sbjct: 9886 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 9945 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 K K T T + K ET E + Sbjct: 9946 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 10005 Query: 295 AAGSKTAAASSASAASTSAGQASAS------ATAAGKSAESAASSASTATTKAGEATEQA 348 + G E K E T Sbjct: 10006 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 10065 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + + ++ K E T+ + + Sbjct: 10066 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 10125 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 + +T E + ET + + + K Sbjct: 10126 NDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 10185 Query: 469 VQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 +++ ET+ K K+ E ++++ Sbjct: 10186 QEVT-----KIETVEEDDKQPKTTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 10240 Query: 529 DKR 531 K+ Sbjct: 10241 KKK 10243 Score = 46.3 bits (107), Expect = 0.007, Method: Composition-based stats. Identities = 31/452 (6%), Positives = 86/452 (19%), Gaps = 19/452 (4%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 10902 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 10961 Query: 139 REAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAA 198 + ++ + + K E + ++ Sbjct: 10962 VKGDKQEVTKIETVEEDDKQPETTVTVEE-----VPYEVEKPDEIQELPEEVRVVETVTE 11016 Query: 199 TSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSAS 258 K T ++ + T + + + + Sbjct: 11017 DGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELP 11076 Query: 259 SAASSATAAGNSAKAAKTSETNA--RSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 K K + + + + + Sbjct: 11077 EEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYE 11136 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 + E + + + + + K T + E + + Sbjct: 11137 EEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPE 11196 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 T E R K + + T Sbjct: 11197 TTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKI 11256 Query: 437 SAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNA 496 + + + E+ + + + + +E K +++ Sbjct: 11257 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 11316 Query: 497 EKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 K +++ + ++ + Sbjct: 11317 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPY 11348 Score = 45.9 bits (106), Expect = 0.009, Method: Composition-based stats. Identities = 45/485 (9%), Positives = 99/485 (20%), Gaps = 36/485 (7%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 10618 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 10677 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 10678 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 10737 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAAS-KEAAKS 248 + ++ K T + +T T + + E + Sbjct: 10738 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 10797 Query: 249 SETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASA 308 ET T K K T + E Q + + Sbjct: 10798 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 10857 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA----SAAARSASAAKTSETN 364 + + + T K + +Q +T+ T Sbjct: 10858 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 10917 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS 424 + + + + + K + R K E Sbjct: 10918 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 10977 Query: 425 ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAA 484 ++ T E E + E +E + K + Sbjct: 10978 DDKQPETTVTVEEVPYEVEKPDEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 11037 Query: 485 TPKAVKSAYDNAEKRLQKDQNGADIPDK----------GCFLNNINAVSKTDFADKRGMR 534 + +K+ + ++P + + + V++ K+ +R Sbjct: 11038 QEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIR 11097 Query: 535 YVRVN 539 + Sbjct: 11098 TRVIK 11102 Score = 45.9 bits (106), Expect = 0.009, Method: Composition-based stats. Identities = 36/485 (7%), Positives = 90/485 (18%), Gaps = 36/485 (7%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 10760 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 10819 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 10820 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 10879 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA-----SEAATSARDAAASKE 244 + ++ K T + +T T + + + Sbjct: 10880 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 10939 Query: 245 AAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAAS 304 +E + ET + + Sbjct: 10940 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEVEKPD 10999 Query: 305 SASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETN 364 + + + + +T+ T Sbjct: 11000 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 11059 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS 424 + + + + + K + R K E Sbjct: 11060 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 11119 Query: 425 ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAA 484 ++ T E E + E +E + K + Sbjct: 11120 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 11179 Query: 485 TPKAVKSAYDNAEKRLQKDQNGADIPDK----------GCFLNNINAVSKTDFADKRGMR 534 + +K+ + ++P + + + V++ K+ +R Sbjct: 11180 QEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIR 11239 Query: 535 YVRVN 539 + Sbjct: 11240 TRVIK 11244 Score = 45.9 bits (106), Expect = 0.010, Method: Composition-based stats. Identities = 46/485 (9%), Positives = 99/485 (20%), Gaps = 36/485 (7%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 10831 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 10890 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 10891 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 10950 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAAS-KEAAKS 248 + ++ K T + +T T + D E + Sbjct: 10951 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEVEKPDEIQELPEEVRV 11010 Query: 249 SETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASA 308 ET T K K T + E Q + + Sbjct: 11011 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 11070 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA----SAAARSASAAKTSETN 364 + + + T K + +Q +T+ T Sbjct: 11071 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 11130 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS 424 + + + + + K + R K E Sbjct: 11131 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 11190 Query: 425 ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAA 484 ++ T E E + E +E + K + Sbjct: 11191 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 11250 Query: 485 TPKAVKSAYDNAEKRLQKDQNGADIPDK----------GCFLNNINAVSKTDFADKRGMR 534 + +K+ + ++P + + + V++ K+ +R Sbjct: 11251 QEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIR 11310 Query: 535 YVRVN 539 + Sbjct: 11311 TRVIK 11315 Score = 45.5 bits (105), Expect = 0.013, Method: Composition-based stats. Identities = 41/445 (9%), Positives = 79/445 (17%), Gaps = 43/445 (9%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 9624 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 9683 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 9684 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 9743 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---------------SEAAT 234 + ++ K T + +T T + E Sbjct: 9744 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 9803 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 K K T T + + K ET E + Sbjct: 9804 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETAEEDDKQPETTVTVEEVPYEEEKPE 9863 Query: 295 AAGSKTAAASSASAASTSAGQASAS------ATAAGKSAESAASSASTATTKAGEATEQA 348 + G E K E T Sbjct: 9864 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 9923 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + + ++ K E T+ + + Sbjct: 9924 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 9983 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 +T E + ET + + + K Sbjct: 9984 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 10043 Query: 469 VQLSS-ATNSTSETLAATPKAVKSA 492 +++ T + T V+ Sbjct: 10044 QEVTKIETVEEDDKQPETTVTVEEV 10068 Score = 44.8 bits (103), Expect = 0.019, Method: Composition-based stats. Identities = 41/445 (9%), Positives = 79/445 (17%), Gaps = 43/445 (9%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 9553 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 9612 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 9613 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 9672 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---------------SEAAT 234 + ++ K T + +T T + E Sbjct: 9673 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 9732 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 K K T T + K ET E + Sbjct: 9733 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 9792 Query: 295 AAGSKTAAASSASAASTSAGQASAS------ATAAGKSAESAASSASTATTKAGEATEQA 348 + G E + K E T Sbjct: 9793 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETAEEDDKQPETTVTV 9852 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + + ++ K E T+ + + Sbjct: 9853 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 9912 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 +T E + ET + + + K Sbjct: 9913 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 9972 Query: 469 VQLSS-ATNSTSETLAATPKAVKSA 492 +++ T + T V+ Sbjct: 9973 QEVTKIETVEEDDKQPETTVTVEEV 9997 Score = 44.4 bits (102), Expect = 0.027, Method: Composition-based stats. Identities = 41/445 (9%), Positives = 78/445 (17%), Gaps = 43/445 (9%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 10405 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 10464 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 10465 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 10524 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---------------SEAAT 234 + ++ K T + +T T + E Sbjct: 10525 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 10584 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 K K T T + K ET E + Sbjct: 10585 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 10644 Query: 295 AAGSKTAAASSASAASTSAGQASAS------ATAAGKSAESAASSASTATTKAGEATEQA 348 + G E K E T Sbjct: 10645 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 10704 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + + ++ K E T+ + + Sbjct: 10705 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 10764 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 +T E + ET + + + K Sbjct: 10765 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 10824 Query: 469 VQLSS-ATNSTSETLAATPKAVKSA 492 +++ T + T V+ Sbjct: 10825 QEVTKIETVEEDDKQPETTVTVEEV 10849 Score = 44.4 bits (102), Expect = 0.028, Method: Composition-based stats. Identities = 46/485 (9%), Positives = 99/485 (20%), Gaps = 36/485 (7%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 9908 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 9967 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 9968 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 10027 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAAS-KEAAKS 248 + ++ K T + +T T + + E + Sbjct: 10028 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 10087 Query: 249 SETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASA 308 ET T K K T + E Q + + Sbjct: 10088 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEENDKQPETTVTVEEVPYEEEKPE 10147 Query: 309 ASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQA----SAAARSASAAKTSETN 364 + + + T K + +Q KT+ T Sbjct: 10148 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPKTTVTV 10207 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS 424 + + + + + K + R K E Sbjct: 10208 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDNQEVTKIETVEE 10267 Query: 425 ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAA 484 ++ T E E + E +E + K + Sbjct: 10268 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 10327 Query: 485 TPKAVKSAYDNAEKRLQKDQNGADIPDK----------GCFLNNINAVSKTDFADKRGMR 534 + +K+ + ++P + + + V++ K+ +R Sbjct: 10328 QEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIR 10387 Query: 535 YVRVN 539 + Sbjct: 10388 TRVIK 10392 Score = 44.0 bits (101), Expect = 0.035, Method: Composition-based stats. Identities = 41/445 (9%), Positives = 78/445 (17%), Gaps = 43/445 (9%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 9411 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 9470 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 9471 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 9530 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---------------SEAAT 234 + ++ K T + +T T + E Sbjct: 9531 KKKIRTRFIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 9590 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 K K T T + K ET E + Sbjct: 9591 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 9650 Query: 295 AAGSKTAAASSASAASTSAGQASAS------ATAAGKSAESAASSASTATTKAGEATEQA 348 + G E K E T Sbjct: 9651 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 9710 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + + ++ K E T+ + + Sbjct: 9711 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 9770 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 +T E + ET + + + K Sbjct: 9771 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 9830 Query: 469 VQLSS-ATNSTSETLAATPKAVKSA 492 +++ T + T V+ Sbjct: 9831 QEVTKIETAEEDDKQPETTVTVEEV 9855 Score = 44.0 bits (101), Expect = 0.037, Method: Composition-based stats. Identities = 42/445 (9%), Positives = 79/445 (17%), Gaps = 43/445 (9%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 9482 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRFIKK 9541 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 9542 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 9601 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---------------SEAAT 234 + ++ K T + +T T + E Sbjct: 9602 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 9661 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 K K T T + K ET E + Sbjct: 9662 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 9721 Query: 295 AAGSKTAAASSASAASTSAGQASAS------ATAAGKSAESAASSASTATTKAGEATEQA 348 + G E K E T Sbjct: 9722 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 9781 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + + ++ K E T+ +A + Sbjct: 9782 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETAEE 9841 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 +T E + ET + + + K Sbjct: 9842 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 9901 Query: 469 VQLSS-ATNSTSETLAATPKAVKSA 492 +++ T + T V+ Sbjct: 9902 QEVTKIETVEEDDKQPETTVTVEEV 9926 Score = 43.2 bits (99), Expect = 0.067, Method: Composition-based stats. Identities = 41/445 (9%), Positives = 78/445 (17%), Gaps = 43/445 (9%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 10334 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 10393 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 10394 VKGDMQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 10453 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---------------SEAAT 234 + ++ K T + +T T + E Sbjct: 10454 KKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 10513 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 K K T T + K ET E + Sbjct: 10514 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 10573 Query: 295 AAGSKTAAASSASAASTSAGQASAS------ATAAGKSAESAASSASTATTKAGEATEQA 348 + G E K E T Sbjct: 10574 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 10633 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + + ++ K E T+ + + Sbjct: 10634 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 10693 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 +T E + ET + + + K Sbjct: 10694 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 10753 Query: 469 VQLSS-ATNSTSETLAATPKAVKSA 492 +++ T + T V+ Sbjct: 10754 QEVTKIETVEEDDKQPETTVTVEEV 10778 Score = 42.8 bits (98), Expect = 0.081, Method: Composition-based stats. Identities = 34/443 (7%), Positives = 82/443 (18%), Gaps = 19/443 (4%) Query: 91 GAMTEDDARPEALRRFELMV---------EEVARNASAVAQNTAAAKKSASDASTSAREA 141 EDD +PE E + +E+ V T K T + Sbjct: 9837 ETAEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 9896 Query: 142 ATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSA 201 Q ++ + + + + Sbjct: 9897 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 9956 Query: 202 GAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAA 261 K T ++ + T + + + + Sbjct: 9957 K--KKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEV 10014 Query: 262 SSATAAGNSAKAAKTSETNA--RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASAS 319 K K + + + + + Sbjct: 10015 RVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEK 10074 Query: 320 ATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAA 379 + E + + + + + K T + E + + +T Sbjct: 10075 PEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEENDKQPETTV 10134 Query: 380 ASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAA 439 E R K + + T Sbjct: 10135 TVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETV 10194 Query: 440 TRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKS------AY 493 + K + E+ + + + + +E K +++ Sbjct: 10195 EEDDKQPKTTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKG 10254 Query: 494 DNAEKRLQKDQNGADIPDKGCFL 516 DN E + D + Sbjct: 10255 DNQEVTKIETVEEDDKQPETTVT 10277 Score = 42.8 bits (98), Expect = 0.085, Method: Composition-based stats. Identities = 40/445 (8%), Positives = 77/445 (17%), Gaps = 43/445 (9%) Query: 91 GAMTEDDARPEALRRFELMV------------EEVARNASAVAQNTAAAKKSASDASTSA 138 + EDD +PE E + E R V ++ KK Sbjct: 10263 ETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKK 10322 Query: 139 REAATHAADAADSARAASTSAGQAASSAQ---------SASSSAGTASTKATEASKSAAA 189 + ++ + + T Sbjct: 10323 VKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPK 10382 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA---------------SEAAT 234 + ++ T + +T T + E Sbjct: 10383 KKKIRTRVIKKVKGDMQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPEEIQELPEEVRV 10442 Query: 235 SARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASA 294 K K T T + K ET E + Sbjct: 10443 VETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTVEEVPYEEEKPE 10502 Query: 295 AAGSKTAAASSASAASTSAGQASAS------ATAAGKSAESAASSASTATTKAGEATEQA 348 + G E K E T Sbjct: 10503 EIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEEDDKQPETTVTV 10562 Query: 349 SAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAK 408 + + + ++ K E T+ + + Sbjct: 10563 EEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDKQEVTKIETVEE 10622 Query: 409 SSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 +T E + ET + + + K Sbjct: 10623 DDKQPETTVTVEEVPYEEEKPEEIQELPEEVRVVETVTEDGKPKKKKIRTRVIKKVKGDK 10682 Query: 469 VQLSS-ATNSTSETLAATPKAVKSA 492 +++ T + T V+ Sbjct: 10683 QEVTKIETVEEDDKQPETTVTVEEV 10707 >UniRef50_C1FYY4 Predicted protein n=3 Tax=Paracoccidioides brasiliensis RepID=C1FYY4_PARBD Length = 1654 Score = 52.1 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 56/479 (11%), Positives = 116/479 (24%), Gaps = 3/479 (0%) Query: 50 SMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELM 109 S DV GQ+ + G N+ + + + Sbjct: 69 SFDVATGQHQYKFRLGPDGAWFCDKDATTVVDDDGFENNVVAVEAVPIIQKPNNSAKKDD 128 Query: 110 VEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSA 169 + + + A D A + + + A AG + Sbjct: 129 SDPQNVASETTVLDQEEKVDGAKDGDGEATKIEILSDAKETNDVPAEPLAGGDDVEKAAK 188 Query: 170 SSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKA 229 ++ E KSAA S A S ++ S + L + Sbjct: 189 DIKDDDVPVQSVENWKSAADTTKVDSIPADSTIKSEASTDDILPELAQPTEHTAAEVLAV 248 Query: 230 SEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAG 289 E + AA +++A+ ++T A + A ++ AK Sbjct: 249 GE---DPQTAAVTQDASIVADTVVDEKLGDVADVSEAKDIISEPAKVPVEALTDLVHRPA 305 Query: 290 QSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQAS 349 + K A + + A + +A + TAT +A + Sbjct: 306 AAKEEPPAEKEAGKTKEEDDTAQATERAAKTDILTTPTDLKVEPLETATIVTEDAASNDN 365 Query: 350 AAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKS 409 A + A + + ++A + AK+ Sbjct: 366 EPTAEPVEALKPIEQETKVDPLALTEEEEEKEEEEEEKEEETAAKTEEPVEPVVNVPAKA 425 Query: 410 SATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIV 469 + A ++ + A A A + + Sbjct: 426 DSKPGEAAADIEHQEVEQKVIEHRRSQEDILAGDDKAPEAALAPPAPPAPPKQPKQTPVP 485 Query: 470 QLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 S T + A P+ +S +AE+ + + + + + Sbjct: 486 AASEPTEIPAVDTPAEPEPTQSHAVDAEQTTMVNGDIKKGYEGDEKAEEVAEAPADQLS 544 >UniRef50_C0FZM7 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FZM7_9FIRM Length = 592 Score = 51.3 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 54/380 (14%), Positives = 121/380 (31%), Gaps = 9/380 (2%) Query: 28 RNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLN 87 S + S + + + YS+ +E + + V+ G + + Sbjct: 53 VTSGQITSKVRGSGSVEASESYSVTIEETRKIATVNVKKDAEVATGDLLFTLEDTDS--- 109 Query: 88 DFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAAD 147 E DA ++L + E A + + + T + A Sbjct: 110 ------DELDAAKKSLNEAQAAYESAVLTAGITVAERQSIEAGKGSSLTQKQNEIAAANQ 163 Query: 148 AADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS 207 A+AA +A + ++ + ++ T K+ AE S A S +A+++ Sbjct: 164 RVKDAQAAVDAAQASVDKIKAQIDAVSNSTADTTAEEKAVLDAEKKNSEAQDSLTSAESA 223 Query: 208 ETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAA 267 T ++ +A S A EA + AS A + + A ++ ++A A Sbjct: 224 YTPVKSAYDTALESLQRAQADLEEAKATRDALNASSSATAADKQTAETAVATAQVKVNTA 283 Query: 268 GNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSA 327 ++ K + + + S +A +A+ + + + A + S + + + + Sbjct: 284 DSNFKTCQANLNKVQGSYDSAKSAATDSKNALSNANYNLSVKKLTGTNTAEANNLQAQLN 343 Query: 328 ESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAA 387 + + T +Q + + + ++ E +K A S + Sbjct: 344 TATVALTDANTALTSATNDQKKVTDKISGEVTIASAYKTMTDLQEEVAKLQAKSIGTEIT 403 Query: 388 SSASSASASKDEATRQASAA 407 S S A Sbjct: 404 SPISGTVTDIAVTAGTTVNA 423 >UniRef50_C5VC26 Secreted cell wall-associated hydrolase n=2 Tax=Corynebacterium matruchotii RepID=C5VC26_9CORY Length = 500 Score = 51.3 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 59/469 (12%), Positives = 125/469 (26%), Gaps = 28/469 (5%) Query: 219 ATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSE 278 + + A A A A + A+++ + A N +S Sbjct: 41 REAVNKALVDFHSAQADATKARTEADQARAALDTTQGQITKAQKVLDDISNIVYRGSSSS 100 Query: 279 TNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTAT 338 + + A A T + T +A A Sbjct: 101 AISGVVSKPKAEDTLDRHTLLRTNADKQRDAMTELDKLRTQQTNEESGLRAARDVAEKRE 160 Query: 339 TKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 + +A + A A A+A A+ ++ + + A ++ + + Sbjct: 161 QETDKAKQDAQKAIDDAAAELKQHQAEHAALVASRDAAQK--ELDTIQAKQKTTTAKATT 218 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 A + + A A + A S + + + A+ + V Sbjct: 219 SAAPTPTTPAKQKSEAKPTEATAIADSIAKIVGSSQPDHTSLNLKPVAENITLTENDVDD 278 Query: 459 EDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNN 518 ED + Q + T + +++ + + A + L D + G + Sbjct: 279 EDDDEEENEDTQDQAQTQAPAQSQGGSTNPTQEANKVKDGELTLDMHALSSNGPGSVADL 338 Query: 519 INAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAG 578 + +S+ G + N V+ R+ + + T G Sbjct: 339 LKKLSEA-----GGAGSDQANINLNGDRASKIETVIARAESQLGVPYAWGGGDANGPTLG 393 Query: 579 DPMN--------------NCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNK 624 +C G + G +Q+ + + Sbjct: 394 IRDGGVADSFGDFEKVGFDCSGLTLYAFAGVGIALPHYTGYQYQFGTK-------VSPQE 446 Query: 625 GDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPA 673 +FY A V ++ DG I AP + V + ++G +P Sbjct: 447 MQRGDLIFYGANAEDHVAIYLGDGQMIEAPQSGSEVVVSPVRWGGMSPQ 495 >UniRef50_D2H4Z1 Putative uncharacterized protein n=1 Tax=Ailuropoda melanoleuca RepID=D2H4Z1_AILME Length = 2496 Score = 51.3 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 97/762 (12%), Positives = 205/762 (26%), Gaps = 18/762 (2%) Query: 323 AGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASS 382 A+ S+ ++ + + S + + T S TA + + Sbjct: 193 KPAVAQGTTSTQKPTGQTVEKSNKSQHTTISLGITSTHSPSVSGYDPTDHNSQDTATSGT 252 Query: 383 ASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRA 442 AS ++S+ S S +SA+ + +++S+ + S Sbjct: 253 ASMPSTSSVSHKPSTSSEMPTSSASTADISSSSSANNTVSISLEVPTTPIGAHTPGGNTK 312 Query: 443 ETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQK 502 T+ + +T ++G S+ S ET A++ + + + Sbjct: 313 STSQTETGSFSPVTTAVSMTTEQEGPSTAHSSFTSAPETTASSHHHTSQTVETSRQSQSA 372 Query: 503 DQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVS 562 +I++ T N + + SG + PV S + Sbjct: 373 STADISSSSSAKSTVSISSEVPTVPIRTDTPGRNTKNT-SHSESGNFSPVTATVSMTTER 431 Query: 563 ELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMS 622 E SR + + + + + + G + G ++ + H Sbjct: 432 EGPSRARSSFTSAPETTASSQHHTSQTMETSRQSQSGTVSQG-ITSTPSSSLSGHPSTDR 490 Query: 623 NKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVI 682 D S P + S P + D + A+N + Sbjct: 491 FSQDTATS--GTASTPSPSSESHKPSTSSEIPTSSASTADISSSSIASNTHSVSSEVPTT 548 Query: 683 LDFKSGRGFYE-SHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTY 741 + G + S + S + G ++ R A ++ + Sbjct: 549 PIRRDTPGRNTKNMSQTETGSFSPDTATVSMTTEREGPSRARSSFTS--APETTASSQHH 606 Query: 742 LLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFD 801 T + T+ + L A + T + Sbjct: 607 TSQTMETSRQSQSGTVSQGITSTPSSSLSGHPSTDRFSQDTATSGTASTPSPSSESHKPS 666 Query: 802 GSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILV---NFYTGV 858 S +I ++A A ++ + A V + G + + + Sbjct: 667 TSSEIPTSSAST---ADISSSSIASNTHSVSSEVPTTPIRRDTPGRNTKNMSHTETGSFG 723 Query: 859 GSCRTLQMKAHYRNGGLFYRSSRDGYG--FEEDWAEVYTSKNLPPESYPVGAP-IPWPSD 915 + M + S + T + P I S Sbjct: 724 PVTAIVSMITDLEGPSTAHLSFTSAPEIMASSQYHTSQTMETNRQSQTSPILPGITSTSS 783 Query: 916 TVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSH 975 + SG+ + + D + + PS + + +P+ + G Sbjct: 784 SPLSGHTPTEDISQDPATLGTANTSLPSDKSNETSPQSEGIQPSIAHSFSMSASSGNLGT 843 Query: 976 THSASASSTDLGTKTTSSFDYGTKSTNNTGAH--THSVSGSTNSAGAHTHSLANVNTASA 1033 TH S + + S G T ++ HS S S+ + + + + Sbjct: 844 THIPSNFYSSTSNSSFSQSALGFAVTWSSATLFSGHSSPLSITSSFPPSTASSMSTVKTQ 903 Query: 1034 NSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHA 1075 SG + S R+ + S + TH+ S G H+ Sbjct: 904 TSGTITTSQRIGSQRRTTPSVPSTTSQTHTTFSPRPSIGTHS 945 >UniRef50_UPI000023CDE0 hypothetical protein FG09994.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023CDE0 Length = 1220 Score = 50.9 bits (119), Expect = 3e-04, Method: Composition-based stats. Identities = 93/752 (12%), Positives = 206/752 (27%), Gaps = 6/752 (0%) Query: 100 PEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSA 159 PE + A K S++++ A E + A A Sbjct: 261 PEEDNDEPTPTKVTAAKKRKADDEDTDMKDSSTNSKRRAPEHEEDVQEQQAPAPAPVLVP 320 Query: 160 GQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAA 219 + SS + A ++ ++ K A+A S + + + S + A Sbjct: 321 TSTLGKNKRRSSISDEADSQPSKMQKGQASAAKSLFEKIANKSSTTPVSSPLKPSAKPAE 380 Query: 220 TSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSET 279 +A+ + T+ ++ ++ ++ + +++ AS + S+ + Sbjct: 381 DNAAAKPNPFAFNKTNGSGSSLARSIFQNPKPTSTAGASGGNIFGYLSDASSAKNSGVDA 440 Query: 280 NARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATT 339 +A S + + S AA+ + + + G + A +++ + +T Sbjct: 441 DAESEADSDAEDDSQGDEPSAAASGAETVSQVGNGLFGQKTALSSGLAAGSSAPGTREST 500 Query: 340 KAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDE 399 ++ + + A + + + S + + S+ +++ Sbjct: 501 PGPSLFDRVTKDTDGQPMRLEDKAEVPAEKPFPKLADQTWNPSTTPIKFAPSAPASTGQA 560 Query: 400 ATR--QASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVA 457 A+ +A++A +S+ A+ AT + A K++ S + ++ A Sbjct: 561 ASLFGKATSAPTSSLFANKPATISNLFGAAKPAEKTSTPSDHADKTGGDESDKENDEAPK 620 Query: 458 LEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLN 517 + + S T A + D+ P F Sbjct: 621 KSMFESKASAAQPSFGSMFSKPATEAVKAPEPAKPVTSLFGAKPDDKASTPAPTANLFGA 680 Query: 518 NINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTA 577 + + + + P + A S S T+ A Sbjct: 681 VSKPAESSGPVLQSSTLFGAKPTGDDKPAATEAPKTSLFGAPSTSAANGESTTATSLFGA 740 Query: 578 GDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGA 637 G + + M S G ++GA Sbjct: 741 KPTTTASNLFGNTSTPAAAPLFGAPSSSDATAAKKDTPATTSMFSF-GGASNGADKINGA 799 Query: 638 AFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSL 697 A P+F S ++ + K +PA + + + + Sbjct: 800 AKPLF---GAPQSPKPSSGTAGLDGSPMKQDEPSPAKRAFNGGTVSGASAPIFSFGGSTT 856 Query: 698 IVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTL 757 S + GG + GA + ++ + + G GG + Sbjct: 857 PAAAPPSFGGGASGTSTPLFGGASTTPAANDVGASFGSNSSASGSFGFQFGGGGGGNSAS 916 Query: 758 RPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFA 817 F A G + + S +G + P SG A A Sbjct: 917 SSFNNPFAPGNNGGDSGAAPSSSGGMFSFGASSAPSAPSGGAAPFQFGGPSNATAFGANN 976 Query: 818 RRATDTYADADGGVPWNAESGAYNVTRSGDSY 849 A G P + +GA + ++ Sbjct: 977 STPAFGGASGSSGAPGFSFTGASPAQNATPTF 1008 >UniRef50_Q04HY6 Choline binding protein A n=135 Tax=Streptococcus pneumoniae RepID=Q04HY6_STRP2 Length = 701 Score = 50.5 bits (118), Expect = 3e-04, Method: Composition-based stats. Identities = 70/678 (10%), Positives = 151/678 (22%), Gaps = 51/678 (7%) Query: 118 SAVAQNTAAAKKSASDASTSAREAATHAADA--ADSARAASTSAGQAASSAQSASSSAGT 175 + + + A S++ A T R+AA D R + + + Sbjct: 37 ATENEGSTQAATSSNMAKTEHRKAAKQVVDEYIEKMLREIQLDRRKHTQNVALNIKLSAI 96 Query: 176 ASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATS 235 + E + ++ + + A + + + + ++ Sbjct: 97 KTKYLRELNVLEEKSKDELPSEIKAKLDAAFEKFKKDTLKPGEKVAEAKKKVEEAKKKAE 156 Query: 236 ARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAA 295 + + ++ + A + E+ + A + + Sbjct: 157 DQKEEDRRNYPTNTYKTLELEIAEFDVKVKEAELELVKEEAKESRNEGTIKQAKEKVESK 216 Query: 296 AGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSA 355 T + + + +A A A K A A S +A A Sbjct: 217 KAEATRLENIKTDRKKAEEEAKRKADAKLKEANVATSDQGKPKGRAKRGVPGELATPDKK 276 Query: 356 SAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTAS 415 S ++ ET SS + A + + +KD+ ++ Sbjct: 277 ENDAKSSDSSVGEETLPSSSLKSGKKVAEAEKKVEEAEKKAKDQKEEDRRNYPTNTYKTL 336 Query: 416 TKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSAT 475 + A+ + E A + + +A+ + A Sbjct: 337 DLEIAESDVKVKEAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLENIKTDRKKAE 396 Query: 476 NSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRY 535 A K + + + P A D + Sbjct: 397 EEAKRKAAEEDKVKEKPAEQPQPAPATQPEKP-APKPEKPAEQPKAEKTDDQQAEEDYAR 455 Query: 536 VRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGW 595 + S N + + M GW Sbjct: 456 R-------SEEEYNRLTQQQPPKTEKPAQPSTPKTGWKQENGMWYFYNTDGS---MATGW 505 Query: 596 TDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPG 655 W Y N A+ + + N G +G+ + Sbjct: 506 LQN-----NGSWYYLNANGAMATGWLQNNGS--WYYLNANGSMATGWLQNNGSWYYLNAN 558 Query: 656 ADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIV 715 + Y +G +Y + + AT + Sbjct: 559 GAMATGWLQY---------------------NGSWYYLNSN----------GAMATGWLQ 587 Query: 716 ARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKL 775 G G+ W + Y L N G + NA G++ G Sbjct: 588 YNGSWYYLNANGDMATGWLQNNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWVK 647 Query: 776 SASLNGNALTATKLQTPR 793 + ++ + Sbjct: 648 DGDTWYYLEASGAMKASQ 665 >UniRef50_A8XJP6 Putative uncharacterized protein n=2 Tax=Caenorhabditis briggsae RepID=A8XJP6_CAEBR Length = 2416 Score = 50.5 bits (118), Expect = 4e-04, Method: Composition-based stats. Identities = 54/489 (11%), Positives = 107/489 (21%), Gaps = 14/489 (2%) Query: 94 TEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREA---ATHAADAAD 150 EDD E ++ E + + A A+ K + + +A++ A+ + Sbjct: 955 DEDDTPAEPVKEPEPVKKTPVLAKKAPAKKPDTEKPAEPVSGPTAKDPRLSTKAPAEKPN 1014 Query: 151 SARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETN 210 A A +A + A+ S + + A A A Sbjct: 1015 PATAPPKDTPKAVDPPKPAAPKKWRPSWEDDPDDEPEADFTVPAPAKKPDTEDAAGPLGG 1074 Query: 211 ASASLQSAATSASTATTKASEAATSARDAAAS-----------KEAAKSSETNASSSASS 259 + + A T K + + A + + A + + Sbjct: 1075 PKSKTPALGKKAPTDKPKPASKKPDTENPADALGGPTPKDPKLAKKAPAKKPEDKPKEKP 1134 Query: 260 AASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASAS 319 + A A K + + + + + G+ + Sbjct: 1135 KEAPKPAEPPKPAAPKKWKPPWELDSEPEDEPEADFTVPAPSKKPDTEDPADPLGRPKKT 1194 Query: 320 ATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAA 379 K A + + K E E+ T Sbjct: 1195 DPKLAKKAPAKKPEDKPKEKPKEAPKPAEPPEPAAPKKWKPPWELDSEPEDEPEADFTMP 1254 Query: 380 ASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAA 439 A + A K + + A A + K A + + Sbjct: 1255 APKKPDSEEPADPLGGPKPKDPKLAKKALAKKPEDKPKEKPKEVPKPAEPSKPAAPKKWK 1314 Query: 440 TRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 E + V + + S LA K A + Sbjct: 1315 PPWEEYPDDEPEADFTVPAPKKPEDSEDPAEPVSTPKPKDPKLAKKVPTKKPADKPKDAP 1374 Query: 500 LQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAG 559 + + + P + V AP + V Sbjct: 1375 KETPKKLEEPPKPAAPKKWKPPWELDSEPEDEPEADFTVPAPKKPEDSEDPAEPVSAPKP 1434 Query: 560 SVSELASRV 568 +LA + Sbjct: 1435 KDPKLAKKA 1443 >UniRef50_UPI000023F3CB hypothetical protein FG08587.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023F3CB Length = 928 Score = 50.5 bits (118), Expect = 4e-04, Method: Composition-based stats. Identities = 41/299 (13%), Positives = 97/299 (32%), Gaps = 21/299 (7%) Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 ++ A S S+ ++ T+ +A++ A+++ + S+ +A+ + Sbjct: 377 ASSTEDATSTDITTASDVSSMEEATATSDVSASASTDATTSMDATSTDTAASTDTAASTE 436 Query: 406 AAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTK 465 A S+ TTA+T + + + A + +T+ A A+ A +S ++ +T Sbjct: 437 AVASTYTTAATNSDASTSADVATSMDAATSTDATASADATASTDVTTSSDASVSTDATAS 496 Query: 466 KGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKT 525 + T+S AA+ + + +A+ + A D +++ + Sbjct: 497 TETTATGANTSSVVTDPAASGVSATATESSADTATDSTTDSASKSDVLSATSDLTV-TSD 555 Query: 526 DFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCE 585 + ++AP + + S + + T Sbjct: 556 TSSVTGAASSSTIDAPVESDTATGTAATTNVSGATEPTTIATAYSTYR------------ 603 Query: 586 FNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAF 644 GW A Y N HS ++ +G + + + A + Sbjct: 604 --------GWNTTSTGAAAYTQAYGNPTTLAHSGSLTTEGSPAYTNYPGNTGAVSMTTS 654 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 50.1 bits (117), Expect = 4e-04, Method: Composition-based stats. Identities = 108/881 (12%), Positives = 232/881 (26%), Gaps = 37/881 (4%) Query: 10 KDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPP 69 D G PV + + NS + T + A G ++ + V P Sbjct: 655 TDAFGHPVPGVEVTWVSDLNSPGLEHVTSITNEHGIAENNFSSTVTGTANITVQVGTSTP 714 Query: 70 SHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKK 129 AG I + D+ T+ +T + ++ + +AV Sbjct: 715 VAAGAIEIKTDNSTMTVKASDFTVTTTSVVANGTDKTVYKLKVTDKQGNAVPAAAVEWSN 774 Query: 130 SA-----SDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEAS 184 + + +T+ T A+ A A +A ++ + + A A Sbjct: 775 NIGIFIQASPTTTDANGETFIELASTKAGTAKVTATIGGKPYHASDVTFIASRQSAQIAL 834 Query: 185 KSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKE 244 A+ +++ + AK + + + + + T E Sbjct: 835 LPASKIKAAANDKDWITLTAKIVDAHNNPIKSEKIVWDAVSHQMTFSPTTKITHTNDQGE 894 Query: 245 AAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAAS 304 + + AS +A + +T +A ++ + + + A Sbjct: 895 SEIQVASAQVGDASVSAQVVANDLLINQDNQTLTFSADAATAGISKWLAPGDQALIANGV 954 Query: 305 SASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETN 364 + + + + + T++ T S + A T Sbjct: 955 ATVSYTVLVKDSKDHIVPNSQVLWETDLGKFTSSQTMKTVTTTDSQGEATVELASTQAGF 1014 Query: 365 AKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGS 424 AK S T+ A +A SS ++ + + + + T + Sbjct: 1015 AKVKAAVNGGSITSTAKVEFTADSSTATIAITPVNKQVYVANGGDTVTYNVIVLDKHQNP 1074 Query: 425 ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAA 484 A + ST + A+ + + A ++ VQ + A N+T Sbjct: 1075 VRDADITWSTVNHHVVKYSPASSKTDSDGKATVAVTSTAAGSTQVQATLANNATDIADQI 1134 Query: 485 TPKAVK-SAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA-------------DK 530 + A + +A +Q D ++ + ++ + Sbjct: 1135 SFNADRQTAVIKTVVVKGNNQPAPDGSGSVSYVTTVEDINGNPVSGMILSWGSNINKVTN 1194 Query: 531 RGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVI--ITTATRTAGDPMNNCEFNG 588 + +G V + A + A + I A A P+N Sbjct: 1195 QTTTTDANGMSTQTITGTQAGKVEVSVALNEGNNAKNPVKSINEAEFVAVAPVNANSSLL 1254 Query: 589 FVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDG 648 + D + A F +N + + K + + + GA Sbjct: 1255 LLPNLIIADGKQNATLKFTLRDDNHNPVSGLANQIKVTQVTANYVTVGAIAETTVK-GVY 1313 Query: 649 LSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKL 708 + + V+ T G T+ + D K+ + ++ + Sbjct: 1314 EAEIKGTKEGTVDLTATVTGRNVSKTQKLTLQA--DNKTATLKTVTSNIKTAKADGMDGI 1371 Query: 709 FATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGE 768 T ++ GN + G WR + + + A Sbjct: 1372 TYTATVIDAQGNAS-LANVSVG--WRTNLGELVAMSKTNASGMATVTLTSKQAGKATVTA 1428 Query: 769 LVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADAD 828 +V + +L+ + S + D T T + Sbjct: 1429 IVSSSSQMDALSVTFTAGGIVIAQSSASVSAANLVADSTTT----------TLTVKVNDT 1478 Query: 829 GGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAH 869 G P +SG VT + + + G Sbjct: 1479 NGNPLTGQSGKIKVTAANFPGLTLPTQFTEGPNGVYTATIS 1519 >UniRef50_B6HHF2 Pc20g14690 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6HHF2_PENCW Length = 824 Score = 50.1 bits (117), Expect = 5e-04, Method: Composition-based stats. Identities = 45/430 (10%), Positives = 107/430 (24%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE 140 T+ R + + + + + + Sbjct: 174 EGKDTVLKAETPRPNLHGRTGQPLATTVPIAQDKAKPEPQHTALSVSPTTKESPKIQPAP 233 Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 A+ + A A + K + A + + + Sbjct: 234 PVAQPQAPLPQAQTPTPKPQDIAGQAPQPQPTTTLPPQKPSIQPPLAPSTQKAPPTTPAG 293 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 A + + + A A S R A + + S Sbjct: 294 ARVTPAHPSQIAPRANPVPPAPIVADKAGPLAPASQRTPAQPSFQQWQLDPSPHSPYPVV 353 Query: 261 ASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASA 320 + AT + AK ++ G A S+ +A + Sbjct: 354 SPGATTPQAAQATAKRPVPPLSTNPPLPGSQAQPPQPPVPQTPSAIPSAVHTPVPIGLPR 413 Query: 321 TAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAA 380 + SA + S S A+ Q S+ + + + + + Sbjct: 414 VSQTPSATFPSFSESRASHPRLSIDTQGSSTPWKRTPRPRVSRSPSSPPRPRPEDVSPIS 473 Query: 381 SSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAAT 440 + S + + + + ++A + + T+ A + AA+ Sbjct: 474 ERSQSPVDAMEMSPPPRIDLGQRAESVEGKKPRRGQPDTKPAPVVKTEKLTSMRRTRAAS 533 Query: 441 RAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRL 500 A + ++ + ++ + T + + + TPK A + + Sbjct: 534 SASSRSRGRSMASCDGSVAPSEDTVRESRATGRRKGAAATADEGTPKPRTKRKRGASEAV 593 Query: 501 QKDQNGADIP 510 + + + ADIP Sbjct: 594 ELEPHPADIP 603 >UniRef50_C9SNW3 Putative uncharacterized protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SNW3_VERA1 Length = 1132 Score = 50.1 bits (117), Expect = 5e-04, Method: Composition-based stats. Identities = 59/519 (11%), Positives = 143/519 (27%), Gaps = 5/519 (0%) Query: 28 RNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLN 87 S TV+ + A + + V P + T+ +P Sbjct: 304 ITSPTVMHTSSPQAPSSAADHFPAETPAPSRWVGEEAVAETPIELPSGTLRRRVEPMAHY 363 Query: 88 DFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAAD 147 + + E +AL E+ A+ ++ AA K + A + + Sbjct: 364 ETMKKSQERRLSGDAL-PPVDSQEDSDDALEAMQRSRRAAMKKRAAAEQLSGISLRRTIP 422 Query: 148 AADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTS 207 D Q + + + + A+S K + Sbjct: 423 NDDPDITPRPQRSQRRAQVAIVPDDECEVAVGSENEEDISTVADSQDPLNVPDTVK-KPT 481 Query: 208 ETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAA 267 TN A +A ++ +A +T + ++ SS + Sbjct: 482 TTNELPMENHAPLTALDSSVEAIPDSTEETTHNKAGNPLSVDTSDTSSKQRDRIPETSPV 541 Query: 268 GNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSA 327 + + + S A + + A+ + S ++ KS Sbjct: 542 NRPTTQVNVASPERDLALEEVCEDPSPAPMDEDS--PLATPNEPRPSLSMPSRRSSRKSR 599 Query: 328 ESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAA 387 +A ++A+ + K + +A +AAK+ T +++ + S S Sbjct: 600 PTAKAAATDSKAKPSPSEPRAGR-EVKTAAAKSGMLRKSVRATKSKADEPNIQSDPSVVL 658 Query: 388 SSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAK 447 S+ +++ +S + + + + + A ++ + + Sbjct: 659 DSSPQVRSTEPAFDLPSSPPLLTPRLPRGPSKLRSHNTVDQNPATPVAPESSVESAPEST 718 Query: 448 RAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGA 507 S + TT A S + + P+ + A ++ + + Sbjct: 719 STLSTLSVTPSLSSKTTPATQQSDQVAHTSPMKPRSRHPRPSLPPLNTAPVLSERGKKRS 778 Query: 508 DIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATS 546 + ++ + + + + P A S Sbjct: 779 KKQTRRSTRHDSASTDELAMSPSAIGFENSMAHPPKAKS 817 >UniRef50_C3Z0S4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Z0S4_BRAFL Length = 1184 Score = 50.1 bits (117), Expect = 5e-04, Method: Composition-based stats. Identities = 60/469 (12%), Positives = 127/469 (27%), Gaps = 9/469 (1%) Query: 81 SQPGTLNDFLGAMTEDDARPEAL-RRFELMVEEVARNASAVAQNTAAAKKSASDASTSAR 139 + ED+ PE + M A + + K S Sbjct: 98 EDDLEPEETSTVNGEDEVPPEEVPADAVSMETGDAPETVTMPTDGEEGKGEV-----SEE 152 Query: 140 EAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAAT 199 A +++ AD A ++ + A++ + TA AA Sbjct: 153 SAEVPSSETADVAAESTAAPPDTAAAGEGQVPEGATAQDSQEATETEMDAASLVDKVLNE 212 Query: 200 SAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASS 259 + A +A + A ++ + +K+A K + + Sbjct: 213 TEETADDVALSAEQEEELLTEDEVGEGEAEPPAEEKDKEGSNNKKAPKEEPAKPAGAKVI 272 Query: 260 AASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASAS 319 A A +S+ + A + + A +S Sbjct: 273 APGDAPVPFPLTTKDLLEMHAPTASKDEEENISLVVHADDQMIADLDADLLDTPKDAESS 332 Query: 320 ATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAA 379 A + ES T A A+ + ++ + +AE+++ ++ Sbjct: 333 VEGAPTAEESGKEQEKVEETTKEAADLPEGEGAKKEAE---EGKGSEEGKKAAEAARHSS 389 Query: 380 ASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAA 439 +SS+S + +S ++ + A + A + A ++ A Sbjct: 390 SSSSSKNLWVSGLSSTTRATDLKAAFSKYGKVVGAKVVTNARSPGARCYGFVTMSSSEEA 449 Query: 440 TRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 R T R E +++E A + + + K + D + Sbjct: 450 ARCITHLHRTELHGRMISVERAKNDPSSSSKKPPEKKEPEKRDLRSSSTRKPSADTRKPA 509 Query: 500 LQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGK 548 +K+ +K K D + + R A + K Sbjct: 510 PKKEDKDKKEAEKKPEDKKDEKEEKKDEEKRSASQTRRSTGDKAAPAAK 558 >UniRef50_A7E639 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7E639_SCLS1 Length = 839 Score = 49.8 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 59/576 (10%), Positives = 135/576 (23%), Gaps = 11/576 (1%) Query: 8 VLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGF 67 VLK P++ K+ S ++ + S DV Sbjct: 120 VLKKTNKTPIKKKPATSKSDAPSVPSEKPSVKDTPTPPTPKESNDVGKSTKQASTTKAKK 179 Query: 68 PPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAA- 126 + + P + +++P+ + + + + Q+ Sbjct: 180 FAPKLPATPPADKNVPQPEVTSPSPAPKSESKPKPKKFKKPVKKPEPVPEPEPEQDGEHD 239 Query: 127 -------AKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTK 179 + + +A + A + A Sbjct: 240 EPEESENEEPEPEQKPKKSSKANKPEPAPEHEQEEPEEPEQEEAENEPEPKKEAPRPKKF 299 Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 + A + + + S+ + A + E + Sbjct: 300 SKAKKPEPKPAPEEQEHDSEPEPELEQSKPEKVNQKEEHDEGIEMADNEEDEEENEGEND 359 Query: 240 AASKEAAK--SSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAG 297 + E +S + + E + A Sbjct: 360 EGEVDDDSDVQDEEEQDNSEGEDGEGEEEEDEENEEGEEEEDDENELAEAIPDVQPDVIT 419 Query: 298 SKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASA 357 S + A+ S + + S + ++++ A Sbjct: 420 SAAPIPEEFKESIEPLKSATKSEPVKQATKPVEDAKDSVPEPAKEATKQPKKFLSKASKA 479 Query: 358 AKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTK 417 A+ ++ A A A A +AS + ++A A +A + K Sbjct: 480 AQGAQGQANGVLGDATKKAQDVTKDAPGADQVKDTASGTTEQAKDVGDKASKTAKGGADK 539 Query: 418 ATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNS 477 + A A A+Q+ + A A+ +A+ ++ T + Q S Sbjct: 540 IKDTAEQAPDASQATHAVQGATDTAKDVGSKAKGDVEKISGNVKDTAQDKADQASGLGKD 599 Query: 478 TSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVR 537 S+ + K D + A G + + V + + V+ Sbjct: 600 ASDKVGEVAGDAKDKADGVTDGAKDKLGDAQSQVNGVAGDAKSKVEGATGDAQEKLGSVK 659 Query: 538 VNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTA 573 G +S G ++ + Sbjct: 660 DTT-DDVKGGLPSTDDAQKSLGDGTDKTKDALGGLP 694 >UniRef50_B2INK9 Choline binding protein A n=25 Tax=Streptococcus pneumoniae RepID=B2INK9_STRPS Length = 720 Score = 49.8 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 75/569 (13%), Positives = 144/569 (25%), Gaps = 44/569 (7%) Query: 286 TAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEAT 345 A + S + QA A + A + EA Sbjct: 184 AEFDVKVKEAELELVKKEADESRNEGTINQAKAKVESEKAEATRLKKIKTDREKAEEEAK 243 Query: 346 EQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQAS 405 +A A + S + S A K + +S ++ + + + ++ + Sbjct: 244 RRADAKEQDESKRRKSRVKRGDLGEQATPDKKENDAKSSDSSVGEETLPSPSLKPGKKVA 303 Query: 406 AAKSSATTASTKATEAAGSATA--AAQSKSTAESAATRAETAAKRAEDIASAVALEDAST 463 A+ A KA + + T E ++ K AE +++ Sbjct: 304 EAQKKVEEAKKKAKDQKEEDRRNYPTNTYKTLELEIAESDVKVKEAELELVKEEAKESQN 363 Query: 464 TKKGIVQLSSATNSTSETLAATPKAV--KSAYDNAEKRLQKDQNGADIPDKGCFLNNINA 521 +K + + +E K A + A+++ ++ + P + Sbjct: 364 EEKIKQAKAKVESKKAEATRLENIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPAPQ 423 Query: 522 VSKTDFADKRGMRYVRVNAPAGATSGKYYP----VVVMRSAGSVSELASRVIITTATRTA 577 K + + PA + + Y R + + +T Sbjct: 424 PEKPAPKPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTG 483 Query: 578 GDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGA 637 N + F G G W Y N+ A+ + + N G +G+ Sbjct: 484 WKQENGMWY--FYNTDGSMATGWLQNNGSWYYLNSNGAMATGWLQNNGS--WYYLNANGS 539 Query: 638 AFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSL 697 + D+ Y +G +Y + Sbjct: 540 MATGWFQYNGSWYYLNANGDMATGWLQY---------------------NGSWYYLN--- 575 Query: 698 IVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTL 757 AT + G G+ W Y L N G Sbjct: 576 -------ANGDMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWLQYN 628 Query: 758 RPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFA 817 + NA G++ G L + + L A V + + + F Sbjct: 629 GSWYYLNANGDMATGW-LQYNGSWYYLNANGSMATDWVKDGDTWYYLEASGAMKASQWFK 687 Query: 818 RRATDTYADADGGVPWNAESGAYNVTRSG 846 Y + G + N Y V +G Sbjct: 688 VSDKWYYVNGLGALAVNTTVDGYRVNANG 716 >UniRef50_C6WQE4 YD repeat protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WQE4_ACTMD Length = 2144 Score = 49.8 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 71/461 (15%), Positives = 124/461 (26%), Gaps = 17/461 (3%) Query: 174 GTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAA 233 A+ A + SA A E + A+ A A + + + A + A+ Sbjct: 80 DPATVTAQAETWSAVATELATIASDLVAAVAADTTAWTGEAADNYRVRAEQTAALLTAAS 139 Query: 234 TSARDAAASKEAAKSSETNASSSASS--AASSATAAGNSAKAAKTSETNARSSETAAGQS 291 A A+ A A + + + T A + Q+ Sbjct: 140 QGAHGASGGLRKAGELVGAVRGLVRDVIADAVDNLVKVALQIVGTGGVAAPWTIPQISQT 199 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 +A A TA S A A +A + TAT T A+A Sbjct: 200 VAATAARITALTSKLLDALRRLTPLLTQTGDLFGEATTALRTLQTATAPIPSTTLAAAAP 259 Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 S + S + +++ TA S S + + S + + A Sbjct: 260 LPPQSPEPSQAAGPSQSSGAPQTAVTADQQSGMSQPGTPGTNSDTHGGSNTDNPGAALQP 319 Query: 412 TTASTKATEAAGSATAAAQSK---STAESAATRAETAAKRAEDIASAVALEDASTTKKGI 468 A ++ A+ S A S +T T A + A A + A G Sbjct: 320 GAADLDTPRTGTASLGTAEPDAFRSGAPSPSTADTTNPSGASTVPQAAAAQTAPVQASGA 379 Query: 469 VQLSSATN--STSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTD 526 +Q S A +++ P ++ A L + GA ++ + +D Sbjct: 380 MQSSVAQPWLQSADQSLEQPVPQQAVQAEATPDLTQS-TGATSAWPPSPAESVWTLPPSD 438 Query: 527 FADKRGMRYV------RVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDP 580 + +AP ++ + + S + + A Sbjct: 439 PPVTQVYAEAFAWPTPSADAPTATSAAETQWPPMSTQPPSPTSTGADAF---AWPPGQAD 495 Query: 581 MNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMM 621 V GW RA I Sbjct: 496 GMTSAVGTPVAEAGWPRAEGTTTTSSADLATAPRADLPITT 536 >UniRef50_D0N8F2 Mucin-like protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N8F2_PHYIN Length = 1445 Score = 49.4 bits (115), Expect = 8e-04, Method: Composition-based stats. Identities = 67/511 (13%), Positives = 133/511 (26%), Gaps = 7/511 (1%) Query: 68 PPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAA 127 PS +T P T T+ + PE L A Sbjct: 226 APSILSPVTNLLTLAPDTDAPSAPTATDAPSPPETDAPSILSP----VTNLLTPAPDTDA 281 Query: 128 KKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSA 187 S + +T A + + + A + S + + S Sbjct: 282 PSSPTTPATDAPSILSLVTNLLTPVPDTDAPSTPATDAPSILSPVTNLLTPAPDTDAPST 341 Query: 188 AAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAK 247 A ++ + + +T+A ++ + A S + T A + A Sbjct: 342 PATDAPSILSPVTNLLTPAPDTDAPSTPVTDAPSILSPVTNLLTPAPDTDAPSTPATDAP 401 Query: 248 SSETNASSSASSAASSATAAGNSAKAAKTSETNAR-SSETAAGQSASAAAGSKTAAASSA 306 S + ++ + + + + A + + S A + S Sbjct: 402 SILSPVTNLLTPVPDTDAPSTPATDAPSILSPVTNLLTPAPDTDAPSTPATDAPSILSPV 461 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 + T A A +T A + + + T A+ A S T+ Sbjct: 462 TNLLTPAPDTDAPSTPATDAPSILSPVTNLLTPTPDTDAPSATDAPAPTSILSPITTSPL 521 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 T ++ S ++A S + A+ A AT T A + T Sbjct: 522 DPITDLLTTAPTTVPSGTTAP-SPVTLLPDLPTLVPSATTAPLPATLLPDLPTLAPSATT 580 Query: 427 AAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATP 486 + + S + A+ + T + + + T + T Sbjct: 581 TPSGTDSPTTLLPDLPTLTPSGTDLPATLLPDLPTLTPSGTDSPATLLPDLPTLTPSGTD 640 Query: 487 KAVKSAYD-NAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGAT 545 D D +PD + T D + ++PA Sbjct: 641 SPATLLPDLPTLTPSGTDSPATLLPDLPTLTPSGTDSPATLLPDLPTLTPSGTDSPATLL 700 Query: 546 SGKYYPVVVMRSAGSVSELASRVIITTATRT 576 V + S +++ +I +T Sbjct: 701 PDLPTLVPSATTTPSATDVPLTLIPSTDLPV 731 >UniRef50_Q8VQ55 Pneumococcal surface protein A (Fragment) n=21 Tax=Streptococcus pneumoniae RepID=Q8VQ55_STRPN Length = 608 Score = 49.4 bits (115), Expect = 8e-04, Method: Composition-based stats. Identities = 77/618 (12%), Positives = 151/618 (24%), Gaps = 49/618 (7%) Query: 180 ATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDA 239 S + AE AA + AAK A + A K A R A Sbjct: 1 EESPVASRSKAEKDYDAAVKKSEAAKKHYEEAKKKAEDAQKKYDEDQKKTEAKAEKERKA 60 Query: 240 AASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSK 299 + A A + A++ + K A A + A + + + Sbjct: 61 SEKIAEATKEVQQAYLAYLQASNE-----SQRKEADKKIKEATQRKDEAEAAFATIRTTI 115 Query: 300 TAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAK 359 S A + +A A A + + A + A + + S + Sbjct: 116 VVPEPSELAETKKKAEAKAEEKVAKRKYDYATLKLALAKKEVEAKELEIEKLQYEISTLE 175 Query: 360 TSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKAT 419 A+ + + A + A + +QA AK K Sbjct: 176 QEVATAQHQVDNLKKLLAGADPDDGTEVIEAKLKKGEAELNAKQAELAKKQTELE--KLL 233 Query: 420 EAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTS 479 ++ A + A + +A G T + Sbjct: 234 DSLDPEGKTQDELDKEAEEAELDKKADELQNKVADLEKEISNLEILLGGADSEDDTAALQ 293 Query: 480 ETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVN 539 LA ++ + L + D + + ++ + Sbjct: 294 NKLATKKAELEKTQKELDAALNELGPDGDEEETPAPAPQPEQPAPAPKPEQPAP-APKPE 352 Query: 540 APAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRG 599 PA A ++ + +S + ++T GW Sbjct: 353 QPAPAPKPEHQLQLQNQSNQLSRRNQLKSLLT-------------------RKTGWKQEN 393 Query: 600 RYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLV 659 GM++ Y + ++ + + N G + + + + Sbjct: 394 ----GMWYFYNTD-GSMATGWLQNNGSWYYLNATYANGSMATGWVKDGDTWYYLEASGAM 448 Query: 660 VNDTTYK----FGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIV 715 +K + N + A L + +G +Y + AT + Sbjct: 449 KASQWFKVSDKWYYVN--SNGAMATGWLQY-NGSWYYLN----------ANGDMATGWLQ 495 Query: 716 ARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKL 775 G G+ W Y L N G + NA G + G Sbjct: 496 YNGSWYYLNANGDMATGWAKVNGSWYYLNANGAMATGWAKVNGSWYYLNANGSMATGWVK 555 Query: 776 SASLNGNALTATKLQTPR 793 + ++ + Sbjct: 556 DGDTWYYLEASGAMKASQ 573 >UniRef50_A7BAA1 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BAA1_9ACTO Length = 967 Score = 49.4 bits (115), Expect = 8e-04, Method: Composition-based stats. Identities = 50/394 (12%), Positives = 109/394 (27%), Gaps = 5/394 (1%) Query: 119 AVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTAST 178 + + A SA + AA AR A+ + + +A +A Sbjct: 450 SAQKPVAPEPSSAPAQAARQEAPRHEAAHETAPAREATPAQAEPRPAAPPQQRHESSAPQ 509 Query: 179 KATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARD 238 ++ +++ A A + ++ + + + + A A + + Sbjct: 510 RSETPARAEAPAWGRNADMLRGRWNEVVERLSSISRVTWSMVGGN-AQLGAVDGSQVVLL 568 Query: 239 AAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGS 298 S ++ A + T S A + ++ + Q++ + Sbjct: 569 FPVEAMVNAFSRGPRAADVEKAINEVTGLTVSVSAQVGQASGGAATTGPSAQASHPGPAA 628 Query: 299 KTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAA 358 + S + +A+A A G ESA + A Q+ + + Sbjct: 629 QHFQPGSWVSEPPPFDEAAAQAAPQGDL-ESADTGWPEPVLPPEPA--QSGWPTTARAPE 685 Query: 359 KTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKA 418 E S ++ T A + A A + + + + +A + Sbjct: 686 PALEPEPVESAWPEPATVTPIRRDEPVAPAPAPITRAPDPQPVERPALPERAARALAQAP 745 Query: 419 TEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNST 478 + +A A++ + A + + A A + ++ Sbjct: 746 ADTPEAAQASSPNGDAAPRKRSFTVFRYPGDPEPADDPADAPVQP-EPAPASSPVFDDAP 804 Query: 479 SETLAATPKAVKSAYDNAEKRLQKDQNGADIPDK 512 E A TP D N D D Sbjct: 805 IEPAAHTPSTPTGWGDPVVISGGASVNFDDGADS 838 >UniRef50_Q4P4T5 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P4T5_USTMA Length = 5431 Score = 49.0 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 51/453 (11%), Positives = 103/453 (22%), Gaps = 13/453 (2%) Query: 95 EDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARA 154 E D + ++ + + + DA A ++ Sbjct: 4535 EGDEQEPEDAVGDVDPLDPNAVDEKLWDGNDEDQDDKKDAPEDKTNKDLGADNSDSKEAE 4594 Query: 155 ASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASAS 214 + A A SA A E K+ ++AA G + SE Sbjct: 4595 SVPKADDAG-SANDDQKQEQPAEGSRDEHQKAEDDPSQEQTAADQQDGQDELSEPEDDDG 4653 Query: 215 LQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA 274 + ++ R A+ + + + Sbjct: 4654 EEGQGAEEKENEEAEAQGEGGGRQLDQ---EAEQGDNLNLDDDLNMDGGDGESDGDRTED 4710 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 + E A+ + + + E A Sbjct: 4711 DDGLDDFSDIEKMPDDDHRDDKSDDKTMKDLVDDAAVEDKKQTEETAEGEAAEEEAEEDT 4770 Query: 335 STATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKT--AAASSASSAASSASS 392 + +A ++ S +K + + A + Sbjct: 4771 KVDKDQGQDAMDEDGEEEGSEPDSKHQRPDNDDPAGGQLDQVDPFDVDAVDQGADNGGED 4830 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 SA++ +++R + A ++A + AA + + A RA Sbjct: 4831 DSANQQQSSRSQRGRRVDARNDQSQAEQGLDMDDAAHAAPQESAEKPPAAGQNGARAPKA 4890 Query: 453 ASAVALEDASTTKKGIVQLSS------ATNSTSETLAATPKAVKSAYDNAEKRLQKDQNG 506 DA K V + A L A + D A + +G Sbjct: 4891 EHEETSADAEQDGKAEVDPNPLRRMGDALEQFRRRLQQIQDAREDEEDQASGNKAE-HDG 4949 Query: 507 ADIPDKGCFLNNINAVSKTDFADKRGMRYVRVN 539 +P+ + N + A V Sbjct: 4950 EGLPEDADVEHIANDDAAELQALGAAQEQDEVR 4982 >UniRef50_B6K087 GYF domain-containing protein n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K087_SCHJY Length = 943 Score = 49.0 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 54/379 (14%), Positives = 114/379 (30%), Gaps = 8/379 (2%) Query: 205 KTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSA 264 ++S + A A T AAA+ + S +A S ++ Sbjct: 411 AEGTDLPASSFKLNAPPKVAAQASVPVPETGTSIAAAAAPSYASVSKHAVGVEPSHPANV 470 Query: 265 TAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAG 324 ++ T A+ S A+ +++A + + + + A+ A Sbjct: 471 KMVEEEHLSSATEPVQAKQSSDASAPFSTSADKEEVVPKTENAKSGNQKETATTVENLAE 530 Query: 325 KSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSAS 384 K A+ A S + + A + + + TS +S + A ++ Sbjct: 531 KLADVAIKSQNVTAASPVKQFVNNIQAKEKPATKLVTNAQPQEKPTSKTNSPSLARTTPW 590 Query: 385 SAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAET 444 + S K+ + AS++ + ++ + A A S +E + Sbjct: 591 KPLRAKPVPSLDKNISNALASSSPKAEPAKPAQSRLGSPWAKVAEPPISLSEEIQKMEQE 650 Query: 445 AAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQ 504 A+ + +A + + + +S S + T A P AVK+ + Sbjct: 651 DAEMKKQNQAAALASAVAAARTPTLPAASVWGSVNGTGAKKPPAVKT--------SPVKK 702 Query: 505 NGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSEL 564 N + + V+ + + + + V SA + Sbjct: 703 NVTVLQKEAHPKTTATNVATNNHSVPNAWAKMASKPVTSSVPVPKPATVATNSASTAVAE 762 Query: 565 ASRVIITTATRTAGDPMNN 583 S T G +NN Sbjct: 763 GSPDDSWTVVGPGGKLVNN 781 >UniRef50_B6H2C8 Pc13g04790 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6H2C8_PENCW Length = 3674 Score = 48.6 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 69/514 (13%), Positives = 134/514 (26%), Gaps = 10/514 (1%) Query: 78 YEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTS 137 + TED +PEA + + + + A K S Sbjct: 2620 DTKEINVPEAEAAAPPTEDTTQPEAPVDVADIEATEDASMTPAQKRKAKKDKKKQRQSII 2679 Query: 138 AREAATHAADAADSA-RAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSA 196 A+E A +A AA TSA +A + ++A T A + A + Sbjct: 2680 AQEPEAEAQPEEKTAPEAAETSAQEAGITTEAAPGEIVTQDEAVPSAEEPTAQPIDAIQE 2739 Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 AA S + S AA + + S Sbjct: 2740 AAKDPVTPSEDTEQHVLSQEIENGERSLELPSTESKVEEGDKPTEQAAAAPAENPDDEQS 2799 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 + A A + + + + AA +A+AA + A S + Sbjct: 2800 SKKGLPEAGPEPEGESATVPKKMSKKEKKKAAAAAAAAALVKEEKLAESQPEPKEALVST 2859 Query: 317 SASATAAGKSAESAASSASTATTKAGEATE---------QASAAARSASAAKTSETNAKA 367 + + +S K E AS ++ Sbjct: 2860 PQAPEEVVDQSMVQEPQSSETEDKPPPEAEKNPDEITGLPASRLKEDVLEQPAEALDSTE 2919 Query: 368 SETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATA 427 S + + A A++ + + ++A T + + Sbjct: 2920 PLESEVKLEESTKELAEEPAAAETITRKMSKKDKKRAKKQAQEEIIEPTSPPTEDHAQSE 2979 Query: 428 AAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPK 487 ++ + + T + + LE + + +S + AA + Sbjct: 2980 KNEAVISLGTLDTPSPVEPTDQNTNNEPLDLERPAEKNDKKDEALVVADSQNLENAAVDQ 3039 Query: 488 AVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSG 547 V+ A + ++ + + A + AD + PA + Sbjct: 3040 PVQPEVMPAMSKKMSKKDKRKVKKNAGITEEVLAQQQEPQADVAELPTESEVQPAEPAAE 3099 Query: 548 KYYPVVVMRSAGSVSELASRVIITTATRTAGDPM 581 K SA + L + I T +P+ Sbjct: 3100 KEIHQEPEHSAEAQPSLERELPIETPASIENEPI 3133 >UniRef50_A4R522 Putative uncharacterized protein n=2 Tax=Eukaryota RepID=A4R522_MAGGR Length = 3251 Score = 48.2 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 53/466 (11%), Positives = 122/466 (26%), Gaps = 3/466 (0%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE 140 P D + D++P + V++ + ++ ++++ A + Sbjct: 2077 EAPVPAEDLPSTVDATDSQPNETAADQAPVDDEVPTTEETEEPSSEDVETSATALAAETV 2136 Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 + + D+ A+ SA S A A + A + Sbjct: 2137 NNSETSPPNDAEGASDVQPIAPEESAVDVSKDELAAEPIDLGAIAVSDAPVTETVDVDKD 2196 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSA 260 + A+A+ + A A AA ++ + Sbjct: 2197 VAEGEAVIGPATAADDATAEPVVEQPAVEEPPAVEGTVAADKSGHQNATAAPDEDIHDAV 2256 Query: 261 ASSATAAG--NSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASA 318 +AT G + A + E SS + S A A + Sbjct: 2257 EKAATEPGVLDVPAATQAPEDQEESSAAPEPATTSEEVEVAAGQTVLAPEAPEPVVKDDI 2316 Query: 319 SATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTA 378 + AE+ S + + A ++ + A ++ Sbjct: 2317 TPIDGDIPAEADVESGAKVDIEPEIPASSAEVEESKEPGILAADAATEVEAAPAPQAEAM 2376 Query: 379 AASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESA 438 AS + A + + + + A AT + +E + +TAE Sbjct: 2377 VASDEAVAKEADDKSLDAVEAAQPPTEVETLEATGDADAVSEEKAVDDIPEEPSNTAEPV 2436 Query: 439 ATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAA-TPKAVKSAYDNAE 497 A A + A + +E+ + + S+ + + E Sbjct: 2437 AEAATDIVDVNDSDAVSKEIEEEPAAEAAEAAARESLAEDSQVVEEPAAEGAVETAAVEE 2496 Query: 498 KRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAG 543 + ++ + + + + + + V A + Sbjct: 2497 PKAFEESPIDETVVEPATATESDVPHEDNSDEDAPEAAVEAAAASD 2542 >UniRef50_Q0CYT2 Putative uncharacterized protein n=1 Tax=Aspergillus terreus NIH2624 RepID=Q0CYT2_ASPTN Length = 4731 Score = 47.8 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 41/402 (10%), Positives = 88/402 (21%), Gaps = 8/402 (1%) Query: 100 P----EALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAA 155 P EA+ R ++ V + + E+ D D Sbjct: 4205 PEDEGEAIGREDMDVTDPHAKEEEALDLPDEMQLDGDGEDNDEGESDDGLDDGLDDDLPD 4264 Query: 156 STSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASL 215 A + + + A + E+++ A + + Sbjct: 4265 DGPAPEDGQPLEDGAEQDDGADQPTEDQQMEDGLEEATEEQEMQEAEGEEEANAPGEDEP 4324 Query: 216 QSAATSASTATTKASEAATSARDAAASKE---AAKSSETNASSSASSAASSATAAGNSAK 272 + A +E + + A + S+ +A + SA Sbjct: 4325 EEPEQDNILAQQDETEHTGDEVAPSEAVSGGLGADQDQNKDKGSSGNAQQEDGSTDPSAD 4384 Query: 273 AAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAAS 332 + + E + A SK A + ++ Sbjct: 4385 KNQQTGAAQEGEENERHRDTGGGADSKPE-DPQLQAFKKMGDILEQWHRRQKEILNASKQ 4443 Query: 333 SASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASS 392 E T+ A A + + S E ++ + + + Sbjct: 4444 DEEEPEKPLPEDTDMADADFEHLADQDDVADTQALGQASEEQAQALDQNKGVESDVKPTE 4503 Query: 393 ASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI 452 D + Q + S + S A T A+ ++ Sbjct: 4504 KDTLPDVSEDQQDLPDNQMEDEMQVDRHGVPSDEQKPGALIPGGSRAHERTTDAQAQHEV 4563 Query: 453 ASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYD 494 + DA + + E + D Sbjct: 4564 NEELDEVDAQLAEIHLSSTLPPLTPRDEAQRLWSHYESTTND 4605 >UniRef50_UPI00017F7AFB YALI0E22572p n=1 Tax=Yarrowia lipolytica RepID=UPI00017F7AFB Length = 1601 Score = 47.8 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 69/545 (12%), Positives = 157/545 (28%), Gaps = 2/545 (0%) Query: 40 SENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDAR 99 G Y+ + + G L G + T + + Sbjct: 303 WTRRGNGGGYATNQQPGSNIPSLNGGGSGSGSERSTTSSKPTSTSVKTTSTTPTVTATPT 362 Query: 100 PEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSA 159 SA S++ ++S ++ + S + +S Sbjct: 363 TSTSSTSATPTTSTTPTTSATPPAATTPNTSSTPTTSSDLITSSVFESSTGSIPQSISSV 422 Query: 160 GQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAA 219 + + + S T ++ S + SS+S+ S T+ S Sbjct: 423 ESSVEPSSTGPISESTPVGPSSSVESSPSVEPSSESSVGPSTNEPIPETTSEVPSSVETV 482 Query: 220 TSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSET 279 S +TT+AS + +S + S T + +S ++ + + A+ + Sbjct: 483 ESTPESTTEASTESVEPSSTESSTDPVPESTTVSVTSDPTSEPFPETTSSDPEIAQNDSS 542 Query: 280 NARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATT 339 S T + A S++ + +S S + T + ST + Sbjct: 543 TVGPSTTEEYSVEPTPDSTSEAPQSASGTSESSTEYTSEAVTPFPTPSSETTFINSTTSE 602 Query: 340 KAGEATEQASAAARSAS-AAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKD 398 + S + A+ T T+ + SS+T+ +S +S ++ S S + Sbjct: 603 PGTTESLSTSELSTEATVEPSTESTSEAVTPLPTPSSETSFINSTTSELGASESTSETAT 662 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 +S +T S + + T ++E++ + T+ A + S A Sbjct: 663 PLPTPSSETSFINSTTSELGSTGSTPETVTPLPTPSSETSFINSTTSELGASESTSETAT 722 Query: 459 EDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNN 518 + + + S+ + S + + + + + Sbjct: 723 PLPTPSSETSFINSTTSELGSTGSTSEAVTPLPTPSSETSFINSTTSELGTTESTSEAVT 782 Query: 519 INAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAG 578 ++ + + + P + S +ITT+ T+ Sbjct: 783 SLPTPSSETTFINSTTSALGTTESTSETATPLPTS-SSETDFANSTTSEPVITTSPPTSE 841 Query: 579 DPMNN 583 P+ Sbjct: 842 SPVTT 846 >UniRef50_C7QJN3 Hedgehog/intein hint domain protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QJN3_CATAD Length = 846 Score = 47.8 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 64/586 (10%), Positives = 145/586 (24%), Gaps = 2/586 (0%) Query: 52 DVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVE 111 + +Y + +G AG + E S + +++ E L E+ Sbjct: 92 QNDQQKYDQLAAEKGTLEQRAGDVQNRESSLETQATNLEDQSNALNSQAETLNS-EIDAH 150 Query: 112 EVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASS 171 + + A + + + A + A + + Sbjct: 151 NAEPHTFELPDEEAEYAAYNEEKANLDEQKANLQNQIDALTAQSKKLQSDQAQADADQTQ 210 Query: 172 SAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASE 231 T S ++ + + + + + A Sbjct: 211 LETDVQTHNDAVSALEGDVGKLEAERQQILTQIDSLLQDYAGAEPGGEGAPLAAEGGDES 270 Query: 232 AATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQS 291 +A + A S + S+ S + A + + + + A + Sbjct: 271 EPAAAAAPSPGARALPSGGGDQSAPPRSTYAPVQNAPSGSGSGQAQAQAAPAPAPTQTPV 330 Query: 292 ASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAA 351 A S + ++ A + + +A A A + Sbjct: 331 TVTLAPSTVSGLPASEAENLQPSETFDGLIPEANGDYAAEEIQPPAGESVPPAQKAFDNV 390 Query: 352 ARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSA 411 A T A+ A A + + A+A ++ + + + Sbjct: 391 VNKGGKASTRIGGRPATIDKIVPEAAAPAENQGGDTPRPAKAAAPPARSSWVPAVNQPNP 450 Query: 412 TTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQL 471 + +A S S A+ + + + + Sbjct: 451 ANGPPVSIDALKSLLDQQGLGSDADQFDLEYSPTVLGQDGEPAYAVAPTDAAGNPELGAE 510 Query: 472 SSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDK-GCFLNNINAVSKTDFADK 530 S P+ + A++N ++ Q+ + P + A T + Sbjct: 511 GKPILRFSNLGLQNPEVAQDAFENEGLDVEPAQDDSPCPHSFSGATRVLMADGSTKAIAE 570 Query: 531 RGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFV 590 G+ + NA G + + V + + + V Sbjct: 571 VGVGDLVENAEPGGRAEVHRVDQVHTTTTDAAFVDVVVASAAPGGGGTLTGTANHPYYDA 630 Query: 591 MPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDG 636 GG+ D G G Q +A S + + G + +DG Sbjct: 631 TAGGFVDAGALRAGDRLQSAGGGQATVSGVHARFGPLVTYDLTIDG 676 >UniRef50_C9SUW3 Putative uncharacterized protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SUW3_VERA1 Length = 456 Score = 47.8 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 43/319 (13%), Positives = 89/319 (27%) Query: 95 EDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARA 154 EDD +PE + E E K+ + A +A D + R Sbjct: 89 EDDQQPERGQDPEQADEMPEEEMDMGGNEIDDEPKAEEEQDGEAPDAEPEQPDEPEDKRQ 148 Query: 155 ASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASAS 214 + A + +A S + S +A A + + + + Sbjct: 149 DADQDTANADTENAAPSDVKSGGQDQNADSMDVENEPDDSAAQRDEGDAGEGAAKQEADA 208 Query: 215 LQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAA 274 + A S S + +E ++ +A + A + Sbjct: 209 GKKGAVSRSDDQAQPTEQPEDEAAREEPRQDPFKKLGDALERWHRQQTEIKEAEQKEEEG 268 Query: 275 KTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSA 334 + + + + AA + A + AAS Q A A + E AS Sbjct: 269 EAKQPQQDADDMAAKEFQHMQDDETAPDAQALGAASDDQVQPIDDAMAIDEEKEDPASRV 328 Query: 335 STATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSAS 394 + + E ++A+ + + + + + A + A +A Sbjct: 329 MPESEQEPEQQDEAAGDQMDTDEPEDKKDGPERDDGRSGVKTRQGAFDREATPEDADAAQ 388 Query: 395 ASKDEATRQASAAKSSATT 413 DE + + ++ Sbjct: 389 MDADEDEHEDNKVDEASAQ 407 >UniRef50_A3X9B4 Putative uncharacterized protein n=1 Tax=Roseobacter sp. MED193 RepID=A3X9B4_9RHOB Length = 876 Score = 47.4 bits (110), Expect = 0.003, Method: Composition-based stats. Identities = 57/411 (13%), Positives = 116/411 (28%), Gaps = 13/411 (3%) Query: 99 RPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTS 158 E A +A+ + + S+ + AA+ D A + Sbjct: 176 PAEVPNPKPNASPSAAGSANTGGKASLPGSTSSPVEALPKAPAASFPVQVEDQVTAVDAT 235 Query: 159 AGQAASSAQ----------SASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSE 208 + A +A +S+ + + KSA A S ++AA + + Sbjct: 236 IEDSGPIAGFSSRRRKSSGAAPASSKSPAQATPAPQKSAPPAVSPEAAAPVTTKPVQAPP 295 Query: 209 TN---ASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSAT 265 A+ +AA A A A +A S A +S + A+ A A Sbjct: 296 AKGDGAAKPSGAAAAPKVDAAMPAGPTPALAPEAKPSASAKPASTNLENPGAAKAIPQAP 355 Query: 266 AAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGK 325 A G SA+ ++ A A +G+ AA+S + A+ ++ + G Sbjct: 356 APGASAELSRKPTVVKAPPPAAVHPKGPAVSGTAAKAANSLTQAAENSQGSKGKPRFLGL 415 Query: 326 SAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASS 385 + A A +E + + + E+ T + Sbjct: 416 ILTAVLLLAMAAIAAFALFSEGGLFFSSERPPVEQTTVPPLDEPVEGETGATEETPTTVE 475 Query: 386 AASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETA 445 +++ ++ + + + A + S S A ET Sbjct: 476 QPTASPPDPSAAADVPVAEVETEPDSIAPQVSAIPSQPDPGVVELSDLALPSDAASPETD 535 Query: 446 AKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNA 496 ++ V + + + + + + A A Sbjct: 536 SQLDASTGQPVLDALEDNASQQAATEALSQAALYAATGIWQQVPEIAEVPA 586 >UniRef50_Q9KW53 Tail fiber n=9 Tax=Enterobacteriaceae RepID=Q9KW53_PECCC Length = 632 Score = 47.4 bits (110), Expect = 0.003, Method: Composition-based stats. Identities = 88/540 (16%), Positives = 154/540 (28%), Gaps = 15/540 (2%) Query: 276 TSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSAS 335 ET+ G S A + + + A AA + A + Sbjct: 16 QIETSDPVVGGPDGVSNRQAKELASRTRYLKKEQEKTGSDLATHAAAADPHTQYAPKANP 75 Query: 336 TATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASA 395 T T T ++ + ++ A ++ A S + + ++ Sbjct: 76 TFTGMPKAPTPATDNNSQQVATTAFVKSVVATLINGAPAALDTLQELAKSLGNDPNFSAT 135 Query: 396 SKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASA 455 + + A + + A A A+ + +A T A+ + + Sbjct: 136 VLNAIADVKAEAANKLNAHNVAADPHTQYAPKASPVLTGKPTAPTAAQASNDTQVATTAF 195 Query: 456 VALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCF 515 V A+ L + + L P+ + + +L KDQNGADI DK F Sbjct: 196 VKAAVAALVNGSPAALDTLQE-LANALGNDPQFSTTVLNALAGKLAKDQNGADIADKNLF 254 Query: 516 LNNINAVSKTDFADKRGMRYVRVNAPAGATSGKY-YPVVVMRSAGSVSELASRVIITTAT 574 + NI A A G K S + I T T Sbjct: 255 VKNIGAARAFHGAINIGGDSGAWKTSDFIAWLKNQGAFNHPYWICKGSWSYANNKIITDT 314 Query: 575 RTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYV 634 + + + + + S + + + + Sbjct: 315 GVGNIQL------------AGSVIEVFGVESATTIRVTTPSTVSAAGAIPNANFTYINHG 362 Query: 635 DGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYES 694 D + + +T + A A + G S Sbjct: 363 DNYSPGWRRDYNTRNPTAIDVGTYTKAETDTRVTAATAIANNAATSATNANTNANGRVPS 422 Query: 695 HSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGW 754 ++ LS A ++ A + + N TN Sbjct: 423 GRMVNGKALSADISLAAGDVGAYTKAETDTRVASATTVANNAATAAVNANTNANGRVPSG 482 Query: 755 NTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVA 814 + A+ + L G + S NG A+ ATKL TPR+++GV FDGS DI LT A++ Sbjct: 483 RMVNGKALSSDIA-LNAGDIGALSANGTAVAATKLATPRKINGVAFDGSADIILTPANLG 541 >UniRef50_UPI00015FF553 UPI00015FF553 related cluster n=2 Tax=Drosophila melanogaster RepID=UPI00015FF553 Length = 1736 Score = 47.1 bits (109), Expect = 0.004, Method: Composition-based stats. Identities = 36/412 (8%), Positives = 104/412 (25%), Gaps = 3/412 (0%) Query: 73 GTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSAS 132 G + V D++ + + PE + + V E+ + + +++ Sbjct: 269 GDLLVELDARSAEQEGESNNKADTECVPE-VSELKTEVSEIEFESVIMETRSSSPPPPLP 327 Query: 133 DASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAES 192 + +R +A ++ + + A + + TE + + Sbjct: 328 KSPPPSRVSAFVLSEEVIEEQVTPNVPEVSDVKPDEIEQLAISIVAEITEQAAEFVTEQE 387 Query: 193 SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETN 252 + A +T E ++S + A + + + Sbjct: 388 KQQEEAKVDPVPETIEESSSTVVVEEVLPVQNDKVTAPSPTPDEVQKPIEDQDTPDEKES 447 Query: 253 --ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAAS 310 A + A + + + +S A A + + Sbjct: 448 YPVQDPIDPADVNDEVAVTESVDCEVEKETYPTSRRAIENQDEILQEQPAAVKETTEQET 507 Query: 311 TSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASET 370 + S A + A T + S++ + + + + Sbjct: 508 SDQQVISEEAHSDNDKKNEIDLQQEEAKVDPVPETIEESSSPVVVEEVLPVQNDKVTAPS 567 Query: 371 SAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQ 430 + S + + + E ++ A+ Sbjct: 568 PTPDEVQKPIEDQDTPDEKESYPVQDPIDPADVNDEVAVTESVDCEVEKETVSISSNVAE 627 Query: 431 SKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETL 482 S S ++ A E A+ + T+ + ++ + +++ + Sbjct: 628 SSSVSDEQAAIENQDEILQEQPAAVKETTEQETSDQQVISEEAHSDNDKKNE 679 Score = 43.6 bits (100), Expect = 0.049, Method: Composition-based stats. Identities = 43/454 (9%), Positives = 114/454 (25%), Gaps = 8/454 (1%) Query: 73 GTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSAS 132 G + V D A E ++ +A V E+ S + + + +S Sbjct: 699 GDLLVE--------LDARSAEQEGESNNKADTECVPEVSELKTEVSEIEFESVIMETRSS 750 Query: 133 DASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAES 192 ++ + +A + + + A + Sbjct: 751 SPPPPLPKSPPPSRVSAFVLSEEVIEEQVTPNVPEVSDVKPDEIEQLAISIVAEITEQAA 810 Query: 193 SKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETN 252 + +S++T +A + + + Sbjct: 811 EFVTEQEKQQEEAKVDPVPETIEESSSTVVVEEVLPVQNDKVTAPSPTPDEVQKPIEDQD 870 Query: 253 ASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTS 312 S + S S+ ++ + +A Sbjct: 871 TPDEKESYPVPDPIDPADVNDEVAVTESVDCEVEKETVSISSNVAESSSVSDEQAAIENQ 930 Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSA 372 A ++ E S + +A ++ + S KT + + Sbjct: 931 DEILQEQPAAVKETTEQETSDQQVISEEAHSDNDKKNEIDPEVSELKTEVSEIEFESVIM 990 Query: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSK 432 E+ ++ A + +S E + + K E A + Sbjct: 991 ETRSSSPPPPLPKAPPPSRVSSFVLSEEVIEEQVTPNVPEVNDVKPDEIEQLAISIVAEI 1050 Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSA 492 + + + + + + S++ + ++ N + TP V+ Sbjct: 1051 TEQAAEFVTEQEKQQEEAKVDPVPETIEESSSTVVVEEVLPVQNDKVTAPSPTPDEVQKP 1110 Query: 493 YDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTD 526 ++ + +K+ P +N+ AV+++ Sbjct: 1111 IEDQDTPDEKESYPVPDPIDQAAVNDEVAVTESV 1144 >UniRef50_Q6C7I9 YALI0E00484p n=1 Tax=Yarrowia lipolytica RepID=Q6C7I9_YARLI Length = 853 Score = 47.1 bits (109), Expect = 0.004, Method: Composition-based stats. Identities = 49/410 (11%), Positives = 107/410 (26%), Gaps = 1/410 (0%) Query: 42 NPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPE 101 + YS +V G Y V+G E + + + +D PE Sbjct: 445 RNNADPTYSEEVREGIYDKEPTVDGKTVEQL-DAEEAEIGREAKAHKKAAPIKDDGMSPE 503 Query: 102 ALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQ 161 A E + A ++ A +A + A + Sbjct: 504 AKAAAEKVAAARNDPAPTYSEEVREGIHDKEPTVDGKTVEQLDAEEAKIATEAKANKKAA 563 Query: 162 AASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATS 221 A ++ + + A A + A +T + A + Sbjct: 564 APNTDGMSPEAKAAAEKVAAARNDPAPTYSEEVREGIYDQEPTVDGKTVDQLDAEEAKIA 623 Query: 222 ASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNA 281 +S+ AK++ +++ + A + + + Sbjct: 624 TQAKLHPYKAGQSSSIKDDGMSPEAKAAADKVAAARNDPAPTYSEEVREGIYDAEPTVDG 683 Query: 282 RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKA 341 ++ E + A A +K A+A ST A A A A +A + + + + Sbjct: 684 KTVEQLDAEEAKIATQAKLNPYKKAAAPSTDGMSAEAKAAAEKVAAARSNTDPTYSEEVR 743 Query: 342 GEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEAT 401 ++ +E E A T S + A++ + Sbjct: 744 EGIYDKEPTVDGKTVEQLDAEEAEIGREAKAHKKATPIKDDGMSPEAKAAAEKVAAARND 803 Query: 402 RQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAED 451 + ++ K G ++ + + K ++ Sbjct: 804 PAPTYSEEVREGIHDKEPTVDGKTVEQLDAEEAKIATDAKLRREGKLPKE 853 >UniRef50_B7PES8 Secreted mucin MUC17, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PES8_IXOSC Length = 2021 Score = 47.1 bits (109), Expect = 0.004, Method: Composition-based stats. Identities = 63/643 (9%), Positives = 165/643 (25%), Gaps = 6/643 (0%) Query: 22 IQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDS 81 + ++ S+TVV T + ++ +E G + I + T+T S Sbjct: 771 MTFTTEQGSSTVVEGTSFVPTEESTFVSTVTIEEGSSATIEITGNTREIVTPTVTGESLS 830 Query: 82 QPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNAS-----AVAQNTAAAKKSASDAST 136 + D+ + + + + + +T + + Sbjct: 831 TESSTEHLSTITLITDSSTKVEESTSFLPTKESTFITTVLTMETIGSTEETMTPSVTEKS 890 Query: 137 SAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSA 196 A E +T ST S + ++ + + + + Sbjct: 891 LATETSTGEPSTITLIYDPSTEMESTPLSESTTVFLTEESTFISAAVTVETTGSAAETKT 950 Query: 197 AATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSS 256 + + T + S + +T + ++ + T SS Sbjct: 951 PPVTGESLGTETSTGEPSTITHIIEPTTEMESTPPSESTLFYPTEESTFVPTIVTEEESS 1010 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 A+ + + + S S+ + +A S+ + + S + Sbjct: 1011 ATIETTGNPEETMTPSTTEESLGTETSTAELSTITAMTEPTSELGSTPLSEGTSFFYTEE 1070 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 S + A ES+A+ +T T+ + + ++ + T +E + E Sbjct: 1071 STFMSTAVTEEESSATIETTGNTEETMTPSTTEESLGTETSTEELSTITVITEPTTEVES 1130 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAE 436 T + S ++ S+ ++ ++ + + T T + S Sbjct: 1131 TPLSEGTSLFSTEESTFMSTAVTEEESSATIEITGNTQETMTPSVTEESLGTESSTEELS 1190 Query: 437 SAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNA 496 + E + S + + + S++ Sbjct: 1191 TITMITEPTTEVESTPLSEGTSLIYTEESTFMSTGVTEEESSATIEITGNTEETMTPSVT 1250 Query: 497 EKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMR 556 E+ L + + ++ V T ++ + Y + + + Sbjct: 1251 EESLGTESSTEELSTITMITEPTTEVESTPLSEGTSLFYTEESTFMSTAVTEEESSATIE 1310 Query: 557 SAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAI 616 + G+ E +T + + + E Sbjct: 1311 TTGNT-EKTVTPSVTEESLGTETSTEELSTITVITEPTTGVESTPISESTSFFYTEENTF 1369 Query: 617 HSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLV 659 S ++ + + + E L +L Sbjct: 1370 MSTAVTEEESSPTMKTTSNTEETITPSATEGSLGTETSTEELS 1412 >UniRef50_C2HLK2 Surface protein n=5 Tax=Lactobacillales RepID=C2HLK2_LACAC Length = 1676 Score = 46.7 bits (108), Expect = 0.005, Method: Composition-based stats. Identities = 67/605 (11%), Positives = 131/605 (21%), Gaps = 37/605 (6%) Query: 30 STTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDF 89 S+ VV T+ P G+ + D V + P +T + G L + Sbjct: 1061 SSEVVNATVKVTEPTTPGQTADDHNPKYEDVDVK-----PGETNKVTPTNTDKDGNLANI 1115 Query: 90 L--GAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE------- 140 +D P + E K + S E Sbjct: 1116 PDGTKFEKDPDAPSWVEVDPNTGELTVAPPEGTPSGEHEIKVKVTYPDGSTDEVPVTVKV 1175 Query: 141 -----AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKS 195 A D G+ + + G + S Sbjct: 1176 SEPTTPGQTADDHNPKYEDVDVKPGETNKVTPTNTDKDGNPANIPDGTKFEKDPDAPSWV 1235 Query: 196 AAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASS 255 + G + + S T +++ + ++ + Sbjct: 1236 EVDPNTGELTVAPPEGTPSGGHEIKVKVTYPDGSTDEVPVTVKVSDPTTPGQTDADKYTP 1295 Query: 256 SASSAASSATAAGNSAK---------AAKTSETNARSSETAAGQSASAAAGSKTAAASSA 306 A + + A+ + E T G S ++ Sbjct: 1296 EAKDITVTPGPTPDPAEGIGNKDTLPSGTKYEWKDPVDTTTPGDKTGTIVVSYPDGSTDE 1355 Query: 307 SAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK 366 + + A + + + E S K + Sbjct: 1356 IQVTVKVTDPTTPGQTDADKYTPEAKDITVTLGQTPDPAEGIGNKDTLPSGTKYEWKDPV 1415 Query: 367 ASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSAT 426 + T + + T S + + T + A A + Sbjct: 1416 DTTTPGDKTGTIVVSYPDGSTDEIQVTVKVAEPTTPGPTDADKHTPEAKDVTVVQGQTPD 1475 Query: 427 AAAQSKSTAE-----SAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSET 481 A + A + D + + + + T S S T Sbjct: 1476 PAEGIGNKDTLPPGTRYAWKDPVDTTTPGDKTGTIVVTYPDGSTDEVSVTLHVTPSESGT 1535 Query: 482 LAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAP 541 + + K + AD P ++ + K + D G + N Sbjct: 1536 TDTSTTPPTDTSGSDTDTTSKGETPADTPPTDTASDSTDTTPKDENTDNTGGTHKSTNTD 1595 Query: 542 AGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRY 601 + + S + S T +N + G TDR Sbjct: 1596 SSQSGA----TGNTSSGANASSNTEIHASDVTTDQYTTVNDNTADMNTLPQTGETDRNVG 1651 Query: 602 AYGMF 606 +GM Sbjct: 1652 VWGMI 1656 >UniRef50_Q6W4X9 Mucin-6 n=10 Tax=Catarrhini RepID=MUC6_HUMAN Length = 2392 Score = 46.3 bits (107), Expect = 0.007, Method: Composition-based stats. Identities = 46/356 (12%), Positives = 96/356 (26%), Gaps = 28/356 (7%) Query: 258 SSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQAS 317 SS S ++ A +T A + + S+ TA A++ Sbjct: 1233 SSTGPSPSSNHTPASPTQTPLLPATLTSSKPTASSGEPPRPTTAVTPQATSGLPPTATLR 1292 Query: 318 ASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAK-ASETSAESSK 376 ++AT + + ++ASTA+ + + + TS T+ T+ E Sbjct: 1293 STATKPTVTQATTRATASTASPATTSTAQSTTRTTMTLPTPATSGTSPTLPKSTNQELPG 1352 Query: 377 TAAASSASSAASSASSASASKDEATRQASA--------------------AKSSATTAST 416 T A + + AS+ + + + + + + Sbjct: 1353 TTATQTTGPRPTPASTTGPTTPQPGQPTRPTATETTQTRTTTEYTTPQTPHTTHSPPTAG 1412 Query: 417 KATEAAGSATAAAQSKSTAESAATRAETAAKRAEDI--ASAVALEDASTTKKGIVQLSSA 474 + G TA + +T + ET S V + Q++S+ Sbjct: 1413 SPVPSTGPVTATSFHATTTYPTPSHPETTLPTHVPPFSTSLVTPSTHTVITPTHAQMASS 1472 Query: 475 TN--STSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNN---INAVSKTDFAD 529 + S P +K+ F N + S T Sbjct: 1473 ASNHSAPTGTIPPPTTLKATGSTHTAPPITPTTSGTSQAHSSFSTNKTPTSLHSHTSSTH 1532 Query: 530 KRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCE 585 + + + + + + S T + T P ++ Sbjct: 1533 HPEVTPTSTTSITPNPTSTRTRTPMAHTNSATSSRPPPPFTTHSPPTGSSPFSSTG 1588 >UniRef50_C0WA14 Putative uncharacterized protein n=1 Tax=Acidaminococcus sp. D21 RepID=C0WA14_9FIRM Length = 1706 Score = 45.9 bits (106), Expect = 0.008, Method: Composition-based stats. Identities = 107/910 (11%), Positives = 241/910 (26%), Gaps = 45/910 (4%) Query: 200 SAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASS 259 ++ A + A+ + + A +++ + A + +++ ++ + +S Sbjct: 62 ASLIAAPLPSYAATTEEQVAANSAAIAQNKTNIAANKTAIESNQSGIRAVTRTVEAQQAS 121 Query: 260 AASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASAS 319 + A + + A+ + ++E A A+ A + A+ A + + Sbjct: 122 ISGKAEQTDLNKETAERKAADTATNEKVAANKAAIDQNKTDIAKNKTDIAAAQASLSGKA 181 Query: 320 ATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAA 379 + + S S +TA + + + + + +T+ + T Sbjct: 182 DQSDLDTLSSRVSENATAIGTNRSKINENRSKIKELADSLGFKTDESGTITGQSGLTKYF 241 Query: 380 ASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAA 439 + +S+++ + + T+ + A + +A + A T A Sbjct: 242 KTKSSTSSYTETMTKEDGTTETKTTTMTDGEAKAEGGNSVAIGPNANSKANQSITLGDGA 301 Query: 440 TRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKR 499 TA++ A+ A ++ T + A Sbjct: 302 VTKSTASRGIAIGQGAITGATADMGADLGSDTATNQGGVDSISIGTLSNARGNDAIAIGH 361 Query: 500 LQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAG 559 + QN A+ A A Sbjct: 362 NAEVQNVPIDGSGTVASKGSLAIGSDAKVYGASYSLALGAGATIAADNLNGNTTNEAIAI 421 Query: 560 SVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSI 619 + + + + + ++++ A G + + AI Sbjct: 422 GYNAKVNNNATHAIVIGSNANADKADAIAIGYKA-FSEKNSMALGNNAKASEDSLAIGFG 480 Query: 620 MMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAA 679 S + + +GA I G N A + + Sbjct: 481 ATS---SAPNAQAFGNGAVATSGGDISIGNLAGVGSDAKRANVDGSLIAIGVAAGQNVVG 537 Query: 680 DVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGG----EYGALWRN 735 + G + V+ + F T++ + N + G + + Sbjct: 538 TANVAIGDKAGSNVHSNYNVSIGSEAGQGFKTEQTLDNPQNGYNVSIGYKANNFSEISGT 597 Query: 736 DGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKL-------------SASLNGN 782 D + + + Y + A+ N + G S + NGN Sbjct: 598 DTTQYAIAIGANATSYSNSTAIGRAALSNGQYAMAFGDNAHAYDTGSIAFGYNSVAKNGN 657 Query: 783 ALTATKLQTPRRVSG-----VEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAES 837 + VSG + S +++ + D D+D ++ Sbjct: 658 VAIGSGSDAQAIVSGTGYLTQQIAPSSYVSVGTSENLRRISNVADGSLDSDAVTVRQLKT 717 Query: 838 GAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSK 897 + G S V L + F SS + + + Sbjct: 718 AMSQIPSGGTSSGDVTKNYVDQQISNLNSSIEALSKKYFSVSSNENTSTGNKSNDGTSPD 777 Query: 898 NLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGK 957 N + G QA D A + +G I IK Sbjct: 778 NKNAMAIGPGT----------------AAQADDALAIGNNTKSTGAGSIAIGSEGPIKST 821 Query: 958 PASGRAVLSQEQDGIKSHTHSASASSTDLG---TKTTSSFDYGTKSTNNTGAHTHSVSGS 1014 L++ + S S + TD ++++ ++ ++ HS++ Sbjct: 822 DPGDSTHLTEAKGERSVSIGSGSIAQTDHSIAIGTRATNYNQVNENNESSNQGNHSIAIG 881 Query: 1015 TNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAH 1074 + + S+A + A A + + S + + G T T+ GAH Sbjct: 882 YYAETSGDSSIAAGDQAVAAGNGSISIGKESGTDASATDSIAVGTSTKVNGSTSTVVGAH 941 Query: 1075 AHTVGIGAHT 1084 G H Sbjct: 942 NTVEGDRNHA 951 >UniRef50_A8I7A5 Histone methyltransferase n=2 Tax=Eukaryota RepID=A8I7A5_CHLRE Length = 1735 Score = 45.9 bits (106), Expect = 0.009, Method: Composition-based stats. Identities = 57/412 (13%), Positives = 112/412 (27%) Query: 87 NDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAA 146 G ++ + A R+ + + A A + +S+ + E Sbjct: 534 KKAKGLRSQTTSPDAAHRQQPQRKAHSGPSPDKAKRAAAEAASESEASSSESEEDEAEDT 593 Query: 147 DAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKT 206 + A + A ++A +A+ S + +AA S+ + + Sbjct: 594 EDAGAKGKQQPKRSAAPAAAPAAAGSGKAVPLELRRLQADLSAAPSALDLSERRTLRDRP 653 Query: 207 SETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATA 266 S + SAA + + A + + EAA S E +A S Sbjct: 654 SPARRPPASGSAAPTPTQTGGTAHAGSKRGDKHGSDDEAAMSPEAEDVGAAQQRGLSPRR 713 Query: 267 AGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKS 326 + +E +AA ++ + AA+ + A A A Sbjct: 714 QAAEHSIKSEPSPDTGKAEQDKPVPGAAAGAEGQSSIGAPVAAAAATQAGGAMAAAVPVK 773 Query: 327 AESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSA 386 + + +G A + A A + + SA S +A S+A Sbjct: 774 KKPPSLGVPAPARSSGHAMARQLPAIMPAPGSVANRVAGATPAVSAAVSGISAGGGTSTA 833 Query: 387 ASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAA 446 + A+ + A + A S + E+ A A + Sbjct: 834 GAVPPLAAVVVVSKKPSSGPPTRLAIMERSLDAGTAASTDLPDAVSADGEADARHASSPD 893 Query: 447 KRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEK 498 A A + G + + A D ++ Sbjct: 894 VAAVAAVVASDKGQVPRSPGGGAAEEADVAEEEDQQEAAALHATEELDAVDE 945 >UniRef50_D0NLX0 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0NLX0_PHYIN Length = 732 Score = 45.9 bits (106), Expect = 0.009, Method: Composition-based stats. Identities = 52/333 (15%), Positives = 95/333 (28%), Gaps = 5/333 (1%) Query: 81 SQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSARE 140 ++ E+ + E + + + AS+ K + + + + Sbjct: 387 DNQAEEEKERASIKEEAPATSENQDVEEQQKSLKKPASSAKLLAFLQKVEPASVTPAKVD 446 Query: 141 AATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATS 200 + D + A S A+ + TA +E + A + TS Sbjct: 447 ISEDEDDDFVAVEAEDISTEDEAAVEEETPVEEETAVETESEEEDESVATSDALEDEETS 506 Query: 201 AGAAKTSETNASASLQSAATSASTATTKASEAATSAR----DAAASKEAAKSSETNASSS 256 A+ + ++++ S +A E T AA EA S + Sbjct: 507 VEEAEVAPEVVEEQVEASKEETSEEEAEAPEDETPEELKVETAAPVAEAIAVSPELLQEN 566 Query: 257 ASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQA 316 A+ S A + T E+ + A + AS+ + + Sbjct: 567 ERLASESEALTQKLQTAEASVATKDTELESLKHELELLKAQLQEEQASAQEKLLAAEKKQ 626 Query: 317 SASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSK 376 T A + A S A T+A A+ + A + KA TS Sbjct: 627 DELVTEAATARAEVAKSQE-AATEASSASREVKEAKIQMKELAVQNESLKADLTSLREEV 685 Query: 377 TAAASSASSAASSASSASASKDEATRQASAAKS 409 S A S +A + A +A Sbjct: 686 QTLEQSVQLARDSEEAARYAAQVAFAARDSADE 718 >UniRef50_B1Z702 Putative uncharacterized protein n=7 Tax=Alphaproteobacteria RepID=B1Z702_METPB Length = 776 Score = 45.5 bits (105), Expect = 0.010, Method: Composition-based stats. Identities = 72/473 (15%), Positives = 141/473 (29%), Gaps = 5/473 (1%) Query: 83 PGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAA 142 PGT + ++ A+P+A + +SA + + ++A + + Sbjct: 93 PGTASRPQDTKSDVSAKPDASTKPGATGSSAVPPSSAASATAGSPSGKPAEAGRPSTGSG 152 Query: 143 THAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAG 202 + AA A AA G SSA + S +K A A+ + ++ Sbjct: 153 SPAAAAGTPGGAAPLKPGATGSSAVPPGTGGARFSDAKPADAKPEAPAKPGATGSSAVRP 212 Query: 203 AAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAAS 262 A SE+ + SA + + + AAS + A + A + Sbjct: 213 EAPASESAKPDTAASAKPDTAAGAKPGAAPGATRVTEAASLTDGPILDLKAKRLSDPAEA 272 Query: 263 SATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATA 322 A ++ K A + +A+ + A + AS +T + A + A++ Sbjct: 273 GKDAGKDTGKNAPGASASAKDASKDAPKDASKDPSKETVRPAPARGGAGFGTVAASGLLG 332 Query: 323 AGKSAESAASSAS-----TATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKT 377 A + A + + A + ++ + + + Sbjct: 333 GLIGAGLLYGVTTYQRGADPRLAALDTSIAGLATKDAVASLDKRLAANEQALKPLPDAIA 392 Query: 378 AAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAES 437 A ++A +A A A A +A + + A A A +A + Sbjct: 393 RAEAAAKAANDRAGEALQKAGPAPAADGSAPAPSVPADLAARLDALDQRVSALQEEPGRD 452 Query: 438 AATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAE 497 +TA A+ + G SS L + A + AE Sbjct: 453 GTAEVKTAPTLDPAALGALDQRIKALEASGAKDASSDLADKIAALQGEVASRTKADEAAE 512 Query: 498 KRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYY 550 L K + +G A + A ++ + A + Sbjct: 513 AALGKRVDELQKALEGRITAASQAAQEATQAGRQAADAAQTRADEAVRGLERR 565 >UniRef50_Q6CGV5 YALI0A15796p n=1 Tax=Yarrowia lipolytica RepID=Q6CGV5_YARLI Length = 982 Score = 45.5 bits (105), Expect = 0.011, Method: Composition-based stats. Identities = 50/519 (9%), Positives = 117/519 (22%), Gaps = 2/519 (0%) Query: 10 KDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPP 69 D P S + E S + P Sbjct: 122 TDPATIPESTEPATSTESTTSPETSPPISEEPSTSEEPTTSEKPITSEEPTTSPETSKQP 181 Query: 70 SHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKK 129 + + T ED T P E + ++ T+ Sbjct: 182 TTSEQPTTSEDPTTSEEPTTSEEPTTTSEGPTTSEEPTTSPETSEQPTTSEQPTTSEQPT 241 Query: 130 SASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAA 189 + S+ T++ E T + + + S+ + ++ + + Sbjct: 242 ATSEEPTTSEEPTTSHETSEQPTTSEQPTTSPETSAPTTNEEPTSSSPVTDPCTTVTTIV 301 Query: 190 AESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSS 249 T T ++ + Q++ ++ + + + ++ S + Sbjct: 302 TTPPGEEPTTYTTTVDTCSSDPTPQPQTSGSTTTDPCLETTTIVSTPITGEPSTFTITTD 361 Query: 250 ETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAA 309 ++S + ++ + + T + + SA T S Sbjct: 362 VCSSSVPITECVTTIVSNPPGEVPTTLTVTTDVCTPEPTTEITSAPVTECTTTVVSTPPG 421 Query: 310 STSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASE 369 + + S ++ T++ + S S T+ + Sbjct: 422 GDPTTMTVTTDVCTPEPVTSEPVTSEPVTSEPVTSEPVTSEPVTSEPVTSEPVTSEPVTS 481 Query: 370 TSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAA 429 S + + + ++ +E T T T+ T + T Sbjct: 482 EPVTSEPVTSEPVTPTECITTVVSTPPGEEPTTLTITTDICTTEPPTQTT--SAPVTECT 539 Query: 430 QSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAV 489 + + V E S T+ +S+ N TL T Sbjct: 540 TTVVSTPPGGDPTTLTITTDVCTPEPVTSEPVSPTECITTVVSTPPNGEPTTLTVTTDYC 599 Query: 490 KSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFA 528 D + + Sbjct: 600 PPVSTPVVPTDCVTTTTVTTTDPAGEPSTVTTTIDNCPT 638 >UniRef50_C1CZC2 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1CZC2_DEIDV Length = 700 Score = 45.5 bits (105), Expect = 0.011, Method: Composition-based stats. Identities = 73/497 (14%), Positives = 118/497 (23%), Gaps = 4/497 (0%) Query: 100 PEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSA 159 PE R + + + A A Sbjct: 45 PEPRRTPADARRAPLEVVTLAPAPQPTPPPEQPAEAAAPLRPPAAQPTPPKPAPTKPEPA 104 Query: 160 GQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAA 219 A + + T A + + + A + T S + AA Sbjct: 105 KPAPARPVAVKPPPVKTPTVTPPARPTPVPPAVAPAPATPAPSRPTTGRDAQSNNTPQAA 164 Query: 220 TSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSET 279 + AT + A A + + A T + A + + SA + +TS+ Sbjct: 165 SGPPKATAAPAVEAAGAPSSQVAGAAPAPRTTVDTPVAVAPSPSAPVEEPAELTPETSQA 224 Query: 280 NARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATT 339 A+ A S S A S +A +A + Sbjct: 225 PGPGELDPVAAEPGASEAPAPELAPSESEAVAPPEPESVAALPRRDEITAAPETEPERLP 284 Query: 340 KAGEATEQASAAARSASAAKTSETNA-KASETSAESSKTAAASSASSAASSASSASASKD 398 A E + + A TS A A +S+T +A + + AS Sbjct: 285 AAPEPIQAPARTPTPAPEGATSPVAAIPARPAQTSASQTLEVPAAPRGTLAPAPASEVAP 344 Query: 399 EATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVAL 458 R A + A A A A +T + +AA A + V Sbjct: 345 VPARPIPRAPEVPARPPAASVAPAPQAAAPAPVPATPANRPAAPVSAAAEAPSTSGPVPS 404 Query: 459 EDASTTKKGIV--QLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFL 516 + T + +A + T A T + A P Sbjct: 405 RSTTPTSGTAEERPVEAARRNPDRTPAETRAPATTPEPRPATGAGGSP-VAQAPSARAGT 463 Query: 517 NNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRT 576 N A + T R A +G S + A Sbjct: 464 RNPAAGTPTAGTAPERPAATRTPAATTPAAGTAPAGAAPSGTAPESTAPGGTAPSGAAPG 523 Query: 577 AGDPMNNCEFNGFVMPG 593 G +G G Sbjct: 524 GGTSGATPGGSGTAPAG 540 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.293 0.0968 0.199 Lambda K H 0.267 0.0298 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,354,059,381 Number of Sequences: 3077464 Number of extensions: 117814148 Number of successful extensions: 1572026 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 12524 Number of HSP's successfully gapped in prelim test: 29535 Number of HSP's that attempted gapping in prelim test: 892245 Number of HSP's gapped (non-prelim): 332743 length of query: 1120 length of database: 1,040,396,356 effective HSP length: 140 effective length of query: 980 effective length of database: 609,551,396 effective search space: 597360368080 effective search space used: 597360368080 T: 11 A: 40 X1: 16 ( 6.8 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.1 bits) S2: 98 (42.8 bits)