BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (1520 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q46837 Putative lipoprotein acfD homolog n=40 Tax=Gamma... 3155 0.0 UniRef50_A6Y2J7 Large exoproteins involved in heme utilization o... 1414 0.0 UniRef50_A5F372 Accessory colonization factor AcfD n=25 Tax=Vibr... 1391 0.0 UniRef50_A6AXB7 AcfD n=3 Tax=Gammaproteobacteria RepID=A6AXB7_VIBPA 1360 0.0 UniRef50_D0YX39 Accessory colonization factor AcfD n=1 Tax=Photo... 1009 0.0 UniRef50_A6ALV4 AcfD n=5 Tax=Vibrio RepID=A6ALV4_VIBHA 969 0.0 UniRef50_A8H5W9 Inner membrane lipoprotein n=1 Tax=Shewanella pe... 964 0.0 UniRef50_B5FBF1 Inner membrane lipoprotein n=1 Tax=Vibrio fische... 958 0.0 UniRef50_Q5E705 Accessory colonization factor AcfD-like protein,... 672 0.0 UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellula... 157 3e-36 UniRef50_UPI0001B9ED55 hypothetical protein GYMC10_4682 n=1 Tax=... 117 5e-24 UniRef50_UPI000178AA33 hypothetical protein GYMC10_4678 n=1 Tax=... 99 1e-18 UniRef50_C1ZFD9 Putative uncharacterized protein n=1 Tax=Plancto... 97 4e-18 UniRef50_B2ULE8 Putative uncharacterized protein n=1 Tax=Akkerma... 95 2e-17 UniRef50_UPI00017445F8 hypothetical protein VspiD_04825 n=1 Tax=... 91 3e-16 UniRef50_A4GHK6 Putative uncharacterized protein n=1 Tax=uncultu... 86 1e-14 UniRef50_B4DBQ1 Putative uncharacterized protein n=1 Tax=Chthoni... 74 5e-11 UniRef50_A4IG42 Protein FAM115 n=14 Tax=Clupeocephala RepID=F115... 73 9e-11 UniRef50_C2G0M2 Putative uncharacterized protein n=2 Tax=Sphingo... 73 1e-10 UniRef50_D2VFD2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 72 2e-10 UniRef50_C2G4C5 Putative uncharacterized protein n=2 Tax=Sphingo... 67 4e-09 UniRef50_D2VUE2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 65 2e-08 UniRef50_C3XPE8 Putative uncharacterized protein n=1 Tax=Branchi... 65 2e-08 UniRef50_A2EC39 Putative uncharacterized protein n=1 Tax=Trichom... 62 2e-07 UniRef50_UPI000180C854 PREDICTED: similar to transmembrane agrin... 59 1e-06 UniRef50_D1BQB2 Putative uncharacterized protein n=1 Tax=Veillon... 52 2e-04 UniRef50_B9GV44 Predicted protein n=3 Tax=Populus trichocarpa Re... 52 2e-04 UniRef50_A9A5M1 Putative uncharacterized protein n=1 Tax=Nitroso... 48 0.003 UniRef50_A0YJH0 Polymorphic membrane protein n=1 Tax=Lyngbya sp.... 47 0.008 UniRef50_Q8YLY9 All5153 protein n=6 Tax=Nostocaceae RepID=Q8YLY9... 47 0.008 UniRef50_C0WIN3 Putative uncharacterized protein n=1 Tax=Coryneb... 46 0.010 UniRef50_B7P4H6 Putative uncharacterized protein n=1 Tax=Ixodes ... 45 0.019 UniRef50_C2M7W3 Putative uncharacterized protein n=1 Tax=Capnocy... 45 0.025 UniRef50_Q99109 Rep1-C n=1 Tax=Ustilago maydis RepID=REP1_USTMA 45 0.029 UniRef50_B4FNV9 Early nodulin 75 protein n=2 Tax=Zea mays RepID=... 44 0.067 >UniRef50_Q46837 Putative lipoprotein acfD homolog n=40 Tax=Gammaproteobacteria RepID=ACFD_ECOLI Length = 1520 Score = 3155 bits (8180), Expect = 0.0, Method: Compositional matrix adjust. Identities = 1520/1520 (100%), Positives = 1520/1520 (100%) Query: 1 MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTP 60 MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTP Sbjct: 1 MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTP 60 Query: 61 EPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPG 120 EPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPG Sbjct: 61 EPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPG 120 Query: 121 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNS 180 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNS Sbjct: 121 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNS 180 Query: 181 CPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV 240 CPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV Sbjct: 181 CPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV 240 Query: 241 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV 300 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV Sbjct: 241 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV 300 Query: 301 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQ 360 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQ Sbjct: 301 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQ 360 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI Sbjct: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 Query: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG Sbjct: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 Query: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVT 540 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVT Sbjct: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVT 540 Query: 541 RDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDM 600 RDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDM Sbjct: 541 RDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDM 600 Query: 601 KNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVE 660 KNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVE Sbjct: 601 KNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVE 660 Query: 661 HLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKG 720 HLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKG Sbjct: 661 HLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKG 720 Query: 721 GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWV 780 GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWV Sbjct: 721 GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWV 780 Query: 781 YERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDE 840 YERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDE Sbjct: 781 YERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDE 840 Query: 841 ADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQL 900 ADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQL Sbjct: 841 ADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQL 900 Query: 901 SLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT 960 SLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT Sbjct: 901 SLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT 960 Query: 961 SYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM 1020 SYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM Sbjct: 961 SYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM 1020 Query: 1021 NPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFA 1080 NPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFA Sbjct: 1021 NPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFA 1080 Query: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA Sbjct: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 Query: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD 1200 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD Sbjct: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD 1200 Query: 1201 AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH 1260 AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH Sbjct: 1201 AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH 1260 Query: 1261 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT 1320 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT Sbjct: 1261 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT 1320 Query: 1321 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA Sbjct: 1321 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 Query: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG Sbjct: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP 1500 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP Sbjct: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP 1500 Query: 1501 KPEQGPETINQVTEHKMSAE 1520 KPEQGPETINQVTEHKMSAE Sbjct: 1501 KPEQGPETINQVTEHKMSAE 1520 >UniRef50_A6Y2J7 Large exoproteins involved in heme utilization or adhesion n=14 Tax=Vibrionaceae RepID=A6Y2J7_VIBCH Length = 1526 Score = 1414 bits (3661), Expect = 0.0, Method: Compositional matrix adjust. Identities = 770/1575 (48%), Positives = 1000/1575 (63%), Gaps = 122/1575 (7%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K L AI+ A +AGC+ + SDT +KPD P Sbjct: 5 KIKLTAIMVALAIAGCNHDSIPNPSDT------------IKPD-------IPNIGEGDSV 45 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRV-TGATCNGESSDGFTFKPGEDVTCVA 127 T +PD P L+L GS CN + ++ F+ + + V C Sbjct: 46 TGGELPDIGVDD-------PIIISRLSLDGSLMFGESVQCNDQPANQFSVEQKDHVVCTL 98 Query: 128 GNTTIATFNTQSEAARSLRAVEKVSF---SLEDAQELAGSDDKKSNAVSLVTSSNSCPAN 184 T+ATF++ S A +L +A E S +++N +L+ + + Sbjct: 99 DGQTLATFSSPFNIPNSRMAARPSGLELLTLTNADEYKESVLRQANLQTLIKNMGNLQG- 157 Query: 185 TEQVCLTFSSVIESKRFDS-LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPV 243 + + F S +S F + L +D+ EEF++L+ E++ N+ DK PSTH + P Sbjct: 158 -KNIDFNFESSRDSLTFQNYLRNNLDMPAEEFRELITEKISNDNQVDKQPSTHVPDIPPA 216 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTGE 303 TPG DLN+ FVSANAE+ Y+PTEIILS+G+L+DSQG V G+ Y++N RGVTG Sbjct: 217 VTPGASNDLNSGFVSANAEENLVYKPTEIILSQGQLLDSQGRPVNGIAYFSNHSRGVTGI 276 Query: 304 N--------GEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRY 355 N G F FSWG+TISF IDTFELG +RGNK+T L ELG E G N + L+ RY Sbjct: 277 NKNGQATGDGSFEFSWGDTISFAIDTFELGHIRGNKNTFKLNELGSEWAGKNAETLVLRY 336 Query: 356 STTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGAT---LGEGEQVVNLPNEFIEQF 412 + + + D V +VF++YPNV+NE I+LSLSN +G GE+ +P EF +QF Sbjct: 337 ANI-SGDIVSLSDKVTQVFSQYPNVVNESISLSLSNEDVELDVGGGEKQT-VPGEFHKQF 394 Query: 413 NTGQAKEIDTAICAKTDGCNEAR----WFSLTTRNVND---GQIQGVINKLWGV-----D 460 + G A EID A+ N +R W + T++ D +I I KLWG Sbjct: 395 SQGIAAEIDQAL-------NPSRASQMWSTFATKSAVDPEASRILADIQKLWGATEEVQK 447 Query: 461 TNYKSVSKFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWD 520 +K V +FHVFHDSTNFYGSTG+ARGQA VNI+N AFP+LMARND NYW+ FG+ +AWD Sbjct: 448 QGWKKVERFHVFHDSTNFYGSTGHARGQAAVNIANTAFPVLMARNDNNYWIDFGKPKAWD 507 Query: 521 KNELAYITEAPSLVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYS 580 LA+ITEAPSLV+PE V+ +TATFNLPFIS+G +G GK+MVIGN YNSIL CPNG+S Sbjct: 508 DQGLAFITEAPSLVQPEKVSAETATFNLPFISVGDLGRGKVMVIGNSRYNSILVCPNGFS 567 Query: 581 WNGGVNKDGQCTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQ 640 WNGGVN G+C +SD +DM NF NVLRYL+ A +TVGTN+ VYFKR+GQ Sbjct: 568 WNGGVNHQGECIASSDSNDMGNFFSNVLRYLTGKN-----SAELTVGTNIPYVYFKRYGQ 622 Query: 641 VTGNSAAFDFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADT 700 V G+SA F FA E L+S+ LDP+ +P++ILNG+EY G Y +PL ADT Sbjct: 623 VMGSSAPFILDTRFAA-RTETLTSFEGLDPETLPVVILNGYEYRGLRGMGSYDLPLSADT 681 Query: 701 SKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVV- 759 +PK++Q DVT+LI Y+N+GG+VL+ME + + + +A RLLD++G++ + SV+ Sbjct: 682 DEPKMSQDDVTNLIDYVNRGGNVLMMETI---IDQANAGEMTRLLDSSGIAFGMGSSVIA 738 Query: 760 --NNDPQGYPNRVRQQRATGIWVYERYPAVDGA------LPYTIDSKTGEVKWKYQVENK 811 N GYP+RVR QR GIWV ERY A++G LPYTI + G V+W + +E K Sbjct: 739 NGNGPSGGYPDRVRNQRQHGIWVLERYAAIEGGNGAAPMLPYTI-KEDGTVEWTFIIEGK 797 Query: 812 PDDKPKLEVASWLE-DVDGKQETRYAFIDEADH------------------KTEDSLKAA 852 PDDKP LEVASWLE + +G + AFI EADH E SL AA Sbjct: 798 PDDKPNLEVASWLEKNSEGSLVKQVAFIYEADHWQKNEQGQIIYNESGKPVLNEASLAAA 857 Query: 853 KEKIFAAFP------GLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADT 906 K++I AF +EC+N YHYEVNCLEYRPG +P GGM+VP YT+L L + Sbjct: 858 KQRILNAFVTSDGKLAYQECSNSHYHYEVNCLEYRPGNAIPTGGGMHVPFYTELKLGDEE 917 Query: 907 AKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEE 966 AKAM++AA+LGTNI+ LYQHE YFRT G++GERLSSVDL R+YQNM+VWLWND YRYE Sbjct: 918 AKAMIKAANLGTNIEALYQHERYFRTKGKQGERLSSVDLNRIYQNMTVWLWNDLDYRYEA 977 Query: 967 GKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPL 1026 G +DELGF+ FTEFLNCY ND G T+C DLK L M+YG+G AG MNPSYPL Sbjct: 978 GHDDELGFQRFTEFLNCYTNDVAGGNTQCPTDLKLELNQMGMVYGEG-EYAGQMNPSYPL 1036 Query: 1027 NYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQST 1086 NYMEKPLTRLMLGRS+WDL+IKVDV +PG ++ Q T + + + T WFAGN Q T Sbjct: 1037 NYMEKPLTRLMLGRSFWDLDIKVDVRAFPGE-AKGSQGRTIILDMRNQTTAWFAGNRQPT 1095 Query: 1087 GLWAPAQKEVTIK-SNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASG--- 1142 G WA AQ+E ++ S + PVT+T+ALADDLTGREKHE+ L RPPR++K++ + Sbjct: 1096 GQWAVAQQEFSVAVSGEDSPVTITIALADDLTGREKHELGLKRPPRMSKSFVIGGENGNP 1155 Query: 1143 -TVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDA 1201 + F VPYGGLIY +G +S E S TFTG + AP YK+G W+N L+SPAP+GE+ S++ Sbjct: 1156 TSKTFTVPYGGLIYAQGGNS--ELVSLTFTGTIDAPLYKEGKWENGLDSPAPIGEVVSNS 1213 Query: 1202 FVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRD-SEDGKH-RMFTYKNLPGHK 1259 FV+T PK NLNAS YTGG+ QFA DLD FA +NDFY RD +G+H R T ++ P ++ Sbjct: 1214 FVFTAPKANLNASGYTGGIAQFAQDLDRFALDLNDFYARDEGVEGQHNRKATSESNPNNR 1273 Query: 1260 HRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGA 1319 H F NDV ISIG AHSGYPVMN+SF+ S +L T PLN WL+WHEVGHNAAE P V GA Sbjct: 1274 HHFVNDVAISIGAAHSGYPVMNASFNATSKSLNTAPLNSWLLWHEVGHNAAEAPFNVDGA 1333 Query: 1320 TEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEW 1379 TEV NN+LALYMQD +LGKM RV DI +APE++ AW GGAG+RL+M+AQLKEW Sbjct: 1334 TEVVNNLLALYMQDCHLGKMARVEQDIRIAPEFVSMERGHAWGAGGAGERLVMFAQLKEW 1393 Query: 1380 AEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESN 1439 AE F I++WY LP +YS+ +G+KGWNLF+LMHR R + G+N C S Sbjct: 1394 AETEFQIERWY--SGELPTYYSQEDGVKGWNLFKLMHRLTRNADDGVMTLKGENLCQPSG 1451 Query: 1440 GNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDL 1499 +D LMLCAS+ AQTDL+EFF+ WNPG+ A+ P + +EGGV+ + + + + Sbjct: 1452 LGKSDQLMLCASYAAQTDLTEFFQTWNPGSKAFIYPNDPKPHYEGGVTSAGVDRVKEQNY 1511 Query: 1500 PKPEQGPETINQVTE 1514 KP + P INQ+++ Sbjct: 1512 LKPNRDPLKINQISQ 1526 >UniRef50_A5F372 Accessory colonization factor AcfD n=25 Tax=Vibrio RepID=ACFD_VIBC3 Length = 1520 Score = 1391 bits (3601), Expect = 0.0, Method: Compositional matrix adjust. Identities = 730/1493 (48%), Positives = 966/1493 (64%), Gaps = 87/1493 (5%) Query: 86 PVPTKTGYLTLGGSQRV-TGATCNGESSDGFTFKPGEDVTCVAGNTTIATFNTQSEAARS 144 P P + +TL G+ + + CN + + F ++V C +IATF +A ++ Sbjct: 52 PDPIISLSMTLDGNLKFDSSLLCNDQDASHFQISQKDNVFCTINGRSIATFTAPFDANKN 111 Query: 145 LRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPANTEQVCLTFSSVIESKRFDS- 203 R + SL A E S ++ N L+ N + +++ L F S +++ F++ Sbjct: 112 GRNTDSEVLSLISADEYRDSPVRQENLQILM--KNMATIHGDKISLVFRSTLDALTFENY 169 Query: 204 LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPVTTPGTKPDLNASFVSANAEQ 263 L +DL ++F + + E++ N+ DK PSTH + P TPGT +LN+ FVSANAE+ Sbjct: 170 LRHNLDLPKDQFLEAITEKIANDNQVDKQPSTHVPNISPSFTPGTSSNLNSPFVSANAEE 229 Query: 264 FYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTG--------ENGEFSFSWGETI 315 Y PT++I S GRL+DSQG + GV+Y++N+ RG+TG +G F FSWG+ I Sbjct: 230 SLSYIPTDVIPSLGRLLDSQGRVINGVSYFSNNTRGITGVDKTGAILNDGSFEFSWGDII 289 Query: 316 SFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTRVVPDDVRKVFA 375 SF IDTFELGS R NK+ ++ELG + G N + LIHRY++ ++ ++PD V ++F+ Sbjct: 290 SFSIDTFELGSTRANKTDFYISELGKDNEGKNAEALIHRYASI-DDSKLIIPDKVTQIFS 348 Query: 376 EYPNVINEIINLSLSNGAT---LGEGEQVVNLPNEFIEQFNTGQAKEIDTAICAKT---- 428 YPNVINE+I+LSL NG +G+G+ + +P EF +QF++G A ID +I + Sbjct: 349 LYPNVINEVISLSLPNGDIELDIGDGKTQI-VPGEFFKQFDSGLAALIDQSISPISRFKF 407 Query: 429 -DGCNEARWFSLTTRNVNDGQIQGVINKLWGV-DT----NYKSVSKFHVFHDSTNFYGST 482 D + + + + QIQ +INKLWG DT +K V +FH+FHDSTNFYGST Sbjct: 408 EDSLPKKK----SAIDSESSQIQDIINKLWGATDTVQANGWKKVDRFHIFHDSTNFYGST 463 Query: 483 GNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVTRD 542 G+AR QA VNI+N+AFP+LMARND NYW+ FG+ +AWD N LA+ITEAPS V P+ V+ D Sbjct: 464 GSARAQAAVNIANSAFPVLMARNDNNYWIDFGKPKAWDSNSLAFITEAPSTVVPDKVSED 523 Query: 543 TATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDMKN 602 T+TFNLPFISLG++G+GKLMV+GN YNS+L CPNG+SW GG K+G C+L+SD DDM N Sbjct: 524 TSTFNLPFISLGEIGKGKLMVLGNARYNSVLVCPNGFSW-GGTVKNGTCSLSSDRDDMAN 582 Query: 603 FMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVEHL 662 F NV+RYL+ + VGTN+ VYFK GQ G+ A F+ F+ + L Sbjct: 583 FFSNVIRYLTGS-----TSNDVIVGTNIPEVYFKSSGQTMGSKANFELDSRFSK-QTQQL 636 Query: 663 SSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGS 722 +S+ DLD +PL+I+N ++Y + N PY IPL AD PKL++ DVTDLI Y+N GGS Sbjct: 637 TSFHDLDVNTIPLIIINAYDYKGKNINSPYDIPLSADVGSPKLSRSDVTDLIDYINNGGS 696 Query: 723 VLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVV---NNDPQGYPNRVRQQRATGIW 779 VL+ME +++ E + RLLD+AG++ + SVV N G+P+R R QR GIW Sbjct: 697 VLMMETIINTNNSEIS----RLLDSAGIAFGIGNSVVADGNGPSGGHPDRPRSQREHGIW 752 Query: 780 VYERYPAVDG------ALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDG-KQE 832 V ERY AV+ LPY I+S G ++WKY VEN+PDDKPKLEVASW+E G K Sbjct: 753 VIERYAAVEDESSGQQTLPYVINSD-GSIEWKYIVENRPDDKPKLEVASWVESEAGDKLI 811 Query: 833 TRYAFIDEADHKTED------------------SLKAAKEKIFAAFP------GLKECTN 868 T YAFIDE+ H +D SL AK K+ AF EC N Sbjct: 812 THYAFIDESQHWKKDISGKIIYNVAGKPEVDNASLSLAKNKVLDAFKNSSGQRAYSECKN 871 Query: 869 PAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHEL 928 +HYE+NCLEYRPG +P+TGG+YVP+YT + L A AMV+AA+LGTNI LYQHE Sbjct: 872 SEFHYEINCLEYRPGNSIPITGGLYVPRYTDIKLGESEANAMVKAANLGTNIHALYQHER 931 Query: 929 YFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDA 988 YFRT G+ G RL+SVDL R+YQNMSVWLWND YRY++ ++DELGFK FT++LNCY ++ Sbjct: 932 YFRTKGKSGARLNSVDLNRIYQNMSVWLWNDLDYRYDDKQSDELGFKVFTQYLNCYTSNN 991 Query: 989 YAGGTKCSADLKKSLVDNNMIYGDGS-SKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNI 1047 G T C +LK L MIY + S S AG M+PSYPLNYMEKPLTRLMLGRS+WDL+I Sbjct: 992 AGGNTTCPEELKDELTQLGMIYDEKSGSYAGQMDPSYPLNYMEKPLTRLMLGRSFWDLDI 1051 Query: 1048 KVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIK-SNANVPV 1106 KVDV KYPG V+ T+ + +N WFAGN Q TG WA A + ++ S PV Sbjct: 1052 KVDVRKYPGEVTTRSGGGDITLDMRNNTAAWFAGNRQPTGQWAEAHQPFSVSVSGETSPV 1111 Query: 1107 TVTVALADDLTGREKHEVALNRPPRVTKTYSL--DASGTVKFKVPYGGLIYIKGNSSTNE 1164 T+T+ALADDLTGREKHE+ L RPPR++K++ + D+ F VPYGGLIY +G +S + Sbjct: 1112 TITIALADDLTGREKHELGLKRPPRMSKSFVIGGDSPKMQTFTVPYGGLIYAQGGNS--Q 1169 Query: 1165 SASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFA 1224 TF+G + AP Y DG W+N L S AP+GE+ SD F++T PK NLNA Y GG+EQFA Sbjct: 1170 QVKLTFSGTIDAPLYIDGKWRNPLLSGAPIGEVVSDTFIFTAPKANLNADGYLGGIEQFA 1229 Query: 1225 NDLDTFASSMNDFYGRD--SEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNS 1282 DLD F++ +NDFY RD ++ K+R T K++P ++H F NDV IS+G AHSGYPVMN Sbjct: 1230 KDLDQFSADLNDFYARDEGADGDKNRKATDKSMPNNRHHFVNDVAISVGAAHSGYPVMND 1289 Query: 1283 SFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRV 1342 SF +S +L T PLN WL+WHEVGHN+AE P V GATEV NN+LALYMQDR+ GKM+RV Sbjct: 1290 SFITSSRSLNTMPLNSWLLWHEVGHNSAEAPFNVDGATEVVNNLLALYMQDRHQGKMSRV 1349 Query: 1343 ADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSE 1402 DI A +++ + AW GGAG+RL+M+AQLKEWAE FDI WY D LP FY E Sbjct: 1350 EQDIRYAFDFVNAEHGHAWGAGGAGERLVMFAQLKEWAETEFDINDWYND--KLPGFYIE 1407 Query: 1403 REGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFF 1462 G+KGWNLF+LMHR R + G+N C S +D LMLCAS+ AQTDLSEFF Sbjct: 1408 ESGIKGWNLFKLMHRLMRNENDDQINMKGENQCKISGIGKSDLLMLCASYAAQTDLSEFF 1467 Query: 1463 KKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEH 1515 K WNPG+ A+ P + +EGG++ S + SL L P++ P +IN VT+H Sbjct: 1468 KAWNPGSKAFLYPDDPQPYYEGGITPSGIQRVKSLKLNLPQKNPLSINSVTQH 1520 >UniRef50_A6AXB7 AcfD n=3 Tax=Gammaproteobacteria RepID=A6AXB7_VIBPA Length = 1366 Score = 1360 bits (3521), Expect = 0.0, Method: Compositional matrix adjust. Identities = 724/1360 (53%), Positives = 920/1360 (67%), Gaps = 66/1360 (4%) Query: 204 LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPVTTPGTKPDLNASFVSANAEQ 263 L Q+D+ + FKKL+ E + N++ TDK PSTHT V P TPG DL+ +FVSANAE+ Sbjct: 26 LNNQLDVEIDTFKKLLQERLSNDSQTDKQPSTHTPEVEPAVTPGASSDLSQAFVSANAEK 85 Query: 264 FYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTGE-------NGEFSFSWGETIS 316 +Y+P E+IL+ G LVDS G V G+ Y+T+ GRG+TG +G FSWG+TI+ Sbjct: 86 SLEYKPKELILTTGYLVDSFGRSVNGIAYFTSKGRGLTGYKDGRLIGDGSLEFSWGDTIN 145 Query: 317 FGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTRV-VPDDVRKVFA 375 FGIDTFELGS RGNK+TI L +LG G NI+ L+ R+S +N+ V V D V +VF+ Sbjct: 146 FGIDTFELGSTRGNKNTIKLQDLGSGNEGKNIESLVMRFSE--ENDQSVFVTDKVTEVFS 203 Query: 376 EYPNVINEIINLSLSN---GATLGEGEQVVNLPNEFIEQFNTGQAKEIDTAICAKTDGCN 432 +YPNVINE I+LSLSN LG G V + EF +QF +G A++ID + + Sbjct: 204 KYPNVINEAISLSLSNEDIQLDLGNGNTEV-VKGEFEKQFESGLAEDIDKELGRQKLAFG 262 Query: 433 EARWFSLTTRNVN-DGQ-IQGVINKLWGVDTN-----YKSVSKFHVFHDSTNFYGSTGNA 485 E + V+ D Q +Q + +LWG +K V +FH+FHDSTNFYGSTG+A Sbjct: 263 EQYREPKQIKAVDSDAQNVQRDVERLWGATQQAQREGWKPVERFHIFHDSTNFYGSTGSA 322 Query: 486 RGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVTRDTAT 545 R QA VNISN AFP++MARNDKNYW+ F + +AWD+N LAYITEAPS V+P+ V AT Sbjct: 323 RAQAAVNISNKAFPVVMARNDKNYWIDFDKPQAWDENGLAYITEAPSKVKPKKVDASNAT 382 Query: 546 FNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDMKNFME 605 FNLPFIS+G +G+GK+MV+GN YNS+L CPNG+SWNGGVN GQCT N+D DDM NF Sbjct: 383 FNLPFISIGDLGKGKVMVMGNARYNSVLVCPNGFSWNGGVNDQGQCTGNTDSDDMANFFN 442 Query: 606 NVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVEHLSSY 665 N +YL+ K + +V TN+ VYFKR GQV G+ A++ FA + L S+ Sbjct: 443 NAFQYLTGKK-----AGTFSVATNIPHVYFKRGGQVLGSKASYLIDKRFAQ-DTQQLDSF 496 Query: 666 GDLDPQEMPLLILNGFEYV-TQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVL 724 LDP ++PL+ILN + Y+ Q G Y +P++A+ PKLTQQD++DLIAY+ GGSVL Sbjct: 497 SGLDPNDIPLVILNAYSYLGEQGGLGAYDLPMQANLDAPKLTQQDISDLIAYVEDGGSVL 556 Query: 725 IMENVMSNLKEESASGFV-RLLDAAGLSMALNKSVVN--NDPQ-GYPNRVRQQRATGIWV 780 +ME + K + SG V RLLDAAG++ + +SV N P GYP+RVR QR GIWV Sbjct: 557 MMETI----KGQKDSGVVSRLLDAAGIAFGIGESVARDGNGPNGGYPDRVRSQRQQGIWV 612 Query: 781 YERYPAVDG------ALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLE-DVDGKQET 833 ERY A D +LPY I + G V+WKY +EN+PDDKPKLEVA W+E + G + Sbjct: 613 LERYAAEDSSNGEGPSLPYVI-KEDGSVEWKYIIENRPDDKPKLEVAKWIEINEQGDSKV 671 Query: 834 RYAFIDEADHKTE-----DSLKAAKEKIFAAFP------GLKECTNPAYHYEVNCLEYRP 882 + AFIDEA+ + ++L AK +I AF +ECTN YHYEVNCLEYRP Sbjct: 672 QVAFIDEANFYQDGTFDNEALTVAKNRILDAFKDNSGKRAYEECTNNEYHYEVNCLEYRP 731 Query: 883 GTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSS 942 G +P++GG+YVP YT++ L AKAMV+AA++G+NI+ LYQHE YFRT G++G RL+S Sbjct: 732 GNKIPISGGLYVPNYTEMKLGEHEAKAMVKAANIGSNIEALYQHERYFRTKGKQGFRLNS 791 Query: 943 VDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKS 1002 VD+ R+YQN+SVWLWND Y Y++ KNDELGFK FTEFLNCY +D G T C LK Sbjct: 792 VDMSRMYQNLSVWLWNDLRYSYDQEKNDELGFKRFTEFLNCYTDDKAGGNTICPESLKLE 851 Query: 1003 LVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEG 1062 L +MIY +G AG MNPSYPLNYMEKPLTRLMLGRS+WDL++KVD +PG S G Sbjct: 852 LQKMDMIYAEGEY-AGYMNPSYPLNYMEKPLTRLMLGRSFWDLDVKVDTRPFPGVASSSG 910 Query: 1063 QN-VTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIK-SNANVPVTVTVALADDLTGRE 1120 N T T+ + +N T WFAG+ Q+TG WA A T+ S A PVT+TVALADDLT RE Sbjct: 911 SNGGTITLDMSNNVTAWFAGSRQATGQWAQAHVPFTVSVSGAKAPVTITVALADDLTARE 970 Query: 1121 KHEVALNRPPRVTKTYSL--DASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPF 1178 KHEV L RPPR+TK++ + + + + VPYGGLIY +G +S ES TFTG + AP Sbjct: 971 KHEVGLKRPPRMTKSFIIGGNKATSETITVPYGGLIYAQGGNS--ESVQLTFTGTLAAPL 1028 Query: 1179 YKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFY 1238 + DG+WKNDL+SPAP+GE+ S +F+YT PK NL A NY GG+EQFA DLD FAS +NDFY Sbjct: 1029 FIDGSWKNDLDSPAPVGEVVSKSFIYTGPKANLRAENYPGGIEQFAKDLDQFASDLNDFY 1088 Query: 1239 GRDSE-DGK-HRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPL 1296 RD DG+ +R T P +H F NDV ISIG AHSGYPVMNSS++ NS+ + TTPL Sbjct: 1089 ARDEGLDGQANRKVTGDENPNSRHHFVNDVAISIGAAHSGYPVMNSSYNLNSSNINTTPL 1148 Query: 1297 NDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEES 1356 NDWL+WHEVGHNAAE P V GATEV NN+LALYMQD ++GKM RV DI VAPE++ Sbjct: 1149 NDWLLWHEVGHNAAEAPFVVEGATEVVNNLLALYMQDLHIGKMTRVEQDIQVAPEFVRTE 1208 Query: 1357 NNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMH 1416 + AWA GGA +RL+M+AQLKEWAE FDI+ WY LP +YSE EG+KGWNLF+LMH Sbjct: 1209 HGHAWAAGGAAERLVMFAQLKEWAESEFDIRDWYQ--GELPSYYSEVEGVKGWNLFKLMH 1266 Query: 1417 RKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPG 1476 R R + N C + +D LM+CAS+ AQTDLS+FF WNPGA ++ PG Sbjct: 1267 RLTRNESDGIFYLKSTNACRWQGLSKSDQLMVCASYAAQTDLSDFFLAWNPGARSFIYPG 1326 Query: 1477 ASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEHK 1516 +SE S+EGGV+Q + + L L KP PE IN +T K Sbjct: 1327 SSEPSYEGGVTQKGLDVVRKLGLKKPSLDPEEINTITVRK 1366 >UniRef50_D0YX39 Accessory colonization factor AcfD n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0YX39_LISDA Length = 1509 Score = 1009 bits (2609), Expect = 0.0, Method: Compositional matrix adjust. Identities = 605/1570 (38%), Positives = 889/1570 (56%), Gaps = 133/1570 (8%) Query: 11 LLAAILSATLLAGCDGGGSGSSSDTP--PVDSGTGSLPE---VKPDPTPNPEPTPEPTPD 65 LL+ ++A LL GC+G + ++D P P+D PE + +P+ EP P Sbjct: 5 LLSLAITAALLTGCNGDSNSQNTDLPLTPLDPSIPVKPEQPLIPLEPSIPVEPEIPEPPV 64 Query: 66 PEPTPEPIPDPEPTPEPEPEP-VPTKTGYLTLGGSQRVTGATCNGES---SDGFTFKPGE 121 PEP +P+ P +P + G L L G Q V +CNG+ + FTFK G+ Sbjct: 65 EPELPEPPTEPDTIPPSVLDPAIKIHKGGLQLSGKQLVGDISCNGQELALNGQFTFKDGD 124 Query: 122 DVTCVAGNTTIATFNTQSEAARSLR--AVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSN 179 D+ C G +I F+ Q+ RSL A + + F +E D N V ++ + Sbjct: 125 DIRCNFG--SIELFSQQAPQPRSLHSDAQKVIHFDIEHFLHDGAVD----NTVQVLNKID 178 Query: 180 SCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVEN-NAATDKAPSTHTS 238 +C ++ +VCL VI S +LY D E K+ +N E DKAPS+H Sbjct: 179 TCKSDNNKVCL---DVINSYDIATLYNSTDT--EAVKEFINPSSEQVTEEVDKAPSSHVD 233 Query: 239 PVV-PVTTPGTKPDLNASFVSANAEQFYQYQPT----EIILSEGRLVDSQGYGVAGVNYY 293 + P TPGT DLN+ FVSA+AE YQY+P+ EI ++ +L+D++G +AGV+YY Sbjct: 234 VTLKPEVTPGTSTDLNSQFVSASAESAYQYKPSVDNQEITVA--KLLDAKGLPIAGVHYY 291 Query: 294 TNSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGD-EVRGANIDQLI 352 T S RGVT G+ + WGE I+FGIDTF GSV+GN+ + LT++ + + NID LI Sbjct: 292 TPSSRGVTDSQGQIEYIWGEEITFGIDTFTFGSVKGNQLSYQLTDVTENSLVKQNIDSLI 351 Query: 353 HRYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQF 412 RYS ++ R V ++FA YPN INEIIN+SL NGA + EG ++PNEF QF Sbjct: 352 ERYSKNLHDH-REFDTKVHQIFALYPNAINEIINISLPNGAKI-EGTNF-HVPNEFEYQF 408 Query: 413 NTGQAKEIDTAICA-KTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHV 471 ++G AKEID + K+ + + N+N ++++ Y V +FHV Sbjct: 409 DSGLAKEIDEQLKQPKSLWAKQTKIVKAHGSNIN-----ATLHQI------YSGVQQFHV 457 Query: 472 FHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAP 531 FHD ++YG++G AR +NISN AFPILM R D NYWL FG+++AW + +I +A Sbjct: 458 FHDVGSYYGASGFARLMRNLNISNTAFPILMPRMDSNYWLPFGKEQAWTREFKPHIVDAT 517 Query: 532 S--------LVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNG 583 + ++ P V+ D ATFNLP IS GQ+G+G ++ +G+ HY +L CP+ Y N Sbjct: 518 TIDADSKVTMLRPPKVSEDNATFNLPGISTGQIGKGSIVFMGSGHYPIVLSCPDSYWGNK 577 Query: 584 GVN-KDGQCT--LNSDPDD-----------MKNFMENVLRYLSDDKWKPDAKASMTVGTN 629 ++ KD QCT +N++ D M+ F +N+ +L + + + ++ V TN Sbjct: 578 SLSIKDQQCTYSINNNIVDPTTDRQFDNGSMQRFFKNLFTWL--EPSYQNGQNAINVATN 635 Query: 630 LDTVYFKRHGQVTGNSAAFDFHPDFAGISVEHLSS--YGDLDPQEMPLLILNGFEYVTQV 687 ++ HG + F +S+E ++S + ++P+ P+L+L +E + Sbjct: 636 IELAPKFDHGHQSWLPKYEFFINKSYNVSLERITSGNFSGINPETTPILLLQSYE-IGAF 694 Query: 688 GNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDA 747 G D +D S+PKLT DV DLI Y+N GG ++ + + ++ + +L D Sbjct: 695 G-DGTTTKNISDLSQPKLTVNDVNDLIQYVNAGGHIVFFDAI----EQVNPEPIAKLADM 749 Query: 748 AGLSMALNKSVVNNDPQGYPNRVRQQRATGI------------WVYERYPAVDGALPYTI 795 AG+S+ Q Y +G+ VYER+ ++ + Sbjct: 750 AGVSLGGANVAQAKTTQAYCGSSYYCHGSGVKPNVHAVTEHDLVVYERFETLNDDASKIV 809 Query: 796 DSKTGEVKWKYQVENKPDDKPKLEVASWLE-----DVDGKQETRYAFIDEADHKTEDSLK 850 + G + W P+ PKLEVA + +DG + R+AF K+ED + Sbjct: 810 INSDGTITWP-----APNKMPKLEVAKYTTPYMPLTIDGIPQERFAFFQV---KSEDEKR 861 Query: 851 AAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAM 910 AA ++ AFPG+K C + Y +EVNC+E+R G G+P G Y + S++ +M Sbjct: 862 AAIHELQVAFPGVKVCQDD-YEFEVNCIEFRKGHGIPSFGNYQRANYERYSISPKVIDSM 920 Query: 911 VQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKND 970 V+AA+LGTN+ +LYQHELY+RT G +G RLS +L + Y N SVW+WND YRY+ D Sbjct: 921 VEAANLGTNLTKLYQHELYYRTRGEQGHRLSLTELNQTYDNTSVWMWNDEPYRYDNSVED 980 Query: 971 ELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYME 1030 ELGFKT ++LNCY N+ + GG +CS D +++L+ ++ K G +NPSYPLNY E Sbjct: 981 ELGFKTAVDYLNCYTNNQHQGGIECSVDKQQALIKYGFLH-----KNGELNPSYPLNYQE 1035 Query: 1031 KPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWA 1090 KPLTR+MLGRS+WDL+IKVD +YPG + T T+S +N NMQSTGLWA Sbjct: 1036 KPLTRIMLGRSYWDLDIKVDTTQYPGRPAFTNGTQTVTVSTLNNAVTGTVNNMQSTGLWA 1095 Query: 1091 PAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPY 1150 ++V + + VP T+TV+L DDLTG E+HEVALNRPPRV K+++ D S + F+VPY Sbjct: 1096 HQHQQVQV--SGGVPATITVSLIDDLTGLEQHEVALNRPPRVQKSFNYDGS-NLSFRVPY 1152 Query: 1151 GGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKN 1210 GGLIYIK +S+ +A F+F+GV A F+KD W S PL E+++ +YTTP +N Sbjct: 1153 GGLIYIKPHSNIEGTAEFSFSGVATAAFWKDNQWMYGKASDVPLAEIDTGHVIYTTPVEN 1212 Query: 1211 LNASNYTGGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDVQI 1268 + + ++ F ++++ FA+S +DFYGRD GKHR FTY++L H+HRF ND+QI Sbjct: 1213 IEQQD----IQIFVDEMNKFANSASDFYGRDEVVSVGKHRRFTYQDLADHRHRFVNDIQI 1268 Query: 1269 SIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLA 1328 SIG AHSGYPV +++++ +PTTP NDWL+WHE+GHN A P ++ G TEV NN+LA Sbjct: 1269 SIGAAHSGYPVQSTTYN-KGNKIPTTPTNDWLLWHEIGHNLASAPFSMTGGTEVTNNILA 1327 Query: 1329 LYMQDRYL---GKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFD 1385 LYMQ++ KM+RV DI P N W+ G AG RL+M+AQLK WAE +F Sbjct: 1328 LYMQEQRPEPNNKMSRVESDIQKMPLLFSRYNKHVWSNGDAGIRLVMFAQLKLWAENHFR 1387 Query: 1386 IKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG--NAA 1443 I WY + L + + +GWN+F+LMHRK+RGD + + G NYC+ S+ + Sbjct: 1388 IDNWYSEKDLLTIYNQD----QGWNMFKLMHRKSRGDSIGDQ---GINYCSSSDTGLSGG 1440 Query: 1444 DTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASL-DLPKP 1502 D LM+C+S+V+ DLS F+ WNP + LP ++ + GG++ Y L + +L +P Sbjct: 1441 DLLMVCSSYVSGFDLSNFYTLWNPSESMNILPNGDKL-YSGGITSKGYQVLNQIPNLKQP 1499 Query: 1503 EQGPETINQV 1512 E PE+I + Sbjct: 1500 ETSPESITHL 1509 >UniRef50_A6ALV4 AcfD n=5 Tax=Vibrio RepID=A6ALV4_VIBHA Length = 1466 Score = 969 bits (2505), Expect = 0.0, Method: Compositional matrix adjust. Identities = 594/1561 (38%), Positives = 857/1561 (54%), Gaps = 165/1561 (10%) Query: 8 KKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPE 67 K+SLL+ IL A+LL GC G + S+ T P P V+PD P+ PD Sbjct: 2 KRSLLS-ILVASLLFGCGGDENNHSTSTTP--------PTVEPDLPPD-------QPDI- 44 Query: 68 PTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGE---SSDGFTFKPGE-DV 123 V T G L + G Q C+G+ S FT+ E + Sbjct: 45 ------------------GVTTYQGKLFINGKQLTGDIQCDGQDNSESGYFTYAASEGNF 86 Query: 124 TCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPA 183 +C G ++ F+ Q A + D +++ G+ +NA L+ ++CP+ Sbjct: 87 SCEFGAVSLGEFSYQIPAQTRTGSQPAELTQNYDLKDVLGT--HANNAAKLLHKIDTCPS 144 Query: 184 NTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSP-VVP 242 QVCL I S LY+ D A +N V N + PS H P + P Sbjct: 145 QDTQVCL---DEINSYDIQDLYESDDQAA--IDAFLNPSVVNEG---EQPSAHVDPELQP 196 Query: 243 VTTPGTKPDLNASFVSANAEQFYQYQPTEI--ILSEGRLVDSQGYGVAGVNYYTNSGRGV 300 TPG +L SFVSANAE Y+Y+P+ L + RL DSQG +AG+ +++ S RG+ Sbjct: 197 EVTPGASNNLTGSFVSANAEAAYEYKPSAANKPLIKSRLTDSQGNALAGIEFFSQSARGI 256 Query: 301 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDE-VRGANIDQLIHRYSTTG 359 T NGEF + WGE + FGIDTF LG V+GN+ + L +L D + N+D +HRY + Sbjct: 257 TDANGEFEYLWGENLIFGIDTFTLGQVKGNQVSYQLADLSDNPLVKQNLDAFVHRYGLSS 316 Query: 360 QNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKE 419 NN + D+VR+VFA+YPNVINE+INLSL NGA + EG PNEF QF+ G + Sbjct: 317 GNNIE-IGDNVRQVFAQYPNVINELINLSLPNGAKI-EGTNFTT-PNEFEAQFSQGLTQI 373 Query: 420 IDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFY 479 ID + +W TT + + G + Y V H+FHD+ + Sbjct: 374 IDGQLKQT------PQWSGFTTPMLRTVRASGSNYVTQSLHQIYAGVDSVHIFHDNHG-W 426 Query: 480 GSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWD----KNELAYITEAPSLVE 535 G +G R N++N AFP+LM RND +YWL FGE+ AW K++ AYI +A ++ E Sbjct: 427 GGSGYTRAMRNFNLTNEAFPVLMPRNDNSYWLGFGEEAAWTRGSGKDQKAYIVDATTIDE 486 Query: 536 --------PENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNG---G 584 PE +++ TATFNLP ++ G +G GK++ +GN Y SI CP Y W G G Sbjct: 487 NSTVVMQRPEVISKQTATFNLPTMTAGMIGSGKVVFLGNAMYTSIFSCPENY-WAGADLG 545 Query: 585 VNKDGQCTLNSDPDD--------------MKNFMENVLRYLSDDKWKPDA-KASMTVGTN 629 ++ + Q S P + M+ N++ +L P+A + S+ + TN Sbjct: 546 IDSEVQQCRYSTPHNQEAQDADTRTDNGSMQVMFGNLIDWLV-----PNATQESVAIATN 600 Query: 630 LDTVYFKRHGQVTGNSAAFDFHPDFAGISVEHLSS--YGDLDPQEMPLLILNGFEYVTQV 687 ++ + R + G F +P + ++ LSS + LDP PLL+L +E T Sbjct: 601 INKGHAFRWDRKEGQIYDFFVNPSYKLGEMDVLSSGQFDSLDPTSTPLLLLQSYEIKTD- 659 Query: 688 GNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDA 747 G D ++ +D ++PKL DVT LI Y+N GGS++ + L+E + RL DA Sbjct: 660 GYDTKSV--VSDINQPKLDADDVTALIEYVNNGGSIIFFD----ALEESNPEPIARLADA 713 Query: 748 AGLSMA---LNKSVVNNDPQGY-------PN--RVRQQRATGIWVYERYPAVDGALPYTI 795 AG+S+ + K+ + Y PN + + VYERY A I Sbjct: 714 AGVSVGGANVAKTFQSLCTDSYWCHSTSGPNVPNLHTVAEYDLVVYERY-----ADTTKI 768 Query: 796 D-SKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDEADH--KTEDSLKAA 852 + + G V W + D P LE+ + +DG++ RYAF H K+E +AA Sbjct: 769 EINDNGTVTWPGNI-----DMPTLEIPLYKASIDGQEHQRYAF-----HMVKSEQEKQAA 818 Query: 853 KEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQ 912 ++ FPG+ C + Y YEVNC+E R G G+P G + P +T+ ++ + +MV+ Sbjct: 819 VAELQREFPGVPVCKDD-YQYEVNCIEVREGHGIPSRGNHHRPDFTRYEMSPEVVDSMVK 877 Query: 913 AADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDEL 972 AA+LG NI RL HELY+R+ G GERLS +L Y N+SVWLWND Y + DEL Sbjct: 878 AANLGANIDRLLSHELYYRSKGEIGERLSQAELTSTYDNLSVWLWNDEQYEFNPNVQDEL 937 Query: 973 GFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKP 1032 GF+ E LNCY ++A+ GG C + ++ L +MI +++G +NPSYPLN+MEKP Sbjct: 938 GFERAVEMLNCYTDNAHQGGNVCGQETREQLAKWSMI-----TESGELNPSYPLNWMEKP 992 Query: 1033 LTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPA 1092 LTR+MLGRS+WDL+I VD YPG S+ G + I + AG+MQSTGLWAP Sbjct: 993 LTRMMLGRSYWDLDISVDTTSYPGRPSQSGSAASVAIHTDNKTVIGTAGSMQSTGLWAPQ 1052 Query: 1093 QKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGG 1152 +EVTI + V ++ VAL DDLTGR HE++L RPPRV KT+ D S ++ FKVPYGG Sbjct: 1053 LEEVTI--SGGVKASINVALVDDLTGRANHELSLKRPPRVQKTFQYDGS-SLSFKVPYGG 1109 Query: 1153 LIYIKG-NSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNL 1211 LIYI+ + + +F FTGV++A ++K+G+W N +N+ PL E++S F+YTTP N+ Sbjct: 1110 LIYIQPLEVDSRDVVTFNFTGVLRASWWKNGSWLNPINTDVPLAEIDSGHFIYTTPTNNV 1169 Query: 1212 NASNYTGGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDVQIS 1269 ++ + +F ++L+ FA+ +DFYGRD E G+HR FTY L ++HRF NDVQIS Sbjct: 1170 QDTD----VPKFVDELNAFANHASDFYGRDQVIEQGQHRRFTYDALLANRHRFVNDVQIS 1225 Query: 1270 IGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLAL 1329 IG AHSGYPV ++S+ P T +PT P+NDWL+WHEVGHN A P + G+TEV NN+LAL Sbjct: 1226 IGAAHSGYPVQSNSYWPTWTVIPTNPINDWLLWHEVGHNLASAPFMMAGSTEVTNNILAL 1285 Query: 1330 YMQDRYLGK--MNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIK 1387 YMQ++ K M+R+A D++ +P +L+ AW+ G RL M+ QLK WAE +F+I Sbjct: 1286 YMQEQREEKPYMDRIASDLSKSPLWLDRFEGHAWSEADVGMRLAMFGQLKLWAEDHFNID 1345 Query: 1388 KWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCA--ESNGNAADT 1445 WY + P + E + GWN F+L HRKARGD +S+ G NYC+ S + D Sbjct: 1346 DWYSNQAEKPSIFGEDQ---GWNFFKLAHRKARGDSISDQ---GINYCSTQSSQLSQGDL 1399 Query: 1446 LMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQG 1505 +M C S++ DL+++F+ WNP LP + + + GG++ + +N +A++ LPKPE+ Sbjct: 1400 MMACTSYLTGYDLTDYFRMWNPSETKANLPNGT-VDYSGGLTPAGFNAVAAMGLPKPEKS 1458 Query: 1506 P 1506 P Sbjct: 1459 P 1459 >UniRef50_A8H5W9 Inner membrane lipoprotein n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H5W9_SHEPA Length = 1426 Score = 964 bits (2491), Expect = 0.0, Method: Compositional matrix adjust. Identities = 591/1545 (38%), Positives = 858/1545 (55%), Gaps = 162/1545 (10%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K L+ A++ + LLAGC G +D P TP EP+ PT Sbjct: 2 KKLILAVVISNLLAGC-----GDYTDAPS---------------TP-VEPSIPPT----- 35 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGES---SDGFTFKPGEDVTC 125 + IP + T G L L G + +CNG++ FTFK G++V+C Sbjct: 36 --DLIPAKK-----------TYQGSLLLSGKKLSGHISCNGQALGHGGSFTFKDGDNVSC 82 Query: 126 VAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPANT 185 G+ + + + + ++ ++D A S ++A +++ ++CPA Sbjct: 83 TYGSLELLNKDIPLPDGWTRDSHNAMALEIKDDWAHAIS---VTDAAKVMSKVSTCPALA 139 Query: 186 EQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNE-EVENNAATDKAPSTHT-SPVVPV 243 +++CL I+S L+ + A + +N E DKAPS+H S + P Sbjct: 140 DEICL---DEIDSFDVSPLFSNGNAA--DINAFLNPPAAEETDEIDKAPSSHVDSSLTPE 194 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPTE--IILSEGRLVDSQGYGVAGVNYYTNSGRGVT 301 + GTKPD+NA FVSA+AE Y Y+P+E + SE L D+QG +AGVNYYT S RG+T Sbjct: 195 VSAGTKPDINADFVSASAEDAYTYKPSEDARVESESVLTDNQGKPIAGVNYYTKSSRGIT 254 Query: 302 GENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGD-EVRGANIDQLIHRYSTTGQ 360 +G S+ WGETI+FG+DTF SV+GN+ L++ + E+ NI LI RY+T Sbjct: 255 DASGIVSYVWGETITFGLDTFTFSSVKGNQIEYKLSDGSENEIVKQNISALIERYATHTT 314 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 ++ ++V +VF +YPNVINEIINL+L NGA + V PNEF QFNTG A I Sbjct: 315 DSVSF-DENVHRVFGQYPNVINEIINLNLPNGAEIESSGYFV--PNEFNAQFNTGLALII 371 Query: 421 DTAICAKTDGCNEARWFSLTTRNVND-GQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFY 479 D + + R+ T + G + + +L YK V +FHVFHD+++FY Sbjct: 372 DAEL-----NLSPTRFSQQATPLLQKAGYVTNSLQQL------YKDVDQFHVFHDNSSFY 420 Query: 480 GSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNE-LAYITEAPS------ 532 G G AR +N SN AFP+LM RND NYWL FG ++A+ +++ Y+T+A + Sbjct: 421 GEVGYARFMRSMNTSNTAFPVLMPRNDVNYWLPFGSEQAYRRDDGFPYVTDAKTIDASSD 480 Query: 533 --LVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQ 590 L PE V DTAT+NLP I+ G++G GK++ +GN Y +IL P Y W GG Sbjct: 481 VILKRPERVGTDTATYNLPVITAGEIGLGKVVFMGNSMYPNILSKPENY-WAGGEEA--- 536 Query: 591 CTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDF 650 D M F N+ + + + K ++ VG+N+D V+ N+ +DF Sbjct: 537 ---GKDNGSMPTFFMNMFTWFTPGY--DNGKTTINVGSNIDKVWQSN----VNNNQTYDF 587 Query: 651 --HPDFAGISVEHLSS--YGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLT 706 H + ++VE LSS Y LDP+ P+LIL +E T + D ++ + AD ++PKLT Sbjct: 588 FVHGSYK-LNVEPLSSGSYAGLDPKTTPVLILQAYE--TGLFGDGMSVKVLADIAQPKLT 644 Query: 707 QQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMA-----------LN 755 DVT LI Y+N GG+VL M+ + ++ + RL D AG+++ Sbjct: 645 TADVTALIKYINAGGNVLFMDGI----EQLNPEPIARLADTAGIALGGANLARTRQAYCG 700 Query: 756 KSVVNNDPQGYPNRVRQQRATGIWVYERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDK 815 +S P YPN R + YE++ D + ++ + G V + P DK Sbjct: 701 ESYYCQAP--YPN-ARASFTDTLVTYEKF---DDMSKFVVN-QDGTVNFP-----SPIDK 748 Query: 816 PKLEVASWLEDV-DGKQETRYAFIDEADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYE 874 P+ +A + DG ++ +AF KTE A KI AAFP +KECT+ +Y YE Sbjct: 749 PEFGIAQFKTTAEDGSEQDNFAFYSV---KTEAERLEAVAKIKAAFPKVKECTDASYDYE 805 Query: 875 VNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNG 934 + C+E R G G+ Y P++T+ ++ D MV+AA+LG N+++LYQHE+Y+R+ G Sbjct: 806 IGCIETRKGHGLATGSRYYRPRFTRYEISPDVVNTMVKAANLGGNVEKLYQHEIYYRSQG 865 Query: 935 RKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTK 994 ++G RLS +L + Y N+S+W WND Y Y DELGFK TEFLNCY +D + Sbjct: 866 KEGSRLSLNELNQTYDNLSIWFWNDEQYSYNSEVQDELGFKKATEFLNCYTSDVHQPDNA 925 Query: 995 CSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKY 1054 C+AD ++ L+ M+ + +G +NPSYPLNY EKPLTR+MLGRS+WD +I VD E Y Sbjct: 926 CAADTREKLLKYGML-----TSSGELNPSYPLNYQEKPLTRIMLGRSYWDNDISVDTEMY 980 Query: 1055 PGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALAD 1114 PG + EG N + I ++N A NMQSTGLWA + VT+ N + T+TVAL D Sbjct: 981 PGNTAAEGSNASVQIETFNNAVVGTANNMQSTGLWAVKRSVVTVSGNHD--ATITVALVD 1038 Query: 1115 DLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIK-GNSSTNESASFTFTGV 1173 D+TG+ +HE++L RP RV K++S A T + PYGGLIYIK ++ T F F+GV Sbjct: 1039 DVTGKHEHELSLKRPSRVQKSWSHKAGSTTEIIAPYGGLIYIKPASTDTANRVEFNFSGV 1098 Query: 1174 VKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASS 1233 ++A +K+G W+N +N PL E+ + FVYTTP N+ ++ ++ FA+ ++ FA Sbjct: 1099 LEASLWKNGQWQNPVNQEVPLAEVVTGQFVYTTPVNNVTDTD----IQAFASGMNDFAEK 1154 Query: 1234 MNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPT 1293 +DF+ RD+ DG R FT K LP H HRF NDVQISIG AHSGYPVM+++++ ++ ++PT Sbjct: 1155 ASDFHARDNSDGNMR-FTGKLLPEHSHRFVNDVQISIGAAHSGYPVMSTTYNRDANSIPT 1213 Query: 1294 TPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY---LGKMNRVADDITVAP 1350 P NDWL+WHE+GHN A P V GATEVANN+LALYMQD G+M+RV DI AP Sbjct: 1214 IPDNDWLLWHEIGHNLAAAPFNVKGATEVANNLLALYMQDLRDNGDGQMDRVKTDIQKAP 1273 Query: 1351 EYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN 1410 + W+ G AG RL+M+AQLK WA+++F I + G +P +Y E E GWN Sbjct: 1274 MMISRDEGHVWSHGNAGSRLVMFAQLKVWAQEHFKIADHF-KGQTIPSYYGEDE---GWN 1329 Query: 1411 LFQLMHRKARGDEVSNDKFGGKNYCAESNGNA---ADTLMLCASWVAQTDLSEFFKKWNP 1467 +F+LMH +AR + S+ C+ N N D LM C S V+ DL+ FF+ WNP Sbjct: 1330 MFKLMHHEARNNNNSS--------CSAQNANGLSQGDLLMACTSAVSGYDLTPFFEAWNP 1381 Query: 1468 GANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQV 1512 + + ++GG++ + SLDLP+P+ PETI+ + Sbjct: 1382 -SEVSVITADGSRDYQGGITADGIGYVKSLDLPEPKVKPETIDYI 1425 >UniRef50_B5FBF1 Inner membrane lipoprotein n=1 Tax=Vibrio fischeri MJ11 RepID=B5FBF1_VIBFM Length = 1482 Score = 958 bits (2476), Expect = 0.0, Method: Compositional matrix adjust. Identities = 585/1491 (39%), Positives = 837/1491 (56%), Gaps = 134/1491 (8%) Query: 96 LGGSQRVTG-ATCNGES----SDGFTFKPGEDVTCVAGNTTIATFNTQSEAARSLRAVEK 150 + + +TG CNG+S S FT K G C G T+ F + A+ Sbjct: 50 MASGKIITGDVHCNGKSLNTDSGTFTVKEGSVFDCSLGGVTLGEFKAPTPEAKISGVTNT 109 Query: 151 VSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPANTEQVCLTFSSVIESKRFDSLYKQIDL 210 S + D Q + GS NA ++ S ++C + +CL ++S +Y +D Sbjct: 110 TSEASFDLQAVKGS-----NATRILQSISTC-TQEDSICL---DDLDSIDIQDIYSDLDN 160 Query: 211 APEEFKKLVNEEVENNAATDKAPSTHT-SPVVPVTTPGTKPDLNASFVSANAEQFYQYQP 269 L ++E E K PS+H + +VP TPGT DLN+ FVSANAE Y Y+P Sbjct: 161 NESVNAFLKSKEEEKTDEVGKTPSSHVDAEIVPEVTPGTSNDLNSGFVSANAEDSYAYKP 220 Query: 270 TE--IILSEGRLVDSQGYGVAGVNYYTNSGRGVTGENGEFSFSWGETISFGIDTFELGSV 327 + +L++ +L DS G +AGVN+++ + G+TGENGEF + WG+ ++FGIDTFE GSV Sbjct: 221 SAEAKVLTKSQLTDSTGTPLAGVNFFSANAVGITGENGEFEYLWGDKLTFGIDTFEFGSV 280 Query: 328 RGNKSTIALTELGDE-VRGANIDQLIHRYSTTGQNNTRVVPDDVRKVFAEYPNVINEIIN 386 GN+ + +T++ D V ANI LI RY+ N ++ + V+ F+ YPNVINE+IN Sbjct: 281 AGNQVSYKITDVSDNAVVKANIQSLITRYAENNHNGL-LISEKVQDTFSLYPNVINELIN 339 Query: 387 LSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEIDTAICAKTDGCNEARWFSLTTRNVND 446 LSL NG + EG +LP+EF QF G ID + + + +FS + Sbjct: 340 LSLPNGGQI-EGTNF-SLPDEFDAQFQNGLTAAIDAELQQQ----RASFYFSDFPHVFSL 393 Query: 447 GQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARND 506 V + L + + V+ FHVF+D+ +FYG+TG RG +N+SN AFPI+M R D Sbjct: 394 DNGTYVTDSLTRI---FNGVTSFHVFNDNGSFYGATGYTRGMRALNLSNRAFPIMMPRAD 450 Query: 507 KNYWLAFGEKRAWDKNELAYITEAPSLVEPEN--VTRDTATFNLPFISLGQVGEGKLMVI 564 N + FGE++AW + YI P++ P V++D ATF PF++ G++G GK++ + Sbjct: 451 INKDIPFGEQQAWTREGRPYIAVHPTIEMPPIPLVSKDNATFGFPFVTAGEIGSGKVVFM 510 Query: 565 GNPHYNSILRCPNGYSWNGGVNKDG---QCT----LNSDPDD----MKNFMENVLRYLSD 613 GN Y SI+ CP+ Y N + D CT L +DP + MK F N+ +L++ Sbjct: 511 GNSMYPSIISCPDNYWANDALRIDSALQSCTSSFDLANDPRNDNGSMKTFFNNLFTWLNN 570 Query: 614 DKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVEHLSS---YGDLDP 670 DK + + V TN+D R G GN+ F +P F SVE L+ G L Sbjct: 571 DK----SIKGINVATNIDVATALRSGTSHGNAYDFFVNPSFGFSSVEKLTKDGFSGRLSA 626 Query: 671 QEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVM 730 E PLLIL + Q D + AD P L+Q D+T LI Y+N+GGSVL M+ + Sbjct: 627 SETPLLILQAYPPKPQ--GDGMSHRFIADLDNPNLSQDDITALITYINEGGSVLFMDAID 684 Query: 731 SNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGY-----------PN-RVRQQRATGI 778 E RL D+AG+S L S V Q + PN V+ Q + Sbjct: 685 KVTNPEPIG---RLADSAGVS--LGGSNVTPTSQAFCGSSYYCQAPSPNLHVKSQYE--M 737 Query: 779 WVYERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASW-----------LEDV 827 V ER+ VDG PYT++ + G V+W K + K K E+ ++ L D Sbjct: 738 VVLERFQDVDGQQPYTVN-QDGSVEW-----TKDETKIKFEIPTYEIIKRDDKGDPLLDK 791 Query: 828 DGK--QETRYA--FIDEADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPG 883 DG ET++A F+ + + AA ++ AF G C++ +Y YE NC+E R G Sbjct: 792 DGNPVMETKFARIFVKNGEERA-----AAISELQEAFEGTPLCSH-SYEYEFNCIETRQG 845 Query: 884 TGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSV 943 G+ V G + + + +N D ++MV+AA+LG N L +HE+Y+RT G++G RLS+V Sbjct: 846 DGIQVRGAYWRADFDRYQMNQDVVESMVKAANLGDNFNALMEHEMYYRTKGKQGTRLSTV 905 Query: 944 DLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGT---KCSADLK 1000 +L + Y N+S+W+WND Y Y+ DELGFKT FLNCY ++ + C +LK Sbjct: 906 ELNQTYDNLSIWMWNDNPYAYDPNVQDELGFKTAVNFLNCYTDNQHQTDVPEAACPVELK 965 Query: 1001 KSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSE 1060 +L+ N MI+G+G AG MNPSYPLNYMEKPLTR+MLGRS+WD I VD KYPG + Sbjct: 966 ATLIANGMIHGEGE-LAGQMNPSYPLNYMEKPLTRIMLGRSFWDHEITVDTTKYPGRTNG 1024 Query: 1061 EGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGRE 1120 + I + AGN QSTGLWAP EVT++ +V +TV +ADDLTG+ Sbjct: 1025 ATTSEVVNIETAGKAVSYSAGNNQSTGLWAPQLSEVTVR--GDVTAMITVMMADDLTGKP 1082 Query: 1121 KHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGN---SSTNESASFTFTGVVKAP 1177 +HE +LNRPPR+ +++ D T FKVPYGGLIYIK S + A F+ GV KA Sbjct: 1083 QHETSLNRPPRMQMSFAHDGRSTT-FKVPYGGLIYIKPTEILSGASTVAEFSLDGVEKAA 1141 Query: 1178 FYKD------GAWKNDLNSP-APLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTF 1230 ++K G W N +S AP+ E+++ +F+YTT N+ T L +F+ +++ F Sbjct: 1142 WWKKDPANNLGEWVNTPDSSTAPIAEIDTGSFIYTTALNNVK----TADLNEFSKNMNRF 1197 Query: 1231 ASSMNDFYGRDSE--DGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNS 1288 A + +DFYGRD E DGKHR FTY L +HRF NDVQISIG AHSGYPVM+SSF+ +S Sbjct: 1198 ADAASDFYGRDEESADGKHRRFTYPELKEFRHRFVNDVQISIGAAHSGYPVMSSSFNASS 1257 Query: 1289 TTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGK-----MNRVA 1343 +PT ++DWL+WHEVGHN A P + PG+TEV NN+LALYMQ+ G+ M+R+ Sbjct: 1258 NKIPTNAIDDWLVWHEVGHNLASAPFSAPGSTEVTNNLLALYMQE-LEGRNANPEMDRIR 1316 Query: 1344 DDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSER 1403 I AP +L ++ AW+ G AG RL+M+ QLK WAE +F+I +WY DG P Y++ Sbjct: 1317 TSIQKAPAWLSSNDGHAWSHGDAGLRLVMFGQLKIWAENHFEIDRWYVDGETKPAIYNQD 1376 Query: 1404 EGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCA--ESNGNAADTLMLCASWVAQTDLSEF 1461 + GWN+ +LMHRKARGD+ + G NYC+ ++ +A D +M+C+S+V+ DL EF Sbjct: 1377 Q---GWNMIKLMHRKARGDQQGD---AGINYCSSGDTGLSAGDLMMVCSSYVSGYDLGEF 1430 Query: 1462 FKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQV 1512 F+ WN G + +++ + GG+S + + LA L L P++ P TIN + Sbjct: 1431 FQAWNVGETSVTNADGTKV-YSGGISSAGLSKLAELKLNNPKKDPLTINAL 1480 >UniRef50_Q5E705 Accessory colonization factor AcfD-like protein, predicted inner membrane lipoprotein n=1 Tax=Vibrio fischeri ES114 RepID=Q5E705_VIBF1 Length = 1569 Score = 672 bits (1733), Expect = 0.0, Method: Compositional matrix adjust. Identities = 526/1637 (32%), Positives = 791/1637 (48%), Gaps = 251/1637 (15%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K LL A L LLAGC+ +G G++P++ D P P+P Sbjct: 3 KKLLLASLIPMLLAGCN--------QEEININGNGTVPDIGGDGGVTP---------PKP 45 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKP--------G 120 TP+PI K ++ + GATC+G SD Sbjct: 46 TPDPI----------------KYRFMITSSGAPIEGATCDGRLSDHLGVIALDYDSNTLP 89 Query: 121 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTS--S 178 + + C+ T +ATF T + + +RA + + + D + D+ N SL+ + + Sbjct: 90 QSIDCLIAGTPLATFATSTN--KRVRA-QDYNLDIADGKISDLQGDQLVNIQSLLRTVDA 146 Query: 179 NSCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPST--- 235 + +N Q SV K + + Y A E+ ++L ++ DK Sbjct: 147 DGVDSNGYQFVEGEKSV---KNYSANYAD---ALEKTQELFLKDNIGYLKDDKPAGVEPG 200 Query: 236 HTSPVVPVTTPGTKPDLNAS-FVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYT 294 H + V PV TPG+ S VS+NAE+ Y+Y+P E+ + E ++ G V GV YY Sbjct: 201 HGTDVEPVVTPGSDDVTGGSGIVSSNAEKQYEYKP-EVAVPEKSILMLDGQPVVGVEYYG 259 Query: 295 NSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELG-DEVRGANIDQLIH 353 + RG T +G F ++WG+ ++FGI LGS++ + L L D + N++ LI Sbjct: 260 PTYRGKTDVDGSFEYNWGDEVTFGIQALTLGSIKAKGLDVQLGALAADPSKSKNVENLIK 319 Query: 354 RYSTTGQNNTR--VVPDDVRKVFAEYPNVINEIINLSLSNGAT------LGEGEQVVNLP 405 ++ ++N+ V+ DV + FA N I E+IN++L++G T G QV Sbjct: 320 QFD---KDNSAPWVIEQDVHERFALESNNIVELINMNLASGDTSNFDPSFGTPPQV---K 373 Query: 406 NEFIEQFNTG-QAKEIDTAICAKTDGCNE---ARWFSLTTRNVNDGQIQGVINKLWGVDT 461 NEFI QFN G A +I T++ N + + SL + N + ++ + D Sbjct: 374 NEFIAQFNDGGSAFDIVTSLGLSPINVNTYTFSPYVSLRSVNRVETASDALLTMMGQNDN 433 Query: 462 NYKS-VSKFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWD 520 N + V+ FHVF + + Y + +NI N A P++M R+D N ++ FG+ D Sbjct: 434 NKDNDVTHFHVFGNQNDGY-HMPSPHAATFINIDNNAAPVVMPRSDLNAYIPFGQLAVTD 492 Query: 521 KNELAYIT------EAPSLVEPENVT---------------------RDTATFNLPFISL 553 K + T P+ ++ +N T ++TATF LPF+ Sbjct: 493 KFSRPFFTLTNDPKTTPTYIDAKNKTHWNLKEQDVAADSVESSYKMNKETATFELPFVVS 552 Query: 554 GQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLN---SDPDDMKNFMENVLRY 610 G++GEGK++V+GN YNSIL CP YS+N +NKDG C+ +D DM NF N + Sbjct: 553 GKIGEGKVLVLGNSLYNSILVCPENYSFNASINKDGVCSNGNGVTDSLDMFNFFVNAFNW 612 Query: 611 LSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGN-SAAFDFHPDFAGISVEHLSSYG--D 667 L K D + + TN V F R ++G+ S F + F + +S + Sbjct: 613 LDTKKLNQD----INIATNRSEVSFSR---ISGSASHPFKLNESFKALGFRLMSDFSPQG 665 Query: 668 LDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIME 727 LD P+ IL Y T GN R D P + DV LI Y+N+GGSVLIME Sbjct: 666 LDVASTPIYILQA--YPTLGGNTD-----RPDYENPIINDDDVNALIDYVNQGGSVLIME 718 Query: 728 NVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATG------IWVY 781 + L + R LD AG+S A+ K+ P+ + G ++ Sbjct: 719 S----LYNRNLPILGRFLDTAGIS-AIGKNNGVKFAGKLPSNFIAELGKGGTSLRPVYTE 773 Query: 782 ERYPAVDG-----------ALP---YTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDV 827 E Y ++G +P Y D T W + ++K L + V Sbjct: 774 EVY-VLEGLAFEANTTDANGIPEKGYKFDKGTNTYYWG---DKSINNKAVLRALYRPQQV 829 Query: 828 DGKQETRYAFIDEADHKTE---DSLKAAKEKIF--AAFPGLKE-------------CTNP 869 + E R+ E +T+ D+L+A + A GL++ CTN Sbjct: 830 --RDENRHNVTKECQQQTDLVDDALQACIDTKLNQLAQDGLEQWVSDVQAIYEVPMCTNS 887 Query: 870 AYHYEVNCLEYRPGTGVPVTGGMY------VPQYTQLSLNADTAKAMVQAADLGTNIQRL 923 AY Y+++C+E R G G+P++ +Y V + +L ++ D + AM++AA++GTN+ L Sbjct: 888 AYQYQLDCIERREGNGIPLSKTIYPGATDMVQAFARLPMSKDVSNAMIEAANMGTNLTDL 947 Query: 924 YQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNC 983 YQHELY+RT G++G RLS +D++R+Y N+S W+WN+ YRY+ DE G KT EFLNC Sbjct: 948 YQHELYYRTGGKEGVRLSGIDVDRIYNNLSAWMWNNEQYRYDSSTKDEFGHKTVVEFLNC 1007 Query: 984 YANDAYAGGTK-----CSADLKKSLVDNN-MIYGDGSSKAGMMNPSYPLNYMEKPLTRLM 1037 Y+N+ Y + C +LK ++ ++ DG +K +NPSYPLNYMEKPLTR+M Sbjct: 1008 YSNNTYGNPSSDNVIGCPEELKAEMLTKGFLVTVDGVNK---LNPSYPLNYMEKPLTRMM 1064 Query: 1038 LGRSWWDLN----------IKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTG 1087 LGRS++D++ ++VDV YPG + TI G+ QS G Sbjct: 1065 LGRSYFDVDAKNPTAEDRGVQVDVRSYPGVATTTAAAKDITIH---------KGSRQSAG 1115 Query: 1088 LWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYS-LDASGTVKF 1146 +W PA +EV + TV +A+AD+LTGR HE+ALNRPPRV+ +++ ++AS F Sbjct: 1116 VWIPA-REVAYVHGLSSDDTVMIAMADNLTGRVNHEMALNRPPRVSMSFNGVEASN--GF 1172 Query: 1147 KVPYGGLIYIKGNSSTNESASFTFTG-VVKAPFY-----KDGAW-KNDLNSPAPLGELES 1199 KVPYGG +YI S ESA +F G + AP + +G+W S AP+ E+ Sbjct: 1173 KVPYGGSVYITLGSK--ESAQVSFGGSAIAAPMFMMTSATEGSWITTPEESDAPITEIVG 1230 Query: 1200 DAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFT-------Y 1252 F YTT + + LE D F +N+FYGRD G H+MFT Y Sbjct: 1231 KRFSYTTTTAGIKGHSEVDVLE-MTKQFDLFTIGVNEFYGRDGVSGAHKMFTDSAPELEY 1289 Query: 1253 KNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAET 1312 +N+ R +D+QISIG AHSGYPVM++SF ++L ++W++ HE+GHN A Sbjct: 1290 QNM-----RLVDDIQISIGSAHSGYPVMSTSFPRQKSSL-FKATDNWMLGHEIGHNQAAN 1343 Query: 1313 PLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLM 1372 L V GA E ANNVLALY Q+R G M R+ IT A E+ + + WA G DRL Sbjct: 1344 WLNVVGAGETANNVLALYTQERNTGDMPRIKVSITNATEW--ANGDHPWADGTNADRLNF 1401 Query: 1373 YAQLKEWAEKNFDIKKWYPDGTPLPE--FYSEREGM---KGWNLFQLMHRKARGDEVSND 1427 + QLK WAE NFDI +W + E Y++ E +GWN ++ +HR AR E + Sbjct: 1402 FGQLKLWAEDNFDIAQWESEAKLAEERSIYNKNEAGQYDQGWNFYKYLHRAARMPETFTE 1461 Query: 1428 KF--GGKNYC----AESNG-NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEM 1480 G NYC A+ N + D +M+C+S++ D+ FF KW G + L + Sbjct: 1462 GLNKGDVNYCSSEFAQVNSLSKQDMMMICSSFLTGKDIETFFIKWKFGESKVTLQSGDKY 1521 Query: 1481 SFEGGVSQSAYNTLASL 1497 S G+S+ A + + Sbjct: 1522 SV--GISEPALGVMEDM 1536 >UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellular organisms RepID=B8B8E6_ORYSI Length = 753 Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 69/73 (94%), Positives = 72/73 (98%) Query: 538 NVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDP 597 +VTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDP Sbjct: 479 SVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDP 538 Query: 598 DDMKNFMENVLRY 610 DDMKNFMEN L++ Sbjct: 539 DDMKNFMENTLKW 551 >UniRef50_UPI0001B9ED55 hypothetical protein GYMC10_4682 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001B9ED55 Length = 836 Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 135/531 (25%), Positives = 215/531 (40%), Gaps = 123/531 (23%) Query: 1015 SKAGMMNPS-YPLNYMEKPLTRLMLGRSWWDLNIKVDVEK------YPGAVSEEGQNVTE 1067 ++AG + P+ +P+ ++P T + + + EK +PG VSEE V + Sbjct: 377 TEAGSLPPAEFPIQKFQQPYTNALHNFRFSHFTLDPANEKSPYADAFPGVVSEEAAIVND 436 Query: 1068 -----------TISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP-VTVTV-ALAD 1114 T+ ++ P+K N STGL+AP K +T++ V +TV + + D Sbjct: 437 REVEVDFDFPNTMYTHALPSK----NWISTGLYAPPGKVITLEVPEGVEHLTVQIGSHDD 492 Query: 1115 DLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNE-SASFTFTGV 1173 DL G K E R P + L G + PYGGLIY+ + N+ A+ +G Sbjct: 493 DLRGSGKWE----RVPLIVNHQKL-TPGIHQVNSPYGGLIYLIPLKAKNDFRATVKISGA 547 Query: 1174 VKAPFYKDGA-----WKNDLNSPA--PLGELESDAFVYTTPKKNLNASNYTGGLEQFAND 1226 V+AP+Y G W+ P P EL+ + + T P + E+ Sbjct: 548 VEAPYYVLGKTTLEEWERIRTGPVTVPFAELQGERIILTVPSDLIRQ---VADPEELMRT 604 Query: 1227 LDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH----RFTNDVQISIGDAHSGYPVMNS 1282 D S ++ G D + + +P H R+ D QIS G H+GYP+M Sbjct: 605 WDEIYDSYDELVGLDPD---------RAMPHTAHQLNRRYVADGQISSGAMHAGYPIMLP 655 Query: 1283 -SFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY--LGKM 1339 S++ N + + W WHE+GH + T EV N+ +LY Q++Y ++ Sbjct: 656 FSYAANLLDVHYVKTSAWGFWHELGHEYQQRTWTWSDVGEVTVNLFSLYTQEKYGNASEL 715 Query: 1340 NRVADDITVAPEYLE------ESNNQAWARGGAG--DRLLMYAQLKEWAEKNFDIKKWYP 1391 +V +D +Y + E+N+ A G G +RL+M+ QL+ Sbjct: 716 LKVGND---GKDYYDRGIAFVENNDPAKKYGQIGNYERLVMFKQLQL------------- 759 Query: 1392 DGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCAS 1451 GW + + R E+S D+ G DT + AS Sbjct: 760 --------------AYGWEFYTRIFETYR--ELSRDEIQG----------TVDTFAVIAS 793 Query: 1452 WVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKP 1502 A DL+EFF KW G++ +ASL+LP+P Sbjct: 794 QTAGEDLTEFFDKWAI-----------------GLTDDGRARIASLNLPEP 827 >UniRef50_UPI000178AA33 hypothetical protein GYMC10_4678 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178AA33 Length = 1078 Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 132/528 (25%), Positives = 211/528 (39%), Gaps = 121/528 (22%) Query: 1023 SYPLNYMEKPLTRLML----GRSWWDLNIKVD--VEKYPGAVSEEGQNVT-ETISL---Y 1072 ++PL+ + P T +L GR DL +++PGAV + VT +++ + Y Sbjct: 360 TFPLDRSKAPYTSALLAFQLGRIGNDLTAPKSPYADQFPGAVPGDAPRVTGQSVHVNFDY 419 Query: 1073 SN---------PTKWFAGNMQSTGLWAPAQKEVTI---KSNANVPVTVTVALADDLTGRE 1120 S P W STGL+APA + VTI + N+ V V A D+LT + Sbjct: 420 STYDYLRQGTVPKNWI-----STGLYAPAGEWVTIHVPEGTQNLDVQVG-AHTDNLTSKT 473 Query: 1121 KHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIY-IKGNSSTNESASFTFTGVVKAPFY 1179 + E RPP +T+ L SG + + PYGGLIY I + + +G V+AP+Y Sbjct: 474 EWE----RPPVITQRKPL-LSGENRIRSPYGGLIYLIPTKPQPSVAKDVEISGGVRAPYY 528 Query: 1180 KDGA-----WKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLE---QFANDLDTF 1230 G W++ + + PAP EL+ + T P S Y LE Q D Sbjct: 529 ILGETSPSEWEDAIRHHPAPWAELQGRRVILTLP------SEYIRQLEDPQQLVEKWDAI 582 Query: 1231 ASSMNDFYGRDSEDG-KHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVM--NSSFSPN 1287 + G + H+ +LP R+ D QIS G H+GYP+M + + Sbjct: 583 VDYTEEVAGLSPDQQLPHKSI---DLP---FRYVADRQISAGYMHAGYPIMFHIDPSAGH 636 Query: 1288 STTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDIT 1347 + + W WHE GH + E+ N+ +LY+Q+++ Sbjct: 637 AVDISRVTQGGWGFWHETGHEYQQGAWNWNVTGEITVNIYSLYVQEKF------------ 684 Query: 1348 VAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDI------KKWYPDGTPLPEF-- 1399 G + + L+ A+ K++ ++ FD K + L F Sbjct: 685 ----------------GNSSNLLIRNAEGKDFYDRAFDYIESDLPGKSFGTSGQLDLFGY 728 Query: 1400 ---YSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQT 1456 + + GW+ + +H+ R S +N DT ++ AS A Sbjct: 729 LVMFRQLSLAYGWDFYAELHKAYRELPASQ--------LPATNQEEIDTFVIMASKTAGE 780 Query: 1457 DLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQ 1504 +L+EFF KW + + QS +A+L+LP P Q Sbjct: 781 NLTEFFDKW-------------ALPYSKAEVQS---RIAALNLPLPSQ 812 >UniRef50_C1ZFD9 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFD9_PLALI Length = 785 Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 103/354 (29%), Positives = 153/354 (43%), Gaps = 64/354 (18%) Query: 1054 YPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANV---PVTVTV 1110 +PGAV + VT +++ + T+W +STGL+A V +K N+ + + Sbjct: 387 FPGAVPANAKKVTRKVTINTETTRW-----KSTGLYAAPGTLVKVKVPRNIVGQKFEIQI 441 Query: 1111 -ALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKV--PYGGLIYIKGNSSTNESA- 1166 + +D L +++ RPP V + + +D V+F+V YGGLIY+ T Sbjct: 442 GSHSDSLWSKDE----WRRPPAVIRQFPID---KVEFEVGNAYGGLIYVVVPQKTPAGKF 494 Query: 1167 SFTFTGVVKAPFYKDGA-----WKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGL 1220 F+ VV AP++ G W+ + N PAP ELE+ V T P + + ++ L Sbjct: 495 EVEFSNVVDAPYFVHGETDISDWRFTIRNYPAPWAELETRHLVITVPSELVRKLDFPDKL 554 Query: 1221 -EQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPV 1279 +A LD A D Y +N P + RF D QIS G HSGYP+ Sbjct: 555 MNHWAAVLDACA----DLYS-----------ISRNRP-YAERFVFDDQISAGFMHSGYPI 598 Query: 1280 MNSSFSPNSTTLPTTPLN------DWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQD 1333 M + N + LN W +HE+GHN + T G EV NN+ LY+ D Sbjct: 599 MCFT---NPSAPEVVDLNFLENKGGWGFYHELGHNHQKGDWTFQGTGEVTNNLTPLYVID 655 Query: 1334 RYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDR---------LLMYAQLKE 1378 K + D PE ++ + + GA L MY QLKE Sbjct: 656 TLTPKA--FSHDAIQQPE--RDNRERKYVMNGAPFSTWQEDPFLALTMYIQLKE 705 >UniRef50_B2ULE8 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2ULE8_AKKM8 Length = 660 Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 83/296 (28%), Positives = 122/296 (41%), Gaps = 31/296 (10%) Query: 1054 YPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALA 1113 +PG Q V T+ + SN G STGL+AP E++ S + P ++++ Sbjct: 255 FPGVPENGAQTVRRTVEIDSN-----IGGWHSTGLYAPPGAEISC-SLSGAPKDGSISVR 308 Query: 1114 DDLTGREKHEV-ALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFT--F 1170 H++ R P +T G VK P GGL+Y+ F Sbjct: 309 IGCHTDSLHKLDEWKRVPEITMQVPA-GRGRVKMVNPMGGLVYVNVGQRPRRGKVFKVQI 367 Query: 1171 TGVVKAPFYKDGA-----WKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFA 1224 +G V +P + G W L N+ AP GE+ + T P + L +++ A Sbjct: 368 SGAVPSPLFVMGKTTPEQWAEQLENTKAPWGEIRMPRLIVTMPVEQLKQCP---DVQKTA 424 Query: 1225 NDLDTFASSMNDFYGRDSE-DGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSS 1283 L + + G D++ D H H RF D QIS G HSGYP M + Sbjct: 425 EFLQKNMALQDWIMGWDTKPDRLH----------HPMRFVVDRQISAGAGHSGYPAMATK 474 Query: 1284 FSPNS-TTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGK 1338 NS T W +WHE+GHN P T+ G TEV+ N+ ++ + GK Sbjct: 475 DWTNSIATGSIIHSGSWGLWHELGHNHQSPPFTMEGQTEVSVNIFSMVCEVMGTGK 530 >UniRef50_UPI00017445F8 hypothetical protein VspiD_04825 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445F8 Length = 763 Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 85/320 (26%), Positives = 135/320 (42%), Gaps = 38/320 (11%) Query: 1054 YPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTI---KSNANVPVTVTV 1110 +PG +++ VT+ I++ +N W STGL+A A + +T+ ++ ++ V Sbjct: 365 FPGQPAKDAPRVTKEITVDANIDGW-----TSTGLYAVAGEPITVTVPEAMVGNGFSIRV 419 Query: 1111 ALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIK--GNSSTNESASF 1168 D H + R P +T+T ++ T K +GGL+YI G + N S F Sbjct: 420 GCHSDTL---YHLESWRRAPDITRTVGIENVET-KMGTAFGGLVYITVPGRARRNASEPF 475 Query: 1169 T--FTGVVKAPFYK-----DGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLE 1221 G V++P++K D W N + P EL + V + P + L Sbjct: 476 KVKIAGAVESPYFKLGRDTDEQWNNIKKAQGPWAELAGEKMVVSLPSEVARKITNPTELM 535 Query: 1222 QFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVM- 1280 +F D ++ +D + +E + R DVQIS G HSGYP+M Sbjct: 536 EF---WDRVVTAQDDISNQTAERTR------------PERMVADVQISAGFMHSGYPIMI 580 Query: 1281 NSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMN 1340 ++ S T W +HE+GHN T G EV NNV LY L K Sbjct: 581 HTPESAEMVTYGRIKYPGWGFYHEIGHNHQRGNFTFEGTGEVTNNVFGLYCYTEVLKKEL 640 Query: 1341 RVADDITVAPEYLEESNNQA 1360 + V+PE +++ + A Sbjct: 641 LIGHG-GVSPESIKKHIDAA 659 >UniRef50_A4GHK6 Putative uncharacterized protein n=1 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHK6_9BACT Length = 1173 Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 101/415 (24%), Positives = 175/415 (42%), Gaps = 71/415 (17%) Query: 1069 ISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKS-----NANVPVTVTVALADDLTGREKHE 1123 + ++ + WF STGL+A A +++TIK NA + + + + D + Sbjct: 563 VHIHMSGDNWF-----STGLFASAGQKITIKVPKDLVNAGLKIQIGSHIWGDYIF---NH 614 Query: 1124 VALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG--NSSTN--ESASFTFTGVVKAPFY 1179 + + R P +T ++LD S + +GGLIYI N N +++S++ +G AP Y Sbjct: 615 MDMRRFPYITYQWNLDQSEVI-VNSSFGGLIYIVDPVNQQINFPKTSSWSISGAYLAPRY 673 Query: 1180 KDGA-----WKNDLNS-PAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASS 1233 G WKN++ PAP E+ESD + T P + DLD Sbjct: 674 IHGKTALNDWKNEIRKYPAPWAEIESDKVILTVPSHAIR-------------DLDN-PDD 719 Query: 1234 MNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSS-FSPNSTTLP 1292 + +F+ R + D + + + R+ D G AH+GYP+M + + P Sbjct: 720 LMEFWAR-AIDAAADLASISRVREFPQRYVTDPNWQWG-AHAGYPIMMAGPWYPYLLNHK 777 Query: 1293 TTPLNDWL-IWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLG-KMNRVADDITVAP 1350 LN W +HE+GHN G EV+ N+ ++Y+ + G + D + + P Sbjct: 778 KIGLNYWWGTFHELGHNHQMNDWMWDGWGEVSTNLWSVYILETIAGLERKNTWDGMLLFP 837 Query: 1351 EYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN 1410 ++ N+ RG ++F I + D E + + + GW Sbjct: 838 GKRQKRINKFIDRG-----------------RSFAILQ--ADPELALEHLLQLQEVFGWE 878 Query: 1411 LFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKW 1465 LF +H+ +DK KN S+ ++ S + +T+L F+K+W Sbjct: 879 LFMALHQSY------HDKPVHKNV---SDNEKIQQFVIRTSQITKTNLINFYKEW 924 >UniRef50_B4DBQ1 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ1_9BACT Length = 759 Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 84/346 (24%), Positives = 142/346 (41%), Gaps = 48/346 (13%) Query: 1054 YPGAVSEEGQNV-TETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP---VTVT 1109 +PGAV + + + + ++ W STGL+A + + ++ A + + V Sbjct: 361 FPGAVPKNAPRLPNRVVVIDTSVPAW-----HSTGLYAAPGELIKVQVPAELADKGLAVR 415 Query: 1110 VAL-ADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIK-GNSSTNESAS 1167 + +D L EK + R P +++ + + P+GGLIYI+ + T + Sbjct: 416 IGCHSDSLFHLEKWQ----RAPEISRRDPIKTPASTAAN-PFGGLIYIEVPDKLTAAKVN 470 Query: 1168 FTFTGVVKAPFYKDGA-----WKNDLN-SPAPLGELESDAFVYTTPKKNLNASNYTGGLE 1221 +G V++P + G WK L +PAP ELE+ + + P + + + L Sbjct: 471 VAISGGVESPRFVLGETKLLEWKMRLRMAPAPWAELETKKVILSVPSEKIRQLDDPEALL 530 Query: 1222 QFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMN 1281 +F + + D + + T + R DVQIS G HSGYP+M Sbjct: 531 KFWDQI---------------LDAEADLATIPHERKRPERIVPDVQISAGYMHSGYPIMT 575 Query: 1282 --SSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKM 1339 +S +L W +HE+GHN T G EV N+ +LY + GK Sbjct: 576 PLDKSVEHSLSLVEMKQGSWGHFHELGHNHQVGDWTFDGTVEVTCNLFSLYCMETLCGKP 635 Query: 1340 NRVADDITVAPEYLEESNNQAWARGGAGDR--------LLMYAQLK 1377 D + PE +E+ + +R L+MY QL+ Sbjct: 636 PGQGHD-AMKPEAVEKRLRGYLSSTDKFNRWKSDPFLALIMYHQLR 680 >UniRef50_A4IG42 Protein FAM115 n=14 Tax=Clupeocephala RepID=F115_DANRE Length = 912 Score = 72.8 bits (177), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 68/228 (29%), Positives = 99/228 (43%), Gaps = 38/228 (16%) Query: 1126 LNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTN-ESASFTFTGVVKAPFYKDGA- 1183 L R P V + LD S V+ +GGLIY+ S T + V+AP++K G Sbjct: 584 LKRAPVVHARFPLD-SEMVQVWNLWGGLIYLIAPSQTKVDGVEIVVQNAVQAPYFKSGET 642 Query: 1184 ----WKNDL-NSPAPLGELESDAFVYTTPK---KNLNASNYTGGLEQFANDLDTFASSMN 1235 W + + +PAP ELE + + T +NL+ ++ A DT ++ Sbjct: 643 SVADWVSHIRQAPAPWAELEFENLIMTFDSAFIRNLDRP------DEVAKLWDTIMRTIT 696 Query: 1236 DFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPV-MNSSFSPNSTTLPTT 1294 D R + LP K RF DVQIS G H+GYP+ M+S +P + Sbjct: 697 DLAARPPK-----------LP-RKERFVADVQISYGFMHAGYPIMMHSGSAPGLVNVEEA 744 Query: 1295 -PLNDWLIWHEVGHNAA----ETPLTVPGATEVANNVLALYMQDRYLG 1337 W HE+GHN E P P TE N+ +LY+ ++ G Sbjct: 745 YKCGLWGAIHELGHNQQRGVWEFP---PHTTECTCNLWSLYVHEQVFG 789 >UniRef50_C2G0M2 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G0M2_9SPHI Length = 675 Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 75/246 (30%), Positives = 109/246 (44%), Gaps = 25/246 (10%) Query: 1083 MQSTGLWAPAQKEVTIKS-NANVPVTVTV-ALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 + STGL+AP + V I + +TV + A D+LTG+E L R P + L A Sbjct: 107 IYSTGLYAPPGENVKITVPEGLIGLTVQIGAHMDNLTGKE----TLKRDPVIYTVKEL-A 161 Query: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDG-----AWKND-LNSPAPL 1194 G + YGG I+++ N + + F+G V+A + G AWK D L + P Sbjct: 162 PGVNYVRNLYGGTIWVRSNVARPIPVNLKFSGPVRASDFVHGQSDIAAWKKDVLANNVPW 221 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 E+ V T P+ N+ G NDL+ + N Y +D D T Sbjct: 222 LEIRGKHMVMTVPRANVVTFINQGRF----NDLNEVMAEWNIVYEKDYYDWMGLSATAAE 277 Query: 1255 L----PGHKHRFTNDVQISIGDAHSGYPVM---NSSFSPNSTTLPTTP-LNDWLIWHEVG 1306 + P R D+Q S+G AHSG+P + + + T L T N W +HEVG Sbjct: 278 VKNRYPEFPQRVVLDIQPSLGYAHSGFPWVAQNDLQWLDELTNLTTIHNGNSWGSYHEVG 337 Query: 1307 HNAAET 1312 HN +T Sbjct: 338 HNFQQT 343 >UniRef50_D2VFD2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VFD2_NAEGR Length = 934 Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 84/287 (29%), Positives = 125/287 (43%), Gaps = 50/287 (17%) Query: 1066 TETISL--YSNPTKWFAGNMQSTGLWA-PAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 TET+ + +N +W Q TG +A P I SN P V+ T H Sbjct: 372 TETLKMNVSTNRKRW-----QCTGFYALPGATLEVILSN---PKGVSNVRIGGHTDGIAH 423 Query: 1123 EVALNRPPRVTKTYSLDAS--GTVKFKVPYGGLIYIKGNSSTNESASFTFTG-VVKAPFY 1179 + +R P ++KT+S+ A+ T GG IYI+ SS S T G VVK PF+ Sbjct: 424 LDSWSRWPSISKTFSISATTNSTGTITCLNGGTIYIEV-SSIPLSVDVTVIGQVVKTPFF 482 Query: 1180 KDGA-----WKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASS 1233 KDG W + + N P P E+ES+ F + N+ A+ L + + + + Sbjct: 483 KDGIHTDQEWNSTIRNYPGPWVEIESEHFAF-----NVQATPTARQLSEVSTVAKYWGNV 537 Query: 1234 MNDFY---GRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSP--NS 1288 + +Y R + D K RM D+QIS G HSG + S S N Sbjct: 538 VAMYYELSQRQTRDYKERM-------------QADIQISAGYMHSGLSFYDFSRSERWNY 584 Query: 1289 TTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY 1335 ++ W +HE+GHN E T + EV N+ +LY+++ Y Sbjct: 585 SS------RRWGHYHELGHNFQEGAWTYDQSGEVTCNIFSLYLEEHY 625 >UniRef50_C2G4C5 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G4C5_9SPHI Length = 665 Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 85/324 (26%), Positives = 129/324 (39%), Gaps = 34/324 (10%) Query: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 G QST ++APA + + I V TG K E R ++ + L Sbjct: 101 GAWQSTSMFAPAGELIVIDVPQGVYGLKAQVGPHVYTGSTKIEFP-RRDEKIVVSKDL-F 158 Query: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYK-----DGAWKNDLN-SPAPL 1194 G + YGGL+YI F+G AP +K D WK+ +N S P Sbjct: 159 PGKNYIRNLYGGLVYIIPERPLGRVVDLLFSGTTLAPSFKLGKMTDQQWKDLVNKSSVPW 218 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYG--RDSEDGKHRMFTY 1252 ELE + V+T + L E + D+ G + D KHR Sbjct: 219 FELEGNRIVFTLQTERLKRFPINSPTELMELWDKMIKEAYWDWTGMTEGNPDVKHRA--- 275 Query: 1253 KNLPGHKHRFTNDVQISIGDAH-SGYPV---MNSSFSPNSTTLPTTPLNDWLIWHEVGHN 1308 P +K R +DV G A SGYPV N + + T+ + +W +HE+GHN Sbjct: 276 ---PFNKWRIVHDVLFEPGVAQVSGYPVRAGANDQYFGQAVTINSVRTQNWGTYHELGHN 332 Query: 1309 AAETPL-TVPGATEVANNVLAL---YMQDRYLGKMNRV----ADDITVAPEYLEESNNQA 1360 + + + G EV NN+ + + R K+ V + I P+ ++N + Sbjct: 333 MQQGRVWSFDGNGEVTNNLFSFKVAMINGRQHTKIAEVWPTGLEWINYVPKDAADANRKI 392 Query: 1361 WA------RGGAGDRLLMYAQLKE 1378 WA + +L+MYAQ+ E Sbjct: 393 WANMPTLSKNHNDAKLIMYAQIFE 416 >UniRef50_D2VUE2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VUE2_NAEGR Length = 808 Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 46/165 (27%), Positives = 77/165 (46%), Gaps = 26/165 (15%) Query: 1181 DGAWKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYG 1239 D WKN + N P P E+ESD FV+ S+Y L + +S+ +++ Sbjct: 349 DSDWKNTIRNYPGPWVEVESDYFVFNVE------SSYARNLTEL--------TSVTNYWK 394 Query: 1240 RDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMN---------SSFSPNSTT 1290 + E + + + + +K R D+ ISIG HSGYP+M + +P + Sbjct: 395 KVLE--LYYELSQRPIRDYKERMQIDIDISIGYMHSGYPIMAFQDQNEGTIKAVNPANMA 452 Query: 1291 LPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY 1335 L + W +HE+GHN + T A EV N+ +LY+++ + Sbjct: 453 LASKTPGRWGHYHELGHNFQVSDWTYSQAVEVTCNIFSLYLEENF 497 >UniRef50_C3XPE8 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XPE8_BRAFL Length = 676 Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 78/312 (25%), Positives = 126/312 (40%), Gaps = 61/312 (19%) Query: 1051 VEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNA------NV 1104 + +PG EE Q ++ + S + STG + PA K +TIK+ + Sbjct: 298 INDFPGDFEEEPQLQCASVIIKSARKE-----RHSTGYYLPAGKVLTIKATTTGEHLDDW 352 Query: 1105 PVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNE 1164 V V A +D+L+G++K R P+++ L+A K PYGGLIY + S E Sbjct: 353 RVRVG-AHSDNLSGKKK----FKRWPKLSVVKRLEAE--TKISSPYGGLIYFE---SPKE 402 Query: 1165 SASFT--FTGVVKAPFY------KDGAWKNDLNSPAPLGELESDAFVYTTPKKNL-NASN 1215 + T VV+APFY W +P +L + ++T P ++ + + Sbjct: 403 AGDLTASMENVVEAPFYDLEDPRSVQNWSERRKAPGLWADLAGEHIIFTLPAASVRDLED 462 Query: 1216 YTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS 1275 + GL + D + ++ G + G+HR R D Q G H+ Sbjct: 463 PSEGLRAW----DDVVKAHHELRGTNP-SGEHR-----------QRIVPDRQPKAGWMHA 506 Query: 1276 GYPVM-NSSFSPNSTTL----PTTPLNDWLIWHEVGHNAAETPLTVPGATE--------- 1321 GYP++ N + + +W ++HE+GHN T G E Sbjct: 507 GYPIVTNMDIAARDKFILDGKKIRKAGNWGLFHELGHNMQRKWWTFEGTGEDGAQHDVWK 566 Query: 1322 -VANNVLALYMQ 1332 A LA+Y Q Sbjct: 567 KKAGIALAVYAQ 578 >UniRef50_A2EC39 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EC39_TRIVA Length = 734 Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 47/194 (24%), Positives = 80/194 (41%), Gaps = 10/194 (5%) Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTV 1144 STGLW PA I S+ ++ + + R P V+KTY +D SG Sbjct: 393 STGLWLPAGSLGEIISDPEDKSIFSIQIGSHTESLLSRQGPWKRWPVVSKTYDIDPSGKT 452 Query: 1145 KFKVPYGGLIYI---KGNSSTNESASFTFTGVVKAP---FYKDGAWKNDLNSPAPLGELE 1198 + P+GG++YI + N T+ F G V+ P K W++ + P GE+ Sbjct: 453 EIASPFGGIVYITVRELNEETDNRVKLKFNGFVRHPRAVIRKPEIWESSKDYQVPWGEMC 512 Query: 1199 SDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGH 1258 S ++T P + +++ +D + + +F S + +R+ LPG Sbjct: 513 SKTVIFTLPSDEIRK---ITDIDKVLEHVDKVVTHVLEFMNA-SMNRPYRVVFDTQLPGD 568 Query: 1259 KHRFTNDVQISIGD 1272 K + + I D Sbjct: 569 KTESEYPIVLDIKD 582 >UniRef50_UPI000180C854 PREDICTED: similar to transmembrane agrin n=1 Tax=Ciona intestinalis RepID=UPI000180C854 Length = 2114 Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 22/73 (30%) Query: 38 VDSGTGSLPEVKPDPTPN----------------------PEPTPEPTPDPEPTPEPIPD 75 + + T +P V+P+PTPN PEPTP+ P+PEPT +P P+ Sbjct: 1115 ITTTTHPVPVVEPEPTPNAKPEPEPTSKPEPEPEPTPNAKPEPTPKSEPEPEPTSKPEPE 1174 Query: 76 PEPTPEPEPEPVP 88 PEPT PEPEP P Sbjct: 1175 PEPTSNPEPEPTP 1187 Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 25/41 (60%), Positives = 28/41 (68%) Query: 49 KPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89 KP+PTP EP PEPT PEP PEP +PEP P P +P PT Sbjct: 1154 KPEPTPKSEPEPEPTSKPEPEPEPTSNPEPEPTPNAKPEPT 1194 Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 25/41 (60%), Positives = 29/41 (70%), Gaps = 2/41 (4%) Query: 47 EVKPDPTPNPEPTPEPT--PDPEPTPEPIPDPEPTPEPEPE 85 E +P+PT PEP PEPT P+PEPTP P+P PEPEPE Sbjct: 1162 EPEPEPTSKPEPEPEPTSNPEPEPTPNAKPEPTSNPEPEPE 1202 Score = 49.7 bits (117), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 26/52 (50%), Positives = 31/52 (59%), Gaps = 6/52 (11%) Query: 44 SLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPT------PEPEPEPVPT 89 S PE +P+PT NPEP P P PEPT P P+PE T P+ EPE + T Sbjct: 1169 SKPEPEPEPTSNPEPEPTPNAKPEPTSNPEPEPERTTKTPLVPKSEPETLAT 1220 Score = 44.7 bits (104), Expect = 0.032, Method: Compositional matrix adjust. Identities = 27/75 (36%), Positives = 34/75 (45%), Gaps = 22/75 (29%) Query: 37 PVDSGTGSLPEVKPDPTPNPEPTPEPTPD----------------------PEPTPEPIP 74 PV T + ++ P P PEPTP+ PEPTP+ P Sbjct: 1104 PVTIETTRMVQITTTTHPVPVVEPEPTPNAKPEPEPTSKPEPEPEPTPNAKPEPTPKSEP 1163 Query: 75 DPEPTPEPEPEPVPT 89 +PEPT +PEPEP PT Sbjct: 1164 EPEPTSKPEPEPEPT 1178 >UniRef50_D1BQB2 Putative uncharacterized protein n=1 Tax=Veillonella parvula DSM 2008 RepID=D1BQB2_VEIPT Length = 467 Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 26/51 (50%), Positives = 32/51 (62%) Query: 46 PEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTL 96 PEVKP P P P+P +P P P P PE P P+PTP+PE +P P T L + Sbjct: 413 PEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQPTPKPEVKPAPVPTPKLEV 463 Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 27/46 (58%), Positives = 31/46 (67%), Gaps = 2/46 (4%) Query: 46 PEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPE--PEPVPT 89 PEVKP P P P+P +P P P P PE P P+PTP+PE P PVPT Sbjct: 389 PEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQPTPKPEVKPAPVPT 434 Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 25/46 (54%), Positives = 29/46 (63%) Query: 46 PEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKT 91 PEVKP P P P+P +P P P P PE P P PTP+PE +P P T Sbjct: 401 PEVKPAPVPTPKPEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQPT 446 Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust. Identities = 24/46 (52%), Positives = 29/46 (63%) Query: 46 PEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKT 91 PEV+P P P P+P +P P P P PE P P PTP+PE +P P T Sbjct: 377 PEVQPAPAPMPKPEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQPT 422 Score = 45.4 bits (106), Expect = 0.018, Method: Compositional matrix adjust. Identities = 23/43 (53%), Positives = 27/43 (62%) Query: 46 PEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 PEVKP P P P+P +P P P P PE P P PTP+ E +P P Sbjct: 425 PEVKPAPVPTPKPEVKPAPQPTPKPEVKPAPVPTPKLEVKPTP 467 Score = 43.5 bits (101), Expect = 0.072, Method: Compositional matrix adjust. Identities = 21/38 (55%), Positives = 24/38 (63%) Query: 52 PTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89 PTP PE P P P P+P +P P P P PE +P PVPT Sbjct: 373 PTPKPEVQPAPAPMPKPEVKPAPQPTPKPEVKPAPVPT 410 >UniRef50_B9GV44 Predicted protein n=3 Tax=Populus trichocarpa RepID=B9GV44_POPTR Length = 452 Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 28/68 (41%), Positives = 34/68 (50%), Gaps = 2/68 (2%) Query: 50 PDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNG 109 PDPTP+ P P+PTP P P EP P P P P+P P P P TL S + T Sbjct: 60 PDPTPSQAPVPDPTPSPAPVHEPTPSPAPVPDPTPNPAPAPDS--TLSPSPAASTTTLTS 117 Query: 110 ESSDGFTF 117 S+ +F Sbjct: 118 RVSENISF 125 >UniRef50_A9A5M1 Putative uncharacterized protein n=1 Tax=Nitrosopumilus maritimus SCM1 RepID=A9A5M1_NITMS Length = 268 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 30/55 (54%), Positives = 35/55 (63%), Gaps = 4/55 (7%) Query: 37 PVDSGTGSLPEVKP--DPTPNPEPTPEPTPDPEPTPEPIPDPEPT--PEPEPEPV 87 P + PE +P +PTP PEP EPTP+PEP EP P+PEP P PEPEPV Sbjct: 29 PTEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPV 83 Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust. Identities = 32/74 (43%), Positives = 43/74 (58%), Gaps = 2/74 (2%) Query: 35 TPPVDSGTGSLPEVKP--DPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTG 92 TP + PE +P +PTP PEP EPTP+PEP EP P+PEP EP PEP ++ Sbjct: 57 TPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEESQLP 116 Query: 93 YLTLGGSQRVTGAT 106 +++ S T +T Sbjct: 117 EISIKTSYDETPST 130 Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust. Identities = 31/57 (54%), Positives = 36/57 (63%), Gaps = 4/57 (7%) Query: 35 TPPVDSGTGSLPEVKP--DPTPNPEPTPEPTPDPEPTPEPIPDPEPT--PEPEPEPV 87 TP + PE +P +PTP PEP EPTP+PEP EP P+PEP P PEPEPV Sbjct: 37 TPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPV 93 Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust. Identities = 31/57 (54%), Positives = 36/57 (63%), Gaps = 4/57 (7%) Query: 35 TPPVDSGTGSLPEVKP--DPTPNPEPTPEPTPDPEPTPEPIPDPEPT--PEPEPEPV 87 TP + PE +P +PTP PEP EPTP+PEP EP P+PEP P PEPEPV Sbjct: 47 TPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPV 103 Score = 45.8 bits (107), Expect = 0.014, Method: Compositional matrix adjust. Identities = 28/44 (63%), Positives = 31/44 (70%), Gaps = 6/44 (13%) Query: 51 DPTPNPEPTPEPTPDPEPTPEPIPDPEP----TPEPEP--EPVP 88 +PTP PEP EPTP+PEP EP P+PEP TPEPEP EP P Sbjct: 35 EPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTP 78 >UniRef50_A0YJH0 Polymorphic membrane protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YJH0_9CYAN Length = 2103 Score = 46.6 bits (109), Expect = 0.008, Method: Compositional matrix adjust. Identities = 26/64 (40%), Positives = 31/64 (48%), Gaps = 18/64 (28%) Query: 46 PEVKPDPTPNPEPTPEPTPDPEPTPEPIPDP------------------EPTPEPEPEPV 87 P ++P PTP E PEP+ +P PT EP P+P E TPEP EP Sbjct: 1564 PSIEPTPTPTIELIPEPSIEPTPTIEPTPEPSIEPTPTPTFEPTPTPTIELTPEPSIEPT 1623 Query: 88 PTKT 91 PT T Sbjct: 1624 PTPT 1627 Score = 45.8 bits (107), Expect = 0.013, Method: Compositional matrix adjust. Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 8/48 (16%) Query: 50 PDPTPNPEPTPEPTPDP--EPTPEP----IPDP--EPTPEPEPEPVPT 89 P+P+ P PT EPTP+P EPTP P IP+P EPTP EP P P+ Sbjct: 1548 PEPSVEPTPTIEPTPEPSIEPTPTPTIELIPEPSIEPTPTIEPTPEPS 1595 >UniRef50_Q8YLY9 All5153 protein n=6 Tax=Nostocaceae RepID=Q8YLY9_ANASP Length = 499 Score = 46.6 bits (109), Expect = 0.008, Method: Compositional matrix adjust. Identities = 37/85 (43%), Positives = 43/85 (50%), Gaps = 4/85 (4%) Query: 26 GGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTP--DPEPTPE--PIPDPEPTPE 81 GGG S P S T +LPEV P P PE TP P+P PEP PE P P TPE Sbjct: 370 GGGFFSRPRPQPSPSLTPTLPEVTESPLPQPEVTPIPSPTITPEPQPEVTPTASPTVTPE 429 Query: 82 PEPEPVPTKTGYLTLGGSQRVTGAT 106 P+PE PT + +T VT + Sbjct: 430 PQPEVTPTASPTITPEPQPEVTPTS 454 >UniRef50_C0WIN3 Putative uncharacterized protein n=1 Tax=Corynebacterium accolens ATCC 49725 RepID=C0WIN3_9CORY Length = 323 Score = 46.2 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 2/91 (2%) Query: 22 AGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPE 81 +GC + PP + + + P DP P PEP P+ P PE +P P PE P Sbjct: 57 SGCQCQQCRGFVNEPPRQNESTTTP-AHADPNPEPEPKPQDCP-PEQQEQPAPQPEQDPV 114 Query: 82 PEPEPVPTKTGYLTLGGSQRVTGATCNGESS 112 PEP P ++ + A C+ ESS Sbjct: 115 PEPVPEDSECEKEPTTPAAVEDDAACDIESS 145 >UniRef50_B7P4H6 Putative uncharacterized protein n=1 Tax=Ixodes scapularis RepID=B7P4H6_IXOSC Length = 1255 Score = 45.4 bits (106), Expect = 0.019, Method: Compositional matrix adjust. Identities = 21/41 (51%), Positives = 26/41 (63%) Query: 46 PEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 P KP P P+ EP PEP+ +P P P P PEP+ EP P+P Sbjct: 302 PSAKPTPEPSAEPAPEPSAEPVPEPSAEPVPEPSAEPAPQP 342 Score = 44.7 bits (104), Expect = 0.029, Method: Compositional matrix adjust. Identities = 21/43 (48%), Positives = 25/43 (58%) Query: 46 PEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 P ++P P PEP+ EP P+P P P P EP PEP EP P Sbjct: 298 PVLEPSAKPTPEPSAEPAPEPSAEPVPEPSAEPVPEPSAEPAP 340 >UniRef50_C2M7W3 Putative uncharacterized protein n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M7W3_CAPGI Length = 1067 Score = 45.1 bits (105), Expect = 0.025, Method: Compositional matrix adjust. Identities = 16/29 (55%), Positives = 21/29 (72%) Query: 60 PEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 P+P P P P P+P+P P+P P P+PEP P Sbjct: 935 PKPDPKPAPEPKPVPKPDPIPAPKPEPTP 963 Score = 44.3 bits (103), Expect = 0.043, Method: Compositional matrix adjust. Identities = 18/29 (62%), Positives = 20/29 (68%) Query: 49 KPDPTPNPEPTPEPTPDPEPTPEPIPDPE 77 KPDP P PEP P P PDP P P+P P P+ Sbjct: 936 KPDPKPAPEPKPVPKPDPIPAPKPEPTPD 964 Score = 43.5 bits (101), Expect = 0.073, Method: Compositional matrix adjust. Identities = 16/32 (50%), Positives = 23/32 (71%) Query: 56 PEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 P+P P+P P+P+P P+P P P P PEP P+ + Sbjct: 935 PKPDPKPAPEPKPVPKPDPIPAPKPEPTPDSI 966 >UniRef50_Q99109 Rep1-C n=1 Tax=Ustilago maydis RepID=REP1_USTMA Length = 652 Score = 44.7 bits (104), Expect = 0.029, Method: Compositional matrix adjust. Identities = 18/38 (47%), Positives = 21/38 (55%) Query: 44 SLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPE 81 S P KP+P P P PEP P +P P+P PEP P Sbjct: 503 SKPTTKPEPKPQPSDKPEPKPSDKPEPKPSDKPEPKPS 540 >UniRef50_B4FNV9 Early nodulin 75 protein n=2 Tax=Zea mays RepID=B4FNV9_MAIZE Length = 279 Score = 43.5 bits (101), Expect = 0.067, Method: Compositional matrix adjust. Identities = 26/49 (53%), Positives = 32/49 (65%) Query: 39 DSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 D G P+ KP+P P P+P PEP P P P PEP P P+P P+PEP+P Sbjct: 146 DPKPGPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEPKPT 194 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46837 Putative lipoprotein acfD homolog n=40 Tax=Gamma... 2222 0.0 UniRef50_A6Y2J7 Large exoproteins involved in heme utilization o... 1895 0.0 UniRef50_A5F372 Accessory colonization factor AcfD n=25 Tax=Vibr... 1840 0.0 UniRef50_D0YX39 Accessory colonization factor AcfD n=1 Tax=Photo... 1740 0.0 UniRef50_A6AXB7 AcfD n=3 Tax=Gammaproteobacteria RepID=A6AXB7_VIBPA 1730 0.0 UniRef50_A6ALV4 AcfD n=5 Tax=Vibrio RepID=A6ALV4_VIBHA 1703 0.0 UniRef50_A8H5W9 Inner membrane lipoprotein n=1 Tax=Shewanella pe... 1664 0.0 UniRef50_B5FBF1 Inner membrane lipoprotein n=1 Tax=Vibrio fische... 1660 0.0 UniRef50_Q5E705 Accessory colonization factor AcfD-like protein,... 1421 0.0 UniRef50_UPI000178AA33 hypothetical protein GYMC10_4678 n=1 Tax=... 399 e-109 UniRef50_UPI0001B9ED55 hypothetical protein GYMC10_4682 n=1 Tax=... 382 e-104 UniRef50_B4DBQ1 Putative uncharacterized protein n=1 Tax=Chthoni... 358 9e-97 UniRef50_UPI00017445F8 hypothetical protein VspiD_04825 n=1 Tax=... 330 3e-88 UniRef50_A4GHK6 Putative uncharacterized protein n=1 Tax=uncultu... 322 6e-86 UniRef50_C1ZFD9 Putative uncharacterized protein n=1 Tax=Plancto... 318 1e-84 UniRef50_B2ULE8 Putative uncharacterized protein n=1 Tax=Akkerma... 296 4e-78 UniRef50_C2G4C5 Putative uncharacterized protein n=2 Tax=Sphingo... 282 6e-74 UniRef50_D2VFD2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 265 1e-68 UniRef50_A4IG42 Protein FAM115 n=14 Tax=Clupeocephala RepID=F115... 258 1e-66 UniRef50_C2G0M2 Putative uncharacterized protein n=2 Tax=Sphingo... 253 4e-65 UniRef50_C3XPE8 Putative uncharacterized protein n=1 Tax=Branchi... 244 2e-62 UniRef50_A2EC39 Putative uncharacterized protein n=1 Tax=Trichom... 196 5e-48 UniRef50_D2VUE2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 176 5e-42 UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellula... 142 1e-31 UniRef50_B9GV44 Predicted protein n=3 Tax=Populus trichocarpa Re... 84 5e-14 UniRef50_D1BQB2 Putative uncharacterized protein n=1 Tax=Veillon... 74 5e-11 UniRef50_UPI000180C854 PREDICTED: similar to transmembrane agrin... 64 6e-08 Sequences not found previously or not previously below threshold: UniRef50_Q9Y4C2 Protein FAM115A n=54 Tax=Amniota RepID=F115A_HUMAN 238 1e-60 UniRef50_Q5XHI4 Protein FAM115 n=4 Tax=Anura RepID=F115_XENLA 232 1e-58 UniRef50_B4DK02 cDNA FLJ57809 n=1 Tax=Homo sapiens RepID=B4DK02_... 220 3e-55 UniRef50_UPI000155BFC0 PREDICTED: similar to FLJ00264 protein n=... 217 3e-54 UniRef50_C7PHT0 Putative uncharacterized protein n=1 Tax=Chitino... 195 9e-48 UniRef50_B6A9M3 Putative uncharacterized protein n=1 Tax=Cryptos... 189 7e-46 UniRef50_C5LZE4 Putative uncharacterized protein n=1 Tax=Perkins... 185 1e-44 UniRef50_UPI0001B7B86F Similar to experimental autoimmune prosta... 164 2e-38 UniRef50_A9VC57 Predicted protein n=1 Tax=Monosiga brevicollis R... 162 1e-37 UniRef50_A3FQI6 Putative uncharacterized protein n=3 Tax=Cryptos... 158 2e-36 UniRef50_UPI0001692BE8 S-layer domain protein n=1 Tax=Paenibacil... 149 1e-33 UniRef50_B7INX6 Wall-associated protein n=62 Tax=Bacillus cereus... 148 2e-33 UniRef50_A9VG09 S-layer domain protein n=29 Tax=Bacillus cereus ... 141 2e-31 UniRef50_C2U768 S-layer domain protein n=4 Tax=Bacillus cereus R... 137 4e-30 UniRef50_A1JU21 Putative exported protein n=4 Tax=Yersinia RepID... 137 4e-30 UniRef50_C3BVL2 S-layer domain protein n=1 Tax=Bacillus pseudomy... 132 1e-28 UniRef50_A7GU48 S-layer domain protein n=5 Tax=Bacillus cereus g... 126 6e-27 UniRef50_C7MAR4 Putative uncharacterized protein n=1 Tax=Brachyb... 124 3e-26 UniRef50_A2DZU5 Putative uncharacterized protein n=1 Tax=Trichom... 119 8e-25 UniRef50_UPI0001BC7CE9 hypothetical protein BacD2_03774 n=1 Tax=... 119 1e-24 UniRef50_C2G2A5 Putative uncharacterized protein n=2 Tax=Sphingo... 119 1e-24 UniRef50_A2E8P4 Putative uncharacterized protein n=1 Tax=Trichom... 116 6e-24 UniRef50_B7HIS0 S-layer domain protein n=19 Tax=Bacillus RepID=B... 116 7e-24 UniRef50_C3XY93 Putative uncharacterized protein n=1 Tax=Branchi... 116 7e-24 UniRef50_Q2U0W8 Predicted protein n=2 Tax=Aspergillus RepID=Q2U0... 116 9e-24 UniRef50_C9KXI1 Putative uncharacterized protein n=1 Tax=Bactero... 115 1e-23 UniRef50_C6IEC9 Coagulation factor 5/8 type n=2 Tax=Bacteroides ... 115 1e-23 UniRef50_A2F4J7 Immuno-dominant variable surface antigen-like n=... 115 1e-23 UniRef50_C7BLH9 Putative uncharacterized protein n=1 Tax=Photorh... 114 2e-23 UniRef50_C2C0C9 Possible wall-associated protein n=1 Tax=Listeri... 113 7e-23 UniRef50_A2EPB9 Putative uncharacterized protein n=1 Tax=Trichom... 112 1e-22 UniRef50_A2FC48 Putative uncharacterized protein n=2 Tax=Trichom... 111 2e-22 UniRef50_B1V640 Putative antigenic protein NP1 n=1 Tax=Clostridi... 110 4e-22 UniRef50_C2G2G0 Possible wall-associated protein n=2 Tax=Sphingo... 109 9e-22 UniRef50_A2EKB8 Putative uncharacterized protein n=1 Tax=Trichom... 109 9e-22 UniRef50_Q87WH6 Putative uncharacterized protein n=3 Tax=Pseudom... 105 1e-20 UniRef50_A2DDW5 Immuno-dominant variable surface antigen-like n=... 104 3e-20 UniRef50_C2FV33 Possible wall-associated protein n=2 Tax=Sphingo... 104 4e-20 UniRef50_A2F335 Immuno-dominant variable surface antigen-like n=... 103 5e-20 UniRef50_A5ZEQ5 Putative uncharacterized protein n=1 Tax=Bactero... 102 1e-19 UniRef50_Q4ZNJ4 Putative uncharacterized protein n=1 Tax=Pseudom... 101 2e-19 UniRef50_A2DY87 Putative uncharacterized protein n=1 Tax=Trichom... 100 5e-19 UniRef50_Q8EW84 Predicted integral membrane protein n=1 Tax=Myco... 100 5e-19 UniRef50_A2GCT2 Putative uncharacterized protein n=1 Tax=Trichom... 100 9e-19 UniRef50_Q5LB88 Putative lipoprotein n=9 Tax=Bacteroides RepID=Q... 99 1e-18 UniRef50_A5ZF31 Putative uncharacterized protein n=1 Tax=Bactero... 98 3e-18 UniRef50_B1KGL9 Putative uncharacterized protein n=4 Tax=Shewane... 95 3e-17 UniRef50_A2DW23 Putative uncharacterized protein n=1 Tax=Trichom... 93 9e-17 UniRef50_A2FU45 Immuno-dominant variable surface antigen-like n=... 92 1e-16 UniRef50_A2F153 Immuno-dominant variable surface antigen-like n=... 92 2e-16 UniRef50_C9YCF1 Putative uncharacterized protein n=1 Tax=Curviba... 91 3e-16 UniRef50_C5PJT0 Lipoprotein n=2 Tax=Sphingobacterium spiritivoru... 89 1e-15 UniRef50_B2V178 Fibronectin type III domain protein n=2 Tax=Clos... 88 2e-15 UniRef50_A5ZED4 Putative uncharacterized protein n=1 Tax=Bactero... 87 6e-15 UniRef50_B7V4F8 Putative uncharacterized protein n=7 Tax=Pseudom... 86 1e-14 UniRef50_B2UPI7 Putative uncharacterized protein n=3 Tax=Bacteri... 83 1e-13 UniRef50_B2UP41 Putative lipoprotein n=1 Tax=Akkermansia mucinip... 83 1e-13 UniRef50_A2DKX5 Immuno-dominant variable surface antigen-like n=... 81 3e-13 UniRef50_UPI000197B0FB hypothetical protein BACCOPRO_00998 n=1 T... 81 3e-13 UniRef50_D1PPF9 Putative fibronectin type III domain protein n=1... 79 1e-12 UniRef50_B2UQK5 Putative uncharacterized protein n=1 Tax=Akkerma... 78 4e-12 UniRef50_A5ZFW6 Putative uncharacterized protein n=1 Tax=Bactero... 77 5e-12 UniRef50_B0N0I0 Putative uncharacterized protein n=3 Tax=Bacteri... 75 2e-11 UniRef50_Q7MJ28 Putative uncharacterized protein VV2335 n=13 Tax... 74 3e-11 UniRef50_C1I981 Leucine rich repeat domain-containing protein n=... 74 6e-11 UniRef50_A5ZER3 Putative uncharacterized protein n=1 Tax=Bactero... 74 6e-11 UniRef50_A2EK19 Putative uncharacterized protein n=1 Tax=Trichom... 73 7e-11 UniRef50_UPI0001C36412 coagulation factor 5/8 type domain protei... 73 7e-11 UniRef50_B6FXD4 Putative uncharacterized protein n=1 Tax=Clostri... 73 9e-11 UniRef50_Q2SHR7 Putative uncharacterized protein n=2 Tax=Gammapr... 73 1e-10 UniRef50_A9MNV3 Putative uncharacterized protein n=1 Tax=Salmone... 72 2e-10 UniRef50_A8R9D2 Putative uncharacterized protein n=1 Tax=Eubacte... 69 9e-10 UniRef50_C8WI41 Coagulation factor 5/8 type domain protein n=1 T... 69 1e-09 UniRef50_A5ZKM8 Putative uncharacterized protein n=3 Tax=Bactero... 69 1e-09 UniRef50_B0A9L1 Putative uncharacterized protein n=1 Tax=Clostri... 69 2e-09 UniRef50_C8QBU1 Putative uncharacterized protein n=1 Tax=Pantoea... 68 2e-09 UniRef50_A5ZG25 Putative uncharacterized protein n=1 Tax=Bactero... 68 3e-09 UniRef50_B0A9L0 Putative uncharacterized protein n=1 Tax=Clostri... 68 3e-09 UniRef50_C5VKR9 Putative uncharacterized protein n=1 Tax=Prevote... 66 9e-09 UniRef50_Q8EVX9 Putative uncharacterized protein MYPE4300 n=1 Ta... 66 2e-08 UniRef50_Q183N3 Putative exported protein n=9 Tax=Clostridium di... 63 8e-08 UniRef50_B4FNV9 Early nodulin 75 protein n=2 Tax=Zea mays RepID=... 58 2e-06 UniRef50_A0YJH0 Polymorphic membrane protein n=1 Tax=Lyngbya sp.... 51 5e-04 UniRef50_A5ZN83 Putative uncharacterized protein n=1 Tax=Ruminoc... 49 0.001 UniRef50_Q7U5X7 Putative uncharacterized protein n=1 Tax=Synecho... 48 0.003 UniRef50_C2M7W3 Putative uncharacterized protein n=1 Tax=Capnocy... 48 0.003 UniRef50_D2NSM0 Putative uncharacterized protein n=1 Tax=Rothia ... 47 0.006 UniRef50_UPI0001695557 hypothetical protein Plarl_10627 n=1 Tax=... 47 0.007 UniRef50_C0WIN3 Putative uncharacterized protein n=1 Tax=Coryneb... 46 0.008 UniRef50_B0BZP0 Putative uncharacterized protein n=2 Tax=Bacteri... 45 0.027 >UniRef50_Q46837 Putative lipoprotein acfD homolog n=40 Tax=Gammaproteobacteria RepID=ACFD_ECOLI Length = 1520 Score = 2222 bits (5758), Expect = 0.0, Method: Composition-based stats. Identities = 1520/1520 (100%), Positives = 1520/1520 (100%) Query: 1 MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTP 60 MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTP Sbjct: 1 MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTP 60 Query: 61 EPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPG 120 EPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPG Sbjct: 61 EPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPG 120 Query: 121 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNS 180 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNS Sbjct: 121 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNS 180 Query: 181 CPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV 240 CPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV Sbjct: 181 CPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV 240 Query: 241 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV 300 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV Sbjct: 241 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV 300 Query: 301 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQ 360 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQ Sbjct: 301 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQ 360 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI Sbjct: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 Query: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG Sbjct: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 Query: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVT 540 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVT Sbjct: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVT 540 Query: 541 RDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDM 600 RDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDM Sbjct: 541 RDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDM 600 Query: 601 KNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVE 660 KNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVE Sbjct: 601 KNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVE 660 Query: 661 HLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKG 720 HLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKG Sbjct: 661 HLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKG 720 Query: 721 GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWV 780 GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWV Sbjct: 721 GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWV 780 Query: 781 YERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDE 840 YERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDE Sbjct: 781 YERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDE 840 Query: 841 ADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQL 900 ADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQL Sbjct: 841 ADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQL 900 Query: 901 SLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT 960 SLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT Sbjct: 901 SLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT 960 Query: 961 SYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM 1020 SYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM Sbjct: 961 SYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM 1020 Query: 1021 NPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFA 1080 NPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFA Sbjct: 1021 NPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFA 1080 Query: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA Sbjct: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 Query: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD 1200 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD Sbjct: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD 1200 Query: 1201 AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH 1260 AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH Sbjct: 1201 AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH 1260 Query: 1261 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT 1320 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT Sbjct: 1261 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT 1320 Query: 1321 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA Sbjct: 1321 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 Query: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG Sbjct: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP 1500 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP Sbjct: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP 1500 Query: 1501 KPEQGPETINQVTEHKMSAE 1520 KPEQGPETINQVTEHKMSAE Sbjct: 1501 KPEQGPETINQVTEHKMSAE 1520 >UniRef50_A6Y2J7 Large exoproteins involved in heme utilization or adhesion n=14 Tax=Vibrionaceae RepID=A6Y2J7_VIBCH Length = 1526 Score = 1895 bits (4908), Expect = 0.0, Method: Composition-based stats. Identities = 764/1568 (48%), Positives = 993/1568 (63%), Gaps = 108/1568 (6%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K L AI+ A +AGC+ + SDT +KPD P Sbjct: 5 KIKLTAIMVALAIAGCNHDSIPNPSDT------------IKPD-------IPNIGEGDSV 45 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVT-GATCNGESSDGFTFKPGEDVTCVA 127 T +PD P L+L GS CN + ++ F+ + + V C Sbjct: 46 TGGELPDIGVDD-------PIIISRLSLDGSLMFGESVQCNDQPANQFSVEQKDHVVCTL 98 Query: 128 GNTTIATFNTQSEAARSLRAVEKVSFSL---EDAQELAGSDDKKSNAVSLVTSSNSCPAN 184 T+ATF++ S A L +A E S +++N +L+ + + Sbjct: 99 DGQTLATFSSPFNIPNSRMAARPSGLELLTLTNADEYKESVLRQANLQTLIKNMGNL--Q 156 Query: 185 TEQVCLTFSSVIESKRFDS-LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPV 243 + + F S +S F + L +D+ EEF++L+ E++ N+ DK PSTH + P Sbjct: 157 GKNIDFNFESSRDSLTFQNYLRNNLDMPAEEFRELITEKISNDNQVDKQPSTHVPDIPPA 216 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTGE 303 TPG DLN+ FVSANAE+ Y+PTEIILS+G+L+DSQG V G+ Y++N RGVTG Sbjct: 217 VTPGASNDLNSGFVSANAEENLVYKPTEIILSQGQLLDSQGRPVNGIAYFSNHSRGVTGI 276 Query: 304 N--------GEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRY 355 N G F FSWG+TISF IDTFELG +RGNK+T L ELG E G N + L+ RY Sbjct: 277 NKNGQATGDGSFEFSWGDTISFAIDTFELGHIRGNKNTFKLNELGSEWAGKNAETLVLRY 336 Query: 356 STTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGA---TLGEGEQVVNLPNEFIEQF 412 + + + D V +VF++YPNV+NE I+LSLSN +G GE+ +P EF +QF Sbjct: 337 ANISGD-IVSLSDKVTQVFSQYPNVVNESISLSLSNEDVELDVGGGEKQT-VPGEFHKQF 394 Query: 413 NTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGV-----DTNYKSVS 467 + G A EID A+ + + + + + +I I KLWG +K V Sbjct: 395 SQGIAAEIDQALNPSRASQMWSTFATKSAVDPEASRILADIQKLWGATEEVQKQGWKKVE 454 Query: 468 KFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYI 527 +FHVFHDSTNFYGSTG+ARGQA VNI+N AFP+LMARND NYW+ FG+ +AWD LA+I Sbjct: 455 RFHVFHDSTNFYGSTGHARGQAAVNIANTAFPVLMARNDNNYWIDFGKPKAWDDQGLAFI 514 Query: 528 TEAPSLVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNK 587 TEAPSLV+PE V+ +TATFNLPFIS+G +G GK+MVIGN YNSIL CPNG+SWNGGVN Sbjct: 515 TEAPSLVQPEKVSAETATFNLPFISVGDLGRGKVMVIGNSRYNSILVCPNGFSWNGGVNH 574 Query: 588 DGQCTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAA 647 G+C +SD +DM NF NVLRYL+ A +TVGTN+ VYFKR+GQV G+SA Sbjct: 575 QGECIASSDSNDMGNFFSNVLRYLTGKN-----SAELTVGTNIPYVYFKRYGQVMGSSAP 629 Query: 648 FDFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQ 707 F FA E L+S+ LDP+ +P++ILNG+EY G Y +PL ADT +PK++Q Sbjct: 630 FILDTRFAA-RTETLTSFEGLDPETLPVVILNGYEYRGLRGMGSYDLPLSADTDEPKMSQ 688 Query: 708 QDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKS---VVNNDPQ 764 DVT+LI Y+N+GG+VL+ME + + + +A RLLD++G++ + S N Sbjct: 689 DDVTNLIDYVNRGGNVLMMETI---IDQANAGEMTRLLDSSGIAFGMGSSVIANGNGPSG 745 Query: 765 GYPNRVRQQRATGIWVYERYPAVDGA------LPYTIDSKTGEVKWKYQVENKPDDKPKL 818 GYP+RVR QR GIWV ERY A++G LPYTI + G V+W + +E KPDDKP L Sbjct: 746 GYPDRVRNQRQHGIWVLERYAAIEGGNGAAPMLPYTI-KEDGTVEWTFIIEGKPDDKPNL 804 Query: 819 EVASWLE-DVDGKQETRYAFIDEADHK------------------TEDSLKAAKEKIFAA 859 EVASWLE + +G + AFI EADH E SL AAK++I A Sbjct: 805 EVASWLEKNSEGSLVKQVAFIYEADHWQKNEQGQIIYNESGKPVLNEASLAAAKQRILNA 864 Query: 860 FP------GLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQA 913 F +EC+N YHYEVNCLEYRPG +P GGM+VP YT+L L + AKAM++A Sbjct: 865 FVTSDGKLAYQECSNSHYHYEVNCLEYRPGNAIPTGGGMHVPFYTELKLGDEEAKAMIKA 924 Query: 914 ADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELG 973 A+LGTNI+ LYQHE YFRT G++GERLSSVDL R+YQNM+VWLWND YRYE G +DELG Sbjct: 925 ANLGTNIEALYQHERYFRTKGKQGERLSSVDLNRIYQNMTVWLWNDLDYRYEAGHDDELG 984 Query: 974 FKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPL 1033 F+ FTEFLNCY ND G T+C DLK L M+YG+G AG MNPSYPLNYMEKPL Sbjct: 985 FQRFTEFLNCYTNDVAGGNTQCPTDLKLELNQMGMVYGEGE-YAGQMNPSYPLNYMEKPL 1043 Query: 1034 TRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQ 1093 TRLMLGRS+WDL+IKVDV +PG ++ Q T + + + T WFAGN Q TG WA AQ Sbjct: 1044 TRLMLGRSFWDLDIKVDVRAFPGE-AKGSQGRTIILDMRNQTTAWFAGNRQPTGQWAVAQ 1102 Query: 1094 KEVTI-KSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA----SGTVKFKV 1148 +E ++ S + PVT+T+ALADDLTGREKHE+ L RPPR++K++ + + F V Sbjct: 1103 QEFSVAVSGEDSPVTITIALADDLTGREKHELGLKRPPRMSKSFVIGGENGNPTSKTFTV 1162 Query: 1149 PYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPK 1208 PYGGLIY +G +S E S TFTG + AP YK+G W+N L+SPAP+GE+ S++FV+T PK Sbjct: 1163 PYGGLIYAQGGNS--ELVSLTFTGTIDAPLYKEGKWENGLDSPAPIGEVVSNSFVFTAPK 1220 Query: 1209 KNLNASNYTGGLEQFANDLDTFASSMNDFYGRDS-EDGKH-RMFTYKNLPGHKHRFTNDV 1266 NLNAS YTGG+ QFA DLD FA +NDFY RD +G+H R T ++ P ++H F NDV Sbjct: 1221 ANLNASGYTGGIAQFAQDLDRFALDLNDFYARDEGVEGQHNRKATSESNPNNRHHFVNDV 1280 Query: 1267 QISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNV 1326 ISIG AHSGYPVMN+SF+ S +L T PLN WL+WHEVGHNAAE P V GATEV NN+ Sbjct: 1281 AISIGAAHSGYPVMNASFNATSKSLNTAPLNSWLLWHEVGHNAAEAPFNVDGATEVVNNL 1340 Query: 1327 LALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDI 1386 LALYMQD +LGKM RV DI +APE++ AW GGAG+RL+M+AQLKEWAE F I Sbjct: 1341 LALYMQDCHLGKMARVEQDIRIAPEFVSMERGHAWGAGGAGERLVMFAQLKEWAETEFQI 1400 Query: 1387 KKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTL 1446 ++WY LP +YS+ +G+KGWNLF+LMHR R + G+N C S +D L Sbjct: 1401 ERWYS--GELPTYYSQEDGVKGWNLFKLMHRLTRNADDGVMTLKGENLCQPSGLGKSDQL 1458 Query: 1447 MLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGP 1506 MLCAS+ AQTDL+EFF+ WNPG+ A+ P + +EGGV+ + + + + KP + P Sbjct: 1459 MLCASYAAQTDLTEFFQTWNPGSKAFIYPNDPKPHYEGGVTSAGVDRVKEQNYLKPNRDP 1518 Query: 1507 ETINQVTE 1514 INQ+++ Sbjct: 1519 LKINQISQ 1526 >UniRef50_A5F372 Accessory colonization factor AcfD n=25 Tax=Vibrio RepID=ACFD_VIBC3 Length = 1520 Score = 1840 bits (4766), Expect = 0.0, Method: Composition-based stats. Identities = 735/1566 (46%), Positives = 980/1566 (62%), Gaps = 106/1566 (6%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K + +++ L GC P V D + +P Sbjct: 2 KIRIVSLIVLGFLIGCKHESI--------------ITPTVPADGSGGNAL------NPGL 41 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRV-TGATCNGESSDGFTFKPGEDVTCVA 127 +PD P P + +TL G+ + + CN + + F ++V C Sbjct: 42 VGGYLPDIGV-------PDPIISLSMTLDGNLKFDSSLLCNDQDASHFQISQKDNVFCTI 94 Query: 128 GNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPANTEQ 187 +IATF +A ++ R + SL A E S ++ N L+ + + + ++ Sbjct: 95 NGRSIATFTAPFDANKNGRNTDSEVLSLISADEYRDSPVRQENLQILMKNMAT--IHGDK 152 Query: 188 VCLTFSSVIESKRFDS-LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPVTTP 246 + L F S +++ F++ L +DL ++F + + E++ N+ DK PSTH + P TP Sbjct: 153 ISLVFRSTLDALTFENYLRHNLDLPKDQFLEAITEKIANDNQVDKQPSTHVPNISPSFTP 212 Query: 247 GTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTG---- 302 GT +LN+ FVSANAE+ Y PT++I S GRL+DSQG + GV+Y++N+ RG+TG Sbjct: 213 GTSSNLNSPFVSANAEESLSYIPTDVIPSLGRLLDSQGRVINGVSYFSNNTRGITGVDKT 272 Query: 303 ----ENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTT 358 +G F FSWG+ ISF IDTFELGS R NK+ ++ELG + G N + LIHRY++ Sbjct: 273 GAILNDGSFEFSWGDIISFSIDTFELGSTRANKTDFYISELGKDNEGKNAEALIHRYASI 332 Query: 359 GQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGA---TLGEGEQVVNLPNEFIEQFNTG 415 ++ ++PD V ++F+ YPNVINE+I+LSL NG +G+G+ + +P EF +QF++G Sbjct: 333 -DDSKLIIPDKVTQIFSLYPNVINEVISLSLPNGDIELDIGDGKTQI-VPGEFFKQFDSG 390 Query: 416 QAKEIDTAICAKT-DGCNEARWFSLTTRNVNDGQIQGVINKLWGVD-----TNYKSVSKF 469 A ID +I + ++ + + QIQ +INKLWG +K V +F Sbjct: 391 LAALIDQSISPISRFKFEDSLPKKKSAIDSESSQIQDIINKLWGATDTVQANGWKKVDRF 450 Query: 470 HVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITE 529 H+FHDSTNFYGSTG+AR QA VNI+N+AFP+LMARND NYW+ FG+ +AWD N LA+ITE Sbjct: 451 HIFHDSTNFYGSTGSARAQAAVNIANSAFPVLMARNDNNYWIDFGKPKAWDSNSLAFITE 510 Query: 530 APSLVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDG 589 APS V P+ V+ DT+TFNLPFISLG++G+GKLMV+GN YNS+L CPNG+SW GG K+G Sbjct: 511 APSTVVPDKVSEDTSTFNLPFISLGEIGKGKLMVLGNARYNSVLVCPNGFSW-GGTVKNG 569 Query: 590 QCTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFD 649 C+L+SD DDM NF NV+RYL+ + VGTN+ VYFK GQ G+ A F+ Sbjct: 570 TCSLSSDRDDMANFFSNVIRYLTGS-----TSNDVIVGTNIPEVYFKSSGQTMGSKANFE 624 Query: 650 FHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQD 709 F+ + L+S+ DLD +PL+I+N ++Y + N PY IPL AD PKL++ D Sbjct: 625 LDSRFSK-QTQQLTSFHDLDVNTIPLIIINAYDYKGKNINSPYDIPLSADVGSPKLSRSD 683 Query: 710 VTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVV---NNDPQGY 766 VTDLI Y+N GGSVL+ME +++ E RLLD+AG++ + SVV N G+ Sbjct: 684 VTDLIDYINNGGSVLMMETIINTNNSE----ISRLLDSAGIAFGIGNSVVADGNGPSGGH 739 Query: 767 PNRVRQQRATGIWVYERYPAVDG------ALPYTIDSKTGEVKWKYQVENKPDDKPKLEV 820 P+R R QR GIWV ERY AV+ LPY I+S G ++WKY VEN+PDDKPKLEV Sbjct: 740 PDRPRSQREHGIWVIERYAAVEDESSGQQTLPYVINS-DGSIEWKYIVENRPDDKPKLEV 798 Query: 821 ASWLEDVDG-KQETRYAFIDEADHKTED------------------SLKAAKEKIFAAFP 861 ASW+E G K T YAFIDE+ H +D SL AK K+ AF Sbjct: 799 ASWVESEAGDKLITHYAFIDESQHWKKDISGKIIYNVAGKPEVDNASLSLAKNKVLDAFK 858 Query: 862 ------GLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAAD 915 EC N +HYE+NCLEYRPG +P+TGG+YVP+YT + L A AMV+AA+ Sbjct: 859 NSSGQRAYSECKNSEFHYEINCLEYRPGNSIPITGGLYVPRYTDIKLGESEANAMVKAAN 918 Query: 916 LGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFK 975 LGTNI LYQHE YFRT G+ G RL+SVDL R+YQNMSVWLWND YRY++ ++DELGFK Sbjct: 919 LGTNIHALYQHERYFRTKGKSGARLNSVDLNRIYQNMSVWLWNDLDYRYDDKQSDELGFK 978 Query: 976 TFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGS-SKAGMMNPSYPLNYMEKPLT 1034 FT++LNCY ++ G T C +LK L MIY + S S AG M+PSYPLNYMEKPLT Sbjct: 979 VFTQYLNCYTSNNAGGNTTCPEELKDELTQLGMIYDEKSGSYAGQMDPSYPLNYMEKPLT 1038 Query: 1035 RLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQK 1094 RLMLGRS+WDL+IKVDV KYPG V+ T+ + +N WFAGN Q TG WA A + Sbjct: 1039 RLMLGRSFWDLDIKVDVRKYPGEVTTRSGGGDITLDMRNNTAAWFAGNRQPTGQWAEAHQ 1098 Query: 1095 EVTIKSNAN-VPVTVTVALADDLTGREKHEVALNRPPRVTKTYSL--DASGTVKFKVPYG 1151 ++ + PVT+T+ALADDLTGREKHE+ L RPPR++K++ + D+ F VPYG Sbjct: 1099 PFSVSVSGETSPVTITIALADDLTGREKHELGLKRPPRMSKSFVIGGDSPKMQTFTVPYG 1158 Query: 1152 GLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNL 1211 GLIY +G +S + TF+G + AP Y DG W+N L S AP+GE+ SD F++T PK NL Sbjct: 1159 GLIYAQGGNS--QQVKLTFSGTIDAPLYIDGKWRNPLLSGAPIGEVVSDTFIFTAPKANL 1216 Query: 1212 NASNYTGGLEQFANDLDTFASSMNDFYGRD--SEDGKHRMFTYKNLPGHKHRFTNDVQIS 1269 NA Y GG+EQFA DLD F++ +NDFY RD ++ K+R T K++P ++H F NDV IS Sbjct: 1217 NADGYLGGIEQFAKDLDQFSADLNDFYARDEGADGDKNRKATDKSMPNNRHHFVNDVAIS 1276 Query: 1270 IGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLAL 1329 +G AHSGYPVMN SF +S +L T PLN WL+WHEVGHN+AE P V GATEV NN+LAL Sbjct: 1277 VGAAHSGYPVMNDSFITSSRSLNTMPLNSWLLWHEVGHNSAEAPFNVDGATEVVNNLLAL 1336 Query: 1330 YMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKW 1389 YMQDR+ GKM+RV DI A +++ + AW GGAG+RL+M+AQLKEWAE FDI W Sbjct: 1337 YMQDRHQGKMSRVEQDIRYAFDFVNAEHGHAWGAGGAGERLVMFAQLKEWAETEFDINDW 1396 Query: 1390 YPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLC 1449 Y D LP FY E G+KGWNLF+LMHR R + G+N C S +D LMLC Sbjct: 1397 YNDK--LPGFYIEESGIKGWNLFKLMHRLMRNENDDQINMKGENQCKISGIGKSDLLMLC 1454 Query: 1450 ASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETI 1509 AS+ AQTDLSEFFK WNPG+ A+ P + +EGG++ S + SL L P++ P +I Sbjct: 1455 ASYAAQTDLSEFFKAWNPGSKAFLYPDDPQPYYEGGITPSGIQRVKSLKLNLPQKNPLSI 1514 Query: 1510 NQVTEH 1515 N VT+H Sbjct: 1515 NSVTQH 1520 >UniRef50_D0YX39 Accessory colonization factor AcfD n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0YX39_LISDA Length = 1509 Score = 1740 bits (4506), Expect = 0.0, Method: Composition-based stats. Identities = 601/1569 (38%), Positives = 879/1569 (56%), Gaps = 129/1569 (8%) Query: 10 SLLAAILSATLLAGCDGGGSGSSSDTP--PVDSGTGSLPE---VKPDPTPNPEPTPEPTP 64 LL+ ++A LL GC+G + ++D P P+D PE + +P+ EP P Sbjct: 4 KLLSLAITAALLTGCNGDSNSQNTDLPLTPLDPSIPVKPEQPLIPLEPSIPVEPEIPEPP 63 Query: 65 DPEPTPEPIPDPEPTPEPEPEP-VPTKTGYLTLGGSQRVTGATCNGES---SDGFTFKPG 120 PEP +P+ P +P + G L L G Q V +CNG+ + FTFK G Sbjct: 64 VEPELPEPPTEPDTIPPSVLDPAIKIHKGGLQLSGKQLVGDISCNGQELALNGQFTFKDG 123 Query: 121 EDVTCVAGNTTIATFNTQSEAARSLR--AVEKVSFSLEDAQELAGSDDKKSNAVSLVTSS 178 +D+ C G+ + F+ Q+ RSL A + + F +E D N V ++ Sbjct: 124 DDIRCNFGSIEL--FSQQAPQPRSLHSDAQKVIHFDIEHFLHDGAVD----NTVQVLNKI 177 Query: 179 NSCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVEN-NAATDKAPSTHT 237 ++C ++ +VCL VI S +LY D E K+ +N E DKAPS+H Sbjct: 178 DTCKSDNNKVCL---DVINSYDIATLYNSTDT--EAVKEFINPSSEQVTEEVDKAPSSHV 232 Query: 238 SP-VVPVTTPGTKPDLNASFVSANAEQFYQYQPT--EIILSEGRLVDSQGYGVAGVNYYT 294 + P TPGT DLN+ FVSA+AE YQY+P+ ++ +L+D++G +AGV+YYT Sbjct: 233 DVTLKPEVTPGTSTDLNSQFVSASAESAYQYKPSVDNQEITVAKLLDAKGLPIAGVHYYT 292 Query: 295 NSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDE-VRGANIDQLIH 353 S RGVT G+ + WGE I+FGIDTF GSV+GN+ + LT++ + + NID LI Sbjct: 293 PSSRGVTDSQGQIEYIWGEEITFGIDTFTFGSVKGNQLSYQLTDVTENSLVKQNIDSLIE 352 Query: 354 RYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFN 413 RYS ++ R V ++FA YPN INEIIN+SL NGA + EG ++PNEF QF+ Sbjct: 353 RYSKNLHDH-REFDTKVHQIFALYPNAINEIINISLPNGAKI-EGTNF-HVPNEFEYQFD 409 Query: 414 TGQAKEIDTAICA-KTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVF 472 +G AKEID + K+ + + N+N + Y V +FHVF Sbjct: 410 SGLAKEIDEQLKQPKSLWAKQTKIVKAHGSNINAT-----------LHQIYSGVQQFHVF 458 Query: 473 HDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPS 532 HD ++YG++G AR +NISN AFPILM R D NYWL FG+++AW + +I +A + Sbjct: 459 HDVGSYYGASGFARLMRNLNISNTAFPILMPRMDSNYWLPFGKEQAWTREFKPHIVDATT 518 Query: 533 --------LVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGG 584 ++ P V+ D ATFNLP IS GQ+G+G ++ +G+ HY +L CP+ Y N Sbjct: 519 IDADSKVTMLRPPKVSEDNATFNLPGISTGQIGKGSIVFMGSGHYPIVLSCPDSYWGNKS 578 Query: 585 VN-KDGQCTLNS-------------DPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNL 630 ++ KD QCT + D M+ F +N+ +L + + + ++ V TN+ Sbjct: 579 LSIKDQQCTYSINNNIVDPTTDRQFDNGSMQRFFKNLFTWL--EPSYQNGQNAINVATNI 636 Query: 631 DTVYFKRHGQVTGNSAAFDFHPDFAGISVEHLSS--YGDLDPQEMPLLILNGFEYVTQVG 688 + HG + F +S+E ++S + ++P+ P+L+L +E Sbjct: 637 ELAPKFDHGHQSWLPKYEFFINKSYNVSLERITSGNFSGINPETTPILLLQSYEI--GAF 694 Query: 689 NDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAA 748 D +D S+PKLT DV DLI Y+N GG ++ + + ++ + +L D A Sbjct: 695 GDGTTTKNISDLSQPKLTVNDVNDLIQYVNAGGHIVFFDAI----EQVNPEPIAKLADMA 750 Query: 749 GLSMALNKSVVNNDPQGYPNRVRQQRATGI------------WVYERYPAVDGALPYTID 796 G+S+ Q Y +G+ VYER+ ++ + Sbjct: 751 GVSLGGANVAQAKTTQAYCGSSYYCHGSGVKPNVHAVTEHDLVVYERFETLNDDASKIVI 810 Query: 797 SKTGEVKWKYQVENKPDDKPKLEVASWLE-----DVDGKQETRYAFIDEADHKTEDSLKA 851 + G + W P+ PKLEVA + +DG + R+AF K+ED +A Sbjct: 811 NSDGTITWP-----APNKMPKLEVAKYTTPYMPLTIDGIPQERFAFFQV---KSEDEKRA 862 Query: 852 AKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMV 911 A ++ AFPG+K C + Y +EVNC+E+R G G+P G Y + S++ +MV Sbjct: 863 AIHELQVAFPGVKVCQDD-YEFEVNCIEFRKGHGIPSFGNYQRANYERYSISPKVIDSMV 921 Query: 912 QAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDE 971 +AA+LGTN+ +LYQHELY+RT G +G RLS +L + Y N SVW+WND YRY+ DE Sbjct: 922 EAANLGTNLTKLYQHELYYRTRGEQGHRLSLTELNQTYDNTSVWMWNDEPYRYDNSVEDE 981 Query: 972 LGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEK 1031 LGFKT ++LNCY N+ + GG +CS D +++L+ ++ K G +NPSYPLNY EK Sbjct: 982 LGFKTAVDYLNCYTNNQHQGGIECSVDKQQALIKYGFLH-----KNGELNPSYPLNYQEK 1036 Query: 1032 PLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAP 1091 PLTR+MLGRS+WDL+IKVD +YPG + T T+S +N NMQSTGLWA Sbjct: 1037 PLTRIMLGRSYWDLDIKVDTTQYPGRPAFTNGTQTVTVSTLNNAVTGTVNNMQSTGLWAH 1096 Query: 1092 AQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYG 1151 ++V + VP T+TV+L DDLTG E+HEVALNRPPRV K+++ D S + F+VPYG Sbjct: 1097 QHQQVQVS--GGVPATITVSLIDDLTGLEQHEVALNRPPRVQKSFNYDGSN-LSFRVPYG 1153 Query: 1152 GLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNL 1211 GLIYIK +S+ +A F+F+GV A F+KD W S PL E+++ +YTTP +N+ Sbjct: 1154 GLIYIKPHSNIEGTAEFSFSGVATAAFWKDNQWMYGKASDVPLAEIDTGHVIYTTPVENI 1213 Query: 1212 NASNYTGGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDVQIS 1269 + ++ F ++++ FA+S +DFYGRD GKHR FTY++L H+HRF ND+QIS Sbjct: 1214 EQQD----IQIFVDEMNKFANSASDFYGRDEVVSVGKHRRFTYQDLADHRHRFVNDIQIS 1269 Query: 1270 IGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLAL 1329 IG AHSGYPV +++++ +PTTP NDWL+WHE+GHN A P ++ G TEV NN+LAL Sbjct: 1270 IGAAHSGYPVQSTTYN-KGNKIPTTPTNDWLLWHEIGHNLASAPFSMTGGTEVTNNILAL 1328 Query: 1330 YMQDRYL---GKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDI 1386 YMQ++ KM+RV DI P N W+ G AG RL+M+AQLK WAE +F I Sbjct: 1329 YMQEQRPEPNNKMSRVESDIQKMPLLFSRYNKHVWSNGDAGIRLVMFAQLKLWAENHFRI 1388 Query: 1387 KKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESN--GNAAD 1444 WY + L Y++ + GWN+F+LMHRK+RGD + + G NYC+ S+ + D Sbjct: 1389 DNWYSEKDLL-TIYNQDQ---GWNMFKLMHRKSRGDSIGDQ---GINYCSSSDTGLSGGD 1441 Query: 1445 TLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASL-DLPKPE 1503 LM+C+S+V+ DLS F+ WNP + LP ++ + GG++ Y L + +L +PE Sbjct: 1442 LLMVCSSYVSGFDLSNFYTLWNPSESMNILPNGDKL-YSGGITSKGYQVLNQIPNLKQPE 1500 Query: 1504 QGPETINQV 1512 PE+I + Sbjct: 1501 TSPESITHL 1509 >UniRef50_A6AXB7 AcfD n=3 Tax=Gammaproteobacteria RepID=A6AXB7_VIBPA Length = 1366 Score = 1730 bits (4479), Expect = 0.0, Method: Composition-based stats. Identities = 715/1381 (51%), Positives = 918/1381 (66%), Gaps = 63/1381 (4%) Query: 182 PANTEQVCLTFSSVIESKRFDS-LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV 240 + +++ L+ F + L Q+D+ + FKKL+ E + N++ TDK PSTHT V Sbjct: 3 TIHGDELDLSLEKTSHRLIFKNYLNNQLDVEIDTFKKLLQERLSNDSQTDKQPSTHTPEV 62 Query: 241 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV 300 P TPG DL+ +FVSANAE+ +Y+P E+IL+ G LVDS G V G+ Y+T+ GRG+ Sbjct: 63 EPAVTPGASSDLSQAFVSANAEKSLEYKPKELILTTGYLVDSFGRSVNGIAYFTSKGRGL 122 Query: 301 TGE-------NGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIH 353 TG +G FSWG+TI+FGIDTFELGS RGNK+TI L +LG G NI+ L+ Sbjct: 123 TGYKDGRLIGDGSLEFSWGDTINFGIDTFELGSTRGNKNTIKLQDLGSGNEGKNIESLVM 182 Query: 354 RYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGA---TLGEGEQVVNLPNEFIE 410 R+S + + V D V +VF++YPNVINE I+LSLSN LG G V + EF + Sbjct: 183 RFSEEN-DQSVFVTDKVTEVFSKYPNVINEAISLSLSNEDIQLDLGNGNTEV-VKGEFEK 240 Query: 411 QFNTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDG--QIQGVINKLWGVDT-----NY 463 QF +G A++ID + + E + V+ +Q + +LWG + Sbjct: 241 QFESGLAEDIDKELGRQKLAFGEQYREPKQIKAVDSDAQNVQRDVERLWGATQQAQREGW 300 Query: 464 KSVSKFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNE 523 K V +FH+FHDSTNFYGSTG+AR QA VNISN AFP++MARNDKNYW+ F + +AWD+N Sbjct: 301 KPVERFHIFHDSTNFYGSTGSARAQAAVNISNKAFPVVMARNDKNYWIDFDKPQAWDENG 360 Query: 524 LAYITEAPSLVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNG 583 LAYITEAPS V+P+ V ATFNLPFIS+G +G+GK+MV+GN YNS+L CPNG+SWNG Sbjct: 361 LAYITEAPSKVKPKKVDASNATFNLPFISIGDLGKGKVMVMGNARYNSVLVCPNGFSWNG 420 Query: 584 GVNKDGQCTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTG 643 GVN GQCT N+D DDM NF N +YL+ K + +V TN+ VYFKR GQV G Sbjct: 421 GVNDQGQCTGNTDSDDMANFFNNAFQYLTGKK-----AGTFSVATNIPHVYFKRGGQVLG 475 Query: 644 NSAAFDFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVT-QVGNDPYAIPLRADTSK 702 + A++ FA + L S+ LDP ++PL+ILN + Y+ Q G Y +P++A+ Sbjct: 476 SKASYLIDKRFAQ-DTQQLDSFSGLDPNDIPLVILNAYSYLGEQGGLGAYDLPMQANLDA 534 Query: 703 PKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSV---V 759 PKLTQQD++DLIAY+ GGSVL+ME + ++ + RLLDAAG++ + +SV Sbjct: 535 PKLTQQDISDLIAYVEDGGSVLMMETIKG---QKDSGVVSRLLDAAGIAFGIGESVARDG 591 Query: 760 NNDPQGYPNRVRQQRATGIWVYERYPAVDGA------LPYTIDSKTGEVKWKYQVENKPD 813 N GYP+RVR QR GIWV ERY A D + LPY I + G V+WKY +EN+PD Sbjct: 592 NGPNGGYPDRVRSQRQQGIWVLERYAAEDSSNGEGPSLPYVI-KEDGSVEWKYIIENRPD 650 Query: 814 DKPKLEVASWLE-DVDGKQETRYAFIDEADHKTE-----DSLKAAKEKIFAAFP------ 861 DKPKLEVA W+E + G + + AFIDEA+ + ++L AK +I AF Sbjct: 651 DKPKLEVAKWIEINEQGDSKVQVAFIDEANFYQDGTFDNEALTVAKNRILDAFKDNSGKR 710 Query: 862 GLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQ 921 +ECTN YHYEVNCLEYRPG +P++GG+YVP YT++ L AKAMV+AA++G+NI+ Sbjct: 711 AYEECTNNEYHYEVNCLEYRPGNKIPISGGLYVPNYTEMKLGEHEAKAMVKAANIGSNIE 770 Query: 922 RLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFL 981 LYQHE YFRT G++G RL+SVD+ R+YQN+SVWLWND Y Y++ KNDELGFK FTEFL Sbjct: 771 ALYQHERYFRTKGKQGFRLNSVDMSRMYQNLSVWLWNDLRYSYDQEKNDELGFKRFTEFL 830 Query: 982 NCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRS 1041 NCY +D G T C LK L +MIY +G AG MNPSYPLNYMEKPLTRLMLGRS Sbjct: 831 NCYTDDKAGGNTICPESLKLELQKMDMIYAEGE-YAGYMNPSYPLNYMEKPLTRLMLGRS 889 Query: 1042 WWDLNIKVDVEKYPGAVSEEGQN-VTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIK- 1099 +WDL++KVD +PG S G N T T+ + +N T WFAG+ Q+TG WA A T+ Sbjct: 890 FWDLDVKVDTRPFPGVASSSGSNGGTITLDMSNNVTAWFAGSRQATGQWAQAHVPFTVSV 949 Query: 1100 SNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA--SGTVKFKVPYGGLIYIK 1157 S A PVT+TVALADDLT REKHEV L RPPR+TK++ + + + VPYGGLIY + Sbjct: 950 SGAKAPVTITVALADDLTAREKHEVGLKRPPRMTKSFIIGGNKATSETITVPYGGLIYAQ 1009 Query: 1158 GNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYT 1217 G +S ES TFTG + AP + DG+WKNDL+SPAP+GE+ S +F+YT PK NL A NY Sbjct: 1010 GGNS--ESVQLTFTGTLAAPLFIDGSWKNDLDSPAPVGEVVSKSFIYTGPKANLRAENYP 1067 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS 1275 GG+EQFA DLD FAS +NDFY RD + +R T P +H F NDV ISIG AHS Sbjct: 1068 GGIEQFAKDLDQFASDLNDFYARDEGLDGQANRKVTGDENPNSRHHFVNDVAISIGAAHS 1127 Query: 1276 GYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY 1335 GYPVMNSS++ NS+ + TTPLNDWL+WHEVGHNAAE P V GATEV NN+LALYMQD + Sbjct: 1128 GYPVMNSSYNLNSSNINTTPLNDWLLWHEVGHNAAEAPFVVEGATEVVNNLLALYMQDLH 1187 Query: 1336 LGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTP 1395 +GKM RV DI VAPE++ + AWA GGA +RL+M+AQLKEWAE FDI+ WY Sbjct: 1188 IGKMTRVEQDIQVAPEFVRTEHGHAWAAGGAAERLVMFAQLKEWAESEFDIRDWY--QGE 1245 Query: 1396 LPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQ 1455 LP +YSE EG+KGWNLF+LMHR R + N C + +D LM+CAS+ AQ Sbjct: 1246 LPSYYSEVEGVKGWNLFKLMHRLTRNESDGIFYLKSTNACRWQGLSKSDQLMVCASYAAQ 1305 Query: 1456 TDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEH 1515 TDLS+FF WNPGA ++ PG+SE S+EGGV+Q + + L L KP PE IN +T Sbjct: 1306 TDLSDFFLAWNPGARSFIYPGSSEPSYEGGVTQKGLDVVRKLGLKKPSLDPEEINTITVR 1365 Query: 1516 K 1516 K Sbjct: 1366 K 1366 >UniRef50_A6ALV4 AcfD n=5 Tax=Vibrio RepID=A6ALV4_VIBHA Length = 1466 Score = 1703 bits (4410), Expect = 0.0, Method: Composition-based stats. Identities = 580/1555 (37%), Positives = 839/1555 (53%), Gaps = 154/1555 (9%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K L +IL A+LL GC G + S+ T P P V+PD P+ Sbjct: 2 KRSLLSILVASLLFGCGGDENNHSTSTTP--------PTVEPDLPPDQPDI--------- 44 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGE---SSDGFTFKPGE-DVT 124 V T G L + G Q C+G+ S FT+ E + + Sbjct: 45 -----------------GVTTYQGKLFINGKQLTGDIQCDGQDNSESGYFTYAASEGNFS 87 Query: 125 CVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPAN 184 C G ++ F+ Q A + D +++ G+ +NA L+ ++CP+ Sbjct: 88 CEFGAVSLGEFSYQIPAQTRTGSQPAELTQNYDLKDVLGT--HANNAAKLLHKIDTCPSQ 145 Query: 185 TEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHT-SPVVPV 243 QVCL I S LY+ D A +N V N PS H + P Sbjct: 146 DTQVCL---DEINSYDIQDLYESDDQAA--IDAFLNPSVVNEGE---QPSAHVDPELQPE 197 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPTE--IILSEGRLVDSQGYGVAGVNYYTNSGRGVT 301 TPG +L SFVSANAE Y+Y+P+ L + RL DSQG +AG+ +++ S RG+T Sbjct: 198 VTPGASNNLTGSFVSANAEAAYEYKPSAANKPLIKSRLTDSQGNALAGIEFFSQSARGIT 257 Query: 302 GENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGD-EVRGANIDQLIHRYSTTGQ 360 NGEF + WGE + FGIDTF LG V+GN+ + L +L D + N+D +HRY + Sbjct: 258 DANGEFEYLWGENLIFGIDTFTLGQVKGNQVSYQLADLSDNPLVKQNLDAFVHRYGLSSG 317 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 NN + D+VR+VFA+YPNVINE+INLSL NGA + EG PNEF QF+ G + I Sbjct: 318 NNIE-IGDNVRQVFAQYPNVINELINLSLPNGAKI-EGTNFTT-PNEFEAQFSQGLTQII 374 Query: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 D + +W TT + + G + Y V H+FHD+ +G Sbjct: 375 DGQLKQTP------QWSGFTTPMLRTVRASGSNYVTQSLHQIYAGVDSVHIFHDNHG-WG 427 Query: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDK----NELAYITEAPSLV-- 534 +G R N++N AFP+LM RND +YWL FGE+ AW + ++ AYI +A ++ Sbjct: 428 GSGYTRAMRNFNLTNEAFPVLMPRNDNSYWLGFGEEAAWTRGSGKDQKAYIVDATTIDEN 487 Query: 535 ------EPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNG--GVN 586 PE +++ TATFNLP ++ G +G GK++ +GN Y SI CP Y G++ Sbjct: 488 STVVMQRPEVISKQTATFNLPTMTAGMIGSGKVVFLGNAMYTSIFSCPENYWAGADLGID 547 Query: 587 KDGQCTLNSDPDD--------------MKNFMENVLRYLSDDKWKPDAKASMTVGTNLDT 632 + Q S P + M+ N++ +L + + S+ + TN++ Sbjct: 548 SEVQQCRYSTPHNQEAQDADTRTDNGSMQVMFGNLIDWLVPNA----TQESVAIATNINK 603 Query: 633 VYFKRHGQVTGNSAAFDFHPDFAGISVEHLSS--YGDLDPQEMPLLILNGFEYVTQVGND 690 + R + G F +P + ++ LSS + LDP PLL+L +E T G D Sbjct: 604 GHAFRWDRKEGQIYDFFVNPSYKLGEMDVLSSGQFDSLDPTSTPLLLLQSYEIKTD-GYD 662 Query: 691 PYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGL 750 ++ +D ++PKL DVT LI Y+N GGS++ + L+E + RL DAAG+ Sbjct: 663 TKSV--VSDINQPKLDADDVTALIEYVNNGGSIIFFDA----LEESNPEPIARLADAAGV 716 Query: 751 SMA---LNKSVVNNDPQGY-------PN--RVRQQRATGIWVYERYPAVDGALPYTIDSK 798 S+ + K+ + Y PN + + VYERY + Sbjct: 717 SVGGANVAKTFQSLCTDSYWCHSTSGPNVPNLHTVAEYDLVVYERYADT----TKIEIND 772 Query: 799 TGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDEADHKTEDSLKAAKEKIFA 858 G V W + D P LE+ + +DG++ RYAF K+E +AA ++ Sbjct: 773 NGTVTWPGNI-----DMPTLEIPLYKASIDGQEHQRYAFHMV---KSEQEKQAAVAELQR 824 Query: 859 AFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGT 918 FPG+ C + Y YEVNC+E R G G+P G + P +T+ ++ + +MV+AA+LG Sbjct: 825 EFPGVPVCKDD-YQYEVNCIEVREGHGIPSRGNHHRPDFTRYEMSPEVVDSMVKAANLGA 883 Query: 919 NIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFT 978 NI RL HELY+R+ G GERLS +L Y N+SVWLWND Y + DELGF+ Sbjct: 884 NIDRLLSHELYYRSKGEIGERLSQAELTSTYDNLSVWLWNDEQYEFNPNVQDELGFERAV 943 Query: 979 EFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLML 1038 E LNCY ++A+ GG C + ++ L +MI +++G +NPSYPLN+MEKPLTR+ML Sbjct: 944 EMLNCYTDNAHQGGNVCGQETREQLAKWSMI-----TESGELNPSYPLNWMEKPLTRMML 998 Query: 1039 GRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTI 1098 GRS+WDL+I VD YPG S+ G + I + AG+MQSTGLWAP +EVTI Sbjct: 999 GRSYWDLDISVDTTSYPGRPSQSGSAASVAIHTDNKTVIGTAGSMQSTGLWAPQLEEVTI 1058 Query: 1099 KSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG 1158 V ++ VAL DDLTGR HE++L RPPRV KT+ D S ++ FKVPYGGLIYI+ Sbjct: 1059 S--GGVKASINVALVDDLTGRANHELSLKRPPRVQKTFQYDGS-SLSFKVPYGGLIYIQP 1115 Query: 1159 -NSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYT 1217 + + +F FTGV++A ++K+G+W N +N+ PL E++S F+YTTP N+ ++ Sbjct: 1116 LEVDSRDVVTFNFTGVLRASWWKNGSWLNPINTDVPLAEIDSGHFIYTTPTNNVQDTDVP 1175 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS 1275 +F ++L+ FA+ +DFYGRD E G+HR FTY L ++HRF NDVQISIG AHS Sbjct: 1176 ----KFVDELNAFANHASDFYGRDQVIEQGQHRRFTYDALLANRHRFVNDVQISIGAAHS 1231 Query: 1276 GYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY 1335 GYPV ++S+ P T +PT P+NDWL+WHEVGHN A P + G+TEV NN+LALYMQ++ Sbjct: 1232 GYPVQSNSYWPTWTVIPTNPINDWLLWHEVGHNLASAPFMMAGSTEVTNNILALYMQEQR 1291 Query: 1336 LGK--MNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDG 1393 K M+R+A D++ +P +L+ AW+ G RL M+ QLK WAE +F+I WY + Sbjct: 1292 EEKPYMDRIASDLSKSPLWLDRFEGHAWSEADVGMRLAMFGQLKLWAEDHFNIDDWYSNQ 1351 Query: 1394 TPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCA--ESNGNAADTLMLCAS 1451 P + E + GWN F+L HRKARGD +S+ G NYC+ S + D +M C S Sbjct: 1352 AEKPSIFGEDQ---GWNFFKLAHRKARGDSISDQ---GINYCSTQSSQLSQGDLMMACTS 1405 Query: 1452 WVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGP 1506 ++ DL+++F+ WNP LP + + + GG++ + +N +A++ LPKPE+ P Sbjct: 1406 YLTGYDLTDYFRMWNPSETKANLPNGT-VDYSGGLTPAGFNAVAAMGLPKPEKSP 1459 >UniRef50_A8H5W9 Inner membrane lipoprotein n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H5W9_SHEPA Length = 1426 Score = 1664 bits (4308), Expect = 0.0, Method: Composition-based stats. Identities = 583/1541 (37%), Positives = 848/1541 (55%), Gaps = 152/1541 (9%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K L+ A++ + LLAGC G +D P+ EP+ PT Sbjct: 2 KKLILAVVISNLLAGC-----GDYTDA----------------PSTPVEPSIPPT----- 35 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDG---FTFKPGEDVTC 125 + IP + T G L L G + +CNG++ FTFK G++V+C Sbjct: 36 --DLIPAKK-----------TYQGSLLLSGKKLSGHISCNGQALGHGGSFTFKDGDNVSC 82 Query: 126 VAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPANT 185 G+ + + + + ++ ++D A S ++A +++ ++CPA Sbjct: 83 TYGSLELLNKDIPLPDGWTRDSHNAMALEIKDDWAHAIS---VTDAAKVMSKVSTCPALA 139 Query: 186 EQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNE-EVENNAATDKAPSTHTSP-VVPV 243 +++CL I+S L+ + A + +N E DKAPS+H + P Sbjct: 140 DEICL---DEIDSFDVSPLFSNGNAA--DINAFLNPPAAEETDEIDKAPSSHVDSSLTPE 194 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPTE--IILSEGRLVDSQGYGVAGVNYYTNSGRGVT 301 + GTKPD+NA FVSA+AE Y Y+P+E + SE L D+QG +AGVNYYT S RG+T Sbjct: 195 VSAGTKPDINADFVSASAEDAYTYKPSEDARVESESVLTDNQGKPIAGVNYYTKSSRGIT 254 Query: 302 GENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDE-VRGANIDQLIHRYSTTGQ 360 +G S+ WGETI+FG+DTF SV+GN+ L++ + + NI LI RY+T Sbjct: 255 DASGIVSYVWGETITFGLDTFTFSSVKGNQIEYKLSDGSENEIVKQNISALIERYATHTT 314 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 ++ ++V +VF +YPNVINEIINL+L NGA + +PNEF QFNTG A I Sbjct: 315 DSVS-FDENVHRVFGQYPNVINEIINLNLPNGAEIESSGYF--VPNEFNAQFNTGLALII 371 Query: 421 DTAICAKTDGCNEARWFSLTTRNVND-GQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFY 479 D + + R+ T + G + + +L YK V +FHVFHD+++FY Sbjct: 372 DAEL-----NLSPTRFSQQATPLLQKAGYVTNSLQQL------YKDVDQFHVFHDNSSFY 420 Query: 480 GSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKN-ELAYITEAPSLV---- 534 G G AR +N SN AFP+LM RND NYWL FG ++A+ ++ Y+T+A ++ Sbjct: 421 GEVGYARFMRSMNTSNTAFPVLMPRNDVNYWLPFGSEQAYRRDDGFPYVTDAKTIDASSD 480 Query: 535 ----EPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQ 590 PE V DTAT+NLP I+ G++G GK++ +GN Y +IL P Y W GG Sbjct: 481 VILKRPERVGTDTATYNLPVITAGEIGLGKVVFMGNSMYPNILSKPENY-WAGGEEA--- 536 Query: 591 CTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDF 650 D M F N+ + + + K ++ VG+N+D V+ + F Sbjct: 537 ---GKDNGSMPTFFMNMFTWFTPG--YDNGKTTINVGSNIDKVWQSNVNN--NQTYDFFV 589 Query: 651 HPDFAGISVEHLSS--YGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQ 708 H + ++VE LSS Y LDP+ P+LIL +E T + D ++ + AD ++PKLT Sbjct: 590 HGSYK-LNVEPLSSGSYAGLDPKTTPVLILQAYE--TGLFGDGMSVKVLADIAQPKLTTA 646 Query: 709 DVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMA-----------LNKS 757 DVT LI Y+N GG+VL M+ + ++ + RL D AG+++ +S Sbjct: 647 DVTALIKYINAGGNVLFMDGI----EQLNPEPIARLADTAGIALGGANLARTRQAYCGES 702 Query: 758 VVNNDPQGYPNRVRQQRATGIWVYERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPK 817 P YPN R + YE++ D + ++ + G V + P DKP+ Sbjct: 703 YYCQAP--YPN-ARASFTDTLVTYEKF---DDMSKFVVN-QDGTVNFP-----SPIDKPE 750 Query: 818 LEVASWLED-VDGKQETRYAFIDEADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVN 876 +A + DG ++ +AF KTE A KI AAFP +KECT+ +Y YE+ Sbjct: 751 FGIAQFKTTAEDGSEQDNFAFYSV---KTEAERLEAVAKIKAAFPKVKECTDASYDYEIG 807 Query: 877 CLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRK 936 C+E R G G+ Y P++T+ ++ D MV+AA+LG N+++LYQHE+Y+R+ G++ Sbjct: 808 CIETRKGHGLATGSRYYRPRFTRYEISPDVVNTMVKAANLGGNVEKLYQHEIYYRSQGKE 867 Query: 937 GERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCS 996 G RLS +L + Y N+S+W WND Y Y DELGFK TEFLNCY +D + C+ Sbjct: 868 GSRLSLNELNQTYDNLSIWFWNDEQYSYNSEVQDELGFKKATEFLNCYTSDVHQPDNACA 927 Query: 997 ADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPG 1056 AD ++ L+ M+ + +G +NPSYPLNY EKPLTR+MLGRS+WD +I VD E YPG Sbjct: 928 ADTREKLLKYGML-----TSSGELNPSYPLNYQEKPLTRIMLGRSYWDNDISVDTEMYPG 982 Query: 1057 AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDL 1116 + EG N + I ++N A NMQSTGLWA + VT+ N + T+TVAL DD+ Sbjct: 983 NTAAEGSNASVQIETFNNAVVGTANNMQSTGLWAVKRSVVTVSGNHD--ATITVALVDDV 1040 Query: 1117 TGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNS-STNESASFTFTGVVK 1175 TG+ +HE++L RP RV K++S A T + PYGGLIYIK S T F F+GV++ Sbjct: 1041 TGKHEHELSLKRPSRVQKSWSHKAGSTTEIIAPYGGLIYIKPASTDTANRVEFNFSGVLE 1100 Query: 1176 APFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMN 1235 A +K+G W+N +N PL E+ + FVYTTP N+ ++ ++ FA+ ++ FA + Sbjct: 1101 ASLWKNGQWQNPVNQEVPLAEVVTGQFVYTTPVNNVTDTD----IQAFASGMNDFAEKAS 1156 Query: 1236 DFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTP 1295 DF+ RD+ DG R FT K LP H HRF NDVQISIG AHSGYPVM+++++ ++ ++PT P Sbjct: 1157 DFHARDNSDGNMR-FTGKLLPEHSHRFVNDVQISIGAAHSGYPVMSTTYNRDANSIPTIP 1215 Query: 1296 LNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY---LGKMNRVADDITVAPEY 1352 NDWL+WHE+GHN A P V GATEVANN+LALYMQD G+M+RV DI AP Sbjct: 1216 DNDWLLWHEIGHNLAAAPFNVKGATEVANNLLALYMQDLRDNGDGQMDRVKTDIQKAPMM 1275 Query: 1353 LEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLF 1412 + W+ G AG RL+M+AQLK WA+++F I + G +P +Y E E GWN+F Sbjct: 1276 ISRDEGHVWSHGNAGSRLVMFAQLKVWAQEHFKIADHF-KGQTIPSYYGEDE---GWNMF 1331 Query: 1413 QLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAY 1472 +LMH +AR + S+ N + D LM C S V+ DL+ FF+ WNP + Sbjct: 1332 KLMHHEARNNNNSSCSAQNAN-----GLSQGDLLMACTSAVSGYDLTPFFEAWNPSEVSV 1386 Query: 1473 QLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVT 1513 S ++GG++ + SLDLP+P+ PETI+ + Sbjct: 1387 ITADGSR-DYQGGITADGIGYVKSLDLPEPKVKPETIDYIN 1426 >UniRef50_B5FBF1 Inner membrane lipoprotein n=1 Tax=Vibrio fischeri MJ11 RepID=B5FBF1_VIBFM Length = 1482 Score = 1660 bits (4298), Expect = 0.0, Method: Composition-based stats. Identities = 593/1572 (37%), Positives = 840/1572 (53%), Gaps = 162/1572 (10%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K LL A L LLAGC+ S + Sbjct: 3 KKLLLASLIPMLLAGCNQEEINIGSGS--------------------------------- 29 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGE----SSDGFTFKPGEDVT 124 D T P P T L G CNG+ S FT K G Sbjct: 30 ------DSGATTPPTPAIPTTYISTLMASGKIITGDVHCNGKSLNTDSGTFTVKEGSVFD 83 Query: 125 CVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPAN 184 C G T+ F + A+ S + D Q + GS NA ++ S ++C Sbjct: 84 CSLGGVTLGEFKAPTPEAKISGVTNTTSEASFDLQAVKGS-----NATRILQSISTC-TQ 137 Query: 185 TEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTS-PVVPV 243 + +CL ++S +Y +D L ++E E K PS+H +VP Sbjct: 138 EDSICL---DDLDSIDIQDIYSDLDNNESVNAFLKSKEEEKTDEVGKTPSSHVDAEIVPE 194 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPTE--IILSEGRLVDSQGYGVAGVNYYTNSGRGVT 301 TPGT DLN+ FVSANAE Y Y+P+ +L++ +L DS G +AGVN+++ + G+T Sbjct: 195 VTPGTSNDLNSGFVSANAEDSYAYKPSAEAKVLTKSQLTDSTGTPLAGVNFFSANAVGIT 254 Query: 302 GENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDE-VRGANIDQLIHRYSTTGQ 360 GENGEF + WG+ ++FGIDTFE GSV GN+ + +T++ D V ANI LI RY+ Sbjct: 255 GENGEFEYLWGDKLTFGIDTFEFGSVAGNQVSYKITDVSDNAVVKANIQSLITRYAENNH 314 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 N ++ + V+ F+ YPNVINE+INLSL NG + EG +LP+EF QF G I Sbjct: 315 NGL-LISEKVQDTFSLYPNVINELINLSLPNGGQI-EGTNF-SLPDEFDAQFQNGLTAAI 371 Query: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 D + + + +FS + V + L + + V+ FHVF+D+ +FYG Sbjct: 372 DAELQQQ----RASFYFSDFPHVFSLDNGTYVTDSLTRI---FNGVTSFHVFNDNGSFYG 424 Query: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPEN-- 538 +TG RG +N+SN AFPI+M R D N + FGE++AW + YI P++ P Sbjct: 425 ATGYTRGMRALNLSNRAFPIMMPRADINKDIPFGEQQAWTREGRPYIAVHPTIEMPPIPL 484 Query: 539 VTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKD---GQCTLN- 594 V++D ATF PF++ G++G GK++ +GN Y SI+ CP+ Y N + D CT + Sbjct: 485 VSKDNATFGFPFVTAGEIGSGKVVFMGNSMYPSIISCPDNYWANDALRIDSALQSCTSSF 544 Query: 595 -------SDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAA 647 +D MK F N+ +L++DK + + V TN+D R G GN+ Sbjct: 545 DLANDPRNDNGSMKTFFNNLFTWLNNDK----SIKGINVATNIDVATALRSGTSHGNAYD 600 Query: 648 FDFHPDFAGISVEHLSS---YGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPK 704 F +P F SVE L+ G L E PLLIL + Q D + AD P Sbjct: 601 FFVNPSFGFSSVEKLTKDGFSGRLSASETPLLILQAYPPKPQG--DGMSHRFIADLDNPN 658 Query: 705 LTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQ 764 L+Q D+T LI Y+N+GGSVL M+ + + RL D+AG+S L S V Q Sbjct: 659 LSQDDITALITYINEGGSVLFMDAIDKVT---NPEPIGRLADSAGVS--LGGSNVTPTSQ 713 Query: 765 GYPNRVRQQR----------ATGIWVYERYPAVDGALPYTIDSKTGEVKWKYQVENKPDD 814 + + + V ER+ VDG PYT++ + G V+W K + Sbjct: 714 AFCGSSYYCQAPSPNLHVKSQYEMVVLERFQDVDGQQPYTVN-QDGSVEW-----TKDET 767 Query: 815 KPKLEVASW-----------LEDVDGKQ--ETRYAFIDEADHKTEDSLKAAKEKIFAAFP 861 K K E+ ++ L D DG ET++A I K + AA ++ AF Sbjct: 768 KIKFEIPTYEIIKRDDKGDPLLDKDGNPVMETKFARIFV---KNGEERAAAISELQEAFE 824 Query: 862 GLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQ 921 G C++ Y YE NC+E R G G+ V G + + + +N D ++MV+AA+LG N Sbjct: 825 GTPLCSHS-YEYEFNCIETRQGDGIQVRGAYWRADFDRYQMNQDVVESMVKAANLGDNFN 883 Query: 922 RLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFL 981 L +HE+Y+RT G++G RLS+V+L + Y N+S+W+WND Y Y+ DELGFKT FL Sbjct: 884 ALMEHEMYYRTKGKQGTRLSTVELNQTYDNLSIWMWNDNPYAYDPNVQDELGFKTAVNFL 943 Query: 982 NCYANDAYAGGT---KCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLML 1038 NCY ++ + C +LK +L+ N MI+G+G AG MNPSYPLNYMEKPLTR+ML Sbjct: 944 NCYTDNQHQTDVPEAACPVELKATLIANGMIHGEGE-LAGQMNPSYPLNYMEKPLTRIML 1002 Query: 1039 GRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTI 1098 GRS+WD I VD KYPG + + I + AGN QSTGLWAP EVT+ Sbjct: 1003 GRSFWDHEITVDTTKYPGRTNGATTSEVVNIETAGKAVSYSAGNNQSTGLWAPQLSEVTV 1062 Query: 1099 KSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG 1158 + +V +TV +ADDLTG+ +HE +LNRPPR+ +++ D T FKVPYGGLIYIK Sbjct: 1063 R--GDVTAMITVMMADDLTGKPQHETSLNRPPRMQMSFAHDGRSTT-FKVPYGGLIYIKP 1119 Query: 1159 N---SSTNESASFTFTGVVKAPFYK------DGAWKNDLNSP-APLGELESDAFVYTTPK 1208 S + A F+ GV KA ++K G W N +S AP+ E+++ +F+YTT Sbjct: 1120 TEILSGASTVAEFSLDGVEKAAWWKKDPANNLGEWVNTPDSSTAPIAEIDTGSFIYTTAL 1179 Query: 1209 KNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSE--DGKHRMFTYKNLPGHKHRFTNDV 1266 N+ T L +F+ +++ FA + +DFYGRD E DGKHR FTY L +HRF NDV Sbjct: 1180 NNVK----TADLNEFSKNMNRFADAASDFYGRDEESADGKHRRFTYPELKEFRHRFVNDV 1235 Query: 1267 QISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNV 1326 QISIG AHSGYPVM+SSF+ +S +PT ++DWL+WHEVGHN A P + PG+TEV NN+ Sbjct: 1236 QISIGAAHSGYPVMSSSFNASSNKIPTNAIDDWLVWHEVGHNLASAPFSAPGSTEVTNNL 1295 Query: 1327 LALYMQD----RYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEK 1382 LALYMQ+ +M+R+ I AP +L ++ AW+ G AG RL+M+ QLK WAE Sbjct: 1296 LALYMQELEGRNANPEMDRIRTSIQKAPAWLSSNDGHAWSHGDAGLRLVMFGQLKIWAEN 1355 Query: 1383 NFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAE--SNG 1440 +F+I +WY DG P Y++ + GWN+ +LMHRKARGD+ + G NYC+ + Sbjct: 1356 HFEIDRWYVDGETKPAIYNQDQ---GWNMIKLMHRKARGDQQGD---AGINYCSSGDTGL 1409 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP 1500 +A D +M+C+S+V+ DL EFF+ WN G + +++ + GG+S + + LA L L Sbjct: 1410 SAGDLMMVCSSYVSGYDLGEFFQAWNVGETSVTNADGTKV-YSGGISSAGLSKLAELKLN 1468 Query: 1501 KPEQGPETINQV 1512 P++ P TIN + Sbjct: 1469 NPKKDPLTINAL 1480 >UniRef50_Q5E705 Accessory colonization factor AcfD-like protein, predicted inner membrane lipoprotein n=1 Tax=Vibrio fischeri ES114 RepID=Q5E705_VIBF1 Length = 1569 Score = 1421 bits (3677), Expect = 0.0, Method: Composition-based stats. Identities = 516/1641 (31%), Positives = 777/1641 (47%), Gaps = 231/1641 (14%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K LL A L LLAGC+ +G G++P++ D P Sbjct: 3 KKLLLASLIPMLLAGCN--------QEEININGNGTVPDIGGDGGVTP------------ 42 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKP--------G 120 P+P P P K ++ + GATC+G SD Sbjct: 43 -------------PKPTPDPIKYRFMITSSGAPIEGATCDGRLSDHLGVIALDYDSNTLP 89 Query: 121 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSN- 179 + + C+ T +ATF T + + +RA + + + D + D+ N SL+ + + Sbjct: 90 QSIDCLIAGTPLATFATSTN--KRVRAQD-YNLDIADGKISDLQGDQLVNIQSLLRTVDA 146 Query: 180 -SCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPST--- 235 +N Q SV K + + Y A E+ ++L ++ DK Sbjct: 147 DGVDSNGYQFVEGEKSV---KNYSANYAD---ALEKTQELFLKDNIGYLKDDKPAGVEPG 200 Query: 236 HTSPVVPVTTPGTKPDLNAS-FVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYT 294 H + V PV TPG+ S VS+NAE+ Y+Y+P E+ + E ++ G V GV YY Sbjct: 201 HGTDVEPVVTPGSDDVTGGSGIVSSNAEKQYEYKP-EVAVPEKSILMLDGQPVVGVEYYG 259 Query: 295 NSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELG-DEVRGANIDQLIH 353 + RG T +G F ++WG+ ++FGI LGS++ + L L D + N++ LI Sbjct: 260 PTYRGKTDVDGSFEYNWGDEVTFGIQALTLGSIKAKGLDVQLGALAADPSKSKNVENLIK 319 Query: 354 RYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLP---NEFIE 410 ++ V+ DV + FA N I E+IN++L++G T P NEFI Sbjct: 320 QFDKDNS-APWVIEQDVHERFALESNNIVELINMNLASGDTSNFDPSFGTPPQVKNEFIA 378 Query: 411 QFNTG-QAKEIDTAICAKTDGCNE---ARWFSLTTRNVNDGQIQGVINKLWGVDTNYKS- 465 QFN G A +I T++ N + + SL + N + ++ + D N + Sbjct: 379 QFNDGGSAFDIVTSLGLSPINVNTYTFSPYVSLRSVNRVETASDALLTMMGQNDNNKDND 438 Query: 466 VSKFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELA 525 V+ FHVF + + Y + +NI N A P++M R+D N ++ FG+ DK Sbjct: 439 VTHFHVFGNQNDGY-HMPSPHAATFINIDNNAAPVVMPRSDLNAYIPFGQLAVTDKFSRP 497 Query: 526 YIT------EAPSLVEPENVT---------------------RDTATFNLPFISLGQVGE 558 + T P+ ++ +N T ++TATF LPF+ G++GE Sbjct: 498 FFTLTNDPKTTPTYIDAKNKTHWNLKEQDVAADSVESSYKMNKETATFELPFVVSGKIGE 557 Query: 559 GKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNS---DPDDMKNFMENVLRYLSDDK 615 GK++V+GN YNSIL CP YS+N +NKDG C+ + D DM NF N +L K Sbjct: 558 GKVLVLGNSLYNSILVCPENYSFNASINKDGVCSNGNGVTDSLDMFNFFVNAFNWLDTKK 617 Query: 616 WKPDAKASMTVGTNLDTVYFKRHGQVTGN-SAAFDFHPDFAGISVEHLSSYG--DLDPQE 672 D + + TN V F R ++G+ S F + F + +S + LD Sbjct: 618 LNQD----INIATNRSEVSFSR---ISGSASHPFKLNESFKALGFRLMSDFSPQGLDVAS 670 Query: 673 MPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSN 732 P+ IL + T GN R D P + DV LI Y+N+GGSVLIME+ Sbjct: 671 TPIYILQAY--PTLGGNTD-----RPDYENPIINDDDVNALIDYVNQGGSVLIMES---- 719 Query: 733 LKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATG------IWVYERY-- 784 L + R LD AG+S A+ K+ P+ + G ++ E Y Sbjct: 720 LYNRNLPILGRFLDTAGIS-AIGKNNGVKFAGKLPSNFIAELGKGGTSLRPVYTEEVYVL 778 Query: 785 --------PAVDGALP---YTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQET 833 +P Y D T W + ++K L + V + E Sbjct: 779 EGLAFEANTTDANGIPEKGYKFDKGTNTYYWG---DKSINNKAVLRALYRPQQV--RDEN 833 Query: 834 RYAFIDEADHKTE---DSLKAAKEKIFA--AFPGLKE-------------CTNPAYHYEV 875 R+ E +T+ D+L+A + A GL++ CTN AY Y++ Sbjct: 834 RHNVTKECQQQTDLVDDALQACIDTKLNQLAQDGLEQWVSDVQAIYEVPMCTNSAYQYQL 893 Query: 876 NCLEYRPGTGVPVTGGMY------VPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELY 929 +C+E R G G+P++ +Y V + +L ++ D + AM++AA++GTN+ LYQHELY Sbjct: 894 DCIERREGNGIPLSKTIYPGATDMVQAFARLPMSKDVSNAMIEAANMGTNLTDLYQHELY 953 Query: 930 FRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAY 989 +RT G++G RLS +D++R+Y N+S W+WN+ YRY+ DE G KT EFLNCY+N+ Y Sbjct: 954 YRTGGKEGVRLSGIDVDRIYNNLSAWMWNNEQYRYDSSTKDEFGHKTVVEFLNCYSNNTY 1013 Query: 990 AGGTK-----CSADLKKSLVDNN-MIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWW 1043 + C +LK ++ ++ DG +K +NPSYPLNYMEKPLTR+MLGRS++ Sbjct: 1014 GNPSSDNVIGCPEELKAEMLTKGFLVTVDGVNK---LNPSYPLNYMEKPLTRMMLGRSYF 1070 Query: 1044 DLN----------IKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQ 1093 D++ ++VDV YPG + TI G+ QS G+W PA Sbjct: 1071 DVDAKNPTAEDRGVQVDVRSYPGVATTTAAAKDITIH---------KGSRQSAGVWIPA- 1120 Query: 1094 KEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGL 1153 +EV + TV +A+AD+LTGR HE+ALNRPPRV+ +++ + FKVPYGG Sbjct: 1121 REVAYVHGLSSDDTVMIAMADNLTGRVNHEMALNRPPRVSMSFN-GVEASNGFKVPYGGS 1179 Query: 1154 IYIKGNSSTNESASFTFTG-VVKAPFY-----KDGAW-KNDLNSPAPLGELESDAFVYTT 1206 +YI S ESA +F G + AP + +G+W S AP+ E+ F YTT Sbjct: 1180 VYITLGSK--ESAQVSFGGSAIAAPMFMMTSATEGSWITTPEESDAPITEIVGKRFSYTT 1237 Query: 1207 PKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTY--KNLPGHKHRFTN 1264 + + LE D F +N+FYGRD G H+MFT L R + Sbjct: 1238 TTAGIKGHSEVDVLE-MTKQFDLFTIGVNEFYGRDGVSGAHKMFTDSAPELEYQNMRLVD 1296 Query: 1265 DVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVAN 1324 D+QISIG AHSGYPVM++SF ++L ++W++ HE+GHN A L V GA E AN Sbjct: 1297 DIQISIGSAHSGYPVMSTSFPRQKSSL-FKATDNWMLGHEIGHNQAANWLNVVGAGETAN 1355 Query: 1325 NVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNF 1384 NVLALY Q+R G M R+ IT A E+ + + WA G DRL + QLK WAE NF Sbjct: 1356 NVLALYTQERNTGDMPRIKVSITNATEW--ANGDHPWADGTNADRLNFFGQLKLWAEDNF 1413 Query: 1385 DIKKWYPDGTPLPE--FYSEREGM---KGWNLFQLMHRKARGDEVSNDKF--GGKNYCAE 1437 DI +W + E Y++ E +GWN ++ +HR AR E + G NYC+ Sbjct: 1414 DIAQWESEAKLAEERSIYNKNEAGQYDQGWNFYKYLHRAARMPETFTEGLNKGDVNYCSS 1473 Query: 1438 -----SNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYN 1492 ++ + D +M+C+S++ D+ FF KW G + L + + G+S+ A Sbjct: 1474 EFAQVNSLSKQDMMMICSSFLTGKDIETFFIKWKFGESKVTLQSGDK--YSVGISEPALG 1531 Query: 1493 TLASLD----LPKPEQGPETI 1509 + + + P+ P I Sbjct: 1532 VMEDMRKQGIIVTPKTSPLDI 1552 >UniRef50_UPI000178AA33 hypothetical protein GYMC10_4678 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178AA33 Length = 1078 Score = 399 bits (1026), Expect = e-109, Method: Composition-based stats. Identities = 129/553 (23%), Positives = 215/553 (38%), Gaps = 109/553 (19%) Query: 991 GGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLML----GRSWWDLN 1046 GGT S + L +I D S + ++PL+ + P T +L GR DL Sbjct: 332 GGTVSSITPQSPL--YAVIEQDASDLESQL--TFPLDRSKAPYTSALLAFQLGRIGNDLT 387 Query: 1047 IKVD--VEKYPGAVSEEGQNVT-ETISLYSNPTKW-------FAGNMQSTGLWAPAQKEV 1096 +++PGAV + VT +++ + + + + N STGL+APA + V Sbjct: 388 APKSPYADQFPGAVPGDAPRVTGQSVHVNFDYSTYDYLRQGTVPKNWISTGLYAPAGEWV 447 Query: 1097 TIKSNAN---VPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGL 1153 TI + V V A D+LT + + E RPP +T+ L SG + + PYGGL Sbjct: 448 TIHVPEGTQNLDVQVG-AHTDNLTSKTEWE----RPPVITQRKPL-LSGENRIRSPYGGL 501 Query: 1154 IYIKGNS-STNESASFTFTGVVKAPFYKDGA-----WKNDLNS-PAPLGELESDAFVYTT 1206 IY+ + + +G V+AP+Y G W++ + PAP EL+ + T Sbjct: 502 IYLIPTKPQPSVAKDVEISGGVRAPYYILGETSPSEWEDAIRHHPAPWAELQGRRVILTL 561 Query: 1207 PKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDG-KHRMFTYKNLPGHKHRFTND 1265 P + + +Q D + G + H+ R+ D Sbjct: 562 PSEYIRQLEDP---QQLVEKWDAIVDYTEEVAGLSPDQQLPHKSI------DLPFRYVAD 612 Query: 1266 VQISIGDAHSGYPVM--NSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVA 1323 QIS G H+GYP+M + ++ + W WHE GH + E+ Sbjct: 613 RQISAGYMHAGYPIMFHIDPSAGHAVDISRVTQGGWGFWHETGHEYQQGAWNWNVTGEIT 672 Query: 1324 NNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKN 1383 N+ +LY+Q+++ G + + L+ A+ K++ ++ Sbjct: 673 VNIYSLYVQEKF----------------------------GNSSNLLIRNAEGKDFYDRA 704 Query: 1384 FDI------KKWYPDGTPLPEF-----YSEREGMKGWNLFQLMHRKARGDEVSNDKFGGK 1432 FD K + L F + + GW+ + +H+ R S Sbjct: 705 FDYIESDLPGKSFGTSGQLDLFGYLVMFRQLSLAYGWDFYAELHKAYRELPASQ------ 758 Query: 1433 NYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYN 1492 +N DT ++ AS A +L+EFF KW + + QS Sbjct: 759 --LPATNQEEIDTFVIMASKTAGENLTEFFDKW-------------ALPYSKAEVQS--- 800 Query: 1493 TLASLDLPKPEQG 1505 +A+L+LP P Q Sbjct: 801 RIAALNLPLPSQE 813 >UniRef50_UPI0001B9ED55 hypothetical protein GYMC10_4682 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001B9ED55 Length = 836 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 132/541 (24%), Positives = 214/541 (39%), Gaps = 109/541 (20%) Query: 998 DLKKSLVDNNMIYGDGSSKAGMMNPS-YPLNYMEKPLTRLMLGRSWWDLNIKVDVEK--- 1053 + ++L + + ++AG + P+ +P+ ++P T + + + EK Sbjct: 360 ETLETLSSESPLNAWADTEAGSLPPAEFPIQKFQQPYTNALHNFRFSHFTLDPANEKSPY 419 Query: 1054 ---YPGAVSEEGQNVT-ETISLY------SNPTKWFAGNMQSTGLWAPAQKEVTIKSNAN 1103 +PG VSEE V + + + N STGL+AP K +T++ Sbjct: 420 ADAFPGVVSEEAAIVNDREVEVDFDFPNTMYTHALPSKNWISTGLYAPPGKVITLEVPEG 479 Query: 1104 VP-VTVTV-ALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSS 1161 V +TV + + DDL G K E R P + L G + PYGGLIY+ + Sbjct: 480 VEHLTVQIGSHDDDLRGSGKWE----RVPLIVNHQKL-TPGIHQVNSPYGGLIYLIPLKA 534 Query: 1162 TNE-SASFTFTGVVKAPFYKDG-----AWKNDLNSPA--PLGELESDAFVYTTPKKNLNA 1213 N+ A+ +G V+AP+Y G W+ P P EL+ + + T P + Sbjct: 535 KNDFRATVKISGAVEAPYYVLGKTTLEEWERIRTGPVTVPFAELQGERIILTVPSDLIRQ 594 Query: 1214 SNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH----RFTNDVQIS 1269 E+ D S ++ G D + + +P H R+ D QIS Sbjct: 595 ---VADPEELMRTWDEIYDSYDELVGLDPD---------RAMPHTAHQLNRRYVADGQIS 642 Query: 1270 IGDAHSGYPVMN-SSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLA 1328 G H+GYP+M S++ N + + W WHE+GH + T EV N+ + Sbjct: 643 SGAMHAGYPIMLPFSYAANLLDVHYVKTSAWGFWHELGHEYQQRTWTWSDVGEVTVNLFS 702 Query: 1329 LYMQDRY--LGKMNRVADDITVAPE---YLEESNNQAWARGGAGD--RLLMYAQLKEWAE 1381 LY Q++Y ++ +V +D + E+N+ A G G+ RL+M+ QL Sbjct: 703 LYTQEKYGNASELLKVGNDGKDYYDRGIAFVENNDPAKKYGQIGNYERLVMFKQL----- 757 Query: 1382 KNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGN 1441 + GW + + R E+S D+ G Sbjct: 758 ----------------------QLAYGWEFYTRIFETYR--ELSRDEIQG---------- 783 Query: 1442 AADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPK 1501 DT + AS A DL+EFF KW G++ +ASL+LP+ Sbjct: 784 TVDTFAVIASQTAGEDLTEFFDKWAI-----------------GLTDDGRARIASLNLPE 826 Query: 1502 P 1502 P Sbjct: 827 P 827 >UniRef50_B4DBQ1 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ1_9BACT Length = 759 Score = 358 bits (919), Expect = 9e-97, Method: Composition-based stats. Identities = 88/445 (19%), Positives = 156/445 (35%), Gaps = 59/445 (13%) Query: 1047 IKVDVEKYPGAVSEEGQNV-TETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP 1105 + +PGAV + + + + ++ W STGL+A + + ++ A + Sbjct: 354 ANPVADVFPGAVPKNAPRLPNRVVVIDTSVPAW-----HSTGLYAAPGELIKVQVPAELA 408 Query: 1106 VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIK-GNSSTNE 1164 + H R P +++ + + P+GGLIYI+ + T Sbjct: 409 DKGLAVRIGCHSDSLFHLEKWQRAPEISRRDPIKTPASTA-ANPFGGLIYIEVPDKLTAA 467 Query: 1165 SASFTFTGVVKAPFYKDGA-----WKNDLN-SPAPLGELESDAFVYTTPKKNLNASNYTG 1218 + +G V++P + G WK L +PAP ELE+ + + P + + + Sbjct: 468 KVNVAISGGVESPRFVLGETKLLEWKMRLRMAPAPWAELETKKVILSVPSEKIRQLDDPE 527 Query: 1219 GLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYP 1278 L +F + + D + + T + R DVQIS G HSGYP Sbjct: 528 ALLKFWDQI---------------LDAEADLATIPHERKRPERIVPDVQISAGYMHSGYP 572 Query: 1279 VMN--SSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYL 1336 +M +S +L W +HE+GHN T G EV N+ +LY + Sbjct: 573 IMTPLDKSVEHSLSLVEMKQGSWGHFHELGHNHQVGDWTFDGTVEVTCNLFSLYCMETLC 632 Query: 1337 GKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPL 1396 GK D + PE +E+ + +R W D Sbjct: 633 GKPPGQGHD-AMKPEAVEKRLRGYLSSTDKFNR-------------------WKSDPFLA 672 Query: 1397 PEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQT 1456 Y + GW ++ + + R +++ D ++ S A Sbjct: 673 LIMYHQLRVGFGWETYKKVFAEYRDLSKEQR--------PKTDEEKHDQWLVRFSKAAGK 724 Query: 1457 DLSEFFKKWNPGANAYQLPGASEMS 1481 +L FF W + + + Sbjct: 725 NLGPFFDAWGIPTSTTARESINSLP 749 >UniRef50_UPI00017445F8 hypothetical protein VspiD_04825 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445F8 Length = 763 Score = 330 bits (845), Expect = 3e-88, Method: Composition-based stats. Identities = 111/523 (21%), Positives = 181/523 (34%), Gaps = 79/523 (15%) Query: 986 NDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLT-------RLML 1038 +D G L N + G G + P E+PLT R+ L Sbjct: 284 DDIGQGTNAIQVALAAQPPGRNSLQGAVMGALGEGDSGVPTR--EQPLTMAQHASQRIRL 341 Query: 1039 GR--------SWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWA 1090 G S + +PG +++ VT+ I++ +N W STGL+A Sbjct: 342 GMETRVLRLASSPTVAAHPASAVFPGQPAKDAPRVTKEITVDANIDGWT-----STGLYA 396 Query: 1091 PAQKEVTIKSNA---NVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFK 1147 A + +T+ ++ V D H + R P +T+T ++ K Sbjct: 397 VAGEPITVTVPEAMVGNGFSIRVGCHSDT---LYHLESWRRAPDITRTVGIENVE-TKMG 452 Query: 1148 VPYGGLIYIKGNSSTNESA----SFTFTGVVKAPFYKDG-----AWKNDLNSPAPLGELE 1198 +GGL+YI +A G V++P++K G W N + P EL Sbjct: 453 TAFGGLVYITVPGRARRNASEPFKVKIAGAVESPYFKLGRDTDEQWNNIKKAQGPWAELA 512 Query: 1199 SDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGH 1258 + V + P + L +F D ++ +D + +E Sbjct: 513 GEKMVVSLPSEVARKITNPTELMEF---WDRVVTAQDDISNQTAE------------RTR 557 Query: 1259 KHRFTNDVQISIGDAHSGYPVM-NSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVP 1317 R DVQIS G HSGYP+M ++ S T W +HE+GHN T Sbjct: 558 PERMVADVQISAGFMHSGYPIMIHTPESAEMVTYGRIKYPGWGFYHEIGHNHQRGNFTFE 617 Query: 1318 GATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLK 1377 G EV NNV LY L K + V+PE +++ + A G++ Sbjct: 618 GTGEVTNNVFGLYCYTEVLKKELLIGHG-GVSPESIKKHIDAAKKAKDQGEK-------- 668 Query: 1378 EWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAE 1437 WA W D Y + GW ++ + Sbjct: 669 -WAI-------WKGDPFHALTTYVQLVQGFGWENYKKYIWSFADPSFG--------PTPK 712 Query: 1438 SNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEM 1480 ++ D ++ S + + +L FF+ W + S++ Sbjct: 713 NDEEKRDQFLIRFSKITKKNLGPFFEFWGIPVTSSAKAEVSKL 755 >UniRef50_A4GHK6 Putative uncharacterized protein n=1 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHK6_9BACT Length = 1173 Score = 322 bits (826), Expect = 6e-86, Method: Composition-based stats. Identities = 96/432 (22%), Positives = 175/432 (40%), Gaps = 71/432 (16%) Query: 1069 ISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSN-----ANVPVTVTVALADDLTGREKHE 1123 + ++ + WF STGL+A A +++TIK A + + + + D + Sbjct: 563 VHIHMSGDNWF-----STGLFASAGQKITIKVPKDLVNAGLKIQIGSHIWGDYIF---NH 614 Query: 1124 VALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSST----NESASFTFTGVVKAPFY 1179 + + R P +T ++LD S + +GGLIYI + +++S++ +G AP Y Sbjct: 615 MDMRRFPYITYQWNLDQSEVI-VNSSFGGLIYIVDPVNQQINFPKTSSWSISGAYLAPRY 673 Query: 1180 KDGA-----WKNDLN-SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASS 1233 G WKN++ PAP E+ESD + T P + + L Sbjct: 674 IHGKTALNDWKNEIRKYPAPWAEIESDKVILTVPSHAIRDLDNPDDL------------- 720 Query: 1234 MNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNS-SFSPNSTTLP 1292 +F+ R + D + + + R+ D G AH+GYP+M + + P Sbjct: 721 -MEFWAR-AIDAAADLASISRVREFPQRYVTDPNWQWG-AHAGYPIMMAGPWYPYLLNHK 777 Query: 1293 TTPLNDW-LIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLG-KMNRVADDITVAP 1350 LN W +HE+GHN G EV+ N+ ++Y+ + G + D + + P Sbjct: 778 KIGLNYWWGTFHELGHNHQMNDWMWDGWGEVSTNLWSVYILETIAGLERKNTWDGMLLFP 837 Query: 1351 EYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN 1410 ++ N+ RG ++F I + D E + + + GW Sbjct: 838 GKRQKRINKFIDRG-----------------RSFAILQ--ADPELALEHLLQLQEVFGWE 878 Query: 1411 LFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGAN 1470 LF +H+ +DK KN S+ ++ S + +T+L F+K+W Sbjct: 879 LFMALHQSY------HDKPVHKNV---SDNEKIQQFVIRTSQITKTNLINFYKEWGFPIE 929 Query: 1471 AYQLPGASEMSF 1482 + + ++ Sbjct: 930 RSTIDFLANFNY 941 >UniRef50_C1ZFD9 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFD9_PLALI Length = 785 Score = 318 bits (814), Expect = 1e-84, Method: Composition-based stats. Identities = 100/438 (22%), Positives = 162/438 (36%), Gaps = 69/438 (15%) Query: 1049 VDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANV---P 1105 +PGAV + VT +++ + T+W +STGL+A V +K N+ Sbjct: 382 PTASVFPGAVPANAKKVTRKVTINTETTRW-----KSTGLYAAPGTLVKVKVPRNIVGQK 436 Query: 1106 VTVTV-ALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNE 1164 + + + +D L +++ RPP V + + +D + YGGLIY+ T Sbjct: 437 FEIQIGSHSDSLWSKDE----WRRPPAVIRQFPIDKVE-FEVGNAYGGLIYVVVPQKTPA 491 Query: 1165 SA-SFTFTGVVKAPFYKDGA-----WKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYT 1217 F+ VV AP++ G W+ + N PAP ELE+ V T P + + ++ Sbjct: 492 GKFEVEFSNVVDAPYFVHGETDISDWRFTIRNYPAPWAELETRHLVITVPSELVRKLDFP 551 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGY 1277 ++ N + D Y + + RF D QIS G HSGY Sbjct: 552 ---DKLMNHWAAVLDACADLY------------SISRNRPYAERFVFDDQISAGFMHSGY 596 Query: 1278 PVMNS--SFSPNSTTLPTTP-LNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDR 1334 P+M +P L W +HE+GHN + T G EV NN+ LY+ D Sbjct: 597 PIMCFTNPSAPEVVDLNFLENKGGWGFYHELGHNHQKGDWTFQGTGEVTNNLTPLYVIDT 656 Query: 1335 YLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGT 1394 K + D PE ++ + + GA W D Sbjct: 657 LTPKA--FSHDAIQQPE--RDNRERKYVMNGAPFS------------------TWQEDPF 694 Query: 1395 PLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVA 1454 Y + + GW F+ + + + +S + D M+ S Sbjct: 695 LALTMYIQLKEQFGWQPFRDVFLEYEKLQKDEH--------PKSEMDKRDQWMVRFSRKV 746 Query: 1455 QTDLSEFFKKWNPGANAY 1472 +L FF+ W + Sbjct: 747 NRNLGPFFQYWGVPTSEN 764 >UniRef50_B2ULE8 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2ULE8_AKKM8 Length = 660 Score = 296 bits (758), Expect = 4e-78, Method: Composition-based stats. Identities = 105/471 (22%), Positives = 165/471 (35%), Gaps = 75/471 (15%) Query: 1045 LNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANV 1104 L + +PG Q V T+ + SN G STGL+AP E++ S + Sbjct: 246 LKACPAAKDFPGVPENGAQTVRRTVEIDSN-----IGGWHSTGLYAPPGAEISC-SLSGA 299 Query: 1105 PVTVTVALADDLTGREKHEVA-LNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTN 1163 P ++++ H++ R P +T G VK P GGL+Y+ Sbjct: 300 PKDGSISVRIGCHTDSLHKLDEWKRVPEITMQVPA-GRGRVKMVNPMGGLVYVNVGQRPR 358 Query: 1164 ESASFT--FTGVVKAPFYKDG-----AWKNDL-NSPAPLGELESDAFVYTTPKKNLNASN 1215 F +G V +P + G W L N+ AP GE+ + T P + L Sbjct: 359 RGKVFKVQISGAVPSPLFVMGKTTPEQWAEQLENTKAPWGEIRMPRLIVTMPVEQLKQCP 418 Query: 1216 YTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS 1275 +++ A L + + G D T + H RF D QIS G HS Sbjct: 419 ---DVQKTAEFLQKNMALQDWIMGWD---------TKPDRLHHPMRFVVDRQISAGAGHS 466 Query: 1276 GYPVMNSSFSPNS-TTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDR 1334 GYP M + NS T W +WHE+GHN P T+ G TEV+ N+ ++ + Sbjct: 467 GYPAMATKDWTNSIATGSIIHSGSWGLWHELGHNHQSPPFTMEGQTEVSVNIFSMVCEVM 526 Query: 1335 YLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGT 1394 GK + P + + ++ + QL W E Sbjct: 527 GTGKDFESCWGGGMGPYGMSAEMKKYFSGTQTYNEAPNKVQLFFWVE------------- 573 Query: 1395 PLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVA 1454 +Y G++ F+ + + N + S+ + +M S V Sbjct: 574 --LMYYL------GFDAFRQVALQFHDKPYDNGEL--------SDEKKWEWVMNAFSKVT 617 Query: 1455 QTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQG 1505 ++ FFK W VS+ A + L P + Sbjct: 618 GKNMGPFFKIWRTP-----------------VSERATGRMKDLPAWLPSKD 651 >UniRef50_C2G4C5 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G4C5_9SPHI Length = 665 Score = 282 bits (722), Expect = 6e-74, Method: Composition-based stats. Identities = 95/447 (21%), Positives = 154/447 (34%), Gaps = 81/447 (18%) Query: 1051 VEKYPGAVSEEGQN-VTETISLYSN---------PTKWFAGNMQSTGLWAPAQKEVTIKS 1100 +PG V+ +S+ N G QST ++APA + + I Sbjct: 61 ARLFPGVVATTEPRLENYKVSIDLNYVEVSPSDLRISVAPGAWQSTSMFAPAGELIVIDV 120 Query: 1101 NANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNS 1160 V TG K E R ++ + L G + YGGL+YI Sbjct: 121 PQGVYGLKAQVGPHVYTGSTKIEFP-RRDEKIVVSKDL-FPGKNYIRNLYGGLVYIIPER 178 Query: 1161 STNESASFTFTGVVKAPFYK-----DGAWKNDLN-SPAPLGELESDAFVYTTPKKNLNAS 1214 F+G AP +K D WK+ +N S P ELE + V+T + L Sbjct: 179 PLGRVVDLLFSGTTLAPSFKLGKMTDQQWKDLVNKSSVPWFELEGNRIVFTLQTERLKRF 238 Query: 1215 NYTGGLEQFANDLDTFASSMNDFYG--RDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGD 1272 E + D+ G + D KHR P +K R +DV G Sbjct: 239 PINSPTELMELWDKMIKEAYWDWTGMTEGNPDVKHRA------PFNKWRIVHDVLFEPGV 292 Query: 1273 AH-SGYPV---MNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETP-LTVPGATEVANNVL 1327 A SGYPV N + + T+ + +W +HE+GHN + + G EV NN+ Sbjct: 293 AQVSGYPVRAGANDQYFGQAVTINSVRTQNWGTYHELGHNMQQGRVWSFDGNGEVTNNLF 352 Query: 1328 AL---YMQDRYLGKMNRVA----DDITVAPEYLEESNNQAW------ARGGAGDRLLMYA 1374 + + R K+ V + I P+ ++N + W ++ +L+MYA Sbjct: 353 SFKVAMINGRQHTKIAEVWPTGLEWINYVPKDAADANRKIWANMPTLSKNHNDAKLIMYA 412 Query: 1375 QLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNY 1434 Q+ E G+ ++ +AR + Sbjct: 413 QIFE---------------------------KYGYGFMTYLYTRARNA----------RF 435 Query: 1435 CAESNGNAADTLMLCASWVAQTDLSEF 1461 + ++ + D + D+ F Sbjct: 436 ESANDQSKIDFFYEALCEYTKVDMEPF 462 >UniRef50_D2VFD2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VFD2_NAEGR Length = 934 Score = 265 bits (677), Expect = 1e-68, Method: Composition-based stats. Identities = 92/430 (21%), Positives = 151/430 (35%), Gaps = 76/430 (17%) Query: 1052 EKYPGAVSE-----EGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPV 1106 E++PG + T +++ +N +W Q TG +A T++ + P Sbjct: 355 EQFPGLPNNGLTYRNAPTETLKMNVSTNRKRW-----QCTGFYALPGA--TLEVILSNPK 407 Query: 1107 TVTVALADDLTGREKHEVALNRPPRVTKTYSLDAS--GTVKFKVPYGGLIYIKGNSSTNE 1164 V+ T H + +R P ++KT+S+ A+ T GG IYI+ SS Sbjct: 408 GVSNVRIGGHTDGIAHLDSWSRWPSISKTFSISATTNSTGTITCLNGGTIYIEV-SSIPL 466 Query: 1165 SASFTFTG-VVKAPFYKDG-----AWKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYT 1217 S T G VVK PF+KDG W + + N P P E+ES+ F + N+ A+ Sbjct: 467 SVDVTVIGQVVKTPFFKDGIHTDQEWNSTIRNYPGPWVEIESEHFAF-----NVQATPTA 521 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGY 1277 L + + + + + +Y + + +K R D+QIS G HSG Sbjct: 522 RQLSEVSTVAKYWGNVVAMYY----------ELSQRQTRDYKERMQADIQISAGYMHSGL 571 Query: 1278 PVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLG 1337 + S S W +HE+GHN E T + EV N+ +LY+++ Y Sbjct: 572 SFYDFSRSERWN----YSSRRWGHYHELGHNFQEGAWTYDQSGEVTCNIFSLYLEEHYPD 627 Query: 1338 KMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLP 1397 N + + W G Sbjct: 628 SANTYGKGFN-----PPRGLETTYKGNTQPFK-----------------GDWTSAG---L 662 Query: 1398 EFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTD 1457 FY + GW + + + +++ D M S V + Sbjct: 663 LFYLDLIQAFGWVAIRKTFVDYQKNGPQ----------PQNDDQKRDQWMTRFSKVVGRN 712 Query: 1458 LSEFFKKWNP 1467 L F W+ Sbjct: 713 LGPFCDAWSF 722 >UniRef50_A4IG42 Protein FAM115 n=14 Tax=Clupeocephala RepID=F115_DANRE Length = 912 Score = 258 bits (659), Expect = 1e-66, Method: Composition-based stats. Identities = 101/473 (21%), Positives = 156/473 (32%), Gaps = 78/473 (16%) Query: 1030 EKPLTRLML--GRSWWDLNIKVD------VEKYPGAVSEEGQNVTETISLYSNPTKWFAG 1081 E P LML G + + D ++ P + V T + Sbjct: 486 ESPKDHLMLHIGTEVYKVTPDPDALLPYIIKDRPNLPTLSNARVRITANT------GGCE 539 Query: 1082 NMQSTGLWAPAQKEVTIKSNANVPV---TVTVALADDLTGREKHEVALNRPPRVTKTYSL 1138 STGL+ + I + V + D G L R P V + L Sbjct: 540 EWISTGLYLSPGMKTYIAVPPEIVGKNWQVQLGCQTDNIGGSN---TLKRAPVVHARFPL 596 Query: 1139 DASGTVKFKVPYGGLIYIKGNSSTN-ESASFTFTGVVKAPFYKDGA-----WKNDLN-SP 1191 D S V+ +GGLIY+ S T + V+AP++K G W + + +P Sbjct: 597 D-SEMVQVWNLWGGLIYLIAPSQTKVDGVEIVVQNAVQAPYFKSGETSVADWVSHIRQAP 655 Query: 1192 APLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFT 1251 AP ELE + + T + + ++ A DT ++ D R + Sbjct: 656 APWAELEFENLIMTFDSAFIRNLDRP---DEVAKLWDTIMRTITDLAARPPK-------- 704 Query: 1252 YKNLPGHKHRFTNDVQISIGDAHSGYPV-MNSSFSPNSTTLPTT-PLNDWLIWHEVGHNA 1309 LP K RF DVQIS G H+GYP+ M+S +P + W HE+GHN Sbjct: 705 ---LP-RKERFVADVQISYGFMHAGYPIMMHSGSAPGLVNVEEAYKCGLWGAIHELGHNQ 760 Query: 1310 AETPLTV-PGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGD 1368 P TE N+ +LY+ ++ G + A + P + + G + Sbjct: 761 QRGVWEFPPHTTECTCNLWSLYVHEQVFGIKSANAH-PAITPADRQARTKMYFDGGKDLN 819 Query: 1369 RLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDK 1428 M+ E Y + + GW+ F+ + Sbjct: 820 SWCMW---------------------MALETYMQLQEKFGWDAFKKVFSLYH-------- 850 Query: 1429 FGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 N + S V +LS FFK W S + Sbjct: 851 --DMTGVPNDNAGKMNLYAQTFSKVVNLNLSPFFKAWGWPIQPNTEQNLSHLP 901 >UniRef50_C2G0M2 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G0M2_9SPHI Length = 675 Score = 253 bits (645), Expect = 4e-65, Method: Composition-based stats. Identities = 102/455 (22%), Positives = 153/455 (33%), Gaps = 79/455 (17%) Query: 1051 VEKYPGAVSEEGQNV---TETISLYSNPTKW-------FAGNMQSTGLWAPAQKEVTIKS 1100 +PG V E + T I + N + STGL+AP + V I Sbjct: 65 ARVFPGLVGENVPRIKDTTVIIDMNKNVISSRDYKISVAPQAIYSTGLYAPPGENVKITV 124 Query: 1101 NAN-VPVTVTV-ALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG 1158 + +TV + A D+LTG+E L R P + L A G + YGG I+++ Sbjct: 125 PEGLIGLTVQIGAHMDNLTGKE----TLKRDPVIYTVKEL-APGVNYVRNLYGGTIWVRS 179 Query: 1159 NSSTNESASFTFTGVVKAPFYKDGA-----WKND-LNSPAPLGELESDAFVYTTPKKNLN 1212 N + + F+G V+A + G WK D L + P E+ V T P+ N+ Sbjct: 180 NVARPIPVNLKFSGPVRASDFVHGQSDIAAWKKDVLANNVPWLEIRGKHMVMTVPRANVV 239 Query: 1213 ASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNL----PGHKHRFTNDVQI 1268 G NDL+ + N Y +D D T + P R D+Q Sbjct: 240 TFINQGRF----NDLNEVMAEWNIVYEKDYYDWMGLSATAAEVKNRYPEFPQRVVLDIQP 295 Query: 1269 SIGDAHSGYPVM---NSSFSPNSTTLPTTP-LNDWLIWHEVGHNAAET-PLTVPGATEVA 1323 S+G AHSG+P + + + T L T N W +HEVGHN +T + E + Sbjct: 296 SLGYAHSGFPWVAQNDLQWLDELTNLTTIHNGNSWGSYHEVGHNFQQTSTWSWSDLGETS 355 Query: 1324 NNVL------------ALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAG-DRL 1370 NN+ L + + + A G A RL Sbjct: 356 NNLFIFNGGHRRGNATILNFHPALKTAIPTALTFAASLTGKNFSNLDGVIADGEAPFFRL 415 Query: 1371 LMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFG 1430 + QL + + G GW ++ K+R + Sbjct: 416 TPFLQLFDKIQGK--------------------NGESGWAFMTYLYNKSRNSD------- 448 Query: 1431 GKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKW 1465 Y + D D F W Sbjct: 449 ---YQFSLDQAKRDYFYRSLCEFTGRDYYRFMVAW 480 >UniRef50_C3XPE8 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XPE8_BRAFL Length = 676 Score = 244 bits (623), Expect = 2e-62, Method: Composition-based stats. Identities = 86/436 (19%), Positives = 145/436 (33%), Gaps = 113/436 (25%) Query: 1049 VDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNA------ 1102 + +PG EE Q ++ + S + STG + PA K +TIK+ Sbjct: 296 PGINDFPGDFEEEPQLQCASVIIKSARKE-----RHSTGYYLPAGKVLTIKATTTGEHLD 350 Query: 1103 NVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSST 1162 + V V A +D+L+G++K R P+++ L+A K PYGGLIY + Sbjct: 351 DWRVRVG-AHSDNLSGKKK----FKRWPKLSVVKRLEAE--TKISSPYGGLIYFESPKEA 403 Query: 1163 NESASFTFTGVVKAPFY------KDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNY 1216 + + + VV+APFY W +P +L + ++T P ++ Sbjct: 404 GD-LTASMENVVEAPFYDLEDPRSVQNWSERRKAPGLWADLAGEHIIFTLPAASVRDLED 462 Query: 1217 TGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSG 1276 + D + ++ G + H+ R D Q G H+G Sbjct: 463 PS---EGLRAWDDVVKAHHELRGTNPSG------------EHRQRIVPDRQPKAGWMHAG 507 Query: 1277 YPVM-NSSFSPNSTTL----PTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYM 1331 YP++ N + + +W ++HE+GHN T G E Sbjct: 508 YPIVTNMDIAARDKFILDGKKIRKAGNWGLFHELGHNMQRKWWTFEGTGE---------- 557 Query: 1332 QDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYP 1391 + D+ W + AG L +YAQL +F Sbjct: 558 --------DGAQHDV--------------WKK-KAGIALAVYAQL----AHHF------- 583 Query: 1392 DGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCAS 1451 GW ++ + R+ D E+N ++ S Sbjct: 584 ----------------GWEPYKKVFRQYEKDPKKKQ--------PENNKERIVLWIVRFS 619 Query: 1452 WVAQTDLSEFFKKWNP 1467 +L F W Sbjct: 620 EEVGRNLVPLFDFWGF 635 >UniRef50_Q9Y4C2 Protein FAM115A n=54 Tax=Amniota RepID=F115A_HUMAN Length = 921 Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats. Identities = 98/444 (22%), Positives = 155/444 (34%), Gaps = 68/444 (15%) Query: 1038 LGRSWWDLNIKV-DVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEV 1096 L S DL++ V ++E + T+ + + STGL+ P ++ + Sbjct: 499 LAHSGSDLSLLVPEIEDMYSSPYLRPSESPITVEV-NCTNPGTRYCWMSTGLYIPGRQII 557 Query: 1097 TIKSN---ANVPVTVTVA-LADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGG 1152 + A+ + + + DDLT K L R P V LD T +GG Sbjct: 558 EVSLPEAAASADLKIQIGCHTDDLTRASK----LFRGPLVINRCCLDKP-TKSITCLWGG 612 Query: 1153 LIYIKGNSSTN-ESASFTFTGVVKAPFYKDGA-----WKNDLN-SPAPLGELESDAFVYT 1205 L+YI ++ S T G V AP+YK G WK + +P P GEL +D + T Sbjct: 613 LLYIIVPQNSKLGSVPVTVKGAVHAPYYKLGETTLEEWKRRIQENPGPWGELATDNIILT 672 Query: 1206 TPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTND 1265 P NL L + D ++ R+ R D Sbjct: 673 VPTANLRTLENPEPLLRL---WDEVMQAV------------ARLGAEPFPLRLPQRIVAD 717 Query: 1266 VQISIGDAHSGYPVM-NSSFSPNSTTLPTTPL-NDWLIWHEVGHNAAETPLTV-PGATEV 1322 VQIS+G H+GYP+M + W HE+G N P TE Sbjct: 718 VQISVGWMHAGYPIMCHLESVQELINEKLIRTKGLWGPVHELGRNQQRQEWEFPPHTTEA 777 Query: 1323 ANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEK 1382 N+ +Y+ + LG + R +I + P E+ ++G +K W Sbjct: 778 TCNLWCVYVHETVLG-IPRSRANIALWPPVREKRVRIYLSKGPN---------VKNW--- 824 Query: 1383 NFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNA 1442 + E Y + + GW F + + R N + Sbjct: 825 ---------NAWTALETYLQLQEAFGWEPFIRLFTEYRNQTN----------LPTENVDK 865 Query: 1443 ADTLMLCASWVAQTDLSEFFKKWN 1466 + + S Q +L+ FF+ W Sbjct: 866 MNLWVKMFSHQVQKNLAPFFEAWA 889 >UniRef50_Q5XHI4 Protein FAM115 n=4 Tax=Anura RepID=F115_XENLA Length = 905 Score = 232 bits (590), Expect = 1e-58, Method: Composition-based stats. Identities = 92/414 (22%), Positives = 157/414 (37%), Gaps = 68/414 (16%) Query: 1066 TETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANV---PVTVTVAL-ADDLTGREK 1121 + T+ + +STGL+ +K ++ A+ + V V +DDL+ +K Sbjct: 511 SITVEIDGTNPG--NNAWRSTGLYLAPRKTAVLEFPASAVHQGLQVQVGCQSDDLSSADK 568 Query: 1122 HEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESA-SFTFTGVVKAPFYK 1180 + R P V + + +D S V +GGL+YI +++N AP Y Sbjct: 569 YC----RAPVVVRRFHVD-SQRVSVSCFWGGLVYITVKANSNLGIIPVKVYEAEPAPIYI 623 Query: 1181 DGA-----W-KNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSM 1234 G W ++ N PAP EL ++ + T P + + + E + D ++ Sbjct: 624 KGKTSLDTWIQSIRNLPAPWAELITENIILTVPSDAIRSLSDP---EALLSLWDKIMVAI 680 Query: 1235 NDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVM-NSSFSPNSTTLPT 1293 + K LP RF DVQIS G H+GYP+M + + T L Sbjct: 681 TEL-----------AAIPKKLP-RPERFVADVQISAGWMHAGYPIMCHLESAKELTDLNI 728 Query: 1294 TPLND-WLIWHEVGHNAAETPLTV-PGATEVANNVLALYMQDRYLGKMNRVADDITVAPE 1351 W HE+GHN +T + P TE N+ ++Y+ + LG A A Sbjct: 729 MQTGGIWGPIHELGHNQQKTNWELPPHTTEATCNLWSVYVHETVLGIPRSQAHCCLQA-- 786 Query: 1352 YLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNL 1411 E N ++E+ +I++W + E Y + + GW Sbjct: 787 --ETRANH----------------IQEYLRNGSNIEQW--NVWTALETYLQLQEGFGWEP 826 Query: 1412 FQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKW 1465 F+ + + + ++ N + + S QT+L FF+ W Sbjct: 827 FKQLFKDYQSMSGIRNE----------NKSKMNLWAEKFSEAVQTNLVPFFEAW 870 >UniRef50_B4DK02 cDNA FLJ57809 n=1 Tax=Homo sapiens RepID=B4DK02_HUMAN Length = 815 Score = 220 bits (561), Expect = 3e-55, Method: Composition-based stats. Identities = 93/415 (22%), Positives = 140/415 (33%), Gaps = 71/415 (17%) Query: 1065 VTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTI---KSNANVPVTVTVA-LADDLTGRE 1120 V N W STGL+ + + ++ A+ + V + DDLT Sbjct: 427 VEREFHRKGNNDCWV-----STGLYLLEGQNAEVSLSEAAASAGLRVQIGCHTDDLTKAR 481 Query: 1121 KHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNES-ASFTFTGVVKAPFY 1179 K L+R P VT +D + +GGL+Y+ + T G V AP+Y Sbjct: 482 K----LSRAPVVTHQCWMDRTER-SVSCLWGGLLYVIVPKGSQLGPVPVTIRGAVPAPYY 536 Query: 1180 KDG-----AWKNDLNSP-APLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASS 1233 K G WK + AP GEL +D + T P NL A + + +++ Sbjct: 537 KLGKTSLEEWKRQMQENLAPWGELATDNIILTVPTTNLQALKDPEPVLRLWDEM------ 590 Query: 1234 MNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMN--SSFSPNSTTL 1291 R+ R DVQIS G HSGYP+M S + Sbjct: 591 ---------MQAVARLAAEPFPFRRPERIVADVQISAGWMHSGYPIMCHLESVKEIINEM 641 Query: 1292 PTTPLNDWLIWHEVGHNAAETPLTV-PGATEVANNVLALYMQDRYLGKMNRVADDITVAP 1350 W HE+GHN P TE N+ ++Y+ + LG A + P Sbjct: 642 DMRSRGVWGPIHELGHNQQRHGWEFPPHTTEATCNLWSVYVHETVLGIPRAQAHEALSPP 701 Query: 1351 EYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN 1410 E + A G G L D W E Y + + GW Sbjct: 702 E----RERRIKAHLGKGAPLC-------------DWNVW-----TALETYLQLQEAFGWE 739 Query: 1411 LFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKW 1465 F + + + + + N + + S + +L FF+ W Sbjct: 740 PFTQLFAEYQTLS----------HLPKDNTGRMNLWVKKFSEKVKKNLVPFFEAW 784 >UniRef50_UPI000155BFC0 PREDICTED: similar to FLJ00264 protein n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155BFC0 Length = 540 Score = 217 bits (552), Expect = 3e-54, Method: Composition-based stats. Identities = 86/385 (22%), Positives = 137/385 (35%), Gaps = 63/385 (16%) Query: 1091 PAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPY 1150 A T V V +DDLT E+ L RPP VT + + S V + Sbjct: 173 VAGVYFTXXXXXXXXVQVG-CHSDDLTEAEE----LKRPPVVTSRFDV-CSERVTVSSLW 226 Query: 1151 GGLIYIKGNSSTN-ESASFTFTGVVKAPFYKDGA-----W-KNDLNSPAPLGELESDAFV 1203 GGL+YI + + T G ++APF++ G W K + PAP ELE+D Sbjct: 227 GGLMYIVLPTGCQLDPIPVTVRGAMQAPFFRLGETSPSAWHKTLRHHPAPWAELETDNLT 286 Query: 1204 YTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFT 1263 T P +N+ + L + +++ ++ R Sbjct: 287 LTVPAENICLLDSPEPL---LDIWQQIMGAVS------------KLAAVPQTLRRPERIV 331 Query: 1264 NDVQISIGDAHSGYPVM-NSSFSPNSTTLPTT-PLNDWLIWHEVGHNAAETPLTV-PGAT 1320 DVQIS G H+GYP+M + L W HE+GHN + P + Sbjct: 332 ADVQISAGWMHAGYPIMIHLESVEEVVNLQRIWEKGLWGPLHELGHNQQRSNWEFPPHTS 391 Query: 1321 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 E N+ ++Y+ + LG + R ++ PE +G A LK+W Sbjct: 392 EATCNLWSVYVSETVLG-IPRHEAHTSLRPEARASRVKAFLEQG---------APLKKW- 440 Query: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 + E Y + + GW F + + + + N Sbjct: 441 -----------EVFVALETYLQLQEAFGWEPFIQLFADYQTETG----------VPDDNK 479 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKW 1465 + S + + +L+ FFK W Sbjct: 480 AKMNLWAQKFSQLVKKNLAPFFKAW 504 >UniRef50_A2EC39 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EC39_TRIVA Length = 734 Score = 196 bits (499), Expect = 5e-48, Method: Composition-based stats. Identities = 52/234 (22%), Positives = 91/234 (38%), Gaps = 16/234 (6%) Query: 1045 LNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANV 1104 + D E +PG T+ + + + STGLW PA I S+ Sbjct: 359 IKASPDAEDFPGLCP------NATVENHELEVRLHEESWISTGLWLPAGSLGEIISDPED 412 Query: 1105 PVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYI---KGNSS 1161 ++ + + R P V+KTY +D SG + P+GG++YI + N Sbjct: 413 KSIFSIQIGSHTESLLSRQGPWKRWPVVSKTYDIDPSGKTEIASPFGGIVYITVRELNEE 472 Query: 1162 TNESASFTFTGVVKAPF---YKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTG 1218 T+ F G V+ P K W++ + P GE+ S ++T P + Sbjct: 473 TDNRVKLKFNGFVRHPRAVIRKPEIWESSKDYQVPWGEMCSKTVIFTLPSDEIRK---IT 529 Query: 1219 GLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGD 1272 +++ +D + + +F S + +R+ LPG K + + I D Sbjct: 530 DIDKVLEHVDKVVTHVLEFMNA-SMNRPYRVVFDTQLPGDKTESEYPIVLDIKD 582 >UniRef50_C7PHT0 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PHT0_CHIPD Length = 770 Score = 195 bits (496), Expect = 9e-48, Method: Composition-based stats. Identities = 101/495 (20%), Positives = 170/495 (34%), Gaps = 85/495 (17%) Query: 1022 PSYPL---NYMEK---PLTRLMLG--RSWWD--LNIKVDVEKYPGAVSEEGQNVTETISL 1071 P PL N EK LTR G RS D + +PG V + +T +I++ Sbjct: 308 PQNPLILSNTEEKVRYHLTRRFFGKTRSLIDDKHAVSPGARYFPGLVPDTATRITTSITV 367 Query: 1072 YSNP-------TKWFAGNMQSTGLWAPAQKEVTI-----KSNANVPVTVTVALADDLTGR 1119 TGL+ P EV + ++ + V DDL Sbjct: 368 PVQVGTQGLLEPTAIYFRPHPTGLYVPPGTEVKVILQSKDKTQHLKAQIGV-HNDDLADL 426 Query: 1120 EKHEVALNR-PPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTN-ESASFTFTGVVKAP 1177 + L R + +T+SLD T P+GGL+ + + +T + + T TGVVKAP Sbjct: 427 TQ----LTRSAENMVRTFSLDN-DTTLIYSPFGGLLQLNVSDTTTLKEITITVTGVVKAP 481 Query: 1178 FYKDGA-----WKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFA 1231 ++K G W N + N+PAP EL +D + T P + + L QF + Sbjct: 482 YFKLGQTSEASWINSIRNNPAPWAELATDKIILTVPAYRIRQLDNPVKLMQF---WNEVM 538 Query: 1232 SSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYP----VMNSSFSPN 1287 + D + H R D ++ G ++ P V + Sbjct: 539 DADADL------------ARISRIRSHPERVVVDQDVAYGYMYTA-PERIIVPDDQSCAL 585 Query: 1288 STTLPTTPLN-DWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDI 1346 N W ++HE+GH + EV N+ +++ ++ L Sbjct: 586 MLDESQVRANGSWGLFHELGHRHQFWGIDFGELQEVTVNLFTMHVYNKVL---------- 635 Query: 1347 TVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGM 1406 + + + ++ ++ + + N +KW D Y E Sbjct: 636 ----------HKGIYNHEEIASKEIVLKKINNYLQNNPSFEKWGQDPFLALCMYIELIQQ 685 Query: 1407 KGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWN 1466 GW + K R K +S + + L V +T+L++FF W Sbjct: 686 FGWQSIEDTFTKYRAMP--------KEQYPQSQEDKRNLWFLTICDVTKTNLTQFFDIWK 737 Query: 1467 PGANAYQLPGASEMS 1481 + + S Sbjct: 738 VPVSDHVKEKVSTYP 752 >UniRef50_B6A9M3 Putative uncharacterized protein n=1 Tax=Cryptosporidium muris RN66 RepID=B6A9M3_9CRYT Length = 978 Score = 189 bits (480), Expect = 7e-46, Method: Composition-based stats. Identities = 85/459 (18%), Positives = 144/459 (31%), Gaps = 114/459 (24%) Query: 1079 FAGNMQSTGLWAPAQKEV-----------------TIKSNANVPVTV--TVALADDLTGR 1119 + Q+ GL+ K + I + + VT+ + D+ Sbjct: 503 ISNGWQTLGLYIIPNKTLELFFFKDKIINKYFNEDVIIHSKSNYVTIRVQIGCHTDILRT 562 Query: 1120 EKHEVALNRPPRVTKT--YSLD---ASGTVKFKVPYGGLIYIKGNSS-TNESASFTFTGV 1173 + L R P + K+ + +D + + K PYGGL+Y + + G Sbjct: 563 DSDSSPLQRLPIIVKSYIWKIDLNNSRNNISIKSPYGGLLYFELMDRWVKDKFQSNILGY 622 Query: 1174 V---------KAPFYKDG--------------------------AWKNDLNS-PAPLGEL 1197 V APF+ WK+ +N+ AP GEL Sbjct: 623 VYTNSSESCETAPFFVSSKINKDVEFPQVNLGETGQIICTSSINEWKDIINTKSAPWGEL 682 Query: 1198 ESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPG 1257 + T P L + ++ + D ++ D Sbjct: 683 HGLRVIITLPLSTLK---FIENPQEIVDFWDNVIQIQDELLNEGLNDR------------ 727 Query: 1258 HKHRFTNDVQISIGDAHSGYPVMNS----SFSPNSTTLPTTPL-NDWLIWHEVGHNAAET 1312 K R D+QIS G HSGYP+M +P + DW ++HE+GHN + Sbjct: 728 -KERIVCDIQISDGYMHSGYPIMTHMDMCQRHGKLVNIPEIMIEGDWGLYHEIGHNRQKP 786 Query: 1313 PLTVPGATEVANNVLALYMQDRYLG-----KMNRVADDITVAPEYLEESNNQAWARGGAG 1367 T G EV N+ LY ++ +++ + D + EYL + + G Sbjct: 787 QWTFAGTEEVTVNIFTLYTFNKLHAYLKPFQIDFINDQKSKVLEYLGSVKDGNFPSGELF 846 Query: 1368 DRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSND 1427 + + W D Y GWN + + E+ Sbjct: 847 NSI------------------WKADPGIALYTYLVIIIYYGWNSIKKVFELYDQCELPE- 887 Query: 1428 KFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWN 1466 + + +L S DL +FK+W+ Sbjct: 888 --------TIQDQDKIQIWILLLSLTTNCDLRPYFKQWD 918 >UniRef50_C5LZE4 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5LZE4_9ALVE Length = 1326 Score = 185 bits (469), Expect = 1e-44, Method: Composition-based stats. Identities = 90/540 (16%), Positives = 172/540 (31%), Gaps = 126/540 (23%) Query: 1041 SWWDLNIKVDVEKYPGAVSE-------------EGQNVTETISL------YSNPTKWFAG 1081 S+ D+ + + + +PG G+ V T +S+ T Sbjct: 312 SFNDV-LALAGKDFPGVCPPMGQDALSRSQGIINGEEVKITFEAIAKDTPHSSVTGRSFD 370 Query: 1082 NMQSTGLWAPAQKEVTIKS--------------NANVPVTVTVALADDLTGREKH----- 1122 STG++A + + ++ P+++ L DD R + Sbjct: 371 PWISTGVYARPGEPIRVRLESLIRAGNAQGAVKGEGSPLSMEEVLTDDFGFRVRIGCHKD 430 Query: 1123 ----EVALNRPPRVT-KTYSLDASGTVKFK--VPYGGLIYIK------GNSSTNESASFT 1169 + R PR++ + + + ++ + +GGL+Y++ S + + Sbjct: 431 DNTKHDSWKRWPRISAVSKDILSGRNLEVELVSVFGGLVYLERIRENGVEPSKAQGINLN 490 Query: 1170 F----------TGVVKAPFYKDGAW-------KNDLNSP------APLGELESDAFVYTT 1206 G V ++ +G W +N N P P GE+++ + + Sbjct: 491 VDSILMVCARVQGGVPTLWWSNGVWMLGGHPVENLKNLPRAGEAYPPWGEIQAKHVILSL 550 Query: 1207 PKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDV 1266 P L + ++ D R D R + + RF DV Sbjct: 551 PMHLLLPHALPSSGGSPRDGWADTIAAFWDRVIRSHCDLACRPVCGEE----RFRFVFDV 606 Query: 1267 QISIGDAHSGYPVM----------NSSFSPNSTTLPTTPL-----NDWLIWHEVGHNAAE 1311 +IS G HSGYP+M + + + DW ++HE+GH+ Sbjct: 607 RISAGWMHSGYPIMAHENPTAEDACAVWGEGNLRGARQGGKLLLEGDWGLFHELGHHFQR 666 Query: 1312 TPLTVPGATEVANNVLALYMQDRYLGK------MNRVADDITVAPEYLEESNNQAWARGG 1365 T EV N+ ++Y +G+ M V + A +++ E + GG Sbjct: 667 RRWTYKACGEVTVNLFSMYSMITIIGRKIPTRDMENVREGHEKAEKFITERIQ---SEGG 723 Query: 1366 AGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVS 1425 G + + + +W P Y + GW+L + + R D Sbjct: 724 GGAIYSAAHNRRPITD---ALDQWSPWVGL--TMYVDIIETFGWDLLKGVLRSYEQDGTF 778 Query: 1426 NDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGG 1485 ++ + + S LS + + W +M EGG Sbjct: 779 GEEPDPTEWWS------------RVSRACGKGLSMYCRIWG------VPIDLDKMDREGG 820 >UniRef50_D2VUE2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VUE2_NAEGR Length = 808 Score = 176 bits (446), Expect = 5e-42, Method: Composition-based stats. Identities = 58/297 (19%), Positives = 94/297 (31%), Gaps = 59/297 (19%) Query: 1181 DGAWKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYG 1239 D WKN + N P P E+ESD FV+ +Y L + + + + + +Y Sbjct: 349 DSDWKNTIRNYPGPWVEVESDYFVFNVES------SYARNLTELTSVTNYWKKVLELYY- 401 Query: 1240 RDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSF---------SPNSTT 1290 + + + +K R D+ ISIG HSGYP+M +P + Sbjct: 402 ---------ELSQRPIRDYKERMQIDIDISIGYMHSGYPIMAFQDQNEGTIKAVNPANMA 452 Query: 1291 LPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAP 1350 L + W +HE+GHN + T A EV N+ +LY+++ + N Sbjct: 453 LASKTPGRWGHYHELGHNFQVSDWTYSQAVEVTCNIFSLYLEENFPDSANTYGKSFNPPR 512 Query: 1351 EYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN 1410 G W G FY + GW Sbjct: 513 GLETTFKGNTQDYKG----------------------DWTSAG---LLFYLDLIQAFGWV 547 Query: 1411 LFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNP 1467 + + S D ++ S + +L F W+ Sbjct: 548 SIRKAFAEYYTAASGT--------LPTSEDEKRDQWVVRYSKIVGRNLGPFCDAWSF 596 >UniRef50_UPI0001B7B86F Similar to experimental autoimmune prostatitis antigen 2. n=1 Tax=Rattus norvegicus RepID=UPI0001B7B86F Length = 864 Score = 164 bits (415), Expect = 2e-38, Method: Composition-based stats. Identities = 71/316 (22%), Positives = 105/316 (33%), Gaps = 56/316 (17%) Query: 1159 NSSTNESASFTFTGVVKAPFYKDG-----AWKNDL-NSPAPLGELESDAFVYTTPKKNLN 1212 N S + + V AP + G WKN + +S AP GEL +D + T P NL Sbjct: 561 NKSEPDQQAAHINRAVPAPHFGLGKTTQEEWKNLIEHSKAPWGELATDNIILTIPTVNLK 620 Query: 1213 ASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGD 1272 L Q D ++ R R D QIS G Sbjct: 621 VLQDPYPLLQL---WDKIVKAVAKLAARPFP------------FQRPERIVLDKQISFGF 665 Query: 1273 AHSGYPVMNSSFSPNS--TTLPTTPLNDWLIWHEVGHNAAETPLTV-PGATEVANNVLAL 1329 HSGYP+M W + HE+GHN + T P TE N+ ++ Sbjct: 666 LHSGYPIMGLIIIVEGIINEFKIRSHGVWGVTHELGHNHQKPGWTFRPHTTEALCNLWSI 725 Query: 1330 YMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKW 1389 Y+ + L + R ++ PE ++ +G A L W W Sbjct: 726 YVHETVLN-IPRDQAHPSLNPELRKQRIKDHLNKG---------APLSNWIV-------W 768 Query: 1390 YPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLC 1449 E Y + + GW F + ++N + + + Sbjct: 769 -----TALETYLQLQEGFGWEPFIQLFANY----------QTLTGLPQNNEDKMNLWVKK 813 Query: 1450 ASWVAQTDLSEFFKKW 1465 S V Q +L+ FFK W Sbjct: 814 FSEVVQKNLAPFFKAW 829 >UniRef50_A9VC57 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VC57_MONBE Length = 1025 Score = 162 bits (409), Expect = 1e-37, Method: Composition-based stats. Identities = 68/332 (20%), Positives = 101/332 (30%), Gaps = 48/332 (14%) Query: 1033 LTRLMLGRSWWDLNI-----KVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTG 1087 L L+L +SW L D ++PG + +A Sbjct: 689 LEALLLSQSWQQLPCHAVPCSFDAAQFPGCTKPTSASTDVLHEGLVIEPNGWAFLRA--- 745 Query: 1088 LWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHE--VALNRPPRVTKTYSLDASGTVK 1145 W + + N V+ L E R P ++ + + T Sbjct: 746 -WVNGGEPFRVLVNDVAADAVS--LRIGCHADELWHCSGTWKRGPAISGVWPVTPDVTRS 802 Query: 1146 FKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFY-----------KDGAWKNDLNSPAPL 1194 P+GGL+Y++ + +A V KA + S P Sbjct: 803 LAHPWGGLLYLQNRTDAPLTARVYLEQVHKASCVEMHPATATVASVSADLQALGTSEVPW 862 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 EL + T P + + + D + D Sbjct: 863 CELAGANIILTVPTPAAQRLGH--RIPRLLRFWDDVLRAHRDLRRLGPLVR--------- 911 Query: 1255 LPGHKHRFTNDVQISIGDAHSGYPVMN--------SSFSPNSTTLPTT-PLNDWLIWHEV 1305 + R DVQI+ G HSGYPVM P + L +W I+HE Sbjct: 912 ----RERIVFDVQIAAGYMHSGYPVMAHLDQIEGAVDGGPTALQLERLLREGNWGIFHEF 967 Query: 1306 GHNAAETPLTVPGATEVANNVLALYMQDRYLG 1337 GHN E+ T GA EV N+ L DR +G Sbjct: 968 GHNLQESVWTFEGAGEVTVNLFTLNAMDRVVG 999 >UniRef50_A3FQI6 Putative uncharacterized protein n=3 Tax=Cryptosporidium RepID=A3FQI6_CRYPV Length = 1025 Score = 158 bits (398), Expect = 2e-36, Method: Composition-based stats. Identities = 89/457 (19%), Positives = 149/457 (32%), Gaps = 110/457 (24%) Query: 1083 MQSTGLWAPAQKEVTIK-------SNANVPVTVTVALADDLTGREKHEVALNRPPRVTKT 1135 Q+TG++ + + + S + + V D R K E + R P + K Sbjct: 533 WQNTGVYIKHKSFLKAEFIPINKYSLTPIKSMLHVGSHSDFL-RYKEEKIMRRVPLIKKI 591 Query: 1136 YSLDASGTVKFK--VPYGGLIYIK----------------GNSSTNESASFTFTG----- 1172 Y+ S T P G++Y + NS+ + + F Sbjct: 592 YNWSISETNYITIESPCEGIVYFEYIPNYDHTNTKGSLKRPNSNESHGSLIGFIKFIPLI 651 Query: 1173 ---VVKAPFYKDGA-------------------WKNDLNS---------PAPLGELESDA 1201 VV P Y W + PA EL Sbjct: 652 GKEVVTTPIYTIDKLQLDTCSFDNRTVITDKHLWARIFSEANRKFSNLLPA-WTELHGKK 710 Query: 1202 FVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHR 1261 + T P + L + + ++ D + N+ Y N K R Sbjct: 711 IILTIPSRILYSIDCL-IVDDLLEFWDKVIDTQNELY--------------FNYKWTKER 755 Query: 1262 FTNDVQISIGDAHSGYPVMNSSF-------SPNSTTLPTTPL-NDWLIWHEVGHNAAETP 1313 D+QIS G HSGYP+ S L T +W I+HE+GHN + Sbjct: 756 IVCDIQISDGYMHSGYPIATHLDIVEEQINSGGILDLKTLKDEGNWGIYHEIGHNRQSSY 815 Query: 1314 LTVPGATEVANNVLALYMQDRYLGK-----MNRVADDITVAPEYLEESNNQAWARGGAGD 1368 T G EV N+ +Y + + ++ + D I +A EYL + N+++ + Sbjct: 816 WTFNGTEEVTVNLFTMYSYYKLHPRLYPFNISYIKDQINLAVEYLIDMNSESKSP----- 870 Query: 1369 RLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDK 1428 E E+ F +KW + Y + GWN + + + +++ Sbjct: 871 ---------ENMERGFR-EKWMANHGIAFCNYLILINLFGWNTLKTVFQVYDQLSEVDEE 920 Query: 1429 FGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKW 1465 + SN ++ S V +L FF +W Sbjct: 921 LVQTH----SNQQKMAMWIIIFSSVVGFNLKYFFTQW 953 >UniRef50_UPI0001692BE8 S-layer domain protein n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001692BE8 Length = 591 Score = 149 bits (375), Expect = 1e-33, Method: Composition-based stats. Identities = 94/475 (19%), Positives = 154/475 (32%), Gaps = 86/475 (18%) Query: 1033 LTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNP--------TKWFAGNMQ 1084 L+ LMLG + + + E G Q +T + + + N Q Sbjct: 13 LSALMLGSTVATVPVYAQNEPIEGKTVASKQEMTMKLEQTGSIYDIRNRLKVTFGPSNRQ 72 Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRV--TKTYSLDASG 1142 STG++ + + I + N +V T + H + R + K ++ G Sbjct: 73 STGVYKNSNDVIEIYVDPNTDQSVMPQYVISPTVLKSH-TDVGRDSGIPLQKGWNFIREG 131 Query: 1143 TVKFKVPYGGLIYIKGNS-STNESASFTFTGVVKAPFYKDGA-----WKNDL--NSPAPL 1194 G+I++ S T + A+ T +G + P + G W+ N AP Sbjct: 132 A--------GIIHLINESGPTQKPATVTISGGQELPRFILGKHTDADWEKMKLSNPNAPG 183 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 EL S + T + + + D N G DS H + Sbjct: 184 FELVSKHVIIT---GGNKSMSLVHSPNELMKAHDEAIEEENRVAGLDSSKDVH-----QP 235 Query: 1255 LPGHKHRFTNDVQISIGDA-----HSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNA 1309 P H D S G H+GY + + W WHEVGH Sbjct: 236 APMKHHMREKD---SGGWMYAWLQHTGYVSNAMKYI---LNPDIFKRDGWGPWHEVGHTH 289 Query: 1310 AETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITV--APEYLEESNNQAWARGGAG 1367 L G EV NN+ ++ +Q RY G +R+ +D T YLE++N Sbjct: 290 QMNILNWSGLGEVTNNIYSMSVQ-RYFGNHSRLEEDKTYNKVFAYLEQTNKDYNKINDLF 348 Query: 1368 DRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSND 1427 +L M+ QL + G + + +H+ R Sbjct: 349 VKLAMFCQL---------------------------DLAYGKDFYPSLHKAYREL----- 376 Query: 1428 KFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSF 1482 N + S+ + AS A +L+ FF+KW + + Sbjct: 377 -----NEWSGSDAVKQQRFIRLASRTANRNLTPFFEKWGLMVTQETKQQLASLPH 426 >UniRef50_B7INX6 Wall-associated protein n=62 Tax=Bacillus cereus group RepID=B7INX6_BACC2 Length = 1476 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 75/459 (16%), Positives = 135/459 (29%), Gaps = 92/459 (20%) Query: 1062 GQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTI--KSNANVPVTVTVALADDLTGR 1119 + ++ + TG++A +E+ I K ++ + D+ + Sbjct: 59 PGKGSVEEEQKRLKVRYVLSTNEPTGIYAGPNEEIKIEIKGTQSIKAFIGTKSYDE---K 115 Query: 1120 EKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFY 1179 E L G GG++Y ++ E + G P + Sbjct: 116 GFEEFELK-------------PGENNISSSRGGILYFYNMNNDGEVTASVIHGGSHFPLF 162 Query: 1180 KDGA-----W-KNDLNSPAPLG-ELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFAS 1232 G W P EL+++ + T + + + D Sbjct: 163 VLGKHTKKDWDAMLKKYKNPYAVELKAERSLITASPEAVANYMGETDPVELMRLHDKIIR 222 Query: 1233 SMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQIS------IGDAHSGYPVMNSSFSP 1286 N G + P H +F + H+GY Sbjct: 223 FENSVAGLSEDG-----IGVSKAPNHYIQFVEKRKPDKDDWMFATHYHTGY---VPETMD 274 Query: 1287 NSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDI 1346 + + W WHEVGH + P G EV N+ +L +Q R LG + + +D Sbjct: 275 RVLNIKRLQGDGWGPWHEVGHLHQQAPWFWSGVGEVTVNIYSLSVQ-RMLGNKSSLEEDG 333 Query: 1347 TVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGM 1406 + N A + ++L+M+ QL + Sbjct: 334 HYKKAFAYLDNPDAQKKMEEFEKLVMFWQL---------------------------DLA 366 Query: 1407 KGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWN 1466 G + + +H+ R S S+ + + AS A+ +L FF+KW Sbjct: 367 YGEHFYPNLHQMYRLLPESEM--------PASDEDKKQMFIYMASKAAKQNLVPFFEKWG 418 Query: 1467 PGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQG 1505 G N + +L+LPK E+ Sbjct: 419 LGPN-----------------DEVRGKIENLNLPKLEKE 440 >UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellular organisms RepID=B8B8E6_ORYSI Length = 753 Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats. Identities = 83/174 (47%), Positives = 101/174 (58%), Gaps = 16/174 (9%) Query: 538 NVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDP 597 +VTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDP Sbjct: 479 SVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDP 538 Query: 598 DDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGI 657 DDMKNFMEN L++ D + + + V + G+ + A Sbjct: 539 DDMKNFMENTLKWEIVDMGSLNGTFVNSRAVHHPNVGSRHWGEPA----------ELADG 588 Query: 658 SVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVT 711 + L + L Q + L +G + P+ S KL +D++ Sbjct: 589 DIITLGTSSKLSVQ----ISLQNQRVPAGIGMA--SDPMVGRRSGKKLAMEDIS 636 >UniRef50_A9VG09 S-layer domain protein n=29 Tax=Bacillus cereus group RepID=A9VG09_BACWK Length = 869 Score = 141 bits (356), Expect = 2e-31, Method: Composition-based stats. Identities = 74/433 (17%), Positives = 136/433 (31%), Gaps = 64/433 (14%) Query: 1062 GQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREK 1121 + + TGL+A +++TI N + G Sbjct: 63 PGKGDVEVLKQQERKSMAFSPYEPTGLYAKPNEQITINVEGNQNIQA-------YIGTYS 115 Query: 1122 HEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKD 1181 ++ + ++ K+++L G + P GG+IY + TG PF++ Sbjct: 116 YDASWREDSKI-KSFTLK-PGINTIQSPNGGMIYFYNKQQGGTIRTTVTTGGTTTPFFEL 173 Query: 1182 GAW--KNDLN----SP-APLGELESDAFVYTTPKKNLNA--SNYTGGLEQFANDLDTFAS 1232 G ++ +N P A EL+ + + T + Q +D Sbjct: 174 GKHTKQDLINMLDQYPNAHAVELKGERVLITASPARVKKYLLGSNTDPVQLLKKMDEATR 233 Query: 1233 SMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLP 1292 + G E Y + S A+ G + + Sbjct: 234 IQDKVAGLSEEQVDKHYIHYVEENHSPDYYMY--ATSYRTAYVGDAIQ------YVLNIN 285 Query: 1293 TTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGK---MNRVADDITVA 1349 + W WHE GH ++P TEV NN+ +L ++ + + T A Sbjct: 286 KFIKDGWGPWHEAGHLRQQSPWKFYNMTEVQNNIYSLSVEKAFTSNQSFRLQQEGAYTKA 345 Query: 1350 PEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGW 1409 +YLE+ N +L+M QL + G Sbjct: 346 FQYLEQPNKNYDEVSDVFVKLVMLWQL---------------------------QLAYGE 378 Query: 1410 NLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGA 1469 + + +H+ R S +++ N M+ AS VA+ +L FF+KW Sbjct: 379 DFYPKLHQLYRDMSSSE--------LPQTDENKKQLFMISASKVAKQNLIPFFEKWGLRP 430 Query: 1470 NAYQLPGASEMSF 1482 N + + + + Sbjct: 431 NNDTIQKVAALGY 443 >UniRef50_C2U768 S-layer domain protein n=4 Tax=Bacillus cereus RepID=C2U768_BACCE Length = 1064 Score = 137 bits (344), Expect = 4e-30, Method: Composition-based stats. Identities = 78/442 (17%), Positives = 139/442 (31%), Gaps = 85/442 (19%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 ++ + TGL+A +++TIK + G + Sbjct: 53 GKGDVNQIKDKERRQFSFSPYEPTGLYAGPNEKITIKVEGTQNIKA-------YIGTYSY 105 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTF-TGVVKAPFYKD 1181 + A N+ + K+++L+ G + P GG+IY N S TG V PF++ Sbjct: 106 DGAWNQD-NLVKSFTLN-PGENTIESPNGGMIYFY-NQQNGGSVQAEVKTGGVPVPFFEL 162 Query: 1182 GAWK-----NDLNS--PAPLGELESDAFVYTTPKKNLNA--SNYTGGLEQFANDLDTFAS 1232 G N L++ A EL+ + + T + + + +D Sbjct: 163 GKHTKQDLINMLDTYPNAHAVELKGERSLITASPERVKKYLIGSNTDPVELLKKIDESIR 222 Query: 1233 SMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNST--- 1289 + G E+ H F D HS Y + ++ Sbjct: 223 LEDRVAGLSEEEAD----------KHYVHFVEDN-------HSSYYMYAYTYRTAYVKDA 265 Query: 1290 -----TLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVAD 1344 + + W WHE+GH P T G EV NN+ ++ +Q R G +R+ Sbjct: 266 IQVVLDINQFTKDGWGPWHEMGHQRQPNPWTWNGLGEVTNNIYSMSVQ-RAYGLPSRLEK 324 Query: 1345 DITVAPEYLEESNNQAWAR----GGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFY 1400 D + + Q+ +L+M+ QL + F Sbjct: 325 DGVYQKAFTYLNKPQSEKDYNKIDDVFVKLVMFWQLDLAFREEF---------------- 368 Query: 1401 SEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSE 1460 + +H+ R K ++N + S VAQ +L Sbjct: 369 -----------YPKLHQLYRAIP--------KEELPKTNDEKIQNFIYNTSKVAQKNLLP 409 Query: 1461 FFKKWNPGANAYQLPGASEMSF 1482 FF +W A +++ Sbjct: 410 FFDQWGLIATPEIRQKIESLNY 431 >UniRef50_A1JU21 Putative exported protein n=4 Tax=Yersinia RepID=A1JU21_YERE8 Length = 808 Score = 137 bits (344), Expect = 4e-30, Method: Composition-based stats. Identities = 76/411 (18%), Positives = 134/411 (32%), Gaps = 69/411 (16%) Query: 1079 FAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSL 1138 F QS+G WA + I+ + + R+ +E R Sbjct: 76 FTHPTQSSGFWANKGDNLRIEYQHQGEDIDALPELWIVPVRKGNEEDFER------QVVK 129 Query: 1139 DASGTVKFKVPYGGLIYIKGNSSTNES---ASFTFTGVVKAPFYKDGA-----WKNDLNS 1190 G + +V G +Y + S + G P + G W+ L Sbjct: 130 LQRGINEIEVENTGPLYFVATNQPGSSEITVNL-LEGGKPMPRFILGENTAEDWQAQLVK 188 Query: 1191 --PAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHR 1248 AP EL + T P ++ E D S + +G ++ R Sbjct: 189 FGDAPFAELVGKRMILTMPIADMR--EKATDPEGVLVLWDRIVSLAEEQFGLSAK----R 242 Query: 1249 MFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND--WLIWHEVG 1306 F ++ P +++F + + G + + S+ S + + T L + W WHE+G Sbjct: 243 AFPHRATP-FQYQFVSKPDSTPGYMSAANYWLGSNASGIAEVINTDKLQNEGWGPWHELG 301 Query: 1307 HNAAETPLTVPGATEVANNVLALYMQDRYLG----KMNRVADDITVAPEYLEESNNQAWA 1362 H+ T G TEV N+ +LY+Q G + + +DI V + ++A+ Sbjct: 302 HHYQMPAWTFNGNTEVTVNLTSLYVQRALGGTSRMEADSRWEDIAV----FLKDKSKAYE 357 Query: 1363 RGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGD 1422 G L M+ QL + G + +Q + + R Sbjct: 358 DAGVFVSLGMFWQL---------------------------DLAFGKDFYQRLGDRYRTL 390 Query: 1423 EVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQ 1473 + ++N +L AS V+ L FF +W Sbjct: 391 TPAEQ--------PKNNDEKKQRFLLEASRVSGMSLKSFFHQWGIKPTKET 433 >UniRef50_C3BVL2 S-layer domain protein n=1 Tax=Bacillus pseudomycoides DSM 12442 RepID=C3BVL2_9BACI Length = 823 Score = 132 bits (332), Expect = 1e-28, Method: Composition-based stats. Identities = 78/444 (17%), Positives = 138/444 (31%), Gaps = 70/444 (15%) Query: 1084 QSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGT 1143 + TGL+A +++TI + G ++ A N+ + K+++L G Sbjct: 51 EPTGLYASPNEKITILVEGTQNIQA-------YIGTFSYDAAWNQDSLI-KSFTLK-PGE 101 Query: 1144 VKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGA-----WKNDLNS--PAPLGE 1196 + P GG+IY + G PF++ G N L++ A E Sbjct: 102 NTIESPNGGMIYFYNPQQGGTVRAEITAGGSPTPFFELGKHTQQDLVNMLDTYPNAHAVE 161 Query: 1197 LESDAFVYTTPKKNLNA--SNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 L+ + + T + + Q +D + G + K Sbjct: 162 LKGERVLITASPERVKKYLIGSNTDPVQLLKKMDESIRIQDRVSG----------LSEKE 211 Query: 1255 LPGHKHRFTNDVQISIGDAHSGYP---VMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAE 1311 H F D +S Y + + + W WHE GH + Sbjct: 212 ADKHYVHFVEDNHSKDYYMYS-YSSRTAYVGDAIQHVLDVNDFIKDGWGPWHEAGHQRQQ 270 Query: 1312 TPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEE--SNNQAWARGGAGDR 1369 TP T G EV N+ ++ +Q + K A YL + S + Sbjct: 271 TPWTWDGLGEVTVNIYSMSVQRAFGSKSRLEKGTYEKAFNYLNKPQSEKDYNKIDDLFVK 330 Query: 1370 LLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKF 1429 L M+ QL + G + +H++ R Sbjct: 331 LAMFWQL---------------------------DLAFGEEFYPKLHQEYRSLS------ 357 Query: 1430 GGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQS 1489 K ++N + S VA+ +L FF KW A ++ + ++ Sbjct: 358 --KEELPKNNEEKIQGFIYNTSKVAKQNLLPFFDKWGLVATPETRQKVEALN-DPIITAP 414 Query: 1490 AYNTLASLDLPKPEQGPETINQVT 1513 + S + + P NQV+ Sbjct: 415 IWEATDSKPIKVKVEEPTITNQVS 438 >UniRef50_A7GU48 S-layer domain protein n=5 Tax=Bacillus cereus group RepID=A7GU48_BACCN Length = 876 Score = 126 bits (316), Expect = 6e-27, Method: Composition-based stats. Identities = 73/395 (18%), Positives = 123/395 (31%), Gaps = 69/395 (17%) Query: 1084 QSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGT 1143 + TGL+A + + I + G ++ + G Sbjct: 78 EPTGLYAKPNETIIIHVEGKQNIQA-------YIGTYSYDGNPKKF--------YLKPGK 122 Query: 1144 VKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGA-----WKNDLNS--PAPLGE 1196 + P GG++Y + + E+ +G P+++ G W L A E Sbjct: 123 NEISAPKGGMLYFENANLNGETKVTVSSGGTPIPYFELGKHTKEDWDAMLEKFPNAYAVE 182 Query: 1197 LESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLP 1256 L+ + ++T K D + G E Sbjct: 183 LKGERSLFTVTYKAAKQYLGGKDPSPLLRKHDEAIRIQDKVSGVSEE-----YTGVAQAD 237 Query: 1257 GHKHRFTNDVQISIGDAHS-----GYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAE 1311 H F D G ++ GY S L + W WHE GH + Sbjct: 238 THYVHFVEDFNKKDGWMYATNYRTGY---VSDAMKYVLDLEHFEKDGWGPWHEAGHQRQQ 294 Query: 1312 TPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLL 1371 T G TEV N+ ++ +Q R G +R+ E E N+ + Sbjct: 295 T-WKWSGLTEVTVNIYSMAVQ-RAFGNPSRL--------EAEERYND------------I 332 Query: 1372 MYAQLKEWAEKNFD-IKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFG 1430 +K +EKN+D I + + + + G N + +H+ R + G Sbjct: 333 FLYLMKPQSEKNYDQIDNLFVKLGM----FWQLDLAFGDNFYPKLHQMYRLTSLEELGDG 388 Query: 1431 GKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKW 1465 S+ + AS VA DL+ FF+ W Sbjct: 389 -------SDEEKKQLFITMASKVANRDLTPFFEIW 416 >UniRef50_C7MAR4 Putative uncharacterized protein n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MAR4_BRAFD Length = 464 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 85/441 (19%), Positives = 137/441 (31%), Gaps = 69/441 (15%) Query: 1084 QSTGLWAPAQKEVTIKSNAN---VPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 Q TGL+ P ++ ++ A+ VP V A L + E + L+ Sbjct: 73 QPTGLYLPPDVQLGVEVEADGDDVPSIVIGAPGSQLDPDAEDEDDFVGRTLTPRETELE- 131 Query: 1141 SGTVKFKVPYGGLIYIK-GNSSTNESASFTFTGVVKAPF--YKDGAWKNDL------NSP 1191 G +GG+IY+ + +A+ +F G P + G S Sbjct: 132 PGANLVSDAHGGVIYLSFPGADGAATATVSF-GAAAEPMATFVRGTTAEAEFQAQLDRSS 190 Query: 1192 APLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFT 1251 AP EL S + T + L + E+ LD + G +E G Sbjct: 191 APFAELVSTHAIVTVERDRLLLFRH-EDHEKLLGILDRVVEIEQETAGYTTEGG-----V 244 Query: 1252 YKNLPGHKHRFT--NDVQISIG-DAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHN 1308 P H D +G A +GY T+ L+ W +HE+GH+ Sbjct: 245 ESRPPSGPHHLVGYPDGIEGVGAYATNGYTAYPPPIQSTLLTVSGLTLDGWGPYHELGHH 304 Query: 1309 AAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGD 1368 + P+ EV N+ +L +Q + + +V V PE + A AG Sbjct: 305 HQQEPVNPGDVVEVTVNIFSLAVQREFEREYGQVPRMREVDPETGTSHWDTAMEALEAGI 364 Query: 1369 RLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDK 1428 R D + P P + + G R E ++ Sbjct: 365 R---------------DYAELGPFEQLAP--FDQLRLQYGDEFAPAWATLVRQQEPLTEE 407 Query: 1429 FGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQ 1488 +N S A DL++F++ W V+ Sbjct: 408 DRWQNVIYS------------TSAAAGDDLADFWQAWGVE-----------------VAD 438 Query: 1489 SAYNTLASLDLPKPEQGPETI 1509 + L L L P P T+ Sbjct: 439 ETRSALGQLGLEPPVVDPSTL 459 >UniRef50_A2DZU5 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DZU5_TRIVA Length = 727 Score = 119 bits (298), Expect = 8e-25, Method: Composition-based stats. Identities = 69/318 (21%), Positives = 102/318 (32%), Gaps = 35/318 (11%) Query: 992 GTKC-SADLKKSLVDN-----NMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDL 1045 C D LV + + P + L +M S D Sbjct: 295 NVSCMPQDTNPELVQLAASALGYLRSHNFDTPEGLCPDISHGVISVLLCEIMAKLSAQDF 354 Query: 1046 NIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP 1105 E++P SEE + + STG W A I N P Sbjct: 355 AGHDFSERFPERASEEPSEQ----DVGTVRLTLQNEGWYSTGYWLTAGVVAKITLNEVPP 410 Query: 1106 VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSS--TN 1163 V + V + E R P +T T+ LD + P+GG+IYI + S T Sbjct: 411 VPLVVQVGMHAEAIFTKEGPWKRWPMITTTFELD--EVTEIANPFGGMIYIIFDRSVNTP 468 Query: 1164 ESASFTFTGVVKAPFYKDGA---WKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGL 1220 + T V P Y A W+ + AP E+E+ + P K + A L Sbjct: 469 ITIDMTIDHVFPYPLYSPEAPDTWEKTKDRGAPWAEVETYYMTFVAPSKVVRACPN---L 525 Query: 1221 EQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVM 1280 E +D S+ +F G + HK R D+++ I GYP++ Sbjct: 526 EANCKMIDGLIESVLEFLG--------------DHSEHKFRCIFDIEL-IDMPVCGYPII 570 Query: 1281 NSSFSPNSTTLPTTPLND 1298 + S T P Sbjct: 571 MHVDNAESFFSDTNPSES 588 >UniRef50_UPI0001BC7CE9 hypothetical protein BacD2_03774 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7CE9 Length = 814 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 87/494 (17%), Positives = 158/494 (31%), Gaps = 101/494 (20%) Query: 1044 DLNIKVDVEKYPGAVSE------EGQNVTETISLYSNPT-----KWFAGNMQSTGLWAPA 1092 D+++ +PG V + Q V ++ S + + STGL+A A Sbjct: 54 DVSLYDKARIFPGLVDTLTEKRVDEQVVNIDMAYRSAKAVNVNLAMTSPAIYSTGLYAGA 113 Query: 1093 QKEVTIKSNANVP-VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYG 1151 +++T+ + +V +TV + + L R P+V + +L G + + PYG Sbjct: 114 GEKITVMLDDDVKGLTVQIGIHSRDLSSLVGSSYLERDPKVVTSMAL-FKGKNEIRNPYG 172 Query: 1152 GLIYIKGNSSTNES--ASFTFTGVVKAPFYKDG-----AW-KNDLNSPAPLGELESDAFV 1203 G I+IK + +++ G AP Y G W + + P EL Sbjct: 173 GYIWIKRSGDASDTGIVPLKVQGAYLAPDYVVGETEAAEWGEKIKTTTVPWIELRGKQIA 232 Query: 1204 YTTPKKNLN------ASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPG 1257 ++ P K + ++ LEQ D + N+FYG D + + P Sbjct: 233 FSVPVKYMKLKLQSEGQSFVTRLEQSLELWDDWVLCYNEFYGLDDAESE-----TFPKPD 287 Query: 1258 HKHRFTNDVQISIGDAH------SGYP--------------VMNSSFSPNSTTLPTTPLN 1297 R D AH S Y ++ + L T+ + Sbjct: 288 FPVRVVMD-------AHLVTERYSYYSNTNLELLQTEELIDMIADPEQVKAGALNTSHVV 340 Query: 1298 DWLIWHEVGHNAAE-----TPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEY 1352 W+ +G P + + N LY + + D + Sbjct: 341 GWMS---LGLFVQTYWPTPAPNSFKDMYSLMPNFYFLYKHGWWGNQ-----QDAKLFAYK 392 Query: 1353 LEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLP--------------- 1397 L + + + L WA+ D K Y D P Sbjct: 393 LFGRDQKVINTTQYNLNADEFENLVSWAKA--DSCKIYSDEAKRPSKSGNDYWPAALTFY 450 Query: 1398 ----EFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWV 1453 + E G GW F ++R + G+N + + ++ ++ C S Sbjct: 451 SAILSYKQEDTGKDGWKYFAYLNRFLSNE--------GQNVSIFNRLSMSEAMLTCLSHY 502 Query: 1454 AQTDLSEFFKKWNP 1467 + D + F ++ Sbjct: 503 FERDFTPLFDRYGI 516 >UniRef50_C2G2A5 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G2A5_9SPHI Length = 529 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 73/444 (16%), Positives = 138/444 (31%), Gaps = 52/444 (11%) Query: 1086 TGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPR---VTKT-YSLDAS 1141 TG++ P K+ + + + + + + + P+ + K + L Sbjct: 108 TGIYLPVGKQTILVDGISAKNQIKLVIPNWDRHAPQGIDPTK-DPKGWGIEKQEFDLRNG 166 Query: 1142 GTVKFKVPYGGLIYI-----KGNSSTNESASF---TFTGVVKAPFYKDGAWKNDLNSPA- 1192 + +GGL YI K T F G + D W N ++ Sbjct: 167 VNLINIKDFGGLAYITYFSDKPKEQTPIQVHFLNAAVNGYFDISKHNDQDWNNLVDHAVY 226 Query: 1193 PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDF----YGRDSEDGKHR 1248 P+ + P + + Y G++ +N + R E+ Sbjct: 227 PIVDAIGKHIQIAYPTEAIKKYAYGKGVQLISNYDSLVYRQHRILGLIKHNRVPENKILS 286 Query: 1249 MFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHN 1308 Y + G A M P S W HEVGH Sbjct: 287 RVNYNYYMFRDGDGVAYMGDKPGYA------MAMVVDPASV---IKGDPCWGFSHEVGHV 337 Query: 1309 AAETPL-TVPGATEVANNVLALYMQDRYLGK-MNRVADDITVAPEYLEESNNQAWARGGA 1366 P + G EV+NN+ +LY+ + K ++ L G Sbjct: 338 HQTRPYLSWGGLGEVSNNIFSLYVTTSFGNKSRLSEQNNYEKVRNELYGKGKSYLQDGDV 397 Query: 1367 GDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSN 1426 +RL+ + QL+ + + P+FY +LF+ +A D N Sbjct: 398 FNRLVPFWQLQLY----------FAGQGVNPDFYP--------DLFEAFRMQAAADLTKN 439 Query: 1427 DKFGGKNYCAESNGNAADT--LMLCASWVAQTDLSEFFKKWN-PGANAYQLPGASEMSFE 1483 + N+ A + + + DL+EFF + +++ + ++E Sbjct: 440 RRSRRSNFMDRGQNPAVYQLNFVKTVCEIGKVDLTEFFDGYGFFYVGKFEMDDYGKYNYE 499 Query: 1484 --GGVSQSAYNTLASLDLPKPEQG 1505 + + +++LPKP+ Sbjct: 500 MTQAMVDECKAAIRNMNLPKPKTD 523 >UniRef50_A2E8P4 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2E8P4_TRIVA Length = 706 Score = 116 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 75/223 (33%), Gaps = 20/223 (8%) Query: 1049 VDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTV 1108 D+E +PG E+ I AG STGLW A + T++++ ++ + V Sbjct: 348 PDLELFPGVTDEKTCTKRINIETK-------AGIWHSTGLWLQAGQIATVETSNHLTIQV 400 Query: 1109 TVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASF 1168 E R P V TY++ + +GG +++ S N + S Sbjct: 401 GAQ----TLCLLVKEFPWKRWPSVITTYNISPNTPTAIATQFGGPVFVL--SEKNHNVSV 454 Query: 1169 TFTGVVKAPFY---KDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFAN 1225 G P++ K W+ S P GE+E+ P + + Q Sbjct: 455 QIEGCALYPYFYNDKPSIWEQTKTSTIPWGEIETKYVCINLPA---RMIHNEQKISQMCK 511 Query: 1226 DLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQI 1268 +D + +F K R+ + DV Sbjct: 512 RIDQLMQYIREFEAIQYNTFKSRLVFDVEVAKG-EPIVEDVVF 553 >UniRef50_B7HIS0 S-layer domain protein n=19 Tax=Bacillus RepID=B7HIS0_BACC4 Length = 591 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 66/423 (15%), Positives = 126/423 (29%), Gaps = 73/423 (17%) Query: 1076 TKWFAGNMQSTGLWAPAQKEVTIKS--NANVPVTVTVALADDLTGREKHEVALNRPPRVT 1133 + N TG++ ++VTI + + D G+E Sbjct: 69 KRSEQKNYMPTGIYVKPNEQVTITVSGTQKIRAIIGTHQYDKEWGKEIDL---------- 118 Query: 1134 KTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGA-----WKNDL 1188 + G+ P GG++ + ST G PF+ G W + Sbjct: 119 ------SPGSNTISSPNGGVLGLDNFQSTGTVKVQVTQGGSPIPFFVLGKHTKADWIAMM 172 Query: 1189 NS--PAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGK 1246 N+ A +L+S+ V T + + N N D + + G D Sbjct: 173 NNYPNAHAVQLKSERAVLTVTRDSANKYIVNQDPVPLLNKYDEMIRAQDKLAGLSETDPN 232 Query: 1247 HRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNS--TTLPTTPLNDWLIWHE 1304 + + + F + ++ + + + + +TL W HE Sbjct: 233 PLHRSTRRI----WAFVENPNAQSWGMYASWDGAVFTTAGEAIKSTLNVNEFG-WGQMHE 287 Query: 1305 VGHNAAETPLTVP---GATEVANNVLALYMQDRYLGKMN---RVADDITVAPEYLEESNN 1358 GH + P T G EV NN+ +L + + D A YL+++N Sbjct: 288 AGHARQQYPWTWNDLRGMGEVTNNLYSLAAFKKIYPNIPTRLDTEGDYNRAFAYLKQTNK 347 Query: 1359 QAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRK 1418 + +L+M QL G + + +H+ Sbjct: 348 EYKNIDDLFVKLVMLWQLHL---------------------------AYGDDFYPNLHKL 380 Query: 1419 ARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGAS 1478 R +++ + + S +A+ ++ FF KW A Sbjct: 381 YREIPEDQ--------LPKTDEDKIQEFIYNTSKIAKQNVLPFFDKWGLKATQETRQKVE 432 Query: 1479 EMS 1481 ++ Sbjct: 433 ALN 435 >UniRef50_C3XY93 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XY93_BRAFL Length = 533 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 74/196 (37%), Gaps = 26/196 (13%) Query: 1049 VDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKS-------N 1101 +E +PG E Q + TI++ S + +TG + PA + +KS Sbjct: 284 PGIESFPGDFESEPQLHSVTITINSMRQE-----RHTTGYYLPAGTFLLVKSCNTNSGAL 338 Query: 1102 ANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSS 1161 + + V T R ++ L+R P ++ +L+ + PYGG IY++ Sbjct: 339 SGWKIRVGA-----HTDRLSNQHTLHRWPNISVVTNLNHE--TQLFSPYGGNIYLESPEK 391 Query: 1162 TNESASFTFTGVVKAPFY------KDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASN 1215 + S T VV+AP++ W+ SP EL ++T P L Sbjct: 392 PSL-LSITLENVVEAPWFDLTKPETVNNWQVSCQSPGLWAELAGRHIIFTLPSTCLKDGF 450 Query: 1216 YTGGLEQFANDLDTFA 1231 + F + + Sbjct: 451 NPTEMLMFWDKIVQVT 466 >UniRef50_Q2U0W8 Predicted protein n=2 Tax=Aspergillus RepID=Q2U0W8_ASPOR Length = 673 Score = 116 bits (289), Expect = 9e-24, Method: Composition-based stats. Identities = 70/440 (15%), Positives = 140/440 (31%), Gaps = 72/440 (16%) Query: 1048 KVDVEKYPG-AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP- 1105 + D K+P + V+ ++ ++QSTG + + +T+ ++V Sbjct: 237 QTDPSKFPQPRTFKVSPLVSPPDEALRLRQQFHWSDLQSTGFYLNPNEPLTVFVESSVRD 296 Query: 1106 ------VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIK-- 1157 V AL G+E L P L+ +GG+IYI+ Sbjct: 297 GPKPRLVLGPPALVHPDHGKEHVPAQLVELP------PLENGRNKSVHN-FGGIIYIRYT 349 Query: 1158 --GNSSTNESASFTFTGVVKA-PFYKDG-----AWKNDLN-SPAPLGELESDAFVYT-TP 1207 + + P +++G WK+ L+ + P E + + T Sbjct: 350 HRASDQPPPPVFLRLGDTAEPFPLFREGSTTDAQWKSMLDVTKVPFAEHDGKRVIITGLA 409 Query: 1208 KKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQ 1267 K ++ ++ + + + F K+ ++ P + Sbjct: 410 KHAKKYADNGQRQQELLDTYAHIIAIQDRFSAL-----KYNARDPRDRPSLLRPMVVESV 464 Query: 1268 ISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETP-LTVPGATEVANNV 1326 S + Y + + N W+++HE+GH T + TEV N+ Sbjct: 465 NSGVATATNYRAAIPNRLSDQIYWVPRLRNSWMVFHELGHQRQITRTWSWRAMTEVTVNI 524 Query: 1327 LALY-MQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFD 1385 +L +++ VA+ + S + + G L+M+ QL+ Sbjct: 525 YSLANLREYKPPGHKNVAEWDNAKQYLTKASKEKDFDSAGFYLSLVMFEQLRV------- 577 Query: 1386 IKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADT 1445 + G + +HR AR + + + Sbjct: 578 --------------------VFGDGFYHELHRDARH-----------TLVVDKDADKKHH 606 Query: 1446 LMLCASWVAQTDLSEFFKKW 1465 M A+ + DL+E+F KW Sbjct: 607 FMTKAAQLTGQDLTEYFTKW 626 >UniRef50_C9KXI1 Putative uncharacterized protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KXI1_9BACE Length = 705 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 65/285 (22%), Positives = 106/285 (37%), Gaps = 40/285 (14%) Query: 1044 DLNIKVDVEKYPGAV-SEEGQNVTETISLYSN---------PTKWFAGNMQSTGLWAPAQ 1093 D+++ +PG V + + + + TI L N + STGL+A A Sbjct: 60 DVSMYEKARIFPGLVDTAKEERINATIELDLNKQYIDSITLGVSKVPQPIYSTGLYAGAG 119 Query: 1094 KEVTIKS---NANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPY 1150 ++V I + V V + DDLT R P V +L G K P Sbjct: 120 EQVAITVEDNTMGLSVIVG-SHMDDLTELA----PYQRMPLVYVAKAL-FPGKNIIKNPL 173 Query: 1151 GGLIYIKGNSSTNESA--SFTFTGVVKAPFYKDGA-----WKNDL-NSPAPLGELESDAF 1202 GG I+IK + N S S GV K+P + G WK + + P E+ F Sbjct: 174 GGPIWIKKSLGLNASGTCSLKIEGVYKSPDFVIGETDLQDWKRRISETTVPWLEIRGKHF 233 Query: 1203 VYTTPKKNL--NASNYTGGLEQFANDLDT-FASSMNDFYGRDSEDGKHRMFTYKNLPGHK 1259 +T K + N + + L++ + D ++YG + + P Sbjct: 234 AFTVQKDRVLDNLESISSTLQEVGKEWDEAIEEFFFEYYGLKIDKDAEEK---ERAPEFP 290 Query: 1260 HRFTNDVQISIGDAH---SGYPVM---NSSFSPNSTTLPTTPLND 1298 R DVQ+ +G+ + S Y ++ N+ L T + Sbjct: 291 FRVVLDVQV-LGNLYLRNSDYAIVAINNTYMLEEMLNLRTLRTGN 334 >UniRef50_C6IEC9 Coagulation factor 5/8 type n=2 Tax=Bacteroides RepID=C6IEC9_9BACE Length = 705 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 89/470 (18%), Positives = 169/470 (35%), Gaps = 59/470 (12%) Query: 1044 DLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAG----------NMQSTGLWAPAQ 1093 D+++ +PG V N T+ ++ + STGL+A A Sbjct: 58 DVSMYESARVFPGLVDTLVDNTVNTLLALDLSKRYIPAYDLDVQQVPRPIYSTGLYAGAG 117 Query: 1094 KEVTIKSNAN-VPVTVTV-ALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYG 1151 + +TI N N + +TV + + DDLT R P VT + L G + P G Sbjct: 118 ELITITINDNTMGLTVIIGSHLDDLTDIS----PYLRLPVVTTSKQL-FPGKNTIRNPLG 172 Query: 1152 GLIYIKGNSSTNESASF--TFTGVVKAPFYKDGA-----WKNDLN-SPAPLGELESDAFV 1203 G+I+I+ + N SA F G ++P + G+ W L + P EL Sbjct: 173 GMIWIEKSKDVNGSADFVMEINGAYRSPDFIVGSTDVTAWVEQLRTTTVPWLELRGRHVA 232 Query: 1204 YTTPKKNLNASNYTGGL--EQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHR 1261 ++ ++ L + E+ N L+ + +++ +Y P R Sbjct: 233 FSVQRERLLDMINDDPIIAEKMPNTLEAWDNAVETYYYNYYSLQVGAQDFSMRAPDFPER 292 Query: 1262 FTNDV----QISIGDAHSGYPVMNSSFSPN-STTLPTTPL-NDWLIWHEVGHNAAETPLT 1315 DV + I +A G +N+++ N + T N I++ + N + + Sbjct: 293 VVLDVELLDNLYIRNADYGVVALNTNYLLNELASYQTLKSGNSVAIFNALYRNYSFRDIK 352 Query: 1316 VPGATEVANNVLALYMQDRYLGKMNRVADDIT-VAPE---YLEESNNQAWARGGAGDRLL 1371 P +EV++ V A+ + + + + PE + E +A A Sbjct: 353 SPWWSEVSDAVKAIPLYRMAEKGLREDGYPMGPIFPEEGSSIAEQFPKALAYADTDSSRW 412 Query: 1372 MYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGG 1431 + +K +++ Y + + + + W + ++R + D++S D Sbjct: 413 FVSDIKS------EVRPTYALASLVQLANYKDDD---WAFYIELNRMIK-DKISIDHSTS 462 Query: 1432 KNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 + LC + + D S FF+ W AS+ Sbjct: 463 TYFFKA----------LCDYF--KEDFSPFFEHWGYSLTDEARSYASKYP 500 >UniRef50_A2F4J7 Immuno-dominant variable surface antigen-like n=1 Tax=Trichomonas vaginalis RepID=A2F4J7_TRIVA Length = 504 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 53/297 (17%), Positives = 97/297 (32%), Gaps = 32/297 (10%) Query: 1048 KVDVEKYPGAVS--EEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP 1105 ++++ G + + + T + T ++AP + +TI+ N Sbjct: 27 PAGIKQFYGDEQLINNAPRYKIRVYINTRYT-----DRHWTAIYAPPGELITIEVPPNAV 81 Query: 1106 VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSST-NE 1164 + V + + + R K+ + P+GG I I N+ Sbjct: 82 GKIGVCFNHHINDDSGYRTRM----RTLKSSTTLNREVNNVSWPFGGAITITSGIDRFNQ 137 Query: 1165 SASFTFTGVVKAPFY-----KDGAWKNDLNS-PAPLGELESDAFVYTTPKKNLNASNYTG 1218 T TG ++ P++ D W+ +L+ P PL ++ + P + + Sbjct: 138 GLEVTITGGIRMPYFRYGYTTDQEWEEELSLLPGPLANIDMGNAIAQLPSRQIRGKVKLN 197 Query: 1219 GLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYP 1278 F S+N+ G D H ++T D + G+A + Y Sbjct: 198 DACAFWRSASRNLFSVNEINGGPRRD--------DGRIKHPTQWTFDTYVPFGEAATAYG 249 Query: 1279 ----VMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETP-L-TVPGATEVANNVLAL 1329 + +S + W HE GHN + EV NNVL L Sbjct: 250 GNRIIFPPHWSEGIVNYDSAKWGCWGQLHEYGHNFQYGWGWPSFRDYIEVTNNVLNL 306 >UniRef50_C7BLH9 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BLH9_PHOAA Length = 793 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 80/436 (18%), Positives = 140/436 (32%), Gaps = 76/436 (17%) Query: 1084 QSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTK-TYSLDASG 1142 Q TG + + +E+ + +V + P + K L +G Sbjct: 33 QPTGFYVISGQEIMVNVEGETDGSVNAVIG---------------VPELNKPEKYLLTTG 77 Query: 1143 TVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGA-----WKNDLNS--PAPLG 1195 KF GL+ N++ + + K P +K W+N + S AP+ Sbjct: 78 LNKFTSKNEGLLSFTNNNNHGYVKIIIQSELQKIPSFKLNETNNTDWENMMASYSDAPVV 137 Query: 1196 ELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNL 1255 +L S+ + + Y D F ++ G EDGK N Sbjct: 138 QLSSERAIIVVRYNSAKK--YLTDPNALMKYYDDFIRFQDNISG-ILEDGKADYRVDSNK 194 Query: 1256 PGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLT 1315 + D + M + L TT N W IWHE GH ++P T Sbjct: 195 FLYVE---ADRL----YMFATNGHMGFNGDAALQRLLTTN-NGWGIWHESGHQRQQSPYT 246 Query: 1316 VPGA---TEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLM 1372 G TEV N+ +L +Q+ + + + + EYL +L M Sbjct: 247 WSGGTGMTEVTVNLYSLAVQEGFHDRASFIDKYYPKIKEYLVTEEKNF-DAQDINIKLGM 305 Query: 1373 YAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGK 1432 QL+ G + +H+ R + Sbjct: 306 LWQLRL---------------------------TFGNGFYPQLHQAYRLMDS-------- 330 Query: 1433 NYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPG-ASEMSFEGGVSQSAY 1491 SN + L++ +S + +L+ FF KW N L + E + ++ Sbjct: 331 --LPVSNDDKKQQLIISSSQLTNINLAAFFDKWGITPNEKTLEILKTLPPLEKNIWENDD 388 Query: 1492 NTLASLDLPKPEQGPE 1507 ++++P+ + PE Sbjct: 389 KNSITIEMPQQKYVPE 404 >UniRef50_C2C0C9 Possible wall-associated protein n=1 Tax=Listeria grayi DSM 20601 RepID=C2C0C9_LISGR Length = 1217 Score = 113 bits (282), Expect = 7e-23, Method: Composition-based stats. Identities = 69/438 (15%), Positives = 129/438 (29%), Gaps = 59/438 (13%) Query: 1082 NMQSTGLWAPAQKEVTIKSNANVP-VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 + +STGL+ E+T++ + + V T L Sbjct: 81 SFESTGLFLYKGDEITVEVEGEPENLELRVGQWGGYTNTPYIATGSQSYAFKEGVVKLHG 140 Query: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGA-----WKNDLNS--PAP 1193 + GG++Y+ N ST ++ + G VK P+Y G +KN+L P Sbjct: 141 GTNTFTRNFSGGMVYL-VNYSTTKAENVAIKGGVKVPYYVQGKTSIDSFKNELEQYKDVP 199 Query: 1194 LGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYK 1253 E ++ + T + + + N +D F SS+ + Sbjct: 200 FMEFVNNDAIATI------RIDRAKDIFEKGNQVDVFMSSLAKIVKLQNGAAGLSYDGQG 253 Query: 1254 NLPGHKHRF-TNDVQISIGDAHS-----GYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGH 1307 R + + G + G + +++ DW + HE GH Sbjct: 254 AERKDLQRIHVMNPEWGAGQLFATNNFIG--IHSATTKDREIFSKGLDSTDWGMLHESGH 311 Query: 1308 NAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAG 1367 TEV N+ A Y Q + D S + + + Sbjct: 312 TYQNKMYQWRNMTEVTVNIYADYAQKMWSADGTG-RYDAVNVKNGSRASVQKYFKKLETD 370 Query: 1368 DRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSND 1427 + E + +F L + G++ + ++++ R Sbjct: 371 PTWNFDRESTETNDYHFA----------LLGMFLTLPRTFGYDFYPVLNQSYRSLPEEEL 420 Query: 1428 KFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVS 1487 ++ +L S VA DL+ FF+ W ++ Sbjct: 421 P--------KTEEEQKQLFILMTSKVAHRDLTPFFEHWRFT-----------------IT 455 Query: 1488 QSAYNTLASLDLPKPEQG 1505 L +L LP E+ Sbjct: 456 DETKEKLKALKLPTLEKE 473 >UniRef50_A2EPB9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EPB9_TRIVA Length = 664 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 51/254 (20%), Positives = 98/254 (38%), Gaps = 28/254 (11%) Query: 1049 VDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTV 1108 D++++PG + + + + I++ W +TGLWA + I+ + V Sbjct: 323 PDIDEFPGTIEQ-SETSSFEITIDVKKKSWS-----TTGLWALPGICMEIEFKNDDWENV 376 Query: 1109 TVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNS-STNESAS 1167 + + + ++ R P V +T + A + P GG+IY+ N ++S S Sbjct: 377 IIQVGSHTEDFLQTDIPWPRWPVVYRT--VQAQKKLSLTSPTGGIIYLYLNEGEKSKSVS 434 Query: 1168 FTFTGVVKAPFYKDGA---WKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFA 1224 +FT V++ P W S AP E + V+T P ++ L Q+ Sbjct: 435 VSFTNVIQYPHAILDQPTIWDETCFSMAPWCEFDCGNIVFTLPTTKSREADTDWLLTQYC 494 Query: 1225 NDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSF 1284 +S+ D+ F + K+R D+ +S GYP++ + Sbjct: 495 K----LVTSIWDY------------FEVPDEERIKYRVVFDIALSGEGPTLGYPIVLLTS 538 Query: 1285 SPNSTTLPTTPLND 1298 S ++ + N Sbjct: 539 SASNIIQKVSKPNS 552 >UniRef50_A2FC48 Putative uncharacterized protein n=2 Tax=Trichomonas vaginalis RepID=A2FC48_TRIVA Length = 717 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 56/255 (21%), Positives = 91/255 (35%), Gaps = 31/255 (12%) Query: 1052 EKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVA 1111 E +PG V+ + + I L N +W STGLW PA I P T Sbjct: 357 EIFPG-VTGDVELRDFEIDLTINSQEWT-----STGLWLPAGVMGEIIIENVPPSTFIQI 410 Query: 1112 LADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSS---TNESASF 1168 + K ++ R P T + S K PYGG++Y+ + Sbjct: 411 GCHNEKNLPK-KLPWGRWPSTLVTEPV-ISEETKVGSPYGGIVYLMNGEEEELISYDIHA 468 Query: 1169 TFTGVVKAPFYKDGA--WKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFAND 1226 F + P + A W + + P GE+E+ + T P +NL + +E FA Sbjct: 469 KFCNFCEFPRFVLEADTWTDTKDIEVPWGEIETSNVIVTMPSENLRKID----VEAFAKK 524 Query: 1227 LDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSP 1286 +D+ S + ++ +K R DV + G+ YP+ S+ Sbjct: 525 IDSLISCIKNY--------------LDIEWEYKFRIIFDVMVDPGEPSISYPIAFSTLDI 570 Query: 1287 NSTTLPTTPLNDWLI 1301 + N +L Sbjct: 571 DHVVEGFNSPNKYLF 585 >UniRef50_B1V640 Putative antigenic protein NP1 n=1 Tax=Clostridium perfringens D str. JGS1721 RepID=B1V640_CLOPE Length = 1269 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 58/303 (19%), Positives = 109/303 (35%), Gaps = 49/303 (16%) Query: 1053 KYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANV-------P 1105 ++ G++ ++ V + + + +N S + PA + TIK N Sbjct: 82 QFFGSIPDDVLGVEKKVYINTNAK-----GSHSLASYVPAGEIATIKLNNEALKYAKKGK 136 Query: 1106 VTVTVAL-ADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNE 1164 + ++V + + + NR P + KT+S++ + P+GG+IYI + S Sbjct: 137 LKISVGMTMVNAEDYNYNNNNQNRMPYLGKTFSVN-ENETQVGTPFGGMIYIDIDDSVPS 195 Query: 1165 SASFTFT--GVVKAPFY-----KDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYT 1217 F GVV P+Y D WK N+P E+ + + P K + N Sbjct: 196 GLRFEIDVKGVVDTPYYDLGRTTDEEWKESKNAPGLFAEIRTPYLRFMVPSKFIRNINNP 255 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS-- 1275 F + +S++ D + D I+ G A++ Sbjct: 256 YNAALFWTNSVALSSNIMD----------------QQYRIKPMSLIFDQYITAGIAYASV 299 Query: 1276 GYPVMNSS--FSPNSTTL-PTTPLNDWLIWHEVGHNAAETPLTVP---GAT----EVANN 1325 G + N ++ ++ +W HE+ H+ + L G E+ NN Sbjct: 300 GAWICNLPPEWATSALDYDSIMKSGEWGTIHEINHHYQKRYLNYSDEWGVGDEFSEITNN 359 Query: 1326 VLA 1328 L+ Sbjct: 360 ALS 362 >UniRef50_C2G2G0 Possible wall-associated protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G2G0_9SPHI Length = 622 Score = 109 bits (272), Expect = 9e-22, Method: Composition-based stats. Identities = 85/438 (19%), Positives = 144/438 (32%), Gaps = 86/438 (19%) Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTV 1144 TG + + I N T+ T H+ + N PP Y+L Sbjct: 92 PTGYYVAPNQSFNINVNLQQGSTLPKVQVG--THSRNHDYSYN-PPV----YNLSTGSNT 144 Query: 1145 KFKVPYGGLIYIKGNSSTNESAS------FTFTGVV-KAPFYK-----DGAWKNDL-NSP 1191 GG+I+I N +A+ +F+G + + P Y W+N L Sbjct: 145 ITANADGGIIWITYEQPANATAAPGASALLSFSGNISRIPVYIKNSTTLSNWQNQLLTYN 204 Query: 1192 APLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFT 1251 L VY KNL+++ T + N+ D + + G ++ +H Sbjct: 205 NATDVLMVGQNVYMVYAKNLHSAVSTQNNDLILNNADAVWDNHYTYAGLNNSSTQHTR-- 262 Query: 1252 YKNLPGHKHRFT-NDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAA 1310 P H +V + A Y +++ + N W +WHE GH Sbjct: 263 ----PVIPHLMVQTEVPFGLYYAFF-YRTAYANYDAHKVFGEDIVTN-WGVWHEFGHLMQ 316 Query: 1311 ETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPE---YLEESNNQAWARGGAG 1367 P+ G EV N+ +L + R+ D V P+ YL + + + G Sbjct: 317 MQPIDWDGLGEVTVNIFSLKAERALGITPTRLTKD-NVWPQVHTYLSSTGTKNFDSQGVW 375 Query: 1368 DRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSND 1427 +L M+ QL W G + + ++R R D + Sbjct: 376 VKLAMFHQL--W-------------------------LAYGDSFYTNLYRSVREDTGTY- 407 Query: 1428 KFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVS 1487 S+ +L A + +L+ FF+ W N+ + Sbjct: 408 ---------SSSDAKKKNFILKACQQSGYNLTSFFQAWGIQDNSLNI------------- 445 Query: 1488 QSAYNTLASLDLPKPEQG 1505 YN +A+L+LP P Sbjct: 446 ---YNAVAALNLPTPSYD 460 >UniRef50_A2EKB8 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EKB8_TRIVA Length = 685 Score = 109 bits (272), Expect = 9e-22, Method: Composition-based stats. Identities = 53/291 (18%), Positives = 102/291 (35%), Gaps = 34/291 (11%) Query: 994 KCSADLKKSLVD-----NNMIYGDGSSKAGMM--NPSYPLNYMEKPLTRLMLGRSWWDLN 1046 C + K L+D + G ++ G++ P +PL + LT L ++ Sbjct: 218 VCDNNFKDELIDLLNTSWEFLKRTGYNENGLICTKPCHPL--IAILLTDLYTKVPPENVV 275 Query: 1047 IKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPV 1106 + + +PGA +S + + STGL+ PA I+ + +P Sbjct: 276 AIPEYKDFPGA------TGNVELSNFEEHLELGPEMWVSTGLYLPAGVIGEIEISEPMP- 328 Query: 1107 TVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESA 1166 V + + R P + L K P+GG++Y+ N T+E Sbjct: 329 DVHIHIGCHHESLVPKSPPWKRWPLTVCVFPL-TEKVTKVVSPFGGIVYVAMNIETDEPV 387 Query: 1167 --SFTFTGVVKAPF--YKDGA-WKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLE 1221 + F+ + P + D + W+ N P GE+ ++ + T + + G Sbjct: 388 RITVKFSNFCRHPVAQFDDSSVWEMTKNFEVPCGEIVAENLIITLSSQKMR---EIGNFS 444 Query: 1222 QFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGD 1272 + + + S +++ +R LP D + S G Sbjct: 445 KIFDIFNKIISRLSENLSY-PITRPYRFVFDIELP--------DDEPSYGY 486 >UniRef50_Q87WH6 Putative uncharacterized protein n=3 Tax=Pseudomonas syringae group RepID=Q87WH6_PSESM Length = 827 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 73/405 (18%), Positives = 129/405 (31%), Gaps = 81/405 (20%) Query: 1084 QSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTK-TYSLDASG 1142 Q TG++ + V + + V + P + K L G Sbjct: 48 QPTGIYVTKGEHVELSYYQDTTVKIWAVFG---------------VPELNKPVTELLVFG 92 Query: 1143 TVKFKVPYGGLIYIKGNSSTNESASFTFTGV---VKAPFYK---DGAWKNDL--NSPAPL 1194 F+V GL+ S + G V A + K + W+N + + AP+ Sbjct: 93 YNSFEVRESGLLSFIC-QDAGHSVTVIIKGAYSGVPAFWLKETTNAMWQNMMVQYNNAPV 151 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 L S+ + ++ +Y E+ D S +D G E T Sbjct: 152 VMLTSERAIIVVRHESAR--DYITDPEKLMTYYDELIRSQDDISGVLGEGE-----TEWA 204 Query: 1255 LPGHKHRFTNDVQISIGDA-----HSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNA 1309 + +KH + + H G+ + + S L P W WHE GH Sbjct: 205 IDPNKHLYVEADSL---YMFATNGHMGF----TGATALSYLLSGNPAQGWGPWHESGHQR 257 Query: 1310 AETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDR 1369 +P+ TEV N+ +L Q+R G+ +R+ + EYL + + R Sbjct: 258 QLSPMNWEDMTEVTVNIYSLATQERMEGRASRLDVEYPFIKEYLNSPHREFSRLPDHFQR 317 Query: 1370 LLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKF 1429 ++M QL F + +H++ R + Sbjct: 318 VVMLWQLHLTFRTGF---------------------------YPQLHQRYRLMQN----- 345 Query: 1430 GGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQL 1474 + + + ++ S ++ DLS FF +W L Sbjct: 346 -----LPQGSEDVLQRFIVETSLLSGRDLSTFFDRWGIYPTPETL 385 >UniRef50_A2DDW5 Immuno-dominant variable surface antigen-like n=4 Tax=Trichomonas vaginalis RepID=A2DDW5_TRIVA Length = 1162 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 64/348 (18%), Positives = 118/348 (33%), Gaps = 35/348 (10%) Query: 1048 KVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVT 1107 V ++ G + + L+ N + G TGL+AP + +TI+ + + Sbjct: 104 PVGTRQFYGNENLVNNSKRYKARLFINTRR---GPFHPTGLYAPPGELITIEISEKIVNG 160 Query: 1108 VTVA---LADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSST-N 1163 +T+ D+ TK + + KF PYGG I + T N Sbjct: 161 ITLKINRHVDEAANDSNILGG-------TKCDIVLSGTVSKFCWPYGGTIEFERGVDTAN 213 Query: 1164 ESASFTFTGVVKAPFYK-----DGAWKNDLNSPA-PLGELESDAFVYTTPKKNLNASNYT 1217 + +GV++ P++ D W+ DL+ A P+ ++ A T P + S Sbjct: 214 QGFDVNISGVIRCPYFIYGSTTDEEWEEDLSKQAGPVMFIDYGAGFVTMPSTDAKNSIRL 273 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGY 1277 F + S+N+ R + + + +F + S A G Sbjct: 274 NDAMAFWRGVSRVLYSVNEVTYR---SRRSDGRVTTPMMTNLDKFV---RASEAVALVGA 327 Query: 1278 PVMNSS--FSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVL---ALYMQ 1332 V+ + P + W HE H+ EV+NNVL ++ + Sbjct: 328 NVIYEPPYWYPGWVNYESARWGCWGQLHEYAHHFQYF-WGWGDYGEVSNNVLNLISMSLM 386 Query: 1333 DRYLGKMNRVADD-ITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEW 1379 K + I++ + + + + + D L Y W Sbjct: 387 TEIDSKRQIYLNGCISLGDGW--DYTSHPYGNINSKDLLFWYGLHLYW 432 >UniRef50_C2FV33 Possible wall-associated protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FV33_9SPHI Length = 620 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 78/446 (17%), Positives = 134/446 (30%), Gaps = 84/446 (18%) Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTV 1144 TGL A + ++ + TV L + + T L+ Sbjct: 89 PTGLKPIAGTALQVQVTFHGGSTVKPVLIVGTPELDTEQTY-------TLNAGLNTLNPT 141 Query: 1145 KFKVPYGGLIYIKGNSSTNESASFTF-TGVVKAPFYKDGA-----WKNDLN--SPAPLGE 1196 K Y L Y+ +TN+S + F +G + P +K G W + L S A Sbjct: 142 LIKNMY--LQYVSATPNTNDSVTVEFISGYTQVPLFKLGQTTAAEWADQLTTFSDAEYVT 199 Query: 1197 LESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLP 1256 L + N + +D S N+ G D+ HR K Sbjct: 200 LTGQKNYILLTRSRYNNY-TSTDPNTVLAAVDLIISRENEISGLDNSSTLHREPDGK--- 255 Query: 1257 GHKHRFTNDVQISIGDA---HSGY-----PVMNSSFSPNSTTLPTTPLNDWLIWHEVGHN 1308 ++ + G H G + + ++ W +WHE+GH Sbjct: 256 ------VCIIEKNSGYMDATHKGLVRLTGAAAWDKVFKAKNIVSGSTVDQWGLWHEIGHL 309 Query: 1309 AAETPLT-VPGATEVANNVLALYMQDRYLGKMNRVA--DDITVAPEYLEESNNQAWARGG 1365 P+T E A N+ A Y + Y V+ + + + + A Sbjct: 310 HQLWPITPYTVLGEAAPNIYASYTKKYYEPTYRYVSGTNWMNAKVYLAQPDAAKNLAAAE 369 Query: 1366 AGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVS 1425 +L+M+ QL G + + +H+ R + Sbjct: 370 NYTKLMMFEQLML---------------------------AFGEDFMKNLHKMVREEIGI 402 Query: 1426 NDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGG 1485 +N L AS + +L+ FF+KW Sbjct: 403 TYPLPNRNTTNVD--ERLGALAFYASKTSGKNLTNFFQKWGFN----------------- 443 Query: 1486 VSQSAYNTLASLDLPKPEQGPETINQ 1511 +S S +++L+LP+P IN Sbjct: 444 LSSSRIALISALNLPEPATDVSLINS 469 >UniRef50_A2F335 Immuno-dominant variable surface antigen-like n=2 Tax=Trichomonas vaginalis RepID=A2F335_TRIVA Length = 1247 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 48/259 (18%), Positives = 88/259 (33%), Gaps = 25/259 (9%) Query: 1082 NMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDAS 1141 TG++ P + ++I N + V + + L R + ++L+ S Sbjct: 132 GYHPTGIYVPPGEVISIDIPGNTIKRIVVQFNHHTHDQNNYNTRLGR---LKCRFTLN-S 187 Query: 1142 GTVKFKVPYGG-LIYIKGNSSTNESASFTFTGVVKAPFYK-----DGAWKNDLNS-PAPL 1194 +F PYGG L + +G ++ P + D W+ DL APL Sbjct: 188 QHTEFAWPYGGNLDLVTYQDEFPLGFEVNISGGIRMPHFIYGVNTDEEWETDLRKLAAPL 247 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 ++ F+ P N+ + +F + + +N+ + DG + Sbjct: 248 TTFDTGTFLARMPTHNIRGAVCVNDGMRFWQTVSWNSYDVNEVTSVNRRDGN----VIRP 303 Query: 1255 LPGHKHRFTNDVQISIGDAHS---GYPVMN-SSFSPNSTTLPTTPLNDWLIWHEVGHNAA 1310 L + + G A + G + S++ + W + HE H+ Sbjct: 304 LFYNFDSYVP-----AGAAVAFVGGNFIQAPPSWAGGIVNYDSAKWGCWGLLHEYHHHFQ 358 Query: 1311 ETPLTVPGATEVANNVLAL 1329 EV NNVL L Sbjct: 359 SG-WGFANPGEVTNNVLNL 376 >UniRef50_A5ZEQ5 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZEQ5_9BACE Length = 937 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 77/458 (16%), Positives = 145/458 (31%), Gaps = 77/458 (16%) Query: 1085 STGLWAPAQKEVTIKSN--ANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASG 1142 TG++ +E+ + N + + + D G TY L +G Sbjct: 453 PTGIYVAQGQELVVLVADAHNEDMGICIQNLDKPGGDGFGGD----------TYPL-TTG 501 Query: 1143 TVKFKVPYGGLIYIKGNSST------NESASFTFTGVVKAPFYKDGA-----WKNDLNSP 1191 K KV GL+Y+ ++++ + F ++ W LN+ Sbjct: 502 VNKIKVKNKGLVYVIYHTTSLEELAGKQPVKIHFASGKVNGYFDSNKHEASRWSELLNNT 561 Query: 1192 A-PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMF 1250 ++ T P L S G ++ + D F G G Sbjct: 562 VCGYFDVLGTYAHLTFPVNRLRNSTGNRG-KELIDLYDEIVEKEQIFMGLKKYGGMFMNR 620 Query: 1251 TYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPL-NDWLIWHEVGH-N 1308 Y N+ H + +D H+ Y + W HE+GH N Sbjct: 621 MYLNVMYHNFMYASDY-------HTAY---HDDTMDELCNPDRLKTTGCWGPAHEIGHCN 670 Query: 1309 AAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEY---LEESNNQAWARGG 1365 L G TEV NN+++ Y+Q G +R+ + + + + A Sbjct: 671 QTRPGLKWHGLTEVTNNIMSQYIQTTVWGNTSRLQSEGWYTKAWDEIIAKRRAHA-QETD 729 Query: 1366 AGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN-LFQLMHRKARGDEV 1424 +L+ + QL+ + W P + GW+ + ++ R + Sbjct: 730 FFMKLVPFWQLELY---------WGKVKGFTP------KESNGWDGFYPQIYEHIRKNPD 774 Query: 1425 SNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEG 1484 + + + +C A+ DL+ FF+KW + Sbjct: 775 ----------LPTAGEQQLEFVYICCLK-AEKDLTGFFRKWGFLTPVDVTVDD----YGD 819 Query: 1485 G---VSQSAYNTL-ASLDLPKPEQGPETINQVTEHKMS 1518 G V+Q + + A +DL E+ +T+ ++ Sbjct: 820 GKIIVTQKQIDEILAKIDLLGFEKETAAFEYITDDNLN 857 >UniRef50_Q4ZNJ4 Putative uncharacterized protein n=1 Tax=Pseudomonas syringae pv. syringae B728a RepID=Q4ZNJ4_PSEU2 Length = 824 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 66/355 (18%), Positives = 121/355 (34%), Gaps = 61/355 (17%) Query: 1166 ASFTFTGVVKA-PFYKDG----AWKNDLN--SPAPLGELESDAFVYTTPKKNLNASNYTG 1218 TG PF++ W+ + S AP+ L S + ++ Y Sbjct: 115 VVLDITGQYNHVPFFRMDMTNLEWEQQMAQYSNAPVVLLTSPRAIVVVRYQSAQL--YLT 172 Query: 1219 GLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDA--HSG 1276 + + D S+ + G + T L KH + ++ + H G Sbjct: 173 DPAKLMGNFDDAISAQDGISG-----VINYSTTEWALDPSKHFYVEADRLYMFAMDGHMG 227 Query: 1277 YPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTV---PGATEVANNVLALYMQD 1333 + + + + L T P + W WHE GH +P+T G TEV N+ ++ Q+ Sbjct: 228 F----NGSAALARLLSTAPEDGWGPWHESGHQRQLSPMTWGTGTGMTEVTVNLYSMAAQE 283 Query: 1334 RYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDG 1393 +LG+ EYL S + +G +L+M QL+ Sbjct: 284 VFLGRATGADSSYAPMKEYLASSLREYDNIKDSGHKLVMLWQLRL--------------- 328 Query: 1394 TPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWV 1453 G + + +H++ R + + A ++ S + Sbjct: 329 ------------SFGTSFYPQLHQRYRLMHNP----------PTVSDDKAQRFIVETSLL 366 Query: 1454 AQTDLSEFFKKWNPGANAYQL-PGASEMSFEGGVSQSAYNTLASLDLPKPEQGPE 1507 + +L+EFF +W N L A + + ++ +T + LP PE Sbjct: 367 SHVNLAEFFDRWGLYPNPETLNQIADLPALTLAIWETDADTTIPIPLPLSTYIPE 421 >UniRef50_A2DY87 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DY87_TRIVA Length = 725 Score = 100 bits (249), Expect = 5e-19, Method: Composition-based stats. Identities = 66/419 (15%), Positives = 126/419 (30%), Gaps = 77/419 (18%) Query: 1045 LNIKVDVEKYPGAVS-EEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNAN 1103 L++ D +PG E + T I ++ + STGLW P T K N Sbjct: 357 LSVIPDCSTFPGLAKIREFKEYTIEIPIHEQT-------LHSTGLWLPPGVTATCKIEDN 409 Query: 1104 VPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTN 1163 + + + R P V Y ++ G + GG++YI ++ Sbjct: 410 FTDNIAIQIGAHSQSLVSQPPPWKRWPHVVMAYKVNN-GVTEIFSQVGGMVYIGVAETSV 468 Query: 1164 ESASFTFTGVVKAPFYK---DGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGL 1220 S TFT P ++ N P E+ ++ + P + Sbjct: 469 SSLKMTFTNCALYPRAIRSDPKIFEQTQNFDVPWSEISANNVTFVLPTVEFKKIYDMNEI 528 Query: 1221 EQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVM 1280 ++ +D + + ++ N+P R DVQ GYP+ Sbjct: 529 FKYYDDYIEIVAKIMNY----------------NIP-RPFRVVFDVQTPDDLPVPGYPI- 570 Query: 1281 NSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYM--------Q 1332 T+P +N+ L + N E ++ L + Sbjct: 571 ---------TIPIDSINN-LFY-----NLEEPN----------CDLFDLLTSIATSCLRE 605 Query: 1333 DRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQL-----KEWAEKNFDIK 1387 + + R I +A ++++ + + ++ +L K F I Sbjct: 606 GYFDSETERALSQIVIAQIFMDKYDGFDVESDESFQVTQLFHELWRIHRKINNTALFQII 665 Query: 1388 K---------WYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAE 1437 + +Y D F + + NL ++ R R + C + Sbjct: 666 RESMSPDGPEYYDDEERWTAFTRDLSHITQLNLTKIFERLKRIPLNISANLEMFPACPD 724 >UniRef50_Q8EW84 Predicted integral membrane protein n=1 Tax=Mycoplasma penetrans RepID=Q8EW84_MYCPE Length = 984 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 85/434 (19%), Positives = 143/434 (32%), Gaps = 86/434 (19%) Query: 1084 QSTGLWAPAQKEVTIKSNA---------NVPVTVTVALADDLTGR---EKHEVALNRPPR 1131 +TGL+ PA + +TI N+ + + DLT + NR P Sbjct: 272 YTTGLYLPAGEVITINFPGLTDEQVAALNIRLVINDNEIQDLTSSNVESQWSKCKNRMPV 331 Query: 1132 VTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESAS------FTFTGVVKAPFYKDG--- 1182 + + ++L F P GG+I ++ ++TN + G V+A Y G Sbjct: 332 MRQVFTLRK-NNFSFGNPLGGMINLEHINNTNTVVNGSNIFRVVIDGAVEALHYVHGYTT 390 Query: 1183 --AWKNDLN-SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYG 1239 W+ + S AP E+E+D + PK NL ++ D + S + Sbjct: 391 EDEWQRLVKESTAPFVEIENDYTKFLVPKSNLGQFANKENY-EWELDENNNVISYSHKET 449 Query: 1240 RDSEDGKHRMFTYKNLPGHKHRFT-----------------NDVQ--ISIGDAHS--GYP 1278 + ++ N ++ R+ D Q + G A++ Y Sbjct: 450 LINNTYPYKSLDLWNKLSYESRYVSGLNETPTVRPQIKNYFTDYQYYVDGGAAYTSNAYN 509 Query: 1279 VMNSSFSPNSTTLPT-TPLNDWLIWHEVGHNAAETPLTV------PGATEVANNVLALYM 1331 VM S+ T T +W + HE H+ P V EV NNVL L Sbjct: 510 VMPRSWGSAVTNYDTNNNSGNWGVIHEYNHHFQVNPSNVSWGFIRNDQNEVTNNVLNLLA 569 Query: 1332 QDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYP 1391 +Y + D + + +W G + A LK I+ Sbjct: 570 YAKY----ANIGQD------RAGKQDISSWPSGHLSNINSYNAILKV-------IRNSTT 612 Query: 1392 DG--TPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADT-LML 1448 T Y+ GW + + R ++ A SN A+T + Sbjct: 613 SSIITAQTFHYTSVMANFGWE---GLEKAIRLANTTS---------APSNITDANTKFVY 660 Query: 1449 CASWVAQTDLSEFF 1462 S + +F+ Sbjct: 661 FISKATNYNWGQFY 674 >UniRef50_A2GCT2 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2GCT2_TRIVA Length = 694 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 56/309 (18%), Positives = 103/309 (33%), Gaps = 43/309 (13%) Query: 1001 KSLVDNNMIYGDGSSKAGMMNPSYPLNYMEK--PLTRLMLGRSWW---DLNIKVDVEK-- 1053 S+ I G+ +A ++N +Y + E P+ M+ + W D I + K Sbjct: 277 DSIQTMATIVGEEIERASIINTNYAYDVCETFGPILHSMIQQGGWKYGDNQISATLTKLI 336 Query: 1054 ----------------YPGAVSEEGQNV-TETISLYSNPTKWFAGNMQSTGLWAPAQKEV 1096 + G N + +++ + + STGLW Q Sbjct: 337 VKAASVLPLSYFATYDFSGRFVGSSMNSDCQQVTVSLS---FPDPGWYSTGLWIQPQYLS 393 Query: 1097 TIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYI 1156 ++ + VP + + +E R P V+ + + PYGG++Y Sbjct: 394 QVQFDQRVP-NCQIIIGCHTFNAIDNEPPWKRFPIVSLRRQITDR-FHEIATPYGGMVYF 451 Query: 1157 KGNSS----TNESASFTFTGVVKAPFYKDGA---WKNDLNSPAPLGELESDAFVYTTPKK 1209 S N+S T + VK P G WKN P GE+ + + Sbjct: 452 APGESEIIEKNKSIHVTLSDAVKYPMMILGKPDSWKNTYTEDIPWGEIVTKTIIIAARTD 511 Query: 1210 NLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPG---HKHRFTNDV 1266 +N + G + + L ++ + + K R+ P + + T D+ Sbjct: 512 QINMISDQEGQLSYIDSLIAPMVEVSGY----TMSKKFRLVFDIEYPHTFMNSYPITIDI 567 Query: 1267 QISIGDAHS 1275 + HS Sbjct: 568 NLHKKLLHS 576 >UniRef50_Q5LB88 Putative lipoprotein n=9 Tax=Bacteroides RepID=Q5LB88_BACFN Length = 939 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 62/403 (15%), Positives = 119/403 (29%), Gaps = 64/403 (15%) Query: 1134 KTYSLDASGTVKFKVPYGGLIYI---KGNSSTNESASFTFTGVVKAPFY---KDG--AWK 1185 +Y L G K GL+YI + T + ++ K W Sbjct: 488 TSYPLS-EGANKITARNKGLMYILYHTPDYETAQPVKIHIASGQVNGYFDVAKHQASDWN 546 Query: 1186 NDLNSPA-PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSED 1244 L++ ++ T P + G + D +S + G + Sbjct: 547 KLLSNAVDKYFDVVGHYAHLTFPTERFRTHTPDGK--ALIDAYDQIVNSEMELMGLYKYN 604 Query: 1245 GKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND-WLIWH 1303 + Y ++ S A S + N + + W H Sbjct: 605 KLFKNRMYLHVMYT----------SYMYATSYHTAYNDGTLAELCNVDKLKTSACWGPAH 654 Query: 1304 EVGH-NAAETPLTVPGATEVANNVLALYMQDRYLGKMNRV-------ADDITVAPEYLE- 1354 E+GH N L G TEV NN+++ Y+Q G+ +R+ + + + Sbjct: 655 EIGHCNQTRPGLKWLGTTEVTNNIMSEYIQTTIFGQPSRLQTEDMGDGSRNRYSKAWTQI 714 Query: 1355 ----ESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN 1410 + + +L+ + QL+ + K + + + FY + Sbjct: 715 IAAGAPHGNFGSDSDVFCKLVPFWQLELYFGKV--LGRTPLQQSDKGGFYPDVYE----- 767 Query: 1411 LFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGAN 1470 + H R + G + S +A+ +L +FF KW Sbjct: 768 -YIRTHDNLR-----------------TAGEQQTEFVYICSLIAKANLLDFFTKWGFLTP 809 Query: 1471 AYQLPGA---SEMSFEGGVSQSAYNTLASLDLPKPEQGPETIN 1510 +++ + + +L PKP+ E I Sbjct: 810 VDITVDDYGTGKLTVTQARIDEIRSRVEALGYPKPDVALEYIT 852 >UniRef50_A5ZF31 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZF31_9BACE Length = 959 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 85/456 (18%), Positives = 147/456 (32%), Gaps = 74/456 (16%) Query: 1069 ISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKS--NANVPVTVTVALAD----DLTGREKH 1122 I N T ++ TG++ PA + + + + N V + V D D G ++ Sbjct: 454 IDAAVNKTAKYSLLDNPTGIFVPAGENLVVMADLNGLDKVNIRVQNLDKPGQDGFGGTEY 513 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYI--------KGNSSTNESASFTFTGVV 1174 + +G + GL+Y+ T AS G Sbjct: 514 TI---------------VNGVNTISIKEKGLVYVMYHKDDYENAPEITLHFASGKVNGYY 558 Query: 1175 KA--PFYKDGAWKNDLNSPAPL-GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFA 1231 + P K G WK LN+ ++ T ++ NYT ++ N D Sbjct: 559 DSQNPKLK-GRWKELLNNSVDTHFDVIGKYVHLTFTTRSF--LNYTKDVDNLINLYDDMI 615 Query: 1232 SSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTL 1291 +F G + D +Y ++ + F D H+ Y S Sbjct: 616 YRQQEFLGLEKYDRMFHNRSYFHVHYNSGSFMY-----ATDYHTAY---IESSLNYLADE 667 Query: 1292 PTTPLNDWLIWHEVGHNAAETP-LTVPGATEVANNVLALYMQDRYLGKMNR--VADDITV 1348 N W HE+GH P L G TEV NN+ A+Y+Q + + +R V D Sbjct: 668 TQMAANCWGPAHELGHIHQTRPGLKWHGMTEVTNNITAIYVQTKVYNEPSRLTVQDRYVS 727 Query: 1349 APEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKG 1408 A + A ++L+ + QL+ + + + G Sbjct: 728 AFNSIMAGQKAHNAESDVFNKLVPFWQLELY----------FGEVKGNTPMKRTDHGG-- 775 Query: 1409 WNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPG 1468 + ++ K R + C + + A DL+ FF+KW Sbjct: 776 --FYADIYEKVR----TTANPTTDGLCQLE-------FVYNSCVSAGMDLTGFFEKWGFL 822 Query: 1469 ANAYQLPGA---SEMSFEGGVSQSAYNTLASLDLPK 1501 + + G + + + N + +L LPK Sbjct: 823 SPIDMMIGDYTNKQFTITETEIANVRNRIVALGLPK 858 >UniRef50_B1KGL9 Putative uncharacterized protein n=4 Tax=Shewanella RepID=B1KGL9_SHEWM Length = 991 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 92/516 (17%), Positives = 166/516 (32%), Gaps = 67/516 (12%) Query: 999 LKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAV 1058 ++S I G S + SYP++ T L + D + E P Sbjct: 456 KQESHQLYKYITLLGDSYRSQL--SYPMDVATSD-TMDYLQAMFADNTVYNYREINPAPA 512 Query: 1059 SEEGQNVTETISLYSNPTKWFAGNMQ---STGLWAPAQKEVTIKSN--ANVPVTVTVALA 1113 + T+ + + Q S G++A + VT+ N ++V V + Sbjct: 513 DLGNFSRTDFSHITPTDKSVSITSKQGFRSAGVYALPGQTVTVSRNDSSDVKTWVFINTQ 572 Query: 1114 DDLTGREKHEVALNRPPRV-TKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTG 1172 + E NRP + + + T+KF PYGG + +K + + + FTF+ Sbjct: 573 RSASTHEYATNGYNRPKYLQSTHVEIKPGETIKFTSPYGGPMQVKFDKG-DLATQFTFSS 631 Query: 1173 VVKAPFYKDG----AWKNDLN-SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDL 1227 V P++++G + LN S EL ++ F + + + L + Sbjct: 632 VGLHPYWRNGMDGAQFMQQLNDSEFDWAELATEHFEVHSRLDKMKTTMSHEPLWDTPEKM 691 Query: 1228 -DTFASSMNDFYGRDSEDGKHRMFTYKNLPGHK-------------HRFTNDVQISIGDA 1273 + ++++ + + + + D Q + G Sbjct: 692 GQAIMTHVHNYPHLLAGFKGPYIDSVSEITDFAIAQGWELDNLDTVKHMNAD-QATCGAG 750 Query: 1274 HSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPG-ATEVANNVLALYMQ 1332 SG P + N + PT HE+GH + L G + N + Y + Sbjct: 751 CSGNP-----YDANWSFSPTGH----GDIHELGHGLEKGKLRFDGHEGHASTNPYSYYTK 801 Query: 1333 DRYL---GKMNR-VADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKK 1388 R GK+ I E L+ S QA + A+L W+ + + Sbjct: 802 SRGFKESGKLPSCQGLSIKDEFEVLQASMKQA-----DPFNYMQEAKLTSWSNGMATMLQ 856 Query: 1389 WYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVS------------NDKFGGKNYCA 1436 GW+L +H R E + F + A Sbjct: 857 MMVAAQK------NGALEDGWHLLARLHILLREFERAKTSEALWLQKRAQLGFSQFSLDA 910 Query: 1437 ESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAY 1472 + D LM+ S+ Q D E ++ W + Sbjct: 911 AKGISNNDFLMVAMSYSTQLDYREVYQMWGLATSQA 946 >UniRef50_A2DW23 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DW23_TRIVA Length = 386 Score = 93.0 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 39/158 (24%), Positives = 67/158 (42%), Gaps = 11/158 (6%) Query: 1049 VDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTV 1108 + E +PG + + + L P W A TGLW PA K T++ +++ P+ + Sbjct: 235 PEYEFFPGK-TGDEPLGEFDVELVVQPDIWIA-----TGLWLPAGKIGTVELHSDYPLNL 288 Query: 1109 TVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNE-SAS 1167 + + +TG AL R P V + L S + +GG+ Y+ N + + Sbjct: 289 QIQIGSQVTGLLAKNGALKRWPNVVSYFQL-TSEVTQVATSFGGITYVTCNDVMDSVTVK 347 Query: 1168 FTFTGVVKAPFY---KDGAWKNDLNSPAPLGELESDAF 1202 FT P WK+ N+ P GE+E+ ++ Sbjct: 348 IHFTNFCLYPRACCDDPSVWKSTQNTQVPWGEIETPSY 385 >UniRef50_A2FU45 Immuno-dominant variable surface antigen-like n=1 Tax=Trichomonas vaginalis RepID=A2FU45_TRIVA Length = 669 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 59/314 (18%), Positives = 105/314 (33%), Gaps = 32/314 (10%) Query: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 G TGL+ P + +TI+ VT+ + + + +R PR++ ++ Sbjct: 133 GYNYITGLYVPPGEVITIELPFFFSTPVTILINRHIESFNNNNKPEDRLPRLSCGITIKT 192 Query: 1141 SGTVKFKVPYGGLIYIKGNSSTNE-SASFTFTGVVKAPFYKDG-----AWKNDLNS-PAP 1193 G F PYGG + ++ + +G VK P G + N++ AP Sbjct: 193 QGPHYFAWPYGGSLELRYDIDKFTFGLPIKISGCVKMPTLIYGRDSEEDYMNEIRYLKAP 252 Query: 1194 LGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYK 1253 + ++ T P K N F Y + G R + Sbjct: 253 VTVFDTGNLKITMPSKLARNPNRLYDTLHFWQGAGRV------LYSFNEVQGMPRRSDGR 306 Query: 1254 NLPGHKHRFTNDVQISIGDAH----SGYPVMNSSFSPNSTTLPTTPLND-WLIWHEVGHN 1308 +F D +S G A+ S Y + F+ T + + W HE HN Sbjct: 307 V--KVPVQFNIDRYVSHGLAYCQQGSYYIQAPTGFAGAFTDFESVFSDGCWGPIHEYAHN 364 Query: 1309 AAETP---LTVPGATEVANNV-----LALYMQDRYLGKMNRVADDITVAPEYLEESNNQA 1360 + + E+ NV +L M + R+ + + ++ N Sbjct: 365 FQQNWGFGWFYSYS-EITVNVPNYISYSL-MTTIDSARQARIDGNYIMKSDW--NYNTHI 420 Query: 1361 WARGGAGDRLLMYA 1374 ++ G+ L YA Sbjct: 421 YSTIGSTALLFFYA 434 >UniRef50_A2F153 Immuno-dominant variable surface antigen-like n=1 Tax=Trichomonas vaginalis RepID=A2F153_TRIVA Length = 1148 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 70/395 (17%), Positives = 122/395 (30%), Gaps = 36/395 (9%) Query: 1084 QSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGT 1143 ST L+A + +T + + + L + R P + +++L+A Sbjct: 142 HSTTLYAMPGELITFEIPEFAVGKLNLVLN---RQAPSNGDLTQRYPNLQCSFTLNA-KK 197 Query: 1144 VKFKVPYGGLIYIK--GNSSTNESASFTFTGVVKAPFYK-----DGAWKN-DLNSPAPLG 1195 V F P GG + IK ++ S TG ++ P+++ D W+ N APL Sbjct: 198 VTFGYPLGGYMDIKCWIDTFPLHSVEINITGGIRIPYFRYGAESDQDWEEETRNYVAPLT 257 Query: 1196 ELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDS-EDGKHRMFTYKN 1254 ++ P F + S+N+ D +DG+ + + N Sbjct: 258 FYDTGNVKCIFPSTFSRNQIRMSDAGAFWRTVGRVMYSVNEVTNYDRRKDGRIKTAMWFN 317 Query: 1255 LPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPL 1314 + + + + S+S W HE GH+ Sbjct: 318 FDSYVPAGAAVAFVGANFIQAPF-----SWSTAMINYEGAKWGCWGNVHEYGHHFQSG-W 371 Query: 1315 TVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLM-- 1372 + G E NNV+ +++ IT+ +N A+ G+ Sbjct: 372 GISGTGETTNNVINFITYAMLT-EIDATRQ-ITLGGASFNAANGWAYITHEFGNMDATKD 429 Query: 1373 YAQLKEWAEKN---FDIKKWYP--DGTPLPEFY-SEREGMKGWNLFQLMHRKARGDEVSN 1426 YA W N F + W +Y G LMH Sbjct: 430 YAGPFFWYGNNAYFFGLDAWRKCLRAHTHQIYYKRSDYGTYTSEF--LMHCAKFFHRDLR 487 Query: 1427 DKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEF 1461 F + S + C + ++Q L EF Sbjct: 488 AYFKTFEFPEASQISDR-----CNNELSQMKLKEF 517 >UniRef50_C9YCF1 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YCF1_9BURK Length = 872 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 92/490 (18%), Positives = 155/490 (31%), Gaps = 110/490 (22%) Query: 1046 NIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP 1105 + D+ + A + T + T + G A K VT++ + Sbjct: 373 TAQADLGSF--ASAMTAGMAVSTTDETVDVTVPSTSGFTAIGRLAAPGKPVTVELLSAGS 430 Query: 1106 VTVTVALADDLTGREKHEVA--LNRP-----PRVTKTYSLDASGTVKFKVPYGGLIYIKG 1158 TV++ L TG + NRP P + L ++ PYGG + + Sbjct: 431 ATVSLRLNTQRTGSTRLWDPNRYNRPRFLASPDMV----LSTGQAMQLVSPYGGTLQLVF 486 Query: 1159 NSSTNES-ASFTFTGVVKAPFYKDGAWKNDLNSPAP--------LGEL---------ESD 1200 +++T + GV K PF D + E+ +D Sbjct: 487 SNATPQQNVQLRLRGVAKHPFLDQSNGAGDKAAFVTALNAAQHEWAEIKLAGIEIHSRAD 546 Query: 1201 AFVYTTPKKNLNASNYTGGLEQFANDLDT-FASSMNDFYGR---------------DSED 1244 + +N ++Y G +++F N++ T F + G S Sbjct: 547 KM-----RAVINGTDYAGDIDKFLNEVKTLFFEDLYMLAGYALPGKSLTAHVQAMCTSLG 601 Query: 1245 GKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHE 1304 T +PG +H D G SG P W HE Sbjct: 602 WNCTDATLHRVPGTQHINV-DNYSQCGSGCSGNPY--DQDWG-------LSPRGWGESHE 651 Query: 1305 VGHNAAETPLTV--PGATEVANNVLALYMQDRYLGKMNRVADDITVA------------- 1349 VGHN + V +TEV+NN+ L+ R L +M D V+ Sbjct: 652 VGHNQQKGMHKVYDDRSTEVSNNLFPLHKGWRMLSEMGYNTGDTRVSYLSAFNMIKAAKL 711 Query: 1350 -PEYLEESNNQAWARG----GAGDRLLMYAQ-LKEWAEKNFDIKKWYPDGTPLPEFYSER 1403 + +E + W G+R+ Y Q + WA++ + Sbjct: 712 QADPVEAAYQSIWGNAAYAVQNGERMAFYMQWVHYWAQR-------------------QV 752 Query: 1404 EGMKGWNLFQLMHRKARG--------DEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQ 1455 GW++ L++ R + +K G Y + + D L++ SW+ Q Sbjct: 753 SIATGWDIITLLYLHQRQFDAVADADWAANRNKLGYSTYATKPSPTGNDNLLITLSWITQ 812 Query: 1456 TDLSEFFKKW 1465 D F W Sbjct: 813 RDQRPTFDLW 822 >UniRef50_C5PJT0 Lipoprotein n=2 Tax=Sphingobacterium spiritivorum RepID=C5PJT0_9SPHI Length = 601 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 68/464 (14%), Positives = 150/464 (32%), Gaps = 70/464 (15%) Query: 1085 STGLWAPAQ--KEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASG 1142 TG++ + ++ +++ V DD ++K + L + + ++ G Sbjct: 96 PTGIYFSEGDSAVLWVEKTNATQLSLRVTNWDDEEFKQK-DYPLTQG---YNAFKIENKG 151 Query: 1143 TVKFKVPYGGLIYIKGNSSTNESASF-----TFTGVVKAPFYKDGAWKNDL-NSPAPLGE 1196 + Y + + N G+ + + W N L N+ AP+ + Sbjct: 152 NSYIQ-------YFTPDKAGNNKVKIHILSGKVNGIFDISKHTNEDWDNLLANATAPVLD 204 Query: 1197 LESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLP 1256 + ++L + G+E D+ ++ G + +P Sbjct: 205 IVGKQVQLAYAVQSLQTNAAHQGVE-LVRLYDSIIGIQHELMGLKQTNR---------IP 254 Query: 1257 GHKHRFTNDVQISIGDAHS---GYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGH-NAAET 1312 ++ I G H+ G N++ + + N W + HE GH N Sbjct: 255 KNRM---FGRVIWKGFMHADGIGAAFHNNT-MKDVANVAGLRKNSWGVAHEFGHVNQVRP 310 Query: 1313 PLTVPGATEVANNVLAL---YMQDRYLGKMNRVADDITVAPEYLEE---------SNNQA 1360 + G TEV NN+ ++ Y+ ++ K+ R P+ + Q Sbjct: 311 NMKWVGTTEVTNNIYSVWTQYIYNQNQPKLEREKLKDYDEPKIGGRITSYMESAFIHRQP 370 Query: 1361 WARGGAGDRLLMYAQLKEWAEKNFD--IKKWYPDGTPLPEFYSEREGMKGW---NLFQLM 1415 W DR + ++W +F + W L +++ W N + + Sbjct: 371 WLTQAGPDRWDR-ERPRDWGGDHFVKLVPLW-----QLQLYFNVAGEGNTWENKNFYGDI 424 Query: 1416 HRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLP 1475 KA + DK + A + DL++FF++ + L Sbjct: 425 FTKAINAPTTKDKPDAYYQLE---------FIKNACDAGKLDLTDFFEQ-SGLLIPIDLW 474 Query: 1476 GASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEHKMSA 1519 + ++ + + + P+ ++ +T + + A Sbjct: 475 VDDYTCAQMTITPNDIQQVKTYAAKYPKPNTSVLHYITANSVQA 518 >UniRef50_B2V178 Fibronectin type III domain protein n=2 Tax=Clostridium botulinum E RepID=B2V178_CLOBA Length = 1886 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 83/486 (17%), Positives = 149/486 (30%), Gaps = 121/486 (24%) Query: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYS-LD 1139 + Q G+ A A E++I + V V + + +TK + ++ Sbjct: 1003 NDYQPLGISARAGDEISIYVGTDGNVMPEVIFTQYYPESSQWTTTVK---SLTKGKNVIE 1059 Query: 1140 ASGTVKFKVPYGGLIYIK--GNSSTNESASFTFTGVVKAP-------------------- 1177 GG +Y++ NS+TN +G K P Sbjct: 1060 VPKIGSMATERGGSVYVRYPRNSATNNEIKIRVSGGTKIPYLNLSNIVDEEISKKEIEKY 1119 Query: 1178 -------------------------------FYKDGAWKNDLNSPAPLGELESDAFVYTT 1206 YK + LNS E+ +D F+ T Sbjct: 1120 ITTLEEFNEKLPTYYEDANRLMFNKTRENKNLYKFDEKTSVLNS----TEIVTDKFLLTL 1175 Query: 1207 PKK------NLNASNYTGGLEQF------ANDLDTFASSMNDFYGRDSEDGKHRMFTYK- 1253 P NL+AS + +++ +L A + Y + ++ + + Sbjct: 1176 PATEVLRGINLDASTLSDKIDKVYDALLAWEELGDIAYGVKGLYENPDLNNDGKVDSSEI 1235 Query: 1254 --NLPGHKHRFTNDVQISIGDAH--SGY---------PVMNSSFSPNSTTLPTTPLN--D 1298 ++P + S + SG+ P++N ++ T N Sbjct: 1236 KHSMPSSRLNIRYTRMFSGAFMYASSGHIGIEFGSVAPLLNGKPYTKNSDNDITAYNYYG 1295 Query: 1299 WLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNN 1358 W I HE+GH E E NN+++L Q ++R+ Y + ++ Sbjct: 1296 WGIAHEIGHVIDEGNAIY---GETTNNIISLMAQTIDDKALSRLESSNLYPKIYEKVNSG 1352 Query: 1359 QAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRK 1418 L M+ QL + I + + ++R Sbjct: 1353 SIGVASNVFVSLGMFWQLHLAYDNEASINQEDS-------------------FYAKLNRL 1393 Query: 1419 ARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGAS 1478 R + S K+ + L+ AS AQ DL+EFFK+W AN + + Sbjct: 1394 YRENTESTPNVQAKD----------NLLIRLASDAAQKDLTEFFKRWGLIANNDTITYLA 1443 Query: 1479 EMSFEG 1484 +E Sbjct: 1444 SKGYEK 1449 >UniRef50_A5ZED4 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZED4_9BACE Length = 800 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 77/437 (17%), Positives = 152/437 (34%), Gaps = 67/437 (15%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 + +N T ++ TG++A A + + I + L DL G + Sbjct: 345 PYQNPAVMATANKTSKYSLRDNPTGIYAKAGETLAIFVDDIYEGGRISMLIQDLNGGYNN 404 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYI------------------KGNSSTNE 1164 +KTY L G + V GGLIYI + + + Sbjct: 405 ----------SKTYELS-EGYNEITVEVGGLIYILNHVNDDIPLRHEDADNDQKRNIEAK 453 Query: 1165 SASFTFTGVVKAPFY-----KDGAWKNDLNSPAPLGELE--SDAFVYTTPKKNLNASNYT 1217 + F ++ K+ W ++ A E++ + T + Y Sbjct: 454 TVKVHFANGKVNGYFDIQKNKESDWAQIRDN-AKYQEIDILGEYSHLTWRISDFKK--YN 510 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGY 1277 + + +LD +F G + Y + ++ F+ D + +A S Y Sbjct: 511 TEITKTIENLDRLVYLEEEFMG---------LVKYGKMFNNRMHFSIDYKAKSPNA-SDY 560 Query: 1278 -PVMNSS--FSPNSTTLPTTPLNDWLIWHEVGH-NAAETPLTVPGATEVANNVLALYMQD 1333 V N+S ++ P W HEVGH N L G TEV NN+++L++Q Sbjct: 561 RTVYNASDYYAEPFCKPENFPTRCWGPAHEVGHCNQTRPGLKWAGLTEVTNNIMSLFIQT 620 Query: 1334 RYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDG 1393 + G+ ++ D + +++ L++ + I + Sbjct: 621 SF-GRPCKLLVDGCTLKDENDQTLGTYNNIYQGATSLIVDGKRPHCLP---GIANITRET 676 Query: 1394 TPLPEFYSER---EGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCA 1450 +P + + + ++ + + ++ R E +DK G+N N D + Sbjct: 677 QLVPFWQLKLYMIDVLEKTDFYHKLYEYFRTHESPSDK--GEN----QGMNQLD-FVRQV 729 Query: 1451 SWVAQTDLSEFFKKWNP 1467 ++ ++ +FF+KW Sbjct: 730 CDISGLNMLDFFEKWGF 746 >UniRef50_B7V4F8 Putative uncharacterized protein n=7 Tax=Pseudomonas aeruginosa RepID=B7V4F8_PSEA8 Length = 923 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 92/482 (19%), Positives = 158/482 (32%), Gaps = 82/482 (17%) Query: 1059 SEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTG 1118 G T T++L S A + G A K ++I+ ++ V L G Sbjct: 434 PVSGSEETLTLTLPS------AQGFTAIGRMAAPGKRLSIRIEDAGQASLAVGLNTQRIG 487 Query: 1119 REK-HEVALNRPPRVTKTYSLD--ASGTVKFKVPYGGLIYIK-GNSSTNESASFTFTGVV 1174 + PR K+ + A+ +V PYGGL+ + ++ ++ + TG Sbjct: 488 STRLWNTRQYDRPRFLKSPDIKLQANQSVALVSPYGGLLQLVYSGATPGQTVTVKVTGAA 547 Query: 1175 KAPFYKDGAWKNDLNS-----------PAPLGELESDAFVYTTPKKNLNAS---NYTGGL 1220 PF ++ + A E+ S + + + S +Y G + Sbjct: 548 SQPFLDIQPGEDSSQAIADFIQALDADKADWLEIRSGSVEVHAKVEKVRGSIDKDYGGDV 607 Query: 1221 EQFANDLDT-FASSMNDFYGRD---------------SEDGKHRMFTYKNLPGHKHRFTN 1264 ++F +L+ F G + T LPG Sbjct: 608 QRFIRELNEVFIDDAYTLAGFAIPNQAKTPAIQQECAARGWDCDSETLHKLPGT-QHINV 666 Query: 1265 DVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPG--ATEV 1322 D G SG P + ++ N W HE+GHN L V G + E+ Sbjct: 667 DQYAQCGGGCSGNP-YDQTWGLN--------PRGWGESHELGHNLQVNRLKVYGGRSGEI 717 Query: 1323 ANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEK 1382 +N + L+ R L + + DD V Y N R A +Y +L W + Sbjct: 718 SNQIFPLHKDWRVLREFGQNLDDTRV--NYRNAYNLIVAGRAEADPLAGVYKRL--WEDP 773 Query: 1383 NFDIKKWYPDGTPLPEFYSE---------REGMKGWNLFQLMHRKARGDEVSNDKFGGK- 1432 Y FY++ + ++GW+++ L++ R + S+ Sbjct: 774 G-----TYALNGERMAFYTQWVHYWADLKNDPLQGWDIWTLLYLHQRQVDKSDWDANKAA 828 Query: 1433 -----------NYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 N S+ + D L+L SW+ Q D F W +A + Sbjct: 829 LGYGTYAQRPGNSGDASSTDGNDNLLLGLSWLTQRDQRPTFALWGIRTSAAAQAQVAAYG 888 Query: 1482 FE 1483 F Sbjct: 889 FA 890 >UniRef50_B9GV44 Predicted protein n=3 Tax=Populus trichocarpa RepID=B9GV44_POPTR Length = 452 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 38/89 (42%), Gaps = 4/89 (4%) Query: 32 SSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKT 91 + P D P PDPTP+ P P+PTP P P EP P P P P+P P P P Sbjct: 44 PNQAPVPDPTPSQAPV--PDPTPSQAPVPDPTPSPAPVHEPTPSPAPVPDPTPNPAPAPD 101 Query: 92 GYLTLGGSQRVTGATCNGESSDGFTFKPG 120 TL S + T S+ +F Sbjct: 102 S--TLSPSPAASTTTLTSRVSENISFSKK 128 >UniRef50_B2UPI7 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B2UPI7_AKKM8 Length = 506 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 78/460 (16%), Positives = 138/460 (30%), Gaps = 88/460 (19%) Query: 1086 TGLWAPAQKEVTIKSNANVPVTVTVALAD---------------DLTGREKHEVALNRPP 1130 TG++ + V + +++ L + + G K ++ L Sbjct: 104 TGVYLEKGRHV-VLVGKTEGQEISLLLPNLMRKPAEGVQPTKDPNGWGLHKKQIPLKEGI 162 Query: 1131 RVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFY------KDGAW 1184 + ++ Y ++ F ++ + W Sbjct: 163 NII---DVETPANAYIS-------YFTEDAGKAPKIPVHFVTGKANGYFDTTRGDTNKDW 212 Query: 1185 KNDLNSPA-PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSE 1243 L+ P+ + P + L G + N D G D Sbjct: 213 VRLLDQAVSPIMDARGKYIQVAYPVEFLKKFTKDRG-TELINAYDKLIGIQYQLMGLDKY 271 Query: 1244 DG--KHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND--W 1299 ++R+ N + R D G A+ G N T D W Sbjct: 272 GKIPENRVLARVNFNYYMFR---DGD---GVAYLG----NDGTMRMVTDPENVLKGDACW 321 Query: 1300 LIWHEVGHNAAETPLTVPGATEVANNVLALYMQDR--YLGKMNRVADDITVAPEYLEESN 1357 HEVGH P+T G TEV+NN+ +L + ++ R E +E Sbjct: 322 GFSHEVGHVMQMRPMTWGGMTEVSNNIFSLQAAAKTGNESRLKRQGSYDKARKEIIEGEI 381 Query: 1358 NQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHR 1417 ++L+ QL + KN +YPD + E+ G G Sbjct: 382 -AYLQSKDVFNKLVPLWQLHLYFTKN-GHPDFYPD---VMEYLRNNAGNYG--------- 427 Query: 1418 KARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKW---NPGANAYQL 1474 G++ +F C + V +TDL++FF+KW PG Sbjct: 428 ---GNDTVKYQFEFVKACCD---------------VTKTDLTDFFEKWGFFKPGKFHIGD 469 Query: 1475 PGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTE 1514 + + + + +A PKPE I +++E Sbjct: 470 YAQYDFNVTPEMVEETKKWIAGKGYPKPETD---ITELSE 506 >UniRef50_B2UP41 Putative lipoprotein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UP41_AKKM8 Length = 679 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 64/374 (17%), Positives = 108/374 (28%), Gaps = 89/374 (23%) Query: 1133 TKTYSLDASGTVKFKVPYGGLIYIK---GNSSTNESASFTF-----TGVVKAPFYKDGAW 1184 +Y L G + GL YI+ N + GV K+ W Sbjct: 210 HSSYPLK-EGVNIIRAKNKGLGYIEYFTPNYKKAPKVHLSILSGKVNGVFVGGVSKNSDW 268 Query: 1185 KNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSE 1243 K L NSP + ++ P + L G E+ D G Sbjct: 269 KKMLENSPTEVVDIVGSRVHLVYPVEELKQFCPDKG-EELIALYDRIIGMEQQIMG---- 323 Query: 1244 DGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS---GYPVMNSSFSPNSTTLPTTPLNDWL 1300 ++ Y+ LP ++ I G H+ G ++ P + W Sbjct: 324 -----LYKYRMLPKNRM---FGRVIWNGFMHADGTG-AAFHNGTMKEVGNPDRIPGSAWG 374 Query: 1301 IWHEVGH-NAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADD-------------- 1345 I HE GH N + EV NN+ + Y+ R+ + Sbjct: 375 IAHEFGHVNQVRPAMKWVSTGEVTNNIYSAYVNYMLNPSSMRLEHERINGGDGNMIGGRF 434 Query: 1346 ---------------ITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWY 1390 + P+ +N+ +L QL+ + F + Sbjct: 435 NAYLNNGILKGENWLVQSGPDKRSGGDNRPMVH-DHFVKLAPLWQLELY----FKVA--- 486 Query: 1391 PDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCA 1450 G P+FY + ++ + RG + + M A Sbjct: 487 --GKGNPDFYPDI-------FYKAIKMDTRGKKDGELQLA---------------FMKNA 522 Query: 1451 SWVAQTDLSEFFKK 1464 A+ DL++FF+K Sbjct: 523 CDAARQDLTDFFRK 536 >UniRef50_A2DKX5 Immuno-dominant variable surface antigen-like n=1 Tax=Trichomonas vaginalis RepID=A2DKX5_TRIVA Length = 308 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 64/189 (33%), Gaps = 21/189 (11%) Query: 1048 KVDVEKYPGAVS--EEGQNVTETISLYS--NPTKWFAGNMQSTGLWAPAQKEVTIKSNAN 1103 +++ G S + I + + N W TG +AP + +T++ Sbjct: 102 PAGTKQFYGDDSLVSNAERYKAKIFIDTRYNVQHW-------TGFYAPPGELITVEIPDK 154 Query: 1104 VPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLI-YIKGNSST 1162 + V + T R + R + + + KF PYGG I + S Sbjct: 155 ALNRIKVDINVITTSRSYN---YRRADQTSCRTDYINTTVTKFGWPYGGAIDFFIPIDSF 211 Query: 1163 NESASFTFTGVVKAPFYKDG-----AWKNDL-NSPAPLGELESDAFVYTTPKKNLNASNY 1216 E + V+K P+++ G W + + PAPL ++ + T P + Sbjct: 212 PEGLEVNISSVIKMPYFRYGATTEEEWNDKISKYPAPLAVFDTGSLHITGPSTFVRQKKN 271 Query: 1217 TGGLEQFAN 1225 + Sbjct: 272 LNDVMAIWR 280 >UniRef50_UPI000197B0FB hypothetical protein BACCOPRO_00998 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B0FB Length = 567 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 74/464 (15%), Positives = 146/464 (31%), Gaps = 54/464 (11%) Query: 1084 QSTGLWAPAQKEVTIKSNANVPVTVTVALADDLT---GREKHEVALNRPPRVTKTYSLDA 1140 TG+ K + + + V +AD G E + R + L Sbjct: 124 HITGIVLKPGKHIIVVDGLKEGSKLGVKVADLYAPNQGDEDWSLHFER-------FELKN 176 Query: 1141 S-GTVKFKVPYGGLIYIK---GNSSTNESASFTF-----TGVVKAPFYKDGAWKNDLNSP 1191 ++ + GL Y+ N + F G A + W L + Sbjct: 177 GINVIEKTSEWTGLAYMDYYFDNPEKENTVKVHFITGEVNGYFDASVNTNEDWDRMLANA 236 Query: 1192 A-PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSED--GKHR 1248 P+ + P ++L G ++ + + S ++ G + ++ Sbjct: 237 VYPVFDATGSNIHLAYPVEDLKKYA-PGQGKELIDVYEQLVSKQHEIIGWKKYNHITNNK 295 Query: 1249 MFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND-WLIWHEVGH 1307 +F N + R + V + + T + W HEVGH Sbjct: 296 IFARVNYGYYMFRDGDGVAFKFDTM---------KRVADPVHMRTKDEDACWGFSHEVGH 346 Query: 1308 NAA-ETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWA---- 1362 +T L+ G E +NN+ Y + K + +LE+S + Sbjct: 347 VHQLQTYLSWGGLGETSNNICTRYCTQAFGYKNRLSSAFANAEKSFLEDSKAGTVSPSRR 406 Query: 1363 RGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMK-GW-NLFQLMHRKAR 1420 GG D ++ + K +K + +P + + + G+ + + ++ K R Sbjct: 407 AGGMNDSII--SSCKVNPDKAISYLETDVFERLVPFWKLQCYFTQNGYPDFYPDLYEKMR 464 Query: 1421 GDEVSNDKFGGKNYCAESNGNAADTLMLC-ASWVAQTDLSEFFKKWNPGANAYQLPGASE 1479 E + + + SN + AS +A +L +F+K+ +L Sbjct: 465 NSEKEHPELKDLD--RSSNVVPFQLNFIRGASLLAGKNLYPYFEKFGFF-RILKLSYGDY 521 Query: 1480 MSFEGGVSQSAYNT-------LASLDLPKPEQGPETINQVTEHK 1516 + ++ + L L KP PE +N + K Sbjct: 522 GDYNYEMTTEMRDRFKKEMEELEKQQLIKP-LTPEELNALIYAK 564 >UniRef50_D1PPF9 Putative fibronectin type III domain protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PPF9_9FIRM Length = 1764 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 97/592 (16%), Positives = 173/592 (29%), Gaps = 124/592 (20%) Query: 948 LYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNN 1007 +Y N V + Y Y+ + D L Y +D + + + Sbjct: 792 IYNNRPVSISEMRFYAYDSLEAD---------ILALYDDDLHVTLRSGVDEKTIEALQTR 842 Query: 1008 MIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTE 1067 + D +G ++P E R +L + D+ ++V T Sbjct: 843 LDTPD--EASGELHPEREALQRELDNARSLLTTTLSDV-VEVHT--------------TI 885 Query: 1068 TISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALN 1127 T + Q G+ A A +E+ + ++ T A + + E Sbjct: 886 TAKKDGHLGFTGLNAWQPLGVTAKAGEEIIVYVGSSNVATGANAWLQLVATQYNSESG-- 943 Query: 1128 RPPRVTK-------TYSLDASGTVKFKVPYGGLIYIK-GNSSTNESASFTFTGVVKAPFY 1179 V++ + V F +GG +Y++ + N+ S +G P Sbjct: 944 --EVVSQPIQLKVGRNEIMVPELVTFDAEHGGALYVQFTGDNANDRYSVRVSGGATIPVL 1001 Query: 1180 K------DGAWKNDLNSPAPLGE------------LESDA-------------------F 1202 G K + + E L + Sbjct: 1002 DLYGIDDPGQRKERVTAYVAALEAANDTLNSRHDELHGEHGTCEPQTCILNTTDIMLDQM 1061 Query: 1203 VYTTPKKNL-------NASNYTGGLEQFANDLDTFAS------SMNDFYGRDSEDG---K 1246 +Y+ P L + ++ L + +D + + D G ++ + Sbjct: 1062 MYSVPVSQLLAGLGSGSTADKAARLLSSLDAMDQMMTLFYQHKGLTDLAGAGDKNRLPSQ 1121 Query: 1247 HRMFTYKNLPGHKHRFTNDVQISIGD-----AHSGYPVMNSSFSPNSTTLPTTPLNDWLI 1301 H Y + + I IG G P+ + + + L W I Sbjct: 1122 HLNIRYMRMFAGAFMYAAGNHIGIGWDSVPGLGGGEPITLN----EDGSYRSGDLFGWGI 1177 Query: 1302 WHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAW 1361 HE+GHN + V EV NN + Q + R D + Sbjct: 1178 AHEIGHNINQGSYAV---VEVTNNYFS---QISQAHEGVRFGYDAIYSK----------V 1221 Query: 1362 ARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARG 1421 G G ++ QL + + + K Y E Y E + + + AR Sbjct: 1222 TSGTTGHSSDVFTQLGLYWQLHLAYDKGYEY-----EIYDNYEELFASRFYARVDTYARN 1276 Query: 1422 DEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQ 1473 + GG G+A M AS AQ DL++FF +W +A Sbjct: 1277 TAQA-PAPGGIKL--TLGGDADQNFMRLASAAAQKDLTDFFVRWGMTPDAAT 1325 >UniRef50_B2UQK5 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQK5_AKKM8 Length = 747 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 65/438 (14%), Positives = 131/438 (29%), Gaps = 68/438 (15%) Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRV-TKTYSLDASGT 1143 TG+ A E+ + + +A P + + +Y L+ G Sbjct: 247 PTGIVAEKGDEILVFVGPTHGEDIGLASVS--------------PAGIESSSYPLN-EGV 291 Query: 1144 VKFKVPYGGLIYI-----KGNSSTNESASFTFTGVVKAPFY-----KDGAWKNDL-NSPA 1192 K ++ GL+Y+ + + ++ D WK + N+P Sbjct: 292 NKIRINRSGLLYVMYHTDISPPKKPITVHIPVGSGIVNGYFDVTRHTDKDWKRMISNAPH 351 Query: 1193 PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHR-MFT 1251 + ++ + K L + + + D +M G D H Sbjct: 352 SMFDIVGRNSMMILHTKYLKDYS-PDSITKSVRVWDESVKAMWKIMGFDKYPQPHNNRQL 410 Query: 1252 YKNLPGHKHRFTNDVQISIGDAHSGYPV--MNSSFSPNSTTLPTTPLND-WLIWHEVGHN 1308 ++ G H F + GY + ++ N W I HE+GH Sbjct: 411 GVSVEGGAHMF-------ATWYYCGYSIGDQGNTLKNEVLAPGVLQGNRLWGIGHEIGHC 463 Query: 1309 AAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGD 1368 P +E +NN A + D+ +N + N Sbjct: 464 YQH-PFNWRSMSESSNNFFAQLILDQVTNAINGNEQ--------ASDMENPCKYLLSEAV 514 Query: 1369 RLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDK 1428 + + + L WA+ F +Y ++ + + + R +S Sbjct: 515 KGMPFHDLNGWAKWGFAQYSFY-------LYFHKLGINP--EFYPRLFESLRRKPLSRQ- 564 Query: 1429 FGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWN---PGANAYQLPGASEMSFEGG 1485 A A L +++TD ++ F+ +N P G Sbjct: 565 -------AYEVSEAHLALYERICNISRTDFTDDFEIFNWFVPIDRKGHQYGDYSFKMTEE 617 Query: 1486 VSQSAYNTLASLDLPKPE 1503 +++++ +A+ PKP+ Sbjct: 618 MARASKARIAAKRYPKPK 635 >UniRef50_A5ZFW6 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFW6_9BACE Length = 951 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 79/406 (19%), Positives = 134/406 (33%), Gaps = 63/406 (15%) Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTV 1144 TG++ +E + V + + D + R P T +Y L G Sbjct: 442 PTGIYFKDGEEAVVILGNTNGEQVNLKVYD---FDAIRQG--QRTPDPT-SYPLS-EGIN 494 Query: 1145 KFKVPYGGLIYIK---GNSSTNESASF-----TFTGVVKAPFYKDGAWKNDLNSPAPLG- 1195 K ++ +GGL YI+ N T + G W+ LN A G Sbjct: 495 KLRIAHGGLSYIEYYTPNWKTAPALKLHIASGKVNGYYDKHRDVSADWREILN-KATYGC 553 Query: 1196 -ELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 +++ D ++ Y L + + D ++ G D + + + Sbjct: 554 IDIKGDRVNLVFGVNSIKT--YCDNLGKLIQNYDDIVELEHELMGLDKWGRRPKNHMFA- 610 Query: 1255 LPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGH-NAAETP 1313 R T D + G G + ++T + W I HE GH N Sbjct: 611 ------RVTKDGLFADGW---GAGWYEGCMNELASTTKSLREGVWAIAHEFGHVNQIRPG 661 Query: 1314 LTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMY 1373 L TEV NNV ++ + +++ + E + N+ RGG + L Y Sbjct: 662 LKWVSTTEVTNNVYSV------CARYKFYRENMPLEHERCNDGNDNN-VRGGRFNSYLNY 714 Query: 1374 AQLK--EW----AEKNFDIKKWYPDGT---------PLPEFYSEREGMKGWNLFQLMHRK 1418 +K +W + N D K+ G L +Y E G + + + + Sbjct: 715 GIIKGEQWLCQKGQDNMDPSKYPYGGDHFVKLCPLWQLLLYYREIVGGEKRDWYGDVAEI 774 Query: 1419 ARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKK 1464 R + S +NG M V + DL++FF K Sbjct: 775 VRNTDESQL----------TNGQLQLNFMRNTMDVVKEDLTDFFIK 810 >UniRef50_B0N0I0 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B0N0I0_9FIRM Length = 1739 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 58/289 (20%), Positives = 91/289 (31%), Gaps = 38/289 (13%) Query: 1196 ELESDAFVYTTP----KKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSED----GKH 1247 E+ D +Y+ L + Q N L+ M FY + + Sbjct: 1021 EIMLDHMLYSVSGKQIMAGLKGTTLDEKANQLLNSLNAMDQMMELFYQNKGLNENAAAIN 1080 Query: 1248 RMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLN--------DW 1299 + ++L R + G H G + S N W Sbjct: 1081 DRYPAQHLNIRYQRMFAGAFMYAGGNHIGIEWGSVSGLSNGIPFEAAENGKYLSGSLFGW 1140 Query: 1300 LIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEE-SNN 1358 I HE+GHN + + E+ NN +L Q+R R P+ E+ ++N Sbjct: 1141 GIAHEIGHNINQGSYAIA---EITNNYFSLLSQNRDSNDTTRF-----KYPDVYEKVTSN 1192 Query: 1359 QAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRK 1418 +L MY QL ++N+ K + L F + Sbjct: 1193 TVGMSSNVFTQLAMYWQLHLAYDQNYHYKLYDSHEEQLNSL-----------FFARVDYY 1241 Query: 1419 ARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNP 1467 AR N GG + N +A + AS A DL++FF W Sbjct: 1242 ARNPGKVNIPEGGTAL--KLNSDAQQNFVRLASAAANKDLTDFFTAWGI 1288 >UniRef50_Q7MJ28 Putative uncharacterized protein VV2335 n=13 Tax=Vibrionales RepID=Q7MJ28_VIBVY Length = 929 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 73/450 (16%), Positives = 145/450 (32%), Gaps = 67/450 (14%) Query: 1057 AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDL 1116 E +++T+ L S N +S G++A + I N V V++A+ + L Sbjct: 457 EFGAEIARISKTVQLESKR------NFRSAGVYALPGETFQITRRDNSAVKVSIAI-NSL 509 Query: 1117 TGREKHEVALN---RPPRV-TKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTG 1172 HE + N RP + + TY + + T++ YGG I + +++ + FT Sbjct: 510 RSGATHEFSTNGYSRPKHLTSTTYEIKSGETIRLTSAYGGPIQVHFDTN-DLPVELRFTN 568 Query: 1173 VVKAPFYKDGAWKNDLNSPA-----PLGELESDAFVYTTPKKNLNASNYTGG-------- 1219 V + P ++ + EL + F + + + S Sbjct: 569 VAQHPVWRSAEDNEPFAAQLNQDQFDWAELITPGFEVHSKRDKMLQSISATEWAGSAAAM 628 Query: 1220 LEQFANDLDTFASSMNDFYG------RDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDA 1273 + + F ++ F G + D Q + G Sbjct: 629 AQATERYMHNFPHALAGFKGPGITVFEQVQTYGESKGWQVETIDMVKHMNAD-QATCGYG 687 Query: 1274 HSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGA-TEVANNVLALYMQ 1332 SG P + P HE+GH + G N + Y + Sbjct: 688 CSGNP-----YDAYWAFSPVGH----GDLHELGHGLEKGRFRFAGWEGHSTTNYYSYYSK 738 Query: 1333 DRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKE-WAEKNFDIKKWYP 1391 +Y ++ + + ++ + +R A M AQ + W+ W Sbjct: 739 SQYF--IDTGKESQCQSLDFKGQYELLQQSRQQADPNAFMAAQNQTGWS--------WGG 788 Query: 1392 DGTPLPEFYSEREGMK--GWNLFQLMHRKARGDE---------VSNDKFGGKNYCAESNG 1440 ++++G+ GW+L +H R + + G + + Sbjct: 789 RVYIQMMMATQQQGILNDGWHLLGRLHLIEREFNRLKGSAELWDARKESIGFSQYSLDEA 848 Query: 1441 NAA---DTLMLCASWVAQTDLSEFFKKWNP 1467 NA D L++ S++ + D+ + W Sbjct: 849 NAISNNDWLLVALSYITERDMRAYLNMWGF 878 >UniRef50_D1BQB2 Putative uncharacterized protein n=1 Tax=Veillonella parvula DSM 2008 RepID=D1BQB2_VEIPT Length = 467 Score = 74.1 bits (180), Expect = 5e-11, Method: Composition-based stats. Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 2/73 (2%) Query: 30 GSSSDTPPVDSGT--GSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 + P V PEVKP P P P+P +P P P P PE P P+PTP+PE +P Sbjct: 395 PQPTPKPEVKPAPVPTPKPEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQPTPKPEVKPA 454 Query: 88 PTKTGYLTLGGSQ 100 P T L + + Sbjct: 455 PVPTPKLEVKPTP 467 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 26/75 (34%), Positives = 34/75 (45%) Query: 26 GGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPE 85 G + PEVKP P P P+P +P P P P PE P P+PTP+PE + Sbjct: 369 GKVQPTPKPEVQPAPAPMPKPEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQPTPKPEVK 428 Query: 86 PVPTKTGYLTLGGSQ 100 P P T + + Sbjct: 429 PAPVPTPKPEVKPAP 443 >UniRef50_C1I981 Leucine rich repeat domain-containing protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I981_9CLOT Length = 2664 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 80/458 (17%), Positives = 140/458 (30%), Gaps = 96/458 (20%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 Q+ N F N Q TG+ A E+T+ +A+ + + G + Sbjct: 952 QHGDMVAHANRNLKFGFGNNNQPTGISAKPGDEITVYVDADPSQPMPKLVFSQQEGSFAN 1011 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPY------GGLIYI----KGNSSTNESASFTFTG 1172 R + ++ + Y GG IYI F Sbjct: 1012 ---WMRTVNLNPGKNVITVPDIAVDNWYRHDVTRGGSIYILNPYTSEQQPKTPV-IRFAS 1067 Query: 1173 VVKAPF----------------YKDGAWKNDLNSPAPL-------GELESDAFVYTTPKK 1209 K PF YK ++ +P L E SD V+T Sbjct: 1068 GDKYPFLTADTNVEEFKEFLIEYKKAIDEDIAKNPNVLDREVLDVFEFVSDHIVWTGTAT 1127 Query: 1210 NLNASNYTGGLEQFANDLDTF------ASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFT 1263 Y +E+ AN LDT + +YG D + ++ + F Sbjct: 1128 GA----YKAYIEKGANPLDTVNRYNNHMKELFKYYGLDGSNEQNDPKYIRENVRLAQPF- 1182 Query: 1264 NDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVA 1323 G ++ Y + + T W + HE+GH + EV Sbjct: 1183 -------GYMYA-Y-TNHIGVQGDVMTSLLVGEPGWGLDHEIGHRMDVSTRLY---GEVT 1230 Query: 1324 NNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKN 1383 NN+L +YM Y NR+ + + + E++N+ ++ G + L +Y QL+ + Sbjct: 1231 NNMLPMYMSVYYNKIDNRIPFENKIYKNVISENSNK-YSEGELAENLAVYWQLEMY---- 1285 Query: 1384 FDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAA 1443 P ++ + + R + ++ + Sbjct: 1286 ------------KPGYWGNLNKLY----------RERNVNLGSENP---------DNIKM 1314 Query: 1444 DTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 L+ +S V DLSE+F + N S+ Sbjct: 1315 QYLVKFSSEVIGEDLSEYFARHGFEVNEETRQETSKYP 1352 >UniRef50_A5ZER3 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZER3_9BACE Length = 558 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 85/462 (18%), Positives = 147/462 (31%), Gaps = 68/462 (14%) Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTV 1144 TGL+ + + + + + + L D + E T ++L G Sbjct: 53 PTGLYFEEGETIQVTAPDLQGYQLNLLLVD---FSKPAEGEKK---EKTTVFTLKT-GNN 105 Query: 1145 KFKVPYGGLIYIKGNSSTNESA---SFTF-----TGVVKAPFYKDGAWKNDLNSP-APLG 1195 KF P+ GL+Y+ A TF GV A + + WK L+S A + Sbjct: 106 KFYAPHKGLVYVSYYVKDCRKAPEQKLTFHTGINNGVFNAYQHTNDEWKRMLDSAIAEVI 165 Query: 1196 ELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNL 1255 +++ T K L G+E D + G D Sbjct: 166 DMQGKYVHLTFDVKTLREKGSDCGVE-MIRMYDRIILWQQEMLGIDQFGY---------- 214 Query: 1256 PGHKHRFTNDVQISIGDAHSGYPVMNSS--FSPNSTTL---PTTPLNDWLIWHEVGH-NA 1309 + H F +IS +G P N P ++++ ++W+I HE GH N Sbjct: 215 RTNNHMFA---RISW----AGPPNANGKGVSFPRTSSIIRPEDIRNSNWVIGHEFGHVNQ 267 Query: 1310 AETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDR 1369 L G TE+ NN+ A ++Q + + A G G + Sbjct: 268 VRPGLKWHGTTEITNNIQAAWIQYLLRPEGPF--------------RIEHSKAPDGTGQK 313 Query: 1370 LLMYAQLKEWAEKNFDIKK---WYPDGTPLPEFYSEREGMK-----GWNLFQLMHRKARG 1421 + Y L W + +++ Y T YS+ + W L G Sbjct: 314 V--YGGLFNWHFNHCVVQQKPLLYNPRTSFTPPYSDNKNPFVRLCPFWQLQIYNALTNFG 371 Query: 1422 DEVSNDKFGGK-NYCAESNGNAADT---LMLCASWVAQTDLSEFFKKWNPGANAYQLPGA 1477 + E + + + A V Q DL++FF + + G Sbjct: 372 KPDFYARISEIVRRTNEQDLTVGELQLNFVKNACDVIQEDLTDFFIRCGMLRSVDTEIGD 431 Query: 1478 SEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEHKMSA 1519 + +SQ + P+ I+ +T + + A Sbjct: 432 YGGNRHLSISQKQVEEVIRYASRYPKPKSPVIHYITMNSVKA 473 >UniRef50_A2EK19 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EK19_TRIVA Length = 286 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 31/141 (21%), Positives = 50/141 (35%), Gaps = 20/141 (14%) Query: 1142 GTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYK---DGAWKNDLNSPAPLGELE 1198 G + GGL+YI S S F +K+P Y + N P E+E Sbjct: 8 GDNEIFSQTGGLVYIAVEDSEKMELSVKFQKFLKSPRYIEEMPEIFDQTKNFQVPWAEIE 67 Query: 1199 SDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGH 1258 ++ + T PK + +F N +++ + ++DF H Sbjct: 68 TNNLILTLPK---QKYDKIQDFPKFCNFINSVCTKISDFM--------HYTI------RR 110 Query: 1259 KHRFTNDVQISIGDAHSGYPV 1279 + R DVQ G YP+ Sbjct: 111 QFRVVFDVQTPGGKPIPVYPI 131 >UniRef50_UPI0001C36412 coagulation factor 5/8 type domain protein n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36412 Length = 1749 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 47/303 (15%), Positives = 96/303 (31%), Gaps = 38/303 (12%) Query: 1181 DGAWKNDLNSPAPLG-ELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYG 1239 DG W + + ++ + +++ + + A G +E+ A L ++ ++ Sbjct: 1000 DGDWSSAKKNCILGATDIVTKYMMFSVSSQQILAGLSGGTVEEKAEQLYQSLTAADEMVN 1059 Query: 1240 R---------DSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTT 1290 D + G+ L R + G H G + Sbjct: 1060 LFYQHKGLSSDPDAGEKNKLPVSRLNLRYQRMFAGAFMYAGGLHIGIEWGSIPGLTRGVP 1119 Query: 1291 LPTTPLN--------DWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRV 1342 + P W I HE+GH E + E+ NN ++ Q R Sbjct: 1120 VKADPKGRYESGQYFGWGIAHEIGHEINEGAYAIA---EITNNYFSVLAQAHDTNDSVRF 1176 Query: 1343 ADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSE 1402 PE ++ G G ++ QL + + + + ++ + Sbjct: 1177 -----QYPEVYKK-----VTSGVTGRSSNVFTQLGLYWQLHLAYDMGGYNYKTYDKYRDQ 1226 Query: 1403 REGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFF 1462 + F + R E + K GG + + + + CA A+ ++ EFF Sbjct: 1227 FNNL----FFARVDSYVRNTEAA-PKPGGVSLSLSGDVDNKLMRLACA--AAEKNILEFF 1279 Query: 1463 KKW 1465 ++W Sbjct: 1280 ERW 1282 >UniRef50_B6FXD4 Putative uncharacterized protein n=1 Tax=Clostridium hiranonis DSM 13275 RepID=B6FXD4_9CLOT Length = 1937 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 59/260 (22%), Positives = 93/260 (35%), Gaps = 39/260 (15%) Query: 1218 GGLEQFANDLDTFASSMNDFYGRDSEDGKH----------RMFTYKNLPGHKHRFTNDVQ 1267 G E+ A+ + + +D +D ++ K RMF + H DV Sbjct: 1152 GVFEKVADFNNDGQINADDANSKDYQNNKASKRRVNVKYQRMFIGAFMYASGHHVGIDVG 1211 Query: 1268 ISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVL 1327 S D G P + T L W I HE+GH A T E +NN+L Sbjct: 1212 -SSKDLMKGVPF-KFDENGYVTNPDEARLFGWGISHEIGHKADIGNRTYS---ETSNNIL 1266 Query: 1328 ALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNF--D 1385 AL Q +R+ ++ Y + +++ A L M+ QL+ E + + Sbjct: 1267 ALITQTFDGKDKSRLEENGIYPKIYKKVTSSSVGVSQDATTLLGMFWQLQLAYEPGYTSE 1326 Query: 1386 IKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADT 1445 + K DG + Y + M+R R AE + Sbjct: 1327 MLKRNNDGNLTNDSY-----------YAKMNRLYRSLTD-----------AEKALDKDQL 1364 Query: 1446 LMLCASWVAQTDLSEFFKKW 1465 L+ AS A DL++FF+ W Sbjct: 1365 LIRKASESAGKDLTDFFESW 1384 >UniRef50_Q2SHR7 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=Q2SHR7_HAHCH Length = 1031 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 92/533 (17%), Positives = 159/533 (29%), Gaps = 114/533 (21%) Query: 1023 SYPLNYMEKPLTRLMLGRSWWDLNI---------KVDVEKYPGAVSEEGQNVTETISLYS 1073 +YP++ + L + D ++ + D+ + + VT+TI + S Sbjct: 519 TYPMDKITTE-DNAFLKSLYADYSVYNYRALNPAQKDMGNFSRSSFNHVSPVTKTIDMES 577 Query: 1074 NPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALN---RPP 1130 ++ G++A V + N V ++ + +G H+ N RP Sbjct: 578 KR------YFRAAGVYALPGYTVKVTRLDNADVATSIFVNTQRSGAT-HQFEKNGYTRPK 630 Query: 1131 RVTK-TYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLN 1189 + +S+ + ++ F YGG + I+ + SF F V + PF+ +G N Sbjct: 631 FLQTPKFSIKSGESITFTSTYGGPLQIEFGGNGKN-VSFRFENVGEHPFW-NGEEDNASF 688 Query: 1190 SPAP------LGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFY----- 1238 A EL + F + + S + A +++F Sbjct: 689 EQALAQGDYDWAELVTPGFEVHSKLDKMRKSMQDENWKTGAALAAGTMRYIHNFPHVLAG 748 Query: 1239 ----GRDSEDGKHRMFTYKNLPGHKHRFT---NDVQISIGDAHSGYPVMNSSFSPNSTTL 1291 G D H T NL N Q + G SG P + Sbjct: 749 FKGPGIDVVAEIHDFATAHNLDIEHIDIVKHMNADQATCGYGCSGNP-----YDAYWEFS 803 Query: 1292 PTTPLNDWLIWHEVGHNAAETPLTVPGA-TEVANNVLALY--------------MQDRYL 1336 P HE+GH G + N + Y Q Sbjct: 804 PIGH----GDIHELGHGLERGRFRFSGWDGHASTNPYSYYSKSHYYIDTGKNPGCQSLPF 859 Query: 1337 GKMNRVADDI---TVAPEYLEESNNQAWARG-GAGDRLLMYAQLKEWAEKNFDIKKWYPD 1392 M +V D + Y++ W+ G G +++M AQ Sbjct: 860 ESMFKVLRDSVSESNPQAYVQSQKLTGWSNGAGITVQMMMTAQ----------------- 902 Query: 1393 GTPLPEFYSEREGMKGWNLFQLMHRKARGDEV------------SNDKFGGKNYCAESNG 1440 E + GWNL +H R SN F + + Sbjct: 903 --------GEGKLNDGWNLLPRLHILDRNFNAALASEEAWSAAKSNLGFSQYSLAEAKSI 954 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANA--------YQLPGASEMSFEGG 1485 N+ D + + S D ++ W +A + LP A + G Sbjct: 955 NSNDWMTVAVSHATGLDFRDYLTMWALPFSAKASAQVASFSLPVAPRKYYASG 1007 >UniRef50_A9MNV3 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MNV3_SALAR Length = 644 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 55/326 (16%), Positives = 98/326 (30%), Gaps = 53/326 (16%) Query: 1177 PFYKDG-----AWKNDLNSPAPLGE---LESDAFVYTTPKKNLNASNYTGGLEQFAND-- 1226 P + G WKN P G+ + Y N+ ++ +EQ + Sbjct: 215 PVFVLGVNTLDDWKNISQQSTPSGQTLLFDGRTRYY--AANNVGKASKDHNIEQTLREHL 272 Query: 1227 LDTFA-SSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFS 1285 L+T +N G + R + D +I IG Sbjct: 273 LNTIVYDKLNGIDGSSPINEALRSLDIASYNSCCWAEGGDGRIGIGFG------------ 320 Query: 1286 PNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADD 1345 +++PT + W WHE GH + G E N+ ++ + R D Sbjct: 321 ---SSIPT--QSSWGEWHEFGHQNQMQ-WSWNGLGETTVNIYSI-----AACRATRGEVD 369 Query: 1346 ITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREG 1405 + E L+ N W + G+ L + W D + + Sbjct: 370 VKTCHENLQ-YNGFQWDQQAVGNFL---KSGQTW--------DLDTDTNVFHQLMMFAQL 417 Query: 1406 MKGW-NLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKK 1464 W +L+ + + R + +S D ++ AS + DL +FF Sbjct: 418 ETSWPDLYPALGKAYREINNYYNGSAKV----DSKQEKVDFFVVNASKYSGHDLRKFFTH 473 Query: 1465 WNPGANAYQLPGASEMSFEGGVSQSA 1490 W + + M+ + S Sbjct: 474 WGVDYSTDADNQITAMNLPQVIEPSG 499 >UniRef50_A8R9D2 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8R9D2_9FIRM Length = 1702 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 61/185 (32%), Gaps = 27/185 (14%) Query: 1298 DWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESN 1357 W I HE+GHN + + EV NN +L Q + R D + + + Sbjct: 1106 GWGIAHEIGHNINQGKYAIA---EVTNNYFSLIAQAKDTNDSVRF--DYEKVYDRVTSNV 1160 Query: 1358 NQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHR 1417 +L MY QL ++ ++ K + E + F + Sbjct: 1161 KS--RSEDVFTQLAMYWQLHLAYDRYYNYK-----------LFDNYEDIYNHVFFARIDS 1207 Query: 1418 KARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGA 1477 R ++ + M AS A+ DLSEFF +W +P Sbjct: 1208 FVR---DTSKAPSPNEIALTLGNDKDQNFMRLASAAAEKDLSEFFMRWG------LIPDD 1258 Query: 1478 SEMSF 1482 + ++ Sbjct: 1259 TTKAY 1263 >UniRef50_C8WI41 Coagulation factor 5/8 type domain protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WI41_EGGLE Length = 1787 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 60/186 (32%), Gaps = 21/186 (11%) Query: 1298 DWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESN 1357 W I HE+GHN + EV NN A Q R + D + Sbjct: 1175 GWGIAHEIGHNINQAQYAYS---EVTNNYFAQLSQTDGTSASARFSYDEVYDRVTSGDEG 1231 Query: 1358 NQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHR 1417 G +L MY QL+ D + Y + F + Sbjct: 1232 R----TGSVFTQLAMYWQLRLAY-----------DAGGAYQLYDTYQQAFDNRFFARVDS 1276 Query: 1418 KARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGA 1477 AR + + G + G ++ AS A+ DL++FF++W A+ Sbjct: 1277 YARAPKTAPAPEGTELVL---GGGEKQNIIRLASAAAERDLTDFFQRWGFTADEATKAYV 1333 Query: 1478 SEMSFE 1483 S+ E Sbjct: 1334 SQFPAE 1339 >UniRef50_A5ZKM8 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=A5ZKM8_9BACE Length = 857 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 58/406 (14%), Positives = 117/406 (28%), Gaps = 64/406 (15%) Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTV 1144 TG++ EV + ++++ + T + + ++N Y L G Sbjct: 352 PTGIYVNEGDEVVVLVGDTHGQSISIQNIGEETSKGYAQTSVNGD-----IYPLK-EGVN 405 Query: 1145 KFKVPYGGLIY------IKGNSSTNESASFTFTGVVKAPFY------KDGAWKNDLN-SP 1191 K G+++ I+ + G F+ + +K ++ + Sbjct: 406 KLTAKQTGMLFVMYNTNIQNPDAQPIKIHIPLGGGKVCGFFSLKEHQTNEKYKELIDKAD 465 Query: 1192 APLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFT 1251 + +A + K L A+ + D + G Sbjct: 466 YKYFCVIGNAIILYFHHKQLKAA-VPYDILSSIELWDNMIQWQQELMG-----------I 513 Query: 1252 YKNLPGHKHRFTNDVQISIGDAHSG-------YPVMNSSFSPNSTTLPTTPLNDWLIWHE 1304 P + + G + Y V+ + N W HE Sbjct: 514 EDVYPKQMNNHIFAISPEGGYMWASEGRIGFVYTVLGDILRKSYLMASR---NSWGPAHE 570 Query: 1305 VGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARG 1364 +GH + TE +NN+ + Y ++ +R + PEY Sbjct: 571 IGHVHQ-GAINWASTTESSNNLFSNYTIYKFGQNCSRGTE--LAVPEYAAN----VKKAT 623 Query: 1365 GAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEV 1424 R + + K W + D PE ++ W L+ HR + Sbjct: 624 LVFRRCV---ENKAWCDFGTDY------QGEDPEMHARMN----WQLWNYYHRCGYNPQF 670 Query: 1425 SN--DKFGGKNYCAESNGNAADTLML-CASWVAQTDLSEFFKKWNP 1467 K +N + + + A A +L++FF++W Sbjct: 671 FPTLFKLMRENRVSTQDPGENQMMYARMACRAANENLTDFFERWGF 716 >UniRef50_B0A9L1 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0A9L1_9CLOT Length = 2011 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 76/490 (15%), Positives = 147/490 (30%), Gaps = 84/490 (17%) Query: 1064 NVTETISLYSNPTKWFAGNM-----QSTGLWAPAQKEVTIKSNANVPV---TVTVALADD 1115 +T+ +S + QSTG+ A + I A+ + + Sbjct: 298 TLTQNGHTHSKSRNVLRMSRLGTDLQSTGIVARPGQVFKIFVEADSNTKLPQIVFTQQEG 357 Query: 1116 LTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESAS---FTFTG 1172 + E L + V + + + K GG +Y+ + +E G Sbjct: 358 HFSHWQKEYQLKKGLNVITVPEIYSDSWSQ-KSVKGGAVYLMNRYTADEQGKAPVVRIDG 416 Query: 1173 VVKAPFYKDGAWKND---------------LNSPAPLGELESDAFVYTTPKKNLNASNYT 1217 + P Y +G K+ ++ + E + +YT K Sbjct: 417 GEEFPLYNEGDDKDAFLEKLKAYKEKLDKNPDTTVDIFEFNTKRLLYTGTAKAAYQVYVK 476 Query: 1218 GGLE--QFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS 1275 G++ + + DF G +D R G A++ Sbjct: 477 EGVDVGESIQVWNDKIQEAFDFAGL--KDDPSDPTNDSTNVRTTIRLMQP----YGAAYA 530 Query: 1276 GYPVMNSSFSPNSTTLPTTPLN----DWLIWHEVGHNAAETPLTVPGATEVANNVLA--L 1329 Y + L T + W + HEVGH ++ E+ NN+ A Sbjct: 531 AYGHVGIQRGIQEIALRTDKDSINSILWGMVHEVGHQM---DISEREWGEITNNMFANNA 587 Query: 1330 YMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKE-----WAEKNF 1384 Y+ + ++ ++AP+ + + RL M+ QL WAE Sbjct: 588 YINNGAGDRVPYSQIQTSLAPDDASTNFDNL----DYSQRLGMFWQLHLKDNTYWAE--- 640 Query: 1385 DIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAAD 1444 ++K Y P E K + + + + G N Sbjct: 641 -VEKLYRKRKPSVS----NEQAKRDTFAKYASEVLNMNLTKHFEKYGFNL---------- 685 Query: 1445 TLMLCASWVAQTDLSEF---FKKWNPGANAYQLPGASEMSFEGG----VSQSAYNTLASL 1497 S + +L ++ K W + A G + G +S+S+ S+ Sbjct: 686 ------SESCKKELEKYPDGQKTWYLDSRALTYEGNGFEDKDTGLDVSLSKSSSGIRLSM 739 Query: 1498 DLPKPEQGPE 1507 ++P+ ++ Sbjct: 740 NMPQDKRDDL 749 >UniRef50_C8QBU1 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QBU1_9ENTR Length = 557 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 55/372 (14%), Positives = 110/372 (29%), Gaps = 73/372 (19%) Query: 1157 KGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELES--DAFVYTTPKKNLNAS 1214 E F TG + + G N+ + ++ D F + + Sbjct: 185 PDKQQPGEFVPFKITGGGNSHLFILGQ-----NTQSDWAASKTIADKFGFALLYDGHANT 239 Query: 1215 NYTGGLEQ-----FANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQIS 1269 + Q L + + + DG +FT Sbjct: 240 VVPTRIAQHTDEMIGKVLGDNLRVVALYEKINGMDGSEYLFTSP---------------- 283 Query: 1270 IGDAHSGYPVMNSSFSPN--------STTLPTTPLNDWLIWHEVGHNAAETPLTVPGATE 1321 +G + Y + N + T+ ++W +WHE+GH +E Sbjct: 284 MGSMFTNYDNCCFADYRNGYIGVGFHANTMNDKKGDNWGVWHELGHTYEPMKENFNLFSE 343 Query: 1322 VANNVLALYMQDRYLG-KMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 + N ++ ++G ++ + ++PE W + + + + Sbjct: 344 IQVNRYSIEACQMFMGREIPLNKCHVDISPE------EGIWEKQAVAN----FIASGMYY 393 Query: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHR---KARGDEVSNDKFGGKNYCAE 1437 I + F+S G + F +++ K N +Y Sbjct: 394 PDYSTINNLWKQ----LNFFSRLRFSYGEDFFPKVNQARLKTIQQAPGNTIAEKTDYVIG 449 Query: 1438 SNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASL 1497 S D ++ S A DL ++F +W + +Q+ +A L Sbjct: 450 SKQKVIDFSVVAYSQAAGQDLRQYFTQWGLNFS----------------TQAG-EKVAEL 492 Query: 1498 DLPKP--EQGPE 1507 LP+P EQ P+ Sbjct: 493 QLPQPGAEQAPQ 504 >UniRef50_A5ZG25 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZG25_9BACE Length = 886 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 73/239 (30%), Gaps = 42/239 (17%) Query: 1290 TLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVA 1349 + N W HE+GH + P +TE +NN+ + Y+ R LGK +T Sbjct: 574 NVMAAEDNAWGPAHEMGHVHQAA-INWPSSTESSNNLFSNYVI-RRLGKYKSRGRGLTSL 631 Query: 1350 PEYLEESNNQAWARG-------GAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSE 1402 + W G + M QL + + Sbjct: 632 ANAIYRDKQVWWNMGTSTHQNEDTEIHMRMNWQLWIYYD----------------LCKGN 675 Query: 1403 REGMKGW-NLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEF 1461 + K W +F +M + S+ + +C AQ DL++F Sbjct: 676 EQEAKFWPKVFDIMRTTYKNVPESDPGARQLAFVKA----------VC--EAAQEDLTDF 723 Query: 1462 FKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASL--DLPKPEQGPETINQVTEHKMS 1518 F+ W + ++ V+ + P+ P I + + K+S Sbjct: 724 FETWGFFKTVDNVKVEQYGTWTYTVTDKMIADTKAWIKTQNYPKAAP--IQYIEDRKIS 780 >UniRef50_B0A9L0 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0A9L0_9CLOT Length = 1263 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 75/461 (16%), Positives = 143/461 (31%), Gaps = 104/461 (22%) Query: 1062 GQNVTETISLYSN-PTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGRE 1120 QN + ++ +F + QSTG+ A + T+ A+ + Sbjct: 72 AQNGNISWKARNDLRMTYFGTDYQSTGIVARPGETFTVYVEADEGAPMPKIAFSQHEALY 131 Query: 1121 KHEVALNRPPRVTKTYSLDASGTVKFKVP------------YGGLIYIKGNSSTNESAS- 1167 + R Y L G VP GG +Y+ + E Sbjct: 132 SN---WVRW------YDLK-PGKNVITVPEIYDNSWSNKTVKGGAVYLLNRYTAKEQGKA 181 Query: 1168 --FTFTGVVKAPFYKDGA-----------WKNDLNSP----APLGELESDAFVYTTPKKN 1210 T G P Y +G +K L+ L E ++ +YT Sbjct: 182 PVVTIEGGETFPIYNEGDDKAAFIEKLKAYKQKLDQDPENTVDLFEFNTERLLYTGTASA 241 Query: 1211 LNASNYTGGLE--QFANDLDTFASSMNDFYGRDSE------DGKHRMFTYKNLPGH--KH 1260 G++ + A +L++ M DF G ++ D + T + + + + Sbjct: 242 AYKVYVEEGVDVGESAANLNSQVQEMFDFSGLKNDPTDPNNDSTNVKTTIRLMQPYGLAY 301 Query: 1261 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT 1320 + + V I P M + + ++ W HEVGH + Sbjct: 302 AYVDHVGIQRDYE----PGMLRTDQESLNSV------LWATVHEVGHQM---DIMGRDWP 348 Query: 1321 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 E+ NN+ + Y ++ G +RV + E+S+ +++ G RL M+ QL+ Sbjct: 349 EITNNMWSNYAHIKH-GMNDRVPYNDIYNDLAPEDSH-KSFDDLGYFQRLGMFWQLQL-- 404 Query: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 +++E E R + S + Sbjct: 405 --------------KKDTYWTELET------------LYRERKQS----------PANYQ 428 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 D L +S + +L+ F+K+ + ++ Sbjct: 429 EKKDMLATYSSEILGINLTNHFEKYGFTLSDKCKENLKKLP 469 >UniRef50_C5VKR9 Putative uncharacterized protein n=1 Tax=Prevotella melaninogenica ATCC 25845 RepID=C5VKR9_9BACT Length = 1052 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 39/302 (12%), Positives = 82/302 (27%), Gaps = 49/302 (16%) Query: 1180 KDGAWKNDLNSPAPLGE---LESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMND 1236 + W+ + + ++ + V P + +++ ++ +D Sbjct: 556 TNEDWQKMQSDGLVWAKAFNMKGELVVMNMPSQACKDYTPV-HMKELVEIWNSIVQREDD 614 Query: 1237 FYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS--GYPVMNSSFSPNSTTLPTT 1294 G + N + G ++ G N + + Sbjct: 615 LMG-----------FRAAKRDKCNNVLNATAVDHGYMYATTGGTYYNYNTLADVLNYDKM 663 Query: 1295 PLND---WLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPE 1351 + W HE GHN + G TE++ N+ + + G++ ++ Sbjct: 664 KWGNGTLWGPAHEFGHNHQQL-FNTAGMTEISVNMYSNMVMFT-SGRVTSRSEHCNYTDV 721 Query: 1352 YLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNL 1411 +E + +A K W F+ W +Y L Sbjct: 722 DGQEHRGVCESA--VSTYADRFANKKMW----FEYGTW----GTTQMYYK---------L 762 Query: 1412 FQLMHRKARGDEVSND--------KFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFK 1463 + + H D+ + + G+ N A DLS FF+ Sbjct: 763 YMMFHSTGLDDQFWHKCLDYLRTHRLEGQGTANCQGQNDYLLFAKACCVAANQDLSSFFE 822 Query: 1464 KW 1465 W Sbjct: 823 AW 824 >UniRef50_Q8EVX9 Putative uncharacterized protein MYPE4300 n=1 Tax=Mycoplasma penetrans RepID=Q8EVX9_MYCPE Length = 413 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 67/182 (36%), Gaps = 26/182 (14%) Query: 1050 DVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTV- 1108 + +Y + ++ V + ++ + TGL+AP + + IK V Sbjct: 225 NTSEYYHKLPDDALAVEKNFNIDLTQPGYVV-----TGLYAPPGEVINIKIPGLTDEEVK 279 Query: 1109 ----TVALADDLT-----GREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGN 1159 T+ + D+ + A R P + + S+ S K+ P+GG++ + N Sbjct: 280 ALNLTLTIGDNENPAPGYNASNYSRASKRLPVMKTSISITTSE-FKYGSPFGGIVNVVVN 338 Query: 1160 SSTN----ESASFTFTGVVKAPFYKDG-----AWKN-DLNSPAPLGELESDAFVYTTPKK 1209 ++ G V+A Y G WK + AP+ ++ +D + P Sbjct: 339 KEPGLNALKNVGMVINGAVEALHYIHGYTTEAEWKRLTKEAKAPIFDIAADHIKFAGPVH 398 Query: 1210 NL 1211 L Sbjct: 399 TL 400 >UniRef50_UPI000180C854 PREDICTED: similar to transmembrane agrin n=1 Tax=Ciona intestinalis RepID=UPI000180C854 Length = 2114 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 32/98 (32%), Positives = 43/98 (43%), Gaps = 22/98 (22%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPN----------------------PEPTPEPTPDPEP 68 ++ + + T +P V+P+PTPN PEPTP+ P+PEP Sbjct: 1108 ETTRMVQITTTTHPVPVVEPEPTPNAKPEPEPTSKPEPEPEPTPNAKPEPTPKSEPEPEP 1167 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGAT 106 T +P P+PEPT PEPEP P T T Sbjct: 1168 TSKPEPEPEPTSNPEPEPTPNAKPEPTSNPEPEPERTT 1205 >UniRef50_Q183N3 Putative exported protein n=9 Tax=Clostridium difficile RepID=Q183N3_CLOD6 Length = 1987 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 41/189 (21%), Positives = 66/189 (34%), Gaps = 26/189 (13%) Query: 1298 DWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESN 1357 W I HE+GH +T E +NNVLAL Q + K + + T+ Y ++ Sbjct: 1293 GWGIGHEIGHVTDIGKMTYS---ETSNNVLALLAQ-TFDDKTHSRLEGSTMDKIYEHVTS 1348 Query: 1358 NQAWARGGAGDRLLMYAQLKEWAEKNF--DIKKWYPDGTPLPEFYSEREGMKGWNLFQLM 1415 N +RL M QL + +F + K D + + + Sbjct: 1349 NSLGIPSNVFERLGMLWQLHLAYDDDFTGSMLKNNSDADLSND-----------TFYAKI 1397 Query: 1416 HRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLP 1475 RK R S+ ++ L+ AS V + DL ++FK W Sbjct: 1398 SRKYRALSSSD---------PINSLPKDQMLVAMASSVVEKDLRDYFKAWGVEITPELNS 1448 Query: 1476 GASEMSFEG 1484 ++E Sbjct: 1449 IMDSKNYEK 1457 >UniRef50_B4FNV9 Early nodulin 75 protein n=2 Tax=Zea mays RepID=B4FNV9_MAIZE Length = 279 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/49 (53%), Positives = 32/49 (65%) Query: 39 DSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 D G P+ KP+P P P+P PEP P P P PEP P P+P P+PEP+P Sbjct: 146 DPKPGPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEPKPT 194 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 25/48 (52%), Positives = 35/48 (72%) Query: 44 SLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKT 91 P +PDP P P+PTP+P P+P+P P P P+P+PTP+P+P+P P T Sbjct: 147 PKPGPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEPKPT 194 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/45 (48%), Positives = 29/45 (64%) Query: 49 KPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGY 93 KP P P+P+P P+PTP P P P+P P P+P P+P P+P P Sbjct: 148 KPGPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEPK 192 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 24/47 (51%), Positives = 28/47 (59%) Query: 46 PEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTG 92 P+ P P P PEP P P P PEP P+P P PEP P P+P+P P Sbjct: 147 PKPGPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEPKP 193 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 37/107 (34%), Gaps = 32/107 (29%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEP---------------------------- 58 P D P++KP+P P P+P Sbjct: 82 DTKSDPKPAPQSDPKPAPQPDLKPEPKPTPQPDPKPSPQPDPEPKPKPAPQPEPKPEPKP 141 Query: 59 ----TPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQR 101 P+P P P+P PEP P P+P PEP+P+P P T + Sbjct: 142 TPQPDPKPGPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPK 188 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 27/101 (26%), Positives = 36/101 (35%), Gaps = 30/101 (29%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPN------------------------------PEPT 59 P D P +PDP P+ P P Sbjct: 93 SDPKPAPQPDLKPEPKPTPQPDPKPSPQPDPEPKPKPAPQPEPKPEPKPTPQPDPKPGPQ 152 Query: 60 PEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQ 100 P+P P+P+PTP+P P+P+P P P+PEP PT Sbjct: 153 PDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEPKP 193 Score = 44.0 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 17/49 (34%), Positives = 22/49 (44%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPD 75 P + P +P P P P+P P+PTP P+P PEP P Sbjct: 146 DPKPGPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEPKPT 194 >UniRef50_A0YJH0 Polymorphic membrane protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YJH0_9CYAN Length = 2103 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 21/48 (43%), Positives = 24/48 (50%) Query: 44 SLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKT 91 P V+P PT P P P P P PT E IP+P P P EP P + Sbjct: 1548 PEPSVEPTPTIEPTPEPSIEPTPTPTIELIPEPSIEPTPTIEPTPEPS 1595 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/89 (33%), Positives = 35/89 (39%), Gaps = 18/89 (20%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDP------------ 76 S P P ++P PTP E PEP+ +P PT EP P+P Sbjct: 1547 IPEPSVEPTPTIEPTPEPSIEPTPTPTIELIPEPSIEPTPTIEPTPEPSIEPTPTPTFEP 1606 Query: 77 ------EPTPEPEPEPVPTKTGYLTLGGS 99 E TPEP EP PT T L S Sbjct: 1607 TPTPTIELTPEPSIEPTPTPTIELIPEPS 1635 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 28/88 (31%), Positives = 32/88 (36%), Gaps = 20/88 (22%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEP--------------------TPDP 66 + S P +PE +PTP EPTPEP TP+P Sbjct: 1559 EPTPEPSIEPTPTPTIELIPEPSIEPTPTIEPTPEPSIEPTPTPTFEPTPTPTIELTPEP 1618 Query: 67 EPTPEPIPDPEPTPEPEPEPVPTKTGYL 94 P P P E PEP E PT T L Sbjct: 1619 SIEPTPTPTIELIPEPSIELTPTPTFEL 1646 >UniRef50_A5ZN83 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZN83_9FIRM Length = 607 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 33/81 (40%), Gaps = 1/81 (1%) Query: 50 PDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYL-TLGGSQRVTGATCN 108 P+PT PEPT P P P P P+P TPEP P PT GS TG N Sbjct: 475 PEPTAAPEPTAAPEPTAAPEPTATPEPTATPEPTATPEPTTAPGAGYKDGSYTGTGEGFN 534 Query: 109 GESSDGFTFKPGEDVTCVAGN 129 G+ + G V+ Sbjct: 535 GQVTVTINVSGGNIVSAGYDG 555 >UniRef50_Q7U5X7 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8102 RepID=Q7U5X7_SYNPX Length = 1154 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 67/213 (31%), Gaps = 11/213 (5%) Query: 46 PEVKPDPTP----NPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQR 101 P P+ TP P P P P PTPE P P TP PE PVP+ T + Sbjct: 519 PTPTPESTPIPSATPTPESTPIPSATPTPESTPIPSATPTPESAPVPSATPTPESTPAPS 578 Query: 102 VT-GATCNGESSDGFTFKPGEDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQE 160 T + + E D F V G+ + + ++ K S D + Sbjct: 579 ATPDPSIDQELIDDFANNSSTSGVVVIGDVLYGNLESIYDDDWFKVSLTKGSVYRFDLEG 638 Query: 161 LAGSDDKKS----NAVSLVTSSNSCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFK 216 + +D + + N L + + T +S + + + Sbjct: 639 IQLNDPRMTLYGSNLKELTYDDDGGSGYDSLIEFTATSSDNYFISAKSWGETGTYTLKAT 698 Query: 217 KLVNEEVENNAATDKAPSTHTSPVVPVTTPGTK 249 + A + P +P P +TP Sbjct: 699 D-ITPAPSATPAPEPTPVPSATP-TPESTPAPS 729 >UniRef50_C2M7W3 Putative uncharacterized protein n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M7W3_CAPGI Length = 1067 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/60 (31%), Positives = 26/60 (43%) Query: 62 PTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPGE 121 P PDP+P PEP P P+P P P P+P PT + S + F + + Sbjct: 935 PKPDPKPAPEPKPVPKPDPIPAPKPEPTPDSIEETREIAIYNAVSTQDNSQNYFKVEGYD 994 >UniRef50_D2NSM0 Putative uncharacterized protein n=1 Tax=Rothia mucilaginosa DY-18 RepID=D2NSM0_9MICC Length = 586 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 23/65 (35%), Positives = 27/65 (41%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 + + TP V + P V P TP EPT PT +P P P PTPEP P Sbjct: 159 DATAEPTATPTVAPSASAEPSVAPTATPTAEPTVAPTAEPTVQPTAEPTVAPTPEPTVAP 218 Query: 87 VPTKT 91 T Sbjct: 219 TVEPT 223 >UniRef50_UPI0001695557 hypothetical protein Plarl_10627 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001695557 Length = 444 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/46 (41%), Positives = 22/46 (47%) Query: 37 PVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEP 82 PV P P+PTP P P P P+PTP P+P P P P Sbjct: 158 PVQVKAEPAPVPTPEPTPAVTPEPATVPTPKPTPAATPEPAPVPTP 203 >UniRef50_C0WIN3 Putative uncharacterized protein n=1 Tax=Corynebacterium accolens ATCC 49725 RepID=C0WIN3_9CORY Length = 323 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 2/91 (2%) Query: 22 AGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPE 81 +GC + PP + + + P DP P PEP P+ P PE +P P PE P Sbjct: 57 SGCQCQQCRGFVNEPPRQNESTTTPA-HADPNPEPEPKPQDCP-PEQQEQPAPQPEQDPV 114 Query: 82 PEPEPVPTKTGYLTLGGSQRVTGATCNGESS 112 PEP P ++ + A C+ ESS Sbjct: 115 PEPVPEDSECEKEPTTPAAVEDDAACDIESS 145 >UniRef50_B0BZP0 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0BZP0_ACAM1 Length = 988 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 21/73 (28%), Positives = 26/73 (35%), Gaps = 8/73 (10%) Query: 37 PVDSGTGSLPEVKPDPTPNPEPTPEPT--------PDPEPTPEPIPDPEPTPEPEPEPVP 88 P P PD TP P+P+ +P P P P P P P P PE P P Sbjct: 861 PTIPQPTPGPAPSPDVTPVPQPSTDPPSGSTSQPFPFPVPLPSPSPQPGKVPESLPSPGE 920 Query: 89 TKTGYLTLGGSQR 101 L + + Sbjct: 921 VPESGLPQTPAPK 933 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46837 Putative lipoprotein acfD homolog n=40 Tax=Gamma... 1986 0.0 UniRef50_A6Y2J7 Large exoproteins involved in heme utilization o... 1737 0.0 UniRef50_A5F372 Accessory colonization factor AcfD n=25 Tax=Vibr... 1697 0.0 UniRef50_D0YX39 Accessory colonization factor AcfD n=1 Tax=Photo... 1586 0.0 UniRef50_A6AXB7 AcfD n=3 Tax=Gammaproteobacteria RepID=A6AXB7_VIBPA 1585 0.0 UniRef50_A6ALV4 AcfD n=5 Tax=Vibrio RepID=A6ALV4_VIBHA 1551 0.0 UniRef50_B5FBF1 Inner membrane lipoprotein n=1 Tax=Vibrio fische... 1524 0.0 UniRef50_A8H5W9 Inner membrane lipoprotein n=1 Tax=Shewanella pe... 1514 0.0 UniRef50_Q5E705 Accessory colonization factor AcfD-like protein,... 1292 0.0 UniRef50_UPI000178AA33 hypothetical protein GYMC10_4678 n=1 Tax=... 363 4e-98 UniRef50_UPI00017445F8 hypothetical protein VspiD_04825 n=1 Tax=... 355 7e-96 UniRef50_B4DBQ1 Putative uncharacterized protein n=1 Tax=Chthoni... 354 2e-95 UniRef50_UPI0001B9ED55 hypothetical protein GYMC10_4682 n=1 Tax=... 339 4e-91 UniRef50_C1ZFD9 Putative uncharacterized protein n=1 Tax=Plancto... 316 5e-84 UniRef50_B2ULE8 Putative uncharacterized protein n=1 Tax=Akkerma... 299 6e-79 UniRef50_A4IG42 Protein FAM115 n=14 Tax=Clupeocephala RepID=F115... 294 1e-77 UniRef50_Q9Y4C2 Protein FAM115A n=54 Tax=Amniota RepID=F115A_HUMAN 291 2e-76 UniRef50_A4GHK6 Putative uncharacterized protein n=1 Tax=uncultu... 284 1e-74 UniRef50_C7PHT0 Putative uncharacterized protein n=1 Tax=Chitino... 274 2e-71 UniRef50_Q5XHI4 Protein FAM115 n=4 Tax=Anura RepID=F115_XENLA 272 5e-71 UniRef50_B4DK02 cDNA FLJ57809 n=1 Tax=Homo sapiens RepID=B4DK02_... 271 1e-70 UniRef50_C3BVL2 S-layer domain protein n=1 Tax=Bacillus pseudomy... 267 4e-69 UniRef50_C2U768 S-layer domain protein n=4 Tax=Bacillus cereus R... 259 5e-67 UniRef50_A7GU48 S-layer domain protein n=5 Tax=Bacillus cereus g... 259 9e-67 UniRef50_B7INX6 Wall-associated protein n=62 Tax=Bacillus cereus... 257 2e-66 UniRef50_A9VG09 S-layer domain protein n=29 Tax=Bacillus cereus ... 254 2e-65 UniRef50_C2G0M2 Putative uncharacterized protein n=2 Tax=Sphingo... 252 1e-64 UniRef50_D2VFD2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 252 1e-64 UniRef50_C3XPE8 Putative uncharacterized protein n=1 Tax=Branchi... 249 5e-64 UniRef50_UPI0001692BE8 S-layer domain protein n=1 Tax=Paenibacil... 249 6e-64 UniRef50_UPI000155BFC0 PREDICTED: similar to FLJ00264 protein n=... 249 9e-64 UniRef50_C2G4C5 Putative uncharacterized protein n=2 Tax=Sphingo... 245 1e-62 UniRef50_B7HIS0 S-layer domain protein n=19 Tax=Bacillus RepID=B... 241 2e-61 UniRef50_D1PPF9 Putative fibronectin type III domain protein n=1... 240 3e-61 UniRef50_A5ZEQ5 Putative uncharacterized protein n=1 Tax=Bactero... 233 5e-59 UniRef50_B6A9M3 Putative uncharacterized protein n=1 Tax=Cryptos... 231 1e-58 UniRef50_C5LZE4 Putative uncharacterized protein n=1 Tax=Perkins... 230 4e-58 UniRef50_Q5LB88 Putative lipoprotein n=9 Tax=Bacteroides RepID=Q... 230 5e-58 UniRef50_C7BLH9 Putative uncharacterized protein n=1 Tax=Photorh... 225 1e-56 UniRef50_Q87WH6 Putative uncharacterized protein n=3 Tax=Pseudom... 225 1e-56 UniRef50_B1KGL9 Putative uncharacterized protein n=4 Tax=Shewane... 220 2e-55 UniRef50_C2G2A5 Putative uncharacterized protein n=2 Tax=Sphingo... 218 1e-54 UniRef50_UPI0001B7B86F Similar to experimental autoimmune prosta... 218 1e-54 UniRef50_A5ZF31 Putative uncharacterized protein n=1 Tax=Bactero... 217 2e-54 UniRef50_A1JU21 Putative exported protein n=4 Tax=Yersinia RepID... 213 3e-53 UniRef50_C2C0C9 Possible wall-associated protein n=1 Tax=Listeri... 213 6e-53 UniRef50_Q2U0W8 Predicted protein n=2 Tax=Aspergillus RepID=Q2U0... 212 8e-53 UniRef50_A5ZER3 Putative uncharacterized protein n=1 Tax=Bactero... 209 6e-52 UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellula... 208 1e-51 UniRef50_C6IEC9 Coagulation factor 5/8 type n=2 Tax=Bacteroides ... 208 1e-51 UniRef50_Q4ZNJ4 Putative uncharacterized protein n=1 Tax=Pseudom... 208 2e-51 UniRef50_C5PJT0 Lipoprotein n=2 Tax=Sphingobacterium spiritivoru... 205 2e-50 UniRef50_UPI0001BC7CE9 hypothetical protein BacD2_03774 n=1 Tax=... 204 2e-50 UniRef50_A9VC57 Predicted protein n=1 Tax=Monosiga brevicollis R... 204 2e-50 UniRef50_B2UPI7 Putative uncharacterized protein n=3 Tax=Bacteri... 202 8e-50 UniRef50_A3FQI6 Putative uncharacterized protein n=3 Tax=Cryptos... 202 8e-50 UniRef50_A5ZED4 Putative uncharacterized protein n=1 Tax=Bactero... 201 2e-49 UniRef50_C7MAR4 Putative uncharacterized protein n=1 Tax=Brachyb... 200 5e-49 UniRef50_UPI000197B0FB hypothetical protein BACCOPRO_00998 n=1 T... 199 8e-49 UniRef50_A5ZFW6 Putative uncharacterized protein n=1 Tax=Bactero... 196 6e-48 UniRef50_A2F4J7 Immuno-dominant variable surface antigen-like n=... 193 4e-47 UniRef50_C2G2G0 Possible wall-associated protein n=2 Tax=Sphingo... 192 7e-47 UniRef50_A2EC39 Putative uncharacterized protein n=1 Tax=Trichom... 191 1e-46 UniRef50_B2UP41 Putative lipoprotein n=1 Tax=Akkermansia mucinip... 191 2e-46 UniRef50_A2DY87 Putative uncharacterized protein n=1 Tax=Trichom... 190 4e-46 UniRef50_A5ZKM8 Putative uncharacterized protein n=3 Tax=Bactero... 188 1e-45 UniRef50_A8R9D2 Putative uncharacterized protein n=1 Tax=Eubacte... 184 3e-44 UniRef50_B2UQK5 Putative uncharacterized protein n=1 Tax=Akkerma... 183 3e-44 UniRef50_Q2SHR7 Putative uncharacterized protein n=2 Tax=Gammapr... 183 5e-44 UniRef50_C2FV33 Possible wall-associated protein n=2 Tax=Sphingo... 181 2e-43 UniRef50_A2DDW5 Immuno-dominant variable surface antigen-like n=... 180 3e-43 UniRef50_A2DZU5 Putative uncharacterized protein n=1 Tax=Trichom... 180 4e-43 UniRef50_D2VUE2 Predicted protein n=1 Tax=Naegleria gruberi RepI... 179 1e-42 UniRef50_C9YCF1 Putative uncharacterized protein n=1 Tax=Curviba... 176 4e-42 UniRef50_Q7MJ28 Putative uncharacterized protein VV2335 n=13 Tax... 175 1e-41 UniRef50_B0N0I0 Putative uncharacterized protein n=3 Tax=Bacteri... 175 1e-41 UniRef50_B7V4F8 Putative uncharacterized protein n=7 Tax=Pseudom... 175 2e-41 UniRef50_A2F153 Immuno-dominant variable surface antigen-like n=... 174 2e-41 UniRef50_A2F335 Immuno-dominant variable surface antigen-like n=... 172 1e-40 UniRef50_B2V178 Fibronectin type III domain protein n=2 Tax=Clos... 171 2e-40 UniRef50_A5ZG25 Putative uncharacterized protein n=1 Tax=Bactero... 171 2e-40 UniRef50_A2EKB8 Putative uncharacterized protein n=1 Tax=Trichom... 170 3e-40 UniRef50_A2FU45 Immuno-dominant variable surface antigen-like n=... 169 8e-40 UniRef50_C1I981 Leucine rich repeat domain-containing protein n=... 167 3e-39 UniRef50_B0A9L1 Putative uncharacterized protein n=1 Tax=Clostri... 165 1e-38 UniRef50_C8WI41 Coagulation factor 5/8 type domain protein n=1 T... 165 2e-38 UniRef50_C9KXI1 Putative uncharacterized protein n=1 Tax=Bactero... 164 2e-38 UniRef50_UPI0001C36412 coagulation factor 5/8 type domain protei... 164 2e-38 UniRef50_B0A9L0 Putative uncharacterized protein n=1 Tax=Clostri... 163 4e-38 UniRef50_C5VKR9 Putative uncharacterized protein n=1 Tax=Prevote... 158 2e-36 UniRef50_Q8EW84 Predicted integral membrane protein n=1 Tax=Myco... 154 3e-35 UniRef50_A2EPB9 Putative uncharacterized protein n=1 Tax=Trichom... 151 1e-34 UniRef50_A2E8P4 Putative uncharacterized protein n=1 Tax=Trichom... 150 4e-34 UniRef50_B1V640 Putative antigenic protein NP1 n=1 Tax=Clostridi... 148 1e-33 UniRef50_Q183N3 Putative exported protein n=9 Tax=Clostridium di... 145 1e-32 UniRef50_A2GCT2 Putative uncharacterized protein n=1 Tax=Trichom... 145 2e-32 UniRef50_A2FC48 Putative uncharacterized protein n=2 Tax=Trichom... 141 3e-31 UniRef50_C8QBU1 Putative uncharacterized protein n=1 Tax=Pantoea... 135 1e-29 UniRef50_A9MNV3 Putative uncharacterized protein n=1 Tax=Salmone... 133 6e-29 UniRef50_C3XY93 Putative uncharacterized protein n=1 Tax=Branchi... 130 4e-28 UniRef50_B6FXD4 Putative uncharacterized protein n=1 Tax=Clostri... 129 9e-28 UniRef50_A2DW23 Putative uncharacterized protein n=1 Tax=Trichom... 125 2e-26 UniRef50_A2DKX5 Immuno-dominant variable surface antigen-like n=... 120 4e-25 UniRef50_B9GV44 Predicted protein n=3 Tax=Populus trichocarpa Re... 91 3e-16 UniRef50_A2EK19 Putative uncharacterized protein n=1 Tax=Trichom... 88 2e-15 UniRef50_D1BQB2 Putative uncharacterized protein n=1 Tax=Veillon... 88 4e-15 UniRef50_Q8EVX9 Putative uncharacterized protein MYPE4300 n=1 Ta... 85 2e-14 UniRef50_A5ZN83 Putative uncharacterized protein n=1 Tax=Ruminoc... 80 9e-13 UniRef50_UPI000180C854 PREDICTED: similar to transmembrane agrin... 78 2e-12 UniRef50_B4FNV9 Early nodulin 75 protein n=2 Tax=Zea mays RepID=... 71 3e-10 UniRef50_A0YJH0 Polymorphic membrane protein n=1 Tax=Lyngbya sp.... 68 4e-09 Sequences not found previously or not previously below threshold: UniRef50_A6KYY0 Putative uncharacterized protein n=5 Tax=Bactero... 159 7e-37 UniRef50_Q4ZLQ6 Putative uncharacterized protein n=3 Tax=Pseudom... 154 3e-35 UniRef50_A5ZL13 Putative uncharacterized protein n=1 Tax=Bactero... 149 9e-34 UniRef50_A5ZBD5 Putative uncharacterized protein n=1 Tax=Bactero... 133 5e-29 UniRef50_A5ZER4 Putative uncharacterized protein n=1 Tax=Bactero... 132 1e-28 UniRef50_Q0TR08 F5/8 type C domain protein n=8 Tax=Clostridium p... 119 1e-24 UniRef50_C6IL86 Putative uncharacterized protein n=2 Tax=Bactero... 115 1e-23 UniRef50_A9L5A4 Putative uncharacterized protein n=12 Tax=Gammap... 113 4e-23 UniRef50_Q0TS71 Discoidin domain protein n=8 Tax=Clostridium per... 112 1e-22 UniRef50_A2EZ91 Putative uncharacterized protein n=1 Tax=Trichom... 99 2e-18 UniRef50_C4IBV1 Fibronectin type III domain protein n=2 Tax=Clos... 98 2e-18 UniRef50_Q638I8 Enhancin family protein n=51 Tax=Bacillales RepI... 98 3e-18 UniRef50_A2G8C7 Immuno-dominant variable surface antigen-like n=... 93 9e-17 UniRef50_A2DIS8 Putative uncharacterized protein n=1 Tax=Trichom... 83 1e-13 UniRef50_B5FJ10 Viral enhancin protein n=5 Tax=Salmonella enteri... 81 3e-13 UniRef50_B7WZG6 Putative uncharacterized protein n=1 Tax=Comamon... 81 4e-13 UniRef50_C7XZ78 Enhancin family protein n=3 Tax=Lactobacillus je... 80 5e-13 UniRef50_B8MWN6 Viral-enhancing factor, putative n=3 Tax=Eurotio... 80 6e-13 UniRef50_UPI000169559C enhancin family protein n=3 Tax=Paenibaci... 77 7e-12 UniRef50_A9A5M1 Putative uncharacterized protein n=1 Tax=Nitroso... 76 9e-12 UniRef50_P20301 Antigenic protein NP1 (Fragment) n=6 Tax=Entamoe... 76 1e-11 UniRef50_A2DBM1 F5/8 type C domain containing protein n=3 Tax=Tr... 76 1e-11 UniRef50_B1IFS9 Enhancing factor n=5 Tax=Clostridium botulinum R... 75 2e-11 UniRef50_B1V638 LRR adjacent family n=3 Tax=Clostridium perfring... 73 7e-11 UniRef50_Q8EVX8 Putative integral membrane protein n=1 Tax=Mycop... 73 1e-10 UniRef50_A9B8D9 Rhs element Vgr protein n=1 Tax=Herpetosiphon au... 67 7e-09 UniRef50_A2D8B9 Megakaryocyte stimulating factor, putative n=1 T... 65 2e-08 UniRef50_Q7U5X7 Putative uncharacterized protein n=1 Tax=Synecho... 64 3e-08 UniRef50_B7P4H6 Putative uncharacterized protein n=1 Tax=Ixodes ... 63 9e-08 UniRef50_D2NSM0 Putative uncharacterized protein n=1 Tax=Rothia ... 56 1e-05 UniRef50_B0BZP0 Putative uncharacterized protein n=2 Tax=Bacteri... 52 2e-04 UniRef50_Q82F59 Putative uncharacterized protein n=8 Tax=Strepto... 49 0.001 UniRef50_UPI0001695557 hypothetical protein Plarl_10627 n=1 Tax=... 49 0.002 UniRef50_A7B9W7 Putative uncharacterized protein n=1 Tax=Actinom... 48 0.002 UniRef50_D1C606 Peptidase M23 n=1 Tax=Sphaerobacter thermophilus... 47 0.005 UniRef50_C2M7W3 Putative uncharacterized protein n=1 Tax=Capnocy... 47 0.007 UniRef50_C0WIN3 Putative uncharacterized protein n=1 Tax=Coryneb... 44 0.047 UniRef50_B0C0T3 Putative uncharacterized protein n=1 Tax=Acaryoc... 44 0.063 UniRef50_A3DJW7 Fibronectin, type III n=2 Tax=Clostridium thermo... 43 0.082 >UniRef50_Q46837 Putative lipoprotein acfD homolog n=40 Tax=Gammaproteobacteria RepID=ACFD_ECOLI Length = 1520 Score = 1986 bits (5144), Expect = 0.0, Method: Composition-based stats. Identities = 1520/1520 (100%), Positives = 1520/1520 (100%) Query: 1 MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTP 60 MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTP Sbjct: 1 MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTP 60 Query: 61 EPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPG 120 EPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPG Sbjct: 61 EPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPG 120 Query: 121 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNS 180 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNS Sbjct: 121 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNS 180 Query: 181 CPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV 240 CPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV Sbjct: 181 CPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV 240 Query: 241 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV 300 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV Sbjct: 241 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV 300 Query: 301 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQ 360 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQ Sbjct: 301 TGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQ 360 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI Sbjct: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 Query: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG Sbjct: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 Query: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVT 540 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVT Sbjct: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVT 540 Query: 541 RDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDM 600 RDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDM Sbjct: 541 RDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDM 600 Query: 601 KNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVE 660 KNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVE Sbjct: 601 KNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVE 660 Query: 661 HLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKG 720 HLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKG Sbjct: 661 HLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKG 720 Query: 721 GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWV 780 GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWV Sbjct: 721 GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWV 780 Query: 781 YERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDE 840 YERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDE Sbjct: 781 YERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDE 840 Query: 841 ADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQL 900 ADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQL Sbjct: 841 ADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQL 900 Query: 901 SLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT 960 SLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT Sbjct: 901 SLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT 960 Query: 961 SYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM 1020 SYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM Sbjct: 961 SYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM 1020 Query: 1021 NPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFA 1080 NPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFA Sbjct: 1021 NPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFA 1080 Query: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA Sbjct: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 Query: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD 1200 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD Sbjct: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD 1200 Query: 1201 AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH 1260 AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH Sbjct: 1201 AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKH 1260 Query: 1261 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT 1320 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT Sbjct: 1261 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT 1320 Query: 1321 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA Sbjct: 1321 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 Query: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG Sbjct: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP 1500 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP Sbjct: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP 1500 Query: 1501 KPEQGPETINQVTEHKMSAE 1520 KPEQGPETINQVTEHKMSAE Sbjct: 1501 KPEQGPETINQVTEHKMSAE 1520 >UniRef50_A6Y2J7 Large exoproteins involved in heme utilization or adhesion n=14 Tax=Vibrionaceae RepID=A6Y2J7_VIBCH Length = 1526 Score = 1737 bits (4499), Expect = 0.0, Method: Composition-based stats. Identities = 762/1568 (48%), Positives = 987/1568 (62%), Gaps = 108/1568 (6%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K L AI+ A +AGC+ + SDT +KPD P Sbjct: 5 KIKLTAIMVALAIAGCNHDSIPNPSDT------------IKPD-------IPNIGEGDSV 45 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVT-GATCNGESSDGFTFKPGEDVTCVA 127 T +PD P L+L GS CN + ++ F+ + + V C Sbjct: 46 TGGELPDIGVD-------DPIIISRLSLDGSLMFGESVQCNDQPANQFSVEQKDHVVCTL 98 Query: 128 GNTTIATFNTQSEAARSLRAVEKVSFSL---EDAQELAGSDDKKSNAVSLVTSSNSCPAN 184 T+ATF++ S A L +A E S +++N +L+ + + Sbjct: 99 DGQTLATFSSPFNIPNSRMAARPSGLELLTLTNADEYKESVLRQANLQTLIKNMGNL--Q 156 Query: 185 TEQVCLTFSSVIESKRFDS-LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPV 243 + + F S +S F + L +D+ EEF++L+ E++ N+ DK PSTH + P Sbjct: 157 GKNIDFNFESSRDSLTFQNYLRNNLDMPAEEFRELITEKISNDNQVDKQPSTHVPDIPPA 216 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTGE 303 TPG DLN+ FVSANAE+ Y+PTEIILS+G+L+DSQG V G+ Y++N RGVTG Sbjct: 217 VTPGASNDLNSGFVSANAEENLVYKPTEIILSQGQLLDSQGRPVNGIAYFSNHSRGVTGI 276 Query: 304 N--------GEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRY 355 N G F FSWG+TISF IDTFELG +RGNK+T L ELG E G N + L+ RY Sbjct: 277 NKNGQATGDGSFEFSWGDTISFAIDTFELGHIRGNKNTFKLNELGSEWAGKNAETLVLRY 336 Query: 356 STTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGA---TLGEGEQVVNLPNEFIEQF 412 + + + D V +VF++YPNV+NE I+LSLSN +G GE+ +P EF +QF Sbjct: 337 ANISGD-IVSLSDKVTQVFSQYPNVVNESISLSLSNEDVELDVGGGEKQ-TVPGEFHKQF 394 Query: 413 NTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVD-----TNYKSVS 467 + G A EID A+ + + + + + +I I KLWG +K V Sbjct: 395 SQGIAAEIDQALNPSRASQMWSTFATKSAVDPEASRILADIQKLWGATEEVQKQGWKKVE 454 Query: 468 KFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYI 527 +FHVFHDSTNFYGSTG+ARGQA VNI+N AFP+LMARND NYW+ FG+ +AWD LA+I Sbjct: 455 RFHVFHDSTNFYGSTGHARGQAAVNIANTAFPVLMARNDNNYWIDFGKPKAWDDQGLAFI 514 Query: 528 TEAPSLVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNK 587 TEAPSLV+PE V+ +TATFNLPFIS+G +G GK+MVIGN YNSIL CPNG+SWNGGVN Sbjct: 515 TEAPSLVQPEKVSAETATFNLPFISVGDLGRGKVMVIGNSRYNSILVCPNGFSWNGGVNH 574 Query: 588 DGQCTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAA 647 G+C +SD +DM NF NVLRYL+ A +TVGTN+ VYFKR+GQV G+SA Sbjct: 575 QGECIASSDSNDMGNFFSNVLRYLTGKN-----SAELTVGTNIPYVYFKRYGQVMGSSAP 629 Query: 648 FDFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQ 707 F FA E L+S+ LDP+ +P++ILNG+EY G Y +PL ADT +PK++Q Sbjct: 630 FILDTRFAA-RTETLTSFEGLDPETLPVVILNGYEYRGLRGMGSYDLPLSADTDEPKMSQ 688 Query: 708 QDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKS---VVNNDPQ 764 DVT+LI Y+N+GG+VL+ME + + + +A RLLD++G++ + S N Sbjct: 689 DDVTNLIDYVNRGGNVLMMETI---IDQANAGEMTRLLDSSGIAFGMGSSVIANGNGPSG 745 Query: 765 GYPNRVRQQRATGIWVYERYPAVDGA------LPYTIDSKTGEVKWKYQVENKPDDKPKL 818 GYP+RVR QR GIWV ERY A++G LPYTI + G V+W + +E KPDDKP L Sbjct: 746 GYPDRVRNQRQHGIWVLERYAAIEGGNGAAPMLPYTI-KEDGTVEWTFIIEGKPDDKPNL 804 Query: 819 EVASWLE-DVDGKQETRYAFIDEADHK------------------TEDSLKAAKEKIFAA 859 EVASWLE + +G + AFI EADH E SL AAK++I A Sbjct: 805 EVASWLEKNSEGSLVKQVAFIYEADHWQKNEQGQIIYNESGKPVLNEASLAAAKQRILNA 864 Query: 860 FP------GLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQA 913 F +EC+N YHYEVNCLEYRPG +P GGM+VP YT+L L + AKAM++A Sbjct: 865 FVTSDGKLAYQECSNSHYHYEVNCLEYRPGNAIPTGGGMHVPFYTELKLGDEEAKAMIKA 924 Query: 914 ADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELG 973 A+LGTNI+ LYQHE YFRT G++GERLSSVDL R+YQNM+VWLWND YRYE G +DELG Sbjct: 925 ANLGTNIEALYQHERYFRTKGKQGERLSSVDLNRIYQNMTVWLWNDLDYRYEAGHDDELG 984 Query: 974 FKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPL 1033 F+ FTEFLNCY ND G T+C DLK L M+YG+G AG MNPSYPLNYMEKPL Sbjct: 985 FQRFTEFLNCYTNDVAGGNTQCPTDLKLELNQMGMVYGEG-EYAGQMNPSYPLNYMEKPL 1043 Query: 1034 TRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQ 1093 TRLMLGRS+WDL+IKVDV +PG Q T + + + T WFAGN Q TG WA AQ Sbjct: 1044 TRLMLGRSFWDLDIKVDVRAFPGEAKGS-QGRTIILDMRNQTTAWFAGNRQPTGQWAVAQ 1102 Query: 1094 KEVTIKS-NANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTV----KFKV 1148 +E ++ + PVT+T+ALADDLTGREKHE+ L RPPR++K++ + F V Sbjct: 1103 QEFSVAVSGEDSPVTITIALADDLTGREKHELGLKRPPRMSKSFVIGGENGNPTSKTFTV 1162 Query: 1149 PYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPK 1208 PYGGLIY +G +S E S TFTG + AP YK+G W+N L+SPAP+GE+ S++FV+T PK Sbjct: 1163 PYGGLIYAQGGNS--ELVSLTFTGTIDAPLYKEGKWENGLDSPAPIGEVVSNSFVFTAPK 1220 Query: 1209 KNLNASNYTGGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDV 1266 NLNAS YTGG+ QFA DLD FA +NDFY RD E +R T ++ P ++H F NDV Sbjct: 1221 ANLNASGYTGGIAQFAQDLDRFALDLNDFYARDEGVEGQHNRKATSESNPNNRHHFVNDV 1280 Query: 1267 QISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNV 1326 ISIG AHSGYPVMN+SF+ S +L T PLN WL+WHEVGHNAAE P V GATEV NN+ Sbjct: 1281 AISIGAAHSGYPVMNASFNATSKSLNTAPLNSWLLWHEVGHNAAEAPFNVDGATEVVNNL 1340 Query: 1327 LALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDI 1386 LALYMQD +LGKM RV DI +APE++ AW GGAG+RL+M+AQLKEWAE F I Sbjct: 1341 LALYMQDCHLGKMARVEQDIRIAPEFVSMERGHAWGAGGAGERLVMFAQLKEWAETEFQI 1400 Query: 1387 KKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTL 1446 ++WY LP +YS+ +G+KGWNLF+LMHR R + G+N C S +D L Sbjct: 1401 ERWYS--GELPTYYSQEDGVKGWNLFKLMHRLTRNADDGVMTLKGENLCQPSGLGKSDQL 1458 Query: 1447 MLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGP 1506 MLCAS+ AQTDL+EFF+ WNPG+ A+ P + +EGGV+ + + + + KP + P Sbjct: 1459 MLCASYAAQTDLTEFFQTWNPGSKAFIYPNDPKPHYEGGVTSAGVDRVKEQNYLKPNRDP 1518 Query: 1507 ETINQVTE 1514 INQ+++ Sbjct: 1519 LKINQISQ 1526 >UniRef50_A5F372 Accessory colonization factor AcfD n=25 Tax=Vibrio RepID=ACFD_VIBC3 Length = 1520 Score = 1697 bits (4395), Expect = 0.0, Method: Composition-based stats. Identities = 734/1566 (46%), Positives = 976/1566 (62%), Gaps = 106/1566 (6%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K + +++ L GC P V D + +P Sbjct: 2 KIRIVSLIVLGFLIGCKHESI--------------ITPTVPADGSGGNAL------NPGL 41 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRV-TGATCNGESSDGFTFKPGEDVTCVA 127 +PD P P + +TL G+ + + CN + + F ++V C Sbjct: 42 VGGYLPDIGV-------PDPIISLSMTLDGNLKFDSSLLCNDQDASHFQISQKDNVFCTI 94 Query: 128 GNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPANTEQ 187 +IATF +A ++ R + SL A E S ++ N L+ + + + ++ Sbjct: 95 NGRSIATFTAPFDANKNGRNTDSEVLSLISADEYRDSPVRQENLQILMKNMAT--IHGDK 152 Query: 188 VCLTFSSVIESKRFDS-LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPVTTP 246 + L F S +++ F++ L +DL ++F + + E++ N+ DK PSTH + P TP Sbjct: 153 ISLVFRSTLDALTFENYLRHNLDLPKDQFLEAITEKIANDNQVDKQPSTHVPNISPSFTP 212 Query: 247 GTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTG---- 302 GT +LN+ FVSANAE+ Y PT++I S GRL+DSQG + GV+Y++N+ RG+TG Sbjct: 213 GTSSNLNSPFVSANAEESLSYIPTDVIPSLGRLLDSQGRVINGVSYFSNNTRGITGVDKT 272 Query: 303 ----ENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTT 358 +G F FSWG+ ISF IDTFELGS R NK+ ++ELG + G N + LIHRY++ Sbjct: 273 GAILNDGSFEFSWGDIISFSIDTFELGSTRANKTDFYISELGKDNEGKNAEALIHRYASI 332 Query: 359 GQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGA---TLGEGEQVVNLPNEFIEQFNTG 415 ++ ++PD V ++F+ YPNVINE+I+LSL NG +G+G+ + +P EF +QF++G Sbjct: 333 -DDSKLIIPDKVTQIFSLYPNVINEVISLSLPNGDIELDIGDGKTQI-VPGEFFKQFDSG 390 Query: 416 QAKEIDTAICA-KTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVD-----TNYKSVSKF 469 A ID +I ++ + + QIQ +INKLWG +K V +F Sbjct: 391 LAALIDQSISPISRFKFEDSLPKKKSAIDSESSQIQDIINKLWGATDTVQANGWKKVDRF 450 Query: 470 HVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITE 529 H+FHDSTNFYGSTG+AR QA VNI+N+AFP+LMARND NYW+ FG+ +AWD N LA+ITE Sbjct: 451 HIFHDSTNFYGSTGSARAQAAVNIANSAFPVLMARNDNNYWIDFGKPKAWDSNSLAFITE 510 Query: 530 APSLVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDG 589 APS V P+ V+ DT+TFNLPFISLG++G+GKLMV+GN YNS+L CPNG+SW GG K+G Sbjct: 511 APSTVVPDKVSEDTSTFNLPFISLGEIGKGKLMVLGNARYNSVLVCPNGFSW-GGTVKNG 569 Query: 590 QCTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFD 649 C+L+SD DDM NF NV+RYL+ + VGTN+ VYFK GQ G+ A F+ Sbjct: 570 TCSLSSDRDDMANFFSNVIRYLTGS-----TSNDVIVGTNIPEVYFKSSGQTMGSKANFE 624 Query: 650 FHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQD 709 F+ + L+S+ DLD +PL+I+N ++Y + N PY IPL AD PKL++ D Sbjct: 625 LDSRFSK-QTQQLTSFHDLDVNTIPLIIINAYDYKGKNINSPYDIPLSADVGSPKLSRSD 683 Query: 710 VTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSV---VNNDPQGY 766 VTDLI Y+N GGSVL+ME +++ E RLLD+AG++ + SV N G+ Sbjct: 684 VTDLIDYINNGGSVLMMETIINTNNSE----ISRLLDSAGIAFGIGNSVVADGNGPSGGH 739 Query: 767 PNRVRQQRATGIWVYERYPAVDG------ALPYTIDSKTGEVKWKYQVENKPDDKPKLEV 820 P+R R QR GIWV ERY AV+ LPY I+S G ++WKY VEN+PDDKPKLEV Sbjct: 740 PDRPRSQREHGIWVIERYAAVEDESSGQQTLPYVINS-DGSIEWKYIVENRPDDKPKLEV 798 Query: 821 ASWLEDVDG-KQETRYAFIDEADHKTED------------------SLKAAKEKIFAAFP 861 ASW+E G K T YAFIDE+ H +D SL AK K+ AF Sbjct: 799 ASWVESEAGDKLITHYAFIDESQHWKKDISGKIIYNVAGKPEVDNASLSLAKNKVLDAFK 858 Query: 862 ------GLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAAD 915 EC N +HYE+NCLEYRPG +P+TGG+YVP+YT + L A AMV+AA+ Sbjct: 859 NSSGQRAYSECKNSEFHYEINCLEYRPGNSIPITGGLYVPRYTDIKLGESEANAMVKAAN 918 Query: 916 LGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFK 975 LGTNI LYQHE YFRT G+ G RL+SVDL R+YQNMSVWLWND YRY++ ++DELGFK Sbjct: 919 LGTNIHALYQHERYFRTKGKSGARLNSVDLNRIYQNMSVWLWNDLDYRYDDKQSDELGFK 978 Query: 976 TFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGS-SKAGMMNPSYPLNYMEKPLT 1034 FT++LNCY ++ G T C +LK L MIY + S S AG M+PSYPLNYMEKPLT Sbjct: 979 VFTQYLNCYTSNNAGGNTTCPEELKDELTQLGMIYDEKSGSYAGQMDPSYPLNYMEKPLT 1038 Query: 1035 RLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQK 1094 RLMLGRS+WDL+IKVDV KYPG V+ T+ + +N WFAGN Q TG WA A + Sbjct: 1039 RLMLGRSFWDLDIKVDVRKYPGEVTTRSGGGDITLDMRNNTAAWFAGNRQPTGQWAEAHQ 1098 Query: 1095 EVTIKS-NANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSL--DASGTVKFKVPYG 1151 ++ PVT+T+ALADDLTGREKHE+ L RPPR++K++ + D+ F VPYG Sbjct: 1099 PFSVSVSGETSPVTITIALADDLTGREKHELGLKRPPRMSKSFVIGGDSPKMQTFTVPYG 1158 Query: 1152 GLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNL 1211 GLIY +G +S + TF+G + AP Y DG W+N L S AP+GE+ SD F++T PK NL Sbjct: 1159 GLIYAQGGNS--QQVKLTFSGTIDAPLYIDGKWRNPLLSGAPIGEVVSDTFIFTAPKANL 1216 Query: 1212 NASNYTGGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDVQIS 1269 NA Y GG+EQFA DLD F++ +NDFY RD + K+R T K++P ++H F NDV IS Sbjct: 1217 NADGYLGGIEQFAKDLDQFSADLNDFYARDEGADGDKNRKATDKSMPNNRHHFVNDVAIS 1276 Query: 1270 IGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLAL 1329 +G AHSGYPVMN SF +S +L T PLN WL+WHEVGHN+AE P V GATEV NN+LAL Sbjct: 1277 VGAAHSGYPVMNDSFITSSRSLNTMPLNSWLLWHEVGHNSAEAPFNVDGATEVVNNLLAL 1336 Query: 1330 YMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKW 1389 YMQDR+ GKM+RV DI A +++ + AW GGAG+RL+M+AQLKEWAE FDI W Sbjct: 1337 YMQDRHQGKMSRVEQDIRYAFDFVNAEHGHAWGAGGAGERLVMFAQLKEWAETEFDINDW 1396 Query: 1390 YPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLC 1449 Y D LP FY E G+KGWNLF+LMHR R + G+N C S +D LMLC Sbjct: 1397 YND--KLPGFYIEESGIKGWNLFKLMHRLMRNENDDQINMKGENQCKISGIGKSDLLMLC 1454 Query: 1450 ASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETI 1509 AS+ AQTDLSEFFK WNPG+ A+ P + +EGG++ S + SL L P++ P +I Sbjct: 1455 ASYAAQTDLSEFFKAWNPGSKAFLYPDDPQPYYEGGITPSGIQRVKSLKLNLPQKNPLSI 1514 Query: 1510 NQVTEH 1515 N VT+H Sbjct: 1515 NSVTQH 1520 >UniRef50_D0YX39 Accessory colonization factor AcfD n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0YX39_LISDA Length = 1509 Score = 1586 bits (4105), Expect = 0.0, Method: Composition-based stats. Identities = 599/1569 (38%), Positives = 876/1569 (55%), Gaps = 129/1569 (8%) Query: 10 SLLAAILSATLLAGCDGGGSGSSSDTP--PVDSGTGSLPE---VKPDPTPNPEPTPEPTP 64 LL+ ++A LL GC+G + ++D P P+D PE + +P+ EP P Sbjct: 4 KLLSLAITAALLTGCNGDSNSQNTDLPLTPLDPSIPVKPEQPLIPLEPSIPVEPEIPEPP 63 Query: 65 DPEPTPEPIPDPEPTPEPEPEPV-PTKTGYLTLGGSQRVTGATCNGES---SDGFTFKPG 120 PEP +P+ P +P G L L G Q V +CNG+ + FTFK G Sbjct: 64 VEPELPEPPTEPDTIPPSVLDPAIKIHKGGLQLSGKQLVGDISCNGQELALNGQFTFKDG 123 Query: 121 EDVTCVAGNTTIATFNTQSEAARSLR--AVEKVSFSLEDAQELAGSDDKKSNAVSLVTSS 178 +D+ C G+ + F+ Q+ RSL A + + F +E D N V ++ Sbjct: 124 DDIRCNFGSIEL--FSQQAPQPRSLHSDAQKVIHFDIEHFLHDGAVD----NTVQVLNKI 177 Query: 179 NSCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVEN-NAATDKAPSTHT 237 ++C ++ +VCL VI S +LY D E K+ +N E DKAPS+H Sbjct: 178 DTCKSDNNKVCL---DVINSYDIATLYNSTDT--EAVKEFINPSSEQVTEEVDKAPSSHV 232 Query: 238 SP-VVPVTTPGTKPDLNASFVSANAEQFYQYQPT--EIILSEGRLVDSQGYGVAGVNYYT 294 + P TPGT DLN+ FVSA+AE YQY+P+ ++ +L+D++G +AGV+YYT Sbjct: 233 DVTLKPEVTPGTSTDLNSQFVSASAESAYQYKPSVDNQEITVAKLLDAKGLPIAGVHYYT 292 Query: 295 NSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDE-VRGANIDQLIH 353 S RGVT G+ + WGE I+FGIDTF GSV+GN+ + LT++ + + NID LI Sbjct: 293 PSSRGVTDSQGQIEYIWGEEITFGIDTFTFGSVKGNQLSYQLTDVTENSLVKQNIDSLIE 352 Query: 354 RYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFN 413 RYS ++ R V ++FA YPN INEIIN+SL NGA + EG ++PNEF QF+ Sbjct: 353 RYSKNLHDH-REFDTKVHQIFALYPNAINEIINISLPNGAKI-EGTNF-HVPNEFEYQFD 409 Query: 414 TGQAKEIDTAIC-AKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVF 472 +G AKEID + K+ + + N+N + Y V +FHVF Sbjct: 410 SGLAKEIDEQLKQPKSLWAKQTKIVKAHGSNINAT-----------LHQIYSGVQQFHVF 458 Query: 473 HDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPS 532 HD ++YG++G AR +NISN AFPILM R D NYWL FG+++AW + +I +A + Sbjct: 459 HDVGSYYGASGFARLMRNLNISNTAFPILMPRMDSNYWLPFGKEQAWTREFKPHIVDATT 518 Query: 533 LV--------EPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGG 584 + P V+ D ATFNLP IS GQ+G+G ++ +G+ HY +L CP+ Y N Sbjct: 519 IDADSKVTMLRPPKVSEDNATFNLPGISTGQIGKGSIVFMGSGHYPIVLSCPDSYWGNKS 578 Query: 585 VN-KDGQCTLNS-------------DPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNL 630 ++ KD QCT + D M+ F +N+ +L + + + ++ V TN+ Sbjct: 579 LSIKDQQCTYSINNNIVDPTTDRQFDNGSMQRFFKNLFTWL--EPSYQNGQNAINVATNI 636 Query: 631 DTVYFKRHGQVTGNSAAFDFHPDFAGISVEHLSS--YGDLDPQEMPLLILNGFEYVTQVG 688 + HG + F +S+E ++S + ++P+ P+L+L +E Sbjct: 637 ELAPKFDHGHQSWLPKYEFFINKSYNVSLERITSGNFSGINPETTPILLLQSYEI--GAF 694 Query: 689 NDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAA 748 D +D S+PKLT DV DLI Y+N GG ++ + + ++ + +L D A Sbjct: 695 GDGTTTKNISDLSQPKLTVNDVNDLIQYVNAGGHIVFFDAI----EQVNPEPIAKLADMA 750 Query: 749 GLSMALNKSVVNNDPQGYPNRVRQQRATGI------------WVYERYPAVDGALPYTID 796 G+S+ Q Y +G+ VYER+ ++ + Sbjct: 751 GVSLGGANVAQAKTTQAYCGSSYYCHGSGVKPNVHAVTEHDLVVYERFETLNDDASKIVI 810 Query: 797 SKTGEVKWKYQVENKPDDKPKLEVASWLE-----DVDGKQETRYAFIDEADHKTEDSLKA 851 + G + W P+ PKLEVA + +DG + R+AF K+ED +A Sbjct: 811 NSDGTITWP-----APNKMPKLEVAKYTTPYMPLTIDGIPQERFAFFQV---KSEDEKRA 862 Query: 852 AKEKIFAAFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMV 911 A ++ AFPG+K C + Y +EVNC+E+R G G+P G Y + S++ +MV Sbjct: 863 AIHELQVAFPGVKVCQDD-YEFEVNCIEFRKGHGIPSFGNYQRANYERYSISPKVIDSMV 921 Query: 912 QAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDE 971 +AA+LGTN+ +LYQHELY+RT G +G RLS +L + Y N SVW+WND YRY+ DE Sbjct: 922 EAANLGTNLTKLYQHELYYRTRGEQGHRLSLTELNQTYDNTSVWMWNDEPYRYDNSVEDE 981 Query: 972 LGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEK 1031 LGFKT ++LNCY N+ + GG +CS D +++L+ ++ K G +NPSYPLNY EK Sbjct: 982 LGFKTAVDYLNCYTNNQHQGGIECSVDKQQALIKYGFLH-----KNGELNPSYPLNYQEK 1036 Query: 1032 PLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAP 1091 PLTR+MLGRS+WDL+IKVD +YPG + T T+S +N NMQSTGLWA Sbjct: 1037 PLTRIMLGRSYWDLDIKVDTTQYPGRPAFTNGTQTVTVSTLNNAVTGTVNNMQSTGLWAH 1096 Query: 1092 AQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYG 1151 ++ ++ + VP T+TV+L DDLTG E+HEVALNRPPRV K+++ D S + F+VPYG Sbjct: 1097 QHQQ--VQVSGGVPATITVSLIDDLTGLEQHEVALNRPPRVQKSFNYDGS-NLSFRVPYG 1153 Query: 1152 GLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNL 1211 GLIYIK +S+ +A F+F+GV A F+KD W S PL E+++ +YTTP +N+ Sbjct: 1154 GLIYIKPHSNIEGTAEFSFSGVATAAFWKDNQWMYGKASDVPLAEIDTGHVIYTTPVENI 1213 Query: 1212 NASNYTGGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDVQIS 1269 ++ F ++++ FA+S +DFYGRD GKHR FTY++L H+HRF ND+QIS Sbjct: 1214 EQ----QDIQIFVDEMNKFANSASDFYGRDEVVSVGKHRRFTYQDLADHRHRFVNDIQIS 1269 Query: 1270 IGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLAL 1329 IG AHSGYPV +++++ +PTTP NDWL+WHE+GHN A P ++ G TEV NN+LAL Sbjct: 1270 IGAAHSGYPVQSTTYNKG-NKIPTTPTNDWLLWHEIGHNLASAPFSMTGGTEVTNNILAL 1328 Query: 1330 YMQDRY---LGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDI 1386 YMQ++ KM+RV DI P N W+ G AG RL+M+AQLK WAE +F I Sbjct: 1329 YMQEQRPEPNNKMSRVESDIQKMPLLFSRYNKHVWSNGDAGIRLVMFAQLKLWAENHFRI 1388 Query: 1387 KKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESN--GNAAD 1444 WY + L + + +GWN+F+LMHRK+RGD + + G NYC+ S+ + D Sbjct: 1389 DNWYSEKDLLTIYNQD----QGWNMFKLMHRKSRGDSIGD---QGINYCSSSDTGLSGGD 1441 Query: 1445 TLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASL-DLPKPE 1503 LM+C+S+V+ DLS F+ WNP + LP ++ + GG++ Y L + +L +PE Sbjct: 1442 LLMVCSSYVSGFDLSNFYTLWNPSESMNILPNGDKL-YSGGITSKGYQVLNQIPNLKQPE 1500 Query: 1504 QGPETINQV 1512 PE+I + Sbjct: 1501 TSPESITHL 1509 >UniRef50_A6AXB7 AcfD n=3 Tax=Gammaproteobacteria RepID=A6AXB7_VIBPA Length = 1366 Score = 1585 bits (4103), Expect = 0.0, Method: Composition-based stats. Identities = 714/1381 (51%), Positives = 916/1381 (66%), Gaps = 63/1381 (4%) Query: 182 PANTEQVCLTFSSVIESKRFDS-LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV 240 + +++ L+ F + L Q+D+ + FKKL+ E + N++ TDK PSTHT V Sbjct: 3 TIHGDELDLSLEKTSHRLIFKNYLNNQLDVEIDTFKKLLQERLSNDSQTDKQPSTHTPEV 62 Query: 241 VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGV 300 P TPG DL+ +FVSANAE+ +Y+P E+IL+ G LVDS G V G+ Y+T+ GRG+ Sbjct: 63 EPAVTPGASSDLSQAFVSANAEKSLEYKPKELILTTGYLVDSFGRSVNGIAYFTSKGRGL 122 Query: 301 TGE-------NGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIH 353 TG +G FSWG+TI+FGIDTFELGS RGNK+TI L +LG G NI+ L+ Sbjct: 123 TGYKDGRLIGDGSLEFSWGDTINFGIDTFELGSTRGNKNTIKLQDLGSGNEGKNIESLVM 182 Query: 354 RYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGA---TLGEGEQVVNLPNEFIE 410 R+S + + V D V +VF++YPNVINE I+LSLSN LG G V + EF + Sbjct: 183 RFSEEN-DQSVFVTDKVTEVFSKYPNVINEAISLSLSNEDIQLDLGNGNTEV-VKGEFEK 240 Query: 411 QFNTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDG--QIQGVINKLWGVDT-----NY 463 QF +G A++ID + + E + V+ +Q + +LWG + Sbjct: 241 QFESGLAEDIDKELGRQKLAFGEQYREPKQIKAVDSDAQNVQRDVERLWGATQQAQREGW 300 Query: 464 KSVSKFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNE 523 K V +FH+FHDSTNFYGSTG+AR QA VNISN AFP++MARNDKNYW+ F + +AWD+N Sbjct: 301 KPVERFHIFHDSTNFYGSTGSARAQAAVNISNKAFPVVMARNDKNYWIDFDKPQAWDENG 360 Query: 524 LAYITEAPSLVEPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNG 583 LAYITEAPS V+P+ V ATFNLPFIS+G +G+GK+MV+GN YNS+L CPNG+SWNG Sbjct: 361 LAYITEAPSKVKPKKVDASNATFNLPFISIGDLGKGKVMVMGNARYNSVLVCPNGFSWNG 420 Query: 584 GVNKDGQCTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTG 643 GVN GQCT N+D DDM NF N +YL+ K + +V TN+ VYFKR GQV G Sbjct: 421 GVNDQGQCTGNTDSDDMANFFNNAFQYLTGKK-----AGTFSVATNIPHVYFKRGGQVLG 475 Query: 644 NSAAFDFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVT-QVGNDPYAIPLRADTSK 702 + A++ FA + L S+ LDP ++PL+ILN + Y+ Q G Y +P++A+ Sbjct: 476 SKASYLIDKRFAQ-DTQQLDSFSGLDPNDIPLVILNAYSYLGEQGGLGAYDLPMQANLDA 534 Query: 703 PKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSV---V 759 PKLTQQD++DLIAY+ GGSVL+ME + ++ + RLLDAAG++ + +SV Sbjct: 535 PKLTQQDISDLIAYVEDGGSVLMMETIKG---QKDSGVVSRLLDAAGIAFGIGESVARDG 591 Query: 760 NNDPQGYPNRVRQQRATGIWVYERYPAVDG------ALPYTIDSKTGEVKWKYQVENKPD 813 N GYP+RVR QR GIWV ERY A D +LPY I + G V+WKY +EN+PD Sbjct: 592 NGPNGGYPDRVRSQRQQGIWVLERYAAEDSSNGEGPSLPYVI-KEDGSVEWKYIIENRPD 650 Query: 814 DKPKLEVASWLE-DVDGKQETRYAFIDEADHKTE-----DSLKAAKEKIFAAFP------ 861 DKPKLEVA W+E + G + + AFIDEA+ + ++L AK +I AF Sbjct: 651 DKPKLEVAKWIEINEQGDSKVQVAFIDEANFYQDGTFDNEALTVAKNRILDAFKDNSGKR 710 Query: 862 GLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQ 921 +ECTN YHYEVNCLEYRPG +P++GG+YVP YT++ L AKAMV+AA++G+NI+ Sbjct: 711 AYEECTNNEYHYEVNCLEYRPGNKIPISGGLYVPNYTEMKLGEHEAKAMVKAANIGSNIE 770 Query: 922 RLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFL 981 LYQHE YFRT G++G RL+SVD+ R+YQN+SVWLWND Y Y++ KNDELGFK FTEFL Sbjct: 771 ALYQHERYFRTKGKQGFRLNSVDMSRMYQNLSVWLWNDLRYSYDQEKNDELGFKRFTEFL 830 Query: 982 NCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRS 1041 NCY +D G T C LK L +MIY +G AG MNPSYPLNYMEKPLTRLMLGRS Sbjct: 831 NCYTDDKAGGNTICPESLKLELQKMDMIYAEG-EYAGYMNPSYPLNYMEKPLTRLMLGRS 889 Query: 1042 WWDLNIKVDVEKYPGAVSEEGQN-VTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKS 1100 +WDL++KVD +PG S G N T T+ + +N T WFAG+ Q+TG WA A T+ Sbjct: 890 FWDLDVKVDTRPFPGVASSSGSNGGTITLDMSNNVTAWFAGSRQATGQWAQAHVPFTVSV 949 Query: 1101 -NANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTV--KFKVPYGGLIYIK 1157 A PVT+TVALADDLT REKHEV L RPPR+TK++ + + VPYGGLIY + Sbjct: 950 SGAKAPVTITVALADDLTAREKHEVGLKRPPRMTKSFIIGGNKATSETITVPYGGLIYAQ 1009 Query: 1158 GNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYT 1217 G +S ES TFTG + AP + DG+WKNDL+SPAP+GE+ S +F+YT PK NL A NY Sbjct: 1010 GGNS--ESVQLTFTGTLAAPLFIDGSWKNDLDSPAPVGEVVSKSFIYTGPKANLRAENYP 1067 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS 1275 GG+EQFA DLD FAS +NDFY RD + +R T P +H F NDV ISIG AHS Sbjct: 1068 GGIEQFAKDLDQFASDLNDFYARDEGLDGQANRKVTGDENPNSRHHFVNDVAISIGAAHS 1127 Query: 1276 GYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY 1335 GYPVMNSS++ NS+ + TTPLNDWL+WHEVGHNAAE P V GATEV NN+LALYMQD + Sbjct: 1128 GYPVMNSSYNLNSSNINTTPLNDWLLWHEVGHNAAEAPFVVEGATEVVNNLLALYMQDLH 1187 Query: 1336 LGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTP 1395 +GKM RV DI VAPE++ + AWA GGA +RL+M+AQLKEWAE FDI+ WY Sbjct: 1188 IGKMTRVEQDIQVAPEFVRTEHGHAWAAGGAAERLVMFAQLKEWAESEFDIRDWY--QGE 1245 Query: 1396 LPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQ 1455 LP +YSE EG+KGWNLF+LMHR R + N C + +D LM+CAS+ AQ Sbjct: 1246 LPSYYSEVEGVKGWNLFKLMHRLTRNESDGIFYLKSTNACRWQGLSKSDQLMVCASYAAQ 1305 Query: 1456 TDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEH 1515 TDLS+FF WNPGA ++ PG+SE S+EGGV+Q + + L L KP PE IN +T Sbjct: 1306 TDLSDFFLAWNPGARSFIYPGSSEPSYEGGVTQKGLDVVRKLGLKKPSLDPEEINTITVR 1365 Query: 1516 K 1516 K Sbjct: 1366 K 1366 >UniRef50_A6ALV4 AcfD n=5 Tax=Vibrio RepID=A6ALV4_VIBHA Length = 1466 Score = 1551 bits (4015), Expect = 0.0, Method: Composition-based stats. Identities = 578/1553 (37%), Positives = 835/1553 (53%), Gaps = 150/1553 (9%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K L +IL A+LL GC G + S+ T P P V+PD P+ Sbjct: 2 KRSLLSILVASLLFGCGGDENNHSTSTTP--------PTVEPDLPPDQ------------ 41 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGES---SDGFTFKPGE-DVT 124 PD T T G L + G Q C+G+ S FT+ E + + Sbjct: 42 -----PDIGVT---------TYQGKLFINGKQLTGDIQCDGQDNSESGYFTYAASEGNFS 87 Query: 125 CVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPAN 184 C G ++ F+ Q A + D +++ G+ +NA L+ ++CP+ Sbjct: 88 CEFGAVSLGEFSYQIPAQTRTGSQPAELTQNYDLKDVLGT--HANNAAKLLHKIDTCPSQ 145 Query: 185 TEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHT-SPVVPV 243 QVCL I S LY+ D A +N V N PS H + P Sbjct: 146 DTQVCL---DEINSYDIQDLYESDDQAA--IDAFLNPSVVNEGE---QPSAHVDPELQPE 197 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPT--EIILSEGRLVDSQGYGVAGVNYYTNSGRGVT 301 TPG +L SFVSANAE Y+Y+P+ L + RL DSQG +AG+ +++ S RG+T Sbjct: 198 VTPGASNNLTGSFVSANAEAAYEYKPSAANKPLIKSRLTDSQGNALAGIEFFSQSARGIT 257 Query: 302 GENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGD-EVRGANIDQLIHRYSTTGQ 360 NGEF + WGE + FGIDTF LG V+GN+ + L +L D + N+D +HRY + Sbjct: 258 DANGEFEYLWGENLIFGIDTFTLGQVKGNQVSYQLADLSDNPLVKQNLDAFVHRYGLSSG 317 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 NN + D+VR+VFA+YPNVINE+INLSL NGA + EG PNEF QF+ G + I Sbjct: 318 NNIE-IGDNVRQVFAQYPNVINELINLSLPNGAKI-EGTNFTT-PNEFEAQFSQGLTQII 374 Query: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 D + +W TT + + G + Y V H+FHD+ +G Sbjct: 375 DGQLKQTP------QWSGFTTPMLRTVRASGSNYVTQSLHQIYAGVDSVHIFHDNHG-WG 427 Query: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDK----NELAYITEAPSLV-- 534 +G R N++N AFP+LM RND +YWL FGE+ AW + ++ AYI +A ++ Sbjct: 428 GSGYTRAMRNFNLTNEAFPVLMPRNDNSYWLGFGEEAAWTRGSGKDQKAYIVDATTIDEN 487 Query: 535 ------EPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNG--GVN 586 PE +++ TATFNLP ++ G +G GK++ +GN Y SI CP Y G++ Sbjct: 488 STVVMQRPEVISKQTATFNLPTMTAGMIGSGKVVFLGNAMYTSIFSCPENYWAGADLGID 547 Query: 587 KDGQCTLNS--------------DPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDT 632 + Q S D M+ N++ +L + + S+ + TN++ Sbjct: 548 SEVQQCRYSTPHNQEAQDADTRTDNGSMQVMFGNLIDWLVPNA----TQESVAIATNINK 603 Query: 633 VYFKRHGQVTGNSAAFDFHPDFAGISVEHLSS--YGDLDPQEMPLLILNGFEYVTQVGND 690 + R + G F +P + ++ LSS + LDP PLL+L +E T G D Sbjct: 604 GHAFRWDRKEGQIYDFFVNPSYKLGEMDVLSSGQFDSLDPTSTPLLLLQSYEIKTD-GYD 662 Query: 691 PYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGL 750 ++ +D ++PKL DVT LI Y+N GGS++ + L+E + RL DAAG+ Sbjct: 663 TKSV--VSDINQPKLDADDVTALIEYVNNGGSIIFFDA----LEESNPEPIARLADAAGV 716 Query: 751 SMA---LNKSVVNNDPQGY-------PN--RVRQQRATGIWVYERYPAVDGALPYTIDSK 798 S+ + K+ + Y PN + + VYERY + Sbjct: 717 SVGGANVAKTFQSLCTDSYWCHSTSGPNVPNLHTVAEYDLVVYERYADT----TKIEIND 772 Query: 799 TGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDEADHKTEDSLKAAKEKIFA 858 G V W + D P LE+ + +DG++ RYAF K+E +AA ++ Sbjct: 773 NGTVTWPGNI-----DMPTLEIPLYKASIDGQEHQRYAFHMV---KSEQEKQAAVAELQR 824 Query: 859 AFPGLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGT 918 FPG+ C + Y YEVNC+E R G G+P G + P +T+ ++ + +MV+AA+LG Sbjct: 825 EFPGVPVCKDD-YQYEVNCIEVREGHGIPSRGNHHRPDFTRYEMSPEVVDSMVKAANLGA 883 Query: 919 NIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFT 978 NI RL HELY+R+ G GERLS +L Y N+SVWLWND Y + DELGF+ Sbjct: 884 NIDRLLSHELYYRSKGEIGERLSQAELTSTYDNLSVWLWNDEQYEFNPNVQDELGFERAV 943 Query: 979 EFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLML 1038 E LNCY ++A+ GG C + ++ L +MI ++G +NPSYPLN+MEKPLTR+ML Sbjct: 944 EMLNCYTDNAHQGGNVCGQETREQLAKWSMIT-----ESGELNPSYPLNWMEKPLTRMML 998 Query: 1039 GRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTI 1098 GRS+WDL+I VD YPG S+ G + I + AG+MQSTGLWAP +EVTI Sbjct: 999 GRSYWDLDISVDTTSYPGRPSQSGSAASVAIHTDNKTVIGTAGSMQSTGLWAPQLEEVTI 1058 Query: 1099 KSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG 1158 V ++ VAL DDLTGR HE++L RPPRV KT+ D S ++ FKVPYGGLIYI+ Sbjct: 1059 S--GGVKASINVALVDDLTGRANHELSLKRPPRVQKTFQYDGS-SLSFKVPYGGLIYIQP 1115 Query: 1159 -NSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYT 1217 + + +F FTGV++A ++K+G+W N +N+ PL E++S F+YTTP N+ ++ Sbjct: 1116 LEVDSRDVVTFNFTGVLRASWWKNGSWLNPINTDVPLAEIDSGHFIYTTPTNNVQDTDVP 1175 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDS--EDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS 1275 +F ++L+ FA+ +DFYGRD E G+HR FTY L ++HRF NDVQISIG AHS Sbjct: 1176 ----KFVDELNAFANHASDFYGRDQVIEQGQHRRFTYDALLANRHRFVNDVQISIGAAHS 1231 Query: 1276 GYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY 1335 GYPV ++S+ P T +PT P+NDWL+WHEVGHN A P + G+TEV NN+LALYMQ++ Sbjct: 1232 GYPVQSNSYWPTWTVIPTNPINDWLLWHEVGHNLASAPFMMAGSTEVTNNILALYMQEQR 1291 Query: 1336 LGK--MNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDG 1393 K M+R+A D++ +P +L+ AW+ G RL M+ QLK WAE +F+I WY + Sbjct: 1292 EEKPYMDRIASDLSKSPLWLDRFEGHAWSEADVGMRLAMFGQLKLWAEDHFNIDDWYSNQ 1351 Query: 1394 TPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWV 1453 P + E + GWN F+L HRKARGD +S+ + S + D +M C S++ Sbjct: 1352 AEKPSIFGEDQ---GWNFFKLAHRKARGDSISDQGINYCS-TQSSQLSQGDLMMACTSYL 1407 Query: 1454 AQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGP 1506 DL+++F+ WNP LP + + + GG++ + +N +A++ LPKPE+ P Sbjct: 1408 TGYDLTDYFRMWNPSETKANLPNGT-VDYSGGLTPAGFNAVAAMGLPKPEKSP 1459 >UniRef50_B5FBF1 Inner membrane lipoprotein n=1 Tax=Vibrio fischeri MJ11 RepID=B5FBF1_VIBFM Length = 1482 Score = 1524 bits (3945), Expect = 0.0, Method: Composition-based stats. Identities = 587/1572 (37%), Positives = 837/1572 (53%), Gaps = 162/1572 (10%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K LL A L LLAGC+ S + Sbjct: 3 KKLLLASLIPMLLAGCNQEEINIGSGS--------------------------------- 29 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGE----SSDGFTFKPGEDVT 124 D T P P T L G CNG+ S FT K G Sbjct: 30 ------DSGATTPPTPAIPTTYISTLMASGKIITGDVHCNGKSLNTDSGTFTVKEGSVFD 83 Query: 125 CVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPAN 184 C G T+ F + A+ S + D Q + GS NA ++ S ++C Sbjct: 84 CSLGGVTLGEFKAPTPEAKISGVTNTTSEASFDLQAVKGS-----NATRILQSISTC-TQ 137 Query: 185 TEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTS-PVVPV 243 + +CL ++S +Y +D L ++E E K PS+H +VP Sbjct: 138 EDSICL---DDLDSIDIQDIYSDLDNNESVNAFLKSKEEEKTDEVGKTPSSHVDAEIVPE 194 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPTE--IILSEGRLVDSQGYGVAGVNYYTNSGRGVT 301 TPGT DLN+ FVSANAE Y Y+P+ +L++ +L DS G +AGVN+++ + G+T Sbjct: 195 VTPGTSNDLNSGFVSANAEDSYAYKPSAEAKVLTKSQLTDSTGTPLAGVNFFSANAVGIT 254 Query: 302 GENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDE-VRGANIDQLIHRYSTTGQ 360 GENGEF + WG+ ++FGIDTFE GSV GN+ + +T++ D V ANI LI RY+ Sbjct: 255 GENGEFEYLWGDKLTFGIDTFEFGSVAGNQVSYKITDVSDNAVVKANIQSLITRYAENNH 314 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 N ++ + V+ F+ YPNVINE+INLSL NG + EG +LP+EF QF G I Sbjct: 315 NGL-LISEKVQDTFSLYPNVINELINLSLPNGGQI-EGTNF-SLPDEFDAQFQNGLTAAI 371 Query: 421 DTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYG 480 D + + + +FS + V + L + + V+ FHVF+D+ +FYG Sbjct: 372 DAELQQQ----RASFYFSDFPHVFSLDNGTYVTDSLTRI---FNGVTSFHVFNDNGSFYG 424 Query: 481 STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPEN-- 538 +TG RG +N+SN AFPI+M R D N + FGE++AW + YI P++ P Sbjct: 425 ATGYTRGMRALNLSNRAFPIMMPRADINKDIPFGEQQAWTREGRPYIAVHPTIEMPPIPL 484 Query: 539 VTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKD---GQCTLN- 594 V++D ATF PF++ G++G GK++ +GN Y SI+ CP+ Y N + D CT + Sbjct: 485 VSKDNATFGFPFVTAGEIGSGKVVFMGNSMYPSIISCPDNYWANDALRIDSALQSCTSSF 544 Query: 595 -------SDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAA 647 +D MK F N+ +L++DK + + V TN+D R G GN+ Sbjct: 545 DLANDPRNDNGSMKTFFNNLFTWLNNDK----SIKGINVATNIDVATALRSGTSHGNAYD 600 Query: 648 FDFHPDFAGISVEHLSS---YGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPK 704 F +P F SVE L+ G L E PLLIL + Q D + AD P Sbjct: 601 FFVNPSFGFSSVEKLTKDGFSGRLSASETPLLILQAYPPKPQG--DGMSHRFIADLDNPN 658 Query: 705 LTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQ 764 L+Q D+T LI Y+N+GGSVL M+ + + RL D+AG+S L S V Q Sbjct: 659 LSQDDITALITYINEGGSVLFMDAIDKVT---NPEPIGRLADSAGVS--LGGSNVTPTSQ 713 Query: 765 GYPNRVRQQR----------ATGIWVYERYPAVDGALPYTIDSKTGEVKWKYQVENKPDD 814 + + + V ER+ VDG PYT++ + G V+W K + Sbjct: 714 AFCGSSYYCQAPSPNLHVKSQYEMVVLERFQDVDGQQPYTVN-QDGSVEW-----TKDET 767 Query: 815 KPKLEVASW-----------LEDVDGKQ--ETRYAFIDEADHKTEDSLKAAKEKIFAAFP 861 K K E+ ++ L D DG ET++A I K + AA ++ AF Sbjct: 768 KIKFEIPTYEIIKRDDKGDPLLDKDGNPVMETKFARIFV---KNGEERAAAISELQEAFE 824 Query: 862 GLKECTNPAYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQ 921 G C++ Y YE NC+E R G G+ V G + + + +N D ++MV+AA+LG N Sbjct: 825 GTPLCSHS-YEYEFNCIETRQGDGIQVRGAYWRADFDRYQMNQDVVESMVKAANLGDNFN 883 Query: 922 RLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFL 981 L +HE+Y+RT G++G RLS+V+L + Y N+S+W+WND Y Y+ DELGFKT FL Sbjct: 884 ALMEHEMYYRTKGKQGTRLSTVELNQTYDNLSIWMWNDNPYAYDPNVQDELGFKTAVNFL 943 Query: 982 NCYANDAYAGGT---KCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLML 1038 NCY ++ + C +LK +L+ N MI+G+G AG MNPSYPLNYMEKPLTR+ML Sbjct: 944 NCYTDNQHQTDVPEAACPVELKATLIANGMIHGEG-ELAGQMNPSYPLNYMEKPLTRIML 1002 Query: 1039 GRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTI 1098 GRS+WD I VD KYPG + + I + AGN QSTGLWAP + + Sbjct: 1003 GRSFWDHEITVDTTKYPGRTNGATTSEVVNIETAGKAVSYSAGNNQSTGLWAP--QLSEV 1060 Query: 1099 KSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG 1158 +V +TV +ADDLTG+ +HE +LNRPPR+ +++ D + FKVPYGGLIYIK Sbjct: 1061 TVRGDVTAMITVMMADDLTGKPQHETSLNRPPRMQMSFAHDGR-STTFKVPYGGLIYIKP 1119 Query: 1159 NS---STNESASFTFTGVVKAPFYK------DGAWKNDLN-SPAPLGELESDAFVYTTPK 1208 + A F+ GV KA ++K G W N + S AP+ E+++ +F+YTT Sbjct: 1120 TEILSGASTVAEFSLDGVEKAAWWKKDPANNLGEWVNTPDSSTAPIAEIDTGSFIYTTAL 1179 Query: 1209 KNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSE--DGKHRMFTYKNLPGHKHRFTNDV 1266 N+ + L +F+ +++ FA + +DFYGRD E DGKHR FTY L +HRF NDV Sbjct: 1180 NNVKTA----DLNEFSKNMNRFADAASDFYGRDEESADGKHRRFTYPELKEFRHRFVNDV 1235 Query: 1267 QISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNV 1326 QISIG AHSGYPVM+SSF+ +S +PT ++DWL+WHEVGHN A P + PG+TEV NN+ Sbjct: 1236 QISIGAAHSGYPVMSSSFNASSNKIPTNAIDDWLVWHEVGHNLASAPFSAPGSTEVTNNL 1295 Query: 1327 LALYMQDR----YLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEK 1382 LALYMQ+ +M+R+ I AP +L ++ AW+ G AG RL+M+ QLK WAE Sbjct: 1296 LALYMQELEGRNANPEMDRIRTSIQKAPAWLSSNDGHAWSHGDAGLRLVMFGQLKIWAEN 1355 Query: 1383 NFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCA--ESNG 1440 +F+I +WY DG P Y++ + GWN+ +LMHRKARGD+ + G NYC+ ++ Sbjct: 1356 HFEIDRWYVDGETKPAIYNQDQ---GWNMIKLMHRKARGDQQGDA---GINYCSSGDTGL 1409 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP 1500 +A D +M+C+S+V+ DL EFF+ WN G + +++ + GG+S + + LA L L Sbjct: 1410 SAGDLMMVCSSYVSGYDLGEFFQAWNVGETSVTNADGTKV-YSGGISSAGLSKLAELKLN 1468 Query: 1501 KPEQGPETINQV 1512 P++ P TIN + Sbjct: 1469 NPKKDPLTINAL 1480 >UniRef50_A8H5W9 Inner membrane lipoprotein n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H5W9_SHEPA Length = 1426 Score = 1514 bits (3919), Expect = 0.0, Method: Composition-based stats. Identities = 574/1541 (37%), Positives = 835/1541 (54%), Gaps = 152/1541 (9%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K L+ A++ + LLAGC G +D P +P+ P Sbjct: 2 KKLILAVVISNLLAGC-----GDYTDA----------PSTPVEPSIPPTDLIPAK----- 41 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDG---FTFKPGEDVTC 125 T G L L G + +CNG++ FTFK G++V+C Sbjct: 42 -------------------KTYQGSLLLSGKKLSGHISCNGQALGHGGSFTFKDGDNVSC 82 Query: 126 VAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPANT 185 G+ + + + + ++ ++D A S ++A +++ ++CPA Sbjct: 83 TYGSLELLNKDIPLPDGWTRDSHNAMALEIKDDWAHAIS---VTDAAKVMSKVSTCPALA 139 Query: 186 EQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNE-EVENNAATDKAPSTHTSP-VVPV 243 +++CL I+S L+ + A + +N E DKAPS+H + P Sbjct: 140 DEICL---DEIDSFDVSPLFSNGNAA--DINAFLNPPAAEETDEIDKAPSSHVDSSLTPE 194 Query: 244 TTPGTKPDLNASFVSANAEQFYQYQPTE--IILSEGRLVDSQGYGVAGVNYYTNSGRGVT 301 + GTKPD+NA FVSA+AE Y Y+P+E + SE L D+QG +AGVNYYT S RG+T Sbjct: 195 VSAGTKPDINADFVSASAEDAYTYKPSEDARVESESVLTDNQGKPIAGVNYYTKSSRGIT 254 Query: 302 GENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDE-VRGANIDQLIHRYSTTGQ 360 +G S+ WGETI+FG+DTF SV+GN+ L++ + + NI LI RY+T Sbjct: 255 DASGIVSYVWGETITFGLDTFTFSSVKGNQIEYKLSDGSENEIVKQNISALIERYATHTT 314 Query: 361 NNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTGQAKEI 420 ++ ++V +VF +YPNVINEIINL+L NGA + +PNEF QFNTG A I Sbjct: 315 DSVS-FDENVHRVFGQYPNVINEIINLNLPNGAEIESSGYF--VPNEFNAQFNTGLALII 371 Query: 421 DTAICAKTDGCNEARWFSLTTRNVN-DGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFY 479 D + R+ T + G + + +L YK V +FHVFHD+++FY Sbjct: 372 DAELNLSP-----TRFSQQATPLLQKAGYVTNSLQQL------YKDVDQFHVFHDNSSFY 420 Query: 480 GSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKN-ELAYITEAPSLV---- 534 G G AR +N SN AFP+LM RND NYWL FG ++A+ ++ Y+T+A ++ Sbjct: 421 GEVGYARFMRSMNTSNTAFPVLMPRNDVNYWLPFGSEQAYRRDDGFPYVTDAKTIDASSD 480 Query: 535 ----EPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQ 590 PE V DTAT+NLP I+ G++G GK++ +GN Y +IL P Y G Sbjct: 481 VILKRPERVGTDTATYNLPVITAGEIGLGKVVFMGNSMYPNILSKPENYWAGGEE----- 535 Query: 591 CTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDF 650 D M F N+ + + + K ++ VG+N+D V+ + F Sbjct: 536 --AGKDNGSMPTFFMNMFTWFTP--GYDNGKTTINVGSNIDKVWQSNVNN--NQTYDFFV 589 Query: 651 HPDFAGISVEHLSS--YGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQ 708 H + ++VE LSS Y LDP+ P+LIL +E T + D ++ + AD ++PKLT Sbjct: 590 HGSYK-LNVEPLSSGSYAGLDPKTTPVLILQAYE--TGLFGDGMSVKVLADIAQPKLTTA 646 Query: 709 DVTDLIAYLNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMA-----------LNKS 757 DVT LI Y+N GG+VL M+ + ++ + RL D AG+++ +S Sbjct: 647 DVTALIKYINAGGNVLFMDGI----EQLNPEPIARLADTAGIALGGANLARTRQAYCGES 702 Query: 758 VVNNDPQGYPNRVRQQRATGIWVYERYPAVDGALPYTIDSKTGEVKWKYQVENKPDDKPK 817 P YPN R + YE++ + + ++ G V + P DKP+ Sbjct: 703 YYCQAP--YPN-ARASFTDTLVTYEKF----DDMSKFVVNQDGTVNFP-----SPIDKPE 750 Query: 818 LEVASWLED-VDGKQETRYAFIDEADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVN 876 +A + DG ++ +AF KTE A KI AAFP +KECT+ +Y YE+ Sbjct: 751 FGIAQFKTTAEDGSEQDNFAFYSV---KTEAERLEAVAKIKAAFPKVKECTDASYDYEIG 807 Query: 877 CLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRK 936 C+E R G G+ Y P++T+ ++ D MV+AA+LG N+++LYQHE+Y+R+ G++ Sbjct: 808 CIETRKGHGLATGSRYYRPRFTRYEISPDVVNTMVKAANLGGNVEKLYQHEIYYRSQGKE 867 Query: 937 GERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCS 996 G RLS +L + Y N+S+W WND Y Y DELGFK TEFLNCY +D + C+ Sbjct: 868 GSRLSLNELNQTYDNLSIWFWNDEQYSYNSEVQDELGFKKATEFLNCYTSDVHQPDNACA 927 Query: 997 ADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPG 1056 AD ++ L+ M+ +G +NPSYPLNY EKPLTR+MLGRS+WD +I VD E YPG Sbjct: 928 ADTREKLLKYGMLTS-----SGELNPSYPLNYQEKPLTRIMLGRSYWDNDISVDTEMYPG 982 Query: 1057 AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDL 1116 + EG N + I ++N A NMQSTGLWA + VT+ N T+TVAL DD+ Sbjct: 983 NTAAEGSNASVQIETFNNAVVGTANNMQSTGLWAVKRSVVTVS--GNHDATITVALVDDV 1040 Query: 1117 TGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG-NSSTNESASFTFTGVVK 1175 TG+ +HE++L RP RV K++S A T + PYGGLIYIK ++ T F F+GV++ Sbjct: 1041 TGKHEHELSLKRPSRVQKSWSHKAGSTTEIIAPYGGLIYIKPASTDTANRVEFNFSGVLE 1100 Query: 1176 APFYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMN 1235 A +K+G W+N +N PL E+ + FVYTTP N+ ++ FA+ ++ FA + Sbjct: 1101 ASLWKNGQWQNPVNQEVPLAEVVTGQFVYTTPVNNV----TDTDIQAFASGMNDFAEKAS 1156 Query: 1236 DFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTP 1295 DF+ RD+ DG R FT K LP H HRF NDVQISIG AHSGYPVM+++++ ++ ++PT P Sbjct: 1157 DFHARDNSDGNMR-FTGKLLPEHSHRFVNDVQISIGAAHSGYPVMSTTYNRDANSIPTIP 1215 Query: 1296 LNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRY---LGKMNRVADDITVAPEY 1352 NDWL+WHE+GHN A P V GATEVANN+LALYMQD G+M+RV DI AP Sbjct: 1216 DNDWLLWHEIGHNLAAAPFNVKGATEVANNLLALYMQDLRDNGDGQMDRVKTDIQKAPMM 1275 Query: 1353 LEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLF 1412 + W+ G AG RL+M+AQLK WA+++F I + G +P +Y E E GWN+F Sbjct: 1276 ISRDEGHVWSHGNAGSRLVMFAQLKVWAQEHFKIADHFK-GQTIPSYYGEDE---GWNMF 1331 Query: 1413 QLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAY 1472 +LMH +AR + S+ + + D LM C S V+ DL+ FF+ WNP + Sbjct: 1332 KLMHHEARNNNNSSCSAQ-----NANGLSQGDLLMACTSAVSGYDLTPFFEAWNPSEVSV 1386 Query: 1473 QLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVT 1513 S ++GG++ + SLDLP+P+ PETI+ + Sbjct: 1387 ITADGSR-DYQGGITADGIGYVKSLDLPEPKVKPETIDYIN 1426 >UniRef50_Q5E705 Accessory colonization factor AcfD-like protein, predicted inner membrane lipoprotein n=1 Tax=Vibrio fischeri ES114 RepID=Q5E705_VIBF1 Length = 1569 Score = 1292 bits (3342), Expect = 0.0, Method: Composition-based stats. Identities = 511/1648 (31%), Positives = 775/1648 (47%), Gaps = 229/1648 (13%) Query: 9 KSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEP 68 K LL A L LLAGC+ +G G++P++ D P Sbjct: 3 KKLLLASLIPMLLAGCNQEEIN--------INGNGTVPDIGGDGGVTP------------ 42 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKP--------G 120 P+P P P K ++ + GATC+G SD Sbjct: 43 -------------PKPTPDPIKYRFMITSSGAPIEGATCDGRLSDHLGVIALDYDSNTLP 89 Query: 121 EDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSN- 179 + + C+ T +ATF T + + +RA + + + D + D+ N SL+ + + Sbjct: 90 QSIDCLIAGTPLATFATSTN--KRVRAQD-YNLDIADGKISDLQGDQLVNIQSLLRTVDA 146 Query: 180 -SCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPST--- 235 +N Q SV K + + Y A E+ ++L ++ DK Sbjct: 147 DGVDSNGYQFVEGEKSV---KNYSANYAD---ALEKTQELFLKDNIGYLKDDKPAGVEPG 200 Query: 236 HTSPVVPVTTPGTKPDLNAS-FVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYT 294 H + V PV TPG+ S VS+NAE+ Y+Y+P E+ + E ++ G V GV YY Sbjct: 201 HGTDVEPVVTPGSDDVTGGSGIVSSNAEKQYEYKP-EVAVPEKSILMLDGQPVVGVEYYG 259 Query: 295 NSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELG-DEVRGANIDQLIH 353 + RG T +G F ++WG+ ++FGI LGS++ + L L D + N++ LI Sbjct: 260 PTYRGKTDVDGSFEYNWGDEVTFGIQALTLGSIKAKGLDVQLGALAADPSKSKNVENLIK 319 Query: 354 RYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLP---NEFIE 410 ++ V+ DV + FA N I E+IN++L++G T P NEFI Sbjct: 320 QFDKDNS-APWVIEQDVHERFALESNNIVELINMNLASGDTSNFDPSFGTPPQVKNEFIA 378 Query: 411 QFNTG-QAKEIDTAICAKTDGCNE---ARWFSLTTRNVNDGQIQGVINKLWGVDTNYKS- 465 QFN G A +I T++ N + + SL + N + ++ + D N + Sbjct: 379 QFNDGGSAFDIVTSLGLSPINVNTYTFSPYVSLRSVNRVETASDALLTMMGQNDNNKDND 438 Query: 466 VSKFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELA 525 V+ FHVF + + Y + +NI N A P++M R+D N ++ FG+ DK Sbjct: 439 VTHFHVFGNQNDGY-HMPSPHAATFINIDNNAAPVVMPRSDLNAYIPFGQLAVTDKFSRP 497 Query: 526 YIT------EAPSLVEPENVT---------------------RDTATFNLPFISLGQVGE 558 + T P+ ++ +N T ++TATF LPF+ G++GE Sbjct: 498 FFTLTNDPKTTPTYIDAKNKTHWNLKEQDVAADSVESSYKMNKETATFELPFVVSGKIGE 557 Query: 559 GKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNS---DPDDMKNFMENVLRYLSDDK 615 GK++V+GN YNSIL CP YS+N +NKDG C+ + D DM NF N +L K Sbjct: 558 GKVLVLGNSLYNSILVCPENYSFNASINKDGVCSNGNGVTDSLDMFNFFVNAFNWLDTKK 617 Query: 616 WKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVEHLSSYG--DLDPQEM 673 D + + TN V F R S F + F + +S + LD Sbjct: 618 LNQD----INIATNRSEVSFSRISGSA--SHPFKLNESFKALGFRLMSDFSPQGLDVAST 671 Query: 674 PLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNL 733 P+ IL + T GN R D P + DV LI Y+N+GGSVLIME+ L Sbjct: 672 PIYILQAY--PTLGGNTD-----RPDYENPIINDDDVNALIDYVNQGGSVLIMES----L 720 Query: 734 KEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATG------IWVYERY--- 784 + R LD AG+S A+ K+ P+ + G ++ E Y Sbjct: 721 YNRNLPILGRFLDTAGIS-AIGKNNGVKFAGKLPSNFIAELGKGGTSLRPVYTEEVYVLE 779 Query: 785 -------PAVDGALP---YTIDSKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETR 834 +P Y D T W + ++K L + V + E R Sbjct: 780 GLAFEANTTDANGIPEKGYKFDKGTNTYYWG---DKSINNKAVLRALYRPQQV--RDENR 834 Query: 835 YAFIDEADHKTE---DSLKAAKEKIFA--AFPGLKE-------------CTNPAYHYEVN 876 + E +T+ D+L+A + A GL++ CTN AY Y+++ Sbjct: 835 HNVTKECQQQTDLVDDALQACIDTKLNQLAQDGLEQWVSDVQAIYEVPMCTNSAYQYQLD 894 Query: 877 CLEYRPGTGVPVTGGMY------VPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYF 930 C+E R G G+P++ +Y V + +L ++ D + AM++AA++GTN+ LYQHELY+ Sbjct: 895 CIERREGNGIPLSKTIYPGATDMVQAFARLPMSKDVSNAMIEAANMGTNLTDLYQHELYY 954 Query: 931 RTNGRKGERLSSVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYA 990 RT G++G RLS +D++R+Y N+S W+WN+ YRY+ DE G KT EFLNCY+N+ Y Sbjct: 955 RTGGKEGVRLSGIDVDRIYNNLSAWMWNNEQYRYDSSTKDEFGHKTVVEFLNCYSNNTYG 1014 Query: 991 GGTK-----CSADLKKSLVDNNMI-YGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWD 1044 + C +LK ++ + DG +K +NPSYPLNYMEKPLTR+MLGRS++D Sbjct: 1015 NPSSDNVIGCPEELKAEMLTKGFLVTVDGVNK---LNPSYPLNYMEKPLTRMMLGRSYFD 1071 Query: 1045 LN----------IKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQK 1094 ++ ++VDV YPG + TI G+ QS G+W PA++ Sbjct: 1072 VDAKNPTAEDRGVQVDVRSYPGVATTTAAAKDITIHK---------GSRQSAGVWIPARE 1122 Query: 1095 EVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLI 1154 + + TV +A+AD+LTGR HE+ALNRPPRV+ +++ + FKVPYGG + Sbjct: 1123 VAYVH-GLSSDDTVMIAMADNLTGRVNHEMALNRPPRVSMSFN-GVEASNGFKVPYGGSV 1180 Query: 1155 YIKGNSSTNESASFTFTG-VVKAPFY-----KDGAW-KNDLNSPAPLGELESDAFVYTTP 1207 YI S ESA +F G + AP + +G+W S AP+ E+ F YTT Sbjct: 1181 YITLGSK--ESAQVSFGGSAIAAPMFMMTSATEGSWITTPEESDAPITEIVGKRFSYTTT 1238 Query: 1208 KKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTY--KNLPGHKHRFTND 1265 + + + + D F +N+FYGRD G H+MFT L R +D Sbjct: 1239 TAGIKGHSEV-DVLEMTKQFDLFTIGVNEFYGRDGVSGAHKMFTDSAPELEYQNMRLVDD 1297 Query: 1266 VQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANN 1325 +QISIG AHSGYPVM++SF ++L ++W++ HE+GHN A L V GA E ANN Sbjct: 1298 IQISIGSAHSGYPVMSTSFPRQKSSL-FKATDNWMLGHEIGHNQAANWLNVVGAGETANN 1356 Query: 1326 VLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFD 1385 VLALY Q+R G M R+ IT A E+ + + WA G DRL + QLK WAE NFD Sbjct: 1357 VLALYTQERNTGDMPRIKVSITNATEW--ANGDHPWADGTNADRLNFFGQLKLWAEDNFD 1414 Query: 1386 IKKWYPDGTPLPE--FYSEREGM---KGWNLFQLMHRKARGDEVSNDKF--GGKNYCAE- 1437 I +W + E Y++ E +GWN ++ +HR AR E + G NYC+ Sbjct: 1415 IAQWESEAKLAEERSIYNKNEAGQYDQGWNFYKYLHRAARMPETFTEGLNKGDVNYCSSE 1474 Query: 1438 ----SNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNT 1493 ++ + D +M+C+S++ D+ FF KW G + L + + G+S+ A Sbjct: 1475 FAQVNSLSKQDMMMICSSFLTGKDIETFFIKWKFGESKVTLQSGDK--YSVGISEPALGV 1532 Query: 1494 LASLD----LPKPEQGPETINQVTEHKM 1517 + + + P+ P I + ++ Sbjct: 1533 MEDMRKQGIIVTPKTSPLDIKETNPKEV 1560 >UniRef50_UPI000178AA33 hypothetical protein GYMC10_4678 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178AA33 Length = 1078 Score = 363 bits (930), Expect = 4e-98, Method: Composition-based stats. Identities = 119/538 (22%), Positives = 196/538 (36%), Gaps = 79/538 (14%) Query: 991 GGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLML----GRSWWDLN 1046 GGT S + L +I D S + ++PL+ + P T +L GR DL Sbjct: 332 GGTVSSITPQSPL--YAVIEQDASDLESQL--TFPLDRSKAPYTSALLAFQLGRIGNDLT 387 Query: 1047 IK--VDVEKYPGAVSEEGQNVT-ETISLYSN-------PTKWFAGNMQSTGLWAPAQKEV 1096 +++PGAV + VT +++ + + N STGL+APA + V Sbjct: 388 APKSPYADQFPGAVPGDAPRVTGQSVHVNFDYSTYDYLRQGTVPKNWISTGLYAPAGEWV 447 Query: 1097 TIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYI 1156 TI + V + RPP +T+ L SG + + PYGGLIY+ Sbjct: 448 TIHVPEGT-QNLDVQVGAHTDNLTSK-TEWERPPVITQRKPLL-SGENRIRSPYGGLIYL 504 Query: 1157 KGNSSTNESAS-FTFTGVVKAPFYKDGA-----WKNDLNS-PAPLGELESDAFVYTTPKK 1209 A +G V+AP+Y G W++ + PAP EL+ + T P + Sbjct: 505 IPTKPQPSVAKDVEISGGVRAPYYILGETSPSEWEDAIRHHPAPWAELQGRRVILTLPSE 564 Query: 1210 NLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQIS 1269 + +Q D + G + R+ D QIS Sbjct: 565 YIRQL---EDPQQLVEKWDAIVDYTEEVAGLSPDQQLPHKSI-----DLPFRYVADRQIS 616 Query: 1270 IGDAHSGYPVM--NSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVL 1327 G H+GYP+M + ++ + W WHE GH + E+ N+ Sbjct: 617 AGYMHAGYPIMFHIDPSAGHAVDISRVTQGGWGFWHETGHEYQQGAWNWNVTGEITVNIY 676 Query: 1328 ALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIK 1387 +LY+Q+++ N + + ++ + + + G QL Sbjct: 677 SLYVQEKFGNSSNLLIRNAE-GKDFYDRAFD-YIESDLPGKSFGTSGQL----------- 723 Query: 1388 KWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLM 1447 D + + GW+ + +H+ R S +N DT + Sbjct: 724 ----DLFGYLVMFRQLSLAYGWDFYAELHKAYRELPASQ--------LPATNQEEIDTFV 771 Query: 1448 LCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQG 1505 + AS A +L+EFF KW + + QS +A+L+LP P Q Sbjct: 772 IMASKTAGENLTEFFDKW-------------ALPYSKAEVQS---RIAALNLPLPSQE 813 >UniRef50_UPI00017445F8 hypothetical protein VspiD_04825 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445F8 Length = 763 Score = 355 bits (911), Expect = 7e-96, Method: Composition-based stats. Identities = 105/520 (20%), Positives = 177/520 (34%), Gaps = 73/520 (14%) Query: 986 NDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLT-------RLML 1038 +D G L N + G G + P E+PLT R+ L Sbjct: 284 DDIGQGTNAIQVALAAQPPGRNSLQGAVMGALGEGDSGVP--TREQPLTMAQHASQRIRL 341 Query: 1039 GR--------SWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWA 1090 G S + +PG +++ VT+ I++ +N STGL+A Sbjct: 342 GMETRVLRLASSPTVAAHPASAVFPGQPAKDAPRVTKEITVDANID-----GWTSTGLYA 396 Query: 1091 PAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPY 1150 A + +T+ + + H + R P +T+T ++ K + Sbjct: 397 VAGEPITVTVPEAMVGNGFSIRVGCHSDTLYHLESWRRAPDITRTVGIENVE-TKMGTAF 455 Query: 1151 GGLIYIKGN----SSTNESASFTFTGVVKAPFYKDG-----AWKNDLNSPAPLGELESDA 1201 GGL+YI + +E G V++P++K G W N + P EL + Sbjct: 456 GGLVYITVPGRARRNASEPFKVKIAGAVESPYFKLGRDTDEQWNNIKKAQGPWAELAGEK 515 Query: 1202 FVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHR 1261 V + P + L +F D ++ +D R Sbjct: 516 MVVSLPSEVARKITNPTELMEF---WDRVVTAQDDISN------------QTAERTRPER 560 Query: 1262 FTNDVQISIGDAHSGYPVM-NSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGAT 1320 DVQIS G HSGYP+M ++ S T W +HE+GHN T G Sbjct: 561 MVADVQISAGFMHSGYPIMIHTPESAEMVTYGRIKYPGWGFYHEIGHNHQRGNFTFEGTG 620 Query: 1321 EVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 EV NNV LY L K + V+PE +++ + A G++ + Sbjct: 621 EVTNNVFGLYCYTEVLKKELLIGHG-GVSPESIKKHIDAAKKAKDQGEKWAI-------- 671 Query: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 W D Y + GW ++ + + +++ Sbjct: 672 --------WKGDPFHALTTYVQLVQGFGWENYKKY--------IWSFADPSFGPTPKNDE 715 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEM 1480 D ++ S + + +L FF+ W + S++ Sbjct: 716 EKRDQFLIRFSKITKKNLGPFFEFWGIPVTSSAKAEVSKL 755 >UniRef50_B4DBQ1 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ1_9BACT Length = 759 Score = 354 bits (907), Expect = 2e-95, Method: Composition-based stats. Identities = 87/450 (19%), Positives = 151/450 (33%), Gaps = 59/450 (13%) Query: 1042 WWDLNIKVDVEKYPGAVSEEGQNV-TETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKS 1100 + + +PGAV + + + + ++ STGL+A + + ++ Sbjct: 349 PEQIRANPVADVFPGAVPKNAPRLPNRVVVIDTSVPA-----WHSTGLYAAPGELIKVQV 403 Query: 1101 NANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNS 1160 A + + H R P +++ + + P+GGLIYI+ Sbjct: 404 PAELADKGLAVRIGCHSDSLFHLEKWQRAPEISRRDPIKTPASTA-ANPFGGLIYIEVPD 462 Query: 1161 S-TNESASFTFTGVVKAPFYKDGA-----WKNDLN-SPAPLGELESDAFVYTTPKKNLNA 1213 T + +G V++P + G WK L +PAP ELE+ + + P + + Sbjct: 463 KLTAAKVNVAISGGVESPRFVLGETKLLEWKMRLRMAPAPWAELETKKVILSVPSEKIRQ 522 Query: 1214 SNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDA 1273 + E D + D T + R DVQIS G Sbjct: 523 LD---DPEALLKFWDQILDAEADLA------------TIPHERKRPERIVPDVQISAGYM 567 Query: 1274 HSGYPVMN--SSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYM 1331 HSGYP+M +S +L W +HE+GHN T G EV N+ +LY Sbjct: 568 HSGYPIMTPLDKSVEHSLSLVEMKQGSWGHFHELGHNHQVGDWTFDGTVEVTCNLFSLYC 627 Query: 1332 QDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYP 1391 + GK D + PE +E+ + +R W Sbjct: 628 METLCGKPPGQGHDA-MKPEAVEKRLRGYLSSTDKFNR-------------------WKS 667 Query: 1392 DGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCAS 1451 D Y + GW ++ + + R +++ D ++ S Sbjct: 668 DPFLALIMYHQLRVGFGWETYKKVFAEYRDLSKEQR--------PKTDEEKHDQWLVRFS 719 Query: 1452 WVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 A +L FF W + + + Sbjct: 720 KAAGKNLGPFFDAWGIPTSTTARESINSLP 749 >UniRef50_UPI0001B9ED55 hypothetical protein GYMC10_4682 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001B9ED55 Length = 836 Score = 339 bits (870), Expect = 4e-91, Method: Composition-based stats. Identities = 123/540 (22%), Positives = 197/540 (36%), Gaps = 99/540 (18%) Query: 997 ADLKKSLVDNNMIYGDGSSKAGMMNP-SYPLNYMEKPLTRLMLGRSWWDLNIKVDVEK-- 1053 + ++L + + ++AG + P +P+ ++P T + + + EK Sbjct: 359 NETLETLSSESPLNAWADTEAGSLPPAEFPIQKFQQPYTNALHNFRFSHFTLDPANEKSP 418 Query: 1054 ----YPGAVSEEGQNVT-ETISLYSN------PTKWFAGNMQSTGLWAPAQKEVTIKSNA 1102 +PG VSEE V + + + + N STGL+AP K +T++ Sbjct: 419 YADAFPGVVSEEAAIVNDREVEVDFDFPNTMYTHALPSKNWISTGLYAPPGKVITLEVPE 478 Query: 1103 NVP-VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSS 1161 V +TV + DD + R P + L G + PYGGLIY+ + Sbjct: 479 GVEHLTVQIGSHDD---DLRGSGKWERVPLIVNHQKL-TPGIHQVNSPYGGLIYLIPLKA 534 Query: 1162 TNE-SASFTFTGVVKAPFYKDG-----AWKNDLNSPA--PLGELESDAFVYTTPKKNLNA 1213 N+ A+ +G V+AP+Y G W+ P P EL+ + + T P + Sbjct: 535 KNDFRATVKISGAVEAPYYVLGKTTLEEWERIRTGPVTVPFAELQGERIILTVPSDLIRQ 594 Query: 1214 SNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDA 1273 E+ D S ++ G D + N R+ D QIS G Sbjct: 595 VA---DPEELMRTWDEIYDSYDELVGLDPDRAMPHTAHQLNR-----RYVADGQISSGAM 646 Query: 1274 HSGYPVM-NSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQ 1332 H+GYP+M S++ N + + W WHE+GH + T EV N+ +LY Q Sbjct: 647 HAGYPIMLPFSYAANLLDVHYVKTSAWGFWHELGHEYQQRTWTWSDVGEVTVNLFSLYTQ 706 Query: 1333 DRYLG-----KMNRVADDITVAPEYLEESNNQAWARGGAGD--RLLMYAQLKEWAEKNFD 1385 ++Y K+ D E+N+ A G G+ RL+M+ QL+ Sbjct: 707 EKYGNASELLKVGNDGKDYYDRGIAFVENNDPAKKYGQIGNYERLVMFKQLQL------- 759 Query: 1386 IKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADT 1445 GW + + R DT Sbjct: 760 --------------------AYGWEFYTRIFETYRELSRDEI------------QGTVDT 787 Query: 1446 LMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQG 1505 + AS A DL+EFF KW G++ +ASL+LP+P Sbjct: 788 FAVIASQTAGEDLTEFFDKWAI-----------------GLTDDGRARIASLNLPEPLAD 830 >UniRef50_C1ZFD9 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFD9_PLALI Length = 785 Score = 316 bits (809), Expect = 5e-84, Method: Composition-based stats. Identities = 98/459 (21%), Positives = 155/459 (33%), Gaps = 63/459 (13%) Query: 1034 TRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQ 1093 T L ++ +PGAV + VT +++ + T+ +STGL+A Sbjct: 367 TGYYLQLPLEEMKPAPTASVFPGAVPANAKKVTRKVTINTETTR-----WKSTGLYAAPG 421 Query: 1094 KEVTIKSNAN-VPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGG 1152 V +K N V + + + RPP V + + +D + YGG Sbjct: 422 TLVKVKVPRNIVGQKFEIQIGSHSDSLWSKD-EWRRPPAVIRQFPIDKVE-FEVGNAYGG 479 Query: 1153 LIYIKGNSSTNESA-SFTFTGVVKAPFYKDGA-----WK-NDLNSPAPLGELESDAFVYT 1205 LIY+ T F+ VV AP++ G W+ N PAP ELE+ V T Sbjct: 480 LIYVVVPQKTPAGKFEVEFSNVVDAPYFVHGETDISDWRFTIRNYPAPWAELETRHLVIT 539 Query: 1206 TPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTND 1265 P + + ++ ++ N + D Y + + RF D Sbjct: 540 VPSELVRKLDFP---DKLMNHWAAVLDACADLY------------SISRNRPYAERFVFD 584 Query: 1266 VQISIGDAHSGYPVMNS--SFSPNSTTLPTTP-LNDWLIWHEVGHNAAETPLTVPGATEV 1322 QIS G HSGYP+M +P L W +HE+GHN + T G EV Sbjct: 585 DQISAGFMHSGYPIMCFTNPSAPEVVDLNFLENKGGWGFYHELGHNHQKGDWTFQGTGEV 644 Query: 1323 ANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEK 1382 NN+ LY+ D K + D PE + Sbjct: 645 TNNLTPLYVIDTLTPKA--FSHDAIQQPERDNRERK--------------------YVMN 682 Query: 1383 NFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNA 1442 W D Y + + GW F+ + + + +S + Sbjct: 683 GAPFSTWQEDPFLALTMYIQLKEQFGWQPFRDVFLEYEKLQKDEH--------PKSEMDK 734 Query: 1443 ADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 D M+ S +L FF+ W + ++ Sbjct: 735 RDQWMVRFSRKVNRNLGPFFQYWGVPTSENARQMIKDLP 773 >UniRef50_B2ULE8 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2ULE8_AKKM8 Length = 660 Score = 299 bits (765), Expect = 6e-79, Method: Composition-based stats. Identities = 102/473 (21%), Positives = 157/473 (33%), Gaps = 73/473 (15%) Query: 1042 WWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSN 1101 L + +PG Q V T+ + SN G STGL+AP E++ + Sbjct: 243 LASLKACPAAKDFPGVPENGAQTVRRTVEIDSNI-----GGWHSTGLYAPPGAEISCSLS 297 Query: 1102 ANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSS 1161 T R P +T G VK P GGL+Y+ Sbjct: 298 GAPKDGSISVRIGCHTDSLHKLDEWKRVPEITMQVP-AGRGRVKMVNPMGGLVYVNVGQR 356 Query: 1162 T--NESASFTFTGVVKAPFYKDG-----AWKNDL-NSPAPLGELESDAFVYTTPKKNLNA 1213 + +G V +P + G W L N+ AP GE+ + T P + L Sbjct: 357 PRRGKVFKVQISGAVPSPLFVMGKTTPEQWAEQLENTKAPWGEIRMPRLIVTMPVEQLKQ 416 Query: 1214 SNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDA 1273 +++ A L + + G D T + H RF D QIS G Sbjct: 417 CP---DVQKTAEFLQKNMALQDWIMGWD---------TKPDRLHHPMRFVVDRQISAGAG 464 Query: 1274 HSGYPVMNSSFSPNST-TLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQ 1332 HSGYP M + NS T W +WHE+GHN P T+ G TEV+ N+ ++ + Sbjct: 465 HSGYPAMATKDWTNSIATGSIIHSGSWGLWHELGHNHQSPPFTMEGQTEVSVNIFSMVCE 524 Query: 1333 DRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPD 1392 GK + P + + ++ + QL W E Sbjct: 525 VMGTGKDFESCWGGGMGPYGMSAEMKKYFSGTQTYNEAPNKVQLFFWVE----------- 573 Query: 1393 GTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASW 1452 +Y G++ F+ + + N + S+ + +M S Sbjct: 574 ----LMYYL------GFDAFRQVALQFHDKPYDNGEL--------SDEKKWEWVMNAFSK 615 Query: 1453 VAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQG 1505 V ++ FFK W VS+ A + L P + Sbjct: 616 VTGKNMGPFFKIWRTP-----------------VSERATGRMKDLPAWLPSKD 651 >UniRef50_A4IG42 Protein FAM115 n=14 Tax=Clupeocephala RepID=F115_DANRE Length = 912 Score = 294 bits (753), Expect = 1e-77, Method: Composition-based stats. Identities = 99/473 (20%), Positives = 152/473 (32%), Gaps = 78/473 (16%) Query: 1030 EKPLTRLML--GRSWWDLNIKVDV------EKYPGAVSEEGQNVTETISLYSNPTKWFAG 1081 E P LML G + + D + P + V T + Sbjct: 486 ESPKDHLMLHIGTEVYKVTPDPDALLPYIIKDRPNLPTLSNARVRITAN------TGGCE 539 Query: 1082 NMQSTGLWAPAQKEVTIKSNANVPV---TVTVALADDLTGREKHEVALNRPPRVTKTYSL 1138 STGL+ + I + V + D G L R P V + L Sbjct: 540 EWISTGLYLSPGMKTYIAVPPEIVGKNWQVQLGCQTDNIGGSN---TLKRAPVVHARFPL 596 Query: 1139 DASGTVKFKVPYGGLIYIKGNSSTN-ESASFTFTGVVKAPFYKDGA-----WKN-DLNSP 1191 D S V+ +GGLIY+ S T + V+AP++K G W + +P Sbjct: 597 D-SEMVQVWNLWGGLIYLIAPSQTKVDGVEIVVQNAVQAPYFKSGETSVADWVSHIRQAP 655 Query: 1192 APLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFT 1251 AP ELE + + T + + ++ A DT ++ D R + Sbjct: 656 APWAELEFENLIMTFDSAFIRNLDRP---DEVAKLWDTIMRTITDLAARPPK-------- 704 Query: 1252 YKNLPGHKHRFTNDVQISIGDAHSGYPVM-NSSFSPNSTTLPTT-PLNDWLIWHEVGHNA 1309 K RF DVQIS G H+GYP+M +S +P + W HE+GHN Sbjct: 705 ----LPRKERFVADVQISYGFMHAGYPIMMHSGSAPGLVNVEEAYKCGLWGAIHELGHNQ 760 Query: 1310 AETPLTVP-GATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGD 1368 P TE N+ +LY+ ++ G + A + P + + G + Sbjct: 761 QRGVWEFPPHTTECTCNLWSLYVHEQVFGIKSANAHPA-ITPADRQARTKMYFDGGKDLN 819 Query: 1369 RLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDK 1428 M+ E Y + + GW+ F+ + Sbjct: 820 SWCMW---------------------MALETYMQLQEKFGWDAFKKVFSLYHDMTG---- 854 Query: 1429 FGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 N + S V +LS FFK W S + Sbjct: 855 ------VPNDNAGKMNLYAQTFSKVVNLNLSPFFKAWGWPIQPNTEQNLSHLP 901 >UniRef50_Q9Y4C2 Protein FAM115A n=54 Tax=Amniota RepID=F115A_HUMAN Length = 921 Score = 291 bits (744), Expect = 2e-76, Method: Composition-based stats. Identities = 92/458 (20%), Positives = 148/458 (32%), Gaps = 66/458 (14%) Query: 1038 LGRSWWDLNIK-VDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEV 1096 L S DL++ ++E + T+ + STGL+ P ++ + Sbjct: 499 LAHSGSDLSLLVPEIEDMYSSPYLRPSESPITVEVNCTNP-GTRYCWMSTGLYIPGRQII 557 Query: 1097 TIKSNA---NVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGL 1153 + + + + + D R L R P V LD +GGL Sbjct: 558 EVSLPEAAASADLKIQIGCHTDDLTRASK---LFRGPLVINRCCLDKPTK-SITCLWGGL 613 Query: 1154 IYIKGNSSTNE-SASFTFTGVVKAPFYKDG-----AWKNDLN-SPAPLGELESDAFVYTT 1206 +YI ++ S T G V AP+YK G WK + +P P GEL +D + T Sbjct: 614 LYIIVPQNSKLGSVPVTVKGAVHAPYYKLGETTLEEWKRRIQENPGPWGELATDNIILTV 673 Query: 1207 PKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDV 1266 P NL D ++ R+ R DV Sbjct: 674 PTANLRTLENPEP---LLRLWDEVMQAV------------ARLGAEPFPLRLPQRIVADV 718 Query: 1267 QISIGDAHSGYPVMNS--SFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTV-PGATEVA 1323 QIS+G H+GYP+M S W HE+G N P TE Sbjct: 719 QISVGWMHAGYPIMCHLESVQELINEKLIRTKGLWGPVHELGRNQQRQEWEFPPHTTEAT 778 Query: 1324 NNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKN 1383 N+ +Y+ + LG + R +I + P E+ ++G +K W Sbjct: 779 CNLWCVYVHETVLG-IPRSRANIALWPPVREKRVRIYLSKGP---------NVKNW---- 824 Query: 1384 FDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAA 1443 + E Y + + GW F + + R N + Sbjct: 825 --------NAWTALETYLQLQEAFGWEPFIRLFTEYRNQTN----------LPTENVDKM 866 Query: 1444 DTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 + + S Q +L+ FF+ W + + Sbjct: 867 NLWVKMFSHQVQKNLAPFFEAWAWPIQKEVATSLAYLP 904 >UniRef50_A4GHK6 Putative uncharacterized protein n=1 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHK6_9BACT Length = 1173 Score = 284 bits (727), Expect = 1e-74, Method: Composition-based stats. Identities = 96/475 (20%), Positives = 175/475 (36%), Gaps = 69/475 (14%) Query: 1044 DLNIKVDVEKYPGAVSEEGQNVTETISLYSN------PTKWFAGNMQSTGLWAPAQKEVT 1097 D+ + PG E V + N STGL+A A +++T Sbjct: 527 DVIAHPGAQFTPGLPIEGSSPVVHKFQINKIISPTDVHIHMSGDNWFSTGLFASAGQKIT 586 Query: 1098 IKSNAN-VPVTVTVALADDLTGRE-KHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIY 1155 IK + V + + + + G + + + R P +T ++LD S + +GGLIY Sbjct: 587 IKVPKDLVNAGLKIQIGSHIWGDYIFNHMDMRRFPYITYQWNLDQSEVI-VNSSFGGLIY 645 Query: 1156 IKGNSST----NESASFTFTGVVKAPFYKDG-----AWKNDLN-SPAPLGELESDAFVYT 1205 I + +++S++ +G AP Y G WKN++ PAP E+ESD + T Sbjct: 646 IVDPVNQQINFPKTSSWSISGAYLAPRYIHGKTALNDWKNEIRKYPAPWAEIESDKVILT 705 Query: 1206 TPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTND 1265 P + + L +F + D + + + R+ D Sbjct: 706 VPSHAIRDLDNPDDLMEFWAR---------------AIDAAADLASISRVREFPQRYVTD 750 Query: 1266 VQISIGDAHSGYPVMNS-SFSPNSTTLPTTPLNDW-LIWHEVGHNAAETPLTVPGATEVA 1323 G AH+GYP+M + + P LN W +HE+GHN G EV+ Sbjct: 751 PNWQWG-AHAGYPIMMAGPWYPYLLNHKKIGLNYWWGTFHELGHNHQMNDWMWDGWGEVS 809 Query: 1324 NNVLALYMQDRYLG-KMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEK 1382 N+ ++Y+ + G + D + + P ++ N+ RG + L Sbjct: 810 TNLWSVYILETIAGLERKNTWDGMLLFPGKRQKRINKFIDRGRSFAILQ----------- 858 Query: 1383 NFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNA 1442 D E + + + GW LF +H+ V S+ Sbjct: 859 --------ADPELALEHLLQLQEVFGWELFMALHQSYHDKPVHK---------NVSDNEK 901 Query: 1443 ADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSF---EGGVSQSAYNTL 1494 ++ S + +T+L F+K+W + + ++ + ++Q + Sbjct: 902 IQQFVIRTSQITKTNLINFYKEWGFPIERSTIDFLANFNYPSKDLAIAQKIAEAI 956 >UniRef50_C7PHT0 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PHT0_CHIPD Length = 770 Score = 274 bits (701), Expect = 2e-71, Method: Composition-based stats. Identities = 96/489 (19%), Positives = 162/489 (33%), Gaps = 73/489 (14%) Query: 1022 PSYPL---NYMEK---PLTRLMLG--RSWWD--LNIKVDVEKYPGAVSEEGQNVTETISL 1071 P PL N EK LTR G RS D + +PG V + +T +I++ Sbjct: 308 PQNPLILSNTEEKVRYHLTRRFFGKTRSLIDDKHAVSPGARYFPGLVPDTATRITTSITV 367 Query: 1072 YSNP-------TKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEV 1124 TGL+ P EV + + A Sbjct: 368 PVQVGTQGLLEPTAIYFRPHPTGLYVPPGTEVKVILQSKDKTQHLKAQIGVHNDDLADLT 427 Query: 1125 ALNR-PPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTN-ESASFTFTGVVKAPFYKDG 1182 L R + +T+SLD T P+GGL+ + + +T + + T TGVVKAP++K G Sbjct: 428 QLTRSAENMVRTFSLDND-TTLIYSPFGGLLQLNVSDTTTLKEITITVTGVVKAPYFKLG 486 Query: 1183 A-----WKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMND 1236 W N + N+PAP EL +D + T P + + L QF + + D Sbjct: 487 QTSEASWINSIRNNPAPWAELATDKIILTVPAYRIRQLDNPVKLMQF---WNEVMDADAD 543 Query: 1237 FYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSG---YPVMNSSFSPNSTTLPT 1293 + H R D ++ G ++ V + Sbjct: 544 LA------------RISRIRSHPERVVVDQDVAYGYMYTAPERIIVPDDQSCALMLDESQ 591 Query: 1294 TPLN-DWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEY 1352 N W ++HE+GH + EV N+ +++ ++ L Sbjct: 592 VRANGSWGLFHELGHRHQFWGIDFGELQEVTVNLFTMHVYNKVL---------------- 635 Query: 1353 LEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLF 1412 + + + ++ ++ + + N +KW D Y E GW Sbjct: 636 ----HKGIYNHEEIASKEIVLKKINNYLQNNPSFEKWGQDPFLALCMYIELIQQFGWQSI 691 Query: 1413 QLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAY 1472 + K R +S + + L V +T+L++FF W + + Sbjct: 692 EDTFTKYRAMPK--------EQYPQSQEDKRNLWFLTICDVTKTNLTQFFDIWKVPVSDH 743 Query: 1473 QLPGASEMS 1481 S Sbjct: 744 VKEKVSTYP 752 >UniRef50_Q5XHI4 Protein FAM115 n=4 Tax=Anura RepID=F115_XENLA Length = 905 Score = 272 bits (696), Expect = 5e-71, Method: Composition-based stats. Identities = 82/426 (19%), Positives = 146/426 (34%), Gaps = 60/426 (14%) Query: 1066 TETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVA 1125 + T+ + +STGL+ +K ++ A+ + Sbjct: 511 SITVEIDGTNP--GNNAWRSTGLYLAPRKTAVLEFPASAVHQGLQVQVGCQSDDLSSADK 568 Query: 1126 LNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESA-SFTFTGVVKAPFYKDGA- 1183 R P V + + +D S V +GGL+YI +++N AP Y G Sbjct: 569 YCRAPVVVRRFHVD-SQRVSVSCFWGGLVYITVKANSNLGIIPVKVYEAEPAPIYIKGKT 627 Query: 1184 ----W-KNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFY 1238 W ++ N PAP EL ++ + T P + + + E + D ++ + Sbjct: 628 SLDTWIQSIRNLPAPWAELITENIILTVPSDAIRSLS---DPEALLSLWDKIMVAITELA 684 Query: 1239 GRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNS-SFSPNSTTLPTTPLN 1297 + RF DVQIS G H+GYP+M + T L Sbjct: 685 AIPKK------------LPRPERFVADVQISAGWMHAGYPIMCHLESAKELTDLNIMQTG 732 Query: 1298 D-WLIWHEVGHNAAETPLTV-PGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEE 1355 W HE+GHN +T + P TE N+ ++Y+ + LG + R + E Sbjct: 733 GIWGPIHELGHNQQKTNWELPPHTTEATCNLWSVYVHETVLG-IPRSQAHCCLQAETRAN 791 Query: 1356 SNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLM 1415 ++E+ +I++W + E Y + + GW F+ + Sbjct: 792 H-------------------IQEYLRNGSNIEQW--NVWTALETYLQLQEGFGWEPFKQL 830 Query: 1416 HRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLP 1475 + + ++ N + + S QT+L FF+ W Sbjct: 831 FKDYQSMSGIRNE----------NKSKMNLWAEKFSEAVQTNLVPFFEAWGWPIEEATHS 880 Query: 1476 GASEMS 1481 S + Sbjct: 881 KLSVLP 886 >UniRef50_B4DK02 cDNA FLJ57809 n=1 Tax=Homo sapiens RepID=B4DK02_HUMAN Length = 815 Score = 271 bits (693), Expect = 1e-70, Method: Composition-based stats. Identities = 87/427 (20%), Positives = 133/427 (31%), Gaps = 63/427 (14%) Query: 1065 VTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEV 1124 V N W STGL+ + + + T Sbjct: 427 VEREFHRKGNNDCWV-----STGLYLLEGQNAEVSLSEAAASAGLRVQIGCHTDDLTKAR 481 Query: 1125 ALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNE-SASFTFTGVVKAPFYKDG- 1182 L+R P VT +D + +GGL+Y+ + T G V AP+YK G Sbjct: 482 KLSRAPVVTHQCWMDRTER-SVSCLWGGLLYVIVPKGSQLGPVPVTIRGAVPAPYYKLGK 540 Query: 1183 ----AWKNDLNSP-APLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDF 1237 WK + AP GEL +D + T P NL A E D ++ Sbjct: 541 TSLEEWKRQMQENLAPWGELATDNIILTVPTTNLQAL---KDPEPVLRLWDEMMQAV--- 594 Query: 1238 YGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNS--SFSPNSTTLPTTP 1295 R+ R DVQIS G HSGYP+M S + Sbjct: 595 ---------ARLAAEPFPFRRPERIVADVQISAGWMHSGYPIMCHLESVKEIINEMDMRS 645 Query: 1296 LNDWLIWHEVGHNAAETPLTV-PGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLE 1354 W HE+GHN P TE N+ ++Y+ + LG + R ++P E Sbjct: 646 RGVWGPIHELGHNQQRHGWEFPPHTTEATCNLWSVYVHETVLG-IPRAQAHEALSPPERE 704 Query: 1355 ESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQL 1414 +G A L +W + E Y + + GW F Sbjct: 705 RRIKAHLGKG---------APLCDW------------NVWTALETYLQLQEAFGWEPFTQ 743 Query: 1415 MHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQL 1474 + + ++ + N + + S + +L FF+ W Sbjct: 744 LFAEY----------QTLSHLPKDNTGRMNLWVKKFSEKVKKNLVPFFEAWGWPIQKEVA 793 Query: 1475 PGASEMS 1481 + + Sbjct: 794 DSLASLP 800 >UniRef50_C3BVL2 S-layer domain protein n=1 Tax=Bacillus pseudomycoides DSM 12442 RepID=C3BVL2_9BACI Length = 823 Score = 267 bits (681), Expect = 4e-69, Method: Composition-based stats. Identities = 75/465 (16%), Positives = 139/465 (29%), Gaps = 68/465 (14%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 + ++ + TGL+A +++TI + G + Sbjct: 30 GKGNVDQIKDKDRRQFRFSPYEPTGLYASPNEKITILVEGTQNIQA-------YIGTFSY 82 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDG 1182 + A N+ + K+++L G + P GG+IY + G PF++ G Sbjct: 83 DAAWNQDSLI-KSFTLK-PGENTIESPNGGMIYFYNPQQGGTVRAEITAGGSPTPFFELG 140 Query: 1183 AWK-----NDLNS--PAPLGELESDAFVYTTPKKNLNAS--NYTGGLEQFANDLDTFASS 1233 N L++ A EL+ + + T + + Q +D Sbjct: 141 KHTQQDLVNMLDTYPNAHAVELKGERVLITASPERVKKYLIGSNTDPVQLLKKMDESIRI 200 Query: 1234 MNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGY--PVMNSSFSPNSTTL 1291 + G + H F D +S + + Sbjct: 201 QDRVSGLSEK----------EADKHYVHFVEDNHSKDYYMYSYSSRTAYVGDAIQHVLDV 250 Query: 1292 PTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPE 1351 + W WHE GH +TP T G EV N+ ++ +Q + K A Sbjct: 251 NDFIKDGWGPWHEAGHQRQQTPWTWDGLGEVTVNIYSMSVQRAFGSKSRLEKGTYEKAFN 310 Query: 1352 YLEE--SNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGW 1409 YL + S +L M+ QL + G Sbjct: 311 YLNKPQSEKDYNKIDDLFVKLAMFWQL---------------------------DLAFGE 343 Query: 1410 NLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGA 1469 + +H++ R ++N + S VA+ +L FF KW A Sbjct: 344 EFYPKLHQEYRSLSKEE--------LPKNNEEKIQGFIYNTSKVAKQNLLPFFDKWGLVA 395 Query: 1470 NAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTE 1514 ++ + ++ + S + + P NQV++ Sbjct: 396 TPETRQKVEALN-DPIITAPIWEATDSKPIKVKVEEPTITNQVSD 439 >UniRef50_C2U768 S-layer domain protein n=4 Tax=Bacillus cereus RepID=C2U768_BACCE Length = 1064 Score = 259 bits (662), Expect = 5e-67, Method: Composition-based stats. Identities = 74/450 (16%), Positives = 137/450 (30%), Gaps = 70/450 (15%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 ++ + TGL+A +++TIK + G + Sbjct: 53 GKGDVNQIKDKERRQFSFSPYEPTGLYAGPNEKITIKVEGTQNIKA-------YIGTYSY 105 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDG 1182 + A N+ + K+++L+ G + P GG+IY + + TG V PF++ G Sbjct: 106 DGAWNQD-NLVKSFTLN-PGENTIESPNGGMIYFYNQQNGGSVQAEVKTGGVPVPFFELG 163 Query: 1183 AWK-----NDLNS--PAPLGELESDAFVYTTPKKNLNAS--NYTGGLEQFANDLDTFASS 1233 N L++ A EL+ + + T + + + +D Sbjct: 164 KHTKQDLINMLDTYPNAHAVELKGERSLITASPERVKKYLIGSNTDPVELLKKIDESIRL 223 Query: 1234 MNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIG-DAHSGYPVMNSSFSPNSTTLP 1292 + G E H F D S A++ + Sbjct: 224 EDRVAGLSEE----------EADKHYVHFVEDNHSSYYMYAYTYRTAYVKDAIQVVLDIN 273 Query: 1293 TTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEY 1352 + W WHE+GH P T G EV NN+ ++ +Q Y G +R+ D + Sbjct: 274 QFTKDGWGPWHEMGHQRQPNPWTWNGLGEVTNNIYSMSVQRAY-GLPSRLEKDGVYQKAF 332 Query: 1353 LE----ESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKG 1408 +S +L+M+ QL + Sbjct: 333 TYLNKPQSEKDYNKIDDVFVKLVMFWQL---------------------------DLAFR 365 Query: 1409 WNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPG 1468 + +H+ R ++N + S VAQ +L FF +W Sbjct: 366 EEFYPKLHQLYRAIPKEE--------LPKTNDEKIQNFIYNTSKVAQKNLLPFFDQWGLI 417 Query: 1469 ANAYQLPGASEMSFEGGVSQSAYNTLASLD 1498 A +++ ++ + S Sbjct: 418 ATPEIRQKIESLNY-PILTAPIWEATDSKP 446 >UniRef50_A7GU48 S-layer domain protein n=5 Tax=Bacillus cereus group RepID=A7GU48_BACCN Length = 876 Score = 259 bits (660), Expect = 9e-67, Method: Composition-based stats. Identities = 70/481 (14%), Positives = 128/481 (26%), Gaps = 79/481 (16%) Query: 1057 AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDL 1116 + + + TGL+A + + I + Sbjct: 51 RTFLLKGKGNVDHLANIHQRAFAFSPFEPTGLYAKPNETIIIHVEGKQNIQA-------Y 103 Query: 1117 TGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKA 1176 G ++ + G + P GG++Y + + E+ +G Sbjct: 104 IGTYSYDGNPKKF--------YLKPGKNEISAPKGGMLYFENANLNGETKVTVSSGGTPI 155 Query: 1177 PFYKDG-----AWKNDLNS--PAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDT 1229 P+++ G W L A EL+ + ++T K D Sbjct: 156 PYFELGKHTKEDWDAMLEKFPNAYAVELKGERSLFTVTYKAAKQYLGGKDPSPLLRKHDE 215 Query: 1230 FASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGY--PVMNSSFSPN 1287 + G E H F D G ++ S Sbjct: 216 AIRIQDKVSGVSEE-----YTGVAQADTHYVHFVEDFNKKDGWMYATNYRTGYVSDAMKY 270 Query: 1288 STTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDIT 1347 L + W WHE GH +T G TEV N+ ++ +Q + G +R+ + Sbjct: 271 VLDLEHFEKDGWGPWHEAGHQRQQT-WKWSGLTEVTVNIYSMAVQRAF-GNPSRLEAEER 328 Query: 1348 VAPEYLE----ESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSER 1403 +L +S +L M+ QL Sbjct: 329 YNDIFLYLMKPQSEKNYDQIDNLFVKLGMFWQL--------------------------- 361 Query: 1404 EGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFK 1463 + G N + +H+ R + G S+ + AS VA DL+ FF+ Sbjct: 362 DLAFGDNFYPKLHQMYRLTSLEELGDG-------SDEEKKQLFITMASKVANRDLTPFFE 414 Query: 1464 KWNPGANAYQLPGASEMSFEGGVSQSAYN-------TLASLDLPK---PEQGPETINQVT 1513 W + + + ++ +P P I + Sbjct: 415 IWGLMPSEKTKNRIKHLPKLQKEIWKGRDLRPVVEERVSKYQVPMGGIPVPTTVDIGAIN 474 Query: 1514 E 1514 + Sbjct: 475 D 475 >UniRef50_B7INX6 Wall-associated protein n=62 Tax=Bacillus cereus group RepID=B7INX6_BACC2 Length = 1476 Score = 257 bits (657), Expect = 2e-66, Method: Composition-based stats. Identities = 71/457 (15%), Positives = 134/457 (29%), Gaps = 74/457 (16%) Query: 1062 GQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREK 1121 + ++ + TG++A +E+ I+ + + + Sbjct: 59 PGKGSVEEEQKRLKVRYVLSTNEPTGIYAGPNEEIKIEIKGTQSIKAFIG------TKSY 112 Query: 1122 HEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKD 1181 E + L G GG++Y ++ E + G P + Sbjct: 113 DEKGFEE-------FELK-PGENNISSSRGGILYFYNMNNDGEVTASVIHGGSHFPLFVL 164 Query: 1182 G-----AWKNDLN-SPAPLG-ELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSM 1234 G W L P EL+++ + T + + + D Sbjct: 165 GKHTKKDWDAMLKKYKNPYAVELKAERSLITASPEAVANYMGETDPVELMRLHDKIIRFE 224 Query: 1235 NDFYGRDSEDGKHRMFTYKNLPGHKHRFTN------DVQISIGDAHSGYPVMNSSFSPNS 1288 N G + P H +F D + H+GY Sbjct: 225 NSVAGLSEDG-----IGVSKAPNHYIQFVEKRKPDKDDWMFATHYHTGY---VPETMDRV 276 Query: 1289 TTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITV 1348 + + W WHEVGH + P G EV N+ +L +Q K + + +D Sbjct: 277 LNIKRLQGDGWGPWHEVGHLHQQAPWFWSGVGEVTVNIYSLSVQRMLGNK-SSLEEDGHY 335 Query: 1349 APEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKG 1408 + N A + ++L+M+ QL + G Sbjct: 336 KKAFAYLDNPDAQKKMEEFEKLVMFWQL---------------------------DLAYG 368 Query: 1409 WNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPG 1468 + + +H+ R S S+ + + AS A+ +L FF+KW G Sbjct: 369 EHFYPNLHQMYRLLPESE--------MPASDEDKKQMFIYMASKAAKQNLVPFFEKWGLG 420 Query: 1469 ANAYQLPGASEMS---FEGGVSQSAYNTLASLDLPKP 1502 N ++ E + ++ + + KP Sbjct: 421 PNDEVRGKIENLNLPKLEKEIWKATDSNIIREKQVKP 457 >UniRef50_A9VG09 S-layer domain protein n=29 Tax=Bacillus cereus group RepID=A9VG09_BACWK Length = 869 Score = 254 bits (649), Expect = 2e-65, Method: Composition-based stats. Identities = 75/465 (16%), Positives = 138/465 (29%), Gaps = 65/465 (13%) Query: 1057 AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDL 1116 + + + TGL+A +++TI N + Sbjct: 58 RIYTVPGKGDVEVLKQQERKSMAFSPYEPTGLYAKPNEQITINVEGNQNIQA-------Y 110 Query: 1117 TGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKA 1176 G ++ + ++ K+++L G + P GG+IY + TG Sbjct: 111 IGTYSYDASWREDSKI-KSFTLK-PGINTIQSPNGGMIYFYNKQQGGTIRTTVTTGGTTT 168 Query: 1177 PFYKDGAWK-----NDLNS--PAPLGELESDAFVYTTPKKNLNAS--NYTGGLEQFANDL 1227 PF++ G N L+ A EL+ + + T + Q + Sbjct: 169 PFFELGKHTKQDLINMLDQYPNAHAVELKGERVLITASPARVKKYLLGSNTDPVQLLKKM 228 Query: 1228 DTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPN 1287 D + G E Y + A S Sbjct: 229 DEATRIQDKVAGLSEEQVDKHYIHYVEENHSPDYYM--------YATSYRTAYVGDAIQY 280 Query: 1288 STTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADD-- 1345 + + W WHE GH ++P TEV NN+ +L ++ + + Sbjct: 281 VLNINKFIKDGWGPWHEAGHLRQQSPWKFYNMTEVQNNIYSLSVEKAFTSNQSFRLQQEG 340 Query: 1346 -ITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSERE 1404 T A +YLE+ N +L+M QL+ Sbjct: 341 AYTKAFQYLEQPNKNYDEVSDVFVKLVMLWQLQL-------------------------- 374 Query: 1405 GMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKK 1464 G + + +H+ R S +++ N M+ AS VA+ +L FF+K Sbjct: 375 -AYGEDFYPKLHQLYRDMSSSE--------LPQTDENKKQLFMISASKVAKQNLIPFFEK 425 Query: 1465 WNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETI 1509 W N + + + + ++ + S + I Sbjct: 426 WGLRPNNDTIQKVAALGY-PILTAEIWKGTDSNPIKPDVPNENNI 469 >UniRef50_C2G0M2 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G0M2_9SPHI Length = 675 Score = 252 bits (642), Expect = 1e-64, Method: Composition-based stats. Identities = 102/510 (20%), Positives = 162/510 (31%), Gaps = 82/510 (16%) Query: 1042 WWDLNIKVDVEKYPGAVSEEGQNV---TETISLYSN-------PTKWFAGNMQSTGLWAP 1091 D ++ +PG V E + T I + N + STGL+AP Sbjct: 56 VADKSLYDRARVFPGLVGENVPRIKDTTVIIDMNKNVISSRDYKISVAPQAIYSTGLYAP 115 Query: 1092 AQKEVTIKSNAN-VPVTVTVA-LADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVP 1149 + V I + +TV + D+LTG+E L R P + L A G + Sbjct: 116 PGENVKITVPEGLIGLTVQIGAHMDNLTGKE----TLKRDPVIYTVKEL-APGVNYVRNL 170 Query: 1150 YGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGA-----W-KNDLNSPAPLGELESDAFV 1203 YGG I+++ N + + F+G V+A + G W K+ L + P E+ V Sbjct: 171 YGGTIWVRSNVARPIPVNLKFSGPVRASDFVHGQSDIAAWKKDVLANNVPWLEIRGKHMV 230 Query: 1204 YTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN----LPGHK 1259 T P+ N+ G NDL+ + N Y +D D T P Sbjct: 231 MTVPRANVVTFINQGRF----NDLNEVMAEWNIVYEKDYYDWMGLSATAAEVKNRYPEFP 286 Query: 1260 HRFTNDVQISIGDAHSGYPVMN---SSFSPNSTTLPTTP-LNDWLIWHEVGHNAAE-TPL 1314 R D+Q S+G AHSG+P + + T L T N W +HEVGHN + + Sbjct: 287 QRVVLDIQPSLGYAHSGFPWVAQNDLQWLDELTNLTTIHNGNSWGSYHEVGHNFQQTSTW 346 Query: 1315 TVPGATEVANNVL------------ALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWA 1362 + E +NN+ L + + + A Sbjct: 347 SWSDLGETSNNLFIFNGGHRRGNATILNFHPALKTAIPTALTFAASLTGKNFSNLDGVIA 406 Query: 1363 RGGA-GDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARG 1421 G A RL + QL + + G GW ++ K+R Sbjct: 407 DGEAPFFRLTPFLQLFDKIQGK--------------------NGESGWAFMTYLYNKSRN 446 Query: 1422 DEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 + Y + D D F W + Sbjct: 447 SD----------YQFSLDQAKRDYFYRSLCEFTGRDYYRFMVAWGMAVSNVAKKEMRAKY 496 Query: 1482 FEGGVSQSAYNTLASLDLP---KPEQGPET 1508 ++ Y+ + P+ + Sbjct: 497 PPMEMTTWTYDPIKKTGGNSAMNPKYDLPS 526 >UniRef50_D2VFD2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VFD2_NAEGR Length = 934 Score = 252 bits (642), Expect = 1e-64, Method: Composition-based stats. Identities = 86/444 (19%), Positives = 142/444 (31%), Gaps = 76/444 (17%) Query: 1052 EKYPGAVSE-----EGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPV 1106 E++PG + T +++ +N + Q TG +A + + + P Sbjct: 355 EQFPGLPNNGLTYRNAPTETLKMNVSTNRKR-----WQCTGFYALPGATLEVIL--SNPK 407 Query: 1107 TVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVK--FKVPYGGLIYIKGNSSTNE 1164 V+ T H + +R P ++KT+S+ A+ GG IYI+ SS Sbjct: 408 GVSNVRIGGHTDGIAHLDSWSRWPSISKTFSISATTNSTGTITCLNGGTIYIEV-SSIPL 466 Query: 1165 SASFTFTG-VVKAPFYKDG-----AWKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYT 1217 S T G VVK PF+KDG W + + N P P E+ES+ F + + Sbjct: 467 SVDVTVIGQVVKTPFFKDGIHTDQEWNSTIRNYPGPWVEIESEHFAFNVQATP--TARQL 524 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGY 1277 + A + + + + +K R D+QIS G HSG Sbjct: 525 SEVSTVAKYWGNVVAMYYEL-------------SQRQTRDYKERMQADIQISAGYMHSGL 571 Query: 1278 PVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLG 1337 + S S W +HE+GHN E T + EV N+ +LY+++ Y Sbjct: 572 SFYDFSRSERWNYSSR----RWGHYHELGHNFQEGAWTYDQSGEVTCNIFSLYLEEHYPD 627 Query: 1338 KMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLP 1397 N G Sbjct: 628 SANTYGKGFNPPRGLETTYKGNTQPFKGDWTSAG-------------------------L 662 Query: 1398 EFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTD 1457 FY + GW + + + +++ D M S V + Sbjct: 663 LFYLDLIQAFGWVAIRKTFVDYQKNGP----------QPQNDDQKRDQWMTRFSKVVGRN 712 Query: 1458 LSEFFKKWNPGANAYQLPGASEMS 1481 L F W+ + S ++ Sbjct: 713 LGPFCDAWSFPVSDAAKASISNLT 736 >UniRef50_C3XPE8 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XPE8_BRAFL Length = 676 Score = 249 bits (636), Expect = 5e-64, Method: Composition-based stats. Identities = 84/449 (18%), Positives = 142/449 (31%), Gaps = 113/449 (25%) Query: 1049 VDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTI------KSNA 1102 + +PG EE Q ++ + S + STG + PA K +TI + Sbjct: 296 PGINDFPGDFEEEPQLQCASVIIKSARKE-----RHSTGYYLPAGKVLTIKATTTGEHLD 350 Query: 1103 NVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSST 1162 + V V A +D+L+G++K R P+++ L+A K PYGGLIY + Sbjct: 351 DWRVRVG-AHSDNLSGKKK----FKRWPKLSVVKRLEAE--TKISSPYGGLIYFESPKEA 403 Query: 1163 NESASFTFTGVVKAPFYKDGA------WKNDLNSPAPLGELESDAFVYTTPKKNLNASNY 1216 + + + VV+APFY W +P +L + ++T P ++ Sbjct: 404 GD-LTASMENVVEAPFYDLEDPRSVQNWSERRKAPGLWADLAGEHIIFTLPAASVRDL-- 460 Query: 1217 TGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSG 1276 + D + ++ G + H+ R D Q G H+G Sbjct: 461 -EDPSEGLRAWDDVVKAHHELRGTNPSG------------EHRQRIVPDRQPKAGWMHAG 507 Query: 1277 YPVM-NSSFSPNSTTL----PTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYM 1331 YP++ N + + +W ++HE+GHN T G E Sbjct: 508 YPIVTNMDIAARDKFILDGKKIRKAGNWGLFHELGHNMQRKWWTFEGTGE---------- 557 Query: 1332 QDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYP 1391 + D+ AG L +YAQL Sbjct: 558 --------DGAQHDVWKKK---------------AGIALAVYAQL--------------- 579 Query: 1392 DGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCAS 1451 GW ++ + R+ D E+N ++ S Sbjct: 580 ------------AHHFGWEPYKKVFRQYEKDPKKKQ--------PENNKERIVLWIVRFS 619 Query: 1452 WVAQTDLSEFFKKWNPGANAYQLPGASEM 1480 +L F W S + Sbjct: 620 EEVGRNLVPLFDFWGFPHIEEAQAQVSGL 648 >UniRef50_UPI0001692BE8 S-layer domain protein n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001692BE8 Length = 591 Score = 249 bits (636), Expect = 6e-64, Method: Composition-based stats. Identities = 91/491 (18%), Positives = 159/491 (32%), Gaps = 89/491 (18%) Query: 1033 LTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSN--------PTKWFAGNMQ 1084 L+ LMLG + + + E G Q +T + + + N Q Sbjct: 13 LSALMLGSTVATVPVYAQNEPIEGKTVASKQEMTMKLEQTGSIYDIRNRLKVTFGPSNRQ 72 Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRV--TKTYSLDASG 1142 STG++ + + I + N +V T + H + R + K ++ G Sbjct: 73 STGVYKNSNDVIEIYVDPNTDQSVMPQYVISPTVLKSH-TDVGRDSGIPLQKGWNFIREG 131 Query: 1143 TVKFKVPYGGLIYIKGNS-STNESASFTFTGVVKAPFYKDG-----AWKNDL--NSPAPL 1194 G+I++ S T + A+ T +G + P + G W+ N AP Sbjct: 132 A--------GIIHLINESGPTQKPATVTISGGQELPRFILGKHTDADWEKMKLSNPNAPG 183 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 EL S + T + + + D N G DS H+ K+ Sbjct: 184 FELVSKHVIITG---GNKSMSLVHSPNELMKAHDEAIEEENRVAGLDSSKDVHQPAPMKH 240 Query: 1255 LPGHKHRFTNDVQISIGDA-----HSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNA 1309 K S G H+GY S+ + W WHEVGH Sbjct: 241 HMREKD--------SGGWMYAWLQHTGY---VSNAMKYILNPDIFKRDGWGPWHEVGHTH 289 Query: 1310 AETPLTVPGATEVANNVLALYMQDRYLGKMNRVADD--ITVAPEYLEESNNQAWARGGAG 1367 L G EV NN+ ++ +Q + G +R+ +D YLE++N Sbjct: 290 QMNILNWSGLGEVTNNIYSMSVQRYF-GNHSRLEEDKTYNKVFAYLEQTNKDYNKINDLF 348 Query: 1368 DRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSND 1427 +L M+ QL + G + + +H+ R Sbjct: 349 VKLAMFCQL---------------------------DLAYGKDFYPSLHKAYREL----- 376 Query: 1428 KFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVS 1487 N + S+ + AS A +L+ FF+KW + + + Sbjct: 377 -----NEWSGSDAVKQQRFIRLASRTANRNLTPFFEKWGLMVTQETKQQLASLPH---LE 428 Query: 1488 QSAYNTLASLD 1498 + + + ++ Sbjct: 429 KKIWQYMDDMN 439 >UniRef50_UPI000155BFC0 PREDICTED: similar to FLJ00264 protein n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155BFC0 Length = 540 Score = 249 bits (634), Expect = 9e-64, Method: Composition-based stats. Identities = 85/390 (21%), Positives = 132/390 (33%), Gaps = 63/390 (16%) Query: 1092 AQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYG 1151 A T V V +DDLT E+ L RPP VT + + S V +G Sbjct: 174 AGVYFTXXXXXXXXVQVG-CHSDDLTEAEE----LKRPPVVTSRFDV-CSERVTVSSLWG 227 Query: 1152 GLIYIKGNSSTN-ESASFTFTGVVKAPFYKDGA-----WKNDLNS-PAPLGELESDAFVY 1204 GL+YI + + T G ++APF++ G W L PAP ELE+D Sbjct: 228 GLMYIVLPTGCQLDPIPVTVRGAMQAPFFRLGETSPSAWHKTLRHHPAPWAELETDNLTL 287 Query: 1205 TTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTN 1264 T P +N+ + + +++ R Sbjct: 288 TVPAENICLLDSPEP---LLDIWQQIMGAVSKLAA------------VPQTLRRPERIVA 332 Query: 1265 DVQISIGDAHSGYPVM-NSSFSPNSTTLPTT-PLNDWLIWHEVGHNAAETPLTV-PGATE 1321 DVQIS G H+GYP+M + L W HE+GHN + P +E Sbjct: 333 DVQISAGWMHAGYPIMIHLESVEEVVNLQRIWEKGLWGPLHELGHNQQRSNWEFPPHTSE 392 Query: 1322 VANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAE 1381 N+ ++Y+ + LG + R ++ PE +G A LK+W Sbjct: 393 ATCNLWSVYVSETVLG-IPRHEAHTSLRPEARASRVKAFLEQG---------APLKKW-- 440 Query: 1382 KNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGN 1441 + E Y + + GW F + + + N Sbjct: 441 ----------EVFVALETYLQLQEAFGWEPFIQLFADY----------QTETGVPDDNKA 480 Query: 1442 AADTLMLCASWVAQTDLSEFFKKWNPGANA 1471 + S + + +L+ FFK W Sbjct: 481 KMNLWAQKFSQLVKKNLAPFFKAWGWPIEE 510 >UniRef50_C2G4C5 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G4C5_9SPHI Length = 665 Score = 245 bits (624), Expect = 1e-62, Method: Composition-based stats. Identities = 86/464 (18%), Positives = 147/464 (31%), Gaps = 68/464 (14%) Query: 1044 DLNIKVDVEKYPGAVSEEGQN-VTETISLYSN---------PTKWFAGNMQSTGLWAPAQ 1093 D + +PG V+ +S+ N G QST ++APA Sbjct: 54 DYSKITQARLFPGVVATTEPRLENYKVSIDLNYVEVSPSDLRISVAPGAWQSTSMFAPAG 113 Query: 1094 KEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGL 1153 + + I V TG K E R ++ + L G + YGGL Sbjct: 114 ELIVIDVPQGVYGLKAQVGPHVYTGSTKIEFP-RRDEKIVVSKDL-FPGKNYIRNLYGGL 171 Query: 1154 IYIKGNSSTNESASFTFTGVVKAPFYKDG-----AWKNDLN-SPAPLGELESDAFVYTTP 1207 +YI F+G AP +K G WK+ +N S P ELE + V+T Sbjct: 172 VYIIPERPLGRVVDLLFSGTTLAPSFKLGKMTDQQWKDLVNKSSVPWFELEGNRIVFTLQ 231 Query: 1208 KKNLNASNYTGGLEQFANDLDTFA-SSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDV 1266 + L + D + D+ G + P +K R +DV Sbjct: 232 TERLKRFP-INSPTELMELWDKMIKEAYWDWTGMTEGN----PDVKHRAPFNKWRIVHDV 286 Query: 1267 QISIGDAH-SGYPV---MNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETP-LTVPGATE 1321 G A SGYPV N + + T+ + +W +HE+GHN + + G E Sbjct: 287 LFEPGVAQVSGYPVRAGANDQYFGQAVTINSVRTQNWGTYHELGHNMQQGRVWSFDGNGE 346 Query: 1322 VANNVLALYM-------QDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYA 1374 V NN+ + + + + I P+ ++N + WA + Sbjct: 347 VTNNLFSFKVAMINGRQHTKIAEVWPTGLEWINYVPKDAADANRKIWANMPTLSK----- 401 Query: 1375 QLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNY 1434 Y++ G+ ++ +AR + Sbjct: 402 ----------------NHNDAKLIMYAQIFEKYGYGFMTYLYTRARN----------ARF 435 Query: 1435 CAESNGNAADTLMLCASWVAQTDLSEF-FKKWNPGANAYQLPGA 1477 + ++ + D + D+ F + W + Sbjct: 436 ESANDQSKIDFFYEALCEYTKVDMEPFLWIGWGIKVSDVSRNYV 479 >UniRef50_B7HIS0 S-layer domain protein n=19 Tax=Bacillus RepID=B7HIS0_BACC4 Length = 591 Score = 241 bits (614), Expect = 2e-61, Method: Composition-based stats. Identities = 66/501 (13%), Positives = 146/501 (29%), Gaps = 73/501 (14%) Query: 1034 TRLMLGRSWWDLNIKVDV----EKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLW 1089 T L +L++ + ++ + + N TG++ Sbjct: 24 TTLFTPIMSNNLDVHAETAQTATQFEQREFDLPGTGSFWDEAKREKRS-EQKNYMPTGIY 82 Query: 1090 APAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVP 1149 ++VTI + + + G +++ + ++ G+ P Sbjct: 83 VKPNEQVTITVSGTQKIRAII-------GTHQYDKEWGKEIDLS-------PGSNTISSP 128 Query: 1150 YGGLIYIKGNSSTNESASFTFTGVVKAPFYKDG-----AWKNDLNS--PAPLGELESDAF 1202 GG++ + ST G PF+ G W +N+ A +L+S+ Sbjct: 129 NGGVLGLDNFQSTGTVKVQVTQGGSPIPFFVLGKHTKADWIAMMNNYPNAHAVQLKSERA 188 Query: 1203 VYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRF 1262 V T + + N N D + + G D + + + F Sbjct: 189 VLTVTRDSANKYIVNQDPVPLLNKYDEMIRAQDKLAGLSETDPNPLHRSTRRI----WAF 244 Query: 1263 TNDVQISIGDAHSGYPVMNSSFSPNSTTLP-TTPLNDWLIWHEVGHNAAETPLTVP---G 1318 + ++ + + + + W HE GH + P T G Sbjct: 245 VENPNAQSWGMYASWDGAVFTTAGEAIKSTLNVNEFGWGQMHEAGHARQQYPWTWNDLRG 304 Query: 1319 ATEVANNVLALYMQDRYLGKMNR---VADDITVAPEYLEESNNQAWARGGAGDRLLMYAQ 1375 EV NN+ +L + + D A YL+++N + +L+M Q Sbjct: 305 MGEVTNNLYSLAAFKKIYPNIPTRLDTEGDYNRAFAYLKQTNKEYKNIDDLFVKLVMLWQ 364 Query: 1376 LKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYC 1435 L G + + +H+ R + Sbjct: 365 LHL---------------------------AYGDDFYPNLHKLYREIPE--------DQL 389 Query: 1436 AESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLA 1495 +++ + + S +A+ ++ FF KW A ++ + + Sbjct: 390 PKTDEDKIQEFIYNTSKIAKQNVLPFFDKWGLKATQETRQKVEALN-NPTLIAPIWEATD 448 Query: 1496 SLDLPKPEQGPETINQVTEHK 1516 + + + I + + + Sbjct: 449 AKPVKPLAVLSKKIMKASANS 469 >UniRef50_D1PPF9 Putative fibronectin type III domain protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PPF9_9FIRM Length = 1764 Score = 240 bits (612), Expect = 3e-61, Method: Composition-based stats. Identities = 92/591 (15%), Positives = 163/591 (27%), Gaps = 108/591 (18%) Query: 948 LYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNN 1007 +Y N V + Y Y+ + D L Y +D + + + Sbjct: 792 IYNNRPVSISEMRFYAYDSLEAD---------ILALYDDDLHVTLRSGVDEKTIEALQTR 842 Query: 1008 MIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTE 1067 + D +G ++P E R +L + D+ ++V T Sbjct: 843 LDTPD--EASGELHPEREALQRELDNARSLLTTTLSDV-VEVHT--------------TI 885 Query: 1068 TISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNAN---VPVTVTVALADDLTGREKHEV 1124 T + Q G+ A A +E+ + ++ + L E EV Sbjct: 886 TAKKDGHLGFTGLNAWQPLGVTAKAGEEIIVYVGSSNVATGANAWLQLVATQYNSESGEV 945 Query: 1125 ALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG-NSSTNESASFTFTGVVKAPFYKD-- 1181 + V F +GG +Y++ + N+ S +G P Sbjct: 946 VSQPIQLKVGRNEIMVPELVTFDAEHGGALYVQFTGDNANDRYSVRVSGGATIPVLDLYG 1005 Query: 1182 ----GAWKNDLNSPAPLGE-------------------------------LESDAFVYTT 1206 G K + + E + D +Y+ Sbjct: 1006 IDDPGQRKERVTAYVAALEAANDTLNSRHDELHGEHGTCEPQTCILNTTDIMLDQMMYSV 1065 Query: 1207 PKKNLN-------ASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHR--MFTYKNLPG 1257 P L ++ L + +D + G G ++L Sbjct: 1066 PVSQLLAGLGSGSTADKAARLLSSLDAMDQMMTLFYQHKGLTDLAGAGDKNRLPSQHLNI 1125 Query: 1258 HKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND--------WLIWHEVGHNA 1309 R + H G + + W I HE+GHN Sbjct: 1126 RYMRMFAGAFMYAAGNHIGIGWDSVPGLGGGEPITLNEDGSYRSGDLFGWGIAHEIGHNI 1185 Query: 1310 AETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDR 1369 + V EV NN + Q + R D + ++ + Sbjct: 1186 NQGSYAV---VEVTNNYFSQISQAH---EGVRFGYDAIYSK----VTSGTTGHSSDVFTQ 1235 Query: 1370 LLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKF 1429 L +Y QL +K ++ E Y E + + + AR + Sbjct: 1236 LGLYWQLHLAYDKGYEY-----------EIYDNYEELFASRFYARVDTYARNTAQAPAPG 1284 Query: 1430 GGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEM 1480 G K G+A M AS AQ DL++FF +W +A ++ Sbjct: 1285 GIKLTL---GGDADQNFMRLASAAAQKDLTDFFVRWGMTPDAATAAYLAQF 1332 >UniRef50_A5ZEQ5 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZEQ5_9BACE Length = 937 Score = 233 bits (593), Expect = 5e-59, Method: Composition-based stats. Identities = 79/471 (16%), Positives = 147/471 (31%), Gaps = 71/471 (15%) Query: 1069 ISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNA--NVPVTVTVALADDLTGREKHEVAL 1126 I N T ++ TG++ +E+ + N + + + D G Sbjct: 437 IQKAINKTSAYSLLDNPTGIYVAQGQELVVLVADAHNEDMGICIQNLDKPGGDGFGGD-- 494 Query: 1127 NRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSST------NESASFTFTGVVKAPFYK 1180 TY L +G K KV GL+Y+ ++++ + F ++ Sbjct: 495 --------TYPL-TTGVNKIKVKNKGLVYVIYHTTSLEELAGKQPVKIHFASGKVNGYFD 545 Query: 1181 DGA-----WKNDLNSPA-PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSM 1234 W LN+ ++ T P L S G ++ + D Sbjct: 546 SNKHEASRWSELLNNTVCGYFDVLGTYAHLTFPVNRLRNSTGNRG-KELIDLYDEIVEKE 604 Query: 1235 NDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTT 1294 F G G Y N+ H + +D H+ Y + Sbjct: 605 QIFMGLKKYGGMFMNRMYLNVMYHNFMYASDY-------HTAY---HDDTMDELCNPDRL 654 Query: 1295 PL-NDWLIWHEVGH-NAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEY 1352 W HE+GH N L G TEV NN+++ Y+Q G +R+ + + Sbjct: 655 KTTGCWGPAHEIGHCNQTRPGLKWHGLTEVTNNIMSQYIQTTVWGNTSRLQSEGWYTKAW 714 Query: 1353 ---LEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGW 1409 + + A +L+ + QL+ + W P + GW Sbjct: 715 DEIIAKRRAHAQET-DFFMKLVPFWQLELY---------WGKVKGFTP------KESNGW 758 Query: 1410 N-LFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPG 1468 + + ++ R + + G + A+ DL+ FF+KW Sbjct: 759 DGFYPQIYEHIRKNPDLP-----------TAGEQQLEFVYICCLKAEKDLTGFFRKWGFL 807 Query: 1469 ANAYQLPGASEMSFEGGVSQSAYNTL-ASLDLPKPEQGPETINQVTEHKMS 1518 + V+Q + + A +DL E+ +T+ ++ Sbjct: 808 TPVDVT-VDDYGDGKIIVTQKQIDEILAKIDLLGFEKETAAFEYITDDNLN 857 >UniRef50_B6A9M3 Putative uncharacterized protein n=1 Tax=Cryptosporidium muris RN66 RepID=B6A9M3_9CRYT Length = 978 Score = 231 bits (589), Expect = 1e-58, Method: Composition-based stats. Identities = 86/467 (18%), Positives = 140/467 (29%), Gaps = 114/467 (24%) Query: 1076 TKWFAGNMQSTGLWAPAQKEVTIKS-------------------NANVPVTVTVALADDL 1116 + Q+ GL+ K + + + V + V + D+ Sbjct: 500 PILISNGWQTLGLYIIPNKTLELFFFKDKIINKYFNEDVIIHSKSNYVTIRVQIGCHTDI 559 Query: 1117 TGREKHEVALNRPPRVTKTY----SLDASGTV-KFKVPYGGLIYIKGNSS-TNESASFTF 1170 + L R P + K+Y L+ S K PYGGL+Y + + Sbjct: 560 LRTDSDSSPLQRLPIIVKSYIWKIDLNNSRNNISIKSPYGGLLYFELMDRWVKDKFQSNI 619 Query: 1171 TGVV---------KAPFYKDG--------------------------AWKNDLNSP-APL 1194 G V APF+ WK+ +N+ AP Sbjct: 620 LGYVYTNSSESCETAPFFVSSKINKDVEFPQVNLGETGQIICTSSINEWKDIINTKSAPW 679 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 GEL + T P L + ++ + D ++ Sbjct: 680 GELHGLRVIITLPLSTLK---FIENPQEIVDFWDNVIQIQDELLNEG------------- 723 Query: 1255 LPGHKHRFTNDVQISIGDAHSGYPVMNSSFS-----PNSTTLPTTPLNDWLIWHEVGHNA 1309 L K R D+QIS G HSGYP+M DW ++HE+GHN Sbjct: 724 LNDRKERIVCDIQISDGYMHSGYPIMTHMDMCQRHGKLVNIPEIMIEGDWGLYHEIGHNR 783 Query: 1310 AETPLTVPGATEVANNVLALYMQDRYLG-----KMNRVADDITVAPEYLEESNNQAWARG 1364 + T G EV N+ LY ++ +++ + D + EYL + + G Sbjct: 784 QKPQWTFAGTEEVTVNIFTLYTFNKLHAYLKPFQIDFINDQKSKVLEYLGSVKDGNFPSG 843 Query: 1365 GAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEV 1424 + + W D Y GWN + + E+ Sbjct: 844 ELFNSI------------------WKADPGIALYTYLVIIIYYGWNSIKKVFELYDQCEL 885 Query: 1425 SNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANA 1471 + + +L S DL +FK+W+ N Sbjct: 886 PET---------IQDQDKIQIWILLLSLTTNCDLRPYFKQWDWEINT 923 >UniRef50_C5LZE4 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5LZE4_9ALVE Length = 1326 Score = 230 bits (585), Expect = 4e-58, Method: Composition-based stats. Identities = 87/537 (16%), Positives = 167/537 (31%), Gaps = 120/537 (22%) Query: 1041 SWWDLNIKVDVEKYPGAVSE-------------EGQNVTETISL------YSNPTKWFAG 1081 S+ D+ + +PG G+ V T +S+ T Sbjct: 312 SFNDVLALAG-KDFPGVCPPMGQDALSRSQGIINGEEVKITFEAIAKDTPHSSVTGRSFD 370 Query: 1082 NMQSTGLWAPAQKEVTIKSN--------------ANVPVTVTVALADD---------LTG 1118 STG++A + + ++ P+++ L DD Sbjct: 371 PWISTGVYARPGEPIRVRLESLIRAGNAQGAVKGEGSPLSMEEVLTDDFGFRVRIGCHKD 430 Query: 1119 REKHEVALNRPPRVT-KTYSLDASGTVKFK--VPYGGLIYIK------GNSSTNESASF- 1168 + R PR++ + + + ++ + +GGL+Y++ S + + Sbjct: 431 DNTKHDSWKRWPRISAVSKDILSGRNLEVELVSVFGGLVYLERIRENGVEPSKAQGINLN 490 Query: 1169 ---------TFTGVVKAPFYKDGAW-------KNDLNSP------APLGELESDAFVYTT 1206 G V ++ +G W +N N P P GE+++ + + Sbjct: 491 VDSILMVCARVQGGVPTLWWSNGVWMLGGHPVENLKNLPRAGEAYPPWGEIQAKHVILSL 550 Query: 1207 PKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDV 1266 P L + ++ D R D R + + RF DV Sbjct: 551 PMHLLLPHALPSSGGSPRDGWADTIAAFWDRVIRSHCDLACRPVCGEE----RFRFVFDV 606 Query: 1267 QISIGDAHSGYPVMN--SSFSPNSTTL-------------PTTPLNDWLIWHEVGHNAAE 1311 +IS G HSGYP+M + + ++ + DW ++HE+GH+ Sbjct: 607 RISAGWMHSGYPIMAHENPTAEDACAVWGEGNLRGARQGGKLLLEGDWGLFHELGHHFQR 666 Query: 1312 TPLTVPGATEVANNVLALYMQDRYLGK--MNRVADDITVAPEYLEESN-NQAWARGGAGD 1368 T EV N+ ++Y +G+ R +++ E E+ + + GG G Sbjct: 667 RRWTYKACGEVTVNLFSMYSMITIIGRKIPTRDMENVREGHEKAEKFITERIQSEGGGGA 726 Query: 1369 RLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDK 1428 + + W Y + GW+L + + R D ++ Sbjct: 727 IYSAAHNRRPITDALDQWSPWVG-----LTMYVDIIETFGWDLLKGVLRSYEQDGTFGEE 781 Query: 1429 FGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGG 1485 + + S LS + + W +M EGG Sbjct: 782 PDPTEWWS------------RVSRACGKGLSMYCRIWGVPI------DLDKMDREGG 820 >UniRef50_Q5LB88 Putative lipoprotein n=9 Tax=Bacteroides RepID=Q5LB88_BACFN Length = 939 Score = 230 bits (585), Expect = 5e-58, Method: Composition-based stats. Identities = 64/473 (13%), Positives = 142/473 (30%), Gaps = 77/473 (16%) Query: 1073 SNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALAD-DLTGREKHEVALNRPPR 1131 ++ T ++ TG+ +++ I TV+ + + D+ G + Sbjct: 435 THKTSPYSLRDNPTGISVKDGEQLMIFVGDTHGQTVSAVIQNLDVPGGDGFGG------- 487 Query: 1132 VTKTYSLDASGTVKFKVPYGGLIYI---KGNSSTNESASFTFTGVVKAPFYKD-----GA 1183 +Y L + G K GL+YI + T + ++ Sbjct: 488 --TSYPL-SEGANKITARNKGLMYILYHTPDYETAQPVKIHIASGQVNGYFDVAKHQASD 544 Query: 1184 WKNDLNSPAP-LGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDS 1242 W L++ ++ T P + + + D +S + G Sbjct: 545 WNKLLSNAVDKYFDVVGHYAHLTFPTERFRTHTP--DGKALIDAYDQIVNSEMELMGLYK 602 Query: 1243 EDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLN-DWLI 1301 + + Y ++ S A S + N + + W Sbjct: 603 YNKLFKNRMYLHVMYT----------SYMYATSYHTAYNDGTLAELCNVDKLKTSACWGP 652 Query: 1302 WHEVGH-NAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDI-------TVAPEYL 1353 HE+GH N L G TEV NN+++ Y+Q G+ +R+ + + + Sbjct: 653 AHEIGHCNQTRPGLKWLGTTEVTNNIMSEYIQTTIFGQPSRLQTEDMGDGSRNRYSKAWT 712 Query: 1354 E-----ESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKG 1408 + + + +L+ + QL+ + K + Sbjct: 713 QIIAAGAPHGNFGSDSDVFCKLVPFWQLELYFGKVLGRTP--------------LQQSDK 758 Query: 1409 WNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPG 1468 + ++ R + + G + S +A+ +L +FF KW Sbjct: 759 GGFYPDVYEYIRTHDNLR-----------TAGEQQTEFVYICSLIAKANLLDFFTKWGFL 807 Query: 1469 ANAYQLPGAS---EMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEHKMS 1518 +++ + + +L PKP+ + +T++ + Sbjct: 808 TPVDITVDDYGTGKLTVTQARIDEIRSRVEALGYPKPDVA---LEYITDNSVE 857 >UniRef50_C7BLH9 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BLH9_PHOAA Length = 793 Score = 225 bits (572), Expect = 1e-56, Method: Composition-based stats. Identities = 73/456 (16%), Positives = 144/456 (31%), Gaps = 74/456 (16%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 N + +Q TG + + +E+ + +V + + + Sbjct: 12 GNGSANEYKKMQRRSLAHSELQPTGFYVISGQEIMVNVEGETDGSVNAVIGVPELNKPE- 70 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDG 1182 L +G KF GL+ N++ + + K P +K Sbjct: 71 -------------KYLLTTGLNKFTSKNEGLLSFTNNNNHGYVKIIIQSELQKIPSFKLN 117 Query: 1183 A-----WKNDLNS--PAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMN 1235 W+N + S AP+ +L S+ + + Y D F + Sbjct: 118 ETNNTDWENMMASYSDAPVVQLSSERAIIVVRYNSAKK--YLTDPNALMKYYDDFIRFQD 175 Query: 1236 DFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTP 1295 + G + + +K + ++ + M + L T Sbjct: 176 NISGILEDGKADY-----RVDSNKFLYVEADRL---YMFATNGHMGFNGDAALQRLLT-T 226 Query: 1296 LNDWLIWHEVGHNAAETPLTVPG---ATEVANNVLALYMQDRYLGKMNRVADDITVAPEY 1352 N W IWHE GH ++P T G TEV N+ +L +Q+ + + + + EY Sbjct: 227 NNGWGIWHESGHQRQQSPYTWSGGTGMTEVTVNLYSLAVQEGFHDRASFIDKYYPKIKEY 286 Query: 1353 LEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLF 1412 L + + + +L M QL+ G + Sbjct: 287 L-VTEEKNFDAQDINIKLGMLWQLRL---------------------------TFGNGFY 318 Query: 1413 QLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAY 1472 +H+ R + SN + L++ +S + +L+ FF KW N Sbjct: 319 PQLHQAYRLMDS----------LPVSNDDKKQQLIISSSQLTNINLAAFFDKWGITPNEK 368 Query: 1473 QLPGASEM-SFEGGVSQSAYNTLASLDLPKPEQGPE 1507 L + E + ++ ++++P+ + PE Sbjct: 369 TLEILKTLPPLEKNIWENDDKNSITIEMPQQKYVPE 404 >UniRef50_Q87WH6 Putative uncharacterized protein n=3 Tax=Pseudomonas syringae group RepID=Q87WH6_PSESM Length = 827 Score = 225 bits (572), Expect = 1e-56, Method: Composition-based stats. Identities = 75/459 (16%), Positives = 143/459 (31%), Gaps = 74/459 (16%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 + ++ + Q TG++ + V + + V + Sbjct: 27 GRTSADLNKVRQRRSLPHSDYQPTGIYVTKGEHVELSYYQDTTVKIWAVFG--------- 77 Query: 1123 EVALNRPPRVTK-TYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTG---VVKAPF 1178 P + K L G F+V GL+ S + G V A + Sbjct: 78 ------VPELNKPVTELLVFGYNSFEVRESGLLSFIC-QDAGHSVTVIIKGAYSGVPAFW 130 Query: 1179 YK---DGAWKNDL--NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASS 1233 K + W+N + + AP+ L S+ + ++ +Y E+ D S Sbjct: 131 LKETTNAMWQNMMVQYNNAPVVMLTSERAIIVVRHESAR--DYITDPEKLMTYYDELIRS 188 Query: 1234 MNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTL-P 1292 +D G E T + +KH + + + M + + + L Sbjct: 189 QDDISGVLGEGE-----TEWAIDPNKHLYVEADSL---YMFATNGHMGFTGATALSYLLS 240 Query: 1293 TTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEY 1352 P W WHE GH +P+ TEV N+ +L Q+R G+ +R+ + EY Sbjct: 241 GNPAQGWGPWHESGHQRQLSPMNWEDMTEVTVNIYSLATQERMEGRASRLDVEYPFIKEY 300 Query: 1353 LEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLF 1412 L + + R++M QL + Sbjct: 301 LNSPHREFSRLPDHFQRVVMLWQLHL---------------------------TFRTGFY 333 Query: 1413 QLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAY 1472 +H++ R + + + + ++ S ++ DLS FF +W Sbjct: 334 PQLHQRYRLMQN----------LPQGSEDVLQRFIVETSLLSGRDLSTFFDRWGIYPTPE 383 Query: 1473 Q-LPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETIN 1510 + + E + ++ T + LP PE + Sbjct: 384 TLRQISDLLPLEKNIWETDATTSFPIFLPVLTYFPELAH 422 >UniRef50_B1KGL9 Putative uncharacterized protein n=4 Tax=Shewanella RepID=B1KGL9_SHEWM Length = 991 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 86/524 (16%), Positives = 168/524 (32%), Gaps = 59/524 (11%) Query: 999 LKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAV 1058 ++S I G S + SYP++ T L + D + E P Sbjct: 456 KQESHQLYKYITLLGDSYRSQL--SYPMDVATSD-TMDYLQAMFADNTVYNYREINPAPA 512 Query: 1059 SEEGQNVTETISLYSNPTKWFAGNMQ---STGLWAPAQKEVTIKSNANVPVTVTVALADD 1115 + T+ + + Q S G++A + VT+ N + V V + Sbjct: 513 DLGNFSRTDFSHITPTDKSVSITSKQGFRSAGVYALPGQTVTVSRNDSSDVKTWVFINTQ 572 Query: 1116 LTGR--EKHEVALNRPPRV-TKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTG 1172 + E NRP + + + T+KF PYGG + +K + + + FTF+ Sbjct: 573 RSASTHEYATNGYNRPKYLQSTHVEIKPGETIKFTSPYGGPMQVKFD-KGDLATQFTFSS 631 Query: 1173 VVKAPFYKDG----AWKNDLN-SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDL 1227 V P++++G + LN S EL ++ F + + + L + Sbjct: 632 VGLHPYWRNGMDGAQFMQQLNDSEFDWAELATEHFEVHSRLDKMKTTMSHEPLWDTPEKM 691 Query: 1228 DT-FASSMNDFYGRDSEDGKHRMFTYKNLPGHK-------------HRFTNDVQISIGDA 1273 + ++++ + + + + D Q + G Sbjct: 692 GQAIMTHVHNYPHLLAGFKGPYIDSVSEITDFAIAQGWELDNLDTVKHMNAD-QATCGAG 750 Query: 1274 HSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPG-ATEVANNVLALYMQ 1332 SG P + N + PT HE+GH + L G + N + Y + Sbjct: 751 CSGNP-----YDANWSFSPTGH----GDIHELGHGLEKGKLRFDGHEGHASTNPYSYYTK 801 Query: 1333 DRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPD 1392 R + ++ ++ + E + + + + A+L W+ + + Sbjct: 802 SRGFKESGKLPSCQGLSIKDEFEVLQASMKQADPFNYMQE-AKLTSWSNGMATMLQMMVA 860 Query: 1393 GTPLPEFYSEREGMKGWNLFQLMHRKARGDE------------VSNDKFGGKNYCAESNG 1440 GW+L +H R E + F + A Sbjct: 861 AQK------NGALEDGWHLLARLHILLREFERAKTSEALWLQKRAQLGFSQFSLDAAKGI 914 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEG 1484 + D LM+ S+ Q D E ++ W + + +F Sbjct: 915 SNNDFLMVAMSYSTQLDYREVYQMWGLATSQAAKDQVAGFNFSM 958 >UniRef50_C2G2A5 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G2A5_9SPHI Length = 529 Score = 218 bits (555), Expect = 1e-54, Method: Composition-based stats. Identities = 65/441 (14%), Positives = 137/441 (31%), Gaps = 38/441 (8%) Query: 1086 TGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALN---RPPRVTKTYSLDASG 1142 TG++ P K+ + + + + + + + + + L Sbjct: 108 TGIYLPVGKQTILVDGISAKNQIKLVIPNWDRHAPQGIDPTKDPKGWGIEKQEFDLRNGV 167 Query: 1143 TVKFKVPYGGLIYIKGNSSTNE---SASFTFTGVVKAPFYK-----DGAWKNDLNSPA-P 1193 + +GGL YI S + F ++ D W N ++ P Sbjct: 168 NLINIKDFGGLAYITYFSDKPKEQTPIQVHFLNAAVNGYFDISKHNDQDWNNLVDHAVYP 227 Query: 1194 LGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYK 1253 + + P + + Y G+ Q ++ D+ + G + Sbjct: 228 IVDAIGKHIQIAYPTEAIKKYAYGKGV-QLISNYDSLVYRQHRILGLIKHNRVPENKILS 286 Query: 1254 NLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETP 1313 + + + F + ++ GY + + W HEVGH P Sbjct: 287 RVNYNYYMFRDGDGVAYMGDKPGYAMAMVVDPASVI----KGDPCWGFSHEVGHVHQTRP 342 Query: 1314 L-TVPGATEVANNVLALYMQDRYLGKMNRVAD-DITVAPEYLEESNNQAWARGGAGDRLL 1371 + G EV+NN+ +LY+ + K + L G +RL+ Sbjct: 343 YLSWGGLGEVSNNIFSLYVTTSFGNKSRLSEQNNYEKVRNELYGKGKSYLQDGDVFNRLV 402 Query: 1372 MYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGG 1431 + QL+ + + P+FY + + K R SN G Sbjct: 403 PFWQLQLY----------FAGQGVNPDFYPDLFEAFRMQAAADL-TKNRRSRRSNFMDRG 451 Query: 1432 KNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWN-PGANAYQLPGASEMSFE--GGVSQ 1488 +N + + + DL+EFF + +++ + ++E + Sbjct: 452 QNPAVYQ-----LNFVKTVCEIGKVDLTEFFDGYGFFYVGKFEMDDYGKYNYEMTQAMVD 506 Query: 1489 SAYNTLASLDLPKPEQGPETI 1509 + +++LPKP+ + Sbjct: 507 ECKAAIRNMNLPKPKTDLSLL 527 >UniRef50_UPI0001B7B86F Similar to experimental autoimmune prostatitis antigen 2. n=1 Tax=Rattus norvegicus RepID=UPI0001B7B86F Length = 864 Score = 218 bits (555), Expect = 1e-54, Method: Composition-based stats. Identities = 70/333 (21%), Positives = 105/333 (31%), Gaps = 56/333 (16%) Query: 1158 GNSSTNESASFTFTGVVKAPFYKDG-----AWKNDLN-SPAPLGELESDAFVYTTPKKNL 1211 N S + + V AP + G WKN + S AP GEL +D + T P NL Sbjct: 560 NNKSEPDQQAAHINRAVPAPHFGLGKTTQEEWKNLIEHSKAPWGELATDNIILTIPTVNL 619 Query: 1212 NASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIG 1271 L Q D ++ R R D QIS G Sbjct: 620 KVLQDPYPLLQL---WDKIVKAVAKLAAR------------PFPFQRPERIVLDKQISFG 664 Query: 1272 DAHSGYPVMNSSFSPN--STTLPTTPLNDWLIWHEVGHNAAETPLTV-PGATEVANNVLA 1328 HSGYP+M W + HE+GHN + T P TE N+ + Sbjct: 665 FLHSGYPIMGLIIIVEGIINEFKIRSHGVWGVTHELGHNHQKPGWTFRPHTTEALCNLWS 724 Query: 1329 LYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKK 1388 +Y+ + L + R ++ PE ++ +G A L W Sbjct: 725 IYVHETVLN-IPRDQAHPSLNPELRKQRIKDHLNKG---------APLSNWI-------- 766 Query: 1389 WYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLML 1448 E Y + + GW F + ++N + + + Sbjct: 767 ----VWTALETYLQLQEGFGWEPFIQLFANY----------QTLTGLPQNNEDKMNLWVK 812 Query: 1449 CASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 S V Q +L+ FFK W + + Sbjct: 813 KFSEVVQKNLAPFFKAWGWPVQHAVAKSLASLP 845 >UniRef50_A5ZF31 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZF31_9BACE Length = 959 Score = 217 bits (553), Expect = 2e-54, Method: Composition-based stats. Identities = 77/465 (16%), Positives = 153/465 (32%), Gaps = 63/465 (13%) Query: 1069 ISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNR 1128 I N T ++ TG++ PA + + + ++ N V + R ++ + Sbjct: 454 IDAAVNKTAKYSLLDNPTGIFVPAGENLVVMADLNGLDKVNI--------RVQNLDKPGQ 505 Query: 1129 PPRVTKTYSLDASGTVKFKVPYGGLIYIKG---NSSTNESASFTFTGVVKAPFYKD---- 1181 Y++ +G + GL+Y+ + + F +Y Sbjct: 506 DGFGGTEYTI-VNGVNTISIKEKGLVYVMYHKDDYENAPEITLHFASGKVNGYYDSQNPK 564 Query: 1182 --GAWKNDLNSPAPL-GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFY 1238 G WK LN+ ++ T ++ YT ++ N D +F Sbjct: 565 LKGRWKELLNNSVDTHFDVIGKYVHLTFTTRSFLN--YTKDVDNLINLYDDMIYRQQEFL 622 Query: 1239 GRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND 1298 G + D +Y ++ + + + D H+ Y S N Sbjct: 623 GLEKYDRMFHNRSYFHV-----HYNSGSFMYATDYHTAY---IESSLNYLADETQMAANC 674 Query: 1299 WLIWHEVGHNAAETP-LTVPGATEVANNVLALYMQDRYLGKMNR--VADDITVAPEYLEE 1355 W HE+GH P L G TEV NN+ A+Y+Q + + +R V D A + Sbjct: 675 WGPAHELGHIHQTRPGLKWHGMTEVTNNITAIYVQTKVYNEPSRLTVQDRYVSAFNSIMA 734 Query: 1356 SNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLM 1415 A ++L+ + QL+ + + + G + + Sbjct: 735 GQKAHNAESDVFNKLVPFWQLELY----------FGEVKGNTPMKRTDHGG----FYADI 780 Query: 1416 HRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLP 1475 + K R ++G + + A DL+ FF+KW + + Sbjct: 781 YEKVRTTAN-----------PTTDGLCQLEFVYNSCVSAGMDLTGFFEKWGFLSPIDMMI 829 Query: 1476 GAS---EMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEHKM 1517 G + + + N + +L LPK + + + ++ + Sbjct: 830 GDYTNKQFTITETEIANVRNRIVALGLPK---CTDAVEYIVDNTV 871 >UniRef50_A1JU21 Putative exported protein n=4 Tax=Yersinia RepID=A1JU21_YERE8 Length = 808 Score = 213 bits (543), Expect = 3e-53, Method: Composition-based stats. Identities = 73/434 (16%), Positives = 126/434 (29%), Gaps = 59/434 (13%) Query: 1062 GQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREK 1121 QN + F QS+G WA + I+ + + R+ Sbjct: 59 VQNGSPIYFNEQQKRNRFTHPTQSSGFWANKGDNLRIEYQHQGEDIDALPELWIVPVRKG 118 Query: 1122 HEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNES-ASFTF-TGVVKAPFY 1179 +E R + G + +V G +Y + S + G P + Sbjct: 119 NEEDFERQVVKLQR------GINEIEVENTGPLYFVATNQPGSSEITVNLLEGGKPMPRF 172 Query: 1180 KDGA-----WKNDL--NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFAS 1232 G W+ L AP EL + T P ++ E D S Sbjct: 173 ILGENTAEDWQAQLVKFGDAPFAELVGKRMILTMPIADMRE--KATDPEGVLVLWDRIVS 230 Query: 1233 SMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVM--NSSFSPNSTT 1290 + +G + + +++F + + G + + N+S Sbjct: 231 LAEEQFGLSA-----KRAFPHRATPFQYQFVSKPDSTPGYMSAANYWLGSNASGIAEVIN 285 Query: 1291 LPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAP 1350 W WHE+GH+ T G TEV N+ +LY+Q G AD Sbjct: 286 TDKLQNEGWGPWHELGHHYQMPAWTFNGNTEVTVNLTSLYVQRALGGTSRMEADSRWEDI 345 Query: 1351 EYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN 1410 + ++A+ G L M+ QL + G + Sbjct: 346 AVFLKDKSKAYEDAGVFVSLGMFWQL---------------------------DLAFGKD 378 Query: 1411 LFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGAN 1470 +Q + + R + ++N +L AS V+ L FF +W Sbjct: 379 FYQRLGDRYRTLTPAEQ--------PKNNDEKKQRFLLEASRVSGMSLKSFFHQWGIKPT 430 Query: 1471 AYQLPGASEMSFEG 1484 M+ Sbjct: 431 KETSAQLKAMNLPQ 444 >UniRef50_C2C0C9 Possible wall-associated protein n=1 Tax=Listeria grayi DSM 20601 RepID=C2C0C9_LISGR Length = 1217 Score = 213 bits (541), Expect = 6e-53, Method: Composition-based stats. Identities = 69/455 (15%), Positives = 127/455 (27%), Gaps = 55/455 (12%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANV-PVTVTVALADDLTGREK 1121 S + +STGL+ E+T++ + + V T Sbjct: 62 PRGNPATSRAREQRGNEHTSFESTGLFLYKGDEITVEVEGEPENLELRVGQWGGYTNTPY 121 Query: 1122 HEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKD 1181 L + GG++Y+ N ST ++ + G VK P+Y Sbjct: 122 IATGSQSYAFKEGVVKLHGGTNTFTRNFSGGMVYL-VNYSTTKAENVAIKGGVKVPYYVQ 180 Query: 1182 GA-----WKNDLNS--PAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSM 1234 G +KN+L P E ++ + T + + N +D F SS+ Sbjct: 181 GKTSIDSFKNELEQYKDVPFMEFVNNDAIATIRIDRAK------DIFEKGNQVDVFMSSL 234 Query: 1235 NDFYGRDSEDGKHRMFTYKNLPGHKHRF-TNDVQISIGDAHSGYP---VMNSSFSPNSTT 1290 + R + + G + + +++ Sbjct: 235 AKIVKLQNGAAGLSYDGQGAERKDLQRIHVMNPEWGAGQLFATNNFIGIHSATTKDREIF 294 Query: 1291 LPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAP 1350 DW + HE GH TEV N+ A Y Q + D Sbjct: 295 SKGLDSTDWGMLHESGHTYQNKMYQWRNMTEVTVNIYADYAQKMWSADGTG-RYDAVNVK 353 Query: 1351 EYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN 1410 S + + + + E + +F L + G++ Sbjct: 354 NGSRASVQKYFKKLETDPTWNFDRESTETNDYHFA----------LLGMFLTLPRTFGYD 403 Query: 1411 LFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGAN 1470 + ++++ R + +L S VA DL+ FF+ W Sbjct: 404 FYPVLNQSYRSLPEEELPKTEEE--------QKQLFILMTSKVAHRDLTPFFEHWRF--- 452 Query: 1471 AYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQG 1505 ++ L +L LP E+ Sbjct: 453 --------------TITDETKEKLKALKLPTLEKE 473 >UniRef50_Q2U0W8 Predicted protein n=2 Tax=Aspergillus RepID=Q2U0W8_ASPOR Length = 673 Score = 212 bits (540), Expect = 8e-53, Method: Composition-based stats. Identities = 65/452 (14%), Positives = 134/452 (29%), Gaps = 58/452 (12%) Query: 1048 KVDVEKYPG-AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPV 1106 + D K+P + V+ ++ ++QSTG + + +T+ ++V Sbjct: 237 QTDPSKFPQPRTFKVSPLVSPPDEALRLRQQFHWSDLQSTGFYLNPNEPLTVFVESSVRD 296 Query: 1107 TVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG----NSST 1162 L + P +G K +GG+IYI+ + Sbjct: 297 GPKPRLVLGPPALVHPDHGKEHVPAQLVELPPLENGRNKSVHNFGGIIYIRYTHRASDQP 356 Query: 1163 NESASFTFTG-VVKAPFYKDG-----AWKNDLN-SPAPLGELESDAFVYTTPKKNLNAS- 1214 P +++G WK+ L+ + P E + + T K+ Sbjct: 357 PPPVFLRLGDTAEPFPLFREGSTTDAQWKSMLDVTKVPFAEHDGKRVIITGLAKHAKKYA 416 Query: 1215 NYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAH 1274 + ++ + + + F R + P + S Sbjct: 417 DNGQRQQELLDTYAHIIAIQDRFSALKYNARDPR-----DRPSLLRPMVVESVNSGVATA 471 Query: 1275 SGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAA-ETPLTVPGATEVANNVLALY-MQ 1332 + Y + + N W+++HE+GH + TEV N+ +L ++ Sbjct: 472 TNYRAAIPNRLSDQIYWVPRLRNSWMVFHELGHQRQITRTWSWRAMTEVTVNIYSLANLR 531 Query: 1333 DRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPD 1392 + VA+ + S + + G L+M+ QL+ Sbjct: 532 EYKPPGHKNVAEWDNAKQYLTKASKEKDFDSAGFYLSLVMFEQLRV-------------- 577 Query: 1393 GTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASW 1452 + G + +HR AR + + + M A+ Sbjct: 578 -------------VFGDGFYHELHRDARH-----------TLVVDKDADKKHHFMTKAAQ 613 Query: 1453 VAQTDLSEFFKKWNPGANAYQLPGASEMSFEG 1484 + DL+E+F KW + + Sbjct: 614 LTGQDLTEYFTKWGLKPEDRTINEMKKQPKPK 645 >UniRef50_A5ZER3 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZER3_9BACE Length = 558 Score = 209 bits (532), Expect = 6e-52, Method: Composition-based stats. Identities = 81/479 (16%), Positives = 145/479 (30%), Gaps = 58/479 (12%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 T ++ TGL+ + + + + + + L D + Sbjct: 31 PYSDINSLAKQLKTSRYSPFENPTGLYFEEGETIQVTAPDLQGYQLNLLLVD---FSKPA 87 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESA---SFTF-----TGVV 1174 E T ++L G KF P+ GL+Y+ A TF GV Sbjct: 88 EGEKK---EKTTVFTLKT-GNNKFYAPHKGLVYVSYYVKDCRKAPEQKLTFHTGINNGVF 143 Query: 1175 KAPFYKDGAWKNDLNSP-APLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASS 1233 A + + WK L+S A + +++ T K L G+ + D Sbjct: 144 NAYQHTNDEWKRMLDSAIAEVIDMQGKYVHLTFDVKTLREKGSDCGV-EMIRMYDRIILW 202 Query: 1234 MNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPT 1293 + G D + H F G ++ ++ + + Sbjct: 203 QQEMLGIDQFG----------YRTNNHMFA--RISWAGPPNANGKGVSFPRTSSIIRPED 250 Query: 1294 TPLNDWLIWHEVGH-NAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEY 1352 ++W+I HE GH N L G TE+ NN+ A ++Q + Sbjct: 251 IRNSNWVIGHEFGHVNQVRPGLKWHGTTEITNNIQAAWIQYLLRPEGPF----------- 299 Query: 1353 LEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKK---WYPDGTPLPEFYSEREGMK-- 1407 + A G G + +Y L W + +++ Y T YS+ + Sbjct: 300 ---RIEHSKAPDGTGQK--VYGGLFNWHFNHCVVQQKPLLYNPRTSFTPPYSDNKNPFVR 354 Query: 1408 ---GWNLFQLMHRKARGDEVSNDKFG----GKNYCAESNGNAADTLMLCASWVAQTDLSE 1460 W L G + N + G + A V Q DL++ Sbjct: 355 LCPFWQLQIYNALTNFGKPDFYARISEIVRRTNEQDLTVGELQLNFVKNACDVIQEDLTD 414 Query: 1461 FFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEHKMSA 1519 FF + + G + +SQ + P+ I+ +T + + A Sbjct: 415 FFIRCGMLRSVDTEIGDYGGNRHLSISQKQVEEVIRYASRYPKPKSPVIHYITMNSVKA 473 >UniRef50_B8B8E6 Putative uncharacterized protein n=5 Tax=cellular organisms RepID=B8B8E6_ORYSI Length = 753 Score = 208 bits (530), Expect = 1e-51, Method: Composition-based stats. Identities = 83/174 (47%), Positives = 101/174 (58%), Gaps = 16/174 (9%) Query: 538 NVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDP 597 +VTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDP Sbjct: 479 SVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDP 538 Query: 598 DDMKNFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGI 657 DDMKNFMEN L++ D + + + V + G+ + A Sbjct: 539 DDMKNFMENTLKWEIVDMGSLNGTFVNSRAVHHPNVGSRHWGEPA----------ELADG 588 Query: 658 SVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVT 711 + L + L Q + L +G + P+ S KL +D++ Sbjct: 589 DIITLGTSSKLSVQ----ISLQNQRVPAGIGMA--SDPMVGRRSGKKLAMEDIS 636 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 16/110 (14%), Positives = 34/110 (30%), Gaps = 16/110 (14%) Query: 602 NFMENVLRYLSDDKWKPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPDFAGISVEH 661 N+ L++ D + + + V + G+ + A + Sbjct: 188 NWNAKTLKWEIVDMGSLNGTFVNSRAVHHPNVGSRHWGEPA----------ELADGDIIT 237 Query: 662 LSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVT 711 L + L Q + L +G + P+ S KL +D++ Sbjct: 238 LGTSSKLSVQ----ISLQNQRVPAGIGMA--SDPMVGRRSGKKLAMEDIS 281 >UniRef50_C6IEC9 Coagulation factor 5/8 type n=2 Tax=Bacteroides RepID=C6IEC9_9BACE Length = 705 Score = 208 bits (529), Expect = 1e-51, Method: Composition-based stats. Identities = 79/468 (16%), Positives = 156/468 (33%), Gaps = 55/468 (11%) Query: 1044 DLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAG----------NMQSTGLWAPAQ 1093 D+++ +PG V N T+ ++ + STGL+A A Sbjct: 58 DVSMYESARVFPGLVDTLVDNTVNTLLALDLSKRYIPAYDLDVQQVPRPIYSTGLYAGAG 117 Query: 1094 KEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGL 1153 + +TI N N + +TV + L R P VT + L G + P GG+ Sbjct: 118 ELITITINDNT-MGLTVIIGSHLDDLTDIS-PYLRLPVVTTSKQL-FPGKNTIRNPLGGM 174 Query: 1154 IYIKGNSSTNESASFT--FTGVVKAPFYKDGA-----W-KNDLNSPAPLGELESDAFVYT 1205 I+I+ + N SA F G ++P + G+ W + + P EL ++ Sbjct: 175 IWIEKSKDVNGSADFVMEINGAYRSPDFIVGSTDVTAWVEQLRTTTVPWLELRGRHVAFS 234 Query: 1206 TPKKNLNASNYTGGL--EQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFT 1263 ++ L + E+ N L+ + +++ +Y P R Sbjct: 235 VQRERLLDMINDDPIIAEKMPNTLEAWDNAVETYYYNYYSLQVGAQDFSMRAPDFPERVV 294 Query: 1264 NDV----QISIGDAHSGYPVMNSSFS-PNSTTLPTTPL-NDWLIWHEVGHNAAETPLTVP 1317 DV + I +A G +N+++ + T N I++ + N + + P Sbjct: 295 LDVELLDNLYIRNADYGVVALNTNYLLNELASYQTLKSGNSVAIFNALYRNYSFRDIKSP 354 Query: 1318 GATEVANNVLALYMQDRYLGKMNRVADDIT-VAPE---YLEESNNQAWARGGAGDRLLMY 1373 +EV++ V A+ + + + + PE + E +A A Sbjct: 355 WWSEVSDAVKAIPLYRMAEKGLREDGYPMGPIFPEEGSSIAEQFPKALAYADTDSSRWFV 414 Query: 1374 AQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKN 1433 + +K +++ Y + + + + W + ++R + + Sbjct: 415 SDIKS------EVRPTYALASLVQLANYKDDD---WAFYIELNRMIKDKISIDHSTSTY- 464 Query: 1434 YCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 + D S FF+ W AS+ Sbjct: 465 ------------FFKALCDYFKEDFSPFFEHWGYSLTDEARSYASKYP 500 >UniRef50_Q4ZNJ4 Putative uncharacterized protein n=1 Tax=Pseudomonas syringae pv. syringae B728a RepID=Q4ZNJ4_PSEU2 Length = 824 Score = 208 bits (528), Expect = 2e-51, Method: Composition-based stats. Identities = 74/461 (16%), Positives = 144/461 (31%), Gaps = 74/461 (16%) Query: 1065 VTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEV 1124 + + + Q T ++ ++ I + + V+ + Sbjct: 29 GSAESHRERHARARAHTDFQPTNIYVTKGDQLEITATSLYMNRVSAVIGVPELDTP---- 84 Query: 1125 ALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKA-PFYKDG- 1182 +Y L G GL+ TG PF++ Sbjct: 85 ---------TSYPLQ-RGLNVLVATNTGLLGFTNLDPLGHVV-LDITGQYNHVPFFRMDM 133 Query: 1183 ---AWKNDLN--SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDF 1237 W+ + S AP+ L S + ++ Y + + D S+ + Sbjct: 134 TNLEWEQQMAQYSNAPVVLLTSPRAIVVVRYQSAQ--LYLTDPAKLMGNFDDAISAQDGI 191 Query: 1238 YGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTL-PTTPL 1296 G + T L KH + ++ + M + S L T P Sbjct: 192 SGV-----INYSTTEWALDPSKHFYVEADRL---YMFAMDGHMGFNGSAALARLLSTAPE 243 Query: 1297 NDWLIWHEVGHNAAETPLTVP---GATEVANNVLALYMQDRYLGKMNRVADDITVAPEYL 1353 + W WHE GH +P+T G TEV N+ ++ Q+ +LG+ EYL Sbjct: 244 DGWGPWHESGHQRQLSPMTWGTGTGMTEVTVNLYSMAAQEVFLGRATGADSSYAPMKEYL 303 Query: 1354 EESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQ 1413 S + +G +L+M QL+ G + + Sbjct: 304 ASSLREYDNIKDSGHKLVMLWQLRL---------------------------SFGTSFYP 336 Query: 1414 LMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQ 1473 +H++ R + + A ++ S ++ +L+EFF +W N Sbjct: 337 QLHQRYRLMHNP----------PTVSDDKAQRFIVETSLLSHVNLAEFFDRWGLYPNPET 386 Query: 1474 LPGASEMS-FEGGVSQSAYNTLASLDLPKPEQGPETINQVT 1513 L +++ + ++ +T + LP PE + ++ Sbjct: 387 LNQIADLPALTLAIWETDADTTIPIPLPLSTYIPELAHILS 427 >UniRef50_C5PJT0 Lipoprotein n=2 Tax=Sphingobacterium spiritivorum RepID=C5PJT0_9SPHI Length = 601 Score = 205 bits (520), Expect = 2e-50, Method: Composition-based stats. Identities = 61/488 (12%), Positives = 143/488 (29%), Gaps = 90/488 (18%) Query: 1071 LYSNPTKWFAGNMQSTGLWAPAQK--EVTIKSNANVPVTVTVALADDLTGREKHEVALNR 1128 T ++ TG++ + ++ +++ V DD + + + L + Sbjct: 82 AKKLKTSPYSQYENPTGIYFSEGDSAVLWVEKTNATQLSLRVTNWDDEEFK-QKDYPLTQ 140 Query: 1129 PPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTF-----TGVVKAPFYKDGA 1183 + ++ G + Y + + N G+ + + Sbjct: 141 G---YNAFKIENKGNSYIQ-------YFTPDKAGNNKVKIHILSGKVNGIFDISKHTNED 190 Query: 1184 WKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDS 1242 W N L N+ AP+ ++ ++L + G+ + D+ ++ G Sbjct: 191 WDNLLANATAPVLDIVGKQVQLAYAVQSLQTNAAHQGV-ELVRLYDSIIGIQHELMGLKQ 249 Query: 1243 EDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGY--PVMNSSFSPNSTTLPTTPLNDWL 1300 + + I G H+ +++ + + N W Sbjct: 250 TNRIPKNR------------MFGRVIWKGFMHADGIGAAFHNNTMKDVANVAGLRKNSWG 297 Query: 1301 IWHEVGH-NAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPE-------- 1351 + HE GH N + G TEV NN+ +++ Q Y ++ + + Sbjct: 298 VAHEFGHVNQVRPNMKWVGTTEVTNNIYSVWTQYIYNQNQPKLEREKLKDYDEPKIGGRI 357 Query: 1352 -------------YLEESNNQAWAR-------GGAGDRLLMYAQLKEWAEKNFDIKKWYP 1391 +L ++ W R G +L+ QL+ + + W Sbjct: 358 TSYMESAFIHRQPWLTQAGPDRWDRERPRDWGGDHFVKLVPLWQLQLYFNVAGEGNTWEN 417 Query: 1392 DGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCAS 1451 N + + KA + DK + A Sbjct: 418 K-----------------NFYGDIFTKAINAPTTKDKPDAYYQLE---------FIKNAC 451 Query: 1452 WVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQ 1511 + DL++FF++ L + ++ + + + P+ ++ Sbjct: 452 DAGKLDLTDFFEQSGLLIPID-LWVDDYTCAQMTITPNDIQQVKTYAAKYPKPNTSVLHY 510 Query: 1512 VTEHKMSA 1519 +T + + A Sbjct: 511 ITANSVQA 518 >UniRef50_UPI0001BC7CE9 hypothetical protein BacD2_03774 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7CE9 Length = 814 Score = 204 bits (519), Expect = 2e-50, Method: Composition-based stats. Identities = 84/500 (16%), Positives = 160/500 (32%), Gaps = 87/500 (17%) Query: 1044 DLNIKVDVEKYPGAVSE------EGQNVTETISLYSNPT-----KWFAGNMQSTGLWAPA 1092 D+++ +PG V + Q V ++ S + + STGL+A A Sbjct: 54 DVSLYDKARIFPGLVDTLTEKRVDEQVVNIDMAYRSAKAVNVNLAMTSPAIYSTGLYAGA 113 Query: 1093 QKEVTIKSNANV-PVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYG 1151 +++T+ + +V +TV + + L R P+V + +L G + + PYG Sbjct: 114 GEKITVMLDDDVKGLTVQIGIHSRDLSSLVGSSYLERDPKVVTSMAL-FKGKNEIRNPYG 172 Query: 1152 GLIYIKGNSSTNES--ASFTFTGVVKAPFYKDGA-----W-KNDLNSPAPLGELESDAFV 1203 G I+IK + +++ G AP Y G W + + P EL Sbjct: 173 GYIWIKRSGDASDTGIVPLKVQGAYLAPDYVVGETEAAEWGEKIKTTTVPWIELRGKQIA 232 Query: 1204 YTTPKKNLN------ASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPG 1257 ++ P K + ++ LEQ D + N+FYG D + + P Sbjct: 233 FSVPVKYMKLKLQSEGQSFVTRLEQSLELWDDWVLCYNEFYGLDDAESE-----TFPKPD 287 Query: 1258 HKHRFTNDVQ-ISIGDAHSGYP------------VMNSSFSPNSTTLPTTPLNDWLIWHE 1304 R D ++ ++ ++ + L T+ + W+ Sbjct: 288 FPVRVVMDAHLVTERYSYYSNTNLELLQTEELIDMIADPEQVKAGALNTSHVVGWMS--- 344 Query: 1305 VGHNAAE-----TPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQ 1359 +G P + + N LY + + D + L + + Sbjct: 345 LGLFVQTYWPTPAPNSFKDMYSLMPNFYFLYKHGWWGNQ-----QDAKLFAYKLFGRDQK 399 Query: 1360 AWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPE-------------------FY 1400 + L WA+ D K Y D P + Sbjct: 400 VINTTQYNLNADEFENLVSWAKA--DSCKIYSDEAKRPSKSGNDYWPAALTFYSAILSYK 457 Query: 1401 SEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSE 1460 E G GW F ++R G+N + + ++ ++ C S + D + Sbjct: 458 QEDTGKDGWKYFAYLNRFLSN--------EGQNVSIFNRLSMSEAMLTCLSHYFERDFTP 509 Query: 1461 FFKKWNPGANAYQLPGASEM 1480 F ++ + A + Sbjct: 510 LFDRYGIEISDKMRAEALQY 529 >UniRef50_A9VC57 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VC57_MONBE Length = 1025 Score = 204 bits (518), Expect = 2e-50, Method: Composition-based stats. Identities = 70/370 (18%), Positives = 112/370 (30%), Gaps = 44/370 (11%) Query: 996 SADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKV-----D 1050 L SL + + ++ S + + + L L+L +SW L D Sbjct: 652 PLHLAPSLPTTDPTVAITAQVETILEASPQQSQLRRGLEALLLSQSWQQLPCHAVPCSFD 711 Query: 1051 VEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTV 1110 ++PG + +A W + + N V++ Sbjct: 712 AAQFPGCTKPTSASTDVLHEGLVIEPNGWAFLRA----WVNGGEPFRVLVNDVAADAVSL 767 Query: 1111 ALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTF 1170 + R P ++ + + T P+GGL+Y++ + +A Sbjct: 768 RIGCHADELWHCSGTWKRGPAISGVWPVTPDVTRSLAHPWGGLLYLQNRTDAPLTARVYL 827 Query: 1171 TGVVKAPFY-----------KDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGG 1219 V KA + S P EL + T P + Sbjct: 828 EQVHKASCVEMHPATATVASVSADLQALGTSEVPWCELAGANIILTVPTPAAQRLGH--R 885 Query: 1220 LEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPV 1279 + + D + D + R DVQI+ G HSGYPV Sbjct: 886 IPRLLRFWDDVLRAHRDLRRLGPL-------------VRRERIVFDVQIAAGYMHSGYPV 932 Query: 1280 MNS--------SFSPNSTTLPT-TPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALY 1330 M P + L +W I+HE GHN E+ T GA EV N+ L Sbjct: 933 MAHLDQIEGAVDGGPTALQLERLLREGNWGIFHEFGHNLQESVWTFEGAGEVTVNLFTLN 992 Query: 1331 MQDRYLGKMN 1340 DR +G Sbjct: 993 AMDRVVGLTP 1002 >UniRef50_B2UPI7 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B2UPI7_AKKM8 Length = 506 Score = 202 bits (514), Expect = 8e-50, Method: Composition-based stats. Identities = 72/518 (13%), Positives = 141/518 (27%), Gaps = 96/518 (18%) Query: 1027 NYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQST 1086 + M++ T+++ G D + + P T F+ T Sbjct: 55 DAMKELATKILAGHYKPDY-LYAEYRALP------SPRQTGK---NLRIGDGFSKYDNMT 104 Query: 1087 GLWAPAQKEVTIKSNANVPVTVTVALAD---------------DLTGREKHEVALNRPPR 1131 G++ + V + +++ L + + G K ++ L Sbjct: 105 GVYLEKGRHV-VLVGKTEGQEISLLLPNLMRKPAEGVQPTKDPNGWGLHKKQIPLKEGIN 163 Query: 1132 VTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFY------KDGAWK 1185 + ++ Y ++ F ++ + W Sbjct: 164 II---DVETPANAYIS-------YFTEDAGKAPKIPVHFVTGKANGYFDTTRGDTNKDWV 213 Query: 1186 NDLNSPA-PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSED 1244 L+ P+ + P + L G + N D G D Sbjct: 214 RLLDQAVSPIMDARGKYIQVAYPVEFLKKFTKDRG-TELINAYDKLIGIQYQLMGLDKYG 272 Query: 1245 GKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLN--DWLIW 1302 + + + F + G A+ G N T W Sbjct: 273 KIPENRVLARVNFNYYMF----RDGDGVAYLG----NDGTMRMVTDPENVLKGDACWGFS 324 Query: 1303 HEVGHNAAETPLTVPGATEVANNVLAL--YMQDRYLGKMNRVADDITVAPEYLEESNNQA 1360 HEVGH P+T G TEV+NN+ +L + ++ R E +E Sbjct: 325 HEVGHVMQMRPMTWGGMTEVSNNIFSLQAAAKTGNESRLKRQGSYDKARKEIIEGEI-AY 383 Query: 1361 WARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKAR 1420 ++L+ QL + KN + + + R Sbjct: 384 LQSKDVFNKLVPLWQLHLYFTKNGHP-----------------------DFYPDVMEYLR 420 Query: 1421 GDEVSNDKFGGKNYCAESNGNAADT-LMLCASWVAQTDLSEFFKKWNPGANAYQLPGASE 1479 + + ++ + V +TDL++FF+KW G Sbjct: 421 NNAGNYGG---------NDTVKYQFEFVKACCDVTKTDLTDFFEKWGFFKPGKFHIGDYA 471 Query: 1480 ---MSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTE 1514 + + + +A PKPE I +++E Sbjct: 472 QYDFNVTPEMVEETKKWIAGKGYPKPETD---ITELSE 506 >UniRef50_A3FQI6 Putative uncharacterized protein n=3 Tax=Cryptosporidium RepID=A3FQI6_CRYPV Length = 1025 Score = 202 bits (514), Expect = 8e-50, Method: Composition-based stats. Identities = 86/461 (18%), Positives = 147/461 (31%), Gaps = 106/461 (22%) Query: 1079 FAGNMQSTGLWAP-----AQKEVTIKSNANVPVTVTVALADDLTG-REKHEVALNRPPRV 1132 Q+TG++ + + I + P+ + + R K E + R P + Sbjct: 529 IPFEWQNTGVYIKHKSFLKAEFIPINKYSLTPIKSMLHVGSHSDFLRYKEEKIMRRVPLI 588 Query: 1133 TKTYSLDASGTVKFK--VPYGGLIYIK----------------GNSSTNESASFTF---- 1170 K Y+ S T P G++Y + NS+ + + F Sbjct: 589 KKIYNWSISETNYITIESPCEGIVYFEYIPNYDHTNTKGSLKRPNSNESHGSLIGFIKFI 648 Query: 1171 ----TGVVKAPFYKDGA-------------------WKNDLNSPA--------PLGELES 1199 VV P Y W + EL Sbjct: 649 PLIGKEVVTTPIYTIDKLQLDTCSFDNRTVITDKHLWARIFSEANRKFSNLLPAWTELHG 708 Query: 1200 DAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHK 1259 + T P + L + + ++ D + N+ Y N K Sbjct: 709 KKIILTIPSRILYSIDCL-IVDDLLEFWDKVIDTQNELY--------------FNYKWTK 753 Query: 1260 HRFTNDVQISIGDAHSGYPVMNSSF-------SPNSTTLPTTP-LNDWLIWHEVGHNAAE 1311 R D+QIS G HSGYP+ S L T +W I+HE+GHN Sbjct: 754 ERIVCDIQISDGYMHSGYPIATHLDIVEEQINSGGILDLKTLKDEGNWGIYHEIGHNRQS 813 Query: 1312 TPLTVPGATEVANNVLALYMQDRYLGK-----MNRVADDITVAPEYLEESNNQAWARGGA 1366 + T G EV N+ +Y + + ++ + D I +A EYL + N+++ + Sbjct: 814 SYWTFNGTEEVTVNLFTMYSYYKLHPRLYPFNISYIKDQINLAVEYLIDMNSESKSP--- 870 Query: 1367 GDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSN 1426 E E+ F +KW + Y + GWN + + + + Sbjct: 871 -----------ENMERGFR-EKWMANHGIAFCNYLILINLFGWNTLKTVFQVYDQLSEVD 918 Query: 1427 DKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNP 1467 ++ + SN ++ S V +L FF +W Sbjct: 919 EELVQTH----SNQQKMAMWIIIFSSVVGFNLKYFFTQWGW 955 >UniRef50_A5ZED4 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZED4_9BACE Length = 800 Score = 201 bits (511), Expect = 2e-49, Method: Composition-based stats. Identities = 73/489 (14%), Positives = 151/489 (30%), Gaps = 66/489 (13%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 + +N T ++ TG++A A + + I + L DL G + Sbjct: 345 PYQNPAVMATANKTSKYSLRDNPTGIYAKAGETLAIFVDDIYEGGRISMLIQDLNGGYNN 404 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSS------------------TNE 1164 +KTY L + G + V GGLIYI + + + Sbjct: 405 ----------SKTYEL-SEGYNEITVEVGGLIYILNHVNDDIPLRHEDADNDQKRNIEAK 453 Query: 1165 SASFTFTGVVKAPFY-----KDGAWKNDLNSPAPLGELE--SDAFVYTTPKKNLNASNYT 1217 + F ++ K+ W ++ A E++ + T + Y Sbjct: 454 TVKVHFANGKVNGYFDIQKNKESDWAQIRDN-AKYQEIDILGEYSHLTWRISDFKK--YN 510 Query: 1218 GGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGY 1277 + + +LD +F G + ++ F+ D + +A Sbjct: 511 TEITKTIENLDRLVYLEEEFMGLVKYGK---------MFNNRMHFSIDYKAKSPNASDYR 561 Query: 1278 PVMNSS--FSPNSTTLPTTPLNDWLIWHEVGH-NAAETPLTVPGATEVANNVLALYMQDR 1334 V N+S ++ P W HEVGH N L G TEV NN+++L++Q Sbjct: 562 TVYNASDYYAEPFCKPENFPTRCWGPAHEVGHCNQTRPGLKWAGLTEVTNNIMSLFIQTS 621 Query: 1335 YLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGT 1394 + G+ ++ D + +++ L++ + +I + Sbjct: 622 F-GRPCKLLVDGCTLKDENDQTLGTYNNIYQGATSLIVDGKRPHCLPGIANITRETQLVP 680 Query: 1395 PLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADT-LMLCASWV 1453 + ++ + + ++ R E +DK N + + Sbjct: 681 FWQLKLYMIDVLEKTDFYHKLYEYFRTHESPSDKGE--------NQGMNQLDFVRQVCDI 732 Query: 1454 AQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLAS----LDLPKPEQGPETI 1509 + ++ +FF+KW + +++ L P I Sbjct: 733 SGLNMLDFFEKWGFLYPVKTTLNDY-GNKAFEITEEQIELLKEEINGKGYEMPHPNVHQI 791 Query: 1510 NQVTEHKMS 1518 ++ Sbjct: 792 TEINLDDYK 800 >UniRef50_C7MAR4 Putative uncharacterized protein n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MAR4_BRAFD Length = 464 Score = 200 bits (507), Expect = 5e-49, Method: Composition-based stats. Identities = 84/446 (18%), Positives = 141/446 (31%), Gaps = 69/446 (15%) Query: 1079 FAGNMQSTGLWAPAQKEVTIKSN---ANVPVTVTVALADDLTGREKHEVALNRPPRVTKT 1135 ++Q TGL+ P ++ ++ +VP V A L + E + Sbjct: 68 STTDLQPTGLYLPPDVQLGVEVEADGDDVPSIVIGAPGSQLDPDAEDEDDFVGRTLTPRE 127 Query: 1136 YSLDASGTVKFKVPYGGLIYIK-GNSSTNESASFTFTGVVKAPF--YKDG-----AWKND 1187 L+ G +GG+IY+ + +A+ +F G P + G ++ Sbjct: 128 TELE-PGANLVSDAHGGVIYLSFPGADGAATATVSF-GAAAEPMATFVRGTTAEAEFQAQ 185 Query: 1188 LN-SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGK 1246 L+ S AP EL S + T + L E+ LD + G +E G Sbjct: 186 LDRSSAPFAELVSTHAIVTVERDRLLLF-RHEDHEKLLGILDRVVEIEQETAGYTTEGG- 243 Query: 1247 HRMFTYKNLPGHKHRFT--NDVQISIG-DAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWH 1303 P H D +G A +GY T+ L+ W +H Sbjct: 244 ----VESRPPSGPHHLVGYPDGIEGVGAYATNGYTAYPPPIQSTLLTVSGLTLDGWGPYH 299 Query: 1304 EVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWAR 1363 E+GH+ + P+ EV N+ +L +Q + + +V V PE + A Sbjct: 300 ELGHHHQQEPVNPGDVVEVTVNIFSLAVQREFEREYGQVPRMREVDPETGTSHWDTAMEA 359 Query: 1364 GGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDE 1423 AG R D + P P + + G R E Sbjct: 360 LEAGIR---------------DYAELGPFEQLAP--FDQLRLQYGDEFAPAWATLVRQQE 402 Query: 1424 VSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFE 1483 + + ++ S A DL++F++ W Sbjct: 403 ------------PLTEEDRWQNVIYSTSAAAGDDLADFWQAWGV---------------- 434 Query: 1484 GGVSQSAYNTLASLDLPKPEQGPETI 1509 V+ + L L L P P T+ Sbjct: 435 -EVADETRSALGQLGLEPPVVDPSTL 459 >UniRef50_UPI000197B0FB hypothetical protein BACCOPRO_00998 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B0FB Length = 567 Score = 199 bits (505), Expect = 8e-49, Method: Composition-based stats. Identities = 70/467 (14%), Positives = 145/467 (31%), Gaps = 48/467 (10%) Query: 1078 WFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLT---GREKHEVALNRPPRVTK 1134 ++ TG+ K + + + V +AD G E + R Sbjct: 118 GYSTYQHITGIVLKPGKHIIVVDGLKEGSKLGVKVADLYAPNQGDEDWSLHFER------ 171 Query: 1135 TYSLDAS-GTVKFKVPYGGLIYIKG---NSSTNESASFTF-----TGVVKAPFYKDGAWK 1185 + L ++ + GL Y+ N + F G A + W Sbjct: 172 -FELKNGINVIEKTSEWTGLAYMDYYFDNPEKENTVKVHFITGEVNGYFDASVNTNEDWD 230 Query: 1186 NDLNSPA-PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSED 1244 L + P+ + P ++L G ++ + + S ++ G + Sbjct: 231 RMLANAVYPVFDATGSNIHLAYPVEDLKKYA-PGQGKELIDVYEQLVSKQHEIIGWKKYN 289 Query: 1245 --GKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLN-DWLI 1301 +++F N + R + V + + T + W Sbjct: 290 HITNNKIFARVNYGYYMFRDGDGVAFKFDTM---------KRVADPVHMRTKDEDACWGF 340 Query: 1302 WHEVGHNAA-ETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQA 1360 HEVGH +T L+ G E +NN+ Y + K + +LE+S Sbjct: 341 SHEVGHVHQLQTYLSWGGLGETSNNICTRYCTQAFGYKNRLSSAFANAEKSFLEDSKAGT 400 Query: 1361 WARG--GAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMK-GW-NLFQLMH 1416 + G + + K +K + +P + + + G+ + + ++ Sbjct: 401 VSPSRRAGGMNDSIISSCKVNPDKAISYLETDVFERLVPFWKLQCYFTQNGYPDFYPDLY 460 Query: 1417 RKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPG 1476 K R E + + + + + + AS +A +L +F+K+ +L Sbjct: 461 EKMRNSEKEHPELKDLDR-SSNVVPFQLNFIRGASLLAGKNLYPYFEKFGF-FRILKLSY 518 Query: 1477 ASEMSFEGGVSQSAYNT-------LASLDLPKPEQGPETINQVTEHK 1516 + ++ + L L KP PE +N + K Sbjct: 519 GDYGDYNYEMTTEMRDRFKKEMEELEKQQLIKP-LTPEELNALIYAK 564 >UniRef50_A5ZFW6 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFW6_9BACE Length = 951 Score = 196 bits (498), Expect = 6e-48, Method: Composition-based stats. Identities = 86/559 (15%), Positives = 169/559 (30%), Gaps = 63/559 (11%) Query: 986 NDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRL------MLG 1039 N+A G C + + + G A + + ++ + L M+ Sbjct: 344 NEAVGGVVSCGE--MEFYRNAAPVAGLTEVFADELCSELRADVDQRKIDGLENSFFRMIA 401 Query: 1040 RSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIK 1099 +S +D ++ V T + TG++ +E + Sbjct: 402 QSLYDKTYDLEYR-----VQTYEPYREINDLAAEMKTSGYNPFENPTGIYFKDGEEAVVI 456 Query: 1100 SNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIK-- 1157 V + + D R+ R P T +Y L + G K ++ +GGL YI+ Sbjct: 457 LGNTNGEQVNLKVYDFDAIRQG-----QRTPDPT-SYPL-SEGINKLRIAHGGLSYIEYY 509 Query: 1158 -GNSSTNESASFTFTGVVKAPFY-----KDGAWKNDLN-SPAPLGELESDAFVYTTPKKN 1210 N T + +Y W+ LN + +++ D + Sbjct: 510 TPNWKTAPALKLHIASGKVNGYYDKHRDVSADWREILNKATYGCIDIKGDRVNLVFGVNS 569 Query: 1211 LNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISI 1270 + Y L + + D ++ G D + + + R T D + Sbjct: 570 IK--TYCDNLGKLIQNYDDIVELEHELMGLDKWGRRPKNHMFA-------RVTKDGLFAD 620 Query: 1271 GDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGH-NAAETPLTVPGATEVANNVLAL 1329 G G + ++T + W I HE GH N L TEV NNV ++ Sbjct: 621 GW---GAGWYEGCMNELASTTKSLREGVWAIAHEFGHVNQIRPGLKWVSTTEVTNNVYSV 677 Query: 1330 YMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKW 1389 + ++ + + + + + G L + + N D K+ Sbjct: 678 CARYKFYRENMPLEHERCNDGNDNNVRGGRFNSYLNYGIIKGEQW-LCQKGQDNMDPSKY 736 Query: 1390 YPDG---------TPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 G L +Y E G + + + + R + S +NG Sbjct: 737 PYGGDHFVKLCPLWQLLLYYREIVGGEKRDWYGDVAEIVRNTDESQL----------TNG 786 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLP 1500 M V + DL++FF K + + ++Q+ + L Sbjct: 787 QLQLNFMRNTMDVVKEDLTDFFIKAGMLKPIDKELDDYARG-QMTITQTDCDELVKYASK 845 Query: 1501 KPEQGPETINQVTEHKMSA 1519 + + ++ + A Sbjct: 846 YSKPATPVLYYLSANSQKA 864 >UniRef50_A2F4J7 Immuno-dominant variable surface antigen-like n=1 Tax=Trichomonas vaginalis RepID=A2F4J7_TRIVA Length = 504 Score = 193 bits (491), Expect = 4e-47, Method: Composition-based stats. Identities = 78/490 (15%), Positives = 147/490 (30%), Gaps = 79/490 (16%) Query: 1048 KVDVEKYPGAVS--EEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP 1105 ++++ G + + + T + T ++AP + +TI+ N Sbjct: 27 PAGIKQFYGDEQLINNAPRYKIRVYINTRYT-----DRHWTAIYAPPGELITIEVPPNAV 81 Query: 1106 VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSST-NE 1164 + V + + + R K+ + P+GG I I N+ Sbjct: 82 GKIGVCFNHHINDDSGYRTRM----RTLKSSTTLNREVNNVSWPFGGAITITSGIDRFNQ 137 Query: 1165 SASFTFTGVVKAPFY-----KDGAWKNDLNS-PAPLGELESDAFVYTTPKKNLNASNYTG 1218 T TG ++ P++ D W+ +L+ P PL ++ + P + + Sbjct: 138 GLEVTITGGIRMPYFRYGYTTDQEWEEELSLLPGPLANIDMGNAIAQLPSRQIRGKVKLN 197 Query: 1219 GLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYP 1278 F S+N+ G D H ++T D + G+A + Y Sbjct: 198 DACAFWRSASRNLFSVNEINGGPRRDDG--------RIKHPTQWTFDTYVPFGEAATAYG 249 Query: 1279 ----VMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETP-L-TVPGATEVANNVLALYMQ 1332 + +S + W HE GHN + EV NNVL L Sbjct: 250 GNRIIFPPHWSEGIVNYDSAKWGCWGQLHEYGHNFQYGWGWPSFRDYIEVTNNVLNLISY 309 Query: 1333 DRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPD 1392 + M +T+ Y+ N GG G+ ++ ++ Sbjct: 310 SKLT--MVDSRRQMTMGRNYIVGPQN-----GGHGETTHQFSLIQ--------------- 347 Query: 1393 GTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASW 1452 L FY+ G + F+ R + + +L A Sbjct: 348 RNDLFSFYANFIYFFGTDKFRKFLHAHR----------WDKLYSRNEFPYISQFLLTACD 397 Query: 1453 VAQTDLSEFFKKWNPGANAYQLPGASEMSFEG--GVSQSAYNTLASLDLPKPEQGPETIN 1510 + + D ++FK ++ +S L + K P I Sbjct: 398 MYKRDFRDYFK-----------SFDKVFNYSDPNIISPRVDEILNQKNYKKF--NPLAII 444 Query: 1511 QVTEHKMSAE 1520 T + + E Sbjct: 445 YQTGYIVDGE 454 >UniRef50_C2G2G0 Possible wall-associated protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2G2G0_9SPHI Length = 622 Score = 192 bits (488), Expect = 7e-47, Method: Composition-based stats. Identities = 82/463 (17%), Positives = 146/463 (31%), Gaps = 82/463 (17%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 + + T T + + TG + + I N T+ T H Sbjct: 70 ETINSTTLKNRLKTSYEPTRILPTGYYVAPNQSFNINVNLQQGSTLPKVQVG--THSRNH 127 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESAS------FTFTGVV-K 1175 + + N P Y+L GG+I+I N +A+ +F+G + + Sbjct: 128 DYSYNPP-----VYNLSTGSNTITANADGGIIWITYEQPANATAAPGASALLSFSGNISR 182 Query: 1176 APFYK-----DGAWKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDT 1229 P Y W+N L L VY KNL+++ T + N+ D Sbjct: 183 IPVYIKNSTTLSNWQNQLLTYNNATDVLMVGQNVYMVYAKNLHSAVSTQNNDLILNNADA 242 Query: 1230 FASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNST 1289 + + G ++ +H +L +V + A Y +++ + Sbjct: 243 VWDNHYTYAGLNNSSTQHTRPVIPHL-----MVQTEVPFGLYYAFF-YRTAYANYDAHKV 296 Query: 1290 TLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDIT-- 1347 N W +WHE GH P+ G EV N+ +L + R+ D Sbjct: 297 FGEDIVTN-WGVWHEFGHLMQMQPIDWDGLGEVTVNIFSLKAERALGITPTRLTKDNVWP 355 Query: 1348 VAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMK 1407 YL + + + G +L M+ QL Sbjct: 356 QVHTYLSSTGTKNFDSQGVWVKLAMFHQLWL---------------------------AY 388 Query: 1408 GWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNP 1467 G + + ++R R D + S+ +L A + +L+ FF+ W Sbjct: 389 GDSFYTNLYRSVREDTGTY----------SSSDAKKKNFILKACQQSGYNLTSFFQAWGI 438 Query: 1468 GANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETIN 1510 N+ + YN +A+L+LP P + Sbjct: 439 QDNSL----------------NIYNAVAALNLPTPSYDLSQLT 465 >UniRef50_A2EC39 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EC39_TRIVA Length = 734 Score = 191 bits (486), Expect = 1e-46, Method: Composition-based stats. Identities = 71/424 (16%), Positives = 133/424 (31%), Gaps = 41/424 (9%) Query: 1004 VDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQ 1063 N + G + P M + L+ + D E +PG Sbjct: 318 QAYNHLVNTGYRTEQGLCPLLSHCIMSTLIPDLVSKLPAKYIKASPDAEDFPGLCP---- 373 Query: 1064 NVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHE 1123 T+ + + + STGLW PA I S+ ++ + + Sbjct: 374 --NATVENHELEVRLHEESWISTGLWLPAGSLGEIISDPEDKSIFSIQIGSHTESLLSRQ 431 Query: 1124 VALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG---NSSTNESASFTFTGVVKAPFY- 1179 R P V+KTY +D SG + P+GG++YI N T+ F G V+ P Sbjct: 432 GPWKRWPVVSKTYDIDPSGKTEIASPFGGIVYITVRELNEETDNRVKLKFNGFVRHPRAV 491 Query: 1180 --KDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDF 1237 K W++ + P GE+ S ++T P + +++ +D + + +F Sbjct: 492 IRKPEIWESSKDYQVPWGEMCSKTVIFTLPSDEIRKI---TDIDKVLEHVDKVVTHVLEF 548 Query: 1238 YGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLN 1297 +R D Q+ S YP++ + Sbjct: 549 MNAS--------------MNRPYRVVFDTQLPGDKTESEYPIVLDIKDLDDIINNYNKP- 593 Query: 1298 DWLIWHEVGHN--AAETPLTVPGATE--VANNVLALYMQDRYLGKMNRVADDITVAPEYL 1353 ++ + + E ++N V + + + G + D P + Sbjct: 594 TVAMFELIARVSLLSIREAYFDNLVETAISNLVACIIFNEFFPGFDPNIFKDYNFPPLFA 653 Query: 1354 EESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQ 1413 E G LL +Q + + D +F + + +N + Sbjct: 654 ELWVLHQHINQGLIKDLLALSQ-------APEAPYFDSDEDMWTQFVKDVSVIGKYNFTK 706 Query: 1414 LMHR 1417 L+ R Sbjct: 707 LLER 710 >UniRef50_B2UP41 Putative lipoprotein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UP41_AKKM8 Length = 679 Score = 191 bits (485), Expect = 2e-46, Method: Composition-based stats. Identities = 65/498 (13%), Positives = 125/498 (25%), Gaps = 98/498 (19%) Query: 1062 GQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREK 1121 T ++ TG++ ++V + + + + + Sbjct: 151 EPYEPVKDLAERLRTSQYSKFENPTGIFFEEGEDVLLVMGDPKGEKLNLVIHNFGRDGGH 210 Query: 1122 HEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIK---GNSSTNESASFTFTGVVKAPF 1178 L G + GL YI+ N + Sbjct: 211 SSYPLK-------------EGVNIIRAKNKGLGYIEYFTPNYKKAPKVHLSILSGKVNGV 257 Query: 1179 Y-----KDGAWKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFAS 1232 + K+ WK L NSP + ++ P + L E+ D Sbjct: 258 FVGGVSKNSDWKKMLENSPTEVVDIVGSRVHLVYPVEELKQF-CPDKGEELIALYDRIIG 316 Query: 1233 SMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYP--VMNSSFSPNSTT 1290 G Y+ LP ++ I G H+ ++ Sbjct: 317 MEQQIMGLYK---------YRMLPKNRM---FGRVIWNGFMHADGTGAAFHNGTMKEVGN 364 Query: 1291 LPTTPLNDWLIWHEVGH-NAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVA 1349 P + W I HE GH N + EV NN+ + Y+ R+ + Sbjct: 365 PDRIPGSAWGIAHEFGHVNQVRPAMKWVSTGEVTNNIYSAYVNYMLNPSSMRLEHERING 424 Query: 1350 PE--YLEESNNQAWARG--------------------------GAGDRLLMYAQLKEWAE 1381 + + N G +L QL+ + + Sbjct: 425 GDGNMIGGRFNAYLNNGILKGENWLVQSGPDKRSGGDNRPMVHDHFVKLAPLWQLELYFK 484 Query: 1382 KNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGN 1441 G P+FY + ++ + RG + + Sbjct: 485 ---------VAGKGNPDFYPDI-------FYKAIKMDTRGKKDGELQLA----------- 517 Query: 1442 AADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPK 1501 M A A+ DL++FF+K ++++ L + Sbjct: 518 ----FMKNACDAARQDLTDFFRKTGMLKPID-QELDDYTCARMTITEADCKNLIAYARKY 572 Query: 1502 PEQGPETINQVTEHKMSA 1519 + I ++ + A Sbjct: 573 KKPESPVIYYISVNSAEA 590 >UniRef50_A2DY87 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DY87_TRIVA Length = 725 Score = 190 bits (482), Expect = 4e-46, Method: Composition-based stats. Identities = 90/626 (14%), Positives = 175/626 (27%), Gaps = 113/626 (18%) Query: 873 YEVNCLEYRPGTGVPVTGGMYVPQYTQL---------SLNADTAKAMVQAADLGTNIQRL 923 YE ++ G V G + +++ + + ++ A+ G +I Sbjct: 153 YESQLVKIVRALGFIVDKGDFKKDFSRYNLIICASTLKMTPEEMDKFLRYANNGGSICCC 212 Query: 924 Y------QHELYFRTNGRKGERLSSVDLE--------------RLYQ---NMSVWLWNDT 960 Y Q E + + LS +D Y N + L +T Sbjct: 213 YCPSFSTQEECFNINSFLLEFGLSFMDCSINSETSEPIKTNIPVTYDLVKNKPLPLMVNT 272 Query: 961 SYR------YEEGKNDELGFKTFTEFLNC---YANDAYAGGTKCSADLKKSLVDNNMIYG 1011 + D+ + L+C Y+N + C + + Sbjct: 273 FKEMMALNNFNSETIDDFVTELRYVCLSCKGRYSNVLCDILSAC----------WDYLNQ 322 Query: 1012 DGS-SKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVS-EEGQNVTETI 1069 ++ G ++P+ N + + + L++ D +PG E + T I Sbjct: 323 TNIRNEDGFLDPNIIQNIIIVLIIDVYNQLPIDKLSVIPDCSTFPGLAKIREFKEYTIEI 382 Query: 1070 SLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRP 1129 ++ STGLW P T K N + + + R Sbjct: 383 PIHEQTL-------HSTGLWLPPGVTATCKIEDNFTDNIAIQIGAHSQSLVSQPPPWKRW 435 Query: 1130 PRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGA---WKN 1186 P V Y ++ G + GG++YI ++ S TFT P ++ Sbjct: 436 PHVVMAYKVNN-GVTEIFSQVGGMVYIGVAETSVSSLKMTFTNCALYPRAIRSDPKIFEQ 494 Query: 1187 DLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGK 1246 N P E+ ++ + P + + D + + + Sbjct: 495 TQNFDVPWSEISANNVTFVLPTVEFKKI---YDMNEIFKYYDDYIEIVAKIMNYN----- 546 Query: 1247 HRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVG 1306 R DVQ GYP+ S N+ N L Sbjct: 547 ---------IPRPFRVVFDVQTPDDLPVPGYPITIPIDSINNLFYNLEEPNCDL------ 591 Query: 1307 HNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGA 1366 T +A + + + + R I +A ++++ + + Sbjct: 592 ---------FDLLTSIAT---SCLREGYFDSETERALSQIVIAQIFMDKYDGFDVESDES 639 Query: 1367 GDRLLMYAQL-----KEWAEKNFDI---------KKWYPDGTPLPEFYSEREGMKGWNLF 1412 ++ +L K F I ++Y D F + + NL Sbjct: 640 FQVTQLFHELWRIHRKINNTALFQIIRESMSPDGPEYYDDEERWTAFTRDLSHITQLNLT 699 Query: 1413 QLMHRKARGDEVSNDKFGGKNYCAES 1438 ++ R R + C + Sbjct: 700 KIFERLKRIPLNISANLEMFPACPDD 725 >UniRef50_A5ZKM8 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=A5ZKM8_9BACE Length = 857 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 62/468 (13%), Positives = 133/468 (28%), Gaps = 64/468 (13%) Query: 1071 LYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPP 1130 T+ + TG++ EV + ++++ + T + + ++N Sbjct: 338 ADKLMTRKYGDLDNPTGIYVNEGDEVVVLVGDTHGQSISIQNIGEETSKGYAQTSVNGD- 396 Query: 1131 RVTKTYSLDASGTVKFKVPYGGLIY------IKGNSSTNESASFTFTGVVKAPFY----- 1179 Y L G K G+++ I+ + G F+ Sbjct: 397 ----IYPLK-EGVNKLTAKQTGMLFVMYNTNIQNPDAQPIKIHIPLGGGKVCGFFSLKEH 451 Query: 1180 -KDGAWKNDLN-SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDF 1237 + +K ++ + + +A + K L A+ + D + Sbjct: 452 QTNEKYKELIDKADYKYFCVIGNAIILYFHHKQLKAAV-PYDILSSIELWDNMIQWQQEL 510 Query: 1238 YGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS--GYPVMNSSFSPNST--TLPT 1293 G + P + + G + G + + + Sbjct: 511 MGIEDV-----------YPKQMNNHIFAISPEGGYMWASEGRIGFVYTVLGDILRKSYLM 559 Query: 1294 TPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYL 1353 N W HE+GH + TE +NN+ + Y ++ +R + PEY Sbjct: 560 ASRNSWGPAHEIGHVHQ-GAINWASTTESSNNLFSNYTIYKFGQNCSRGTE--LAVPEYA 616 Query: 1354 EESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQ 1413 R + + K W + D PE ++ W L+ Sbjct: 617 A----NVKKATLVFRRCV---ENKAWCDFGTDY------QGEDPEMHARMN----WQLWN 659 Query: 1414 LMHRKARGDEVSN--DKFGGKNYCAESNGNAADTLM-LCASWVAQTDLSEFFKKWNPGAN 1470 HR + K +N + + + A A +L++FF++W Sbjct: 660 YYHRCGYNPQFFPTLFKLMRENRVSTQDPGENQMMYARMACRAANENLTDFFERWGFFV- 718 Query: 1471 AYQLPGASEMSFEGGVSQSAYNTLAS--LDLPKPEQGPETINQVTEHK 1516 + ++ V+ + P P+ + + K Sbjct: 719 PISMKVNQYGTYNYIVTDAMIKETKEFMKQFPAPKHA---FYYLEDRK 763 >UniRef50_A8R9D2 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8R9D2_9FIRM Length = 1702 Score = 184 bits (466), Expect = 3e-44, Method: Composition-based stats. Identities = 84/626 (13%), Positives = 161/626 (25%), Gaps = 117/626 (18%) Query: 928 LYFRTNGRKGERLSSVDL---ERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCY 984 Y+R R S V + + + S+ + Y Y+ ++D N Y Sbjct: 694 RYYRIKLPSPIRTSEVIVGLGRYVAGSPSITIAEMRFYSYDSLEDD---------ISNLY 744 Query: 985 ANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWD 1044 +D + + + + N + D L R +L + Sbjct: 745 EDDLHVQLKNSVDEKQLEDLQNRLDTTDNGEY---------------HLDRNLLQKELDT 789 Query: 1045 L-NIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNAN 1103 ++ D EK T + Q G+ A A +++ + Sbjct: 790 AKSLFEDSEKLYDVTKVNT---NITAKKDGHLGFSGLNAWQPLGVSASANEKIVVYVGGK 846 Query: 1104 ---VPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG-N 1159 V + L E ++ T + V GG +Y++ Sbjct: 847 GKKTGQNVNLQLVATQQHAESSNLSKVVATLKTGRNEITIPELTSTDVERGGALYVQYTG 906 Query: 1160 SSTNESASFTFTGVVKAP------------------------------------------ 1177 + + E + K P Sbjct: 907 NDSKEELAVRVMNGTKIPVLNLYGVTDEKERQNKINAYMEEVKAHTKQLKTLHRKQKGFS 966 Query: 1178 FYKDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNY--TGGLEQFANDLDTFASSMN 1235 + ++ S ++ D + + P + A L +D Sbjct: 967 LFSLFQGYDEKTSIVNTTDIMLDQMMISIPASQVVAGTNGDADTLASSLQAMDEMMILFY 1026 Query: 1236 DFYGRDSEDGKHR---------MFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSP 1286 G + + ++L + + H G + Sbjct: 1027 QQKGLTNSFSEKDLSNKLNVKNSLPSQHLNIRYMKMFAGAFMYAAGNHIGIEWNETIGMV 1086 Query: 1287 NSTTLPTTPLNDW--------LIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGK 1338 N+T++ W I HE+GHN + + EV NN +L Q + Sbjct: 1087 NTTSVQADKDGRWLSGQYFGWGIAHEIGHNINQGKYAIA---EVTNNYFSLIAQAKDTND 1143 Query: 1339 MNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPE 1398 R D + + + +L MY QL ++ ++ K Sbjct: 1144 SVRF--DYEKVYDRVTSNVKS--RSEDVFTQLAMYWQLHLAYDRYYNYK----------- 1188 Query: 1399 FYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDL 1458 + E + F + R ++ + M AS A+ DL Sbjct: 1189 LFDNYEDIYNHVFFARIDSFVR---DTSKAPSPNEIALTLGNDKDQNFMRLASAAAEKDL 1245 Query: 1459 SEFFKKWNPGANAYQLPGASEMSFEG 1484 SEFF +W + + E Sbjct: 1246 SEFFMRWGLIPDDTTKAYIEQFEKEM 1271 >UniRef50_B2UQK5 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQK5_AKKM8 Length = 747 Score = 183 bits (465), Expect = 3e-44, Method: Composition-based stats. Identities = 65/445 (14%), Positives = 130/445 (29%), Gaps = 66/445 (14%) Query: 1085 STGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTV 1144 TG+ A E+ + + +A G E LN G Sbjct: 247 PTGIVAEKGDEILVFVGPTHGEDIGLASVS-PAGIESSSYPLN-------------EGVN 292 Query: 1145 KFKVPYGGLIYI-----KGNSSTNESASFTFTGVVKAPFY-----KDGAWKNDL-NSPAP 1193 K ++ GL+Y+ + + ++ D WK + N+P Sbjct: 293 KIRINRSGLLYVMYHTDISPPKKPITVHIPVGSGIVNGYFDVTRHTDKDWKRMISNAPHS 352 Query: 1194 LGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMF-TY 1252 + ++ + K L + + + D +M G D H Sbjct: 353 MFDIVGRNSMMILHTKYLKDYS-PDSITKSVRVWDESVKAMWKIMGFDKYPQPHNNRQLG 411 Query: 1253 KNLPGHKHRFTNDVQISIGDAHSGYPV--MNSSFSPNSTTLPTTPLND-WLIWHEVGHNA 1309 ++ G H F + GY + ++ N W I HE+GH Sbjct: 412 VSVEGGAHMF-------ATWYYCGYSIGDQGNTLKNEVLAPGVLQGNRLWGIGHEIGHCY 464 Query: 1310 AETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDR 1369 P +E +NN A + D+ +N + N + Sbjct: 465 QH-PFNWRSMSESSNNFFAQLILDQVTNAINGNEQ--------ASDMENPCKYLLSEAVK 515 Query: 1370 LLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKF 1429 + + L WA+ F +Y ++ + + + R +S + Sbjct: 516 GMPFHDLNGWAKWGFAQYSFY-------LYFHKLGINP--EFYPRLFESLRRKPLSRQAY 566 Query: 1430 GGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAY---QLPGASEMSFEGGV 1486 A L +++TD ++ F+ +N G + Sbjct: 567 EVS--------EAHLALYERICNISRTDFTDDFEIFNWFVPIDRKGHQYGDYSFKMTEEM 618 Query: 1487 SQSAYNTLASLDLPKPEQGPETINQ 1511 ++++ +A+ PKP+ ++Q Sbjct: 619 ARASKARIAAKRYPKPKFRIAFLHQ 643 >UniRef50_Q2SHR7 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=Q2SHR7_HAHCH Length = 1031 Score = 183 bits (463), Expect = 5e-44, Method: Composition-based stats. Identities = 86/578 (14%), Positives = 166/578 (28%), Gaps = 86/578 (14%) Query: 958 NDTSYRYEEGKNDELG-----------FKTFTEFLNCYAN-DAYAGGTKCSADLKKSLVD 1005 Y ++ D F+ + N +N D + Sbjct: 448 ESEDYAFDWSACDGENCSAVSGLNAGFFEGADQVRNIMSNLDKNKTNIFADEGKRYQ--- 504 Query: 1006 NNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIK---------VDVEKYPG 1056 ++ G + +YP++ + L + D ++ D+ + Sbjct: 505 -KLLALLGDHYRASV--TYPMDKITTE-DNAFLKSLYADYSVYNYRALNPAQKDMGNFSR 560 Query: 1057 AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDL 1116 + VT+TI + S ++ G++A V + N V ++ + Sbjct: 561 SSFNHVSPVTKTIDMESKR------YFRAAGVYALPGYTVKVTRLDNADVATSIFVNTQR 614 Query: 1117 TGREK--HEVALNRPPRVTK-TYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGV 1173 +G + RP + +S+ + ++ F YGG + I+ + ++ SF F V Sbjct: 615 SGATHQFEKNGYTRPKFLQTPKFSIKSGESITFTSTYGGPLQIEFGGN-GKNVSFRFENV 673 Query: 1174 VKAPFYKDGAWKNDLNSPA------PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDL 1227 + PF+ +G N A EL + F + + S + A Sbjct: 674 GEHPFW-NGEEDNASFEQALAQGDYDWAELVTPGFEVHSKLDKMRKSMQDENWKTGAALA 732 Query: 1228 DTFASSMNDFY---------GRDSEDGKHRMFTYKNLPGHKHRFT---NDVQISIGDAHS 1275 +++F G D H T NL N Q + G S Sbjct: 733 AGTMRYIHNFPHVLAGFKGPGIDVVAEIHDFATAHNLDIEHIDIVKHMNADQATCGYGCS 792 Query: 1276 GYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGA-TEVANNVLALYMQDR 1334 G P + P HE+GH G + N + Y + Sbjct: 793 GNP-----YDAYWEFSPIGH----GDIHELGHGLERGRFRFSGWDGHASTNPYSYYSKSH 843 Query: 1335 YLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGT 1394 Y + ++ E + + + + + +L W+ Sbjct: 844 YYIDTGKNPGCQSLPFESMFKVLRDSVSESN-PQAYVQSQKLTGWSNGA------GITVQ 896 Query: 1395 PLPEFYSEREGMKGWNLFQLMHRKARGDE------------VSNDKFGGKNYCAESNGNA 1442 + E + GWNL +H R SN F + + N+ Sbjct: 897 MMMTAQGEGKLNDGWNLLPRLHILDRNFNAALASEEAWSAAKSNLGFSQYSLAEAKSINS 956 Query: 1443 ADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEM 1480 D + + S D ++ W +A + Sbjct: 957 NDWMTVAVSHATGLDFRDYLTMWALPFSAKASAQVASF 994 >UniRef50_C2FV33 Possible wall-associated protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FV33_9SPHI Length = 620 Score = 181 bits (459), Expect = 2e-43, Method: Composition-based stats. Identities = 80/468 (17%), Positives = 138/468 (29%), Gaps = 84/468 (17%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 Q+ T + TGL A + ++ + TV L + Sbjct: 67 QSTEYVTDGNRLRTALSYSDFDPTGLKPIAGTALQVQVTFHGGSTVKPVLIVGTPELDTE 126 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTF-TGVVKAPFYKD 1181 + T L+ K Y L Y+ +TN+S + F +G + P +K Sbjct: 127 QTY-------TLNAGLNTLNPTLIKNMY--LQYVSATPNTNDSVTVEFISGYTQVPLFKL 177 Query: 1182 G-----AWKNDLN--SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSM 1234 G W + L S A L + N +D S Sbjct: 178 GQTTAAEWADQLTTFSDAEYVTLTGQKNYILLTRSRYNNYTS-TDPNTVLAAVDLIISRE 236 Query: 1235 NDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDA---HSGY-----PVMNSSFSP 1286 N+ G D+ HR K ++ + G H G Sbjct: 237 NEISGLDNSSTLHREPDGK---------VCIIEKNSGYMDATHKGLVRLTGAAAWDKVFK 287 Query: 1287 NSTTLPTTPLNDWLIWHEVGHNAAETPLT-VPGATEVANNVLALYMQDRYLGKMNRVA-- 1343 + + ++ W +WHE+GH P+T E A N+ A Y + Y V+ Sbjct: 288 AKNIVSGSTVDQWGLWHEIGHLHQLWPITPYTVLGEAAPNIYASYTKKYYEPTYRYVSGT 347 Query: 1344 DDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSER 1403 + + + + A +L+M+ QL Sbjct: 348 NWMNAKVYLAQPDAAKNLAAAENYTKLMMFEQLML------------------------- 382 Query: 1404 EGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFK 1463 G + + +H+ R + +N + L AS + +L+ FF+ Sbjct: 383 --AFGEDFMKNLHKMVREEIGITYPLPNRNT--TNVDERLGALAFYASKTSGKNLTNFFQ 438 Query: 1464 KWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQ 1511 KW +S S +++L+LP+P IN Sbjct: 439 KWGFN-----------------LSSSRIALISALNLPEPATDVSLINS 469 >UniRef50_A2DDW5 Immuno-dominant variable surface antigen-like n=4 Tax=Trichomonas vaginalis RepID=A2DDW5_TRIVA Length = 1162 Score = 180 bits (457), Expect = 3e-43, Method: Composition-based stats. Identities = 73/427 (17%), Positives = 135/427 (31%), Gaps = 61/427 (14%) Query: 1048 KVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVT 1107 V ++ G + + L+ N + G TGL+AP + +TI+ + + Sbjct: 104 PVGTRQFYGNENLVNNSKRYKARLFINTRR---GPFHPTGLYAPPGELITIEISEKIVNG 160 Query: 1108 VTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSST-NESA 1166 +T+ + + L TK + + KF PYGG I + T N+ Sbjct: 161 ITLKINRHVDEAANDSNILGG----TKCDIVLSGTVSKFCWPYGGTIEFERGVDTANQGF 216 Query: 1167 SFTFTGVVKAPFYKDG-----AWKNDLNSPA-PLGELESDAFVYTTPKKNLNASNYTGGL 1220 +GV++ P++ G W+ DL+ A P+ ++ A T P + S Sbjct: 217 DVNISGVIRCPYFIYGSTTDEEWEEDLSKQAGPVMFIDYGAGFVTMPSTDAKNSIRLNDA 276 Query: 1221 EQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVM 1280 F + S+N+ R + + + +F + S A G V+ Sbjct: 277 MAFWRGVSRVLYSVNEVTYRS---RRSDGRVTTPMMTNLDKFV---RASEAVALVGANVI 330 Query: 1281 NSS--FSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGK 1338 + P + W HE H+ EV+NNVL L Sbjct: 331 YEPPYWYPGWVNYESARWGCWGQLHEYAHHFQYF-WGWGDYGEVSNNVLNLISMSLMTEI 389 Query: 1339 MNRVA--DDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPL 1396 ++ + ++ + + + + D L Y W + K Y Sbjct: 390 DSKRQIYLNGCISLGDGWDYTSHPYGNINSKDLLFWYGLHLYWFGPDLQRKILYA----- 444 Query: 1397 PEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQT 1456 HR + ++ Y +L S + + Sbjct: 445 -------------------HRSYQYFTRNDTYPYVTEY------------LLHISLLTKR 473 Query: 1457 DLSEFFK 1463 DL +F+ Sbjct: 474 DLRPYFR 480 >UniRef50_A2DZU5 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DZU5_TRIVA Length = 727 Score = 180 bits (456), Expect = 4e-43, Method: Composition-based stats. Identities = 67/316 (21%), Positives = 100/316 (31%), Gaps = 35/316 (11%) Query: 992 GTKC-SADLKKSLVDN-----NMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDL 1045 C D LV + + P + L +M S D Sbjct: 295 NVSCMPQDTNPELVQLAASALGYLRSHNFDTPEGLCPDISHGVISVLLCEIMAKLSAQDF 354 Query: 1046 NIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP 1105 E++P SEE + STG W A I N P Sbjct: 355 AGHDFSERFPERASEEPSEQDV----GTVRLTLQNEGWYSTGYWLTAGVVAKITLNEVPP 410 Query: 1106 VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNS--STN 1163 V + V + E R P +T T+ LD + P+GG+IYI + +T Sbjct: 411 VPLVVQVGMHAEAIFTKEGPWKRWPMITTTFELD--EVTEIANPFGGMIYIIFDRSVNTP 468 Query: 1164 ESASFTFTGVVKAPFYKDG---AWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGL 1220 + T V P Y W+ + AP E+E+ + P K + A L Sbjct: 469 ITIDMTIDHVFPYPLYSPEAPDTWEKTKDRGAPWAEVETYYMTFVAPSKVVRACP---NL 525 Query: 1221 EQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVM 1280 E +D S+ +F G + HK R D+++ I GYP++ Sbjct: 526 EANCKMIDGLIESVLEFLG--------------DHSEHKFRCIFDIEL-IDMPVCGYPII 570 Query: 1281 NSSFSPNSTTLPTTPL 1296 + S T P Sbjct: 571 MHVDNAESFFSDTNPS 586 >UniRef50_D2VUE2 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VUE2_NAEGR Length = 808 Score = 179 bits (453), Expect = 1e-42, Method: Composition-based stats. Identities = 56/311 (18%), Positives = 89/311 (28%), Gaps = 59/311 (18%) Query: 1181 DGAWKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYG 1239 D WKN + N P P E+ESD FV+ L N + Sbjct: 349 DSDWKNTIRNYPGPWVEVESDYFVFNVESSYARNL---TELTSVTNYWKKVLELYYEL-- 403 Query: 1240 RDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSF---------SPNSTT 1290 + + + +K R D+ ISIG HSGYP+M +P + Sbjct: 404 -----------SQRPIRDYKERMQIDIDISIGYMHSGYPIMAFQDQNEGTIKAVNPANMA 452 Query: 1291 LPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAP 1350 L + W +HE+GHN + T A EV N+ +LY+++ + N Sbjct: 453 LASKTPGRWGHYHELGHNFQVSDWTYSQAVEVTCNIFSLYLEENFPDSANTYGKSFNPPR 512 Query: 1351 EYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWN 1410 G FY + GW Sbjct: 513 GLETTFKGNTQDYKGDWTSAG-------------------------LLFYLDLIQAFGWV 547 Query: 1411 LFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGAN 1470 + + S D ++ S + +L F W+ + Sbjct: 548 SIRKAFAEYYTAASGT--------LPTSEDEKRDQWVVRYSKIVGRNLGPFCDAWSFPIS 599 Query: 1471 AYQLPGASEMS 1481 S ++ Sbjct: 600 QSAKSQVSNLT 610 >UniRef50_C9YCF1 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YCF1_9BURK Length = 872 Score = 176 bits (447), Expect = 4e-42, Method: Composition-based stats. Identities = 91/518 (17%), Positives = 158/518 (30%), Gaps = 71/518 (13%) Query: 1006 NNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDL---------NIKVDVEKYPG 1056 + + YP++ P D + D+ + Sbjct: 328 LRYLVLWADVVRRQLR--YPMDKASTP--AAFQKALIADALVAYVRPVATAQADLGSF-- 381 Query: 1057 AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDL 1116 A + T + T + G A K VT++ + TV++ L Sbjct: 382 ASAMTAGMAVSTTDETVDVTVPSTSGFTAIGRLAAPGKPVTVELLSAGSATVSLRLNTQR 441 Query: 1117 TGREKHEVA--LNRPPRV-TKTYSLDASGTVKFKVPYGGLIYIKGNSSTNES-ASFTFTG 1172 TG + NRP + + L ++ PYGG + + +++T + G Sbjct: 442 TGSTRLWDPNRYNRPRFLASPDMVLSTGQAMQLVSPYGGTLQLVFSNATPQQNVQLRLRG 501 Query: 1173 VVKAPFYKDGAWKNDLNSPA--------PLGELESDAFVYTTPKKNLN----ASNYTGGL 1220 V K PF D + E++ + + ++Y G + Sbjct: 502 VAKHPFLDQSNGAGDKAAFVTALNAAQHEWAEIKLAGIEIHSRADKMRAVINGTDYAGDI 561 Query: 1221 EQFANDLDT-FASSMNDFYGR---------------DSEDGKHRMFTYKNLPGHKHRFTN 1264 ++F N++ T F + G S T +PG Sbjct: 562 DKFLNEVKTLFFEDLYMLAGYALPGKSLTAHVQAMCTSLGWNCTDATLHRVPGT-QHINV 620 Query: 1265 DVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPG--ATEV 1322 D G SG P + + W HEVGHN + V +TEV Sbjct: 621 DNYSQCGSGCSGNP-----YDQDW----GLSPRGWGESHEVGHNQQKGMHKVYDDRSTEV 671 Query: 1323 ANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEK 1382 +NN+ L+ R L +M D V+ YL N A+ A Y + A Sbjct: 672 SNNLFPLHKGWRMLSEMGYNTGDTRVS--YLSAFNMIKAAKLQADPVEAAYQSIWGNAAY 729 Query: 1383 NFDIKKWYPDGTPLPEFY--SEREGMKGWNLFQLMHRKARGDEVS--------NDKFGGK 1432 + ++ + GW++ L++ R + +K G Sbjct: 730 AVQNGERMAFYMQWVHYWAQRQVSIATGWDIITLLYLHQRQFDAVADADWAANRNKLGYS 789 Query: 1433 NYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGAN 1470 Y + + D L++ SW+ Q D F W + Sbjct: 790 TYATKPSPTGNDNLLITLSWITQRDQRPTFDLWGVRYS 827 >UniRef50_Q7MJ28 Putative uncharacterized protein VV2335 n=13 Tax=Vibrionales RepID=Q7MJ28_VIBVY Length = 929 Score = 175 bits (444), Expect = 1e-41, Method: Composition-based stats. Identities = 71/501 (14%), Positives = 150/501 (29%), Gaps = 74/501 (14%) Query: 1024 YPLNYMEKPLTRLMLGRSWWD---------LNIKVDVEKYP-GAVSEEGQNVTETISLYS 1073 +P++ ++ L + D + ++ + E +++T+ L S Sbjct: 415 FPMDKSTT-VSLEFLKSYFADYVQYHSRSNNPKQPNMGNFSRSEFGAEIARISKTVQLES 473 Query: 1074 NPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH--EVALNRPPR 1131 N +S G++A + I N V V++A+ +G +RP Sbjct: 474 KR------NFRSAGVYALPGETFQITRRDNSAVKVSIAINSLRSGATHEFSTNGYSRPKH 527 Query: 1132 VT-KTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNS 1190 +T TY + + T++ YGG I + +++ + FT V + P ++ + Sbjct: 528 LTSTTYEIKSGETIRLTSAYGGPIQVHFDTN-DLPVELRFTNVAQHPVWRSAEDNEPFAA 586 Query: 1191 PA-----PLGELESDAFVYTTPKKNLNASNYTGGLEQ--------FANDLDTFASSMNDF 1237 EL + F + + + S + F ++ F Sbjct: 587 QLNQDQFDWAELITPGFEVHSKRDKMLQSISATEWAGSAAAMAQATERYMHNFPHALAGF 646 Query: 1238 YG------RDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTL 1291 G + D Q + G SG P + Sbjct: 647 KGPGITVFEQVQTYGESKGWQVETIDMVKHMNAD-QATCGYGCSGNP-----YDAYWAFS 700 Query: 1292 PTTPLNDWLIWHEVGHNAAETPLTVPGA-TEVANNVLALYMQDRYLGKMNRVADDITVAP 1350 P HE+GH + G N + Y + +Y ++ + + Sbjct: 701 PVGH----GDLHELGHGLEKGRFRFAGWEGHSTTNYYSYYSKSQYF--IDTGKESQCQSL 754 Query: 1351 EYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREG--MKG 1408 ++ + +R A M AQ + W ++++G G Sbjct: 755 DFKGQYELLQQSRQQADPNAFMAAQNQTGW-------SWGGRVYIQMMMATQQQGILNDG 807 Query: 1409 WNLFQLMHRKARGDE------------VSNDKFGGKNYCAESNGNAADTLMLCASWVAQT 1456 W+L +H R + F + + + D L++ S++ + Sbjct: 808 WHLLGRLHLIEREFNRLKGSAELWDARKESIGFSQYSLDEANAISNNDWLLVALSYITER 867 Query: 1457 DLSEFFKKWNPGANAYQLPGA 1477 D+ + W + Sbjct: 868 DMRAYLNMWGFTFSDKAKQQV 888 >UniRef50_B0N0I0 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B0N0I0_9FIRM Length = 1739 Score = 175 bits (443), Expect = 1e-41, Method: Composition-based stats. Identities = 88/609 (14%), Positives = 154/609 (25%), Gaps = 105/609 (17%) Query: 951 NMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIY 1010 N ++ + Y Y+ + D + + + + + Sbjct: 757 NRTMRISEFNFYYYDSLEED---------VNALFTDSFHLTVRDDVTSTTLDDLQTRLNT 807 Query: 1011 GDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETIS 1070 D S+ ++P L +E R ++ + ++ S Sbjct: 808 PDEVSQ--ELHPFKDLIQLELNQARQVVEGTALQNMQEIHNG--------------IAAS 851 Query: 1071 LYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNAN---VPVTVTVALADDLTGREKHEVALN 1127 N + Q G + V + L E + Sbjct: 852 KQGNLGFGGLNSWQPLGYVTYPGDTFIVYVGQEGKRNGQAVNLQLVYSQYHAESASFVSS 911 Query: 1128 RPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGA---- 1183 + V GG +Y++ ++NE + +G K Sbjct: 912 PISLKVGKNEISMKELQSIGVEKGGSVYVQYTGNSNEKIAVRVSGGEKIATLDLYQVSDE 971 Query: 1184 ------------------------------WKNDLNSPAP-------LGELESDAFVYTT 1206 N +N E+ D +Y+ Sbjct: 972 NERLEKVKTYLQSLQTQINKMASKHEELHRDDNSVNYDYDEKNCILGATEIMLDHMLYSV 1031 Query: 1207 P----KKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGK----HRMFTYKNLPGH 1258 L + Q N L+ M FY + + + ++L Sbjct: 1032 SGKQIMAGLKGTTLDEKANQLLNSLNAMDQMMELFYQNKGLNENAAAINDRYPAQHLNIR 1091 Query: 1259 KHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND--------WLIWHEVGHNAA 1310 R + G H G + S N W I HE+GHN Sbjct: 1092 YQRMFAGAFMYAGGNHIGIEWGSVSGLSNGIPFEAAENGKYLSGSLFGWGIAHEIGHNIN 1151 Query: 1311 ETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRL 1370 + + E+ NN +L Q+R R ++N +L Sbjct: 1152 QGSYAIA---EITNNYFSLLSQNRDSNDTTRFKYPDVYEK----VTSNTVGMSSNVFTQL 1204 Query: 1371 LMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFG 1430 MY QL ++N+ K + L F + AR N G Sbjct: 1205 AMYWQLHLAYDQNYHYKLYDSHEEQLNSL-----------FFARVDYYARNPGKVNIPEG 1253 Query: 1431 GKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSA 1490 G + N +A + AS A DL++FF W N S+ E Q Sbjct: 1254 GTAL--KLNSDAQQNFVRLASAAANKDLTDFFTAWGIIPNEETKAFISQFEAETDKIQYL 1311 Query: 1491 YNTLASLDL 1499 + + L Sbjct: 1312 DDDSMAYRL 1320 >UniRef50_B7V4F8 Putative uncharacterized protein n=7 Tax=Pseudomonas aeruginosa RepID=B7V4F8_PSEA8 Length = 923 Score = 175 bits (442), Expect = 2e-41, Method: Composition-based stats. Identities = 86/476 (18%), Positives = 152/476 (31%), Gaps = 68/476 (14%) Query: 1059 SEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTG 1118 G T T++L S A + G A K ++I+ ++ V L G Sbjct: 434 PVSGSEETLTLTLPS------AQGFTAIGRMAAPGKRLSIRIEDAGQASLAVGLNTQRIG 487 Query: 1119 REK-HEVALNRPPRVTKTYSLDASGTVKFK--VPYGGLIY-IKGNSSTNESASFTFTGVV 1174 + PR K+ + PYGGL+ + ++ ++ + TG Sbjct: 488 STRLWNTRQYDRPRFLKSPDIKLQANQSVALVSPYGGLLQLVYSGATPGQTVTVKVTGAA 547 Query: 1175 KAPFYKDGAWKNDLNS-----------PAPLGELESDAFVYTTPKKNLNAS---NYTGGL 1220 PF ++ + A E+ S + + + S +Y G + Sbjct: 548 SQPFLDIQPGEDSSQAIADFIQALDADKADWLEIRSGSVEVHAKVEKVRGSIDKDYGGDV 607 Query: 1221 EQFANDLDTF-ASSMNDFYGRD---------------SEDGKHRMFTYKNLPGHKHRFTN 1264 ++F +L+ G + T LPG Sbjct: 608 QRFIRELNEVFIDDAYTLAGFAIPNQAKTPAIQQECAARGWDCDSETLHKLPGT-QHINV 666 Query: 1265 DVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPG--ATEV 1322 D G SG P + ++ W HE+GHN L V G + E+ Sbjct: 667 DQYAQCGGGCSGNP-YDQTWG--------LNPRGWGESHELGHNLQVNRLKVYGGRSGEI 717 Query: 1323 ANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEK 1382 +N + L+ R L + + DD V Y N R A +Y +L E Sbjct: 718 SNQIFPLHKDWRVLREFGQNLDDTRV--NYRNAYNLIVAGRAEADPLAGVYKRLWEDPGT 775 Query: 1383 NFDIKKWYPDGTPLPEFYSER--EGMKGWNLFQLMHRKARGDEVSNDKFGGK-------- 1432 + T ++++ + ++GW+++ L++ R + S+ Sbjct: 776 YALNGERMAFYTQWVHYWADLKNDPLQGWDIWTLLYLHQRQVDKSDWDANKAALGYGTYA 835 Query: 1433 ----NYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEG 1484 N S+ + D L+L SW+ Q D F W +A + F Sbjct: 836 QRPGNSGDASSTDGNDNLLLGLSWLTQRDQRPTFALWGIRTSAAAQAQVAAYGFAE 891 >UniRef50_A2F153 Immuno-dominant variable surface antigen-like n=1 Tax=Trichomonas vaginalis RepID=A2F153_TRIVA Length = 1148 Score = 174 bits (441), Expect = 2e-41, Method: Composition-based stats. Identities = 69/396 (17%), Positives = 117/396 (29%), Gaps = 36/396 (9%) Query: 1083 MQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASG 1142 ST L+A + +T + + + L + R P + +++L+A Sbjct: 141 YHSTTLYAMPGELITFEIPEFAVGKLNLVLNRQ---APSNGDLTQRYPNLQCSFTLNAKK 197 Query: 1143 TVKFKVPYGGLIYIKG--NSSTNESASFTFTGVVKAPFYKDG-----AWKN-DLNSPAPL 1194 F P GG + IK ++ S TG ++ P+++ G W+ N APL Sbjct: 198 VT-FGYPLGGYMDIKCWIDTFPLHSVEINITGGIRIPYFRYGAESDQDWEEETRNYVAPL 256 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDS-EDGKHRMFTYK 1253 ++ P F + S+N+ D +DG+ + + Sbjct: 257 TFYDTGNVKCIFPSTFSRNQIRMSDAGAFWRTVGRVMYSVNEVTNYDRRKDGRIKTAMWF 316 Query: 1254 NLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETP 1313 N + + + S+S W HE GH+ Sbjct: 317 NFDSYVPAGAAVAFVGANFIQA-----PFSWSTAMINYEGAKWGCWGNVHEYGHHFQSG- 370 Query: 1314 LTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLM- 1372 + G E NNV+ IT+ +N A+ G+ Sbjct: 371 WGISGTGETTNNVINFITYAMLTEIDAT--RQITLGGASFNAANGWAYITHEFGNMDATK 428 Query: 1373 -YAQLKEWAEKN---FDIKKWYP--DGTPLPEFY-SEREGMKGWNLFQLMHRKARGDEVS 1425 YA W N F + W +Y G LMH Sbjct: 429 DYAGPFFWYGNNAYFFGLDAWRKCLRAHTHQIYYKRSDYGTYTSEF--LMHCAKFFHRDL 486 Query: 1426 NDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEF 1461 F + S + C + ++Q L EF Sbjct: 487 RAYFKTFEFPEASQISD-----RCNNELSQMKLKEF 517 >UniRef50_A2F335 Immuno-dominant variable surface antigen-like n=2 Tax=Trichomonas vaginalis RepID=A2F335_TRIVA Length = 1247 Score = 172 bits (435), Expect = 1e-40, Method: Composition-based stats. Identities = 63/433 (14%), Positives = 125/433 (28%), Gaps = 65/433 (15%) Query: 1048 KVDVEKYPGAVS--EEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVP 1105 ++++ G + + + TG++ P + ++I N Sbjct: 101 PAGIKQFNGNETWINNATRHKYRFLINNKR-----MGYHPTGIYVPPGEVISIDIPGNTI 155 Query: 1106 VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGG-LIYIKGNSSTNE 1164 + V + + L R + ++L S +F PYGG L + Sbjct: 156 KRIVVQFNHHTHDQNNYNTRLGR---LKCRFTL-NSQHTEFAWPYGGNLDLVTYQDEFPL 211 Query: 1165 SASFTFTGVVKAPFYKDG-----AWKNDLNS-PAPLGELESDAFVYTTPKKNLNASNYTG 1218 +G ++ P + G W+ DL APL ++ F+ P N+ + Sbjct: 212 GFEVNISGGIRMPHFIYGVNTDEEWETDLRKLAAPLTTFDTGTFLARMPTHNIRGAVCVN 271 Query: 1219 GLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHS--- 1275 +F + + +N+ + DG + D + G A + Sbjct: 272 DGMRFWQTVSWNSYDVNEVTSVNRRDGN---------VIRPLFYNFDSYVPAGAAVAFVG 322 Query: 1276 GYPV-MNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDR 1334 G + S++ + W + HE H+ EV NNVL L Sbjct: 323 GNFIQAPPSWAGGIVNYDSAKWGCWGLLHEYHHHFQSG-WGFANPGEVTNNVLNLIDMVL 381 Query: 1335 YLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGT 1394 + + N +R+ + + Sbjct: 382 ICEIDSTRQVNF----------NGDYVGNKDGWNRM--------------SYQYNTINNG 417 Query: 1395 PLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVA 1454 L FY G L + + K+Y G +L + V Sbjct: 418 DLLSFYGNNVYYFGSEL---------QRKCLKEHITQKHYKRNEVGGYVSEFLLTCAKVF 468 Query: 1455 QTDLSEFFKKWNP 1467 + D+ +F +++ Sbjct: 469 KRDMRAYFSRFSF 481 >UniRef50_B2V178 Fibronectin type III domain protein n=2 Tax=Clostridium botulinum E RepID=B2V178_CLOBA Length = 1886 Score = 171 bits (434), Expect = 2e-40, Method: Composition-based stats. Identities = 86/668 (12%), Positives = 173/668 (25%), Gaps = 164/668 (24%) Query: 942 SVDLERLYQNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKK 1001 V+L + +V + Y Y+ + D N + +D + Sbjct: 888 QVNLSNVSAGTNVTVSELKFYHYDSLETD---------VKNLFTDDMRIELK--DDVTLE 936 Query: 1002 SLVDNNMIYGDGSSKAGMMNPS-----YPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPG 1056 + + + +P Y LN + L Sbjct: 937 EITELENRANTLDDISKEYHPQRDNILYDLNTAKSILEDR-------------------- 976 Query: 1057 AVSEEGQNVTETISLYSNPTK-----WFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVA 1111 + +T ++ ++ + Q G+ A A E++I + V V Sbjct: 977 ---NIAETITLDQNINNSRNSHLGFAMQLNDYQPLGISARAGDEISIYVGTDGNVMPEVI 1033 Query: 1112 LADDLTGREKHEVALNRPPRVTKTYS-LDASGTVKFKVPYGGLIYIKGNSS--TNESASF 1168 + + +TK + ++ GG +Y++ + TN Sbjct: 1034 FTQYYPESSQWTTTVK---SLTKGKNVIEVPKIGSMATERGGSVYVRYPRNSATNNEIKI 1090 Query: 1169 TFTGVVKAPFYKDGA------------------------------------WKNDLNSPA 1192 +G K P+ + + Sbjct: 1091 RVSGGTKIPYLNLSNIVDEEISKKEIEKYITTLEEFNEKLPTYYEDANRLMFNKTRENKN 1150 Query: 1193 PL-----------GELESDAFVYTTPKKNLN------------ASNYTGGLEQFANDLDT 1229 E+ +D F+ T P + + +L Sbjct: 1151 LYKFDEKTSVLNSTEIVTDKFLLTLPATEVLRGINLDASTLSDKIDKVYDALLAWEELGD 1210 Query: 1230 FASSMNDFYGRDSEDGKHRMFTY---KNLPGHKHRFTNDVQISIGDAH--SGYPVMNSSF 1284 A + Y + ++ + ++P + S + SG+ + Sbjct: 1211 IAYGVKGLYENPDLNNDGKVDSSEIKHSMPSSRLNIRYTRMFSGAFMYASSGHIGIEFGS 1270 Query: 1285 SPNSTTLPTTPLND-----------WLIWHEVGHNAAETPLTVPGATEVANNVLALYMQD 1333 N W I HE+GH E E NN+++L Q Sbjct: 1271 VAPLLNGKPYTKNSDNDITAYNYYGWGIAHEIGHVIDEGNAIY---GETTNNIISLMAQT 1327 Query: 1334 RYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDG 1393 ++R+ Y + ++ L M+ QL + I + Sbjct: 1328 IDDKALSRLESSNLYPKIYEKVNSGSIGVASNVFVSLGMFWQLHLAYDNEASINQEDS-- 1385 Query: 1394 TPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWV 1453 + ++R R + S + L+ AS Sbjct: 1386 -----------------FYAKLNRLYRENTESTPNV----------QAKDNLLIRLASDA 1418 Query: 1454 AQTDLSEFFKKWNPGANAYQLPGASEMSFEGG------VSQSAYNTLASLDLPKPEQGPE 1507 AQ DL+EFFK+W AN + + +E ++ A + S + + + Sbjct: 1419 AQKDLTEFFKRWGLIANNDTITYLASKGYEKENKAIYYMNDDARRKVLS-GISQMSPNTK 1477 Query: 1508 TINQVTEH 1515 I +T Sbjct: 1478 VIANLTYD 1485 >UniRef50_A5ZG25 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZG25_9BACE Length = 886 Score = 171 bits (433), Expect = 2e-40, Method: Composition-based stats. Identities = 64/478 (13%), Positives = 132/478 (27%), Gaps = 71/478 (14%) Query: 1071 LYSNPTKWFAGNMQSTGLWAPAQKEVTIKS---NANVPVTVTVALADDLTGREKHEVALN 1127 TK ++ TG++ +E+ + V++ +++ Sbjct: 344 ATRLQTKKYSNLDNPTGIYVNKDEEIIVLVGNIPEGQKVSLQCIWEENVYTSGSSTDYYK 403 Query: 1128 RPPRVTKTYSLDASGTVKFKVPYGGLIYIKGN--------SSTNESASFTFTGVVKAPFY 1179 + TYSL+ G K+ G +++ N + V F+ Sbjct: 404 QTQATGTTYSLE-EGVNLLKMQGPGQLFVMHNVDGEQLLNNPAPIKIHIPLGHGVVNGFF 462 Query: 1180 KDGAWKN-------DLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFAS 1232 K + + + ++ + + + + + D Sbjct: 463 DLEEHKTDAKYAELISKATHKYFCVRGERMMFYFHRLKMLDAA-PTEILSAIHLWDDIVG 521 Query: 1233 SMNDFYGRDSE--DGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTT 1290 G DGK + P + + +D ++ + ++ + Sbjct: 522 WEQSLMGISQYRQDGKINNHMFAISPEGSYMWASDYRMGFVYTYLKNILLRENVMAA--- 578 Query: 1291 LPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAP 1350 N W HE+GH + P +TE +NN+ + Y+ R GK +T Sbjct: 579 ----EDNAWGPAHEMGHVHQ-AAINWPSSTESSNNLFSNYVIRRL-GKYKSRGRGLTSLA 632 Query: 1351 EYLEESNNQAWARG-------GAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSER 1403 + W G + M QL + + Sbjct: 633 NAIYRDKQVWWNMGTSTHQNEDTEIHMRMNWQLWIYYD----------------LCKGNE 676 Query: 1404 EGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADT-LMLCASWVAQTDLSEFF 1462 + K W + R ES+ A + AQ DL++FF Sbjct: 677 QEAKFW---PKVFDIMR---------TTYKNVPESDPGARQLAFVKAVCEAAQEDLTDFF 724 Query: 1463 KKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASL--DLPKPEQGPETINQVTEHKMS 1518 + W + ++ V+ + P+ P I + + K+S Sbjct: 725 ETWGFFKTVDNVKVEQYGTWTYTVTDKMIADTKAWIKTQNYPKAAP--IQYIEDRKIS 780 >UniRef50_A2EKB8 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EKB8_TRIVA Length = 685 Score = 170 bits (431), Expect = 3e-40, Method: Composition-based stats. Identities = 52/308 (16%), Positives = 106/308 (34%), Gaps = 39/308 (12%) Query: 994 KCSADLKKSLVD-----NNMIYGDGSSKAGMMN--PSYPLNYMEKPLTRLMLGRSWWDLN 1046 C + K L+D + G ++ G++ P +PL + LT L ++ Sbjct: 218 VCDNNFKDELIDLLNTSWEFLKRTGYNENGLICTKPCHPL--IAILLTDLYTKVPPENVV 275 Query: 1047 IKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPV 1106 + + +PGA +S + + STGL+ PA I+ + +P Sbjct: 276 AIPEYKDFPGAT------GNVELSNFEEHLELGPEMWVSTGLYLPAGVIGEIEISEPMPD 329 Query: 1107 TVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESA 1166 V + + R P + L K P+GG++Y+ N T+E Sbjct: 330 -VHIHIGCHHESLVPKSPPWKRWPLTVCVFPL-TEKVTKVVSPFGGIVYVAMNIETDEPV 387 Query: 1167 --SFTFTGVVKAPFYKDGA---WKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLE 1221 + F+ + P + W+ N P GE+ ++ + T + + G Sbjct: 388 RITVKFSNFCRHPVAQFDDSSVWEMTKNFEVPCGEIVAENLIITLSSQKMREI---GNFS 444 Query: 1222 QFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMN 1281 + + + S +++ +RF D+++ + GYP++ Sbjct: 445 KIFDIFNKIISRLSE--------------NLSYPITRPYRFVFDIELPDDEPSYGYPLVF 490 Query: 1282 SSFSPNST 1289 + Sbjct: 491 LEDDIDLI 498 >UniRef50_A2FU45 Immuno-dominant variable surface antigen-like n=1 Tax=Trichomonas vaginalis RepID=A2FU45_TRIVA Length = 669 Score = 169 bits (427), Expect = 8e-40, Method: Composition-based stats. Identities = 75/474 (15%), Positives = 144/474 (30%), Gaps = 80/474 (16%) Query: 1048 KVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVT 1107 + + G + + Y+N G TGL+ P + +TI+ Sbjct: 103 PAGLRQVYGEPHDLDNQTRYQATFYANSR---YGYNYITGLYVPPGEVITIELPFFFSTP 159 Query: 1108 VTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNE-SA 1166 VT+ + + + +R PR++ ++ G F PYGG + ++ + Sbjct: 160 VTILINRHIESFNNNNKPEDRLPRLSCGITIKTQGPHYFAWPYGGSLELRYDIDKFTFGL 219 Query: 1167 SFTFTGVVKAPFYKDG-----AWKNDLNS-PAPLGELESDAFVYTTPKKNLNASNYTGGL 1220 +G VK P G + N++ AP+ ++ T P K N Sbjct: 220 PIKISGCVKMPTLIYGRDSEEDYMNEIRYLKAPVTVFDTGNLKITMPSKLARNPNRLYDT 279 Query: 1221 EQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAH----SG 1276 F S N+ G +F D +S G A+ S Sbjct: 280 LHFWQGAGRVLYSFNEVQGMPRRSDGRVKV--------PVQFNIDRYVSHGLAYCQQGSY 331 Query: 1277 YPVMNSSFSPNSTTLPTT-PLNDWLIWHEVGHNAAETPLTVPG---ATEVANNV-----L 1327 Y + F+ T + W HE HN + +E+ NV Sbjct: 332 YIQAPTGFAGAFTDFESVFSDGCWGPIHEYAHNFQQ-NWGFGWFYSYSEITVNVPNYISY 390 Query: 1328 ALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIK 1387 +L + R+ + + ++ N ++ G+ L YA + + + Sbjct: 391 SLMT-TIDSARQARIDGNYIMKSDW--NYNTHIYSTIGSTALLFFYAN-NLYHFGSAKQR 446 Query: 1388 KWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLM 1447 + D + + G G + Sbjct: 447 QSLHDHIFQTTYKRDTYGYYGE------------------------------------YL 470 Query: 1448 LCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPK 1501 L S + D+ +FK ++ E ++ ++Q+ + L SL+L + Sbjct: 471 LHCSTLFNRDMRPYFKTFS--------NNEFEFNYSYHINQATEDMLDSLNLRE 516 >UniRef50_C1I981 Leucine rich repeat domain-containing protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I981_9CLOT Length = 2664 Score = 167 bits (422), Expect = 3e-39, Method: Composition-based stats. Identities = 73/454 (16%), Positives = 133/454 (29%), Gaps = 86/454 (18%) Query: 1062 GQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREK 1121 Q+ N F N Q TG+ A E+T+ +A+ + + G Sbjct: 951 EQHGDMVAHANRNLKFGFGNNNQPTGISAKPGDEITVYVDADPSQPMPKLVFSQQEGSFA 1010 Query: 1122 HEVALNRPPRVTKTYSLDASGTVKFKVPY------GGLIYIKGN---SSTNESASFTFTG 1172 + R + ++ + Y GG IYI ++ F Sbjct: 1011 N---WMRTVNLNPGKNVITVPDIAVDNWYRHDVTRGGSIYILNPYTSEQQPKTPVIRFAS 1067 Query: 1173 VVKAPF----------------YKDGAWKNDLNSPA-------PLGELESDAFVYTTPKK 1209 K PF YK ++ +P + E SD V+T Sbjct: 1068 GDKYPFLTADTNVEEFKEFLIEYKKAIDEDIAKNPNVLDREVLDVFEFVSDHIVWTGTAT 1127 Query: 1210 NLNA--SNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQ 1267 N + + +YG D + ++ + F Sbjct: 1128 GAYKAYIEKGANPLDTVNRYNNHMKELFKYYGLDGSNEQNDPKYIRENVRLAQPF----- 1182 Query: 1268 ISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVL 1327 G ++ + + T W + HE+GH + EV NN+L Sbjct: 1183 ---GYMYAY--TNHIGVQGDVMTSLLVGEPGWGLDHEIGHRMDVSTRLY---GEVTNNML 1234 Query: 1328 ALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIK 1387 +YM Y NR+ + + + E++N+ + G + L +Y QL+ + Sbjct: 1235 PMYMSVYYNKIDNRIPFENKIYKNVISENSNKY-SEGELAENLAVYWQLEMYKPGY---- 1289 Query: 1388 KWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLM 1447 W ++R+ R + ++ + L+ Sbjct: 1290 ---------------------WGNLNKLYRE-RNVNLGSENP---------DNIKMQYLV 1318 Query: 1448 LCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 +S V DLSE+F + N S+ Sbjct: 1319 KFSSEVIGEDLSEYFARHGFEVNEETRQETSKYP 1352 >UniRef50_B0A9L1 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0A9L1_9CLOT Length = 2011 Score = 165 bits (417), Expect = 1e-38, Method: Composition-based stats. Identities = 72/521 (13%), Positives = 150/521 (28%), Gaps = 116/521 (22%) Query: 1057 AVSEEGQNVTETISLYSNPTKWFAGNM-----QSTGLWAPAQKEVTIKSNANVPVTVTVA 1111 ++ +T+ +S + QSTG+ A + I A+ + Sbjct: 291 DFTDRTFTLTQNGHTHSKSRNVLRMSRLGTDLQSTGIVARPGQVFKIFVEADSNTKLPQI 350 Query: 1112 LADDLTGREKH---EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESAS- 1167 + G H E L + V + + + K GG +Y+ + +E Sbjct: 351 VFTQQEGHFSHWQKEYQLKKGLNVITVPEIYSDSWSQ-KSVKGGAVYLMNRYTADEQGKA 409 Query: 1168 --FTFTGVVKAPFYKDGAWKNDL---------------NSPAPLGELESDAFVYTTPKKN 1210 G + P Y +G K+ ++ + E + +YT K Sbjct: 410 PVVRIDGGEEFPLYNEGDDKDAFLEKLKAYKEKLDKNPDTTVDIFEFNTKRLLYTGTAKA 469 Query: 1211 LNASNYTG--GLEQFANDLDTFASSMNDFYGR--DSEDGKHRMFTYKNLPGHKHRFTNDV 1266 + + + DF G D D + + + Sbjct: 470 AYQVYVKEGVDVGESIQVWNDKIQEAFDFAGLKDDPSDPTNDSTNVRTTIRLMQPY---- 525 Query: 1267 QISIGDAHSGYPVMNSSFSPNSTTLPTTPLN----DWLIWHEVGHNAAETPLTVPGATEV 1322 G A++ Y + L T + W + HEVGH ++ E+ Sbjct: 526 ----GAAYAAYGHVGIQRGIQEIALRTDKDSINSILWGMVHEVGHQM---DISEREWGEI 578 Query: 1323 ANNVLAL--YMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 NN+ A Y+ + ++ ++AP+ + + RL M+ QL Sbjct: 579 TNNMFANNAYINNGAGDRVPYSQIQTSLAPDDASTN----FDNLDYSQRLGMFWQLHLKD 634 Query: 1381 EKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNG 1440 + + + R + S + Sbjct: 635 NT----------------------------YWAEVEKLYRKRKPSV----------SNEQ 656 Query: 1441 NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS--------------FEG-- 1484 DT AS V +L++ F+K+ + + +EG Sbjct: 657 AKRDTFAKYASEVLNMNLTKHFEKYGFNLSESCKKELEKYPDGQKTWYLDSRALTYEGNG 716 Query: 1485 ----------GVSQSAYNTLASLDLPKPEQGPETINQVTEH 1515 +S+S+ S+++P+ ++ ++ ++ Sbjct: 717 FEDKDTGLDVSLSKSSSGIRLSMNMPQDKRDDLLGYEIIKN 757 >UniRef50_C8WI41 Coagulation factor 5/8 type domain protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WI41_EGGLE Length = 1787 Score = 165 bits (416), Expect = 2e-38, Method: Composition-based stats. Identities = 80/598 (13%), Positives = 154/598 (25%), Gaps = 119/598 (19%) Query: 958 NDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKA 1017 + Y+ ++D + YA+D + + + D + Sbjct: 789 EMRFHAYDSLESD---------IMALYADDLHLELKDDVTSAAVDELQQRLDTPD--PAS 837 Query: 1018 GMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTK 1077 G NP +E R +L + ++V ++ + Sbjct: 838 GEFNPYRVELQVELDNARKLLATQGLEGTVRVHNGI-------SSARDNRSLGISG---- 886 Query: 1078 WFAGNMQSTGLWAPAQKEVTIKSNANVPVT-----VTVALADDLTGREKHEVALNRPPRV 1132 Q G ++ + + A VT + + ++ + Sbjct: 887 --LNAWQPLGAVVAEGDQIVVYTGAKGAVTGKEAPLRLVVSQQHPESSNVSKTI--ATLK 942 Query: 1133 TKTYSLDASGTVKFKVPYGGLIYIKG-NSSTNESASFTFTGVVKAPFYKDGA-------- 1183 + V +GG +Y++ + +G P Sbjct: 943 VGRNEITIPSLSSLDVEHGGQLYVEYTGDNDAADWGVRVSGAQAVPVLDLYQVDDPAERL 1002 Query: 1184 -----WKNDLNSPAPLGE------------------------------LESDAFVYTTPK 1208 + L + P E + D +Y+ P Sbjct: 1003 ARTTAYVQALEAYVPALEESHGKLHGAGGNAAVRYGYDPKNCVLNATDVMLDQMMYSVPA 1062 Query: 1209 KNLNAS----NYTGGLEQFANDLDTFASSMNDFY-----------GRDSEDGKHRMFTYK 1253 + + A + D M FY G D+ K + + Sbjct: 1063 QQMLAGAGSGTADERAARLLASFDAMDQMMELFYQHKGLADSFDAGTDAAVIKSNLLPSQ 1122 Query: 1254 NLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND--------WLIWHEV 1305 +L R + H G + + + + W I HE+ Sbjct: 1123 HLNIRYTRMFAGAFMYAAGNHIGIEWDSVPGLGKGSPVAVDGDGEKASGSYFGWGIAHEI 1182 Query: 1306 GHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGG 1365 GHN + +EV NN A Q R + D + G Sbjct: 1183 GHNINQAQYAY---SEVTNNYFAQLSQTDGTSASARFSYDEVYDRVTSGDEGR----TGS 1235 Query: 1366 AGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVS 1425 +L MY QL+ + + Y + F + AR + + Sbjct: 1236 VFTQLAMYWQLRLAYDAGGAYQ-----------LYDTYQQAFDNRFFARVDSYARAPKTA 1284 Query: 1426 NDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFE 1483 G + G ++ AS A+ DL++FF++W A+ S+ E Sbjct: 1285 PAPEGTELVL---GGGEKQNIIRLASAAAERDLTDFFQRWGFTADEATKAYVSQFPAE 1339 >UniRef50_C9KXI1 Putative uncharacterized protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KXI1_9BACE Length = 705 Score = 164 bits (415), Expect = 2e-38, Method: Composition-based stats. Identities = 86/508 (16%), Positives = 150/508 (29%), Gaps = 69/508 (13%) Query: 1006 NNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSE-EGQN 1064 DG G P ++ D+++ +PG V + + Sbjct: 33 YKYDITDGIDNNGASEPDITIDTE-----------GGIDVSMYEKARIFPGLVDTAKEER 81 Query: 1065 VTETISLYSNPT---------KWFAGNMQSTGLWAPAQKEVTIKSNANV-PVTVTVA-LA 1113 + TI L N + STGL+A A ++V I N ++V V Sbjct: 82 INATIELDLNKQYIDSITLGVSKVPQPIYSTGLYAGAGEQVAITVEDNTMGLSVIVGSHM 141 Query: 1114 DDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIK--GNSSTNESASFTFT 1171 DDLT R P V +L G K P GG I+IK + + + S Sbjct: 142 DDLTELA----PYQRMPLVYVAKAL-FPGKNIIKNPLGGPIWIKKSLGLNASGTCSLKIE 196 Query: 1172 GVVKAPFYKDGA-----WKNDLN-SPAPLGELESDAFVYTTPKKNLNA--SNYTGGLEQF 1223 GV K+P + G WK ++ + P E+ F +T K + + + L++ Sbjct: 197 GVYKSPDFVIGETDLQDWKRRISETTVPWLEIRGKHFAFTVQKDRVLDNLESISSTLQEV 256 Query: 1224 ANDLDTFASSMN-DFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAH---SGYPV 1279 + D ++YG + + P R DVQ+ +G+ + S Y + Sbjct: 257 GKEWDEAIEEFFFEYYGLKIDKDAEEK---ERAPEFPFRVVLDVQV-LGNLYLRNSDYAI 312 Query: 1280 M---NSSFSPNSTTLPTTPL-NDWLIWHEVGH--NAAETPLTVPGATEVANNVLALYMQD 1333 + N+ L T N + + P N + LY Sbjct: 313 VAINNTYMLEEMLNLRTLRTGNSVALLSAINSMCTYRSRNNPWPADYRAVANAIPLYRIG 372 Query: 1334 RYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDG 1393 + +I E + +A A + K + Sbjct: 373 KKNFSKENAFGEIFPGEENITTLFPKAIEYAMADSSKWAKEDAATKYDDKTAYKAFDLLS 432 Query: 1394 TPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWV 1453 Y W + ++ KA+ + + + + Sbjct: 433 LIQLANYDNNN----WEAMEYLNLKAKEERSI-------------DNSTLSYVFRRLCDY 475 Query: 1454 AQTDLSEFFKKWNPGANAYQLPGASEMS 1481 + +L FF W A + Sbjct: 476 FKQNLCPFFDYWGVEQLDEDRKYAEQYP 503 >UniRef50_UPI0001C36412 coagulation factor 5/8 type domain protein n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36412 Length = 1749 Score = 164 bits (415), Expect = 2e-38, Method: Composition-based stats. Identities = 78/591 (13%), Positives = 162/591 (27%), Gaps = 105/591 (17%) Query: 954 VWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDG 1013 + + Y Y+ ++ E + +A+D + + ++S+ Sbjct: 754 ISIAEMKFYYYDPIEH---------EVYDLFADDMHLSLKEGV--TQESIDSLRARLEVK 802 Query: 1014 SSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYS 1073 +G ++P + E +L D+ V+++ + T Sbjct: 803 DEVSGELHPKKEILERELTTAEQILKD-----EALADILTIDNRVTKKAD-GSITF---- 852 Query: 1074 NPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVA-LNRPPRV 1132 Q G+ A A V + + T + + E + ++ + Sbjct: 853 ---GGGLNAWQPLGVTAMAGDTVMVYVGSPEKKTGDSTNLRLIATQYHGESSAWSKTIGM 909 Query: 1133 TKT--YSLDASGTVKFKVPYGGLIYIKG-NSSTNESASFTFTGVVKAPFYK--------- 1180 K + V GG +Y++ + E S +G + P Sbjct: 910 LKAGINEITIPKITDLDVEQGGQLYVEYTGAQGKERYSVRVSGGHQIPTLNVTMASDSNA 969 Query: 1181 ------------------------------DGAWKNDLNSPAPLG-ELESDAFVYTTPKK 1209 DG W + + ++ + +++ + Sbjct: 970 ARALVTKYVEELEATAANLETEHESHKKAHDGDWSSAKKNCILGATDIVTKYMMFSVSSQ 1029 Query: 1210 NLNASNYTGGLEQFANDLDTFASSMNDFYGR---------DSEDGKHRMFTYKNLPGHKH 1260 + A G +E+ A L ++ ++ D + G+ L Sbjct: 1030 QILAGLSGGTVEEKAEQLYQSLTAADEMVNLFYQHKGLSSDPDAGEKNKLPVSRLNLRYQ 1089 Query: 1261 RFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND--------WLIWHEVGHNAAET 1312 R + G H G + + P W I HE+GH E Sbjct: 1090 RMFAGAFMYAGGLHIGIEWGSIPGLTRGVPVKADPKGRYESGQYFGWGIAHEIGHEINEG 1149 Query: 1313 PLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLM 1372 + E+ NN ++ Q R ++ +L + Sbjct: 1150 AYAIA---EITNNYFSVLAQAHDTNDSVRFQYPEVYKKV----TSGVTGRSSNVFTQLGL 1202 Query: 1373 YAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGK 1432 Y QL + + Y + F + R E + K GG Sbjct: 1203 YWQLHLAYDMG----------GYNYKTYDKYRDQFNNLFFARVDSYVRNTEAA-PKPGGV 1251 Query: 1433 NYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFE 1483 + + + LM A A+ ++ EFF++W + A + E Sbjct: 1252 SLSLSGDVDNK--LMRLACAAAEKNILEFFERWGMVPDETTKKYAQQFEKE 1300 >UniRef50_B0A9L0 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0A9L0_9CLOT Length = 1263 Score = 163 bits (413), Expect = 4e-38, Method: Composition-based stats. Identities = 74/477 (15%), Positives = 135/477 (28%), Gaps = 96/477 (20%) Query: 1048 KVDVEKYPGAVSEEGQNVTETISLYSN-PTKWFAGNMQSTGLWAPAQKEVTIKSNANVPV 1106 E QN + ++ +F + QSTG+ A + T+ A+ Sbjct: 58 SKSNESNSERTFTLAQNGNISWKARNDLRMTYFGTDYQSTGIVARPGETFTVYVEADEGA 117 Query: 1107 TVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVP------------YGGLI 1154 + + R Y L G VP GG + Sbjct: 118 PMPKIAFSQHEALYSN---WVRW------YDLK-PGKNVITVPEIYDNSWSNKTVKGGAV 167 Query: 1155 YIKGNSSTNESAS---FTFTGVVKAPFYKDGAWK---------------NDLNSPAPLGE 1196 Y+ + E T G P Y +G K D + L E Sbjct: 168 YLLNRYTAKEQGKAPVVTIEGGETFPIYNEGDDKAAFIEKLKAYKQKLDQDPENTVDLFE 227 Query: 1197 LESDAFVYTTPKKNLNASNYTG--GLEQFANDLDTFASSMNDFYGR--DSEDGKHRMFTY 1252 ++ +YT + + A +L++ M DF G D D + Sbjct: 228 FNTERLLYTGTASAAYKVYVEEGVDVGESAANLNSQVQEMFDFSGLKNDPTDPNNDSTNV 287 Query: 1253 KNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND--WLIWHEVGHNAA 1310 K + + A+ + + + P LN W HEVGH Sbjct: 288 KTTIRLMQPY------GLAYAYVDHVGIQRDYEPGMLRTDQESLNSVLWATVHEVGHQM- 340 Query: 1311 ETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRL 1370 + E+ NN+ + Y ++ G +RV + E+S+ +++ G RL Sbjct: 341 --DIMGRDWPEITNNMWSNYAHIKH-GMNDRVPYNDIYNDLAPEDSH-KSFDDLGYFQRL 396 Query: 1371 LMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFG 1430 M+ QL+ + + + R + S Sbjct: 397 GMFWQLQLKKDT----------------------------YWTELETLYRERKQS----- 423 Query: 1431 GKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVS 1487 + D L +S + +L+ F+K+ + ++ + Sbjct: 424 -----PANYQEKKDMLATYSSEILGINLTNHFEKYGFTLSDKCKENLKKLPETNEKT 475 >UniRef50_A6KYY0 Putative uncharacterized protein n=5 Tax=Bacteroides RepID=A6KYY0_BACV8 Length = 969 Score = 159 bits (402), Expect = 7e-37, Method: Composition-based stats. Identities = 59/474 (12%), Positives = 125/474 (26%), Gaps = 65/474 (13%) Query: 1066 TETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVA 1125 + TK ++ TG+ A E+ + V + + T E Sbjct: 433 DINVWADRLMTKHYSCLDNLTGIAVEANDEIIVLVGNTHGYPVALQCIGEETTSFGEEKN 492 Query: 1126 LNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG-----NSSTNESASFTFTGVVKAPFY- 1179 + Y L G K + G +++ ++ T+ ++ Sbjct: 493 YVQTAASGDIYFLK-EGVNKITIRNRGQLFVMYTADLQSNPTSIRIHIPLGSGQVTGYFD 551 Query: 1180 -----KDGAWKNDLNSPAP-LGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASS 1233 + + L+ ++ + ++ + + + + D Sbjct: 552 LQRHQTNEKYAELLSKATDKYFGVKGEKMIFYFHRSEMLKHVR-TEILSAIHLWDNIVEW 610 Query: 1234 MNDFYGRDSE-DGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLP 1292 G D + + P + + +D +I+ + ++ Sbjct: 611 EQSLMGIDKMHANQFNNHLFAISPEGAYMWASDYRIAFVYTYLDNILLYDKVM------- 663 Query: 1293 TTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEY 1352 N W HE+GH + G+TE +NN+ + Y+ LGK + E Sbjct: 664 LAEDNAWGPAHEIGHVHQLA-IDWMGSTESSNNLFSNYI-IYQLGKYKSRGRGLDYLAEC 721 Query: 1353 LEESNNQAWARG-------GAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREG 1405 + N + G + M QL + E K Sbjct: 722 VYGQNQAWYNMGSATHQGEDTEIHMRMNWQLWLYYELCKGHDKKPMV------------- 768 Query: 1406 MKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADT-LMLCASWVAQTDLSEFFKK 1464 + + R + + Q DL+EFF+ Sbjct: 769 ------WPKIFELMR---------TTYQHVPTHQPGQRQMAFVKAVCDATQEDLTEFFET 813 Query: 1465 WNPGANAYQLPGASEMSFEGGVSQSAYNTLASL--DLPKPEQGPETINQVTEHK 1516 W P + V++ + + P+ P I + + K Sbjct: 814 WGFFKAVDA-PIEQYGQAQYTVTEQMICETKAYIQNKKYPKAAP--IQYIEDRK 864 >UniRef50_C5VKR9 Putative uncharacterized protein n=1 Tax=Prevotella melaninogenica ATCC 25845 RepID=C5VKR9_9BACT Length = 1052 Score = 158 bits (398), Expect = 2e-36, Method: Composition-based stats. Identities = 51/441 (11%), Positives = 114/441 (25%), Gaps = 57/441 (12%) Query: 1085 STGLWAPAQKEVTIKSNANVP--VTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASG 1142 TG+ A +++ + ++P TV + L T L + + L+ Sbjct: 457 PTGVTARDGEDLFVFLGDDIPQDATVQIELVPLGTRSAGKYHNLKKGLNII----LNQGE 512 Query: 1143 TVKFKVPYGGLIYIKGNSSTNE------SASFTFTGVVKAPFY------KDGAWKNDLNS 1190 F YI + + + G ++ + W+ + Sbjct: 513 NNVFVN------YIGRTFNNGKYLRDYKPMNIHIEGGKVNGYFDLTKGNTNEDWQKMQSD 566 Query: 1191 PAPLGE---LESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKH 1247 + ++ + V P + +++ ++ +D G Sbjct: 567 GLVWAKAFNMKGELVVMNMPSQACKDYT-PVHMKELVEIWNSIVQREDDLMG-------- 617 Query: 1248 RMFTYKNLPGHKHRFTNDVQISIGDAHS--GYPVMNSSFSPNSTTLPTTPLND---WLIW 1302 + N + G ++ G N + + + W Sbjct: 618 ---FRAAKRDKCNNVLNATAVDHGYMYATTGGTYYNYNTLADVLNYDKMKWGNGTLWGPA 674 Query: 1303 HEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWA 1362 HE GHN + G TE++ N+ + + G++ ++ +E + Sbjct: 675 HEFGHNHQQL-FNTAGMTEISVNMYSNMVMFT-SGRVTSRSEHCNYTDVDGQEHRGVCES 732 Query: 1363 RGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGD 1422 +A K W F+ W + + R Sbjct: 733 A--VSTYADRFANKKMW----FEYGTWGTTQMYYKLYMMFHSTGLDDQFWHKCLDYLRTH 786 Query: 1423 EVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSF 1482 + G+ N A DLS FF+ W + + Sbjct: 787 -----RLEGQGTANCQGQNDYLLFAKACCVAANQDLSSFFEAWGHFYDVNGSVIGDYSNT 841 Query: 1483 EGGVSQSAYNTLASLDLPKPE 1503 +++ + P+ Sbjct: 842 TMYTTRAQWVEAKKFMQQFPK 862 >UniRef50_Q8EW84 Predicted integral membrane protein n=1 Tax=Mycoplasma penetrans RepID=Q8EW84_MYCPE Length = 984 Score = 154 bits (388), Expect = 3e-35, Method: Composition-based stats. Identities = 74/431 (17%), Positives = 134/431 (31%), Gaps = 80/431 (18%) Query: 1084 QSTGLWAPAQKEVTIKSNANVPVT-----VTVALADDLTGR-------EKHEVALNRPPR 1131 +TGL+ PA + +TI + + + D+ + NR P Sbjct: 272 YTTGLYLPAGEVITINFPGLTDEQVAALNIRLVINDNEIQDLTSSNVESQWSKCKNRMPV 331 Query: 1132 VTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESAS------FTFTGVVKAPFYKDG--- 1182 + + ++L F P GG+I ++ ++TN + G V+A Y G Sbjct: 332 MRQVFTLRK-NNFSFGNPLGGMINLEHINNTNTVVNGSNIFRVVIDGAVEALHYVHGYTT 390 Query: 1183 --AWKND-LNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYG 1239 W+ S AP E+E+D + PK NL E ++ + S + Sbjct: 391 EDEWQRLVKESTAPFVEIENDYTKFLVPKSNLGQFANKENYEWELDENNNVISYSHK-ET 449 Query: 1240 RDSEDGKHRMFTYKNLPGHKHRFTND-------------------VQISIGDAHS--GYP 1278 + ++ N ++ R+ + + G A++ Y Sbjct: 450 LINNTYPYKSLDLWNKLSYESRYVSGLNETPTVRPQIKNYFTDYQYYVDGGAAYTSNAYN 509 Query: 1279 VMNSSFSPNSTTLPT-TPLNDWLIWHEVGHNAAETPLTVPG------ATEVANNVLALYM 1331 VM S+ T T +W + HE H+ P V EV NNVL L Sbjct: 510 VMPRSWGSAVTNYDTNNNSGNWGVIHEYNHHFQVNPSNVSWGFIRNDQNEVTNNVLNLLA 569 Query: 1332 QDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYP 1391 +Y + + + + +W G + A LK Sbjct: 570 YAKY----------ANIGQDRAGKQDISSWPSGHLSNINSYNAILKV-----IRNSTTSS 614 Query: 1392 DGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCAS 1451 T Y+ GW + + R ++ + +A + S Sbjct: 615 IITAQTFHYTSVMANFGWE---GLEKAIRLANTTSA--------PSNITDANTKFVYFIS 663 Query: 1452 WVAQTDLSEFF 1462 + +F+ Sbjct: 664 KATNYNWGQFY 674 >UniRef50_Q4ZLQ6 Putative uncharacterized protein n=3 Tax=Pseudomonas syringae group RepID=Q4ZLQ6_PSEU2 Length = 607 Score = 154 bits (388), Expect = 3e-35, Method: Composition-based stats. Identities = 69/466 (14%), Positives = 121/466 (25%), Gaps = 74/466 (15%) Query: 1053 KYPG-AVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVA 1111 ++P + + + Q TG +A A +E ++ NV + Sbjct: 200 EFPHPRSVLIQPMQDAAAEQAAMSWRIRKADYQPTGYFALAGQEFEVRVWGNVD---NLT 256 Query: 1112 LADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTF- 1170 L G + + + G + P GG I+I+ + + Sbjct: 257 LLVGTQGLADRDDPSEQSENMRARP--LTRGINIIRDPLGGAIHIRNLIGSRPGVARVIF 314 Query: 1171 -TGVVKAPFYKDG----AWKNDLN-SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFA 1224 +G + P+Y +G W+ L + AP EL V + + Sbjct: 315 GSGAIPMPYYVNGITATQWRKMLLLTKAPEVELVGTHIVVAAFRSTALKFSDVE-PSAIV 373 Query: 1225 NDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSF 1284 + + + G D H T L + + P + Sbjct: 374 HSHEEVMRLEAEVSGFDGSAPIH---TRSRLLIYAVEGSASKIPHATTGCIALPYREAIG 430 Query: 1285 SPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVAD 1344 N L W+ HE GH+ + + E+ N+ AL + Y+ + V Sbjct: 431 EFNEALLGGLAAERWVTLHEYGHHYQTSYNSYGLFGEITVNLYALAVGRHYINEYTYV-- 488 Query: 1345 DITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPE-FYSER 1403 L +K Y P + + Sbjct: 489 -------------------------LPERWNGTVNWLALPRTEKIYGAPESDPLAMFEQL 523 Query: 1404 EGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFK 1463 G + HR R +L AS A+ +L+EFF Sbjct: 524 RKGLGEDFLPDWHRYIREHPGEALGLK--------------YFVLTASIAAKRNLTEFFA 569 Query: 1464 KWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETI 1509 W + + +L P P Q I Sbjct: 570 DWGLLKLTDT---------------DVWIAVNALGFPYPSQRLAAI 600 >UniRef50_A2EPB9 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EPB9_TRIVA Length = 664 Score = 151 bits (382), Expect = 1e-34, Method: Composition-based stats. Identities = 64/346 (18%), Positives = 126/346 (36%), Gaps = 50/346 (14%) Query: 964 YEEGKNDELGFKTFTEFLNC--YA-NDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMM 1020 Y+ D++ F L C Y ++ ++ C + K Sbjct: 246 YDSSFRDKIDVLKF-HMLACGVYNPSEVHSCYRSC----------YKYLERTKFKKGS-- 292 Query: 1021 NPSYPLNYMEKPLTRLMLGRS----WWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPT 1076 SY +++ + LML + + + D++++PG + + + + I++ Sbjct: 293 --SYCQTDVQRCIAELMLCITENIDPHEYLMLPDIDEFPGTIEQS-ETSSFEITIDVKKK 349 Query: 1077 KWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTY 1136 W +TGLWA + I+ + V + + + ++ R P V +T Sbjct: 350 SWS-----TTGLWALPGICMEIEFKNDDWENVIIQVGSHTEDFLQTDIPWPRWPVVYRT- 403 Query: 1137 SLDASGTVKFKVPYGGLIYIKGNS-STNESASFTFTGVVKAPFYKDGA---WKNDLNSPA 1192 + A + P GG+IY+ N ++S S +FT V++ P W S A Sbjct: 404 -VQAQKKLSLTSPTGGIIYLYLNEGEKSKSVSVSFTNVIQYPHAILDQPTIWDETCFSMA 462 Query: 1193 PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTY 1252 P E + V+T P ++ L Q+ +S+ D+ F Sbjct: 463 PWCEFDCGNIVFTLPTTKSREADTDWLLTQYCK----LVTSIWDY------------FEV 506 Query: 1253 KNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND 1298 + K+R D+ +S GYP++ + S ++ + N Sbjct: 507 PDEERIKYRVVFDIALSGEGPTLGYPIVLLTSSASNIIQKVSKPNS 552 >UniRef50_A2E8P4 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2E8P4_TRIVA Length = 706 Score = 150 bits (378), Expect = 4e-34, Method: Composition-based stats. Identities = 54/314 (17%), Positives = 101/314 (32%), Gaps = 23/314 (7%) Query: 959 DTSYRYEEGKNDELG-FKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKA 1017 D +R + ++ E+ L Y ++ + L+ + G Sbjct: 259 DEYFRVLDEQDSEVSELDNVVSKLRYYISEMTPENAEIILKLQDK--SMEYLQKTGFRNG 316 Query: 1018 GMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTK 1077 +M + L+ L+ + D+E +PG E+ I Sbjct: 317 ELMCNDTRQCMVCIILSELINKIPPESVKPVPDLELFPGVTDEKTCTKRINIETK----- 371 Query: 1078 WFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYS 1137 AG STGLW A + T++++ + +T+ + E R P V TY+ Sbjct: 372 --AGIWHSTGLWLQAGQIATVETSNH----LTIQVGAQTLCLLVKEFPWKRWPSVITTYN 425 Query: 1138 LDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFY---KDGAWKNDLNSPAPL 1194 + + +GG +++ S N + S G P++ K W+ S P Sbjct: 426 ISPNTPTAIATQFGGPVFVL--SEKNHNVSVQIEGCALYPYFYNDKPSIWEQTKTSTIPW 483 Query: 1195 GELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKN 1254 GE+E+ P + + Q +D + +F K R+ Sbjct: 484 GEIETKYVCINLP---ARMIHNEQKISQMCKRIDQLMQYIREFEAIQYNTFKSRLVFDVE 540 Query: 1255 LPGHKHRFTNDVQI 1268 + DV Sbjct: 541 VAKG-EPIVEDVVF 553 >UniRef50_A5ZL13 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZL13_9BACE Length = 952 Score = 149 bits (375), Expect = 9e-34, Method: Composition-based stats. Identities = 55/471 (11%), Positives = 129/471 (27%), Gaps = 73/471 (15%) Query: 1063 QNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH 1122 T K + TG++ + E+ + T+++ Sbjct: 438 PYSTPNNWADKLYMKSYTDLDNPTGIYVNSGDELIVMVGETYGNTISLQAI--------- 488 Query: 1123 EVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSS------TNESASFTFTGVVKA 1176 R ++ + G K ++ G++++ N+ T + Sbjct: 489 -----RSSNLSGDKYMLNEGINKLQMKGDGMLFVMYNTELTSENAKPVKIHIPLTSGTVS 543 Query: 1177 PFYKDGAWKN-------DLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDT 1229 ++ K + ++ + + + L + ++ D Sbjct: 544 GYFDLERDKTDAVYTELLQKATYEYFLIKGNEMLLNFHRTKLLQW-QPNSIVEYITMFDH 602 Query: 1230 FASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNST 1289 F + + G + N + +D + G+ G+ + + Sbjct: 603 FVNWQYELLGLEDI-----RPALFNNHVNGSSINDDSYMWAGNGQIGFGINALDEFMPTE 657 Query: 1290 TLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVA 1349 L T W HE+GH + G E +NN+ + Y+ + + + A +A Sbjct: 658 KLYT-ERRCWGPAHEIGHLHQ-GAIAWTGCFESSNNLFSNYVLYKIGRECSNGAPLSVLA 715 Query: 1350 PEYLEES----NNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREG 1405 L + + +Y QL + + Sbjct: 716 DRKLNNRPFCNFLGDPKKEDTEIHMRIYWQLWLYFHRC---------------------- 753 Query: 1406 MKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKW 1465 + + + +K R + N+ G + AS +AQ +L++FF W Sbjct: 754 GIKSDFYPELFKKLRNNRNLNNIPVG---------ERQMLFVKYASDIAQKNLADFFDMW 804 Query: 1466 NPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEHK 1516 + + V+ + P P+ + + K Sbjct: 805 GFMTPVDETIEQYGSNR-YTVTNAMIAETREYTSKYP--NPQPFYYIEDRK 852 >UniRef50_B1V640 Putative antigenic protein NP1 n=1 Tax=Clostridium perfringens D str. JGS1721 RepID=B1V640_CLOPE Length = 1269 Score = 148 bits (374), Expect = 1e-33, Method: Composition-based stats. Identities = 77/498 (15%), Positives = 157/498 (31%), Gaps = 86/498 (17%) Query: 1026 LNYMEKPLTR-LMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQ 1084 ++ +EK + + D ++ G++ ++ V + + + +N Sbjct: 61 MDNIEKDIENGTLTKHKAAD-------GQFFGSIPDDVLGVEKKVYINTNAK-----GSH 108 Query: 1085 STGLWAPAQKEVTIKSNANV-------PVTVTVAL-ADDLTGREKHEVALNRPPRVTKTY 1136 S + PA + TIK N + ++V + + + NR P + KT+ Sbjct: 109 SLASYVPAGEIATIKLNNEALKYAKKGKLKISVGMTMVNAEDYNYNNNNQNRMPYLGKTF 168 Query: 1137 SLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFT--GVVKAPFYKDG-----AWKNDLN 1189 S+ + P+GG+IYI + S F GVV P+Y G WK N Sbjct: 169 SV-NENETQVGTPFGGMIYIDIDDSVPSGLRFEIDVKGVVDTPYYDLGRTTDEEWKESKN 227 Query: 1190 SPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRM 1249 +P E+ + + P K + N F + +S++ D Sbjct: 228 APGLFAEIRTPYLRFMVPSKFIRNINNPYNAALFWTNSVALSSNIMD------------- 274 Query: 1250 FTYKNLPGHKHRFTNDVQISIGDAHS--GYPVMNSS--FSPNSTTLP-TTPLNDWLIWHE 1304 + D I+ G A++ G + N ++ ++ +W HE Sbjct: 275 ---QQYRIKPMSLIFDQYITAGIAYASVGAWICNLPPEWATSALDYDSIMKSGEWGTIHE 331 Query: 1305 VGHNAAETPLTVPG-------ATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESN 1357 + H+ + L +E+ NN L+ Y ++ T + + Sbjct: 332 INHHYQKRYLNYSDEWGVGDEFSEITNNALSSVSYILYTNIAVYRGEEGTYDWNKVAD-- 389 Query: 1358 NQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKG-WNLFQLMH 1416 Y+ LK+ + + YS G N ++ Sbjct: 390 --------------PYSSLKQQIYEGKQY--YPNKPNIGNFMYSTFAHEIGPINFINVVK 433 Query: 1417 RKARGDEVSNDKFGGKNYCAESNGNAA-----DTLMLCASWVAQTDLSEFFK---KWNPG 1468 G + +Y ES+G + D + ++ D + + + W Sbjct: 434 STYDGGTFNGIYIPPYDYKLESDGGKSRSDRYDDIAYRLCVASERDYTWYIQNELLW--P 491 Query: 1469 ANAYQLPGASEMSFEGGV 1486 + ++ + Sbjct: 492 IKQETINKIKSHKYKETI 509 >UniRef50_Q183N3 Putative exported protein n=9 Tax=Clostridium difficile RepID=Q183N3_CLOD6 Length = 1987 Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats. Identities = 92/617 (14%), Positives = 165/617 (26%), Gaps = 132/617 (21%) Query: 956 LWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSS 1015 + Y Y+ ++D N + +D + K + + + + D Sbjct: 887 ISELKFYEYDSIEDD---------VRNLFMDDLQVELKEDVTQEKITELKDRLNTPDK-- 935 Query: 1016 KAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNP 1075 +G +P + E L + + + D DV +++ G+ + N Sbjct: 936 ASGEYHPYKTVIERELKLAQDL----YNDRATLTDVTTVNQEINKLGKYKDKNGQYTVNS 991 Query: 1076 TK-WFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKH---EVALNRPPR 1131 + Q+ G+ A A E+T+ + T + + E+ LN Sbjct: 992 NNLGMQNDWQALGVAARAGDEITVYVGSKSGKTPKLIYTQYYGESGAYKSGEINLNTGKN 1051 Query: 1132 VTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNES--ASFTFTGVVKAPFYKDGAWKNDLN 1189 V + S + GG +YI+ T G K P +N Sbjct: 1052 V-----IQLSKLHSLDIECGGSLYIRYAEDTPSGGDIKVRVAGATKIPHLNL---NGTIN 1103 Query: 1190 SPAPLG-------------------------------------------------ELESD 1200 +P G ++ESD Sbjct: 1104 DKSPAGVSESKKKIKAYIEELARYKDAVKGEPYYPGTNVGNSYGYSEKTGVLNTTDIESD 1163 Query: 1201 AFVYTTPKKNLNASNYTGGLE---------QFANDLDTFASSMN---------DFYGRDS 1242 P + G + + + D G + Sbjct: 1164 KVTLNVPATAVYQGISKGNADLNTQVDRLYNSMLAWEQIMDLVYSERGVFKTQDLDGNGT 1223 Query: 1243 EDG------KHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPL 1296 D K+ M + R + H G + N Sbjct: 1224 VDDNEFNLTKNDMAPKSRMNIKYQRMFIGAFMYASGLHVGVEYGSVPGLMNGVPFQIDDS 1283 Query: 1297 ND--------WLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITV 1348 W I HE+GH +T +E +NNVLAL Q + K + + T+ Sbjct: 1284 GKATGGNLFGWGIGHEIGHVTDIGKMTY---SETSNNVLALLAQ-TFDDKTHSRLEGSTM 1339 Query: 1349 APEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKG 1408 Y ++N +RL M QL + +F + Sbjct: 1340 DKIYEHVTSNSLGIPSNVFERLGMLWQLHLAYDDDFTGSMLKNNSDADLS---------N 1390 Query: 1409 WNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPG 1468 + + RK R S+ ++ L+ AS V + DL ++FK W Sbjct: 1391 DTFYAKISRKYRALSSSD---------PINSLPKDQMLVAMASSVVEKDLRDYFKAWGVE 1441 Query: 1469 ANAYQLPGASEMSFEGG 1485 ++E Sbjct: 1442 ITPELNSIMDSKNYEKE 1458 >UniRef50_A2GCT2 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2GCT2_TRIVA Length = 694 Score = 145 bits (365), Expect = 2e-32, Method: Composition-based stats. Identities = 54/308 (17%), Positives = 96/308 (31%), Gaps = 41/308 (13%) Query: 1001 KSLVDNNMIYGDGSSKAGMMNPSYPLNYMEK--PLTRLMLGRSWW---DLNIKVDVE--- 1052 S+ I G+ +A ++N +Y + E P+ M+ + W D I + Sbjct: 277 DSIQTMATIVGEEIERASIINTNYAYDVCETFGPILHSMIQQGGWKYGDNQISATLTKLI 336 Query: 1053 ---------------KYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVT 1097 + G N + + STGLW Q Sbjct: 337 VKAASVLPLSYFATYDFSGRFVGSSMNSDCQ--QVTVSLSFPDPGWYSTGLWIQPQYLSQ 394 Query: 1098 IKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIK 1157 ++ + VP + + +E R P V+ + + PYGG++Y Sbjct: 395 VQFDQRVP-NCQIIIGCHTFNAIDNEPPWKRFPIVSLRRQI-TDRFHEIATPYGGMVYFA 452 Query: 1158 GNS----STNESASFTFTGVVKAPFYKDGA---WKNDLNSPAPLGELESDAFVYTTPKKN 1210 N+S T + VK P G WKN P GE+ + + Sbjct: 453 PGESEIIEKNKSIHVTLSDAVKYPMMILGKPDSWKNTYTEDIPWGEIVTKTIIIAARTDQ 512 Query: 1211 LNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPG---HKHRFTNDVQ 1267 +N + G + + L ++ + K R+ P + + T D+ Sbjct: 513 INMISDQEGQLSYIDSLIAPMVEVSGYT----MSKKFRLVFDIEYPHTFMNSYPITIDIN 568 Query: 1268 ISIGDAHS 1275 + HS Sbjct: 569 LHKKLLHS 576 >UniRef50_A2FC48 Putative uncharacterized protein n=2 Tax=Trichomonas vaginalis RepID=A2FC48_TRIVA Length = 717 Score = 141 bits (354), Expect = 3e-31, Method: Composition-based stats. Identities = 53/255 (20%), Positives = 89/255 (34%), Gaps = 31/255 (12%) Query: 1052 EKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVA 1111 E +PG V+ + + I L N +W STGLW PA I P T + Sbjct: 357 EIFPG-VTGDVELRDFEIDLTINSQEWT-----STGLWLPAGVMGEIIIENVPPSTF-IQ 409 Query: 1112 LADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSST---NESASF 1168 + ++ R P T + + K PYGG++Y+ + Sbjct: 410 IGCHNEKNLPKKLPWGRWPSTLVTEPVISEE-TKVGSPYGGIVYLMNGEEEELISYDIHA 468 Query: 1169 TFTGVVKAPFYKDG--AWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFAND 1226 F + P + W + + P GE+E+ + T P +NL +E FA Sbjct: 469 KFCNFCEFPRFVLEADTWTDTKDIEVPWGEIETSNVIVTMPSENLRKI----DVEAFAKK 524 Query: 1227 LDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSP 1286 +D+ S + ++ +K R DV + G+ YP+ S+ Sbjct: 525 IDSLISCIKNY--------------LDIEWEYKFRIIFDVMVDPGEPSISYPIAFSTLDI 570 Query: 1287 NSTTLPTTPLNDWLI 1301 + N +L Sbjct: 571 DHVVEGFNSPNKYLF 585 >UniRef50_C8QBU1 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QBU1_9ENTR Length = 557 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 71/491 (14%), Positives = 145/491 (29%), Gaps = 87/491 (17%) Query: 1054 YPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALA 1113 +P + + + + G T + A + VT+ N+P A Sbjct: 86 FPDRYNMPVAHHETKDWVNTPYATVGRGIFFFTRTFGEAGQNVTVSVG-NIPAGAKCYAA 144 Query: 1114 DDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGN----SSTNESASFT 1169 +K + L+ L A + G++ + E F Sbjct: 145 TGREFADKDAMYLD-------QQQLTAKSDNSYTFKKTGVLLLGCGDPDKQQPGEFVPFK 197 Query: 1170 FTGVVKAPFYKDGAWKNDLNSPAPLGELES--DAFVYTTPKKNLNASNYTGGL-----EQ 1222 TG + + G N+ + ++ D F + + + E Sbjct: 198 ITGGGNSHLFILGQ-----NTQSDWAASKTIADKFGFALLYDGHANTVVPTRIAQHTDEM 252 Query: 1223 FANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNS 1282 L + + + DG +FT +G + Y Sbjct: 253 IGKVLGDNLRVVALYEKINGMDGSEYLFTSP----------------MGSMFTNYDNCCF 296 Query: 1283 SFSPN--------STTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDR 1334 + N + T+ ++W +WHE+GH +E+ N ++ Sbjct: 297 ADYRNGYIGVGFHANTMNDKKGDNWGVWHELGHTYEPMKENFNLFSEIQVNRYSIEACQM 356 Query: 1335 YLG-KMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDG 1393 ++G ++ + ++PE W + + + + I + Sbjct: 357 FMGREIPLNKCHVDISPE------EGIWEKQAVAN----FIASGMYYPDYSTINNLWKQ- 405 Query: 1394 TPLPEFYSEREGMKGWNLFQLMHR---KARGDEVSNDKFGGKNYCAESNGNAADTLMLCA 1450 F+S G + F +++ K N +Y S D ++ Sbjct: 406 ---LNFFSRLRFSYGEDFFPKVNQARLKTIQQAPGNTIAEKTDYVIGSKQKVIDFSVVAY 462 Query: 1451 SWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKP--EQGPET 1508 S A DL ++F +W + +Q+ +A L LP+P EQ P+ Sbjct: 463 SQAAGQDLRQYFTQWGLNFS----------------TQAG-EKVAELQLPQPGAEQAPQI 505 Query: 1509 INQVTEHKMSA 1519 ++ K+ A Sbjct: 506 --SLSRDKIIA 514 >UniRef50_A5ZBD5 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZBD5_9BACE Length = 977 Score = 133 bits (335), Expect = 5e-29, Method: Composition-based stats. Identities = 77/608 (12%), Positives = 155/608 (25%), Gaps = 117/608 (19%) Query: 950 QNMSVWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYA--GGTKCSADLKKSLVDNN 1007 N++V L Y+Y + ND L + F + ++ Y+ GT S LK+ + Sbjct: 363 DNVNVALAEFECYQYSDNTNDILEAQKF------FTDETYSELKGTVTSESLKE--IKTA 414 Query: 1008 MIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTE 1067 +IY +K+ S + Sbjct: 415 VIYQLAKELLEG-----------------------------KYDKKF--RFSTYHSCKSP 443 Query: 1068 TISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKS--NANVPVTVTVALADDLTGREKHEVA 1125 I + TG++ + V + +++A+AD G +K ++ Sbjct: 444 EIVAEELTIGSRSIYDNPTGIYFTQGEPVLVFVMYKGASNTPLSLAIADYREGGKKSVIS 503 Query: 1126 LNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG---NSSTNESASFTFTGVVKAPFY--- 1179 L G G YI+ + + + F + ++ Sbjct: 504 LR-------------GGLNVITPANSGNGYIQYWTRDDAGDTDVDIHFCFGKQIGYWDVR 550 Query: 1180 ---KDGAWKNDL--------NSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLD 1228 D W L + P + ++ ++ + D Sbjct: 551 RGDTDATWPEILERAKRSAVDIPNAMMDILGQRVHLQNTVNAFAKCA-PNAIQAVVDMHD 609 Query: 1229 TFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDA-----HSGYPVMNSS 1283 G + V+ G + YP + Sbjct: 610 RMLDFEYLMMGLVKNNAVPANRF------------FGVRSWGGSPNWNGVCANYPNTEDA 657 Query: 1284 FSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKM---- 1339 N W+ HE GH + G TEV NN+ + Q Sbjct: 658 MLVPKVFY--RKNNVWVFGHEFGHGNQVAQMKGNGWTEVTNNLYCSFAQYMMRNDPLSEG 715 Query: 1340 -NRVADDITVAPEYLEE----SNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGT 1394 R+ + P N + + Q+ + + + P Sbjct: 716 YLRLEHESFKRPGARSALAGGRINAFLNEALVAHK-SYFMQVATISTDKPGVWESDPFVK 774 Query: 1395 PLPEFYSEREGMKGW---NLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCAS 1451 +P + M + + +H A D + G + M A Sbjct: 775 LIPLWQMTMYFMAADIKPDFWPDVHWAAIHDNDKSYSPGRRYV----------NFMKRAI 824 Query: 1452 WVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQ 1511 + +L FF+ + ++Q + + + KP + Sbjct: 825 DASGLNLCGFFEGMGLLKVFDNVKVDDYTVATINITQEMVDEVKAYGEGKPLPS-GGMQY 883 Query: 1512 VTEHKMSA 1519 ++ + + A Sbjct: 884 ISANSVEA 891 >UniRef50_A9MNV3 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MNV3_SALAR Length = 644 Score = 133 bits (334), Expect = 6e-29, Method: Composition-based stats. Identities = 67/451 (14%), Positives = 126/451 (27%), Gaps = 80/451 (17%) Query: 1067 ETISLYSNPTKWFAGNMQS---TGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHE 1123 T + ++ T +A +T+++ V L Sbjct: 102 VTEQHDNGRSRTTTNGRTPAFFTYYYANVNDTMTLRTENQPDD---VKCYSLLENNYSE- 157 Query: 1124 VALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG----------NSSTNESASFTFTGV 1173 P+ L+ + K + G++ ++ + +T S T Sbjct: 158 ------PQAVDAVMLNHNSETKITFNHKGVVLLQCWGTSEDASLKHVNTLISTRVISTTA 211 Query: 1174 VKAPFYKDG-----AWKNDLNSPAPLGE---LESDAFVYTTPKKNLNASNYTGGLEQFAN 1225 P + G WKN P G+ + Y N+ ++ +EQ Sbjct: 212 STQPVFVLGVNTLDDWKNISQQSTPSGQTLLFDGRTRYYA--ANNVGKASKDHNIEQTLR 269 Query: 1226 D--LDTFASS-MNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNS 1282 + L+T +N G + R + D +I IG S Sbjct: 270 EHLLNTIVYDKLNGIDGSSPINEALRSLDIASYNSCCWAEGGDGRIGIGFGSS------- 322 Query: 1283 SFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRV 1342 + W WHE GH + G E N+ ++ + R Sbjct: 323 ----------IPTQSSWGEWHEFGHQNQMQ-WSWNGLGETTVNIYSIAAC-----RATRG 366 Query: 1343 ADDITVAPEYLEESNNQAWARGGAGDRL--LMYAQLKEWAEKNFDIKKWYPDGTPLPEFY 1400 D+ E L+ N W + G+ L L + + Sbjct: 367 EVDVKTCHENLQ-YNGFQWDQQAVGNFLKSGQTWDL-----------DTDTNVFHQLMMF 414 Query: 1401 SEREGMKGW-NLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLS 1459 ++ E W +L+ + + R + +S D ++ AS + DL Sbjct: 415 AQLE--TSWPDLYPALGKAYREINNYYNGSAKV----DSKQEKVDFFVVNASKYSGHDLR 468 Query: 1460 EFFKKWNPGANAYQLPGASEMSFEGGVSQSA 1490 +FF W + + M+ + S Sbjct: 469 KFFTHWGVDYSTDADNQITAMNLPQVIEPSG 499 >UniRef50_A5ZER4 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZER4_9BACE Length = 973 Score = 132 bits (331), Expect = 1e-28, Method: Composition-based stats. Identities = 63/466 (13%), Positives = 119/466 (25%), Gaps = 69/466 (14%) Query: 1085 STGLWAPAQKEVTIKSNANVPV-TVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGT 1143 TGL+ K+ I + T+ + + D G E + L SG Sbjct: 456 PTGLFFEKDKKYIIFVGDEIGDKTLNLYIKDWREGGENQTIRLK-------------SGL 502 Query: 1144 VKFKVPYGGLIYIKGNSSTN---ESASFTFTGVVKAPFY------KDGAWKNDLNS---- 1190 G YI+ + + + F+ + WK LN Sbjct: 503 NTIITTVDGTGYIQYWTDMEVYEPAVKVHVCYGNEIGFWDVRAGHTNEDWKRILNLANIC 562 Query: 1191 ------PAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSED 1244 + ++ + N + N D G + Sbjct: 563 VQRLNVTNAMLDVLGERVQLINTVNAFNTY-CPDDIMSIMNMHDELMQIEYMMMGLVKNN 621 Query: 1245 GKHRMFTYKNLPGHKHRFTNDVQISIGDA-HSGYPVMNSSFSPNSTTLPTTPLNDWLIWH 1303 R V+ G +G + N W+ H Sbjct: 622 AVPRNR------------MLGVRSWGGSPNWNGTCANFPNSEQAMLDKGVFLQNIWVFGH 669 Query: 1304 EVGHNAAETPLTVPGATEVANNVLALYMQDRYLG--------KMNRVADDITVAPEYLEE 1355 E GH + G EV NN+ A + + R + V + Sbjct: 670 EFGHGNQVAQMKGAGWAEVTNNIYAQQAMYQMNNAACRLEHTEFKRQGYNDKVVADRFNA 729 Query: 1356 SNNQAWARGGAGDR----LLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNL 1411 N A + L+ + E+ + + L +E + Sbjct: 730 YLNDAIVKKKPYLTHEGGLVNDPEKGEYYSADPFVSLAPLWQLSLFFMLTEDAPWSKPDF 789 Query: 1412 FQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANA 1471 + +H A D S + G M A ++ +L++FFKK Sbjct: 790 WPDVHWAAIHDNNSVY----------TYGEKYVNFMKRAMDASEMNLTDFFKKMGLLREI 839 Query: 1472 YQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEHKM 1517 G + + +++ + + K I ++ + + Sbjct: 840 NMKVGDYGPAKQITITKEMVGEIENYGKSKSPVPTPVIYYISGNSL 885 >UniRef50_C3XY93 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XY93_BRAFL Length = 533 Score = 130 bits (327), Expect = 4e-28, Method: Composition-based stats. Identities = 44/209 (21%), Positives = 79/209 (37%), Gaps = 22/209 (10%) Query: 1049 VDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKS--NANVPV 1106 +E +PG E Q + TI++ S + +TG + PA + +KS + + Sbjct: 284 PGIESFPGDFESEPQLHSVTITINSMRQE-----RHTTGYYLPAGTFLLVKSCNTNSGAL 338 Query: 1107 TVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESA 1166 + T R ++ L+R P ++ +L+ + PYGG IY++ + Sbjct: 339 SGWKIRVGAHTDRLSNQHTLHRWPNISVVTNLNHE--TQLFSPYGGNIYLESPEKPSL-L 395 Query: 1167 SFTFTGVVKAPFYKD------GAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGL 1220 S T VV+AP++ W+ SP EL ++T P L + Sbjct: 396 SITLENVVEAPWFDLTKPETVNNWQVSCQSPGLWAELAGRHIIFTLPSTCLKDGFNPTEM 455 Query: 1221 EQFANDLDTF------ASSMNDFYGRDSE 1243 F + + +M+ G D Sbjct: 456 LMFWDKIVQVTCNIFSLHAMHTIVGIDPW 484 >UniRef50_B6FXD4 Putative uncharacterized protein n=1 Tax=Clostridium hiranonis DSM 13275 RepID=B6FXD4_9CLOT Length = 1937 Score = 129 bits (324), Expect = 9e-28, Method: Composition-based stats. Identities = 100/685 (14%), Positives = 189/685 (27%), Gaps = 158/685 (23%) Query: 942 SVDLERLYQNMS-----VWLWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCS 996 +V+ R+ N+S + + Y+Y++ + T N + +D K Sbjct: 802 AVNARRIQVNLSTSPRRLSISEMKFYQYDDIE---------TSVENLFEDDLKVELRKNK 852 Query: 997 ADLKKSLVDNNMI----YGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVE 1052 + K + + ++ G NP E L +L D + Sbjct: 853 ENGKVEVTQEEIDQIRKTLKTPNEEGEYNPRKDTLLNEVKLAETILK----DNKLS---- 904 Query: 1053 KYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVAL 1112 E+ +V I+ T + Q+ G A +++ + N V +A Sbjct: 905 -------EKVISVDPLINGSGANTGY-NNTWQALGYSVKAGQKIDVYMGRNENKKVVLAY 956 Query: 1113 ADDLTGR---EKHEVALNRPPRVTKTYSLDA--------------------SGTVKFKVP 1149 E+ L LD S + KV Sbjct: 957 EQHYGESGSYLSKEIELKPGRNTISIEKLDNFDLNYEKGGNLYVRVTDNVDSSNAEIKVR 1016 Query: 1150 YGGLI---------------YIKGNSSTNESASFT----------------FTGVVKAPF 1178 G Y+ N ++ E+ Sbjct: 1017 VSGATEIPHLDLNNHLEDVDYLVNNPNSKEALEIKEKLKAYIESLKAHVQTMQSKYPESA 1076 Query: 1179 YKDGAWKNDLNSPAPL-----GELESDAFVYTTPKKNL-------------NASNYTGGL 1220 K KN +E D F T P +++ Sbjct: 1077 TKSDNTKNIYTYDKDTSVLNSTNIEGDRFTLTLPAQDVYEGITEGAENNLDKQVENLYNT 1136 Query: 1221 EQFANDLDTFASS-------MNDFYG--------RDSEDGKHRMFTYKNLPGHKHRFTND 1265 + ++ + DF +S+D ++ + + + R Sbjct: 1137 VLAWEQIIQITNAKKGVFEKVADFNNDGQINADDANSKDYQNNKASKRRVNVKYQRMFIG 1196 Query: 1266 VQISIGDAHSGYPVMNSSF-----------SPNSTTLPTTPLNDWLIWHEVGHNAAETPL 1314 + H G V +S + T L W I HE+GH A Sbjct: 1197 AFMYASGHHVGIDVGSSKDLMKGVPFKFDENGYVTNPDEARLFGWGISHEIGHKADIGNR 1256 Query: 1315 TVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYA 1374 T +E +NN+LAL Q +R+ ++ Y + +++ A L M+ Sbjct: 1257 TY---SETSNNILALITQTFDGKDKSRLEENGIYPKIYKKVTSSSVGVSQDATTLLGMFW 1313 Query: 1375 QLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNY 1434 QL+ E + + + + + + M+R R + Sbjct: 1314 QLQLAYEPGYTSEMLKRN---------NDGNLTNDSYYAKMNRLYRSLTDAEKAL----- 1359 Query: 1435 CAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTL 1494 + L+ AS A DL++FF+ W A++ + + E Q + Sbjct: 1360 ------DKDQLLIRKASESAGKDLTDFFESWGLVADSKTKAALTGLEKETKKIQYLNDEA 1413 Query: 1495 ASLDLPKPEQGPETINQVTEHKMSA 1519 L +N + K++A Sbjct: 1414 YRKRL---ANDSSDLNMSKDLKVNA 1435 >UniRef50_A2DW23 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DW23_TRIVA Length = 386 Score = 125 bits (313), Expect = 2e-26, Method: Composition-based stats. Identities = 45/200 (22%), Positives = 76/200 (38%), Gaps = 12/200 (6%) Query: 1006 NNMIYGDGSS-KAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQN 1064 + + G S + G+ P + + LM + E +PG + Sbjct: 191 WDYLKKTGYSLENGLCCPDVKHGIIVVLIHELMPKLPLTVYKPIPEYEFFPGKT-GDEPL 249 Query: 1065 VTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEV 1124 + L P W A TGLW PA K T++ +++ P+ + + + +TG Sbjct: 250 GEFDVELVVQPDIWIA-----TGLWLPAGKIGTVELHSDYPLNLQIQIGSQVTGLLAKNG 304 Query: 1125 ALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNE-SASFTFTGVVKAPFY---K 1180 AL R P V + L S + +GG+ Y+ N + + FT P Sbjct: 305 ALKRWPNVVSYFQL-TSEVTQVATSFGGITYVTCNDVMDSVTVKIHFTNFCLYPRACCDD 363 Query: 1181 DGAWKNDLNSPAPLGELESD 1200 WK+ N+ P GE+E+ Sbjct: 364 PSVWKSTQNTQVPWGEIETP 383 >UniRef50_A2DKX5 Immuno-dominant variable surface antigen-like n=1 Tax=Trichomonas vaginalis RepID=A2DKX5_TRIVA Length = 308 Score = 120 bits (301), Expect = 4e-25, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 65/199 (32%), Gaps = 14/199 (7%) Query: 1048 KVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVT 1107 +++ G S ++ + TG +AP + +T++ Sbjct: 102 PAGTKQFYGDDSLVSNAERYKAKIFIDTR---YNVQHWTGFYAPPGELITVEIPDKALNR 158 Query: 1108 VTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLI-YIKGNSSTNESA 1166 + V + T R + R + + + KF PYGG I + S E Sbjct: 159 IKVDINVITTSRSYN---YRRADQTSCRTDYINTTVTKFGWPYGGAIDFFIPIDSFPEGL 215 Query: 1167 SFTFTGVVKAPFYKDG-----AWKNDL-NSPAPLGELESDAFVYTTPKKNLNASNYTGGL 1220 + V+K P+++ G W + + PAPL ++ + T P + + Sbjct: 216 EVNISSVIKMPYFRYGATTEEEWNDKISKYPAPLAVFDTGSLHITGPSTFVRQKKNLNDV 275 Query: 1221 EQFAN-DLDTFASSMNDFY 1238 + F + + Sbjct: 276 MAIWRTTMQIFIHQLQEMM 294 >UniRef50_Q0TR08 F5/8 type C domain protein n=8 Tax=Clostridium perfringens RepID=Q0TR08_CLOP1 Length = 1687 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 69/456 (15%), Positives = 134/456 (29%), Gaps = 103/456 (22%) Query: 1068 TISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALN 1127 +IS W + Q TGL A A ++T+ + L +H A Sbjct: 506 SISEAKKRKVWNFQDWQITGLSARAGDKITVYVDVAEGDPTPTLLYKQ--SLTQHGGA-- 561 Query: 1128 RPPRVTKTYSLDASGTVKFKVPY--------------GGLIYIKGNSSTNESA--SFTFT 1171 ++ L G + +P GG ++ S ++ Sbjct: 562 ------TSFQLK-PGKNEITIPEINYESNGIPKDVIQGGDLFFTNYKSDSQKRAPKVRIE 614 Query: 1172 GVVKAPFYKDGA-----------------WKNDLNSPAPLGELESDAFVYTTPKKNLNAS 1214 G K P + G +P + + + L+ Sbjct: 615 GASKYPVFILGKSDENEVMKELEAYVEKIKAEPKTTPNIFAVSSNKSLEFVQATYALDWY 674 Query: 1215 NYTGGLEQF-ANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIG-D 1272 ++ A D + + F+G D+ + F ++ +P K +S G Sbjct: 675 KKNNKTPKYTAEQWDQYIADAMGFWGFDNSKDVNSDFNFRIMPMVK-------NLSGGAF 727 Query: 1273 AHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQ 1332 ++G V+ L W + HE+GHN T+ EV NN++ L+ + Sbjct: 728 MNAGNGVIGIRPGNQDAILAANK--GWGVAHELGHNFDTGGRTI---VEVTNNMMPLFFE 782 Query: 1333 DRYLGKMN----RVADDITVAPEYLEES-NNQAWARGGAGD--RLLMYAQLKEWAEKNFD 1385 +Y K + ++ T L++ NN+ + + + +L QL + Sbjct: 783 SKYKTKTRITDQNIWENNTYPKVGLDDYSNNELYNKADSTHLAQLAPLWQLYLYDNT--- 839 Query: 1386 IKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADT 1445 + R+ R + N Sbjct: 840 -------------------------FYGKFERQFRERDFGNKNREDIYKS---------- 864 Query: 1446 LMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 ++ AS + DL+EFF + + ++ Sbjct: 865 WVVAASDAMELDLTEFFARHGIRVDDKVKEDLAKYP 900 >UniRef50_C6IL86 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IL86_9BACE Length = 996 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 66/562 (11%), Positives = 153/562 (27%), Gaps = 74/562 (13%) Query: 1000 KKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEK-PLTRLMLGRSWWDLNIKVDVEKYPGAV 1058 ++ +D +I + P + + P +L +S +++ + Sbjct: 369 AENPLDEQLITVFTDRSCSELRPDASDETINRLPAFFNVLAKSLQSNTYPEAEKRF--RI 426 Query: 1059 SEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNA-NVPVTVTVALADDLT 1117 T +++ TG+ A +E+ + ++ ++++ DL Sbjct: 427 QSYQAYSVPEYWGDKLRTNYYSPLCNPTGIITNAGEEMVVLADGIPQGESISLRCCSDLG 486 Query: 1118 GREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNES---ASFTF---- 1170 + N G KF G +++ F Sbjct: 487 PDGEERFLKN--------------GINKFSFSRAGNLFVIYQKLDPRGMPAVKIHFPPQY 532 Query: 1171 TGVVKAPFYKDGAWKNDLNSPAP--------------------LGELESDAFVYTTPKKN 1210 + + W ++ + L+ ++T K Sbjct: 533 VEITEHARVGFNVWDLTVDKTDDLFREYIRKAKSVTLDGSDKCVFVLKGRKILFTALKDL 592 Query: 1211 LNASNY--TGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNL----------PGH 1258 L + G+ + D + D+ + ++ + Sbjct: 593 LQNQDNFKQYGVVRGMERWDNLIDWEQELAAIDTYSNTGEFNSLMHVTTFTDGLYATNYY 652 Query: 1259 KHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPG 1318 + DV G + + N+W HE+GH + P Sbjct: 653 INMAAGDVSTKDGWGFKNNF--------DPRDMDKNQDNEWGPGHELGHMHQ-GAINWPS 703 Query: 1319 ATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKE 1378 TE +NN+ + Y+ + +R + T+A + L + Sbjct: 704 TTESSNNLFSNYVVYKINQWGSRGSSIGTLATYRYAPPTPWSRFMHPRDPNTLAFTPQDM 763 Query: 1379 WAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGD-EVSNDKFGGKNYCAE 1437 ++ D K+ E + W F+ + +K ++ + + Sbjct: 764 TSD---DANKYGLYQGEASEMHMRLNQQL-WTYFERIGKKPNTIRKIFEQGRTPEFWLPF 819 Query: 1438 SNGNAADTLM-LCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLAS 1496 ++ AA + + A D++EFF W + SF V+Q N + Sbjct: 820 NDPGAAQLMYARNVAKAANMDMTEFFDAWGFFIPVSFKLY-AYGSFSYTVTQDMINQTLA 878 Query: 1497 LDLPKPEQGPETINQVTEHKMS 1518 + I + + + Sbjct: 879 Y-MKTFSTKCPPIEYIEDRRYQ 899 >UniRef50_A9L5A4 Putative uncharacterized protein n=12 Tax=Gammaproteobacteria RepID=A9L5A4_SHEB9 Length = 1077 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 77/607 (12%), Positives = 148/607 (24%), Gaps = 126/607 (20%) Query: 992 GTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNI---- 1047 K ++ YP+ Y E + D + Sbjct: 439 NNAIDVFTKTDFPLMKAGLLLADKYRSEID--YPIAYSEH---AQWQQALFADWTVSYAR 493 Query: 1048 -----KVDVEKYPGAVSEEGQNVTE------TISLYSNPTKWFAGNMQSTGLWAPAQKEV 1096 + D+ +Y + + T+S + + G +TG +A + + Sbjct: 494 THNLAQPDLGEYVTDRANLSKGSNAHYAYPATVSERKTISVPYTGQWTTTGWYALPGQTI 553 Query: 1097 TIKSNANVPVTVTVALADDLTGREK-HEVALNRPPR--VTKTYSLDASGTVKFKVPYGGL 1153 + + V + L + +E + R P + L +++F PYGG Sbjct: 554 KLTRLDSSAANVEIKLNYHRRNTNRAYEQKVYRGPLELAQQRLRLAQGKSIEFSSPYGGP 613 Query: 1154 IYIKGNSSTNE-----SASFTFTGVVKAP----FYKDGAW----KNDLNSPAPLGELESD 1200 IY+ N + S V K P F N+ P +L +D Sbjct: 614 IYLYINGDASSVDGALSVDVNAEHVTKHPTIMDFSNPAEIAAFNDRIQNTELPHVDLRTD 673 Query: 1201 AFVYTTPKKNLNASNYTG--GLEQFANDL-DTFASSMNDFYGRDSEDGKHRM-------- 1249 + + T + N + D +S+ G + Sbjct: 674 GAEQHLRRDRFLNTIGTDVPDVNALLNSIVDDHINSVYTLAGLKIQGKSLSESLPADVLA 733 Query: 1250 -----------FTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND 1298 + D G SG P + Sbjct: 734 ACKGLFGDDCIDNSLHTRTIIQHANYDQNAQCGAGCSGNP-----WDAAWN----IDPTG 784 Query: 1299 WLIWHEVGHNAAETPL-----------TVPGA----TEVANNVLALYM----QDRYLGKM 1339 W HE+GHN L G E +NN+ + Sbjct: 785 WGDNHELGHNLQTNRLNVQYAAANNSDNWAGYSSRAGENSNNIFPYVVKWKTHYLRDNNT 844 Query: 1340 NRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQL-------KEWAEKNFDIKKWYPD 1392 + V D + + A + ++ ++ + + Sbjct: 845 DTVTDGHMNHKDLFYVFMSDAAGTTDTSGKRVVVGANCKVLDAGEDRYTAPWASNTYAVH 904 Query: 1393 GTPLPEFYSER-------------EGMKGWNLFQLMHRKAR------------GDEVSND 1427 FY + G+NLF L+++ +R S Sbjct: 905 NGYRMAFYIQMALKAHGMTLSDGTMLNNGFNLFTLLYQHSRIFGKYANNASDWEANRSKL 964 Query: 1428 KFGGKNYCAES--------NGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASE 1479 F + S + D +++ S + D F ++ Sbjct: 965 GFSQFPFDGNSVYGGKTVKDIPGNDFMLVSLSQLTGKDWRSHFDMLGLRYSSLAAAQTVA 1024 Query: 1480 MSFEGGV 1486 + G + Sbjct: 1025 NATSGTM 1031 >UniRef50_Q0TS71 Discoidin domain protein n=8 Tax=Clostridium perfringens RepID=Q0TS71_CLOP1 Length = 2142 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 59/455 (12%), Positives = 128/455 (28%), Gaps = 79/455 (17%) Query: 1056 GAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADD 1115 G+ E Q+ +I W + Q TG + + +T+ + + Sbjct: 712 GSTVFELQSRGNSIKESQKRKVWNFQDWQPTGYAVKSGQVITVYVDVEDGKPTPKLVFKQ 771 Query: 1116 LTGREKHE--VALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGN---SSTNESASFTF 1170 + + + ++L++ V + ++ G++Y Sbjct: 772 MDSQHNGDVTISLSKGKNVITIPE-KPTNELRPGTAKAGVLYTSNPYTSEEQGRKPKIRI 830 Query: 1171 TGVVKAPFYKDG------------AWKNDLNSPA---PLGELESDAFVYTTPKKNLNASN 1215 G + P Y G + + L + ++ SD + Sbjct: 831 EGAINYPNYIKGIDNDEEVMNDLEEYVDLLKKDPQLPDVFDVFSDKTLVNVTATYALNWY 890 Query: 1216 YTGG--LEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDA 1273 + AN D ++G D + F ++ + K + G Sbjct: 891 KNNNKLPSETANKSDEVIKETMKYWGFDESSEVNSDFNFRYISMLKW------LDNGGFM 944 Query: 1274 HSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQD 1333 ++G + + + L W HE+GHN T+ EV NN+L L+ + Sbjct: 945 NAGNGITGFNKAEQGGALGVDT--GWGFMHEMGHNFDTNNRTI---VEVTNNMLPLHFER 999 Query: 1334 RYL---GKMNRVADDITVAPEYLEESNNQAWARGGAGDRL----LMYAQLKEWAEKNFDI 1386 + + + P+ + + + L QL+ + + Sbjct: 1000 IKGVPSNITRQNLWERNILPKVALDDYSNNEYYPESDKSLLSHVAPLWQLQLYDKT---- 1055 Query: 1387 KKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTL 1446 + ++ R ++ S N + Sbjct: 1056 ------------------------FWPRFEQEFRSRDIGG----------GSWENKHNAW 1081 Query: 1447 MLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMS 1481 ++ AS V + DLSE F++ S+ Sbjct: 1082 VMAASDVFKLDLSEHFERHGMDVWKETKEYTSKYP 1116 >UniRef50_A2EZ91 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EZ91_TRIVA Length = 215 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 63/201 (31%), Gaps = 14/201 (6%) Query: 1048 KVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVT 1107 ++++ G + + N T ++ P + +T++ Sbjct: 10 PAGIKQFNGDEQKIQVTQRYKFKVVLNTR---VKEYHWTAIYVPPGERITVEIPKKAVNY 66 Query: 1108 VTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLI-YIKGNSSTNESA 1166 V + + + + L K + KF PYGG + ++ Sbjct: 67 VWANINRQVNHTDSYNERLKDLKCQIK----MTNEVNKFGWPYGGNLEFLTFVDYFPSGL 122 Query: 1167 SFTFTGVVKAPFY-----KDGAWKND-LNSPAPLGELESDAFVYTTPKKNLNASNYTGGL 1220 TG ++ P + + W++ N PAP+ ++ + P + L + Sbjct: 123 EIYVTGGIRMPHFQYGVTTEEEWEDLNSNLPAPVAVVDCGNMLGVGPSEKLLQQSRLNDA 182 Query: 1221 EQFANDLDTFASSMNDFYGRD 1241 F + S+ND Y Sbjct: 183 LAFWRSVGQIFYSLNDVYNYP 203 >UniRef50_C4IBV1 Fibronectin type III domain protein n=2 Tax=Clostridium butyricum RepID=C4IBV1_CLOBU Length = 1989 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 88/660 (13%), Positives = 166/660 (25%), Gaps = 162/660 (24%) Query: 956 LWNDTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSS 1015 + Y Y+ +D G Y++D ++ + + D Sbjct: 921 ISELRFYEYDSLSDDIAG---------LYSDDLRI------------VIRDGVKQSDLDR 959 Query: 1016 KAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNP 1075 +N P+ P +L + D E VS + +I + Sbjct: 960 LYERLNTKDPVCDEYHPNRETLLKELDNAQKLFNDQE-----VSANITTLDASIRADNIG 1014 Query: 1076 TK-WFAGNMQSTGLWAPAQ-------KEVTIKSNA-NVPVTVTVALADDLTGREKHEVAL 1126 + QS G A +++ + + + V +A + + Sbjct: 1015 PSLGMENSYQSLGSVARPNALKATEKEKIIVYMGSTDSNTKVDIAFLQNYGQPSSY---- 1070 Query: 1127 NRPPRVTKTYSLDASGTVKFKVP--------YGGLIYIKGNS-STNESASFTFTGVVKAP 1177 ++K Y++ + G + ++P GG + + S ST + +GV + P Sbjct: 1071 -----MSKVYTI-SPGVTEIEIPTIFSADVEKGGQVMARVTSGSTGATVKIRLSGVDEIP 1124 Query: 1178 FY----------------------------------------------KDGAWKNDLNSP 1191 + +K + + Sbjct: 1125 HLNVKNIINDTSKVNEVKENIRTYIRDLKTYVNELPQRYPETVSDEDKINNIYKYEKTTS 1184 Query: 1192 AP-LGELESDAFVYTTPKK----NLNASNYTGGLEQFANDLDTFASSMNDF--------- 1237 ++E D F T P + E+ + + + Sbjct: 1185 VLNTTDIEGDRFTLTLPATEILAGIEQGLNGNEEEEVERVYNALLAWEQEIQVGYAKKGV 1244 Query: 1238 -------------YGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSF 1284 D L R H G SS+ Sbjct: 1245 FEEVKDFNNNGEIDSEDDVYFNSNKAPLTRLNVKYQRMMMGAAAYSSSHHIGVGFGASSY 1304 Query: 1285 S----------PNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDR 1334 N T L L+ HE+GH+ + P E +NN+LA Q Sbjct: 1305 MQGIPYKFDEDGNVTNPDEARLYGTLMGHEIGHSMDISNRIYP---ETSNNLLASITQTM 1361 Query: 1335 YLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGT 1394 + + E + + L M Q E N Sbjct: 1362 LNEDNPLTSGAMNTLYEKVTSN--TIGLSTNRSVVLGMLWQPYLAYEDNDTY-------- 1411 Query: 1395 PLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVA 1454 E + F ++R R +++G+ L+ S VA Sbjct: 1412 -KMLITDFDENTSNDSYFAKLNRAYRNMSAEE----------KADGDRDQYLIRMTSKVA 1460 Query: 1455 QTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTE 1514 +LS F+ NA S+ E Q + + + K + E+ + Sbjct: 1461 GRNLSSFYMAHGIIPNATTRAYVSQFKEETRPIQYINDEARRMRMSK-KADMESDTYLIA 1519 >UniRef50_Q638I8 Enhancin family protein n=51 Tax=Bacillales RepID=Q638I8_BACCZ Length = 742 Score = 98.0 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 67/434 (15%), Positives = 126/434 (29%), Gaps = 66/434 (15%) Query: 1081 GNMQSTGLWAPAQKEVTI-KSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLD 1139 + Q G + + + N N +TV L + + EK N + L Sbjct: 53 HDRQDLGFILQRNTPLKVRQINPNFKNKLTVRLLSNDSKNEKSIQVGNEWVTIQGDTPLV 112 Query: 1140 ASGTVKFKVPYGG----LIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLG 1195 PYG L Y N S + + F+ W Sbjct: 113 P----FIDTPYGEERAILEYQVENKSATKPLPIYKQQGSVSQFF--STWDQ---FDGEYA 163 Query: 1196 ELESDAFVYTTPKKN---LNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTY 1252 ++ ++F PKK+ + + L++ + + + G D +++ Sbjct: 164 LIQGESFQLFVPKKDKELVRSLKDFQSLDELIAYYEDIFAIYDSIIGLDGSTVENK---- 219 Query: 1253 KNLPGHKHRFTNDVQIS-IGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAE 1311 +R+ IS G A+ G +S L W HE+ H Sbjct: 220 ----KSHNRYFLKADISGAGGAYYGANWTANSTDSTKMWLDKLS---WGTLHEIAHGYQA 272 Query: 1312 T-PLTVPGATEVANNVLAL-YMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDR 1369 EV+NN+ + Y +Y K ++V +L + + Sbjct: 273 GFDNQGIFTGEVSNNLFGVQYQYSKYGKKADQVG--------WLFNFGKKEQVERNLYNA 324 Query: 1370 LLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKF 1429 L+ ++N + + ++ G F M++ R + Sbjct: 325 LM---------KENKNYNDLDLRQKLILLTMAK--QKAGDEAFAKMYQGYRKLASNVAFK 373 Query: 1430 GGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQS 1489 G + D + S A+ D + F++W N Q+ + Sbjct: 374 KGDHSLP-------DLMNQYYSENAKVDFTPVFERWGFKLNHKQIEMNRAKGYP------ 420 Query: 1490 AYNTLASLDLPKPE 1503 + SL PE Sbjct: 421 ---AVTSLAYIVPE 431 >UniRef50_A2G8C7 Immuno-dominant variable surface antigen-like n=1 Tax=Trichomonas vaginalis RepID=A2G8C7_TRIVA Length = 939 Score = 93.0 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 44/303 (14%), Positives = 80/303 (26%), Gaps = 68/303 (22%) Query: 1206 TPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTND 1265 P ++ ++ F + S+ND H + D Sbjct: 1 MPSQDAKSAIKLNDAMGFYRGVSRVLYSVNDVT--------HYPRRKDGRVLTPLQINLD 52 Query: 1266 VQISIGDAHS----GYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATE 1321 + G A S Y M +S+ W HE+GH + E Sbjct: 53 SFVKTGAAFSVVGANYIQMPNSWYSGIVNFDAIKWGSWGCVHEIGHQFQ-SNWGWGSYGE 111 Query: 1322 VANNV-----LALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQL 1376 V NNV ++ + ++ D + + LL YA Sbjct: 112 VTNNVLNYIHYSMMTEIDSTRQITLNGDFNVYNGGW--GRFCHQYNTINEDSLLLWYANN 169 Query: 1377 KEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCA 1436 W F + W L+ +HR Sbjct: 170 LYW----FGQETWRKV------------------LYAHIHRTMFPRNNVYPYVSEY---- 203 Query: 1437 ESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLAS 1496 +L S ++ D+ +FK ++ +++ ++Q +N L S Sbjct: 204 ----------LLHLSKISNRDMRAYFKTFSL------------YNYDTDITQETHNLLDS 241 Query: 1497 LDL 1499 +L Sbjct: 242 WNL 244 >UniRef50_B9GV44 Predicted protein n=3 Tax=Populus trichocarpa RepID=B9GV44_POPTR Length = 452 Score = 91.5 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 38/89 (42%), Gaps = 4/89 (4%) Query: 32 SSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKT 91 + P D P PDPTP+ P P+PTP P P EP P P P P+P P P P Sbjct: 44 PNQAPVPDPTPSQAPV--PDPTPSQAPVPDPTPSPAPVHEPTPSPAPVPDPTPNPAPAPD 101 Query: 92 GYLTLGGSQRVTGATCNGESSDGFTFKPG 120 TL S + T S+ +F Sbjct: 102 S--TLSPSPAASTTTLTSRVSENISFSKK 128 >UniRef50_A2EK19 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2EK19_TRIVA Length = 286 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 49/141 (34%), Gaps = 20/141 (14%) Query: 1142 GTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYK---DGAWKNDLNSPAPLGELE 1198 G + GGL+YI S S F +K+P Y + N P E+E Sbjct: 8 GDNEIFSQTGGLVYIAVEDSEKMELSVKFQKFLKSPRYIEEMPEIFDQTKNFQVPWAEIE 67 Query: 1199 SDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGH 1258 ++ + T PK + +F N +++ + ++DF Sbjct: 68 TNNLILTLPK---QKYDKIQDFPKFCNFINSVCTKISDFMHY--------------TIRR 110 Query: 1259 KHRFTNDVQISIGDAHSGYPV 1279 + R DVQ G YP+ Sbjct: 111 QFRVVFDVQTPGGKPIPVYPI 131 >UniRef50_D1BQB2 Putative uncharacterized protein n=1 Tax=Veillonella parvula DSM 2008 RepID=D1BQB2_VEIPT Length = 467 Score = 87.6 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 27/78 (34%), Positives = 36/78 (46%) Query: 26 GGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPE 85 G + PEVKP P P P+P +P P P P PE P P+PTP+PE + Sbjct: 369 GKVQPTPKPEVQPAPAPMPKPEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQPTPKPEVK 428 Query: 86 PVPTKTGYLTLGGSQRVT 103 P P T + + + T Sbjct: 429 PAPVPTPKPEVKPAPQPT 446 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 26/72 (36%), Positives = 34/72 (47%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 + PEVKP P P P+P +P P P P PE P P+PTP+PE +P P Sbjct: 396 QPTPKPEVKPAPVPTPKPEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQPTPKPEVKPAP 455 Query: 89 TKTGYLTLGGSQ 100 T L + + Sbjct: 456 VPTPKLEVKPTP 467 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 26/74 (35%), Positives = 32/74 (43%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89 PEVKP P P P+P +P P P P PE P P PTP+PE +P P Sbjct: 385 PMPKPEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQPTPKPEVKPAPVPTPKPEVKPAPQ 444 Query: 90 KTGYLTLGGSQRVT 103 T + + T Sbjct: 445 PTPKPEVKPAPVPT 458 >UniRef50_Q8EVX9 Putative uncharacterized protein MYPE4300 n=1 Tax=Mycoplasma penetrans RepID=Q8EVX9_MYCPE Length = 413 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 54/320 (16%), Positives = 102/320 (31%), Gaps = 48/320 (15%) Query: 917 GTNIQRLYQHELYFRTN-GRKGERLSSVDLER----LYQNMSVWLWNDTSYRYEEGKNDE 971 G NI LY H+ Y R R S L +N + + + Y + + Sbjct: 104 GNNIDTLYSHDSYITKKYRRSAARTYSGTLSGFPAITTRNGAEAKTSISFYSADYYPSPV 163 Query: 972 LGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEK 1031 + T T + +D + L + I ++K +P+ L Y + Sbjct: 164 HPYFTNTGTQSGTPSDTEKMNNVWDENTL--LNSSQFIQWAIANKQLKKHPAIDLQYYQ- 220 Query: 1032 PLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAP 1091 + +Y + ++ V + ++ + TGL+AP Sbjct: 221 --------------AATNNTSEYYHKLPDDALAVEKNFNIDLTQPGYVV-----TGLYAP 261 Query: 1092 AQKEVTIKSNANVPVTVTVALADDLTGREKHEVA----------LNRPPRVTKTYSLDAS 1141 + + IK V G ++ R P + + S+ S Sbjct: 262 PGEVINIKIPGLTDEEVKALNLTLTIGDNENPAPGYNASNYSRASKRLPVMKTSISITTS 321 Query: 1142 GTVKFKVPYGGLIYIKGNSSTN----ESASFTFTGVVKAPFYKDG-----AWKN-DLNSP 1191 K+ P+GG++ + N ++ G V+A Y G WK + Sbjct: 322 E-FKYGSPFGGIVNVVVNKEPGLNALKNVGMVINGAVEALHYIHGYTTEAEWKRLTKEAK 380 Query: 1192 APLGELESDAFVYTTPKKNL 1211 AP+ ++ +D + P L Sbjct: 381 APIFDIAADHIKFAGPVHTL 400 >UniRef50_A2DIS8 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DIS8_TRIVA Length = 988 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 64/431 (14%), Positives = 123/431 (28%), Gaps = 76/431 (17%) Query: 1077 KWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVT-KT 1135 + + + +T+ N V+ L E + + T K Sbjct: 171 SGLVRAYTPSDRYYVQGESLTVYCNTYDSVSYGALLVKPF----DKESSWHEWGLSTGKN 226 Query: 1136 YSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKN--------- 1186 ++ + G + NS N TG +KDG N Sbjct: 227 KNIKNPSDMTNHPTKLGTLMFYCNSKVNNPPKCRATGGRPILKFKDGDNVNEFMNKAEEY 286 Query: 1187 ------DLNSPAPLGELESDAFVYT-TPKKNLNASNYTGGLEQFA-NDLDTFASSMNDFY 1238 N A + +L+ K + Y G N+ +++ S+ D Sbjct: 287 AKSPNFISNVQASVTDLKDKKVNLMCVHSKKIVMYTYADGGLSVLKNNFNSWGKSVQDL- 345 Query: 1239 GRDSEDGKHRMFTY-----KNLPGHKHRFTNDVQISIGDAHSGYP--VMNSSFSPNSTTL 1291 D + H ++T+ D++ + A+ Y +N + + Sbjct: 346 -IDEYNKMHEIYTWYAGYEPGYIQATWLQMIDIR-TAAWAYCNYRYIAINGGDGASMLSW 403 Query: 1292 PTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPE 1351 T + W +HE GH+ T E+ NN+ +L MQ + G+ RV + Sbjct: 404 RTITYDGWGYYHEWGHHYDNRDTTEA---EMTNNLYSLMMQRK-NGRPARVEWIFPMLKS 459 Query: 1352 YLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNL 1411 + N + D + E + G + Sbjct: 460 WGINQNGNQY------------------------------DIWSRLAILRQLEIILGEEM 489 Query: 1412 FQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDL-SEFFKKWNPGAN 1470 ++ R + + + D + S + Q +L + FFK++N N Sbjct: 490 VPSFYKWQRAGNMRDLESKMW---------VHDIWVAGLSRLHQKNLYTSFFKQYNWPCN 540 Query: 1471 AYQLPGASEMS 1481 S+ Sbjct: 541 DACKAECSKYP 551 >UniRef50_B5FJ10 Viral enhancin protein n=5 Tax=Salmonella enterica subsp. enterica RepID=B5FJ10_SALDC Length = 1010 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 56/409 (13%), Positives = 109/409 (26%), Gaps = 53/409 (12%) Query: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 + Q+ G PA ++ I+ N + L + + EK ++ T Sbjct: 29 HDRQALGFILPANTQLQIRQPNNNAGNARLRLLCNDSACEKSLTLNGNWQTISTTVD-SV 87 Query: 1141 SGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGA--------WKNDLNSPA 1192 GG + T+ P ++ G W+ + Sbjct: 88 PFIDTLFFAQGGEFSVIYRQPTSNK---------NLPHWRKGQSEDAFFQTWEEQ---AS 135 Query: 1193 PLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTY 1252 P LE D + P + A+ G+ + N + G Sbjct: 136 PFALLELDRVRFLLPWAD-RANVINAGITALDAYYTRVIDAYNGWTGLSDSPASPLNQNV 194 Query: 1253 KNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAET 1312 N F + +G A+ + + S W I HE+GH Sbjct: 195 ANRY-----FIKADKHGVGAAYYLPWWCAQTAATLSQGWIDNVATQWTILHEIGHGYQGV 249 Query: 1313 PLTVPGA--TEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRL 1370 + EV NN+ A + Q L + N + D + + Sbjct: 250 FMNDVDLPVGEVWNNIYAAFFQQLNLNQGNHLYTDGWLYNYGRQSEQEL----------- 298 Query: 1371 LMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFG 1430 Q + + W +G G F++ ++ R + Sbjct: 299 ----QFITYLRNRTPVNAWGVRPRLQFLMLMLFKG--GTEAFRVFNQNYRELGADENFLP 352 Query: 1431 GKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASE 1479 ++ D L + + D++ F + +A+ + Sbjct: 353 CEHRLT-------DLLADAIATASGYDVAPFIQLCGLPVDAFTREQIAA 394 >UniRef50_B7WZG6 Putative uncharacterized protein n=1 Tax=Comamonas testosteroni KF-1 RepID=B7WZG6_COMTE Length = 954 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 64/471 (13%), Positives = 123/471 (26%), Gaps = 63/471 (13%) Query: 1058 VSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLT 1117 T T + G A K V ++ + + + + T Sbjct: 460 PKASASLATSDSWETIEVTIAQTDGRTAIGRGAVPGKTVQVQIVDAADAALALRVGNIRT 519 Query: 1118 ---GREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKG-NSSTNESASFTFTGV 1173 + R P + L T+ + +GG +++ + G Sbjct: 520 RGNPLAQENYTRPRFPDGHQA-RLSPGQTLSYSTAWGGPLFLNYSGAKAGSVVKLRVRGS 578 Query: 1174 VKAPFY----------KDGAWKNDLNSPAPL--GELESDAFVYTTPKKNLN------ASN 1215 VK + D A + ++ T + Sbjct: 579 VKYAHFDFTRNPGAQEIDEAVQALQRGDFGWQTSKMVGGEVQQTIGYAQSAIGSHHPRTY 638 Query: 1216 YTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYK-----NLPGHKHRFTNDVQISI 1270 L+ D + A+ N+ + + + F + Sbjct: 639 VVERLKGMIFDSNHLANGYNNMSASANVNNVCATLGWDCRGSIQRAPGVQHFV-GWLAAC 697 Query: 1271 GDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGA------TEVAN 1324 G SG P + W WHE+GHN +T+ E N Sbjct: 698 GFLCSGNP----------SDGAAGLAPGWGWWHELGHNTVMRHMTLLTTDGGGCPVECDN 747 Query: 1325 NVLALYM----QDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWA 1380 N+LA G N + I YL+ +A + G + M+ + W Sbjct: 748 NILANASALRQYAITNGAENNSGERIDHKKLYLDIQAARATGKTGDALQADMFQ--RFWT 805 Query: 1381 EKNFDIKKWYPDGTPLPEFYS-------EREGMKGWNLFQLMHRKARGDEV-----SNDK 1428 + N L Y+ + + + L+ R R +N Sbjct: 806 KANKSDNAMRAVHFQLAFIYTRERLGQAQPQPADVIDFLGLLGRGERLIYDNAYWSANKN 865 Query: 1429 FGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASE 1479 G A + + + L + +S + D+ + F + + L + Sbjct: 866 ALGMKDYASRDISNHELLYVLSSRIIGRDMRQVFAHYGIPLSPVALSSIAA 916 >UniRef50_C7XZ78 Enhancin family protein n=3 Tax=Lactobacillus jensenii RepID=C7XZ78_9LACO Length = 701 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 69/474 (14%), Positives = 128/474 (27%), Gaps = 68/474 (14%) Query: 962 YRYEE--GKNDELGFKTFTEFLN-CYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAG 1018 Y Y +++ F+ LN + +D + ++ + S + N I G + Sbjct: 156 YEYTGAGEISNQTNFENLQTQLNTLFTSDKDDELSTKASKEQISAIKLNFILSTGLTAEQ 215 Query: 1019 MMNPSYPLNYMEKPLTRLMLGRSWWDLNI----KVDVEKYPGAVSEEGQNVTETISLYSN 1074 ++ T L DL +V + + ++ N Sbjct: 216 K------SIIKQRLATANRLFDEAADLTSIKKDQVQSKSFYVLPTQSDSNAE-------G 262 Query: 1075 PTKWFAGNMQSTGLWAPAQKEVTI-KSNANVPVTVTVALADDLTGREKHEVALNRPPRVT 1133 + + Q G+ + I + N N V V L + K + +T Sbjct: 263 RQMGTSQDRQPLGIILTEGSTIKIRQVNDNYKGQVNVELVGYASAEIKSTSVGSDWVEIT 322 Query: 1134 KTYS-----LDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKA-PFY-------- 1179 T L + V +GV A P + Sbjct: 323 ATRDDAVVFLKTPQSNATSV------------QPKLEYEL-VSGVAPALPTFTATNSQTD 369 Query: 1180 KDGAWKNDLNSPAPLGELESDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDF-Y 1238 W+ A +E++ P + ++Q + D + ++ Sbjct: 370 VLKQWQKTR---AAFALIEANNINILVPLSD-YNLVAKTDIKQLIDQYDNQVFKLYNYLT 425 Query: 1239 GRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLND 1298 G D D + K F Q G +GY + + +N Sbjct: 426 GYDYNDNPDKNVKGKY-------FVMSDQTGSG---AGYYSGRLTAQNGKSVSAYLSIN- 474 Query: 1299 WLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESN- 1357 WL HE+GH P + NN+ Q Y + ++ ++ Sbjct: 475 WLPLHEIGHGYEI-PSDGLWIRDSFNNIFGTLYQLEYEKDNFEKKSWLLYGQKFASVNDF 533 Query: 1358 NQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNL 1411 + G + L + +L W N D Y + KG Sbjct: 534 IKQVKSGKDFNGLNFFGRLHFWM--NLAYNLKGMDAFRNFNIYYKEAARKGLKF 585 >UniRef50_B8MWN6 Viral-enhancing factor, putative n=3 Tax=Eurotiomycetidae RepID=B8MWN6_ASPFN Length = 858 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 62/414 (14%), Positives = 119/414 (28%), Gaps = 59/414 (14%) Query: 1078 WFAGNMQSTGLWAPAQKEVTIKSNANV---PVTVTVALADDLTGREKHEVALNRPPRVTK 1134 + Q + A + + + +T+ + D+ T K + ++ Sbjct: 25 GIDHDRQHLSIVLAAGQTIKARQTNTAITGELTLRLLNDDNQTEASKKVGS--DWAELSA 82 Query: 1135 TYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFT---GVVKAPFYKDGAWKNDL--- 1188 + VP+ +Y +++ E + G + P Y+ G +++ Sbjct: 83 SV---------VSVPFIDTLYTDTSNAAVEPV-VEYEYPDGSKQLPVYRKGQSQSEFFNH 132 Query: 1189 --NSPAPLGELESDAFVYTTPK---KNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSE 1243 N + L+S P + L ++ ++ S N G E Sbjct: 133 WDNQGSEFALLDSSYTQVLVPVIDKEALRHPQEVDNIDGLIGYYESVFSFYNALAGLSFE 192 Query: 1244 DGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWH 1303 + +L F + G A+ G +S + ++W H Sbjct: 193 PERP-----SDLNSTNRYFMKADKHGAGGAYYGQNWTAASTNAIKEWWLLPSASNWGNLH 247 Query: 1304 EVGHNAA--ETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAW 1361 E+GH EV+NNV A Q YLG D +L QA Sbjct: 248 EIGHGYQMSFRNDRYFWNGEVSNNVYAALYQSAYLG-------DRKYQEGWLYNYGKQAQ 300 Query: 1362 ARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARG 1421 G + + L +W D F G + F +++ R Sbjct: 301 VEQGIISDITSHKSLNDW------------DLRAKLYFLVLMVEKAGVDSFADFNQQYRL 348 Query: 1422 DEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLP 1475 + + D L S + D++ F + + + Q Sbjct: 349 VSNQPNFDSTNHLL-------LDMLSDSFSRIGHIDVTPFVELCSGYISPGQRE 395 >UniRef50_A5ZN83 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZN83_9FIRM Length = 607 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 33/81 (40%), Gaps = 1/81 (1%) Query: 50 PDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYL-TLGGSQRVTGATCN 108 P+PT PEPT P P P P P+P TPEP P PT GS TG N Sbjct: 475 PEPTAAPEPTAAPEPTAAPEPTATPEPTATPEPTATPEPTTAPGAGYKDGSYTGTGEGFN 534 Query: 109 GESSDGFTFKPGEDVTCVAGN 129 G+ + G V+ Sbjct: 535 GQVTVTINVSGGNIVSAGYDG 555 >UniRef50_UPI000180C854 PREDICTED: similar to transmembrane agrin n=1 Tax=Ciona intestinalis RepID=UPI000180C854 Length = 2114 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 32/98 (32%), Positives = 43/98 (43%), Gaps = 22/98 (22%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPN----------------------PEPTPEPTPDPEP 68 ++ + + T +P V+P+PTPN PEPTP+ P+PEP Sbjct: 1108 ETTRMVQITTTTHPVPVVEPEPTPNAKPEPEPTSKPEPEPEPTPNAKPEPTPKSEPEPEP 1167 Query: 69 TPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGAT 106 T +P P+PEPT PEPEP P T T Sbjct: 1168 TSKPEPEPEPTSNPEPEPTPNAKPEPTSNPEPEPERTT 1205 >UniRef50_UPI000169559C enhancin family protein n=3 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI000169559C Length = 870 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 64/452 (14%), Positives = 122/452 (26%), Gaps = 64/452 (14%) Query: 984 YANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPL-NYMEKPLTRLMLGRSW 1042 + + D + ++ + +P + L N +M Sbjct: 467 FNKEIQGTNVSIGQDTIPLKEGYTIKIYHAETRTRLKSPDFNLINSKSNDNMFIMTKNGL 526 Query: 1043 WDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTI-KSN 1101 + V + V+ T + +K + Q G+ Q + I + N Sbjct: 527 INTLTPVQKKSIHHLVAP-------TWIFNAGISKGKYHDRQDLGVILQPQATIRIRQVN 579 Query: 1102 ANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSS 1161 + +TV L +D EK E + + DA PYG Sbjct: 580 PHFKDKLTVRLLNDDRLTEKKEGVTSEWSTIQA----DAISVPFIDTPYG---------D 626 Query: 1162 TNESASFTFTG-VVKAPFYKD--------GAWKNDLNSPAPLGELESDAFVYTTPK---K 1209 N +T G + P Y+ W + A ++ +F P + Sbjct: 627 QNAEVEYTIEGKQIPLPIYQPCGNKMEFFQQWDKE---QAGFALVQGPSFQLLVPAKDKE 683 Query: 1210 NLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQIS 1269 L +++ + ND G + D + D + Sbjct: 684 ALRNLKEFKSIDELIQYYEEIFQLFNDMIGLEDTDTGTNKMSQNRYFLK-----ADAHGA 738 Query: 1270 IGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAET-PLTVPGATEVANNVLA 1328 G +S + + + +T N+W HE+ H EV+NN+ Sbjct: 739 GGAYYS----HDYTANSYATVDMWLKKNNWGPLHEIAHGYQAAFDNKGMYTGEVSNNLFG 794 Query: 1329 LYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKK 1388 + Q GK ++ G + + L + K Sbjct: 795 VQHQYSKNGK-----------------DADKIGWLFDYGKKESVEKNLYQAIIKEGKGYT 837 Query: 1389 WYPDGTPLPEFYSEREGMKGWNLFQLMHRKAR 1420 D + + G F ++R+ R Sbjct: 838 EVDDLRFQLILLTMLKQKAGNEAFTHLYREYR 869 >UniRef50_A9A5M1 Putative uncharacterized protein n=1 Tax=Nitrosopumilus maritimus SCM1 RepID=A9A5M1_NITMS Length = 268 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 30/73 (41%), Positives = 34/73 (46%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTK 90 + P PE +PTP PEP EPTP+PEP EP P+PEP EP PEP P Sbjct: 45 EPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVV 104 Query: 91 TGYLTLGGSQRVT 103 SQ Sbjct: 105 EPTPEPEESQLPE 117 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 40/156 (25%), Positives = 53/156 (33%), Gaps = 9/156 (5%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTK 90 + P PE +PTP PEP EPTP+PEP EP P+PEP EP PEP P Sbjct: 35 EPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVV 94 Query: 91 TGYLTLGGSQRVTGATCNGESSDGFTFKPGEDVTCVAGNTTIATFNTQ--SEAARSLRAV 148 T + P + T FN+ E A Sbjct: 95 EPTPEPEPVVEPTPEPEESQL-------PEISIKTSYDETPSTEFNSAALLETASDEEVN 147 Query: 149 EKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPAN 184 +++F ++ S D N + N Sbjct: 148 SRITFEEFTDEKTLISHDAFGNKQMKIKVLEVSDEN 183 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 31/80 (38%), Positives = 38/80 (47%), Gaps = 2/80 (2%) Query: 30 GSSSDTPPVDSGTGSLPEVKP--DPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 + P V+ P V+P +P P EPTPEP P EPTPEP P EPTPEPEP Sbjct: 36 PTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVE 95 Query: 88 PTKTGYLTLGGSQRVTGATC 107 PT + + + Sbjct: 96 PTPEPEPVVEPTPEPEESQL 115 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 22/72 (30%), Positives = 29/72 (40%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTK 90 P + P +P+P P P PEP +P P PEP+ +P P PEP EP P Sbjct: 31 EPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEP 90 Query: 91 TGYLTLGGSQRV 102 + Sbjct: 91 EPVVEPTPEPEP 102 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 31/73 (42%), Positives = 36/73 (49%), Gaps = 2/73 (2%) Query: 35 TPPVDSGTGSLPEVKP--DPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTG 92 P V+ P V+P +P P EPTPEP P EPTPEP P EPTPEPEP PT Sbjct: 31 EPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEP 90 Query: 93 YLTLGGSQRVTGA 105 + + Sbjct: 91 EPVVEPTPEPEPV 103 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 29/75 (38%), Positives = 36/75 (48%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTK 90 + + + T + E P+P P EPTPEP P EPTPEP P EPTPEPEP PT Sbjct: 19 EAKRIQQIKNPTEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTP 78 Query: 91 TGYLTLGGSQRVTGA 105 + + Sbjct: 79 EPEPVVEPTPEPEPV 93 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 32/99 (32%), Positives = 43/99 (43%), Gaps = 9/99 (9%) Query: 30 GSSSDTPPVDSGTGSLPEVKP--DPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 + P V+ P V+P +P P EPTPEP P EPTPEP P EPTPEPE + Sbjct: 56 PTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEESQL 115 Query: 88 PTKTGYLTLGGSQRVT-------GATCNGESSDGFTFKP 119 P + + + + E + TF+ Sbjct: 116 PEISIKTSYDETPSTEFNSAALLETASDEEVNSRITFEE 154 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 25/67 (37%), Positives = 27/67 (40%) Query: 37 PVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTL 96 P + PE +P P PEP P P PEP P P PEP P EP P P T Sbjct: 29 PTEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTPEPEPVVEPTP 88 Query: 97 GGSQRVT 103 V Sbjct: 89 EPEPVVE 95 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 23/85 (27%), Positives = 27/85 (31%) Query: 19 TLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEP 78 +L+ + +P P PEP P P PEP P P PEP Sbjct: 1 MVLSEVNKDEEAQLQWEKEAKRIQQIKNPTEPVVEPTPEPEPVVEPTPEPEPVVEPTPEP 60 Query: 79 TPEPEPEPVPTKTGYLTLGGSQRVT 103 P EP P P T V Sbjct: 61 EPVVEPTPEPEPVVEPTPEPEPVVE 85 >UniRef50_P20301 Antigenic protein NP1 (Fragment) n=6 Tax=Entamoeba RepID=APRN_ENTHI Length = 640 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 48/407 (11%), Positives = 107/407 (26%), Gaps = 77/407 (18%) Query: 1106 VTVTVALADDLTGREKHEVALNRPPR-------------VTKTYSLDASGTVKFKVPYGG 1152 V V++ + + ++R + T + T K P GG Sbjct: 1 VQVSIGKCNHNPSD-QWLDNISRWSNDRMPIDSIGIDLGLQTTQPYSINDTFKIGSPIGG 59 Query: 1153 LIYIKGNSSTNESASFTFTGVVKAPFYK-----DGAWKN-DLNSPAPLGELESDAFVYTT 1206 +IY++ +++ S TF V +A + W + N+P + E+ + Sbjct: 60 MIYLRFDTTFTNSFYVTFYNVGRASIINYNITMNEEWNSVLKNAPGNVAEIRTPGNRLVF 119 Query: 1207 PKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDV 1266 +++ + + + + T +N+P D Sbjct: 120 TSRHIRNLEDAQYISDYW---------------LKAISISNYAVTLENIPIT---LNFDQ 161 Query: 1267 QISIGDAHS----GYPVMNSSFSPNSTTLPT-TPLNDWLIWHEVGHNAAET-----PLTV 1316 ++ G A + + S ++ +W HE+ H+ + Sbjct: 162 RVDAGAAVAFVDRWFTQHPSDWASGCVNKEGLINSGNWGPLHEMNHHMQGPYLRGGNWGI 221 Query: 1317 PGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGGAGDRLLMYAQL 1376 E NNV+ Y + + Y+ + + + Sbjct: 222 KEPGEETNNVMTSINYILYTN-IAGHRNQGLSGWNYVSDGYSTIYKILN----------- 269 Query: 1377 KEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCA 1436 + P Y G + + + G N+ G + Sbjct: 270 -------------GENDQPHLRSYVNIAHAFGTDTLIALVKSYYGLWYENNYEGEYSIKR 316 Query: 1437 ESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFE 1483 +S L A+ + D + M++ Sbjct: 317 DSTS----AFCLLAAIATKRDTRYLCSLFKYDIQQNVSEAIKNMNYP 359 >UniRef50_A2DBM1 F5/8 type C domain containing protein n=3 Tax=Trichomonas vaginalis RepID=A2DBM1_TRIVA Length = 1128 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 50/415 (12%), Positives = 102/415 (24%), Gaps = 95/415 (22%) Query: 1081 GNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDA 1140 Q TG + + + + + + + + + P Sbjct: 125 NTNQPTGKYMITGERIIVYCENYGRKSNSKLVFSKNAQDGDNMMQYVDLP--------QG 176 Query: 1141 SGTVKFKVPYGG----LIYIKGNSSTNESASFTFTGVVKAPFYKDGA------------- 1183 +GG +++ ++ +G + +K G Sbjct: 177 KTNTSATYTWGGLYAPMLWNCYSNEDFPPPKCRISGGHELLSFKLGDNATLFEERLENYL 236 Query: 1184 -----WKNDLNSPAP-----LGELESDAFVYTTPKKNLNASNYTG-----GLEQFANDLD 1228 N L++ + E+ FV T G L + Sbjct: 237 KDPNYLSNTLSNDVQGKLMNVAEVSGPNFVMHTTAGACLTGIRWGKNAGLDLNVAMAQWN 296 Query: 1229 TFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAH-SGYPVMNSSFSPN 1287 ++ G +D + + R + + + A+ +G + + Sbjct: 297 EIRDLYLEYLGYVDDDPNP-----VHKKQYGTRVVSKIGKTGAYAYKTGNHIFYNQDYFG 351 Query: 1288 STTLP----TTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVA 1343 + ++W WHE GH P E+ NN+ L MQ RY +R+ Sbjct: 352 AAACNAQFIRASGDNWGAWHENGHAYDFGP---TEEAELTNNMFPLMMQRRY-NLTSRLE 407 Query: 1344 DDITVAPEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSER 1403 E N G W + + Sbjct: 408 S---------ESRWNNVLNYLNYGQ-------------------AWGGENWYGLGILRQL 439 Query: 1404 EGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDL 1458 E G + F R R + S + + ++ S + +L Sbjct: 440 EIFAGKHSFAQFCRLIREHD-------------YSGVSRREKFVIGFSKLVGQNL 481 >UniRef50_B1IFS9 Enhancing factor n=5 Tax=Clostridium botulinum RepID=B1IFS9_CLOBK Length = 925 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 67/446 (15%), Positives = 123/446 (27%), Gaps = 72/446 (16%) Query: 1078 WFAGNMQSTGLWAPAQKEVTI-KSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTY 1136 + Q G+ PA ++TI + N N +TV L +D E +KT+ Sbjct: 52 GIGHDKQDLGIILPANVQLTIRQVNPNFKGNLTVRLLNDNNQHE-----------TSKTF 100 Query: 1137 SLDASGTVKFKVPYGGLIYI--KGNSSTNESASFTFTGVVKA-PFYKDGAWKNDL----- 1188 + ++ PY + ++ S + T ++ P YK G + Sbjct: 101 N---QTSITISTPYSSVPFVDTVYGGSEKPKIEYRVTSSMQTLPIYKKGQNEQSFFYGWD 157 Query: 1189 NSPAPLGELESDAFVYTTP---KKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDG 1245 + AP + D F P K + ++ N G Sbjct: 158 KASAPFALVTDDYFQLLVPQKDKAYMKNMKDFSSIDNLILYYREIFEYYNKLAGISFNTN 217 Query: 1246 KHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEV 1305 N K + G H+ + S + T W HE+ Sbjct: 218 VQTDKNVPNKYFIKADKSGPGGGYYGGNHTAET--SDSVASFWLT------RGWGALHEI 269 Query: 1306 GHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNNQAWARGG 1365 GH T EV NN+ A +Q + +L + + Sbjct: 270 GHGYQ-NDFT---RGEVWNNIYAHSLQKKDPNV-------NIFKSGWLYDYGRKDAVDNN 318 Query: 1366 AGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVS 1425 K W + W Y + G + F +++ R Sbjct: 319 VN---------KLWHQDKAAFNTWGLREQLYG--YVLMKDKAGDDSFTHFNQEYR----- 362 Query: 1426 NDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGG 1485 K S+ D + +++ D + + + ++Q +++ Sbjct: 363 --KLANTPGFNTSDYKQFDLISKAYGEISKLDFTPVIESFGGVMTSWQKEENRYKNYKP- 419 Query: 1486 VSQSAYNTLASLDLPKPEQGPETINQ 1511 +A L+ P I Q Sbjct: 420 --------VAPLNEVVPTSQVSQIQQ 437 >UniRef50_B1V638 LRR adjacent family n=3 Tax=Clostridium perfringens D str. JGS1721 RepID=B1V638_CLOPE Length = 1829 Score = 73.4 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 74/425 (17%), Positives = 118/425 (27%), Gaps = 80/425 (18%) Query: 1077 KWFAGNMQSTGLWAPAQKEVTI-KSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKT 1135 + + G PA E I + N N+ +T+ L +D + E VT + Sbjct: 71 RGNYHGREPLGFILPANVEFQIRQVNTNLNKNLTLDLFNDDSNTENSITIPKNGEWVTVS 130 Query: 1136 YSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDG--------AWKND 1187 + P+ I E V K P Y+ G W N Sbjct: 131 -------SASLSAPF---IRKTSGEQVPEVEFILRGEVDKLPIYRTGMNESEFFKEWDNL 180 Query: 1188 LNSPAPLGELESDAFVYTTPK---KNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSED 1244 ++ +E+D PK K+L N +++ D ++ + G + Sbjct: 181 DSN---FAFIENDRVQMLIPKVDKKSLRNMNDFSSIDELFKFYDEMFNTYDRLLGLEKNS 237 Query: 1245 GKHRMFTYKNLPGHKHRFTNDVQISIGDAH--SGYPVMNSSFSPNSTTLPTTPLNDWLIW 1302 + F + G A+ S Y NS WL Sbjct: 238 ENPL-----HNNVDTQYFIKADKHGAGAAYYSSNYTAQNSDSMSAYL------EKGWLPL 286 Query: 1303 HEVGHNAAETPLTVPGAT--EVANNVLALYMQDRYLGKMN---RVADDITVAPEYLEESN 1357 HEVGH + G +V NN+LA Q YL D + Sbjct: 287 HEVGHGYEY-DIKNKGLYLVDVFNNILAHTYQSTYLKNDEGWLFNGDRLGRDKSMKNVRE 345 Query: 1358 NQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHR 1417 N ++ D+L M+ L ++ G F +R Sbjct: 346 NGSYDSANYQDKLGMFIYLLDFI---------------------------GEEKFAEFNR 378 Query: 1418 KARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGA 1477 R S + N NA L S V + E+F + + Sbjct: 379 LYREKNNSGE---------LRNKNAITILNELLSEVTGYNFKEYFNSYKLIDSNNLFNNE 429 Query: 1478 SEMSF 1482 + Sbjct: 430 KYSKY 434 >UniRef50_Q8EVX8 Putative integral membrane protein n=1 Tax=Mycoplasma penetrans RepID=Q8EVX8_MYCPE Length = 832 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 67/232 (28%), Gaps = 7/232 (3%) Query: 1233 SMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTL- 1291 + D D+ ++ +K H H ++ + Y S+S + Sbjct: 2 DLYDKMALDALYTINKNAVWKQSYRHIHSNFVAYGAAVSFVGASYINSPWSWSASILNYE 61 Query: 1292 PTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLAL--YMQDRYLGKMNRVADDITVA 1349 W + HE H G+ EV NNVL + Y+ + + Sbjct: 62 NVLNEGSWGVMHENNHQNQ-GSWGYTGSGEVTNNVLTIASYLNFTNVSNNRYKWINYNAT 120 Query: 1350 PEYLEESNNQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGW 1409 + + N + + Y L E+ N K + YS G Sbjct: 121 DGKISKRGNNHIYKLNGYLNVDTY--LDEYVNDNNSPDKNKAREGIDYD-YSVIMSHFGT 177 Query: 1410 NLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEF 1461 + FQ + R V++ + + + S V D E+ Sbjct: 178 STFQKIIRSYSDSTVTDFEGNKITIPESIKNDRNAIFVYRISAVTGYDWIEY 229 >UniRef50_B4FNV9 Early nodulin 75 protein n=2 Tax=Zea mays RepID=B4FNV9_MAIZE Length = 279 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 27/109 (24%), Positives = 37/109 (33%), Gaps = 32/109 (29%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEP---------------------------- 58 P D P++KP+P P P+P Sbjct: 82 DTKSDPKPAPQSDPKPAPQPDLKPEPKPTPQPDPKPSPQPDPEPKPKPAPQPEPKPEPKP 141 Query: 59 ----TPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVT 103 P+P P P+P PEP P P+P PEP+P+P P T + Sbjct: 142 TPQPDPKPGPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPE 190 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 26/49 (53%), Positives = 32/49 (65%) Query: 39 DSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 D G P+ KP+P P P+P PEP P P P PEP P P+P P+PEP+P Sbjct: 146 DPKPGPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEPKPT 194 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/100 (25%), Positives = 32/100 (32%), Gaps = 26/100 (26%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPN--------------------------PEPTPEP 62 P D P +PDP P+ P+P P P Sbjct: 92 QSDPKPAPQPDLKPEPKPTPQPDPKPSPQPDPEPKPKPAPQPEPKPEPKPTPQPDPKPGP 151 Query: 63 TPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRV 102 PDP+P P+P P P P P+P+P P P Sbjct: 152 QPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEP 191 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 27/93 (29%), Positives = 35/93 (37%), Gaps = 24/93 (25%) Query: 27 GGSGSSSDTPPVDSGTGSLP------------------------EVKPDPTPNPEPTPEP 62 TP D P + KP P P+P+P P+P Sbjct: 102 DLKPEPKPTPQPDPKPSPQPDPEPKPKPAPQPEPKPEPKPTPQPDPKPGPQPDPKPEPKP 161 Query: 63 TPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLT 95 TP P P P+P P P+P P+P P+P P T Sbjct: 162 TPQPGPEPKPKPAPQPEPKPTPQPDPKPEPKPT 194 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 23/96 (23%), Positives = 32/96 (33%), Gaps = 30/96 (31%) Query: 37 PVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPE------------------- 77 D+ + P + DP P P+P +P P P P P+P P P+ Sbjct: 80 QSDTKSDPKPAPQSDPKPAPQPDLKPEPKPTPQPDPKPSPQPDPEPKPKPAPQPEPKPEP 139 Query: 78 -----------PTPEPEPEPVPTKTGYLTLGGSQRV 102 P P+P+PEP PT Sbjct: 140 KPTPQPDPKPGPQPDPKPEPKPTPQPGPEPKPKPAP 175 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 34/89 (38%), Gaps = 16/89 (17%) Query: 23 GCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEP 82 G TP P +P+P P P+P P+P P P P P+P P+P+PTP+P Sbjct: 150 GPQPDPKPEPKPTPQPGPEPKPKPAPQPEPKPTPQPDPKPEPKPTPQPDPKPEPKPTPQP 209 Query: 83 EP----------------EPVPTKTGYLT 95 +P P + Sbjct: 210 PFPQPEPQPDPKPQPEPSKPDPKPQPEPS 238 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 29/87 (33%), Gaps = 28/87 (32%) Query: 45 LPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPE--------------------- 83 + K DP P P+ P+P P P+ PEP P P+P P+P Sbjct: 80 QSDTKSDPKPAPQSDPKPAPQPDLKPEPKPTPQPDPKPSPQPDPEPKPKPAPQPEPKPEP 139 Query: 84 -------PEPVPTKTGYLTLGGSQRVT 103 P+P P + + Sbjct: 140 KPTPQPDPKPGPQPDPKPEPKPTPQPG 166 Score = 43.3 bits (100), Expect = 0.085, Method: Composition-based stats. Identities = 21/70 (30%), Positives = 30/70 (42%) Query: 26 GGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPE 85 G + P PE KP+P P P+ + P P P +P P P+P +PEP+ Sbjct: 49 GVSQPDRNQEPKPTPQPEPKPEPKPEPKPAPQSDTKSDPKPAPQSDPKPAPQPDLKPEPK 108 Query: 86 PVPTKTGYLT 95 P P + Sbjct: 109 PTPQPDPKPS 118 >UniRef50_A0YJH0 Polymorphic membrane protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YJH0_9CYAN Length = 2103 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 30/93 (32%), Positives = 36/93 (38%), Gaps = 18/93 (19%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEP---------- 78 S P P ++P PTP E PEP+ +P PT EP P+P Sbjct: 1547 IPEPSVEPTPTIEPTPEPSIEPTPTPTIELIPEPSIEPTPTIEPTPEPSIEPTPTPTFEP 1606 Query: 79 --------TPEPEPEPVPTKTGYLTLGGSQRVT 103 TPEP EP PT T L S +T Sbjct: 1607 TPTPTIELTPEPSIEPTPTPTIELIPEPSIELT 1639 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 26/87 (29%), Gaps = 10/87 (11%) Query: 27 GGSGSSSDTPPVDSGTG--SLPEVKPDPTPNPEPTPEPTPDPEPTP--------EPIPDP 76 + S P P ++P PT P P P P P PT E P+P Sbjct: 1559 EPTPEPSIEPTPTPTIELIPEPSIEPTPTIEPTPEPSIEPTPTPTFEPTPTPTIELTPEP 1618 Query: 77 EPTPEPEPEPVPTKTGYLTLGGSQRVT 103 P P P + L + Sbjct: 1619 SIEPTPTPTIELIPEPSIELTPTPTFE 1645 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 25/73 (34%), Positives = 30/73 (41%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 + TP P ++ P P+ EPTP P PEP+ EP P P PEP Sbjct: 1523 DDIPTVEPTPIPTIEPTPTPTIELIPEPSVEPTPTIEPTPEPSIEPTPTPTIELIPEPSI 1582 Query: 87 VPTKTGYLTLGGS 99 PT T T S Sbjct: 1583 EPTPTIEPTPEPS 1595 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 26/82 (31%), Gaps = 16/82 (19%) Query: 37 PVDSGTGSLPEVKP----------------DPTPNPEPTPEPTPDPEPTPEPIPDPEPTP 80 + P V+P + P P PTP EP+ EP P+P Sbjct: 1667 ELIPTPTIEPSVEPTPTPFLEPTPTPTPTIELIPESFIEPTPTPTFEPSVEPTPEPSIEL 1726 Query: 81 EPEPEPVPTKTGYLTLGGSQRV 102 P P P T LT Q Sbjct: 1727 TPTPTIEPIPTPQLTFEPQQPT 1748 >UniRef50_A9B8D9 Rhs element Vgr protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B8D9_HERA2 Length = 258 Score = 66.8 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 25/76 (32%), Positives = 34/76 (44%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 G + TP + P+ +P TP PEPT P P P+PT P P+P P P+P+P Sbjct: 167 GTQPEPTQTPRSEPTANPYPQPQPTRTPRPEPTANPYPQPQPTRTPRPEPTANPYPQPQP 226 Query: 87 VPTKTGYLTLGGSQRV 102 T T + Sbjct: 227 TRTPRPEPTANPYPQP 242 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/76 (31%), Positives = 33/76 (43%) Query: 25 DGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEP 84 + + TP + P+ +P TP PEPT P P P+PT P P+P P P+P Sbjct: 183 NPYPQPQPTRTPRPEPTANPYPQPQPTRTPRPEPTANPYPQPQPTRTPRPEPTANPYPQP 242 Query: 85 EPVPTKTGYLTLGGSQ 100 +P T T Sbjct: 243 QPTRTPRPEPTQHPYP 258 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 21/75 (28%), Positives = 24/75 (32%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 + + P P P P P P TP P P P P+P P P PEP P Sbjct: 179 EPTANPYPQPQPTRTPRPEPTANPYPQPQPTRTPRPEPTANPYPQPQPTRTPRPEPTANP 238 Query: 87 VPTKTGYLTLGGSQR 101 P T Sbjct: 239 YPQPQPTRTPRPEPT 253 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 28/77 (36%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89 + P P +P P P+P P TP PEPT P P P+PT P PEP Sbjct: 178 SEPTANPYPQPQPTRTPRPEPTANPYPQPQPTRTPRPEPTANPYPQPQPTRTPRPEPTAN 237 Query: 90 KTGYLTLGGSQRVTGAT 106 + R Sbjct: 238 PYPQPQPTRTPRPEPTQ 254 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/85 (29%), Positives = 29/85 (34%) Query: 18 ATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPE 77 TL+ G ++ P S P P P P P TP P P P P+P P Sbjct: 152 PTLVIQPTAGVQPTTGTQPEPTQTPRSEPTANPYPQPQPTRTPRPEPTANPYPQPQPTRT 211 Query: 78 PTPEPEPEPVPTKTGYLTLGGSQRV 102 P PEP P P T Sbjct: 212 PRPEPTANPYPQPQPTRTPRPEPTA 236 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 23/70 (32%), Positives = 27/70 (38%) Query: 34 DTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGY 93 T P +P P P+P P TP PEPT P P P+PT P PEP Sbjct: 164 PTTGTQPEPTQTPRSEPTANPYPQPQPTRTPRPEPTANPYPQPQPTRTPRPEPTANPYPQ 223 Query: 94 LTLGGSQRVT 103 + R Sbjct: 224 PQPTRTPRPE 233 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 31/75 (41%), Gaps = 2/75 (2%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPNP--EPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 + +G +P+PT P EPT P P P+PT P P+P P P+P+P Sbjct: 150 SQPTLVIQPTAGVQPTTGTQPEPTQTPRSEPTANPYPQPQPTRTPRPEPTANPYPQPQPT 209 Query: 88 PTKTGYLTLGGSQRV 102 T T + Sbjct: 210 RTPRPEPTANPYPQP 224 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 25/76 (32%) Query: 28 GSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 + T P PT +P P TP EPT P P P+PT P PEP Sbjct: 140 VNPYPQPTTGSQPTLVIQPTAGVQPTTGTQPEPTQTPRSEPTANPYPQPQPTRTPRPEPT 199 Query: 88 PTKTGYLTLGGSQRVT 103 + R Sbjct: 200 ANPYPQPQPTRTPRPE 215 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 24/73 (32%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 + P +G+ ++P P +P P P EP +P P P+P P P Sbjct: 137 IPPVNPYPQPTTGSQPTLVIQPTAGVQPTTGTQPEPTQTPRSEPTANPYPQPQPTRTPRP 196 Query: 89 TKTGYLTLGGSQR 101 T Sbjct: 197 EPTANPYPQPQPT 209 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 27/78 (34%), Gaps = 4/78 (5%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPN----PEPTPEPTPDPEPTPEPIPDPEPTPEPEPE 85 + TP ++P V P P P P +PT +PT P+P TP EP Sbjct: 122 PEPTLTPRPAPTATNIPPVNPYPQPTTGSQPTLVIQPTAGVQPTTGTQPEPTQTPRSEPT 181 Query: 86 PVPTKTGYLTLGGSQRVT 103 P T T Sbjct: 182 ANPYPQPQPTRTPRPEPT 199 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 26/76 (34%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89 P TGS P + PT +PT P+P TP P P P+P+P P Sbjct: 136 NIPPVNPYPQPTTGSQPTLVIQPTAGVQPTTGTQPEPTQTPRSEPTANPYPQPQPTRTPR 195 Query: 90 KTGYLTLGGSQRVTGA 105 + T Sbjct: 196 PEPTANPYPQPQPTRT 211 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 21/70 (30%) Query: 33 SDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTG 92 + P P P NP P P P +P +PT +PEP T Sbjct: 119 TPRPEPTLTPRPAPTATNIPPVNPYPQPTTGSQPTLVIQPTAGVQPTTGTQPEPTQTPRS 178 Query: 93 YLTLGGSQRV 102 T + Sbjct: 179 EPTANPYPQP 188 >UniRef50_A2D8B9 Megakaryocyte stimulating factor, putative n=1 Tax=Trichomonas vaginalis RepID=A2D8B9_TRIVA Length = 563 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 30/82 (36%), Gaps = 2/82 (2%) Query: 26 GGGSGSSSDTPPVDSG--TGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPE 83 G + TP P P P P P P+PT P P P P P+PT P Sbjct: 354 GTPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKPTATPIPKPTGTPIPKPTATPI 413 Query: 84 PEPVPTKTGYLTLGGSQRVTGA 105 P+P T T + TG Sbjct: 414 PKPTATPIPKPTATPMPKPTGT 435 Score = 64.5 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 31/77 (40%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 +++ P + P P P P P P+PT P P P P P+PT P P+P Sbjct: 327 KPTATPIPKPTATPIPKPTATPMPKPTGTPIPKPTATPIPKPTGTPIPKPTATPIPKPTA 386 Query: 89 TKTGYLTLGGSQRVTGA 105 T T + TG Sbjct: 387 TPIPKPTATPIPKPTGT 403 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 29/77 (37%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 +++ P P P P P P P+PT P P P P P+PT P P+P Sbjct: 343 KPTATPMPKPTGTPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKPTATPIPKPTG 402 Query: 89 TKTGYLTLGGSQRVTGA 105 T T + T Sbjct: 403 TPIPKPTATPIPKPTAT 419 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 29/77 (37%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 + + P + P P P P P P+PT P P P P P+PT P P+P Sbjct: 351 KPTGTPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKPTATPIPKPTGTPIPKPTA 410 Query: 89 TKTGYLTLGGSQRVTGA 105 T T + T Sbjct: 411 TPIPKPTATPIPKPTAT 427 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 29/77 (37%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 + + P + P P P P P P+PT P P P P P+PT P P+P Sbjct: 367 KPTGTPIPKPTATPIPKPTATPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKPTA 426 Query: 89 TKTGYLTLGGSQRVTGA 105 T T + T Sbjct: 427 TPMPKPTGTPIPKPTAT 443 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 29/78 (37%), Gaps = 4/78 (5%) Query: 26 GGGSGSSSDTPPVDSGTGSLP----EVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPE 81 G + TP +P P PT P P P TP P+PT PIP P TP Sbjct: 370 GTPIPKPTATPIPKPTATPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKPTATPM 429 Query: 82 PEPEPVPTKTGYLTLGGS 99 P+P P T Sbjct: 430 PKPTGTPIPKPTATPIPK 447 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 30/77 (38%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 +++ P + P P P P P P+PT P P P P P+PT P P+P Sbjct: 335 KPTATPIPKPTATPMPKPTGTPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKPTA 394 Query: 89 TKTGYLTLGGSQRVTGA 105 T T + T Sbjct: 395 TPIPKPTGTPIPKPTAT 411 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/73 (31%), Positives = 27/73 (36%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89 T P KP TP P+PT P P P TP P P P P+P P+P Sbjct: 340 PIPKPTATPMPKPTGTPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKPTATPIPK 399 Query: 90 KTGYLTLGGSQRV 102 TG + Sbjct: 400 PTGTPIPKPTATP 412 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 22/73 (30%), Positives = 27/73 (36%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89 T + P KP TP P+PT P P P TP P P P P+P P+P Sbjct: 348 PMPKPTGTPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKPTATPIPKPTGTPIPK 407 Query: 90 KTGYLTLGGSQRV 102 T + Sbjct: 408 PTATPIPKPTATP 420 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 28/81 (34%), Gaps = 4/81 (4%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPE----PIPDPEPTPEPEP 84 + TP +P+ P P P TP P P P P+ PIP P TP P+P Sbjct: 341 IPKPTATPMPKPTGTPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKPTATPIPKP 400 Query: 85 EPVPTKTGYLTLGGSQRVTGA 105 P T T Sbjct: 401 TGTPIPKPTATPIPKPTATPI 421 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 22/74 (29%), Positives = 27/74 (36%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 + TP +P+ P P P TP P P P P+P P P P P P P Sbjct: 365 IPKPTGTPIPKPTATPIPKPTATPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKP 424 Query: 89 TKTGYLTLGGSQRV 102 T T G+ Sbjct: 425 TATPMPKPTGTPIP 438 Score = 59.9 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 28/81 (34%), Gaps = 4/81 (4%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPE----PIPDPEPTPEPEP 84 + TP +P+ P P P TP P P P P+ PIP P TP P+P Sbjct: 333 IPKPTATPIPKPTATPMPKPTGTPIPKPTATPIPKPTGTPIPKPTATPIPKPTATPIPKP 392 Query: 85 EPVPTKTGYLTLGGSQRVTGA 105 P T T Sbjct: 393 TATPIPKPTGTPIPKPTATPI 413 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 31/78 (39%) Query: 28 GSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPV 87 S T +S + S + P P P P P+PT P P P P P+PT P P+P Sbjct: 302 SIEPSETTSSTESSSSSEIPIPPIPKPTATPIPKPTATPIPKPTATPMPKPTGTPIPKPT 361 Query: 88 PTKTGYLTLGGSQRVTGA 105 T T + T Sbjct: 362 ATPIPKPTGTPIPKPTAT 379 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 19/54 (35%), Positives = 22/54 (40%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPE 83 T + P KP TP P+PT P P P TP P P P P+P Sbjct: 396 PIPKPTGTPIPKPTATPIPKPTATPIPKPTATPMPKPTGTPIPKPTATPIPKPT 449 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 31/79 (39%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 + ++S P ++ + + + P P P TP P+PT PIP P TP P+P Sbjct: 295 ESNETTSSIEPSETTSSTESSSSSEIPIPPIPKPTATPIPKPTATPIPKPTATPMPKPTG 354 Query: 87 VPTKTGYLTLGGSQRVTGA 105 P T T Sbjct: 355 TPIPKPTATPIPKPTGTPI 373 >UniRef50_Q7U5X7 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8102 RepID=Q7U5X7_SYNPX Length = 1154 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 72/232 (31%), Gaps = 9/232 (3%) Query: 41 GTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQ 100 G PE P P+ P P P P PTPE P P TP PE PVP+ T + Sbjct: 518 GPTPTPESTPIPSATPTPESTPIPSATPTPESTPIPSATPTPESAPVPSATPTPESTPAP 577 Query: 101 RVT-GATCNGESSDGFTFKPGEDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQ 159 T + + E D F V G+ + + ++ K S D + Sbjct: 578 SATPDPSIDQELIDDFANNSSTSGVVVIGDVLYGNLESIYDDDWFKVSLTKGSVYRFDLE 637 Query: 160 ELAGSDDKKS----NAVSLVTSSNSCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEF 215 + +D + + N L + + T +S + + + Sbjct: 638 GIQLNDPRMTLYGSNLKELTYDDDGGSGYDSLIEFTATSSDNYFISAKSWGETGTYTLKA 697 Query: 216 KKLVNEEVENNAATDKAP---STHTSPVVPVTTPGTKPDLNASFVSANAEQF 264 + A + P +T T P + P ++ + A Sbjct: 698 TD-ITPAPSATPAPEPTPVPSATPTPESTPAPSATPDPSIDQELIDDFANNS 748 >UniRef50_B7P4H6 Putative uncharacterized protein n=1 Tax=Ixodes scapularis RepID=B7P4H6_IXOSC Length = 1255 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 31/79 (39%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 + S P + +PE +P P P P P P EP P+ + + + P +P P Sbjct: 313 EPAPEPSAEPVPEPSAEPVPEPSAEPAPQPSAEPAPEPSAEPAPKTLAEFQAKPTSKPNP 372 Query: 87 VPTKTGYLTLGGSQRVTGA 105 P+ + +Q+ Sbjct: 373 EPSPEPHAPSEHTQKPESE 391 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 28/114 (24%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 S P + E P P+ P P P P P+ E P P PEP P Sbjct: 317 EPSAEPVPEPSAEPVPEPSAEPAPQPSAEPAPEPSAEPAPKTLAEFQAKPTSKPNPEPSP 376 Query: 87 VPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPGEDVTCVAGNTTIATFNTQSE 140 P T + E ++ + + Sbjct: 377 EPHAPSEHTQKPESEPSAEPEPQPEPKPEPEPKPEPQPATPEGRSLGPHDHRYP 430 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 28/118 (23%), Gaps = 47/118 (39%) Query: 32 SSDTPPVDSGTGSLPEVKP----------------------------------------- 50 PP P+ +P Sbjct: 237 PRTEPPPVIKPT-EPKAEPVAEPTPEPTAEPTPEPAPEPSPEPAAEPSPEPSSEPTPEPS 295 Query: 51 -----DPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVT 103 +P+ P P P P PEP+ EP+P+P P PEP P Sbjct: 296 AEPVLEPSAKPTPEPSAEPAPEPSAEPVPEPSAEPVPEPSAEPAPQPSAEPAPEPSAE 353 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 22/135 (16%), Positives = 31/135 (22%), Gaps = 47/135 (34%) Query: 16 LSATLLAGCDGGGSGSSSDTPP-------------------------------------- 37 L TLL+ P Sbjct: 227 LRTTLLSSLKPRTEPPPVIKPTEPKAEPVAEPTPEPTAEPTPEPAPEPSPEPAAEPSPEP 286 Query: 38 ---------VDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 + + P+P+ P P P P PEP+ EP+P+P P P+P P Sbjct: 287 SSEPTPEPSAEPVLEPSAKPTPEPSAEPAPEPSAEPVPEPSAEPVPEPSAEPAPQPSAEP 346 Query: 89 TKTGYLTLGGSQRVT 103 Sbjct: 347 APEPSAEPAPKTLAE 361 >UniRef50_D2NSM0 Putative uncharacterized protein n=1 Tax=Rothia mucilaginosa DY-18 RepID=D2NSM0_9MICC Length = 586 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/71 (33%), Positives = 28/71 (39%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 + + TP V + P V P TP EPT PT +P P P PTPEP P Sbjct: 159 DATAEPTATPTVAPSASAEPSVAPTATPTAEPTVAPTAEPTVQPTAEPTVAPTPEPTVAP 218 Query: 87 VPTKTGYLTLG 97 T T Sbjct: 219 TVEPTAEPTKD 229 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 28/89 (31%) Query: 13 AAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEP 72 +A +AT A S T + P +P+ P TP P PT EP Sbjct: 139 SASATATPSATASPSVEPSKDATAEPTATPTVAPSASAEPSVAPTATPTAEPTVAPTAEP 198 Query: 73 IPDPEPTPEPEPEPVPTKTGYLTLGGSQR 101 P P P P PT + Sbjct: 199 TVQPTAEPTVAPTPEPTVAPTVEPTAEPT 227 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/73 (28%), Positives = 28/73 (38%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTK 90 S S P D+ P+ + EP+ PT P P P EPT +P EP Sbjct: 151 SPSVEPSKDATAEPTATPTVAPSASAEPSVAPTATPTAEPTVAPTAEPTVQPTAEPTVAP 210 Query: 91 TGYLTLGGSQRVT 103 T T+ + T Sbjct: 211 TPEPTVAPTVEPT 223 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 32/82 (39%) Query: 22 AGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPE 81 A + S++ +P V+ + E PT P + EP+ P TP P PT E Sbjct: 138 ASASATATPSATASPSVEPSKDATAEPTATPTVAPSASAEPSVAPTATPTAEPTVAPTAE 197 Query: 82 PEPEPVPTKTGYLTLGGSQRVT 103 P +P T T + T Sbjct: 198 PTVQPTAEPTVAPTPEPTVAPT 219 Score = 44.9 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 14/81 (17%), Positives = 30/81 (37%) Query: 23 GCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEP 82 G + + + T + + + P +P+ + P+P+ TP P T +P +P + Sbjct: 83 GDNKTDNAQAEPTAQPSASSSAQPTASAEPSASATPSPDATPTPNATAQPTAEPTASASA 142 Query: 83 EPEPVPTKTGYLTLGGSQRVT 103 P T + + Sbjct: 143 TATPSATASPSVEPSKDATAE 163 >UniRef50_B0BZP0 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0BZP0_ACAM1 Length = 988 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 26/78 (33%) Query: 26 GGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPE 85 G + P +P+ DP P P P P P+P P P P P P Sbjct: 860 GPTIPQPTPGPAPSPDVTPVPQPSTDPPSGSTSQPFPFPVPLPSPSPQPGKVPESLPSPG 919 Query: 86 PVPTKTGYLTLGGSQRVT 103 VP T + V+ Sbjct: 920 EVPESGLPQTPAPKEPVS 937 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 20/66 (30%), Positives = 26/66 (39%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 S + P P PD TP P+P+ +P P P P P P+P P+P VP Sbjct: 853 SPPGTIPGPTIPQPTPGPAPSPDVTPVPQPSTDPPSGSTSQPFPFPVPLPSPSPQPGKVP 912 Query: 89 TKTGYL 94 Sbjct: 913 ESLPSP 918 >UniRef50_Q82F59 Putative uncharacterized protein n=8 Tax=Streptomyces RepID=Q82F59_STRAW Length = 582 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/47 (46%), Positives = 24/47 (51%) Query: 43 GSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89 S PE +P P P P P PEP E P+PEPT E PEP P Sbjct: 275 TSEPEPAVEPAPEPVAEVTPEPQPEPVAEATPEPEPTVEATPEPEPV 321 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 17/48 (35%), Positives = 19/48 (39%) Query: 55 NPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRV 102 EP P P PEP E P+P+P P E P P T T Sbjct: 275 TSEPEPAVEPAPEPVAEVTPEPQPEPVAEATPEPEPTVEATPEPEPVA 322 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/86 (30%), Positives = 28/86 (32%), Gaps = 20/86 (23%) Query: 39 DSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEP-------------- 84 S E P+P P P+P P E TPEP P E TPEPEP Sbjct: 275 TSEPEPAVEPAPEPVAEVTPEPQPEPVAEATPEPEPTVEATPEPEPVAETTPEPEPEPTV 334 Query: 85 -----EPVPTKTGY-LTLGGSQRVTG 104 EP P T S Sbjct: 335 AEKTAEPEPEPVAEQPTPEPSAADGD 360 >UniRef50_UPI0001695557 hypothetical protein Plarl_10627 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001695557 Length = 444 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/47 (40%), Positives = 23/47 (48%) Query: 37 PVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPE 83 V + +P +P P PEP PTP P P P P P PTP+P Sbjct: 160 QVKAEPAPVPTPEPTPAVTPEPATVPTPKPTPAATPEPAPVPTPKPT 206 Score = 44.9 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 21/53 (39%) Query: 49 KPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQR 101 K D + EP P P P P P PEP P P+P P T + + Sbjct: 152 KLDTLKPVQVKAEPAPVPTPEPTPAVTPEPATVPTPKPTPAATPEPAPVPTPK 204 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 44/231 (19%), Positives = 73/231 (31%), Gaps = 13/231 (5%) Query: 27 GGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEP 86 + TP P P P P P TPEP P P P P PIP P PTP P+P P Sbjct: 162 KAEPAPVPTPEPTPAVTPEPATVPTPKPTPAATPEPAPVPTPKPTPIPAPVPTPVPKPVP 221 Query: 87 VPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPGEDVTCVAGNTTIA---TFNTQSEAAR 143 P T D +T KP +T G I+ T ++ Sbjct: 222 APILQDGTYTIDFTLYKDKTQEVSKMDTYTIKPA-ALTVNNGKMKISHTLTHSSWITQYE 280 Query: 144 SLRAVEKVSFSLEDAQELAGSDDKKSNAVSLVTSSNSCPANTEQVCLTFSSVIESKRFDS 203 + ++ +A + + + + + N+ +V + + + + Sbjct: 281 IEQNGSLKETTILSLDGVADTRVVQFDIADITETLNA----RVKVDIPEMNYLHTYDVQL 336 Query: 204 LYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPVTTPGTKPDLNA 254 + + + + + + K S P T T N+ Sbjct: 337 AFDS-----DSIESVQGKPDPSRGPESKPEPKEVSASDPDVTLLTDATSNS 382 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 15/44 (34%), Positives = 16/44 (36%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIP 74 P P V P+P P P P P PEP P P P Sbjct: 160 QVKAEPAPVPTPEPTPAVTPEPATVPTPKPTPAATPEPAPVPTP 203 >UniRef50_A7B9W7 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7B9W7_9ACTO Length = 989 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 43/162 (26%), Gaps = 39/162 (24%) Query: 26 GGGSGSS-SDTPPVDSGTGSLPEVKPDPTPN--------------------------PEP 58 + + P G P +P PTP P P Sbjct: 728 HNERPAPVEEEPVPTPSPGPSPVPEPVPTPAPKPTPAPAPTADPAPAPAPSVDPAPVPAP 787 Query: 59 TPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGY------------LTLGGSQRVTGAT 106 T +P P P PT +P P P PT +P P P PT + Sbjct: 788 TMDPAPAPAPTTDPAPAPAPTTDPAPAPAPTMDPAPAPTADPAPMPHPFPSPAPTGGPVR 847 Query: 107 CNGESSDGFTFKPGEDVTCVAGNTTIATFNTQSEAARSLRAV 148 S+ T A T + AARS A Sbjct: 848 VPAASATNAGTPASSGATRTAAPAATGGSATGAPAARSNGAA 889 >UniRef50_D1C606 Peptidase M23 n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C606_SPHTD Length = 719 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 19/47 (40%), Positives = 23/47 (48%) Query: 45 LPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKT 91 P P+PT P P PTP+ PTPE P P+P P+P T Sbjct: 616 TPSATPEPTETPVPEATPTPEASPTPEATPLAAPSPARTVVPLPEPT 662 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 24/55 (43%) Query: 49 KPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVT 103 + P+ PEPT P P+ PTPE P PE TP P P T + T Sbjct: 614 EATPSATPEPTETPVPEATPTPEASPTPEATPLAAPSPARTVVPLPEPTWNPEST 668 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 16/55 (29%), Positives = 23/55 (41%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPE 85 ++ + + +PE P P +P P P P P +P PEPT PE Sbjct: 614 EATPSATPEPTETPVPEATPTPEASPTPEATPLAAPSPARTVVPLPEPTWNPEST 668 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 20/73 (27%), Positives = 28/73 (38%) Query: 30 GSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPT 89 ++TP ++ P+ TP P+P T P P P P+ P P P P P+ Sbjct: 621 PEPTETPVPEATPTPEASPTPEATPLAAPSPARTVVPLPEPTWNPESTEVPTPAPVPTPS 680 Query: 90 KTGYLTLGGSQRV 102 T S V Sbjct: 681 PHPTSTAMPSPTV 693 >UniRef50_C2M7W3 Putative uncharacterized protein n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M7W3_CAPGI Length = 1067 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/61 (31%), Positives = 26/61 (42%) Query: 62 PTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPGE 121 P PDP+P PEP P P+P P P P+P PT + S + F + + Sbjct: 935 PKPDPKPAPEPKPVPKPDPIPAPKPEPTPDSIEETREIAIYNAVSTQDNSQNYFKVEGYD 994 Query: 122 D 122 Sbjct: 995 I 995 Score = 44.9 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 16/30 (53%), Positives = 21/30 (70%) Query: 59 TPEPTPDPEPTPEPIPDPEPTPEPEPEPVP 88 P+P P P P P+P+P P+P P P+PEP P Sbjct: 934 IPKPDPKPAPEPKPVPKPDPIPAPKPEPTP 963 >UniRef50_C0WIN3 Putative uncharacterized protein n=1 Tax=Corynebacterium accolens ATCC 49725 RepID=C0WIN3_9CORY Length = 323 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 28/93 (30%), Positives = 40/93 (43%), Gaps = 6/93 (6%) Query: 22 AGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPE 81 +GC + PP + + + P DP P PEP P+ P P + P P+P + Sbjct: 57 SGCQCQQCRGFVNEPPRQNESTTTPA-HADPNPEPEPKPQDCP---PEQQEQPAPQPEQD 112 Query: 82 PEPEPVPTKT--GYLTLGGSQRVTGATCNGESS 112 P PEPVP + + A C+ ESS Sbjct: 113 PVPEPVPEDSECEKEPTTPAAVEDDAACDIESS 145 >UniRef50_B0C0T3 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C0T3_ACAM1 Length = 1003 Score = 43.7 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 25/110 (22%), Positives = 32/110 (29%) Query: 38 VDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLTLG 97 D P +P P+ +P P P P P P P P P P P G G Sbjct: 871 PDPIPSPTPTPEPAPSSSPAPGATPPDSSSPNSIPFPVPFPGSGPSPGPGSVPGGTPAPG 930 Query: 98 GSQRVTGATCNGESSDGFTFKPGEDVTCVAGNTTIATFNTQSEAARSLRA 147 T T + S T PG ++ + T A Sbjct: 931 SIPAPTTGTPSTVPSGIETPTPGAEIPVSEPGGQLPTPADSLPDPAERPA 980 >UniRef50_A3DJW7 Fibronectin, type III n=2 Tax=Clostridium thermocellum RepID=A3DJW7_CLOTH Length = 667 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 15/60 (25%), Positives = 20/60 (33%) Query: 31 SSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTK 90 + + P P P P+P P P+ P PE T P P P+P Sbjct: 67 EPTPIGSLTPAVTETPLATPTQETLPSPSPSEFSTPTPSFTPDASPESTSTPFPSPLPFP 126 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.306 0.118 0.306 Lambda K H 0.267 0.0361 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 7,411,646,549 Number of Sequences: 3077464 Number of extensions: 301074391 Number of successful extensions: 4750846 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 9828 Number of HSP's successfully gapped in prelim test: 5422 Number of HSP's that attempted gapping in prelim test: 3806977 Number of HSP's gapped (non-prelim): 462102 length of query: 1520 length of database: 1,040,396,356 effective HSP length: 142 effective length of query: 1378 effective length of database: 603,396,468 effective search space: 831480332904 effective search space used: 831480332904 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.6 bits) S2: 99 (42.9 bits)