BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (219 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular o... 449 e-125 UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria Re... 268 1e-70 UniRef50_UPI00016C400A von Willebrand factor, type A n=1 Tax=Gem... 254 2e-66 UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_A... 233 4e-60 UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacter... 232 6e-60 UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacte... 224 2e-57 UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium E... 215 8e-55 UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacte... 204 2e-51 UniRef50_Q2FNC6 von Willebrand factor, type A n=1 Tax=Methanospi... 185 9e-46 UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus... 175 8e-43 UniRef50_Q2SNJ1 Uncharacterized protein encoded in toxicity prot... 172 5e-42 UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria Re... 169 8e-41 UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter... 163 3e-39 UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria Re... 161 2e-38 UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacter... 154 1e-36 UniRef50_D0W6S8 Glycosyl transferase, group 2 family n=1 Tax=Nei... 153 5e-36 UniRef50_C8PVC3 von Willebrand factor type A domain protein n=1 ... 148 1e-34 UniRef50_Q87W17 von Willebrand factor type A domain protein n=2 ... 145 9e-34 UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibri... 136 5e-31 UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostri... 130 3e-29 UniRef50_D1PS09 von Willebrand factor type A domain protein n=1 ... 129 7e-29 UniRef50_A6G3C6 von Willebrand factor, type A n=1 Tax=Plesiocyst... 109 6e-23 UniRef50_C9ZGR0 Putative uncharacterized protein n=1 Tax=Strepto... 103 5e-21 UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 88 3e-16 UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotoma... 87 3e-16 UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capn... 85 2e-15 UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides... 81 2e-14 UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10... 80 6e-14 UniRef50_A9BLP5 von Willebrand factor type A n=3 Tax=Burkholderi... 77 4e-13 UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacte... 75 1e-12 UniRef50_B7AFM8 Putative uncharacterized protein n=1 Tax=Bactero... 75 1e-12 UniRef50_Q8DK92 Tlr0974 protein n=1 Tax=Thermosynechococcus elon... 75 2e-12 UniRef50_A1VV60 von Willebrand factor, type A n=4 Tax=Proteobact... 72 1e-11 UniRef50_Q5NWS4 Tellurium resistance protein n=8 Tax=Bacteria Re... 72 1e-11 UniRef50_B6BJ58 Phage/colicin/tellurite resistance cluster TerY ... 71 2e-11 UniRef50_B2K3B2 von Willebrand factor type A n=39 Tax=Gammaprote... 71 2e-11 UniRef50_UPI0001AED79F von Willebrand factor type A n=1 Tax=Stre... 70 5e-11 UniRef50_C9LWT6 Tellurium resistance protein n=1 Tax=Selenomonas... 67 4e-10 UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobac... 67 5e-10 UniRef50_A1THU3 von Willebrand factor, type A n=1 Tax=Mycobacter... 65 1e-09 UniRef50_Q0I303 Putative uncharacterized protein n=5 Tax=Pasteur... 65 1e-09 UniRef50_D1AAR7 von Willebrand factor type A n=1 Tax=Thermomonos... 64 3e-09 UniRef50_UPI00018742C1 von Willebrand factor type A n=1 Tax=Cory... 63 7e-09 UniRef50_C7PNX3 von Willebrand factor type A n=2 Tax=Sphingobact... 63 7e-09 UniRef50_A7BNL3 Tellurium resistance protein n=1 Tax=Beggiatoa s... 63 9e-09 UniRef50_C9RK46 von Willebrand factor type A n=1 Tax=Fibrobacter... 60 7e-08 UniRef50_A7C6I0 Protein containing Von Willebrand factor, type A... 59 1e-07 UniRef50_UPI0001745BB0 TerY3 n=1 Tax=Verrucomicrobium spinosum D... 57 4e-07 UniRef50_Q0RFF8 Putative uncharacterized protein n=1 Tax=Frankia... 56 9e-07 UniRef50_A1SV07 Putative uncharacterized protein n=1 Tax=Psychro... 54 4e-06 UniRef50_D2AQW9 Uncharacterized protein encoded in toxicity prot... 54 4e-06 UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fuso... 53 6e-06 UniRef50_B4VGR3 Putative uncharacterized protein n=1 Tax=Strepto... 52 1e-05 UniRef50_C7N2G1 Uncharacterized protein n=2 Tax=Slackia heliotri... 51 2e-05 UniRef50_C5PP99 von Willebrand factor, type A n=2 Tax=Bacteroide... 51 2e-05 UniRef50_C9RJN5 von Willebrand factor type A n=1 Tax=Fibrobacter... 50 5e-05 UniRef50_B2UUD5 Phage/colicin/tellurite resistance cluster terY ... 50 7e-05 UniRef50_C8N9L6 Tellurium resistance protein n=1 Tax=Cardiobacte... 50 8e-05 UniRef50_C2HF13 von Willebrand factor type A n=1 Tax=Finegoldia ... 49 2e-04 UniRef50_C0F0K9 Putative uncharacterized protein n=2 Tax=Eubacte... 49 2e-04 UniRef50_Q4GZD0 Putative uncharacterized protein n=2 Tax=Trypano... 47 3e-04 UniRef50_C3XUD0 Putative uncharacterized protein n=1 Tax=Branchi... 47 5e-04 UniRef50_B0A9L7 Putative uncharacterized protein n=2 Tax=Clostri... 46 8e-04 UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=... 46 0.001 UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytoph... 45 0.001 UniRef50_UPI0001C37785 von Willebrand factor type A n=1 Tax=Rumi... 45 0.002 UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis... 45 0.002 UniRef50_A8L6A2 von Willebrand factor type A n=1 Tax=Frankia sp.... 43 0.009 UniRef50_C5VFZ9 von Willebrand factor type A n=2 Tax=Corynebacte... 42 0.012 UniRef50_C1A2B7 Putative uncharacterized protein n=1 Tax=Rhodoco... 41 0.037 UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus ... 40 0.044 UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR 40 0.045 >UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular organisms RepID=YEGL_ECOLI Length = 219 Score = 449 bits (1154), Expect = e-125, Method: Compositional matrix adjust. Identities = 219/219 (100%), Positives = 219/219 (100%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR Sbjct: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 Query: 61 VELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 VELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS Sbjct: 61 VELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL Sbjct: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 Query: 181 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV Sbjct: 181 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 >UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria RepID=B8HUC4_CYAP4 Length = 254 Score = 268 bits (685), Expect = 1e-70, Method: Compositional matrix adjust. Identities = 143/216 (66%), Positives = 162/216 (75%), Gaps = 3/216 (1%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 F T DFA+NPEPRCP ILLLD SGSM G PI ELNAG+ FRDELLAD LA KRVE+ IV Sbjct: 39 FGTDDFANNPEPRCPVILLLDTSGSMRGTPIQELNAGVELFRDELLADALASKRVEVAIV 98 Query: 67 TFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 FGPV V Q F +A F PP L A+ DTP+GAAI ALD+++ RK Y+ANGI+YYRPW+ Sbjct: 99 GFGPVQVIQDFVTADYFNPPKLRAEADTPLGAAIETALDLLQSRKDTYKANGIAYYRPWV 158 Query: 127 FLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFR 186 FLITDG PTD WQ AA +V GE K FAFFSIGV+GA + LAQIS R PL L+ L+FR Sbjct: 159 FLITDGGPTDHWQTAARRVKEGESKKSFAFFSIGVEGARIDILAQISTRTPLKLKELRFR 218 Query: 187 ELFSWLSSSLRSVSRSTPGTEVVL---EAPKGWTSV 219 +LF WLSSSL+SVSRSTPG EV L P GW SV Sbjct: 219 DLFQWLSSSLKSVSRSTPGDEVPLLNPATPDGWASV 254 >UniRef50_UPI00016C400A von Willebrand factor, type A n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C400A Length = 249 Score = 254 bits (648), Expect = 2e-66, Method: Compositional matrix adjust. Identities = 131/249 (52%), Positives = 163/249 (65%), Gaps = 30/249 (12%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSM------NGRP------------------ 36 M+EQI F+ A+NPEPRCPC+LL+D SGSM GR Sbjct: 1 MAEQIPFSDVALATNPEPRCPCVLLIDTSGSMAEVVSGTGRDLGRTAQVDGKTYRVVSGG 60 Query: 37 ---INELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFTSAANFFPPILFAQG 92 I+ +N GL ++ ++ DPLA +RVE+ +VTFG V PF + + F PP+L A G Sbjct: 61 TTRIDLVNEGLRVYQADVTNDPLAAQRVEVSVVTFGDTVRTVTPFVTTSQFTPPVLTANG 120 Query: 93 DTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDK 152 +TPMGAAI KA+D V ERKREYR NG+ +YRPWIFLITDG PTD W+AAA +V GEE K Sbjct: 121 ETPMGAAILKAIDAVTERKREYRQNGLHFYRPWIFLITDGEPTDAWEAAAARVREGEEKK 180 Query: 153 RFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTE--VVL 210 +FAFF++GV+GA+M L QISVRQPL L+G F+E+F WLS S RSVS S PG E V L Sbjct: 181 QFAFFAVGVEGANMDRLKQISVRQPLHLKGYSFKEMFLWLSQSQRSVSHSNPGQEEQVKL 240 Query: 211 EAPKGWTSV 219 P GW S+ Sbjct: 241 APPAGWASL 249 >UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_ANASP Length = 224 Score = 233 bits (593), Expect = 4e-60, Method: Compositional matrix adjust. Identities = 117/214 (54%), Positives = 150/214 (70%), Gaps = 6/214 (2%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-G 69 +FA NPEPRCPC+LLLD SGSM G I LN GL++ +DEL+ + +A +RVE+ IVTF Sbjct: 12 EFAENPEPRCPCVLLLDTSGSMQGAAIEALNQGLLSLKDELMKNSIAARRVEIAIVTFDS 71 Query: 70 PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 ++V Q F +A F PPIL AQG T MGA I KALDMV+ERK YRANG++YYRPW+F+I Sbjct: 72 HINVIQDFVTADQFNPPILTAQGLTSMGAGIHKALDMVQERKSLYRANGVAYYRPWVFMI 131 Query: 130 TDGAPTDEW----QAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 TDG P E + AA ++ E +KR AFFS+GV+ A+M L QI+VR PL L+GL F Sbjct: 132 TDGEPQGELDHLVEQAALRLQGDEVNKRVAFFSVGVENANMTRLNQIAVRTPLKLKGLNF 191 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 E+F WLS+S+ +VS S +V L P GW S+ Sbjct: 192 IEMFVWLSASMSAVSHSQIDEQVAL-PPIGWGSI 224 >UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacteria RepID=B5W3I3_SPIMA Length = 228 Score = 232 bits (592), Expect = 6e-60, Method: Compositional matrix adjust. Identities = 116/214 (54%), Positives = 152/214 (71%), Gaps = 6/214 (2%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-G 69 +FA NPEPRCPC+LLLD S SM G P++ LNAGL+TFR+ L+ D LA KRVE+ I+TF Sbjct: 16 EFAENPEPRCPCVLLLDTSASMQGEPLDGLNAGLMTFRENLIKDELAKKRVEIAIITFDN 75 Query: 70 PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 V + Q F +A F PP+L AQG T MG AI +ALDM+ RK EYR NGI+YYRPW+F+I Sbjct: 76 QVKIIQDFVTADRFEPPLLNAQGQTYMGTAIGEALDMIASRKAEYRNNGITYYRPWVFMI 135 Query: 130 TDGAP---TDEWQAAANKVFRGEE-DKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 TDG P +D A K R EE +K+ AFF++GV+GA+M+ L +++ R PL L+GL F Sbjct: 136 TDGEPQGESDRITEQAIKRIRDEEANKQVAFFAVGVEGANMERLGEMAQRTPLKLKGLDF 195 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 RE+F WLS+S+++VS S +V L P GW +V Sbjct: 196 REMFIWLSASMQTVSHSKVDEQVAL-PPPGWGTV 228 >UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacteria RepID=Q3M2E0_ANAVT Length = 218 Score = 224 bits (570), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 113/211 (53%), Positives = 141/211 (66%), Gaps = 3/211 (1%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP 70 +F NPE RCP ILLLD SGSM+G+PI ELN GL TF+++++ D A VE+ I+TFGP Sbjct: 7 EFVENPENRCPVILLLDTSGSMSGQPIQELNRGLATFKEDVIKDSQASLSVEVAIITFGP 66 Query: 71 VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 V + Q F + F PP L A+G TPMG AI ALD++E RK Y+ NGI YYRPWIFLIT Sbjct: 67 VRLVQDFVNIDQFTPPQLEAEGVTPMGEAIEYALDLLETRKSAYKENGILYYRPWIFLIT 126 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI--SVRQPLPLQGLQFREL 188 DGAPTD + AA +V E ++R FF++GVQGAD L QI + R P+ L GL FR L Sbjct: 127 DGAPTDYYHLAAQRVKEAEANRRLCFFTVGVQGADFNKLRQIAPAERPPVILNGLDFRSL 186 Query: 189 FSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 F WLS+S++ VS G V L P GW + Sbjct: 187 FVWLSTSMKRVSSGKIGEAVAL-PPVGWGQI 216 >UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium Ellin514 RepID=B9XNU9_9BACT Length = 229 Score = 215 bits (548), Expect = 8e-55, Method: Compositional matrix adjust. Identities = 112/227 (49%), Positives = 148/227 (65%), Gaps = 8/227 (3%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 MS+Q+ F +F NPEPRCPC+L+LDVS SM G I+ LN G+ F +L LA KR Sbjct: 1 MSDQLPFIDVEFVDNPEPRCPCVLVLDVSSSMRGAAIDFLNLGVDLFAHDLTRSRLACKR 60 Query: 61 VELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 VE I+TFG VH+ Q F S + F PP A G TPMG A+ +A +++E+RKR+YRA G+ Sbjct: 61 VETAIITFGDGVHIVQDFVSPSAFVPPRFEAGGKTPMGEAVVQACELLEKRKRKYRAAGV 120 Query: 120 SYYRPWIFLITDGAPTD----EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI--S 173 SY+RPWIFLITDG PTD W+ A V GE DK+ FF + V A+ L ++ + Sbjct: 121 SYFRPWIFLITDGEPTDYETANWRQAVEIVRAGEVDKKLMFFGVAVSDANQGKLNELCPA 180 Query: 174 VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTE-VVLEAPKGWTSV 219 R + L GL F+ LF+WLSSSLR+VS + PGT+ +VL + GW +V Sbjct: 181 SRPAIKLNGLDFQGLFTWLSSSLRTVSSANPGTQGIVLPSIAGWATV 227 >UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacteria RepID=Q114A2_TRIEI Length = 1204 Score = 204 bits (518), Expect = 2e-51, Method: Composition-based stats. Identities = 102/210 (48%), Positives = 134/210 (63%), Gaps = 1/210 (0%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP 70 +F NPE RCP ILLLD S SM+G I ELN G+ F+ + D LA RVE+ ++TF Sbjct: 622 EFVENPENRCPIILLLDTSYSMSGEAITELNQGVKIFQASVKEDELASLRVEIAVITFNS 681 Query: 71 -VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 + V Q F + F P L A G T MG AI KAL+++E+RK++Y+ + I YYRPWIFLI Sbjct: 682 EIEVVQDFVTVDKFIPKTLEASGVTHMGKAIEKALELLEKRKQDYKNSDIQYYRPWIFLI 741 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELF 189 TDG PTD WQ AA K+ E +++ FF++GV+ ADM+TL++ISV P L GL F+ LF Sbjct: 742 TDGQPTDTWQDAAKKIEEAETNRKLLFFAVGVRDADMETLSEISVCPPKKLNGLDFQSLF 801 Query: 190 SWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 WLS SL+ VS S G + L W + Sbjct: 802 KWLSFSLQQVSVSKIGEKNRLPPTNAWEEI 831 >UniRef50_Q2FNC6 von Willebrand factor, type A n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNC6_METHJ Length = 233 Score = 185 bits (470), Expect = 9e-46, Method: Compositional matrix adjust. Identities = 101/215 (46%), Positives = 131/215 (60%), Gaps = 12/215 (5%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 P C +L+LD S SM+G I ELN GL DEL D LA+KR++L ++TFG V + + Sbjct: 17 HPHCATVLVLDTSASMSGNKIAELNEGLRILTDELKEDDLAVKRIDLAVITFGKGVELVR 76 Query: 76 PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 PFT + F PP L A G TPMG AI +A+ +VEERK EYR G YYRPWIFLITDG PT Sbjct: 77 PFTGISAFDPPELSAGGYTPMGQAILEAVRLVEERKAEYRTIGTDYYRPWIFLITDGQPT 136 Query: 136 DE------WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRE 187 D W+ V GE D +F F+++GV A+M L +IS R PL L+ ++ E Sbjct: 137 DMRKGDEIWEKVIEAVHGGERDHKFLFWALGVDQANMTVLREISPPGRTPLMLKEAKWAE 196 Query: 188 LFSWLSSSLRSVSRSTPGTEVVLE---APKGWTSV 219 +F WLS SL +S S G ++ LE P+GW + Sbjct: 197 MFLWLSKSLSQISDSRIGEQISLENPVGPEGWGVI 231 >UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AYK3_9ENTR Length = 227 Score = 175 bits (444), Expect = 8e-43, Method: Compositional matrix adjust. Identities = 93/225 (41%), Positives = 130/225 (57%), Gaps = 6/225 (2%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M E + N E R P IL+LD SGSM G+PI +LN GL EL D +A KR Sbjct: 1 MMEHLMIPDVALVDNSEQRTPLILVLDSSGSMYGQPIQQLNEGLKLLEQELKNDVIAAKR 60 Query: 61 VELGIVTFG---PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 V + ++ +G + + A +F P+L A G TPMG AIT AL+ +E K+ ++ Sbjct: 61 VRILVIEYGGYDQCTIHGDWKDAMDFTAPVLEANGTTPMGQAITLALEEIEAEKQRFKQA 120 Query: 118 GISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS---V 174 G++Y RPW+FL++DG PTD+W+ AA + EE ++ A F I V GA + + S V Sbjct: 121 GVAYTRPWLFLMSDGVPTDQWEQAAQLCRQAEESQKTAVFPIMVDGASAEVMGSFSRNGV 180 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 L+GLQF+ELF WLS+S++ VS+STPG L + W SV Sbjct: 181 NGVKMLKGLQFKELFLWLSASMQVVSQSTPGGTAQLPSTDSWASV 225 >UniRef50_Q2SNJ1 Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (VWF) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SNJ1_HAHCH Length = 223 Score = 172 bits (437), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 95/212 (44%), Positives = 126/212 (59%), Gaps = 5/212 (2%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-- 69 F N R PC+L+LD S SM G PI +LN GL L D RV+L ++ G Sbjct: 11 FNDNNSQRTPCVLVLDGSSSMFGEPIRQLNEGLKLLERALKEDASTAMRVQLLVIRAGNH 70 Query: 70 -PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 V + A +F P +FA G TP+G A+ ALD +E++K Y ANGIS RPWI L Sbjct: 71 DQAEVLTDWVDAMDFNAPEVFANGTTPLGGAMNLALDKIEDQKAAYDANGISSTRPWIIL 130 Query: 129 ITDGAPTD-EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRE 187 I+DGAPTD W+A A++ E++++ F IGV+GA +TL Q S + L+GLQFRE Sbjct: 131 ISDGAPTDFNWEAVADRCRHAEQNRKVVIFPIGVEGATFETLNQFSNKGAKKLKGLQFRE 190 Query: 188 LFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 LF WLS S+ +VS S+PG +V L A W+ V Sbjct: 191 LFVWLSRSMATVSVSSPGEKVQLPATD-WSEV 221 >UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria RepID=C9RRF6_FIBSS Length = 228 Score = 169 bits (427), Expect = 8e-41, Method: Compositional matrix adjust. Identities = 98/228 (42%), Positives = 135/228 (59%), Gaps = 9/228 (3%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M+ + D +NP R P L+LD SGSM G+PI+ELN G+ F D + +D AL Sbjct: 1 MTNEKLLNIEDLENNPSTRVPVCLVLDTSGSMEGQPISELNEGINCFYDAVRSDETALYA 60 Query: 61 VELGIVTFGPVHVEQPFTSAANFFP--PILFAQGDTPMGAAITKALDMVEERKREYRANG 118 E+ +VTFG V + S P P FA G TPMG A+ ALD++E+RK EY+A+G Sbjct: 61 AEIAVVTFGGSAVLKTDFSTLEHQPDSPNFFANGGTPMGEAMNMALDLLEKRKGEYKASG 120 Query: 119 ISYYRPWIFLITDGAP---TDEWQAAANKVFRGEEDKRFAFFSIGV-QGADMKTLAQIS- 173 + YY+PWI L+TDG P + E+ A + ++++ F IG+ + ADM LA S Sbjct: 121 VDYYQPWIVLMTDGKPNGDSSEYARAVQRTCEMIKNRKLTIFPIGIGEDADMNALAAFSP 180 Query: 174 VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAP--KGWTSV 219 R PL LQGL FRE F+WLS S+ VS+STPG ++ L+ KGW + Sbjct: 181 KRSPLKLQGLNFREFFAWLSKSVSKVSQSTPGDKIQLDTDGIKGWAEL 228 >UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJ63_FIBSS Length = 227 Score = 163 bits (413), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 97/222 (43%), Positives = 129/222 (58%), Gaps = 9/222 (4%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 + D +NP R P L+LD SGSM G INELN G+ F D + +D AL E+ +V Sbjct: 6 LSIEDLENNPSSRVPVCLVLDTSGSMEGDSINELNEGVRLFYDAVRSDETALYAAEISVV 65 Query: 67 TFGPVHVEQPFTSAANFFP--PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 TFG Q S P P +A G TPMG A+ ALDM+E+RK EY+A+G+ YY+P Sbjct: 66 TFGGHASCQAGFSTLEHQPDAPQFYADGGTPMGEAMNMALDMLEKRKSEYKASGVDYYQP 125 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEE---DKRFAFFSIGV-QGADMKTLAQIS-VRQPLP 179 WI L+TDG P + + R + D++ F IG+ + ADM LA+ S R PL Sbjct: 126 WIVLMTDGMPNGSQAELSRSIQRTCDMINDRKLTIFPIGIGEDADMDVLARFSPKRSPLK 185 Query: 180 LQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAP--KGWTSV 219 LQGL F+E F+WLS S+ VS+STPG +V L+ KGW + Sbjct: 186 LQGLNFKEFFAWLSKSVSKVSQSTPGDKVQLDVDGIKGWAEL 227 >UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria RepID=A9AV55_HERA2 Length = 222 Score = 161 bits (407), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 92/214 (42%), Positives = 121/214 (56%), Gaps = 19/214 (8%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG----- 69 N E +C CIL++D SGSM GRPI+ELN GL F ++ +R+E+ +V F Sbjct: 10 NYEQKCLCILVVDTSGSMQGRPIDELNQGLQVFHQDISNSFSTAQRLEICLVEFNSQADC 69 Query: 70 ---PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 P V+Q F PIL G T + + A+ V+ERK YR+ G YYRPWI Sbjct: 70 IVEPSLVDQ-------FHMPILAVAGTTKLVDGVRLAIHKVQERKSWYRSTGQPYYRPWI 122 Query: 127 FLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQG 182 L+TDG P + A A ++ G +K+F FF IGVQGADM+ L QIS R P+ LQG Sbjct: 123 ILMTDGEPDSDQDVAGLAREIQHGVNNKQFVFFPIGVQGADMRMLQQISTPDRPPMLLQG 182 Query: 183 LQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGW 216 L+F F WLS+SL V+ ST G + L + GW Sbjct: 183 LRFEAFFDWLSASLSMVASSTDGQVIQLPSTSGW 216 >UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacterium sp. JLS RepID=A3PUP3_MYCSJ Length = 233 Score = 154 bits (390), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 92/220 (41%), Positives = 121/220 (55%), Gaps = 15/220 (6%) Query: 14 SNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH- 72 +NP+PR C++L DVSGSM G PI L G F L + LA KRVE+ +VTFG V Sbjct: 13 ANPDPRVACVVLADVSGSMQGEPIAALERGFAAFTRYLQNEVLASKRVEVAVVTFGTVAT 72 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 V P A P A G T M A I ALD++E+RK Y+A G+ YYRPWI L+TDG Sbjct: 73 VLVPMQEARTLQPVAFTASGTTNMAAGIHLALDILEDRKHAYKAAGLQYYRPWILLLTDG 132 Query: 133 APT-DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISV-RQPLPLQGLQFRELF 189 P D + A ++ E + F++G D + L ++S+ R P PL+GL++ ELF Sbjct: 133 KPNLDGFDEAVARLNAVESARGVTVFAVGAGPRVDYQQLGRLSLQRSPAPLEGLKYEELF 192 Query: 190 SWLSSSLRSVSRSTP-----------GTEVVLEAPKGWTS 218 WLS+SL +VS ST V L + GWTS Sbjct: 193 EWLSASLSNVSNSTEFARDDQTHEAMNGRVPLPSAAGWTS 232 >UniRef50_D0W6S8 Glycosyl transferase, group 2 family n=1 Tax=Neisseria lactamica ATCC 23970 RepID=D0W6S8_NEILA Length = 191 Score = 153 bits (386), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 81/192 (42%), Positives = 115/192 (59%), Gaps = 5/192 (2%) Query: 32 MNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ--PFTSAANF-FPPIL 88 M G PI +LN G+ F L D +A VE+GI+ G HVE+ PFT+A + Sbjct: 1 MYGEPIEQLNQGVQQFIQALQEDEIASYSVEVGILAAGG-HVEEIIPFTTAEQLDYTSTF 59 Query: 89 FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRG 148 AQG TP+G+A+ + L M+E+RKREY+ NG++YY+PW+ +I+DG+PTD WQ AA + Sbjct: 60 TAQGSTPLGSAVEQGLKMLEDRKREYQKNGVAYYQPWLVVISDGSPTDSWQNAAQETRTL 119 Query: 149 EEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRS-TPGTE 207 E+++ +GV ADM L Q S R L L GL+F + F WLS+S+ VS S + + Sbjct: 120 AENRKLVSLMVGVNDADMDKLGQFSNRPALKLDGLRFGDFFQWLSASMSRVSASNSTAAQ 179 Query: 208 VVLEAPKGWTSV 219 V L W S+ Sbjct: 180 VSLPPIDTWASI 191 >UniRef50_C8PVC3 von Willebrand factor type A domain protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PVC3_9GAMM Length = 260 Score = 148 bits (374), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 85/230 (36%), Positives = 129/230 (56%), Gaps = 17/230 (7%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSM-----NG--RPINELNAGLVTFRDELLADPLALK 59 F D+ +N R C+L+LD+SGSM NG R I+ LN G+ F +L+ D A Sbjct: 29 FREIDYGNNVAQRTLCVLVLDLSGSMAIRSGNGDKRRIDMLNEGIEAFYHDLMKDETARN 88 Query: 60 RVELGIVTFGPVH----VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 RV L IV G V+ + +T A +FFP G TP+G + AL+++E+ + R Sbjct: 89 RVRLAIVIVGGVNDTAELMMDWTDAIDFFPIKFRENGMTPLGQGMLLALNLIEQERINLR 148 Query: 116 ANGISYYRPWIFLITDGAPTDE---WQAAANKVFRGEEDKRFAFFSIGVQGA--DMKTLA 170 NGI+Y RPW+ +TDG PTD WQAA N+ + E++ + + I + ++K L Sbjct: 149 DNGINYTRPWVIAMTDGLPTDSQDVWQAAINQCHQAEQNNQCIIYPIAIDAGVQEVKMLK 208 Query: 171 QISVRQ-PLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 Q+S+ P+ L ++F E F WLS+SL++VS+S PG V L + W ++ Sbjct: 209 QLSILTPPVHLNSVKFVEFFVWLSASLKTVSQSAPGETVQLGSISPWATI 258 >UniRef50_Q87W17 von Willebrand factor type A domain protein n=2 Tax=Proteobacteria RepID=Q87W17_PSESM Length = 224 Score = 145 bits (366), Expect = 9e-34, Method: Compositional matrix adjust. Identities = 86/214 (40%), Positives = 120/214 (56%), Gaps = 3/214 (1%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M E+ + D NP R P L+LDVSGSM G PI EL AG+ F + D +A Sbjct: 1 MQEEYILSQEDLVDNPTARVPICLVLDVSGSMAGEPIRELQAGVNMFYQAIREDEVAQYA 60 Query: 61 VELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 E+ IVTFG F + P L A+G T MG + ALD++E RK +Y+ G+ Sbjct: 61 AEISIVTFGSEAKRTVDFMAIERQDVPALIAEGTTSMGQGVNLALDLLEVRKGDYQRAGV 120 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGV-QGADMKTLAQIS-VRQP 177 YY+PW+ ++TDG PTD+ A+ ++ E K+ F I + A++ L +S R P Sbjct: 121 DYYQPWMVVMTDGEPTDDITRASERIREMCESKKLTVFPIAIGTAANLDILGMLSPGRPP 180 Query: 178 LPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE 211 L L+GL F+E F WLS S+ VS+STPG V+L+ Sbjct: 181 LRLKGLNFKEFFLWLSRSVSRVSQSTPGETVILD 214 >UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQB2_9BACT Length = 225 Score = 136 bits (343), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 89/223 (39%), Positives = 122/223 (54%), Gaps = 5/223 (2%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 S + ++ NP PR P L+LDVSGSM G PI ELN G+ F L D +A Sbjct: 3 SLNMILDQNEMVENPTPRVPVSLVLDVSGSMLGAPIEELNRGVELFFKSLKDDDVARYSA 62 Query: 62 ELGIVTF-GPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 E+ +++F V E F P L A G T MG A++ AL+ +E+RK YR G+ Sbjct: 63 EVSVISFSNEVTQEVDFGPLEKCDIPELKAIGKTRMGGAVSLALESLEKRKELYRTLGVD 122 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGV-QGADMKTLAQIS-VRQPL 178 YY+PW+ ++TDG P D+WQ AA K + + F I + A TL + S R PL Sbjct: 123 YYQPWMVIMTDGKPNDDWQLAAAKTSALVDKGKLTVFPIAIGDNACTDTLKEFSPARNPL 182 Query: 179 PLQGLQFRELFSWLSSSLRSVSRSTPG--TEVVLEAPKGWTSV 219 L+ L F+E F WLSSS+ VS+S PG E+ L+ +GW S+ Sbjct: 183 RLKDLNFQEFFRWLSSSVSKVSQSIPGEKVELDLKGLEGWASL 225 >UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DBA5_9CLOT Length = 231 Score = 130 bits (327), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 74/183 (40%), Positives = 100/183 (54%), Gaps = 7/183 (3%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 E C+LL+D SGSM G INELN GL+ F + L D A ++ +++F V Sbjct: 20 ERHIACVLLVDTSGSMAGASINELNQGLLEFGNALDQDEHARGVADVCVISFNSNVETVV 79 Query: 76 PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 PF AAN+ P L A G T M A+ LD +EERK+ YR G SYYRPW+FL+TDG PT Sbjct: 80 PFCPAANYSAPTLSAGGLTSMNEAVIAGLDAIEERKQLYRQLGCSYYRPWMFLLTDGEPT 139 Query: 136 DEWQ--AAANKVFRGEEDKRFAFFSIGVQG----ADMKTLAQISVRQPLPLQGLQFRELF 189 D+ A N++ + DK+ FF +G+ A +K+ + L QF+E F Sbjct: 140 DQNMEGEAKNRLQQALNDKKVNFFPMGIGSGANYAHLKSYTKGGNGAVLKASASQFKEAF 199 Query: 190 SWL 192 WL Sbjct: 200 VWL 202 >UniRef50_D1PS09 von Willebrand factor type A domain protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PS09_9FIRM Length = 246 Score = 129 bits (324), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 86/245 (35%), Positives = 122/245 (49%), Gaps = 37/245 (15%) Query: 11 DFASNPEPRCPCILLLDVSGSM----------------NGRP----------INELNAGL 44 D NP PR P L LD SGSM +GR ++EL G+ Sbjct: 3 DLVENPTPRVPICLCLDTSGSMGAVQGDCVDTGKTLFEDGRQWNLVTGGTSRLDELQKGI 62 Query: 45 VTFRDELLADPLALKRVELGIVTF-GPVHVEQPFTSAANFFP-PILFAQGDTPMGAAITK 102 F + + D +A E+ IVTF F + P L A GDT MG + Sbjct: 63 KLFYNSVREDEVARYAAEICIVTFDSEAKCRMDFANLDRQSDLPELTATGDTAMGEGVNL 122 Query: 103 ALDMVEERKREYRANGISYYRPWIFLITDGAPT---DEWQAAANKVFRGEEDKRFAFF-- 157 ALD++E RKREY+ G+ Y++PW+ L+TDG P E++ A + E K+ F Sbjct: 123 ALDLLESRKREYQDKGVDYFQPWLVLMTDGVPNGNEGEFERAVQRCRDMEAQKKLTVFPI 182 Query: 158 SIGVQGADMKTLAQISVRQ-PLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE--APK 214 +IG +G D LA+ S ++ PL LQGL+FRE F+WLS S+ S+S PG + L+ + + Sbjct: 183 AIGDEG-DQTALAKFSAKRPPLKLQGLKFREFFAWLSQSVAKTSQSMPGETIKLDLNSIQ 241 Query: 215 GWTSV 219 GW + Sbjct: 242 GWAEL 246 >UniRef50_A6G3C6 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G3C6_9DELT Length = 211 Score = 109 bits (273), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 63/168 (37%), Positives = 89/168 (52%), Gaps = 16/168 (9%) Query: 37 INELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV-EQPFTSAANFFPPILFAQGDTP 95 I +N L FR +++ DPLA KR++L +++F V E F SA N+ PP L G T Sbjct: 13 IGAVNRSLQAFRADIMEDPLARKRLDLCVISFNHECVTENHFCSAQNWRPPTLVPGGATG 72 Query: 96 MGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD------EWQAAANKVFRGE 149 MG AI L+ + R + YR +G+ YRPW+ L+TDG PTD W + GE Sbjct: 73 MGQAIKVGLETLRGRLQRYRLDGVDCYRPWVMLVTDGLPTDMQPNDARWMEVRQLIQDGE 132 Query: 150 EDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRE-----LFSWL 192 + +RF FFS+ V + L Q+ ++P LQ RE +F WL Sbjct: 133 QKRRFMFFSVAVLPEAIPALRQLGAQRP----PLQVREGKIPTMFKWL 176 >UniRef50_C9ZGR0 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9ZGR0_STRSW Length = 253 Score = 103 bits (256), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 77/226 (34%), Positives = 104/226 (46%), Gaps = 26/226 (11%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 +A +F +N + R P +L LD S SM G PI LN L + EL D VE+ +V Sbjct: 13 YADIEFENNAQ-RMPLVLCLDTSSSMAGPPIQTLNNALAEWTRELHDDVSLSYSVEVAVV 71 Query: 67 TFG--------------PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKR 112 TFG P PF A F P L A G T M A+ A+ +V RK Sbjct: 72 TFGGQGVGAWRGPQLLDPRTRTSPFIPAHAFQAPQLTAAGVTLMTEALELAMHIVAARKS 131 Query: 113 EYRANGISYYRPWIFLITDGAP-------TDEWQAAANKVFRGEEDKRFAFFSIGVQGAD 165 E RA+G+ YYRP I L+TDG P TD W + + +RF ++IGV G Sbjct: 132 ELRASGLQYYRPQICLVTDGLPTDPTGHLTDSWHRLVPVLAEEQSARRFRLYAIGVGGIT 191 Query: 166 ---MKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEV 208 + L + + +QG FREL +S+S + + G EV Sbjct: 192 DRGEQVLKAFAPKFNARIQGFPFRELLQMMSASANAEQKGA-GDEV 236 >UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 Length = 239 Score = 87.8 bits (216), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 56/181 (30%), Positives = 86/181 (47%), Gaps = 5/181 (2%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 R P LLLD SGSM+G PI + G+ T L DP AL+ + ++TF P Sbjct: 30 RLPVYLLLDTSGSMHGEPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVPL 89 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 T +F P L A G T +G A++ + + ++ A+ +RP +FL+TDG+P D+ Sbjct: 90 TDLLSFQMPALTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRPLVFLMTDGSPNDD 149 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRELFSWLSSS 195 W+ N F+ + G AD L +I+ V Q + F W+S+S Sbjct: 150 WRKGLND-FKAARTGVVVACAAG-HDADTSVLKEITEIVVQLDTADSSTIKAFFKWVSAS 207 Query: 196 L 196 + Sbjct: 208 I 208 >UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W3F9_DESAS Length = 219 Score = 87.4 bits (215), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 57/187 (30%), Positives = 92/187 (49%), Gaps = 7/187 (3%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 R P LLLD SGSM G PI + G+ EL +P A++ + ++TFG + Sbjct: 11 RLPVYLLLDRSGSMFGEPIEAVKQGVKYMISELKKEPQAIETAYISVITFGSDARQDVQL 70 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 T A F P + A G T +GAA+ + + R+ Y+P +F++TDG PTD+ Sbjct: 71 TELAAFKEPQIEANGTTSLGAALHILNNCFDNEVRKSTPTQKGDYKPLVFIMTDGEPTDD 130 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL----QGLQFRELFSWLS 193 W+ AA ++ + + K ++G G D+ T + + L Q F++ F W+S Sbjct: 131 WENAAREI-KQKSGKVANIVAVGC-GPDVNTDTLKKITDIVLLMSSYQPEDFKQFFRWVS 188 Query: 194 SSLRSVS 200 S++ S Sbjct: 189 QSVKQAS 195 >UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CA78 Length = 347 Score = 84.7 bits (208), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 55/192 (28%), Positives = 92/192 (47%), Gaps = 5/192 (2%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 R P LLDVS SM G PI + G+ T EL ADP AL+ V L I+ F G V P Sbjct: 3 RLPIYFLLDVSESMVGDPIEHVQDGMATIIKELKADPFALETVWLSIIGFAGKSKVITPL 62 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 F+PP + G T + + + + ++ ++ + ++P +FL TDG PTD+ Sbjct: 63 QDIITFYPPKIPIGGGTSLASGLNELMNAIDREVVKTTLERKGDWKPLVFLFTDGIPTDD 122 Query: 138 WQAAANKVFRGEEDKRFAFFSIGV-QGADMKTLAQIS--VRQPLPLQGLQFRELFSWLSS 194 A A + + ++ +I + + + L Q++ V Q ++E F W+++ Sbjct: 123 -PAQAIERWNAHYRRKVNLVAISLGENTNYNLLGQLTDQVLQFNNTNAAAYKEFFKWITA 181 Query: 195 SLRSVSRSTPGT 206 S+++ S T Sbjct: 182 SIKTTSEQVNNT 193 >UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z299_9BACE Length = 348 Score = 81.3 bits (199), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 57/190 (30%), Positives = 89/190 (46%), Gaps = 9/190 (4%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 R P L+DVS SM G PI ++ G+ EL DP AL+ + ++ F G P Sbjct: 3 RLPVYFLVDVSESMVGAPIQQVQDGMRMIVQELRTDPYALETAYISVIAFAGKAKCVSPL 62 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 T F+PP G T +G A+ +D +++ ++P +FL TDG PTD Sbjct: 63 TELYKFYPPTFPIGGGTSLGNALEFLMDDMDKTLVRTTTEQKGDWKPIVFLFTDGNPTDN 122 Query: 138 WQAAA---NKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRELFSWL 192 A N +RG+ + SIG + + L QIS V + + F+ F W+ Sbjct: 123 PSNAFTRWNNKYRGKAN--IVAISIG-DNVNTQLLGQISDNVLRLNKTDEISFKSFFKWV 179 Query: 193 SSSLRSVSRS 202 ++S+++ S S Sbjct: 180 TASIKATSVS 189 >UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10 Tax=Enterobacteriaceae RepID=D1TTW6_YERPE Length = 327 Score = 79.7 bits (195), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 52/188 (27%), Positives = 93/188 (49%), Gaps = 5/188 (2%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 R P +LD S SM G + ++N GL ++L DP AL+ + ++ F G P Sbjct: 3 RLPIFFVLDCSESMIGENLKKMNDGLQMIINDLKKDPHALETAWISVIAFAGVAKTIVPL 62 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 +F+PP L G T +GAA+ + ++ + R+ ++P ++L+TDG PTD+ Sbjct: 63 VEVVSFYPPRLPIGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKPVVYLLTDGRPTDD 122 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPL--PLQGLQFRELFSWLSS 194 A + ++ ++ +IG+ AD+ L Q++ L Q F + W+++ Sbjct: 123 TTAEITR-WKTHYARKVNLIAIGLGPSADLNILRQLTENVLLFNDTQEGDFTQFIKWITA 181 Query: 195 SLRSVSRS 202 S+ + SRS Sbjct: 182 SVSAHSRS 189 >UniRef50_A9BLP5 von Willebrand factor type A n=3 Tax=Burkholderiales RepID=A9BLP5_DELAS Length = 244 Score = 77.0 bits (188), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 60/207 (28%), Positives = 92/207 (44%), Gaps = 14/207 (6%) Query: 4 QITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVEL 63 I F S F + P +LLLDVSGSM+G I +N + D + + Sbjct: 3 NIPFDPSKFTAPKAKPLPVVLLLDVSGSMSGEKIRNVNDAVRDMLDTFSDTENGETEIHV 62 Query: 64 GIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKR-EYRANGISY 121 I+TFG V + QP SA++ L A G TP+G A+ A M+E++ RA Sbjct: 63 AIITFGSQVALHQPLASASDIHWQDLSAGGMTPLGTALQMAKAMIEDKDVIPSRA----- 117 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADM------KTLAQISVR 175 YRP + L++DG P D W+ N + ++ + GAD K + S R Sbjct: 118 YRPTVVLVSDGGPNDAWEKPLNAFISDGRSAKCDRLAMAI-GADADEAVLGKFIEGTSNR 176 Query: 176 QPLPLQGLQFRELFSWLSSSLRSVSRS 202 Q R+ F +++ S+ ++S Sbjct: 177 LFYAENAKQLRDFFKFVTMSVTIRTKS 203 >UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacteria RepID=Q5NWS3_AZOSE Length = 349 Score = 75.5 bits (184), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 57/188 (30%), Positives = 84/188 (44%), Gaps = 5/188 (2%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 R P L+DVS SM G + +L GL L ADP AL+ V + ++ F G P Sbjct: 3 RLPIFFLVDVSESMAGDNLRQLQEGLERLVRSLRADPYALETVFISVIAFAGKPKTLTPL 62 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 F+ P L T +G+A+ +D +E + +RP ++L+TDG PTD+ Sbjct: 63 VELYQFYAPRLPLGSGTSLGSAMAHLMDEMERTVQRSTPEKKGDWRPVVYLLTDGKPTDD 122 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGL---QFRELFSWLSS 194 + A + R E+ R +IGV + Q L L F+ W+S Sbjct: 123 IEPAIKRWKRDFEE-RSNLVAIGVGKHASLSALQRFTENVLSLDATTEDDFKRFIDWISQ 181 Query: 195 SLRSVSRS 202 S+ S SRS Sbjct: 182 SVASQSRS 189 >UniRef50_B7AFM8 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AFM8_9BACE Length = 333 Score = 75.5 bits (184), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 56/175 (32%), Positives = 84/175 (48%), Gaps = 5/175 (2%) Query: 32 MNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFTSAANFFPPILFA 90 M G PI ++ G+ EL DP AL+ V + ++ F G V P T F+PP Sbjct: 1 MVGEPIIQVEKGMRNIIQELRTDPYALETVFVSVIVFAGKEKVLSPLTELYKFYPPQFPI 60 Query: 91 QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEE 150 G T +G A+ ++ +++ ++ ++P IFL TDG PTD Q A N+ + Sbjct: 61 GGGTSLGTALDCLMNDIDKSVKKTTVEMKGDWKPIIFLFTDGMPTDNPQQAFNRWNAHYK 120 Query: 151 DK-RFAFFSIGVQGADMKTLAQISVRQ-PLPLQGLQ-FRELFSWLSSSLRSVSRS 202 K SIG D K L +IS L G Q F+ F W+++S++S S S Sbjct: 121 RKANLVCISIG-DNTDTKMLGKISDNVLRLNDTGEQSFKAFFKWVTASIKSTSVS 174 >UniRef50_Q8DK92 Tlr0974 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DK92_THEEB Length = 241 Score = 74.7 bits (182), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 60/201 (29%), Positives = 94/201 (46%), Gaps = 16/201 (7%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH--VE 74 E P LLLD S SM G PI L+ GL F+ E+ +D A V++G++TF V Sbjct: 19 ERHLPVYLLLDTSSSMEGAPIESLHQGLEQFQREVSSDQFARDIVKVGVITFASDAQLVT 78 Query: 75 QPFTSAANFFPPILFAQGDTPMGAAITKALDMVE-ERKREYRANGISYYRPWIFLITDGA 133 ++F PP+L A G T + A T L+ ++ + R + ++P +F++TDG Sbjct: 79 GGLVPISDFQPPMLTASGVTRLDLAFTVLLESIDRDVVRPVKGGQKGDWKPAVFVLTDGR 138 Query: 134 PTDEWQAAANKVFRGEED----------KRFAFFSIGVQ-GADMKTLAQISVRQPLPLQG 182 PTD A ++++R D K ++G D TL IS + Sbjct: 139 PTDRHGIATDELWRPARDALVNRPKGEIKPSVIVAVGCGPHVDDDTLKAISTGTAFKMGT 198 Query: 183 LQ--FRELFSWLSSSLRSVSR 201 + F LF +LS SL + ++ Sbjct: 199 SEAAFVALFQYLSQSLTTSTQ 219 >UniRef50_A1VV60 von Willebrand factor, type A n=4 Tax=Proteobacteria RepID=A1VV60_POLNA Length = 240 Score = 72.4 bits (176), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 71/215 (33%), Positives = 109/215 (50%), Gaps = 23/215 (10%) Query: 16 PEPR-CPCILLLDVSGSM--NGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG--P 70 P+ R P I+L DVSGSM NG+ I+ LN L + +++G++TFG Sbjct: 24 PQARPLPVIVLADVSGSMSENGK-IDALNVALKEMILSFGKESGLRAEIQVGLITFGGRE 82 Query: 71 VHVEQPFTSAANFFPPILF-AQGDTPMGAAITKALDMVEERKR-EYRANGISYYRPWIFL 128 H P +A F A G TPMG+A A ++E++++ RA YRP + L Sbjct: 83 AHEHLPLVAAKVIGGVEAFKANGGTPMGSAFALARKLLEDKEQIPSRA-----YRPVLIL 137 Query: 129 ITDGAPTDEWQAAANKVF---RGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL-QGLQ 184 ++DGAPTD W+A + RG++ RFA +IG AD+ LAQ + P+ + + Sbjct: 138 VSDGAPTDAWEAPLADLKASERGQKATRFA-MAIGAD-ADLDMLAQFPNDREAPVFKTHE 195 Query: 185 FRELFSWLSS-SLRSVSRST---PGTEVVLEAPKG 215 R++ + + ++ VSRST P V L+ G Sbjct: 196 ARDIGRFFRAVTMSVVSRSTSAAPDQPVTLDMEDG 230 >UniRef50_Q5NWS4 Tellurium resistance protein n=8 Tax=Bacteria RepID=Q5NWS4_AZOSE Length = 214 Score = 72.0 bits (175), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 59/193 (30%), Positives = 86/193 (44%), Gaps = 13/193 (6%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 R P LL+D SGSM G P+ +N GL + L +P A++ V L + TF + P Sbjct: 6 RLPVYLLIDTSGSMRGEPVESVNVGLRAMQTSLRQNPYAIETVHLSVTTFDSQIKDVLPL 65 Query: 78 TSAANFFPP--ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 T+ + P + A G T +G A+ LD ++ R+ A + P +F++TDG PT Sbjct: 66 TALEDATIPEIVCPASGATLLGEALEHILDRAKKEVRQSSAEQKGDWAPLLFIMTDGKPT 125 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKT------LAQISVRQPLPLQGLQFRELF 189 D + N+V K F F SI A K L V + F F Sbjct: 126 DTF--VFNQV--APAIKAFKFGSIIACAAGPKADPAGLRLITDHVVSLDTMDSAAFTAFF 181 Query: 190 SWLSSSLRSVSRS 202 W+S ++ S S S Sbjct: 182 QWVSVTVSSGSMS 194 >UniRef50_B6BJ58 Phage/colicin/tellurite resistance cluster TerY protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BJ58_9PROT Length = 229 Score = 71.2 bits (173), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 51/169 (30%), Positives = 76/169 (44%), Gaps = 9/169 (5%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + F +DF P +LLLDVS SM G I+ LN + + + ++L Sbjct: 1 MAFNPADFVVEEPKSIPVVLLLDVSYSMQGENIDTLNKAVESMLNSFKKAETMETFIKLS 60 Query: 65 IVTFGP---VHVEQPFTSAANF-FPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 I+TFG V + P T + F P L G TPMGAA M+E+ K ++ Sbjct: 61 IITFGSENGVDLHTPLTEVSKIDFKP-LTVSGSTPMGAAFKMGKAMIED-KDIFKGRD-- 116 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTL 169 YRP I L++DG P D+W+ + K+ ++ + AD L Sbjct: 117 -YRPTIVLLSDGEPNDDWRQPLDDFVSTGRTKKCDRMALAIGAADKTVL 164 >UniRef50_B2K3B2 von Willebrand factor type A n=39 Tax=Gammaproteobacteria RepID=B2K3B2_YERPB Length = 233 Score = 71.2 bits (173), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 52/144 (36%), Positives = 73/144 (50%), Gaps = 7/144 (4%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-PF 77 R P LL+D SGSM G I+ +N G+ L DP AL+ V L I+T+ E P Sbjct: 24 RLPVYLLIDTSGSMRGESIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIPL 83 Query: 78 TSAANF-FPPILF-AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 T+ NF F I + G T GAA+ + VE + + +RP +FL+TDG P+ Sbjct: 84 TALENFQFTDITVPSAGGTFTGAALECLIHCVERDIQRSDGDQKGDWRPLVFLMTDGTPS 143 Query: 136 DEWQAAANKVFRGEEDKRFAFFSI 159 D + A + + E K+ AF SI Sbjct: 144 DVY--AYGEAIK--EVKKRAFGSI 163 >UniRef50_UPI0001AED79F von Willebrand factor type A n=1 Tax=Streptomyces albus J1074 RepID=UPI0001AED79F Length = 221 Score = 70.1 bits (170), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 49/156 (31%), Positives = 70/156 (44%), Gaps = 5/156 (3%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTS 79 P LL D SGSM G PI+ +N L E+ +P + ++ F V QP Sbjct: 5 PFYLLCDESGSMTGDPIDAINRALPDLHHEISTNPTVADKTRFCLIGFSDDASVLQPLVD 64 Query: 80 AANFFP-PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE- 137 ++ P L A G T G A L VE+ E +A G YRP F ++DG PTDE Sbjct: 65 LSDIDEVPALSAGGLTDYGTAFRTLLRSVEKDVAELKAQGHEVYRPVAFFLSDGIPTDED 124 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS 173 W A ++ + F IG A+ + + Q++ Sbjct: 125 WPTAHRELLNSRYAPKIIAFGIG--DAEAQIIGQVA 158 >UniRef50_C9LWT6 Tellurium resistance protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LWT6_9FIRM Length = 212 Score = 67.0 bits (162), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 57/193 (29%), Positives = 87/193 (45%), Gaps = 14/193 (7%) Query: 32 MNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAANFFPPILFA 90 M G PI + G+ EL DP AL+ L ++TF V T F P L A Sbjct: 1 MMGEPIEAVRQGIKALLSELRGDPQALETAYLSVITFASQVRQTTKLTELMLFKEPRLEA 60 Query: 91 QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD--EWQAAANKVFRG 148 +G T MG A+ + V R+ +RP +FL+TDG+PTD +++ AA ++ Sbjct: 61 EGCTLMGGALKLLAECVRTEVRKNTETQKGDWRPLVFLLTDGSPTDLEDFRQAAAEI--- 117 Query: 149 EEDKRFAFFSIGVQGADMKT--LAQIS--VRQPLPLQGLQFRELFSWLSSSLRSVSRS-- 202 + + GAD T L Q++ V L + F+W+S S++ S+S Sbjct: 118 -KSLKLGNIIACAAGADADTSYLKQLTDNVLMMNSLSAGDMAKFFAWVSGSIKMSSKSLD 176 Query: 203 -TPGTEVVLEAPK 214 PG + L P+ Sbjct: 177 AKPGAAIELPPPR 189 >UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobacteria RepID=A1VWQ4_POLNA Length = 350 Score = 66.6 bits (161), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 48/188 (25%), Positives = 87/188 (46%), Gaps = 5/188 (2%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 R P +LD S SM G + ++ + L DP AL+ V ++ F G P Sbjct: 3 RLPVFFVLDCSESMVGANLKKMEGAVAAIVKSLRTDPQALETVFFSVIAFAGVARTIAPL 62 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 +F+PP L G T +G+A+ + ++ + A +RP I+L+TDG PTD Sbjct: 63 VEIVSFYPPKLPLGGGTNLGSALDALMGEIDRSVIKTTAERKGDWRPIIYLVTDGRPTDN 122 Query: 138 WQAAANKVFRGEEDKRFAFFSIGV-QGADMKTLAQIS--VRQPLPLQGLQFRELFSWLSS 194 A + + K+ +IG+ + D L +++ V ++ F++ +W+++ Sbjct: 123 PSRAIER-WNSHYAKKATLIAIGLGRSVDFTALRRLTENVISFEDIKESDFKKFINWVTA 181 Query: 195 SLRSVSRS 202 S+ S+S Sbjct: 182 SVVVQSKS 189 >UniRef50_A1THU3 von Willebrand factor, type A n=1 Tax=Mycobacterium vanbaalenii PYR-1 RepID=A1THU3_MYCVP Length = 248 Score = 65.5 bits (158), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 54/162 (33%), Positives = 76/162 (46%), Gaps = 12/162 (7%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-PFT- 78 P L+ DVS SM G I LN L FRD L +P+ +V+ G++ F E P Sbjct: 20 PFWLVCDVSASM-GPHIGTLNQSLRDFRDSLATNPVLADKVQFGVIDFSDTATEVIPLGD 78 Query: 79 -SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD- 136 S+A+ L +G T G A T ++E R A+ Y+RP +F +TDG PTD Sbjct: 79 FSSADLERHQLRTRGGTSYGQAFTTVQQIIE-RDLAAGADRFRYFRPAVFFLTDGQPTDR 137 Query: 137 EWQAAANKV--FRGEEDKRF----AFFSIGVQGADMKTLAQI 172 W+ A + F + F F G+ AD TLA++ Sbjct: 138 HWREAFRDLTFFDQASGQGFRSYPLFVPFGIGDADAATLAEL 179 >UniRef50_Q0I303 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=Q0I303_HAES1 Length = 343 Score = 65.1 bits (157), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 42/157 (26%), Positives = 76/157 (48%), Gaps = 3/157 (1%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 R P L++DVS SM G ++ + L DP AL+ V + ++ F G V P Sbjct: 3 RLPIFLVVDVSESMAGDSHRQMQEAINRLVQRLRCDPYALESVYISVIAFAGAAGVIAPL 62 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 T +F+ P L T +GAA+ +D ++ + ++P +++++DG TD+ Sbjct: 63 TELMSFYAPRLPMGSGTSLGAALNLTMDEIQRNVVRSSGDQKGDFKPLVYILSDGVATDD 122 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQIS 173 +A + ++ E R ++G+ AD+ L QI+ Sbjct: 123 PTSAIQR-WQQEFKSRTKLIAVGLGNFADLSALNQIA 158 >UniRef50_D1AAR7 von Willebrand factor type A n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AAR7_THECD Length = 228 Score = 64.3 bits (155), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 46/156 (29%), Positives = 71/156 (45%), Gaps = 3/156 (1%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFTS 79 P L+ D S SM G P+ E+N L E+ ++P + L I++F V P Sbjct: 7 PFYLVCDESYSMAGNPLQEINDQLPQIVTEIASNPTVADKARLCIISFSDTAEVLLPLAD 66 Query: 80 AANFFP-PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD-E 137 + P L +G T GAA T D +E R+ +A G +RP +F +TDG PTD + Sbjct: 67 LNDVHQVPQLAPKGATSYGAAFTLLRDTIERDIRDLKAAGHVPFRPTVFFLTDGQPTDSD 126 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS 173 W A ++ + R + G +TL ++ Sbjct: 127 WATAHQRLTAKDFGPRPTILAFGFGDVRPETLRAVA 162 >UniRef50_UPI00018742C1 von Willebrand factor type A n=1 Tax=Corynebacterium amycolatum SK46 RepID=UPI00018742C1 Length = 228 Score = 63.2 bits (152), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 64/223 (28%), Positives = 97/223 (43%), Gaps = 30/223 (13%) Query: 15 NPEPR---CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV 71 N EPR P + D SGSM G I ELN+GL + +E+ P A V ++ F Sbjct: 6 NMEPRGNILPIYFVADESGSM-GPDIAELNSGLQSLLNEIRMAPFAAANVRFSVIGFDNE 64 Query: 72 -----------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 HVEQ T +A F F+ + A I D+ + + YR N Sbjct: 65 ARLYLSNADLRHVEQMPTLSARF--ATYFSTAFDLLNAQIPD--DVAQLKAEGYRVN--- 117 Query: 121 YYRPWIFLITDGAPT--DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPL 178 RP +FL+TDG P D WQ A + + F IG +D + + Q++ + Sbjct: 118 --RPAVFLLTDGYPMSDDLWQEALSDLQNAPHHPNILAFGIG--ESDAQIIGQMASKNGW 173 Query: 179 PLQGLQFRELFSWLSSSLRSVSRS--TPGTEVVLEAPKGWTSV 219 Q Q + + LS + S+++S + GT V +P+ T + Sbjct: 174 AFQAAQGADTGAMLSEFMSSLTQSVISSGTSVANGSPEIITDI 216 >UniRef50_C7PNX3 von Willebrand factor type A n=2 Tax=Sphingobacteriales RepID=C7PNX3_CHIPD Length = 352 Score = 62.8 bits (151), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 54/191 (28%), Positives = 87/191 (45%), Gaps = 21/191 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 R P L+DVS SM G I + GL EL +DP AL+ + I+ F G P Sbjct: 3 RLPIYFLIDVSESMVGEQIQFVEEGLAAIIKELKSDPYALETAWVSIIVFAGQAKTIVPL 62 Query: 78 TSAANFFPPILFAQGDTPMGA--AITKALD--MVEERKREYRANGISY--YRPWIFLITD 131 +F+PP P+GA +++ L M E RK ++P +FL TD Sbjct: 63 QEVISFYPPKF------PIGAGTSLSNGLGHLMYEMRKNTIHTTATQKGDWKPIVFLFTD 116 Query: 132 GAPTDEWQAAANKVFRGEEDK-RFAFFSIGVQG--ADMKTLAQISV--RQPLPLQGLQFR 186 G PTD+ AA + + ++K S G + + +K L + + + P ++ Sbjct: 117 GTPTDDTSAAVREWKQNWQNKSNLIAISFGDENNLSALKELTETVLLFKNATP---QSYK 173 Query: 187 ELFSWLSSSLR 197 E F W+++S++ Sbjct: 174 EFFRWVTASIK 184 >UniRef50_A7BNL3 Tellurium resistance protein n=1 Tax=Beggiatoa sp. SS RepID=A7BNL3_9GAMM Length = 171 Score = 62.8 bits (151), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 42/150 (28%), Positives = 75/150 (50%), Gaps = 5/150 (3%) Query: 56 LALKRVELGIVTFGPVHVE-QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY 114 + ++ V L +TF + P + F P++ A G T +G+A+ +D +E+ R+ Sbjct: 1 MGIETVYLWDITFHSTAQQVTPLSELMLFKEPLISASGATALGSALRLLMDCLEKEVRKN 60 Query: 115 RANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS- 173 A ++P +FL+TDG PTD W++AA+K + ++ F+ G AD+ L I+ Sbjct: 61 TAVQKGDWKPLVFLMTDGMPTDAWESAADK-LKNQKSANLIAFAAG-PNADVANLKGITD 118 Query: 174 -VRQPLPLQGLQFRELFSWLSSSLRSVSRS 202 V + L + F W+S S+ +S Sbjct: 119 IVLKSEELSPGALKAFFQWMSQSILQTGKS 148 >UniRef50_C9RK46 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RK46_FIBSS Length = 236 Score = 59.7 bits (143), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 55/206 (26%), Positives = 95/206 (46%), Gaps = 27/206 (13%) Query: 21 PCILLLDVSGSMN--GRPINELNAGLVTFRDELLADPLALKRVEL--GIVTFGPVHVE-- 74 P I+L DVSGSMN G+ ++ L L + E+ I+TFG Sbjct: 16 PVIILADVSGSMNEIGK-LDSLKHALNNMISSFKDASSSSLEAEIYVSIITFGNQAANII 74 Query: 75 ---QPFTSAANFFPPI-----LFAQGDTPMGAAITKALDMVEERK-REYRANGISYYRPW 125 Q + AN + + A G+TP+G A+T +D++E R+ RA YRP+ Sbjct: 75 LEPQSASEIANDPSKMNVINKMQAIGNTPLGKALTSLVDLLENREIYPSRA-----YRPF 129 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGV-QGADMKTLAQISVRQPLPL---- 180 I L +DG P D WQ +++ E K+ ++ + AD L + + +P+ Sbjct: 130 IVLASDGMPNDLWQQPLDRLLNSERSKKANRLALAIGADADESMLKKFVNNEEMPIFKAN 189 Query: 181 QGLQFRELFSWLS-SSLRSVSRSTPG 205 ++ ++ F ++ S+++S + PG Sbjct: 190 NAIEIQKFFKCVTMSAIKSSQSAKPG 215 >UniRef50_A7C6I0 Protein containing Von Willebrand factor, type A n=1 Tax=Beggiatoa sp. PS RepID=A7C6I0_9GAMM Length = 127 Score = 58.9 bits (141), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 35/102 (34%), Positives = 52/102 (50%), Gaps = 3/102 (2%) Query: 61 VELGIVTF-GPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 +E+ I+TF V E NF PIL +G T + + +A+ VE RK+ Y+ G Sbjct: 1 MEISIITFHSDVKNELEPALVDNFTMPILTTKGSTKLVDGVREAIAKVEARKQWYKETGQ 60 Query: 120 SYYRPWIFLITDGAPTDEW--QAAANKVFRGEEDKRFAFFSI 159 YYRPWI ITDG P + + ++ E K+F F+ + Sbjct: 61 PYYRPWIIAITDGEPDSDQDVEGLTQEIRTAIEGKKFVFWRL 102 >UniRef50_UPI0001745BB0 TerY3 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745BB0 Length = 345 Score = 57.0 bits (136), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 48/191 (25%), Positives = 92/191 (48%), Gaps = 12/191 (6%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 R P +LLD S SM G + + G+ + L +P AL+ + +TF ++ P Sbjct: 3 RLPVYVLLDCSESMIGNGLRGMRTGISSMLKALRQNPHALETAWISFITFDSRAELKSPL 62 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY--YRPWIFLITDGAPT 135 S PP L + T +G+A+ + + + + + ++ +RP + LITDG PT Sbjct: 63 QSLDEVQPPRLLVRPGTSLGSALLLLSERILQEVKRTQPGTLTKGDFRPIVILITDGQPT 122 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIG----VQGADMKTLAQISVR-QPLPLQGLQFRELFS 190 D+W++A ++ K +++G + A ++ + + + Q QG + +LF Sbjct: 123 DDWRSALREM--NSTVKIANLYAVGCGDDIDFAGLREMTDVVLNLQQTDEQG--WAKLFV 178 Query: 191 WLSSSLRSVSR 201 W+S ++ + SR Sbjct: 179 WISETVSTASR 189 >UniRef50_Q0RFF8 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RFF8_FRAAA Length = 209 Score = 55.8 bits (133), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 55/187 (29%), Positives = 83/187 (44%), Gaps = 17/187 (9%) Query: 28 VSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFTSAANFFPP 86 +S SM G P+ LN L + E+ ++P + + IVTF V P A + P Sbjct: 1 MSASMAGGPLEALNDSLPALQKEMQSNPTVGEIARISIVTFSDVGRTVVPLCDLAEVYLP 60 Query: 87 ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG---APTDEWQAAAN 143 L +G T AA + +E R G YRP +F ++DG AP D W AA N Sbjct: 61 ELMVEGGTNFAAAFQETRRAIEGGLRSL-PKGTPIYRPVVFFMSDGEHQAPGD-WTAALN 118 Query: 144 KVFRGEEDKRFA----FFSIGVQGADMKTLAQISVRQPLPLQ----GLQFRELFSWLSSS 195 + + RFA F G Q ++ ++ +I+ R + Q RE+ + L S Sbjct: 119 DL--RDRSWRFAPEVVAFGFGDQ-VNVDSIRRIATRFSFLARDADPATQVREIMNALIGS 175 Query: 196 LRSVSRS 202 +R+ S S Sbjct: 176 IRTTSTS 182 >UniRef50_A1SV07 Putative uncharacterized protein n=1 Tax=Psychromonas ingrahamii 37 RepID=A1SV07_PSYIN Length = 154 Score = 53.9 bits (128), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 34/130 (26%), Positives = 62/130 (47%), Gaps = 3/130 (2%) Query: 86 PILFAQG-DTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANK 144 P+ A G D M + A +M+ R + + + RPW+ L TDG +D + A K Sbjct: 26 PLFGAYGKDLAMAQGLEVAFNMLLTRVHQLKKKQVKVKRPWLILFTDGLGSDYDKETAKK 85 Query: 145 VFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRSTP 204 ++R + + F + + G + LA+ S PL + + +F+W +S + + Sbjct: 86 LYRYSLENKLIVFIVNI-GKETDDLAKFSSIAPLVINISKIESIFTWFYNSFTEIILAED 144 Query: 205 GTEVVLEAPK 214 G V+L+AP+ Sbjct: 145 GI-VILKAPE 153 >UniRef50_D2AQW9 Uncharacterized protein encoded in toxicity protection region of plasmid 478 contains von Willebrand factor (VWF) domain n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AQW9_STRRD Length = 202 Score = 53.9 bits (128), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 45/160 (28%), Positives = 68/160 (42%), Gaps = 8/160 (5%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT 78 P L+++ S SM GR + E+ L F DELL P+ RV + +V F P + Sbjct: 5 VPIYLVVNTSRSMAGR-LVEIEHVLAAFADELLFSPILGDRVRVCVVAFSDSARCVLPLS 63 Query: 79 SAANFFP-PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 P L +T + + + A G+ Y RP L+TDG P D Sbjct: 64 DLTGIVELPKLVPGRETRFAPMFDLLTSLTDVDASAHLAAGLVYDRPLALLVTDGVPADP 123 Query: 138 -WQAAANKVFRGEEDKRFAFFSIGV---QGADMKTLAQIS 173 WQ A ++ +R R +IGV Q +M + ++S Sbjct: 124 GWQRAFDRFYR-HAHPRIVLITIGVGAPQAEEMGSAMRVS 162 >UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fusobacterium sp. D11 RepID=UPI0001B52F00 Length = 218 Score = 53.1 bits (126), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 48/157 (30%), Positives = 72/157 (45%), Gaps = 14/157 (8%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDEL--LADPLALK-RVELGIVT 67 +F S P+ P ILL D S SM + ELN + RD L L + +LK + + +T Sbjct: 2 EFTSQPKKVLPLILLADTSSSMR-EWMRELNTAI---RDMLGTLKEQESLKAEIHISFIT 57 Query: 68 FGP--VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 FG ++ T +N G TP+G A+ A +MVE R+ + Y P Sbjct: 58 FGNGGANLHTALTPVSNIEFNDFTEGGMTPLGGALRIAKEMVENREIIPSKS----YAPI 113 Query: 126 IFLITDGAPTDE-WQAAANKVFRGEEDKRFAFFSIGV 161 I L++DGAP D W+ + K+ S+G+ Sbjct: 114 ILLLSDGAPNDNGWENEMYRFINDGRSKKCMRMSLGI 150 >UniRef50_B4VGR3 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4VGR3_9ACTO Length = 206 Score = 52.4 bits (124), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 50/192 (26%), Positives = 85/192 (44%), Gaps = 28/192 (14%) Query: 28 VSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFTSAANFFPP 86 +SGSM+G P+ +N L + +L DP + + +VTF P + A+ P Sbjct: 1 MSGSMSGGPMAAMNTALPAMQRAILDDPTTGEIARVSVVTFSDTAACVLPLSDMAHARMP 60 Query: 87 ILFAQGDTPMGAAITKALDMVEERK--REYRANGIS-------YYRPWIFLITDGA--PT 135 L QG T D E + RE +GI Y+RP +F ++DG + Sbjct: 61 TLSPQGGT----------DFAEGFRVGREALVDGIGALGRGARYHRPVVFFLSDGQHNSS 110 Query: 136 DEWQAAANKVFRGEEDKRFA-FFSIGVQGADMKTLAQISVRQPLPLQGLQ----FRELFS 190 W++ +++ R +EDK A S G A+ +AQ+S R + + +E+ Sbjct: 111 QSWKSGFDRL-RSKEDKYGAEVVSFGFGQANRDVIAQVSTRHAFFAEDMDPAVAVKEILH 169 Query: 191 WLSSSLRSVSRS 202 + S+++ S S Sbjct: 170 TVLMSIKTTSGS 181 >UniRef50_C7N2G1 Uncharacterized protein n=2 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N2G1_SLAHD Length = 272 Score = 51.2 bits (121), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 52/173 (30%), Positives = 82/173 (47%), Gaps = 20/173 (11%) Query: 21 PCILLLDVSGSMN--GR--PINE-LNAGLVTFRDELLADPLA-LKRVELGI------VTF 68 P I ++D SGSMN GR +N +N L D +P A +K LG +T Sbjct: 25 PIIYVIDTSGSMNFYGRISAVNRAMNETLDVLGDVAAKNPTADVKVAVLGFSTGAEWITT 84 Query: 69 GPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR-ANGISYYRPWIF 127 PV + +F+ L A+G T +GAA+ +++ E+ R+ + + P I Sbjct: 85 DPVTGKPALMDLEDFYWDKLKARGSTDLGAAL---IELGEQLTRDAMLVSETGFKVPVII 141 Query: 128 LITDGAPTDEWQAAANKVFRGEEDKRFAF---FSIGVQGADMKTLAQISVRQP 177 ++DG PTD+W++A KV R A ++G AD + LA+I+ P Sbjct: 142 FMSDGGPTDDWESAFEKVCANNRWVRAATKIALAVG-DNADREVLARIADGNP 193 >UniRef50_C5PP99 von Willebrand factor, type A n=2 Tax=Bacteroidetes RepID=C5PP99_9SPHI Length = 256 Score = 51.2 bits (121), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 3/121 (2%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPF 77 R P LLD SGSM+G PI LN L + L D A + + + ++TF V P Sbjct: 48 RLPVYFLLDTSGSMHGEPIQALNNALSGMINNLRTDAQAAETLWISMITFDREVKEIVPL 107 Query: 78 TSAANFFPPILFA--QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 T+ +F P + G T G A+ D + +RP +F+ TDG P+ Sbjct: 108 TALESFQLPEISCPESGPTFTGKALEILYDTATREVIKGSPEQKGDWRPLLFIFTDGKPS 167 Query: 136 D 136 D Sbjct: 168 D 168 >UniRef50_C9RJN5 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJN5_FIBSS Length = 256 Score = 50.4 bits (119), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 49/164 (29%), Positives = 73/164 (44%), Gaps = 16/164 (9%) Query: 7 FATSD-FASNPEPR--CPCILLLDVSGSMNGRPINELNAGLVTFRDEL--LADPLALKRV 61 AT D FA P PR I ++D SGSM+G I LN + D++ ++ ++ Sbjct: 1 MATKDPFALEPIPRRVTHLIFMVDTSGSMSGSKIASLNTAVRDALDDVGDISKNCGDSQI 60 Query: 62 ELGIVTFGPV---HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 ++ ++ F EQP A F L A G T G+A + LD R + Sbjct: 61 KIAVLEFSSAVNWMYEQPL-EAEKFQWQDLSASGTTSFGSACAE-LDAKLSRSNGFMGEK 118 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ 162 P I L++DGAPTD + V + E+ K +F GV+ Sbjct: 119 TGCRAPAIVLLSDGAPTDGY------VRKLEKLKGNRWFKAGVK 156 >UniRef50_B2UUD5 Phage/colicin/tellurite resistance cluster terY protein n=5 Tax=Helicobacter pylori RepID=B2UUD5_HELPS Length = 217 Score = 49.7 bits (117), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 41/130 (31%), Positives = 59/130 (45%), Gaps = 14/130 (10%) Query: 21 PCILLLDVSGSMN-----GRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP--VHV 73 P LLLD SGSMN I LN + + L + ++ I+TFG + Sbjct: 16 PVFLLLDTSGSMNESLGNCTRIEALNLCIQKMIETLKQEAKKELFSKMAIITFGENGAVL 75 Query: 74 EQPFTSAANF-FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 PF N F P L A G TP+ A A D++E++ +Y+ + L++DG Sbjct: 76 HTPFDDVKNINFKP-LSASGGTPLDQAFRLAKDLIEDKD----TFPTKFYKLYSILVSDG 130 Query: 133 APTDE-WQAA 141 P D+ WQ A Sbjct: 131 EPNDDKWQKA 140 >UniRef50_C8N9L6 Tellurium resistance protein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N9L6_9GAMM Length = 149 Score = 49.7 bits (117), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 41/148 (27%), Positives = 68/148 (45%), Gaps = 7/148 (4%) Query: 54 DPLALKRVELGIVTFGPVHVEQ--PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERK 111 DP AL+ V L I+T+ H ++ P T + P + G + +GAA+ + V + Sbjct: 4 DPYALESVYLSIITYN-THAKEVLPPTPLMDVVVPAFTSGGASCLGAALECVVRAVRRDR 62 Query: 112 REYRANGISYYRPWIFLITDGAPTDEWQAAAN-KVFRGEEDKRFAFFSIGVQGADMKTLA 170 S Y+P +F+ TDG P+D + A + + E A G + AD+ L Sbjct: 63 IAANKAQRSDYKPILFIFTDGTPSDPFVYNATVPIIKALEFTNIAVCVAGAK-ADVAVLK 121 Query: 171 QIS--VRQPLPLQGLQFRELFSWLSSSL 196 Q++ V + Q ++ W+SSSL Sbjct: 122 QLTDLVISLADDEARQIKDCIRWVSSSL 149 >UniRef50_C2HF13 von Willebrand factor type A n=1 Tax=Finegoldia magna ATCC 53516 RepID=C2HF13_PEPMA Length = 249 Score = 48.5 bits (114), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 86/183 (46%), Gaps = 14/183 (7%) Query: 18 PRCPCIL--LLDVSGSMNGRPINELNAGLVTFRDEL--LADPLALKRVELGIVTFGP-VH 72 PR +L ++D SGSM G I E+N+ + EL +++ V++ I++F + Sbjct: 11 PRKMMVLFFVIDTSGSMKGTKIGEVNSAIEEILPELSDISNSNPDAEVKMAILSFNSEIQ 70 Query: 73 VEQPFTSAAN---FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 P T + + L A G T MGAA + + K + + S Y P IFL+ Sbjct: 71 WITPKTGPVDPGVYLWRDLNANGTTRMGAAFEELESKLHGDK--FMKSATSSYAPVIFLM 128 Query: 130 TDGAPT---DEWQAAANKVFRGEEDKRFAFFSIGV-QGADMKTLAQISVRQPLPLQGLQF 185 +DG PT +++Q+ NK+ + K ++G+ Q AD+ L + + LQ Sbjct: 129 SDGMPTETEEQFQSGLNKLKANKWFKSGIKVALGIGQDADLDVLEAFTGTKEAVLQTNNV 188 Query: 186 REL 188 ++L Sbjct: 189 KKL 191 >UniRef50_C0F0K9 Putative uncharacterized protein n=2 Tax=Eubacterium hallii DSM 3353 RepID=C0F0K9_9FIRM Length = 291 Score = 48.5 bits (114), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 53/201 (26%), Positives = 86/201 (42%), Gaps = 22/201 (10%) Query: 8 ATSDFASNPEPRCPCI---LLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 A DF EP + ++DVSGSM G I LN+ + L+ A V++ Sbjct: 37 AEEDFLDTMEPAKKSMTIFFMIDVSGSMKGTKIGSLNSTMEELLPSLIGVGEASTDVKIA 96 Query: 65 IVTFGPVHVE----QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 I+ F VE +P + L A G T MG A +E K+ R+ +S Sbjct: 97 IMKFS-TDVEWVTPEPVKIEEYQYWNRLEADGLTFMGDAF------MELSKKLSRSTFLS 149 Query: 121 Y----YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGV-QGADMKTLAQISVR 175 + P IFL++DG+P D+W+ + + + + + ++G+ +M L + Sbjct: 150 SPSLSFAPVIFLLSDGSPNDDWKKGLDTLKQNKWFQHGLKIALGIGSKVNMDVLRAFTGN 209 Query: 176 QPLPLQGL---QFRELFSWLS 193 L +Q Q REL L+ Sbjct: 210 DELAVQAKNADQLRELIKLLA 230 >UniRef50_Q4GZD0 Putative uncharacterized protein n=2 Tax=Trypanosoma brucei RepID=Q4GZD0_9TRYP Length = 4600 Score = 47.4 bits (111), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 49/161 (30%), Positives = 74/161 (45%), Gaps = 22/161 (13%) Query: 14 SNPEPRCPCILL-LDVSGSMNGRPINELNAGLVTFRD-ELLADPLALKRV-ELGIVTFGP 70 + P R IL+ LD S SM NAG+++ R L+A+ L V ELGI FG Sbjct: 4338 TKPNKRSYQILVALDDSLSMQCN-----NAGIMSCRAVALIAEALQQLEVGELGIACFGK 4392 Query: 71 ----VH-VEQPFT--SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANG----- 118 VH + +PF S F I FAQ T + + LD +++ +R R NG Sbjct: 4393 ETRIVHEMHEPFVAESGPRAFSEITFAQKSTNLKLLLETTLDYLDDARR--RMNGQIRSS 4450 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSI 159 + +F+I+DG T++ + R EE+ + F + Sbjct: 4451 TQRLQQMMFIISDGQITEDRMELRKLLMRAEENHQMVVFVL 4491 >UniRef50_C3XUD0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XUD0_BRAFL Length = 443 Score = 46.6 bits (109), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 39/123 (31%), Positives = 60/123 (48%), Gaps = 11/123 (8%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTF--RDELLADPLALKRVELGIVTF-GPVHVEQPFTS 79 +L LD SGSMNGR + EL G+ F + A+ ++L R + +V F G + QP + Sbjct: 6 VLCLDTSGSMNGRGMAELKKGVRHFLLGVQETANKMSL-RENVAVVEFGGGARIIQPLS- 63 Query: 80 AANFFPPI-----LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 N+ + L A G TPM + +A+ + +R G P + L+TDG P Sbjct: 64 -GNYGTVMQSVDNLKAGGTTPMFEGLMEAMKEILQRGGVLTLPGGRKMTPRVILMTDGYP 122 Query: 135 TDE 137 D+ Sbjct: 123 DDK 125 >UniRef50_B0A9L7 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A9L7_9CLOT Length = 273 Score = 46.2 bits (108), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 38/116 (32%), Positives = 58/116 (50%), Gaps = 5/116 (4%) Query: 25 LLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH---VEQPFTSAA 81 L+D SGSM+G+ I LN + EL A ++L ++TF ++P + Sbjct: 39 LVDTSGSMSGKKIGTLNTTMEELLPELRGLGGATTDIKLAVMTFSSGCEWITKEPMSVDD 98 Query: 82 NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 + L A+G T +G A T+ + + RK A +S Y P IFL+TDG TD+ Sbjct: 99 YQYWTRLKAEGLTDLGEAFTELSNKL-SRKEFLNAPSLS-YAPVIFLLTDGYATDD 152 >UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C09D7 Length = 262 Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust. Identities = 59/218 (27%), Positives = 92/218 (42%), Gaps = 27/218 (12%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGL----VTFRDELLADPLALKRVELGIVTFGP- 70 P +LD SGSM G PI LN + V +D LA A ++++ ++ F Sbjct: 11 PRKELHVFYVLDTSGSMTGVPIAALNTAMEECTVALKD--LAKKNADAKLKIAVLEFSTG 68 Query: 71 ---VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 V P + F L A G T +GAA+ + LD ++ + + + P I Sbjct: 69 AKWVTYNGPESLDDEFEWEHLSAGGVTDIGAAL-RELD-IKLSRNGFLKSMTGALMPVII 126 Query: 128 LITDGAPTDEWQAAANKVFRGE--EDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 +TDG PTDE+ AA ++ + F+IG AD ++ I ++ Sbjct: 127 FMTDGYPTDEYAAALAELRKNRWYTSSTKIGFAIG-DDADAAIISSIVGNSEAVIKTSDL 185 Query: 186 RELF-----------SWLSSSLRSVSRSTPGTEVVLEA 212 ELF S L+SS R+ S G+ +V +A Sbjct: 186 -ELFKRLMKFVTVRASMLASSSRTTSSFVNGSAIVQDA 222 >UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytophthora infestans T30-4 RepID=D0N9W4_PHYIN Length = 2146 Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust. Identities = 47/184 (25%), Positives = 78/184 (42%), Gaps = 21/184 (11%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP----VHVEQPFT 78 + +LD SGSMNG+P N+L A + +AD L V +VTF V+ + T Sbjct: 1905 VFVLDCSGSMNGQPWNDLMAAWKEYVYNRIADGATLDLV--SVVTFDNSAQIVYEARSIT 1962 Query: 79 SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEW 138 + N I + G T A + A +++ R N ++P I +DG P D Sbjct: 1963 TVTN--ARIQYRGGGTNYAAGLRSANEVLS------RVN-FDMFKPAIVFFSDGHPCDPL 2013 Query: 139 QAA--ANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVR----QPLPLQGLQFRELFSWL 192 Q A + E F++G ++ L +++ + L G + + F + Sbjct: 2014 QGEELATHIRGCYERNGLQAFAVGFGSINLNMLERVAEKLGGTYHHVLTGNELKATFFSI 2073 Query: 193 SSSL 196 S+SL Sbjct: 2074 SASL 2077 >UniRef50_UPI0001C37785 von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37785 Length = 285 Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust. Identities = 39/127 (30%), Positives = 60/127 (47%), Gaps = 18/127 (14%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG---------PVH 72 L+D SGSM G+ + ELN + E+ A V++ ++TF P+ Sbjct: 53 IFFLIDTSGSMKGKKMGELNTVMEELIPEIRRVGEADTEVKVAVLTFSTDVRWMYSTPIP 112 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 +E +F L A G T MGAA K L + R + +S + P IFL+TDG Sbjct: 113 IE-------DFEWARLRANGVTSMGAAF-KELSLRMSRNSFLNSPSLS-FAPVIFLMTDG 163 Query: 133 APTDEWQ 139 P+D+++ Sbjct: 164 YPSDDYR 170 >UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7RKA1_NEMVE Length = 1128 Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust. Identities = 57/208 (27%), Positives = 87/208 (41%), Gaps = 26/208 (12%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLA-DPLALKRVELGI----VT 67 A++P+P+ IL++D SGSM G + T D L D +A E G+ VT Sbjct: 205 AASPQPK-DVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFLAFESGVRRVKVT 263 Query: 68 FGPVHVEQPFTSAANFFPPI-----------LFAQGDTPMGAAITKALDMVEERKREYRA 116 G E+ F S+ P+ +A G T A A D++++ +E Sbjct: 264 SGDAKDEKCFESSLAKASPVNIDILKKFLDGEYASGGTMYAIAFNAAFDILDKYYKEKNT 323 Query: 117 NGISYYRPWIFLITDGAPTDEWQAAANKVFRGEE--DKRFAFFSIGVQGADMKTLAQISV 174 RP I +TDGAP D+ N V + + + G+ G + A + + Sbjct: 324 T----RRPVILFMTDGAPNDDPGTILNTVKTRNQGLSTKADILTFGMGGG--ISPAGVDL 377 Query: 175 RQPLPLQGLQFRELFSW-LSSSLRSVSR 201 Q L Q L F L+++LR VSR Sbjct: 378 LQSLAEQTLDGGARFEVSLTTALRDVSR 405 >UniRef50_A8L6A2 von Willebrand factor type A n=1 Tax=Frankia sp. EAN1pec RepID=A8L6A2_FRASN Length = 238 Score = 42.7 bits (99), Expect = 0.009, Method: Compositional matrix adjust. Identities = 34/118 (28%), Positives = 50/118 (42%), Gaps = 2/118 (1%) Query: 24 LLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAAN 82 +L+D S SM+G P+ +N L + P V LG + F V N Sbjct: 19 ILVDASYSMSGAPMLAVNEILPEVISTIEQSPTLGDVVRLGALDFADDARVVLRLDDLRN 78 Query: 83 FFP-PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQ 139 P A+G T A + +E + + +G YRP +F ITDG PTD+ + Sbjct: 79 IGGVPQFAARGGTSYAAGFRQLRKEIESDLAQLKGDGYKVYRPAVFFITDGEPTDDQK 136 >UniRef50_C5VFZ9 von Willebrand factor type A n=2 Tax=Corynebacterium matruchotii RepID=C5VFZ9_9CORY Length = 236 Score = 42.4 bits (98), Expect = 0.012, Method: Compositional matrix adjust. Identities = 57/224 (25%), Positives = 93/224 (41%), Gaps = 33/224 (14%) Query: 21 PCILLLDVSGSM-NGRP--------INELNAGLVTFRDELLADPLALKRVELGIVTF-GP 70 P L+DVS SM +P N+L G+V ++ + +R+ LG++ F Sbjct: 10 PVFFLIDVSYSMLEEKPGGGTLLDAANQLVPGIVEACEKY---SVLDQRLRLGLIEFCDE 66 Query: 71 VHVEQPFTSAANFFP--PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 V P + F P L A+G T AA + + R I +RP +F Sbjct: 67 ARVVIPLSEIDAFSENIPQLVAKGGTNFAAAFWAVFNEMGVAVESLRKPEIGIHRPTVFF 126 Query: 129 ITDGAPTDE-------WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPL--- 178 ITDG + W A +++ FR R FF+ GV A+++ + + Sbjct: 127 ITDGEDIGDVEERARAWAALSDEGFR----YRPNFFTFGVGNANLEGIRAFKLGSGFAAA 182 Query: 179 ---PLQGLQ-FRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTS 218 P + +Q +E+ + L SS+ S S T ++ PK + S Sbjct: 183 TKDPTRAVQRLQEILNTLVSSIVSSSAGDNPTGKIIVDPKTFNS 226 >UniRef50_C1A2B7 Putative uncharacterized protein n=1 Tax=Rhodococcus erythropolis PR4 RepID=C1A2B7_RHOE4 Length = 233 Score = 40.8 bits (94), Expect = 0.037, Method: Compositional matrix adjust. Identities = 36/138 (26%), Positives = 63/138 (45%), Gaps = 18/138 (13%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTS 79 P L+ DVS SM I E+N + ++E+L DP+ + +++F ++ P Sbjct: 8 PFYLVFDVSYSME-PVIGEVNNAMRALKNEILKDPILGDIARVCVLSFSDEARIDVPMCD 66 Query: 80 AAN----FFPPILFAQGDTPMGAAITKALDMVEERKR----EYRANGI-SYYRPWIFLIT 130 A+ L +G G + D++ ER + + +G +RP +F +T Sbjct: 67 LADDTRITREDFLQVRG----GTSFAPIFDLIGERIAADIADLKGHGEGKVFRPTVFFVT 122 Query: 131 DGAPTD---EWQAAANKV 145 DG PTD EW +A ++ Sbjct: 123 DGVPTDAVHEWNSAFTRL 140 >UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHI4_9EURY Length = 705 Score = 40.4 bits (93), Expect = 0.044, Method: Composition-based stats. Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 13/113 (11%) Query: 24 LLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAANF 83 +LLD+SGSM G+ IN L + D L K V L + F F Sbjct: 530 MLLDISGSMGGQKINAAKRILGSIHDSLDGS----KYVHLRMFGFYGSDGTHVFEFDRKM 585 Query: 84 FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 L A GDTP AI A+D++++ K S + +F+ITDG P + Sbjct: 586 LMN-LAAMGDTPTDIAIYYAMDLMKKDK--------SNFDKTLFIITDGDPNN 629 >UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR Length = 757 Score = 40.4 bits (93), Expect = 0.045, Method: Compositional matrix adjust. Identities = 35/119 (29%), Positives = 52/119 (43%), Gaps = 13/119 (10%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELL-ADPLALKRVELGIVTFGPVH---VEQPFT 78 I ++D+SGSM G P GL++ +L D + ++ F V E+ Sbjct: 334 IFIIDISGSMKGGPFESAKNGLLSSLQKLNPEDSFNIIAFKMDTYLFSSVMEQATEEAII 393 Query: 79 SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 A + L A G T + + +A+ ++ E N I P IFLITDGA DE Sbjct: 394 EATRWLNDKLTADGGTNILGPLKQAIKLLAE-----TTNSI----PVIFLITDGAVEDE 443 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular o... 335 5e-91 UniRef50_UPI00016C400A von Willebrand factor, type A n=1 Tax=Gem... 281 1e-74 UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacte... 279 3e-74 UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacte... 278 8e-74 UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria Re... 276 4e-73 UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_A... 269 4e-71 UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacter... 269 5e-71 UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium E... 264 1e-69 UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus... 255 6e-67 UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria Re... 246 4e-64 UniRef50_Q2SNJ1 Uncharacterized protein encoded in toxicity prot... 242 7e-63 UniRef50_Q87W17 von Willebrand factor type A domain protein n=2 ... 240 3e-62 UniRef50_Q2FNC6 von Willebrand factor, type A n=1 Tax=Methanospi... 239 4e-62 UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter... 235 8e-61 UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria Re... 231 1e-59 UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibri... 228 1e-58 UniRef50_D1PS09 von Willebrand factor type A domain protein n=1 ... 222 8e-57 UniRef50_C8PVC3 von Willebrand factor type A domain protein n=1 ... 221 1e-56 UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides... 220 2e-56 UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacter... 213 3e-54 UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10... 210 3e-53 UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capn... 209 5e-53 UniRef50_D0W6S8 Glycosyl transferase, group 2 family n=1 Tax=Nei... 209 5e-53 UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotoma... 207 2e-52 UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 203 4e-51 UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacte... 196 5e-49 UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobac... 196 6e-49 UniRef50_B7AFM8 Putative uncharacterized protein n=1 Tax=Bactero... 193 2e-48 UniRef50_C7PNX3 von Willebrand factor type A n=2 Tax=Sphingobact... 192 5e-48 UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostri... 191 1e-47 UniRef50_A9BLP5 von Willebrand factor type A n=3 Tax=Burkholderi... 182 8e-45 UniRef50_C9ZGR0 Putative uncharacterized protein n=1 Tax=Strepto... 180 2e-44 UniRef50_Q0I303 Putative uncharacterized protein n=5 Tax=Pasteur... 177 3e-43 UniRef50_C9LWT6 Tellurium resistance protein n=1 Tax=Selenomonas... 172 7e-42 UniRef50_Q8DK92 Tlr0974 protein n=1 Tax=Thermosynechococcus elon... 171 2e-41 UniRef50_B6BJ58 Phage/colicin/tellurite resistance cluster TerY ... 171 2e-41 UniRef50_Q5NWS4 Tellurium resistance protein n=8 Tax=Bacteria Re... 168 9e-41 UniRef50_C5PP99 von Willebrand factor, type A n=2 Tax=Bacteroide... 168 2e-40 UniRef50_B2K3B2 von Willebrand factor type A n=39 Tax=Gammaprote... 163 4e-39 UniRef50_UPI0001AED79F von Willebrand factor type A n=1 Tax=Stre... 162 6e-39 UniRef50_UPI0001745BB0 TerY3 n=1 Tax=Verrucomicrobium spinosum D... 161 2e-38 UniRef50_UPI00018742C1 von Willebrand factor type A n=1 Tax=Cory... 160 5e-38 UniRef50_D1AAR7 von Willebrand factor type A n=1 Tax=Thermomonos... 158 9e-38 UniRef50_A1VV60 von Willebrand factor, type A n=4 Tax=Proteobact... 156 5e-37 UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fuso... 155 1e-36 UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=... 153 4e-36 UniRef50_A6G3C6 von Willebrand factor, type A n=1 Tax=Plesiocyst... 151 1e-35 UniRef50_B4VGR3 Putative uncharacterized protein n=1 Tax=Strepto... 149 7e-35 UniRef50_C0F0K9 Putative uncharacterized protein n=2 Tax=Eubacte... 146 6e-34 UniRef50_C2HF13 von Willebrand factor type A n=1 Tax=Finegoldia ... 144 2e-33 UniRef50_Q0RFF8 Putative uncharacterized protein n=1 Tax=Frankia... 140 3e-32 UniRef50_C9RJN5 von Willebrand factor type A n=1 Tax=Fibrobacter... 140 4e-32 UniRef50_UPI0001C37785 von Willebrand factor type A n=1 Tax=Rumi... 140 5e-32 UniRef50_B0A9L7 Putative uncharacterized protein n=2 Tax=Clostri... 136 4e-31 UniRef50_A7BNL3 Tellurium resistance protein n=1 Tax=Beggiatoa s... 135 8e-31 UniRef50_C9RK46 von Willebrand factor type A n=1 Tax=Fibrobacter... 134 2e-30 UniRef50_B2UUD5 Phage/colicin/tellurite resistance cluster terY ... 133 4e-30 UniRef50_A1THU3 von Willebrand factor, type A n=1 Tax=Mycobacter... 131 1e-29 UniRef50_C7N2G1 Uncharacterized protein n=2 Tax=Slackia heliotri... 126 3e-28 UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytoph... 124 2e-27 UniRef50_C8N9L6 Tellurium resistance protein n=1 Tax=Cardiobacte... 120 3e-26 UniRef50_D2AQW9 Uncharacterized protein encoded in toxicity prot... 119 9e-26 UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis... 106 6e-22 UniRef50_C3XUD0 Putative uncharacterized protein n=1 Tax=Branchi... 103 3e-21 UniRef50_A7C6I0 Protein containing Von Willebrand factor, type A... 95 2e-18 UniRef50_A1SV07 Putative uncharacterized protein n=1 Tax=Psychro... 92 1e-17 UniRef50_Q4GZD0 Putative uncharacterized protein n=2 Tax=Trypano... 89 1e-16 Sequences not found previously or not previously below threshold: UniRef50_UPI0001B4AB93 von Willebrand factor type A n=1 Tax=Bact... 124 3e-27 UniRef50_B7AIG3 Putative uncharacterized protein n=2 Tax=Bactero... 121 1e-26 UniRef50_A8SDD9 Putative uncharacterized protein n=2 Tax=Ruminoc... 119 6e-26 UniRef50_C1A2B7 Putative uncharacterized protein n=1 Tax=Rhodoco... 115 1e-24 UniRef50_UPI0001B4E5FD von Willebrand factor type A n=1 Tax=Stre... 114 2e-24 UniRef50_A8L6A2 von Willebrand factor type A n=1 Tax=Frankia sp.... 114 3e-24 UniRef50_B8HT03 von Willebrand factor type A n=2 Tax=Cyanothece ... 108 1e-22 UniRef50_C5VFZ9 von Willebrand factor type A n=2 Tax=Corynebacte... 106 8e-22 UniRef50_A3YVK5 Tellurium resistance protein n=3 Tax=Cyanobacter... 103 6e-21 UniRef50_UPI0001B4AD96 von Willebrand factor type A n=1 Tax=Bact... 100 7e-20 UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira... 93 5e-18 UniRef50_UPI0000D560E4 PREDICTED: similar to inter-alpha (globul... 91 3e-17 UniRef50_A7I7X6 Putative uncharacterized protein n=1 Tax=Candida... 88 1e-16 UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Art... 86 5e-16 UniRef50_A5N5I8 DnaK4 n=2 Tax=Clostridium kluyveri RepID=A5N5I8_... 83 5e-15 UniRef50_A7C1J8 von Willebrand factor, type A n=1 Tax=Beggiatoa ... 83 6e-15 UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga o... 83 8e-15 UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfoloba... 81 2e-14 UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesm... 80 4e-14 UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Cion... 80 4e-14 UniRef50_UPI0001C161B1 von Willebrand factor, type A Precursor n... 80 6e-14 UniRef50_C3ZEP6 Putative uncharacterized protein n=2 Tax=Branchi... 78 1e-13 UniRef50_C0GKG1 von Willebrand factor type A n=1 Tax=Dethiobacte... 78 2e-13 UniRef50_Q3M1S2 von Willebrand factor, type A n=1 Tax=Anabaena v... 78 2e-13 UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=... 78 2e-13 UniRef50_UPI000180D2FB PREDICTED: similar to inter-alpha (globul... 78 2e-13 UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea may... 76 6e-13 UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=4... 76 6e-13 UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Tak... 76 7e-13 UniRef50_C3XUD1 Putative uncharacterized protein n=1 Tax=Branchi... 76 7e-13 UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tet... 76 1e-12 UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea s... 75 1e-12 UniRef50_Q82LZ6 Putative uncharacterized protein n=1 Tax=Strepto... 75 2e-12 UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun se... 75 2e-12 UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein... 75 2e-12 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 74 3e-12 UniRef50_UPI0001C378BC von Willebrand factor, type A n=1 Tax=Rum... 74 3e-12 UniRef50_A7BVG3 von Willebrand factor type A domain protein n=1 ... 73 6e-12 UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=So... 73 7e-12 UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Tak... 73 7e-12 UniRef50_B7KCF7 von Willebrand factor type A n=1 Tax=Cyanothece ... 73 7e-12 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 73 9e-12 UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3... 72 1e-11 UniRef50_B0CZQ4 Predicted protein n=1 Tax=Laccaria bicolor S238N... 72 1e-11 UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=4... 72 1e-11 UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-lik... 72 1e-11 UniRef50_Q231J4 von Willebrand factor type A domain containing p... 71 2e-11 UniRef50_UPI00016E1D1D UPI00016E1D1D related cluster n=9 Tax=Tet... 71 2e-11 UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ... 71 2e-11 UniRef50_B2UZB2 von Willebrand factor type A domain protein n=1 ... 71 2e-11 UniRef50_Q24FW2 von Willebrand factor type A domain containing p... 71 2e-11 UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotri... 71 2e-11 UniRef50_C3YC17 Putative uncharacterized protein n=1 Tax=Branchi... 71 2e-11 UniRef50_B2A702 von Willebrand factor type A n=1 Tax=Natranaerob... 71 3e-11 UniRef50_UPI00016E1D58 UPI00016E1D58 related cluster n=1 Tax=Tak... 71 3e-11 UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomyce... 71 3e-11 UniRef50_Q22ST4 von Willebrand factor type A domain containing p... 71 3e-11 UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 ... 70 4e-11 UniRef50_Q503P4 Zgc:110377 n=9 Tax=Clupeocephala RepID=Q503P4_DANRE 70 4e-11 UniRef50_B3RZ89 Putative uncharacterized protein n=1 Tax=Trichop... 70 5e-11 UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeo... 70 6e-11 UniRef50_Q67LM1 Magnesium chelatase n=1 Tax=Symbiobacterium ther... 70 8e-11 UniRef50_A2E0T6 von Willebrand factor type A domain containing p... 69 8e-11 UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genom... 69 9e-11 UniRef50_Q1AYC2 Protoporphyrin IX magnesium-chelatase n=15 Tax=B... 69 1e-10 UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=... 69 1e-10 UniRef50_A8SU73 Putative uncharacterized protein n=1 Tax=Coproco... 69 1e-10 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 69 1e-10 UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus tri... 69 1e-10 UniRef50_Q5UWJ9 Calcium-binding protein-like n=1 Tax=Haloarcula ... 69 1e-10 UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n... 69 1e-10 UniRef50_UPI000180C2AF PREDICTED: similar to FiBrilliN homolog f... 69 1e-10 UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3... 68 1e-10 UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopu... 68 2e-10 UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genom... 68 2e-10 UniRef50_B0CZQ2 Predicted protein n=2 Tax=Laccaria bicolor S238N... 68 2e-10 UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clup... 68 2e-10 UniRef50_UPI00017B4DF5 UPI00017B4DF5 related cluster n=3 Tax=Tet... 68 2e-10 UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudo... 68 2e-10 UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin... 68 2e-10 UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis R... 68 2e-10 UniRef50_UPI00005843FB PREDICTED: hypothetical protein n=1 Tax=S... 68 2e-10 UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1... 68 2e-10 UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8Y... 68 2e-10 UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocep... 68 2e-10 UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens ... 68 2e-10 UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain... 68 3e-10 UniRef50_A0D1M1 Chromosome undetermined scaffold_34, whole genom... 67 3e-10 UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha ... 67 3e-10 UniRef50_UPI00016C377F protein containing a von Willebrand facto... 67 3e-10 UniRef50_A1ZQA6 Putative uncharacterized protein n=3 Tax=cellula... 67 3e-10 UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta Re... 67 4e-10 UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha p... 67 4e-10 UniRef50_UPI00006A1B4A Collagen alpha-3(VI) chain precursor. n=5... 67 4e-10 UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family pr... 67 5e-10 UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflex... 67 5e-10 UniRef50_A6H584 Collagen alpha-5(VI) chain n=2 Tax=Mus musculus ... 67 5e-10 UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 T... 67 5e-10 UniRef50_A0DIJ2 Chromosome undetermined scaffold_52, whole genom... 67 5e-10 UniRef50_UPI0001C1630F hypothetical protein CRD_00534 n=2 Tax=No... 67 5e-10 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 67 5e-10 UniRef50_Q235T9 von Willebrand factor type A domain containing p... 67 5e-10 UniRef50_A4YGI9 von Willebrand factor, type A n=1 Tax=Metallosph... 66 6e-10 UniRef50_A2RNJ3 Putative uncharacterized protein n=2 Tax=Lactoco... 66 6e-10 UniRef50_Q0A603 von Willebrand factor, type A n=1 Tax=Alkalilimn... 66 6e-10 UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microc... 66 6e-10 UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza s... 66 7e-10 UniRef50_Q0AZS1 Mg-chelatase subunit ChlD-like protein n=1 Tax=S... 66 8e-10 UniRef50_Q22UB9 von Willebrand factor type A domain containing p... 66 8e-10 UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN... 66 8e-10 UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 ... 66 8e-10 UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=... 66 8e-10 UniRef50_A2AX52 Collagen alpha-4(VI) chain n=12 Tax=Chordata Rep... 66 8e-10 UniRef50_B4BQC0 von Willebrand factor type A n=2 Tax=Geobacillus... 66 9e-10 UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=... 66 9e-10 UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated c... 66 9e-10 UniRef50_Q23JA0 von Willebrand factor type A domain containing p... 66 9e-10 UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein... 66 1e-09 UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina... 66 1e-09 UniRef50_UPI0000F1FEC5 PREDICTED: similar to Clca1 protein n=2 T... 66 1e-09 UniRef50_C1XTM1 Uncharacterized protein containing a von Willebr... 66 1e-09 UniRef50_C3ZCZ5 Putative uncharacterized protein (Fragment) n=1 ... 66 1e-09 UniRef50_C9RU69 von Willebrand factor type A n=2 Tax=Geobacillus... 65 1e-09 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 65 1e-09 UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteri... 65 1e-09 UniRef50_UPI0000EB12CB UPI0000EB12CB related cluster n=1 Tax=Can... 65 1e-09 UniRef50_B3RP11 Putative uncharacterized protein n=1 Tax=Trichop... 65 2e-09 UniRef50_Q2W311 Putative uncharacterized protein n=1 Tax=Magneto... 65 2e-09 UniRef50_D1VKI5 von Willebrand factor type A n=1 Tax=Frankia sp.... 65 2e-09 UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-cont... 65 2e-09 UniRef50_B3RZT6 Putative uncharacterized protein n=2 Tax=Trichop... 65 2e-09 UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome... 65 2e-09 UniRef50_C8NJ92 Secreted Mg-chelatase subunit n=3 Tax=Corynebact... 65 2e-09 UniRef50_D2VKS7 von Willebrand factor type A domain-containing p... 65 2e-09 UniRef50_UPI0001C34E55 hypothetical protein ClM62_13922 n=1 Tax=... 65 2e-09 UniRef50_UPI000155D2F0 PREDICTED: similar to matrilin-3, partial... 65 2e-09 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 65 2e-09 UniRef50_UPI0000E488A7 PREDICTED: similar to Clca1 protein n=5 T... 65 2e-09 UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiob... 65 2e-09 UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 ... 65 2e-09 UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisp... 65 2e-09 UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globul... 64 3e-09 UniRef50_C8XJ05 Magnesium chelatase n=20 Tax=cellular organisms ... 64 3e-09 UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=... 64 3e-09 UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglob... 64 3e-09 UniRef50_Q2BCF0 Possible D-amino acid dehydrogenase, large subun... 64 3e-09 UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleosto... 64 3e-09 UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax... 64 3e-09 UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated c... 64 4e-09 UniRef50_UPI0001C39385 von Willebrand factor type A n=1 Tax=Arth... 64 4e-09 UniRef50_UPI0001760236 PREDICTED: similar to mCG140660 n=1 Tax=D... 64 4e-09 UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genom... 64 4e-09 UniRef50_C4JI95 Putative uncharacterized protein n=3 Tax=Onygena... 64 4e-09 UniRef50_B5ZN80 von Willebrand factor type A n=8 Tax=Rhizobiales... 64 4e-09 UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n... 64 4e-09 UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepI... 64 4e-09 UniRef50_C5E9N8 von Willebrand factor type A n=4 Tax=Bifidobacte... 63 5e-09 UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella... 63 5e-09 UniRef50_D0N9Y3 Putative uncharacterized protein n=2 Tax=Phytoph... 63 5e-09 UniRef50_B2BSQ4 Structural toxin protein n=4 Tax=Legionella pneu... 63 5e-09 UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 ... 63 5e-09 UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus Rep... 63 5e-09 UniRef50_UPI00016E1D39 UPI00016E1D39 related cluster n=1 Tax=Tak... 63 5e-09 UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus ... 63 5e-09 UniRef50_UPI0000F2E695 PREDICTED: hypothetical protein n=1 Tax=M... 63 5e-09 UniRef50_A9BS02 von Willebrand factor type A n=1 Tax=Delftia aci... 63 5e-09 UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=... 63 5e-09 UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YN... 63 6e-09 UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesioc... 63 6e-09 UniRef50_A2E6Y7 von Willebrand factor type A domain containing p... 63 6e-09 UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fr... 63 6e-09 UniRef50_A6M139 von Willebrand factor, type A n=1 Tax=Clostridiu... 63 6e-09 UniRef50_C9ZGQ6 Putative membrane protein n=3 Tax=Streptomyces R... 63 6e-09 UniRef50_Q5NJK1 Matrilin-3a n=5 Tax=Danio rerio RepID=Q5NJK1_DANRE 63 6e-09 UniRef50_A4QDZ6 Putative uncharacterized protein n=1 Tax=Coryneb... 63 6e-09 UniRef50_O05809 Uncharacterized protein Rv2850c/MT2916 n=51 Tax=... 63 6e-09 UniRef50_A4XTA4 Hemolysin-type calcium-binding region n=1 Tax=Ps... 63 7e-09 UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein... 63 7e-09 UniRef50_B0WHU4 Sushi n=3 Tax=Culicini RepID=B0WHU4_CULQU 63 7e-09 UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1... 63 8e-09 UniRef50_D1HBR9 Whole genome shotgun sequence of line PN40024, s... 63 8e-09 UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein... 63 8e-09 UniRef50_B1G2X7 Putative uncharacterized protein n=1 Tax=Burkhol... 63 8e-09 >UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular organisms RepID=YEGL_ECOLI Length = 219 Score = 335 bits (860), Expect = 5e-91, Method: Composition-based stats. Identities = 219/219 (100%), Positives = 219/219 (100%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR Sbjct: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 Query: 61 VELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 VELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS Sbjct: 61 VELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL Sbjct: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 Query: 181 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV Sbjct: 181 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 >UniRef50_UPI00016C400A von Willebrand factor, type A n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C400A Length = 249 Score = 281 bits (718), Expect = 1e-74, Method: Composition-based stats. Identities = 130/249 (52%), Positives = 163/249 (65%), Gaps = 30/249 (12%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMN----------GR--------------- 35 M+EQI F+ A+NPEPRCPC+LL+D SGSM GR Sbjct: 1 MAEQIPFSDVALATNPEPRCPCVLLIDTSGSMAEVVSGTGRDLGRTAQVDGKTYRVVSGG 60 Query: 36 --PINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAANFFPPILFAQG 92 I+ +N GL ++ ++ DPLA +RVE+ +VTFG V PF + + F PP+L A G Sbjct: 61 TTRIDLVNEGLRVYQADVTNDPLAAQRVEVSVVTFGDTVRTVTPFVTTSQFTPPVLTANG 120 Query: 93 DTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDK 152 +TPMGAAI KA+D V ERKREYR NG+ +YRPWIFLITDG PTD W+AAA +V GEE K Sbjct: 121 ETPMGAAILKAIDAVTERKREYRQNGLHFYRPWIFLITDGEPTDAWEAAAARVREGEEKK 180 Query: 153 RFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRSTPG--TEVVL 210 +FAFF++GV+GA+M L QISVRQPL L+G F+E+F WLS S RSVS S PG +V L Sbjct: 181 QFAFFAVGVEGANMDRLKQISVRQPLHLKGYSFKEMFLWLSQSQRSVSHSNPGQEEQVKL 240 Query: 211 EAPKGWTSV 219 P GW S+ Sbjct: 241 APPAGWASL 249 >UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacteria RepID=Q3M2E0_ANAVT Length = 218 Score = 279 bits (715), Expect = 3e-74, Method: Composition-based stats. Identities = 113/217 (52%), Positives = 142/217 (65%), Gaps = 3/217 (1%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + +F NPE RCP ILLLD SGSM+G+PI ELN GL TF+++++ D A VE+ Sbjct: 1 MPVGLPEFVENPENRCPVILLLDTSGSMSGQPIQELNRGLATFKEDVIKDSQASLSVEVA 60 Query: 65 IVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 I+TFGPV + Q F + F PP L A+G TPMG AI ALD++E RK Y+ NGI YYRP Sbjct: 61 IITFGPVRLVQDFVNIDQFTPPQLEAEGVTPMGEAIEYALDLLETRKSAYKENGILYYRP 120 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI--SVRQPLPLQG 182 WIFLITDGAPTD + AA +V E ++R FF++GVQGAD L QI + R P+ L G Sbjct: 121 WIFLITDGAPTDYYHLAAQRVKEAEANRRLCFFTVGVQGADFNKLRQIAPAERPPVILNG 180 Query: 183 LQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 L FR LF WLS+S++ VS G V L P GW + Sbjct: 181 LDFRSLFVWLSTSMKRVSSGKIGEAVALP-PVGWGQI 216 >UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacteria RepID=Q114A2_TRIEI Length = 1204 Score = 278 bits (712), Expect = 8e-74, Method: Composition-based stats. Identities = 102/214 (47%), Positives = 134/214 (62%), Gaps = 1/214 (0%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 +F NPE RCP ILLLD S SM+G I ELN G+ F+ + D LA RVE+ ++ Sbjct: 618 LPEPEFVENPENRCPIILLLDTSYSMSGEAITELNQGVKIFQASVKEDELASLRVEIAVI 677 Query: 67 TFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 TF + V Q F + F P L A G T MG AI KAL+++E+RK++Y+ + I YYRPW Sbjct: 678 TFNSEIEVVQDFVTVDKFIPKTLEASGVTHMGKAIEKALELLEKRKQDYKNSDIQYYRPW 737 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 IFLITDG PTD WQ AA K+ E +++ FF++GV+ ADM+TL++ISV P L GL F Sbjct: 738 IFLITDGQPTDTWQDAAKKIEEAETNRKLLFFAVGVRDADMETLSEISVCPPKKLNGLDF 797 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 + LF WLS SL+ VS S G + L W + Sbjct: 798 QSLFKWLSFSLQQVSVSKIGEKNRLPPTNAWEEI 831 >UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria RepID=B8HUC4_CYAP4 Length = 254 Score = 276 bits (706), Expect = 4e-73, Method: Composition-based stats. Identities = 143/217 (65%), Positives = 162/217 (74%), Gaps = 3/217 (1%) Query: 6 TFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI 65 F T DFA+NPEPRCP ILLLD SGSM G PI ELNAG+ FRDELLAD LA KRVE+ I Sbjct: 38 AFGTDDFANNPEPRCPVILLLDTSGSMRGTPIQELNAGVELFRDELLADALASKRVEVAI 97 Query: 66 VTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 V FGPV V Q F +A F PP L A+ DTP+GAAI ALD+++ RK Y+ANGI+YYRPW Sbjct: 98 VGFGPVQVIQDFVTADYFNPPKLRAEADTPLGAAIETALDLLQSRKDTYKANGIAYYRPW 157 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 +FLITDG PTD WQ AA +V GE K FAFFSIGV+GA + LAQIS R PL L+ L+F Sbjct: 158 VFLITDGGPTDHWQTAARRVKEGESKKSFAFFSIGVEGARIDILAQISTRTPLKLKELRF 217 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVL---EAPKGWTSV 219 R+LF WLSSSL+SVSRSTPG EV L P GW SV Sbjct: 218 RDLFQWLSSSLKSVSRSTPGDEVPLLNPATPDGWASV 254 >UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_ANASP Length = 224 Score = 269 bits (688), Expect = 4e-71, Method: Composition-based stats. Identities = 118/225 (52%), Positives = 154/225 (68%), Gaps = 7/225 (3%) Query: 1 MSEQITFATS-DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALK 59 M++ +T +FA NPEPRCPC+LLLD SGSM G I LN GL++ +DEL+ + +A + Sbjct: 1 MNDTLTLDEVVEFAENPEPRCPCVLLLDTSGSMQGAAIEALNQGLLSLKDELMKNSIAAR 60 Query: 60 RVELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 RVE+ IVTF ++V Q F +A F PPIL AQG T MGA I KALDMV+ERK YRANG Sbjct: 61 RVEIAIVTFDSHINVIQDFVTADQFNPPILTAQGLTSMGAGIHKALDMVQERKSLYRANG 120 Query: 119 ISYYRPWIFLITDGAPTDEW----QAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 ++YYRPW+F+ITDG P E + AA ++ E +KR AFFS+GV+ A+M L QI+V Sbjct: 121 VAYYRPWVFMITDGEPQGELDHLVEQAALRLQGDEVNKRVAFFSVGVENANMTRLNQIAV 180 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 R PL L+GL F E+F WLS+S+ +VS S +V L GW S+ Sbjct: 181 RTPLKLKGLNFIEMFVWLSASMSAVSHSQIDEQVALPPI-GWGSI 224 >UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacteria RepID=B5W3I3_SPIMA Length = 228 Score = 269 bits (688), Expect = 5e-71, Method: Composition-based stats. Identities = 113/215 (52%), Positives = 152/215 (70%), Gaps = 6/215 (2%) Query: 10 SDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 +FA NPEPRCPC+LLLD S SM G P++ LNAGL+TFR+ L+ D LA KRVE+ I+TF Sbjct: 15 VEFAENPEPRCPCVLLLDTSASMQGEPLDGLNAGLMTFRENLIKDELAKKRVEIAIITFD 74 Query: 70 P-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 V + Q F +A F PP+L AQG T MG AI +ALDM+ RK EYR NGI+YYRPW+F+ Sbjct: 75 NQVKIIQDFVTADRFEPPLLNAQGQTYMGTAIGEALDMIASRKAEYRNNGITYYRPWVFM 134 Query: 129 ITDGAP---TDE-WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ 184 ITDG P +D + A ++ E +K+ AFF++GV+GA+M+ L +++ R PL L+GL Sbjct: 135 ITDGEPQGESDRITEQAIKRIRDEEANKQVAFFAVGVEGANMERLGEMAQRTPLKLKGLD 194 Query: 185 FRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 FRE+F WLS+S+++VS S +V L P GW +V Sbjct: 195 FREMFIWLSASMQTVSHSKVDEQVALPPP-GWGTV 228 >UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium Ellin514 RepID=B9XNU9_9BACT Length = 229 Score = 264 bits (676), Expect = 1e-69, Method: Composition-based stats. Identities = 112/227 (49%), Positives = 148/227 (65%), Gaps = 8/227 (3%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 MS+Q+ F +F NPEPRCPC+L+LDVS SM G I+ LN G+ F +L LA KR Sbjct: 1 MSDQLPFIDVEFVDNPEPRCPCVLVLDVSSSMRGAAIDFLNLGVDLFAHDLTRSRLACKR 60 Query: 61 VELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 VE I+TFG VH+ Q F S + F PP A G TPMG A+ +A +++E+RKR+YRA G+ Sbjct: 61 VETAIITFGDGVHIVQDFVSPSAFVPPRFEAGGKTPMGEAVVQACELLEKRKRKYRAAGV 120 Query: 120 SYYRPWIFLITDGAPTD----EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI--S 173 SY+RPWIFLITDG PTD W+ A V GE DK+ FF + V A+ L ++ + Sbjct: 121 SYFRPWIFLITDGEPTDYETANWRQAVEIVRAGEVDKKLMFFGVAVSDANQGKLNELCPA 180 Query: 174 VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTE-VVLEAPKGWTSV 219 R + L GL F+ LF+WLSSSLR+VS + PGT+ +VL + GW +V Sbjct: 181 SRPAIKLNGLDFQGLFTWLSSSLRTVSSANPGTQGIVLPSIAGWATV 227 >UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AYK3_9ENTR Length = 227 Score = 255 bits (652), Expect = 6e-67, Method: Composition-based stats. Identities = 93/225 (41%), Positives = 130/225 (57%), Gaps = 6/225 (2%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M E + N E R P IL+LD SGSM G+PI +LN GL EL D +A KR Sbjct: 1 MMEHLMIPDVALVDNSEQRTPLILVLDSSGSMYGQPIQQLNEGLKLLEQELKNDVIAAKR 60 Query: 61 VELGIVTFG---PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 V + ++ +G + + A +F P+L A G TPMG AIT AL+ +E K+ ++ Sbjct: 61 VRILVIEYGGYDQCTIHGDWKDAMDFTAPVLEANGTTPMGQAITLALEEIEAEKQRFKQA 120 Query: 118 GISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS---V 174 G++Y RPW+FL++DG PTD+W+ AA + EE ++ A F I V GA + + S V Sbjct: 121 GVAYTRPWLFLMSDGVPTDQWEQAAQLCRQAEESQKTAVFPIMVDGASAEVMGSFSRNGV 180 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 L+GLQF+ELF WLS+S++ VS+STPG L + W SV Sbjct: 181 NGVKMLKGLQFKELFLWLSASMQVVSQSTPGGTAQLPSTDSWASV 225 >UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria RepID=C9RRF6_FIBSS Length = 228 Score = 246 bits (628), Expect = 4e-64, Method: Composition-based stats. Identities = 95/228 (41%), Positives = 135/228 (59%), Gaps = 9/228 (3%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M+ + D +NP R P L+LD SGSM G+PI+ELN G+ F D + +D AL Sbjct: 1 MTNEKLLNIEDLENNPSTRVPVCLVLDTSGSMEGQPISELNEGINCFYDAVRSDETALYA 60 Query: 61 VELGIVTF-GPVHVEQPFTSAANF-FPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 E+ +VTF G ++ F++ + P FA G TPMG A+ ALD++E+RK EY+A+G Sbjct: 61 AEIAVVTFGGSAVLKTDFSTLEHQPDSPNFFANGGTPMGEAMNMALDLLEKRKGEYKASG 120 Query: 119 ISYYRPWIFLITDGAPTDE---WQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS- 173 + YY+PWI L+TDG P + + A + ++++ F IG+ ADM LA S Sbjct: 121 VDYYQPWIVLMTDGKPNGDSSEYARAVQRTCEMIKNRKLTIFPIGIGEDADMNALAAFSP 180 Query: 174 VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEA--PKGWTSV 219 R PL LQGL FRE F+WLS S+ VS+STPG ++ L+ KGW + Sbjct: 181 KRSPLKLQGLNFREFFAWLSKSVSKVSQSTPGDKIQLDTDGIKGWAEL 228 >UniRef50_Q2SNJ1 Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (VWF) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SNJ1_HAHCH Length = 223 Score = 242 bits (618), Expect = 7e-63, Method: Composition-based stats. Identities = 97/223 (43%), Positives = 130/223 (58%), Gaps = 6/223 (2%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 MS+ + F N R PC+L+LD S SM G PI +LN GL L D R Sbjct: 1 MSDTLI-PDVVFNDNNSQRTPCVLVLDGSSSMFGEPIRQLNEGLKLLERALKEDASTAMR 59 Query: 61 VELGIVTFGP---VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 V+L ++ G V + A +F P +FA G TP+G A+ ALD +E++K Y AN Sbjct: 60 VQLLVIRAGNHDQAEVLTDWVDAMDFNAPEVFANGTTPLGGAMNLALDKIEDQKAAYDAN 119 Query: 118 GISYYRPWIFLITDGAPTD-EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 GIS RPWI LI+DGAPTD W+A A++ E++++ F IGV+GA +TL Q S + Sbjct: 120 GISSTRPWIILISDGAPTDFNWEAVADRCRHAEQNRKVVIFPIGVEGATFETLNQFSNKG 179 Query: 177 PLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 L+GLQFRELF WLS S+ +VS S+PG +V L A W+ V Sbjct: 180 AKKLKGLQFRELFVWLSRSMATVSVSSPGEKVQLPATD-WSEV 221 >UniRef50_Q87W17 von Willebrand factor type A domain protein n=2 Tax=Proteobacteria RepID=Q87W17_PSESM Length = 224 Score = 240 bits (612), Expect = 3e-62, Method: Composition-based stats. Identities = 88/224 (39%), Positives = 122/224 (54%), Gaps = 5/224 (2%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M E+ + D NP R P L+LDVSGSM G PI EL AG+ F + D +A Sbjct: 1 MQEEYILSQEDLVDNPTARVPICLVLDVSGSMAGEPIRELQAGVNMFYQAIREDEVAQYA 60 Query: 61 VELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 E+ IVTFG F + P L A+G T MG + ALD++E RK +Y+ G+ Sbjct: 61 AEISIVTFGSEAKRTVDFMAIERQDVPALIAEGTTSMGQGVNLALDLLEVRKGDYQRAGV 120 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS-VRQP 177 YY+PW+ ++TDG PTD+ A+ ++ E K+ F I + A++ L +S R P Sbjct: 121 DYYQPWMVVMTDGEPTDDITRASERIREMCESKKLTVFPIAIGTAANLDILGMLSPGRPP 180 Query: 178 LPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE--APKGWTSV 219 L L+GL F+E F WLS S+ VS+STPG V+L+ W V Sbjct: 181 LRLKGLNFKEFFLWLSRSVSRVSQSTPGETVILDKAGIDAWGQV 224 >UniRef50_Q2FNC6 von Willebrand factor, type A n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNC6_METHJ Length = 233 Score = 239 bits (611), Expect = 4e-62, Method: Composition-based stats. Identities = 103/231 (44%), Positives = 134/231 (58%), Gaps = 12/231 (5%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M + D P C +L+LD S SM+G I ELN GL DEL D LA+KR Sbjct: 1 MGLEKLEDIVDIPYPQHPHCATVLVLDTSASMSGNKIAELNEGLRILTDELKEDDLAVKR 60 Query: 61 VELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 ++L ++TFG V + +PFT + F PP L A G TPMG AI +A+ +VEERK EYR G Sbjct: 61 IDLAVITFGKGVELVRPFTGISAFDPPELSAGGYTPMGQAILEAVRLVEERKAEYRTIGT 120 Query: 120 SYYRPWIFLITDGAPTDE------WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS 173 YYRPWIFLITDG PTD W+ V GE D +F F+++GV A+M L +IS Sbjct: 121 DYYRPWIFLITDGQPTDMRKGDEIWEKVIEAVHGGERDHKFLFWALGVDQANMTVLREIS 180 Query: 174 --VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE---APKGWTSV 219 R PL L+ ++ E+F WLS SL +S S G ++ LE P+GW + Sbjct: 181 PPGRTPLMLKEAKWAEMFLWLSKSLSQISDSRIGEQISLENPVGPEGWGVI 231 >UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJ63_FIBSS Length = 227 Score = 235 bits (600), Expect = 8e-61, Method: Composition-based stats. Identities = 97/227 (42%), Positives = 128/227 (56%), Gaps = 9/227 (3%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 + + D +NP R P L+LD SGSM G INELN G+ F D + +D AL Sbjct: 1 MNRNLLSIEDLENNPSSRVPVCLVLDTSGSMEGDSINELNEGVRLFYDAVRSDETALYAA 60 Query: 62 ELGIVTFGPVHVEQPFTSAANFFP--PILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 E+ +VTFG Q S P P +A G TPMG A+ ALDM+E+RK EY+A+G+ Sbjct: 61 EISVVTFGGHASCQAGFSTLEHQPDAPQFYADGGTPMGEAMNMALDMLEKRKSEYKASGV 120 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGE---EDKRFAFFSIGVQ-GADMKTLAQISV- 174 YY+PWI L+TDG P + + R D++ F IG+ ADM LA+ S Sbjct: 121 DYYQPWIVLMTDGMPNGSQAELSRSIQRTCDMINDRKLTIFPIGIGEDADMDVLARFSPK 180 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVL--EAPKGWTSV 219 R PL LQGL F+E F+WLS S+ VS+STPG +V L + KGW + Sbjct: 181 RSPLKLQGLNFKEFFAWLSKSVSKVSQSTPGDKVQLDVDGIKGWAEL 227 >UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria RepID=A9AV55_HERA2 Length = 222 Score = 231 bits (590), Expect = 1e-59, Method: Composition-based stats. Identities = 89/210 (42%), Positives = 119/210 (56%), Gaps = 5/210 (2%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHV 73 N E +C CIL++D SGSM GRPI+ELN GL F ++ +R+E+ +V F Sbjct: 10 NYEQKCLCILVVDTSGSMQGRPIDELNQGLQVFHQDISNSFSTAQRLEICLVEFNSQADC 69 Query: 74 EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + F PIL G T + + A+ V+ERK YR+ G YYRPWI L+TDG Sbjct: 70 IVEPSLVDQFHMPILAVAGTTKLVDGVRLAIHKVQERKSWYRSTGQPYYRPWIILMTDGE 129 Query: 134 PTDEWQAA--ANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFRELF 189 P + A A ++ G +K+F FF IGVQGADM+ L QIS R P+ LQGL+F F Sbjct: 130 PDSDQDVAGLAREIQHGVNNKQFVFFPIGVQGADMRMLQQISTPDRPPMLLQGLRFEAFF 189 Query: 190 SWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 WLS+SL V+ ST G + L + GW + Sbjct: 190 DWLSASLSMVASSTDGQVIQLPSTSGWGIL 219 >UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQB2_9BACT Length = 225 Score = 228 bits (581), Expect = 1e-58, Method: Composition-based stats. Identities = 89/223 (39%), Positives = 122/223 (54%), Gaps = 5/223 (2%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 S + ++ NP PR P L+LDVSGSM G PI ELN G+ F L D +A Sbjct: 3 SLNMILDQNEMVENPTPRVPVSLVLDVSGSMLGAPIEELNRGVELFFKSLKDDDVARYSA 62 Query: 62 ELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 E+ +++F V E F P L A G T MG A++ AL+ +E+RK YR G+ Sbjct: 63 EVSVISFSNEVTQEVDFGPLEKCDIPELKAIGKTRMGGAVSLALESLEKRKELYRTLGVD 122 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQIS-VRQPL 178 YY+PW+ ++TDG P D+WQ AA K + + F I + A TL + S R PL Sbjct: 123 YYQPWMVIMTDGKPNDDWQLAAAKTSALVDKGKLTVFPIAIGDNACTDTLKEFSPARNPL 182 Query: 179 PLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE--APKGWTSV 219 L+ L F+E F WLSSS+ VS+S PG +V L+ +GW S+ Sbjct: 183 RLKDLNFQEFFRWLSSSVSKVSQSIPGEKVELDLKGLEGWASL 225 >UniRef50_D1PS09 von Willebrand factor type A domain protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PS09_9FIRM Length = 246 Score = 222 bits (565), Expect = 8e-57, Method: Composition-based stats. Identities = 83/244 (34%), Positives = 115/244 (47%), Gaps = 35/244 (14%) Query: 11 DFASNPEPRCPCILLLDVSGSM---NGR-----------------------PINELNAGL 44 D NP PR P L LD SGSM G ++EL G+ Sbjct: 3 DLVENPTPRVPICLCLDTSGSMGAVQGDCVDTGKTLFEDGRQWNLVTGGTSRLDELQKGI 62 Query: 45 VTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAANF-FPPILFAQGDTPMGAAITK 102 F + + D +A E+ IVTF F + P L A GDT MG + Sbjct: 63 KLFYNSVREDEVARYAAEICIVTFDSEAKCRMDFANLDRQSDLPELTATGDTAMGEGVNL 122 Query: 103 ALDMVEERKREYRANGISYYRPWIFLITDGAPTDE---WQAAANKVFRGEEDKRFAFFSI 159 ALD++E RKREY+ G+ Y++PW+ L+TDG P ++ A + E K+ F I Sbjct: 123 ALDLLESRKREYQDKGVDYFQPWLVLMTDGVPNGNEGEFERAVQRCRDMEAQKKLTVFPI 182 Query: 160 GVQG-ADMKTLAQISV-RQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE--APKG 215 + D LA+ S R PL LQGL+FRE F+WLS S+ S+S PG + L+ + +G Sbjct: 183 AIGDEGDQTALAKFSAKRPPLKLQGLKFREFFAWLSQSVAKTSQSMPGETIKLDLNSIQG 242 Query: 216 WTSV 219 W + Sbjct: 243 WAEL 246 >UniRef50_C8PVC3 von Willebrand factor type A domain protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PVC3_9GAMM Length = 260 Score = 221 bits (564), Expect = 1e-56, Method: Composition-based stats. Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 17/230 (7%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSM-----NG--RPINELNAGLVTFRDELLADPLALK 59 F D+ +N R C+L+LD+SGSM NG R I+ LN G+ F +L+ D A Sbjct: 29 FREIDYGNNVAQRTLCVLVLDLSGSMAIRSGNGDKRRIDMLNEGIEAFYHDLMKDETARN 88 Query: 60 RVELGIVTFG----PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 RV L IV G + +T A +FFP G TP+G + AL+++E+ + R Sbjct: 89 RVRLAIVIVGGVNDTAELMMDWTDAIDFFPIKFRENGMTPLGQGMLLALNLIEQERINLR 148 Query: 116 ANGISYYRPWIFLITDGAPTDE---WQAAANKVFRGEEDKRFAFFSIGVQGA--DMKTLA 170 NGI+Y RPW+ +TDG PTD WQAA N+ + E++ + + I + ++K L Sbjct: 149 DNGINYTRPWVIAMTDGLPTDSQDVWQAAINQCHQAEQNNQCIIYPIAIDAGVQEVKMLK 208 Query: 171 QISVR-QPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 Q+S+ P+ L ++F E F WLS+SL++VS+S PG V L + W ++ Sbjct: 209 QLSILTPPVHLNSVKFVEFFVWLSASLKTVSQSAPGETVQLGSISPWATI 258 >UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z299_9BACE Length = 348 Score = 220 bits (562), Expect = 2e-56, Method: Composition-based stats. Identities = 58/207 (28%), Positives = 91/207 (43%), Gaps = 8/207 (3%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P L+DVS SM G PI ++ G+ EL DP AL+ + ++ F G P Sbjct: 2 RRLPVYFLVDVSESMVGAPIQQVQDGMRMIVQELRTDPYALETAYISVIAFAGKAKCVSP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 T F+PP G T +G A+ +D +++ ++P +FL TDG PTD Sbjct: 62 LTELYKFYPPTFPIGGGTSLGNALEFLMDDMDKTLVRTTTEQKGDWKPIVFLFTDGNPTD 121 Query: 137 EWQAAANKVFRGEEDK-RFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRELFSWLS 193 A + K SIG + + L QIS V + + F+ F W++ Sbjct: 122 NPSNAFTRWNNKYRGKANIVAISIG-DNVNTQLLGQISDNVLRLNKTDEISFKSFFKWVT 180 Query: 194 SSLRSVSRSTP---GTEVVLEAPKGWT 217 +S+++ S S +V L + G + Sbjct: 181 ASIKATSVSVSDMGDDDVKLASTSGIS 207 >UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacterium sp. JLS RepID=A3PUP3_MYCSJ Length = 233 Score = 213 bits (543), Expect = 3e-54, Method: Composition-based stats. Identities = 91/220 (41%), Positives = 120/220 (54%), Gaps = 15/220 (6%) Query: 14 SNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV 73 +NP+PR C++L DVSGSM G PI L G F L + LA KRVE+ +VTFG V Sbjct: 13 ANPDPRVACVVLADVSGSMQGEPIAALERGFAAFTRYLQNEVLASKRVEVAVVTFGTVAT 72 Query: 74 E-QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 P A P A G T M A I ALD++E+RK Y+A G+ YYRPWI L+TDG Sbjct: 73 VLVPMQEARTLQPVAFTASGTTNMAAGIHLALDILEDRKHAYKAAGLQYYRPWILLLTDG 132 Query: 133 APT-DEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISV-RQPLPLQGLQFRELF 189 P D + A ++ E + F++G D + L ++S+ R P PL+GL++ ELF Sbjct: 133 KPNLDGFDEAVARLNAVESARGVTVFAVGAGPRVDYQQLGRLSLQRSPAPLEGLKYEELF 192 Query: 190 SWLSSSLRSVSRSTP-----------GTEVVLEAPKGWTS 218 WLS+SL +VS ST V L + GWTS Sbjct: 193 EWLSASLSNVSNSTEFARDDQTHEAMNGRVPLPSAAGWTS 232 >UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10 Tax=Enterobacteriaceae RepID=D1TTW6_YERPE Length = 327 Score = 210 bits (535), Expect = 3e-53, Method: Composition-based stats. Identities = 53/197 (26%), Positives = 94/197 (47%), Gaps = 5/197 (2%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P +LD S SM G + ++N GL ++L DP AL+ + ++ F G P Sbjct: 2 RRLPIFFVLDCSESMIGENLKKMNDGLQMIINDLKKDPHALETAWISVIAFAGVAKTIVP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 +F+PP L G T +GAA+ + ++ + R+ ++P ++L+TDG PTD Sbjct: 62 LVEVVSFYPPRLPIGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKPVVYLLTDGRPTD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPL--PLQGLQFRELFSWLS 193 + A + ++ ++ +IG+ AD+ L Q++ L Q F + W++ Sbjct: 122 DTTAEITR-WKTHYARKVNLIAIGLGPSADLNILRQLTENVLLFNDTQEGDFTQFIKWIT 180 Query: 194 SSLRSVSRSTPGTEVVL 210 +S+ + SRS L Sbjct: 181 ASVSAHSRSVGEESPPL 197 >UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CA78 Length = 347 Score = 209 bits (533), Expect = 5e-53, Method: Composition-based stats. Identities = 56/204 (27%), Positives = 93/204 (45%), Gaps = 8/204 (3%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P LLDVS SM G PI + G+ T EL ADP AL+ V L I+ F G V P Sbjct: 2 RRLPIYFLLDVSESMVGDPIEHVQDGMATIIKELKADPFALETVWLSIIGFAGKSKVITP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 F+PP + G T + + + + ++ ++ + ++P +FL TDG PTD Sbjct: 62 LQDIITFYPPKIPIGGGTSLASGLNELMNAIDREVVKTTLERKGDWKPLVFLFTDGIPTD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS--VRQPLPLQGLQFRELFSWLS 193 + A + + ++ +I + + L Q++ V Q ++E F W++ Sbjct: 122 DPAQAIER-WNAHYRRKVNLVAISLGENTNYNLLGQLTDQVLQFNNTNAAAYKEFFKWIT 180 Query: 194 SSLRSVSRSTPGT---EVVLEAPK 214 +S+++ S T + L P Sbjct: 181 ASIKTTSEQVNNTNTDVIKLAKPD 204 >UniRef50_D0W6S8 Glycosyl transferase, group 2 family n=1 Tax=Neisseria lactamica ATCC 23970 RepID=D0W6S8_NEILA Length = 191 Score = 209 bits (532), Expect = 5e-53, Method: Composition-based stats. Identities = 79/191 (41%), Positives = 112/191 (58%), Gaps = 3/191 (1%) Query: 32 MNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFTSAANF-FPPILF 89 M G PI +LN G+ F L D +A VE+GI+ G V PFT+A + Sbjct: 1 MYGEPIEQLNQGVQQFIQALQEDEIASYSVEVGILAAGGHVEEIIPFTTAEQLDYTSTFT 60 Query: 90 AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGE 149 AQG TP+G+A+ + L M+E+RKREY+ NG++YY+PW+ +I+DG+PTD WQ AA + Sbjct: 61 AQGSTPLGSAVEQGLKMLEDRKREYQKNGVAYYQPWLVVISDGSPTDSWQNAAQETRTLA 120 Query: 150 EDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRS-TPGTEV 208 E+++ +GV ADM L Q S R L L GL+F + F WLS+S+ VS S + +V Sbjct: 121 ENRKLVSLMVGVNDADMDKLGQFSNRPALKLDGLRFGDFFQWLSASMSRVSASNSTAAQV 180 Query: 209 VLEAPKGWTSV 219 L W S+ Sbjct: 181 SLPPIDTWASI 191 >UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W3F9_DESAS Length = 219 Score = 207 bits (528), Expect = 2e-52, Method: Composition-based stats. Identities = 60/206 (29%), Positives = 95/206 (46%), Gaps = 8/206 (3%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 R P LLLD SGSM G PI + G+ EL +P A++ + ++TFG + Sbjct: 9 SRRLPVYLLLDRSGSMFGEPIEAVKQGVKYMISELKKEPQAIETAYISVITFGSDARQDV 68 Query: 76 PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 T A F P + A G T +GAA+ + + R+ Y+P +F++TDG PT Sbjct: 69 QLTELAAFKEPQIEANGTTSLGAALHILNNCFDNEVRKSTPTQKGDYKPLVFIMTDGEPT 128 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLP--LQGLQFRELFSWL 192 D+W+ AA ++ + + K ++G + TL +I+ L Q F++ F W+ Sbjct: 129 DDWENAAREIKQ-KSGKVANIVAVGCGPDVNTDTLKKITDIVLLMSSYQPEDFKQFFRWV 187 Query: 193 SSSLRSVS---RSTPGTEVVLEAPKG 215 S S++ S L AP Sbjct: 188 SQSVKQASIKFTKDSDQPTNLPAPPP 213 >UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 Length = 239 Score = 203 bits (516), Expect = 4e-51, Method: Composition-based stats. Identities = 56/187 (29%), Positives = 86/187 (45%), Gaps = 5/187 (2%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P LLLD SGSM+G PI + G+ T L DP AL+ + ++TF P Sbjct: 29 RRLPVYLLLDTSGSMHGEPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVP 88 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 T +F P L A G T +G A++ + + ++ A+ +RP +FL+TDG+P D Sbjct: 89 LTDLLSFQMPALTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRPLVFLMTDGSPND 148 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRELFSWLSS 194 +W+ N + G AD L +I+ V Q + F W+S+ Sbjct: 149 DWRKGLNDFKAA-RTGVVVACAAG-HDADTSVLKEITEIVVQLDTADSSTIKAFFKWVSA 206 Query: 195 SLRSVSR 201 S+ S+ Sbjct: 207 SISVGSQ 213 >UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacteria RepID=Q5NWS3_AZOSE Length = 349 Score = 196 bits (498), Expect = 5e-49, Method: Composition-based stats. Identities = 56/191 (29%), Positives = 85/191 (44%), Gaps = 5/191 (2%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P L+DVS SM G + +L GL L ADP AL+ V + ++ F G P Sbjct: 2 RRLPIFFLVDVSESMAGDNLRQLQEGLERLVRSLRADPYALETVFISVIAFAGKPKTLTP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 F+ P L T +G+A+ +D +E + +RP ++L+TDG PTD Sbjct: 62 LVELYQFYAPRLPLGSGTSLGSAMAHLMDEMERTVQRSTPEKKGDWRPVVYLLTDGKPTD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR--QPLPLQGLQFRELFSWLS 193 + + A + R E+ R +IGV A + L + + F+ W+S Sbjct: 122 DIEPAIKRWKRDFEE-RSNLVAIGVGKHASLSALQRFTENVLSLDATTEDDFKRFIDWIS 180 Query: 194 SSLRSVSRSTP 204 S+ S SRS Sbjct: 181 QSVASQSRSVS 191 >UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobacteria RepID=A1VWQ4_POLNA Length = 350 Score = 196 bits (498), Expect = 6e-49, Method: Composition-based stats. Identities = 47/194 (24%), Positives = 85/194 (43%), Gaps = 5/194 (2%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P +LD S SM G + ++ + L DP AL+ V ++ F G P Sbjct: 2 RRLPVFFVLDCSESMVGANLKKMEGAVAAIVKSLRTDPQALETVFFSVIAFAGVARTIAP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 +F+PP L G T +G+A+ + ++ + A +RP I+L+TDG PTD Sbjct: 62 LVEIVSFYPPKLPLGGGTNLGSALDALMGEIDRSVIKTTAERKGDWRPIIYLVTDGRPTD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR--QPLPLQGLQFRELFSWLS 193 A + + K+ +IG+ D L +++ ++ F++ +W++ Sbjct: 122 NPSRAIER-WNSHYAKKATLIAIGLGRSVDFTALRRLTENVISFEDIKESDFKKFINWVT 180 Query: 194 SSLRSVSRSTPGTE 207 +S+ S+S Sbjct: 181 ASVVVQSKSVGDGT 194 >UniRef50_B7AFM8 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AFM8_9BACE Length = 333 Score = 193 bits (492), Expect = 2e-48, Method: Composition-based stats. Identities = 57/192 (29%), Positives = 88/192 (45%), Gaps = 8/192 (4%) Query: 32 MNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFTSAANFFPPILFA 90 M G PI ++ G+ EL DP AL+ V + ++ F G V P T F+PP Sbjct: 1 MVGEPIIQVEKGMRNIIQELRTDPYALETVFVSVIVFAGKEKVLSPLTELYKFYPPQFPI 60 Query: 91 QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEE 150 G T +G A+ ++ +++ ++ ++P IFL TDG PTD Q A N+ + Sbjct: 61 GGGTSLGTALDCLMNDIDKSVKKTTVEMKGDWKPIIFLFTDGMPTDNPQQAFNRWNAHYK 120 Query: 151 DK-RFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPG-- 205 K SIG D K L +IS V + F+ F W+++S++S S S Sbjct: 121 RKANLVCISIG-DNTDTKMLGKISDNVLRLNDTGEQSFKAFFKWVTASIKSTSVSVTDMG 179 Query: 206 -TEVVLEAPKGW 216 E+ L + G Sbjct: 180 TDEIQLASTSGI 191 >UniRef50_C7PNX3 von Willebrand factor type A n=2 Tax=Sphingobacteriales RepID=C7PNX3_CHIPD Length = 352 Score = 192 bits (489), Expect = 5e-48, Method: Composition-based stats. Identities = 47/184 (25%), Positives = 81/184 (44%), Gaps = 5/184 (2%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P L+DVS SM G I + GL EL +DP AL+ + I+ F G P Sbjct: 2 RRLPIYFLIDVSESMVGEQIQFVEEGLAAIIKELKSDPYALETAWVSIIVFAGQAKTIVP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 +F+PP T + + + + + A ++P +FL TDG PTD Sbjct: 62 LQEVISFYPPKFPIGAGTSLSNGLGHLMYEMRKNTIHTTATQKGDWKPIVFLFTDGTPTD 121 Query: 137 EWQAAANKVFRGEEDK-RFAFFSIGVQGADMKTLAQISVRQPLPLQGL--QFRELFSWLS 193 + AA + + ++K S G + ++ L +++ L ++E F W++ Sbjct: 122 DTSAAVREWKQNWQNKSNLIAISFGDEN-NLSALKELTETVLLFKNATPQSYKEFFRWVT 180 Query: 194 SSLR 197 +S++ Sbjct: 181 ASIK 184 >UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DBA5_9CLOT Length = 231 Score = 191 bits (486), Expect = 1e-47, Method: Composition-based stats. Identities = 74/183 (40%), Positives = 100/183 (54%), Gaps = 7/183 (3%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQ 75 E C+LL+D SGSM G INELN GL+ F + L D A ++ +++F V Sbjct: 20 ERHIACVLLVDTSGSMAGASINELNQGLLEFGNALDQDEHARGVADVCVISFNSNVETVV 79 Query: 76 PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 PF AAN+ P L A G T M A+ LD +EERK+ YR G SYYRPW+FL+TDG PT Sbjct: 80 PFCPAANYSAPTLSAGGLTSMNEAVIAGLDAIEERKQLYRQLGCSYYRPWMFLLTDGEPT 139 Query: 136 DEWQ--AAANKVFRGEEDKRFAFFSIGVQG----ADMKTLAQISVRQPLPLQGLQFRELF 189 D+ A N++ + DK+ FF +G+ A +K+ + L QF+E F Sbjct: 140 DQNMEGEAKNRLQQALNDKKVNFFPMGIGSGANYAHLKSYTKGGNGAVLKASASQFKEAF 199 Query: 190 SWL 192 WL Sbjct: 200 VWL 202 >UniRef50_A9BLP5 von Willebrand factor type A n=3 Tax=Burkholderiales RepID=A9BLP5_DELAS Length = 244 Score = 182 bits (462), Expect = 8e-45, Method: Composition-based stats. Identities = 60/219 (27%), Positives = 92/219 (42%), Gaps = 12/219 (5%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 I F S F + P +LLLDVSGSM+G I +N + D + Sbjct: 1 MSNIPFDPSKFTAPKAKPLPVVLLLDVSGSMSGEKIRNVNDAVRDMLDTFSDTENGETEI 60 Query: 62 ELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEE-RKREYRANGI 119 + I+TFG V + QP SA++ L A G TP+G A+ A M+E+ RA Sbjct: 61 HVAIITFGSQVALHQPLASASDIHWQDLSAGGMTPLGTALQMAKAMIEDKDVIPSRA--- 117 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI----SV 174 YRP + L++DG P D W+ N + ++ + AD L + S Sbjct: 118 --YRPTVVLVSDGGPNDAWEKPLNAFISDGRSAKCDRLAMAIGADADEAVLGKFIEGTSN 175 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAP 213 R Q R+ F +++ S+ ++S V + Sbjct: 176 RLFYAENAKQLRDFFKFVTMSVTIRTKSQTPNNVPEAST 214 >UniRef50_C9ZGR0 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9ZGR0_STRSW Length = 253 Score = 180 bits (458), Expect = 2e-44, Method: Composition-based stats. Identities = 77/226 (34%), Positives = 103/226 (45%), Gaps = 26/226 (11%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 +A +F +N R P +L LD S SM G PI LN L + EL D VE+ +V Sbjct: 13 YADIEFENN-AQRMPLVLCLDTSSSMAGPPIQTLNNALAEWTRELHDDVSLSYSVEVAVV 71 Query: 67 TFGPVHV--------------EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKR 112 TFG V PF A F P L A G T M A+ A+ +V RK Sbjct: 72 TFGGQGVGAWRGPQLLDPRTRTSPFIPAHAFQAPQLTAAGVTLMTEALELAMHIVAARKS 131 Query: 113 EYRANGISYYRPWIFLITDGAPTDE-------WQAAANKVFRGEEDKRFAFFSIGVQGA- 164 E RA+G+ YYRP I L+TDG PTD W + + +RF ++IGV G Sbjct: 132 ELRASGLQYYRPQICLVTDGLPTDPTGHLTDSWHRLVPVLAEEQSARRFRLYAIGVGGIT 191 Query: 165 --DMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEV 208 + L + + +QG FREL +S+S + + G EV Sbjct: 192 DRGEQVLKAFAPKFNARIQGFPFRELLQMMSASANAEQKGA-GDEV 236 >UniRef50_Q0I303 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=Q0I303_HAES1 Length = 343 Score = 177 bits (448), Expect = 3e-43, Method: Composition-based stats. Identities = 47/191 (24%), Positives = 86/191 (45%), Gaps = 6/191 (3%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P L++DVS SM G ++ + L DP AL+ V + ++ F G V P Sbjct: 2 RRLPIFLVVDVSESMAGDSHRQMQEAINRLVQRLRCDPYALESVYISVIAFAGAAGVIAP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 T +F+ P L T +GAA+ +D ++ + ++P +++++DG TD Sbjct: 62 LTELMSFYAPRLPMGSGTSLGAALNLTMDEIQRNVVRSSGDQKGDFKPLVYILSDGVATD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRELFSWLSSS 195 + +A + + E R ++G+ AD+ L QI+ + E + L+ S Sbjct: 122 DPTSAIQRWQQ-EFKSRTKLIAVGLGNFADLSALNQIAELT-FRIDDQDLEEAYLTLTRS 179 Query: 196 L--RSVSRSTP 204 + +S+S Sbjct: 180 IEDSILSQSRS 190 >UniRef50_C9LWT6 Tellurium resistance protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LWT6_9FIRM Length = 212 Score = 172 bits (437), Expect = 7e-42, Method: Composition-based stats. Identities = 55/191 (28%), Positives = 86/191 (45%), Gaps = 10/191 (5%) Query: 32 MNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFTSAANFFPPILFA 90 M G PI + G+ EL DP AL+ L ++TF V T F P L A Sbjct: 1 MMGEPIEAVRQGIKALLSELRGDPQALETAYLSVITFASQVRQTTKLTELMLFKEPRLEA 60 Query: 91 QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD--EWQAAANKVFRG 148 +G T MG A+ + V R+ +RP +FL+TDG+PTD +++ AA ++ Sbjct: 61 EGCTLMGGALKLLAECVRTEVRKNTETQKGDWRPLVFLLTDGSPTDLEDFRQAAAEIKSL 120 Query: 149 EEDKRFAFFSIGVQGADMKTLAQISVRQPLP--LQGLQFRELFSWLSSSLRSVSRS---T 203 + + G AD L Q++ + L + F+W+S S++ S+S Sbjct: 121 KLG-NIIACAAGA-DADTSYLKQLTDNVLMMNSLSAGDMAKFFAWVSGSIKMSSKSLDAK 178 Query: 204 PGTEVVLEAPK 214 PG + L P+ Sbjct: 179 PGAAIELPPPR 189 >UniRef50_Q8DK92 Tlr0974 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DK92_THEEB Length = 241 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 59/201 (29%), Positives = 92/201 (45%), Gaps = 16/201 (7%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 E P LLLD S SM G PI L+ GL F+ E+ +D A V++G++TF Sbjct: 19 ERHLPVYLLLDTSSSMEGAPIESLHQGLEQFQREVSSDQFARDIVKVGVITFASDAQLVT 78 Query: 77 --FTSAANFFPPILFAQGDTPMGAAITKALDMVEER-KREYRANGISYYRPWIFLITDGA 133 ++F PP+L A G T + A T L+ ++ R + ++P +F++TDG Sbjct: 79 GGLVPISDFQPPMLTASGVTRLDLAFTVLLESIDRDVVRPVKGGQKGDWKPAVFVLTDGR 138 Query: 134 PTDEWQAAANKVFRGEED----------KRFAFFSIGVQ-GADMKTLAQISVRQPLPLQG 182 PTD A ++++R D K ++G D TL IS + Sbjct: 139 PTDRHGIATDELWRPARDALVNRPKGEIKPSVIVAVGCGPHVDDDTLKAISTGTAFKMGT 198 Query: 183 LQ--FRELFSWLSSSLRSVSR 201 + F LF +LS SL + ++ Sbjct: 199 SEAAFVALFQYLSQSLTTSTQ 219 >UniRef50_B6BJ58 Phage/colicin/tellurite resistance cluster TerY protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BJ58_9PROT Length = 229 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 52/207 (25%), Positives = 82/207 (39%), Gaps = 11/207 (5%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + F +DF P +LLLDVS SM G I+ LN + + + ++L Sbjct: 1 MAFNPADFVVEEPKSIPVVLLLDVSYSMQGENIDTLNKAVESMLNSFKKAETMETFIKLS 60 Query: 65 IVTFGP---VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 I+TFG V + P T + L G TPMGAA M+E++ Sbjct: 61 IITFGSENGVDLHTPLTEVSKIDFKPLTVSGSTPMGAAFKMGKAMIEDKDIF----KGRD 116 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL- 180 YRP I L++DG P D+W+ + K+ ++ + AD L L Sbjct: 117 YRPTIVLLSDGEPNDDWRQPLDDFVSTGRTKKCDRMALAIGAADKTVLNMFIEGCENSLF 176 Query: 181 ---QGLQFRELFSWLSSSLRSVSRSTP 204 + F ++ S+ ++S Sbjct: 177 YAEDAENIIDEFKKITMSVTQRTKSVN 203 >UniRef50_Q5NWS4 Tellurium resistance protein n=8 Tax=Bacteria RepID=Q5NWS4_AZOSE Length = 214 Score = 168 bits (427), Expect = 9e-41, Method: Composition-based stats. Identities = 55/206 (26%), Positives = 86/206 (41%), Gaps = 8/206 (3%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQ 75 R P LL+D SGSM G P+ +N GL + L +P A++ V L + TF + Sbjct: 4 SRRLPVYLLIDTSGSMRGEPVESVNVGLRAMQTSLRQNPYAIETVHLSVTTFDSQIKDVL 63 Query: 76 PFTSAANFFPPIL--FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 P T+ + P + A G T +G A+ LD ++ R+ A + P +F++TDG Sbjct: 64 PLTALEDATIPEIVCPASGATLLGEALEHILDRAKKEVRQSSAEQKGDWAPLLFIMTDGK 123 Query: 134 PTDEWQ-AAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRELFS 190 PTD + + + + G AD L I+ V + F F Sbjct: 124 PTDTFVFNQVAPAIKAFKFGSIIACAAG-PKADPAGLRLITDHVVSLDTMDSAAFTAFFQ 182 Query: 191 WLSSSLRSVSRSTPGT-EVVLEAPKG 215 W+S ++ S S S + L Sbjct: 183 WVSVTVSSGSMSVGAANTLSLPPTPP 208 >UniRef50_C5PP99 von Willebrand factor, type A n=2 Tax=Bacteroidetes RepID=C5PP99_9SPHI Length = 256 Score = 168 bits (425), Expect = 2e-40, Method: Composition-based stats. Identities = 52/204 (25%), Positives = 78/204 (38%), Gaps = 8/204 (3%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQP 76 R P LLD SGSM+G PI LN L + L D A + + + ++TF V P Sbjct: 47 RRLPVYFLLDTSGSMHGEPIQALNNALSGMINNLRTDAQAAETLWISMITFDREVKEIVP 106 Query: 77 FTSAANFFPPILFA--QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 T+ +F P + G T G A+ D + +RP +F+ TDG P Sbjct: 107 LTALESFQLPEISCPESGPTFTGKALEILYDTATREVIKGSPEQKGDWRPLLFIFTDGKP 166 Query: 135 TD--EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS-VRQPLPLQGLQFRELFSW 191 +D + K+ + G D K L S V ++ F W Sbjct: 167 SDLQLYSQMIPKIRSLNFGT-IVGCAAGHMADDKKLLELTSDVVHLNTADSSTLKQFFKW 225 Query: 192 LSSSLRSVSRST-PGTEVVLEAPK 214 +S ++ ++S V L P Sbjct: 226 VSDTIEQGNKSRGTTDTVALPPPP 249 >UniRef50_B2K3B2 von Willebrand factor type A n=39 Tax=Gammaproteobacteria RepID=B2K3B2_YERPB Length = 233 Score = 163 bits (413), Expect = 4e-39, Method: Composition-based stats. Identities = 53/181 (29%), Positives = 80/181 (44%), Gaps = 9/181 (4%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQP 76 R P LL+D SGSM G I+ +N G+ L DP AL+ V L I+T+ P Sbjct: 23 RRLPVYLLIDTSGSMRGESIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIP 82 Query: 77 FTSAANFFPPILF--AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 T+ NF + + G T GAA+ + VE + + +RP +FL+TDG P Sbjct: 83 LTALENFQFTDITVPSAGGTFTGAALECLIHCVERDIQRSDGDQKGDWRPLVFLMTDGTP 142 Query: 135 TD--EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRELFS 190 +D + A +V + ++G + A + L Q++ V L F F Sbjct: 143 SDVYAYGEAIKEVKKRAFGS-IIACAVGAK-AKHEHLKQLTSQVVALETLDSTAFSGFFK 200 Query: 191 W 191 W Sbjct: 201 W 201 >UniRef50_UPI0001AED79F von Willebrand factor type A n=1 Tax=Streptomyces albus J1074 RepID=UPI0001AED79F Length = 221 Score = 162 bits (411), Expect = 6e-39, Method: Composition-based stats. Identities = 51/186 (27%), Positives = 79/186 (42%), Gaps = 11/186 (5%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT 78 P LL D SGSM G PI+ +N L E+ +P + ++ F V QP Sbjct: 4 LPFYLLCDESGSMTGDPIDAINRALPDLHHEISTNPTVADKTRFCLIGFSDDASVLQPLV 63 Query: 79 SAANFFP-PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD- 136 ++ P L A G T G A L VE+ E +A G YRP F ++DG PTD Sbjct: 64 DLSDIDEVPALSAGGLTDYGTAFRTLLRSVEKDVAELKAQGHEVYRPVAFFLSDGIPTDE 123 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQ------GLQFRELFS 190 +W A ++ + + G+ A+ + + Q++ + + RE S Sbjct: 124 DWPTAHRELLNSRYAPK--IIAFGIGDAEAQIIGQVANFRAFIQKDNSVSPAQALREFAS 181 Query: 191 WLSSSL 196 L+ S+ Sbjct: 182 SLTRSI 187 >UniRef50_UPI0001745BB0 TerY3 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745BB0 Length = 345 Score = 161 bits (407), Expect = 2e-38, Method: Composition-based stats. Identities = 50/200 (25%), Positives = 89/200 (44%), Gaps = 9/200 (4%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQP 76 R P +LLD S SM G + + G+ + L +P AL+ + +TF ++ P Sbjct: 2 RRLPVYVLLDCSESMIGNGLRGMRTGISSMLKALRQNPHALETAWISFITFDSRAELKSP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN--GISYYRPWIFLITDGAP 134 S PP L + T +G+A+ + + + + + +RP + LITDG P Sbjct: 62 LQSLDEVQPPRLLVRPGTSLGSALLLLSERILQEVKRTQPGTLTKGDFRPIVILITDGQP 121 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS--VRQPLPLQGLQFRELFSW 191 TD+W++A ++ K +++G D L +++ V + +LF W Sbjct: 122 TDDWRSALREM--NSTVKIANLYAVGCGDDIDFAGLREMTDVVLNLQQTDEQGWAKLFVW 179 Query: 192 LSSSLRSVSRST-PGTEVVL 210 +S ++ + SR G E L Sbjct: 180 ISETVSTASRGVADGDEREL 199 >UniRef50_UPI00018742C1 von Willebrand factor type A n=1 Tax=Corynebacterium amycolatum SK46 RepID=UPI00018742C1 Length = 228 Score = 160 bits (404), Expect = 5e-38, Method: Composition-based stats. Identities = 55/214 (25%), Positives = 91/214 (42%), Gaps = 12/214 (5%) Query: 15 NPEPR---CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP- 70 N EPR P + D SGSM G I ELN+GL + +E+ P A V ++ F Sbjct: 6 NMEPRGNILPIYFVADESGSM-GPDIAELNSGLQSLLNEIRMAPFAAANVRFSVIGFDNE 64 Query: 71 VHVEQPFTSAANF-FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 + + P L A+ T A + + + +A G RP +FL+ Sbjct: 65 ARLYLSNADLRHVEQMPTLSARFATYFSTAFDLLNAQIPDDVAQLKAEGYRVNRPAVFLL 124 Query: 130 TDGAP--TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRE 187 TDG P D WQ A + + + G+ +D + + Q++ + Q Q + Sbjct: 125 TDGYPMSDDLWQEALSDLQNAPHHPN--ILAFGIGESDAQIIGQMASKNGWAFQAAQGAD 182 Query: 188 LFSWLSSSLRSVSRS--TPGTEVVLEAPKGWTSV 219 + LS + S+++S + GT V +P+ T + Sbjct: 183 TGAMLSEFMSSLTQSVISSGTSVANGSPEIITDI 216 >UniRef50_D1AAR7 von Willebrand factor type A n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AAR7_THECD Length = 228 Score = 158 bits (401), Expect = 9e-38, Method: Composition-based stats. Identities = 48/169 (28%), Positives = 74/169 (43%), Gaps = 3/169 (1%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 E P L+ D S SM G P+ E+N L E+ ++P + L I++F V Sbjct: 3 EQILPFYLVCDESYSMAGNPLQEINDQLPQIVTEIASNPTVADKARLCIISFSDTAEVLL 62 Query: 76 PFTSAAN-FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 P + P L +G T GAA T D +E R+ +A G +RP +F +TDG P Sbjct: 63 PLADLNDVHQVPQLAPKGATSYGAAFTLLRDTIERDIRDLKAAGHVPFRPTVFFLTDGQP 122 Query: 135 TD-EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQG 182 TD +W A ++ + R + G +TL ++ + G Sbjct: 123 TDSDWATAHQRLTAKDFGPRPTILAFGFGDVRPETLRAVATFRAFIANG 171 >UniRef50_A1VV60 von Willebrand factor, type A n=4 Tax=Proteobacteria RepID=A1VV60_POLNA Length = 240 Score = 156 bits (394), Expect = 5e-37, Method: Composition-based stats. Identities = 60/216 (27%), Positives = 96/216 (44%), Gaps = 14/216 (6%) Query: 10 SDFASNPEPRCPCILLLDVSGSMNGR-PINELNAGLVTFRDELLADPLALKRVELGIVTF 68 FA+ P I+L DVSGSM+ I+ LN L + +++G++TF Sbjct: 19 KAFAAPQARPLPVIVLADVSGSMSENGKIDALNVALKEMILSFGKESGLRAEIQVGLITF 78 Query: 69 G--PVHVEQPFTSAANFF-PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 G H P +A A G TPMG+A A ++E++++ YRP Sbjct: 79 GGREAHEHLPLVAAKVIGGVEAFKANGGTPMGSAFALARKLLEDKEQIPSRA----YRPV 134 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL---- 180 + L++DGAPTD W+A + E ++ F++ + AD+ LAQ + P+ Sbjct: 135 LILVSDGAPTDAWEAPLADLKASERGQKATRFAMAIGADADLDMLAQFPNDREAPVFKTH 194 Query: 181 QGLQFRELFSWLSSSLRSVSRS-TPGTEVVLEAPKG 215 + F ++ S+ S S S P V L+ G Sbjct: 195 EARDIGRFFRAVTMSVVSRSTSAAPDQPVTLDMEDG 230 >UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fusobacterium sp. D11 RepID=UPI0001B52F00 Length = 218 Score = 155 bits (392), Expect = 1e-36, Method: Composition-based stats. Identities = 48/200 (24%), Positives = 80/200 (40%), Gaps = 11/200 (5%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP 70 +F S P+ P ILL D S SM + ELN + L + + +TFG Sbjct: 2 EFTSQPKKVLPLILLADTSSSMR-EWMRELNTAIRDMLGTLKEQESLKAEIHISFITFGN 60 Query: 71 --VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 ++ T +N G TP+G A+ A +MVE R+ + Y P I L Sbjct: 61 GGANLHTALTPVSNIEFNDFTEGGMTPLGGALRIAKEMVENREIIPSKS----YAPIILL 116 Query: 129 ITDGAPTDE-WQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL--QGLQ 184 ++DGAP D W+ + K+ S+G+ D L S + + Sbjct: 117 LSDGAPNDNGWENEMYRFINDGRSKKCMRMSLGIGRDYDYDVLKGFSSNGEVYEAKDSMN 176 Query: 185 FRELFSWLSSSLRSVSRSTP 204 + F +++ +++ + S Sbjct: 177 IIDFFKFMTMTIKEKTLSKD 196 >UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C09D7 Length = 262 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 52/225 (23%), Positives = 86/225 (38%), Gaps = 21/225 (9%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELL--ADPLALKRVELGI 65 A ++ P +LD SGSM G PI LN + L A A ++++ + Sbjct: 3 AINEIDEMPRKELHVFYVLDTSGSMTGVPIAALNTAMEECTVALKDLAKKNADAKLKIAV 62 Query: 66 VTFGP----VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 + F V P + F L A G T +GAA+ + ++ + + + Sbjct: 63 LEFSTGAKWVTYNGPESLDDEFEWEHLSAGGVTDIGAALREL--DIKLSRNGFLKSMTGA 120 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL 180 P I +TDG PTDE+ AA ++ + + AD ++ I + Sbjct: 121 LMPVIIFMTDGYPTDEYAAALAELRKNRWYTSSTKIGFAIGDDADAAIISSIVGNSEAVI 180 Query: 181 QGLQFRELFS-----------WLSSSLRSVSRSTPGTEVVLEAPK 214 + ELF L+SS R+ S G+ +V +A Sbjct: 181 KTSDL-ELFKRLMKFVTVRASMLASSSRTTSSFVNGSAIVQDAID 224 >UniRef50_A6G3C6 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G3C6_9DELT Length = 211 Score = 151 bits (382), Expect = 1e-35, Method: Composition-based stats. Identities = 75/207 (36%), Positives = 106/207 (51%), Gaps = 14/207 (6%) Query: 27 DVSGSM-NGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV-EQPFTSAANFF 84 D S SM I +N L FR +++ DPLA KR++L +++F V E F SA N+ Sbjct: 2 DRSHSMIFNDHIGAVNRSLQAFRADIMEDPLARKRLDLCVISFNHECVTENHFCSAQNWR 61 Query: 85 PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD------EW 138 PP L G T MG AI L+ + R + YR +G+ YRPW+ L+TDG PTD W Sbjct: 62 PPTLVPGGATGMGQAIKVGLETLRGRLQRYRLDGVDCYRPWVMLVTDGLPTDMQPNDARW 121 Query: 139 QAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI-SVRQPLPLQGLQFRELFSWLSSSLR 197 + GE+ +RF FFS+ V + L Q+ + R PL ++ + +F WLS+S Sbjct: 122 MEVRQLIQDGEQKRRFMFFSVAVLPEAIPALRQLGAQRPPLQVREGKIPTMFKWLSASFS 181 Query: 198 SVSRSTPGTEV-----VLEAPKGWTSV 219 S+SRS G V GW + Sbjct: 182 SISRSQLGAPVSGITEPTATETGWADI 208 >UniRef50_B4VGR3 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4VGR3_9ACTO Length = 206 Score = 149 bits (376), Expect = 7e-35, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 80/195 (41%), Gaps = 9/195 (4%) Query: 28 VSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAANFFPP 86 +SGSM+G P+ +N L + +L DP + + +VTF P + A+ P Sbjct: 1 MSGSMSGGPMAAMNTALPAMQRAILDDPTTGEIARVSVVTFSDTAACVLPLSDMAHARMP 60 Query: 87 ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA--PTDEWQAAANK 144 L QG T + + G Y+RP +F ++DG + W++ ++ Sbjct: 61 TLSPQGGTDFAEGFRVGREAL-VDGIGALGRGARYHRPVVFFLSDGQHNSSQSWKSGFDR 119 Query: 145 VFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ----FRELFSWLSSSLRSVS 200 + E+ S G A+ +AQ+S R + + +E+ + S+++ S Sbjct: 120 LRSKEDKYGAEVVSFGFGQANRDVIAQVSTRHAFFAEDMDPAVAVKEILHTVLMSIKTTS 179 Query: 201 RS-TPGTEVVLEAPK 214 S G L P+ Sbjct: 180 GSFQAGGAAGLTIPE 194 >UniRef50_C0F0K9 Putative uncharacterized protein n=2 Tax=Eubacterium hallii DSM 3353 RepID=C0F0K9_9FIRM Length = 291 Score = 146 bits (368), Expect = 6e-34, Method: Composition-based stats. Identities = 51/206 (24%), Positives = 86/206 (41%), Gaps = 13/206 (6%) Query: 8 ATSDFASNPEPRCP---CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 A DF EP ++DVSGSM G I LN+ + L+ A V++ Sbjct: 37 AEEDFLDTMEPAKKSMTIFFMIDVSGSMKGTKIGSLNSTMEELLPSLIGVGEASTDVKIA 96 Query: 65 IVTFG-PVHVEQ--PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 I+ F V P + L A G T MG A + + + + ++ Sbjct: 97 IMKFSTDVEWVTPEPVKIEEYQYWNRLEADGLTFMGDAFMELSKKL--SRSTFLSSPSLS 154 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPL 180 + P IFL++DG+P D+W+ + + + + + ++G+ +M L + L + Sbjct: 155 FAPVIFLLSDGSPNDDWKKGLDTLKQNKWFQHGLKIALGIGSKVNMDVLRAFTGNDELAV 214 Query: 181 QGL---QFRELFSWLSSSLRSV-SRS 202 Q Q REL L+ + + SRS Sbjct: 215 QAKNADQLRELIKLLAVTSSQIGSRS 240 >UniRef50_C2HF13 von Willebrand factor type A n=1 Tax=Finegoldia magna ATCC 53516 RepID=C2HF13_PEPMA Length = 249 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 51/204 (25%), Positives = 91/204 (44%), Gaps = 16/204 (7%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDEL--LADPLALKRVELGIVTFGP-VH 72 P ++D SGSM G I E+N+ + EL +++ V++ I++F + Sbjct: 11 PRKMMVLFFVIDTSGSMKGTKIGEVNSAIEEILPELSDISNSNPDAEVKMAILSFNSEIQ 70 Query: 73 VEQPFTSAAN---FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 P T + + L A G T MGAA + + K + + S Y P IFL+ Sbjct: 71 WITPKTGPVDPGVYLWRDLNANGTTRMGAAFEELESKLHGDK--FMKSATSSYAPVIFLM 128 Query: 130 TDGAPT---DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQF 185 +DG PT +++Q+ NK+ + K ++G+ AD+ L + + LQ Sbjct: 129 SDGMPTETEEQFQSGLNKLKANKWFKSGIKVALGIGQDADLDVLEAFTGTKEAVLQTNNV 188 Query: 186 ---RELFSWLS-SSLRSVSRSTPG 205 + + ++S +S + S+S G Sbjct: 189 KKLKAMIQFVSVTSSQIASQSVSG 212 >UniRef50_Q0RFF8 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RFF8_FRAAA Length = 209 Score = 140 bits (353), Expect = 3e-32, Method: Composition-based stats. Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 11/184 (5%) Query: 28 VSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFTSAANFFPP 86 +S SM G P+ LN L + E+ ++P + + IVTF V P A + P Sbjct: 1 MSASMAGGPLEALNDSLPALQKEMQSNPTVGEIARISIVTFSDVGRTVVPLCDLAEVYLP 60 Query: 87 ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA---PTDEWQAAAN 143 L +G T AA + +E R G YRP +F ++DG P D W AA N Sbjct: 61 ELMVEGGTNFAAAFQETRRAIEGGLR-SLPKGTPIYRPVVFFMSDGEHQAPGD-WTAALN 118 Query: 144 KVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQ----GLQFRELFSWLSSSLRS 198 + + G ++ ++ +I+ R + Q RE+ + L S+R+ Sbjct: 119 DLRDRSWRFAPEVVAFGFGDQVNVDSIRRIATRFSFLARDADPATQVREIMNALIGSIRT 178 Query: 199 VSRS 202 S S Sbjct: 179 TSTS 182 >UniRef50_C9RJN5 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJN5_FIBSS Length = 256 Score = 140 bits (352), Expect = 4e-32, Method: Composition-based stats. Identities = 54/211 (25%), Positives = 87/211 (41%), Gaps = 15/211 (7%) Query: 7 FATSD-FASNPEPR--CPCILLLDVSGSMNGRPINELNAGLVTFRDEL--LADPLALKRV 61 AT D FA P PR I ++D SGSM+G I LN + D++ ++ ++ Sbjct: 1 MATKDPFALEPIPRRVTHLIFMVDTSGSMSGSKIASLNTAVRDALDDVGDISKNCGDSQI 60 Query: 62 ELGIVTFGPV---HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 ++ ++ F EQP A F L A G T G+A + LD R + Sbjct: 61 KIAVLEFSSAVNWMYEQPL-EAEKFQWQDLSASGTTSFGSACAE-LDAKLSRSNGFMGEK 118 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQP 177 P I L++DGAPTD + K+ K +I + A+ L + + Sbjct: 119 TGCRAPAIVLLSDGAPTDGYVRKLEKLKGNRWFKAGVKVAIAIGDDANNDVLREFTGSSE 178 Query: 178 LPL---QGLQFRELFSWLSSSLRSV-SRSTP 204 + Q +++ +S S +V S+S Sbjct: 179 SVITVHNVDQLKKMIHTVSVSATTVASQSAS 209 >UniRef50_UPI0001C37785 von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37785 Length = 285 Score = 140 bits (352), Expect = 5e-32, Method: Composition-based stats. Identities = 46/210 (21%), Positives = 86/210 (40%), Gaps = 14/210 (6%) Query: 8 ATSDFASNP-------EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 + DF +P L+D SGSM G+ + ELN + E+ A Sbjct: 32 NSLDFDDDPLSATGVSRKSLVIFFLIDTSGSMKGKKMGELNTVMEELIPEIRRVGEADTE 91 Query: 61 VELGIVTFG-PVHVEQP-FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 V++ ++TF V +F L A G T MGAA + + + + + Sbjct: 92 VKVAVLTFSTDVRWMYSTPIPIEDFEWARLRANGVTSMGAAFKEL--SLRMSRNSFLNSP 149 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQP 177 + P IFL+TDG P+D+++ ++ K ++G+ A+ LA+ + + Sbjct: 150 SLSFAPVIFLMTDGYPSDDYREGLKELQSNSWYKFGLKAALGIGNEANDDVLAEFTGSKD 209 Query: 178 LPLQGLQFRELFSWLSSSLRSVSRSTPGTE 207 + +L + + +V+ S G++ Sbjct: 210 TVVHAYSGGQLAQMIK--IIAVTSSQIGSK 237 >UniRef50_B0A9L7 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A9L7_9CLOT Length = 273 Score = 136 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 49/207 (23%), Positives = 85/207 (41%), Gaps = 10/207 (4%) Query: 4 QITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVEL 63 + F + + L+D SGSM+G+ I LN + EL A ++L Sbjct: 18 EYDFDPLEVKPISKKNLVIFFLVDTSGSMSGKKIGTLNTTMEELLPELRGLGGATTDIKL 77 Query: 64 GIVTFGPVH---VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 ++TF ++P + + L A+G T +G A T+ + + RK A +S Sbjct: 78 AVMTFSSGCEWITKEPMSVDDYQYWTRLKAEGLTDLGEAFTELSNKL-SRKEFLNAPSLS 136 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQPLP 179 Y P IFL+TDG TD+ + K ++G+ D + L + + L Sbjct: 137 -YAPVIFLLTDGYATDDALEGLKTLQHNNWYKYGLKVALGLGEKFDEELLKKFTGNPELV 195 Query: 180 LQGL---QFRELFSWLSSSLRSV-SRS 202 + Q +L ++ + + SRS Sbjct: 196 VTAKTSDQLSKLVKTIAVTSSQIGSRS 222 >UniRef50_A7BNL3 Tellurium resistance protein n=1 Tax=Beggiatoa sp. SS RepID=A7BNL3_9GAMM Length = 171 Score = 135 bits (341), Expect = 8e-31, Method: Composition-based stats. Identities = 42/151 (27%), Positives = 75/151 (49%), Gaps = 5/151 (3%) Query: 56 LALKRVELGIVTF-GPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY 114 + ++ V L +TF P + F P++ A G T +G+A+ +D +E+ R+ Sbjct: 1 MGIETVYLWDITFHSTAQQVTPLSELMLFKEPLISASGATALGSALRLLMDCLEKEVRKN 60 Query: 115 RANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS- 173 A ++P +FL+TDG PTD W++AA+K+ + ++ F+ G AD+ L I+ Sbjct: 61 TAVQKGDWKPLVFLMTDGMPTDAWESAADKL-KNQKSANLIAFAAG-PNADVANLKGITD 118 Query: 174 -VRQPLPLQGLQFRELFSWLSSSLRSVSRST 203 V + L + F W+S S+ +S Sbjct: 119 IVLKSEELSPGALKAFFQWMSQSILQTGKSV 149 >UniRef50_C9RK46 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RK46_FIBSS Length = 236 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 51/211 (24%), Positives = 93/211 (44%), Gaps = 23/211 (10%) Query: 20 CPCILLLDVSGSMNG-RPINELNAGLVTFRDELLADPLALKRVEL--GIVTFGPVH---V 73 P I+L DVSGSMN ++ L L + E+ I+TFG + Sbjct: 15 LPVIILADVSGSMNEIGKLDSLKHALNNMISSFKDASSSSLEAEIYVSIITFGNQAANII 74 Query: 74 EQPFTSAANFFPP-------ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 +P +++ P + A G+TP+G A+T +D++E R+ YRP+I Sbjct: 75 LEPQSASEIANDPSKMNVINKMQAIGNTPLGKALTSLVDLLENREIYPSRA----YRPFI 130 Query: 127 FLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL----Q 181 L +DG P D WQ +++ E K+ ++ + AD L + + +P+ Sbjct: 131 VLASDGMPNDLWQQPLDRLLNSERSKKANRLALAIGADADESMLKKFVNNEEMPIFKANN 190 Query: 182 GLQFRELFSWLSSS-LRSVSRSTPGTEVVLE 211 ++ ++ F ++ S ++S + PG + Sbjct: 191 AIEIQKFFKCVTMSAIKSSQSAKPGEIAPND 221 >UniRef50_B2UUD5 Phage/colicin/tellurite resistance cluster terY protein n=5 Tax=Helicobacter pylori RepID=B2UUD5_HELPS Length = 217 Score = 133 bits (335), Expect = 4e-30, Method: Composition-based stats. Identities = 48/204 (23%), Positives = 78/204 (38%), Gaps = 16/204 (7%) Query: 15 NPEPR-CPCILLLDVSGSMN-----GRPINELNAGLVTFRDELLADPLALKRVELGIVTF 68 E R P LLLD SGSMN I LN + + L + ++ I+TF Sbjct: 9 TMEERFIPVFLLLDTSGSMNESLGNCTRIEALNLCIQKMIETLKQEAKKELFSKMAIITF 68 Query: 69 GP--VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 G + PF N L A G TP+ A A D++E++ +Y+ + Sbjct: 69 GENGAVLHTPFDDVKNINFKPLSASGGTPLDQAFRLAKDLIEDKD----TFPTKFYKLYS 124 Query: 127 FLITDGAPTDE-WQAAANKVFRGEEDKRFAFFS--IGVQGADMKTLAQISVRQPLPLQGL 183 L++DG P D+ WQ A + + +S IG + + + + Sbjct: 125 ILVSDGEPNDDKWQKALSNFHHDGRSAKSVCWSIFIGDRNTNPQVNKDFGKDGVFYADDV 184 Query: 184 Q-FRELFSWLSSSLRSVSRSTPGT 206 + LF ++ ++ S S Sbjct: 185 EKLVGLFEIMTQTISKGSTSIKDD 208 >UniRef50_A1THU3 von Willebrand factor, type A n=1 Tax=Mycobacterium vanbaalenii PYR-1 RepID=A1THU3_MYCVP Length = 248 Score = 131 bits (331), Expect = 1e-29, Method: Composition-based stats. Identities = 51/164 (31%), Positives = 72/164 (43%), Gaps = 12/164 (7%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-PFT 78 P L+ DVS SM G I LN L FRD L +P+ +V+ G++ F E P Sbjct: 19 LPFWLVCDVSASM-GPHIGTLNQSLRDFRDSLATNPVLADKVQFGVIDFSDTATEVIPLG 77 Query: 79 S--AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 +A+ L +G T G A T ++E R A+ Y+RP +F +TDG PTD Sbjct: 78 DFSSADLERHQLRTRGGTSYGQAFTTVQQIIE-RDLAAGADRFRYFRPAVFFLTDGQPTD 136 Query: 137 -EWQAAANKVF------RGEEDKRFAFFSIGVQGADMKTLAQIS 173 W+ A + F G+ AD TLA++ Sbjct: 137 RHWREAFRDLTFFDQASGQGFRSYPLFVPFGIGDADAATLAELV 180 >UniRef50_C7N2G1 Uncharacterized protein n=2 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N2G1_SLAHD Length = 272 Score = 126 bits (318), Expect = 3e-28, Method: Composition-based stats. Identities = 53/232 (22%), Positives = 93/232 (40%), Gaps = 35/232 (15%) Query: 6 TFATSDFA-SNPEPRCPCILLLDVSGSMN--GRPINELNAGLVTFRDEL----LADPLAL 58 ++ + P I ++D SGSMN GR I+ +N + D L +P A Sbjct: 9 PMPNIEYTMAKARKLLPIIYVIDTSGSMNFYGR-ISAVNRAMNETLDVLGDVAAKNPTAD 67 Query: 59 KRVELGIVTFG---------PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEE 109 V++ ++ F PV + +F+ L A+G T +GAA+ + + + Sbjct: 68 --VKVAVLGFSTGAEWITTDPVTGKPALMDLEDFYWDKLKARGSTDLGAALIELGEQLTR 125 Query: 110 RKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRF---AFFSIGVQGADM 166 + + P I ++DG PTD+W++A KV R ++G AD Sbjct: 126 D--AMLVSETGFKVPVIIFMSDGGPTDDWESAFEKVCANNRWVRAATKIALAVG-DNADR 182 Query: 167 KTLAQISVRQPLPL----QGLQFRELFSWLS--SSL----RSVSRSTPGTEV 208 + LA+I+ P + ++L +S +S+ S S V Sbjct: 183 EVLARIADGNPEAVVPVSDSATLQKLIKVVSVTASMINGRSRTSDSNQNDIV 234 >UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytophthora infestans T30-4 RepID=D0N9W4_PHYIN Length = 2146 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 46/190 (24%), Positives = 77/190 (40%), Gaps = 21/190 (11%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ--- 75 + + +LD SGSMNG+P N+L A + +AD L V +VTF Sbjct: 1901 KMHHVFVLDCSGSMNGQPWNDLMAAWKEYVYNRIADGATLDLV--SVVTFDNSAQIVYEA 1958 Query: 76 -PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 T+ N I + G T A + A +++ R N ++P I +DG P Sbjct: 1959 RSITTVTN--ARIQYRGGGTNYAAGLRSANEVL------SRVN-FDMFKPAIVFFSDGHP 2009 Query: 135 TDEWQ--AAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVR----QPLPLQGLQFREL 188 D Q A + E F++G ++ L +++ + L G + + Sbjct: 2010 CDPLQGEELATHIRGCYERNGLQAFAVGFGSINLNMLERVAEKLGGTYHHVLTGNELKAT 2069 Query: 189 FSWLSSSLRS 198 F +S+SL + Sbjct: 2070 FFSISASLST 2079 >UniRef50_UPI0001B4AB93 von Willebrand factor type A n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AB93 Length = 250 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 67/183 (36%), Gaps = 6/183 (3%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDEL--LADPLALKRVELGIVTFGPVH- 72 P + ++D SGSM G+ I +N + + ++D + + + F Sbjct: 10 PRRKMILFFVIDTSGSMIGKKIGSVNDAIENVLPMIGEISDENPDAEINVAALEFSTGTR 69 Query: 73 -VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY-RANGISYYRPWIFLIT 130 + A +F + A G T +G A + + ++G Y+ P I L++ Sbjct: 70 WLYDEPKEAKDFIWQKVEANGLTSLGEACEELNKKLSRSGGFMPTSSGSGYFSPAIILLS 129 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFRELF 189 DG PTD ++ + K +I + AD + L Q + + L Sbjct: 130 DGGPTDNFEGGLKTLQGNSWFKHAIKIAIAIGDDADKEVLKQFTGSSEAVITVHNIEALK 189 Query: 190 SWL 192 + Sbjct: 190 KMI 192 >UniRef50_B7AIG3 Putative uncharacterized protein n=2 Tax=Bacteroidales RepID=B7AIG3_9BACE Length = 247 Score = 121 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 75/212 (35%), Gaps = 12/212 (5%) Query: 10 SDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDEL--LADPLALKRVELGIVT 67 D S P ++D SGSM G I +N + L ++ +++ + Sbjct: 5 DDVVSVPRRTMTLFFVIDTSGSMAGNKIGAVNDAVENVLPMLDEISASNPDAEIKVAALE 64 Query: 68 FGPVH--VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 F + A+ F + A G T +GAA + + + + + P Sbjct: 65 FSSGCNWLYDEPKLASEFVWQDVTASGLTSLGAACQELNTKL--SRNGFMQTPSGSFAPA 122 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQ 184 I L++DG PTD++ +K+ K +I + AD L Q + Sbjct: 123 IILLSDGGPTDDFYGGLSKLKANNWFKNAIKIAIAIGDDADKDVLTQFTGTNEAVFTVHN 182 Query: 185 F---RELFSWLSSSLRSVS--RSTPGTEVVLE 211 +++ ++ + + ST G E Sbjct: 183 IDALKQIIRVVAVTSSQIGSKSSTAGDTTKQE 214 >UniRef50_C8N9L6 Tellurium resistance protein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N9L6_9GAMM Length = 149 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 40/150 (26%), Positives = 66/150 (44%), Gaps = 5/150 (3%) Query: 51 LLADPLALKRVELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEE 109 + DP AL+ V L I+T+ P T + P + G + +GAA+ + V Sbjct: 1 MRQDPYALESVYLSIITYNTHAKEVLPPTPLMDVVVPAFTSGGASCLGAALECVVRAVRR 60 Query: 110 RKREYRANGISYYRPWIFLITDGAPTDEWQ-AAANKVFRGEEDKRFAFFSIGVQGADMKT 168 + S Y+P +F+ TDG P+D + A + + E A G + AD+ Sbjct: 61 DRIAANKAQRSDYKPILFIFTDGTPSDPFVYNATVPIIKALEFTNIAVCVAGAK-ADVAV 119 Query: 169 LAQIS--VRQPLPLQGLQFRELFSWLSSSL 196 L Q++ V + Q ++ W+SSSL Sbjct: 120 LKQLTDLVISLADDEARQIKDCIRWVSSSL 149 >UniRef50_A8SDD9 Putative uncharacterized protein n=2 Tax=Ruminococcaceae RepID=A8SDD9_9FIRM Length = 247 Score = 119 bits (299), Expect = 6e-26, Method: Composition-based stats. Identities = 42/194 (21%), Positives = 70/194 (36%), Gaps = 14/194 (7%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLAL---KRVELGIVTFGPVH-VEQP- 76 ++D SGSM G I +N+ + L + A +++ I+ F P Sbjct: 3 LFYVVDTSGSMCGSKIGSVNSAMEEAITSDLPEISAANDDAEIKVAIMQFSSGCSWITPQ 62 Query: 77 --FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 + + L A G T +GAA + + + EY + Y P I L +DG P Sbjct: 63 SGPIAIGDVIWNDLNAGGVTDLGAACKELDKKL--SRNEYLNSQTGAYAPVILLFSDGGP 120 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQ---FRELFS 190 TD W+ ++ K +I + AD LA+ + + + L Sbjct: 121 TDNWEKELKQLKLNNWFKHAIKIAIAIGDDADKTVLAEFTGTIESVITVNDKHTLKALIR 180 Query: 191 WLSSSLRS-VSRST 203 +S S S Sbjct: 181 KVSVRASEFQSHSK 194 >UniRef50_D2AQW9 Uncharacterized protein encoded in toxicity protection region of plasmid 478 contains von Willebrand factor (VWF) domain n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AQW9_STRRD Length = 202 Score = 119 bits (298), Expect = 9e-26, Method: Composition-based stats. Identities = 42/158 (26%), Positives = 63/158 (39%), Gaps = 5/158 (3%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVE 74 P L+++ S SM GR + E+ L F DELL P+ RV + +V F Sbjct: 1 MRRVVPIYLVVNTSRSMAGR-LVEIEHVLAAFADELLFSPILGDRVRVCVVAFSDSARCV 59 Query: 75 QPFTSAANFFP-PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 P + P L +T + + + A G+ Y RP L+TDG Sbjct: 60 LPLSDLTGIVELPKLVPGRETRFAPMFDLLTSLTDVDASAHLAAGLVYDRPLALLVTDGV 119 Query: 134 PTDE-WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLA 170 P D WQ A ++ +R R +IGV + + Sbjct: 120 PADPGWQRAFDRFYR-HAHPRIVLITIGVGAPQAEEMG 156 >UniRef50_C1A2B7 Putative uncharacterized protein n=1 Tax=Rhodococcus erythropolis PR4 RepID=C1A2B7_RHOE4 Length = 233 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 48/221 (21%), Positives = 87/221 (39%), Gaps = 29/221 (13%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFT 78 P L+ DVS SM I E+N + ++E+L DP+ + +++F ++ P Sbjct: 7 LPFYLVFDVSYSME-PVIGEVNNAMRALKNEILKDPILGDIARVCVLSFSDEARIDVPMC 65 Query: 79 SAANF----FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY-YRPWIFLITDGA 133 A+ L +G T + + + + +G +RP +F +TDG Sbjct: 66 DLADDTRITREDFLQVRGGTSFAPIFDLIGERIAADIADLKGHGEGKVFRPTVFFVTDGV 125 Query: 134 PTD---EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF----- 185 PTD EW +A ++ + G+ AD + L I+ P G F Sbjct: 126 PTDAVHEWNSAFTRLTSVKAYPNLV--PFGLGDADEEVLRAITF-PPYRQDGYFFMANAG 182 Query: 186 -------RELFSWLSSSLRSVSRS----TPGTEVVLEAPKG 215 + + ++ S+ S ++S TPG + +G Sbjct: 183 TSAEQAMQAITRIVTQSVVSCTQSAVAGTPGVVMDTRGTEG 223 >UniRef50_UPI0001B4E5FD von Willebrand factor type A n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4E5FD Length = 228 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 45/213 (21%), Positives = 78/213 (36%), Gaps = 19/213 (8%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLA----DPLALKRVELGIVTFGPVH 72 I LLD S SM G I LN + E+ + +P A + +TF Sbjct: 11 NRPVHFIWLLDCSYSMQGEKIARLNYAIREAIPEMRSVAHDNPAAQLLLRT--LTFSTTA 68 Query: 73 V--EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 + +F + G T +G A+ ++ RA +P + L++ Sbjct: 69 QWHHKNPVPVDDFTWQDVQVDGMTNLGEALDLVSRELQTPPMPQRA-----LKPVLALVS 123 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPL-PLQGL---QF 185 DG PTD+W+A + ++ +I + AD L + L PL Q Sbjct: 124 DGVPTDDWKAGLKAIDATPWGRKAVRVAIAIGEDADRNVLQEFLGNPELRPLDANSPKQL 183 Query: 186 RELFSWLS-SSLRSVSRSTPGTEVVLEAPKGWT 217 W S +++++ S+ + L P + Sbjct: 184 AAAIRWASTAAVKAASQPVAASSDTLSKPLPYA 216 >UniRef50_A8L6A2 von Willebrand factor type A n=1 Tax=Frankia sp. EAN1pec RepID=A8L6A2_FRASN Length = 238 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 41/156 (26%), Positives = 61/156 (39%), Gaps = 6/156 (3%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFT 78 +L+D S SM+G P+ +N L + P V LG + F V Sbjct: 15 LAFYILVDASYSMSGAPMLAVNEILPEVISTIEQSPTLGDVVRLGALDFADDARVVLRLD 74 Query: 79 SAANFF-PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 N P A+G T A + +E + + +G YRP +F ITDG PTD+ Sbjct: 75 DLRNIGGVPQFAARGGTSYAAGFRQLRKEIESDLAQLKGDGYKVYRPAVFFITDGEPTDD 134 Query: 138 WQ---AAANKVFRGEEDKRFAFFSIG-VQGADMKTL 169 + AA ++ R G V + +TL Sbjct: 135 QKDLDAAFAELTDANFRGRPNIIPFGVVSSVNKQTL 170 >UniRef50_B8HT03 von Willebrand factor type A n=2 Tax=Cyanothece RepID=B8HT03_CYAP4 Length = 236 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 47/217 (21%), Positives = 78/217 (35%), Gaps = 25/217 (11%) Query: 20 CPCILLLDVSGSMNGR-PINELNAGLVTFRDELLA----DPLALKRVELGIVTFGP-VHV 73 I L D SGSM + I LNA + L +P A V + F Sbjct: 20 LHFIWLCDCSGSMVSQGKIQSLNAAIKETIPMLQQTAADNPNAQVLVRA--IKFSDGAEW 77 Query: 74 EQP-FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 P T F L A G T +G A+ + + R + P + LI+DG Sbjct: 78 HIPTPTPVDQFRWTDLTAGGVTDLGMALEMVAEQL--RVPPMSERALP---PVLVLISDG 132 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL----QGLQFRE 187 PTD++ + + ++ +I V A+ + L + L + Q Sbjct: 133 QPTDDFGSGLKALMAQPWGQKAVRVAIAVGQDANHEVLQKFIGPSELRVLQANNPDQLVN 192 Query: 188 LFSWLSSSLRSVSRS------TPGTEVVLEAPKGWTS 218 W S+ +++VS+ P V+L+ + + Sbjct: 193 QIRWTSTLIKTVSQPRLEKSHRPDQMVILQPQEPAAT 229 >UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7RKA1_NEMVE Length = 1128 Score = 106 bits (264), Expect = 6e-22, Method: Composition-based stats. Identities = 57/208 (27%), Positives = 87/208 (41%), Gaps = 26/208 (12%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELL-ADPLALKRVELGI----VT 67 A++P+P+ IL++D SGSM G + T D L D +A E G+ VT Sbjct: 205 AASPQPK-DVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFLAFESGVRRVKVT 263 Query: 68 FGPVHVEQPFTSAANFFPPI-----------LFAQGDTPMGAAITKALDMVEERKREYRA 116 G E+ F S+ P+ +A G T A A D++++ +E Sbjct: 264 SGDAKDEKCFESSLAKASPVNIDILKKFLDGEYASGGTMYAIAFNAAFDILDKYYKEKNT 323 Query: 117 NGISYYRPWIFLITDGAPTDEWQAAANKVFRGEE--DKRFAFFSIGVQGADMKTLAQISV 174 RP I +TDGAP D+ N V + + + G+ G + A + + Sbjct: 324 TR----RPVILFMTDGAPNDDPGTILNTVKTRNQGLSTKADILTFGMGGG--ISPAGVDL 377 Query: 175 RQPLPLQGLQFRELFSW-LSSSLRSVSR 201 Q L Q L F L+++LR VSR Sbjct: 378 LQSLAEQTLDGGARFEVSLTTALRDVSR 405 >UniRef50_C5VFZ9 von Willebrand factor type A n=2 Tax=Corynebacterium matruchotii RepID=C5VFZ9_9CORY Length = 236 Score = 106 bits (264), Expect = 8e-22, Method: Composition-based stats. Identities = 46/213 (21%), Positives = 81/213 (38%), Gaps = 20/213 (9%) Query: 20 CPCILLLDVSGSM------NGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVH 72 P L+DVS SM G ++ N + + + +R+ LG++ F Sbjct: 9 LPVFFLIDVSYSMLEEKPGGGTLLDAANQLVPGIVEACEKYSVLDQRLRLGLIEFCDEAR 68 Query: 73 VEQPFTSAANFFP--PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 V P + F P L A+G T AA + + R I +RP +F IT Sbjct: 69 VVIPLSEIDAFSENIPQLVAKGGTNFAAAFWAVFNEMGVAVESLRKPEIGIHRPTVFFIT 128 Query: 131 DGAPTDEWQAAANKVFRGEEDK---RFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ--- 184 DG + + A ++ R FF+ GV A+++ + + Sbjct: 129 DGEDIGDVEERARAWAALSDEGFRYRPNFFTFGVGNANLEGIRAFKLGSGFAAATKDPTR 188 Query: 185 ----FRELFSWLSSSLRSVSRS-TPGTEVVLEA 212 +E+ + L SS+ S S P +++++ Sbjct: 189 AVQRLQEILNTLVSSIVSSSAGDNPTGKIIVDP 221 >UniRef50_C3XUD0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XUD0_BRAFL Length = 443 Score = 103 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 67/187 (35%), Gaps = 16/187 (8%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPL-ALKRVELGIVTFGP-VHVEQPF 77 +L LD SGSMNGR + EL G+ F + R + +V FG + QP Sbjct: 3 LDTVLCLDTSGSMNGRGMAELKKGVRHFLLGVQETANKMSLRENVAVVEFGGGARIIQPL 62 Query: 78 TS---AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 + L A G TPM + +A+ + +R G P + L+TDG P Sbjct: 63 SGNYGTVMQSVDNLKAGGTTPMFEGLMEAMKEILQRGGVLTLPGGRKMTPRVILMTDGYP 122 Query: 135 TDEWQAAANKVFRGEEDKRFAFFS-------IGVQ-GADMKTL---AQISVRQPLPLQGL 183 D+ + G + +G D L A+++ + Sbjct: 123 DDKENVLKAALSFGPAGWQAVGLPHPIPIACVGCGDDVDKDLLQAIAKLTNGMYILGDVS 182 Query: 184 QFRELFS 190 Q E F Sbjct: 183 QLSEFFR 189 >UniRef50_A3YVK5 Tellurium resistance protein n=3 Tax=Cyanobacteria RepID=A3YVK5_9SYNE Length = 260 Score = 103 bits (256), Expect = 6e-21, Method: Composition-based stats. Identities = 47/182 (25%), Positives = 71/182 (39%), Gaps = 20/182 (10%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGR-PINELNAGLVT----FRDELLADPLALK 59 + F A+ P I + D SGSM + I LN + + +P A Sbjct: 1 MPFPNVRLANRP---LHFIYICDCSGSMAAQGKIQALNQAIRQSLPGMAEVARQNPEARV 57 Query: 60 RVELGIVTFGPV---HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRA 116 V V+F H+E+P T L A G T MG A+ +++ E RA Sbjct: 58 LVRA--VSFADRAAWHLEKP-TEVHQLQWLDLQAGGITAMGEALELVAAVLQSPPMEERA 114 Query: 117 NGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR 175 P + LI+DG PTD++ A + R ++ +I + AD + L Q Sbjct: 115 LP-----PVLVLISDGQPTDDFDAGLASLMRQPWAQKAVRLAIAMGHDADTEVLQQFIGS 169 Query: 176 QP 177 P Sbjct: 170 DP 171 >UniRef50_UPI0001B4AD96 von Willebrand factor type A n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD96 Length = 194 Score = 99.6 bits (247), Expect = 7e-20, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 64/160 (40%), Gaps = 10/160 (6%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF--GPVHVEQPFTS 79 +LLD SGSM+G I+ LN + +L K +++ +++F + + Sbjct: 3 LYILLDTSGSMDGSKISALNDSMENIIIDLQEKAFNGKNIDIVVLSFARDVTWMHDKPIN 62 Query: 80 AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQ 139 +F L A G T +G A + + I L++DG PTD++ Sbjct: 63 ILDFNWKPLTASGMTSLGKACCELAKNISTYPANNENT-------AIVLLSDGCPTDDYD 115 Query: 140 AAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPL 178 ++ + F+I + AD++TL + Q Sbjct: 116 EGIMELRNLQTFNDADKFAIALGDNADLQTLIRFVDVQEN 155 >UniRef50_A7C6I0 Protein containing Von Willebrand factor, type A n=1 Tax=Beggiatoa sp. PS RepID=A7C6I0_9GAMM Length = 127 Score = 95.0 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 35/100 (35%), Positives = 51/100 (51%), Gaps = 3/100 (3%) Query: 61 VELGIVTF-GPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 +E+ I+TF V E NF PIL +G T + + +A+ VE RK+ Y+ G Sbjct: 1 MEISIITFHSDVKNELEPALVDNFTMPILTTKGSTKLVDGVREAIAKVEARKQWYKETGQ 60 Query: 120 SYYRPWIFLITDGAPTDEW--QAAANKVFRGEEDKRFAFF 157 YYRPWI ITDG P + + ++ E K+F F+ Sbjct: 61 PYYRPWIIAITDGEPDSDQDVEGLTQEIRTAIEGKKFVFW 100 >UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira maxima CS-328 RepID=B5W7H4_SPIMA Length = 488 Score = 93.4 bits (231), Expect = 5e-18, Method: Composition-based stats. Identities = 40/173 (23%), Positives = 66/173 (38%), Gaps = 24/173 (13%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP- 70 F + P+ +LL+D SGSM+G+ + E+ F + LKR +L +V F Sbjct: 46 FLTKPKA---VVLLIDTSGSMSGQKLREVQTAASEFVS--RQN---LKRHDLAVVEFSSR 97 Query: 71 VHVEQPFT---SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 V FT + L A+G T + A +++ R P I Sbjct: 98 ASVVADFTRNETELQQAIARLSARGGTNLSEGFNLATSVLQNSDRT----------PNIL 147 Query: 128 LITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 L TDG P + AA+ + + ++G A + L ++ L Sbjct: 148 LFTDGVPNNPPMAAS--IAQQIRASGINLVAVGTGDAQINYLTALTGDPDLVF 198 >UniRef50_A1SV07 Putative uncharacterized protein n=1 Tax=Psychromonas ingrahamii 37 RepID=A1SV07_PSYIN Length = 154 Score = 91.9 bits (227), Expect = 1e-17, Method: Composition-based stats. Identities = 36/154 (23%), Positives = 66/154 (42%), Gaps = 4/154 (2%) Query: 63 LGIVTFGPVHVEQPFTSAANFFP-PILFAQGDT-PMGAAITKALDMVEERKREYRANGIS 120 + I T V P + P+ A G M + A +M+ R + + + Sbjct: 2 VLIQTADYCRVISPLKRVEDINSIPLFGAYGKDLAMAQGLEVAFNMLLTRVHQLKKKQVK 61 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 RPW+ L TDG +D + A K++R + + F + + + LA+ S PL + Sbjct: 62 VKRPWLILFTDGLGSDYDKETAKKLYRYSLENKLIVFIVNIGK-ETDDLAKFSSIAPLVI 120 Query: 181 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 + +F+W +S + + G V+L+AP+ Sbjct: 121 NISKIESIFTWFYNSFTEIILAEDGI-VILKAPE 153 >UniRef50_UPI0000D560E4 PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) n=5 Tax=Tribolium castaneum RepID=UPI0000D560E4 Length = 842 Score = 90.7 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 53/238 (22%), Positives = 85/238 (35%), Gaps = 58/238 (24%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA S+ + P+ I +LD SGSM+G I +L + + EL + + IV Sbjct: 297 FAPSEVEALPKQ---VIFVLDTSGSMDGNRIKQLKEAMNSILSELKKEDVFNIVEFSSIV 353 Query: 67 TFGPVHVEQPFTSAANFFPP------------------------------------ILFA 90 V Q P L A Sbjct: 354 KVWNVDKVQVDYEVGEDPWPLYDSPEAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLNA 413 Query: 91 QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP------TDEWQAAANK 144 G T + +A+ L +V++ K N ++P I +TDG P T++ +A ++ Sbjct: 414 YGGTDIKSALEVGLKLVKKNKE----NKEDAHQPIIVFLTDGEPTMGETNTEKITSAISE 469 Query: 145 VFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--------PLPLQGLQFRELFSWLSS 194 + GE S G GAD + L +IS++ LQ +E + +SS Sbjct: 470 MNSGETRAPIFSLSFG-DGADREFLQKISLKNLGFARHIYEAADASLQLQEFYKQISS 526 >UniRef50_Q4GZD0 Putative uncharacterized protein n=2 Tax=Trypanosoma brucei RepID=Q4GZD0_9TRYP Length = 4600 Score = 88.8 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 44/159 (27%), Positives = 70/159 (44%), Gaps = 18/159 (11%) Query: 14 SNPEPRCPCILL-LDVSGSMNGRPINELNAGLVTFRD-ELLADPLALKRV-ELGIVTFGP 70 + P R IL+ LD S SM NAG+++ R L+A+ L V ELGI FG Sbjct: 4338 TKPNKRSYQILVALDDSLSMQCN-----NAGIMSCRAVALIAEALQQLEVGELGIACFGK 4392 Query: 71 V-----HVEQPFT--SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRA---NGIS 120 + +PF S F I FAQ T + + LD +++ +R + Sbjct: 4393 ETRIVHEMHEPFVAESGPRAFSEITFAQKSTNLKLLLETTLDYLDDARRRMNGQIRSSTQ 4452 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSI 159 + +F+I+DG T++ + R EE+ + F + Sbjct: 4453 RLQQMMFIISDGQITEDRMELRKLLMRAEENHQMVVFVL 4491 >UniRef50_A7I7X6 Putative uncharacterized protein n=1 Tax=Candidatus Methanoregula boonei 6A8 RepID=A7I7X6_METB6 Length = 229 Score = 88.4 bits (218), Expect = 1e-16, Method: Composition-based stats. Identities = 45/223 (20%), Positives = 74/223 (33%), Gaps = 29/223 (13%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI--VTFGPV--- 71 + L D S SM G+ I LN + E+ A +V++ + + F Sbjct: 3 RKQLHFFWLADCSDSMRGKKIATLNQAIREALPEVQKAVAAYPQVDIRMRAIKFSNDAAW 62 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 HV F P L +G T AI + + R P LI+D Sbjct: 63 HVGPDPVPINEFVWPELETEGLTATAKAIRLLTGELSIERMPRRGLP-----PICILISD 117 Query: 132 GAPTDEWQA---AANKVFRGEEDKRFAFFSIGVQGA---DMKTLAQISVRQPLPL----Q 181 G TD + A ++ + + +I + + L + + ++ + L Sbjct: 118 GFCTDPREEYDTAIAELGKIPWGIKAVRLAIAIGDESDYNATELLKFANQESVGLLKAHS 177 Query: 182 GLQFRELFSW-----LSSSLRSVSRSTPGTE----VVLEAPKG 215 + W +S R SR P E V LE+P Sbjct: 178 PEELVAYIKWASVSASVASSRGRSRGAPAEEDTSNVALESPPP 220 >UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38CFE Length = 489 Score = 86.5 bits (213), Expect = 5e-16, Method: Composition-based stats. Identities = 38/175 (21%), Positives = 61/175 (34%), Gaps = 28/175 (16%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTF--RDELLADPLALKRVELGIVTFG 69 F + P+ ++L+D SGSM+G + E+ F R L D +L +V F Sbjct: 47 FLTKPQA---VVMLIDTSGSMSGSKLPEVQRAASEFVSRQNLKRD-------DLAVVEFS 96 Query: 70 P-VHVEQPFTSAAN---FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 V FT L A G T + A +++ R Sbjct: 97 SRASVVADFTRDERELQQAIARLSAWGGTNLSEGFNLATSVLQNSDRPGN---------- 146 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 I L TDG P + AA+ + + ++G A + L ++ L Sbjct: 147 ILLFTDGEPNNRRMAAS--IAQQIRASGINLVAVGTGDAPVNYLTALTGDPDLVF 199 >UniRef50_A5N5I8 DnaK4 n=2 Tax=Clostridium kluyveri RepID=A5N5I8_CLOK5 Length = 604 Score = 83.4 bits (205), Expect = 5e-15, Method: Composition-based stats. Identities = 34/146 (23%), Positives = 56/146 (38%), Gaps = 9/146 (6%) Query: 12 FASNPEPRCPCILLLDVSGSMN-GRPINELNAGLVTFRDELLADPLALK-RVELGIVTFG 69 F E I ++D SGSMN ++ + G+ E+ + G++ F Sbjct: 381 FTHCMEKSLYFIWMIDCSGSMNIDNRLDYVKEGMKCVISEIEGKCRLDNITLNFGVIKFS 440 Query: 70 PVHV--EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 V + + + G T +G+AI + + +YR P IF Sbjct: 441 DTAVWNVEIGEKINDSIYEDMNPGGITSLGSAIDMVSSELNKINIDYRT-----LTPVIF 495 Query: 128 LITDGAPTDEWQAAANKVFRGEEDKR 153 LITDG PTD ++ A ++ K Sbjct: 496 LITDGMPTDNYEGAVERLLVNSVGKN 521 >UniRef50_A7C1J8 von Willebrand factor, type A n=1 Tax=Beggiatoa sp. PS RepID=A7C1J8_9GAMM Length = 478 Score = 83.4 bits (205), Expect = 6e-15, Method: Composition-based stats. Identities = 43/182 (23%), Positives = 66/182 (36%), Gaps = 18/182 (9%) Query: 16 PEPRCPCILLLDVSGSMN-GRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE 74 P P ILL+D SGSM G + E+ A + F ++ +V FG Sbjct: 39 PLPPHDVILLIDTSGSMAEGTKLQEVQAAAIQFIQRRHGLTHLANN-KIAVVGFGGRAYL 97 Query: 75 QP--FTSAANFFPP--ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 + N P L A G TPM + A++ + + G + I L T Sbjct: 98 VANLTSDLMNLEQPIQKLRAVGGTPMDRGLQSAMNQL--------SAGSDSEQRSILLFT 149 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGL--QFREL 188 DG P D + N + ++ +I AD+ L Q++ L F + Sbjct: 150 DGKP-DNQRTTLNA-SQLVKNANIQIVAIATDDADIGLLTQVTGDAALVFPTSVGNFDQA 207 Query: 189 FS 190 F Sbjct: 208 FQ 209 >UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CDX4_KOSOT Length = 730 Score = 82.6 bits (203), Expect = 8e-15, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 86/223 (38%), Gaps = 35/223 (15%) Query: 15 NPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-- 70 PE P + +LD+SGSM+G+ I + L+ L I+TF Sbjct: 267 EPERIIPKDIVFILDISGSMSGQKIEKAKLALLQVLQMLHEGD------RFSIITFNNEV 320 Query: 71 ---VHVEQPFTSAANFFPP--ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 PF+ ++P + A G T + A+ + ++++ + Y+ Sbjct: 321 NNLTERLLPFSDRTEWYPAVKQIMAGGMTNIHDALLEGIEVL------GTQSTDDRYK-V 373 Query: 126 IFLITDGAPTD---EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQ 181 + +TDGAPT+ + + + + F GV + + L +++ + ++ Sbjct: 374 VLFLTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVGYDVNAELLDELAEKGGGKVK 433 Query: 182 --------GLQFRELFSWL-SSSLRSVSRSTPGTEVVLEAPKG 215 + EL+ + + + +V GT++ PKG Sbjct: 434 YIVENEEIDEKVLELYRMIETPVMSNVHLEINGTDISYVLPKG 476 >UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfolobaceae RepID=A4YGU7_METS5 Length = 383 Score = 81.1 bits (199), Expect = 2e-14, Method: Composition-based stats. Identities = 38/157 (24%), Positives = 61/157 (38%), Gaps = 20/157 (12%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAA 81 I+LLD SGSM+G I G + + ++ VTF V++ + F Sbjct: 41 IVLLDTSGSMDGLKIESAKKGAIELLKRIPQGN------KVSFVTFSSRVNIVREFVDPE 94 Query: 82 NFFPPI--LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQ 139 + I L A G T A+ A ++ + +GI Y + L+TDG PTD+ Sbjct: 95 DLTAEISSLSAGGQTAFFTALLTAFNL-------HNKHGIPSY---VILLTDGNPTDDTN 144 Query: 140 AAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 K + F +G + L ++ R Sbjct: 145 VETYKRIAIPNGVQTISFGLG-DDYNETILKSLADRS 180 >UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10Z89_TRIEI Length = 477 Score = 80.3 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 61/191 (31%), Gaps = 25/191 (13%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 P+P+ +LL+D S SM G + E+ A F + L L IV F Sbjct: 43 PQPQT-VVLLIDTSSSMWGGKLPEVQAAATGFV-----ERQNLTVNNLAIVEFSSNSQVL 96 Query: 76 PFTSAANFFPPI----LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 A L G T + + ++ P I L TD Sbjct: 97 TNFDADKTELKQAIANLTPSGGTNLSQGLKTVASLLRNSNT-----------PNILLFTD 145 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLP--LQGLQFRELF 189 G P D A+ + R + ++G A+ L ++ L + + F Sbjct: 146 GQPNDP--RASKSIAREIREAGINLVTVGTGDANSNYLTSLTENPDLVFFANSGEIDQAF 203 Query: 190 SWLSSSLRSVS 200 ++ +S Sbjct: 204 RAAEKAISQLS 214 >UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Ciona intestinalis RepID=UPI000180CCF8 Length = 864 Score = 80.3 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 66/180 (36%), Gaps = 23/180 (12%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA P P + ++DVSGSM+G I + L T D+L + + I+TF Sbjct: 288 FAPTNLPVIPKKVVFVIDVSGSMSGHKIVQTKEALRTILDDLN------EIDQFNIITFS 341 Query: 70 PVHVEQPFTSAANFFP----------PILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 + P ++A+G T AA + ++E Sbjct: 342 STTNVWHPNEMVDVNPTNIRNAKKHVRSMYARGGTNFNAAALDGIQLLET--ISSNRTNT 399 Query: 120 SYYRPWIFLITDGAPTDEWQ--AAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQ 176 + L+TDG PT A + R + R++ F +G D + L QI+ Sbjct: 400 LEEASMMILLTDGQPTVGVTGNEAIRRNIRERVNGRYSIFCLGFGQHLDHEFLDQIASEN 459 >UniRef50_UPI0001C161B1 von Willebrand factor, type A Precursor n=2 Tax=Nostocaceae RepID=UPI0001C161B1 Length = 474 Score = 80.0 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 44/174 (25%), Positives = 63/174 (36%), Gaps = 23/174 (13%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFTSA 80 +LL+D S SM+ + E+ F L+ ++ +V FG V P T+ Sbjct: 52 IVLLIDTSSSMSDGKLAEVKTAASQFIQRRN-----LESDQIAVVNFGATVQTPAPLTND 106 Query: 81 ANF---FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 N L G TPMG I A D ++ I L TDG P D+ Sbjct: 107 INTLNNAIDQLLEIGSTPMGEGINTAQDQLQATTLNKN----------IILFTDGLP-DD 155 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLP--LQGLQFRELF 189 A N + ++ GAD L QI+ + L QF + F Sbjct: 156 PNFAYNSAL-SVRNAGIKLIAVATGGADTNYLTQITGDRSLVFYANSGQFDQAF 208 >UniRef50_C3ZEP6 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3ZEP6_BRAFL Length = 994 Score = 78.4 bits (192), Expect = 1e-13, Method: Composition-based stats. Identities = 38/141 (26%), Positives = 52/141 (36%), Gaps = 25/141 (17%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG- 69 DF E +L+LD SGSM G PI LN F L LGIVTF Sbjct: 321 DFVLLQEKEPMMVLVLDTSGSMRGDPIRRLNQAATHFIRS-----TVLDDSWLGIVTFST 375 Query: 70 PVHVEQPF---------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 + P TS N P G T +G A+ + + ++E + Sbjct: 376 TANTYHPLLQITSAADRTSLINRVPS--TVGGTTCIGCALLEGVKVLEAQGDPSGG---- 429 Query: 121 YYRPWIFLITDGAPTDEWQAA 141 +FL++DG + A Sbjct: 430 ----ILFLMSDGQENEAPDIA 446 >UniRef50_C0GKG1 von Willebrand factor type A n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GKG1_9FIRM Length = 272 Score = 78.4 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 46/205 (22%), Positives = 77/205 (37%), Gaps = 36/205 (17%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF 77 P LL+D SGSMNG+ I E+ ++ L ++TF V Sbjct: 90 PPLEVCLLVDTSGSMNGKRIREVKTLADNLVRQMHE--------PLSLITFQEGDVGVKV 141 Query: 78 TSAANFFPPI-----LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 S N + A G TPMG I A++ + R+ + + LITDG Sbjct: 142 RSTRNDLMVRRGLAAMSAAGLTPMGEGIRTAVNYLCGRRGKKH---------LVILITDG 192 Query: 133 APT------DEWQAAANKVFRGEEDKRFAFFSIGVQGAD--MKTLAQISVRQPLPLQGLQ 184 PT D + A + + + IG++ ++ LA+ + + L Sbjct: 193 LPTWASGDKDPYLDAI-EAGALIKKHKMHLICIGLEPQRKFLEKLAESADASLYIVDDLD 251 Query: 185 FRELFSWLSSSLRSVSRSTPGTEVV 209 RE+ +++ RS G+ + Sbjct: 252 HREI-----AAITRRERSRVGSLIP 271 >UniRef50_Q3M1S2 von Willebrand factor, type A n=1 Tax=Anabaena variabilis ATCC 29413 RepID=Q3M1S2_ANAVT Length = 592 Score = 78.4 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 43/177 (24%), Positives = 66/177 (37%), Gaps = 27/177 (15%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTF--RDELLADPLALKRVELGIVTFG-PVHVEQPFT 78 +LL+D S SM+ + E+ F R L D +L +V+FG + P T Sbjct: 52 IVLLIDASSSMSDGKLTEVKTAATKFVERRNLTQD-------KLAVVSFGLDIQTATPLT 104 Query: 79 SAANFFP---PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 A+ L G TPM + A+ ++ I L TDG P Sbjct: 105 DNADTLESAIASLSEAGGTPMAQGLDAAIGELQATFLSRN----------ILLFTDGVP- 153 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLP--LQGLQFRELFS 190 + QA A+ + +R ++ AD LAQ++ L QF + F Sbjct: 154 -DSQALASLSAQSARSQRINLIAVATGDADTNYLAQLTADPSLVFYANSGQFDQAFR 209 >UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=3 Tax=Amniota RepID=UPI000155CC23 Length = 1374 Score = 78.4 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 59/168 (35%), Gaps = 19/168 (11%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 + ++DVSGSM G + + + ++L D IVTF + + Sbjct: 320 VVFVIDVSGSMFGTKMKQTKKAMHVILNDLHHDDY------FNIVTFSDAVSVWKASGSI 373 Query: 82 NFFPP----------ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 PP + A G T + AA+ A + + E P I +TD Sbjct: 374 QATPPNIKSAKVYVNKMEADGWTDINAALLVAASVFNQSTGETGRGKGLKKIPLIIFLTD 433 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGV---QGADMKTLAQISVRQ 176 G T A+ + ++ + G+ AD + ++S+ Sbjct: 434 GEATAGVTVASRILSNAKQSLKGNISLFGLAFGDDADYHLMRRLSLEN 481 >UniRef50_UPI000180D2FB PREDICTED: similar to inter-alpha (globulin) inhibitor H5 n=1 Tax=Ciona intestinalis RepID=UPI000180D2FB Length = 1586 Score = 78.0 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 44/198 (22%), Positives = 70/198 (35%), Gaps = 33/198 (16%) Query: 2 SEQITFATSDFASNPEPRCP-----CILLLDVSGSMNGRPINELNAGLVTFRDELLADPL 56 + ++ S FA P + L+DVSGSM G I+++ + T L Sbjct: 923 TTEMVIDQSYFAHFITSNLPPMSKRVVFLIDVSGSMFGIKIDQVRQAMNTILHGL----- 977 Query: 57 ALKRVELGIVTF----------GPVHVEQPFT-----SAANFFPPILFAQGDTPMGAAIT 101 + ++ F G V T SA NF + +G T + A+ Sbjct: 978 -AETDFFSVIAFNSSVSRWSPSGTAAVLASGTTANINSAMNFLNTTVVTRGGTDILQAVE 1036 Query: 102 KALDMVEERKREYRANGISYYRPWIFLITDGAPTDEW--QAAANKVFRGEEDKRFAFFSI 159 A+ + + + + L+TDG PTD A R RF +I Sbjct: 1037 AAIQLFDSAATGGTNTASDF----MVLLTDGRPTDGTVSSTAIISAIRNLNRGRFGINTI 1092 Query: 160 GVQG-ADMKTLAQISVRQ 176 G DM L +I+ + Sbjct: 1093 GFGTLVDMNLLRKIAAQN 1110 >UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea mays RepID=C0HF51_MAIZE Length = 459 Score = 76.5 bits (187), Expect = 6e-13, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 72/201 (35%), Gaps = 40/201 (19%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDE---LLADPLALKRVELGIVTF-GPVHVEQ 75 +++LD+SGSM G + + + F E + D L I+TF H Sbjct: 44 LDIVVVLDISGSMRGTKLEHMKHAMTRFIIEKLGIRGD-------RLAIITFESKAHKVF 96 Query: 76 PFTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 +S L A GDT + A + LD+++ R+ IFL Sbjct: 97 DLSSMLPDQVKKAVAVVEGLKAGGDTNIKAGLEAGLDVLKTRRGHSHNASC------IFL 150 Query: 129 ITDGAPT-DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPL--------P 179 ++DG D+ + ++V F G + +D + L I+ Sbjct: 151 MSDGHENVDKARTLLDRVGE----HSVVTFGFG-EKSDEQLLYDIAYHSHAGTYHHVREK 205 Query: 180 LQGLQFRELFSWLS--SSLRS 198 Q + F++L+ S+ Sbjct: 206 EDENQLMKAFAFLAIYRSISM 226 >UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=40 Tax=Euteleostomi RepID=ITIH5_HUMAN Length = 942 Score = 76.5 bits (187), Expect = 6e-13, Method: Composition-based stats. Identities = 37/185 (20%), Positives = 62/185 (33%), Gaps = 29/185 (15%) Query: 7 FATSDFASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 FA D P P + +LD S SM G + + L T +L Sbjct: 284 FAPKDL-----PPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------RFS 332 Query: 65 IVTFGP---------VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 I+ F + V + + G T + A+ +A+ ++ + Sbjct: 333 IIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAH-- 390 Query: 116 ANGISYYR-PWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQ-GADMKTLAQ 171 +GI I +TDG PT + E R F+IG+ D + L + Sbjct: 391 -SGIGDRSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEK 449 Query: 172 ISVRQ 176 +S+ Sbjct: 450 LSLEN 454 >UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E8A41 Length = 945 Score = 76.1 bits (186), Expect = 7e-13, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 71/185 (38%), Gaps = 28/185 (15%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA D + P+ + ++D S SM G+ I + L T +L + Sbjct: 288 FAPKDLPAVPKN---VVFVIDTSASMLGKKIRQTKEALFTILGDLRPGD------HFNFI 338 Query: 67 TFGP-VHVEQP--FTSA-------ANFFPPILFAQGDTPMGAAITKALDMVEE--RKREY 114 +F V V QP A F +L G T + +AI ++++ ++ Sbjct: 339 SFSSRVKVWQPGRLVPVTPNNVRDAKKFIFMLPTSGGTNINSAIQTGSSLLQDYLSAQDA 398 Query: 115 RANGISYYRPWIFLITDGAPTDEWQAAANKV--FRGEEDKRFAFFSIGVQ-GADMKTLAQ 171 N +S I +TDG PT + + R +F F+IG+ D + L + Sbjct: 399 SPNSVS----LIIFLTDGQPTVGEVQSVTILGNTRSAVQGKFCIFTIGIGNDVDYRLLER 454 Query: 172 ISVRQ 176 +++ Sbjct: 455 MALDN 459 >UniRef50_C3XUD1 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XUD1_BRAFL Length = 370 Score = 76.1 bits (186), Expect = 7e-13, Method: Composition-based stats. Identities = 44/186 (23%), Positives = 64/186 (34%), Gaps = 39/186 (20%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALK---RVELGIVTFG-PVHVEQ 75 +L LD SGSM GR + EL + F L A K R +G+V FG + Q Sbjct: 3 LDTVLCLDTSGSMAGRGMRELKKAVREFI--LGVQETASKHNLRENVGVVEFGAKTRIVQ 60 Query: 76 PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 P T N + +L A G +G P + L+TDG P Sbjct: 61 PLT---NNYAAVLRAVGG-------------------VLVLSGQKKMTPRVILMTDGHPD 98 Query: 136 DEWQAAANKVFRGEEDKRFAFFS-------IGV-QGADMKTLAQIS-VRQPLPLQG--LQ 184 D+ + G + +G D L ++ + + +QG Q Sbjct: 99 DKQNVLKAALSFGPAGWQAVGLPHPIPIACVGCGGDVDGDLLQAVAKLTNGMYVQGDIGQ 158 Query: 185 FRELFS 190 E F Sbjct: 159 LSEFFR 164 >UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tetraodon nigroviridis RepID=UPI00017B0D26 Length = 856 Score = 75.7 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 41/185 (22%), Positives = 71/185 (38%), Gaps = 28/185 (15%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA D + P+ + ++D S SM G+ + + L+T +L + Sbjct: 217 FAPKDLPAVPKN---VVFVIDTSASMLGKKMRQTKEALLTILGDLRPAD------RFNFI 267 Query: 67 TFGP-VHVEQP--FTSA-------ANFFPPILFAQGDTPMGAAITKALDMVEE--RKREY 114 +F + V QP A A F +L G T + AI ++ + R+ Sbjct: 268 SFSSRIRVWQPGRLVPATPSAVRDAKKFVVMLPTSGGTDIDGAIQTGSSLLRDHLSGRDA 327 Query: 115 RANGISYYRPWIFLITDGAPT--DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQ 171 N +S I +TDG PT + A R +F F+IG+ D + L + Sbjct: 328 GPNSVS----LIIFLTDGQPTVGEVRPGAILGNARAAVRDKFCIFTIGMGDDVDYRLLER 383 Query: 172 ISVRQ 176 +++ Sbjct: 384 MALDN 388 >UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PS55_PICSI Length = 829 Score = 75.3 bits (184), Expect = 1e-12, Method: Composition-based stats. Identities = 42/200 (21%), Positives = 69/200 (34%), Gaps = 39/200 (19%) Query: 12 FASNPEPRCPCILL--LDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 +P R P L+ LDVSGSM+G + L + L + L +V F Sbjct: 348 MVKDPGCRAPIDLVTVLDVSGSMSGTKLALLKRAMAFVISNLSPED------RLSVVVFS 401 Query: 70 PV--------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 + AAN L G T + + K ++E+R++ Sbjct: 402 STAKRVFSLKRMTPDGQRAANRVVERLLCTGGTNIAEGLRKGAKVLEDRRQRNPVAS--- 458 Query: 122 YRPWIFLITDGA-------------PTDEWQAAANKVFRGEEDKRFA-FFSIGV--QGAD 165 I L++DG P+DE + +A + R + F GV A Sbjct: 459 ----IMLLSDGQDTYSLSSRGVVLFPSDEQRRSARQSTRYGHVQIPVHAFGFGVDHDAAT 514 Query: 166 MKTLAQISVRQPLPLQGLQF 185 M ++++S +Q Sbjct: 515 MHAISEVSGGTFSFIQAESL 534 >UniRef50_Q82LZ6 Putative uncharacterized protein n=1 Tax=Streptomyces avermitilis RepID=Q82LZ6_STRAW Length = 462 Score = 74.9 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 54/158 (34%), Gaps = 26/158 (16%) Query: 20 CPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQP 76 P + ++D SGSM G ++ + + L T EL LGI+TF V P Sbjct: 42 LPVNFVFVVDTSGSMTGTKLDTVKSALQTIYRELRPADC------LGIITFDHNVRTVLP 95 Query: 77 FTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 + + P L QG T + + +D + R ++L Sbjct: 96 AVAKQDLPPERFAEVVSALTTQGGTDIDLGVQYGIDEISRHSVSGRTVNC------LYLF 149 Query: 130 TDGAPTD---EWQAAANKVFRGEE-DKRFAFFSIGVQG 163 +DG PT +W V D + F G Sbjct: 150 SDGDPTSGERDWIKVRANVAAKLRGDLTLSCFGFGSDA 187 >UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun sequence. (Fragment) n=16 Tax=Euteleostomi RepID=Q4SBF6_TETNG Length = 1039 Score = 74.9 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 67/170 (39%), Gaps = 26/170 (15%) Query: 7 FATSDFASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 FA D PR P + ++D+SGSM+G + + ++ ++L + G Sbjct: 423 FAPKDL-----PRLPKNVVFVIDMSGSMSGTKMQQTREAMLKILEDLDPED------HFG 471 Query: 65 IVTFGP---------VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 I+ F + A + + + G T + A + KA+DM++E ++ R Sbjct: 472 IILFDHRIQFWNTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKR 531 Query: 116 ANGISYYRPWIFLITDGAPT--DEWQAAANKVFRGEEDKRFAFFSIGVQG 163 S I L+TDG P + + + + + FS+G Sbjct: 532 LPEKSID--MIILLTDGDPNSGESRIPVIQENVKAAIGGQMSLFSLGFGN 579 Score = 42.6 bits (99), Expect = 0.010, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 15/68 (22%) Query: 7 FATSDFASNPEPRCP--CILLLDVSGSMNGRPI-NELNAGLVTFRDELLADPLALKRVEL 63 FA D PR P + ++D+SGSM+G + E + + + A Sbjct: 323 FAPKDL-----PRLPKNVVFVIDMSGSMSGTKMQQEAHRAARSLQKRSTDGGTAR----- 372 Query: 64 GIVTFGPV 71 ++F P Sbjct: 373 --ISFSPT 378 >UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XQ17_9BACT Length = 806 Score = 74.6 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 40/175 (22%), Positives = 61/175 (34%), Gaps = 34/175 (19%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 + +LD SGSM+G+ + + L + L R E I+ F +P Sbjct: 314 VVFVLDTSGSMSGKKMEQAKKALQFCVESLND----GDRFE--IIRFSTES--EPLFDKL 365 Query: 82 NFFPPI-----------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 L A G T + A+ KAL + + R + + +T Sbjct: 366 AAVSKENREKAGDFIKNLKAMGGTAIDEALKKALSLESKEGRPF----------VVVFLT 415 Query: 131 DGAP----TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQ 181 DG P TDE Q R +E +R F IG + L +I+ Q Sbjct: 416 DGLPTVGTTDEDQILKGMQERNKEKRRIFCFGIG-TDVNTHLLDRIAEETRAFSQ 469 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 74.2 bits (181), Expect = 3e-12, Method: Composition-based stats. Identities = 37/175 (21%), Positives = 67/175 (38%), Gaps = 36/175 (20%) Query: 19 RCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQ 75 R P I ++D SGSMNG+P++ L L+ D L + ++ F Sbjct: 1443 RFPIDLICVIDTSGSMNGQPLDLLKETLLFLVDLLQTGD------RICLIQFSTNAQRLT 1496 Query: 76 PFTSAANF--------FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW-- 125 P S + L A+G T + + A D++++R+ Y P Sbjct: 1497 PLLSIESKDNIKSIKNEINRLVAKGGTNICQGMQLAFDVLKQRR---------YKNPITS 1547 Query: 126 IFLITDGAPTDEWQAAANKV------FRGEEDKRFAFFSIGVQ-GADMKTLAQIS 173 +FL++DG D + + ++ ++ F + G D + +IS Sbjct: 1548 VFLLSDGL-NDGAENKIRDLLKQLNFYQNYNEENFTIQTFGFGKDHDPNLMDKIS 1601 >UniRef50_UPI0001C378BC von Willebrand factor, type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C378BC Length = 565 Score = 74.2 bits (181), Expect = 3e-12, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE 74 N + +LD SGSM+G P+N L A L + + +G+V++ +V Sbjct: 386 NSGKPIAAVFVLDTSGSMSGAPLNSLKASLRNSIKYINSSNY------IGVVSYSS-NVN 438 Query: 75 QPFTSAANFFPPI----------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 A F L A G+T +A+++A+ M+ + ++ P Sbjct: 439 VDL-ELAKFDLNQQAYFMGAVDSLTASGNTATFSALSQAMIMLRDFTKDNPNVS-----P 492 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLP 179 +FL++DG + + + + ++IG A++ L IS Sbjct: 493 MVFLLSDGQSNS--GSEFSDIDGAIATAQIPIYTIGY-NANLNELKAISEINEAA 544 >UniRef50_A7BVG3 von Willebrand factor type A domain protein n=1 Tax=Beggiatoa sp. PS RepID=A7BVG3_9GAMM Length = 280 Score = 73.4 bits (179), Expect = 6e-12, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 64/198 (32%), Gaps = 31/198 (15%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSA 80 LL+DVS SM+G + E F + LA +G++ FG + T Sbjct: 91 VFLLIDVSYSMDGSALAEAKQAAQEF---VRKSDLAHTA--IGLIEFGSKAKIISGLTQN 145 Query: 81 ANF---FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT-- 135 A L G T M +T A + + R +I L+TDG P Sbjct: 146 AKHLYKAINRLKTNGSTNMTEGLTTAY---------LKLKNVDDPR-FIILLTDGLPNHP 195 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRELF---- 189 Q A ++ +IG AD L ++ + + F Sbjct: 196 KNTQQIAQEICAD----GIELITIGTGDADKTYLQSLACYDQNSFFAKAGTMVSTFSRIA 251 Query: 190 SWLSSSLRSVSRSTPGTE 207 L+ S + + G Sbjct: 252 QVLTESGSYIQITQNGQR 269 >UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GUK8_SORC5 Length = 521 Score = 73.0 bits (178), Expect = 7e-12, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 66/196 (33%), Gaps = 26/196 (13%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 + D + P ++ +D SGSM G PI + AGLV D L + +V Sbjct: 119 SPVDLGALERPPLHLVIAVDTSGSMEGDPIAYVRAGLVEMIDALQP------TDRISLVR 172 Query: 68 FGPVHVEQPFTSAANFFPPI------LFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 + + + + L A+G T + + A + E+ Sbjct: 173 YSDAAEVVLEQAEGSDREALTEAFEGLTARGSTNLYEGLFTAYALAEQHLDPA------- 225 Query: 122 YRPWIFLITDGAPTDEWQAAANK--VFRGEEDKRFAFFSIGVQGA-DMKTLAQIS----V 174 ++ + ++DG T + + G +K +IGV D+ + IS Sbjct: 226 WQNRVIFLSDGVATAGLTSPQRLVSLAAGYAEKGIGLTAIGVGAEFDVDAMRGISEVGAG 285 Query: 175 RQPLPLQGLQFRELFS 190 E+F+ Sbjct: 286 NFYFLEDPKAVEEVFA 301 >UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Takifugu rubripes RepID=UPI00016DFBC7 Length = 883 Score = 73.0 bits (178), Expect = 7e-12, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 63/173 (36%), Gaps = 29/173 (16%) Query: 7 FATSDFASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 FA D R P + ++D SGSM+G + ++ ++ ++L + G Sbjct: 241 FAPKDLT-----RLPKNVVFVIDRSGSMSGTKMQQIQEAMIKILEDLHPED------HFG 289 Query: 65 IVTFGPV----------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY 114 I+ F E+ + A + I T + AA+ KA+DM+ + Sbjct: 290 IIQFDSSVDSWRNSLSLATEENISEAMAYVNQISHKIQATNINAAVLKAVDMLVTDREAK 349 Query: 115 RANGISYYRPWIFLITDGAPTDEWQA----AANKVFRGEEDKRFAFFSIGVQG 163 R S I L+TDG PT + + R + + +G Sbjct: 350 RLPEKSID--MIILLTDGDPTTDIGETRIPVIQENVRNAIGGNMSLYGLGFGN 400 >UniRef50_B7KCF7 von Willebrand factor type A n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KCF7_CYAP7 Length = 573 Score = 73.0 bits (178), Expect = 7e-12, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 60/170 (35%), Gaps = 23/170 (13%) Query: 22 CIL--LLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFT 78 L ++D SGSM+G P+ + GL E+ +G+VT+G P Sbjct: 398 VYLMTVIDTSGSMDGAPLEAVKKGLRIASKEINPGNY------VGLVTYGDRAAEVVPLG 451 Query: 79 SAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 L A G T M + L + E+K+ +G Y + L+TD Sbjct: 452 LFDELQHKRFLAAIDNLRADGATAMYDGMMIGLSKLMEQKKN-NPDGRFY----LLLLTD 506 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQ 181 G ++V E + I + + L I+ + ++ Sbjct: 507 GQAN--MGVTFDEVKEVIEYSGVRVYPIAYGDVNQEELEAIASLRESTVK 554 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 72.6 bits (177), Expect = 9e-12, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 32/195 (16%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 P + ++D SGSM G I + L+ D L ++ L ++ F Sbjct: 201 VEQSRPSIDLVCVIDNSGSMQGEKIQNVKTTLLQLLDMLNSND------RLSLILFNSYP 254 Query: 73 VEQP-FTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 + P + A G T + + + A +++++R ++ P Sbjct: 255 TLLCNLRKVDDENTPNIQSIINSITADGGTDINSGMLMAFNILQKR---------QFFNP 305 Query: 125 W--IFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQ-GADMKTLAQI----SVR 175 IFL++DG + + + K F+ S G D + +I Sbjct: 306 VSSIFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFGSDHDGPLMNRICQLKDGN 365 Query: 176 QPLPLQGLQFRELFS 190 + Q E F Sbjct: 366 FYYVEKINQVDEFFV 380 >UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3 Tax=Andropogoneae RepID=C5WYU9_SORBI Length = 698 Score = 72.2 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 59/186 (31%), Gaps = 38/186 (20%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF- 77 + +LDVSGSM G I L + L + L ++ F P Sbjct: 240 LDLVTVLDVSGSMAGTKIALLKNAMSFVIQTLGPND------RLSVIAFSSTARRLFPLR 293 Query: 78 --TSAANFFP----PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW--IFLI 129 T A L A G T + + K ++E+R+ + P I L+ Sbjct: 294 RMTLAGRQQALQAVSSLVASGGTNIADGLKKGAKVIEDRRLKN---------PVCSIILL 344 Query: 130 TDGA-----PTD----EWQAAANKVFRGEEDKRFAFFSIGVQ----GADMKTLAQISVRQ 176 +DG P+D ++ A + G A M +A+IS Sbjct: 345 SDGQDTYTLPSDRNLLDYSALVPPSILPGTGHHVQIHTFGFGSDHDSAAMHAIAEISSGT 404 Query: 177 PLPLQG 182 + Sbjct: 405 FSFIDA 410 >UniRef50_B0CZQ4 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0CZQ4_LACBS Length = 461 Score = 71.9 bits (175), Expect = 1e-11, Method: Composition-based stats. Identities = 45/206 (21%), Positives = 74/206 (35%), Gaps = 39/206 (18%) Query: 14 SNPEPRCP----CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 ++P P I ++D SGSM+G ++ A L E + V++ + Sbjct: 257 NDPGKPTPGNYKIIFVIDDSGSMSGGRWKQVRAALAEIGKEAMQ--YDADGVDVCFI--- 311 Query: 70 PVHVEQPFTSAANF--------------FPPILF------AQGDTPMGAAITKALD-MVE 108 V + +N+ IL G TP GA + L ++E Sbjct: 312 NSPVHKELVKVSNYLAFLIAELNKHNQTQEEILSVYDQVQPSGFTPTGAKLEAILGPVIE 371 Query: 109 ERKREYRANGISYYRPW-IFLITDGAPTDEW----QAAANKVFRGEEDKRFA---FFSIG 160 + + +P I ++TDG PTD+ Q AA K+ G F IG Sbjct: 372 KLDAAVDTQAYGHIKPADIIVLTDGVPTDDPAAVIQTAAQKLDEGLHHMNAVGIQFVQIG 431 Query: 161 VQGADMKTLAQISVRQPLPLQGLQFR 186 + L + P+ ++ L FR Sbjct: 432 NDEGADQALMAL-CNGPVRVRSLMFR 456 >UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=40 Tax=Euteleostomi RepID=ITIH2_HUMAN Length = 946 Score = 71.9 bits (175), Expect = 1e-11, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 76/199 (38%), Gaps = 40/199 (20%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP----------V 71 + ++DVSGSM G + + + T D+L A+ ++ F Sbjct: 311 ILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAED------HFSVIDFNQNIRTWRNDLIS 364 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP----WIF 127 + A + + G T + A+ +A+ ++ E AN + P I Sbjct: 365 ATKTQVADAKR-YIEKIQPSGGTNINEALLRAIFILNE------ANNLGLLDPNSVSLII 417 Query: 128 LITDGAPTDEWQAAANKVFRGEEDK---RFAFFSIGVQ-GADMKTLAQISVRQPLPLQ-- 181 L++DG PT + +K+ + ++ + FS+G+ D L ++S Q Sbjct: 418 LVSDGDPTVG-ELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHGIAQRI 476 Query: 182 ------GLQFRELFSWLSS 194 Q ++ ++ +S+ Sbjct: 477 YGNQDTSSQLKKFYNQVST 495 >UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-like protein n=3 Tax=Eutheria RepID=ITH5L_HUMAN Length = 1313 Score = 71.9 bits (175), Expect = 1e-11, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 61/169 (36%), Gaps = 21/169 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV---------- 71 + ++DVS SM G + + + +L A+ I++F Sbjct: 284 VVFVIDVSSSMFGTKMEQTKTAMNVILSDLQANDY------FNIISFSDTVNVWKAGGSI 337 Query: 72 -HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 Q SA + + + A G T + +A+ A ++ +E P I +T Sbjct: 338 QATIQNVHSAKD-YLHCMEADGWTDVNSALLAAASVLNHSNQEPGRGPSVGRIPLIIFLT 396 Query: 131 DGAPTDEWQAAANKVFRGEED--KRFAFFSIGVQ-GADMKTLAQISVRQ 176 DG PT + + + R + FS+ AD L ++S+ Sbjct: 397 DGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFTLLRRLSLEN 445 >UniRef50_Q231J4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q231J4_TETTH Length = 520 Score = 71.5 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 28/124 (22%), Positives = 44/124 (35%), Gaps = 21/124 (16%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQP 76 I ++D SGSM G+ I + ++ + D + +V F V Sbjct: 94 QPLDLIFVIDTSGSMQGKKIELVKKSILQVLHIIQGDD------RISLVGFNSQAKVLLE 147 Query: 77 FTSAA-------NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 T L A G T +G + KA D+++ER IFL+ Sbjct: 148 LTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDIIKERTNSKNLAS-------IFLL 200 Query: 130 TDGA 133 +DG Sbjct: 201 SDGQ 204 >UniRef50_UPI00016E1D1D UPI00016E1D1D related cluster n=9 Tax=Tetraodontidae RepID=UPI00016E1D1D Length = 2191 Score = 71.5 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 41/162 (25%), Positives = 69/162 (42%), Gaps = 20/162 (12%) Query: 22 CILLLDVSGSMNGRPINELN-AGLVTFRDELLAD-PLALKRVELGIVTFGPVHVEQPF-- 77 + ++D SGS I N + +F L++ +A RV +GIV + + Q F Sbjct: 383 IVFIIDESGS-----IGSANFQLMRSFLHSLISGLQVASNRVRVGIVMYNVEPMAQVFLN 437 Query: 78 -----TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + +F + + G T GAA+ L V ++R R + + +ITDG Sbjct: 438 TFKDKSELLDFIKILPYHGGGTNTGAALNFTLQEVFIKQRGSRKDLGV--QQVAVVITDG 495 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 DE + A + R +++GV+ AD L QI+ Sbjct: 496 KSQDEVSSPAANLRRA----GVTVYAVGVKDADKAQLDQIAS 533 Score = 65.7 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 61/174 (35%), Gaps = 19/174 (10%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPF--- 77 I L+D SGS+ ++ + + + + V +G++ + + + P Sbjct: 792 LIFLIDSSGSIYPEDYKKMKDFMKSV---IKQSIVGKNEVHVGVMQYSTIQKLVFPLNQY 848 Query: 78 ---TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 + + G T G AIT + R G + + ++TDG Sbjct: 849 YTKDELSKAIDEMQQIGGGTHTGEAITDVSQYFDAR-----NGGRPDLKQRLVVVTDGES 903 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 DE + A + K +SIGV A+ L +IS G F L Sbjct: 904 QDEVRQPAEALRA----KGVIVYSIGVVAANTSQLLEISGTPNRMYAGRDFDAL 953 Score = 65.7 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 64/174 (36%), Gaps = 18/174 (10%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ------ 75 L+D SGS+N +++ ++ F P + +G+V F + Sbjct: 590 IFFLIDHSGSINPADFHDMKKFMIEFLHTFRVGPQ---HIRIGVVKFADSPQLEFDLQAY 646 Query: 76 -PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 S + I G T G A+ ++ + Y + +ITDG Sbjct: 647 SDVKSLEDAILNIKQIGGGTETGRALEFMSPQFDQALATHGHKVKEY----LVVITDGKS 702 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 TD+ +A A+K+ + ++IGV+ AD L +IS F L Sbjct: 703 TDKVKAPADKLRSQD----VVVYAIGVKNADENQLLEISGDPQRTFFVNNFDAL 752 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 38/168 (22%), Positives = 59/168 (35%), Gaps = 18/168 (10%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 + L+D S S+ E+ L F L P ++++G+ + Q F Sbjct: 2 IVFLVDGSSSIGTDNFQEVRLFLRNFTSGLDIGP---DKIQIGLAQYSNDP-HQEFLLKD 57 Query: 82 NFFPPILFA--------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + L A G T G AI ++ RAN +ITDG Sbjct: 58 HMEKTALLAALDSFPYRTGGTETGKAIDFLRTQYFTKEAGSRANQRVP--QIAVVITDGD 115 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQ 181 TD+ A + + F+IGV A+ L I+ R P + Sbjct: 116 STDDVTVPAQSLRK----HGVIVFAIGVGNANQNELESIANRPPKRFK 159 Score = 49.5 bits (117), Expect = 8e-05, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 54/175 (30%), Gaps = 19/175 (10%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT--- 78 I L+D S S+ + L + + G++ + FT Sbjct: 995 IIFLVDGSTSITQPKFRSM---LKFMASMVNQTTVGSDLTRFGVILYSNDA-NSMFTLKQ 1050 Query: 79 -----SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + GDT G A+T +L E A + + +ITDG Sbjct: 1051 YSAKREVLQAIAALKSPLGDTYTGKALTYSLQFFNEEHGGRAALQVP---QILMVITDGE 1107 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D+ + AA + F+IG+ A L QI+ F L Sbjct: 1108 SQDDVEDAARLLRSL----GVEVFTIGIGNAHDLELLQIAGSPERVFTVKSFGNL 1158 >UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTR8_HALO1 Length = 903 Score = 71.5 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 58/166 (34%), Gaps = 26/166 (15%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQ 75 +P L++D SGSM+G I + L L + +V F Sbjct: 455 QPHVAIALVVDRSGSMSGLKIEAAKESARATAEVLSPSDL------ITVVAFDNQPTTIV 508 Query: 76 PFTSAAN-----FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 A+N L A G T + A+ +A ++++ + + + +++ Sbjct: 509 RLQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGANAKVKH---------VIVLS 559 Query: 131 DGA-PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVR 175 DG P D + + R ++G+ AD L I+ Sbjct: 560 DGQAPYD----GIADLCQEMRSARITVSAVGIGDADRNLLNLITDN 601 >UniRef50_B2UZB2 von Willebrand factor type A domain protein n=1 Tax=Clostridium botulinum E3 str. Alaska E43 RepID=B2UZB2_CLOBA Length = 984 Score = 71.5 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 55/135 (40%), Gaps = 24/135 (17%) Query: 9 TSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF 68 SDF N P+ +L+LD SGSM I ++ + F +++ P +++ IVT+ Sbjct: 80 PSDFKCN-IPKKEIVLVLDTSGSMKDSKIKKMKNAAMEFVNKIKKIPN----LDIDIVTY 134 Query: 69 GPV--------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 + E+ N + A G T G + KA +++ K + Sbjct: 135 STSGYTYLNNGNTEEDLLKIIN----SIKADGGTNTGEGLRKANYILDLEKNKNADKS-- 188 Query: 121 YYRPWIFLITDGAPT 135 I ++DG PT Sbjct: 189 -----IVFMSDGMPT 198 >UniRef50_Q24FW2 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FW2_TETTH Length = 1074 Score = 71.5 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 42/181 (23%), Positives = 71/181 (39%), Gaps = 27/181 (14%) Query: 12 FASNPEPRCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 F + R P I ++D SGSM+G IN L L+ D+L ++ LG+V F Sbjct: 354 FDAKAYQRPPIDLICVMDNSGSMHGEKINMLKETLLYLIDQL------DEKDRLGLVLFN 407 Query: 70 PVHVEQPFTSAA-------NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 +P S + + AQG T + +T+A ++ RK Y Sbjct: 408 SEVTFRPMKSMDTTNKLKLKQYISDIRAQGGTDINLGMTEAFKFIKTRK---------YC 458 Query: 123 RPW--IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLP 179 P +FL++DG + A + +++F+ G D + QI + Sbjct: 459 NPVTSVFLLSDGLDSKAQDRVAVTLKNMSINEQFSINCFGFGRDHDPILMNQIKKIDQVD 518 Query: 180 L 180 + Sbjct: 519 M 519 >UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1C6_SLAHD Length = 744 Score = 71.5 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 63/197 (31%), Gaps = 27/197 (13%) Query: 6 TFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI 65 S N +L LD SGSM+G P+NE F + ++ + Sbjct: 366 LMGDSKVDPNDASSRHVVLALDTSGSMDGEPLNETKTATREFASTIFKS-----DADVCL 420 Query: 66 VTF-GPVHVEQPFTS---AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 V++ T A L A G T + A+ + + +E + R Sbjct: 421 VSYDSSARNVIDSTDNEYALKAAVRDLSAGGGTNIEDALRVSYERLEGSGSDKR------ 474 Query: 122 YRPWIFLITDGAPT-----DEWQAAANKVFRGEEDKRFAFF--SIGVQGADMKTLAQIS- 173 I L++DG D+ A AN++ F S+ + + + I+ Sbjct: 475 ---IIVLMSDGEANEGLVGDDLIAYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIAS 531 Query: 174 -VRQPLPLQGLQFRELF 189 Q R F Sbjct: 532 PGCHYEVDDASQLRYFF 548 >UniRef50_C3YC17 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YC17_BRAFL Length = 951 Score = 71.1 bits (173), Expect = 2e-11, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 62/185 (33%), Gaps = 27/185 (14%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLA-DPLALK 59 + QI + R I+ D SGSM+G P ++ L+ ++ + +P Sbjct: 61 LKLQIPLPMEWTSLLQNKRTHVIVAADKSGSMSGNPWRQVQQALLYMIGDVASVNP---- 116 Query: 60 RVELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 V L +V + + + + A G T AA + D ++ + + Sbjct: 117 SVALDVVIYNDKASLLQYAGSYQDAVNRVNADGMTSFAAAFSCIKDCLKTEIQGTPVSKT 176 Query: 120 SYYRPWIFLITDGAPT-----D------EWQAAANKVFRGEEDKRFAFFSIGVQG-ADMK 167 + +TDGA T D W+ A ++ +G D Sbjct: 177 -----VVVFMTDGADTCNRGADIDRSVRSWKEALARL-----GHEAIVHVVGFSAQHDYN 226 Query: 168 TLAQI 172 L ++ Sbjct: 227 FLGRL 231 >UniRef50_B2A702 von Willebrand factor type A n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A702_NATTJ Length = 599 Score = 71.1 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 42/170 (24%), Positives = 66/170 (38%), Gaps = 28/170 (16%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF--GPVHVEQPFT- 78 L+D SGSM GR + E+ L R ++ I+TF V+VE PFT Sbjct: 424 VCFLVDASGSMGGRRMQEVKF--------FAEHVLLKGRDKIAILTFREDNVNVEIPFTR 475 Query: 79 --SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT- 135 + A G TPM I A +E + + ++ LITDG PT Sbjct: 476 NWDKLRSGLNKIKAFGLTPMSKGIEMARKYLESEVGQQKNT-------FLVLITDGLPTI 528 Query: 136 -----DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 D ++ + + F IG++ ++K L +++ L Sbjct: 529 SDGGEDPFKETLKAAQKLSQ-TSIKFVCIGLEP-NVKFLKKLAQASQASL 576 >UniRef50_UPI00016E1D58 UPI00016E1D58 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E1D58 Length = 451 Score = 70.7 bits (172), Expect = 3e-11, Method: Composition-based stats. Identities = 41/162 (25%), Positives = 69/162 (42%), Gaps = 20/162 (12%) Query: 22 CILLLDVSGSMNGRPINELN-AGLVTFRDELLAD-PLALKRVELGIVTFGPVHVEQPF-- 77 + ++D SGS I N + +F L++ +A RV +GIV + + Q F Sbjct: 2 IVFIIDESGS-----IGSANFQLMRSFLHSLISGLQVASNRVRVGIVMYNVEPMAQVFLN 56 Query: 78 -----TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + +F + + G T GAA+ L V ++R R + + +ITDG Sbjct: 57 TFKDKSELLDFIKILPYHGGGTNTGAALNFTLQEVFIKQRGSRKDLGV--QQVAVVITDG 114 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 DE + A + R +++GV+ AD L QI+ Sbjct: 115 KSQDEVSSPAANLRRA----GVTVYAVGVKDADKAQLDQIAS 152 Score = 65.3 bits (158), Expect = 1e-09, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 61/174 (35%), Gaps = 19/174 (10%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPF--- 77 I L+D SGS+ ++ + + + + V +G++ + + + P Sbjct: 233 LIFLIDSSGSIYPEDYKKMKDFMKSV---IKQSIVGKNEVHVGVMQYSTIQKLVFPLNQY 289 Query: 78 ---TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 + + G T G AIT + R G + + ++TDG Sbjct: 290 YTKDELSKAIDEMQQIGGGTHTGEAITDVSQYFDAR-----NGGRPDLKQRLVVVTDGES 344 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 DE + A + K +SIGV A+ L +IS G F L Sbjct: 345 QDEVRQPAEALRA----KGVIVYSIGVVAANTSQLLEISGTPNRMYAGRDFDAL 394 >UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomycetaceae RepID=D2R2I7_9PLAN Length = 786 Score = 70.7 bits (172), Expect = 3e-11, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 56/164 (34%), Gaps = 28/164 (17%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT--------FGPVHV 73 I ++D SGSM G+ I + + + L V F Sbjct: 309 VIFVVDRSGSMQGKKIEQAREAMRYVLNNLHEGDTFNIVAYDSTVESFKPELQKFDDATR 368 Query: 74 EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP-WIFLITDG 132 + + L+A G T + A+ A M+ RP +I +TDG Sbjct: 369 KSALA-----YVDGLYAGGSTNISGALDSAFAML-----------TGSDRPNYILFLTDG 412 Query: 133 APT--DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS 173 PT + + ++ + + R + GV + + L ++S Sbjct: 413 LPTAGETNEGKIVELAKQKNVHRARMINFGVGYDVNSRLLDRMS 456 >UniRef50_Q22ST4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ST4_TETTH Length = 648 Score = 70.7 bits (172), Expect = 3e-11, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 72/209 (34%), Gaps = 28/209 (13%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE-QPFT 78 ++++D SGSM G I + LV + + + + IV F FT Sbjct: 222 VDLVVVIDKSGSMEGEKIQLVKETLVKIINLMSSMD------RICIVCFNESGDRPLTFT 275 Query: 79 SAANFFPPIL-------FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + L +A G T + I AL ++ RK + I L++D Sbjct: 276 RVTDENKQTLLNLIQQIYAGGGTNISEGINHALKAIQNRKFKNNVTS-------ILLLSD 328 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS---VRQPLPLQGLQF-- 185 G T + + + + F +IG D K L +S +Q + + Sbjct: 329 GQDTKAYTRVKAYIDKYQIKDAFNIETIGFGEDHDPKLLRTLSDLRNGTFNFMQDVNYLD 388 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 + + + +V++ V P+ Sbjct: 389 TAFINIFAGMISTVAQ-NIKVGVKFTPPE 416 >UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KDL1_SHEWM Length = 739 Score = 70.3 bits (171), Expect = 4e-11, Method: Composition-based stats. Identities = 41/152 (26%), Positives = 63/152 (41%), Gaps = 23/152 (15%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFTS- 79 IL++D SGSM+G I + L L A ++ F V P++ Sbjct: 365 LILVIDTSGSMSGASIAQAKRALNYALAGLKAKDT------FNVIEFNSNVGSLSPYSLP 418 Query: 80 -------AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 AN + L A G T M A+ ALD + E A G R +F +TDG Sbjct: 419 ATAKNIGLANQYVRSLKANGGTEMQLALNAALD----KGTETEALGSERLRQVLF-MTDG 473 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGA 164 + DE Q+ + + + + R F++G+ A Sbjct: 474 SVGDE-QSLFHLIKQKIGESRL--FTLGIGSA 502 >UniRef50_Q503P4 Zgc:110377 n=9 Tax=Clupeocephala RepID=Q503P4_DANRE Length = 868 Score = 70.3 bits (171), Expect = 4e-11, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 65/205 (31%), Gaps = 30/205 (14%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA PR P + ++D S SM G + + L T EL D I+ F Sbjct: 245 FAPANLPRVPKMVVFVIDNSYSMYGNKMAQTKEALGTILGELPEDDY------FAIIVFS 298 Query: 70 PV---------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 + A + + G T + A ++M+ +R A Sbjct: 299 TTFVVWRPYLSKATEENVKEAQEYVKTIEVIGGTELHDATIHGVEMLYAAQRNGTAPKNM 358 Query: 121 YYRPWIFLITDGAPTDEWQA--AANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQ- 176 + L+TDG P ++ + R D F + AD L +S + Sbjct: 359 VL--MMILLTDGQPNQYPRSLPEIQESIRKAIDGNITLFGLAFGNDADYGFLDTLSKQNN 416 Query: 177 -------PLPLQGLQFRELFSWLSS 194 LQ + + +SS Sbjct: 417 GIVRRIYEDSDAPLQLKGFYEEVSS 441 >UniRef50_B3RZ89 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RZ89_TRIAD Length = 933 Score = 69.9 bits (170), Expect = 5e-11, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 57/155 (36%), Gaps = 22/155 (14%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 +PR +L+LDVSGSM G+P+ +L F + + +GI+TF + Sbjct: 312 KPRT--VLVLDVSGSMRGKPMEQLQQAATNFLLNVAQNGS-----FVGIITFSSAASIRS 364 Query: 77 --FTSAANFFPPIL------FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 + L A G T +GA I + +++ +G + + + Sbjct: 365 SLVQINDDADRQRLILLLPSGASGSTSIGAGIQAGVKILKASVGNKSPSGGT-----LIV 419 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG 163 ++DG + V + D + I Sbjct: 420 LSDGR--ENRSPTIADVKKQVLDNKITVQGISFGS 452 >UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeoglobus fulgidus RepID=O28828_ARCFU Length = 410 Score = 69.9 bits (170), Expect = 6e-11, Method: Composition-based stats. Identities = 44/205 (21%), Positives = 71/205 (34%), Gaps = 36/205 (17%) Query: 1 MSEQITFATSDFASNPEPR---CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLA 57 M +I ++ + C ++L+DVS SM GR I + R + Sbjct: 219 MRGEIELNENEMVARQPKHTEKCVYVMLIDVSDSMRGRKIVGAIEAALCLRKAIRRAGSG 278 Query: 58 LKRVELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 EL ++ F E N L A+G T +G A+ +A RK ++ Sbjct: 279 D---ELRVIAFNHRAHEIKEGEILN-----LEARGRTDIGLALKRA------RKILKGSS 324 Query: 118 GISYYRPWIFLITDGAPTD-------EWQAAANKVFRGEE-DKRFAFFSIGVQG------ 163 G +FLI+DG PT W+ A + + D R G +G Sbjct: 325 GTG----VVFLISDGEPTSSYNPYLTPWRCALKEAEKMRNVDARLQIIMFGKEGRFLELC 380 Query: 164 ADMKTLAQISVRQPLPLQGLQFREL 188 +M L+ + L + Sbjct: 381 KNMAKLSGNANLFHFS-DPLNLKNF 404 >UniRef50_Q67LM1 Magnesium chelatase n=1 Tax=Symbiobacterium thermophilum RepID=Q67LM1_SYMTH Length = 741 Score = 69.6 bits (169), Expect = 8e-11, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 65/187 (34%), Gaps = 31/187 (16%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG--PVHVE 74 E LL+D S SM GR I + ++ F V V Sbjct: 559 EQSLDICLLIDASASMAGRRILAAKHLARHLLVSTRD--------RIAVIAFQERDVRVY 610 Query: 75 QPFT---SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 PFT SA + G TP+ + ++++++ + RP + LITD Sbjct: 611 VPFTRSYSAVEDGLARIQPMGLTPLAHGLIRSMELIHSARVR---------RPLLLLITD 661 Query: 132 GAPT------DEWQAAANKVFRGEEDKRFAFFSIGVQGAD--MKTLAQISVRQPLPLQGL 183 G PT D A + R +R F IG+Q + ++ L + + + L Sbjct: 662 GIPTVPKWSVDPLADAV-EAARQLRAQRIPFTCIGLQPSRRYLEQLVRQAGGTLHVVDEL 720 Query: 184 QFRELFS 190 L Sbjct: 721 SEESLIR 727 >UniRef50_A2E0T6 von Willebrand factor type A domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2E0T6_TRIVA Length = 753 Score = 69.2 bits (168), Expect = 8e-11, Method: Composition-based stats. Identities = 34/165 (20%), Positives = 54/165 (32%), Gaps = 12/165 (7%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV 71 F + ++D SGSM G I + L L P+ I+ FG Sbjct: 234 FEGKVQANTEFYFIIDCSGSMYGSRIKNAKSCLNVLLHSL---PIGC---RFSIIKFG-T 286 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 E + A + A DM+ K Y +FL+TD Sbjct: 287 KFEVALEPCDYTDENMSKAMHQLDLIDADMCGNDMISPLKYISEHPQKKDYIKQVFLLTD 346 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR 175 G D+ + V ++ F F+IG+ AD + ++ Sbjct: 347 GE--DDRISICAMVQANRDN--FRVFTIGIGSDADRNLIIDVARN 387 >UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0E9B3_PARTE Length = 603 Score = 69.2 bits (168), Expect = 9e-11, Method: Composition-based stats. Identities = 47/178 (26%), Positives = 71/178 (39%), Gaps = 34/178 (19%) Query: 13 ASNPEPRCPCILLL--DVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP 70 SN R P L+ DVSGSM GR IN + L L + + I+ F Sbjct: 111 VSNNLDRPPIDLVCVVDVSGSMIGRKINLVKDSLRYLMKILGPED------RICIIVFTT 164 Query: 71 V-HVEQPFTSAANFFPPILFAQ-------GDTPMGAAITKALDMVEERKREYRANGISYY 122 V H+ F P+L T + + KAL M++ RK Y Sbjct: 165 VAHIVTSFIRNTQENKPLLKKAILELKGLASTNISDGMNKALWMLKNRK---------YK 215 Query: 123 RPW--IFLITDGAPTDEWQAAANKVFRGEE----DKRFAFFSIGVQ-GADMKTLAQIS 173 P IFL++DG D+++ A +VF + +++F + G D + QI+ Sbjct: 216 NPVSCIFLLSDGQ--DDYKGAEQRVFDQLQLLKIEEKFVIHTFGYGQDHDAYVMNQIA 271 >UniRef50_Q1AYC2 Protoporphyrin IX magnesium-chelatase n=15 Tax=Bacteria RepID=Q1AYC2_RUBXD Length = 616 Score = 69.2 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 33/133 (24%), Positives = 53/133 (39%), Gaps = 16/133 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF--GPVHVEQPF-- 77 +L++D SGSM R + + LL D +R +++F + P Sbjct: 450 LVLVVDSSGSMAAR---SRMSAVKGAVRALLEDAY-RRRDRAAVISFRGEEARLLVPPAS 505 Query: 78 -TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT- 135 AA L G TP+ A + A + V A+ RP + +ITDG T Sbjct: 506 GVEAAAARLEELPTGGRTPLAAGLELAAETVLRE-----ASREPERRPLLVVITDGRATA 560 Query: 136 -DEWQAAANKVFR 147 ++ AAA ++ Sbjct: 561 GEDPLAAARRLRE 573 >UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=Q8H924_ORYSJ Length = 646 Score = 69.2 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 29/124 (23%), Positives = 46/124 (37%), Gaps = 21/124 (16%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP--- 76 + +LDVSGSM G + L + D L L +++F Sbjct: 174 LDLVTVLDVSGSMVGNKLALLKQAMGFVIDNLGPGD------RLCVISFSSGASRLMRLS 227 Query: 77 -FTSAANFFPPI----LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 T A L A+G T +GAA+ KA ++++R + L++D Sbjct: 228 RMTDAGKAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVES-------VILLSD 280 Query: 132 GAPT 135 G T Sbjct: 281 GQDT 284 >UniRef50_A8SU73 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SU73_9FIRM Length = 550 Score = 68.8 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 62/168 (36%), Gaps = 26/168 (15%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAAN 82 + + D SGSM+G P+N+L L + + +G+V++ + A Sbjct: 377 VFVADCSGSMDGDPMNQLKNSLTNGAQYINDNNY------VGLVSYSNSVTIE--VPIAQ 428 Query: 83 FFPPI----------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 F L A G T A+ A+ M+ E K + +FL++DG Sbjct: 429 FDLNQRSYFQGAVNNLIASGGTASYDAVVVAVKMITEAK-----AQHPDAKCMLFLLSDG 483 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLP 179 + + +++ ++IG AD LA++S Sbjct: 484 YANNGYS--MDEITSALRTSGIPVYTIGYGDDADTGELARLSGINEAA 529 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 68.8 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 72/209 (34%), Gaps = 31/209 (14%) Query: 2 SEQITFATSDFASNPEPRCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALK 59 Q+ A + A + + R P L+LD SGSM+G+P+ + + + D L D Sbjct: 22 QRQLRIAVAAKADDHDRRLPLNLCLVLDHSGSMDGQPLETVKSAALGLIDRLEEDD---- 77 Query: 60 RVELGIVTFGPVHVEQPF------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKRE 113 L ++ F +A L A+G T + + + Sbjct: 78 --RLSVIAFDHRAKIVIENQQVRNGAAIAKAIERLKAEGGTAIDEGLKLGIQEA------ 129 Query: 114 YRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQI 172 A G IFL+TDG K+ D + ++G + L I Sbjct: 130 --AKGKEDRVSHIFLLTDGENEHGDNDRCLKLGTVASDYKLTVHTLGFGDHWNQDVLEAI 187 Query: 173 SVRQPLPLQGLQ--------FRELFSWLS 193 + L ++ FR+LF +S Sbjct: 188 AASAQGSLSYIENPSEALHTFRQLFQRMS 216 >UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus trichocarpa RepID=B9GK57_POPTR Length = 595 Score = 68.8 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 53/237 (22%), Positives = 86/237 (36%), Gaps = 53/237 (22%) Query: 16 PEPRCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVH 72 P R P + +LDVSGSM G+ I L + L L IVTF Sbjct: 149 PHHRAPIDIVNVLDVSGSMAGKLI-LLKRAVNFIIQNLGPSD------RLSIVTFSSSAR 201 Query: 73 VEQPFTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 P + + L A G T + A + K + ++EER++ Sbjct: 202 RILPLRTMSGSGREDAISVVNSLSATGGTNIVAGLRKGVRVLEERRQHNSVAS------- 254 Query: 126 IFLITDGAPTDEWQAAANKVF-----------RGEEDKRFAF----FSIGV--QGADMKT 168 I L++DG T + N++ GEE ++ F F G+ A M Sbjct: 255 IILLSDGCDTQS-HSTHNRLEYLKLIFPSNNASGEESRQPTFPIHTFGFGLDHDSAAMHA 313 Query: 169 LAQISVRQPLPLQGLQ-----FRELFSWLSSSLR-----SVSRSTPGTEVVLEAPKG 215 ++ +S ++ + F L+S + V ++PG ++ L P G Sbjct: 314 ISDVSGGTFSFIESIDILQDAFARCIGGLTSIVARDVQLKVRSASPGVQI-LSTPSG 369 >UniRef50_Q5UWJ9 Calcium-binding protein-like n=1 Tax=Haloarcula marismortui RepID=Q5UWJ9_HALMA Length = 1562 Score = 68.8 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 36/161 (22%), Positives = 58/161 (36%), Gaps = 23/161 (14%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFT 78 L++D SGSM+ + N F LL A +V F +V Q T Sbjct: 500 VDVTLVMDTSGSMS-SSVKLRNTAGQRFVAGLLDVDRA------AVVDFDSSAYVAQDLT 552 Query: 79 S---AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 S AAN L + G T +G+ ++ A + RA + L+TDG Sbjct: 553 SDFGAANSTLDNLGSGGGTDIGSGLSTANSQFASNSNDSRAQ-------VMILLTDGRGN 605 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 A + ++ +++G A+ L I+ Sbjct: 606 GGISEA-----QTAANQNTTVYTVGFDNANRDKLRDIANIT 641 >UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BC4A Length = 1038 Score = 68.8 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 60/184 (32%), Gaps = 25/184 (13%) Query: 14 SNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELL-ADPLALKRVELGIVTF-GPV 71 +N ++++D SGSM +N + + L D A+ V F V Sbjct: 185 ANVPKPKQIVIVIDKSGSMGVTNMNLAKEAAKSVVNTLNPQDRFAVMAFSSIFVPFQSTV 244 Query: 72 HVEQPFTSAANFFPPI-----------LFAQGDTPMGAAITKALDMVEER--KREYRANG 118 +Q F + P + + G T A+ KA ++ ++ Sbjct: 245 ASDQCFATTFADASPQNKKKVEDFVDTISSGGGTNYAPALQKAFSFFQQEPSVSDFNIKK 304 Query: 119 ISYYRP-----WIFLITDGAPTDEWQAAANKVFRGEE--DKRFAFFSIGVQGADMKTLAQ 171 I P I ++DG P D + R E + + G+ AD L Sbjct: 305 ID---PSEIDRVILFMSDGIPNDPGSTILSAQIRANEQLNNSVIILTYGLGNADFGVLRN 361 Query: 172 ISVR 175 ++ Sbjct: 362 MATN 365 >UniRef50_UPI000180C2AF PREDICTED: similar to FiBrilliN homolog family member (fbn-1) n=1 Tax=Ciona intestinalis RepID=UPI000180C2AF Length = 990 Score = 68.8 bits (167), Expect = 1e-10, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 22/169 (13%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 P+ R I ++D SGS+ + + + L+ + + +VTF V Sbjct: 808 PKARMDLIFIMDSSGSIGEENFKTMKQFVKNVYERFT---LSDEFTRIAVVTFHSVVQLA 864 Query: 76 -------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 T N + FA T G A+T + + ++ + Sbjct: 865 NDTEWFYSKTELDNAIDSLQFAGKGTLTGQALTFTREHLIGKREGSTN--------VVIA 916 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP 177 +TDG D +AAA ++ ++G+ G+ ++ L+ I+ + Sbjct: 917 VTDGNSKDNSKAAAAELRNM----NVHVMAVGITGSHLRDLSMIASKPA 961 >UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3 Tax=Theria RepID=ITIH4_PIG Length = 921 Score = 68.4 bits (166), Expect = 1e-10, Method: Composition-based stats. Identities = 33/121 (27%), Positives = 51/121 (42%), Gaps = 15/121 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-------VHVE 74 I ++D SGSM GR I + L+ +L + R + +V+F V Sbjct: 273 VIFVIDTSGSMRGRKIQQTREALIKILGDLGS------RDQFNLVSFSGEAPRRRAVAAS 326 Query: 75 QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 A + + AQG T + A+ A+ ++E RE S +I L+TDG P Sbjct: 327 AENVEEAKSYAAEIHAQGGTNINDAMLMAVQLLERANREELLPARSVT--FIILLTDGDP 384 Query: 135 T 135 T Sbjct: 385 T 385 >UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1915 Length = 728 Score = 68.4 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 65/180 (36%), Gaps = 24/180 (13%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA + P + ++D SGSM+G+ I F L P + GI+ F Sbjct: 238 FAPASLQKVPKNVVFVIDHSGSMHGQKI---KQTYEAFLKILADLP---EEDHFGILIFD 291 Query: 70 P---------VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 V A F + A+G T + A+ A+ M++ R IS Sbjct: 292 DKVDKWQNTLVKAVPDNIIKAKQFVSKISARGGTDINKALLAAVKMLKNTSRNKLLPKIS 351 Query: 121 YYRPWIFLITDGAPTD---EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQ 176 I ++DG PT N V + E ++ + +G D L ++++ Sbjct: 352 --TSIILFLSDGEPTSGVTNHNEIINNVKKANE-RQTTLYCLGFGNDVDFNFLEKMALEN 408 >UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C9G5_PARTE Length = 648 Score = 68.4 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 33/157 (21%), Positives = 58/157 (36%), Gaps = 24/157 (15%) Query: 25 LLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFTSAANF 83 ++D SGSM G+ I + LV D L ++ L ++TF G P + Sbjct: 233 VIDKSGSMEGKKIASVQQSLVQLLDFL------SEKDRLCLITFDGSAQRLTPLKTLTQD 286 Query: 84 FPPILF-------AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 A G T + A + +++RK + + IFL++DG D Sbjct: 287 NKNYFKKAIYSIRASGQTNIAKGTEIAFNQIQQRKMKNQVTS-------IFLLSDGQ--D 337 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI 172 + A + + + S G D +++I Sbjct: 338 QGAAEYIQRQKDVVEDIVTIHSFGYGSDHDAALMSKI 374 >UniRef50_B0CZQ2 Predicted protein n=2 Tax=Laccaria bicolor S238N-H82 RepID=B0CZQ2_LACBS Length = 1228 Score = 68.4 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 60/174 (34%), Gaps = 17/174 (9%) Query: 6 TFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI 65 + +++ +N I ++D S SM G+ E+ L E + +E+ Sbjct: 146 PSSMAEYMTNY----HIIFVIDDSSSMRGQRWTEVRLALTELSKEAMQ--YDADGMEICF 199 Query: 66 VTFGPVH-VEQPFTSAANFFPPILFAQGDTPMGAAITKALD-MVEERKREYRANGISYYR 123 + + F + +G T GA + L+ ++ + + Sbjct: 200 LNSQKRKDCIANPADMLDIF-DAVQPRGWTYTGAKLKFLLNRCIKRFDDAAGKPEYADIK 258 Query: 124 PW-IFLITDGAPTDEW----QAAANKVFRGEEDKRFA---FFSIGVQGADMKTL 169 P I ++TDGAPTD+ A ++ + F IG + L Sbjct: 259 PVDIIVLTDGAPTDDPAVVIADAIRRLDGAKHHLNAIGIQFVQIGDEDGAAAAL 312 >UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clupeocephala RepID=Q498Q0_DANRE Length = 892 Score = 68.4 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 31/135 (22%), Positives = 52/135 (38%), Gaps = 24/135 (17%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA R P + ++D SGSM G I + ++ +L D G++TF Sbjct: 257 FAPTDVQRIPKNVVFIIDQSGSMQGNKIEQTRMAMLRILSDLAKDDY------FGLITFS 310 Query: 70 PV---------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 A F + + G T + A+ A++M+ + +E A+ Sbjct: 311 SHIQAWKPELLKATAENVEEAKTFVKQIRSGGATDINGAVLNAVNMINQYTQEGSAS--- 367 Query: 121 YYRPWIFLITDGAPT 135 + L+TDG PT Sbjct: 368 ----ILILLTDGDPT 378 >UniRef50_UPI00017B4DF5 UPI00017B4DF5 related cluster n=3 Tax=Tetraodontidae RepID=UPI00017B4DF5 Length = 2436 Score = 68.4 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 57/166 (34%), Gaps = 15/166 (9%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF---- 77 + L+D S S+ E+ L + L P + +G+ F H E Sbjct: 5 IVFLVDGSSSIGPSNFQEVRLFLRSLASGLNVSP---DNIRIGLAQFDEPHQEFLLKYHI 61 Query: 78 --TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 + F + G T G AI +K RA+ +ITDG T Sbjct: 62 EKMNLLAAFESFPYRNGGTETGKAINFLRKQYFTKKAGSRADQRVP--QIAVVITDGDST 119 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQ 181 D+ A ++ + F+IGV A+ L I+ R + Sbjct: 120 DDVVVPARELRK----HGVIVFAIGVGNANQGELKSIANRPSERFK 161 Score = 62.6 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 64/180 (35%), Gaps = 19/180 (10%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLAD-PLALKRVELGIVTFGPVHVEQ 75 + + + LLD SGS+ + F +L+ ++ V +G+ F ++ Sbjct: 985 KQKADLVFLLDQSGSIQSDDY----TTMKKFTIDLINKFQISRDLVHVGLAQFSSTFKDE 1040 Query: 76 -------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 + + + +G T +G A+ E +A GIS + L Sbjct: 1041 FYLNKFFDEQAISAHIKDMQQEEGGTLIGLALNSIRKYFEASHGSRKAEGIS---QNLVL 1097 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 ITDG D+ + AA + F+IG+ L QI+ F +L Sbjct: 1098 ITDGDSQDDVEEAARLLRGL----GVEVFAIGIGNVHDLELLQIAGTPENVFTVKNFDKL 1153 Score = 62.2 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 47/222 (21%), Positives = 80/222 (36%), Gaps = 44/222 (19%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-------HVE 74 L+D SGS++ ++ ++ F P V +G+V + H Sbjct: 395 IFFLIDQSGSIHPPDFYDMKKFILEFLQTFRVGPN---HVRIGVVKYADSPTLEFDLHTY 451 Query: 75 QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 S I G T G A+ + R R + + Y + +ITDG Sbjct: 452 TDVKSLEKAITNIHQVGGGTETGKALDFMRPQFD-RAVTTRGHKVKEY---LVVITDGNS 507 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--------VRQPLPLQGLQ-- 184 TD+ + A+K+ + ++IGV+ A K L +IS V L+ ++ Sbjct: 508 TDKVKDPADKLRA----QGVVVYAIGVKDAVEKELLEISGEPQRTFYVNNFDALKPIKDD 563 Query: 185 ------------FRELFSWLSSS----LRSVSRSTPGTEVVL 210 FR+ + L S L +V + PG + L Sbjct: 564 IITDICSTDAAVFRKFWVELGGSDLSLLSTVCKDVPGDLIFL 605 Score = 61.5 bits (148), Expect = 2e-08, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 58/174 (33%), Gaps = 19/174 (10%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT-- 78 I L+D SGS+ ++ + + + +V +G++ + + P Sbjct: 602 LIFLIDSSGSIYPEDYQKMKDFMKSLVQ---KSNIGKDQVHVGVLQYSTEQKLVFPLIQY 658 Query: 79 ----SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 + + G T G AI + + G + + ++TDG Sbjct: 659 YTKDQLSKAIDDMQQIGGGTHTGEAIAVVSKYFD-----AQNGGRPDLKQRLVVVTDGES 713 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D+ + A + K +SIGV A+ L +IS F L Sbjct: 714 QDDVKLPAEALRA----KGVIVYSIGVVAANTSQLLEISGDADRMYAERDFDAL 763 Score = 49.5 bits (117), Expect = 8e-05, Method: Composition-based stats. Identities = 27/111 (24%), Positives = 41/111 (36%), Gaps = 10/111 (9%) Query: 91 QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEE 150 G T GAA+ V R++ R + +ITDG D+ A + G Sbjct: 275 GGGTNTGAALNFTQHQVFVREKGSRIELGV--QQVAVVITDGRSQDDVSTPAANLRAG-- 330 Query: 151 DKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL--FSWLSSSLRSV 199 +++GV+ AD L QI+ P L +SL+ V Sbjct: 331 ---VTVYAVGVKDADEAQLHQIAS-YPTKEHTFTVDSFSKLKTLETSLQRV 377 >UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NW6_PSEA6 Length = 701 Score = 68.4 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 34/152 (22%), Positives = 56/152 (36%), Gaps = 19/152 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ----PF 77 + LLD SGSM G I + + +L + + I+ F Sbjct: 305 VVFLLDTSGSMAGESIVQAKRAVDFALTQLRPEDN------VNIIQFNDAPQALWKRAMP 358 Query: 78 TSAANFFPPI-----LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 +A + L A G T M A+T AL+ + + G R +F ITDG Sbjct: 359 ATAKHIQRARNWVASLHADGGTEMAPALTLALNKPSLHRDDSDLLGSHKLRQVVF-ITDG 417 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGA 164 + ++ + A + + F+IG+ A Sbjct: 418 SVSN--EDALMSLIESKLADN-RLFTIGIGSA 446 >UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000E460BF Length = 1028 Score = 68.0 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 39/174 (22%), Positives = 63/174 (36%), Gaps = 26/174 (14%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP----- 70 P R I ++DVSGSM G+ + T D++ + I+ F Sbjct: 305 PNTRKNVIFVIDVSGSMYGQKTRQTKRAFTTILDDVRP------IDRINIILFSSYAHVW 358 Query: 71 -----VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 V +AA L G T + ++ KA++++ E P Sbjct: 359 REDQMVEATSDNIAAAKRHVNGLSVGGGTNIYDSLMKAVEILLEHD-------TGDAMPL 411 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPL 178 I ++TDG + AA + R + FSIG G D L ++S+ Sbjct: 412 IIMLTDGQVGN--AAAIVRDVTSVIGGRLSLFSIGFGNGVDFPFLEKLSLSNQA 463 >UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V370_MONBE Length = 471 Score = 68.0 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 58/177 (32%), Gaps = 27/177 (15%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-G 69 DF P + ++DVSGSM G+ + + + L L +VTF Sbjct: 49 DFEPVERPAIDLVAVIDVSGSMAGQKLKMVQSTLEFLMRNLK------DTDRFALVTFDS 102 Query: 70 PVHVEQPFTSAANFFPP-------ILFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 V L A T + + + ++++++R +S Sbjct: 103 DVKTVFDLRPMTTAHKEACLADVQKLRAGSCTNLSGGLFRGVELMQQR--GATKGAVSS- 159 Query: 123 RPWIFLITDGAPT------DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS 173 I L+TDG D+ A + D F G + + L Q+S Sbjct: 160 ---ILLMTDGIANEGVRDKDDMCRALRGLMGPAPDYTIYTFGYG-KDHNENMLRQLS 212 >UniRef50_UPI00005843FB PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005843FB Length = 429 Score = 68.0 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 66/191 (34%), Gaps = 32/191 (16%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP----------- 70 + ++DVS SM G +++ L T D L I+TF Sbjct: 165 IVFVIDVSASMYGTKLSQTKEALKTMLDNLNP------TDYFNIITFSDGVQYWRENNRL 218 Query: 71 VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 ++ + A + L +T + AI KA ++++ R Y G S Y + L+T Sbjct: 219 APAQRRYMDDAMAYVDSLRDDSETNLNEAIVKAGELLDSEAR-YNRPGDSVYS-MMILLT 276 Query: 131 DGAP---TDEWQAAANKVFRGEEDKR-FAFFSIGVQGADMKTLAQISVRQPLPLQ----- 181 DG P T + Q + K G + D L +++ + Sbjct: 277 DGRPSVGTTDQQEILDNAREVIAGKHSLNILGFG-RLVDFDLLVKLAYENNGTAKMIYEG 335 Query: 182 ---GLQFRELF 189 Q RE + Sbjct: 336 TTAAEQLREFY 346 >UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1 Tax=Sorghum bicolor RepID=C5Z1W1_SORBI Length = 607 Score = 68.0 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 45/196 (22%), Positives = 79/196 (40%), Gaps = 35/196 (17%) Query: 20 CPCILLLDVSGSMN--GRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 +++LDVSGSM GR +++L + + +L L +VTF G E P Sbjct: 105 LDLVVVLDVSGSMRDFGR-LDKLKSAMRFIIKKLAPMD------RLSVVTFNGGATRECP 157 Query: 77 FTSAANFFPPILF-------AQGDTPMGAAITKALDMVEERKR-EYRANGISYYRPWIFL 128 + + P+L A+G T + A + L +++ R+ R G + L Sbjct: 158 LRAMSEDAVPVLTDIVDGLVARGGTNIEAGLKMGLQVLDGRRYTGARTAG-------VIL 210 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV-----RQPLPLQGL 183 ++DG + R ++ S G ADM L +++ L G+ Sbjct: 211 MSDGEQN----SGDATRVRNPQNYPVYTLSFG-SNADMNLLQKLAGGGGTYNPVLDSGGM 265 Query: 184 QFRELFSWLSSSLRSV 199 ++FS L + L +V Sbjct: 266 SMLDVFSQLMAGLLTV 281 >UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8YP40_ANASP Length = 427 Score = 68.0 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 64/193 (33%), Gaps = 35/193 (18%) Query: 2 SEQITFATSDFASNPEPRCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALK 59 Q+ + S A E P L+LD SGSM+G+P+ + + D L Sbjct: 22 QRQLAISISAVAEQFEQNLPLNLCLILDQSGSMHGQPLKMVVEAVEKLLDRLQPGD---- 77 Query: 60 RVELGIVTF-GPVHVEQP------FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKR 112 + +V F G V P S L A G T + + + + + + R Sbjct: 78 --RISVVAFAGSATVIIPNQIVENPESIKTQIRKKLQASGGTVIAEGLQQGITELMKGTR 135 Query: 113 EYRANGISYYRPWIFLITDGAPTDE-----WQAAANKVFRGEEDKR------FAFFSIGV 161 + FL+TDG D W+ + R E + ++G Sbjct: 136 GAVSQA--------FLLTDGHGEDSLKIWKWEIGPDDSRRCLEFAKKAAKINLTINTLGF 187 Query: 162 QGA-DMKTLAQIS 173 + L I+ Sbjct: 188 GNNWNQDLLETIA 200 >UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocephala RepID=Q6PGW2_DANRE Length = 927 Score = 68.0 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 39/185 (21%), Positives = 72/185 (38%), Gaps = 29/185 (15%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA + P P + ++D SGSM+GR I + + L+T +L D G++TF Sbjct: 265 FAPSDVPHIPKNVVFIIDRSGSMHGRKIRQTRSALLTILKDLDEDD------HFGLITFD 318 Query: 70 PV---------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 + A F + +G T + A+ +DM+ R+ A+ Sbjct: 319 AEIDFWRRELLQATKANRENAESFVKRIQDRGATNINDAVLAGVDMINRNPRKGTAS--- 375 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEE---DKRFAFFSIGVQ-GADMKTLAQISVRQ 176 + L+TDG PT + K+ + +F + +G + L ++S+ Sbjct: 376 ----ILILLTDGDPTAG-ETNIEKIMANVKEAIGSKFPLYCLGFGYDVNFDFLTKMSLEN 430 Query: 177 PLPLQ 181 + Sbjct: 431 NAVAR 435 >UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9U149_PHYPA Length = 1185 Score = 68.0 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 40/165 (24%), Positives = 59/165 (35%), Gaps = 29/165 (17%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-P 70 FA P I ++D SGSM G PI + L F + + I+ FG Sbjct: 457 FALRPMSSSELIFVVDRSGSMQGTPIKQAGQALELFLRSIPCEDH-----YFNIIGFGDN 511 Query: 71 VHVEQPFTSAANF--------FPPILFAQ-GDTPMGAAITKALDMVEERKREYRANGISY 121 P ++ N + L A G T M +A ++ E R+R+ Sbjct: 512 HKTLFPKSTPYNEETLTKGLRYAQALEADMGGTEMMSAFE---EIFEHRRRDVPTQ---- 564 Query: 122 YRPWIFLITDGAP--TDEWQAAANKVFRGEEDKRFA-FFSIGVQG 163 IFL+TDG D + E+ F FS+G+ Sbjct: 565 ----IFLLTDGEIWDVDSLIECIRDAKKEEKSDNFVRVFSLGIGS 605 >UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain H4 n=38 Tax=Eutheria RepID=ITIH4_HUMAN Length = 930 Score = 67.6 bits (164), Expect = 3e-10, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 73/204 (35%), Gaps = 31/204 (15%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA + P+ + ++D SGSM+GR I + L+ D+L R + ++ Sbjct: 263 FAPEGLTTMPKN---VVFVIDKSGSMSGRKIQQTREALIKILDDLSP------RDQFNLI 313 Query: 67 TFGPVHV-----EQPFTSAANFFPPILFAQ----GDTPMGAAITKALDMVEERKREYRAN 117 F P ++ A G T + A+ A+ +++ +E R Sbjct: 314 VFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLP 373 Query: 118 GISYYRPWIFLITDGAPT--DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISV 174 S I L+TDG PT + + R R++ F +G L ++++ Sbjct: 374 EGSVS--LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLAL 431 Query: 175 --------RQPLPLQGLQFRELFS 190 LQ ++ + Sbjct: 432 DNGGLARRIHEDSDSALQLQDFYQ 455 >UniRef50_A0D1M1 Chromosome undetermined scaffold_34, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0D1M1_PARTE Length = 1460 Score = 67.2 bits (163), Expect = 3e-10, Method: Composition-based stats. Identities = 36/134 (26%), Positives = 46/134 (34%), Gaps = 13/134 (9%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT--S 79 IL+LD SGSM G GLV F E+ +P + I+ F + Sbjct: 1277 ILILDDSGSMEGAFFEAAKKGLVAFLQEIQKNP----ESRVTIILFNHQARCVVDYEIPD 1332 Query: 80 AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQ 139 A I F G T + A D + + S IF TDG Sbjct: 1333 AQVQQKEIQFRGGGTDFDEPLKLAFDKIANNPDFDNFSSHS-----IFFYTDGQ-AQYPT 1386 Query: 140 AAANKVFRGEEDKR 153 A KV + DKR Sbjct: 1387 KAMEKVKQFPSDKR 1400 >UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha (Globulin) inhibitor H5 (ITIH5) (Fragment) n=2 Tax=Danio rerio RepID=Q5RHF3_DANRE Length = 906 Score = 67.2 bits (163), Expect = 3e-10, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 63/191 (32%), Gaps = 39/191 (20%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA D P+ + ++D S SM G + + L T +EL + V Sbjct: 242 FAPRDLPVVPKN---VVFVIDTSASMLGTKMKQTKQALFTIINELRPNDN------FNFV 292 Query: 67 TFGP-VHVEQP--FTSAANFFPP-------ILFAQGDTPMGAAITKALDMVEERKREYRA 116 TF + V QP ++ G T + I ++ + + Sbjct: 293 TFSNRIRVWQPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALLSDYLS-SKD 351 Query: 117 NGISYYRPWIFLITDGAPT----------DEWQAAANKVFRGEEDKRFAFFSIGVQ-GAD 165 + I +TDG PT + A + +F F+IG+ D Sbjct: 352 ESHHHSVSLIIFLTDGRPTVGVLQSPTIISNTKTAVQE--------KFCLFTIGMGDDVD 403 Query: 166 MKTLAQISVRQ 176 + L ++S+ Sbjct: 404 YRLLERMSLDN 414 >UniRef50_UPI00016C377F protein containing a von Willebrand factor type A domain n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C377F Length = 821 Score = 67.2 bits (163), Expect = 3e-10, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 54/165 (32%), Gaps = 29/165 (17%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP---------VH 72 +L+LD S SM+ + + + +L + G+V F V Sbjct: 273 LVLVLDTSSSMSDIKMQQAKKAVKFCLSQLQPED------RFGVVRFSTTVTKFRSELVA 326 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW-IFLITD 131 + A + L G T + A+ AL M RP+ + TD Sbjct: 327 ANTDYLDLATKWIDGLKTSGGTAIWPALNDALAMRSSDPS----------RPFTMVFFTD 376 Query: 132 GAPTDEWQAAANKVFR--GEEDKRFAFFSIGVQ-GADMKTLAQIS 173 G PT + A V + F+ GV + L Q++ Sbjct: 377 GQPTVDETNADKIVKNVLAKNTGNTRIFTFGVGDDVNAAMLDQLA 421 >UniRef50_A1ZQA6 Putative uncharacterized protein n=3 Tax=cellular organisms RepID=A1ZQA6_9SPHI Length = 3238 Score = 67.2 bits (163), Expect = 3e-10, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 46/134 (34%), Gaps = 23/134 (17%) Query: 17 EPRCPCILLLDVSGSM---------NGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 EP I +LD+SGSM G+ I+ A A ++ Sbjct: 2205 EPPIEVIYVLDMSGSMKWEYPKTDDAGKTISRFRAAQDALIYANSALAQQGMSSRSALIV 2264 Query: 68 FGPVHVEQP------FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 F + F + + G TPM + A ++++ R + Sbjct: 2265 FNDTTAQVMSGFTNNFQQLNSIVENLGAPNGGTPMSKGMLSAKELLKTRS--------AD 2316 Query: 122 YRPWIFLITDGAPT 135 +P + LITDG PT Sbjct: 2317 KKPVVVLITDGVPT 2330 >UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta RepID=B6TZ81_MAIZE Length = 516 Score = 67.2 bits (163), Expect = 4e-10, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 55/167 (32%), Gaps = 29/167 (17%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFT 78 + +LDVSGSM G I ++ + +L + L IVTF + P Sbjct: 62 LDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSS------IDRLSIVTFLDTANRICPLQ 115 Query: 79 SAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREY-RANGISYYRPWIFLIT 130 P L G+T + + L ++ +RK R G + L++ Sbjct: 116 QVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG-------VMLMS 168 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQ 176 DG Q + + ++ G D L ++ Sbjct: 169 DG------QQNRGEPAANVKIGNVPVYTFGFGADYDPTVLNAVARNS 209 >UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8U1E2_9PROT Length = 683 Score = 66.9 bits (162), Expect = 4e-10, Method: Composition-based stats. Identities = 39/152 (25%), Positives = 58/152 (38%), Gaps = 28/152 (18%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP---VHVEQPFT 78 I ++DVSGSM G P+ A L + + L + +V F + P Sbjct: 330 VIFVIDVSGSMKGEPLRAAKASLTSGIEGLGRNDT------FNVVAFNNKAAAFYDAPVR 383 Query: 79 SAANFFPPI------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 ++ F L A G T M AA AL M G + ITDG Sbjct: 384 ASGKFHRAALKVIDGLKAGGGTEMAAAFELALQM----------PGDPDRLQQVVFITDG 433 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGA 164 A ++E A N++ +R F++G+ A Sbjct: 434 AVSNE-AALFNQIKGELGARRL--FTVGIGSA 462 >UniRef50_UPI00006A1B4A Collagen alpha-3(VI) chain precursor. n=5 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1B4A Length = 2535 Score = 66.9 bits (162), Expect = 4e-10, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 59/163 (36%), Gaps = 19/163 (11%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSA 80 + L+D S S+N + + + + + RV++G++ F E P Sbjct: 999 IVFLVDSSASINSDDYETMKEFMESM---VKQAEIGPDRVQIGLIQFSSETKEEFPLNRY 1055 Query: 81 ANFFPPILFAQG------DTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 +G T MG A+ L K R N Y + +ITDG Sbjct: 1056 KRKDEIQSAIRGIQQLSQGTLMGEALKYTLPYFSASKGG-RVNTKQY----LIVITDGEA 1110 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP 177 D A + D ++IGVQ A+ L +I+ +Q Sbjct: 1111 QD----AVGNPAKAIRDHGVIIYAIGVQQANNTQLLEIAGKQE 1149 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 59/176 (33%), Gaps = 22/176 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS-- 79 L+D SGS+ ++ ++ + RV G+V + V + F S Sbjct: 797 IYFLIDGSGSIYPEDFEDMKKFMIELISMFQ---VGANRVRFGVVQYSDVRRTEFFISEH 853 Query: 80 -----AANFFPPILFAQGDTPMGAAITKALDMV--EERKREYRANGISYYRPWIFLITDG 132 + I G T G A+T + + R ++ + +ITDG Sbjct: 854 NTQKMLKDAISQIEQLGGGTLTGEALTSMKQLFVNAAKDRPHKVPQS------LVVITDG 907 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D AA ++ F+IGV+ A + + I+ F L Sbjct: 908 ESQDRVTEAAAEIRND----GITIFAIGVKNAVEEEIRDIAGSNEKMFFVNNFDSL 959 Score = 49.9 bits (118), Expect = 6e-05, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 59/165 (35%), Gaps = 20/165 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVT--FRDELLADPLALKRVELGIVTFGPV-HVEQPFT 78 + L+D S S I +N L + A ++L V +G+V + +E Sbjct: 602 IVFLIDESSS-----IGPINFQLTRVFLHKVVSALDISLSNVRVGLVLYSDEPRLELKLN 656 Query: 79 ------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 +F + + G GAA+ + ++ R + + ++T+G Sbjct: 657 TFNEKYEILDFITKLPYRGGKAHTGAALDFLRKKMFTKQNGGRPHQGV--QQIAVVMTNG 714 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP 177 D + A K+ R F++G Q + L I+ P Sbjct: 715 QSMDNFTKPAAKLRRS----GVEVFAVGFQNINDTELDIIASHPP 755 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 39/159 (24%), Positives = 63/159 (39%), Gaps = 13/159 (8%) Query: 22 CILLLDVSGSM-NGRPINELNAGLVTFRDELLADPLAL-KRVELGIVTFGPVHVEQPFTS 79 I LLD S S+ G + F + ++ D L V+ G V +G EQ + Sbjct: 1206 IIFLLDASASITRGEF-----RLMQRFVEAVVNDSLVGKDNVQFGAVVYGTNPAEQFSLN 1260 Query: 80 AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP----WIFLITDGAPT 135 + IL A P + T +E + + + RP + L+TDGA T Sbjct: 1261 TYSTKLDILKAVFSLPQVSGYTYTAKALEYTRIRFGTSYGG--RPGISHILILVTDGATT 1318 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 + + V + +D F++GV A + L QI+ Sbjct: 1319 EADRPNLPIVSKALKDDGIIVFAVGVGKAVPQELQQIAG 1357 >UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDA4D Length = 1547 Score = 66.9 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 37/154 (24%), Positives = 59/154 (38%), Gaps = 10/154 (6%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPF 77 R I LLD SGSM+G+PI+ L F L D +++FG + P Sbjct: 305 RSEFIFLLDRSGSMSGQPIDRACQALTLFLKSLPTDSY------FNVISFGSSFKLLFPQ 358 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 + N + A + ++ + K + N I Y +FL+TDG D Sbjct: 359 SEKYNSQSLEKAISNISKYKADLG-GTEIYKPLKNVFVQNKIQGYNKQVFLLTDGE-VDS 416 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQ 171 + + + + + R G GAD + Q Sbjct: 417 PEQVISLIRKNNKFSRVHSIGFG-SGADQYLINQ 449 >UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflexi (class) RepID=A5UTA6_ROSS1 Length = 425 Score = 66.9 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 66/186 (35%), Gaps = 27/186 (14%) Query: 18 PRCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVE 74 P+ P L+LD S SM G + ++ D+L D +V F V Sbjct: 41 PKLPLNLCLVLDRSSSMRGERLMQVKEAAARIVDQLGPDDY------FSLVVFNDRADVV 94 Query: 75 QPFT-----SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 P S + A G T M + AL V+ R + GIS + L+ Sbjct: 95 IPAQRAIKKSDLKAAIAQIEAAGGTEMAQGLALALQEVQ---RPFLTRGISR----LILL 147 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQIS----VRQPLPLQGLQ 184 TDG T ++ ++ R + + ++G+ + L ++ R Sbjct: 148 TDGR-TYGDESRCVEIARRGQSRGIGLTALGIGTEWNEDLLETMTASENSRAQYIATAQD 206 Query: 185 FRELFS 190 ++F+ Sbjct: 207 VVKVFA 212 >UniRef50_A6H584 Collagen alpha-5(VI) chain n=2 Tax=Mus musculus RepID=CO6A5_MOUSE Length = 2640 Score = 66.9 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 72/194 (37%), Gaps = 20/194 (10%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 +EQ+ + E L+D S S+ + ++ + + D P+ +V Sbjct: 457 AEQMELDKTGCVDTKEA--DIYFLIDGSSSIRKKEFEQIQIFMSSVIDMF---PIGPNKV 511 Query: 62 ELGIVTFG-------PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY 114 +G+V + PV I +G T G A+ L ++++ K E Sbjct: 512 RVGVVQYSHKNEVEFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALDFILPLIKKGKTE- 570 Query: 115 RANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 R + Y + ++TDG D AN++ + +IG+ A+ L QI+ Sbjct: 571 RTDRAPCY---LIVLTDGKSNDSVLEPANRLRAE----QITIHAIGIGEANKTQLRQIAG 623 Query: 175 RQPLPLQGLQFREL 188 + G F L Sbjct: 624 KDERVNFGQNFDSL 637 Score = 61.8 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 65/176 (36%), Gaps = 15/176 (8%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 + +LD SGS+ R + + + + RV++G +T+ Sbjct: 845 LDIVFVLDHSGSIGPREQESM---MNLTIHLVKKADVGRDRVQIGALTYSNHPEILFYLN 901 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 ++ A G+T A+ + +++ + R R + +ITDG Sbjct: 902 TYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVLFTEEHGSRLTQNV--RQLMIVITDG 958 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D + ++ R DK F++GV A+ L ++ ++ + F +L Sbjct: 959 VSHDRDK--LDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHVDNFDKL 1012 Score = 59.1 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 36/171 (21%), Positives = 65/171 (38%), Gaps = 19/171 (11%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ----PF 77 + L+D SGS+ + + ++ + R ++G+V F + E+ + Sbjct: 661 IMFLVDSSGSIGPTNFETMKTFMKNLVGKIQ---IGADRSQVGVVQFSDYNREEFQLNKY 717 Query: 78 TSAANFFP---PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 ++ + + +T G A+T + + K G R ++ L+TDG Sbjct: 718 STHEEIYAAIDRMSPINRNTLTGGALTFVNEYFDLSKG-----GRPQVRKFLILLTDGKA 772 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 DE A + K FS+GV GA+ L +IS L F Sbjct: 773 QDEVGGPATALR----SKSVTIFSVGVYGANRAQLEEISGDGSLVFHVENF 819 >UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 Tax=Arabidopsis thaliana RepID=Q9M1S2_ARATH Length = 676 Score = 66.9 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 54/170 (31%), Gaps = 26/170 (15%) Query: 18 PRCPCILL--LDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVE 74 R P L+ LD+SGSM G + L + L + L ++ F Sbjct: 239 RRAPIDLVTVLDISGSMGGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRL 292 Query: 75 QPFTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 P T ++ + L A G T + + K ++E+R I Sbjct: 293 FPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS-------II 345 Query: 128 LITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQ 176 L++DG D + + + S G D + +S Sbjct: 346 LLSDGR--DTYTTNHPDPSYKVMLPQISVHSFGFGSDHDASVMHSVSEVS 393 >UniRef50_A0DIJ2 Chromosome undetermined scaffold_52, whole genome shotgun sequence n=4 Tax=Eukaryota RepID=A0DIJ2_PARTE Length = 2542 Score = 66.9 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 59/179 (32%), Gaps = 17/179 (9%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ- 75 P+ I ++D SGSM+G P N + + + + ++ F Sbjct: 2355 SPKIHYIFMIDDSGSMSGSPWNTAKNCCLNCLSTIEKN----LNARVSVIIFNSTARIAI 2410 Query: 76 --PFTSAANFFPPILFAQGDTPMGAAITKALDMV-EERKREYRANGISYYRPWIFLITDG 132 + I F G T G+A +A ++ + + ++ + +Y TDG Sbjct: 2411 NCEIVNLVEMEKKIQFNSGSTDFGSAFQQAYKLIVQHQNDAFQKTEVLFY-------TDG 2463 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGV-QGADMKTLAQISVRQPLPLQGLQFRELFS 190 + + ++ F + A+ +L I L + ++ F Sbjct: 2464 GAA-YPKEQVKLFTEIPDHQKARIFIHCCTEEANATSLQMIVNEMNRSLIKSELKQKFQ 2521 >UniRef50_UPI0001C1630F hypothetical protein CRD_00534 n=2 Tax=Nostocaceae RepID=UPI0001C1630F Length = 587 Score = 66.9 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 36/182 (19%), Positives = 65/182 (35%), Gaps = 26/182 (14%) Query: 22 CIL--LLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 L ++D SGSM+G P+ + GL ++ +G+V++G + Sbjct: 412 VYLMTVIDTSGSMSGGPLEAVKNGLRIASQQINPGNY------VGLVSYGDQPINLVKLA 465 Query: 76 PFTSAAN----FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 PF + L A G T M + L + ++ R+ NG Y + L+TD Sbjct: 466 PFDDLQHKRFLAGIDGLEADGATAMYDGVMVGLSELLQQ-RKTNPNGKFY----LLLLTD 520 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQ---GLQFREL 188 G + +V E + I + L I+ + ++ +EL Sbjct: 521 GQTNQGF--NFEQVKEIIEYSGVRVYPIAYGEVNEAELNAIAALRESTVKKGTPENVQEL 578 Query: 189 FS 190 Sbjct: 579 LK 580 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 66.9 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 40/171 (23%), Positives = 64/171 (37%), Gaps = 32/171 (18%) Query: 5 ITFATSDFASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVE 62 + + +F + R P I ++D SGSM+G + + + L L DP Sbjct: 315 LMPPSDEFIA--AQRLPREVIFVIDTSGSMHGESLEQAKSALFFALANL--DPQDS---- 366 Query: 63 LGIVTFGPV--HVEQPFTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKRE 113 I+ F + A +F L A G T +G A + LD Sbjct: 367 FNIIEFNSKVNALNAQALPANDFNIRRARNFVYGLKADGGTEIGLAFEQVLD-------- 418 Query: 114 YRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA 164 + Y R +FL TDG+ ++E ++ D R F+IG+ A Sbjct: 419 -NSEHADYLRQIVFL-TDGSISNE-TEVFAQIKGSLGDSR--IFTIGIGSA 464 >UniRef50_Q235T9 von Willebrand factor type A domain containing protein n=5 Tax=Tetrahymena thermophila RepID=Q235T9_TETTH Length = 703 Score = 66.9 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 70/204 (34%), Gaps = 37/204 (18%) Query: 8 ATSDFASNP---EPRCPCILLLDVSGSMNG-RPINELNAGLVTFRDELLADPLALKRVEL 63 + SNP P I ++D SGSMN I + ++ + L + L Sbjct: 196 NDMEVKSNPLEGRPNLDLICVIDNSGSMNDFSKIENVKNTILQLLEMLNEND------RL 249 Query: 64 GIVTFG-PVHVEQPFTSAANFFPPIL-------FAQGDTPMGAAITKALDMVEERKREYR 115 ++TF + N L A G T + I A +++ RK++ Sbjct: 250 SLITFNTKAKQLCGLKNVNNQNKKSLQTITKSIKADGGTDIIRGIEIAFQILQSRKQKNS 309 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAA-----ANKVFRGEEDKRFAFFSIGVQGAD----M 166 + IFL++DG D A ++ +++ F S G M Sbjct: 310 VSS-------IFLLSDGQ--DNLADAGIKNLLKTTYKQLQEESFTIHSFGFGNDHDGPLM 360 Query: 167 KTLAQISVRQP-LPLQGLQFRELF 189 + +AQI + Q E F Sbjct: 361 QKIAQIKDGSFYFVEKNDQVDEFF 384 >UniRef50_A4YGI9 von Willebrand factor, type A n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YGI9_METS5 Length = 363 Score = 66.5 bits (161), Expect = 6e-10, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 62/192 (32%), Gaps = 38/192 (19%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M + S F + ++L+D S SM G + G D L D Sbjct: 1 MFRIVLVPESKFEA---KNLHYVILIDRSYSMKGEKLEMAKEGARLLVDNLPKDS----- 52 Query: 61 VELGIVTFGP----VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRA 116 ++ F + + + L T M A+ +A ++ + Sbjct: 53 -RFSLLAFNEKVSIIKEHEHPSEMGK-ELESLKVGSGTAMYKALQEAFNLARKYGEPT-- 108 Query: 117 NGISYYRPWIFLITDGAPTD-------EWQAAANKV------FRGEEDKRFAFFSIGVQG 163 ++ L+TDG P+D + N+ E+ + F IG Sbjct: 109 --------YVILLTDGVPSDMGCMPGLSRKFDLNRCLPVYQGLSVPENVQIISFGIGDDY 160 Query: 164 ADMKTLAQISVR 175 ++ + L ++S + Sbjct: 161 SE-EILTEVSEK 171 >UniRef50_A2RNJ3 Putative uncharacterized protein n=2 Tax=Lactococcus lactis RepID=A2RNJ3_LACLM Length = 1444 Score = 66.5 bits (161), Expect = 6e-10, Method: Composition-based stats. Identities = 33/139 (23%), Positives = 51/139 (36%), Gaps = 27/139 (19%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG--- 69 NP +L++D+SGSM G + G+ +F + A V +G+V + Sbjct: 297 VQNPIKPVDIVLVVDMSGSMEGAREGAIKQGVKSFLSSIENTAYAQY-VNVGLVGYSSPG 355 Query: 70 ----PVHVEQPF--------TSAANFFPPILFAQGD-TPMGAAITKALDMVEERKREYRA 116 ++ P SA N F G T +G I + M++E Sbjct: 356 YISNSGYITVPMESLATDGHVSAMNKALERQFVGGTFTQLG--IRQGAQMLKEDASGNEK 413 Query: 117 NGISYYRPWIFLITDGAPT 135 I L+TDG PT Sbjct: 414 --------MIILMTDGVPT 424 >UniRef50_Q0A603 von Willebrand factor, type A n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0A603_ALHEH Length = 972 Score = 66.5 bits (161), Expect = 6e-10, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 68/209 (32%), Gaps = 26/209 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFTSA 80 L++D SGSM+G PI T D + A +G+V F V P + Sbjct: 331 ISLIVDTSGSMSGAPIINARTAGRTLVDVVEPGRTA-----MGVVRFSASASVVHPMIAI 385 Query: 81 ANFFPPI----------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 + L A G T M + LD +++ + FL++ Sbjct: 386 PDPGTAEKDQLKDAIDSLPASGLTAMFDGLILGLDELQDYSAANDTDAGQ----VAFLLS 441 Query: 131 DGAPTDEWQAAAN-KVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQ--PLPLQGLQFR 186 DG D AA + + +D + G A L +++ Sbjct: 442 DGG--DNSSAATEPQTVQAYQDANVPIIAFGYGSFAPTGVLRRLADNTGGEFFASPTTLA 499 Query: 187 ELFSWLSSSLRSVSRSTPGTEVVLEAPKG 215 E+ ++ +VS + ++ G Sbjct: 500 EIQEAFLAANAAVSDAVNLSQESQPVAAG 528 >UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXM6_9CYAN Length = 928 Score = 66.5 bits (161), Expect = 6e-10, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 57/172 (33%), Gaps = 27/172 (15%) Query: 15 NPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 NP P + L+D SGS +G P+N+ + F + L I+ F Sbjct: 419 NPHQLVPKDVVFLIDTSGSQSGEPLNKCQELMRRFINGLNPHDT------FTIIDFSDTT 472 Query: 73 VEQPFTSAANF---------FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYR 123 + AN + L A G T + I L+ E R+ Sbjct: 473 RQLSPVPLANTVQNRNSAMNYINQLNASGGTQLRRGIQAVLNFPEVDPGRLRS------- 525 Query: 124 PWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVR 175 I L+TDG +E Q A + R F G + L +I+ Sbjct: 526 --IVLLTDGYIGNENQILAEVQRHLKLGNRLHSFGAG-SSVNRFLLNRIAEI 574 >UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AXM1_ORYSI Length = 614 Score = 66.1 bits (160), Expect = 7e-10, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 63/170 (37%), Gaps = 29/170 (17%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT 78 + +LDVSGSM G + + + F +L D L +V+F V T Sbjct: 30 VDVVAVLDVSGSMEGERLEHVKEAMEIFIGKLGPDD------RLSVVSFATSVRRLTELT 83 Query: 79 SAANFFPPI-------LFAQGDTPMGAAITKALDMVEERK--REYRANGISYYRPWIFLI 129 + + L A G T MGAA+ + ++ +RK R+ + + + Sbjct: 84 YMSEQGRAVAKEIVDGLVADGSTNMGAALLEGAMILRDRKGARDESNGRVGC----MMFL 139 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPL 178 +DG D ++++ + F + G+ + + I+ Sbjct: 140 SDGT-ND-------EIYKEDISGEFPAHTFGLGSDHNPNVMRHIADETSA 181 >UniRef50_Q0AZS1 Mg-chelatase subunit ChlD-like protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AZS1_SYNWW Length = 592 Score = 66.1 bits (160), Expect = 8e-10, Method: Composition-based stats. Identities = 40/165 (24%), Positives = 62/165 (37%), Gaps = 30/165 (18%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG--PVHVEQPFT- 78 LL+D SGSM G F L + L + ++ +VTF V PFT Sbjct: 412 VCLLIDASGSMAGDKRQA-----ACF---LAQNLLLSGKEKVAVVTFQERSSEVVVPFTR 463 Query: 79 --SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT- 135 + N + G TPM I A+++++ + P + LI+DG P Sbjct: 464 NQNILNKGLSTISPAGLTPMADGIMTAVNLIKNNRVRN---------PLLVLISDGIPNI 514 Query: 136 -----DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVR 175 D A E+K F IG++ + L ++S Sbjct: 515 PLWTLDAQADALEAATHIRENK-IHFICIGLES-NRFYLEKLSAN 557 >UniRef50_Q22UB9 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22UB9_TETTH Length = 2269 Score = 66.1 bits (160), Expect = 8e-10, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 62/165 (37%), Gaps = 19/165 (11%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV--- 71 NP I++ D SGSM G L L+ F D + A + ++ F Sbjct: 2097 NP---VHFIIVFDESGSMEGEKWITLRKELLNFIDN-RSRATAQD--FITLIGFAHTVKL 2150 Query: 72 -HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERK-REYRANGISYYRPWIFLI 129 + P F G T A + +AL+++ + + + ++ N IF + Sbjct: 2151 YTKVEKLNEQIKQKVPQEFMDGGTNYSAPLQQALNILSQEQCQTFKKNN------VIFFL 2204 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 +DG E + ++ + + F +G + +TL ++ Sbjct: 2205 SDGD-AKEPKTEIQQLQKLGHLIKLIQF-VGYGDENFQTLKSMAN 2247 >UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN03_ARATH Length = 641 Score = 66.1 bits (160), Expect = 8e-10, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 48/140 (34%), Gaps = 21/140 (15%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFT 78 I +LDVSGSM+G + + + L + L +++F P Sbjct: 203 LDLITVLDVSGSMDGVKMELMKNAMSFVIQNL------GETDRLSVISFSSMARRLFPLR 256 Query: 79 SAAN-------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + L A G T + + ++E R+ + +G + L++D Sbjct: 257 LMSETGKQAAMQAVNSLVADGGTNIAEGLKIGARVIEGRRWKNPVSG-------MMLLSD 309 Query: 132 GAPTDEWQAAANKVFRGEED 151 G + A ++ E Sbjct: 310 GQDNFTFSHAGVRLRTDYES 329 >UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 Tax=Cystobacterineae RepID=Q1D9B7_MYXXD Length = 476 Score = 66.1 bits (160), Expect = 8e-10, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 59/189 (31%), Gaps = 29/189 (15%) Query: 18 PRCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 R P L++D SGSM+G + + L L I+ +G Sbjct: 91 QRSPVNLALVIDRSGSMSGYKLAQAKQAARHLIGLLNDQD------RLAIIHYGSDVKSL 144 Query: 76 PFTSAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 P A +F +G T +GA ++ + +R Y N + L Sbjct: 145 PSLEATAANRERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQRTYGVNR-------LIL 197 Query: 129 ITDGAPTDEWQ--AAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI----SVRQPLPLQ 181 ++DG PT+ ++ R +IGV + + + Sbjct: 198 MSDGQPTEGLTADEELTRMARELRATGLTLSAIGVGTDFNEDLMQAFAEYGAGAYGFLED 257 Query: 182 GLQFRELFS 190 Q LF Sbjct: 258 AAQLSTLFQ 266 >UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=Dialister invisus DSM 15470 RepID=C9LLI0_9FIRM Length = 640 Score = 66.1 bits (160), Expect = 8e-10, Method: Composition-based stats. Identities = 38/130 (29%), Positives = 56/130 (43%), Gaps = 14/130 (10%) Query: 24 LLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF--GPVHVEQPFTSAA 81 L+D SGSM R E + ++LAD KR +G++ F V P T + Sbjct: 453 FLVDASGSMGAR---ERMKAVKGVVFKMLADAY-QKRDRVGMIAFRRDRAEVLLPITRSI 508 Query: 82 NFFPPILFA---QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEW 138 F L A G TP+ + KA DM++ R Y+ + + P + LITDG T+ Sbjct: 509 EFAQKKLAALPTGGKTPLAQGLIKAEDMLD---RLYKQDPLQD--PVLILITDGRATNSL 563 Query: 139 QAAANKVFRG 148 + V Sbjct: 564 NKNTDPVRDA 573 >UniRef50_A2AX52 Collagen alpha-4(VI) chain n=12 Tax=Chordata RepID=CO6A4_MOUSE Length = 2309 Score = 66.1 bits (160), Expect = 8e-10, Method: Composition-based stats. Identities = 36/171 (21%), Positives = 59/171 (34%), Gaps = 23/171 (13%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 L+D SGS+ E+ + P RV G+V + + F Sbjct: 850 IYFLIDGSGSIKPNDFIEMKDFMKEVIKMFHIGP---DRVRFGVVQYSD-KIISQFFLTQ 905 Query: 82 NFFPPILFA--------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 L A G T G A++ MV + R + Y + +ITDG Sbjct: 906 YASMAGLSAAIDNIQQVGGGTTTGKALS---KMVPVFQNTARIDVARY----LIVITDGQ 958 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ 184 TD AA + ++IGV+ A+ L +I+ ++ + Sbjct: 959 STDPVAEAAQGLRDI----GVNIYAIGVRDANTTELEEIASKKMFFIYEFD 1005 Score = 59.5 bits (143), Expect = 8e-08, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 57/179 (31%), Gaps = 19/179 (10%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ- 75 + I L+D S S+ + ++ + + + +++G++ F E+ Sbjct: 1026 SQKADIIFLIDGSESIAPKDFEKMKDFMERMVN---QSNIGADEIQIGLLQFSSNPQEEF 1082 Query: 76 ------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 + T G A+ L + + R Y + +I Sbjct: 1083 RLNRYSSKVDMCRAILSVQQMSDGTHTGKALNFTLPFFDSSRGG-RPRVHQY----LIVI 1137 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 TDG D A + D+ F+IGV L +I+ Q Q F L Sbjct: 1138 TDGVSQDNVAPPAKALR----DRNIIIFAIGVGNVQRAQLLEITNDQDKVFQEENFESL 1192 Score = 50.3 bits (119), Expect = 5e-05, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 83/208 (39%), Gaps = 22/208 (10%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF--T 78 + L+D S S+ + ++ L + L + +V++G+V + ++ P + Sbjct: 236 IVFLVDSSTSIGLQNFQKVKHFLHSVVSGL---DVRSDQVQVGLVQYSDNIYPAFPLKQS 292 Query: 79 SAANFFPPIL----FAQGDTPMGAAITKALDMVEERKREYRA-NGISYYRPWIFLITDGA 133 S + + ++ G T G+A+ RA +G+ + L+TDG Sbjct: 293 SLKSAVLDRIRNLPYSMGGTSTGSALEFIRANSLTEMSGSRAKDGVP---QIVVLVTDGE 349 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFRELFSW 191 +DE Q A+++ R F +G+ D++ L +I+ F + Sbjct: 350 SSDEVQDVADQLKRD----GVFVFVVGINIQDVQELQKIANEPFEEFLFTTENF-SILQA 404 Query: 192 LSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 LS +L ST ++ ++ K + V Sbjct: 405 LSGTLLQALCSTVERQMK-KSTKTYADV 431 >UniRef50_B4BQC0 von Willebrand factor type A n=2 Tax=Geobacillus RepID=B4BQC0_9BACI Length = 668 Score = 66.1 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 27/129 (20%), Positives = 43/129 (33%), Gaps = 20/129 (15%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLAD----------PLALKRVELGIV 66 P + ++DVSGSM + + L + + P + +V Sbjct: 194 RPPIDVVFVMDVSGSMTTMKLQSAKSALQAAVNYFKTNYHPNDRFALIPFSDDVKATSVV 253 Query: 67 TFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 FG A L A G T AA++ A + +R+ +I Sbjct: 254 PFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLAQSYFNDPERKK----------YI 303 Query: 127 FLITDGAPT 135 +TDG PT Sbjct: 304 IFLTDGMPT 312 >UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174639B Length = 868 Score = 66.1 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 65/176 (36%), Gaps = 27/176 (15%) Query: 24 LLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAAN 82 L++D SGSM+G + + + + L + +G+ F HV P T + Sbjct: 417 LVIDRSGSMSGEKLEMAKSAAIATAEVLTRNDS------IGVYAFDSEAHVVVPMTRLTS 470 Query: 83 FFPPI-----LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 L + G T + A T+A + ++ K + + + ++TDG + + Sbjct: 471 SSAVAGQIAGLTSGGGTNLHPAFTEARNALQRTKAKIKH---------MIILTDGQTSGQ 521 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS----VRQPLPLQGLQFRELF 189 A R E + +IG GA + L I+ + L +F Sbjct: 522 GYEALASQCRAE-GVTISTVAIG-DGAHVGLLQAIASLGGGKSYTTLDAANIVRIF 575 >UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated chloride channel 1 precursor n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4A663 Length = 1245 Score = 66.1 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 41/177 (23%), Positives = 67/177 (37%), Gaps = 27/177 (15%) Query: 20 CPCILLLDVSGSM-NGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 C +L+LD SGSM I+++N+ F + L+ D ++ +GIVTF G Sbjct: 523 CRVVLVLDTSGSMGTSNRIDKVNSAATAFVN-LVDDGIS-----IGIVTFTGSPTTRHAL 576 Query: 78 T---------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 T S + F L A G T +G + + L+++ G I L Sbjct: 577 TQINTQADRDSLRDIF--QLTASGGTCIGCGLEQGLEVLMAHPSGSADGG------IIVL 628 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQ 184 +TDG Q + + R +IG ++ +AQ + Q Sbjct: 629 MTDGQ-DSGIQNHIIRQTLQDMGVRVNTVAIGEDAYGELSLIAQETGVNVEITQPNN 684 >UniRef50_Q23JA0 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila SB210 RepID=Q23JA0_TETTH Length = 1049 Score = 65.7 bits (159), Expect = 9e-10, Method: Composition-based stats. Identities = 39/168 (23%), Positives = 56/168 (33%), Gaps = 38/168 (22%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 R I LLD SGSM+G+PI L F L D +++FG Sbjct: 310 RSEFIFLLDRSGSMSGQPIRRACEALTLFLKSLPNDSY------FNVISFGSS------- 356 Query: 79 SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW------------- 125 LF ++ KA+ ++ K + G Y P Sbjct: 357 ------FDKLFPSSTKYTSESLEKAILLI--SKYQADLGGTEIYNPLNNVFVQNKIQGYN 408 Query: 126 --IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQ 171 IFL+TDG D Q + + + R G GAD + + Sbjct: 409 KQIFLLTDGE-VDSPQQVVRLIKKNNKYNRVHSIGFG-SGADQYLIKE 454 >UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Shewanella RepID=A6WMD3_SHEB8 Length = 772 Score = 65.7 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 37/159 (23%), Positives = 55/159 (34%), Gaps = 24/159 (15%) Query: 17 EPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE 74 +P P IL++D SGSM G I + L+ L + I+ F Sbjct: 387 QPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDS------FNIIEFNSSLSL 440 Query: 75 QPFTSA---------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 T A F L A G T M A+ AL + + R Sbjct: 441 LSATPLPATSSNLSRARQFVSRLQADGGTEMALALDAALPKSLGSVS---PDAVQPLRQV 497 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA 164 IF +TDG + + A + R + F++G+ A Sbjct: 498 IF-MTDG--SVGNEQALFDLIR-YQIGESRLFTVGIGSA 532 >UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina mazei RepID=Q8PU63_METMA Length = 1004 Score = 65.7 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 65/174 (37%), Gaps = 26/174 (14%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV- 71 N +L++D SGSM+G PI+ F D + A+ +A G+V+F Sbjct: 306 EDNANANANVMLVIDRSGSMSGSPISSAKNSANLFIDYMEAEDMA------GVVSFSSSA 359 Query: 72 ------HVEQPFT-SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 P ++ ++A G T +G+ + L+ + P Sbjct: 360 RYDYHLATLTPEVKNSIKQKINSIYASGVTAIGSGMRYGLNDLLNYGDPNN--------P 411 Query: 125 W-IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQ 176 W I L++DG N V + +++G+ A D K L I+ + Sbjct: 412 WAIVLLSDGYQNS--GENPNNVIPSIKASNIQVYTVGLGPAVDQKLLGNIADQT 463 >UniRef50_UPI0000F1FEC5 PREDICTED: similar to Clca1 protein n=2 Tax=Danio rerio RepID=UPI0000F1FEC5 Length = 903 Score = 65.7 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 62/212 (29%), Gaps = 36/212 (16%) Query: 17 EPRCPCILLLDVSGSMNGR-PINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 + L+LDVSGSM I + ++ +GIV F Sbjct: 295 RKKRAVCLILDVSGSMATESRILRMRQAATHLLRN-----YVEEQASVGIVKFSTAASIV 349 Query: 76 P-----FTSAANFFPPIL---FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 + A L G T M + L ++ E + + I Sbjct: 350 SSLTIIESDATRDHLINLLPETPGGSTNMCNGLRLGLQVLSEDDMDAIGDE-------II 402 Query: 128 LITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP-------LPL 180 +TDG TD+ + +I + + L +++ + Sbjct: 403 FLTDGQATDD----VTLCIPDAINSGAIIHTIALSDSAHNALQEMADKTGGIFFYSKDDF 458 Query: 181 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEA 212 Q + F+ L+ S S V LE+ Sbjct: 459 TSNQLMDAFASLTLSTGDHS----NEPVQLES 486 >UniRef50_C1XTM1 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Deinococci RepID=C1XTM1_9DEIN Length = 464 Score = 65.7 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 47/205 (22%), Positives = 68/205 (33%), Gaps = 47/205 (22%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMN-----------------GRPINELNAG---- 43 + S A+ P+ ++D SGSM G+ + Sbjct: 29 LKIRPSAEATRSRPQLVVAFVVDTSGSMREVVTEPTERTGQSVRVDGKDYEVVRGAKSKI 88 Query: 44 --LVTFRDELLADPLALKRVELGIVTFGPVH-VEQPFTSAANFFPPILFA--------QG 92 ++ LL+ P L IV F V V QPFT A L A G Sbjct: 89 DLVIEALQNLLSSPQLQPSDRLAIVKFDDVAEVVQPFTPANE--KARLVAAAERLTQYSG 146 Query: 93 DTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDK 152 T MGA + + + ++E R + L+TDG DE V Sbjct: 147 GTQMGAGMREGMRLLEREAGSRR----------LILLTDGQTFDEP--LVETVAAQLAQA 194 Query: 153 RFAFFSIGVQGA-DMKTLAQISVRQ 176 R +IGV + LA+I+ R Sbjct: 195 RIPVTAIGVGDEWNDDLLAEITDRT 219 >UniRef50_C3ZCZ5 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3ZCZ5_BRAFL Length = 371 Score = 65.7 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 42/173 (24%), Positives = 71/173 (41%), Gaps = 30/173 (17%) Query: 14 SNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VH 72 +NP + +LD SGS+ R ++ AG+ + +AL +G+V + V Sbjct: 217 NNP---VDIVFVLDGSGSVGRRNFEKVQAGVKKIVGDFN---IALDSTRVGVVQYSSIVR 270 Query: 73 VEQPFTSAANFF------PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP-- 124 E + +N I + G T GAA+ A+ + ANG RP Sbjct: 271 QEFALDTFSNLQGLESGIQSIPYMAGGTRTGAAMEYAI-----QNSFTSANGA---RPDV 322 Query: 125 --WIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKT-LAQISV 174 I L+TDG D+ A+ K + F++G+ +++ L QI+ Sbjct: 323 GHVIVLVTDGRSYDDVSQASQKAKQA----GIVVFAVGIGDGAVESQLNQIAS 371 Score = 49.1 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 60/164 (36%), Gaps = 20/164 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 I +LD SGS+ N++ + L P + ++ + ++ A Sbjct: 4 IIFMLDGSGSVGPDNFNKMKEFVKKTVGGYLIGPS---NTRVAVMQYSSSVRQEFALDAF 60 Query: 82 NFFPPIL-------FAQGDTPMGAAITKALDMVEERKREYRANGISYYRP-WIFLITDGA 133 N +L + +G T G A+T+ R+ +NG P ++TDG Sbjct: 61 NTLEDLLVGIEEIRYMRGGTRTGKALTRL-----RRQGFLESNGARKNVPHVAVIVTDGR 115 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP 177 +D AA + + +++GV D+ L I+ Sbjct: 116 SSDSVDQAALETRQS----GIVLYAVGVGNYDLGQLTDIASTNE 155 >UniRef50_C9RU69 von Willebrand factor type A n=2 Tax=Geobacillus RepID=C9RU69_GEOSY Length = 1077 Score = 65.3 bits (158), Expect = 1e-09, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 43/129 (33%), Gaps = 20/129 (15%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLAD----------PLALKRVELGIV 66 P + ++DVSGSM + + L + ++ P + E +V Sbjct: 195 RPPIDVVFVMDVSGSMTAMKLQSAKSALQAAVNYFKSNYNQNDRFALIPFSDGVREASVV 254 Query: 67 TFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 FG A L A G T AA++ A K + Y I Sbjct: 255 PFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLA-------KSYFTDPTRKKY---I 304 Query: 127 FLITDGAPT 135 +TDG PT Sbjct: 305 IFLTDGMPT 313 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 65.3 bits (158), Expect = 1e-09, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 43/124 (34%), Gaps = 25/124 (20%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + ++DVSGSM G I + L + L + +++F + + Sbjct: 89 VDIVCVIDVSGSMQGEKIQLVQTTLNFMVERLSPAD------RICLISFSNDATK--ISR 140 Query: 80 AANFFP----------PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 P P L A G T + + L + +R+ + + I L+ Sbjct: 141 LVQMSPKGKKQLKSMIPRLVASGGTNIVGGLEYGLQALRQRRTINQLSS-------IILL 193 Query: 130 TDGA 133 +DG Sbjct: 194 SDGQ 197 >UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteriaceae RepID=C7P2A9_HALMD Length = 393 Score = 65.3 bits (158), Expect = 1e-09, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 55/184 (29%), Gaps = 25/184 (13%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQP- 76 R L +D SGSM G I G + LLAD + IV F V P Sbjct: 36 RRHIALCIDTSGSMEGDNIKRARDG-AAWVFGLLADED-----YVSIVAFDTEATVILPA 89 Query: 77 --FTSAANFFP----PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 ++ L A G T M + A + + + L++ Sbjct: 90 TRWSDLDRQTAMDHVEELTAGGGTDMYNGLKAAKETLSSSATGPDTVKR------LLLLS 143 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ----GADMKTLAQISVRQPLPLQG-LQF 185 DG + + +D S G+ A ++TL L+ Sbjct: 144 DGKDNERTPDEFEGLAEAIDDAGIRIQSAGIGTDYNEATIRTLGTAGRGTWTHLEAPGDI 203 Query: 186 RELF 189 + F Sbjct: 204 EDFF 207 >UniRef50_UPI0000EB12CB UPI0000EB12CB related cluster n=1 Tax=Canis lupus familiaris RepID=UPI0000EB12CB Length = 2186 Score = 65.3 bits (158), Expect = 1e-09, Method: Composition-based stats. Identities = 39/175 (22%), Positives = 63/175 (36%), Gaps = 21/175 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 + L+D SGS+ ++ + ++ P ++G+V F ++ E+ F Sbjct: 588 IMFLVDSSGSIGHDNFGKMKTFMKNLLAKIQIGP---DSTQIGVVQFSDINQEE-FQLNK 643 Query: 82 NFFPPILF--------AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 F T G+A+T K ++ LITDG Sbjct: 644 YFTQNETSDAIDRMSLINRGTLTGSALTFVGQYFTPTKGARTKVKK-----FLILITDGE 698 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D + A + DK FS+GV GA+ L +IS L Q F +L Sbjct: 699 AQDPVRDPAKALR----DKGVVIFSVGVYGANRTQLEEISGDSSLVFQVENFDDL 749 Score = 54.5 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 76/194 (39%), Gaps = 21/194 (10%) Query: 4 QITFATSDFASNP-EPRCPCILLLDVSGSMNGRPINELNAGLVTFRD--ELLADPLALKR 60 ++ S+ P L+D S S+N ++ ++ + +D + Sbjct: 364 NLSILDSECNDKPHTKEADIYFLIDGSTSINTEGFEQIKQFMLAVTGMFSIGSDKVQAGA 423 Query: 61 VELGI---VTFGPVHVEQPFTSAANFFPPILFA---QGDTPMGAAITKALDMVEERKREY 114 V+ V F ++ + IL QG+T G A+ L ++ ++ R++ Sbjct: 424 VQYSDKIRVEF----YINASSNDMDLRKAILNIEQLQGNTHTGKALDFMLSII-KKDRKH 478 Query: 115 RANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 R + I + + ++TDG DE A ++ D++ ++G+ AD L QI+ Sbjct: 479 RISEIPCH---LIVLTDGKSQDEVLKPAERLR----DEQITIHAVGIGEADKIQLQQIAG 531 Query: 175 RQPLPLQGLQFREL 188 + G F L Sbjct: 532 EEERVNFGQNFDSL 545 Score = 49.9 bits (118), Expect = 6e-05, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 55/173 (31%), Gaps = 13/173 (7%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-- 75 + +LD SGS G E + + + + RV +G + + Sbjct: 790 KVLDIVFVLDHSGS-IGT--QEQESMMNLTIHLVKKADVDSDRVRVGALKYSDYPEVLFY 846 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 ++ + G T A+ A M EY + + + +ITDG Sbjct: 847 LSGNKSAVIEHLRRRRYTSGHTYTARALEHANIM---FTEEYGSRIQQNVKQMLIIITDG 903 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 D + + +K +++GV A+ L ++ + F Sbjct: 904 VSHD--RDNLSDTASKLRNKGINIYAVGVGQANQLELETMAGNKSNTFHVDNF 954 >UniRef50_B3RP11 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RP11_TRIAD Length = 356 Score = 65.3 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 51/154 (33%), Gaps = 27/154 (17%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-- 70 AS R +L+LD SGSM+G ++ D L + E+G++ F Sbjct: 213 ASPSSKRL--VLVLDRSGSMSGDRFLKVKEAATAVLDSLGPND------EIGVIAFDDEI 264 Query: 71 -------VHVEQPFTS-----AANFFPPILFAQ-GDTPMGAAITKALDMVEERKREYRAN 117 V P T +F + + G T A+ A DM+ Sbjct: 265 RIHGGCKVTTVSPATPQSIIFLKDFINNKIQPEFGSTGYVPALKHAFDMLSTNMTSKAKT 324 Query: 118 GISYYRPWIFLITDGAPTDEWQAAANKVFRGEED 151 + I +TDG P + + + E Sbjct: 325 KTN----LIVFLTDGHPDEPESQILDVIKNRNEA 354 >UniRef50_Q2W311 Putative uncharacterized protein n=1 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2W311_MAGSA Length = 1171 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 66/182 (36%), Gaps = 27/182 (14%) Query: 20 CPCILLLDVSGSMN---GRPINELNAGLVTFRDELLADPLALKRVELGIVTFG---PVHV 73 ++LLD S SM G P+ + F +L D + +V F VH Sbjct: 63 LDVVMLLDHSSSMGAAPGSPLQMMLRAAGNFLRQLSPDS------RVAVVGFNQVPSVHC 116 Query: 74 EQPFTSA-ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 T A A + G T + AA+ +A++++ A+G + L +DG Sbjct: 117 TLAATPAQARSALQAISPGGATSIAAALNQAVELL--------AHGRPGMDKVVVLCSDG 168 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQG--ADMKTLAQISVRQ--PLPLQGLQFREL 188 D+ A+ + R + ++G LA ++ RQ + ++ Sbjct: 169 Q--DDIAEIADALARLKAIPSVRVLAVGFGDEVIHATFLAMVADRQDYFHLTRARDMDDV 226 Query: 189 FS 190 F Sbjct: 227 FQ 228 >UniRef50_D1VKI5 von Willebrand factor type A n=1 Tax=Frankia sp. EuI1c RepID=D1VKI5_9ACTO Length = 560 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 41/184 (22%), Positives = 65/184 (35%), Gaps = 24/184 (13%) Query: 10 SDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFR--DELLADPLALKRVE--LGI 65 + + I +LD SGSM G + L L D+ L+ A R + I Sbjct: 355 AAYLDQYRRPTRAIYVLDTSGSMEGPRLAALQQALTGLTGADDSLSGRFARFRAREQVTI 414 Query: 66 VTFGP-VHVEQPFTSAA-----------NFFPPILFAQGDTPMGAAITKALDMVEERKRE 113 +TF V + FT + + + L A G+T + +A+ A + Sbjct: 415 ITFNDKVTATRQFTVSDPTPGSADLKAISDYGAALRAGGNTAIYSALDAAYTTAAAGMKA 474 Query: 114 YRANGISYYRPWIFLITDGAPT---DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLA 170 + S I L+TDG D A R + + F++ AD L Sbjct: 475 DPSALTS-----IVLMTDGENNRGLDSAGFLARYNTRPPDVRGVRTFAVDFGDADRAALT 529 Query: 171 QISV 174 QI+ Sbjct: 530 QIAT 533 >UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB92D Length = 586 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 38/152 (25%), Positives = 54/152 (35%), Gaps = 28/152 (18%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 ++D SGSM GRPI + L D L ++ +V F TS Sbjct: 316 ITFVIDTSGSMGGRPIVDAKESLQLAIDRL------SEKDRFNVVAFNNDTTRLFETSVE 369 Query: 82 ---------NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 F L A G T M A+ AL R + + +F ITDG Sbjct: 370 GTTRNKQYARDFVKHLNAGGGTEMAPALNAALK---------RTTTKDFIKQVVF-ITDG 419 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGA 164 A A +++ D R F++G+ A Sbjct: 420 A-VGNEAALFSQIKNELGDARL--FTVGIGSA 448 >UniRef50_B3RZT6 Putative uncharacterized protein n=2 Tax=Trichoplax adhaerens RepID=B3RZT6_TRIAD Length = 1343 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 58/163 (35%), Gaps = 26/163 (15%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLA-DPLALKRVELGIVTFGPVHVEQ-PFTS 79 +++LD SGSM G + +L +L D +GI+ F P + Sbjct: 299 IVMVLDKSGSMRGSNLQQLIQAATNVILQLGQIDGS------IGIIIFSTSATVTCPLMA 352 Query: 80 AANFF---------PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 N PP A G T +G+ I K ++++ E + +G + +++ Sbjct: 353 VNNDQDKNKLIGCLPPE--ASGGTSIGSGILKGIELLLGSVGEQKPSGGH-----LIVMS 405 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS 173 DG + V + SI + K L ++ Sbjct: 406 DGQENANPR--IKDVMSNITENDVVVTSISFGQSASKVLEDLA 446 >UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DZ93_PARTE Length = 522 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 54/145 (37%), Gaps = 33/145 (22%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 I L+D SGSM+G ++ + L L + L ++ F + T Sbjct: 114 LICLIDHSGSMSGEKMHLVKKSLKHLLKMLQPND------RLCLIEFDDQNYR--LTRLM 165 Query: 82 NFFPP----------ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW--IFLI 129 + A G T +G A+ AL +++ R+ + P IFL+ Sbjct: 166 RATQENMYKFLIAIDTIEANGATDIGNAMKMALSILKHRR---------FKNPIASIFLL 216 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRF 154 +DG + AA +V+ + K Sbjct: 217 SDGE----DEGAAGRVWNDIQSKNI 237 >UniRef50_C8NJ92 Secreted Mg-chelatase subunit n=3 Tax=Corynebacterium RepID=C8NJ92_COREF Length = 530 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 46/200 (23%), Positives = 71/200 (35%), Gaps = 36/200 (18%) Query: 15 NPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFR----DELLADPLALKRVELGIVTF 68 N + R P +LDVSGSM G + L + ++ L D +R + I+ F Sbjct: 337 NNDLRVPGDTTFVLDVSGSMAGTRMELLRSTMLEMISGEASSLTGDVSLRERENVTIIPF 396 Query: 69 GPVHVEQPFTSAANFFPPI----------LFAQGDTPMGAAITKALDMVEERKREYRANG 118 E + P L A+G T + A+ +A + VE Sbjct: 397 NFSPGEPITATVDEVGGPQRQELVDGVTALQAEGGTGIYDALLRAYEQVEPGASI----- 451 Query: 119 ISYYRPWIFLITDGAPTD-----EWQAAANKVFRGEEDKRFAFFSIGVQGAD---MKTLA 170 P I L+TDG T +Q +++ E KR F I A+ M+ LA Sbjct: 452 -----PSIVLMTDGEQTSGLSFGHFQRLYSELPT--EKKRIPVFVILYGEANITEMENLA 504 Query: 171 QISVRQPLPLQGLQFRELFS 190 ++ + E F Sbjct: 505 GLTGGKTFDAMNGGLEEAFK 524 >UniRef50_D2VKS7 von Willebrand factor type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VKS7_NAEGR Length = 923 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 57/179 (31%), Gaps = 28/179 (15%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PV 71 A +L++D SGSM G+ ++ + + L D+L ++ + IV F V Sbjct: 684 AQKERKGVDLVLVVDKSGSMAGQKLDMVKSTLSFMVDQLK------EKDRVAIVEFDTQV 737 Query: 72 HVEQPFTSAA-------NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 T + T + A+ +L ++ R++E Sbjct: 738 KTNLDLTKMDIEGKKKAKQVSSAISPGSCTNLSGALFTSLKLLASRQQEKNEVTS----- 792 Query: 125 WIFLITDGA------PTDEWQAAANKVF-RGEEDKRFAFFSIGVQ-GADMKTLAQISVR 175 + L TDG T+E + + G D L I+ + Sbjct: 793 -VILFTDGLANRGLISTNEILQNMQDLMDELLSTSNVTIHTFGFGQDTDANMLTSIAQK 850 >UniRef50_UPI0001C34E55 hypothetical protein ClM62_13922 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E55 Length = 466 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 57/167 (34%), Gaps = 24/167 (14%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDEL-----LADPLALKRVELGIVTFGPVHVE-- 74 + +D S M G + G+ F + L + A ++ +G+V+F Sbjct: 40 IVFAIDRSAKMEGSALEAAKKGIKAFIETLERESAQPEGYAGEK-RVGLVSFSDTATVNS 98 Query: 75 --QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 P A L A G + AI A+ +++ + + +FLITDG Sbjct: 99 MLSPVVEQAARAAEGLTAGGKSNQAEAIRAAVKLLDMKTPGEK---------MLFLITDG 149 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGV---QGADMKTLAQISVRQ 176 +++ + + IG+ G + + L + Sbjct: 150 Q--TPFRSQTDSAAAEARQAGVTVYCIGIAAPDGVNREALRSWASGP 194 >UniRef50_UPI000155D2F0 PREDICTED: similar to matrilin-3, partial n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155D2F0 Length = 354 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 59/164 (35%), Gaps = 15/164 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-------H 72 + ++D S S+ R ++ L D L A + +V + Sbjct: 150 LDLVFIVDSSRSVRPREFEKVKTFLSQVIDTLDIGETAT---RVAVVNYASTVKVEFHLQ 206 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 S I T G AI A+D V + RA + + + ++TDG Sbjct: 207 THSDKESLKQAVSRIAPLATGTMSGLAIRTAMDEVFTVEAGARAPAFNIPK-VVVIVTDG 265 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 P D+ Q A + +++GV ADM++L Q++ Sbjct: 266 RPQDQVQEAV----AQAQASGIEIYAVGVGRADMQSLRQLASEP 305 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 38/159 (23%), Positives = 57/159 (35%), Gaps = 25/159 (15%) Query: 24 LLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE----QPFTS 79 L+LD SGSM G+P+ + D LL L ++ F V QP T Sbjct: 46 LILDHSGSMAGQPLETVKRAAQKLVDRLLPSD------RLAVIVFDHVAKVLIPNQPVTD 99 Query: 80 AANFFPPI--LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 I L A G T + + L + K + IFL+TDG +E Sbjct: 100 RDKIKTRISHLAAMGGTAIDEGLQLGLTELIAAKAGAISQ--------IFLLTDGE--NE 149 Query: 138 WQAAANKVFRGEEDKR--FAFFSIGVQGA-DMKTLAQIS 173 + + EE + ++G + L QI+ Sbjct: 150 HGNNSRCLQLAEEAAKENITLNTLGFGYHWNQDVLEQIA 188 >UniRef50_UPI0000E488A7 PREDICTED: similar to Clca1 protein n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000E488A7 Length = 966 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 62/178 (34%), Gaps = 26/178 (14%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 + + P +L+LD SGSM+G +++ G F ++ + + IV Sbjct: 302 SPNFIVVQPSGSLRIVLVLDTSGSMDGERFDKMIRGAKNFIQSIVPNNS-----YVAIVE 356 Query: 68 FGPVHVEQP-FTSAANFFP--------PILFAQGDTPMGAAITKALDMVEERKREYRANG 118 F + T + P L A G T +G I A+ + + + R Sbjct: 357 FNYESIVDSYMTELTSVISRKDLASLLPTL-ADGATCIGCGIVTAIQVAQYNDMDSRGV- 414 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKT--LAQISV 174 ++ L++DG + E SI AD + LAQ++ Sbjct: 415 ------YLILLSDGE--ENHGTPIADTMDDIEGSGVIVHSIAFYEADTQLEDLAQMTG 464 >UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67LZ3_SYMTH Length = 414 Score = 64.9 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 36/160 (22%), Positives = 54/160 (33%), Gaps = 29/160 (18%) Query: 16 PEPRCPCIL--LLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV 73 PE R P L ++D SGSM G + L D++ + L IVT+ V Sbjct: 37 PEGRPPLNLAAVVDRSGSMAGAALYFTKQALRFLVDQM------AEEDRLAIVTYDD-QV 89 Query: 74 EQPF-------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 PF A + A G T + + + + R + + Sbjct: 90 HVPFPSQPVVQKDAVRLLVDGITAGGTTNLSGGLATGMQQIRPHAGPGRVSR-------V 142 Query: 127 FLITDGAP----TDEWQAAANKVFRGEEDKRFAFFSIGVQ 162 L+TDG TD R +K A ++GV Sbjct: 143 LLMTDGLANVGVTDP--DVLAGWARAWREKGLAVSTMGVG 180 >UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=C8N8N5_9GAMM Length = 563 Score = 64.5 bits (156), Expect = 2e-09, Method: Composition-based stats. Identities = 50/218 (22%), Positives = 86/218 (39%), Gaps = 42/218 (19%) Query: 16 PEPRCP--CILLLDVSGSMNG-RPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPV 71 PE R P + L+D SGSM+ + + + F + L AD + ++T+ G Sbjct: 197 PEKRPPANLVFLIDTSGSMDDPDKLPLVKKTVCHFAEALRADD------RISLITYSGST 250 Query: 72 HVEQPFTSAANFFPPI-----LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 P T+ I L A G T G A+ A D + YR +GI+ I Sbjct: 251 AEILPPTAGDQKETIIAALKPLRAHGATAGGEALRMAYDAA---AKNYRKDGINR----I 303 Query: 127 FLITDGAPTDEWQAAANK---VFRGEEDKRFAFFSI-----GVQGADMKTLAQIS----V 174 L TDG ++ + + DKR + S+ G + + + Q++ Sbjct: 304 LLATDG----DFNVGISDPATLKNYVADKRKSGISLTTLGYGSGNYNDEMMEQLADAGDG 359 Query: 175 RQPLPLQGLQFRE-LFSWLSSSLRSVSRSTPGTEVVLE 211 + ++ L L+S+L +V+R ++ LE Sbjct: 360 NYSYIDSEAEAKKVLVRQLTSTLATVAR---DIKIQLE 394 >UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI0_9BACT Length = 833 Score = 64.5 bits (156), Expect = 2e-09, Method: Composition-based stats. Identities = 42/203 (20%), Positives = 77/203 (37%), Gaps = 28/203 (13%) Query: 14 SNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVH 72 +P +L++D SGSMNG+PI A L R ++G++ F G Sbjct: 406 EKEQPSLALVLVIDKSGSMNGQPIVLAREASKA------AAELLSSRDQVGVIAFDGSAK 459 Query: 73 VEQPFTSAANFFPPI-----LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 + TSAAN + + A G T + A+ DM+ + + + Sbjct: 460 LVTDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDMLGIASAKIKH---------MI 510 Query: 128 LITDGAPT-DEWQAAANKVFRGEEDKRFAFFSIGVQGAD--MKTLAQISVRQPLPLQGLQ 184 +++DG +++ ++++ + + S+G A M +AQI + Sbjct: 511 VLSDGQSQGGDFEGISSELAQMGVT--ISTVSLGQGAAVDLMAAIAQIGNGRAYVTN--N 566 Query: 185 FRELFSWLSSSLRSVSRSTPGTE 207 E+ + SRS E Sbjct: 567 AEEMPRIFTKETMEASRSAIKEE 589 >UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) n=1 Tax=Monodelphis domestica RepID=UPI0000F2DDBB Length = 819 Score = 64.2 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 42/185 (22%), Positives = 70/185 (37%), Gaps = 34/185 (18%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA P P + L+D SGSM GR I + A L+ D+L + ++TF Sbjct: 251 FAPTQLPMVPKNIVFLIDKSGSMAGRKIKKTKAALIKILDDLKPED------HFNMITFS 304 Query: 70 PV----------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMV-EERKREYRANG 118 +++ A F A G T + A+ A+ M+ E K++ G Sbjct: 305 GHVTRWKPELVLALDEHLKEAKTFLSNT-PALGVTNVNGAVLAAVSMLDESNKKKELPEG 363 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAF------FSIGVQ-GADMKTLAQ 171 I L+TDG T K+ + E+ + A F +G + L + Sbjct: 364 SVS---MIILLTDGDST----EGETKLQKIHENVKAAIRGQYHLFCLGFGFDINYVFLER 416 Query: 172 ISVRQ 176 +++ Sbjct: 417 LALDN 421 >UniRef50_C8XJ05 Magnesium chelatase n=20 Tax=cellular organisms RepID=C8XJ05_NAKMY Length = 705 Score = 64.2 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 31/132 (23%), Positives = 48/132 (36%), Gaps = 14/132 (10%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF--GPVHVEQPFTS 79 + +D SGSM R + LL D + ++G+VTF G + P TS Sbjct: 524 VLFCVDASGSMAAR---ARMEAVKAAVLSLLTDAYQRRD-KVGLVTFRGGAADLALPPTS 579 Query: 80 AANFFP---PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 + +L A G TP+ + A + + RP + ++TDG T Sbjct: 580 SVEAAARRLEMLPAGGRTPLAEGLLCAAHTLRVERIR-----DPRRRPLLVVVTDGRATS 634 Query: 137 EWQAAANKVFRG 148 A A Sbjct: 635 GPDAVARSRRAA 646 >UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=Q10RY0_ORYSJ Length = 694 Score = 64.2 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 60/184 (32%), Gaps = 34/184 (18%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-PF- 77 + +LDVSGSM+G ++ L + L + L +V F P Sbjct: 246 LDLVTVLDVSGSMSGIKLSLLKRAMSFVIQTLGPND------RLSVVAFSSTAQRLFPLR 299 Query: 78 --TSAANFFP----PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 T L A G T + A+ K +V++R+R+ + I L++D Sbjct: 300 RMTLTGRQQALQAISSLVASGGTNIADALKKGAKVVKDRRRKNPVSS-------IILLSD 352 Query: 132 GAPTDEWQAAANKVF-----------RGEEDKRFAFFSIGV--QGADMKTLAQISVRQPL 178 G T + + + + F G A M +A+ S Sbjct: 353 GQDTHSFLSGEADINYSILVPPSILPGTSHHVQIHTFGFGTDHDSAAMHAIAETSNGTFS 412 Query: 179 PLQG 182 + Sbjct: 413 FIDA 416 >UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGD5_ARCPR Length = 411 Score = 64.2 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 45/201 (22%), Positives = 77/201 (38%), Gaps = 37/201 (18%) Query: 7 FATSDFASNPEPR---CPCILLLDVSGSMNGRPI-NELNAGLVTFRDELLADPLALKRVE 62 F D + C ++L+DVS SM GR I L + L R + + E Sbjct: 222 FDERDLVAREGKHMEKCVYVMLIDVSDSMRGRRIVGALESAL-ALRKVIKKSNMD----E 276 Query: 63 LGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 L +V F + N L +G T +G A+ A +++++R R +G Sbjct: 277 LHVVAFNHRVRKIKDEEILN-----LRTRGRTDIGLALKTAREIIKKR----RGSG---- 323 Query: 123 RPWIFLITDGAPTDEWQAAANK----VFRGEEDKR----FAFFSIGVQG---ADMKTLAQ 171 IFLITDG PT + + E+ ++ + + A + +A+ Sbjct: 324 --VIFLITDGEPTSSYDPYLTPTMCALREAEKLRKVDANLTIVMLSPEKRFLALCERIAK 381 Query: 172 ISVRQPLP--LQGLQFRELFS 190 +S + L L ++ F Sbjct: 382 LSRKANLVYIENPLNMKKFFV 402 >UniRef50_Q2BCF0 Possible D-amino acid dehydrogenase, large subunit n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BCF0_9BACI Length = 459 Score = 64.2 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 62/192 (32%), Gaps = 27/192 (14%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMN-----GRPINELNAGLVTFRDELLADPL 56 ++ + + ++L+D SGSM G + + F L D Sbjct: 135 MPELPDGEDEIQQAKNQKSNIVILMDASGSMKADVSGGNKMMLAKETIKEFTSSLEDDAS 194 Query: 57 ALKRVELGIVTFGPVHVEQPFTSAANFFP-------------PILFAQGDTPMGAAITKA 103 + T + + FP A G TP+ AI KA Sbjct: 195 VSLMAYGHVGTGNDEDKAESCSRIDEVFPLGAYEKTAFNKSMDSFEASGWTPLAGAIDKA 254 Query: 104 LDMVEERKREYRANGISYYRPWIFLITDGAPT--DEWQAAANKVFRGEEDKRFAFFSIGV 161 +++ Y + Y+ +++++DG T + AA ++ + + V Sbjct: 255 RELL----SAYNST---DYKNTLYIVSDGVETCDGDPVEAAQQLQGSNIEAKVNIIGFDV 307 Query: 162 QGADMKTLAQIS 173 K L +++ Sbjct: 308 DDEGQKQLKEVA 319 >UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleostomi RepID=Q562D1_XENTR Length = 895 Score = 64.2 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 41/185 (22%), Positives = 67/185 (36%), Gaps = 29/185 (15%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA S P+ I ++D S SM G + + L+ D++ V Sbjct: 265 FAPSKLKEVPKN---IIFIIDRSISMIGLKMQQTKEALLKILDDVKEHD------HFNFV 315 Query: 67 TF--GPVHVEQPFTSA-------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 F G EQ A A + L+ +G T + A+ A+ ++++ Sbjct: 316 IFDWGVEIWEQSLVKATPENLNRAKAYVRNLYPKGWTNINDALLSAISLLDQAHDARSVP 375 Query: 118 GISYYRPWIFLITDGAPT------DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQ 171 S I +TDG P+ D+ Q A RG+ F +GV D L + Sbjct: 376 KRS--ASLIIFMTDGQPSTGERNLDKIQENARNAIRGKYSLYSLGFGVGV---DYPFLEK 430 Query: 172 ISVRQ 176 +S+ Sbjct: 431 LSLEN 435 >UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P2E4_9RHOB Length = 772 Score = 64.2 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 51/155 (32%), Gaps = 34/155 (21%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP----------V 71 + +LD SGSM+G+PI + L D I+ F + Sbjct: 366 LVFVLDTSGSMSGQPIEASKTFMTAAIKALRPDDY------FRILHFSNDTSQFAGQAVL 419 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 E+ A F L A G T + A+ A D + + +TD Sbjct: 420 ATERNKQKALKFVA-DLSAGGGTEINQAVNAAFDQAQPDNTTR----------IVVFLTD 468 Query: 132 GAPTDEWQAAANKVFRGEEDK--RFAFFSIGVQGA 164 G DE V + ++ + ++ GV + Sbjct: 469 GYIGDE-----ATVIKSIANRIGKARIYAFGVGNS 498 >UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated chloride channel n=2 Tax=Gallus gallus RepID=UPI0000E80A5E Length = 928 Score = 63.8 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 69/217 (31%), Gaps = 43/217 (19%) Query: 22 CILLLDVSGSMN-GRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFTS 79 L+LDVSGSMN I L F +++ +GIVTF + + P Sbjct: 308 VSLVLDVSGSMNTNNRITNLRTAAEVFLIQIIEIGS-----RVGIVTFESSAYEKSPLLQ 362 Query: 80 AANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + A G T + A I K L+++ + I L+TDG Sbjct: 363 ITSVATRQRLVQNLPTTAGGGTKICAGIEKGLEIITNAIGTTYGSE-------IVLLTDG 415 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ-------PLPLQGLQF 185 D + ++ +I + + K L + S + Sbjct: 416 E--DSTMSL---CREKVKESGAIIHTIALGPSAAKELEEFSNITGGLQLYAVDVDVPSKL 470 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVLEAPK------GW 216 E F S + + S + LE+ GW Sbjct: 471 VEAF----SEITTGSGDISEQSIQLESKDQEVVSSGW 503 >UniRef50_UPI0001C39385 von Willebrand factor type A n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C39385 Length = 361 Score = 63.8 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 40/84 (47%), Gaps = 2/84 (2%) Query: 90 AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGE 149 A G T G + AL +V + + N + P + LI+DG PTD++Q+ +++ + Sbjct: 278 ADGTTSSGTDMGTALKLVSQELKIPPMNERA-LPPVLVLISDGQPTDDFQSGLDELMKQP 336 Query: 150 EDKRFAFFSIGVQ-GADMKTLAQI 172 K+ +I + AD L + Sbjct: 337 WGKKAVRIAIAIGKDADEAVLQKF 360 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 6/65 (9%) Query: 7 FATSDFASNP--EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 F D ++P +P+ + L+D SGSM+ I+ + V+F D+++ A V LG Sbjct: 57 FQWKDLVADPLQQPKLDIVFLVDTSGSMS-DEIDAVKRSCVSFADQIIK---AGANVRLG 112 Query: 65 IVTFG 69 +V F Sbjct: 113 LVGFD 117 >UniRef50_UPI0001760236 PREDICTED: similar to mCG140660 n=1 Tax=Danio rerio RepID=UPI0001760236 Length = 1753 Score = 63.8 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 39/188 (20%), Positives = 70/188 (37%), Gaps = 27/188 (14%) Query: 22 CILLLDVSGSMNGRPINELN-AGLVTFRDELLAD-PLALKRVELGIVTFGPV-------H 72 + L+D S S I N + F L+ + +A +V +G+V + + Sbjct: 33 IVFLVDGSAS-----IGLDNFQQIRQFLSSLVENFEVAPDKVRIGLVQYSDTPRTEFSLN 87 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKAL--DMVEERKREYRANGISYYRPWIFLIT 130 Q ++ + + G T G + L +EE + N +IT Sbjct: 88 TYQNKEEILDYIRNLRYKTGGTHTGQGLEFILKQHFIEEAGSRAQQNVPQ----IAIVIT 143 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFREL 188 DG DE A ++ + + F+IG++ AD++ L QI+ F L Sbjct: 144 DGDSQDEVDLQAQELRQ----RGIKIFAIGIKDADVRLLRQIANEPYDQYVYSVSDFAAL 199 Query: 189 FSWLSSSL 196 +S S+ Sbjct: 200 -QGISQSV 206 Score = 58.4 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 42/186 (22%), Positives = 66/186 (35%), Gaps = 24/186 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ------ 75 +LL+D SGS+ E+ L F D P V LG+ F ++ Sbjct: 232 IVLLVDSSGSIGDNDFEEVKKFLHAFVDRFNLRP---DLVRLGLAQFSDRPYQEFLLGDY 288 Query: 76 -PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 +++ +G T G A+T E R N +ITDG Sbjct: 289 ADKKDLHQKLNNLIYRKGGTQTGQALTFIR---ENYFSLARPNVPG----IAIVITDGES 341 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFRELFSWL 192 D+ + A ++ + + F I V +M+ L I+ ++EL L Sbjct: 342 RDDVEEPAQRLR----NTGVSLFVIRVGKGNMEKLRAIANIPHEEFLFSINNYQEL-QGL 396 Query: 193 SSSLRS 198 SLRS Sbjct: 397 KESLRS 402 Score = 56.1 bits (134), Expect = 8e-07, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 69/176 (39%), Gaps = 18/176 (10%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP----VHVEQPF 77 ++D S S+ R + L L + ++++ ++ + F Sbjct: 613 IAFIVDQSSSIKSRNFQLVRDFLENTIGRL---DVGKDKIQIAVILYSDFPRADVYLNTF 669 Query: 78 TSAANF--FPPILFAQ-GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 ++ + + L G T GAA+ A + V + R R + Y + +ITDG Sbjct: 670 SNKNDILRYINTLPYGRGKTYTGAALRFAKEHVFTKARGSRRD--KYVQQVAVVITDGKS 727 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFREL 188 TD+ +AA ++ R + F++G++ L +I+ P L F +L Sbjct: 728 TDDAASAAAELRRS----GVSIFALGIKDTKEDDLREIASYPPKKFVLNVENFDQL 779 Score = 49.9 bits (118), Expect = 5e-05, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 54/174 (31%), Gaps = 16/174 (9%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ------ 75 LLD SGS++ ++ ++ D + V +G+V F Sbjct: 819 IYFLLDESGSISYPDFEDMKKFIMECLDVFQ---IGKDHVRIGVVKFASKATTVFRLHDY 875 Query: 76 -PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 + + G T + + + + E + R + +ITDG Sbjct: 876 STKSDVEKAVKDLEMYGGGTRTDLGLRQMIPLFREAVQ----TRGEKARELLIVITDGES 931 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 T + ++ + ++IG +G + + + F ++ Sbjct: 932 TGTVEPVEVPAKHLRAEQNVSIYAIGCEGLLADVV--FLIDGSDSVSAEDFEKM 983 Score = 47.6 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 60/172 (34%), Gaps = 39/172 (22%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 + L+D S S+ + L ++ EL +A + +G+ F + + Q Sbjct: 1232 LVFLIDGSESIKPPSWDILKQTMIGIVKEL---DIAKDKWRVGVAQFSDILLHQ------ 1282 Query: 82 NFFPPILFAQGDTPMGAAITKALDMVEERKREYRA----NGISYY-------------RP 124 + T + +A++ +++RK+ I YY Sbjct: 1283 ------FYLNTYTSFAE-VEEAINNIKQRKQGTNTWDALKLIKYYFTKENGSRIEGGVAQ 1335 Query: 125 WIFLITDGAPTDEWQ-AAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISV 174 + LITDG DE A + +K+ A IG+ L +I+ Sbjct: 1336 NLLLITDGEANDEKDLNALADLK----NKKIAITVIGIGNEIKKSELREIAG 1383 >UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CDA1_PARTE Length = 604 Score = 63.8 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 56/164 (34%), Gaps = 32/164 (19%) Query: 25 LLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAANFF 84 ++D SGSM+G I + L + L L ++ F Q T+ Sbjct: 190 VIDRSGSMSGEKIEMVKQTLNILLNFLGPKD------RLCLIQFDDTC--QRLTNLRRVT 241 Query: 85 PP----------ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 ++A G T +G AL ++ RK IF+++DG Sbjct: 242 DENKTYYSDIISKIYANGGTVIGLGTQMALKQIKYRKSVNNVT-------AIFVLSDGQ- 293 Query: 135 TDEWQAAANKVFR--GEEDKRFAFFSIGVQ-GADMKTLAQISVR 175 +AA + + + + S G D K + +IS Sbjct: 294 ---DEAAISSLQKQLAYYKQTLTIHSFGFGSDHDAKLMTKISNL 334 >UniRef50_C4JI95 Putative uncharacterized protein n=3 Tax=Onygenales RepID=C4JI95_UNCRE Length = 314 Score = 63.8 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 43/179 (24%), Positives = 66/179 (36%), Gaps = 39/179 (21%) Query: 23 ILLLDVSGSMNGRPINELNAGLVT----------------FRDELLADPLALKRVELGIV 66 + L+D SGSM GR E A L F + + P I Sbjct: 92 VFLIDDSGSMAGRSWRETEAALSAIAPICTQFDADGVDIYFLNHINRQPSQNTGAYRSIT 151 Query: 67 TFGPVHVEQPFTSAANFFPPILFAQGDTP----MGAAITKALDMVEERKREYR-ANGISY 121 PV V + FTS +G TP +G + LD +E R A+ S Sbjct: 152 --SPVEVHEVFTSV--------SPRGGTPTGKRLGQILKPYLDQLESLIENDRFASSDSL 201 Query: 122 YRPW-IFLITDGAPTDEWQAA-------ANKVFRGEEDKRFAFFSIGVQGADMKTLAQI 172 RP + +ITDG PTD+ ++ +++ FF +G + + L ++ Sbjct: 202 LRPLNLIVITDGVPTDDVESVIVSAARKLDRLNAQPWQIGIQFFQVGNEPDAAEDLREL 260 >UniRef50_B5ZN80 von Willebrand factor type A n=8 Tax=Rhizobiales RepID=B5ZN80_RHILW Length = 522 Score = 63.8 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 37/170 (21%), Positives = 58/170 (34%), Gaps = 24/170 (14%) Query: 24 LLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV------ELGIVTFGPVHVEQPF 77 L LD SGSM G ++L + L D + V + ++ F Sbjct: 346 LCLDFSGSMQGDGEDQLQKAMRFL---LTPDEASKVLVQWSPADRIIVIPFDGSVRNTFM 402 Query: 78 TSAANFFPPIL-------FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 S L A G T M +AL + R++ +S Y P I ++T Sbjct: 403 ASGNPLEQEGLLNEISRQKAGGGTDMYTCAAQALQQI------ARSDRLSTYLPAIVIMT 456 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 DG +D+ A + E F I AD L ++ + + Sbjct: 457 DGR-SDDQSQAFMSEWNATE-PHVPVFGITFGDADKTQLDSLAKQTSARV 504 >UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09E12_STIAU Length = 540 Score = 63.8 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 62/194 (31%), Gaps = 32/194 (16%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 ++D SGSM G + L L +V F VE F + Sbjct: 72 VTFVIDTSGSMQGSRMQIAKDALKYCVTRLNPQDT------FNVVRFS-TDVEALFPALK 124 Query: 82 NFFP----------PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP-WIFLIT 130 + P L A G T + A+ R + N P + IT Sbjct: 125 SAQPENIQKAVAFVEQLEAIGGTAIDEAL----------VRGLQDNDGKSSAPHLLMFIT 174 Query: 131 DGAPT--DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQ-GLQFR 186 DG PT + + A + + + F+ GV + + L ++S + Sbjct: 175 DGQPTIGETDEGAIAQHAKDGRKAKTRLFTFGVGEDLNARLLDRLSSDGAGTSDFVRDGK 234 Query: 187 ELFSWLSSSLRSVS 200 E + +SS VS Sbjct: 235 EFETKISSFYDKVS 248 >UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepID=B9GN58_POPTR Length = 705 Score = 63.8 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 30/128 (23%), Positives = 44/128 (34%), Gaps = 23/128 (17%) Query: 16 PEPRCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-H 72 P R P I +LDVS SM G + L + L + L IV F Sbjct: 325 PSRRAPIDLITVLDVSASMTGAKLQMLKRAMRLVISSLGSAD------RLSIVAFSSSPK 378 Query: 73 VEQPF-------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 P +A L + +G A+ KA ++E+R+ Sbjct: 379 RLLPLKRMTPNGQRSARRIIDRLVCGQGSSVGEALRKATKVLEDRRERNPVAS------- 431 Query: 126 IFLITDGA 133 I L++DG Sbjct: 432 IMLLSDGQ 439 >UniRef50_C5E9N8 von Willebrand factor type A n=4 Tax=Bifidobacterium longum RepID=C5E9N8_BIFLO Length = 401 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 39/185 (21%), Positives = 62/185 (33%), Gaps = 30/185 (16%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF----- 77 I ++D SGSM+G N + GL DP K+ + + G V++ PF Sbjct: 224 IWVVDYSGSMSGEGKNGVVKGLNAAL-----DPDQAKKSYIEPAS-GDVNILIPFETEAH 277 Query: 78 ---------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 TS A G T + + ALD + + Y I L Sbjct: 278 RPVKATGTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQ------YTTAIVL 331 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL---QGLQF 185 +TDG + Q ++ + FSI AD L ++ + + Sbjct: 332 MTDGRSNSDHQDEFESAYKS-RGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDL 390 Query: 186 RELFS 190 +F Sbjct: 391 AAVFR 395 >UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CPU4_SHEPW Length = 710 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 38/152 (25%), Positives = 59/152 (38%), Gaps = 27/152 (17%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP---VHVEQPFT 78 IL++D SGSM+G I + A ++ L A I+ F + P Sbjct: 351 LILVIDTSGSMSGEAIEQAKASIIYALAGLSAQDS------FNILQFNSNVYALSDTPLN 404 Query: 79 SA------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 ++ A + L A G T M A+ KAL + + R + ITDG Sbjct: 405 ASAKNIGRAQAYVQRLQANGGTEMSLALDKALSQQDANRERLRQ---------VLFITDG 455 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGA 164 A +E Q ++ + R F+IG+ A Sbjct: 456 AVGNEPQ-LFTQIRNQLQQSRL--FTIGIGDA 484 >UniRef50_D0N9Y3 Putative uncharacterized protein n=2 Tax=Phytophthora infestans T30-4 RepID=D0N9Y3_PHYIN Length = 1481 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 58/182 (31%), Gaps = 21/182 (11%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE----QPFT 78 + +LD SGSM+G+P +L F L D V VTF P Sbjct: 1292 VFVLDNSGSMSGQPWKDLLCACDEFGISRLKDGGEKDLV--SYVTFDHEGRIFCEGVPLP 1349 Query: 79 SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD-E 137 A P G T + N ++ + +DG P D E Sbjct: 1350 EALEMSVP---------FGGGGTSYGKGLRAANEVLSRNDFEEFKVVLIFFSDGQPCDIE 1400 Query: 138 WQAAANKVFRGEEDKR-FAFFSIGVQGADMKTLAQIS----VRQPLPLQGLQFRELFSWL 192 A + R K F++G ++ L +++ R L R F + Sbjct: 1401 MGVALARHIRLSYAKYDLKAFAVGFGCINLPVLQRVASEMGGRYRQVLDANALRTEFQRI 1460 Query: 193 SS 194 ++ Sbjct: 1461 AA 1462 >UniRef50_B2BSQ4 Structural toxin protein n=4 Tax=Legionella pneumophila RepID=B2BSQ4_LEGPN Length = 4669 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 35/126 (27%), Positives = 49/126 (38%), Gaps = 12/126 (9%) Query: 22 CILLLDVSGSMNGRPINEL-NAGLVTFRDELLADPLALKRVELGIVTFGPVHV----EQP 76 +L+LD SGSM G I L N+ L AL V++ IVTF Sbjct: 3423 LMLILDTSGSMAGSGIQTLINSTLELLERY-----EALGNVKVRIVTFNTSATAIGSVWM 3477 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT- 135 AA L A G+T AA+ A++ I + + I+DG PT Sbjct: 3478 TVDAAKNALLGLTAGGNTNFDAALITAMNAFNSGTVGGADGRIGGAQNVSYFISDGNPTV 3537 Query: 136 -DEWQA 140 +W + Sbjct: 3538 NQDWPS 3543 >UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YPR2_BRAFL Length = 863 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 61/182 (33%), Gaps = 26/182 (14%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA + P P + ++D SGSM G + + + T +L ++ F Sbjct: 225 FAPSGLPVVPKNIVFIIDKSGSMGGTKMRQTKQAMNTILKDLRDHD------RFNVMPFS 278 Query: 70 PVHV-----EQPFTSAANFFPPILF------AQGDTPMGAAITKALDMVEERKREYRANG 118 E + N + A G T + AI A D++ R+ Sbjct: 279 YSSTMWRPNEMVLATRENIESARTYVRRSINAGGGTNINQAIIDAADLL--RRVTDDQPN 336 Query: 119 ISYYRPWIFLITDGAPT---DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISV 174 I +TDG P+ + + V + + + F +G D L ++++ Sbjct: 337 SPRSASLIIFLTDGLPSVGESKPRNIMVNVKNAIRE-QVSLFCLGFGKDVDFPFLEKMAL 395 Query: 175 RQ 176 Sbjct: 396 EN 397 >UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus RepID=B7AA98_THEAQ Length = 706 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 38/177 (21%), Positives = 60/177 (33%), Gaps = 28/177 (15%) Query: 9 TSDFASNPEPRC--PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 D P R +L+LDVSGSM G + AG + A LG+V Sbjct: 294 PEDLPLKPLGRKGAALVLVLDVSGSMEGEKLAMAVAGALELVRS------AAPEDYLGVV 347 Query: 67 TFGPV-HVEQPFTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANG 118 F V P L A G T +G A +AL ++++ Sbjct: 348 LFSSSPRVLFPPRPMTAQGKKEAESLLLSLRAGGGTVLGGAFREALRLLQD--------- 398 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVR 175 + R + +++DG D + + ++G AD L ++ R Sbjct: 399 VPVERKALLVLSDGIIFDPKEPILA--LAATAGVEVSALALG-PDADAAFLEALAQR 452 >UniRef50_UPI00016E1D39 UPI00016E1D39 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E1D39 Length = 753 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 44/203 (21%), Positives = 71/203 (34%), Gaps = 27/203 (13%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 + L+D S S+ E+ L F L P ++++G+ + Q F Sbjct: 2 IVFLVDGSSSIGTDNFQEVRLFLRNFTSGLDIGP---DKIQIGLAQYSNDP-HQEFLLKD 57 Query: 82 NFFPPILFA--------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + L A G T G AI ++ RAN +ITDG Sbjct: 58 HMEKTALLAALDSFPYRTGGTETGKAIDFLRTQYFTKEAGSRANQRVP--QIAVVITDGD 115 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLP-------LQGLQ-- 184 TD+ A + + F+IGV A+ L I+ R P Q LQ Sbjct: 116 STDDVTVPAQSLRK----HGVIVFAIGVGNANQNELESIANRPPKRFKFTIDSFQALQRL 171 Query: 185 FRELFSWLSSSLRSVSRSTPGTE 207 + L + S++ ++ G+ Sbjct: 172 TKGLLQTMCVSIKDQHQAEAGSR 194 >UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHI4_9EURY Length = 705 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 41/155 (26%), Positives = 60/155 (38%), Gaps = 19/155 (12%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFTSA 80 +LLD+SGSM G+ IN L + D L V L + F G Sbjct: 528 IWMLLDISGSMGGQKINAAKRILGSIHDSLDGSKY----VHLRMFGFYGSDGTHV--FEF 581 Query: 81 ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT--DEW 138 L A GDTP AI A+D++++ K + +F+ITDG P E Sbjct: 582 DRKMLMNLAAMGDTPTDIAIYYAMDLMKKDKSNFDKT--------LFIITDGDPNNGQET 633 Query: 139 QAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS 173 + A N + K F+I + + + S Sbjct: 634 KNALNSLKNAM--KNVNVFTIFISREAARAVEIFS 666 >UniRef50_UPI0000F2E695 PREDICTED: hypothetical protein n=1 Tax=Monodelphis domestica RepID=UPI0000F2E695 Length = 2439 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 61/176 (34%), Gaps = 25/176 (14%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-------PVHVE 74 L+D SGS+N E+ ++ + V G+V + + Sbjct: 624 FYFLIDGSGSINHDDFAEMKTFMIELISTFR---VGADHVRFGVVQYSDSPTVEFDIRQH 680 Query: 75 QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEE--RKREYRANGISYYRPWIFLITDG 132 + I G T G A+T + E R + R ++ +ITDG Sbjct: 681 SSVAQLKSAITKIWQTGGGTRTGEALTFMKRLFSEVARDKVLR---------FLIVITDG 731 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D+ AA ++ + ++IGV+ A K L +IS Q F L Sbjct: 732 QSQDQVAQAAEELRQE----NITIYAIGVKSAVTKELLEISGSQNRMFFVNDFDSL 783 Score = 53.0 bits (126), Expect = 6e-06, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 66/189 (34%), Gaps = 37/189 (19%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 + L+D S S++ R + + + + RV +G+ + Sbjct: 1182 HREIDLVFLIDGSSSIHPRNFTAMKTFMKQIVNSFT---IGKDRVRIGVAQYST------ 1232 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMV-EERKREYRANG----ISYYRPW------ 125 F+ ++ GA I + +D + + R + Y G S++ P Sbjct: 1233 -NPQKEFYLNTFYS------GAEINQHIDKITQLRTQTYTGKGLRFVKSFFEPANGSRKN 1285 Query: 126 ------IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLP 179 + +ITDG D AAN + +++ FSIG+ ++ L I+ Sbjct: 1286 LHVLQSLVVITDGMSNDSVVEAANDLR----NEKIQIFSIGIGVINLFELQLIAGNVKRV 1341 Query: 180 LQGLQFREL 188 F +L Sbjct: 1342 FVVGDFGQL 1350 Score = 53.0 bits (126), Expect = 7e-06, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 60/166 (36%), Gaps = 16/166 (9%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP----VHVEQPF 77 + L+D S S+ ++ + L + L + +V +G+ F + F Sbjct: 18 LVFLVDSSTSIGPENFQKVKSFLYSLVLGL---EIGRDQVRVGLAQFNDNIYKAFLLNQF 74 Query: 78 TSAANFFPPILF---AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 ++ IL G T G+A+ RA + L+TDG Sbjct: 75 PRKSDVLEQILSLPYRTGGTRTGSALNFLRTEFFTESAGSRAKDNVP--QIVILVTDGES 132 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 DE AA+K+ + + + +G+ D++ L I+ + Sbjct: 133 NDEVAEAASKLK----GQGVSIYVVGINVQDVQELKTIASKPLEKF 174 >UniRef50_A9BS02 von Willebrand factor type A n=1 Tax=Delftia acidovorans SPH-1 RepID=A9BS02_DELAS Length = 536 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 40/183 (21%), Positives = 68/183 (37%), Gaps = 30/183 (16%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV-----ELGIVTFGP------- 70 I +LDVSGSM G + ++ L + + ++ F Sbjct: 342 IFVLDVSGSMKGARLAQMKEALKLLSGAEASAASQRYAAFQARERVLLIPFSGLVGQPAR 401 Query: 71 ----VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 Q ++ + L A G T + A+T A ++ ++E RA+ + I Sbjct: 402 VQFAAGDLQAASAQVLAYADSLVADGGTAIYDALTLAQ---QQARQELRADPERFVS--I 456 Query: 127 FLITDGAPT--DEWQAAANKVFRGEEDKR---FAFFSIGVQGA---DMKTLAQISVRQPL 178 L+TDGA T +W AA + R D F I A +M+ LA ++ + Sbjct: 457 VLLTDGANTAGRDW-AAFEREQRMARDGGAPLVRVFPIIFGEAQSGEMQALAALTGGRAF 515 Query: 179 PLQ 181 + Sbjct: 516 DAR 518 >UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=Q2QSE5_ORYSJ Length = 524 Score = 63.4 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 38/167 (22%), Positives = 62/167 (37%), Gaps = 25/167 (14%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-------- 71 + ++DVSGSM G I + L +L L IVTF Sbjct: 61 LDLVAVVDVSGSMRGHKIESVKKALQFVIMKLTPVD------RLSIVTFESSAKRLTKLR 114 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + Q F + L A G T + A + L ++ +R + + + IFL++D Sbjct: 115 AMTQDFRGELDGIVKSLIANGGTDIKAGLDLGLAVLADRV--FTESRTAN----IFLMSD 168 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQP 177 G + +V GE + ++ G G D + L I+ P Sbjct: 169 GKLEGKTSGDPTQVNPGE----VSVYTFGFGHGTDHQLLTDIAKNSP 211 >UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YNZ7_ANASP Length = 820 Score = 63.4 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 52/163 (31%), Gaps = 25/163 (15%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 + L+D SGS G P+ + + F + L D IV F + A Sbjct: 301 VVFLIDTSGSQMGAPLMQCQELMRRFINGLNPDDT------FSIVDFSDTTRQLSPVPLA 354 Query: 82 N---------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 N + L A G T M I L+ R+ I L+TDG Sbjct: 355 NNAQNRTRAINYINQLSANGGTEMLRGIRAVLNFPVTDPGRLRS---------IVLLTDG 405 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVR 175 +E Q A + R F G + L +I+ Sbjct: 406 YIGNENQILAEVQQHLKSGNRLYSFGAG-SSVNRFLLNRIAEL 447 >UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBY0_9DELT Length = 996 Score = 63.4 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 61/185 (32%), Gaps = 30/185 (16%) Query: 17 EPRCPCILLLDVSGSM-NGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE- 74 +P IL++D SGSM +G ++ + L DP E+G++ F Sbjct: 525 QPTLALILVIDKSGSMSSGDRLDLVKEAARATARTL--DPSD----EIGVIAFDNSPQVL 578 Query: 75 ---QPFTSAANFFP--PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 QP + L A G T A+ +A + K + + L+ Sbjct: 579 VRLQPAANRLRISSSIRRLSAGGGTNAMPALREAYLQLAGSKALVKH---------VILL 629 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQIS----VRQPLPLQGLQ 184 +DG + N + S+GV A L +++ R G Sbjct: 630 SDGE---SPENGINALLGDMRQSDITVSSVGVGDGAGKDFLIRVAERGRGRYFYSEDGTD 686 Query: 185 FRELF 189 +F Sbjct: 687 VPRIF 691 >UniRef50_A2E6Y7 von Willebrand factor type A domain containing protein n=4 Tax=Trichomonas vaginalis RepID=A2E6Y7_TRIVA Length = 720 Score = 63.4 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 77/220 (35%), Gaps = 31/220 (14%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDEL--------LADPLALKRVEL 63 F E + ++D SGSM+G I L L + + K V + Sbjct: 233 FEGKVEQKSEFYFIIDCSGSMSGSRIENAKFCLNILIHSLPIGCRFSIIQFGNSYKEV-V 291 Query: 64 GIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYR 123 I + +V+ ++ A + G T + + + + G + R Sbjct: 292 SICDYSNKNVKYAMSAIARINADM----GGTDILSPLEYVFK---------KKLGKGFIR 338 Query: 124 PWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQ----PL 178 IFL+TDG + ++V + E+ R F+IG+ GAD + IS + L Sbjct: 339 -KIFLLTDGEVHNSDM-ICSRVQKERENNR--IFAIGLGSGADPGLIKNISAKSGGNYVL 394 Query: 179 PLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTS 218 + + S S S S + + + W + Sbjct: 395 IADDDNMNNMIVEIMKSALSPSLSNISIQGESDQTEMWPT 434 >UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fragment) n=1 Tax=Sorghum bicolor RepID=C5YMJ6_SORBI Length = 423 Score = 63.4 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 44/122 (36%), Gaps = 21/122 (17%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT 78 + +LDVSGSM G+ + + + D L +D L +V F T Sbjct: 125 LDLVTVLDVSGSMAGKKMERVKRAMGFLIDNLGSDD------RLSVVAFSTDARRIIRLT 178 Query: 79 SAANFFP-------PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 ++ L A G T + + A +++ R+ + + L++D Sbjct: 179 RMSDDGKAAAKRAVESLAASGSTNIRGGLDVAAMVLDGRRHKNAVAS-------VILLSD 231 Query: 132 GA 133 G Sbjct: 232 GQ 233 >UniRef50_A6M139 von Willebrand factor, type A n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M139_CLOB8 Length = 962 Score = 63.4 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 30/135 (22%), Positives = 49/135 (36%), Gaps = 20/135 (14%) Query: 18 PRCPCILLLDVSGSMNGR-PINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE-- 74 P +L+LD SGSM + L F ++ +K +++ IV F Sbjct: 78 PPKEIVLVLDSSGSMADNYKLTNLKKAATDFITKM----STVKNLKIAIVDFDTQATIIN 133 Query: 75 --QPFTSAANFFP-----PILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 +S+ N L A G T G + +A ++ + + I Sbjct: 134 KLTDVSSSTNVTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSEQNPLASKN-----II 188 Query: 128 LITDGAPT-DEWQAA 141 ++DG PT WQ A Sbjct: 189 FMSDGEPTYYNWQTA 203 >UniRef50_C9ZGQ6 Putative membrane protein n=3 Tax=Streptomyces RepID=C9ZGQ6_STRSW Length = 534 Score = 63.4 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 60/168 (35%), Gaps = 28/168 (16%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP------VHVEQP 76 + +LD SGSM G ++ L L + R E+ ++ FG HV +P Sbjct: 358 VYVLDTSGSMEGDRLDRLKTALTELTGDFR------DREEVTLMPFGSDVKSVRTHVVRP 411 Query: 77 FT-----SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 L A G+T + ++ +A + + R+ + I L+TD Sbjct: 412 ADPKAGLDGIRADTRKLSAAGETAIYTSLRRAYEHLGAVDRDTFTS--------IVLMTD 463 Query: 132 GAPTDEWQAA-ANKVFRG--EEDKRFAFFSIGVQGADMKTLAQISVRQ 176 G T+ A + + + + F I +D L I+ Sbjct: 464 GENTEGASPADFDDFYGRLPDAARHIPVFPILFGDSDRDELEHIAEVT 511 >UniRef50_Q5NJK1 Matrilin-3a n=5 Tax=Danio rerio RepID=Q5NJK1_DANRE Length = 460 Score = 63.0 bits (152), Expect = 6e-09, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 56/164 (34%), Gaps = 15/164 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF-- 77 + ++D S S+ ++ L D L P A + +V + + Sbjct: 63 LDLVFIIDSSRSVRPGEFEKVKIFLADMVDTLDVGPDAT---RVAVVNYASTVKIESLLK 119 Query: 78 -----TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + I T G AI KA+D K R + + ++TDG Sbjct: 120 SHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEAFTEKSGARPKSKNISK-VAIIVTDG 178 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 P D+ +V +++GV ADM++L ++ Sbjct: 179 RPQDQ----VEEVSAAARASGIEIYAVGVDRADMRSLKLMASNP 218 >UniRef50_A4QDZ6 Putative uncharacterized protein n=1 Tax=Corynebacterium glutamicum R RepID=A4QDZ6_CORGB Length = 354 Score = 63.0 bits (152), Expect = 6e-09, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 62/187 (33%), Gaps = 30/187 (16%) Query: 24 LLLDVSGSMNGRPINELNAGLVTFRD----ELLADPLALKRVELGIV--TFGPVHVE--- 74 +LDVSGSM G+ I L + LA+ R ++ I+ +FGP V Sbjct: 172 FVLDVSGSMLGQRITLLKDTMSDLISGGATTDLANVSLRGREKVSIIPFSFGPHEVISET 231 Query: 75 -----QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 P L A G T + A+ A Y + Y P I L+ Sbjct: 232 LGAVGSPSRIDLQQRVEALQADGGTGIYDAVLAA----------YAESAGGDYIPSIVLM 281 Query: 130 TDGAPT-----DEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGL 183 TDG T D++ N + G ADM+ LA + + Sbjct: 282 TDGELTAGRTYDQFLTEWNALPSNIRSIPVFVILYGEANVADMEQLAATTGGKTFDAING 341 Query: 184 QFRELFS 190 E F Sbjct: 342 DLDEAFK 348 >UniRef50_O05809 Uncharacterized protein Rv2850c/MT2916 n=51 Tax=Bacteria RepID=Y2850_MYCTU Length = 629 Score = 63.0 bits (152), Expect = 6e-09, Method: Composition-based stats. Identities = 29/119 (24%), Positives = 48/119 (40%), Gaps = 14/119 (11%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF--GPVHVEQPFTS 79 I ++D SGSM R + A + LL D + ++ ++TF + TS Sbjct: 452 VIFVVDASGSMAAR---DRMAAVSGATLSLLRDAYQRRD-KVAVITFRQHEATLLLSPTS 507 Query: 80 AANF---FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 +A+ G TP+ + A ++ K RA RP + ++TDG T Sbjct: 508 SAHIAGRRLARFSTGGKTPLAEGLLAARALIIREKVRDRA-----RRPLVVVLTDGRAT 561 >UniRef50_A4XTA4 Hemolysin-type calcium-binding region n=1 Tax=Pseudomonas mendocina ymp RepID=A4XTA4_PSEMY Length = 3184 Score = 63.0 bits (152), Expect = 7e-09, Method: Composition-based stats. Identities = 43/193 (22%), Positives = 66/193 (34%), Gaps = 26/193 (13%) Query: 24 LLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR--VELGIVTFGP-VHVEQPFT-- 78 L+LD+S SM G + L +++ A V + ++TF FT Sbjct: 2367 LVLDISLSMAGDKLTALKQAVISLAQ-----GYAGLSAPVHVNLITFNSGAAEIGDFTFS 2421 Query: 79 -------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 +A L A G T A++ A V A+ ++ ++ I+D Sbjct: 2422 SVGDAGYTALLTAVNGLTASGFTNYEQALSVAKAQVLSDISAPGADPAQQHK--LYFISD 2479 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSW 191 G PT Q A + F G + + SV G+ F F Sbjct: 2480 GEPTVGAQGATLTTWIANNWTNFIGNVDGDGDSATNSFTAHSV-------GISFTGSFMG 2532 Query: 192 LSSSLRSVSRSTP 204 +S SV STP Sbjct: 2533 QIASDGSVINSTP 2545 >UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q083T9_SHEFN Length = 722 Score = 63.0 bits (152), Expect = 7e-09, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 61/164 (37%), Gaps = 22/164 (13%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSA- 80 IL++D SGSM+G+ I + L L I+ F T Sbjct: 346 LILVIDTSGSMSGQSITQAKQALQFALAGLRDIDS------FNIIEFNSDVTMLSATPLS 399 Query: 81 --------ANFFPPILFAQGDTPMGAAITKAL-DMVEERKREYRANGISYYRPWIFLITD 131 AN F L A G T M +A+ AL D V++ + A+ R IF +TD Sbjct: 400 ANSRNIGKANRFIQSLDADGGTEMRSALQTALVDSVQQDSDQTDAHS-EMLRQVIF-MTD 457 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISV 174 GA + D R F++G+ A + + + + Sbjct: 458 GA-VGNEHELYQLINDQLGDSRL--FTVGIGSAPNSDFMRRAAT 498 >UniRef50_B0WHU4 Sushi n=3 Tax=Culicini RepID=B0WHU4_CULQU Length = 2239 Score = 63.0 bits (152), Expect = 7e-09, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 61/177 (34%), Gaps = 31/177 (17%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLV-TFRDELLADPLALKR-VELGIVTFGPVHVE 74 R + L+D S S GR N F +LL+D + ++TF Sbjct: 132 NKRVDIVFLIDASSS-VGRQ----NFASEIKFVKKLLSDFNVSYNYTRVAVITFSSQKKI 186 Query: 75 -------------QPFTSAANFFPPIL-FAQGDTPMGAAITKALDMVEERKREYRANGIS 120 N+ P + F+ G T A+ +A ++ + + + + Sbjct: 187 FRHIDQISQSVEDNDKCLLLNYQVPRIAFSGGGTYTYGALKEAEEIFKNARLDSKK---- 242 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP 177 IFLITDG + R ++D +SIG+Q + L I+ Sbjct: 243 ----IIFLITDGFSNG--RDPIPLAGRLKKDNNVVIYSIGIQSGNYAELHAIASAPE 293 >UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1 Tax=Sorghum bicolor RepID=C5WZE3_SORBI Length = 704 Score = 62.6 bits (151), Expect = 8e-09, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 42/123 (34%), Gaps = 25/123 (20%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-PFT 78 + +LDVS SM+G + L + + L L +V F P Sbjct: 234 LDLVTVLDVSRSMSGPKLALLKRAMRFVIENLEPSD------RLSVVAFSSSACRLFPLR 287 Query: 79 SAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW--IFLI 129 F L A G T + + KA +VE+R+ P I L+ Sbjct: 288 KMTAFGQQQSQQAVDSLVADGGTNIAEGLRKAARVVEDRQARN---------PVCSIILL 338 Query: 130 TDG 132 +DG Sbjct: 339 SDG 341 >UniRef50_D1HBR9 Whole genome shotgun sequence of line PN40024, scaffold_205.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HBR9_VITVI Length = 656 Score = 62.6 bits (151), Expect = 8e-09, Method: Composition-based stats. Identities = 31/129 (24%), Positives = 43/129 (33%), Gaps = 23/129 (17%) Query: 15 NPEPRCPCILL--LDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PV 71 NP R P L+ LDV G M G + + + L + L IV F Sbjct: 277 NPARRAPIDLVTVLDVGGGMTGAKLQMMKRAMRLVISSLSS------TDRLSIVAFSASS 330 Query: 72 HVEQPFTSA-------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 P A L A T G A+ KA ++E+R+ Sbjct: 331 KRLMPLKRMTTTGRRSARRIIESLIAGQGTSAGEALKKASKVLEDRRERNPVAS------ 384 Query: 125 WIFLITDGA 133 I L++DG Sbjct: 385 -IMLLSDGQ 392 >UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Rhizobium RepID=B5ZY26_RHILW Length = 794 Score = 62.6 bits (151), Expect = 8e-09, Method: Composition-based stats. Identities = 35/152 (23%), Positives = 52/152 (34%), Gaps = 26/152 (17%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP---------VH 72 + ++D SGSM+G I + L L + ++ F V Sbjct: 356 VVFVIDNSGSMSGPSIEQAKQSLALAISRLTPND------RFNVIRFDDTMTDYFKGLVA 409 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 A + L A G T M A+ AL R R +FL TDG Sbjct: 410 ATPDNREKAIAYVRGLPADGGTEMLPALEDAL-------RNQGPVATGALRQVVFL-TDG 461 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGA 164 A +E Q ++ D R F++G+ A Sbjct: 462 AIGNE-QQLFQEITANRGDAR--VFTVGIGSA 490 >UniRef50_B1G2X7 Putative uncharacterized protein n=1 Tax=Burkholderia graminis C4D1M RepID=B1G2X7_9BURK Length = 182 Score = 62.6 bits (151), Expect = 8e-09, Method: Composition-based stats. Identities = 34/121 (28%), Positives = 48/121 (39%), Gaps = 16/121 (13%) Query: 23 ILLLDVSGSM-NGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAA 81 +LD SGSM G + L GL+ D + R E +V FG + F A Sbjct: 28 CFVLDCSGSMLAGERL-ALAKGLLIAL----FDRASAMRAEAALVCFGGAGADLRFGPAV 82 Query: 82 NFFPPI--LFA---QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 + L G TP A + A ++E R A W++++TDG TD Sbjct: 83 PRWWNERWLEPVGGGGGTPFAAGVQCATQLLERSARRKPAQQR-----WVWILTDGRTTD 137 Query: 137 E 137 E Sbjct: 138 E 138 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular o... 258 1e-67 UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacte... 224 1e-57 UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacte... 222 6e-57 UniRef50_UPI00016C400A von Willebrand factor, type A n=1 Tax=Gem... 219 7e-56 UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium E... 210 2e-53 UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria Re... 210 3e-53 UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacter... 210 3e-53 UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_A... 209 8e-53 UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus... 203 4e-51 UniRef50_Q2SNJ1 Uncharacterized protein encoded in toxicity prot... 195 9e-49 UniRef50_Q2FNC6 von Willebrand factor, type A n=1 Tax=Methanospi... 195 1e-48 UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria Re... 192 6e-48 UniRef50_Q87W17 von Willebrand factor type A domain protein n=2 ... 192 1e-47 UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibri... 185 1e-45 UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter... 185 1e-45 UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria Re... 182 9e-45 UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides... 179 7e-44 UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotoma... 175 8e-43 UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacter... 174 2e-42 UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10... 174 2e-42 UniRef50_D1PS09 von Willebrand factor type A domain protein n=1 ... 173 4e-42 UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capn... 171 2e-41 UniRef50_C8PVC3 von Willebrand factor type A domain protein n=1 ... 170 4e-41 UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobac... 166 5e-40 UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 163 4e-39 UniRef50_A9BLP5 von Willebrand factor type A n=3 Tax=Burkholderi... 163 4e-39 UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacte... 162 7e-39 UniRef50_D0W6S8 Glycosyl transferase, group 2 family n=1 Tax=Nei... 162 9e-39 UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostri... 155 7e-37 UniRef50_B7AFM8 Putative uncharacterized protein n=1 Tax=Bactero... 154 2e-36 UniRef50_C7PNX3 von Willebrand factor type A n=2 Tax=Sphingobact... 153 4e-36 UniRef50_Q0I303 Putative uncharacterized protein n=5 Tax=Pasteur... 153 5e-36 UniRef50_B6BJ58 Phage/colicin/tellurite resistance cluster TerY ... 150 3e-35 UniRef50_C9ZGR0 Putative uncharacterized protein n=1 Tax=Strepto... 146 6e-34 UniRef50_C5PP99 von Willebrand factor, type A n=2 Tax=Bacteroide... 140 5e-32 UniRef50_C9LWT6 Tellurium resistance protein n=1 Tax=Selenomonas... 138 1e-31 UniRef50_A1VV60 von Willebrand factor, type A n=4 Tax=Proteobact... 137 4e-31 UniRef50_Q5NWS4 Tellurium resistance protein n=8 Tax=Bacteria Re... 135 1e-30 UniRef50_D1AAR7 von Willebrand factor type A n=1 Tax=Thermomonos... 133 3e-30 UniRef50_UPI0001745BB0 TerY3 n=1 Tax=Verrucomicrobium spinosum D... 132 9e-30 UniRef50_B0A9L7 Putative uncharacterized protein n=2 Tax=Clostri... 131 2e-29 UniRef50_Q8DK92 Tlr0974 protein n=1 Tax=Thermosynechococcus elon... 131 2e-29 UniRef50_B2K3B2 von Willebrand factor type A n=39 Tax=Gammaprote... 130 4e-29 UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fuso... 129 5e-29 UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=... 129 6e-29 UniRef50_C0F0K9 Putative uncharacterized protein n=2 Tax=Eubacte... 128 9e-29 UniRef50_UPI0001AED79F von Willebrand factor type A n=1 Tax=Stre... 128 1e-28 UniRef50_UPI00018742C1 von Willebrand factor type A n=1 Tax=Cory... 127 2e-28 UniRef50_B4VGR3 Putative uncharacterized protein n=1 Tax=Strepto... 127 3e-28 UniRef50_B7AIG3 Putative uncharacterized protein n=2 Tax=Bactero... 124 2e-27 UniRef50_B8HT03 von Willebrand factor type A n=2 Tax=Cyanothece ... 124 2e-27 UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytoph... 124 3e-27 UniRef50_UPI0001C37785 von Willebrand factor type A n=1 Tax=Rumi... 123 3e-27 UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=... 123 4e-27 UniRef50_C9RJN5 von Willebrand factor type A n=1 Tax=Fibrobacter... 123 4e-27 UniRef50_C2HF13 von Willebrand factor type A n=1 Tax=Finegoldia ... 122 8e-27 UniRef50_UPI0001B4E5FD von Willebrand factor type A n=1 Tax=Stre... 121 2e-26 UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga o... 121 2e-26 UniRef50_UPI0001B4AB93 von Willebrand factor type A n=1 Tax=Bact... 120 2e-26 UniRef50_A6G3C6 von Willebrand factor, type A n=1 Tax=Plesiocyst... 120 5e-26 UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Cion... 120 5e-26 UniRef50_C3XUD0 Putative uncharacterized protein n=1 Tax=Branchi... 118 1e-25 UniRef50_Q0RFF8 Putative uncharacterized protein n=1 Tax=Frankia... 118 1e-25 UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Tak... 116 7e-25 UniRef50_C9RK46 von Willebrand factor type A n=1 Tax=Fibrobacter... 116 7e-25 UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira... 115 1e-24 UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-lik... 115 1e-24 UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ... 115 1e-24 UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tet... 114 2e-24 UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun se... 114 2e-24 UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein... 113 3e-24 UniRef50_A8SDD9 Putative uncharacterized protein n=2 Tax=Ruminoc... 113 6e-24 UniRef50_C7N2G1 Uncharacterized protein n=2 Tax=Slackia heliotri... 113 6e-24 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 113 6e-24 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 112 8e-24 UniRef50_B2UUD5 Phage/colicin/tellurite resistance cluster terY ... 112 9e-24 UniRef50_Q22ST4 von Willebrand factor type A domain containing p... 112 1e-23 UniRef50_C5VFZ9 von Willebrand factor type A n=2 Tax=Corynebacte... 111 2e-23 UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain... 111 2e-23 UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3... 111 2e-23 UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesm... 111 2e-23 UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Art... 111 2e-23 UniRef50_UPI00006A1B4A Collagen alpha-3(VI) chain precursor. n=5... 111 2e-23 UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis... 110 3e-23 UniRef50_Q503P4 Zgc:110377 n=9 Tax=Clupeocephala RepID=Q503P4_DANRE 110 3e-23 UniRef50_A3YVK5 Tellurium resistance protein n=3 Tax=Cyanobacter... 110 3e-23 UniRef50_A1THU3 von Willebrand factor, type A n=1 Tax=Mycobacter... 110 4e-23 UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN... 109 9e-23 UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=4... 108 1e-22 UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=4... 108 1e-22 UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3... 108 1e-22 UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 ... 108 1e-22 UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocep... 108 1e-22 UniRef50_UPI0001760236 PREDICTED: similar to mCG140660 n=1 Tax=D... 108 1e-22 UniRef50_Q231J4 von Willebrand factor type A domain containing p... 108 2e-22 UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 ... 108 2e-22 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 107 2e-22 UniRef50_A7BNL3 Tellurium resistance protein n=1 Tax=Beggiatoa s... 107 2e-22 UniRef50_A2AX52 Collagen alpha-4(VI) chain n=12 Tax=Chordata Rep... 107 3e-22 UniRef50_UPI0000F2E695 PREDICTED: hypothetical protein n=1 Tax=M... 107 3e-22 UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-try... 106 3e-22 UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=... 106 4e-22 UniRef50_B7KCF7 von Willebrand factor type A n=1 Tax=Cyanothece ... 106 4e-22 UniRef50_UPI00016C377F protein containing a von Willebrand facto... 106 4e-22 UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Tak... 106 4e-22 UniRef50_Q235T9 von Willebrand factor type A domain containing p... 106 7e-22 UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopu... 106 7e-22 UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n... 106 7e-22 UniRef50_UPI00017B4DF5 UPI00017B4DF5 related cluster n=3 Tax=Tet... 106 7e-22 UniRef50_A7C1J8 von Willebrand factor, type A n=1 Tax=Beggiatoa ... 106 7e-22 UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea s... 105 9e-22 UniRef50_C1A2B7 Putative uncharacterized protein n=1 Tax=Rhodoco... 105 9e-22 UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=So... 105 1e-21 UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magno... 105 1e-21 UniRef50_UPI00016E1D1D UPI00016E1D1D related cluster n=9 Tax=Tet... 105 1e-21 UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 T... 105 1e-21 UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n... 105 1e-21 UniRef50_A6H584 Collagen alpha-5(VI) chain n=2 Tax=Mus musculus ... 105 1e-21 UniRef50_UPI000180D155 PREDICTED: similar to integrin alpha Hr1 ... 105 1e-21 UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin... 105 1e-21 UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clup... 105 2e-21 UniRef50_A2E6Y7 von Willebrand factor type A domain containing p... 105 2e-21 UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha ... 104 2e-21 UniRef50_Q5NJK1 Matrilin-3a n=5 Tax=Danio rerio RepID=Q5NJK1_DANRE 104 2e-21 UniRef50_UPI00017450FB von Willebrand factor type A domain prote... 104 2e-21 UniRef50_UPI000155C0BD PREDICTED: similar to collagen type VI al... 104 2e-21 UniRef50_A6NMZ7 Collagen alpha-6(VI) chain n=2 Tax=Theria RepID=... 104 2e-21 UniRef50_UPI000155C0BC PREDICTED: hypothetical protein n=1 Tax=O... 104 2e-21 UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophag... 104 2e-21 UniRef50_Q4TBC0 Chromosome undetermined SCAF7164, whole genome s... 104 2e-21 UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genom... 104 2e-21 UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellu... 104 3e-21 UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genom... 104 3e-21 UniRef50_Q24FW2 von Willebrand factor type A domain containing p... 104 3e-21 UniRef50_A2E0T6 von Willebrand factor type A domain containing p... 103 3e-21 UniRef50_UPI0001B4AD96 von Willebrand factor type A n=1 Tax=Bact... 103 3e-21 UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexu... 103 4e-21 UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, s... 103 4e-21 UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genom... 103 4e-21 UniRef50_A8TX70 Collagen alpha-5(VI) chain n=18 Tax=Eutheria Rep... 103 4e-21 UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea may... 103 4e-21 UniRef50_A8L6A2 von Willebrand factor type A n=1 Tax=Frankia sp.... 103 4e-21 UniRef50_UPI00016E1D39 UPI00016E1D39 related cluster n=1 Tax=Tak... 103 5e-21 UniRef50_UPI000155D2F0 PREDICTED: similar to matrilin-3, partial... 103 5e-21 UniRef50_Q8C6K9 Collagen alpha-6(VI) chain n=26 Tax=cellular org... 103 5e-21 UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomyce... 103 5e-21 UniRef50_Q7ZX63 Matn2-prov protein n=16 Tax=Euteleostomi RepID=Q... 103 5e-21 UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacilla... 103 5e-21 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 103 6e-21 UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globul... 103 6e-21 UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=... 102 7e-21 UniRef50_UPI000180D2FB PREDICTED: similar to inter-alpha (globul... 102 8e-21 UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisp... 102 9e-21 UniRef50_Q5NIW0 Matrilin 3b n=19 Tax=Clupeocephala RepID=Q5NIW0_... 102 1e-20 UniRef50_UPI0000EB12CB UPI0000EB12CB related cluster n=1 Tax=Can... 101 1e-20 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 101 1e-20 UniRef50_C3ZZV2 Putative uncharacterized protein n=1 Tax=Branchi... 101 1e-20 UniRef50_C3YBZ5 Putative uncharacterized protein n=1 Tax=Branchi... 101 1e-20 UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudo... 101 1e-20 UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza s... 101 2e-20 UniRef50_UPI00016E6A6D UPI00016E6A6D related cluster n=2 Tax=Tak... 101 2e-20 UniRef50_UPI000194D9FE PREDICTED: similar to matrilin 4 n=2 Tax=... 101 2e-20 UniRef50_C3JL94 von Willebrand factor type A domain protein n=1 ... 101 2e-20 UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family pr... 101 2e-20 UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotri... 101 2e-20 UniRef50_D2VKS7 von Willebrand factor type A domain-containing p... 101 2e-20 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 101 2e-20 UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 ... 101 2e-20 UniRef50_C3ZZV3 Putative uncharacterized protein (Fragment) n=1 ... 100 3e-20 UniRef50_UPI00006CAF43 von Willebrand factor type A domain conta... 100 3e-20 UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Breviba... 100 4e-20 UniRef50_UPI0001760CA2 PREDICTED: inter-alpha (globulin) inhibit... 100 4e-20 UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID... 100 4e-20 UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflex... 100 4e-20 UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=... 100 5e-20 UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis R... 100 5e-20 UniRef50_UPI0000F1FEC5 PREDICTED: similar to Clca1 protein n=2 T... 100 5e-20 UniRef50_A7I7X6 Putative uncharacterized protein n=1 Tax=Candida... 100 5e-20 UniRef50_A9V8A7 Predicted protein n=3 Tax=root RepID=A9V8A7_MONBE 100 6e-20 UniRef50_UPI0001C161B1 von Willebrand factor, type A Precursor n... 100 6e-20 UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleosto... 100 6e-20 UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microc... 99 8e-20 UniRef50_UPI000194DA30 PREDICTED: collagen, type XX, alpha 1 n=1... 99 8e-20 UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genom... 99 8e-20 UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR 99 8e-20 UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome... 99 8e-20 UniRef50_A7BVG3 von Willebrand factor type A domain protein n=1 ... 99 9e-20 UniRef50_UPI00006A1B4F Collagen alpha-3(VI) chain precursor. n=1... 99 9e-20 UniRef50_UPI0001C378BC von Willebrand factor, type A n=1 Tax=Rum... 99 9e-20 UniRef50_C3Y2U7 Putative uncharacterized protein n=1 Tax=Branchi... 99 9e-20 UniRef50_Q2W311 Putative uncharacterized protein n=1 Tax=Magneto... 99 9e-20 UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi... 99 1e-19 UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus Rep... 99 1e-19 UniRef50_A2E1S5 von Willebrand factor type A domain containing p... 99 1e-19 UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein... 99 1e-19 UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein... 99 1e-19 UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-cont... 99 1e-19 UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1... 99 1e-19 UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillu... 99 1e-19 UniRef50_Q23FU3 von Willebrand factor type A domain containing p... 99 1e-19 UniRef50_A8SU73 Putative uncharacterized protein n=1 Tax=Coproco... 99 2e-19 UniRef50_UPI00016E1D58 UPI00016E1D58 related cluster n=1 Tax=Tak... 98 2e-19 UniRef50_UPI00017B1702 UPI00017B1702 related cluster n=1 Tax=Tet... 98 2e-19 UniRef50_Q4T9U6 Chromosome 14 SCAF7491, whole genome shotgun seq... 98 2e-19 UniRef50_Q4S2X7 Chromosome 8 SCAF14759, whole genome shotgun seq... 98 2e-19 UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax... 98 2e-19 UniRef50_Q90ZA0 Collagen type XX alpha 1 n=11 Tax=cellular organ... 98 2e-19 UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta Re... 98 2e-19 UniRef50_UPI000058940A PREDICTED: similar to inter-alpha (globul... 98 2e-19 UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangi... 98 2e-19 UniRef50_A2AR69 Novel protein similar to vertebrate inter-alpha ... 98 3e-19 UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 ... 98 3e-19 UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuni... 97 3e-19 UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella... 97 3e-19 UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiob... 97 3e-19 UniRef50_UPI0001C34E55 hypothetical protein ClM62_13922 n=1 Tax=... 97 3e-19 UniRef50_D1CG77 von Willebrand factor type A; type II secretion ... 97 3e-19 UniRef50_UPI00016E9735 UPI00016E9735 related cluster n=2 Tax=Tak... 97 3e-19 UniRef50_B8HSI1 von Willebrand factor type A n=8 Tax=Cyanobacter... 97 3e-19 UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter... 97 3e-19 UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 ... 97 4e-19 UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesioc... 97 4e-19 UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatiba... 97 4e-19 UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2... 97 4e-19 UniRef50_Q4RP12 Chromosome 10 SCAF15009, whole genome shotgun se... 97 4e-19 UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geob... 97 4e-19 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 97 4e-19 UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus tri... 97 4e-19 UniRef50_Q4SQ43 Chromosome 19 SCAF14535, whole genome shotgun se... 97 4e-19 UniRef50_Q99715 Collagen alpha-1(XII) chain n=75 Tax=Euteleostom... 97 5e-19 UniRef50_UPI000180BDFB PREDICTED: similar to cartilage matrix pr... 97 5e-19 UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina... 97 5e-19 UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1... 97 5e-19 UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein... 97 6e-19 UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putat... 97 6e-19 Sequences not found previously or not previously below threshold: UniRef50_Q32NR2 MGC130922 protein n=3 Tax=Tetrapoda RepID=Q32NR2... 103 6e-21 UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 ... 101 2e-20 UniRef50_O95460 Matrilin-4 n=32 Tax=Amniota RepID=MATN4_HUMAN 101 2e-20 UniRef50_O00339 Matrilin-2 n=30 Tax=Euteleostomi RepID=MATN2_HUMAN 100 3e-20 UniRef50_Q5CZQ6 Matn4 protein (Fragment) n=12 Tax=Euteleostomi R... 100 5e-20 UniRef50_C3YUL3 Putative uncharacterized protein n=1 Tax=Branchi... 100 5e-20 UniRef50_P21941 Cartilage matrix protein n=35 Tax=Euteleostomi R... 99 9e-20 UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinoco... 99 1e-19 UniRef50_C3ZCS4 Putative uncharacterized protein n=1 Tax=Branchi... 98 2e-19 UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotom... 98 2e-19 UniRef50_UPI00006A0494 Matrilin-4 precursor. n=5 Tax=Euteleostom... 97 3e-19 UniRef50_UPI000180D037 PREDICTED: similar to polydomain protein-... 97 3e-19 UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesioc... 97 3e-19 UniRef50_Q8NFW1 Collagen alpha-1(XXII) chain n=23 Tax=Euteleosto... 97 3e-19 UniRef50_O15232 Matrilin-3 n=28 Tax=Euteleostomi RepID=MATN3_HUMAN 97 5e-19 >UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular organisms RepID=YEGL_ECOLI Length = 219 Score = 258 bits (659), Expect = 1e-67, Method: Composition-based stats. Identities = 219/219 (100%), Positives = 219/219 (100%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR Sbjct: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 Query: 61 VELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 VELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS Sbjct: 61 VELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL Sbjct: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 Query: 181 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV Sbjct: 181 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 >UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacteria RepID=Q114A2_TRIEI Length = 1204 Score = 224 bits (572), Expect = 1e-57, Method: Composition-based stats. Identities = 102/214 (47%), Positives = 133/214 (62%), Gaps = 1/214 (0%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 +F NPE RCP ILLLD S SM+G I ELN G+ F+ + D LA RVE+ ++ Sbjct: 618 LPEPEFVENPENRCPIILLLDTSYSMSGEAITELNQGVKIFQASVKEDELASLRVEIAVI 677 Query: 67 TFGPV-HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 TF V Q F + F P L A G T MG AI KAL+++E+RK++Y+ + I YYRPW Sbjct: 678 TFNSEIEVVQDFVTVDKFIPKTLEASGVTHMGKAIEKALELLEKRKQDYKNSDIQYYRPW 737 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 IFLITDG PTD WQ AA K+ E +++ FF++GV+ ADM+TL++ISV P L GL F Sbjct: 738 IFLITDGQPTDTWQDAAKKIEEAETNRKLLFFAVGVRDADMETLSEISVCPPKKLNGLDF 797 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 + LF WLS SL+ VS S G + L W + Sbjct: 798 QSLFKWLSFSLQQVSVSKIGEKNRLPPTNAWEEI 831 >UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacteria RepID=Q3M2E0_ANAVT Length = 218 Score = 222 bits (566), Expect = 6e-57, Method: Composition-based stats. Identities = 113/217 (52%), Positives = 142/217 (65%), Gaps = 3/217 (1%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + +F NPE RCP ILLLD SGSM+G+PI ELN GL TF+++++ D A VE+ Sbjct: 1 MPVGLPEFVENPENRCPVILLLDTSGSMSGQPIQELNRGLATFKEDVIKDSQASLSVEVA 60 Query: 65 IVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 I+TFGPV + Q F + F PP L A+G TPMG AI ALD++E RK Y+ NGI YYRP Sbjct: 61 IITFGPVRLVQDFVNIDQFTPPQLEAEGVTPMGEAIEYALDLLETRKSAYKENGILYYRP 120 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI--SVRQPLPLQG 182 WIFLITDGAPTD + AA +V E ++R FF++GVQGAD L QI + R P+ L G Sbjct: 121 WIFLITDGAPTDYYHLAAQRVKEAEANRRLCFFTVGVQGADFNKLRQIAPAERPPVILNG 180 Query: 183 LQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 L FR LF WLS+S++ VS G V L P GW + Sbjct: 181 LDFRSLFVWLSTSMKRVSSGKIGEAVALP-PVGWGQI 216 >UniRef50_UPI00016C400A von Willebrand factor, type A n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C400A Length = 249 Score = 219 bits (557), Expect = 7e-56, Method: Composition-based stats. Identities = 130/249 (52%), Positives = 163/249 (65%), Gaps = 30/249 (12%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMN----------GR--------------- 35 M+EQI F+ A+NPEPRCPC+LL+D SGSM GR Sbjct: 1 MAEQIPFSDVALATNPEPRCPCVLLIDTSGSMAEVVSGTGRDLGRTAQVDGKTYRVVSGG 60 Query: 36 --PINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAANFFPPILFAQG 92 I+ +N GL ++ ++ DPLA +RVE+ +VTFG V PF + + F PP+L A G Sbjct: 61 TTRIDLVNEGLRVYQADVTNDPLAAQRVEVSVVTFGDTVRTVTPFVTTSQFTPPVLTANG 120 Query: 93 DTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDK 152 +TPMGAAI KA+D V ERKREYR NG+ +YRPWIFLITDG PTD W+AAA +V GEE K Sbjct: 121 ETPMGAAILKAIDAVTERKREYRQNGLHFYRPWIFLITDGEPTDAWEAAAARVREGEEKK 180 Query: 153 RFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRSTPG--TEVVL 210 +FAFF++GV+GA+M L QISVRQPL L+G F+E+F WLS S RSVS S PG +V L Sbjct: 181 QFAFFAVGVEGANMDRLKQISVRQPLHLKGYSFKEMFLWLSQSQRSVSHSNPGQEEQVKL 240 Query: 211 EAPKGWTSV 219 P GW S+ Sbjct: 241 APPAGWASL 249 >UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium Ellin514 RepID=B9XNU9_9BACT Length = 229 Score = 210 bits (536), Expect = 2e-53, Method: Composition-based stats. Identities = 112/227 (49%), Positives = 148/227 (65%), Gaps = 8/227 (3%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 MS+Q+ F +F NPEPRCPC+L+LDVS SM G I+ LN G+ F +L LA KR Sbjct: 1 MSDQLPFIDVEFVDNPEPRCPCVLVLDVSSSMRGAAIDFLNLGVDLFAHDLTRSRLACKR 60 Query: 61 VELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 VE I+TFG VH+ Q F S + F PP A G TPMG A+ +A +++E+RKR+YRA G+ Sbjct: 61 VETAIITFGDGVHIVQDFVSPSAFVPPRFEAGGKTPMGEAVVQACELLEKRKRKYRAAGV 120 Query: 120 SYYRPWIFLITDGAPTD----EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI--S 173 SY+RPWIFLITDG PTD W+ A V GE DK+ FF + V A+ L ++ + Sbjct: 121 SYFRPWIFLITDGEPTDYETANWRQAVEIVRAGEVDKKLMFFGVAVSDANQGKLNELCPA 180 Query: 174 VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTE-VVLEAPKGWTSV 219 R + L GL F+ LF+WLSSSLR+VS + PGT+ +VL + GW +V Sbjct: 181 SRPAIKLNGLDFQGLFTWLSSSLRTVSSANPGTQGIVLPSIAGWATV 227 >UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria RepID=B8HUC4_CYAP4 Length = 254 Score = 210 bits (535), Expect = 3e-53, Method: Composition-based stats. Identities = 143/217 (65%), Positives = 162/217 (74%), Gaps = 3/217 (1%) Query: 6 TFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI 65 F T DFA+NPEPRCP ILLLD SGSM G PI ELNAG+ FRDELLAD LA KRVE+ I Sbjct: 38 AFGTDDFANNPEPRCPVILLLDTSGSMRGTPIQELNAGVELFRDELLADALASKRVEVAI 97 Query: 66 VTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 V FGPV V Q F +A F PP L A+ DTP+GAAI ALD+++ RK Y+ANGI+YYRPW Sbjct: 98 VGFGPVQVIQDFVTADYFNPPKLRAEADTPLGAAIETALDLLQSRKDTYKANGIAYYRPW 157 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 +FLITDG PTD WQ AA +V GE K FAFFSIGV+GA + LAQIS R PL L+ L+F Sbjct: 158 VFLITDGGPTDHWQTAARRVKEGESKKSFAFFSIGVEGARIDILAQISTRTPLKLKELRF 217 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVL---EAPKGWTSV 219 R+LF WLSSSL+SVSRSTPG EV L P GW SV Sbjct: 218 RDLFQWLSSSLKSVSRSTPGDEVPLLNPATPDGWASV 254 >UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacteria RepID=B5W3I3_SPIMA Length = 228 Score = 210 bits (534), Expect = 3e-53, Method: Composition-based stats. Identities = 113/215 (52%), Positives = 151/215 (70%), Gaps = 6/215 (2%) Query: 10 SDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 +FA NPEPRCPC+LLLD S SM G P++ LNAGL+TFR+ L+ D LA KRVE+ I+TF Sbjct: 15 VEFAENPEPRCPCVLLLDTSASMQGEPLDGLNAGLMTFRENLIKDELAKKRVEIAIITFD 74 Query: 70 P-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 V + Q F +A F PP+L AQG T MG AI +ALDM+ RK EYR NGI+YYRPW+F+ Sbjct: 75 NQVKIIQDFVTADRFEPPLLNAQGQTYMGTAIGEALDMIASRKAEYRNNGITYYRPWVFM 134 Query: 129 ITDGAPTDE----WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ 184 ITDG P E + A ++ E +K+ AFF++GV+GA+M+ L +++ R PL L+GL Sbjct: 135 ITDGEPQGESDRITEQAIKRIRDEEANKQVAFFAVGVEGANMERLGEMAQRTPLKLKGLD 194 Query: 185 FRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 FRE+F WLS+S+++VS S +V L P GW +V Sbjct: 195 FREMFIWLSASMQTVSHSKVDEQVALPPP-GWGTV 228 >UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_ANASP Length = 224 Score = 209 bits (531), Expect = 8e-53, Method: Composition-based stats. Identities = 118/225 (52%), Positives = 153/225 (68%), Gaps = 7/225 (3%) Query: 1 MSEQITFATS-DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALK 59 M++ +T +FA NPEPRCPC+LLLD SGSM G I LN GL++ +DEL+ + +A + Sbjct: 1 MNDTLTLDEVVEFAENPEPRCPCVLLLDTSGSMQGAAIEALNQGLLSLKDELMKNSIAAR 60 Query: 60 RVELGIVTFGPV-HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 RVE+ IVTF +V Q F +A F PPIL AQG T MGA I KALDMV+ERK YRANG Sbjct: 61 RVEIAIVTFDSHINVIQDFVTADQFNPPILTAQGLTSMGAGIHKALDMVQERKSLYRANG 120 Query: 119 ISYYRPWIFLITDGAPTDE----WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 ++YYRPW+F+ITDG P E + AA ++ E +KR AFFS+GV+ A+M L QI+V Sbjct: 121 VAYYRPWVFMITDGEPQGELDHLVEQAALRLQGDEVNKRVAFFSVGVENANMTRLNQIAV 180 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 R PL L+GL F E+F WLS+S+ +VS S +V L GW S+ Sbjct: 181 RTPLKLKGLNFIEMFVWLSASMSAVSHSQIDEQVALPPI-GWGSI 224 >UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AYK3_9ENTR Length = 227 Score = 203 bits (516), Expect = 4e-51, Method: Composition-based stats. Identities = 92/225 (40%), Positives = 129/225 (57%), Gaps = 6/225 (2%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M E + N E R P IL+LD SGSM G+PI +LN GL EL D +A KR Sbjct: 1 MMEHLMIPDVALVDNSEQRTPLILVLDSSGSMYGQPIQQLNEGLKLLEQELKNDVIAAKR 60 Query: 61 VELGIVTFGPV---HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 V + ++ +G + + A +F P+L A G TPMG AIT AL+ +E K+ ++ Sbjct: 61 VRILVIEYGGYDQCTIHGDWKDAMDFTAPVLEANGTTPMGQAITLALEEIEAEKQRFKQA 120 Query: 118 GISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP 177 G++Y RPW+FL++DG PTD+W+ AA + EE ++ A F I V GA + + S Sbjct: 121 GVAYTRPWLFLMSDGVPTDQWEQAAQLCRQAEESQKTAVFPIMVDGASAEVMGSFSRNGV 180 Query: 178 L---PLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 L+GLQF+ELF WLS+S++ VS+STPG L + W SV Sbjct: 181 NGVKMLKGLQFKELFLWLSASMQVVSQSTPGGTAQLPSTDSWASV 225 >UniRef50_Q2SNJ1 Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (VWF) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SNJ1_HAHCH Length = 223 Score = 195 bits (496), Expect = 9e-49, Method: Composition-based stats. Identities = 97/223 (43%), Positives = 130/223 (58%), Gaps = 6/223 (2%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 MS+ + F N R PC+L+LD S SM G PI +LN GL L D R Sbjct: 1 MSDTL-IPDVVFNDNNSQRTPCVLVLDGSSSMFGEPIRQLNEGLKLLERALKEDASTAMR 59 Query: 61 VELGIVTFGP---VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 V+L ++ G V + A +F P +FA G TP+G A+ ALD +E++K Y AN Sbjct: 60 VQLLVIRAGNHDQAEVLTDWVDAMDFNAPEVFANGTTPLGGAMNLALDKIEDQKAAYDAN 119 Query: 118 GISYYRPWIFLITDGAPTD-EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 GIS RPWI LI+DGAPTD W+A A++ E++++ F IGV+GA +TL Q S + Sbjct: 120 GISSTRPWIILISDGAPTDFNWEAVADRCRHAEQNRKVVIFPIGVEGATFETLNQFSNKG 179 Query: 177 PLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 L+GLQFRELF WLS S+ +VS S+PG +V L A W+ V Sbjct: 180 AKKLKGLQFRELFVWLSRSMATVSVSSPGEKVQLPATD-WSEV 221 >UniRef50_Q2FNC6 von Willebrand factor, type A n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNC6_METHJ Length = 233 Score = 195 bits (495), Expect = 1e-48, Method: Composition-based stats. Identities = 103/231 (44%), Positives = 134/231 (58%), Gaps = 12/231 (5%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M + D P C +L+LD S SM+G I ELN GL DEL D LA+KR Sbjct: 1 MGLEKLEDIVDIPYPQHPHCATVLVLDTSASMSGNKIAELNEGLRILTDELKEDDLAVKR 60 Query: 61 VELGIVTFG-PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 ++L ++TFG V + +PFT + F PP L A G TPMG AI +A+ +VEERK EYR G Sbjct: 61 IDLAVITFGKGVELVRPFTGISAFDPPELSAGGYTPMGQAILEAVRLVEERKAEYRTIGT 120 Query: 120 SYYRPWIFLITDGAPTD------EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS 173 YYRPWIFLITDG PTD W+ V GE D +F F+++GV A+M L +IS Sbjct: 121 DYYRPWIFLITDGQPTDMRKGDEIWEKVIEAVHGGERDHKFLFWALGVDQANMTVLREIS 180 Query: 174 --VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE---APKGWTSV 219 R PL L+ ++ E+F WLS SL +S S G ++ LE P+GW + Sbjct: 181 PPGRTPLMLKEAKWAEMFLWLSKSLSQISDSRIGEQISLENPVGPEGWGVI 231 >UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria RepID=C9RRF6_FIBSS Length = 228 Score = 192 bits (489), Expect = 6e-48, Method: Composition-based stats. Identities = 96/228 (42%), Positives = 134/228 (58%), Gaps = 9/228 (3%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M+ + D +NP R P L+LD SGSM G+PI+ELN G+ F D + +D AL Sbjct: 1 MTNEKLLNIEDLENNPSTRVPVCLVLDTSGSMEGQPISELNEGINCFYDAVRSDETALYA 60 Query: 61 VELGIVTFGPVHVE-QPFTSAANF-FPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 E+ +VTFG V F++ + P FA G TPMG A+ ALD++E+RK EY+A+G Sbjct: 61 AEIAVVTFGGSAVLKTDFSTLEHQPDSPNFFANGGTPMGEAMNMALDLLEKRKGEYKASG 120 Query: 119 ISYYRPWIFLITDGAPTDE---WQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS- 173 + YY+PWI L+TDG P + + A + ++++ F IG+ ADM LA S Sbjct: 121 VDYYQPWIVLMTDGKPNGDSSEYARAVQRTCEMIKNRKLTIFPIGIGEDADMNALAAFSP 180 Query: 174 VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEA--PKGWTSV 219 R PL LQGL FRE F+WLS S+ VS+STPG ++ L+ KGW + Sbjct: 181 KRSPLKLQGLNFREFFAWLSKSVSKVSQSTPGDKIQLDTDGIKGWAEL 228 >UniRef50_Q87W17 von Willebrand factor type A domain protein n=2 Tax=Proteobacteria RepID=Q87W17_PSESM Length = 224 Score = 192 bits (487), Expect = 1e-47, Method: Composition-based stats. Identities = 88/224 (39%), Positives = 122/224 (54%), Gaps = 5/224 (2%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M E+ + D NP R P L+LDVSGSM G PI EL AG+ F + D +A Sbjct: 1 MQEEYILSQEDLVDNPTARVPICLVLDVSGSMAGEPIRELQAGVNMFYQAIREDEVAQYA 60 Query: 61 VELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 E+ IVTFG F + P L A+G T MG + ALD++E RK +Y+ G+ Sbjct: 61 AEISIVTFGSEAKRTVDFMAIERQDVPALIAEGTTSMGQGVNLALDLLEVRKGDYQRAGV 120 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS-VRQP 177 YY+PW+ ++TDG PTD+ A+ ++ E K+ F I + A++ L +S R P Sbjct: 121 DYYQPWMVVMTDGEPTDDITRASERIREMCESKKLTVFPIAIGTAANLDILGMLSPGRPP 180 Query: 178 LPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE--APKGWTSV 219 L L+GL F+E F WLS S+ VS+STPG V+L+ W V Sbjct: 181 LRLKGLNFKEFFLWLSRSVSRVSQSTPGETVILDKAGIDAWGQV 224 >UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQB2_9BACT Length = 225 Score = 185 bits (469), Expect = 1e-45, Method: Composition-based stats. Identities = 89/223 (39%), Positives = 122/223 (54%), Gaps = 5/223 (2%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 S + ++ NP PR P L+LDVSGSM G PI ELN G+ F L D +A Sbjct: 3 SLNMILDQNEMVENPTPRVPVSLVLDVSGSMLGAPIEELNRGVELFFKSLKDDDVARYSA 62 Query: 62 ELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 E+ +++F V E F P L A G T MG A++ AL+ +E+RK YR G+ Sbjct: 63 EVSVISFSNEVTQEVDFGPLEKCDIPELKAIGKTRMGGAVSLALESLEKRKELYRTLGVD 122 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQIS-VRQPL 178 YY+PW+ ++TDG P D+WQ AA K + + F I + A TL + S R PL Sbjct: 123 YYQPWMVIMTDGKPNDDWQLAAAKTSALVDKGKLTVFPIAIGDNACTDTLKEFSPARNPL 182 Query: 179 PLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE--APKGWTSV 219 L+ L F+E F WLSSS+ VS+S PG +V L+ +GW S+ Sbjct: 183 RLKDLNFQEFFRWLSSSVSKVSQSIPGEKVELDLKGLEGWASL 225 >UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJ63_FIBSS Length = 227 Score = 185 bits (469), Expect = 1e-45, Method: Composition-based stats. Identities = 96/227 (42%), Positives = 127/227 (55%), Gaps = 9/227 (3%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 + + D +NP R P L+LD SGSM G INELN G+ F D + +D AL Sbjct: 1 MNRNLLSIEDLENNPSSRVPVCLVLDTSGSMEGDSINELNEGVRLFYDAVRSDETALYAA 60 Query: 62 ELGIVTFGPVHVEQPFTSAANFF--PPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 E+ +VTFG Q S P +A G TPMG A+ ALDM+E+RK EY+A+G+ Sbjct: 61 EISVVTFGGHASCQAGFSTLEHQPDAPQFYADGGTPMGEAMNMALDMLEKRKSEYKASGV 120 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFR---GEEDKRFAFFSIGVQ-GADMKTLAQIS-V 174 YY+PWI L+TDG P + + R D++ F IG+ ADM LA+ S Sbjct: 121 DYYQPWIVLMTDGMPNGSQAELSRSIQRTCDMINDRKLTIFPIGIGEDADMDVLARFSPK 180 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVL--EAPKGWTSV 219 R PL LQGL F+E F+WLS S+ VS+STPG +V L + KGW + Sbjct: 181 RSPLKLQGLNFKEFFAWLSKSVSKVSQSTPGDKVQLDVDGIKGWAEL 227 >UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria RepID=A9AV55_HERA2 Length = 222 Score = 182 bits (461), Expect = 9e-45, Method: Composition-based stats. Identities = 89/210 (42%), Positives = 119/210 (56%), Gaps = 5/210 (2%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHV 73 N E +C CIL++D SGSM GRPI+ELN GL F ++ +R+E+ +V F Sbjct: 10 NYEQKCLCILVVDTSGSMQGRPIDELNQGLQVFHQDISNSFSTAQRLEICLVEFNSQADC 69 Query: 74 EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + F PIL G T + + A+ V+ERK YR+ G YYRPWI L+TDG Sbjct: 70 IVEPSLVDQFHMPILAVAGTTKLVDGVRLAIHKVQERKSWYRSTGQPYYRPWIILMTDGE 129 Query: 134 PTDEWQAA--ANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRELF 189 P + A A ++ G +K+F FF IGVQGADM+ L QIS R P+ LQGL+F F Sbjct: 130 PDSDQDVAGLAREIQHGVNNKQFVFFPIGVQGADMRMLQQISTPDRPPMLLQGLRFEAFF 189 Query: 190 SWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 WLS+SL V+ ST G + L + GW + Sbjct: 190 DWLSASLSMVASSTDGQVIQLPSTSGWGIL 219 >UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z299_9BACE Length = 348 Score = 179 bits (454), Expect = 7e-44, Method: Composition-based stats. Identities = 57/207 (27%), Positives = 90/207 (43%), Gaps = 8/207 (3%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P L+DVS SM G PI ++ G+ EL DP AL+ + ++ F G P Sbjct: 2 RRLPVYFLVDVSESMVGAPIQQVQDGMRMIVQELRTDPYALETAYISVIAFAGKAKCVSP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 T F+PP G T +G A+ +D +++ ++P +FL TDG PTD Sbjct: 62 LTELYKFYPPTFPIGGGTSLGNALEFLMDDMDKTLVRTTTEQKGDWKPIVFLFTDGNPTD 121 Query: 137 EWQAAANKVFRGEEDK-RFAFFSIGVQGADMKTLAQISVR--QPLPLQGLQFRELFSWLS 193 A + K SIG + + L QIS + + F+ F W++ Sbjct: 122 NPSNAFTRWNNKYRGKANIVAISIG-DNVNTQLLGQISDNVLRLNKTDEISFKSFFKWVT 180 Query: 194 SSLRSVSRSTP---GTEVVLEAPKGWT 217 +S+++ S S +V L + G + Sbjct: 181 ASIKATSVSVSDMGDDDVKLASTSGIS 207 >UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W3F9_DESAS Length = 219 Score = 175 bits (444), Expect = 8e-43, Method: Composition-based stats. Identities = 59/206 (28%), Positives = 93/206 (45%), Gaps = 8/206 (3%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 R P LLLD SGSM G PI + G+ EL +P A++ + ++TFG + Sbjct: 9 SRRLPVYLLLDRSGSMFGEPIEAVKQGVKYMISELKKEPQAIETAYISVITFGSDARQDV 68 Query: 76 PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 T A F P + A G T +GAA+ + + R+ Y+P +F++TDG PT Sbjct: 69 QLTELAAFKEPQIEANGTTSLGAALHILNNCFDNEVRKSTPTQKGDYKPLVFIMTDGEPT 128 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLP--LQGLQFRELFSWL 192 D+W+ AA ++ + ++G + TL +I+ L Q F++ F W+ Sbjct: 129 DDWENAAREIKQKSGKV-ANIVAVGCGPDVNTDTLKKITDIVLLMSSYQPEDFKQFFRWV 187 Query: 193 SSSLRSVS---RSTPGTEVVLEAPKG 215 S S++ S L AP Sbjct: 188 SQSVKQASIKFTKDSDQPTNLPAPPP 213 >UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacterium sp. JLS RepID=A3PUP3_MYCSJ Length = 233 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 91/221 (41%), Positives = 119/221 (53%), Gaps = 15/221 (6%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 +NP+PR C++L DVSGSM G PI L G F L + LA KRVE+ +VTFG V Sbjct: 12 DANPDPRVACVVLADVSGSMQGEPIAALERGFAAFTRYLQNEVLASKRVEVAVVTFGTVA 71 Query: 73 VE-QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 P A P A G T M A I ALD++E+RK Y+A G+ YYRPWI L+TD Sbjct: 72 TVLVPMQEARTLQPVAFTASGTTNMAAGIHLALDILEDRKHAYKAAGLQYYRPWILLLTD 131 Query: 132 GAPT-DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS-VRQPLPLQGLQFREL 188 G P D + A ++ E + F++G D + L ++S R P PL+GL++ EL Sbjct: 132 GKPNLDGFDEAVARLNAVESARGVTVFAVGAGPRVDYQQLGRLSLQRSPAPLEGLKYEEL 191 Query: 189 FSWLSSSLRSVSRSTP-----------GTEVVLEAPKGWTS 218 F WLS+SL +VS ST V L + GWTS Sbjct: 192 FEWLSASLSNVSNSTEFARDDQTHEAMNGRVPLPSAAGWTS 232 >UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10 Tax=Enterobacteriaceae RepID=D1TTW6_YERPE Length = 327 Score = 174 bits (441), Expect = 2e-42, Method: Composition-based stats. Identities = 52/197 (26%), Positives = 91/197 (46%), Gaps = 5/197 (2%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P +LD S SM G + ++N GL ++L DP AL+ + ++ F G P Sbjct: 2 RRLPIFFVLDCSESMIGENLKKMNDGLQMIINDLKKDPHALETAWISVIAFAGVAKTIVP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 +F+PP L G T +GAA+ + ++ + R+ ++P ++L+TDG PTD Sbjct: 62 LVEVVSFYPPRLPIGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKPVVYLLTDGRPTD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQ--PLPLQGLQFRELFSWLS 193 + A + ++ +IG+ AD+ L Q++ Q F + W++ Sbjct: 122 DTTAEITRWKT-HYARKVNLIAIGLGPSADLNILRQLTENVLLFNDTQEGDFTQFIKWIT 180 Query: 194 SSLRSVSRSTPGTEVVL 210 +S+ + SRS L Sbjct: 181 ASVSAHSRSVGEESPPL 197 >UniRef50_D1PS09 von Willebrand factor type A domain protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PS09_9FIRM Length = 246 Score = 173 bits (438), Expect = 4e-42, Method: Composition-based stats. Identities = 81/244 (33%), Positives = 114/244 (46%), Gaps = 35/244 (14%) Query: 11 DFASNPEPRCPCILLLDVSGSMNG--------------------------RPINELNAGL 44 D NP PR P L LD SGSM ++EL G+ Sbjct: 3 DLVENPTPRVPICLCLDTSGSMGAVQGDCVDTGKTLFEDGRQWNLVTGGTSRLDELQKGI 62 Query: 45 VTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAANF-FPPILFAQGDTPMGAAITK 102 F + + D +A E+ IVTF F + P L A GDT MG + Sbjct: 63 KLFYNSVREDEVARYAAEICIVTFDSEAKCRMDFANLDRQSDLPELTATGDTAMGEGVNL 122 Query: 103 ALDMVEERKREYRANGISYYRPWIFLITDGAPTDE---WQAAANKVFRGEEDKRFAFFSI 159 ALD++E RKREY+ G+ Y++PW+ L+TDG P ++ A + E K+ F I Sbjct: 123 ALDLLESRKREYQDKGVDYFQPWLVLMTDGVPNGNEGEFERAVQRCRDMEAQKKLTVFPI 182 Query: 160 GVQGA-DMKTLAQI-SVRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLE--APKG 215 + D LA+ + R PL LQGL+FRE F+WLS S+ S+S PG + L+ + +G Sbjct: 183 AIGDEGDQTALAKFSAKRPPLKLQGLKFREFFAWLSQSVAKTSQSMPGETIKLDLNSIQG 242 Query: 216 WTSV 219 W + Sbjct: 243 WAEL 246 >UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CA78 Length = 347 Score = 171 bits (433), Expect = 2e-41, Method: Composition-based stats. Identities = 54/204 (26%), Positives = 91/204 (44%), Gaps = 8/204 (3%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P LLDVS SM G PI + G+ T EL ADP AL+ V L I+ F G V P Sbjct: 2 RRLPIYFLLDVSESMVGDPIEHVQDGMATIIKELKADPFALETVWLSIIGFAGKSKVITP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 F+PP + G T + + + + ++ ++ + ++P +FL TDG PTD Sbjct: 62 LQDIITFYPPKIPIGGGTSLASGLNELMNAIDREVVKTTLERKGDWKPLVFLFTDGIPTD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR--QPLPLQGLQFRELFSWLS 193 + A + ++ +I + + L Q++ + Q ++E F W++ Sbjct: 122 DPAQAIERW-NAHYRRKVNLVAISLGENTNYNLLGQLTDQVLQFNNTNAAAYKEFFKWIT 180 Query: 194 SSLRSVSRSTPG---TEVVLEAPK 214 +S+++ S + L P Sbjct: 181 ASIKTTSEQVNNTNTDVIKLAKPD 204 >UniRef50_C8PVC3 von Willebrand factor type A domain protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PVC3_9GAMM Length = 260 Score = 170 bits (430), Expect = 4e-41, Method: Composition-based stats. Identities = 81/230 (35%), Positives = 123/230 (53%), Gaps = 17/230 (7%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMN-------GRPINELNAGLVTFRDELLADPLALK 59 F D+ +N R C+L+LD+SGSM R I+ LN G+ F +L+ D A Sbjct: 29 FREIDYGNNVAQRTLCVLVLDLSGSMAIRSGNGDKRRIDMLNEGIEAFYHDLMKDETARN 88 Query: 60 RVELGIVTFG----PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 RV L IV G + +T A +FFP G TP+G + AL+++E+ + R Sbjct: 89 RVRLAIVIVGGVNDTAELMMDWTDAIDFFPIKFRENGMTPLGQGMLLALNLIEQERINLR 148 Query: 116 ANGISYYRPWIFLITDGAPTDE---WQAAANKVFRGEEDKRFAFFSIGVQGADMK--TLA 170 NGI+Y RPW+ +TDG PTD WQAA N+ + E++ + + I + + L Sbjct: 149 DNGINYTRPWVIAMTDGLPTDSQDVWQAAINQCHQAEQNNQCIIYPIAIDAGVQEVKMLK 208 Query: 171 QIS-VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 Q+S + P+ L ++F E F WLS+SL++VS+S PG V L + W ++ Sbjct: 209 QLSILTPPVHLNSVKFVEFFVWLSASLKTVSQSAPGETVQLGSISPWATI 258 >UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobacteria RepID=A1VWQ4_POLNA Length = 350 Score = 166 bits (420), Expect = 5e-40, Method: Composition-based stats. Identities = 49/219 (22%), Positives = 88/219 (40%), Gaps = 19/219 (8%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P +LD S SM G + ++ + L DP AL+ V ++ F G P Sbjct: 2 RRLPVFFVLDCSESMVGANLKKMEGAVAAIVKSLRTDPQALETVFFSVIAFAGVARTIAP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 +F+PP L G T +G+A+ + ++ + A +RP I+L+TDG PTD Sbjct: 62 LVEIVSFYPPKLPLGGGTNLGSALDALMGEIDRSVIKTTAERKGDWRPIIYLVTDGRPTD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQ--GLQFRELFSWLS 193 A + K+ +IG+ D L +++ F++ +W++ Sbjct: 122 NPSRAIERW-NSHYAKKATLIAIGLGRSVDFTALRRLTENVISFEDIKESDFKKFINWVT 180 Query: 194 SSLRSVSRSTPGT--------------EVVLEAPKGWTS 218 +S+ S+S ++++E P Sbjct: 181 ASVVVQSKSVGDGTDFQGLRILDKSVMKIIMEPPSTIAD 219 >UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 Length = 239 Score = 163 bits (413), Expect = 4e-39, Method: Composition-based stats. Identities = 57/201 (28%), Positives = 89/201 (44%), Gaps = 10/201 (4%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P LLLD SGSM+G PI + G+ T L DP AL+ + ++TF P Sbjct: 29 RRLPVYLLLDTSGSMHGEPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSARQAVP 88 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 T +F P L A G T +G A++ + + ++ A+ +RP +FL+TDG+P D Sbjct: 89 LTDLLSFQMPALTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRPLVFLMTDGSPND 148 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR--QPLPLQGLQFRELFSWLS 193 +W+ N + AD L +I+ Q + F W+S Sbjct: 149 DWRKGLNDFKAARTG---VVVACAAGHDADTSVLKEITEIVVQLDTADSSTIKAFFKWVS 205 Query: 194 SSLRSVSR---STPGTEVVLE 211 +S+ S+ S+ + LE Sbjct: 206 ASISVGSQKVESSKKEVIGLE 226 >UniRef50_A9BLP5 von Willebrand factor type A n=3 Tax=Burkholderiales RepID=A9BLP5_DELAS Length = 244 Score = 163 bits (412), Expect = 4e-39, Method: Composition-based stats. Identities = 58/218 (26%), Positives = 91/218 (41%), Gaps = 10/218 (4%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 I F S F + P +LLLDVSGSM+G I +N + D + Sbjct: 1 MSNIPFDPSKFTAPKAKPLPVVLLLDVSGSMSGEKIRNVNDAVRDMLDTFSDTENGETEI 60 Query: 62 ELGIVTFGP-VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 + I+TFG V + QP SA++ L A G TP+G A+ A M+E++ Sbjct: 61 HVAIITFGSQVALHQPLASASDIHWQDLSAGGMTPLGTALQMAKAMIEDK----DVIPSR 116 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI----SVR 175 YRP + L++DG P D W+ N + ++ + AD L + S R Sbjct: 117 AYRPTVVLVSDGGPNDAWEKPLNAFISDGRSAKCDRLAMAIGADADEAVLGKFIEGTSNR 176 Query: 176 QPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAP 213 Q R+ F +++ S+ ++S V + Sbjct: 177 LFYAENAKQLRDFFKFVTMSVTIRTKSQTPNNVPEAST 214 >UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacteria RepID=Q5NWS3_AZOSE Length = 349 Score = 162 bits (410), Expect = 7e-39, Method: Composition-based stats. Identities = 56/191 (29%), Positives = 85/191 (44%), Gaps = 5/191 (2%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P L+DVS SM G + +L GL L ADP AL+ V + ++ F G P Sbjct: 2 RRLPIFFLVDVSESMAGDNLRQLQEGLERLVRSLRADPYALETVFISVIAFAGKPKTLTP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 F+ P L T +G+A+ +D +E + +RP ++L+TDG PTD Sbjct: 62 LVELYQFYAPRLPLGSGTSLGSAMAHLMDEMERTVQRSTPEKKGDWRPVVYLLTDGKPTD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVR--QPLPLQGLQFRELFSWLS 193 + + A + R E+ R +IGV A + L + + F+ W+S Sbjct: 122 DIEPAIKRWKRDFEE-RSNLVAIGVGKHASLSALQRFTENVLSLDATTEDDFKRFIDWIS 180 Query: 194 SSLRSVSRSTP 204 S+ S SRS Sbjct: 181 QSVASQSRSVS 191 >UniRef50_D0W6S8 Glycosyl transferase, group 2 family n=1 Tax=Neisseria lactamica ATCC 23970 RepID=D0W6S8_NEILA Length = 191 Score = 162 bits (409), Expect = 9e-39, Method: Composition-based stats. Identities = 79/191 (41%), Positives = 111/191 (58%), Gaps = 3/191 (1%) Query: 32 MNGRPINELNAGLVTFRDELLADPLALKRVELGIVT-FGPVHVEQPFTSAAN-FFPPILF 89 M G PI +LN G+ F L D +A VE+GI+ G V PFT+A + Sbjct: 1 MYGEPIEQLNQGVQQFIQALQEDEIASYSVEVGILAAGGHVEEIIPFTTAEQLDYTSTFT 60 Query: 90 AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGE 149 AQG TP+G+A+ + L M+E+RKREY+ NG++YY+PW+ +I+DG+PTD WQ AA + Sbjct: 61 AQGSTPLGSAVEQGLKMLEDRKREYQKNGVAYYQPWLVVISDGSPTDSWQNAAQETRTLA 120 Query: 150 EDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRSTP-GTEV 208 E+++ +GV ADM L Q S R L L GL+F + F WLS+S+ VS S +V Sbjct: 121 ENRKLVSLMVGVNDADMDKLGQFSNRPALKLDGLRFGDFFQWLSASMSRVSASNSTAAQV 180 Query: 209 VLEAPKGWTSV 219 L W S+ Sbjct: 181 SLPPIDTWASI 191 >UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DBA5_9CLOT Length = 231 Score = 155 bits (393), Expect = 7e-37, Method: Composition-based stats. Identities = 84/207 (40%), Positives = 111/207 (53%), Gaps = 8/207 (3%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 E C+LL+D SGSM G INELN GL+ F + L D A ++ +++F V Sbjct: 20 ERHIACVLLVDTSGSMAGASINELNQGLLEFGNALDQDEHARGVADVCVISFNSNVETVV 79 Query: 76 PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 PF AAN+ P L A G T M A+ LD +EERK+ YR G SYYRPW+FL+TDG PT Sbjct: 80 PFCPAANYSAPTLSAGGLTSMNEAVIAGLDAIEERKQLYRQLGCSYYRPWMFLLTDGEPT 139 Query: 136 DEW--QAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS---VRQPLPLQGLQFRELF 189 D+ A N++ + DK+ FF +G+ GA+ L + L QF+E F Sbjct: 140 DQNMEGEAKNRLQQALNDKKVNFFPMGIGSGANYAHLKSYTKGGNGAVLKASASQFKEAF 199 Query: 190 SWLSSSLRSVSRSTPG-TEVVLEAPKG 215 WLSSS+ +S S P V LE Sbjct: 200 VWLSSSMSVISNSDPSLGNVTLEPTPM 226 >UniRef50_B7AFM8 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AFM8_9BACE Length = 333 Score = 154 bits (390), Expect = 2e-36, Method: Composition-based stats. Identities = 53/193 (27%), Positives = 86/193 (44%), Gaps = 8/193 (4%) Query: 32 MNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFTSAANFFPPILFA 90 M G PI ++ G+ EL DP AL+ V + ++ F G V P T F+PP Sbjct: 1 MVGEPIIQVEKGMRNIIQELRTDPYALETVFVSVIVFAGKEKVLSPLTELYKFYPPQFPI 60 Query: 91 QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEE 150 G T +G A+ ++ +++ ++ ++P IFL TDG PTD Q A N+ Sbjct: 61 GGGTSLGTALDCLMNDIDKSVKKTTVEMKGDWKPIIFLFTDGMPTDNPQQAFNRW-NAHY 119 Query: 151 DKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQ--FRELFSWLSSSLRSVSRSTPG-- 205 ++ I + D K L +IS + F+ F W+++S++S S S Sbjct: 120 KRKANLVCISIGDNTDTKMLGKISDNVLRLNDTGEQSFKAFFKWVTASIKSTSVSVTDMG 179 Query: 206 -TEVVLEAPKGWT 217 E+ L + G Sbjct: 180 TDEIQLASTSGID 192 >UniRef50_C7PNX3 von Willebrand factor type A n=2 Tax=Sphingobacteriales RepID=C7PNX3_CHIPD Length = 352 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 45/184 (24%), Positives = 78/184 (42%), Gaps = 5/184 (2%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQP 76 R P L+DVS SM G I + GL EL +DP AL+ + I+ F G P Sbjct: 2 RRLPIYFLIDVSESMVGEQIQFVEEGLAAIIKELKSDPYALETAWVSIIVFAGQAKTIVP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 +F+PP T + + + + + A ++P +FL TDG PTD Sbjct: 62 LQEVISFYPPKFPIGAGTSLSNGLGHLMYEMRKNTIHTTATQKGDWKPIVFLFTDGTPTD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQPLPLQGL--QFRELFSWLS 193 + AA + + + +I ++ L +++ L ++E F W++ Sbjct: 122 DTSAAVREWKQN-WQNKSNLIAISFGDENNLSALKELTETVLLFKNATPQSYKEFFRWVT 180 Query: 194 SSLR 197 +S++ Sbjct: 181 ASIK 184 >UniRef50_Q0I303 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=Q0I303_HAES1 Length = 343 Score = 153 bits (386), Expect = 5e-36, Method: Composition-based stats. Identities = 47/201 (23%), Positives = 87/201 (43%), Gaps = 6/201 (2%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH-VEQP 76 R P L++DVS SM G ++ + L DP AL+ V + ++ F V P Sbjct: 2 RRLPIFLVVDVSESMAGDSHRQMQEAINRLVQRLRCDPYALESVYISVIAFAGAAGVIAP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 T +F+ P L T +GAA+ +D ++ + ++P +++++DG TD Sbjct: 62 LTELMSFYAPRLPMGSGTSLGAALNLTMDEIQRNVVRSSGDQKGDFKPLVYILSDGVATD 121 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRELFSWLSSS 195 + +A + + E R ++G+ AD+ L QI+ + E + L+ S Sbjct: 122 DPTSAIQRWQQ-EFKSRTKLIAVGLGNFADLSALNQIA-ELTFRIDDQDLEEAYLTLTRS 179 Query: 196 L--RSVSRSTPGTEVVLEAPK 214 + +S+S L + Sbjct: 180 IEDSILSQSRSLGVAPLTIDE 200 >UniRef50_B6BJ58 Phage/colicin/tellurite resistance cluster TerY protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BJ58_9PROT Length = 229 Score = 150 bits (379), Expect = 3e-35, Method: Composition-based stats. Identities = 52/210 (24%), Positives = 83/210 (39%), Gaps = 11/210 (5%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + F +DF P +LLLDVS SM G I+ LN + + + ++L Sbjct: 1 MAFNPADFVVEEPKSIPVVLLLDVSYSMQGENIDTLNKAVESMLNSFKKAETMETFIKLS 60 Query: 65 IVTFGP---VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 I+TFG V + P T + L G TPMGAA M+E++ Sbjct: 61 IITFGSENGVDLHTPLTEVSKIDFKPLTVSGSTPMGAAFKMGKAMIEDKDIF----KGRD 116 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL- 180 YRP I L++DG P D+W+ + K+ ++ + AD L L Sbjct: 117 YRPTIVLLSDGEPNDDWRQPLDDFVSTGRTKKCDRMALAIGAADKTVLNMFIEGCENSLF 176 Query: 181 ---QGLQFRELFSWLSSSLRSVSRSTPGTE 207 + F ++ S+ ++S + Sbjct: 177 YAEDAENIIDEFKKITMSVTQRTKSVNKNQ 206 >UniRef50_C9ZGR0 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9ZGR0_STRSW Length = 253 Score = 146 bits (368), Expect = 6e-34, Method: Composition-based stats. Identities = 74/236 (31%), Positives = 101/236 (42%), Gaps = 31/236 (13%) Query: 1 MSEQIT------FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLAD 54 M + +A +F +N R P +L LD S SM G PI LN L + EL D Sbjct: 1 MHREYPPTLPAEYADIEFENN-AQRMPLVLCLDTSSSMAGPPIQTLNNALAEWTRELHDD 59 Query: 55 PLALKRVELGIVTFGP--------------VHVEQPFTSAANFFPPILFAQGDTPMGAAI 100 VE+ +VTFG PF A F P L A G T M A+ Sbjct: 60 VSLSYSVEVAVVTFGGQGVGAWRGPQLLDPRTRTSPFIPAHAFQAPQLTAAGVTLMTEAL 119 Query: 101 TKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE-------WQAAANKVFRGEEDKR 153 A+ +V RK E RA+G+ YYRP I L+TDG PTD W + + +R Sbjct: 120 ELAMHIVAARKSELRASGLQYYRPQICLVTDGLPTDPTGHLTDSWHRLVPVLAEEQSARR 179 Query: 154 FAFFSIGVQGA---DMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGT 206 F ++IGV G + L + + +QG FREL +S+S + + Sbjct: 180 FRLYAIGVGGITDRGEQVLKAFAPKFNARIQGFPFRELLQMMSASANAEQKGAGDE 235 >UniRef50_C5PP99 von Willebrand factor, type A n=2 Tax=Bacteroidetes RepID=C5PP99_9SPHI Length = 256 Score = 140 bits (352), Expect = 5e-32, Method: Composition-based stats. Identities = 50/206 (24%), Positives = 78/206 (37%), Gaps = 12/206 (5%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQP 76 R P LLD SGSM+G PI LN L + L D A + + + ++TF V P Sbjct: 47 RRLPVYFLLDTSGSMHGEPIQALNNALSGMINNLRTDAQAAETLWISMITFDREVKEIVP 106 Query: 77 FTSAANFFPPILFA--QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 T+ +F P + G T G A+ D + +RP +F+ TDG P Sbjct: 107 LTALESFQLPEISCPESGPTFTGKALEILYDTATREVIKGSPEQKGDWRPLLFIFTDGKP 166 Query: 135 TDE--WQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVR--QPLPLQGLQFRELF 189 +D + K+ AD K L +++ ++ F Sbjct: 167 SDLQLYSQMIPKIRSLNFG---TIVGCAAGHMADDKKLLELTSDVVHLNTADSSTLKQFF 223 Query: 190 SWLSSSLRSVSRST-PGTEVVLEAPK 214 W+S ++ ++S V L P Sbjct: 224 KWVSDTIEQGNKSRGTTDTVALPPPP 249 >UniRef50_C9LWT6 Tellurium resistance protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LWT6_9FIRM Length = 212 Score = 138 bits (347), Expect = 1e-31, Method: Composition-based stats. Identities = 55/190 (28%), Positives = 85/190 (44%), Gaps = 10/190 (5%) Query: 32 MNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAANFFPPILFA 90 M G PI + G+ EL DP AL+ L ++TF V T F P L A Sbjct: 1 MMGEPIEAVRQGIKALLSELRGDPQALETAYLSVITFASQVRQTTKLTELMLFKEPRLEA 60 Query: 91 QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD--EWQAAANKVFRG 148 +G T MG A+ + V R+ +RP +FL+TDG+PTD +++ AA ++ + Sbjct: 61 EGCTLMGGALKLLAECVRTEVRKNTETQKGDWRPLVFLLTDGSPTDLEDFRQAAAEI-KS 119 Query: 149 EEDKRFAFFSIGVQGADMKTLAQISVR--QPLPLQGLQFRELFSWLSSSLRSVSRS---T 203 + + G AD L Q++ L + F+W+S S++ S+S Sbjct: 120 LKLGNIIACAAGA-DADTSYLKQLTDNVLMMNSLSAGDMAKFFAWVSGSIKMSSKSLDAK 178 Query: 204 PGTEVVLEAP 213 PG + L P Sbjct: 179 PGAAIELPPP 188 >UniRef50_A1VV60 von Willebrand factor, type A n=4 Tax=Proteobacteria RepID=A1VV60_POLNA Length = 240 Score = 137 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 60/216 (27%), Positives = 96/216 (44%), Gaps = 14/216 (6%) Query: 10 SDFASNPEPRCPCILLLDVSGSMNGR-PINELNAGLVTFRDELLADPLALKRVELGIVTF 68 FA+ P I+L DVSGSM+ I+ LN L + +++G++TF Sbjct: 19 KAFAAPQARPLPVIVLADVSGSMSENGKIDALNVALKEMILSFGKESGLRAEIQVGLITF 78 Query: 69 GP--VHVEQPFTSAANFF-PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 G H P +A A G TPMG+A A ++E++++ YRP Sbjct: 79 GGREAHEHLPLVAAKVIGGVEAFKANGGTPMGSAFALARKLLEDKEQI----PSRAYRPV 134 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL---- 180 + L++DGAPTD W+A + E ++ F++ + AD+ LAQ + P+ Sbjct: 135 LILVSDGAPTDAWEAPLADLKASERGQKATRFAMAIGADADLDMLAQFPNDREAPVFKTH 194 Query: 181 QGLQFRELFSWLSSSLRSVSRS-TPGTEVVLEAPKG 215 + F ++ S+ S S S P V L+ G Sbjct: 195 EARDIGRFFRAVTMSVVSRSTSAAPDQPVTLDMEDG 230 >UniRef50_Q5NWS4 Tellurium resistance protein n=8 Tax=Bacteria RepID=Q5NWS4_AZOSE Length = 214 Score = 135 bits (340), Expect = 1e-30, Method: Composition-based stats. Identities = 56/207 (27%), Positives = 89/207 (42%), Gaps = 10/207 (4%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 R P LL+D SGSM G P+ +N GL + L +P A++ V L + TF + Sbjct: 4 SRRLPVYLLIDTSGSMRGEPVESVNVGLRAMQTSLRQNPYAIETVHLSVTTFDSQIKDVL 63 Query: 76 PFTSAANFFPPIL--FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 P T+ + P + A G T +G A+ LD ++ R+ A + P +F++TDG Sbjct: 64 PLTALEDATIPEIVCPASGATLLGEALEHILDRAKKEVRQSSAEQKGDWAPLLFIMTDGK 123 Query: 134 PTDEWQAAANKVFRGEEDKRF-AFFSIGVQ-GADMKTLAQISV--RQPLPLQGLQFRELF 189 PTD + N+V + +F + + AD L I+ + F F Sbjct: 124 PTDTF--VFNQVAPAIKAFKFGSIIACAAGPKADPAGLRLITDHVVSLDTMDSAAFTAFF 181 Query: 190 SWLSSSLRSVSRST-PGTEVVLEAPKG 215 W+S ++ S S S + L Sbjct: 182 QWVSVTVSSGSMSVGAANTLSLPPTPP 208 >UniRef50_D1AAR7 von Willebrand factor type A n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AAR7_THECD Length = 228 Score = 133 bits (336), Expect = 3e-30, Method: Composition-based stats. Identities = 51/184 (27%), Positives = 77/184 (41%), Gaps = 8/184 (4%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH-VEQ 75 E P L+ D S SM G P+ E+N L E+ ++P + L I++F V Sbjct: 3 EQILPFYLVCDESYSMAGNPLQEINDQLPQIVTEIASNPTVADKARLCIISFSDTAEVLL 62 Query: 76 PFTSAAN-FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 P + P L +G T GAA T D +E R+ +A G +RP +F +TDG P Sbjct: 63 PLADLNDVHQVPQLAPKGATSYGAAFTLLRDTIERDIRDLKAAGHVPFRPTVFFLTDGQP 122 Query: 135 TD-EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQG-----LQFREL 188 TD +W A ++ + R + G +TL ++ + G RE Sbjct: 123 TDSDWATAHQRLTAKDFGPRPTILAFGFGDVRPETLRAVATFRAFIANGELDPRNALREF 182 Query: 189 FSWL 192 L Sbjct: 183 AKQL 186 >UniRef50_UPI0001745BB0 TerY3 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745BB0 Length = 345 Score = 132 bits (332), Expect = 9e-30, Method: Composition-based stats. Identities = 45/195 (23%), Positives = 85/195 (43%), Gaps = 8/195 (4%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQP 76 R P +LLD S SM G + + G+ + L +P AL+ + +TF ++ P Sbjct: 2 RRLPVYVLLDCSESMIGNGLRGMRTGISSMLKALRQNPHALETAWISFITFDSRAELKSP 61 Query: 77 FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN--GISYYRPWIFLITDGAP 134 S PP L + T +G+A+ + + + + + +RP + LITDG P Sbjct: 62 LQSLDEVQPPRLLVRPGTSLGSALLLLSERILQEVKRTQPGTLTKGDFRPIVILITDGQP 121 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLP--LQGLQFRELFSW 191 TD+W++A ++ + +++G D L +++ + +LF W Sbjct: 122 TDDWRSALREMNSTVKIANL--YAVGCGDDIDFAGLREMTDVVLNLQQTDEQGWAKLFVW 179 Query: 192 LSSSLRSVSRSTPGT 206 +S ++ + SR Sbjct: 180 ISETVSTASRGVADG 194 >UniRef50_B0A9L7 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A9L7_9CLOT Length = 273 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 46/208 (22%), Positives = 84/208 (40%), Gaps = 10/208 (4%) Query: 4 QITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVEL 63 + F + + L+D SGSM+G+ I LN + EL A ++L Sbjct: 18 EYDFDPLEVKPISKKNLVIFFLVDTSGSMSGKKIGTLNTTMEELLPELRGLGGATTDIKL 77 Query: 64 GIVTFGPVH---VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 ++TF ++P + + L A+G T +G A T+ + + ++E+ Sbjct: 78 AVMTFSSGCEWITKEPMSVDDYQYWTRLKAEGLTDLGEAFTELSNKL--SRKEFLNAPSL 135 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLP 179 Y P IFL+TDG TD+ + K ++G+ D + L + + L Sbjct: 136 SYAPVIFLLTDGYATDDALEGLKTLQHNNWYKYGLKVALGLGEKFDEELLKKFTGNPELV 195 Query: 180 LQG---LQFRELFSWLSSSLRSV-SRST 203 + Q +L ++ + + SRS Sbjct: 196 VTAKTSDQLSKLVKTIAVTSSQIGSRSM 223 >UniRef50_Q8DK92 Tlr0974 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DK92_THEEB Length = 241 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 61/219 (27%), Positives = 95/219 (43%), Gaps = 18/219 (8%) Query: 1 MSEQITFATSDFASNP--EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLAL 58 M+ Q P E P LLLD S SM G PI L+ GL F+ E+ +D A Sbjct: 1 MTIQSNVPEWANVEVPGGERHLPVYLLLDTSSSMEGAPIESLHQGLEQFQREVSSDQFAR 60 Query: 59 KRVELGIVTFGPVHVEQP--FTSAANFFPPILFAQGDTPMGAAITKALDMVEER-KREYR 115 V++G++TF ++F PP+L A G T + A T L+ ++ R + Sbjct: 61 DIVKVGVITFASDAQLVTGGLVPISDFQPPMLTASGVTRLDLAFTVLLESIDRDVVRPVK 120 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRF----------AFFSIGVQGA- 164 ++P +F++TDG PTD A ++++R D ++G Sbjct: 121 GGQKGDWKPAVFVLTDGRPTDRHGIATDELWRPARDALVNRPKGEIKPSVIVAVGCGPHV 180 Query: 165 DMKTLAQISVRQPLPLQGLQ--FRELFSWLSSSLRSVSR 201 D TL IS + + F LF +LS SL + ++ Sbjct: 181 DDDTLKAISTGTAFKMGTSEAAFVALFQYLSQSLTTSTQ 219 >UniRef50_B2K3B2 von Willebrand factor type A n=39 Tax=Gammaproteobacteria RepID=B2K3B2_YERPB Length = 233 Score = 130 bits (326), Expect = 4e-29, Method: Composition-based stats. Identities = 57/207 (27%), Positives = 86/207 (41%), Gaps = 11/207 (5%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQP 76 R P LL+D SGSM G I+ +N G+ L DP AL+ V L I+T+ P Sbjct: 23 RRLPVYLLIDTSGSMRGESIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIP 82 Query: 77 FTSAANFFPPILF--AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 T+ NF + + G T GAA+ + VE + + +RP +FL+TDG P Sbjct: 83 LTALENFQFTDITVPSAGGTFTGAALECLIHCVERDIQRSDGDQKGDWRPLVFLMTDGTP 142 Query: 135 TD--EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPL--PLQGLQFRELFS 190 +D + A +V + ++G A + L Q++ + L F F Sbjct: 143 SDVYAYGEAIKEVKKRAFGS-IIACAVGA-KAKHEHLKQLTSQVVALETLDSTAFSGFFK 200 Query: 191 W--LSSSLRSVSRSTPGTEVVLEAPKG 215 W S + S S + L P Sbjct: 201 WVSASVAAGSSSAGINTGQDNLPPPPP 227 >UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fusobacterium sp. D11 RepID=UPI0001B52F00 Length = 218 Score = 129 bits (325), Expect = 5e-29, Method: Composition-based stats. Identities = 48/200 (24%), Positives = 79/200 (39%), Gaps = 11/200 (5%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP 70 +F S P+ P ILL D S SM + ELN + L + + +TFG Sbjct: 2 EFTSQPKKVLPLILLADTSSSMR-EWMRELNTAIRDMLGTLKEQESLKAEIHISFITFGN 60 Query: 71 --VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 ++ T +N G TP+G A+ A +MVE R+ Y P I L Sbjct: 61 GGANLHTALTPVSNIEFNDFTEGGMTPLGGALRIAKEMVENREII----PSKSYAPIILL 116 Query: 129 ITDGAPTDE-WQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL--QGLQ 184 ++DGAP D W+ + K+ S+G+ D L S + + Sbjct: 117 LSDGAPNDNGWENEMYRFINDGRSKKCMRMSLGIGRDYDYDVLKGFSSNGEVYEAKDSMN 176 Query: 185 FRELFSWLSSSLRSVSRSTP 204 + F +++ +++ + S Sbjct: 177 IIDFFKFMTMTIKEKTLSKD 196 >UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C09D7 Length = 262 Score = 129 bits (325), Expect = 6e-29, Method: Composition-based stats. Identities = 51/225 (22%), Positives = 85/225 (37%), Gaps = 21/225 (9%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADP--LALKRVELGI 65 A ++ P +LD SGSM G PI LN + L A ++++ + Sbjct: 3 AINEIDEMPRKELHVFYVLDTSGSMTGVPIAALNTAMEECTVALKDLAKKNADAKLKIAV 62 Query: 66 VTFGP----VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 + F V P + F L A G T +GAA+ + ++ + + + Sbjct: 63 LEFSTGAKWVTYNGPESLDDEFEWEHLSAGGVTDIGAALREL--DIKLSRNGFLKSMTGA 120 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL 180 P I +TDG PTDE+ AA ++ + + AD ++ I + Sbjct: 121 LMPVIIFMTDGYPTDEYAAALAELRKNRWYTSSTKIGFAIGDDADAAIISSIVGNSEAVI 180 Query: 181 QGLQFRELFS-----------WLSSSLRSVSRSTPGTEVVLEAPK 214 + ELF L+SS R+ S G+ +V +A Sbjct: 181 KTSDL-ELFKRLMKFVTVRASMLASSSRTTSSFVNGSAIVQDAID 224 >UniRef50_C0F0K9 Putative uncharacterized protein n=2 Tax=Eubacterium hallii DSM 3353 RepID=C0F0K9_9FIRM Length = 291 Score = 128 bits (323), Expect = 9e-29, Method: Composition-based stats. Identities = 52/224 (23%), Positives = 88/224 (39%), Gaps = 15/224 (6%) Query: 8 ATSDFASNPEP---RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 A DF EP ++DVSGSM G I LN+ + L+ A V++ Sbjct: 37 AEEDFLDTMEPAKKSMTIFFMIDVSGSMKGTKIGSLNSTMEELLPSLIGVGEASTDVKIA 96 Query: 65 IVTFG-PVHVEQ--PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 I+ F V P + L A G T MG A + + + + ++ Sbjct: 97 IMKFSTDVEWVTPEPVKIEEYQYWNRLEADGLTFMGDAFMELSKKL--SRSTFLSSPSLS 154 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPL 180 + P IFL++DG+P D+W+ + + + + + ++G+ +M L + L + Sbjct: 155 FAPVIFLLSDGSPNDDWKKGLDTLKQNKWFQHGLKIALGIGSKVNMDVLRAFTGNDELAV 214 Query: 181 ---QGLQFRELFSWL---SSSLRSVSRSTPGTEVVLEAPKGWTS 218 Q REL L SS + S S + + A + Sbjct: 215 QAKNADQLRELIKLLAVTSSQIGSRSLALVDSNGRQPAAETVAQ 258 >UniRef50_UPI0001AED79F von Willebrand factor type A n=1 Tax=Streptomyces albus J1074 RepID=UPI0001AED79F Length = 221 Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats. Identities = 50/190 (26%), Positives = 78/190 (41%), Gaps = 6/190 (3%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQP 76 P LL D SGSM G PI+ +N L E+ +P + ++ F V QP Sbjct: 2 QILPFYLLCDESGSMTGDPIDAINRALPDLHHEISTNPTVADKTRFCLIGFSDDASVLQP 61 Query: 77 FTSAANF-FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 ++ P L A G T G A L VE+ E +A G YRP F ++DG PT Sbjct: 62 LVDLSDIDEVPALSAGGLTDYGTAFRTLLRSVEKDVAELKAQGHEVYRPVAFFLSDGIPT 121 Query: 136 D-EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWLSS 194 D +W A ++ + + G+ A+ + + Q++ + + L Sbjct: 122 DEDWPTAHRELLNSRYAPKI--IAFGIGDAEAQIIGQVANFRAFIQKDNSVSPA-QALRE 178 Query: 195 SLRSVSRSTP 204 S++RS Sbjct: 179 FASSLTRSIV 188 >UniRef50_UPI00018742C1 von Willebrand factor type A n=1 Tax=Corynebacterium amycolatum SK46 RepID=UPI00018742C1 Length = 228 Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats. Identities = 55/214 (25%), Positives = 89/214 (41%), Gaps = 12/214 (5%) Query: 15 NPEPR---CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV 71 N EPR P + D SGSM G I ELN+GL + +E+ P A V ++ F Sbjct: 6 NMEPRGNILPIYFVADESGSM-GPDIAELNSGLQSLLNEIRMAPFAAANVRFSVIGFDNE 64 Query: 72 HVE-QPFTSAANF-FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 + P L A+ T A + + + +A G RP +FL+ Sbjct: 65 ARLYLSNADLRHVEQMPTLSARFATYFSTAFDLLNAQIPDDVAQLKAEGYRVNRPAVFLL 124 Query: 130 TDGAP--TDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRE 187 TDG P D WQ A + + + G+ +D + + Q++ + Q Q + Sbjct: 125 TDGYPMSDDLWQEALSDLQNAPHHPNI--LAFGIGESDAQIIGQMASKNGWAFQAAQGAD 182 Query: 188 LFSWLSSSLRSVSRSTP--GTEVVLEAPKGWTSV 219 + LS + S+++S GT V +P+ T + Sbjct: 183 TGAMLSEFMSSLTQSVISSGTSVANGSPEIITDI 216 >UniRef50_B4VGR3 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4VGR3_9ACTO Length = 206 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 44/194 (22%), Positives = 79/194 (40%), Gaps = 9/194 (4%) Query: 29 SGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAANFFPPI 87 SGSM+G P+ +N L + +L DP + + +VTF P + A+ P Sbjct: 2 SGSMSGGPMAAMNTALPAMQRAILDDPTTGEIARVSVVTFSDTAACVLPLSDMAHARMPT 61 Query: 88 LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD--EWQAAANKV 145 L QG T + + + G Y+RP +F ++DG W++ +++ Sbjct: 62 LSPQGGTDFAEGFRVGREALVDGIGALGR-GARYHRPVVFFLSDGQHNSSQSWKSGFDRL 120 Query: 146 FRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ----FRELFSWLSSSLRSVSR 201 E+ S G A+ +AQ+S R + + +E+ + S+++ S Sbjct: 121 RSKEDKYGAEVVSFGFGQANRDVIAQVSTRHAFFAEDMDPAVAVKEILHTVLMSIKTTSG 180 Query: 202 S-TPGTEVVLEAPK 214 S G L P+ Sbjct: 181 SFQAGGAAGLTIPE 194 >UniRef50_B7AIG3 Putative uncharacterized protein n=2 Tax=Bacteroidales RepID=B7AIG3_9BACE Length = 247 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 71/212 (33%), Gaps = 12/212 (5%) Query: 10 SDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI--VT 67 D S P ++D SGSM G I +N + L + E+ + + Sbjct: 5 DDVVSVPRRTMTLFFVIDTSGSMAGNKIGAVNDAVENVLPMLDEISASNPDAEIKVAALE 64 Query: 68 FGPVHVEQ--PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 F A+ F + A G T +GAA + + + + + P Sbjct: 65 FSSGCNWLYDEPKLASEFVWQDVTASGLTSLGAACQELNTKL--SRNGFMQTPSGSFAPA 122 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQ 184 I L++DG PTD++ +K+ K +I + AD L Q + Sbjct: 123 IILLSDGGPTDDFYGGLSKLKANNWFKNAIKIAIAIGDDADKDVLTQFTGTNEAVFTVHN 182 Query: 185 FRELFSWLSSSLRSVSR-----STPGTEVVLE 211 L + + S+ ST G E Sbjct: 183 IDALKQIIRVVAVTSSQIGSKSSTAGDTTKQE 214 >UniRef50_B8HT03 von Willebrand factor type A n=2 Tax=Cyanothece RepID=B8HT03_CYAP4 Length = 236 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 77/218 (35%), Gaps = 21/218 (9%) Query: 17 EPRCPCILLLDVSGSM-NGRPINELNAGLVTFRDELLADPLALKRVELGI--VTFGP-VH 72 I L D SGSM + I LNA + L ++ + + F Sbjct: 17 SRPLHFIWLCDCSGSMVSQGKIQSLNAAIKETIPMLQQTAADNPNAQVLVRAIKFSDGAE 76 Query: 73 VEQP-FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 P T F L A G T +G A+ + + RA P + LI+D Sbjct: 77 WHIPTPTPVDQFRWTDLTAGGVTDLGMALEMVAEQLRVPPMSERA-----LPPVLVLISD 131 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL----QGLQFR 186 G PTD++ + + ++ +I V A+ + L + L + Q Sbjct: 132 GQPTDDFGSGLKALMAQPWGQKAVRVAIAVGQDANHEVLQKFIGPSELRVLQANNPDQLV 191 Query: 187 ELFSWLSSSLRSVSRS------TPGTEVVLEAPKGWTS 218 W S+ +++VS+ P V+L+ + + Sbjct: 192 NQIRWTSTLIKTVSQPRLEKSHRPDQMVILQPQEPAAT 229 >UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytophthora infestans T30-4 RepID=D0N9W4_PHYIN Length = 2146 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 42/192 (21%), Positives = 74/192 (38%), Gaps = 21/192 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ--- 75 + + +LD SGSMNG+P N+L A + +AD L + +VTF Sbjct: 1901 KMHHVFVLDCSGSMNGQPWNDLMAAWKEYVYNRIADGATLDL--VSVVTFDNSAQIVYEA 1958 Query: 76 -PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 T+ N + G T A + A +++ ++P I +DG P Sbjct: 1959 RSITTVTNARIQ--YRGGGTNYAAGLRSANEVLSR-------VNFDMFKPAIVFFSDGHP 2009 Query: 135 TDEWQ--AAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS----VRQPLPLQGLQFREL 188 D Q A + E F++G ++ L +++ L G + + Sbjct: 2010 CDPLQGEELATHIRGCYERNGLQAFAVGFGSINLNMLERVAEKLGGTYHHVLTGNELKAT 2069 Query: 189 FSWLSSSLRSVS 200 F +S+SL + + Sbjct: 2070 FFSISASLSTRA 2081 >UniRef50_UPI0001C37785 von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37785 Length = 285 Score = 123 bits (310), Expect = 3e-27, Method: Composition-based stats. Identities = 45/207 (21%), Positives = 84/207 (40%), Gaps = 14/207 (6%) Query: 11 DFASNP-------EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVEL 63 DF +P L+D SGSM G+ + ELN + E+ A V++ Sbjct: 35 DFDDDPLSATGVSRKSLVIFFLIDTSGSMKGKKMGELNTVMEELIPEIRRVGEADTEVKV 94 Query: 64 GIVTFGPVHV--EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 ++TF +F L A G T MGAA + + + + + Sbjct: 95 AVLTFSTDVRWMYSTPIPIEDFEWARLRANGVTSMGAAFKEL--SLRMSRNSFLNSPSLS 152 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPL 180 + P IFL+TDG P+D+++ ++ K ++G+ A+ LA+ + + + Sbjct: 153 FAPVIFLMTDGYPSDDYREGLKELQSNSWYKFGLKAALGIGNEANDDVLAEFTGSKDTVV 212 Query: 181 QGLQFRELFSWLSSSLRSVSRSTPGTE 207 +L + + +V+ S G++ Sbjct: 213 HAYSGGQLAQMIK--IIAVTSSQIGSK 237 >UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=3 Tax=Amniota RepID=UPI000155CC23 Length = 1374 Score = 123 bits (309), Expect = 4e-27, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 69/200 (34%), Gaps = 27/200 (13%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 P + + ++DVSGSM G + + + ++L D IVTF Sbjct: 314 PPVQKNVVFVIDVSGSMFGTKMKQTKKAMHVILNDLHHDD------YFNIVTFSDAVSVW 367 Query: 76 ----------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 P +A + + A G T + AA+ A + + E P Sbjct: 368 KASGSIQATPPNIKSAKVYVNKMEADGWTDINAALLVAASVFNQSTGETGRGKGLKKIPL 427 Query: 126 IFLITDGAPTDEWQAAANKVFRGEE--DKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQ- 181 I +TDG T A+ + ++ + F + AD + ++S+ + Sbjct: 428 IIFLTDGEATAGVTVASRILSNAKQSLKGNISLFGLAFGDDADYHLMRRLSLENRGVARR 487 Query: 182 -------GLQFRELFSWLSS 194 LQ + + ++S Sbjct: 488 IYEDADATLQLKGFYDEIAS 507 >UniRef50_C9RJN5 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJN5_FIBSS Length = 256 Score = 123 bits (309), Expect = 4e-27, Method: Composition-based stats. Identities = 47/207 (22%), Positives = 80/207 (38%), Gaps = 12/207 (5%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDEL--LADPLALKRVELGI 65 P I ++D SGSM+G I LN + D++ ++ ++++ + Sbjct: 5 DPFALEPIPRRVTHLIFMVDTSGSMSGSKIASLNTAVRDALDDVGDISKNCGDSQIKIAV 64 Query: 66 VTFGPVH---VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 + F EQP A F L A G T G+A + LD R + Sbjct: 65 LEFSSAVNWMYEQPL-EAEKFQWQDLSASGTTSFGSACAE-LDAKLSRSNGFMGEKTGCR 122 Query: 123 RPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL- 180 P I L++DGAPTD + K+ K +I + A+ L + + + Sbjct: 123 APAIVLLSDGAPTDGYVRKLEKLKGNRWFKAGVKVAIAIGDDANNDVLREFTGSSESVIT 182 Query: 181 --QGLQFRELFSWLSSSLRSV-SRSTP 204 Q +++ +S S +V S+S Sbjct: 183 VHNVDQLKKMIHTVSVSATTVASQSAS 209 >UniRef50_C2HF13 von Willebrand factor type A n=1 Tax=Finegoldia magna ATCC 53516 RepID=C2HF13_PEPMA Length = 249 Score = 122 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 50/205 (24%), Positives = 88/205 (42%), Gaps = 16/205 (7%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELL--ADPLALKRVELGIVTFGPV-H 72 P ++D SGSM G I E+N+ + EL ++ V++ I++F Sbjct: 11 PRKMMVLFFVIDTSGSMKGTKIGEVNSAIEEILPELSDISNSNPDAEVKMAILSFNSEIQ 70 Query: 73 VEQPFTSAAN---FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 P T + + L A G T MGAA + + K + + S Y P IFL+ Sbjct: 71 WITPKTGPVDPGVYLWRDLNANGTTRMGAAFEELESKLHGDK--FMKSATSSYAPVIFLM 128 Query: 130 TDGAPT---DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQ- 184 +DG PT +++Q+ NK+ + K ++G+ AD+ L + + LQ Sbjct: 129 SDGMPTETEEQFQSGLNKLKANKWFKSGIKVALGIGQDADLDVLEAFTGTKEAVLQTNNV 188 Query: 185 --FRELFSWLSSSLRSV-SRSTPGT 206 + + ++S + + S+S G Sbjct: 189 KKLKAMIQFVSVTSSQIASQSVSGG 213 >UniRef50_UPI0001B4E5FD von Willebrand factor type A n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4E5FD Length = 228 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 75/211 (35%), Gaps = 15/211 (7%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI--VTFGPVHV- 73 I LLD S SM G I LN + E+ + +L + +TF Sbjct: 11 NRPVHFIWLLDCSYSMQGEKIARLNYAIREAIPEMRSVAHDNPAAQLLLRTLTFSTTAQW 70 Query: 74 -EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + +F + G T +G A+ ++ RA +P + L++DG Sbjct: 71 HHKNPVPVDDFTWQDVQVDGMTNLGEALDLVSRELQTPPMPQRA-----LKPVLALVSDG 125 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLP----LQGLQFRE 187 PTD+W+A + ++ +I + AD L + L Q Sbjct: 126 VPTDDWKAGLKAIDATPWGRKAVRVAIAIGEDADRNVLQEFLGNPELRPLDANSPKQLAA 185 Query: 188 LFSWLS-SSLRSVSRSTPGTEVVLEAPKGWT 217 W S +++++ S+ + L P + Sbjct: 186 AIRWASTAAVKAASQPVAASSDTLSKPLPYA 216 >UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CDX4_KOSOT Length = 730 Score = 121 bits (303), Expect = 2e-26, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 79/217 (36%), Gaps = 33/217 (15%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-----VHV 73 + +LD+SGSM+G+ I + L+ L I+TF Sbjct: 273 PKDIVFILDISGSMSGQKIEKAKLALLQVLQMLHEGD------RFSIITFNNEVNNLTER 326 Query: 74 EQPFTSAANFF--PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 PF+ ++ + A G T + A+ + ++++ + +TD Sbjct: 327 LLPFSDRTEWYPAVKQIMAGGMTNIHDALLEGIEVL-------GTQSTDDRYKVVLFLTD 379 Query: 132 GAPTD---EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQ------ 181 GAPT+ + + + + F GV + + L +++ + ++ Sbjct: 380 GAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVGYDVNAELLDELAEKGGGKVKYIVENE 439 Query: 182 --GLQFRELFSWL-SSSLRSVSRSTPGTEVVLEAPKG 215 + EL+ + + + +V GT++ PKG Sbjct: 440 EIDEKVLELYRMIETPVMSNVHLEINGTDISYVLPKG 476 >UniRef50_UPI0001B4AB93 von Willebrand factor type A n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AB93 Length = 250 Score = 120 bits (302), Expect = 2e-26, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 68/188 (36%), Gaps = 6/188 (3%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDEL--LADPLALKRVELGIVTF 68 D S P + ++D SGSM G+ I +N + + ++D + + + F Sbjct: 5 DNESIPRRKMILFFVIDTSGSMIGKKIGSVNDAIENVLPMIGEISDENPDAEINVAALEF 64 Query: 69 GPVHVEQ--PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY-RANGISYYRPW 125 A +F + A G T +G A + + ++G Y+ P Sbjct: 65 STGTRWLYDEPKEAKDFIWQKVEANGLTSLGEACEELNKKLSRSGGFMPTSSGSGYFSPA 124 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQ 184 I L++DG PTD ++ + K +I + AD + L Q + + Sbjct: 125 IILLSDGGPTDNFEGGLKTLQGNSWFKHAIKIAIAIGDDADKEVLKQFTGSSEAVITVHN 184 Query: 185 FRELFSWL 192 L + Sbjct: 185 IEALKKMI 192 >UniRef50_A6G3C6 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G3C6_9DELT Length = 211 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 74/208 (35%), Positives = 107/208 (51%), Gaps = 14/208 (6%) Query: 26 LDVSGSM-NGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-PFTSAANF 83 +D S SM I +N L FR +++ DPLA KR++L +++F V + F SA N+ Sbjct: 1 MDRSHSMIFNDHIGAVNRSLQAFRADIMEDPLARKRLDLCVISFNHECVTENHFCSAQNW 60 Query: 84 FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD------E 137 PP L G T MG AI L+ + R + YR +G+ YRPW+ L+TDG PTD Sbjct: 61 RPPTLVPGGATGMGQAIKVGLETLRGRLQRYRLDGVDCYRPWVMLVTDGLPTDMQPNDAR 120 Query: 138 WQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI-SVRQPLPLQGLQFRELFSWLSSSL 196 W + GE+ +RF FFS+ V + L Q+ + R PL ++ + +F WLS+S Sbjct: 121 WMEVRQLIQDGEQKRRFMFFSVAVLPEAIPALRQLGAQRPPLQVREGKIPTMFKWLSASF 180 Query: 197 RSVSRSTPGTEV-----VLEAPKGWTSV 219 S+SRS G V GW + Sbjct: 181 SSISRSQLGAPVSGITEPTATETGWADI 208 >UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Ciona intestinalis RepID=UPI000180CCF8 Length = 864 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 74/209 (35%), Gaps = 32/209 (15%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA ++ P+ + ++DVSGSM+G I + L T D+L + I+ Sbjct: 288 FAPTNLPVIPKK---VVFVIDVSGSMSGHKIVQTKEALRTILDDLNEID------QFNII 338 Query: 67 TFGPVHVEQPFTSAANFF----------PPILFAQGDTPMGAAITKALDMVEERKREYRA 116 TF + ++A+G T AA + ++E Sbjct: 339 TFSSTTNVWHPNEMVDVNPTNIRNAKKHVRSMYARGGTNFNAAALDGIQLLET--ISSNR 396 Query: 117 NGISYYRPWIFLITDGAPTDEWQ--AAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQIS 173 + L+TDG PT A + R + R++ F +G D + L QI+ Sbjct: 397 TNTLEEASMMILLTDGQPTVGVTGNEAIRRNIRERVNGRYSIFCLGFGQHLDHEFLDQIA 456 Query: 174 VR--------QPLPLQGLQFRELFSWLSS 194 LQ ++ + ++S Sbjct: 457 SENKGLSRKIYNDADAALQLKDFYDEVAS 485 >UniRef50_C3XUD0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XUD0_BRAFL Length = 443 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 44/188 (23%), Positives = 65/188 (34%), Gaps = 16/188 (8%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPL-ALKRVELGIVTFGP-VHVEQP 76 +L LD SGSMNGR + EL G+ F + R + +V FG + QP Sbjct: 2 PLDTVLCLDTSGSMNGRGMAELKKGVRHFLLGVQETANKMSLRENVAVVEFGGGARIIQP 61 Query: 77 FTS---AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + L A G TPM + +A+ + +R G P + L+TDG Sbjct: 62 LSGNYGTVMQSVDNLKAGGTTPMFEGLMEAMKEILQRGGVLTLPGGRKMTPRVILMTDGY 121 Query: 134 PTDEWQAAANKVFRGEEDKR-------FAFFSIGVQ-GADMKTLAQIS---VRQPLPLQG 182 P D+ + G + +G D L I+ + Sbjct: 122 PDDKENVLKAALSFGPAGWQAVGLPHPIPIACVGCGDDVDKDLLQAIAKLTNGMYILGDV 181 Query: 183 LQFRELFS 190 Q E F Sbjct: 182 SQLSEFFR 189 >UniRef50_Q0RFF8 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RFF8_FRAAA Length = 209 Score = 118 bits (296), Expect = 1e-25, Method: Composition-based stats. Identities = 47/182 (25%), Positives = 75/182 (41%), Gaps = 9/182 (4%) Query: 29 SGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFTSAANFFPPI 87 S SM G P+ LN L + E+ ++P + + IVTF V P A + P Sbjct: 2 SASMAGGPLEALNDSLPALQKEMQSNPTVGEIARISIVTFSDVGRTVVPLCDLAEVYLPE 61 Query: 88 LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPT--DEWQAAANKV 145 L +G T AA + +E R G YRP +F ++DG +W AA N + Sbjct: 62 LMVEGGTNFAAAFQETRRAIEGGLRSL-PKGTPIYRPVVFFMSDGEHQAPGDWTAALNDL 120 Query: 146 FRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQ----GLQFRELFSWLSSSLRSVS 200 + G ++ ++ +I+ R + Q RE+ + L S+R+ S Sbjct: 121 RDRSWRFAPEVVAFGFGDQVNVDSIRRIATRFSFLARDADPATQVREIMNALIGSIRTTS 180 Query: 201 RS 202 S Sbjct: 181 TS 182 >UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E8A41 Length = 945 Score = 116 bits (290), Expect = 7e-25, Method: Composition-based stats. Identities = 38/174 (21%), Positives = 64/174 (36%), Gaps = 12/174 (6%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA D + P+ + ++D S SM G+ I + L T +L V Sbjct: 288 FAPKDLPAVPK---NVVFVIDTSASMLGKKIRQTKEALFTILGDLRPGDHFNFISFSSRV 344 Query: 67 TFGPVHVEQPFTS----AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 P T A F +L G T + +AI ++++ A+ S Sbjct: 345 KVWQPGRLVPVTPNNVRDAKKFIFMLPTSGGTNINSAIQTGSSLLQDYLSAQDASPNSV- 403 Query: 123 RPWIFLITDGAPTDEWQAAANKV--FRGEEDKRFAFFSIGVQ-GADMKTLAQIS 173 I +TDG PT + + R +F F+IG+ D + L +++ Sbjct: 404 -SLIIFLTDGQPTVGEVQSVTILGNTRSAVQGKFCIFTIGIGNDVDYRLLERMA 456 >UniRef50_C9RK46 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RK46_FIBSS Length = 236 Score = 116 bits (290), Expect = 7e-25, Method: Composition-based stats. Identities = 50/211 (23%), Positives = 89/211 (42%), Gaps = 22/211 (10%) Query: 17 EPRCPCILLLDVSGSMNG-RPINELNAGLVTFRDELLADPLALKR--VELGIVTFGP--V 71 P I+L DVSGSMN ++ L L + + + I+TFG Sbjct: 12 SRPLPVIILADVSGSMNEIGKLDSLKHALNNMISSFKDASSSSLEAEIYVSIITFGNQAA 71 Query: 72 HVEQPFTSAANFF--------PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYR 123 ++ SA+ + A G+TP+G A+T +D++E R+ YR Sbjct: 72 NIILEPQSASEIANDPSKMNVINKMQAIGNTPLGKALTSLVDLLENREI----YPSRAYR 127 Query: 124 PWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL-- 180 P+I L +DG P D WQ +++ E K+ ++ + AD L + + +P+ Sbjct: 128 PFIVLASDGMPNDLWQQPLDRLLNSERSKKANRLALAIGADADESMLKKFVNNEEMPIFK 187 Query: 181 --QGLQFRELFSWLSSSLRSVSRSTPGTEVV 209 ++ ++ F ++ S S+S E+ Sbjct: 188 ANNAIEIQKFFKCVTMSAIKSSQSAKPGEIA 218 >UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira maxima CS-328 RepID=B5W7H4_SPIMA Length = 488 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 38/179 (21%), Positives = 60/179 (33%), Gaps = 23/179 (12%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQP 76 +LL+D SGSM+G+ + E+ F LKR +L +V F V Sbjct: 49 KPKAVVLLIDTSGSMSGQKLREVQTAASEFVS-----RQNLKRHDLAVVEFSSRASVVAD 103 Query: 77 FT---SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 FT + L A+G T + A +++ P I L TDG Sbjct: 104 FTRNETELQQAIARLSARGGTNLSEGFNLATSVLQNS----------DRTPNILLFTDGV 153 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGL--QFRELFS 190 P + AA + + ++G A + L ++ L F Sbjct: 154 PNNPPMAA--SIAQQIRASGINLVAVGTGDAQINYLTALTGDPDLVFYANFGDLDRAFR 210 >UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-like protein n=3 Tax=Eutheria RepID=ITH5L_HUMAN Length = 1313 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 65/200 (32%), Gaps = 27/200 (13%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 P + ++DVS SM G + + + +L A+ I++F Sbjct: 278 PPMEKNVVFVIDVSSSMFGTKMEQTKTAMNVILSDLQAND------YFNIISFSDTVNVW 331 Query: 76 PFTSAANFFPPI----------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 + + A G T + +A+ A ++ +E P Sbjct: 332 KAGGSIQATIQNVHSAKDYLHCMEADGWTDVNSALLAAASVLNHSNQEPGRGPSVGRIPL 391 Query: 126 IFLITDGAPTDEWQAAANKV--FRGEEDKRFAFFSIGVQ-GADMKTLAQISVR------- 175 I +TDG PT + + R R + FS+ AD L ++S+ Sbjct: 392 IIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFTLLRRLSLENRGIARR 451 Query: 176 -QPLPLQGLQFRELFSWLSS 194 LQ + L+ +S Sbjct: 452 IYEDTDAALQLKGLYEEISM 471 >UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTR8_HALO1 Length = 903 Score = 115 bits (287), Expect = 1e-24, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + +P L++D SGSM+G I + L L + Sbjct: 443 MPVRFDSEKQREQPHVAIALVVDRSGSMSGLKIEAAKESARATAEVLSPSDL------IT 496 Query: 65 IVTFGPVH-VEQPFTSAAN-----FFPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 +V F A+N L A G T + A+ +A ++++ + + Sbjct: 497 VVAFDNQPTTIVRLQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGANAKVKH-- 554 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPL 178 + +++DG + + R ++G+ AD L I+ Sbjct: 555 -------VIVLSDGQA---PYDGIADLCQEMRSARITVSAVGIGDADRNLLNLITDNGDG 604 Query: 179 PLQ-GLQFRELFSWLSSSLRSVSRST 203 L L RS Sbjct: 605 RLYMTDDLAALPRIFMKETTEAQRSA 630 >UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tetraodon nigroviridis RepID=UPI00017B0D26 Length = 856 Score = 114 bits (285), Expect = 2e-24, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 62/174 (35%), Gaps = 12/174 (6%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA D + P+ + ++D S SM G+ + + L+T +L + Sbjct: 217 FAPKDLPAVPK---NVVFVIDTSASMLGKKMRQTKEALLTILGDLRPADRFNFISFSSRI 273 Query: 67 TFGPVHVEQPFTSA----ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 P T + A F +L G T + AI ++ + A S Sbjct: 274 RVWQPGRLVPATPSAVRDAKKFVVMLPTSGGTDIDGAIQTGSSLLRDHLSGRDAGPNSV- 332 Query: 123 RPWIFLITDGAPTDEWQAAANKV--FRGEEDKRFAFFSIGVQ-GADMKTLAQIS 173 I +TDG PT + R +F F+IG+ D + L +++ Sbjct: 333 -SLIIFLTDGQPTVGEVRPGAILGNARAAVRDKFCIFTIGMGDDVDYRLLERMA 385 >UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun sequence. (Fragment) n=16 Tax=Euteleostomi RepID=Q4SBF6_TETNG Length = 1039 Score = 114 bits (285), Expect = 2e-24, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 74/199 (37%), Gaps = 18/199 (9%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA PR P + ++D+SGSM+G + + ++ ++L + + + F Sbjct: 423 FAPKDLPRLPKNVVFVIDMSGSMSGTKMQQTREAMLKILEDLDPEDHFGIILFDHRIQFW 482 Query: 70 PV---HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 + A + + + G T + A + KA+DM++E ++ R S I Sbjct: 483 NTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKRLPEKSID--MI 540 Query: 127 FLITDGAPTDEWQ--AAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR-------- 175 L+TDG P + + + + FS+G L +S Sbjct: 541 ILLTDGDPNSGESRIPVIQENVKAAIGGQMSLFSLGFGNDVKYPFLDVMSRENNGLARRI 600 Query: 176 QPLPLQGLQFRELFSWLSS 194 LQ + + +SS Sbjct: 601 YEGSDAALQLQGFYDEVSS 619 Score = 46.9 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 26/63 (41%), Gaps = 10/63 (15%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINE-LNAGLVTFRDELLADPLALKRVELGIVTF 68 FA PR P + ++D+SGSM+G + + + + + A ++F Sbjct: 323 FAPKDLPRLPKNVVFVIDMSGSMSGTKMQQEAHRAARSLQKRSTDGGTAR-------ISF 375 Query: 69 GPV 71 P Sbjct: 376 SPT 378 >UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XQ17_9BACT Length = 806 Score = 113 bits (284), Expect = 3e-24, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 60/189 (31%), Gaps = 28/189 (14%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + D + + +LD SGSM+G+ + + L + L Sbjct: 297 LASPGVDAKAKQIVSKDVVFVLDTSGSMSGKKMEQAKKALQFCVESLNDGD------RFE 350 Query: 65 IVTFGP---------VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 I+ F V + A F L A G T + A+ KAL + + R + Sbjct: 351 IIRFSTESEPLFDKLAAVSKENREKAGDFIKNLKAMGGTAIDEALKKALSLESKEGRPF- 409 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDK--RFAFFSIGVQ-GADMKTLAQI 172 + +TDG PT + +E + F G+ + L +I Sbjct: 410 ---------VVVFLTDGLPTVGTTDEDQILKGMQERNKEKRRIFCFGIGTDVNTHLLDRI 460 Query: 173 SVRQPLPLQ 181 + Q Sbjct: 461 AEETRAFSQ 469 >UniRef50_A8SDD9 Putative uncharacterized protein n=2 Tax=Ruminococcaceae RepID=A8SDD9_9FIRM Length = 247 Score = 113 bits (282), Expect = 6e-24, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 70/199 (35%), Gaps = 15/199 (7%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV---ELGIVTFGP-VHVEQP- 76 ++D SGSM G I +N+ + L + A ++ I+ F P Sbjct: 3 LFYVVDTSGSMCGSKIGSVNSAMEEAITSDLPEISAANDDAEIKVAIMQFSSGCSWITPQ 62 Query: 77 --FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 + + L A G T +GAA + + + EY + Y P I L +DG P Sbjct: 63 SGPIAIGDVIWNDLNAGGVTDLGAACKELDKKL--SRNEYLNSQTGAYAPVILLFSDGGP 120 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQ---FRELFS 190 TD W+ ++ K +I + AD LA+ + + + L Sbjct: 121 TDNWEKELKQLKLNNWFKHAIKIAIAIGDDADKTVLAEFTGTIESVITVNDKHTLKALIR 180 Query: 191 WLS--SSLRSVSRSTPGTE 207 +S +S G Sbjct: 181 KVSVRASEFQSHSKQSGDT 199 >UniRef50_C7N2G1 Uncharacterized protein n=2 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N2G1_SLAHD Length = 272 Score = 113 bits (282), Expect = 6e-24, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 86/236 (36%), Gaps = 29/236 (12%) Query: 6 TFATSDFA-SNPEPRCPCILLLDVSGSMN--GRPINELNAGLVTFRDELLADPLALKR-- 60 ++ + P I ++D SGSMN G I+ +N + D L Sbjct: 9 PMPNIEYTMAKARKLLPIIYVIDTSGSMNFYG-RISAVNRAMNETLDVLGDVAAKNPTAD 67 Query: 61 VELGIVTFGPVHVEQ---------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERK 111 V++ ++ F +F+ L A+G T +GAA+ + + + + Sbjct: 68 VKVAVLGFSTGAEWITTDPVTGKPALMDLEDFYWDKLKARGSTDLGAALIELGEQL--TR 125 Query: 112 REYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRF-AFFSIGVQG-ADMKTL 169 + + P I ++DG PTD+W++A KV R ++ V AD + L Sbjct: 126 DAMLVSETGFKVPVIIFMSDGGPTDDWESAFEKVCANNRWVRAATKIALAVGDNADREVL 185 Query: 170 AQISVRQPLPL----QGLQFRELFSWLSSSL------RSVSRSTPGTEVVLEAPKG 215 A+I+ P + ++L +S + S S V + Sbjct: 186 ARIADGNPEAVVPVSDSATLQKLIKVVSVTASMINGRSRTSDSNQNDIVNAVRTEM 241 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 113 bits (282), Expect = 6e-24, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 62/192 (32%), Gaps = 28/192 (14%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 P + ++D SGSM G I + L+ D L ++ L ++ F Sbjct: 201 VEQSRPSIDLVCVIDNSGSMQGEKIQNVKTTLLQLLDMLNSND------RLSLILFNSYP 254 Query: 73 VEQ--------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 T + A G T + + + A +++++R+ + Sbjct: 255 TLLCNLRKVDDENTPNIQSIINSITADGGTDINSGMLMAFNILQKRQFFNPVSS------ 308 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQ-GADMKTLAQI----SVRQP 177 IFL++DG + + + K F+ S G D + +I Sbjct: 309 -IFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFGSDHDGPLMNRICQLKDGNFY 367 Query: 178 LPLQGLQFRELF 189 + Q E F Sbjct: 368 YVEKINQVDEFF 379 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 112 bits (281), Expect = 8e-24, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 74/215 (34%), Gaps = 36/215 (16%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPF 77 I ++D SGSMNG+P++ L L+ D L + ++ F P Sbjct: 1445 PIDLICVIDTSGSMNGQPLDLLKETLLFLVDLLQTGD------RICLIQFSTNAQRLTPL 1498 Query: 78 TSAANF--------FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 S + L A+G T + + A D++++R+ + +FL+ Sbjct: 1499 LSIESKDNIKSIKNEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITS-------VFLL 1551 Query: 130 TDGAPTDEWQAAANKV------FRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPL 178 +DG D + + ++ ++ F + G D + +IS Sbjct: 1552 SDGL-NDGAENKIRDLLKQLNFYQNYNEENFTIQTFGFGKDHDPNLMDKISQLMDGNFYY 1610 Query: 179 PLQGLQFRE-LFSWLSSSLRSVSRST-PGTEVVLE 211 + E L +S++ +V E Sbjct: 1611 IGDIHRIDECFIDALGGLFSVISQNVSINVQVPQE 1645 >UniRef50_B2UUD5 Phage/colicin/tellurite resistance cluster terY protein n=5 Tax=Helicobacter pylori RepID=B2UUD5_HELPS Length = 217 Score = 112 bits (280), Expect = 9e-24, Method: Composition-based stats. Identities = 46/202 (22%), Positives = 75/202 (37%), Gaps = 15/202 (7%) Query: 16 PEPRCPCILLLDVSGSMNGR-----PINELNAGLVTFRDELLADPLALKRVELGIVTFG- 69 E P LLLD SGSMN I LN + + L + ++ I+TFG Sbjct: 11 EERFIPVFLLLDTSGSMNESLGNCTRIEALNLCIQKMIETLKQEAKKELFSKMAIITFGE 70 Query: 70 -PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 + PF N L A G TP+ A A D++E++ +Y+ + L Sbjct: 71 NGAVLHTPFDDVKNINFKPLSASGGTPLDQAFRLAKDLIEDK----DTFPTKFYKLYSIL 126 Query: 129 ITDGAPTDE-WQAAANKVFRGEEDKRFAFFSIGVQG--ADMKTLAQIS-VRQPLPLQGLQ 184 ++DG P D+ WQ A + + +SI + + + + Sbjct: 127 VSDGEPNDDKWQKALSNFHHDGRSAKSVCWSIFIGDRNTNPQVNKDFGKDGVFYADDVEK 186 Query: 185 FRELFSWLSSSLRSVSRSTPGT 206 LF ++ ++ S S Sbjct: 187 LVGLFEIMTQTISKGSTSIKDD 208 >UniRef50_Q22ST4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ST4_TETTH Length = 648 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 72/212 (33%), Gaps = 28/212 (13%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQ 75 ++++D SGSM G I + LV + + + + IV F Sbjct: 219 RQTVDLVVVIDKSGSMEGEKIQLVKETLVKIINLMSSMD------RICIVCFNESGDRPL 272 Query: 76 PFTSAANFFPPIL-------FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 FT + L +A G T + I AL ++ RK + I L Sbjct: 273 TFTRVTDENKQTLLNLIQQIYAGGGTNISEGINHALKAIQNRKFKNNVTS-------ILL 325 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS---VRQPLPLQGLQ 184 ++DG T + + + + F +IG D K L +S +Q + Sbjct: 326 LSDGQDTKAYTRVKAYIDKYQIKDAFNIETIGFGEDHDPKLLRTLSDLRNGTFNFMQDVN 385 Query: 185 F--RELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 + + + + +V++ V P+ Sbjct: 386 YLDTAFINIFAGMISTVAQ-NIKVGVKFTPPE 416 >UniRef50_C5VFZ9 von Willebrand factor type A n=2 Tax=Corynebacterium matruchotii RepID=C5VFZ9_9CORY Length = 236 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 46/213 (21%), Positives = 81/213 (38%), Gaps = 20/213 (9%) Query: 20 CPCILLLDVSGSM------NGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVH 72 P L+DVS SM G ++ N + + + +R+ LG++ F Sbjct: 9 LPVFFLIDVSYSMLEEKPGGGTLLDAANQLVPGIVEACEKYSVLDQRLRLGLIEFCDEAR 68 Query: 73 VEQPFTSAANF--FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 V P + F P L A+G T AA + + R I +RP +F IT Sbjct: 69 VVIPLSEIDAFSENIPQLVAKGGTNFAAAFWAVFNEMGVAVESLRKPEIGIHRPTVFFIT 128 Query: 131 DGAPTDEWQAAANKVFRGEEDK---RFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ--- 184 DG + + A ++ R FF+ GV A+++ + + Sbjct: 129 DGEDIGDVEERARAWAALSDEGFRYRPNFFTFGVGNANLEGIRAFKLGSGFAAATKDPTR 188 Query: 185 ----FRELFSWLSSSLRSVSRS-TPGTEVVLEA 212 +E+ + L SS+ S S P +++++ Sbjct: 189 AVQRLQEILNTLVSSIVSSSAGDNPTGKIIVDP 221 >UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain H4 n=38 Tax=Eutheria RepID=ITIH4_HUMAN Length = 930 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 76/207 (36%), Gaps = 31/207 (14%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA + P+ + ++D SGSM+GR I + L+ D+L R + ++ Sbjct: 263 FAPEGLTTMPK---NVVFVIDKSGSMSGRKIQQTREALIKILDDLSP------RDQFNLI 313 Query: 67 TFGPVHV-----EQPFT----SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 F P + + A F + A G T + A+ A+ +++ +E R Sbjct: 314 VFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLP 373 Query: 118 GISYYRPWIFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISV 174 S I L+TDG PT R R++ F +G L ++++ Sbjct: 374 EGSV--SLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLAL 431 Query: 175 R--------QPLPLQGLQFRELFSWLS 193 LQ ++ + ++ Sbjct: 432 DNGGLARRIHEDSDSALQLQDFYQEVA 458 >UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3 Tax=Theria RepID=ITIH4_PIG Length = 921 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 41/193 (21%), Positives = 70/193 (36%), Gaps = 26/193 (13%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-------V 71 I ++D SGSM GR I + L+ +L + R + +V+F V Sbjct: 270 PKNVIFVIDTSGSMRGRKIQQTREALIKILGDLGS------RDQFNLVSFSGEAPRRRAV 323 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 A + + AQG T + A+ A+ ++E RE S +I L+TD Sbjct: 324 AASAENVEEAKSYAAEIHAQGGTNINDAMLMAVQLLERANREELLPARSV--TFIILLTD 381 Query: 132 GAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR--------QPLPL 180 G PT K R D + + F +G L ++++ Sbjct: 382 GDPTVGETNPSKIQKNVREAIDGQHSLFCLGFGFDVPYAFLEKMALENGGLARRIYEDSD 441 Query: 181 QGLQFRELFSWLS 193 LQ + + ++ Sbjct: 442 SALQLEDFYQEVA 454 >UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10Z89_TRIEI Length = 477 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 57/195 (29%), Gaps = 24/195 (12%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV 71 P +LL+D S SM G + E+ A F + L L IV F Sbjct: 38 LTRRPPQPQTVVLLIDTSSSMWGGKLPEVQAAATGFV-----ERQNLTVNNLAIVEFSSN 92 Query: 72 HVEQPFTSAA----NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 A L G T + + ++ P I Sbjct: 93 SQVLTNFDADKTELKQAIANLTPSGGTNLSQGLKTVASLLRNSN-----------TPNIL 141 Query: 128 LITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQF 185 L TDG P D A+ + R + ++G A+ L ++ + Sbjct: 142 LFTDGQPNDP--RASKSIAREIREAGINLVTVGTGDANSNYLTSLTENPDLVFFANSGEI 199 Query: 186 RELFSWLSSSLRSVS 200 + F ++ +S Sbjct: 200 DQAFRAAEKAISQLS 214 >UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38CFE Length = 489 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 61/188 (32%), Gaps = 26/188 (13%) Query: 9 TSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF 68 F + P+ ++L+D SGSM+G + E+ F LKR +L +V F Sbjct: 44 PPGFLTKPQ---AVVMLIDTSGSMSGSKLPEVQRAASEFVS-----RQNLKRDDLAVVEF 95 Query: 69 GP-VHVEQPFTSAAN---FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 V FT L A G T + A +++ R Sbjct: 96 SSRASVVADFTRDERELQQAIARLSAWGGTNLSEGFNLATSVLQNSDRPGN--------- 146 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGL- 183 I L TDG P + + A + + ++G A + L ++ L Sbjct: 147 -ILLFTDGEPNN--RRMAASIAQQIRASGINLVAVGTGDAPVNYLTALTGDPDLVFYANF 203 Query: 184 -QFRELFS 190 F Sbjct: 204 GDLDSAFR 211 >UniRef50_UPI00006A1B4A Collagen alpha-3(VI) chain precursor. n=5 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1B4A Length = 2535 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 60/176 (34%), Gaps = 18/176 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 L+D SGS+ ++ ++ + RV G+V + V + F S Sbjct: 795 ADIYFLIDGSGSIYPEDFEDMKKFMIELISMFQ---VGANRVRFGVVQYSDVRRTEFFIS 851 Query: 80 -------AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + I G T G A+T + + + R + + + +ITDG Sbjct: 852 EHNTQKMLKDAISQIEQLGGGTLTGEALTS-MKQLFVNAAKDRPHKVP---QSLVVITDG 907 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D AA ++ F+IGV+ A + + I+ F L Sbjct: 908 ESQDRVTEAAAEIRND----GITIFAIGVKNAVEEEIRDIAGSNEKMFFVNNFDSL 959 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 61/177 (34%), Gaps = 19/177 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 + + L+D S S+N + + + + RV++G++ F E P Sbjct: 996 KADIVFLVDSSASINSDDYETMKEFMESMV---KQAEIGPDRVQIGLIQFSSETKEEFPL 1052 Query: 78 TSAANFFPPILFAQG------DTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 +G T MG A+ L K G + ++ +ITD Sbjct: 1053 NRYKRKDEIQSAIRGIQQLSQGTLMGEALKYTLPYFSASKG-----GRVNTKQYLIVITD 1107 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 G D A + D ++IGVQ A+ L +I+ +Q F L Sbjct: 1108 GEAQDAVGNPAKAIR----DHGVIIYAIGVQQANNTQLLEIAGKQEQVYYEDSFDSL 1160 Score = 90.8 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 19/196 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + L+D S S+ + L T + + L ++++G++ + + F + Sbjct: 1 ADIVFLMDGSWSIGTENFITMKNFLYTLVNGF---DVGLDKIQIGLIQYSDNARTEFFLN 57 Query: 80 AANFFPPIL-------FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + +L + G T G ++ L RA +ITDG Sbjct: 58 SYSNKEDVLKYIQNLKYKGGGTKTGLSLEFMLTQHFSEAAGSRAA--EGVPQIAVVITDG 115 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFRELFS 190 D + A V ++IG++ A + L +I+ F L Sbjct: 116 QAQDSIREPAIAVKNA----GIILYAIGIKDAVLSELNEIASDPDDKHVYSVADFNAL-Q 170 Query: 191 WLSSSLRSVSRSTPGT 206 +S ++ V +T Sbjct: 171 SISQNMIQVLCTTVEE 186 Score = 87.7 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 62/178 (34%), Gaps = 5/178 (2%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 A I LLD S S+ + + + + V+ G V +G Sbjct: 1197 ACKKTEVADIIFLLDASASITRGEFRLMQRFVEAVVN---DSLVGKDNVQFGAVVYGTNP 1253 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN--GISYYRPWIFLIT 130 EQ + + IL A P + T +E + + + G + L+T Sbjct: 1254 AEQFSLNTYSTKLDILKAVFSLPQVSGYTYTAKALEYTRIRFGTSYGGRPGISHILILVT 1313 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 DGA T+ + V + +D F++GV A + L QI+ ++ L Sbjct: 1314 DGATTEADRPNLPIVSKALKDDGIIVFAVGVGKAVPQELQQIAGYPDRWFLVQNYKGL 1371 Score = 85.0 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 54/179 (30%), Gaps = 19/179 (10%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 + + L+D S S+ + D ++ RV +G+ + ++ Sbjct: 1404 HEQLDLVFLIDGSASITSSNFTSAKTFMKEIVDSFT---ISENRVRIGVAQYSANPKKEF 1460 Query: 77 F-------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 F + I + T G + + + R N Y + ++ Sbjct: 1461 FLNEYYSSSDMKKQIDSISQLKATTYTGKGLRF-VKQFFDPANGGRKNVPQY----LIVM 1515 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 TDG D A + FSIG+ + L I+ + F+ L Sbjct: 1516 TDGMSNDSVNEDAAALR----SSGVKIFSIGIGLRNSFELVMIAGSPKNVYEVETFQAL 1570 Score = 80.0 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 58/197 (29%), Gaps = 17/197 (8%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M + + + + L+D S S+ L L ++L Sbjct: 581 MLQFSPVSVVPAVCSSASVADIVFLIDESSSIGPINFQLTRVFLHKVVSAL---DISLSN 637 Query: 61 VELGIVTFGPVHVEQ-------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKRE 113 V +G+V + + +F + + G GAA+ + ++ Sbjct: 638 VRVGLVLYSDEPRLELKLNTFNEKYEILDFITKLPYRGGKAHTGAALDFLRKKMFTKQNG 697 Query: 114 YRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS 173 R + ++T+G D + A K+ R F++G Q + L I+ Sbjct: 698 GRP--HQGVQQIAVVMTNGQSMDNFTKPAAKLRR----SGVEVFAVGFQNINDTELDIIA 751 Query: 174 VRQPLPLQGLQFRELFS 190 P Sbjct: 752 SHPPRK-HVTNVESFLQ 767 Score = 70.8 bits (172), Expect = 3e-11, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 59/175 (33%), Gaps = 17/175 (9%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 + + L+D S SM ++ ++ ++L + + + +G+ + + + Sbjct: 396 KRYADVVFLVDSSTSMGTIFFQKMKDFIIHIINQLN---VGINKHRIGLAQYSGLPQTEF 452 Query: 77 FTSAANFFPPIL--------FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 + IL + G G A+ + R N + ++ + Sbjct: 453 LLNHYETKEEILKHIKETFTYRGGPLKTGHALEFVRSTFFIEEAGSRINYGNP--QFLVV 510 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGL 183 IT + + + A + + S+G+ +D K L +I+ + Sbjct: 511 IT----SSKSEDAVRRHAEELKSVGVTTISVGIGNSDRKELEKIATDPFVFQTTG 561 Score = 54.6 bits (130), Expect = 2e-06, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 55/176 (31%), Gaps = 17/176 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF-- 77 +LL++ + M + L L + + ++ +G+VT+ + Sbjct: 204 ADIVLLVESTTRMGDATFEKAKNFLYDLVSNL---DVGINKIRIGLVTYNDETNPEFLLN 260 Query: 78 -----TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 T + + +G T G A+ + R + ++T+G Sbjct: 261 SYSSKTEILESIQNMKYVEGYTYTGRALEYVNTTYFTQAAGSRFE--ESVAQILIIVTEG 318 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 +D A ++ + + + +G + L + S + Q + Sbjct: 319 DSSDTLTEPAKELK----SRGISVYVVGTNIKYDRQLQEASSKPDEKFF-YQLDDF 369 >UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7RKA1_NEMVE Length = 1128 Score = 110 bits (276), Expect = 3e-23, Method: Composition-based stats. Identities = 55/212 (25%), Positives = 82/212 (38%), Gaps = 34/212 (16%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLA-DPLALKRVELGI----VT 67 A++P+P+ IL++D SGSM G + T D L D +A E G+ VT Sbjct: 205 AASPQPK-DVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFLAFESGVRRVKVT 263 Query: 68 FGPVHVEQPF-----------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRA 116 G E+ F F +A G T A A D++++ +E Sbjct: 264 SGDAKDEKCFESSLAKASPVNIDILKKFLDGEYASGGTMYAIAFNAAFDILDKYYKEKNT 323 Query: 117 NGISYYRPWIFLITDGAPTDEWQAAANKV--FRGEEDKRFAFFSIGVQG----ADMKTLA 170 RP I +TDGAP D+ N V + + G+ G A + L Sbjct: 324 TR----RPVILFMTDGAPNDDPGTILNTVKTRNQGLSTKADILTFGMGGGISPAGVDLLQ 379 Query: 171 QISVRQPLPLQGLQFRELFSW-LSSSLRSVSR 201 ++ Q L F L+++LR VSR Sbjct: 380 SLAE------QTLDGGARFEVSLTTALRDVSR 405 >UniRef50_Q503P4 Zgc:110377 n=9 Tax=Clupeocephala RepID=Q503P4_DANRE Length = 868 Score = 110 bits (275), Expect = 3e-23, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 68/205 (33%), Gaps = 30/205 (14%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA PR P + ++D S SM G + + L T EL D I+ F Sbjct: 245 FAPANLPRVPKMVVFVIDNSYSMYGNKMAQTKEALGTILGELPEDD------YFAIIVFS 298 Query: 70 PVHVE---------QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 V + A + + G T + A ++M+ +R A Sbjct: 299 TTFVVWRPYLSKATEENVKEAQEYVKTIEVIGGTELHDATIHGVEMLYAAQRNGTAPKNM 358 Query: 121 YYRPWIFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQP 177 + L+TDG P ++ + R D F + AD L +S + Sbjct: 359 VL--MMILLTDGQPNQYPRSLPEIQESIRKAIDGNITLFGLAFGNDADYGFLDTLSKQNN 416 Query: 178 LPLQ--------GLQFRELFSWLSS 194 ++ LQ + + +SS Sbjct: 417 GIVRRIYEDSDAPLQLKGFYEEVSS 441 >UniRef50_A3YVK5 Tellurium resistance protein n=3 Tax=Cyanobacteria RepID=A3YVK5_9SYNE Length = 260 Score = 110 bits (275), Expect = 3e-23, Method: Composition-based stats. Identities = 41/180 (22%), Positives = 65/180 (36%), Gaps = 14/180 (7%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGR-PINELNAGLVTFRDELLADPLALKRVEL 63 + F A+ P I + D SGSM + I LN + + + Sbjct: 1 MPFPNVRLANRP---LHFIYICDCSGSMAAQGKIQALNQAIRQSLPGMAEVARQNPEARV 57 Query: 64 GI--VTFGPVHV--EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 + V+F + T L A G T MG A+ +++ E RA Sbjct: 58 LVRAVSFADRAAWHLEKPTEVHQLQWLDLQAGGITAMGEALELVAAVLQSPPMEERA--- 114 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPL 178 P + LI+DG PTD++ A + R ++ +I + AD + L Q P Sbjct: 115 --LPPVLVLISDGQPTDDFDAGLASLMRQPWAQKAVRLAIAMGHDADTEVLQQFIGSDPG 172 >UniRef50_A1THU3 von Willebrand factor, type A n=1 Tax=Mycobacterium vanbaalenii PYR-1 RepID=A1THU3_MYCVP Length = 248 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 49/163 (30%), Positives = 72/163 (44%), Gaps = 12/163 (7%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-PFT 78 P L+ DVS SM G I LN L FRD L +P+ +V+ G++ F E P Sbjct: 19 LPFWLVCDVSASM-GPHIGTLNQSLRDFRDSLATNPVLADKVQFGVIDFSDTATEVIPLG 77 Query: 79 SAANFFPPI--LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 ++ L +G T G A T ++E A+ Y+RP +F +TDG PTD Sbjct: 78 DFSSADLERHQLRTRGGTSYGQAFTTVQQIIERDLA-AGADRFRYFRPAVFFLTDGQPTD 136 Query: 137 -EWQAAANKVF--RGEEDKRFAFF----SIGVQGADMKTLAQI 172 W+ A + + F + G+ AD TLA++ Sbjct: 137 RHWREAFRDLTFFDQASGQGFRSYPLFVPFGIGDADAATLAEL 179 >UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN03_ARATH Length = 641 Score = 109 bits (272), Expect = 9e-23, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 62/195 (31%), Gaps = 27/195 (13%) Query: 10 SDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 SD A I +LDVSGSM+G + + + L + L +++F Sbjct: 193 SDDARRARAPLDLITVLDVSGSMDGVKMELMKNAMSFVIQNL------GETDRLSVISFS 246 Query: 70 P-VHVEQPFTSAAN-------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 P + L A G T + + ++E R+ + +G Sbjct: 247 SMARRLFPLRLMSETGKQAAMQAVNSLVADGGTNIAEGLKIGARVIEGRRWKNPVSG--- 303 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDK-----RFAFFSIGVQ-GADMKTLAQISVR 175 + L++DG + A ++ E R + G D + + IS Sbjct: 304 ----MMLLSDGQDNFTFSHAGVRLRTDYESLLPSSCRIPIHTFGFGSDHDAELMHTISEV 359 Query: 176 QPLPLQGLQFRELFS 190 ++ + Sbjct: 360 SSGTFSFIETETVIQ 374 >UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=40 Tax=Euteleostomi RepID=ITIH2_HUMAN Length = 946 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 75/210 (35%), Gaps = 35/210 (16%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA + P+ + ++DVSGSM G + + + T D+L A+ ++ Sbjct: 299 FAPDNLDPIPK---NILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAED------HFSVI 349 Query: 67 TFG-----------PVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 F Q + A + + G T + A+ +A+ ++ E Sbjct: 350 DFNQNIRTWRNDLISATKTQ--VADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGL 407 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI 172 + S I L++DG PT K + + FS+G+ D L ++ Sbjct: 408 LDPNSV--SLIILVSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRL 465 Query: 173 SVRQPLPLQ--------GLQFRELFSWLSS 194 S Q Q ++ ++ +S+ Sbjct: 466 SNENHGIAQRIYGNQDTSSQLKKFYNQVST 495 >UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=40 Tax=Euteleostomi RepID=ITIH5_HUMAN Length = 942 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 59/182 (32%), Gaps = 23/182 (12%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA D P+ + +LD S SM G + + L T +L I+ Sbjct: 284 FAPKDLPPLPK---NVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQD------RFSII 334 Query: 67 TFGPV---------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 F V + + G T + A+ +A+ ++ + Sbjct: 335 GFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGIG 394 Query: 118 GISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQ-GADMKTLAQISV 174 S I +TDG PT + E R F+IG+ D + L ++S+ Sbjct: 395 DRSV--SLIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 452 Query: 175 RQ 176 Sbjct: 453 EN 454 >UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3 Tax=Andropogoneae RepID=C5WYU9_SORBI Length = 698 Score = 108 bits (270), Expect = 1e-22, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 56/195 (28%), Gaps = 31/195 (15%) Query: 14 SNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VH 72 ++ + +LDVSGSM G I L + L + L ++ F Sbjct: 234 ASSRAPLDLVTVLDVSGSMAGTKIALLKNAMSFVIQTLGPND------RLSVIAFSSTAR 287 Query: 73 VEQPFTSA-------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 P A L A G T + + K ++E+R+ + Sbjct: 288 RLFPLRRMTLAGRQQALQAVSSLVASGGTNIADGLKKGAKVIEDRRLKNPVCS------- 340 Query: 126 IFLITDGA-----PTD----EWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR 175 I L++DG P+D ++ A + G D + I+ Sbjct: 341 IILLSDGQDTYTLPSDRNLLDYSALVPPSILPGTGHHVQIHTFGFGSDHDSAAMHAIAEI 400 Query: 176 QPLPLQGLQFRELFS 190 + Sbjct: 401 SSGTFSFIDAEGSIQ 415 >UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YPR2_BRAFL Length = 863 Score = 108 bits (270), Expect = 1e-22, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 65/201 (32%), Gaps = 30/201 (14%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV---- 71 P + ++D SGSM G + + + T +L ++ F Sbjct: 231 PVVPKNIVFIIDKSGSMGGTKMRQTKQAMNTILKDLRDHD------RFNVMPFSYSSTMW 284 Query: 72 -------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 + SA + + A G T + AI A D++ R+ Sbjct: 285 RPNEMVLATRENIESARTYVRRSINAGGGTNINQAIIDAADLL--RRVTDDQPNSPRSAS 342 Query: 125 WIFLITDGAPTDEWQAAANKV--FRGEEDKRFAFFSIGVQ-GADMKTLAQISVR------ 175 I +TDG P+ N + + ++ + F +G D L ++++ Sbjct: 343 LIIFLTDGLPSVGESKPRNIMVNVKNAIREQVSLFCLGFGKDVDFPFLEKMALENRGLAR 402 Query: 176 --QPLPLQGLQFRELFSWLSS 194 LQ + + +++ Sbjct: 403 RIYEDSDAALQLKGFYDEVAT 423 >UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocephala RepID=Q6PGW2_DANRE Length = 927 Score = 108 bits (270), Expect = 1e-22, Method: Composition-based stats. Identities = 38/180 (21%), Positives = 66/180 (36%), Gaps = 16/180 (8%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA SD P + ++D SGSM+GR I + + L+T +L D + Sbjct: 265 FAPSDV---PHIPKNVVFIIDRSGSMHGRKIRQTRSALLTILKDLDEDDHFGLITFDAEI 321 Query: 67 TFGPVHVEQ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYR 123 F + Q A F + +G T + A+ +DM+ R+ A+ Sbjct: 322 DFWRRELLQATKANRENAESFVKRIQDRGATNINDAVLAGVDMINRNPRKGTAS------ 375 Query: 124 PWIFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL 180 + L+TDG PT + +F + +G + L ++S+ Sbjct: 376 -ILILLTDGDPTAGETNIEKIMANVKEAIGSKFPLYCLGFGYDVNFDFLTKMSLENNAVA 434 >UniRef50_UPI0001760236 PREDICTED: similar to mCG140660 n=1 Tax=Danio rerio RepID=UPI0001760236 Length = 1753 Score = 108 bits (270), Expect = 1e-22, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 70/202 (34%), Gaps = 19/202 (9%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE-- 74 + + L+D S S+ ++ L + + +A +V +G+V + Sbjct: 28 KQVADIVFLVDGSASIGLDNFQQIRQFLSSLVENF---EVAPDKVRIGLVQYSDTPRTEF 84 Query: 75 -----QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 Q ++ + + G T G + L + RA +I Sbjct: 85 SLNTYQNKEEILDYIRNLRYKTGGTHTGQGLEFILKQHFIEEAGSRAQQN--VPQIAIVI 142 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFRE 187 TDG DE A ++ + + F+IG++ AD++ L QI+ F Sbjct: 143 TDGDSQDEVDLQAQELRQ----RGIKIFAIGIKDADVRLLRQIANEPYDQYVYSVSDFAA 198 Query: 188 LFSWLSSSLRSVSRSTPGTEVV 209 L +S S+ ++ + Sbjct: 199 L-QGISQSVVRELCTSVKDVIE 219 Score = 91.2 bits (225), Expect = 2e-17, Method: Composition-based stats. Identities = 40/188 (21%), Positives = 65/188 (34%), Gaps = 24/188 (12%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 +LL+D SGS+ E+ L F D L V LG+ F ++ Sbjct: 230 ADIVLLVDSSGSIGDNDFEEVKKFLHAFVDRFN---LRPDLVRLGLAQFSDRPYQEFLLG 286 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 +++ +G T G A+T + R +ITDG Sbjct: 287 DYADKKDLHQKLNNLIYRKGGTQTGQALTFIRENYFSLARPNVPG-------IAIVITDG 339 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFRELFS 190 D+ + A ++ + + F I V +M+ L I+ ++EL Sbjct: 340 ESRDDVEEPAQRLR----NTGVSLFVIRVGKGNMEKLRAIANIPHEEFLFSINNYQEL-Q 394 Query: 191 WLSSSLRS 198 L SLRS Sbjct: 395 GLKESLRS 402 Score = 90.8 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 66/178 (37%), Gaps = 18/178 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV------ 73 ++D S S+ R + L L + ++++ ++ + Sbjct: 611 ADIAFIVDQSSSIKSRNFQLVRDFLENTIGRL---DVGKDKIQIAVILYSDFPRADVYLN 667 Query: 74 EQPFTSAANFFPPILFAQ-GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + L G T GAA+ A + V + R R + Y + +ITDG Sbjct: 668 TFSNKNDILRYINTLPYGRGKTYTGAALRFAKEHVFTKARGSRRD--KYVQQVAVVITDG 725 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFREL 188 TD+ +AA ++ R + F++G++ L +I+ P L F +L Sbjct: 726 KSTDDAASAAAELRR----SGVSIFALGIKDTKEDDLREIASYPPKKFVLNVENFDQL 779 Score = 80.8 bits (198), Expect = 3e-14, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 54/169 (31%), Gaps = 17/169 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-------H 72 + L+D S S+ + L ++ EL +A + +G+ F + + Sbjct: 1230 ADLVFLIDGSESIKPPSWDILKQTMIGIVKEL---DIAKDKWRVGVAQFSDILLHQFYLN 1286 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 F I + T A+ + G+ + LITDG Sbjct: 1287 TYTSFAEVEEAINNIKQRKQGTNTWDALKLIKYYFTKENGSRIEGGV---AQNLLLITDG 1343 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQPLPL 180 D + N + ++K+ A IG+ L +I+ L Sbjct: 1344 EAND--EKDLNAL-ADLKNKKIAITVIGIGNEIKKSELREIAGSPDRVL 1389 Score = 78.5 bits (192), Expect = 1e-13, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE 74 + LLD SGS++ ++ ++ D + V +G+V F Sbjct: 812 KQTAKADIYFLLDESGSISYPDFEDMKKFIMECLDVFQ---IGKDHVRIGVVKFASKATT 868 Query: 75 Q------PFTSAANFFPPILFA-QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 S L G T + + + + E + R + Sbjct: 869 VFRLHDYSTKSDVEKAVKDLEMYGGGTRTDLGLRQMIPLFREAVQ----TRGEKARELLI 924 Query: 128 LITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG--ADMKTLAQISVRQPLPLQGLQF 185 +ITDG T + ++ + ++IG +G AD+ L + F Sbjct: 925 VITDGESTGTVEPVEVPAKHLRAEQNVSIYAIGCEGLLADVVFLI----DGSDSVSAEDF 980 Query: 186 REL 188 ++ Sbjct: 981 EKM 983 Score = 58.8 bits (141), Expect = 1e-07, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 65/205 (31%), Gaps = 44/205 (21%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + L+D S S++ ++ + ++ + ++ + +V +G E+ Sbjct: 963 ADVVFLIDGSDSVSAEDFEKMKDIMEYVIEKF---AIGSEKERVAVVQYGTNPNEEF--- 1016 Query: 80 AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQ 139 + N F D + + R R +TDG D+ Sbjct: 1017 SLNAFDNK-----------------DRLLQEIRNIRQ------------VTDGESRDDVA 1047 Query: 140 AAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQ---GLQFRELFSWLSSSL 196 A + D ++IG++ A+ + I+ +EL S + + Sbjct: 1048 LPAKALR----DNSINTYAIGLRHANRSQILAIAGSHGEVFYEDAVASLKELSSEVLLKI 1103 Query: 197 RSVSRSTPG--TEVVLEAPKGWTSV 219 + TP + L W ++ Sbjct: 1104 CNTECKTPELIDIIFLVDTSAWPNI 1128 Score = 56.1 bits (134), Expect = 7e-07, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 61/187 (32%), Gaps = 19/187 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 +L+D S S + ++ + L +L + + +G+ F E+ + Sbjct: 420 ADLFILVDSSAS--KQELSIIKNFLQKLIGQLN---VGINGNRVGLAQFSENVKEEFLLN 474 Query: 80 AANFFPP--------ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 L G+ +G AI A R RA Y+ ++ +I Sbjct: 475 THRTRNEMSTSIRNLQLTPTGERRIGHAIEHARSNFFNRDAGSRAA--EGYKQFLLVIAA 532 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSW 191 G D A+ K+ + F+ G+ AD + I+ + + Sbjct: 533 GESADGVIQASRKIKKDA----VTVFAAGLNRADAYEMKDIASQSHNYKLVGNIPLVQQK 588 Query: 192 LSSSLRS 198 + ++ + Sbjct: 589 IKVAVDT 595 Score = 43.4 bits (101), Expect = 0.005, Method: Composition-based stats. Identities = 16/147 (10%), Positives = 38/147 (25%), Gaps = 18/147 (12%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 +S ++ + I L+D S I+ L + + ++ KR Sbjct: 1095 LSSEVLLKICNTECKTPELIDIIFLVDTS---AWPNIDGLQEMINLMTHVVRKSVVSEKR 1151 Query: 61 VELGIVTFGPVHVEQPFTSAANFFPPILF----------AQGD-----TPMGAAITKALD 105 V G +T+ + ++ A G+ + Sbjct: 1152 VRFGAITYSNSPQLEFTLQQYKSQADVMRDAEVAQLREIAGGNGRVHYASTYQGLRGLQK 1211 Query: 106 MVEERKREYRANGISYYRPWIFLITDG 132 ++ + + + DG Sbjct: 1212 IITQELCNLTKPICEMEVADLVFLIDG 1238 >UniRef50_Q231J4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q231J4_TETTH Length = 520 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 69/210 (32%), Gaps = 26/210 (12%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQP 76 I ++D SGSM G+ I + ++ + D + +V F V Sbjct: 94 QPLDLIFVIDTSGSMQGKKIELVKKSILQVLHIIQGDD------RISLVGFNSQAKVLLE 147 Query: 77 FTSAAN-------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 T L A G T +G + KA D+++ER IFL+ Sbjct: 148 LTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDIIKERTNSKNLAS-------IFLL 200 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI----SVRQPLPLQGLQ 184 +DG + + + + + + F G D TL++I Q Sbjct: 201 SDGQDNCGFSQTQHFMNQSKIEYPFCIDCFGFGDDHDSLTLSKINQLQQGTFNFIRDISQ 260 Query: 185 FRELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 + F+ + + +++ V + Sbjct: 261 IDDAFTIILAGIKTFVAQNVKISVNFGNTE 290 >UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 Tax=Cystobacterineae RepID=Q1D9B7_MYXXD Length = 476 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 68/215 (31%), Gaps = 31/215 (14%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 L++D SGSM+G + + L + L I+ +G Sbjct: 91 QRSPVNLALVIDRSGSMSGYKLAQAKQAARHLIGLL------NDQDRLAIIHYGSDVKSL 144 Query: 76 PFTSAANFFPPILF-------AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 P A +F +G T +GA ++ + +R Y N + L Sbjct: 145 PSLEATAANRERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQRTYGVNR-------LIL 197 Query: 129 ITDGAPTDEWQ--AAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI----SVRQPLPLQ 181 ++DG PT+ ++ R +IGV + + + Sbjct: 198 MSDGQPTEGLTADEELTRMARELRATGLTLSAIGVGTDFNEDLMQAFAEYGAGAYGFLED 257 Query: 182 GLQFRELFSW-LSSSLRSVSRSTPGTEVVLEAPKG 215 Q LF L + +V+R G + P G Sbjct: 258 AAQLSTLFQKDLQQAGTTVAR---GVTMTFTLPPG 289 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 72/209 (34%), Gaps = 31/209 (14%) Query: 2 SEQITFATSDFASNPEPRCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALK 59 Q+ A + A + + R P L+LD SGSM+G+P+ + + + D L D Sbjct: 22 QRQLRIAVAAKADDHDRRLPLNLCLVLDHSGSMDGQPLETVKSAALGLIDRLEEDD---- 77 Query: 60 RVELGIVTFGPVHVE------QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKRE 113 L ++ F +A L A+G T + + + + Sbjct: 78 --RLSVIAFDHRAKIVIENQQVRNGAAIAKAIERLKAEGGTAIDEGLKLGIQEAAK---- 131 Query: 114 YRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQI 172 G IFL+TDG K+ D + ++G + L I Sbjct: 132 ----GKEDRVSHIFLLTDGENEHGDNDRCLKLGTVASDYKLTVHTLGFGDHWNQDVLEAI 187 Query: 173 SVRQPLPLQGLQ--------FRELFSWLS 193 + L ++ FR+LF +S Sbjct: 188 AASAQGSLSYIENPSEALHTFRQLFQRMS 216 >UniRef50_A7BNL3 Tellurium resistance protein n=1 Tax=Beggiatoa sp. SS RepID=A7BNL3_9GAMM Length = 171 Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats. Identities = 41/151 (27%), Positives = 72/151 (47%), Gaps = 5/151 (3%) Query: 56 LALKRVELGIVTF-GPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY 114 + ++ V L +TF P + F P++ A G T +G+A+ +D +E+ R+ Sbjct: 1 MGIETVYLWDITFHSTAQQVTPLSELMLFKEPLISASGATALGSALRLLMDCLEKEVRKN 60 Query: 115 RANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 A ++P +FL+TDG PTD W++AA+K+ + F+ G AD+ L I+ Sbjct: 61 TAVQKGDWKPLVFLMTDGMPTDAWESAADKLKNQ-KSANLIAFAAG-PNADVANLKGITD 118 Query: 175 R--QPLPLQGLQFRELFSWLSSSLRSVSRST 203 + L + F W+S S+ +S Sbjct: 119 IVLKSEELSPGALKAFFQWMSQSILQTGKSV 149 >UniRef50_A2AX52 Collagen alpha-4(VI) chain n=12 Tax=Chordata RepID=CO6A4_MOUSE Length = 2309 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 68/203 (33%), Gaps = 24/203 (11%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV------ 71 + L+D SGS+ E+ + P RV G+V + Sbjct: 846 EKADIYFLIDGSGSIKPNDFIEMKDFMKEVIKMFHIGP---DRVRFGVVQYSDKIISQFF 902 Query: 72 -HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 + I G T G A++K + + + R ++ +IT Sbjct: 903 LTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQNTARI-------DVARYLIVIT 955 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ-FRELF 189 DG TD AA + D ++IGV+ A+ L +I+ ++ + + + Sbjct: 956 DGQSTDPVAEAAQGLR----DIGVNIYAIGVRDANTTELEEIASKKMFFIYEFDSLKSIH 1011 Query: 190 SWLSSSLRSV--SRSTPGTEVVL 210 + + S +S + L Sbjct: 1012 QEVIRDICSSENCKSQKADIIFL 1034 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 66/206 (32%), Gaps = 20/206 (9%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 + + I L+D S S+ + ++ + + + +++G++ F Sbjct: 1022 ENCKSQKADIIFLIDGSESIAPKDFEKMKDFMERMVN---QSNIGADEIQIGLLQFSSNP 1078 Query: 73 VEQ-------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 E+ + T G A+ L + + G + Sbjct: 1079 QEEFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALNFTLPFFDSSRG-----GRPRVHQY 1133 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 + +ITDG D A + D+ F+IGV L +I+ Q Q F Sbjct: 1134 LIVITDGVSQDNVAPPAKALR----DRNIIIFAIGVGNVQRAQLLEITNDQDKVFQEENF 1189 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVLE 211 L L + S S+ G + L Sbjct: 1190 ESL-QSLEKEILSEVCSSQGCNIDLS 1214 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 79/210 (37%), Gaps = 20/210 (9%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 + L+D S S+ + ++ L + L + +V++G+V + ++ P Sbjct: 233 PADIVFLVDSSTSIGLQNFQKVKHFLHSVVSGL---DVRSDQVQVGLVQYSDNIYPAFPL 289 Query: 78 --TSAANFFPPILF----AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 +S + + + G T G+A+ RA + L+TD Sbjct: 290 KQSSLKSAVLDRIRNLPYSMGGTSTGSALEFIRANSLTEMSGSRA--KDGVPQIVVLVTD 347 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFRELF 189 G +DE Q A+++ R F +G+ D++ L +I+ F + Sbjct: 348 GESSDEVQDVADQLKRD----GVFVFVVGINIQDVQELQKIANEPFEEFLFTTENF-SIL 402 Query: 190 SWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 LS +L ST ++ ++ K + V Sbjct: 403 QALSGTLLQALCSTVERQMK-KSTKTYADV 431 Score = 61.5 bits (148), Expect = 2e-08, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 56/177 (31%), Gaps = 20/177 (11%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSA 80 + L+ S+N + + + L + L + + +G+ + + S Sbjct: 34 DIVFLVHN--SINPQHAHSVRNFLYILANSLQ---VGRDNIRVGLAQYSDTPTSEFLLSV 88 Query: 81 ANFFPPILFAQ-------GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + +L G MG A+ L+ RA+ +++ G Sbjct: 89 YHRKGDVLKHIRGLQFKPGGNRMGQALQFILEHHFREGAGSRASQ--GVPQVAVVVSSGL 146 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFREL 188 D + A + R ++IGV+ A L +IS + F L Sbjct: 147 TEDHIREPAEALRRA----GILVYAIGVKDASQAELREISSSPKDNFTFFVPNFPGL 199 Score = 59.2 bits (142), Expect = 9e-08, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 56/194 (28%), Gaps = 21/194 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + L+D S + + + L + + ++G+ + F Sbjct: 429 ADVVFLIDTSQGTSQASFQWMQNFISRIIGIL---EVGQDKYQIGLAQYSDQG-HTEFLF 484 Query: 80 AANFFPPILFA---------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 + + A G G + + R + ++ +IT Sbjct: 485 NTHKTRNEMVAHIHELLVFQGGSRKTGQGLRFLHRTFFQEAAGSRL--LQGVPQYVVVIT 542 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFS 190 G DE A + + + S+G+Q D L I + + LQ + Sbjct: 543 SGKSEDEVGEVAQILRK----RGVDIVSVGLQDFDRAELEGI--GPVVLVSDLQGEDRIR 596 Query: 191 WLSSSLRSVSRSTP 204 L + + +P Sbjct: 597 QLMLDVNMFIQGSP 610 >UniRef50_UPI0000F2E695 PREDICTED: hypothetical protein n=1 Tax=Monodelphis domestica RepID=UPI0000F2E695 Length = 2439 Score = 107 bits (267), Expect = 3e-22, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 59/176 (33%), Gaps = 21/176 (11%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 L+D SGS+N E+ ++ + V G+V + + Sbjct: 622 ADFYFLIDGSGSINHDDFAEMKTFMIELISTFR---VGADHVRFGVVQYSDSPTVEFDIR 678 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + I G T G A+T + + + ++ +ITDG Sbjct: 679 QHSSVAQLKSAITKIWQTGGGTRTGEALTF-MKRLFSEVARDKVLR------FLIVITDG 731 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D+ AA ++ + ++IGV+ A K L +IS Q F L Sbjct: 732 QSQDQVAQAAEELRQE----NITIYAIGVKSAVTKELLEISGSQNRMFFVNDFDSL 783 Score = 93.9 bits (232), Expect = 4e-18, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 54/179 (30%), Gaps = 17/179 (9%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ- 75 + L+D S S++ R + + + + RV +G+ + ++ Sbjct: 1182 HREIDLVFLIDGSSSIHPRNFTAMKTFMKQIVNSFT---IGKDRVRIGVAQYSTNPQKEF 1238 Query: 76 ------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 I + T G + E + + + +I Sbjct: 1239 YLNTFYSGAEINQHIDKITQLRTQTYTGKGLRFVKSFFEPANGSRKNLHV---LQSLVVI 1295 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 TDG D AAN + + FSIG+ ++ L I+ F +L Sbjct: 1296 TDGMSNDSVVEAANDLRNE----KIQIFSIGIGVINLFELQLIAGNVKRVFVVGDFGQL 1350 Score = 89.3 bits (220), Expect = 9e-17, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 70/209 (33%), Gaps = 20/209 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF-- 77 + L+D S S+ ++ + L + L + +V +G+ F + Sbjct: 16 ADLVFLVDSSTSIGPENFQKVKSFLYSLVLGL---EIGRDQVRVGLAQFNDNIYKAFLLN 72 Query: 78 -----TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + + G T G+A+ RA + L+TDG Sbjct: 73 QFPRKSDVLEQILSLPYRTGGTRTGSALNFLRTEFFTESAGSRA--KDNVPQIVILVTDG 130 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFRELFS 190 DE AA+K+ + + + +G+ D++ L I+ + F + Sbjct: 131 ESNDEVAEAASKLK----GQGVSIYVVGINVQDVQELKTIASKPLEKFLFSIEDF-NILE 185 Query: 191 WLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 LS ++ S ++ K + V Sbjct: 186 GLSGNILPTLCSAVENQIR-AFTKSYADV 213 Score = 73.9 bits (180), Expect = 4e-12, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 66/174 (37%), Gaps = 11/174 (6%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 + + L+D S +N R +++ ++ + L ++V++G++ F E+ Sbjct: 804 KADILFLVDGSERINTRDFDKMKEFMMQMVN---KSDLGPEKVQIGLLQFSSNPQEEFRL 860 Query: 79 SAANFFPPILFA-QGDTPMGAA--ITKALDMVEERKREYRANGISYYRPWIFLITDGAPT 135 + IL A G + A + AL R ++ + I +I+ G Sbjct: 861 NTYYSKVDILRAITGMVQIRAGARVGSALSFSLPYFERSRGGRLNVPQYLIIIIS-GKTG 919 Query: 136 DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELF 189 D + A + DK F+IGV A+ L +I+ Q F L Sbjct: 920 DAVKMPAKALR----DKGIKIFAIGVHKANNSQLLEITGAQDKVYYEENFDSLL 969 Score = 62.7 bits (151), Expect = 9e-09, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 58/184 (31%), Gaps = 19/184 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + L+D S + N++ + D L + + ++G+ +G + + Sbjct: 211 ADVVFLVDTSEGTSSVSFNQMKDFICRVIDTL---EVGRDKDQIGLAQYGNQGHVEFLLN 267 Query: 80 AAN--------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 A L G G + + + + R + + +IT Sbjct: 268 AYQNPVEMISHIQQNFLPRGGARKTGNGLQYIQETFFQEEAGSR--FLQGIPQYAVVITS 325 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ--FRELF 189 G D A K+ + +G+Q D + L ++ + Q R+L Sbjct: 326 GQSEDLVLEKAQKLKE----RGVKIMVVGIQDFDSRELKAMATPPLVFEIEGQDGIRQLH 381 Query: 190 SWLS 193 +S Sbjct: 382 QGVS 385 Score = 60.4 bits (145), Expect = 4e-08, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 58/166 (34%), Gaps = 17/166 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE----- 74 + L++ S + L L+ P +V +G+V + Sbjct: 414 ADIVFLVEASSRIGLENFQLAVELLRKIIHTLIIGPN---KVRVGLVLYSDEPRLEFGLN 470 Query: 75 --QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + + F G T GAA+ + V +++ R + +IT+G Sbjct: 471 TFLSQSEILSHLNKLPFIGGKTKTGAALDFLRNTVFTQQKGSR--YRQGVQQLAVVITEG 528 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIG-VQGADMKTLAQISVRQP 177 DE A+ + R F++G ++ + + L +I+ P Sbjct: 529 YSQDEVDRPASLLRRA----GVTVFAVGTLKASGSRDLNKIASHPP 570 Score = 53.8 bits (128), Expect = 4e-06, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 17/188 (9%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP 70 + A + L+ SG + + + ++ + + + V G +++ Sbjct: 983 EEACKKTEVADILFLVHASGGITSQELLAIHRLMEAIIN---DSLVGKDNVRFGAISYSD 1039 Query: 71 VHVEQPFTSAANFFPPILF--------AQGDTPMGAAITKALDMVEERKREYRANGISYY 122 F+ + G A+ A + E Sbjct: 1040 NSEVL-FSLDTYITKAQIRDAVFHLKPKVGKAHTATALKFAKERFSEMHGGR---QSLAV 1095 Query: 123 RPWIFLITDGAPTDEWQA-AANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQ 181 + LIT+ PT+ + + + ++ F+IG++ L I+ + Sbjct: 1096 TQILVLITN-KPTESEEKKYLQESAQTLQEAGIDVFAIGIKNVKRPELQAITKHRDRSFM 1154 Query: 182 GLQFRELF 189 + EL+ Sbjct: 1155 VQSYNELY 1162 >UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-trypsin inhibitor heavy chain H3 n=11 Tax=Tetrapoda RepID=B4DPQ4_HUMAN Length = 698 Score = 106 bits (266), Expect = 3e-22, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 64/171 (37%), Gaps = 8/171 (4%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 P ++D+SGSM GR + + L+ +++ + + G V+ H+ Q Sbjct: 279 PVVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFTLFSGDVSTWKEHLVQ 338 Query: 76 PFTS---AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 A F + +G T + + + + M+ + + E+R S + ++TDG Sbjct: 339 ATPENLQEARTFVKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERS--TSIVIMLTDG 396 Query: 133 APT--DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL 180 + + R +F +++G + L +++ Sbjct: 397 DANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=Q10RY0_ORYSJ Length = 694 Score = 106 bits (266), Expect = 4e-22, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 57/192 (29%), Gaps = 31/192 (16%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH-VEQ 75 + +LDVSGSM+G ++ L + L + L +V F Sbjct: 243 RAPLDLVTVLDVSGSMSGIKLSLLKRAMSFVIQTLGPND------RLSVVAFSSTAQRLF 296 Query: 76 PFTSA-------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 P A L A G T + A+ K +V++R+R+ + I L Sbjct: 297 PLRRMTLTGRQQALQAISSLVASGGTNIADALKKGAKVVKDRRRKNPVSS-------IIL 349 Query: 129 ITDGAPTDEWQAAANKVFRGE---------EDKRFAFFSIGVQ-GADMKTLAQISVRQPL 178 ++DG T + + + + G D + I+ Sbjct: 350 LSDGQDTHSFLSGEADINYSILVPPSILPGTSHHVQIHTFGFGTDHDSAAMHAIAETSNG 409 Query: 179 PLQGLQFRELFS 190 + Sbjct: 410 TFSFIDAEGSIQ 421 >UniRef50_B7KCF7 von Willebrand factor type A n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KCF7_CYAP7 Length = 573 Score = 106 bits (266), Expect = 4e-22, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 62/182 (34%), Gaps = 24/182 (13%) Query: 25 LLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFTSAAN- 82 ++D SGSM+G P+ + GL E+ +G+VT+G P Sbjct: 403 VIDTSGSMDGAPLEAVKKGLRIASKEINPGN------YVGLVTYGDRAAEVVPLGLFDEL 456 Query: 83 ------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTD 136 L A G T M + L + E+K+ R ++ L+TDG Sbjct: 457 QHKRFLAAIDNLRADGATAMYDGMMIGLSKLMEQKKNN-----PDGRFYLLLLTDGQANM 511 Query: 137 EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQ---GLQFRELFSWLS 193 ++V E + I + + L I+ + ++ +L L Sbjct: 512 GVT--FDEVKEVIEYSGVRVYPIAYGDVNQEELEAIASLRESTVKKGTPENVEDLLKGLF 569 Query: 194 SS 195 + Sbjct: 570 QT 571 >UniRef50_UPI00016C377F protein containing a von Willebrand factor type A domain n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C377F Length = 821 Score = 106 bits (266), Expect = 4e-22, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 63/222 (28%), Gaps = 29/222 (13%) Query: 6 TFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI 65 + A +L+LD S SM+ + + + +L + G+ Sbjct: 257 LISPQVEAEKKRVARDLVLVLDTSSSMSDIKMQQAKKAVKFCLSQLQPED------RFGV 310 Query: 66 VTFGPVHVE---------QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRA 116 V F + + A + L G T + A+ AL M Sbjct: 311 VRFSTTVTKFRSELVAANTDYLDLATKWIDGLKTSGGTAIWPALNDALAM---------R 361 Query: 117 NGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDK--RFAFFSIGVQ-GADMKTLAQIS 173 + + TDG PT + A V F+ GV + L Q++ Sbjct: 362 SSDPSRPFTMVFFTDGQPTVDETNADKIVKNVLAKNTGNTRIFTFGVGDDVNAAMLDQLA 421 Query: 174 VRQPLP-LQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 + ++ +S +S V L + Sbjct: 422 DSTRAVSTYVREAEDIEVKVSGLYAKISNPVLTD-VQLATSE 462 >UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Takifugu rubripes RepID=UPI00016DFBC7 Length = 883 Score = 106 bits (266), Expect = 4e-22, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 75/211 (35%), Gaps = 34/211 (16%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA D P+ + ++D SGSM+G + ++ ++ ++L + GI+ Sbjct: 241 FAPKDLTRLPK---NVVFVIDRSGSMSGTKMQQIQEAMIKILEDLHPED------HFGII 291 Query: 67 TFGPV----------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRA 116 F E+ + A + I T + AA+ KA+DM+ + R Sbjct: 292 QFDSSVDSWRNSLSLATEENISEAMAYVNQISHKIQATNINAAVLKAVDMLVTDREAKRL 351 Query: 117 NGISYYRPWIFLITDGAPTDEWQA----AANKVFRGEEDKRFAFFSIGVQ-GADMKTL-- 169 S I L+TDG PT + + R + + +G D L Sbjct: 352 PEKSID--MIILLTDGDPTTDIGETRIPVIQENVRNAIGGNMSLYGLGFGNDVDYGFLDV 409 Query: 170 -----AQISVRQPLPLQ-GLQFRELFSWLSS 194 ++ R LQ + + +SS Sbjct: 410 MSRENKGLARRIYTGADAALQLQGFYDEVSS 440 >UniRef50_Q235T9 von Willebrand factor type A domain containing protein n=5 Tax=Tetrahymena thermophila RepID=Q235T9_TETTH Length = 703 Score = 106 bits (264), Expect = 7e-22, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 68/202 (33%), Gaps = 33/202 (16%) Query: 8 ATSDFASNP---EPRCPCILLLDVSGSMNG-RPINELNAGLVTFRDELLADPLALKRVEL 63 + SNP P I ++D SGSMN I + ++ + L + L Sbjct: 196 NDMEVKSNPLEGRPNLDLICVIDNSGSMNDFSKIENVKNTILQLLEMLNEND------RL 249 Query: 64 GIVTFGP-VHVEQPFTSAANFFPPILF-------AQGDTPMGAAITKALDMVEERKREYR 115 ++TF + N L A G T + I A +++ RK++ Sbjct: 250 SLITFNTKAKQLCGLKNVNNQNKKSLQTITKSIKADGGTDIIRGIEIAFQILQSRKQKNS 309 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVF---RGEEDKRFAFFSIGVQ-GADMKTLAQ 171 + IFL++DG N + + +++ F S G D + + Sbjct: 310 VSS-------IFLLSDGQDNLADAGIKNLLKTTYKQLQEESFTIHSFGFGNDHDGPLMQK 362 Query: 172 IS----VRQPLPLQGLQFRELF 189 I+ + Q E F Sbjct: 363 IAQIKDGSFYFVEKNDQVDEFF 384 >UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1915 Length = 728 Score = 106 bits (264), Expect = 7e-22, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 73/205 (35%), Gaps = 32/205 (15%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA + P + ++D SGSM+G+ I + + +L + GI+ F Sbjct: 238 FAPASLQKVPKNVVFVIDHSGSMHGQKIKQTYEAFLKILADLPEED------HFGILIFD 291 Query: 70 P---------VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 V A F + A+G T + A+ A+ M++ R IS Sbjct: 292 DKVDKWQNTLVKAVPDNIIKAKQFVSKISARGGTDINKALLAAVKMLKNTSRNKLLPKIS 351 Query: 121 YYRPWIFLITDGAPTDEWQ---AAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR- 175 I ++DG PT N V + E ++ + +G D L ++++ Sbjct: 352 --TSIILFLSDGEPTSGVTNHNEIINNVKKANE-RQTTLYCLGFGNDVDFNFLEKMALEN 408 Query: 176 -------QPLPLQGLQFRELFSWLS 193 LQ + ++ ++ Sbjct: 409 GGLARRIYEDSDAALQLQGFYNEVA 433 >UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BC4A Length = 1038 Score = 106 bits (264), Expect = 7e-22, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 54/187 (28%), Gaps = 19/187 (10%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI-VTF--- 68 +N ++++D SGSM +N + + L I V F Sbjct: 184 QANVPKPKQIVIVIDKSGSMGVTNMNLAKEAAKSVVNTLNPQDRFAVMAFSSIFVPFQST 243 Query: 69 --GPVHVEQPFTSAA-------NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 F A+ F + + G T A+ KA ++ N Sbjct: 244 VASDQCFATTFADASPQNKKKVEDFVDTISSGGGTNYAPALQKAFSFFQQEPSVSDFNIK 303 Query: 120 SY----YRPWIFLITDGAPTDEWQAAANKVFRGEE--DKRFAFFSIGVQGADMKTLAQIS 173 I ++DG P D + R E + + G+ AD L ++ Sbjct: 304 KIDPSEIDRVILFMSDGIPNDPGSTILSAQIRANEQLNNSVIILTYGLGNADFGVLRNMA 363 Query: 174 VRQPLPL 180 + Sbjct: 364 TNKGDVY 370 >UniRef50_UPI00017B4DF5 UPI00017B4DF5 related cluster n=3 Tax=Tetraodontidae RepID=UPI00017B4DF5 Length = 2436 Score = 106 bits (264), Expect = 7e-22, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 69/196 (35%), Gaps = 17/196 (8%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 + + + LLD SGS+ + + ++ ++ V +G+ F ++ Sbjct: 984 EKQKADLVFLLDQSGSIQSDDYTTMKKFTIDLINKFQ---ISRDLVHVGLAQFSSTFKDE 1040 Query: 76 -------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 + + + +G T +G A+ E +A GIS + L Sbjct: 1041 FYLNKFFDEQAISAHIKDMQQEEGGTLIGLALNSIRKYFEASHGSRKAEGIS---QNLVL 1097 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 ITDG D+ + AA + F+IG+ L QI+ F +L Sbjct: 1098 ITDGDSQDDVEEAARLLRGL----GVEVFAIGIGNVHDLELLQIAGTPENVFTVKNFDKL 1153 Query: 189 FSWLSSSLRSVSRSTP 204 + ++ +S P Sbjct: 1154 EGIHQKVVDTICQSKP 1169 Score = 103 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 60/176 (34%), Gaps = 18/176 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE----- 74 L+D SGS++ ++ ++ F P V +G+V + Sbjct: 393 ADIFFLIDQSGSIHPPDFYDMKKFILEFLQTFRVGPN---HVRIGVVKYADSPTLEFDLH 449 Query: 75 --QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 S I G T G A+ D + + + ++ +ITDG Sbjct: 450 TYTDVKSLEKAITNIHQVGGGTETGKAL----DFMRPQFDRAVTTRGHKVKEYLVVITDG 505 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 TD+ + A+K+ ++IGV+ A K L +IS F L Sbjct: 506 NSTDKVKDPADKLRAQ----GVVVYAIGVKDAVEKELLEISGEPQRTFYVNNFDAL 557 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 65/201 (32%), Gaps = 17/201 (8%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT- 78 + L+D S S+ E+ L + L P + +G+ F H E Sbjct: 3 ADIVFLVDGSSSIGPSNFQEVRLFLRSLASGLNVSP---DNIRIGLAQFDEPHQEFLLKY 59 Query: 79 -----SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + F + G T G AI +K RA+ +ITDG Sbjct: 60 HIEKMNLLAAFESFPYRNGGTETGKAINFLRKQYFTKKAGSRADQR--VPQIAVVITDGD 117 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFRELFSW 191 TD+ A ++ + F+IGV A+ L I+ R F+ L Sbjct: 118 STDDVVVPARELRK----HGVIVFAIGVGNANQGELKSIANRPSERFKFTIDSFQALKRL 173 Query: 192 LSSSLRSVSRSTPGTEVVLEA 212 L ++ S V + Sbjct: 174 TERLLETMCVSMEDQHQVFPS 194 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 58/175 (33%), Gaps = 19/175 (10%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFT- 78 I L+D SGS+ ++ + + + +V +G++ + + P Sbjct: 601 DLIFLIDSSGSIYPEDYQKMKDFMKSLVQ---KSNIGKDQVHVGVLQYSTEQKLVFPLIQ 657 Query: 79 -----SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + + G T G AI + + G + + ++TDG Sbjct: 658 YYTKDQLSKAIDDMQQIGGGTHTGEAIAVVSKYFD-----AQNGGRPDLKQRLVVVTDGE 712 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D+ + A + K +SIGV A+ L +IS F L Sbjct: 713 SQDDVKLPAEALRA----KGVIVYSIGVVAANTSQLLEISGDADRMYAERDFDAL 763 Score = 83.1 bits (204), Expect = 5e-15, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 70/209 (33%), Gaps = 26/209 (12%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 M +Q S E +LLD G + ++ L + ++L P Sbjct: 185 MEDQHQVFPSALK---EKFADVFILLD--GGITQAEFRQIRTFLGSLVNQLNFSPS---T 236 Query: 61 VELGIVTFGPVHVEQPFTSAANFFPPILFA--------QGDTPMGAAITKALDMVEERKR 112 LG+ +G +L A G T GAA+ V R++ Sbjct: 237 YRLGLAQYGQDIKVDFLFKDHQTNKDLLTAVKNAQQHHGGGTNTGAALNFTQHQVFVREK 296 Query: 113 EYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI 172 R + +ITDG D+ A + +++GV+ AD L QI Sbjct: 297 GSRIE--LGVQQVAVVITDGRSQDDVSTPAANLRA-----GVTVYAVGVKDADEAQLHQI 349 Query: 173 SV--RQPLPLQGLQFRELFSWLSSSLRSV 199 + + F +L L +SL+ V Sbjct: 350 ASYPTKEHTFTVDSFSKL-KTLETSLQRV 377 Score = 80.4 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 69/207 (33%), Gaps = 19/207 (9%) Query: 3 EQITFATSDFASNPEPRCPCILLLDVSGSMNGRP-INELNAGLVTFRDELLADPLALKRV 61 +++ + I L+DVS S+ + + + + + + Sbjct: 778 DRVEVRPVISDCKKTAQADIIFLVDVSTSILKEKAFPSVTVFMESVVN---QSSVGPELT 834 Query: 62 ELGIVTFGPV-------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY 114 G++TF + G+T G A+ +L + Sbjct: 835 RFGVITFSTGVQSIFTLKQYSSKRDVLQAVGAVTAPGGNTNTGDALDYSLQYFGKEHGGR 894 Query: 115 RANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 A + + +ITDGA + + + + FSIGV+ A + L ++ Sbjct: 895 AALKVP---QILMVITDGAAQEPSK--LPGPSEALRKQGVSVFSIGVKNASREQLDIMAG 949 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSR 201 P + F + F L + +++S+ Sbjct: 950 NDPSRVF---FVDTFDALETLYKNISK 973 >UniRef50_A7C1J8 von Willebrand factor, type A n=1 Tax=Beggiatoa sp. PS RepID=A7C1J8_9GAMM Length = 478 Score = 106 bits (264), Expect = 7e-22, Method: Composition-based stats. Identities = 41/188 (21%), Positives = 66/188 (35%), Gaps = 18/188 (9%) Query: 16 PEPRCPCILLLDVSGSMN-GRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE 74 P P ILL+D SGSM G + E+ A + F L ++ +V FG Sbjct: 39 PLPPHDVILLIDTSGSMAEGTKLQEVQAAAIQFIQR-RHGLTHLANNKIAVVGFGGRAYL 97 Query: 75 QPFTSAA----NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 ++ L A G TPM + A++ + G + I L T Sbjct: 98 VANLTSDLMNLEQPIQKLRAVGGTPMDRGLQSAMNQLS--------AGSDSEQRSILLFT 149 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL--QGLQFREL 188 DG P + Q + ++ +I AD+ L Q++ L F + Sbjct: 150 DGKP--DNQRTTLNASQLVKNANIQIVAIATDDADIGLLTQVTGDAALVFPTSVGNFDQA 207 Query: 189 FSWLSSSL 196 F ++ Sbjct: 208 FQKAEQAI 215 >UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PS55_PICSI Length = 829 Score = 105 bits (263), Expect = 9e-22, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 58/195 (29%), Gaps = 34/195 (17%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ- 75 + +LDVSGSM+G + L + L + L +V F Sbjct: 355 RAPIDLVTVLDVSGSMSGTKLALLKRAMAFVISNLSPED------RLSVVVFSSTAKRVF 408 Query: 76 -------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 AAN L G T + + K ++E+R++ I L Sbjct: 409 SLKRMTPDGQRAANRVVERLLCTGGTNIAEGLRKGAKVLEDRRQRNPVAS-------IML 461 Query: 129 ITDGAPT------------DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR 175 ++DG T + Q + + + + G D T+ IS Sbjct: 462 LSDGQDTYSLSSRGVVLFPSDEQRRSARQSTRYGHVQIPVHAFGFGVDHDAATMHAISEV 521 Query: 176 QPLPLQGLQFRELFS 190 +Q L Sbjct: 522 SGGTFSFIQAESLVQ 536 >UniRef50_C1A2B7 Putative uncharacterized protein n=1 Tax=Rhodococcus erythropolis PR4 RepID=C1A2B7_RHOE4 Length = 233 Score = 105 bits (263), Expect = 9e-22, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 82/220 (37%), Gaps = 27/220 (12%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE-QPFT 78 P L+ DVS SM I E+N + ++E+L DP+ + +++F P Sbjct: 7 LPFYLVFDVSYSMEPV-IGEVNNAMRALKNEILKDPILGDIARVCVLSFSDEARIDVPMC 65 Query: 79 SAAN----FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY-YRPWIFLITDGA 133 A+ L +G T + + + + +G +RP +F +TDG Sbjct: 66 DLADDTRITREDFLQVRGGTSFAPIFDLIGERIAADIADLKGHGEGKVFRPTVFFVTDGV 125 Query: 134 PTD---EWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS------VRQPLPLQGL- 183 PTD EW +A ++ + G+ AD + L I+ Sbjct: 126 PTDAVHEWNSAFTRLTSVKAYPNL--VPFGLGDADEEVLRAITFPPYRQDGYFFMANAGT 183 Query: 184 ----QFRELFSWLSSSLRSVSRS----TPGTEVVLEAPKG 215 + + ++ S+ S ++S TPG + +G Sbjct: 184 SAEQAMQAITRIVTQSVVSCTQSAVAGTPGVVMDTRGTEG 223 >UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GUK8_SORC5 Length = 521 Score = 105 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 63/195 (32%), Gaps = 26/195 (13%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 + D + P ++ +D SGSM G PI + AGLV D L + +V Sbjct: 119 SPVDLGALERPPLHLVIAVDTSGSMEGDPIAYVRAGLVEMIDALQPTD------RISLVR 172 Query: 68 FGPVHVEQ------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 + A L A+G T + + A + E+ Sbjct: 173 YSDAAEVVLEQAEGSDREALTEAFEGLTARGSTNLYEGLFTAYALAEQHLD-------PA 225 Query: 122 YRPWIFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQI----SV 174 ++ + ++DG T + + G +K +IGV D+ + I + Sbjct: 226 WQNRVIFLSDGVATAGLTSPQRLVSLAAGYAEKGIGLTAIGVGAEFDVDAMRGISEVGAG 285 Query: 175 RQPLPLQGLQFRELF 189 E+F Sbjct: 286 NFYFLEDPKAVEEVF 300 >UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magnoliophyta RepID=Q9FF49_ARATH Length = 704 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 54/191 (28%), Gaps = 30/191 (15%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 + +LDVSGSM G + L + L L +++F Sbjct: 248 RAPVDLVTVLDVSGSMAGTKLALLKRAMGFVIQNLGPFD------RLSVISFSSTARRNF 301 Query: 76 PFTSAAN-------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 P L + G T + + K ++ +R+ + + I L Sbjct: 302 PLRLMTETGKQEALQAVNSLVSNGGTNIAEGLKKGARVLIDRRFKNPVSS-------IVL 354 Query: 129 ITDGAPTDEWQAA--------ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLP 179 ++DG T + + + R + G D + I+ Sbjct: 355 LSDGQDTYTMTSPNGSRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMHSIAENSGGT 414 Query: 180 LQGLQFRELFS 190 ++ + Sbjct: 415 FSFIESETVIQ 425 >UniRef50_UPI00016E1D1D UPI00016E1D1D related cluster n=9 Tax=Tetraodontidae RepID=UPI00016E1D1D Length = 2191 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 74/196 (37%), Gaps = 19/196 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 L+D SGS+N +++ ++ F P + + +G+V F + Sbjct: 588 ADIFFLIDHSGSINPADFHDMKKFMIEFLHTFRVGP---QHIRIGVVKFADSPQLEFDLQ 644 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 S + I G T G A+ + + + A + ++ +ITDG Sbjct: 645 AYSDVKSLEDAILNIKQIGGGTETGRALEF----MSPQFDQALATHGHKVKEYLVVITDG 700 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWL 192 TD+ +A A+K+ + ++IGV+ AD L +IS F L + Sbjct: 701 KSTDKVKAPADKLR----SQDVVVYAIGVKNADENQLLEISGDPQRTFFVNNFDAL-RPI 755 Query: 193 SSSLRSVSRSTPGTEV 208 + + S G ++ Sbjct: 756 KDDIITDICSQDGKQI 771 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 43/189 (22%), Positives = 75/189 (39%), Gaps = 19/189 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 + ++D SGS+ + + L + L +A RV +GIV + + Q Sbjct: 381 ADIVFIIDESGSIGSANFQLMRSFLHSLISGLQ---VASNRVRVGIVMYNVEPMAQVFLN 437 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + +F + + G T GAA+ L V ++R R + + +ITDG Sbjct: 438 TFKDKSELLDFIKILPYHGGGTNTGAALNFTLQEVFIKQRGSRKD--LGVQQVAVVITDG 495 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFRELFS 190 DE + A + R +++GV+ AD L QI+ F +L Sbjct: 496 KSQDEVSSPAANLRRA----GVTVYAVGVKDADKAQLDQIASYPTNKHTFIIDSFTKL-K 550 Query: 191 WLSSSLRSV 199 L +SL+ + Sbjct: 551 TLEASLQRI 559 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 66/196 (33%), Gaps = 20/196 (10%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSA 80 + L+D S S+ E+ L F L + ++++G+ + Q F Sbjct: 1 DIVFLVDGSSSIGTDNFQEVRLFLRNFTSGL---DIGPDKIQIGLAQYSNDP-HQEFLLK 56 Query: 81 ANFFPPILFAQ--------GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + L A G T G AI ++ RAN +ITDG Sbjct: 57 DHMEKTALLAALDSFPYRTGGTETGKAIDFLRTQYFTKEAGSRANQR--VPQIAVVITDG 114 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFRELFS 190 TD+ A + + F+IGV A+ L I+ R P F+ L Sbjct: 115 DSTDDVTVPAQSLRK----HGVIVFAIGVGNANQNELESIANRPPKRFKFTIDSFQALQR 170 Query: 191 WLSSSLRSVSRSTPGT 206 L+++ S Sbjct: 171 LTKGLLQTMCVSIKDQ 186 Score = 98.5 bits (244), Expect = 1e-19, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 61/177 (34%), Gaps = 19/177 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPF 77 + I L+D SGS+ ++ + + + V +G++ + + + P Sbjct: 789 KRDLIFLIDSSGSIYPEDYKKMKDFMKSVI---KQSIVGKNEVHVGVMQYSTIQKLVFPL 845 Query: 78 T------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + + G T G AIT + R G + + ++TD Sbjct: 846 NQYYTKDELSKAIDEMQQIGGGTHTGEAITDVSQYFD-----ARNGGRPDLKQRLVVVTD 900 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 G DE + A + K +SIGV A+ L +IS G F L Sbjct: 901 GESQDEVRQPAEALRA----KGVIVYSIGVVAANTSQLLEISGTPNRMYAGRDFDAL 953 Score = 92.0 bits (227), Expect = 1e-17, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 54/183 (29%), Gaps = 17/183 (9%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 + I L+D S S+ + + + + + G++ + Sbjct: 986 ECKKTEKADIIFLVDGSTSITQPKFRSMLKFMASMVN---QTTVGSDLTRFGVILYSNDA 1042 Query: 73 -------VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 + GDT G A+T +L E A + Sbjct: 1043 NSMFTLKQYSAKREVLQAIAALKSPLGDTYTGKALTYSLQFFNEEHGGRAALQVP---QI 1099 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 + +ITDG D+ + AA + F+IG+ A L QI+ F Sbjct: 1100 LMVITDGESQDDVEDAARLLRSL----GVEVFTIGIGNAHDLELLQIAGSPERVFTVKSF 1155 Query: 186 REL 188 L Sbjct: 1156 GNL 1158 Score = 45.7 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 21/175 (12%), Positives = 51/175 (29%), Gaps = 17/175 (9%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPL------ALKRVELGI-VTFGP 70 L+D SG +N ++ L +++ + A + + F Sbjct: 186 QHQDIFFLVD-SG-LNPTDFQQVKTTLSRLVNQMNFNAYTYRLGLAQYGQNIDVKFLFNT 243 Query: 71 VHVEQPFTSAANFFPPI-LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 ++ A L +G+A+ + + R + +R ++ ++ Sbjct: 244 HQTKEELLKAIKAVNNRRLQPNEVHNLGSALQYVYKNLFTAEAGSRTDQS--FRQYLVVV 301 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQ 184 + D + A + + A MK L+ +S + Sbjct: 302 SGKDSNDPFYKEARLLKSA----GIYIITFSAG-ASMKELSVLSSPRYSYQSISN 351 >UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 Tax=Arabidopsis thaliana RepID=Q9M1S2_ARATH Length = 676 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 53/183 (28%), Gaps = 24/183 (13%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 + +LD+SGSM G + L + L + L ++ F Sbjct: 240 RAPIDLVTVLDISGSMGGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLF 293 Query: 76 PFTSAAN-------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 P T ++ L A G T + + K ++E+R I L Sbjct: 294 PLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS-------IIL 346 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFRE 187 ++DG T + + S G D + +S ++ Sbjct: 347 LSDGRDTYTTNHPDPSYKVML--PQISVHSFGFGSDHDASVMHSVSEVSGGTFSFIESES 404 Query: 188 LFS 190 + Sbjct: 405 VIQ 407 >UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09E12_STIAU Length = 540 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 68/212 (32%), Gaps = 28/212 (13%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + ++ +++ ++D SGSM G + L L Sbjct: 55 LIAPKTEVSASEIAAKRVTFVIDTSGSMQGSRMQIAKDALKYCVTRLNPQDT------FN 108 Query: 65 IVTFG-PVHVEQPFTSAAN--------FFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 +V F V P +A F L A G T + A+ + L Sbjct: 109 VVRFSTDVEALFPALKSAQPENIQKAVAFVEQLEAIGGTAIDEALVRGLQ---------D 159 Query: 116 ANGISYYRPWIFLITDGAPT--DEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI 172 +G S + ITDG PT + + A + + + F+ GV + + L ++ Sbjct: 160 NDGKSSAPHLLMFITDGQPTIGETDEGAIAQHAKDGRKAKTRLFTFGVGEDLNARLLDRL 219 Query: 173 SVRQPLPLQ-GLQFRELFSWLSSSLRSVSRST 203 S +E + +SS VS Sbjct: 220 SSDGAGTSDFVRDGKEFETKISSFYDKVSNPV 251 >UniRef50_A6H584 Collagen alpha-5(VI) chain n=2 Tax=Mus musculus RepID=CO6A5_MOUSE Length = 2640 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 72/211 (34%), Gaps = 21/211 (9%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 +EQ+ + L+D S S+ + ++ + + D P+ +V Sbjct: 457 AEQMELDKTGCVDT--KEADIYFLIDGSSSIRKKEFEQIQIFMSSVIDMF---PIGPNKV 511 Query: 62 ELGIVTFGP-------VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY 114 +G+V + V I +G T G A+ L +++ + Sbjct: 512 RVGVVQYSHKNEVEFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALDFILPLIK----KG 567 Query: 115 RANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 + ++ ++TDG D AN++ + +IG+ A+ L QI+ Sbjct: 568 KTERTDRAPCYLIVLTDGKSNDSVLEPANRLRAE----QITIHAIGIGEANKTQLRQIAG 623 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSRSTPG 205 + G F L + + + S G Sbjct: 624 KDERVNFGQNFDSL-KSIKNEIVHRICSEKG 653 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 64/176 (36%), Gaps = 15/176 (8%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 + +LD SGS+ R + + + + RV++G +T+ Sbjct: 845 LDIVFVLDHSGSIGPREQESM---MNLTIHLVKKADVGRDRVQIGALTYSNHPEILFYLN 901 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 ++ A G+T A+ +++ + R R + +ITDG Sbjct: 902 TYSSGSAIAEHLRRPRDTGGETYTAKALQH-SNVLFTEEHGSRLTQN--VRQLMIVITDG 958 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D + ++ R DK F++GV A+ L ++ ++ + F +L Sbjct: 959 VSHD--RDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHVDNFDKL 1012 Score = 100 bits (248), Expect = 4e-20, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 61/175 (34%), Gaps = 19/175 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ--- 75 + + L+D SGS+ + + ++ + R ++G+V F + E+ Sbjct: 658 KADIMFLVDSSGSIGPTNFETMKTFMKNLVGKIQ---IGADRSQVGVVQFSDYNREEFQL 714 Query: 76 ---PFTSAANFFPPILFA-QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + +T G A+T + + G R ++ L+TD Sbjct: 715 NKYSTHEEIYAAIDRMSPINRNTLTGGALTFVNEYFD-----LSKGGRPQVRKFLILLTD 769 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFR 186 G DE A + K FS+GV GA+ L +IS L F Sbjct: 770 GKAQDEVGGPATALR----SKSVTIFSVGVYGANRAQLEEISGDGSLVFHVENFD 820 Score = 63.5 bits (153), Expect = 5e-09, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 63/201 (31%), Gaps = 19/201 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFT 78 + L+D S + + + L L P+ + + + + H E Sbjct: 29 ADVVFLVDSSNYLGIKSFPFVRTFLNRMISSL---PIEANKYRVALAQYSDALHNEFQLG 85 Query: 79 SAANFFP--PILF-----AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + N P L G +G A+ +A R + P + ++ Sbjct: 86 TFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRD--KKQFPPILVVLAS 143 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL--QGLQFRELF 189 D+ + AA + S+GVQ A + L ++ Q Sbjct: 144 AESEDDVEEAAKALRED----GVKIISVGVQKASEENLKAMATSQFHFNLRTARDLGMFA 199 Query: 190 SWLSSSLRSVSRSTPGTEVVL 210 ++ ++ V++ GT V L Sbjct: 200 PNMTRIIKDVTQYREGTTVDL 220 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 58/178 (32%), Gaps = 17/178 (9%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE--- 74 P I L D S ++ + L D + +R+++G+ +G + E Sbjct: 1034 PEADVIFLCDGSDMVSDSEFVTMTTFLSDLIDNF---DIESQRMKIGMAQYGSRYQEIIE 1090 Query: 75 ----QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 T + + ++G + A+ DM + R G+ + +IT Sbjct: 1091 LESSLNKTQWKSQVHSVAQSKGLPRLDFALKHVSDMFDPSVGGRRNAGVP---QTLVVIT 1147 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 + + + +D ++G+ + L I+ + F +L Sbjct: 1148 ----SSSPRYDVTDAVKVLKDLGICVLALGIGDVYKEQLLPITGNSEKIITFRDFNKL 1201 Score = 49.6 bits (117), Expect = 8e-05, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 55/163 (33%), Gaps = 19/163 (11%) Query: 20 CPCILLLDVS-GSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 I L+D S G + + +L L + + + LG+++F Sbjct: 267 ADLIFLVDESVG--TTQNLRDLQNFLENVTSSV---DVKDNCMRLGLMSFSDRAQTISSL 321 Query: 78 TSAANF-----FPPILFAQ-GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 S+AN L Q G + +GAAI + R L+T Sbjct: 322 RSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKAQ--GVPQIAVLVTH 379 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 A D + AA + F++G++GA+ L I Sbjct: 380 RASDDMVREAALDLRLE----GVTMFAMGIEGANNTQLEDIVS 418 Score = 48.4 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 60/174 (34%), Gaps = 26/174 (14%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDE--LLADPLALK-RVELGIVTFGP------ 70 LLD S ++ + A + + D + ++P A + + ++++ P Sbjct: 1995 MDVAFLLDNSKNIASDDFQAVKALVSSVIDSFHITSNPSASESGDRVALLSYSPSESSRR 2054 Query: 71 ---VHVEQPFTSAANF-------FPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 V E FT+ N + + GD +G A+ A++ + + Sbjct: 2055 KGRVKTEFAFTTYDNQSIMKNYIYTSLQQLNGDATIGLALQWAMEGL------FLGTPNP 2108 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 I +I+ G E + V + + + F I + + +++ Sbjct: 2109 RKHKVIIVISAGE-NHEEKEFVKTVALRAKCQGYVVFVISLGSTQRDEMEELAS 2161 >UniRef50_UPI000180D155 PREDICTED: similar to integrin alpha Hr1 n=1 Tax=Ciona intestinalis RepID=UPI000180D155 Length = 1595 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 63/214 (29%), Gaps = 25/214 (11%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG---PVHV 73 + +L++D SGS+N ++ L L + ++G+V + Sbjct: 413 RGKIDIVLVVDQSGSVNQCNFQKVKRWLRDIVRSFN---LGVTEQDVGVVVYSKKATTST 469 Query: 74 EQPF-------------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 + + G T G A A +M+ K Sbjct: 470 VVDLGFSDYDSDGHTKKQEMTKILKKLAYEGGTTYTGYAFKLANEMLTGNKSR------P 523 Query: 121 YYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 + I L+TDGA T + ++GV + L QI+ + Sbjct: 524 DAKKMIILLTDGATTAANTLQLKEELDVSRAANVMILAVGVGKFNQTELIQIAGDRKNFF 583 Query: 181 QGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 +F EL SVS + E+ Sbjct: 584 AVTKFSELEKVRDKLRSSVSVGNLEGSLKNESTD 617 >UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000E460BF Length = 1028 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 38/169 (22%), Positives = 60/169 (35%), Gaps = 26/169 (15%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP----- 70 P R I ++DVSGSM G+ + T D++ + I+ F Sbjct: 305 PNTRKNVIFVIDVSGSMYGQKTRQTKRAFTTILDDVRPID------RINIILFSSYAHVW 358 Query: 71 -----VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 V +AA L G T + ++ KA++++ E P Sbjct: 359 REDQMVEATSDNIAAAKRHVNGLSVGGGTNIYDSLMKAVEILLEHD-------TGDAMPL 411 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQIS 173 I ++TDG AA + R + FSIG D L ++S Sbjct: 412 IIMLTDGQV--GNAAAIVRDVTSVIGGRLSLFSIGFGNGVDFPFLEKLS 458 >UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clupeocephala RepID=Q498Q0_DANRE Length = 892 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 65/183 (35%), Gaps = 27/183 (14%) Query: 12 FASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 FA R P + ++D SGSM G I + ++ +L D G++TF Sbjct: 257 FAPTDVQRIPKNVVFIIDQSGSMQGNKIEQTRMAMLRILSDLAKDD------YFGLITFS 310 Query: 70 PV---------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 A F + + G T + A+ A++M+ + +E A+ Sbjct: 311 SHIQAWKPELLKATAENVEEAKTFVKQIRSGGATDINGAVLNAVNMINQYTQEGSAS--- 367 Query: 121 YYRPWIFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQP 177 + L+TDG PT + + ++ + +G + L ++S+ Sbjct: 368 ----ILILLTDGDPTSGVTNPVTIQQNVKTAIGGKYPLYCLGFGFNVRFEFLEKMSLENN 423 Query: 178 LPL 180 Sbjct: 424 GAA 426 >UniRef50_A2E6Y7 von Willebrand factor type A domain containing protein n=4 Tax=Trichomonas vaginalis RepID=A2E6Y7_TRIVA Length = 720 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 69/213 (32%), Gaps = 17/213 (7%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV 71 F E + ++D SGSM+G I L L I+ FG Sbjct: 233 FEGKVEQKSEFYFIIDCSGSMSGSRIENAKFCLNILIHSL------PIGCRFSIIQFGNS 286 Query: 72 -HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 +N +A A D++ + ++ + IFL+T Sbjct: 287 YKEVVSICDYSNKNVK--YAMSAIARINADMGGTDILSPLEYVFKKKLGKGFIRKIFLLT 344 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPLPLQGLQF 185 DG + ++V + E+ R F+IG+ GAD + IS L Sbjct: 345 DGEVHNSDM-ICSRVQKERENNRI--FAIGLGSGADPGLIKNISAKSGGNYVLIADDDNM 401 Query: 186 RELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTS 218 + + S S S S + + + W + Sbjct: 402 NNMIVEIMKSALSPSLSNISIQGESDQTEMWPT 434 >UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha (Globulin) inhibitor H5 (ITIH5) (Fragment) n=2 Tax=Danio rerio RepID=Q5RHF3_DANRE Length = 906 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 59/174 (33%), Gaps = 11/174 (6%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 FA D P+ + ++D S SM G + + L T +EL + + Sbjct: 242 FAPRDLPVVPK---NVVFVIDTSASMLGTKMKQTKQALFTIINELRPNDNFNFVTFSNRI 298 Query: 67 TFGPVHVEQPFTSA----ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 P T A F ++ G T + I ++ + + + Sbjct: 299 RVWQPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALLSDYLS-SKDESHHHS 357 Query: 123 RPWIFLITDGAPTDEW--QAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS 173 I +TDG PT + ++F F+IG+ D + L ++S Sbjct: 358 VSLIIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGMGDDVDYRLLERMS 411 >UniRef50_Q5NJK1 Matrilin-3a n=5 Tax=Danio rerio RepID=Q5NJK1_DANRE Length = 460 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 55/167 (32%), Gaps = 15/167 (8%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 + ++D S S+ ++ L D L + + +V + + Sbjct: 60 SRPLDLVFIIDSSRSVRPGEFEKVKIFLADMVDTL---DVGPDATRVAVVNYASTVKIES 116 Query: 77 F------TSAANFFPPILFA-QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 + T G AI KA+D K R + + I + Sbjct: 117 LLKSHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEAFTEKSGARPKSKNISKVAIIV- 175 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 TDG P D+ + +V +++GV ADM++L ++ Sbjct: 176 TDGRPQDQVE----EVSAAARASGIEIYAVGVDRADMRSLKLMASNP 218 >UniRef50_UPI00017450FB von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017450FB Length = 424 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 56/186 (30%), Gaps = 14/186 (7%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 AS +++D SGSM G + D L A + V+ Sbjct: 33 ASAKRAPVNVTIVIDKSGSMGGDKMVHAREAAKQALDRLGAGDMVSVVAYDDAVSLISPA 92 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + A G T + + I+K + + KR + N + L++DG Sbjct: 93 TDLTDRDRVKAAIDRIQAGGSTALFSGISKGAEELRRNKRPNQVNR-------VVLLSDG 145 Query: 133 APTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPLPLQGLQF 185 + ++ + ++G+ G + + +++ Sbjct: 146 MANVGPSSPQDLGRLGASLAKEGITVTTLGLGLGYNEDLMTELALRSDGNHAFIENSQNL 205 Query: 186 RELFSW 191 +F Sbjct: 206 AGIFQT 211 >UniRef50_UPI000155C0BD PREDICTED: similar to collagen type VI alpha 4 n=2 Tax=Mammalia RepID=UPI000155C0BD Length = 1844 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 62/178 (34%), Gaps = 19/178 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF- 77 + I L+D S S+ ++ + + + + V +G++ F E+ Sbjct: 620 KADIIFLIDGSESIKESNFEKMKEFMKLMVNM---SNIGPENVRIGVLQFSSSPREEFML 676 Query: 78 ------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + I + T G A+T L + + G ++ +ITD Sbjct: 677 NKYTTKEDLSRAISDIKQIKAGTQTGQALTFTLPYFDTSRW-----GRPTEPQYLIVITD 731 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELF 189 G D + A + DK + F+IGV A+ L +I+ + F L Sbjct: 732 GEAQDSVKGPAKALR----DKGISIFAIGVLEANKTQLLEITGTEDQVFYENDFDSLI 785 Score = 74.6 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 55/178 (30%), Gaps = 24/178 (13%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-- 75 + L+D S S++ ++ + + RV G+V + + Sbjct: 439 EKVDIYFLIDGSSSIDHGDFLDMKMFMSEVLSVFQ---MGNNRVRFGVVQYSDSPHLEFE 495 Query: 76 -----PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 I +G +G A+ + + + LIT Sbjct: 496 VGQYHSTVKLKEAIRGIKQLRGRDRIGEALNYMNQRFMD----------NDRVKILILIT 545 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 G DE +A ++ + + ++IGV+ + L I+ + L F L Sbjct: 546 AGNFQDEVAESAQELRQ----RGIVIYAIGVKTDNQLKLISIAGTEENVLCVNDFDTL 599 Score = 60.4 bits (145), Expect = 5e-08, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 58/197 (29%), Gaps = 23/197 (11%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 +P + L+D S + + + L + R ++G+ F + Sbjct: 28 KPYADVVFLVDASEKLELNNFPLIKNFIFRIIRTL---EVGSNRYQIGLAQFSGTGHVEF 84 Query: 77 F--TSAANFFPPILFAQGDT------PMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 T QG T G A+ + R + + Sbjct: 85 LLNTYLTKAEMIDYVQQGFTLRHGPRRTGNALQFLQKTFFKEAAGSRFGQ--GVPQYAVV 142 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV-----RQPLPLQGL 183 IT G D AA K+ K S+G+Q D K L ++ + Sbjct: 143 ITSGKSEDAVAGAARKLR----GKGVNILSVGIQNFDKKELETMASPSLVFKIQREEGAS 198 Query: 184 QF-RELFSWLSSSLRSV 199 Q R++ SS++ Sbjct: 199 QLERKVIDLFRSSIKKR 215 >UniRef50_A6NMZ7 Collagen alpha-6(VI) chain n=2 Tax=Theria RepID=CO6A6_HUMAN Length = 2263 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 69/196 (35%), Gaps = 18/196 (9%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPF 77 + + L+D S S+ ++ L + + ++L RV +G F H E P Sbjct: 998 KVDLVFLMDGSTSIQPNDFKKMKEFLASVVQDF---DVSLNRVRIGAAQFSDTYHPEFPL 1054 Query: 78 ------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + I G+T +GAA+ + G + ++TD Sbjct: 1055 GTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFRPDMGSRINTGTP---QVLLVLTD 1111 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSW 191 G DE AA + + +S+G+ D + L QI+ L F EL Sbjct: 1112 GQSQDEVAQAAEALR----HRGIDIYSVGIGDVDDQQLIQITGTAEKKLTVHNFDEL-KK 1166 Query: 192 LSSSLRSVSRSTPGTE 207 ++ + +T G Sbjct: 1167 VNKRIVRNICTTAGES 1182 Score = 95.8 bits (237), Expect = 9e-19, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 66/176 (37%), Gaps = 18/176 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 LL+D SGS +E+ L +A +V +G V + + Sbjct: 435 ADIYLLIDGSGSTQATDFHEMKTFLSEVVGMFN---IAPHKVRVGAVQYADSWDLEFEIN 491 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 I G+T GAA+ L ++++ K++ R N + + + ++T+G Sbjct: 492 KYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQ-RGNKVPCH---LVVLTNG 547 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D AN++ ++IG++ A+ L +I+ + F L Sbjct: 548 MSKDSILEPANRLREEH----IRVYAIGIKEANQTQLREIAGEEKRVYYVHDFDAL 599 Score = 95.8 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 66/175 (37%), Gaps = 19/175 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ--- 75 + + L+D SGS+ +++ + + RV++G+V F ++ E+ Sbjct: 620 KADIMFLVDSSGSIGPENFSKMKTFMKNLVS---KSQIGPDRVQIGVVQFSDINKEEFQL 676 Query: 76 ----PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + +N + T G+A++ K R ++ LITD Sbjct: 677 NRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGA-----RPNIRKFLILITD 731 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFR 186 G D + A + + +S+GV G+++ L +IS R + F Sbjct: 732 GEAQDIVKEPAVVLRQE----GVIIYSVGVFGSNVTQLEEISGRPEMVFYVENFD 782 Score = 95.0 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 18/192 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + ++D SGS++ N + ++ + +V G + + + Sbjct: 808 LDVVFVIDSSGSIDYDEYNIMKDFMIGLV---KKADVGKNQVRFGALKYADDPEVLFYLD 864 Query: 80 -------AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + G T A+ + M E + G+ + +ITDG Sbjct: 865 DFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRLNKGVP---QVLIVITDG 921 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV---RQPLPLQGLQFRELF 189 D + N + DK ++G+ GA+ L ++ + + +F Sbjct: 922 ESHDADK--LNATAKALRDKGILVLAVGIDGANPVELLAMAGSSDKYFFVETFGGLKGIF 979 Query: 190 SWLSSSLRSVSR 201 S +++S+ + S+ Sbjct: 980 SDVTASVCNSSK 991 Score = 62.7 bits (151), Expect = 8e-09, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 51/164 (31%), Gaps = 20/164 (12%) Query: 20 CPCILLLDVS--GSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP- 76 + LLD+S GS + L L L + + +G+V + Sbjct: 228 ADVVFLLDMSINGS--EENFDYLKGFLEESVSAL---DIKENCMRVGLVAYSNETKVINS 282 Query: 77 ------FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 + + G GAAI K V + R N L+T Sbjct: 283 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQ--GVPQIAVLVT 340 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 D AA + R F++G++GA L +I+ Sbjct: 341 HRDSEDNVTKAAVNLRRE----GVTIFTLGIEGASDTQLEKIAS 380 Score = 58.4 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 58/200 (29%), Gaps = 27/200 (13%) Query: 20 CPCILLLDVS---GSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 + L+D S GS + + + L P+ + + + + + Sbjct: 26 ADVVFLVDSSDRLGS---KSFPFVKMFITKMISSL---PIEADKYRVALAQYSD-KLHSE 78 Query: 77 F--------TSAANFFPPIL-FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 F + N F G +G A+ +A R + P + Sbjct: 79 FHLSTFKGRSPMLNHLRKNFGFIGGSLQIGKALQEAHRTYFSAPANGRD--KKQFPPILV 136 Query: 128 LITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL--QGLQF 185 ++ + E + + + S+GVQ A + L ++ Q Sbjct: 137 VL----ASSESEDNVEEASKALRKDGVKIISVGVQKASEENLKAMATSQFHFNLRTVRDL 192 Query: 186 RELFSWLSSSLRSVSRSTPG 205 ++ ++ V + G Sbjct: 193 SMFSQNMTHIIKDVIKYKEG 212 >UniRef50_UPI000155C0BC PREDICTED: hypothetical protein n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155C0BC Length = 2392 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 69/192 (35%), Gaps = 18/192 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 + ++D SGS++ N + A + D + + +V+ G + + Sbjct: 806 LDIVFVIDSSGSIDSNEYNIMKAFM---IDLVKKADVGKNQVQFGALKYSDFPEVLFNLN 862 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + +F G T A+ + + E G+ + +ITDG Sbjct: 863 EFSSKSEIISFIQNDHPRGGSTYTAKALAHSAHLFSESLGSRMHRGVP---QVLIVITDG 919 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF---RELF 189 D N R DK ++G++GA+ + L ++ F + +F Sbjct: 920 ESHDAH--LLNATARALRDKGILVLAVGIEGANHEELLSMAGSTDRYFFVENFEGLKGIF 977 Query: 190 SWLSSSLRSVSR 201 +S+S+ + S+ Sbjct: 978 ENVSASVCNTSK 989 Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 56/176 (31%), Gaps = 18/176 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-------VH 72 LL+D SGS+ E+ L P +V G V + + Sbjct: 433 ADIYLLIDGSGSIQVADFQEMKRFLAEVIGMFNIGPH---KVRFGAVQYSHLWEWEFEMD 489 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 I G+T GAA+ K L + + R + ++TDG Sbjct: 490 RYSNKNDLVKAVENIRQLGGNTDTGAALDKMLPLFQ----RARQQRARKVPQHLVVLTDG 545 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D + A ++ ++IGV+ A+ L +I+ F L Sbjct: 546 LSHDSVREPAGRLRGD----NINVYAIGVKEANHTQLEEIAGSDSRVYYVHNFDSL 597 Score = 94.3 bits (233), Expect = 2e-18, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 66/174 (37%), Gaps = 19/174 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 + L+D SGS+ G ++ + + + +V++G+V F ++ E Sbjct: 619 ADIMFLVDSSGSIGGDNFEKMKTFMKNVVNRTK---IGANQVQVGLVQFSDINKEGFQLN 675 Query: 76 --PFTSAANFFPPILFAQG-DTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + L G T +G A+T D K + ++ L+TDG Sbjct: 676 QYDTKTKISDAIDGLSLIGRGTLIGGALTFVSDYFSVSKGA-----RPNVKKFLVLLTDG 730 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFR 186 D + AA + + +S+GV G++ L +IS R + F Sbjct: 731 KSQDAVKEAAVALRQD----GVIIYSVGVFGSEYSQLEEISGRSDMVFYVENFD 780 Score = 94.3 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 42/190 (22%), Positives = 69/190 (36%), Gaps = 17/190 (8%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV----EQ 75 + L+D S S+ ++ LVT ++ + +V +G+ F + Sbjct: 997 ADLVFLIDGSTSILEEDFKKMKDFLVTIVNDF---DIRPGKVHVGLAQFSHEYRPEFSLI 1053 Query: 76 PFTS---AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 PF N I G+T +GAA+ G+ + + ++TDG Sbjct: 1054 PFRDKIEVKNQIGRIQQIFGNTLIGAALRNVGSYFWPDFGSRINAGV---QQVLLVLTDG 1110 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWL 192 DE AA + +K +S+GV + + L QIS L F EL Sbjct: 1111 QSQDEVAQAAEDLR----NKGIDIYSLGVGQVNDQQLIQISGSAKKKLTVDNFSELDKIK 1166 Query: 193 SSSLRSVSRS 202 +R V S Sbjct: 1167 KRVVRDVCTS 1176 Score = 61.5 bits (148), Expect = 2e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 53/164 (32%), Gaps = 20/164 (12%) Query: 20 CPCILLLDVSGSMNG--RPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP- 76 + L+D S +NG L LV D + + +G+V + Sbjct: 226 VDIVFLVDES--VNGTDENFEHLKGFLVETIDSF---DVKENCMRIGLVMYSNETKLVSR 280 Query: 77 -----FTSAANFFPPILFAQGDTPM-GAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 S L + + GAAI + R R + LIT Sbjct: 281 LGTGTNKSDILQQIDGLSPKAGRALTGAAINVTRKEIFSRGAGSRKSQ--GVLQITVLIT 338 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 + D AA + R F++G++GA+ L QI+ Sbjct: 339 HRSSEDNVSEAALSLRRE----GVTVFAVGIEGANETQLDQIAS 378 Score = 55.7 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 67/194 (34%), Gaps = 29/194 (14%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-------H 72 + L+D S ++ + + + + L P+ + + + + + Sbjct: 16 ADVVFLVDSSDNLGNKAFPFVKTFVNKMINAL---PIEASKYRIALAQYSDDLHSEFQLN 72 Query: 73 VEQPFTSAANFFPPILFAQGDTP-MGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + N +G +P +G A+ KA R + P + ++ Sbjct: 73 TFKSKNPMLNHVKKNFAFRGGSPRLGLALQKAHKTYFSGLTNGRDP--KRFPPVLVVLAS 130 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSW 191 G D+ +A A + R R S+G+Q A + L ++ Q F + Sbjct: 131 GPSEDDVEAPAKALQRD----RVKIISLGMQAASDRDLKAMATPQ------------FDF 174 Query: 192 LSSSLRSVSRSTPG 205 L ++R VS +P Sbjct: 175 LLRTIREVSMFSPN 188 >UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PTX8_CHIPD Length = 462 Score = 104 bits (259), Expect = 2e-21, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 61/192 (31%), Gaps = 29/192 (15%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 AS P L+LD SGSM+G I D+L + L IV + Sbjct: 74 ASKPRVPLNISLVLDRSGSMSGDKIKYARQAAKFLIDQLNSTD------HLSIVNYDDRV 127 Query: 73 VEQPFT------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 + A + +G T + + + V+ ++E Y + Sbjct: 128 EVTSPSQSVKNKEALKAAIDKIHDRGSTNLSGGMLEGYTQVKSTRKE-------GYVNRV 180 Query: 127 FLITDGAPTDEWQAAANKVFRGEEDK----RFAFFSIGVQ-GADMKTLAQISV----RQP 177 L+TDG ++ R E+K A + GV + L ++ Sbjct: 181 LLLTDGLANQGITDPL-ELKRLAENKYKEDGIALSTFGVGADYNEDLLTMLAENGRANYY 239 Query: 178 LPLQGLQFRELF 189 + ++F Sbjct: 240 FIDSPDKIPQIF 251 >UniRef50_Q4TBC0 Chromosome undetermined SCAF7164, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4TBC0_TETNG Length = 1636 Score = 104 bits (259), Expect = 2e-21, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 69/196 (35%), Gaps = 17/196 (8%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 + + + LLD SGS+ + + ++ ++ V +G+ F ++ Sbjct: 1041 EKQKADLVFLLDQSGSIQSDDYTTMKKFTIDLINKFQ---ISRDLVHVGLAQFSSTFKDE 1097 Query: 76 -------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 + + + +G T +G A+ E +A GIS + L Sbjct: 1098 FYLNKFFDEQAISAHIKDMQQEEGGTLIGLALNSIRKYFEASHGSRKAEGIS---QNLVL 1154 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 ITDG D+ + AA + F+IG+ L QI+ F +L Sbjct: 1155 ITDGDSQDDVEEAARLLRGL----GVEVFAIGIGNVHDLELLQIAGTPENVFTVKNFDKL 1210 Query: 189 FSWLSSSLRSVSRSTP 204 + ++ +S P Sbjct: 1211 EGIHQKVVDTICQSKP 1226 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 71/201 (35%), Gaps = 19/201 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE----- 74 L+D SGS++ ++ ++ F P V +G+V + Sbjct: 458 ADIFFLIDQSGSIHPPDFYDMKKFILEFLQTFRVGPN---HVRIGVVKYADSPTLEFDLH 514 Query: 75 --QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 S I G T G A+ D + + + ++ +ITDG Sbjct: 515 TYTDVKSLEKAITNIHQVGGGTETGKAL----DFMRPQFDRAVTTRGHKVKEYLVVITDG 570 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWL 192 TD+ + A+K+ ++IGV+ A K L +IS F L + Sbjct: 571 NSTDKVKDPADKLRAQ----GVVVYAIGVKDAVEKELLEISGEPQRTFYVNNFDAL-KPI 625 Query: 193 SSSLRSVSRSTPGTEVVLEAP 213 + + ST G+++ L + Sbjct: 626 KDDIITDICSTDGSDLSLLST 646 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 58/175 (33%), Gaps = 19/175 (10%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFT- 78 I L+D SGS+ ++ + + + +V +G++ + + P Sbjct: 654 DLIFLIDSSGSIYPEDYQKMKDFMKSLVQ---KSNIGKDQVHVGVLQYSTEQKLVFPLIQ 710 Query: 79 -----SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + + G T G AI + + G + + ++TDG Sbjct: 711 YYTKDQLSKAIDDMQQIGGGTHTGEAIAVVSKYFD-----AQNGGRPDLKQRLVVVTDGE 765 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D+ + A + K +SIGV A+ L +IS F L Sbjct: 766 SQDDVKLPAEALRA----KGVIVYSIGVVAANTSQLLEISGDADRMYAERDFDAL 816 Score = 78.5 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 66/195 (33%), Gaps = 19/195 (9%) Query: 15 NPEPRCPCILLLDVSGSMNGRP-INELNAGLVTFRDELLADPLALKRVELGIVTFGPV-- 71 + I L+DVS S+ + + + + + + G++TF Sbjct: 847 KKTAQADIIFLVDVSTSILKEKAFPSVTVFMESVVN---QSSVGPELTRFGVITFSTGVQ 903 Query: 72 -----HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 + G+T G A+ +L + A + + Sbjct: 904 SIFTLKQYSSKRDVLQAVGAVTAPGGNTNTGDALDYSLQYFGKEHGGRAALKVP---QIL 960 Query: 127 FLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFR 186 +ITDGA + + + + FSIGV+ A + L ++ P + F Sbjct: 961 MVITDGAAQEPSK--LPGPSEALRKQGVSVFSIGVKNASREQLDIMAGNDPSRVF---FV 1015 Query: 187 ELFSWLSSSLRSVSR 201 + F L + +++S+ Sbjct: 1016 DTFDALETLYKNISK 1030 Score = 73.5 bits (179), Expect = 5e-12, Method: Composition-based stats. Identities = 27/118 (22%), Positives = 40/118 (33%), Gaps = 8/118 (6%) Query: 91 QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEE 150 G T G AI +K RA+ +ITDG TD+ A ++ + Sbjct: 83 NGGTETGKAINFLRKQYFTKKAGSRADQR--VPQIAVVITDGDSTDDVVVPARELRK--- 137 Query: 151 DKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFRELFSWLSSSLRSVSRSTPGT 206 F+IGV A+ L I+ R F+ L L ++ S Sbjct: 138 -HGVIVFAIGVGNANQGELKSIANRPSERFKFTIDSFQALKRLTERLLETMCVSMEDQ 194 Score = 66.9 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 25/120 (20%), Positives = 42/120 (35%), Gaps = 12/120 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH------- 72 + ++D SGS+ + L + L P RV +GIV + Sbjct: 337 ADIVFIIDESGSIGSSDFQLVRTFLHSLVSGLEVSPN---RVRVGIVVYHGEPKAEVFLN 393 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + +F + + G T GAA+ V R++ R + +ITDG Sbjct: 394 TFTDKSELLDFIRILPYHGGGTNTGAALNFTQHQVFVREKGSRIE--LGVQQVAVVITDG 451 >UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CDA1_PARTE Length = 604 Score = 104 bits (259), Expect = 2e-21, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 66/207 (31%), Gaps = 34/207 (16%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-------- 71 + ++D SGSM+G I + L + L L ++ F Sbjct: 185 VDLLCVIDRSGSMSGEKIEMVKQTLNILLNFLGPKD------RLCLIQFDDTCQRLTNLR 238 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 V + + ++A G T +G AL ++ RK IF+++D Sbjct: 239 RVTDENKTYYSDIISKIYANGGTVIGLGTQMALKQIKYRKSVNNVTA-------IFVLSD 291 Query: 132 GAPTDEWQAAANKVFRGEEDKR--FAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQ---- 184 G +AA + + + + S G D K + +IS + Sbjct: 292 GQ----DEAAISSLQKQLAYYKQTLTIHSFGFGSDHDAKLMTKISNLGKGSFYFVNNISL 347 Query: 185 FRELFSWLSSSLRSVSRSTPGTEVVLE 211 E F +L S + LE Sbjct: 348 LDEFFVDALGALT--SMVVTDISINLE 372 >UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MS10_ANATD Length = 1188 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 69/206 (33%), Gaps = 29/206 (14%) Query: 4 QITFATSDFASNPEPR-CPCILLLDVSGSMN-GRPINELNAGLVTFRDELLADPLALKRV 61 ++ + N + + +LD SGSM+ P +F D L+ Sbjct: 481 EVPINKGEREINQQVNYIDLVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGD------ 534 Query: 62 ELGIVTFGP-VHVEQPFTS---AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRAN 117 +V F ++ QP T+ A + + G T + I A + R E R Sbjct: 535 RAAVVDFDNFGYLLQPLTTDFQAVKNAIDRIDSWGGTNIAEGIRIANQQLISRSSEDR-- 592 Query: 118 GISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQ 176 I L+TDG + N + ++ ++IG+ D L I+ + Sbjct: 593 -----IKVIILLTDGEGYYD-----NNLTTEAKNNGITIYTIGLGTSVDENLLRDIATQT 642 Query: 177 PLPL----QGLQFRELFSWLSSSLRS 198 Q ++F ++ + Sbjct: 643 GGMYFPVSSASQLPQVFKRITEIVTE 668 >UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0E9B3_PARTE Length = 603 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 81/215 (37%), Gaps = 34/215 (15%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 P + ++DVSGSM GR IN + L L + + I+ F H+ Sbjct: 117 RPPIDLVCVVDVSGSMIGRKINLVKDSLRYLMKILGPED------RICIIVFTTVAHIVT 170 Query: 76 PFTSAANFFPP-----ILFAQG--DTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 F P IL +G T + + KAL M++ RK + + IFL Sbjct: 171 SFIRNTQENKPLLKKAILELKGLASTNISDGMNKALWMLKNRKYKNPVS-------CIFL 223 Query: 129 ITDGAPTDEWQAA----ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPLP 179 ++DG D+++ A +++ + +++F + G D + QI+ Sbjct: 224 LSDGQ--DDYKGAEQRVFDQLQLLKIEEKFVIHTFGYGQDHDAYVMNQIAKYREGNFYYI 281 Query: 180 LQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 + + F + + +S + L++ Sbjct: 282 DNINKASDYF--ILAMSGMLSIYAQNVSINLKSND 314 >UniRef50_Q24FW2 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FW2_TETTH Length = 1074 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 45/210 (21%), Positives = 75/210 (35%), Gaps = 29/210 (13%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP 70 D + P I ++D SGSM+G IN L L+ D+L LG+V F Sbjct: 355 DAKAYQRPPIDLICVMDNSGSMHGEKINMLKETLLYLIDQLDEKD------RLGLVLFNS 408 Query: 71 VHVEQPFTSAA-------NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYR 123 +P S + + AQG T + +T+A ++ RK Sbjct: 409 EVTFRPMKSMDTTNKLKLKQYISDIRAQGGTDINLGMTEAFKFIKTRKYCNPVTS----- 463 Query: 124 PWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISV-RQPLPLQ 181 +FL++DG + A + +++F+ G D + QI Q Sbjct: 464 --VFLLSDGLDSKAQDRVAVTLKNMSINEQFSINCFGFGRDHDPILMNQIKKIDQVDMFF 521 Query: 182 GLQFRELFSWLSS-------SLRSVSRSTP 204 LFS + S++ +S+ Sbjct: 522 VDALGGLFSVIGQDVLIKVKSVKELSKDVN 551 >UniRef50_A2E0T6 von Willebrand factor type A domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2E0T6_TRIVA Length = 753 Score = 103 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 51/170 (30%), Gaps = 12/170 (7%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV 71 F + ++D SGSM G I + L L I+ FG Sbjct: 234 FEGKVQANTEFYFIIDCSGSMYGSRIKNAKSCLNVLLHSL------PIGCRFSIIKFG-T 286 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 E + A + A DM+ K Y +FL+TD Sbjct: 287 KFEVALEPCDYTDENMSKAMHQLDLIDADMCGNDMISPLKYISEHPQKKDYIKQVFLLTD 346 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL 180 G D+ + V + F F+IG+ AD + ++ Sbjct: 347 GE--DDRISICAMVQANRD--NFRVFTIGIGSDADRNLIIDVARNGSGRY 392 >UniRef50_UPI0001B4AD96 von Willebrand factor type A n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD96 Length = 194 Score = 103 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 37/183 (20%), Positives = 73/183 (39%), Gaps = 10/183 (5%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF--GPVHVEQPFTS 79 +LLD SGSM+G I+ LN + +L K +++ +++F + + Sbjct: 3 LYILLDTSGSMDGSKISALNDSMENIIIDLQEKAFNGKNIDIVVLSFARDVTWMHDKPIN 62 Query: 80 AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQ 139 +F L A G T +G A + + I L++DG PTD++ Sbjct: 63 ILDFNWKPLTASGMTSLGKACCELAKNISTYPANNENTA-------IVLLSDGCPTDDYD 115 Query: 140 AAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRS 198 ++ + F+I + AD++TL + Q + L L++ + + Sbjct: 116 EGIMELRNLQTFNDADKFAIALGDNADLQTLIRFVDVQENIFIENKADRLIDALNTIMGN 175 Query: 199 VSR 201 ++ Sbjct: 176 ITN 178 >UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UYN7_ROSS1 Length = 459 Score = 103 bits (258), Expect = 4e-21, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 69/219 (31%), Gaps = 28/219 (12%) Query: 14 SNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP--- 70 P + +LDVSGSM+G + L L + +VTF Sbjct: 85 EQHRPPLHLVAVLDVSGSMSGTKLASAKEALRQALHFLQDGDV------FSLVTFSDQVQ 138 Query: 71 -----VHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 Q + A G T + + + +D+ +++++ Sbjct: 139 THLKAESYAQRKRDKMENLLDEIRASGMTALDGGLAQGIDLGQKKRQATT---------L 189 Query: 126 IFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQG 182 + L++DG + ++GV + + +I+ + Sbjct: 190 VLLLSDGQANVGETDLEKIGLRAQKARQSGLIVSTLGVGLDYNEALMVEIANQGGGRFYH 249 Query: 183 LQ-FRELFSWLSSSLRSVS-RSTPGTEVVLEAPKGWTSV 219 +Q ++ + L L S + + EV + P G V Sbjct: 250 IQEGSQIPAALMQELGSAAMLAARQVEVEFDLPSGAALV 288 >UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, scaffold_125.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HHA4_VITVI Length = 630 Score = 103 bits (258), Expect = 4e-21, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 62/218 (28%), Gaps = 34/218 (15%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQ 75 + +LDVSGSM G ++ L + L L IV+F Sbjct: 202 RAPIDLVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGPSD------RLSIVSFSSTARRIF 255 Query: 76 PFTSAANF-------FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 P ++ L + G T + + K + ++EER + I L Sbjct: 256 PLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKKGVRVLEERSEQNPVAS-------IIL 308 Query: 129 ITDGAPTDEWQAAANKVF--------RGEEDKRFAFFSIGVQ-GADMKTLAQIS----VR 175 ++DG T + R + G D + IS Sbjct: 309 LSDGKDTYNCDNVNRRQTSHCASSNPRQGRQAIIPVHTFGFGSDHDSTAMHAISDESGGT 368 Query: 176 QPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAP 213 ++ F+ L SV V +P Sbjct: 369 FSFIESVATVQDAFAMCIGGLLSVVAQELRLTVKSVSP 406 >UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genome shotgun sequence n=3 Tax=Paramecium tetraurelia RepID=A0C051_PARTE Length = 636 Score = 103 bits (258), Expect = 4e-21, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 69/217 (31%), Gaps = 27/217 (12%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHV 73 N I L+D+SGSM G I + A L+ L + L ++TF H Sbjct: 155 NQRVGVDLICLIDISGSMIGVKIEMVKASLIVLLQFLGDND------RLQLITFDNDAHR 208 Query: 74 EQPFTSAANFF-------PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 P + N + A G + A A ++ RK + Sbjct: 209 LTPLKTVTNQNKSYFTQIIKQIKANGGNRISEATKMAFYQLKSRKYINNVTS-------V 261 Query: 127 FLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI----SVRQPLPLQG 182 FL++DG + N++ E F G + D + + Q+ S Sbjct: 262 FLLSDGVDYTYPE-VKNQIQTVNEVFTLHTFGFG-EDHDAQMMTQLCNLKSGSFYFVQDV 319 Query: 183 LQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTSV 219 E F+ L SV + AP + + Sbjct: 320 TLLDEFFADALGGLISVVGEQLEITLSSSAPPPYQDI 356 >UniRef50_A8TX70 Collagen alpha-5(VI) chain n=18 Tax=Eutheria RepID=CO6A5_HUMAN Length = 2615 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 63/178 (35%), Gaps = 18/178 (10%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ-- 75 L+D S S+ + ++ ++ + P +V +G+V + + Sbjct: 439 KEADIHFLIDGSSSIQEKQFEQIKRFMLEVTEMFSIGP---DKVRVGVVQYSDDTEVEFY 495 Query: 76 -----PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 I G T G A+ L +++ + R + + Y + ++T Sbjct: 496 ITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKN-GMKDRMSKVPCY---LIVLT 551 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 DG TD A ++ + ++G+ A+ L +I+ ++ G F L Sbjct: 552 DGMSTDRVVEPAKRLRAE----QITVHAVGIGAANKIELQEIAGKEERVSFGQNFDAL 605 Score = 98.5 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 64/181 (35%), Gaps = 15/181 (8%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + +LD SGS+ + + + + + + RV+ G + + + + Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---INLTIHLVKKADVGRDRVQFGALKYSDQPNILFYLN 869 Query: 80 AANFFPPILF-------AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + I+ G+T A+ A + E + + + + +ITDG Sbjct: 870 TYSNRSAIIENLRKRRDTGGNTYTAKALKHANALFTEE---HGSRIKQNVKQMLIVITDG 926 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWL 192 D Q N +K F++GV A+ K L ++ + + F +L Sbjct: 927 ESHDHDQ--LNDTALELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYVDNFDKLKDVF 984 Query: 193 S 193 + Sbjct: 985 T 985 Score = 94.3 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 57/175 (32%), Gaps = 19/175 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 + + L+D S S+ ++ + ++ + + ++G+V F E Sbjct: 626 KADIMFLVDSSWSIGNENFRKMKIFMKNLLTKIQ---IGADKTQIGVVQFSDKTKEEFQL 682 Query: 78 ------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 ++ + T G A+ K + ++ LITD Sbjct: 683 NRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGA-----RLGAKKFLILITD 737 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFR 186 G D+ + A + K FS+GV A+ L +IS L F Sbjct: 738 GVAQDDVRDPARILR----GKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFD 788 Score = 64.2 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 60/196 (30%), Gaps = 19/196 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFT 78 + L+D S + + + + + L P+ + + + + H E + Sbjct: 29 ADVVFLVDSSDHLGPKSFPFVKTFINKMINSL---PIEANKYRVALAQYSDEFHSEFHLS 85 Query: 79 S-------AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + + F G +G A+ +A R + P + ++ Sbjct: 86 TFKGRSPMLNHLKKNFQFIGGSLQIGKALQEAHRTYFSAPINGRD--RKQFPPILVVL-- 141 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL--QGLQFRELF 189 + E + + + + S+GVQ A + L ++ Sbjct: 142 --ASAESEDEVEEASKALQKDGVKIISVGVQKASEENLKAMATSHFHFNLRTIRDLSTFS 199 Query: 190 SWLSSSLRSVSRSTPG 205 ++ ++ V++ G Sbjct: 200 QNMTQIIKDVTKYKEG 215 Score = 59.2 bits (142), Expect = 9e-08, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 53/178 (29%), Gaps = 17/178 (9%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV------ 71 I L D S ++ + L D + +R+++G+ FG Sbjct: 1002 QEADVIFLCDGSDRVSNSDFVTMTTFLSDLIDNF---DIQSQRMKIGMAQFGSNYQSIIE 1058 Query: 72 -HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 T + + G + A+ K +M R G+ + +IT Sbjct: 1059 LKNSLTKTQWKTQIQNVSKSGGFPRIDFALKKVSNMFNLHAGGRRNAGVP---QTLVVIT 1115 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 G P + A + D +G+ + L I+ + F +L Sbjct: 1116 SGDPRYDVADAVKTLK----DLGICVLVLGIGDVYKEHLLPITGNSEKIITFQDFDKL 1169 Score = 50.0 bits (118), Expect = 5e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 53/163 (32%), Gaps = 19/163 (11%) Query: 20 CPCILLLDVS-GSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF- 77 + L+D S G G + L L + + + LG++++ F Sbjct: 235 ADLVFLVDESLG--TGGNLRHLQTFLENITSSM---DVKENCMRLGLMSYSNSAKTISFL 289 Query: 78 -----TSAANFFPPILFAQ-GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 S L Q G + GAAI + + Y + L+T Sbjct: 290 KSSTTQSEFQQQIKNLSIQVGKSNTGAAIDQMRR--DGFSESYGSRRAQGVPQIAVLVTH 347 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 P+D+ A + F++ +QGA+ L +I Sbjct: 348 -RPSDDEVHDAAL---NLRLEDVNVFALSIQGANNTQLEEIVS 386 Score = 48.0 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 56/179 (31%), Gaps = 33/179 (18%) Query: 20 CPCILLLDVS---GSMNGRPINELNAGLVTFRDELLADPL---ALKRVELGIVTFGP--- 70 L+D S GS E+ A + + D P + + ++++ P Sbjct: 2290 MDVAFLIDASQRVGS---DEFKEVKAFITSVLDYFHIAPTPLTSTLGDRVAVLSYSPPGY 2346 Query: 71 --------VHVEQPFTS------AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRA 116 V++E + + GD +G A+ +D V R Sbjct: 2347 MPNTEECPVYLEFDLVTYNSIHQMKHHLQDSQQLNGDVFIGHALQWTIDNVFVGTPNLRK 2406 Query: 117 NGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISV 174 N IF+I+ G + V + + ++ F + K L +++ Sbjct: 2407 N------KVIFVISAGETNSLDKDVLRNVSLRAKCQGYSIFVFSFGPKHNDKELEELAS 2459 Score = 46.1 bits (108), Expect = 7e-04, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 53/170 (31%), Gaps = 17/170 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELL--ADPLALK-RVELGIVTFG------- 69 + L+D S ++ + A + + D +DPL + ++++ Sbjct: 1962 MDVVFLIDNSRNIAKDEFKAVKALVSSVIDNFNIASDPLISDSGDRIALLSYSPWESSRR 2021 Query: 70 ---PVHVEQPFTSAANF--FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 V E F + N + G A T ++ + + Sbjct: 2022 KMGTVKTEFDFITYDNQLLMKNHIQTSFQQLNGEA-TIGRALLWTTENLFPETPYLRKHK 2080 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 IF+++ G E + + + + + F I + + +++ Sbjct: 2081 VIFVVSAGE-NYERKEFVKMMALRAKCQGYVIFVISLGSTRKDDMEELAS 2129 >UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea mays RepID=C0HF51_MAIZE Length = 459 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 78/209 (37%), Gaps = 40/209 (19%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF 77 +++LD+SGSM G + + + F E L ++ L I+TF H Sbjct: 43 PLDIVVVLDISGSMRGTKLEHMKHAMTRFIIE----KLGIRGDRLAIITFESKAHKVFDL 98 Query: 78 TSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 +S L A GDT + A + LD+++ R+ G S+ IFL++ Sbjct: 99 SSMLPDQVKKAVAVVEGLKAGGDTNIKAGLEAGLDVLKTRR------GHSHNASCIFLMS 152 Query: 131 DGAPT-DEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQIS--------------V 174 DG D+ + ++V + + G +D + L I+ Sbjct: 153 DGHENVDKARTLLDRVGEH------SVVTFGFGEKSDEQLLYDIAYHSHAGTYHHVREKE 206 Query: 175 RQPLPLQGLQFRELFSWLSSSLRSVSRST 203 + ++ F ++ +S V+ S Sbjct: 207 DENQLMKAFAFLAIYRSISMLDLKVTVSA 235 >UniRef50_A8L6A2 von Willebrand factor type A n=1 Tax=Frankia sp. EAN1pec RepID=A8L6A2_FRASN Length = 238 Score = 103 bits (257), Expect = 4e-21, Method: Composition-based stats. Identities = 40/156 (25%), Positives = 60/156 (38%), Gaps = 6/156 (3%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFT 78 +L+D S SM+G P+ +N L + P V LG + F V Sbjct: 15 LAFYILVDASYSMSGAPMLAVNEILPEVISTIEQSPTLGDVVRLGALDFADDARVVLRLD 74 Query: 79 SAANFF-PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 N P A+G T A + +E + + +G YRP +F ITDG PTD+ Sbjct: 75 DLRNIGGVPQFAARGGTSYAAGFRQLRKEIESDLAQLKGDGYKVYRPAVFFITDGEPTDD 134 Query: 138 WQA---AANKVFRGEEDKRFAFFSIG-VQGADMKTL 169 + A ++ R G V + +TL Sbjct: 135 QKDLDAAFAELTDANFRGRPNIIPFGVVSSVNKQTL 170 >UniRef50_UPI00016E1D39 UPI00016E1D39 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E1D39 Length = 753 Score = 103 bits (257), Expect = 5e-21, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 69/208 (33%), Gaps = 27/208 (12%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSA 80 + L+D S S+ E+ L F L + ++++G+ + Q F Sbjct: 1 DIVFLVDGSSSIGTDNFQEVRLFLRNFTSGL---DIGPDKIQIGLAQYSNDP-HQEFLLK 56 Query: 81 ANFFPPILFAQ--------GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + L A G T G AI ++ RAN +ITDG Sbjct: 57 DHMEKTALLAALDSFPYRTGGTETGKAIDFLRTQYFTKEAGSRANQR--VPQIAVVITDG 114 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFRE--- 187 TD+ A + + F+IGV A+ L I+ R P F+ Sbjct: 115 DSTDDVTVPAQSLRK----HGVIVFAIGVGNANQNELESIANRPPKRFKFTIDSFQALQR 170 Query: 188 ----LFSWLSSSLRSVSRSTPGTEVVLE 211 L + S++ ++ G+ Sbjct: 171 LTKGLLQTMCVSIKDQHQAEAGSRTDQS 198 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 22/98 (22%), Positives = 36/98 (36%), Gaps = 10/98 (10%) Query: 56 LALKRVELGIVTFGPVHVE-----QPFTS--AANFFPPILFAQGDTPMGAAITKALDMVE 108 + + V +G+ F Q FT + + G T +G A+ + E Sbjct: 300 FSKEFVHVGLAQFSSSFQHEFYLNQFFTEQVISKHVMDLQQLGGGTNIGLALNSIREYFE 359 Query: 109 ERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVF 146 + GIS + LITDG D+ + AA + Sbjct: 360 ASRGSRSPEGIS---QNLVLITDGESQDDVEDAARLLR 394 >UniRef50_UPI000155D2F0 PREDICTED: similar to matrilin-3, partial n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155D2F0 Length = 354 Score = 103 bits (257), Expect = 5e-21, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 58/167 (34%), Gaps = 15/167 (8%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV----- 71 + ++D S S+ R ++ L D L + + +V + Sbjct: 147 SRPLDLVFIVDSSRSVRPREFEKVKTFLSQVIDTL---DIGETATRVAVVNYASTVKVEF 203 Query: 72 --HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 S I T G AI A+D V + RA + + + + Sbjct: 204 HLQTHSDKESLKQAVSRIAPLATGTMSGLAIRTAMDEVFTVEAGARAPAFNIPKVVVIV- 262 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 TDG P D+ Q A + +++GV ADM++L Q++ Sbjct: 263 TDGRPQDQVQEAVAQAQA----SGIEIYAVGVGRADMQSLRQLASEP 305 >UniRef50_Q8C6K9 Collagen alpha-6(VI) chain n=26 Tax=cellular organisms RepID=CO6A6_MOUSE Length = 2265 Score = 103 bits (257), Expect = 5e-21, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 70/197 (35%), Gaps = 17/197 (8%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 + + L+D S S++ ++ LV+ + ++L RV +G+ F + Sbjct: 991 VDCEIEKVDLVFLMDGSNSIHPDDFQKMKGFLVSVVQDF---DVSLNRVRIGVAQFSDSY 1047 Query: 73 VEQPF-------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 + + I G T +G A+ K + G Sbjct: 1048 RSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQPDMGSRINAGTP---QV 1104 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 + ++TDG DE AA ++ K +S+G+ D + L QI+ L F Sbjct: 1105 LLVLTDGRSQDEVAQAAEELR----HKGVDIYSVGIGDVDDQELVQITGTAEKKLTVHNF 1160 Query: 186 RELFSWLSSSLRSVSRS 202 EL +R++ S Sbjct: 1161 DELKKVKKRIVRNICTS 1177 Score = 95.0 bits (235), Expect = 1e-18, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 65/175 (37%), Gaps = 19/175 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ--- 75 + + L+D SGS+ +++ + + RV++G+V F + E+ Sbjct: 619 KADIMFLVDSSGSIGPENFSKMKMFMKNLVS---KSQIGADRVQIGVVQFSHENKEEFQL 675 Query: 76 ----PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + AN + T G+A+T K R ++ LITD Sbjct: 676 NTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGA-----RPNVRKFLILITD 730 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFR 186 G D + A + + +S+GV G+++ L +IS + + F Sbjct: 731 GEAQDIVRDPAIALRKE----GVIIYSVGVFGSNVTQLEEISGKPEMVFYVENFD 781 Score = 91.6 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 67/192 (34%), Gaps = 18/192 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + ++D SGS++ + N + ++ + +V G + + + Sbjct: 807 LDVVFVIDSSGSIDYQEYNIMKDFMIGLV---KKADVGKNQVRFGALKYADDPEVLFYLD 863 Query: 80 -------AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + G+T A+ + M E + G+ + +ITDG Sbjct: 864 ELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLHKGVP---QVLIVITDG 920 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV---RQPLPLQGLQFRELF 189 D N + DK ++G+ GA+ L ++ + + +F Sbjct: 921 ESHD--AEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAGSSDKYYFVETFGGLKGIF 978 Query: 190 SWLSSSLRSVSR 201 S +S+S+ + S+ Sbjct: 979 SDVSASVCNSSK 990 Score = 90.8 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 65/176 (36%), Gaps = 18/176 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 LL+D SGS +E+ L +A +V +G V + + Sbjct: 434 ADIYLLIDGSGSTQPTDFHEMKTFLSEVVGMFN---IAPHKVRVGAVQYADTWDLEFEIS 490 Query: 76 ---PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 I G+T GAA+ L +++ K+E R + + + + ++T+G Sbjct: 491 KYSNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAKKE-RGSKVPCH---LVVLTNG 546 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 D A+K+ +IGV+ A+ L +I+ + +F L Sbjct: 547 MSRDSVLGPAHKLREE----NIRVHAIGVKEANQTQLREIAGEEKRVYYVHEFDAL 598 Score = 68.5 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 60/182 (32%), Gaps = 26/182 (14%) Query: 20 CPCILLLDVSGSM----NGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 + LLD M + ++ L A L L + + +G+VT+ Sbjct: 227 ADVVFLLD----MAINGSQEDLDHLKAFLGESISAL---DIKENCMRVGLVTYSNETRVI 279 Query: 76 PFTSAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 S N +L G GAA+ K + +R R N L Sbjct: 280 SSLSTGNNKTEVLQRIQDLSPQVGQAYTGAALRKTRKEIFSAQRGSRKNQ--GVPQIAVL 337 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQG--LQFR 186 +T A D AA + R F++G++GA+ L +I+ F Sbjct: 338 VTHRASEDNVTKAAVNLRRE----GVTIFTMGIEGANPDELEKIASHPAEQFTSKLGNFS 393 Query: 187 EL 188 EL Sbjct: 394 EL 395 Score = 60.0 bits (144), Expect = 5e-08, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 60/196 (30%), Gaps = 19/196 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFT 78 + L+D S + + + + L P+ + + + + H E Sbjct: 25 ADVVFLVDSSDHLGLKSFPLVKTFIHKMISSL---PIEANKYRVALAQYSDALHNEFQLG 81 Query: 79 SAANFFP--PILF-----AQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + N P L G +G A+ +A R + P + ++ Sbjct: 82 TFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRD--KKQFPPILVVLAS 139 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL--QGLQFRELF 189 D+ + AA + S+GVQ A + L ++ Q Sbjct: 140 AESEDDVEEAAKALRED----GVKIISVGVQKASEENLKAMATSQFHFNLRTARDLSVFA 195 Query: 190 SWLSSSLRSVSRSTPG 205 ++ ++ V++ G Sbjct: 196 PNMTEIIKDVTQYREG 211 >UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomycetaceae RepID=D2R2I7_9PLAN Length = 786 Score = 103 bits (257), Expect = 5e-21, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 54/171 (31%), Gaps = 26/171 (15%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT--------FGPVHV 73 I ++D SGSM G+ I + + + L V F Sbjct: 309 VIFVVDRSGSMQGKKIEQAREAMRYVLNNLHEGDTFNIVAYDSTVESFKPELQKFDDATR 368 Query: 74 EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + + L+A G T + A+ A M+ R +I +TDG Sbjct: 369 KSAL-----AYVDGLYAGGSTNISGALDSAFAMLTGSDRPN----------YILFLTDGL 413 Query: 134 PTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQ 181 PT ++ + + R + GV + + L ++S Q Sbjct: 414 PTAGETNEGKIVELAKQKNVHRARMINFGVGYDVNSRLLDRMSRENFGQSQ 464 >UniRef50_Q7ZX63 Matn2-prov protein n=16 Tax=Euteleostomi RepID=Q7ZX63_XENLA Length = 589 Score = 103 bits (256), Expect = 5e-21, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 58/181 (32%), Gaps = 17/181 (9%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV----- 71 + ++D S S+ ++ L+T L + +G++ +G Sbjct: 48 NKPLDLVFIIDSSRSVRPADFEKVKEFLITMLKFL---DIGPDTTRVGLLQYGSTVKNEF 104 Query: 72 --HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 + + + ++ T G AI A+++ R R + + Sbjct: 105 SLKMYKRKSDIERAVKRMMHLATGTMTGLAIQYAMNIAFSEAEGARPLNQYVPRIAMIV- 163 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFRE 187 TDG P D + ++ F+IGV DM TL I F + Sbjct: 164 TDGRPQDPVE----EISAKARMSGILIFAIGVGRVDMSTLKTIGSEPHSEHVFLVANFSQ 219 Query: 188 L 188 + Sbjct: 220 I 220 >UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HPN0_LYSSC Length = 825 Score = 103 bits (256), Expect = 5e-21, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 75/232 (32%), Gaps = 32/232 (13%) Query: 2 SEQITFATSDFASNPEPRCP---CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLAL 58 + T + + + P +++LD SGSM+G + + L + Sbjct: 346 TPIETLLPVEMEIKGKEQLPSLGLVIVLDRSGSMSGSKLELAKEAAARSVEMLRDEDT-- 403 Query: 59 KRVELGIVTFGPVHVEQ----PFTSAANFFPPIL--FAQGDTPMGAAITKALDMVEERKR 112 LG + F E P + IL G T + ++ KA + + + K Sbjct: 404 ----LGFIAFDDRPWEIIETGPLNNKEEAVDTILSVTPGGGTEIYGSLAKAYENLADMKL 459 Query: 113 EYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQ 171 + R I L+TDG + + +D ++ + AD L Sbjct: 460 Q---------RKHIILLTDGQSQ---PGNYDDLIEQGKDNGITLSTVAIGQDADANLLEA 507 Query: 172 ISV-RQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEA---PKGWTSV 219 +S + + + S LS +SR+ GW ++ Sbjct: 508 LSEMGSGRFYNVIDEQTIPSILSRETAMISRTYIEDNPFYPVLYNAGGWNTL 559 Score = 50.0 bits (118), Expect = 6e-05, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 43/123 (34%), Gaps = 21/123 (17%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 P + L+D S SMNG +E+ + + LA G+ +F + Sbjct: 22 PIKEEQIVYLVDRSASMNGTE-DEMVQFIQDSLQSKKDEQLA------GLYSFSSTLQTE 74 Query: 76 PFTSAANFFPPI---LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + P + A T + ++ A +++ +K + L+TDG Sbjct: 75 AIMTKTLKEVPKFTEIKATDQTNIEQSLQLATGIIDPKKA-----------TRLVLLTDG 123 Query: 133 APT 135 T Sbjct: 124 NET 126 >UniRef50_Q32NR2 MGC130922 protein n=3 Tax=Tetrapoda RepID=Q32NR2_XENLA Length = 840 Score = 103 bits (256), Expect = 6e-21, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 57/181 (31%), Gaps = 17/181 (9%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV----- 71 + ++D S S+ ++ L+T L + +G++ +G Sbjct: 48 NKPMDLVFIIDSSRSVRPADFEKVKEFLITMLKFL---DIGPDNTRVGLLQYGSTVKNEF 104 Query: 72 --HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 + ++ T G AI A+++ R R + + Sbjct: 105 SLKTYKRKPDIERAVKRMMHLATGTMTGLAIQYAMNIAFSEAEGARPLNQYVPRIAMIV- 163 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFRE 187 TDG P D ++ + F+IGV DM TL I + F + Sbjct: 164 TDGRPQDP----VAEIAAKARNSGILIFAIGVGRVDMSTLKTIGSQPHSEHVFLVANFSQ 219 Query: 188 L 188 + Sbjct: 220 I 220 Score = 96.2 bits (238), Expect = 7e-19, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 66/202 (32%), Gaps = 22/202 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 + ++D S S+ + + D L ++ K +G++ + HV FT Sbjct: 569 PVDLVFVIDGSKSLGEDNFEIVKQFVKGILDSL---EISQKAARVGLIQY-STHVRTEFT 624 Query: 79 --------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 I + + G A+ + + RA + R I T Sbjct: 625 MAQYSSAKDVKKAVSQIKYMGRGSMTGLALKLMHEKSFSEAQGARARPMRVPRVAIVF-T 683 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFREL 188 DG DE A K + ++IG+ A + L +I+ ++ + F + Sbjct: 684 DGRAQDEVSEYAEKAKQ----SGITIYAIGIGKAIDEELQEIASAPQEKHVIYAEDFSAM 739 Query: 189 ---FSWLSSSLRSVSRSTPGTE 207 L SS+ TP Sbjct: 740 GYIMEKLKSSMCEGWSKTPDVS 761 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 103 bits (256), Expect = 6e-21, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 64/224 (28%), Gaps = 31/224 (13%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 + + + + ++DVSGSM G I + L + L + +++ Sbjct: 77 SPAQTSEASRSGVDIVCVIDVSGSMQGEKIQLVQTTLNFMVERLSPAD------RICLIS 130 Query: 68 FGPVHVEQP--------FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 F + P L A G T + + L + +R+ + + Sbjct: 131 FSNDATKISRLVQMSPKGKKQLKSMIPRLVASGGTNIVGGLEYGLQALRQRRTINQLSS- 189 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFA----FFSIGVQ-GADMKTLAQIS- 173 I L++DG + + + + G G D L ++ Sbjct: 190 ------IILLSDGQDNNG-TTVLQRAKATMDSIVIRDDYSVHTFGYGHGHDSTLLNALAE 242 Query: 174 ---VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 F+ L SV +++ + + Sbjct: 243 PKNGAFYYVKDEETIATAFANCLGELMSVVADQIEVKLMTQPTE 286 >UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) n=1 Tax=Monodelphis domestica RepID=UPI0000F2DDBB Length = 819 Score = 103 bits (256), Expect = 6e-21, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 75/199 (37%), Gaps = 18/199 (9%) Query: 11 DFASNPEPRCP--CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF 68 +FA P P + L+D SGSM GR I + A L+ D+L + G VT Sbjct: 250 NFAPTQLPMVPKNIVFLIDKSGSMAGRKIKKTKAALIKILDDLKPEDHFNMITFSGHVTR 309 Query: 69 GPVHVEQPFTS---AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 + A F A G T + A+ A+ M++E ++ S Sbjct: 310 WKPELVLALDEHLKEAKTFLSNTPALGVTNVNGAVLAAVSMLDESNKKKELPEGSV--SM 367 Query: 126 IFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQ- 181 I L+TDG T+ ++ + ++ F +G + L ++++ + Sbjct: 368 IILLTDGDSTEGETKLQKIHENVKAAIRGQYHLFCLGFGFDINYVFLERLALDNGGMARH 427 Query: 182 -------GLQFRELFSWLS 193 LQ ++ + ++ Sbjct: 428 IFEGLDAELQLQDFYQEVA 446 >UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174639B Length = 868 Score = 102 bits (255), Expect = 7e-21, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 63/196 (32%), Gaps = 29/196 (14%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + + L++D SGSM+G + + + + L + +G Sbjct: 398 LPVRLKAPDEEEKQSSALALVIDRSGSMSGEKLEMAKSAAIATAEVLTRNDS------IG 451 Query: 65 IVTFGP-VHVEQPFTSAANF-----FPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 + F HV P T + L + G T + A T+A + ++ K + + Sbjct: 452 VYAFDSEAHVVVPMTRLTSSSAVAGQIAGLTSGGGTNLHPAFTEARNALQRTKAKIKH-- 509 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQIS---- 173 + ++TDG + + + ++ + A + L I+ Sbjct: 510 -------MIILTDGQTSG---QGYEALASQCRAEGVTISTVAIGDGAHVGLLQAIASLGG 559 Query: 174 VRQPLPLQGLQFRELF 189 + L +F Sbjct: 560 GKSYTTLDAANIVRIF 575 >UniRef50_UPI000180D2FB PREDICTED: similar to inter-alpha (globulin) inhibitor H5 n=1 Tax=Ciona intestinalis RepID=UPI000180D2FB Length = 1586 Score = 102 bits (255), Expect = 8e-21, Method: Composition-based stats. Identities = 47/239 (19%), Positives = 76/239 (31%), Gaps = 41/239 (17%) Query: 2 SEQITFATSDFASNPEPRCP-----CILLLDVSGSMNGRPINELNAGLVTFRDELLADPL 56 + ++ S FA P + L+DVSGSM G I+++ + T L Sbjct: 923 TTEMVIDQSYFAHFITSNLPPMSKRVVFLIDVSGSMFGIKIDQVRQAMNTILHGLAETD- 981 Query: 57 ALKRVELGIVTFGP---------------VHVEQPFTSAANFFPPILFAQGDTPMGAAIT 101 ++ F SA NF + +G T + A+ Sbjct: 982 -----FFSVIAFNSSVSRWSPSGTAAVLASGTTANINSAMNFLNTTVVTRGGTDILQAVE 1036 Query: 102 KALDMVEERKREYRANGISYYRPWIFLITDGAPTDEW--QAAANKVFRGEEDKRFAFFSI 159 A+ + + + + L+TDG PTD A R RF +I Sbjct: 1037 AAIQLFDSAATGGTNTASDF----MVLLTDGRPTDGTVSSTAIISAIRNLNRGRFGINTI 1092 Query: 160 GVQG-ADMKTLAQISVRQPLPL--------QGLQFRELFSWLSSSLRSVSRSTPGTEVV 209 G DM L +I+ + Q + +S + S + T EV Sbjct: 1093 GFGTLVDMNLLRKIAAQNSGTSIQIFIDLNSYAQISNFYEEISQPILSNTTMTYEQEVD 1151 Score = 46.1 bits (108), Expect = 7e-04, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 38/139 (27%), Gaps = 16/139 (11%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV----EQPF 77 +D +GSM+G I +E L R + +V F V Sbjct: 194 LAFAIDDTGSMSGE-IRAAKQRAKMIIEE-RQGSLDEPRDFV-LVPFNDPTVGPITVTSN 250 Query: 78 TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDE 137 L A G G A AL + + +F+ITD D Sbjct: 251 PDTFKASIDRLHAHGG---GDAPELALRGILLAIENSQEGST------VFVITDVDAKDI 301 Query: 138 WQAAANKVFRGEEDKRFAF 156 + + + F Sbjct: 302 ELQDVVVAQARQRNIKITF 320 >UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI0_9BACT Length = 833 Score = 102 bits (254), Expect = 9e-21, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 75/214 (35%), Gaps = 27/214 (12%) Query: 3 EQITFATSDFA-SNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 E++ TS + +P +L++D SGSMNG+PI + L + R Sbjct: 394 EEVLPVTSRYEKEKEQPSLALVLVIDKSGSMNGQPIVLAREASKAAAELLSS------RD 447 Query: 62 ELGIVTF-GPVHVEQPFTSAANF-----FPPILFAQGDTPMGAAITKALDMVEERKREYR 115 ++G++ F G + TSAAN + A G T + A+ DM+ + + Sbjct: 448 QVGVIAFDGSAKLVTDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDMLGIASAKIK 507 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISV 174 + +++DG + ++ + GA + +A I+ Sbjct: 508 H---------MIVLSDGQSQGGD---FEGISSELAQMGVTISTVSLGQGAAVDLMAAIAQ 555 Query: 175 RQPLPLQG-LQFRELFSWLSSSLRSVSRSTPGTE 207 E+ + SRS E Sbjct: 556 IGNGRAYVTNNAEEMPRIFTKETMEASRSAIKEE 589 >UniRef50_Q5NIW0 Matrilin 3b n=19 Tax=Clupeocephala RepID=Q5NIW0_DANRE Length = 478 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 57/178 (32%), Gaps = 15/178 (8%) Query: 6 TFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI 65 + + ++D S S+ ++ L + L + + + Sbjct: 189 PTVPAPAEPCKSRPLDLVFIIDSSRSVRPAEFEKVKIFLSEMVNSL---DIGSDATRVAL 245 Query: 66 VTFGPVHVEQ-------PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANG 118 V + + F I T G AI A++ V R Sbjct: 246 VNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFTENAGARPLK 305 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 + I + TDG P D+ + +V +++GV A+M++L Q++ + Sbjct: 306 KGIGKVAIIV-TDGRPQDKVE----EVSAAARASGIEIYAVGVDRAEMRSLKQMASQP 358 >UniRef50_UPI0000EB12CB UPI0000EB12CB related cluster n=1 Tax=Canis lupus familiaris RepID=UPI0000EB12CB Length = 2186 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 70/193 (36%), Gaps = 19/193 (9%) Query: 4 QITFATSDFASNPE-PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVE 62 ++ S+ P L+D S S+N ++ ++ + +V+ Sbjct: 364 NLSILDSECNDKPHTKEADIYFLIDGSTSINTEGFEQIKQFMLAVTGMFS---IGSDKVQ 420 Query: 63 LGIVTFGPVHVEQPFT-------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 G V + + + I QG+T G A+ L ++++ R Sbjct: 421 AGAVQYSDKIRVEFYINASSNDMDLRKAILNIEQLQGNTHTGKALDFMLSIIKKD----R 476 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVR 175 + IS + ++TDG DE A ++ D++ ++G+ AD L QI+ Sbjct: 477 KHRISEIPCHLIVLTDGKSQDEVLKPAERLR----DEQITIHAVGIGEADKIQLQQIAGE 532 Query: 176 QPLPLQGLQFREL 188 + G F L Sbjct: 533 EERVNFGQNFDSL 545 Score = 101 bits (252), Expect = 1e-20, Method: Composition-based stats. Identities = 39/177 (22%), Positives = 68/177 (38%), Gaps = 19/177 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP-- 76 + + L+D SGS+ ++ + ++ P ++G+V F ++ E+ Sbjct: 585 KADIMFLVDSSGSIGHDNFGKMKTFMKNLLAKIQIGP---DSTQIGVVQFSDINQEEFQL 641 Query: 77 ---FT--SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 FT ++ + T G+A+T K + ++ LITD Sbjct: 642 NKYFTQNETSDAIDRMSLINRGTLTGSALTFVGQYFTPTKGARTK-----VKKFLILITD 696 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 G D + A + DK FS+GV GA+ L +IS L Q F +L Sbjct: 697 GEAQDPVRDPAKALR----DKGVVIFSVGVYGANRTQLEEISGDSSLVFQVENFDDL 749 Score = 93.9 bits (232), Expect = 4e-18, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 57/179 (31%), Gaps = 13/179 (7%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE 74 + +LD SGS+ + + + + + RV +G + + Sbjct: 787 KNIKVLDIVFVLDHSGSIGTQEQESM---MNLTIHLVKKADVDSDRVRVGALKYSDYPEV 843 Query: 75 Q-----PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 ++ + G T A+ A +++ + R + + +I Sbjct: 844 LFYLSGNKSAVIEHLRRRRYTSGHTYTARALEHA-NIMFTEEYGSRIQQN--VKQMLIII 900 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 TDG D + + +K +++GV A+ L ++ + F L Sbjct: 901 TDGVSHD--RDNLSDTASKLRNKGINIYAVGVGQANQLELETMAGNKSNTFHVDNFSNL 957 Score = 58.4 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 51/169 (30%), Gaps = 19/169 (11%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT-- 78 + L+D S + + + + + L P+ + + + + + F Sbjct: 1 DVVFLVDSSNHLGTKSFPFVKTFISKIINSL---PIEAHKYRVALAQYSD-QLHSEFQLG 56 Query: 79 ------SAANFFPPIL-FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 N F G +G A+ +A R R + P + ++ Sbjct: 57 TFKSRNPMLNHLKKNFGFVGGSLRIGQALREAHRTYFSRPDSGRD--KKQFPPILVVL-- 112 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL 180 + E + + + S+G+Q A + L ++ L Sbjct: 113 --ASAESEDDVEEPSKALRGDGVRIISVGLQSASEQELKAMATVSEKVL 159 Score = 44.2 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 54/165 (32%), Gaps = 21/165 (12%) Query: 19 RCPCILLLDVSGSMNGRP--INELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 + + L+D S G + L L + + LG++++ Sbjct: 192 KTDLMFLVDES---VGTRQDLRNLQNFLRNITASM---DMRYNCTRLGLMSYSDGAKTIS 245 Query: 77 F----TSAANFFPPILFAQ---GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 TS F I G + GAAI K +E + L+ Sbjct: 246 LLNSSTSQYEFQEQIQKLSFQAGKSHAGAAIEKMR--LEAFSESSGSRRAQGVPQIAVLV 303 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 T TDE + AA ++ F++ +QGA+ L +I Sbjct: 304 THRPSTDEVRDAALQLRLQ----DVTVFAMNIQGANDTQLEEIVS 344 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 59/186 (31%), Gaps = 26/186 (13%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQ 75 + L+D SGSM G I + L L L ++ F V+ Sbjct: 119 RQGVDLVCLIDHSGSMQGEKIKLVRKTLKQMLTFLQPCD------RLCLIMFDCKVYRLT 172 Query: 76 PFTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 L A+G T +G + AL +++ RK + + IFL Sbjct: 173 RLMRVTQENVQKFRVAISSLQARGGTDIGNGMKMALSILKHRKYKNPVSA-------IFL 225 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPLPLQGL 183 ++DG + + + + F + G K +++I+ + Sbjct: 226 LSDGVDEGAEERVRDDLIQYNIRDSFTIKTFGFGRDCCPKIMSEIAHYKEGQFYFVPNLT 285 Query: 184 QFRELF 189 E F Sbjct: 286 NIDECF 291 >UniRef50_C3ZZV2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZZV2_BRAFL Length = 4065 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 59/176 (33%), Gaps = 19/176 (10%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSA 80 + LLD SGS+ + D P+ ++G+V F ++ Sbjct: 1842 DLVFLLDGSGSVGSNNFLNVKNFTKLITDLF---PVGDNATKVGLVQFSDTIQKEFDLRD 1898 Query: 81 ANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + IL A G T G AI + R + + ++TDG Sbjct: 1899 YDTKAEILSAIDNISYLGGGTYTGNAIDYVRQVSFNTINGNR----GSHPDMLIVLTDGE 1954 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFREL 188 D A + D+ F+IGV G D TL +I+ Q F +L Sbjct: 1955 SFDPVTFA----SQSARDQGITIFAIGVGTGVDYATLEEIAGDPQKVQQVTDFADL 2006 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 59/177 (33%), Gaps = 19/177 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + LLD SGS+ + L ++ +G+V + + + Sbjct: 1588 LDVVFLLDGSGSVGSANFDLLKTFTTRIATNF---DVSTNLTRVGVVQYSDQTNSEFVLN 1644 Query: 80 AANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + +L A G T GAA+ D V + + + ++TDG Sbjct: 1645 TFSTEAEVLAAIAAISYQNGGTSTGAAL----DYVRQNVFISASGDRPDAANILIVLTDG 1700 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFREL 188 +D+ A + +S+G+ D TL QI+ LQ F L Sbjct: 1701 VSSDDVSFPAM----AARNAGITIYSVGIGDGVDYNTLQQIAGDPNKVLQATGFSSL 1753 Score = 96.2 bits (238), Expect = 7e-19, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 61/181 (33%), Gaps = 19/181 (10%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 P + LLD SGS+ + ++ +G+V + + Sbjct: 1016 PYGGLDLVFLLDGSGSVGTTNFELVKDFTSEVVLNFN---ISADTTNVGVVQYSDTVRNE 1072 Query: 76 PFTSAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 F S+ + P++ A G T G AI R R + + + Sbjct: 1073 FFLSSYDTKLPLIDAINQISYLTGGTLTGFAIDYVRQSSFSRPAGARNT----FPDVLVV 1128 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRE 187 +TDG D+ ++A + F++G+ D TL QIS LQ F Sbjct: 1129 LTDGQSQDDVVSSA----AAARSQGITIFAVGIGSEVDFTTLLQISGYPSRILQIQDFAT 1184 Query: 188 L 188 L Sbjct: 1185 L 1185 Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 39/178 (21%), Positives = 68/178 (38%), Gaps = 20/178 (11%) Query: 29 SGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAANFFPPIL 88 SGS+ N + D ++ ++G+V + + + +A + +L Sbjct: 2421 SGSVGADNFNLVKQFAKRLVDNF---EISQTDTKVGVVQYSSSSNVEFYLNAFSTKQAVL 2477 Query: 89 FA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAA 141 A QG T GAAIT + + RAN Y + ++TDG +D+ Sbjct: 2478 DAINAVTYQQGGTNTGAAITYTMQEIFASANGARAN----YPDVLIVVTDGESSDDVAVP 2533 Query: 142 ANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRS 198 A + +++GV + TL QI+ LQ F L + + SL+ Sbjct: 2534 AL----SARNAGTLIYAVGVGNGVNQATLLQIAGNAGQVLQAADFAGL-TTVVQSLQQ 2586 Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 58/178 (32%), Gaps = 19/178 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ--- 75 + LLD SGS+ + + T +A ++G+V + + Sbjct: 750 PLDIVFLLDGSGSVGSANFDLVKDFTRTLARNF---DIAANMTQIGVVQYSDTVNREFGL 806 Query: 76 ----PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 N + + QG T GAAI R + + + ++TD Sbjct: 807 GDFHNRQDVLNAISAVSYQQGGTLTGAAIDFVRQTSFTTGDGDRPDVPN----MLIVVTD 862 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFREL 188 G D Q A+ R F +G+ D TL +I+ LQ F L Sbjct: 863 GVSGDSVQGPADAARRE----GITTFGVGIGNGIDFGTLLEIAGDSARVLQADDFGAL 916 Score = 89.3 bits (220), Expect = 8e-17, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 63/190 (33%), Gaps = 25/190 (13%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV---------ELGIV 66 P P + LLD S S+ + + +G V Sbjct: 546 PYPGADLVFLLDGSASITSPNFELVKDFAERVARHFTISSSRNDNMSYRSFTAATNVGAV 605 Query: 67 TFGPVHVEQPFTSAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGI 119 + + F S+ + ++ A G T G ALD V++ A Sbjct: 606 QYSDTVRSEFFLSSFDTDFEVVRALDGISYLAGGTFTG----FALDFVQQSAFSPVAGAR 661 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQPL 178 Y + ++TDG D+ A A + A F++G+ A D TL QI+ Sbjct: 662 DGYPDILVVVTDGVSQDDVVAPAESARKE----GIAVFAVGIGSAVDYATLLQIAGIDGR 717 Query: 179 PLQGLQFREL 188 LQ F +L Sbjct: 718 ILQINNFVDL 727 Score = 88.1 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 61/168 (36%), Gaps = 19/168 (11%) Query: 29 SGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAANFFPPIL 88 SGS+ N L A +A+ +G+V + + + +A +L Sbjct: 2707 SGSVGSDNFNLLKAFTQNIVGNF---DIAVNNTRVGVVQYSDFNNIEFNLNAYATEAEVL 2763 Query: 89 FA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAA 141 A +G T GAAI V RA+ + ++TDG +D Sbjct: 2764 AAIGAISYQRGGTFTGAAIDFVRQDVFTTAGGNRADKPD----ILLVLTDGESSDSVAGP 2819 Query: 142 ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFREL 188 A + + +++G+ G + TL +I+ LQ F+ L Sbjct: 2820 A----QNTLNAGITIYAVGIGSGVNADTLQEIAGDPGRVLQVADFQGL 2863 Score = 87.7 bits (216), Expect = 2e-16, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 58/178 (32%), Gaps = 21/178 (11%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE-QPFTS 79 + LLD SGS+ + + + ++ +G+V + F Sbjct: 2126 DLVFLLDGSGSVGASSFDLMKSFTNRITTNF---DVSPTSTRVGVVQYSSQGSVATEFRL 2182 Query: 80 AANFFPPILFA--------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + + A G+T G A+ + V + G + + +ITD Sbjct: 2183 DSYSNKDDVIAAVNGIVYQNGNTYTGEAL----NYVRQNSFAVANGGRADVANILVVITD 2238 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFREL 188 G D+ A + R +++G+ TL I+ Q LQ F L Sbjct: 2239 GQSVDDVTGPAQDLLRE----GVTVYALGIGDGIQYSTLEAIAQDQSRVLQANTFTNL 2292 Score = 86.6 bits (213), Expect = 6e-16, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 58/177 (32%), Gaps = 19/177 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 I LLD SGS+ + + + P A +G+ + + + + Sbjct: 1302 LDLIFLLDGSGSITAPNFELVKSFTYSVSRNFDVSPNA---TRIGVAQYSDTNSLEFNLN 1358 Query: 80 AANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + +L A G T GAA+ + R + + + TDG Sbjct: 1359 RYSTKDEVLNAVNGISYQGGGTYTGAALDFVRQTMMVESAGDRTMSPN----ILVVATDG 1414 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFREL 188 +D+ + A + +++G+ G TL I+ LQ F L Sbjct: 1415 ESSDDQRTPAEVLRNA----GTLVYAVGIGAGVSSTTLLDIAGYNSRVLQATDFASL 1467 Score = 77.7 bits (190), Expect = 2e-13, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 57/177 (32%), Gaps = 19/177 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + L+D S S+ + L ++ + +G+V + + + Sbjct: 135 LDLVFLVDGSSSVGSDNFETIKVFLEAITAGF---EVSSSQTRVGVVQYSTGINTEFDLN 191 Query: 80 AANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + ++ A +G T GA IT R + + + +ITDG Sbjct: 192 SFATEAEVINAIRGLSHQRGSTFTGAGITFTRLESFTGASGDRPDAPN----VLIVITDG 247 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQPLPLQGLQFREL 188 D A A +SIG+ + TL I+ + L F +L Sbjct: 248 ISADSVDAPA----EAARADNITTYSIGIGDEINYLTLLSIAGMRERVLNVTTFGDL 300 Score = 77.3 bits (189), Expect = 3e-13, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 65/189 (34%), Gaps = 22/189 (11%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT 78 I L+D S S++ + L L + ++ +G+V + V+ E Sbjct: 353 IDVIFLIDGSSSISLLNFDLLKTFLQNITMKF---DVSSDITRIGVVQYSTDVNTEFELK 409 Query: 79 ------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL-ITD 131 + I +G T +GA I A G P I + ITD Sbjct: 410 TYATEAEVIHAISNITRQRGSTFIGAGIN-----FVRTNSFTVAAGDRPLAPNILVTITD 464 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADM-KTLAQISVRQPLPLQGLQFRELFS 190 G D+ A + D+ +SIG+ TL I+ + F EL Sbjct: 465 GISADDVAGPA----QAARDQGILTYSIGIGEEIQWPTLLSIAGARHRVFNVTSFSEL-P 519 Query: 191 WLSSSLRSV 199 + +SL ++ Sbjct: 520 GIEASLTAL 528 >UniRef50_C3YBZ5 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YBZ5_BRAFL Length = 515 Score = 101 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 69/198 (34%), Gaps = 20/198 (10%) Query: 2 SEQITFATSDFASNPE---PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLAL 58 + A ++ E P + +LD SGS+ + +V+ D + Sbjct: 155 KQTYIDAVNELEQESECKIPGMDMVFVLDGSGSVGADNFETVKDFVVSVVDGF---EIGQ 211 Query: 59 KRVELGIVTFGP-VHVEQPFT------SAANFFPPILFAQGDTPMGAAITKALDMVEERK 111 R +G+V + V E T + I + QG T GAA+ D+ + Sbjct: 212 SRTRIGVVQYSDEVQNEFNLTEYGNKADVQSAISNITYLQGRTYTGAALRYMTDVSFSEE 271 Query: 112 REYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQ 171 R + + I + TDG TD Q A+ F+IG+ G D++ L Q Sbjct: 272 AGARPPYQAIPKVGIVV-TDGEATDNVQGPASSAHEA----GVNVFAIGIGGYDVRELRQ 326 Query: 172 ISVRQ--PLPLQGLQFRE 187 I+ F Sbjct: 327 IATDPDATHVFAVDNFAA 344 >UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NW6_PSEA6 Length = 701 Score = 101 bits (252), Expect = 1e-20, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 70/202 (34%), Gaps = 21/202 (10%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 + P + LLD SGSM G I + + +L + + I+ F Sbjct: 296 VAQQMPSREVVFLLDTSGSMAGESIVQAKRAVDFALTQLRPEDN------VNIIQFNDAP 349 Query: 73 VEQPFTSA---------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYR 123 + A + L A G T M A+T AL+ + + G R Sbjct: 350 QALWKRAMPATAKHIQRARNWVASLHADGGTEMAPALTLALNKPSLHRDDSDLLGSHKLR 409 Query: 124 PWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISV-RQPLPLQ 181 +F ITDG + + A + + F+IG+ A + + Q + + Sbjct: 410 QVVF-ITDG--SVSNEDALMSLIESKLADN-RLFTIGIGSAPNSYFMTQAAQAGRGTFTY 465 Query: 182 GLQFRELFSWLSSSLRSVSRST 203 +++ +++ ++R Sbjct: 466 IGDIQQVQHKMTALFNKLTRPV 487 >UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AXM1_ORYSI Length = 614 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 68/205 (33%), Gaps = 30/205 (14%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 A + +LDVSGSM G + + + F +L D L +V+ Sbjct: 18 APPVLEGTARAGVDVVAVLDVSGSMEGERLEHVKEAMEIFIGKLGPDD------RLSVVS 71 Query: 68 FG-PVHVEQPFT-------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 F V T + A L A G T MGAA+ + ++ +RK + Sbjct: 72 FATSVRRLTELTYMSEQGRAVAKEIVDGLVADGSTNMGAALLEGAMILRDRKGA--RDES 129 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPL 178 + + ++DG +++++ + F + G+ + + I+ Sbjct: 130 NGRVGCMMFLSDG--------TNDEIYKEDISGEFPAHTFGLGSDHNPNVMRHIADETSA 181 Query: 179 PL-----QGLQFRELFSWLSSSLRS 198 + F S L S Sbjct: 182 TYSFVNRNIADIKGAFDLFISGLTS 206 >UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A188_PELCD Length = 442 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 59/194 (30%), Gaps = 24/194 (12%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 A + P L+LD SGSM+G I + + L L V Sbjct: 54 APRAPRTAQRPPVNLALVLDRSGSMSGNKIAKAREAAIEAVRRLSDGDLFSLVVYD---- 109 Query: 68 FGPVHVEQPFTSAAN-----FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 V P ++ + G T + A+++ V R + + Y Sbjct: 110 -DSVETLVPAQPVSDIGDIEARIRRIRPGGSTALFGAVSQGAAEV-------RKHSDAPY 161 Query: 123 RPWIFLITDGAPTDEWQAAAN--KVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VR 175 + L++DG A+ ++ + + ++GV + + Q++ Sbjct: 162 VNRVVLLSDGLANVGPSRPADLARLGAALLKEGISVTTVGVGTDFNEDLMTQLAERSDGN 221 Query: 176 QPLPLQGLQFRELF 189 +F Sbjct: 222 HYFVESSRDLPRIF 235 >UniRef50_UPI00016E6A6D UPI00016E6A6D related cluster n=2 Tax=Takifugu rubripes RepID=UPI00016E6A6D Length = 832 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 68/212 (32%), Gaps = 22/212 (10%) Query: 3 EQITFATSDFASNPEP--RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 + T + NP + ++D S S+ ++ +V L + Sbjct: 30 QNETTVQNKAVENPCKAVPLDFVFVIDSSRSIRPNDYEKVKTFIVNLIQFL---EIGPDA 86 Query: 61 VELGIVTFGPV-------HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKRE 113 +G++ +G V + + T G AI A++ + Sbjct: 87 TRVGLLQYGSVVQPEFSLNTFTSKAEVEQAVRNMRHLATGTMTGLAIKYAMETSFTEEDG 146 Query: 114 YRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQI- 172 RA + R + + TDG P D + A + + F+IGV DM TL I Sbjct: 147 ARAAHLHIPRIAVIV-TDGRPQDTVEQVAAQARQA----GIQIFAIGVGRVDMNTLKTIG 201 Query: 173 ----SVRQPLPLQGLQFRELFSWLSSSLRSVS 200 S L Q L S S L S Sbjct: 202 SEPHSEHVHLVANFSQIETLISVFHSKLCGGS 233 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 27/212 (12%), Positives = 63/212 (29%), Gaps = 26/212 (12%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-------H 72 + ++D S SM + +++ + L ++ +G++ + Sbjct: 562 MDLVFVIDGSKSMGPANFELVKHFVISIVESLN---VSQMGSHVGLLQYSTKVRTEFTLR 618 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 S + + + G+A+ K R N + TDG Sbjct: 619 QHTSAQSIRQAVSRMQYMGRGSMTGSALRHMFQFSFSAKEGARPNVPHVG----IVFTDG 674 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ-----PLPLQGLQFRE 187 D+ + + +++GV A + L +I+ E Sbjct: 675 RSQDDVS----EWANKAKKSGVTIYAVGVGKAIEQELREIASEPDEKHLYYAEDFQDMGE 730 Query: 188 LFSWLSSSLRSV---SRSTPGTEVVLEAPKGW 216 + L S + + S++ +L + W Sbjct: 731 ITKKLKSRMCTALKMSQTIISGSGLLPSNCNW 762 >UniRef50_UPI000194D9FE PREDICTED: similar to matrilin 4 n=2 Tax=Neognathae RepID=UPI000194D9FE Length = 580 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 58/185 (31%), Gaps = 18/185 (9%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 + ++D S S+ + ++ L + +G++ + V Sbjct: 32 PLDIVFVIDSSRSVRPFEFETMRRFMMDIIGNL---DVGPNATRVGVIQYSSQVQNIFSL 88 Query: 78 T------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 I+ T G AI A+++ + R R ++TD Sbjct: 89 KTFFTRADMERAINSIIPLAQGTMTGLAIQYAMNVAFTTQEGARPLHKRIPR-IAIVVTD 147 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFRELF 189 G P D A + +++G+Q ADM +L ++ + F EL Sbjct: 148 GRPQDRVTEVATQARNA----GIEIYAVGIQRADMNSLRAMASPPLEEHVFLVESF-ELI 202 Query: 190 SWLSS 194 + Sbjct: 203 QQFAK 207 Score = 95.0 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 61/182 (33%), Gaps = 17/182 (9%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 ++++D S S+ + + + D L P +G+V + V E P Sbjct: 343 HVDLVMVIDGSKSVRPQNFELVKQFVNRIVDLLEVSPHG---TRVGLVQYSSRVRTEFPL 399 Query: 78 T------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + + + T G A+ ++ R + R + TD Sbjct: 400 NKYHSADEIKKAVMDVEYMEKGTMTGLALKHMVEHSFSELEGARPLSYNIPRIGLVF-TD 458 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSW 191 G D+ ++ R ++ F++GV A + L I+ Q + F+ Sbjct: 459 GRSQDD----ISEWARRAKESGIVMFAVGVGKAVEEELRAIASEP--VEQHFSYSADFTT 512 Query: 192 LS 193 ++ Sbjct: 513 MT 514 >UniRef50_C3JL94 von Willebrand factor type A domain protein n=1 Tax=Rhodococcus erythropolis SK121 RepID=C3JL94_RHOER Length = 614 Score = 101 bits (252), Expect = 2e-20, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 65/198 (32%), Gaps = 18/198 (9%) Query: 22 CILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP---VHVEQPFT 78 + ++D SGSM G P+ + L L + A R G G V V Sbjct: 52 VLFVVDTSGSMAGSPLAQAKDALRAGIGALSSGQAAGLRSFAGDCGNGGQLLVPVATDNR 111 Query: 79 SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEW 138 N L A G TP A+ A + I LI+DG T Sbjct: 112 DQLNNATNQLTAGGTTPTPDALRAAAGDLPSTGDRT-----------IILISDGQSTCGD 160 Query: 139 QAAANKVFRGEEDKRFAFFSIGVQ--GADMKTLAQISV-RQPLPLQGLQFRELFSWLSSS 195 A + + F ++G L+ I+ EL +S++ Sbjct: 161 PCAVATELKTQLGIDFRVHAVGFNAPDVAESELSCIANATGGRYFTATNTTELSDAISAA 220 Query: 196 LRSVSRSTPGTEVVLEAP 213 + + S + +++ ++P Sbjct: 221 VTTGS-AEIRSDIDCDSP 237 >UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDA4D Length = 1547 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 41/184 (22%), Positives = 67/184 (36%), Gaps = 13/184 (7%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 R I LLD SGSM+G+PI+ L F L D +++FG Sbjct: 305 RSEFIFLLDRSGSMSGQPIDRACQALTLFLKSLPTD------SYFNVISFGSSFKLLFPQ 358 Query: 79 SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEW 138 S + A + A ++ + K + N I Y +FL+TDG D Sbjct: 359 SEKYNSQSLEKAISNISKYKADLGGTEIYKPLKNVFVQNKIQGYNKQVFLLTDGE-VDSP 417 Query: 139 QAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQ--ISVRQPLPLQGL--QFRE-LFSWLS 193 + + + + + R G AD + Q I+ + + L E + + LS Sbjct: 418 EQVISLIRKNNKFSRVHSIGFGSG-ADQYLINQSAIAGKGISKIVDLKCDLSEVIINMLS 476 Query: 194 SSLR 197 + Sbjct: 477 MCIT 480 >UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1C6_SLAHD Length = 744 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 60/197 (30%), Gaps = 27/197 (13%) Query: 6 TFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI 65 S N +L LD SGSM+G P+NE F + ++ + Sbjct: 366 LMGDSKVDPNDASSRHVVLALDTSGSMDGEPLNETKTATREFASTIFKSD-----ADVCL 420 Query: 66 VTF-GPVHVEQPFTS---AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 V++ T A L A G T + A+ + + +E G Sbjct: 421 VSYDSSARNVIDSTDNEYALKAAVRDLSAGGGTNIEDALRVSYERLE---------GSGS 471 Query: 122 YRPWIFLITDGAPTDE-WQAAANKVFRGEEDKRFAFFSIGV------QGADMKTLAQIS- 173 + I L++DG + +D +++G + + + I+ Sbjct: 472 DKRIIVLMSDGEANEGLVGDDLIAYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIAS 531 Query: 174 -VRQPLPLQGLQFRELF 189 Q R F Sbjct: 532 PGCHYEVDDASQLRYFF 548 >UniRef50_D2VKS7 von Willebrand factor type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VKS7_NAEGR Length = 923 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 62/217 (28%), Gaps = 32/217 (14%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-V 71 A +L++D SGSM G+ ++ + + L D+L + IV F V Sbjct: 684 AQKERKGVDLVLVVDKSGSMAGQKLDMVKSTLSFMVDQLKEKD------RVAIVEFDTQV 737 Query: 72 HVEQPFTSAA-------NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 T + T + A+ +L ++ R++E Sbjct: 738 KTNLDLTKMDIEGKKKAKQVSSAISPGSCTNLSGALFTSLKLLASRQQEKNEVTS----- 792 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEE-------DKRFAFFSIGVQ-GADMKTLAQIS--- 173 + L TDG + + ++ + G D L I+ Sbjct: 793 -VILFTDGLANRGLISTNEILQNMQDLMDELLSTSNVTIHTFGFGQDTDANMLTSIAQKG 851 Query: 174 -VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVV 209 + F + +L SV + Sbjct: 852 NGLYDYLETADDIPKAFGNVIGNLVSVVGQNIKIRIQ 888 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 64/196 (32%), Gaps = 24/196 (12%) Query: 2 SEQITFATSDFASNPEPR---CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLAL 58 Q+ + + A R L+LD SGSM G+P+ + D LL Sbjct: 21 QRQLEISVAAIAQASGERNAPLNLGLILDHSGSMAGQPLETVKRAAQKLVDRLLPSD--- 77 Query: 59 KRVELGIVTFGPVHVE----QPFTSAANFFPPI--LFAQGDTPMGAAITKALDMVEERKR 112 L ++ F V QP T I L A G T + + L + K Sbjct: 78 ---RLAVIVFDHVAKVLIPNQPVTDRDKIKTRISHLAAMGGTAIDEGLQLGLTELIAAKA 134 Query: 113 EYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQ 171 IFL+TDG + ++ + ++G + L Q Sbjct: 135 GA--------ISQIFLLTDGENEHGNNSRCLQLAEEAAKENITLNTLGFGYHWNQDVLEQ 186 Query: 172 ISVRQPLPLQGLQFRE 187 I+ L +++ + Sbjct: 187 IADAAGGSLMFIEYPQ 202 >UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=C8N8N5_9GAMM Length = 563 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 75/224 (33%), Gaps = 32/224 (14%) Query: 4 QITFATSDFASNPEPRCPCILLLDVSGSM-NGRPINELNAGLVTFRDELLADPLALKRVE 62 +I +D A P + L+D SGSM + + + + F + L AD Sbjct: 187 RIAIQAADLAPEKRPPANLVFLIDTSGSMDDPDKLPLVKKTVCHFAEALRADD------R 240 Query: 63 LGIVTFG-PVHVEQPFTSAAN-----FFPPILFAQGDTPMGAAITKALDMVEERKREYRA 116 + ++T+ P T+ L A G T G A+ A D + R+ Sbjct: 241 ISLITYSGSTAEILPPTAGDQKETIIAALKPLRAHGATAGGEALRMAYDAAAKNYRKDGI 300 Query: 117 NGISYYRPWIFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGV--QGADMKTLAQI 172 N I L TDG + + ++G + + + Q+ Sbjct: 301 NR-------ILLATDGDFNVGISDPATLKNYVADKRKSGISLTTLGYGSGNYNDEMMEQL 353 Query: 173 SVRQPLPLQGLQFRE-----LFSWLSSSLRSVSRSTPGTEVVLE 211 + + L L+S+L +V+R ++ LE Sbjct: 354 ADAGDGNYSYIDSEAEAKKVLVRQLTSTLATVAR---DIKIQLE 394 >UniRef50_O95460 Matrilin-4 n=32 Tax=Amniota RepID=MATN4_HUMAN Length = 622 Score = 101 bits (251), Expect = 2e-20, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 59/182 (32%), Gaps = 19/182 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 + ++D S S+ + L+ L P A +G++ + V P Sbjct: 32 PLDLVFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPL 88 Query: 78 T------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 ++ T G AI A+++ R R + + TD Sbjct: 89 RAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGARPPEERVPRVAVIV-TD 147 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS----VRQPLPLQGLQFRE 187 G P D +V + +++GVQ AD+ +L ++ ++ + Sbjct: 148 GRPQDR----VAEVAAQARARGIEIYAVGVQRADVGSLRAMASPPLDEHVFLVESFDLIQ 203 Query: 188 LF 189 F Sbjct: 204 EF 205 Score = 91.2 bits (225), Expect = 2e-17, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 58/165 (35%), Gaps = 15/165 (9%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 +LL+D S S+ + + + D L ++ + +G+V F V E P Sbjct: 384 HVDLVLLVDGSKSVRPQNFELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPL 440 Query: 78 ------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + + + T G A+ ++ + R ++ R + TD Sbjct: 441 GRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALNVPRVGLVF-TD 499 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 G D+ + +++ +++GV A L +I+ Sbjct: 500 GRSQDD----ISVWAARAKEEGIVMYAVGVGKAVEAELREIASEP 540 >UniRef50_C3ZZV3 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3ZZV3_BRAFL Length = 2692 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 44/178 (24%), Positives = 67/178 (37%), Gaps = 19/178 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + LLD SGS+ + + E ++ +G+V F + F S Sbjct: 1635 LDLVFLLDGSGSVTAVNFDLVKDFASGVVSEFQ---ISTTETRVGVVQFSDTLRTEFFMS 1691 Query: 80 AANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + +L A QG+T GAAIT A RAN + + ++TDG Sbjct: 1692 SFSTKQQVLQAISDIDYIQGNTLTGAAITFATASSFSTPAGNRANFPDF----MIVVTDG 1747 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRELF 189 D A + D+ F++GV D TL QI+ LQ F +L Sbjct: 1748 LSQDSVVQPA----QSARDQGITIFAVGVGNEVDFATLLQITGVPEYILQVTDFSDLL 1801 Score = 98.5 bits (244), Expect = 1e-19, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 60/187 (32%), Gaps = 20/187 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPF- 77 + LLD SGS+ + + ++ + +V + E Sbjct: 1097 TDLVFLLDGSGSVGSNNFDLVKTFTKNVVQNF---DISETATRVAVVQYSDQFSTEFSLN 1153 Query: 78 -----TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 T N I + G T G AI + V R Y + ++TDG Sbjct: 1154 AFSTKTEVYNAIDNISYLTGGTFTGFAIDFVMQSVFTSISGER----DGYPDLLVVVTDG 1209 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFRELFSW 191 TD+ A+ + +++GV D TL QI+ Q F L Sbjct: 1210 LSTDDVSGPAD----TARAQGVTIYAVGVGSDIDFNTLEQIAGLTSRVSQVSDFSSL-VT 1264 Query: 192 LSSSLRS 198 LS +L Sbjct: 1265 LSQTLSQ 1271 Score = 98.5 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 63/178 (35%), Gaps = 19/178 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + LLD SGS+ + + + ++L +G+V + + + Sbjct: 1379 LDLVFLLDGSGSVTTANFDIVKEFTRRLANNF---DISLADTRVGVVQYSDSPTLEFNLN 1435 Query: 80 AAN-------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + N I + QG T G AI D V S + ++TDG Sbjct: 1436 SFNTNELVDLAIRNIQYQQGGTNTGQAI----DFVRVNSFSANNGDRSDVPNVMIVVTDG 1491 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRELF 189 +D+ A + + + +++G+ D L QI+ + +Q F LF Sbjct: 1492 QSSDDVVGPA----QTARNAGISMYAVGIGNGVDTNELLQIAGQVDRVVQSADFSTLF 1545 Score = 88.1 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 65/199 (32%), Gaps = 22/199 (11%) Query: 3 EQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVE 62 E + T NPE I LLD SGS+ + + + ++ Sbjct: 282 ENCQYYTPCLVRNPE--FDLIFLLDESGSIGTDNFKLVKSFTERMANNF---DISPNSTR 336 Query: 63 LGIVTFGPVHVE-------QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 +G+V + + + I + G T GAAI + R Sbjct: 337 VGVVQYSNFPGTEFSLNAFTDKAAVLDAISKIDYNGGSTFTGAAIDFVRNNEFTSVNGDR 396 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI-S 173 + +ITDG P D+ A + +++G+ D L Q+ + Sbjct: 397 ----DDVPNILIVITDGNPNDDVSGPAI----SANNAGITTYAVGIGSNVDQANLVQMTA 448 Query: 174 VRQPLPLQGLQFRELFSWL 192 R LQ F +L + + Sbjct: 449 GRPGRVLQAADFTDLTTVV 467 Score = 81.9 bits (201), Expect = 1e-14, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 59/180 (32%), Gaps = 20/180 (11%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF 77 P + LLD SGS+ + A P A +G++ + + F Sbjct: 552 PGADLVFLLDGSGSIGTDNFQLVKAFTKEVIRNFAISPTA---TRVGLLQYSDTIDNEFF 608 Query: 78 TSAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 + N + A G T G A+ + R N Y + ++T Sbjct: 609 MNEFNTRDELYTAVDNVVYKTGGTFTGFAVEFTRQIAFRTSAGTRDN----YPDILIVVT 664 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR-QPLPLQGLQFREL 188 DG D +A D+ +++GV D +L +++ LQ F L Sbjct: 665 DGNSEDVVTSAV----ASAIDQGILIYAVGVGSNVDFASLLELTGGVNSRVLQVSDFTGL 720 Score = 81.2 bits (199), Expect = 2e-14, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 65/195 (33%), Gaps = 24/195 (12%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 + LLD SGS+ + ++ +G+V + + F Sbjct: 1892 PVDMVFLLDGSGSVTQPNFELVKQFTQNVVVNFN---ISSATTRVGLVQYSDTIRTEFFL 1948 Query: 79 SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEW 138 ++ +T GAAI D V A + ++TDG D+ Sbjct: 1949 NS--------HPSRNTLTGAAI----DFVRTSSFSIPAGNRLTLPDVLVVVTDGLSQDDV 1996 Query: 139 QAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRELF---SWLSS 194 A + D A +++G+ D TL I+ Q LQ F L L+ Sbjct: 1997 AGPA----QIARDNGIAIYAVGIGSEVDFATLLDIAGLQSRVLQINDFSSLLDAEEQLTE 2052 Query: 195 SLRSVS-RSTPGTEV 208 + ++S PG V Sbjct: 2053 IVCNISYCGDPGAPV 2067 Score = 80.8 bits (198), Expect = 3e-14, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 59/178 (33%), Gaps = 19/178 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 + LLD S S+ + ++ +G+V + + F Sbjct: 2507 PVDLVFLLDGSSSITSPNFQIVKDFTADVVRTFN---VSSAATNVGLVQYSDTIRTEFFL 2563 Query: 79 SAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 ++ + +L A QG+T GAAI R N Y + ++TD Sbjct: 2564 NSFDTKSEVLNAIGNIGYLQGNTRTGAAIDFVRISSFSVPAGNRGNQPDY----LIVVTD 2619 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQPLPLQGLQFREL 188 G DE A + + F++G+ D TL I+ LQ F L Sbjct: 2620 GLSQDEVLGPA----QTARFEGINIFAVGIGNEIDFTTLLHIAGSPNRVLQINDFAGL 2673 Score = 80.0 bits (196), Expect = 5e-14, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 60/178 (33%), Gaps = 19/178 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 + LLD S S+ + ++ +G+V + + F Sbjct: 2248 PVDLVFLLDGSSSITSPNFQIVKDFTADVVRTFN---VSSAATNVGLVQYSDTIRTEFFL 2304 Query: 79 SAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 ++ + +L A QG+T GAAI R N Y + ++TD Sbjct: 2305 NSFDTKSGVLNAIGNIGYLQGNTRTGAAIDFVRISSFSVPAGNRGNQPDY----LIVVTD 2360 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQPLPLQGLQFREL 188 G D+ A + + + F++G+ D TL I+ LQ F L Sbjct: 2361 GLSQDDVVVPA----QTARNDGISIFAVGIGSEIDFATLLNIAGSPNRILQINDFAGL 2414 Score = 77.7 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 52/180 (28%), Gaps = 20/180 (11%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 + LLD S S+ + + ++ + + + + Sbjct: 14 VDLVFLLDGSASVGASNFELVKDFTQQTTAKF---DISDGSTRVAVAQYSSTPQVEFNLN 70 Query: 76 ---PFTSAANFFPPILFAQG-DTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + +N I + G T G AI R + + ++TD Sbjct: 71 TNSDVDTLSNAIEQITYMNGDSTFTGFAIEFVRQSAFSSFNGARDDKPD----IMVVVTD 126 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFRELFS 190 G D ++A F++GV G + L I+ LQ F +L Sbjct: 127 GQSADSVTSSAATAREQ----GVTMFAVGVGTGVGLSELQDIAGYTDRVLQLNDFVQLAQ 182 Score = 63.1 bits (152), Expect = 6e-09, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 53/177 (29%), Gaps = 18/177 (10%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSA 80 + L+D S S+ + + F + + L +G V F + Sbjct: 838 DLVFLVDKSSSVGPANFELVKEFMYDFTNTFS---VGLSDTRIGAVQFADAQTKD--FDM 892 Query: 81 ANFFPPILFAQGDTPM--------GAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 F G + G A A+D V + + + ++T Sbjct: 893 DTFATKEQTLAGIQNIVYTDNQVGGVATGAAIDFVRQNSYTRGNGDRTSVPDLLVVVTSS 952 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFREL 188 A TD+ +A E + +++GV L + L+ F +L Sbjct: 953 ASTDDVASA----QETAEKEGITIYTVGVTNSVSFAELTSTAGSFSRVLRANDFSDL 1005 Score = 57.7 bits (138), Expect = 3e-07, Method: Composition-based stats. Identities = 24/101 (23%), Positives = 37/101 (36%), Gaps = 9/101 (8%) Query: 89 FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRG 148 A+ +T GAAI RA + + ++TDG D A + Sbjct: 2121 PAERNTLTGAAIDFVRTSTFSTPAGNRAGQPDF----LIVVTDGLSQDNVAVPA----QT 2172 Query: 149 EEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFREL 188 + + F++G+ D TL QI+ LQ F L Sbjct: 2173 ARNNGISIFAVGIGSEVDADTLLQIAGTPSRTLQINDFAGL 2213 >UniRef50_O00339 Matrilin-2 n=30 Tax=Euteleostomi RepID=MATN2_HUMAN Length = 956 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 57/182 (31%), Gaps = 17/182 (9%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV---- 71 R + ++D S S+N ++ +V L + +G++ +G Sbjct: 52 ENKRADLVFIIDSSRSVNTHDYAKVKEFIVDILQFL---DIGPDVTRVGLLQYGSTVKNE 108 Query: 72 ---HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 + + + T G AI AL++ R + R + + Sbjct: 109 FSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIV 168 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFR 186 TDG P D A K D F+IGV D TL I F Sbjct: 169 -TDGRPQDSVAEVAAK----ARDTGILIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFS 223 Query: 187 EL 188 ++ Sbjct: 224 QI 225 Score = 91.6 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 64/208 (30%), Gaps = 21/208 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 + ++D S S+ + + D L P A +G++ + V FT Sbjct: 653 PIDLVFVIDGSKSLGEENFEVVKQFVTGIIDSLTISPKA---ARVGLLQY-STQVHTEFT 708 Query: 79 --------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 + + + G A+ + + R R I T Sbjct: 709 LRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVPRAAIVF-T 767 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLP--LQGLQFREL 188 DG D+ + + +++GV A + L +I+ F + Sbjct: 768 DGRAQDDVS----EWASKAKANGITMYAVGVGKAIEEELQEIASEPTNKHLFYAEDFSTM 823 Query: 189 FSWLSSSLRS-VSRSTPGTEVVLEAPKG 215 +S L+ + + ++ ++P G Sbjct: 824 -DEISEKLKKGICEALEDSDGRQDSPAG 850 >UniRef50_UPI00006CAF43 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAF43 Length = 631 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 70/210 (33%), Gaps = 30/210 (14%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV- 71 + I ++D SGSM+G+ + L + + + +++F V Sbjct: 136 EQSERVPMDLICVIDDSGSMSGKKAQLVRKSLKYLLKIMNEND------RICLISFDSVE 189 Query: 72 HVEQPF-------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 + PF S + +G T + A + L M++ RK + Sbjct: 190 KILTPFLRNNLENKSELKKAIKNIVGRGSTNIEAGMEAGLWMIKNRKEKNPIT------- 242 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEEDKRF----AFFSIGVQ-GADMKTLAQISVRQ-PL 178 +FL++DG D+ +V + + + G D + I+ Sbjct: 243 CMFLLSDGQ--DDSPQVDLRVQKLIQSYDIQDTFIVNTYGYGADHDATQMRNIAETHKGG 300 Query: 179 PLQGLQFRELFSWLSSSLRSVSRSTPGTEV 208 +++ W S+ + S G +V Sbjct: 301 YYYIEDVKKVSEWFVLSISGL-LSAVGEDV 329 >UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5D5_BREBN Length = 513 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 73/224 (32%), Gaps = 30/224 (13%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGR-PINELNAGLVTFRDELLADPLALKRVELGI 65 + + + ++DVSGSMN + + L D+L +GI Sbjct: 165 IKGKELSPKERKPANLVFVIDVSGSMNQENRLELVKKSLHVLVDQLQPTDS------VGI 218 Query: 66 VTFGPV-HVEQPFTSAANFF-----PPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 V +G V P TS + L +G T + +M + N Sbjct: 219 VVYGSEGRVLLPPTSTEDKQAILSAIDELQPEGSTNAEQGLVLGYEMAARSFKPPAINR- 277 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFR----GEEDKRFAFFSIGVQGADMKTLAQISVR 175 + L +DG A + +D + F G+ + + Q++ + Sbjct: 278 ------VILCSDGVANVGETGAEGILRSIEDYARKDIYLSSFGFGMGNYNDVMMEQLANK 331 Query: 176 QPLPLQGLQ-FRE----LFSWLSSSLRSVSRSTPGTEVVLEAPK 214 + F E L+ +L++++R +V + K Sbjct: 332 GEGSYAYIDTFSEARRIFTESLTGTLQTIARDVK-IQVEFDPKK 374 >UniRef50_UPI0001760CA2 PREDICTED: inter-alpha (globulin) inhibitor H5-like n=1 Tax=Danio rerio RepID=UPI0001760CA2 Length = 1157 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 67/211 (31%), Gaps = 18/211 (8%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG----PV 71 P I ++D+SGSM G I + A +V+ +L V V Sbjct: 288 PVVPKDVIFVIDISGSMIGTKIKQTKAAMVSILSDLREGDYFNLITFSDDVHTWKKDRTV 347 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMG---AAITKALDMVEERKREYRANGISYYRPWIFL 128 + A F + A G T + + K L+ S P I Sbjct: 348 RATRQNVRDAKEFVRKIIAAGWTNINAALLSAAKLLNPSTRSSSSTGRAPSSQRVPMIIF 407 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKR-FAFFSIGVQ-GADMKTLAQISVRQPLPLQ----- 181 +TDG T + ++ + F + AD L ++++ + Sbjct: 408 LTDGEATIGETETDVILHNAQKSLGLVSLFGLAFGDDADFPMLRRLALENRGVARMVYED 467 Query: 182 ---GLQFRELFSWLSS-SLRSVSRSTPGTEV 208 +Q + + +++ L + S +V Sbjct: 468 DDAAIQLKGFYDEVATPLLSDIQLSYLDDQV 498 >UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID=Q7G2L9_ORYSJ Length = 719 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 60/216 (27%), Gaps = 43/216 (19%) Query: 4 QITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVEL 63 + +S + +LDVS SM G + L + L L Sbjct: 244 HLKAPSSPATVTSRAPIDLVTVLDVSWSMAGTKLALLKRAMSFVIQALGPGD------RL 297 Query: 64 GIVTF-GPVHVEQPFTSAAN-------FFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 +VTF P L A G T + A+ KA ++E+R+ Sbjct: 298 SVVTFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRRERNP 357 Query: 116 ANGISYYRPWIFLITDGAPT-------------DEWQAAA----NKVFRGEEDKRFAFFS 158 I L++DG T D+ A + + G + + Sbjct: 358 VCS-------IVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHA 410 Query: 159 IGVQ-GADMKTLAQIS----VRQPLPLQGLQFRELF 189 G D + I+ ++ F Sbjct: 411 FGFGADHDSPAMHSIAEMSGGTFSFIDAAGSIQDAF 446 >UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflexi (class) RepID=A5UTA6_ROSS1 Length = 425 Score = 100 bits (248), Expect = 4e-20, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 61/185 (32%), Gaps = 25/185 (13%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVE 74 P+ L+LD S SM G + ++ D+L D +V F V Sbjct: 41 PKLPLNLCLVLDRSSSMRGERLMQVKEAAARIVDQLGPDD------YFSLVVFNDRADVV 94 Query: 75 QPFTSAA-----NFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 P A + A G T M + AL V+ ++ + L+ Sbjct: 95 IPAQRAIKKSDLKAAIAQIEAAGGTEMAQGLALALQEVQR-------PFLTRGISRLILL 147 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPLPLQGLQ 184 TDG T ++ ++ R + + ++G+ + L ++ R Sbjct: 148 TDGR-TYGDESRCVEIARRGQSRGIGLTALGIGTEWNEDLLETMTASENSRAQYIATAQD 206 Query: 185 FRELF 189 ++F Sbjct: 207 VVKVF 211 >UniRef50_Q5CZQ6 Matn4 protein (Fragment) n=12 Tax=Euteleostomi RepID=Q5CZQ6_DANRE Length = 261 Score = 100 bits (248), Expect = 5e-20, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 60/178 (33%), Gaps = 17/178 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFT 78 +LL+D S S+ + + + D+L ++ K +G+V + V E P + Sbjct: 28 IDLVLLIDGSKSVRPQNFELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSCVRTEFPLS 84 Query: 79 ------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + + T G A+ ++ R + R + TDG Sbjct: 85 MYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEKNIPRVGLVF-TDG 143 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFREL 188 D+ + + ++ +++GV A L +I+ F + Sbjct: 144 RSQDD----IQEWAKKAKEAGITMYAVGVGKAVEDELREIASDPVEKHFFYSADFTAI 197 >UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=Q8H924_ORYSJ Length = 646 Score = 100 bits (248), Expect = 5e-20, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 58/198 (29%), Gaps = 39/198 (19%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH----VE 74 + +LDVSGSM G + L + D L L +++F Sbjct: 173 PLDLVTVLDVSGSMVGNKLALLKQAMGFVIDNLGPGD------RLCVISFSSGASRLMRL 226 Query: 75 QPFTSAANFFPPI----LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 T A L A+G T +GAA+ KA ++++R + L++ Sbjct: 227 SRMTDAGKAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVES-------VILLS 279 Query: 131 DGAPT-----------DEWQAAA---NKVFRGEEDKRFA---FFSIGVQ-GADMKTLAQI 172 DG T D A + V + G D + I Sbjct: 280 DGQDTYTVPPRGGYDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGKDHDAAAMHTI 339 Query: 173 SVRQPLPLQGLQFRELFS 190 + ++ Sbjct: 340 AEVTGGTFSFIENEAAIQ 357 >UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V370_MONBE Length = 471 Score = 100 bits (248), Expect = 5e-20, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 61/193 (31%), Gaps = 27/193 (13%) Query: 11 DFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-G 69 DF P + ++DVSGSM G+ + + + L L +VTF Sbjct: 49 DFEPVERPAIDLVAVIDVSGSMAGQKLKMVQSTLEFLMRNLK------DTDRFALVTFDS 102 Query: 70 PVHVEQPFTSAANFFPP-------ILFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 V L A T + + + ++++++R A Sbjct: 103 DVKTVFDLRPMTTAHKEACLADVQKLRAGSCTNLSGGLFRGVELMQQRGATKGAVSS--- 159 Query: 123 RPWIFLITDGAPT------DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 I L+TDG D+ A + D F G + + L Q+S Sbjct: 160 ---ILLMTDGIANEGVRDKDDMCRALRGLMGPAPDYTIYTFGYG-KDHNENMLRQLSETG 215 Query: 177 PLPLQGLQFRELF 189 ++ ++ Sbjct: 216 NGMYYFIESNDII 228 >UniRef50_UPI0000F1FEC5 PREDICTED: similar to Clca1 protein n=2 Tax=Danio rerio RepID=UPI0000F1FEC5 Length = 903 Score = 100 bits (248), Expect = 5e-20, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 61/217 (28%), Gaps = 36/217 (16%) Query: 12 FASNPEPRCPCILLLDVSGSMNGR-PINELNAGLVTFRDELLADPLALKRVELGIVTFGP 70 F + L+LDVSGSM I + ++ +GIV F Sbjct: 290 FKLLQRKKRAVCLILDVSGSMATESRILRMRQAATHLLRN-----YVEEQASVGIVKFST 344 Query: 71 VHVEQPFTSAANFFPPI--------LFAQGDTPMGAAITKALDMVEERKREYRANGISYY 122 + G T M + L ++ E + + Sbjct: 345 AASIVSSLTIIESDATRDHLINLLPETPGGSTNMCNGLRLGLQVLSEDDMDAIGDE---- 400 Query: 123 RPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPL---- 178 I +TDG TD+ + +I + + L +++ + Sbjct: 401 ---IIFLTDGQATDDVTLCIPDAI----NSGAIIHTIALSDSAHNALQEMADKTGGIFFY 453 Query: 179 ---PLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEA 212 Q + F+ L+ S S V LE+ Sbjct: 454 SKDDFTSNQLMDAFASLTLSTGDHS----NEPVQLES 486 >UniRef50_A7I7X6 Putative uncharacterized protein n=1 Tax=Candidatus Methanoregula boonei 6A8 RepID=A7I7X6_METB6 Length = 229 Score = 100 bits (248), Expect = 5e-20, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 68/223 (30%), Gaps = 29/223 (13%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLA--DPLALKRVELGIVTFG-PVHV 73 + L D S SM G+ I LN + E+ + + + F Sbjct: 3 RKQLHFFWLADCSDSMRGKKIATLNQAIREALPEVQKAVAAYPQVDIRMRAIKFSNDAAW 62 Query: 74 EQ--PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 F P L +G T AI + + R P LI+D Sbjct: 63 HVGPDPVPINEFVWPELETEGLTATAKAIRLLTGELSIERMPRR-----GLPPICILISD 117 Query: 132 GAPTDEWQA---AANKVFRGEEDKRFAFFSIGVQ---GADMKTLAQISVRQPLPL----Q 181 G TD + A ++ + + +I + + L + + ++ + L Sbjct: 118 GFCTDPREEYDTAIAELGKIPWGIKAVRLAIAIGDESDYNATELLKFANQESVGLLKAHS 177 Query: 182 GLQFRELFSW-----LSSSLRSVSRSTPGTE----VVLEAPKG 215 + W +S R SR P E V LE+P Sbjct: 178 PEELVAYIKWASVSASVASSRGRSRGAPAEEDTSNVALESPPP 220 >UniRef50_C3YUL3 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YUL3_BRAFL Length = 1201 Score = 100 bits (248), Expect = 5e-20, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 71/202 (35%), Gaps = 25/202 (12%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE---- 74 LLD SGS+N ++ V + ++L +G+V + + Sbjct: 667 PLDLFFLLDGSGSVNAANFVKVKQFAVNVVNTF---DVSLTATRVGVVQYSDRNTLVFNL 723 Query: 75 ---QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 S + I++ G T GAA+ R+Y A I ++TD Sbjct: 724 GNKVNKPSTVSAINNIVYQSGGTNTGAALQY--------VRQYAAWRGGNVPKVIIVLTD 775 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSW 191 G +D + + ++IGV D L QI+ + + L F+ Sbjct: 776 GKSSDSVSGPSQNLVAA----GVEVYAIGVGSFDHGQLLQIANNKQNNVIELNN---FNA 828 Query: 192 LSSSLRSVSRSTPGTEVVLEAP 213 L++ + +S + + + P Sbjct: 829 LATKIDMISTNVCSYALHVPTP 850 >UniRef50_A9V8A7 Predicted protein n=3 Tax=root RepID=A9V8A7_MONBE Length = 2847 Score = 99.7 bits (247), Expect = 6e-20, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 70/189 (37%), Gaps = 17/189 (8%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV--EQ 75 LLD SGS++GR + + L++ V +G+ + + Sbjct: 817 REIDVFFLLDGSGSIDGRDFELQRSFVRDLVSNLMSGDN---DVRVGVAEYSSTYTQIVF 873 Query: 76 PFTSAANFFPPILFAQ----GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 PF+S+ + L + G T G ++ +A D + + S + L+TD Sbjct: 874 PFSSSQSAIDSSLSSMIQTAGATATGTSLGEAADDI-------GSTARSSAARVLILMTD 926 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRELFS 190 G +D + + + +IGV A L QI+ + F +L S Sbjct: 927 GETSDGDEQNIDPSVDALRALGVSITAIGVGNSASESELLQIAGSSDHVFNNIAFVDLSS 986 Query: 191 WLSSSLRSV 199 +++ + + Sbjct: 987 FINQIIGQI 995 Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 57/181 (31%), Gaps = 27/181 (14%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ---- 75 IL+ D SGS+ R L + + + +TF Sbjct: 1243 TDLILIQDNSGSIEERDFQTSIQFLRALVNG---ADIESSGSRIAAITFCSEPTLLTDYV 1299 Query: 76 ----PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 A N L T GAA+ + + + A R + +ITD Sbjct: 1300 STTSEALDALNTASNTLTC--GTATGAALDFVRENILTDRSNSGA------RRVVIVITD 1351 Query: 132 GAPTDEW---QAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQPLPLQGLQFRE 187 G +++ Q A ++ +D ++IGV D+ L I+ Q F Sbjct: 1352 GESQEDFSVVQNAGARLQAEVDD----VYAIGVGSGTDLAELRVIASSDDNTFQEASFDN 1407 Query: 188 L 188 L Sbjct: 1408 L 1408 Score = 52.7 bits (125), Expect = 9e-06, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 65/212 (30%), Gaps = 24/212 (11%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSA 80 +LD S S++ R L L ++ + I F + S Sbjct: 450 DVAFILDSSDSIDDRDYQLLKDFTSALVRSLT---VSSTNARVAIELFSSEPQIETGFSY 506 Query: 81 ANFF----PPILFAQG-DTPMGAAITKAL-DMVEERKREYRANGISYYRPWIFLITDGAP 134 + L T G A+ A D+ + +R+ + + +ITDG Sbjct: 507 DESYLISVINSLPHLKLGTATGEALRMARQDIFSDNDALFRSFSVPAFA---IVITDGNS 563 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPLQGLQFRELF--SW 191 ++ A + R + F++GV + L I+ F L S+ Sbjct: 564 LEDASYVAEQARR-LREHGVQVFALGVGSQITVSQLVDIAGDNARVFGVADFGVLNASSF 622 Query: 192 LSSSLRSVSRSTPGTEVV--------LEAPKG 215 +S L V S + L P G Sbjct: 623 VSQFLEDVFCSETPDDGDVGRPGDRGLPGPDG 654 >UniRef50_UPI0001C161B1 von Willebrand factor, type A Precursor n=2 Tax=Nostocaceae RepID=UPI0001C161B1 Length = 474 Score = 99.7 bits (247), Expect = 6e-20, Method: Composition-based stats. Identities = 46/196 (23%), Positives = 70/196 (35%), Gaps = 24/196 (12%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQP 76 +LL+D S SM+ + E+ F L+ ++ +V FG V P Sbjct: 48 KPQAIVLLIDTSSSMSDGKLAEVKTAASQFIQRRN-----LESDQIAVVNFGATVQTPAP 102 Query: 77 FTSAANF---FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 T+ N L G TPMG I A D ++ I L TDG Sbjct: 103 LTNDINTLNNAIDQLLEIGSTPMGEGINTAQDQLQATT----------LNKNIILFTDGL 152 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPL--QGLQFRELFSW 191 P D A + + + ++ GAD L QI+ + L QF + FS Sbjct: 153 PDDPNFAYNSAL--SVRNAGIKLIAVATGGADTNYLTQITGDRSLVFYANSGQFDQAFSQ 210 Query: 192 LSSSL-RSVSRSTPGT 206 + + + + S G Sbjct: 211 AEAVIYKQLIESNTGE 226 >UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleostomi RepID=Q562D1_XENTR Length = 895 Score = 99.7 bits (247), Expect = 6e-20, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 66/180 (36%), Gaps = 11/180 (6%) Query: 7 FATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVE---L 63 FA S P+ I ++D S SM G + + L+ D++ + + Sbjct: 265 FAPSKLKEVPK---NIIFIIDRSISMIGLKMQQTKEALLKILDDVKEHDHFNFVIFDWGV 321 Query: 64 GIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYR 123 I V + A + L+ +G T + A+ A+ ++++ S Sbjct: 322 EIWEQSLVKATPENLNRAKAYVRNLYPKGWTNINDALLSAISLLDQAHDARSVPKRS--A 379 Query: 124 PWIFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL 180 I +TDG P+ + + R +++ +S+G G D L ++S+ Sbjct: 380 SLIIFMTDGQPSTGERNLDKIQENARNAIRGKYSLYSLGFGVGVDYPFLEKLSLENSGVA 439 >UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXM6_9CYAN Length = 928 Score = 99.3 bits (246), Expect = 8e-20, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 62/192 (32%), Gaps = 25/192 (13%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 ++ + + L+D SGS +G P+N+ + F + L Sbjct: 411 YLIPAIEYNPHQLVPKDVVFLIDTSGSQSGEPLNKCQELMRRFINGLNPHDT------FT 464 Query: 65 IVTFGPVHVEQPFTSAANF---------FPPILFAQGDTPMGAAITKALDMVEERKREYR 115 I+ F + AN + L A G T + I L+ E R Sbjct: 465 IIDFSDTTRQLSPVPLANTVQNRNSAMNYINQLNASGGTQLRRGIQAVLNFPEVDPGRLR 524 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVR 175 + I L+TDG +E Q A + R F G + L +I+ Sbjct: 525 S---------IVLLTDGYIGNENQILAEVQRHLKLGNRLHSFGAG-SSVNRFLLNRIAEI 574 Query: 176 QPLPLQGLQFRE 187 + +++ E Sbjct: 575 GRGISRIVRYDE 586 >UniRef50_UPI000194DA30 PREDICTED: collagen, type XX, alpha 1 n=1 Tax=Taeniopygia guttata RepID=UPI000194DA30 Length = 1505 Score = 99.3 bits (246), Expect = 8e-20, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 76/217 (35%), Gaps = 28/217 (12%) Query: 10 SDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG 69 S F + +LL+D S S+ + L +A ++ +G+ + Sbjct: 250 SQFQCDTPAMIDLVLLVDGSWSIGRNNFKLIKEFLSNLIS---PFSIAEDKIRVGLSQYS 306 Query: 70 PVHVEQPFTSAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYY 122 + SA + +L A G+T G A+T L+ + R Sbjct: 307 SDPRTEWELSAYSTREQVLEAVRNLRYKGGNTFTGLALTHVLEQNLKPDAGARLEAEK-- 364 Query: 123 RPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPL 180 + L+TDG D+ AA + F+IGV+ AD L Q++ Sbjct: 365 --LVILLTDGKSQDDANLAAQTLKNL----GIEIFAIGVKNADEAELRQVASEPLELSVY 418 Query: 181 QGLQFR-------ELFSWLSSSLRSVS-RSTPGTEVV 209 L F +L L + ++ S + T G+ V Sbjct: 419 NVLDFPLLSSLVGKLTRVLCARIKERSHKDTTGSAVK 455 >UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C9G5_PARTE Length = 648 Score = 99.3 bits (246), Expect = 8e-20, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 67/205 (32%), Gaps = 32/205 (15%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFT 78 + ++D SGSM G+ I + LV D L L ++TF G P Sbjct: 228 IDLLCVIDKSGSMEGKKIASVQQSLVQLLDFLSEKD------RLCLITFDGSAQRLTPLK 281 Query: 79 SAANFFPP-------ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + + A G T + A + +++RK + + IFL++D Sbjct: 282 TLTQDNKNYFKKAIYSIRASGQTNIAKGTEIAFNQIQQRKMKNQVTS-------IFLLSD 334 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI----SVRQPLPLQGLQFR 186 G + ED S G D +++I Sbjct: 335 GQDQ-GAAEYIQRQKDVVEDI-VTIHSFGYGSDHDAALMSKICKVGQGSFYYIEDVKLLD 392 Query: 187 ELFSWLSSSLRSVSRSTPGTEVVLE 211 E F + +L +S S +V ++ Sbjct: 393 EFF---ADALGRLS-SALAEKVQID 413 >UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR Length = 757 Score = 99.3 bits (246), Expect = 8e-20, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 71/204 (34%), Gaps = 18/204 (8%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLAL-KRVELGIVTFGPV---HVE 74 R I ++D+SGSM G P GL++ +L + ++ F V E Sbjct: 330 RKEVIFIIDISGSMKGGPFESAKNGLLSSLQKLNPEDSFNIIAFKMDTYLFSSVMEQATE 389 Query: 75 QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 + A + L A G T + + +A+ ++ E P IFLITDG Sbjct: 390 EAIIEATRWLNDKLTADGGTNILGPLKQAIKLLAETTNS---------IPVIFLITDG-A 439 Query: 135 TDEWQAAANKVFRGEEDKR---FAFFSIGVQGA-DMKTLAQISVRQPLPLQGLQFRELFS 190 ++ + N V + G+ + L ++ + Sbjct: 440 VEDERDICNFVKGYLPSGGSISLRISTFGIGTYCNHHFLRMLAQIGRGHFDTAYDADSVD 499 Query: 191 WLSSSLRSVSRSTPGTEVVLEAPK 214 + L + + S ++ ++A + Sbjct: 500 FRMQKLFTTASSIILADITVDALE 523 >UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DZ93_PARTE Length = 522 Score = 99.3 bits (246), Expect = 8e-20, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH-VEQ 75 I L+D SGSM+G ++ + L L + L ++ F + Sbjct: 109 RQGIDLICLIDHSGSMSGEKMHLVKKSLKHLLKMLQPND------RLCLIEFDDQNYRLT 162 Query: 76 PFTSAAN-------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 A + A G T +G A+ AL +++ R+ + IFL Sbjct: 163 RLMRATQENMYKFLIAIDTIEANGATDIGNAMKMALSILKHRRFKNPIAS-------IFL 215 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPLPLQGL 183 ++DG N + + F + G K +++I+ + + Sbjct: 216 LSDGEDEGAAGRVWNDIQSKNIKEPFTINTFGFGRDCCPKIMSEIAHFKEGQFYYISEIS 275 Query: 184 QFRELF 189 + E F Sbjct: 276 KIDECF 281 >UniRef50_A7BVG3 von Willebrand factor type A domain protein n=1 Tax=Beggiatoa sp. PS RepID=A7BVG3_9GAMM Length = 280 Score = 99.3 bits (246), Expect = 9e-20, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 67/222 (30%), Gaps = 33/222 (14%) Query: 2 SEQITFATSDFASNPEPRC------PCILLLDVSGSMNGRPINELNAGLVTFRDELLADP 55 QI S + P P LL+DVS SM+G + E F + Sbjct: 65 QAQIPDDLSWLSLPPSPESVTLVHQSVFLLIDVSYSMDGSALAEAKQAAQEFVRK----- 119 Query: 56 LALKRVELGIVTFGP-VHVEQPFTSAAN---FFPPILFAQGDTPMGAAITKALDMVEERK 111 L +G++ FG + T A L G T M +T A ++ Sbjct: 120 SDLAHTAIGLIEFGSKAKIISGLTQNAKHLYKAINRLKTNGSTNMTEGLTTAYLKLKN-- 177 Query: 112 REYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQ 171 +I L+TDG P ++ + +IG AD L Sbjct: 178 --------VDDPRFIILLTDGLPN--HPKNTQQIAQEICADGIELITIGTGDADKTYLQS 227 Query: 172 IS--VRQPLPLQGLQFRELF----SWLSSSLRSVSRSTPGTE 207 ++ + + F L+ S + + G Sbjct: 228 LACYDQNSFFAKAGTMVSTFSRIAQVLTESGSYIQITQNGQR 269 >UniRef50_UPI00006A1B4F Collagen alpha-3(VI) chain precursor. n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1B4F Length = 556 Score = 99.3 bits (246), Expect = 9e-20, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 63/180 (35%), Gaps = 14/180 (7%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFT 78 + L+D S S+N + + + + RV++G++ F E P Sbjct: 1 ADIVFLVDSSASINSDDYETMKEFMESMV---KQAEIGPDRVQIGLIQFSSETKEEFPLN 57 Query: 79 SAANFFP--------PILFAQGDTPMGAAITKALDMVEERKREYRAN--GISYYRPWIFL 128 F IL A P + T +E + + + G + L Sbjct: 58 RYKKQFSLNTYSTKLDILKAVFSLPQVSGYTYTAKALEYTRIRFGTSYGGRPGISHILIL 117 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 +TDGA T+ + V + +D F++GV A + L QI+ ++ L Sbjct: 118 VTDGATTEADRPNLPIVSKALKDDGIIVFAVGVGKAVPQELQQIAGYPDRWFLVQNYKGL 177 >UniRef50_P21941 Cartilage matrix protein n=35 Tax=Euteleostomi RepID=MATN1_HUMAN Length = 496 Score = 99.3 bits (246), Expect = 9e-20, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 50/167 (29%), Gaps = 15/167 (8%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV------ 71 + ++D S S+ ++ L + L + +G+V + Sbjct: 38 RPTDLVFVVDSSRSVRPVEFEKVKVFLSQVIESL---DVGPNATRVGMVNYASTVKQEFS 94 Query: 72 -HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 + I T G AI A+ R + + ++T Sbjct: 95 LRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGR-SRSPDISKVVIVVT 153 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP 177 DG P D Q + + F+IGV D TL QI+ Sbjct: 154 DGRPQDSVQDVSARARA----SGVELFAIGVGSVDKATLRQIASEPQ 196 Score = 92.3 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 57/180 (31%), Gaps = 20/180 (11%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPF- 77 + L+D S S+ + + D L ++ K ++G+V + V E P Sbjct: 274 TDLVFLIDGSKSVRPENFELVKKFISQIVDTL---DVSDKLAQVGLVQYSSSVRQEFPLG 330 Query: 78 -----TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + + T GAA+ +D R + TDG Sbjct: 331 RFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGAQKVG----IVFTDG 386 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFRELFS 190 D N + +D F F++GV A L +I+ F+ + Sbjct: 387 RSQDY----INDAAKKAKDLGFKMFAVGVGNAVEDELREIASEPVAEHYFYTADFKTINQ 442 >UniRef50_UPI0001C378BC von Willebrand factor, type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C378BC Length = 565 Score = 99.3 bits (246), Expect = 9e-20, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 70/186 (37%), Gaps = 22/186 (11%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP---V 71 N + +LD SGSM+G P+N L A L + + +G+V++ V Sbjct: 386 NSGKPIAAVFVLDTSGSMSGAPLNSLKASLRNSIKYINSSN------YIGVVSYSSNVNV 439 Query: 72 HVEQPFTSAANF-----FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 +E L A G+T +A+++A+ M+ + ++ P + Sbjct: 440 DLELAKFDLNQQAYFMGAVDSLTASGNTATFSALSQAMIMLRDFTKDNPNVS-----PMV 494 Query: 127 FLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFR 186 FL++DG + + + + ++IG A++ L IS Sbjct: 495 FLLSDGQSNSGSE--FSDIDGAIATAQIPIYTIGY-NANLNELKAISEINEAATINADTD 551 Query: 187 ELFSWL 192 ++ L Sbjct: 552 DVIYQL 557 >UniRef50_C3Y2U7 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y2U7_BRAFL Length = 421 Score = 98.9 bits (245), Expect = 9e-20, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 50/174 (28%), Gaps = 12/174 (6%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ----- 75 + +LD SGS++ + + D +A +G+V F E+ Sbjct: 31 DIMFVLDGSGSISADDFVSAKSFISRVVDAF---DIAADFTRVGVVQFSSFFTEEFPLDR 87 Query: 76 --PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 S I G T +G I ++ + R R L+TDG+ Sbjct: 88 YSDKASLKQAIGNIPQRGGGTLLGQVINYLVNTSFTEAKGARPLSDGIPR-IAVLMTDGS 146 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFR 186 D FSIGV + L ++ + Sbjct: 147 AHDNPTTVLAPAIDALRASGIIAFSIGVGPSVNRDQLEAVAGDTDRVFLVGAYS 200 >UniRef50_Q2W311 Putative uncharacterized protein n=1 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2W311_MAGSA Length = 1171 Score = 98.9 bits (245), Expect = 9e-20, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 73/207 (35%), Gaps = 29/207 (14%) Query: 20 CPCILLLDVSGSMN---GRPINELNAGLVTFRDELLADPLALKRVELGIVTFG---PVHV 73 ++LLD S SM G P+ + F +L D + +V F VH Sbjct: 63 LDVVMLLDHSSSMGAAPGSPLQMMLRAAGNFLRQLSPD------SRVAVVGFNQVPSVHC 116 Query: 74 EQPFTSA-ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 T A A + G T + AA+ +A++++ A+G + L +DG Sbjct: 117 TLAATPAQARSALQAISPGGATSIAAALNQAVELL--------AHGRPGMDKVVVLCSDG 168 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGA--DMKTLAQISVRQPLP--LQGLQFREL 188 D+ A+ + R + ++G LA ++ RQ + ++ Sbjct: 169 Q--DDIAEIADALARLKAIPSVRVLAVGFGDEVIHATFLAMVADRQDYFHLTRARDMDDV 226 Query: 189 FSWLSSSLR--SVSRSTPGTEVVLEAP 213 F L+ + + + V P Sbjct: 227 FQRLAKEVNGPTGLLAAVTEPVATPVP 253 >UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=B8G546_CHLAD Length = 418 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 64/213 (30%), Gaps = 29/213 (13%) Query: 2 SEQITFATSDFASNPEP----RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLA 57 + Q+ + + + P +LD SGSM G + + A + L +A Sbjct: 21 TPQVVYLLVEAVAPASPTSALPLNLCFVLDRSGSMQGAKLESMKAATRRVIELLRPHDVA 80 Query: 58 LKRVELGIVTFGP-VHVEQPFTSAAN-----FFPPILFAQGDTPMGAAITKALDMVEERK 111 IV F V P T + + G T M + A +++ Sbjct: 81 ------AIVIFDDTVQTLIPATPVGDRSALLAAVETITEAGGTAMSLGMQAAQTELQKHL 134 Query: 112 REYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLA 170 R + + L+TDG T + + R ++G+ + + L Sbjct: 135 GPDRISR-------MLLLTDGQ-TWGDEPICRDLARTLGQAGVRITALGLGTEWNEQLLD 186 Query: 171 QIS----VRQPLPLQGLQFRELFSWLSSSLRSV 199 I+ Q F ++V Sbjct: 187 DIAAASDGYSDYIADPAQIETFFQQAVKEAQAV 219 >UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus RepID=B7AA98_THEAQ Length = 706 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 60/196 (30%), Gaps = 34/196 (17%) Query: 9 TSDFASNP--EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIV 66 D P +L+LDVSGSM G + AG + + LG+V Sbjct: 294 PEDLPLKPLGRKGAALVLVLDVSGSMEGEKLAMAVAGALELVRSAAPED------YLGVV 347 Query: 67 TFGPVHVEQ-PFTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANG 118 F P L A G T +G A +AL ++++ Sbjct: 348 LFSSSPRVLFPPRPMTAQGKKEAESLLLSLRAGGGTVLGGAFREALRLLQD--------- 398 Query: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS---- 173 + R + +++DG D + ++ + AD L ++ Sbjct: 399 VPVERKALLVLSDGIIFDPKEPILALAATA----GVEVSALALGPDADAAFLEALAQRGG 454 Query: 174 VRQPLPLQGLQFRELF 189 R + LF Sbjct: 455 GRFYRAATPKELPRLF 470 >UniRef50_A2E1S5 von Willebrand factor type A domain containing protein n=2 Tax=Trichomonas vaginalis RepID=A2E1S5_TRIVA Length = 688 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 57/194 (29%), Gaps = 14/194 (7%) Query: 12 FASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV 71 F + ++D SGSM+G I L F L I+ FG Sbjct: 226 FETKVHSNSEFYFIIDCSGSMSGSCIQNAKLCLNIFMHSL------PIGCRFSIIKFGS- 278 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 E + A A D++ K + +FL+TD Sbjct: 279 DYEVALHPCDYTDENVSEAMKQLNNIDAEMGGTDILSPLKYVMELTPKQGFIKQVFLLTD 338 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQI---SVRQPLPLQGLQFRE 187 G D + + FSIG+ GAD + + S + + + + Sbjct: 339 GQ--DSNTNELCALAQENRTNN-RIFSIGIGSGADKDLIINVSQKSGGNYVFVDDDESEK 395 Query: 188 LFSWLSSSLRSVSR 201 L + L S Sbjct: 396 LNEKVIELLNSAIS 409 >UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Shewanella RepID=A6WMD3_SHEB8 Length = 772 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 77/229 (33%), Gaps = 28/229 (12%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + + ++ P IL++D SGSM G I + L+ L + Sbjct: 377 VLPPKVEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDS------FN 430 Query: 65 IVTFGPVHVEQPFTSA---------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 I+ F T A F L A G T M A+ A + + Sbjct: 431 IIEFNSSLSLLSATPLPATSSNLSRARQFVSRLQADGGTEMALALDAA---LPKSLGSVS 487 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISV 174 + + R + +TDG QA + + + R F++G+ A + + + + Sbjct: 488 PDAVQPLRQ-VIFMTDG-SVGNEQALFDLIRYQIGESRL--FTVGIGSAPNSHFMQRAAE 543 Query: 175 -RQPLPLQGLQFRELFSWLSSSLRSVSRST-PGTEVVLE---APKGWTS 218 + + E+ + +S+ L + +V + P W S Sbjct: 544 LGRGTFTYIGKVDEVDAKISALLSKIQYPVLTDIQVRYDDGSVPDYWPS 592 >UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella loihica PV-4 RepID=A3QDW1_SHELP Length = 776 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 69/225 (30%), Gaps = 28/225 (12%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + D A PR L++D SGSM G I + + ++ L + Sbjct: 388 MLMPPQDKARVRLPR-ELTLVIDTSGSMTGDSIAQAKSAILNALAGLGSQDT------FN 440 Query: 65 IVTFGPVHVEQPFTSA---------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 ++ F + AN F L A G T M A+ +AL E Sbjct: 441 VIAFDSSVRSLSPVALSATAANLGKANLFVQSLEADGGTEMAPALLRALSQPESGVSSIS 500 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQIS- 173 + + ITDG + + +R F++G+ A + + + + Sbjct: 501 SAVKPERLKQVVFITDG-AVGNEASLFALIAANIGRQRL--FTVGIGAAPNGYFMERAAR 557 Query: 174 ---VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPKG 215 + + L + S S +V L G Sbjct: 558 AGRGTYTYVGKISEVDAKIGELLEKIESPQIS----DVTLTLDDG 598 >UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB92D Length = 586 Score = 98.9 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 65/208 (31%), Gaps = 29/208 (13%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSA 80 ++D SGSM GRPI + L D L +V F TS Sbjct: 315 DITFVIDTSGSMGGRPIVDAKESLQLAIDRLSEKD------RFNVVAFNNDTTRLFETSV 368 Query: 81 ---------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 A F L A G T M A+ AL + + ITD Sbjct: 369 EGTTRNKQYARDFVKHLNAGGGTEMAPALNAALK----------RTTTKDFIKQVVFITD 418 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISVRQPLPLQGLQFRELFS 190 G A +++ D R F++G+ A + + + + ++ Sbjct: 419 G-AVGNEAALFSQIKNELGDARL--FTVGIGSAPNSYFMTRAAQFGLGSYVFVRNTADIK 475 Query: 191 WLSSSLRSVSRSTPGTEVVLEAPKGWTS 218 SL S +++ L P G+ Sbjct: 476 QQMDSLLYKLESPVLSDLSLTLPAGYAQ 503 >UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1 Tax=Sorghum bicolor RepID=C5WZE3_SORBI Length = 704 Score = 98.5 bits (244), Expect = 1e-19, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 60/206 (29%), Gaps = 44/206 (21%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH-VEQ 75 + +LDVS SM+G + L + + L L +V F Sbjct: 231 RAPLDLVTVLDVSRSMSGPKLALLKRAMRFVIENLEPSD------RLSVVAFSSSACRLF 284 Query: 76 PFTSAANF-------FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 P F L A G T + + KA +VE+R+ I L Sbjct: 285 PLRKMTAFGQQQSQQAVDSLVADGGTNIAEGLRKAARVVEDRQARNPVCS-------IIL 337 Query: 129 ITDG------------APTDEWQAAANKVFRGEEDKRFAFFSIGVQ-----GADMKTLAQ 171 ++DG AP ++ + + + G+ D + + Sbjct: 338 LSDGVDSHNLPPRDGSAPEPDYAPLVPRSILPGSEHHVPIHAFGLGMDHDHDHDSRAMHA 397 Query: 172 ISVRQPLPLQGLQFRELFSWLSSSLR 197 ++ + SS++ Sbjct: 398 VAQMSSGTFS------FIDMVGSSIQ 417 >UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1D2W8_DEIDV Length = 418 Score = 98.5 bits (244), Expect = 1e-19, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 57/196 (29%), Gaps = 26/196 (13%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 A + P ++D SGSM+G P+ + + D + +V Sbjct: 32 APVTTQVSQRPPLNLAFVIDRSGSMSGLPLQMAKQAAIAAVRQARPDD------RVSVVA 85 Query: 68 FGPVHVEQPFTSAANFFPPILFA------QGDTPMGAAITKALDMVEERKREYRANGISY 121 F + A ++ A +G T + + V + N Sbjct: 86 FDDRVDVIVPSQLATSREAVIQAIGTIDDRGSTNLHGGWLEGATQVAQHLTPGALNR--- 142 Query: 122 YRPWIFLITDGAPTDEWQA--AANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQIS----V 174 + L++DG + RG ++ + +IG+ D + L I+ Sbjct: 143 ----VILLSDGQANVGVTDRREIARQVRGLTERGISTTTIGLGSHYDEELLLAIANAGDG 198 Query: 175 RQPLPLQGLQFRELFS 190 + F Sbjct: 199 NFEHVEDPSRLPTFFE 214 >UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B7C3_9BACI Length = 920 Score = 98.5 bits (244), Expect = 1e-19, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 63/212 (29%), Gaps = 25/212 (11%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE 74 P ++++D SGSM G + + L LG + F Sbjct: 398 KEMPSLGLMIVMDRSGSMAGSKLELAKEAAARSVELLREKDT------LGFIAFDDRPWV 451 Query: 75 Q----PFTSAANFFPPI--LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 P + I + G T + ++ KA + +E K + R I L Sbjct: 452 IVETGPLEDKKDAVDKIGSVTPGGGTEIFTSLEKAYEELENLKLQ---------RKHIIL 502 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISV-RQPLPLQGLQFR 186 +TDG + ++ ++ + AD L +++ Sbjct: 503 LTDGQS--ARSTDYESMIETGKENNITLSTVALGSDADRNLLEELAGLGAGRFYDVTDSS 560 Query: 187 ELFSWLSSSLRSVSRSTPGTEVVLEAPKGWTS 218 + S LS +R+ + + + Sbjct: 561 VIPSILSRETVMATRTYIEDNPFYPSLRPYPE 592 Score = 42.3 bits (98), Expect = 0.014, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 41/136 (30%), Gaps = 21/136 (15%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ 75 P P + + D S S+ GR L + + +++ G + Sbjct: 60 PAPGKTVVFIADRSASVQGREGELL-DFIDAGIQSKGKEDS------YAVISAGETAAAE 112 Query: 76 PFTSAANFFPPILFAQ---GDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 ++ G+T + A I A ++ E I L++DG Sbjct: 113 SSLASMKGEFREFSTDTGKGETNLEAGIQLASTLMPEETPGR-----------IVLLSDG 161 Query: 133 APTDEWQAAANKVFRG 148 T A K+ + Sbjct: 162 RETAGSSREAAKLLKN 177 >UniRef50_Q23FU3 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23FU3_TETTH Length = 755 Score = 98.5 bits (244), Expect = 1e-19, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 62/195 (31%), Gaps = 28/195 (14%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 I L+D SGSM G+ + L L ++ +V+F P Sbjct: 47 PVDIICLIDNSGSMAGKKAQLVRKSLKYLLKILEKGD------QISLVSFSSTAKTLCPL 100 Query: 78 TSAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 T + + + QG T + + ++ RK + +I L+T Sbjct: 101 TQVNDENKQQIKSAIKQINGQGGTFVIPGFKEVTKILNSRKE-------QREQTFILLLT 153 Query: 131 DGAPTD-EWQAAANKVFRGEEDKRFA----FFSIGVQ-GADMKTLAQISVRQPLPL-QGL 183 DG D + + R ++ G + + L +I+ + Sbjct: 154 DGEFGDIDSGKVIQNINRLFTQSEIQKTPYIYTYGYGDDVNPEILQEIAQKFQGKYCLIS 213 Query: 184 QFRELFSWLSSSLRS 198 +++ W S+ S Sbjct: 214 NVQQVTDWFLLSVSS 228 >UniRef50_A8SU73 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SU73_9FIRM Length = 550 Score = 98.5 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 67/185 (36%), Gaps = 22/185 (11%) Query: 23 ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTSAAN 82 + + D SGSM+G P+N+L L + + +G+V++ + + + Sbjct: 377 VFVADCSGSMDGDPMNQLKNSLTNGAQYINDNN------YVGLVSYSNSVTIEVPIAQFD 430 Query: 83 FF--------PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 L A G T A+ A+ M+ E K + +FL++DG Sbjct: 431 LNQRSYFQGAVNNLIASGGTASYDAVVVAVKMITEAKA-----QHPDAKCMLFLLSDGYA 485 Query: 135 TDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFRELFSWLS 193 + + +++ ++IG AD LA++S ++ + Sbjct: 486 NNGYS--MDEITSALRTSGIPVYTIGYGDDADTGELARLSGINEAASINADSDDIIYKIK 543 Query: 194 SSLRS 198 S S Sbjct: 544 SLFNS 548 >UniRef50_UPI00016E1D58 UPI00016E1D58 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E1D58 Length = 451 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 43/188 (22%), Positives = 75/188 (39%), Gaps = 19/188 (10%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ----- 75 + ++D SGS+ + + L + L +A RV +GIV + + Q Sbjct: 1 DIVFIIDESGSIGSANFQLMRSFLHSLISGLQ---VASNRVRVGIVMYNVEPMAQVFLNT 57 Query: 76 --PFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + +F + + G T GAA+ L V ++R R + + +ITDG Sbjct: 58 FKDKSELLDFIKILPYHGGGTNTGAALNFTLQEVFIKQRGSRKD--LGVQQVAVVITDGK 115 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFRELFSW 191 DE + A + R +++GV+ AD L QI+ F +L Sbjct: 116 SQDEVSSPAANLRRA----GVTVYAVGVKDADKAQLDQIASYPTNKHTFIIDSFTKL-KT 170 Query: 192 LSSSLRSV 199 L +SL+ + Sbjct: 171 LEASLQRI 178 Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 60/175 (34%), Gaps = 19/175 (10%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV-HVEQPFT- 78 I L+D SGS+ ++ + + + V +G++ + + + P Sbjct: 232 DLIFLIDSSGSIYPEDYKKMKDFMKSVI---KQSIVGKNEVHVGVMQYSTIQKLVFPLNQ 288 Query: 79 -----SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 + + G T G AIT + R G + + ++TDG Sbjct: 289 YYTKDELSKAIDEMQQIGGGTHTGEAITDVSQYFD-----ARNGGRPDLKQRLVVVTDGE 343 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFREL 188 DE + A + K +SIGV A+ L +IS G F L Sbjct: 344 SQDEVRQPAEALRA----KGVIVYSIGVVAANTSQLLEISGTPNRMYAGRDFDAL 394 >UniRef50_C3ZCS4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZCS4_BRAFL Length = 949 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 69/189 (36%), Gaps = 25/189 (13%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH------ 72 + +L+LD SGS+ + + + +G+V + Sbjct: 26 KLDLMLVLDGSGSVGDADFAKTLEFAENVVNAF---DIGTDLTRVGVVQYSDTPTMEFNL 82 Query: 73 -VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 V S I + G T GAA+ A R + + ++TD Sbjct: 83 GVHADKGSTIAAVNNIQYQNGGTATGAALEFA--------RANANWRGAPVPKVMIVVTD 134 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSW 191 G D+ AAA + + A ++IGV D+ L QI+ ++ ++ ++ Sbjct: 135 GKSGDDVTAAAQAL----AGEGVAVYAIGVGNYDLPELQQIANGNNN--NVIELQD-YNA 187 Query: 192 LSSSLRSVS 200 L++++ ++ Sbjct: 188 LTAAIDQIA 196 Score = 70.0 bits (170), Expect = 6e-11, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 53/181 (29%), Gaps = 27/181 (14%) Query: 34 GRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV-------EQPFTSAANFFPP 86 G N + + L +G++ + T+ N Sbjct: 698 GLQWNPVKWTMYRLLVTL------SVASAVGVIQYSSTVQEEFSLNAHFTKTAVLNAIDN 751 Query: 87 ILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVF 146 I++ G T GAAIT + + ++TDG +D+ ++ Sbjct: 752 IVYMGGGTLTGAAITY---------MKDNSQWRPGVAKIAIVVTDGKSSDDVGPPSSAAQ 802 Query: 147 RGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPLQGLQFRELFSWLSSSLRSVSRSTPG 205 + +IGV D L+QI+ + L + ++ SV Sbjct: 803 Q----TGITMHAIGVGANVDQTELSQIASTSQYVTTVADYDALDAQMAQLTASVCDGAND 858 Query: 206 T 206 Sbjct: 859 Q 859 Score = 63.8 bits (154), Expect = 3e-09, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 49/146 (33%), Gaps = 21/146 (14%) Query: 62 ELGIVTFGPVHV-------EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREY 114 ++G++ + T+ N I++ G T G AIT + Sbjct: 366 QIGVIQYSSTVQEEFSLNAHFTKTAVLNAIDNIVYMGGGTLTGTAITY---------MKD 416 Query: 115 RANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS 173 + ++TDG +D+ A ++ + +IGV D L+QI+ Sbjct: 417 NSQWRPNVAKIAIVVTDGKSSDDVAAPSS----AAQQAGITMHAIGVGANVDQTELSQIA 472 Query: 174 VRQPLPLQGLQFRELFSWLSSSLRSV 199 + L + ++ SV Sbjct: 473 STSQYVTNVADYDALDAQMAQLTASV 498 Score = 60.8 bits (146), Expect = 3e-08, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 32/94 (34%), Gaps = 10/94 (10%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQ- 75 +LD SGS+ G +++ + ++ +G+V + + + Sbjct: 510 NAPLDLFFVLDGSGSVTGANFDKVKQFTKNVVNAF---DISATATRVGVVQYSDSNTLEF 566 Query: 76 ------PFTSAANFFPPILFAQGDTPMGAAITKA 103 S I++ G T G+A+ A Sbjct: 567 NLGDHADKPSTLAAIDSIVYQGGGTTTGSALEFA 600 >UniRef50_UPI00017B1702 UPI00017B1702 related cluster n=1 Tax=Tetraodon nigroviridis RepID=UPI00017B1702 Length = 455 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 58/166 (34%), Gaps = 17/166 (10%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 + ++D S S+ ++ L D L + + +V + V+ Sbjct: 61 SRPLDLVFIIDSSRSVRPSEFEKVKIFLADMVDTL---DVGADATRVAVVNYAST-VKTE 116 Query: 77 FTSAANFFPPILFA--------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 F +F P L T G AI A+ + R + + I + Sbjct: 117 FLLKDHFNKPNLKKAISRIEPLATGTMTGLAIKTAVSEAFTEQSGARPRPRNIAKVAIIV 176 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 TDG P D+ + +V +++GV ADM++L ++ Sbjct: 177 -TDGRPQDQVE----EVSAAARASGVEIYAVGVDRADMRSLQLMAS 217 >UniRef50_Q4T9U6 Chromosome 14 SCAF7491, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4T9U6_TETNG Length = 443 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 58/166 (34%), Gaps = 17/166 (10%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 + ++D S S+ ++ L D L + + +V + V+ Sbjct: 218 SRPLDLVFIIDSSRSVRPSEFEKVKIFLADMVDTL---DVGADATRVAVVNYAST-VKTE 273 Query: 77 FTSAANFFPPILFA--------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 F +F P L T G AI A+ + R + + I + Sbjct: 274 FLLKDHFNKPNLKKAISRIEPLATGTMTGLAIKTAVSEAFTEQSGARPRPRNIAKVAIIV 333 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 TDG P D+ + +V +++GV ADM++L ++ Sbjct: 334 -TDGRPQDQVE----EVSAAARASGVEIYAVGVDRADMRSLQLMAS 374 >UniRef50_Q4S2X7 Chromosome 8 SCAF14759, whole genome shotgun sequence n=3 Tax=Euteleostomi RepID=Q4S2X7_TETNG Length = 647 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 62/186 (33%), Gaps = 21/186 (11%) Query: 3 EQITFATSDFASNPEP--RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 + T A S NP + ++D S S+ R ++ +V L + + Sbjct: 34 QNETTAQSRAVGNPCKAVPLDFVFVIDSSRSIRPRDYEKVKTFIVNLVQFL---EVGPEA 90 Query: 61 VELGIVTFGPVHVEQP---------FTSAANFFPPILFAQGDTPMGAAITKALDMVEERK 111 +G++ +G V QP + T G AI A + Sbjct: 91 TRVGLLQYGSV--VQPEFSLSTFSTKAEVEQAVRNMKHLATGTMTGLAIQYAAETSFTEA 148 Query: 112 REYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQ 171 R + R + + TDG P D + A + + F+IGV DMKTL Sbjct: 149 DGARPAHLHIPRIAVVV-TDGRPQDRVEEVAAQARQA----GIQIFAIGVGRVDMKTLKT 203 Query: 172 ISVRQP 177 I Sbjct: 204 IGSEPH 209 Score = 78.9 bits (193), Expect = 1e-13, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 57/197 (28%), Gaps = 36/197 (18%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT 78 + ++D S SM + +++ + L ++ +G++ + V E + Sbjct: 406 MDLVFMIDGSKSMGPANFERVKQFVISIVESL---DVSPTGAHVGLLQYSTNVRTEFTLS 462 Query: 79 S------AANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + + + G+A+ + + R N + TDG Sbjct: 463 QHTSAQGIRQAVSRMQYMGRGSMTGSALRRMFQSSFSAEEGARPNVPR----VSVVFTDG 518 Query: 133 APTDEWQAAANKVFRGEEDK--------------------RFAFFSIGVQGADMKTLAQI 172 D+ A K +++GV A + L +I Sbjct: 519 RSQDDASEWAKKAKNSGIPGSFSYFGGTGRFLSCSFLLVLGVTIYAVGVGKAIEQELREI 578 Query: 173 SVRQP--LPLQGLQFRE 187 + +F++ Sbjct: 579 ASEPEEKHLYYAQEFKD 595 >UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P2E4_9RHOB Length = 772 Score = 98.1 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 55/187 (29%), Gaps = 17/187 (9%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + A + + + +LD SGSM+G+PI + L D Sbjct: 349 LIEPPKLPAEDMIGQRELVFVLDTSGSMSGQPIEASKTFMTAAIKALRPDDYFRILHFSN 408 Query: 65 IVT-FGPVHVEQPFTSAANF--FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 + F V + F L A G T + A+ A D + Sbjct: 409 DTSQFAGQAVLATERNKQKALKFVADLSAGGGTEINQAVNAAFDQAQPDN---------- 458 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQISVRQPLPL 180 + +TDG DE + + ++ GV + L ++ Sbjct: 459 TTRIVVFLTDGYIGDE-ATVIKSI--ANRIGKARIYAFGVGNSVNRFLLDAMATEGRGYA 515 Query: 181 QGLQFRE 187 + + E Sbjct: 516 RYVALGE 522 >UniRef50_Q90ZA0 Collagen type XX alpha 1 n=11 Tax=cellular organisms RepID=Q90ZA0_CHICK Length = 1472 Score = 97.7 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 42/189 (22%), Positives = 70/189 (37%), Gaps = 20/189 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 +LL+D S S+ + L +A ++ +G+ + + S Sbjct: 240 TDIVLLVDGSWSIGRSNFKLIKEFLSALISPFN---IAQDKIRVGLSQYSSDPRTEWDLS 296 Query: 80 AANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 A +L A G+T G A+T L+ + R + L+TDG Sbjct: 297 AYATRDQVLEAVRNLRYKGGNTFTGLALTHVLEQNLKPDAGARLEAEK----LVILLTDG 352 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLPLQGLQFRELFSWL 192 D+ AA + F+IGV+ AD L Q++ PL L + F L Sbjct: 353 KSQDDANLAAQTLKNM----GIEIFAIGVKNADEAELKQVASE-PLELTVYNVLD-FPLL 406 Query: 193 SSSLRSVSR 201 SS + ++R Sbjct: 407 SSLVGRLTR 415 >UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J6Q3_DESRM Length = 416 Score = 97.7 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 64/218 (29%), Gaps = 20/218 (9%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 P ++D SGSM G ++ + L +VT Sbjct: 35 VEKERPVQNLSFVIDRSGSMAGEKLDYTKKAVAFAVGHLSPQDYCSVVAFDDMVTMVASS 94 Query: 73 VEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + A ++ G T + + + V+ +E + N + L+TDG Sbjct: 95 HQVANKDALKMAVESIYPGGSTNLSGGMLLGVREVKLAHKENQINR-------VLLLTDG 147 Query: 133 APTDEWQ--AAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPLPLQGLQF 185 +A + R + G+ + L + + Q Sbjct: 148 MANVGVTDHSALVEKSREMAAGGVNLSTFGLGEDFEEDLLQAMVEAGGGNFYYIEKPDQI 207 Query: 186 RELF-SWLSSSLRSVSRS-----TPGTEVVLEAPKGWT 217 +F L+ L V+++ PG V + G+ Sbjct: 208 PGIFEQELTGLLSIVAQNLSVKVKPGQGVSITGVLGYP 245 >UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta RepID=B6TZ81_MAIZE Length = 516 Score = 97.7 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 53/165 (32%), Gaps = 27/165 (16%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQPFT 78 + +LDVSGSM G I ++ + +L + L IVTF + P Sbjct: 62 LDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSSID------RLSIVTFLDTANRICPLQ 115 Query: 79 SAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 P L G+T + + L ++ +RK + L++D Sbjct: 116 QVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG------VMLMSD 169 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVR 175 G AA + ++ G D L ++ Sbjct: 170 GQQNRGEPAA------NVKIGNVPVYTFGFGADYDPTVLNAVARN 208 >UniRef50_UPI000058940A PREDICTED: similar to inter-alpha (globulin) inhibitor H3 n=1 Tax=Strongylocentrotus purpuratus RepID=UPI000058940A Length = 964 Score = 97.7 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 60/169 (35%), Gaps = 9/169 (5%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH----VE 74 R I ++D+SGSM+G + ++ L T D++ V F Sbjct: 348 RKNIIFVIDISGSMSGTKLAQVKDALSTILDDMSETDKFNILPFSDDVHFLESTGMLYST 407 Query: 75 QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGAP 134 + A F L +T + AI ++M+ R + + ++TDG P Sbjct: 408 KENVRRAKRFVMGLQEMDNTNLHKAIISGVNML--RAESEQDPQEEEIVSMLIVLTDGNP 465 Query: 135 TDE--WQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQPLPL 180 + + + F+ F IG AD L ++S++ Sbjct: 466 NHGEIDKTIIERNVHEAINGDFSLFCIGFGADADYPFLRRLSLQNHGVA 514 >UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVG2_SORC5 Length = 656 Score = 97.7 bits (242), Expect = 2e-19, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 63/213 (29%), Gaps = 28/213 (13%) Query: 15 NPEPRCPCILLLDVSGSM-NGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVH 72 + L+D SGSM + I L D L + + T+ G V Sbjct: 294 KERTPVHLVYLVDTSGSMQSPDKIELAKKSLKMLTDTLKPGDT------VALCTYAGSVR 347 Query: 73 VEQPFTSAAN-----FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 T + L A G T M + I A + E + N + Sbjct: 348 EVLAPTGIESKGKILAALADLTAGGSTAMSSGIDLAYSLAERTLVKGHVNR-------VI 400 Query: 128 LITDGAPTDEWQA--AANKVFRGEEDKRFAFFSIGVQGADMK--TLAQIS----VRQPLP 179 +++DG + K + DK ++G + K + Q++ Sbjct: 401 VLSDGDANVGPTSHDEILKTIKRARDKGITLSTVGFGQGNYKDLMMEQLANQGDGNYAYI 460 Query: 180 LQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEA 212 Q R +FS + V +V + Sbjct: 461 DSEAQARRVFSEQVGGMLQVIARDVKIQVEFDP 493 >UniRef50_A2AR69 Novel protein similar to vertebrate inter-alpha (Globulin) inhibitor H family (Plasma Kallikrein-sensitive glycoprotein) (ITIH) (Fragment) n=10 Tax=Clupeocephala RepID=A2AR69_DANRE Length = 860 Score = 97.7 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 67/211 (31%), Gaps = 18/211 (8%) Query: 16 PEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG----PV 71 P I ++D+SGSM G I + A +V+ +L V V Sbjct: 285 PVVPKDVIFVIDISGSMIGTKIKQTKAAMVSILSDLREGDYFNLITFSDDVHTWKKDRTV 344 Query: 72 HVEQPFTSAANFFPPILFAQGDTPMG---AAITKALDMVEERKREYRANGISYYRPWIFL 128 + A F + A G T + + K L+ S P I Sbjct: 345 RATRQNVRDAKEFVRKIIAAGWTNINAALLSAAKLLNPSTRSSSSTGRAPSSQRVPMIIF 404 Query: 129 ITDGAPTDEWQAAANKVFRGEEDKR-FAFFSIGVQ-GADMKTLAQISVRQPLPLQ----- 181 +TDG T + ++ + F + AD L ++++ + Sbjct: 405 LTDGEATIGETETDVILHNAQKSLGLVSLFGLAFGDDADFPMLRRLALENRGVARMVYED 464 Query: 182 ---GLQFRELFSWLSS-SLRSVSRSTPGTEV 208 +Q + + +++ L + S +V Sbjct: 465 DDAAIQLKGFYDEVATPLLSDIQLSYLDDQV 495 >UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1DFU7_MYXXD Length = 422 Score = 97.7 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 69/216 (31%), Gaps = 33/216 (15%) Query: 6 TFATSDFASNPE---PRCPC--ILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKR 60 FA + + P R P L+LD SGSMNG+ + + L + Sbjct: 28 AFAWMELKARPAETGQRVPVSLALVLDRSGSMNGQKLADARRAATELVQRLKPED----- 82 Query: 61 VELGIVTFGPVHVEQPFTSAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKRE 113 L + +G QP L G T + A+ A + + RE Sbjct: 83 -RLAFIDYGTDVRVQPSRRMTEEAREELLTLISGLQDDGSTNISGALDAAANALRPHMRE 141 Query: 114 YRANGISYYRPWIFLITDGAPTDEW--QAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLA 170 YR + L++DG PT + R ++GV + Sbjct: 142 YRVSRA-------ILLSDGQPTTGIVSEPGLLDQVRQLRRDGITVSALGVGRDYQETLMR 194 Query: 171 QISVR----QPLPLQGLQFRELF-SWLSSSLRSVSR 201 ++ + + E+F L + +V+R Sbjct: 195 GMAEQGGGFSGFIDDSARLAEVFSRELDQATSTVAR 230 >UniRef50_UPI00006A0494 Matrilin-4 precursor. n=5 Tax=Euteleostomi RepID=UPI00006A0494 Length = 466 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 62/184 (33%), Gaps = 18/184 (9%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 + ++D S S+ + ++ + L + L +G+V + V Sbjct: 5 PMDLVFIIDSSRSVRPFEFETMRKFMIDIINSL---EVGLSTTRVGVVQYSSQVQTVFSL 61 Query: 78 ------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + I+ T G AI A+++ + R + R I + TD Sbjct: 62 KTFSNKSDMEKAINEIIPLAQGTMTGLAIQYAMNVAFTEEEGARPLSKNIPRVAIIV-TD 120 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFRELF 189 G P D A + +++GVQ AD+ +L ++ F +L Sbjct: 121 GRPQDRVTEVAVQAREA----GIEIYAVGVQRADVSSLRAMASHPLDDHVFHVESF-DLI 175 Query: 190 SWLS 193 LS Sbjct: 176 QHLS 179 Score = 90.0 bits (222), Expect = 5e-17, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 58/178 (32%), Gaps = 20/178 (11%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPFT 78 + ++D S S+ + + ++ D ++ + +G+V + V E P + Sbjct: 235 IDLVFVIDGSKSVRPQNFELVKEFVINIVDS---SAISAQGTHIGLVQYSSRVRTEFPLS 291 Query: 79 ------SAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 I + + T G A+ ++ R N + TDG Sbjct: 292 QYTNGQDIKTAVKNIQYMEKGTMTGLALKHMVEQSFSEAEGARKNVPKIG----LVFTDG 347 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPLP--LQGLQFREL 188 D+ ++ + ++ +++GV A L +I+ F + Sbjct: 348 RSQDD----ISEWAKKAKEAGITMYAVGVGKAVEDELNEIASDPVNKHSFYTADFSTM 401 >UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BJ22_9GAMM Length = 445 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 64/185 (34%), Gaps = 26/185 (14%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGP-VHVEQPF 77 ++LD SGSM G + + + L + + + +V++ V+V P Sbjct: 69 PANIAIVLDKSGSMQGDKLFRAKEAAIMAINRLSQNDI------VSVVSYDSRVNVVVPA 122 Query: 78 TSAANFF-----PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 T ++ + A G+T + A ++K + + + + N + L++DG Sbjct: 123 TKVSDTNTIARAINRIQANGNTALFAGVSKGANELRKFLDLNKVNR-------VILLSDG 175 Query: 133 APTDEWQAA--ANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPLPLQGLQF 185 K+ + + +IG+ G + + Q++ Sbjct: 176 LANIGPSTPNELGKLGLSLAKEGMSVTTIGLGLGYNEDLMTQLAGFSDGNHAFVENADDL 235 Query: 186 RELFS 190 +F Sbjct: 236 ARVFQ 240 >UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CPU4_SHEPW Length = 710 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 63/174 (36%), Gaps = 16/174 (9%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELL-ADPLALKRVEL 63 + + IL++D SGSM+G I + A ++ L D + + Sbjct: 334 LLPPQDKMRLSALAPRELILVIDTSGSMSGEAIEQAKASIIYALAGLSAQDSFNILQFNS 393 Query: 64 GIVTFGPVHVEQPFTSA--ANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISY 121 + + + A + L A G T M A+ KAL + + R Sbjct: 394 NVYALSDTPLNASAKNIGRAQAYVQRLQANGGTEMSLALDKALSQQDANRERLRQ----- 448 Query: 122 YRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISV 174 + ITDGA +E Q ++ + R F+IG+ A + + + + Sbjct: 449 ----VLFITDGAVGNEPQ-LFTQIRNQLQQSRL--FTIGIGDAPNAHFMQRAAE 495 >UniRef50_UPI000180D037 PREDICTED: similar to polydomain protein-like n=1 Tax=Ciona intestinalis RepID=UPI000180D037 Length = 3908 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 49/162 (30%), Gaps = 18/162 (11%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF-- 77 I ++D S S+ + L + + +V + + Sbjct: 3424 TDVIFIVDGSWSVGEINFRKAKDFLKALVE---PFEVGWDNSRFAVVQYSDDPRTEFLMN 3480 Query: 78 -----TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 T N I + G+T G A+ +L R Y ++TDG Sbjct: 3481 EHFTVTDVLNAIDAIPYKGGNTNTGKALAFSLYTALSPANGARP----YVNKVALVLTDG 3536 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV 174 DE A ++ + ++GV AD L I+ Sbjct: 3537 RSQDEVGNPARELRQA----GVKVLTVGVGDADKNELKSIAS 3574 Score = 71.9 bits (175), Expect = 1e-11, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 60/210 (28%), Gaps = 22/210 (10%) Query: 15 NPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV- 73 + ++L D S S+ + ++ A LV+ D + V +G + Sbjct: 394 KKAQKTDLVVLTDGSWSVGPQNFKKIQAFLVSLVDAFS---IGFNNVLMGYAQYSDDART 450 Query: 74 ------EQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIF 127 + + G+T G A+ + + R + Sbjct: 451 EFNLNEHVTKDDLIRAINQVQYKGGNTATGGALDYIRTNLFTSEGGTRRGVLKTA----I 506 Query: 128 LITDGAPT-DEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQ 184 +ITDG D+ A + FSIGV A L I+ Sbjct: 507 VITDGESILDDVTEPARMLKEI----GVEVFSIGVAAALRSELEDIASSPASDHVFSVDN 562 Query: 185 FRELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 F + + + L + E P+ Sbjct: 563 FDD-IKNIKNILLKETCKAVAVCRNFEPPE 591 >UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67LZ3_SYMTH Length = 414 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 56/197 (28%), Gaps = 28/197 (14%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 A A P ++D SGSM G + L D++ + L IVT Sbjct: 31 APRMPAPEGRPPLNLAAVVDRSGSMAGAALYFTKQALRFLVDQMAEED------RLAIVT 84 Query: 68 FGPVHVEQPF-------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGIS 120 + V PF A + A G T + + + + R + Sbjct: 85 YDD-QVHVPFPSQPVVQKDAVRLLVDGITAGGTTNLSGGLATGMQQIRPHAGPGRVSR-- 141 Query: 121 YYRPWIFLITDGAPTDEWQAA--ANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQIS---- 173 + L+TDG R +K A ++GV L ++ Sbjct: 142 -----VLLMTDGLANVGVTDPDVLAGWARAWREKGLAVSTMGVGPHFSEDLLVALAEAGG 196 Query: 174 VRQPLPLQGLQFRELFS 190 Q +F Sbjct: 197 GNFHYIANPDQIPRIFQ 213 >UniRef50_UPI0001C34E55 hypothetical protein ClM62_13922 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E55 Length = 466 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 63/185 (34%), Gaps = 23/185 (12%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALK----RVELGIVTFGPVHVE-- 74 + +D S M G + G+ F + L + + +G+V+F Sbjct: 39 DIVFAIDRSAKMEGSALEAAKKGIKAFIETLERESAQPEGYAGEKRVGLVSFSDTATVNS 98 Query: 75 --QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 P A L A G + AI A+ +++ + + +FLITDG Sbjct: 99 MLSPVVEQAARAAEGLTAGGKSNQAEAIRAAVKLLDMKTPGEK---------MLFLITDG 149 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGV---QGADMKTLAQISVRQPLPLQGLQFRELF 189 +++ + + IG+ G + + L + P ++ REL Sbjct: 150 QT--PFRSQTDSAAAEARQAGVTVYCIGIAAPDGVNREALRSWASG-PSDSHIIEIRELG 206 Query: 190 SWLSS 194 ++ Sbjct: 207 EAQTA 211 >UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GC99_9DELT Length = 546 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 28/226 (12%), Positives = 60/226 (26%), Gaps = 25/226 (11%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVT 67 A + P P ++LD SGSM G + + + L + +++ Sbjct: 119 ADDEAGQGPRPGLDLAIVLDRSGSMGGDKLRFAKQAGLDLVNRLDEQD------RVTLIS 172 Query: 68 FGPVHV-EQPFTSAANFFPPIL-------FAQGDTPMGAAITKALDMV---EERKREYRA 116 + + +L G T +G A+ L + E + R Sbjct: 173 YDDTVTPLSNLQRVDDDGIEVLRRQLLDIQVGGTTALGPALFMGLQRLAAPEPFGPQTRT 232 Query: 117 NGISYYRPWIFLITDGAPTDEWQ--AAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS 173 + L++DG + ++G+ + + +I+ Sbjct: 233 EARHDRLRHVILLSDGIANVGETRPEVIGGRVAEHFGGGVSVSTLGMGLDYNEDLMTRIA 292 Query: 174 ----VRQPLPLQGLQFRELF-SWLSSSLRSVSRSTPGTEVVLEAPK 214 R + L+ +V+ L Sbjct: 293 DEGGGRYHFIEDAESIPAMLGDELAGLTATVASEVDSVFATLPGTD 338 >UniRef50_D1CG77 von Willebrand factor type A; type II secretion system protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CG77_THET1 Length = 643 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 73/227 (32%), Gaps = 35/227 (15%) Query: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 EQ A F NP+P +L LD S SMN + L + Sbjct: 80 KEQSDIAVYPFYQNPDP-IDVVLALDTSASMNDDAFTAAQDAAYGLINGLSPED------ 132 Query: 62 ELGIVTFGPVHVEQPFTSAANFFP----PILFAQGDTPMGAAITKALDMVEERKREYRAN 117 ++G++TF + + L T + ++ A V + Sbjct: 133 KVGLITFDKTARVIEPLAQDHARVQESIQKLSRSVGTALYQGLSLAAQEVAK-------- 184 Query: 118 GISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQ 176 I L+TDG T + ++ + F++G D + L +I+ Sbjct: 185 --GQNTKAIVLMTDGFNT-SRNTTLEEAVAKAQEVGASVFTVGFGKKVDTQGLQKIANET 241 Query: 177 PLPL----QGLQFRELFSWLSS--------SLRSVSRSTPGTEVVLE 211 Q R +F+ +S S S + S G +V ++ Sbjct: 242 GGEYFSAPTNAQLRRVFADISQKLHQEYRLSYTSSTISKEGEKVKVD 288 >UniRef50_Q8NFW1 Collagen alpha-1(XXII) chain n=23 Tax=Euteleostomi RepID=COMA1_HUMAN Length = 1626 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 48/177 (27%), Gaps = 17/177 (9%) Query: 21 PCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHV------E 74 + LLD S S+ ++ + D + R +G+V + Sbjct: 38 DLVFLLDTSSSVGKEDFEKVRQWVANLVDTF---EVGPDRTRVGVVRYSDRPTTAFELGL 94 Query: 75 QPFTSAANFFPPIL-FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDGA 133 L + G+T G A+ R Y+ L+TDG Sbjct: 95 FGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRP-RDRAYKQVAILLTDGR 153 Query: 134 PTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFREL 188 D F++GV A + L +I+ F + Sbjct: 154 SQD----LVLDAAAAAHRAGIRIFAVGVGEALKEELEEIASEPKSAHVFHVSDFNAI 206 >UniRef50_UPI00016E9735 UPI00016E9735 related cluster n=2 Tax=Takifugu rubripes RepID=UPI00016E9735 Length = 1617 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 70/206 (33%), Gaps = 22/206 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE----- 74 + L+D S S+ + + + + + V G+ FG V Sbjct: 7 ADIVFLVDESWSVGPSSFSHVKDFVSAIITSFKDSVVGSEGVRFGVTVFGDVPKMRIALT 66 Query: 75 --QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI-FLITD 131 + + +G A+ + V + + P I LIT+ Sbjct: 67 DYSSLEEVLRAIRDLPYEGRSRRIGDALAFLVHQV------FSPAISRDHTPKIAVLITN 120 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFRELF 189 G D +AAA V D + F++GV GAD L +I R+ L G + L Sbjct: 121 GRSDDPVEAAARLV----ADNGISLFAVGVGGADESELRRIVSEPREEHLLLGTHYSALE 176 Query: 190 SWLSSSLRSV--SRSTPGTEVVLEAP 213 + L+ R V + S P V + Sbjct: 177 NILARLSRRVCITASEPPRPVKISPT 202 Score = 84.3 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 58/181 (32%), Gaps = 19/181 (10%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQP 76 + + + L+D S S+ ++ + + + ++ +V + + Sbjct: 883 QSKADIVFLVDESSSIGPNNFVKIKDFIFRVATYFP--AIGPQTTQIAVVHYSDDPRIEF 940 Query: 77 FTSAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 + +L A G+T G I+ ++ + + R + LI Sbjct: 941 RLNDFKDRNSVLRALRGLRYVGGNTRTGKGISYVVEELFQESLGMRPEAAH----VLVLI 996 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS--VRQPLPLQGLQFRE 187 TDG D R + ++GV AD++ L +I+ F + Sbjct: 997 TDGRAQDNVVPP----SRIARALGVSVLAVGVSNADIEELHRIAAPAGYKNIFYSPTFDD 1052 Query: 188 L 188 Sbjct: 1053 F 1053 >UniRef50_B8HSI1 von Willebrand factor type A n=8 Tax=Cyanobacteria RepID=B8HSI1_CYAP4 Length = 589 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 71/202 (35%), Gaps = 25/202 (12%) Query: 4 QITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVEL 63 + + SDFA P ++++D SGSM G + + L T+ + L Sbjct: 395 TMLKSWSDFAKKPS---LVVIVVDTSGSMAGEKLANVQNTLNTYINGLSPQDQVALMRFS 451 Query: 64 GIVTFGPVHVEQPFTSAAN----FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 V V T A F L A G+T + A A + + + R Sbjct: 452 SDV---GTPVVVDGTPAGRDRGLQFISSLRANGNTHLYDATLAARNWLTQNLR------- 501 Query: 120 SYYRPWIFLITDGAPTD---EWQAAANKVFRG--EEDKRFAFFSIGVQGA---DMKTLAQ 171 S + ++TDG T + ++ + D+R +FF++G D + L Q Sbjct: 502 SDAINAVLVLTDGEDTGSAISLEQLGPELQKSGFNSDQRISFFTVGYGEEGEFDPQALQQ 561 Query: 172 ISVRQPLPLQGLQFRELFSWLS 193 I+ + ++ Sbjct: 562 IANVNGGYYSKGDPASIGRLMA 583 >UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter sp. K31 RepID=B0T5X0_CAUSK Length = 592 Score = 97.4 bits (241), Expect = 3e-19, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 51/187 (27%), Gaps = 24/187 (12%) Query: 17 EPRCPCILLLDVSGSMNG-RPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVE 74 +P + L+D SGSM+G + L D+L + +V + G Sbjct: 227 QPPLNLVFLIDTSGSMSGPDRLPLAKKALNVLIDQLRPQD------RVSMVAYAGSAGAV 280 Query: 75 QPFTSAANF-----FPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 T + L + G T G + A + R N + L+ Sbjct: 281 LSPTDGKSKLKMRCALTALRSGGSTAGGQGLELAYAL-------ARQNLDPKAVNRVILM 333 Query: 130 TDGAPTDEWQAA--ANKVFRGEEDKRF--AFFSIGVQGADMKTLAQISVRQPLPLQGLQF 185 TDG + + + G + + ++ + Sbjct: 334 TDGDFNVGIADPTRLKDFVADQRKSGVYLSVYGFGRGNYNDTMMQALAQNGNGTAAYVDG 393 Query: 186 RELFSWL 192 + L Sbjct: 394 LQEARKL 400 >UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KDL1_SHEWM Length = 739 Score = 97.0 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 40/180 (22%), Positives = 66/180 (36%), Gaps = 24/180 (13%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + SD + IL++D SGSM+G I + L L A Sbjct: 348 MLLPPSDQKQDVSISRELILVIDTSGSMSGASIAQAKRALNYALAGLKAKDT------FN 401 Query: 65 IVTFGP-----VHVEQPFTS----AANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 ++ F P T+ AN + L A G T M A+ ALD K Sbjct: 402 VIEFNSNVGSLSPYSLPATAKNIGLANQYVRSLKANGGTEMQLALNAALD-----KGTET 456 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISV 174 S + +TDG+ DE Q+ + + + + R F++G+ A + + + + Sbjct: 457 EALGSERLRQVLFMTDGSVGDE-QSLFHLIKQKIGESRL--FTLGIGSAPNSHFMRRAAE 513 >UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBY0_9DELT Length = 996 Score = 97.0 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 59/188 (31%), Gaps = 30/188 (15%) Query: 14 SNPEPRCPCILLLDVSGSM-NGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH 72 +P IL++D SGSM +G ++ + L E+G++ F Sbjct: 522 QREQPTLALILVIDKSGSMSSGDRLDLVKEAARATARTLDPSD------EIGVIAFDNSP 575 Query: 73 VE-QPFTSAAN-----FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 AAN L A G T A+ +A + K + + Sbjct: 576 QVLVRLQPAANRLRISSSIRRLSAGGGTNAMPALREAYLQLAGSKALVKH---------V 626 Query: 127 FLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQG-ADMKTLAQIS----VRQPLPLQ 181 L++DG + N + S+GV A L +++ R Sbjct: 627 ILLSDGES---PENGINALLGDMRQSDITVSSVGVGDGAGKDFLIRVAERGRGRYFYSED 683 Query: 182 GLQFRELF 189 G +F Sbjct: 684 GTDVPRIF 691 Score = 44.6 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 51/171 (29%), Gaps = 25/171 (14%) Query: 1 MSEQITFATSDFASNPEPRCPC-----ILLLDVSGSMNGRPINELNAGLVTFRDELLADP 55 M + A + + P R P + ++DVS S++ + + + ++ Sbjct: 142 MRGLVVMAVALALAQPSLRSPIRGKTVVFVVDVSESIDDSQLAAAEQAVREAAELAASEA 201 Query: 56 LAL----KRVELGIVTFGPVHVEQPFTSAANFFPPILFAQGDTPM----GAAITKALDMV 107 R + ++T+ A L D M +A+ A ++ Sbjct: 202 ELGIEKEDRTRVRVITYAGRARLLEL-EAGEAGELSLPRDPDNAMASDHASALRLAEALL 260 Query: 108 EERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFS 158 + + L+TD + + ED+ + + Sbjct: 261 DPDTEGR-----------VVLMTDATGDLAEREGLGQAIFDLEDRGVSVHT 300 >UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z5_DESAA Length = 558 Score = 97.0 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 42/195 (21%), Positives = 58/195 (29%), Gaps = 24/195 (12%) Query: 1 MSEQITFATSDFASNPEPRCPCILLLDVSGSMNGR-PINELNAGLVTFRDELLADPLALK 59 M + + LLDVSGSMN + + + EL A Sbjct: 176 MLVHVGLQGRCLDYKDVKPSNLVFLLDVSGSMNSENKLPLVKRSMEMLVKELGAGD---- 231 Query: 60 RVELGIVTF-GPVHVEQPFTSAANFFP-----PILFAQGDTPMGAAITKALDMVEERKRE 113 + IVT+ G + P TSA N L A G T G I A + E Sbjct: 232 --RVSIVTYAGSAGLVLPSTSARNKRKIITALDRLEAGGSTAGGEGIELAYRVAWENLIP 289 Query: 114 YRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKR----FAFFSIGVQGADMKTL 169 N + L TDG + V EE +R G+ + + Sbjct: 290 EGNNR-------VILCTDGDFNVGVSSTPELVRMIEEKRRAGIYLTICGFGMGNYKDEKM 342 Query: 170 AQISVRQPLPLQGLQ 184 IS + Sbjct: 343 EAISNAGNGNFYYID 357 >UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2QZH3_ORYSJ Length = 633 Score = 97.0 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 46/234 (19%), Positives = 81/234 (34%), Gaps = 39/234 (16%) Query: 8 ATSDFASNPEPRCPCILLLDVSGSM-------------NGRPINELNAGLVTFRDELLAD 54 A N + +LDVSGSM G ++ L A + +L Sbjct: 58 APPAADLNSHVPLDVVAVLDVSGSMNDPVAAASPKSNLQGSRLDVLKASMKFVIRKLADG 117 Query: 55 PLALKRVELGIVTFGPVHVEQ----------PFTSAANFFPPILFAQGDTPMGAAITKAL 104 L IV F V++ S A L A+G T + A+ +A+ Sbjct: 118 D------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGTALMPALEEAV 171 Query: 105 DMVEERKREYRANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA 164 +++ER+ R + +I L+TDG T ++ + + F +G Sbjct: 172 KILDERQGSSRN-----HVGFILLLTDGDDTTGFRWTRDAIHGAVFKYPVHTFGLGA-SH 225 Query: 165 DMKTLAQISV---RQPLPLQGLQFRELFSWLSSSLRSV-SRSTPGTEVVLEAPK 214 D + L I+ + + L+ L + + + T V L+A + Sbjct: 226 DPEALLHIAQGSRGTYSFVDDDNLANIAGALAVCLGGLKTVAAVDTRVSLKAAE 279 >UniRef50_Q4RP12 Chromosome 10 SCAF15009, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4RP12_TETNG Length = 1259 Score = 97.0 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 64/202 (31%), Gaps = 20/202 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT 78 + L+D S S+ + + + + + + + +V + E P T Sbjct: 94 TDLVFLVDGSWSVGRENFKHIRSFIASLAGAF---DIGEDKTRVAVVQYSTDTRTEFPLT 150 Query: 79 SAAN-----FFPPIL-FAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 L + G+T G AI L + R + + +ITDG Sbjct: 151 RYTRRGDLLQAINSLPYKGGNTMTGDAIDYLLQNIFTEAGGSRKS----FPKVAMIITDG 206 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFRELFS 190 D + A ++ + F +G++GAD L +I+ F ++ Sbjct: 207 KSQDPVEEHARRLR----NIGVEIFVLGIKGADEDELREIASTPHSKHMYNVPNFDKIQE 262 Query: 191 WLSSSLRSVSRSTPGTEVVLEA 212 +R V L + Sbjct: 263 VQKKIIREVCSGVDEQLSSLVS 284 Score = 93.9 bits (232), Expect = 3e-18, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 67/189 (35%), Gaps = 21/189 (11%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG-PVHVEQPFT 78 +LL+D S S+ + ++ A L + + +V++ +V + H E Sbjct: 394 ADVVLLVDGSYSIGLQNFAKVRAFLEVLVNSF---DIGPSKVQISLVQYSRDPHTEFALN 450 Query: 79 S--AANFFPPILFA----QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 + N + G T G A+ D + R R N + LITDG Sbjct: 451 THHDINAVVRAVRTFPYRGGSTNTGKAMKYVKDKIFVASRGARQNVPR----VMVLITDG 506 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFRELFS 190 +D ++ AA + + F++GV+ A L I+ + F F Sbjct: 507 KSSDSFKDAATNLR----NIDVEIFAVGVKDAVRSELEAIANPPADNHVFEVEDFDA-FQ 561 Query: 191 WLSSSLRSV 199 +S L Sbjct: 562 RISKELTQS 570 Score = 66.1 bits (160), Expect = 8e-10, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 59/202 (29%), Gaps = 46/202 (22%) Query: 6 TFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGI 65 ++ SD + + +LLLD S S+ + + + + +V++G+ Sbjct: 1099 PYSPSDVTCKTKAQADIVLLLDGSWSIGRLNFKTIRTFISRMVEVF---DIGPDKVQVGL 1155 Query: 66 VTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPW 125 AL+ V + + R Sbjct: 1156 -------------------------------------ALNYVLQNNFKENVGMRRNSRKI 1178 Query: 126 IFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGL 183 L+TDG D+ A + +++GV+ A+ L I+ Sbjct: 1179 GVLVTDGKSQDDVHEKAQNLRNE----NIELYAVGVKNAEENELRSIASDPDDIHMYNVA 1234 Query: 184 QFRELFSWLSSSLRSVSRSTPG 205 F L + + ++ S G Sbjct: 1235 DFSFLLDIVDNLTNNLCNSVKG 1256 >UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788F71 Length = 1007 Score = 97.0 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 72/216 (33%), Gaps = 34/216 (15%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVH-VEQP 76 P IL++D SGSM+G I + + + A +G+V F P Sbjct: 405 PSLGLILVIDRSGSMDGNKIELAKESAMRTVELMRAKDT------VGVVAFDDQPWWVVP 458 Query: 77 FTSAANFF-----PPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + + + G T + A++ AL+ + + + R I L+TD Sbjct: 459 PQKLGDKEEVLSSIQSIPSAGGTNIYPAVSSALEEMLKIDAQRRH---------IILMTD 509 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----VRQPLPLQGLQFR 186 G + + + + S+ V AD L ++ R Sbjct: 510 GQS--AMNSGYQDLTDTMVENKITMSSVAVGMDADTNLLQSLADAAKGRYYFVEDETTLP 567 Query: 187 ELFSWLSSSLRSVSRSTPGTEVVLEA---PKGWTSV 219 +F S +++S + + A P W S+ Sbjct: 568 AVF---SREAVMLAKSYIVDKPFVPAVQNPGDWASL 600 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 97.0 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 72/218 (33%), Gaps = 30/218 (13%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + + +F + I ++D SGSM+G + + + L L Sbjct: 315 LMPPSDEFIAAQRLPREVIFVIDTSGSMHGESLEQAKSALFFALANLDPQDS------FN 368 Query: 65 IVTFGPVHVEQPFT--SAANFFPPI-------LFAQGDTPMGAAITKALDMVEERKREYR 115 I+ F A +F L A G T +G A + LD Sbjct: 369 IIEFNSKVNALNAQALPANDFNIRRARNFVYGLKADGGTEIGLAFEQVLD---------- 418 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQISV 174 + + Y I +TDG+ ++E ++ D R F+IG+ A + + + + Sbjct: 419 NSEHADYLRQIVFLTDGSISNE-TEVFAQIKGSLGDSRI--FTIGIGSAPNSYFMTRAAT 475 Query: 175 RQPLPLQ-GLQFRELFSWLSSSLRSVSRSTPGTEVVLE 211 ++ + + ++ + ++ + Sbjct: 476 LGRGTFTFIGDVTDVQRTMKNLFVQLANAALKELIITD 513 >UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus trichocarpa RepID=B9GK57_POPTR Length = 595 Score = 97.0 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 73/233 (31%), Gaps = 49/233 (21%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF-GPVHVEQ 75 + +LDVSGSM G + L + L L IVTF Sbjct: 152 RAPIDIVNVLDVSGSMAG-KLILLKRAVNFIIQNLGPSD------RLSIVTFSSSARRIL 204 Query: 76 PFTSAAN-------FFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFL 128 P + + L A G T + A + K + ++EER++ I L Sbjct: 205 PLRTMSGSGREDAISVVNSLSATGGTNIVAGLRKGVRVLEERRQHNSVAS-------IIL 257 Query: 129 ITDGAPTDEWQAA--ANKVF----------RGEEDKRFAFFSIGVQ----GADMKTLAQI 172 ++DG T + F + G A M ++ + Sbjct: 258 LSDGCDTQSHSTHNRLEYLKLIFPSNNASGEESRQPTFPIHTFGFGLDHDSAAMHAISDV 317 Query: 173 SVRQPLPLQGLQ-----FRELFSWLSSSLRS-----VSRSTPGTEVVLEAPKG 215 S ++ + F L+S + V ++PG ++ L P G Sbjct: 318 SGGTFSFIESIDILQDAFARCIGGLTSIVARDVQLKVRSASPGVQI-LSTPSG 369 >UniRef50_Q4SQ43 Chromosome 19 SCAF14535, whole genome shotgun sequence. (Fragment) n=2 Tax=Tetraodon nigroviridis RepID=Q4SQ43_TETNG Length = 1060 Score = 97.0 bits (240), Expect = 4e-19, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 76/206 (36%), Gaps = 22/206 (10%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + L+D S S+ + + + + + V G+ FG V + + Sbjct: 461 ADIVFLVDESWSVGQNSFSHVKDFISAIITSFKDSVVGTEGVRFGVTVFGDVPKMRIALT 520 Query: 80 AANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWI-FLITD 131 + +L A +G A+T + V + + P I LIT+ Sbjct: 521 DYSSQEEVLRAIRDLPYEGRSRRIGDALTFLVQHV------FSPVIRRDHGPKIAVLITN 574 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFRELF 189 G +D+ AA ++ D + F++GV GAD L ++ R+ L G + L Sbjct: 575 GR-SDDPVDAAARLVA---DSGISLFAVGVGGADASELRRMVSEPREEHLLLGADYSALE 630 Query: 190 SWLSSSLRS--VSRSTPGTEVVLEAP 213 + L+ R V+ S P V + Sbjct: 631 NLLARLSRRVCVTASEPPRPVKMSPT 656 >UniRef50_Q99715 Collagen alpha-1(XII) chain n=75 Tax=Euteleostomi RepID=COCA1_HUMAN Length = 3063 Score = 96.6 bits (239), Expect = 5e-19, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 63/202 (31%), Gaps = 20/202 (9%) Query: 20 CPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFTS 79 + L+D S S+ + + + ++ +G+V + + + Sbjct: 139 TDLVFLVDGSWSVGRNNFKYILDFIAALVSAF---DIGEEKTRVGVVQYSSDTRTEFNLN 195 Query: 80 AANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITDG 132 +L A G+T G AI + R + +ITDG Sbjct: 196 QYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGARVG----FPKVAIIITDG 251 Query: 133 APTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ--PLPLQGLQFRELFS 190 DE + A ++ + FS+G++ AD K L QI+ F + Sbjct: 252 KSQDEVEIPARELR----NVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDAIVD 307 Query: 191 WLSSSLRSVSRSTPGTEVVLEA 212 + + V L + Sbjct: 308 IQNEIISQVCSGVDEQLGELVS 329 Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 68/190 (35%), Gaps = 21/190 (11%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFG----PVHVE 74 + + L+D S S+ ++ A L P RV++ +V + Sbjct: 438 KADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPN---RVQISLVQYSRDPHTEFTL 494 Query: 75 QPFTSAANFFPPI---LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + FT + I + G T G A+T + + + R+N + LITD Sbjct: 495 KKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSNVPK----VMILITD 550 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISV--RQPLPLQGLQFRELF 189 G +D ++ A K+ + F++GV+ A L I+ + F F Sbjct: 551 GKSSDAFRDPAIKLR----NSDVEIFAVGVKDAVRSELEAIASPPAETHVFTVEDFDA-F 605 Query: 190 SWLSSSLRSV 199 +S L Sbjct: 606 QRISFELTQS 615 Score = 93.9 bits (232), Expect = 4e-18, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 73/219 (33%), Gaps = 21/219 (9%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + +S +LL+D S S+ + + + + + KRV++ Sbjct: 1183 MPILSSGMECLTRAEADIVLLVDGSWSIGRANFRTVRSFISRIVEVF---DIGPKRVQIA 1239 Query: 65 IVTFGPVHVEQPFTSAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRAN 117 + + + +A +L A G+T G A+ + R Sbjct: 1240 LAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQAGMRPR 1299 Query: 118 GISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ- 176 LITDG D+ +A + K+ D+ F+IG++ AD L I+ Sbjct: 1300 ARKIG----VLITDGKSQDDVEAPSKKLK----DEGVELFAIGIKNADEVELKMIATDPD 1351 Query: 177 -PLPLQGLQFRELFSWLSSSLRSVSRSTPGTEVVLEAPK 214 F L + ++ S G LEAP Sbjct: 1352 DTHAYNVADFESLSRIVDDLTINLCNSVKG-PGDLEAPS 1389 Score = 78.9 bits (193), Expect = 1e-13, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 62/179 (34%), Gaps = 19/179 (10%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 + + L D S S+ N++ + D ++ +++ V + + Sbjct: 2321 KADIVFLTDASWSIGDDNFNKVVKFIFNTVGGF--DEISPAGIQVSFVQYSDEVKSEFKL 2378 Query: 79 SAANFFPPILFA-------QGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLITD 131 + N L A G+T G A+T + V + R N + ++TD Sbjct: 2379 NTYNDKALALGALQNIRYRGGNTRTGKALTFIKEKVLTWESGMRKNVPK----VLVVVTD 2434 Query: 132 GAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQP--LPLQGLQFREL 188 G DE + AA + + F+ F +GV D LA I+ + F Sbjct: 2435 GRSQDEVKKAALVIQQ----SGFSVFVVGVADVDYNELANIASKPSERHVFIVDDFESF 2489 >UniRef50_UPI000180BDFB PREDICTED: similar to cartilage matrix protein n=1 Tax=Ciona intestinalis RepID=UPI000180BDFB Length = 272 Score = 96.6 bits (239), Expect = 5e-19, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 63/181 (34%), Gaps = 13/181 (7%) Query: 18 PRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPF 77 + +LD S S+ + + + F D + +++G++ FG E+ Sbjct: 31 RPLDIVFMLDGSRSVRPKNFQTVKDYVKNFTDIF--EAFGPNDMQVGVIQFGSGVREEIL 88 Query: 78 -------TSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLIT 130 I + + T G A+ K + + R + + +IT Sbjct: 89 LNQFYVRHELMEAIDNIRYMETGTMTGLALRKLVTETLTVEHGARVD-NPIVHTVVVIIT 147 Query: 131 DGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQISVRQP--LPLQGLQFRE 187 DG D + K + + + F F+IG+ A+ K L +++ + F Sbjct: 148 DGKSQDYSRGGVTKWTKEAKARGFEIFAIGIGRKANRKELLEMASEPKELHTFRVQNFNA 207 Query: 188 L 188 + Sbjct: 208 I 208 >UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina mazei RepID=Q8PU63_METMA Length = 1004 Score = 96.6 bits (239), Expect = 5e-19, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 71/197 (36%), Gaps = 28/197 (14%) Query: 13 ASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPV- 71 N +L++D SGSM+G PI+ F D + A+ +A G+V+F Sbjct: 306 EDNANANANVMLVIDRSGSMSGSPISSAKNSANLFIDYMEAEDMA------GVVSFSSSA 359 Query: 72 ------HVEQP-FTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 P ++ ++A G T +G+ + L+ + Sbjct: 360 RYDYHLATLTPEVKNSIKQKINSIYASGVTAIGSGMRYGLNDLLNYGDPNNP-------W 412 Query: 125 WIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQIS----VRQPLP 179 I L++DG N V + +++G+ A D K L I+ + Sbjct: 413 AIVLLSDGYQNSGENP--NNVIPSIKASNIQVYTVGLGPAVDQKLLGNIADQTGGKYYYS 470 Query: 180 LQGLQFRELFSWLSSSL 196 Q +E+++ + + Sbjct: 471 PTDSQLQEIYNDIVGKI 487 >UniRef50_O15232 Matrilin-3 n=28 Tax=Euteleostomi RepID=MATN3_HUMAN Length = 486 Score = 96.6 bits (239), Expect = 5e-19, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 52/167 (31%), Gaps = 15/167 (8%) Query: 17 EPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVE-- 74 + ++D S S+ ++ + D L + + +V + Sbjct: 79 SRPLDLVFIIDSSRSVRPLEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEF 135 Query: 75 -----QPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWIFLI 129 S I T G AI A+D + R + + I + Sbjct: 136 QLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTVEAGAREPSSNIPKVAIIV- 194 Query: 130 TDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQ 176 TDG P D+ A + +++GV ADM +L ++ Sbjct: 195 TDGRPQDQVNEVAARAQA----SGIELYAVGVDRADMASLKMMASEP 237 >UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSY7_9GAMM Length = 670 Score = 96.6 bits (239), Expect = 5e-19, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 59/198 (29%), Gaps = 33/198 (16%) Query: 9 TSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTF 68 + S P + ++D SGSM G+ + L + L D +V F Sbjct: 309 PDEMTSGPRMPREVVFVIDTSGSMAGQRMYHAKQALSQAVERLSPDD------RFNVVEF 362 Query: 69 GP--VHVEQPFTSAANFFPP-------ILFAQGDTPMGAAITKALDMVEERKREYRANGI 119 + SA+ L G T M A+ AL Sbjct: 363 NNQHSRLFSSMRSASAINVKQALNWVGRLQGGGGTMMLPAVEDAL----------SVRSD 412 Query: 120 SYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQ-GADMKTLAQIS----V 174 Y + LITD + +A +V + K F++G+ + L + + Sbjct: 413 PAYLRQVILITD--ASVGNEAEILRVVERQR-KGARLFTVGIGVSPNSYLLRKAAQVGQG 469 Query: 175 RQPLPLQGLQFRELFSWL 192 G + + L Sbjct: 470 DYVYIASGQEVKARMQRL 487 >UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q083T9_SHEFN Length = 722 Score = 96.6 bits (239), Expect = 6e-19, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 73/215 (33%), Gaps = 24/215 (11%) Query: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 + + + + IL++D SGSM+G+ I + L L Sbjct: 329 LMPPSVEVSEQHLIARELILVIDTSGSMSGQSITQAKQALQFALAGLRDIDS------FN 382 Query: 65 IVTFGPVHVEQPFTSA---------ANFFPPILFAQGDTPMGAAITKALDMVEERKREYR 115 I+ F T AN F L A G T M +A+ AL + ++ + Sbjct: 383 IIEFNSDVTMLSATPLSANSRNIGKANRFIQSLDADGGTEMRSALQTAL-VDSVQQDSDQ 441 Query: 116 ANGISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGA-DMKTLAQIS- 173 + S + +TDGA +E + + D R F++G+ A + + + + Sbjct: 442 TDAHSEMLRQVIFMTDGAVGNEHE-LYQLINDQLGDSRL--FTVGIGSAPNSDFMRRAAT 498 Query: 174 ---VRQPLPLQGLQFRELFSWLSSSLRSVSRSTPG 205 + ++ L + + + G Sbjct: 499 MGRGTFTYIGNESEVQQKIEQLLNKIEQPVLTNIG 533 >UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putative n=1 Tax=Ricinus communis RepID=B9RR85_RICCO Length = 755 Score = 96.6 bits (239), Expect = 6e-19, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 59/180 (32%), Gaps = 34/180 (18%) Query: 19 RCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELGIVTFGPVHVEQPFT 78 R + ++D+SGSM G+P+ + + +L I+ F F+ Sbjct: 324 RKEIVFIVDISGSMEGKPLEGMKNAMSGALAKLNPKDS------FNIIAFNGETYL--FS 375 Query: 79 SAANFFPPI------------LFAQGDTPMGAAITKALDMVEERKREYRANGISYYRPWI 126 S A G T + + +A++MV + P I Sbjct: 376 SLMELATEKTVERAVEWMNLNFIAGGGTNISVPLNQAMEMVSNTQGSL---------PVI 426 Query: 127 FLITDGAPTDEWQAAANKVFRGEEDKRF---AFFSIGVQGA-DMKTLAQISVRQPLPLQG 182 FL+TDG ++ + + + + K ++ G+ + L ++ Sbjct: 427 FLVTDG-AVEDERHICDSMKKYVRGKGAICPRIYTFGIGTYCNHYFLRMLATVCRGQYDA 485 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.143 0.386 Lambda K H 0.267 0.0434 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,239,616,083 Number of Sequences: 3077464 Number of extensions: 44997764 Number of successful extensions: 138291 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 915 Number of HSP's successfully gapped in prelim test: 3937 Number of HSP's that attempted gapping in prelim test: 130384 Number of HSP's gapped (non-prelim): 6003 length of query: 219 length of database: 1,040,396,356 effective HSP length: 124 effective length of query: 95 effective length of database: 658,790,820 effective search space: 62585127900 effective search space used: 62585127900 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 90 (39.2 bits)