BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (427 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteri... 523 e-147 UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteri... 463 e-129 UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodoba... 458 e-127 UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacter... 456 e-127 UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobact... 455 e-126 UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkhol... 449 e-125 UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria ... 420 e-116 UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes ... 402 e-110 UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales ... 397 e-109 UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium... 390 e-107 UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitroso... 389 e-107 UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidith... 375 e-102 UniRef50_A9FJ88 Uncharacterized conserved protein involved in st... 372 e-101 UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani Rep... 369 e-100 UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=... 363 8e-99 UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonif... 359 2e-97 UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiob... 343 8e-93 UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitroco... 330 5e-89 UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriac... 310 9e-83 UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meioth... 306 1e-81 UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeri... 305 3e-81 UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5... 278 2e-73 UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phag... 243 7e-63 UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobac... 216 1e-54 UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reineke... 172 3e-41 UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candida... 156 2e-36 UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 ... 139 2e-31 UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobac... 117 5e-25 UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepI... 114 5e-24 UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 ... 98 6e-19 UniRef50_C6M483 von Willebrand factor type A domain protein n=1 ... 96 3e-18 UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea... 95 5e-18 UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 ... 91 8e-17 UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteo... 90 2e-16 UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteo... 88 6e-16 UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteo... 87 1e-15 UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacte... 87 1e-15 UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium... 86 2e-15 UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales... 86 3e-15 UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter... 85 4e-15 UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobact... 84 1e-14 UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 ... 84 1e-14 UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria R... 84 1e-14 UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20... 83 2e-14 UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Rumi... 83 2e-14 UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidet... 83 2e-14 UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostri... 83 3e-14 UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaen... 81 6e-14 UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatiba... 79 4e-13 UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacte... 77 1e-12 UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=... 76 2e-12 UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellac... 76 2e-12 UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopi... 76 3e-12 UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 ... 76 3e-12 UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 ... 75 5e-12 UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Breviba... 75 6e-12 UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 ... 74 8e-12 UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9... 74 9e-12 UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacter... 74 1e-11 UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobact... 74 1e-11 UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria R... 73 2e-11 UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocyst... 73 2e-11 UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4... 73 2e-11 UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmati... 73 2e-11 UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophag... 73 2e-11 UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella... 71 6e-11 UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 ... 71 1e-10 UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimon... 69 2e-10 UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastop... 69 2e-10 UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenz... 69 3e-10 UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria Re... 69 3e-10 UniRef50_C7N770 Uncharacterized protein containing a von Willebr... 68 5e-10 UniRef50_UPI000185CB41 protein containing von Willebrand factor ... 68 6e-10 UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobact... 67 8e-10 UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 ... 67 1e-09 UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella... 66 2e-09 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 65 6e-09 UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp.... 64 1e-08 UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostri... 63 2e-08 UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangi... 63 2e-08 UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 ... 63 2e-08 UniRef50_B4D1N7 Autotransporter-associated beta strand repeat pr... 62 5e-08 UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Ach... 61 7e-08 UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepI... 59 3e-07 UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 59 4e-07 UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax... 57 2e-06 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 54 8e-06 UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiph... 54 1e-05 UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 ... 54 1e-05 UniRef50_D0MZH7 Putative uncharacterized protein n=1 Tax=Phytoph... 53 2e-05 UniRef50_Q231J4 von Willebrand factor type A domain containing p... 53 2e-05 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 53 2e-05 UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 ... 53 2e-05 UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferrogl... 53 2e-05 UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscill... 53 2e-05 UniRef50_Q9ZGE6 Magnesium-chelatase 67 kDa subunit n=2 Tax=Helio... 53 3e-05 UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeo... 52 3e-05 UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha p... 52 5e-05 UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genom... 51 6e-05 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 51 8e-05 UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methyli... 51 1e-04 UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi... 51 1e-04 UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 51 1e-04 UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Stre... 50 2e-04 UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglob... 50 2e-04 UniRef50_A0CHZ1 Chromosome undetermined scaffold_185, whole geno... 50 2e-04 UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 ... 50 2e-04 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 50 2e-04 UniRef50_A6BYV9 Putative uncharacterized protein n=1 Tax=Plancto... 49 3e-04 UniRef50_Q23KK4 von Willebrand factor type A domain containing p... 49 3e-04 UniRef50_D2S019 ATPase associated with various cellular activiti... 49 3e-04 UniRef50_O26551 Magnesium chelatase subunit ChlI n=1 Tax=Methano... 49 3e-04 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 49 3e-04 UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacte... 49 3e-04 UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnol... 49 4e-04 UniRef50_Q23AA2 Putative uncharacterized protein n=1 Tax=Tetrahy... 49 4e-04 UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genom... 49 4e-04 UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromon... 48 5e-04 UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacter... 48 6e-04 UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellu... 48 6e-04 UniRef50_Q97HZ9 Predicted metal-dependent peptidase n=1 Tax=Clos... 48 7e-04 UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcani... 48 7e-04 UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Art... 48 7e-04 UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=... 48 8e-04 UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesioc... 47 9e-04 UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 ... 47 0.001 UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12... 47 0.001 UniRef50_Q23FU3 von Willebrand factor type A domain containing p... 47 0.001 UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis v... 47 0.002 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 47 0.002 UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein... 46 0.002 UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actin... 46 0.002 UniRef50_UPI0001555D4A PREDICTED: hypothetical protein, partial ... 46 0.003 UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genom... 46 0.004 UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus ... 46 0.004 UniRef50_A9B2Y1 VWA containing CoxE family protein n=4 Tax=Bacte... 45 0.004 UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin... 45 0.005 UniRef50_A1AQS2 Protoporphyrin IX magnesium-chelatase n=1 Tax=Pe... 45 0.005 UniRef50_A0CK50 Chromosome undetermined scaffold_2, whole genome... 45 0.006 UniRef50_C4G1K3 Putative uncharacterized protein n=1 Tax=Abiotro... 44 0.008 UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter... 44 0.009 UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobac... 44 0.010 UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangi... 44 0.014 UniRef50_A5FL88 Putative uncharacterized protein n=2 Tax=Flavoba... 43 0.020 UniRef50_B4S8S0 Magnesium chelatase ATPase subunit D n=3 Tax=Chl... 43 0.022 UniRef50_B1I3V2 Magnesium chelatase n=4 Tax=cellular organisms R... 42 0.034 UniRef50_C9LTL4 Magnesium-chelatase, subunit D/I family n=1 Tax=... 42 0.048 >UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteria RepID=Y1510_YERPA Length = 424 Score = 523 bits (1347), Expect = e-147, Method: Composition-based stats. Identities = 364/422 (86%), Positives = 395/422 (93%), Gaps = 1/422 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M +FIDRRLNGKNKSMVNRQRFLRRYK+QIKQSI++AINKRSVTD++SGESVSIP +DI+ Sbjct: 1 MGYFIDRRLNGKNKSMVNRQRFLRRYKSQIKQSIADAINKRSVTDIESGESVSIPIDDIN 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EPMFHQG GGLRHRVHPGNDHF+ NDR++RPQGGGGG SGQG A +DGEG+DEFVFQIS Sbjct: 61 EPMFHQGNGGLRHRVHPGNDHFITNDRVDRPQGGGGGG-SGQGNAGKDGEGEDEFVFQIS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 KDEYLDLLFEDLALPNLK+NQ +QL E+KTHRAGYT+NGVPANISVVRSLQNSLARRTAM Sbjct: 120 KDEYLDLLFEDLALPNLKRNQYKQLAEFKTHRAGYTSNGVPANISVVRSLQNSLARRTAM 179 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 TA KRREL LE L ++ NSEPAQLLEEERLRK I EL+ KI RVPFIDTFDLRYKNYE Sbjct: 180 TASKRRELRELEAALTVLENSEPAQLLEEERLRKAITELKQKIARVPFIDTFDLRYKNYE 239 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 +RP+PSSQAVMFCLMDVSGSMDQ+TKDMAKRFYILLYLFLSRTYKNV+VVYIRHHTQAKE Sbjct: 240 RRPEPSSQAVMFCLMDVSGSMDQATKDMAKRFYILLYLFLSRTYKNVDVVYIRHHTQAKE 299 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE EFFYSQETGGTIVSSALKLMDEVV+ERYNPAQWNIYAAQASDGDNWADDSPLCHE+ Sbjct: 300 VDEQEFFYSQETGGTIVSSALKLMDEVVQERYNPAQWNIYAAQASDGDNWADDSPLCHEL 359 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 LAKK+LPVVRYYSYIEITRRAHQTLWREYE L+ FDNFA+QHIR+ +DIYPVFRELFHK Sbjct: 360 LAKKILPVVRYYSYIEITRRAHQTLWREYEDLEEKFDNFAIQHIREPEDIYPVFRELFHK 419 Query: 421 QN 422 Q Sbjct: 420 QT 421 >UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteria RepID=Y975_NITHX Length = 439 Score = 463 bits (1192), Expect = e-129, Method: Composition-based stats. Identities = 182/439 (41%), Positives = 274/439 (62%), Gaps = 18/439 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN K++S+ NRQRFLRR + ++K+SI + + ++D D ++VSIPT Sbjct: 1 MPIFIDRRLNPKDRSLGNRQRFLRRAREELKRSIRDRVRSGRISDADGEQAVSIPTRSTD 60 Query: 61 EPMFHQGRG-GLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 EP F + G R V PGN HFV DR+ +P G G+ + S +D+F F + Sbjct: 61 EPRFEAAKDSGRREHVLPGNKHFVPGDRLRKPGHGAAGTPDPSMKDS-----EDDFRFVL 115 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 S++E LDL FEDL LP++ + +++ ++ RAG+ A G P NI+V R+++NS RR A Sbjct: 116 SREEVLDLFFEDLELPDMVKLSLKEILAFRPRRAGFAATGSPTNINVGRTMRNSYGRRIA 175 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEE--RLRKEIAELRAKIERVPFIDTFDLRYK 237 + KR E+ A+ + +A + + + + + L+ E+ L K + ++D D+R+ Sbjct: 176 LKRPKREEVDAIRQEIAELESGSQSPVARQRIAALQAEVERLERKRRLIAYVDPVDIRFN 235 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 +E +P P+++AVMFCLMDVSGSM + KD+AKRF++LL+LFL Y E+V+I H + Sbjct: 236 RFEAQPIPNAKAVMFCLMDVSGSMGEREKDLAKRFFVLLHLFLKCRYDRTEIVFISHTHE 295 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 A+EV+E FFYS ++GGT+VS+AL+ M ++ ERY ++WNIYAAQASDGDN A DS C Sbjct: 296 AQEVNEETFFYSTQSGGTVVSTALEKMHRIIAERYPGSEWNIYAAQASDGDNAAADSHRC 355 Query: 358 HEILAKKLLPVVRYYSYIEI----------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 +L ++++ + +YY+Y+EI T +LWR Y + + + NF M I D Sbjct: 356 ITLLDEEIMRLCQYYAYVEIIDERERHIFGTTENGTSLWRAYSSVNANWPNFQMTRIADA 415 Query: 408 DDIYPVFRELFHKQNATAK 426 DIYPVFR+LF +Q K Sbjct: 416 ADIYPVFRQLFTRQATAEK 434 >UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodobacterales RepID=B6B8L1_9RHOB Length = 445 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 194/444 (43%), Positives = 280/444 (63%), Gaps = 21/444 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD----VDSGESVSIPT 56 M FIDRR N K KS+ NRQRFLRR + IK+ + +++ +S+ D GE V+IP Sbjct: 1 MHHFIDRRANPKGKSLGNRQRFLRRARENIKERVDQSVRGKSIQSGSGVPDGGEKVTIPA 60 Query: 57 EDISEPMF-HQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEF 115 + EP F H +GGLR V PGN FV D I+RPQ GG+G G +AS++G+G+DEF Sbjct: 61 RGLKEPRFFHSSKGGLRRHVLPGNKDFVVGDTIKRPQ---GGTGQGGRKASEEGDGEDEF 117 Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 F ++++EYL++LFE L LP+L + + T RAG T G P N+++VR+++NSL Sbjct: 118 SFTLTQEEYLEILFEGLELPDLVEKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLG 177 Query: 176 RRTAMTAGKRRELHALEENLAIIS---NSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 RR A+ + LEE +A + + P Q E LRK++ + K + V +ID Sbjct: 178 RRIALQRPTTKSQRDLEEQIAELEALDDRTPPQEDFLEALRKKLDGIIRKRKVVGYIDPL 237 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 DLRY + +S+AV+FCLMDVSGSM + KD+AKRF++LL+LFL R Y++ E+V++ Sbjct: 238 DLRYDTFVPEKIRNSRAVVFCLMDVSGSMQEREKDLAKRFFLLLHLFLERCYEHTELVFV 297 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 RH A+EVDE FFY++ETGGTIVS+AL+ M E+++ERY P +WNIY AQASDG+N+ + Sbjct: 298 RHTHHAQEVDEETFFYARETGGTIVSTALEKMKEIIEERYPPDEWNIYGAQASDGENFGN 357 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEI----------TRRAHQTLWREYEHLQSTFDNFAMQ 402 DS C ++L LLPV ++Y+Y+EI A + LW+ Y +++ +F MQ Sbjct: 358 DSARCKKLLLNDLLPVSQFYAYVEIVDEAAEMLLNNPEAGEDLWQNYREVKAQAQHFEMQ 417 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 + IYP+FRE F + A+ Sbjct: 418 RVSQPGHIYPIFREFFLPKVKGAQ 441 >UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacteria RepID=Y882_CHRSD Length = 426 Score = 456 bits (1173), Expect = e-127, Method: Composition-based stats. Identities = 238/427 (55%), Positives = 310/427 (72%), Gaps = 4/427 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 MT+FIDRR N KNKS VNRQRFL+RY++ IK+++ EA+N+RS+TD++ GE +SIP +DIS Sbjct: 1 MTYFIDRRANAKNKSAVNRQRFLQRYRSHIKRAVEEAVNRRSITDMERGEKISIPAKDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP+F G GG R V PGN FV+ DR+ R GG G GSG+G AS GEG DEF F +S Sbjct: 61 EPVFQHGPGGARTIVSPGNKEFVEGDRLRR-PGGEGRGGSGEGSASNQGEGMDEFAFSLS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 ++E+LD +F+ LALP+L++ Q R L E + RAG T +GVP+ I++VRS++ + ARR M Sbjct: 120 REEFLDFVFDGLALPHLERKQLRDLDEVRPVRAGVTRDGVPSRINIVRSMREAQARRIGM 179 Query: 181 TAGKRRELHALEENLAIISNSEP--AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 A +R L EE L +P L+ EI L ++E VPFIDT+DLRY N Sbjct: 180 RAPIKRALREAEEALESEERKDPVLRNPARIGELKAEIERLEKRLEAVPFIDTYDLRYNN 239 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 +P PS++AVMFC+MDVSGSM Q KD+AKRF++LLYLFL R Y+ VE+V+IRHHT A Sbjct: 240 LIDQPQPSNKAVMFCVMDVSGSMTQGHKDIAKRFFLLLYLFLERNYEKVELVFIRHHTAA 299 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 KEVDE EFFYS+ETGGTIVSSAL L+DE++ +RY+PAQWN+Y AQASDGDNW DDS C Sbjct: 300 KEVDEEEFFYSRETGGTIVSSALTLVDEIIAKRYSPAQWNLYVAQASDGDNWDDDSLTCR 359 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD-NFAMQHIRDQDDIYPVFREL 417 ++L L+ ++YY+Y+EIT +HQ LW EYE +Q+ FAMQ I + DIYPVFR+L Sbjct: 360 DLLMTSLMAKLQYYTYVEITPHSHQALWEEYERVQAAHPSRFAMQQIVEPGDIYPVFRKL 419 Query: 418 FHKQNAT 424 F K+ A+ Sbjct: 420 FRKRVAS 426 >UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobacteria RepID=Y6755_BRAJA Length = 427 Score = 455 bits (1170), Expect = e-126, Method: Composition-based stats. Identities = 186/431 (43%), Positives = 276/431 (64%), Gaps = 18/431 (4%) Query: 3 WFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEP 62 IDRRLN KS+ NRQRFLRR K+ ++ ++ + +R + DV G V+IP + + EP Sbjct: 4 HIIDRRLNPGGKSLENRQRFLRRAKSLVQGAVKKTSQERDIKDVLEGGEVTIPLDGMHEP 63 Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD 122 F + GG R V PGN FV+ D ++R G GS + +G+ +D F F +S+D Sbjct: 64 RFRR-EGGTRDMVLPGNKKFVEGDYLQR-----SGQGSAKDSGPGEGDSEDAFRFVLSRD 117 Query: 123 EYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTA 182 E++DL +DL LP+L + + Q RAGYT +G PANISV R+++ +LARR A+ Sbjct: 118 EFVDLFLDDLELPDLAKRKIAQTESEGIQRAGYTTSGSPANISVSRTVKLALARRIALKR 177 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 ++ E+ LE +A ++ + E L E+ +L AK +R+PFID D+RY+ +E Sbjct: 178 PRKDEIEELEAAIAACTDED-----ERVVLLAELEKLMAKTKRIPFIDPLDIRYRRFETV 232 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P P +QAVMFCLMDVSGSM + KD+AKRFY+LLY+FL R YK+VE+V+IRH +A+EVD Sbjct: 233 PKPVAQAVMFCLMDVSGSMSEHMKDLAKRFYMLLYVFLKRRYKHVEIVFIRHTDRAEEVD 292 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 E FFY +GGT+VSSAL+ M ++V+ER+NP+ WNIYAAQASDGDN D L +L Sbjct: 293 EQTFFYGPASGGTLVSSALQAMHDIVRERFNPSDWNIYAAQASDGDNSYSDGELTGLLLT 352 Query: 363 KKLLPVVRYYSYIEITRRAH-------QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 K+LPV ++++Y+E+ +LW YE L+++ +M+ + ++ +I+PVF Sbjct: 353 DKILPVCQFFAYLEVGESGGSAFDLSDSSLWTLYERLRNSGAPLSMRKVSERSEIFPVFH 412 Query: 416 ELFHKQNATAK 426 +LF ++ + + Sbjct: 413 DLFQRRETSQE 423 >UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkholderiales RepID=A9I2A9_BORPD Length = 419 Score = 449 bits (1155), Expect = e-125, Method: Composition-based stats. Identities = 205/426 (48%), Positives = 281/426 (65%), Gaps = 10/426 (2%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M IDRRLNG+NKS VNR+RFLRRYK QI++++ + + +RS+ D+D G +++P DIS Sbjct: 1 MNSLIDRRLNGRNKSAVNRERFLRRYKDQIRRAVQDLVRERSIEDMDQGGEINLPARDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP F G+GG R VHPGN F + D RP G G GS G+ D+F F +S Sbjct: 61 EPHFRHGQGGDRELVHPGNREFAKGDTFPRPSGSDGEGGSEPGEGES----VDQFTFSLS 116 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 + E+L+L FEDL LP+L + Q +T+ K RAGYT G P+ +SV R+L+ SL+RR A+ Sbjct: 117 RAEFLNLFFEDLELPHLIRTQLGDVTQKKWQRAGYTTTGSPSLLSVSRTLKASLSRRVAL 176 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R EL A + L + A E + LR+E+ + ++ R+PF+D DLRY+N Sbjct: 177 GVAARAELEAAQAKLDAAIAAG-APQAEIDALRQEVEDCANRLARLPFLDDLDLRYRNRV 235 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 P ++AVMFCLMDVSGSMD+ KD+AKRF+ LLYLFLSR Y++V+VV+IRH A+E Sbjct: 236 SVAMPMARAVMFCLMDVSGSMDEGKKDLAKRFFTLLYLFLSRKYEHVDVVFIRHTDNAEE 295 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE FFY ++GGTIV SAL+LM E+V++RY P+ WN+YAAQASDGD++ D+ Sbjct: 296 VDEQTFFYDPKSGGTIVLSALELMHEIVQQRYPPSAWNVYAAQASDGDSFGADAGKSARF 355 Query: 361 LAKKLLPVVRYYSYIEITRR---AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 LA+ LLP RY++YIE+ +LW EYE Q T +F M+ I ++ +IYPVF +L Sbjct: 356 LAENLLPATRYFAYIEVPDSQEARKSSLWAEYE--QETAPHFVMRRICERGEIYPVFHDL 413 Query: 418 FHKQNA 423 F K+ A Sbjct: 414 FKKETA 419 >UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria RepID=Y587_BACC4 Length = 391 Score = 420 bits (1079), Expect = e-116, Method: Composition-based stats. Identities = 105/416 (25%), Positives = 179/416 (43%), Gaps = 47/416 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR + + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGYDDQQRHQEKVQEAIKNNLPDLVTEESIVMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGS-GSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN D + R GG G G+GQ + D G+D + ++S E F +L Sbjct: 81 -VGQGNGDSKVGDVVARDGSGGQKQKGPGKGQGAGDAAGEDYYEAEVSILELEQAFFREL 139 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 LPNLK+ + + G+ NI R++ ++ R Sbjct: 140 ELPNLKRKEMDENRIEHVEFNDIRKTGLWGNIDKKRTMISAYKRNAMSGKAS-------- 191 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 I DL+++ + + P S+AV+ Sbjct: 192 ---------------------------------FHPIHQEDLKFRTWNEVLKPDSKAVVL 218 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET 312 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E EFF E+ Sbjct: 219 AMMDTSGSMGIWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVTEEEFFSKGES 278 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 GGTI SS K E++ +Y+P ++NIY SDGDN D+ C + L ++L+ + Sbjct: 279 GGTICSSVYKKALELIDNKYSPDRYNIYPFHFSDGDNLTSDNARCVK-LVEELMKKCNMF 337 Query: 373 SYIEITRRA-HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y E+ + H TL Y++++ DNF ++ + D++ + F +++ Sbjct: 338 GYGEVNQYNRHSTLMSAYKNIK--DDNFRYYILKQKADVFHAMKSFFREESGEKMA 391 >UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes RepID=Y926_BACA2 Length = 394 Score = 402 bits (1032), Expect = e-110, Method: Composition-based stats. Identities = 103/412 (25%), Positives = 181/412 (43%), Gaps = 47/412 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR ++ + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGFDDQQRHQKKVQEAIKNNLPDLVTEESIIMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ D + R G G+G+GQ + D G+D + ++S + + LF++L Sbjct: 81 -VGQGDGDSEVGDVVAR-DGADKKQGAGKGQGAGDQAGEDYYEAEVSLMDLEEALFQELE 138 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL+Q ++ + G+ NI R++ ++ R Sbjct: 139 LPNLQQKERDNIVHTDIEFNDIRKTGLTGNIDKKRTMLSAYKRNAMTGKPS--------- 189 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 I DL+YK + P S+AV+ Sbjct: 190 --------------------------------FYPIYPEDLKYKTWNDVTKPESKAVVLA 217 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E +FF E+G Sbjct: 218 MMDTSGSMGVWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVSEEDFFSKGESG 277 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GTI SS + E++ E+Y+PA++NIY SDGDN D+ C + L ++ + Sbjct: 278 GTICSSVYRKSLELIDEKYDPARYNIYPFHFSDGDNLTSDNARCVK-LVNDIMKKSNLFC 336 Query: 374 YIEITRRA-HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 Y E+ + H TL Y++++ D F ++ + D++ + F + + Sbjct: 337 YGEVNQYNRHSTLMSAYKNVK--DDKFKYYILKQKSDVFQALKSFFKNEESG 386 >UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales RepID=Y568_CLOK1 Length = 403 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 125/413 (30%), Positives = 208/413 (50%), Gaps = 25/413 (6%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S+ +R+R + + IK ++++ I++ S+ + V IP + I E F G Sbjct: 14 DRSLEDRRRHRQLVEKSIKDNLADIISEESIIGQSKNKKVKIPIKGIKEYQFIYGDNSSG 73 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 + DRI + G G+ Q + + EG+D + +++ ++ LD L EDL Sbjct: 74 VGSGD--GSQKKGDRIGKAIKDRDGKGN---QGAGNQEGEDMYEIEVTIEDVLDYLMEDL 128 Query: 133 ALPNLKQNQQRQL-TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP + + + Q+ + ++GY G+ ++ R++ L R+ G +R L + Sbjct: 129 ELPLMDKKKFSQILSNNSPKKSGYQRKGINPRLAKKRTVVEKLKRQQ----GTKRALREI 184 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 L S+P L E K R PF DLRY +++P A + Sbjct: 185 HGEL----ESDPKNKLPENTTIKS---------RFPFKQD-DLRYFRVKRKPKLELNAAI 230 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD SGSMD + K +A+ F+ +LY F+ Y NVEV +I H T AK V E+EFF+ E Sbjct: 231 ICVMDTSGSMDSTRKFLARSFFFVLYRFIKMKYNNVEVKFISHSTSAKVVTENEFFHKVE 290 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS LK EV++E YNPA WN+Y SDGDNW++D+ L + AK L V Sbjct: 291 SGGTYISSGLKKALEVIEENYNPAYWNVYTFYVSDGDNWSEDNSLALK-CAKDLCKVCNL 349 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 +SY EI + + + + T +NF + I ++ D++ +++ +K+ Sbjct: 350 FSYAEIIPSPYGSSIKHIFQNKITDNNFTVVTIHEKQDLWKSLKKILNKELEE 402 >UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium roseum DSM 5159 RepID=B9L510_THERP Length = 389 Score = 390 bits (1001), Expect = e-107, Method: Composition-based stats. Identities = 114/417 (27%), Positives = 182/417 (43%), Gaps = 51/417 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K +++ R + + K IK+++++ ++++S+ D V +P + E F R Sbjct: 19 KGAIDQARHMEKVKEAIKRNLADIVSEQSLITSDGKRVVRVPIRVLEEYRFRFDPDSGRQ 78 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ + GGG SG G + D G D + +++ +E +L+FEDL Sbjct: 79 -VGQGSGG---THVGDVVGRVGGGQRSGDGPQAGDQPGIDYYEAELTIEELSELIFEDLE 134 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL++ + R+L +G AN+ R+L+ +L R Sbjct: 135 LPNLEEKRLRELESEAVRFTEIRRHGPFANLDKRRTLRENLRRNA--------------- 179 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 R+ DLR+K +E+ S AV+ Sbjct: 180 --------------------------WRGRARIGDFANEDLRFKTWERDVKRESNAVVIA 213 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MDVSGSM K +++ FY + FL Y VE+ +I HH +A+EV E EFF E+G Sbjct: 214 MMDVSGSMGTFEKYVSRAFYYWMVRFLRTKYDRVEIRFIAHHAEAREVSEEEFFSRGESG 273 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT S+A +L ++++E Y P WNIY SDGDNW D+ C E LA++LL + Sbjct: 274 GTRASTAYELALQLIRESYPPDSWNIYPFHFSDGDNWPSDNERCRE-LAEELLRCANLFG 332 Query: 374 YIEITR---RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y EI + TL + + S I ++ D+Y R F + Sbjct: 333 YGEIRQGRYTYQSTLMHTLQRIGS--PKLVTVTITEKADVYQALRRFFGPEVGQEVA 387 >UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitrosococcus oceani RepID=Q3J885_NITOC Length = 394 Score = 389 bits (999), Expect = e-107, Method: Composition-based stats. Identities = 119/419 (28%), Positives = 201/419 (47%), Gaps = 49/419 (11%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S +R R ++ + I+ ++++ + + S+ + +P I E F G+ Sbjct: 17 DRSAKDRLRHRQKVRKAIRDNVADIVAEESIIGQSRDRIIKVPIRGIREYRFVYGQNTPG 76 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 G+ Q G G G + D G D + +I+ +E ++++ EDL Sbjct: 77 VGTGQGDSEPGQTV-------GQVPQGDGGPGHAGDRPGMDYYETEITLEELIEIMLEDL 129 Query: 133 ALPNLKQNQQRQL-TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP++++ + R++ +E + R G+ GV ++ R+ ++ + RR A Sbjct: 130 ELPDMERKRFREVLSERTSKRKGFRRVGVRVHMDKRRTAKSRIRRRLA------------ 177 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + AE R PF D+RY + P S AV+ Sbjct: 178 ---------------------SDKDAEDNETKHRFPFHRD-DMRYHRLREDMRPQSNAVV 215 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 FC+MD SGSMD K +A+ F+ LLY F+ Y NV+VV+I HHT+A+EV E EFF+ E Sbjct: 216 FCIMDTSGSMDTLKKYLARSFFFLLYQFVRSRYVNVDVVFIAHHTKAREVTEEEFFHKGE 275 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT +SS E+++ RY+P+ WNIYA SDGDN+ D+ + A+ L V Sbjct: 276 AGGTFISSGYSKALEIIQNRYHPSLWNIYAFHCSDGDNFDSDNAATLKA-AEVLCQVCNL 334 Query: 372 YSYIEITRR----AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + Y EI R T+ + ++ DNF I+ ++DI+P FR+L +++ ++K Sbjct: 335 FGYGEIKPRPSGFYEGTMLDLFRSVR--MDNFQSVLIQRKEDIWPSFRQLLSRESESSK 391 >UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EPS6_ACIF5 Length = 434 Score = 375 bits (962), Expect = e-102, Method: Composition-based stats. Identities = 166/437 (37%), Positives = 258/437 (59%), Gaps = 17/437 (3%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD-VDSGESVSIPTEDI 59 M+ IDRR +G +S N+ R RR +A++K ++ + S+ D ++ + VSIPT D+ Sbjct: 1 MSMIIDRRSSG-TRSTANQDRLQRRVRARLKVAVEKMARSGSIEDLANTDQPVSIPTRDL 59 Query: 60 SEPMFHQGRGGLR-HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQ 118 EP F + RV PGN + + D I +P+GGG G + DG G+DE Sbjct: 60 HEPSFRRDLSDTSWERVLPGNKEYQRGDEINKPEGGGSGK---GRAGAPDGLGEDEVAIV 116 Query: 119 ISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT 178 +S DE+LDLLF+ LALPNL++ Q + + RAG+ +G P+ + V R+++ + ARR Sbjct: 117 LSADEFLDLLFDGLALPNLRKMAQGDIQADQWRRAGFIKDGSPSRMHVGRTMRAARARRL 176 Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEE----------ERLRKEIAELRAKIERVPF 228 A+ AGKRREL L + ++ +L ++ L +I L KI+ +PF Sbjct: 177 ALRAGKRRELQDLLDARNVLQEEIQGRLAQKQDVSVEQERLSELNHQIDALERKIKAIPF 236 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ID DLR+ + +++P P + AVMFC+MDVSGSM + KD+AKRF++LLYLFL R Y+ V+ Sbjct: 237 IDEADLRFAHIDQQPHPITNAVMFCVMDVSGSMGEKEKDLAKRFFLLLYLFLHRHYQAVQ 296 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 +V+I+HH+ A E E FF ++E GGT+VS A+ L +E++++R+ P +WN+Y AQ SDGD Sbjct: 297 MVFIKHHSTASECSEQAFFGAREGGGTLVSPAIILSEEIMRQRFPPDRWNVYLAQVSDGD 356 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 N+ D+ + E L L + + Y+E+ R + L R Y+ + F +++ Sbjct: 357 NYFADNAVVEEHLLNLLPRLRNLF-YLEVNRDSESDLLRLYDAIAQDFPELVTARASERE 415 Query: 409 DIYPVFRELFHKQNATA 425 DIYP+FR LF + + Sbjct: 416 DIYPMFRTLFATEETPS 432 >UniRef50_A9FJ88 Uncharacterized conserved protein involved in stress response n=21 Tax=Bacteria RepID=A9FJ88_SORC5 Length = 405 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 101/405 (24%), Positives = 182/405 (44%), Gaps = 47/405 (11%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + RF + + +I++++ + I++ + + VSIP I P F G R V Sbjct: 44 DHGRFRQIVRGRIRENLRKYISQGELIGRKGKDLVSIPIPQIDIPRFRFG-DKQRGGVGQ 102 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ + P GG GQGQ + GEG ++ +E +L E+L LP++ Sbjct: 103 GDGNPGD------PVGGSDDKQPGQGQ-AGSGEGDHLLEVDVTLEELAGILGEELELPDI 155 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + + +++ +G G + R+ + +L R Sbjct: 156 QDKGKSKISNAHDRYSGIRRVGPESLRHFKRTYREALKR--------------------- 194 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 ++ R + + D RY++++ +P + AV+ +MDV Sbjct: 195 --------MISSGTFRPSAPVVVPVPD--------DKRYRSWKTITEPVANAVIIYMMDV 238 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV 317 SGSM K++ + + +L+R YK +E +I H A+EVD FF+++E+GGT++ Sbjct: 239 SGSMGDEQKEIVRIESFWIDAWLTRQYKGLESRFIIHDAIAREVDRDTFFHTRESGGTMI 298 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA-DDSPLCHEILAKKLLPVVRYYSYIE 376 SSA KL +++ Y P +WNIY SDGDNW+ DD+ C ++L ++LP V ++Y + Sbjct: 299 SSAYKLCSQIIDNDYPPDEWNIYPFHFSDGDNWSMDDTLSCVDVLKTQILPRVNMFAYGQ 358 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + ++ + S D + IRD+D I ++ K Sbjct: 359 VESPYGSGQFIKDLKEHFSQDDRVVVSEIRDKDAIVGSIKDFLGK 403 >UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani RepID=Q896G6_CLOTE Length = 386 Score = 369 bits (948), Expect = e-100, Method: Composition-based stats. Identities = 109/416 (26%), Positives = 192/416 (46%), Gaps = 43/416 (10%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 N++ +R+R + IK ++ + + + ++ + +P + + E F + Sbjct: 13 NRAGEDRKRHRELVEKSIKDNLVDVLLQEDISIQKENIKIKVPIKGVKEYEFTYSQNRSF 72 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN+ Q ++R S G G + + EG+D F +++ +E +F+DL Sbjct: 73 VVVGKGNEKKGQKIALKR------ASEQGGGAGAGEIEGEDIFETEVTIEEIFQSIFDDL 126 Query: 133 ALPNLKQNQQRQL-TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LPNLK+ + ++ + + G+ +G+ ++ R+ + R+ A R++ Sbjct: 127 ELPNLKKKKFNKILNDSFKRKKGFKKHGISPRLAKRRTAIEKVKRKQATQKVLGRDIA-- 184 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 ER PF DLRY + + AV+ Sbjct: 185 --------------------------------ERFPFKKD-DLRYSRVKLNKNKEYNAVI 211 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD S SMDQ K MA+ F+ ++Y F+ Y+ V++ +I H T AKEV E EFF+ E Sbjct: 212 ICIMDTSASMDQMKKYMARSFFFMIYKFIKMKYEEVDICFISHSTTAKEVTEEEFFHKVE 271 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS K E++ RYNP +NIY ASDGDNW +D+ + +AK+L V Sbjct: 272 SGGTYISSGYKKALEIINTRYNPQIYNIYTFHASDGDNWNEDNDRAVK-VAKELSNVCNL 330 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + YIEI + R + +NF I ++D++ +++ ++ +G Sbjct: 331 FGYIEIMGYGYSNGIRNKYLKEIEKENFIPLIIEKKEDLWRALKDILKQEMREERG 386 >UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C05A6 Length = 368 Score = 363 bits (931), Expect = 8e-99, Method: Composition-based stats. Identities = 119/401 (29%), Positives = 187/401 (46%), Gaps = 63/401 (15%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + +R + + I+++I I S+T+ +G V + +++ E F G V Sbjct: 18 DAKRHRKLVEKSIRENIDMLIVGESITETAAGNIVKVRIQELPEYRFKFGSS--TEYVAI 75 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ V N++ + + +AS + G D + +I D+ L LLFE L LPNL Sbjct: 76 GDGDEVVNEKCDF-----------EMEASNEA-GLDIYESEIVLDDALALLFEQLELPNL 123 Query: 138 KQNQQRQLTEYKTH-RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 + + + L + T R+G G+ + R+LQ + R Sbjct: 124 YEKKFKNLEYFSTQKRSGIKKTGIYPRFAKKRTLQEKIIRN------------------- 164 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + FI+ D+RY++ K+ S AV+ C+MD Sbjct: 165 ---------------------------KNGRFINQ-DIRYQSLAKKQINHSNAVIVCIMD 196 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 SGSM + KDMAK FY LLY F+ Y VE+++I H T AKEV E++FF+ E+GGT Sbjct: 197 TSGSMGTTKKDMAKSFYFLLYQFIKIRYAKVEMIFIAHSTIAKEVTENDFFHKGESGGTY 256 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SS E++KERY+P WN+Y SDGDNW DD+ L LA +L + YIE Sbjct: 257 ISSGYTKALEIIKERYDPRLWNVYTFHCSDGDNWTDDNNLAV-SLANELCSCSNLFGYIE 315 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 I + ++ + T +NF I + DI+ VF+++ Sbjct: 316 IKTNNYSSVILNEYNAHITSNNFLALKIFKKSDIFEVFKKV 356 >UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonifex degensii KC4 RepID=C9RD74_AMMDK Length = 371 Score = 359 bits (920), Expect = 2e-97, Method: Composition-based stats. Identities = 106/407 (26%), Positives = 167/407 (41%), Gaps = 51/407 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K + +R + K I+Q + E I + S+ D V IP + E F Sbjct: 15 KGEEDARRHQEKLKEIIRQRLPELITEESLILADDRRKVRIPLRLVEEFRFRFA-SHQEM 73 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V ++ I P G GG + G D + ++S +E +++FE+LA Sbjct: 74 LVGQAGSQPGTDETIVFPGIGRGGGAGTE-------PGIDYYEAEVSVEEIAEVVFEELA 126 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+ K A G+ A + R+L N+L R G++ E Sbjct: 127 LPHYKPKNTAN-RGIAEEWADLRRQGIRACLDRRRTLLNALKRHA--KEGRKGEFR---- 179 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + DLR++ + P ++AV+ Sbjct: 180 -----------------------------------LCPSDLRFRVWRSIESPEARAVVLA 204 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 ++D SGSM K +A+ F+ + FL Y NVEVVY+ HHT+A+E EFF E+G Sbjct: 205 MLDTSGSMGPLEKYLARSFFFWMVRFLEANYANVEVVYLAHHTEARETTASEFFRKGESG 264 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT SS +L ++++ RY P ++NIYA SDGDN D+ C E L +LL V Sbjct: 265 GTRCSSVYELALDIIETRYPPTEYNIYAFHFSDGDNLPADNERCME-LIGRLLEVANLVG 323 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 Y EI T + + + +R++ D+Y + F + Sbjct: 324 YGEIEGPYFYTSTLKTVYQSIAHPRLVVVTLRERKDVYRALKAFFAR 370 >UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67Q87_SYMTH Length = 395 Score = 343 bits (879), Expect = 8e-93, Method: Composition-based stats. Identities = 98/422 (23%), Positives = 170/422 (40%), Gaps = 54/422 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 + ++++R R + I+ ++++ ++ ++ D + V +P + E F + Sbjct: 18 QGQMDQERHQARIREAIRANLADIVSDEAIIASDGRKVVRLPIRVLREYRFRLDWQK-QP 76 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 RV + + + RP +G +D F + +E +LLF +L+ Sbjct: 77 RVGEADGPVRPGEPVGRPGRAAEAAGGSGAGDEAG---EDWFETDVPLEELEELLFAELS 133 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+L+ Q+ LT G+ ANI R+L ++ R Sbjct: 134 LPHLEPKQEPHLTVLHHEWRDVRRQGLYANIDKKRTLLEAMKRNRLAGRP---------- 183 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + I DLR++ +++ P + AV+ Sbjct: 184 -------------------------------PLAGIRREDLRFRTWDEAEIPGASAVLII 212 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K +A+ + FL Y+ V++ ++ H T+AKE+DE FF E+G Sbjct: 213 MMDTSGSMGTGEKYIARSLCHWMVRFLRTRYERVKLHFVAHTTEAKEMDEESFFTRGESG 272 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT SSA + +++ RY P + N+YA SDGDN D+P L +KLL Sbjct: 273 GTRCSSAYEYALQLIDRRYPPDRHNLYAFHFSDGDNLISDNPRAV-ALLRKLLERCALVG 331 Query: 374 YIEITRRAHQTLWREYE--------HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 Y +I + Y+ + F IRD+ +IY R F + A Sbjct: 332 YGQIETQPQYLSMPYYQPNTLLTLFREEIDHPRFVTALIRDRSEIYAALRAFFPRPGAGE 391 Query: 426 KG 427 +G Sbjct: 392 RG 393 >UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BNQ7_9GAMM Length = 391 Score = 330 bits (847), Expect = 5e-89, Method: Composition-based stats. Identities = 95/421 (22%), Positives = 177/421 (42%), Gaps = 58/421 (13%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 + + R + + +++ + + + V +V +P + F +R Sbjct: 21 RGTRDWLRHNEKIREAVREQLPDLVAGSDVLSRPDNRTVKVPVRFMEHYRFRLRNPDVRT 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 G R +P + S +GEGQ F + D+ LD L+++L Sbjct: 81 GAGQGKAKPGDVLRPAQP-----ARPGQGKEGSGEGEGQITFALEFQIDDILDWLWDELE 135 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+LK ++ E R G+ G + + R+++ ++ RR+A Sbjct: 136 LPHLKPRLGTRIEEDAYIREGWDRRGARSRLDRRRTMKEAIKRRSAQG------------ 183 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 E +P ++ DLR++ +R P++ AV F Sbjct: 184 -----------------------------PEAIPIVND-DLRFRQLARRRRPTTNAVAFF 213 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 L+DVS SMD+ + +AK F+ + R + +E+V+I H +A E +E FF G Sbjct: 214 LLDVSSSMDEHCRRLAKTFFFWALQGVRRQFSTIEIVFIAHTVEAWEFEEENFFRIHGQG 273 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT S+A+ ++++ERY+PA +N Y A+DG N+++D E L +L P++ + Sbjct: 274 GTKSSTAVHKAQQILEERYDPAMYNCYLFYATDGHNFSEDRRRATEALL-RLAPLMNFLG 332 Query: 374 YIEITRRAHQTL-------WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 Y E++ + H+ L WR ++++ + DI+ + F Q A A+ Sbjct: 333 YAEVSHQNHRRLDTEVAGIWRGLGAEGWPVGSYSLTR---EADIWLAIKAFFTDQAAEAE 389 Query: 427 G 427 Sbjct: 390 A 390 >UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriaceae RepID=Y746_HALSA Length = 442 Score = 310 bits (793), Expect = 9e-83, Method: Composition-based stats. Identities = 93/449 (20%), Positives = 173/449 (38%), Gaps = 58/449 (12%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 +R+RF + + +Q ++E I + ++V +P + + P F + R V Sbjct: 5 EDRERFHEIGEQR-RQDLAEFIQYGDL-GGSGPDAVRVPIKLVDLPAFEYDQ-LDRGGVG 61 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 G+ D++ P +G G + D E+ +++ +E+ L E L L + Sbjct: 62 QGDVDP--GDQVGEP----DEAGEGDDDEAGDESADHEY-YEMDPEEFAAELDERLGL-D 113 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM---------------- 180 L ++ + E + G + + L R+ AM Sbjct: 114 LDPKGKKVVAETEGAFNETARRGPRGTLDFAHLYKQGLKRKIAMDFDEAYVTAALRVDGW 173 Query: 181 ---TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE------------- 224 + + A I+ + +++ R + A I+ Sbjct: 174 GVDAVYTWAREQHIPVSRAWIAERARSPSPDDDAGRVVDDAVWASIDAMEAAVDVEPTRT 233 Query: 225 --------RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 RVP + D R+++ + V+ + DVSGSM +S +++ +R + L Sbjct: 234 RIRRGGPGRVP-LRREDERFRHPKVVEHRERNVVVVNIRDVSGSMRESKRELVERTFTPL 292 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 +L+ Y N E VYI H A EVD +FF Q GGT +S+A +L + V+ E Y ++ Sbjct: 293 DWYLTGKYDNAEFVYIAHDADAWEVDRTDFFGIQSGGGTRISTAYELAENVLDE-YPFSE 351 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 WN Y A DG+N DD+ L + ++Y+E ++ F Sbjct: 352 WNRYVFAAGDGENSHDDTEENVIPLMNDI--DANLHAYVETQPTDGVQTGTHAGKVRDAF 409 Query: 397 ---DNFAMQHIRDQDDIYPVFRELFHKQN 422 DN A+ + + DD+ + ++ Sbjct: 410 GDTDNVAVTTVTEPDDVMGAIETILSTED 438 >UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XT76_9DEIN Length = 360 Score = 306 bits (783), Expect = 1e-81, Method: Composition-based stats. Identities = 91/404 (22%), Positives = 152/404 (37%), Gaps = 54/404 (13%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 + QRF + ++K+ E + + G+ VSIP + P G + Sbjct: 6 RDLQRFKEIVRGEVKKRAREFLTREEYLGSLDGQVVSIPLPQLELPRLQYGHNEMGQG-- 63 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 E G G G G V ++ +E+L+L+ E L LP Sbjct: 64 ------------EGEGEGQGQGMGGTAGRGGLGPSGHVPVAEMDLEEFLELIGEALKLPR 111 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L+ Q + E + G + R+L+ +L R Sbjct: 112 LEPKQGGAVEESSPKYTTLSRRGPESLRHARRTLRQALRRAI------------------ 153 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + R E L + + D RY+ E +P P +QA + +D Sbjct: 154 -----------QSGIYRPEDPRLVPERD--------DYRYRAPEPKPRPQAQAALVFALD 194 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 VSGSM+ + + + ++ + + + Y+ H +A EV E +FF +E GGT Sbjct: 195 VSGSMEGEQLRLVRILSYWITAWVKKHFPRLSRHYLLHDAEAWEVSEEDFFRLREGGGTR 254 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SS +KL +V+ ERY +N Y +DGDNW DD+ E L K LLP + Y Y + Sbjct: 255 LSSGIKLAQQVL-ERYPAQLYNRYVYHFTDGDNWQDDTAEALETL-KALLPTLSLYGYAQ 312 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + R Q + + A + ++ + + L Sbjct: 313 VRSRYGQGRFIDDLRSHFPSDPALATAELGGRESLPSALKRLLG 356 >UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GJ99_SILST Length = 300 Score = 305 bits (780), Expect = 3e-81, Method: Composition-based stats. Identities = 123/299 (41%), Positives = 183/299 (61%), Gaps = 13/299 (4%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAG---KRRELHALEENL 195 + + T RAG T G P N+++VR+++NSL RR A+ +R+L A L Sbjct: 2 EKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLGRRIALQRPSTQTQRDLEAQVAEL 61 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 I P+Q L ++ ++ K V +ID DLRY + +S+AV+FCLM Sbjct: 62 EEIEARSPSQDELLAELVAKLDGIKRKRRVVGYIDPLDLRYDTFVPEKIRNSRAVVFCLM 121 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 DVSGSM + KD+AKRF++LL+LFL+R Y++ E+V++RH A+EVDE FFY++ETGGT Sbjct: 122 DVSGSMQEREKDLAKRFFLLLHLFLTRGYEHTEIVFVRHTHYAQEVDEETFFYARETGGT 181 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 IVS+AL+ M E++ ERY P +WNIY AQASDG+N+ +DS C +IL ++LLP+ ++Y+Y+ Sbjct: 182 IVSTALEKMKEIIDERYPPDEWNIYGAQASDGENFGNDSVRCRKILTEQLLPMCQFYAYV 241 Query: 376 EI----------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 EI A + LW+ Y ++ +F MQ + + IYP+FRE F + Sbjct: 242 EIVEESAQMLLDNTEAGEDLWQNYRQVKEACRHFEMQRVSEPGHIYPIFREFFLPKVKG 300 >UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5_THET2 Length = 351 Score = 278 bits (711), Expect = 2e-73, Method: Composition-based stats. Identities = 104/404 (25%), Positives = 161/404 (39%), Gaps = 64/404 (15%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 + RF + ++K+ + E + + + G VSIP + P G Sbjct: 10 RDLLRFKEIVRGEVKKRVREFLTREELFGQVEGRLVSIPLPQLEIPKIVHGEPLGEGL-- 67 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 G G G G V ++ +E+LDL+ E L LP Sbjct: 68 ----------------------GLGGPGEEALGPGGHIPVAELELEEFLDLVGEALRLPR 105 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L+ + ++TE G V R+L+ SL R Sbjct: 106 LRPKGEGEVTEEALRHTTIARKGPRGLRHVRRTLKESLKR-------------------- 145 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 L+ R E L + E DLRYK ++P P +QAV+ +D Sbjct: 146 ---------ALQSGEYRPEDPLLVPERE--------DLRYKAPRRKPIPHAQAVVLFALD 188 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 VSGSM + + K + L++ R + +E Y+ H +A EV E EFF ++E GGT Sbjct: 189 VSGSMREEELKLVKTLSFWITLWIKRHFPRLERRYLLHDAEAWEVPEEEFFKAREGGGTR 248 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SSAL L +E++K Y A +N Y SDG+NW D+PL E ++LLP + Y Y + Sbjct: 249 ISSALLLAEEILKA-YPEAFYNRYLFHFSDGENWQGDTPLALEA-LRRLLPSLALYGYAQ 306 Query: 377 ITRRAHQT-LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + Q E + A+ +R ++D+ R L Sbjct: 307 VEGPYGQGHFLEEVREALGGREGVALAAVRGREDLPVALRRLLG 350 >UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phage phiYS40 RepID=A0MN74_9CAUD Length = 340 Score = 243 bits (621), Expect = 7e-63, Method: Composition-based stats. Identities = 86/366 (23%), Positives = 137/366 (37%), Gaps = 75/366 (20%) Query: 16 MVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRV 75 ++ R+L+R + IK + + +N + + + V I + EP F Sbjct: 5 TIDEIRYLKRLENIIKARMQDIVNSNDIIESTPEDKVRIRIPIMDEPYFK---------- 54 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP 135 + G G GSGSG EG E +++ +E +LLFE L LP Sbjct: 55 -----------PVFPGSGAGAGSGSGSEPGEGSEEGDHEIEIELTVEELSELLFEYLGLP 103 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 +K + + + G + G + I RR Sbjct: 104 KIKPKG-SSVEKEEYLIEGISKTGPRSRIH----------RR------------------ 134 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + +I + + + +RYK+ KR P A+++ Sbjct: 135 ----------------------KTYYEIMKYGYKED-SIRYKHLRKREVPIFDAIVYFAR 171 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 D S S+D K K + F+ YKNV + H T+AK V E +FF E G T Sbjct: 172 DYSASVDDKKKFKIKSTAFWINNFIKYNYKNVTTKFAVHDTKAKFVSEQDFFKLSEGGAT 231 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + SS +L+ E + RY+ +N Y SDG+N DD+P E L +KL +Y Sbjct: 232 LCSSVFELIYEDYR-RYSVDDYNFYLFYFSDGENLPDDNPKLRE-LVEKLSEDFNLIAYG 289 Query: 376 EITRRA 381 E+ Sbjct: 290 EVKSTD 295 >UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobacteria RepID=Q747C5_GEOSL Length = 447 Score = 216 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 76/380 (20%), Positives = 152/380 (40%), Gaps = 53/380 (13%) Query: 23 LRRYKAQIKQSISEAINKRSVTDVDSGES---VSIPTEDISEPMFHQ------GRGGLRH 73 L R + + + + I + +G V +PT + E + H Sbjct: 72 LERDRLREEDGLPRKIRIGKLIKPGAGGKEKIVVVPT-TVEEKLIHDRAPEETEEDESMG 130 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 G++ + ++ RPQ GG +G G+ +G + + + + +L E Sbjct: 131 GTGDGDEGEIIGEQPVRPQQEGGSGTAGHGE--GEGHELESTAYDLGR-----ILTERFD 183 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNLK+ ++ + + Y+ + N R L ++ + E + Sbjct: 184 LPNLKEKGKK------SSLSHYSYDLTDRN----RGFGQILEKKQTLRRIL--ETNIALG 231 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 +A ++ +P +L + + +RV I + +L Y SQA++F Sbjct: 232 TVADVAEIDPTRL------------VISPRDRVYRILSRELEY---------ESQALVFF 270 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRHHTQAKEVDEHEFFY-SQE 311 + D SGSM+ + ++L+Y +L + + VE +I H A+EV + +Y + Sbjct: 271 IRDYSGSMEGKATEAVCSQHVLIYSWLLYQFARQVETRFILHDNDAREVPDFYTYYNLRV 330 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT V++A ++++E+V++ +NIY +DGD+W + L +++L Sbjct: 331 AGGTRVAAAYRMVNEIVEKESLARDYNIYVFHGTDGDDWDTNGEETIPEL-RRMLAYANR 389 Query: 372 YSYIEITRRAHQTLWREYEH 391 + E E Sbjct: 390 IGVTIAEHTYGSSGNTEVER 409 >UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BJU7_9GAMM Length = 555 Score = 172 bits (435), Expect = 3e-41, Method: Composition-based stats. Identities = 46/263 (17%), Positives = 81/263 (30%), Gaps = 26/263 (9%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R + +EE + + PA ++ + + L Sbjct: 124 RRFLNQGMRPPADSIRVEEFINYFDYALPAPDTTNTPIQISTERTQTPWNPQTELVRVSL 183 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + + P + L+DVSGSM+ K + +R + LL L + VY Sbjct: 184 QSYRSDFKTLPPLN--LVFLLDVSGSMNSPDKLPLMQRSFNLLVSQLRPQDRVAIAVYAG 241 Query: 294 HHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 E + GGT S+ + L ++ + Y P N + Sbjct: 242 QSGVVLEPTSGDQKAQINQAINQLRAGGGTHGSAGIHLAYDLAQANYLPDGINR-IFIGT 300 Query: 346 DGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 DGD N S + L ++ + + T + L E + + + Sbjct: 301 DGDFNVGTTSLTELKALIERKREAGVFLSVLGFG--TGNYNDALMEELSNHGNGTAYYL- 357 Query: 402 QHIRDQDDIYPVFRELFHKQNAT 424 D Y R+LF Q A Sbjct: 358 -------DSYQEARKLFATQLAA 373 >UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EEA7_9EURY Length = 373 Score = 156 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 90/204 (44%), Gaps = 23/204 (11%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN-VEV 289 D+RY E++ P+ A + D+SGSM+ + + L+ +L Y++ V++ Sbjct: 172 DEDIRYNLIEEKLIPNLSATFVIMRDISGSMEMYG-EFSATIAGLIEFWLKEKYEHTVKI 230 Query: 290 VYIRHHTQAKEVD---EHEFFYSQETGGTIVSSALKLMDEVVK-----------ERYNPA 335 Y+ H +A E D +FF +GGT + A KL+ ++ ER + Sbjct: 231 RYVAHTDEAFEYDPRKREDFFKLSSSGGTAFNPAYKLVIDMTDGASYKSNSPYKERIDYQ 290 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +++ +DGDN+ + E L KKL P + Y+++ + Y ++S Sbjct: 291 SEDVFLLHITDGDNYNGEDEAVRETL-KKLFPRLTKVFYLQVGGYSDSF----YNLIKSV 345 Query: 396 FDNFAMQHIRDQDDI-YPVFRELF 418 + ++ +DI Y +++ Sbjct: 346 DPE-KLSEVKSGNDISYNNVKKVL 368 >UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 Tax=Shewanella benthica KT99 RepID=A9DKM0_9GAMM Length = 167 Score = 139 bits (351), Expect = 2e-31, Method: Composition-based stats. Identities = 101/153 (66%), Positives = 124/153 (81%), Gaps = 1/153 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN + +S VNRQRF+ RYK QIK+++S+A+ +RSVTDVD GE +SIPT+DIS Sbjct: 16 MANFIDRRLNARGRSTVNRQRFINRYKQQIKKAVSDAVTRRSVTDVDKGERISIPTKDIS 75 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP FHQG+GG+R RVHPGND F++ D+IER GGG GSGQG AS GEG D+FVFQIS Sbjct: 76 EPSFHQGQGGIRERVHPGNDQFIKGDKIER-PPGGGSQGSGQGDASNSGEGDDDFVFQIS 134 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRA 153 KDEYL+LLFEDL LPNL+ N+ +L EY+ +RA Sbjct: 135 KDEYLELLFEDLELPNLQNNRLNKLVEYQVYRA 167 >UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DA43_9BACT Length = 883 Score = 117 bits (294), Expect = 5e-25, Method: Composition-based stats. Identities = 42/248 (16%), Positives = 77/248 (31%), Gaps = 29/248 (11%) Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 +EE L P + + L+ + K P S Sbjct: 365 RIEELLNYFPYDYPQPQG-AAPFSATMEVATCPWAPEHRLVRVGLKGREIPKDERPPSN- 422 Query: 250 VMFCLMDVSGSMDQSTKD--MAKRFYILLYLFLSRTYKNVEVV-------YIRHHTQAKE 300 + L+DVSGSM+ K + K F +L+ + V +V + TQ KE Sbjct: 423 -LVFLIDVSGSMNMPNKLPLLQKCFSLLVEQLGPK--DRVSIVTYASGTKLVLEPTQDKE 479 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + GGT SS + L + ++ + P N A+DGD N + Sbjct: 480 AMQTAIDGLHAGGGTHGSSGIDLAYRMAQQSFIPGGTNRVIL-ATDGDWNIGITNQSELL 538 Query: 360 ILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + + + ++ + + + D R+ Sbjct: 539 SMITRKAKSGVFLTVLGFG--LDNLKDSMLVKLADHGNGHYAYI--------DTEQEARK 588 Query: 417 LFHKQNAT 424 +F Q ++ Sbjct: 589 VFVDQLSS 596 >UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VY88_NAEGR Length = 1082 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 84/238 (35%), Gaps = 39/238 (16%) Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 SN + Q+ E+ R E+ L + + + + P P+ + L DVS Sbjct: 82 SNQKVEQVEEKSLFRIEVEHLIDLDDHDIGVAELTIHVNDPSLNPKPT---LFIALADVS 138 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEV--VYIRHHTQAKEVD------------EH 304 GSM + L F +++ N + + + + AKE+D E Sbjct: 139 GSMQGRPWEQVCTS---LKHFAQQSFNNPAIICRMVAYESSAKEIDMKGTLQSIIRNIET 195 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQW-----NIYAAQASDGDNWADDS-PLCH 358 F GGT +SA +L ++ + N+ +DG++++ P Sbjct: 196 AF----TGGGTDFASAFQLACTIITRESGQDRENLPFGNVVITFLTDGEDFSKVGKPGGL 251 Query: 359 EILAKKLLPVVR------YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + L++++ V R + + L + + + + D +D+ Sbjct: 252 QYLSEEINRVYRGDITIHTVGFG---SHHNLELLDNIRKVGTIEGAYRYANYDDNNDV 306 >UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5H3_9GAMM Length = 608 Score = 97.9 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 78/265 (29%), Gaps = 26/265 (9%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R M + E + A E A + + Sbjct: 170 RRMIKMGKRPPADAVREEAFINYFDYHYSAPKSLETPFNVHTEVAPAPWNNQRQLLKIGI 229 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVY-- 291 + + EK ++ + L+DVSGSM+ K + K +L L VVY Sbjct: 230 KGFDIEKAELKAAN--LVFLLDVSGSMNAPDKLPLLKSSLTMLTKQLDENDSVAIVVYAG 287 Query: 292 ----IRHHTQAKE--VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + T+ E V + G T + ++L ++ + + N A+ Sbjct: 288 AAGLVLPATKGNEYQVISNALNNLSAGGSTNGAQGIELAYQIASQNFKKEGINRVIL-AT 346 Query: 346 DGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 DGD N S + L + + + + L + ++ + + Sbjct: 347 DGDFNVGMSSVDALKKLIANKRKTGIALTTLGFGQ--GNYNDGLMEQLANIGNGQHAYI- 403 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 D R++ + ++ Sbjct: 404 -------DTINEARKVLVDELSSTM 421 >UniRef50_C6M483 von Willebrand factor type A domain protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M483_NEISI Length = 538 Score = 95.6 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 91/275 (33%), Gaps = 23/275 (8%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V ++ R ++ +EE + + P + + + Sbjct: 90 IDVDTGSYANVRRFLTNGEQPPKDAVRIEEIVNYFPYNYPLP-TDNRPFAVHTETIDSPW 148 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + + ++ ++ K+ P + + L+DVSGSMD+ K + ++ +L L Sbjct: 149 QPEAKLIKIGIQAQDTAKKDLPPAN--LVFLVDVSGSMDEENKLPLVQKTLRILTQQLRP 206 Query: 283 TYKNVEVVYIR--------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 K + Y KE + G T SAL++ E ++ + P Sbjct: 207 QDKVTLITYASGEDLVLPPTSGADKETILSAIDKLRAGGATDGESALQMAYEQAQKAFVP 266 Query: 335 AQWNIYAAQASDGD-NWA-DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 N A+DGD N D+ ++A+K V + ++ + + Sbjct: 267 NGINRILL-ATDGDFNVGVSDTETLKSMVAEKRKSGVSLSTLGFGMGNYNEDMMEQIADA 325 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 ++ D +++ +Q + Sbjct: 326 GDGNYSYI--------DNEKEAKKVLQQQLTSTLA 352 >UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DBX8_9RHIZ Length = 668 Score = 95.2 bits (235), Expect = 5e-18, Method: Composition-based stats. Identities = 40/302 (13%), Positives = 87/302 (28%), Gaps = 35/302 (11%) Query: 146 TEYKTHRAGYTANGVPA---------NISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 + G+ +NGV + + V + + R +EE + Sbjct: 195 EAERDRVEGFDSNGVRSVAEYPVSTFSADVDTASYAMVRRALKQGVMPDPRTVRIEEMVN 254 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + PA E R + + ++ + + P + V+ L+D Sbjct: 255 YFNYDYPAPESVETPFRATVTVTPTPWNANTRLLHIGVKGYDVKPAARPQANLVL--LVD 312 Query: 257 VSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFF 307 VSGSM ++ K + K + LL L V Y E Sbjct: 313 VSGSMQETDKLPLLKSAFRLLIQKLEPEDTVSIVTYAGDAGTVLEPTPASDKAKILDALD 372 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL 366 + G T ++ ++ + ++ N A+DGD N + L ++ Sbjct: 373 DLRPGGSTAGAAGIEEAYRLAEKARVNGGVNRVLL-ATDGDFNVGASDDDALKSLIEEKR 431 Query: 367 PV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + + L + + + D + ++ Sbjct: 432 ESGVFLSIFGFGQ--GNYNDQLMQTLAQNGNGVAAYI--------DTLAEAEKTLAQEAT 481 Query: 424 TA 425 + Sbjct: 482 AS 483 >UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BTM6_TERTT Length = 689 Score = 91.0 bits (224), Expect = 8e-17, Method: Composition-based stats. Identities = 47/286 (16%), Positives = 87/286 (30%), Gaps = 23/286 (8%) Query: 131 DLALPN-LKQNQQRQLTEYKTHRAGYTANGVPAN--ISVVRSLQNSLARRTAMTAGKRRE 187 DL P+ L+ + T+ T + I V + + + R+ ++ Sbjct: 210 DLEPPHQLETADRDHFDTVATNPIKVTREEPVSTFSIDVDTASYSFVRRQLNRGQLPQKA 269 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 LEE + P + I + A + + + K P + Sbjct: 270 AVRLEEMVNYFPYDYPLPSAATAPFKPTITVIPAPWNQAKRLVHIGI--KALPLAHPPKA 327 Query: 248 QAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD---E 303 + L+DVSGSM K + K+ LL L T VVY E E Sbjct: 328 N--LVFLLDVSGSMGSPDKLPLVKQSMELLLSGLQPTDTVSIVVYAGAAGTVLEPTPVAE 385 Query: 304 H-----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLC 357 G T + ++L ++ + Y N A+DGD N P Sbjct: 386 QQKILAALDRLNAGGSTAGAQGIELAYQLAEANYQRDAVNRIIL-ATDGDFNVGIADPEQ 444 Query: 358 HEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + ++ + + + + L ++ + + Sbjct: 445 LKGYVERKRANGIELSILGFG--SGNYNDALMQQLAQNGNGVAAYI 488 >UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteobacteria RepID=C6BAR1_RHILS Length = 706 Score = 89.8 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 39/278 (14%), Positives = 84/278 (30%), Gaps = 19/278 (6%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH-ALEENL 195 L N++R + V + V S + RR+ L +EE + Sbjct: 227 LDPNRERFANAAANPIKSVATDPVSTFSADVDSASYAFVRRSLTGGAMPDPLSVRVEEMI 286 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 P ++ + + + R + ++ + P + + L+ Sbjct: 287 NYFPYDWPGPNNADQPFKATVTVMPTPWNRDTELMHVAIKGYDIAPATTPRAN--LVFLI 344 Query: 256 DVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEF 306 DVSGSMD+ K + K + L+ L V Y + Sbjct: 345 DVSGSMDEPDKLPLLKSAFRLMVNRLKADDTVSIVTYAGNAGTVLAPTRVAEKSKILSAI 404 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKL 365 + G T + ++ ++ K+ + N A+DGD N S + + ++ Sbjct: 405 DRLEPGGSTGGAEGIEAAYDLAKQGFVKDGVNRVML-ATDGDFNVGPSSDGDLKRIIEEK 463 Query: 366 LPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + + + +L + + + Sbjct: 464 RKDGIFLTVLGFG--RGNLNDSLMQTLAQNGNGSAAYI 499 >UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TM27_9PROT Length = 318 Score = 87.9 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 32/73 (43%), Positives = 45/73 (61%) Query: 4 FIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPM 63 IDRR N + KS+ NRQRFLRR K Q+ +++ +A +R + DV GE + IPT+ ++EP Sbjct: 230 IIDRRRNSQGKSLANRQRFLRRAKRQVTEAVRQASAERRIRDVADGEQIVIPTDGLNEPR 289 Query: 64 FHQGRGGLRHRVH 76 F LR H Sbjct: 290 FRHDARRLRLDRH 302 >UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteobacteria RepID=B1KPQ5_SHEWM Length = 640 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 48/309 (15%), Positives = 94/309 (30%), Gaps = 28/309 (9%) Query: 130 EDLALPNLKQN-QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRREL 188 +D+ LP L+ + + AG + I V ++L R R Sbjct: 135 QDIYLPELQNRDKFERQVANGIMVAGEIPVSTFS-IDVDTGSYSTLRRSINHGVLPERGT 193 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 +EE + + PA E+ + + L+ EK +SQ Sbjct: 194 VRVEELINYFAYQYPAPDAGEQPFSVNTELAPSPYNPHKMLLRIGLKGFEKEKADLGASQ 253 Query: 249 AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE------- 300 + L+DVSGSM K + K +L L + VVY + Sbjct: 254 --LVFLLDVSGSMSSQDKLPLLKNALKMLSQQLDEGDRISIVVYAGASGVVLDGVKGNDT 311 Query: 301 -VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 + G T + ++L ++ ++ + N A+DGD N Sbjct: 312 LAISQALDKLKAGGSTNGGAGIELAYQLAQKHFIAGGVNRVIL-ATDGDFNVGVSDQQAL 370 Query: 359 EILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 E + ++ + + + + L + + + D R Sbjct: 371 EDMIEEKRKQGIALTTLGFGQ--GNYNDHLMEQLADKGNGHYAYI--------DTLNEAR 420 Query: 416 ELFHKQNAT 424 ++ + + Sbjct: 421 KVLVDEISA 429 >UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacteriaceae RepID=C0YQB8_9FLAO Length = 800 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 48/276 (17%), Positives = 82/276 (29%), Gaps = 27/276 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + +++ R + +EE + P E A Sbjct: 356 IDVDNASYSNVRRMINNGQVVDKNAVRIEEMVNYFKYDYPQPKNEN-PFSINTEYSDAPW 414 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ KN P+S + L+DVSGSM K + K + +L L Sbjct: 415 NPKHKLLKIGLQGKNLPMDKLPASN--LVFLIDVSGSMSDENKLPLLKSSFKVLLNQLRP 472 Query: 283 TYKNVEVVY------IRHHTQAKEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 K VVY + T A E D+ Q G T + ++L ++ +E + Sbjct: 473 KDKVGIVVYAGSAGMVLPPTSAGEKDKIIEALDRLQAGGSTAGGAGIELAYKLAQENFVK 532 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 N A+DGD N S + L + + + Sbjct: 533 EGNNRVI-IATDGDFNVGTSSISDLKTLIEDRRKSGVFLTCLGFG--MGNYKDNTLETLA 589 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D + K+ A + Sbjct: 590 DKGNGNYAYI--------DNMQEANKFLGKEFAGSM 617 >UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KS19_CLOPH Length = 551 Score = 86.4 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 42/273 (15%), Positives = 82/273 (30%), Gaps = 35/273 (12%) Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 ++ R L+ RR A + +EE L + + Sbjct: 122 NIRRMLKE--GRRVDTGAVR------IEEMLNYFNYDYKLPEGDS-PFGITTELSDCPWN 172 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRT 283 + ++ + + S + L+DVSGSM D+ + +R ++LL L+ Sbjct: 173 PDTKLFLAGIQTEKIDFSKSAPSN--LVFLIDVSGSMMDEDKLPLVQRAFLLLTENLTEK 230 Query: 284 YKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + V Y + T KE ++ + G T S ++ ++ E Y Sbjct: 231 DRISIVTYAGNDTVVLSGAKGNQKEKIQNAITELEAGGSTFGSKGIETAYQLAMENYIEG 290 Query: 336 QWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEH 391 N A+DGD N S L ++ + + T Sbjct: 291 GNNRVIL-ATDGDLNVGVTSESELTNLIEEKRKSGVALSVLGFG--TGNIKDNKMEALAD 347 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + D R++ ++ Sbjct: 348 HGNGNYAYI--------DSLMEARKVLVEEMGA 372 >UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales RepID=C8SEV7_9RHIZ Length = 718 Score = 85.6 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 75/240 (31%), Gaps = 18/240 (7%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 + R + + +EE + ++ + + Sbjct: 279 VRRSLKEGFVPQADTVRVEEMINYFPYDWKGPDSASTPFNSTVSVMPTPWNTHTKLMHVA 338 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVY- 291 ++ + + P + + L+DVSGSMD+ K + K + LL L V Y Sbjct: 339 IKGFDVKPTEQPKAN--LVFLIDVSGSMDEPDKLPLLKSAFRLLVSKLKADDTISIVTYA 396 Query: 292 -----IRHHTQAKEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + T+ E D+ + Q G T + +K ++ ++ + N A Sbjct: 397 GDAGTVLMPTKIAEKDKILNAIDNLQPGGSTAGEAGIKEAYKLAQQSFIKDGVNRVML-A 455 Query: 345 SDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DGD N + L ++ + + + + + + + + Sbjct: 456 TDGDFNVGQTDDDDLKRLIEQERKTGVFLSVFGFG--RGNLNDEMMQTIAQNGNGTAAYI 513 >UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter sp. K31 RepID=B0T5X0_CAUSK Length = 592 Score = 85.2 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 38/275 (13%), Positives = 80/275 (29%), Gaps = 26/275 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + ++ R A + +EE + +E + + + + Sbjct: 147 IDVDTAAYANVRRFLNEGAAPPHDALRVEELINYFDYGYARPTAQEPPFKPTVTVVPSPW 206 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + + ++ + P + L+D SGSM + +AK+ +L L Sbjct: 207 SQDRQLMHIGVQGYATPRAGQPPLN--LVFLIDTSGSMSGPDRLPLAKKALNVLIDQLRP 264 Query: 283 TYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + V Y ++K + G T L+L + ++ +P Sbjct: 265 QDRVSMVAYAGSAGAVLSPTDGKSKLKMRCALTALRSGGSTAGGQGLELAYALARQNLDP 324 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYE 390 N +DGD N P + + Y + + T+ + Sbjct: 325 KAVNRVIL-MTDGDFNVGIADPTRLKDFVADQRKSGVYLSVYGFG--RGNYNDTMMQALA 381 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + + D R+L +A Sbjct: 382 QNGNGTAAYV--------DGLQEARKLLRDDFDSA 408 >UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobacteria RepID=Q21MJ3_SACD2 Length = 708 Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats. Identities = 34/265 (12%), Positives = 82/265 (30%), Gaps = 26/265 (9%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 + R+ ++ EE + + P + I + + + + Sbjct: 271 VRRQLNSGYLPEKDAIRAEELINYFDYNYPLPSDSTAPFKPNITVIDSPWAKGKKLVHIG 330 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI 292 L+ + P + + L+DVSGSM+ K + K+ +L L+ VVY Sbjct: 331 LKGYDIAPDQKPRTN--LVFLLDVSGSMNSQDKLPLVKQSMEMLLSTLNPDDTVAIVVYA 388 Query: 293 RHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 E Q G T + + L ++ + ++ N A Sbjct: 389 GAAGTVLEPTPAKDKQKILSAMQRLQAGGSTAGGAGIALAYDLAEANFDKKAVNRVIL-A 447 Query: 345 SDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DGD N + + ++ + + + + L + + + Sbjct: 448 TDGDFNVGSTNNETLQGFVERKREKGIFLSVLGFGQ--GNYNDHLMQTLAQNGNGVAAYI 505 Query: 401 MQHIRDQDDIYPVFRELFHKQNATA 425 D +++ ++ +++ Sbjct: 506 --------DTVSEAQKVLVQEASSS 522 >UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 Tax=Erythrobacter RepID=Q2N8R4_ERYLH Length = 580 Score = 83.7 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 38/261 (14%), Positives = 75/261 (28%), Gaps = 24/261 (9%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R + + EE + + R + L Sbjct: 143 RRFLSQGQMPPKAAVRTEEFINYFRYDYDRPQDRSQPFTVNFDAARTPWNEDTRLIRIGL 202 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 + E+ P + + LMDVSGSM + K + K L L K VVY Sbjct: 203 AGYDIERSERPPAN--LVFLMDVSGSMGRPDKLPLVKTALAGLAGELQPQDKVSIVVYAG 260 Query: 294 HHTQAKEVDEH------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 E Q G T + ++L ++ ++ + N A+DG Sbjct: 261 AAGLVLEPTNDTRKIRAALNQLQAGGSTAGGAGIQLAYQIAEDNFIEGGVNRVIL-ATDG 319 Query: 348 D-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 D N S + +K + + T ++ + + + + + Sbjct: 320 DFNVGVSSRDALIEMIEKKRDSGITLTTLGFG--TGNYNEAMMEQIANHGNGNYAYI--- 374 Query: 404 IRDQDDIYPVFRELFHKQNAT 424 D +++ + ++ Sbjct: 375 -----DSALEAKKVLGDEMSS 390 >UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria RepID=C6VVX3_DYAFD Length = 625 Score = 83.7 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 85/274 (31%), Gaps = 27/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 + V R+ +++ R + +EE + P E + + Sbjct: 167 VDVDRAAYSNVRRFLNNGQMPPEDAVRIEEMINYFDYDYPQPRGEH-PVAIVAETTDSPW 225 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ K +S + L+DVSGSM+++ K + K+ + LL L Sbjct: 226 NPGLKLVHIGLQAKTVSAENLSASN--LVFLIDVSGSMNEANKLPLLKQAFKLLADQLRV 283 Query: 283 TYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 K V Y K+ + + G T ++L ++ K+ + P Sbjct: 284 EDKISIVAYAGSAGMVLAPTSGSEKKTIKDALDKLEAGGSTAGGEGIELAYDLAKKHFLP 343 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 N A+DGD N + + L ++ + + + Sbjct: 344 KGNNRVIL-ATDGDFNVGISNESELQKLIEEKRKAGIFLSVMGFG--MGNYKDSHVETLA 400 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + D R++F ++ Sbjct: 401 DKGNGNYAYI--------DNIQEARKVFVQEFGG 426 >UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20 Tax=Proteobacteria RepID=Q4KKB4_PSEF5 Length = 582 Score = 83.3 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 72/265 (27%), Gaps = 26/265 (9%) Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR 235 RR E E L + A + + + ++ Sbjct: 129 RRLLNQGSLPPEGAVRLEELVNYFPYDYALPTDGSPFGVTTELAPSPWNPHTRLLRIGIK 188 Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQST-KDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + + L+DVSGSMD+ + K LL L + VVY Sbjct: 189 ASDRAVAELAPAN--LVFLVDVSGSMDRREGLPLVKSTLKLLVDQLRDQDRVSLVVYAGE 246 Query: 295 HTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 E G T +S ++L ++ ++ + N A+D Sbjct: 247 SRVVLEPTSGRDKAKIRTAIDQLTAGGSTAGASGIQLAYQMAQQGFIDQGINRILL-ATD 305 Query: 347 GD-NWAD---DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 GD N DS +K + + ++ L + + Sbjct: 306 GDFNVGVSDFDSLKAMAAEKRKSGVSLTTLGFG--VDNYNEHLMEQLADAGDGNYAYI-- 361 Query: 403 HIRDQDDIYPVFRELFHKQNATAKG 427 D R++ Q ++ Sbjct: 362 ------DNLREARKVLVDQLSSTLA 380 >UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C375AE Length = 550 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 46/291 (15%), Positives = 87/291 (29%), Gaps = 30/291 (10%) Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNS---------LARRTAMTAGKRRELHAL 191 ++ +L GYT G S S ++ + R + + Sbjct: 75 EEPELPSANEEYKGYTEAGFKDTKSEPLSTFSADVDTASYTNVRRLIENRNIVPEDAVRI 134 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 EE + P R + R + ++ K +++ P S + Sbjct: 135 EEFINYFDYDYPQPEDGSAFGRY-VEIADCPWNRDHKLMMVGIQGKELQQQETPPSN--L 191 Query: 252 FCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHT------QAKEVDE- 303 L+D SGSM+ K + + + +L L + + V Y + DE Sbjct: 192 VFLIDSSGSMNSYDKLPLVQSAFSMLAEQLDKNDRISIVTYAGSSAVLLDGEKGSNTDEI 251 Query: 304 -HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEIL 361 + + +G T +K E+ +E + N A+DGD N S L Sbjct: 252 LEQLYSITASGSTNGEGGIKTAYELAEEHFIKGGNNRVIL-ATDGDLNVGASSEEELTRL 310 Query: 362 AKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + + + E + NF+ D+ + Sbjct: 311 IETKRDNGIYLSVLGFGE--GNYKDARMEALADNGNG--NFSYIDSEDEAE 357 >UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidetes RepID=C7PNZ7_CHIPD Length = 639 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 42/274 (15%), Positives = 76/274 (27%), Gaps = 27/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V R+ +++ R + +EE + + + Sbjct: 191 IDVDRASYSNVRRFLNEGNMPPVDAVRVEEMINYFDYKY-SNPTGNTPVAVRTDMAICPW 249 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ K+ K P S + L+DVSGSM + K + K+ + LL L Sbjct: 250 NTAHQLVRIALKGKDVAKDNLPPSN--LVFLIDVSGSMSDAKKLPLVKQAFKLLVNQLRP 307 Query: 283 TYKNVEVVYI--------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + VVY K + G T ++L + E Sbjct: 308 VDRVAIVVYAGAAGLVLPSTSGDHKTAILDALDKLEAGGSTAGGEGVQLAYKTATEYLLK 367 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 + N A+DGD N S + + +K + + Sbjct: 368 SGNNRVI-IATDGDFNVGPSSDGELQRIIEKKREKGIFLSVLGFG--MGNYKDNKLELLA 424 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + D + R F + Sbjct: 425 DKGNGNYAYI--------DNFEEARRTFATEFGG 450 >UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CVB5_9CLOT Length = 556 Score = 82.5 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 42/266 (15%), Positives = 75/266 (28%), Gaps = 27/266 (10%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 +L R+ + +EE L + P E+E + Sbjct: 105 ANLRRKILEGNEVPADAVRIEEMLNYFTYDYPEP-TEDEPFSVTTYIGDCPWNENHKLLQ 163 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVV 290 L+ + + S + L+DVSGSM+ + K + KR ++LL L V Sbjct: 164 IGLQAEKPDLENQKPSN--LVFLIDVSGSMESADKLGLVKRAFLLLTENLRPEDTVSIVT 221 Query: 291 YIRHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 Y T + K G T S ++ + +E + N Sbjct: 222 YASSDTVVLDGVSGEEKAAIMTAIENLTAGGSTDGSKGIETAYRLAEEHFQKDGNNRVIL 281 Query: 343 QASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 A+DGD N S L +K + + T + Sbjct: 282 -ATDGDLNLGLTSEGDLTRLIQKKKESGVFLSVMGFG--TGNIKDNKMEALADNGNGQYA 338 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNAT 424 + D + + ++ Sbjct: 339 YV--------DSLMEAKRVLVEELGG 356 >UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaenterica_26029 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI0001913F8A Length = 88 Score = 81.4 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 78/88 (88%), Positives = 82/88 (93%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 LT KTHRAG+T+NGVPANISVVRSLQNSLARRTAMTAGKRRELHALE L IS+SEPA Sbjct: 1 LTSNKTHRAGFTSNGVPANISVVRSLQNSLARRTAMTAGKRRELHALETELETISHSEPA 60 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTF 232 QLLEEERLR+EIAELRAKIERVPFIDTF Sbjct: 61 QLLEEERLRREIAELRAKIERVPFIDTF 88 >UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z5_DESAA Length = 558 Score = 78.7 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 42/319 (13%), Positives = 86/319 (26%), Gaps = 48/319 (15%) Query: 122 DEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 +EY + P + + N+ V +++ Sbjct: 90 EEYAPIREGGFKSPLYDPLSTFSIDVDTASYSNVRRFLSYGNMPPVDAVR---------- 139 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 +EE + P ++ + + R + L+ + + Sbjct: 140 ---------IEEMINYFHYDYPQPKG-QDPFSITMEMSQCPWNRDNMLVHVGLQGRCLDY 189 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVV-------YIR 293 + S + L+DVSGSM+ K + KR +L L V +V + Sbjct: 190 KDVKPSN--LVFLLDVSGSMNSENKLPLVKRSMEMLVKELGAG-DRVSIVTYAGSAGLVL 246 Query: 294 HHTQAKEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NW 350 T A+ + + G T ++L V E P N +DGD N Sbjct: 247 PSTSARNKRKIITALDRLEAGGSTAGGEGIELAYRVAWENLIPEGNNRVIL-CTDGDFNV 305 Query: 351 A-DDSPLCHEILAKKLLP--VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 +P ++ +K + + + + + Sbjct: 306 GVSSTPELVRMIEEKRRAGIYLTICGFG--MGNYKDEKMEAISNAGNGNFYYI------- 356 Query: 408 DDIYPVFRELFHKQNATAK 426 D ++F + Sbjct: 357 -DSRREAHKVFVQDMRANM 374 >UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacteraceae RepID=A3PN61_RHOS1 Length = 651 Score = 77.5 bits (189), Expect = 1e-12, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 61/250 (24%), Gaps = 18/250 (7%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + L RE +EE + PA R ++ R Sbjct: 212 IDVDTASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSITRTPW 271 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ + P + L+D SGSM K + K+ + L+ L Sbjct: 272 NPETRLVHVALQGRMPAIEDRPPLN--LVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRP 329 Query: 283 TYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + V Y + G T L L E Sbjct: 330 EDQVAIVTYAGSAGEVLAPTAANQRSTILSALDRLDAGGSTAGDEGLALAYRTASEMAGA 389 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILA---KKLLPVVRYYSYIEITRRAHQTLWREYE 390 + A+DGD N P L + + + + Sbjct: 390 GEVTRVVL-ATDGDFNLGISDPEELARLVAHERDTGVYLSVLGFG--RGNLDDATMQALA 446 Query: 391 HLQSTFDNFA 400 + + Sbjct: 447 QNGNGQAAYI 456 >UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C54C8 Length = 638 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 34/266 (12%), Positives = 72/266 (27%), Gaps = 26/266 (9%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 ++ R L E + S + + + + Sbjct: 197 ANVRRMLNEGTLPPASAVFLAEFVNYFPYSYAPPPAGADPVAFHVEMGPCPWNAKHHLLR 256 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVV 290 ++ P + L+D SGSM Q + + ++ LL L+ + V Sbjct: 257 VGVQAHQIPAEKLPPRN--LVFLVDTSGSMQQENRLPLVQKSLELLVEKLTEKDRVSVVT 314 Query: 291 YIRHHTQAKEVDEHE--------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 Y A Q GGT +K + ++ + N Sbjct: 315 YAGDSRVALPPTSGADKKAILDVVTGLQANGGTNGEGGIKKAYQFARDTFLDGGVNRVIL 374 Query: 343 QASDGD-NWA-DDSPLCHEILAKKLLPV--VRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DGD N D+ +++ ++ + Y +E + + Sbjct: 375 -CTDGDFNVGVVDNGELVKLIEEQRKSKVFLTVLGYG--MGNYKDDRLKELANHGNGHHA 431 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNAT 424 + D +++F +Q Sbjct: 432 YI--------DTLDEAKKVFVEQGGA 449 >UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellaceae RepID=A5WCP1_PSYWF Length = 571 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 40/306 (13%), Positives = 92/306 (30%), Gaps = 31/306 (10%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN--SLARRTAMTAGKRRELHALEEN 194 + QQ E + + T+ A +S+ + ++ R ++ +EE Sbjct: 100 MAPKQQENYAEIEPNAVNATSEQAFATLSIDTDTGSYANVRRFLNQGQLPPKDAVRVEEL 159 Query: 195 LAIISNS-EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY--EKRPDPSSQAVM 251 + + A+ + + I ++ ++ K+ P + + Sbjct: 160 INYFNYDFTAAKKQANAPFLVSTEVVNSPWHPTNQIVKVGIKAEDLLTAKQKQPPAN--L 217 Query: 252 FCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------- 303 L+DVSGSMD K +AK +L L + Y + Sbjct: 218 VFLVDVSGSMDTEDKLQLAKSSLKMLTKQLRAQDSITLITYAGNTKVVLPSTPGNQTQKI 277 Query: 304 -HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEIL 361 + +G T +A+KL + E + N +DGD N S + Sbjct: 278 LNAIDNLTASGSTNGEAAIKLAYQQATEHFKKDGINR-ILMLTDGDFNVGVSSVKDMLQI 336 Query: 362 AKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 + + + + + + + + ++ D +++ Sbjct: 337 IRSNRDKGISLSTLGFGQ--GNYNDHMMEQVADNGNGNYSYI--------DSLSEAKKVL 386 Query: 419 HKQNAT 424 + + Sbjct: 387 IDEMSA 392 >UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UP85_RHOBA Length = 885 Score = 76.0 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 55/410 (13%), Positives = 109/410 (26%), Gaps = 61/410 (14%) Query: 31 KQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIER 90 K A + S G+ P + R R + + + Sbjct: 316 KDEAPTAPREPSAGKPVVGDFAVAPVPE-QLGRQQFDFRASRGRTLE--RQLGETEELA- 371 Query: 91 PQGGGGGSGSGQGQASQDGEGQ--DEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEY 148 P G G D+F P +++N+ R++ + Sbjct: 372 PTSDRLAILPPTPDGEGQGPGMSGDKFE------------------P-IQENEFRRVADD 412 Query: 149 KTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE 208 + ++ + A+ + VRS R + +EE + E Sbjct: 413 A--LSTFSIDVDTASYAKVRSYLQR-------GQLPRPDSVRIEELINYFDYQYTPPSAE 463 Query: 209 EE-RLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK- 266 + +A + ++ K+ +++ P + L+D SGSM + K Sbjct: 464 DPVPFSSAMAVASCPWNENNRLVRVGIQAKDIDRKERPRCN--LVFLIDTSGSMKRPNKL 521 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGTIVS 318 + +L L + VVY + G T Sbjct: 522 PLVIEGMKVLLDQLKNRDRVAIVVYAGSSGLVLDSTPVKQKKKIIRALSALSAGGSTNGG 581 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGD-NW---ADDSPLCHEILAKKLLPVVRYYSY 374 + L+L + +E + N SDGD N D + K + + Sbjct: 582 AGLQLAYQTARENFIEDGVNRVIL-CSDGDFNVGMTGTDQLVAEATRQSKSGTELTVLGF 640 Query: 375 IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + + + F D +++ Q A Sbjct: 641 G--MGNHNDAMMERISNSGAGNYAFV--------DTIAEAKKVLADQVAG 680 >UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 Tax=Caulobacteraceae RepID=B4WCU1_9CAUL Length = 613 Score = 76.0 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 75/274 (27%), Gaps = 29/274 (10%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + +++ R + +EE + A + + Sbjct: 160 IDVDTAAYSNVRRFIDEGRSPPADAVRVEELINAFDYGYARPTSLARPFAITTAVVASPW 219 Query: 224 ERVPF-----IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLY 277 I L+ + + L+DVSGSM K D+AK+ L Sbjct: 220 APRTERGGRQIVHIGLQGYELPQGEQRPLN--LTFLVDVSGSMRSPDKLDLAKQAMNLAI 277 Query: 278 LFLSRTYKNVEVVYIRH---------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV 328 L + V Y K + +GGT ++ + + Sbjct: 278 DRLRPQ-DTLSVTYYAEGAGTTLQPTPGDQKLKMRCAVASLRASGGTAGATGMTNAYDQA 336 Query: 329 KERYNPAQWNIYAAQASDGD-NWA-DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 + + + N +DGD N D+ + +A+K V Y Sbjct: 337 QASFARDKVNR-ILMFTDGDFNVGVTDNKRLEDYVAEKRGTGVYLSVYGFGRGNYQDARM 395 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + + + D+ R LF Sbjct: 396 QTIAQAGNGVAAYV-------GDL-RDARRLFGP 421 >UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=C8N8N5_9GAMM Length = 563 Score = 74.8 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 38/283 (13%), Positives = 81/283 (28%), Gaps = 36/283 (12%) Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA 217 G +NI + + +N L + +EE L + P + + Sbjct: 124 TGSYSNIRRMLTRENRL---------PPADAVRVEEILNYFAYGYPLPQ-DGKPFAVHTQ 173 Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILL 276 + + + + ++ + P + + L+D SGSMD K + K+ Sbjct: 174 TVDSPWQADAKLIRIAIQAADLAPEKRPPAN--LVFLIDTSGSMDDPDKLPLVKKTVCHF 231 Query: 277 YLFLSRTYKNVEVVYIRHHTQ--------AKEVDEHEFFYSQETGGTIVSSALKLMDEVV 328 L + + Y + KE + G T AL++ + Sbjct: 232 AEALRADDRISLITYSGSTAEILPPTAGDQKETIIAALKPLRAHGATAGGEALRMAYDAA 291 Query: 329 KERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQT 384 + Y N A+DGD N P + + Y + + Sbjct: 292 AKNYRKDGINRILL-ATDGDFNVGISDPATLKNYVADKRKSGISLTTLGYG--SGNYNDE 348 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + + ++ D +++ +Q + Sbjct: 349 MMEQLADAGDGNYSYI--------DSEAEAKKVLVRQLTSTLA 383 >UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5D5_BREBN Length = 513 Score = 74.8 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 39/252 (15%), Positives = 76/252 (30%), Gaps = 28/252 (11%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 E +EE + S PA + + + ++ I ++ K + Sbjct: 119 AEAVRVEEFINFFPTSYPAP--TNQTFAIQADSGPSPFQKNLQIVRIGIKGKELSPKERK 176 Query: 246 SSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HT 296 + + ++DVSGSM+Q + ++ K+ +L L T VVY T Sbjct: 177 PAN--LVFVIDVSGSMNQENRLELVKKSLHVLVDQLQPTDSVGIVVYGSEGRVLLPPTST 234 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSP 355 + K+ Q G T L L E+ + P N SDG N + Sbjct: 235 EDKQAILSAIDELQPEGSTNAEQGLVLGYEMAARSFKPPAINRVIL-CSDGVANVGETGA 293 Query: 356 LCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + + + + + + + + + D + Sbjct: 294 EGILRSIEDYARKDIYLSSFGFG--MGNYNDVMMEQLANKGEGSYAYI--------DTFS 343 Query: 413 VFRELFHKQNAT 424 R +F + Sbjct: 344 EARRIFTESLTG 355 >UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZHE2_9SPHI Length = 704 Score = 74.4 bits (181), Expect = 8e-12, Method: Composition-based stats. Identities = 34/259 (13%), Positives = 77/259 (29%), Gaps = 27/259 (10%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEE--------RLRKE 215 I V + +++ R + +EE + P ++ Sbjct: 252 IDVDNASYSNVRRFVNDGQPLPKNAVRVEEMINYFEYDYPQPTPTKDKEGKLQTHPFSVN 311 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYI 274 + L+ +N + + + + L+D SGSMD K + KR + Sbjct: 312 TEYGTCPWNPHHKLLQIGLQGENLQTKNASPAN--LVFLVDASGSMDSEDKLPLLKRSFK 369 Query: 275 LLYLFLSRTYKNVEV---------VYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 +L L+ + + + V +E + G T ++L Sbjct: 370 VLLKQLTDSRTKIAIVAYAGASGLVLPATSVSHREKILTALENIESGGSTAGGEGIELAY 429 Query: 326 EVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRA 381 ++ ++ + N A+DGD N S L + + T Sbjct: 430 KIAQQAFIAGGNNRVIL-ATDGDFNVGLSSDEELMQLISNKRKSGVYLTCLGFG--TGNL 486 Query: 382 HQTLWREYEHLQSTFDNFA 400 + ++ + + + + Sbjct: 487 NDSMMEKLTNAGNGNYYYI 505 >UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9IU52_BORPD Length = 582 Score = 74.0 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 34/262 (12%), Positives = 69/262 (26%), Gaps = 22/262 (8%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 + R + EE + ++ A + Sbjct: 147 VRRLLNEGRLPPPDAVRAEEFINYFDYGYATPDSRQQPFSIITEVSAAPWNPQRQLLKIG 206 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI 292 ++ + P++ + L+D SGSM + K + K L L + V Y Sbjct: 207 IQGYRVAPQDIPAAN--LVFLVDTSGSMAERDKLPLIKGALKQLVAQLRPQDRVAIVTYA 264 Query: 293 RH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 K + G T + L L + + N A Sbjct: 265 GQASMTLDSTPGDQKARINAAIDELRAAGSTNGGAGLDLAYAQAAKGFVKGGVNRILL-A 323 Query: 345 SDGD-NWADDSPLCHE-ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 SDGD N + +A++ + + + L + + ++ Sbjct: 324 SDGDFNVGATDLEDLKDKIARQRQGGIALTTLGVGGGNFNDALAMQLADAGNGSYHYL-- 381 Query: 403 HIRDQDDIYPVFRELFHKQNAT 424 D R++ Q ++ Sbjct: 382 ------DSLREARKVLAAQMSS 397 >UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacterales RepID=Q28U54_JANSC Length = 686 Score = 74.0 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 40/271 (14%), Positives = 74/271 (27%), Gaps = 33/271 (12%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEE-ERLRKEIAELRAKIERVPF 228 L+++L R A + +EE + PA ++ R + Sbjct: 253 LRSTLNR----GALPAPDAVRIEEMVNYFPYDYPAPTADDISPFRPNVQVFETPWNPDTQ 308 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTYKN 286 + ++ P + L+D SGSM+ K + + F ++L LS + Sbjct: 309 LVHIGIQGDLPVVEDRPPLN--LVFLIDTSGSMNDPAKLPLLIQSFRLMLNR-LSPEDEV 365 Query: 287 VEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V Y A E Q G T L+ + E + + Sbjct: 366 AIVTYAGSAGVALEPTAASDTATINAALTTLQAGGSTNGVGGLEEAYRLAGEMMVDGEVS 425 Query: 339 IYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQS 394 A+DGD N E + + + + + Sbjct: 426 RVLL-ATDGDFNVGLSDAGALEDYIAEQRDTGIYLSVLGFG--RGNLQDDTMQALAQNGN 482 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 ++ D + + Q A A Sbjct: 483 GTASYI--------DTLHEAQRVLVDQLAGA 505 >UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobacteriaceae RepID=YFBK_ECOLI Length = 575 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 45/284 (15%), Positives = 79/284 (27%), Gaps = 24/284 (8%) Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA---IISNSEPAQLLEEERLRK 214 G AN V R L L + + + I + + + Sbjct: 130 TGSYAN--VRRFLNQGL-----LPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAM 182 Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFY 273 A + D+ K+ + P+S + L+D SGSM + + Sbjct: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASN--LVFLIDTSGSMISDERLPLIQSSL 240 Query: 274 ILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMD 325 LL L V Y A K G T + L+L Sbjct: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 Query: 326 EVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQ 383 + + + N A+DGD N D P E + KK V ++ ++ Sbjct: 301 QQATKGFIKGGINRILL-ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNE 359 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + + + ++ Q + R++ K Sbjct: 360 AMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 >UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria RepID=A5F9T1_FLAJ1 Length = 709 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 46/276 (16%), Positives = 78/276 (28%), Gaps = 27/276 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + ++ R ++ +EE + + P E + Sbjct: 264 IDVDNASYTNIRRFLNSGQEVPKDAVRVEEMVNFFKYNYPQPKNEH-PFSINTEYSDSPW 322 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 I L+ KN PSS + L+DVSGSM+ K + K+ +L L Sbjct: 323 NSQNKILKIGLQGKNIATNDLPSSN--LVFLIDVSGSMEDMNKLPLLKQSMKILVNELRP 380 Query: 283 TYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 T K VVY + G T + ++L ++ E + Sbjct: 381 TDKVSIVVYAGAAGMVLPPTSGNEKKTIIKALDQLEAGGSTAGGAGIELAYKIATENFIK 440 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 N A+DGD N S E L ++ + Y + Sbjct: 441 GGNNRVIL-ATDGDFNVGSSSNSDMEKLIEEKRKTGVFLTCLGYG--MGNYKDSKMEILA 497 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D K+ + Sbjct: 498 DKGNGNYAYI--------DNIQEANRFLGKEFKGSM 525 >UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GHE4_9DELT Length = 785 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 48/160 (30%), Gaps = 11/160 (6%) Query: 251 MFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEV 301 + L+DVSGSM K + K + L L VVY KE Sbjct: 357 LVFLLDVSGSMSSRGKLPLIKHGFTQLVEQLGAEDHVSIVVYAGAAGVVLPPTSGDQKET 416 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEI 360 + GGT S+ + E+ + + N +DGD N Sbjct: 417 ILGALDRLEAGGGTNGSAGIVEAYELAQANFVDGGVNRVIL-GTDGDFNVGLSDHDALVE 475 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 L ++ + S + + L + + F Sbjct: 476 LIEQKRESGVFLSVLGVGGHYDDELMEQLADHGNGNYAFL 515 >UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4 Tax=Cyanobacteria RepID=B0CCM8_ACAM1 Length = 686 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 90/304 (29%), Gaps = 22/304 (7%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 + L LP + + +I V + +++ R ++ Sbjct: 196 DRLHLPGTFNTEDYKRINENPFFLPQRTPLSTFSIDVDTASYSNVRRFIRQGQLPPKDAV 255 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 LEE + + ++ A + L+ K EK S Sbjct: 256 RLEELINYFDYGYASPKGDQ-PFSVSTEVATAPWNNQHKLVHIGLKGKELEKEQ--PSN- 311 Query: 250 VMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKE 300 + L+DVSGSM + K + K+ LL L + VVY K Sbjct: 312 -LVFLIDVSGSMKRPNKLALVKKSLCLLVHQLKPEDRVSLVVYAGRAGIVLPSTPGTQKA 370 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + + G T ++ +K+ ++ + + N A+DGD N S E Sbjct: 371 TIMNAIDRLEAGGSTAGAAGIKMAYDMAERHFLKNGNNRVIL-ATDGDFNVGQSSDAELE 429 Query: 360 ILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR- 415 L ++ + Y T + + + + Q + R Sbjct: 430 RLIEQKRDRGVFLTVLGYG--TGNYKDNKMELLANKGNGNYAYIDTLLEAQKVLVNDLRG 487 Query: 416 ELFH 419 LF Sbjct: 488 TLFT 491 >UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AC65_GEMAT Length = 642 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 44/274 (16%), Positives = 79/274 (28%), Gaps = 27/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V R+ + R + +EE + + + A Sbjct: 192 IDVDRASYGNARRFLQDGQRPPADAVRIEELINYFPYELREPRG-NDPVAITTEVTTAPW 250 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + + L+ + E P + + L+DVSGSM K + K+ LL + Sbjct: 251 QPRHQLVRIALQSRRIETASLPPNN--LVFLIDVSGSMQSPDKLPLVKQSLRLLVDQMRP 308 Query: 283 TYKNVEVVYI--------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + V Y KE + G T + ++L +E + Sbjct: 309 QDRVAIVAYAGAAGLVLPSTSGDEKETIIQAIERLEAGGSTAGGAGIELAYRTAREHFMD 368 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLL---PVVRYYSYIEITRRAHQTLWREYE 390 N ASDGD N S E L ++ + + T + Sbjct: 369 HGNNRVIL-ASDGDFNVGVSSDGELERLIERKRTEGTYLTILGFG--TGNYQDAKMEKLA 425 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + DDI R++ ++ Sbjct: 426 KRGNGNYGYV-------DDIAEA-RKMLVREMGA 451 >UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PR69_CHIPD Length = 588 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 87/274 (31%), Gaps = 27/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 + V R+ +++ R + +EE + S P + + L Sbjct: 160 VDVDRAAYSNIRRFVKLKERIPANAVRIEEMVNYFHYSYPLPPVGQT-LAIYSNYATCPW 218 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + +R K+ P S + L+DVSGSM K + + + +L L Sbjct: 219 AEDHRLLQIAVRGKSVNLDSLPPSN--LVFLIDVSGSMAMPNKLPLLQAAFRILVNNLRS 276 Query: 283 TYKNVEVVYI--------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 V Y AK + Y G T +A+KL ++ +E + Sbjct: 277 NDHVAIVAYAGVPGVILPSTPGSAKSKILNAIDYLSAGGATAGEAAIKLAYQIAEENFIK 336 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILA---KKLLPVVRYYSYIEITRRAHQTLWREYE 390 N A+DGD N S E L K+ ++ + + + Sbjct: 337 EGNNRVIL-ATDGDFNVGQTSDHDMEQLILGKKETGVLLTCLGFG--MKNYKDSKLETLS 393 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + NFA D ++F ++ + Sbjct: 394 SKGNG--NFAYI------DNLEEASKIFAREFGS 419 >UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella RepID=A3D1E9_SHEB5 Length = 642 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 75/274 (27%), Gaps = 26/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V +L R + +EE L + P + Sbjct: 155 IDVDTGSYATLRRMLREGRLPEKGTVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPY 214 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ + K +S + L+DVSGSM + K + + LL LS Sbjct: 215 NDDMMLLRIGLKGYDLPKSQLGASN--LVFLLDVSGSMASADKLPLLQTALKLLTAQLSA 272 Query: 283 TYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 K VVY + + G + ++ K+ + P Sbjct: 273 QDKVSIVVYAGAAGVVLDGVSGNDTQTLTYALEQLSAGGSINGGQGITQAYQLAKKHFIP 332 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 N A+DGD N L +K + + + L + Sbjct: 333 NGINRVIL-ATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFG--LGNYNDQLMEQLA 389 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + D R++ + ++ Sbjct: 390 DKGNGNYAYI--------DTLNEARKVLVDELSS 415 >UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZU67_9SPHI Length = 552 Score = 70.6 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 44/284 (15%), Positives = 90/284 (31%), Gaps = 18/284 (6%) Query: 128 LFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRE 187 L ED++ P +K+ + + + + TA +I V + + + Sbjct: 75 LEEDVSPPKIKEKKPANENTFLSVK---TAPLSTFSIDVDNASYSRARKSINNGQLPSTS 131 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 LEE + + Q + + + L+ K + R S Sbjct: 132 SVRLEEFINYFNYQY-KQPEGQHPFSVNTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLS 190 Query: 248 QAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--- 303 + L+DVSGSM K + ++ + +L L + VVY + + Sbjct: 191 N--LVFLIDVSGSMSAPDKLPLLRKAFKMLVNNLGEEDRVAIVVYAGNAGLVLPATQGTD 248 Query: 304 -----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLC 357 Q G T + +KL ++ K+ + N A+DGD N S Sbjct: 249 KQKIMEALDKLQSGGSTAGGAGIKLAYKIAKQNFIKEGNNRIIL-ATDGDFNLGASSDQA 307 Query: 358 HEILAKKLLPVVRYYSYIEIT-RRAHQTLWREYEHLQSTFDNFA 400 + L ++ + + + + + + + Sbjct: 308 MQNLIEEKRKEGVFITVLGLGMGNYRDSKMEIIADKGNGNYYYL 351 >UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XSR4_9CAUL Length = 625 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 35/253 (13%), Positives = 66/253 (26%), Gaps = 19/253 (7%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + ++ R + R+ +EE + +E A + Sbjct: 169 IDVDTAAYANVRRFISEGQTPPRDAVRVEEMINYFDYGYARPGRADEPFAVSTAVAASPW 228 Query: 224 ERVPF-----IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLY 277 I L+ + ++DVSGSM K +A++ L+ Sbjct: 229 SANAGAGGRQIVHIGLQGYELPAGERRPLN--LTFMVDVSGSMQSPDKLGLAQQTMNLII 286 Query: 278 LFLSRTYKNVEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVK 329 L + Y A G T + + E + Sbjct: 287 DRLRPEDRVAVTYYASDVGTAVGPTPGSEKLKLRCAVAALNAGGSTAGAQGMVNAYEQAE 346 Query: 330 ERYNPAQWNIYAAQASDGD-NWA-DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 ++P + N +DGD N D + +A K + Y + Sbjct: 347 AAFSPDKVNR-ILMFTDGDFNVGVTDDRRLEDYVADKRGTGIYLSVYGFGRGNYQDARMQ 405 Query: 388 EYEHLQSTFDNFA 400 + + Sbjct: 406 TIAQAGNGVAAYV 418 >UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT14_9PLAN Length = 616 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 54/375 (14%), Positives = 105/375 (28%), Gaps = 54/375 (14%) Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD 122 R P + ++ G G G G D+F + Sbjct: 97 KVRSDARQDRLATLPTESRRLGIEQPNAAPGFMPQLDGIAGHGEGPGVGGDKFAYV---- 152 Query: 123 EYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTA 182 + N R + + + ++ + A+ S +RS Sbjct: 153 ---------------ENNPFRAVADEP--LSTFSIDVDTASYSKIRSYLIDYH-----QL 190 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 + + +EE + + + A +++ + + ++ K Sbjct: 191 PPQGAVR-VEELINYFTY-DYATPTDQKPFAANVEAAACPWNAEHRLVRIGIKGKEIANA 248 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI--------R 293 P+S + L+DVSGSM+ + K + K+ LL L K VVY Sbjct: 249 ERPASN--LVFLLDVSGSMNNARKLPLLKQGMKLLVDQLGENDKVAIVVYAGAAGMVLNS 306 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWAD 352 + K Q G T ++L + E + N +DGD N Sbjct: 307 TNGDDKSTIMEALDRLQAGGSTNGGQGIELAYQAATENFIKGGVNRVIL-CTDGDFNVGV 365 Query: 353 DSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 S +A + + T + + E + F D Sbjct: 366 TSTSDLVTMAADKAKSGVFLSVMGFG--TGNHNDAMMEELSGKANGNYAFI--------D 415 Query: 410 IYPVFRELFHKQNAT 424 +++ +Q + Sbjct: 416 TITEAKKVLVEQMSG 430 >UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NVX5_9RHOB Length = 608 Score = 69.4 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 67/258 (25%), Gaps = 26/258 (10%) Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 + +EE + + P ++ + + ++ Sbjct: 181 GRLPNPDAVRVEEMVNYFDYNYPVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVP 240 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 PS + L+D SGSM + K + ++ + LL L + V Y Sbjct: 241 LDDLPSQN--LVFLIDTSGSMADANKLPLLQQSFRLLLSSLRDEDEVAIVTYAGSSGVLL 298 Query: 300 EVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NW 350 E + + G T LK + + + A+DGD N Sbjct: 299 EPTKVADKTRILEKINALTSGGSTAGHEGLKGAYALAETMTGDGEQTRIIL-ATDGDFNV 357 Query: 351 ADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 P + + + + + L + + Sbjct: 358 GLSDPDSLKRYVAEQRENGTALSVLGFG--RGNYNDELMQTLAQNGQGVAAYI------- 408 Query: 408 DDIYPVFRELFHKQNATA 425 D R++ Q ++ Sbjct: 409 -DTLSEARKVLVDQVVSS 425 >UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria RepID=B1ZYN3_OPITP Length = 792 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 70/234 (29%), Gaps = 35/234 (14%) Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQ---------LLEEERLRKE 215 +V R L+ + +EE + A E Sbjct: 335 NVRRFLRE--------GRLPPADAVRIEELVNYFPYRYAAPGRVRDEGVAAPGEAPFAAA 386 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYI 274 + A + L+ K+ ++ + L+DVSGSMDQ K + + Sbjct: 387 LEVAAAPWAAQHRLVRIGLKAKDAAVSGRAAAN--LVFLLDVSGSMDQPNKLRLVQESMR 444 Query: 275 LLYLFLSRTYKNVEVVYIRHHTQA---------KEVDEHEFFYSQETGGTIVSSALKLMD 325 LL L + V Y + A +E+ + + G T + L+L Sbjct: 445 LLLGRLQPEDRVAIVTYAGNSGLALPSTPVARQREILD-AIDELRAGGSTNGAMGLQLAY 503 Query: 326 EVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYI 375 ++ K + N +DGD N S L ++ + + Sbjct: 504 DIAKANFVANGVNRVIL-CTDGDFNVGVTSEGELVRLIEEKAKSGVFLTVLGFG 556 >UniRef50_C7N770 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N770_SLAHD Length = 629 Score = 68.3 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 50/345 (14%), Positives = 94/345 (27%), Gaps = 27/345 (7%) Query: 75 VHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDG---EGQDEFVFQISKDEYLDLLFED 131 + G + E P + S + +G + + + + D L E Sbjct: 99 IGVGVGTNLLGSNAEMPVAETKAASEDTMAGSANSYAPDGGLAYETDEAYETF-DTLDEG 157 Query: 132 LALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH-- 189 + + + + E + T + V + +L R Sbjct: 158 APMEDFNTEEYAAIEENGFV-STVTRPLSTCSADVDTASYCNLRRMINDGYSLDEIPDGA 216 Query: 190 -ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 +EE L + R + + +K S Sbjct: 217 VRIEEMLNYFHYDSGEP-EGNDLFAVRAESARCPWNDQTQLLVMT--FTASDKAQTASKG 273 Query: 249 AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE---VDEH 304 + + L+D+SGSMD+ K D+ K + L L + V Y E D+ Sbjct: 274 SNLVFLIDISGSMDEPDKLDLLKDSFGTLLENLGPNDRVSIVTYAAGEDVLLEGASGDDT 333 Query: 305 -----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 + G T + L++ EV + Y N ASDGD N S Sbjct: 334 RKIMRALNRLEADGSTNGEAGLEMAYEVAERNYIEGGVNRIVM-ASDGDLNVGITSESDL 392 Query: 359 EILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 ++ + + + T + ++ Sbjct: 393 YDFVEEKRETGVYLSVLGFG--SGNYKDTKMETLADHGNGTYHYI 435 >UniRef50_UPI000185CB41 protein containing von Willebrand factor n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CB41 Length = 550 Score = 68.3 bits (165), Expect = 6e-10, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 76/245 (31%), Gaps = 19/245 (7%) Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE-EERLRKEIAELRAKI 223 V R+ +L R ++ +EE + PA E LR Sbjct: 107 DVDRASYANLRRMLGYGQLPPKDAIRIEEMINYFDYDYPAPTKEATSPLRVTPELAPTPW 166 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ K + P S + L+DVSGSMD+ K + K + LL L Sbjct: 167 NPEHLLLRIGLQAKKLDLAQAPPSN--IVFLIDVSGSMDEPNKLPLLKSSFKLLLTQLKP 224 Query: 283 TYKNVEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 T + V Y A E +G T SS ++L + ++ + Sbjct: 225 TDRVAIVTYASGTKVALSSTPVKERQKIEKVLDNLYASGSTSGSSGIQLAYKEAQKNFIK 284 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYE 390 N A+DGD N +P E +K + + + Sbjct: 285 NGNNRIIL-ATDGDFNVGISNPRELEKFIEKQRESGIYMSVLGFG--MGNYRDDMAETIA 341 Query: 391 HLQST 395 + Sbjct: 342 DKGNG 346 >UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UJ22_METS4 Length = 654 Score = 67.5 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 34/268 (12%), Positives = 71/268 (26%), Gaps = 30/268 (11%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 ++++L R + EE + + PA + R + + + Sbjct: 200 VRDALNRN---HLPPPAAVRT-EELINYFPYAYPAPASPDAPFRVTASVFPSPWAEGRKL 255 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVE 288 +R P + + L+D SGSM + + K+ +L L + Sbjct: 256 LHIGIRGYAVAPAERPPAN--LVFLVDTSGSMAAPNRLPLVKQSLAMLLTTLDARDRVAL 313 Query: 289 VVYIRHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V Y E Q G T ++ + ++P N Sbjct: 314 VAYAGEVGTVLEPTPAGEAGRILAAIETLQAHGSTAGGEGIRQAYALAARHFDPKAVNRV 373 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTF 396 A+DGD N + + + + L + + Sbjct: 374 IL-ATDGDFNVGITGRDELTGFVARERRKGIFLSVLGFG--MGNLNDALMQALAKDGNGV 430 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNAT 424 D R++ ++ + Sbjct: 431 AAHI--------DTAQEARKVLVEEATS 450 >UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 Tax=Bacteria RepID=A7C0I1_9GAMM Length = 367 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 49/165 (29%), Gaps = 18/165 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P + + L+DVSGSM + K LL L+ K VVY E Sbjct: 1 MPPAN--LVFLVDVSGSMRSNHKLALLKSALKLLSNQLTEKDKVSLVVYAGAAGVVLEPT 58 Query: 303 --------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADD 353 G T S+ + L + ++ + N A+DGD N Sbjct: 59 PGHQSVKINGALERLTAGGSTHGSAGIHLAYNLAEQAFIKNGINRILL-ATDGDFNVGTV 117 Query: 354 SPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQST 395 + L ++ + + + L + + Sbjct: 118 DFEALKNLVEEKRKSGISLTTLGFG--RGNYNDQLMEQLADAGNG 160 >UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WPE6_EGGLE Length = 555 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 38/268 (14%), Positives = 72/268 (26%), Gaps = 28/268 (10%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 L+ +A+R A + EE L + P + + + Sbjct: 112 LRRMVAQRYAPAVVPAGAVRT-EELLNYFDYAYPEPVG-SDLFGVSAQMSDCPWNDQTKL 169 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVE 288 + + + + A + L+DVSGSMD K + K + L L+ + Sbjct: 170 --LVMGFATEKDGDASPTGANLVFLIDVSGSMDDPDKLPLVKDSFAALVEGLTERDRVSV 227 Query: 289 VVYIRHHTQAKE-VDEHE-------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V Y E V + G T + L+ + + + N Sbjct: 228 VTYASGERVLLEGVPGDDKRRIMRAVDSLVAEGSTNGEAGLEQAYRLAESSFIEGGVNRV 287 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTF 396 ASDGD N S ++ + + + + Sbjct: 288 VM-ASDGDLNVGISSESELHDFVEQKRETGVYLSVLGFG--SGNYKDNKMETLADHGNGA 344 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNAT 424 ++ D R + + Sbjct: 345 YHYI--------DCAEEARRVLGRNLRA 364 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 64.8 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 48/290 (16%), Positives = 94/290 (32%), Gaps = 55/290 (18%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P +++ Q + + + + IS + L S+ R+ K ++ E+ Sbjct: 1356 PKIEEKDQSEGQQEEFEQNE---THSLRKISQKKVLIKSIQRKVKTNKEKVQKALNEEDK 1412 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 N +Q K I+ + + P DL C+ Sbjct: 1413 ----ENQTKSQQHRISSNVKNISGQFSLGQLQPMRFPIDL-----------------ICV 1451 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------------ 302 +D SGSM+ D+ K + L L + + I+ T A+ + Sbjct: 1452 IDTSGSMNGQPLDLLKETLLFLVDLLQTGDR---ICLIQFSTNAQRLTPLLSIESKDNIK 1508 Query: 303 --EHEFFYSQETGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 ++E GGT + ++L +V+ + RY +++ SDG N ++ Sbjct: 1509 SIKNEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLL--SDGLNDGAENK---- 1562 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + LL + +Y + + + N M I D Sbjct: 1563 --IRDLLKQLNFYQ----NYNEENFTIQTFGFGKDHDPNL-MDKISQLMD 1605 >UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TQT5_9MICO Length = 533 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 75/244 (30%), Gaps = 23/244 (9%) Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 EE + + PA ++ L+ + A ++ + + L+ + + R M Sbjct: 131 EEWVNSFDSGFPAPRKDDLELQSDQARASSEDD-GTRLVRIGLQGREVDVREWQPVALTM 189 Query: 252 FCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVY------IRHHTQAKEVDE- 303 +D SGSMD + + K LL L V Y + T ++ D Sbjct: 190 V--VDTSGSMDIRERLGLVKSSLALLAENLRPDDTIAIVTYQTDATPLLEPTPVRDTDTI 247 Query: 304 -HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA-DDSPLCHEI 360 + G T + + L L + +E Y N+ ASDG N D Sbjct: 248 LAAIDRLEAGGSTNLEAGLLLGYDQAREAYKQGATNVVLL-ASDGVANVGVTDGGRLATA 306 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + + + L + F + D + R+LF + Sbjct: 307 IRDNGRRGIHLVTVGYGMGNYSDHLMEQLADQGDGFYEYI--------DTFEEARKLFVE 358 Query: 421 QNAT 424 Sbjct: 359 DLRA 362 >UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SXC8_9FIRM Length = 612 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 41/272 (15%), Positives = 81/272 (29%), Gaps = 22/272 (8%) Query: 111 GQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL 170 G V S Y ++ ++ ++ +N ++ + + A+ A+ S VRS Sbjct: 101 GDTAMVTDTSNSMYSEVAYDTREYDSMTENGF--VSTVDRPLSTFAADRDTASYSNVRSY 158 Query: 171 QNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFID 230 S + +EE L + + + E+ + + Sbjct: 159 IES-------GSLPPDGAVRIEEMLNYFTYDYRKKPEDGEKFSIYTEYSDCPWNKDTKLM 211 Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEV 289 + + S + L+D SGSM D + + ++ + +L L + V Sbjct: 212 MVGINTDEIDFGDKKPSN--LVFLIDTSGSMYDDNKLPLVQQSFAMLAENLDENDRVSIV 269 Query: 290 VYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 Y T G T A+ E+ ++ + N Sbjct: 270 TYAGEDTVVLSGTPGSEQYTISEALSNMTAEGCTNGGDAIITAYELAEKNFINGGNNRVI 329 Query: 342 AQASDGD-NWADDSPLCHEILAKKLLPVVRYY 372 A+DGD N S L + + Sbjct: 330 L-ATDGDLNVGLTSESDLVDLITEEKKENNIF 360 >UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVG2_SORC5 Length = 656 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 38/266 (14%), Positives = 69/266 (25%), Gaps = 34/266 (12%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R+ A + EE L + +A + + + Sbjct: 229 RRKIMDGALPPYQAVRAEEFLNYFDYGYASPAA--GPFAVHLAAAPSPFTSGHHLVRVAV 286 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 + K + + L+D SGSM K ++AK+ +L L V Sbjct: 287 QGKRVPVKERTPVH--LVYLVDTSGSMQSPDKIELAKKSLKMLTDTLKPGD---TVALCT 341 Query: 294 HHTQAKEVDE-----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 + +EV G T +SS + L + + N Sbjct: 342 YAGSVREVLAPTGIESKGKILAALADLTAGGSTAMSSGIDLAYSLAERTLVKGHVNRVIV 401 Query: 343 QASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 SDGD N S K+ + + + + + + Sbjct: 402 -LSDGDANVGPTSHDEILKTIKRARDKGITLSTVGFGQ--GNYKDLMMEQLANQGDGNYA 458 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNAT 424 + D R +F +Q Sbjct: 459 YI--------DSEAQARRVFSEQVGG 476 >UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1CVN5_MYXXD Length = 700 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 33/295 (11%), Positives = 82/295 (27%), Gaps = 39/295 (13%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 + + + ++ + A+ ++ R+ + + +EE + Sbjct: 241 INTEEERFSTFSVDTDSASYTLTRAYLER-------GSLPNEQAVRVEEFVNTFDYGYAH 293 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS 264 Q ++ + + + + ++ + + S + ++DVSGSM+ Sbjct: 294 QG--SAPFSVQVEGFPSPVRKGYHVVHVGVKAREVSRPQRKPSH--LVFVIDVSGSMNLE 349 Query: 265 TKD-MAKRFYILLYLFLSRTYKNVEVVY------IRHHTQA--KEVDEHEFFYSQETGGT 315 + + KR LL L + VVY + T A + G T Sbjct: 350 NRLGLVKRALHLLVNELDERDQVSIVVYGSTARLVLEPTSAVHAHIIRAAIDSLHTEGST 409 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV------V 369 + L++ + N SDG A+ + +++ + Sbjct: 410 NAQAGLEMGYSLAASHLVEGGINRVIL-CSDG--VANTGLTDANSIWERIRARAAKGITL 466 Query: 370 RYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + L + + D +F + Sbjct: 467 STVGFG--MGNYNDVLMERLSQVGEGNYAYV--------DRIEEAHRIFVRDLTG 511 >UniRef50_B4D1N7 Autotransporter-associated beta strand repeat protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D1N7_9BACT Length = 1545 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 46/256 (17%), Positives = 74/256 (28%), Gaps = 45/256 (17%) Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL-RY--KNYEKRPDPSSQ 248 EE + +P + R PF DL R+ K P Sbjct: 1146 EEFINAFDYRDPEPSPGAPL------AFVTERARYPFAQNRDLLRFAVKTAAAGRQPGRP 1199 Query: 249 AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR-------------- 293 + L+D SGSM+++ + ++ + +L L K V + R Sbjct: 1200 LNIVLLLDRSGSMERADRVNIVREALSVLAKHLQPQDKLSIVTFARTPHLWADAVAGDKV 1259 Query: 294 HHTQAK--EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNW 350 H A+ E+ GGT + +AL L E + N +DG N Sbjct: 1260 HDVIARVNEITPE--------GGTNLEAALDLAYETAHHHFAVDSTNRVIL-FTDGAANL 1310 Query: 351 ADDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 D +P + + + + L + F I +D Sbjct: 1311 GDVNPDALTKKVEAQRKQGIALDCFGIGWEGYNDDLLEQLTRNADGRYGF----INTPED 1366 Query: 410 IYPVFRELFHKQNATA 425 F Q A A Sbjct: 1367 ----AAANFATQIAGA 1378 >UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NFD9_ACHLI Length = 486 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 49/298 (16%), Positives = 101/298 (33%), Gaps = 25/298 (8%) Query: 122 DEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 DE + +F D + +N ++ ++ + + A+ S +RS NS Sbjct: 31 DENYNYIFNDDEHQEIIENPFIDVSVNN--KSNISLSANTASYSFIRSQINS-------G 81 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 R +EE + + + ++ + ++ + L K + Sbjct: 82 RAVDRNAVRIEEMVNFFNYNYNQPETDKT-FGFKSELIQTPWNNETHLLLIGLETKQVDL 140 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRT-------YKNVE-VVYI 292 PS+ + L+DVSGSM + K +AK+ LL + Y + E VV+ Sbjct: 141 GDIPSN---IVILLDVSGSMSATNKLSLAKKAMELLIEQMKPNDVISLVTYSSGEKVVFK 197 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWA 351 + + +G T L + +V +E + N A+DGD N Sbjct: 198 GKSIDDMAYMTSQIRLLKASGSTAGKKGLDMAYKVAEEYFIEGGNNRIIL-ATDGDFNVG 256 Query: 352 -DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + + E +++K + + +Y + ++ I + Sbjct: 257 ISSTDMLIEYISEKRESGIYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARK 314 >UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W4Q3_NAEGR Length = 454 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 64/202 (31%), Gaps = 30/202 (14%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L++ Q + +DVSGSM D AK I + + +VV I Sbjct: 26 LQFDLISNIQRKEKQ--IVIALDVSGSMRGQGIDQAK---IAISNLFEQVVDTPDVVLIT 80 Query: 294 HHTQAK---------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI----Y 340 + T A+ E + Q GGT + + + + +N Sbjct: 81 YDTSAELYDLRKKPAETRQSTLEQIQAGGGTDFTCVFEAISNL-------DMFNRQSEVA 133 Query: 341 AAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEI--TRRAHQTLWREYEHLQSTFD 397 +DG D + E + K L + + + I T L + L S Sbjct: 134 ILFFTDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQG 193 Query: 398 NFAMQHIRDQDDIYPVFRELFH 419 F ++D ++I L Sbjct: 194 TFQY--VKDANEINQSMENLIG 213 >UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VDM1_NAEGR Length = 754 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 66/202 (32%), Gaps = 30/202 (14%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L++ Q + +DVSGSM D AK I + + +VV I Sbjct: 26 LQFDLISNIQRKEKQ--IVIALDVSGSMRGQGIDQAK---IAISNLFEQVVDIPDVVLIA 80 Query: 294 HHTQAK---------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN----IY 340 + T A+ E + Q GGT + + + ++ +N + Sbjct: 81 YDTSAELYDLRKKPAETRQSTLEQIQAGGGTDFTCVFEAISKL-------DMFNSQSEVA 133 Query: 341 AAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEI--TRRAHQTLWREYEHLQSTFD 397 +DG D + E + K L + + + I T L + L S Sbjct: 134 ILFFTDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQG 193 Query: 398 NFAMQHIRDQDDIYPVFRELFH 419 F ++D ++I L Sbjct: 194 TFQY--VKDANEINQSMENLIG 213 >UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P2E4_9RHOB Length = 772 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 51/161 (31%), Gaps = 19/161 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-------TYKNVEVVYIRHHTQAKEVDE 303 + ++D SGSM + +K F L + N + A E ++ Sbjct: 366 LVFVLDTSGSMSGQPIEASKTFMTAAIKALRPDDYFRILHFSNDTSQFAGQAVLATERNK 425 Query: 304 HEFFY----SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT ++ A+ + + P +DG + D + Sbjct: 426 QKALKFVADLSAGGGTEINQAVNAAFDQAQ----PDNTTRIVVFLTDG--YIGDEATVIK 479 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +A ++ R Y++ + ++ L + + Sbjct: 480 SIANRI-GKARIYAFG-VGNSVNRFLLDAMATEGRGYARYV 518 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 62/180 (34%), Gaps = 25/180 (13%) Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH--------- 304 ++D SGSMD + K + L L + V I +AK V E+ Sbjct: 47 VLDHSGSMDGQPLETVKSAALGLIDRLEE-DDRLSV--IAFDHRAKIVIENQQVRNGAAI 103 Query: 305 --EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDNWADDSPLCHEI 360 + GGT + LKL + + + + +DG+N D+ C + Sbjct: 104 AKAIERLKAEGGTAIDEGLKLGIQEAAK----GKEDRVSHIFLLTDGENEHGDNDRCLK- 158 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 L V Y T ++ + ++ +I + + FR+LF + Sbjct: 159 ----LGTVASDYKLTVHTLGFGDHWNQDVLEAIAASAQGSLSYIENPSEALHTFRQLFQR 214 >UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AWD1_HERA2 Length = 610 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 77/251 (30%), Gaps = 23/251 (9%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + +EE L P + + E+A + ++ ++ E Sbjct: 206 ADSVRVEEYLNAFDYEYPQPEDGDFAIYSEVAP-SPFGGPNYELVQIGIQARSIEVADRK 264 Query: 246 SSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQA 298 A + ++D SGSM Q + +M K I L L V + + + T Sbjct: 265 P--AALTFVIDTSGSMAQDNRLEMVKNALIYLAGQLEPDDSLAIVAFNDGMRVVLNPTSG 322 Query: 299 KEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSP 355 + + + G T + L E+ + + P N SDG N P Sbjct: 323 ENQMDIITAINSLEPAGSTNAEAGLYKGFELAWQAFKPEGINRILL-CSDGVANSGMTEP 381 Query: 356 LCHEILAKKLLPV-VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++ L V+ +Y + L + N+A D Sbjct: 382 SQLLATFQQYLDAGVQLSTYGVGMGNYNDILLEQLADKGDG--NYAYF------DSADEA 433 Query: 415 RELFHKQNATA 425 + LF +Q + Sbjct: 434 QRLFGEQLTGS 444 >UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZRP2_9SPHI Length = 1088 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 24/107 (22%), Positives = 40/107 (37%), Gaps = 10/107 (9%) Query: 251 MFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQA--KEV 301 + L+DVSGSM K + K + L + V+Y + T A +E Sbjct: 914 LMLLLDVSGSMSSKDKLPLLKESFKYLISIMRPQDDVSIVIYAGDAAIVLKPTSASNQEQ 973 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + G T V + KL + + + + N A+DG+ Sbjct: 974 INAVIDKLRSRGKTNVKAGFKLAYKWMSKNFKEGGNNRIIL-ATDGE 1019 Score = 44.0 bits (102), Expect = 0.012, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 40/106 (37%), Gaps = 10/106 (9%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQAKEVDE- 303 + L+DVSGSM M K L + K VV+ + T AK + Sbjct: 687 LMLLLDVSGSMKN-ELPMLKSALKYLVNIMRPEDKVSVVVFGSEAKLMLRPTSAKYKAQI 745 Query: 304 -HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + +G T + LKL + ++ Y N ASDG+ Sbjct: 746 MQAIDTLKSSGRTNGEAGLKLAYQWIQNNYKNNNNNRIIL-ASDGE 790 >UniRef50_D0MZH7 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MZH7_PHYIN Length = 1850 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 44/294 (14%), Positives = 94/294 (31%), Gaps = 42/294 (14%) Query: 28 AQIKQSISEAINKRSVTDVDSGESV---SIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQ 84 ++ +++E + ++ G V S P + P G+ P ND V Sbjct: 1455 ERMYGTMAELASDGTLELKVDGSRVGGPSTPKTGLDTP--KYGKDD------PNNDPHVG 1506 Query: 85 NDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFED------LALPNLK 138 + GG +G G + V Q+S+++ ++ E +A L Sbjct: 1507 GNTWAGGTGGSDTAGLGGRGGPYRLDKGH-PVHQVSQEKKDEVSAEARAKARAMAQEALA 1565 Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 + R++ + Y + + R +A L A+ + + Sbjct: 1566 EK-LREIDMSEREWETYQ------------TYFKRVERESAQLRAVLANLEAVAQERNWL 1612 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + +L + + + + + D ++ M +MDVS Sbjct: 1613 RHQSSGELDDGKL----VDGVAGERLVFKRRGVRDSPFQAPAGHQQEQEPKRMVFVMDVS 1668 Query: 259 GSM------DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 GSM D + M + +++ F + ++ H + E+ EF Sbjct: 1669 GSMYRFNGQDSRLERMLETSLMIMESFAGFE-RELDYCIFGHSGDSPEIPFVEF 1721 >UniRef50_Q231J4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q231J4_TETTH Length = 520 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 51/136 (37%), Gaps = 20/136 (14%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++ D +K + ++D SGSM ++ K+ + + + + Sbjct: 76 VEAKDFDADQVKKDKVRYQPLDLIFVIDTSGSMQGKKIELVKKSILQVLHIIQGDDR--- 132 Query: 289 VVYIRHHTQAKEVDE-------------HEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + + ++QAK + E Q GGT + ++ +++KER N Sbjct: 133 ISLVGFNSQAKVLLELTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDIIKERTNSK 192 Query: 336 QWNIY-AAQASDG-DN 349 N+ SDG DN Sbjct: 193 --NLASIFLLSDGQDN 206 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 46/115 (40%), Gaps = 15/115 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT---QAKEVDEHE-- 305 + C++D SGSM K + L L+ + +++ + T ++VD+ Sbjct: 210 LVCVIDNSGSMQGEKIQNVKTTLLQLLDMLNSNDRLSLILFNSYPTLLCNLRKVDDENTP 269 Query: 306 -----FFYSQETGGTIVSSALKLMDEVVKER--YNPAQWNIYAAQASDGDNWADD 353 GGT ++S + + ++++R +NP SDG + D Sbjct: 270 NIQSIINSITADGGTDINSGMLMAFNILQKRQFFNPVSS---IFLLSDGQDNGAD 321 >UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JNR2_9BACT Length = 923 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 69/246 (28%), Gaps = 25/246 (10%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD-LRY 236 A EE + +P + + + PF D LR+ Sbjct: 513 LAQNVRPPAGTLRTEEFVNAFDYGDPTPPVARKI------GFTWERAHWPFAHDRDVLRF 566 Query: 237 K-NYEKRPDPSSQAV-MFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 SSQ + + +D SGSM + + D+ L L+ + V + R Sbjct: 567 SLQTAAHGRASSQPLHLTLAIDTSGSMSRPDRVDIVNSLATALQSNLTEKDRLSIVSFDR 626 Query: 294 HHT---QAKEVDEHEFF-----YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + V GGT + SAL+L + + + N + Sbjct: 627 QPRLVLDGQSVTAETNLATLATQLNPQGGTDLESALQLSYQTAQRHFQENAINRVIL-IT 685 Query: 346 DG-DNWADDSPL-CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 DG N + + + + + + + T F Sbjct: 686 DGAANLGNTNAEQLRTTVTENRIRGIALDCFGIGFDGHDDTFLESLSRNGDGRYRF---- 741 Query: 404 IRDQDD 409 +R +D Sbjct: 742 LRSPED 747 >UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCW7_FERPL Length = 403 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 83/242 (34%), Gaps = 19/242 (7%) Query: 118 QISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 I+ E LD FE+ AL L + G T + R + +A++ Sbjct: 110 DINFKELLDYFFEE-ALKELIEMGI---------IEGVTKRFFRRKVKFSRQAERIIAQK 159 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 K + + E +S +L+E + + + D + Sbjct: 160 VMKEVSKEAKGYYAESEGETLSYIPGYELVEYDEYLHSYDLIDIPETMIRAAKNEDFEIR 219 Query: 238 N---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + P + L+DVS SM A + L + + + + ++EV H Sbjct: 220 EKDIVSRNPKKVGKRHFVMLIDVSDSMRGKKIVGAIEAALALKMSIRKGFDDLEVFVFNH 279 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 T+ ++ E + G T ++ ALK ++ + Y +DG+ A + Sbjct: 280 RTE--KIREGDIVNVDVEGRTDIALALKTARNALRGKDGAK----YVILITDGEPTASYN 333 Query: 355 PL 356 PL Sbjct: 334 PL 335 >UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZFT4_9SPHI Length = 827 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 59/183 (32%), Gaps = 22/183 (12%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-------YKNVEV 289 K + P + V ++DVSGSM ++KR L L +++ Sbjct: 313 KAPKNSQIPPREYV--FIVDVSGSMHGFPLSVSKRLLKNLIGKLRPKDKFNVMLFESSNQ 370 Query: 290 VYIRHHTQAKEVDEHEFFYS----QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + +A + + + F + GGT + ALK + + ++ + Sbjct: 371 MMSPESMEATQANIQKAFGVIDQQRGGGGTRLLPALKKALAFKQTK----DYSRSFVVVT 426 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG L + L +++ I ++ L F + H Sbjct: 427 DG---YVTVEKEAFDLIRNNLNRANLFAFG-IGSSVNRFLIEGMARAGMGEP-FIVTHGT 481 Query: 406 DQD 408 + D Sbjct: 482 EAD 484 >UniRef50_Q9ZGE6 Magnesium-chelatase 67 kDa subunit n=2 Tax=Heliobacteriaceae RepID=BCHD_HELMO Length = 666 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 91/277 (32%), Gaps = 35/277 (12%) Query: 108 DGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 + + + + + F+ +P + Q + R G A+G ++ Sbjct: 356 ETPPDEAPKDEQTLQLPEEFFFDAEEVPMEDELLSLQNKVQRQARGG--AHGKQKSLERG 413 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R + A+ + + + Q E + +R Sbjct: 414 RYAR-------ALLPPPGKNSRVAVDATLRAAAPYQRQRRESGQYG----------DRQV 456 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKN 286 + D+R K + ++ S A++ ++D SGSM + AK +LL K Sbjct: 457 IVTNSDIRAKQFVRK----SGALIIFVVDASGSMAFNRMSSAKGAVSVLLNEAYVNRDKV 512 Query: 287 VEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +++ T++ E+ + F GG+ ++ A+ EV + Sbjct: 513 ALIIFRGQQAETLVPPTRSVELAKKRFDQVPVGGGSPLAGAIAQAIEVGVNSIGSDVGQV 572 Query: 340 YAAQASDGD-NWADD---SPLCHEILAKKLLPVVRYY 372 +DG N D P E L +++L + R Sbjct: 573 IITLITDGRGNVPMDPQAGPKNREQLNEEILALSRLV 609 >UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeoglobus fulgidus RepID=O28828_ARCFU Length = 410 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 84/247 (34%), Gaps = 14/247 (5%) Query: 112 QDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQ 171 D ++S + +D F+++ + L++ + E + HR + S+ Sbjct: 108 GDISKDELSMSQVVDNFFDEV-VDELQEMGYVEKVETRFHRKIIHYT------AKAESVL 160 Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 ++ +R E S ++++ + + + Sbjct: 161 AEKVLSLSLQNLDKRSYGEHETEKLGQSIFSSERIVDYDPFTHSYDNIDLVESLIASAMR 220 Query: 232 FDLRYKN---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++ ++P + + V L+DVS SM A + L + R E Sbjct: 221 GEIELNENEMVARQPKHTEKCVYVMLIDVSDSMRGRKIVGAIEAALCLRKAIRRAGSGDE 280 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + I + +A E+ E E + G T + ALK +++K SDG+ Sbjct: 281 LRVIAFNHRAHEIKEGEILNLEARGRTDIGLALKRARKILKGSSGTG----VVFLISDGE 336 Query: 349 NWADDSP 355 + +P Sbjct: 337 PTSSYNP 343 >UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8U1E2_9PROT Length = 683 Score = 51.7 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 56/181 (30%), Gaps = 23/181 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-------YKNVEVVY----IRHHTQAK 299 + ++DVSGSM AK L R + N + +R + Sbjct: 330 VIFVIDVSGSMKGEPLRAAKASLTSGIEGLGRNDTFNVVAFNNKAAAFYDAPVRASGKFH 389 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT +++A +L ++ +P + +DG A + Sbjct: 390 RAALKVIDGLKAGGGTEMAAAFELALQMPG---DPDRLQQVVF-ITDG---AVSNEAALF 442 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 K L R ++ I + E +I D V R+LF Sbjct: 443 NQIKGELGARRLFTVG-IGSAPNTFFMEEAARFGRGT----YTYIGDTSSAERVMRDLFT 497 Query: 420 K 420 K Sbjct: 498 K 498 >UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genome shotgun sequence n=3 Tax=Paramecium tetraurelia RepID=A0C051_PARTE Length = 636 Score = 51.3 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 57/164 (34%), Gaps = 23/164 (14%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA------KEVDEH 304 + CL+D+SGSM +M K I+L FL + + I A K V Sbjct: 162 LICLIDISGSMIGVKIEMVKASLIVLLQFLGDNDR---LQLITFDNDAHRLTPLKTVTNQ 218 Query: 305 E-------FFYSQETGGTIVSSALKLM-DEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + GG +S A K+ ++ +Y +++ SDG ++ Sbjct: 219 NKSYFTQIIKQIKANGGNRISEATKMAFYQLKSRKYINNVTSVFLL--SDGVDYTYPEVK 276 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 ++ + + + E + + +L+S F Sbjct: 277 NQIQTVNEVF-TLHTFGFGE---DHDAQMMTQLCNLKSGSFYFV 316 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 50.9 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Query: 254 LMDVSGSMDQSTKDMAKRFYI-LLYLFLSRT------YKNVEVVYI-RHHTQAKEVDEHE 305 ++D SGSM + KR L+ L + +V V I ++ + Sbjct: 47 ILDHSGSMAGQPLETVKRAAQKLVDRLLPSDRLAVIVFDHVAKVLIPNQPVTDRDKIKTR 106 Query: 306 FFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 + GGT + L+L E++ + +DG+N ++ C ++ + Sbjct: 107 ISHLAAMGGTAIDEGLQLGLTELIAAKAGAISQ---IFLLTDGENEHGNNSRCLQLAEEA 163 Query: 365 LLPVVRY----YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + + Y +Q + + ++ I D+ F LF++ Sbjct: 164 AKENITLNTLGFGY-----HWNQDVLEQIADAAGG----SLMFIEYPQDVLIGFERLFNQ 214 >UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SP98_METPP Length = 791 Score = 50.9 bits (120), Expect = 1e-04, Method: Composition-based stats. Identities = 72/386 (18%), Positives = 130/386 (33%), Gaps = 48/386 (12%) Query: 22 FLRRYKAQIKQSISEAINKRSVTDVDSGESVSIP------TEDISEPMFHQGRGGLRHRV 75 +L K + ++ I ++ + S P +D+ M G Sbjct: 364 YLDMLKPSERVEMAMKILEKILQPQKSNGMPQQPQNGGLTIKDLERAMGRGGAPN----- 418 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQG-QASQDGEGQDEFVFQISKDEYLDLLFE-DLA 133 PGN + + G GSG+ A GQD +S ++ L + ++ Sbjct: 419 -PGNGNSQSGGQPGDQAGAQDGSGTEDMVPAPTVTHGQDH---VMSTEDLAQALHDAGVS 474 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 + + L + +GV + I+ Q + R + + Sbjct: 475 SDTMAKLGFDDLKKIPEEVKH-AKDGVVSAINKASEDQMKVGSRYPGGHLLHYAKAQMLD 533 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD-------PS 246 + E A E K + + +D D+ +K+ P Sbjct: 534 FFKPVLTWEMAHKKLLEACGKGSRYDPTEPWTLYHVDAADMGFKHQRDVPFMGSRMPGKE 593 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV--EVVYIRHHTQAKEVDEH 304 + +MF ++D SGS+D + M KRF R + V +V+ T + V E Sbjct: 594 QKPLMFDIIDTSGSVDDA---MLKRFVSEALNQARRVSRGVAPDVLISWADTICRGVPEF 650 Query: 305 -------EFFYS----QETGGTIVSSALKLMDEVVK--ERYNPAQWNI-YAAQASD-GDN 349 +F GGT +A++ + E+VK + A+ NI +D GD+ Sbjct: 651 ISEKNYKQFLTKGINYGGRGGTNFQAAIENVLEMVKPGSKSGYAKRNIDAICYMTDSGDS 710 Query: 350 WADDSPLCHEIL---AKKLLPVVRYY 372 D + L + KKL P++ Sbjct: 711 VPDPARLLRKAQECGLKKLPPILFLV 736 >UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=B8G546_CHLAD Length = 418 Score = 50.5 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 60/183 (32%), Gaps = 19/183 (10%) Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRT--------YKNVEVVYIRHHTQAKEVDEH 304 ++D SGSM + + K + L V+ + + Sbjct: 47 FVLDRSGSMQGAKLESMKAATRRVIELLRPHDVAAIVIFDDTVQTLIPATPVGDRSALLA 106 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 E GGT +S ++ +++ P + + +DG W D P+C + LA+ Sbjct: 107 AVETITEAGGTAMSLGMQAAQTELQKHLGPDRISRMLL-LTDGQTWG-DEPICRD-LART 163 Query: 365 LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 L + + + ++ L + + ++ I D I F + Sbjct: 164 LGQAGVRITALGLGTEWNEQLLDDIAAASDGYSDY----IADPAQI----ETFFQQAVKE 215 Query: 425 AKG 427 A+ Sbjct: 216 AQA 218 >UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V0V6_NAEGR Length = 502 Score = 50.5 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 67/199 (33%), Gaps = 32/199 (16%) Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 L E L+ + ++ + ++ + E +PF+ L + Sbjct: 2 KVSESALQKFETLLSNFAPQSVNLSVQIDAGHLPTDQIGVQCE-IPFLVRL-LSGNLPPQ 59 Query: 242 RPDPSSQAVM-----FCL-MDVSGSMDQSTKDMAKRFYI---------LLYLFLSRTYKN 286 + + V+ CL +D+SGSMD+ K+ +K + L+ FL+ Sbjct: 60 EEEAETTNVLKTPVNICLVLDISGSMDEPLKNRSKGSKLTACKSAIRELVTNFLTYKD-- 117 Query: 287 VEVVYIRHHTQAKEVDEH---------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 + I + K V + G T ++SAL +++ P Sbjct: 118 -TIHLITYSDSPKTVFTEKNKESVNLNDIDKISTEGSTNIASALHSAVDLLHNSNAPGT- 175 Query: 338 NIYAAQASDGD-NWADDSP 355 A SDG N + + Sbjct: 176 -KLIAFFSDGQCNVGETNL 193 >UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Streptomyces RepID=D1WZ12_9ACTO Length = 1289 Score = 50.2 bits (118), Expect = 2e-04, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 80/291 (27%), Gaps = 24/291 (8%) Query: 45 DVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQ 104 D + + T G G PG P Sbjct: 935 DQLPSGAARLATALDELYGAGHGEGSRGGLSGPGRTGSRGGREPSFPGVREWSEELAALF 994 Query: 105 ASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANI 164 E + + L L A P++ +L AG A + Sbjct: 995 GPGVREEVLAAAAVTGRQDVLAELDPAAATPSV------ELLRTILRYAG---GLPEARL 1045 Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 + +R L L R LA + +L LR +A R + Sbjct: 1046 AALRPLVRHLVDELTRQLTTRLRPALTGTMLARPTRRPGGRLDLPRTLRANLATARRTAD 1105 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 + +++ R S+ + + DVSGSM+ A + L + Sbjct: 1106 GTVQVIPQKPVFRS---RARRSADWRLILVTDVSGSME------ASTIWSALTASVLAGV 1156 Query: 285 KNVEVVYIRHHTQAKEVDEHEFFYS------QETGGTIVSSALKLMDEVVK 329 + ++ T+ ++ H GGT +++ L+ +++ Sbjct: 1157 PTLSTHFLAFSTEVVDLTGHVHDPLSLLLEVSVGGGTHIAAGLRHARGLIE 1207 >UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGD5_ARCPR Length = 411 Score = 50.2 bits (118), Expect = 2e-04, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 88/235 (37%), Gaps = 20/235 (8%) Query: 118 QISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 +S E ++ +E++ + +LK+ + + GY + R+L + + Sbjct: 114 DLSTSELVNYFYEEI-IEDLKKEGYLE----DDYFRGYKFT-----KNAERALSKKIL-Q 162 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP-FIDTFDLRY 236 ++ + E +S+ +++E + LR + + V + LR+ Sbjct: 163 LSLQDLTGEDFGEHETEKTGVSSFLKNEIVEYDELRHSYDSIDLQETLVKCALRDPSLRF 222 Query: 237 KNYEKRPDP---SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + V L+DVS SM A + L + ++ E+ + Sbjct: 223 DERDLVAREGKHMEKCVYVMLIDVSDSMRGRRIVGALESALALRKVIKKS-NMDELHVVA 281 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + + +++ + E + G T + ALK E++K+R +DG+ Sbjct: 282 FNHRVRKIKDEEILNLRTRGRTDIGLALKTAREIIKKRRGSG----VIFLITDGE 332 >UniRef50_A0CHZ1 Chromosome undetermined scaffold_185, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CHZ1_PARTE Length = 265 Score = 50.2 bits (118), Expect = 2e-04, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 80/207 (38%), Gaps = 24/207 (11%) Query: 163 NISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK 222 N SL +++ TA + ++ + ++P E+ L EI L+ Sbjct: 55 NFGYQHSLGPKYSQQLPQTAISQEIFDDDDQVQTNLVQAKPNMYDLEKELIFEIKTLQKM 114 Query: 223 IE-------RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 I+ ++P I + + + + CL+D S SM+ S + K+ + Sbjct: 115 IKLSKISTQQLPGIISIKTK-DQLNDQDLNRVGVDLICLIDKSSSMNGSKIETVKQSLKV 173 Query: 276 LYLFLSRTYKNVEVVYIRHH---TQAKEVDEHE-------FFYSQETGGTIVSSALKLMD 325 L FLS + +++ H T K + E + GGT +SSA ++ Sbjct: 174 LLTFLSNQDRLQLIIFNTHAKRLTPLKRITEDNKLYFTQMIDQIKSDGGTQISSATQIAI 233 Query: 326 -EVVKERYNPAQWNI-YAAQASDG-DN 349 ++ +Y + N+ SDG DN Sbjct: 234 SQLKGRKY---RNNVSSVFLLSDGQDN 257 >UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1D6F9_MYXXD Length = 592 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 35/265 (13%), Positives = 70/265 (26%), Gaps = 31/265 (11%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R +EE + E + + + + Sbjct: 174 RRYLVNGQLPPASAVRVEEFVNYFKFRYAPP--ETGAFAVHLEGAPSPFDAKRHFLRVGV 231 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 + K + A + L+D SGSM K +A+ + L+ V Y Sbjct: 232 QGKVVSRSQRKP--AHLVFLVDTSGSMHSEDKLPLAREAIKVAVKNLNENDTVAIVTYAG 289 Query: 294 HH---------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + T AK + GGT + S ++L ++ + + Sbjct: 290 NTRDVLPPTPATDAKSI-HAALDSLTAGGGTAMGSGMELAYRHAVKKASGSVV-SRVVVL 347 Query: 345 SDGD-----NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DGD N + + + + K V + L + + + Sbjct: 348 TDGDANIGRNVSAN--AMLDSIHKYTAEGVTLTTVGFGMGNYRDDLMEKLADKGNGNCFY 405 Query: 400 AMQHIRDQDDIYPVFRELFHKQNAT 424 D +++F Q Sbjct: 406 V--------DSLREAKKVFETQLTG 422 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 60/179 (33%), Gaps = 23/179 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF-- 306 + ++D SGSM + + AK F+ L L ++ +E + A+ + ++F Sbjct: 332 VIFVIDTSGSMHGESLEQAKSALFFALANLDPQDSFNIIEFNSKVNALNAQALPANDFNI 391 Query: 307 -------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + + GGT + A + + + A + +DG + + Sbjct: 392 RRARNFVYGLKADGGTEIGLAFEQVLD----NSEHADYLRQIVFLTDG---SISNETEVF 444 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 K L R ++ I + L F I D D+ + LF Sbjct: 445 AQIKGSLGDSRIFTIG-IGSAPNSYFMTRAATLGRGTFTF----IGDVTDVQRTMKNLF 498 >UniRef50_A6BYV9 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYV9_9PLAN Length = 1197 Score = 49.4 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 62/422 (14%), Positives = 120/422 (28%), Gaps = 65/422 (15%) Query: 24 RRYKAQIKQSISEA-INKRSVTDVDSGESVSIPTEDISEP-------MFHQ-------GR 68 R+ + I + + + ++ D + P P + Sbjct: 791 RKGREAIANLFPDFPLRQLNIKDPFAK-----PIRVAEAPGEIPLADRWRLILGVKGCST 845 Query: 69 GGLRHRVHPGNDHFVQNDRIERPQGGG--GGSGSGQGQASQDGEGQDEFVFQISKDEYLD 126 + + + ++R R G G + A E + KD + Sbjct: 846 PKSQQVAGTLDQLYGGSEREGRGLQGDLASDRGGTEAAAPSVREWISDVERLFGKDVCEE 905 Query: 127 LLFEDLA------LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 +L E L +L R E + ++R L ++ R A Sbjct: 906 VLGEAAVNGRAAVLEHLNHATVRPSVELLEQVLSLRGALSERELGLLRKLARNITERMAK 965 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R ++A + +L L + K + I L Y+ Sbjct: 966 QLANRLRPALHGLSIARPTRRRSPRLDFARTLNSNLHTAYRKSDGRISIAPTRLVYRLPA 1025 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKD---MAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 KR + ++DVSGSM+ S MA F L ++V + TQ Sbjct: 1026 KRQM---DWHLIFVVDVSGSMEASVIYSSMMAAIFSAL---------PAIDVKFFAFSTQ 1073 Query: 298 AKEVD---EHEFFYSQE---TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + E E GGT + L+ E + NP++ + +D Sbjct: 1074 VIDFTGRVEDPLSLLMEIQIGGGTHIGLGLRAARESIT---NPSRTLVVL--VTD----F 1124 Query: 352 DDSPLCHEILAKKLL---PVVRYYSYI----EITRRAHQTLWREYEHLQSTFDNFAMQHI 404 ++ E+L++ ++ + E R H + + + Sbjct: 1125 EEGVSVPELLSEVVMLSSSGAKLIGLAALNDEAKPRYHAGTAAAVVQAGMPVAAVSPERL 1184 Query: 405 RD 406 + Sbjct: 1185 AE 1186 >UniRef50_Q23KK4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23KK4_TETTH Length = 1085 Score = 49.4 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 38/260 (14%), Positives = 88/260 (33%), Gaps = 46/260 (17%) Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 + + + + E A +Q + + I L +K + +Y +Y Sbjct: 382 LKQLEEEKAKLIREKSAFWDKKNSSQEARLSQYSQSINSLNSKYPLGKMCSIVEKKYFHY 441 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 + + D SGS + + L Y + YI+ + + Sbjct: 442 ------------YFIQDESGSFSNDHQYAIQGVAQLFNRIKPNDY----ITYIKFDSSSH 485 Query: 300 E---------VDEHEFFYSQE---TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + +F + GGT SA + + + ++ +Y+ ++ + +DG Sbjct: 486 VDIPKTLKSSLSQGDFISKIQKCRGGGTNFQSAFQTLLQQIQSKYDQQEYPVVIF-ITDG 544 Query: 348 -DNWADDSPLCHEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 DN DS L + +Y Y + + +++ + F+N + Sbjct: 545 QDNTDLDS---IISQITSLCQDIVFYTIGYGSVNEKY-------LKNITNKFNN----TV 590 Query: 405 RDQDDIYPVFRELFHKQNAT 424 ++ +I +LF+ +N Sbjct: 591 GEKKEINGKPVDLFYVKNTP 610 >UniRef50_D2S019 ATPase associated with various cellular activities AAA_5 n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2S019_9EURY Length = 665 Score = 49.4 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 65/191 (34%), Gaps = 16/191 (8%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R+++ +LARRT R + + + + L + Sbjct: 385 RTMREALARRTPSKVDVRSGRYVRARDSESVDDVAIDATLRAAAPHQPARRETDDSSSGI 444 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS-MDQSTKDMAKRFYILLYLFLSRTYKN 286 I+ DLR K E+R ++A++ ++D SGS M KR + L R Sbjct: 445 AIEPKDLRQKIRERR----AEALVVFVVDASGSVMSGRQMFETKRGILSLVEDAYRARDR 500 Query: 287 VEVVY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV--KERYNPAQ 336 V VV + T+ G T ++ L E+V + R + Sbjct: 501 VAVVVFREEGAFTLVEPTRNLSAARRAVSKLTVGGNTPLAHGLVEAYELVERERRRDEDL 560 Query: 337 WNIYAAQASDG 347 + + SDG Sbjct: 561 YPLVVL-FSDG 570 >UniRef50_O26551 Magnesium chelatase subunit ChlI n=1 Tax=Methanothermobacter thermautotrophicus str. Delta H RepID=O26551_METTH Length = 591 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 54/160 (33%), Gaps = 15/160 (9%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILL--YLFLSRT------YKNVEVV 290 EK S+A+ ++D S SM K AK LL + R ++ E Sbjct: 418 EKVRIGKSRALYIIVLDTSSSMRLERKIKFAKTVSWLLLRDSYEKRNRIALIAFRGYEAN 477 Query: 291 YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG--D 348 + T E E + G T ++ AL+L EV + A + SDG + Sbjct: 478 LVVEPTSNLETVEEALEGLRSGGRTPLTPALRLAAEVASSSSDEACTAVVI---SDGRCN 534 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWRE 388 + + + + + L + ++ E Sbjct: 535 VFINSNLEEDMNMLETELRNLNLL-FVNAEPEKRSLGILE 573 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 58/182 (31%), Gaps = 29/182 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------- 303 + C++DVSGSM + + + LS + + I A ++ Sbjct: 91 IVCVIDVSGSMQGEKIQLVQTTLNFMVERLSPADR---ICLISFSNDATKISRLVQMSPK 147 Query: 304 ------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPL 356 +GGT + L+ + +++R Q + SDG DN Sbjct: 148 GKKQLKSMIPRLVASGGTNIVGGLEYGLQALRQRRTINQLSSIIL-LSDGQDNNGTTVLQ 206 Query: 357 CHEILAKKLLPVVRY----YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + ++ Y + Y TL ++ + ++D++ I Sbjct: 207 RAKATMDSIVIRDDYSVHTFGYG---HGHDSTLLNALAEPKNGAFYY----VKDEETIAT 259 Query: 413 VF 414 F Sbjct: 260 AF 261 >UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacteria RepID=Q114A2_TRIEI Length = 1204 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 66/196 (33%), Gaps = 26/196 (13%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 L + + Q E + + L F++ + R E +P ++ Sbjct: 577 LPLVVDSYLQEKQKQEAREAQAKAAPERLLEPE----FVENPEQRLPEPEFVENPENRCP 632 Query: 251 MFCLMDVSGSMDQS---TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF 307 + L+D S SM + + + VE+ I +++ + V +F Sbjct: 633 IILLLDTSYSMSGEAITELNQGVKIFQASVKEDELASLRVEIAVITFNSEIEVV--QDFV 690 Query: 308 --------YSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDG---DNW 350 + +G T + A++ E++++R + + + +DG D W Sbjct: 691 TVDKFIPKTLEASGVTHMGKAIEKALELLEKRKQDYKNSDIQYYRPWIFLITDGQPTDTW 750 Query: 351 ADDSPLCHEILAKKLL 366 D + E + L Sbjct: 751 QDAAKKIEEAETNRKL 766 >UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnoliophyta RepID=B9SJS6_RICCO Length = 540 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 73/209 (34%), Gaps = 14/209 (6%) Query: 156 TANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE 215 G AN+ + L + + + +E + S P + +LR Sbjct: 10 RRRGRRANLLAGGEAEQKLPLVPLLPPPLKMSSNDDDEKIVTRSRPTPPIVPARVKLR-S 68 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 I A +E +L + P + ++DVS SM+ + K + Sbjct: 69 INNDMAPLEESKLKVMLELTGGDSSSYGRPGLD--LVAVLDVSRSMEGDKMEKMKTAMLF 126 Query: 276 LYLFLSRTYKNVEVVY---------IRHHT-QAKEVDEHEFFYSQETGGTIVSSALKLMD 325 + L T + V + +R T +++E E+ G T +++ L+ Sbjct: 127 IIKKLGPTDRLSIVTFSGGANRLCPLRQTTGKSQEEFENLINGLNADGATNITAGLQTAL 186 Query: 326 EVVKERYNPAQWNIYAAQASDGD-NWADD 353 +V+K R + + SDG+ N D Sbjct: 187 KVLKGRSFNGERVVGIMLMSDGEQNAGSD 215 >UniRef50_Q23AA2 Putative uncharacterized protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23AA2_TETTH Length = 968 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 66/207 (31%), Gaps = 25/207 (12%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + +R+ + R K S + ++D SGSM K +A + S Sbjct: 2 EEKRIYYKIPLK-RVKATTTTEKGGSNLHIVGIIDASGSMSSWWKWIA-------EFWNS 53 Query: 282 RTYKNVEVVYIRHHTQAKEVDEH---EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 + + I A+ + + G T + A K+ + V+ P + Sbjct: 54 ESIPKENLHTITFDGTARHCQSNVLSTRIHDHGGGMTAIPEAFKMFETVLD--SIPVNES 111 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLL-----PVVRYYSYIEITRRAHQTLWREYEHLQ 393 + A SDG D++ E KKL + + + R + Sbjct: 112 VTAIFISDG---QDNNLNTLEERMKKLKGNHENRKINFICLGIESGFPTFLSMRLRQLYH 168 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHK 420 +N ++ + Y + F+K Sbjct: 169 QGDENIPALYLIE----YVSEKAFFNK 191 >UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CDA1_PARTE Length = 604 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 59/164 (35%), Gaps = 22/164 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----IRHHTQAKEVDEHEF 306 + C++D SGSM +M K+ +L FL + + + R + DE++ Sbjct: 187 LLCVIDRSGSMSGEKIEMVKQTLNILLNFLGPKDRLCLIQFDDTCQRLTNLRRVTDENKT 246 Query: 307 FY------SQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDNWADDSPLCHE 359 +Y GGT++ ++ + + +Y + N+ SDG + Sbjct: 247 YYSDIISKIYANGGTVIGLGTQMALKQI--KYRKSVNNVTAIFVLSDGQD-----EAAIS 299 Query: 360 ILAKKLL---PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 L K+L + +S+ L + +L F Sbjct: 300 SLQKQLAYYKQTLTIHSFG-FGSDHDAKLMTKISNLGKGSFYFV 342 >UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KNK1_AERHH Length = 552 Score = 48.2 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 70/224 (31%), Gaps = 22/224 (9%) Query: 167 VRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERV 226 R+ + ++ A + + + + L Sbjct: 319 GRAYISEEKKKQA--RIPHASKSEVHGTHRSEDLARVLPTELLNLEDEALETLFYARFLE 376 Query: 227 PFIDTFDLRYKNYEKRPDPSSQ----AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 + T++L+ + + +D SGSM + A+ + + L + Sbjct: 377 RNLMTYELQGTTCTSGEQLELEQKRTGPVVACLDTSGSMSGAPLLKARALLLAVSAVLQQ 436 Query: 283 TYKNVEVVYIRHHTQAKE--VDEHE-------FFYSQETGGTIVSSALKLMDEVVK--ER 331 +++ VV + + +E + E F GGT + L E+++ + Sbjct: 437 EARSLHVVLFGDNGELREYAIHEENSASGLLHFLRQGFGGGTDFETPLNRACEIIRDAKE 496 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 Y A SDGD D + H KK+L YS + Sbjct: 497 YEKAD----ILMISDGDCVLSDDYIEHLQTRKKIL-DCSIYSVL 535 >UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacterium sp. JLS RepID=A3PUP3_MYCSJ Length = 233 Score = 48.2 bits (113), Expect = 6e-04, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 59/172 (34%), Gaps = 17/172 (9%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHHTQA 298 +P + L DVSGSM +R + +L K VEV + T A Sbjct: 12 DANPDPRVACVVLADVSGSMQGEPIAALERGFAAFTRYLQNEVLASKRVEVAVVTFGTVA 71 Query: 299 KE-VDEHEFFYSQ-----ETGGTIVSSALKLMDEVVKER---YNPAQWNIY---AAQASD 346 V E Q +G T +++ + L +++++R Y A Y +D Sbjct: 72 TVLVPMQEARTLQPVAFTASGTTNMAAGIHLALDILEDRKHAYKAAGLQYYRPWILLLTD 131 Query: 347 GD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 G N + A + V ++ R +Q L R +S Sbjct: 132 GKPNLDGFDEAVARLNAVESARGVTVFAVGAGPRVDYQQLGR-LSLQRSPAP 182 >UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MS10_ANATD Length = 1188 Score = 48.2 bits (113), Expect = 6e-04, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 45/119 (37%), Gaps = 18/119 (15%) Query: 251 MFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVV------YIRHH-TQA 298 + ++D SGSM + K AK F L VV Y+ T Sbjct: 500 LVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQG-----DRAAVVDFDNFGYLLQPLTTD 554 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + ++ GGT ++ +++ ++ + R + + + +DG+ + D++ Sbjct: 555 FQAVKNAIDRIDSWGGTNIAEGIRIANQQLISRSSEDRIKVIIL-LTDGEGYYDNNLTT 612 >UniRef50_Q97HZ9 Predicted metal-dependent peptidase n=1 Tax=Clostridium acetobutylicum RepID=Q97HZ9_CLOAB Length = 456 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 63/215 (29%), Gaps = 31/215 (14%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE--NLAI 197 ++ + G I L+N + R +A +R L++ A Sbjct: 200 KNFDEM-NIHKTWSESYNRGYENQIDE---LKNKIIRNSAKGRIPKRVQEYLDDMNKKAE 255 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 IS + + + K R P+ DLR K + + +D+ Sbjct: 256 ISWQMYLKKAIGTLPKGYKKTITRKDRRQPY--RMDLRGKLSDHIIK------IVVAIDI 307 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV-----DEHEFFYSQET 312 SGSM + D A + KN E+ I + + Sbjct: 308 SGSMTDAEIDAAMTEIFDIL-----KNKNYELTIIECDNIVRRMYRVSKPRDMKKKLDTK 362 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 GGT S + + + + + +DG Sbjct: 363 GGTSFSPVFEYLHK-------NRMEDCFLIYFTDG 390 >UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcanivorax sp. DG881 RepID=B4X134_9GAMM Length = 657 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 51/318 (16%), Positives = 89/318 (27%), Gaps = 36/318 (11%) Query: 131 DLALP-NLKQNQQRQLTEYKTH----RAGYTANG--VPANISVVRSLQNSLARRTAMTAG 183 L LP L T R A G A + V ++ AR + + Sbjct: 168 QLRLPLTLTPRFTPPTEAPHTLDSLLRNTVAAPGGTADAGTASVHIDLDAGARLATLGSP 227 Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD-LRYKNYEKR 242 + I+ A ++ + L E + F + D Y Sbjct: 228 SHAIHYQRHGRRYTITPKAGAIAMDRDLLLNWELEDTGEPLVTRFHEEIDGEHYALLMVV 287 Query: 243 PDPSSQAVMF-----CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 P + Q ++D SGSM + AK L L + + HT Sbjct: 288 PPKTGQVTALPRETLFIIDSSGSMGGAPMRQAKASLHLALQRLKPGDRFNITDFDSQHTL 347 Query: 298 AKEVD----------EHEF-FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 E +F Q +GGT + A L + + + +D Sbjct: 348 LFETPVTVSDNSRQQAQDFVDGLQASGGTHMLPA--LSATLSQPAS--DGYLRQVIFITD 403 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G + L ++L R ++ + + F + +I D Sbjct: 404 G--AVGNESGIFRALHQQLGE-ARLFTVGI-----GSAPNSHFMTRAAQFGRGSFTYIND 455 Query: 407 QDDIYPVFRELFHKQNAT 424 Q+ + LF + + Sbjct: 456 QNQVQQGMDTLFRRLESP 473 >UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38CFE Length = 489 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 48/143 (33%), Gaps = 22/143 (15%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P + + L+D SGSM S +R + K ++ + ++A V Sbjct: 49 TKPQA---VVMLIDTSGSMSGSKLPEVQRAASEFVS--RQNLKRDDLAVVEFSSRASVVA 103 Query: 303 ---------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + GGT +S L V++ P NI +DG+ + Sbjct: 104 DFTRDERELQQAIARLSAWGGTNLSEGFNLATSVLQNSDRPG--NILLF--TDGE---PN 156 Query: 354 SPLCHEILAKKLLPV-VRYYSYI 375 + +A+++ + + Sbjct: 157 NRRMAASIAQQIRASGINLVAVG 179 >UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=Dialister invisus DSM 15470 RepID=C9LLI0_9FIRM Length = 640 Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats. Identities = 52/357 (14%), Positives = 109/357 (30%), Gaps = 48/357 (13%) Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD-EYL 125 +++ D E P G G G + D D F D E + Sbjct: 296 DPDKNNTEKDQESENNDPGDDGEDPSGNHIVEAMGNG-GNNDESSSDMPEFPQGADDEKV 354 Query: 126 DLLFEDLALPNL-KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 D + LP L QN+++Q T + + T + R K Sbjct: 355 DSADLHVTLPPLWIQNEKKQFTPKGSGKRHITRS------------DERQGRYVKAGIPK 402 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 + A + + P Q + + I D+R ++R Sbjct: 403 GETHDIAID--ATLRAAAPHQKGRQSNGCAVV------------IRHEDIR---RKEREK 445 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKR---FYILLYLFLSRT------YKNVEVVYIRHH 295 + + L+D SGSM + A + F +L + R ++ + Sbjct: 446 RTGN-IFLFLVDASGSMGARERMKAVKGVVFKMLADAYQKRDRVGMIAFRRDRAEVLLPI 504 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDG---DNWA 351 T++ E + + G T ++ L ++++ Y + +DG ++ Sbjct: 505 TRSIEFAQKKLAALPTGGKTPLAQGLIKAEDMLDRLYKQDPLQDPVLILITDGRATNSLN 564 Query: 352 DDSPLCHEIL--AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 ++ + L A+++ + I+ + + + F + I + Sbjct: 565 KNTDPVRDALSEAERIGHRHMLAAVIDTESGFIKLGLAKELAQKMGASYFHVDKISE 621 >UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GAI6_9DELT Length = 560 Score = 47.5 bits (111), Expect = 9e-04, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 36/119 (30%), Gaps = 17/119 (14%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV---------- 290 P + +D SGSM + ++ + + L V + Sbjct: 213 PEERPPMNVTLV--LDTSGSMAGTPIELLRETSRAIAAQLKLG-DTVSICEWDTSNDWTL 269 Query: 291 --YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 Y E+ + GGT + L+ E+ + Y+P N SDG Sbjct: 270 AGYAV-TGPNDELLLEKINDVVHGGGTNLYGGLESGYELAQMVYDPDAINRLVL-ISDG 326 >UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 Tax=Octadecabacter antarcticus 307 RepID=B5JCH3_9RHOB Length = 197 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 33/97 (34%), Gaps = 7/97 (7%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 +++SL R + +EE + + PA E R I + Sbjct: 108 IRSSLTR----GQLPPTDAVRIEEMINYFPYAYPAPEGE-APFRPTINVFETPWNADTQL 162 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 ++ + P + L+D SGSM+ + K Sbjct: 163 VHIGIQGEMPAIEDRPPLN--LVFLIDTSGSMESADK 197 >UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12 Tax=Actinomycetales RepID=D2BAS2_STRRD Length = 490 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 70/218 (32%), Gaps = 33/218 (15%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQ--AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLS 281 R+P T +R ++ +P ++ A + ++DVSGSM + + D+ + L L Sbjct: 122 RMPENGTALIRVGLQTRKAEPEARRPANLTFVVDVSGSMGEPGRLDLVREALHKLVDQLG 181 Query: 282 RTYKNVEVVYIRHHTQAKEV-----------DEHEFFYSQETGGTIVSSALKLMDEVVKE 330 V +V TQA+ V T + + L Sbjct: 182 PG-DQVSIV--AFSTQARLVLSMTPATGRDQLHAAIDRLGVEDSTNLETGLTAGYAEAAR 238 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV----RYYSYIEITRRAHQTLW 386 + PA N SDG A+ + + ++ + R L Sbjct: 239 AFRPAATNRVIL-LSDG--LANTGDTTWQGILDRVAESAGRQITLLCVG-VGRDYGDQLM 294 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + + DD R++F +Q AT Sbjct: 295 EQLADNGDGAAVY----VSSADD----ARKVFVEQLAT 324 >UniRef50_Q23FU3 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23FU3_TETTH Length = 755 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 52/146 (35%), Gaps = 24/146 (16%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK------EVDEH 304 + CL+D SGSM + ++ L L + ++ + + AK +V++ Sbjct: 50 IICLIDNSGSMAGKKAQLVRKSLKYLLKILEKGD---QISLVSFSSTAKTLCPLTQVNDE 106 Query: 305 -------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 GGT V K + +++ R + + +DG+ DS Sbjct: 107 NKQQIKSAIKQINGQGGTFVIPGFKEVTKILNSR-KEQREQTFILLLTDGEFGDIDSGKV 165 Query: 358 HEILAK-------KLLPVVRYYSYIE 376 + + + + P + Y Y + Sbjct: 166 IQNINRLFTQSEIQKTPYIYTYGYGD 191 >UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5Z1_VITVI Length = 686 Score = 46.7 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 31/141 (21%), Positives = 52/141 (36%), Gaps = 22/141 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----------RHHTQAKE 300 + ++DVSGSM S + KR L L + + V + R +E Sbjct: 205 LVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMSDNGRE 264 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA--SDG-DNWADDS--- 354 +GGT + LK V++ER ++ N A+ SDG D + D+ Sbjct: 265 AAGLAINSLXSSGGTNIVEGLKKGVRVLEER---SEQNPVASIILLSDGKDTYNCDNVNR 321 Query: 355 ---PLCHEILAKKLLPVVRYY 372 C +++L + Sbjct: 322 RQTSHCASSNPRQVLEYLNLL 342 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 46.7 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 46/125 (36%), Gaps = 16/125 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---IRHHTQAKEVDEH--- 304 + CL+D SGSM + ++ + FL + +++ + T+ V + Sbjct: 124 LVCLIDHSGSMQGEKIKLVRKTLKQMLTFLQPCDRLCLIMFDCKVYRLTRLMRVTQENVQ 183 Query: 305 ----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDNWADDSPLCH 358 Q GGT + + +K+ ++K R N SDG + + Sbjct: 184 KFRVAISSLQARGGTDIGNGMKMALSILKHR---KYKNPVSAIFLLSDGVDEGAE-ERVR 239 Query: 359 EILAK 363 + L + Sbjct: 240 DDLIQ 244 >UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Deltaproteobacteria RepID=A0LHW4_SYNFM Length = 812 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 54/182 (29%), Gaps = 20/182 (10%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-------YKNVEVVYIRHH--TQAKEVD 302 ++DVSGSM +++KR L L T + V A V Sbjct: 335 IFIVDVSGSMHGFPLEISKRLLTDLIGGLKPTDCFNVMLFSGDSTVMAERSVPASADNVR 394 Query: 303 E--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 Q GGT + ALK + ++ + A+DG + E Sbjct: 395 RAVEMIGRRQGGGGTELLPALKKALSLPRKE----GVSRSMVIATDG--FVTVEEEAFE- 447 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 L + + ++ + I ++ L + F + + FR Sbjct: 448 LIRSHIGDANFFPFG-IGTSVNRMLIEGMARAGAGEP-FVITRPDEAPAGAEKFRRYIQS 505 Query: 421 QN 422 Sbjct: 506 PL 507 >UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actinomycetales RepID=Q6ABM1_PROAC Length = 654 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 67/201 (33%), Gaps = 13/201 (6%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 +Q + + AG P S R + + RR + RR + Sbjct: 350 DPRKQPSGSGEQVVAAGDPFAVRPLEPSQDRFARRACGRRLRTRSNDRRGRYVSARPTDR 409 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + L + ++ + + + D R K R + + + ++D Sbjct: 410 PDDLALDATLRAAAVHQKSRRATERPDLAVHVKPIDWRAKVRAGR----AASCVIFVVDA 465 Query: 258 SGSMDQSTKDMAKR---FYILLYLFLSRT------YKNVEVVYIRHHTQAKEVDEHEFFY 308 SGSM + A + +LL ++ R ++ + T + EV +H Sbjct: 466 SGSMGSRGRMTASKGAVLSLLLDAYVKRDRVCLIGFRRDRAEVLVPVTSSVEVAQHGLAE 525 Query: 309 SQETGGTIVSSALKLMDEVVK 329 G T +S+ L EVV+ Sbjct: 526 LPVGGRTPLSAGLIKACEVVR 546 >UniRef50_UPI0001555D4A PREDICTED: hypothetical protein, partial n=1 Tax=Ornithorhynchus anatinus RepID=UPI0001555D4A Length = 397 Score = 45.9 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 4/103 (3%) Query: 27 KAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQND 86 K +IK + ++ + + ++ V P + +P G + H G Sbjct: 294 KNRIKNNCNDIVTEEDTEELQRKRKV--PRPLLDKPG--DGPVRMSTLTHRGRRPEGSKA 349 Query: 87 RIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLF 129 +R +G GS + + D +G D +++ E + LL Sbjct: 350 LEKRVEGEEAGSWAQGPGSGNDIDGGDSKDIRLTLMEEVLLLG 392 >UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0E9B3_PARTE Length = 603 Score = 45.5 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 52/155 (33%), Gaps = 15/155 (9%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK----------NVEVVYIRHHTQAKE 300 + C++DVSGSM ++ K L L + ++ +IR+ + K Sbjct: 122 LVCVVDVSGSMIGRKINLVKDSLRYLMKILGPEDRICIIVFTTVAHIVTSFIRNTQENKP 181 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE 359 + + + T +S + ++K R + SDG D++ + Sbjct: 182 LLKKAILELKGLASTNISDGMNKALWMLKNRKYKNPVSC-IFLLSDGQDDYKGAEQRVFD 240 Query: 360 ILAKKLLP---VVRYYSYIEITRRAHQTLWREYEH 391 L + V+ + Y + +Y Sbjct: 241 QLQLLKIEEKFVIHTFGYGQDHDAYVMNQIAKYRE 275 >UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHI4_9EURY Length = 705 Score = 45.5 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 56/148 (37%), Gaps = 7/148 (4%) Query: 223 IERVPFIDTFDLRY-KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + + + T D Y R + A ++ L+D+SGSM + AKR ++ L Sbjct: 499 RKLIRYEVTKDPAYISKPYLRHIKDAGAEIWMLLDISGSMGGQKINAAKRILGSIHDSLD 558 Query: 282 -RTYKNVEV--VYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 Y ++ + Y T E D G T A+ +++K+ + + ++ Sbjct: 559 GSKYVHLRMFGFYGSDGTHVFEFDRKMLMNLAAMGDTPTDIAIYYAMDLMKK--DKSNFD 616 Query: 339 IYAAQASDGD-NWADDSPLCHEILAKKL 365 +DGD N ++ L + Sbjct: 617 KTLFIITDGDPNNGQETKNALNSLKNAM 644 >UniRef50_A9B2Y1 VWA containing CoxE family protein n=4 Tax=Bacteria RepID=A9B2Y1_HERA2 Length = 460 Score = 45.1 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 52/351 (14%), Positives = 103/351 (29%), Gaps = 49/351 (13%) Query: 85 NDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQ 144 + + G G G F +S++E ++ + L +K+ R+ Sbjct: 126 GFQPGALRQSQPGQGQASQPGGLQGGQGVGSGFNLSEEELRQVI-QGLEKDLIKRMALRE 184 Query: 145 LTEYKTHRAGYTANGVPA-------NISVVRSLQNSL---ARRTAMTAGKRRELHALEEN 194 + + A P+ N+L R + ++ L+ Sbjct: 185 VLQD----NRLAAQLTPSMAVVEQLLRDKSHLSGNALINAKRLIKQYVDELADVLRLQVM 240 Query: 195 LAIISNSE----PAQLLEEERLRKEIAELRAKIERVPFIDTFD-LRYKNYEKRPDPSSQA 249 A+ + + P ++ L++ I D L Y+ K+ P Sbjct: 241 QAVSAKIDRSVPPKRVFRNLDLKRTIWRNLTNWNSNEGRLYVDRLYYRQTAKKRTPMRMI 300 Query: 250 VMFCLMDVSGSMDQ---STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--- 303 V+ D SGSM +A F L +V++ I T+ ++ Sbjct: 301 VVV---DQSGSMVDAMVQCTILASIFAGL---------PHVDMHLIAFDTRMLDLTPWVH 348 Query: 304 ---HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA---AQASDGDNWADDSPLC 357 +Q GGT ++ AL E ++E P + + D D+ Sbjct: 349 DPFEVLLRTQLGGGTSINEALLFASEKIQE---PRKTAVVLITDFYEGGSDQVLLDTIKA 405 Query: 358 HEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 +PV Y + L + + ++ I+ Sbjct: 406 MIESGVHFIPVGAVTSSGYFSVNDWFRTKLKEMGRPIFAGSPRKLIEQIKQ 456 >UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000E460BF Length = 1028 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 45/122 (36%), Gaps = 14/122 (11%) Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 K + +D + + Y + + P+++ + ++DVSGSM KR + + + Sbjct: 283 KKAGHLEIVDGYFVHYFSPDG--LPNTRKNVIFVIDVSGSMYGQKTRQTKRAFTTILDDV 340 Query: 281 SRTYKNVEVVYIRHHTQAKE------------VDEHEFFYSQETGGTIVSSALKLMDEVV 328 + +++ + +E + GGT + +L E++ Sbjct: 341 RPIDRINIILFSSYAHVWREDQMVEATSDNIAAAKRHVNGLSVGGGTNIYDSLMKAVEIL 400 Query: 329 KE 330 E Sbjct: 401 LE 402 >UniRef50_A1AQS2 Protoporphyrin IX magnesium-chelatase n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AQS2_PELPD Length = 617 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 63/195 (32%), Gaps = 18/195 (9%) Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 R G P + S + +R A R + S + Sbjct: 332 ESAQREEIMGVGAPFKL-RRLSFRKDRRKRQANGRRTRTRIKGRGGRYVKSLLSSTEHDI 390 Query: 208 EEERLRKEIAELRAKIERVPF--IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST 265 + + A + R I+ DLR++ E+R ++ ++D SGSM Sbjct: 391 AIDATLRACAPFQKARNRQGMLKIEQDDLRFRQRERRM----GHLVLFVVDGSGSMGARQ 446 Query: 266 KDMAKR---FYILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGT 315 + M + +LL + + K +V+ + T + E+ G T Sbjct: 447 RMMETKGAVQSLLLDCY-QKRDKVAMIVFRKDRAELVLPPTASVELAARRLAELPVGGKT 505 Query: 316 IVSSALKLMDEVVKE 330 ++S L +V+ Sbjct: 506 PLASGLLKTHRLVRR 520 >UniRef50_A0CK50 Chromosome undetermined scaffold_2, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0CK50_PARTE Length = 1015 Score = 44.8 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 56/146 (38%), Gaps = 18/146 (12%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAV--MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 ++PFI+ + + A+ + ++D SGSM + L F ++ Sbjct: 10 KIPFIEQSQDQSTKKGDKKGKEGNAILTIIGVIDASGSMSG-CWE-------WLSDFWNQ 61 Query: 283 TYKNVEVVYIRHHTQAKEVDEHEFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + ++ I T+ K E GGT + A + M+ +++ P Q NI Sbjct: 62 SIPKENLITITFDTRQKISAEGVLSKRIKDHGGGGTEIVPAFQTMETELQK--VPIQNNI 119 Query: 340 YAAQASDGDNWADDSPLCHEILAKKL 365 SDG D++ + KKL Sbjct: 120 TVIFISDG---QDNNVRTIDERMKKL 142 >UniRef50_C4G1K3 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1K3_ABIDE Length = 1659 Score = 44.4 bits (103), Expect = 0.008, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 62/201 (30%), Gaps = 11/201 (5%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 S + +++ LAR + + + E++ I++ + Sbjct: 2 RSERKMVRSLLARFMVLMMVINLLGGINPSAVKAGPDEYYKNGSEKQENGVTISKKVTRY 61 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 +L+ K + + + +MD SGSM+ + + AK+ L Sbjct: 62 NAADGTYDIELKVKGSTEVVQNNKILDIVLVMDTSGSMEGKSLENAKKAANNFVDKLLPQ 121 Query: 284 YKNVEVVYIR-------HHTQAKEVD--EHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 NV + + + V ++ + GGT L+ V+ P Sbjct: 122 NNNVNIGIVSFAEKGEIKSGLTRNVTTLKNAIKGLKADGGTYTQQGLEKAATVLNG--AP 179 Query: 335 AQWNIYAAQASDGDNWADDSP 355 A+ DG+ + Sbjct: 180 AEHKKVMVVIGDGEPTYANGE 200 >UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VUB8_DYAFD Length = 935 Score = 44.4 bits (103), Expect = 0.009, Method: Composition-based stats. Identities = 26/108 (24%), Positives = 41/108 (37%), Gaps = 12/108 (11%) Query: 251 MFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRT-------YKNVEVVYIRHHTQAK--E 300 M L+DVS SM+ K + KR L + Y V ++ + AK E Sbjct: 759 MVLLLDVSSSMNSPYKMPLLKRSIKSLLTLVRPEDMISIVLYSGKARVVLKPTSGAKASE 818 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + Q G T + +KL + ++Y N A+DG+ Sbjct: 819 ISRM-IDLLQSDGDTDGNEGIKLAYKTANKQYIRGGNNRIVL-ATDGE 864 >UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobaceae RepID=C3NM85_SULIN Length = 452 Score = 44.0 bits (102), Expect = 0.010, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 47/121 (38%), Gaps = 14/121 (11%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL--------FLSRTYKNVEVVYI 292 ++ + ++ L+D SGSMD AK + LY F R + N+ I Sbjct: 280 QKQIRETLGPIYLLLDKSGSMDGEKILWAKAVALALYSRAKRENRDFYLRFFDNIPYPLI 339 Query: 293 RHHTQAKEVD----EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + AK D + GGT +S ++ E +KE + I +DG+ Sbjct: 340 KVQKNAKSKDIIKMVEYIGKIRGGGGTDISRSIISACEDIKEGHVKGVSEIILL--TDGE 397 Query: 349 N 349 + Sbjct: 398 D 398 >UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G8C3_SORC5 Length = 907 Score = 43.6 bits (101), Expect = 0.014, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 52/188 (27%), Gaps = 17/188 (9%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 + A E + + A + P L + LR ++ F Sbjct: 427 AAFEAALARGVVPAAERELVGDVAARYAPEVPLALDKALGLRADLERAALGPGGGAFHLR 486 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 LR P + +D SGSM + D A+R L L+ + Sbjct: 487 LALRSAAAAAAARPHLSVHLV--LDTSGSMAGAPIDSARRAAQALVDRLAPADDFSLTTF 544 Query: 292 IRHHTQAKEVDEH------------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + A+ V E +E GGT + + L L P Sbjct: 545 ---SSDAEVVIEDGPVGPRRAAIRRAIEGLREGGGTNIGAGLSLGYAQASRPGIPEDAVR 601 Query: 340 YAAQASDG 347 SDG Sbjct: 602 VVLLVSDG 609 >UniRef50_A5FL88 Putative uncharacterized protein n=2 Tax=Flavobacterium RepID=A5FL88_FLAJ1 Length = 1111 Score = 43.2 bits (100), Expect = 0.020, Method: Composition-based stats. Identities = 34/166 (20%), Positives = 54/166 (32%), Gaps = 9/166 (5%) Query: 65 HQGRGGLRHRVHPG---NDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISK 121 G G G N +N + + QG G G +DGEG E + +I K Sbjct: 928 KSGEGKQGQDSGQGKEGNGKEGKNGQGKNGQGSKEGGKGKDGNEGEDGEGDAEKIMEIYK 987 Query: 122 DE--YLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL---QNSLAR 176 ++ + L ++LA L + + + K G N ++ R L Q L Sbjct: 988 EQVKLREALQKELAKKGLDAQGRSAIEQMKASEKQILNKG-FKNENLQRILNIQQELLKL 1046 Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK 222 A+ + E N SN A + L + Sbjct: 1047 NNAVQEQGQDTKRQSETNKTEFSNRSNALPSSLTDYLNSVEILNRQ 1092 >UniRef50_B4S8S0 Magnesium chelatase ATPase subunit D n=3 Tax=Chlorobiaceae RepID=B4S8S0_PROA2 Length = 619 Score = 42.8 bits (99), Expect = 0.022, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 70/185 (37%), Gaps = 17/185 (9%) Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE--IAELRAKIERVP---FID 230 R A+ + R + + + + L+ ++ + LR + I+ Sbjct: 356 RGEALNNRRGRFVRSQPGEIRGGKVALIPTLISAAPWQESRRLERLRKTGKVSTTGLIIN 415 Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK--RFYILLYLFLSR------ 282 D++ K + + S + ++D SGSM + AK ++L ++ R Sbjct: 416 KEDVKVKKFRDK----SGTLFIFIVDASGSMALNRMRQAKGAVSHLLQNAYVHRDQVALI 471 Query: 283 TYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 +++ E + +Q+ + + E GGT ++SA+ L E K+ I Sbjct: 472 SFRGKEAQLLLPPSQSVDRAKRELDVLPTGGGTPLASAIYLAWETAKQARTKGVSQIMFV 531 Query: 343 QASDG 347 +DG Sbjct: 532 LITDG 536 >UniRef50_B1I3V2 Magnesium chelatase n=4 Tax=cellular organisms RepID=B1I3V2_DESAP Length = 670 Score = 42.4 bits (98), Expect = 0.034, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 55/174 (31%), Gaps = 20/174 (11%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R L+ RR+ + + E L + LR + + R Sbjct: 410 RVLRKGSGRRSRTRTPTKAG-----RYVRATLRRERDDLAFDATLRAAAP-FQKQRARDG 463 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR---FYILLYLFLSRTY 284 + + + R + ++D SGSM + +A + +LL + R Sbjct: 464 VAVAVESQDIREKVREKRIGN-FLVFVVDASGSMGAQQRMVAAKGAVLSLLLDAYQKR-- 520 Query: 285 KNVEVV-YIRHHTQAK-------EVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 V +V + H + E+ E G T +++ L EV + Sbjct: 521 DRVGMVAFKGEHAEVLLPPTNSVELAERRLAELPTGGRTPLAAGLLKAYEVARA 574 >UniRef50_C9LTL4 Magnesium-chelatase, subunit D/I family n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LTL4_9FIRM Length = 657 Score = 41.7 bits (96), Expect = 0.048, Method: Composition-based stats. Identities = 50/296 (16%), Positives = 90/296 (30%), Gaps = 53/296 (17%) Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEY--LDLLFEDLALP 135 G ++DR E + A D G+D +DE ++ + L+L Sbjct: 341 GESKEQEDDRGESQEKEAQADEEQASPADGDSGGED-------RDETHSIEAVMARLSL- 392 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 R+ + ++G V ++ R + SL R G+R +L Sbjct: 393 ------LRETVCVRKGKSG-RRAIVQLDVPAGRPWRTSLPR-----TGRRIDLAFAATLR 440 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 A + +R E + + R + A + L+ Sbjct: 441 AAAPYQRQRHGEQAVVIRAEDLRVWIRARR---------------------ASANILFLV 479 Query: 256 DVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEHEF 306 D SGSM + M K + L + V ++ R T++ E+ E Sbjct: 480 DASGSMGAKERMKMVKGAVLALLREAYQKRDRVGLIAFRRTSAETLLPMTRSVELAEKAL 539 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEIL 361 G T ++ L +++ E +DG N + E L Sbjct: 540 RSLPTGGKTPLAEGLAAALKMMDELSRKEGAETVLVLVTDGRTNVSAAGKAKEEAL 595 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteri... 413 e-114 UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteri... 365 1e-99 UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacter... 362 2e-98 UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodoba... 358 2e-97 UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobact... 353 9e-96 UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkhol... 351 3e-95 UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria ... 345 2e-93 UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales ... 325 3e-87 UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes ... 323 8e-87 UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium... 322 2e-86 UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitroso... 317 6e-85 UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani Rep... 296 1e-78 UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonif... 292 2e-77 UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidith... 289 2e-76 UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=... 288 2e-76 UniRef50_A9FJ88 Uncharacterized conserved protein involved in st... 284 4e-75 UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiob... 276 1e-72 UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitroco... 263 9e-69 UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteo... 252 2e-65 UniRef50_C6M483 von Willebrand factor type A domain protein n=1 ... 251 4e-65 UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reineke... 250 6e-65 UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatiba... 248 3e-64 UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidet... 248 3e-64 UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobac... 248 4e-64 UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 ... 246 1e-63 UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria R... 245 3e-63 UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meioth... 244 4e-63 UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellac... 244 5e-63 UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 ... 243 8e-63 UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopi... 241 2e-62 UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastop... 240 7e-62 UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostri... 240 1e-61 UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20... 239 1e-61 UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 ... 237 5e-61 UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Rumi... 237 6e-61 UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella... 236 8e-61 UniRef50_C7N770 Uncharacterized protein containing a von Willebr... 236 1e-60 UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter... 236 1e-60 UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4... 235 2e-60 UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium... 234 5e-60 UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales... 231 4e-59 UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacte... 230 7e-59 UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria R... 230 7e-59 UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostri... 230 1e-58 UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 ... 229 2e-58 UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea... 229 2e-58 UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteo... 228 3e-58 UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 ... 224 4e-57 UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 ... 223 9e-57 UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5... 223 1e-56 UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobact... 223 1e-56 UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9... 222 2e-56 UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=... 222 2e-56 UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriac... 221 3e-56 UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeri... 220 8e-56 UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenz... 220 1e-55 UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobact... 219 2e-55 UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocyst... 218 4e-55 UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria Re... 216 1e-54 UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacter... 213 1e-53 UniRef50_UPI000185CB41 protein containing von Willebrand factor ... 211 3e-53 UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacte... 211 5e-53 UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Breviba... 209 1e-52 UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella... 208 2e-52 UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophag... 207 8e-52 UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobact... 205 3e-51 UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 ... 204 4e-51 UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangi... 204 5e-51 UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimon... 203 7e-51 UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmati... 199 1e-49 UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 ... 196 9e-49 UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phag... 194 4e-48 UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Ach... 194 7e-48 UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiph... 187 6e-46 UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 ... 183 8e-45 UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp.... 178 3e-43 UniRef50_B4D1N7 Autotransporter-associated beta strand repeat pr... 158 4e-37 UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobac... 157 8e-37 UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 ... 153 1e-35 UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12... 153 1e-35 UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 ... 142 2e-32 UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methyli... 141 5e-32 UniRef50_A6BYV9 Putative uncharacterized protein n=1 Tax=Plancto... 136 2e-30 UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeo... 129 2e-28 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 127 7e-28 UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 ... 125 4e-27 UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Stre... 124 5e-27 UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcani... 124 8e-27 UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=... 118 4e-25 UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferrogl... 114 5e-24 UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepI... 114 5e-24 UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha p... 112 3e-23 UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglob... 112 3e-23 UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesioc... 111 7e-23 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 111 8e-23 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 110 1e-22 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 110 1e-22 UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 ... 109 2e-22 UniRef50_Q9ZGE6 Magnesium-chelatase 67 kDa subunit n=2 Tax=Helio... 109 2e-22 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 108 3e-22 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 108 4e-22 UniRef50_Q231J4 von Willebrand factor type A domain containing p... 107 9e-22 UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candida... 107 1e-21 UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnol... 106 1e-21 UniRef50_D0MZH7 Putative uncharacterized protein n=1 Tax=Phytoph... 106 1e-21 UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 ... 106 1e-21 UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genom... 106 2e-21 UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genom... 104 5e-21 UniRef50_Q23KK4 von Willebrand factor type A domain containing p... 102 3e-20 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 101 4e-20 UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi... 100 1e-19 UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscill... 100 1e-19 UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 100 2e-19 UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis v... 98 5e-19 UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax... 98 6e-19 UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 94 8e-18 UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepI... 94 9e-18 UniRef50_A0CHZ1 Chromosome undetermined scaffold_185, whole geno... 93 2e-17 UniRef50_Q23FU3 von Willebrand factor type A domain containing p... 92 3e-17 UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Art... 92 5e-17 UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromon... 91 6e-17 UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellu... 91 8e-17 UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacte... 89 3e-16 UniRef50_Q23AA2 Putative uncharacterized protein n=1 Tax=Tetrahy... 89 4e-16 UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacter... 87 2e-15 UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteo... 85 5e-15 UniRef50_D2S019 ATPase associated with various cellular activiti... 84 1e-14 UniRef50_Q97HZ9 Predicted metal-dependent peptidase n=1 Tax=Clos... 79 2e-13 UniRef50_O26551 Magnesium chelatase subunit ChlI n=1 Tax=Methano... 74 1e-11 UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaen... 61 1e-07 Sequences not found previously or not previously below threshold: UniRef50_C1RGW7 Uncharacterized protein containing a von Willebr... 135 2e-30 UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangi... 119 2e-25 UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 ... 116 1e-24 UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus te... 111 6e-23 UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophag... 111 8e-23 UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=So... 109 2e-22 UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter... 106 2e-21 UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudo... 106 2e-21 UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifi... 106 2e-21 UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1... 104 6e-21 UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-cont... 104 9e-21 UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 102 3e-20 UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, s... 100 7e-20 UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome... 100 2e-19 UniRef50_Q235T9 von Willebrand factor type A domain containing p... 100 2e-19 UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=... 99 2e-19 UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1... 99 2e-19 UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genom... 99 2e-19 UniRef50_D2VKS7 von Willebrand factor type A domain-containing p... 98 5e-19 UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genom... 98 5e-19 UniRef50_UPI00017450FB von Willebrand factor type A domain prote... 98 7e-19 UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiob... 98 7e-19 UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alterom... 98 8e-19 UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain ... 97 1e-18 UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis R... 96 2e-18 UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangi... 96 2e-18 UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus ce... 96 2e-18 UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein... 96 2e-18 UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobact... 96 3e-18 UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscill... 96 3e-18 UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3... 96 3e-18 UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole geno... 95 5e-18 UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinoco... 95 5e-18 UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine... 95 5e-18 UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein... 94 6e-18 UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfo... 94 8e-18 UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotom... 94 1e-17 UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein... 94 1e-17 UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 ... 94 1e-17 UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflex... 93 2e-17 UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 ... 93 2e-17 UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein... 93 2e-17 UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepI... 93 2e-17 UniRef50_C1XMC3 Uncharacterized protein containing a von Willebr... 93 3e-17 UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN... 92 4e-17 UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis... 92 4e-17 UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein... 92 4e-17 UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ... 92 4e-17 UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 T... 91 6e-17 UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus tri... 91 6e-17 UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genom... 91 7e-17 UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella... 91 7e-17 UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8Y... 91 8e-17 UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11... 91 9e-17 UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga o... 90 1e-16 UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanob... 90 1e-16 UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1... 90 2e-16 UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexu... 90 2e-16 UniRef50_Q22ST4 von Willebrand factor type A domain containing p... 89 2e-16 UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1... 89 2e-16 UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magno... 89 2e-16 UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharoph... 89 2e-16 UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopi... 89 3e-16 UniRef50_UPI00006CAF43 von Willebrand factor type A domain conta... 89 4e-16 UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein... 88 5e-16 UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein... 88 5e-16 UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillu... 88 5e-16 UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebr... 88 6e-16 UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 88 6e-16 UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythro... 88 7e-16 UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuni... 88 8e-16 UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopi... 87 1e-15 UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR 87 1e-15 UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ... 87 1e-15 UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 ... 87 1e-15 UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 ... 87 1e-15 UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum '... 87 1e-15 UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira ... 87 2e-15 UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW... 87 2e-15 UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfoloba... 86 2e-15 UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2... 86 2e-15 UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisp... 86 2e-15 UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 ... 86 2e-15 UniRef50_Q24FW2 von Willebrand factor type A domain containing p... 86 3e-15 UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomyce... 86 3e-15 UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n... 86 3e-15 UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin... 86 3e-15 UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ... 86 3e-15 UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3... 86 3e-15 UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein... 86 3e-15 UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain... 85 4e-15 UniRef50_Q82LZ6 Putative uncharacterized protein n=1 Tax=Strepto... 85 5e-15 UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesioc... 85 5e-15 UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3... 84 9e-15 UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesm... 84 9e-15 UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ5... 84 9e-15 UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26... 84 1e-14 UniRef50_Q22N58 von Willebrand factor type A domain containing p... 83 1e-14 UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax... 83 2e-14 UniRef50_B4W304 von Willebrand factor type A domain protein (Fra... 83 2e-14 UniRef50_B8AE57 Putative uncharacterized protein n=1 Tax=Oryza s... 83 2e-14 UniRef50_UPI00016C377F protein containing a von Willebrand facto... 83 2e-14 UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta Re... 83 3e-14 UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza s... 83 3e-14 UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax... 82 4e-14 UniRef50_Q24C76 von Willebrand factor type A domain containing p... 82 4e-14 UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein... 82 4e-14 UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Cion... 82 4e-14 UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina... 82 4e-14 UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea s... 82 5e-14 UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi... 81 6e-14 UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophob... 81 6e-14 UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein... 81 6e-14 UniRef50_B9XLE8 Vault protein inter-alpha-trypsin domain protein... 81 6e-14 UniRef50_C9YX20 Putative uncharacterized protein n=1 Tax=Strepto... 81 7e-14 UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ... 81 7e-14 UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=T... 81 7e-14 UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 ... 81 8e-14 UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putat... 81 8e-14 UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira... 81 9e-14 UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clup... 81 1e-13 UniRef50_Q9NY47 Voltage-dependent calcium channel subunit delta-... 80 1e-13 UniRef50_B0CG18 von Willebrand factor type A domain protein, put... 80 1e-13 UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain... 80 1e-13 UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin... 80 2e-13 UniRef50_C4RGW7 Putative uncharacterized protein n=1 Tax=Micromo... 80 2e-13 UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopu... 80 2e-13 UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZE... 80 2e-13 UniRef50_C7FPD9 Uncharacterized protein n=2 Tax=environmental sa... 79 2e-13 UniRef50_A6G2V8 von Willebrand factor, type A n=1 Tax=Plesiocyst... 79 2e-13 UniRef50_A8M9M1 von Willebrand factor type A n=1 Tax=Caldivirga ... 79 3e-13 UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_A... 79 3e-13 UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geob... 79 3e-13 UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 ... 79 3e-13 UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea may... 79 3e-13 UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza s... 79 3e-13 UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1... 79 4e-13 UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actin... 79 4e-13 UniRef50_C0E6Z8 Putative uncharacterized protein n=2 Tax=Coryneb... 79 4e-13 UniRef50_B6ZDR6 Voltage dependent calcium channel alpha2d/delta ... 79 4e-13 UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesioc... 79 5e-13 UniRef50_Q0AV90 Putative uncharacterized protein n=1 Tax=Syntrop... 78 5e-13 UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocyst... 78 5e-13 UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=... 78 5e-13 UniRef50_Q6L2C8 Putative uncharacterized protein n=1 Tax=Picroph... 78 5e-13 UniRef50_B0JR39 von Willebrand factor type A n=1 Tax=Microcystis... 78 6e-13 UniRef50_A2FKC6 von Willebrand factor type A domain containing p... 78 6e-13 UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangi... 78 6e-13 UniRef50_B8HSI1 von Willebrand factor type A n=8 Tax=Cyanobacter... 78 7e-13 UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 ... 78 7e-13 UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 ... 77 1e-12 UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globul... 77 1e-12 UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_... 77 1e-12 UniRef50_UPI0001C1630F hypothetical protein CRD_00534 n=2 Tax=No... 77 1e-12 UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=... 77 1e-12 UniRef50_C6WL97 VWA containing CoxE family protein n=1 Tax=Actin... 77 1e-12 UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepI... 77 1e-12 UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated c... 77 1e-12 UniRef50_C8NWP6 Putative uncharacterized protein n=1 Tax=Coryneb... 77 1e-12 UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacilla... 77 2e-12 UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein... 76 2e-12 UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fr... 76 2e-12 UniRef50_A3U9M7 Putative uncharacterized protein n=1 Tax=Croceib... 76 2e-12 UniRef50_A4YGI9 von Willebrand factor, type A n=1 Tax=Metallosph... 76 2e-12 UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxob... 76 2e-12 UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus ... 76 2e-12 UniRef50_B1X316 Putative uncharacterized protein n=1 Tax=Cyanoth... 76 2e-12 UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun se... 76 3e-12 UniRef50_D1KBY4 Putative uncharacterized protein n=2 Tax=Proteob... 76 3e-12 UniRef50_C4G1K3 Putative uncharacterized protein n=1 Tax=Abiotro... 76 3e-12 UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microc... 76 3e-12 UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein... 76 4e-12 UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseoba... 75 4e-12 UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=... 75 4e-12 UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudona... 75 5e-12 UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornu... 75 5e-12 UniRef50_A0CDA0 Chromosome undetermined scaffold_17, whole genom... 75 5e-12 UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillu... 75 5e-12 UniRef50_Q7US47 Putative uncharacterized protein n=1 Tax=Rhodopi... 75 6e-12 UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Tak... 75 6e-12 UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2... 75 6e-12 UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacte... 74 7e-12 UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotri... 74 7e-12 UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-try... 74 7e-12 UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocep... 74 8e-12 UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=4... 74 9e-12 UniRef50_B8G7Y1 von Willebrand factor type A n=3 Tax=Chloroflexu... 74 9e-12 UniRef50_D2AUU0 Putative uncharacterized protein n=1 Tax=Strepto... 74 9e-12 UniRef50_Q9LMB7 F14D16.26 n=5 Tax=rosids RepID=Q9LMB7_ARATH 74 9e-12 UniRef50_A9B057 von Willebrand factor type A n=3 Tax=Chloroflexi... 74 1e-11 UniRef50_A9WI94 von Willebrand factor type A n=2 Tax=Chloroflexu... 74 1e-11 UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID... 74 1e-11 UniRef50_D0Z403 Putative uncharacterized protein n=1 Tax=Photoba... 74 1e-11 UniRef50_A8N264 Putative uncharacterized protein n=1 Tax=Coprino... 74 1e-11 UniRef50_Q60ED8 Von Willebrand factor type A domain containing p... 74 1e-11 UniRef50_Q10JU7 Von Willebrand factor type A domain containing p... 74 1e-11 UniRef50_Q12VX7 Putative uncharacterized protein n=1 Tax=Methano... 73 1e-11 UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus Rep... 73 2e-11 UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=... 73 2e-11 UniRef50_A9SQ90 Predicted protein n=3 Tax=Physcomitrella patens ... 73 2e-11 UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Breviba... 73 2e-11 UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=4... 73 2e-11 UniRef50_D2UZF5 von Willebrand factor type A domain-containing p... 73 2e-11 UniRef50_D1YZJ4 Putative uncharacterized protein n=1 Tax=Methano... 73 2e-11 UniRef50_A8MJ77 Magnesium chelatase n=2 Tax=Clostridiales RepID=... 73 2e-11 UniRef50_A0CN87 Chromosome undetermined scaffold_22, whole genom... 73 2e-11 UniRef50_C9LTL4 Magnesium-chelatase, subunit D/I family n=1 Tax=... 73 2e-11 UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepI... 73 2e-11 UniRef50_B7KCF7 von Willebrand factor type A n=1 Tax=Cyanothece ... 73 2e-11 UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria Re... 72 3e-11 UniRef50_B3QTN9 Vault protein inter-alpha-trypsin domain protein... 72 3e-11 UniRef50_C3JL94 von Willebrand factor type A domain protein n=1 ... 72 3e-11 UniRef50_C1XFF8 Mg-chelatase subunit ChlD n=1 Tax=Meiothermus ru... 72 3e-11 UniRef50_B4BQC0 von Willebrand factor type A n=2 Tax=Geobacillus... 72 3e-11 UniRef50_A0LPK8 Vault protein inter-alpha-trypsin domain protein... 72 4e-11 UniRef50_Q46AG0 BatA n=3 Tax=Methanomicrobia RepID=Q46AG0_METBF 72 4e-11 UniRef50_Q54CQ8 von Willebrand factor A domain-containing protei... 72 4e-11 UniRef50_Q01UI0 von Willebrand factor, type A n=1 Tax=Candidatus... 72 4e-11 UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteri... 72 4e-11 UniRef50_Q3M1S2 von Willebrand factor, type A n=1 Tax=Anabaena v... 72 5e-11 UniRef50_Q1VY89 Inter-alpha-trypsin inhibitor family heavy chain... 72 5e-11 UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methano... 72 5e-11 UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verruc... 72 5e-11 UniRef50_A8SU73 Putative uncharacterized protein n=1 Tax=Coproco... 72 5e-11 UniRef50_C7R936 Vault protein inter-alpha-trypsin domain protein... 72 5e-11 UniRef50_A0CK50 Chromosome undetermined scaffold_2, whole genome... 72 6e-11 UniRef50_A4XHD9 von Willebrand factor, type A n=2 Tax=Clostridia... 71 6e-11 UniRef50_A8J658 Collagen-related protein n=1 Tax=Chlamydomonas r... 71 6e-11 UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia br... 71 6e-11 UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein fam... 71 6e-11 UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tet... 71 6e-11 UniRef50_D1HBR9 Whole genome shotgun sequence of line PN40024, s... 71 6e-11 UniRef50_B5YKY5 Magnesium-chelatase subunit ChlD n=1 Tax=Thermod... 71 6e-11 UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesioc... 71 6e-11 UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacter... 71 7e-11 UniRef50_C6VU79 Sigma 54 interacting domain protein n=1 Tax=Dyad... 71 7e-11 UniRef50_Q7UL83 Inter-alpha-trypsin inhibitor family heavy chain... 71 9e-11 UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n... 71 9e-11 UniRef50_C9SWV9 U-box domain containing protein n=1 Tax=Verticil... 71 1e-10 UniRef50_A8J0D9 Flagellar associated protein n=1 Tax=Chlamydomon... 71 1e-10 UniRef50_Q24CQ9 von Willebrand factor type A domain containing p... 71 1e-10 UniRef50_C6PWL8 von Willebrand factor type A n=1 Tax=Clostridium... 70 1e-10 UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens ... 70 1e-10 UniRef50_UPI000180D2FB PREDICTED: similar to inter-alpha (globul... 70 1e-10 UniRef50_D2V048 Predicted protein n=2 Tax=Naegleria gruberi RepI... 70 1e-10 UniRef50_UPI000180D3E0 PREDICTED: similar to LOC779593 protein n... 70 2e-10 UniRef50_A2E1S5 von Willebrand factor type A domain containing p... 70 2e-10 UniRef50_Q1NTK1 Von Willebrand factor, type A n=2 Tax=delta prot... 70 2e-10 UniRef50_A2E6Y7 von Willebrand factor type A domain containing p... 70 2e-10 UniRef50_A7RNW3 Predicted protein n=3 Tax=Nematostella vectensis... 70 2e-10 UniRef50_D1IDZ7 Whole genome shotgun sequence of line PN40024, s... 70 2e-10 UniRef50_B8CHV3 VCBS n=1 Tax=Shewanella piezotolerans WP3 RepID=... 70 2e-10 UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YN... 70 2e-10 UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-lik... 70 2e-10 UniRef50_Q4RV83 Chromosome 15 SCAF14992, whole genome shotgun se... 69 2e-10 UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphyloth... 69 2e-10 UniRef50_Q5UWJ9 Calcium-binding protein-like n=1 Tax=Haloarcula ... 69 2e-10 UniRef50_Q11Y10 Possible outer membrane protein n=1 Tax=Cytophag... 69 3e-10 UniRef50_Q7SGD8 Predicted protein n=4 Tax=Sordariales RepID=Q7SG... 69 3e-10 UniRef50_UPI0001C378BC von Willebrand factor, type A n=1 Tax=Rum... 69 3e-10 UniRef50_A3DK47 von Willebrand factor, type A n=9 Tax=cellular o... 69 3e-10 UniRef50_B8KT14 Magnesium-chelatase 60 kDa subunit n=1 Tax=gamma... 69 3e-10 UniRef50_A7RVQ6 Predicted protein n=1 Tax=Nematostella vectensis... 69 3e-10 UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatiba... 69 3e-10 UniRef50_Q4RF07 Chromosome 13 SCAF15122, whole genome shotgun se... 69 3e-10 UniRef50_A6Q2J6 von Willebrand factor type A domain protein n=1 ... 69 3e-10 UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microspo... 69 3e-10 UniRef50_A7S6T1 Predicted protein n=2 Tax=Nematostella vectensis... 69 4e-10 UniRef50_Q2QZN4 von Willebrand factor type A domain containing p... 69 4e-10 UniRef50_A0CY84 Chromosome undetermined scaffold_307, whole geno... 69 4e-10 UniRef50_Q55G98 von Willebrand factor A domain-containing protei... 69 4e-10 UniRef50_D1CCX6 von Willebrand factor type A n=1 Tax=Thermobacul... 69 4e-10 UniRef50_A5UWS5 von Willebrand factor, type A n=2 Tax=Roseiflexu... 69 4e-10 UniRef50_D1CG77 von Willebrand factor type A; type II secretion ... 69 4e-10 UniRef50_D1BQE7 von Willebrand factor type A n=1 Tax=Veillonella... 69 4e-10 UniRef50_A0DIJ2 Chromosome undetermined scaffold_52, whole genom... 69 4e-10 UniRef50_Q22UB9 von Willebrand factor type A domain containing p... 69 4e-10 UniRef50_Q2W311 Putative uncharacterized protein n=1 Tax=Magneto... 69 4e-10 UniRef50_B8GAZ1 von Willebrand factor type A n=3 Tax=Chloroflexu... 69 4e-10 UniRef50_A3TQW7 Putative membrane protein n=1 Tax=Janibacter sp.... 69 4e-10 UniRef50_B2A702 von Willebrand factor type A n=1 Tax=Natranaerob... 69 4e-10 UniRef50_A7HHW8 Vault protein inter-alpha-trypsin domain protein... 69 4e-10 UniRef50_C1YR26 Uncharacterized protein containing a von Willebr... 68 5e-10 UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family pr... 68 5e-10 UniRef50_Q3M1S4 von Willebrand factor, type A n=8 Tax=Cyanobacte... 68 5e-10 UniRef50_A9UVU8 Predicted protein n=1 Tax=Monosiga brevicollis R... 68 5e-10 UniRef50_A0CCS0 Chromosome undetermined scaffold_168, whole geno... 68 5e-10 UniRef50_B0TSG0 Vault protein inter-alpha-trypsin domain protein... 68 6e-10 UniRef50_Q9XAH6 Putative uncharacterized protein SCO6688 n=2 Tax... 68 6e-10 UniRef50_Q8H923 Putative uncharacterized protein OSJNBa0071K18.1... 68 6e-10 UniRef50_B9R4P7 von Willebrand factor type A domain protein n=1 ... 68 6e-10 UniRef50_Q54DU5 von Willebrand factor A domain-containing protei... 68 7e-10 UniRef50_UPI0000D560E4 PREDICTED: similar to inter-alpha (globul... 68 7e-10 UniRef50_D1YYY2 Putative uncharacterized protein n=1 Tax=Methano... 68 7e-10 UniRef50_C9RU69 von Willebrand factor type A n=2 Tax=Geobacillus... 68 7e-10 UniRef50_A7RTF3 Predicted protein (Fragment) n=1 Tax=Nematostell... 68 7e-10 UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium E... 68 7e-10 UniRef50_B6B3S7 Magnesium chelatase ATPase subunit D n=1 Tax=Rho... 68 8e-10 UniRef50_A0BS51 Chromosome undetermined scaffold_124, whole geno... 68 8e-10 UniRef50_B4S8S0 Magnesium chelatase ATPase subunit D n=3 Tax=Chl... 67 8e-10 UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha ... 67 1e-09 UniRef50_B1I3V2 Magnesium chelatase n=4 Tax=cellular organisms R... 67 1e-09 UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides... 67 1e-09 UniRef50_C0M4X9 Inter-alpha-trypsin inhibitor heavy chain H4 (Fr... 67 1e-09 UniRef50_C3ZG18 Putative uncharacterized protein n=1 Tax=Branchi... 67 1e-09 UniRef50_D0LP28 von Willebrand factor type A n=1 Tax=Haliangium ... 67 1e-09 UniRef50_A0CKU6 Chromosome undetermined scaffold_20, whole genom... 67 1e-09 UniRef50_C1XFI8 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus Re... 67 1e-09 UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 ... 67 1e-09 UniRef50_Q22G03 Putative uncharacterized protein n=1 Tax=Tetrahy... 67 1e-09 UniRef50_A6G7V2 von Willebrand factor, type A n=1 Tax=Plesiocyst... 67 1e-09 UniRef50_C4V3L6 Magnesium chelatase n=2 Tax=Selenomonas RepID=C4... 67 1e-09 UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytoph... 67 1e-09 UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectria... 67 1e-09 UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter... 67 1e-09 UniRef50_Q22SJ7 von Willebrand factor type A domain containing p... 67 1e-09 UniRef50_Q237Q6 von Willebrand factor type A domain containing p... 67 1e-09 UniRef50_C0ZKA0 Putative uncharacterized protein n=2 Tax=Bacteri... 67 2e-09 UniRef50_C5PP99 von Willebrand factor, type A n=2 Tax=Bacteroide... 67 2e-09 UniRef50_A2F7N4 von Willebrand factor type A domain containing p... 67 2e-09 UniRef50_C1GWG1 von Willebrand factor type A domain containing p... 67 2e-09 UniRef50_Q7MCW9 Uncharacterized protein n=2 Tax=Vibrio vulnificu... 66 2e-09 UniRef50_UPI0001BC5690 magnesium chelatase n=1 Tax=Fusobacterium... 66 2e-09 UniRef50_Q87W17 von Willebrand factor type A domain protein n=2 ... 66 2e-09 UniRef50_Q8TU27 Putative uncharacterized protein n=1 Tax=Methano... 66 2e-09 UniRef50_Q23JA0 von Willebrand factor type A domain containing p... 66 2e-09 UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular o... 66 2e-09 UniRef50_UPI00006CD16B von Willebrand factor type A domain conta... 66 2e-09 UniRef50_C3YRH3 Putative uncharacterized protein (Fragment) n=1 ... 66 2e-09 UniRef50_A6EQD3 von Willebrand factor type A like domain n=2 Tax... 66 2e-09 UniRef50_Q7UNJ0 Putative uncharacterized protein n=1 Tax=Rhodopi... 66 2e-09 UniRef50_D1YD07 von Willebrand factor type A domain protein n=2 ... 66 3e-09 UniRef50_C8XH18 von Willebrand factor type A n=1 Tax=Nakamurella... 66 3e-09 UniRef50_UPI0001792BA1 PREDICTED: similar to Inter-alpha-trypsin... 66 3e-09 UniRef50_A0PNU3 UPF0353 protein MUL_1490 n=43 Tax=Actinomycetale... 66 3e-09 UniRef50_UPI0001C37785 von Willebrand factor type A n=1 Tax=Rumi... 66 3e-09 UniRef50_UPI0001C31E2D von Willebrand factor type A n=1 Tax=Cone... 66 3e-09 UniRef50_A7T2Z0 Predicted protein n=4 Tax=Nematostella vectensis... 66 3e-09 UniRef50_A5UW94 von Willebrand factor, type A n=2 Tax=Roseiflexu... 66 3e-09 UniRef50_Q2QZN5 Putative uncharacterized protein n=1 Tax=Oryza s... 66 3e-09 UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotoma... 66 3e-09 UniRef50_C0QP91 Putative von Willebrand factor type A domain pro... 66 3e-09 UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Tak... 66 4e-09 UniRef50_A6FXN3 Putative uncharacterized protein n=1 Tax=Plesioc... 66 4e-09 UniRef50_B6BJ58 Phage/colicin/tellurite resistance cluster TerY ... 66 4e-09 UniRef50_B2HDT6 Putative uncharacterized protein n=3 Tax=Mycobac... 66 4e-09 UniRef50_C8NPK0 Magnesium chelatase n=4 Tax=Corynebacterium RepI... 66 4e-09 UniRef50_Q54DV3 von Willebrand factor A domain-containing protei... 66 4e-09 UniRef50_Q6SGW6 Magnesium-chelatase, 60 kDa subunit n=1 Tax=uncu... 65 4e-09 UniRef50_A0BWF5 Chromosome undetermined scaffold_132, whole geno... 65 4e-09 UniRef50_B7FTA2 Predicted protein n=3 Tax=Bacillariophyta RepID=... 65 4e-09 UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus... 65 4e-09 UniRef50_O50313 Magnesium-chelatase 67 kDa subunit n=15 Tax=Bact... 65 4e-09 UniRef50_B9ML47 YD repeat protein n=1 Tax=Anaerocellum thermophi... 65 5e-09 UniRef50_UPI000180C2AF PREDICTED: similar to FiBrilliN homolog f... 65 5e-09 UniRef50_UPI0000E47594 PREDICTED: similar to inter-alpha (globul... 65 5e-09 UniRef50_Q0FYU3 Von Willebrand factor, type A n=1 Tax=Fulvimarin... 65 5e-09 UniRef50_D1A557 Vault protein inter-alpha-trypsin domain protein... 65 5e-09 UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 65 6e-09 UniRef50_B0VJ57 Putative uncharacterized protein n=1 Tax=Candida... 65 6e-09 UniRef50_Q22HH7 von Willebrand factor type A domain containing p... 65 6e-09 UniRef50_A9AX98 von Willebrand factor type A n=1 Tax=Herpetosiph... 65 6e-09 UniRef50_A6R161 Predicted protein n=3 Tax=Onygenales RepID=A6R16... 65 6e-09 UniRef50_C9ZGQ6 Putative membrane protein n=3 Tax=Streptomyces R... 65 7e-09 UniRef50_P19827 Inter-alpha-trypsin inhibitor heavy chain H1 n=6... 65 7e-09 UniRef50_A2E0T6 von Willebrand factor type A domain containing p... 65 7e-09 UniRef50_B6HQ22 Pc22g19800 protein n=1 Tax=Penicillium chrysogen... 65 7e-09 UniRef50_Q54LJ4 Type A von Willebrand factor domain-containing p... 65 7e-09 UniRef50_A6QBT6 von Willebrand factor type A domain protein n=1 ... 64 7e-09 UniRef50_A7VF89 Putative uncharacterized protein n=1 Tax=Clostri... 64 7e-09 UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastop... 64 7e-09 UniRef50_Q503P4 Zgc:110377 n=9 Tax=Clupeocephala RepID=Q503P4_DANRE 64 7e-09 UniRef50_A8VYD1 Extracellular solute-binding protein, family 5 n... 64 7e-09 UniRef50_Q8TYU9 Mg-chelatase subunit ChlI and Chld (MoxR-like AT... 64 7e-09 UniRef50_A6Q208 von Willebrand factor type A domain protein n=1 ... 64 8e-09 UniRef50_C4LHS9 Magnesium-chelatase subunit D n=1 Tax=Corynebact... 64 8e-09 UniRef50_Q9SJE1 Magnesium-chelatase subunit chlD, chloroplastic ... 64 8e-09 UniRef50_A4Y9K4 von Willebrand factor, type A n=3 Tax=Shewanella... 64 8e-09 UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleosto... 64 8e-09 UniRef50_A9RSX3 Predicted protein n=1 Tax=Physcomitrella patens ... 64 8e-09 UniRef50_Q0VTG8 Protein containing a von Willebrand factor type ... 64 8e-09 UniRef50_Q4S685 Chromosome 9 SCAF14729, whole genome shotgun seq... 64 9e-09 UniRef50_A2DWC0 von Willebrand factor type A domain containing p... 64 9e-09 UniRef50_A7C4W6 von Willebrand factor, type A n=1 Tax=Beggiatoa ... 64 9e-09 UniRef50_Q23J98 von Willebrand factor type A domain containing p... 64 9e-09 UniRef50_A0M6V9 Membrane protein containing von Willebrand facto... 64 9e-09 UniRef50_A5GQG5 Protoporphyrin IX Mg-chelatase subunit ChlD n=3 ... 64 1e-08 UniRef50_A8DJP2 von Willebrand factor type A n=1 Tax=Candidatus ... 64 1e-08 UniRef50_A8IJ40 Predicted protein n=1 Tax=Chlamydomonas reinhard... 64 1e-08 UniRef50_A5GIB9 Protoporphyrin IX Mg-chelatase subunit ChlD n=22... 64 1e-08 UniRef50_UPI0001760CA2 PREDICTED: inter-alpha (globulin) inhibit... 64 1e-08 UniRef50_C5CEE7 PEGA domain protein n=1 Tax=Kosmotoga olearia TB... 64 1e-08 UniRef50_A0C3R8 Chromosome undetermined scaffold_148, whole geno... 64 1e-08 UniRef50_C2MDE3 BatA protein n=6 Tax=Bacteroidales RepID=C2MDE3_... 64 1e-08 UniRef50_Q47M48 von Willebrand factor, type A n=4 Tax=Streptospo... 64 1e-08 UniRef50_C0CX78 Putative uncharacterized protein n=1 Tax=Clostri... 64 1e-08 UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anseri... 64 1e-08 UniRef50_B9HP09 Predicted protein n=13 Tax=cellular organisms Re... 64 1e-08 UniRef50_UPI0001C161B1 von Willebrand factor, type A Precursor n... 64 1e-08 UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methano... 64 1e-08 UniRef50_D1YP15 von Willebrand factor type A domain protein n=1 ... 63 2e-08 UniRef50_UPI0001BC3853 von Willebrand factor type A n=1 Tax=Buty... 63 2e-08 UniRef50_A6G2Y5 Putative uncharacterized protein n=1 Tax=Plesioc... 63 2e-08 UniRef50_A9BS02 von Willebrand factor type A n=1 Tax=Delftia aci... 63 2e-08 UniRef50_A2AR69 Novel protein similar to vertebrate inter-alpha ... 63 2e-08 UniRef50_Q2JAM5 Protoporphyrin IX magnesium-chelatase n=13 Tax=B... 63 2e-08 UniRef50_A9AUC9 von Willebrand factor type A n=1 Tax=Herpetosiph... 63 2e-08 UniRef50_Q0A603 von Willebrand factor, type A n=1 Tax=Alkalilimn... 63 2e-08 UniRef50_Q2SB91 Uncharacterized protein containing a von Willebr... 63 2e-08 UniRef50_UPI0000F2D28F PREDICTED: hypothetical protein n=1 Tax=M... 63 2e-08 UniRef50_A0LFJ6 Protoporphyrin IX magnesium-chelatase n=1 Tax=Sy... 63 2e-08 UniRef50_C6JMG8 Magnesium chelatase n=1 Tax=Fusobacterium varium... 63 2e-08 UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capn... 63 2e-08 UniRef50_UPI000180D2ED PREDICTED: similar to predicted protein n... 63 2e-08 UniRef50_A0D1M1 Chromosome undetermined scaffold_34, whole genom... 63 2e-08 UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangi... 63 2e-08 UniRef50_A7BVG3 von Willebrand factor type A domain protein n=1 ... 63 2e-08 UniRef50_A9GBN0 Putative uncharacterized protein n=1 Tax=Sorangi... 63 2e-08 UniRef50_B2UZB2 von Willebrand factor type A domain protein n=1 ... 63 2e-08 UniRef50_UPI00005843FB PREDICTED: hypothetical protein n=1 Tax=S... 63 2e-08 UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=... 63 2e-08 UniRef50_A9L314 Outer membrane adhesin like proteiin n=5 Tax=She... 63 2e-08 UniRef50_C1XWJ2 von Willebrand factor type A-like protein n=1 Ta... 63 2e-08 UniRef50_C7G046 von Willebrand factor A domain-containing protei... 63 2e-08 UniRef50_UPI000180B9AB PREDICTED: similar to CLCA family member ... 63 2e-08 UniRef50_C3XQR7 Putative uncharacterized protein n=1 Tax=Branchi... 63 2e-08 UniRef50_UPI000178810F von Willebrand factor type A n=1 Tax=Geob... 63 2e-08 UniRef50_A7C1J8 von Willebrand factor, type A n=1 Tax=Beggiatoa ... 63 2e-08 UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglob... 62 3e-08 UniRef50_B9EIV3 Cacna2d4 protein n=1 Tax=Mus musculus RepID=B9EI... 62 3e-08 UniRef50_C7NN24 von Willebrand factor type A n=1 Tax=Halorhabdus... 62 3e-08 UniRef50_P33352 Uncharacterized protein yehP n=69 Tax=root RepID... 62 3e-08 UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibri... 62 3e-08 UniRef50_A7RFL6 Predicted protein n=1 Tax=Nematostella vectensis... 62 3e-08 UniRef50_Q64D90 Cell surface protein n=8 Tax=environmental sampl... 62 3e-08 UniRef50_UPI00004D9B6D UPI00004D9B6D related cluster n=2 Tax=Xen... 62 3e-08 UniRef50_B1L6Y8 von Willebrand factor type A n=1 Tax=Candidatus ... 62 3e-08 UniRef50_C1XTM1 Uncharacterized protein containing a von Willebr... 62 3e-08 UniRef50_C1XUY0 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus Re... 62 3e-08 UniRef50_Q6VPP3 Parturition-related protein PRP3 n=6 Tax=Eutheri... 62 3e-08 UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria Re... 62 4e-08 UniRef50_Q54MG4 von Willebrand factor A domain-containing protei... 62 4e-08 UniRef50_B5YCB4 von Willebrand factor type A domain protein n=2 ... 62 4e-08 UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostri... 62 4e-08 UniRef50_UPI000186D791 calcium channel, putative n=3 Tax=Neopter... 62 4e-08 UniRef50_B2VYM4 von Willebrand domain containing protein n=3 Tax... 62 4e-08 UniRef50_Q1Q2F5 Putative uncharacterized protein n=1 Tax=Candida... 62 5e-08 UniRef50_A1AQS2 Protoporphyrin IX magnesium-chelatase n=1 Tax=Pe... 62 5e-08 UniRef50_D2VP35 von Willebrand factor, type A domain-containing ... 62 5e-08 UniRef50_UPI0000E488A7 PREDICTED: similar to Clca1 protein n=5 T... 62 5e-08 UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetom... 62 5e-08 UniRef50_UPI0001757D5D PREDICTED: similar to AGAP009579-PA n=1 T... 62 5e-08 UniRef50_B3RZT6 Putative uncharacterized protein n=2 Tax=Trichop... 62 5e-08 UniRef50_Q6A9M2 Magnesium-chelatase subunit n=3 Tax=Propionibact... 62 6e-08 UniRef50_Q2FNC6 von Willebrand factor, type A n=1 Tax=Methanospi... 62 6e-08 UniRef50_Q56BS9 Putative uncharacterized protein n=1 Tax=Enterob... 62 6e-08 UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus ... 61 6e-08 UniRef50_B3RP11 Putative uncharacterized protein n=1 Tax=Trichop... 61 6e-08 UniRef50_UPI00017B4DF5 UPI00017B4DF5 related cluster n=3 Tax=Tet... 61 6e-08 UniRef50_Q25545 Putative uncharacterized protein (Fragment) n=1 ... 61 7e-08 UniRef50_A6NF34 Anthrax toxin receptor-like n=8 Tax=Catarrhini R... 61 7e-08 UniRef50_C8NJ92 Secreted Mg-chelatase subunit n=3 Tax=Corynebact... 61 7e-08 UniRef50_A9GV55 Putative secreted protein n=1 Tax=Sorangium cell... 61 7e-08 UniRef50_D1VKI5 von Willebrand factor type A n=1 Tax=Frankia sp.... 61 7e-08 UniRef50_C0QXK8 von Willebrand factor type A (VWA) domain contai... 61 7e-08 UniRef50_UPI0001A2C533 UPI0001A2C533 related cluster n=1 Tax=Dan... 61 8e-08 UniRef50_UPI0001A2C532 UPI0001A2C532 related cluster n=2 Tax=Clu... 61 8e-08 UniRef50_UPI00016C400A von Willebrand factor, type A n=1 Tax=Gem... 61 8e-08 UniRef50_UPI00006CCBAF hypothetical protein TTHERM_00437740 n=1 ... 61 8e-08 UniRef50_Q5LDB9 Putative uncharacterized protein n=11 Tax=Bacter... 61 8e-08 UniRef50_B9L896 von Willebrand factor, type A n=1 Tax=Nautilia p... 61 8e-08 UniRef50_B0A9L7 Putative uncharacterized protein n=2 Tax=Clostri... 61 9e-08 UniRef50_Q1ZTY1 Putative uncharacterized protein n=1 Tax=Photoba... 61 9e-08 UniRef50_C7R6G5 von Willebrand factor type A n=1 Tax=Kangiella k... 61 9e-08 UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobac... 61 9e-08 UniRef50_A1ZDA1 OmpA family protein n=1 Tax=Microscilla marina A... 61 9e-08 UniRef50_C0EZA4 Putative uncharacterized protein n=1 Tax=Eubacte... 61 9e-08 UniRef50_Q10ZP7 von Willebrand factor, type A n=1 Tax=Trichodesm... 61 1e-07 UniRef50_C0CQI8 Putative uncharacterized protein n=1 Tax=Blautia... 61 1e-07 UniRef50_UPI00005A0386 PREDICTED: similar to loss of heterozygos... 61 1e-07 UniRef50_D1A9V6 Cobaltochelatase subunit n=2 Tax=Actinomycetales... 61 1e-07 UniRef50_D2H285 Putative uncharacterized protein (Fragment) n=1 ... 61 1e-07 UniRef50_Q4TBC0 Chromosome undetermined SCAF7164, whole genome s... 61 1e-07 UniRef50_Q0AW90 Conserved putative chloride channel n=1 Tax=Synt... 61 1e-07 UniRef50_D1C680 ATPase associated with various cellular activiti... 61 1e-07 UniRef50_B5HZU2 VWA domain-containing protein n=1 Tax=Streptomyc... 61 1e-07 UniRef50_A1S119 von Willebrand factor, type A n=1 Tax=Thermofilu... 61 1e-07 UniRef50_O00534 von Willebrand factor A domain-containing protei... 61 1e-07 UniRef50_Q28XX9 GA11538 n=5 Tax=Drosophila RepID=Q28XX9_DROPS 61 1e-07 UniRef50_UPI00006CC819 von Willebrand factor type A domain conta... 60 1e-07 UniRef50_B7Q0X7 Calcium activated chlorine channel, putative (Fr... 60 1e-07 UniRef50_Q5SKK6 Putative uncharacterized protein TTHA0637 n=4 Ta... 60 1e-07 UniRef50_Q233P7 von Willebrand factor type A domain containing p... 60 1e-07 UniRef50_B9Z3T2 Cobaltochelatase subunit n=1 Tax=Lutiella nitrof... 60 1e-07 UniRef50_C0D9H7 Putative uncharacterized protein n=1 Tax=Clostri... 60 1e-07 UniRef50_D1VLF3 von Willebrand factor type A n=1 Tax=Frankia sp.... 60 1e-07 UniRef50_UPI000180B5AF PREDICTED: similar to Clca1 protein n=1 T... 60 1e-07 UniRef50_A4YDL0 von Willebrand factor, type A n=1 Tax=Metallosph... 60 2e-07 UniRef50_D2LNF7 von Willebrand factor type A n=3 Tax=Aciduliprof... 60 2e-07 UniRef50_Q73PP7 Magnesium chelatase, subunit D/I family n=1 Tax=... 60 2e-07 UniRef50_A4BKH3 Putative uncharacterized protein n=1 Tax=Reineke... 60 2e-07 UniRef50_C3NI41 von Willebrand factor type A n=9 Tax=Sulfolobus ... 60 2e-07 UniRef50_UPI000180CF38 PREDICTED: similar to calcium activated c... 60 2e-07 UniRef50_UPI000180B52F PREDICTED: similar to predicted protein n... 60 2e-07 UniRef50_A6C7T1 Putative uncharacterized protein n=1 Tax=Plancto... 60 2e-07 UniRef50_D2Q363 von Willebrand factor type A n=1 Tax=Kribbella f... 60 2e-07 UniRef50_Q6IND5 MGC83495 protein n=9 Tax=cellular organisms RepI... 60 2e-07 UniRef50_C1RL29 von Willebrand factor type A-like protein n=1 Ta... 60 2e-07 UniRef50_D1XJQ4 von Willebrand factor type A n=2 Tax=Streptomyce... 60 2e-07 UniRef50_C5YBL3 Putative uncharacterized protein Sb06g000656 n=1... 60 2e-07 UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fuso... 59 2e-07 UniRef50_B0VJ58 BatA protein n=1 Tax=Candidatus Cloacamonas acid... 59 2e-07 UniRef50_C3Y9U4 Putative uncharacterized protein n=1 Tax=Branchi... 59 2e-07 UniRef50_D0LJ27 von Willebrand factor type A n=1 Tax=Haliangium ... 59 2e-07 UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobac... 59 2e-07 UniRef50_B3QXN7 Putative uncharacterized protein n=1 Tax=Chloroh... 59 3e-07 UniRef50_A0D8L3 Chromosome undetermined scaffold_41, whole genom... 59 3e-07 UniRef50_C5D5J8 Ig domain protein group 2 domain protein n=2 Tax... 59 3e-07 UniRef50_C7NQ34 von Willebrand factor type A n=1 Tax=Halorhabdus... 59 3e-07 UniRef50_UPI00016E1D58 UPI00016E1D58 related cluster n=1 Tax=Tak... 59 3e-07 UniRef50_P58335-2 Isoform 2 of Anthrax toxin receptor 2 n=3 Tax=... 59 3e-07 UniRef50_B7Q438 Neurogenic locus notch, putative n=1 Tax=Ixodes ... 59 3e-07 UniRef50_A9UIA7 Hedgling (Fragment) n=4 Tax=Nematostella vectens... 59 3e-07 UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria Re... 59 3e-07 UniRef50_Q1AYC2 Protoporphyrin IX magnesium-chelatase n=15 Tax=B... 59 3e-07 UniRef50_D1NA79 von Willebrand factor type A n=1 Tax=Victivallis... 59 3e-07 UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10... 59 3e-07 UniRef50_Q1IGT5 Surface adhesion protein n=1 Tax=Pseudomonas ent... 59 3e-07 UniRef50_D1ZZE6 Putative uncharacterized protein GLEAN_08029 n=1... 59 3e-07 UniRef50_A3JLW1 Putative uncharacterized protein n=1 Tax=Rhodoba... 59 3e-07 UniRef50_Q6VUC2 Putative uncharacterized protein n=1 Tax=Antonos... 59 4e-07 UniRef50_D1JHM1 Putitive magnesium-chelatase subunit n=2 Tax=unc... 59 4e-07 UniRef50_Q2JD81 von Willebrand factor, type A n=4 Tax=Frankineae... 59 4e-07 UniRef50_Q30SV4 von Willebrand factor, type A n=2 Tax=Campylobac... 59 4e-07 UniRef50_A0C946 Chromosome undetermined scaffold_16, whole genom... 59 4e-07 UniRef50_B0S9S4 Putative uncharacterized protein n=2 Tax=Leptosp... 59 4e-07 UniRef50_A1ANG2 Protoporphyrin IX magnesium-chelatase n=6 Tax=Ba... 59 4e-07 UniRef50_Q7Z3S7 Voltage-dependent calcium channel subunit delta-... 59 4e-07 UniRef50_C3Y4Z7 Putative uncharacterized protein n=1 Tax=Branchi... 59 4e-07 UniRef50_UPI00006CC94A von Willebrand factor type A domain conta... 59 4e-07 UniRef50_UPI000179F51C Novel protein. n=1 Tax=Bos taurus RepID=U... 59 4e-07 UniRef50_A0M6V8 Membrane protein containing von Willebrand facto... 59 4e-07 UniRef50_A6TP10 von Willebrand factor, type A n=1 Tax=Alkaliphil... 59 4e-07 UniRef50_C4PY90 Dihydropyridine-sensitive l-type calcium channel... 59 4e-07 UniRef50_Q97LT1 DnaK protein (Heat shock protein), C-terminal re... 59 4e-07 UniRef50_C3YRH6 Putative uncharacterized protein n=1 Tax=Branchi... 59 4e-07 UniRef50_C7PW75 Vault protein inter-alpha-trypsin domain protein... 59 4e-07 UniRef50_UPI0001C37059 hypothetical protein RflaF_04637 n=1 Tax=... 59 4e-07 UniRef50_C7PNX3 von Willebrand factor type A n=2 Tax=Sphingobact... 59 5e-07 UniRef50_C1F3F5 von Willebrand factor type A domain protein n=1 ... 59 5e-07 UniRef50_A8K7I4 Calcium-activated chloride channel regulator 1 n... 59 5e-07 UniRef50_Q3KGA0 Putative secreted protein, hemolysin n=1 Tax=Pse... 59 5e-07 UniRef50_UPI00016E1D1D UPI00016E1D1D related cluster n=9 Tax=Tet... 58 5e-07 UniRef50_C8PMB0 BatA protein n=1 Tax=Treponema vincentii ATCC 35... 58 5e-07 UniRef50_A7C0I2 von Willebrand factor type A domain protein n=1 ... 58 5e-07 UniRef50_C4DQN3 Uncharacterized protein containing a von Willebr... 58 5e-07 UniRef50_D2RSW3 von Willebrand factor type A n=1 Tax=Haloterrige... 58 5e-07 UniRef50_UPI0001789223 von Willebrand factor type A n=1 Tax=Geob... 58 5e-07 UniRef50_A2EHL8 Ubiquitin-conjugating enzyme family protein n=3 ... 58 5e-07 UniRef50_UPI00016C38A3 LPXTG-motif cell wall anchor domain prote... 58 5e-07 UniRef50_Q31JK3 Type A von Willebrand factor-like n=1 Tax=Thiomi... 58 6e-07 UniRef50_C6VXL7 von Willebrand factor type A n=1 Tax=Dyadobacter... 58 6e-07 UniRef50_A0Z5Z1 BatB protein, putative n=2 Tax=unclassified Gamm... 58 6e-07 UniRef50_A8L751 Magnesium chelatase n=8 Tax=cellular organisms R... 58 6e-07 UniRef50_C7ZL29 Putative uncharacterized protein n=2 Tax=Nectria... 58 6e-07 UniRef50_A7SIA9 Predicted protein n=2 Tax=Nematostella vectensis... 58 6e-07 UniRef50_A0C282 Chromosome undetermined scaffold_144, whole geno... 58 6e-07 UniRef50_UPI0001B4AD96 von Willebrand factor type A n=1 Tax=Bact... 58 6e-07 UniRef50_UPI0001AF2DA9 hypothetical protein SrosN1_23653 n=1 Tax... 58 7e-07 UniRef50_A6QCY4 von Willebrand factor type A domain protein n=2 ... 58 7e-07 UniRef50_B0XT03 von Willebrand domain protein n=4 Tax=Trichocoma... 58 7e-07 UniRef50_B0EK65 Putative uncharacterized protein n=7 Tax=Entamoe... 58 7e-07 UniRef50_C3ZT39 Putative uncharacterized protein n=1 Tax=Branchi... 58 7e-07 UniRef50_C9LDM7 BatA protein n=10 Tax=Prevotella RepID=C9LDM7_9BACT 58 7e-07 UniRef50_B1KQC1 Outer membrane adhesin like proteiin n=1 Tax=She... 58 7e-07 UniRef50_C3ZZV2 Putative uncharacterized protein n=1 Tax=Branchi... 58 7e-07 UniRef50_C1YN95 von Willebrand factor type A-like protein n=1 Ta... 58 7e-07 UniRef50_Q7XTB9 OSJNBa0068L06.4 protein n=7 Tax=Oryza sativa Rep... 58 7e-07 UniRef50_B3RWX4 Putative uncharacterized protein n=1 Tax=Trichop... 58 8e-07 UniRef50_B5EUF0 von Willebrand factor, type A n=44 Tax=Vibrionac... 58 8e-07 UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacte... 58 8e-07 UniRef50_C3WEJ2 BatB protein n=1 Tax=Fusobacterium mortiferum AT... 58 8e-07 UniRef50_Q2BCF0 Possible D-amino acid dehydrogenase, large subun... 57 8e-07 UniRef50_B2HK18 Conserved membrane protein n=3 Tax=Mycobacterium... 57 9e-07 UniRef50_B0TL26 von Willebrand factor type A n=7 Tax=Gammaproteo... 57 9e-07 UniRef50_C8SB00 von Willebrand factor type A n=1 Tax=Ferroglobus... 57 9e-07 UniRef50_C6X1I4 BatB n=2 Tax=Flavobacteriaceae RepID=C6X1I4_FLAB3 57 9e-07 UniRef50_UPI000058940A PREDICTED: similar to inter-alpha (globul... 57 1e-06 UniRef50_B3PKG6 von Willebrand factor type A domain protein n=1 ... 57 1e-06 UniRef50_A9Z1V5 von Willebrand factor A domain-containing protei... 57 1e-06 UniRef50_C1F7S6 Putative uncharacterized protein n=1 Tax=Acidoba... 57 1e-06 UniRef50_A0BYA6 Chromosome undetermined scaffold_136, whole geno... 57 1e-06 UniRef50_A4YIH6 Protoporphyrin IX magnesium-chelatase n=1 Tax=Me... 57 1e-06 UniRef50_C3ZIK8 Putative uncharacterized protein n=1 Tax=Branchi... 57 1e-06 UniRef50_B0MMX9 Putative uncharacterized protein n=1 Tax=Eubacte... 57 1e-06 UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepI... 57 1e-06 UniRef50_C3ZCZ5 Putative uncharacterized protein (Fragment) n=1 ... 57 1e-06 UniRef50_A6DT53 BatB protein n=1 Tax=Lentisphaera araneosa HTCC2... 57 1e-06 UniRef50_UPI00006CCCA4 von Willebrand factor type A domain conta... 57 1e-06 UniRef50_A2SQ24 von Willebrand factor, type A n=1 Tax=Methanocor... 57 1e-06 UniRef50_C0Z8R3 Hypothetical membrane protein n=1 Tax=Brevibacil... 57 1e-06 UniRef50_Q897H0 Membrane-associated protein n=1 Tax=Clostridium ... 57 1e-06 UniRef50_C6QKH4 von Willebrand factor type A n=1 Tax=Geobacillus... 57 1e-06 UniRef50_Q7UG35 Putative uncharacterized protein n=1 Tax=Rhodopi... 57 1e-06 UniRef50_A7RFD8 Predicted protein (Fragment) n=1 Tax=Nematostell... 57 1e-06 UniRef50_B1MX57 Predicted metal-dependent peptidase n=3 Tax=Leuc... 57 1e-06 UniRef50_C0BF89 Putative uncharacterized protein n=1 Tax=Coproco... 57 1e-06 UniRef50_Q6LJM7 Putative uncharacterized protein n=1 Tax=Photoba... 57 1e-06 UniRef50_C3XQQ6 Putative uncharacterized protein n=1 Tax=Branchi... 57 1e-06 UniRef50_B5ZN80 von Willebrand factor type A n=8 Tax=Rhizobiales... 57 1e-06 UniRef50_A1HT91 Von Willebrand factor, type A n=1 Tax=Thermosinu... 57 1e-06 UniRef50_Q0W1N2 Putative uncharacterized protein n=1 Tax=uncultu... 57 1e-06 UniRef50_Q5TIE3-4 Isoform 4 of von Willebrand factor A domain-co... 57 1e-06 UniRef50_A8M5H1 von Willebrand factor type A n=13 Tax=Actinomyce... 57 1e-06 UniRef50_Q5TIE3 von Willebrand factor A domain-containing protei... 57 1e-06 UniRef50_B9ZQD1 von Willebrand factor type A n=1 Tax=Thioalkaliv... 57 1e-06 UniRef50_B3RZ89 Putative uncharacterized protein n=1 Tax=Trichop... 57 1e-06 UniRef50_Q0W729 Putative uncharacterized protein n=1 Tax=uncultu... 57 1e-06 UniRef50_Q14CN2 Calcium-activated chloride channel regulator 4, ... 57 1e-06 UniRef50_A4ACS0 Magnesium-chelatase, 60 kDa subunit n=2 Tax=uncl... 57 2e-06 UniRef50_UPI0001C34E55 hypothetical protein ClM62_13922 n=1 Tax=... 57 2e-06 UniRef50_O76836 Putative uncharacterized protein n=4 Tax=Caenorh... 57 2e-06 UniRef50_Q2FMX9 Protoporphyrin IX magnesium-chelatase n=2 Tax=Me... 57 2e-06 UniRef50_B2UUD5 Phage/colicin/tellurite resistance cluster terY ... 57 2e-06 UniRef50_C8XJ05 Magnesium chelatase n=20 Tax=cellular organisms ... 57 2e-06 UniRef50_B0SI02 BatA n=2 Tax=Leptospira biflexa serovar Patoc Re... 57 2e-06 UniRef50_B9SSC8 Protein binding protein, putative n=1 Tax=Ricinu... 57 2e-06 UniRef50_D1NW11 Putative von Willebrand factor type A domain pro... 57 2e-06 UniRef50_C1EAC0 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 57 2e-06 UniRef50_Q22NG1 von Willebrand factor type A domain containing p... 57 2e-06 UniRef50_C5E9N8 von Willebrand factor type A n=4 Tax=Bifidobacte... 56 2e-06 UniRef50_C6M593 Putative uncharacterized protein n=1 Tax=Neisser... 56 2e-06 UniRef50_A9B2Y1 VWA containing CoxE family protein n=4 Tax=Bacte... 56 2e-06 UniRef50_B3PJ55 von Willebrand factor type A domain protein n=1 ... 56 2e-06 UniRef50_C3XUV0 Putative uncharacterized protein n=1 Tax=Branchi... 56 2e-06 UniRef50_Q4RTR8 Chromosome 2 SCAF14997, whole genome shotgun seq... 56 2e-06 UniRef50_A9KHQ1 von Willebrand factor type A n=1 Tax=Clostridium... 56 2e-06 UniRef50_A9DAG7 Putative uncharacterized protein n=1 Tax=Hoeflea... 56 2e-06 UniRef50_Q0AZS1 Mg-chelatase subunit ChlD-like protein n=1 Tax=S... 56 2e-06 UniRef50_B0WHU4 Sushi n=3 Tax=Culicini RepID=B0WHU4_CULQU 56 2e-06 UniRef50_Q1Q250 Putative uncharacterized protein n=1 Tax=Candida... 56 2e-06 UniRef50_C1MHV2 Predicted protein n=1 Tax=Micromonas pusilla CCM... 56 2e-06 UniRef50_C1D350 Putative magnesium chelatase, chlD subunit n=1 T... 56 2e-06 UniRef50_Q8BVM2 Anthrax toxin receptor-like n=2 Tax=Mus musculus... 56 2e-06 UniRef50_D2H1L1 Putative uncharacterized protein (Fragment) n=4 ... 56 2e-06 UniRef50_UPI0001C15B95 hypothetical protein CRC_00003 n=1 Tax=Cy... 56 2e-06 UniRef50_B6XSC0 Putative uncharacterized protein n=3 Tax=Bifidob... 56 2e-06 UniRef50_Q22UC0 Putative uncharacterized protein n=1 Tax=Tetrahy... 56 2e-06 UniRef50_C6MF78 von Willebrand factor type A n=1 Tax=Nitrosomona... 56 2e-06 UniRef50_UPI00015B5332 PREDICTED: similar to ENSANGP00000020925 ... 56 2e-06 UniRef50_C7PJ56 von Willebrand factor type A n=1 Tax=Chitinophag... 56 2e-06 UniRef50_B9LQX7 von Willebrand factor type A n=1 Tax=Halorubrum ... 56 3e-06 UniRef50_A0Z4J3 Von Willebrand factor, type A n=2 Tax=Bacteria R... 56 3e-06 UniRef50_C4ZKE8 von Willebrand factor type A n=2 Tax=Thauera sp.... 56 3e-06 UniRef50_Q895E2 Conserved protein, putative metal-dependent pept... 56 3e-06 UniRef50_A7SFM5 Predicted protein n=1 Tax=Nematostella vectensis... 56 3e-06 UniRef50_C5VFZ9 von Willebrand factor type A n=2 Tax=Corynebacte... 56 3e-06 UniRef50_A6M139 von Willebrand factor, type A n=1 Tax=Clostridiu... 56 3e-06 UniRef50_UPI0000F1FEC5 PREDICTED: similar to Clca1 protein n=2 T... 56 3e-06 UniRef50_D0LKC7 von Willebrand factor type A n=1 Tax=Haliangium ... 56 3e-06 UniRef50_Q66HV5 Zgc:92481 n=2 Tax=Danio rerio RepID=Q66HV5_DANRE 56 3e-06 UniRef50_D0MEC0 von Willebrand factor type A n=1 Tax=Rhodothermu... 56 3e-06 UniRef50_A5FBS6 Uncharacterized protein with a von Willebrand fa... 56 3e-06 UniRef50_A7K1D3 Protein BatA n=19 Tax=Vibrionales RepID=A7K1D3_V... 56 3e-06 UniRef50_A1ZZI4 Von Willebrand factor type A domain protein (Fra... 56 3e-06 UniRef50_A0ED74 Chromosome undetermined scaffold_9, whole genome... 56 3e-06 UniRef50_B3RUM1 Putative uncharacterized protein n=1 Tax=Trichop... 56 3e-06 UniRef50_Q5VMI5 Putative uncharacterized protein OSJNBa0085L11.1... 56 3e-06 UniRef50_B3QY78 von Willebrand factor type A n=1 Tax=Chloroherpe... 56 3e-06 UniRef50_B3QUN4 von Willebrand factor type A n=4 Tax=Bacteria Re... 56 3e-06 UniRef50_A0CBH4 Chromosome undetermined scaffold_164, whole geno... 56 3e-06 UniRef50_B1G2X7 Putative uncharacterized protein n=1 Tax=Burkhol... 56 3e-06 UniRef50_D1N5F6 von Willebrand factor type A n=1 Tax=Victivallis... 56 3e-06 UniRef50_B8G7S2 Magnesium chelatase n=9 Tax=cellular organisms R... 56 4e-06 UniRef50_B9KV79 Putative uncharacterized protein n=1 Tax=Rhodoba... 56 4e-06 UniRef50_B9PUS9 Microneme protein, putative n=5 Tax=Sarcocystida... 56 4e-06 UniRef50_D2R6S0 Putative uncharacterized protein n=1 Tax=Pirellu... 56 4e-06 UniRef50_B0NBU8 Putative uncharacterized protein n=1 Tax=Clostri... 56 4e-06 UniRef50_A8LLA0 von Willebrand factor type A domain protein n=7 ... 56 4e-06 UniRef50_C8P229 Putative uncharacterized protein n=1 Tax=Erysipe... 56 4e-06 UniRef50_UPI0001AED79F von Willebrand factor type A n=1 Tax=Stre... 56 4e-06 UniRef50_C3ZZV3 Putative uncharacterized protein (Fragment) n=1 ... 56 4e-06 UniRef50_A8M2F6 von Willebrand factor type A n=3 Tax=Micromonosp... 56 4e-06 UniRef50_Q04NS4 BatA n=4 Tax=Leptospira RepID=Q04NS4_LEPBJ 55 4e-06 UniRef50_D0KDG5 VWA containing CoxE family protein n=9 Tax=Gamma... 55 4e-06 UniRef50_B3SBE5 Putative uncharacterized protein n=2 Tax=Trichop... 55 4e-06 UniRef50_Q2J8W6 von Willebrand factor, type A n=2 Tax=Actinomyce... 55 4e-06 UniRef50_UPI00017F3212 von Willebrand factor, type A n=1 Tax=Esc... 55 4e-06 UniRef50_D0LUP3 Uncharacterized protein containing a von Willebr... 55 4e-06 UniRef50_Q01T75 von Willebrand factor, type A n=2 Tax=Candidatus... 55 5e-06 UniRef50_A9UWH5 Predicted protein n=2 Tax=Monosiga brevicollis R... 55 5e-06 UniRef50_O05809 Uncharacterized protein Rv2850c/MT2916 n=51 Tax=... 55 5e-06 UniRef50_B9XQJ6 von Willebrand factor type A n=1 Tax=bacterium E... 55 5e-06 UniRef50_A6DKL3 Putative uncharacterized protein n=1 Tax=Lentisp... 55 5e-06 UniRef50_UPI0000E472E1 PREDICTED: similar to parturition-related... 55 5e-06 UniRef50_C7DFN0 Magnesium chelatase ATPase subunit D n=1 Tax=Tha... 55 5e-06 UniRef50_Q99KC8 von Willebrand factor A domain-containing protei... 55 5e-06 UniRef50_Q23AF2 von Willebrand factor type A domain containing p... 55 5e-06 UniRef50_Q73UD3 UPF0353 protein MAP_3435c n=4 Tax=Mycobacterium ... 55 5e-06 UniRef50_A6G0N1 Protein containing a von Willebrand factor type ... 55 5e-06 UniRef50_A9B607 von Willebrand factor type A n=6 Tax=Chloroflexi... 55 5e-06 UniRef50_C4DDW8 von Willebrand factor type A-like protein n=1 Ta... 55 5e-06 UniRef50_C6JPK6 BatA protein n=2 Tax=Fusobacterium RepID=C6JPK6_... 55 6e-06 UniRef50_Q74B80 Putative uncharacterized protein n=1 Tax=Geobact... 55 6e-06 UniRef50_A6CAQ4 Putative uncharacterized protein n=1 Tax=Plancto... 55 6e-06 UniRef50_Q9YD81 Putative uncharacterized protein n=1 Tax=Aeropyr... 55 6e-06 UniRef50_A4QDZ6 Putative uncharacterized protein n=1 Tax=Coryneb... 55 6e-06 UniRef50_Q11RQ7 BatA-like protein, aerotolerance-related n=1 Tax... 55 6e-06 UniRef50_A9BLP5 von Willebrand factor type A n=3 Tax=Burkholderi... 55 6e-06 UniRef50_A9B6J8 von Willebrand factor type A n=1 Tax=Herpetosiph... 55 6e-06 UniRef50_Q7K0H4 Straightjacket n=11 Tax=Coelomata RepID=Q7K0H4_D... 55 6e-06 UniRef50_Q055Y9 Putative uncharacterized protein n=4 Tax=Leptosp... 55 7e-06 UniRef50_D2R3Y3 von Willebrand factor type A n=1 Tax=Pirellula s... 55 7e-06 UniRef50_B6H7Y2 Pc16g08660 protein n=1 Tax=Penicillium chrysogen... 55 7e-06 UniRef50_Q2FLV5 Protoporphyrin IX magnesium-chelatase n=1 Tax=Me... 55 7e-06 UniRef50_UPI000180B353 PREDICTED: similar to putative calcium ac... 55 7e-06 UniRef50_Q4J9H5 Conserved protein n=2 Tax=Sulfolobus RepID=Q4J9H... 55 7e-06 UniRef50_B0G4W3 Putative uncharacterized protein n=1 Tax=Dorea f... 55 7e-06 UniRef50_C3YUL3 Putative uncharacterized protein n=1 Tax=Branchi... 55 7e-06 UniRef50_A0BJE7 Chromosome undetermined scaffold_11, whole genom... 55 7e-06 UniRef50_A9CX50 RTX toxin, putative n=1 Tax=Shewanella benthica ... 55 7e-06 UniRef50_Q17A73 Dihydropyridine-sensitive l-type calcium channel... 55 7e-06 UniRef50_Q1N498 Putative uncharacterized protein n=1 Tax=Bermane... 54 7e-06 UniRef50_C6XGA4 Putative uncharacterized protein n=2 Tax=Candida... 54 7e-06 UniRef50_C9KWG4 Putative uncharacterized protein n=1 Tax=Bactero... 54 7e-06 UniRef50_Q4RX89 Chromosome 11 SCAF14979, whole genome shotgun se... 54 7e-06 UniRef50_B3RWW7 Putative uncharacterized protein n=1 Tax=Trichop... 54 7e-06 UniRef50_A5FB87 von Willebrand factor, type A n=1 Tax=Flavobacte... 54 8e-06 UniRef50_A9A4V8 von Willebrand factor type A n=2 Tax=Thaumarchae... 54 8e-06 UniRef50_O00339 Matrilin-2 n=30 Tax=Euteleostomi RepID=MATN2_HUMAN 54 8e-06 UniRef50_A1SAA4 Uncharacterized protein containing a von Willebr... 54 8e-06 UniRef50_D2S6Y8 von Willebrand factor type A n=1 Tax=Geodermatop... 54 8e-06 UniRef50_UPI0000EB12CB UPI0000EB12CB related cluster n=1 Tax=Can... 54 8e-06 UniRef50_Q5NWS4 Tellurium resistance protein n=8 Tax=Bacteria Re... 54 8e-06 UniRef50_D0IVS5 Putative uncharacterized protein n=3 Tax=Bacteri... 54 8e-06 UniRef50_C8PVC3 von Willebrand factor type A domain protein n=1 ... 54 8e-06 UniRef50_C3XQR8 Putative uncharacterized protein n=1 Tax=Branchi... 54 8e-06 UniRef50_A7RPC2 Predicted protein (Fragment) n=2 Tax=Eumetazoa R... 54 8e-06 UniRef50_C1V7M6 von Willebrand factor type A-like protein n=1 Ta... 54 9e-06 UniRef50_Q58221 Uncharacterized protein MJ0811 n=4 Tax=Methanoca... 54 9e-06 UniRef50_A7NJ01 von Willebrand factor type A n=2 Tax=Roseiflexus... 54 9e-06 UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus ... 54 9e-06 UniRef50_C4IH33 von Willebrand factor type A domain protein n=1 ... 54 9e-06 UniRef50_A8EWS0 Putative uncharacterized protein n=1 Tax=Arcobac... 54 9e-06 UniRef50_A3I2X5 Putative batB protein n=1 Tax=Algoriphagus sp. P... 54 9e-06 UniRef50_A3HZP7 BatA protein n=1 Tax=Algoriphagus sp. PR1 RepID=... 54 9e-06 UniRef50_C3JDN5 von Willebrand factor, type A n=2 Tax=Rhodococcu... 54 1e-05 UniRef50_D2W671 Predicted protein n=1 Tax=Naegleria gruberi RepI... 54 1e-05 UniRef50_A0KS56 Putative outer membrane adhesin like proteiin n=... 54 1e-05 UniRef50_A7RV93 Predicted protein n=2 Tax=Nematostella vectensis... 54 1e-05 UniRef50_B2SKX9 von Willebrand factor type A domain protein n=13... 54 1e-05 UniRef50_B4D6B0 von Willebrand factor type A n=1 Tax=Chthoniobac... 54 1e-05 UniRef50_D0LKC8 von Willebrand factor type A n=1 Tax=Haliangium ... 54 1e-05 UniRef50_C3Y507 Putative uncharacterized protein n=1 Tax=Branchi... 54 1e-05 UniRef50_A9UV27 Predicted protein n=2 Tax=Eukaryota RepID=A9UV27... 54 1e-05 UniRef50_D2QTD7 von Willebrand factor type A n=1 Tax=Spirosoma l... 54 1e-05 UniRef50_UPI0000E48E5C PREDICTED: similar to calcium-activated c... 54 1e-05 UniRef50_Q1RJP8 Putative uncharacterized protein n=9 Tax=Rickett... 54 1e-05 UniRef50_D2A5S3 Putative uncharacterized protein GLEAN_15119 n=3... 54 1e-05 UniRef50_B5JN05 von Willebrand factor type A domain protein n=1 ... 54 1e-05 UniRef50_B7KMY0 Putative uncharacterized protein n=1 Tax=Cyanoth... 54 1e-05 UniRef50_A1S120 von Willebrand factor, type A n=1 Tax=Thermofilu... 54 1e-05 UniRef50_UPI0000E49DB4 PREDICTED: similar to poly (ADP-ribose) p... 54 1e-05 UniRef50_A3J9J6 Putative uncharacterized protein n=3 Tax=Bacteri... 54 1e-05 UniRef50_A0B5M2 von Willebrand factor, type A n=1 Tax=Methanosae... 54 1e-05 UniRef50_B2K3B2 von Willebrand factor type A n=39 Tax=Gammaprote... 54 1e-05 UniRef50_B2KDS9 von Willebrand factor type A n=1 Tax=Elusimicrob... 54 1e-05 UniRef50_C9QHX4 Putative outer membrane adhesin like proteiin n=... 54 1e-05 UniRef50_A3K0S6 Putative uncharacterized protein n=1 Tax=Sagittu... 54 1e-05 UniRef50_C9RK46 von Willebrand factor type A n=1 Tax=Fibrobacter... 54 1e-05 UniRef50_Q54T94 Putative uncharacterized protein n=1 Tax=Dictyos... 54 1e-05 UniRef50_C1V972 Protoporphyrin IX magnesium-chelatase n=1 Tax=Ha... 54 1e-05 UniRef50_Q2B6K0 Putative uncharacterized protein n=1 Tax=Bacillu... 54 1e-05 UniRef50_A6NMZ7 Collagen alpha-6(VI) chain n=2 Tax=Theria RepID=... 54 1e-05 UniRef50_D2RQJ2 von Willebrand factor type A n=1 Tax=Haloterrige... 54 1e-05 UniRef50_A9WKF3 von Willebrand factor type A n=3 Tax=Chloroflexu... 54 1e-05 UniRef50_A9GY82 Putative uncharacterized protein n=1 Tax=Sorangi... 54 1e-05 UniRef50_C9L3E2 BatB protein n=10 Tax=Bacteroidales RepID=C9L3E2... 54 1e-05 UniRef50_C3YP68 Putative uncharacterized protein n=1 Tax=Branchi... 54 1e-05 UniRef50_A1VSB7 von Willebrand factor, type A n=10 Tax=Betaprote... 54 2e-05 UniRef50_C5GK44 U-box domain-containing protein n=2 Tax=Ajellomy... 54 2e-05 UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B... 54 2e-05 UniRef50_C3YC17 Putative uncharacterized protein n=1 Tax=Branchi... 54 2e-05 UniRef50_Q5BKJ5 Clca1 protein (Fragment) n=8 Tax=Xenopus (Silura... 53 2e-05 UniRef50_A5UYK7 von Willebrand factor, type A n=2 Tax=Roseiflexu... 53 2e-05 UniRef50_C3XEK2 Putative uncharacterized protein n=1 Tax=Helicob... 53 2e-05 UniRef50_A8J3X6 Flagellar associated protein (Fragment) n=1 Tax=... 53 2e-05 UniRef50_B1JFF2 Na-Ca exchanger/integrin-beta4 n=1 Tax=Pseudomon... 53 2e-05 UniRef50_D0LBG9 ATPase associated with various cellular activiti... 53 2e-05 UniRef50_A6YIP9 Capillary morphogenesis protein 2B n=14 Tax=Eute... 53 2e-05 UniRef50_D0LR75 Putative uncharacterized protein n=1 Tax=Haliang... 53 2e-05 UniRef50_C5BKN1 Matrixin family protein n=1 Tax=Teredinibacter t... 53 2e-05 UniRef50_D2LB55 von Willebrand factor type A n=1 Tax=Rhodomicrob... 53 2e-05 UniRef50_C1F3L6 von Willebrand factor type A domain protein n=1 ... 53 2e-05 UniRef50_A8H5T6 von Willebrand factor type A n=5 Tax=Proteobacte... 53 2e-05 UniRef50_A7RFK1 Predicted protein (Fragment) n=2 Tax=Nematostell... 53 2e-05 UniRef50_Q5LCG5 Aerotolerance-related membrane protein n=25 Tax=... 53 2e-05 UniRef50_C5BKZ8 von Willebrand factor type A domain protein n=5 ... 53 2e-05 UniRef50_C0ZKA7 Putative uncharacterized protein n=1 Tax=Breviba... 53 2e-05 UniRef50_B0C1G4 Putative uncharacterized protein n=1 Tax=Acaryoc... 53 2e-05 UniRef50_UPI0001925847 PREDICTED: similar to polydom n=1 Tax=Hyd... 53 2e-05 UniRef50_C0GKG1 von Willebrand factor type A n=1 Tax=Dethiobacte... 53 2e-05 UniRef50_Q3APD8 von Willebrand factor, type A n=1 Tax=Chlorobium... 53 2e-05 UniRef50_Q021L5 von Willebrand factor, type A n=1 Tax=Candidatus... 53 2e-05 UniRef50_C6XTR0 von Willebrand factor type A n=1 Tax=Pedobacter ... 53 2e-05 UniRef50_C9PFU4 Putative hemolysin n=1 Tax=Vibrio furnissii CIP ... 53 2e-05 UniRef50_UPI000180D155 PREDICTED: similar to integrin alpha Hr1 ... 53 2e-05 UniRef50_UPI00006CD1DE von Willebrand factor type A domain conta... 53 2e-05 UniRef50_Q2SNJ1 Uncharacterized protein encoded in toxicity prot... 53 2e-05 UniRef50_A6L4M8 Putative uncharacterized protein n=9 Tax=Bactero... 53 2e-05 UniRef50_C1XZC4 Uncharacterized conserved protein n=2 Tax=Meioth... 53 2e-05 UniRef50_C7N671 Mg-chelatase subunit ChlD n=1 Tax=Slackia heliot... 53 2e-05 UniRef50_A2DPQ9 von Willebrand factor type A domain containing p... 53 2e-05 UniRef50_UPI000186D9CC conserved hypothetical protein n=1 Tax=Pe... 53 2e-05 UniRef50_UPI0000E47896 PREDICTED: similar to cache domain contai... 53 2e-05 UniRef50_UPI000023F6A9 hypothetical protein FG10431.1 n=2 Tax=Gi... 53 2e-05 UniRef50_A9YWQ9 Zinc finger protein n=1 Tax=Medicago truncatula ... 53 2e-05 UniRef50_UPI000050F951 von Willebrand factor type A n=1 Tax=Brev... 53 2e-05 UniRef50_A6QCW6 von Willebrand factor type A domain protein n=1 ... 53 2e-05 UniRef50_C3JM29 Magnesium-chelatase subunit ChlD n=3 Tax=Actinom... 53 2e-05 UniRef50_B8FBV5 Putative uncharacterized protein n=1 Tax=Desulfa... 53 3e-05 UniRef50_A0BI06 Chromosome undetermined scaffold_109, whole geno... 53 3e-05 UniRef50_A3ZTC3 Putative uncharacterized protein n=1 Tax=Blastop... 53 3e-05 UniRef50_C2HF13 von Willebrand factor type A n=1 Tax=Finegoldia ... 52 3e-05 UniRef50_B7PIU8 Calcium activated chlorine channel, putative n=2... 52 3e-05 UniRef50_Q9WXB0 Magnesium chelatase n=1 Tax=Acidiphilium rubrum ... 52 3e-05 UniRef50_C8VGR0 von Willebrand domain protein (AFU_orthologue; A... 52 3e-05 UniRef50_C1E936 Predicted protein n=3 Tax=Micromonas RepID=C1E93... 52 3e-05 UniRef50_A9GNX2 Putative membrane protein n=1 Tax=Sorangium cell... 52 3e-05 UniRef50_O26655 Magnesium chelatase subunit n=1 Tax=Methanotherm... 52 3e-05 UniRef50_C2GKW9 Putative uncharacterized protein n=1 Tax=Coryneb... 52 3e-05 UniRef50_B3T1G6 Putative von Willebrand factor type A domain pro... 52 3e-05 UniRef50_C0F0K9 Putative uncharacterized protein n=2 Tax=Eubacte... 52 3e-05 UniRef50_UPI0001976EAB hypothetical protein BbifN4_00972 n=1 Tax... 52 3e-05 UniRef50_B9L3Z2 von Willebrand factor type A n=2 Tax=Bacteria Re... 52 3e-05 UniRef50_B3RLQ7 Putative uncharacterized protein n=1 Tax=Trichop... 52 3e-05 UniRef50_Q21JX5 von Willebrand factor, type A n=8 Tax=Gammaprote... 52 3e-05 UniRef50_A4QP95 LOC563828 protein (Fragment) n=4 Tax=Cyprinidae ... 52 3e-05 UniRef50_Q22X70 von Willebrand factor type A domain containing p... 52 3e-05 UniRef50_Q22ML1 von Willebrand factor type A domain containing p... 52 3e-05 UniRef50_Q608G4 Putative MxaC protein n=1 Tax=Methylococcus caps... 52 3e-05 UniRef50_Q9UKK3 Poly [ADP-ribose] polymerase 4 n=14 Tax=Eutheria... 52 3e-05 UniRef50_C0ZE04 Putative uncharacterized protein n=1 Tax=Breviba... 52 3e-05 UniRef50_B9TFA6 Putative uncharacterized protein n=1 Tax=Ricinus... 52 3e-05 UniRef50_B0ACH3 Putative uncharacterized protein n=1 Tax=Clostri... 52 3e-05 UniRef50_A9V9D8 Predicted protein n=1 Tax=Monosiga brevicollis R... 52 3e-05 UniRef50_A7NQL3 von Willebrand factor type A n=2 Tax=Roseiflexus... 52 3e-05 UniRef50_C0Z8R6 Putative uncharacterized protein n=1 Tax=Breviba... 52 3e-05 UniRef50_C1ZLB6 Putative uncharacterized protein n=1 Tax=Plancto... 52 3e-05 UniRef50_P0A5D7 Uncharacterized protein Rv0959/MT0986 n=36 Tax=A... 52 3e-05 UniRef50_UPI0001760236 PREDICTED: similar to mCG140660 n=1 Tax=D... 52 4e-05 UniRef50_A8ULL3 Putative uncharacterized protein n=1 Tax=Flavoba... 52 4e-05 UniRef50_C3XJE4 Putative uncharacterized protein (Fragment) n=1 ... 52 4e-05 UniRef50_B0R5W4 Magnesium chelatase (Protoporphyrin IX magnesium... 52 4e-05 UniRef50_B7Q412 Putative uncharacterized protein n=1 Tax=Ixodes ... 52 4e-05 UniRef50_UPI0001AEDBB6 hypothetical protein SalbJ_01235 n=1 Tax=... 52 4e-05 UniRef50_C6PVI2 Vault protein inter-alpha-trypsin domain protein... 52 4e-05 UniRef50_Q1V424 Putative RTX toxin (Fragment) n=2 Tax=Vibrio alg... 52 4e-05 UniRef50_A0DJ91 Chromosome undetermined scaffold_52, whole genom... 52 4e-05 UniRef50_D1RCP1 von Willebrand factor type A domain protein n=6 ... 52 4e-05 UniRef50_Q4V1D8 Putative uncharacterized protein dadA n=8 Tax=Ba... 52 4e-05 UniRef50_C6WL71 von Willebrand factor type A n=1 Tax=Actinosynne... 52 4e-05 UniRef50_C4N894 Complement factor B-like protein n=1 Tax=Venerup... 52 4e-05 UniRef50_B9X084 Complement factor B n=5 Tax=Eumetazoa RepID=B9X0... 52 4e-05 UniRef50_A5UXM2 von Willebrand factor, type A n=1 Tax=Roseiflexu... 52 4e-05 UniRef50_UPI000180C3F0 PREDICTED: similar to Vwa1 protein n=1 Ta... 52 4e-05 UniRef50_UPI0000ECD6E7 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.... 52 4e-05 UniRef50_Q7JMF9 Protein T24F1.6b, partially confirmed by transcr... 52 4e-05 UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated c... 52 4e-05 UniRef50_Q2SCZ7 Uncharacterized protein containing a von Willebr... 52 4e-05 >UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteria RepID=Y1510_YERPA Length = 424 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 359/421 (85%), Positives = 390/421 (92%), Gaps = 1/421 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M +FIDRRLNGKNKSMVNRQRFLRRYK+QIKQSI++AINKRSVTD++SGESVSIP +DI+ Sbjct: 1 MGYFIDRRLNGKNKSMVNRQRFLRRYKSQIKQSIADAINKRSVTDIESGESVSIPIDDIN 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EPMFHQG GGLRHRVHPGNDHF+ NDR++RPQGGGGG +DGEG+DEFVFQIS Sbjct: 61 EPMFHQGNGGLRHRVHPGNDHFITNDRVDRPQGGGGGGSGQGNA-GKDGEGEDEFVFQIS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 KDEYLDLLFEDLALPNLK+NQ +QL E+KTHRAGYT+NGVPANISVVRSLQNSLARRTAM Sbjct: 120 KDEYLDLLFEDLALPNLKRNQYKQLAEFKTHRAGYTSNGVPANISVVRSLQNSLARRTAM 179 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 TA KRREL LE L ++ NSEPAQLLEEERLRK I EL+ KI RVPFIDTFDLRYKNYE Sbjct: 180 TASKRRELRELEAALTVLENSEPAQLLEEERLRKAITELKQKIARVPFIDTFDLRYKNYE 239 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 +RP+PSSQAVMFCLMDVSGSMDQ+TKDMAKRFYILLYLFLSRTYKNV+VVYIRHHTQAKE Sbjct: 240 RRPEPSSQAVMFCLMDVSGSMDQATKDMAKRFYILLYLFLSRTYKNVDVVYIRHHTQAKE 299 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE EFFYSQETGGTIVSSALKLMDEVV+ERYNPAQWNIYAAQASDGDNWADDSPLCHE+ Sbjct: 300 VDEQEFFYSQETGGTIVSSALKLMDEVVQERYNPAQWNIYAAQASDGDNWADDSPLCHEL 359 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 LAKK+LPVVRYYSYIEITRRAHQTLWREYE L+ FDNFA+QHIR+ +DIYPVFRELFHK Sbjct: 360 LAKKILPVVRYYSYIEITRRAHQTLWREYEDLEEKFDNFAIQHIREPEDIYPVFRELFHK 419 Query: 421 Q 421 Q Sbjct: 420 Q 420 >UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteria RepID=Y975_NITHX Length = 439 Score = 365 bits (937), Expect = 1e-99, Method: Composition-based stats. Identities = 181/439 (41%), Positives = 274/439 (62%), Gaps = 18/439 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN K++S+ NRQRFLRR + ++K+SI + + ++D D ++VSIPT Sbjct: 1 MPIFIDRRLNPKDRSLGNRQRFLRRAREELKRSIRDRVRSGRISDADGEQAVSIPTRSTD 60 Query: 61 EPMFHQGRG-GLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 EP F + G R V PGN HFV DR+ +P G G+ + + +D+F F + Sbjct: 61 EPRFEAAKDSGRREHVLPGNKHFVPGDRLRKPGHGAAGTPDPSMK-----DSEDDFRFVL 115 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 S++E LDL FEDL LP++ + +++ ++ RAG+ A G P NI+V R+++NS RR A Sbjct: 116 SREEVLDLFFEDLELPDMVKLSLKEILAFRPRRAGFAATGSPTNINVGRTMRNSYGRRIA 175 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEE--ERLRKEIAELRAKIERVPFIDTFDLRYK 237 + KR E+ A+ + +A + + + + + L+ E+ L K + ++D D+R+ Sbjct: 176 LKRPKREEVDAIRQEIAELESGSQSPVARQRIAALQAEVERLERKRRLIAYVDPVDIRFN 235 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 +E +P P+++AVMFCLMDVSGSM + KD+AKRF++LL+LFL Y E+V+I H + Sbjct: 236 RFEAQPIPNAKAVMFCLMDVSGSMGEREKDLAKRFFVLLHLFLKCRYDRTEIVFISHTHE 295 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 A+EV+E FFYS ++GGT+VS+AL+ M ++ ERY ++WNIYAAQASDGDN A DS C Sbjct: 296 AQEVNEETFFYSTQSGGTVVSTALEKMHRIIAERYPGSEWNIYAAQASDGDNAAADSHRC 355 Query: 358 HEILAKKLLPVVRYYSYIEI----------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 +L ++++ + +YY+Y+EI T +LWR Y + + + NF M I D Sbjct: 356 ITLLDEEIMRLCQYYAYVEIIDERERHIFGTTENGTSLWRAYSSVNANWPNFQMTRIADA 415 Query: 408 DDIYPVFRELFHKQNATAK 426 DIYPVFR+LF +Q K Sbjct: 416 ADIYPVFRQLFTRQATAEK 434 >UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacteria RepID=Y882_CHRSD Length = 426 Score = 362 bits (928), Expect = 2e-98, Method: Composition-based stats. Identities = 238/427 (55%), Positives = 310/427 (72%), Gaps = 4/427 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 MT+FIDRR N KNKS VNRQRFL+RY++ IK+++ EA+N+RS+TD++ GE +SIP +DIS Sbjct: 1 MTYFIDRRANAKNKSAVNRQRFLQRYRSHIKRAVEEAVNRRSITDMERGEKISIPAKDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP+F G GG R V PGN FV+ DR+ R GG G GSG+G AS GEG DEF F +S Sbjct: 61 EPVFQHGPGGARTIVSPGNKEFVEGDRLRR-PGGEGRGGSGEGSASNQGEGMDEFAFSLS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 ++E+LD +F+ LALP+L++ Q R L E + RAG T +GVP+ I++VRS++ + ARR M Sbjct: 120 REEFLDFVFDGLALPHLERKQLRDLDEVRPVRAGVTRDGVPSRINIVRSMREAQARRIGM 179 Query: 181 TAGKRRELHALEENLAIISNSEP--AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 A +R L EE L +P L+ EI L ++E VPFIDT+DLRY N Sbjct: 180 RAPIKRALREAEEALESEERKDPVLRNPARIGELKAEIERLEKRLEAVPFIDTYDLRYNN 239 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 +P PS++AVMFC+MDVSGSM Q KD+AKRF++LLYLFL R Y+ VE+V+IRHHT A Sbjct: 240 LIDQPQPSNKAVMFCVMDVSGSMTQGHKDIAKRFFLLLYLFLERNYEKVELVFIRHHTAA 299 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 KEVDE EFFYS+ETGGTIVSSAL L+DE++ +RY+PAQWN+Y AQASDGDNW DDS C Sbjct: 300 KEVDEEEFFYSRETGGTIVSSALTLVDEIIAKRYSPAQWNLYVAQASDGDNWDDDSLTCR 359 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD-NFAMQHIRDQDDIYPVFREL 417 ++L L+ ++YY+Y+EIT +HQ LW EYE +Q+ FAMQ I + DIYPVFR+L Sbjct: 360 DLLMTSLMAKLQYYTYVEITPHSHQALWEEYERVQAAHPSRFAMQQIVEPGDIYPVFRKL 419 Query: 418 FHKQNAT 424 F K+ A+ Sbjct: 420 FRKRVAS 426 >UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodobacterales RepID=B6B8L1_9RHOB Length = 445 Score = 358 bits (919), Expect = 2e-97, Method: Composition-based stats. Identities = 194/444 (43%), Positives = 278/444 (62%), Gaps = 21/444 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD----VDSGESVSIPT 56 M FIDRR N K KS+ NRQRFLRR + IK+ + +++ +S+ D GE V+IP Sbjct: 1 MHHFIDRRANPKGKSLGNRQRFLRRARENIKERVDQSVRGKSIQSGSGVPDGGEKVTIPA 60 Query: 57 EDISEPMF-HQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEF 115 + EP F H +GGLR V PGN FV D I+RPQGG G G +AS++G+G+DEF Sbjct: 61 RGLKEPRFFHSSKGGLRRHVLPGNKDFVVGDTIKRPQGGT---GQGGRKASEEGDGEDEF 117 Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 F ++++EYL++LFE L LP+L + + T RAG T G P N+++VR+++NSL Sbjct: 118 SFTLTQEEYLEILFEGLELPDLVEKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLG 177 Query: 176 RRTAMTAGKRRELHALEE---NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 RR A+ + LEE L + + P Q E LRK++ + K + V +ID Sbjct: 178 RRIALQRPTTKSQRDLEEQIAELEALDDRTPPQEDFLEALRKKLDGIIRKRKVVGYIDPL 237 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 DLRY + +S+AV+FCLMDVSGSM + KD+AKRF++LL+LFL R Y++ E+V++ Sbjct: 238 DLRYDTFVPEKIRNSRAVVFCLMDVSGSMQEREKDLAKRFFLLLHLFLERCYEHTELVFV 297 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 RH A+EVDE FFY++ETGGTIVS+AL+ M E+++ERY P +WNIY AQASDG+N+ + Sbjct: 298 RHTHHAQEVDEETFFYARETGGTIVSTALEKMKEIIEERYPPDEWNIYGAQASDGENFGN 357 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEI----------TRRAHQTLWREYEHLQSTFDNFAMQ 402 DS C ++L LLPV ++Y+Y+EI A + LW+ Y +++ +F MQ Sbjct: 358 DSARCKKLLLNDLLPVSQFYAYVEIVDEAAEMLLNNPEAGEDLWQNYREVKAQAQHFEMQ 417 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 + IYP+FRE F + A+ Sbjct: 418 RVSQPGHIYPIFREFFLPKVKGAQ 441 >UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobacteria RepID=Y6755_BRAJA Length = 427 Score = 353 bits (905), Expect = 9e-96, Method: Composition-based stats. Identities = 186/431 (43%), Positives = 276/431 (64%), Gaps = 18/431 (4%) Query: 3 WFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEP 62 IDRRLN KS+ NRQRFLRR K+ ++ ++ + +R + DV G V+IP + + EP Sbjct: 4 HIIDRRLNPGGKSLENRQRFLRRAKSLVQGAVKKTSQERDIKDVLEGGEVTIPLDGMHEP 63 Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD 122 F + GG R V PGN FV+ D ++R G GS + +G+ +D F F +S+D Sbjct: 64 RFRR-EGGTRDMVLPGNKKFVEGDYLQR-----SGQGSAKDSGPGEGDSEDAFRFVLSRD 117 Query: 123 EYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTA 182 E++DL +DL LP+L + + Q RAGYT +G PANISV R+++ +LARR A+ Sbjct: 118 EFVDLFLDDLELPDLAKRKIAQTESEGIQRAGYTTSGSPANISVSRTVKLALARRIALKR 177 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 ++ E+ LE +A ++ E L E+ +L AK +R+PFID D+RY+ +E Sbjct: 178 PRKDEIEELEAAIAACTD-----EDERVVLLAELEKLMAKTKRIPFIDPLDIRYRRFETV 232 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P P +QAVMFCLMDVSGSM + KD+AKRFY+LLY+FL R YK+VE+V+IRH +A+EVD Sbjct: 233 PKPVAQAVMFCLMDVSGSMSEHMKDLAKRFYMLLYVFLKRRYKHVEIVFIRHTDRAEEVD 292 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 E FFY +GGT+VSSAL+ M ++V+ER+NP+ WNIYAAQASDGDN D L +L Sbjct: 293 EQTFFYGPASGGTLVSSALQAMHDIVRERFNPSDWNIYAAQASDGDNSYSDGELTGLLLT 352 Query: 363 KKLLPVVRYYSYIEITRR-------AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 K+LPV ++++Y+E+ + +LW YE L+++ +M+ + ++ +I+PVF Sbjct: 353 DKILPVCQFFAYLEVGESGGSAFDLSDSSLWTLYERLRNSGAPLSMRKVSERSEIFPVFH 412 Query: 416 ELFHKQNATAK 426 +LF ++ + + Sbjct: 413 DLFQRRETSQE 423 >UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkholderiales RepID=A9I2A9_BORPD Length = 419 Score = 351 bits (900), Expect = 3e-95, Method: Composition-based stats. Identities = 203/426 (47%), Positives = 279/426 (65%), Gaps = 10/426 (2%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M IDRRLNG+NKS VNR+RFLRRYK QI++++ + + +RS+ D+D G +++P DIS Sbjct: 1 MNSLIDRRLNGRNKSAVNRERFLRRYKDQIRRAVQDLVRERSIEDMDQGGEINLPARDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP F G+GG R VHPGN F + D RP G G GS G+ D+F F +S Sbjct: 61 EPHFRHGQGGDRELVHPGNREFAKGDTFPRPSGSDGEGGSEPGEGESV----DQFTFSLS 116 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 + E+L+L FEDL LP+L + Q +T+ K RAGYT G P+ +SV R+L+ SL+RR A+ Sbjct: 117 RAEFLNLFFEDLELPHLIRTQLGDVTQKKWQRAGYTTTGSPSLLSVSRTLKASLSRRVAL 176 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R EL A + L + E + LR+E+ + ++ R+PF+D DLRY+N Sbjct: 177 GVAARAELEAAQAKLDAAIAAGAP-QAEIDALRQEVEDCANRLARLPFLDDLDLRYRNRV 235 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 P ++AVMFCLMDVSGSMD+ KD+AKRF+ LLYLFLSR Y++V+VV+IRH A+E Sbjct: 236 SVAMPMARAVMFCLMDVSGSMDEGKKDLAKRFFTLLYLFLSRKYEHVDVVFIRHTDNAEE 295 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE FFY ++GGTIV SAL+LM E+V++RY P+ WN+YAAQASDGD++ D+ Sbjct: 296 VDEQTFFYDPKSGGTIVLSALELMHEIVQQRYPPSAWNVYAAQASDGDSFGADAGKSARF 355 Query: 361 LAKKLLPVVRYYSYIEITRR---AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 LA+ LLP RY++YIE+ +LW EYE T +F M+ I ++ +IYPVF +L Sbjct: 356 LAENLLPATRYFAYIEVPDSQEARKSSLWAEYEQ--ETAPHFVMRRICERGEIYPVFHDL 413 Query: 418 FHKQNA 423 F K+ A Sbjct: 414 FKKETA 419 >UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria RepID=Y587_BACC4 Length = 391 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 105/416 (25%), Positives = 179/416 (43%), Gaps = 47/416 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR + + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGYDDQQRHQEKVQEAIKNNLPDLVTEESIVMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIER-PQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN D + R GG G G+GQ + D G+D + ++S E F +L Sbjct: 81 -VGQGNGDSKVGDVVARDGSGGQKQKGPGKGQGAGDAAGEDYYEAEVSILELEQAFFREL 139 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 LPNLK+ + + G+ NI R++ ++ R Sbjct: 140 ELPNLKRKEMDENRIEHVEFNDIRKTGLWGNIDKKRTMISAYKRNAMSGKAS-------- 191 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 I DL+++ + + P S+AV+ Sbjct: 192 ---------------------------------FHPIHQEDLKFRTWNEVLKPDSKAVVL 218 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET 312 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E EFF E+ Sbjct: 219 AMMDTSGSMGIWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVTEEEFFSKGES 278 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 GGTI SS K E++ +Y+P ++NIY SDGDN D+ C + L ++L+ + Sbjct: 279 GGTICSSVYKKALELIDNKYSPDRYNIYPFHFSDGDNLTSDNARCVK-LVEELMKKCNMF 337 Query: 373 SYIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y E+ + H TL Y++++ DNF ++ + D++ + F +++ Sbjct: 338 GYGEVNQYNRHSTLMSAYKNIKD--DNFRYYILKQKADVFHAMKSFFREESGEKMA 391 >UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales RepID=Y568_CLOK1 Length = 403 Score = 325 bits (832), Expect = 3e-87, Method: Composition-based stats. Identities = 119/413 (28%), Positives = 200/413 (48%), Gaps = 25/413 (6%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S+ +R+R + + IK ++++ I++ S+ + V IP + I E F G Sbjct: 14 DRSLEDRRRHRQLVEKSIKDNLADIISEESIIGQSKNKKVKIPIKGIKEYQFIYGDNSSG 73 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 + + DRI + G G+ Q + + EG+D + +++ ++ LD L EDL Sbjct: 74 VGSG--DGSQKKGDRIGKAIKDRDGKGN---QGAGNQEGEDMYEIEVTIEDVLDYLMEDL 128 Query: 133 ALPNLKQNQQRQL-TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP + + + Q+ + ++GY G+ ++ R++ L R+ R L Sbjct: 129 ELPLMDKKKFSQILSNNSPKKSGYQRKGINPRLAKKRTVVEKLKRQQGTKRALREIHGEL 188 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 E R PF DLRY +++P A + Sbjct: 189 ES-----DPKNKLPEN------------TTIKSRFPFKQD-DLRYFRVKRKPKLELNAAI 230 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD SGSMD + K +A+ F+ +LY F+ Y NVEV +I H T AK V E+EFF+ E Sbjct: 231 ICVMDTSGSMDSTRKFLARSFFFVLYRFIKMKYNNVEVKFISHSTSAKVVTENEFFHKVE 290 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS LK EV++E YNPA WN+Y SDGDNW++D+ L + AK L V Sbjct: 291 SGGTYISSGLKKALEVIEENYNPAYWNVYTFYVSDGDNWSEDNSLALK-CAKDLCKVCNL 349 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 +SY EI + + + + T +NF + I ++ D++ +++ +K+ Sbjct: 350 FSYAEIIPSPYGSSIKHIFQNKITDNNFTVVTIHEKQDLWKSLKKILNKELEE 402 >UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes RepID=Y926_BACA2 Length = 394 Score = 323 bits (828), Expect = 8e-87, Method: Composition-based stats. Identities = 103/412 (25%), Positives = 181/412 (43%), Gaps = 47/412 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR ++ + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGFDDQQRHQKKVQEAIKNNLPDLVTEESIIMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ D + R G G+G+GQ + D G+D + ++S + + LF++L Sbjct: 81 -VGQGDGDSEVGDVVAR-DGADKKQGAGKGQGAGDQAGEDYYEAEVSLMDLEEALFQELE 138 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL+Q ++ + G+ NI R++ ++ R Sbjct: 139 LPNLQQKERDNIVHTDIEFNDIRKTGLTGNIDKKRTMLSAYKRNAMTGKPS--------- 189 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 I DL+YK + P S+AV+ Sbjct: 190 --------------------------------FYPIYPEDLKYKTWNDVTKPESKAVVLA 217 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E +FF E+G Sbjct: 218 MMDTSGSMGVWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVSEEDFFSKGESG 277 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GTI SS + E++ E+Y+PA++NIY SDGDN D+ C + L ++ + Sbjct: 278 GTICSSVYRKSLELIDEKYDPARYNIYPFHFSDGDNLTSDNARCVK-LVNDIMKKSNLFC 336 Query: 374 YIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 Y E+ + H TL Y++++ D F ++ + D++ + F + + Sbjct: 337 YGEVNQYNRHSTLMSAYKNVKD--DKFKYYILKQKSDVFQALKSFFKNEESG 386 >UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium roseum DSM 5159 RepID=B9L510_THERP Length = 389 Score = 322 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 114/417 (27%), Positives = 182/417 (43%), Gaps = 51/417 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K +++ R + + K IK+++++ ++++S+ D V +P + E F R Sbjct: 19 KGAIDQARHMEKVKEAIKRNLADIVSEQSLITSDGKRVVRVPIRVLEEYRFRFDPDSGRQ 78 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ + GGG SG G + D G D + +++ +E +L+FEDL Sbjct: 79 -VGQGSG---GTHVGDVVGRVGGGQRSGDGPQAGDQPGIDYYEAELTIEELSELIFEDLE 134 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL++ + R+L +G AN+ R+L+ +L R Sbjct: 135 LPNLEEKRLRELESEAVRFTEIRRHGPFANLDKRRTLRENLRRN---------------- 178 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 R+ DLR+K +E+ S AV+ Sbjct: 179 -------------------------AWRGRARIGDFANEDLRFKTWERDVKRESNAVVIA 213 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MDVSGSM K +++ FY + FL Y VE+ +I HH +A+EV E EFF E+G Sbjct: 214 MMDVSGSMGTFEKYVSRAFYYWMVRFLRTKYDRVEIRFIAHHAEAREVSEEEFFSRGESG 273 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT S+A +L ++++E Y P WNIY SDGDNW D+ C E LA++LL + Sbjct: 274 GTRASTAYELALQLIRESYPPDSWNIYPFHFSDGDNWPSDNERCRE-LAEELLRCANLFG 332 Query: 374 YIEITRR---AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y EI + TL + + S I ++ D+Y R F + Sbjct: 333 YGEIRQGRYTYQSTLMHTLQRIGS--PKLVTVTITEKADVYQALRRFFGPEVGQEVA 387 >UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitrosococcus oceani RepID=Q3J885_NITOC Length = 394 Score = 317 bits (811), Expect = 6e-85, Method: Composition-based stats. Identities = 117/419 (27%), Positives = 199/419 (47%), Gaps = 49/419 (11%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S +R R ++ + I+ ++++ + + S+ + +P I E F G+ Sbjct: 17 DRSAKDRLRHRQKVRKAIRDNVADIVAEESIIGQSRDRIIKVPIRGIREYRFVYGQNTPG 76 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 G+ Q G G G + D G D + +I+ +E ++++ EDL Sbjct: 77 VGTGQGDSEPGQ-------TVGQVPQGDGGPGHAGDRPGMDYYETEITLEELIEIMLEDL 129 Query: 133 ALPNLKQNQQRQLTEYKT-HRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP++++ + R++ +T R G+ GV ++ R+ ++ + RR A ++ Sbjct: 130 ELPDMERKRFREVLSERTSKRKGFRRVGVRVHMDKRRTAKSRIRRRLA----SDKDAEDN 185 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 E R PF D+RY + P S AV+ Sbjct: 186 ETK-----------------------------HRFPFHRD-DMRYHRLREDMRPQSNAVV 215 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 FC+MD SGSMD K +A+ F+ LLY F+ Y NV+VV+I HHT+A+EV E EFF+ E Sbjct: 216 FCIMDTSGSMDTLKKYLARSFFFLLYQFVRSRYVNVDVVFIAHHTKAREVTEEEFFHKGE 275 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT +SS E+++ RY+P+ WNIYA SDGDN+ D+ + A+ L V Sbjct: 276 AGGTFISSGYSKALEIIQNRYHPSLWNIYAFHCSDGDNFDSDNAATLKA-AEVLCQVCNL 334 Query: 372 YSYIEIT----RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + Y EI T+ + ++ DNF I+ ++DI+P FR+L +++ ++K Sbjct: 335 FGYGEIKPRPSGFYEGTMLDLFRSVRM--DNFQSVLIQRKEDIWPSFRQLLSRESESSK 391 >UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani RepID=Q896G6_CLOTE Length = 386 Score = 296 bits (758), Expect = 1e-78, Method: Composition-based stats. Identities = 108/416 (25%), Positives = 193/416 (46%), Gaps = 43/416 (10%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 N++ +R+R + IK ++ + + + ++ + +P + + E F + Sbjct: 13 NRAGEDRKRHRELVEKSIKDNLVDVLLQEDISIQKENIKIKVPIKGVKEYEFTYSQNRSF 72 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN+ Q ++R GGG+G+G+ + +D F +++ +E +F+DL Sbjct: 73 VVVGKGNEKKGQKIALKRASEQGGGAGAGEIEG------EDIFETEVTIEEIFQSIFDDL 126 Query: 133 ALPNLKQNQQRQLTEYKTHRA-GYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LPNLK+ + ++ R G+ +G+ ++ R+ + R+ A R+ Sbjct: 127 ELPNLKKKKFNKILNDSFKRKKGFKKHGISPRLAKRRTAIEKVKRKQATQKVLGRD---- 182 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + E +K+ DLRY + + AV+ Sbjct: 183 --------------IAERFPFKKD-----------------DLRYSRVKLNKNKEYNAVI 211 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD S SMDQ K MA+ F+ ++Y F+ Y+ V++ +I H T AKEV E EFF+ E Sbjct: 212 ICIMDTSASMDQMKKYMARSFFFMIYKFIKMKYEEVDICFISHSTTAKEVTEEEFFHKVE 271 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS K E++ RYNP +NIY ASDGDNW +D+ + +AK+L V Sbjct: 272 SGGTYISSGYKKALEIINTRYNPQIYNIYTFHASDGDNWNEDNDRAVK-VAKELSNVCNL 330 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + YIEI + R + +NF I ++D++ +++ ++ +G Sbjct: 331 FGYIEIMGYGYSNGIRNKYLKEIEKENFIPLIIEKKEDLWRALKDILKQEMREERG 386 >UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonifex degensii KC4 RepID=C9RD74_AMMDK Length = 371 Score = 292 bits (746), Expect = 2e-77, Method: Composition-based stats. Identities = 105/407 (25%), Positives = 165/407 (40%), Gaps = 51/407 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K + +R + K I+Q + E I + S+ D V IP + E F Sbjct: 15 KGEEDARRHQEKLKEIIRQRLPELITEESLILADDRRKVRIPLRLVEEFRFRF-ASHQEM 73 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V ++ I P G GG + G D + ++S +E +++FE+LA Sbjct: 74 LVGQAGSQPGTDETIVFPGIGRGGGAGTE-------PGIDYYEAEVSVEEIAEVVFEELA 126 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+ K A G+ A + R+L N+L R Sbjct: 127 LPHYKPKNTAN-RGIAEEWADLRRQGIRACLDRRRTLLNALKRHAKEGR----------- 174 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + R + DLR++ + P ++AV+ Sbjct: 175 ---------------KGEFR---------------LCPSDLRFRVWRSIESPEARAVVLA 204 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 ++D SGSM K +A+ F+ + FL Y NVEVVY+ HHT+A+E EFF E+G Sbjct: 205 MLDTSGSMGPLEKYLARSFFFWMVRFLEANYANVEVVYLAHHTEARETTASEFFRKGESG 264 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT SS +L ++++ RY P ++NIYA SDGDN D+ C E L +LL V Sbjct: 265 GTRCSSVYELALDIIETRYPPTEYNIYAFHFSDGDNLPADNERCME-LIGRLLEVANLVG 323 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 Y EI T + + + +R++ D+Y + F + Sbjct: 324 YGEIEGPYFYTSTLKTVYQSIAHPRLVVVTLRERKDVYRALKAFFAR 370 >UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EPS6_ACIF5 Length = 434 Score = 289 bits (739), Expect = 2e-76, Method: Composition-based stats. Identities = 166/437 (37%), Positives = 255/437 (58%), Gaps = 17/437 (3%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD-VDSGESVSIPTEDI 59 M+ IDRR +G +S N+ R RR +A++K ++ + S+ D ++ + VSIPT D+ Sbjct: 1 MSMIIDRRSSG-TRSTANQDRLQRRVRARLKVAVEKMARSGSIEDLANTDQPVSIPTRDL 59 Query: 60 SEPMFHQG-RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQ 118 EP F + RV PGN + + D I +P+GGG G G DG G+DE Sbjct: 60 HEPSFRRDLSDTSWERVLPGNKEYQRGDEINKPEGGGSGKGRAGAP---DGLGEDEVAIV 116 Query: 119 ISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT 178 +S DE+LDLLF+ LALPNL++ Q + + RAG+ +G P+ + V R+++ + ARR Sbjct: 117 LSADEFLDLLFDGLALPNLRKMAQGDIQADQWRRAGFIKDGSPSRMHVGRTMRAARARRL 176 Query: 179 AMTAGKRRELHALEENLAIISNSEPAQ----------LLEEERLRKEIAELRAKIERVPF 228 A+ AGKRREL L + ++ + L +I L KI+ +PF Sbjct: 177 ALRAGKRRELQDLLDARNVLQEEIQGRLAQKQDVSVEQERLSELNHQIDALERKIKAIPF 236 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ID DLR+ + +++P P + AVMFC+MDVSGSM + KD+AKRF++LLYLFL R Y+ V+ Sbjct: 237 IDEADLRFAHIDQQPHPITNAVMFCVMDVSGSMGEKEKDLAKRFFLLLYLFLHRHYQAVQ 296 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 +V+I+HH+ A E E FF ++E GGT+VS A+ L +E++++R+ P +WN+Y AQ SDGD Sbjct: 297 MVFIKHHSTASECSEQAFFGAREGGGTLVSPAIILSEEIMRQRFPPDRWNVYLAQVSDGD 356 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 N+ D+ + E L L + + Y+E+ R + L R Y+ + F +++ Sbjct: 357 NYFADNAVVEEHLLNLLPRLRNLF-YLEVNRDSESDLLRLYDAIAQDFPELVTARASERE 415 Query: 409 DIYPVFRELFHKQNATA 425 DIYP+FR LF + + Sbjct: 416 DIYPMFRTLFATEETPS 432 >UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C05A6 Length = 368 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 117/401 (29%), Positives = 184/401 (45%), Gaps = 63/401 (15%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + +R + + I+++I I S+T+ +G V + +++ E F G V Sbjct: 18 DAKRHRKLVEKSIRENIDMLIVGESITETAAGNIVKVRIQELPEYRFKFG--SSTEYVAI 75 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ V N++ + + +AS + G D + +I D+ L LLFE L LPNL Sbjct: 76 GDGDEVVNEKCDF-----------EMEASNEA-GLDIYESEIVLDDALALLFEQLELPNL 123 Query: 138 KQNQQRQLTEYKTH-RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 + + + L + T R+G G+ + R+LQ + R Sbjct: 124 YEKKFKNLEYFSTQKRSGIKKTGIYPRFAKKRTLQEKIIR-------------------- 163 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + D+RY++ K+ S AV+ C+MD Sbjct: 164 ---------------------------NKNGRFINQDIRYQSLAKKQINHSNAVIVCIMD 196 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 SGSM + KDMAK FY LLY F+ Y VE+++I H T AKEV E++FF+ E+GGT Sbjct: 197 TSGSMGTTKKDMAKSFYFLLYQFIKIRYAKVEMIFIAHSTIAKEVTENDFFHKGESGGTY 256 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SS E++KERY+P WN+Y SDGDNW DD+ L LA +L + YIE Sbjct: 257 ISSGYTKALEIIKERYDPRLWNVYTFHCSDGDNWTDDNNLAV-SLANELCSCSNLFGYIE 315 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 I + ++ + T +NF I + DI+ VF+++ Sbjct: 316 IKTNNYSSVILNEYNAHITSNNFLALKIFKKSDIFEVFKKV 356 >UniRef50_A9FJ88 Uncharacterized conserved protein involved in stress response n=21 Tax=Bacteria RepID=A9FJ88_SORC5 Length = 405 Score = 284 bits (727), Expect = 4e-75, Method: Composition-based stats. Identities = 101/405 (24%), Positives = 179/405 (44%), Gaps = 47/405 (11%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + RF + + +I++++ + I++ + + VSIP I P F G R V Sbjct: 44 DHGRFRQIVRGRIRENLRKYISQGELIGRKGKDLVSIPIPQIDIPRFRFG-DKQRGGVGQ 102 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ + P GG GQGQ + GEG ++ +E +L E+L LP++ Sbjct: 103 GDGN------PGDPVGGSDDKQPGQGQ-AGSGEGDHLLEVDVTLEELAGILGEELELPDI 155 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + + +++ +G G + R+ + +L R + Sbjct: 156 QDKGKSKISNAHDRYSGIRRVGPESLRHFKRTYREALKRMISSG---------------- 199 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 V D RY++++ +P + AV+ +MDV Sbjct: 200 ---------------------TFRPSAPVVVPVPDDKRYRSWKTITEPVANAVIIYMMDV 238 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV 317 SGSM K++ + + +L+R YK +E +I H A+EVD FF+++E+GGT++ Sbjct: 239 SGSMGDEQKEIVRIESFWIDAWLTRQYKGLESRFIIHDAIAREVDRDTFFHTRESGGTMI 298 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA-DDSPLCHEILAKKLLPVVRYYSYIE 376 SSA KL +++ Y P +WNIY SDGDNW+ DD+ C ++L ++LP V ++Y + Sbjct: 299 SSAYKLCSQIIDNDYPPDEWNIYPFHFSDGDNWSMDDTLSCVDVLKTQILPRVNMFAYGQ 358 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + ++ + S D + IRD+D I ++ K Sbjct: 359 VESPYGSGQFIKDLKEHFSQDDRVVVSEIRDKDAIVGSIKDFLGK 403 >UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67Q87_SYMTH Length = 395 Score = 276 bits (705), Expect = 1e-72, Method: Composition-based stats. Identities = 99/422 (23%), Positives = 168/422 (39%), Gaps = 54/422 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 + ++++R R + I+ ++++ ++ ++ D + V +P + E F + Sbjct: 18 QGQMDQERHQARIREAIRANLADIVSDEAIIASDGRKVVRLPIRVLREYRFRLDWQK-QP 76 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 RV + + + RP +G D G+D F + +E +LLF +L+ Sbjct: 77 RVGEADGPVRPGEPVGRPGRAAEAAGGSGAG---DEAGEDWFETDVPLEELEELLFAELS 133 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+L+ Q+ LT G+ ANI R+L ++ R Sbjct: 134 LPHLEPKQEPHLTVLHHEWRDVRRQGLYANIDKKRTLLEAMKRNRLAGRPPLAG------ 187 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 I DLR++ +++ P + AV+ Sbjct: 188 -----------------------------------IRREDLRFRTWDEAEIPGASAVLII 212 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K +A+ + FL Y+ V++ ++ H T+AKE+DE FF E+G Sbjct: 213 MMDTSGSMGTGEKYIARSLCHWMVRFLRTRYERVKLHFVAHTTEAKEMDEESFFTRGESG 272 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT SSA + +++ RY P + N+YA SDGDN D+P L +KLL Sbjct: 273 GTRCSSAYEYALQLIDRRYPPDRHNLYAFHFSDGDNLISDNPRAV-ALLRKLLERCALVG 331 Query: 374 YIEI--------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 Y +I + F IRD+ +IY R F + A Sbjct: 332 YGQIETQPQYLSMPYYQPNTLLTLFREEIDHPRFVTALIRDRSEIYAALRAFFPRPGAGE 391 Query: 426 KG 427 +G Sbjct: 392 RG 393 >UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BNQ7_9GAMM Length = 391 Score = 263 bits (672), Expect = 9e-69, Method: Composition-based stats. Identities = 94/421 (22%), Positives = 173/421 (41%), Gaps = 58/421 (13%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 + + R + + +++ + + + V +V +P + F +R Sbjct: 21 RGTRDWLRHNEKIREAVREQLPDLVAGSDVLSRPDNRTVKVPVRFMEHYRFRLRNPDVRT 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 G R +P + S +GEGQ F + D+ LD L+++L Sbjct: 81 GAGQGKAKPGDVLRPAQP-----ARPGQGKEGSGEGEGQITFALEFQIDDILDWLWDELE 135 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+LK ++ E R G+ G + + R+++ ++ RR+A Sbjct: 136 LPHLKPRLGTRIEEDAYIREGWDRRGARSRLDRRRTMKEAIKRRSAQGP----------- 184 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 I DLR++ +R P++ AV F Sbjct: 185 -------------------------------EAIPIVNDDLRFRQLARRRRPTTNAVAFF 213 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 L+DVS SMD+ + +AK F+ + R + +E+V+I H +A E +E FF G Sbjct: 214 LLDVSSSMDEHCRRLAKTFFFWALQGVRRQFSTIEIVFIAHTVEAWEFEEENFFRIHGQG 273 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT S+A+ ++++ERY+PA +N Y A+DG N+++D E L +L P++ + Sbjct: 274 GTKSSTAVHKAQQILEERYDPAMYNCYLFYATDGHNFSEDRRRATEALL-RLAPLMNFLG 332 Query: 374 YIEITRRAHQTL-------WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 Y E++ + H+ L WR ++++ + DI+ + F Q A A+ Sbjct: 333 YAEVSHQNHRRLDTEVAGIWRGLGAEGWPVGSYSLTR---EADIWLAIKAFFTDQAAEAE 389 Query: 427 G 427 Sbjct: 390 A 390 >UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteobacteria RepID=B1KPQ5_SHEWM Length = 640 Score = 252 bits (644), Expect = 2e-65, Method: Composition-based stats. Identities = 46/310 (14%), Positives = 92/310 (29%), Gaps = 26/310 (8%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 +D+ LP L+ + + +I V ++L R R Sbjct: 135 QDIYLPELQNRDKFERQVANGIMVAGEIPVSTFSIDVDTGSYSTLRRSINHGVLPERGTV 194 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 +EE + + PA E+ + + L+ EK +SQ Sbjct: 195 RVEELINYFAYQYPAPDAGEQPFSVNTELAPSPYNPHKMLLRIGLKGFEKEKADLGASQ- 253 Query: 250 VMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE-------- 300 + L+DVSGSM K + K +L L + VVY + Sbjct: 254 -LVFLLDVSGSMSSQDKLPLLKNALKMLSQQLDEGDRISIVVYAGASGVVLDGVKGNDTL 312 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + G T + ++L ++ ++ + N A+DGD N E Sbjct: 313 AISQALDKLKAGGSTNGGAGIELAYQLAQKHFIAGGVNRVIL-ATDGDFNVGVSDQQALE 371 Query: 360 ILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + ++ + + + + L + + + D R+ Sbjct: 372 DMIEEKRKQGIALTTLGFGQ--GNYNDHLMEQLADKGNGHYAYI--------DTLNEARK 421 Query: 417 LFHKQNATAK 426 + + + Sbjct: 422 VLVDEISATL 431 >UniRef50_C6M483 von Willebrand factor type A domain protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M483_NEISI Length = 538 Score = 251 bits (640), Expect = 4e-65, Method: Composition-based stats. Identities = 42/305 (13%), Positives = 97/305 (31%), Gaps = 23/305 (7%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP + ++ Q + ++ +I V ++ R ++ +EE Sbjct: 60 LPLAENTERYQDQPDQPVKSVAQEPVSTFSIDVDTGSYANVRRFLTNGEQPPKDAVRIEE 119 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + P + + + + + ++ ++ K+ P + + Sbjct: 120 IVNYFPYNYPLPT-DNRPFAVHTETIDSPWQPEAKLIKIGIQAQDTAKKDLPPAN--LVF 176 Query: 254 LMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEH 304 L+DVSGSMD+ K + ++ +L L K + Y KE Sbjct: 177 LVDVSGSMDEENKLPLVQKTLRILTQQLRPQDKVTLITYASGEDLVLPPTSGADKETILS 236 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK 363 + G T SAL++ E ++ + P N A+DGD N + + Sbjct: 237 AIDKLRAGGATDGESALQMAYEQAQKAFVPNGINR-ILLATDGDFNVGVSDTETLKSMVA 295 Query: 364 KLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + V + ++ + + ++ D +++ +Q Sbjct: 296 EKRKSGVSLSTLGFGMGNYNEDMMEQIADAGDGNYSYI--------DNEKEAKKVLQQQL 347 Query: 423 ATAKG 427 + Sbjct: 348 TSTLA 352 >UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BJU7_9GAMM Length = 555 Score = 250 bits (639), Expect = 6e-65, Method: Composition-based stats. Identities = 52/304 (17%), Positives = 91/304 (29%), Gaps = 22/304 (7%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 P + + T R T + V + + R + +EE Sbjct: 83 PPVSENRENYPKTPISPIRQVATDPVSTFSTDVDTASYTNARRFLNQGMRPPADSIRVEE 142 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + PA ++ + + L+ + + P + Sbjct: 143 FINYFDYALPAPDTTNTPIQISTERTQTPWNPQTELVRVSLQSYRSDFKTLPPLN--LVF 200 Query: 254 LMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--------EH 304 L+DVSGSM+ K + +R + LL L + VY E Sbjct: 201 LLDVSGSMNSPDKLPLMQRSFNLLVSQLRPQDRVAIAVYAGQSGVVLEPTSGDQKAQINQ 260 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK 363 + GGT S+ + L ++ + Y P N +DGD N S + L + Sbjct: 261 AINQLRAGGGTHGSAGIHLAYDLAQANYLPDGINR-IFIGTDGDFNVGTTSLTELKALIE 319 Query: 364 KLLPVVRYYSY-IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + S T + L E + + + D Y R+LF Q Sbjct: 320 RKREAGVFLSVLGFGTGNYNDALMEELSNHGNGTAYYL--------DSYQEARKLFATQL 371 Query: 423 ATAK 426 A Sbjct: 372 AATL 375 >UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z5_DESAA Length = 558 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 40/307 (13%), Positives = 86/307 (28%), Gaps = 28/307 (9%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 +P+ + + E ++ +I V + +++ R + + +E Sbjct: 83 RVPDYNTEEYAPIREGGF-KSPLYDPLSTFSIDVDTASYSNVRRFLSYGNMPPVDAVRIE 141 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 E + P ++ + + R + L+ + + + S + Sbjct: 142 EMINYFHYDYPQPK-GQDPFSITMEMSQCPWNRDNMLVHVGLQGRCLDYKDVKPSN--LV 198 Query: 253 CLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHT--------QAKEVDE 303 L+DVSGSM+ K + KR +L L + V Y + K Sbjct: 199 FLLDVSGSMNSENKLPLVKRSMEMLVKELGAGDRVSIVTYAGSAGLVLPSTSARNKRKII 258 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILA 362 + G T ++L V E P N +DGD N S + Sbjct: 259 TALDRLEAGGSTAGGEGIELAYRVAWENLIPEGNNRVIL-CTDGDFNVGVSSTPELVRMI 317 Query: 363 KKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 ++ + + + + + D ++F Sbjct: 318 EEKRRAGIYLTICGFG--MGNYKDEKMEAISNAGNGNFYYI--------DSRREAHKVFV 367 Query: 420 KQNATAK 426 + Sbjct: 368 QDMRANM 374 >UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidetes RepID=C7PNZ7_CHIPD Length = 639 Score = 248 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 45/305 (14%), Positives = 82/305 (26%), Gaps = 28/305 (9%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P + E + H + +I V R+ +++ R + +EE Sbjct: 163 PQFNTEDYSPVNENRFH-TVASDPLSTFSIDVDRASYSNVRRFLNEGNMPPVDAVRVEEM 221 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + + + L+ K+ K P S + L Sbjct: 222 INYFDYKYSNPT-GNTPVAVRTDMAICPWNTAHQLVRIALKGKDVAKDNLPPSN--LVFL 278 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEHE 305 +DVSGSM + K + K+ + LL L + VVY K Sbjct: 279 IDVSGSMSDAKKLPLVKQAFKLLVNQLRPVDRVAIVVYAGAAGLVLPSTSGDHKTAILDA 338 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK 364 + G T ++L + E + N A+DGD N S + + +K Sbjct: 339 LDKLEAGGSTAGGEGVQLAYKTATEYLLKSGNNRVII-ATDGDFNVGPSSDGELQRIIEK 397 Query: 365 LLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 + + + + D + R F + Sbjct: 398 KREKGIFLSVLGFG--MGNYKDNKLELLADKGNGNYAYI--------DNFEEARRTFATE 447 Query: 422 NATAK 426 Sbjct: 448 FGGTL 452 >UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DA43_9BACT Length = 883 Score = 248 bits (632), Expect = 4e-64, Method: Composition-based stats. Identities = 48/301 (15%), Positives = 87/301 (28%), Gaps = 30/301 (9%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 L +N + E +I V + + R +EE Sbjct: 319 DTLTENAFLNVPEN---------PLSTFSIDVDTASYAIVRRYLNDNHLPPTGAVRIEEL 369 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 L P + + L+ + K P S + L Sbjct: 370 LNYFPYDYPQPQ-GAAPFSATMEVATCPWAPEHRLVRVGLKGREIPKDERPPSN--LVFL 426 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR------HHTQAKEVDEHEFF 307 +DVSGSM+ K + ++ + LL L + V Y TQ KE + Sbjct: 427 IDVSGSMNMPNKLPLLQKCFSLLVEQLGPKDRVSIVTYASGTKLVLEPTQDKEAMQTAID 486 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL 366 GGT SS + L + ++ + P N A+DGD N + + + Sbjct: 487 GLHAGGGTHGSSGIDLAYRMAQQSFIPGGTNRVIL-ATDGDWNIGITNQSELLSMITRKA 545 Query: 367 PVVRYYSY-IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + + ++ + + + D R++F Q ++ Sbjct: 546 KSGVFLTVLGFGLDNLKDSMLVKLADHGNGHYAYI--------DTEQEARKVFVDQLSST 597 Query: 426 K 426 Sbjct: 598 L 598 >UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5H3_9GAMM Length = 608 Score = 246 bits (627), Expect = 1e-63, Method: Composition-based stats. Identities = 39/280 (13%), Positives = 80/280 (28%), Gaps = 26/280 (9%) Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL 219 +I V ++ R M + E + A E Sbjct: 155 STFSIDVDTGSYSNSRRMIKMGKRPPADAVREEAFINYFDYHYSAPKSLETPFNVHTEVA 214 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYL 278 A + ++ + EK ++ + L+DVSGSM+ K + K +L Sbjct: 215 PAPWNNQRQLLKIGIKGFDIEKAELKAAN--LVFLLDVSGSMNAPDKLPLLKSSLTMLTK 272 Query: 279 FLSRTYKNVEVVYIRHHTQAK--------EVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 L VVY +V + G T + ++L ++ + Sbjct: 273 QLDENDSVAIVVYAGAAGLVLPATKGNEYQVISNALNNLSAGGSTNGAQGIELAYQIASQ 332 Query: 331 RYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLW 386 + N A+DGD N S + L + + + + L Sbjct: 333 NFKKEGINRVIL-ATDGDFNVGMSSVDALKKLIANKRKTGIALTTLGFGQ--GNYNDGLM 389 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + ++ + + D R++ + ++ Sbjct: 390 EQLANIGNGQHAYI--------DTINEARKVLVDELSSTM 421 >UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria RepID=C6VVX3_DYAFD Length = 625 Score = 245 bits (624), Expect = 3e-63, Method: Composition-based stats. Identities = 42/278 (15%), Positives = 86/278 (30%), Gaps = 23/278 (8%) Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL 219 ++ V R+ +++ R + +EE + P E + Sbjct: 163 TTFSVDVDRAAYSNVRRFLNNGQMPPEDAVRIEEMINYFDYDYPQPRGEH-PVAIVAETT 221 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYL 278 + + L+ K +S + L+DVSGSM+++ K + K+ + LL Sbjct: 222 DSPWNPGLKLVHIGLQAKTVSAENLSASN--LVFLIDVSGSMNEANKLPLLKQAFKLLAD 279 Query: 279 FLSRTYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 L K V Y K+ + + G T ++L ++ K+ Sbjct: 280 QLRVEDKISIVAYAGSAGMVLAPTSGSEKKTIKDALDKLEAGGSTAGGEGIELAYDLAKK 339 Query: 331 RYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSY-IEITRRAHQTLWRE 388 + P N A+DGD N + + L ++ + S + Sbjct: 340 HFLPKGNNRVIL-ATDGDFNVGISNESELQKLIEEKRKAGIFLSVMGFGMGNYKDSHVET 398 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D R++F ++ Sbjct: 399 LADKGNGNYAYI--------DNIQEARKVFVQEFGGTL 428 >UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XT76_9DEIN Length = 360 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 88/404 (21%), Positives = 150/404 (37%), Gaps = 54/404 (13%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 + QRF + ++K+ E + + G+ VSIP + P G + Sbjct: 6 RDLQRFKEIVRGEVKKRAREFLTREEYLGSLDGQVVSIPLPQLELPRLQYGHNEMGQG-- 63 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 E G G G G V ++ +E+L+L+ E L LP Sbjct: 64 ------------EGEGEGQGQGMGGTAGRGGLGPSGHVPVAEMDLEEFLELIGEALKLPR 111 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L+ Q + E + G + R+L+ +L R + + + E Sbjct: 112 LEPKQGGAVEESSPKYTTLSRRGPESLRHARRTLRQALRRAIQSGIYRPEDPRLVPE--- 168 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 D RY+ E +P P +QA + +D Sbjct: 169 ----------------------------------RDDYRYRAPEPKPRPQAQAALVFALD 194 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 VSGSM+ + + + ++ + + + Y+ H +A EV E +FF +E GGT Sbjct: 195 VSGSMEGEQLRLVRILSYWITAWVKKHFPRLSRHYLLHDAEAWEVSEEDFFRLREGGGTR 254 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SS +KL +V++ RY +N Y +DGDNW DD+ E L K LLP + Y Y + Sbjct: 255 LSSGIKLAQQVLE-RYPAQLYNRYVYHFTDGDNWQDDTAEALETL-KALLPTLSLYGYAQ 312 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + R Q + + A + ++ + + L Sbjct: 313 VRSRYGQGRFIDDLRSHFPSDPALATAELGGRESLPSALKRLLG 356 >UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellaceae RepID=A5WCP1_PSYWF Length = 571 Score = 244 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 39/303 (12%), Positives = 87/303 (28%), Gaps = 23/303 (7%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPA--NISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 + QQ E + + T+ A +I ++ R ++ +EE Sbjct: 100 MAPKQQENYAEIEPNAVNATSEQAFATLSIDTDTGSYANVRRFLNQGQLPPKDAVRVEEL 159 Query: 195 LAIISNS-EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + A+ + + I ++ ++ A + Sbjct: 160 INYFNYDFTAAKKQANAPFLVSTEVVNSPWHPTNQIVKVGIKAEDLLTAKQKQPPANLVF 219 Query: 254 LMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEH 304 L+DVSGSMD K +AK +L L + Y + + + Sbjct: 220 LVDVSGSMDTEDKLQLAKSSLKMLTKQLRAQDSITLITYAGNTKVVLPSTPGNQTQKILN 279 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK 363 +G T +A+KL + E + N +DGD N S + + Sbjct: 280 AIDNLTASGSTNGEAAIKLAYQQATEHFKKDGINR-ILMLTDGDFNVGVSSVKDMLQIIR 338 Query: 364 KLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + + + + + ++ D +++ + Sbjct: 339 SNRDKGISLSTLGFGQGNYNDHMMEQVADNGNGNYSYI--------DSLSEAKKVLIDEM 390 Query: 423 ATA 425 + Sbjct: 391 SAT 393 >UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BTM6_TERTT Length = 689 Score = 243 bits (621), Expect = 8e-63, Method: Composition-based stats. Identities = 47/313 (15%), Positives = 93/313 (29%), Gaps = 31/313 (9%) Query: 130 EDLALPN-LKQNQQRQLTEYKTHRAGYTA--NGVPANISVVRSLQNSLARRTAMTAGKRR 186 DL P+ L+ + T+ T +I V + + + R+ ++ Sbjct: 209 PDLEPPHQLETADRDHFDTVATNPIKVTREEPVSTFSIDVDTASYSFVRRQLNRGQLPQK 268 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 LEE + P + I + A + + ++ Sbjct: 269 AAVRLEEMVNYFPYDYPLPSAATAPFKPTITVIPAPWNQAKRLVHIGIKALPLAH----P 324 Query: 247 SQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE-- 303 +A + L+DVSGSM K + K+ LL L T VVY E Sbjct: 325 PKANLVFLLDVSGSMGSPDKLPLVKQSMELLLSGLQPTDTVSIVVYAGAAGTVLEPTPVA 384 Query: 304 ------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPL 356 G T + ++L ++ + Y N A+DGD N P Sbjct: 385 EQQKILAALDRLNAGGSTAGAQGIELAYQLAEANYQRDAVNRIIL-ATDGDFNVGIADPE 443 Query: 357 CHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + ++ + + + + L ++ + + D Sbjct: 444 QLKGYVERKRANGIELSILGFG--SGNYNDALMQQLAQNGNGVAAYI--------DTLSE 493 Query: 414 FRELFHKQNATAK 426 +++ +Q + Sbjct: 494 AQKVLVEQASGTL 506 >UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UP85_RHOBA Length = 885 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 51/410 (12%), Positives = 102/410 (24%), Gaps = 57/410 (13%) Query: 31 KQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIER 90 K A + S G+ P + R R + + + Sbjct: 316 KDEAPTAPREPSAGKPVVGDFAVAPVPE-QLGRQQFDFRASRGRTLE--RQLGETEELA- 371 Query: 91 PQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKT 150 P G G F+ +++N+ R++ + Sbjct: 372 PTSDRLAILPPTPDGEGQGPGMSGDKFE-----------------PIQENEFRRVADDA- 413 Query: 151 HRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEE 210 +I V + + R + +EE + E+ Sbjct: 414 --------LSTFSIDVDTASYAKVRSYLQRGQLPRPDSVRIEELINYFDYQYTPPSAEDP 465 Query: 211 -RLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DM 268 +A + ++ K+ +++ P + L+D SGSM + K + Sbjct: 466 VPFSSAMAVASCPWNENNRLVRVGIQAKDIDRKERPRCN--LVFLIDTSGSMKRPNKLPL 523 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSA 320 +L L + VVY K+ G T + Sbjct: 524 VIEGMKVLLDQLKNRDRVAIVVYAGSSGLVLDSTPVKQKKKIIRALSALSAGGSTNGGAG 583 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIE 376 L+L + +E + N SDGD N A + + + Sbjct: 584 LQLAYQTARENFIEDGVNRVIL-CSDGDFNVGMTGTDQLVAEATRQSKSGTELTVLGFG- 641 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + F D +++ Q A Sbjct: 642 -MGNHNDAMMERISNSGAGNYAFV--------DTIAEAKKVLADQVAGTL 682 >UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT14_9PLAN Length = 616 Score = 240 bits (613), Expect = 7e-62, Method: Composition-based stats. Identities = 53/386 (13%), Positives = 98/386 (25%), Gaps = 52/386 (13%) Query: 53 SIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQ 112 + + R P + ++ G G G G Sbjct: 87 RVAGREKEAGKVRSDARQDRLATLPTESRRLGIEQPNAAPGFMPQLDGIAGHGEGPGVGG 146 Query: 113 DEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN 172 D+F + E RA +I V + + Sbjct: 147 DKFAYV----------------------------ENNPFRAVADEPLSTFSIDVDTASYS 178 Query: 173 SLARRTAM-TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 + + +EE + + +++ + + Sbjct: 179 KIRSYLIDYHQLPPQGAVRVEELINYFTYDYATPT-DQKPFAANVEAAACPWNAEHRLVR 237 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVV 290 ++ K P+S + L+DVSGSM+ + K + K+ LL L K VV Sbjct: 238 IGIKGKEIANAERPASN--LVFLLDVSGSMNNARKLPLLKQGMKLLVDQLGENDKVAIVV 295 Query: 291 YIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 Y + K Q G T ++L + E + N Sbjct: 296 YAGAAGMVLNSTNGDDKSTIMEALDRLQAGGSTNGGQGIELAYQAATENFIKGGVNRVIL 355 Query: 343 QASDGD-NWADDSPLCHEILAKKLLPVVRYYSY-IEITRRAHQTLWREYEHLQSTFDNFA 400 +DGD N S +A + S T + + E + F Sbjct: 356 -CTDGDFNVGVTSTSDLVTMAADKAKSGVFLSVMGFGTGNHNDAMMEELSGKANGNYAFI 414 Query: 401 MQHIRDQDDIYPVFRELFHKQNATAK 426 D +++ +Q + Sbjct: 415 --------DTITEAKKVLVEQMSGTL 432 >UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CVB5_9CLOT Length = 556 Score = 240 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 53/346 (15%), Positives = 94/346 (27%), Gaps = 23/346 (6%) Query: 92 QGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTH 151 GGG + S + G ++ ++ + E P ++ Sbjct: 25 GAGGGKTASATEAEVKAEAGSYASETMAAQSQWDGAVMEAEGPPLSHNTEEYNYIAENAF 84 Query: 152 RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEER 211 A A V + +L R+ + +EE L + P E+E Sbjct: 85 LAVANAPLSTFAADVDTASYANLRRKILEGNEVPADAVRIEEMLNYFTYDYPEPT-EDEP 143 Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-MAK 270 + L+ + + S + L+DVSGSM+ + K + K Sbjct: 144 FSVTTYIGDCPWNENHKLLQIGLQAEKPDLENQKPSN--LVFLIDVSGSMESADKLGLVK 201 Query: 271 RFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALK 322 R ++LL L V Y T K G T S ++ Sbjct: 202 RAFLLLTENLRPEDTVSIVTYASSDTVVLDGVSGEEKAAIMTAIENLTAGGSTDGSKGIE 261 Query: 323 LMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSY-IEITRR 380 + +E + N A+DGD N S L +K + S T Sbjct: 262 TAYRLAEEHFQKDGNNRVIL-ATDGDLNLGLTSEGDLTRLIQKKKESGVFLSVMGFGTGN 320 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D + + ++ Sbjct: 321 IKDNKMEALADNGNGQYAYV--------DSLMEAKRVLVEELGGTL 358 >UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20 Tax=Proteobacteria RepID=Q4KKB4_PSEF5 Length = 582 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 40/301 (13%), Positives = 81/301 (26%), Gaps = 23/301 (7%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + +Q Q + A + V ++ R + LEE + Sbjct: 92 EPREQYQKLPDNPIHSVAEAPVSTFSADVDTGAYANVRRLLNQGSLPPEGAVRLEELVNY 151 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + + + ++ + + + L+DV Sbjct: 152 FPYDYALPT-DGSPFGVTTELAPSPWNPHTRLLRIGIKASDRAVAELAPAN--LVFLVDV 208 Query: 258 SGSMDQST-KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--------EHEFFY 308 SGSMD+ + K LL L + VVY E Sbjct: 209 SGSMDRREGLPLVKSTLKLLVDQLRDQDRVSLVVYAGESRVVLEPTSGRDKAKIRTAIDQ 268 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP 367 G T +S ++L ++ ++ + N A+DGD N + +A + Sbjct: 269 LTAGGSTAGASGIQLAYQMAQQGFIDQGINR-ILLATDGDFNVGVSDFDSLKAMAAEKRK 327 Query: 368 -VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 V + ++ L + + D R++ Q ++ Sbjct: 328 SGVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYI--------DNLREARKVLVDQLSSTL 379 Query: 427 G 427 Sbjct: 380 A 380 >UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZHE2_9SPHI Length = 704 Score = 237 bits (605), Expect = 5e-61, Method: Composition-based stats. Identities = 46/342 (13%), Positives = 97/342 (28%), Gaps = 43/342 (12%) Query: 105 ASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANI 164 + D +F K + + E + +NQ Q+ + +I Sbjct: 205 GAGDLSSDLKFDR---KAAFRNAFPEGERYATIYENQFYQVGQN---------PLSTFSI 252 Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEE--------RLRKEI 216 V + +++ R + +EE + P ++ Sbjct: 253 DVDNASYSNVRRFVNDGQPLPKNAVRVEEMINYFEYDYPQPTPTKDKEGKLQTHPFSVNT 312 Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYIL 275 + L+ +N + + + + L+D SGSMD K + KR + + Sbjct: 313 EYGTCPWNPHHKLLQIGLQGENLQTKNASPAN--LVFLVDASGSMDSEDKLPLLKRSFKV 370 Query: 276 LYLFLSRTY-KNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMDE 326 L L+ + K V Y +E + G T ++L + Sbjct: 371 LLKQLTDSRTKIAIVAYAGASGLVLPATSVSHREKILTALENIESGGSTAGGEGIELAYK 430 Query: 327 VVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQT 384 + ++ + N A+DGD N S L V T + + Sbjct: 431 IAQQAFIAGGNNRVIL-ATDGDFNVGLSSDEELMQLISNKRKSGVYLTCLGFGTGNLNDS 489 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + + D +++ K Sbjct: 490 MMEKLTNAGNGNYYYI--------DGINEAKKVLAKNLTGTL 523 >UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C375AE Length = 550 Score = 237 bits (604), Expect = 6e-61, Method: Composition-based stats. Identities = 47/360 (13%), Positives = 99/360 (27%), Gaps = 45/360 (12%) Query: 96 GGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQ-------QRQLTEY 148 G + S E ++ S DE + E + + + +L Sbjct: 25 GDNMHDTASGSYKEESSAYEYYEESADE--EFAPEYFSTDDYAPEGDYYYWEEEPELPSA 82 Query: 149 KTHRAGYTANG---------VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 GYT G + V + ++ R + +EE + Sbjct: 83 NEEYKGYTEAGFKDTKSEPLSTFSADVDTASYTNVRRLIENRNIVPEDAVRIEEFINYFD 142 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 P + + + R + ++ K +++ P S + L+D SG Sbjct: 143 YDYPQPE-DGSAFGRYVEIADCPWNRDHKLMMVGIQGKELQQQETPPSN--LVFLIDSSG 199 Query: 260 SMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEHEFFYSQ 310 SM+ K + + + +L L + + V Y + + + Sbjct: 200 SMNSYDKLPLVQSAFSMLAEQLDKNDRISIVTYAGSSAVLLDGEKGSNTDEILEQLYSIT 259 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-- 367 +G T +K E+ +E + N A+DGD N S L + Sbjct: 260 ASGSTNGEGGIKTAYELAEEHFIKGGNNRVIL-ATDGDLNVGASSEEELTRLIETKRDNG 318 Query: 368 -VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + E + ++ D + ++ + Sbjct: 319 IYLSVLGFGE--GNYKDARMEALADNGNGNFSYI--------DSEDEAERVLVQEMSGTL 368 >UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella RepID=A3D1E9_SHEB5 Length = 642 Score = 236 bits (603), Expect = 8e-61, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 87/310 (28%), Gaps = 28/310 (9%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 + A+P L QN+ Q + AG I V +L R + Sbjct: 123 DYAAIP-LAQNKFEQQVQNGIMVAG-EIPVSTFFIDVDTGSYATLRRMLREGRLPEKGTV 180 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 +EE L + P + + L+ + K +S Sbjct: 181 RVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASN- 239 Query: 250 VMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKE 300 + L+DVSGSM + K + + LL LS K VVY + Sbjct: 240 -LVFLLDVSGSMASADKLPLLQTALKLLTAQLSAQDKVSIVVYAGAAGVVLDGVSGNDTQ 298 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + G + ++ K+ + P N A+DGD N Sbjct: 299 TLTYALEQLSAGGSINGGQGITQAYQLAKKHFIPNGINRVIL-ATDGDFNVGVTDFDDLI 357 Query: 360 ILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L +K + + + L + + + D R+ Sbjct: 358 ALIEKEKDHGIGLTTLGFGL--GNYNDQLMEQLADKGNGNYAYI--------DTLNEARK 407 Query: 417 LFHKQNATAK 426 + + ++ Sbjct: 408 VLVDELSSTL 417 >UniRef50_C7N770 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N770_SLAHD Length = 629 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 49/370 (13%), Positives = 95/370 (25%), Gaps = 35/370 (9%) Query: 75 VHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDG---EGQDEFVFQISKDEYLDLLFED 131 + G + E P + S + +G + + + + D L E Sbjct: 99 IGVGVGTNLLGSNAEMPVAETKAASEDTMAGSANSYAPDGGLAYETDEAYETF-DTLDEG 157 Query: 132 LALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK---RREL 188 + + + + E + T + V + +L R Sbjct: 158 APMEDFNTEEYAAIEENGF-VSTVTRPLSTCSADVDTASYCNLRRMINDGYSLDEIPDGA 216 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 +EE L + R + + + +K S Sbjct: 217 VRIEEMLNYFHYDSGEPE-GNDLFAVRAESARCPWNDQTQLLV--MTFTASDKAQTASKG 273 Query: 249 AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI--------RHHTQAK 299 + + L+D+SGSMD+ K D+ K + L L + V Y Sbjct: 274 SNLVFLIDISGSMDEPDKLDLLKDSFGTLLENLGPNDRVSIVTYAAGEDVLLEGASGDDT 333 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 + G T + L++ EV + Y N ASDGD N S Sbjct: 334 RKIMRALNRLEADGSTNGEAGLEMAYEVAERNYIEGGVNR-IVMASDGDLNVGITSESDL 392 Query: 359 EILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 ++ + + + T + ++ D Sbjct: 393 YDFVEEKRETGVYLSVLGFG--SGNYKDTKMETLADHGNGTYHYI--------DCVEEAE 442 Query: 416 ELFHKQNATA 425 + + Sbjct: 443 RVLGEDLTAN 452 >UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter sp. K31 RepID=B0T5X0_CAUSK Length = 592 Score = 236 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 40/303 (13%), Positives = 86/303 (28%), Gaps = 22/303 (7%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P ++ ++ + +I V + ++ R A + +EE Sbjct: 118 PPIRDTEKYPGAAANPVKRVAEEPVSTFSIDVDTAAYANVRRFLNEGAAPPHDALRVEEL 177 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + +E + + + + + + ++ + P + L Sbjct: 178 INYFDYGYARPTAQEPPFKPTVTVVPSPWSQDRQLMHIGVQGYATPRAGQPPLN--LVFL 235 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEHE 305 +D SGSM + +AK+ +L L + V Y ++K Sbjct: 236 IDTSGSMSGPDRLPLAKKALNVLIDQLRPQDRVSMVAYAGSAGAVLSPTDGKSKLKMRCA 295 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK 364 + G T L+L + ++ +P N +DGD N P + Sbjct: 296 LTALRSGGSTAGGQGLELAYALARQNLDPKAVNRVILM-TDGDFNVGIADPTRLKDFVAD 354 Query: 365 LLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 V Y + T+ + + + D R+L Sbjct: 355 QRKSGVYLSVYGFGRGNYNDTMMQALAQNGNGTAAYV--------DGLQEARKLLRDDFD 406 Query: 424 TAK 426 +A Sbjct: 407 SAL 409 >UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4 Tax=Cyanobacteria RepID=B0CCM8_ACAM1 Length = 686 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 59/386 (15%), Positives = 106/386 (27%), Gaps = 45/386 (11%) Query: 69 GGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLL 128 +R R P N F + R RP G Q A + Q +F S Sbjct: 121 SQVRQRRKPSNRRFGISPRRPRPTGLPPALTKAQ-PAPAETAAQSQFSRDQSGRMKSVAP 179 Query: 129 F---------------EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNS 173 + L LP + + +I V + ++ Sbjct: 180 PAGLAPPAPEPRFQDKDRLHLPGTFNTEDYKRINENPFFLPQRTPLSTFSIDVDTASYSN 239 Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 + R ++ LEE + + ++ A + Sbjct: 240 VRRFIRQGQLPPKDAVRLEELINYFDYGYASPKGDQ-PFSVSTEVATAPWNNQHKLVHIG 298 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVYI 292 L+ K EK + + L+DVSGSM + K + K+ LL L + VVY Sbjct: 299 LKGKELEKEQ----PSNLVFLIDVSGSMKRPNKLALVKKSLCLLVHQLKPEDRVSLVVYA 354 Query: 293 R--------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 K + + G T ++ +K+ ++ + + N A Sbjct: 355 GRAGIVLPSTPGTQKATIMNAIDRLEAGGSTAGAAGIKMAYDMAERHFLKNGNNRVIL-A 413 Query: 345 SDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DGD N S E L ++ + Y T + + + Sbjct: 414 TDGDFNVGQSSDAELERLIEQKRDRGVFLTVLGYG--TGNYKDNKMELLANKGNGNYAYI 471 Query: 401 MQHIRDQDDIYPVFRELFHKQNATAK 426 D +++ Sbjct: 472 --------DTLLEAQKVLVNDLRGTL 489 >UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KS19_CLOPH Length = 551 Score = 234 bits (597), Expect = 5e-60, Method: Composition-based stats. Identities = 39/300 (13%), Positives = 86/300 (28%), Gaps = 27/300 (9%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 ++ + +++ + V + +++ R +EE L + Sbjct: 89 TEEYNAVIEQGYQSTKNHPLSTFSADVDTASYSNIRRMLKEGRRVDTGAVRIEEMLNYFN 148 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + + ++ + + S + L+DVSG Sbjct: 149 YDYKLPEGD-SPFGITTELSDCPWNPDTKLFLAGIQTEKIDFSKSAPSN--LVFLIDVSG 205 Query: 260 SM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQ 310 SM D+ + +R ++LL L+ + V Y + T KE ++ + Sbjct: 206 SMMDEDKLPLVQRAFLLLTENLTEKDRISIVTYAGNDTVVLSGAKGNQKEKIQNAITELE 265 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV- 368 G T S ++ ++ E Y N A+DGD N S L ++ Sbjct: 266 AGGSTFGSKGIETAYQLAMENYIEGGNNRVIL-ATDGDLNVGVTSESELTNLIEEKRKSG 324 Query: 369 --VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + T + + D R++ ++ Sbjct: 325 VALSVLGFG--TGNIKDNKMEALADHGNGNYAYI--------DSLMEARKVLVEEMGATL 374 >UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales RepID=C8SEV7_9RHIZ Length = 718 Score = 231 bits (588), Expect = 4e-59, Method: Composition-based stats. Identities = 38/303 (12%), Positives = 88/303 (29%), Gaps = 22/303 (7%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P + + Q + A +I V + + + R + + +EE Sbjct: 240 PQEENRDRVQDFKTNPVHAALEDPVSTFSIDVDTASYSFVRRSLKEGFVPQADTVRVEEM 299 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + ++ + + ++ + + P + + L Sbjct: 300 INYFPYDWKGPDSASTPFNSTVSVMPTPWNTHTKLMHVAIKGFDVKPTEQPKAN--LVFL 357 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHE 305 +DVSGSMD+ K + K + LL L V Y K+ + Sbjct: 358 IDVSGSMDEPDKLPLLKSAFRLLVSKLKADDTISIVTYAGDAGTVLMPTKIAEKDKILNA 417 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK 364 Q G T + +K ++ ++ + N A+DGD N + L ++ Sbjct: 418 IDNLQPGGSTAGEAGIKEAYKLAQQSFIKDGVNRVML-ATDGDFNVGQTDDDDLKRLIEQ 476 Query: 365 LLPVVRYYS-YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + S + + + + + + D ++ + + Sbjct: 477 ERKTGVFLSVFGFGRGNLNDEMMQTIAQNGNGTAAYI--------DTLAEAEKVLVEDAS 528 Query: 424 TAK 426 + Sbjct: 529 STL 531 >UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacteriaceae RepID=C0YQB8_9FLAO Length = 800 Score = 230 bits (587), Expect = 7e-59, Method: Composition-based stats. Identities = 47/303 (15%), Positives = 82/303 (27%), Gaps = 23/303 (7%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P + N+ +I V + +++ R + +EE Sbjct: 327 PVTQNNESYDAFVENPFELTRNQPLSTFSIDVDNASYSNVRRMINNGQVVDKNAVRIEEM 386 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + P E A + L+ KN P+S + L Sbjct: 387 VNYFKYDYPQPKNEN-PFSINTEYSDAPWNPKHKLLKIGLQGKNLPMDKLPASN--LVFL 443 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHE 305 +DVSGSM K + K + +L L K VVY K+ Sbjct: 444 IDVSGSMSDENKLPLLKSSFKVLLNQLRPKDKVGIVVYAGSAGMVLPPTSAGEKDKIIEA 503 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK 364 Q G T + ++L ++ +E + N A+DGD N S + L + Sbjct: 504 LDRLQAGGSTAGGAGIELAYKLAQENFVKEGNNRVII-ATDGDFNVGTSSISDLKTLIED 562 Query: 365 LLPVVRYY-SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + D + K+ A Sbjct: 563 RRKSGVFLTCLGFGMGNYKDNTLETLADKGNGNYAYI--------DNMQEANKFLGKEFA 614 Query: 424 TAK 426 + Sbjct: 615 GSM 617 >UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria RepID=A5F9T1_FLAJ1 Length = 709 Score = 230 bits (586), Expect = 7e-59, Method: Composition-based stats. Identities = 55/322 (17%), Positives = 96/322 (29%), Gaps = 30/322 (9%) Query: 118 QISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 + D+ L+++ + LP Q E + TA +I V + ++ R Sbjct: 221 EQELDKKLNIIRPNPTLPT--QEDYDTFVENAFE-SPKTAPLSTFSIDVDNASYTNIRRF 277 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 ++ +EE + + P E + I L+ K Sbjct: 278 LNSGQEVPKDAVRVEEMVNFFKYNYPQPKNEH-PFSINTEYSDSPWNSQNKILKIGLQGK 336 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 N PSS + L+DVSGSM+ K + K+ +L L T K VVY Sbjct: 337 NIATNDLPSSN--LVFLIDVSGSMEDMNKLPLLKQSMKILVNELRPTDKVSIVVYAGAAG 394 Query: 295 ------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 K+ + G T + ++L ++ E + N A+DGD Sbjct: 395 MVLPPTSGNEKKTIIKALDQLEAGGSTAGGAGIELAYKIATENFIKGGNNRVIL-ATDGD 453 Query: 349 -NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 N S E L ++ + Y + + + Sbjct: 454 FNVGSSSNSDMEKLIEEKRKTGVFLTCLGYG--MGNYKDSKMEILADKGNGNYAYI---- 507 Query: 405 RDQDDIYPVFRELFHKQNATAK 426 D K+ + Sbjct: 508 ----DNIQEANRFLGKEFKGSM 525 >UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SXC8_9FIRM Length = 612 Score = 230 bits (585), Expect = 1e-58, Method: Composition-based stats. Identities = 37/328 (11%), Positives = 80/328 (24%), Gaps = 32/328 (9%) Query: 111 GQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL 170 G V S Y ++ ++ ++ +N + + Sbjct: 101 GDTAMVTDTSNSMYSEVAYDTREYDSMTENGF---------VSTVDRPLSTFAADRDTAS 151 Query: 171 QNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFID 230 +++ + +EE L + + + E+ + + Sbjct: 152 YSNVRSYIESGSLPPDGAVRIEEMLNYFTYDYRKKPEDGEKFSIYTEYSDCPWNKDTKLM 211 Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEV 289 + + S + L+D SGSM D + + ++ + +L L + V Sbjct: 212 MVGINTDEIDFGDKKPSN--LVFLIDTSGSMYDDNKLPLVQQSFAMLAENLDENDRVSIV 269 Query: 290 VYIRHHTQAKE--------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 Y T G T A+ E+ ++ + N Sbjct: 270 TYAGEDTVVLSGTPGSEQYTISEALSNMTAEGCTNGGDAIITAYELAEKNFINGGNNRVI 329 Query: 342 AQASDGD-NWADDSPLCHEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFDN 398 A+DGD N S L + + T Sbjct: 330 L-ATDGDLNVGLTSESDLVDLITEEKKENNIFLSVLGFGTDNLKDNKLEALADNGDGSYA 388 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNATAK 426 F D +++ + Sbjct: 389 FI--------DSAYEAKKVLVDEMGGTL 408 >UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZU67_9SPHI Length = 552 Score = 229 bits (583), Expect = 2e-58, Method: Composition-based stats. Identities = 47/310 (15%), Positives = 93/310 (30%), Gaps = 26/310 (8%) Query: 128 LFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRE 187 L ED++ P +K+ + + + + TA +I V + + + Sbjct: 75 LEEDVSPPKIKEKKPANENTFLSVK---TAPLSTFSIDVDNASYSRARKSINNGQLPSTS 131 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 LEE + + + + + L+ K + R S Sbjct: 132 SVRLEEFINYFNYQYKQPE-GQHPFSVNTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLS 190 Query: 248 QAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHH--------TQA 298 + L+DVSGSM K + ++ + +L L + VVY + Sbjct: 191 N--LVFLIDVSGSMSAPDKLPLLRKAFKMLVNNLGEEDRVAIVVYAGNAGLVLPATQGTD 248 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLC 357 K+ Q G T + +KL ++ K+ + N A+DGD N S Sbjct: 249 KQKIMEALDKLQSGGSTAGGAGIKLAYKIAKQNFIKEGNNRIIL-ATDGDFNLGASSDQA 307 Query: 358 HEILAKKLLPVVRYYSY-IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + L ++ + + + + + D + Sbjct: 308 MQNLIEEKRKEGVFITVLGLGMGNYRDSKMEIIADKGNGNYYYL--------DNLNEAYK 359 Query: 417 LFHKQNATAK 426 +F K Sbjct: 360 VFGKDLKGTL 369 >UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DBX8_9RHIZ Length = 668 Score = 229 bits (583), Expect = 2e-58, Method: Composition-based stats. Identities = 40/301 (13%), Positives = 85/301 (28%), Gaps = 31/301 (10%) Query: 146 TEYKTHRAGYTANGVPA---------NISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 + G+ +NGV + + V + + R +EE + Sbjct: 195 EAERDRVEGFDSNGVRSVAEYPVSTFSADVDTASYAMVRRALKQGVMPDPRTVRIEEMVN 254 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + PA E R + + ++ + + P + + L+D Sbjct: 255 YFNYDYPAPESVETPFRATVTVTPTPWNANTRLLHIGVKGYDVKPAARPQAN--LVLLVD 312 Query: 257 VSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFF 307 VSGSM ++ K + K + LL L V Y E Sbjct: 313 VSGSMQETDKLPLLKSAFRLLIQKLEPEDTVSIVTYAGDAGTVLEPTPASDKAKILDALD 372 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL 366 + G T ++ ++ + ++ N A+DGD N + L ++ Sbjct: 373 DLRPGGSTAGAAGIEEAYRLAEKARVNGGVNRVLL-ATDGDFNVGASDDDALKSLIEEKR 431 Query: 367 PVVRYYS-YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + S + + L + + + D + ++ + Sbjct: 432 ESGVFLSIFGFGQGNYNDQLMQTLAQNGNGVAAYI--------DTLAEAEKTLAQEATAS 483 Query: 426 K 426 Sbjct: 484 L 484 >UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteobacteria RepID=C6BAR1_RHILS Length = 706 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 37/304 (12%), Positives = 87/304 (28%), Gaps = 27/304 (8%) Query: 137 LKQNQQRQLTEYKTHRAGYTANG-VPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 L N++R + + V + + R A +EE + Sbjct: 227 LDPNRERFANAAANPIKSVATDPVSTFSADVDSASYAFVRRSLTGGAMPDPLSVRVEEMI 286 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 P ++ + + + R + ++ + P + + L+ Sbjct: 287 NYFPYDWPGPNNADQPFKATVTVMPTPWNRDTELMHVAIKGYDIAPATTPRAN--LVFLI 344 Query: 256 DVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEF 306 DVSGSMD+ K + K + L+ L V Y + Sbjct: 345 DVSGSMDEPDKLPLLKSAFRLMVNRLKADDTVSIVTYAGNAGTVLAPTRVAEKSKILSAI 404 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKL 365 + G T + ++ ++ K+ + N A+DGD N S + + ++ Sbjct: 405 DRLEPGGSTGGAEGIEAAYDLAKQGFVKDGVNRVML-ATDGDFNVGPSSDGDLKRIIEEK 463 Query: 366 LPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + + +L + + + D ++ ++ Sbjct: 464 RKDGIFLTVLGFG--RGNLNDSLMQTLAQNGNGSAAYI--------DTLAEAQKTLVEEA 513 Query: 423 ATAK 426 + Sbjct: 514 GSTL 517 >UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=C8N8N5_9GAMM Length = 563 Score = 224 bits (571), Expect = 4e-57, Method: Composition-based stats. Identities = 36/304 (11%), Positives = 85/304 (27%), Gaps = 24/304 (7%) Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA-MTAGKRRELHALEEN 194 ++ ++ ++ A +I V +++ R + +EE Sbjct: 92 EVQNRERYAHSDANPVHRVSDAPVSTFSIDVDTGSYSNIRRMLTRENRLPPADAVRVEEI 151 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 L + P + + + + + + ++ + P + + L Sbjct: 152 LNYFAYGYPLPQ-DGKPFAVHTQTVDSPWQADAKLIRIAIQAADLAPEKRPPAN--LVFL 208 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEHE 305 +D SGSMD K + K+ L + + Y KE Sbjct: 209 IDTSGSMDDPDKLPLVKKTVCHFAEALRADDRISLITYSGSTAEILPPTAGDQKETIIAA 268 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK 364 + G T AL++ + + Y N A+DGD N P + Sbjct: 269 LKPLRAHGATAGGEALRMAYDAAAKNYRKDGINR-ILLATDGDFNVGISDPATLKNYVAD 327 Query: 365 LLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + + + ++ D +++ +Q Sbjct: 328 KRKSGISLTTLGYGSGNYNDEMMEQLADAGDGNYSYI--------DSEAEAKKVLVRQLT 379 Query: 424 TAKG 427 + Sbjct: 380 STLA 383 >UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 Tax=Erythrobacter RepID=Q2N8R4_ERYLH Length = 580 Score = 223 bits (568), Expect = 9e-57, Method: Composition-based stats. Identities = 41/302 (13%), Positives = 86/302 (28%), Gaps = 20/302 (6%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 +P + ++ E + ++ V + R + + EE Sbjct: 102 VPQPEDRERYDGEEVSPVKIAAVEPLSTFSVDVDTGAYANARRFLSQGQMPPKAAVRTEE 161 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + R + L + E+ P + + Sbjct: 162 FINYFRYDYDRPQDRSQPFTVNFDAARTPWNEDTRLIRIGLAGYDIERSERPPAN--LVF 219 Query: 254 LMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------EHEF 306 LMDVSGSM + K + K L L K VVY E Sbjct: 220 LMDVSGSMGRPDKLPLVKTALAGLAGELQPQDKVSIVVYAGAAGLVLEPTNDTRKIRAAL 279 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKL 365 Q G T + ++L ++ ++ + N A+DGD N S + +K Sbjct: 280 NQLQAGGSTAGGAGIQLAYQIAEDNFIEGGVNRVIL-ATDGDFNVGVSSRDALIEMIEKK 338 Query: 366 LP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + T ++ + + + + + D +++ + ++ Sbjct: 339 RDSGITLTTLGFGTGNYNEAMMEQIANHGNGNYAYI--------DSALEAKKVLGDEMSS 390 Query: 425 AK 426 Sbjct: 391 TL 392 >UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5_THET2 Length = 351 Score = 223 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 101/404 (25%), Positives = 160/404 (39%), Gaps = 64/404 (15%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 + RF + ++K+ + E + + + G VSIP + P Sbjct: 10 RDLLRFKEIVRGEVKKRVREFLTREELFGQVEGRLVSIPLPQLEIPKIVH---------- 59 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 + G G G G G G V ++ +E+LDL+ E L LP Sbjct: 60 --------------GEPLGEGLGLGGPGEEALGPGGHIPVAELELEEFLDLVGEALRLPR 105 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L+ + ++TE G V R+L+ SL R + + + E Sbjct: 106 LRPKGEGEVTEEALRHTTIARKGPRGLRHVRRTLKESLKRALQSGEYRPEDPLLVPE--- 162 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 DLRYK ++P P +QAV+ +D Sbjct: 163 ----------------------------------REDLRYKAPRRKPIPHAQAVVLFALD 188 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 VSGSM + + K + L++ R + +E Y+ H +A EV E EFF ++E GGT Sbjct: 189 VSGSMREEELKLVKTLSFWITLWIKRHFPRLERRYLLHDAEAWEVPEEEFFKAREGGGTR 248 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SSAL L +E++ + Y A +N Y SDG+NW D+PL E L +LLP + Y Y + Sbjct: 249 ISSALLLAEEIL-KAYPEAFYNRYLFHFSDGENWQGDTPLALEALR-RLLPSLALYGYAQ 306 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + Q E + A+ +R ++D+ R L Sbjct: 307 VEGPYGQGHFLEEVREALGGREGVALAAVRGREDLPVALRRLLG 350 >UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobacteriaceae RepID=YFBK_ECOLI Length = 575 Score = 223 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 41/289 (14%), Positives = 78/289 (26%), Gaps = 20/289 (6%) Query: 156 TANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSE------PAQLLEE 209 ++ V ++ R + +EE + + + Sbjct: 118 QNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKP 177 Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDM 268 A + D+ K+ + P+S + L+D SGSM + Sbjct: 178 IPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASN--LVFLIDTSGSMISDERLPL 235 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSA 320 + LL L V Y A K G T + Sbjct: 236 IQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAG 295 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-VVRYYSYIEIT 378 L+L + + + N A+DGD N D P E + KK V ++ Sbjct: 296 LELAYQQATKGFIKGGINR-ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGN 354 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 ++ + + + ++ Q + R++ K Sbjct: 355 SNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 >UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9IU52_BORPD Length = 582 Score = 222 bits (566), Expect = 2e-56, Method: Composition-based stats. Identities = 37/305 (12%), Positives = 74/305 (24%), Gaps = 22/305 (7%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 A P ++ + A V ++ R + E Sbjct: 106 APPQAEERENYARYRDNPVVAAQEQPVSTFGADVDTGSYTNVRRLLNEGRLPPPDAVRAE 165 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 E + ++ A + ++ + P++ + Sbjct: 166 EFINYFDYGYATPDSRQQPFSIITEVSAAPWNPQRQLLKIGIQGYRVAPQDIPAAN--LV 223 Query: 253 CLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDE 303 L+D SGSM + K + K L L + V Y K Sbjct: 224 FLVDTSGSMAERDKLPLIKGALKQLVAQLRPQDRVAIVTYAGQASMTLDSTPGDQKARIN 283 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILA 362 + G T + L L + + N ASDGD N + Sbjct: 284 AAIDELRAAGSTNGGAGLDLAYAQAAKGFVKGGVNR-ILLASDGDFNVGATDLEDLKDKI 342 Query: 363 KKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 + + + + L + + ++ D R++ Q Sbjct: 343 ARQRQGGIALTTLGVGGGNFNDALAMQLADAGNGSYHYL--------DSLREARKVLAAQ 394 Query: 422 NATAK 426 ++ Sbjct: 395 MSSTL 399 >UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C54C8 Length = 638 Score = 222 bits (565), Expect = 2e-56, Method: Composition-based stats. Identities = 37/291 (12%), Positives = 75/291 (25%), Gaps = 26/291 (8%) Query: 147 EYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQL 206 + R+ A + V + ++ R L E + S Sbjct: 172 QENEFRSPLVAALSTFSADVNTASYANVRRMLNEGTLPPASAVFLAEFVNYFPYSYAPPP 231 Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 + + + + ++ P + L+D SGSM Q + Sbjct: 232 AGADPVAFHVEMGPCPWNAKHHLLRVGVQAHQIPAEKLPPRN--LVFLVDTSGSMQQENR 289 Query: 267 -DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE--------FFYSQETGGTIV 317 + ++ LL L+ + V Y A Q GGT Sbjct: 290 LPLVQKSLELLVEKLTEKDRVSVVTYAGDSRVALPPTSGADKKAILDVVTGLQANGGTNG 349 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYS 373 +K + ++ + N +DGD N L ++ + Sbjct: 350 EGGIKKAYQFARDTFLDGGVNRVIL-CTDGDFNVGVVDNGELVKLIEEQRKSKVFLTVLG 408 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 Y +E + + + D +++F +Q Sbjct: 409 YG--MGNYKDDRLKELANHGNGHHAYI--------DTLDEAKKVFVEQGGA 449 >UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriaceae RepID=Y746_HALSA Length = 442 Score = 221 bits (564), Expect = 3e-56, Method: Composition-based stats. Identities = 90/447 (20%), Positives = 170/447 (38%), Gaps = 56/447 (12%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 +R+RF + + +Q ++E I + ++V +P + + P F + R V Sbjct: 5 EDRERFHEIGEQR-RQDLAEFIQYGDLGGS-GPDAVRVPIKLVDLPAFEYDQ-LDRGGVG 61 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 G+ D++ P +G G + D E+ +++ +E+ L E L L + Sbjct: 62 QGDVDP--GDQVGEP----DEAGEGDDDEAGDESADHEY-YEMDPEEFAAELDERLGL-D 113 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM---------------- 180 L ++ + E + G + + L R+ AM Sbjct: 114 LDPKGKKVVAETEGAFNETARRGPRGTLDFAHLYKQGLKRKIAMDFDEAYVTAALRVDGW 173 Query: 181 ---TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERV----------- 226 + + A I+ + +++ R + A I+ + Sbjct: 174 GVDAVYTWAREQHIPVSRAWIAERARSPSPDDDAGRVVDDAVWASIDAMEAAVDVEPTRT 233 Query: 227 ---------PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY 277 + D R+++ + V+ + DVSGSM +S +++ +R + L Sbjct: 234 RIRRGGPGRVPLRREDERFRHPKVVEHRERNVVVVNIRDVSGSMRESKRELVERTFTPLD 293 Query: 278 LFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 +L+ Y N E VYI H A EVD +FF Q GGT +S+A +L + V+ E Y ++W Sbjct: 294 WYLTGKYDNAEFVYIAHDADAWEVDRTDFFGIQSGGGTRISTAYELAENVLDE-YPFSEW 352 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF- 396 N Y A DG+N DD+ L + ++Y+E ++ F Sbjct: 353 NRYVFAAGDGENSHDDTEENVIPLMNDI--DANLHAYVETQPTDGVQTGTHAGKVRDAFG 410 Query: 397 --DNFAMQHIRDQDDIYPVFRELFHKQ 421 DN A+ + + DD+ + + Sbjct: 411 DTDNVAVTTVTEPDDVMGAIETILSTE 437 >UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GJ99_SILST Length = 300 Score = 220 bits (560), Expect = 8e-56, Method: Composition-based stats. Identities = 123/299 (41%), Positives = 183/299 (61%), Gaps = 13/299 (4%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAG---KRRELHALEENL 195 + + T RAG T G P N+++VR+++NSL RR A+ +R+L A L Sbjct: 2 EKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLGRRIALQRPSTQTQRDLEAQVAEL 61 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 I P+Q L ++ ++ K V +ID DLRY + +S+AV+FCLM Sbjct: 62 EEIEARSPSQDELLAELVAKLDGIKRKRRVVGYIDPLDLRYDTFVPEKIRNSRAVVFCLM 121 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 DVSGSM + KD+AKRF++LL+LFL+R Y++ E+V++RH A+EVDE FFY++ETGGT Sbjct: 122 DVSGSMQEREKDLAKRFFLLLHLFLTRGYEHTEIVFVRHTHYAQEVDEETFFYARETGGT 181 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 IVS+AL+ M E++ ERY P +WNIY AQASDG+N+ +DS C +IL ++LLP+ ++Y+Y+ Sbjct: 182 IVSTALEKMKEIIDERYPPDEWNIYGAQASDGENFGNDSVRCRKILTEQLLPMCQFYAYV 241 Query: 376 EI----------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 EI A + LW+ Y ++ +F MQ + + IYP+FRE F + Sbjct: 242 EIVEESAQMLLDNTEAGEDLWQNYRQVKEACRHFEMQRVSEPGHIYPIFREFFLPKVKG 300 >UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NVX5_9RHOB Length = 608 Score = 220 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 36/302 (11%), Positives = 80/302 (26%), Gaps = 26/302 (8%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L+ ++ E R ++ V + + + + + +EE + Sbjct: 137 LEDRERFASAEANPLRRTSADPVSTFSVDVDTASYSYVRSTLSGGRLPNPDAVRVEEMVN 196 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + P ++ + + ++ PS + L+D Sbjct: 197 YFDYNYPVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLPSQN--LVFLID 254 Query: 257 VSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFF 307 SGSM + K + ++ + LL L + V Y E + + Sbjct: 255 TSGSMADANKLPLLQQSFRLLLSSLRDEDEVAIVTYAGSSGVLLEPTKVADKTRILEKIN 314 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL 366 G T LK + + + A+DGD N P + + Sbjct: 315 ALTSGGSTAGHEGLKGAYALAETMTGDGEQTRIIL-ATDGDFNVGLSDPDSLKRYVAEQR 373 Query: 367 P---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + L + + D R++ Q Sbjct: 374 ENGTALSVLGFG--RGNYNDELMQTLAQNGQGVAAYI--------DTLSEARKVLVDQVV 423 Query: 424 TA 425 ++ Sbjct: 424 SS 425 >UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobacteria RepID=Q21MJ3_SACD2 Length = 708 Score = 219 bits (557), Expect = 2e-55, Method: Composition-based stats. Identities = 40/311 (12%), Positives = 99/311 (31%), Gaps = 26/311 (8%) Query: 129 FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRREL 188 + + P + N + + E + ++ A +I V + + + R+ ++ Sbjct: 226 GDTIVAPAPQGNDKFEHVEENSVKSVAEAPVSTFSIDVDTASYSFVRRQLNSGYLPEKDA 285 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 EE + + P + I + + + + L+ + P + Sbjct: 286 IRAEELINYFDYNYPLPSDSTAPFKPNITVIDSPWAKGKKLVHIGLKGYDIAPDQKPRTN 345 Query: 249 AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE---- 303 + L+DVSGSM+ K + K+ +L L+ VVY E Sbjct: 346 --LVFLLDVSGSMNSQDKLPLVKQSMEMLLSTLNPDDTVAIVVYAGAAGTVLEPTPAKDK 403 Query: 304 ----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 Q G T + + L ++ + ++ N A+DGD N + Sbjct: 404 QKILSAMQRLQAGGSTAGGAGIALAYDLAEANFDKKAVNRVIL-ATDGDFNVGSTNNETL 462 Query: 359 EILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + ++ + + + + L + + + D + Sbjct: 463 QGFVERKREKGIFLSVLGFGQ--GNYNDHLMQTLAQNGNGVAAYI--------DTVSEAQ 512 Query: 416 ELFHKQNATAK 426 ++ ++ +++ Sbjct: 513 KVLVQEASSSL 523 >UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GHE4_9DELT Length = 785 Score = 218 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 41/301 (13%), Positives = 77/301 (25%), Gaps = 25/301 (8%) Query: 142 QRQLTEYKTHRAGYTANG----VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 +R + E + A + A G +I V + S+ + EE + Sbjct: 242 ERSVVEARQVAASFVATGEDRKSTFSIDVDTASYASVRQSLRNGWMPDPGSVRTEEMINY 301 Query: 198 ISNSEPAQLLEE-ERLRKEIAELRAKIERVPFIDTFDLRY-KNYEKRPDPSSQAVMFCLM 255 A + ++ + + + L+ Sbjct: 302 FDYGYVAPSGGAGAPFAVHTEVGPCPWAPDHRLVQIGVQATRELPAQAQELRTRNLVFLL 361 Query: 256 DVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEHEF 306 DVSGSM + K + L L VVY KE Sbjct: 362 DVSGSMSSRGKLPLIKHGFTQLVEQLGAEDHVSIVVYAGAAGVVLPPTSGDQKETILGAL 421 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKL 365 + GGT S+ + E+ + + N +DGD N L ++ Sbjct: 422 DRLEAGGGTNGSAGIVEAYELAQANFVDGGVNRVILG-TDGDFNVGLSDHDALVELIEQK 480 Query: 366 LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + S + + L + + F D ++ ++ Sbjct: 481 RESGVFLSVLGVGGHYDDELMEQLADHGNGNYAFL--------DGKREAEKVLVEEIGGT 532 Query: 426 K 426 Sbjct: 533 L 533 >UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria RepID=B1ZYN3_OPITP Length = 792 Score = 216 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 40/289 (13%), Positives = 77/289 (26%), Gaps = 31/289 (10%) Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQ---------LLE 208 V + ++ R + +EE + A Sbjct: 320 PLSTFAADVDTASYANVRRFLREGRLPPADAVRIEELVNYFPYRYAAPGRVRDEGVAAPG 379 Query: 209 EERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-D 267 E + A + L+ K+ ++ + L+DVSGSMDQ K Sbjct: 380 EAPFAAALEVAAAPWAAQHRLVRIGLKAKDAAVSGRAAAN--LVFLLDVSGSMDQPNKLR 437 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGTIVSS 319 + + LL L + V Y + A + G T + Sbjct: 438 LVQESMRLLLGRLQPEDRVAIVTYAGNSGLALPSTPVARQREILDAIDELRAGGSTNGAM 497 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSY-IEI 377 L+L ++ K + N +DGD N S L ++ + + Sbjct: 498 GLQLAYDIAKANFVANGVNRVIL-CTDGDFNVGVTSEGELVRLIEEKAKSGVFLTVLGFG 556 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + ++ + + D +L +Q + Sbjct: 557 MGNLKDAMLQQIADRGNGSYGYI--------DTRREAEKLLVQQVSGTL 597 >UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacterales RepID=Q28U54_JANSC Length = 686 Score = 213 bits (542), Expect = 1e-53, Method: Composition-based stats. Identities = 38/281 (13%), Positives = 71/281 (25%), Gaps = 27/281 (9%) Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEE-ERLRKEIAE 218 +I V + L A + +EE + PA ++ R + Sbjct: 239 STFSIDVDTASYALLRSTLNRGALPAPDAVRIEEMVNYFPYDYPAPTADDISPFRPNVQV 298 Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLY 277 + ++ P + L+D SGSM+ + + + L+ Sbjct: 299 FETPWNPDTQLVHIGIQGDLPVVEDRPPLN--LVFLIDTSGSMNDPAKLPLLIQSFRLML 356 Query: 278 LFLSRTYKNVEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVK 329 LS + V Y A E Q G T L+ + Sbjct: 357 NRLSPEDEVAIVTYAGSAGVALEPTAASDTATINAALTTLQAGGSTNGVGGLEEAYRLAG 416 Query: 330 ERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTL 385 E + + A+DGD N E + + + Sbjct: 417 EMMVDGEVSRVLL-ATDGDFNVGLSDAGALEDYIAEQRDTGIYLSVLGFG--RGNLQDDT 473 Query: 386 WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + ++ D + + Q A A Sbjct: 474 MQALAQNGNGTASYI--------DTLHEAQRVLVDQLAGAL 506 >UniRef50_UPI000185CB41 protein containing von Willebrand factor n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CB41 Length = 550 Score = 211 bits (538), Expect = 3e-53, Method: Composition-based stats. Identities = 53/323 (16%), Positives = 98/323 (30%), Gaps = 27/323 (8%) Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 + +E E+ + L+ N+ + A + V R+ +L Sbjct: 58 AYDAVVEEMEIANSEEPSQQQLRSNETYKEISENPFVAVAQQPVTTFSADVDRASYANLR 117 Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEE-ERLRKEIAELRAKIERVPFIDTFDL 234 R ++ +EE + PA E LR + L Sbjct: 118 RMLGYGQLPPKDAIRIEEMINYFDYDYPAPTKEATSPLRVTPELAPTPWNPEHLLLRIGL 177 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 + K + P S + L+DVSGSMD+ K + K + LL L T + V Y Sbjct: 178 QAKKLDLAQAPPSN--IVFLIDVSGSMDEPNKLPLLKSSFKLLLTQLKPTDRVAIVTYAS 235 Query: 294 HHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 A ++ E +G T SS ++L + ++ + N A+ Sbjct: 236 GTKVALSSTPVKERQKIEKVLDNLYASGSTSGSSGIQLAYKEAQKNFIKNGNNRIIL-AT 294 Query: 346 DGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 DGD N +P E +K + + + + + Sbjct: 295 DGDFNVGISNPRELEKFIEKQRESGIYMSVLGFG--MGNYRDDMAETIADKGNGNYAYI- 351 Query: 402 QHIRDQDDIYPVFRELFHKQNAT 424 D +++ + + Sbjct: 352 -------DDLTEAKKVLVNEFSG 367 >UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacteraceae RepID=A3PN61_RHOS1 Length = 651 Score = 211 bits (536), Expect = 5e-53, Method: Composition-based stats. Identities = 53/395 (13%), Positives = 91/395 (23%), Gaps = 33/395 (8%) Query: 51 SVSIP-TEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDG 109 V +P P R+ + + P + S +G A Q Sbjct: 94 VVVMPNARLAEPPQTAPDAPEADARLTAAPEAGGGAETAGAPVPAEPRARSAEGAAPQTF 153 Query: 110 EGQDEFVFQISKDEYLDLLFEDLA-----LPNLKQNQQRQLTEYKTHRAGYTANGVPANI 164 + L L + LP R +I Sbjct: 154 AADEAMPMAAPPAPDLALSKQAAEAPARALPQGDSEAFAN-APDNPLRVTAEDPVSTFSI 212 Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 V + L RE +EE + PA R ++ R Sbjct: 213 DVDTASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSITRTPWN 272 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRT 283 + L+ + P + L+D SGSM + K+ + L+ L Sbjct: 273 PETRLVHVALQGRMPAIEDRPPLN--LVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPE 330 Query: 284 YKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + V Y + + G T L L E Sbjct: 331 DQVAIVTYAGSAGEVLAPTAANQRSTILSALDRLDAGGSTAGDEGLALAYRTASEMAGAG 390 Query: 336 QWNIYAAQASDGD-NWADDSPLCHEILA---KKLLPVVRYYSYIEITRRAHQTLWREYEH 391 + A+DGD N P L + + + + Sbjct: 391 EVTRVVL-ATDGDFNLGISDPEELARLVAHERDTGVYLSVLGFG--RGNLDDATMQALAQ 447 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D +++ Q + A Sbjct: 448 NGNGQAAYI--------DSLNEAQKVLVDQLSGAL 474 >UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5D5_BREBN Length = 513 Score = 209 bits (532), Expect = 1e-52, Method: Composition-based stats. Identities = 41/290 (14%), Positives = 80/290 (27%), Gaps = 24/290 (8%) Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 + V + + E +EE + S PA Sbjct: 81 TNQFVSTAKDRLSTFAADVDTASYTIMRHFIKDGNLPPAEAVRVEEFINFFPTSYPAPT- 139 Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK- 266 + + + ++ I ++ K + + + ++DVSGSM+Q + Sbjct: 140 -NQTFAIQADSGPSPFQKNLQIVRIGIKGKELSPKERKPAN--LVFVIDVSGSMNQENRL 196 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVS 318 ++ K+ +L L T VVY T+ K+ Q G T Sbjct: 197 ELVKKSLHVLVDQLQPTDSVGIVVYGSEGRVLLPPTSTEDKQAILSAIDELQPEGSTNAE 256 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKL-LPVVRYYSYIE 376 L L E+ + P N SDG N + + + S+ Sbjct: 257 QGLVLGYEMAARSFKPPAINRVIL-CSDGVANVGETGAEGILRSIEDYARKDIYLSSFGF 315 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + + D + R +F + Sbjct: 316 GMGNYNDVMMEQLANKGEGSYAYI--------DTFSEARRIFTESLTGTL 357 >UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WPE6_EGGLE Length = 555 Score = 208 bits (530), Expect = 2e-52, Method: Composition-based stats. Identities = 38/303 (12%), Positives = 75/303 (24%), Gaps = 30/303 (9%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK---RRELHALEENLA 196 ++ + + + T+ + V + +L R A EE L Sbjct: 78 TEEYRALDEPGFLSPATSPLSTLSADVDTASYCNLRRMVAQRYAPAVVPAGAVRTEELLN 137 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + P + + + + + + A + L+D Sbjct: 138 YFDYAYPEP-VGSDLFGVSAQMSDCPWNDQTKLLVMG--FATEKDGDASPTGANLVFLID 194 Query: 257 VSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI--------RHHTQAKEVDEHEFF 307 VSGSMD K + K + L L+ + V Y K Sbjct: 195 VSGSMDDPDKLPLVKDSFAALVEGLTERDRVSVVTYASGERVLLEGVPGDDKRRIMRAVD 254 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL 366 G T + L+ + + + N ASDGD N S ++ Sbjct: 255 SLVAEGSTNGEAGLEQAYRLAESSFIEGGVNRVV-MASDGDLNVGISSESELHDFVEQKR 313 Query: 367 P---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + ++ D R + + Sbjct: 314 ETGVYLSVLGFG--SGNYKDNKMETLADHGNGAYHYI--------DCAEEARRVLGRNLR 363 Query: 424 TAK 426 Sbjct: 364 ANL 366 >UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PR69_CHIPD Length = 588 Score = 207 bits (526), Expect = 8e-52, Method: Composition-based stats. Identities = 45/276 (16%), Positives = 85/276 (30%), Gaps = 27/276 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 + V R+ +++ R + +EE + S P + + L Sbjct: 160 VDVDRAAYSNIRRFVKLKERIPANAVRIEEMVNYFHYSYPLPPVGQT-LAIYSNYATCPW 218 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + +R K+ P S + L+DVSGSM K + + + +L L Sbjct: 219 AEDHRLLQIAVRGKSVNLDSLPPSN--LVFLIDVSGSMAMPNKLPLLQAAFRILVNNLRS 276 Query: 283 TYKNVEVVYIR--------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 V Y AK + Y G T +A+KL ++ +E + Sbjct: 277 NDHVAIVAYAGVPGVILPSTPGSAKSKILNAIDYLSAGGATAGEAAIKLAYQIAEENFIK 336 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILA---KKLLPVVRYYSYIEITRRAHQTLWREYE 390 N A+DGD N S E L K+ ++ + + + Sbjct: 337 EGNNRVIL-ATDGDFNVGQTSDHDMEQLILGKKETGVLLTCLGFG--MKNYKDSKLETLS 393 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D ++F ++ + Sbjct: 394 SKGNGNFAYI--------DNLEEASKIFAREFGSTL 421 >UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UJ22_METS4 Length = 654 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 35/298 (11%), Positives = 72/298 (24%), Gaps = 22/298 (7%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + R A ++ V + + EE + Sbjct: 166 RDRFANAPEGGFRITREAPVSTVSLGVDTASYGIVRDALNRNHLPPPAAVRTEELINYFP 225 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + PA + R + + + +R P + + L+D SG Sbjct: 226 YAYPAPASPDAPFRVTASVFPSPWAEGRKLLHIGIRGYAVAPAERPPAN--LVFLVDTSG 283 Query: 260 SMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQ 310 SM + + K+ +L L + V Y E Q Sbjct: 284 SMAAPNRLPLVKQSLAMLLTTLDARDRVALVAYAGEVGTVLEPTPAGEAGRILAAIETLQ 343 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILA-KKLLPV 368 G T ++ + ++P N A+DGD N ++ Sbjct: 344 AHGSTAGGEGIRQAYALAARHFDPKAVNRVIL-ATDGDFNVGITGRDELTGFVARERRKG 402 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + L + + D R++ ++ + Sbjct: 403 IFLSVLGFGMGNLNDALMQALAKDGNGVAAHI--------DTAQEARKVLVEEATSTL 452 >UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 Tax=Caulobacteraceae RepID=B4WCU1_9CAUL Length = 613 Score = 204 bits (520), Expect = 4e-51, Method: Composition-based stats. Identities = 39/297 (13%), Positives = 74/297 (24%), Gaps = 27/297 (9%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + + +I V + +++ R + +EE + Sbjct: 136 TETYPDATPNPVKRTADQPVSTFSIDVDTAAYSNVRRFIDEGRSPPADAVRVEELINAFD 195 Query: 200 NSEPAQLLEEERLRKEIAELRAKIER-----VPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 A + + I L+ + + L Sbjct: 196 YGYARPTSLARPFAITTAVVASPWAPRTERGGRQIVHIGLQGYELPQGEQRPLN--LTFL 253 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEHE 305 +DVSGSM K D+AK+ L L Y K Sbjct: 254 VDVSGSMRSPDKLDLAKQAMNLAIDRLRPQDTLSVTYYAEGAGTTLQPTPGDQKLKMRCA 313 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWA-DDSPLCHEILAK 363 + +GGT ++ + + + + + N +DGD N D+ + +A+ Sbjct: 314 VASLRASGGTAGATGMTNAYDQAQASFARDKVNR-ILMFTDGDFNVGVTDNKRLEDYVAE 372 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 K V Y + + + R LF Sbjct: 373 KRGTGVYLSVYGFGRGNYQDARMQTIAQAGNGVAAYVGD--------LRDARRLFGP 421 >UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVG2_SORC5 Length = 656 Score = 204 bits (519), Expect = 5e-51, Method: Composition-based stats. Identities = 38/278 (13%), Positives = 68/278 (24%), Gaps = 24/278 (8%) Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA 217 I V + R+ A + EE L + +A Sbjct: 212 RLSTFAIDVDTASYAIARRKIMDGALPPYQAVRAEEFLNYFDYGYASPAAG--PFAVHLA 269 Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILL 276 + + ++ K + + L+D SGSM K ++AK+ +L Sbjct: 270 AAPSPFTSGHHLVRVAVQGKRVPVKERTP--VHLVYLVDTSGSMQSPDKIELAKKSLKML 327 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVV 328 L Y + G T +SS + L + Sbjct: 328 TDTLKPGDTVALCTYAGSVREVLAPTGIESKGKILAALADLTAGGSTAMSSGIDLAYSLA 387 Query: 329 KERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLW 386 + N SDGD N S K+ + + + Sbjct: 388 ERTLVKGHVNRVIV-LSDGDANVGPTSHDEILKTIKRARDKGITLSTVGFGQGNYKDLMM 446 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + + D R +F +Q Sbjct: 447 EQLANQGDGNYAYI--------DSEAQARRVFSEQVGG 476 >UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XSR4_9CAUL Length = 625 Score = 203 bits (517), Expect = 7e-51, Method: Composition-based stats. Identities = 40/307 (13%), Positives = 75/307 (24%), Gaps = 36/307 (11%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 + P+ N R++ + +I V + ++ R + R+ Sbjct: 144 DTERYPDATPNPVRRVADE---------PVSTFSIDVDTAAYANVRRFISEGQTPPRDAV 194 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIER-----VPFIDTFDLRYKNYEKRPD 244 +EE + +E A + I L+ Sbjct: 195 RVEEMINYFDYGYARPGRADEPFAVSTAVAASPWSANAGAGGRQIVHIGLQGYELPAGER 254 Query: 245 PSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD- 302 + ++DVSGSM K +A++ L+ L + Y A Sbjct: 255 RPLN--LTFMVDVSGSMQSPDKLGLAQQTMNLIIDRLRPEDRVAVTYYASDVGTAVGPTP 312 Query: 303 -------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDS 354 G T + + E + ++P + N +DGD N Sbjct: 313 GSEKLKLRCAVAALNAGGSTAGAQGMVNAYEQAEAAFSPDKVNR-ILMFTDGDFNVGVTD 371 Query: 355 PLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 E + Y + + + D Sbjct: 372 DRRLEDYVADKRGTGIYLSVYGFGRGNYQDARMQTIAQAGNGVAAYV--------DDLDE 423 Query: 414 FRELFHK 420 R LF Sbjct: 424 ARCLFGP 430 >UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AC65_GEMAT Length = 642 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 81/301 (26%), Gaps = 27/301 (8%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 +Q E +I V R+ + R + +EE + Sbjct: 167 NREQYDRIEDNPFLGVTGNPLSTFSIDVDRASYGNARRFLQDGQRPPADAVRIEELINYF 226 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + + A + + L+ + E P + + L+DVS Sbjct: 227 PYELREPR-GNDPVAITTEVTTAPWQPRHQLVRIALQSRRIETASLPPNN--LVFLIDVS 283 Query: 259 GSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEHEFFYS 309 GSM K + K+ LL + + V Y KE Sbjct: 284 GSMQSPDKLPLVKQSLRLLVDQMRPQDRVAIVAYAGAAGLVLPSTSGDEKETIIQAIERL 343 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL-- 366 + G T + ++L +E + N ASDGD N S E L ++ Sbjct: 344 EAGGSTAGGAGIELAYRTAREHFMDHGNNRVIL-ASDGDFNVGVSSDGELERLIERKRTE 402 Query: 367 -PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + + T + + + D R++ ++ Sbjct: 403 GTYLTILGFG--TGNYQDAKMEKLAKRGNGNYGYV--------DDIAEARKMLVREMGAT 452 Query: 426 K 426 Sbjct: 453 L 453 >UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1CVN5_MYXXD Length = 700 Score = 196 bits (499), Expect = 9e-49, Method: Composition-based stats. Identities = 34/293 (11%), Positives = 80/293 (27%), Gaps = 31/293 (10%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 + + + ++ + A+ ++ R+ + + +EE + Sbjct: 241 INTEEERFSTFSVDTDSASYTLTRAYLER-------GSLPNEQAVRVEEFVNTFDYGYAH 293 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS 264 Q ++ + + + + ++ + + S + ++DVSGSM+ Sbjct: 294 Q--GSAPFSVQVEGFPSPVRKGYHVVHVGVKAREVSRPQRKPS--HLVFVIDVSGSMNLE 349 Query: 265 TKD-MAKRFYILLYLFLSRTYKNVEVVY------IRHHTQAK--EVDEHEFFYSQETGGT 315 + + KR LL L + VVY + T A + G T Sbjct: 350 NRLGLVKRALHLLVNELDERDQVSIVVYGSTARLVLEPTSAVHAHIIRAAIDSLHTEGST 409 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCH-EILAKKLLPVVRYYS 373 + L++ + N SDG N E + + + + Sbjct: 410 NAQAGLEMGYSLAASHLVEGGINRVIL-CSDGVANTGLTDANSIWERIRARAAKGITLST 468 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + L + + D +F + Sbjct: 469 VGFGMGNYNDVLMERLSQVGEGNYAYV--------DRIEEAHRIFVRDLTGTL 513 >UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phage phiYS40 RepID=A0MN74_9CAUD Length = 340 Score = 194 bits (494), Expect = 4e-48, Method: Composition-based stats. Identities = 83/365 (22%), Positives = 131/365 (35%), Gaps = 75/365 (20%) Query: 16 MVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRV 75 ++ R+L+R + IK + + +N + + + V I + EP F Sbjct: 5 TIDEIRYLKRLENIIKARMQDIVNSNDIIESTPEDKVRIRIPIMDEPYFK---------- 54 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP 135 + G G GSGSG EG E +++ +E +LLFE L LP Sbjct: 55 -----------PVFPGSGAGAGSGSGSEPGEGSEEGDHEIEIELTVEELSELLFEYLGLP 103 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 +K + + + G + G + I ++ + Sbjct: 104 KIKPKG-SSVEKEEYLIEGISKTGPRSRIHRRKTYYEIMK-------------------- 142 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 +RYK+ KR P A+++ Sbjct: 143 ----YGYKE---------------------------DSIRYKHLRKREVPIFDAIVYFAR 171 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 D S S+D K K + F+ YKNV + H T+AK V E +FF E G T Sbjct: 172 DYSASVDDKKKFKIKSTAFWINNFIKYNYKNVTTKFAVHDTKAKFVSEQDFFKLSEGGAT 231 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + SS +L+ E RY+ +N Y SDG+N DD+P E L +KL +Y Sbjct: 232 LCSSVFELIYEDY-RRYSVDDYNFYLFYFSDGENLPDDNPKLRE-LVEKLSEDFNLIAYG 289 Query: 376 EITRR 380 E+ Sbjct: 290 EVKST 294 >UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NFD9_ACHLI Length = 486 Score = 194 bits (492), Expect = 7e-48, Method: Composition-based stats. Identities = 43/318 (13%), Positives = 91/318 (28%), Gaps = 25/318 (7%) Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 +DE + +F D + +N ++ ++S + + + + Sbjct: 30 QDENYNYIFNDDEHQEIIENPFIDVSVNNK---------SNISLSANTASYSFIRSQINS 80 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R +EE + + + + + + ++ + L K + Sbjct: 81 GRAVDRNAVRIEEMVNFFNYNYNQPETD-KTFGFKSELIQTPWNNETHLLLIGLETKQVD 139 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVYI------- 292 PS + L+DVSGSM + K +AK+ LL + V Y Sbjct: 140 LGDIPS---NIVILLDVSGSMSATNKLSLAKKAMELLIEQMKPNDVISLVTYSSGEKVVF 196 Query: 293 -RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NW 350 + + +G T L + +V +E + N A+DGD N Sbjct: 197 KGKSIDDMAYMTSQIRLLKASGSTAGKKGLDMAYKVAEEYFIEGGNNRIIL-ATDGDFNV 255 Query: 351 ADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 S + + + +Y + ++ I + Sbjct: 256 GISSTDMLIEYISEKRESGIYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARKA 315 Query: 410 IYPVFRELFHKQNATAKG 427 + + AK Sbjct: 316 FVDNIDGVLYTVARDAKA 333 >UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AWD1_HERA2 Length = 610 Score = 187 bits (475), Expect = 6e-46, Method: Composition-based stats. Identities = 43/349 (12%), Positives = 89/349 (25%), Gaps = 27/349 (7%) Query: 90 RPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYK 149 P A+ + D + +E + + + Sbjct: 112 MPTQAADAGQPVPNPAAGKPLVDTWELPTQPIDPNPNYAYEQDQ--EIFDSMYFKNYGTN 169 Query: 150 THRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEE 209 T + + + + + + +EE L P + Sbjct: 170 PFVRTETDPLSTFAMDIDSASYSLMRSSINQGLLPPADSVRVEEYLNAFDYEYPQPEDGD 229 Query: 210 ERLRKEIAELRAKI-ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-D 267 + + ++ ++ E A + ++D SGSM Q + + Sbjct: 230 --FAIYSEVAPSPFGGPNYELVQIGIQARSIEVADRKP--AALTFVIDTSGSMAQDNRLE 285 Query: 268 MAKRFYILLYLFLSRTYKNVEVVY------IRHHTQAKEVDE--HEFFYSQETGGTIVSS 319 M K I L L V + + + T + + + G T + Sbjct: 286 MVKNALIYLAGQLEPDDSLAIVAFNDGMRVVLNPTSGENQMDIITAINSLEPAGSTNAEA 345 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLP-VVRYYSYIEI 377 L E+ + + P N SDG N P ++ L V+ +Y Sbjct: 346 GLYKGFELAWQAFKPEGINR-ILLCSDGVANSGMTEPSQLLATFQQYLDAGVQLSTYGVG 404 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + L + + D + LF +Q + Sbjct: 405 MGNYNDILLEQLADKGDGNYAYF--------DSADEAQRLFGEQLTGSL 445 >UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1D6F9_MYXXD Length = 592 Score = 183 bits (465), Expect = 8e-45, Method: Composition-based stats. Identities = 33/287 (11%), Positives = 68/287 (23%), Gaps = 25/287 (8%) Query: 152 RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEER 211 V + R +EE + E Sbjct: 151 VETAKDPLSTFAADVDTASYTVSRRYLVNGQLPPASAVRVEEFVNYFKFRYAPP--ETGA 208 Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAK 270 + + + ++ K + A + L+D SGSM K +A+ Sbjct: 209 FAVHLEGAPSPFDAKRHFLRVGVQGKVVSRSQRKP--AHLVFLVDTSGSMHSEDKLPLAR 266 Query: 271 RFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH--------EFFYSQETGGTIVSSALK 322 + L+ V Y + GGT + S ++ Sbjct: 267 EAIKVAVKNLNENDTVAIVTYAGNTRDVLPPTPATDAKSIHAALDSLTAGGGTAMGSGME 326 Query: 323 LMDEVVKERYNPAQWNIYAAQASDGD-NWA--DDSPLCHEILAKKLLPVVRYYSYIEITR 379 L ++ + + + +DGD N + + + K V + Sbjct: 327 LAYRHAVKKASGSVVSRV-VVLTDGDANIGRNVSANAMLDSIHKYTAEGVTLTTVGFGMG 385 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 L + + + D +++F Q Sbjct: 386 NYRDDLMEKLADKGNGNCFYV--------DSLREAKKVFETQLTGTL 424 >UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TQT5_9MICO Length = 533 Score = 178 bits (452), Expect = 3e-43, Method: Composition-based stats. Identities = 47/293 (16%), Positives = 85/293 (29%), Gaps = 30/293 (10%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 + + R+ + + + V RSL E EE + + PA Sbjct: 91 IDTRERPRSTFAVDVDGGSFRVARSL-------LHDGHLPPPESVRPEEWVNSFDSGFPA 143 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS 264 ++ L+ + A ++ + + L+ + + R + ++D SGSMD Sbjct: 144 PRKDDLELQSDQARASSE-DDGTRLVRIGLQGREVDVREWQP--VALTMVVDTSGSMDIR 200 Query: 265 TKD-MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGT 315 + + K LL L V Y T E + G T Sbjct: 201 ERLGLVKSSLALLAENLRPDDTIAIVTYQTDATPLLEPTPVRDTDTILAAIDRLEAGGST 260 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA-DDSPLCHEILAKKLLPVVRYYS 373 + + L L + +E Y N+ ASDG N D + + + Sbjct: 261 NLEAGLLLGYDQAREAYKQGATNVVLL-ASDGVANVGVTDGGRLATAIRDNGRRGIHLVT 319 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 L + F + D + R+LF + Sbjct: 320 VGYGMGNYSDHLMEQLADQGDGFYEYI--------DTFEEARKLFVEDLRATL 364 >UniRef50_B4D1N7 Autotransporter-associated beta strand repeat protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D1N7_9BACT Length = 1545 Score = 158 bits (399), Expect = 4e-37, Method: Composition-based stats. Identities = 43/290 (14%), Positives = 77/290 (26%), Gaps = 25/290 (8%) Query: 149 KTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE 208 + +++V A EE + +P Sbjct: 1103 QPEVQTSANAFSTFSLNVSDVSFKLAAASLEQGHMPDPASVRSEEFINAFDYRDPEPSPG 1162 Query: 209 EERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-D 267 L R + + F + K P + L+D SGSM+++ + + Sbjct: 1163 -APLAFVTERARYPFAQNRDLLRFAV--KTAAAGRQPGRPLNIVLLLDRSGSMERADRVN 1219 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHHT---------QAKEVDEHEFFYSQETGGTIVS 318 + + +L L K V + R + +V GGT + Sbjct: 1220 IVREALSVLAKHLQPQDKLSIVTFARTPHLWADAVAGDKVHDVI-ARVNEITPEGGTNLE 1278 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAK-KLLPVVRYYSYIE 376 +AL L E + N +DG N D +P + + + + Sbjct: 1279 AALDLAYETAHHHFAVDSTNRVIL-FTDGAANLGDVNPDALTKKVEAQRKQGIALDCFGI 1337 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + L + F I +D F Q A A Sbjct: 1338 GWEGYNDDLLEQLTRNADGRYGF----INTPED----AAANFATQIAGAL 1379 >UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobacteria RepID=Q747C5_GEOSL Length = 447 Score = 157 bits (397), Expect = 8e-37, Method: Composition-based stats. Identities = 73/380 (19%), Positives = 145/380 (38%), Gaps = 53/380 (13%) Query: 23 LRRYKAQIKQSISEAINKRSVTDVDSGES---VSIPTEDISEPMFHQ------GRGGLRH 73 L R + + + + I + +G V +PT + E + H Sbjct: 72 LERDRLREEDGLPRKIRIGKLIKPGAGGKEKIVVVPT-TVEEKLIHDRAPEETEEDESMG 130 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 G++ + ++ RPQ GG G +G + + + + +L E Sbjct: 131 GTGDGDEGEIIGEQPVRPQQEGGS--GTAGHGEGEGHELESTAYDLGR-----ILTERFD 183 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNLK+ ++ + + Y+ + N R L ++ + Sbjct: 184 LPNLKEKGKK------SSLSHYSYDLTDRN----RGFGQILEKKQTLRRILE-------- 225 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 I+ A + E + + + +RV I + +L Y SQA++F Sbjct: 226 --TNIALGTVADVAEIDP----TRLVISPRDRVYRILSRELEY---------ESQALVFF 270 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRHHTQAKEVDEH-EFFYSQE 311 + D SGSM+ + ++L+Y +L + + VE +I H A+EV + ++ + Sbjct: 271 IRDYSGSMEGKATEAVCSQHVLIYSWLLYQFARQVETRFILHDNDAREVPDFYTYYNLRV 330 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT V++A ++++E+V++ +NIY +DGD+W + L +++L Sbjct: 331 AGGTRVAAAYRMVNEIVEKESLARDYNIYVFHGTDGDDWDTNGEETIPEL-RRMLAYANR 389 Query: 372 YSYIEITRRAHQTLWREYEH 391 + E E Sbjct: 390 IGVTIAEHTYGSSGNTEVER 409 >UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 Tax=Bacteria RepID=A7C0I1_9GAMM Length = 367 Score = 153 bits (387), Expect = 1e-35, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 58/194 (29%), Gaps = 22/194 (11%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY-LFLSRTYKNVEVVYIRHHTQAKEVD 302 P + + L+DVSGSM + K + + L L+ K VVY E Sbjct: 1 MPPAN--LVFLVDVSGSMRSNHKLALLKSALKLLSNQLTEKDKVSLVVYAGAAGVVLEPT 58 Query: 303 --------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADD 353 G T S+ + L + ++ + N A+DGD N Sbjct: 59 PGHQSVKINGALERLTAGGSTHGSAGIHLAYNLAEQAFIKNGINR-ILLATDGDFNVGTV 117 Query: 354 SPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + L ++ + + + L + + + D Sbjct: 118 DFEALKNLVEEKRKSGISLTTLGFGRGNYNDQLMEQLADAGNGNYAYI--------DTLN 169 Query: 413 VFRELFHKQNATAK 426 +++ + ++ Sbjct: 170 EAQKVLVDEMSSTL 183 >UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12 Tax=Actinomycetales RepID=D2BAS2_STRRD Length = 490 Score = 153 bits (386), Expect = 1e-35, Method: Composition-based stats. Identities = 37/278 (13%), Positives = 71/278 (25%), Gaps = 26/278 (9%) Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL 219 + V + R EE + + + Sbjct: 64 STFALDVDTASYGYAKRILQEGRLPEPGQIRPEEFVNSFRQDYKEPGDDG--FTVHMDGA 121 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYL 278 R + L+ + E + + ++DVSGSM + D+ + L Sbjct: 122 RMPEN-GTALIRVGLQTRKAEPEARRPAN--LTFVVDVSGSMGEPGRLDLVREALHKLVD 178 Query: 279 FLSRTYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 L + V + ++ T + + L Sbjct: 179 QLGPGDQVSIVAFSTQARLVLSMTPATGRDQLHAAIDRLGVEDSTNLETGLTAGYAEAAR 238 Query: 331 RYNPAQWNIYAAQASDG-DNWADDS-PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWRE 388 + PA N SDG N D + + +A+ + + R L + Sbjct: 239 AFRPAATNRVIL-LSDGLANTGDTTWQGILDRVAESAGRQITLLCVG-VGRDYGDQLMEQ 296 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + DD R++F +Q AT Sbjct: 297 LADNGDGAAVY----VSSADD----ARKVFVEQLATNL 326 >UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JNR2_9BACT Length = 923 Score = 142 bits (359), Expect = 2e-32, Method: Composition-based stats. Identities = 38/296 (12%), Positives = 71/296 (23%), Gaps = 19/296 (6%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 D LP + T +++V A Sbjct: 465 NDEPLPRTQNPPPTTQLSEYPESNTATDPQSTFSLNVSDVSYRLTEAYLAQNVRPPAGTL 524 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 EE + +P + + + + F L+ S Sbjct: 525 RTEEFVNAFDYGDPTPPVARK-IGFTWERAHWPFAHDRDVLRFSLQ--TAAHGRASSQPL 581 Query: 250 VMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHT---QAKEVDEHE 305 + +D SGSM + + D+ L L+ + V + R + V Sbjct: 582 HLTLAIDTSGSMSRPDRVDIVNSLATALQSNLTEKDRLSIVSFDRQPRLVLDGQSVTAET 641 Query: 306 -----FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE 359 GGT + SAL+L + + + N +DG N + + Sbjct: 642 NLATLATQLNPQGGTDLESALQLSYQTAQRHFQENAINRVIL-ITDGAANLGNTNAEQLR 700 Query: 360 ILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + + T F +R +D Sbjct: 701 TTVTENRIRGIALDCFGIGFDGHDDTFLESLSRNGDGRYRF----LRSPEDAALEL 752 >UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SP98_METPP Length = 791 Score = 141 bits (355), Expect = 5e-32, Method: Composition-based stats. Identities = 77/432 (17%), Positives = 141/432 (32%), Gaps = 54/432 (12%) Query: 22 FLRRYKAQIKQSISEAINKRSVTDVDSGESVSIP------TEDISEPMFHQGRGGLRHRV 75 +L K + ++ I ++ + S P +D+ M G Sbjct: 364 YLDMLKPSERVEMAMKILEKILQPQKSNGMPQQPQNGGLTIKDLERAMGRGGAPN----- 418 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQG-QASQDGEGQDEFVFQISKDEYLDLLFED-LA 133 PGN + + G GSG+ A GQD +S ++ L + ++ Sbjct: 419 -PGNGNSQSGGQPGDQAGAQDGSGTEDMVPAPTVTHGQD---HVMSTEDLAQALHDAGVS 474 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 + + L + +GV + I+ Q + R + + Sbjct: 475 SDTMAKLGFDDLKKIPEEVKH-AKDGVVSAINKASEDQMKVGSRYPGGHLLHYAKAQMLD 533 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD-------PS 246 + E A E K + + +D D+ +K+ P Sbjct: 534 FFKPVLTWEMAHKKLLEACGKGSRYDPTEPWTLYHVDAADMGFKHQRDVPFMGSRMPGKE 593 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV--EVVYIRHHTQAKEVDE- 303 + +MF ++D SGS+D + M KRF R + V +V+ T + V E Sbjct: 594 QKPLMFDIIDTSGSVDDA---MLKRFVSEALNQARRVSRGVAPDVLISWADTICRGVPEF 650 Query: 304 ------HEFFYSQ----ETGGTIVSSALKLMDEVVK--ERYNPAQWNI-YAAQASD-GDN 349 +F GGT +A++ + E+VK + A+ NI +D GD+ Sbjct: 651 ISEKNYKQFLTKGINYGGRGGTNFQAAIENVLEMVKPGSKSGYAKRNIDAICYMTDSGDS 710 Query: 350 WADDSPLCHEIL---AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD--NFAMQHI 404 D + L + KKL P++ ++ + +E + Sbjct: 711 VPDPARLLRKAQECGLKKLPPIL----FLVPKSCYDERFAKEASKWATVVYFHAGPGAKH 766 Query: 405 RDQDDIYPVFRE 416 + DI RE Sbjct: 767 TQKVDINAAARE 778 >UniRef50_A6BYV9 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYV9_9PLAN Length = 1197 Score = 136 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 55/416 (13%), Positives = 109/416 (26%), Gaps = 53/416 (12%) Query: 24 RRYKAQIKQSISEA-INKRSVTDVDSGESVSIPTEDISEP-------MFHQ-------GR 68 R+ + I + + + ++ D + P P + Sbjct: 791 RKGREAIANLFPDFPLRQLNIKDPFAK-----PIRVAEAPGEIPLADRWRLILGVKGCST 845 Query: 69 GGLRHRVHPGNDHFVQNDRIERPQGGG--GGSGSGQGQASQDGEGQDEFVFQISKDEYLD 126 + + + ++R R G G + A E + KD + Sbjct: 846 PKSQQVAGTLDQLYGGSEREGRGLQGDLASDRGGTEAAAPSVREWISDVERLFGKDVCEE 905 Query: 127 LLFEDLA------LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 +L E L +L R E + ++R L ++ R A Sbjct: 906 VLGEAAVNGRAAVLEHLNHATVRPSVELLEQVLSLRGALSERELGLLRKLARNITERMAK 965 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R ++A + +L L + K + I L Y+ Sbjct: 966 QLANRLRPALHGLSIARPTRRRSPRLDFARTLNSNLHTAYRKSDGRISIAPTRLVYRLPA 1025 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 KR + ++DVSGSM+ S + ++V + TQ + Sbjct: 1026 KRQM---DWHLIFVVDVSGSMEASVIY---SSMMAAIFSALPA---IDVKFFAFSTQVID 1076 Query: 301 VDEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 Q GGT + L+ E + P++ + +D + Sbjct: 1077 FTGRVEDPLSLLMEIQIGGGTHIGLGLRAARESITN---PSRTLVVL--VTDFE-EGVSV 1130 Query: 355 PLCHEILAKKLLPVVRYYSYI----EITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 P + + E R H + + + + Sbjct: 1131 PELLSEVVMLSSSGAKLIGLAALNDEAKPRYHAGTAAAVVQAGMPVAAVSPERLAE 1186 >UniRef50_C1RGW7 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RGW7_9CELL Length = 500 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 37/308 (12%), Positives = 64/308 (20%), Gaps = 29/308 (9%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 EDL P R + V Sbjct: 49 EDLPYPEPGPTGPTAAGMTDPAR----DALSTFALDVDTGAYTRFRDAVRQGFSVDPFGV 104 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 EE + + L I + + + + A Sbjct: 105 RTEEFVNYFAQDYEPPAEG---LGVSIDATALPFRPDHRLVRVGI--SSAPASAVSRADA 159 Query: 250 VMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE----- 303 + ++D SGSMD++ + K L L RT + V Y E Sbjct: 160 DLVLVVDCSGSMDEAGKMETTKYALRTLVSSLRRTDRVAMVCYSTEADVYLEPTPVAERE 219 Query: 304 ---HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE 359 T ++ L L ++ + SDG N + P Sbjct: 220 GVLAAIDRLAPRDSTNAAAGLALGYDLAMSMRTEGRLTRVVL-VSDGVANVGETDPEGIL 278 Query: 360 ILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 + S + L + + + D +F Sbjct: 279 ARISSQAKAGISLISVGVGITTYNDHLLEQLADQGDGWHVYV--------DGEAEAERVF 330 Query: 419 HKQNATAK 426 + Sbjct: 331 ATGLTGSL 338 >UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeoglobus fulgidus RepID=O28828_ARCFU Length = 410 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 43/279 (15%), Positives = 94/279 (33%), Gaps = 22/279 (7%) Query: 112 QDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQ 171 D ++S + +D F+++ + L++ + E + HR + S+ Sbjct: 108 GDISKDELSMSQVVDNFFDEV-VDELQEMGYVEKVETRFHRKIIHYT------AKAESVL 160 Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 ++ +R E S ++++ + + + Sbjct: 161 AEKVLSLSLQNLDKRSYGEHETEKLGQSIFSSERIVDYDPFTHSYDNIDLVESLIASAMR 220 Query: 232 FDLRYKN---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++ ++P + + V L+DVS SM A + L + R E Sbjct: 221 GEIELNENEMVARQPKHTEKCVYVMLIDVSDSMRGRKIVGAIEAALCLRKAIRRAGSGDE 280 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + I + +A E+ E E + G T + ALK +++K SDG+ Sbjct: 281 LRVIAFNHRAHEIKEGEILNLEARGRTDIGLALKRARKILKGSSGTG----VVFLISDGE 336 Query: 349 NWADDSP-----LCHEILAKKLL---PVVRYYSYIEITR 379 + +P C A+K+ ++ + + R Sbjct: 337 PTSSYNPYLTPWRCALKEAEKMRNVDARLQIIMFGKEGR 375 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 127 bits (319), Expect = 7e-28, Method: Composition-based stats. Identities = 45/291 (15%), Positives = 89/291 (30%), Gaps = 60/291 (20%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P +++ Q + + + + IS + L S+ R+ K ++ E+ Sbjct: 1356 PKIEEKDQSEGQQEEFEQNE---THSLRKISQKKVLIKSIQRKVKTNKEKVQKALNEEDK 1412 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 +Q K I+ + + P DL C+ Sbjct: 1413 EN----QTKSQQHRISSNVKNISGQFSLGQLQPMRFPIDL-----------------ICV 1451 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------------ 302 +D SGSM+ D+ K + L L + + I+ T A+ + Sbjct: 1452 IDTSGSMNGQPLDLLKETLLFLVDLLQTGDR---ICLIQFSTNAQRLTPLLSIESKDNIK 1508 Query: 303 --EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 ++E GGT + ++L +V+K+R SDG N ++ Sbjct: 1509 SIKNEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITSV-FLLSDGLNDGAENK----- 1562 Query: 361 LAKKLLPVVRYY-----------SYIEITRRAHQTLWREYEHLQSTFDNFA 400 + LL + +Y ++ L + L + Sbjct: 1563 -IRDLLKQLNFYQNYNEENFTIQTFGFGK-DHDPNLMDKISQLMDGNFYYI 1611 >UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 Tax=Shewanella benthica KT99 RepID=A9DKM0_9GAMM Length = 167 Score = 125 bits (313), Expect = 4e-27, Method: Composition-based stats. Identities = 101/153 (66%), Positives = 124/153 (81%), Gaps = 1/153 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN + +S VNRQRF+ RYK QIK+++S+A+ +RSVTDVD GE +SIPT+DIS Sbjct: 16 MANFIDRRLNARGRSTVNRQRFINRYKQQIKKAVSDAVTRRSVTDVDKGERISIPTKDIS 75 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP FHQG+GG+R RVHPGND F++ D+IER GGG GSGQG AS GEG D+FVFQIS Sbjct: 76 EPSFHQGQGGIRERVHPGNDQFIKGDKIER-PPGGGSQGSGQGDASNSGEGDDDFVFQIS 134 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRA 153 KDEYL+LLFEDL LPNL+ N+ +L EY+ +RA Sbjct: 135 KDEYLELLFEDLELPNLQNNRLNKLVEYQVYRA 167 >UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Streptomyces RepID=D1WZ12_9ACTO Length = 1289 Score = 124 bits (312), Expect = 5e-27, Method: Composition-based stats. Identities = 46/308 (14%), Positives = 85/308 (27%), Gaps = 29/308 (9%) Query: 45 DVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQ 104 D + + T G G PG P Sbjct: 935 DQLPSGAARLATALDELYGAGHGEGSRGGLSGPGRTGSRGGREPSFPGVREWSEELAALF 994 Query: 105 ASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANI 164 E + + L L A P++ +L AG A + Sbjct: 995 GPGVREEVLAAAAVTGRQDVLAELDPAAATPSV------ELLRTILRYAG---GLPEARL 1045 Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 + +R L L R LA + +L LR +A R + Sbjct: 1046 AALRPLVRHLVDELTRQLTTRLRPALTGTMLARPTRRPGGRLDLPRTLRANLATARRTAD 1105 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 + ++ R S+ + + DVSGSM+ A + L + Sbjct: 1106 GTVQVIPQKPVFR---SRARRSADWRLILVTDVSGSME------ASTIWSALTASVLAGV 1156 Query: 285 KNVEVVYIRHHTQAKEVDEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWN 338 + ++ T+ ++ H GGT +++ L+ +++ P++ Sbjct: 1157 PTLSTHFLAFSTEVVDLTGHVHDPLSLLLEVSVGGGTHIAAGLRHARGLIE---VPSRTL 1213 Query: 339 IYAAQASD 346 + SD Sbjct: 1214 VVV--ISD 1219 >UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcanivorax sp. DG881 RepID=B4X134_9GAMM Length = 657 Score = 124 bits (310), Expect = 8e-27, Method: Composition-based stats. Identities = 51/320 (15%), Positives = 84/320 (26%), Gaps = 36/320 (11%) Query: 131 DLALP-NLKQNQQRQLTEYKTH----RAGYTANG--VPANISVVRSLQNSLARRTAMTAG 183 L LP L T R A G A + V ++ AR + + Sbjct: 168 QLRLPLTLTPRFTPPTEAPHTLDSLLRNTVAAPGGTADAGTASVHIDLDAGARLATLGSP 227 Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD-LRYKNYEKR 242 + I+ A ++ + L E + F + D Y Sbjct: 228 SHAIHYQRHGRRYTITPKAGAIAMDRDLLLNWELEDTGEPLVTRFHEEIDGEHYALLMVV 287 Query: 243 PDPSSQAVMF-----CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 P + Q ++D SGSM + AK L L + + HT Sbjct: 288 PPKTGQVTALPRETLFIIDSSGSMGGAPMRQAKASLHLALQRLKPGDRFNITDFDSQHTL 347 Query: 298 AKEVD----------EHEF-FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 E +F Q +GGT + AL + +D Sbjct: 348 LFETPVTVSDNSRQQAQDFVDGLQASGGTHMLPALSATLSQPAS----DGYLRQVIFITD 403 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G A + + L R ++ I + + I D Sbjct: 404 G---AVGNESGIFRALHQQLGEARLFTVG-IGSAPNSHFMTRAAQFGRGSFTY----IND 455 Query: 407 QDDIYPVFRELFHKQNATAK 426 Q+ + LF + + Sbjct: 456 QNQVQQGMDTLFRRLESPLM 475 >UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FTM1_SORC5 Length = 535 Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 37/261 (14%), Positives = 70/261 (26%), Gaps = 24/261 (9%) Query: 173 SLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 + E L +E LR E + + Sbjct: 133 HVRELLRSGRAPEPWQVRTYEFLNYYRIDYAPP--DEGELRVEPQIEPGEEAGSYAL-QI 189 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 +R + P + + ++D SGSMD K + LS V + Sbjct: 190 GVRSYDPPSPRRPIA---VTFVLDTSGSMDGEPMAREKATVRAVAASLSEGDVVNMVTWN 246 Query: 293 RH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 +GGT + S L++ ++ +E + + N Sbjct: 247 TQNSVILSGHVVDGPDDPALLAAADALSASGGTDLESGLRVGYQLAQEHFEEGRINRVIL 306 Query: 343 QASD-GDNWADDSPLCHEILAKKL-LPVVRYYSYIEITR-RAHQTLWREYEHLQSTFDNF 399 SD G N S + A+ + + L + Sbjct: 307 -VSDGGANVGVTSEELIALHAEDADQEAIYLVGVGTGPALGYNDVLMDAVTDKGRGAYVY 365 Query: 400 AMQHIRDQDDIYPVFRELFHK 420 + D+D+ + +FR+ F + Sbjct: 366 ----LDDEDEAFHMFRDRFAE 382 >UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=Dialister invisus DSM 15470 RepID=C9LLI0_9FIRM Length = 640 Score = 118 bits (296), Expect = 4e-25, Method: Composition-based stats. Identities = 54/357 (15%), Positives = 105/357 (29%), Gaps = 48/357 (13%) Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISK-DEYL 125 +++ D E P G G G + D D F DE + Sbjct: 296 DPDKNNTEKDQESENNDPGDDGEDPSGNHIVEAMGNG-GNNDESSSDMPEFPQGADDEKV 354 Query: 126 DLLFEDLALPNL-KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 D + LP L QN+++Q T + + T R K Sbjct: 355 DSADLHVTLPPLWIQNEKKQFTPKGSGKRHITR------------SDERQGRYVKAGIPK 402 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 + A + + P Q + + I D+R K EKR Sbjct: 403 GETHDIAID--ATLRAAAPHQKGRQSNGCAVV------------IRHEDIRRKEREKR-- 446 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVY-------IRHH 295 + + L+D SGSM + A + + +L + + + + + Sbjct: 447 --TGNIFLFLVDASGSMGARERMKAVKGVVFKMLADAYQKRDRVGMIAFRRDRAEVLLPI 504 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDGD-----N 349 T++ E + + G T ++ L ++++ Y + +DG N Sbjct: 505 TRSIEFAQKKLAALPTGGKTPLAQGLIKAEDMLDRLYKQDPLQDPVLILITDGRATNSLN 564 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 D A+++ + I+ + + + F + I + Sbjct: 565 KNTDPVRDALSEAERIGHRHMLAAVIDTESGFIKLGLAKELAQKMGASYFHVDKISE 621 >UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI7_9BACT Length = 1078 Score = 116 bits (291), Expect = 1e-24, Method: Composition-based stats. Identities = 23/277 (8%), Positives = 64/277 (23%), Gaps = 20/277 (7%) Query: 157 ANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEI 216 N I V + + +EE + E + + Sbjct: 648 TNLSTFAIDVDTASYTAARSEIRAGRKVEASHVRIEEFINNFDYHYSVPKKE--AFKIDS 705 Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYIL 275 K+ + ++ + ++D SGSM + + ++ Sbjct: 706 ELSDHKVYAGVKLLRVGVQGQRLGADSQKPGSYT--FVIDNSGSMAAENRLPLIQKTLPN 763 Query: 276 LYLFLSRTYKNVE------VVYIRH--HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV 327 ++ +++ + V + + E + +S ++ ++ Sbjct: 764 MFKAMNQDDEVTILSCEGGVTNLANRITASNHSQLETAVKNIEAGTVANLSVGIEEAYKL 823 Query: 328 VKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQ 383 + + N SDG + + + + Sbjct: 824 AAQNFRSGAVNRVIL-LSDGIASLGEKEAQEVLKTVSQYRKQGIGNTVIGVG--SEDYDD 880 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + + F + D + F F Sbjct: 881 SFLETLANKGDGVYYFGDSKEQMNDILVNNFEASFKT 917 >UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCW7_FERPL Length = 403 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 44/250 (17%), Positives = 86/250 (34%), Gaps = 20/250 (8%) Query: 109 GEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 G+ + I+ E LD FE+ AL L + + G T + R Sbjct: 102 RSGELKVE-DINFKELLDYFFEE-ALKELIEMGIIE---------GVTKRFFRRKVKFSR 150 Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 + +A++ K + + E +S +L+E + + + Sbjct: 151 QAERIIAQKVMKEVSKEAKGYYAESEGETLSYIPGYELVEYDEYLHSYDLIDIPETMIRA 210 Query: 229 IDTFDLRYKN---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK 285 D + + P + L+DVS SM A + L + + + + Sbjct: 211 AKNEDFEIREKDIVSRNPKKVGKRHFVMLIDVSDSMRGKKIVGAIEAALALKMSIRKGFD 270 Query: 286 NVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 ++EV H T+ ++ E + G T ++ ALK ++ + Y + Sbjct: 271 DLEVFVFNHRTE--KIREGDIVNVDVEGRTDIALALKTARNALRGKDGAK----YVILIT 324 Query: 346 DGDNWADDSP 355 DG+ A +P Sbjct: 325 DGEPTASYNP 334 >UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VY88_NAEGR Length = 1082 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 78/231 (33%), Gaps = 23/231 (9%) Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 SN + Q+ E+ R E+ L + + + + P P+ + L DVS Sbjct: 82 SNQKVEQVEEKSLFRIEVEHLIDLDDHDIGVAELTIHVNDPSLNPKPT---LFIALADVS 138 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH--------EFFYSQ 310 GSM + S + + + + AKE+D + Sbjct: 139 GSMQGRPWEQVCTSLKHFAQQ-SFNNPAIICRMVAYESSAKEIDMKGTLQSIIRNIETAF 197 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQW-----NIYAAQASDGDNWADDS-PLCHEILAKK 364 GGT +SA +L ++ + N+ +DG++++ P + L+++ Sbjct: 198 TGGGTDFASAFQLACTIITRESGQDRENLPFGNVVITFLTDGEDFSKVGKPGGLQYLSEE 257 Query: 365 L----LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + + ++ + L + + + + D +D+ Sbjct: 258 INRVYRGDITIHTVGF-GSHHNLELLDNIRKVGTIEGAYRYANYDDNNDVI 307 >UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8U1E2_9PROT Length = 683 Score = 112 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 55/181 (30%), Gaps = 23/181 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQAK 299 + ++DVSGSM AK L R V + +R + Sbjct: 330 VIFVIDVSGSMKGEPLRAAKASLTSGIEGLGRNDTFNVVAFNNKAAAFYDAPVRASGKFH 389 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT +++A +L ++ +P + +DG A + Sbjct: 390 RAALKVIDGLKAGGGTEMAAAFELALQMPG---DPDRLQQVVF-ITDG---AVSNEAALF 442 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 K L R ++ I + E + I D V R+LF Sbjct: 443 NQIKGELGARRLFTVG-IGSAPNTFFMEEAARFGRGTYTY----IGDTSSAERVMRDLFT 497 Query: 420 K 420 K Sbjct: 498 K 498 >UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGD5_ARCPR Length = 411 Score = 112 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 42/259 (16%), Positives = 95/259 (36%), Gaps = 25/259 (9%) Query: 118 QISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 +S E ++ +E++ + +LK+ + + G + R+L + + Sbjct: 114 DLSTSELVNYFYEEI-IEDLKKEGYLEDDYF---------RGYKFTKNAERALSKKI-LQ 162 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP-FIDTFDLRY 236 ++ + E +S+ +++E + LR + + V + LR+ Sbjct: 163 LSLQDLTGEDFGEHETEKTGVSSFLKNEIVEYDELRHSYDSIDLQETLVKCALRDPSLRF 222 Query: 237 KN---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + V L+DVS SM A + L + ++ E+ + Sbjct: 223 DERDLVAREGKHMEKCVYVMLIDVSDSMRGRRIVGALESALALRKVIKKS-NMDELHVVA 281 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + + +++ + E + G T + ALK E++K+R +DG+ + Sbjct: 282 FNHRVRKIKDEEILNLRTRGRTDIGLALKTAREIIKKRRGSG----VIFLITDGEPTSSY 337 Query: 354 SP-----LCHEILAKKLLP 367 P +C A+KL Sbjct: 338 DPYLTPTMCALREAEKLRK 356 >UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZQD5_OPITP Length = 859 Score = 111 bits (277), Expect = 6e-23, Method: Composition-based stats. Identities = 39/292 (13%), Positives = 82/292 (28%), Gaps = 23/292 (7%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 L + + ++ V A + EE +P Sbjct: 413 LDAPQAEVSTAKEPVSTFSLHVSDVSFQLAQAALARGEMPDPQRIRPEEFYNAFDYGDPT 472 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS 264 +++ I + + + + ++ + + L+D SGSM+++ Sbjct: 473 PAS-ADKIACRIEQAAHPLLQQRNLVRIAMKVPAAGRGAGQPLNLTV--LLDTSGSMERT 529 Query: 265 TKDM-AKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE--------FFYSQETGGT 315 + + +L L+ + + + R E + + TGGT Sbjct: 530 DRATSVRAALGVLASLLTPDDRVTLIGFARQPRLLAESLAGDQARQLVDLASTTPFTGGT 589 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLL-PVVRYYS 373 + +AL L E+ + +N A N +DG N + P + L + + + Sbjct: 590 NLEAALSLAGELARRHHNAAAQNRIVL-ITDGAANLGNADPAQLATRIETLRQQGIAFDA 648 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 T + D F +Q A A Sbjct: 649 CGVGTDGLDDAVLEALTRKGDGRYYVL--------DAPENADAGFARQLAGA 692 >UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GAI6_9DELT Length = 560 Score = 111 bits (276), Expect = 7e-23, Method: Composition-based stats. Identities = 28/239 (11%), Positives = 59/239 (24%), Gaps = 20/239 (8%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 E + E + + + + + F + + P + Sbjct: 165 EFMNYYGFDYDPAADGELSVYAAMNPIEGEGDEARFQMQIGVASELMTPEERPPMNVTLV 224 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV--------VYIRH--HTQAKEVD 302 +D SGSM + ++ + + L + E+ Sbjct: 225 --LDTSGSMAGTPIELLRETSRAIAAQLKLGDTVSICEWDTSNDWTLAGYAVTGPNDELL 282 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD-GDNWADDSPLCHEIL 361 + GGT + L+ E+ + Y+P N SD G N Sbjct: 283 LEKINDVVHGGGTNLYGGLESGYELAQMVYDPDAINRLVL-ISDGGANAGITDLDLIAEN 341 Query: 362 AKKLL-PVVRYYSYIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 A + + L F +++++ F + F Sbjct: 342 AAYGGSDGIYLVGVGVDDPDDYNDELMDAVTDAGKGASVFMPS----EEEVWTTFGDNF 396 >UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PTX8_CHIPD Length = 462 Score = 111 bits (276), Expect = 8e-23, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 52/187 (27%), Gaps = 13/187 (6%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------IRHH 295 P + ++D SGSM A++ L L+ T V Y Sbjct: 76 KPRVPLNISLVLDRSGSMSGDKIKYARQAAKFLIDQLNSTDHLSIVNYDDRVEVTSPSQS 135 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDS 354 + KE + + G T +S + VK N +DG N Sbjct: 136 VKNKEALKAAIDKIHDRGSTNLSGGMLEGYTQVKSTRKEGYVNRVLL-LTDGLANQGITD 194 Query: 355 PLCHEILAKKLLP--VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 PL + LA+ + ++ + ++ L F + Sbjct: 195 PLELKRLAENKYKEDGIALSTFG-VGADYNEDLLTMLAENGRANYYFIDSPDKIPQIFAG 253 Query: 413 VFRELFH 419 + L Sbjct: 254 ELKGLLS 260 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 111 bits (276), Expect = 8e-23, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 56/187 (29%), Gaps = 23/187 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--HHTQAKEVDEHEFF- 307 + ++D SGSM + + AK L + + + A+ + ++F Sbjct: 332 VIFVIDTSGSMHGESLEQAKSALFFALANLDPQDSFNIIEFNSKVNALNAQALPANDFNI 391 Query: 308 --------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT + A + + + A + +DG + + Sbjct: 392 RRARNFVYGLKADGGTEIGLAFEQVLD----NSEHADYLRQIVFLTDG---SISNETEVF 444 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 K L R ++ I + L F I D D+ + LF Sbjct: 445 AQIKGSLGDSRIFTIG-IGSAPNSYFMTRAATLGRGTFTF----IGDVTDVQRTMKNLFV 499 Query: 420 KQNATAK 426 + A Sbjct: 500 QLANAAL 506 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 62/187 (33%), Gaps = 21/187 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------IRHHT 296 ++ + ++D SGSM + KR L L + + +V+ Sbjct: 38 RNAPLNLGLILDHSGSMAGQPLETVKRAAQKLVDRLLPSDRLAVIVFDHVAKVLIPNQPV 97 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDGDNWADDSP 355 ++ + + GGT + L+L E++ + +DG+N ++ Sbjct: 98 TDRDKIKTRISHLAAMGGTAIDEGLQLGLTELIAAKAGAISQ---IFLLTDGENEHGNNS 154 Query: 356 LCHEILAKKLLP--VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 C ++ + + + +Q + + ++ I D+ Sbjct: 155 RCLQLAEEAAKENITLNTLGFGY---HWNQDVLEQIADAAGG----SLMFIEYPQDVLIG 207 Query: 414 FRELFHK 420 F LF++ Sbjct: 208 FERLFNQ 214 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 64/206 (31%), Gaps = 31/206 (15%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 + K + + +D SGSMD + K + L L + Sbjct: 25 LRIAVAAKADDHDRRLPLNLCLV--LDHSGSMDGQPLETVKSAALGLIDRLEEDDRLS-- 80 Query: 290 VYIRHHTQAKEVDEH-----------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 I +AK V E+ + GGT + LKL + + + + Sbjct: 81 -VIAFDHRAKIVIENQQVRNGAAIAKAIERLKAEGGTAIDEGLKLGIQEAAK----GKED 135 Query: 339 IY--AAQASDGDNWADDSPLCHE--ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 +DG+N D+ C + +A V + + +Q + Sbjct: 136 RVSHIFLLTDGENEHGDNDRCLKLGTVASDYKLTVHTLGFGD---HWNQDVLEAIAASAQ 192 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHK 420 ++ I + + FR+LF + Sbjct: 193 GSLSY----IENPSEALHTFRQLFQR 214 >UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GUK8_SORC5 Length = 521 Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats. Identities = 35/263 (13%), Positives = 62/263 (23%), Gaps = 25/263 (9%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE--------IAELRAKIERV 226 + + A PA E+ I+ + ++ Sbjct: 56 RQILEDGEIPGPDTLDDVGFFAEHKLDYPAATCGEDVCMHGLLGIMGNMISGSPCTLIQI 115 Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN 286 DL P + +D SGSM+ + + + L T + Sbjct: 116 GMNSPVDLGA-----LERPP--LHLVIAVDTSGSMEGDPIAYVRAGLVEMIDALQPTDRI 168 Query: 287 VEVVYIRHH--------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V Y +E F G T + L + ++ +PA N Sbjct: 169 SLVRYSDAAEVVLEQAEGSDREALTEAFEGLTARGSTNLYEGLFTAYALAEQHLDPAWQN 228 Query: 339 IYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 SDG SP LA + I + R + + Sbjct: 229 RVIF-LSDGVATAGLTSPQRLVSLAAGYAEKGIGLTAIGVGAEFDVDAMRGISEVGAGNF 287 Query: 398 NFAMQHIRDQDDIYPVFRELFHK 420 F ++ + Sbjct: 288 YFLEDPKAVEEVFAEEVKTFLVP 310 >UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZRP2_9SPHI Length = 1088 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 51/163 (31%), Gaps = 18/163 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQ 297 + + L+DVSGSM K + K + L + V+Y Sbjct: 910 AHNNLMLLLDVSGSMSSKDKLPLLKESFKYLISIMRPQDDVSIVIYAGDAAIVLKPTSAS 969 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 +E + G T V + KL + + + + N A+DG+ + Sbjct: 970 NQEQINAVIDKLRSRGKTNVKAGFKLAYKWMSKNFKEGGNNRIIL-ATDGE-FPIS--KY 1025 Query: 358 HEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFD 397 L +K + +S+ +T++ + Sbjct: 1026 IYKLVEKRATKGINLSVFSFGSMTKKF--ETLEKLVAKGKGNY 1066 Score = 91.7 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 45/138 (32%), Gaps = 10/138 (7%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH------HTQAK- 299 + + L+DVSGSM M K L + K VV+ T AK Sbjct: 683 AHNNLMLLLDVSGSMKNE-LPMLKSALKYLVNIMRPEDKVSVVVFGSEAKLMLRPTSAKY 741 Query: 300 -EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + +G T + LKL + ++ Y N ASDG+ Sbjct: 742 KAQIMQAIDTLKSSGRTNGEAGLKLAYQWIQNNYKNNNNNRIIL-ASDGEFSISKGLYQM 800 Query: 359 EILAKKLLPVVRYYSYIE 376 + + +S+ + Sbjct: 801 IEQKAEESIALSVFSFAD 818 >UniRef50_Q9ZGE6 Magnesium-chelatase 67 kDa subunit n=2 Tax=Heliobacteriaceae RepID=BCHD_HELMO Length = 666 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 43/277 (15%), Positives = 90/277 (32%), Gaps = 35/277 (12%) Query: 108 DGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 + + + + + F+ +P + Q + R G A+G ++ Sbjct: 356 ETPPDEAPKDEQTLQLPEEFFFDAEEVPMEDELLSLQNKVQRQARGG--AHGKQKSLERG 413 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R + A+ + + + Q E + +R Sbjct: 414 RYAR-------ALLPPPGKNSRVAVDATLRAAAPYQRQRRESGQYG----------DRQV 456 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-RTYKN 286 + D+R K + ++ S A++ ++D SGSM + AK +L K Sbjct: 457 IVTNSDIRAKQFVRK----SGALIIFVVDASGSMAFNRMSSAKGAVSVLLNEAYVNRDKV 512 Query: 287 VEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +++ T++ E+ + F GG+ ++ A+ EV + Sbjct: 513 ALIIFRGQQAETLVPPTRSVELAKKRFDQVPVGGGSPLAGAIAQAIEVGVNSIGSDVGQV 572 Query: 340 YAAQASDGD-NWADD---SPLCHEILAKKLLPVVRYY 372 +DG N D P E L +++L + R Sbjct: 573 IITLITDGRGNVPMDPQAGPKNREQLNEEILALSRLV 609 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 66/211 (31%), Gaps = 30/211 (14%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 D + + + ++ + C++D SGSM K + L L+ + + Sbjct: 189 DILEEQKEQVKQVEQSRPSIDLVCVIDNSGSMQGEKIQNVKTTLLQLLDMLNSNDRLSLI 248 Query: 290 VYIRHHT---QAKEVDEHE-------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 ++ + T ++VD+ GGT ++S + + ++++R +N Sbjct: 249 LFNSYPTLLCNLRKVDDENTPNIQSIINSITADGGTDINSGMLMAFNILQKR---QFFNP 305 Query: 340 Y--AAQASDGDNWADDSPLCHEILAKKLLP----VVRYYSYIEITRRAHQTLWREYEHLQ 393 SDG + D + I + + L + + + L L+ Sbjct: 306 VSSIFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFG---SDHDGPLMNRICQLK 362 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + E F Sbjct: 363 DGNFYYV--------EKINQVDEFFVDALGG 385 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 108 bits (270), Expect = 4e-22, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 62/194 (31%), Gaps = 19/194 (9%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA---- 298 S + C++DVSGSM + + + LS + + + T+ Sbjct: 83 EASRSGVDIVCVIDVSGSMQGEKIQLVQTTLNFMVERLSPADRICLISFSNDATKISRLV 142 Query: 299 ------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA 351 K+ + +GGT + L+ + +++R Q + SDG DN Sbjct: 143 QMSPKGKKQLKSMIPRLVASGGTNIVGGLEYGLQALRQRRTINQLSSIIL-LSDGQDNNG 201 Query: 352 DDSPLCHEILAKK--LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + + +++ TL ++ + ++D++ Sbjct: 202 TTVLQRAKATMDSIVIRDDYSVHTFGYGHG-HDSTLLNALAEPKNGAFYY----VKDEET 256 Query: 410 IYPVFRELFHKQNA 423 I F + + Sbjct: 257 IATAFANCLGELMS 270 >UniRef50_Q231J4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q231J4_TETTH Length = 520 Score = 107 bits (267), Expect = 9e-22, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 66/208 (31%), Gaps = 18/208 (8%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++ D +K + ++D SGSM ++ K+ + + + + Sbjct: 76 VEAKDFDADQVKKDKVRYQPLDLIFVIDTSGSMQGKKIELVKKSILQVLHIIQGDDRISL 135 Query: 289 VVYIRHHTQAKEVD----------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V + E+ + Q GGT + ++ +++KER ++ Sbjct: 136 VGFNSQAKVLLELTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDIIKER-TNSKNL 194 Query: 339 IYAAQASDG-DNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 SDG DN +K + + + + + + LQ Sbjct: 195 ASIFLLSDGQDNCGFSQTQHFMNQSKIEYPFCIDCFGFGD---DHDSLTLSKINQLQQGT 251 Query: 397 DNFA--MQHIRDQDDIYPVFRELFHKQN 422 NF + I D I + F QN Sbjct: 252 FNFIRDISQIDDAFTIILAGIKTFVAQN 279 >UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EEA7_9EURY Length = 373 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 48/204 (23%), Positives = 91/204 (44%), Gaps = 23/204 (11%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN-VEV 289 D+RY E++ P+ A + D+SGSM+ + A L+ +L Y++ V++ Sbjct: 172 DEDIRYNLIEEKLIPNLSATFVIMRDISGSMEMYGEFSA-TIAGLIEFWLKEKYEHTVKI 230 Query: 290 VYIRHHTQAKEVD---EHEFFYSQETGGTIVSSALKLMDEV-----------VKERYNPA 335 Y+ H +A E D +FF +GGT + A KL+ ++ KER + Sbjct: 231 RYVAHTDEAFEYDPRKREDFFKLSSSGGTAFNPAYKLVIDMTDGASYKSNSPYKERIDYQ 290 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +++ +DGDN+ + E L KKL P + Y+++ + Y ++S Sbjct: 291 SEDVFLLHITDGDNYNGEDEAVRETL-KKLFPRLTKVFYLQVGGYSDSF----YNLIKSV 345 Query: 396 FDNFAMQHIRDQDDI-YPVFRELF 418 + ++ +DI Y +++ Sbjct: 346 DPE-KLSEVKSGNDISYNNVKKVL 368 >UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnoliophyta RepID=B9SJS6_RICCO Length = 540 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 76/251 (30%), Gaps = 22/251 (8%) Query: 156 TANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE 215 G AN+ + L + + + +E + S P + +LR Sbjct: 10 RRRGRRANLLAGGEAEQKLPLVPLLPPPLKMSSNDDDEKIVTRSRPTPPIVPARVKLR-S 68 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 I A +E +L + P + ++DVS SM+ + K + Sbjct: 69 INNDMAPLEESKLKVMLELTGGDSSSYGRPGLD--LVAVLDVSRSMEGDKMEKMKTAMLF 126 Query: 276 LYLFLSRTYKNVEVVYIR----------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 + L T + V + +++E E+ G T +++ L+ Sbjct: 127 IIKKLGPTDRLSIVTFSGGANRLCPLRQTTGKSQEEFENLINGLNADGATNITAGLQTAL 186 Query: 326 EVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 +V+K R + + SDG+ N D+ + + + Sbjct: 187 KVLKGRSFNGERVVGIMLMSDGEQNAGSDATGVSVGNV-----PIHTFGFGI---NHEPK 238 Query: 385 LWREYEHLQST 395 + H Sbjct: 239 GLKAIAHNSIG 249 >UniRef50_D0MZH7 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MZH7_PHYIN Length = 1850 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 44/294 (14%), Positives = 94/294 (31%), Gaps = 42/294 (14%) Query: 28 AQIKQSISEAINKRSVTDVDSGESV---SIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQ 84 ++ +++E + ++ G V S P + P G+ P ND V Sbjct: 1455 ERMYGTMAELASDGTLELKVDGSRVGGPSTPKTGLDTPK--YGKD------DPNNDPHVG 1506 Query: 85 NDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFED------LALPNLK 138 + GG +G G + V Q+S+++ ++ E +A L Sbjct: 1507 GNTWAGGTGGSDTAGLGGRGGPYRLDKGH-PVHQVSQEKKDEVSAEARAKARAMAQEALA 1565 Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 + R++ + Y + + R +A L A+ + + Sbjct: 1566 EK-LREIDMSEREWETYQ------------TYFKRVERESAQLRAVLANLEAVAQERNWL 1612 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + +L + + + + + D ++ M +MDVS Sbjct: 1613 RHQSSGELDDGKL----VDGVAGERLVFKRRGVRDSPFQAPAGHQQEQEPKRMVFVMDVS 1668 Query: 259 GSM------DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 GSM D + M + +++ F + ++ H + E+ EF Sbjct: 1669 GSMYRFNGQDSRLERMLETSLMIMESFAGFE-RELDYCIFGHSGDSPEIPFVEF 1721 >UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 Tax=Octadecabacter antarcticus 307 RepID=B5JCH3_9RHOB Length = 197 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 45/165 (27%), Gaps = 4/165 (2%) Query: 102 QGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVP 161 A D G + + S D ++ +L + + Sbjct: 37 AAPAGNDMAGNLRSMAEPSNDGFVHVLRDGSTFYEEYDETFAN-DTPNPLKITSDEPVST 95 Query: 162 ANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 +I V + + + +EE + + PA E R I Sbjct: 96 FSIDVDTAAYALIRSSLTRGQLPPTDAVRIEEMINYFPYAYPAPEGE-APFRPTINVFET 154 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 + ++ + P + L+D SGSM+ + K Sbjct: 155 PWNADTQLVHIGIQGEMPAIEDRPPLN--LVFLIDTSGSMESADK 197 >UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VUB8_DYAFD Length = 935 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 55/215 (25%), Gaps = 22/215 (10%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKR 271 R + + T L M L+DVS SM+ + KR Sbjct: 730 RVRVDTVYVDRNAQLQNVTRSLDGFAPN---------NMVLLLDVSSSMNSPYKMPLLKR 780 Query: 272 FYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE--------FFYSQETGGTIVSSALKL 323 L + V+Y + Q G T + +KL Sbjct: 781 SIKSLLTLVRPEDMISIVLYSGKARVVLKPTSGAKASEISRMIDLLQSDGDTDGNEGIKL 840 Query: 324 MDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 + ++Y N A+DG+ D + + + +++ Sbjct: 841 AYKTANKQYIRGGNNRIVL-ATDGEFPVSDEVMDMIRQNARQDVYLSIFTFG--RHEHTG 897 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPV-FREL 417 ++ L D I ++L Sbjct: 898 QKLKKLSELGMGSYAHVTDASADLQLILEAQAKKL 932 >UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CDA1_PARTE Length = 604 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 59/178 (33%), Gaps = 22/178 (12%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---IR 293 N +++ + C++D SGSM +M K+ +L FL + + + + Sbjct: 173 FNMDQQQHSKVGVDLLCVIDRSGSMSGEKIEMVKQTLNILLNFLGPKDRLCLIQFDDTCQ 232 Query: 294 HHTQAKEVDEHE-------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQAS 345 T + V + GGT++ ++ + + +Y + N+ S Sbjct: 233 RLTNLRRVTDENKTYYSDIISKIYANGGTVIGLGTQMALKQI--KYRKSVNNVTAIFVLS 290 Query: 346 DGDNWADDSPLCHEILAKKL---LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 DG L K+L + +S+ L + +L F Sbjct: 291 DGQ-----DEAAISSLQKQLAYYKQTLTIHSFGF-GSDHDAKLMTKISNLGKGSFYFV 342 >UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NW6_PSEA6 Length = 701 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 50/192 (26%), Gaps = 24/192 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI---------RHHTQAKEV 301 + L+D SGSM + AKR L + + AK + Sbjct: 305 VVFLLDTSGSMAGESIVQAKRAVDFALTQLRPEDNVNIIQFNDAPQALWKRAMPATAKHI 364 Query: 302 DEHE--FFYSQETGGTIVSSALKLMDEVVKERYNPAQW-----NIYAAQASDGDNWADDS 354 GGT ++ AL L + + +DG + + Sbjct: 365 QRARNWVASLHADGGTEMAPALTLALNKPSLHRDDSDLLGSHKLRQVVFITDG---SVSN 421 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 L + L R ++ I + + + I D + Sbjct: 422 EDALMSLIESKLADNRLFTIG-IGSAPNSYFMTQAAQAGRGTFTY----IGDIQQVQHKM 476 Query: 415 RELFHKQNATAK 426 LF+K Sbjct: 477 TALFNKLTRPVM 488 >UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GDG5_9DELT Length = 486 Score = 106 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 29/248 (11%), Positives = 57/248 (22%), Gaps = 28/248 (11%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 E L S PA + + + F + + + Sbjct: 106 EFLNYYSFEYPA----ADPGDLSVHVDLRSKDEGRFQLQIGVASEIVSPSERLPMNITLV 161 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV--------VYIRHH--TQAKEVD 302 +D S SM + K + L V H Sbjct: 162 --LDESTSMTGAPMYAMKATARAIAGSLREGDVISLVSWSNSNNVRLASHAVAGSNDATL 219 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD-GDNWADDSPLCHEIL 361 + GGT + + L+ + + ++ + N SD G N + Sbjct: 220 LDTIDAIEPGGGTDLHAGLEQGYALAQANFSADRINRVVL-VSDGGANLGFTDAELIAQM 278 Query: 362 AK-KLLPVVRY-YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 A+ + + + R + L F +F Sbjct: 279 AELEDGEGIYMVGVGVGDVGRYNDELMDTVTDQGKGASVFIP--------NEAEAERMFG 330 Query: 420 KQNATAKG 427 ++ + G Sbjct: 331 ERFMSTMG 338 >UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genome shotgun sequence n=3 Tax=Paramecium tetraurelia RepID=A0C051_PARTE Length = 636 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 58/170 (34%), Gaps = 17/170 (10%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---TQA 298 + + + CL+D+SGSM +M K I+L FL + + + T Sbjct: 153 QKNQRVGVDLICLIDISGSMIGVKIEMVKASLIVLLQFLGDNDRLQLITFDNDAHRLTPL 212 Query: 299 KEVDEHE-------FFYSQETGGTIVSSALKLM-DEVVKERYNPAQWNIYAAQASDGDNW 350 K V + GG +S A K+ ++ +Y ++ SDG ++ Sbjct: 213 KTVTNQNKSYFTQIIKQIKANGGNRISEATKMAFYQLKSRKYINNVTSV--FLLSDGVDY 270 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 ++ + + + E + + +L+S F Sbjct: 271 TYPEVKNQIQTVNEVFT-LHTFGFGE---DHDAQMMTQLCNLKSGSFYFV 316 >UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Photobacterium profundum 3TCK RepID=Q1YZ74_PHOPR Length = 714 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 58/182 (31%), Gaps = 20/182 (10%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----IRHHTQAKEVDEHEF 306 + ++D+SGSM + + AK+ L V + + + Q V Sbjct: 335 VTFVLDISGSMYGESIEQAKQALRYGLQQLQPEDSFNIVTFNHEAMLYSEQLLPVTSSTI 394 Query: 307 -------FYSQETGGTIVSSALKLMDEVVKE-RYNPAQWNIYAAQASDGDNWADDSPLCH 358 GGT +++ALK + + N +W +DG + + Sbjct: 395 TRALRFVDGLDADGGTEMAAALKAAFSIKTHDQLNSTRWLNQIVFITDG---SVGNESAL 451 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 L ++ L R ++ I + + I D ++ R LF Sbjct: 452 FDLIEQQLVDRRLFTVG-IGSAPNSYFMTRAAMKGKGTYTY----IGDVKEVNTKMRLLF 506 Query: 419 HK 420 K Sbjct: 507 SK 508 >UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB92D Length = 586 Score = 104 bits (258), Expect = 9e-21, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 59/231 (25%), Gaps = 23/231 (9%) Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD 267 + A + + + L + + + ++D SGSM Sbjct: 273 QPSPSSAPQAAIFTESKGQHDYALVMLMPPQVKSQDLQDFDRDITFVIDTSGSMGGRPIV 332 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD-----------EHEFFYSQETGGTI 316 AK L LS + V + T+ E + GGT Sbjct: 333 DAKESLQLAIDRLSEKDRFNVVAFNNDTTRLFETSVEGTTRNKQYARDFVKHLNAGGGTE 392 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 ++ AL +K +DG A + K L R ++ Sbjct: 393 MAPALNAA---LKRTTTKDFIKQVVF-ITDG---AVGNEAALFSQIKNELGDARLFTVG- 444 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 I + F +R+ DI L +K + Sbjct: 445 IGSAPNSYFMTRAAQFGLGSYVF----VRNTADIKQQMDSLLYKLESPVLS 491 >UniRef50_Q23KK4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23KK4_TETTH Length = 1085 Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats. Identities = 43/318 (13%), Positives = 106/318 (33%), Gaps = 52/318 (16%) Query: 129 FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSL------ARRTAMTA 182 F++ +L ++N + + + + I + + +N + + Sbjct: 325 FQEASLLITQKNSMISTHQTRQKLLNSDIQQIESKIKKLEADKNKQVEDLEARKDKYLKQ 384 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 + + + E A +Q + + I L +K + +Y +Y Sbjct: 385 LEEEKAKLIREKSAFWDKKNSSQEARLSQYSQSINSLNSKYPLGKMCSIVEKKYFHY--- 441 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE-- 300 + + D SGS + + L Y + YI+ + + Sbjct: 442 ---------YFIQDESGSFSNDHQYAIQGVAQLFNRIKPNDY----ITYIKFDSSSHVDI 488 Query: 301 -------VDEHEFFYSQE---TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DN 349 + + +F + GGT SA + + + ++ +Y+ ++ + +DG DN Sbjct: 489 PKTLKSSLSQGDFISKIQKCRGGGTNFQSAFQTLLQQIQSKYDQQEYPVVIF-ITDGQDN 547 Query: 350 WADDSPLCHEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 DS L + +Y Y + + +++ + F+N + ++ Sbjct: 548 TDLDS---IISQITSLCQDIVFYTIGYGSVNEKY-------LKNITNKFNN----TVGEK 593 Query: 408 DDIYPVFRELFHKQNATA 425 +I +LF+ +N Sbjct: 594 KEINGKPVDLFYVKNTPN 611 >UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H5J9_SHEPA Length = 789 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 55/206 (26%), Gaps = 26/206 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----H 295 ++ S + ++D SGSM AK L T K V + Sbjct: 390 AEQQPSSIHRELILVIDTSGSMSGDAIIQAKTALKYALAGLRPTDKFNIVQFNSDVDKWS 449 Query: 296 TQAKEVD-------EHEFFYSQETGGTIVSSALKLMDEV-----------VKERYNPAQW 337 A ++ + GGT +S A+ + + + Sbjct: 450 GMAMSATPYNLAQAQNYINRLEANGGTEMSIAINAALNIETVTDKETGTELDNNDLGSNL 509 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG A + L + L R ++ I + + L Sbjct: 510 LRQVLFITDG---AVSNESMLFELIEAQLGDSRLFTIG-IGSAPNAHFMQRAAQLGRGTY 565 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + +++ Q Sbjct: 566 TYIGKLDEVNQKVVSLLKKIEKPQVT 591 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 66/207 (31%), Gaps = 20/207 (9%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAV-----MFCLMDVSGSMDQSTKDMAKRFYILLYL 278 + ++ + + V + CL+D SGSM + ++ + Sbjct: 92 NQQNAALMITIKSNDILLINQRGQECVRQGVDLVCLIDHSGSMQGEKIKLVRKTLKQMLT 151 Query: 279 FLSRTYKNVEVVY---IRHHTQAKEVDEH-------EFFYSQETGGTIVSSALKLMDEVV 328 FL + +++ + T+ V + Q GGT + + +K+ ++ Sbjct: 152 FLQPCDRLCLIMFDCKVYRLTRLMRVTQENVQKFRVAISSLQARGGTDIGNGMKMALSIL 211 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWR 387 K R + SDG + + + L + + ++ + Sbjct: 212 KHRKYKNPVS-AIFLLSDGVDEGAE-ERVRDDLIQYNIRDSFTIKTFGFGR-DCCPKIMS 268 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVF 414 E H + F ++ + D+ + Sbjct: 269 EIAHYKEGQFYFVP-NLTNIDECFAEA 294 >UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, scaffold_125.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HHA4_VITVI Length = 630 Score = 100 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 60/220 (27%), Gaps = 26/220 (11%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY--EKRPDPSSQAVMFCLMDVSGS 260 L + E+ + A F ++ + + + ++DVSGS Sbjct: 157 SRPQLVTVKALPELPAISASESFRTFAVLVGIKAPALLDDAHLLDRAPIDLVAVLDVSGS 216 Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEVDEHEFFYSQ 310 M S + KR L L + + V + +E Sbjct: 217 MAGSKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLT 276 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDS------PLCHEILAK 363 +GGT + LK V++ER SDG D + D+ C + Sbjct: 277 SSGGTNIVEGLKKGVRVLEERSEQNPVASIIL-LSDGKDTYNCDNVNRRQTSHCASSNPR 335 Query: 364 KLLPVV---RYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + + + + T +F Sbjct: 336 QGRQAIIPVHTFGFG---SDHDSTAMHAISDESGGTFSFI 372 >UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=B8G546_CHLAD Length = 418 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 63/196 (32%), Gaps = 19/196 (9%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------- 291 P + + ++D SGSM + + K + L V++ Sbjct: 34 PASPTSALPLNLCFVLDRSGSMQGAKLESMKAATRRVIELLRPHDVAAIVIFDDTVQTLI 93 Query: 292 IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + E GGT +S ++ +++ P + + +DG W Sbjct: 94 PATPVGDRSALLAAVETITEAGGTAMSLGMQAAQTELQKHLGPDRISRMLL-LTDGQTWG 152 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 D P+C + LA+ L + + + ++ L + + ++ I D I Sbjct: 153 -DEPICRD-LARTLGQAGVRITALGLGTEWNEQLLDDIAAASDGYSDY----IADPAQI- 205 Query: 412 PVFRELFHKQNATAKG 427 F + A+ Sbjct: 206 ---ETFFQQAVKEAQA 218 >UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZFT4_9SPHI Length = 827 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 58/191 (30%), Gaps = 22/191 (11%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 K + P + ++DVSGSM ++KR L L K +++ Sbjct: 313 KAPKNSQIPPRE--YVFIVDVSGSMHGFPLSVSKRLLKNLIGKLRPKDKFNVMLFESSNQ 370 Query: 295 --HTQAKEVDEHE-------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 ++ E + + GGT + ALK + + ++ + Sbjct: 371 MMSPESMEATQANIQKAFGVIDQQRGGGGTRLLPALKKALAFKQTK----DYSRSFVVVT 426 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG L + L +++ I ++ L F + H Sbjct: 427 DG---YVTVEKEAFDLIRNNLNRANLFAFG-IGSSVNRFLIEGMARAGMGEP-FIVTHGT 481 Query: 406 DQDDIYPVFRE 416 + D FR Sbjct: 482 EADVKAEKFRN 492 >UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V0V6_NAEGR Length = 502 Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 37/265 (13%), Positives = 82/265 (30%), Gaps = 35/265 (13%) Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 L E L+ + ++ + ++ + E +PF+ L + Sbjct: 2 KVSESALQKFETLLSNFAPQSVNLSVQIDAGHLPTDQIGVQCE-IPFLVRL-LSGNLPPQ 59 Query: 242 RPDPSSQAVM-----FCL-MDVSGSMDQSTKDMAKRFYI---------LLYLFLSRTYKN 286 + + V+ CL +D+SGSMD+ K+ +K + L+ FL+ Sbjct: 60 EEEAETTNVLKTPVNICLVLDISGSMDEPLKNRSKGSKLTACKSAIRELVTNFLTYKDTI 119 Query: 287 VEVVYIRHHTQA------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + Y + V+ ++ G T ++SAL +++ P Sbjct: 120 HLITYSDSPKTVFTEKNKESVNLNDIDKISTEGSTNIASALHSAVDLLHNSNAPG--TKL 177 Query: 341 AAQASDGD-NWADDS--------PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 A SDG N + + + ++ + SY + + + Sbjct: 178 IAFFSDGQCNVGETNLNIFGSGLLKKLKDYSEGKDDQIHISSYG-VGSDYDELWLQAIAR 236 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRE 416 + +D ++ Sbjct: 237 TGKGEYYYLEDETYAKDAFERSLKK 261 >UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DZ93_PARTE Length = 522 Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 33/263 (12%), Positives = 83/263 (31%), Gaps = 37/263 (14%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I VV + +++ R++ ++ + L++N+ + + + + + Sbjct: 47 IDVVITNESNYGRKSLSQNYMKQANYVLQDNVELKLSYSGLPTQGTQAVLLSV------- 99 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 +R + CL+D SGSM + K+ L L Sbjct: 100 --QTKNQAITIR-----------QGIDLICLIDHSGSMSGEKMHLVKKSLKHLLKMLQPN 146 Query: 284 YKNVEVVYIRHH---TQAKEVDEH-------EFFYSQETGGTIVSSALKLMDEVVKERYN 333 + + + + T+ + + G T + +A+K+ ++K R Sbjct: 147 DRLCLIEFDDQNYRLTRLMRATQENMYKFLIAIDTIEANGATDIGNAMKMALSILKHRRF 206 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP--VVRYYSYIEITRRAHQTLWREYEH 391 SDG++ + ++I +K + + + + + E H Sbjct: 207 KNPIA-SIFLLSDGEDEGAAGRVWNDIQSKNIKEPFTINTFGFGRDCCP---KIMSEIAH 262 Query: 392 LQSTFDNFAMQHIRDQDDIYPVF 414 + + I D+ + Sbjct: 263 FKEGQFYYI-SEISKIDECFFEA 284 >UniRef50_Q235T9 von Willebrand factor type A domain containing protein n=5 Tax=Tetrahymena thermophila RepID=Q235T9_TETTH Length = 703 Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 67/206 (32%), Gaps = 26/206 (12%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYKNVEVVY 291 D+ K+ P + C++D SGSM+ S + K + L L+ + + + Sbjct: 197 DMEVKSNPLEGRP--NLDLICVIDNSGSMNDFSKIENVKNTILQLLEMLNENDRLSLITF 254 Query: 292 IRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + Q K+ + + GGT + +++ ++++ R + Sbjct: 255 NTKAKQLCGLKNVNNQNKKSLQTITKSIKADGGTDIIRGIEIAFQILQSRKQKNSVS-SI 313 Query: 342 AQASDG-DNWADDSPLCHEILAKKLL--PVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 SDG DN AD K L +S+ L ++ ++ Sbjct: 314 FLLSDGQDNLADAGIKNLLKTTYKQLQEESFTIHSFGFGN-DHDGPLMQKIAQIKDGSFY 372 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNAT 424 F +++D F Sbjct: 373 FV-----EKNDQVDE---FFIDALGG 390 >UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=Q10RY0_ORYSJ Length = 694 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 46/319 (14%), Positives = 92/319 (28%), Gaps = 38/319 (11%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVR----SLQNSLARRTAMTAGKRREL 188 LP + Q + + +SVVR +L ++ A+ + Sbjct: 123 ELP-FQGTQPGDTAYGRARVSTVNWPQDEGQMSVVRRLSHGYSGNLQQQLAVFRTPEASI 181 Query: 189 HALEENLAIISN--SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 +EN+ S + + ++ + + + K + S Sbjct: 182 FNDDENIDPQSETVDDHNAVTNSVEIKTYSEFPAIQKSERRKVFAILIHLKAPKSLDSVS 241 Query: 247 SQAVM--FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---------- 294 S+A + ++DVSGSM + KR + L + V + Sbjct: 242 SRAPLDLVTVLDVSGSMSGIKLSLLKRAMSFVIQTLGPNDRLSVVAFSSTAQRLFPLRRM 301 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD-- 352 ++ +GGT ++ ALK +VVK+R + SDG + Sbjct: 302 TLTGRQQALQAISSLVASGGTNIADALKKGAKVVKDRRRKNPVSSIIL-LSDGQDTHSFL 360 Query: 353 -------DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 S L + V+ +++ T + +F Sbjct: 361 SGEADINYSILVPPSILPGTSHHVQIHTFGFGT-DHDSAAMHAIAETSNGTFSFI----- 414 Query: 406 DQDDIYPVFRELFHKQNAT 424 D ++ F + Sbjct: 415 ---DAEGSIQDAFAQCMGG 430 >UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSY7_9GAMM Length = 670 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 58/187 (31%), Gaps = 23/187 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH 304 P + ++D SGSM AK+ LS + V + H++ Sbjct: 316 PRMPREVVFVIDTSGSMAGQRMYHAKQALSQAVERLSPDDRFNVVEFNNQHSRLFSSMRS 375 Query: 305 E-----------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 Q GGT++ A++ V R +PA + +D + Sbjct: 376 ASAINVKQALNWVGRLQGGGGTMMLPAVEDALSV---RSDPA-YLRQVILITD---ASVG 428 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + + ++ R ++ + L R+ + + I ++ Sbjct: 429 NEAEILRVVERQRKGARLFTVGIGVS-PNSYLLRKAAQVGQGDYVY----IASGQEVKAR 483 Query: 414 FRELFHK 420 + LF K Sbjct: 484 MQRLFAK 490 >UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0E9B3_PARTE Length = 603 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 59/182 (32%), Gaps = 23/182 (12%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV----------YI 292 P + C++DVSGSM ++ K L L + +V +I Sbjct: 116 DRPPID--LVCVVDVSGSMIGRKINLVKDSLRYLMKILGPEDRICIIVFTTVAHIVTSFI 173 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA 351 R+ + K + + + T +S + ++K R + SDG D++ Sbjct: 174 RNTQENKPLLKKAILELKGLASTNISDGMNKALWMLKNRKYKNPVSC-IFLLSDGQDDYK 232 Query: 352 DDSPLCHEIL----AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + L ++ V+ + Y + + + + + +I Sbjct: 233 GAEQRVFDQLQLLKIEEKF-VIHTFGYGQ---DHDAYVMNQIAKYREGNFYYI-DNINKA 287 Query: 408 DD 409 D Sbjct: 288 SD 289 >UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5Z1_VITVI Length = 686 Score = 98.3 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 36/278 (12%), Positives = 73/278 (26%), Gaps = 51/278 (18%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERV----- 226 +L + + R + +++P L+ R + + + + Sbjct: 117 RNLQPXSPQSPEPRHFSDDEPLVVNSAESTDPTSLVSLSRPQLVTVKALPEWPAISASES 176 Query: 227 --PFIDTFDLRYKNY--EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 F ++ + + + ++DVSGSM S + KR L L Sbjct: 177 FRTFAVLVGIKAPALLDDAHLLDRAPIDLVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGP 236 Query: 283 TYKNVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY 332 + + V + +E +GGT + LK V++ER Sbjct: 237 SDRLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLXSSGGTNIVEGLKKGVRVLEERS 296 Query: 333 NPAQWNIYAAQASDG-DNWADDS------PLCHEILAKKLLPVVRYY------------- 372 SDG D + D+ C +++L + Sbjct: 297 EQNPVASIIL-LSDGKDTYNCDNVNRRQTSHCASSNPRQVLEYLNLLPASICPRNRESGD 355 Query: 373 ----------SYIEITRRAHQTLWREYEHLQSTFDNFA 400 ++ T +F Sbjct: 356 EGRQAIIPVHTFGF-GSDHDSTAMHAISDESGGTFSFI 392 >UniRef50_D2VKS7 von Willebrand factor type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VKS7_NAEGR Length = 923 Score = 98.3 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 39/339 (11%), Positives = 93/339 (27%), Gaps = 48/339 (14%) Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGV------PANISVVRSLQNSLARRTA 179 + L L + +++ + ++ A + + L+ + Sbjct: 546 ETLMNALEIVTVEEEEDVHSENTDQDQSTILAELTINESSSSNSNLNQEDTISILSEKQL 605 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELR---------AKIERVPFID 230 + + + + Q ++E+ + E +PF Sbjct: 606 TSPPTPIDHAVIPSTSNTSLTNSSQQDNQQEQPTIMNTRSTSNKIVFNGHCEYEAIPFET 665 Query: 231 TFDL------RYKNYEKRPDPSSQAV-MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 DL + +E++ + V + ++D SGSM DM K + L Sbjct: 666 ECDLYCMATLQGPCFEQQAQKERKGVDLVLVVDKSGSMAGQKLDMVKSTLSFMVDQLKEK 725 Query: 284 YKNVEVVYIRHHTQAKEVDEHEFFY----------SQETGGTIVSSALKLMDEVVKERYN 333 + V + ++ + + T +S AL +++ R Sbjct: 726 DRVAIVEFDTQVKTNLDLTKMDIEGKKKAKQVSSAISPGSCTNLSGALFTSLKLLASRQQ 785 Query: 334 PAQWNIYAAQASDG-DNWA--DDSP--LCHEILAKKLLPVVRY----YSYIEITRRAHQT 384 +DG N + + L +LL + + + T Sbjct: 786 EKNEVTSVILFTDGLANRGLISTNEILQNMQDLMDELLSTSNVTIHTFGFGQDT---DAN 842 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + ++ + DDI F + + Sbjct: 843 MLTSIAQKGNGLYDY----LETADDIPKAFGNVIGNLVS 877 >UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C9G5_PARTE Length = 648 Score = 98.3 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 54/168 (32%), Gaps = 20/168 (11%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--- 302 + + C++D SGSM+ ++ + L FLS + + + A+ + Sbjct: 225 EAGIDLLCVIDKSGSMEGKKIASVQQSLVQLLDFLSEKDRLCLITF---DGSAQRLTPLK 281 Query: 303 ----------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + + + +G T ++ ++ +++R Q SDG + Sbjct: 282 TLTQDNKNYFKKAIYSIRASGQTNIAKGTEIAFNQIQQRKMKNQVT-SIFLLSDGQDQGA 340 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + + + + + + Y L + + + Sbjct: 341 AEYIQRQKDVVEDIVTIHSFGYG---SDHDAALMSKICKVGQGSFYYI 385 >UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P2E4_9RHOB Length = 772 Score = 97.9 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 49/161 (30%), Gaps = 19/161 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV-------YIRHHTQAKEVDE 303 + ++D SGSM + +K F L + + A E ++ Sbjct: 366 LVFVLDTSGSMSGQPIEASKTFMTAAIKALRPDDYFRILHFSNDTSQFAGQAVLATERNK 425 Query: 304 HEFFY----SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT ++ A+ + + P +DG + D + Sbjct: 426 QKALKFVADLSAGGGTEINQAVNAAFDQAQ----PDNTTRIVVFLTDG--YIGDEATVIK 479 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +A ++ R Y++ ++ L + + Sbjct: 480 SIANRIGK-ARIYAFGVGNS-VNRFLLDAMATEGRGYARYV 518 >UniRef50_UPI00017450FB von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017450FB Length = 424 Score = 97.9 bits (242), Expect = 7e-19, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 47/181 (25%), Gaps = 13/181 (7%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 L + E + + ++D SGSM A+ L V Sbjct: 22 LKVGLTGQELEASAKR-APVNVTIVIDKSGSMGGDKMVHAREAAKQALDRLGAGDMVSVV 80 Query: 290 VY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 Y ++ + Q G T + S + E ++ P Q N Sbjct: 81 AYDDAVSLISPATDLTDRDRVKAAIDRIQAGGSTALFSGISKGAEELRRNKRPNQVNRVV 140 Query: 342 AQASDG-DNWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 SDG N SP L L + + ++ L E F Sbjct: 141 L-LSDGMANVGPSSPQDLGRLGASLAKEGITVTTLGLGLG-YNEDLMTELALRSDGNHAF 198 Query: 400 A 400 Sbjct: 199 I 199 >UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67LZ3_SYMTH Length = 414 Score = 97.5 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 23/200 (11%), Positives = 46/200 (23%), Gaps = 18/200 (9%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----- 291 + P + ++D SGSM + K+ L ++ + V Y Sbjct: 33 RMPAPEGRPPLN--LAAVVDRSGSMAGAALYFTKQALRFLVDQMAEEDRLAIVTYDDQVH 90 Query: 292 ---IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG- 347 K+ G T +S L + ++ P + + +DG Sbjct: 91 VPFPSQPVVQKDAVRLLVDGITAGGTTNLSGGLATGMQQIRPHAGPGRVSRVLLM-TDGL 149 Query: 348 DNWADDSPLCH---EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 N P ++ V + L ++ Sbjct: 150 ANVGVTDPDVLAGWARAWREKGLAVSTMGVG---PHFSEDLLVALAEAGGGNFHYIANPD 206 Query: 405 RDQDDIYPVFRELFHKQNAT 424 + L Sbjct: 207 QIPRIFQEELHGLLQVAVQG 226 >UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alteromonadales RepID=Q3IHK0_PSEHT Length = 664 Score = 97.5 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 52/185 (28%), Gaps = 23/185 (12%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQAKE 300 ++D SGSM + + AK L + + + Sbjct: 325 VFVVDTSGSMHGQSMEQAKNALFYALSLLDSNDSFNIIGFDNVVTLMSDKPLVASGFNLR 384 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 E + Q GGT + AL + + + +DG + + Sbjct: 385 RAERFIYGLQADGGTEIQGALDAVLD----GSQFDGFVRQVIFLTDG---SVSNEDALFK 437 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + L R ++ I + R + F I ++ P ++LF K Sbjct: 438 SIQAKLGDSRLFTVG-IGSAPNSFFMRRAADVGKGSFTF----IGSTSEVQPKMQQLFDK 492 Query: 421 QNATA 425 A Sbjct: 493 LAHPA 497 >UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744662 Length = 679 Score = 97.1 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 67/236 (28%), Gaps = 23/236 (9%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + + + L + + F+ K +E P ++DV Sbjct: 268 LRYQLSGREVATGLLLHQAPAGSSPEAESFFLLNVQPPAK-WEAGQTPPRD--YLFVLDV 324 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-----------QAKEVDEHEF 306 SGSM+ + +KR L L+ + + + + + Sbjct: 325 SGSMNGFPIETSKRLMSDLLKGLNPGDTFNILHFASDSAVLSPKPLAATPENIHLATKDL 384 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL 366 + GGT + AL+ +E + +DG L +K L Sbjct: 385 SRHRGNGGTELLPALQRALATPRE----VGVSRSIVILTDG---YVTIEKEAFRLVRKEL 437 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 +++ I ++ L H F + +D FRE + Sbjct: 438 QNANVFTFG-IGTAVNRWLIEGLAHAGQGDP-FVVLSEKDAAAAAERFREYISRPV 491 >UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V370_MONBE Length = 471 Score = 96.4 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 69/230 (30%), Gaps = 26/230 (11%) Query: 199 SNSEPAQ-LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + PA L+ L + + L ++E P+ + ++DV Sbjct: 12 TAEAPAPLQLDVRPLWQYAEIGARESSAYISCR---LTAPDFEPVERPAID--LVAVIDV 66 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEVDEHEFF 307 SGSM M + L L T + V + T KE + Sbjct: 67 SGSMAGQKLKMVQSTLEFLMRNLKDTDRFALVTFDSDVKTVFDLRPMTTAHKEACLADVQ 126 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLL 366 + T +S L E++++R +DG N + L+ Sbjct: 127 KLRAGSCTNLSGGLFRGVELMQQRGATKGAVSSILLMTDGIANEGVRDKDDMCRALRGLM 186 Query: 367 ---PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 P Y++ ++ + R+ + F + +DI P Sbjct: 187 GPAPDYTIYTFGYGK-DHNENMLRQLSETGNGMYYFI-----ESNDIIPE 230 >UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G8C3_SORC5 Length = 907 Score = 96.4 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 63/258 (24%), Gaps = 18/258 (6%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 A E + + A + P L + LR ++ F LR Sbjct: 433 LARGVVPAAERELVGDVAARYAPEVPLALDKALGLRADLERAALGPGGGAFHLRLALRSA 492 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 P + +D SGSM + D A+R L L+ + Sbjct: 493 AAAAAARPHLSVHLV--LDTSGSMAGAPIDSARRAAQALVDRLAPADDFSLTTFSSDAEV 550 Query: 298 AKE---------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 E +E GGT + + L L P SDG Sbjct: 551 VIEDGPVGPRRAAIRRAIEGLREGGGTNIGAGLSLGYAQASRPGIPEDAVRVVLLVSDGR 610 Query: 349 NWADDSPLCHEI--LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + + ++ + + L + + +R Sbjct: 611 ATSGLTHSERLAWLALDAFQRGIQTSALG-LGDDFDGQLMSAIASDGAGGYYY----LRH 665 Query: 407 QDDIYPVFRELFHKQNAT 424 + I P K+ Sbjct: 666 PEQIAPALSTELDKRLDP 683 >UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LQW0_BACS4 Length = 282 Score = 96.0 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 58/217 (26%), Gaps = 13/217 (5%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 + + K + ++ +L + + L+D SGSM K Sbjct: 9 FAHQYENVPCKGKEAAYLL-VELTGAKVKHTERSPINLSL--LLDRSGSMSGEPLRYCKE 65 Query: 272 FYILLYLFLSRTYKNVEVVY------IRHHTQA--KEVDEHEFFYSQETGGTIVSSALKL 323 + L+ VV+ I + K++ + + G T +S L Sbjct: 66 ACNFVINQLTDKDILSVVVFDDQVETIIEPQKVTHKDLLKEYIQRIETRGITNLSGGLIQ 125 Query: 324 MDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 + V ++ N SDG N LA S + ++ Sbjct: 126 GCQHVLKQEVKNYVNRVIL-LSDGQANAGITDKEALVKLADDYQSAGLVISTLGVSEHFD 184 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + L +F + L + Sbjct: 185 EELLEGVADSGRGNFHFINEVENIPSIFEQELDGLLN 221 >UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XQ17_9BACT Length = 806 Score = 96.0 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 57/202 (28%), Gaps = 22/202 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---- 294 + + + ++D SGSM + AK+ L+ + + + Sbjct: 302 VDAKAKQIVSKDVVFVLDTSGSMSGKKMEQAKKALQFCVESLNDGDRFEIIRFSTESEPL 361 Query: 295 -------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + +E + GGT + ALK + + P +DG Sbjct: 362 FDKLAAVSKENREKAGDFIKNLKAMGGTAIDEALKKALSLESKEGRP----FVVVFLTDG 417 Query: 348 DNW--ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 D + + ++ R + + I + L F + + Sbjct: 418 LPTVGTTDEDQILKGMQERNKEKRRIFCFG-IGTDVNTHLLDRIAEETRAFSQYVLP--- 473 Query: 406 DQDDIYPVFRELFHKQNATAKG 427 ++D+ F K N Sbjct: 474 -EEDLEVKVSSFFSKINEPVLA 494 >UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobactrum intermedium LMG 3301 RepID=C4WI90_9RHIZ Length = 777 Score = 96.0 bits (237), Expect = 3e-18, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 52/198 (26%), Gaps = 23/198 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 +Q + ++D SGSM ++ + AK L + + + T+ Sbjct: 369 PAVASAKKAQREVVFVIDNSGSMGGTSIEQAKASLDYALSHLQPGDRFNVIRFDDTLTRF 428 Query: 299 KEVDEHE-----------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 EV + GGT + AL + + +DG Sbjct: 429 FEVSVEASQQNIASARHFVMSLEAQGGTAMLPALHAALD----DSHQGNGLRQIVFLTDG 484 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + R + I + L L HI Sbjct: 485 E---ISNEQQLLDAIAARRGRSRIFMVG-IGTAPNSYLMNHAAELGRGTF----THIGSA 536 Query: 408 DDIYPVFRELFHKQNATA 425 ++ R LF K A Sbjct: 537 AEVDERMRALFDKLENPA 554 >UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZUW0_9SPHI Length = 425 Score = 95.6 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 58/200 (29%), Gaps = 23/200 (11%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---- 291 K EK+ + +D SGSM + K+ + L V Y Sbjct: 34 GKAPEKQERIPLNISLV--VDRSGSMSGDKLNYVKKAVDFVIDNLKSDDVLSIVQYDDEI 91 Query: 292 --IRHHTQA--KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + K+ + Q T +S + VK + N SDG Sbjct: 92 DVVASSAKVTNKKALHEKVKGIQARNMTNLSGGMMEGYAQVKSTQSNGYVNRVLL-LSDG 150 Query: 348 -DNWADDSPLCHEILAKKLLP--VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 N +P + +A+K + ++ + ++ L F Sbjct: 151 LANAGITAPEQLQQIAQKKFREAGIALSTFG-VGSDFNEVLMTNLSEYGGANYYFI---- 205 Query: 405 RDQDDIYPVFRELFHKQNAT 424 D+ ++F ++ Sbjct: 206 ----DMPDKIPQIFAQELEG 221 >UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3 Tax=Andropogoneae RepID=C5WYU9_SORBI Length = 698 Score = 95.6 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 58/203 (28%), Gaps = 21/203 (10%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 + F LR + + ++DVSGSM + + K + L Sbjct: 216 KEIFAILIHLRAPKSSHSASSRAPLDLVTVLDVSGSMAGTKIALLKNAMSFVIQTLGPND 275 Query: 285 KNVEVVYIRHHTQA----------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + + + + ++ +GGT ++ LK +V+++R Sbjct: 276 RLSVIAFSSTARRLFPLRRMTLAGRQQALQAVSSLVASGGTNIADGLKKGAKVIEDRRLK 335 Query: 335 AQWNIYAAQASDGD---------NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 SDG N D S L + V+ +++ Sbjct: 336 NPV-CSIILLSDGQDTYTLPSDRNLLDYSALVPPSILPGTGHHVQIHTFGF-GSDHDSAA 393 Query: 386 WREYEHLQSTFDNFAMQHIRDQD 408 + S +F QD Sbjct: 394 MHAIAEISSGTFSFIDAEGSIQD 416 >UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0C5K4_PARTE Length = 611 Score = 94.8 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 42/280 (15%), Positives = 89/280 (31%), Gaps = 21/280 (7%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANG--VPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP LK N + + + + + + R +++ L Sbjct: 76 LPKLKNNLYKDNQSTIQFPDQFQPQMQMLTPQMRMNYKYIQKIPRCDDDEQLIQKQSAKL 135 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + + + + E LR K E +P + + + E + + Sbjct: 136 QNKNKY--DLQSSIAFEINSLRTSCKVSNYKSEYIPAMISIKTKENQTEMTER-TIGIDL 192 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEV 301 CL+D S SM +M K+ +LL FL + + + H + K+ Sbjct: 193 ICLIDKSMSMSGDNINMVKKSLLLLLDFLGEQDRLQIITFNEHAQRLTPLKCLTEKNKQY 252 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKER-YNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + G T +SSA + + +KE+ Y ++ SDG + D+ Sbjct: 253 FQAVISQISAEGLTKISSATYIAFKQLKEKVYRNNVTSV--FLLSDGHD--GDALFEISD 308 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + + V ++ + +L++ + Sbjct: 309 QIRHVKEVFTISTFGF-GDDHDAQMMTSISNLKNGNFYYV 347 >UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1D2W8_DEIDV Length = 418 Score = 94.8 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 45/196 (22%), Gaps = 20/196 (10%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------ 291 + P + ++D SGSM MAK+ I + V + Sbjct: 35 TTQVSQRPPLN--LAFVIDRSGSMSGLPLQMAKQAAIAAVRQARPDDRVSVVAFDDRVDV 92 Query: 292 --IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD- 348 ++E + G T + V + P N SDG Sbjct: 93 IVPSQLATSREAVIQAIGTIDDRGSTNLHGGWLEGATQVAQHLTPGALNRVIL-LSDGQA 151 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 N + L + I + + L + R Sbjct: 152 NVGVTDRREIARQVRGLTERGISTTTIGLGSHYDEELLLAIANAGDGNFEHVEDPSRLP- 210 Query: 409 DIYPVFRELFHKQNAT 424 F ++ Sbjct: 211 -------TFFEEELQG 219 >UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RYC9_9GAMM Length = 686 Score = 94.8 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 54/181 (29%), Gaps = 21/181 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQAK 299 + ++D SGSM + AK L + + + + Sbjct: 318 IVFVVDTSGSMGGVSIKQAKGSLTRALRHLGPNDRFNVIEFNSSHRALFQHAVPASHHNL 377 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEV--VKERYNPAQWNIYAAQASDGDNWADDSPLC 357 ++ + + +GGT + AL+L ++ ++ P +DG A + Sbjct: 378 QLASEYVRHLEASGGTEMMPALQLALKLPGAQDELRPEPALRQVIFITDG---AVGNESA 434 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 L R ++ I + R+ + I D ++ L Sbjct: 435 LFEHIVDSLGGSRLFTVG-IGSAPNAWFMRKAAEYGRGTFTY----IGDVAEVGEKMDAL 489 Query: 418 F 418 F Sbjct: 490 F 490 >UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVH6_PARL1 Length = 755 Score = 94.4 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 53/195 (27%), Gaps = 22/195 (11%) Query: 244 DPSSQAV-MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----------- 291 P ++ ++D SGSM + AK + L + + Sbjct: 341 QPEAKPREAIFVIDNSGSMSGPSMVQAKESLLWALDRLKPGDTFNVIRFDDTLTVLFPDA 400 Query: 292 IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + H + V + + GGT + AL+ ++ N +DG A Sbjct: 401 VPAHGENLAVAKKFVKSLEANGGTEMLPALRAS--LIDRNVNDGTRLRQIVFLTDG---A 455 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + L R ++ I + HI + ++ Sbjct: 456 ISNEAELFHEITSNLGRSRLFTVG-IGSAPNSYFMTRASEAGRGTF----THIGKETEVT 510 Query: 412 PVFRELFHKQNATAK 426 ELF K Sbjct: 511 ERMAELFEKLQNPVM 525 >UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VDM1_NAEGR Length = 754 Score = 94.4 bits (233), Expect = 8e-18, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 68/198 (34%), Gaps = 22/198 (11%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L++ Q + +DVSGSM D AK L+ + +VV I Sbjct: 26 LQFDLISNIQRKEKQ--IVIALDVSGSMRGQGIDQAKIAISNLFEQV---VDIPDVVLIA 80 Query: 294 HHTQAK---------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + T A+ E + Q GGT + + + ++ + +N + Sbjct: 81 YDTSAELYDLRKKPAETRQSTLEQIQAGGGTDFTCVFEAISKL--DMFNSQSE-VAILFF 137 Query: 345 SDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEIT--RRAHQTLWREYEHLQSTFDNFAM 401 +DG D + E + K L + + + I L + L S F Sbjct: 138 TDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQGTF-- 195 Query: 402 QHIRDQDDIYPVFRELFH 419 Q+++D ++I L Sbjct: 196 QYVKDANEINQSMENLIG 213 >UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XPW8_DESMR Length = 439 Score = 94.1 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 58/203 (28%), Gaps = 17/203 (8%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 E ++ + +D SGSM + AKR + L T + + Y Sbjct: 33 NTPEGETKERTRLNLALAIDRSGSMAGRPLEEAKRCASFVVDKLKNTDRVSLIAYDSSIE 92 Query: 295 ------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + K + + G T + E + +P+ + SDG Sbjct: 93 TRVPSVKVEDKAIFHRAIEGIDDGGCTNLHGGWLKGAEQISPYIDPSTISRIIL-LSDGQ 151 Query: 349 -NWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 N ++L V +Y + ++TL + + D Sbjct: 152 ANEGLTDEAEIFKQCRELADAGVTTSTYG-LGSNFNETLMIGMAKNGQGNSYY-GRTADD 209 Query: 407 QDDIYPV----FRELFHKQNATA 425 D + LF KQ + Sbjct: 210 LMDPFQEELSLLEALFAKQVRAS 232 >UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W4Q3_NAEGR Length = 454 Score = 94.1 bits (232), Expect = 9e-18, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 65/202 (32%), Gaps = 30/202 (14%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L++ Q + +DVSGSM D AK L+ + +VV I Sbjct: 26 LQFDLISNIQRKEKQ--IVIALDVSGSMRGQGIDQAKIAISNLFEQV---VDTPDVVLIT 80 Query: 294 HHTQAK---------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI----Y 340 + T A+ E + Q GGT + + + + +N Sbjct: 81 YDTSAELYDLRKKPAETRQSTLEQIQAGGGTDFTCVFEAISNL-------DMFNRQSEVA 133 Query: 341 AAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEIT--RRAHQTLWREYEHLQSTFD 397 +DG D + E + K L + + + I L + L S Sbjct: 134 ILFFTDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQG 193 Query: 398 NFAMQHIRDQDDIYPVFRELFH 419 F Q+++D ++I L Sbjct: 194 TF--QYVKDANEINQSMENLIG 213 >UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J6Q3_DESRM Length = 416 Score = 94.1 bits (232), Expect = 1e-17, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 59/213 (27%), Gaps = 22/213 (10%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + L ++ P + ++D SGSM D K+ LS Sbjct: 17 PGNKQVAYLMVKLTAPKQVEKERPVQN--LSFVIDRSGSMAGEKLDYTKKAVAFAVGHLS 74 Query: 282 RTYKNVEVVY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN 333 V + H K+ + G T +S + L VK + Sbjct: 75 PQDYCSVVAFDDMVTMVASSHQVANKDALKMAVESIYPGGSTNLSGGMLLGVREVKLAHK 134 Query: 334 PAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEH 391 Q N +DG N ++++ V ++ + + L + Sbjct: 135 ENQINRVLL-LTDGMANVGVTDHSALVEKSREMAAGGVNLSTFG-LGEDFEEDLLQAMVE 192 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + ++ D P +F ++ Sbjct: 193 AGGGNFYYI-----EKPDQIPG---IFEQELTG 217 >UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Shewanella RepID=A6WMD3_SHEB8 Length = 772 Score = 93.7 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 47/175 (26%), Gaps = 17/175 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 EK PS + ++D SGSM + AK + L + + Sbjct: 382 VEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIEFNSSLSLL 441 Query: 292 ----IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN--IYAAQAS 345 + + Q GGT ++ AL +P + Sbjct: 442 SATPLPATSSNLSRARQFVSRLQADGGTEMALALDAALPKSLGSVSPDAVQPLRQVIFMT 501 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 DG + + L + + R ++ I + + L + Sbjct: 502 DG---SVGNEQALFDLIRYQIGESRLFTVG-IGSAPNSHFMQRAAELGRGTFTYI 552 >UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 Tax=Cystobacterineae RepID=Q1D9B7_MYXXD Length = 476 Score = 93.7 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 51/179 (28%), Gaps = 13/179 (7%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 TFDL + +D SGSM AK+ L L+ + + Sbjct: 79 TFDLSGAQVPGAQRSPVNLALV--IDRSGSMSGYKLAQAKQAARHLIGLLNDQDRLAIIH 136 Query: 291 Y---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 Y + +E + GGT + + L + N Sbjct: 137 YGSDVKSLPSLEATAANRERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQRTYGVNRLI 196 Query: 342 AQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 SDG + +A++L S I + ++ L + + + F Sbjct: 197 LM-SDGQPTEGLTADEELTRMARELRATGLTLSAIGVGTDFNEDLMQAFAEYGAGAYGF 254 >UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflexi (class) RepID=A5UTA6_ROSS1 Length = 425 Score = 93.3 bits (230), Expect = 2e-17, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 56/192 (29%), Gaps = 14/192 (7%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY--------KNVEVVYIR 293 + P + ++D S SM K + L +VV Sbjct: 38 QQLPKLPLNLCLVLDRSSSMRGERLMQVKEAAARIVDQLGPDDYFSLVVFNDRADVVIPA 97 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 K + + GGT ++ L L + V+ + + +DG + D Sbjct: 98 QRAIKKSDLKAAIAQIEAAGGTEMAQGLALALQEVQRPFLTRGISRLIL-LTDGRTYG-D 155 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH---IRDQDDI 410 C EI + + + I ++ L +++ + ++ D Sbjct: 156 ESRCVEIARRGQSRGIGLTALG-IGTEWNEDLLETMTASENSRAQYIATAQDVVKVFADE 214 Query: 411 YPVFRELFHKQN 422 +F +Q Sbjct: 215 VKRLHAIFAQQV 226 >UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JPY1_9BACT Length = 632 Score = 93.3 bits (230), Expect = 2e-17, Method: Composition-based stats. Identities = 32/307 (10%), Positives = 67/307 (21%), Gaps = 36/307 (11%) Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNS---------LARRTAMTAGKRRELHAL 191 + GY A + + L + L Sbjct: 20 ELSPFEVSGPSTGGYRATSTLSATRIRTKLGATVGGAQDIRYLRNLIDEGIIPSPASFTA 79 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN--YEKRPDPSSQA 249 E + E + P +D + + Sbjct: 80 EGLFSEHDLPIGGDAKEGWLFDIASQATSFESAAQPKVDILAQLGFVSGIDATTFKPAPL 139 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH----- 304 + ++D SGSM ++ ++ + L + V+Y E + Sbjct: 140 NLVAVVDKSGSMSGDPLELVRKSLRQVVSQLGSDDQLSIVLYGSSTHIHLEPTKTSTENR 199 Query: 305 -----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 Q G T + + L+L +V ++ + +D N Sbjct: 200 DQIIASIDRIQSHGSTAMEAGLELGYQVARQSADAFVGKTRVMLFTDERPNVGRTDATGF 259 Query: 359 EILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 +A+ + L + ++ F D Sbjct: 260 MAMAESGSKSDIGLTTIGVGV---HFGAELAEKISSVRGGNLFFF--------DDDESME 308 Query: 416 ELFHKQN 422 F K+ Sbjct: 309 TTFRKEL 315 >UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q083T9_SHEFN Length = 722 Score = 92.9 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 52/186 (27%), Gaps = 24/186 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE----- 305 + ++D SGSM + AK+ L + + T Sbjct: 346 LILVIDTSGSMSGQSITQAKQALQFALAGLRDIDSFNIIEFNSDVTMLSATPLSANSRNI 405 Query: 306 ------FFYSQETGGTIVSSALKLMD-----EVVKERYNPAQWNIYAAQASDGDNWADDS 354 GGT + SAL+ + + ++ +DG A + Sbjct: 406 GKANRFIQSLDADGGTEMRSALQTALVDSVQQDSDQTDAHSEMLRQVIFMTDG---AVGN 462 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 L L R ++ I + R + + I ++ ++ Sbjct: 463 EHELYQLINDQLGDSRLFTVG-IGSAPNSDFMRRAATMGRGTFTY----IGNESEVQQKI 517 Query: 415 RELFHK 420 +L +K Sbjct: 518 EQLLNK 523 >UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepID=C5EGH1_9FIRM Length = 681 Score = 92.9 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 44/395 (11%), Positives = 101/395 (25%), Gaps = 35/395 (8%) Query: 43 VTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQ 102 + + V+ ++ E +RP + Sbjct: 105 MKMQVGDKVVTAKIKEKEEAKQEFDAAKSE-------GKSASLLEQQRPNVFTMNVANI- 156 Query: 103 GQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLK-QNQQRQLTEYKTHRAGYTANG-- 159 + + +F + P + R+ + + Y G Sbjct: 157 MPGDTVNIELHYTEMIALSEGSYEFVFPAVVGPRYSSPSPDREEDGNQWVASPYQEGGAV 216 Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL 219 + SL + +++ + + ++ A I+ +PA Sbjct: 217 PKGTYDIAVSLSTGVPITGIVSSSHKINIEQSADSSAHITLKDPADYGGNRDFILRYQLA 276 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSS-------QAVMFCLMDVSGSMDQSTKDMAKRF 272 + ++T + P ++DVSGSM D AK Sbjct: 277 GQTVNSGLMLNTGEKENFFLLMVQPPERVPAEAIPPREYIFVLDVSGSMFGYPLDTAKEL 336 Query: 273 YILLYLFLSRTYKNVEVVY----IRHHTQAKEVDEHE-------FFYSQETGGTIVSSAL 321 + L T +++ IR ++ + + GGT ++ AL Sbjct: 337 IRNMVSNLRETDTFNLILFSNDAIRMSARSLPATDENVERAINLINRQKGGGGTELAPAL 396 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 + + + + +DG + L ++S+ T Sbjct: 397 EKAVGIPMD-SGAGSVSRSVVVITDG---YMSDEQAIFDIVAGNLDTTSFFSFGIGTS-V 451 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 ++ L +F + + D +F Sbjct: 452 NRYLIEGIARTGGGE-SFVVTDSSESADTARLFDT 485 >UniRef50_A0CHZ1 Chromosome undetermined scaffold_185, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CHZ1_PARTE Length = 265 Score = 92.5 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 77/208 (37%), Gaps = 19/208 (9%) Query: 163 NISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELR-- 220 N SL +++ TA + ++ + ++P E+ L EI L+ Sbjct: 55 NFGYQHSLGPKYSQQLPQTAISQEIFDDDDQVQTNLVQAKPNMYDLEKELIFEIKTLQKM 114 Query: 221 ---AKIERVPFIDTFDLRYKN-YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 +KI ++ K+ + + CL+D S SM+ S + K+ +L Sbjct: 115 IKLSKISTQQLPGIISIKTKDQLNDQDLNRVGVDLICLIDKSSSMNGSKIETVKQSLKVL 174 Query: 277 YLFLSRTYKNVEVVYIRHH---TQAKEVDE-------HEFFYSQETGGTIVSSALKLMDE 326 FLS + +++ H T K + E + GGT +SSA ++ Sbjct: 175 LTFLSNQDRLQLIIFNTHAKRLTPLKRITEDNKLYFTQMIDQIKSDGGTQISSATQIAIS 234 Query: 327 VVKERYNPAQWNI-YAAQASDGDNWADD 353 +K R + N+ SDG + Sbjct: 235 QLKGR--KYRNNVSSVFLLSDGQDNDAT 260 >UniRef50_C1XMC3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XMC3_MEIRU Length = 412 Score = 92.5 bits (228), Expect = 3e-17, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 56/202 (27%), Gaps = 20/202 (9%) Query: 234 LRYKNYEKRPDPSSQAV-MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY- 291 LR + P + + ++D SGSM S K I L + V+Y Sbjct: 34 LRIHTPTPQARPERPLLNLALVLDRSGSMGGSKLKYTKEAAIYAVHNLLPEDRVAVVIYD 93 Query: 292 --------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + + + G T + + V + N Sbjct: 94 DAVEVLVPSTPVADGRAAIANLIRTIRTGGSTALHAGWLEGATQVAAYQEAGRLNRVVL- 152 Query: 344 ASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 SDG N + +P ++L S + + ++ L F Sbjct: 153 LSDGLANRGETNPGVIAEQVRELARRGVSTSTLGVGLDYNEDLMTTMADAGEGNYYF--- 209 Query: 403 HIRDQDDIYPVFRELFHKQNAT 424 I D+ +F ++ A Sbjct: 210 -IESPADLP----RIFAQELAG 226 >UniRef50_Q23FU3 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23FU3_TETTH Length = 755 Score = 92.1 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 54/170 (31%), Gaps = 21/170 (12%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQ 297 + CL+D SGSM + ++ L L + + V + + + Sbjct: 47 PVDIICLIDNSGSMAGKKAQLVRKSLKYLLKILEKGDQISLVSFSSTAKTLCPLTQVNDE 106 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 K+ + GGT V K + +++ R + + +DG+ DS Sbjct: 107 NKQQIKSAIKQINGQGGTFVIPGFKEVTKILNSR-KEQREQTFILLLTDGEFGDIDSGKV 165 Query: 358 HEILAK-------KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + + + + P + Y Y + + + +E Sbjct: 166 IQNINRLFTQSEIQKTPYIYTYGYGD---DVNPEILQEIAQKFQGKYCLI 212 >UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN03_ARATH Length = 641 Score = 92.1 bits (227), Expect = 4e-17, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 67/219 (30%), Gaps = 22/219 (10%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP-SSQAVMFCLMDVSGSM 261 E +L E++ L + R F L+ + + + ++DVSGSM Sbjct: 156 SDHQALEIKLFPEVSALAKPVSRADFAVLVHLKAEGVSDDARRARAPLDLITVLDVSGSM 215 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA----------KEVDEHEFFYSQE 311 D ++ K + L T + + + + K+ Sbjct: 216 DGVKMELMKNAMSFVIQNLGETDRLSVISFSSMARRLFPLRLMSETGKQAAMQAVNSLVA 275 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEIL-AKKLLPV- 368 GGT ++ LK+ V++ R + SDG DN+ + LLP Sbjct: 276 DGGTNIAEGLKIGARVIEGRRWKNPVS-GMMLLSDGQDNFTFSHAGVRLRTDYESLLPSS 334 Query: 369 ----VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 + + + L + S +F Sbjct: 335 CRIPIHTFGFG---SDHDAELMHTISEVSSGTFSFIETE 370 >UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7RKA1_NEMVE Length = 1128 Score = 91.7 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 29/229 (12%), Positives = 58/229 (25%), Gaps = 42/229 (18%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++D RY+ + + ++D SGSM S +AK + L+ + Sbjct: 191 CSSYDPRYRPWYVEAASPQPKDVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAF 250 Query: 289 VVYIRHHTQAKEVDEHE------------------------FFYSQETGGTIVSSALKLM 324 + + + K +GGT+ + A Sbjct: 251 LAFESGVRRVKVTSGDAKDEKCFESSLAKASPVNIDILKKFLDGEYASGGTMYAIAFNAA 310 Query: 325 DEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEILAKKLLPVVR------YYSYIE 376 +++ + Y +DG +D P K + + Sbjct: 311 FDILDKYYKEKNTTRRPVILFMTDG--APNDDPGTILNTVKTRNQGLSTKADILTFGMGG 368 Query: 377 ITRRAHQTLWREYEHLQ-STFDNFAMQHIRDQDD-------IYPVFREL 417 A L + F + D + R+L Sbjct: 369 GISPAGVDLLQSLAEQTLDGGARFEVSLTTALRDVSRHLLAVARSARKL 417 >UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Burkholderiales RepID=A1VI76_POLNA Length = 701 Score = 91.7 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 51/187 (27%), Gaps = 20/187 (10%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---TQA 298 S ++D+SGSM D AK L L + +++ + + A Sbjct: 319 AAQAISPRDYIFVVDISGSMHGFPLDTAKTLMRELIGKLRPSDTFNVLLFSGSNRFLSPA 378 Query: 299 KEVDEHE--------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 GGT + ALK + K A + +DG Sbjct: 379 SVPATQANIEQAVRTIDEMGGGGGTELIPALKRVYAEPK----AADVSRTVVVVTDG--- 431 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 L ++ L +S+ I ++ L F + Sbjct: 432 FVTVEREAFELVRRNLSQANLFSFG-IGSSVNRHLMEGLARAGMGEP-FIITEPSQARAQ 489 Query: 411 YPVFREL 417 FR L Sbjct: 490 AERFRRL 496 >UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LNY0_HALO1 Length = 808 Score = 91.7 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 38/281 (13%), Positives = 78/281 (27%), Gaps = 40/281 (14%) Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 S + + A L + E L + + +R R + + Sbjct: 377 SASTAAVELVKYGVANGERPHPSLARVWEFLNYETFDSASYEELGDRFRVSMGMVSRPSL 436 Query: 225 RVPFIDTFDLRYK----NYEKRPDPSSQAVMFCLMDVSGSMDQS-----------TKDMA 269 + L N + P AV+ L+D+SGSM + D+ Sbjct: 437 TQDGAVDYLLGANVTVPNLTREERP--HAVVTFLVDISGSMAEYSPTVDAGGAPTRMDIV 494 Query: 270 KRFYILLYLFLSRTYKNVEVVYIRHHTQAK-EVDEHEF--------------FYSQETGG 314 + L V + A+ E++ E GG Sbjct: 495 REGLWKAVSALKPGD---IVNVVSFDDAAQIELERGEIRPGAATPRPYLRSVLRLLPRGG 551 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCH-EILAKKLLPVVRYY 372 T +S+ +++ V + Y+P + N +D N P + + + + Sbjct: 552 TNLSAGIEVAYRVARRNYDPYRINRVII-LTDAYANRGSIDPSLIGDHVLIGDDEGIHFS 610 Query: 373 SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + ++ + F++ RD + Sbjct: 611 GLG-VGYDFNEDFLNTLTDVGRGTY-FSLITERDAARAFGE 649 >UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38CFE Length = 489 Score = 91.7 bits (226), Expect = 5e-17, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 56/186 (30%), Gaps = 25/186 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD-------- 302 + L+D SGSM S +R + K ++ + ++A V Sbjct: 54 VVMLIDTSGSMSGSKLPEVQRAASEFVS--RQNLKRDDLAVVEFSSRASVVADFTRDERE 111 Query: 303 -EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 + GGT +S L V++ P +DG+ ++ + Sbjct: 112 LQQAIARLSAWGGTNLSEGFNLATSVLQNSDRPGN----ILLFTDGE---PNNRRMAASI 164 Query: 362 AKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE-LFH 419 A+++ + + + F + D D + + ++ Sbjct: 165 AQQIRASGINLVAVGTGDAPVNYLT----ALTGDPDLVF-YANFGDLDSAFRGAEKAIYG 219 Query: 420 KQNATA 425 +Q + Sbjct: 220 QQLVES 225 >UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 Tax=Arabidopsis thaliana RepID=Q9M1S2_ARATH Length = 676 Score = 91.4 bits (225), Expect = 6e-17, Method: Composition-based stats. Identities = 26/249 (10%), Positives = 67/249 (26%), Gaps = 29/249 (11%) Query: 175 ARRTAMTAGKRRELHALE------------ENLAIISNSEPAQLLEEE-RLRKEIAELRA 221 R + E E L + + + + + ++ Sbjct: 158 QRAITQGHPEPATFDDDERLEEQIVFDGETEVLKKENRDYVRMMDMKVYPEVSAVPQSKS 217 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + + + + ++D+SGSM + + KR + L Sbjct: 218 CENFDVLVHLKAVTGDQISQYRRAPID--LVTVLDISGSMGGTKLALLKRAMGFVIQNLG 275 Query: 282 RTYKNVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + + + + +++ GGT + L+ +V+++R Sbjct: 276 SSDRLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDR 335 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 SDG + + K +LP + +S+ ++ Sbjct: 336 LERNSVASIIL-LSDGRDTYTTNHPDPSY--KVMLPQISVHSFGF-GSDHDASVMHSVSE 391 Query: 392 LQSTFDNFA 400 + +F Sbjct: 392 VSGGTFSFI 400 >UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus trichocarpa RepID=B9GK57_POPTR Length = 595 Score = 91.4 bits (225), Expect = 6e-17, Method: Composition-based stats. Identities = 39/305 (12%), Positives = 74/305 (24%), Gaps = 51/305 (16%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 +P + A Y N P +I + L + HA+ Sbjct: 57 VPFQAPKNVPSFQRSGSLHA-YVPNASPVHIEPDHFSDDELVPDVSQGQPSSSRPHAI-T 114 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + + + L ++ P + + Sbjct: 115 VKTLPEYPAVSASESFSKFGVLVRVLAPPLD---------------NTLPHHRAPIDIVN 159 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEVDE 303 ++DVSGSM + KR + L + + V + +E Sbjct: 160 VLDVSGSMAG-KLILLKRAVNFIIQNLGPSDRLSIVTFSSSARRILPLRTMSGSGREDAI 218 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 TGGT + + L+ V++ER SDG + S K Sbjct: 219 SVVNSLSATGGTNIVAGLRKGVRVLEERRQHNSVASIIL-LSDGCDTQSHSTHNRLEYLK 277 Query: 364 KLLPV--------------VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + P + + + + +F + D Sbjct: 278 LIFPSNNASGEESRQPTFPIHTFGFGL---DHDSAAMHAISDVSGGTFSFI-----ESID 329 Query: 410 IYPVF 414 I Sbjct: 330 ILQDA 334 >UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KNK1_AERHH Length = 552 Score = 91.4 bits (225), Expect = 6e-17, Method: Composition-based stats. Identities = 42/286 (14%), Positives = 85/286 (29%), Gaps = 26/286 (9%) Query: 105 ASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQ-NQQRQLTEYKTHRAGYTANGVPAN 163 G + ++ SK L + + +LK + + + Sbjct: 259 GYGQILGDIDTTYEKSKALLLACSGANFSYNDLKLCKDDIEPLAKQLQQNHAIKELTYK- 317 Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 + R+ + ++ A + + + + L Sbjct: 318 --MGRAYISEEKKKQA--RIPHASKSEVHGTHRSEDLARVLPTELLNLEDEALETLFYAR 373 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQ----AVMFCLMDVSGSMDQSTKDMAKRFYILLYLF 279 + T++L+ + + +D SGSM + A+ + + Sbjct: 374 FLERNLMTYELQGTTCTSGEQLELEQKRTGPVVACLDTSGSMSGAPLLKARALLLAVSAV 433 Query: 280 LSRTYKNVEVVYIRHHTQAKEVDEHE---------FFYSQETGGTIVSSALKLMDEVVK- 329 L + +++ VV + + +E HE F GGT + L E+++ Sbjct: 434 LQQEARSLHVVLFGDNGELREYAIHEENSASGLLHFLRQGFGGGTDFETPLNRACEIIRD 493 Query: 330 -ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 + Y A SDGD D + H KK+L YS Sbjct: 494 AKEYEKAD----ILMISDGDCVLSDDYIEHLQTRKKIL-DCSIYSV 534 >UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EFJ5_PARTE Length = 610 Score = 91.4 bits (225), Expect = 7e-17, Method: Composition-based stats. Identities = 32/274 (11%), Positives = 79/274 (28%), Gaps = 25/274 (9%) Query: 171 QNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF-- 228 Q + + G + E E+ + I + L + ++ Sbjct: 43 QEIVKSLLSEQQGMQLEEKVEEQTVITIDQEITDPDALQVELLNSVHLNVLPRQKAIQVQ 102 Query: 229 ----IDTFDLRYKNYEKRPDPS-SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 I L+ ++ + + + + C++DVSGSM+ + + + L T Sbjct: 103 EYSQILPVVLQIQSLKSQLKKQRANIDLMCVVDVSGSMNGEKIKLVQNSLRYIQKILKPT 162 Query: 284 YKNVEVVYIRHHTQAKEVDEH----------EFFYSQETGGTIVSSALKLMDEVVKERYN 333 + V + + + + T ++S + L ++++R Sbjct: 163 DRLALVTFGTQAGINLQWTRNIAENKKKIKKAIKDIKIRDSTNIASGVALGLRMIRDRKF 222 Query: 334 PAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREY 389 SDG D+ C + L + + + + Y + Sbjct: 223 KNPVT-SMFVLSDGVDDDRGADLRCQQALHQYNIQDTLTINTFGYG---SDHDAKVMNNI 278 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 +L+ + Q R + + + Sbjct: 279 ANLKGGQFVYIDQIQRVSEHFILAMSGMLSVKAK 312 >UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CPU4_SHEPW Length = 710 Score = 91.4 bits (225), Expect = 7e-17, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 52/186 (27%), Gaps = 22/186 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV-----VYIRHHT----QAKEV 301 + ++D SGSM + AK I LS + VY T AK + Sbjct: 351 LILVIDTSGSMSGEAIEQAKASIIYALAGLSAQDSFNILQFNSNVYALSDTPLNASAKNI 410 Query: 302 --DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + Q GGT +S AL + +DG A + Sbjct: 411 GRAQAYVQRLQANGGTEMSLALDKALSQQDAN---RERLRQVLFITDG---AVGNEPQLF 464 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + L R ++ I + + L + I Q ++ + Sbjct: 465 TQIRNQLQQSRLFTIG-IGDAPNAHFMQRAAELGRGTYTY----IGKQSEVKSKMVAMLD 519 Query: 420 KQNATA 425 K Sbjct: 520 KLEKPT 525 >UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8YP40_ANASP Length = 427 Score = 91.0 bits (224), Expect = 8e-17, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 61/205 (29%), Gaps = 31/205 (15%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT- 296 ++ + + + ++D SGSM M L L + V + T Sbjct: 31 AVAEQFEQNLPLNLCLILDQSGSMHGQPLKMVVEAVEKLLDRLQPGDRISVVAFAGSATV 90 Query: 297 -------QAKEVDEHEF-FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + E + + Q +GGT+++ L+ + + A A +DG Sbjct: 91 IIPNQIVENPESIKTQIRKKLQASGGTVIAEGLQQGITELMKGTRGAVSQ--AFLLTDGH 148 Query: 349 NW-----------ADDSPLCHE--ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 DDS C E A K+ + + +Q L Sbjct: 149 GEDSLKIWKWEIGPDDSRRCLEFAKKAAKINLTINTLGFGN---NWNQDLLETIADAGGG 205 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHK 420 + HI + F LF + Sbjct: 206 T----LAHIERPEQAVHHFNRLFTR 226 >UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MS10_ANATD Length = 1188 Score = 91.0 bits (224), Expect = 8e-17, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 58/178 (32%), Gaps = 26/178 (14%) Query: 251 MFCLMDVSGSMD-----QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAK 299 + ++D SGSM K AK F L + V + T Sbjct: 500 LVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQ----GDRAAVVDFDNFGYLLQPLTTDF 555 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + ++ GGT ++ +++ ++ + R + + + +DG+ + D++ Sbjct: 556 QAVKNAIDRIDSWGGTNIAEGIRIANQQLISRSSEDRIKVIIL-LTDGEGYYDNNLTT-- 612 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + Y+ T + L R+ + + VF+ + Sbjct: 613 ---EAKNNGITIYTIGLGTS-VDENLLRDIATQTGGMY----FPVSSASQLPQVFKRI 662 >UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11 Tax=Rhizobiales RepID=A6X8G3_OCHA4 Length = 750 Score = 90.6 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 52/198 (26%), Gaps = 23/198 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 Q + ++D SGSM ++ + AK L + + + T+ Sbjct: 342 PALASPKKVQREVIFVIDNSGSMGGTSIEQAKASLDYALSQLQPGDRFNVIRFDDTLTKF 401 Query: 299 KE----VDEHEFF-------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 E ++ + GGT + AL + N +DG Sbjct: 402 FEDSVDANQENIASARRFVTSLEAQGGTEMLPALHAALD----DSNQGNGLRQIVFLTDG 457 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + R + I + L L HI Sbjct: 458 E---ISNEQQLLDAVAARRGRSRIFMVG-IGSAPNSYLMNRAAELGRGTF----THIGSA 509 Query: 408 DDIYPVFRELFHKQNATA 425 ++ R LF K A Sbjct: 510 AEVDERMRALFDKLENPA 527 >UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CDX4_KOSOT Length = 730 Score = 90.2 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 56/189 (29%), Gaps = 14/189 (7%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + P+ + ++D+SGSM + AK + + L + + + Sbjct: 264 PPREPERIIPKDIVFILDISGSMSGQKIEKAKLALLQVLQMLHEGDRFSIITFNNEVNNL 323 Query: 299 KEVDE---------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 E G T + AL EV+ + ++ + +DG Sbjct: 324 TERLLPFSDRTEWYPAVKQIMAGGMTNIHDALLEGIEVLGTQSTDDRYKVVLF-LTDGAP 382 Query: 350 W-ADDSPLCHEILAKKLL--PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + KL V + + + + L E + +++ Sbjct: 383 TEGITDIGTIIRDSTKLAKVRDVHLFVFG-VGYDVNAELLDELAEKGGGKVKYIVENEEI 441 Query: 407 QDDIYPVFR 415 + + ++R Sbjct: 442 DEKVLELYR 450 >UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanobacteria RepID=B4VT64_9CYAN Length = 1037 Score = 90.2 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 64/225 (28%), Gaps = 26/225 (11%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKN-----YEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 LR ++A + + D + E + + + L+D SGS S Sbjct: 623 LRYQVAGADTQATVLTQADERGGHFATYLIPAIEYQQNEIVPKDVVFLVDTSGSQSGSPI 682 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRHHT-----------QAKEVDEHEFFYSQETGGT 315 +K L+ + + T Q ++ + GGT Sbjct: 683 VQSKELMRQFIQGLNPQDTFTIIDFANSTTQLSDKPLANTPQNRKKALNYINRLDANGGT 742 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + + + + PA +DG D + +L P R YS+ Sbjct: 743 ELMNGIDTVLNFPAA---PAGRLRSVVLLTDG--LIGDDEQIIAEIRDRLKPGNRLYSFG 797 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + ++ L L + + V +E F + Sbjct: 798 VGSST-NRFLIERLAELGRGTAEVVPPN----ESAEVVAQEFFQE 837 >UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Shewanella amazonensis SB2B RepID=A1S752_SHEAM Length = 753 Score = 89.8 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 49/181 (27%), Gaps = 19/181 (10%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK--EVDEHEFF- 307 + ++D SGSM + A+ I L + + F Sbjct: 399 LVLVIDTSGSMAGDSMVQARSALIHALGGLGPQDSFNIIAFSSDARPLWPDAKPATAFNL 458 Query: 308 --------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT ++SAL+L + + +DG A + Sbjct: 459 GAAQQFVRSLEADGGTEMASALELALKTPSVVDEDTKRLRQVLFITDG---AVNGEDALF 515 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 L ++ L R + I + F I ++ +L Sbjct: 516 NLIERRLGTSRLFPVA-IGAAPNGYFMSRAAAAGRGSFTF----IGHGGEVAEKMNQLLS 570 Query: 420 K 420 + Sbjct: 571 R 571 >UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UYN7_ROSS1 Length = 459 Score = 89.8 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 54/185 (29%), Gaps = 14/185 (7%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-------- 292 R + ++DVSGSM + AK FL V + Sbjct: 83 PREQHRPPLHLVAVLDVSGSMSGTKLASAKEALRQALHFLQDGDVFSLVTFSDQVQTHLK 142 Query: 293 --RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-N 349 + + ++ E+ + +G T + L ++ +++ SDG N Sbjct: 143 AESYAQRKRDKMENLLDEIRASGMTALDGGLAQGIDLGQKKRQATT---LVLLLSDGQAN 199 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + + A+K S + + ++ L E + + + Sbjct: 200 VGETDLEKIGLRAQKARQSGLIVSTLGVGLDYNEALMVEIANQGGGRFYHIQEGSQIPAA 259 Query: 410 IYPVF 414 + Sbjct: 260 LMQEL 264 >UniRef50_Q22ST4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ST4_TETTH Length = 648 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 50/163 (30%), Gaps = 16/163 (9%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQA 298 + ++D SGSM+ + K + + +S + V + + Sbjct: 222 VDLVVVIDKSGSMEGEKIQLVKETLVKIINLMSSMDRICIVCFNESGDRPLTFTRVTDEN 281 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 K+ + GGT +S + + ++ R SDG + + + Sbjct: 282 KQTLLNLIQQIYAGGGTNISEGINHALKAIQNRKFKNNVT-SILLLSDGQDTKAYTRVKA 340 Query: 359 EILAKKLLPVVRY--YSYIEITRRAHQTLWREYEHLQSTFDNF 399 I ++ + E L R L++ NF Sbjct: 341 YIDKYQIKDAFNIETIGFGE---DHDPKLLRTLSDLRNGTFNF 380 >UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1 Tax=Sorghum bicolor RepID=C5WZE3_SORBI Length = 704 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 66/237 (27%), Gaps = 33/237 (13%) Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 L E E ++ + + F LR + + P + ++DVS SM Sbjct: 190 YEIPPLLEITTYTEFPAIQESVAQEQFAILIHLRVPTWVRTRAP---LDLVTVLDVSRSM 246 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA----------KEVDEHEFFYSQE 311 + KR + L + + V + + ++ + Sbjct: 247 SGPKLALLKRAMRFVIENLEPSDRLSVVAFSSSACRLFPLRKMTAFGQQQSQQAVDSLVA 306 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDNW------------ADDSPLC 357 GGT ++ L+ VV++R N SDG + D +PL Sbjct: 307 DGGTNIAEGLRKAARVVEDR---QARNPVCSIILLSDGVDSHNLPPRDGSAPEPDYAPLV 363 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQT---LWREYEHLQSTFDNFAMQHIRDQDDIY 411 + V +++ H + S +F D Sbjct: 364 PRSILPGSEHHVPIHAFGLGMDHDHDHDSRAMHAVAQMSSGTFSFIDMVGSSIQDAL 420 >UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magnoliophyta RepID=Q9FF49_ARATH Length = 704 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 47/175 (26%), Gaps = 26/175 (14%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA-------- 298 + + ++DVSGSM + + KR + L + + + + Sbjct: 249 APVDLVTVLDVSGSMAGTKLALLKRAMGFVIQNLGPFDRLSVISFSSTARRNFPLRLMTE 308 Query: 299 --KEVDEHEFFYSQETGGTIVSSALKL-MDEVVKERYNPAQWNIYAAQASDGDNW----- 350 K+ GGT ++ LK ++ R+ +I SDG + Sbjct: 309 TGKQEALQAVNSLVSNGGTNIAEGLKKGARVLIDRRFKNPVSSIVLL--SDGQDTYTMTS 366 Query: 351 -----ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 D V + + +L +F Sbjct: 367 PNGSRGTDYKALLPKEINGNRIPVHAFGFGA---DHDASLMHSIAENSGGTFSFI 418 >UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21PJ3_SACD2 Length = 763 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 55/188 (29%), Gaps = 26/188 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE----- 305 + ++D SGSM ++ AKR L+ + + + ++ + Sbjct: 388 IVFVVDTSGSMQGTSIQQAKRSLQFALRGLNPSDTFNIIEFDTSFSRFRSRPVSATASNV 447 Query: 306 ------FFYSQETGGTIVSSALKLMDEVVK-------ERYNPAQWNIYAAQASDGDNWAD 352 GT + +AL+ + + E + +DG A Sbjct: 448 QAAVSWVNNLNADNGTEMYAALEEAFDQLASINPNGTENSKSSNNLQQVVFITDG---AV 504 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + L + L R ++ I + R+ + F I D ++ Sbjct: 505 GNEQALLSLIHRRLNNARLFTVA-IGSAPNSYFMRKAAQFGKGANVF----IGDTAEVTH 559 Query: 413 VFRELFHK 420 L K Sbjct: 560 KMNALLSK 567 >UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacteria RepID=Q114A2_TRIEI Length = 1204 Score = 89.1 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 25/221 (11%), Positives = 67/221 (30%), Gaps = 24/221 (10%) Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 + L + + Q E + + L F++ + R E +P Sbjct: 573 DTVNLPLVVDSYLQEKQKQEAREAQAKAAPERLLEPE----FVENPEQRLPEPEFVENPE 628 Query: 247 SQAVMFCLMDVSGSMDQS---TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 ++ + L+D S SM + + + VE+ I +++ + V + Sbjct: 629 NRCPIILLLDTSYSMSGEAITELNQGVKIFQASVKEDELASLRVEIAVITFNSEIEVVQD 688 Query: 304 HEF------FYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDGDNWA 351 + +G T + A++ E++++R + + + +DG Sbjct: 689 FVTVDKFIPKTLEASGVTHMGKAIEKALELLEKRKQDYKNSDIQYYRPWIFLITDGQPTD 748 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 +I + + +++ + Sbjct: 749 TWQDAAKKIEEAETNRKLLFFAVGV-----RDADMETLSEI 784 >UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7ULL3_RHOBA Length = 484 Score = 89.1 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 55/206 (26%), Gaps = 13/206 (6%) Query: 224 ERVPFIDTFDLRYKNYEK-RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 E+ L + P + +D SGSM AK LS Sbjct: 61 EKQTNHLRIALTGFELKSAEERPPVNVCLV--LDHSGSMSGQKLARAKEAAEAAIDRLSD 118 Query: 283 TYKNVEVVYIRHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 V+Y + T + + + Q T + + + V++ Sbjct: 119 DDIVSVVLYDSNVTVLVPATKATDRSSIKQKIRGIQAGSSTALFAGVSKGAAEVRKFLAD 178 Query: 335 AQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 Q N SDG N SP E L + L+ S + + ++ L + Sbjct: 179 EQVNRVIL-LSDGLANVGPKSPQELEGLGRSLMKEAISVSTLGLGSGYNEDLMVALASVG 237 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFH 419 F F L Sbjct: 238 GGNHAFIEDADSLVSVFNQEFDGLLS 263 >UniRef50_UPI00006CAF43 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAF43 Length = 631 Score = 88.7 bits (218), Expect = 4e-16, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 61/187 (32%), Gaps = 19/187 (10%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 F+ +L K +++ + + C++D SGSM + ++ L ++ + Sbjct: 123 FVCGVNLHVKQPKEQSER-VPMDLICVIDDSGSMSGKKAQLVRKSLKYLLKIMNENDRIC 181 Query: 288 EVV----------YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 + ++R++ + K + G T + + ++ ++K R Sbjct: 182 LISFDSVEKILTPFLRNNLENKSELKKAIKNIVGRGSTNIEAGMEAGLWMIKNRKEKNPI 241 Query: 338 NIYAAQASDGDNWADDSPLCHEILAK----KLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 SDG + + L + L + + +V Y Y T R Sbjct: 242 TC-MFLLSDGQDDSPQVDLRVQKLIQSYDIQDTFIVNTYGYGA---DHDATQMRNIAETH 297 Query: 394 STFDNFA 400 + Sbjct: 298 KGGYYYI 304 >UniRef50_Q23AA2 Putative uncharacterized protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23AA2_TETTH Length = 968 Score = 88.7 bits (218), Expect = 4e-16, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 66/207 (31%), Gaps = 25/207 (12%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + +R+ + R K S + ++D SGSM K +A + S Sbjct: 2 EEKRIYYKIPLK-RVKATTTTEKGGSNLHIVGIIDASGSMSSWWKWIA-------EFWNS 53 Query: 282 RTYKNVEVVYIRHHTQAKEVDEHEFF---YSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 + + I A+ + + G T + A K+ + V+ P + Sbjct: 54 ESIPKENLHTITFDGTARHCQSNVLSTRIHDHGGGMTAIPEAFKMFETVLD--SIPVNES 111 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLL-----PVVRYYSYIEITRRAHQTLWREYEHLQ 393 + A SDG D++ E KKL + + + R + Sbjct: 112 VTAIFISDGQ---DNNLNTLEERMKKLKGNHENRKINFICLGIESGFPTFLSMRLRQLYH 168 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHK 420 +N ++ + Y + F+K Sbjct: 169 QGDENIPALYLIE----YVSEKAFFNK 191 >UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Rhizobium RepID=B5ZY26_RHILW Length = 794 Score = 88.3 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 30/230 (13%), Positives = 61/230 (26%), Gaps = 21/230 (9%) Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD 267 + + A L +++ + P ++ + ++D SGSM + + Sbjct: 313 QAAPGKLPSAGLFREVKDGKTCLLAFVTPPTAPDAAAPPAKREVVFVIDNSGSMSGPSIE 372 Query: 268 MAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQAKEVDEHEFFYSQETGGTI 316 AK+ L L+ + + + + +E GGT Sbjct: 373 QAKQSLALAISRLTPNDRFNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLPADGGTE 432 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 + AL+ + +DG A + R ++ Sbjct: 433 MLPALEDALR--NQGPVATGALRQVVFLTDG---AIGNEQQLFQEITANRGDARVFTVG- 486 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 I + + + I D + ELF K A Sbjct: 487 IGSAPNTYFMTKAAEIGRGTF----TQIGSTDQVASRMGELFAKLQNPAM 532 >UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KVI6_HALNC Length = 671 Score = 88.3 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 56/216 (25%), Gaps = 25/216 (11%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 + + + + K + ++DVSGSM + A Sbjct: 277 KINSGLMTYEWNGEHYFLMMAQPPKRVAPTEV--MKREYLFVVDVSGSMYGFPLNTASDL 334 Query: 273 YILLYLFLSRTYKNVEVVY-----IRHHTQAKEVDEH------EFFYSQETGGTIVSSAL 321 L L + + + T + E+ Q GGT + AL Sbjct: 335 MRELLSSLKPQETFNILFFSGGSRVLSPTPLQATPENLQRAMTMMRSIQGGGGTELLPAL 394 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 K + + +DG D L K+ L +++ I Sbjct: 395 KTAFAMPRTE----DTARSIVVITDG---YVDVERQAYDLIKQNLNSTNLFAFG-IGSSV 446 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 ++ L H I +D+ V Sbjct: 447 NRYLMESMAHAGQGEP----FIITGPNDVPGVGARF 478 >UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CIG8_9BACI Length = 931 Score = 88.3 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 49/177 (27%), Gaps = 20/177 (11%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 DL+ K + PS M ++D SGSM +AK I L + Sbjct: 395 VDMDLKGK----KELPSLG--MVIVLDRSGSMAGYKIQLAKEAAIRSAELLREKDTLGFI 448 Query: 290 VY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + KE + GGT + +L+L E + + Sbjct: 449 AFDDRPWQIIDTEPIKDKEKVIEKINGLTSGGGTNIFPSLELAYEQLT---PLELQRKHI 505 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DG + SP + + + + L E Sbjct: 506 ILLTDGQ--SATSPDYLTTIQEGKENNITLSTVAIGEGS-DSVLLEELSDEGGGRFY 559 >UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SQR4_HAHCH Length = 733 Score = 88.3 bits (217), Expect = 6e-16, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 52/181 (28%), Gaps = 20/181 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-----------QAK 299 + ++D SGSM+ + A+ + L+ + + + H +A Sbjct: 358 LIWVVDTSGSMEGVSIQQARDAVLQALDTLTPRDRFNVIEFNSHARKLFPQAVPAQERAL 417 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + + GGT ++ AL P + +DG + + L Sbjct: 418 QQARRFVRGLKADGGTEIAEALDRALSDAA----PEGYVRQVVFLTDG---SVGNELALF 470 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + L R ++ I ++ R+ + + D Sbjct: 471 KQIDQQLGDSRLFTVG-IGPSPNRFFMRKAAQFGRGAYSHINDT-AEVSDKIAELTAALR 528 Query: 420 K 420 + Sbjct: 529 Q 529 >UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TQ23_SHEHH Length = 850 Score = 87.9 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 53/206 (25%), Gaps = 37/206 (17%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------IRHHTQAKEVD- 302 + ++D SGSM AK L + + RH A ++ Sbjct: 457 LVLVIDTSGSMSGDAIIQAKSALKYALAGLRPQDSFNVLQFNSTVERWSRHVMPATAINL 516 Query: 303 ---EHEFFYSQETGGTIVSSALKLMDEVVKERY--------NPAQWN------------- 338 ++ Q GGT +S AL + + ++ Sbjct: 517 GRAQNYINGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHDDDRYQSSNETLEQSAATP 576 Query: 339 -IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG A + K L R ++ I + + + Sbjct: 577 LRQVLFITDG---AVANESRLFEQIKNQLGESRLFTIG-IGSAPNAHFMQRAAEVGRGTY 632 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + ++ Q Sbjct: 633 TYIGKLDEVNQKVVSLLEKIEKPQVT 658 >UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythrobacter RepID=A3W9L9_9SPHN Length = 740 Score = 87.9 bits (216), Expect = 7e-16, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 49/181 (27%), Gaps = 23/181 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----------IRHHTQAKE 300 M ++D SGSM + A+R + L + + + + + Sbjct: 346 MIFVIDNSGSMAGESMPAARRSLLYALETLRPQDRFNVIRFDDTMTELFASAVQASDSNI 405 Query: 301 VDEHEFFY-SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 F + GGT + AL+ P + +DG A + Sbjct: 406 AAAKTFTHNLMANGGTEMLPALRAAL----RDRAPDERVRQVIFLTDG---ALSNEADMM 458 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + R + I + L R H+ ++ + L Sbjct: 459 EEINRNRKDSRVFMVG-IGSAPNTYLMRRMAEAGRGTF----THVGMGEEAEDQMQRLLD 513 Query: 420 K 420 + Sbjct: 514 R 514 >UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BJ22_9GAMM Length = 445 Score = 87.5 bits (215), Expect = 8e-16, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 59/213 (27%), Gaps = 21/213 (9%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 R L E+ A + ++D SGSM AK I+ LS+ Sbjct: 45 ETRHKAFIKISLEGHKLEQTQARI-PANIAIVLDKSGSMQGDKLFRAKEAAIMAINRLSQ 103 Query: 283 T--------YKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 V VV Q G T + + + +++ + Sbjct: 104 NDIVSVVSYDSRVNVVVPATKVSDTNTIARAINRIQANGNTALFAGVSKGANELRKFLDL 163 Query: 335 AQWNIYAAQASDG-DNWADDSPLCHEIL---AKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + N SDG N +P L K V ++ L + Sbjct: 164 NKVNRVIL-LSDGLANIGPSTPNELGKLGLSLAKEGMSVTTIGLGLG---YNEDLMTQLA 219 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 F + + DD+ VF+ F + Sbjct: 220 GFSDGNHAF----VENADDLARVFQYEFGDVLS 248 >UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UNM0_RHOBA Length = 900 Score = 87.1 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 25/193 (12%), Positives = 53/193 (27%), Gaps = 23/193 (11%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH------ 294 ++ M ++D SGSM ++AK L + + Sbjct: 455 EKEREKPSLAMMLVIDKSGSMGGQKIELAKDAAQAAVELLGPKDAIGVIAFDGDSYTVSE 514 Query: 295 --HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 T + + +GGT + A+ E + + +DG + Sbjct: 515 LRSTSDRGAISDAISTIEASGGTNMYPAMADAYEAL---LGATAKLKHVILMTDGVSSPG 571 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 D ++ + + + + + L E + F D Sbjct: 572 DFQGVAGDMSAS-RITLSTVALGQGSS---EDLLEELAQIGGGRYYF--------CDDPQ 619 Query: 413 VFRELFHKQNATA 425 ++F K+ A Sbjct: 620 SVPQVFAKETVEA 632 >UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR Length = 757 Score = 87.1 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 60/203 (29%), Gaps = 32/203 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----RHH 295 + + + + ++D+SGSM + AK + L+ + + Sbjct: 322 NNQSMKAFRKEVIFIIDISGSMKGGPFESAKNGLLSSLQKLNPEDSFNIIAFKMDTYLFS 381 Query: 296 TQAKEVDEHEF--------FYSQETGGTIVSSALKLMDEVVKE--RYNPAQWNIYAAQAS 345 + ++ E GGT + LK +++ E P + Sbjct: 382 SVMEQATEEAIIEATRWLNDKLTADGGTNILGPLKQAIKLLAETTNSIP-----VIFLIT 436 Query: 346 DGDNWADDSPLCHEILAKKLLP-----VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 DG A + K LP +R ++ I + R + Sbjct: 437 DG---AVEDERDICNFVKGYLPSGGSISLRISTFG-IGTYCNHHFLRMLAQIGRGHF--- 489 Query: 401 MQHIRDQDDIYPVFRELFHKQNA 423 D D + ++LF ++ Sbjct: 490 -DTAYDADSVDFRMQKLFTTASS 511 >UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTR8_HALO1 Length = 903 Score = 87.1 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 55/201 (27%), Gaps = 26/201 (12%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 +R+ + ++R P + +D SGSM + AK LS + V + Sbjct: 445 VRFDSEKQREQPHVAIALV--VDRSGSMSGLKIEAAKESARATAEVLSPSDLITVVAFDN 502 Query: 294 HHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 T + + Q GGT + AL+ E+++ + S Sbjct: 503 QPTTIVRLQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGANAKVKH---VIVLS 559 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG D + V + R L Sbjct: 560 DGQ-APYDGIADLCQEMRSARITVSAVGIGDADRN----LLNLITDNGDGRLY------- 607 Query: 406 DQDDIYPVFRELFHKQNATAK 426 D +F K+ A+ Sbjct: 608 -MTDDLAALPRIFMKETTEAQ 627 >UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A188_PELCD Length = 442 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 46/174 (26%), Gaps = 14/174 (8%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----- 291 + P + +D SGSM + A+ I LS VVY Sbjct: 56 RAPRTAQRPPVNLALV--LDRSGSMSGNKIAKAREAAIEAVRRLSDGDLFSLVVYDDSVE 113 Query: 292 ---IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG- 347 E + G T + A+ V++ + N SDG Sbjct: 114 TLVPAQPVSDIGDIEARIRRIRPGGSTALFGAVSQGAAEVRKHSDAPYVNRVVL-LSDGL 172 Query: 348 DNWADDSPLCHEIL-AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 N P L A L + + + ++ L + F Sbjct: 173 ANVGPSRPADLARLGAALLKEGISVTTVG-VGTDFNEDLMTQLAERSDGNHYFV 225 >UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KDL1_SHEWM Length = 739 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 53/188 (28%), Gaps = 20/188 (10%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---------HTQAKEV 301 + ++D SGSM ++ AKR L + + + AK + Sbjct: 365 LILVIDTSGSMSGASIAQAKRALNYALAGLKAKDTFNVIEFNSNVGSLSPYSLPATAKNI 424 Query: 302 --DEHEFFYSQETGGTIVSSALKLMDEV-VKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + GGT + AL + + ++ +DG + Sbjct: 425 GLANQYVRSLKANGGTEMQLALNAALDKGTETEALGSERLRQVLFMTDG---SVGDEQSL 481 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 L K+ + R ++ I + R + I D++ L Sbjct: 482 FHLIKQKIGESRLFTLG-IGSAPNSHFMRRAAEFGRGTFTY----IGKLDEVQSKIESLL 536 Query: 419 HKQNATAK 426 ++ Sbjct: 537 YQIERPQL 544 >UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F1H2_SORC5 Length = 607 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 48/220 (21%), Gaps = 15/220 (6%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 E + ++ E + ++D SGSM +A Sbjct: 2 FNARPDRPWLPAEPSERLLRVEITVPRPEGGQARK-PVHLSLVIDRSGSMSGEKLRLALE 60 Query: 272 FYILLYLFLSRTYKNVEVVYIRH------HTQ----AKEVDEHEFFYSQETGGTIVSSAL 321 L + V + T A+ E G T + Sbjct: 61 AARQAIRTLQPGDRFSVVTFDHQVEVPIPSTDATPGARLRAEAALDTVIARGNTDLGGGW 120 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK-KLLPVVRYYSYIEITR 379 V +DG N SP A+ + L V + Sbjct: 121 LRGCAEVGAHLPEDAIGRVLL-LTDGQANHGITSPDELTSRARSQRLRRVTTSTIGLGEG 179 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 ++ L FA + + E+ Sbjct: 180 -FNEFLLGRLSEEGGGNFYFAARADELPGFVGREIGEVLS 218 >UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SHY6_LEPBA Length = 550 Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 59/202 (29%), Gaps = 23/202 (11%) Query: 234 LRYKNYEKRPDPSSQAVMFCL-MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY- 291 LR++ + ++ L +D S SM + L +L+R V Y Sbjct: 26 LRFRTPANPNVEERKPLVIGLAIDKSWSMKGEKMEAVIDASCALVNWLTRHDAVSIVAYS 85 Query: 292 --------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + H T+ V + Q T +S + + + P + Sbjct: 86 ADVQLIQPVTHLTEKVSVT-DKIRNIQVATSTNLSGGWLSALKSLNQSKIPNAYKRVLL- 143 Query: 344 ASDGDNWAD--DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG+ + D I A L + + + ++ + E + Sbjct: 144 LTDGNPTSGIKDKEALVTIAADHLSMGISTTTIG-VGNDFNEEMLVEIAKAGGGNFYYI- 201 Query: 402 QHIRDQDDIYPVFRELFHKQNA 423 D ++F ++ Sbjct: 202 -------DNPENASDIFFEEFG 216 >UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacterium sp. JLS RepID=A3PUP3_MYCSJ Length = 233 Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats. Identities = 33/160 (20%), Positives = 57/160 (35%), Gaps = 16/160 (10%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHHTQA 298 +P + L DVSGSM +R + +L K VEV + T A Sbjct: 12 DANPDPRVACVVLADVSGSMQGEPIAALERGFAAFTRYLQNEVLASKRVEVAVVTFGTVA 71 Query: 299 KE-VDEHEFFYSQ-----ETGGTIVSSALKLMDEVVKER---YNPAQ---WNIYAAQASD 346 V E Q +G T +++ + L +++++R Y A + + +D Sbjct: 72 TVLVPMQEARTLQPVAFTASGTTNMAAGIHLALDILEDRKHAYKAAGLQYYRPWILLLTD 131 Query: 347 GD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 G N + A + V ++ R +Q L Sbjct: 132 GKPNLDGFDEAVARLNAVESARGVTVFAVGAGPRVDYQQL 171 >UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW34_ANASP Length = 615 Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 57/189 (30%), Gaps = 14/189 (7%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-- 291 LR++ E P + ++D SGSM + A + + L VVY Sbjct: 25 LRFRA-EIPESPRRNLNLSLVIDRSGSMAGAALHHALKAAESVVDQLEPKDILSVVVYDD 83 Query: 292 ------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 K + + G T +S E VK + +P + N + Sbjct: 84 AVDTVVPPQPVTDKPALKKSIRQVRAGGITNLSGGWLKGCEYVKHQLDPQKINRVLL-LT 142 Query: 346 DGD-NWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 DG N P + + + + ++ L + F Q Sbjct: 143 DGHANMGIQDPKILTATSTQKAEEGITTTTLGFAQG-FNEDLLIGMARAANGNFYFI-QS 200 Query: 404 IRDQDDIYP 412 I + +++ Sbjct: 201 IDEAAEVFS 209 >UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfolobaceae RepID=A4YGU7_METS5 Length = 383 Score = 86.4 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 56/164 (34%), Gaps = 12/164 (7%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE-----H 304 L+D SGSMD + AK+ I L + + K V + +E + Sbjct: 39 HYIVLLDTSGSMDGLKIESAKKGAIELLKRIPQGNKVSFVTFSSRVNIVREFVDPEDLTA 98 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 E G T +AL + + P+ Y +DG+ D + ++ +A Sbjct: 99 EISSLSAGGQTAFFTALLTAFNLHNKHGIPS----YVILLTDGNPTDDTNVETYKRIA-- 152 Query: 365 LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + V+ S+ + ++T+ + + + Sbjct: 153 IPNGVQTISFG-LGDDYNETILKSLADRSGGVFYHVNDAMEIPE 195 >UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2QZH3_ORYSJ Length = 633 Score = 86.4 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 25/222 (11%), Positives = 64/222 (28%), Gaps = 36/222 (16%) Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS-------QAVMFCLMDVSGSMDQS-- 264 + I + ++ + P + + ++DVSGSM+ Sbjct: 28 APVKVSTTPIFPTIPRGQTNKDFQVLLRVEAPPAADLNSHVPLDVVAVLDVSGSMNDPVA 87 Query: 265 -----------TKDMAKRFYILLYLFLSRTYKNVEVVY------------IRHHTQAKEV 301 D+ K + L+ + V + + + + Sbjct: 88 AASPKSNLQGSRLDVLKASMKFVIRKLADGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSI 147 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQASDGDNWADDSPLCHEI 360 + Q GGT + AL+ +++ ER ++ ++ + +DGD+ + Sbjct: 148 AGKKIDRLQARGGTALMPALEEAVKILDERQGSSRNHVGFILLLTDGDDTTGF-RWTRDA 206 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + + +++ + L +F Sbjct: 207 IHGAVFKY-PVHTFGLGASHDPEALL-HIAQGSRGTYSFVDD 246 >UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI0_9BACT Length = 833 Score = 86.4 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 54/193 (27%), Gaps = 23/193 (11%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 ++ + ++D SGSM+ +A+ LS + + + + Sbjct: 404 EKEKEQPSLALVLVIDKSGSMNGQPIVLAREASKAAAELLSSRDQVGVIAFDGSAKLVTD 463 Query: 301 VDEHE--------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + GGT + A+ + +++ + I SDG + Sbjct: 464 LTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDMLGIASAKIKHMIVL---SDGQSQGG 520 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 D LA ++ + S + L + + + Sbjct: 521 DFEGISSELA-QMGVTISTVSLGQGAA---VDLMAAIAQIGNGRAYV--------TNNAE 568 Query: 413 VFRELFHKQNATA 425 +F K+ A Sbjct: 569 EMPRIFTKETMEA 581 >UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1DFU7_MYXXD Length = 422 Score = 86.4 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 +L+ + E + +D SGSM+ A+R L L + + Y Sbjct: 32 MELKARPAETGQRVPVSLALV--LDRSGSMNGQKLADARRAATELVQRLKPEDRLAFIDY 89 Query: 292 ---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 R +A+E Q+ G T +S AL ++ + + Sbjct: 90 GTDVRVQPSRRMTEEAREELLTLISGLQDDGSTNISGALDAAANALRPHMREYRVSRAIL 149 Query: 343 QASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 SDG S ++L S + + R +TL R F F Sbjct: 150 -LSDGQPTTGIVSEPGLLDQVRQLRRDGITVSALGVGRDYQETLMRGMAEQGGGFSGFI- 207 Query: 402 QHIRDQDDIYPVFRELFHKQNATA 425 DD + +F ++ A Sbjct: 208 ------DDSARLAE-VFSRELDQA 224 >UniRef50_Q24FW2 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FW2_TETTH Length = 1074 Score = 86.0 bits (211), Expect = 3e-15, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 56/179 (31%), Gaps = 21/179 (11%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 + + P + C+MD SGSM +M K + L L + V++ Sbjct: 354 FDAKAYQRPPID--LICVMDNSGSMHGEKINMLKETLLYLIDQLDEKDRLGLVLFNSEVT 411 Query: 295 -------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQAS 345 T K + + GGT ++ + + +K R N S Sbjct: 412 FRPMKSMDTTNKLKLKQYISDIRAQGGTDINLGMTEAFKFIKTR---KYCNPVTSVFLLS 468 Query: 346 DG-DNWADDSPL-CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 DG D+ A D + ++ + + + R L + + + F Sbjct: 469 DGLDSKAQDRVAVTLKNMSINEQFSINCFGFG---RDHDPILMNQIKKIDQVDMFFVDA 524 >UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomycetaceae RepID=D2R2I7_9PLAN Length = 786 Score = 86.0 bits (211), Expect = 3e-15, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 46/167 (27%), Gaps = 18/167 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----------HH 295 ++ + ++D SGSM + A+ + L V Y Sbjct: 305 TKKTVIFVVDRSGSMQGKKIEQAREAMRYVLNNLHEGDTFNIVAYDSTVESFKPELQKFD 364 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDS 354 ++ G T +S AL ++ P Y +DG + + Sbjct: 365 DATRKSALAYVDGLYAGGSTNISGALDSAFAMLTGSDRPN----YILFLTDGLPTAGETN 420 Query: 355 PLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 LAK+ + R ++ + + L + Sbjct: 421 EGKIVELAKQKNVHRARMINFG-VGYDVNSRLLDRMSRENFGQSQYV 466 >UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BC4A Length = 1038 Score = 86.0 bits (211), Expect = 3e-15, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 63/209 (30%), Gaps = 35/209 (16%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 R +FD R + + + + + ++D SGSM + ++AK + L+ Sbjct: 164 PSLRATDCGSFDNRCRPWYVQANVPKPKQIVIVIDKSGSMGVTNMNLAKEAAKSVVNTLN 223 Query: 282 RTYKNVEVVYIR-----HHTQA----------------KEVDEHEFFYSQETGGTIVSSA 320 + + + T A K+ E GGT + A Sbjct: 224 PQDRFAVMAFSSIFVPFQSTVASDQCFATTFADASPQNKKKVEDFVDTISSGGGTNYAPA 283 Query: 321 LKLMDEVVK----------ERYNPAQWNIYAAQASDG--DNWADDSPLCHEILAKKLLPV 368 L+ + ++ +P++ + SDG ++ ++L Sbjct: 284 LQKAFSFFQQEPSVSDFNIKKIDPSEIDRVILFMSDGIPNDPGSTILSAQIRANEQLNNS 343 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFD 397 V +Y A + R + Sbjct: 344 VIILTYGL--GNADFGVLRNMATNKGDVY 370 >UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926ED6 Length = 464 Score = 86.0 bits (211), Expect = 3e-15, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 45/185 (24%), Gaps = 28/185 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA- 298 EK + + ++D SGSM + K+ + L+ + + + T Sbjct: 42 EKEHRKRAPIDLVVVIDKSGSMAGEKLALVKKTLEFVVSQLNEKDRLCLITF---DTSVY 98 Query: 299 ------------KEVDEHEFFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQAS 345 K T + L EV+ + Sbjct: 99 LDFKLTPMTPMNKYQTLKIIKDISPGSMTNLCGGLMKGLCEVIDRADEEKNEVASVLLFT 158 Query: 346 DG-------DNWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQST 395 DG N S + + P Y++ + + +E S Sbjct: 159 DGFANKGGLTNIYCSSSQTAKYTIGIVGPKTADASIYTFGF-GSNHNAQMLKEISDAGSG 217 Query: 396 FDNFA 400 + Sbjct: 218 MYYYI 222 >UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LL92_HALO1 Length = 430 Score = 85.6 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 51/234 (21%), Gaps = 14/234 (5%) Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 PAQ + L + +D SGSM Sbjct: 6 TPAQRAGSVAVTVTPQYDLLPSNARELNLMVRLEGTGDAPATRAPLDLALV--IDRSGSM 63 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----------RHHTQAKEVDEHEFFYSQE 311 K + L L V Y R + Q Sbjct: 64 SGDKLSDVKTAALELLETLQPEDTITLVSYSSDVSMHLMRTRADDAGQREARRALLALQA 123 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVR 370 GGT + L E ++ + + + + SDG N + P A Sbjct: 124 RGGTALGPGLFRALEALEGASDRTRMS-HLMLFSDGIANAGEVRPSVLGARAAGAFGAGV 182 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 S + + ++ L +F + + L Sbjct: 183 SVSTMGVGVDYNEDLMTRLADQGGGRYHFIQDSEAIASILDDEMKGLVATVARG 236 >UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3 Tax=Sorghum bicolor RepID=C5WYV0_SORBI Length = 686 Score = 85.6 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 53/186 (28%), Gaps = 28/186 (15%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------- 291 R P + + ++DVSGSM + K+ + L + V + Sbjct: 158 DRDAPRAPLDLVTVLDVSGSMRWDKLALVKQAMGFVIGSLGPHDRLSVVSFSSGARRVTR 217 Query: 292 -IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DN 349 +R K + + GGT ++ L+ +V+ ER + + SDG DN Sbjct: 218 LLRMSHTGKSLATEAVESLRAGGGTNIAEGLRTAAKVLGERRHRNAVSSVIL-LSDGHDN 276 Query: 350 WADDS---------------PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 ++ P A +++ Sbjct: 277 YSMPRRARGGVPPNYEVLVPPSFVPGTASTGEGSAPIHTFGFGN-DHDAAAMHVVAEATG 335 Query: 395 TFDNFA 400 +F Sbjct: 336 GTFSFI 341 >UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FW78_SHESH Length = 770 Score = 85.6 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 56/196 (28%), Gaps = 27/196 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQAK 299 + ++D SGSM S + AK+ L + + I T+ Sbjct: 377 LILVIDTSGSMSGSAMEQAKKAMKYALAGLGSDDTFNVIEFNSKVSSLSKGPIPASTKNI 436 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEV-----------VKERYNPA-QWNIYAAQASDG 347 E+ GGT ++ AL+ ++ + + +DG Sbjct: 437 EMANRFVHSLTSDGGTEMALALEHALGQESGGSSWQETGLQGKDEESTSRLRQVLFMTDG 496 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 A + L K + R ++ I + + + Q Sbjct: 497 ---AVGNEAELFKLIKYRIGKSRLFTLG-IGSAPNSHFMQRAAEFGRGTFTYIGDLDEVQ 552 Query: 408 DDIYPVFRELFHKQNA 423 + I + ++ H Q Sbjct: 553 EKIQGLLYKIEHPQIT 568 >UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain H4 n=38 Tax=Eutheria RepID=ITIH4_HUMAN Length = 930 Score = 85.2 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 63/202 (31%), Gaps = 28/202 (13%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-- 296 + + + ++D SGSM + I + LS + +V+ T Sbjct: 263 FAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQW 322 Query: 297 -------QAKEVDEHEFF--YSQETGGTIVSSALKLMDEVV-----KERYNPAQWNIYAA 342 A+ V++ F Q GGT ++ A+ + +++ +ER ++ Sbjct: 323 RPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVSLIIL 382 Query: 343 QASDGD-NWADDSPLCHEILAKKL---LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DGD + +P + ++ + + + Sbjct: 383 -LTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSY---AFLEKLALDNGGLAR 438 Query: 399 FAMQHIRDQDDIYPVFRELFHK 420 I + D ++ + + Sbjct: 439 ----RIHEDSDSALQLQDFYQE 456 >UniRef50_Q82LZ6 Putative uncharacterized protein n=1 Tax=Streptomyces avermitilis RepID=Q82LZ6_STRAW Length = 462 Score = 85.2 bits (209), Expect = 5e-15, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 43/170 (25%), Gaps = 15/170 (8%) Query: 224 ERVPFIDTFDLRYKNYEKRPDP---SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 E L+ P ++D SGSM + D K +Y L Sbjct: 16 EATTHFVEIALKAGRARPEDAPATEPLPVNFVFVVDTSGSMTGTKLDTVKSALQTIYREL 75 Query: 281 SRTYKNVEVVYIRHHTQAK----------EVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 + + + E GGT + ++ + + Sbjct: 76 RPADCLGIITFDHNVRTVLPAVAKQDLPPERFAEVVSALTTQGGTDIDLGVQYGIDEISR 135 Query: 331 RYNPAQWNIYAAQASDGDNWADDSP--LCHEILAKKLLPVVRYYSYIEIT 378 + SDGD + + +A KL + + + Sbjct: 136 HSVSGRTVNCLYLFSDGDPTSGERDWIKVRANVAAKLRGDLTLSCFGFGS 185 >UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G857_9DELT Length = 540 Score = 85.2 bits (209), Expect = 5e-15, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 51/174 (29%), Gaps = 17/174 (9%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 + + P + +D+S SM+ D ++ + + L + V + + Sbjct: 149 DPAELDRPPLNLTI--AVDLSKSMEGEPIDRVRQGLLQMREQLEPEDR---VTLVGFGDE 203 Query: 298 AKEVDEH----------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 A+ + E+ G T + + L+ E N SDG Sbjct: 204 AQVIVENADKDSVELATAIAALVPWGSTNLYAGLRTAFEQTDLYAQEGWQNRVLL-VSDG 262 Query: 348 D-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + E LA+ + + + I L R L S + Sbjct: 263 VPTTGIVNSDKIEGLAEAWSGMGYGLTTVGIGNDFDIELMRNLSELGSGSFYYV 316 >UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TM27_9PROT Length = 318 Score = 85.2 bits (209), Expect = 5e-15, Method: Composition-based stats. Identities = 32/73 (43%), Positives = 45/73 (61%) Query: 4 FIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPM 63 IDRR N + KS+ NRQRFLRR K Q+ +++ +A +R + DV GE + IPT+ ++EP Sbjct: 230 IIDRRRNSQGKSLANRQRFLRRAKRQVTEAVRQASAERRIRDVADGEQIVIPTDGLNEPR 289 Query: 64 FHQGRGGLRHRVH 76 F LR H Sbjct: 290 FRHDARRLRLDRH 302 >UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3 Tax=Theria RepID=ITIH4_PIG Length = 921 Score = 84.0 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 56/186 (30%), Gaps = 20/186 (10%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---------IRHHTQAKE 300 + ++D SGSM + I + L + V + + + E Sbjct: 272 NVIFVIDTSGSMRGRKIQQTREALIKILGDLGSRDQFNLVSFSGEAPRRRAVAASAENVE 331 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVK----ERYNPAQWNIYAAQASDGD-NWADDSP 355 + GGT ++ A+ + ++++ E PA+ + +DGD + +P Sbjct: 332 EAKSYAAEIHAQGGTNINDAMLMAVQLLERANREELLPARSVTFIILLTDGDPTVGETNP 391 Query: 356 LCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + ++ + + + + I + D Sbjct: 392 SKIQKNVREAIDGQHSLFCLGFGFDVPY-AFLEKMALENGGLAR----RIYEDSDSALQL 446 Query: 415 RELFHK 420 + + + Sbjct: 447 EDFYQE 452 >UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10Z89_TRIEI Length = 477 Score = 84.0 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 22/200 (11%), Positives = 53/200 (26%), Gaps = 29/200 (14%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 P + L+D S SM + + + + + + Sbjct: 36 LSLTRRPPQPQTVVLLIDTSSSMWGGKLPEVQAAATGFVE--RQNLTVNNLAIVEFSSNS 93 Query: 299 KEVD---------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + + + +GGT +S LK + +++ P +DG Sbjct: 94 QVLTNFDADKTELKQAIANLTPSGGTNLSQGLKTVASLLRNSNTPN-----ILLFTDGQ- 147 Query: 350 WADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWRE-----YEHLQSTFDNFAM 401 + P + +A+++ + + +L + + F Sbjct: 148 --PNDPRASKSIAREIREAGINLVTVGTGDANSNYLTSLTENPDLVFFANSGEIDQAFRA 205 Query: 402 QH--IRDQDDIYPVFRELFH 419 I D + +F Sbjct: 206 AEKAISQLSDTSGNYGLVFG 225 >UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ58_ORYSJ Length = 589 Score = 84.0 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 54/183 (29%), Gaps = 23/183 (12%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 +LR + + + ++DVSGSMD D K + LS + V + Sbjct: 54 LELRGSSSSTDR---AGLDLVAVIDVSGSMDGDRIDKVKTALQFVIRKLSDLDRLCIVTF 110 Query: 292 IRHHTQ----------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + T+ A+ + + G T + L+ VV R A + Sbjct: 111 CTNATRLCPLRFVTAAAQAELKALVDGLKAYGDTNMKGGLETGMSVVDGRSLAAGRAVSV 170 Query: 342 AQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ-STFDNF 399 SDG N D+ H V +S+ L N+ Sbjct: 171 MLMSDGYQNHGGDARDVHLKNV-----PVYTFSFGA---SHDSNLLEAIARKSLGGTFNY 222 Query: 400 AMQ 402 Sbjct: 223 VAD 225 >UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26 Tax=Gammaproteobacteria RepID=A9QZI4_YERPG Length = 472 Score = 83.7 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 58/207 (28%), Gaps = 23/207 (11%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 L N + + +D S SM + A+ IL L+ T V Sbjct: 79 LKISLTGFNLDSTRRSPINLALV--IDRSTSMSGERIEKAREEAILAVNMLNITDTLSVV 136 Query: 290 VYIRHHTQA----KEVDEHEFF-----YSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 Y H K D+ + G T + + + + V + N Q N Sbjct: 137 AYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKHLNREQVNRI 196 Query: 341 AAQASDGD-NWADDSPLCHEILAK---KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 SDG N S LA+ K + + ++ L Sbjct: 197 IL-ISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQ---DYNEDLMTAIAGYSDGN 252 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNA 423 F + + D+ F + F + Sbjct: 253 HTF----VANSADLEKAFTKEFQDVMS 275 >UniRef50_D2S019 ATPase associated with various cellular activities AAA_5 n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2S019_9EURY Length = 665 Score = 83.7 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 75/240 (31%), Gaps = 25/240 (10%) Query: 129 FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS---------LQNSLARRTA 179 E N + + + G G A+ V R+ ++ +LARRT Sbjct: 337 DEAEGSDNETDRESDEDRDAAAAAGGIPIRGDEASYPVDRTAIQPPRDRTMREALARRTP 396 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 R + + + + L + I+ DLR K Sbjct: 397 SKVDVRSGRYVRARDSESVDDVAIDATLRAAAPHQPARRETDDSSSGIAIEPKDLRQKIR 456 Query: 240 EKRPDPSSQAVMFCLMDVSGS-MDQSTKDMAKRFYILLYLFL-SRTYKNVEVVY------ 291 E+R ++A++ ++D SGS M KR + L + VV+ Sbjct: 457 ERR----AEALVVFVVDASGSVMSGRQMFETKRGILSLVEDAYRARDRVAVVVFREEGAF 512 Query: 292 -IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV--KERYNPAQWNIYAAQASDGD 348 + T+ G T ++ L E+V + R + + + SDG Sbjct: 513 TLVEPTRNLSAARRAVSKLTVGGNTPLAHGLVEAYELVERERRRDEDLYPLVVL-FSDGQ 571 >UniRef50_Q22N58 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22N58_TETTH Length = 669 Score = 83.3 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 56/199 (28%), Gaps = 32/199 (16%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS----------------TKDMAKRFYILLY 277 +R + CL+D S SM T D+ K + Sbjct: 18 VRVSVIPPDDLERHPCNIVCLVDGSLSMGSKLVIHQKNGGKKESDMTTLDLVKHTVKTIA 77 Query: 278 LFLSRTYKNVEVVYIRHHTQAKEVDE----------HEFFYSQETGGTIVSSALKLMDEV 327 L+ + V + H E+ E E G T + L+ EV Sbjct: 78 SSLNPQDRLALVGFSTHSKIYFELTEMDDQGKNVAFTEIDKMWAGGQTNIWGGLQDSLEV 137 Query: 328 VKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP----VVRYYSYIEITRRAHQ 383 +K+ + P N+ +DG + E+L + +++ Sbjct: 138 IKKGFRPN-QNVCIFLFTDGRPTMIPAIGHVEMLRRWKEQHPAIQFSIFTFGFGN-DLDT 195 Query: 384 TLWREYEHLQSTFDNFAMQ 402 L E Q+ +F Sbjct: 196 DLMLELSQEQNGIFSFISD 214 >UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LJL4_HALO1 Length = 602 Score = 83.3 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 56/229 (24%), Gaps = 56/229 (24%) Query: 241 KRPDPSSQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---- 295 + ++D SGSM + D ++ LL + + V Y Sbjct: 123 PDDIEPRPLDLVVVVDTSGSMATDARMDYVRQGLHLLVDAVDEDDRLALVSYQSFAEVHA 182 Query: 296 --------------------------------------TQAKEVDEHEF-FYSQETGGTI 316 +A + H Q GGT Sbjct: 183 ELPALPVEETPEEPTEPTDPVGEPTDPPADPDEDPVDEREAWRSEMHALVDTLQPGGGTN 242 Query: 317 VSSALKLMDEVVKERYN--PAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYS 373 + L+ E+ KE P + SDG L++ + + Sbjct: 243 IYEGLERGFEIAKEARVNHPDRAQRVIL-LSDGLATEGITDSASIIALSEAFIEGGMGLT 301 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + + L R + F RE+F ++ Sbjct: 302 TVGVGASFNVELMRGLAERGAGNFYFVED--------PEAVREVFTEEL 342 >UniRef50_B4W304 von Willebrand factor type A domain protein (Fragment) n=2 Tax=Cyanobacteria RepID=B4W304_9CYAN Length = 538 Score = 83.3 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 56/196 (28%), Gaps = 20/196 (10%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-------- 283 + ++ + ++D SGSM + A + L +L+ Sbjct: 24 ITFQGSESSQQTSSRRPLNLSLVLDRSGSMAGAPLRYAIQAAQNLIDYLTADDFVSVVIY 83 Query: 284 YKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 EV+ + + + + G T +S L V+ +P + N Sbjct: 84 DDTAEVIIPPQLVGDQAALKAKIGKIRARGCTNLSGGWLLGCSQVQANQSPERINRVLL- 142 Query: 344 ASDG-DNWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG N+ P A + + ++ L + F Sbjct: 143 LTDGLANYGIKDPQVLTKTALEKAEADIVTTTLGFGN---YFNEDLLINMANAARGNFYF 199 Query: 400 AMQHIRDQDDIYPVFR 415 I+ DD VF Sbjct: 200 ----IQSPDDASQVFE 211 >UniRef50_B8AE57 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AE57_ORYSI Length = 585 Score = 82.9 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 26/229 (11%), Positives = 63/229 (27%), Gaps = 33/229 (14%) Query: 209 EERLRKEIAEL--RAKIERVPFIDTFDLRYKNYEKRPDP-SSQAVMFCLMDVSGSMDQS- 264 + ++ + LR + + ++DVSGSM Sbjct: 3 ADPVKVSTTTMLPTIPRGHTNKDFRVLLRVEAPPMADLKGHVPIDVVAVLDVSGSMGDPA 62 Query: 265 -------------TKDMAKRFYILLYLFLSRTYKNVEVVY------------IRHHTQAK 299 D+ K + L + V + + + Sbjct: 63 MASSDFEKNKPPSRLDVLKEAMKFIIRKLDDGDRLSIVAFNDRPVKEYSTGLLNISGNGR 122 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQASDGDNWADDSPLCH 358 + E + + + GGT + AL+ V+ R ++ ++ + +DGD+ Sbjct: 123 RIAEKKVDWLEARGGTALMPALEEAIRVLDCRPGDSRNSVGFILLLTDGDD--TSGFRWS 180 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + +++ + + L +F D+ Sbjct: 181 RDVINGAVGKYPVHTFGLGAAHSSEALL-HIAQESRGTYSFVDDENMDK 228 >UniRef50_UPI00016C377F protein containing a von Willebrand factor type A domain n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C377F Length = 821 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 56/183 (30%), Gaps = 21/183 (11%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------- 291 + + ++D S SM AK+ L + V + Sbjct: 263 EAEKKRVARDLVLVLDTSSSMSDIKMQQAKKAVKFCLSQLQPEDRFGVVRFSTTVTKFRS 322 Query: 292 --IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD- 348 + +T ++ + +GGT + AL + +P++ +DG Sbjct: 323 ELVAANTDYLDLATKWIDGLKTSGGTAIWPALNDA--LAMRSSDPSR-PFTMVFFTDGQP 379 Query: 349 NWADDS-PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + + + K R +++ + + + + + +R+ Sbjct: 380 TVDETNADKIVKNVLAKNTGNTRIFTFG-VGDDVNAAMLDQLADSTRAVSTY----VREA 434 Query: 408 DDI 410 +DI Sbjct: 435 EDI 437 >UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta RepID=B6TZ81_MAIZE Length = 516 Score = 82.5 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 57/203 (28%), Gaps = 20/203 (9%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 P ++ A +E +L + S + ++DVSGSM Sbjct: 19 PTPIVPGRVQLVSKNNNMAPLEENTQKVLLELTGGDSTSDR---SGLDLVAVLDVSGSMQ 75 Query: 263 QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ---AKEVDEHE-------FFYSQET 312 + K + LS + V ++ + ++V E Q Sbjct: 76 GEKIEKMKTAMKFVVKKLSSIDRLSIVTFLDTANRICPLQQVTEDSQPQLLKLIDALQPG 135 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 G T +S L+ +V+ +R + + SDG + K V + Sbjct: 136 GNTNISDGLQTGLKVLADRKLSSGRVVGVMLMSDGQ----QNRGEPAANVKIGNVPVYTF 191 Query: 373 SYIEITRRAHQTLWREYEHLQST 395 + T+ Sbjct: 192 GFGA---DYDPTVLNAVARNSMG 211 >UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AXM1_ORYSI Length = 614 Score = 82.5 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 62/192 (32%), Gaps = 27/192 (14%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HT 296 + + ++DVSGSM+ + K + L + V + Sbjct: 28 AGVDVVAVLDVSGSMEGERLEHVKEAMEIFIGKLGPDDRLSVVSFATSVRRLTELTYMSE 87 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN--IY--AAQASDGDNWAD 352 Q + V + G T + +AL ++++R + SDG N Sbjct: 88 QGRAVAKEIVDGLVADGSTNMGAALLEGAMILRDRKGARDESNGRVGCMMFLSDGTN--- 144 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 EI + + +++ + + + R S +F ++I DI Sbjct: 145 -----DEIYKEDISGEFPAHTFG-LGSDHNPNVMRHIADETSATYSFVNRNI---ADIKG 195 Query: 413 VFRELFHKQNAT 424 F +LF + Sbjct: 196 AF-DLFISGLTS 206 >UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E105CF Length = 757 Score = 82.1 bits (201), Expect = 4e-14, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 52/191 (27%), Gaps = 22/191 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH 304 PS Q ++D SGSM + A +L+ V + + Sbjct: 386 PSIQQNTIFVLDSSGSMHGTALTQAIDAIREGVSYLTEHDTFNIVDFDSEARALWRQSQF 445 Query: 305 E-----------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + GGT + AL L + + +DG + + Sbjct: 446 ADEVSKAEAMRFLRHVDSDGGTNMQDALALSLTQLLDSST--GLTQVIF-VTDG---SIN 499 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + + L R ++ I + L + I D +I P Sbjct: 500 NERELLKQIAEQLGDKRLFTVG-IGAAPNSHFMEYAAMLGKGTYTY----IDDLTEIQPK 554 Query: 414 FRELFHKQNAT 424 LF + + Sbjct: 555 MAYLFSQLRSP 565 >UniRef50_Q24C76 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila RepID=Q24C76_TETTH Length = 670 Score = 82.1 bits (201), Expect = 4e-14, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 56/187 (29%), Gaps = 35/187 (18%) Query: 247 SQAVMFCLMDVSGSMDQSTK------------------DMAKRFYILLYLFLSRTYKNVE 288 + + + C++DVSGSM K D+ K ++ L Sbjct: 31 TNSNICCVVDVSGSMSSEAKIINQSSQKSDENYSLSILDVVKHSIKMIVNTLGSEDYLSI 90 Query: 289 VVY-----IRHH-----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V + + K + + GGT + L ++ P N Sbjct: 91 VTFSDSANVLFDLLPMNDSNKTMAIEKIENLSTEGGTELWKGLNSALNILLNNKTPN-TN 149 Query: 339 IYAAQASDG---DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +DG D+ D + + + KL + + + + ++ L + + Sbjct: 150 QSIFLLTDGQPTDSGIDTNLVKFKQAYPKLNCTINTFGF---SSSSNSELMNKIAMEYNG 206 Query: 396 FDNFAMQ 402 +F Sbjct: 207 MFSFIPD 213 >UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Deltaproteobacteria RepID=A0LHW4_SYNFM Length = 812 Score = 82.1 bits (201), Expect = 4e-14, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 54/177 (30%), Gaps = 20/177 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ--AKEVDEHE--- 305 ++DVSGSM +++KR L L T +++ T + V Sbjct: 334 YIFIVDVSGSMHGFPLEISKRLLTDLIGGLKPTDCFNVMLFSGDSTVMAERSVPASADNV 393 Query: 306 ------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 Q GGT + ALK + ++ + A+DG Sbjct: 394 RRAVEMIGRRQGGGGTELLPALKKALSLPRKE----GVSRSMVIATDG---FVTVEEEAF 446 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L + + ++ + T ++ L + F + + FR Sbjct: 447 ELIRSHIGDANFFPFGIGTS-VNRMLIEGMARAGAGEP-FVITRPDEAPAGAEKFRR 501 >UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Ciona intestinalis RepID=UPI000180CCF8 Length = 864 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 30/261 (11%), Positives = 75/261 (28%), Gaps = 28/261 (10%) Query: 190 ALEENLAIISNSE----PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY-EKRPD 244 E A +S + L + + E I D + ++ Sbjct: 234 RRSETQAYVSYRPTREQQRNIRRRSDLSFLVNYDVTREELGGEILIKDGYFVHFFAPTNL 293 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IRHHTQAK 299 P + ++DVSGSM K + L+ + + + + H + Sbjct: 294 PVIPKKVVFVIDVSGSMSGHKIVQTKEALRTILDDLNEIDQFNIITFSSTTNVWHPNEMV 353 Query: 300 EVDEHEFFYSQ-------ETGGTIVSS----ALKLMDEVVKERYNPAQWNIYAAQASDGD 348 +V+ ++ GGT ++ ++L++ + R N + +DG Sbjct: 354 DVNPTNIRNAKKHVRSMYARGGTNFNAAALDGIQLLETISSNRTNTLEEASMMILLTDGQ 413 Query: 349 -NWADD-SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + + +++ + H+ L + + I + Sbjct: 414 PTVGVTGNEAIRRNIRERVNGRYSIFCLGFGQHLDHEFL-DQIASENKG----LSRKIYN 468 Query: 407 QDDIYPVFRELFHKQNATAKG 427 D ++ + + + Sbjct: 469 DADAALQLKDFYDEVASPLLA 489 >UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina mazei RepID=Q8PU63_METMA Length = 1004 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 57/199 (28%), Gaps = 22/199 (11%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT----- 296 + ++ A + ++D SGSM S AK L ++ V + Sbjct: 306 EDNANANANVMLVIDRSGSMSGSPISSAKNSANLFIDYMEAEDMAGVVSFSSSARYDYHL 365 Query: 297 -----QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + K + + +G T + S ++ + Y SDG + Sbjct: 366 ATLTPEVKNSIKQKINSIYASGVTAIGSGMRYGLNDL-LNYGDPNNPWAIVLLSDGYQNS 424 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 ++P K ++ Y+ + Q L ++ +IY Sbjct: 425 GENPNNVIPSIK--ASNIQVYTVG-LGPAVDQKLLGNIADQTGGKYYYSPTD-SQLQEIY 480 Query: 412 PVF-------RELFHKQNA 423 + +F + Sbjct: 481 NDIVGKIIGWKTVFKRNVK 499 >UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PS55_PICSI Length = 829 Score = 81.7 bits (200), Expect = 5e-14, Method: Composition-based stats. Identities = 47/300 (15%), Positives = 86/300 (28%), Gaps = 46/300 (15%) Query: 127 LLFEDLALP-NLKQN-QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 + +D L NL+ N Q L + V + R+ N+ R A+ Sbjct: 250 VYDDDEPLDSNLRPNEGQTSLLTDDDREFEFKGLFVDHEGTSDRAAGNARKMRIAL--YP 307 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 E A E + + K+ + V K P Sbjct: 308 EVEAVAAGEACENFTVLVHVKAPSASEASKKQNYEDCEGNMV--------------KDPG 353 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA------ 298 + + ++DVSGSM + + KR + LS + VV+ + Sbjct: 354 CRAPIDLVTVLDVSGSMSGTKLALLKRAMAFVISNLSPEDRLSVVVFSSTAKRVFSLKRM 413 Query: 299 ----KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDG-DNWA 351 + TGGT ++ L+ +V+++R Q N SDG D ++ Sbjct: 414 TPDGQRAANRVVERLLCTGGTNIAEGLRKGAKVLEDR---RQRNPVASIMLLSDGQDTYS 470 Query: 352 DDSPLCHEILAKKLLPVVR-----------YYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 S + + R +++ + +F Sbjct: 471 LSSRGVVLFPSDEQRRSARQSTRYGHVQIPVHAFGFGV-DHDAATMHAISEVSGGTFSFI 529 >UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=A9AXC2_HERA2 Length = 421 Score = 81.3 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 53/178 (29%), Gaps = 16/178 (8%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------IRHHTQ 297 + ++D SGSM + + V++ + Sbjct: 41 QMPVNVSFVLDHSGSMKGDKMRCVREATQRALGLMGPQDIVSVVIFDHRRETIISAQPVR 100 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + E ++ GGT ++ AL+ ++ N + +DG Sbjct: 101 NVAALQAEVGKIKDAGGTKIAPALEAALNEIRRSQNANTISRIIL-LTDGQTEG---ERD 156 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 LA+++ + + + ++ L E + + + +DI F+ Sbjct: 157 CLRLAEEIGKASVPLTALGVGDDWNEDLLIEMANRSGGVAEY----FSNPNDIASFFQ 210 >UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LPD4_SYNFM Length = 479 Score = 81.3 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 60/201 (29%), Gaps = 21/201 (10%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM-AKRFYILLYLFLSRTYKNVEVVYIRHH 295 + + + M +MD SGSM + K A++ + L LS T + V Y H Sbjct: 80 RAPGGNVEARRELDMVVVMDRSGSMADAGKLTHARQAVLNLLSRLSETDRFALVSYSDHV 139 Query: 296 ---------TQAKEVDEHEFFY-SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 T A Q G T + L+ + E + + S Sbjct: 140 QRHGGLLPITPANRATLERIVRGIQPGGATNLGGGLQEGISQLAELQQNGRLSRLIL-IS 198 Query: 346 DG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 DG N P +A S + + ++ L + F Sbjct: 199 DGLANRGVTDPSALGTMASVAAERGYAVSTVGVGLDFNEHLMTSIADKGAGNYTF----- 253 Query: 405 RDQDDIYPVFRELFHKQNATA 425 + F ++F K+ A Sbjct: 254 ---MESASAFAQVFDKEFRDA 271 >UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0AMP5_MARMM Length = 740 Score = 81.3 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 55/232 (23%), Gaps = 28/232 (12%) Query: 209 EERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM 268 + A + + ++ L ++ ++D SGSM ++ Sbjct: 305 ADPSEASAALFIEEWQGETYLLAQILPPAELGADTPRRARET-IFVIDNSGSMGGASMRQ 363 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF--------------YSQETGG 314 A+ I L + IR ++V + GG Sbjct: 364 ARAALITALQRLEPGDR---FNVIRFDNTMEQVFPQAVDASPDNVATALTFARRLEAQGG 420 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 T++ AL + +DG A + + L R + Sbjct: 421 TVMLPALNAALR--DTSPDDDSRVRQIVFLTDG---AIGNEAELFAAIEAGLGRSRLFPV 475 Query: 375 IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 I + L I ++ ELF Sbjct: 476 G-IGSAPNGYFMSRAARLGRGTS----TQIGQVSEVEARMEELFTALERPVM 522 >UniRef50_B9XLE8 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XLE8_9BACT Length = 723 Score = 81.3 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 57/199 (28%), Gaps = 26/199 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA- 298 E + M ++D SGSM AK L + + H +Q Sbjct: 389 ELGQLGRAPMEMVFVLDCSGSMSGEPIAQAKAAIRHALKQLQPGDSFQIINFSEHASQLG 448 Query: 299 ---KEVDEHEFFY-------SQETGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDG 347 E G T + +K + + + + +DG Sbjct: 449 AKPLEATPENIRKGLAYVEALNSDGPTEMIEGIKAALD-----FPHDPERLRFVCFLTDG 503 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + + +++ R +S+ ++ L + A+ H+ Sbjct: 504 --FIGNEAEILAAVHERIGAS-RIFSFGV--GSCNRYLLDHLAKMGGG----AVAHLGLH 554 Query: 408 DDIYPVFRELFHKQNATAK 426 D+ V + F + + A Sbjct: 555 DNGAKVMDDFFERVSHPAM 573 >UniRef50_C9YX20 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9YX20_STRSW Length = 1239 Score = 81.3 bits (199), Expect = 7e-14, Method: Composition-based stats. Identities = 38/293 (12%), Positives = 73/293 (24%), Gaps = 30/293 (10%) Query: 60 SEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 +G G G P ++ E Sbjct: 897 ELYGTGRGEGSSDLGR-EGGRSRAGGQDTSFPTAREWAEELDALFGAEVREEVLARAADQ 955 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 + + L L R + T ++ +R L L A Sbjct: 956 GRTDVLA---------ELDPKAVRPSVDLLTSVLSLAGGMPEQQLARLRPLVRRLVDELA 1006 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 R + +L LR +A R + + + Sbjct: 1007 KELATRMRPALTGLATPRPTRRPGGRLDLPRTLRANLAHTRRTADGRTVVVPERPVF--- 1063 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 R + + ++DVSGSM+ S A +L + ++ T Sbjct: 1064 STRSRKEADWRLILVVDVSGSMEASVIWSALTAAVL------GGVPTLSTHFLAFSTDVI 1117 Query: 300 EVDEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 ++ + + GGT +++ L +V P++ + SD Sbjct: 1118 DLTDRVDDPLSLLLEVRVGGGTHIAAGLAHARSLVT---VPSRTLVVV--VSD 1165 >UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LWF9_HALO1 Length = 419 Score = 81.0 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 24/205 (11%), Positives = 57/205 (27%), Gaps = 22/205 (10%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + E + +D S SM A + L + + + Sbjct: 22 IEAQATESSARMPVNLALV--IDRSSSMRGPRLASAIVAARQVVEQLDERDRLSVIAFDA 79 Query: 294 H----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 +A++ E + GT +++ +K E V+ + + Sbjct: 80 TARTIFGPMSVTDEARQTLEQALAGLRTGVGTNLAAGMKKGAEAVRSGFVRGALSRLVL- 138 Query: 344 ASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG + LA+K + + + + L + H ++ Sbjct: 139 LTDGQPSLGITDNDRLCALAQKEADRGVTITTMGLGQGFDDELLADLAHSGRGGFHY--- 195 Query: 403 HIRDQDDIYPVFRELFHKQNATAKG 427 + DI F ++ + Sbjct: 196 -LASAADIPGA----FGRELSGVFA 215 >UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CF36E Length = 790 Score = 81.0 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 57/219 (26%), Gaps = 53/219 (24%) Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK----------------DMAK 270 F D + K E + S + C++DVSGSM K D+ K Sbjct: 116 RFNDQVKISIKTPEGQQR--SACDICCVIDVSGSMSDEAKIKNSKGDIESNGLTILDLVK 173 Query: 271 RFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE-------------HEFFYSQETGGTIV 317 + L + V + HT A ++ + E T + Sbjct: 174 HSVKTIINNLDERDRLSLVAF---HTNAYKITDLTPMNENGRNHAIKELEKLIPLDSTNI 230 Query: 318 SSALKLMDEVV---KERYNPAQWNIY----AAQASDGD-NWADDSPLCHEILAKKLLP-- 367 + EVV +++ +DG N P H + KK Sbjct: 231 WDGIYQALEVVKAGQQQSIQKGEQRVAFSQILLFTDGQPNVIP--PRGHLPMLKKYKEEN 288 Query: 368 ----VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + + + L + F Sbjct: 289 DVNCSISTFGFGY---NLDSELLDQLAIEGRGSFAFIPD 324 >UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 Tax=cellular organisms RepID=Q47YR5_COLP3 Length = 786 Score = 81.0 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 62/204 (30%), Gaps = 19/204 (9%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-- 291 L + EK + ++D SGSM + + AK L L L+ + + Sbjct: 379 LTFFPPEKAVAQVIARDIIFIIDTSGSMQAGSMEQAKSSLQLALLQLNNKDSFNIIAFDN 438 Query: 292 -------IRHHTQAKEVDEHE--FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 + H A + + + GGT + L + K++ ++ Sbjct: 439 DTELLFPVTHMASAHNISKAQQFIDGLSANGGTEMYRPLSNALMMKKDKTQSSKAIRQIV 498 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG A + L R Y+ I + ++ F Sbjct: 499 FITDG---AVANEFELMQLLNTAQGDFRLYTVG-IGAAPNGYFMKKAAQFGRGSYVF--- 551 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 I+++ ++ K + A Sbjct: 552 -IQNKSEVQRKMSHFMTKISQPAL 574 >UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putative n=1 Tax=Ricinus communis RepID=B9RR85_RICCO Length = 755 Score = 81.0 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 60/223 (26%), Gaps = 35/223 (15%) Query: 227 PFIDTFDLRYKNYEKRPDPSSQA---VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 +D D+ Y P+ + + ++D+SGSM+ + K L+ Sbjct: 300 HDVDQRDMFYLYLFPGDQPNMKVFRKEIVFIVDISGSMEGKPLEGMKNAMSGALAKLNPK 359 Query: 284 YKNVEVVYIR----HHTQAKEVDEHEFFYSQ--------ETGGTIVSSALKLMDEVVKER 331 + + + + E + GGT +S L E+V Sbjct: 360 DSFNIIAFNGETYLFSSLMELATEKTVERAVEWMNLNFIAGGGTNISVPLNQAMEMV--- 416 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKK-------LLPVVRYYSYIEITRRAHQT 384 N +DG A + KK + P + + Sbjct: 417 SNTQGSLPVIFLVTDG---AVEDERHICDSMKKYVRGKGAICPRIYTFGIGTYCNHYFLR 473 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + + + ++ + +I F + + Sbjct: 474 MLATVCR-GQYDAAYDVDSVQARMEI------FFSRGLSAVLA 509 >UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira maxima CS-328 RepID=B5W7H4_SPIMA Length = 488 Score = 81.0 bits (198), Expect = 9e-14, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 54/186 (29%), Gaps = 25/186 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD-------- 302 + L+D SGSM + + K ++ + ++A V Sbjct: 53 VVLLIDTSGSMSGQKLREVQTAASEFVS--RQNLKRHDLAVVEFSSRASVVADFTRNETE 110 Query: 303 -EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 + GGT +S L V++ +DG ++P + Sbjct: 111 LQQAIARLSARGGTNLSEGFNLATSVLQNS----DRTPNILLFTDGV---PNNPPMAASI 163 Query: 362 AKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE-LFH 419 A+++ + + + + F + D D + + ++ Sbjct: 164 AQQIRASGINLVAVGTGDAQINYLT----ALTGDPDLVF-YANFGDLDRAFRGAEKAIYG 218 Query: 420 KQNATA 425 +Q + Sbjct: 219 QQLVES 224 >UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clupeocephala RepID=Q498Q0_DANRE Length = 892 Score = 80.6 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 51/184 (27%), Gaps = 21/184 (11%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + Y + ++D SGSM + + + + + L++ + + Sbjct: 254 VHYFAPTDVQRIPKN--VVFIIDQSGSMQGNKIEQTRMAMLRILSDLAKDDYFGLITFSS 311 Query: 294 H-----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 H + E + + G T ++ A+ ++ + +I Sbjct: 312 HIQAWKPELLKATAENVEEAKTFVKQIRSGGATDINGAVLNAVNMINQYTQEGSASILIL 371 Query: 343 QASDGDNW-ADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DGD +P+ + K + + + + + Sbjct: 372 -LTDGDPTSGVTNPVTIQQNVKTAIGGKYPLYCLGFGF---NVRFEFLEKMSLENNGAAR 427 Query: 399 FAMQ 402 + Sbjct: 428 RIYE 431 >UniRef50_Q9NY47 Voltage-dependent calcium channel subunit delta-2 n=115 Tax=Euteleostomi RepID=CA2D2_HUMAN Length = 1150 Score = 80.2 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 59/216 (27%), Gaps = 19/216 (8%) Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 ID +D+R + SS M ++DVSGS+ T + K + L Sbjct: 263 TPWRAPKKIDLYDVR-RRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTL 321 Query: 281 SRTYKNVE-------------VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV 327 S ++ + + K+V + G T + + + Sbjct: 322 SDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQ 381 Query: 328 VKERYNPAQW-NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 ++ N +DG +D VR +++ T Sbjct: 382 LQNSNITRANCNKMIMMFTDG---GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPL 438 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + F + I + ++ + Sbjct: 439 QWMACANKG-YYFEIPSIGAIRINTQEYLDVLGRPM 473 >UniRef50_B0CG18 von Willebrand factor type A domain protein, putative n=5 Tax=Cyanobacteria RepID=B0CG18_ACAM1 Length = 708 Score = 80.2 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 25/221 (11%), Positives = 57/221 (25%), Gaps = 30/221 (13%) Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQA--------VMFCLMDVSGSMDQSTKDMAK 270 + +K + + D R ++ P+ + + L+D SGS ++ Sbjct: 303 VASKQTQATLLTQSDQRGGHFATYLIPALKYKSNQIVPKDVVFLIDTSGSQSGPPIVQSR 362 Query: 271 RFYILLYLFLSRTYKNVEVVYIRHHTQ-----------AKEVDEHEFFYSQETGGTIVSS 319 + L+ + + ++ ++ GGT + + Sbjct: 363 KLMTQFLDKLNPNDTFSIINFSNTTSKLSPKPLANTPANRKKALEYIKKLDANGGTELMN 422 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 + V P +DG D + +L P R Y + Sbjct: 423 GINT---VAAFPPAPDGRLRSVVLLTDG--LIGDDETIIAAVRDRLKPGNRIYPFGVGFS 477 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 ++ L + + D F + Sbjct: 478 T-NRFLLDRLAEVGRGT-----VEVVAPKDSAEKVAAKFVQ 512 >UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09DT2_STIAU Length = 843 Score = 80.2 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 48/193 (24%), Gaps = 23/193 (11%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV-------YIRHHTQA 298 + + ++D SGSM+ + A+ L L + + + Sbjct: 244 PKRQEVVFVVDTSGSMEGESLPQAQGALRLCLRHLREGDRFNIIAFDTSFQSFAPQPAVF 303 Query: 299 KEVDEHEFFY----SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + + + GGT + + + E +DG + Sbjct: 304 TQKTLEQADRWVAALRANGGTELLQPMLAAVQAAPEG--------VVVLLTDGQ---VGN 352 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + R YS+ I L ++ F R D + F Sbjct: 353 EAEILQAVLRARKTARIYSFG-IGTNVSDALLKDMARQTDGAVEFIHPGERIDDKVVAQF 411 Query: 415 RELFHKQNATAKG 427 + + Sbjct: 412 SRALAPRITELQA 424 >UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000E460BF Length = 1028 Score = 79.8 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 19/153 (12%), Positives = 47/153 (30%), Gaps = 15/153 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + P+++ + ++DVSGSM KR + + + + +++ + Sbjct: 299 FSPDGLPNTRKNVIFVIDVSGSMYGQKTRQTKRAFTTILDDVRPIDRINIILFSSYAHVW 358 Query: 299 K-----EVDEHEF-------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + E GGT + +L E++ E + +D Sbjct: 359 REDQMVEATSDNIAAAKRHVNGLSVGGGTNIYDSLMKAVEILLEH-DTGDAMPLIIMLTD 417 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 G ++ + + + +S Sbjct: 418 GQ--VGNAAAIVRDVTSVIGGRLSLFSIGFGNG 448 >UniRef50_C4RGW7 Putative uncharacterized protein n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RGW7_9ACTO Length = 633 Score = 79.8 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 35/273 (12%), Positives = 71/273 (26%), Gaps = 21/273 (7%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL 127 R ++ + G G + E DE ++ Sbjct: 283 PADARRYARALDELYGAGRGEGGIDLGHAAGGGQEAAFPTAREWADELEALFGTTVREEV 342 Query: 128 LFEDLA------LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 + L L R E T A ++ +R L L Sbjct: 343 IARAAESGRTDVLSELDPAAVRPSVELLTSVLSLAGGLPEAKLAGLRPLVRRLVEELTAR 402 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 + + ++ LR +A R + + + Sbjct: 403 LATQVRPALTGLTSPRPTRRPGGRINLARTLRANLAHTRRLDDGRTVVVPQRPVFHT--- 459 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 R + + ++DVSGSM+ S A +L + ++ T+ ++ Sbjct: 460 RTRREADWRLVLVVDVSGSMEASVVWSALTAAVLA------GVPTLSTHFLAFSTEVVDL 513 Query: 302 DEHEFFYS------QETGGTIVSSALKLMDEVV 328 + + GGT +++ L +V Sbjct: 514 TDRVDDPLALLLEVRVGGGTHIAAGLAHARSLV 546 >UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1915 Length = 728 Score = 79.8 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 24/204 (11%), Positives = 52/204 (25%), Gaps = 24/204 (11%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-- 291 + Y + ++D SGSM ++ + L +++ Sbjct: 235 VHYFAPASLQKVPKN--VVFVIDHSGSMHGQKIKQTYEAFLKILADLPEEDHFGILIFDD 292 Query: 292 ---IRHHTQAKEVDEHEF------FYSQETGGTIVSSALKLMDEVV----KERYNPAQWN 338 +T K V ++ GGT ++ AL +++ + + P Sbjct: 293 KVDKWQNTLVKAVPDNIIKAKQFVSKISARGGTDINKALLAAVKMLKNTSRNKLLPKIST 352 Query: 339 IYAAQASDGDNW-ADDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQTLWREYEHLQSTF 396 SDG+ + KK Y + Sbjct: 353 SIILFLSDGEPTSGVTNHNEIINNVKKANERQTTLYCLGFGN-DVDFNFLEKMALENGGL 411 Query: 397 DNFAMQHIRDQDDIYPVFRELFHK 420 I + D + +++ Sbjct: 412 AR----RIYEDSDAALQLQGFYNE 431 >UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZED8_SYNY3 Length = 588 Score = 79.8 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 54/204 (26%), Gaps = 28/204 (13%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM-AKRFYILLYLFLSRTYKNVEVVY----- 291 + P + ++D SGSM+ K A++ LS ++ Sbjct: 33 SPPAMDQPRPSLNLGFVIDRSGSMEGHNKITYARQAVCYAIDQLSPGDHLSVTIFDDQVQ 92 Query: 292 -IRHHTQAKEVDEHE--FFYSQETGGTIVSSA-LKLMDEVVKERYNPAQWNIYAAQASDG 347 + T K+ + + G T + L+ +V + A+ N SDG Sbjct: 93 TLIPSTLVKDKAQFKRLVQGINPGGCTDLHGGWLQGGIQVSQNLS--AELNRIIL-LSDG 149 Query: 348 -DNWADDSPLCHEILA---KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 N + +P + + ++ L + Sbjct: 150 LANRGETNPDIIATDVHGLAQRGASTTTLGLGD---DYNEDLLEAMARSGDGNYYYVADA 206 Query: 404 IRDQDDIYPVFRELFHKQNATAKG 427 + +F ++ Sbjct: 207 EQLP--------TIFERELQGLAA 222 >UniRef50_C7FPD9 Uncharacterized protein n=2 Tax=environmental samples RepID=C7FPD9_9BACT Length = 836 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 50/173 (28%), Gaps = 19/173 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-RHHTQ 297 + PD + +F ++D SGSM D A+ + + + + Sbjct: 299 LKVDPDQVTPKELFFVVDTSGSMMGEPLDKARAAMRYALERMGPDDTFQIIDFASGVASL 358 Query: 298 AKEVDEHEFFYSQET----------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 A + ++ GGT + + ++ + P A +DG Sbjct: 359 APRPLPNTPENLRKGLAFIEAMTSQGGTEMLAGIRAALD----GPTPPGRLRIVAFMTDG 414 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + + + R +S+ + ++ L E + Sbjct: 415 ---YIGNDGDILDYIDQSVGQARLFSFG-VGEDVNRYLLEEMATRGRGTVQYV 463 >UniRef50_Q97HZ9 Predicted metal-dependent peptidase n=1 Tax=Clostridium acetobutylicum RepID=Q97HZ9_CLOAB Length = 456 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 63/215 (29%), Gaps = 31/215 (14%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN--LAI 197 ++ + G I L+N + R +A +R L++ A Sbjct: 200 KNFDEMN-IHKTWSESYNRGYENQID---ELKNKIIRNSAKGRIPKRVQEYLDDMNKKAE 255 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 IS + + + K R P+ DLR K + + +D+ Sbjct: 256 ISWQMYLKKAIGTLPKGYKKTITRKDRRQPY--RMDLRGKLSDHIIK------IVVAIDI 307 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV-----DEHEFFYSQET 312 SGSM + D A + KN E+ I + + Sbjct: 308 SGSMTDAEIDAAMTEIFDILKN-----KNYELTIIECDNIVRRMYRVSKPRDMKKKLDTK 362 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 GGT S + + + + + +DG Sbjct: 363 GGTSFSPVFEYLHK-------NRMEDCFLIYFTDG 390 >UniRef50_A6G2V8 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G2V8_9DELT Length = 877 Score = 79.4 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 48/187 (25%), Gaps = 23/187 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----IRHHTQAKEVDEHEF 306 + ++D SGSM D AK + + + + + Sbjct: 377 LVFVVDNSGSMGGLPMDTAKGLMRKALKDIRPDDTFTVLRFSESASGLSNKLLPATQDNI 436 Query: 307 -------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 Q GGT ++ +K V +P + +DG + Sbjct: 437 EAGVDYVDAMQGMGGTQMTEGIKAALRVPH---DPDRL-RVVMFLTDG---YIGNEQAIF 489 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 L + R +S + ++ L + + D PV + Sbjct: 490 ELIDDNIGDARLFSLG-VGGAPNRYLLDGMASVGRG--AVTYAGYDEPAD--PVIERFYE 544 Query: 420 KQNATAK 426 + Sbjct: 545 RVATPVL 551 >UniRef50_A8M9M1 von Willebrand factor type A n=1 Tax=Caldivirga maquilingensis IC-167 RepID=A8M9M1_CALMQ Length = 474 Score = 79.4 bits (194), Expect = 3e-13, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 75/247 (30%), Gaps = 33/247 (13%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF- 228 +++AR + R E+ + + I + + V Sbjct: 218 ALDNVAREPTIGNALRVS--RFTEHSNYPTYITGVREYRIGDPAYRIDLDKTSMNMVRKT 275 Query: 229 -----IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-----STKDMAKRFYILLYL 278 + T D+ + Y + +D SGSM + D+AK + Sbjct: 276 FLNKPMSTRDIVVREYADVKL----MDIVLCLDTSGSMKEFSGAYMKMDIAKEAIVKYIR 331 Query: 279 FLSRTYKNVEVVYIRH-------HTQAKEVD---EHEFFYSQETGGTIVSSALKLMDEVV 328 +LSRT + +V K+ E Y GGT +++AL+ ++ Sbjct: 332 YLSRTNDRLSMVLFNFRADILWGPHSVKKYINEMEEMSRYIYPGGGTNIANALEKARIIL 391 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWRE 388 + P N + +DG S C + K V + + + L Sbjct: 392 SKSNYP---NKHIICITDGRTVNASS--CIKEAVKLRRMGVTLSTVA-VGDNSDFDLLMR 445 Query: 389 YEHLQST 395 + + Sbjct: 446 LSKIGNG 452 >UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_ANASP Length = 224 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 55/161 (34%), Gaps = 19/161 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHH 295 E +P + L+D SGSM + + + + L L + + VE+ + Sbjct: 11 VEFAENPEPRCPCVLLLDTSGSMQGAAIEALNQGLLSLKDELMKNSIAARRVEIAIVTFD 70 Query: 296 TQAKE----VDEHEFFY--SQETGGTIVSSALKLMDEVVKER---YNPAQ---WNIYAAQ 343 + V +F G T + + + ++V+ER Y + + Sbjct: 71 SHINVIQDFVTADQFNPPILTAQGLTSMGAGIHKALDMVQERKSLYRANGVAYYRPWVFM 130 Query: 344 ASDGDNWADDSPLCHEILAK----KLLPVVRYYSYIEITRR 380 +DG+ + L + + ++ V ++S Sbjct: 131 ITDGEPQGELDHLVEQAALRLQGDEVNKRVAFFSVGVENAN 171 >UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788F71 Length = 1007 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 45/182 (24%), Gaps = 16/182 (8%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH----- 295 KR PS + ++D SGSMD + ++AK + + V + Sbjct: 401 KREIPSLG--LILVIDRSGSMDGNKIELAKESAMRTVELMRAKDTVGVVAFDDQPWWVVP 458 Query: 296 ---TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 KE GGT + A+ E E + +DG + + Sbjct: 459 PQKLGDKEEVLSSIQSIPSAGGTNIYPAVSSALE---EMLKIDAQRRHIILMTDGQSAMN 515 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + + + A L + F Sbjct: 516 SGYQDLTDTMVENKITMSSVAVGM---DADTNLLQSLADAAKGRYYFVEDETTLPAVFSR 572 Query: 413 VF 414 Sbjct: 573 EA 574 >UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YPR2_BRAFL Length = 863 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 20/205 (9%), Positives = 51/205 (24%), Gaps = 24/205 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-- 296 + P + ++D SGSM + K+ + L + + + T Sbjct: 225 FAPSGLPVVPKNIVFIIDKSGSMGGTKMRQTKQAMNTILKDLRDHDRFNVMPFSYSSTMW 284 Query: 297 ---QAKEVDEHEFF--------YSQETGGTIVSSALKLMDEVVKERYNPAQWNI----YA 341 + GGT ++ A+ ++++ + + Sbjct: 285 RPNEMVLATRENIESARTYVRRSINAGGGTNINQAIIDAADLLRRVTDDQPNSPRSASLI 344 Query: 342 AQASDG-DNWADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG + + P + K + V + + Sbjct: 345 IFLTDGLPSVGESKPRNIMVNVKNAIREQVSLFCLGFGK-DVDFPFLEKMALENRGLAR- 402 Query: 400 AMQHIRDQDDIYPVFRELFHKQNAT 424 I + D + + + Sbjct: 403 ---RIYEDSDAALQLKGFYDEVATP 424 >UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea mays RepID=C0HF51_MAIZE Length = 459 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 25/227 (11%), Positives = 69/227 (30%), Gaps = 24/227 (10%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 A + + + + E + ++ ++ + ++D+SGSM Sbjct: 1 MAPHGDIDLAKAYHHVTVSMREHTEKVM---VKLTAPHTGKGDTAPLDIVVVLDISGSMR 57 Query: 263 QSTKDMAKRFY-ILLYLFLS-RTYKNVEVVYIRHHTQAKEVDE----------HEFFYSQ 310 + + K + L R + + + + ++ + Sbjct: 58 GTKLEHMKHAMTRFIIEKLGIRGDRLAIITFESKAHKVFDLSSMLPDQVKKAVAVVEGLK 117 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR 370 G T + + L+ +V+K R + SDG ++ +L + V Sbjct: 118 AGGDTNIKAGLEAGLDVLKTRRGHSHNASCIFLMSDG---HENVDKARTLLDRVGEHSVV 174 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + E + + + H+R+++D + + Sbjct: 175 TFGFGEKSDE------QLLYDIAYHSHAGTYHHVREKEDENQLMKAF 215 >UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8BII0_ORYSI Length = 585 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 61/232 (26%), Gaps = 34/232 (14%) Query: 206 LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS- 264 + LR E E + K + ++DVSGSM ++ Sbjct: 5 AAGKVTLRSEPKEKAIPSNEERKEWPVLVHVVAPAKTERFPID--LVAVLDVSGSMTKAT 62 Query: 265 ------TKDMAKRFYILLYLFLSRTYKNVEV-----VYIRHHTQAKEVD-------EHEF 306 D+ K ++ L + V V T+ E+ + Sbjct: 63 SMHGWTRLDLVKGAMKMVTNKLGAGDRLAIVPFNGKVVAAGATRLMEMTTKGRADANAKV 122 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEILAKK 364 + G T ALK ++ R + + SDG + Sbjct: 123 NQLKAGGDTKFLPALKHASGLLDSRPAGDKQYRPGFIFLLSDGQDNGV---------LDD 173 Query: 365 LLPVVRY--YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 L VRY +++ R + + + + VF Sbjct: 174 KLGGVRYPAHTFGMCQSRCNPKSMVHIATATKGSYHPIDDKLSNVAQALAVF 225 >UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1 Tax=Sorghum bicolor RepID=C5Z1W1_SORBI Length = 607 Score = 78.7 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 40/269 (14%), Positives = 76/269 (28%), Gaps = 26/269 (9%) Query: 168 RSLQNSLARRTAMTA-GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERV 226 R+ +R+ + + + EE +A SN+ + + + K + Sbjct: 23 RAASAGRSRKPGTKSSAPNKMFNDDEEPIAPASNAGKQVRGFSDVGKASVKPYYPKEAPL 82 Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYK 285 L + + + ++DVSGSM D K + L+ + Sbjct: 83 GASTVRVLLDVSSSSSTAGRAALDLVVVLDVSGSMRDFGRLDKLKSAMRFIIKKLAPMDR 142 Query: 286 NVEVVYIRHHT----------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 V + T A V GGT + + LK+ +V+ R Sbjct: 143 LSVVTFNGGATRECPLRAMSEDAVPVLTDIVDGLVARGGTNIEAGLKMGLQVLDGRRYTG 202 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 SDG+ + D+ V S+ A L ++ Sbjct: 203 ARTAGVILMSDGEQNSGDATRVRNPQN----YPVYTLSFG---SNADMNLLQKLA-GGGG 254 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 N + ++F + A Sbjct: 255 TYNPVLDSGG------MSMLDVFSQLMAG 277 >UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actinomycetales RepID=Q6ABM1_PROAC Length = 654 Score = 78.7 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 43/311 (13%), Positives = 87/311 (27%), Gaps = 23/311 (7%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGG-SGSGQGQASQDGEGQDEFVFQISKDEYLD 126 +H + + ERP G+ + + + Sbjct: 286 PPRNQHPDDQPDQPEQRPREPERPDPDVEKWQAGENLATPPSSSGEQQPEYHDGPQ---N 342 Query: 127 LLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 + +Q + + AG P S R + + RR + RR Sbjct: 343 QRDDGQH----DPRKQPSGSGEQVVAAGDPFAVRPLEPSQDRFARRACGRRLRTRSNDRR 398 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 + + L + ++ + + + D R K R Sbjct: 399 GRYVSARPTDRPDDLALDATLRAAAVHQKSRRATERPDLAVHVKPIDWRAKVRAGR---- 454 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVY-------IRHHTQ 297 + + + ++D SGSM + A + LL + + + + + T Sbjct: 455 AASCVIFVVDASGSMGSRGRMTASKGAVLSLLLDAYVKRDRVCLIGFRRDRAEVLVPVTS 514 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDGD-NWADDSP 355 + EV +H G T +S+ L EVV+ +DG N + D Sbjct: 515 SVEVAQHGLAELPVGGRTPLSAGLIKACEVVRPLLLKDPGLRPLLILVTDGRGNVSLDGR 574 Query: 356 LCHEILAKKLL 366 + + + Sbjct: 575 PNSQATDEAIR 585 >UniRef50_C0E6Z8 Putative uncharacterized protein n=2 Tax=Corynebacterium matruchotii RepID=C0E6Z8_9CORY Length = 1107 Score = 78.7 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 39/296 (13%), Positives = 86/296 (29%), Gaps = 30/296 (10%) Query: 79 NDHFVQNDRIERPQGGGGGSGSGQGQAS---QDGEGQDEFVFQISKDEYLDLLFEDLAL- 134 ++ + + + G G G + Q+E D+ ++ + + Sbjct: 770 DELYGADTVGKDHAGESDGRVHAAGNGPSQLGVRQWQEEITALFGADQLQEIFGKAADMG 829 Query: 135 -----PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 +L R E T + + +R L + + + + Sbjct: 830 RSDVITSLDAESVRPSVELLTTVLNLKGALPESRLRQLRPLVSKIVSELSKELASQLSPA 889 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 L + + L+ + + + P I ++ S Sbjct: 890 -LGGMANTKPSRRKSPRLDLPATVRNNLKHTVMVNSRPQIIPVTPIFRAP---ERKVSPW 945 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------E 303 + L+DVSGSM+ S + + + +V +I T ++ Sbjct: 946 HIIVLVDVSGSMEPS------TVFAAMTAGILAGVNTFKVSFITFDTSVIDLTGHVEDPL 999 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT ++ ++ + NP + SD + W + L HE Sbjct: 1000 ELLLEIKVGGGTNIAQGVRYA---ASQVTNPTKT--ILVLISDFEEWGSVNNLTHE 1050 >UniRef50_B6ZDR6 Voltage dependent calcium channel alpha2d/delta subunit n=3 Tax=Euteleostomi RepID=B6ZDR6_RANCA Length = 1078 Score = 78.7 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 62/231 (26%), Gaps = 24/231 (10%) Query: 206 LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST 265 L + + R ID +D+R + +S M L+DVSGS+ T Sbjct: 215 LARYYPASPWVDKSRTP----NKIDLYDVR-RRPWYIQGAASPKDMLILVDVSGSVSGLT 269 Query: 266 KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA-------------KEVDEHEFFYSQET 312 + + + LS + + K+V + Sbjct: 270 LKLIRTSVTEMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVRNKKVLKEAVNNITAK 329 Query: 313 GGTIVSSALKLMDEVVKERYNPAQW-NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 G T K + ++ N +DG +D L K VR Sbjct: 330 GTTDYKQGFKFAFDQLRNTNVSRANCNKIIMLFTDG---GEDKATETFKLYNKN-KTVRV 385 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 +++ + + + + I + ++ + Sbjct: 386 FTFSVGQHNYDKGPIQWMACENKG-YYYEIPSIGAIRINTQEYLDVLGRPM 435 >UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GC99_9DELT Length = 546 Score = 78.7 bits (192), Expect = 5e-13, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 56/209 (26%), Gaps = 30/209 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------- 292 E P + ++D SGSM AK+ + L L + + Y Sbjct: 122 EAGQGPRPGLDLAIVLDRSGSMGGDKLRFAKQAGLDLVNRLDEQDRVTLISYDDTVTPLS 181 Query: 293 ---RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDE--VVKERYNPA-------QWNIY 340 R EV + Q G T + AL + + E + P + Sbjct: 182 NLQRVDDDGIEVLRRQLLDIQVGGTTALGPALFMGLQRLAAPEPFGPQTRTEARHDRLRH 241 Query: 341 AAQASDG-DNWADDSPLCH-EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 SDG N + P +A+ V + ++ L + Sbjct: 242 VILLSDGIANVGETRPEVIGGRVAEHFGGGVSVSTLGMGL-DYNEDLMTRIADEGGGRYH 300 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNATAKG 427 F + + P + A Sbjct: 301 FI-----EDAESIPAM---LGDELAGLTA 321 >UniRef50_Q0AV90 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AV90_SYNWW Length = 776 Score = 78.3 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 39/340 (11%), Positives = 80/340 (23%), Gaps = 62/340 (18%) Query: 96 GGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQL----TEYKTH 151 G G+ E D + D P + + R T Sbjct: 151 PGKPLGKKIGPGRAEPTDR------------VPDADFISPPIGETGYRATLSLHVHNNTP 198 Query: 152 RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR---ELHALEENLAIISNSEPAQLLE 208 + + I + ++ + T R L E + I E Sbjct: 199 ISSIKSPSHKIRIDRMDEYSATITLQENNTRMNRDFVLNLKLDGETVPRIIYW-KNPKDE 257 Query: 209 EERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM 268 EL +R P L+D+S SM+ + Sbjct: 258 YFACITYTPELPIIEQRQPK---------------------EYIFLIDISRSMEGKKIEH 296 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHHT----QAKEVDEHEFFY-------SQETGGTIV 317 A + L + + + ++ ++ GGT + Sbjct: 297 AADAIQICLRNLDEGDSFNLLAFESENHAFAPKSLPYNQENLDKASAWVKNLHAMGGTNI 356 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 A++L + A+DG + +K + +S I Sbjct: 357 LPAVQLALKEA------GDQQKVVILATDGQ-VG--NENEIINYVRKRNQNLCLFSLG-I 406 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + + F+ ++ + F + Sbjct: 407 DTAVNSYFINQIAEAGNGCAEFSYPGESLEEKMLRHFARI 446 >UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G415_9DELT Length = 877 Score = 78.3 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 41/162 (25%), Gaps = 20/162 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV--------- 301 M ++D SGSM +AK+ L + + E Sbjct: 375 MIFVIDRSGSMSGVPLALAKQTLREALSHLRPVDTFNVISFESSTAMLYEAAVPANEQNL 434 Query: 302 --DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG----DNWADDSP 355 E Q GGT++S A+ + Y +DG ++ Sbjct: 435 VHAERFIDGLQAGGGTMMSGAVDAAL----SPEIGLGRHRYVFFVTDGFISNEDEIARQA 490 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 A K R + I ++ L Sbjct: 491 SALVRAADKAGQRARVFGMG-IGSSPNRELLASLSKAGKGRY 531 >UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=Q8H924_ORYSJ Length = 646 Score = 78.3 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 50/179 (27%), Gaps = 33/179 (18%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------- 303 + ++DVSGSM + + K+ + L + + + ++ + Sbjct: 176 LVTVLDVSGSMVGNKLALLKQAMGFVIDNLGPGDRLCVISFSSGASRLMRLSRMTDAGKA 235 Query: 304 ---HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW-------ADD 353 GGT + +AL+ +V+ +R SDG + D Sbjct: 236 HAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVESVIL-LSDGQDTYTVPPRGGYD 294 Query: 354 SPLCHEILA------------KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 ++ L P V + + + + +F Sbjct: 295 RDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGK---DHDAAAMHTIAEVTGGTFSFI 350 >UniRef50_Q6L2C8 Putative uncharacterized protein n=1 Tax=Picrophilus torridus RepID=Q6L2C8_PICTO Length = 379 Score = 78.3 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 58/185 (31%), Gaps = 22/185 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 Y ++ +S +DVS SM + D+AK + L + R V I A Sbjct: 28 YPEKTVKASGFHYIIAIDVSNSMRKGKLDLAKEGAMNLIEKIPRDN---IVSLIAFGDTA 84 Query: 299 KEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 K + E + G T + +AL ++ + P + +DG Sbjct: 85 KVIVEGKEPTFALEAIPSLKVAGNTAMYTALLTATKLADKYNMPGR----IILLTDGMPT 140 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 +E L ++ I L + ++ H+ + +++ Sbjct: 141 DVSMNESYENL--QVPEGFTIDCIG-IGDNYRDDLLKLLADKGNS----IFYHLENPEEL 193 Query: 411 YPVFR 415 V Sbjct: 194 PKVME 198 >UniRef50_B0JR39 von Willebrand factor type A n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JR39_MICAN Length = 724 Score = 78.3 bits (191), Expect = 6e-13, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 35/199 (17%) Query: 239 YEKRPDPS----SQAVMFCLMDVSGSMDQSTKDM-AKRFYILLYLFLSRTYKNVEVVYIR 293 P + + L+D SGSM+ K AK + VV Sbjct: 38 LTVTERPPIVEQNPQSVVMLIDTSGSMNDDNKLQEAKNAAKAFIERQDPSVNRFAVVGFG 97 Query: 294 HH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 T + GGT + L E ++ + + + +D Sbjct: 98 SQVQIGTGLTSDLATLNQAIDNLSDGGGTRMDLGLATAIEQLESSSS----DRHILLFTD 153 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G SP +I+ +++T ++ E +Q NFA + + Sbjct: 154 GQPAPAPSPEQIDIMI-----------VLDVT----SSMNEEIAGVQQGIQNFAQELKKR 198 Query: 407 QDDIYPVF----RELFHKQ 421 + D LF ++ Sbjct: 199 KLDAQIGLIAFGDRLFGEE 217 >UniRef50_A2FKC6 von Willebrand factor type A domain containing protein n=4 Tax=Trichomonas vaginalis RepID=A2FKC6_TRIVA Length = 667 Score = 78.3 bits (191), Expect = 6e-13, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 51/163 (31%), Gaps = 22/163 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----------IRHHTQAKE 300 F ++D SGSM+ D A + L+ L + + + ++ + Sbjct: 241 YFFVIDCSGSMEGKLIDKAVKCMRLMLQSLPMKCRFSIYCFGYNFRQLLPIVEYNNENVL 300 Query: 301 VDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + + + GGT + +K+ ++ +DG+ D+ Sbjct: 301 LAMNLIKNIKANMGGTNI-------YNPLKDIFSQDGMLKKIFLLTDGE---VDNSEEII 350 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 L +K Y+ + L R + + + + Sbjct: 351 NLVEKNKAFGNIYTVGIGSGA-DPGLIRNLAEVTNGKWTYVLD 392 >UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FM70_SORC5 Length = 507 Score = 78.3 bits (191), Expect = 6e-13, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 55/198 (27%), Gaps = 15/198 (7%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 R P + A + L+D SGSM + A+ L V V Q Sbjct: 111 ARAARGQPRAPAAVVLLVDASGSMQGPKMENARAAAQAFVDRLPDGD-LVSVASFADTAQ 169 Query: 298 AKEVD-----------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 A+ G T + + LKL ++ + SD Sbjct: 170 ARVAPTVLGRSTRPAVARAIAALGPDGSTNLFAGLKLAEQHALAAPSTHAVRRVVL-ISD 228 Query: 347 GD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 G N SP LA++ + I + + + S+ + + R Sbjct: 229 GQANIGPSSPDILGALAQRGAAHGVQITSIGVGADYDERTLNALA-VGSSGRLYHLTEAR 287 Query: 406 DQDDIYPVFRELFHKQNA 423 + + L A Sbjct: 288 EMSSVLERELALLQTTAA 305 >UniRef50_B8HSI1 von Willebrand factor type A n=8 Tax=Cyanobacteria RepID=B8HSI1_CYAP4 Length = 589 Score = 77.9 bits (190), Expect = 7e-13, Method: Composition-based stats. Identities = 27/248 (10%), Positives = 64/248 (25%), Gaps = 25/248 (10%) Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA-ELRAKIERVP 227 + + + + + A L P + + P Sbjct: 331 TEKAAAEQFIEYLRSPAAQAIATNLGLRSGVPGTPLGAKFTAEFGVNPQPKYDSYRPPQP 390 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 + T L K++ S ++ ++D SGSM + LS + Sbjct: 391 EVVTTML--KSWSDFAKKPS--LVVIVVDTSGSMAGEKLANVQNTLNTYINGLSPQDQVA 446 Query: 288 EVVYIRHHTQAKEVD---------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 + + VD + G T + A + + N Sbjct: 447 LMRFSSDVGTPVVVDGTPAGRDRGLQFISSLRANGNTHLYDATLAARNWLTQNLRSDAIN 506 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLP-------VVRYYSYIEI-TRRAHQTLWREYE 390 +DG++ S + E L +L + +++ ++ Sbjct: 507 -AVLVLTDGEDTG--SAISLEQLGPELQKSGFNSDQRISFFTVGYGEEGEFDPQALQQIA 563 Query: 391 HLQSTFDN 398 ++ + + Sbjct: 564 NVNGGYYS 571 >UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UK93_METS4 Length = 761 Score = 77.9 bits (190), Expect = 7e-13, Method: Composition-based stats. Identities = 33/241 (13%), Positives = 56/241 (23%), Gaps = 28/241 (11%) Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV------MFCL 254 P + L A A F + P +A + + Sbjct: 320 DGPVPADRDFALTWRAAPSAAP-AVGLFRERVGEDEYLLAVVTPPEGRAPARRPREVTFV 378 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFF 307 +D SGSM ++ AK ++ L + + + A E Sbjct: 379 IDNSGSMAGASMRQAKASLLVALDRLGPADRFNVIRFDDTMDLLFPAPVPADEAHRDAAR 438 Query: 308 Y----SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 + GGT + L+ + +DG A + Sbjct: 439 RFVAALEARGGTEMLPPLRAA--LADPHPEEGDRVRQIVFLTDG---AIGNEEQIFSAIS 493 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 R + I + L L I D + EL K + Sbjct: 494 AGRGRSRLFMIG-IGSAPNGHLMTHAAELGGGSY----TAIGTIDQVAERTAELLAKLES 548 Query: 424 T 424 Sbjct: 549 P 549 >UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 Tax=Myxococcales RepID=Q1DE81_MYXXD Length = 860 Score = 77.5 bits (189), Expect = 1e-12, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 50/190 (26%), Gaps = 23/190 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRH 294 + + ++DVSGSM + A+ L L + + + + Sbjct: 279 PPKQEVVFVVDVSGSMAGESLPQAQAALRLCLRHLREGDRFNVIAFENRFQSFQPEPVPF 338 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + E + GGT + + ++ + P +DG + Sbjct: 339 TQRTLEEADRWVAALNADGGTELLAPMRAAVQAA-----PDG---VIVLLTDGQ---VGN 387 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + R YS+ I L R+ F R D + F Sbjct: 388 EAEILRAVLEARKTARVYSFG-IGTNVSDVLLRDMAKQTGGDVEFIHPGERIDDKVVAQF 446 Query: 415 RELFHKQNAT 424 + Sbjct: 447 SRALAPRVTE 456 >UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) n=1 Tax=Monodelphis domestica RepID=UPI0000F2DDBB Length = 819 Score = 77.5 bits (189), Expect = 1e-12, Method: Composition-based stats. Identities = 39/333 (11%), Positives = 86/333 (25%), Gaps = 36/333 (10%) Query: 117 FQISKDEYLDLLFEDLALP-NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 F++ +E L L ++ Q + + + + G+ + + + + L Sbjct: 130 FELVYEELLKRHLGKYELMLMIQPKQLVKQLQVDIYI--FEPQGISSLENDITFMTKKLE 187 Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQ----LLEEERLRKEIAELRAKIERVPFIDT 231 T K ++ + + + + R I+ Sbjct: 188 DALTKTQNK---TEVHIAFKPSLAQQQKEPWKLNTVVDGKFIVRYDVDRVTTAGDIQIEN 244 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 N+ P + L+D SGSM K I + L + + Sbjct: 245 -GYFVHNFAPTQLPMVPKNIVFLIDKSGSMAGRKIKKTKAALIKILDDLKPEDHFNMITF 303 Query: 292 IRHHTQAK--------EVDEHE---FFYSQETGGTIVSSALKLMDEVVKERYN----PAQ 336 H T+ K E + + G T V+ A+ ++ E P Sbjct: 304 SGHVTRWKPELVLALDEHLKEAKTFLSNTPALGVTNVNGAVLAAVSMLDESNKKKELPEG 363 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR----YYSYIEITRRAHQTLWREYEHL 392 +DGD + + + + + + +R + + Sbjct: 364 SVSMIILLTDGD--STEGETKLQKIHENVKAAIRGQYHLFCLGFGF-DINYVFLERLALD 420 Query: 393 QSTFDNFAMQHIRDQ---DDIYPVFRELFHKQN 422 + + + D Y Q Sbjct: 421 NGGMARHIFEGLDAELQLQDFYQEVANPLLTQV 453 >UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I2R0_9CLOT Length = 960 Score = 77.1 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 52/183 (28%), Gaps = 28/183 (15%) Query: 254 LMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVY--------IRHHTQAKE 300 ++D SGSM +AK + L + + + KE Sbjct: 411 IIDKSGSMSAEGGGVSKLTLAKEAAMKALENLREVDEISVIAFDDTYDEVVPLQKVGDKE 470 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE 359 + Q GGT + AL+ + + + I +DG D + D+ Sbjct: 471 AIKELISGIQIRGGTSIYPALEQGYNMQMQSSAKIKHTI---LLTDGQDGYGLDNYATLL 527 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + + E + L + + + DIY +F Sbjct: 528 QNFIDNNITLSTVAVGEGA---NAGLLNQLASIGKGRSYY--------TDIYTDIPRIFA 576 Query: 420 KQN 422 K+ Sbjct: 577 KEV 579 >UniRef50_UPI0001C1630F hypothetical protein CRD_00534 n=2 Tax=Nostocaceae RepID=UPI0001C1630F Length = 587 Score = 77.1 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 29/261 (11%), Positives = 66/261 (25%), Gaps = 14/261 (5%) Query: 132 LALPNL---KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRREL 188 L L ++ + AN S + + Sbjct: 292 LELKDISIRDPKKLSVFVSEGLTFNSLKKIPEFANSSFIPFGVPHNNPLVKFKWTTPEQQ 351 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 LE + L + ++ +P + L ++ + D Sbjct: 352 EGLELFAKFAQSDPMQNLAPRMPPEVAQYLAQKQVPPIPSGEVLSLGQTFWKTQKDAGKT 411 Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----------IRHHTQA 298 + ++D SGSM + K + ++ V Y Sbjct: 412 VYLMTVIDTSGSMSGGPLEAVKNGLRIASQQINPGNYVGLVSYGDQPINLVKLAPFDDLQ 471 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + + G T + + + E++++R Y +DG + Sbjct: 472 HKRFLAGIDGLEADGATAMYDGVMVGLSELLQQRKTNPNGKFYLLLLTDGQTNQGFNFEQ 531 Query: 358 HEILAKKLLPVVRYYSYIEIT 378 + + + V +Y E+ Sbjct: 532 VKEIIEYSGVRVYPIAYGEVN 552 >UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=Q2QSE5_ORYSJ Length = 524 Score = 77.1 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 47/155 (30%), Gaps = 15/155 (9%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---TQAKEVDEH--- 304 + ++DVSGSM + K+ + + L+ + V + T+ + + + Sbjct: 63 LVAVVDVSGSMRGHKIESVKKALQFVIMKLTPVDRLSIVTFESSAKRLTKLRAMTQDFRG 122 Query: 305 ----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 GGT + + L L V+ +R SDG S Sbjct: 123 ELDGIVKSLIANGGTDIKAGLDLGLAVLADRVFTESRTANIFLMSDGKLEGKTSGDP--- 179 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + V Y++ H L + Sbjct: 180 -TQVNPGEVSVYTFGFGHGTDH-QLLTDIAKNSPG 212 >UniRef50_C6WL97 VWA containing CoxE family protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WL97_ACTMD Length = 1295 Score = 77.1 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 49/302 (16%), Positives = 89/302 (29%), Gaps = 29/302 (9%) Query: 47 DSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQA- 105 G +IP E + G R G + D + G G G + Sbjct: 919 AEGAETAIPAELPPVLRWRLLLGARGDRPAGGGRYAAALDELYGHDRGEGADSGSLGGSA 978 Query: 106 -------SQDGEGQDEFVFQIS---KDEYLDLLFEDLALP---NLKQNQQRQLTEYKTHR 152 E DE ++E L E L + R + + Sbjct: 979 GGDGDPFPVVREWSDELKALFGDRVREEVLAAAAEGGRLEAALEIDPTSVRPSVDLLRNV 1038 Query: 153 AGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERL 212 ++ +R L L R A R L + +L L Sbjct: 1039 LSLAGGLSEDALARLRPLVARLVRELAEQLANRIRPALTGMQLPFPTRRPGGKLDLPRTL 1098 Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 R +A R + ++ R ++ + ++DVSGSM+ A Sbjct: 1099 RANLATARRDEHGKVVVIPERPVFR---SRGRKANDWRLILVVDVSGSME------ASTV 1149 Query: 273 YILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS------QETGGTIVSSALKLMDE 326 + L + +++ ++ TQ ++ E + GGT ++ AL+ + Sbjct: 1150 WAALTASVFAGVRSLTTHFLAFSTQVVDLSERVADPLSLLLEVKVGGGTHIAGALRHARD 1209 Query: 327 VV 328 +V Sbjct: 1210 LV 1211 >UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VHB8_NAEGR Length = 755 Score = 77.1 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 54/196 (27%), Gaps = 28/196 (14%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS--------------TKDMAKRFYILLYLF 279 + K + C++DVSGSM S D+ K L Sbjct: 116 IHVKVSPPTGGQRQPCNLVCILDVSGSMGSSAEDLSSSNENTGFSRLDLVKHSVRTLIEL 175 Query: 280 LSRTYKNVEVVYI----------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK 329 ++ + + + + K+ + + G T V L+L E Sbjct: 176 MNEKDQISLIPFSDSARMELPLTKMDAVGKKKAIEKLEHLGPEGSTNVWDGLRLGMESSL 235 Query: 330 ERYNPAQWNIYAAQASDGD---NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 A+ N +DG+ N E K+ +S+ L Sbjct: 236 NNPLCAKTNTCLILFTDGEPNINPPRGIVPTLEKYIKEHPLNSTIHSFGFGYS-LDSALL 294 Query: 387 REYEHLQSTFDNFAMQ 402 ++ S ++ Sbjct: 295 KDIAMNGSGAYSYIPD 310 >UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated chloride channel 1 precursor n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4A663 Length = 1245 Score = 77.1 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 44/140 (31%), Gaps = 12/140 (8%) Query: 248 QAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--- 303 + + ++D SGSM + D + V + T + + Sbjct: 522 ECRVVLVLDTSGSMGTSNRIDKVNSAATAFVNLVDDGISIGIVTFTGSPTTRHALTQINT 581 Query: 304 -------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + F +GGT + L+ EV+ + + +DG + + + Sbjct: 582 QADRDSLRDIFQLTASGGTCIGCGLEQGLEVLMAHPSGSADGGIIVLMTDGQDSGIQNHI 641 Query: 357 CHEILAKKLLPVVRYYSYIE 376 + L + + V + E Sbjct: 642 IRQTL-QDMGVRVNTVAIGE 660 >UniRef50_C8NWP6 Putative uncharacterized protein n=1 Tax=Corynebacterium genitalium ATCC 33030 RepID=C8NWP6_9CORY Length = 1152 Score = 77.1 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 41/354 (11%), Positives = 94/354 (26%), Gaps = 31/354 (8%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL 127 R + + + + G GG + + DE ++ D+ Sbjct: 805 SPTQRRLARSLDQLYGRGEGEGSTAGSMGGRAGNEPPYPTARDWVDELDALFGEEVREDI 864 Query: 128 LFEDL------ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 + A+ L + Q R E + + + +R + + Sbjct: 865 AATAVDTRHPYAMELLTETQPRASVELLSDVLTLAGGMPESVLDKLRPVLRRMVEELTQV 924 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 + S +L + + + + P + ++ Sbjct: 925 LASQLRPALRGLQGWRPSTRPSPELDPLSTIHRNLRHAVLDSDGAPQLVVATPIFRQPIA 984 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 S+ + ++DVS SM+ S + L + + V ++ T+ ++ Sbjct: 985 ---KRSEWHVIVVVDVSASMEPS------TVFAALTASILSGVDALSVTFLAFSTEVIDL 1035 Query: 302 DEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 H GGT ++ AL + E V P++ SD D + Sbjct: 1036 SGHVSDPLSLLLEIHIGGGTNIAGALAVAHEHVT---VPSRT--LLITISDFDEYG-SVE 1089 Query: 356 LCHEILAKKLLPVVRYYSYI----EITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + VR R + + + + + + Sbjct: 1090 RLLARVQALNNAGVRLLGCAALDDTGQARYNVGIAGQLADVGMAVSAVSPTALA 1143 >UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HPN0_LYSSC Length = 825 Score = 76.7 bits (187), Expect = 2e-12, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 47/159 (29%), Gaps = 21/159 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------- 303 + ++D SGSM S ++AK L +I + E+ E Sbjct: 369 LVIVLDRSGSMSGSKLELAKEAAARSVEMLRDEDTLG---FIAFDDRPWEIIETGPLNNK 425 Query: 304 ----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 GGT + +L E + + + +DG P ++ Sbjct: 426 EEAVDTILSVTPGGGTEIYGSLAKAYENLADMKLQRKH---IILLTDGQ----SQPGNYD 478 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 L ++ S + I + A L + S Sbjct: 479 DLIEQGKDNGITLSTVAIGQDADANLLEALSEMGSGRFY 517 >UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella loihica PV-4 RepID=A3QDW1_SHELP Length = 776 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 50/206 (24%), Gaps = 29/206 (14%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + + + ++D SGSM + AK + L I + Sbjct: 392 PQDKARVRLPRELTLVIDTSGSMTGDSIAQAKSAILNALAGLGSQDT---FNVIAFDSSV 448 Query: 299 KEVDEHEFF--------------YSQETGGTIVSSALKLMDEV-------VKERYNPAQW 337 + + + GGT ++ AL + P + Sbjct: 449 RSLSPVALSATAANLGKANLFVQSLEADGGTEMAPALLRALSQPESGVSSISSAVKPERL 508 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG A + L + R ++ I + Sbjct: 509 KQVVF-ITDG---AVGNEASLFALIAANIGRQRLFTVG-IGAAPNGYFMERAARAGRGTY 563 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQNA 423 + + I + ++ Q + Sbjct: 564 TYVGKISEVDAKIGELLEKIESPQIS 589 >UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fragment) n=1 Tax=Sorghum bicolor RepID=C5YMJ6_SORBI Length = 423 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 33/109 (30%), Gaps = 11/109 (10%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKE 300 + ++DVSGSM + KR L L + V + K Sbjct: 127 LVTVLDVSGSMAGKKMERVKRAMGFLIDNLGSDDRLSVVAFSTDARRIIRLTRMSDDGKA 186 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + +G T + L + V+ R + SDG + Sbjct: 187 AAKRAVESLAASGSTNIRGGLDVAAMVLDGRRHKNAVASVIL-LSDGQD 234 >UniRef50_A3U9M7 Putative uncharacterized protein n=1 Tax=Croceibacter atlanticus HTCC2559 RepID=A3U9M7_9FLAO Length = 244 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 12/97 (12%), Positives = 33/97 (34%), Gaps = 1/97 (1%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 +++ + L E T + + +I V R+ +++ R + + +EE + Sbjct: 149 VREQESYALVEENTFKTASVSPLSTFSIDVDRASYSNIRRMINNSQPIPVDAVKVEEMIN 208 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 S + P + + +++ I Sbjct: 209 YFSYNYPEPKT-NDPFSEYTELVQSPWNTNTSILRIG 244 >UniRef50_A4YGI9 von Willebrand factor, type A n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YGI9_METS5 Length = 363 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 61/206 (29%), Gaps = 31/206 (15%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + + L+D S SM +MAK LL L + + + + + Sbjct: 7 VPESKFEAKNLHYVILIDRSYSMKGEKLEMAKEGARLLVDNLPKDSRFSLLAFNEKVSII 66 Query: 299 KE-----VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 KE E + GT + AL+ + ++ Y +DG Sbjct: 67 KEHEHPSEMGKELESLKVGSGTAMYKALQEAFNLARKY----GEPTYVILLTDGV---PS 119 Query: 354 SPLCHEILAKK--------------LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 C L++K + V+ S+ I + + E F Sbjct: 120 DMGCMPGLSRKFDLNRCLPVYQGLSVPENVQIISFG-IGDDYSEEILTEVSEKGRGFFY- 177 Query: 400 AMQHIRDQDDIYPVFRELFHKQNATA 425 H+ D I +L + A + Sbjct: 178 ---HVTDPAQIPEKMPKLVKSEVAAS 200 >UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxobacter sp. K RepID=B4UFP8_ANASK Length = 480 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 56/190 (29%), Gaps = 21/190 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HT 296 S + ++DVSGSM A + + L L+ VV+ Sbjct: 38 SPVCVIPVLDVSGSMHGEKLHFATQSIMKLVDHLAPGDFCGVVVFSTEVETLAAPTEMTQ 97 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA-DDS 354 K+ + + T ++ L + K P + +DG N S Sbjct: 98 DRKDALKVALGRLRPRHNTNLAGGLLAGLDHAKVTKVPDGMPVRVILFTDGLANEGPATS 157 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 P L + L ++ A Q L RE L + +R +D Sbjct: 158 PEGLCALLEANLGTASVSAFGY-GDDADQELLRELSTLGRGNYAY----VRSPEDALTA- 211 Query: 415 RELFHKQNAT 424 F ++ Sbjct: 212 ---FARELGG 218 >UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RNW6_9PROT Length = 452 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 49/194 (25%), Gaps = 16/194 (8%) Query: 220 RAKIERVPFIDTFDLRYKNYEK---RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 A I V +R + + + ++D SGSM A R + Sbjct: 15 PALIAGVAQKLPVLIRVQAPDPLATEKKARKPYHLALVIDRSGSMSGPPLAEAVRCAKHI 74 Query: 277 YLFLSRTYKNVEVVY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV 328 L T VV+ ++ G T + + + + Sbjct: 75 ADQLEPTDIASLVVFDDRVQTLVPPRPVGDRQALHLALSRVHSGGSTNLHGGWQAGADGL 134 Query: 329 KERYNPAQWNIYAAQASDGD-NWA-DDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQTL 385 A SDG+ N P L + V +Y + ++ L Sbjct: 135 LPAAGQAALARVIL-LSDGNANVGEITDPAGIAALCAQAAERGVSTSTYG-LGSHFNEDL 192 Query: 386 WREYEHLQSTFDNF 399 E + Sbjct: 193 MVEMAKRGGGNHYY 206 >UniRef50_B1X316 Putative uncharacterized protein n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X316_CYAA5 Length = 547 Score = 76.0 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 32/254 (12%), Positives = 75/254 (29%), Gaps = 34/254 (13%) Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEI-AELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + +E + P + + ++L + + ++ + + L + R Sbjct: 301 KTQLIEGVFNQLEAENPRKFQQIKQLNPPLPNTVTKEVTPIAEVHRQLLDNFHPSVRQKR 360 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY-------LFL-SRTYKNVEVVYIRHHT- 296 + ++D SGSM + + L FL S + ++Y R+ Sbjct: 361 ----WIIGIIDASGSMRGQGYEQLLAAFSELLEPQKAKDNFLYSPDDRFSLIIYQRNDAY 416 Query: 297 -------QAKEVDEHEF-----FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 +E+ + GGT V L + E ++ P+ + + Sbjct: 417 QIPLNSQPGEEISRETLWETLQKEVKPGGGTPVDKGLIMGLETAQK--IPSDYKLEIFLF 474 Query: 345 SDGDNWADDSPLCHE--ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG +P + P + + + ++ + Sbjct: 475 TDGQFRDPVTPELLNIYQSIQDKNPELTIVGAGGV----NTQQLQQLSAKLDARPIISQN 530 Query: 403 HIRDQDDIYPVFRE 416 D++ FRE Sbjct: 531 ASETLDELLKAFRE 544 >UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun sequence. (Fragment) n=16 Tax=Euteleostomi RepID=Q4SBF6_TETNG Length = 1039 Score = 76.0 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 64/204 (31%), Gaps = 25/204 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---IRH- 294 + + P + ++D+SGSM + + + + L +++ I+ Sbjct: 423 FAPKDLPRLPKNVVFVIDMSGSMSGTKMQQTREAMLKILEDLDPEDHFGIILFDHRIQFW 482 Query: 295 HTQAKEVDEHEFFY-------SQETGGTIVSSALKLMDEVVKE-----RYNPAQWNIYAA 342 +T + + Q GGT +++ + +++KE R ++ Sbjct: 483 NTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKRLPEKSIDMIIL 542 Query: 343 QASDGD-NWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DGD N + + K + + +S + Sbjct: 543 -LTDGDPNSGESRIPVIQENVKAAIGGQMSLFSLGFGN-DVKYPFLDVMSRENNGLAR-- 598 Query: 401 MQHIRDQDDIYPVFRELFHKQNAT 424 I + D + F+ + ++ Sbjct: 599 --RIYEGSDAALQLQG-FYDEVSS 619 >UniRef50_D1KBY4 Putative uncharacterized protein n=2 Tax=Proteobacteria RepID=D1KBY4_9GAMM Length = 682 Score = 76.0 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 65/187 (34%), Gaps = 21/187 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IRHHTQAKEVDEHE 305 + ++D SGSM S+ + A I L T + + + + T +D ++ Sbjct: 334 VIFIIDSSGSMMGSSMEQATNALIQAINRLKPTDRFNIIDFDSDFEVLFDTAIPAIDMNK 393 Query: 306 ------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + +GGT A+K ++ + + ++ +DG + Sbjct: 394 RHGIRFAKHLVASGGTEPLEAIKFA--LLSKDEDSDKYLRQVIFLTDGQ-VG--NEKELF 448 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 ++ + R+++ I + L + + I D D++ ELF Sbjct: 449 RAVQQNIDDDRFFTIG-IGSAPNDYLMTKMAEYGKGAFTY----IGDIDEVEVKMGELFS 503 Query: 420 KQNATAK 426 K + A Sbjct: 504 KLESPAM 510 >UniRef50_C4G1K3 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1K3_ABIDE Length = 1659 Score = 75.6 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 52/166 (31%), Gaps = 11/166 (6%) Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + E++ I++ + +L+ K + + + +MD S Sbjct: 37 PDEYYKNGSEKQENGVTISKKVTRYNAADGTYDIELKVKGSTEVVQNNKILDIVLVMDTS 96 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---------TQAKEVDEHEFFYS 309 GSM+ + + AK+ L NV + + T+ ++ Sbjct: 97 GSMEGKSLENAKKAANNFVDKLLPQNNNVNIGIVSFAEKGEIKSGLTRNVTTLKNAIKGL 156 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + GGT L+ V+ PA+ DG+ + Sbjct: 157 KADGGTYTQQGLEKAATVLNGA--PAEHKKVMVVIGDGEPTYANGE 200 >UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXM6_9CYAN Length = 928 Score = 75.6 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 22/217 (10%), Positives = 59/217 (27%), Gaps = 29/217 (13%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPS--------SQAVMFCLMDVSGSMDQSTKDMAKRFYI 274 + + D R ++ P+ + L+D SGS + + Sbjct: 392 RTQTTVLSQADTRGGHFAVYLIPAIEYNPHQLVPKDVVFLIDTSGSQSGEPLNKCQELMR 451 Query: 275 LLYLFLSRTYKNVEVVY-----------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKL 323 L+ + + + + Q + + +GGT + ++ Sbjct: 452 RFINGLNPHDTFTIIDFSDTTRQLSPVPLANTVQNRNSAMNYINQLNASGGTQLRRGIQA 511 Query: 324 MDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 + + +DG + + + + L R +S+ ++ Sbjct: 512 VLNFPE---VDPGRLRSIVLLTDG--YIGNENQILAEVQRHLKLGNRLHSFG-AGSSVNR 565 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 L + + +R + V + F + Sbjct: 566 FLLNRIAEIGRG----ISRIVRYDEPTEEVAEQFFGQ 598 >UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Marinobacter RepID=A1U6Y4_MARAV Length = 712 Score = 75.6 bits (184), Expect = 4e-12, Method: Composition-based stats. Identities = 32/264 (12%), Positives = 68/264 (25%), Gaps = 34/264 (12%) Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERL---------RKEIAELRAKIERVPFIDTFD 233 LE + A +S + L++ + + A R + + F+ Sbjct: 283 PSHPLQVELEGSRATVSPEQGQILMDRDVIVRWRPADNQAPTAALFRQQWQGEDFLMAMV 342 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + + ++D SGSM + A+ + L + + + Sbjct: 343 MPPATTGQVLRRE----LLFVIDTSGSMAGESIRQARSALLRGLDTLRPGDRFNVIQFNS 398 Query: 294 HHTQA-----------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 GGT ++ AL L + + + Sbjct: 399 QAHALYTQPVPANGHYLARARDYVQDLTADGGTEMAGALSLA--MGMDGSESSGHVQQMV 456 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG A + + L R ++ I + RE Sbjct: 457 FMTDG---AVGNESALFDQIRTGLGNRRLFTVA-IGSAPNMHFLREAARWGRGQY----T 508 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 + ++ +LF A Sbjct: 509 AVHSAAEVDKALGKLFAAMEAPVM 532 >UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseobacter sp. AzwK-3b RepID=A6FSG0_9RHOB Length = 444 Score = 75.2 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 39/170 (22%), Gaps = 14/170 (8%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IRH 294 E P P + +D S SM AKR + L + + V + + Sbjct: 36 ETEPRPPLNLALV--LDRSSSMRGQPLHEAKRAADQIVAGLRPSDRLAIVAFDNATEVMF 93 Query: 295 HTQAK---EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNW 350 + + G T + L E SDG N Sbjct: 94 SGGPRGDGQAARAALSRIHARGMTALHDGWLLGVEQSIAMREAGTPARV-FLLSDGVANV 152 Query: 351 ADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 ++ + + ++ L E + Sbjct: 153 GLTDASAIAADCTRMAEHGITTSTCGLGMG-FNEDLMAEMARAGRGNAYY 201 >UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174639B Length = 868 Score = 75.2 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 44/173 (25%), Gaps = 17/173 (9%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 +R K ++ SS + +D SGSM +MAK I L+R + Sbjct: 400 VRLKAPDEEEKQSSALALV--IDRSGSMSGEKLEMAKSAAIATAEVLTRNDSIGVYAFDS 457 Query: 294 HHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + + GGT + A ++ + + Sbjct: 458 EAHVVVPMTRLTSSSAVAGQIAGLTSGGGTNLHPAFTEARNALQRTKAKIKH---MIILT 514 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 DG + + + + H L + L Sbjct: 515 DGQTSGQ-GYEALASQCRAEGVTISTVAIGDGA---HVGLLQAIASLGGGKSY 563 >UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B5YMD8_THAPS Length = 868 Score = 75.2 bits (183), Expect = 5e-12, Method: Composition-based stats. Identities = 41/258 (15%), Positives = 79/258 (30%), Gaps = 31/258 (12%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD---------LRY 236 A EE + A+ E++ +R+E +L R ++ Sbjct: 64 EGAVAQEEEAVDFAMEMLAREHEQQGIREETLQLSVAPHRESIGLQSGEFTGQICATIKA 123 Query: 237 KNYEKRP-DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 ++ +R S + +DVSGSM D+ K LL L + + + Sbjct: 124 RDLPQRDSFARSPIDIVVALDVSGSMRVEKLDLCKETLHLLLRELHHDDRFALISFSEDA 183 Query: 296 T----------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + K+ H G T ++SA+ L +VV P + + Sbjct: 184 VIEVPMQKVNERNKQQALHAIDRLSVKGRTNIASAVSLAAQVVNGVAEPNKV-RSVFLLT 242 Query: 346 DGD-NWADDSPLCHEIL----AKKLL----PVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 DG+ N + L + P + +++ Q L R S Sbjct: 243 DGNANTGYTEAIDLVKLTSIFVEANRNPHTPPISLHTFGY-GPEPDQKLLRGMAMATSGG 301 Query: 397 DNFAMQHIRDQDDIYPVF 414 ++++ + Sbjct: 302 SFYSVRDNSQVSSAFGDA 319 >UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6X2_PHATR Length = 523 Score = 74.8 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 30/238 (12%), Positives = 64/238 (26%), Gaps = 61/238 (25%) Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY 277 + +++ F + R ++ D + + ++DVSGSM + + K+ +L Sbjct: 38 GIDSEVSTNHFCASIHAR-TMPKEDEDCRTPIDLIVVLDVSGSMTGNKLKLCKKTLTMLL 96 Query: 278 LFLSRTYKNVEVVY-----IRHHTQA-----KEVDEHEFFYSQETGGTIVSSALKLMDEV 327 L + + + + QA K + G T +S+AL L + Sbjct: 97 RVLQTQDRFGLISFGSDARVEFPAQAMSKQNKASALQKIQSLTTRGCTNMSAALGLAVQE 156 Query: 328 VK--ERYNPAQWNIYAAQASDG-DNWADDSPLCHEIL----------------------- 361 +K E+ NP +DG N L Sbjct: 157 LKIIEKSNPV---RSLFFLTDGLANEGISDLDGLVSLTRNCLLPSDNPSNVLNSEVMIAE 213 Query: 362 -------------------AKKLLPV-VRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 + + + +++ + L + Sbjct: 214 CLDDLATSQHQITRLPVAEIESVCRAPITLHTFGYGR-DHNAALLESLADTTQGGAYY 270 >UniRef50_A0CDA0 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0CDA0_PARTE Length = 508 Score = 74.8 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 33/300 (11%), Positives = 82/300 (27%), Gaps = 37/300 (12%) Query: 136 NLKQNQQRQLTEYKTHRAGYTA--NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 + + ++ + + T N + S S R ++ Sbjct: 35 SFQPKKKNPIQKSNTMHNLIQQFQTQTQENNDLKNSQSESKNRF----------SQRTQQ 84 Query: 194 NLAIISNSEPAQLLEEERLRK-EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 ++ S+ L ++ + + + L+ + +F Sbjct: 85 SIPSRSSINKQPLDDDIQFDMFSVNPGSNILNLTQHTIPIVLQLRTKTLEELDQIGVDLF 144 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEVD 302 CL+D+ M D K+ + L + + + ++ +E Sbjct: 145 CLIDIGNGMQGQKIDYVKQILHSILTNLREQDRLCLISFNNDGKLLTGLQKVTSETQEYF 204 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 Q G T + ++ +V+ +R N W SDG + + Sbjct: 205 AFVIDGLQCNGTTELWKGTEVAFDVINQRKNKNNWAR-ILIFSDGQ-----DEIALTKIK 258 Query: 363 KKLLPVVRYYS---YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 K+L ++ + A +L+ + I + ++ + F Sbjct: 259 KQLEYNYDIFTIDSFGFSNSNA-SKRLSSITNLRFGKHH----IINSEQQVFKCLEQTFA 313 >UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B7C3_9BACI Length = 920 Score = 74.8 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 53/180 (29%), Gaps = 26/180 (14%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 D++ K + PS ++ MD SGSM S ++AK L Sbjct: 390 VNMDIKGK----KEMPSLGLMIV--MDRSGSMAGSKLELAKEAAARSVELLREKDTLG-- 441 Query: 290 VYIRHHTQAKEVDE-----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 +I + + E + GGT + ++L+ E ++ + Sbjct: 442 -FIAFDDRPWVIVETGPLEDKKDAVDKIGSVTPGGGTEIFTSLEKAYEELENLKLQRKH- 499 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DG + K+ + + A + L E L + Sbjct: 500 --IILLTDGQSARSTDYESMIETGKENNITLSTVALG---SDADRNLLEELAGLGAGRFY 554 Score = 43.2 bits (100), Expect = 0.018, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 38/132 (28%), Gaps = 11/132 (8%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV----VYIR 293 P+ + + D S S+ ++ F + + Sbjct: 53 AVPSIRLPAPGKTVVFIADRSASVQGREGELL-DFIDAGIQSKGKEDSYAVISAGETAAA 111 Query: 294 HHTQAKEVDE-HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + A E EF G T + + ++L ++ P + SDG A Sbjct: 112 ESSLASMKGEFREFSTDTGKGETNLEAGIQLASTLM-----PEETPGRIVLLSDGRETAG 166 Query: 353 DSPLCHEILAKK 364 S ++L + Sbjct: 167 SSREAAKLLKNR 178 >UniRef50_Q7US47 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7US47_RHOBA Length = 1291 Score = 74.8 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 71/258 (27%), Gaps = 21/258 (8%) Query: 80 DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL------A 133 + R G G + + ++ D ++L A Sbjct: 941 RGHGEGSRGGLANAPSGMGGGTEAPEPTTAQWAEDLEALFGSDLCQEVLGTAAGNGRSTA 1000 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 + L + E + ++ +R L L + A R + Sbjct: 1001 IELLDPDTVTPSLELLQQVLSLAGAMPESKVATLRRLARRLTEQLASELAVRLQPAMNGL 1060 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + +L LR +A + + I L + + KR + Sbjct: 1061 SSPRPTRRRARKLNLPRTLRDNLANCHRRADGRATIVAEKLMFHSPSKRQM---DWHVTF 1117 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS---- 309 ++DVS SM S A L + + V ++ T+ + E Sbjct: 1118 VVDVSASMSASVIYSA------LVAAVFDALPALSVRFLAFSTEVLDFSEQVADPLSLLL 1171 Query: 310 --QETGGTIVSSALKLMD 325 Q GGT + L+ Sbjct: 1172 EVQVGGGTDIGLGLRAAR 1189 >UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E8A41 Length = 945 Score = 74.8 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 21/202 (10%), Positives = 58/202 (28%), Gaps = 22/202 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IR 293 + + P+ + ++D S SM K + L + + + Sbjct: 288 FAPKDLPAVPKNVVFVIDTSASMLGKKIRQTKEALFTILGDLRPGDHFNFISFSSRVKVW 347 Query: 294 HHTQAKEVDEHE-------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI----YAA 342 + V + F +GGT ++SA++ ++++ + + Sbjct: 348 QPGRLVPVTPNNVRDAKKFIFMLPTSGGTNINSAIQTGSSLLQDYLSAQDASPNSVSLII 407 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT-LWREYEHLQSTFDNFAM 401 +DG + + + ++ + L M Sbjct: 408 FLTDGQPTVGEVQSVTILGNTRSAVQGKFCIFTIGIGNDVDYRLLERMALDNCG----MM 463 Query: 402 QHIRDQDDIYPVFRELFHKQNA 423 + I ++ D + + F+ + Sbjct: 464 RRIPEEADASSMLKG-FYDEIG 484 >UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2 Tax=Sorghum bicolor RepID=C5YHY2_SORBI Length = 567 Score = 74.8 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 50/200 (25%), Gaps = 26/200 (13%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 R D F + + + ++DVS SM + K+ + L Sbjct: 56 RFTSRDRFAVLVHAKAPSDVSRAPLDLVTVLDVSDSMKGEKLALLKQAMCFVIDQLGPAD 115 Query: 285 KNVEVVY----------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + V + R K + G T + + + EV+ R Sbjct: 116 RLSVVTFSNDASRLTRLARMSDAGKASAKIAVESLAVQGFTNIKQGIHVAAEVLAGRREK 175 Query: 335 AQWNIYAAQASDG-DNWADDS--PLCHEILAKKLLPVVRY-----------YSYIEITRR 380 SDG DN S P + + P + +++ T Sbjct: 176 NVVAGMIL-LSDGHDNCGGTSVRPDGTKSYVNLVPPSLTVAAGSSRPAAPIHTFGFGTS- 233 Query: 381 AHQTLWREYEHLQSTFDNFA 400 +F Sbjct: 234 HDAGAMHAVAEATGGTFSFV 253 >UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacteria RepID=Q3M2E0_ANAVT Length = 218 Score = 74.4 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 50/166 (30%), Gaps = 16/166 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK---NVEVVYIRHH 295 E +P ++ + L+D SGSM R + + + +VEV I Sbjct: 6 PEFVENPENRCPVILLLDTSGSMSGQPIQELNRGLATFKEDVIKDSQASLSVEVAIITFG 65 Query: 296 TQAKEVDEHEFF-----YSQETGGTIVSSALKLMDEVVKER---YNPAQ---WNIYAAQA 344 D + G T + A++ ++++ R Y + + Sbjct: 66 PVRLVQDFVNIDQFTPPQLEAEGVTPMGEAIEYALDLLETRKSAYKENGILYYRPWIFLI 125 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 +DG + + + +++ + R+ Sbjct: 126 TDGAPTDYYHLAAQRVKEAEANRRLCFFTVGVQGADFN--KLRQIA 169 >UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1C6_SLAHD Length = 744 Score = 74.4 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 51/170 (30%), Gaps = 19/170 (11%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 + + P+ +S + +D SGSMD + K + ++ +V + + + Sbjct: 369 DSKVDPNDASSRHVVLALDTSGSMDGEPLNETKTATREFASTIFKSD--ADVCLVSYDSS 426 Query: 298 AKEVDEH---------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 A+ V + GGT + AL++ E ++ SDG+ Sbjct: 427 ARNVIDSTDNEYALKAAVRDLSAGGGTNIEDALRVSYERLE---GSGSDKRIIVLMSDGE 483 Query: 349 -NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH----QTLWREYEHLQ 393 N + V Y+ + Q + Sbjct: 484 ANEGLVGDDLIAYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIASPG 533 >UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-trypsin inhibitor heavy chain H3 n=11 Tax=Tetrapoda RepID=B4DPQ4_HUMAN Length = 698 Score = 74.4 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 24/205 (11%), Positives = 56/205 (27%), Gaps = 18/205 (8%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + + P + ++D+SGSM + K + + + ++ + Sbjct: 273 FAPQGLPVVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFTLFSGDVSTW 332 Query: 299 KEVDEHEF-FYSQET----------GGTIVSSALKLMDEVV----KERYNPAQWNIYAAQ 343 KE QE G T ++ L ++ +E P + Sbjct: 333 KEHLVQATPENLQEARTFVKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIM 392 Query: 344 ASDGD-NWADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DGD N + P + + + Y+ + F Sbjct: 393 LTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGN-NLNYNFLENMALENHGFARRIY 451 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 + + + E+ + + Sbjct: 452 EDSDADLQLQGFYEEVANPLLTGVE 476 >UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocephala RepID=Q6PGW2_DANRE Length = 927 Score = 74.4 bits (181), Expect = 8e-12, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 50/177 (28%), Gaps = 15/177 (8%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 + P + ++D SGSM + + + L + + Sbjct: 265 FAPSDVPHIPKNVVFIIDRSGSMHGRKIRQTRSALLTILKDLDEDDHFGLITFDAEIDFW 324 Query: 292 ---IRHHTQA-KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + T+A +E E Q+ G T ++ A+ +++ +I +DG Sbjct: 325 RRELLQATKANRENAESFVKRIQDRGATNINDAVLAGVDMINRNPRKGTASILIL-LTDG 383 Query: 348 DNW-ADDS-PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 D + + + + + Y + + + + Sbjct: 384 DPTAGETNIEKIMANVKEAIGSKFPLYCLGF-GYDVNFDFLTKMSLENNAVARRIYE 439 >UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=40 Tax=Euteleostomi RepID=ITIH2_HUMAN Length = 946 Score = 74.0 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 57/204 (27%), Gaps = 24/204 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 + + ++DVSGSM + L + + Sbjct: 299 FAPDNLDPIPKNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTW 358 Query: 292 ----IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER-----YNPAQWNIYAA 342 I + Q +GGT ++ AL ++ E +P ++ Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIIL 418 Query: 343 QASDGDNWADDSP--LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 SDGD + + + + + + +S + L + + Sbjct: 419 -VSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFL-KRLSNENHG----I 472 Query: 401 MQHIRDQDDIYPVFRELFHKQNAT 424 Q I D ++ +++ + Sbjct: 473 AQRIYGNQDTSSQLKKFYNQVSTP 496 >UniRef50_B8G7Y1 von Willebrand factor type A n=3 Tax=Chloroflexus RepID=B8G7Y1_CHLAD Length = 914 Score = 74.0 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 21/216 (9%), Positives = 52/216 (24%), Gaps = 24/216 (11%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ------STKDMAKRFYILLY 277 R ++ + R + + ++D SGSM + + D+A+ Sbjct: 387 WRRTLLEPILPVALDPPLREERP-DLALVLVIDRSGSMRELVDDGRTQLDLAREAVYQAS 445 Query: 278 LFLSRTYKNVEVVYIRHHTQAKE--------VDEHEFFYSQETGGTIVSSALKLMDEVVK 329 L++ + + + E GGT + S + L E + Sbjct: 446 RGLTQRDQIALIAFDSIADTLLPLQPLPGLFTIEDALSRLVAGGGTNIRSGIALAAETIA 505 Query: 330 ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 + +DG ++ + V + T Sbjct: 506 TS---QARIRHVILLTDG--VSETEYADLVADLRAQGITVSAIAIGLDT----DPALERV 556 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + + + + ++ Sbjct: 557 AQIGGGKYYLVQRVPDLPQVVLEETVRVANRDVIEE 592 >UniRef50_D2AUU0 Putative uncharacterized protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AUU0_STRRD Length = 1330 Score = 74.0 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 37/293 (12%), Positives = 73/293 (24%), Gaps = 29/293 (9%) Query: 60 SEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 +G G PG P ++ E Sbjct: 987 ELYGTGRGEGAGHFGQSPGEGGDSGGRGDSFPTAREWAEELEVLFGTEVREEVLARAADA 1046 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 + + L L R E T ++ +R L L Sbjct: 1047 GRTDV---------LTELDPAAVRPSVELLTSVLTLAGGLPEQRLAKLRPLVRRLVAELT 1097 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 R + ++ LR + R + + + Sbjct: 1098 RELATRLRPALTGLATPRPTRRPGGRIDLPRTLRANLQHTRRMGDGRLVVVPERPVF--- 1154 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 R + + ++DVSGSM+ S A +L + ++ T+ Sbjct: 1155 STRARREADWRLILVVDVSGSMEASVVWSALTAAVLA------GVPTLSTHFLSFSTEVI 1208 Query: 300 EVDEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 ++ + + GGT +++ L +V P++ + SD Sbjct: 1209 DLTDRVADPLSLLLEVRVGGGTHIAAGLAHARSLVT---VPSRTLVVV--VSD 1256 >UniRef50_Q9LMB7 F14D16.26 n=5 Tax=rosids RepID=Q9LMB7_ARATH Length = 736 Score = 74.0 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 60/222 (27%), Gaps = 35/222 (15%) Query: 224 ERVPFIDTFDL-RYKNYEKRPDPSS--QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 V +D D+ + + + + + + ++D+S SM + K L Sbjct: 281 APVHDVDQRDIFSFYLFPGKQQKTKAFKREVVFVVDISKSMTGKPLEDVKNAISTALSKL 340 Query: 281 SRTYKNVEVVY----IRHHTQAKEVDEHEFFYSQET--------GGTIVSSALKLMDEVV 328 + + T + V E GT + L+ E++ Sbjct: 341 DPGDSFNIITFSNDTALFSTSMESVTSDAVERGIEWMNKNFVVADGTNMLPPLEKAVEML 400 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK-------LLPVVRYYSYIEITRRA 381 N +DG + + + KK + P + + Sbjct: 401 ---SNTRGSIPMIFFVTDG---SVEDERHICDVMKKHLASAGSVFPRIHTFGLGVFCNHY 454 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + HI ++ D +LF K + Sbjct: 455 FLQMLANIS-CGQHESVYNTDHIEERMD------KLFTKALS 489 >UniRef50_A9B057 von Willebrand factor type A n=3 Tax=Chloroflexi (class) RepID=A9B057_HERA2 Length = 562 Score = 74.0 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 47/323 (14%), Positives = 94/323 (29%), Gaps = 46/323 (14%) Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 +L+E+L + + Q L A Y G + + L+ M A K Sbjct: 257 AAILYENLIIESYDQALYPNL--ELPMVAIYPKEGSFWSDHPLVVLETE-----RMNADK 309 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR-- 242 R +E L A +I+ A I+ +D L+ Sbjct: 310 RAAAQVFQEFLLAQPQQAKAMQYGFRPANVDISLA-APIDTAHGVDPSQLQVALPTPSAE 368 Query: 243 ---------PDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI 292 Q + ++D SGSM Q + AK + ++ Sbjct: 369 VLQAITQLWQQHKKQVDVALIIDTSGSMRQENRLREAKTALGDFIDIFADQDNVQVTIFS 428 Query: 293 RHHTQAKEVDE---------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + T+ ++ G T + S + + ++++ + Sbjct: 429 TNATELSDLSPIGPKRADLHTRIDGLVADGETRLYSTIGEVYTDIQQQTEVQRI-RALVV 487 Query: 344 ASDGDNWADDSPLCHEILAKKLLP-------VVRYYSYIEITRRAHQTLWREYEHL-QST 395 +DG++ A L E L +++ + +Y A+Q + + + + Sbjct: 488 LTDGEDTASS--LSLEQLNEQIRQDESGTSIKIFTIAYG---SDANQEVLQRIAEITGAK 542 Query: 396 FDNFAMQHIRDQDDIYPVFRELF 418 IR +Y F Sbjct: 543 SYTGDPATIRQ---VYHEIATFF 562 >UniRef50_O26551 Magnesium chelatase subunit ChlI n=1 Tax=Methanothermobacter thermautotrophicus str. Delta H RepID=O26551_METTH Length = 591 Score = 74.0 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 51/144 (35%), Gaps = 14/144 (9%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMD-QSTKDMAKR-FYILLYLFLSRTYKNVEVVY------ 291 EK S+A+ ++D S SM + AK ++LL + + + + Sbjct: 418 EKVRIGKSRALYIIVLDTSSSMRLERKIKFAKTVSWLLLRDSYEKRNRIALIAFRGYEAN 477 Query: 292 -IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN- 349 + T E E + G T ++ AL+L EV + A + SDG Sbjct: 478 LVVEPTSNLETVEEALEGLRSGGRTPLTPALRLAAEVASSSSDEACTAVVI---SDGRCN 534 Query: 350 -WADDSPLCHEILAKKLLPVVRYY 372 + + + + + L + Sbjct: 535 VFINSNLEEDMNMLETELRNLNLL 558 >UniRef50_A9WI94 von Willebrand factor type A n=2 Tax=Chloroflexus RepID=A9WI94_CHLAA Length = 845 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 59/192 (30%), Gaps = 22/192 (11%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQ----STKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 RP + + ++D S SM S DMAK IL L + + + Sbjct: 385 TPPPRPQR-APVSILFIIDRSASMSATFGISKFDMAKEAAILSLTTLQPGDRVGVLAFDT 443 Query: 294 HH-----------TQAKEVDEHEFFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYA 341 + + + GGT + AL + + E Y+ +A Sbjct: 444 ETIWTVPFRTVGEGVSLVELQDQIATMSLGGGTNIERALSVGLPALANEPYS----TRHA 499 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG +++++ P ++ L + S I I + L + + F Sbjct: 500 VLLTDGRSYSNNYPR-YQQLVETARAAQITLSTIAIGSDSDTELLNQLASWGNGRYYFVA 558 Query: 402 QHIRDQDDIYPV 413 + Sbjct: 559 DATDLPRITFQE 570 >UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID=Q7G2L9_ORYSJ Length = 719 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 32/269 (11%), Positives = 69/269 (25%), Gaps = 41/269 (15%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 E +++ I E A + + F L+ + Sbjct: 196 PAEDVVGTQDVDSIVADEMAPASVGITTYAAFPAMEESVMVEEFAVLIHLKAPSSPATVT 255 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA------ 298 + + ++DVS SM + + KR + L + V + + Sbjct: 256 SRAPIDLVTVLDVSWSMAGTKLALLKRAMSFVIQALGPGDRLSVVTFSSSARRLFPLRKM 315 Query: 299 ----KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDG-DNWA 351 ++ GGT ++ AL+ V+++R + N SDG D + Sbjct: 316 TESGRQRALQRVSSLVADGGTNIADALRKAARVMEDR---RERNPVCSIVLLSDGRDTYT 372 Query: 352 DDSPLCHEILAKK----------------LLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 P + V+ +++ + Sbjct: 373 VPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGF-GADHDSPAMHSIAEMSGG 431 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 +F D ++ F + Sbjct: 432 TFSFI--------DAAGSIQDAFAQCIGG 452 >UniRef50_D0Z403 Putative uncharacterized protein n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z403_LISDA Length = 543 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 73/204 (35%), Gaps = 18/204 (8%) Query: 165 SVVRSLQNS-LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 +V +L + ++ +T KR + + S +++ L Sbjct: 302 DIVTTLGRAYISEKTNHKQVKRINTNEVYGTHKSADISRVLPSDLALLENEDLEYLFYAK 361 Query: 224 ERVPFIDTFDLRYKNYE---KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 + T+ L + + + + + +D SGSM A+ + ++ + Sbjct: 362 LLESNLSTYKLLGHHIDFEKENDTEEDKGPIVTCLDTSGSMSGIPILKARALLLAIHSII 421 Query: 281 SRTYKNVEVVYIRHHTQAKEVDEHE--------FFYSQETGGTIVSSALKLMDEVVK--E 330 ++ + + V+ Q KE+ E F + +GGT + LK +++ E Sbjct: 422 TKEKRELYVLLFGSRGQVKELYLSETSSSGLLPFICKEFSGGTDFETPLKRAINIIEHKE 481 Query: 331 RYNPAQWNIYAAQASDGDNWADDS 354 ++N A +DG+ D+ Sbjct: 482 KFNKAD----ILMITDGECNVSDN 501 >UniRef50_A8N264 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8N264_COPC7 Length = 885 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 42/315 (13%), Positives = 78/315 (24%), Gaps = 50/315 (15%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 +D P+L + + +T P++ + S T RR Sbjct: 160 QDAVQPSLAETRIS-ITADIQMYGKIQRIVSPSHPDGITESPYS----TPQGRPSRRRT- 213 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKI-----ERVPFIDTFDLRYKNYEKRPD 244 + P L + L L R P D P Sbjct: 214 -------TVRYRSPHYLDHDFVLGIHADGLDKPRCFAEVRRNPERGVPDTLAFQLTMVPR 266 Query: 245 ----PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK- 299 P ++D SGSM+ AK +++ + N + K Sbjct: 267 TKLPPIRSQEYIFIVDCSGSMEGPRIQTAKDSLVMMLQMIPSH--NSIFNIFAFGNECKS 324 Query: 300 ----------EVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + E Y++ GGT + +AL++ + Sbjct: 325 WVAHSQNYSGKTLEEAIRYTESMQADLGGTEMRNALRVAL-----SSCSKGLPTVVFLLT 379 Query: 346 DGDNWADDS-PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 DG + D + R ++ + L + + + Sbjct: 380 DGGCFDVDGCKQEVKGFVDGCTAPKRIFTLGIGESASSD-LCESIARVGNGES----FMV 434 Query: 405 RDQDDIYPVFRELFH 419 D + +LF Sbjct: 435 IDTSSVVQKCAKLFT 449 >UniRef50_Q60ED8 Von Willebrand factor type A domain containing protein n=6 Tax=Poaceae RepID=Q60ED8_ORYSJ Length = 801 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 45/177 (25%), Gaps = 28/177 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----RHH 295 + + ++D SGSM + K L + + + Sbjct: 323 NNQKRKVFRNASVFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDYFNIITFNDELHSFS 382 Query: 296 TQAKEVDEHEFFYSQ--------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + ++V+E ++ GGT + L ++ +N +DG Sbjct: 383 SCLEQVNEKTIENAREWVNTNFIAEGGTDIMHPLSEAIALLSNSHNA---LPQIFLVTDG 439 Query: 348 DNWADDSPLCHEILAKKLL-------PVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 + + K+ L P + + R + Sbjct: 440 ---SVEDERNICRTVKEQLATRGSKSPRISTFGLGSYCNHY---FLRMLASIGKGHY 490 >UniRef50_Q10JU7 Von Willebrand factor type A domain containing protein, expressed n=17 Tax=Poaceae RepID=Q10JU7_ORYSJ Length = 680 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 45/177 (25%), Gaps = 28/177 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----RHH 295 + + ++D SGSM + K L + + + Sbjct: 323 NNQKRKVFRNASVFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDYFNIITFNDELHSFS 382 Query: 296 TQAKEVDEHEFFYSQ--------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + ++V+E ++ GGT + L ++ +N +DG Sbjct: 383 SCLEQVNEKTIENAREWVNTNFIAEGGTDIMHPLSEAIALLSNSHNA---LPQIFLVTDG 439 Query: 348 DNWADDSPLCHEILAKKLL-------PVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 + + K+ L P + + R + Sbjct: 440 ---SVEDERNICRTVKEQLATRGSKSPRISTFGLGSYCNHY---FLRMLASIGKGHY 490 >UniRef50_Q12VX7 Putative uncharacterized protein n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12VX7_METBU Length = 892 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 54/183 (29%), Gaps = 19/183 (10%) Query: 251 MFCLMDVSGSM----DQSTKDM--AKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE- 303 + ++D SGSM + + + AK + L + V + T ++ Sbjct: 619 IVLVLDRSGSMKFLGNAPEQPLTDAKSAAKIFMENLLSNTEVGVVSFSSTSTVDRQPVSL 678 Query: 304 ----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + GGT + A+ + ++ A+ +DG A Sbjct: 679 NISGNKDLLHNAIDSMVADGGTAIGDAMADANNLLINGRPDAK--KIMIVLTDGVATAGS 736 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + ++ L +R YS + + ++ + + +Y Sbjct: 737 DRDGSDAISTANLNNIRIYSIGLGSSEYIDEPMLKRIASETGGSYYNAPSGSELQTVYNT 796 Query: 414 FRE 416 + Sbjct: 797 ISK 799 >UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus RepID=B7AA98_THEAQ Length = 706 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 51/308 (16%), Positives = 89/308 (28%), Gaps = 31/308 (10%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQ---NSLARRT-AMTAGKRRELHA 190 P + + R + E R+ A +PA R+L +LAR A Sbjct: 184 PLTGEAEVRAVAEGSWGRSEAKARLLPA--DRARALVLGDPALARYLEAQGFLVEEAFRR 241 Query: 191 LEEN------LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY-KNYEKRP 243 E L ++ A + LR+ L + F +D ++ +P Sbjct: 242 PLEADLVAVGLGVLDLPPGAPEALRDYLRRGGGLLFTATPKGLFFGGWDRALPEDLPLKP 301 Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH-------- 295 A + ++DVSGSM+ MA + L + V++ Sbjct: 302 LGRKGAALVLVLDVSGSMEGEKLAMAVAGALELVRSAAPEDYLGVVLFSSSPRVLFPPRP 361 Query: 296 --TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 Q K+ E + GGT++ A + +++ + SDG + Sbjct: 362 MTAQGKKEAESLLLSLRAGGGTVLGGAFREALRLLQ---DVPVERKALLVLSDGIIFDPK 418 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 P LA V + A A Sbjct: 419 EP--ILALAATAGVEVSALALG---PDADAAFLEALAQRGGGRFYRAATPKELPRLFLKE 473 Query: 414 FRELFHKQ 421 +E+F + Sbjct: 474 GQEVFQGE 481 >UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=3 Tax=Amniota RepID=UPI000155CC23 Length = 1374 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 57/206 (27%), Gaps = 26/206 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------ 292 + R P Q + ++DVSGSM + K+ ++ L V + Sbjct: 308 FAPRGLPPVQKNVVFVIDVSGSMFGTKMKQTKKAMHVILNDLHHDDYFNIVTFSDAVSVW 367 Query: 293 RHHTQAKEVDEH------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN------IY 340 + + + + G T +++AL + V + Sbjct: 368 KASGSIQATPPNIKSAKVYVNKMEADGWTDINAALLVAASVFNQSTGETGRGKGLKKIPL 427 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DG+ AK+ L + + A L R Sbjct: 428 IIFLTDGEATAGVTVASRILSNAKQSLKGNISLFGLAF-GDDADYHLMRRLSLENRGVAR 486 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNAT 424 I + D + F+ + A+ Sbjct: 487 ----RIYEDADATLQLKG-FYDEIAS 507 >UniRef50_A9SQ90 Predicted protein n=3 Tax=Physcomitrella patens subsp. patens RepID=A9SQ90_PHYPA Length = 778 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 55/195 (28%), Gaps = 26/195 (13%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----IRH 294 + Q + L+D SGSM + A + L + + Sbjct: 336 PDPSKITVFQRAVVFLLDRSGSMYGDPLNDALQALYSGLESLKPEDSFNIIAFDHETALF 395 Query: 295 HTQAKEVDEHEFFYSQ--------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 +Q + + ++ GGT + S L+ ++V+ P Y +D Sbjct: 396 SSQMERANSASILRAREWATEKCKARGGTDILSPLQQAFKLVEN--FPGAV-PYVFLITD 452 Query: 347 GDNWADDSPLCHEILAKK-------LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 G A D+ + + P + + + + F Sbjct: 453 G---AVDNEKNICLTMQSRIVELGARAPRISTFGIGHYC-NYYFLKMLAVIGRGLSDVAF 508 Query: 400 AMQHIRDQDDIYPVF 414 A +R Q + V Sbjct: 509 ASGKLRGQMERMLVA 523 >UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z595_BREBN Length = 947 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 58/207 (28%), Gaps = 33/207 (15%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTY 284 DL+ K PS + +D SGSM +A+ I ++ Sbjct: 393 VHMDLKGKE----QLPSLGLQLV--IDKSGSMSSDARGADKMALAREAAIRATTMMNAQD 446 Query: 285 KNVEVVY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 + + + + + Q GGT + AL+L E VK + Sbjct: 447 YIGVIAFDDTPWDVVAPQSVTKLDEIQQQISRIQADGGTDIFPALQLGYERVKAMNTQRK 506 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG + DD V + + + + L L Sbjct: 507 H---VILLTDGQSALDDDYEGLLQQMTAENITVSTVALGD---DSDRGLLEMIAELGKGR 560 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNA 423 FA ++F K+ A Sbjct: 561 YYFANDAESIP--------KIFSKETA 579 Score = 44.4 bits (103), Expect = 0.008, Method: Composition-based stats. Identities = 13/124 (10%), Positives = 33/124 (26%), Gaps = 11/124 (8%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD-E 303 P + ++D S SM + + F K + + + Sbjct: 62 PVQAKTIVFVVDRSASMKDDPRVL--SFLREAVGQKQAADKYAVIAIGAEAAVDQPMTIR 119 Query: 304 HEFFYSQE---TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 E T ++ ++L ++ +DG + D+ + Sbjct: 120 QEVQPLGVDVNRNATNLAEGIRLASAMIPTNARGK-----VVLLTDGLETSGDAARQTRL 174 Query: 361 LAKK 364 ++ Sbjct: 175 ARER 178 >UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=40 Tax=Euteleostomi RepID=ITIH5_HUMAN Length = 942 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 36/354 (10%), Positives = 93/354 (26%), Gaps = 29/354 (8%) Query: 80 DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL-LFEDLALPNLK 138 ++ + G G+ +AS +D+ F +S +E L L + +++ Sbjct: 118 SGDRVKEKRNKTTEENGEKGTEIFRASAVIPSKDKAAFFLSYEELLQRRLGKYEHSISVR 177 Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLAR-RTAMTAGKRRELHALEENLAI 197 Q + + + S Q R ++ E I Sbjct: 178 PQQLSGRLSVDVNILESAGIASLEVLPLHNSRQRGSGRGEDDSGPPPSTVINQNETFANI 237 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY------KNYEKRPDPSSQAVM 251 I Q + + + D++ + + P + Sbjct: 238 IFKPTVVQQARIAQNGILGDFIIR-YDVNREQSIGDIQVLNGYFVHYFAPKDLPPLPKNV 296 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IRHHTQAKEVDEH-- 304 ++D S SM + K + L + + + + + Sbjct: 297 VFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTPDSIR 356 Query: 305 ----EFFYSQETGGTIVSSALKLMDEVVKE---RYNPAQWNI-YAAQASDGDNWADDSPL 356 + TGGT ++ AL+ ++ + ++ +DG ++ Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGIGDRSVSLIVFLTDGKPTVGETHT 416 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQT-LWREYEHLQSTFDNFAMQHIRDQDD 409 + + + + L + + + +++D Sbjct: 417 LKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCG----LTRRVHEEED 466 >UniRef50_D2UZF5 von Willebrand factor type A domain-containing protein n=1 Tax=Naegleria gruberi RepID=D2UZF5_NAEGR Length = 207 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 50/172 (29%), Gaps = 14/172 (8%) Query: 212 LRKEIAELRAKIER-VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK 270 + K + ++ Y ++ + SS + ++D SGSM ++ K Sbjct: 9 FSCTFEYDQVKYNQTQNMFGMASIKAPIYVEKENRSS-LDIIAVLDKSGSMSD-KIELVK 66 Query: 271 RFYILLYLFLSRTYKNVEVVYIRH-HTQAKEVDEHE---------FFYSQETGGTIVSSA 320 + + + + + V + + T K + T +S A Sbjct: 67 KSLLFMIDQMQARDRLGIVEFDANVSTTLKLTSMDNGGKKQAMNCVNNIKLGTTTNISGA 126 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRY 371 + +++ R +DG + +KL + Sbjct: 127 IIEAFDILANRGGNISPTTSILLFTDGLPTVGVQQQDKIVNIVEKLYTKLNL 178 >UniRef50_D1YZJ4 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YZJ4_METPS Length = 506 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 61/179 (34%), Gaps = 16/179 (8%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 + T++L+ N+ ++ M ++D SGSM + +AK + L + + Sbjct: 308 WMEKKLLTYELKGVNWTDDSKK-NRGPMVAMVDTSGSMHGDPEIVAKSIILALVRRMMKE 366 Query: 284 YKNVEVVYIRHHTQAKEVDEH----------EFFYSQETGGTIVSSALKLMDEVVKERYN 333 ++V+V Q E++ +F GGT +AL+ E +K++ Sbjct: 367 SRDVKVYLFSSEGQTHEIEITDNKKMATEFLDFLSYTFEGGTDFDTALREGVESLKKK-- 424 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 N +DG + + L + + I + + Sbjct: 425 -QYVNADILFITDGLSV-VNDKYVISGLEQMKRENGTRL-FTIIVGNDNAGGIDRFSDH 480 >UniRef50_A8MJ77 Magnesium chelatase n=2 Tax=Clostridiales RepID=A8MJ77_ALKOO Length = 629 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 66/198 (33%), Gaps = 13/198 (6%) Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA-KIERVPFIDTFDL 234 R +GKR ++ + I P ++ + R + + Sbjct: 369 RFAVKGSGKRNKVKTDSKEGRYIRYRIPKGRPKDIAFDATFRIAACSQGGRNREGLSLVI 428 Query: 235 R-YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVY 291 R EK + + A + ++D SGSM + A + LL + + + Sbjct: 429 RSGDIREKVREKHTGATILFVVDASGSMGAKRRMGAVKGAVLSLLNDAYQKRDNVGIIAF 488 Query: 292 -------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK-ERYNPAQWNIYAAQ 343 + + T++ ++ + G T ++S L E++K +R A Y Sbjct: 489 RKDGADTLLNITRSVDLAQKCLTNLPTGGKTPLASGLYKAYELLKIDRIKNADALQYIVL 548 Query: 344 ASDGD-NWADDSPLCHEI 360 SDG N S E Sbjct: 549 VSDGKGNVPLFSENAIED 566 >UniRef50_A0CN87 Chromosome undetermined scaffold_22, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0CN87_PARTE Length = 951 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 54/193 (27%), Gaps = 25/193 (12%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS--RTYKNVEVVYIRH 294 KN + + ++D SGSM +A + + + + + Sbjct: 19 KNEKTATINKGDLTIIGVIDASGSMSNCWAWLA-------NFWNKSIPKDNLIAITFSNN 71 Query: 295 HTQAKEVDEHEFF-YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 T K+ E GGT + A ++ + P Q N+ SDG D+ Sbjct: 72 PTVLKDNKELNLDIGKHGGGGTEIVPAFVEFEKQLAN--VPTQNNVTVIFISDGQ---DN 126 Query: 354 SPLCHEILAKKLLP------VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 S + K L + + E ++ + Sbjct: 127 SVKTLDTRMKNQLKGNLLNHRINFICLGVEKGFPTNLAMNLRELYHRGDPQIPAIYLIE- 185 Query: 408 DDIYPVFRELFHK 420 Y + F+K Sbjct: 186 ---YSSEQAFFNK 195 >UniRef50_C9LTL4 Magnesium-chelatase, subunit D/I family n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LTL4_9FIRM Length = 657 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 45/368 (12%), Positives = 89/368 (24%), Gaps = 18/368 (4%) Query: 49 GESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQD 108 G +P + +F R + Q + E S A Q+ Sbjct: 268 GRIYVMPQDIEEAALFVLAHRMSRKKEQREESSRRQREPQESQAEPQEESEDEADDAPQE 327 Query: 109 GEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 D + E + + + + +G +I V Sbjct: 328 ERPDDVLTRDATGGESKEQEDDRGESQEKEAQADEEQASPADGDSGGEDRDETHSIEAVM 387 Query: 169 SLQNSLARRTAMTAGK-RRELHALEENLAIISNSEPAQLL--EEERLRKEIAELRAKIER 225 + + L + GK R + A + A +R Sbjct: 388 ARLSLLRETVCVRKGKSGRRAIVQLDVPAGRPWRTSLPRTGRRIDLAFAATLRAAAPYQR 447 Query: 226 VPFIDT-FDLRYKNYEK-RPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFL-S 281 + +R ++ + A + L+D SGSM + M K + L Sbjct: 448 QRHGEQAVVIRAEDLRVWIRARRASANILFLVDASGSMGAKERMKMVKGAVLALLREAYQ 507 Query: 282 RTYKNVEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + + + + R T++ E+ E G T ++ L +++ E Sbjct: 508 KRDRVGLIAFRRTSAETLLPMTRSVELAEKALRSLPTGGKTPLAEGLAAALKMMDELSRK 567 Query: 335 AQWNIYAAQASDGD-NWADDS---PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 +DG N + + L E Sbjct: 568 EGAETVLVLVTDGRTNVSAAGKAKEEALRAAEEIARRDAHCIVLDTEKNFPKVGLAPEIA 627 Query: 391 HLQSTFDN 398 + Sbjct: 628 QRMNAGYA 635 >UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepID=B9GN58_POPTR Length = 705 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 49/154 (31%), Gaps = 21/154 (13%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 P + + ++DVS SM + M KR L+ L + V + + + Sbjct: 324 DPSRRAPIDLITVLDVSASMTGAKLQMLKRAMRLVISSLGSADRLSIVAFSSSPKRLLPL 383 Query: 302 DEHE----------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDN 349 G+ V AL+ +V+++R + N SDG + Sbjct: 384 KRMTPNGQRSARRIIDRLVCGQGSSVGEALRKATKVLEDR---RERNPVASIMLLSDGQD 440 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 + H + V + + + + + Sbjct: 441 ERSSTRFAHIEIP------VHSFGFGQSGGNSQE 468 >UniRef50_B7KCF7 von Willebrand factor type A n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KCF7_CYAP7 Length = 573 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 63/195 (32%), Gaps = 20/195 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR---------TYKNVEV 289 ++ + D + ++D SGSMD + + K+ + ++ + EV Sbjct: 388 WKLQKDAGKTVYLMTVIDTSGSMDGAPLEAVKKGLRIASKEINPGNYVGLVTYGDRAAEV 447 Query: 290 V-YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDG 347 V + + G T + + + ++++++ N Y +DG Sbjct: 448 VPLGLFDELQHKRFLAAIDNLRADGATAMYDGMMIGLSKLMEQKKNNPDGRFYLLLLTDG 507 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + + + V +Y ++ +Q L+ + Sbjct: 508 QANMGVTFDEVKEVIEYSGVRVYPIAYGDV----NQEELEAIASLRE-----STVKKGTP 558 Query: 408 DDIYPVFRELFHKQN 422 +++ + + LF Sbjct: 559 ENVEDLLKGLFQTNL 573 >UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria RepID=C9RRF6_FIBSS Length = 228 Score = 72.5 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 51/172 (29%), Gaps = 27/172 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK---NVEVVYIRHHT 296 + +PS++ + ++D SGSM+ Y + E+ + Sbjct: 11 DLENNPSTRVPVCLVLDTSGSMEGQPISELNEGINCFYDAVRSDETALYAAEIAVVTFGG 70 Query: 297 QAKEVDEHEFFYSQ---------ETGGTIVSSALKLMDEVVKER------YNPAQWNIYA 341 A V + +F + GGT + A+ + +++++R + + Sbjct: 71 SA--VLKTDFSTLEHQPDSPNFFANGGTPMGEAMNMALDLLEKRKGEYKASGVDYYQPWI 128 Query: 342 AQASDGDNWADDSP--LCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYE 390 +DG D S + + + + + Sbjct: 129 VLMTDGKPNGDSSEYARAVQRTCEMIKNRKLTIFPIGIGE----DADMNALA 176 >UniRef50_B3QTN9 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Bacteria RepID=B3QTN9_CHLT3 Length = 837 Score = 72.5 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 44/169 (26%), Gaps = 19/169 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 ++DVSGSM ++K L L T +++ Sbjct: 327 RVTEKMIPNREYIYIVDVSGSMFGQPIAISKELMKKLLGRLRPTETFNLLLFSGGSKLLS 386 Query: 300 EV--------DEHEFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 E E F+ GGT + AL + K+ + +DG Sbjct: 387 EKSLPATDKNIEKAFYALENEHGGGGTELLRALNRALGLPKKEAG----SRTFVVITDG- 441 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 + +K L ++ ++ L S Sbjct: 442 --YVSFEVETFETIRKNLNKANLFAVGIGNG-VNRFLIEGMARAGSGEP 487 >UniRef50_C3JL94 von Willebrand factor type A domain protein n=1 Tax=Rhodococcus erythropolis SK121 RepID=C3JL94_RHOER Length = 614 Score = 72.5 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 47/189 (24%), Gaps = 17/189 (8%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 + + + + ++D SGSM S AK LS + Sbjct: 38 FSTPTAQAEETTSSVLFVVDTSGSMAGSPLAQAKDALRAGIGALSSGQAAGLRSFAGDCG 97 Query: 295 ---------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 T ++ + G T AL+ P+ + S Sbjct: 98 NGGQLLVPVATDNRDQLNNATNQLTAGGTTPTPDALRAA-----AGDLPSTGDRTIILIS 152 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + D L +L R ++ ++ + F + Sbjct: 153 DGQSTCGDPCAVATELKTQLGIDFRVHAVGFNAPDVAESELSCIANATGGRY-FTATNTT 211 Query: 406 DQDDIYPVF 414 + D Sbjct: 212 ELSDAISAA 220 >UniRef50_C1XFF8 Mg-chelatase subunit ChlD n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XFF8_MEIRU Length = 298 Score = 72.5 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 42/166 (25%), Gaps = 11/166 (6%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTK---DMAKR--FYILLYLFLSRTYKNVEVVYIRH---- 294 P + A + ++ SM Q+ M L L R K V + + Sbjct: 79 MPENLAGVILAIENGWSMRQTDIAPNRMVATQMAAKALVDKLPRHIKVGVVTFSGYGTLL 138 Query: 295 --HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 T ++ GG + L E + + S G + + Sbjct: 139 LPPTTDRKAIRQAIDNLDLGGGFSFTYGLLAALEALPQTPPEGSRPGVIVLFSHGHDVSG 198 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 + PL A + V + + ++ Sbjct: 199 NDPLKIADQALERGIQVHAIGVGTHGHNFDEEMLKKVADRTGGRYY 244 >UniRef50_B4BQC0 von Willebrand factor type A n=2 Tax=Geobacillus RepID=B4BQC0_9BACI Length = 668 Score = 72.5 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 73/287 (25%), Gaps = 59/287 (20%) Query: 98 SGSGQGQASQDGEGQDEFVFQISKDEY------LDLLFEDLALPNLKQNQQRQLTEYKTH 151 G DG+G F++ + E + + +N +++ T Sbjct: 45 EGWATAPGEGDGKGNSIRFFEVRIQLKGQNGTAVSYRTEAVEVEP-DRNGEKKYTFSLDM 103 Query: 152 RAGY-TANGVPANISVVRSLQNSLAR-----RTAMTAGKRRELHALEENLAIIS------ 199 R + +A G A +V L + E + A + Sbjct: 104 RGKWPSAPGTTATYEIVVDAYRVLGNGQEEVYFSFPQPPYEYTRQTETSTAKLDFSLSFS 163 Query: 200 -NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + + R ++ + + P + +MDVS Sbjct: 164 QPEYAKPPDGDAQGRLDVTLIP--------------QGGVPAPVRPP---IDVVFVMDVS 206 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRHHTQAKEVDEHEF----------- 306 GSM AK + Y N I K F Sbjct: 207 GSMTTMKLQSAKSALQAAVNYFKTNYHPNDRFALIPFSDDVKATSVVPFGSKSNVISQLD 266 Query: 307 ------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 GGT S+AL L + +N + Y +DG Sbjct: 267 AILDEGNRLTANGGTNYSAALSLA----QSYFNDPERKKYIIFLTDG 309 >UniRef50_A0LPK8 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Deltaproteobacteria RepID=A0LPK8_SYNFM Length = 680 Score = 72.1 bits (175), Expect = 4e-11, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 53/184 (28%), Gaps = 28/184 (15%) Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV-----------YIRHHTQAKEV 301 ++D+SGSM + +S + V Y+ + + Sbjct: 307 FVLDISGSMTGRKITTLIEGVSRVLGKMSANDRFRIVTFNTTAADFTGGYVPASPENVQT 366 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEI 360 Q G T + L L ++ +DG N Sbjct: 367 WMQRVKQIQAGGSTALFDGLDLAYRLLDGERTTG-----IVLVTDGVCNVGPTRHDEFLG 421 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI----YPVFRE 416 L K+ VR ++++ +Q L F ++ + DDI + Sbjct: 422 LLKQ--HDVRLFTFVIGNSA-NQPLMDRLAKESGGFA----MNVSESDDIAGRLIQAKAK 474 Query: 417 LFHK 420 +FH+ Sbjct: 475 VFHE 478 >UniRef50_Q46AG0 BatA n=3 Tax=Methanomicrobia RepID=Q46AG0_METBF Length = 317 Score = 72.1 bits (175), Expect = 4e-11, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 55/210 (26%), Gaps = 31/210 (14%) Query: 243 PDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 + +MDVSGSM + AK +L L V + T Sbjct: 83 EQTKEGVNVVLVMDVSGSMQAQDYTPSRLEAAKSSAEILINSLKSKDYAGIVTFESGATT 142 Query: 298 A---KEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DN 349 A E + + G T + L L ++ P + + SDG +N Sbjct: 143 AAYLSPYKEKVIEKLRNVAPKEGSTAIGDGLSLGIDMA--SSIPNKKKVIIL-LSDGVNN 199 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYI-----------EITRRA---HQTLWREYEHLQST 395 SP AK V + + + + Sbjct: 200 AGYISPDEAIQYAKANNIQVYTIGMGSNGNVLLGYDWFGNPQYAELDEATLQAIANDTGG 259 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 F + D+IY E ++ Sbjct: 260 KY-FKSIDDKTLDEIYKNISENIKREKEET 288 >UniRef50_Q54CQ8 von Willebrand factor A domain-containing protein DDB_G0292740 n=1 Tax=Dictyostelium discoideum RepID=Y2740_DICDI Length = 910 Score = 72.1 bits (175), Expect = 4e-11, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 60/206 (29%), Gaps = 26/206 (12%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR- 293 ++++ K + L+D SGSM + D A+R ++ L+ K + Sbjct: 332 KFESINKEDI-YQKGEFIFLIDCSGSMSGNPIDSARRALEIIIRSLNEQCKFNIYCFGSG 390 Query: 294 -----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + V GGT + +K +++ + +P ++ Sbjct: 391 FNKAFQEGSRKYDDDSLAVVNRYVSNISANLGGTELLQPIK---DILSKEIDP-EYPRQI 446 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG A K R ++Y I L + Sbjct: 447 FILTDG---AVSDRSKLIEFVSKESKTTRIFTYG-IGSSVDVELVVGLSKACKGYY---- 498 Query: 402 QHIRDQDDIYPVFRELFHKQNATAKG 427 IR+ D+ +L Sbjct: 499 TLIRNSSDMETEVMKLLSIAFEPTLS 524 >UniRef50_Q01UI0 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01UI0_SOLUE Length = 837 Score = 72.1 bits (175), Expect = 4e-11, Method: Composition-based stats. Identities = 33/318 (10%), Positives = 81/318 (25%), Gaps = 44/318 (13%) Query: 104 QASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPAN 163 QA+ + G +I+ + + FED +T + + + Sbjct: 249 QANVNAVGAIALAGKIAAQDLGEARFED------------SVTLRSPRVLLVSRDPAASE 296 Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEEN----LAIISNSEPAQLLEEERLRKEIAEL 219 ++R+L + + + L+E + + Sbjct: 297 EHIIRALHAN---QFEVQQAPGGVPAKLDEFQLIVINNWDMESIPLAAKASLEEYVKKGG 353 Query: 220 RAKIERVPFIDTFDLRYKNYEK----------RPDPSSQAVMFCLMDVSGSMDQSTKDMA 269 D + K + P + ++D S SM+ ++A Sbjct: 354 GLVWIAGEHNIYVDKKGKPEDALERSLPAKLAPPRSPEGTAVVLIIDKSSSMEGRKIELA 413 Query: 270 KRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSAL 321 + I + L +++ + + + GGT ++ AL Sbjct: 414 RLAAIGVVENLRPIDSVGVLIFDNSFQWAVPIRKAEDRATIKKLISGITPDGGTQIAPAL 473 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 + + + +DG L K+ S + + + Sbjct: 474 TEAYQRI---LPQTAMYKHIVLLTDG----ISEEGDSMTLTKEAQANHVTISTVGLGQDV 526 Query: 382 HQTLWREYEHLQSTFDNF 399 ++ + F Sbjct: 527 NRAFLEKVASNADGKAYF 544 >UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteriaceae RepID=C7P2A9_HALMD Length = 393 Score = 72.1 bits (175), Expect = 4e-11, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 54/194 (27%), Gaps = 19/194 (9%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ---- 297 + + + +D SGSM+ A+ ++ L+ V + T Sbjct: 30 EQETDVRRHIALCIDTSGSMEGDNIKRARDGAAWVFGLLADEDYVSIVAFDTEATVILPA 89 Query: 298 ------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 ++ GGT + + LK E + SDG + Sbjct: 90 TRWSDLDRQTAMDHVEELTAGGGTDMYNGLKAAKETLSSSATGPDTVKRLLLLSDGKD-N 148 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + +P E LA+ + I ++ R H+ DI Sbjct: 149 ERTPDEFEGLAEAIDDAGIRIQSAGIGTDYNEATIRTLGTAGRGT----WTHLEAPGDI- 203 Query: 412 PVFRELFHKQNATA 425 + F + A Sbjct: 204 ---EDFFGEAVEQA 214 >UniRef50_Q3M1S2 von Willebrand factor, type A n=1 Tax=Anabaena variabilis ATCC 29413 RepID=Q3M1S2_ANAVT Length = 592 Score = 71.7 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 47/199 (23%), Gaps = 25/199 (12%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY-ILLYLFLSRTYKNVEVVY----- 291 + P + + L+D S SM K + K V + Sbjct: 42 APSSQQTPQA---IVLLIDASSSMSDGKLTEVKTAATKFVERRNLTQDKLAVVSFGLDIQ 98 Query: 292 -IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 T + E E GGT ++ L ++ + + +DG Sbjct: 99 TATPLTDNADTLESAIASLSEAGGTPMAQGLDAAIGELQATFL----SRNILLFTDG--V 152 Query: 351 ADDSPLCHEILAKKLLPVVRYYSY--IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 D L + + + L + + D Sbjct: 153 PDSQALASLSAQSARSQRINLIAVATGDADTNY-------LAQLTADPSLVFYANSGQFD 205 Query: 409 DIYPVFRELFHKQNATAKG 427 + +KQ ++ Sbjct: 206 QAFRNAEAAIYKQLVESEA 224 >UniRef50_Q1VY89 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated n=1 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VY89_9FLAO Length = 689 Score = 71.7 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 47/328 (14%), Positives = 91/328 (27%), Gaps = 41/328 (12%) Query: 114 EFVFQISKDEYLDLLFEDL--ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQ 171 E ++S E L F+++ P+ Q + + + I V L Sbjct: 140 EIEVELSYVELLPYSFDEVTFEYPSDYSAIQSDVIVFAQNLN--FNLFSERTIDDV-ELF 196 Query: 172 NSLARRTAMTAGKRRELHALEE-NLAIISNSEPAQLLEEE--RLRKEIAELRAKIERVPF 228 N++ T + E ++ I E + E V Sbjct: 197 NNVGTMTNDGNVATVLISENEAISINDILIKYQLASDELGVIPFSTLLE------EGVNE 250 Query: 229 IDTFDLRYKNYEKRPDPSSQ-----AVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSR 282 D F + P+ ++ ++D SGSM K AK + L+ Sbjct: 251 CDDFGNGFFGLVVEPESNANTEVIEKNFVLIIDSSGSMRGGNKMAQAKEASEFIVNNLNI 310 Query: 283 TYKNVEVVY----IRHHTQAKEVDEHE-------FFYSQETGGTIVSSALKLMDEVVKER 331 + + + + E + G T +S +L + Sbjct: 311 GDNFNVIDFDNNIVLFQPELVEYNIQNSNAALDFIENIVALGATNISESLVTAINQFEAG 370 Query: 332 YNPAQWNIYAAQASD-GDNWADDSPLCHEILAK----KLLPVVRYYSYIEITRRAHQTLW 386 + NI +D G + + LA+ ++ + +++ I L Sbjct: 371 -AEDKANIIVF-FTDGGATEGETNTQNILQLAEDTVNQIETEIFLFTFG-IGEDVTTDLL 427 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + F F + DI F Sbjct: 428 TLLAVQNNGFVTFLGD--NEIVDIISNF 453 >UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q466I6_METBF Length = 562 Score = 71.7 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 42/298 (14%), Positives = 93/298 (31%), Gaps = 29/298 (9%) Query: 116 VFQISKDEYLDLLFEDLAL--P------NLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 F +E L+ LF+ L L P ++K+ ++ Y+ + + Sbjct: 233 EFVTEMEENLE-LFDTLTLLFPQRNWSYSVKELKKEPFYVQLKMLKNYS-TFFEKSPDLK 290 Query: 168 RSLQNSLARRTAMT----AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 + + R ++ S + + + L + + Sbjct: 291 KIMDFIGRREFDPPSDRIRLSPFGKDRIQTVRFSDSINNLLPMEAAKLLNPSLKKKFYAD 350 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 + ++ K+Y P + M L+D SGSM + + +AK + + + Sbjct: 351 MLEGKLLSYQFLGKHYTGPPRIKPRGPMIVLVDTSGSMHGAPQTLAKSAVLAMAKLMLSQ 410 Query: 284 YKNVEVVYIRHHTQAKEVDEHEFFYSQE----------TGGTIVSSALKLMDEVVKERYN 333 ++++V+ +Q E++ E GGT ++AL + +KE+ Sbjct: 411 QRDMKVILFASTSQHLEIELSSRKKMSEKFLNFLLYTFGGGTDFNTALASGLKSLKEKDF 470 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 +DG ++ S ++ Y I + E Sbjct: 471 QGAD---LLFITDGK--SEVSDELVLARWEEAKKKYNAKVYSLIVGSSGAGGLSEISD 523 >UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JQC2_9BACT Length = 808 Score = 71.7 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 47/180 (26%), Gaps = 25/180 (13%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRH 294 A +DVSGSM +A L + V + + Sbjct: 279 EGGADFVFALDVSGSMQGKLHTLA-SGVKKAIGQLKPEDRFRVVAFNNTAFDLNRGWVSA 337 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADD 353 GGT V + + L E R + + +DG N Sbjct: 338 TEANLRETFARLDQLNSNGGTNVYAGVHLALE----RLDADRVATLIL-VTDGVTNQGIV 392 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 P L K +R+Y ++ + L + + + DDI Sbjct: 393 DPKAFYKLMHK--QDLRFYGFLLGNSS-NWPLMQLMCDASGGSYR----AVSNSDDIIGE 445 >UniRef50_A8SU73 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SU73_9FIRM Length = 550 Score = 71.7 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 65/250 (26%), Gaps = 22/250 (8%) Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRP 243 EL ++ + + E ++ ++ A+ + YK + Sbjct: 312 TAAELEGVKLVVDYCKSDEMQKIAAQKGFNANDDYTSAEEFSGAQVTQGLKTYKKTKDNG 371 Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----------IR 293 + + D SGSMD + K +++ V Y + Sbjct: 372 K---DIIAVFVADCSGSMDGDPMNQLKNSLTNGAQYINDNNYVGLVSYSNSVTIEVPIAQ 428 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE-RYNPAQWNIYAAQASDGDNWAD 352 + + +GGT A+ + +++ E + SDG Sbjct: 429 FDLNQRSYFQGAVNNLIASGGTASYDAVVVAVKMITEAKAQHPDAKCMLFLLSDGYANNG 488 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 S + V Y + S D DDI Sbjct: 489 YSMDEITSALRTSGIPVYTIGYGDDADTGELARLSGINEAASINA--------DSDDIIY 540 Query: 413 VFRELFHKQN 422 + LF+ Q Sbjct: 541 KIKSLFNSQL 550 >UniRef50_C7R936 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R936_KANKD Length = 689 Score = 71.7 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 57/192 (29%), Gaps = 24/192 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------- 291 + + M ++D SGSM + AK+ LS + + Sbjct: 320 DLVQQKTQPREMIFVIDSSGSMSGESMQQAKQGLYYALSQLSINDTFNIIDFDNDANKLF 379 Query: 292 ---IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + E+ ++ + GGT ++ A+ L + + +DG Sbjct: 380 DEAVPATLSNLEMAKYFVATLEADGGTEIAKAINLALD-----KPDSSLLRQVVFLTDG- 433 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + + + + L R ++ I + + + + I Sbjct: 434 --SIGNERQIFQMIENQLGNNRLFTIG-IGAAPNSYFMSKAANYGRGTFTY----IGKAS 486 Query: 409 DIYPVFRELFHK 420 ++ +LF K Sbjct: 487 EVQTKLEQLFKK 498 >UniRef50_A0CK50 Chromosome undetermined scaffold_2, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0CK50_PARTE Length = 1015 Score = 71.7 bits (174), Expect = 6e-11, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 55/177 (31%), Gaps = 21/177 (11%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF- 307 + ++D SGSM + L F +++ ++ I T+ K E Sbjct: 36 LTIIGVIDASGSMSGC--------WEWLSDFWNQSIPKENLITITFDTRQKISAEGVLSK 87 Query: 308 --YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKL 365 GGT + A + M+ +++ P Q NI SDG D++ + KKL Sbjct: 88 RIKDHGGGGTEIVPAFQTMETELQK--VPIQNNITVIFISDGQ---DNNVRTIDERMKKL 142 Query: 366 L-----PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + + + ++ + F + Sbjct: 143 GGNTQNRNINFICLGVESGFPTNISMNLRQLYHRGDPQIPAIYLIEHASPQAFFNKF 199 >UniRef50_A4XHD9 von Willebrand factor, type A n=2 Tax=Clostridia RepID=A4XHD9_CALS8 Length = 909 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 51/196 (26%), Gaps = 21/196 (10%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS------TKDMAKRFYILLYLFLSRTYKNV 287 L K K + + ++D SGSM S ++AK + L + Sbjct: 391 LPVKMQLKNKEKERNVAVVLVIDHSGSMGGSNLRNINKLEIAKSAAAKMIDHLESSDSVG 450 Query: 288 EVVY------IRHHTQAKEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + + K +E Q GGT + L ++K+ + Sbjct: 451 VIAFDHNFYWASKFGKLKSKNEVIENISTIQVGGGTAIIPPLTEAVNLLKKSKAKDK--- 507 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +D + +AK+ + + Y S + Sbjct: 508 VIVLLTD-GYGEEGGYEYPASIAKRNNIKITTIGVGSSINAPILSWMAAYT---SGRFYY 563 Query: 400 AMQHIRDQDDIYPVFR 415 D + Sbjct: 564 VKDASNLIDVFLKEAK 579 >UniRef50_A8J658 Collagen-related protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8J658_CHLRE Length = 387 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 59/205 (28%), Gaps = 17/205 (8%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS---SQAVMFCL 254 ++ A RL +LR + Q + L Sbjct: 68 LTRRAAAIAGGYRRLYGSKDVTEKAAAPWTKPCPEELRASVEKLATRLVGDVQQVNVVFL 127 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE------VDEHEFFY 308 +D SGS++ + F + L+ + N++V ++ + +D Sbjct: 128 VDGSGSVNAEEFEAMLGFCVDASNQLAESVPNLQVAVVQFSNDVRVEVGLAPLDSEALRK 187 Query: 309 -----SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG--DNWADDSPLCHEIL 361 + GGT V+ AL +++K P + +DG D++ Sbjct: 188 TTREMVRMNGGTNVAVALTKAGQLLKRDAAPDAM-RHVVLLTDGRVDSYQAHEARQVADQ 246 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLW 386 V ++Y L Sbjct: 247 LADEQRHVSLFAYGVGRGVDRAELL 271 >UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LD98_GORB4 Length = 423 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 46/187 (24%), Gaps = 16/187 (8%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT--------YKNVEVVYIRHHTQA 298 + A + ++D SGSM A+R + L + +VV Sbjct: 37 APAALQVVLDRSGSMSGPPLAGAQRALAGVIGQLDPRDVFGVVTFDDDAQVVLPAAPLAD 96 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN-IYAAQASDGD-NWADDSPL 356 K G T +SS + ++ A SDG N Sbjct: 97 KARAVDAVGSIVPGGCTDLSSGYLRGLQELRRATASAGIRGGTVLVISDGHVNRGIRDLD 156 Query: 357 CHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + K Y R +TL + FA I Sbjct: 157 EFASITAKAAADGIITSTLGYG---RGYDETLLSAIARSGNGNHVFADDPDAAGAAIAGE 213 Query: 414 FRELFHK 420 L K Sbjct: 214 VDGLLSK 220 >UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein family-like n=8 Tax=Oryza sativa RepID=Q6ZFR4_ORYSJ Length = 703 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 46/174 (26%), Gaps = 28/174 (16%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE----- 303 + ++DVSGSM+ + KR L L + V + + + Sbjct: 269 VDLVTVLDVSGSMEGYKLALLKRAMGL----LGPGDRLAVVSFSYSARRVIRLTRMSEGG 324 Query: 304 -----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DN------WA 351 G T + L +V R SDG DN W Sbjct: 325 KASAKSAVESLHADGCTNILEGLVEAAKVFDGRRYRNAVASVIL-LSDGQDNYNVNGGWG 383 Query: 352 DDSPLCHEILA-----KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + + +L + + +++ T + +F Sbjct: 384 ASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGT-DHDASAMHTIAEETGGTFSFI 436 >UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tetraodon nigroviridis RepID=UPI00017B0D26 Length = 856 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 21/204 (10%), Positives = 58/204 (28%), Gaps = 26/204 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IR 293 + + P+ + ++D S SM K + + L + + + + Sbjct: 217 FAPKDLPAVPKNVVFVIDTSASMLGKKMRQTKEALLTILGDLRPADRFNFISFSSRIRVW 276 Query: 294 HHTQAKEVDEHEFFY-------SQETGGTIVSSALKLMDEVVKERY-----NPAQWNIYA 341 + +GGT + A++ ++++ P ++ Sbjct: 277 QPGRLVPATPSAVRDAKKFVVMLPTSGGTDIDGAIQTGSSLLRDHLSGRDAGPNSVSLII 336 Query: 342 AQASDGD-NWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG + P A+ + ++ + L Sbjct: 337 F-LTDGQPTVGEVRPGAILGNARAAVRDKFCIFTIG-MGDDVDYRLLERMALDNCG---- 390 Query: 400 AMQHIRDQDDIYPVFRELFHKQNA 423 M+ I ++ D + + F+ + Sbjct: 391 MMRRIPEEADASSMLKG-FYDEIG 413 >UniRef50_D1HBR9 Whole genome shotgun sequence of line PN40024, scaffold_205.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HBR9_VITVI Length = 656 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 27/128 (21%), Positives = 41/128 (32%), Gaps = 15/128 (11%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------ 295 P + + ++DV G M + M KR L+ LS T + V + Sbjct: 277 NPARRAPIDLVTVLDVGGGMTGAKLQMMKRAMRLVISSLSSTDRLSIVAFSASSKRLMPL 336 Query: 296 ----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDN 349 T + GT ALK +V+++R + N SDG N Sbjct: 337 KRMTTTGRRSARRIIESLIAGQGTSAGEALKKASKVLEDR---RERNPVASIMLLSDGQN 393 Query: 350 WADDSPLC 357 S Sbjct: 394 ERVSSKST 401 >UniRef50_B5YKY5 Magnesium-chelatase subunit ChlD n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YKY5_THEYD Length = 614 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 57/171 (33%), Gaps = 17/171 (9%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKN 286 I DLRYK E+R + ++D SGSM + A + LL + K Sbjct: 412 IFDDDLRYKEKERR----MSHNVIFVVDGSGSMGVEQRMKATKGAVLSLLIDCYKKRDKV 467 Query: 287 VEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK-ERYNPAQWN 338 +V+ + T + E+ G T +S+ L +++K + + Sbjct: 468 AMIVFRKDKAEILLPLTSSVELALKRLREIPTGGKTPLSAGLMEAYKLMKITHFKYPENR 527 Query: 339 IYAAQASDGD-NWADDSPLCHEILAKK--LLPVVRYYSYIEITRRAHQTLW 386 + +DG N + E L +L +I I Sbjct: 528 LLILIITDGKPNVSLSDKPVLEELKSVCFMLKDFPLTDFIVIDTEKKDKFM 578 >UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBY0_9DELT Length = 996 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 47/186 (25%), Gaps = 16/186 (8%) Query: 241 KRPDPSSQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYI------- 292 +R + ++D SGSM D+ K L + + + + Sbjct: 520 ERQREQPTLALILVIDKSGSMSSGDRLDLVKEAARATARTLDPSDEIGVIAFDNSPQVLV 579 Query: 293 -RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + GGT AL+ ++ + A SDG++ Sbjct: 580 RLQPAANRLRISSSIRRLSAGGGTNAMPALREAY--LQLAGSKALVKHVIL-LSDGES-P 635 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 ++ ++ + S L R ++ Sbjct: 636 ENGINALLGDMRQ--SDITVSSVGVGDGAGKDFLIR-VAERGRGRYFYSEDGTDVPRIFS 692 Query: 412 PVFREL 417 RE+ Sbjct: 693 REAREV 698 >UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacteria RepID=B5W3I3_SPIMA Length = 228 Score = 71.3 bits (173), Expect = 7e-11, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 50/173 (28%), Gaps = 24/173 (13%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRH 294 E +P + L+D S SM D + L + K VE+ I Sbjct: 14 AVEFAENPEPRCPCVLLLDTSASMQGEPLDGLNAGLMTFRENLIKDELAKKRVEIAIITF 73 Query: 295 HTQAK----EVDEHEFFY--SQETGGTIVSSALKLMDEVVKERYNPAQWN------IYAA 342 Q K V F G T + +A+ +++ R + N + Sbjct: 74 DNQVKIIQDFVTADRFEPPLLNAQGQTYMGTAIGEALDMIASRKAEYRNNGITYYRPWVF 133 Query: 343 QASDGDNWADDSPLCHEILA----KKLLPVVRYYSYIEITRRAHQTLWREYEH 391 +DG+ + + + + ++ V +++ Sbjct: 134 MITDGEPQGESDRITEQAIKRIRDEEANKQVAFFAVGVEGAN-----MERLGE 181 >UniRef50_C6VU79 Sigma 54 interacting domain protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VU79_DYAFD Length = 614 Score = 71.3 bits (173), Expect = 7e-11, Method: Composition-based stats. Identities = 43/290 (14%), Positives = 85/290 (29%), Gaps = 58/290 (20%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQ-GQASQDGEGQDEFVFQISKDEYLD 126 + R N ++ E P G G+ G+ + + EG E + Sbjct: 298 QPNERPASGDENPQNQKSQTPEAPDKTQNGQQDGECGEGNCESEG----------SERIA 347 Query: 127 LLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 + + +P L + +A AN VR+ R+TA A R Sbjct: 348 NIDFGMTVPELTKKP-------------ASAAPDVANGRDVRA------RQTAKGAAIRA 388 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 ++A+ L + +L + + Sbjct: 389 VRSETASDIAMTD-------TILHALTRNPDDLTIGKADLHQKVRSGKAGR--------- 432 Query: 247 SQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRT--------YKNVEVVYIRHHTQ 297 ++ ++D SGSM + K + L + ++ VE + T+ Sbjct: 433 ---LILFVVDSSGSMAAGKRMEAVKGSVMKLLEDAYQKRNMVAVIAFRGVEATVLLEPTR 489 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + E G T + AL++ ++++ SDG Sbjct: 490 STGLAEQALEQLPTGGRTPLPHALEMAEKMLASFAGRDTMEPLLVILSDG 539 >UniRef50_Q7UL83 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain n=1 Tax=Rhodopirellula baltica RepID=Q7UL83_RHOBA Length = 764 Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 57/192 (29%), Gaps = 25/192 (13%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-- 296 + P + + ++D SGSM+ + F + L+ + + + T Sbjct: 333 WSIEPTEITPREVILVLDTSGSMNGPAISQLRLFADHVLDHLNPNDEFRVIAFSNRTTAF 392 Query: 297 --QAKEVDEHEF-------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 A + + +GGT + ALKL + + + Y +D Sbjct: 393 QPNAVSATDANIQSAKQFVRGLRASGGTNLLPALKLA---LGGEADESARPRYMILMTD- 448 Query: 348 DNWADDSPLCHEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + L + R + ++ + L + F + Sbjct: 449 -ALVGNDHSILRYLRQPEFQDARVFPIAFGAA---PNDYLISRAAEMGRGFS----MQVT 500 Query: 406 DQDDIYPVFREL 417 +QD+ + R Sbjct: 501 NQDNTPEIARRF 512 >UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09E12_STIAU Length = 540 Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 46/163 (28%), Gaps = 17/163 (10%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IRHHTQAKEVDEH- 304 + ++D SGSM S +AK L+ V + E+ Sbjct: 72 VTFVIDTSGSMQGSRMQIAKDALKYCVTRLNPQDTFNVVRFSTDVEALFPALKSAQPENI 131 Query: 305 ----EF-FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 F + GGT + AL +++ + +DG + Sbjct: 132 QKAVAFVEQLEAIGGTAIDEALVRG---LQDNDGKSSAPHLLMFITDGQPTIGETDEGAI 188 Query: 359 EILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 AK R +++ + + L + +F Sbjct: 189 AQHAKDGRKAKTRLFTFG-VGEDLNARLLDRLSSDGAGTSDFV 230 >UniRef50_C9SWV9 U-box domain containing protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SWV9_VERA1 Length = 662 Score = 70.6 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 30/264 (11%), Positives = 67/264 (25%), Gaps = 33/264 (12%) Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 + +S +R T+ + A + + Q ++ V Sbjct: 11 TTGSSSSRATSNPSATPDSSVADDNMSITSEPATLVQDEIDDLTLSVHPLASRDGLLVKV 70 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST-----------------KDMAKR 271 R + P + + ++DVSGSMD + D+ K Sbjct: 71 EPPTTPREALQSGKRIPRAPCDIVLVIDVSGSMDDAAPAPVIPGQKDENTGLSILDLTKH 130 Query: 272 FYILLYLFLSRTYKNVEVVY----------IRHHTQAKEVDEHEFFYSQETGGTIVSSAL 321 + L + V + + + K + + Q GT + + Sbjct: 131 AARTILETLDERDRLGIVAFTTNAKVILSLVEMNPDNKVSAKDKIENLQPLNGTNMWHGI 190 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAK--KLLPVVRYYSYIEIT 378 ++ + + + +DG N L +L + + + Sbjct: 191 TEGIKLFSDCDSSSGRVPAMMVLTDGLPNSGCPRLGYIPKLRDMGQLPATIHTFGFGY-- 248 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQ 402 L + + F Sbjct: 249 -HIRSGLLKSIAEIGGGNYAFIPD 271 >UniRef50_A8J0D9 Flagellar associated protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8J0D9_CHLRE Length = 4349 Score = 70.6 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 31/250 (12%), Positives = 61/250 (24%), Gaps = 39/250 (15%) Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 E L ++ K + K + + + C++D SGSM Sbjct: 933 EAAEPLTLKVLPEYEKYALGKEACRAVISIKASAEVKQR-AHVALTCVLDRSGSMGGERI 991 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVY----------IRHHTQAKEVDEHEFFYSQETGGTI 316 ++ + L L+ V Y +R +A+ + GGT Sbjct: 992 ELVRETCHFLIDQLTADDYLGIVSYSNTVREDVPLLRMTPEARRLAHTMISSLTLHGGTA 1051 Query: 317 VSSALKLMDEV-----------VKERYNPAQWNIYA---AQASDGD-NWADDSPLCHEIL 361 + + L+ + + + +DG + Sbjct: 1052 LYAGLEAGVKQQMAAASELKALAAAAGGGSDSSRIVHSCFLFTDGQATTGPCTVNEIMGQ 1111 Query: 362 AKKLL----PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 L + +++ L + QS + I DDI Sbjct: 1112 MTSLQSPADQNITVHTFGF-GDDHSVELLQGVAEAQSGVYYY----ISCADDIPSG---- 1162 Query: 418 FHKQNATAKG 427 F Sbjct: 1163 FGDALGGLLA 1172 >UniRef50_Q24CQ9 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24CQ9_TETTH Length = 856 Score = 70.6 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 59/195 (30%), Gaps = 27/195 (13%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRH 294 SS++ L+D SGSM+ A L L + +++ Sbjct: 316 SSRSEFIFLLDRSGSMNGRPIKKATEALNLFLKSLPPNSYFNVYSFGTRYVPMFPNSVQY 375 Query: 295 HTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 ++ E+ + + G T + S L + EV ++ +N +DG Sbjct: 376 TGKSLEIALKKVKNFKANLGRTDILSPLTNIFEVQEK---INGYNKQIFLLTDG---GVK 429 Query: 354 SPLCHEILAKK--LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + L KK + + A Q L + I ++D+ Sbjct: 430 NRDKVIRLIKKNNKNSRINSIGFG---SGADQHLINTSAIAGKG----ISKIIDMEEDLS 482 Query: 412 PVFRELFHKQNATAK 426 V E+ + Sbjct: 483 EVVIEMLGNCITPSL 497 >UniRef50_C6PWL8 von Willebrand factor type A n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PWL8_9CLOT Length = 422 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 58/185 (31%), Gaps = 22/185 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK----------- 299 + +D SGSM + + + + + KN + A+ Sbjct: 116 IVFAIDTSGSMKNTDPNNER--FSAALNLIDNMDKNNRFSMYKFDDTAEKIIPMSQVTKQ 173 Query: 300 ---EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 EV G T + AL+ E +K + N SDG + D S Sbjct: 174 SREEVSGKLKDMQNPKGNTNMRDALEKAYEEIKSSETKDK-NAMVIMLSDGGDTYDLSKK 232 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 E L + Y+ + ++ +E ++++ D+ VF + Sbjct: 233 FDETLKPFKEKNISIYTIGMSNGN-NFSMLKEIAKESGGNYY----NVKEIKDLKNVFNK 287 Query: 417 LFHKQ 421 ++ + Sbjct: 288 IYRDR 292 >UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9U149_PHYPA Length = 1185 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 62/203 (30%), Gaps = 22/203 (10%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 P S + + ++D SGSM + A + L + ++ + Sbjct: 453 FLPRFALRPMSSSELIFVVDRSGSMQGTPIKQAGQALELFLRSIPCEDHYFNIIGFGDNH 512 Query: 297 -----QAKEVDEHEFFY-------SQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 ++ +E + GGT + SA + + E + R P Q Sbjct: 513 KTLFPKSTPYNEETLTKGLRYAQALEADMGGTEMMSAFEEIFEH-RRRDVPTQ----IFL 567 Query: 344 ASDGDNWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG+ W DS + AKK VR +S I L + Sbjct: 568 LTDGEIWDVDSLIECIRDAKKEEKSDNFVRVFSLG-IGSNVSHHLVESVGRAADGYAQLI 626 Query: 401 MQHIRDQDDIYPVFRELFHKQNA 423 ++ R + + + + Sbjct: 627 VEGERMEKKVINMLKSALVPAVT 649 >UniRef50_UPI000180D2FB PREDICTED: similar to inter-alpha (globulin) inhibitor H5 n=1 Tax=Ciona intestinalis RepID=UPI000180D2FB Length = 1586 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 27/231 (11%), Positives = 56/231 (24%), Gaps = 31/231 (13%) Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 + + E +R + + ++ P + L+DVSGSM Sbjct: 904 SPFGLNGKFVIEYDVF---RDRTTEMVIDQSYFAHFITSNLPPMSKRVVFLIDVSGSMFG 960 Query: 264 STKDMAKRFYILLYLFLSRTYKNVEVVYIRH------HTQAKEVDEHEFFYSQE------ 311 D ++ + L+ T + + A + Sbjct: 961 IKIDQVRQAMNTILHGLAETDFFSVIAFNSSVSRWSPSGTAAVLASGTTANINSAMNFLN 1020 Query: 312 -----TGGTIVSSALKLMDEVVKERYNPAQWNI---YAAQASDGDNWADD-SPLCHEILA 362 GGT + A++ ++ N + +DG S Sbjct: 1021 TTVVTRGGTDILQAVEAAIQLFDSA-ATGGTNTASDFMVLLTDGRPTDGTVSSTAIISAI 1079 Query: 363 KKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + L + + + L R+ S + I Sbjct: 1080 RNLNRGRFGINTIGFGTL---VDMNLLRKIAAQNSGTSIQIFIDLNSYAQI 1127 >UniRef50_D2V048 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2V048_NAEGR Length = 1065 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 53/174 (30%), Gaps = 22/174 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE- 303 + ++ +D SGSM S AK L + + +++I + ++ +D Sbjct: 80 KNQGKLLIIALDKSGSMAGSGISEAKLALETLLSNVEGCNER--ILFIVFDSNSELIDMT 137 Query: 304 --------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW-NIYAAQASDGDNWADDS 354 GGT SS K + Y + + +DG + + Sbjct: 138 NMELENKLQVVKKVSAGGGTDFSSVFK-----IIRNYGGSLNGQVAIIFFTDGQDQYSSN 192 Query: 355 ---PLCHEILAKKLLPVVRYYSYIEIT--RRAHQTLWREYEHLQSTFDNFAMQH 403 + L ++L Y + I L + L + +F Sbjct: 193 STREGSIKSLQERLNTESESYEFHTIGFTSVHDARLLTDITRLGTAQGSFQFAE 246 >UniRef50_UPI000180D3E0 PREDICTED: similar to LOC779593 protein n=1 Tax=Ciona intestinalis RepID=UPI000180D3E0 Length = 1012 Score = 70.2 bits (170), Expect = 2e-10, Method: Composition-based stats. Identities = 27/279 (9%), Positives = 61/279 (21%), Gaps = 63/279 (22%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 + L R ++ I ++ + ++ ++D SGSM Sbjct: 282 YSPFGLSGTLVVRYDVERTQMFGDIAIHN-GFFAHHFAPPSLAAFPKLVVFVVDTSGSMF 340 Query: 263 QSTKDMAKRFYILLYLFLSRTYKNVEVVY------------IRHHTQAKEVDEHEFFYSQ 310 K+ L+ VV+ T++ Sbjct: 341 GYKLKQVKQALADSLRSLNNEDHFNIVVFGDTAEPWISGVLSTASTRSINDAITYVDAVS 400 Query: 311 ETGGTIVSSALKLMDEVVK-----------------------------ERYNPAQ----- 336 GGT + AL+ +++ + + Sbjct: 401 ARGGTNMLVALQTAFAIMEPYLPSLPENETMVEDTTPFPTPVPLQPETNHFIRKRATETQ 460 Query: 337 -----WNIYAAQASDG----DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 + +DG D+ D + + + Sbjct: 461 TELSNYAKMIVFLTDGRPTKDDVGTDDIASRIEKINGGRVNLHTIGFGSL---VDMRFLE 517 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + L + + + D R F + +A Sbjct: 518 KLAALNGG----VSRRVFESLDAATQIRHFFDEVSAPVL 552 >UniRef50_A2E1S5 von Willebrand factor type A domain containing protein n=2 Tax=Trichomonas vaginalis RepID=A2E1S5_TRIVA Length = 688 Score = 70.2 bits (170), Expect = 2e-10, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 49/172 (28%), Gaps = 19/172 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------IRHH----T 296 S + + ++D SGSM S AK + L + + + H Sbjct: 232 SNSEFYFIIDCSGSMSGSCIQNAKLCLNIFMHSLPIGCRFSIIKFGSDYEVALHPCDYTD 291 Query: 297 QAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + + GGT + S LK + E+ + +DG D + Sbjct: 292 ENVSEAMKQLNNIDAEMGGTDILSPLKYVMEL----TPKQGFIKQVFLLTDGQ---DSNT 344 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 LA++ R +S + L F ++ Sbjct: 345 NELCALAQENRTNNRIFSIGIGSGADKD-LIINVSQKSGGNYVFVDDDESEK 395 >UniRef50_Q1NTK1 Von Willebrand factor, type A n=2 Tax=delta proteobacterium MLMS-1 RepID=Q1NTK1_9DELT Length = 771 Score = 70.2 bits (170), Expect = 2e-10, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 59/201 (29%), Gaps = 29/201 (14%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 ++P P+S + L+D SGSM + AK+ + L +++ Sbjct: 258 PSEQPIPTS---LAILLDCSGSMAGDSIAQAKQAISDMLNLLRPEDYCNLIMFGSEVKSV 314 Query: 292 ----IRHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVV------KERYNPAQWNIY 340 + GGT + AL ++ + PA+ + Sbjct: 315 FPCQVAADKTNITTLRRAIRAIDADMGGTEMQKALVETLKMSPIYKPPEVEVVPARISRN 374 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG W D + + +++ R ++ + + Sbjct: 375 ILLITDGQVWGD------KQILRRMAKSDHRVFTVG-VGGAVCEAFLHGLASQSGGACEL 427 Query: 400 AMQHIRDQDDIYPVFRELFHK 420 + + I + ++ + Sbjct: 428 VAPNEEMGEKIARQSKRVYAE 448 >UniRef50_A2E6Y7 von Willebrand factor type A domain containing protein n=4 Tax=Trichomonas vaginalis RepID=A2E6Y7_TRIVA Length = 720 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 53/196 (27%), Gaps = 19/196 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----- 293 + ++ + ++D SGSM S + AK +L L + + + Sbjct: 231 PQFEGKVEQKSEFYFIIDCSGSMSGSRIENAKFCLNILIHSLPIGCRFSIIQFGNSYKEV 290 Query: 294 -----HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + + GGT + S L+ + + ++ +DG Sbjct: 291 VSICDYSNKNVKYAMSAIARINADMGGTDILSPLEYVFK---KKLGKGFI-RKIFLLTDG 346 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + +K R ++ + L + Sbjct: 347 E---VHNSDMICSRVQKERENNRIFAIGLGSGA-DPGLIKNISAKSGGNYVLIADDDNMN 402 Query: 408 DDIYPVFRELFHKQNA 423 + I + + + Sbjct: 403 NMIVEIMKSALSPSLS 418 >UniRef50_A7RNW3 Predicted protein n=3 Tax=Nematostella vectensis RepID=A7RNW3_NEMVE Length = 798 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 32/282 (11%), Positives = 69/282 (24%), Gaps = 30/282 (10%) Query: 155 YTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI-ISNSEPAQLLEEERLR 213 + + + V ++ RR K + + + A + L Sbjct: 201 FQPDPSDNCHASVTLAESHTFRRDVEIQIKSEDPFVAHALVEPGLPRPSDAPEDKTRGLA 260 Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY 273 L+ + V F+ + + ++D SGSM S A R Sbjct: 261 ISTEFLQKPVAMVNFV--PAFKADDLTCGE-------FIFVVDRSGSMSGSRIKDAARTL 311 Query: 274 ILLYLFLSRTYKNVEVVY-----------IRHHTQAKEVDEHEFFYSQET-GGTIVSSAL 321 L L V + ++ + + + + GGT + L Sbjct: 312 QLFLKSLPDGCYFNIVGFGSSYKTLFSKSKTYNDETLKTATNHAAHLAADLGGTEILEPL 371 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 + + +DG+ + L + R +S+ + Sbjct: 372 RWVYSQ----SLIEGAPRQLFLLTDGE-VG--NTAQVISLVAENASTARVFSFGIGDGAS 424 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 L + F + Q + + Sbjct: 425 -TELIKGVARAGHGSAEFVRGQDKLQVKVIKTLKRALQPALT 465 >UniRef50_D1IDZ7 Whole genome shotgun sequence of line PN40024, scaffold_19.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1IDZ7_VITVI Length = 478 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 53/199 (26%), Gaps = 33/199 (16%) Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR 235 R ++ E S + + + A I Sbjct: 123 RNSSNGNAAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNIQRN--M 180 Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 + +P + + ++D+SGSM + + KR Sbjct: 181 SNSPLNSHNPRAPVDLVTVLDISGSMAGTKLALLKRAMGFAL------------------ 222 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDNWADD 353 GGT ++ L+ +V+++R + N SDG + Sbjct: 223 --------QAVNSLVANGGTNIAEGLRKGAKVMEDR---KERNPVSSIILLSDGQDTYTT 271 Query: 354 SPLCHEILAKKLLPVVRYY 372 + + A+ + ++ Sbjct: 272 ESVIQDAFAQCIGGLLSVV 290 >UniRef50_B8CHV3 VCBS n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CHV3_SHEPW Length = 1477 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 52/165 (31%), Gaps = 24/165 (14%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKR-----FYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 + + L+D SGSM S AK L+ + V V+ + AK + Sbjct: 1024 ADYNLAFLIDSSGSMGDSAVATAKAQILSVLATLITNANQPSAGTVNVLLVDFDQTAKIL 1083 Query: 302 DE-------------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 G T S+A + Y P N +DG+ Sbjct: 1084 IAIDLSSNDPLASITTALEAMSSGGTTNYSAAFTAAYNWFNDNY-PQGNNRT-FFITDGE 1141 Query: 349 -NWADDSPLCHEILAKKLLPVVRYYSYIEI---TRRAHQTLWREY 389 N + P + A+ ++ SY+E + + +++ Sbjct: 1142 PNTDNGQPGDYFENAQNAFALLNALSYVEAIGLGGNVNSSTLQQF 1186 >UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YNZ7_ANASP Length = 820 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 58/200 (29%), Gaps = 27/200 (13%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-- 291 ++Y+ + P + L+D SGS + + L+ V + Sbjct: 289 IQYRQDQVVPK-----DVVFLIDTSGSQMGAPLMQCQELMRRFINGLNPDDTFSIVDFSD 343 Query: 292 ---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 + ++ Q + + GGT + ++ + +P + Sbjct: 344 TTRQLSPVPLANNAQNRTRAINYINQLSANGGTEMLRGIRAVLNFPVT--DPGRL-RSIV 400 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG + + + + L R YS+ ++ L L Sbjct: 401 LLTDG--YIGNENQILAEVQQHLKSGNRLYSFG-AGSSVNRFLLNRIAELGRGIAQII-- 455 Query: 403 HIRDQDDIYPVFRELFHKQN 422 + D + F++Q Sbjct: 456 RHDEPTD---EIVDKFYRQI 472 >UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-like protein n=3 Tax=Eutheria RepID=ITH5L_HUMAN Length = 1313 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 41/306 (13%), Positives = 87/306 (28%), Gaps = 35/306 (11%) Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN 200 ++ + + R G + +P +R+ + + E I+ Sbjct: 172 KRLSIEVTVSERTGISYVHIPP----LRTGRLRTNAHASEVDSPPSTRIERGETCVRITY 227 Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN------YEKRPDPSSQAVMFCL 254 Q +A+ + + V D++ + + R P + + + Sbjct: 228 CPTLQDQSSISGSGIMADFLVQYDVVMEDIIGDVQIYDDYFIHYFAPRGLPPMEKNVVFV 287 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IRHHTQAKEVDEH----E 305 +DVS SM + + K ++ L + + + + + Sbjct: 288 IDVSSSMFGTKMEQTKTAMNVILSDLQANDYFNIISFSDTVNVWKAGGSIQATIQNVHSA 347 Query: 306 FFYSQ---ETGGTIVSSALKLMDEVV-KERYNPAQWNIY-----AAQASDGDNW-ADDSP 355 Y G T V+SAL V+ P + +DG+ +P Sbjct: 348 KDYLHCMEADGWTDVNSALLAAASVLNHSNQEPGRGPSVGRIPLIIFLTDGEPTAGVTTP 407 Query: 356 LCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++ L V +S A TL R + I + D Sbjct: 408 SVILSNVRQALGHRVSLFSLAF-GDDADFTLLRRLSLENRG----IARRIYEDTDAALQL 462 Query: 415 RELFHK 420 + L+ + Sbjct: 463 KGLYEE 468 >UniRef50_Q4RV83 Chromosome 15 SCAF14992, whole genome shotgun sequence. (Fragment) n=3 Tax=Euteleostomi RepID=Q4RV83_TETNG Length = 1434 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 82/276 (29%), Gaps = 26/276 (9%) Query: 164 ISVVRSLQNSLAR-RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK 222 +S S L R R + + + + +SEP + LE R R L + Sbjct: 334 LSEAHSPLVILERGRFSFGQYEEQISSRRDFIRCTRKDSEPERKLEFVRKRYHKDILSSP 393 Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 + + F DL E + + L+D SGSM + K ++ L Sbjct: 394 VLMLNFC--PDLLG---EPLELHRATRELLFLVDRSGSMSGTKIQSVKEAMVIALKSLPP 448 Query: 283 TYKNVEVVY-------IRHHTQAKEVDE----HEFFYSQET-GGTIVSSALKLMDEVVKE 330 K V + + +V + GT + AL + + + Sbjct: 449 GTKLNIVGFGTTIKPLFTSSKLSTDVTILQACEYLQRMRADMKGTNLLGALSWVYQQPMQ 508 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 R + +DG + L ++ R + + +A + L + Sbjct: 509 RS----YPRQVFIITDG---CVSNVAKVLELVRRNACAGRCFGLG-LGPKACRRLLQGVA 560 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 L + R Q + ++ F + Sbjct: 561 KLTGGTAEYLDDEERLQPKVIKSLKKAFEPVLTDVR 596 >UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphylothermus marinus F1 RepID=A3DLZ3_STAMF Length = 416 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 48/175 (27%), Gaps = 10/175 (5%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVD 302 ++D S SMD AK+ + L L + + Sbjct: 41 FLIVIDTSYSMDGEKIFRAKQAALRLLDILRDKDYVGVYGFAGKFYKVLEPVPATNRNEV 100 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGD-NWADDSPLCHEI 360 E + GT + LK + E K+ ++ +DG+ P Sbjct: 101 EKAIIGLKLGSGTNIYDTLKKLVEETKKVLESGAISLVRIIFITDGEPTTGQKKPEKILE 160 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 +AKKL I + ++ L + + + I + Sbjct: 161 MAKKLREAGASALIIGVGTEYNEKLLSRMAMVLNGEFEHVSDPASLEKLISEYAK 215 >UniRef50_Q5UWJ9 Calcium-binding protein-like n=1 Tax=Haloarcula marismortui RepID=Q5UWJ9_HALMA Length = 1562 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 45/178 (25%), Gaps = 17/178 (9%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV-----VY 291 + + + MD SGSM S K + L + V Y Sbjct: 490 RQTDDDGLRPVDVTLV--MDTSGSMSSSVK-LRNTAGQRFVAGLLDVDRAAVVDFDSSAY 546 Query: 292 IRHH-TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + T GGT + S L + N ++ + +DG Sbjct: 547 VAQDLTSDFGAANSTLDNLGSGGGTDIGSGLSTANSQFASNSNDSRAQVMIL-LTDGRGN 605 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 S + Y A++ R+ ++ N+ + Sbjct: 606 GGISEA-------QTAANQNTTVYTVGFDNANRDKLRDIANITDGEFNYVTDRSELPN 656 >UniRef50_Q11Y10 Possible outer membrane protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11Y10_CYTH3 Length = 1313 Score = 69.4 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 26/216 (12%), Positives = 58/216 (26%), Gaps = 21/216 (9%) Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMA 269 ++L +I L + + K + +D+S SM + +A Sbjct: 51 DKLENQITTLTRESVSIHENGIEQQVVKVVNPAAVKPKSISLVLTIDISESMQKQYMPLA 110 Query: 270 KRFYILLYLFLSRTYKNVEVV------YIRHH-TQAKEVDEHEFFYSQETGGTIVSSALK 322 K + L V +I T+ + GGT + Sbjct: 111 KNAAAAIVNKLPLDISECAVTSFNDVSFINTDFTRDRFKLLQSIQTLVPAGGTDYNKGFI 170 Query: 323 L----MDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 +++K+ + +DG + D +P AK + V + Sbjct: 171 KSNAGGLDILKKGLHEK----VLIFLTDG--YGDVNPTEIIQQAKSIGAKVYVITLGMSA 224 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + + ++ + +Y Sbjct: 225 PEE----LKRIVTATNGSYYENVISEQEINAVYMSI 256 >UniRef50_Q7SGD8 Predicted protein n=4 Tax=Sordariales RepID=Q7SGD8_NEUCR Length = 1086 Score = 69.4 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 58/204 (28%), Gaps = 26/204 (12%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 L K PS++ + + D SGSM + + K + + K Sbjct: 275 HQRALMATLVPKFNLPSTRPEIVFVCDRSGSMGGARIEGLKSALRIFLKSIPVGAKFNIC 334 Query: 290 VY------------IRHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQ 336 + + ++ + GGT + L+ E +RY Sbjct: 335 SFGSTFEFLFSDGSRSYDHESLRLAMDYVSRMDADLGGTEMYQPLEAAFE---KRY--ND 389 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLP----VVRYYSYIEITRRAHQTLWREYEHL 392 ++ +DG+ W + + K + +R ++ +H L Sbjct: 390 MDLEVFLLTDGEIW---NQEHLFTMINKKVSESQGAIRLFTLGIGNDVSH-ALIEGAARA 445 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRE 416 + F + + + + Sbjct: 446 GNGFAQSVTDSEKMNAKVVRMLKA 469 >UniRef50_UPI0001C378BC von Willebrand factor, type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C378BC Length = 565 Score = 69.4 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 57/195 (29%), Gaps = 23/195 (11%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 K ++ + ++D SGSM + + K +++ + V Y + Sbjct: 379 KLWKTNKNSGKPIAAVFVLDTSGSMSGAPLNSLKASLRNSIKYINSSNYIGVVSYSSNVN 438 Query: 297 QAKEVDE----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQA 344 E+ + +G T SAL ++ + N+ Sbjct: 439 VDLELAKFDLNQQAYFMGAVDSLTASGNTATFSALSQAM-IMLRDFTKDNPNVSPMVFLL 497 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 SDG + + + + Y A+ + + N A Sbjct: 498 SDGQSNSGSEFSDIDGAIATAQIPIYTIGY-----NANLNELKAISEI-----NEAATIN 547 Query: 405 RDQDDIYPVFRELFH 419 D DD+ + LF+ Sbjct: 548 ADTDDVIYQLKNLFN 562 >UniRef50_A3DK47 von Willebrand factor, type A n=9 Tax=cellular organisms RepID=A3DK47_CLOTH Length = 565 Score = 69.4 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 36/257 (14%), Positives = 76/257 (29%), Gaps = 23/257 (8%) Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY 236 A+ + + L + + +S+ +L E + L Sbjct: 321 MYAIGNLTQEKKEILNKFVEFCKSSKSQELATEYGFNRLDDYLPEISNFDGEAIMKAQ-- 378 Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----- 291 K ++++ D ++ V + DVSGSM + K+ I ++S V Y Sbjct: 379 KLWKEKKDVNNDIVAVFVADVSGSMAGEPLNRLKQSLINGSKYISSDVSIGLVSYSTDVN 438 Query: 292 -----IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQAS 345 + + + G T A+ + +++ +E+ + S Sbjct: 439 INLPIAKFDLNQRSLFVGAVESLAAGGNTATFDAIIVATKMLKEEKAKNPNAKLMLFVLS 498 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG S + + K + Y A+ + + Sbjct: 499 DGVTNYGHSLNDIKDMMKTFGIPIYTIGY-----NANIKALETLSQINEAAN-----INA 548 Query: 406 DQDDIYPVFRELFHKQN 422 D +D+ LF+ Q Sbjct: 549 DTEDVVYQLGSLFNAQM 565 >UniRef50_B8KT14 Magnesium-chelatase 60 kDa subunit n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KT14_9GAMM Length = 610 Score = 69.4 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 49/319 (15%), Positives = 93/319 (29%), Gaps = 31/319 (9%) Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 E+ P + + L + +PA + L+N +RR ++ AGK Sbjct: 295 EQAQDEEAQSPETEPDDPVPLEALSEQLLEASTAVLPAEMLGQLLLRNQPSRRQSLHAGK 354 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR--------- 235 ++ I E+ A +R+ L Sbjct: 355 SGATQRSKQRGRPIGVKPGNPKSGEKLNLVATLRTAAPWQRLRQQQRDGLNTGTNIAIET 414 Query: 236 --YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF-LSRTYKNVEVVY- 291 ++ R +S ++D SGS AK LL R + + + Sbjct: 415 SDFRVVRYRQRSAS--TTVFVVDASGSAALHRLAEAKGAVELLLAECYVRRDQVALIAFR 472 Query: 292 ------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + T++ + GGT ++SAL+L+D V ++ + Y + Sbjct: 473 GTEAELLLPPTRSLVRAKRSLAELPGGGGTPLASALRLLDTVTEQIASHGGTPHYVL-LT 531 Query: 346 DG-DNWADDSPLCHEILAKKL------LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 DG N E ++ L + + T E Sbjct: 532 DGRANIGLSGEPGREQASQDAEQTASHLRKRQLRGIVIDTSSRPSYRAEELA--GHLGAI 589 Query: 399 FAMQHIRDQDDIYPVFREL 417 +A + D+ V + + Sbjct: 590 YAPLPQANASDLQRVIQAV 608 >UniRef50_A7RVQ6 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RVQ6_NEMVE Length = 419 Score = 69.4 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 65/190 (34%), Gaps = 21/190 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVY-------IRHHT 296 +A + L+D SGS+ S + K+F L ++ + + + + + + Sbjct: 42 RKADLLFLLDTSGSLSLSNFNTEKKFIRNLLNVIAVGFDATRVEIITFGSDVNRRVPFIS 101 Query: 297 QAKEVDEHEFFY------SQETGGTIVSSALKLMDEVVKER-YNPAQWNI--YAAQASDG 347 +A E D F E G T + A + EV K + NI +DG Sbjct: 102 EAHEKDTKCTFNEKFANVVHEWGMTNMRGAFEKAYEVCKGTWSGKKRLNIKTTVILITDG 161 Query: 348 D-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF-DNFAMQHIR 405 NW +P + + V ++ + L + ++ F + + Sbjct: 162 HWNWPWQNPDPVPKAQQLIREGVEILAFGVGYGISLSNLQTITANQRAGHTYAFQISNFD 221 Query: 406 DQDDIYPVFR 415 + + + R Sbjct: 222 EFNKLATYLR 231 >UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z6_DESAA Length = 480 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 52/202 (25%), Gaps = 23/202 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---- 294 + M ++D SGSM AK L L + V Y Sbjct: 83 LAPEQTKTKPVDMVIVLDRSGSMGGQKVRDAKAAVKGLVEGLRSQDRFSLVTYSNSVNGG 142 Query: 295 ------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + GGT + L+ V++ P + SDG Sbjct: 143 DGLHYLTADKRNSLNWMVDSIPAGGGTNLGGGLEKGVGVLRAYGAPDRMGKVIL-ISDGQ 201 Query: 349 -NWADDSPLCHEILAKKLLPVV--RYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 N P +A + + I + ++ L + + Sbjct: 202 ANQGVTDPNQLAAMAALRDDGLVYSVTTVG-IGQDFNEQLMATVADGGRGRYYY----LE 256 Query: 406 DQDDIYPVFRELFHKQNATAKG 427 + D F +F ++ + Sbjct: 257 NPGD----FLAVFQEEANWTRA 274 >UniRef50_Q4RF07 Chromosome 13 SCAF15122, whole genome shotgun sequence. (Fragment) n=2 Tax=Tetraodon nigroviridis RepID=Q4RF07_TETNG Length = 983 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 66/224 (29%), Gaps = 29/224 (12%) Query: 220 RAKIERVPFIDTFDLRYKNYEKRPD----PSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 A + + +RP +S M L+D SGS+ T + + Sbjct: 44 PASPWMDARKTPSKIDLYDVRRRPWYIQGAASPKDMLILVDASGSVSGLTLKLIRTSVTE 103 Query: 276 LYLFLSRTYKNVEVVY--------------IRHHTQAKEVDEHEFFYSQETGGTIVSSAL 321 + LS V VVY ++ + + K++ + G T + L Sbjct: 104 MLETLS-DDDYVNVVYFNTQVKKTACFDHLVQANVRNKKLLKDAVQNITAKGITNYTKGL 162 Query: 322 KLMDEVVK-ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL--PVVRYYSYIEIT 378 + E + + A N +DG + + +K VR +++ Sbjct: 163 EFAFEQLSVTNVSRANCNKIIMLFTDG------GEERAQAILEKYNADKKVRIFTFSVGQ 216 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + + + + I + ++ + Sbjct: 217 HNYDKGPIQWMA-CSNKGYFYEIPSIGAIRINTQEYLDVLGRPM 259 >UniRef50_A6Q2J6 von Willebrand factor type A domain protein n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q2J6_NITSB Length = 289 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 56/197 (28%), Gaps = 25/197 (12%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 Y + + + +D SGSM D+ +K + + V+ Sbjct: 66 YSSIKLDDRKGRD--LVLALDASGSMEESLYDEKSKFEVVKSMAQNFFHKRFDDNIGIVI 123 Query: 291 Y---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + + T+A + + S T + L + ++ + Sbjct: 124 FGSFAYIAAPLTYDTKALDFLINYLEPSIAGNNTAIGEGLWQGIKALQADTAKQK---VL 180 Query: 342 AQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG N SP AKKL + + + L + +A Sbjct: 181 ILITDGHHNSGSISPRQAVEKAKKLGIKIYTIGLGDA----DKHLLEQIAKESGGKFFYA 236 Query: 401 MQHIRDQDDIYPVFREL 417 D I+ +L Sbjct: 237 KSE-EDLQSIFSELNKL 252 >UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FLY1_NANOT Length = 748 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 23/209 (11%), Positives = 49/209 (23%), Gaps = 40/209 (19%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST----------------KDMAKRFY 273 ++ K P + ++D+S SM+ + D+ K Sbjct: 51 MVVSIQPPLKPKDDVPHVPCDIVLVIDISASMNSAAPIPTGESGGEDTGLSILDLTKHAA 110 Query: 274 ILLYLFLSRTYKNVEVVYIRHHTQAKEV----------DEHEFFYSQETGGTIVSSALKL 323 + L+ + V + A E+ T + +K Sbjct: 111 KTIIQTLNENDRLAVVTFCTEIRVAFELEFMSEENKSKVLAAIDCLHGISSTNLWHGIKE 170 Query: 324 MDEVVKERYNPAQWNIYAAQASDGD-----NWADDSPLCHEILAKKL-----LPVVRYYS 373 +V+ +DG P + L LP++ + Sbjct: 171 GLKVLATNSTQGNVQ-ALLVLTDGAPNHMCPAQGYVPKLRQTLLDHRDLTGSLPLIHTFG 229 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + L + + F Sbjct: 230 FGY---YLRSPLLQSIAEIGGGTFAFIPD 255 >UniRef50_A7S6T1 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7S6T1_NEMVE Length = 1235 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 51/209 (24%), Gaps = 40/209 (19%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS---------TKDMAKRF 272 + +D R + + S + ++D S SM D+AK Sbjct: 256 PAAPLKECHAYDNRLRPWYTSAAYPSTKKLVIVLDTSSSMASRVELGTKRRTRLDVAKAA 315 Query: 273 YILLYLFLSRTYKN------VEVVYIRHHTQAK--------------EVDEHEFFYSQET 312 + L K +V + + S+ Sbjct: 316 LSTILSTLLPQDKVGVVLFNSKVTLAGSSGVDECYSTRLAPAGRFNVNYLKDFINRSRPG 375 Query: 313 GGTIVSSALKLMDEVVK--ERYNPAQWNIYAAQASDGDNWADDSPLCHEILA-------K 363 GGT +A K ++K + + + +DG D L E L + Sbjct: 376 GGTQYQNAFKAAFTLLKSAKSGDGGGEQSFLLFLTDG--GPKDDALEVERLIAQNKKEME 433 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHL 392 + V + + Sbjct: 434 ESRERVTIMTIGLGKDEHMKDFLGRLSKN 462 >UniRef50_Q2QZN4 von Willebrand factor type A domain containing protein n=2 Tax=Oryza sativa Japonica Group RepID=Q2QZN4_ORYSJ Length = 574 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 38/291 (13%), Positives = 89/291 (30%), Gaps = 66/291 (22%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 +++A++ A + + E ++ +R IA + K + + Sbjct: 41 YYSTIAKQCNKKATSKAAIDRQEVRVST------------TPIRAAIARDQRKDDFEVLV 88 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST----------KDMAKRFYILLYLF 279 + EKR + + ++DVSGSM++ D+ K + Sbjct: 89 TVEAPKVVAPEKR----APIDLVAVLDVSGSMNKEEFVRGKHMSSRLDLLKIAMKYIIKL 144 Query: 280 LSRTYKNVEVVYI----------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLM----- 324 + + V + R+ +++ E+ + +G T ALK Sbjct: 145 VRDADRLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASGNTDFRPALKKAVEDMN 204 Query: 325 ------------DEVVKERYNPAQWNIY--AAQASDGDNWADDSPLCHEILAKK------ 364 +++ R + SDG + S + E +AK Sbjct: 205 IQNIKNSSAYNNFQILDGRGKEEKKKRVGFILLLSDGVDQFQYSRINWEKVAKSTDVDHS 264 Query: 365 ----LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 +L +++ R+ L +F +++ + + + Sbjct: 265 EVGAMLRKYAVHTFGFSAS-HDPVPLRQISALSYGLYSFVCKNLDNITEAF 314 >UniRef50_A0CY84 Chromosome undetermined scaffold_307, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CY84_PARTE Length = 625 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 63/194 (32%), Gaps = 34/194 (17%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV------VYIRHHTQAKE 300 ++ ++D SGSM S + AK+ IL L + + + +++ + Q+ Sbjct: 397 NRGNYLFIIDRSGSMSGSRIEKAKQALILFLKSLPQDSEFNIISFGIADIFLFFNHQSVP 456 Query: 301 -------------VDEHEFFYSQE----TGGTIVSSAL-KLMDEVVKERYNPAQWNIYAA 342 V + +E GGT + + L +++ N+ Sbjct: 457 LNNVSSQQQFLGIVQNEAIQHVEEMAANMGGTEILTPLQQMVYNASYGTSKNTTLNV--F 514 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG+ D + + + + + L + + + Q Sbjct: 515 MLTDGE-TDADQIIQLVQSNNQAQTRIYTLGIGQGCSQY---LIQRVAEVGNG----KSQ 566 Query: 403 HIRDQDDIYPVFRE 416 + D++DI + Sbjct: 567 IVSDKEDINEKIHQ 580 >UniRef50_Q55G98 von Willebrand factor A domain-containing protein DDB_G0267758 n=1 Tax=Dictyostelium discoideum RepID=Y7758_DICDI Length = 878 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 54/189 (28%), Gaps = 22/189 (11%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-- 293 +K+ + + ++ L+D SGSM KR ++ L+ V VV Sbjct: 303 FKDIKIEDM-NQKSEFIFLIDCSGSMVGEPMRKVKRAMEIIIRSLNENQHRVNVVCFGSS 361 Query: 294 ----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 ++ + E + GGT + + +K + ++ Sbjct: 362 FKKVFKVSRDYNDETLECLSKYIQSIEANLGGTELLTPIKNIL----SSPPNPEYPRQLF 417 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG+ K R ++Y I L + F Sbjct: 418 ILTDGE---APHRDKIIHYLSKESNTTRIFTYG-IGDSVDIDLIIGLSNACKGHYEFITD 473 Query: 403 HIRDQDDIY 411 + + + Sbjct: 474 NDNFEKQVM 482 >UniRef50_D1CCX6 von Willebrand factor type A n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CCX6_THET1 Length = 918 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 55/208 (26%), Gaps = 42/208 (20%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ----------------STKDMAKRFY 273 D + +N ++ P Q + +D SGSM D+AK Sbjct: 390 LPVDSQIRNPDEEP----QVAVVMAIDKSGSMAACHCEGSKLLEQYPGGIPKVDIAKESA 445 Query: 274 ILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 IL L V + K + Q +GGT + L Sbjct: 446 ILSSETLGPNDIFGVVAFDTAPRWVVRPEPVTDKSSIAEKVAGIQGSGGTNIYGGLAEA- 504 Query: 326 EVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 + N + +DG N + E+++K + + Sbjct: 505 --IDSLIKVKAKNKHVILLTDGWSNVGNYD----ELISKARRHGITISTVSAAGGS--AQ 556 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYP 412 L R + RD DI Sbjct: 557 LLRSIAEKGGGTFY----NTRDSADIPQ 580 Score = 51.3 bits (121), Expect = 8e-05, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 46/136 (33%), Gaps = 17/136 (12%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 PS + + L+D S S+ AK F Y R VV+ + Sbjct: 58 RLPSHKLGVVFLVDASDSVGPEGIAQAKEFVRKAYQLAGRDVDLGVVVFGKEPLIDSLTS 117 Query: 303 EH----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 +F ++ T + SA++L + PA + SDG+N D Sbjct: 118 SDGKLPDFLSRPDSTATDIPSAMRLAFSM-----FPADSSKKIVLLSDGNNNVGD----- 167 Query: 359 EILAKKLLPVVRYYSY 374 +++ + R + Sbjct: 168 ---MQEVSRLARMFGV 180 >UniRef50_A5UWS5 von Willebrand factor, type A n=2 Tax=Roseiflexus RepID=A5UWS5_ROSS1 Length = 851 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 62/191 (32%), Gaps = 24/191 (12%) Query: 238 NYEKRPDPS-SQAVMFCLMDVSGSMDQS----TKDMAKRFYILLYLFLSRTYKNVEVVYI 292 P P S + ++D S SM MAK I+ L + + + + Sbjct: 380 EMTPPPRPERSDTTLLLIIDQSASMGPETGISKFTMAKEAAIMATESLRQEDRIGVLAFD 439 Query: 293 RHH-----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + GGT + +AL+ + + P + +A Sbjct: 440 VSTRWVVDFQPVGVGLSLADVQRRISTLPLGGGTDIYNALQEGLPALAQ--QPGRV-RHA 496 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG ++ DD +L + + + I I A L +E + ++A Sbjct: 497 VLLTDGRSFTDDRQAYRMLLEEARSQNITLST-IAIGTDADINLLQELARWGAGRYHYA- 554 Query: 402 QHIRDQDDIYP 412 + +DI Sbjct: 555 ---AEPNDIPR 562 >UniRef50_D1CG77 von Willebrand factor type A; type II secretion system protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CG77_THET1 Length = 643 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 48/183 (26%), Gaps = 20/183 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQ 297 + +D S SM+ A+ L LS K + + I Q Sbjct: 92 QNPDPIDVVLALDTSASMNDDAFTAAQDAAYGLINGLSPEDKVGLITFDKTARVIEPLAQ 151 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + + GT + L L + V + Q +DG N + + Sbjct: 152 DHARVQESIQKLSRSVGTALYQGLSLAAQEVAK----GQNTKAIVLMTDGFNTSR-NTTL 206 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 E +AK ++ + ++ + A R + Sbjct: 207 EEAVAKAQEVGASVFTVGFGK-KVDTQGLQKIANETGGEYFSAP--------TNAQLRRV 257 Query: 418 FHK 420 F Sbjct: 258 FAD 260 >UniRef50_D1BQE7 von Willebrand factor type A n=1 Tax=Veillonella parvula DSM 2008 RepID=D1BQE7_VEIPT Length = 671 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 48/382 (12%), Positives = 101/382 (26%), Gaps = 40/382 (10%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSG----SGQGQASQDGEGQDEFVFQISKDE 123 + + P + + D S S + S D + E Sbjct: 297 NDSMGNEAAPKDSNANDGDTHSAMNSAEDASSQQDDSSEADGSNQSNDLDATQKESCDSE 356 Query: 124 YLDLLFED------LALPN---------LKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 +++ +D L LP+ + + T + +R G + Sbjct: 357 AMNVGGDDSQRLSSLCLPDTVARIANQLFQWKLESSKTVDRQYRKGSGRRLMTKTKDTRG 416 Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 + + A+ +L ++ A + E+ + + K + Sbjct: 417 RMIRAYQDEHAL-----EDLALVDTLRAAAPYQRLRAATKTEQEKLSTQSQQLKHQGGKG 471 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF--YILLYLFLSRTYKN 286 + K + A ++D SGSM + A + LL Sbjct: 472 LAIVIKPQDYRRKAREKRIGAYQLFVVDASGSMAARHRMEATKAAILSLLRDSYIHRDSV 531 Query: 287 VEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + + T++ E E G T ++ L++ + + Sbjct: 532 GLIAFRKESAEVLLPFTRSVERAERLLTSMPTGGKTPLAHGLRMAYTLCDRLLRAHRAER 591 Query: 340 Y-AAQASDGDNWADDSPL--CHEILAKKLLPVVRYYSYIE--ITRRAHQTLWREYEHLQS 394 +DG + DS ++L + + T L +E L + Sbjct: 592 IQIICITDGRATSGDSEDPVAESKQWARILGTLPVDCIVIDTETGFIKLGLAKELCKLMN 651 Query: 395 TFDNFAMQHIRDQDDIYPVFRE 416 D+ I V R Sbjct: 652 GSYYAMDTITADR--ILRVSRR 671 >UniRef50_A0DIJ2 Chromosome undetermined scaffold_52, whole genome shotgun sequence n=4 Tax=Eukaryota RepID=A0DIJ2_PARTE Length = 2542 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 48/135 (35%), Gaps = 11/135 (8%) Query: 223 IERVPFIDTFDLRY--KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 + + D+++ K K S + ++D SGSM S + AK + + Sbjct: 2330 EQLKIKLKRQDIKFLQKRQFKEVINSPKIHYIFMIDDSGSMSGSPWNTAKNCCLNCLSTI 2389 Query: 281 SRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE--------TGGTIVSSALKLMDEVVKERY 332 + N V I ++ A+ E E +G T SA + +++ + Sbjct: 2390 EKN-LNARVSVIIFNSTARIAINCEIVNLVEMEKKIQFNSGSTDFGSAFQQAYKLIVQHQ 2448 Query: 333 NPAQWNIYAAQASDG 347 N A +DG Sbjct: 2449 NDAFQKTEVLFYTDG 2463 >UniRef50_Q22UB9 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22UB9_TETTH Length = 2269 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 54/164 (32%), Gaps = 13/164 (7%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-----EV 301 + + D SGSM+ ++ + SR + I K E Sbjct: 2097 NPVHFIIVFDESGSMEGEKWITLRKELLNFIDNRSRATAQDFITLIGFAHTVKLYTKVEK 2156 Query: 302 DEHEFFYSQE----TGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQASDGDNWADDSPL 356 + GGT S+ L+ ++ +E+ + N SDGD P Sbjct: 2157 LNEQIKQKVPQEFMDGGTNYSAPLQQALNILSQEQCQTFKKNNVIFFLSDGD---AKEPK 2213 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +KL +++ ++ QTL LQ+ +++ Sbjct: 2214 TEIQQLQKLGHLIKLIQFVGYGDENFQTLKSMANDLQNVEASYS 2257 >UniRef50_Q2W311 Putative uncharacterized protein n=1 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2W311_MAGSA Length = 1171 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 52/179 (29%), Gaps = 24/179 (13%) Query: 251 MFCLMDVSGSMD---QSTKDMAKRFYILLYLFLSRTYKNVEV----------VYIRHHTQ 297 + L+D S SM S M R LS + V Q Sbjct: 65 VVMLLDHSSSMGAAPGSPLQMMLRAAGNFLRQLSPDSRVAVVGFNQVPSVHCTLAATPAQ 124 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 A+ G T +++AL E++ + SDG + + Sbjct: 125 AR----SALQAISPGGATSIAAALNQAVELLAHGRP--GMDKVVVLCSDGQDDIAEIADA 178 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L K +P VR + H T Q F + RD DD++ + Sbjct: 179 LARL--KAIPSVRVLAVGFGDEVIHATFLAMVADRQD---YFHLTRARDMDDVFQRLAK 232 >UniRef50_B8GAZ1 von Willebrand factor type A n=3 Tax=Chloroflexus RepID=B8GAZ1_CHLAD Length = 958 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 59/217 (27%), Gaps = 42/217 (19%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMD-----------------QSTKDMAKRFYILLYL 278 Y + R + ++D SGSMD + D+AK Sbjct: 398 YMDVRNRELRP-DLAIVFVIDKSGSMDACHCANPDRGGPITSSSERKIDIAKDAVAQATA 456 Query: 279 FLSRTYKNVEVVY--IRHHTQA------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 LS V + T E + G T + + L +E++++ Sbjct: 457 LLSPQDTVGVVTFDGAAFPTFVATRGATVEQVMDAVSGVEPRGPTNIRAGLLRAEEMLQQ 516 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 A+ +DG D L ++ + + + ++ Sbjct: 517 --VDARIKHMIL-LTDGWGSGGDQLDIAARLREQ-GITLTVVAAGSGSATY----LQQLA 568 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 D D+ +F ++ TA G Sbjct: 569 AEGGGRYY----PAADMADVPQ----IFVQETITAIG 597 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 51/168 (30%), Gaps = 10/168 (5%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 P + L+D S SM ST+ A+ F + + VV+ + + D Sbjct: 62 RPVDRLTTVFLLDGSDSMPASTRAQAEAFIRAALQEMPPDDQAAIVVFGGNALVERAPDS 121 Query: 304 H----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD-GDNWADDSPLCH 358 T T + +A++L + + SD G+N + Sbjct: 122 DRRLGRITSIPITNRTNIEAAIQLGMAL----FPADSQKRLVL-LSDGGENSGRAIDVAR 176 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 ++ + + + +E A ++ + + + Sbjct: 177 LAASRGIPIDIVDLALVETDAEALVASVEAPNGVRDGQEALIVATVES 224 >UniRef50_A3TQW7 Putative membrane protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TQW7_9MICO Length = 654 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 51/202 (25%), Gaps = 13/202 (6%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 I T D R +P +Q L+D SGSM +S + + + Sbjct: 66 MIATLDGRPSPVTSKPATRAQRTTVLLIDTSGSMGRSGMATVRTAVKDFLASAPKDVRIG 125 Query: 288 EVVYIR------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 V + T A+ + + G T + S + ++ + + Sbjct: 126 VVSFGNTAGPEIAPTTARAAVQAVVDDLRADGNTALFSGVTQAVRMLG-----STGDRSI 180 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ-TLWREYEHLQSTFDNFA 400 SDG N D K L + T + + Sbjct: 181 VLLSDGKNTVGDRASGLAAAGKALTASQVRVEVVRFTTGENDPEALAAFAKAGGGS-VVQ 239 Query: 401 MQHIRDQDDIYPVFRELFHKQN 422 + ++ Q Sbjct: 240 ATDAEGVRTAFQTAAKVLESQV 261 >UniRef50_B2A702 von Willebrand factor type A n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A702_NATTJ Length = 599 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 56/187 (29%), Gaps = 16/187 (8%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++ D+R +N + + L+D SGSM K F + L K Sbjct: 406 LNPQDIRVRN----KKKQTSMNVCFLVDASGSMGGRRMQEVKFFAEHVL--LKGRDKIAI 459 Query: 289 VVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + + T+ + + G T +S +++ + ++ + N + Sbjct: 460 LTFREDNVNVEIPFTRNWDKLRSGLNKIKAFGLTPMSKGIEMARKYLESEVGQQK-NTFL 518 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVV--RYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG D K + ++ I + ++ Sbjct: 519 VLITDGLPTISDGGEDPFKETLKAAQKLSQTSIKFVCIGLEPNVKFLKKLAQASQASLYI 578 Query: 400 AMQHIRD 406 + ++ Sbjct: 579 VEELQKE 585 >UniRef50_A7HHW8 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HHW8_ANADF Length = 1362 Score = 68.6 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 44/181 (24%), Gaps = 23/181 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE----- 305 + L+D SGSM + D + + V + E Sbjct: 381 LVFLVDKSGSMMGAPFDRVRALVARALDAMGPDDTFQVVAFDGSAQAMSEAPLPATPSAI 440 Query: 306 ------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT + ++ ++ + +DG + Sbjct: 441 ARAKEWLASLEGGGGTEMLEGVRAALSPPED----PRRLRMVVFCTDG---FIGNEPEII 493 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + L R + + I ++ L + + D LF Sbjct: 494 EAVEALRGRARVFGFG-IGSSVNRYLVEGVGRAGRGASEVV--SLDEPPDAAVA--RLFA 548 Query: 420 K 420 + Sbjct: 549 R 549 >UniRef50_C1YR26 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YR26_NOCDA Length = 505 Score = 68.3 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 46/184 (25%), Gaps = 17/184 (9%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKE 300 A + ++D SGSM D A R + L L+ + V + + K Sbjct: 40 ATLQVVLDRSGSMGGGRLDGAVRALLSLVERLAPSDNFGLVSFNDQARVEVPCGPLEDKA 99 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 +GGT +SS L + + SDG N Sbjct: 100 RVRRLISGLHASGGTDLSSGLLRGVQEARRAGADRGG--TLLLISDGHANQGVTDHDLLR 157 Query: 360 ILAKKL---LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 +A Y + L + FA I Sbjct: 158 QVAADAYAHGVTTTSLGYGLG---YDEELLGAVADGGAGSALFAEDPDTAGGLIAREAEY 214 Query: 417 LFHK 420 L K Sbjct: 215 LLAK 218 >UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDA4D Length = 1547 Score = 68.3 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 54/162 (33%), Gaps = 20/162 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----------H 294 SS++ L+D SGSM D A + L L + + + Sbjct: 303 SSRSEFIFLLDRSGSMSGQPIDRACQALTLFLKSLPTDSYFNVISFGSSFKLLFPQSEKY 362 Query: 295 HTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 ++Q+ E + GGT + LK + V+ + +N +DG+ D Sbjct: 363 NSQSLEKAISNISKYKADLGGTEIYKPLKNVF--VQNKI--QGYNKQVFLLTDGE---VD 415 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 SP L +K R +S + Q L + Sbjct: 416 SPEQVISLIRKNNKFSRVHSIGFGSGA-DQYLINQSAIAGKG 456 >UniRef50_Q3M1S4 von Willebrand factor, type A n=8 Tax=Cyanobacteria RepID=Q3M1S4_ANAVT Length = 464 Score = 68.3 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 55/223 (24%), Gaps = 46/223 (20%) Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM--------------- 261 + A+ E LR P + ++D SGSM Sbjct: 14 EFMPAETEGQKLFLMLKLRPTKEVAVSRPPT--TFAFVIDTSGSMYEIVTGETTPTGVTY 71 Query: 262 ------------DQSTKDMAKRFYILLYL--FLSRTYKNVEVVYIRHHTQAKEVD----- 302 +S D+ + L L + + V + +Q ++ Sbjct: 72 TQDAKEYSQVTGGKSKIDIVIESLLALVRSGRLEASDRVAIVQFDDTASQIIDLTPATQV 131 Query: 303 ---EHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 E+ + +GGT + L+ +++ +DG + +D Sbjct: 132 SQLENAIAQLRSFSGGTRMGLGLRRALDML---SGQDMAVRRTLLFTDGQTFDEDICRAL 188 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 + E + L + + Sbjct: 189 ASDFATKNIPITALGVGE---DFKEDLLSHLSDSTGGTLFYVV 228 >UniRef50_A9UVU8 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UVU8_MONBE Length = 785 Score = 68.3 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 45/316 (14%), Positives = 84/316 (26%), Gaps = 56/316 (17%) Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL-QNSLARRTAMTAGK 184 + E + P + G +G N RS+ ++ R AMT Sbjct: 11 QISHELMRDPVAAPDGYVYDRTNILQWIGQGEDGQRNNSPFDRSITISAADLRPAMTIRS 70 Query: 185 RRELH----ALEENLAIISNSEPA---QLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 E + LE +A + PA L E+A E I Sbjct: 71 ALEEYIAQHHLEFEVAPLVTGRPALKLPARLASELELEVALHPVPGETRKAILEL----- 125 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQST----------------KDMAKRFYILLYLFLS 281 +R ++ + ++DVSGSM + D+ K + L+ Sbjct: 126 -IPQRQTATTNIHLNLVLDVSGSMGAAVTARDESNTLIEYNLCVMDLVKFASQVAVKCLA 184 Query: 282 RTYKNVEVVYI-----------------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLM 324 V + T +V + G T + + ++ Sbjct: 185 PGDVISIVTFSDAAKIIVEPISVPDPKMGADTTVADVL-GKIDAIYHGGSTNLWAGIETG 243 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNW-----ADDSPLCHEILAKKLLPVVRYYSYIEITR 379 +++ P N+ A +DG+ ++ V+ + Sbjct: 244 LQLLASCAQPHLHNVCVA-LTDGEPNRHPEQGYETAHRRFKQMPNFSYVLHTLPFGF--G 300 Query: 380 RAHQTLWREYEHLQST 395 R L + Sbjct: 301 RIDSALLQSLARTGEG 316 >UniRef50_A0CCS0 Chromosome undetermined scaffold_168, whole genome shotgun sequence n=6 Tax=Paramecium tetraurelia RepID=A0CCS0_PARTE Length = 981 Score = 68.3 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 56/196 (28%), Gaps = 23/196 (11%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------IRHH 295 ++ +D SGSM AK+ IL L + + + + Sbjct: 341 ENQAINRGTYLFFIDRSGSMSGGRIKKAKQSLILFLRSLPDNCRFNIISFGTMFRSLWSD 400 Query: 296 TQ--AKEVDEHEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGD 348 ++ +++ + + GT + + + + +Y ++ +DG+ Sbjct: 401 SKQYSQDTLDEAIKHVNAMEANMQGTEIFKPFQDV--IYNNQYGKSKTTTLNIFLLTDGE 458 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + K V E + L + + + F + D + Sbjct: 459 -VDVFPIIDLVKRNNKAETRVYTLGIGEGCSQY---LIKNLADVGNGKFQF----VADDE 510 Query: 409 DIYPVFRELFHKQNAT 424 DI +L Sbjct: 511 DINAKVIDLLEDSMTP 526 >UniRef50_B0TSG0 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TSG0_SHEHH Length = 761 Score = 68.3 bits (165), Expect = 6e-10, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 53/195 (27%), Gaps = 22/195 (11%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ----- 297 P +S + ++D SGSM + A + L+ +++ HH Sbjct: 256 PQTTSSRCIKMVVDCSGSMLGDSITQAGIALKQILKLLNEDDWFNIILFGSHHKSLFSES 315 Query: 298 ------AKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 ++ E GGT + SAL + PA+ +DG+ W Sbjct: 316 VKANRANLDIAAKELANLNADLGGTEMLSALNAAYDSAA----PAELASNILLITDGEIW 371 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 A++ + ++ F + I Sbjct: 372 G---EEQLICKAQESNHRHFVVGVGSAVS---EAFLKQLADKTGGASEFVTPNENMSSRI 425 Query: 411 YPVFRELFHKQNATA 425 F + + + Sbjct: 426 VQHFCRIKQSKLTQS 440 >UniRef50_Q9XAH6 Putative uncharacterized protein SCO6688 n=2 Tax=Streptomyces RepID=Q9XAH6_STRCO Length = 1171 Score = 68.3 bits (165), Expect = 6e-10, Method: Composition-based stats. Identities = 35/267 (13%), Positives = 72/267 (26%), Gaps = 26/267 (9%) Query: 92 QGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA------LPNLKQNQQRQL 145 +G ++ + ++L +A + L R Sbjct: 845 GSDDDRTGGAARSFPSVRHWAEDLRTLFGAEIRQEVLERAVADGRTDVIALLDPASVRPS 904 Query: 146 TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQ 205 E + ++ +R L L R + Sbjct: 905 VELLSAVLTLARGMPEQRVASLRPLVKRLVEELTKELATRLRPTLTGLTTPRPTRRPGGP 964 Query: 206 LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST 265 L LR +A +R + + + ++ R + + ++DVS SM+ S Sbjct: 965 LDLPRTLRANLAHIRRREDGRVEVVPERPVFRTRTARRN---DWRLILVVDVSASMETSV 1021 Query: 266 KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS------QETGGTIVSS 319 A IL + ++ TQ ++ + GGT +++ Sbjct: 1022 VWSALTAAIL------GGAPTLSTHFLTFSTQVADLTGLVADPLSLLLEVKVGGGTHIAA 1075 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASD 346 L +V P + + SD Sbjct: 1076 GLAHARSLVT---VPDRTLVVV--VSD 1097 >UniRef50_Q8H923 Putative uncharacterized protein OSJNBa0071K18.17 n=5 Tax=Poaceae RepID=Q8H923_ORYSJ Length = 606 Score = 68.3 bits (165), Expect = 6e-10, Method: Composition-based stats. Identities = 20/173 (11%), Positives = 46/173 (26%), Gaps = 27/173 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ----------AKE 300 + ++DVSGSM + K+ + L + V + ++ K Sbjct: 145 LVTVLDVSGSMAGRKLALVKKAMGFVIDNLGPADRLCVVSFSTEASRRTRLLRMSEVGKA 204 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE 359 + + T + L++ V+ +R + + SDG D++ Sbjct: 205 TAKRAVESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSSVIL-LSDGKDSYVVPRRGNGM 263 Query: 360 ILA------------KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + L + + + +F Sbjct: 264 SYMDLVPPSFASSGGRGQLAPIHTFGFGA---DHDAAAMNTIAESTGGTFSFV 313 >UniRef50_B9R4P7 von Willebrand factor type A domain protein n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9R4P7_9RHOB Length = 624 Score = 67.9 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 85/292 (29%), Gaps = 26/292 (8%) Query: 90 RPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYK 149 +PQ + + + E L L E L + + Sbjct: 281 QPQPAENDQSVSPDAGPKQDPESRDQETSQNAKEDLSALTE-----LLAAVEAGTIDGLP 335 Query: 150 THRAGYTANGVPANISVVRSLQNSLAR-RTAMTAGKRRELHALEENLAIISNSEPAQLLE 208 A T + A +++ R R T+ A LA + + P QL+ Sbjct: 336 EFLADTTRSSPRARSGKSGAVRKDARRGRPVSTSRMPPRPDARPNILATLRAAAPWQLIR 395 Query: 209 E---ERLRKEIAEL---RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 + L ++ + + I D RY+ + + ++D SGS Sbjct: 396 NRNRDLLAAKLERAIAAPPRRKPRTLITRDDYRYQRL----RHETPSTAIFVVDASGSTA 451 Query: 263 QSTKDMAKRFYILLYLF-LSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGG 314 K L R + + + + T++ + + + G Sbjct: 452 LERLGETKGAIEQLLSRCYVRRDEVAMIAFRGTQAETLLSPTRSLVMAKRKLAGLPGGGP 511 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKL 365 T +++ L+ E+ + +DG N A D A+++ Sbjct: 512 TPLAAGLERGLELALSVRRQGSTPVLVFM-TDGRGNIALDGTPDRTRAAEQV 562 >UniRef50_Q54DU5 von Willebrand factor A domain-containing protein DDB_G0292028 n=1 Tax=Dictyostelium discoideum RepID=Y2028_DICDI Length = 932 Score = 67.9 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 25/193 (12%), Positives = 58/193 (30%), Gaps = 24/193 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----------HH 295 ++ ++D SGSM + +K + L+ K V + ++ Sbjct: 339 QKSEFIFVLDCSGSMSGKPIEKSKMALEICMRSLNENSKFNIVCFGSNFNKLFETSKHYN 398 Query: 296 TQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + + GGT + L+ + +++ + +P ++ +DG+ + Sbjct: 399 DETLQKASEYINRIDANLGGTEL---LEPIVDILSKESDP-EFPRQVFILTDGE---ISN 451 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 K R ++Y I + L + + I D D+ Sbjct: 452 RDKLIDYVGKEANTTRIFTYG-IGSYVDKELIVGVSKACKGYY----EMIVDNSDMEEKV 506 Query: 415 RELFHKQNATAKG 427 +L Sbjct: 507 MKLISIAMQPTLS 519 >UniRef50_UPI0000D560E4 PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) n=5 Tax=Tribolium castaneum RepID=UPI0000D560E4 Length = 842 Score = 67.9 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 43/381 (11%), Positives = 103/381 (27%), Gaps = 77/381 (20%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLAL--------PNLKQNQQRQLTEYKT---HRAGYTAN 158 E Q + VF ++ +E L E + P N + + E + ++ Sbjct: 157 EPQKKAVFTLTYEELLQRQNEQYEVVINIHPGQPVKDLNVEVHIDESRPLKFVKSPPLRT 216 Query: 159 GVP-ANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA 217 G + + + + + +A + + + + + + Sbjct: 217 GNEISKNDDKTASLAEIKQNNSTSATVKFNPNIERQKQLATGLGTKEENGLAGQFVVQYD 276 Query: 218 ELRAKIERVPFIDT-FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 R + + + + + Q + ++D SGSMD + K + Sbjct: 277 VERDPKGGEVLLKDGYFVHFFAPSEVEALPKQ--VIFVLDTSGSMDGNRIKQLKEAMNSI 334 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFY---------------------------- 308 L + V + + VD+ + Y Sbjct: 335 LSELKKEDVFNIVEF-SSIVKVWNVDKVQVDYEVGEDPWPLYDSPEAPQKNKTNQVLPPA 393 Query: 309 -----------------SQETGGTIVSSALKLMDEVVKER--YNPAQWNIYAAQASDGDN 349 GGT + SAL++ ++VK+ +DG+ Sbjct: 394 YKATDENKEKAKKVVEKLNAYGGTDIKSALEVGLKLVKKNKENKEDAHQPIIVFLTDGEP 453 Query: 350 W-ADDSPLCHEILAKKL-----LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 + + ++ + S+ + + ++ F Sbjct: 454 TMGETNTEKITSAISEMNSGETRAPIFSLSFGDGA---DREFLQKISLKNLGFARHIY-- 508 Query: 404 IRDQDDIYPVFRELFHKQNAT 424 + D +E F+KQ ++ Sbjct: 509 --EAADASLQLQE-FYKQISS 526 >UniRef50_D1YYY2 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YYY2_METPS Length = 716 Score = 67.9 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 53/197 (26%), Gaps = 24/197 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY-----------KNV 287 + L+D SGSM K+ A+ L L + Sbjct: 157 PASKDVKKISGEYVILIDHSGSMAGPKKEAAEWAVGKFLLGLGPDDWFTLGAFSNNTRWY 216 Query: 288 EVVYIRHHTQ----AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + A E + +F E GGT + AL+ ++ + + + Sbjct: 217 SRLLAGATGDTVKNAVEFMKSKF----EGGGTEMGVALEQALDI---KRLKGDVSRHVLI 269 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +D + D + + + P R S + I + L F Sbjct: 270 ITDAE-VTDGGRILRLVDRESRRPDRRSISLLCIDAAPNSYLAAAIAERGGGIVKFLTSD 328 Query: 404 IRDQDDIYPVFRELFHK 420 + DI + Sbjct: 329 PSE-GDISSALDAILDD 344 >UniRef50_C9RU69 von Willebrand factor type A n=2 Tax=Geobacillus RepID=C9RU69_GEOSY Length = 1077 Score = 67.9 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 39/174 (22%), Gaps = 31/174 (17%) Query: 199 SNSEPAQLLEEER-------LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 Q + + +D + P + Sbjct: 143 PYQYTRQTGVSTAKLDFSLSFSQPEYAKPPNGDAQGRLDVTLVPQGAVSGIIRPPID--V 200 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK-NVEVVYIRHHTQAKEVDEHEF---- 306 +MDVSGSM AK + Y N I +E F Sbjct: 201 VFVMDVSGSMTAMKLQSAKSALQAAVNYFKSNYNQNDRFALIPFSDGVREASVVPFGKYS 260 Query: 307 -------------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 GGT S+AL L K + Y +DG Sbjct: 261 NVASQLDAILNTGNSLTAGGGTNYSAALSLA----KSYFTDPTRKKYIIFLTDG 310 >UniRef50_A7RTF3 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RTF3_NEMVE Length = 756 Score = 67.9 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 37/306 (12%), Positives = 75/306 (24%), Gaps = 32/306 (10%) Query: 132 LALPNLK---QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRREL 188 +LP + + L + + T+ N+ + + R A + Sbjct: 162 ASLPRVDSAYEFDFELLVQSASEIQEITSPHSKLNVVISSEDKCQATVRLAE---PFKFD 218 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 ++ + P E I + + V D + + Sbjct: 219 VDVKVMILNRDPFLPQATFENGVTGSNITQDFLEKPLVTLNFMPDF------GKQEALET 272 Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR--TYKNV------EVVYIRHHTQAKE 300 ++D SGSM A+ L L + V E ++ + Sbjct: 273 GEFIFVIDRSGSMSGDRIKNARETLFLFLKSLPEHCHFNVVGFGSSYEKLFSSSTKYSDS 332 Query: 301 VDEHEFFY---SQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + + GGT + LK + +DG+ + Sbjct: 333 SVNKACNHAKNLEANLGGTEILEPLKYVFSQP----VIKGSPRQVFLMTDGE-VG--NTQ 385 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L KK R +++ + L + F + Q + R Sbjct: 386 QVITLVKKNSTHARCFTFGIGQGAS-TALIKGVARAGQGTAEFITSSHQMQAKVVKTLRN 444 Query: 417 LFHKQN 422 Sbjct: 445 ALQPSM 450 >UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium Ellin514 RepID=B9XNU9_9BACT Length = 229 Score = 67.9 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 58/173 (33%), Gaps = 17/173 (9%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVV 290 L + + E +P + ++DVS SM + D L L+R+ K VE Sbjct: 5 LPFIDVEFVDNPEPRCPCVLVLDVSSSMRGAAIDFLNLGVDLFAHDLTRSRLACKRVETA 64 Query: 291 YIRHHTQAK----EVDEHEFF--YSQETGGTIVSSALKLMDEVVKER---YNPAQWN--- 338 I V F + G T + A+ E++++R Y A + Sbjct: 65 IITFGDGVHIVQDFVSPSAFVPPRFEAGGKTPMGEAVVQACELLEKRKRKYRAAGVSYFR 124 Query: 339 IYAAQASDGD--NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 + +DG+ ++ + + + + + A+Q E Sbjct: 125 PWIFLITDGEPTDYETANWRQAVEIVRAGEVDKKLMFFGVAVSDANQGKLNEL 177 >UniRef50_B6B3S7 Magnesium chelatase ATPase subunit D n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B3S7_9RHOB Length = 547 Score = 67.9 bits (164), Expect = 8e-10, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 66/217 (30%), Gaps = 24/217 (11%) Query: 153 AGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERL 212 + G N + + N R AG + + + +A + + P Q + E Sbjct: 282 SDTKRKGTTGNGAGAKQNGNRRGRPLPARAGSKANTARV-DLIATLRAAIPYQTIRREA- 339 Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 P I DLR K Y+ S ++ +D SGS + AK Sbjct: 340 --------QPQRTGPIIHPGDLRRKRYQTL----SDRLLIFTVDASGSAAMARLAEAKGA 387 Query: 273 YILLYLFLS-RTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLM 324 +L R + + + T++ + GGT ++S L Sbjct: 388 VEMLLSEAYARRDHVALISFRGLDAEVLLPPTRSLVQTKRRLAALPGGGGTPLASGLTAA 447 Query: 325 DEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEI 360 + + + + +DG N A D Sbjct: 448 LSLAETASHKGM-SATIVLLTDGRANIALDGQANRTQ 483 >UniRef50_A0BS51 Chromosome undetermined scaffold_124, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0BS51_PARTE Length = 947 Score = 67.9 bits (164), Expect = 8e-10, Method: Composition-based stats. Identities = 27/225 (12%), Positives = 63/225 (28%), Gaps = 24/225 (10%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPA-----QLLEEERLRKEIAELRAKIERVPFIDTF 232 K+R L+ + + + A + LE + + + + Sbjct: 671 IPPGILKQRILYEKGLFIKRYDSIKKAAFIFTECLETSKFYDPEIRINCLKQLKEIFQSQ 730 Query: 233 DLRYKNYEKRPDPSSQAV-----MFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKN 286 +L YK + + + ++D SGSM+ K++A + +L + Sbjct: 731 NLLYKVPKIEQLLELNEIKKNNDIVFVIDHSGSMENIKKELAINGILKIFDNYLQDQDRI 790 Query: 287 V--------EVVYIRHH-TQAKEVDEHEFFY---SQETGGTIVSSALKLMDEVVKERYNP 334 EV++ ++ + G T + SA+ + E+ Sbjct: 791 SYMRFNQNIEVIFDLTSKSENTAYLRSAIERSKNIRAEGMTAMLSAVLHAYSI-HEKAVK 849 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 + DG++ + P + Sbjct: 850 KDNQQWIVVLCDGEDNLSNITYERMKKFTSKRPQISLIVIGIGLS 894 >UniRef50_B4S8S0 Magnesium chelatase ATPase subunit D n=3 Tax=Chlorobiaceae RepID=B4S8S0_PROA2 Length = 619 Score = 67.5 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 39/286 (13%), Positives = 92/286 (32%), Gaps = 36/286 (12%) Query: 108 DGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 + + E + + D +L+ + + ++ ++ + G N Sbjct: 308 NSDPDAEEENEETPDMIEELMMDAIETEL--PENLMNISLASKKKSKSGSRGEALNNRRG 365 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE-EERLRKEIAELRAKIERV 226 R +++ + ++ + ++ E +L + + L Sbjct: 366 RFVRS------QPGEIRGGKVALIPTLISAAPWQESRRLERLRKTGKVSTTGLI------ 413 Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT--- 283 I+ D++ K + + S + ++D SGSM + AK L Sbjct: 414 --INKEDVKVKKFRDK----SGTLFIFIVDASGSMALNRMRQAKGAVSHLLQNAYVHRDQ 467 Query: 284 -----YKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 ++ E + +Q+ + + E GGT ++SA+ L E K+ Sbjct: 468 VALISFRGKEAQLLLPPSQSVDRAKRELDVLPTGGGTPLASAIYLAWETAKQARTKGVSQ 527 Query: 339 IYAAQASDGD-NWAD------DSPLCHEILAKKLLPVVRYYSYIEI 377 I +DG N ++P + +K + + Y + Sbjct: 528 IMFVLITDGRGNIGLQSMMDKNAPKAPKEEIEKEVEALAASVYADG 573 >UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha (Globulin) inhibitor H5 (ITIH5) (Fragment) n=2 Tax=Danio rerio RepID=Q5RHF3_DANRE Length = 906 Score = 67.5 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 55/204 (26%), Gaps = 25/204 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IR 293 + R P + ++D S SM + K+ + L V + + Sbjct: 242 FAPRDLPVVPKNVVFVIDTSASMLGTKMKQTKQALFTIINELRPNDNFNFVTFSNRIRVW 301 Query: 294 HHTQAKEVD-------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN-----IYA 341 + V + + TGGT ++ ++ ++ + + + Sbjct: 302 QPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALLSDYLSSKDESHHHSVSLI 361 Query: 342 AQASDGDNWADD--SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG SP + ++ + L Sbjct: 362 IFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIG-MGDDVDYRLLERMSLDNCGT--- 417 Query: 400 AMQHIRDQDDIYPVFRELFHKQNA 423 M+ I + D + + F+ + Sbjct: 418 -MRRIPEDADASLMLKG-FYDEIG 439 >UniRef50_B1I3V2 Magnesium chelatase n=4 Tax=cellular organisms RepID=B1I3V2_DESAP Length = 670 Score = 67.5 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 61/194 (31%), Gaps = 19/194 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN-SEPAQLLEEERLRKEIAELRAK 222 R L+ RR+ + + + + A L +K+ A Sbjct: 406 YERDRVLRKGSGRRSRTRTPTKAGRYVRATLRRERDDLAFDATLRAAAPFQKQRARD--- 462 Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFL 280 +++ D+R EK + + ++D SGSM + +A + LL Sbjct: 463 -GVAVAVESQDIR----EKVREKRIGNFLVFVVDASGSMGAQQRMVAAKGAVLSLLLDAY 517 Query: 281 SRTYKNVEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER-Y 332 + + V + T + E+ E G T +++ L EV + + Sbjct: 518 QKRDRVGMVAFKGEHAEVLLPPTNSVELAERRLAELPTGGRTPLAAGLLKAYEVARAHLF 577 Query: 333 NPAQWNIYAAQASD 346 + SD Sbjct: 578 KDPNLSPLLIVISD 591 >UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z299_9BACE Length = 348 Score = 67.5 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 61/197 (30%), Gaps = 29/197 (14%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---IRHHTQAK-EVD 302 + ++ L+DVS SM + + ++ L +E Y I +AK Sbjct: 2 RRLPVYFLVDVSESMVGAPIQQVQDGMRMIVQELRTDPYALETAYISVIAFAGKAKCVSP 61 Query: 303 EHEFFYSQE-----TGGTIVSSALKLMDEVVKERYN------PAQWNIYAAQASDGDNWA 351 E + GGT + +AL+ + + + + W +DG+ Sbjct: 62 LTELYKFYPPTFPIGGGTSLGNALEFLMDDMDKTLVRTTTEQKGDWKPIVFLFTDGN--P 119 Query: 352 DDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 D+P + I I + L + + + D+I Sbjct: 120 TDNPSNAFTRWNNKYRGKANIVA-ISIGDNVNTQLLGQISDN--------VLRLNKTDEI 170 Query: 411 YPVFRELFHKQNATAKG 427 F+ F A+ K Sbjct: 171 --SFKSFFKWVTASIKA 185 >UniRef50_C0M4X9 Inter-alpha-trypsin inhibitor heavy chain H4 (Fragment) n=1 Tax=Nilaparvata lugens RepID=C0M4X9_NILLU Length = 315 Score = 67.5 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 29/240 (12%), Positives = 62/240 (25%), Gaps = 58/240 (24%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 + P + + ++D+SGSM K + + L+ V++ Sbjct: 49 FAPAELPPLRKQVVFVLDISGSMFGEKIKQLKDAMLKILSDLNPQDHFSIVLFSDNAYVW 108 Query: 292 ---------------------------------IRHHTQAKEVDEHEFFYS-QETGGTIV 317 I T EF + T T + Sbjct: 109 SKAKTAVMKKILDEGFYNLDNETLAILDDHRNEILQATPDNVKTAKEFVELIKPTTSTNI 168 Query: 318 SSALKLMDEVVKE-----RYNPAQWNIYAAQASDGD-NWADDSPLCH----EILAKKLLP 367 L+ ++VKE +DG+ N P+ L ++L Sbjct: 169 IDGLRKGLKLVKEGKETLDTTKEPSQPIMFFLTDGEPNVDLTDPVEIVNETSSLNEQLKT 228 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + ++ + T ++ F +I + D + + ++ Sbjct: 229 PIYSLAFGQGA---DITFLKKLSKANHGFAR----NIYEGSDATLQLNNFYKEISSPLLA 281 >UniRef50_C3ZG18 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZG18_BRAFL Length = 806 Score = 67.5 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 31/256 (12%), Positives = 71/256 (27%), Gaps = 29/256 (11%) Query: 160 VPANISVVRSLQNSLARRT---AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEI 216 P +I + S ++S+ + + + + + + + L + Sbjct: 188 SPNSIDKIESPKSSIDVTYGGTSAQVRLKDDHKLDSDVELYVHYKDKHRPFAVTELGQGT 247 Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 A + + + ++ ++D SGSM + A+ +L Sbjct: 248 DGFMADHTVMLTFVP------DLSREDLVANCGEFIFILDRSGSMSGNKIKNARETLLLF 301 Query: 277 YLFLSRTYKNVEVVYIR-----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLM 324 L V + + ++ + + GGT + L+ + Sbjct: 302 LKSLPIGCYFNIVGFGSTHESLFKGSEKYDNKSLKTACKALGKMEADLGGTEILQPLQYV 361 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 + A +DG+ W D+ C +AK R +S + Sbjct: 362 YKQP----PIAGHPRQLFLLTDGEVW--DTQACVREVAKH-ADSARCFSVGIGEGAS-TA 413 Query: 385 LWREYEHLQSTFDNFA 400 L + F Sbjct: 414 LVKGVARAGRGKAEFV 429 >UniRef50_D0LP28 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LP28_HALO1 Length = 523 Score = 67.5 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 36/262 (13%), Positives = 63/262 (24%), Gaps = 32/262 (12%) Query: 175 ARRTAMTAGKRRELHALEENLAIISN-------SEPAQLLEEERLRKEIAELRAKIERVP 227 A E +E + L + + +V Sbjct: 60 RDLIAEGRVPPAEAFLVEAMFSEHDLPVAGDACDSMLCLRSSLAVAPALDGTPTGWLQVG 119 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-------DQSTKDMAKRFYILLYLFL 280 T D PS + +DVSGSM S + + L L Sbjct: 120 MSSTID-----PATFERPS--LTIVATVDVSGSMGWGYADDQVSAGSLTRNLLGALVDQL 172 Query: 281 SRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY 332 + V Y A K+ E G T + + L+ + E Sbjct: 173 GPEDRIAIVTYGSRVDTALTLRSAGQKDEIHTAIDKLSEAGSTNMEAGLQRAYAIASEAA 232 Query: 333 NPAQWNIY-AAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + + +D N E +A + + + Q L Sbjct: 233 ADGETDSTRIMLFTDVQPNVGATGASQFEAMASEGADSGVGLTVFGLGLGLGQELMTAMS 292 Query: 391 HLQSTFDNFAMQHIRDQDDIYP 412 HL+ F++ ++ Sbjct: 293 HLRGGN-AFSLTRHESVGELIE 313 >UniRef50_A0CKU6 Chromosome undetermined scaffold_20, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CKU6_PARTE Length = 811 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 60/201 (29%), Gaps = 17/201 (8%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 L Y +Q + ++D SGSM + +AK Y + K Sbjct: 1 MFQEVQLNYFKVPSVKFEETQ--LLAILDCSGSMYNYWQYVAK-----YYNEIRHKVKLH 53 Query: 288 EVVYIRHHTQAKEVDEHEF---FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 I T+ K + + E + GGT ++ A + +++++ + N+ Sbjct: 54 --CAITFDTRVKTIPQAELGSNINTYGGGGTNITIAFQELNKLL--YSLKQKKNVTVLFV 109 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 SDG D + + + + E + + + Sbjct: 110 SDGQ---GDFSKEFIDKTRPAIENLNFICIGVGKGFPTFISMDLREMYHNGDKSIPPLFL 166 Query: 405 RDQDDIYPVFRELFHKQNATA 425 D D + F + A Sbjct: 167 VDIQDGDQPLDKRFQIEMLAA 187 >UniRef50_C1XFI8 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus RepID=C1XFI8_MEIRU Length = 722 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 57/188 (30%), Gaps = 17/188 (9%) Query: 246 SSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-------- 296 + ++DVSGSM + +A + L VV+ Sbjct: 311 PGGVGIVLVLDVSGSMLEDDKLGLAVTGSLELIRSARPQDYIGVVVFSDRPRWLFRPRPM 370 Query: 297 --QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 Q ++ E +Q GGT++ A E +++ ++ +DG A D Sbjct: 371 TEQGRKEAESLLLSTQAGGGTMIRRAYLEALEALEQVPTESKQ---VIALTDG--LAADV 425 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 A++ P ++ + I A RE + Sbjct: 426 TPDLFDAAREASPRIKTNTVA-IGADADGRFLRELAQAGDGTYWDVPRPEDLPRFFLEEA 484 Query: 415 RELFHKQN 422 + +F ++ Sbjct: 485 QRVFRREA 492 >UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G9E8_9DELT Length = 532 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 55/187 (29%), Gaps = 18/187 (9%) Query: 231 TFDLRYKNYEKRPDPSSQAV-MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 T D R + S++ + + ++D SGSM + A + L V Sbjct: 116 TLDARGNFRDAVARTSAEPLNLAIVIDHSGSMKGQRERNALDAAAGMISRLRDGDTVSVV 175 Query: 290 VYIR-HHTQAKEVDEHEFFYSQE------------TGGTIVSSALKLMDEVVKERYNPAQ 336 Y HT + +G T VS ++ + ++ R Sbjct: 176 SYNTKAHTIVPVTTLDARNRDRVISDLRVGVASRPSGNTCVSCGVEAGLQTLQGRRP--G 233 Query: 337 WNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + SDG+ N LA++ S I + ++ L + Sbjct: 234 IDRMLL-LSDGEANRGVRDEPGIRRLAREARNRGVSISSIGVDVDYNEVLMSAIAREANG 292 Query: 396 FDNFAMQ 402 F+ Sbjct: 293 RHYFSET 299 >UniRef50_Q22G03 Putative uncharacterized protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22G03_TETTH Length = 994 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 71/204 (34%), Gaps = 21/204 (10%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 T L+ + S Q ++D SGSMD + + K+ L F+ +T + ++ Sbjct: 27 TLKLKLDETALVQNNSRQKNYQIVIDNSGSMDGTNIQLTKQLCNELVQFVIKTQPHSKIS 86 Query: 291 YIRHHTQAKEV----------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + +T V E GGTI + ++ ++ + + Sbjct: 87 LMTFNTSIDHVENLHLKSLKQVEQFISNINANGGTIFHITFDKLRDICQK-FTNQNEELV 145 Query: 341 AAQASDGD---NWADDSPLC----HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 +DG + + + KK + V ++ T + + LQ Sbjct: 146 IVYLTDGQVQSGQDSTNLKDSFIFLQQVLKKFVNNVEVHALGMGTS-HDPVILDKIISLQ 204 Query: 394 STFDNFAMQHIRDQDDIYPVFREL 417 +T + Q I++ +I F+ + Sbjct: 205 TTQSTY--QFIKESSEIEGAFKNI 226 >UniRef50_A6G7V2 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G7V2_9DELT Length = 820 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 38/302 (12%), Positives = 79/302 (26%), Gaps = 35/302 (11%) Query: 131 DLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHA 190 DL++ L + L+ G ++V R + + + L Sbjct: 188 DLSVEPLGPRVRVSLSIADALPEGAWPTSPSHKLNVARVEGRA---KVELGGDAGAALDR 244 Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD-LRYKNYEKRPDPSSQA 249 ++ A ++ E + R + +++ L P A Sbjct: 245 -----DVVVRWPGAPVVAGEDAGVSLELARPDAAHLGAANSYGRLVLTPPPIEPGREVSA 299 Query: 250 V---MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHH 295 V + L+D SGSM A+ L L + V + Sbjct: 300 VPRDLIVLLDTSGSMRGEPLAHAQAVTEALIRSLRDRDRLELVEFSSRVRRWSQAPASMS 359 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 +E + +GGT + + ++ + +DG Sbjct: 360 AAKREEALRWVGALRASGGTHMRDGILAALASLRP-----EAQRQILLITDG--LIAFES 412 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + + P R ++ I +++L R + +D Sbjct: 413 EIVQAARQHRPPGCRVHTLG-IGSSVNRSLTRPVALAGGG----LEVIVAPGEDAEEAAA 467 Query: 416 EL 417 L Sbjct: 468 RL 469 >UniRef50_C4V3L6 Magnesium chelatase n=2 Tax=Selenomonas RepID=C4V3L6_9FIRM Length = 636 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 56/355 (15%), Positives = 103/355 (29%), Gaps = 54/355 (15%) Query: 37 AINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP------GNDHFVQND---- 86 I T +G S +P + F R + GN + Sbjct: 247 LIEAACATAALAGRSFVMPADVEEAAEFVLVHRMSRPQEEQQPPPDAGNAPEQGGNDMPD 306 Query: 87 --RIERPQGGGGGSGSGQGQASQDGE---GQDEFVFQISKDEYLDLLFEDLALPN---LK 138 + P GG+ S G +S++ QD +E + + +A P + Sbjct: 307 EPQEAPPPQDDGGAESPHGASSENESQNAPQDGADDPSPPEEAHEDGDDRVAAPLENVMA 366 Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 + +T R + V + + R L+ L A + Sbjct: 367 RLSLLSMTMRAQGRKSGKRDIVQTHTADGRCLRTEL---------PHSGARLDLALSATL 417 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + P Q + + I D+R KR + A + L+D S Sbjct: 418 RAAAPYQRARQ-------------RTQTVVIRPEDVRVWVRAKR----AAANILFLVDAS 460 Query: 259 GSMDQSTK-DMAKRFYILLYLFLSRT--------YKNVEVVYIRHHTQAKEVDEHEFFYS 309 GSM + M K + L + ++ + T++ E+ E + Sbjct: 461 GSMGARERMRMVKGAILALLQEAYQKRDCVGLIAFRRDRAETLLPMTRSVELAEKQLRDL 520 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK 363 G T ++ L + ++E +DG N A D + + Sbjct: 521 PTGGRTPLAEGLACAVQTLRELERRGSEKTVLILITDGRTNTARDGDDGVQRALR 575 >UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytophthora infestans T30-4 RepID=D0N9W4_PHYIN Length = 2146 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 45/159 (28%), Gaps = 9/159 (5%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYI-LLYLFLSRTYKNVEVVYIRHHT------QAKEVDEH 304 ++D SGSM+ + + +Y ++ V + +A+ + Sbjct: 1905 VFVLDCSGSMNGQPWNDLMAAWKEYVYNRIADGATLDLVSVVTFDNSAQIVYEARSITTV 1964 Query: 305 EFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 Q GGT ++ L+ +EV+ R N + SDG + Sbjct: 1965 TNARIQYRGGGTNYAAGLRSANEVL-SRVNFDMFKPAIVFFSDGHPCDPLQGEELATHIR 2023 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 ++ + + + + Sbjct: 2024 GCYERNGLQAFAVGFGSINLNMLERVAEKLGGTYHHVLT 2062 >UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7YL43_NECH7 Length = 764 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 20/230 (8%), Positives = 53/230 (23%), Gaps = 35/230 (15%) Query: 203 PAQLLEEERLRKEIAELRA--KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 + + + + ++ P + ++DVSGS Sbjct: 37 GGPSAIQPPVVIASDDATLHLEPVPDRKGLIVKVQPPTAPSAEIPHVPCDIVLVIDVSGS 96 Query: 261 MDQST---------------KDMAKRFYILLYLFLSRTYKNVEVVYI----------RHH 295 M + D+ K + ++ + + V + Sbjct: 97 MAGAAPVPGEETNESTGLSILDLTKHAARTIIETMNESDRLGIVTFASKAKVVQPLLSMT 156 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 ++ KE + T + L ++ K + +DG + Sbjct: 157 SENKERSRGNVTSMRPIDATNLWHGLLEGIKLFKN--VKSSNVPAIMVLTDGMPNHMNPA 214 Query: 356 LCHEILAK---KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + +L + + + L + + F Sbjct: 215 AGFVPKLRAMGQLPASIHTFGFGY---HLRSGLLKSIAEIGGGNYAFIPD 261 >UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJ63_FIBSS Length = 227 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 52/176 (29%), Gaps = 33/176 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY------KNVEVVY-- 291 + +PSS+ + ++D SGSM+ + + L Y + + V + Sbjct: 10 DLENNPSSRVPVCLVLDTSGSMEGDSINELNEGVRLFYDAVRSDETALYAAEISVVTFGG 69 Query: 292 -----IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIY 340 T + D +F+ GGT + A+ + +++++R + + Sbjct: 70 HASCQAGFSTLEHQPDAPQFY---ADGGTPMGEAMNMALDMLEKRKSEYKASGVDYYQPW 126 Query: 341 AAQASDGDNWADDSPLCHEILAKKLL-----PVVRYYSYIEITRRAHQTLWREYEH 391 +DG S ++ + + Sbjct: 127 IVLMTDGMPNG--SQAELSRSIQRTCDMINDRKLTIFPIGIGE----DADMDVLAR 176 >UniRef50_Q22SJ7 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22SJ7_TETTH Length = 642 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 K+ + + C+++ S SM K + L L+ + V+ + T Sbjct: 186 KDQVLVKNSRPSIDLVCVINNSESMHGEKILNVKNTLLYLLEMLNSNDRLSLVLSNNNPT 245 Query: 297 QAK------EVDEHE----FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 E ++ + T T ++ ++ +++ R + + + SD Sbjct: 246 TLFDLKYLDEKNKQDLKRIINNISITQNTNITKSMIKAFNILQFRQSQNKVS-SIFLLSD 304 Query: 347 GDNWADDSPLCHEILA------KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 G + + + + + Y + + L++ + Sbjct: 305 G--VDSSAEKQIQNYISSQQSLQNKNFAIHSFGYGFDQ---DAEMINKICSLKNGNFYYI 359 >UniRef50_Q237Q6 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q237Q6_TETTH Length = 713 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 51/201 (25%), Gaps = 36/201 (17%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST---------------KDMAKRFYILLYL 278 +R + + + C++DVSGSM D+ K + Sbjct: 49 VRIQILSPKGKSKVSNSICCVVDVSGSMGSRAVTKQSGGNSELGYSVLDIVKHSLNTIVQ 108 Query: 279 FLSRTYKNVEVVYIRHHTQA---KEVDEHE-------FFYSQETGGTIVSSALKLMDEVV 328 L + V + + +++ E Q T + + ++ E + Sbjct: 109 NLDEGDEFSMVTFSDNSKLVCNYQQMTESNIKSSVDLINQCQPDASTNIWAGIEQGLEQM 168 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK-------KLLPVVRYYSYIEITRRA 381 + N + N +DG + L P + + + Sbjct: 169 QNDSNKNK-NQQLIVLTDGQPNVNPPRGILTTLNNFYNKNIISPKPSINTFGFGY---YL 224 Query: 382 HQTLWREYEHLQSTFDNFAMQ 402 L +F Sbjct: 225 DSHLLFNIAQDCQGIYSFIPD 245 >UniRef50_C0ZKA0 Putative uncharacterized protein n=2 Tax=Bacteria RepID=C0ZKA0_BREBN Length = 477 Score = 66.7 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 51/372 (13%), Positives = 101/372 (27%), Gaps = 67/372 (18%) Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFE--DLAL 134 P + P D G +++++E + L + Sbjct: 24 PDTSQNAEGQNPSAPP---STETPPTQSQPPDQTGD--PNAEMTQEEKVKALEAMAQEGM 78 Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH-ALEE 193 P ++ + + +G + + + + + L + + ++ Sbjct: 79 PLKRETTEDFVNSPPGRFSGVSYD------NNREEVLSELKKFPTVEKPDEEMMNKYYLA 132 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 L + + S P + L+ P ID ++K Sbjct: 133 LLGLFAQSYPDPQQIIDELKMAS-------FGNPDIDDPRFKFKESYNVEI--------- 176 Query: 254 LMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE----- 303 ++D SGSM D AK L VY H KE D+ Sbjct: 177 ILDASGSMAAKSNGKTRMDAAKEAIQAFAESLPEQANVALRVY-GHKGSGKESDKTLSCG 235 Query: 304 -----------------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 Q TG T ++ +L+ + + + N SD Sbjct: 236 SSELVYGMQTYNKEKLTQSLNQFQPTGYTPIAYSLQEAKKDLSKLPGDKNTN-MIFLVSD 294 Query: 347 G-DNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 G + D + LA+ ++ P++ + Q +E I Sbjct: 295 GIETCDGDPVEAAKQLAQSEITPIINVIGFGVDGP--GQQQLKEVAKAAGGRY----VLI 348 Query: 405 RDQDDIYPVFRE 416 +DQ ++ F Sbjct: 349 QDQKELQDEFNR 360 >UniRef50_C5PP99 von Willebrand factor, type A n=2 Tax=Bacteroidetes RepID=C5PP99_9SPHI Length = 256 Score = 66.7 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 49/161 (30%), Gaps = 24/161 (14%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE---VVYI 292 +N ++ + ++ L+D SGSM + L + E + I Sbjct: 36 AQNPIRKKITMRRLPVYFLLDTSGSMHGEPIQALNNALSGMINNLRTDAQAAETLWISMI 95 Query: 293 RHHTQAKE-VDEHEFFYSQ-------ETGGTIVSSALKLMDEVVKERYN------PAQWN 338 + KE V Q E+G T AL+++ + W Sbjct: 96 TFDREVKEIVPLTALESFQLPEISCPESGPTFTGKALEILYDTATREVIKGSPEQKGDWR 155 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 +DG L +++P +R ++ I Sbjct: 156 PLLFIFTDGKPSD-------LQLYSQMIPKIRSLNFGTIVG 189 >UniRef50_A2F7N4 von Willebrand factor type A domain containing protein n=5 Tax=Trichomonas vaginalis RepID=A2F7N4_TRIVA Length = 722 Score = 66.7 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 47/166 (28%), Gaps = 21/166 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HT 296 S + + ++D SGSM S + A + L L + + + H Sbjct: 242 SNSEFYFIVDCSGSMSCSRINNAIKCMRLFIQSLPVGCRFSILRFGSHFETVLPPCDYTD 301 Query: 297 QAKEVDEHEFFYSQET-GGTIVSSALKLMDEV-VKERYNPAQWNIYAAQASDGDNWADDS 354 + + GGT + + L+ + ++ E + +DG+ + Sbjct: 302 ENVANAMNLLDNISANMGGTNILAPLQHVSDLQASEGFVKQ-----IFFLTDGE-VDNSD 355 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +C L + + A L + Sbjct: 356 IICATALKNRSTNRIFSIGLG---SGADPGLIKGMARKSGGNYAII 398 >UniRef50_C1GWG1 von Willebrand factor type A domain containing protein n=1 Tax=Paracoccidioides brasiliensis Pb01 RepID=C1GWG1_PARBA Length = 773 Score = 66.7 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 49/198 (24%), Gaps = 38/198 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQST------------------KDMAKRFYILLYLFLS 281 ++ + +DVSGSM S D+ K + L+ Sbjct: 65 PEKDIRHVPCDIVLCIDVSGSMQLSAPLPTTDESGKREETGLSVLDLTKHAARTIIETLN 124 Query: 282 RTYKNVEVVY---------IRH-HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + V + I H K+ Q T + LKL V+ + Sbjct: 125 ENDRLGVVTFSNDAEVAYKISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGLSVLGKV 184 Query: 332 YNPAQWNIYAAQASDGDNWA-------DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 Q +DG K LP++ + + Sbjct: 185 DLRPQNVQALYVLTDGQPNHMCPRQGYVPKLRPILERQKDRLPLIHTFGFGY---DIRSG 241 Query: 385 LWREYEHLQSTFDNFAMQ 402 L + + +F Sbjct: 242 LLQSIAEVGGGTYSFIPD 259 >UniRef50_Q7MCW9 Uncharacterized protein n=2 Tax=Vibrio vulnificus RepID=Q7MCW9_VIBVY Length = 688 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 65/233 (27%), Gaps = 44/233 (18%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 L L +S +P Q I + + I + +++ Sbjct: 272 LPGRLEAVSYRDPQQSERG-----TIKLTFTPGDDLSAIQ----QGRDW----------- 311 Query: 251 MFCLMDVSGSMDQST---KDMAKRFYILL-----YLFLSRTYKNVEVV--YIRHHTQAKE 300 ++D SGSM + KR L + L + E+ +I + Sbjct: 312 -VFVLDKSGSMSGKHATLTEGVKRGLGKLPSGDRFRILMFDNRVQEITNGFIAVNQNNVT 370 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE 359 GGT + AL+ + +DG N Sbjct: 371 QAIETINQIATGGGTNLYDALERAVSGLDSDRTTG-----IILVTDGVANVGVTEKKQFL 425 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 L ++ VR Y++I + L + + F I + DDI Sbjct: 426 KLMQRY--DVRLYTFIMGNSA-NTPLLEPMTQVSNGFA----TSISNSDDILG 471 >UniRef50_UPI0001BC5690 magnesium chelatase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5690 Length = 605 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 50/321 (15%), Positives = 100/321 (31%), Gaps = 44/321 (13%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS 169 E + EFV K+E + + A+ L + L G + S Sbjct: 313 EREQEFVPPKEKEESTFSIGDTFAVKELVHKKTLHLK---------KRRGSGKRLKTTTS 363 Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 L+ R KR+ +E+ + A + K I Sbjct: 364 LKQ--GRDIKSGFPKRK----MEDFAFAATIRAAAPHQKRRE----------KKFVKISI 407 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNV 287 D+R K EKR + ++D SGSM + A + + LL + K Sbjct: 408 QKEDIRIKIREKR----IGTHILFVVDSSGSMGAKKRMRAVKGAIFSLLQDAYEKRDKVA 463 Query: 288 EVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ-WNI 339 V + + T++ E+ + + G T ++ L +++++ Sbjct: 464 LVAFRKKSAEELLSMTRSIELAKKQLQNLATGGKTPLAEGLFKAYQLIRQLKKKDGEIYP 523 Query: 340 YAAQASDG-DNW---ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 SDG N D +A+K+ I+ + Sbjct: 524 LLVLISDGRANISLHGRDPIEESLEMARKIKKEGISSVVIDTEEGFTLLEMAKNISEAMG 583 Query: 396 FDNFAMQHIRDQDDIYPVFRE 416 + + +++I +D+ + ++ Sbjct: 584 AEYYRLENI-QAEDMLKLLKK 603 >UniRef50_Q87W17 von Willebrand factor type A domain protein n=2 Tax=Proteobacteria RepID=Q87W17_PSESM Length = 224 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 53/151 (35%), Gaps = 19/151 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN----VEVVYIRHH 295 + +P+++ + ++DVSGSM + + Y + + E+ + Sbjct: 11 DLVDNPTARVPICLVLDVSGSMAGEPIRELQAGVNMFYQAIRE-DEVAQYAAEISIVTFG 69 Query: 296 TQAKE------VDEHEFFYSQETGGTIVSSALKLMDEVVK------ERYNPAQWNIYAAQ 343 ++AK ++ + G T + + L ++++ +R + + Sbjct: 70 SEAKRTVDFMAIERQDVPALIAEGTTSMGQGVNLALDLLEVRKGDYQRAGVDYYQPWMVV 129 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 +DG+ D +++ + + Sbjct: 130 MTDGE--PTDDITRASERIREMCESKKLTVF 158 >UniRef50_Q8TU27 Putative uncharacterized protein n=1 Tax=Methanosarcina acetivorans RepID=Q8TU27_METAC Length = 589 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 20/183 (10%), Positives = 48/183 (26%), Gaps = 19/183 (10%) Query: 251 MFCLMDVSGSMDQSTKDMA-KRFYILLYLFLSRTYKNVEVV-------YIRHHTQAKEVD 302 + +D SGSM + K + + VV + T + Sbjct: 83 VVFAIDSSGSMQSNDPSGLRKTAAKSFVDKMDSSRDTAGVVSWDDSIDFSLPLTNDFPLV 142 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 + +G T ++ L+ +++ +DG A Sbjct: 143 KTNIDSVDSSGSTNLNVGLEEAIDILDANPRTENSVEVIIFLTDGQGTY------LHSTA 196 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 ++ Y + T ++ + D + +F ++F + Sbjct: 197 QEAADKGYVI-YSIGLGGVNPTPLQDMATTTGGAYYSSP----DATSLQAIFDDIFSEVT 251 Query: 423 ATA 425 + Sbjct: 252 TST 254 >UniRef50_Q23JA0 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila SB210 RepID=Q23JA0_TETTH Length = 1049 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 65/234 (27%), Gaps = 27/234 (11%) Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 A A+ + + ++ + R + +L Sbjct: 251 AILPASHSAMVSFIPNFNEDITQEIDDSIRAAIN-NGDDIFSDEFQQKLNQEL------I 303 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-------- 293 SS++ L+D SGSM A L L + + Sbjct: 304 DHLNSSRSEFIFLLDRSGSMSGQPIRRACEALTLFLKSLPNDSYFNVISFGSSFDKLFPS 363 Query: 294 ---HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + +++ E Q GGT + + L + V+ + +N +DG+ Sbjct: 364 STKYTSESLEKAILLISKYQADLGGTEIYNPLNNVF--VQNKI--QGYNKQIFLLTDGE- 418 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 DSP L KK R +S + Q L +E Q Sbjct: 419 --VDSPQQVVRLIKKNNKYNRVHSIGFGSGA-DQYLIKESAIAGKGISKLVDQK 469 >UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular organisms RepID=YEGL_ECOLI Length = 219 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 54/173 (31%), Gaps = 19/173 (10%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL------SRTYKNV 287 + + + +P + L+DVSGSM+ + + L + + Sbjct: 5 ITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVELG 64 Query: 288 EVVYIRHHTQAKEVDEHEFFY--SQETGGTIVSSALKLMDEVVKER---YNPAQWNIY-- 340 V + H + FF G T + +A+ ++V+ER Y + Y Sbjct: 65 IVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYRP 124 Query: 341 -AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 +DG + +++ + ++S + + Sbjct: 125 WIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGAD-----MKTLAQI 172 >UniRef50_UPI00006CD16B von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila RepID=UPI00006CD16B Length = 730 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 33/254 (12%), Positives = 73/254 (28%), Gaps = 24/254 (9%) Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT-FDLRYKN 238 + + A + L ++ +E+ + FI + Sbjct: 194 KGELSKINEGYFSQEKAYLKVKYTKNNLSMLSFDQKNSEISPYCALINFIPPQISTQENL 253 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 K D ++ ++D SGSM ++AK I L + + Sbjct: 254 LTKTTDQLIKSEFVLIIDRSGSMYGPKMELAKESLIFFLKSLPVGSIYNIISFGSTCEIM 313 Query: 292 ----IRHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 ++ + Q + + GGT VS AL+ + ++ +D Sbjct: 314 FDQSVQFNDQNVQNSIQQIDQFSANLGGTNVSKALEHVY---LNLFDQYGLRKKIFIITD 370 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G+ + + + + +++ + + F QH++D Sbjct: 371 GEFTDRNETQELVNAYQNRC-DINVLCIG------KDSQFQQAIEIANKTGGF-TQHVKD 422 Query: 407 QDDIYPVFRELFHK 420 DI L + Sbjct: 423 HIDIISKVILLLSQ 436 >UniRef50_C3YRH3 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YRH3_BRAFL Length = 581 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 36/289 (12%), Positives = 81/289 (28%), Gaps = 40/289 (13%) Query: 115 FVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSL 174 + F+ + + L + P ++ R AN V ++ + + Sbjct: 212 YEFEFQLEVKMPCLLAGVESP----THSIRVDADPYARN---ANEVFVTLAEQHTYSEDV 264 Query: 175 ARRTAMTAG-KRRELHAL--------EENLAIISNSEPAQLLEEERLRKEIAELRAKIER 225 ++ K + EE + + Q +EE+ ++ LR ++ + Sbjct: 265 QVLLYLSDPHKPAIILEHGDMSLSGYEEYVKSRRGFKRLQREKEEKPSSKVDYLRGRLHK 324 Query: 226 VPFIDTFDLRYKNYEKRPDPSSQA-----VMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 + + P ++D SGSM + A+ +L L Sbjct: 325 DLMHHAAIMLSFFPDFSSIPPRDPSKIPGNFIFILDRSGSMSGANIAGARETLLLFLKSL 384 Query: 281 SRTYKNVEVVYIR-----HHTQA------KEVDEHEFFYSQET-GGTIVSSALKLMDEVV 328 V + T + + + GGT + S L+ + Sbjct: 385 PTCCVFNIVSFGSSYKPMFSTSVPYTQQNVDKASADIKKMRADMGGTNILSPLQWVF--- 441 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 + + +DG + + L +K R ++ Sbjct: 442 -SAPVTSGYPRQVFLLTDG---SVSNTGTVIDLVRKNAYNTRCFALGIG 486 >UniRef50_A6EQD3 von Willebrand factor type A like domain n=2 Tax=Bacteroidetes RepID=A6EQD3_9BACT Length = 733 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 55/190 (28%), Gaps = 27/190 (14%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 +P + ++DVSGSM+ +++K L L+ ++ T Sbjct: 286 KPKNVTAREYLFIVDVSGSMNGYPLEVSKDLMRNLLCNLNADDTFNVQLFASSSTIFNPT 345 Query: 302 DEHEFFYSQETGGTIV---------------SSALKLMDEVVKERYNPAQWNIYAAQASD 346 + T SAL + E+ + + +D Sbjct: 346 PVEATDENV----TNAIKFLTSGQGGGGTQLLSALNVAYELPRS---QEGSSRSMVIITD 398 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G ++ L +++ I ++ L + S ++F + Sbjct: 399 G---YVSVEREAFTKIEENLDQASVFTFG-IGSSVNRYLIEGMAAV-SKSESFIATSREE 453 Query: 407 QDDIYPVFRE 416 + F++ Sbjct: 454 ASKVAEDFKK 463 >UniRef50_Q7UNJ0 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UNJ0_RHOBA Length = 327 Score = 65.9 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 52/183 (28%), Gaps = 13/183 (7%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------- 303 + ++D SGSM S + + + L+ T + ++ ++ +A E + Sbjct: 149 ITLVVDRSGSMAGSRFNDLQAAIRIFTDLLATTPVDEQIGLASYNDRASEDVQLTENFAE 208 Query: 304 --HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 + + G T +S ++ E+ P +DG + P Sbjct: 209 VNNAMDRLRTGGFTSISRGMQAGQEIALRGRPPEFVERTMIVMTDGRHNRGPEPRVVATD 268 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 + ++ ++ + +F + DIY Sbjct: 269 LAADGVTIHTITFGAGA---DFGRMQDVARIGGGR-HFHATNGDQLRDIYREIALTLGTV 324 Query: 422 NAT 424 Sbjct: 325 LTE 327 >UniRef50_D1YD07 von Willebrand factor type A domain protein n=2 Tax=Propionibacterium acnes RepID=D1YD07_PROAC Length = 318 Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 29/213 (13%), Positives = 56/213 (26%), Gaps = 36/213 (16%) Query: 242 RPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 P +A + +DVS SM + S AK L + V + Sbjct: 80 HEVPRDRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLPPRFNVSLVKFAASAQ 139 Query: 295 ----HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK-----ERYNPAQWNIYAAQAS 345 T + Q T + + +K ++ + S Sbjct: 140 VVVPPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDDPKHPGQKPPAAIVLLS 199 Query: 346 DGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRR-----------AHQTLWREYEHLQ 393 DG N S + ++ +P V +Y + Sbjct: 200 DGATNVGRPSLEAAKEAGRQHVP-VYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKA- 257 Query: 394 STFDNFAMQHIRDQDDIYPVF------RELFHK 420 S + F+ + + D+Y ++F + Sbjct: 258 SGGEKFSAESLGQLSDVYKSIAQSVGYEKVFGE 290 >UniRef50_C8XH18 von Willebrand factor type A n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XH18_NAKMY Length = 618 Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 41/265 (15%), Positives = 77/265 (29%), Gaps = 42/265 (15%) Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 ++ E N + + A P + D+R + + Sbjct: 366 LPEQQKVFTEANFRTADHQPGEPIT-SSPYLIADGVTIALNPPGPSVLR-DVRA-LWTQV 422 Query: 243 PDPSSQAVMFCLMDVSGSM-------DQSTKDMAKRFYILLYLFLSRTYK---------- 285 P A + +MDVSGSM +S D+AK+ L+ T + Sbjct: 423 RKP---ARVLVVMDVSGSMASESGYGSESKLDLAKKAATSALGQLTDTDQMGLWAFTTDL 479 Query: 286 ------NVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 ++V + Q ++ GT + +A + + + + +P N Sbjct: 480 PTPDTITADLVGVGPLAQTRQPIIDAISSLTPLNGTPLYAATREAAKAMNAQKDPNSINA 539 Query: 340 YAAQASDGDNWADDSP-----LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 +DG N D+ A++ V +Y A +E Sbjct: 540 VVV-LTDGRNEYTDNDLDGLLRELNASAEEDGVRVFTIAYG---PDADLATLQEISEASR 595 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFH 419 R+ I VF ++ Sbjct: 596 AAAY----DARNPTSIDKVFSDVLS 616 >UniRef50_UPI0001792BA1 PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P... n=1 Tax=Acyrthosiphon pisum RepID=UPI0001792BA1 Length = 821 Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 22/233 (9%), Positives = 55/233 (23%), Gaps = 55/233 (23%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------ 292 + + + ++DVSGSM+ K + + +++ Sbjct: 264 FAPTDLKPLKTHVIFILDVSGSMNGQKITQVKGAMSQILSEIDSEDFFTLILFSSLAQIW 323 Query: 293 -------------------------RHHTQAKEVDEHEFFY-------SQETGGTIVSSA 320 + +E Y + T + A Sbjct: 324 TINATQNTSNYWDDRGRNLNNFETMGENHFIFSANEQNIQYAKKFIQALEPDSTTNMEDA 383 Query: 321 LKLMDEVV---KERYNPAQWN--IYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRY 371 L + K R+ + +DG+ N +P + + Sbjct: 384 LNKALSIAKLGKMRFKDSAKTPKPIIVFLTDGEMNEGITNPQALMKYVSDINVDNYPIYS 443 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + ++ + F + D F+K+ ++ Sbjct: 444 LGFGKGA---DIEFLKKLSLNNTGFARVIY----EASDASLQLHN-FYKEISS 488 >UniRef50_A0PNU3 UPF0353 protein MUL_1490 n=43 Tax=Actinomycetales RepID=Y1490_MYCUA Length = 335 Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 57/213 (26%), Gaps = 42/213 (19%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYIR-- 293 P ++AV+ ++DVS SM + + A+ L+ + Y Sbjct: 89 DVRIPRNRAVVMLVIDVSQSMRATDVEPNRMVAAQEAAKQFADELTPGINLGLIAYAGTA 148 Query: 294 ----HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE---------RYNPAQWNIY 340 T +E + Q T A+ + + PA+ Sbjct: 149 TVLVSPTTNREATKAALDKLQFADRTATGEAIFTALQAIATVGAVIGGGDTPPPAR---- 204 Query: 341 AAQASDGDNWADDSPLCHE------ILAKKLLPVVRYYSYIEITR-----------RAHQ 383 SDG +P + AK + S+ Sbjct: 205 IVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDD 264 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 ++ L ++ + + + +Y ++ Sbjct: 265 ETMKKVAQLSGGN-SYNAATLAELNSVYVSLQQ 296 >UniRef50_UPI0001C37785 von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37785 Length = 285 Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 44/153 (28%), Gaps = 14/153 (9%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR---TYKNVEVVYIRHHTQAK-----E 300 V+F L+D SGSM L + R V+V + T + Sbjct: 51 LVIFFLIDTSGSMKGKKMGELNTVMEELIPEIRRVGEADTEVKVAVLTFSTDVRWMYSTP 110 Query: 301 VDEHEFF--YSQETGGTIVSSAL-KLMDEVVKERYNPA---QWNIYAAQASDGDNWADDS 354 + +F + G T + +A +L + + + + + +DG D Sbjct: 111 IPIEDFEWARLRANGVTSMGAAFKELSLRMSRNSFLNSPSLSFAPVIFLMTDGYPSDDYR 170 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 E+ + + L Sbjct: 171 EGLKELQSNSWYKFGLKAALGIGNEANDDVLAE 203 >UniRef50_UPI0001C31E2D von Willebrand factor type A n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31E2D Length = 319 Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 47/154 (30%), Gaps = 13/154 (8%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 P +A + + DVSGSM + AKR + RT + + Sbjct: 77 RTVAVPVERASIALVTDVSGSMLATDVQPNRMIAAKRAARRFVDEVPRTVNLGVISFNNT 136 Query: 295 HTQAKEVDEH------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASD 346 T + + +GGT A+ E+++ + SD Sbjct: 137 ATVLQSPTRNRSDVLTAIDRLAVSGGTATGEAIATATEMLRNQPGENGRRPPSAIVLISD 196 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 G + P+ A++L + ++ Sbjct: 197 GTSTNGRDPIEAAAEARRLRIPIYTVAFGTDQGT 230 >UniRef50_A7T2Z0 Predicted protein n=4 Tax=Nematostella vectensis RepID=A7T2Z0_NEMVE Length = 357 Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 56/184 (30%), Gaps = 16/184 (8%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF--LSRTYKNVEVVYIRHHTQA---- 298 P + + L+D SGS++ + D K F L +S TY +V VY +T A Sbjct: 141 PPLKMNLVFLIDNSGSINDTEFDNFKEFAKKLAESFTISATYTHVAAVY--FNTLANFGF 198 Query: 299 -----KEVDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 V + GGT + AL + V + +DG + Sbjct: 199 NLKYDINVIKTAIDNLPNIGGGTHIGKALTYTLDNVFKVAPRQNVKNVLVVLTDGK--SH 256 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 DS + P V ++ + F ++H + Sbjct: 257 DSVTLPAAAVRNYGPGVEVFAVGVGAGDSFVAQLNVIASDPDEDHVFHVEHFSQIESTTG 316 Query: 413 VFRE 416 + Sbjct: 317 AVED 320 >UniRef50_A5UW94 von Willebrand factor, type A n=2 Tax=Roseiflexus RepID=A5UW94_ROSS1 Length = 452 Score = 65.6 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 38/176 (21%), Gaps = 22/176 (12%) Query: 258 SGSMDQSTKDMAKRFY-------ILLYLFLSRTYKNVEVVYIRHH--------TQAKEVD 302 SGS+ Q + A + L R + VV+ H + Sbjct: 91 SGSVPQEVRKAASSALDHVVHALHTVVERLDRNDRLSLVVFADHALLLIPGMVGSDRVTL 150 Query: 303 EHEFFYSQE---TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 GT ++ + L ++ + + N +DG + L Sbjct: 151 VRAIERLPGLDLGDGTNLADGIALALNQIRANRDARRANRVLL-LTDGFTRDPAACLTLA 209 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 A + L F + I Sbjct: 210 DQAADEHIAITTIGLG---GEFQDDLLTGIADRSGGNALFLKRASAIPRAISAELE 262 >UniRef50_Q2QZN5 Putative uncharacterized protein n=1 Tax=Oryza sativa Japonica Group RepID=Q2QZN5_ORYSJ Length = 553 Score = 65.6 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 45/172 (26%), Gaps = 26/172 (15%) Query: 260 SMDQS-TKDMAKRFYILLYLFLSRTYKNVEV-----VYIRHHTQAKEVD-------EHEF 306 SM D+ K ++ L + V V T+ E+ + Sbjct: 6 SMHGWTRLDLVKGAMKMVTNKLGAGDRLAIVPFNGKVVAAGATRLMEMTTKGRADANAKV 65 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEILAKK 364 + G T ALK ++ R + + SDG + Sbjct: 66 NQLKAGGDTKFLPALKHASGLLDSRPAGDKQYRPGFIFLLSDGQDNGV---------LDD 116 Query: 365 LLPVVRY--YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 L VRY +++ R + + + + VF Sbjct: 117 KLGGVRYPAHTFGMCQSRCNPKSMVHIATATKGSYHPIDDKLSNVAQALAVF 168 >UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W3F9_DESAS Length = 219 Score = 65.6 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 63/200 (31%), Gaps = 35/200 (17%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------ 292 + + S + ++ L+D SGSM + K+ + L + + +E YI Sbjct: 2 FNEVEGLSRRLPVYLLLDRSGSMFGEPIEAVKQGVKYMISELKKEPQAIETAYISVITFG 61 Query: 293 ---RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN----------PAQWNI 339 R Q E+ + + G T + +AL ++ ++ + Sbjct: 62 SDARQDVQLTELAAFKEPQIEANGTTSLGAALH----ILNNCFDNEVRKSTPTQKGDYKP 117 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG+ DD + +K V + + ++ + Sbjct: 118 LVFIMTDGEPT-DDWENAAREIKQKSGKVANIVAVG-CGPDVNTDTLKKITDI------V 169 Query: 400 AMQHIRDQDDIYPVFRELFH 419 + +D F++ F Sbjct: 170 LLMSSYQPED----FKQFFR 185 >UniRef50_C0QP91 Putative von Willebrand factor type A domain protein n=1 Tax=Persephonella marina EX-H1 RepID=C0QP91_PERMH Length = 304 Score = 65.6 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 45/183 (24%), Gaps = 29/183 (15%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---------TQAKE 300 + +DVS SM + K K +L FL + + + + T + Sbjct: 85 NIIIALDVSNSMKEKNK--LKISKEILRDFLLKRDEEDRIGILVFDNLPFRLMPLTSDRG 142 Query: 301 VDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD-GDNWADDSP 355 + GGT + L + + + N +D GD + + Sbjct: 143 ALLRVISIIRPAMVDVGGTAMYDGLVEALNM----FMKDRRNKIIILLTDGGDINSKYTL 198 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + + + + F D R Sbjct: 199 EDVVRFNQDIGAKIYTIGVSSGMNFY---VLERLSEATGGKAFFVT------KDYQKALR 249 Query: 416 ELF 418 +F Sbjct: 250 SVF 252 >UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Takifugu rubripes RepID=UPI00016DFBC7 Length = 883 Score = 65.6 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 18/215 (8%), Positives = 51/215 (23%), Gaps = 36/215 (16%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + + + ++D SGSM + + I + L + + Sbjct: 238 VHFFAPKDLTRLPKN--VVFVIDRSGSMSGTKMQQIQEAMIKILEDLHPEDHFGIIQFDS 295 Query: 294 HHTQAKE----VDEHEFF-----------YSQETGGTIVSSALKLMDEVV-----KERYN 333 + E Q T +++A+ +++ +R Sbjct: 296 SVDSWRNSLSLATEENISEAMAYVNQISHKIQA---TNINAAVLKAVDMLVTDREAKRLP 352 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV----RYYSYIEITRRAHQTLWREY 389 ++ +DGD D ++ + + + Y Sbjct: 353 EKSIDMIIL-LTDGDPTTDIGETRIPVIQENVRNAIGGNMSLYGLGFGN-DVDYGFLDVM 410 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + F+ + ++ Sbjct: 411 SRENKGLARRIYTGADAALQLQG-----FYDEVSS 440 >UniRef50_A6FXN3 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6FXN3_9DELT Length = 416 Score = 65.6 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 45/198 (22%), Gaps = 38/198 (19%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IRHHTQA-----KE 300 M ++D S SM + AK + L L+ VV+ + + + Sbjct: 1 MVLVVDTSASMKGDAIEGAKAAAMELVDGLAEGDSFALVVFHSRAEVLMPSTVINEDSRA 60 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVK----------------ERYNPAQWNIYAAQA 344 + Q G T ++ L+ ++ +P Sbjct: 61 AARSKIETMQAWGTTDLAGGLQQALAQLQVAQNIVGAGGSTGAQSGAPDPTVLERVVLLG 120 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 DG + + Y +TL F Sbjct: 121 -DGVPNDASTIPSTVGQLAARGTQITALGYGI---EYDETLLASLAEQTHGSFRFV---- 172 Query: 405 RDQDDIYPVFRELFHKQN 422 D LF + Sbjct: 173 ----DDPEAVASLFRDEV 186 >UniRef50_B6BJ58 Phage/colicin/tellurite resistance cluster TerY protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BJ58_9PROT Length = 229 Score = 65.6 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 45/130 (34%), Gaps = 13/130 (10%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT------YKNVEV 289 + + + + L+DVS SM D + + + K + Sbjct: 3 FNPADFVVEEPKSIPVVLLLDVSYSMQGENIDTLNKAVESMLNSFKKAETMETFIKLSII 62 Query: 290 VYIRH-----HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER--YNPAQWNIYAA 342 + HT EV + +F +G T + +A K+ +++++ + + Sbjct: 63 TFGSENGVDLHTPLTEVSKIDFKPLTVSGSTPMGAAFKMGKAMIEDKDIFKGRDYRPTIV 122 Query: 343 QASDGDNWAD 352 SDG+ D Sbjct: 123 LLSDGEPNDD 132 >UniRef50_B2HDT6 Putative uncharacterized protein n=3 Tax=Mycobacterium RepID=B2HDT6_MYCMM Length = 772 Score = 65.6 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 56/200 (28%), Gaps = 20/200 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 S+ + ++D SGSM A+R + L + + + Sbjct: 298 VPPAEPSSAPRDVVVVLDRSGSMGGWKMVAARRAAGRIVDMLDAGDRFCVLAFDDRIETP 357 Query: 292 -IRHHTQAKEVDEHEFF------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 D + F + GGT+++ L E++ + + Sbjct: 358 PAMPDGLVPASDRNRFAASSWLGSLRSRGGTVMAQPLTNAVEMLAD-SGEDRQASVVL-V 415 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG +D LA + R Y + R + L S Sbjct: 416 TDGQISGED--HLLRSLAPVVGRT-RIYCVG-VDRAVNAGFLERLAGLGSGRAELVESED 471 Query: 405 RDQDDIYPVFRELFHKQNAT 424 R + + + R + + Sbjct: 472 RLDEVMARLARTIGRPALTS 491 >UniRef50_C8NPK0 Magnesium chelatase n=4 Tax=Corynebacterium RepID=C8NPK0_COREF Length = 248 Score = 65.6 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 54/169 (31%), Gaps = 14/169 (8%) Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMA 269 L A ++ + DLR R ++ ++D SGSM ++ A Sbjct: 35 GTLMAAADRGAAVVDGMVDFRPEDLRGSLRRGREAN----LIVFVVDTSGSMAARSRVRA 90 Query: 270 KR--FYILLYLFLSRTYKNVEV-------VYIRHHTQAKEVDEHEFFYSQETGGTIVSSA 320 +L R K + + T + ++ + G T ++ Sbjct: 91 VTGAIMSMLTDAYQRRDKVAVIAVNGNKPTLVLAPTSSVDMAQKSLDAMPMGGRTPLAEG 150 Query: 321 LKLMDEVVK-ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 L + ++++ E +DG++ +D A+ ++ Sbjct: 151 LIMARDLMEREHRKEPSRRPLLVVMTDGEDTSDAGETGIATAARAVVKS 199 >UniRef50_Q54DV3 von Willebrand factor A domain-containing protein DDB_G0292016 n=1 Tax=Dictyostelium discoideum RepID=Y2016_DICDI Length = 918 Score = 65.6 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 57/188 (30%), Gaps = 21/188 (11%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-- 293 +KN ++ L+D SGSM + + A+R ++ L+ +K + Sbjct: 286 FKNVNPDEV-YQKSEFIFLIDCSGSMSGQSINKARRAMEIIIRSLNEQHKVNIYCFGSSF 344 Query: 294 ---------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 ++ + E+ GGT + + +++ +P ++ Sbjct: 345 NKVFDKSRVYNDETLEIAGSFVEKISANLGGTELLPPM---VDILSSPNDP-EYPRQVFI 400 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG+ K R ++Y I Q L + + Sbjct: 401 LTDGE---ISERDKLIDYVAKEANTTRIFTYG-IGASVDQELVIGLSKACKGYYEMIKET 456 Query: 404 IRDQDDIY 411 + + Sbjct: 457 TNMEKQVM 464 >UniRef50_Q6SGW6 Magnesium-chelatase, 60 kDa subunit n=1 Tax=uncultured marine bacterium 443 RepID=Q6SGW6_9BACT Length = 593 Score = 65.2 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 60/198 (30%), Gaps = 20/198 (10%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP--FIDTFDLRYKNYEKR 242 R E PA L + A + +ID D+R K +E+R Sbjct: 348 RPIGVRRGELKRGHRIDIPATLRAAAPFQAVRARWDTSNHQQASLYIDPQDIRVKRFEQR 407 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-------- 294 +VM +D SGS AK LL EV I Sbjct: 408 K----SSVMIFAVDASGSSAHQRMAEAKGAIELLLADCYSH--RTEVALISFKGESADLL 461 Query: 295 --HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWA 351 T++ + GGT +++AL+ + +V Y +DG N A Sbjct: 462 LPPTRSLVRAKRTLAQLPGGGGTPMAAALQTIYDVATTVEAMGATPTYVL-LTDGASNVA 520 Query: 352 DDSPLCHEILAKKLLPVV 369 D + L V Sbjct: 521 RDGTKSRTAGTEDALRVA 538 >UniRef50_A0BWF5 Chromosome undetermined scaffold_132, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0BWF5_PARTE Length = 1574 Score = 65.2 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 53/178 (29%), Gaps = 11/178 (6%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 + + ++ + S ++D SGSM D AK + L Sbjct: 1367 QHLKKSLNHEIIFSQKSDFQISISNIHYILILDDSGSMQGVNWDNAKNGALHCIKSLENV 1426 Query: 284 Y--KNVEVVYIRHHTQAKEVDEHEFFYSQE-----TGGTIVSSALKLMDEVVKERYNPAQ 336 K +++ E + + G T L +++ + Sbjct: 1427 DCAKVSVIIFNGDARIVVECQKPNYIQMSSCISYKGGNTAFDPPFNLALQLIVKY--KGF 1484 Query: 337 WNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYY-SYIEITRRAHQTLWREYEHL 392 I +DG+ + + L ++ ++ E + + Q + ++++ Sbjct: 1485 NKIQILFYTDGEAGYPQTTIDKFCQLPPQIRSLINLIACSGEKSSHSLQLMIQKFQQN 1542 >UniRef50_B7FTA2 Predicted protein n=3 Tax=Bacillariophyta RepID=B7FTA2_PHATR Length = 800 Score = 65.2 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 78/235 (33%), Gaps = 31/235 (13%) Query: 122 DEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 + + +F+ A P + + + ++ + G R L S R + Sbjct: 486 EVPQEFMFDIDATP-MDPDLIDFTSRERSGKGG------------GRGLIFSQDRGRYIK 532 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 + A + S P Q ER A +K R I D+R K + Sbjct: 533 PMLPKGKVIRLAVDATLRASAPYQKSRRER-----AVGTSKEGRGVHIQQSDVRIKKMAR 587 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVVYIRH------ 294 + + +++ ++D SGSM + + AK LL K + + Sbjct: 588 K----AGSLIIFVVDASGSMALNRMNAAKGAAVSLLTEAYQSRDKISLIPFQGEMADVLL 643 Query: 295 -HTQAKEVDEHEFFYSQETGGTIVSSALKLM-DEVVKERYNPAQWNIYAAQASDG 347 T++ + GG+ ++ AL+L + + + + SDG Sbjct: 644 PPTKSITMARQRLEQMPCGGGSPLAHALQLATLTGINAQKSGDVGKVVVVLISDG 698 >UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AYK3_9ENTR Length = 227 Score = 65.2 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 50/173 (28%), Gaps = 19/173 (10%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT------YKNVEVVY 291 + + + + ++D SGSM LL L + + + Y Sbjct: 9 DVALVDNSEQRTPLILVLDSSGSMYGQPIQQLNEGLKLLEQELKNDVIAAKRVRILVIEY 68 Query: 292 IRHH-----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK---ERYNPAQW---NIY 340 + K+ + + G T + A+ L E ++ +R+ A + Sbjct: 69 GGYDQCTIHGDWKDAMDFTAPVLEANGTTPMGQAITLALEEIEAEKQRFKQAGVAYTRPW 128 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 SDG D L ++ + + + A + + Sbjct: 129 LFLMSDG--VPTDQWEQAAQLCRQAEESQKTAVFPIMVDGASAEVMGSFSRNG 179 >UniRef50_O50313 Magnesium-chelatase 67 kDa subunit n=15 Tax=Bacteria RepID=BCHD_CHLP8 Length = 619 Score = 65.2 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 74/250 (29%), Gaps = 29/250 (11%) Query: 106 SQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANIS 165 + + + + + D +L+ + + ++ +A + G N Sbjct: 308 PDETDSDADEEQEETPDMIEELMMDAIETDL--PENILNISLASKKKAKSGSRGEALNNK 365 Query: 166 VVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIER 225 R +++ K ++ + ++ + + ++ K A + + + Sbjct: 366 RGRFVRS------QPGEIKSGKVALIPTLISAAPWQAARKAEKAKKGIKTGALVISTDDV 419 Query: 226 VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF-YILLYLFLSRTY 284 R+++ S + ++D SGSM + AK LL Sbjct: 420 KIK------RFRD-------KSGTLFIFMVDASGSMALNRMRQAKGAVASLLQNAYVHRD 466 Query: 285 KNVEVVYIRHHTQAKEVDEHEFFY-------SQETGGTIVSSALKLMDEVVKERYNPAQW 337 + + + Q GGT ++SAL E K+ Sbjct: 467 QVSLISFRGKQAQVLLPPSQSVDRAKRELDVLPTGGGTPLASALLTGWETAKQARTKGIT 526 Query: 338 NIYAAQASDG 347 I +DG Sbjct: 527 QIMFVMITDG 536 >UniRef50_B9ML47 YD repeat protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9ML47_ANATD Length = 3027 Score = 65.2 bits (157), Expect = 5e-09, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 48/164 (29%), Gaps = 18/164 (10%) Query: 247 SQAVMFCLMDVSGSMDQST-----KDMAKRFYI---LLYLFLSRTYKNVEVVYIRHHTQA 298 S+ + ++D SGSM + + K+F L + + V + T Sbjct: 765 SKVDIVFVLDNSGSMSSNDPNYYRIEATKKFIQNIDELNNRVGLVDFDSSVSVRSNLTSD 824 Query: 299 KEVDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPL 356 K + G T + LK + ++ Q SDG N Sbjct: 825 KSKLLQALNAMRWTGGSTNIGGGLKAALGL----FDQEQSKKIIVLLSDGYHNTGIHPND 880 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 L K+ + VV + + + L + + Sbjct: 881 VLPELIKQEI-VVNTIALGK---DCDRELLHDIADKTKGGYFYV 920 >UniRef50_UPI000180C2AF PREDICTED: similar to FiBrilliN homolog family member (fbn-1) n=1 Tax=Ciona intestinalis RepID=UPI000180C2AF Length = 990 Score = 65.2 bits (157), Expect = 5e-09, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 64/182 (35%), Gaps = 22/182 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA------ 298 P ++ + +MD SGS+ + K+F +Y + + + + + H+ Sbjct: 808 PKARMDLIFIMDSSGSIGEENFKTMKQFVKNVYERFTLSDEFTRIAVVTFHSVVQLANDT 867 Query: 299 -----KEVDEHEFFYSQETG-GTIVSSALKLMDE-VVKERYNPAQWNIYAAQASDGDNWA 351 K ++ Q G GT+ AL E ++ +R N+ A +DG+ + Sbjct: 868 EWFYSKTELDNAIDSLQFAGKGTLTGQALTFTREHLIGKR--EGSTNVVIA-VTDGN--S 922 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 D+ + + V + ++ + ++ + D +D+ Sbjct: 923 KDNSKAAAAELRNM--NVHVMAVGITGSHLRDLSM--IASKPASENVLSLSQVEDINDVI 978 Query: 412 PV 413 V Sbjct: 979 DV 980 >UniRef50_UPI0000E47594 PREDICTED: similar to inter-alpha (globulin) inhibitor H3 variant n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47594 Length = 902 Score = 65.2 bits (157), Expect = 5e-09, Method: Composition-based stats. Identities = 24/205 (11%), Positives = 55/205 (26%), Gaps = 24/205 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------ 292 + P + + ++DVSGSM K + + T K + + Sbjct: 335 FSPSGLPVLRKNVIFIIDVSGSMAGVKLRQVKDALTTILNDMPETDKFNIIPFSDDVNFL 394 Query: 293 -----RHHTQAKEVDEHEF-FYSQETGGTIVSSALKLMDEVVKERYNPA---QWNIY--A 341 T + F QE T + A+ ++++ + N+ Sbjct: 395 DRNKMLFSTSSNVRRAKRFVKSLQERDNTNLHKAIIAGVRMLRDESDQNVRPDENVVSML 454 Query: 342 AQASDGD-NWADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 SDG+ N + E ++ + ++ + Sbjct: 455 IVLSDGNPNHGEIDKEIIERNVEEAIRGDFSLFNLGFGE-DLDFPFLERMAYQNHGVAR- 512 Query: 400 AMQHIRDQDDIYPVFRELFHKQNAT 424 I ++ D + + + Sbjct: 513 ---QIPERADAGKLLENFYFEVATP 534 >UniRef50_Q0FYU3 Von Willebrand factor, type A n=1 Tax=Fulvimarina pelagi HTCC2506 RepID=Q0FYU3_9RHIZ Length = 584 Score = 65.2 bits (157), Expect = 5e-09, Method: Composition-based stats. Identities = 42/334 (12%), Positives = 96/334 (28%), Gaps = 32/334 (9%) Query: 48 SGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQ 107 + E V + + P Q P + D + + S Sbjct: 218 ADEDVLLACRLVLAPRARQAPSFDEQ---PPEEDSESEDHPDASNDDTPPQDHDEPDQSD 274 Query: 108 DGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 + + + D+ ED+A+P + + G Sbjct: 275 SEKSETNQEASSQSGQDTDVQSEDVAIPLDILKRLAAMVRNHHTAGKTARKGA------- 327 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 + ++ AR + + H+ + A ++ + P Q +R + R Sbjct: 328 -TARSGGARGRPLGSRAGDPRHSSLDISATLTAALPWQRFRRQRFPRLTD-------RPV 379 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY-ILLYLFLSRTYKN 286 + D+R + + + + ++ ++D SGS + AK +L R + Sbjct: 380 ILTPSDIRIRRLQAKRETAT----IFVVDASGSAALARLAEAKGAIERILAECYRRRDRV 435 Query: 287 VEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 V + T++ GGT ++S + L ++ ++ + + Sbjct: 436 AMVAFRGTSAETLLPETKSLTRARRALAGLSAGGGTPLASGIALAGDLARQCERQERTPL 495 Query: 340 YAAQASDG-DNWADDSPLCHEILAKKLLPVVRYY 372 +DG N D + + Sbjct: 496 IVF-LTDGKANITLDGMAGRSSAREDVNTQATVL 528 >UniRef50_D1A557 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Streptosporangineae RepID=D1A557_THECD Length = 795 Score = 65.2 bits (157), Expect = 5e-09, Method: Composition-based stats. Identities = 50/414 (12%), Positives = 105/414 (25%), Gaps = 52/414 (12%) Query: 15 SMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHR 74 ++ R R Y+ + A+ + DV + + +IP + + Sbjct: 91 ALKERGRARADYQEAVAAGHRAALAEEDRPDVFTMQVGNIPPGERVTVRLTLDQPLPYED 150 Query: 75 VHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLAL 134 + P +G G A+ D IS L + + L Sbjct: 151 GAATFRFPLVVAPRYIPGTALPDERAGDGIAADTDAVPDASR--ISPPVLLPGFPDPVRL 208 Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 E AG+ + +++ VV + R T + + Sbjct: 209 ----------SLEADIDPAGFPLGEIRSSLHVVAADTRGSGR---TTVRLQPGERLDRDF 255 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + ++ P Q L + + L Sbjct: 256 VLRLAYGRPEQAAASVTLTPDAEGESGTFTLTV----------LPPSERCAPRPRDVVIL 305 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFF 307 +D SGSM A+R + L+ + + + + F Sbjct: 306 LDRSGSMHGWKMVAARRAAARIVDTLTGRDRFAVLSFDDMVERPAGLDGGLSPATDRNRF 365 Query: 308 Y-------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 Q GGT +++ L+ ++ + A + +DG + Sbjct: 366 RAVEHLAGLQARGGTELAAPLREGAALL----DDAGRDRVLVLITDGQ---VGNEDQLLA 418 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 L L +R ++ I + + + + +D Sbjct: 419 LIDPFLNGLRIHAVG-IDQAVNAGFLGRLATAGQGR-----LELVESEDRLDEA 466 >UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 Length = 239 Score = 64.8 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRHHTQ 297 K+ + ++ L+D SGSM + K L L + + V I + Sbjct: 23 KKELHLRRLPVYLLLDTSGSMHGEPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSS 82 Query: 298 AKE-VDEHEFFY-----SQETGGTIVSSALKL-MDEVVKE-----RYNPAQWNIYAAQAS 345 A++ V + +G T + AL L + KE W + Sbjct: 83 ARQAVPLTDLLSFQMPALTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRPLVFLMT 142 Query: 346 DG---DNW 350 DG D+W Sbjct: 143 DGSPNDDW 150 >UniRef50_B0VJ57 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJ57_9BACT Length = 331 Score = 64.8 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 65/214 (30%), Gaps = 36/214 (16%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK--RFYILLYLFLSR--TYKNVEVVYIR 293 +YE + SS + +DVS SMD + ++ R + + FL + T + + + Sbjct: 78 DYENKELQSSGMDIIFALDVSKSMDATDMMPSRLLRAILQIGSFLEQVKTDRIGIIAFAG 137 Query: 294 HHT------QAKEVDEHEFFYSQETG----GTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 T E GT + SAL+L + + + Sbjct: 138 TATLQCPLTDDYEAVRIVLNGLNSNTVEIPGTDIGSALRLA----ENAFPEGSKSKTLVL 193 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYI--------------EITRRAHQTLWREY 389 SDG++ + + K V E+ + + +E Sbjct: 194 ISDGEDLQHSALREA-RILKTKGIRVYTMGVGSPEGTIIRHPETGEEVKSKLDEATLQEI 252 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + ++I + + ++ ++ Sbjct: 253 ARITEGEYYRVTP---GGEEIQLILKRIYESEST 283 >UniRef50_Q22HH7 von Willebrand factor type A domain containing protein n=3 Tax=Tetrahymena thermophila SB210 RepID=Q22HH7_TETTH Length = 796 Score = 64.8 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 61/208 (29%), Gaps = 22/208 (10%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 VP I + + +A L+D SGSM S + AK+ I L Sbjct: 255 NFVPNIAQVKKQLFRQAQNEIELMKAEFLLLIDRSGSMVGSNIETAKQALIFFLKSLPEG 314 Query: 284 YKNVEVVY-----IRHHTQAKEVDEH------EFFYSQET-GGTIVSSALK-LMDEVVKE 330 + + + + + D++ + Q GGT +S ALK LM + + Sbjct: 315 SIYNIISFGTNYTVMYPQSVQVNDQNLQDSIDKIEKFQANMGGTNISQALKYLMYNLQDQ 374 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 +DG+ + D P + K + + Sbjct: 375 Y----GLRKKIYIITDGE-FQDYQPALEIVKKNKFKCDINALCIG----SYEFLYATQIL 425 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELF 418 + + + + ++ F Sbjct: 426 NETGGNFQKVTDTSQIISQVIQLLKDSF 453 >UniRef50_A9AX98 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AX98_HERA2 Length = 828 Score = 64.8 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 37/298 (12%), Positives = 81/298 (27%), Gaps = 34/298 (11%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P+ N + L +P ++S +R + + + + + AL E Sbjct: 280 PDSSANLRDALEAANLVTEALRPAALPTSLSQLRVYDSIVLQDISANDLSLDQQLALREF 339 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + + + A + ++ +R P+ + + Sbjct: 340 VRSLGHGVVVLGGTNSYNLGSYAGTPLEELLPVSMEP-------PPRRERPT--VTLLLI 390 Query: 255 MDVSGSM---DQSTKD-MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD-------- 302 +D S SM K +AK I L + + T V Sbjct: 391 LDRSASMLGESGKDKFSLAKAAAIAATDSLGADDTIGVLAF--DDTNDWTVTFTKVGQGV 448 Query: 303 -----EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 ++ GGT + +AL++ + ++ +A +DG + + S Sbjct: 449 QLSEIQNNIAGLSAGGGTDIYAALEVGMGGLAQQTGKV---RHAVLLTDGRSGGESSYES 505 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + + + A L L + +FA + Sbjct: 506 LIAPLRAQGITLSTIAIG---GDADTVLLESLAKLGAGRYHFASRPDDLPRLTLQEAE 560 >UniRef50_A6R161 Predicted protein n=3 Tax=Onygenales RepID=A6R161_AJECN Length = 759 Score = 64.8 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 27/260 (10%), Positives = 60/260 (23%), Gaps = 44/260 (16%) Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK------IERVPFIDTFDLRYK 237 + + E++ II + P + + + Sbjct: 3 PVQPAASSEDDFEIIDDQIPIRPRLSTSTTIAGERSPNEVGVQLHPLPDTNSMILSVHPP 62 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQST------------------KDMAKRFYILLYLF 279 + ++ P + +DVS SM S D+ K + Sbjct: 63 LHPEKEMPHVPCDIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIET 122 Query: 280 LSRTYKNVEVVYIRHHTQAKEVDE----------HEFFYSQETGGTIVSSALKLMDEVVK 329 L+ + V + E+ + + T + LKL + + Sbjct: 123 LNENDRLGIVAFSTEAEVVYEISKMNESSKKAALKAVEALKPLSSTNLWHGLKLGLKAFE 182 Query: 330 ERYNPAQWNIYAAQASDGDNWA-------DDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 + Q +DG L +P++ + + Sbjct: 183 NERHTPQSVQALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHTFGFGY---NIR 239 Query: 383 QTLWREYEHLQSTFDNFAMQ 402 L + + F Sbjct: 240 SGLLQAIAEVGGGTFAFIPD 259 >UniRef50_C9ZGQ6 Putative membrane protein n=3 Tax=Streptomyces RepID=C9ZGQ6_STRSW Length = 534 Score = 64.8 bits (156), Expect = 7e-09, Method: Composition-based stats. Identities = 36/274 (13%), Positives = 74/274 (27%), Gaps = 31/274 (11%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + LT + TA+ ++++ + RR + E Sbjct: 253 KSRPDLTVIRPRDGVVTADYPLSSLASTGTDVRDDVRRLTDALRTPDVQRLITERTLR-- 310 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY-KNYEKRPDPSSQAVMFCLMDVS 258 ++ + R + P + + YE S+ ++D S Sbjct: 311 ----RPVVASVPPAAGLDTTRRRELPFPGSRSVAVGLLDAYENDLRRPSRT--VYVLDTS 364 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV----------------D 302 GSM+ D K L EV + + K V Sbjct: 365 GSMEGDRLDRLKTALTELTGDFR---DREEVTLMPFGSDVKSVRTHVVRPADPKAGLDGI 421 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 + G T + ++L+ E + +I +DG+N SP + Sbjct: 422 RADTRKLSAAGETAIYTSLRRAYEHLGAVDRDTFTSIVL--MTDGENTEGASPADFDDFY 479 Query: 363 KKLLPVVRYY-SYIEITRRAHQTLWREYEHLQST 395 +L R+ + + + + + Sbjct: 480 GRLPDAARHIPVFPILFGDSDRDELEHIAEVTGG 513 >UniRef50_P19827 Inter-alpha-trypsin inhibitor heavy chain H1 n=63 Tax=Mammalia RepID=ITIH1_HUMAN Length = 911 Score = 64.8 bits (156), Expect = 7e-09, Method: Composition-based stats. Identities = 24/205 (11%), Positives = 55/205 (26%), Gaps = 26/205 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 + + + + ++D+SGSM K + + + V++ Sbjct: 281 FAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSW 340 Query: 292 ---IRHHTQAKEVDEHEF---FYSQETGGTIVSSALKLMDEVVK--ERYNPAQWN--IYA 341 + ++A +F F E T ++ L E++ + P N Sbjct: 341 KGSLVQASEANLQAAQDFVRGFSLDEA--TNLNGGLLRGIEILNQVQESLPELSNHASIL 398 Query: 342 AQASDGDNWA--DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DGD D + + + Y+ + Sbjct: 399 IMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGF-GHNVDFNFLEVMSMENNGRA-- 455 Query: 400 AMQHIRDQDDIYPVFRELFHKQNAT 424 Q I + D + + + Sbjct: 456 --QRIYEDHDATQQLQGFYSQVAKP 478 >UniRef50_A2E0T6 von Willebrand factor type A domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2E0T6_TRIVA Length = 753 Score = 64.8 bits (156), Expect = 7e-09, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 54/187 (28%), Gaps = 28/187 (14%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE 305 + + ++D SGSM S AK +L L + + + EV Sbjct: 239 QANTEFYFIIDCSGSMYGSRIKNAKSCLNVLLHSLPIGCRFSIIKF----GTKFEVALEP 294 Query: 306 FFYSQETGGTIVSSALKLM---------------DEVVKERYNPAQWNIYAAQASDGDNW 350 Y+ E +S A+ + + + E + +DG+ Sbjct: 295 CDYTDE----NMSKAMHQLDLIDADMCGNDMISPLKYISEHPQKKDYIKQVFLLTDGE-- 348 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 D + + + R ++ I A + L + S F + ++ Sbjct: 349 --DDRISICAMVQANRDNFRVFTIG-IGSDADRNLIIDVARNGSGRYIFIDDEDENMNEK 405 Query: 411 YPVFREL 417 L Sbjct: 406 VIELLRL 412 >UniRef50_B6HQ22 Pc22g19800 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6HQ22_PENCW Length = 896 Score = 64.8 bits (156), Expect = 7e-09, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 57/192 (29%), Gaps = 24/192 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 K P + + ++D SGSM + L L + + Sbjct: 267 VPKFSLPPDLSEIVFVVDRSGSMTD-NMHTLRSALGLFLKSLPLGVPFNLISFGSSFEAI 325 Query: 292 -IRHHTQAKEVDEHEFFY---SQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 R +E E + Q GGT + S L+ V++RY + +D Sbjct: 326 WARSKVSTRESLEEALQHTKNIQADLGGTEILSGLEAA---VEKRYQDKVLEVLVL--TD 380 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G+ W A + R+++ +H L F Q + + Sbjct: 381 GEVWNQSEVFDLVNQANQQ-HSTRFFTLGLGDSVSHS-LINGISRAGKGF----TQTVLN 434 Query: 407 QDDIYPVFRELF 418 +D+ + Sbjct: 435 NEDLNKTVVRML 446 >UniRef50_Q54LJ4 Type A von Willebrand factor domain-containing protein n=3 Tax=Eukaryota RepID=Q54LJ4_DICDI Length = 2563 Score = 64.8 bits (156), Expect = 7e-09, Method: Composition-based stats. Identities = 53/340 (15%), Positives = 107/340 (31%), Gaps = 44/340 (12%) Query: 115 FVFQISKDEYLDLLFEDLALP-NLKQNQQRQLTEYKTH-------RAGYTANGVPANISV 166 +V ++S D LD+ F LP ++ Q+ Q + T +ISV Sbjct: 886 YVTELSIDG-LDISF---VLPRSITPKQRLQSSSSNTQSVTSTVQVTELAQKQSDLSISV 941 Query: 167 VRSLQNSLARRTAMTAGKR--RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI- 223 + ++ + + T R R L N L + +L + E + Sbjct: 942 GIEMPYNIVKLISPTHDVRIKRTHTKATIELNNQDNQY---LDKNFQLLIGLEEPYSPRM 998 Query: 224 -----ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 E+ K S ++ L+D+S SM + R + Sbjct: 999 WVEVDEKGHHASMLAFYPKLDIDNTMKDSHTMVTLLIDLSSSMAGDAFEDLLRAVRITIS 1058 Query: 279 FLS--RTYKNVEVVY-----------IRHHTQAKEVDEHEFFYSQET-GGTIVSSALKLM 324 L + V + + ++ + + + GGT++ L+ + Sbjct: 1059 NLRGMQKVLFDVVCFGDTFDWLFGIGVPPTESNLQIAWSHINHLKTSYGGTLLHQPLQSL 1118 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 + ++ N +DG N A + ++L KK P R +++ I + Sbjct: 1119 YLLAEKAKPTNPHN--ILLFTDG-NVA--NEELVQMLVKKASPYCRMFAFG-IGEHCSRH 1172 Query: 385 LWREYEHLQSTFDNFAMQHIR-DQDDIYPVFRELFHKQNA 423 + L + F + R + I + L + Sbjct: 1173 FVKSICRLGGGYPEFIQTNKRPNPKKIIDQLQRLTQPAMS 1212 >UniRef50_A6QBT6 von Willebrand factor type A domain protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QBT6_SULNB Length = 305 Score = 64.4 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 54/196 (27%), Gaps = 26/196 (13%) Query: 251 MFCLMDVSGSMDQ----------STKDMAKRFYILLYLFLSRTYKN-----VEVVYIRHH 295 + ++D S SM Q + D+ K + + +V +I Sbjct: 85 IVLVIDSSDSMRQMGFDPKDPYKNKFDVVKEVVADFIKK-RKNDRIGMVTFADVAFIASP 143 Query: 296 TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNW 350 ++ Q T ++ AL ++ + ++ +DG DN Sbjct: 144 LTFEKDFLTNITEMQKLGMAGKRTAINDALVQAYNLMSKSKAKSK---IIILLTDGRDNM 200 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + + + +K + R + +A + I Sbjct: 201 SKIPLSDVKHMIEKRDVKLYTIGIG-GPRDYDAQYLKTLAKAGKGQ-AYAARSAAMLSKI 258 Query: 411 YPVFRELFHKQNATAK 426 Y +L + + K Sbjct: 259 YDEINKLEVTKLDSKK 274 >UniRef50_A7VF89 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VF89_9CLOT Length = 1391 Score = 64.4 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 53/171 (30%), Gaps = 17/171 (9%) Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVDEHEF 306 +MD SGSM+ + AK+ ++ K + V Y T ++ Sbjct: 553 IVMDKSGSMEGAAIANAKQAATEAVEHITSE-KMMIVSYDNEAYLEQSLTSRSGTLKNSI 611 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP-LCHEILAKKL 365 + GGT +S+ L L + ++ + SDG + + A KL Sbjct: 612 AAISDGGGTNISAGLNLALDNLEAEKG----SRAVILMSDGQDGGSEEDMQAATDRAAKL 667 Query: 366 LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 V + E + + + DIY ++ Sbjct: 668 GISVYTVGFGEC----DDAYMQAIAEVTGGKF-VKASASTELSDIYLYLQK 713 >UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZR58_9PLAN Length = 1032 Score = 64.4 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 48/188 (25%), Gaps = 21/188 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------- 303 + ++D SGSM M + + + I +QA+ + Sbjct: 459 LMLVLDKSGSMQGEKMQMTQGAALAAIRAMGAADFAG---VIGFDSQAQRIVPIRKVDNP 515 Query: 304 ----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + +GGT ++ + L ++ + SDG P Sbjct: 516 GMFVAQVRKLSASGGTNMTPGVALGFRDLQN---VDAGVKHMIVLSDGQ----TEPGNVA 568 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 +A + + S + + A Q L R + Sbjct: 569 QIASDMKKMGMTVSAVAVGSDADQKLMATVARNGGGKFYAVNNPKAIPRIFMREARRVAQ 628 Query: 420 KQNATAKG 427 A G Sbjct: 629 PLVKEAPG 636 >UniRef50_Q503P4 Zgc:110377 n=9 Tax=Clupeocephala RepID=Q503P4_DANRE Length = 868 Score = 64.4 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 25/203 (12%), Positives = 58/203 (28%), Gaps = 23/203 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----- 293 + P ++ ++D S SM + K + L +V+ Sbjct: 245 FAPANLPRVPKMVVFVIDNSYSMYGNKMAQTKEALGTILGELPEDDYFAIIVFSTTFVVW 304 Query: 294 ------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV--VKERYNPAQWNIYAAQA- 344 + + + + GGT + A E+ +R A N+ Sbjct: 305 RPYLSKATEENVKEAQEYVKTIEVIGGTELHDATIHGVEMLYAAQRNGTAPKNMVLMMIL 364 Query: 345 -SDGDN--WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG + P E + K + + + A + + Sbjct: 365 LTDGQPNQYPRSLPEIQESIRKAIDGNITLFGLAFGN-DADYGFLDTLSKQNNG----IV 419 Query: 402 QHIRDQDDIYPVFRELFHKQNAT 424 + I + D + F+++ ++ Sbjct: 420 RRIYEDSDAPLQLKG-FYEEVSS 441 >UniRef50_A8VYD1 Extracellular solute-binding protein, family 5 n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8VYD1_9BACI Length = 978 Score = 64.4 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 14/165 (8%), Positives = 50/165 (30%), Gaps = 13/165 (7%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRF-YILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P + + ++D SG+++ D + + + + + +E+ Sbjct: 56 QPGAGLDLMFVLDNSGTVNLDDTDSIRSSTVSDYAENMLPGDRGGIISFNTEADMLQEMS 115 Query: 303 EHEFFYSQE-------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 ++ + +GGT +S ++ +E + +DG + + Sbjct: 116 DNRYDLLDALSALPDPSGGTDLSQGMRAANEQFVQ--TKGANKQIMVLITDGADT-INLA 172 Query: 356 LCHEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFDN 398 + + + + + + + + L ++ Sbjct: 173 EVYNQVREARMNGITIFTLGLGSLATGLDEALLQDIADQTRGQYR 217 >UniRef50_Q8TYU9 Mg-chelatase subunit ChlI and Chld (MoxR-like ATPase and vWF domain) n=1 Tax=Methanopyrus kandleri RepID=Q8TYU9_METKA Length = 818 Score = 64.4 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 44/307 (14%), Positives = 86/307 (28%), Gaps = 22/307 (7%) Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 DE + + L + L++ + +++ G + ++ +L L Sbjct: 517 IDELEEEIQRRLEI--LQKLGFVR----PSYQGGVSLTLKGRELAAFSALIEELEAFEGT 570 Query: 181 TAGKRRELHALEENLAIISN-SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 G E S E + L + A I DLR + Sbjct: 571 EFGHHAAKRLSERGTGSRSYSREYRRGDPYANLDVRGSLRTAVRRGRREILPEDLRSFDR 630 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF-LSRTYKNVEVVY------I 292 E+ + ++D SGSM D AKR I L F + + V + + Sbjct: 631 EE----EVCLDIVYVIDTSGSMSGDRIDAAKRAAIALAHFSVKAGDRVGIVGFNTKAEIV 686 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 T E + + G T + A+++ E+ + P + +DG Sbjct: 687 VDITSDVEEIITKVMSLKPGGATDIGDAIRVGTELFRRCGRPDRDWHMIL-LTDGVPTKG 745 Query: 353 DSPLCHEILAK---KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + + L++ V + L + + Sbjct: 746 EPDPETKALSEATAASRMGVTISTIGIKLPEEGIRLIEHIAGISGGRSHHITDPEELTLV 805 Query: 410 IYPVFRE 416 +R Sbjct: 806 TLNEYRR 812 >UniRef50_A6Q208 von Willebrand factor type A domain protein n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q208_NITSB Length = 305 Score = 64.4 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 52/196 (26%), Gaps = 26/196 (13%) Query: 251 MFCLMDVSGSMDQS----------TKDMAKRFYILLYLFLSRTYKNVEVVYIRHH----- 295 + +D SGSM + D+ + R V++ Sbjct: 85 IVLAIDASGSMQEKGFDPTDPQKTKFDVVRSLVKAFISK-RRNDNIGVVIFGSFAYIASP 143 Query: 296 -TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNW 350 T KE + Y T + AL ++KE + +DG D Sbjct: 144 LTFNKEAVKKILDYLDIGVAGSKTAIDDALIESVRLLKES---QAKSKIVILLTDGIDTA 200 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + P +AKK + + + R + +A + I Sbjct: 201 SKTPPDVAVKMAKKYGVKIYTIGIGDKRG-IDEAFLRWLAQQGHGYYFYAKDASMLRK-I 258 Query: 411 YPVFRELFHKQNATAK 426 Y L + + Sbjct: 259 YDEINRLEPSEIRGKE 274 >UniRef50_C4LHS9 Magnesium-chelatase subunit D n=1 Tax=Corynebacterium kroppenstedtii DSM 44385 RepID=C4LHS9_CORK4 Length = 665 Score = 64.4 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 44/316 (13%), Positives = 83/316 (26%), Gaps = 30/316 (9%) Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 P + HP ++ D + + D S Sbjct: 274 RPRLPEDVDDSDFDDHPRDNDENGGDENHDGNSEEENNKLSKNANPAAARPDD--ADTTS 331 Query: 121 KDEYLDLLFEDLALPNL--------KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN 172 +E +++ P+ ++ + + A Sbjct: 332 DEEPGADTAPEVSSPSESDSGDTSREERNDPPHNDDASTGAEARNTDPVDKPDDQPLAIP 391 Query: 173 SLARR--TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFID 230 + RR T+ T R + + PA+ + + A +R Sbjct: 392 HIPRRGRTSTTVWPGRHGPRGISHRGRVRRVMPARTHDTDLAILPTLAAAAPWQRFRHRH 451 Query: 231 TFDLRYKNYEKRPDPSSQA------VMFCLMDVSGSMDQSTKDMAKR-FYILLYLFLSRT 283 D R + ++Q ++ ++D SGSM + AK +L Sbjct: 452 QDDQRRIILTRDDLRTAQRGSAGGELVIIIVDASGSMGRGAIRTAKSTALEVLQSSYRDR 511 Query: 284 YKNVEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 K +V H T++ GGT ++SA + R+ P Sbjct: 512 SKVCIIVARGHEAAIGLPVTRSLSRARRCLTSLPTGGGTPLASARLRAASIA-RRFPPEL 570 Query: 337 WNIYAAQASDG-DNWA 351 + SDG N Sbjct: 571 V-RVI-ELSDGRANVG 584 >UniRef50_Q9SJE1 Magnesium-chelatase subunit chlD, chloroplastic n=49 Tax=cellular organisms RepID=CHLD_ARATH Length = 760 Score = 64.4 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 65/232 (28%), Gaps = 31/232 (13%) Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 + +F+ L + + R G S R + G Sbjct: 455 EEFIFDAEG--GLVDEKLLFFAQQAQKRRGKAGRAKNVIFSEDRGRYIK----PMLPKGP 508 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 + L A + + + R F++ D+R K ++ Sbjct: 509 VKRLAVDATLRAAAPYQKLRREKDIS------------GTRKVFVEKTDMRAKRMARK-- 554 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVVY-------IRHHT 296 + A++ ++D SGSM + AK LL + + + + + + Sbjct: 555 --AGALVIFVVDASGSMALNRMQNAKGAALKLLAESYTSRDQVSIIPFRGDAAEVLLPPS 612 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDG 347 ++ + + GG+ ++ L V +DG Sbjct: 613 RSIAMARNRLERLPCGGGSPLAHGLTTAVRVGLNAEKSGDVGRIMIVAITDG 664 >UniRef50_A4Y9K4 von Willebrand factor, type A n=3 Tax=Shewanella RepID=A4Y9K4_SHEPC Length = 528 Score = 64.4 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 51/144 (35%), Gaps = 16/144 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE---VDE 303 ++ M +D SGSM + +++AK + + + V + KE + Sbjct: 363 NRGPMIVCLDTSGSMQGTPENVAKALVLQCISVAKKEKRACFVYLFGSKGEVKEMELTPD 422 Query: 304 HE-------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDNWADDSP 355 F GGT V L + E R + QW SDG+ ++ S Sbjct: 423 KAGLEQMILFLSMSFGGGTDVEGPLNMALE----RSDEKQWQQADILLVSDGE-FSVSSG 477 Query: 356 LCHEILAKKLLPVVRYYSYIEITR 379 L +I +K + + + R Sbjct: 478 LSRKISNRKEQRGMSVHGVVIGGR 501 >UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleostomi RepID=Q562D1_XENTR Length = 895 Score = 64.4 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 24/204 (11%), Positives = 51/204 (25%), Gaps = 27/204 (13%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IR 293 + + ++D S SM K + + + V++ I Sbjct: 265 FAPSKLKEVPKNIIFIIDRSISMIGLKMQQTKEALLKILDDVKEHDHFNFVIFDWGVEIW 324 Query: 294 HHTQAKEVDEH------EFFYSQETGGTIVSSALKLMDEVVKE----RYNPAQWNIYAAQ 343 + K E+ G T ++ AL ++ + R P + Sbjct: 325 EQSLVKATPENLNRAKAYVRNLYPKGWTNINDALLSAISLLDQAHDARSVPKRSASLIIF 384 Query: 344 ASDGDNWADDSPLCHEILAKK----LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG + + + + YS + S Sbjct: 385 MTDGQ--PSTGERNLDKIQENARNAIRGKYSLYSLGFGVG-VDYPFLEKLSLENSGVAR- 440 Query: 400 AMQHIRDQDDIYPVFRELFHKQNA 423 I ++ D F+ + A Sbjct: 441 ---RIYEESDAALQMEG-FYDEVA 460 >UniRef50_A9RSX3 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RSX3_PHYPA Length = 1068 Score = 64.4 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 57/225 (25%), Gaps = 32/225 (14%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + L + P + M L+D SGSM + A L + Sbjct: 255 ERHPSHGTHAIALTF-QPRFALQPLRTSEMIFLVDRSGSMMGTQIKQAGEALELFLRSIP 313 Query: 282 RTYKNVEVVYIRHHT-----QAKEVDEHEFFY-------SQET-GGTIVSSALKLMDEVV 328 +V + + E E Q GGT +++A EV Sbjct: 314 FENHYFNIVGFGSNHNFLFPTSVEYTEDSLKKAVHYAQTIQANMGGTEIANAF---FEVF 370 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR--------YYSYIEITRR 380 + R +DG W D+ + + + + R Sbjct: 371 QRRRRN--VPTQIFLLTDGMVW--DAEQLTKSIIEAVDDGARNNSPVRVFTLGVGNAVSH 426 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 L + +++ R + + + + A Sbjct: 427 H---LIESVARAGGGYAQLVLENERMEKKVLNMLKAGLTPSVTNA 468 >UniRef50_Q0VTG8 Protein containing a von Willebrand factor type A domain n=5 Tax=Gammaproteobacteria RepID=Q0VTG8_ALCBS Length = 698 Score = 64.4 bits (155), Expect = 8e-09, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 49/181 (27%), Gaps = 27/181 (14%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE-------- 303 ++D+SGSM+ + L + V++ +A+E+ Sbjct: 321 VFVLDISGSMN-AKLATLGDGVRQALGKLRGNDRFRIVLF---DDRAEELTSGFVDATPN 376 Query: 304 ------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPL 356 + Q GGT + L L + +DG N Sbjct: 377 NIRQYTQKIMQLQSRGGTNLFGGLSLALTPLDADRPTG-----IVLVTDGVANVGKTRQK 431 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L + VR ++++ ++ + + F I + Sbjct: 432 DFIDLLEN--HDVRLFTFVMGNSA-NRPMLTAMTDASNGFAISVSNSDDIAGQILNATSK 488 Query: 417 L 417 + Sbjct: 489 V 489 >UniRef50_Q4S685 Chromosome 9 SCAF14729, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4S685_TETNG Length = 608 Score = 64.4 bits (155), Expect = 9e-09, Method: Composition-based stats. Identities = 27/232 (11%), Positives = 57/232 (24%), Gaps = 52/232 (22%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 + + P + ++DVSGSM + K+ + L + + Sbjct: 238 FAPKGLPVVPKDVIFVIDVSGSMIGTKIQQTKQAMSTILADLREGDHFNIITFSDQVRTW 297 Query: 292 -----IRHHTQAKEVDEHEFFYSQETG-----------------------------GTIV 317 +R Q + G GT + Sbjct: 298 KRGRTVRATRQNVRDAKEFVRRIIAEGCESEATEHHLTASLCLFLLLYEFSFSFPSGTNI 357 Query: 318 SSALKLMDEVVKERYNPAQWN----IYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYY 372 ++AL +++ + + +DG+ + AKK L + Sbjct: 358 NAALLSAAQLINPPSSSRHLSSHRVPLVIFLTDGEATIGVTAGDTILTNAKKALGSASLF 417 Query: 373 SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 A L + + D + F+ + A+ Sbjct: 418 GLAF-GDDADFLLLKRLALDNRGVARMVY----EDADAALQLKG-FYDEVAS 463 >UniRef50_A2DWC0 von Willebrand factor type A domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2DWC0_TRIVA Length = 729 Score = 64.0 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 59/183 (32%), Gaps = 22/183 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------HHT 296 + + + ++D SGSM++S AK LL L + + + + Sbjct: 231 ANSEFYFIIDCSGSMEESRIKNAKFCLNLLIHSLPVGCRFSIIKFGSMYEVVLPTCDYTD 290 Query: 297 QAKEVDEHEFFYSQET-GGTIVSSALKLMDEVV-KERYNPAQWNIYAAQASDGDNWADDS 354 + + GT + S LK + + KE + +DG++ D Sbjct: 291 ENVAKAMEQINQMDANMEGTDILSPLKFVSDQSTKEGFIKQ-----VFLLTDGEDIHTD- 344 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD-IYPV 413 L + R ++ + + L + + + + ++ I + Sbjct: 345 --QIYALVQANRTNNRIFTIGIGSGA-DRNLIKNIARISGGNNALIEDNDEKMNEKIIEL 401 Query: 414 FRE 416 R+ Sbjct: 402 LRK 404 >UniRef50_A7C4W6 von Willebrand factor, type A n=1 Tax=Beggiatoa sp. PS RepID=A7C4W6_9GAMM Length = 305 Score = 64.0 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 85/272 (31%), Gaps = 46/272 (16%) Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELR--AKIERVPFIDTFDLR----- 235 A + + + + Q + R + + I+ ++ + Sbjct: 48 ITPAHYEAAQFFIDYLLDKSQQQKALQYGFRPANVSVPLASPIDIAHGVNPLAPKMILEM 107 Query: 236 ---------YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN 286 K + + P A + ++D SG M A+ + L + Sbjct: 108 PTVDMMEAIIKIWHQYKKP---ANIVLVLDTSGGMRGEKILHARTMALQLLEIVKEADYF 164 Query: 287 VEVVY------IRHHTQAKEVD---EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 + + I + Q K + +F Y GGT + A+ +++ P + Sbjct: 165 SLLSFNHSLNWIAKNIQVKSQQKWLKRQFNYQFPGGGTALYDAIFNAYTFLQKNSFPDKI 224 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLL-----PVVRYYSYIEITRRAHQTLWREYEHL 392 + SDG + S L + L K+ +R ++ + + L + Sbjct: 225 AVMIV-LSDGGDSH--SELNFKDLLSKIPFNSDTSPIRIFAVGYGSITDKKRLNEIAKMT 281 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 Q F + AM + ++F K+ A+ Sbjct: 282 QGKFYDGAMVDVD----------KIFKKEMAS 303 >UniRef50_Q23J98 von Willebrand factor type A domain containing protein n=4 Tax=Tetrahymena thermophila SB210 RepID=Q23J98_TETTH Length = 1633 Score = 64.0 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 49/164 (29%), Gaps = 24/164 (14%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----------H 294 SS++ ++D SGSM A IL L + + + Sbjct: 1043 SSRSEFIFILDRSGSMRGQPIRRACEAIILFLKSLPNDSYFNVISFGSSFEKLFPFSTKY 1102 Query: 295 HTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 +++ E GGT + + L + + + + +N +DG+ D Sbjct: 1103 TSESLEKAVQIINNYDSDLGGTEIYNPLHNVFIMKR----ISGYNRQIFLLTDGE---VD 1155 Query: 354 SPLCHEILAKKLLPVVRY--YSYIEITRRAHQTLWREYEHLQST 395 S L KK R + + L +E Sbjct: 1156 SSEQVIELIKKNNKYNRVHSIGFGFKADQY---LIKESAIAGKG 1196 >UniRef50_A0M6V9 Membrane protein containing von Willebrand factor (VWA) type A domain n=21 Tax=Bacteroidetes RepID=A0M6V9_GRAFK Length = 354 Score = 64.0 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 45/157 (28%), Gaps = 20/157 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVY--- 291 + + +DVS SMD + +K+ + L + + + Y Sbjct: 81 KMETVKREGVDIVFAIDVSKSMDAEDIAPNRLEKSKQLVSQILSSLG-SDRVGIIAYAGG 139 Query: 292 ------IRHHTQAKEVDEHEFFY-SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 I A ++ + GT +S A++L + Q N Sbjct: 140 AYPQLPITTDFSAAKMFLQALNTDMISSQGTAISDAIELATTYYD---DDQQTNRVLFII 196 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 SDG++ + + A + + Sbjct: 197 SDGED-HEGNVEDIAEQAAEKGIRIFTIGVGTEKGGP 232 >UniRef50_A5GQG5 Protoporphyrin IX Mg-chelatase subunit ChlD n=3 Tax=cellular organisms RepID=A5GQG5_SYNR3 Length = 653 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 64/217 (29%), Gaps = 23/217 (10%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + T + + A S R + S +R + R A + Sbjct: 363 MLDAEATAIDPELLQFQSAKNRAASSGSRGIVLSESRGRYVRPVLPRGPVRRIAVDATLR 422 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + P Q + R + DLR K +++ + A++ L+D SG Sbjct: 423 AAAPYQKARRA----------REPHRKVIVQDGDLRAKQLQRK----AGALVIFLVDASG 468 Query: 260 SMDQSTKDMAKRF-YILLYLFLSRTYKNVEVVYIRH-------HTQAKEVDEHEFFYSQE 311 SM + AK LL + + + T++ + Sbjct: 469 SMALNRMQSAKGAVLRLLTEAYENRDEVALIPFRGEQAEVLLPPTRSITAAKRRLETMAC 528 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWN-IYAAQASDG 347 GG+ ++ L V + + + +DG Sbjct: 529 GGGSPLAHGLAQAARVGNNALQTGELSQVVVVAITDG 565 >UniRef50_A8DJP2 von Willebrand factor type A n=1 Tax=Candidatus Chloracidobacterium thermophilum RepID=A8DJP2_9BACT Length = 324 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 35/272 (12%), Positives = 75/272 (27%), Gaps = 24/272 (8%) Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 +L + + + + + + I S + K + Sbjct: 19 TLLSPKGQERPSKRPPKADRTTTPDEVFTIDTSLVVLDVAVFDQDNRFVGDLRKENFRVY 78 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 + + + + + + P S + ++D SGSM + L + Sbjct: 79 DEQVEQQIEYFSRDEAPVS---LGFVVDTSGSMR-PRRAKVIEAVKFLARAAKPGDEFFL 134 Query: 289 VVYIRHHTQAKEVD------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 V + A+E E GGT + A++L E + + I Sbjct: 135 VDFKNKAELAEEFTPRPADIEEAVDNIVWGGGTALLDAIQLSAEYADKEGKNRRKAIVVF 194 Query: 343 QASDGDNWAD-DSPLCHEILAKKLLPVVRYYS----------YIEITRRAHQTLWREYEH 391 SDGD+ L ++ V + TR+ L ++ + Sbjct: 195 --SDGDDRDSYYDRRQLIKLLQEYQVQVYIVGFPDDDDDGGLFGRSTRKRAVQLIKDIAN 252 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 F + + + +I Q + Sbjct: 253 ETGGR-AFFPKSVDELPEIVRTINADLRTQYS 283 >UniRef50_A8IJ40 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IJ40_CHLRE Length = 434 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 25/213 (11%), Positives = 48/213 (22%), Gaps = 42/213 (19%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 S + + L + +L + + K + + C++D Sbjct: 120 FSANTEQPVPPVSDLDGRLDKLLKQYGLGEEAVRAVVSLKAVADVKQR-AHVALTCVLDR 178 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------HHTQAKEVDEHEFF 307 SGSM + + L L+ V Y A+ + Sbjct: 179 SGSMSGERIALVRETCHFLIDQLTPDDYLGIVSYSGGVRADVPLLRMTPAARGLAHAMVD 238 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP 367 + G T + L E P +D Sbjct: 239 ALEADGSTALYDGLVAGVRQQMEAEAP----------TD--------------------Q 268 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 V +++ L + QS + Sbjct: 269 HVTVHTFGFGAG-HSVELLQAVADAQSGVYYYI 300 >UniRef50_A5GIB9 Protoporphyrin IX Mg-chelatase subunit ChlD n=22 Tax=Cyanobacteria RepID=A5GIB9_SYNPW Length = 728 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 75/252 (29%), Gaps = 26/252 (10%) Query: 130 EDLALPNLKQ--NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRE 187 +D A P++ + + + A + S RS+ S +R + R Sbjct: 424 QDEAPPSVPEEFMLDPEAVAIDPDLLLFNAAKSKSGNSGSRSVVLSDSRGRYVKPMLPRG 483 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 A + + P Q + ER ++ DLR K + Sbjct: 484 PVRRIAVDATLRAAAPYQKARRA----------RQPERTVIVEESDLRAKLL----QRQA 529 Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRF-YILLYLFLSRTYKNVEVVY-------IRHHTQAK 299 A++ L+D SGSM + AK LL + + + + T++ Sbjct: 530 GALVIFLVDASGSMALNRMQSAKGAVIRLLTEAYENRDEVALIPFRGDQAEVLLPPTRSI 589 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW-NIYAAQASDGD-NWADDSPLC 357 GG+ ++ L V + +DG N + L Sbjct: 590 TAARRRLESMPCGGGSPLAHGLTQAARVGANALATGDLGQVVVVAITDGRGNVPLSTSLG 649 Query: 358 HEILAKKLLPVV 369 L + P + Sbjct: 650 QPELEGEEKPDL 661 >UniRef50_UPI0001760CA2 PREDICTED: inter-alpha (globulin) inhibitor H5-like n=1 Tax=Danio rerio RepID=UPI0001760CA2 Length = 1157 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 42/348 (12%), Positives = 90/348 (25%), Gaps = 43/348 (12%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS 169 F +S +E L L +L + + + G IS +R+ Sbjct: 147 PPGARVSFSLSYEELLSRRLGRYEL-SLGLRPGQPVQNLSLEVSISERTG----ISFIRA 201 Query: 170 LQNSLARRTAMTA-----GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 L +R + T A + + Q + A+ + + Sbjct: 202 LPLRTSRLLSNTVQADAEAPPSTKVKQNAYCAHVRYTPSIQQQRNVSPKGLSADFIIQYD 261 Query: 225 RVPFIDTFDLRYKN------YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 D++ + + R P + ++D+SGSM + K + + Sbjct: 262 VELKDPMGDIQVDDGYFVHYFAPRGLPVVPKDVIFVIDISGSMIGTKIKQTKAAMVSILS 321 Query: 279 FLSRTYKNVEVVYI------RHHTQAKEVDEHEFF------YSQETGGTIVSSAL----- 321 L + + + + ++ G T +++AL Sbjct: 322 DLREGDYFNLITFSDDVHTWKKDRTVRATRQNVRDAKEFVRKIIAAGWTNINAALLSAAK 381 Query: 322 ----KLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIE 376 R +Q +DG+ + A+K L +V + Sbjct: 382 LLNPSTRSSSSTGRAPSSQRVPMIIFLTDGEATIGETETDVILHNAQKSLGLVSLFGLAF 441 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 A + R + DD + + + Sbjct: 442 -GDDADFPMLRRLALENRGVARMVY----EDDDAAIQLKGFYDEVATP 484 >UniRef50_C5CEE7 PEGA domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CEE7_KOSOT Length = 1706 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 65/217 (29%), Gaps = 25/217 (11%) Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 E + + ++ + + EE L A + R DL+ + +PS Sbjct: 246 EIKILFRAYTDINKPISEEVLLHSDAYILEPSGR------IDLQ-SLEALKKEPSLN--F 296 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVDEHE 305 +D SGSM + AK L + + + T+ +E + Sbjct: 297 VLEVDRSGSMK-PVMEKAKDAASYFLDLLPENSELALIAFDTEIEVLKNFTRDREQLKRA 355 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG--DNWADDSPLCHEILAK 363 + G T + + E++ ER P + +DG N+ D +P + L++ Sbjct: 356 LAIIKARGATPLYDTVAKGIELLSERSGP----RFLILVTDGVDANYGDTAPGSEKTLSE 411 Query: 364 --KLLPVVRYYSYIEITRRAHQTL-WREYEHLQSTFD 397 +L + Sbjct: 412 VIRLARENNVVIFAIGLGTRIDEFSLGTLARSTGGMF 448 >UniRef50_A0C3R8 Chromosome undetermined scaffold_148, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0C3R8_PARTE Length = 618 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 48/162 (29%), Gaps = 15/162 (9%) Query: 226 VPFIDTFDLRYK----NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 +K E + L+D SGSM + A+ I Sbjct: 410 QYRFSREGKEFKFVENQVEIQAAQQPSFHYIILLDDSGSMSGDRFNQAQNGLISSLSSAK 469 Query: 282 RTYKNVEVVYIRHHTQAKEVDEHEFFYSQE--------TGGTIVSSALKLMDE-VVKERY 332 +N+ V I + A+ V + + Q GGT SA +L + + + Sbjct: 470 DN-QNIRVTIIIFNDNARCVVDSQTINMQTIKNAVVCNGGGTSFQSAFQLAYQKIAAVKN 528 Query: 333 NPAQWNIYAAQASD-GDNWADDSPLCHEILAKKLLPVVRYYS 373 +D GD++ + L + + + Sbjct: 529 FEQFNKHVIFFYTDGGDSYPTQALNQFANLPQAQRMKIDLIA 570 >UniRef50_C2MDE3 BatA protein n=6 Tax=Bacteroidales RepID=C2MDE3_9PORP Length = 326 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 54/216 (25%), Gaps = 38/216 (17%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRHH- 295 + MD+SGSM + A+ + VV+ Sbjct: 80 EERSIQGIDLVLAMDLSGSMQALDLKPNRFEAARDVASEMIA-ARPNDNIGLVVFAGESF 138 Query: 296 -----TQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 T +V ++ GT + L ++ + +DG Sbjct: 139 TLCPLTVDHDVILQMLDATEIGQLEDGTAIGLGLATAINTLR---GSDNKSKVIILLTDG 195 Query: 348 DNWADD-SPLCHEILAKKLLPVVRYYS------------------YIEITRRAHQTLWRE 388 N A D +P LA++ + + Y+E + + R Sbjct: 196 SNNAGDITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTASGIEYVEADVQIDEGTLRH 255 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + +IY L + + Sbjct: 256 IAQQTGGKY-YRATDETKLHEIYKEIDSLEKSRLTS 290 >UniRef50_Q47M48 von Willebrand factor, type A n=4 Tax=Streptosporangineae RepID=Q47M48_THEFY Length = 609 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 55/209 (26%), Gaps = 45/209 (21%) Query: 244 DPSSQAVMFCLMDVSGSMDQS-------TKDMAKRFYILLYLFLSRTYKNVEVVYIR--- 293 + ++ +D SGSM +S ++AK I S + + ++ Sbjct: 407 RKPANVLLV--IDTSGSMQESVPGTGSTRLELAKEAAITSLDEFSDSDRVGLWMFSTDLE 464 Query: 294 -------------------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + T +E GGT + +V E P Sbjct: 465 DNGQDWRELVPLGPLGASVNGTPRREELAERISNLPPGGGTGLYDTALAAHTLVAEHSRP 524 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAK------KLLPVVRYYSYIEITRRAHQTLWRE 388 N +DG N + ++L + + SY E A + Sbjct: 525 DAINAVVF-LTDGKNEDLNGISLEKLLDSITPEPGQQGVRIFTISYGE---DADLKTMTQ 580 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + D I VF + Sbjct: 581 IAEATNAAAY----DASDPQSIDEVFEAV 605 >UniRef50_C0CX78 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CX78_9CLOT Length = 547 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 66/218 (30%), Gaps = 11/218 (5%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 E N A + + AQ + + + + + L + Sbjct: 24 AETNQAFVERAYQAQEGQMDVICAIPGQEGNGENFQAMLGEQSLPVLSVSTAEQSGLPKT 83 Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA-----KEVDEHE 305 ++CL+DVSGSM + K + L+ V T + +E + + Sbjct: 84 IYCLVDVSGSMKGR-MEQVKETLTAISGGLNENDNLVIGKMGNQITDSAFLSGQEEIKAQ 142 Query: 306 FFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG--DNWADDSPLCHEILA 362 Q TG T + S L + +++ + SDG D A + Sbjct: 143 IDSLQYTGEDTDLYSGLIHGLKFLQQE-PEVKTLRALVVLSDGCDDQGAGSTWKEAYDAV 201 Query: 363 KKLLPVVRYYSYIEITRRAHQTL-WREYEHLQSTFDNF 399 +K V + I + Q + + +F Sbjct: 202 EKADIPVYTVAVILSEKDYEQAKELGSFARNSAGGLHF 239 >UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anserina RepID=B2AQN8_PODAN Length = 648 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 46/201 (22%), Gaps = 41/201 (20%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQST-----------------KDMAKRFYILLYLFLSR 282 + R + +DVSGSM D+ + + L Sbjct: 62 DLRERNHVPLDLVLSIDVSGSMGADAPVPAKNGTEGEHYGLSVLDLVRHAAKTILETLDD 121 Query: 283 TYKNVEVVYIRHHTQAKEVD----------EHEFFYSQETGGTIVSSALKLMDEVV---- 328 + V + +E+ + Q T + ++ + Sbjct: 122 HDRLGIVTFSTSSKVVRELTYMTPANKAKILKQLDALQPLSMTNLWHGIRDGLSLFNNNL 181 Query: 329 ----KERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAK--KLLPVVRYYSYIEITRRA 381 R + +DG N + L + L + + + Sbjct: 182 KAVNDRRNPGSGRVPALLVLTDGMPNHQCPNQGYVAKLRQWSTLPASIHTFGFGY---SL 238 Query: 382 HQTLWREYEHLQSTFDNFAMQ 402 L + + +F Sbjct: 239 RSGLLKSIAEVGGGNYSFIPD 259 >UniRef50_B9HP09 Predicted protein n=13 Tax=cellular organisms RepID=B9HP09_POPTR Length = 786 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 60/197 (30%), Gaps = 25/197 (12%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEP----------AQLLEEERLRKEIAEL 219 Q + RR K E+ I P A L +K E Sbjct: 500 AQQAQRRRGKAGRAKNVIFS--EDRGRYIKPMLPKGPVKRLAVDATLRAAAPYQKLRKEK 557 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYL 278 + R +++ D+R K ++ + A++ ++D SGSM + AK LL Sbjct: 558 DTQKSRKVYVEKTDMRAKRMARK----AGALVIFVVDASGSMALNRMQNAKGAALKLLAE 613 Query: 279 FLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + + + + + +++ + GG+ ++ L V Sbjct: 614 SYTSRDQVAIIPFRGDAAEVLLPPSRSISMARKRLERLPCGGGSPLAHGLTTAVRVGLNA 673 Query: 332 YNPAQWNIY-AAQASDG 347 +DG Sbjct: 674 EKSGDVGRIMIVAITDG 690 >UniRef50_UPI0001C161B1 von Willebrand factor, type A Precursor n=2 Tax=Nostocaceae RepID=UPI0001C161B1 Length = 474 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 19/164 (11%), Positives = 36/164 (21%), Gaps = 20/164 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH-------TQAKEVDE 303 + L+D S SM K + + VV T Sbjct: 52 IVLLIDTSSSMSDGKLAEVKTAASQFIQRRNLESDQIAVVNFGATVQTPAPLTNDINTLN 111 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 + E G T + + + ++ N +DG DD + Sbjct: 112 NAIDQLLEIGSTPMGEGINTAQDQLQAT----TLNKNIILFTDG--LPDDPNFAYNSALS 165 Query: 364 KLLPVVRYYSY--IEITRRA-----HQTLWREYEHLQSTFDNFA 400 ++ + Y + F+ Sbjct: 166 VRNAGIKLIAVATGGADTNYLTQITGDRSLVFYANSGQFDQAFS 209 >UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q46D40_METBF Length = 612 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 41/296 (13%), Positives = 86/296 (29%), Gaps = 28/296 (9%) Query: 121 KDEYLDLLFEDLA----LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLAR 176 +E + +L + L L L + + HR + A + S + + Sbjct: 292 IEELIPVLEDHLEMLEILSMLFPGRAWDYSLKALHREYFGNLEKYAALLRKSSAIHEILE 351 Query: 177 RT---------AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 + + + + + Sbjct: 352 QVGRIELEYGSKKLSLSPYSKSEVHSVTFSGDLRTLLPAETVKLKNPLLKRKFYADMLEG 411 Query: 228 FIDTFDLRYKNYEKRPD-PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN 286 + T+ L+ +N+ + + L+D S SM S + +AK + + + ++ Sbjct: 412 KLLTYQLKGENWNSDSAGKKRKGPVVALVDTSASMRGSPELLAKAVVLAVTRRMLTENRD 471 Query: 287 VEVVYIRH--HTQAKEVDEH--------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 V+V+ T E+ EF GGT ++AL+ + +K Sbjct: 472 VKVILFSSKWQTVEIELTNKKRMGEEFLEFLKFTFGGGTDFNTALRAGLKAMKNEKAFEG 531 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 + +DG + + PL E K R +S I ++ Sbjct: 532 AD--LLFLTDGYSELSEKPLIREWNEIKAERRARIFSLII--GNYDAGGLQQISDH 583 >UniRef50_D1YP15 von Willebrand factor type A domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YP15_9FIRM Length = 675 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 45/307 (14%), Positives = 88/307 (28%), Gaps = 37/307 (12%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 Q T + +R G + + A+ +L ++ A Sbjct: 376 KLQSSKTVDRQYRKGSGRRLMTKTKDTRGRMIRVYQDEHAL-----EDLALIDTLRAAAP 430 Query: 200 NSEPAQLLEEERLRKEIA--------ELRAKIERVPFIDTFDLRYKNYEKRPD------- 244 + L+ + ++ P + DL+ + +P Sbjct: 431 YQRLRAVERVTPLQADTPLNVVERSVTSECLCQKAPQTEHHDLKGLSIVVKPQDYRRKAR 490 Query: 245 -PSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVYIRHH------ 295 A ++D SGSM + A + LL +V+ + Sbjct: 491 EKRIGAYQLFVVDASGSMAARHRMEATKGAILSLLRDSYVHRDSVGLIVFRKDSAEVLLP 550 Query: 296 -TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE-RYNPAQWNIYAAQASDGDNWADD 353 T++ E E G T ++ L++ + + I +DG + D Sbjct: 551 FTRSVERAERLLAKLPTGGKTPLAKGLRVAYTMCDRLLRRHSAERIQMICITDGRATSGD 610 Query: 354 --SPLCHEILAKKLLPVVRYYSYIE--ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 +P+ ++L + + T L +E L + +AM I D Sbjct: 611 SENPVAEAKQWARILGTLPVDCIVIDTETGFIKLGLAKELCKLMNGSY-YAMDSITS-DS 668 Query: 410 IYPVFRE 416 I V R Sbjct: 669 ILRVSRR 675 >UniRef50_UPI0001BC3853 von Willebrand factor type A n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3853 Length = 623 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 25/218 (11%), Positives = 57/218 (26%), Gaps = 39/218 (17%) Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFC----LMDVSGSMDQST---KDMAKRFY------ 273 + + ++ + K + + L+D SGSM + + K Sbjct: 71 YMVVDKEEWFEAWNKEIKYPNSNNIVFDTVILIDCSGSMRTNDPDFEYSVKNTLYPGSSY 130 Query: 274 -----------ILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------HEFFYSQETGGTI 316 + V++ E+ + GGT Sbjct: 131 QITTCYRKLASKNYVKAQGNDDRTGIVLFTSEANTVCELTNSEYVLMNAIDKIYSNGGTN 190 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 ++A+K ++ SDG++ S ++ + + + Y Sbjct: 191 FNNAIKESIRILT-NTRNDSEKR-ILLVSDGESELSSS--VIDLAIENNIKINTVY---- 242 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 I + + L + F + +IY Sbjct: 243 IGGQNNNELLKNVAERTGGKY-FKAVTADELINIYSEI 279 >UniRef50_A6G2Y5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G2Y5_9DELT Length = 516 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 45/339 (13%), Positives = 86/339 (25%), Gaps = 30/339 (8%) Query: 83 VQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQ 142 + +D G G G + E D+ E D ED Sbjct: 13 LDDDDSVADGYGDEGETQGADEGEDSTETGDDTGTSDDSSEASDT-GEDDGAGETGPTDP 71 Query: 143 RQLTEYKTHRAGYT--ANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN 200 L E AG + + AN+ + +L R + L Sbjct: 72 GMLAEACDPNAGQSHGYDLAVANLELAPALVREAVLFGDGKV--PRIPLSPRPFLNHFDF 129 Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 L ++ + F K P + ++D+ S Sbjct: 130 GYAPSPGASPSLSGQLWPVGPANAEGVERYRFQFAVKAPALSPQQRPPVDLAIVVDLGPS 189 Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI---- 316 M + + + L + V I +A ++ + G T Sbjct: 190 MAGEPLVLVEEALAAIESALGPGDR---VTLIAAGEEAMDLAAADIEGF---GSTPLTGL 243 Query: 317 -----------VSSALKLMDEVVKERYNPAQW-NIYAAQASDGDNWADDSPLCHEILAKK 364 + +A++L ++ ++ + + S+G + SP + ++ Sbjct: 244 INPEEAFGYAKLEAAIELAYASLESPWSGDEIGHRRVLLLSNGH--FEVSPALADTVSSA 301 Query: 365 LLPVVRYYSYIEITRR-AHQTLWREYEHLQSTFDNFAMQ 402 S T + RE L FA Sbjct: 302 AAEDRHLVSVATGTHELYADSALRELGRLGQGSAVFAPT 340 >UniRef50_A9BS02 von Willebrand factor type A n=1 Tax=Delftia acidovorans SPH-1 RepID=A9BS02_DELAS Length = 536 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 49/224 (21%), Gaps = 31/224 (13%) Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 + + + + L + R +S ++DVSGS Sbjct: 295 RPASPEAQASPALPTAPVVELSFPNRLEVIDAVLSAYQSDLRRPATS----IFVLDVSGS 350 Query: 261 MDQSTKDMAKRFYILL-----------YLFLSRTYKNVEVVYIR----------HHTQAK 299 M + K LL Y + + + + + Sbjct: 351 MKGARLAQMKEALKLLSGAEASAASQRYAAFQARERVLLIPFSGLVGQPARVQFAAGDLQ 410 Query: 300 EVDEHEF---FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDNWADDSP 355 GGT + AL L + ++ +DG N A Sbjct: 411 AASAQVLAYADSLVADGGTAIYDALTLAQQQARQELRADPERFVSIVLLTDGANTAGRDW 470 Query: 356 LCHEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFD 397 E + + I A + L Sbjct: 471 AAFEREQRMARDGGAPLVRVFPIIFGEAQSGEMQALAALTGGRA 514 >UniRef50_A2AR69 Novel protein similar to vertebrate inter-alpha (Globulin) inhibitor H family (Plasma Kallikrein-sensitive glycoprotein) (ITIH) (Fragment) n=10 Tax=Clupeocephala RepID=A2AR69_DANRE Length = 860 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 42/345 (12%), Positives = 91/345 (26%), Gaps = 40/345 (11%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS 169 F +S +E L L +L + + + G IS +R+ Sbjct: 147 PPGARVSFSLSYEELLSRRLGRYEL-SLGLRPGQPVQNLSLEVSISERTG----ISFIRA 201 Query: 170 L--QNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 L L A A + + Q + A+ + + Sbjct: 202 LVLFLFLIDLLADAEAPPSTKVKQNAYCAHVRYTPSIQQQRNVSPKGLSADFIIQYDVEL 261 Query: 228 FIDTFDLRYKN------YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 D++ + + R P + ++D+SGSM + K + + L Sbjct: 262 KDPMGDIQVDDGYFVHYFAPRGLPVVPKDVIFVIDISGSMIGTKIKQTKAAMVSILSDLR 321 Query: 282 RTYKNVEVVYI------RHHTQAKEVDEHEFF------YSQETGGTIVSSALKLMDEVVK 329 + + + + ++ G T +++AL +++ Sbjct: 322 EGDYFNLITFSDDVHTWKKDRTVRATRQNVRDAKEFVRKIIAAGWTNINAALLSAAKLLN 381 Query: 330 ---------ERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 R +Q +DG+ + A+K L +V + Sbjct: 382 PSTRSSSSTGRAPSSQRVPMIIFLTDGEATIGETETDVILHNAQKSLGLVSLFGLAF-GD 440 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 A + R + DD + + + Sbjct: 441 DADFPMLRRLALENRGVARMVY----EDDDAAIQLKGFYDEVATP 481 >UniRef50_Q2JAM5 Protoporphyrin IX magnesium-chelatase n=13 Tax=Bacteria RepID=Q2JAM5_FRASC Length = 788 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 50/187 (26%), Gaps = 15/187 (8%) Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 A + +++ L R + + D R Sbjct: 520 ATGRRSPARGSR-GALVTTAADAPGLHLPATLLAAAPFQAARGRCGPGLVLVPADRRGAV 578 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF--YILLYLFLSRTYKNVEVVY----- 291 R ++ ++D SGSM + LL R + + + Sbjct: 579 RVGRE----GNLVLFVVDASGSMAARARMTLVTTAVLALLVDAYQRRDRIGMITFRGSGA 634 Query: 292 --IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA-QASDGD 348 + T + EV G T +++ L L EV++ +DG Sbjct: 635 EVVLAPTSSVEVGAARLRALPTGGRTPLAAGLGLAGEVLRAERRRDPTRRALLVVVTDGR 694 Query: 349 NWADDSP 355 A D P Sbjct: 695 ATAGDDP 701 >UniRef50_A9AUC9 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AUC9_HERA2 Length = 579 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 48/171 (28%), Gaps = 17/171 (9%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST-------KDMAKRFYILLYLFLSRTYKN 286 + ++ + ++D SGSM S D AK + L Sbjct: 344 IEGFSWVNLSRVQDPLNIMLVIDTSGSMGPSKEGLTDGGLDAAKIAALDFIDHLPSNANV 403 Query: 287 VEVVYIRHHTQAKEVD------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + + T + + G T + AL + ++ A+ + Sbjct: 404 GLIHFGTLVTVDHSLTNDIGAVRQSISELKPEGQTAIYDALAISYTQLRR----AKGQTF 459 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 SDG + A I+AK + Y + L + + Sbjct: 460 IVLISDGADTASKGDNYDSIVAKATKANIPTYIIGLTSPEFDGQLLEDLQR 510 >UniRef50_Q0A603 von Willebrand factor, type A n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0A603_ALHEH Length = 972 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 36/351 (10%), Positives = 83/351 (23%), Gaps = 30/351 (8%) Query: 88 IERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTE 147 + G + + G + + ++ + + Q + Sbjct: 164 PDGCGNGCDYDMLADAEGAGYTLGHEWGHYVLALYDEYEGRDPAENRDTFPQVGDVPTSP 223 Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 + G ++ S TA + + E + + Sbjct: 224 AIMNSQWQARGGNYEWLNHSTSDNIGDPEDTAQGRVYGKSGWEVLVQPTTDDPQEGNETV 283 Query: 208 EEERLRKEIAELRAKIERV----PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 + +R R E A +D D ++ + + ++D SGSM Sbjct: 284 QPDRTRYTALEAVAPTAADNWVVTQLDQMDHGCRDELEIVWMDDDLEISLIVDTSGSMSG 343 Query: 264 STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--------------EHEFFYS 309 + A+ L + + VV + Sbjct: 344 APIINARTAGRTLVDVVEPGRTAMGVVRFSASASVVHPMIAIPDPGTAEKDQLKDAIDSL 403 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWN--IYAAQASDGDNWADDSPLCHEILAKKLLP 367 +G T + L L + +++ + A SDG D+S E + Sbjct: 404 PASGLTAMFDGLILGLDELQDYSAANDTDAGQVAFLLSDG---GDNSSAATEPQTVQAYQ 460 Query: 368 VVRY----YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + Y + R + + + + + Sbjct: 461 DANVPIIAFGYGSFAPT---GVLRRLADNTGGEFFASPTTLAEIQEAFLAA 508 >UniRef50_Q2SB91 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SB91_HAHCH Length = 261 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 43/147 (29%), Gaps = 24/147 (16%) Query: 248 QAVMFCLMDVSGSMDQ-------STKDMAKRFYILLYLFLSRTYKNVEVVYIRH------ 294 + D SGSMD D+AK L + Sbjct: 73 NYYIVF--DGSGSMDNTDCGDGKRKLDVAKTAVKKFVEQLPADANVGVYAFDGQGVGERT 130 Query: 295 --HTQAKEVDEHEFFYSQETGGTIVSSALK---LMDE-VVKERYNPAQWNIYAAQASDG- 347 TQ + + + GGT +S+ L+ ++ ++++ +DG Sbjct: 131 HLATQNRPLVKQMIDQLVAGGGTPLSAGLEDGKAALTAQAGKQLGYGEYHLVI--ITDGL 188 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSY 374 + ++ + + + + + Sbjct: 189 ASLGYETDGAVQTILQDTPINIHTIGF 215 >UniRef50_UPI0000F2D28F PREDICTED: hypothetical protein n=1 Tax=Monodelphis domestica RepID=UPI0000F2D28F Length = 998 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 61/208 (29%), Gaps = 27/208 (12%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 DL+ +Y R + L+D SGSM + K IL+ L T + + Sbjct: 386 PDLQQTHYSLRK---THGEFIFLIDRSGSMSGVNMHLVKDAMILILKSLMPTCLFNIIGF 442 Query: 292 IR-----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNI 339 + + + GGT + S LK + + Sbjct: 443 GSTFKTLFPSSQVYSEDNLVSACKNIQHLRADMGGTNILSPLK----WITRQPIHEGHPR 498 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 DG + ++ L + R YS+ I +A L + + F Sbjct: 499 LLFLLIDG---SVNNTGKVIELLRNNASTTRCYSFG-IGPKACPRLVQGLAAVSRGSAEF 554 Query: 400 AMQHIRDQDDIYPVFRELFHKQNATAKG 427 +R + + P + K A Sbjct: 555 ----LRQGERLQPKMIKSLKKAMAPVLS 578 >UniRef50_A0LFJ6 Protoporphyrin IX magnesium-chelatase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LFJ6_SYNFM Length = 612 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 50/350 (14%), Positives = 100/350 (28%), Gaps = 49/350 (14%) Query: 53 SIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQ 112 +P I + R + G G+ +A DG+G Sbjct: 262 VVPLALIHR---RRSAAPPREEEMREQQSQIPRAEEREQAGDSSRGGAEAREAPADGDGS 318 Query: 113 DEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN 172 S++ R G + +R ++ Sbjct: 319 QAHSSPKSRE--------------------------GHSREEVFPVGDSFKVKRMRFAKD 352 Query: 173 SLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA-ELRAKIERVPFIDT 231 + R + R + I ++ + + LR ++ + Sbjct: 353 RIERNASGRRTNTRFSGKAGRYVGSILRAKKLDVAVDATLRAAAPWQILRGRTGNLIVSR 412 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR---FYILLYLFLSRTYKNVE 288 DLR+K EK+ ++ +D SGSM + + M + +L + K Sbjct: 413 EDLRFKRREKK----MGHLVVFAVDCSGSM-GARRRMIETKGAVLSMLTDCYHKRDKVSL 467 Query: 289 VVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA-QWNIY 340 + + + T + E+ G T + +AL + +++ I Sbjct: 468 IAFRKDGAEVVLPPTSSVELASRRLAEIPVGGKTPLPAALVAIYNLIRRVLIKEPALRIV 527 Query: 341 AAQASDG-DNWADDSPLCHEILAK--KLLPVVRYYSYIEITRRAHQTLWR 387 AA SDG N E + + LL ++ +I + + R Sbjct: 528 AAVVSDGRANQGITGIPVPEEVERCAGLLKGMKNTDFIVVDTEDKSGIMR 577 >UniRef50_C6JMG8 Magnesium chelatase n=1 Tax=Fusobacterium varium ATCC 27725 RepID=C6JMG8_FUSVA Length = 632 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 28/228 (12%), Positives = 76/228 (33%), Gaps = 16/228 (7%) Query: 197 IISNSEPAQLLEEERLRKEI-AELRAKIERVPFIDTFDLRYKNYE-KRPDPSSQAVMFCL 254 I ++ P + + I A + + +++ ++ K + + A + + Sbjct: 396 YIKSTLPKGKIRDFAFDATIRAAAPYQKKNKENNLMINIKKEHIRVKVREKRTGASILFV 455 Query: 255 MDVSGSMDQSTK-DMAKRF-YILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHE 305 +D SGSM + + K LL + + V + + T++ ++ + + Sbjct: 456 VDSSGSMGVKKRMEAVKGAVMSLLKDAYEKRDRVGMVSFRRDKAEELLPITRSIDLAQKK 515 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQASDGDN----WADDSPLCHEI 360 G T ++ + ++K + + SDG D Sbjct: 516 LEKLATGGKTPLAEGIAKAYTIIKNEMRKDKEVVPLIVFLSDGKGNFSASGKDPVKESLE 575 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 +A+K+ I+ + + + + ++++R +D Sbjct: 576 MAEKIKNEGIRAIVIDTEEGFIKLEMAKTLSEAMKAEYYKLENLRSED 623 >UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CA78 Length = 347 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 46/162 (28%), Gaps = 19/162 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRHHTQAKEVDE 303 + ++ L+DVS SM + + + L + V + I ++K + Sbjct: 2 RRLPIYFLLDVSESMVGDPIEHVQDGMATIIKELKADPFALETVWLSIIGFAGKSKVITP 61 Query: 304 -HEFF-----YSQETGGTIVSSALKLMDEVVKERYN------PAQWNIYAAQASDGDNWA 351 + GGT ++S L + + W +DG Sbjct: 62 LQDIITFYPPKIPIGGGTSLASGLNELMNAIDREVVKTTLERKGDWKPLVFLFTDG--IP 119 Query: 352 DDSPL-CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 D P E V + I + + L + Sbjct: 120 TDDPAQAIERWNAHYRRKVNLVA-ISLGENTNYNLLGQLTDQ 160 >UniRef50_UPI000180D2ED PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180D2ED Length = 983 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 50/190 (26%), Gaps = 30/190 (15%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-----STKDMAKRFYILL 276 ++P + D R++ + L+D S SM+ D+A+ +L Sbjct: 178 PSHKIPNCTSIDPRFRPWYTETAWPKPKRFLILLDSSRSMENTFNSKPMIDIARELIDIL 237 Query: 277 YLFLSRTYKNVEVVY---------------IRHHTQAKEVDEHEFFYSQETGGTIVSSAL 321 L K + + KE G + + A Sbjct: 238 LETLRPNDKISAIGFRHEALRSQGCFRNQLAFASETNKEKLRSFLRNITPMGESSYTVAF 297 Query: 322 KLMDEVVKERYNP-----AQWNIYAAQASDGDNWAD-----DSPLCHEILAKKLLPVVRY 371 + +++++ Y SDG D E K+ V Sbjct: 298 QSAFQLLEQDYIKYKNKSDTEKYVILLISDGQPKEAYGRMQDVYSIIEQQNLKINNSVSI 357 Query: 372 YSYIEITRRA 381 +SY Sbjct: 358 FSYAIGRNGH 367 >UniRef50_A0D1M1 Chromosome undetermined scaffold_34, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0D1M1_PARTE Length = 1460 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 48/135 (35%), Gaps = 11/135 (8%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFY 308 A ++D SGSM+ + + AK+ + + + V I + QA+ V ++E Sbjct: 1274 AHYILILDDSGSMEGAFFEAAKKGLVAFLQEIQKN-PESRVTIILFNHQARCVVDYEIPD 1332 Query: 309 SQE--------TGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQASDGD-NWADDSPLCH 358 +Q GGT LKL + + + ++ +DG + + Sbjct: 1333 AQVQQKEIQFRGGGTDFDEPLKLAFDKIANNPDFDNFSSHSIFFYTDGQAQYPTKAMEKV 1392 Query: 359 EILAKKLLPVVRYYS 373 + + + Sbjct: 1393 KQFPSDKREKIELVA 1407 >UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F2Q0_SORC5 Length = 521 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 27/219 (12%), Positives = 56/219 (25%), Gaps = 16/219 (7%) Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 R F++R R + + +D SGSM + A + Sbjct: 78 ARLPRSAQETFLMFEVRGDGSPARSLAQANLSLV--IDRSGSMKGTRLTNAVQAATTAVS 135 Query: 279 FLSRTYKNVEVVY------IRHHTQAKEVDEHEFF----YSQETGGTIVSSALKLMDEVV 328 L+ V + + T G T +S ++ ++ Sbjct: 136 RLNDGDVVSVVTFDTRTSVVVPPTTVGPETRGRILASVRGISLGGDTCISCGIEEGLSLL 195 Query: 329 KERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 + A + SDGD N +A++ + I + ++ + Sbjct: 196 GQTS--AGVSR-MLVLSDGDANHGVRDVPGFRAMAQRARDRGVAITTIGVDVDYNEKILS 252 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + F +L + A+ Sbjct: 253 AIALDSNGRHYFVENDAALARIFEAEAEQLTTSVASGAE 291 >UniRef50_A7BVG3 von Willebrand factor type A domain protein n=1 Tax=Beggiatoa sp. PS RepID=A7BVG3_9GAMM Length = 280 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 55/151 (36%), Gaps = 23/151 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRHHTQAKEVDE------ 303 +F L+DVS SMD S AK+ F+ ++ + + I ++AK + Sbjct: 91 VFLLIDVSYSMDGSALAEAKQAAQ---EFVRKSDLAHTAIGLIEFGSKAKIISGLTQNAK 147 Query: 304 ---HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + G T ++ L +K +P + +DG + P + Sbjct: 148 HLYKAINRLKTNGSTNMTEGLTTAYLKLKNVDDP----RFIILLTDG---LPNHPKNTQQ 200 Query: 361 LAKKLL-PVVRYYSYIEITRRAHQTLWREYE 390 +A+++ + + T A +T + Sbjct: 201 IAQEICADGIELITIG--TGDADKTYLQSLA 229 >UniRef50_A9GBN0 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GBN0_SORC5 Length = 940 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 54/188 (28%), Gaps = 17/188 (9%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 R +R N K+ PS + MD SGSM +MAK LS Sbjct: 450 WYRTTIERILPVRMDNERKKDMPSVAMALV--MDRSGSMTGLPLEMAKAAAKATAGVLSS 507 Query: 283 TYKNVEVVYIRHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + + T + + E Q GGT + SAL + + Sbjct: 508 DDLIEVIAFDSAPTRYVKMQPARNRSRIAGEIARIQPGGGTEIFSALDAAYQDMT---VT 564 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + +DG + ++++ + + + + L + + Sbjct: 565 QARKKHVILLTDGK---ASTGGIRDLVSAMIAESITVTTVGLGN-DLDEQLLKMIADVGG 620 Query: 395 TFDNFAMQ 402 + Sbjct: 621 GRFHAVPD 628 >UniRef50_B2UZB2 von Willebrand factor type A domain protein n=1 Tax=Clostridium botulinum E3 str. Alaska E43 RepID=B2UZB2_CLOBA Length = 984 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 46/155 (29%), Gaps = 17/155 (10%) Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 +P+ +E + E I+ I D + +K + ++D SGS Sbjct: 50 DKPSFDVEITSVTPENPVAGEDIKIDGKIIPSDFKCNIPKKE--------IVLVLDTSGS 101 Query: 261 MDQSTKDMAKRFYILLYLFLS--RTYKNVEVVYIR------HHTQAKEVDEHEFFYSQET 312 M S K + + V Y ++ +E + Sbjct: 102 MKDSKIKKMKNAAMEFVNKIKKIPNLDIDIVTYSTSGYTYLNNGNTEEDLLKIINSIKAD 161 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 GGT L+ + ++ N + SDG Sbjct: 162 GGTNTGEGLRKANYILDLEKNKNA-DKSIVFMSDG 195 >UniRef50_UPI00005843FB PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005843FB Length = 429 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 20/181 (11%), Positives = 56/181 (30%), Gaps = 24/181 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------RHHTQA----KE 300 + ++DVS SM + K + L+ T + + R + + + Sbjct: 165 IVFVIDVSASMYGTKLSQTKEALKTMLDNLNPTDYFNIITFSDGVQYWRENNRLAPAQRR 224 Query: 301 VDEHE---FFYSQETGGTIVSSALKLMDEVV--KERYNPAQWNIY--AAQASDGD-NWAD 352 + ++ T ++ A+ E++ + RYN ++Y +DG + Sbjct: 225 YMDDAMAYVDSLRDDSETNLNEAIVKAGELLDSEARYNRPGDSVYSMMILLTDGRPSVGT 284 Query: 353 DSPLCHEILAKKL---LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 A+++ + + + L + + + + + Sbjct: 285 TDQQEILDNAREVIAGKHSLNILGFGRL---VDFDLLVKLAYENNGTAKMIYEGTTAAEQ 341 Query: 410 I 410 + Sbjct: 342 L 342 >UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C09D7 Length = 262 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 48/156 (30%), Gaps = 18/156 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL-----SRTYKNVEVVYIRH 294 E P + +F ++D SGSM + L +++ + Sbjct: 6 EIDEMPRKELHVFYVLDTSGSMTGVPIAALNTAMEECTVALKDLAKKNADAKLKIAVLEF 65 Query: 295 HTQAKEVD---------EHEFFYSQETGGTIVSSAL-KLMDEVVKERYNPAQWN---IYA 341 T AK V E E+ + G T + +AL +L ++ + + + Sbjct: 66 STGAKWVTYNGPESLDDEFEWEHLSAGGVTDIGAALRELDIKLSRNGFLKSMTGALMPVI 125 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 +DG + + E+ + + Sbjct: 126 IFMTDGYPTDEYAAALAELRKNRWYTSSTKIGFAIG 161 >UniRef50_A9L314 Outer membrane adhesin like proteiin n=5 Tax=Shewanella RepID=A9L314_SHEB9 Length = 1215 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 35/286 (12%), Positives = 69/286 (24%), Gaps = 29/286 (10%) Query: 156 TANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE 215 NG ++ + RTA + + + + R R Sbjct: 216 ARNGDMKWLNHSTASNIGDVNRTAQGRVYGKSAWEVLAQDVKDDPKSGRKTAQPTRTRYT 275 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDP-----SSQAVMFCLMDVSGSMDQSTKDMAK 270 A P +L + R M +MD SGSM S D AK Sbjct: 276 TLANNAPDANNP--VKKELPAAQFSCRDQLDFVWVEGDIDMQIVMDRSGSMFGSPIDNAK 333 Query: 271 RFYILLYLFLSRTYK-NVEVVYIRHHTQ---------------AKEVDEHEFFYSQETGG 314 + +L + V + + K+ + G Sbjct: 334 QAAKILVDATAEGSTAMGLVSFSGRSSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANGS 393 Query: 315 TIVSSALKLMDE--VVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRY 371 T + +L + + + +DG DN + S + + Sbjct: 394 TALFDGSQLALDNLSAYQASAASGAPGVVFVLADGDDNNSIKSESSVITAYQNANVPIFS 453 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + Y + + + + + D + + Sbjct: 454 FGYGSASPT---GPLVTMANATGGKYFSSPTTLAEIIDAFLQANAI 496 >UniRef50_C1XWJ2 von Willebrand factor type A-like protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XWJ2_9DEIN Length = 308 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 46/195 (23%), Gaps = 28/195 (14%) Query: 243 PDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVYIRHH-- 295 P P QA + +DVSGSM D AK + K V + Sbjct: 80 PTPDEQAGVVLAIDVSGSMMADDLKPSRLDAAKAAARSFVERMPAGVKVGLVSFAAGAVL 139 Query: 296 ----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-----AAQASD 346 T + + T + L + + + SD Sbjct: 140 ESGLTADHQGVIERIDLLERRANTAIGEGLLESL----KAFPTGANHQVAVPATVILLSD 195 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR-------AHQTLWREYEHLQSTFDNF 399 G N +P AK+ V + R + F Sbjct: 196 GRNRIGIAPQEAAQEAKRRGVRVYTIGVGSDDPNASVDWAGFDEAELRGIAEVTGGRY-F 254 Query: 400 AMQHIRDQDDIYPVF 414 A +IY Sbjct: 255 AADSADRLQEIYREL 269 >UniRef50_C7G046 von Willebrand factor A domain-containing protein DDB_G0286969 n=1 Tax=Dictyostelium discoideum RepID=Y6969_DICDI Length = 2079 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 21/192 (10%), Positives = 52/192 (27%), Gaps = 23/192 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------- 293 P ++ + L+DVS SM+ AK+ LS+ + + Sbjct: 349 PVVESELIFLVDVSESMEGYNMKHAKKALHRFLHSLSKDTYFNIISFASSHRKLFAQSVK 408 Query: 294 HHTQAKEVDEHEFFYSQE--TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 ++ + + + G T + LK + A +DG Sbjct: 409 YNDENLKAATAYVESLKAISHGETNLLEPLKDIY------SVDATCPRKIFLLTDGR--- 459 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 ++ L ++ + + L + S +++ + + Sbjct: 460 VNNIGPIVDLVRQNAHNTSVFPIGMGE-FVSRQLVEYIANAGSGVAELVIENETIESKVM 518 Query: 412 PVFRELFHKQNA 423 + + Sbjct: 519 RQLKRALQPAMS 530 >UniRef50_UPI000180B9AB PREDICTED: similar to CLCA family member 1, chloride channel regulator n=1 Tax=Ciona intestinalis RepID=UPI000180B9AB Length = 1001 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 12/118 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF-YILLYLFLSRTYKNVEVVYIRHH---- 295 + P ++DVSGSM M ++ + LS K V + Sbjct: 226 RVVKPPQNRRFVVVLDVSGSMRGKRLLMMRQSTSEFISSLLSDGDKIGIVQFHSFAQTLL 285 Query: 296 ---TQAKEVDEHEF---FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + D + F ++ G T + ++ + + ER +P + + +DG Sbjct: 286 PIRHVNSQTDRFDICSRFPNRTGGSTCIGCGIQAAMQEM-ERDDPTEPCGHIVVLTDG 342 >UniRef50_C3XQR7 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XQR7_BRAFL Length = 1460 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 56/188 (29%), Gaps = 38/188 (20%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST-------KDMAKRFYIL 275 + + D R++N+ + + +MDVSGSM + ++AK+ + Sbjct: 1139 PKNDRSCEGNDHRFRNWYVSAASPKKKNVVIVMDVSGSMREPPGPEEQNRLNLAKQAALT 1198 Query: 276 LYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF-FYSQETGGTIV----------------- 317 + L+ V + E E E T + Sbjct: 1199 VLDTLTPRDWGGVVSFSARA----ETPEGCLGDSLGEANPTNIGIMKDFINQRVPETITM 1254 Query: 318 -SSALKLMDEVVKERYNP------AQWNIYAAQASDGDNWADDSPLCHEILAKKLL-PVV 369 + ++ E N +NI SDG D L ++L+ V Sbjct: 1255 YGVGFRKAFDMFAEARNKKPEQFEDCYNIIIF-LSDGSPTDKDFALNAITQGQELMDRSV 1313 Query: 370 RYYSYIEI 377 ++Y Sbjct: 1314 YIFTYGLG 1321 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 55/190 (28%), Gaps = 41/190 (21%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST--------KDMAKRFYI 274 + + R++N+ + + +MDVSGSM + ++AK+ + Sbjct: 182 PKSDRSCEGNGHRFRNWYVSAASPKKKNVVIVMDVSGSMREPHGVPEEQNRLNLAKQAAL 241 Query: 275 LLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVS---------------- 318 + L+ V + +AK + E T + Sbjct: 242 TVLDTLTPRDWAGVVSF---SARAKAPEGCLGDSLGEANPTNIGIMKDFINQRVPETITV 298 Query: 319 --SALKLMDEVVKERYNP------AQWNIYAAQASDGDNWADDSPLCHEILAKK---LLP 367 K + E N NI +DG D+ + + K + Sbjct: 299 YAEGFKKAFNMFFESKNKKPEQFEDCQNIIIF-LTDGQ--PTDTYFTLDDIVKGQDLMER 355 Query: 368 VVRYYSYIEI 377 V ++Y Sbjct: 356 SVHIFTYGLG 365 >UniRef50_UPI000178810F von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178810F Length = 421 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 60/183 (32%), Gaps = 21/183 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------ 303 + ++D SGSM+++ + + L + R + +V+ T + Sbjct: 115 IVLVIDNSGSMNETDPNQDRYTAAKNLINRMDRDNRVSVMVFDHATTLLQPFTRVKNQET 174 Query: 304 -----HEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 E GGT +S AL+ ++E + + + SDG +++ Sbjct: 175 KDEIIAEIDGLATNDGGTDISLALEDTMSHIQESRDAGRSAMVI-MLSDG--FSETDHDR 231 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 K+ V + L + ++ +D+ VF+++ Sbjct: 232 VLAEYKQQQIAVNTIGLSLVNPD-GAQLLQTIAAETGGQYY----DVQHAEDLSFVFQKI 286 Query: 418 FHK 420 + Sbjct: 287 YDD 289 >UniRef50_A7C1J8 von Willebrand factor, type A n=1 Tax=Beggiatoa sp. PS RepID=A7C1J8_9GAMM Length = 478 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 57/204 (27%), Gaps = 29/204 (14%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-----TYKNVE 288 L+ P P + L+D SGSM + TK + F+ R N + Sbjct: 30 LKTTQPPSIPLPPHD--VILLIDTSGSMAEGTK--LQEVQAAAIQFIQRRHGLTHLANNK 85 Query: 289 VVYIRHHTQAKEVD---------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + +A V E + GGT + L+ + Sbjct: 86 IAVVGFGGRAYLVANLTSDLMNLEQPIQKLRAVGGTPMDRGLQSAMNQLSA--GSDSEQR 143 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG D+ A +L+ T A L + Sbjct: 144 SILLFTDGK---PDNQRTTLN-ASQLVKNANIQIVAIATDDADIGLLTQVT----GDAAL 195 Query: 400 A-MQHIRDQDDIYPVFRELFHKQN 422 + + D + + ++QN Sbjct: 196 VFPTSVGNFDQAFQKAEQAIYEQN 219 >UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGP5_ARCPR Length = 430 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 48/127 (37%), Gaps = 7/127 (5%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI---RHH 295 K ++ + L+D SGSM A+ + +Y S + + + HH Sbjct: 262 LTKEKLSIAEGAYYILVDKSGSMVGEKTVWARSVALAIYRMASLKRRRYFLRFFDKKTHH 321 Query: 296 --TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + EV + GGT +++AL+ + + ER N +DG++ +D Sbjct: 322 LLSDPHEVV-DAILKVKSNGGTDITNALRTAVKDLVERGLSDLTN-TIVIITDGEDVVED 379 Query: 354 SPLCHEI 360 + Sbjct: 380 LSKDLKK 386 >UniRef50_B9EIV3 Cacna2d4 protein n=1 Tax=Mus musculus RepID=B9EIV3_MOUSE Length = 1091 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 39/306 (12%), Positives = 82/306 (26%), Gaps = 26/306 (8%) Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR-ELHALEEN 194 N + L E H + N +++ + ++ N +E Sbjct: 158 NYVELGAEFLLESDAHFSNLRVNVSMSSVQLPTNVYNKDPDILNGVYMSEALNPVFVENF 217 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + + R E + FD R + + +S + L Sbjct: 218 QRDPTLTWQYFGSSTGFFRIYPGIKWMPDENG--VIAFDCRNRGW-YIQAATSPKDIVIL 274 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---------------IRHHTQAK 299 +D+SGSM +AK + L + Y ++ + Sbjct: 275 VDISGSMKGLRMAIAKHTITTILDTLGENDFVNIIAYNDYVHYIEPCFKGILVQADRDNR 334 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVV---KERYNPAQWNIYAAQASDGDNWADDSPL 356 E + G +VS AL E++ +E + N +DG A + Sbjct: 335 EHFKQLVDELMVKGVGVVSQALIEAFEILKQFQESKQGSLCNQAIMLITDG---AVEDYE 391 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 VR ++Y+ + + + + D + + Sbjct: 392 PVFETYNWPDRKVRVFTYLIGREVTFADRMKWIACNNKGYYT-QISTLADAQESVMEYLH 450 Query: 417 LFHKQN 422 + + Sbjct: 451 VLSRPM 456 >UniRef50_C7NN24 von Willebrand factor type A n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NN24_HALUD Length = 592 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 29/264 (10%), Positives = 59/264 (22%), Gaps = 42/264 (15%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL---RAKIERVPFIDT 231 R + +E A E T Sbjct: 113 RRNVEEEYLPLPDSLPVEGLFYNYYFDTGGTGECSSLFCPSYATAITADPLGESTGRYFT 172 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS----------------------TKDMA 269 L + + + ++D+SGSM +A Sbjct: 173 VGLN-STLDTSTFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEGDSRSKMAVA 231 Query: 270 KRFYILLYLFLSRTYKNVEVVYIRHHTQAK------EVDEHEF-----FYSQETGGTIVS 318 K + L L + V++ T AK D + GGT ++ Sbjct: 232 KDALVALTEQLHPDDRVGVVLFNNEPTVAKPLRDVETTDMDAIRGHIREDIEAGGGTNIA 291 Query: 319 SALKLMDEVVKERYNPA---QWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSY 374 + +++ E + +D N + + S+ Sbjct: 292 DGMAEAADMLGEYADSDPTEAETRQIV-ITDAMPNTGQTDDQALQDRLAGYAEDGIHTSF 350 Query: 375 IEITRRAHQTLWREYEHLQSTFDN 398 + + + L E ++ Sbjct: 351 VGVGVDFNPELVDEITAVRGANYR 374 >UniRef50_P33352 Uncharacterized protein yehP n=69 Tax=root RepID=YEHP_ECOLI Length = 378 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 43/372 (11%), Positives = 94/372 (25%), Gaps = 48/372 (12%) Query: 56 TEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIE-----RPQGGGGGSGSGQGQASQDG- 109 T ++ G ++ + +E P+ G SG S Sbjct: 10 TRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTT 69 Query: 110 -EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKT--HRAGYTANGVPANISV 166 E + + E L + + + R + + + A + Sbjct: 70 PEWINSIHTLFPQQVI-----ERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHT 124 Query: 167 VRSLQNSL---ARRTAMTAGKRRELHALEENLAIIS------NSEPAQLLEEERLRKEIA 217 + + ARR + +E S L + + Sbjct: 125 KHLMNPEVLAAARRIVCQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKSTLR 184 Query: 218 ELRAKIERVP-FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 + R+ + S Q + L+D SGSM S A L Sbjct: 185 ANLQHWHPQHGKLYIESPRFN--SRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACL- 241 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF------YSQETGGTIVSSALKLMDEVVKE 330 + + + T ++ Q GGT ++SA++ +++++ Sbjct: 242 --WQLPGIR---THLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQ 296 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR----AHQTLW 386 SD S L + K + ++ + + Sbjct: 297 PAKS-----VIILVSD-FYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTA 350 Query: 387 REYEHLQSTFDN 398 + ++ + Sbjct: 351 QALVNVGAQIAA 362 >UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQB2_9BACT Length = 225 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 27/130 (20%), Positives = 51/130 (39%), Gaps = 18/130 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK---NVEVVYIRHHT 296 E +P+ + + ++DVSGSM + + R L + L + EV I Sbjct: 12 EMVENPTPRVPVSLVLDVSGSMLGAPIEELNRGVELFFKSLKDDDVARYSAEVSVISFSN 71 Query: 297 QA-KEVD-----EHEFFYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQA 344 + +EVD + + + G T + A+ L E +++R + + Sbjct: 72 EVTQEVDFGPLEKCDIPELKAIGKTRMGGAVSLALESLEKRKELYRTLGVDYYQPWMVIM 131 Query: 345 SDG---DNWA 351 +DG D+W Sbjct: 132 TDGKPNDDWQ 141 >UniRef50_A7RFL6 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RFL6_NEMVE Length = 981 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 58/217 (26%), Gaps = 52/217 (23%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVV 290 FD R++++ + + ++D S SM + +A++ + + L K V Sbjct: 199 FDPRFQSWYVEAVTRMRTNIVVVIDRSSSMSTAGRMALARQAAVTVLDTLGPNDKVGVVA 258 Query: 291 YIR--------HHTQAKEVDEHEFFYSQET-----------------------GGTIVSS 319 + E + G T Sbjct: 259 FSHFIIKPPGCFGGNVAEALPKNINRIKAWVEALTPRGKVSLQKTNLRYVSFPGATKYVP 318 Query: 320 ALKLMDEVVKERYN-------------PAQWNIYAAQASDGDNWADDSPLCHEILAK--- 363 AL+ E++ +N + N +DGD + + + + Sbjct: 319 ALEAAFEMLGGDFNIKILHHPLIALIKRSAEN-MILFLTDGDPFDRNPDVSIFEAIRIGQ 377 Query: 364 KLL---PVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 + L + Y E + ++ L + Sbjct: 378 RKLAFPARINVYGLGESLNIDNLNRLKQIASLNNGTF 414 >UniRef50_Q64D90 Cell surface protein n=8 Tax=environmental samples RepID=Q64D90_9ARCH Length = 1359 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 39/307 (12%), Positives = 81/307 (26%), Gaps = 51/307 (16%) Query: 150 THRAGYTANGVPANISVVRS---LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQL 206 ++ +GY+ + NI N+ E + Sbjct: 848 SYFSGYSTSFSK-NIGFSTGGAKDVNNFRENIGNDYLPLPTDITYEGLFYDYYFDTGEKA 906 Query: 207 LEEERLRKEIAELRAKI---ERVPFIDTFDLRYKNYEKR-PDPSSQAVMFCLMDVSGSM- 261 + + +K E + + + L E + +D+SGSM Sbjct: 907 ECQNLFCPSYSYALSKDPVSEVLGYYLSVGLNSGIIESDFQRKKLNLALV--LDISGSMG 964 Query: 262 ------------------DQSTKDMAKRFYIL-----LYLFLSRTYKNVEVVYIRHHTQA 298 D + +K L L + V++ A Sbjct: 965 SSFDEYYYDRFGNHVAVNDTEDAEKSKIEIAAAAIVALLDHLEDDDRLGLVLFNTGAELA 1024 Query: 299 KEVD----------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ---WNIYAAQAS 345 + V + + TGGT +S+ +++ E+ E Q N + Sbjct: 1025 EPVSLVGAKNMQKLKGDVLEISATGGTRLSAGMQMATELYDEFLEVNQSEYENRIIF-LT 1083 Query: 346 DG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN--FAMQ 402 D N S + + Y ++I I + L ++ + Sbjct: 1084 DAMPNSGQTSEESLLGMIEANANKNVYTTFIGIGVDFNTELVEYITKIRGANYYSVHSAT 1143 Query: 403 HIRDQDD 409 +++ D Sbjct: 1144 QFKERMD 1150 >UniRef50_UPI00004D9B6D UPI00004D9B6D related cluster n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00004D9B6D Length = 994 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 62/192 (32%), Gaps = 14/192 (7%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVY---IR 293 + +Q L+D SGSM + M ++ L+ + + ++ + Sbjct: 98 QXAQTDLRKAQGEFIFLLDRSGSMSGAALFPMVRQEPPLVLTQVQSSDSLYFLLLGCSLS 157 Query: 294 HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 H ++ + + GGT + S L + R + +DG A Sbjct: 158 HGQESVAIACDSIKRLRADMGGTNILSPLNWIFRQPVCR----GYPRLLFLLTDG---AV 210 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + L + R YS+ I + A + L + + F + R Q + P Sbjct: 211 SNTGKVIELIRHHSSFTRCYSFG-IGQNACRRLVQGVASVSKGSAEFLSEGERLQPKV-P 268 Query: 413 VFRELFHKQNAT 424 + + N Sbjct: 269 ERERVLAEPNMG 280 >UniRef50_B1L6Y8 von Willebrand factor type A n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L6Y8_KORCO Length = 328 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 44/257 (17%), Positives = 80/257 (31%), Gaps = 27/257 (10%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 ++ R A K + ++ S E L K I + K ++V Sbjct: 90 SRSLFKRLIAKIVIKISSGDSRGFSVNSKEYSVAYSPGMEFDLEKTIERMIEKCKKV--- 146 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK---- 285 ++RY++ + ++D SGSM +A + L Sbjct: 147 --DEMRYEDIVASDKRKRDKSLIMILDSSGSMTGKKILIAMMIAAIASHKLRSGRYGVVG 204 Query: 286 -NVEVVYIRHHTQAKEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 N I+ + K+ + E G T +S LK E+ NP Sbjct: 205 FNSTAFVIKSPAENKDSVKVIEEILDLVPIGYTNISDGLKKGLEISYHLKNPKY-----L 259 Query: 343 QASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG+ N +D A++ + ++ + R L +E + F Sbjct: 260 LITDGEYNVGEDPRKV----ARRFKNLCVIHTRGKRDSR-GSVLCKEIARIG-GSKYFV- 312 Query: 402 QHIRDQDDIYPVFRELF 418 I D I V + + Sbjct: 313 --IDDIKQIQRVMKSIL 327 >UniRef50_C1XTM1 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Deinococci RepID=C1XTM1_9DEIN Length = 464 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 24/213 (11%), Positives = 55/213 (25%), Gaps = 45/213 (21%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ---------------------------STK 266 L+ + + Q V+ ++D SGSM + S Sbjct: 29 LKIRPSAEATRSRPQLVVAFVVDTSGSMREVVTEPTERTGQSVRVDGKDYEVVRGAKSKI 88 Query: 267 DMAKRFYILLYL--FLSRTYKNVEVVY-----IRHH-TQAKEVDE--HEFFYSQE-TGGT 315 D+ L L + + V + + T A E + +GGT Sbjct: 89 DLVIEALQNLLSSPQLQPSDRLAIVKFDDVAEVVQPFTPANEKARLVAAAERLTQYSGGT 148 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + + ++ +++ + +DG + + + V Sbjct: 149 QMGAGMREGMRLLEREAG----SRRLILLTDGQTFDEPLVETVAAQLAQARIPVTAIGVG 204 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + + L E + ++ Sbjct: 205 D---EWNDDLLAEITDRTQGKPFHVIPDNQNPQ 234 >UniRef50_C1XUY0 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus RepID=C1XUY0_9DEIN Length = 319 Score = 62.1 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 22/202 (10%), Positives = 57/202 (28%), Gaps = 31/202 (15%) Query: 248 QAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 + + +DVS SM + + A+ + L + + V + R T+ Sbjct: 85 RTTIVLALDVSRSMRATDVLPSRFEAAREALKVFIRELPQGARIGLVTFSRAATEVVAPT 144 Query: 303 EHE---FFYSQETG---GTIVSSALKLMDEVV----KERYNPA-QWNIYAAQASDGDNWA 351 + + G GT + + + + + + +DG + + Sbjct: 145 TNRQRLLDSVELIGLEFGTAIGEGILTSLQALPPLEQRKDAKDPSELATIILLTDGRSIS 204 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRR--------------AHQTLWREYEHLQSTFD 397 PL +A + + +T + + ++ + Sbjct: 205 GIDPLEAARIAAEQKVRIHTIGVGRVTEGPVPGLESVYQWAAYFDEDVLKQIAAITGGKY 264 Query: 398 NFAMQHIRDQDDIYPVFRELFH 419 F + + + Y + F Sbjct: 265 FFVNSAGKLR-ETYQQLSQSFV 285 >UniRef50_Q6VPP3 Parturition-related protein PRP3 n=6 Tax=Eutheria RepID=Q6VPP3_RAT Length = 923 Score = 62.1 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 58/191 (30%), Gaps = 32/191 (16%) Query: 236 YKNYEKRPDPSSQA---------VMFCL-MDVSGSMDQSTK--DMAKRFYILLYLFLSRT 283 +KN P S + CL +DVSGSM + M + L L Sbjct: 283 FKNSTPMEMPPSPPFFSLLRISERIVCLVLDVSGSMGSYDRLNRMNQAAKFFLQQILESR 342 Query: 284 YKNVEVVYIRHHTQAKEVDE--------HEFFYS--QETGGTIVSSALKLMDEVVKER-Y 332 V + T E+ + +GGT + S ++ +V K + Y Sbjct: 343 SWAGMVHFHSSATVKSELIQINSDVERNQLLETLPTSASGGTSICSGIRTAFQVFKNKGY 402 Query: 333 NPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 + SDG++ + C + K VV + + + ++ Sbjct: 403 QTGGND--ILLLSDGED--STAKDCLDE-VKDSGAVVHFIALGKAF----DQSISNMANV 453 Query: 393 QSTFDNFAMQH 403 FA Sbjct: 454 TGGKQLFATDE 464 >UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria RepID=B8HUC4_CYAP4 Length = 254 Score = 62.1 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 51/146 (34%), Gaps = 17/146 (11%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYI 292 + + +P + + L+D SGSM + L L K VEV + Sbjct: 39 FGTDDFANNPEPRCPVILLLDTSGSMRGTPIQELNAGVELFRDELLADALASKRVEVAIV 98 Query: 293 RH-HTQAKE--VDEHEFF--YSQETGGTIVSSALKLMDEVVKER---YNPAQ---WNIYA 341 Q + V F + T + +A++ ++++ R Y + + Sbjct: 99 GFGPVQVIQDFVTADYFNPPKLRAEADTPLGAAIETALDLLQSRKDTYKANGIAYYRPWV 158 Query: 342 AQASDG---DNWADDSPLCHEILAKK 364 +DG D+W + E +KK Sbjct: 159 FLITDGGPTDHWQTAARRVKEGESKK 184 >UniRef50_Q54MG4 von Willebrand factor A domain-containing protein DDB_G0285975 n=4 Tax=Dictyostelium discoideum RepID=Y5975_DICDI Length = 917 Score = 62.1 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 48/177 (27%), Gaps = 20/177 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA-------- 298 ++ L+D SGSM AKR ++ L+ K + T+A Sbjct: 336 QKSEFIFLIDCSGSMSGEPIKKAKRALEIIIRSLNENCKFNIYCFGSRFTKAFDNSKMYN 395 Query: 299 ---KEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + GGT + ++ +++ + ++ +DG+ Sbjct: 396 DETLKEISGYVEKIDADLGGTELLPPIR---DILSTESDF-EYPRQLFILTDGE---VSE 448 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 R ++Y L + + ++ + Sbjct: 449 RDSLINYVATESNNTRIFTYGIGNS-VDTELVIGLSKACKGYYEMIKDNSNFEEQVM 504 >UniRef50_B5YCB4 von Willebrand factor type A domain protein n=2 Tax=Dictyoglomus RepID=B5YCB4_DICT6 Length = 890 Score = 62.1 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 55/204 (26%), Gaps = 24/204 (11%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDV-----SGSMDQSTKDMAKRFYILLYLFLSRT 283 I LR ++ S + ++D S S ++AK L+ L Sbjct: 375 ILPVTLR----PEQILKKSNVAIVIVLDASGSMGSYSGGDMKMELAKESAQLVLDLLEEK 430 Query: 284 YKNVEVVY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + + KE GGT + LK E + + + Sbjct: 431 DYFGLIAFDHSYQWIVPLQPLTNKEETASLISKISPGGGTALYPPLKSAGEALIKAPIKS 490 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + +DG D + LAK + V E A+ L ++ + + Sbjct: 491 KH---IIAITDGQTEGGDFYNLVKYLAKYKIT-VSTIGIGE---DANIPLLKDIANWGNG 543 Query: 396 FDNFAMQHIRDQDDIYPVFRELFH 419 + + L Sbjct: 544 RFYHTWNIRNLPQLLLSETKALLR 567 >UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DBA5_9CLOT Length = 231 Score = 62.1 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 53/168 (31%), Gaps = 18/168 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK---NVEVVYIRHHTQAKEVD 302 L+D SGSM ++ + + + L + +V I ++ + V Sbjct: 20 ERHIACVLLVDTSGSMAGASINELNQGLLEFGNALDQDEHARGVADVCVISFNSNVETVV 79 Query: 303 EH------EFFYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDGDNW 350 G T ++ A+ + ++ER + + + +DG+ Sbjct: 80 PFCPAANYSAPTLSAGGLTSMNEAVIAGLDAIEERKQLYRQLGCSYYRPWMFLLTDGEPT 139 Query: 351 ADDSPLCHEILAKKLL--PVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 + + ++ L V ++ + + L + Y + Sbjct: 140 DQNMEGEAKNRLQQALNDKKVNFFPMGIGSGANYAHL-KSYTKGGNGA 186 >UniRef50_UPI000186D791 calcium channel, putative n=3 Tax=Neoptera RepID=UPI000186D791 Length = 652 Score = 62.1 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 51/172 (29%), Gaps = 23/172 (13%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ-------- 297 +S + L+D S SM K +AK ++ L + T+ Sbjct: 187 TSSKDIVILVDSSSSMGGKKKGIAKAIVNIILDTLGNNDFVNIYRFSESATEIVPCFKDV 246 Query: 298 -AKEVDEH------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQW----NIYAAQASD 346 + E+ F + + G +SAL E++ RYN N +D Sbjct: 247 LVQATAENIRELRIAFDFVKYEGSANFTSALVTGFEIL-HRYNRTGQGCQCNQAIMLITD 305 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 G S VR ++Y+ ++Q + Sbjct: 306 G---PSSSYKEIFKQYNWPHMPVRMFTYLVGKDGSNQEDMNWMACANKGYFA 354 >UniRef50_B2VYM4 von Willebrand domain containing protein n=3 Tax=Leotiomyceta RepID=B2VYM4_PYRTR Length = 906 Score = 62.1 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 56/189 (29%), Gaps = 24/189 (12%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 + L K + + D SGSM + D+AK+ + L Sbjct: 267 ENHPTIANHRALMATLVPKFSLRPETPEIVFVCDRSGSMQTA-IDLAKQALQVFLKSLPI 325 Query: 283 TYKNVEVVY-IRHH-------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKE 330 K + H T ++E + + GGT + L+ ++ Sbjct: 326 GVKFNICSFGNTHSFLWPKSVTYSQETLDLAINHVNSMTANYGGTEMLQPLQA---TIEN 382 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWR 387 RY + +DG+ W L + +L +R ++ +H L Sbjct: 383 RYKDMALD--IMLLTDGEIWRQ--QQLFSYLNQSVLESKDPIRVFTLGVGMGVSH-ALIE 437 Query: 388 EYEHLQSTF 396 + F Sbjct: 438 GIAKAGNGF 446 >UniRef50_Q1Q2F5 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q2F5_9BACT Length = 333 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 49/162 (30%), Gaps = 24/162 (14%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVV 290 ++E+ + +D S SM ++AKR L L + + Sbjct: 78 GYHWEEVEKKGID--IMIAVDTSRSMLADDVKPNRLEVAKREIEDLLKIL-EGDRVGLIA 134 Query: 291 YIR---------HHTQAKEVDEHEFF-YSQETGGTIVSSALKLMDEVVKERYNPAQWN-I 339 + A + ++ GGT ++ A+ + + + N Sbjct: 135 FAGRAFTYCPLTSDYSAFRLFLNDLNVNIIPVGGTAIAEAIYKGID----AFGENENNHK 190 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 +DG+N + PL AK+ V+ + Sbjct: 191 AMIIITDGEN-HETDPLKAASKAKEKGIVIYTVGVGKKEGSY 231 >UniRef50_A1AQS2 Protoporphyrin IX magnesium-chelatase n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AQS2_PELPD Length = 617 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 44/308 (14%), Positives = 92/308 (29%), Gaps = 25/308 (8%) Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 HR + + E+P + Q + + +DE + Sbjct: 267 HRRLQLREEEQNSTEPEQPHDQDQQQNNEQRDDQGEQP----PPPENGQDENDRHPSGEG 322 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 + R G P + + ++ R+ + R Sbjct: 323 EHESTVSMG------ESAQREEIMGVGAPFKLRRLSFRKDRRKRQANGRRTRTRIKGRGG 376 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAK-IERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + + +S + + LR +A+ + + I+ DLR++ E+R ++ Sbjct: 377 RYVKSLLSSTEHDIAIDATLRACAPFQKARNRQGMLKIEQDDLRFRQRERR----MGHLV 432 Query: 252 FCLMDVSGSMDQSTKDM-AKRFYI-LLYLFLSRTYKNVEVVY-------IRHHTQAKEVD 302 ++D SGSM + M K LL + K +V+ + T + E+ Sbjct: 433 LFVVDGSGSMGARQRMMETKGAVQSLLLDCYQKRDKVAMIVFRKDRAELVLPPTASVELA 492 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDNWADDSPLCHEIL 361 G T ++S L +V+ +DG +P + Sbjct: 493 ARRLAELPVGGKTPLASGLLKTHRLVRRTSMHHPEQRILVVLITDGRGNQHLTPETRKEE 552 Query: 362 AKKLLPVV 369 L ++ Sbjct: 553 VSNLAGLL 560 >UniRef50_D2VP35 von Willebrand factor, type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VP35_NAEGR Length = 286 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 58/192 (30%), Gaps = 33/192 (17%) Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS-SQAVMFCLMDVSGSMDQ 263 + EE R+E+ R + + ++ Y + +P+ + ++D SGSM Sbjct: 29 PVSNEEAYRREMEICRQYEIESKMREMHEAKFHQYSENKEPALQNLQLVLIIDHSGSMGS 88 Query: 264 STKDM--------------------AKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 +D + L + + K+ +V I + +EV Sbjct: 89 FDEDATGQNRSKGLVDSNHWTRYDNVIQAAKYLSESIFQYDKDGKVPLIFFDSNVREVIV 148 Query: 304 HEFFYSQE-------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 T + AL+L + + N+ +DG P Sbjct: 149 DSIPRLVAAFEKNQPNSSTNLLGALELAF----KNHVNDHENVLFIVFTDGSPNGGQEPK 204 Query: 357 CHEILAKKLLPV 368 + L + L Sbjct: 205 -IKQLIQDKLTK 215 >UniRef50_UPI0000E488A7 PREDICTED: similar to Clca1 protein n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000E488A7 Length = 966 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 44/163 (26%), Gaps = 17/163 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA----- 298 PS + ++D SGSMD D R + V +V + + Sbjct: 309 QPSGSLRIVLVLDTSGSMDGERFDKMIRGAKNFIQSIVPNNSYVAIVEFNYESIVDSYMT 368 Query: 299 ---KEVDEHEFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + + G T + + +V + + +Y SDG+ Sbjct: 369 ELTSVISRKDLASLLPTLADGATCIGCGIVTAIQVAQYN-DMDSRGVYLILLSDGEENHG 427 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + +V ++ E T + + Sbjct: 428 TPIADTMDDIEGSGVIVHSIAFYEA-----DTQLEDLAQMTGG 465 >UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2GTB7_CHAGB Length = 777 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 23/228 (10%), Positives = 50/228 (21%), Gaps = 34/228 (14%) Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 L+ ++ ++ ER I + E + +D+SGSM Sbjct: 29 GPKKPSNSLQLQLHPFSSEHERGGLIVKIQPP-REPENADLHHVPCDLVLSIDISGSMAD 87 Query: 264 ST------------------KDMAKRFYILLYLFLSRTYKNVEVVYIRHHT------QAK 299 D+ K + L + V + + K Sbjct: 88 EAPAPSKPGGEAGEDTGLRVIDLVKHAARTIVATLDSRDRLGIVTFTNRSKVGIPPYENK 147 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDNWADDSPLC 357 + T + ++ + E +DG P Sbjct: 148 AKTLENIESMEPFSSTNMWHGIRDGLSLFSEAEG-GSTGRVPALLVLTDGMPNYMCPPKG 206 Query: 358 ---HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + L + + + L + + +F Sbjct: 207 YVPMLRSMEPLPATIHTFGFGY---ELRSGLLKSIAEVGGGNYSFIPD 251 >UniRef50_UPI0001757D5D PREDICTED: similar to AGAP009579-PA n=1 Tax=Tribolium castaneum RepID=UPI0001757D5D Length = 1056 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 29/209 (13%), Positives = 57/209 (27%), Gaps = 32/209 (15%) Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF 279 AK D FD R + + + + L+D SGSMD + +A + Sbjct: 205 PAKKWPNIEKDEFDCRVRTW-YIEAATCTKDVIILVDNSGSMDGMGRHIASLTVNTILDT 263 Query: 280 LSRTYKNVEVVYIRHHT----------------QAKEVDEHEFFYSQETGGTIVSSALKL 323 S + Y T + + + + +G T AL++ Sbjct: 264 FSNNDYINILYYSNQTTNYTIPCFRNLLVQATPENIVLFKEAIRHLGPSGKTDFPQALQM 323 Query: 324 MDEVVKERYNPAQ--------------WNIYAAQASDGDNWADDSPLCHEILAKKLLPV- 368 ++++ N +DG + + + Sbjct: 324 AFDILENYREIRGCNNEEIDEEGKSKACNQAIMLITDGISRNFSDIVMRNNQLDGGKTIP 383 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFD 397 VR ++Y+ + R F Sbjct: 384 VRIFTYLIGKEVTNVEEIRWMACANRGFY 412 >UniRef50_B3RZT6 Putative uncharacterized protein n=2 Tax=Trichoplax adhaerens RepID=B3RZT6_TRIAD Length = 1343 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 54/189 (28%), Gaps = 20/189 (10%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 ++ R + + ++D SGSM S + + L L + Sbjct: 276 QNRPARIKNTAPIFRFVKAMPVRIVMVLDKSGSMRGSNLQQLIQAATNVILQLGQID--G 333 Query: 288 EVVYIRHHTQAKEV-------DEHEFFYS------QETGGTIVSSALKLMDEVVKERYNP 334 + I T A ++ + + +GGT + S + E++ Sbjct: 334 SIGIIIFSTSATVTCPLMAVNNDQDKNKLIGCLPPEASGGTSIGSGILKGIELLLGSVGE 393 Query: 335 AQWN-IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 + + + SDG A+ + VV S+ + + + Sbjct: 394 QKPSGGHLIVMSDGQENANPRIKDVMSNITENDVVVTSISFGQSA----SKVLEDLAKST 449 Query: 394 STFDNFAMQ 402 FA Sbjct: 450 GGSSYFAST 458 >UniRef50_Q6A9M2 Magnesium-chelatase subunit n=3 Tax=Propionibacterium acnes RepID=Q6A9M2_PROAC Length = 600 Score = 61.7 bits (148), Expect = 6e-08, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 69/243 (28%), Gaps = 16/243 (6%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 L + T ++ + + RR R Sbjct: 290 EQLDEKGHDDSTAASPRV--IAPEETFNVLTSTPAAYDRTQRRARGRRMVTRSADLRGHV 347 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE---KRPDPSSQAVM 251 ++ ++P L LR RA+ D + K + K + + ++ Sbjct: 348 ISSQPTTQPVDLSLPATLRAAAPHQRARRASGEGRDGLSVVVKRSDWRRKVRESRTSTLV 407 Query: 252 FCLMDVSGSMDQSTKDMAKRFYI--LLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS 309 ++D SGSM + A + + LL + + + + ++ + + Sbjct: 408 VFIVDASGSMGARGRMAASKAAVIGLLQDAYVKRDRVAVITFSGNNARVIVPATSSIERA 467 Query: 310 Q-------ETGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQASD-GDNWADDSPLCHEI 360 Q G T ++S L +++ A +D G N D + Sbjct: 468 QRLLTTMAVGGQTPLASGLSCAGRLIETELRRDSTLRPVAIVVTDGGGNVGLDGRVDRTA 527 Query: 361 LAK 363 + Sbjct: 528 TNQ 530 >UniRef50_Q2FNC6 von Willebrand factor, type A n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNC6_METHJ Length = 233 Score = 61.7 bits (148), Expect = 6e-08, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 44/162 (27%), Gaps = 18/162 (11%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHHTQAKEVDE- 303 ++D S SM + +L L K +++ I + E+ Sbjct: 19 HCATVLVLDTSASMSGNKIAELNEGLRILTDELKEDDLAVKRIDLAVITF-GKGVELVRP 77 Query: 304 ----HEFF--YSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDGDNWA 351 F G T + A+ +V+ER + + +DG Sbjct: 78 FTGISAFDPPELSAGGYTPMGQAILEAVRLVEERKAEYRTIGTDYYRPWIFLITDGQPTD 137 Query: 352 D-DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 E + + + R + ++ Q + Sbjct: 138 MRKGDEIWEKVIEAVHGGERDHKFLFWALGVDQANMTVLREI 179 >UniRef50_Q56BS9 Putative uncharacterized protein n=1 Tax=Enterobacteria phage RB43 RepID=Q56BS9_9CAUD Length = 739 Score = 61.7 bits (148), Expect = 6e-08, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 53/184 (28%), Gaps = 24/184 (13%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + +KN +P P++ + DVSGSM ++ L + V ++Y Sbjct: 6 IEFKNAVSKPTPTNH---VFVCDVSGSMYNE-LPKIRKHLKANLASLVKQDDTVSILYFS 61 Query: 294 HHTQAKEVDE--------------HEFFY-SQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V + TG T L L E+ + + Sbjct: 62 SKGDYGTVFRGEKVSNVSDLTNICTAIDRYLKPTGCTGFVEPLNLAAEIATDLQSENGNL 121 Query: 339 IYAAQASDG-DN-WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG DN W D L +++E ++ L + + Sbjct: 122 NSLIFLTDGYDNCWRTDD---ILKACAVLPLTFNSIAFLEYGYYVNRPLLEKMAEATNAL 178 Query: 397 DNFA 400 F Sbjct: 179 HKFV 182 >UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHI4_9EURY Length = 705 Score = 61.3 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 48/136 (35%), Gaps = 7/136 (5%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----HTQ 297 R + A ++ L+D+SGSM + AKR ++ L K V + T Sbjct: 519 RHIKDAGAEIWMLLDISGSMGGQKINAAKRILGSIHDSL-DGSKYVHLRMFGFYGSDGTH 577 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 E D G T A+ +++K+ + + ++ +DGD Sbjct: 578 VFEFDRKMLMNLAAMGDTPTDIAIYYAMDLMKK--DKSNFDKTLFIITDGDPNNGQETKN 635 Query: 358 HEILAKKLLPVVRYYS 373 K + V ++ Sbjct: 636 ALNSLKNAMKNVNVFT 651 >UniRef50_B3RP11 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RP11_TRIAD Length = 356 Score = 61.3 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 43/158 (27%), Gaps = 21/158 (13%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 T + ++K + S + ++D SGSM K + L + + Sbjct: 200 TTETKFKQWYVNAASPSSKRLVLVLDRSGSMSGDRFLKVKEAATAVLDSLGPNDEIGVIA 259 Query: 291 Y--------------IRHHTQAKEVDEHEF--FYSQET-GGTIVSSALKLMDEVVKER-- 331 + + T + +F Q G T ALK +++ Sbjct: 260 FDDEIRIHGGCKVTTVSPATPQSIIFLKDFINNKIQPEFGSTGYVPALKHAFDMLSTNMT 319 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV 369 +DG D+ + K + Sbjct: 320 SKAKTKTNLIVFLTDGH--PDEPESQILDVIKNRNEAL 355 >UniRef50_UPI00017B4DF5 UPI00017B4DF5 related cluster n=3 Tax=Tetraodontidae RepID=UPI00017B4DF5 Length = 2436 Score = 61.3 bits (147), Expect = 6e-08, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 49/159 (30%), Gaps = 21/159 (13%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE------ 300 +A + L+D SGS+ K+F I L + V V + + K+ Sbjct: 986 QKADLVFLLDQSGSIQSDDYTTMKKFTIDLINKFQISRDLVHVGLAQFSSTFKDEFYLNK 1045 Query: 301 -VDEHEF-----FYSQETGGTIVSSA---LKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 DE QE GGT++ A ++ E + +DGD+ Sbjct: 1046 FFDEQAISAHIKDMQQEEGGTLIGLALNSIRKYFEASHGSRKAEGISQNLVLITDGDSQD 1105 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 D +LL + + H + Sbjct: 1106 DVEEAA------RLLRGLGVEVFAIGIGNVHDLELLQIA 1138 Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 49/184 (26%), Gaps = 23/184 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV--- 301 + L+D SGS+ K F L + V V +++ T+ K V Sbjct: 596 KDVPGDLIFLIDSSGSIYPEDYQKMKDFMKSLVQKSNIGKDQVHVGVLQYSTEQKLVFPL 655 Query: 302 --------DEHEFFYSQE-TGGTIVSSALKLM--DEVVKERYNPAQWNIYAAQASDGDNW 350 Q+ GGT A+ ++ + P +DG+ Sbjct: 656 IQYYTKDQLSKAIDDMQQIGGGTHTGEAIAVVSKYFDAQNGGRPDLKQR-LVVVTDGE-- 712 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + D + +V + + E +A + D+ Sbjct: 713 SQDDVKLPAEALRAKGVIVYSIGVVAANTS------QLLEISGDADRMYAERDFDALKDL 766 Query: 411 YPVF 414 Sbjct: 767 EKQM 770 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 44/148 (29%), Gaps = 26/148 (17%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHT------ 296 + +A +F L+D SGS+ K+F + + V Y T Sbjct: 390 TEEADIFFLIDQSGSIHPPDFYDMKKFILEFLQTFRVGPNHVRIGVVKYADSPTLEFDLH 449 Query: 297 --QAKEVDEHEFFYS-QETGGTIVSSALKLMDEVVKERYNPAQWNI------YAAQASDG 347 + E Q GGT + ++ +++ A Y +DG Sbjct: 450 TYTDVKSLEKAITNIHQVGGGTETG----KALDFMRPQFDRAVTTRGHKVKEYLVVITDG 505 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYI 375 + + + K V Y+ Sbjct: 506 ----NSTDKVKDPADKLRAQGVVVYAIG 529 >UniRef50_Q25545 Putative uncharacterized protein (Fragment) n=1 Tax=Naegleria fowleri RepID=Q25545_NAEFO Length = 357 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 37/127 (29%), Gaps = 12/127 (9%) Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLC 357 K+ + T +S L ++K+R + +DG N + Sbjct: 2 KQKAKQVAKNIHAGTCTNLSGGLFEGLRLIKQRTTCNEIT-SILLFTDGLANEGITNTSE 60 Query: 358 H-----EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + +++ + +++ + + + F + + DDI Sbjct: 61 IVSKMNTTIHEEIRKQITCFTFGFGSDT-DANMLTSIAQAGNGLYYF----LNNVDDIPK 115 Query: 413 VFRELFH 419 F + Sbjct: 116 AFGNVIG 122 >UniRef50_A6NF34 Anthrax toxin receptor-like n=8 Tax=Catarrhini RepID=ANTRL_HUMAN Length = 565 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 44/150 (29%), Gaps = 11/150 (7%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQAKEVDEH 304 ++ ++D SGS++ + D+ + F S + + Y + T K ++ Sbjct: 77 LYFILDKSGSVNNNWIDLYMWVEETVARFQSPNIRMCFITYSTDGQTVLPLTSDKNRIKN 136 Query: 305 EFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 Q G T + + + + ++ + + +DG+ A Sbjct: 137 GLDQLQKIVPDGHTFMQAGFRKAIQQIESFNSGNKVPSMIIAMTDGELVAHAFQDTLREA 196 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 K Y+ + Sbjct: 197 QKARKLGANVYTLGVA--DYNLDQITAIAD 224 >UniRef50_C8NJ92 Secreted Mg-chelatase subunit n=3 Tax=Corynebacterium RepID=C8NJ92_COREF Length = 530 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 29/250 (11%), Positives = 66/250 (26%), Gaps = 36/250 (14%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY- 236 +T + ++ A+ + R+ + + + + + Sbjct: 264 YPLTPLATADAETNQQVEALADWMLDHPEHLTDTFRRPVDPMAILPPELAQAFVIEQPFP 323 Query: 237 --KNYEKRPDPSSQAVM------FCLMDVSGSMDQSTKDMAKRFYILLYL---------- 278 + + + ++DVSGSM + ++ + + + Sbjct: 324 GDRAVTDALISAYNNDLRVPGDTTFVLDVSGSMAGTRMELLRSTMLEMISGEASSLTGDV 383 Query: 279 FLSRTYKNVEVVYIRHHTQAKEVDEHE------------FFYSQETGGTIVSSALKLMDE 326 L + + + E Q GGT + AL E Sbjct: 384 SLRERENVTIIPFNFSPGEPITATVDEVGGPQRQELVDGVTALQAEGGTGIYDALLRAYE 443 Query: 327 VVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTL 385 V+ P +DG+ + S + L +L R ++ + A+ T Sbjct: 444 QVE----PGASIPSIVLMTDGEQTSGLSFGHFQRLYSELPTEKKRIPVFVILYGEANITE 499 Query: 386 WREYEHLQST 395 L Sbjct: 500 MENLAGLTGG 509 >UniRef50_A9GV55 Putative secreted protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GV55_SORC5 Length = 563 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 43/170 (25%), Gaps = 21/170 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY--------------- 284 ++D SGSM ++AK+ + + L + Sbjct: 26 ALAQVAPLDTNHVVIIDRSGSMYGDRLELAKKAAKIYWNTLVSSNVPASQSFTELLGVAS 85 Query: 285 --KNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 V Y A +D G T + + L+ +++ Sbjct: 86 YSDTSSVTYPLTALPASGLD-TAVDALVADGSTSIGAGLEEALDMLISESPTKSARECVI 144 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 SDG + +P + V + A + + + Sbjct: 145 LLSDGQHNTPPAPSDFYADYFSRVDEVHSIALG---SGADEAMMSDIAAN 191 >UniRef50_D1VKI5 von Willebrand factor type A n=1 Tax=Frankia sp. EuI1c RepID=D1VKI5_9ACTO Length = 560 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 46/190 (24%), Gaps = 26/190 (13%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLY---LFLSRTY----KNVEVVYIRHHTQAKEVDEH 304 ++D SGSM+ ++ L LS + +V I + + + Sbjct: 368 IYVLDTSGSMEGPRLAALQQALTGLTGADDSLSGRFARFRAREQVTIITFNDKVTATRQF 427 Query: 305 EFF-----------------YSQETGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASD 346 + G T + SAL +D Sbjct: 428 TVSDPTPGSADLKAISDYGAALRAGGNTAIYSALDAAYTTAAAGMKADPSALTSIVLMTD 487 Query: 347 GDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 G+N DS + V ++ A + + A Sbjct: 488 GENNRGLDSAGFLARYNTRPPDVRGVRTFAVDFGDADRAALTQIATSTGGAVFDATAPGV 547 Query: 406 DQDDIYPVFR 415 D++ R Sbjct: 548 SLSDVFREIR 557 >UniRef50_C0QXK8 von Willebrand factor type A (VWA) domain containing protein n=2 Tax=Brachyspira RepID=C0QXK8_BRAHW Length = 289 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 39/142 (27%), Gaps = 17/142 (11%) Query: 254 LMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------ 302 ++DVS SM + +K+ I K V + + Sbjct: 52 VVDVSPSMMAEDMIPTRLEASKKTMIDFIKK-RNFDKISLVSFALRASVLSPATFDYTSL 110 Query: 303 EHEFFYSQ--ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 E E + E G T + + +++ R +DG+ N + P Sbjct: 111 EEEIKKIEIDEEGSTSIGLGIATAVDML--RSVKEDNEKIIILLTDGENNSGEIDPKLAS 168 Query: 360 ILAKKLLPVVRYYSYIEITRRA 381 +A + + Sbjct: 169 EIASNFNIKIYTIGIGDANGSH 190 >UniRef50_UPI0001A2C533 UPI0001A2C533 related cluster n=1 Tax=Danio rerio RepID=UPI0001A2C533 Length = 1222 Score = 60.9 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 55/186 (29%), Gaps = 20/186 (10%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-----------TYKNVEVVY 291 S Q L+D SGSM + K +++ L +K + Sbjct: 347 DLRSIQGEFVFLIDRSGSMSGVNINRVKDAMVVILKSLFPACLFNIVGFGSKFKTLFSTS 406 Query: 292 IRHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + ++ + + GGT + + L + R +DG Sbjct: 407 QSYDEESLALACEYVKKIRADMGGTNILAPLNWILRQPMHR----GHPRLLFLLTDG--- 459 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 A + L + R +++ I + A + L + F + R Q + Sbjct: 460 AVSNTGKVIELLRSHARFTRCFTFG-IGQAACRRLVSGLSAVSRGTAEFLAEGERLQPKM 518 Query: 411 YPVFRE 416 ++ Sbjct: 519 IKSLKK 524 >UniRef50_UPI0001A2C532 UPI0001A2C532 related cluster n=2 Tax=Clupeocephala RepID=UPI0001A2C532 Length = 1236 Score = 60.9 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 55/186 (29%), Gaps = 20/186 (10%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-----------TYKNVEVVY 291 S Q L+D SGSM + K +++ L +K + Sbjct: 347 DLRSIQGEFVFLIDRSGSMSGVNINRVKDAMVVILKSLFPACLFNIVGFGSKFKTLFSTS 406 Query: 292 IRHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + ++ + + GGT + + L + R +DG Sbjct: 407 QSYDEESLALACEYVKKIRADMGGTNILAPLNWILRQPMHR----GHPRLLFLLTDG--- 459 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 A + L + R +++ I + A + L + F + R Q + Sbjct: 460 AVSNTGKVIELLRSHARFTRCFTFG-IGQAACRRLVSGLSAVSRGTAEFLAEGERLQPKM 518 Query: 411 YPVFRE 416 ++ Sbjct: 519 IKSLKK 524 >UniRef50_UPI00016C400A von Willebrand factor, type A n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C400A Length = 249 Score = 60.9 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 23/201 (11%), Positives = 51/201 (25%), Gaps = 47/201 (23%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSM----DQSTKDMAKRF----------------Y 273 + + + +P + L+D SGSM + +D+ + Sbjct: 5 IPFSDVALATNPEPRCPCVLLIDTSGSMAEVVSGTGRDLGRTAQVDGKTYRVVSGGTTRI 64 Query: 274 ILLYLFLS------RTYK----NVEVVYIRHHTQAKEVDEHEFFY------SQETGGTIV 317 L+ L VEV + + V G T + Sbjct: 65 DLVNEGLRVYQADVTNDPLAAQRVEVSVVTFGDTVRTVTPFVTTSQFTPPVLTANGETPM 124 Query: 318 SSALKLMDEVVKERYNPAQWN------IYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +A+ + V ER + N + +DG+ + + + Sbjct: 125 GAAILKAIDAVTERKREYRQNGLHFYRPWIFLITDGEPTDAWEAAAARVREGEEKKQFAF 184 Query: 372 YSYIEITRRAHQTLWREYEHL 392 ++ + + Sbjct: 185 FAVGVEGAN-----MDRLKQI 200 >UniRef50_UPI00006CCBAF hypothetical protein TTHERM_00437740 n=1 Tax=Tetrahymena thermophila RepID=UPI00006CCBAF Length = 937 Score = 60.9 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 20/174 (11%), Positives = 50/174 (28%), Gaps = 14/174 (8%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 + + + ++D SGSM A R + + +Y + + Sbjct: 9 FRIPQVSTEKVSSALIGVLDASGSMSSY-WPFAARSWNQISQNHPNSY---CITFSTKAA 64 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + K + T + ++++E+++ N+ SDG DD+ Sbjct: 65 ERKLPLSENINQYDCS-STNIVGGFEVLNEMLQR--PSMGNNVTVVFISDGQ---DDNNS 118 Query: 357 CHEILAKKLL----PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 KL + + S + + + + + + Sbjct: 119 TLPNRITKLKITHQKKINFISIGVGSGFPTFVAMNLRDIYHNGESSLPPVFLIE 172 >UniRef50_Q5LDB9 Putative uncharacterized protein n=11 Tax=Bacteroides RepID=Q5LDB9_BACFN Length = 419 Score = 60.9 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 50/158 (31%), Gaps = 15/158 (9%) Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 +S+ L E +K++ + + + I ++ + +D Sbjct: 222 YLSDPALQPLFFERFNKKKLQMMDYESKDQHRIKDIKIQGNEIVEEQSGP----FIICVD 277 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFY 308 SGSM ++ K + + + + ++ + E++ F Sbjct: 278 TSGSMSGEREEFVKSAILAIAELTEQQDRKCYLINFSNDIACIEIERLGQNIQELANFLC 337 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 GGT ++ AL ++K + N SD Sbjct: 338 QSFHGGTDLTPALLHAIYILKTKS---YRNADLVMMSD 372 >UniRef50_B9L896 von Willebrand factor, type A n=1 Tax=Nautilia profundicola AmH RepID=B9L896_NAUPA Length = 288 Score = 60.9 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 54/178 (30%), Gaps = 18/178 (10%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---------IRHHTQAKE 300 + +D SGSM + K A + L + + VV+ + + E Sbjct: 77 NIVIDLDTSGSMAEFNKIDAAKAVSLDFAKKRKNDALGLVVFGNIAYIASPLTFDKKTFE 136 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE 359 + S G T + AL L + + A +DG DN + Sbjct: 137 DILKRIYVSIAGGKTAIYDALFLSSNL----FKNANGEKIIILLTDGMDNMSITPLDVVI 192 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 KK V + A ++ ++ + + + D IY +L Sbjct: 193 KKLKKEHIKVYSIAIG---GDADLSVLKKISKETNGKF-YIASSLEDLKKIYSDINKL 246 >UniRef50_B0A9L7 Putative uncharacterized protein n=2 Tax=Clostridium bartlettii DSM 16795 RepID=B0A9L7_9CLOT Length = 273 Score = 60.9 bits (146), Expect = 9e-08, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 57/183 (31%), Gaps = 22/183 (12%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYI 292 + E +P V+F L+D SGSM L L ++++ + Sbjct: 21 FDPLEVKPISKKNLVIFFLVDTSGSMSGKKIGTLNTTMEELLPELRGLGGATTDIKLAVM 80 Query: 293 RHHTQAKEVDEHEFF--------YSQETGGTIVSSAL-KLMDEVVKERYNPA---QWNIY 340 + + + + + G T + A +L +++ ++ + A + Sbjct: 81 TFSSGCEWITKEPMSVDDYQYWTRLKAEGLTDLGEAFTELSNKLSRKEFLNAPSLSYAPV 140 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI-EITRRAHQTLWREYEHLQSTFDNF 399 +DG + E L K L +Y Y ++ + E + Sbjct: 141 IFLLTDG----YATDDALEGL--KTLQHNNWYKYGLKVALGLGEKFDEELLKKFTGNPEL 194 Query: 400 AMQ 402 + Sbjct: 195 VVT 197 >UniRef50_Q1ZTY1 Putative uncharacterized protein n=1 Tax=Photobacterium angustum S14 RepID=Q1ZTY1_PHOAS Length = 259 Score = 60.9 bits (146), Expect = 9e-08, Method: Composition-based stats. Identities = 18/204 (8%), Positives = 52/204 (25%), Gaps = 35/204 (17%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQ-------STKDMAKRFYILLYLFLSRTYKNVEV 289 ++ ++ D SGSMD AK+ + + Sbjct: 61 NALVSDNWLATNYLLIF--DGSGSMDNTNCGNGQKKIVAAKQAIQTFINDIPNSANVGLY 118 Query: 290 VYIRHHTQAKE--------VDEHEFFYSQETGGTIVSSALKLMD----EVVKERYNPAQW 337 V+ + + + G T + S+L ++ ++ Sbjct: 119 VFDNKDASLRVPLGNNNRATLKQAIYDVTAGGATPLKSSLDSSYSALERQASKQLGYGEY 178 Query: 338 NIYAAQASDGDNWADDSPL-CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 N+ +DGD ++P + + + + ++ Sbjct: 179 NVVI--VTDGDASQGENPQPAINRIYRDSPVTIHTIGFCIG---------EQHALNAKGI 227 Query: 397 DNFAMQHIRDQDDIYPVFRELFHK 420 + + + + + + + Sbjct: 228 TYYQSA--NNPEKLLAGLKNVLAE 249 >UniRef50_C7R6G5 von Willebrand factor type A n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R6G5_KANKD Length = 348 Score = 60.9 bits (146), Expect = 9e-08, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 49/156 (31%), Gaps = 23/156 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-------DMAKRFYILLYLFL--SRTYKNVEVV 290 + P++ + +D+SGSM+ D LL F+ + + ++ Sbjct: 77 DTMDLPATGRDLMISIDISGSMEMPDMVIEDKEVDRLVAVKALLTDFIARRKGDRVGMIL 136 Query: 291 Y---------IRHH-TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + + + + + + T + + L + ++ER N Sbjct: 137 FGEQAYLQTPLTFDLKTVQTMLDETTIGLAGSSRTAIGDGIGLAVKRLRER---DANNRV 193 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYI 375 +DG N +PL LA+ + Sbjct: 194 LILLTDGQNNTGALNPLQAAELAEHAGITIYTIGVG 229 >UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobacteria RepID=A1VWQ4_POLNA Length = 350 Score = 60.9 bits (146), Expect = 9e-08, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 41/149 (27%), Gaps = 18/149 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---IRHHTQAK---- 299 + +F ++D S SM + + + L + +E V+ I A+ Sbjct: 2 RRLPVFFVLDCSESMVGANLKKMEGAVAAIVKSLRTDPQALETVFFSVIAFAGVARTIAP 61 Query: 300 --EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN------PAQWNIYAAQASDGDNWA 351 E+ GGT + SAL + + W +DG Sbjct: 62 LVEIVSFYPPKLPLGGGTNLGSALDALMGEIDRSVIKTTAERKGDWRPIIYLVTDGR--P 119 Query: 352 DDSP-LCHEILAKKLLPVVRYYSYIEITR 379 D+P E + Sbjct: 120 TDNPSRAIERWNSHYAKKATLIAIGLGRS 148 >UniRef50_A1ZDA1 OmpA family protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZDA1_9SPHI Length = 756 Score = 60.9 bits (146), Expect = 9e-08, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 66/198 (33%), Gaps = 21/198 (10%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---- 291 +K E S + +MD SGSM + K + + + K V + Sbjct: 390 FKVREIHEMISKPYDISLVMDYSGSMAGNIKKLEEATRKFILTK-HPNDKISVVKFDERL 448 Query: 292 ---IRHHTQAKEVDEHEFFYS-QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 +R Q + D +F + G T + + E +K AQ N +DG Sbjct: 449 ETELRLTAQGSKTDCVKFDGLTRYGGSTALYAGADEGLESLKN----AQNNKVMLLFTDG 504 Query: 348 DN------WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 + + + E++ K +R ++ T ++TL L F Sbjct: 505 EENSSLQYFGKRAFRASEVVKKAREKGIRVFTIAYGTGVNNKTLN-ALSMLTDGKTYFI- 562 Query: 402 QHIRDQDDIYPVFRELFH 419 ++ + +Y +F Sbjct: 563 ENPDEIKQVYEELPRIFR 580 >UniRef50_C0EZA4 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EZA4_9FIRM Length = 538 Score = 60.9 bits (146), Expect = 9e-08, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 54/195 (27%), Gaps = 27/195 (13%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKR-FYILLYLFLSRTYKNVEVVYIRHHTQAK----- 299 S + + +D+S SMD D K+ + L++ V Y T Sbjct: 222 SKKRDIVLTLDISASMDGIPLDETKKAAAKFVDSILNKNSNIGLVSYSDEATSLSGICSN 281 Query: 300 -EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 ++ T + L +++ + + SDG Sbjct: 282 DVFLKNTITSLSSAENTNIEDGLSRAYSMLQLGQSKKK---LIVLMSDGLPTLGKDGEEL 338 Query: 359 EILAKKLLPV---VRYYSYIEITRRA---HQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 A+K+ + + + T Q L + ++ + +D+ Sbjct: 339 IKYAEKIKDQGVLIYTLGFFQNTEEYKAEGQYLMEKIASEG---CHY---EVSSSEDLV- 391 Query: 413 VFRELFHKQNATAKG 427 F + A G Sbjct: 392 ----FFFEDVAGQIG 402 >UniRef50_Q10ZP7 von Willebrand factor, type A n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZP7_TRIEI Length = 420 Score = 60.9 bits (146), Expect = 1e-07, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 49/207 (23%), Gaps = 24/207 (11%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 + ++ K PS M +D S SM AK + + L Sbjct: 26 HVLRVRIQPKT--DANLPSLPIRMAIALDTSQSMKGEKLQRAKEACLAVVSHLRDPDYLS 83 Query: 288 EVVYIRHHTQAKE----------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 Y T E E Q G T + L + ++E P + Sbjct: 84 LAGYSTRVTPLLESLAGGGAAAGFAEGAIADLQARGVTRI----DLALDWIEESLLPEKS 139 Query: 338 NIYA-AQASDGD--NWADDSPLCHEILAKKLLP----VVRYYSYIEI-TRRAHQTLWREY 389 +DG N + K + + + + + Sbjct: 140 PPLVGVLITDGHATNAGGTPLDDMKPFIVKARNMKSCGIILCAVGLGDAANFNTSFLTDL 199 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRE 416 +A + D+ + Sbjct: 200 SDQGGGAFIYADTPDKLLSDLQNRLKA 226 >UniRef50_C0CQI8 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CQI8_9FIRM Length = 393 Score = 60.9 bits (146), Expect = 1e-07, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 56/192 (29%), Gaps = 19/192 (9%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---HTQAKEVDE 303 S M L+D SGSM + + L + + V + +T+ +DE Sbjct: 107 SAVDMVLLLDGSGSMQGKKEPCVQAT-EALLEQMDEQSRAQAVAFASCVLGNTELLPLDE 165 Query: 304 ---HEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 E GGT L ++E+ + SDG+ ++ Sbjct: 166 EGRETLIKFVEGTDIIGGTEFGQPLTFALNSLEEKKETGRIQAVIL-LSDGEGPFPETLE 224 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH-IRDQDDI-YPVF 414 + V Y+ R+ F + + ++ I Sbjct: 225 E-----EYKEKDVVLYTIRMDAGEQETETARQLVQFAQKTGGFDTKIPVDEKGQISTDEL 279 Query: 415 RELFHKQNATAK 426 + F + + AK Sbjct: 280 TKAFRQAFSAAK 291 >UniRef50_UPI00005A0386 PREDICTED: similar to loss of heterozygosity, 11, chromosomal region 2, gene A homolog n=1 Tax=Canis lupus familiaris RepID=UPI00005A0386 Length = 881 Score = 60.9 bits (146), Expect = 1e-07, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 54/178 (30%), Gaps = 20/178 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-----------TYKNVEVVYIRHHTQAK 299 L+D SGSM + K ++ L T+K + + ++ Sbjct: 217 FIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSSQTYSEESV 276 Query: 300 EVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + + GGT + S LK + R +DG A ++ Sbjct: 277 AMACDNIQRMRADMGGTNILSPLKWIIRQPVHR----GHPRLLFLITDG---AVNNTGKV 329 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L + R YS+ I L R + F ++ R Q + ++ Sbjct: 330 LELVRNHAFSTRCYSFG-IGPNVCHRLVRGLATVSKGSAEFLVEGERLQPKMIKSLKK 386 >UniRef50_D1A9V6 Cobaltochelatase subunit n=2 Tax=Actinomycetales RepID=D1A9V6_THECD Length = 657 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 45/145 (31%), Gaps = 14/145 (9%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKN 286 I DLR + R ++ ++D SGSM + A + LL R K Sbjct: 464 IAPEDLRAALRDGRE----GNLVLFVVDASGSMAARRRMGAVKGAVLSLLLDAYQRRDKV 519 Query: 287 VEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK-ERYNPAQWN 338 + + T + E G T +++ L EV++ ER Sbjct: 520 GLITFRGRGAELALPPTSSVEAGARRLERLPTGGRTPLAAGLLRAAEVLRVERLRDPARR 579 Query: 339 IYAAQASDGDNWADDSPLCHEILAK 363 +DG P L + Sbjct: 580 PLLVIVTDGRATTGPDPAQAAALLR 604 >UniRef50_D2H285 Putative uncharacterized protein (Fragment) n=1 Tax=Ailuropoda melanoleuca RepID=D2H285_AILME Length = 1230 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 54/178 (30%), Gaps = 20/178 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-----------TYKNVEVVYIRHHTQAK 299 L+D SGSM + K ++ L T+K + + ++ Sbjct: 362 FIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSSQTYSEESV 421 Query: 300 EVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + + GGT + S LK + R +DG A ++ Sbjct: 422 AMACDNIQRMRADMGGTNILSPLKWIIRQPVHR----GHPRLLFLITDG---AVNNTGKV 474 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L + R YS+ I L R + F ++ R Q + ++ Sbjct: 475 LELLRNHAFSTRCYSFG-IGPNVCHRLVRGLATVSKGSAEFLVEGERLQPKMIKSLKK 531 >UniRef50_Q4TBC0 Chromosome undetermined SCAF7164, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4TBC0_TETNG Length = 1636 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 64/202 (31%), Gaps = 27/202 (13%) Query: 162 ANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 N ++ +++N+ R+ + + ++ + R+ + L Sbjct: 261 TNKDLLTAVKNAQQRKLQPNEPRNLGKALQYAYKNFFTPEAGSRNDQS--FRQYLVVLTG 318 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 K P + + N A + ++D SGS+ S + + F L L Sbjct: 319 KDADDPVYEEAHCKAANI---------ADIVFIIDESGSIGSSDFQLVRTFLHSLVSGLE 369 Query: 282 RTYKNVEVVYIRHHTQAK-EVDEHEFFYSQE-----------TGGTIVSSALKLMDEVV- 328 + V V + +H + K EV + F E GGT +AL V Sbjct: 370 VSPNRVRVGIVVYHGEPKAEVFLNTFTDKSELLDFIRILPYHGGGTNTGAALNFTQHQVF 429 Query: 329 ---KERYNPAQWNIYAAQASDG 347 K A +DG Sbjct: 430 VREKGSRIELGVQQVAVVITDG 451 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 49/159 (30%), Gaps = 21/159 (13%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE------ 300 +A + L+D SGS+ K+F I L + V V + + K+ Sbjct: 1043 QKADLVFLLDQSGSIQSDDYTTMKKFTIDLINKFQISRDLVHVGLAQFSSTFKDEFYLNK 1102 Query: 301 -VDEHEF-----FYSQETGGTIVSSA---LKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 DE QE GGT++ A ++ E + +DGD+ Sbjct: 1103 FFDEQAISAHIKDMQQEEGGTLIGLALNSIRKYFEASHGSRKAEGISQNLVLITDGDSQD 1162 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 D +LL + + H + Sbjct: 1163 DVEEAA------RLLRGLGVEVFAIGIGNVHDLELLQIA 1195 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 49/184 (26%), Gaps = 23/184 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV--- 301 + L+D SGS+ K F L + V V +++ T+ K V Sbjct: 649 KDVPGDLIFLIDSSGSIYPEDYQKMKDFMKSLVQKSNIGKDQVHVGVLQYSTEQKLVFPL 708 Query: 302 --------DEHEFFYSQE-TGGTIVSSALKLM--DEVVKERYNPAQWNIYAAQASDGDNW 350 Q+ GGT A+ ++ + P +DG+ Sbjct: 709 IQYYTKDQLSKAIDDMQQIGGGTHTGEAIAVVSKYFDAQNGGRPDLKQR-LVVVTDGE-- 765 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + D + +V + + E +A + D+ Sbjct: 766 SQDDVKLPAEALRAKGVIVYSIGVVAANTS------QLLEISGDADRMYAERDFDALKDL 819 Query: 411 YPVF 414 Sbjct: 820 EKQM 823 Score = 45.5 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 44/148 (29%), Gaps = 26/148 (17%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHT------ 296 + +A +F L+D SGS+ K+F + + V Y T Sbjct: 455 TEEADIFFLIDQSGSIHPPDFYDMKKFILEFLQTFRVGPNHVRIGVVKYADSPTLEFDLH 514 Query: 297 --QAKEVDEHEFFYS-QETGGTIVSSALKLMDEVVKERYNPAQWNI------YAAQASDG 347 + E Q GGT + ++ +++ A Y +DG Sbjct: 515 TYTDVKSLEKAITNIHQVGGGTETG----KALDFMRPQFDRAVTTRGHKVKEYLVVITDG 570 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYI 375 + + + K V Y+ Sbjct: 571 ----NSTDKVKDPADKLRAQGVVVYAIG 594 >UniRef50_Q0AW90 Conserved putative chloride channel n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AW90_SYNWW Length = 951 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 43/169 (25%), Gaps = 18/169 (10%) Query: 265 TKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEHEFFYSQETGGTI 316 ++AK I L V + + K+ + + + GGT Sbjct: 428 KVELAKEAAIQATSILGPLDMAGVVAFDDTAQWVVEFQAVKDKDAIQDDIATIRADGGTS 487 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 + AL L +K+ + + +DG + + + + E Sbjct: 488 IYPALALAYTALKDAHTKFKH---IILLTDGQSATTGDYYFLSRRMARAGITMSTVAVGE 544 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ----DDIYPVFRELFHKQ 421 L + F+ + + + ++ Sbjct: 545 GA---DTLLLEQLAAWGQGRYYFSDEISNIPRIFTKETMKAIKSYLVEE 590 >UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaenterica_26029 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI0001913F8A Length = 88 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 78/88 (88%), Positives = 82/88 (93%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 LT KTHRAG+T+NGVPANISVVRSLQNSLARRTAMTAGKRRELHALE L IS+SEPA Sbjct: 1 LTSNKTHRAGFTSNGVPANISVVRSLQNSLARRTAMTAGKRRELHALETELETISHSEPA 60 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTF 232 QLLEEERLR+EIAELRAKIERVPFIDTF Sbjct: 61 QLLEEERLRREIAELRAKIERVPFIDTF 88 >UniRef50_D1C680 ATPase associated with various cellular activities AAA_5 n=2 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C680_SPHTD Length = 658 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 48/340 (14%), Positives = 94/340 (27%), Gaps = 32/340 (9%) Query: 26 YKAQIKQSISEAINKRSVTDVDSG--ESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFV 83 + ++S +++I + V V G E+ P +R G+ Sbjct: 270 VRESSRRS-ADSIVEEIVLAVLDGQPEAPRYPDPGGPGRRPADHESSIRPGAGRGSGTGR 328 Query: 84 QNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQR 143 R + G+ + + S + F + P++ R Sbjct: 329 ATSRPPDHPVAPSHVNTTGSGRVYLGDEALDAAARRSASQRAVRAFAE-RHPHV-TRALR 386 Query: 144 QLTEYKTHRAGYTANGVPANISV--------VRSLQNSLARRTAMTAGKRRELHALEENL 195 +L +T A+G P + + +L R A R L Sbjct: 387 RLPGVRTALETSLADGPPLPREILGEVHHLAGHADHRALVDRIARDVLVRMAQRDL---- 442 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 L R E AEL ++ ++ ++ + ++ Sbjct: 443 ---DQHPGPGRLTSTPYRGEAAELDLDRSLERVLEQPEVTDEDLVVIERRPRKRAYALML 499 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFF 307 DVSGSM + A + + + + + R K +DE Sbjct: 500 DVSGSMQGAAIYEAALALAAVAVRVDP-DPFAVIAFWRDAAVLKRLDEPVDLDHLIDRVL 558 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 G T ++ L++ + + SDG Sbjct: 559 SLPGRGLTNLALGLRVGLDELSRASTQE---RVGLLFSDG 595 >UniRef50_B5HZU2 VWA domain-containing protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HZU2_9ACTO Length = 518 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 28/258 (10%), Positives = 65/258 (25%), Gaps = 30/258 (11%) Query: 153 AGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERL 212 A Y + + + R ++ +R ++ Sbjct: 255 ADYPLTSLRSTSARTREDVRRVSEDLRTERIQREITARTHR----------RPVVASVPP 304 Query: 213 RKEIAELRAKIERVPF-IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 + R + P D +YE S+ ++D SGSM+ D K Sbjct: 305 ASGLDTTRRRELPFPGTRSVADGLLDSYENELRRPSRT--VYVLDTSGSMEGDRLDRLKT 362 Query: 272 FYILLYLFLSRTYKNVEVVYIRHHTQAK-------------EVDEHEFFYSQETGGTIVS 318 L + + + + + + G T + Sbjct: 363 ALADLTGDFREREEVTLMPFGSQVKSVRTHVVKPSDPRAGLDAIRDDTSALSADGDTAIY 422 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR-YYSYIEI 377 ++L+ + + + +DG+N A + +L R + + Sbjct: 423 TSLEKAYDHLGAGRDAFT---SIVLMTDGENTAGAKARDFDAFYARLGRKARDTPVFPIL 479 Query: 378 TRRAHQTLWREYEHLQST 395 + ++ L Sbjct: 480 FGDSDRSELAHIADLTGG 497 >UniRef50_A1S119 von Willebrand factor, type A n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S119_THEPD Length = 327 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 53/182 (29%), Gaps = 17/182 (9%) Query: 247 SQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKN------VEVVYIRHH 295 ++ ++DVSGSM+ S ++A+R LL + +V Sbjct: 98 ARPAAVLVVDVSGSMEDSIPGGVKIEVARRAATLLVERMPGGVDVGLLAFSDRIVLSLPP 157 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 T + + GGT+ + L+ +K + SDG + Sbjct: 158 TGDRRRVLDAIESLKPGGGTMYTYPLQAALSWLKPYKLFNASTLVVF-VSDGLPADAATY 216 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + L V YI + + + +++ F+ Sbjct: 217 RTLLSEFRSLGIPVYTV-YIGPGGDEGERELKLIAGSTGGEEY----TAGSAEELLKAFK 271 Query: 416 EL 417 L Sbjct: 272 TL 273 >UniRef50_O00534 von Willebrand factor A domain-containing protein 5A n=8 Tax=Theria RepID=VMA5A_HUMAN Length = 786 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 55/206 (26%), Gaps = 29/206 (14%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQS---------TKDMAKRFYILLYLFLSRT--- 283 Y N + ++ LMD SGSM AK ILL L Sbjct: 267 YPNIPEDQPSNTCGEFIFLMDRSGSMQSPMSSQDTSQLRIQAAKETLILLLKSLPIGCYF 326 Query: 284 -----YKNVEVVYIRHHTQAKEVDEHEFFYS---QET-GGTIVSSALKLMDEVVKERYNP 334 + E + ++ E Q GGT + + L+ + + P Sbjct: 327 NIYGFGSSYEACFPESVKYTQQTMEEALGRVKLMQADLGGTEILAPLQNIY---RGPSIP 383 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + +DG+ D + E+ + + E T +L + Sbjct: 384 -GHPLQLFVFTDGE-VTDTFSVIKEVRINRQKHRCFSFGIGEGTST---SLIKGIARASG 438 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHK 420 F R Q + Sbjct: 439 GTSEFITGKDRMQSKALRTLKRSLQP 464 >UniRef50_Q28XX9 GA11538 n=5 Tax=Drosophila RepID=Q28XX9_DROPS Length = 1196 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 59/185 (31%), Gaps = 19/185 (10%) Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 +K ++ +D +D R +++ +S + LMD SGSM D+AK + L Sbjct: 239 SKWKKDVPVDLYDCRLRSW-YMEAATSPKDIVILMDGSGSMLGQRLDIAKHVVNTILDTL 297 Query: 281 SRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET----GGTIVS---SALKLMDEVVKER-- 331 + KE +E G ++ +AL E+++E Sbjct: 298 GTNDFVNIFTF------DKEATLGNIRELKEGIENFGPKSIANYTAALTRAFEILEEAKS 351 Query: 332 -YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR--RAHQTLWRE 388 AQ N DG + VR ++Y+ W Sbjct: 352 TSRGAQCNQAIMIIGDGAPENNREVFELHNWRDPPYKPVRVFTYLIGKEVANWDDIRWMA 411 Query: 389 YEHLQ 393 E+ Sbjct: 412 CENQG 416 >UniRef50_UPI00006CC819 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CC819 Length = 930 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 50/193 (25%), Gaps = 20/193 (10%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-- 293 + SS+ L+D SGSM + A IL L + Sbjct: 312 FNQQIIDQTDSSKCEFIFLLDRSGSMSGQSIQNAIEALILFIKSLPLDSYFNIYSFGTEF 371 Query: 294 ---------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + + E+ +E GGT + L E+ + Y + Sbjct: 372 SKLFDQSQKYSNENVELALNEIITYSANYGGTNIYQPLS---EIFNQPYVK-GYGRQIYI 427 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG ++ L + R ++ + + Sbjct: 428 LTDGQ---IENKENVMHLIQSNNISNRVHAIGIGLYVDKDLIIQS-AKSGKGCHAHVTDQ 483 Query: 404 IRDQDDIYPVFRE 416 Q+ I + + Sbjct: 484 SLIQESIINILQN 496 >UniRef50_B7Q0X7 Calcium activated chlorine channel, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7Q0X7_IXOSC Length = 704 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 52/189 (27%), Gaps = 26/189 (13%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--- 298 + ++DVSGSM K + + L + + + + T Sbjct: 58 KGQGHIRIVFVLDVSGSMGLENKINMLRQAASRSLEDNVPDGSDVGIITFSDNATVVAGM 117 Query: 299 -------KEVDEHEFFYSQETGGTIVSSAL-----KLMDEVVKERYNPAQWNIYAAQASD 346 ++ ++ G T + AL K + ++ N +D Sbjct: 118 RTLSAATRQAIKNAVPSI-ARGSTAIGKALMTSVQKHGPQELERN-GETAENAALLLMTD 175 Query: 347 G-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 G +N L +K L V + + R ++ + + + + Sbjct: 176 GEENEPPYINDVLPTLLQKRLRV-----FSVPVGKEADDGLRVLSE-RTGENVYPITNTT 229 Query: 406 DQDDIYPVF 414 D Sbjct: 230 KLADRLAEM 238 >UniRef50_Q5SKK6 Putative uncharacterized protein TTHA0637 n=4 Tax=Thermaceae RepID=Q5SKK6_THET8 Length = 407 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 60/224 (26%), Gaps = 19/224 (8%) Query: 132 LALPNLKQNQQRQL----TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRE 187 L LP Q + + +R L +L R Sbjct: 104 LRLPGEDPTDPAQGGYRGEAGEARFELTEKASDFLGLKSLRELLGALGRNPPGLHPTPHH 163 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 +E+ L + A + DL + ++ Sbjct: 164 APGVEKTGETKPWEWGDPLELNVPETLKKAMAKGLERLSHEDLVIDL--------AEYTA 215 Query: 248 QAVMFCLMDVSGSM---DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH 304 L+D S SM + AK+ + L + Y V ++ H A+E+ Sbjct: 216 SMSTVVLLDCSHSMILYGEDRFTPAKKVALALAHLIRTQYPGDRVRFVLFHDTAEEIPLA 275 Query: 305 EFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + Q T + L+L ++++ +DG Sbjct: 276 KLPLVQVGPYHTNTKAGLELARTLLRKM---GGEMKQIILITDG 316 >UniRef50_Q233P7 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q233P7_TETTH Length = 790 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 44/171 (25%), Gaps = 25/171 (14%) Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQAKEVDEHEF 306 S SM A IL L V + I++ T + E Sbjct: 236 SRSMSGQPIQKACEALILFLKSLPIDSYFNVVSFGSSYEKLFQSSIKYDTNSLEKAIKII 295 Query: 307 FYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK- 364 GGT + L+ + + +N +DG+ SP L +K Sbjct: 296 KNYTADLGGTEIYKPLQSVF----KETKIDGYNKQIFLLTDGE---VKSPKEVVKLIRKN 348 Query: 365 -LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 L + + + L E + D +I Sbjct: 349 NKLNRINSIGFGSGADKY---LIEESAITGKGISK-IVDLKCDLSEIIIEM 395 >UniRef50_B9Z3T2 Cobaltochelatase subunit n=1 Tax=Lutiella nitroferrum 2002 RepID=B9Z3T2_9NEIS Length = 653 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 64/217 (29%), Gaps = 19/217 (8%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 N + + A G PA V R +L A E L + Sbjct: 371 NDTDDNAQPGSAAEQLFAAGTPA--GVGRIEVAALHASEASGRRSSSEGTFHGHTLRAVP 428 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + P+QL + L+ + R ++K ++ ++D SG Sbjct: 429 SEAPSQLALDATLKSALLRNPGDFSVT--------RADLHDKVRVGKQANLILLVVDASG 480 Query: 260 SMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQ 310 SM + A + LL R + + + + T+ + E Sbjct: 481 SMAAKRRMEAVKGCVLGLLQDAYQRRDQVAVIAFRGNAAELVLPPTRQIDQAERVLSELP 540 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 G T ++ AL+L E + SDG Sbjct: 541 TGGRTPLAHALQLTAETLARHARADNLTPLLVVLSDG 577 >UniRef50_C0D9H7 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D9H7_9CLOT Length = 1360 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 23/216 (10%), Positives = 58/216 (26%), Gaps = 11/216 (5%) Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 ++ + S +++ ++ + +++ ++ + Y S A M Sbjct: 495 ISSLDASAFSEITAYVQIETPVDYSIDELKSHITVEDCGAQISEYNLEKVEYSSANMLLC 554 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQAKEVDEHEFFY 308 DVSGSM + ++ I + +S + +++ + T +V Sbjct: 555 CDVSGSMQGRPIEDSRAAVISMAESMSGNARLGVILFNSSVQGLTDFTVQPDVIRSTAES 614 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 GGT + + E + + SDG S + + Sbjct: 615 MTANGGTNIFDTVVHGLESFPKNGPEVLNTLVV--MSDGQENNAHSAEEIQTAIGQAAKD 672 Query: 369 VRYY--SYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + + + Sbjct: 673 KSILVHCLG-LGSEVDANYLQTIAQSAGGTYQYVTD 707 >UniRef50_D1VLF3 von Willebrand factor type A n=1 Tax=Frankia sp. EuI1c RepID=D1VLF3_9ACTO Length = 372 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 40/169 (23%), Gaps = 25/169 (14%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVV 290 + P S+ + +DVSGSM + A++ + V Sbjct: 73 ARPQATVPITSNSTTIMLALDVSGSMCSTDVPPNRITAAEKAATAFIKAQPAGSRIGLVT 132 Query: 291 YIR------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN---PAQWNI-- 339 + T + + GT + + + + + P + Sbjct: 133 FSGIAGLLVPPTTDSQKLLDALQNLTTSRGTAIGQGILTSIDAIADADPSVAPTGSAVSG 192 Query: 340 ---------YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 +DG N P A V + T Sbjct: 193 NGTGPYAADVIVVLTDGANTQGVDPQTAAKQAAARRLRVYTIGFGTTTP 241 >UniRef50_UPI000180B5AF PREDICTED: similar to Clca1 protein n=1 Tax=Ciona intestinalis RepID=UPI000180B5AF Length = 1034 Score = 60.2 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 61/187 (32%), Gaps = 30/187 (16%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY-LFLSRTYKNVEVVYIRHHTQAK 299 + P ++DVSGSM ++ F+ R ++ T A+ Sbjct: 310 RVVKPFPYRSFVLVLDVSGSMWGGRLTKMRQIMNTFVDDFVQRGDYVGITIF---STIAR 366 Query: 300 EV-------DEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 ++ D+ + G T + + +++++ +I +D Sbjct: 367 KLSPLTRIRDQSDRASLIRRLPRSVRGSTCIGCGINSAVQIMEQHSPDLCGDIIVF--TD 424 Query: 347 G-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 G +N H+ + KK V + T A+Q L R S + Sbjct: 425 GEENVEPSVADVHDKVVKKKCRVSAVF----FTTTANQALVR-LVDATSGTWFY-----G 474 Query: 406 DQDDIYP 412 D DDI P Sbjct: 475 DTDDITP 481 >UniRef50_A4YDL0 von Willebrand factor, type A n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YDL0_METS5 Length = 394 Score = 60.2 bits (144), Expect = 2e-07, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 58/217 (26%), Gaps = 49/217 (22%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMD---------------------------------QS 264 +F ++DVSGSM + Sbjct: 26 TIVPERVKPVPLDLFIVLDVSGSMGIIDNPPEVDDSLIAGTAEVDGHVVRYLKDDIGVNN 85 Query: 265 TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--KEVDEHEFFYSQE---TGGTIVSS 319 ++A L + + + + H + + +E G T + S Sbjct: 86 RLEVALEAIRNLLENADTSTRVTIITFSDHVNVLCRRVTPSTALEHLEEIVPDGNTALYS 145 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 A+K ++ E PA+ +DG + D E ++ ++ Sbjct: 146 AVKKAISLIDEH--PAR----VLLITDG--YPTDVEDETEYSKLEVPRFSQFIPIGV--G 195 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + R L + + + + + I R Sbjct: 196 EYNAKILRSLADLSNGRF-YHVNDVSEISRIMEEERA 231 >UniRef50_D2LNF7 von Willebrand factor type A n=3 Tax=Aciduliprofundum boonei T469 RepID=D2LNF7_9EURY Length = 2166 Score = 60.2 bits (144), Expect = 2e-07, Method: Composition-based stats. Identities = 21/155 (13%), Positives = 40/155 (25%), Gaps = 37/155 (23%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----------------DQSTKDMAKRFYILLYLFLSR 282 R + + ++D SGSM + D+A + I L Sbjct: 1242 APRLNKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRIDVAIQAAIDAVKELGP 1301 Query: 283 TYKNVEVVYIRHHT------------QAKEVDEHEFFYSQETGGTIVSSALKLM--DEVV 328 + + + + Q GGT + L Sbjct: 1302 QDRVAVFTFDGNSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLYDTLSWAVYYMDT 1361 Query: 329 KERYNPAQWN--IYAAQASDG----DNWADDSPLC 357 + NP + + +DG DN+ ++ Sbjct: 1362 QSADNPDREDATRGILVLTDGLSNSDNYGPNNVRN 1396 >UniRef50_Q73PP7 Magnesium chelatase, subunit D/I family n=1 Tax=Treponema denticola RepID=Q73PP7_TREDE Length = 643 Score = 60.2 bits (144), Expect = 2e-07, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 53/182 (29%), Gaps = 20/182 (10%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST--KDMAKRFYILLYLFLSRTYKNVEVV 290 D ++K + R A + L+D SGSM K+ LL + + + Sbjct: 434 DYKFKRRKTR----IGASIIFLVDASGSMGAMKRMKETKNTILSLLMDSYQKHDEVSMIT 489 Query: 291 YIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK-ERYNPAQWNIYAA 342 + T++ + + E G T ++ L E K R Sbjct: 490 FAGTRVEIILPFTRSVLLAKRELQLIPTIGKTPLALGLNKALEYFKIHRLKNKDMIPLLF 549 Query: 343 QASDGDN----WADDSPLCHEILAKKLLPVVRYYSYIEIT--RRAHQTLWREYEHLQSTF 396 +DG D P+ + K + YS + T L E + Sbjct: 550 LITDGRTNHGSVFFDEPIKDALFISKKIKNANIYSVVIDTESGFVKLALAEEVAKNLNAR 609 Query: 397 DN 398 Sbjct: 610 YY 611 >UniRef50_A4BKH3 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BKH3_9GAMM Length = 553 Score = 60.2 bits (144), Expect = 2e-07, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 50/182 (27%), Gaps = 14/182 (7%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI---- 292 + ++ + + DVSGSMD K F+S V + Sbjct: 367 RIWKDKKSGGRPIAAMFVADVSGSMDGDRIRALKIALDESANFVSSRNSIGLVTFNDRVN 426 Query: 293 ------RHHTQAKEVDEHEFFYSQETGGTIVSSAL-KLMDEVVKERYNPAQWNIYAAQAS 345 Q K GGT + A+ E++ + + S Sbjct: 427 VDLPIREFDLQQKSQFLGAVERMSAGGGTATNDAILVAAHELLNFAKTHPEHKLTIFVLS 486 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH--QTLWRE-YEHLQSTFDNFAMQ 402 DG+ E + + L V +Y + L Y + + + Sbjct: 487 DGETRNGLPLGDVEKVIQMLNIPVHSIAYGFESADLKKVSGLVEASYTESSTGSAAYQIG 546 Query: 403 HI 404 ++ Sbjct: 547 NL 548 >UniRef50_C3NI41 von Willebrand factor type A n=9 Tax=Sulfolobus RepID=C3NI41_SULIN Length = 356 Score = 60.2 bits (144), Expect = 2e-07, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 55/173 (31%), Gaps = 10/173 (5%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE 305 +S ++D S SM + A + L L+ +++ H + Sbjct: 36 TSSIHYIIMIDNSPSMRGEKLNTAVQSAQKLLYNLNEGNYVTLILFSNHPEIKYQGPAKG 95 Query: 306 FFYSQETGG--TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 G T + A+ + K+ P + +DG + +E L Sbjct: 96 IITFDVGKGYTTRLHEAVSFTINLAKQSQVPTK----IIMLTDGKPTDKRNVKDYEKL-- 149 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + P + + ++ + ++ S + ++ I + +I+ R Sbjct: 150 DIPPNTQIITIGIGN-NYNERILKKLADRSSGKF-YHIKDISELPNIFEGQRT 200 >UniRef50_UPI000180CF38 PREDICTED: similar to calcium activated chloride channel n=1 Tax=Ciona intestinalis RepID=UPI000180CF38 Length = 624 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 44/139 (31%), Gaps = 19/139 (13%) Query: 243 PDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYL------FLS----RTYKNV 287 + ++DVSGSMD K F L +L +V Sbjct: 44 KRKAVCKRYVLVIDVSGSMDSVAGGQTLMQRMKTFARLFINKAATYSWLGIVAFSNDAHV 103 Query: 288 EVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + + Q KE + G T +S+ L L +++K+R + +DG Sbjct: 104 VLRLTQMNKQGKEKATRAVQTLRTEGLTNISAGLFLALDLIKDRSHSRD---SIILFTDG 160 Query: 348 -DNWADDSPLCHEILAKKL 365 N ++ Sbjct: 161 AANCGITEAGPLITEYREK 179 >UniRef50_UPI000180B52F PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180B52F Length = 652 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 45/148 (30%), Gaps = 23/148 (15%) Query: 238 NYEKRPDPSSQA--VMFCLMDVSGSMDQST-------KDMAKRFYILLYLFLSRTYKNVE 288 E + P + L+DVSGSMD + D K+F L S T Sbjct: 39 ELEAKGKPEAGIFNRFVLLIDVSGSMDHTADGQSCTLLDRMKKFIELFIDNASDTSWIGI 98 Query: 289 VVYIRHHT----------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V + +AK + T +S+ L + EV+++ Sbjct: 99 VTFSSTANVIMELKQMTAEAKVFAKTTVLSLTTESRTNISAGLFMALEVIQKLKPSRD-- 156 Query: 339 IYAAQASDG-DNWADDSPLCHEILAKKL 365 +DG N K++ Sbjct: 157 -CIIVFTDGVANEGIVDSGTLIQEYKRI 183 >UniRef50_A6C7T1 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C7T1_9PLAN Length = 313 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 35/271 (12%), Positives = 75/271 (27%), Gaps = 23/271 (8%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 + L + Q + + A N + R +++ LAR A + + + Sbjct: 45 IEELGRLQASEEMDDDPTYAD-AFNSLRRTTEEQREVEHPLARHQAQGIERSADFMRMIP 103 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + A++ + L +L + L R ++D + + + + + Sbjct: 104 SEAMLRRRPGLKRLWHAKLAE--RGLLTYRVRGTYVDRVSTEVEEQQPQSKKRIRGPIIV 161 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH--------- 304 +D SGSM + +AK + + + V+ Sbjct: 162 CVDTSGSMSGRPEAVAKALTLEACRIAHAEQRPCL--LFSFSGSGQYVEHELSLSPDGLQ 219 Query: 305 ---EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDNWADDSPLCHEI 360 EF GGT +S+ + R A+W SDG ++ + Sbjct: 220 SLLEFLTMNFDGGTDISTPFEKAL----ARLRTAEWERADILLVSDGA-FSKSQVDALKP 274 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 + + Sbjct: 275 ALDDAKKRLGLRVSGLLVGNYSSGPMNSLCQ 305 >UniRef50_D2Q363 von Willebrand factor type A n=1 Tax=Kribbella flavida DSM 17836 RepID=D2Q363_9ACTO Length = 837 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 54/354 (15%), Positives = 81/354 (22%), Gaps = 39/354 (11%) Query: 8 RLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQG 67 R N + +++ R + I S G P D P Sbjct: 435 RRNPFDAPGLDQDR----LEQAISDSTPPDDPDDDPDGGAPGRPDGGPNSDGD-PSGDGS 489 Query: 68 RGGLRHRVHPGNDHFVQND---RIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEY 124 R PG D +R G D Sbjct: 490 PDDAR---DPGARDPDARDSDLTADRDANGSAPGDRAPDGGLSDSPADGNPQRVRGNGSA 546 Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS-LQNSLARRTAMTAG 183 + + AL + + ++ RS + R A Sbjct: 547 ENTGDPENALSGDAPTGDVATSSTPYKARLLSVQSAGPGVAGRRSRAITEVGRVVGDRAR 606 Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRP 243 RE LA + + P Q + D+R R Sbjct: 607 VGRESGRPH-LLATLRAAAPHQHSRGRSG------------PGLAVHPTDVR---LGVRE 650 Query: 244 DPSSQAVMFCLMDVSGSMDQ-STKDMAKRFY-ILLYLFLSRTYKNVEVVYIRH------- 294 V+FC+ D SGSM K LL R K V + R Sbjct: 651 GREGNLVLFCV-DASGSMGTKRRMSEVKTAIVSLLLDAYQRRDKVGLVTFARSQATVALP 709 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA-QASDG 347 T + E G T ++ L +V++ +DG Sbjct: 710 PTGSVETAVRRLESLPTGGRTPLAEGLVRAADVLRIAAIRDPRRRPLLVLVTDG 763 >UniRef50_Q6IND5 MGC83495 protein n=9 Tax=cellular organisms RepID=Q6IND5_XENLA Length = 861 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 60/201 (29%), Gaps = 29/201 (14%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQ---------STKDMAKRFYILL---------- 276 Y N+ + + S+ LMD SGSM + D AK ILL Sbjct: 266 YPNFPETKEKSNFGEFIFLMDRSGSMTEQMTNEQNAPRRIDSAKETLILLLKSLPLGCYF 325 Query: 277 -YLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNP 334 + ++ + + Q+ E GGT + LK + + +P Sbjct: 326 NIFSFGSDFTSIFSESMAYTQQSMEEAVKLVNQMDADMGGTEILEPLKKIYKTAGRPSHP 385 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 +DG+ + + ++ R +++ + L + S Sbjct: 386 ----RQLFVFTDGE-VGNTNF--VIDEVRRNAHNHRCFTFGIGEGAS-TALIKGLARAAS 437 Query: 395 TFDNFAMQHIRDQDDIYPVFR 415 F R Q + + Sbjct: 438 GTFEFITGKERMQPKVLQTLK 458 >UniRef50_C1RL29 von Willebrand factor type A-like protein n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RL29_9CELL Length = 425 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 60/193 (31%), Gaps = 25/193 (12%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-----------------YKNVEVVYIRH 294 ++D SGSM AK + + Y N V ++ Sbjct: 45 IIMIDTSGSMTGPMLAAAKHAAQVAVDTIPDGTWFAIVSGSHVAQRVFPYPNAPVAIVQM 104 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 A+E + GGT +S+ L+L D++ PA +A +DG N ++ Sbjct: 105 EPGAREEAKRAVARLSAQGGTAMSTWLRLADQIFAT--QPAATQRHAILLTDGKNESEP- 161 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + + + + R RE ++ I D DI F Sbjct: 162 RAQLTSTIQAVTGRFQCDARG-VGERWQVDELREIATALLGG----VELIADPADIAKDF 216 Query: 415 RELFHKQNATAKG 427 + L + Sbjct: 217 QALLATSLSRGVA 229 >UniRef50_D1XJQ4 von Willebrand factor type A n=2 Tax=Streptomyces RepID=D1XJQ4_9ACTO Length = 624 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 54/194 (27%), Gaps = 28/194 (14%) Query: 247 SQAVMFCLMDVSGSMDQST------KDMAKRFYILLYLFLSRTYKNVEVVYIR------H 294 + + ++D SGSM + + A+R + L Y VY Sbjct: 25 AGGSLVMVLDSSGSMGEDDGTGSTRMESARRAVGAVVDALPDGYPTGLRVYGADRPQGCA 84 Query: 295 HTQ--------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 T+ + + + TG T + +L+ E + + A SD Sbjct: 85 DTRLVRPVRPLDRAAVKSAVAGVRPTGDTPIGLSLRKAAEDLPAPRDGAARTRTIVLVSD 144 Query: 347 GDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 G++ P +R + + A + + Sbjct: 145 GEDTCGTPPPCEVAARLAGQGAGLRIDTVGFQVKGAAREQLECVAEAGNGRYY------- 197 Query: 406 DQDDIYPVFRELFH 419 D D + R+L Sbjct: 198 DAPDADALARQLLR 211 >UniRef50_C5YBL3 Putative uncharacterized protein Sb06g000656 n=1 Tax=Sorghum bicolor RepID=C5YBL3_SORBI Length = 434 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 59/209 (28%), Gaps = 40/209 (19%) Query: 247 SQAVMFCLMDVSGSM----------DQSTKDMAKRFYILLYLFL--------SRTYKNVE 288 + + ++DVSGSM + ++AK L + + Sbjct: 10 APVDVVAVLDVSGSMAWDYGNGTTVENHRLELAKEAMAKAIQSLGPAAAAVAAGGARRNR 69 Query: 289 VVYIRHHTQAKEVD-------------EHEFFYSQETGGTIVSSALKLMDEVVKER-YNP 334 + + K+V ++ + G LK+ +++ ER Sbjct: 70 LAVVPFSNVVKQVTPLTEMDMEGQQTVKNAVDALKPGGQADYLMPLKIAAKILDERKAEE 129 Query: 335 AQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL-----WRE 388 SDG D++ D+ E L + L +++ + R Sbjct: 130 KDRLAIIIFVSDGQDHYFRDTDDMKETLTQHKLIKYPIHAFGVSVSEQDSSGGGAKALRA 189 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 S Q D D V +L Sbjct: 190 MADATSGSYTSITQ--DDDVDTMAVAEKL 216 >UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fusobacterium sp. D11 RepID=UPI0001B52F00 Length = 218 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 48/168 (28%), Gaps = 15/168 (8%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN---VEVVYIRH------ 294 P + L D S SM + +++ + L + + +I Sbjct: 6 QPKKVLPLILLADTSSSMREWMREL-NTAIRDMLGTLKEQESLKAEIHISFITFGNGGAN 64 Query: 295 -HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER--YNPAQWNIYAAQASDGDNWA 351 HT V EF E G T + AL++ E+V+ R + SDG Sbjct: 65 LHTALTPVSNIEFNDFTEGGMTPLGGALRIAKEMVENREIIPSKSYAPIILLLSDGAPND 124 Query: 352 DDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 + S I R + + + ++ Sbjct: 125 NGWENEMYRFINDGRSKKCMRMSLG-IGRDYDYDVLKGFSSNGEVYEA 171 >UniRef50_B0VJ58 BatA protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJ58_9BACT Length = 270 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 55/223 (24%), Gaps = 41/223 (18%) Query: 239 YEKRPDPSSQAVMFCLMDVSGS---MDQSTKDMAKRFYILLYLFL--SRTYKNVEVVYIR 293 + R + + +D+SGS MD + K+ + F+ + V + Sbjct: 15 IKTRDLSNKGVDIVMAIDISGSMLAMDFAPKNRLSAAVSVAKDFVKRRPNDRFGLVAFSE 74 Query: 294 HH------TQAKEVDEHEFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + T + +E T + L +K ++ Sbjct: 75 YALTQVPLTFDHLAMLNSLDKLKVNEEASATAIGMGLAKAVARLKNSTAKSK---VIILI 131 Query: 345 SDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEIT----------------RRAHQTLWR 387 +DG N + PL +AK+L V Sbjct: 132 TDGVSNTGEIDPLTAAGMAKELGIKVYPIGVGSKGLVPFPYSDPIFGTRYINTYIDLDME 191 Query: 388 EYEHLQSTFDNFAMQHIRDQD---DIYPVF----RELFHKQNA 423 + T D DI + LF Q Sbjct: 192 TLNKIAETTGTGKAALATDAKGLADIMNEIDRLEKTLFTTQFR 234 >UniRef50_C3Y9U4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y9U4_BRAFL Length = 1317 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 68/246 (27%), Gaps = 29/246 (11%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 +++ ++ +I I + R + + L+D Sbjct: 23 WDARGSDRIVNDDPSALQIGLDPDSPRSKVEILGEIFKKHVQRLRQTENQTVELVFLVDS 82 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--------------- 302 S S+ + RF L + V + ++ K V+ Sbjct: 83 SASVGNENFNSELRFVKKLLADFTLAENAARVAIVTFSSRNKVVNHVDHLSKPSYHKHKC 142 Query: 303 ---EHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 E E + GGT A+ EV++ A +DG + D Sbjct: 143 SLLEEELPRIKYAGGGTYTKGAMIKAQEVLRHARPNA--TKAVFLMTDGYSNGGDPLPEA 200 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 L + V+ +++ + + + + ++ + + + + R Sbjct: 201 RKLKQN---DVQIFTFGIRSGNVKE--LQNMATDPAEEHSYFLDSFAEFEAL---ARRAL 252 Query: 419 HKQNAT 424 H+ Sbjct: 253 HEDLQG 258 >UniRef50_D0LJ27 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LJ27_HALO1 Length = 775 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 51/194 (26%), Gaps = 23/194 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHH 295 + + ++D SGSM + AK + L + + + + Sbjct: 263 ADKDVTLVLDRSGSMSGAPLARAKDAAKAVVARLGDGDRVNVMAFDDGVDALFLRPVPIS 322 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEV---VKERYNPAQWNIYAAQASDGDNWAD 352 + + + GGT ++ AL + + + +DG Sbjct: 323 AERRSQAVEYIDRLSDGGGTDLAGALAEALDAQHPSESEADTGSRPHVILFLTDGQ---- 378 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 +A+ R ++ + L + F I +I Sbjct: 379 SDSQATLQVARGDAGDARVFTIGVGDG-VEKPLLARLASEKRGRFTF----IASPSEIER 433 Query: 413 VFRELFHKQNATAK 426 L+ + A Sbjct: 434 KVSRLYSEIAAPVL 447 >UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobaceae RepID=C3NM85_SULIN Length = 452 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 66/189 (34%), Gaps = 31/189 (16%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------- 291 ++ + ++ L+D SGSMD AK + LY R ++ + + Sbjct: 280 QKQIRETLGPIYLLLDKSGSMDGEKILWAKAVALALYSRAKRENRDFYLRFFDNIPYPLI 339 Query: 292 -IRHHTQAKEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 ++ + ++K++ + + GGT +S ++ E +KE + +DG+ Sbjct: 340 KVQKNAKSKDIIKMVEYIGKIRGGGGTDISRSIISACEDIKEGHVKGVSE--IILLTDGE 397 Query: 349 NWADDSPLCHEILAKKLLPVVR--YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 E ++ L S + L+ D + + + D Sbjct: 398 ------DKIAETTVRRSLKEANSQLISVMIRGDN---------ADLRRVSDEYLITYKLD 442 Query: 407 QDDIYPVFR 415 +D+ V Sbjct: 443 HEDLLKVVE 451 >UniRef50_B3QXN7 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QXN7_CHLT3 Length = 368 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 34/307 (11%), Positives = 91/307 (29%), Gaps = 47/307 (15%) Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 ++++ L ++ L + + QLT T + + + SL + ++ Sbjct: 61 GLGDFIEWLKKEGYLEDAAEKGSFQLTSKITKKI---------REDSLNQIFTSLKKDSS 111 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 + + K E P + + ++ + + Sbjct: 112 LGSHKIASPGRSVE---------PLPETRAWKFGDNLQQMDITSTINNSFKRNGIDDFSL 162 Query: 240 EKRPDP------SSQAVMFCLMDVSGSM---DQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 + +Q ++DVS SM + AK + L + Y + Sbjct: 163 SEDDIEVYDNEHQTQCATVLMIDVSHSMILYGEDRITPAKTVALALSELILTRYPKDSLE 222 Query: 291 YIRHHTQAKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-- 347 + +A ++D E + T + L L ++++ + + N +DG Sbjct: 223 ILLFGDEAWQIDVKELPFISIGPYHTNTKAGLALARQLLRRKRSQ---NKQIFMITDGKP 279 Query: 348 -----------DNWADDSPLCHEILAKK---LLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 +++ D + ++ L + + +++ + + E Sbjct: 280 SAINEGIKIYKNSFGLDRKIVNKTLDEAVICRKEKIVITTFMVTSDPYLKGFVEELTEAN 339 Query: 394 STFDNFA 400 F+ Sbjct: 340 QGRAYFS 346 >UniRef50_A0D8L3 Chromosome undetermined scaffold_41, whole genome shotgun sequence n=2 Tax=root RepID=A0D8L3_PARTE Length = 2732 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 45/148 (30%), Gaps = 11/148 (7%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL-SRTYKNVEVVYIRHH 295 + E+ ++D SGSM D AK I + K+ V I + Sbjct: 2529 FDKEQLQRAPKAIHYVIMLDDSGSMQGEKFDNAKAGIIAFLAEIHKMKNKDSRVTIIIFN 2588 Query: 296 TQAK-EVDEHEFFYSQET------G-GTIVSSALKLMDE-VVKERYNPAQWNIYAAQASD 346 A+ VD ++ G GT E +++ +D Sbjct: 2589 DLARVVVDSEVIDSKEQEQKITFKGLGTNFDEPFTKAYEKIIQRPDFDKFHQHSMFFYTD 2648 Query: 347 GD-NWADDSPLCHEILAKKLLPVVRYYS 373 G ++ + + L+ + + + Sbjct: 2649 GQADYPQTALGLFDQLSPEQKQKIELVA 2676 >UniRef50_C5D5J8 Ig domain protein group 2 domain protein n=2 Tax=Geobacillus RepID=C5D5J8_GEOSW Length = 942 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 21/155 (13%), Positives = 39/155 (25%), Gaps = 26/155 (16%) Query: 213 RKEIAELRAK-IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 ++ + + +D + + P V DVSGSM AK Sbjct: 45 SSQLEYAKPPNGDAQGRLDVTLIPKGRVDNIVRPPIDVVFVF--DVSGSMTPLKLQSAKY 102 Query: 272 FYILLYLFLS----RTYKNVEVVY---------IRHHTQA------KEVDEHEFFYSQET 312 + + + + + T A E + + Sbjct: 103 ALQSAVDYFKANANPNDRFALIPFSSDVQYNKVVPFPTGAYDVKQHLERIANVANDLRAY 162 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 GGT + + +N Y +DG Sbjct: 163 GGTN----YTQSLQQAQSFFNDPTRKKYIIFLTDG 193 >UniRef50_C7NQ34 von Willebrand factor type A n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NQ34_HALUD Length = 1100 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 41/173 (23%), Gaps = 15/173 (8%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH------ 304 + ++D SGSM + AK L + V + T + + Sbjct: 515 LAFVIDESGSMGGARIQDAKASAKRFVGGLYEDDRAALVSFAGGATLGQSLTTDHGAVNA 574 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 GGT + L+ + + A DG P+ A + Sbjct: 575 SIDQLNAGGGTNTGAGLQKAVDELTSN-GEGDTQEIILLA-DGGTGLGPDPVTIAQTADE 632 Query: 365 LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + D ++ VF + Sbjct: 633 HRITINTIGMGTGI---DAQELTSIADATGGEFY----QVSDSSELPEVFDRV 678 >UniRef50_UPI00016E1D58 UPI00016E1D58 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E1D58 Length = 451 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 52/191 (27%), Gaps = 26/191 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEVDEHEFF 307 + ++D SGS+ + + + F L L + V+Y +V + F Sbjct: 2 IVFIIDESGSIGSANFQLMRSFLHSLISGLQVASNRVRVGIVMYNVEPMA--QVFLNTFK 59 Query: 308 YSQE-----------TGGTIVSSALKLMDE---VVKERYNPA-QWNIYAAQASDGDNWAD 352 E GGT +AL + + + A +DG Sbjct: 60 DKSELLDFIKILPYHGGGTNTGAALNFTLQEVFIKQRGSRKDLGVQQVAVVITDG----K 115 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 A V Y+ A + + + F + + Sbjct: 116 SQDEVSSPAANLRRAGVTVYAVGVK--DADKAQLDQIASYPTNKHTFIIDSFTKLKTLEA 173 Query: 413 VFRELFHKQNA 423 + + + N+ Sbjct: 174 SLQRILCQNNS 184 >UniRef50_P58335-2 Isoform 2 of Anthrax toxin receptor 2 n=3 Tax=Homininae RepID=P58335-2 Length = 386 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 51/174 (29%), Gaps = 16/174 (9%) Query: 236 YKNYEKRPDPSSQAV--MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + PS + ++ ++D SGS+ + ++ L F+S + +V+ Sbjct: 28 GGLLRAQEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAERFVSPEMRLSFIVFSS 87 Query: 294 HHTQAKEVD---------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 T + + G T + LKL +E +++ + + Sbjct: 88 QATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA-GGLKTSSIIIAL 146 Query: 345 SDGDNWA-DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG S E + L Y + Q + Sbjct: 147 TDGKLDGLVPSYAEKEAKISRSL-GASVYCVGVL--DFEQAQLERIADSKEQVF 197 >UniRef50_B7Q438 Neurogenic locus notch, putative n=1 Tax=Ixodes scapularis RepID=B7Q438_IXOSC Length = 1597 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 62/203 (30%), Gaps = 23/203 (11%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS--TKDMAKRFYILLYLFLSRTYKN 286 + DL+ +++ + ++D SGS+ ++A + L +S Y Sbjct: 23 VLDADLQLFVSTIERYSTTKNDIVFVLDESGSIGADVFPAELAFTEMVARLLVVSPEYSR 82 Query: 287 VEVVYIRHHTQAK--------EVDEHEFFY-SQE----TGGTIVSSALKLMDEVVKERYN 333 + V+ + + + +F + + +GGT AL E++ Sbjct: 83 LTVMTFSNDNLVHIDQVGSSGDTNMCKFVHELNQIPYRSGGTRTREALGYAGEILWNARQ 142 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 A N SDG + P L + + + + Sbjct: 143 EA--NRIVVLISDGQANSGSEPSEIARLLRVK--GIVIFGVGVAHINKD----ELLDVAS 194 Query: 394 STFDNFAMQHIRDQDDIYPVFRE 416 S + +++ + R+ Sbjct: 195 SPAHTYMLRNFEYIKKVNKDLRK 217 >UniRef50_A9UIA7 Hedgling (Fragment) n=4 Tax=Nematostella vectensis RepID=A9UIA7_NEMVE Length = 3480 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 54/180 (30%), Gaps = 22/180 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-EVDEH 304 + + ++D SGS+ + K F + F + K V I + T A+ E + Sbjct: 175 QTSVDLVFILDTSGSVGSYNFEKMKTFVKNVVDFFNIGPKGTHVAVITYSTWAQVEFNLK 234 Query: 305 EFFYSQE------------TGGTIVSSALKLM----DEVVKERYNPAQWNIYAAQASDGD 348 S+ +G T + AL L +V A +DG Sbjct: 235 AHHSSKAALKNAVNAIYYRSGWTYTADALDLAGRNIFQVANGMRPDKGIPKIAVLLTDGY 294 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + ++ L V + ++ + F +++ D + Sbjct: 295 SNGNNPLGPANDL---RAAGVNVFCVGI--GNYYERELNDIATDPDKDHVFKLENFNDLN 349 >UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria RepID=A9AV55_HERA2 Length = 222 Score = 59.4 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 60/163 (36%), Gaps = 21/163 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK---NVEVVYIRHHTQA---K 299 + + ++D SGSM D + + + +S ++ +E+ + ++QA Sbjct: 12 EQKCLCILVVDTSGSMQGRPIDELNQGLQVFHQDISNSFSTAQRLEICLVEFNSQADCIV 71 Query: 300 EVDEHEFFY---SQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDGDNW 350 E + F+ G T + ++L V+ER + + +DG+ Sbjct: 72 EPSLVDQFHMPILAVAGTTKLVDGVRLAIHKVQERKSWYRSTGQPYYRPWIILMTDGE-- 129 Query: 351 ADDSPLCHEILAKKLLPVVR---YYSYIEITRRAHQTLWREYE 390 DS LA+++ V + + + A + ++ Sbjct: 130 -PDSDQDVAGLAREIQHGVNNKQFVFFPIGVQGADMRMLQQIS 171 >UniRef50_Q1AYC2 Protoporphyrin IX magnesium-chelatase n=15 Tax=Bacteria RepID=Q1AYC2_RUBXD Length = 616 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 50/148 (33%), Gaps = 14/148 (9%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY--ILLYLFLSRTYKN 286 + DLR K E R ++ ++D SGSM ++ A + LL R + Sbjct: 432 LRPEDLREKVREGRE----GNLLVLVVDSSGSMAARSRMSAVKGAVRALLEDAYRRRDRA 487 Query: 287 VEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDE-VVKERYNPAQWN 338 + + E G T +++ L+L E V++E + Sbjct: 488 AVISFRGEEARLLVPPASGVEAAAARLEELPTGGRTPLAAGLELAAETVLREASREPERR 547 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLL 366 +DG A + PL ++ Sbjct: 548 PLLVVITDGRATAGEDPLAAARRLRERG 575 >UniRef50_D1NA79 von Willebrand factor type A n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1NA79_9BACT Length = 232 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 46/180 (25%), Gaps = 38/180 (21%) Query: 247 SQAVMFCLMDVSGSMD------QSTKDMAKRFYILLY-------LFLSRTYKNVEVVYIR 293 ++ + L+DVSGSM S D+ KR + + Sbjct: 58 AEKRVVFLVDVSGSMGAVTPEGGSRLDVMKRELRRAVGSAVASANRIGAPKEAGNFRVWA 117 Query: 294 HHT----------------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 + A E G T + A + + E+ K Sbjct: 118 FSSGLQLFPDLEPCGFRDRSAVERLNRFVGALGAGGSTNMLMAWRKILELTKH-----GQ 172 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 SDGD + L +L V + + +L RE + Sbjct: 173 LDTVYFLSDGDPSDCSAE-ELTRLLTRLPKDVTVHCFAIG---LDSSLLREIAAAHNGNY 228 >UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10 Tax=Enterobacteriaceae RepID=D1TTW6_YERPE Length = 327 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 45/161 (27%), Gaps = 17/161 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE---VVYIRHHTQAK---- 299 + +F ++D S SM ++ L + +E + I AK Sbjct: 2 RRLPIFFVLDCSESMIGENLKKMNDGLQMIINDLKKDPHALETAWISVIAFAGVAKTIVP 61 Query: 300 --EVDEHEFFYSQETGGTIVSSALKLMDEVVK------ERYNPAQWNIYAAQASDGDNWA 351 EV GGT + +AL+ + + W +DG Sbjct: 62 LVEVVSFYPPRLPIGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKPVVYLLTDGRPT- 120 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 DD+ V + + A + R+ Sbjct: 121 DDTTAEITRWKTHYARKVNLIAIG-LGPSADLNILRQLTEN 160 >UniRef50_Q1IGT5 Surface adhesion protein n=1 Tax=Pseudomonas entomophila L48 RepID=Q1IGT5_PSEE4 Length = 5862 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 40/123 (32%), Gaps = 22/123 (17%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK-----NVEVVYIRHHTQAK 299 P + ++D SGSM S D AK+ ++ L+ + K V ++ + TQ K Sbjct: 5332 PGQNYNLAFIVDTSGSMGSSGVDAAKKSLESVFKTLAASVKGDQSGTVNILLVDFATQVK 5391 Query: 300 ------------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI---YAAQA 344 + + + GGT A K + N Sbjct: 5392 SSVAVTLNDAGLQTLLNALNNLRADGGTNYEDAFKTTANWFQN--LKDGGNTGSNQTFFI 5449 Query: 345 SDG 347 +DG Sbjct: 5450 TDG 5452 >UniRef50_D1ZZE6 Putative uncharacterized protein GLEAN_08029 n=1 Tax=Tribolium castaneum RepID=D1ZZE6_TRICA Length = 1868 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 57/189 (30%), Gaps = 24/189 (12%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 + + + L+D S S+ ++ +F L ++ Y + V + + + Sbjct: 73 KNTQKLELIFLIDGSSSVGETNFRSELKFVKKLLSDVTVDYNHTRVAIATFSSSVSKNID 132 Query: 304 HEFFYSQE-----------------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 +E GGT A ++ E+ + N ++ +D Sbjct: 133 QISDPRKENNKCFLLSKLLSKIEYTGGGTNTLKAFEVAKEIFTQSRNDSE--KVLFLITD 190 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G + D + A+ V+ ++ + E ++ + + Sbjct: 191 GFSNGGD---PIPLAAELKKDQVKIFTIGIANGNYKE--LYELASTPGEIYSYLLDSFEE 245 Query: 407 QDDIYPVFR 415 + + + Sbjct: 246 FESLARHLK 254 >UniRef50_A3JLW1 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JLW1_9RHOB Length = 354 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 37/152 (24%), Gaps = 18/152 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---HHTQAKE------V 301 L+D SGSM + + L T + + + E Sbjct: 168 FVLLVDRSGSMAEI-MPEVREAAKEFVAALPDTAECSVSSFAGDWDFSHRGPEGALTCKP 226 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEI 360 + F Q G T + L+ + E +DG N S Sbjct: 227 ENFAFDNIQPGGTTNIYGPLREAYGWLSESERTDHQKAVIL-LTDGRANDDAASESQTLA 285 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 + Y+++ + R Sbjct: 286 MKDDA------YTFVYYMGDSDDRWLRSLADN 311 >UniRef50_Q6VUC2 Putative uncharacterized protein n=1 Tax=Antonospora locustae RepID=Q6VUC2_ANTLO Length = 824 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 30/240 (12%), Positives = 71/240 (29%), Gaps = 34/240 (14%) Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 + + ++ E+ R + R ID ++ + + +A Sbjct: 224 DDHDVKNRVYTIAESKRDEDIVFRMRMEREERVSARHFEIDNHNVVELTIVPKMEVLRRA 283 Query: 250 VM-----FCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----- 294 V+ ++D SGSM + + + +V VV+ H Sbjct: 284 VLCSREIILVVDKSGSMGWKCGGTVPHKLVSEAIEVFMSTVCDLNIHVNVVFFDHECCES 343 Query: 295 -----------HTQAKEVDEHEFFYSQET--GGTIVSSALKLMDEVVKERYNPAQWNIYA 341 ++ +D+ E+ + GGT + + L+ ++ K Sbjct: 344 DVLFRGCSERLTSEQGLLDKKEWVREKAGPRGGTCIVAGLQRAVDL-KPAAEDGSIRRNI 402 Query: 342 AQASDGDNWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG D + L ++ R+++ ++ T+ +F Sbjct: 403 ILLTDG---GDSNLREITSLVQREAAKGTRFFAIGIGNGVSYDTVME-VARAGRGTHDFI 458 >UniRef50_D1JHM1 Putitive magnesium-chelatase subunit n=2 Tax=uncultured archaeon RepID=D1JHM1_9ARCH Length = 705 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 47/319 (14%), Positives = 82/319 (25%), Gaps = 36/319 (11%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTAN-GVPANISVVRSLQNS---LARRTAMTAGKRREL 188 P+ Q + + N G P + VR Q R+T+ L Sbjct: 386 EQPDKNPEGQDDKDDVNESKNESVFNIGDPIDTRAVRQKQKKDKTYRRKTSGRRIPTLSL 445 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD---- 244 N I + P + + L I + + K+ + R Sbjct: 446 RN---NGKYIRHGIPKGKITDVALDATIRAAAVYQKERVVDSDLAVVIKSQDIREKIRVG 502 Query: 245 PSSQAVMFCLMDVSGSMD-QSTKDMAKRF-YILLYLFLSRTYKNVEVVYIRHHTQAK--- 299 S A M ++D SGSM + AK LL + K V + Sbjct: 503 KISTATM-FVVDASGSMGANRRMESAKGAVLSLLLDSYQQRDKVGMVAFKGDQADVLLPL 561 Query: 300 ----EVDEHEFFYSQETGGTIVSSALKLMDEV--VKERYNPAQWNIYAAQASDG-DNWAD 352 ++ G T +++ L+ + ++ + SDG N + Sbjct: 562 CSSSDLAVERLRELPTGGRTPLAAGLEQGLNLLMAEKHRDEEAI-PILLLISDGRANVSA 620 Query: 353 DSPLCHEILAKKLLPVVR---YYSYIEITRRAHQTLWR-------EYEHLQSTFDNFAMQ 402 E L R Y + T + + + Sbjct: 621 GGSKELEQELLALAEQARAKGIYVIVIDTEIVSDSFIQMQLGYCRAIANYSGGKYYPIAD 680 Query: 403 HIRDQ-DDIYPVFRELFHK 420 DI R + + Sbjct: 681 LTSGAVRDIVISERNMLND 699 >UniRef50_Q2JD81 von Willebrand factor, type A n=4 Tax=Frankineae RepID=Q2JD81_FRASC Length = 319 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 46/188 (24%), Gaps = 30/188 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRH 294 P +A + +DVS SM + AK+ L V + Sbjct: 78 RAERVPRERATIILAIDVSNSMAATDIAPTRLAAAKQGASAFVDQLPPRINLGLVSFAGS 137 Query: 295 HTQ------AKEVDEHEFFYSQETGGTIVSSAL---KLMDEVVKERYNPAQWN---IYAA 342 T +E Q T V + +R++ + Sbjct: 138 ATVLVPASADRESVRAGIRGLQLGPATAVGEGIFASLQAITTAGKRFSDTGQSAPPAAIV 197 Query: 343 QASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRA-----------HQTLWREYE 390 SDG+ + E A++ V +Y ++ R+ Sbjct: 198 LLSDGETTRGRPNNQAIEA-ARQARIPVDTIAYGTADGTLDVGGQEVPVPVNEQALRDIA 256 Query: 391 HLQSTFDN 398 + Sbjct: 257 EQTGGSYH 264 >UniRef50_Q30SV4 von Willebrand factor, type A n=2 Tax=Campylobacterales RepID=Q30SV4_SULDN Length = 307 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 60/203 (29%), Gaps = 31/203 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQ---------------STKDMAKRFYILLYLFLSRTY 284 P + + +D SGSM+ S ++AK + Sbjct: 73 RVDPLNRNGKDIVLAIDASGSMNSTGFDFEGEAALPQKLSRFEIAKIVASEFIQK-RLSD 131 Query: 285 KNVEVVY------IRHHTQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPA 335 V+Y T K + Y T + A+ + K + Sbjct: 132 NVGIVLYGDFAFIASPITYEKNIIIEMLSYLNQGMAGQNTAIGEAIAMSLRAFKHSKAKS 191 Query: 336 QWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + +DG+ N D SP +LAK+ + A + L ++ Sbjct: 192 K---IVVLLTDGEHNSGDISPKDALVLAKEENIKIYTIGMGN-RGEADEALLKKIADESG 247 Query: 395 TFDNFAMQHIRDQDDIYPVFREL 417 +A + ++ +IY EL Sbjct: 248 GEFFYA-TNAKELKEIYEHIDEL 269 >UniRef50_A0C946 Chromosome undetermined scaffold_16, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0C946_PARTE Length = 1279 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 55/173 (31%), Gaps = 15/173 (8%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 R+ E S ++D SGSM + ++ L Y N + I+ Sbjct: 1085 RFFQKELEQWNQSH-HFILVIDESGSMAGQKWKILMEAIQQCFIELR-KYPNNRISLIQF 1142 Query: 295 HTQAKEV----DEHEFFYSQE-------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 A+ V E ++ +GGT +A L+ ++ + + Sbjct: 1143 SDDARFVVGSNQPEEIPQQEQVKQFQMMSGGTNFENAFLLVFYAIQRCISQFDFQTVVFY 1202 Query: 344 ASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +DGD ++ + S + ++L + EI + + Sbjct: 1203 -TDGDADYPNVSMDLFAQVNQELRQKIDILICTEIKESKSLQKVCQVFQQKMG 1254 >UniRef50_B0S9S4 Putative uncharacterized protein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0S9S4_LEPBA Length = 373 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 57/209 (27%), Gaps = 26/209 (12%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRT 283 + L + + P + ++D SGSM + +AK I + L + Sbjct: 19 LIPVVLFFLHLSLLPQSNHNKRYVFILDASGSMSEKWDGKTRMAVAKEKLIQVLGGLPKD 78 Query: 284 YKNVEVVY---IRHHTQAK----------EVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 V Y I A+ + + G T ++ L +VV E Sbjct: 79 ASVGLVAYGNRIAGCQSARLYHPIQKGGASIVSQKLTTIVPAGSTPIAQTL----QVVGE 134 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 Q SDG + P ++ R + + Sbjct: 135 YLLSDQLETEIIFISDGVESCEGDPKSVLYNLRQSGKKFRLQILGIDIDPKGEEDLKRLS 194 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 L ++ +D F+ +F Sbjct: 195 ILGDGNY----FPLKTPEDYDRSFQRIFA 219 >UniRef50_A1ANG2 Protoporphyrin IX magnesium-chelatase n=6 Tax=Bacteria RepID=A1ANG2_PELPD Length = 689 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 43/129 (33%), Gaps = 14/129 (10%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKN 286 + D R K EKR + ++D SGSM + A + LL + + Sbjct: 487 LSDRDFRGKVREKR----VGNFLLFVVDASGSMGARGRMAASKGAVMSLLLDAYQKRDRV 542 Query: 287 VEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + ++ T + E+ G T +S+A+ E ++ Sbjct: 543 GMISFRKNEAFVNLPPTTSVELAGKLLEEMPVGGRTPLSAAIAKSYEQLRGVLGRDPTAR 602 Query: 340 -YAAQASDG 347 +DG Sbjct: 603 PIVIFITDG 611 >UniRef50_Q7Z3S7 Voltage-dependent calcium channel subunit delta-4 n=102 Tax=Eumetazoa RepID=CA2D4_HUMAN Length = 1137 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 49/340 (14%), Positives = 102/340 (30%), Gaps = 29/340 (8%) Query: 104 QASQDGEGQDEFVFQISKDEYLDLLF-EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPA 162 +A+++ + EF + D Y +L E N + L E H + N + Sbjct: 146 EAAEEADLNHEFNESLVFDYYNSVLINERDEKGNFVELGAEFLLESNAHFSNLPVNTSIS 205 Query: 163 NISVVRSLQNSLARRTAMTAGKRR-ELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 ++ + ++ N +E + + R Sbjct: 206 SVQLPTNVYNKDPDILNGVYMSEALNAVFVENFQRDPTLTWQYFGSATGFFRIYPGIKWT 265 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 E + TFD R + + +S + L+DVSGSM +AK + L Sbjct: 266 PDENG--VITFDCRNRGW-YIQAATSPKDIVILVDVSGSMKGLRMTIAKHTITTILDTLG 322 Query: 282 RTYKNVEVVY---------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDE 326 + Y ++ +E + G +V AL+ + Sbjct: 323 ENDFVNIIAYNDYVHYIEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQ 382 Query: 327 VVKERYNPAQW----NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 ++K+ + A+ N SDG D P+ + VR ++Y+ + Sbjct: 383 ILKQ-FQEAKQGSLCNQAIMLISDGA-VEDYEPVFEKYNWPDC--KVRVFTYLIGREVSF 438 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + + + D + + + + Sbjct: 439 ADRMKWIACNNKGYYT-QISTLADTQENVMEYLHVLSRPM 477 >UniRef50_C3Y4Z7 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y4Z7_BRAFL Length = 1236 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 21/151 (13%), Positives = 39/151 (25%), Gaps = 19/151 (12%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQS----TKDMAKR---FYILLYLFLSRTYKNVEV 289 K+ E + ++D S SMD + +R F L+ + V Sbjct: 618 KSLEASGHQRTPLRFVAVIDESYSMDDRIGRDKLTLIQRMQIFAELMAKDFKDEDQMGIV 677 Query: 290 VYI----------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + R + ++ + G T +S L + K N Sbjct: 678 TFANDAKVVLPMTRMDSSGRDSALEKIQNISTRGQTNLSDGLLSAISMFKGSSGSDFHNG 737 Query: 340 YAAQASDGD-NWADDSPLCHEILAKKLLPVV 369 +DG N + + Sbjct: 738 IIL-FTDGQANQGIIDAAELVQEYNSKMAGL 767 >UniRef50_UPI00006CC94A von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CC94A Length = 901 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 58/194 (29%), Gaps = 23/194 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------- 293 SS++ L+D SGSM A L L V + Sbjct: 317 KSSRSEYIFLIDRSGSMRGKPLTKALEALQLFLQSLPPDSYFNIVSFGSNFKKLYERSQK 376 Query: 294 HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 +++Q + ++ GT + S L + + +N +DG + Sbjct: 377 YNSQTLKFACNKIKDYSADMNGTDILSPLNNIFYYGQN---IRGYNRQIFVLTDGA-VQN 432 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + EI V + + A + L +E + ++ + D+ Sbjct: 433 RQSVVREIKRNNKKNRVHFIGFG---SSADKILIQESAIAGKG----IHEMVQFEQDLSS 485 Query: 413 VFRELFHKQNATAK 426 + ++ K + Sbjct: 486 IVIKILCKTISATL 499 >UniRef50_UPI000179F51C Novel protein. n=1 Tax=Bos taurus RepID=UPI000179F51C Length = 1156 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 57/211 (27%), Gaps = 33/211 (15%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 DL+ R L+D SGSM+ + K ++ L Sbjct: 339 PDLQSVQPNLRKTRGE---FIFLVDRSGSMNGTNIQCVKDAMLVALKSLVP---TCLFNV 392 Query: 292 IRHHTQAKEVDE--------------HEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQ 336 I + K + + GGT + S LK + R P Sbjct: 393 IGFGSTFKTLFPSSQTYNEENLAMACDSIQKMRADMGGTNILSPLKWVIRQPVLRGCP-- 450 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG A ++ L + R YS+ I L + + Sbjct: 451 --RLLFLITDG---AVNNTGKVLELVRNHAFSTRCYSFG-IGPNVCHRLVKGLATVSKGS 504 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 F + + + + P + K A Sbjct: 505 AEF----LEEGERLQPKMVKSLKKAMAPVLS 531 >UniRef50_A0M6V8 Membrane protein containing von Willebrand factor(VWA) type A domain n=55 Tax=cellular organisms RepID=A0M6V8_GRAFK Length = 335 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 54/224 (24%), Gaps = 42/224 (18%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 + + + + +DVS SM + D K + VVY Sbjct: 81 DVSTQTSSTQGIDIVMAIDVSASMLARDLQPNRLDATKNVAEEFIQD-RPGDRIGLVVYA 139 Query: 293 RHH------TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 T K + + GT + S L +K + + Sbjct: 140 GESFTKTPITSDKAIVLDALEDIEYNNVLENGTAIGSGLATAVNRIK---DSDAESKVII 196 Query: 343 QASDG-DNWADDSPLCHEILAKKLLPVVRYYS---------------------YIEITRR 380 +DG +N P LA + V + + Sbjct: 197 LLTDGVNNAGFIDPSTASELAVEFGIKVYTIGVGSNGMALSPVGVNPANGRLRFGNVQVE 256 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + L +E F + ++IY L + Sbjct: 257 IDEDLLKEIAAATGGKY-FRATNNEKLEEIYAEIDSLEKTEIEE 299 >UniRef50_A6TP10 von Willebrand factor, type A n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TP10_ALKMQ Length = 551 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 54/177 (30%), Gaps = 20/177 (11%) Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFL--SRTYKNVEV-----VYIRHH-TQAKEVDEHE 305 ++D SGSM + AK ++ S + + VYIR + Sbjct: 179 ILDNSGSMSGNPMTQAKSAAKQFLNYVDFSNGDQVEIIEFNSDVYIRIPYGSDIKSLNTA 238 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKK 364 + T + AL + P +DG +N + S L++ Sbjct: 239 IDTMESNSQTALYDALYTGLVRAYSQSGPKC----ILAFTDGEENASIRSVSEVTELSRA 294 Query: 365 LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 + + + +E ++ + ++ +++ ++ +Q Sbjct: 295 TSIPIFIIGVGSLI---DEESLKEIAEQTGGEYFYSPTAV----ELEQIYKTVYDQQ 344 >UniRef50_C4PY90 Dihydropyridine-sensitive l-type calcium channel, putative n=1 Tax=Schistosoma mansoni RepID=C4PY90_SCHMA Length = 421 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 63/225 (28%), Gaps = 39/225 (17%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 + A R +D FD+R +++ S +F L+D SGSM + +A Sbjct: 185 FTGILRVYPAFPWRQQNVDMFDVRRRSW-FIQGSSVPKDLFILLDTSGSMTGQSLKLANL 243 Query: 272 FYILLYLFLSRTYKNVEVVY--------------------IRHHTQAKEVDEH------E 305 L L + I ++ + + + Sbjct: 244 SAQKLIEALDVDDYFTVAHFPGAKDHVAPMIVTANNESEPICFNSFVQATRRNKLRLFYD 303 Query: 306 FFYSQETGGTIVSSALKLMDEV-------VKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + G + ++LK E+ + N +D N Sbjct: 304 LSTLKARGYSDFPASLKFAYEMFRNLTESARGDRGKELRNKILVLLTD--NAFVFDESVL 361 Query: 359 EILAKKLLPVVRY-YSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 L ++ + + YS E A++ + + + + Sbjct: 362 SQLKQQKSNITTFIYSLGEPVGAAYEHKMKACAT--NDYYQYLPT 404 >UniRef50_Q97LT1 DnaK protein (Heat shock protein), C-terminal region has VWA type A domain n=1 Tax=Clostridium acetobutylicum RepID=Q97LT1_CLOAB Length = 698 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 23/254 (9%), Positives = 70/254 (27%), Gaps = 20/254 (7%) Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 + ++ + +II + E K+ + D ++ Sbjct: 443 LGKYVFYDIEYVGRKPSIIDIEYSYNKNGVVDVFATQRETAKKLPLKIEKLSEDFIFEEE 502 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-YKNVEVVYI------ 292 + + + +D+SGSM + A + + + + Sbjct: 503 LNEKEEAVHKNIVIAIDLSGSMRGKPLEEAIEASKTFVDSIDEGSFSLALIGFADKVKTL 562 Query: 293 RHHTQAKEVDEHEFFYS-QETGGTI-VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + T+ +E + GT +S ++K+ Y + + +DG + Sbjct: 563 INLTEDREEIFRAIDGLKKADVGTSTMSEPFSEAYNILKDAYG----DCFVVVLTDGQWY 618 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + K+ + + A + + +N + + Sbjct: 619 GKKDIMAEVNKCKEYEIEIAAIGFG----NAKKDFLDKIATC---EENSIFTEVSNLKQS 671 Query: 411 YPVFRELFHKQNAT 424 + ++ + + + Sbjct: 672 FSRIAKVISRSDGS 685 >UniRef50_C3YRH6 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YRH6_BRAFL Length = 495 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 52/201 (25%), Gaps = 24/201 (11%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY- 291 D+ + + Q ++D SGSM + A+ +L L V + Sbjct: 275 DILASHPKTSE--GIQGEYIAVIDRSGSMSGAFIATARETLLLFLKSLPAGSAFNIVGFG 332 Query: 292 ----IRHHTQAKEVDEHEFFYSQET--------GGTIVSSALKLMDEVVKERYNPAQWNI 339 V++ + GGT + L E + PA Sbjct: 333 STFKPLFDASV-PVNQENVGTASAWVCKMRADLGGTNLLGPL----EWIFSAPRPAGRPR 387 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG A + L + R ++ + L R F Sbjct: 388 EVFILTDG---AVSNTSRVIDLVRANSSHTRCWAVGIGEGASR-VLIRGIAEAGRGRAEF 443 Query: 400 AMQHIRDQDDIYPVFRELFHK 420 + R Q + + Sbjct: 444 VTEVDRMQAKLLLCLKRSLQP 464 >UniRef50_C7PW75 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7PW75_CATAD Length = 1033 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 34/294 (11%), Positives = 73/294 (24%), Gaps = 30/294 (10%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 N R E G G+ +++ V S++ S R + + + + Sbjct: 207 PNPVRLSIEVAVDPVGLPLAGLTSSLHGV-SVEESEGSRYLVRLNPGARADR--DFVLRL 263 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV----MFCL 254 + + + D+ P + A + + Sbjct: 264 GYGGSG-AATSLAVAWDSESANEVAKESAKATPTDIGTFLLTVLPPEPTGATRPRDVALI 322 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDE---- 303 +D SGSM A+R + L+ + + + T E + Sbjct: 323 LDRSGSMGGWKMTAARRAAARIVDTLTAEDRFAVLTFDDQMETPDGLPTGLSEATDRHRF 382 Query: 304 ---HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 GGT + L+ ++ + + +DG + Sbjct: 383 RAVQHLATVDARGGTEMEPPLRRAATLL--SDDNPDRDRVLILITDGQ---VGNEDRLLT 437 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA--MQHIRDQDDIYP 412 L +R ++ I + + L + D D Sbjct: 438 TLSPKLTHIRVHTVG-IDTAVNAAFLQRLSTLGGGHCELVESEDRLDDAMDAIH 490 >UniRef50_UPI0001C37059 hypothetical protein RflaF_04637 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37059 Length = 453 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 64/196 (32%), Gaps = 34/196 (17%) Query: 252 FCLMDVSGSMDQST----KDMAKRFYILLYLFLSR-TYKNVEVVYIRHH------TQAKE 300 ++D SGSM + K+ AK+F +K + T + Sbjct: 185 VLILDASGSMQGAPMTAQKEAAKKFCEDTIGTNKNSNHKFAVITLDSGSKTLTDFTNDID 244 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE- 359 + + G T S+AL+ E++ + A NI SDG+ + + + Sbjct: 245 ELDSAIAKTTAYGSTNYSAALRNAAELLSKVSADAVRNIVL--CSDGNPYGGEEKSTGKY 302 Query: 360 ---------------ILAKKLLPVVRYYSYIE---ITRRAHQTLWREYEHLQSTFDNFAM 401 +A+++ Y+ ++ + + S N+ Sbjct: 303 TLSDYSDYEYANAAYDIAQEIKKDYEIYTLGFFHSLSGEDLDFGRTYLKDVASYDSNY-- 360 Query: 402 QHIRDQDDIYPVFREL 417 + DD+ VF ++ Sbjct: 361 AEVNKVDDLQKVFADV 376 >UniRef50_C7PNX3 von Willebrand factor type A n=2 Tax=Sphingobacteriales RepID=C7PNX3_CHIPD Length = 352 Score = 58.6 bits (140), Expect = 5e-07, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 38/129 (29%), Gaps = 17/129 (13%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK-------NVEVVYIRHHTQ-- 297 + ++ L+DVS SM + + L + +V+ Sbjct: 2 RRLPIYFLIDVSESMVGEQIQFVEEGLAAIIKELK-SDPYALETAWVSIIVFAGQAKTIV 60 Query: 298 -AKEVDEHEFFYSQETGGTIVSSAL-KLMDEVVKE-----RYNPAQWNIYAAQASDGDNW 350 +EV GT +S+ L LM E+ K W +DG Sbjct: 61 PLQEVISFYPPKFPIGAGTSLSNGLGHLMYEMRKNTIHTTATQKGDWKPIVFLFTDGTPT 120 Query: 351 ADDSPLCHE 359 D S E Sbjct: 121 DDTSAAVRE 129 >UniRef50_C1F3F5 von Willebrand factor type A domain protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F3F5_ACIC5 Length = 313 Score = 58.6 bits (140), Expect = 5e-07, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 48/166 (28%), Gaps = 26/166 (15%) Query: 251 MFCLMDVSGSMD---QSTKDMAKRFYILLYLFLSRTYKNVEVVY-------IRHHTQAKE 300 + +D SGS+ K A+ F L + V + + K+ Sbjct: 73 IVLAIDTSGSVRKDLDEEKRAAREFLRA---TLRPEDRVEIVNFNTRVHEVVPFTNNLKK 129 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 +D E T + +A+ E + +R P + SDGDN + + Sbjct: 130 IDRG-LNRLSEGPATALYAAIAYGSEELAQR--PGR--KVLVVISDGDNT-VANSSYQQA 183 Query: 361 LAKKLLPVVRYYSYIE-------ITRRAHQTLWREYEHLQSTFDNF 399 L + + +S I+ + + Sbjct: 184 LDRAVRAETMIFSVIDLPVINDAGRDVGGEHAMIALSEATGGEYYY 229 >UniRef50_A8K7I4 Calcium-activated chloride channel regulator 1 n=44 Tax=Eumetazoa RepID=CLCA1_HUMAN Length = 914 Score = 58.6 bits (140), Expect = 5e-07, Method: Composition-based stats. Identities = 29/269 (10%), Positives = 74/269 (27%), Gaps = 31/269 (11%) Query: 161 PANISVVRSLQNSLARRTAMTAGKRREL---HALEENLAII------SNSEPAQLLEEER 211 + V L + + +++ + P + ++ Sbjct: 209 RCTFNKVTGLYEKGCEFVLQSRQTEKASIMFAQHVDSIVEFCTEQNHNKEAPNKQNQKCN 268 Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK--DMA 269 LR +R + + N Q ++ ++D SGSM + + Sbjct: 269 LRSTWEVIRDSED-FKKTTPMTTQPPNPTFSLLQIGQRIVCLVLDKSGSMATGNRLNRLN 327 Query: 270 KRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE----HEFFYSQ------ETGGTIVSS 319 + + L + V + E+ + + +GGT + S Sbjct: 328 QAGQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICS 387 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 L+ V++++Y I +DG++ K+ ++ + Sbjct: 388 GLRSAFTVIRKKYPTDGSEIVLL--TDGEDNTISG---CFNEVKQSGAIIHTVALGPSA- 441 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 E + +A +++ Sbjct: 442 ---AQELEELSKMTGGLQTYASDQVQNNG 467 >UniRef50_Q3KGA0 Putative secreted protein, hemolysin n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KGA0_PSEPF Length = 2887 Score = 58.6 bits (140), Expect = 5e-07, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 46/188 (24%), Gaps = 22/188 (11%) Query: 246 SSQAVMFCLMDVSGSMDQS-------TKDMAKRFYILLYLFLSR--TYKNVEVVYIRHHT 296 + + ++D+SGSM + ++AK+ L K V + + T Sbjct: 2078 EIDSNILIVLDISGSMADASGVPGLSRLELAKQAISALLDKYDDLGDVKVQLVTFSSNAT 2137 Query: 297 QAKEV------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-- 348 V + GGT +A+ M SDG Sbjct: 2138 DRTSVWVDVATAKTLLAGLSAGGGTNYDAAVATMYNAFNTSGKLTGAQNVGYFFSDGKPN 2197 Query: 349 --NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + + A+ + T N + D Sbjct: 2198 EGDIGTADEATLKAFLDANNIKNYAIGLGSGVSNAN---LDPLAYDGITHTNTNAVVVTD 2254 Query: 407 QDDIYPVF 414 + + V Sbjct: 2255 LNQLNSVL 2262 >UniRef50_UPI00016E1D1D UPI00016E1D1D related cluster n=9 Tax=Tetraodontidae RepID=UPI00016E1D1D Length = 2191 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 51/192 (26%), Gaps = 26/192 (13%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEVDEHE 305 A + ++D SGS+ + + + F L L + V+Y +V + Sbjct: 381 ADIVFIIDESGSIGSANFQLMRSFLHSLISGLQVASNRVRVGIVMYNVEPMA--QVFLNT 438 Query: 306 FFYSQE-----------TGGTIVSSALKLMDE---VVKERYNPA-QWNIYAAQASDGDNW 350 F E GGT +AL + + + A +DG Sbjct: 439 FKDKSELLDFIKILPYHGGGTNTGAALNFTLQEVFIKQRGSRKDLGVQQVAVVITDG--- 495 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 A V Y+ A + + + F + + Sbjct: 496 -KSQDEVSSPAANLRRAGVTVYAVGVK--DADKAQLDQIASYPTNKHTFIIDSFTKLKTL 552 Query: 411 YPVFRELFHKQN 422 + + + Sbjct: 553 EASLQRILCQNV 564 >UniRef50_C8PMB0 BatA protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PMB0_9SPIO Length = 332 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 54/214 (25%), Gaps = 41/214 (19%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTK------DMAKRFYILLYLF-----LSRTYK 285 + + SS + ++D S SM + AKR L T Sbjct: 78 RQTSEAMYSSSGQALMFVIDTSPSMAAQDMGTETRLEAAKRIIKSFAEKYEGDSLGLTAL 137 Query: 286 NVEVVYIRHHTQAKEVDEHEFFYSQE---TGGTIVSSALKLM-DEVVKERYNPAQWNIYA 341 + T + Q GT + L + + P+ Sbjct: 138 GSSAAVLIPPTIDRHTFLTRLDQLQVGELGDGTAIGMGLASAVLHLTQYSTLPSH----I 193 Query: 342 AQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYI----------------EITRR---- 380 +DGD N + P + K EI+ Sbjct: 194 ILFTDGDNNTGEIHPRAAADIIKHKKIGFYIIGLGKSGYAPVKYIDPIQKKEISGTLNTV 253 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++T ++ + F+ + DI+ F Sbjct: 254 FNETELQKIAGYGNGRY-FSAKSPELLTDIFNRF 286 >UniRef50_A7C0I2 von Willebrand factor type A domain protein n=1 Tax=Beggiatoa sp. PS RepID=A7C0I2_9GAMM Length = 150 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 18/142 (12%), Positives = 37/142 (26%), Gaps = 21/142 (14%) Query: 66 QGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQAS--QDGEGQDEFVFQISKDE 123 G R G D+ ++ S + + D I ++ Sbjct: 28 LASSGQRPLAEAGEDNTAYTGKLWAGGINSPTISSEKTTGATTGDLASLRAPSEPIEREN 87 Query: 124 YLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAG 183 Y + N +Q++E +I V ++ R Sbjct: 88 YA----------HFDNNPIKQVSEQ---------PVSTFSIDVDTGSYANVRRFINSGRL 128 Query: 184 KRRELHALEENLAIISNSEPAQ 205 + +EE L + + P+ Sbjct: 129 PPHDAVRVEEMLNYFNYNYPSP 150 >UniRef50_C4DQN3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DQN3_9ACTO Length = 831 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 30/246 (12%), Positives = 65/246 (26%), Gaps = 30/246 (12%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 + + E + + E + ++ I DL R P + Sbjct: 257 DFILRFDYGESGDVAGSLLTAPDENEPTSGTFQLTAIPPSDL------PRARPR---DVV 307 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH-------- 304 L+D SGSM A+R + LS + + T + +D + Sbjct: 308 VLLDRSGSMGGWKMVAARRAAARIVDTLSSADRFAVRCFDTAMTSPEGLDPNGLSAGTDR 367 Query: 305 ----EFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + + GGT + L +++ + +DG ++ + Sbjct: 368 NRFRAVEHLAGTETRGGTDILKPLSTAVDLLTA--GEKGRDRVIILVTDGQ-VGNEDQIL 424 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 E+ + V I + + + R + + R + Sbjct: 425 RELTGRLSGMRVHVVG---IDKAVNAGFLHRLALVGRGRCELVESEDRLDEATAHIHRRI 481 Query: 418 FHKQNA 423 Sbjct: 482 VAPVVT 487 >UniRef50_D2RSW3 von Willebrand factor type A n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RSW3_9EURY Length = 1446 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 53/184 (28%), Gaps = 20/184 (10%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVD 302 A + D SGSM S A+ L+ + + V Y T + Sbjct: 534 ADFVFVNDESGSMSGSPTHYAELAGKRFVGALTDSERAGRVGYASGANLDQPLTTDHDAV 593 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEI 360 +GGT + L++ ++E N SDG + PL Sbjct: 594 NSSLERLSASGGTNTRAGLRVGLNHLEE---EGWENRSAVMILLSDGK--SGSDPLPVAE 648 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 A + + ++ RE + H+ ++D+ F + Sbjct: 649 DAAEAGVEISTVGLGN---NINENELREIAAITGGDFY----HVEREEDLPDTFERVAEN 701 Query: 421 QNAT 424 Q Sbjct: 702 QTGP 705 >UniRef50_UPI0001789223 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001789223 Length = 968 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 46/164 (28%), Gaps = 23/164 (14%) Query: 245 PSSQAVM----FCLMDVSGSM--------DQSTKDMAKRFYILLYLFLSRTYKNVEVV-- 290 P + VM ++D SGSM + AK + T V VV Sbjct: 63 PPANVVMPNDVVLIIDKSGSMAPTYGPNNGEDKMTNAKEAAKGFVDLMDMTKHRVAVVDF 122 Query: 291 ----YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 T K+ + GGT +A+ ++ + A +D Sbjct: 123 SSSASSFPFTVDKDAAKSYINTINSGGGTATGNAIDAAVALLADHRTEA--QPVIVLMTD 180 Query: 347 G---DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 G ++ + P + + + Y ++ Sbjct: 181 GAATESPKNTDPFDYALQRAQAAKDAGVIFYTIALLNPNEDPIT 224 >UniRef50_A2EHL8 Ubiquitin-conjugating enzyme family protein n=3 Tax=Trichomonas vaginalis RepID=A2EHL8_TRIVA Length = 967 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 74/233 (31%), Gaps = 22/233 (9%) Query: 206 LLEEERLRKEIAELRAKIERVPFIDTFD-----LRYKNYEKRPDPSSQAVMFCLMDVSGS 260 + + + K++ ++ E P D LR + + + + +D SGS Sbjct: 409 KSQNQIVSKKLVKVIDPREGEPRDKPLDEFAQQLREQKIKIETPSEVKQINIICIDTSGS 468 Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE---------HEFFYSQE 311 M + +AK + ++ + ++ + + + Sbjct: 469 MGGTKIQIAKTCFKVIVNRAYEVGPSSLWGLYTFNSTPERKLKLSPIPADFHTQVDKLGV 528 Query: 312 TGGTI----VSSALKLMDEVVKERYNPAQWNIYAAQASD-GDN-WADDSPLCHEILAKKL 365 +G T + +A+ ++E VK + +D GDN ++ ++ L K+L Sbjct: 529 SGCTALYYCIIAAMNEINEKVKSDSSYKNALKRIIALTDGGDNTYSHNTARGIADLTKQL 588 Query: 366 LPVVRYYSYIEITRRAHQTLW-REYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + Y YIE+ R + I R++ Sbjct: 589 IDNGIYLDYIEL-GNVGDMKLPRNMAYYTGGDYLKFTSDIFQSSSSRVDIRKI 640 >UniRef50_UPI00016C38A3 LPXTG-motif cell wall anchor domain protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C38A3 Length = 874 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 70/232 (30%), Gaps = 25/232 (10%) Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 A+ + LR A + ++ + +K P + L+D SGSM Sbjct: 240 WRAKAEDRSALRVWTQPDPATGKAYFLALCAPPKFADAKKVPRE-----VILLVDHSGSM 294 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-------QAKEVDEH-----EFFYS 309 + + A LS ++ H T K E+ EF Sbjct: 295 SGAKWEAADWAVERFLAGLSEDDAFSLGLF--HSTTKWFGERTRKATPENVRAAVEFLKL 352 Query: 310 -QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 ++ GGT + AL+ + PA + +D + D + + P Sbjct: 353 NRDQGGTELGVALEQALARSRSAETPA---RHVLILTDAE-VTDAGRILRLADLESEKPN 408 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 R S + I + L E F + D DD+ E+ Sbjct: 409 RRRISVLCIDAAPNAALASELAERGGGVSRFLTSNPDD-DDVTTALDEVLAD 459 >UniRef50_Q31JK3 Type A von Willebrand factor-like n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31JK3_THICR Length = 349 Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 27/220 (12%), Positives = 60/220 (27%), Gaps = 42/220 (19%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVE 288 P +S + +D+SGSM+++ + K + + Sbjct: 93 LNTTPFQASGKDLMLAVDLSGSMEKTDMPLRGVEVDRLTAVKSVVKNFIQK-RQGDRMGL 151 Query: 289 VVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 VV+ + + E +E T + A+ + + + + Sbjct: 152 VVFGSQAFLQSPLTYDLNTVETLLNETEIGMAGNNTAIGDAIGIALKHLHQNSEKKA--- 208 Query: 340 YAAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIE------------ITRRAHQTLW 386 +DG N PL A+++ + + R T Sbjct: 209 VLILLTDGSNTAGAVQPLDAAKQAQEMGLKIYTIGIGQNQATGLDAFIFGPNRNMDTTTL 268 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 ++ L F + ++IY Q ++ Sbjct: 269 QKIAELTQGRF-FMAKDTNQLNEIYQ-----LIDQLEASQ 302 >UniRef50_C6VXL7 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VXL7_DYAFD Length = 339 Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 55/186 (29%), Gaps = 32/186 (17%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRH 294 E++ S + L+D+S SM + + AKR + +V+ Sbjct: 94 ERKDRFSEGIDIMLLLDISDSMIEKDLSPNRLEAAKRMARQFIKG-RLQDRIGLIVFAGE 152 Query: 295 H------TQAKEVD----EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 T E+ + T GT + SAL + V + A + A Sbjct: 153 AVSLCPLTTDYELLYGFLDEVTPSLIPTPGTAIGSALAVA---VNRMRDTAGESKVAILI 209 Query: 345 SDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRAHQTL------------WREYEH 391 SDGDN + + P LA V S + + + Sbjct: 210 SDGDNTSGNLGPTTSAQLANAFGVKVYTISVGKPKSASKADTTASAGALMDEGELQNIAG 269 Query: 392 LQSTFD 397 + + Sbjct: 270 IGNGKY 275 >UniRef50_A0Z5Z1 BatB protein, putative n=2 Tax=unclassified Gammaproteobacteria (miscellaneous) RepID=A0Z5Z1_9GAMM Length = 332 Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 60/207 (28%), Gaps = 36/207 (17%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 E +S + +D+SGSM + K + + Sbjct: 81 EPIELANSGRDLLLAIDLSGSMQIEDMQIGNSLVSRITAVKAIAADFASR-RTGDRVGLI 139 Query: 290 VY---------IRHHT-QAKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWN 338 ++ + K+ E G T + AL L + ++ER PA + Sbjct: 140 LFGTRAYVQAPLTFDVKTVKQFIEEA--QLGFAGEDTAIGDALGLAVKRLRER--PAD-S 194 Query: 339 IYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSY-------IEITRRAHQTLWREYE 390 +DG + A P+ LA ++ + + + L Sbjct: 195 RVLILLTDGQDTASTVDPMEAAALASEMNVKIYTIGISRRLGTSSNSSGEVDEALLTAIA 254 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFREL 417 F + ++ DIY V EL Sbjct: 255 QATGGRY-FRARTPKELQDIYQVLDEL 280 >UniRef50_A8L751 Magnesium chelatase n=8 Tax=cellular organisms RepID=A8L751_FRASN Length = 776 Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 47/130 (36%), Gaps = 16/130 (12%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTYKN 286 +D D R R ++ ++D SGSM T+ ++ LL R + Sbjct: 563 LDPADRRGAQRRGRE----GNLVLFVVDASGSMAARTRLRRVSTAVLSLLVDAYQRRDRI 618 Query: 287 VEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV--VKERYNPAQW 337 + + + T + EV + G T +++ L+ V + R +P + Sbjct: 619 GMITFRGVGAQVVLAPTSSVEVGAARLVGLRTGGRTPIAAGLECAGVVLRAEARRDPDRR 678 Query: 338 NIYAAQASDG 347 + +DG Sbjct: 679 PLLVL-VTDG 687 >UniRef50_C7ZL29 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7ZL29_NECH7 Length = 923 Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 50/174 (28%), Gaps = 24/174 (13%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---- 294 K PS + + + D SGSM + K + L K + H Sbjct: 268 VPKFKLPSEKPEIVFICDRSGSMGGT-IPDLKAALEIFLKSLPVGVKFNICSFGSHFSFL 326 Query: 295 ----HTQAKEVDEHEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQW--NIYAAQA 344 T +K+ + + + GGT + + V+ + N+ Sbjct: 327 WDRSQTYSKDSLDKALRHIKSFDADFGGTEM-------YQPVEATFKKRYTDMNLEIFLL 379 Query: 345 SDGDNWADDSPLCHEI-LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG + D + VR +S + + L + F Sbjct: 380 TDGAIYDQDRLFELINHNVAESKGSVRVFSLGIGSGASTS-LVEGVARAGNGFA 432 >UniRef50_A7SIA9 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7SIA9_NEMVE Length = 484 Score = 57.9 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 47/154 (30%), Gaps = 18/154 (11%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH--- 304 + + +D SGSM AKRF L + K V IR T+AK + Sbjct: 269 RTDLAFAIDASGSMGDQGFLRAKRFVKALIGSFKVSQKGTHVGIIRFSTRAKVMFTFTEH 328 Query: 305 --------EFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAA----QASDGDNWA 351 + GGT AL+L + + ++ + +DG + Sbjct: 329 FTHEDVNYAIDDIEYTEGGTKTELALRLARTELFSKQGGSRTSPLIFKLFVLMTDGRSEY 388 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 + + K V + + L Sbjct: 389 FHAVARQAKMLK--RSGVHVMAVGIGKYTNQREL 420 >UniRef50_A0C282 Chromosome undetermined scaffold_144, whole genome shotgun sequence n=3 Tax=Paramecium tetraurelia RepID=A0C282_PARTE Length = 825 Score = 57.9 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 60/204 (29%), Gaps = 23/204 (11%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 ++Y ++ D + ++ ++D SGSM K + + L + I Sbjct: 7 VKYFRLPQKEDGE-KGLLISILDCSGSMSSYWKY-----VSIYHNELIQKASMSIA--IT 58 Query: 294 HHTQAKEVDEHEFFY----SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 T K + +E + GGT ++ A +++ +K+R + + SDG Sbjct: 59 FDTAVKVLQPNENIHPNINQYGGGGTNITCAFVALNQEIKKRNITSDLTVV--FVSDGQ- 115 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + + + + + + I + Sbjct: 116 -GSYDEKQIKQDMPTV-QNLNFICIGVGNGFPTHISMSLRSLYHTGNLSIPPVFIVSVQE 173 Query: 410 IYPV------FRELFHKQNATAKG 427 Y +F ++ + Sbjct: 174 QYASKSREDYLDSIFKEEFESVGA 197 >UniRef50_UPI0001B4AD96 von Willebrand factor type A n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD96 Length = 194 Score = 57.9 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 38/120 (31%), Gaps = 13/120 (10%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR---TYKNVEVVYIRHHTQAKEVDEHEFF 307 ++ L+D SGSMD S + + L KN+++V + + + Sbjct: 3 LYILLDTSGSMDGSKISALNDSMENIIIDLQEKAFNGKNIDIVVLSFARDVTWMHDKPIN 62 Query: 308 -------YSQETGGTIVSSA-LKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 +G T + A +L + Y N SDG D E Sbjct: 63 ILDFNWKPLTASGMTSLGKACCELAKNI--STYPANNENTAIVLLSDGCPTDDYDEGIME 120 >UniRef50_UPI0001AF2DA9 hypothetical protein SrosN1_23653 n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AF2DA9 Length = 527 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 67/247 (27%), Gaps = 39/247 (15%) Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY-- 236 A+T A+ ++ + + LR+ + + + +L + Sbjct: 265 ALTGATPEARDAVRTLTEHFRSTAVQREITALTLRRPVVAAARPADPLAPEQRRELPFPG 324 Query: 237 ---------KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 +YE R S+ ++D SGSM K L Sbjct: 325 TRSVADGLLSSYEHRLRRPSRT--VYVLDTSGSMKGRRLAQLKSALNGLTGDFRE---RE 379 Query: 288 EVVYIRHHTQAKEVDEHEFF----------------YSQETGGTIVSSALKLMDEVVKER 331 +V + + K+V H G T + S+L + Sbjct: 380 QVTLLPFGSTVKQVRTHTVDPADPKAGPAAIRADAAALSAEGDTAIYSSLAAAYD----H 435 Query: 332 YNPAQWNIY--AAQASDGDNWADDSPLCHEILAKKLLPVVRYY-SYIEITRRAHQTLWRE 388 P + + +DG+N A S + L R + + + ++ Sbjct: 436 LGPDTESAFTSIVLMTDGENTAGRSAAEFGAFYRALPEARRVTPVFPVVFGDSDRSELEA 495 Query: 389 YEHLQST 395 L Sbjct: 496 IAALTGG 502 >UniRef50_A6QCY4 von Willebrand factor type A domain protein n=2 Tax=unclassified Epsilonproteobacteria RepID=A6QCY4_SULNB Length = 307 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 51/186 (27%), Gaps = 24/186 (12%) Query: 251 MFCLMDVSGSM-----DQSTKDMAK--RFYILLYLFL--SRTYKNVEVVYIRHH------ 295 + +D SGSM D+ + K L F+ V++ Sbjct: 82 LVLAIDASGSMAQSGFDEKDRFKTKYETTLDLSADFIKHRFDDNMGVVIFGTFAYTASPL 141 Query: 296 TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA 351 T E E + T + AL + Y A + +DG N Sbjct: 142 TYDLEAMESMLKMTTVGIAGESTAIGDALMQAMRTL--SYGEA-QSKAIILLTDGYHNAG 198 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 SP AK+ + + + L S ++A ++Y Sbjct: 199 RSSPKAAVAKAKEKGIKIYTIGVGK-SSDYDAALLDTIAKE-SGGKSYAAASAAQLKEVY 256 Query: 412 PVFREL 417 +L Sbjct: 257 KEIDKL 262 >UniRef50_B0XT03 von Willebrand domain protein n=4 Tax=Trichocomaceae RepID=B0XT03_ASPFC Length = 946 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 28/228 (12%), Positives = 60/228 (26%), Gaps = 28/228 (12%) Query: 206 LLEEERLRKEIAELRAKIE--RVPFIDT--FDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 L + L + L + + K + + ++D SGSM Sbjct: 246 LARDFVLLVKADGLDTPRAMLETHPVIPTQRAIMTTLVPKFGLEPIKPEIIFVIDRSGSM 305 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK--------EVDEHEF---FYSQ 310 D K + L + H+ E + + Sbjct: 306 -MDKIDTLKSALRVFLKSLPVGVCFNICSFGSRHSFLWKQSLFYTAESLQEALSFVDGVR 364 Query: 311 ET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV- 368 GGT + A++ V+ R + + +DG W + ++ Sbjct: 365 ANMGGTEMQEAVEA---TVRSRMKDKELEVLIL--TDGQIW---NQQTLFGFIRETAADN 416 Query: 369 -VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 R++S +H L + F + + + + + Sbjct: 417 GARFFSLGIGNGASHS-LVEGIARAGNGFSQMVVNYEELDRKVVRMLK 463 >UniRef50_B0EK65 Putative uncharacterized protein n=7 Tax=Entamoeba RepID=B0EK65_ENTDI Length = 720 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 45/166 (27%), Gaps = 23/166 (13%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-IRHHTQAKE 300 + + + D SGSM + + L L K + + ++ KE Sbjct: 205 NKEKEGDINIIFICDRSGSMYGEGINALRNMLQLFLRQLPLKSKFEIISFGSKYDFMFKE 264 Query: 301 VDEHEFFYSQET-----------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + E+ + GGT + + LK ++ + +DG Sbjct: 265 MVEYNEDTLKNASNRISEFEANYGGTSMDAPLKA---LIDNNTEK----CHIILLTDG-- 315 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 D+ + L + L R + + Sbjct: 316 -YVDNKINTIEYIHNLSKKNSLHGVGLGRSC-DIELIRNIGRIGNG 359 >UniRef50_C3ZT39 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZT39_BRAFL Length = 1044 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 39/178 (21%), Gaps = 26/178 (14%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA-------KEVDEH 304 ++D SGSM + ++ L V + T E Sbjct: 274 VLVLDTSGSMGKKLLFNLRQSLTSHVYNLPIGSSLGIVTFNSEATINAPMTVIGNETTRD 333 Query: 305 EFFY---SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 G T + S L+ ++ SDG Sbjct: 334 ALVGALPMTTGGKTSIGSGLQEALGLLGNDLGR------IILISDGQ-------EDELPH 380 Query: 362 AKKLLPVVRYYSYIEIT---RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 +LP +R + T + + + + + I Sbjct: 381 IADVLPALRVAGHTVHTVAIGADGDPMLEQLSRDTGGKSFYHTRWSTNFPGILRTIEA 438 >UniRef50_C9LDM7 BatA protein n=10 Tax=Prevotella RepID=C9LDM7_9BACT Length = 334 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 55/220 (25%), Gaps = 45/220 (20%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYI 292 + + + +DVS SM AK+ V+ Sbjct: 77 HNALSNKETEGINIMMAIDVSTSMLTPDLPPSRIETAKQVAYEFINN-RPDDNIGLTVFG 135 Query: 293 RHH------TQAKEVDEHEFFY----SQETG----GTIVSSALKLMDEVVKERYNPAQWN 338 T + F Q+ G GT + L +++ + ++ Sbjct: 136 GEAYTQCPLTTDHSALLNMFKQVNCDLQKEGVISPGTAIGMGLSSAVSHLEQSKSKSK-- 193 Query: 339 IYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIE--------------------I 377 +DG+ N + SPL +AK+L + S I Sbjct: 194 -VIILLTDGENNAGEISPLTAAEMAKRLGIRIYTISVGTDAAVNQTVATLPNGETYEAAI 252 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + + + DIY L Sbjct: 253 KQNTDPKTLEAIANSTGGKF-YQARSKAKLRDIYQNIDRL 291 >UniRef50_B1KQC1 Outer membrane adhesin like proteiin n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KQC1_SHEWM Length = 3259 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 37/130 (28%), Gaps = 19/130 (14%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL-----SRTYKNVEVVYIRHHTQ- 297 P + ++D SGSM + + A++ + +Y L + + + T Sbjct: 2831 IPGQNYNIAFIIDTSGSMGSTAVNTAEQQLLTVYSQLLSYAEQNNSGVINTLLVDFDTDA 2890 Query: 298 ------------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW-NIYAAQA 344 A ++ + G T A + + N Sbjct: 2891 SLLISINLTDPNALQLLSNALATMTSGGATNYYDAFSTAYDWFQNGVPTTNNGNNITYFI 2950 Query: 345 SDGDNWADDS 354 +DG D+ Sbjct: 2951 TDGQPTTDNG 2960 >UniRef50_C3ZZV2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZZV2_BRAFL Length = 4065 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 54/190 (28%), Gaps = 25/190 (13%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL------------YLFLSRTYK-NVEVV 290 + + + L+D SGS+ + D+ K F + + + + N E V Sbjct: 1583 NRTLNLDVVFLLDGSGSVGSANFDLLKTFTTRIATNFDVSTNLTRVGVVQYSDQTNSEFV 1642 Query: 291 YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDE---VVKERYNPAQWNIYAAQASDG 347 T+A+ + Q GGT +AL + + + P NI +DG Sbjct: 1643 LNTFSTEAEVLAAIAAISYQ-NGGTSTGAALDYVRQNVFISASGDRPDAANILIV-LTDG 1700 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 S + YS ++ Sbjct: 1701 ----VSSDDVSFPAMAARNAGITIYSVGIGDG-VDYNTLQQIA--GDPNKVLQATGFSSL 1753 Query: 408 DDIYPVFREL 417 DDI EL Sbjct: 1754 DDIGGQLEEL 1763 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 48/159 (30%), Gaps = 25/159 (15%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-HTQAKEVDEHEFF 307 + L+D SGS+ + K F L+ +V ++ T KE D ++ Sbjct: 1841 WDLVFLLDGSGSVGSNNFLNVKNFTKLITDLFPVGDNATKVGLVQFSDTIQKEFDLRDYD 1900 Query: 308 YSQE-----------TGGTIVSSALKLMDEVVKERYNPAQWNI-----YAAQASDGDNWA 351 E GGT +A+ + +V +N N +DG+++ Sbjct: 1901 TKAEILSAIDNISYLGGGTYTGNAIDYVRQV---SFNTINGNRGSHPDMLIVLTDGESFD 1957 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + + ++ T E Sbjct: 1958 PVTFASQSA----RDQGITIFAIGVGTG-VDYATLEEIA 1991 Score = 51.7 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 49/152 (32%), Gaps = 20/152 (13%) Query: 258 SGSMDQSTKDMAKRFYILLYLF--LSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE---- 311 SGS+ ++ K+F L +S+T V VV + E + F Q Sbjct: 2421 SGSVGADNFNLVKQFAKRLVDNFEISQTDTKVGVVQYSSSSNV-EFYLNAFSTKQAVLDA 2479 Query: 312 -------TGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEILA 362 GGT +A+ + + N A+ N +DG+ + L+ Sbjct: 2480 INAVTYQQGGTNTGAAITYTMQEIFASANGARANYPDVLIVVTDGE---SSDDVAVPALS 2536 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + + Y+ TL + + Sbjct: 2537 ARNAGTL-IYAVGVGNGVNQATLLQIAGNAGQ 2567 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 47/160 (29%), Gaps = 24/160 (15%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-HTQAKEVDE 303 + L+D SGS+ + D+ K F L ++ +++ T +E Sbjct: 747 RDVPLDIVFLLDGSGSVGSANFDLVKDFTRTLARNFDIAANMTQIGVVQYSDTVNREFGL 806 Query: 304 HEF----FYSQE-------TGGTIVSSALKLMDEVVKERYNPAQWNI-----YAAQASDG 347 +F GGT+ +A+ + + + + +DG Sbjct: 807 GDFHNRQDVLNAISAVSYQQGGTLTGAAIDFVRQ---TSFTTGDGDRPDVPNMLIVVTDG 863 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 + DS A+ + + TL Sbjct: 864 --VSGDSVQGPADAAR--REGITTFGVGIGNGIDFGTLLE 899 Score = 48.2 bits (113), Expect = 6e-04, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 57/211 (27%), Gaps = 27/211 (12%) Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 I N +P L + + R ++ Y + + L+D Sbjct: 2079 FIDNCDPNPCLNGAQCFQTADSYRCTCAEGYEGTNCEI-YTALNAQTF-----DLVFLLD 2132 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFY-------- 308 SGS+ S+ D+ K F + + + V +++ +Q E Sbjct: 2133 GSGSVGASSFDLMKSFTNRITTNFDVSPTSTRVGVVQYSSQGSVATEFRLDSYSNKDDVI 2192 Query: 309 ------SQETGGTIVSSALKLMDE--VVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + G T AL + + A +DG + D + ++ Sbjct: 2193 AAVNGIVYQNGNTYTGEALNYVRQNSFAVANGGRADVANILVVITDGQSVDDVTGPAQDL 2252 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 L V Y+ + Sbjct: 2253 L----REGVTVYALGIGDG-IQYSTLEAIAQ 2278 Score = 47.5 bits (111), Expect = 0.001, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 46/166 (27%), Gaps = 20/166 (12%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 + P + L+D SGS+ + ++ K F + L + + V +++ Sbjct: 1008 RAPVCANFPYGGLDLVFLLDGSGSVGTTNFELVKDFTSEVVLNFNISADTTNVGVVQYSD 1067 Query: 297 QAKE-----------VDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNI---YA 341 + TGGT+ A+ + + R N Sbjct: 1068 TVRNEFFLSSYDTKLPLIDAINQISYLTGGTLTGFAIDYVRQSSFSR-PAGARNTFPDVL 1126 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 +DG A + ++ + TL + Sbjct: 1127 VVLTDGQ----SQDDVVSSAAAARSQGITIFAVGIGSEVDFTTLLQ 1168 Score = 41.7 bits (96), Expect = 0.060, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 46/157 (29%), Gaps = 28/157 (17%) Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLS-------------RTYKNVEVVYIRHHTQAKEVD 302 D SGS+ ++ K F + + N+E + T+A+ + Sbjct: 2705 DGSGSVGSDNFNLLKAFTQNIVGNFDIAVNNTRVGVVQYSDFNNIEFNLNAYATEAEVLA 2764 Query: 303 E-HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-----YAAQASDGDNWADDSPL 356 Y + GGT +A+ + + V + A N +DG+ S Sbjct: 2765 AIGAISYQR--GGTFTGAAIDFVRQDV---FTTAGGNRADKPDILLVLTDGE----SSDS 2815 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 L + Y+ + TL Sbjct: 2816 VAGPAQNTLNAGITIYAVGIGSGVNADTLQEIAGDPG 2852 >UniRef50_C1YN95 von Willebrand factor type A-like protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YN95_NOCDA Length = 525 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 63/243 (25%), Gaps = 31/243 (12%) Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + + L + L L P A +DVS Sbjct: 285 DRTWRRPVTAGAELSPPVPPLVELPFPASREVVDGLVADYSASLRRP---ARTVYALDVS 341 Query: 259 GSMDQSTKDMAKRFYILL-------YLFLSRTYKNVEVV--------------YIRHHTQ 297 GSM+ + L ++ ++ EVV ++ Sbjct: 342 GSMEGGRLAELQSALGALTGADGGSLARSTQAFQEREVVTLLPFSTWPADPRTFVVEPGS 401 Query: 298 AKEVDEH---EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGD-NWAD 352 EV+ + G T AL E+++ + +DG+ N Sbjct: 402 VDEVNADLSAAVEGLEAEGDTAAYDALVRAYELLESDTGSDGDPLMSVVLMTDGEVNRGV 461 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 E LA + PV R + + + E L +D + ++ Sbjct: 462 GLEGFRESLAARSEPVARVPVFTVLFGESDVPEMTELAELTGG--RVFDAREQDLEQVFR 519 Query: 413 VFR 415 R Sbjct: 520 EIR 522 >UniRef50_Q7XTB9 OSJNBa0068L06.4 protein n=7 Tax=Oryza sativa RepID=Q7XTB9_ORYSJ Length = 724 Score = 57.9 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 37/143 (25%), Gaps = 2/143 (1%) Query: 151 HRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEE 210 R T + + L A A + E + Sbjct: 217 KRTTTTDDHKRKSYDDDEPLLAPKAAAGAFNPIPEDDEDDATEFRGFFPAR--PRSGLAV 274 Query: 211 RLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK 270 L + A + A ++ ++ P + + ++DVS M M K Sbjct: 275 TLAPDAALVSAGRRHGKYVVAVRVKAPALRSSPSTRAPIDLVTVLDVSQGMMGDKLHMLK 334 Query: 271 RFYILLYLFLSRTYKNVEVVYIR 293 R L+ L + V + Sbjct: 335 RGMRLVIASLGPADRLAIVAFSG 357 >UniRef50_B3RWX4 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RWX4_TRIAD Length = 1007 Score = 57.9 bits (138), Expect = 8e-07, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 45/162 (27%), Gaps = 22/162 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------------IRHHTQ 297 + L+DVSGSM D+AK L V + + ++ Sbjct: 234 VVILLDVSGSMHGMPLDIAKISIQSLIRTFGENDFLNIVFFNKDINLSIPCFKDVVQTSE 293 Query: 298 AKEVD--EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI---YAAQASDGDNWAD 352 + + + G S A ++++ + + SDG + Sbjct: 294 SHKYVFGRALAANILDGGIADFSKAYDYAFQMLQRSRSKQEQKRCHQLIMVFSDG---TE 350 Query: 353 DSPLCH-EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 + P + V Y +T R W + Sbjct: 351 ERPKAVFDKYNADKQISVITYGIGTVTSRFEALRWMACYNKG 392 >UniRef50_B5EUF0 von Willebrand factor, type A n=44 Tax=Vibrionaceae RepID=B5EUF0_VIBFM Length = 321 Score = 57.9 bits (138), Expect = 8e-07, Method: Composition-based stats. Identities = 29/215 (13%), Positives = 51/215 (23%), Gaps = 43/215 (20%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD-----------MAKRFYILLYLFLSRTYKNVE 288 + M ++D+SGSM + K+ + + Sbjct: 74 DPVDIQPEHRDMMLVVDLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQ-RKGDRLGL 132 Query: 289 VVYIRHH----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V++ H +E + T + L L + E P Sbjct: 133 VLFGDHAYLQTPLTFDRNTVREQLDRTVLRL-VGQMTAMGEGLGLATKTFIESNAPQ--- 188 Query: 339 IYAAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEIT---------------RRAH 382 SDG N PL LAK + R Sbjct: 189 RTIILLSDGANTAGVLEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNTARDLD 248 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + + F ++ + +IY L Sbjct: 249 EDTLTKIATMTGGQY-FRARNADELAEIYQTIDAL 282 >UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacteria RepID=Q5NWS3_AZOSE Length = 349 Score = 57.9 bits (138), Expect = 8e-07, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 44/148 (29%), Gaps = 16/148 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---IRHHTQAK---- 299 + +F L+DVS SM + L L +E V+ I + K Sbjct: 2 RRLPIFFLVDVSESMAGDNLRQLQEGLERLVRSLRADPYALETVFISVIAFAGKPKTLTP 61 Query: 300 --EVDEHEFFYSQETGGTIVSSALKLMDEVVKE---RYNP---AQWNIYAAQASDGDNWA 351 E+ + GT + SA+ + + ++ R P W +DG Sbjct: 62 LVELYQFYAPRLPLGSGTSLGSAMAHLMDEMERTVQRSTPEKKGDWRPVVYLLTDGKPT- 120 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITR 379 DD + + + Sbjct: 121 DDIEPAIKRWKRDFEERSNLVAIGVGKH 148 >UniRef50_C3WEJ2 BatB protein n=1 Tax=Fusobacterium mortiferum ATCC 9817 RepID=C3WEJ2_FUSMR Length = 322 Score = 57.9 bits (138), Expect = 8e-07, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 56/225 (24%), Gaps = 54/225 (24%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKN----- 286 K E ++ L+D S SM + KR L L + Sbjct: 68 KEIEDEEIEVKGMNIYVLIDTSRSMLTEDVYPNRLEAGKRVLTNLIQSLK-GDRVGFIPF 126 Query: 287 VEVVYIRHH-TQAKEVDEHEFF----YSQETGGTIVSSALKLMDEVVKERYNP-AQWNIY 340 + YI+ T + ++ GGT + AL+L ++ + N Sbjct: 127 SDSAYIQMPLTDDYNITQNYINAIDTTLISGGGTELYQALELA----EKSFKEIGSENKT 182 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR-------------------- 380 SDG ++ S K+ V Sbjct: 183 VIVISDGGDFDKKSL----DFVKENKIDVYSIGVGTKEGNVIPEYLNGVKRGFIKDESGS 238 Query: 381 -----AHQTLWREYEHLQSTFD----NFAMQHIRDQDDIYPVFRE 416 + ++ + + N +D + R+ Sbjct: 239 AVISKLNSDFLQKISNENNGKYYEVNNLVDTSKNFVNDTINLERK 283 >UniRef50_Q2BCF0 Possible D-amino acid dehydrogenase, large subunit n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BCF0_9BACI Length = 459 Score = 57.5 bits (137), Expect = 8e-07, Method: Composition-based stats. Identities = 48/358 (13%), Positives = 97/358 (27%), Gaps = 41/358 (11%) Query: 92 QGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTH 151 G G G+ D ++ + S DE + +D+ + + ++ E K Sbjct: 4 AGCSGEDEKASGEKKNDPPQEEAREQETSSDEKIPEAADDIE--GMVAQKHGKILEGKLE 61 Query: 152 RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEER 211 A+ A + N + A + ++ L Sbjct: 62 PEVEIADLWDA---KKYTGFNEETLQPAAEKEMKAYFSEQKDLSGSQVYDYLVYQLGSGL 118 Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTK 266 + EL + + +L E + + ++ + LMD SGSM + Sbjct: 119 YQSYYEELVSFE---HGHEMPELPDGEDEIQQAKNQKSNIVILMDASGSMKADVSGGNKM 175 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE--------------------- 305 +AK L + Y H + D+ E Sbjct: 176 MLAKETIKEFTSSLEDDASVSLMAY-GHVGTGNDEDKAESCSRIDEVFPLGAYEKTAFNK 234 Query: 306 -FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK- 363 + +G T ++ A+ E++ YN + SDG D P+ + Sbjct: 235 SMDSFEASGWTPLAGAIDKARELL-SAYNSTDYKNTLYIVSDGVETCDGDPVEAAQQLQG 293 Query: 364 -KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + V + Q +E +D + ++ + Sbjct: 294 SNIEAKVNIIGF--DVDDEGQKQLKEVAEAGGGTYATVRDKDELEDQVLKKWKPSLGQ 349 >UniRef50_B2HK18 Conserved membrane protein n=3 Tax=Mycobacterium RepID=B2HK18_MYCMM Length = 983 Score = 57.5 bits (137), Expect = 9e-07, Method: Composition-based stats. Identities = 20/183 (10%), Positives = 45/183 (24%), Gaps = 23/183 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------IRHHTQA 298 + ++D S SM A+R + L+ + + + + Sbjct: 296 PRPRHLVLVLDRSRSMAGWKMTAARRAASRIVDALTSDDRFAVLTFDDGIEYPVGLPAGL 355 Query: 299 KEVDE-------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 E + + G T + + L+ ++ + SDG Sbjct: 356 TEASDRHRYRAVEHLARVEARGDTEMLAPLRRALALLGREQVADTDDAVLILISDGQ--- 412 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + L VR ++ + + R + D +D Sbjct: 413 VGNEDQLLQELSGDLGRVRLHTIG-VDEAVNAGFLRRLAGVGGGRCVLV-----DNEDRL 466 Query: 412 PVF 414 Sbjct: 467 DEA 469 >UniRef50_B0TL26 von Willebrand factor type A n=7 Tax=Gammaproteobacteria RepID=B0TL26_SHEHH Length = 345 Score = 57.5 bits (137), Expect = 9e-07, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 65/213 (30%), Gaps = 40/213 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM---AKRFYILLYLFL------SRTYKNVEVV 290 E PS + +D+SGSM + + L+ + + + ++ Sbjct: 75 EAIELPSKGRDLMLSVDLSGSMQIEDMVLDGKVVDRFSLIQHVISDFIERRKGDRIGLIL 134 Query: 291 YIRHH------TQAKEVDEHEFFYSQET--GG-TIVSSALKLMDEVVKERYNPAQW-NIY 340 + H TQ + +Q G T + A+ L +R++ + N Sbjct: 135 FADHAYLQSPLTQDRRTVAQYLKEAQIGLVGKQTAIGEAIALAV----KRFDKVEQSNRV 190 Query: 341 AAQASDG-DNWADDSPLCHEILAKKLLPVVRYYS-----------YIEITRRA----HQT 384 +DG +N SP +A K + + + ++ Sbjct: 191 LILLTDGSNNAGAISPEQATQIAAKRGITIYTIGVGADVMERRTLFGKERVNPSMDLDES 250 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 +E F ++ + + IY V L Sbjct: 251 QLQEIAKTTGGQY-FRARNTEELEQIYQVIDTL 282 >UniRef50_C8SB00 von Willebrand factor type A n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SB00_FERPL Length = 469 Score = 57.5 bits (137), Expect = 9e-07, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 40/118 (33%), Gaps = 14/118 (11%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR-FYILLYLFLSRTYKNVEVVYIRHHTQAK 299 KR + S+ + +D+SGSM + + AK + + + + + K Sbjct: 295 KRREKESKGPIVIALDLSGSMSGAKEQWAKAVSLATIDIAVKERRPWAIIAFDAGIKDVK 354 Query: 300 EVDEHE-------FFYSQETGGTIVSSALKLMDEVVK--ERYNPAQWNIYAAQASDGD 348 + +GGT LK ++V+ + A SDGD Sbjct: 355 VFRKQPKPEDVLGIMRIGASGGTNFEKPLKEAMKIVEDCREFTKAD----ILFISDGD 408 >UniRef50_C6X1I4 BatB n=2 Tax=Flavobacteriaceae RepID=C6X1I4_FLAB3 Length = 335 Score = 57.5 bits (137), Expect = 9e-07, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 48/156 (30%), Gaps = 22/156 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM--AKRFYILLYLFLSR--TYKNVEVVYIRHH 295 E+ + L+DVS SM+ + ++ L+ + + K +V+ Sbjct: 81 EEVETKQKMNNVIFLLDVSNSMNAQDVEQNRLQQAKNLIINAMGKMTNDKVGIIVFAGEA 140 Query: 296 TQAKEVDEHEFFYSQE--TG---------GTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + + +F + G GT A++ + N A+ + Sbjct: 141 SSIMPLT-TDFTAVETYVGGVETSIVKMQGTDFLKAMQTA---ADKFRNVAKGSRKVVLL 196 Query: 345 SDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 SDG DN + LA + V Sbjct: 197 SDGEDNEG--NEKAAAKLANREGIRVISVGIGSEEG 230 >UniRef50_UPI000058940A PREDICTED: similar to inter-alpha (globulin) inhibitor H3 n=1 Tax=Strongylocentrotus purpuratus RepID=UPI000058940A Length = 964 Score = 57.5 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 57/195 (29%), Gaps = 23/195 (11%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK------NVEVVYIRHHTQAKEV 301 + + ++D+SGSM + K + +S T K + +V ++ Sbjct: 348 RKNIIFVIDISGSMSGTKLAQVKDALSTILDDMSETDKFNILPFSDDVHFLESTGMLYST 407 Query: 302 DEHE------FFYSQETGGTIVSSALKLMDEVVKERY--NPAQWNIY--AAQASDGD-NW 350 E+ QE T + A+ +++ +P + I +DG+ N Sbjct: 408 KENVRRAKRFVMGLQEMDNTNLHKAIISGVNMLRAESEQDPQEEEIVSMLIVLTDGNPNH 467 Query: 351 ADDSPLCHE-ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + E + + + + A R I ++ D Sbjct: 468 GEIDKTIIERNVHEAINGDFSLFCIGF-GADADYPFLRRLSLQNHGVAR----RIPERAD 522 Query: 410 IYPVFRELFHKQNAT 424 +++ Sbjct: 523 AGEHLENFYYEVATP 537 >UniRef50_B3PKG6 von Willebrand factor type A domain protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PKG6_CELJU Length = 660 Score = 57.5 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 64/209 (30%), Gaps = 26/209 (12%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 P A + ++DVSGSM ++ +R + L + L + Sbjct: 31 TSAPLPSKVLPADIRMIIDVSGSMKKTDPHNLRRPAVDLMVRLLPDGSKAGIWTFGQSVN 90 Query: 298 AKEV--------DEHEFFYSQETGG----TIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + + T + +AL+ + V + + Sbjct: 91 LLVPYRLVDESWRQQAAKSASAINSVALHTHIGAALEKAAQDVVA--GDDGFRRNLVLLT 148 Query: 346 DG----DNWADDSPLCHEILAKKLLPVVRYYSYIEIT----RRAHQTLWREYEHLQSTFD 397 DG D A + + + +LLP ++ Y+ T + A Q L ++ Sbjct: 149 DGVVDIDPEAVVNIQERKRILTELLPQLKAAGYVVHTIALSQDADQELMKKLALTTDGVF 208 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQNATAK 426 A + D++ F +F + + Sbjct: 209 AVA----QSADELMQAFLTIFDQAVPAER 233 >UniRef50_A9Z1V5 von Willebrand factor A domain-containing protein 5B1 n=8 Tax=Amniota RepID=VW5B1_MOUSE Length = 1215 Score = 57.5 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 58/208 (27%), Gaps = 27/208 (12%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV-- 289 DL+ R + L+D S SM ++ K ++ L + Sbjct: 338 PDLQSVQPNPRK---AHGEFIFLIDRSNSMSKTNIQCIKEAMLVALKSLMPACFFNIIGF 394 Query: 290 --VYIRHHTQAKEVDEHE-------FFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNI 339 + ++ +E Q GGT + S LK + R Sbjct: 395 GSTFKAVFASSRIYNEENLTMACDCIQRMQADMGGTNMLSPLKWVLRQPLRR----GHPR 450 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG + ++ L + R YS+ I L + + F Sbjct: 451 LLFLITDG---SVNNTGKVLELVRNHASSTRCYSFG-IGPTVCYRLVKGLASVSKGSAEF 506 Query: 400 AMQHIRDQDDIYPVFRELFHKQNATAKG 427 M + + + P + K A Sbjct: 507 LM----EGERLQPKMVKSLKKAMAPVLS 530 >UniRef50_C1F7S6 Putative uncharacterized protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7S6_ACIC5 Length = 339 Score = 57.5 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 47/173 (27%), Gaps = 13/173 (7%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV-------VY 291 Y + L+D S S+ + + + L L V + Sbjct: 105 YSFTQQTQLPLRLGILVDTSTSIRERFQFEQQAVTNFLLQVLRPKTDEAFVEGFDEAPNF 164 Query: 292 IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD--EVVKERYNPAQWNIYAAQASDGDN 349 I + + + GGT + A+ +++ P SDGD+ Sbjct: 165 ILNWSNNLDTLSSAIQDLHPGGGTALYDAVYSACRDKLLNAASGPIYVRRAIILVSDGDD 224 Query: 350 WADDSPLCHEILAKKLLPVVRYYSY---IEITRRAHQTLWREYEHLQSTFDNF 399 + L + + + Y+ + T + R+ F Sbjct: 225 NQSHAYLT-DAIKECQRAQTAIYAVSTDTDPTPDPGDDILRKMAEETGGRAFF 276 >UniRef50_A0BYA6 Chromosome undetermined scaffold_136, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0BYA6_PARTE Length = 608 Score = 57.5 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 50/177 (28%), Gaps = 23/177 (12%) Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF-FYSQET------- 312 MD S AK+ IL L +T + + + E QE Sbjct: 1 MDGSRIQKAKQSLILFLKSLPQTSLFNIISFGTQYVSLWEESRQYTQDNLQEAIQHVKDM 60 Query: 313 ----GGTIVSSALKLMDEVVKERYN-PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP 367 GGT + + LK ++ Y + +DG++ D + + Sbjct: 61 QADMGGTNIYNPLKN--KIYNSSYGCSKDTTLNVFLLTDGED-NADPIIELVKNNNRAET 117 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + E L + + + + + D +DI +L Sbjct: 118 RIYTLGIGERCSFY---LIKRVAEVGNGKFH----IVGDNEDINEKVIDLLEDSLTP 167 >UniRef50_A4YIH6 Protoporphyrin IX magnesium-chelatase n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YIH6_METS5 Length = 600 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYI--LLYLFLS 281 +R +++ DL K+ E + + L+D S SMD S + + + + LL Sbjct: 408 DRRNWLNPDDLVMKSLETQGAIP----ILLLLDSSRSMDFSRRILVAKAILRELLQKAYQ 463 Query: 282 RTYKNVEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV-KERYN 333 K V + T+ E + G T +S AL L ++V +ER + Sbjct: 464 VRSKVGLVTFSGSEARYDVPLTRNLRKVEEFVNGVRPAGKTPMSMALYLALQIVNRERRS 523 Query: 334 PAQWNIYAAQASDGD---NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + N SDG + + E + KL V ++ I+ + Sbjct: 524 RRKLNPLVFLISDGKANVSLGGNILQELEYFSIKLGEVSKFI-VIDTGSPYQPSFNPTLA 582 Query: 391 HLQST 395 Sbjct: 583 ERAHG 587 >UniRef50_C3ZIK8 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZIK8_BRAFL Length = 987 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 59/202 (29%), Gaps = 26/202 (12%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQ--------STKDMA-KRFYILLYLFLSRTYKNVE 288 R + + + D+SGSMD + + + Y L+ + Sbjct: 291 EPTFRVVKARKPRFVLVFDISGSMDSIDDVSINTPRRSLLHQTVYKLVREGIPDGSHVGM 350 Query: 289 VVYIRHHTQAKEVDEHEFFYSQ----------ETGGTIVSSALKLMDEVVKER-YNPAQW 337 V + + T+ ++ E + +GGT + L EV+ +PA Sbjct: 351 VKFHQWATRLLDLTEIATEEDRQEIADAVPNEASGGTCIGCGLTEALEVLSMNGADPAGG 410 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE--HLQST 395 + SDGD + SP + V +S + T E Sbjct: 411 IVIIL--SDGDESLNVSPNLTVATQHLVAAGVTVHSVTYSSSA--DTRMEEVAASTHGRA 466 Query: 396 FDNFAMQHIRDQDDIYPVFREL 417 F + ++ + Sbjct: 467 FFYSGAANSNSLEEGLREAVRV 488 >UniRef50_B0MMX9 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MMX9_9FIRM Length = 475 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 39/294 (13%), Positives = 91/294 (30%), Gaps = 49/294 (16%) Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLD 126 + G + N Q+ G +G+ + ++DG G D Q+S + Sbjct: 177 SKDGNSRNSNDMNSDNGQSGNDTDNSNGSNRNGNAENPGNEDGTGNDM---QMSAETASQ 233 Query: 127 LLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 L + + + Q ++ + + + + +V R + + + Sbjct: 234 L---EKDWKEISEKMQIEIEAFSKEKGDTRGSFIQNLNAVNREKYD-YTQFLKKFSVMGE 289 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 + ++ I + +L E L + + YK ++ + Sbjct: 290 AMRVNDDEFDYIFYTYGLKLYERMPLIEPLE------------------YKEVKRIKE-- 329 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRHHTQAKE--- 300 +D SGS+ ++ ++F Y L + + + I+ T +E Sbjct: 330 ----FVIAIDTSGSVSG---ELVQKFIQKTYNILKNEESFFTKINLHIIQCDTDIQEDRK 382 Query: 301 -VDEHEFF------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + EF GGT +D++++ + +DG Sbjct: 383 ITSQEEFDDYLAAMKLHGFGGTDFRPVFAYVDKLIQSK--EFTNLKGLIYLTDG 434 >UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepID=Q7S708_NEUCR Length = 766 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 25/236 (10%), Positives = 58/236 (24%), Gaps = 51/236 (21%) Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS---SQAVMFCLMDVSGSMDQS 264 E + ++ + + ++ PDP+ + +DVSGSM Sbjct: 26 EIRPVAPKLEIHPLPSHTSGLLLRV-IPPRSPPNLPDPNFHHVPCDIVLAIDVSGSMSAD 84 Query: 265 T----------------------KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE-- 300 D+ K + L+ + + V + T+AK Sbjct: 85 APVPTTASADYTNEQPEHNGLSVLDLVKHAARTIVSTLNSSDRLGIVTF---STEAKVLQ 141 Query: 301 --VDEHEFFYSQET---GG------TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + + GG T + + ++ + +DG Sbjct: 142 PLMPMTALNKKKTERNLGGMQPFSATNLWGGIVEGLKLFD---GQSGRMPALMVLTDGMP 198 Query: 350 W---ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + + L + + + L + + +F Sbjct: 199 NHMCPAQGYVAKLRAMETLPAAIHTFGFGY---SLRSGLLKSVAEIGGGGYSFIPD 251 >UniRef50_C3ZCZ5 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3ZCZ5_BRAFL Length = 371 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 54/170 (31%), Gaps = 25/170 (14%) Query: 236 YKNYEKRPDPSSQAV-MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 Y EK P + V + ++D SGS+ + + + + + + V +++ Sbjct: 206 YTLLEKVTPPCNNPVDIVFVLDGSGSVGRRNFEKVQAGVKKIVGDFNIALDSTRVGVVQY 265 Query: 295 HTQAK-EVDEHEFFYSQE-----------TGGTIVSSALKLMDEVVKERYNPAQWNI--- 339 + + E F Q GGT +A++ ++ + A Sbjct: 266 SSIVRQEFALDTFSNLQGLESGIQSIPYMAGGTRTGAAMEYA---IQNSFTSANGARPDV 322 Query: 340 --YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 +DG ++ D S + K + ++ L + Sbjct: 323 GHVIVLVTDGRSYDDVS----QASQKAKQAGIVVFAVGIGDGAVESQLNQ 368 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 44/154 (28%), Gaps = 20/154 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-EVDEHEFFYS 309 + ++D SGS+ + K F N V +++ + + E F Sbjct: 4 IIFMLDGSGSVGPDNFNKMKEFVKKTVGGYLIGPSNTRVAVMQYSSSVRQEFALDAFNTL 63 Query: 310 QE-----------TGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDSPL 356 ++ GGT AL + N A+ N+ A +DG S Sbjct: 64 EDLLVGIEEIRYMRGGTRTGKALTRLRRQGFLESNGARKNVPHVAVIVTDGR----SSDS 119 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + + + Y+ + Sbjct: 120 VDQAALETRQSGIVLYAVGV--GNYDLGQLTDIA 151 >UniRef50_A6DT53 BatB protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DT53_9BACT Length = 621 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 50/162 (30%), Gaps = 30/162 (18%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQST----------KDMAKRFYILLYLFLSRTY 284 + K + SS + L+D+S SM+ K AK+ + Sbjct: 79 QGKEIS-QEKESSSRSILFLVDISKSMNVRDMNEQSRLEYSKWWAKKLMNDI-----PGD 132 Query: 285 KNVEVVYIR----------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + + + R GGT +++AL + KE Sbjct: 133 RFGLITFSRIANIECPLTSEPDMVLLYLSDLNSSLLPGGGTNIAAALDHAQKQFKEN--- 189 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 + + SDG+ + E L KK +P V S + Sbjct: 190 ERDSRVVVLLSDGETDGNKWRESLEALQKKKIP-VNVISLGD 230 >UniRef50_UPI00006CCCA4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CCCA4 Length = 1082 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 46/137 (33%), Gaps = 13/137 (9%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA---KEVDEHEF 306 L+D S S + + AK + L + V + ++V + Sbjct: 432 HYIILLDQSSSFQNAYQS-AKNGILQFTKNLQQNDVLTIVSFASSSRIIVQKQKVSQINL 490 Query: 307 FYSQET------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + GGT SA + + ++++ + N ++ +DG + + + Sbjct: 491 NSLEANLRNMLCGGTCFISAFESLKQIIQNQLNTNEY-PIILFVTDGQD--SSNLSNTIL 547 Query: 361 LAKKLLPVVRYYSYIEI 377 + + + +++ Sbjct: 548 DVRNQVEDLIFFTVGYG 564 >UniRef50_A2SQ24 von Willebrand factor, type A n=1 Tax=Methanocorpusculum labreanum Z RepID=A2SQ24_METLZ Length = 313 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 33/159 (20%), Positives = 52/159 (32%), Gaps = 17/159 (10%) Query: 246 SSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 S + +DVS SM S + AK +L LS + V++ + A Sbjct: 84 SENVNLVVALDVSASMSASDYSPTRVEAAKGSSEILIRSLSESDTAGVVIFESGASSAAY 143 Query: 301 VDEHE---FFYSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWAD 352 + + ++ TG T + L L ++V PA I SDG N Sbjct: 144 LSSDKNRVVSRLEQVSVKTGKTALGDGLALAVDMVTA--IPAGTYIVVL-LSDGVSNSGM 200 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITR-RAHQTLWREYE 390 +P AK VV + ++Y Sbjct: 201 ITPQEAAEYAKNSGVVVYTIGVGSESPVEVSSDGVQQYA 239 >UniRef50_C0Z8R3 Hypothetical membrane protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z8R3_BREBN Length = 424 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 47/171 (27%), Gaps = 25/171 (14%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT------ 296 S + ++D SGSM S D + + + R ++ + + H Sbjct: 108 QQASGANNIVMVLDTSGSMQSSDPD--NQLFKAAADMVQRMDSDMNIAVVTFHDQTNVLQ 165 Query: 297 -----QAKEVDEHEFFYS----QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 ++ V + + GGT + AL+ + ++ N SDG Sbjct: 166 PLTELSSQSVKDEVVKKLLQFPRTDGGTRIDLALQAGLDQLQAN---QMANSTVVLMSDG 222 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEIT-RRAHQTLWREYEHLQSTFD 397 LA V ++ L ++ Sbjct: 223 ----YSDLDVPAALAPYKQNQVIVHTVGMSQIDADGTALLQKIAAETGGSY 269 >UniRef50_Q897H0 Membrane-associated protein n=1 Tax=Clostridium tetani RepID=Q897H0_CLOTE Length = 842 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 50/173 (28%), Gaps = 20/173 (11%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTY--------KNV 287 K A + L+D SGSMD ++AK+ I L + Sbjct: 401 KNKRKQGDAGIVLLIDCSGSMDDESGGVKKIELAKQGAIETIKALESEDYIGILGFSDTI 460 Query: 288 EVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + V + KE E + GGT++ L + + + +DG Sbjct: 461 DWVVPFQKAENKEKLIKEVGKLKPKGGTLIIPGLIEGVKTLSSAKTKVKH---MILLTDG 517 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + E + K + + E + + + F+ Sbjct: 518 QAEKNGFDKYLENMKKNNMT-LSTVGLGE---DSDREVLTHLSDFTGGRKYFS 566 >UniRef50_C6QKH4 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y4.1MC1 RepID=C6QKH4_9BACI Length = 932 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 30/131 (22%), Gaps = 27/131 (20%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQST------KDMAKRFYILLYLFLSRTYKNVEVVY 291 + P V DVSGSM + K + + + V + Sbjct: 71 RVDNIIRPPIDVVFVF--DVSGSMVMPSLKLDSAKYALQSAVDYFKANANPNDRFALVPF 128 Query: 292 I--RHHTQAKEVDEHEFF-------------YSQETGGTIVSSALKLMDEVVKERYNPAQ 336 + + + GGT + + +N Sbjct: 129 SDGVQSDKVVPFPSGTYDVKQHLNWIATVANSLRANGGTN----YTQALQQAQSFFNDPA 184 Query: 337 WNIYAAQASDG 347 Y +DG Sbjct: 185 RKKYIIFLTDG 195 >UniRef50_Q7UG35 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UG35_RHOBA Length = 486 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 42/304 (13%), Positives = 77/304 (25%), Gaps = 43/304 (14%) Query: 79 NDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLAL---- 134 + + G S +G E +DE + S +E E Sbjct: 168 DSKPGEGTSHPEETLGDDSSDAGDSGNDAVAEQRDETGDESSVEEDDGSSAESNEPTPDS 227 Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPA--NISVVRSLQNSLARRTAMTA--------GK 184 P+L+Q + + G L + A A Sbjct: 228 PSLEQAADSSVGNDLESYVDPSQTGAENADQWDADDLLHEEIKSAVADAAENGGWGTLPG 287 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 + L + + + L + R + R RY + Sbjct: 288 HAQERLLATLRPPLDYRSILRQFRQSVLSVDRRLTRMRPSRRYGFAQMGSRYDFTTR--- 344 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK----- 299 + +DVSGSM + + ++ F ++++VV TQ + Sbjct: 345 ------LLFAVDVSGSMSHRD---LQNGFSIINRFFQYGIRSIDVV--WFDTQIRCEPLT 393 Query: 300 -EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + GGT + E + E +DGD+ P Sbjct: 394 FRSARRDV-SITGRGGTDLGCVT----EFIDEHRGYDG----VVVFTDGDSPHPKRPANR 444 Query: 359 EILA 362 + Sbjct: 445 QTRI 448 >UniRef50_A7RFD8 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RFD8_NEMVE Length = 182 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 44/152 (28%), Gaps = 17/152 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHTQAKEVDE 303 + + +D S S+ + + K F L + + +VY A + + Sbjct: 4 TNIDLVFAIDASSSVGKVNFERVKGFIRRLVESFHISRTSTRVAAIVYSSRPRVAFDFNR 63 Query: 304 --------HEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 H + GGT AL+L + RY + +DG S Sbjct: 64 YTSARRAAHAVKRLRFLRGGTSTGRALRLASSRLFRRYGRKRR-KVLMLITDG----KSS 118 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 + V+ ++ + L Sbjct: 119 DDVLKPSKALKRKGVQIFAVGVGMSVSRNELI 150 >UniRef50_B1MX57 Predicted metal-dependent peptidase n=3 Tax=Leuconostoc RepID=B1MX57_LEUCK Length = 389 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 26/222 (11%), Positives = 61/222 (27%), Gaps = 26/222 (11%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLAR-------RTAMTAGKRR 186 L L Q + + + GV ++ + ++ R R+ Sbjct: 149 LAELDDKLLPQYAGNYSGHEEWQSAGV--SLDEAEAALETVIRQAQKDAQRSGRGQLPGS 206 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 ++E + LR ++ + + + Y+ + Sbjct: 207 IQQRIDEI-------MVLKRHWRAILRAGLSAIPNRRKTTRARFNRRQAYRLDLPGELST 259 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 + +D S S+ M + +++ + + T V + Sbjct: 260 YALQLIIFVDNSASISNHQVSM----MLAQVAQIAKQFD-ANIQIFSFDTSVHVVKRIKQ 314 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + GGT + + + Y P + I +DGD Sbjct: 315 WVRHAGGGTSFQCIFDM---LAAKHYEPMRTVIVIL--TDGD 351 >UniRef50_C0BF89 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BF89_9FIRM Length = 275 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 18/167 (10%), Positives = 45/167 (26%), Gaps = 18/167 (10%) Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE------------VDEHEFFYSQETGGTI 316 K+ L+++ N E+ + + A E + +GGT Sbjct: 80 LKQAATNFTTQLAQSSPNSEIALVTFNKTATEQFDFKNVGKDSAYITETINAMETSGGTH 139 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 + L +++ N + Y +DG + K + + Sbjct: 140 QNEGLDRAYKILNNDQNTSNLKRYVVLLTDGCPNGVTYDQITTSINKIKSTNTKLITVGV 199 Query: 377 ITRRAHQTLWRE---YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + L + + + D + +F ++ + Sbjct: 200 GLDETNTGLKAAKDYLQANADDNMAY---NANDASHLNTIFTQILGQ 243 >UniRef50_Q6LJM7 Putative uncharacterized protein n=1 Tax=Photobacterium profundum RepID=Q6LJM7_PHOPR Length = 492 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 + M +D SGSM S + +AK + L + ++ ++ I + ++ Sbjct: 313 QKEDKKGPMVICIDTSGSMHGSPETIAKALSLYLTTQAKKEQRDCYLINISTSIEILDLS 372 Query: 303 EH-------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + F GGT V+ A++ ++K A N SD P Sbjct: 373 QGYSLSSLLTFLQKSFHGGTDVAPAMRHGINIMKN---DAYENADMLIISD--FVMSSLP 427 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF------AMQHIRDQDD 409 L ++ + Y + + + + + I+ Q+D Sbjct: 428 NDCLELVEQQRIKGNRF-YSLCIGN--AFMTNRLKTHFDSEWVYNPSNSSITELIKFQND 484 Query: 410 IYPVF 414 + Sbjct: 485 VIDSA 489 >UniRef50_C3XQQ6 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XQQ6_BRAFL Length = 655 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 57/200 (28%), Gaps = 26/200 (13%) Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 PA+ ++ R + R D FD R + ++ SS M L+D SGS+ Sbjct: 257 YPARPMQPHRFGSRTLTIADVPRRF---DEFDARMTQWYQQTI-SSPKDMLILLDTSGSV 312 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNV-------------EVVYIRHHTQAKEVDEHEFFY 308 + + + K L L+ +++ T KEV Sbjct: 313 EGRSLSLMKHTTWFLLDRLTEDDYVATGYFNAYAQAVSCLSSFVQATTHNKEVIHKSLDN 372 Query: 309 SQETGGTIVSSALKLMDEV-----VKERYNPAQW--NIYAAQASDGDNWADDSPLCHEIL 361 + + L+ ++ +++R+ N + +N + Sbjct: 373 LEAADQANYYAGLEYAFKIFNNFEMEDRFENQGAECNKVIVLVT--ENAELYPEAVFQKY 430 Query: 362 AKKLLPVVRYYSYIEITRRA 381 V E Sbjct: 431 NPDRNIRVFVIVVGEPIHDW 450 >UniRef50_B5ZN80 von Willebrand factor type A n=8 Tax=Rhizobiales RepID=B5ZN80_RHILW Length = 522 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 30/230 (13%), Positives = 58/230 (25%), Gaps = 34/230 (14%) Query: 193 ENLAIISNSEPAQ----LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP--- 245 + LA + ++ Q L A+ P +R E Sbjct: 274 DLLAYLRSASAQQRIADTGRRIPLSGVAAKPEPGWNFDPARLVTAIRMPEPEVIRQALTL 333 Query: 246 -----SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY------LFLSRTYKNVEVVYIRH 294 ++ +D SGSM +D ++ L L + ++ I Sbjct: 334 YQAALRKPSLTALCLDFSGSMQGDGEDQLQKAMRFLLTPDEASKVLVQWSPADRIIVIPF 393 Query: 295 HTQAK------------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 + E +E + GGT + + + + + + Sbjct: 394 DGSVRNTFMASGNPLEQEGLLNEISRQKAGGGTDMYTCAAQALQQIARSDRLSTYLPAIV 453 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 +DG +DD P V + A +T Sbjct: 454 IMTDGR--SDDQSQAFMSEWNATEPHVPV--FGITFGDADKTQLDSLAKQ 499 >UniRef50_A1HT91 Von Willebrand factor, type A n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HT91_9FIRM Length = 586 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 38/268 (14%), Positives = 81/268 (30%), Gaps = 27/268 (10%) Query: 145 LTEYKTHRAGY-TANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEP 203 LT Y Y + + R+++ R K R + Sbjct: 305 LTPYGREFKDYLDRHLPDLEAHLRRAIRLLKPRSPDQGRSKARMQAMHGQEQRHAKRWTV 364 Query: 204 AQLLEEERLRKEIAELRAKIERVPF----IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 L + + + + + P I + D+ + S+ + ++D S Sbjct: 365 GGSLGQLAVAETVIAAAQRCAAGPGGPFTIGSQDIHHF----IRKKKSKTDICLIIDASA 420 Query: 260 SMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET------- 312 SM + + +L LS + + +V+ + + + +F ++ + Sbjct: 421 SMSGQR--VGAAKLLAKHLLLSTSDRVAVIVFQENQARVQVPLTRDFAQAESSLAHIESF 478 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY- 371 G T ++ LK+ E +KE N +DG + + LA L Sbjct: 479 GSTPLALGLKVGIEYLKESRAK---NPLVILITDG--VPTVGDITGDPLADALTAAASIK 533 Query: 372 ---YSYIEITRRAHQTLWREYEHLQSTF 396 Y + I + H+ + Sbjct: 534 SHGYGFTCIGLKPHRDYLTQVAQAAGGN 561 >UniRef50_Q0W1N2 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W1N2_UNCMA Length = 477 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 66/230 (28%), Gaps = 15/230 (6%) Query: 171 QNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFID 230 ++S T + + + + L + Sbjct: 242 RSSEGLDTGSGKVLHSGRLEVHSIVTSSDLYYLLPSELIKLQDSILQYLFFARWIEGKLL 301 Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 T+ L D + + L+D SGSMD +A+ + + + + VV Sbjct: 302 TYHLTDPGKSDTGDCKRKGPVIALVDTSGSMDGIPGILARAVTLATVRMFLQRGRKIRVV 361 Query: 291 YIRHHTQAKEVDEH--------EFFYSQETGGTIVSSALKLMDEVVKER-YNPAQWNIYA 341 Q E+D EF S GGT ++ALK +K R Y A Sbjct: 362 LFSSVGQLDEIDLPEGSTPGFLEFLRSSFGGGTDFNTALKAGLGALKARQYASAD----I 417 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 +DG + D L + K + ++ I + Sbjct: 418 MFVTDGMSRITDEALIEDWRRLKEASGSQIFTV--IVGNDQAGGLEDISD 465 >UniRef50_Q5TIE3-4 Isoform 4 of von Willebrand factor A domain-containing protein 5B1 n=2 Tax=Amniota RepID=Q5TIE3-4 Length = 1016 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 52/195 (26%), Gaps = 24/195 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IR 293 + L+D S SM + K ++ L + + Sbjct: 356 RKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALKSLMPACLFNIIGFGSTFKSLFPSSQT 415 Query: 294 HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + + + + + GGT + S LK + R +DG A Sbjct: 416 YSEDSLAMACDDIQRMKADMGGTNILSPLKWVIRQPVHR----GHPRLLFVITDG---AV 468 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 ++ L + R YS+ I L + + M + + + P Sbjct: 469 NNTGKVLELVRNHAFSTRCYSFG-IGPNVCHRLVKGLASVSEGSAELLM----EGERLQP 523 Query: 413 VFRELFHKQNATAKG 427 + K A Sbjct: 524 KMVKSLKKAMAPVLS 538 >UniRef50_A8M5H1 von Willebrand factor type A n=13 Tax=Actinomycetales RepID=A8M5H1_SALAI Length = 319 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 54/221 (24%), Gaps = 38/221 (17%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 + + P +A + +DVS SM + AK L + V Sbjct: 73 ARPTAEVRVPRERATVMVAVDVSTSMLAGDVEPDRLTAAKEAARRFVDGLPDEFNVGLVA 132 Query: 291 YIRH------HTQAKEVDEHEFFYSQETG----GTIVSSALKL---MDEVVKERYNPAQW 337 + +E + E GT + A+ + + Sbjct: 133 FAGSAAVLVPPDTDREALDEGIDRLVEGATGVQGTAIGEAINTSLGAVKALDGEAAKDPP 192 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ-----------TLW 386 SDG N + P+ A + V ++ + + Sbjct: 193 PARIVLLSDGANTSGMDPMEAATDAVAMDVPVHTIAFGTASGYVDRGGRPIQVPVDGQTL 252 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 E + + D R + + ++ G Sbjct: 253 DEVARETGGQFH--------EADSAKELRAV-YDDIGSSVG 284 >UniRef50_Q5TIE3 von Willebrand factor A domain-containing protein 5B1 n=19 Tax=Euteleostomi RepID=VW5B1_HUMAN Length = 1220 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 52/195 (26%), Gaps = 24/195 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IR 293 + L+D S SM + K ++ L + + Sbjct: 356 RKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALKSLMPACLFNIIGFGSTFKSLFPSSQT 415 Query: 294 HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + + + + + GGT + S LK + R +DG A Sbjct: 416 YSEDSLAMACDDIQRMKADMGGTNILSPLKWVIRQPVHR----GHPRLLFVITDG---AV 468 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 ++ L + R YS+ I L + + M + + + P Sbjct: 469 NNTGKVLELVRNHAFSTRCYSFG-IGPNVCHRLVKGLASVSEGSAELLM----EGERLQP 523 Query: 413 VFRELFHKQNATAKG 427 + K A Sbjct: 524 KMVKSLKKAMAPVLS 538 >UniRef50_B9ZQD1 von Willebrand factor type A n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZQD1_9GAMM Length = 615 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 41/325 (12%), Positives = 93/325 (28%), Gaps = 31/325 (9%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGS---GQGQASQDGEGQDEF----VFQIS 120 GG D + GG +G G +A D G DE + Sbjct: 263 AGGDEAGGDEAGGDEAGGDEAGGDEAGGDEAGGDEAGGDEAGGDEAGGDEAWSTRPTEQM 322 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 D+ D + E + + + +++++ + G +G+ R Sbjct: 323 LDQEADGMEETFDMGDALKESIQEVSQKEEQEKGRYTDGLCPFTFCGEMPVQGSGDRLVQ 382 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 AG + S + + + + L + + + Sbjct: 383 QAGIASAALRTRLASQLQSVNRERRWASRKGSKLSSRHLSRVVTGDHRVFG--------K 434 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK- 299 ++ + + L+D SGSM + A L + + + + + Sbjct: 435 RQESGTPNTAVQILVDRSGSMAGDPIETAMTAA-LAIQLATDSLRGINTQVSAFPASSSG 493 Query: 300 -----------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + + F TG T +S+A+ V+ + ++ +DG Sbjct: 494 GLVPITSFGENGRMKADNFGVGSTGATPMSNAI---LGVLPSMFARSESRKVMLVITDGA 550 Query: 349 NWADDSPLCHEILAKKLLPVVRYYS 373 +S + +A+ + + Sbjct: 551 PNDSESAMEAIRMARDVNVEMYAIG 575 >UniRef50_B3RZ89 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RZ89_TRIAD Length = 933 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 15/137 (10%), Positives = 41/137 (29%), Gaps = 19/137 (13%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYI-LLYLFLSRTYKNVEVVYIRHHT--QAKEV 301 + + ++DVSGSM + ++ L + + + + Sbjct: 309 RAKKPRTVLVLDVSGSMRGKPMEQLQQAATNFLLNVAQNGSFVGIITFSSAASIRSSLVQ 368 Query: 302 DEHEFFYSQ--------ETGGTIVSSALKLMDEVVK----ERYNPAQWNIYAAQASDG-D 348 + + +G T + + ++ +++K + I SDG + Sbjct: 369 INDDADRQRLILLLPSGASGSTSIGAGIQAGVKILKASVGNKSPSGGTLIVL---SDGRE 425 Query: 349 NWADDSPLCHEILAKKL 365 N + + + Sbjct: 426 NRSPTIADVKKQVLDNK 442 >UniRef50_Q0W729 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W729_UNCMA Length = 1310 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 37/154 (24%), Gaps = 18/154 (11%) Query: 269 AKRFYILLYLFLSRTYKNVEVVYIR-----------HHTQAKEVDEHEFFYSQETGGTIV 317 AK + + V + + K ++ +GGT + Sbjct: 909 AKTSAVSFVESRGDGDQVGVVSFYTSASLNSALKQMNSGTNKTTVKNAINSLSASGGTDI 968 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 SS +K + Y +DG ++ K ++ Sbjct: 969 SSGIKKAIAELDAHKRSTAKQ-YIIVLTDG--YSQYPEFDLIEADKAKAKGYTIFTIG-- 1023 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 A + ++ + + + Y Sbjct: 1024 MGMADEDTLKKIASK--PEYYYRVLSPEQLEAAY 1055 >UniRef50_Q14CN2 Calcium-activated chloride channel regulator 4, 30 kDa form n=30 Tax=Theria RepID=CLCA4_HUMAN Length = 919 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 54/171 (31%), Gaps = 20/171 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 SQ ++ ++D SGSM + M + L + V + T ++ Sbjct: 301 KISQRIVCLVLDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLI 360 Query: 303 E----HEFFYSQET------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + E GGT + S +K +V+ E ++ + +DG++ Sbjct: 361 QIKSSDERNTLMAGLPTYPLGGTSICSGIKYAFQVIGELHSQLDGSEVLL-LTDGEDNTA 419 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 S ++ +I + R A + + + + Sbjct: 420 SS------CIDEVKQSGAIVHFIALGRAADEAVIE-MSKITGGSHFYVSDE 463 >UniRef50_A4ACS0 Magnesium-chelatase, 60 kDa subunit n=2 Tax=unclassified Gammaproteobacteria RepID=A4ACS0_9GAMM Length = 615 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 46/332 (13%), Positives = 88/332 (26%), Gaps = 26/332 (7%) Query: 37 AINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGG 96 + SV+D + V + ++ + V D + + + Sbjct: 236 LAREESVSDEQLEQVVRLVLLPLATRLPGGDEQDAEDDVEESPDDSADDSQPPDNETPDA 295 Query: 97 GSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYT 156 A ++ E + + L + L L + L + Sbjct: 296 PDSPPPSDAQRESEDDKAPDAEQPNTDPDPLTDDRL-LEAAQAMLPPDLLAKLLSGSMGA 354 Query: 157 ANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEI 216 + S R R G R L+ A + + P Q L Sbjct: 355 RRNAASGKSGARQRAKMRGRPMGSLPGDPRSGARLDVM-ATLRTAAPWQRLRG------- 406 Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 + ++ R SQ+V ++D SGS AK LL Sbjct: 407 -----PATGGARLRVQAEDFRIVRFRQR--SQSVTVFVVDASGSSALYRLAEAKGAVELL 459 Query: 277 YLF-LSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV 328 R + + + + T++ + GGT +++ + E++ Sbjct: 460 LADCYVRRDEVALIAFRGDSAELLLPPTRSLVRAKRSLSALPGGGGTPLAAGIDATAELL 519 Query: 329 KERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + Y +DG N D E Sbjct: 520 EMLDRRGATPAYV-MLTDGRGNVCRDGSTGRE 550 >UniRef50_UPI0001C34E55 hypothetical protein ClM62_13922 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E55 Length = 466 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 46/183 (25%), Gaps = 23/183 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE-------VVYIRHHTQAK---- 299 + +D S M+ S + AK+ L R E V + A Sbjct: 40 IVFAIDRSAKMEGSALEAAKKGIKAFIETLERESAQPEGYAGEKRVGLVSFSDTATVNSM 99 Query: 300 -----EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 E G + + A++ +++ + + +DG Sbjct: 100 LSPVVEQAARAAEGLTAGGKSNQAEAIRAAVKLLDMKTPGEK---MLFLITDGQTPFRSQ 156 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 A++ V Y S + + IR+ + F Sbjct: 157 TDSAAAEARQ--AGVTVYCIGIAAPDGVNR--EALRSWASGPSDSHIIEIRELGEAQTAF 212 Query: 415 REL 417 L Sbjct: 213 ERL 215 >UniRef50_O76836 Putative uncharacterized protein n=4 Tax=Caenorhabditis RepID=O76836_CAEEL Length = 1028 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 56/227 (24%), Gaps = 19/227 (8%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + A +E ++ S +L + I LR + P Sbjct: 327 ADARAADEVISKESW---TELANSDPFATLICSFFKLPTTQMSSQLEYLRTERDAVIDTP 383 Query: 246 SSQAV-MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE- 303 + + M D S S+ F L N V I + EV + Sbjct: 384 KCEVLDMIIAFDTSESLSSLIVPQYVDFAKKLVAQYKYGNDNTRVGIITFSSDVVEVRKL 443 Query: 304 ----------HEFFYSQETGG-TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 TGG T V+ A + N + N +DG Sbjct: 444 TDGNTLDAVNAAIDTVHYTGGLTNVTKAQLTAKNLFDTESNANR-NKVLFILTDG--VPT 500 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 E+ A L + S+ + E + F Sbjct: 501 VDTYTDEVAAGDKLKSISVISFFVGYSSYSDEVKTELGKVSEPKYIF 547 >UniRef50_Q2FMX9 Protoporphyrin IX magnesium-chelatase n=2 Tax=Methanomicrobia RepID=Q2FMX9_METHJ Length = 680 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 56/166 (33%), Gaps = 15/166 (9%) Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN---YEKRPDPSSQAVMFCLMDV 257 +E + L I + +++ D+ + ++ +K+ + ++D Sbjct: 439 AEKRPTGRDIALDATI-RAVSPYQKMRLSDSLAIVIRSDEVLQKKRIGKTATATLFVVDA 497 Query: 258 SGSMD-QSTKDMAKRF-YILLYLFLSRTYKNVEVVYIRHH-------TQAKEVDEHEFFY 308 SGSM + + AK + LL + V + T + ++ Sbjct: 498 SGSMGVEQRMEAAKGAIFSLLEDSYQNRDRVGLVAFRGEGADVVLPLTSSIDLAYQRLSE 557 Query: 309 SQETGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQASDG-DNWAD 352 G T +++ L+ ++ +E+ +DG N + Sbjct: 558 LPTGGKTPLAAGLQKSLTILMREKQKYPSLLPLLVLITDGRANVGN 603 >UniRef50_B2UUD5 Phage/colicin/tellurite resistance cluster terY protein n=5 Tax=Helicobacter pylori RepID=B2UUD5_HELPS Length = 217 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 48/133 (36%), Gaps = 19/133 (14%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRT------YKN 286 + K +F L+D SGSM++S + + L + K Sbjct: 4 DLSKYTMEERFIPVFLLLDTSGSMNESLGNCTRIEALNLCIQKMIETLKQEAKKELFSKM 63 Query: 287 VEVVYIRHHTQAKEVDEHE-----FFYSQETGGTIVSSALKLMDEVVKER--YNPAQWNI 339 + + + + F +GGT + A +L ++++++ + + + Sbjct: 64 AIITF-GENGAVLHTPFDDVKNINFKPLSASGGTPLDQAFRLAKDLIEDKDTFPTKFYKL 122 Query: 340 YAAQASDGDNWAD 352 Y+ SDG+ D Sbjct: 123 YSILVSDGEPNDD 135 >UniRef50_C8XJ05 Magnesium chelatase n=20 Tax=cellular organisms RepID=C8XJ05_NAKMY Length = 705 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 35/120 (29%), Gaps = 11/120 (9%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRF-YILLYLFLSRTYKNVEVVY---- 291 + + V+FC+ D SGSM + K LL R K V + Sbjct: 512 RLAVKQGREANLVLFCV-DASGSMAARARMEAVKAAVLSLLTDAYQRRDKVGLVTFRGGA 570 Query: 292 ---IRHHTQAKEVDEHEFFYSQETGGTIVSSALK-LMDEVVKERYNPAQWNIYAAQASDG 347 T + E G T ++ L + ER + +DG Sbjct: 571 ADLALPPTSSVEAAARRLEMLPAGGRTPLAEGLLCAAHTLRVERIRDPRRRPLLVVVTDG 630 >UniRef50_B0SI02 BatA n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SI02_LEPBA Length = 317 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 56/210 (26%), Gaps = 25/210 (11%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSM----DQSTKDMAKRFYILLYLFLSRT--YKNVEV 289 Y+ PD + + +D+SGSM D ++ LL F+ + + V Sbjct: 78 GSKYKLSPDSTKGVDIMIALDISGSMVNSYDFLPRNRLSVSKDLLREFVKKRLYDRIGIV 137 Query: 290 VYIRHH--TQAKEVDEHEFFYSQETG--------GTIVSSALKL-MDEVVKERYNPAQWN 338 V+ D GT V AL L + + Sbjct: 138 VFAGAAYLQSPLSSDRFALDELIAGTSSEDIEEQGTAVGDALVLSSYRLKNSEAK----S 193 Query: 339 IYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRA--HQTLWREYEHLQST 395 +DG N P K + V + + + ++ + Sbjct: 194 KVIILLTDGVSNTGKLDPDTAAYTTKTMGIKVYCIGIGKEEGQYEINYESLQKISSNTNG 253 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 F + + + +L + + Sbjct: 254 KF-FRAESPEVLESVLNEIDQLEVVELPSK 282 >UniRef50_B9SSC8 Protein binding protein, putative n=1 Tax=Ricinus communis RepID=B9SSC8_RICCO Length = 705 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 48/167 (28%), Gaps = 14/167 (8%) Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + E L E A + A + +R Y P + Sbjct: 256 FVNPTPPVLKPRRNVELSLLPESAVVTAGRTYQTHVVVLRIRAPPYTAARRPPID--LVM 313 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH----------TQAKEVDE 303 ++DVS M + KR ++ L+ + V + + Sbjct: 314 VLDVSQRMCGVKLQVMKRIMRVVMSSLNSNDRLSIVAFSATSKRLSPLKRMTADGRRSAR 373 Query: 304 HEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 TG G + ALK +V+++R S+G + Sbjct: 374 RIIDALGSTGQGMSANDALKKAAKVIEDRRVKNPVASIII-ISNGQD 419 >UniRef50_D1NW11 Putative von Willebrand factor type A domain protein n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NW11_9BIFI Length = 493 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 25/120 (20%), Positives = 40/120 (33%), Gaps = 18/120 (15%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF-LSRTYKNVEVVYIRHHTQAK-- 299 +++ + LMDVSGSM + +AK L L+ V + +R ++AK Sbjct: 67 AQQQTESDVVVLMDVSGSMTTTDMKVAKNAVNGLANQLLNDENDTVRMSIVRFSSEAKTL 126 Query: 300 ------------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + G T AL+ +V+ Y SDG Sbjct: 127 EFSNGSEWTHSPALVAQALNTLTSRGNTNWDGALQNASALVQGDSARKS---YVVLMSDG 183 >UniRef50_C1EAC0 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1EAC0_9CHLO Length = 753 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 23/227 (10%), Positives = 58/227 (25%), Gaps = 32/227 (14%) Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 + + +++ + R PFI + + + +F ++D SGSM Sbjct: 255 SPSITAAAVQQPSPKGPTADTRAPFIISISP----PDPKSCAPFARSVFFIVDRSGSMTG 310 Query: 264 STKDMAKRFYILLYLFLSRTYKNVEVVYIR----HHTQAKEVDEHEFF--------YSQE 311 A + + L + + Sbjct: 311 KPMAGANQALLAGLSSLGPQDTFNICAFDNLQEYLSEDMVPASPENINAAKGWIQGHCTA 370 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL----- 366 G T + S L+ ++ +R +DG A ++ ++++ Sbjct: 371 RGTTDILSPLRAAVAILSKRPLLGAV-PLIYVVTDG---AVENEREICRYMQEVMSAPPP 426 Query: 367 ------PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 P V + + + + + + + Q Sbjct: 427 EGLMTHPRVCTFGIGRYCNHYFLKMLSQIG-KGLSDAAYTDERVGSQ 472 >UniRef50_Q22NG1 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22NG1_TETTH Length = 821 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 56/194 (28%), Gaps = 26/194 (13%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT----- 296 + S+A F L+D SGSM + ++ L FL R + I + Sbjct: 330 NAEKHSKAQFFFLIDRSGSMCT----IFQKARDTLIEFLQRLPDDSYFNVISFGSGYQFL 385 Query: 297 ------QAKEVDEHEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + K+ + + GGT + L+ + + + + + +D Sbjct: 386 FEEAKKKNKQSMKSALEQISKFSADMGGTEIYQPLEK---IFQCKNVNDLYQMQIFLLTD 442 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G P L + R + + + L R + Sbjct: 443 GQ---VSQPDMVVQLIRNNSHKARVHCIGLGSG-VDKQLLRRCSESGRGANRQVDNASEL 498 Query: 407 QDDIYPVFRELFHK 420 ++ + V F Sbjct: 499 KEVVINVLENSFTP 512 >UniRef50_C5E9N8 von Willebrand factor type A n=4 Tax=Bifidobacterium longum RepID=C5E9N8_BIFLO Length = 401 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 46/148 (31%), Gaps = 21/148 (14%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYL-------FLSRTYKNVEVVYIRHHTQAKEVDE- 303 ++D SGSM K+ + ++ +V + I T+A + Sbjct: 224 IWVVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSYIEPASGDVNI-LIPFETEAHRPVKA 282 Query: 304 ---------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 HE + +GGT + L + + +Q+ +DG + D Sbjct: 283 TGTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTTAIVLMTDGRS-NSDH 341 Query: 355 PLCHEILAKKLLPVVRYYS--YIEITRR 380 E K + +S + + Sbjct: 342 QDEFESAYKSRGRDLPIFSIMFGDADPS 369 >UniRef50_C6M593 Putative uncharacterized protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M593_NEISI Length = 482 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 20/191 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 K + M +D SGSM+ +++AK + L ++ V+ + E Sbjct: 300 KSQEDEKPGPMILCVDTSGSMNGLPENIAKAMALFLGTKAKSENRSCFVINFSTGIETFE 359 Query: 301 VDEH-------EFFYSQETGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQASDGD-NWA 351 + F GGT + AL+ +++ +E Y A SD N Sbjct: 360 LTSKTGISNLIAFLRQSFHGGTDAAPALRHALKMMEQESYQKAD----LLMISDFVMNGL 415 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 D L + ++ + + +++ ++ Sbjct: 416 PDDLLASIEIQRETGNQFNSLVIG-------DAFMSKRLKTHFDREWIYNPNVQTIQELV 468 Query: 412 PVFRELFHKQN 422 +++F+KQ Sbjct: 469 QFKKDVFNKQV 479 >UniRef50_A9B2Y1 VWA containing CoxE family protein n=4 Tax=Bacteria RepID=A9B2Y1_HERA2 Length = 460 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 44/343 (12%), Positives = 106/343 (30%), Gaps = 35/343 (10%) Query: 85 NDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQ 144 + + G G G F +S++E ++ + L +K+ R+ Sbjct: 126 GFQPGALRQSQPGQGQASQPGGLQGGQGVGSGFNLSEEELRQVI-QGLEKDLIKRMALRE 184 Query: 145 LTEYKTHRAGYTAN------GVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI- 197 + + A T + + + + + R + ++ L+ A+ Sbjct: 185 VLQDNRLAAQLTPSMAVVEQLLRDKSHLSGNALINAKRLIKQYVDELADVLRLQVMQAVS 244 Query: 198 --ISNSEPAQLLEEER-LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 I S P + + L++ I D Y + + M + Sbjct: 245 AKIDRSVPPKRVFRNLDLKRTIWRNLTNWNSNEGRLYVDRLYYRQTAKKRTPMR--MIVV 302 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------HEFFY 308 +D SGSM + M + + +V++ I T+ ++ Sbjct: 303 VDQSGSMVDA---MVQCTILASIF---AGLPHVDMHLIAFDTRMLDLTPWVHDPFEVLLR 356 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 +Q GGT ++ AL E ++E P + + +D + + + + Sbjct: 357 TQLGGGTSINEALLFASEKIQE---PRKTAVVL--ITD-FYEGGSDQVLLDTIKAMIESG 410 Query: 369 VRYYSYIEITR----RAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 V + +T + + + + + + + +Q Sbjct: 411 VHFIPVGAVTSSGYFSVNDWFRTKLKEMGRPIFAGSPRKLIEQ 453 >UniRef50_B3PJ55 von Willebrand factor type A domain protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PJ55_CELJU Length = 318 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 60/191 (31%), Gaps = 25/191 (13%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEVVY-- 291 P+S + +D+SGSM + K+ ++ + V++ Sbjct: 84 MPNSGRDLLLAVDISGSMREPDMVYNNRRITRLMAVKKVVGDFVAR-RQSDRLGLVLFGT 142 Query: 292 -------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + + + E T + A+ L + ++E+ N Sbjct: 143 QAFLQAPLTFDVKTVQEMLIEAESGYAGEATAIGDAIALSIKRLREQPNAK---RVIILL 199 Query: 345 SDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG+N A + LA K + ++ R ++ F ++ Sbjct: 200 TDGENTAGELGIATATDLAVKANTKIYTIAFSPYDREVDSHSMQQIAEQTGGEF-FRARN 258 Query: 404 IRDQDDIYPVF 414 RD ++I+ Sbjct: 259 TRDLEEIHRQL 269 >UniRef50_C3XUV0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XUV0_BRAFL Length = 815 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 47/359 (13%), Positives = 97/359 (27%), Gaps = 50/359 (13%) Query: 69 GGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLL 128 G D + + G G S++ G+ + L+ L Sbjct: 121 GENGDTDGDAEDTTGEAEDAAGETGDTDGQTKDTTGESKEAAGETPMDTPKT---LLEEL 177 Query: 129 FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISV--------VRSLQNSLARRT-- 178 P+LK+ + + K G + + + V +R+L+ S+ + Sbjct: 178 RRQPETPSLKETPKTMMQRLKKPM-GIPKSLMEKTLKVLMWTLKLLMRTLKTSMEKTLRV 236 Query: 179 ---AMTAGKRRELHALEENLAIISNSE-PAQLLEEERLRKEIAELRAKIERV-PFIDTFD 233 + + L E L + + L K + + + + Sbjct: 237 LMWTLKPLLEKTPETLMETLKPLMEKTLKTLIGTLTPLMKALKITNSVRGAGAHGLSVQN 296 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF------------LS 281 R P +F L+D SGS+ + K+F + + L Sbjct: 297 PRGSGSSTCEAP---VDLFFLLDGSGSVKAANFAKVKQFAVDMVNSFDVSPAATRVGVLQ 353 Query: 282 RTYKNVEVVYIRHHTQAKEVDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + +N V + + K + GGT +AL+ + R Sbjct: 354 YSNRNTLV-FNLGNKVNKPTTVSAINSISYQGGGTRTGAALQYIRGNAAWRR--GNVPKV 410 Query: 341 AAQASDG---DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG D+ + S + Y + + + + Sbjct: 411 LIVLTDGKSEDSVSGPSQNLVSDRVEV---------YAIGVSNFDHEELLQIVNNKQSN 460 >UniRef50_Q4RTR8 Chromosome 2 SCAF14997, whole genome shotgun sequence n=3 Tax=Tetraodon nigroviridis RepID=Q4RTR8_TETNG Length = 1406 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 57/199 (28%), Gaps = 16/199 (8%) Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 E +L +++ V D + P S V L+D S SM S Sbjct: 888 PEGFQLSVSLSKAHLPRMWVEKHPEKDSQVYFDIDAGSPPSNEV-VLLLDTSESMRDS-L 945 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIR--HH--TQAKEVDEHEFFYSQETGGTIVSSALK 322 + + + L K +++ A ++ G T + L+ Sbjct: 946 HTLQEIALRVLKALHPDVKVNIILFGTGQFDPGGLAVTSPLSPQRFTPVGGSTELWRPLR 1005 Query: 323 LMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY--SYIEITRR 380 ++ P++ SDG +P L + + R + + Sbjct: 1006 -ALSLL----PPSRGLRNLLLLSDG---HVQNPELTLQLLRDGVQHSRLFPAASGLHGPT 1057 Query: 381 AHQTLWREYEHLQSTFDNF 399 A++ + R F Sbjct: 1058 ANRHMLRALAQAGGGAFEF 1076 >UniRef50_A9KHQ1 von Willebrand factor type A n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KHQ1_CLOPH Length = 513 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 45/122 (36%), Gaps = 18/122 (14%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFY-ILLYLFLSRTY-----KNVEVVYIRHHTQAKEV 301 +V+ +D SGSM + +L L+ K ++ I ++ K V Sbjct: 340 PSVVVFCLDYSGSMFGEGNEQLVAAMEKILDHKLASEDMIQFSKMDKIFVIPFSSELKWV 399 Query: 302 DE-----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 D ++ G T + + ++ E++K+ ++ + +DG++ Sbjct: 400 DSAISGIDTANLISRIKDTEAHGKTNIYAPVEHAIEILKD-FDADVYTKSIVLMTDGESA 458 Query: 351 AD 352 + Sbjct: 459 GN 460 >UniRef50_A9DAG7 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DAG7_9RHIZ Length = 363 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 43/162 (26%), Gaps = 26/162 (16%) Query: 251 MFCLMDVSGSMD-----QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE 305 M ++D SGSMD Q ++ K L + V + + ++ Sbjct: 163 MALVLDRSGSMDWNLNGQKKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNSKLTGST 222 Query: 306 ------------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWN--------IYAAQAS 345 +GGT + A V + + + + Sbjct: 223 KLRWNPGKTKEFVDALPASGGTDSTDAFDWAYTAVTHKRENNTHDAKSGQVPKKFIVFMT 282 Query: 346 DGDNWADDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQTLW 386 DGDN + + L + Y+ + L Sbjct: 283 DGDNNYSSADSSTKHLCDDAKDDGIEVYTVAFAAPNRGKQLL 324 >UniRef50_Q0AZS1 Mg-chelatase subunit ChlD-like protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AZS1_SYNWW Length = 592 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 45/309 (14%), Positives = 89/309 (28%), Gaps = 26/309 (8%) Query: 111 GQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL 170 D + +E L +D L + + QL ++ + + N+ + S Sbjct: 276 WGDVEHYVEQLEELG--LIKDTILGKVMTRKGLQLKDFVINHKCELETEIRRNMRKMPSG 333 Query: 171 QNSLARRTAM--TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 NS R+ + E + + + L E + + + + + Sbjct: 334 GNSRFRKLGQVDQKQTQVEFTNRNKTVNNPDKNWSGDLAVPETIVQAMKNSFLRNDPHFT 393 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 I DL Y + + L+D SGSM + A ++ L LS K Sbjct: 394 IKKEDLHYYD----KKSYVPIDVCLLIDASGSMAGDKRQAAC--FLAQNLLLSGKEKVAV 447 Query: 289 VVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 V + T+ + + G T ++ + ++K N Sbjct: 448 VTFQERSSEVVVPFTRNQNILNKGLSTISPAGLTPMADGIMTAVNLIKNNRV---RNPLL 504 Query: 342 AQASDGDN----WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 SDG W D+ A + +I I +++ + Sbjct: 505 VLISDGIPNIPLWTLDAQADALEAATHIRE--NKIHFICIGLESNRFYLEKLSANAGGAL 562 Query: 398 NFAMQHIRD 406 +D Sbjct: 563 YLVDDLNKD 571 >UniRef50_B0WHU4 Sushi n=3 Tax=Culicini RepID=B0WHU4_CULQU Length = 2239 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 55/201 (27%), Gaps = 33/201 (16%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 K+ EK + + + L+D S S+ + +F L + +Y V I + Sbjct: 123 KSVEKIKTKNKRVDIVFLIDASSSVGRQNFASEIKFVKKLLSDFNVSYNYTRVAVITFSS 182 Query: 297 QAK-----------------------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN 333 Q K +V F GGT ALK +E+ K Sbjct: 183 QKKIFRHIDQISQSVEDNDKCLLLNYQVPRIAF----SGGGTYTYGALKEAEEIFKNARL 238 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 ++ +DG + LA +L Y + + Sbjct: 239 DSK--KIIFLITDG----FSNGRDPIPLAGRLKKDNNVVIYSIGIQSGNYAELHAIASAP 292 Query: 394 STFDNFAMQHIRDQDDIYPVF 414 + + + + Sbjct: 293 EGDHCYLLDSFDHFETLARKA 313 >UniRef50_Q1Q250 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q250_9BACT Length = 395 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 63/226 (27%), Gaps = 31/226 (13%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ----------STKDMAKRF 272 E + DL K+ + P ++ ++D S SM+ D+AK Sbjct: 34 PEPLESFVPQDLSAKSLSVKYAPKIES-FVIILDASASMETAYTGIVNKGHPKFDVAKDI 92 Query: 273 YILLYLFLSRTYKNVEVVYIRH---------------HTQAKEVDEHEFF-YSQETGGTI 316 + L N +V H ++++ E+ G + Sbjct: 93 ISRMNKTLPEVDVNSALVTFGHGFFTPLKKTFIVYELTHHSRDLLENALNMAIYPKGSSP 152 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +A+ + N SDG+N + L + +Y+ Sbjct: 153 AGNAIADASNQL---LTSVGQNAVIF-VSDGENLIGTPLEKIKDLKELYGEKTCFYTIHV 208 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + R+ + + I ++ +++F Sbjct: 209 GNTPEGKEALRKLARSATCGFSVTADEIASSGNMANFVKKVFLTDI 254 >UniRef50_C1MHV2 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MHV2_9CHLO Length = 802 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 48/177 (27%), Gaps = 25/177 (14%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-------------RHHTQ 297 + ++D SGSM+ + A L+ + Sbjct: 303 VVFVIDRSGSMNGEPMEAANEALTTGLRSLTEHDYFNICAFDDGQEYFDANAMTQATPKN 362 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + + T + + L +++ + +DG +D+ +C Sbjct: 363 VERAMAWMNEHCVARYTTDIYTPLSEALKLLAGCAGNGTV-PFVFLITDGA-VSDEKEIC 420 Query: 358 HEILAK-----KLLPVVRYYSYIEITRRAHQTLWREYEHLQST--FDNFAMQHIRDQ 407 ++A+ + LP V + + + ++ F I Q Sbjct: 421 KMLMAESQQKGEALPRVCTFGIGQYCNHY---FLKMLANIGRGLFDAAFTNDKIATQ 474 >UniRef50_C1D350 Putative magnesium chelatase, chlD subunit n=1 Tax=Deinococcus deserti VCD115 RepID=C1D350_DEIDV Length = 589 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 38/119 (31%), Gaps = 14/119 (11%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVY-IRH 294 + + V+ + D SGSM + K + + +R + + + Sbjct: 412 FHTPVHEKRGGRRVL-FVADTSGSMGAQGRMGAVKGAMLAVLEQQARRDRVALITFRATG 470 Query: 295 HTQAKE------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 +A E + E + G T ++ AL L EV+ +DG Sbjct: 471 AVRALEWTADATLAEAAITAAPTGGRTPLAHALVLAREVLATEPGAE-----LVLFTDG 524 >UniRef50_Q8BVM2 Anthrax toxin receptor-like n=2 Tax=Mus musculus RepID=ANTRL_MOUSE Length = 641 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 44/138 (31%), Gaps = 10/138 (7%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-------HTQAKEVDE 303 ++ ++D SGS+ + + L+ F + + + Y + +KE+++ Sbjct: 77 LYLVLDKSGSVADNWIHIYSFAEGLVKKFTNPNLRISIITYSTEAEVILPLTSDSKEINK 136 Query: 304 H--EFFYSQETGGTIVSSALKLMDEVVKERYNPAQW-NIYAAQASDGDNWADDSPLCHEI 360 G T + L+ +E +++ + N +DG E Sbjct: 137 SLLVLKSIVPQGLTHMQKGLRKANEQIRKSTLGGRIVNSVIIALTDGLLLLKPYLDTMEE 196 Query: 361 LAKKLLPVVRYYSYIEIT 378 K Y+ Sbjct: 197 AKKARRMGAIVYTVGVFM 214 >UniRef50_D2H1L1 Putative uncharacterized protein (Fragment) n=4 Tax=Laurasiatheria RepID=D2H1L1_AILME Length = 191 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 20/145 (13%), Positives = 40/145 (27%), Gaps = 11/145 (7%) Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI--RHH----TQAKEVDEHEF 306 + SGS++ + D+ ++ F + + + Y H T K Sbjct: 4 FFISRSGSVNNNWMDIYNMVEDVVKKFDNPKVRISFITYSTDGHTLMKITSDKNEIRENL 63 Query: 307 FYSQ---ETGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 Q +G T + L+ +E + +E + I +DG Sbjct: 64 AKLQNVVPSGATHMQEGLRKANEQIEQENAGEKKAPIVILALTDGTLLPFPFEETKMEAE 123 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWR 387 + Y + L Sbjct: 124 ESRRLGATVYCIG-VKDYRKDQLLD 147 >UniRef50_UPI0001C15B95 hypothetical protein CRC_00003 n=1 Tax=Cylindrospermopsis raciborskii CS-505 RepID=UPI0001C15B95 Length = 1499 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 54/189 (28%), Gaps = 41/189 (21%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS---------TKDMAKRF--------YIL 275 D+ + + ++DVSGS KD + +I Sbjct: 834 DINGNGVREDLIQGDSPNIVFVIDVSGSTRGPFQGIPVGDVNKDGIQNTILDAEIAGFIA 893 Query: 276 LYLFLSRT-----YKNVEVVYIRHH----TQAKEVD---------EHEFFYSQETGGTIV 317 L L R K V + T E D E + + G T Sbjct: 894 LNNSLVRKGFGSRAKVSIVSFASDAKTLLTTNPETDSNKNGTKDVEEKLISLKSGGETNF 953 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIE 376 AL+ + +++ A SDG N + + ++ V+ ++ Sbjct: 954 EIALQEAAKTLRDIGTTAGNGNVIFM-SDGQPNQGNYTDEVLDLQ----KAGVKLSAFGV 1008 Query: 377 ITRRAHQTL 385 T + +L Sbjct: 1009 GTGASIDSL 1017 >UniRef50_B6XSC0 Putative uncharacterized protein n=3 Tax=Bifidobacterium RepID=B6XSC0_9BIFI Length = 1192 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 27/122 (22%), Positives = 41/122 (33%), Gaps = 19/122 (15%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL-------SRTYKNVEVVYIRHHTQAKE 300 A + +MD SGSM + AK L L + + V++ + T+A Sbjct: 490 PADIVLVMDKSGSMKGELDNNAKEAANALAKKLLTDKNSTLPSEQQVQMAVVTFSTKATI 549 Query: 301 VDEHEFFYSQ--------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + GGT +ALK ++ R N + SDGD Sbjct: 550 EQNFTTDVLKINNAVEGDPDGGTNWEAALKQA-NILSGRSNVKKH---IIFLSDGDPTFR 605 Query: 353 DS 354 S Sbjct: 606 TS 607 >UniRef50_Q22UC0 Putative uncharacterized protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22UC0_TETTH Length = 2382 Score = 56.3 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 40/126 (31%), Gaps = 11/126 (8%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK------- 299 + + D SGSM+ ++ + SR + I + K Sbjct: 2233 NPVHFIIVFDESGSMEGDKWMSLRKELLDFIDNRSRYTAQDFITLIGFNDSIKLYTKLEK 2292 Query: 300 --EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW-NIYAAQASDGD-NWADDSP 355 E + + G T S+ L+ + +++ + N SDG+ N D Sbjct: 2293 LNEQIKQKVPEKNMNGNTNFSAPLQQVLKILSQDNCKTFNKNNVIFFLSDGEANKPDTDI 2352 Query: 356 LCHEIL 361 L + Sbjct: 2353 LQLQKQ 2358 >UniRef50_C6MF78 von Willebrand factor type A n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MF78_9PROT Length = 333 Score = 55.9 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 42/168 (25%), Gaps = 24/168 (14%) Query: 247 SQAVMFCLMDVSGSMD--QSTKDM---AKRFYILLYLFLSRTYKNVEVVYIRHHTQA--- 298 S + ++D SGSM S K + + L L +V V I + A Sbjct: 63 SGLDLALVLDSSGSMGAVDSGKTLNQWLQEASTALVNALPAASTSVSV--IDFDSSAAIL 120 Query: 299 ---------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 +GGT + + + + + A SDG + Sbjct: 121 QGLTPLSSGSAAVISAINAIDASGGTNIGAGIDSAAAELTGANHTAGSTQMMVVVSDGFS 180 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 D + L + + + + Sbjct: 181 SGDPASSALAALGAGV-DAIHTVGL----PGHDAFTMQNIATSGNGIY 223 >UniRef50_UPI00015B5332 PREDICTED: similar to ENSANGP00000020925 n=1 Tax=Nasonia vitripennis RepID=UPI00015B5332 Length = 2053 Score = 55.9 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 31/296 (10%), Positives = 78/296 (26%), Gaps = 38/296 (12%) Query: 147 EYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQL 206 E TH + N +++ + ++ + L R MT + L + ++ + Sbjct: 1003 EPDTHFNNISVNTSFSSVHIPTNVYDRLPR-VNMTISWSKRLDRI------FKHNYKSDP 1055 Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK----------RPDPSSQAVMFCLMD 256 + + + + + + K + M L+D Sbjct: 1056 ALMWQYFCSTTGVLRQYPAMRWPVSLKKDGKEITDTYDCRVRSWFIEASTCSKDMVILVD 1115 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---------------IRHHTQAKEV 301 SGSM + +AK + LS + ++ + Sbjct: 1116 NSGSMTGMSNAIAKTTVSTIMSTLSNNDFVAVFNFSDSTQQVVSCFQDKLVQATPENIRR 1175 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ------WNIYAAQASDGDNWADDSP 355 + + G ++ A +++ N ++ N +DG Sbjct: 1176 INDDILTMKPEGVANITEAFLAAFTILENYRNESRCGSDLSCNQMIMLVTDGIASNITEV 1235 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 ++ VR ++Y+ + L + + + Sbjct: 1236 FQEYNWSENGTIPVRVFTYLLGQEVTKVREIQWMACLNRGYYTHIHTQAEVPEQVL 1291 >UniRef50_C7PJ56 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJ56_CHIPD Length = 226 Score = 55.9 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 53/154 (34%), Gaps = 22/154 (14%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 D +++YK ++ + L+D SGSM K +A F L YK Sbjct: 50 HDKVEIQYKQ----EQAKNELYVCFLLDSSGSM-MKDKQIA--FIKGLVATTIARYKTRR 102 Query: 289 VVYIR---HHTQAKEVDE---------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 + Y H+ A+ + + G T + S L+ +++K Sbjct: 103 IKYAAVALHNGTAQILSAPTLHADELLQTIAALKTGGKTNMQSGFVLLHQLMKTNTQQKA 162 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR 370 ++Y +DG A ++ E + + Sbjct: 163 -SLYIF--TDGKINAGNTATPFEEAVRYYKQYLS 193 >UniRef50_B9LQX7 von Willebrand factor type A n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LQX7_HALLT Length = 491 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 31/240 (12%), Positives = 55/240 (22%), Gaps = 39/240 (16%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 + + + + E L I E I D + Sbjct: 176 ADGEGNALGDDSDGPISGEGELADAIDVTVWYDENCNNILDAD--------AEEAGDSVC 227 Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---------TQAKEV 301 + ++D SGSM S K L + + +V R + T + Sbjct: 228 VQLVIDTSGSMGGSRIANTKSGAKQLAETILDANPDNQVGVTRFNNGASTPQQLTDDLDD 287 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEI 360 E +GGT + + + E N DGD N + Sbjct: 288 VEAAIDGLSASGGTNAQAGVDAGQAEL-ENCPHD--NRVMVVFGDGDINTDGSAAKVA-- 342 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 ++ + D I D ++F + Sbjct: 343 -------GTEIFAIGVGGASFSD--LEDLAS--DPADEHVFFAIDD-----GAIEQIFGQ 386 >UniRef50_A0Z4J3 Von Willebrand factor, type A n=2 Tax=Bacteria RepID=A0Z4J3_9GAMM Length = 591 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 58/194 (29%), Gaps = 20/194 (10%) Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 + + I D R + ++ ++D SGS AK LL Sbjct: 381 ATSARQGQLHIKPSDFRTTRF----RHQRESTTLFVVDASGSAAMHRMAEAKGAVELLLA 436 Query: 279 F-LSRTYKNVEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 SR + + + T++ + GGT ++ A++L ++ E Sbjct: 437 DCYSRRDSVALIAFRGNKAELLLPPTRSLVRAKKALAALPGGGGTPLAHAVEL-TTLLSE 495 Query: 331 RYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYY------SYIEITRRAHQ 383 + A +DG N A D + ++ + + + + + Q Sbjct: 496 QILRDGATPTAVFLTDGVANIARDGTPGRQRAKEEAMTAAKQFKALKSRALVIDASPRAQ 555 Query: 384 TLWREYEHLQSTFD 397 RE Sbjct: 556 QKARELADALGGQY 569 >UniRef50_C4ZKE8 von Willebrand factor type A n=2 Tax=Thauera sp. MZ1T RepID=C4ZKE8_THASP Length = 840 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 56/202 (27%), Gaps = 23/202 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---IRHHT 296 + P + + L+D SGSM + A+R + L + + + H + Sbjct: 258 PRVPAAAHPLAVKILVDCSGSMQGDSIAAARRALQAIIAGLREGERFSLSRFGSTVEHRS 317 Query: 297 QAKEVDEHEFF--------YSQET-GGTIVSSALKLMDEVVKERYN-----PAQWNIYAA 342 +A Q GGT + +AL + + + Sbjct: 318 RALWRTSAATRQAGQRWAMQLQADLGGTEMENALASTLALAGDAEPSPGTEEGAAAVDLL 377 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG + K+ + + I + + R +F Sbjct: 378 LITDGQ------IHAIDRTVKRARALGNRIFVVGIGSAPAEGVLRRLADETGGACDFVAP 431 Query: 403 HIRDQDDIYPVFRELFHKQNAT 424 + + +F L ++ Sbjct: 432 GEAVEPAVLRMFARLRSQRMDA 453 >UniRef50_Q895E2 Conserved protein, putative metal-dependent peptidase n=2 Tax=Clostridium RepID=Q895E2_CLOTE Length = 468 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 36/256 (14%), Positives = 76/256 (29%), Gaps = 46/256 (17%) Query: 118 QISKDEYLDLLFEDL--ALPNLK-QNQQRQLTEYKTHRAG---------YTANGVPANIS 165 +S + + E + A+ L + ++++ + N++ Sbjct: 163 DLSYERNAEYYAEKIKEAIDKLTSEKGKKKILNKELMINSQDIKIEECKIENAHDVWNLN 222 Query: 166 VVRSLQNSLA---RRTAMTAGKRRELHALEENL------AIISNSEPAQLLEEERLRKEI 216 L ++TA A K + +++ L A I +E + + + Sbjct: 223 KDNFDLEHLKELTKKTANNASKGKISSSIQRALKDLNKKAQIPWNEYLRKVIGTQPMGYK 282 Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 + K R P + D+R K + + + +D+SGSM + Sbjct: 283 KTITRKDRRQP--NRLDIRGKLPDHKIK------LLIALDISGSMSDEDIQKVMVEIFDI 334 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEV----DEHEF-FYSQETGGTIVSSALKLMDEVVKER 331 + E+ I + V E + GGT S + + + Sbjct: 335 VKK-----HSAEITIIESDNAIRRVYKVKREGDIKKKLDTRGGTSFSPVFRYIYDNKLRD 389 Query: 332 YNPAQWNIYAAQASDG 347 Y +DG Sbjct: 390 Y-------ILIYFTDG 398 >UniRef50_A7SFM5 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7SFM5_NEMVE Length = 417 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 38/128 (29%), Gaps = 19/128 (14%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KE 300 A L+D SGSM K IL L + + ++ +E Sbjct: 291 AEFIFLVDRSGSMSGKHIFQVKEMLILFLKSLPANCYFNLIGFGSYYRSVYQETQIYDEE 350 Query: 301 VDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 EH Y Q+ GGT L L E + + + +DG + Sbjct: 351 TAEHACNYVQKMRADLGGTN----LILPLEFIFNQPPKKGIPRFVFMLTDG---GVSNTT 403 Query: 357 CHEILAKK 364 ++ Sbjct: 404 EVIDFVRR 411 >UniRef50_C5VFZ9 von Willebrand factor type A n=2 Tax=Corynebacterium matruchotii RepID=C5VFZ9_9CORY Length = 236 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 59/204 (28%), Gaps = 37/204 (18%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-----------YKNVEVVYIRHHTQ 297 +F L+DVS SM K + L + + + I + Sbjct: 9 LPVFFLIDVSYSM-LEEKPGG-GTLLDAANQLVPGIVEACEKYSVLDQRLRLGLIEFCDE 66 Query: 298 AKEVD--------EHEFFYSQETGGTIVSSAL-----KLMDEVVKERYNPAQWNI-YAAQ 343 A+ V GGT ++A ++ V R + Sbjct: 67 ARVVIPLSEIDAFSENIPQLVAKGGTNFAAAFWAVFNEMGVAVESLRKPEIGIHRPTVFF 126 Query: 344 ASDGDNWADDSPLCH--EILAKKLLPVV-RYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG++ D L+ + ++++ A+ R + L S F Sbjct: 127 ITDGEDIGDVEERARAWAALSDEGFRYRPNFFTFGV--GNANLEGIRAF-KLGSGFAA-- 181 Query: 401 MQHIRDQDDIYPVFRELFHKQNAT 424 +D +E+ + ++ Sbjct: 182 --ATKDPTRAVQRLQEILNTLVSS 203 >UniRef50_A6M139 von Willebrand factor, type A n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M139_CLOB8 Length = 962 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 54/200 (27%), Gaps = 22/200 (11%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK-IERVPFIDTFD 233 R+ L + ++ A + + + I K + I Sbjct: 7 KRKFLKKVSLIVSLMLILVSINSSVFRVKADIASKPQFTVTIDSYTPKNPKLGEEITING 66 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLS--RTYKNVEVV 290 + K P + + ++D SGSM D K+ +S + K V Sbjct: 67 TIHPQPFKISIPPKE--IVLVLDSSGSMADNYKLTNLKKAATDFITKMSTVKNLKIAIVD 124 Query: 291 YIRHHTQAKEVDE-----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPA-QWN 338 + T ++ + GGT L+ ++ + Sbjct: 125 FDTQATIINKLTDVSSSTNVTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSEQNPLAS 184 Query: 339 IYAAQASDGD----NWADDS 354 SDG+ NW + Sbjct: 185 KNIIFMSDGEPTYYNWQTAN 204 >UniRef50_UPI0000F1FEC5 PREDICTED: similar to Clca1 protein n=2 Tax=Danio rerio RepID=UPI0000F1FEC5 Length = 903 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 53/168 (31%), Gaps = 20/168 (11%) Query: 246 SSQAVMFCLM-DVSGSMDQSTK--DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 + CL+ DVSGSM ++ M + LL ++ V + + + Sbjct: 294 QRKKRAVCLILDVSGSMATESRILRMRQAATHLLRNYVEEQASVGIVKFSTAASIVSSLT 353 Query: 303 ----EHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + + G T + + L+L +V+ E A + +DG D Sbjct: 354 IIESDATRDHLINLLPETPGGSTNMCNGLRLGLQVLSEDDMDAIGDEIIF-LTDGQ-ATD 411 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 D LC ++ + + A +E ++ Sbjct: 412 DVTLCIPDAINS-GAIIHTIALSDSAHNA----LQEMADKTGGIFFYS 454 >UniRef50_D0LKC7 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LKC7_HALO1 Length = 346 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 28/233 (12%), Positives = 54/233 (23%), Gaps = 51/233 (21%) Query: 241 KRPDPSSQAVMFCLMDVSGSM----------DQSTKDMAKRFYILLYLFL-----SRTYK 285 + + ++D SGSM DQ+ ++ K + Sbjct: 83 ENTIRREGIAIMMVVDTSGSMRALDLADGGLDQTRLEVVKDVFRAFVAGEDGLDGRSNDT 142 Query: 286 NVEVVYIRHHTQAKEVDEH-----------EFFYSQETGGTIVSSALKLMDEVVKERYNP 334 V + + + E + GT + L L E ++E Sbjct: 143 IGLVSFAGFADTRCPLTLNHGSLLTILDDLEIVRERAEDGTAIGDGLGLAVERLRES--- 199 Query: 335 AQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA------------ 381 + +DG +N ++PL LA +L V Sbjct: 200 EASSRVIILLTDGVNNAGIETPLEAAELASRLGIKVYTIGAGTDGVAPVRVTNPLTGAEE 259 Query: 382 --------HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + F +Y L + + + Sbjct: 260 LRPMPVEIDEATLEAIAEHTGGRY-FRATDGDGLRQVYEQIDRLERTEISERR 311 >UniRef50_Q66HV5 Zgc:92481 n=2 Tax=Danio rerio RepID=Q66HV5_DANRE Length = 804 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 66/247 (26%), Gaps = 33/247 (13%) Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 +E L + +P+ +LE L A + + +S Sbjct: 222 DVELLLYYVDPHQPSAVLEAGAATAPAGSLMADPLLMLSLYPE----FPAAVMSSLTSHG 277 Query: 250 VMFCLMDVSGSMD---------QSTKDMAKRFYILLYL-------FLSRTYKNVEVVYIR 293 L+D SGSMD Q + A+ +LL F + + + Sbjct: 278 EFIFLVDQSGSMDCPMHHGEGAQMRIESARDTLLLLLKSLPLGCYFNIYGFGSSFQAFFP 337 Query: 294 HHTQAKEVDEHEFFY----SQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 E E + GGT + L+ + + +DG+ Sbjct: 338 QSVLYSEQTLQEALQRVKLMRADLGGTEILQPLQHIYRQA----CIPEHPRQLFIFTDGE 393 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 W + L + R +S+ + L S F R Q Sbjct: 394 VW---NTRELLDLVRAHSSSHRCFSFGIGEGAS-TALITGMAKEGSGHAQFITGSDRMQP 449 Query: 409 DIYPVFR 415 + R Sbjct: 450 KVMQSLR 456 >UniRef50_D0MEC0 von Willebrand factor type A n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MEC0_RHOM4 Length = 329 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 53/209 (25%), Gaps = 38/209 (18%) Query: 242 RPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 + ++D+S SM S ++A+R I R + VV+ Sbjct: 81 EKRTVEGRDLMLVLDLSSSMLAQDFSPSRFEVARRTAIQFVQG-RRADRIGLVVFAGQAF 139 Query: 295 ----HTQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 T Q GT + +A+ +K + +DG Sbjct: 140 TQVPPTLDYRFLLTMLQRLQVGRLEDGTAIGTAIATAINRLKNS---EARSKVIILLTDG 196 Query: 348 DNW-ADDSPLCHEILAKKLLPVVRYYSY-IEITRRA-----------------HQTLWRE 388 N + PL LA++ + + + RE Sbjct: 197 QNNRGEIDPLTAAELARQAGIRIYTIGLSGRGEAPYPVQTPFGTRPQPVPVEIDEAMMRE 256 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 F R + IY L Sbjct: 257 VAEKTGGRY-FRATDARTLEAIYAEIDRL 284 >UniRef50_A5FBS6 Uncharacterized protein with a von Willebrand factor type A (VWA) domain-like protein n=29 Tax=Bacteroidetes RepID=A5FBS6_FLAJ1 Length = 372 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 41/115 (35%), Gaps = 7/115 (6%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSM---DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + +Q ++D+S SM + AK+ + L ++ Y + + Sbjct: 169 DLVVEETQHKAQMSTVLMIDISHSMILYGEDRITPAKKVAMALAELITTRYPKDTLDILV 228 Query: 294 HHTQAKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 A + + Y Q T + L+L ++++ + N N +DG Sbjct: 229 FGNDAWTIPIKDLPYLQVGPYHTNTVAGLQLAMDILRRKRN---TNKQIFMITDG 280 >UniRef50_A7K1D3 Protein BatA n=19 Tax=Vibrionales RepID=A7K1D3_VIBSE Length = 334 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 45/169 (26%), Gaps = 24/169 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 + + ++D+SGSM Q ++ K+ + + V Sbjct: 88 DPVEFQPKYRDLMLVVDLSGSMQQEDMELNGEYIDRLTAVKKVLSDFVAK-RKGDRLGVV 146 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ H T ++ + + T + + L + + P Sbjct: 147 LFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDSDAPQ---RV 203 Query: 341 AAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWRE 388 SDG N PL +AKK + + Sbjct: 204 MILLSDGSNTAGVLEPLEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTR 252 >UniRef50_A1ZZI4 Von Willebrand factor type A domain protein (Fragment) n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZZI4_9SPHI Length = 976 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 53/192 (27%), Gaps = 24/192 (12%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----- 292 N K P + + + L+DVS S ++ L K V++ Sbjct: 288 NLPKIPRKNMKKNVVFLLDVSLSSQPDKFNVWLTTLRTLLHENRDVIKRFSVIFFNVEAF 347 Query: 293 --------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + G T + AL + Y Sbjct: 348 WWRKGWTKNSPGNIASFMKFA-NKLSLEGATDLGRALDEVY-----NSKLKSKAKYLFLL 401 Query: 345 SDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 SDGD +W ++ L+KK+ Y++ + F++ + Sbjct: 402 SDGDLSWGENG---LYQLSKKIASQDELYAFSTGLSGTDARILDHLARSTQG-AVFSILN 457 Query: 404 IRDQDDIYPVFR 415 + ++ FR Sbjct: 458 ESEVYEVSQNFR 469 >UniRef50_A0ED74 Chromosome undetermined scaffold_9, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0ED74_PARTE Length = 562 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 20/199 (10%), Positives = 56/199 (28%), Gaps = 21/199 (10%) Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 L +E ++ ++ +L + L+ + + Sbjct: 321 LRRVENFEWDFD-----PKQNIDQKKQLKDKLIKIWQEYGQDICNKLQKSYQNIQQLNHN 375 Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE----VVYIRHHTQAKEVDE 303 + ++D S SM++ D+ K+ + + + I + + + Sbjct: 376 RVHYIFILDSSESMNKDWTDI-KKGVREFIKKIKEKDQVENKEFWISLILFNKEQTTLIN 434 Query: 304 HEFFYSQE--------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDS 354 + + GGT +K +VK+ + +DG + Sbjct: 435 SKRARDIKTKFKMNFLGGGTNFGKPIKKAINLVKKDNTSDLF--LILFYTDGKAAIPEQE 492 Query: 355 PLCHEILAKKLLPVVRYYS 373 + L ++ + + Sbjct: 493 LKKMQNLEEEKRKKIHLIA 511 >UniRef50_B3RUM1 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RUM1_TRIAD Length = 1173 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 19/190 (10%), Positives = 46/190 (24%), Gaps = 35/190 (18%) Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKI--ERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + V F +D R + + S + Sbjct: 192 NYLKWQY-----FGSKFGLSYTFPGRPWTTNFVGFTKDYDPRLRPWYIAAT-SGPKDVVI 245 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV----------------------- 290 ++D SM + +AK + L+R V Sbjct: 246 VIDCGLSMQGNRFKIAKSVAKTVLATLTRNDYVNIVCTRFSHWDETGKWHFYETTVLGCY 305 Query: 291 ---YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 I ++ + + G + + + ++++ + +I +DG Sbjct: 306 KDQLIPASLTNRKSLSNAIDNLKAGGTSEMKKGFQKAFKLLRGSHRTGCQSIMIV-ITDG 364 Query: 348 DNWADDSPLC 357 + C Sbjct: 365 EKTDGPKVRC 374 >UniRef50_Q5VMI5 Putative uncharacterized protein OSJNBa0085L11.1 n=2 Tax=Oryza sativa RepID=Q5VMI5_ORYSJ Length = 393 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 12/103 (11%), Positives = 33/103 (32%), Gaps = 12/103 (11%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 +R K + + ++D+ G M + K+ + L++ + V + Sbjct: 273 VRVKAPAYTKQTRAPLDLVMVLDIGGRM--RELEQLKQGAKFIIHNLTQQDRLSIVTFGP 330 Query: 294 HHTQAKEVD----------EHEFFYSQETGGTIVSSALKLMDE 326 + E+ + +GG + + L + + Sbjct: 331 RADRLSELTPMTEQDKRSSNDAVQALEASGGVKIGAGLNVAYQ 373 >UniRef50_B3QY78 von Willebrand factor type A n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QY78_CHLT3 Length = 346 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 29/234 (12%), Positives = 55/234 (23%), Gaps = 48/234 (20%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD--MAKRFYILLYLFLSR--TYKNVEVVYIRHH 295 + + +DVS SM ++ + FL R + VV+ Sbjct: 80 RLKEVKRKGIEVVIALDVSNSMLADDIQPSRLQKSKYTISNFLERLGNDRVGLVVFAGQS 139 Query: 296 ------TQAKEVDEHEFFYSQETG----GTIVSSALKL---MDEVVKERYNPAQWNIY-- 340 T K + GT SSA++ E ++E + N Sbjct: 140 FVQCPITSDKSALKLFMDIVSTDAIPTQGTNFSSAIRESIRALERIEEGAEAEEKNRVRN 199 Query: 341 --AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA----------------- 381 SDG++ + A + Sbjct: 200 KVILIFSDGED-HEAGIDEVLEEAASKNIRIYTVGVGSAEPTPIPVLNKDGKRVDFKRDS 258 Query: 382 ---------HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + L R+ D + I +L ++ + + Sbjct: 259 QGSVVTTHLQEALLRKIAEQTKGNYYRIAPQGSDFELIADDINKLEKQELSAKE 312 >UniRef50_B3QUN4 von Willebrand factor type A n=4 Tax=Bacteria RepID=B3QUN4_CHLT3 Length = 340 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 53/203 (26%), Gaps = 41/203 (20%) Query: 251 MFCLMDVSGSMDQST------KDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQA 298 + +D+SGSM + AK + + VV+ T Sbjct: 100 IVLAIDLSGSMLAEDFEPKNRIEAAKSVATDFIHQ-RLSDRIGLVVFSGKSFTQCPLTLD 158 Query: 299 KEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADD 353 + + + GT + +A+ ++E ++ +DG N + Sbjct: 159 YRLLTNFISELKAGTIEEDGTAIGTAIATATNRLRESTAKSK---VIILLTDGQNNAGEI 215 Query: 354 SPLCHEILAKKLLPVVRYYS-------------------YIEITRRAHQTLWREYEHLQS 394 P+ LA L + Y+++ + + Sbjct: 216 EPVTAAELAAALGIKIYTVGAGTRGYARYPIPDPLFGKRYVQMKVDVDDSTLTRIARISG 275 Query: 395 TFDNFAMQHIRDQDDIYPVFREL 417 F + Y EL Sbjct: 276 GRY-FRATDLESLKKTYHEIDEL 297 >UniRef50_A0CBH4 Chromosome undetermined scaffold_164, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CBH4_PARTE Length = 428 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 47/144 (32%), Gaps = 16/144 (11%) Query: 230 DTFDLRYKNYE-KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 D ++ Y P + ++D SGSM + +D+ K + L + +Y Sbjct: 1 MNQDKQFYLYTIVDQKPDDNFSVVGVIDGSGSMSECWEDLCKAWNHFLLD-IQSSYC--- 56 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 I+ A + + G T +S + ++VK N+ +DG Sbjct: 57 ---IQFDDTAILQQNPKLNQQKGEG-TNISLGFHELIKLVKSEKLKK--NVIVIFITDG- 109 Query: 349 NWADDSPLCHEILAKKLLPVVRYY 372 +D E L + Sbjct: 110 -VGED---KLEDYIDDLAENFSLF 129 >UniRef50_B1G2X7 Putative uncharacterized protein n=1 Tax=Burkholderia graminis C4D1M RepID=B1G2X7_9BURK Length = 182 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 44/134 (32%), Gaps = 14/134 (10%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSR 282 +R + DLR+ R P + ++D SGSM +AK I L+ S Sbjct: 4 QRRRALSAEDLRF----MRDVPRGGVLHCFVLDCSGSMLAGERLALAKGLLIALFDRASA 59 Query: 283 -TYKNVEVVYIRHHTQAKEVD-------EHEFFYSQETGGTIVSSALKLMDEVVKER-YN 333 + V + + E GGT ++ ++ ++++ Sbjct: 60 MRAEAALVCFGGAGADLRFGPAVPRWWNERWLEPVGGGGGTPFAAGVQCATQLLERSARR 119 Query: 334 PAQWNIYAAQASDG 347 + +DG Sbjct: 120 KPAQQRWVWILTDG 133 >UniRef50_D1N5F6 von Willebrand factor type A n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N5F6_9BACT Length = 783 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 54/182 (29%), Gaps = 20/182 (10%) Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-----DQS 264 R+ + + I + + + P + + DVS SM S Sbjct: 51 SPPRRRFRIFLLLLTMLFLIAAAARPFWSSQLVPFEPRGRDLMVIFDVSKSMLATDIAPS 110 Query: 265 TKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVDEHEFFYSQET----GG 314 + AK L + V + T GG Sbjct: 111 RLEHAKFLLRQLVESA-PNDRFGLVAFAGKAYLACPLTSDSLAFTQYIDELNTDTVPLGG 169 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 T + +AL++ ++ K A N +DGD A +S + L K+ +P + Sbjct: 170 TNLEAALRVAEQAFKA---AAGGNRGILLFTDGDELAGNSAALVDELRKRQIP-LFIVGL 225 Query: 375 IE 376 + Sbjct: 226 GD 227 >UniRef50_B8G7S2 Magnesium chelatase n=9 Tax=cellular organisms RepID=B8G7S2_CHLAD Length = 696 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 55/182 (30%), Gaps = 16/182 (8%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 R A + + Q L I + I DLR K R Sbjct: 453 RAARVTDLALDATLREAAIYQRKRRMELMHTIDT-PYRRRPKIVIKRSDLRQK---VRVR 508 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRF--YILLYLFLSRTYKNVEVVY-------IRHH 295 + AV ++D S SM + A + LL R + V + + Sbjct: 509 RTRNAVC-FVVDASWSMAAEERMQATKAAVLSLLRDAYQRRDQVGLVSFQRDYARVLLPL 567 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE-RYNPAQWNIYAAQASDGD-NWADD 353 T + E+ + G T +S L E+++ R A+ +DG N + Sbjct: 568 TNSVELAQRRLQSMPTGGKTPLSRGLLTAFELLERARRRDAEVVPLMVLLTDGQANVSIS 627 Query: 354 SP 355 Sbjct: 628 DL 629 >UniRef50_B9KV79 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides KD131 RepID=B9KV79_RHOSK Length = 1043 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 44/156 (28%), Gaps = 22/156 (14%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS----RTYKNVEVVYI 292 K S A ++ +DVSGSM + K L + N + + Sbjct: 246 KAPIVPEANISDAAIYITLDVSGSMSGTRMAAQKAGVAALIREIGASVDPDRPNDIRIVL 305 Query: 293 RHHTQA-----KEVDEHEFFYSQ---------ETGGTIVSSALKLMDEVVKERYNPAQWN 338 + A + ++ ++ + +GGT ++A Sbjct: 306 WNAGLAGSIERRNMEPDDYTALEDWMLALSNSTSGGTNFNAAFAEASTFFA---GGGSKR 362 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 +DG+ S E L P + + Sbjct: 363 RIVIFVTDGEPSPVSSVDAAEATIASLPP-ADIFGF 397 >UniRef50_B9PUS9 Microneme protein, putative n=5 Tax=Sarcocystidae RepID=B9PUS9_TOXGO Length = 769 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 49/146 (33%), Gaps = 16/146 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHTQAKEV- 301 ++Q + L+D SGS+ + K+F + L N V Y ++ Sbjct: 72 TNQLDICFLIDSSGSIGIQNFRLVKQFLHTFLMVLPIGPEEVNNAVVTYSTDVHLQWDLQ 131 Query: 302 DEHEFFYSQET----------GGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDN 349 + G T S LK +++ P + ++ +DG++ Sbjct: 132 SPNAVDKQLAAHAVLEMPYKKGSTNTSDGLKACKQILFTGSRPGREHVPKLVIGMTDGES 191 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYI 375 +D + ++L +V + Sbjct: 192 DSDFRTVRAAKEIRELGGIVTVLAVG 217 >UniRef50_D2R6S0 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R6S0_9PLAN Length = 324 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 26/256 (10%), Positives = 54/256 (21%), Gaps = 38/256 (14%) Query: 174 LARRTAMTAGKRRELHALEEN-------LAIISNSEPAQLLEEERLRKEIAELRAKIERV 226 +ARR + E+ LA S + R I + + Sbjct: 72 IARRLPQQTEGPPSVEGTEQEGKPSVSSLASFSRETKSPGGRPLREIPSIDVAAVMADEL 131 Query: 227 PFIDTFDLRYKNYEKRPDPSSQAV---MFCLMDVSGSMDQSTKDMAKRFYILL---YLFL 280 + A ++D S SM L L Sbjct: 132 RRNPPAAKPIGEVASFSMFGATARGRSFVLVIDRSQSMGGEGLGAIAAAMQELQVQLAAL 191 Query: 281 SRTYKNVEVVY-----IRHHTQAKEVDEHEFFYS-------QETGGTIVSSALKLMDEVV 328 + + V Y + + V E +G T + + + Sbjct: 192 TPQQRVQVVAYNDSAALYGDGRLVPVTPAEQEKLVRFVGGIVASGATEHRTGILAALRLA 251 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPLCHEIL-AKKLLPVVRYYSYIEI---TRRAHQT 384 E +DG + + ++ A + + + Sbjct: 252 PE---------VVFLLTDGGDPPLSNHDLRTLVEAAAGRTQIIVLEFGRGKLSEENSQDE 302 Query: 385 LWREYEHLQSTFDNFA 400 ++ + Sbjct: 303 KLKKLAQATGGMYRYV 318 >UniRef50_B0NBU8 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0NBU8_EUBSP Length = 1865 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 61/190 (32%), Gaps = 34/190 (17%) Query: 249 AVMFCLMDVSGSMDQSTKDM--AKRFYILLYLFLSRTYKNVEVVYIRH------------ 294 A + ++D S SM ++ K + + E+ I + Sbjct: 1067 ASIVLVLDASASMQENGKKLKDIQDAAKAFVNTTKEKSPISEIAVIWYQGSEGSSSTITD 1126 Query: 295 -------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + +GGT + AL+ + ++ R N ++ YA +DG Sbjct: 1127 SGFYTLDTSDNVDAINRFISNKNASGGTPMGDALEEANSILSGRPNSSK---YALLFTDG 1183 Query: 348 D---NWADDSPLCHEIL-----AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 N +++S C AK++ + Y+ W E ST ++ Sbjct: 1184 MPGYNSSNNSFNCMVANHANNEAKEIKEYAKLYTIGYKLS--GSFKWEEGHSQDSTNNHG 1241 Query: 400 AMQHIRDQDD 409 + + D Sbjct: 1242 SHKTETKAAD 1251 Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats. Identities = 45/348 (12%), Positives = 93/348 (26%), Gaps = 55/348 (15%) Query: 38 INKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGG 97 NK + DS V D + + + N+++ + Sbjct: 341 ANKEGIIPADSKLKVIPVLPDDKKTKDQYKEVEDKLKDKAKNENYSIAGFLAYDISFVDE 400 Query: 98 SGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTA 157 G + I K+ + + + +L++N++ Q+ + A Sbjct: 401 DGKEVEPDGNVKVTMEYKKDVIPKEVEVTEKDLGVTVMHLEENEKGQVKKVVDMVAD--- 457 Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA 217 +G A++ T K+ E + ++ E LL + Sbjct: 458 DGSKASVET-----------TDNGKVKKAEFVTDSFSTFTLAWQEYEPLLTDYANGSVRT 506 Query: 218 ELRAKIERVPFIDTFDLRYKNYEK----------RPDPSSQAVMFCLMDVSGSMDQST-- 265 + ++Y +K + + + ++D SGSM + Sbjct: 507 SDNSLGAPEH---NKRIKYNEKDKDYTLTLDVTGKRGKKAGVDVLLVIDKSGSMGLNDNG 563 Query: 266 ---------KDMAKRFYILLYL-FLSRTYKNVEVVYIRHHTQA---------------KE 300 K+ L L + V I + K Sbjct: 564 RTDSNYFNLMPTLKKTVPTLVDTILPDSDSVNRVAAISFSSDDYTGNDISTDWVDYNGKS 623 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + GGT A++ D+ +K R + SDG+ Sbjct: 624 GFNRKIEGLGTKGGTNWQLAMRNADKKLKPRAESQNKKVVVF-LSDGE 670 >UniRef50_A8LLA0 von Willebrand factor type A domain protein n=7 Tax=Proteobacteria RepID=A8LLA0_DINSH Length = 328 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 50/203 (24%), Gaps = 30/203 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----------DMAKRFYILLYLFLSRTYKNVE 288 E S+ + +D+SGSMD K + Sbjct: 86 EPITITSAARDLVLAVDISGSMDDRDMTAPDGTRLQRLQAVKDVVGAFVAE-REGDRISL 144 Query: 289 VVYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +V+ T+ + Q T + A+ L ++ Sbjct: 145 IVFGAKPFIQAPFTEDLDSVVELLNQVQTGMAGPNTAIGDAIGLAIRSFEDSEIEE---R 201 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRAHQTL----WREYEHLQS 394 SDG + A +P+ +A + + + L + Sbjct: 202 LLILLSDGADTASTMTPINAAQIAAQEGITIYTIGVGNPDGSGEERLDPATLEDIATRGG 261 Query: 395 TFDNFAMQHIRDQDDIYPVFREL 417 FA + +IY L Sbjct: 262 GAFYFADD-VEGLSEIYAEIDAL 283 >UniRef50_C8P229 Putative uncharacterized protein n=1 Tax=Erysipelothrix rhusiopathiae ATCC 19414 RepID=C8P229_ERYRH Length = 1466 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 38/123 (30%), Gaps = 20/123 (16%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDM-------AKRFYILLYLFLSRTYKNVEVVYIRHH 295 + + ++D SGSMD AKR I + + V + + Sbjct: 87 RLKREPSDIVLVLDTSGSMDPQKNPQGIDRISKAKREAIHFVNEIFERDASARVALVSYG 146 Query: 296 TQAKE----------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 T+ + +E + GGT AL ++ + P N S Sbjct: 147 TKVSSNSFHTKQESNLLINEIKSLKAEGGTFTQGALYEAKMLLNQSSAP---NKTIVLLS 203 Query: 346 DGD 348 DG Sbjct: 204 DGQ 206 >UniRef50_UPI0001AED79F von Willebrand factor type A n=1 Tax=Streptomyces albus J1074 RepID=UPI0001AED79F Length = 221 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 47/188 (25%), Gaps = 22/188 (11%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE---VVYIRHHTQAKE----V 301 + L D SGSM D R L+ +S + I A V Sbjct: 4 LPFYLLCDESGSMTGDPIDAINRALPDLHHEISTNPTVADKTRFCLIGFSDDASVLQPLV 63 Query: 302 DEHEFFY---SQETGGTIVSSALKLMDEVVK------ERYNPAQWNIYAAQASDGDNWAD 352 D + G T +A + + V+ + + A SDG + Sbjct: 64 DLSDIDEVPALSAGGLTDYGTAFRTLLRSVEKDVAELKAQGHEVYRPVAFFLSDGIPTDE 123 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 D P H L + ++ + + + + Sbjct: 124 DWPTAHRELLNS-RYAPKIIAFGI-----GDAEAQIIGQVANFRAFIQKDNSVSPAQALR 177 Query: 413 VFRELFHK 420 F + Sbjct: 178 EFASSLTR 185 >UniRef50_C3ZZV3 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3ZZV3_BRAFL Length = 2692 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 46/177 (25%), Gaps = 25/177 (14%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF--LSRTYKNVEVV-Y 291 +Y +P + L+D SGS+ + K F + +S V VV Y Sbjct: 285 QYYTPCLVRNPEFD--LIFLLDESGSIGTDNFKLVKSFTERMANNFDISPNSTRVGVVQY 342 Query: 292 IRHHTQAKEVDEHEFFYSQE-----------TGGTIVSSALKLMD--EVVKERYNPAQWN 338 E + F G T +A+ + E + Sbjct: 343 SNFPGT--EFSLNAFTDKAAVLDAISKIDYNGGSTFTGAAIDFVRNNEFTSVNGDRDDVP 400 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +DG+ D S + Y+ I Q + + Sbjct: 401 NILIVITDGNPNDDVSGPAISANN----AGITTYAVG-IGSNVDQANLVQMTAGRPG 452 Score = 48.6 bits (114), Expect = 5e-04, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 48/157 (30%), Gaps = 25/157 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ-AKEVDEHEFFYS 309 + L+D SGS+ + D+ K F + + V +++ Q + E + F Sbjct: 1099 LVFLLDGSGSVGSNNFDLVKTFTKNVVQNFDISETATRVAVVQYSDQFSTEFSLNAFSTK 1158 Query: 310 QE-----------TGGTIVSSALKLMDEVVKERYN--PAQWNIYAAQASDG---DNWADD 353 E TGGT A+ + + V + + +DG D+ + Sbjct: 1159 TEVYNAIDNISYLTGGTFTGFAIDFVMQSVFTSISGERDGYPDLLVVVTDGLSTDDVSGP 1218 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + V Y+ + + Sbjct: 1219 ADTAR-------AQGVTIYAVG-VGSDIDFNTLEQIA 1247 Score = 48.2 bits (113), Expect = 6e-04, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 43/151 (28%), Gaps = 18/151 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLF--LSRTYKNVEVV---------YIRHHTQAK 299 + L+D SGS+ + D+ K F L +S V VV + + Sbjct: 1381 LVFLLDGSGSVTTANFDIVKEFTRRLANNFDISLADTRVGVVQYSDSPTLEFNLNSFNTN 1440 Query: 300 EVDEHEFFYSQ-ETGGTIVSSALKLMD--EVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 E+ + Q + GGT A+ + + + +DG S Sbjct: 1441 ELVDLAIRNIQYQQGGTNTGQAIDFVRVNSFSANNGDRSDVPNVMIVVTDGQ----SSDD 1496 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 + Y+ L + Sbjct: 1497 VVGPAQTARNAGISMYAVGIGNGVDTNELLQ 1527 >UniRef50_A8M2F6 von Willebrand factor type A n=3 Tax=Micromonosporaceae RepID=A8M2F6_SALAI Length = 583 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 67/248 (27%), Gaps = 43/248 (17%) Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 A + + +D + + + M C++DVSGS Sbjct: 337 GFGAPQGAPSPEVGGASTEPGSGDAAGAVDPVAVD-RAVASWSIATQSGRMLCVIDVSGS 395 Query: 261 MDQS------------TKDMAKRFYILLYL-----------FLSRTYKNVEVVYIRHHTQ 297 M S T D A+R L L +V I + Sbjct: 396 MKGSVAGAGGASRQQVTLDAARRGLSLFDDSWQIGLWEFSTNLGSGRDYRRLVEIGPLSN 455 Query: 298 AKEVDEHEFFYSQETGGTIVSSAL----KLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + E Q T + L E V+E ++P Q N +DG N D+ Sbjct: 456 QRSRLEQALTQIQP---TRGDTGLFDTVLAAYEAVQEEWDPGQVNSIVL-FTDGKNDDDN 511 Query: 354 SPLCHEILAK-------KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 ++LA+ + V +A + +F + Sbjct: 512 GISQQQLLAELERIKDAERPVQVVLIGIGADVSKAELESITKVT----GGGSFVTEDPTK 567 Query: 407 QDDIYPVF 414 DI+ Sbjct: 568 IGDIFLKA 575 >UniRef50_Q04NS4 BatA n=4 Tax=Leptospira RepID=Q04NS4_LEPBJ Length = 312 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 58/203 (28%), Gaps = 30/203 (14%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR----FYILLYLFL--SRTYKNVEV 289 K P+ + +DVSGSM +S + + LL F+ ++ + V Sbjct: 73 GKKTTFLPNEKKGVDVMIALDVSGSMSRSRDFLPETRLGVSKKLLRKFIDKRKSDRLGLV 132 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNI 339 V+ T +E + GT + A+ L ++ + Sbjct: 133 VFAGAAYLQAPLTGDRESLNEILGTIEEETVAEQGTAIGDAIILSTYRLRAS---QARSK 189 Query: 340 YAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRA--HQTLWREYEHLQSTF 396 +DG N P+ LA+ + + + + + RE Sbjct: 190 VIVLITDGVSNTGKIDPVTATDLAEHIGVKIYSVGIGKEDGSYEINFEILRELSASTGGK 249 Query: 397 DNFAMQHIRDQDDIYPVFRELFH 419 + + + + Sbjct: 250 FF--------RAEDPEEMKAVLT 264 >UniRef50_D0KDG5 VWA containing CoxE family protein n=9 Tax=Gammaproteobacteria RepID=D0KDG5_PECWW Length = 379 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 28/246 (11%), Positives = 65/246 (26%), Gaps = 16/246 (6%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R + R+ + + + + + Sbjct: 136 REAVRDIIRKVVDEILRTLRPTFTNALTGRRNRFRRSPIASSQNFDWRATIAANLKHFDR 195 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 + ++ R + +D S SM S A IL L + Sbjct: 196 EKQRLVIETPHFNSRMQRHMPWDVILCVDQSASMSSSVMYAAVCASILATL---PAVRVS 252 Query: 288 EVVYIRH----HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 +V+ A + E Q GGT ++ A++ ++ V+ P + A Sbjct: 253 LIVFDTQVVDLSHLAHDPVE-VLMTVQLGGGTNIAKAMQYCEQRVQN---PKRT--IVAL 306 Query: 344 ASDGDNWADDS--PLCHEILAKKLLPVVRYYSYIEITRR-AHQTLWREYEHLQSTFDNFA 400 SD + + C + + + + ++ + E + ++ Sbjct: 307 ISDFEEGGALNHLLSCVQRMHSQQITLLGLAALDEAAHPVYDAAIGQKLADRGMHVAALT 366 Query: 401 MQHIRD 406 +H Sbjct: 367 PEHFAQ 372 >UniRef50_B3SBE5 Putative uncharacterized protein n=2 Tax=Trichoplax adhaerens RepID=B3SBE5_TRIAD Length = 1262 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 64/184 (34%), Gaps = 20/184 (10%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 Y ++ +P + + +D+S SM Q D K ++L L + VV+ H Sbjct: 884 YPEFDVTENP--RPEIIIALDMSNSMKQCLMDTQKIAALILTN-LPPECRFNIVVFGSAH 940 Query: 296 TQA----KEVDEHEFF-------YSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + +EV + + G + + +VKE + N+ Sbjct: 941 NELFPMYQEVSKESVNMAIKFIGSLSASWGNSNFYHVIDNFTHIVKELKANSVSNV--FL 998 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 SDG ++S L + + +R +++ +++ R + + + Sbjct: 999 ISDGHFGDENSITAI--LRRDKIDNLRLFTF-STGDTSNRYFMRTLAKIGAGYHEHFDTK 1055 Query: 404 IRDQ 407 R + Sbjct: 1056 FRSK 1059 >UniRef50_Q2J8W6 von Willebrand factor, type A n=2 Tax=Actinomycetales RepID=Q2J8W6_FRASC Length = 534 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 50/199 (25%), Gaps = 28/199 (14%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY---LFLSRTYK----NVEVVYIRHH 295 + L+DVSGSM S + L LS + ++ I Sbjct: 335 DRFRRPSHAIFLLDVSGSMAGSRIAALQAALRGLTGADDTLSGRFARFRGREKITMITFA 394 Query: 296 TQAKEVDEHE-----------------FFYSQETGGTIVSSALKLMDEVVKERYNPA-QW 337 +A + + + GT + SAL+ + Sbjct: 395 GRANDPVDFAVNDPRPGSADLAGVNTFVDGLRLQDGTAIYSALEAGYRAAGAAVEADPGY 454 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY-SYIEITRRAHQTLWREYEHLQSTF 396 +DG+N + S ++L R ++ A R+ Sbjct: 455 LTSIVLMTDGENNSGISAADFRSSYQRLPAAARAVRTFTIAFGEADPAALRDISADTGG- 513 Query: 397 DNFAMQHIRDQDDIYPVFR 415 D + R Sbjct: 514 -AVFDARTSSLADAFKDIR 531 >UniRef50_UPI00017F3212 von Willebrand factor, type A n=1 Tax=Escherichia coli O157:H7 str. EC4024 RepID=UPI00017F3212 Length = 325 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 29/272 (10%), Positives = 73/272 (26%), Gaps = 26/272 (9%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 L R +E + L + S E + + Sbjct: 24 LGRFLFTRERTVKEYVRVP-FLPGLIESLQLNQQPERSGKVATCMFWLVWALMVCALARP 82 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST-------KDMAKRFYILLYLFLSRTYKN 286 + P ++ +DVSGSM+++ ++ ++ + Sbjct: 83 EYLTPPQHIEKPMRNIMLI--LDVSGSMEKNDVAGGLTRLQAVQQSVKKFVA-ARKSDRI 139 Query: 287 VEVVYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQW 337 V++ ++ K+ E T + AL + +++ + Sbjct: 140 GLVIFANSAWPFAPVSEDKQALETRISQLTPGMAGQQTAIGDALGVTVKLLDSTGDKEAS 199 Query: 338 NIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ----TLWREYEHL 392 + +DG D + +P LA + ++ ++ L ++ + Sbjct: 200 KLAIL-LTDGNDTASQLTPRLAAQLAVSHHVQLHTIAFGDVNSSGDDKVDLNLLQDLARM 258 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 A D ++ + Q + Sbjct: 259 TGGRSWTAENSGASLDAVWKEIDAITPVQVKS 290 >UniRef50_D0LUP3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain-like protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LUP3_HALO1 Length = 536 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 48/355 (13%), Positives = 87/355 (24%), Gaps = 49/355 (13%) Query: 82 FVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFV-----FQISKDEYLDLLFEDLALP- 135 + R G + S + G+ S + AL Sbjct: 167 HGLGQQPARASQPGRSQPEQSDEHSDEQTGEHLAPGKHGLADTSSETDARDADLRAALQQ 226 Query: 136 ---NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 L + R + E G R R + + +E Sbjct: 227 ACAELAEA-LRAVDEVAEAMQECDTWGREPG-DFGRLPIEEFQRLSQVLRETPSVRKIVE 284 Query: 193 ENLAIISNSEPAQLLEEER-LRKEIAELRAKIERVPFIDTFDLRYKNYEKR-----PDPS 246 +P R E+ + T ++ ++ R Sbjct: 285 LAGRWSELLKPRLKRGHSPRGRSELVGVTLGGGLERLCATELIKLRHPALRRVLLGQLAE 344 Query: 247 SQAV--------------MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 +A+ M ++D SGSM + MAK + L L + + V+ Sbjct: 345 RRALVHELRGPDVLGRGPMILVVDTSGSMHGARMTMAKSLMLALALHCWEQRRPLRVLTF 404 Query: 293 RHHTQAKE----------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-A 341 + E + GGT L + E+V ER W A Sbjct: 405 GAPGEMHESEVAVDEPFWTRLEQCLSVAFGGGTDFDGPLLRVCEIVGER----PWRRADA 460 Query: 342 AQASDGDNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +DG+ + LA+ + + + R + + Sbjct: 461 VFLTDGEC--CVAEATRAQLARTRARVALNIIGVLVGRGRGLDGVADIAYRARDG 513 >UniRef50_Q01T75 von Willebrand factor, type A n=2 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01T75_SOLUE Length = 320 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 51/159 (32%), Gaps = 17/159 (10%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH- 295 ++ K + ++D SGSM + + L +R + V + Sbjct: 77 QDIRKFKSEDVPVSLGLVIDNSGSMRN-KLQKVEAAALALVKASNRDDEVFIVNFNDTAY 135 Query: 296 ---------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 T E GGT + A+++ + +K+ + + + +D Sbjct: 136 LDNPKDKDFTNDIGELEQALKRIDARGGTAMRDAIQMSIDHLKKGHRDKKVLVVI---TD 192 Query: 347 G-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 G DN + + A + V Y +T H+ Sbjct: 193 GNDNSSVINMERIMKNAHQ--SDVLIYGVGLLTEEEHRE 229 >UniRef50_A9UWH5 Predicted protein n=2 Tax=Monosiga brevicollis RepID=A9UWH5_MONBE Length = 2728 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Query: 234 LRY-KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL------YLFLSRTYKN 286 LRY + Y + + A + ++DVSGSM + A+ F + + ++ Sbjct: 2507 LRYVQQYSSHAEYAEAADVLMVVDVSGSMTDY-MEQARAFVRTIAREGFHLDSTASQHRM 2565 Query: 287 VEVVYIRHH---------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK--ERYNPA 335 + T + G T ALKL+++ ++ + +PA Sbjct: 2566 ALFTFGTTATALGGEPLFTSDWAQLQQRVAQIAVNGATNYLPALKLVEQSLRDLKASDPA 2625 Query: 336 QWN---IYAAQASDGDNWADDSPLCHEILAKKLLPVV--RYYSYIEITR 379 ++N +DG N + A++++ + + + Sbjct: 2626 RYNASRRIVLFQTDGSNSDRNQTRAITATARRIVDELDATLMAVLTGAG 2674 >UniRef50_O05809 Uncharacterized protein Rv2850c/MT2916 n=51 Tax=Bacteria RepID=Y2850_MYCTU Length = 629 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 37/123 (30%), Gaps = 10/123 (8%) Query: 250 VMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVYIRH-------HTQAKE 300 ++ ++D SGSM + A LL R K + + +H T + Sbjct: 451 LVIFVVDASGSMAARDRMAAVSGATLSLLRDAYQRRDKVAVITFRQHEATLLLSPTSSAH 510 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQASDGDNWADDSPLCHE 359 + G T ++ L ++ + +DG A PL Sbjct: 511 IAGRRLARFSTGGKTPLAEGLLAARALIIREKVRDRARRPLVVVLTDGRATAGPDPLGRS 570 Query: 360 ILA 362 A Sbjct: 571 RTA 573 >UniRef50_B9XQJ6 von Willebrand factor type A n=1 Tax=bacterium Ellin514 RepID=B9XQJ6_9BACT Length = 342 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 52/173 (30%), Gaps = 21/173 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAK--RFYILLYLFLSRTYK--NVEVVYIRHH------TQAKE 300 + L+D S SM + ++ R + F+ + + V + T + Sbjct: 97 VMFLLDCSKSMLAADVQPSRLSRSKYAILDFVQQHGRGRVGLVAFAGQAFLQCPLTFDYD 156 Query: 301 VDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 E GGT + AL +++ +DG++ Sbjct: 157 AFRDALLAIDEQTIPVGGTDIGRALDEAYRAMEKNDRHK----ILVLITDGEDLEKAGIK 212 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWRE--YEHLQSTFDNFAMQHIRDQ 407 + LA+K VV + + ++++ N H+ + Sbjct: 213 TAQALAEK-GIVVYTIGVGTAAGSPIKVMNERGMLDYVKDEQGNVVESHLDEA 264 >UniRef50_A6DKL3 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKL3_9BACT Length = 890 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 50/207 (24%), Gaps = 25/207 (12%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQ------STKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 K ++ + +MD SGSM + ++A L + Sbjct: 378 KNEHRKLRSNLSIVMDRSGSMGMTVKGGKTKMELANEGAAQTIELLGAMDSVSVIAVDTE 437 Query: 295 HT---------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 A E+ + GG V + L+ ++ R S Sbjct: 438 AHAIVPQTVLKDAPEIASQARRVKSQGGGIYVYTGLEESWRQLEGREGQKH----VILFS 493 Query: 346 DGDNWADDSP-LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 D ++ + K V + E + + F + Sbjct: 494 DSNDSEEPGRYKELLADMKDEGMTVSVIALGE-RTDVDSPFLIDIANRGRGRIFFTDDPL 552 Query: 405 RDQD----DIYPVFRELFHKQNATAKG 427 + V R F K+ K Sbjct: 553 SLPSIFAQETVTVARSAFLKEVTATKS 579 >UniRef50_UPI0000E472E1 PREDICTED: similar to parturition-related protein PRP3, partial n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E472E1 Length = 528 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 28/238 (11%), Positives = 57/238 (23%), Gaps = 32/238 (13%) Query: 197 IISNSEPAQLLEEERLRKEI--AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM--- 251 + E L + + + D + V+ Sbjct: 44 FCDDDESDPDSLHNPLAPNLMNKLCNYRSAWSVMRNHTDFQTDVPPVSNTTPVFEVLQLS 103 Query: 252 -----FCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE- 303 ++D+S SM + + M + + + + K VV+ + E+ + Sbjct: 104 SVRSVVLVLDISNSMKEKNRFDRMIQSSTVYIMSAIPADSKLGIVVFESASREIAELTDI 163 Query: 304 ----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + G T + +K +V+ Y Y SDG D Sbjct: 164 TDTASRQRLVNALNT-SPKGTTCIGCGIKSGLKVL-GSYAQGG---YILLLSDGVENEDP 218 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 S ++ + + S +F IY Sbjct: 219 SITDMYDDINNSGVIIDTITI----SNEADQQMEDLSKNTSGKSSFCSDTGIGPCLIY 272 >UniRef50_C7DFN0 Magnesium chelatase ATPase subunit D n=1 Tax=Thalassiobium sp. R2A62 RepID=C7DFN0_9RHOB Length = 548 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 62/391 (15%), Positives = 114/391 (29%), Gaps = 50/391 (12%) Query: 30 IKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIE 89 +++S + I +V +VS+ + P F + + + D + Sbjct: 164 LRRSARKVIVPPAVLSDLVTLAVSLGISSLRAPTF--ALNAAQTHAAWQDRDTLTADDVA 221 Query: 90 RPQGGGGGSGSGQGQ--ASQDGEGQDEFVFQISKDEYL----DLLFEDLA--LPNLKQNQ 141 + Q + + QD +D+ D+L + + LP N Sbjct: 222 IAVALVYAHRATQMPHDDATENTPQDANQDSQPQDQPFSIPKDILLDAIKAVLP---PNL 278 Query: 142 QRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNS 201 L + RA +G IS R R G + + + +A + + Sbjct: 279 LANLDAQTSKRAAGVGSGAK-RISNRR------GRPLPARTGSKASAARV-DLIATLRAA 330 Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 P Q L + P DLR+K Y+ S ++ +D SGS Sbjct: 331 IPWQTLRKNA---------EPARIGPIFRQSDLRHKCYQTL----SDRLLIFTVDASGSA 377 Query: 262 DQSTKDMAKRFYILLYLFLS-RTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETG 313 + AK LL R + + I T++ + G Sbjct: 378 AMARLAEAKGAVELLLSEAYARRDHVALIAFRGTDADLILPPTRSLVQTKKRLAALPGGG 437 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE------ILAKKLL 366 GT +++ L + + +DG N A D + L K + Sbjct: 438 GTPLAAGLTAALALAQTTTQKGLT-PTIVLLTDGRANVALDGTGNRKLAATDAQLIAKKI 496 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 R + + T + +E Sbjct: 497 NAARVEAIVIDTTMRPERALQELAQTMDATY 527 >UniRef50_Q99KC8 von Willebrand factor A domain-containing protein 5A n=29 Tax=Euteleostomi RepID=VMA5A_MOUSE Length = 793 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 56/201 (27%), Gaps = 29/201 (14%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQS---------TKDMAKRFYILLYLFLS----- 281 Y + + + LMD SGSMD + AK +LL L Sbjct: 267 YPDIPEVEASKACGEFVFLMDRSGSMDSPMSTENNSQLRIEAAKETLLLLLKSLPMGCYF 326 Query: 282 --RTYKNVEVVYIRHHTQ-AKEVDEHEFFYSQE----TGGTIVSSALKLMDEVVKERYNP 334 + + + + ++ E + GGT + + L + K P Sbjct: 327 NIYGFGSSYEKFFPESVKYTQDTMEDAVKRVKALKANLGGTEILTPL---CNIYKASSIP 383 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + +DG+ +D + E+ + + +L + + Sbjct: 384 -GHPLQLFVFTDGE-VSDTFSVIREVKLNSKKHRCFSFGIGQGAST---SLIKNIARVSG 438 Query: 395 TFDNFAMQHIRDQDDIYPVFR 415 F R Q + Sbjct: 439 GTAVFITGKDRMQTKALGSLK 459 >UniRef50_Q23AF2 von Willebrand factor type A domain containing protein n=3 Tax=Tetrahymena thermophila SB210 RepID=Q23AF2_TETTH Length = 368 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 42/185 (22%), Gaps = 47/185 (25%) Query: 253 CLMDVSGSMD---------QSTKDMAKRFY-ILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 ++D+SGSMD S K L L Y+ V+ + + D Sbjct: 192 FVIDISGSMDYTFKANGETISRLAFVKSQLTKTLAEQLKP-YQKFNVIIFGNSASQWKTD 250 Query: 303 ------------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 G T +SS L L + SDG Sbjct: 251 YIDATPENIQAAIAYINKLTTNGATNISSGLDLAFNTKQALNG-------IYLLSDG--V 301 Query: 351 ADDSPLCHEILAKKLLPV---------VRYYSYIEIT------RRAHQTLWREYEHLQST 395 + + + K L + S+I R + Sbjct: 302 PNSGVQTVDGIKKYLADKNASRNEKVHINTISFIMGGTETQNDRNLSFQFLNAIADATNG 361 Query: 396 FDNFA 400 Sbjct: 362 SFKGI 366 >UniRef50_Q73UD3 UPF0353 protein MAP_3435c n=4 Tax=Mycobacterium avium complex (MAC) RepID=Y3435_MYCPA Length = 335 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 53/214 (24%), Gaps = 42/214 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYIRH 294 P ++AV+ ++DVS SM + AK L+ V + + Sbjct: 88 SDVRIPLNRAVVMLVIDVSESMASTDVPPNRLAAAKEAGKQFADQLTPAINLGLVEFAAN 147 Query: 295 ------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK---------ERYNPAQWNI 339 T + + Q T + + + E PA+ Sbjct: 148 ATLLVPPTTNRAAVKAGIDSLQPAPKTATGEGIFTALQAIATVGSVMGGGEGPPPAR--- 204 Query: 340 YAAQASDG-DNWADD-----SPLCHEILAKKLLPVVRYYSYIEITRR-----------AH 382 SDG +N D AK + S+ Sbjct: 205 -IVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPVD 263 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 ++ + F + ++Y + Sbjct: 264 DQTLQKICEITDGQ-AFHADSLDSLKNVYSTLQR 296 >UniRef50_A6G0N1 Protein containing a von Willebrand factor type A domain n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0N1_9DELT Length = 1606 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 52/176 (29%), Gaps = 26/176 (14%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK--------NVEV 289 + E P +A + ++D S D + L LS + EV Sbjct: 532 DIEFDKLPPPRADVVVVVDTSAFGDDAEHQSRLAVAEALLRSLSESDNFAVVSADLTAEV 591 Query: 290 VYIRH-----HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 +Y T+ + + G T + + + + V + PA +Y Sbjct: 592 LYPEQGMTPASTENIDAALEALGERRHGGATDLGAIFERALDRVHDADQPAV--VYI--- 646 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVR-----YYSYIEITRRAHQTLWREYEHLQST 395 GD A + L+++L + ++ + +L + Sbjct: 647 --GDGLATSGERGSDDLSERLRRALSGSPARLFTVG-VGPTIDDSLLERLARVGGG 699 >UniRef50_A9B607 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=A9B607_HERA2 Length = 550 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 37/126 (29%), Gaps = 14/126 (11%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 + P A + ++D SGSM D D AK + L + + Sbjct: 367 NAWANNRKP---ANIMLVVDSSGSMRDDDKMDQAKLGVEVFLNRLPSKDNVGMIGFSSSP 423 Query: 296 TQA--KEVDEHEFFYSQ-------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 Q G T + A+ L + ++ P + N SD Sbjct: 424 AVLVPLATRSENMANLQMQTQGLVPDGNTSLYDAIDLARQELENLKQPDRIN-AIVVLSD 482 Query: 347 GDNWAD 352 G + A Sbjct: 483 GADTAS 488 >UniRef50_C4DDW8 von Willebrand factor type A-like protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DDW8_9ACTO Length = 425 Score = 55.2 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 24/126 (19%), Positives = 39/126 (30%), Gaps = 21/126 (16%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV----------------VYIRHH 295 ++D SGSM S AKR I L + V ++ Sbjct: 47 VIMVDCSGSMTGSRIAEAKRATIAAIESLDEGCRFAIVKGTDEAQMVYPDDETTAVVKSS 106 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 T++ V + + GGT + + L ++ Y +DG N + Sbjct: 107 TRSAAVKRVQI--LRAGGGTAMGTWLAKTSRIL---SGTDAAVKYGLLLTDGRNQHETEE 161 Query: 356 LCHEIL 361 E L Sbjct: 162 ELREHL 167 >UniRef50_C6JPK6 BatA protein n=2 Tax=Fusobacterium RepID=C6JPK6_FUSVA Length = 325 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 48/158 (30%), Gaps = 27/158 (17%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKN----- 286 K ++ ++ L+D S SM + AKR L L + Sbjct: 71 KLLDEDTVEVKGLNIYALIDTSRSMMAEDVYPNRLEAAKRTLENLLQGLK-GDRIGFIPF 129 Query: 287 VEVVYIRHH-TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNP-AQWNIY 340 + YI+ T + ++ GGT + AL+L ++ + N Sbjct: 130 SDSAYIQMPLTDDYSIGKNYINALDTNLISGGGTELYQALELA----EKSFKEINSDNKT 185 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 SDG D + + + +S T Sbjct: 186 IIILSDG---GDFDEKSLKFVKDNKM---NVFSIGIGT 217 >UniRef50_Q74B80 Putative uncharacterized protein n=1 Tax=Geobacter sulfurreducens RepID=Q74B80_GEOSL Length = 575 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 56/379 (14%), Positives = 109/379 (28%), Gaps = 46/379 (12%) Query: 25 RYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGG------LRHRVHPG 78 R A ++ + + I + +G+S S+P H + R + G Sbjct: 180 REAAALRDELMKLIRDSAHQQPKNGQS----RPSQSQPGSHGDQNDGTGSDNSRQQNGTG 235 Query: 79 NDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI-SKDEYLDLLFEDLALPNL 137 ++ Q ++ Q +GSGQ ++S S++ L E L+ ++ Sbjct: 236 SNGSRQAQNGDKQQISSQPAGSGQDKSSGHSSQGQSGASDSPSQESVRQALAEALSGKSV 295 Query: 138 --------KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 K + ++ G A + + + +A+ Sbjct: 296 GSFGNVGEKLAELLCQNATESSFNGTAAAAPRLPTAQLLNQNGGYDDLSALRVHTAALRA 355 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 L+ + + RL + + + D R K + Sbjct: 356 RLQGLVQASKQKRSTPVSVGHRLDSRV---------LTRLRICDTRVFT-RKEEKRAVNT 405 Query: 250 VMFCLMDVSGSMDQ----STKDMAKRFYILLYLFL--SRTYKNVEVVYIRHHTQAK---- 299 + L+D SGSM + +A R + L + + H Sbjct: 406 AVCMLLDSSGSMGNTTILNKMGIASRACFVAAEALFSIPGVRTAIATFKGHDNHVFPMVN 465 Query: 300 --EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 E +H F +GGT + AL + R + SDGD D P+ Sbjct: 466 FGEKPDHSRFNITGSGGTRLGHALWWAWGELSLR---RETRKICIAFSDGDT--GDGPVT 520 Query: 358 HEILAKKLLPVVRYYSYIE 376 + + + Sbjct: 521 QAAIKRMREEGIEVIGIGI 539 >UniRef50_A6CAQ4 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAQ4_9PLAN Length = 338 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 25/218 (11%), Positives = 50/218 (22%), Gaps = 34/218 (15%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM- 261 P Q +L + + + N S ++D SGSM Sbjct: 124 PTQAPSAPTTDLTTDQLMSPS---TALAPPVMGAGNVNFFDAVDSGKRFVFVLDCSGSMA 180 Query: 262 --DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------------HHTQAKEVDE 303 + A+ I L+ + + Y + + Sbjct: 181 APQGAPIRKARSELISSLAGLNHHQQFQIIFYNTTTRAMQHRGKSAELLYATDINRTLAR 240 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 + GGT ALK +NP +D + S + + Sbjct: 241 QFIQSVEPDGGTDHLPALKRAL-----SFNPD----VIFFLTDAKHPQLSS-ADLNDIRE 290 Query: 364 KLLPVVRYYS--YIEITRRAHQTLWREYEHLQSTFDNF 399 + + + + E + + Sbjct: 291 QNGGKAKIHCIEFGEGFPVKEGNSLDKLARQNKGSYRY 328 >UniRef50_Q9YD81 Putative uncharacterized protein n=1 Tax=Aeropyrum pernix RepID=Q9YD81_AERPE Length = 463 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 50/153 (32%), Gaps = 24/153 (15%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN------VEVVYIRHHTQAK 299 SS+ ++ L+D SGSM + D A+ + L+ + V Y H + + Sbjct: 293 SSRGPIYVLLDKSGSMVGAKIDWARAVAVALFRRSLAENRRFSARFFDSVTYPAIHLRPR 352 Query: 300 EVDEH------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDNWA 351 + GGT +++A+K + + P +DG+ Sbjct: 353 SKPRDFLELVKYLAAVKAGGGTDITAAIKTAADDISR--TPRGEQRISDIVLITDGE--- 407 Query: 352 DDSPLCHEILAKK--LLPVVRYYSYIEITRRAH 382 + + + R ++ I + Sbjct: 408 ---DRLNIDVVEDSLKRSDARLHTVIIQGHNPY 437 >UniRef50_A4QDZ6 Putative uncharacterized protein n=1 Tax=Corynebacterium glutamicum R RepID=A4QDZ6_CORGB Length = 354 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 43/167 (25%), Gaps = 29/167 (17%) Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYL----------FLSRTYKNVEVVYIRHHTQAKEVD 302 ++DVSGSM + K L L K + + + Sbjct: 172 FVLDVSGSMLGQRITLLKDTMSDLISGGATTDLANVSLRGREKVSIIPFSFGPHEVISET 231 Query: 303 ------------EHEFFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDGDN 349 + Q GGT + A+ E Y P+ +DG+ Sbjct: 232 LGAVGSPSRIDLQQRVEALQADGGTGIYDAVLAAYAESAGGDYIPS-----IVLMTDGEL 286 Query: 350 WADDSPLCHEILAKKLLPVVRYY-SYIEITRRAHQTLWREYEHLQST 395 A + L +R ++ + A+ + Sbjct: 287 TAGRTYDQFLTEWNALPSNIRSIPVFVILYGEANVADMEQLAATTGG 333 >UniRef50_Q11RQ7 BatA-like protein, aerotolerance-related n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11RQ7_CYTH3 Length = 351 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 49/174 (28%), Gaps = 30/174 (17%) Query: 250 VMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQA 298 M +DVS SM + D AK+ + + V++ T Sbjct: 113 NMIFAIDVSESMKITDIHPSRFDAAKQICTDIINK-RSNDRIGIVIFSGEAVTLSPLTND 171 Query: 299 KEVDEHEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW-ADD 353 + +++ ++ GT + +AL +K SDG+N Sbjct: 172 YVLLKNQLNDLKQNKDLQSGTAIGTALGTAINRLKNAETKE---RIIVLISDGENTSGLM 228 Query: 354 SPLCHEILAKKLLPVVRYYSYI-EITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 P+ L + + + T + + + + + Sbjct: 229 DPITAADLCLEYNIKIYCIGLGKDGTHQFKDD---------NGTIQYVESKLDE 273 >UniRef50_A9BLP5 von Willebrand factor type A n=3 Tax=Burkholderiales RepID=A9BLP5_DELAS Length = 244 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 39/124 (31%), Gaps = 16/124 (12%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVY 291 ++ + +P P + L+DVSGSM + + V Sbjct: 10 KFTAPKAKPLP-----VVLLLDVSGSMSGEKIRNVNDAVRDMLDTFSDTENGETEIHVAI 64 Query: 292 IRHHTQA--KEVDEHEFF----YSQETGGTIVSSALKLMDEVVKER--YNPAQWNIYAAQ 343 I +Q + G T + +AL++ +++++ + Sbjct: 65 ITFGSQVALHQPLASASDIHWQDLSAGGMTPLGTALQMAKAMIEDKDVIPSRAYRPTVVL 124 Query: 344 ASDG 347 SDG Sbjct: 125 VSDG 128 >UniRef50_A9B6J8 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B6J8_HERA2 Length = 950 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 57/214 (26%), Gaps = 41/214 (19%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQ----------------STKDMAKRFYILLYLFLS 281 + R + ++D SGSMD D+AK L Sbjct: 398 DVRNRQQRP-DIALVFIIDKSGSMDACHCNGGDMAAREGGGTRKIDIAKEAVAQAAAVLG 456 Query: 282 RTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYN 333 + K V + E+D+ +G T V S + E +++ Sbjct: 457 KDDKLGVVTFDDSAHWTIELDKVPSQDDVVAALAPVPPSGQTNVVSGMNAAYEQLRQSDA 516 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 + A +DG A D E + K + + + Y L Sbjct: 517 KIKH---AILLTDGWGHATDIGSIAENMNKD---GITLSVVAAGNGSDNA--LQRYAELG 568 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + ++F ++ A G Sbjct: 569 GGRYY--------PARVMEEVPQIFLQETIQAVG 594 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 42/148 (28%), Gaps = 16/148 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 + P L+D S S+ + ++F + K VV+ + + Sbjct: 57 QVRQPVQNLTTVFLLDSSDSIAPGQRSNNEQFIAQALETMQEGDKAAVVVF-GENALVER 115 Query: 301 VDEH-----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 V T +S A++L + + SDG ++S Sbjct: 116 VPSEIQRLGTIQSVPIAARTDISEAIQLGLAL----FPADTQKRLVL-LSDG---GENSG 167 Query: 356 LCHEI--LAKKLLPVVRYYSYIEITRRA 381 E+ LA++ + Sbjct: 168 RALEMIPLAQRRNVPIDIVPTGIGQGNP 195 >UniRef50_Q7K0H4 Straightjacket n=11 Tax=Coelomata RepID=Q7K0H4_DROME Length = 1218 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 57/196 (29%), Gaps = 27/196 (13%) Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 +K + +D +D R +++ +S + LMD SGSM D+AK + L Sbjct: 239 SKWRKDVPVDLYDCRLRSW-YMEAATSPKDIVILMDGSGSMLGQRLDIAKHVVNTILDTL 297 Query: 281 SRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV------------------SSALK 322 + + + E + + ++AL Sbjct: 298 GTNDFVNIFTFDKEVSPVVPCFEDTLIQANLG---NIRELKEGIELFRPKSIANYTAALT 354 Query: 323 LMDEVVKE---RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 E+++E AQ N DG + VR ++Y+ Sbjct: 355 KAFELLEETKLSSRGAQCNQAIMIIGDGAPENNREVFELHNWRDPPYKPVRVFTYLIGKE 414 Query: 380 --RAHQTLWREYEHLQ 393 W E+ Sbjct: 415 VANWDDIRWMACENQG 430 >UniRef50_Q055Y9 Putative uncharacterized protein n=4 Tax=Leptospira RepID=Q055Y9_LEPBL Length = 379 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 53/168 (31%), Gaps = 24/168 (14%) Query: 252 FCLMDVSGSMDQ-----STKDMAKRFYILLYLFLSRTYKNVEVVY----IRHHT------ 296 ++D SGSM++ +AK+ L + + Y + Sbjct: 68 LFIVDASGSMNEYLGIYQKIHLAKKHVSRYISTLPTETEIGFIAYGNRIPGCSSSRLYEP 127 Query: 297 ---QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWAD 352 + ++ F +G T ++ ++++ ++ +R + +DG ++ Sbjct: 128 LQRENHGTFKNRLFSLTPSGATPLAESIRIAGNLISQRKKETE----IILITDGVESCYG 183 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 D + L K+ +++ + + + + Sbjct: 184 DPKKELQAL-KQQGIYFKFHILGLGLKPDEERKMKILAEEGNGKYFGI 230 >UniRef50_D2R3Y3 von Willebrand factor type A n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R3Y3_9PLAN Length = 776 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 37/287 (12%), Positives = 76/287 (26%), Gaps = 61/287 (21%) Query: 109 GEGQDEFVFQISKDEYLDLLFEDLA---LPN------LKQNQQRQLTEYKTHRAGYTANG 159 D + + D + ++LA LP ++ R R Sbjct: 219 RVWDDVSLLRQGVLGSADYIDDELAEVSLPRWPVSRGIEPPIARGYDRSFFLRNRVFPPI 278 Query: 160 VPANISVVRSLQNSL----------ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEE 209 P +R+L+ L ++ R A+E+ LA + P Sbjct: 279 PPRASDELRTLEPPLWFAAERAWQASQDLKAARLVRPSQIAVEDFLAAMDYRFPVPQAGL 338 Query: 210 ER-------LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 + + R + ++ N P + M +DVS SM Sbjct: 339 ALRTAAGPSIASLTDRAASPGPRS-SLLLLGIQGANI---PKTAPSRHMIVAVDVSSSMH 394 Query: 263 QS-TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET--------- 312 + + + + V + ++V E + + Sbjct: 395 RQGRMQQVRSALDKFTSQMRDGDQLSIVAF-------RDVSEVLVERATASEAQSAVAML 447 Query: 313 ------GGTIVSSALKLMDEVV------KERYNPAQWNIYAAQASDG 347 GT ++S L+ + P+ ++ +DG Sbjct: 448 DLPVVVSGTNLASGLQQSLLLAMQAPGDATATPPSATSVVV--ITDG 492 >UniRef50_B6H7Y2 Pc16g08660 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6H7Y2_PENCW Length = 944 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 51/196 (26%), Gaps = 24/196 (12%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L K + + ++D SGSM+ K + L + Sbjct: 273 LMASLVPKFNLRPASPEVVFVIDRSGSMES-KIPTLKSALQVFLKSLPVGICFNICSFGS 331 Query: 294 -----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + GGT + A+ V+ R N ++ Sbjct: 332 YYSFMWPTSQVYDASSLNQALAFVDTVYADMGGTEMKQAVVA---TVQNRLNFEDLDVLI 388 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPV--VRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG + D ++ R++S +H L + F Sbjct: 389 L--TDGQIFDQD---QLFNFVREKAADNTARFFSLGIGEAASHS-LIEGIARAGNGFCQS 442 Query: 400 AMQHIRDQDDIYPVFR 415 ++ I + + Sbjct: 443 VTEYETLDRKIVRMLK 458 >UniRef50_Q2FLV5 Protoporphyrin IX magnesium-chelatase n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FLV5_METHJ Length = 619 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 28/244 (11%), Positives = 66/244 (27%), Gaps = 23/244 (9%) Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA---KIERVPFIDTFDLRYKNYE 240 K+ + ++ + P+ + + ++ + Sbjct: 368 KKGSHQKISDSGRYFRSKNPSGKIYDIAFDATFRAAAPHQITRSNGTLALNISVQDIRVK 427 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVYIRH---- 294 +R S + + ++D SGSM + + A + LL + + + Sbjct: 428 ERKRKSGR-TIIFVVDSSGSMGAAKRMSAVKGAVLSLLKDAYINRDQVALISFRGPGAEV 486 Query: 295 ---HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE-RYNPAQWNIYAAQASDG--- 347 T++ H+ + G T +SS + +++ R + + SDG Sbjct: 487 LLKPTRSGMTAYHQLAHLPTGGQTPLSSGIYTTVSLIRTIRRKNSHDEPFVIIISDGRAN 546 Query: 348 ----DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 DN A+ YY L ++ Sbjct: 547 HARSDNDPVAEAWMAAAAARN--EKAHYYVIDTECGYPRFFLAKKLAEHIGGTYTLLDTM 604 Query: 404 IRDQ 407 ++ Sbjct: 605 NGEE 608 >UniRef50_UPI000180B353 PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 n=1 Tax=Ciona intestinalis RepID=UPI000180B353 Length = 1580 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 53/198 (26%), Gaps = 27/198 (13%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL-FLSRTYKNVEVVYIRHHTQAKEV--DE 303 + ++DVSGSM + M ++ L K V + E+ Sbjct: 333 ASRRFVLVLDVSGSMSGNRLLMMRQSAGDFISTSLPDGDKVGIVQFHSSANLMMEIRQIS 392 Query: 304 HEFFYSQ--------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + G T + + ++ + + +DG Sbjct: 393 SQLDRVAIAAGIPGIAGGSTCIGCGIYAAMNEMERH-DANETCGNIIVLTDGKENQPPYV 451 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD------------NFAMQH 403 LA + VV + T + L + +FA+ Sbjct: 452 NDVSQLAIQKNCVVNAILF---TTTENSALVDLVTATGGQWFFAQDRDLKRLMGSFAVIA 508 Query: 404 IRDQDDIYPVFRELFHKQ 421 D D+ + + +KQ Sbjct: 509 ANDDGDVRNLVSTILYKQ 526 >UniRef50_Q4J9H5 Conserved protein n=2 Tax=Sulfolobus RepID=Q4J9H5_SULAC Length = 360 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 50/163 (30%), Gaps = 21/163 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTY--------KNVEVVYIRHHTQ 297 S ++D S SM + K ++A L + ++E +Y + Sbjct: 37 SGIHYVVMIDNSPSMKKENKINLALSSASRLVQDIIPGNFISIYLFSNDIETLYEGESGK 96 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 E + T + A+ + E ++ ++ + SDG Sbjct: 97 QIE-----LKSIKMGYTTNLHKAITKVLE----KFKSSEIPVKIILLSDGKPTDKRYSRD 147 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +E L ++ V+ + + ++ + + S Sbjct: 148 YESL--QVPKNVQLITIG-LGEDYNEAIMKILADKGSGVFYHI 187 >UniRef50_B0G4W3 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G4W3_9FIRM Length = 685 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 56/187 (29%), Gaps = 20/187 (10%) Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT---------QAKEVDE 303 + DVSGSMD S + AK+ + V T Sbjct: 347 MVADVSGSMDGSPLNEAKQVMSDFVGSVQF-DAGDLVELTSFSTGVCLEQEFSDDAATLT 405 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILA 362 ++ T + AL E R +DG DN+++ + +A Sbjct: 406 NDINNLVTGDMTSLYDALYTAVE----RVAAQNGARCVIAFTDGNDNYSNCTKEDVVNVA 461 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ--DDIYPVFRELFHK 420 + V I + + + N + D+ ++IY + ++L+ Sbjct: 462 NRYHVPVFIIGIGSI---DYADVNDIATQTGGMYYNVSDVTSMDKIYEEIYQMEKQLYLV 518 Query: 421 QNATAKG 427 + G Sbjct: 519 EFEDNTG 525 >UniRef50_C3YUL3 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YUL3_BRAFL Length = 1201 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 53/166 (31%), Gaps = 28/166 (16%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL------------FLSRTYKNVEVVYIRH 294 + +F L+D SGS++ + K+F + + + + +N V + Sbjct: 666 APLDLFFLLDGSGSVNAANFVKVKQFAVNVVNTFDVSLTATRVGVVQYSDRNTLV-FNLG 724 Query: 295 HTQAKEVDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG---DNW 350 + K ++GGT +AL+ + + R +DG D+ Sbjct: 725 NKVNKPSTVSAINNIVYQSGGTNTGAALQYVRQYAAWR--GGNVPKVIIVLTDGKSSDSV 782 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 + S + Y+ + Q L + + + Sbjct: 783 SGPSQNLVAAGVE-------VYAIGVGSFDHGQLL--QIANNKQNN 819 >UniRef50_A0BJE7 Chromosome undetermined scaffold_11, whole genome shotgun sequence n=2 Tax=root RepID=A0BJE7_PARTE Length = 2123 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 45/137 (32%), Gaps = 10/137 (7%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA----- 298 + ++D S SM+ D FL K +V I + +A Sbjct: 1935 REQQEIFYLFILDDSFSMEGKKGDEMMESLRQQLKFLKSN-KYAKVSVISFNYKASLQIE 1993 Query: 299 -KEVDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSP 355 K+ G T LKL + + +Y Y SDG+ ++ S Sbjct: 1994 FKKPKAKLIKQITLVGGITNFDPPLKLCLDQI-LKYEKKIDQAYILLYSDGEGSYPQQSL 2052 Query: 356 LCHEILAKKLLPVVRYY 372 + L+++L + + Sbjct: 2053 AEYITLSQELRNKISFL 2069 >UniRef50_A9CX50 RTX toxin, putative n=1 Tax=Shewanella benthica KT99 RepID=A9CX50_9GAMM Length = 1830 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 22/120 (18%), Positives = 40/120 (33%), Gaps = 21/120 (17%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK-----NVEVVYIRHHTQAKEV- 301 + ++D SGSM + D AK + ++ L + V ++ + T K Sbjct: 1301 NYNIAFILDSSGSMGSTNVDTAKDQLLEIFNTLKASATGSHAGVVNILLVDFDTGTKVTV 1360 Query: 302 ------------DEHEFFYSQETGGTIVSSALKLMDEVVK--ERYNPAQWNIYAAQASDG 347 E G T +A +L+ E + E + N+ +DG Sbjct: 1361 SVNLADDNAISNLEDALDTIYSGGRTNYEAAFELVTEWLNNGEAASNDGTNLT-YFITDG 1419 >UniRef50_Q17A73 Dihydropyridine-sensitive l-type calcium channel n=4 Tax=Culicidae RepID=Q17A73_AEDAE Length = 1173 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 29/214 (13%), Positives = 60/214 (28%), Gaps = 27/214 (12%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + +R +DTFD R +++ + + L+D SGSM +A+ + S Sbjct: 195 EWDR-RQVDTFDCRKRSW-YIETATCSKDIVILLDNSGSMTGYRNYIAQLTVKSILDTFS 252 Query: 282 RTYKNVEVVY---------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDE 326 Y I+ + + + G V A E Sbjct: 253 NNDFINIYKYSNDVDPLVDCFADMLIQATPENIRFMNEKVRGLEPDGYANVKKAFVKAFE 312 Query: 327 VVKERYNP--------AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV-VRYYSYIEI 377 +++ Y + N +DG + + VR ++Y+ Sbjct: 313 LLQ-HYREMRRCNETVSGCNQAIMLITDGVPSNITDVFEQYNWFENGTKIPVRVFTYLLG 371 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + L + Q+++ Sbjct: 372 REVTKVREIQWMACLNRGHYSHIQSLDEVQEEVL 405 >UniRef50_Q1N498 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1N498_9GAMM Length = 731 Score = 54.4 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 59/175 (33%), Gaps = 25/175 (14%) Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---------HTQAKEVDE- 303 L+D+SGSM + + + L F + + V++ + K + E Sbjct: 356 LLDISGSMQGKFQTLIEGVKKGLKRF-NPQDRVRVVLFNDYASNLTGGFLPATQKNIAEI 414 Query: 304 -HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEIL 361 + GGT + ++ + A W +DG N + L Sbjct: 415 IRKLDLVLPNGGTHLMDGVRFALSGLDADRTSAIW-----LVTDGVTNVGETKQRKFVDL 469 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 K+ +R +++I ++ L + + F ++ + DDI + Sbjct: 470 LKQK--DIRVFTFIMGNGA-NRPLLKAITKASNGFA----INVSNSDDIIGQLEK 517 >UniRef50_C6XGA4 Putative uncharacterized protein n=2 Tax=Candidatus Liberibacter asiaticus str. psy62 RepID=C6XGA4_LIBAP Length = 374 Score = 54.4 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 48/167 (28%), Gaps = 28/167 (16%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQ------STKDMAKRFYILLYLFLSRTYKNVEVV- 290 + + ++ M ++DVS SM+ + DMA + + + VV Sbjct: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 Query: 291 --YIRHHTQAKEV---------DEHEFFYSQETG-GTIVSSALKLMDEVV--------KE 330 + + +E + + Y + G T + LK + Sbjct: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 Query: 331 RYNPAQWNIYAAQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYIE 376 A + +DG+N + + + Y+ Sbjct: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 >UniRef50_C9KWG4 Putative uncharacterized protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KWG4_9BACE Length = 454 Score = 54.4 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 55/163 (33%), Gaps = 18/163 (11%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 ++R + M +D SGSM + +AK + + + ++ + + Sbjct: 292 QTKRREPRQIKGPMIVAIDTSGSMSGKAESIAKALLLEITQMAKKQHRKC--FLLSFSVR 349 Query: 298 AKEVDEH---------EFFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDG 347 A+ +D EF S +GGT LK + +E Y A SD Sbjct: 350 AQALDTAHSGNWKKVREFMVSHFSGGTDGEEMLKTALHTLTQENYLMAD----VLIISDF 405 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITR-RAHQTLWREY 389 + P + K+ VR+Y ++ L + Sbjct: 406 EFDFCCKPTE-SRIRKEQERGVRFYGLQIGNGVNVYEELLDKV 447 >UniRef50_Q4RX89 Chromosome 11 SCAF14979, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4RX89_TETNG Length = 1160 Score = 54.4 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 22/179 (12%), Positives = 49/179 (27%), Gaps = 22/179 (12%) Query: 267 DMAKRFYILLYLFLSRTYKNVEVVY--------------IRHHTQAKEVDEHEFFYSQET 312 + K + + LS + ++ + + K++ + + Q Sbjct: 179 RLMKTSVMEMLDTLSDDDYVNVARFNEKADAVVPCFRTLVQANVRNKKIFKEAVMHMQAK 238 Query: 313 GGTIVSSALKLMDE-VVKERYNPAQW-NIYAAQASDGDNWADDSPLCHEILAKKLLPVVR 370 G T S E ++ E P N +DG +D VR Sbjct: 239 GTTDYKSGFTFAFEQLLNESSAPRANCNKMIMMFTDG---GEDRAQEIFEKYNWPNKTVR 295 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN--ATAKG 427 +++ T + + F + I + ++ + A +K Sbjct: 296 VFTFSVGQHNYDVTPLQWIA-CSNKGYYFEIPSIGAIRINTQEYLDVLGRPMVLAGSKA 353 >UniRef50_B3RWW7 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RWW7_TRIAD Length = 732 Score = 54.4 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 27/225 (12%), Positives = 64/225 (28%), Gaps = 28/225 (12%) Query: 196 AIISNSEPA-QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD----PSSQAV 250 + L + R R D + + +R S Sbjct: 179 DYFIENYNNDPNLRNQYFGGNDGVFRTFPGRPWPKDESGIVLYDCRQRGWYILGSDSPKN 238 Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------------IRHHT 296 + L+D SGSM +AK L L++ + + I+ Sbjct: 239 VIILIDRSGSMRGMPLAIAKWGTSNLLDTLNQNDFFTILTFNESITPVIDCYTNLIQATD 298 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER-----YNPAQWNIYAAQASDGDNWA 351 + K++ + + G + + + ++++ + A+ +DG Sbjct: 299 ENKKLYKTYLEKFTDGGRADFNHSYAMAFDLLQNAKTQSFRHSAKCQEAIVLFTDGA-AQ 357 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 L E ++K +R +++ + + + F Sbjct: 358 YTEKLLAERNSEKK---IRIITFVVGPQFYDTVPIEKLTCEYNGF 399 >UniRef50_A5FB87 von Willebrand factor, type A n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FB87_FLAJ1 Length = 2588 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 47/164 (28%), Gaps = 22/164 (13%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-------S 264 L + I + D++ P + + + +D+SGSM + Sbjct: 32 LAQTITTNKTVTANAGNCGIIDVKVDITGANPI-TRNSDVVLAIDISGSMGNTISGDFKT 90 Query: 265 TKDMAKRFYILLYLFLS--RTYKNVEVVYIRHHT--------QAKEVDE--HEFFYSQET 312 + D AK + + V Y + A V + ++ Q T Sbjct: 91 SMDYAKDAALAFLNQAKANPQNRIAIVAYSTTASLKIGLTYLNATGVTQITNQINALQAT 150 Query: 313 GGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDGD-NWADDS 354 T + + + + ++ +DG N S Sbjct: 151 NSTNIYAGIVRSETELETNGRFDCSTARAIILLTDGVTNVTGTS 194 >UniRef50_A9A4V8 von Willebrand factor type A n=2 Tax=Thaumarchaeota RepID=A9A4V8_NITMS Length = 520 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 28/278 (10%), Positives = 64/278 (23%), Gaps = 38/278 (13%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLA---IISNSEPAQLLEEERLRKEIAELR 220 I L + + + + + + ++ Sbjct: 256 IDEYNVLLDEDKKTENKGLMPEAIGIQIPTTRNVDETVIYDMSLINGLKTKFKEWKTG-- 313 Query: 221 AKIERVPFIDTFD----LRYKNYEKRPDPSS-QAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 K + V + FD + S + + L+D S S+ + K+ + Sbjct: 314 WKEQHVRSGEEFDEENYIEGNEPFFTDIKKSIKTKIVILLDHSSSISSDAIEY-KKATLA 372 Query: 276 LYLFLSRTYKNVEVVYIRHHTQAKEV---------------DEHEFFYSQETGGTIVSSA 320 L L Y V+ T+ + V G T ++ Sbjct: 373 LCEVL--AYLKVKFAVYAFSTENRSVVCWSIKPDNMKWNNVTAKRLAQIVANGSTPLAEV 430 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 M +++ + +DG+ D+ K L + Sbjct: 431 YDKMFPILQSKRPD-----ILLTLTDGEPSDPDAVRNMTKSLKSLGISMVALGLG----- 480 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 + + + DI ++ Sbjct: 481 PNTVRATTIANNLRHLGYEKTMAVSRLRDIPNKVIKIL 518 >UniRef50_O00339 Matrilin-2 n=30 Tax=Euteleostomi RepID=MATN2_HUMAN Length = 956 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 53/168 (31%), Gaps = 27/168 (16%) Query: 235 RYKNYEKRPD--------PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN 286 R ++ P + +A + ++D S S++ K F + + FL Sbjct: 34 RGRHARTHPQTALLESSCENKRADLVFIIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDV 93 Query: 287 VEVVYIRHHTQAK-EVDEHEFFYSQE-----------TGGTIVSSALKLMDEVV---KER 331 V +++ + K E F E + GT+ A++ + E Sbjct: 94 TRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEG 153 Query: 332 YNPAQWN--IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 P + N +DG DS A+ ++ ++ Sbjct: 154 ARPLRENVPRVIMIVTDGR--PQDSVAEVAAKARDTGILIFAIGVGQV 199 >UniRef50_A1SAA4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Shewanella amazonensis SB2B RepID=A1SAA4_SHEAM Length = 713 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 48/171 (28%), Gaps = 25/171 (14%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----RHHTQAKEVDEHEFF 307 ++D SGSM+ + + L L + +++ VD + Sbjct: 334 VFVLDKSGSMNGKYATLVEGVRQGL-GKLPAQDRFRIILFDESTQEFSKGFVPVDSNNIN 392 Query: 308 Y-------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE 359 GT + LK + + +DG N Sbjct: 393 QALAWVEGISPGNGTDLYQGLKRALTPLDADRSTG-----VVLITDGVANVGVTEKRRFL 447 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 L ++ VR +++I + L L + + + DDI Sbjct: 448 ELMQQ--QDVRLFTFIMGNSA-NTPLLVPMTRLSNG----VATSVSNADDI 491 >UniRef50_D2S6Y8 von Willebrand factor type A n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2S6Y8_9ACTO Length = 248 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 19/121 (15%), Positives = 33/121 (27%), Gaps = 11/121 (9%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 R + R S+ L+D SGS+ +A L + + V + Sbjct: 65 RADHLHTRGWRSTGRACVLLVDASGSVAGDELAVAVLTASALAQRMRPGDELAVVAFWSK 124 Query: 295 HTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + + F + G T + L+ + A SD Sbjct: 125 AVVLRPLSADPRSDLVVDRLFDLRGGGTTDLDLGLRTAAAQLTRTRAAA---RDVLLLSD 181 Query: 347 G 347 G Sbjct: 182 G 182 >UniRef50_UPI0000EB12CB UPI0000EB12CB related cluster n=1 Tax=Canis lupus familiaris RepID=UPI0000EB12CB Length = 2186 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 49/174 (28%), Gaps = 19/174 (10%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV----------VYIRHHTQAKE 300 + ++D SGS+ ++ I L V V V Sbjct: 794 IVFVLDHSGSIGTQEQESMMNLTIHLVKKADVDSDRVRVGALKYSDYPEVLFYLSGNKSA 853 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERY---NPAQWNIYAAQASDGDNWADDSPLC 357 V EH +G T + AL+ + + E Y +DG + D+ Sbjct: 854 VIEHLRRRRYTSGHTYTARALEHANIMFTEEYGSRIQQNVKQMLIIITDGVSHDRDNLSD 913 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + + + +Q + + F + + + DIY Sbjct: 914 TASKLRNKGINIYAVGVGQA----NQLELETMA--GNKSNTFHVDNFSNLKDIY 961 >UniRef50_Q5NWS4 Tellurium resistance protein n=8 Tax=Bacteria RepID=Q5NWS4_AZOSE Length = 214 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 49/149 (32%), Gaps = 24/149 (16%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRHHTQAKEV- 301 S + ++ L+D SGSM + + L + + V + +Q K+V Sbjct: 4 SRRLPVYLLIDTSGSMRGEPVESVNVGLRAMQTSLRQNPYAIETVHLSVTTFDSQIKDVL 63 Query: 302 DEHEFFYSQ-------ETGGTIVSSALKLMDEVVKERYN------PAQWNIYAAQASDGD 348 + +G T++ AL+ + + K+ W +DG Sbjct: 64 PLTALEDATIPEIVCPASGATLLGEALEHILDRAKKEVRQSSAEQKGDWAPLLFIMTDGK 123 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEI 377 + ++ P ++ + + I Sbjct: 124 PTDT-------FVFNQVAPAIKAFKFGSI 145 >UniRef50_D0IVS5 Putative uncharacterized protein n=3 Tax=Bacteria RepID=D0IVS5_COMTE Length = 618 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 44/263 (16%), Positives = 78/263 (29%), Gaps = 35/263 (13%) Query: 119 ISKDEYLDLLFEDLA-LPNLKQNQQRQLTEYKTHRAGYTANGVPANISV--VRSLQNSLA 175 +S +E D + DL + L+ Q + + G G +I L L Sbjct: 354 LSAEEVYDRIAGDLRRMRKLRTLAGGQGDMLERNVGGQRKAGDYTDIDEFCRTQLGKGLL 413 Query: 176 R--RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 R + A +EE A++ + + L + +E Sbjct: 414 RHEQDARGLLP---AGLIEEIRALL----QPPIDWQVELARWFDHHFPPVETRRSYARIS 466 Query: 234 LRYKNYEKRPDPSSQA--------VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK 285 R P P QA L+D SGSM++ +AK + + Sbjct: 467 RRQSATPDIPRPRIQADSRWLEGRTFGVLLDTSGSMERH--VLAK-ALGAIASYADAKD- 522 Query: 286 NVEVVYIRHHTQAKE---VDEHEFFY---SQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 V I A + + + + GGT++ + L+ + + + Sbjct: 523 VPAVRLICCDAAAYDLGYLPAADIAQRIALKGRGGTVLQPGVDLLLQ--DDDFPKDGP-- 578 Query: 340 YAAQASDGDNWADDSPLCHEILA 362 +DG P H L Sbjct: 579 -ILIITDGQCDQLRVPREHAYLL 600 >UniRef50_C8PVC3 von Willebrand factor type A domain protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PVC3_9GAMM Length = 260 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 56/179 (31%), Gaps = 29/179 (16%) Query: 213 RKEIAELRAKIERVPFIDTFDL--RYKNYEKRPDPSSQAVMFCLMDVSGSM-------DQ 263 I + E P L +++ + + + + + ++D+SGSM D+ Sbjct: 4 HNPIPNFQNPAESTPPSIPQALTPQFREIDYGNNVAQRTLCVLVLDLSGSMAIRSGNGDK 63 Query: 264 STKDMAKRFYILLYLFL------SRTYKNVEVVYIRHHTQAKEVDE--HEFF----YSQE 311 DM Y L + V+ + A+ + + +E Sbjct: 64 RRIDMLNEGIEAFYHDLMKDETARNRVRLAIVIVGGVNDTAELMMDWTDAIDFFPIKFRE 123 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWN------IYAAQASDGDNWADDSPLCHEILAKK 364 G T + + L ++++ + N + +DG DS + + Sbjct: 124 NGMTPLGQGMLLALNLIEQERINLRDNGINYTRPWVIAMTDG--LPTDSQDVWQAAINQ 180 >UniRef50_C3XQR8 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XQR8_BRAFL Length = 2411 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 49/176 (27%), Gaps = 42/176 (23%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQST-------KDMAKRFYILLYLFLSRTYKNVEV 289 +N+ + + ++DVSGSM + ++AK+ + + L+ V Sbjct: 127 RNWYVSAASPKKKNVVIVIDVSGSMREPPGPEEQNRLNLAKQAALTVLDTLTPRDWGGVV 186 Query: 290 VYIRHHTQAKEVDEHEF-FYSQETGGTIV------------------SSALKLMDEVVKE 330 + E E E T + + ++ E Sbjct: 187 SFSARA----ETPEGCLGDSLGEANPTNIGIMQDFINQRVPETITMYGVGFRKAFDMFAE 242 Query: 331 RYNP------AQWNIYAAQASDGDNWADDSPLCHEILAKKLL---PVVRYYSYIEI 377 N +NI SDG D + + K V ++Y Sbjct: 243 ARNKKPEQFEDCYNIIIF-LSDG--SPTDKAFALDEITKGQELMDRSVYIFTYGLG 295 >UniRef50_A7RPC2 Predicted protein (Fragment) n=2 Tax=Eumetazoa RepID=A7RPC2_NEMVE Length = 930 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 20/183 (10%), Positives = 53/183 (28%), Gaps = 29/183 (15%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH--- 295 E P + ++D S SM AK+ ++ + + VV+ ++ Sbjct: 644 PEFESGPPHHVEVIFVLDASCSMKGKALQEAKKLTLMCLSLMEEEWAFNIVVFGSNYSEL 703 Query: 296 -TQAKEVDEHEFFY--------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 TQ+++ D+ G T + L+ + +++ + +D Sbjct: 704 FTQSQKKDKETVARAAKFVKSVKAVKGSTDLWRVLRSLY-LLRCNSTAD-YPSNVFLFTD 761 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G + + P ++ R + + ++ Sbjct: 762 G---HVTEESTTLAYIRDIRPTC------------NRHFLRSMARVGAGAYELFDSKVKS 806 Query: 407 QDD 409 + + Sbjct: 807 KWE 809 >UniRef50_C1V7M6 von Willebrand factor type A-like protein n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1V7M6_9EURY Length = 402 Score = 54.4 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 46/310 (14%), Positives = 89/310 (28%), Gaps = 31/310 (10%) Query: 103 GQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQ----------NQQRQLTEYKTHR 152 A E V S E +L D+ P+ + L+ Sbjct: 66 MAADGLTEWTPGPVRTSSSVELGELRSIDVDPPDEDEGAAVIDDVEALDSEDLSSEIGEI 125 Query: 153 AGYTANG-VPANISVVRSLQNSLARRTAMTAG--KRRELHALEENLAIISNSEPAQLLEE 209 G G + ++ RS +N + +A R+ A + + EE Sbjct: 126 GGIHRGGKATSRLADRRSARNHRVDNYSGSAPASDVRDELRAGGTAAAVVREFKRLVAEE 185 Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNY--EKRPDPSSQAVMFCLMDVSGSMDQSTKD 267 R+ + E + F+ D + + + + V+ +D+SGSM + D Sbjct: 186 IRVTDRVGERPDVRNVIRFLAGDDTVFDDLWSRTETNDTGDRVVGIALDMSGSMGNAEHD 245 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE-------HEFFYSQETGGTIVSSA 320 L + V + + ++ V ++ GGT + A Sbjct: 246 AKAAVGALALAASAVDDDVVITAFPQGKRKSALVTGPYESWRWQHLDATEPGGGTPMLPA 305 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR-YYSYIEITR 379 L+ +V + +DG D L + L + Sbjct: 306 LR---DVTRLMRPLKGRERVLFAITDGRPSRADRVAE---LVEDLRTYGTAVVGFGF--G 357 Query: 380 RAHQTLWREY 389 R +++ E Sbjct: 358 RVNESTLEEL 367 >UniRef50_Q58221 Uncharacterized protein MJ0811 n=4 Tax=Methanocaldococcus RepID=Y811_METJA Length = 439 Score = 54.4 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 60/226 (26%), Gaps = 49/226 (21%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANIS--VVRSLQNSLARRTAMTAGKRRELHALEEN 194 L+ + R++ + N + I + R + +E+ L + Sbjct: 186 LQNKKIREIVKKLGKLRLLAINEYKSKIKHYSGEIYSTKIGR--DLKHLLPKEIVNLSDE 243 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + + + +D++ + + L Sbjct: 244 ILYYD--------------------FLRRFVDKKLLIYDIQ------NKLEKQKGPIIIL 277 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-------------EV 301 +D SGSM + K + + R + ++ YI + E+ Sbjct: 278 LDHSGSMYGDREIWGKAVALSIIEIAKRENR--DIYYIAFDDGVRFEKKINPKTITFDEI 335 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 E GGT L ++KE N +DG Sbjct: 336 IE--IASLYFGGGTNFIMPLNRAMSIIKEH--ETFKNADILLITDG 377 >UniRef50_A7NJ01 von Willebrand factor type A n=2 Tax=Roseiflexus RepID=A7NJ01_ROSCS Length = 972 Score = 54.4 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 52/187 (27%), Gaps = 25/187 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQ-------STKDMAKRFYILLYLFLSRTYKNVEVVYI 292 + ++D SGSM + + D+AK L L+ + VV+ Sbjct: 400 PLDTKQQPDLALVMVIDRSGSMSELVGGSRRNRLDLAKEAVYQASLGLTPIDQVGLVVFD 459 Query: 293 RHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + E GGT + ++ + + + Sbjct: 460 DAANWVLPLQRLPSVVEIERALGSFGIGGGTNIRPGIEQAAQALASADAKVKH---VILL 516 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG A+ + + + + E + + + ++ + I Sbjct: 517 TDG--IAESNYSDLIAQMRAAGVTISTVAIGEDA-NPN---LVDVANAGGGR-SYRVTRI 569 Query: 405 RDQDDIY 411 D I+ Sbjct: 570 EDVPRIF 576 Score = 42.1 bits (97), Expect = 0.039, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 42/126 (33%), Gaps = 10/126 (7%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV--- 301 P + L+DVS SM + ++ A ++ + + VV+ + + Sbjct: 63 PVRELTTVFLVDVSDSMTPAQRERALQYVNDALAAMPPGDQAAVVVFGDNALVERAPGPI 122 Query: 302 -DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD-GDNWADDSPLCHE 359 T T + A++L + PA+ SD G+N + Sbjct: 123 GPLSRLTSVPITTRTNLQEAVQLGLAL-----FPAETQKRLVLISDGGENAGRVADAAQL 177 Query: 360 ILAKKL 365 +K+ Sbjct: 178 AAIRKV 183 >UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BM85_HYPBU Length = 439 Score = 54.4 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 64/194 (32%), Gaps = 18/194 (9%) Query: 173 SLARRTAMTAGKRR--ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP--- 227 +LAR T + + + E + P L+ L +I + A +P Sbjct: 191 NLARNTDVKVLLEALKTIESTEAYIRTRKIRSPRGELDGYELGSDIERVVASELALPTDL 250 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 F+ F R K+ + L+D SGSM AK + L R + Sbjct: 251 FLLKFAERNLLLYKKVVSEEYGKFYVLLDKSGSMMGMKIIWAKAVALALAQRAIREKREF 310 Query: 288 EVVYI------------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + + R H + + GGT ++ A+ + + + + Sbjct: 311 YIRFFDSIPYPPLYIPKRVHGRDVVKLLEYVARIRANGGTDITRAILTAVDDIATKLQRS 370 Query: 336 QWNIYAAQASDGDN 349 + + +DG++ Sbjct: 371 KVSDIIL-ITDGED 383 >UniRef50_C4IH33 von Willebrand factor type A domain protein n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4IH33_CLOBU Length = 1336 Score = 54.4 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 47/155 (30%), Gaps = 34/155 (21%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD-------------QSTKDM 268 +IE I DL+ ++ + + + ++D SGSM+ Sbjct: 483 EIEFTYKISAEDLQIEDINE----EVKKDIVLILDTSGSMNFNFYNDSIPYNEKDKRIYS 538 Query: 269 AKRFYILLYLFLSRTY--KNVEVVYIRHHTQA-------------KEVDEHEFFYSQETG 313 K+ + + + Y + A K+ E+ + G Sbjct: 539 LKQSAKQFINKFNNKDNIRIGIIPYSYYSGYANNIKQLTEINDNNKKSYENYIDNIKVEG 598 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 T ++ +++ ++ Y +DG+ Sbjct: 599 ATNQGDGIREAGKMLLNTDGNSK--KYVILITDGE 631 >UniRef50_A8EWS0 Putative uncharacterized protein n=1 Tax=Arcobacter butzleri RM4018 RepID=A8EWS0_ARCB4 Length = 1866 Score = 54.4 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 38/158 (24%), Gaps = 31/158 (19%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS------------TKDMAKRF 272 I D + + + +D SGSM ++ D+ K Sbjct: 1358 AGKDILIGDTGGTQLNVQAGKNYNIALV--VDTSGSMKEASGSKTAWGTTISRIDLLKDA 1415 Query: 273 YILLYLFLSRTYKNVEVVYIRHHTQAKEVDE-------------HEFFYSQETGGTIVSS 319 L L + V I T AKE + + GGT Sbjct: 1416 LKNLADSLKGHDGKINVSIIDFDTNAKEPITFNDLTSKNISDLITKIDALKAEGGTNYED 1475 Query: 320 ALKLMDEVVKERY----NPAQWNIYAAQASDGDNWADD 353 A + + +DGD + Sbjct: 1476 AFLKTTSWFDTQSVTYGKAQGYENLTYFLTDGDPTFSN 1513 >UniRef50_A3I2X5 Putative batB protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3I2X5_9SPHI Length = 321 Score = 54.0 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 24/214 (11%), Positives = 60/214 (28%), Gaps = 38/214 (17%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 + +F +D+S SM+ + +R L L++++ + + I ++ Sbjct: 69 SVKEIKEEGKDIFLAVDLSQSMNATDIGPSRLQRIKFEL-KELTKSFPSDRIGLIIFSSE 127 Query: 298 A---------KEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 A + V + GT +++ L++ + + + + Sbjct: 128 AFMQCPLTFDQSVLQLYIDGLNTGLVPNFGTDLNAPLRIALDRFQNDESQEVKSKSVILI 187 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR---------------------AHQ 383 SDG+N+ D K L V + + Sbjct: 188 SDGENFG-DELENIGSELKNLGVKVFALGIGTESGSTIPRGNGIVMDPQTGEPAQTVLDK 246 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 ++ +++ D+ L Sbjct: 247 RPLQQIAAETDGQYFEISDEVQEVADLIKRLERL 280 >UniRef50_A3HZP7 BatA protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZP7_9SPHI Length = 347 Score = 54.0 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 65/216 (30%), Gaps = 41/216 (18%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVY 291 K+ E+ + + +MD+S SMD + AK I + VV+ Sbjct: 95 KSNERVEQFTEGIDIMLVMDISESMDLQDFKPNRLEAAKATAIDFING-RFGDRIGMVVF 153 Query: 292 IRHHTQAKEVDE---------HEF-FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + F E GT + SA+ +KE + ++ Sbjct: 154 AGEAYSLAPLTNDYKLLTDLIQDISFNMMEAKGTAIGSAIASATNRMKESESASK---VL 210 Query: 342 AQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIE----------------ITRRAHQT 384 SDG+ N + PL LA L + + + + +T Sbjct: 211 ILLSDGESNAGNVDPLFAAQLASALDIKIYTIAVGKDGMVPYGTDFFGRPQMVESYLDET 270 Query: 385 LWREYEHLQSTFD-----NFAMQHIRDQDDIYPVFR 415 RE + + + +I D+ D Sbjct: 271 NLREIAKIGNGEFFRASDGGTLNNIFDRIDTMEKAE 306 >UniRef50_C3JDN5 von Willebrand factor, type A n=2 Tax=Rhodococcus erythropolis RepID=C3JDN5_RHOER Length = 551 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 61/211 (28%), Gaps = 39/211 (18%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMD---QSTKDMAKRF-----------------YILL 276 K + S + ++DVSGSMD MA Sbjct: 338 KALARYEILSRPSRALAVVDVSGSMDYMQDGVTRMAATAQAGDIAIRMFPANAQLGLWAF 397 Query: 277 YLFLSRTYKNVEVVYIRH------HTQAKEVDEHEFFYSQE--TGGTIVSSALKLMDEVV 328 + L E+ + T + GGT + ++ + Sbjct: 398 SIDLGEGTDYRELEPVARMDATEGDTDHRSKLLSRIDSLSSIVGGGTGLYDSVLAAYRSM 457 Query: 329 KERYNPAQWNIYAAQASDGDNWADDS---PLCHEILAKKLLPV--VRYYSYIEITRRAHQ 383 ++ Y+PA N +DG N S + L ++ P V + +T A Sbjct: 458 QQTYDPASINSVIL-LTDGANDDPSSISLQELLDTLTREQDPTRPVPIITIG-VTDDADT 515 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + L +FA DI VF Sbjct: 516 DVLEQISALTGGNSHFAPT----PADIPKVF 542 >UniRef50_D2W671 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W671_NAEGR Length = 518 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 53/195 (27%), Gaps = 28/195 (14%) Query: 228 FIDTFDLRYKNYEKRPDPSSQA-----VMFCLMDVSGSMDQSTK---------DMAKRFY 273 F + D P P Q + D SGSM S D + Sbjct: 304 FKASVDFERHLINNAPPPGPQISSEVFHFIFVNDKSGSMGGSDARPTSSKYSNDRLGALF 363 Query: 274 ILLYLFLSRTYKNVEVV---------YIRHHTQAKEVDE-HEFFYSQETGGTIVSSALKL 323 FL + ++V Y T GGT ++A++ Sbjct: 364 ESCEKFLEVRDGSSDLVSCIMYDHSAYNCFTTNPLSTSLVSTMSSYVAGGGTSFTNAMQS 423 Query: 324 MDEVVKERYNP-AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 + ++ Y + I SDG++ AD++ L ++ + Sbjct: 424 VSSLISSTYPNHQSYKIVVLFMSDGEDSADEAVSITGQLVSSHDIILHTIQLG---GSSD 480 Query: 383 QTLWREYEHLQSTFD 397 T R+ Sbjct: 481 NTGLRQMAATGRGQF 495 >UniRef50_A0KS56 Putative outer membrane adhesin like proteiin n=2 Tax=Proteobacteria RepID=A0KS56_SHESA Length = 5839 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 34/249 (13%), Positives = 59/249 (23%), Gaps = 58/249 (23%) Query: 153 AGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERL 212 G T + V + ++AR T L +E + Sbjct: 5208 TGVTIRVPGTSAGQVDLVVEAVARE-KGTDLTNSATGEDTIRLDYFKGTEGEPGDQNVNF 5266 Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 E I DL P + ++D SGS++ T + K Sbjct: 5267 GSE-----------HNIVVGDLDGSIV----LPGQNYNIAFMVDSSGSINADTLETMKLQ 5311 Query: 273 YILLYLFLS-----RTYKNVEVVYIRHHTQAK-------------EVDEHEFFYSQETGG 314 + L + V++ + T AK ++ + GG Sbjct: 5312 LAQVLETLKDSASGQQSGTVKIFLVDFDTHAKGSISVDLSDPNALDILQDALDDMSSGGG 5371 Query: 315 TIVSSALKL-----MDEVVKERYNPAQWNIYAAQASDG----------------DNWADD 353 T + + N A +DG D W + Sbjct: 5372 TNYEDVFTTTANWFANGDAQNNS--GATN-LAFFITDGKPTYYNAVPGGNPLVYDTWGSN 5428 Query: 354 SPLCHEILA 362 + L Sbjct: 5429 NDRSLSQLI 5437 >UniRef50_A7RV93 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7RV93_NEMVE Length = 1118 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 21/107 (19%), Positives = 36/107 (33%), Gaps = 18/107 (16%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQ 310 + ++D S SM MAK + L++ + H + EV Sbjct: 242 IVIILDCSLSMKGKRLRMAKEIAKTVLNTLTKQDFVNVIC--GHASNWDEV--------- 290 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 +A K E++K R +I +DG++ D C Sbjct: 291 ------GKAAFKKAFELLKGRAKTGCQSIIIF-VTDGEDNDGDPVRC 330 >UniRef50_B2SKX9 von Willebrand factor type A domain protein n=13 Tax=Xanthomonadaceae RepID=B2SKX9_XANOP Length = 335 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 38/264 (14%), Positives = 72/264 (27%), Gaps = 39/264 (14%) Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL-RYKNYEKRPDPSS 247 + + +E Q + + + + R F+ L R + + P Sbjct: 37 RRADAAALRVPYAEQLQAVAQAKTTPSLRMPRWLAWLGWFLLCAALARPQQLGEVIQPPR 96 Query: 248 QAV-MFCLMDVSGSMDQSTKDM-------AKRFYILLYLFLSRTY-KNVEVVYIRHHTQA 298 +A M +D+SGSM++ + +L FL R V ++ A Sbjct: 97 EARQMMLAVDLSGSMNEPDMVLGGKVVDRLTAAKAVLSDFLDRRDGDRVGLLVFGQRAYA 156 Query: 299 KEVDEHEFFYSQ----------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + + T + A+ L + ++E Q +DG Sbjct: 157 LTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE---QKQGQRVVVLLTDGV 213 Query: 349 NW-ADDSPLCHEILAKKLLPVVRYYSY--------------IEITRRAHQTLWREYEHLQ 393 N PL LAK + ++ + R+ Sbjct: 214 NTAGVLDPLKAAELAKAEGVRIYTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQT 273 Query: 394 STFDNFAMQHIRDQDDIYPVFREL 417 F + + IY L Sbjct: 274 GGRF-FRARDTEELAGIYAELDRL 296 >UniRef50_B4D6B0 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D6B0_9BACT Length = 879 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 46/182 (25%), Gaps = 41/182 (22%) Query: 249 AVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRT--------YKNVEVVYIRHH 295 M ++D SGSM Q+ +A + + L V Sbjct: 421 VAMLVVLDRSGSMTAAVAGQTKISLADQGAVFAMNALQPKDYFGVVAVDTKPHTVVPLAP 480 Query: 296 TQAKEVDEHEFFYSQETGG-----TIVSSALKLMDEVVKERYNPAQWNIYAAQ------- 343 AK E + GG T + A + + ++ PA+ Sbjct: 481 ISAKGAAEQKILSITAGGGGIYIYTSMVEAFQQLRDI------PARVKHLLLFSDAADAE 534 Query: 344 ------ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT-LWREYEHLQSTF 396 SDG +S + L + T + T R+ S Sbjct: 535 EKAAGEMSDGIRTGGNSLDLASAM---LAAKITTSVVGLGTEQDKDTPFLRQLAERGSGR 591 Query: 397 DN 398 Sbjct: 592 FY 593 >UniRef50_D0LKC8 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LKC8_HALO1 Length = 344 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 46/180 (25%), Gaps = 26/180 (14%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 E A + ++DVS SM + AK L L + V + Sbjct: 84 RTETVTSSEVSADIMVVLDVSRSMLADDVAPTRLARAKAEVAELSSALRGH-RIGLVAFA 142 Query: 293 R-HHTQAKEVDEHEFFYS---------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 A ++ FF GGT + AL+ ++P Sbjct: 143 GRASVLAPLTPDYGFFRMILDGVDTKSVSRGGTEIGQALRKAV----RSFDPGPGAKMIL 198 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR-----AHQTLWREYEHLQSTFD 397 +DG++ + + + VV + T R Sbjct: 199 LITDGEDHGGYAEDAAREALEAGVRVVAI-GFGSEQGSQITLVDPDTGARALLTDGDGAP 257 >UniRef50_C3Y507 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y507_BRAFL Length = 306 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 50/159 (31%), Gaps = 21/159 (13%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY------KNVEVVYIR------- 293 ++ + L+D+SGS+ + K + L ++ + V + Sbjct: 102 NKRDLVLLLDMSGSIGSTDFTSLKTYVAELLSYICPENEMGTFHRVAVVTFSSSVVLNFN 161 Query: 294 -HHTQAKEVDEHEFFYSQ-ETGGTIVSSALKLMDEVV---KERYNPAQWNIYAAQASDGD 348 H + + E G T + A+ + V + ++ +DG Sbjct: 162 FHEATSLGQIQASIHSLPYEGGSTRTADAINFVRTQVFQTGNYRDEPDVDLEVLLITDGH 221 Query: 349 -NWADDSPLCHEILAKKLLPVVRYY--SYIEITRRAHQT 384 N A +SP E+ A+ L + Y A Sbjct: 222 PNGAGNSPQDVELAAEALGERANIFALGYGSAYSSASDF 260 >UniRef50_A9UV27 Predicted protein n=2 Tax=Eukaryota RepID=A9UV27_MONBE Length = 3700 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 49/319 (15%), Positives = 90/319 (28%), Gaps = 43/319 (13%) Query: 116 VFQISKDEYLDLLFEDLALPNLKQ------NQQRQLTEYKTHRAGYTANGVPANISVVRS 169 +Q++ + + L E LP+ + + L G + I+ R Sbjct: 3258 EYQLAIQDTVADLNE---LPDFIKWSQEVGKEVVALRATADKATGEIKT-RQSTITWQRD 3313 Query: 170 LQNSLARRTAMTAGKRRELHA---LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERV 226 L R A+ + R +EE L E + + +L E Sbjct: 3314 LLVVRDRALALEKVQERVAVLEPMVEERLN---------QAERKAFGA-VDKLTKDYEAG 3363 Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVM------FCLMDVSGSMDQSTKDMAKRFYILL---- 276 + ++ S A + +D SGSM D Y Sbjct: 3364 KIRTPGLFHFCKFQACTRVKSAADVAVGQGSLFGLDESGSMGGDNWDALLNSYSSFMQSR 3423 Query: 277 -YLFLSRTYKNVEVVYI--RHHTQAKEVDEH--EFFYSQETGGTIVSSALKLMDEVVKER 331 + L+ + V Y T AK F GGT + A++ + + + Sbjct: 3424 VHDELNLADRVTVVQYSDRARTTLAKASMREAAAFVPQMNGGGTDFNVAIQELR--GQGK 3481 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY-YSYIEITRRAHQTLWREYE 390 + SDG + D E + +L + + + + A E Sbjct: 3482 TMSNIFRPVLIFMSDGQAY--DPRTELERMKAELPHMSSFMVALGPNAQVAVLQGMAELA 3539 Query: 391 HLQSTFDNFAMQHIRDQDD 409 ++ M + D Sbjct: 3540 GVREGVSEPEMLKLLSFGD 3558 >UniRef50_D2QTD7 von Willebrand factor type A n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTD7_9SPHI Length = 359 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 53/193 (27%), Gaps = 36/193 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYIRH 294 E R + S + MDVS SM +S A+R R + V++ Sbjct: 104 ELREEQSEGIDIMLAMDVSVSMSESDILPTRLAAARRVAQAFVRG-RRNDRIGLVIFAGE 162 Query: 295 H------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDE-------------VVKER 331 T + + GT + AL K Sbjct: 163 AFSLCPLTTDYNLLNQYLNDLNDGMIRTSGTAIGDALARCINRMRDRPAASSDTTQAKTE 222 Query: 332 YNPAQWNIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIE------ITRRAHQT 384 ++ + SDGDN A + P+ LAK + + + + Sbjct: 223 QWKSERSKVIILLSDGDNTAGNLDPITAASLAKAFNIKIYTIAVGQPVASASEASTVDEG 282 Query: 385 LWREYEHLQSTFD 397 + ++ + Sbjct: 283 ILKKIATIGKGSF 295 >UniRef50_UPI0000E48E5C PREDICTED: similar to calcium-activated chloride channel-2 n=13 Tax=Strongylocentrotus purpuratus RepID=UPI0000E48E5C Length = 1175 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 19/162 (11%), Positives = 41/162 (25%), Gaps = 24/162 (14%) Query: 251 MFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS 309 + ++D+SGSM + D M + + + K + + T + Sbjct: 242 VVLVLDISGSMSGNRFDRMIQSSADYIMNVIPLDSKLGIIGF--ESTSHIRTLLTDITDI 299 Query: 310 ------------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 GGT + + +V+ Y Y SDG + + Sbjct: 300 ASRERLVDALPPSAGGGTCIECGILSGIQVL-GSYAQGG---YLLLLSDGQG-SAQNLRD 354 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 ++ + + +F Sbjct: 355 TYNDIDNAGVIIDTITI----SNSADQEMEHLSTNTFGKSSF 392 >UniRef50_Q1RJP8 Putative uncharacterized protein n=9 Tax=Rickettsia RepID=Q1RJP8_RICBR Length = 516 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 49/161 (30%), Gaps = 32/161 (19%) Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE-------VVYIRHHTQAKEVDEHE- 305 L+D+SGSM+ K F + L K E +V + A+ E Sbjct: 279 LIDISGSME-------KDFSVYKNNILKILDKLAEIPNWQINIVVFNDESTARSFSNQEN 331 Query: 306 --------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + G T + +K E K + + + I +DG + +S + Sbjct: 332 NIEDIKVYINNLKANGYTKLYGTIKEALESFKGKIDESSTLIV---FTDGKDEGTNSNVT 388 Query: 358 HEILAKK-----LLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 + + P Y+ +Q + + Sbjct: 389 EKDVVDVTSEVIKNPQFNMYTVGFGQ-YYNQEFFEQVATRG 428 >UniRef50_D2A5S3 Putative uncharacterized protein GLEAN_15119 n=3 Tax=Tribolium castaneum RepID=D2A5S3_TRICA Length = 1022 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 22/213 (10%) Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 + + A I ID FD R +++ SS + L+D SGSM +++A+ Sbjct: 47 MRQFPAMIWSQEPIDLFDCRTRSW-YIEAASSPKDVVILVDRSGSMTGMRREIARHVVHN 105 Query: 276 LYLFLSRTY-----------KNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLM 324 + L N+ V + + + +F Q +S AL Sbjct: 106 ILDTLGNNDYYTTDPLIECFDNILVQANLANVRVLKEAMADFKTEQIA---NLSLALVTA 162 Query: 325 DEVVKERYNPAQW---NIYAAQASDGDNWADDSPLCHEILAKKLLPVVR--YYSYIEITR 379 ++++ N ++ N +DG D+ LP + ++Y+ Sbjct: 163 FQLLENYRNESKGANCNQAIMLVTDG--VQDNYMEIFRDYNWDNLPFINVRVFTYLIGRE 220 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + + + + ++++ Sbjct: 221 VSDVRDVKWMACANRGYYVHLSTYAEVREEVLQ 253 >UniRef50_B5JN05 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JN05_9BACT Length = 257 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 49/169 (28%), Gaps = 24/169 (14%) Query: 247 SQAVMFCLMDVSGSMDQ------STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 + + D SGSM++ + K+ + L + + +E Sbjct: 71 TNYYVIF--DASGSMNELVAQQLPKIEAGKQALVTFANNLPEDANLGLLTF----DPVRE 124 Query: 301 V------DEHEF----FYSQETGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQA-SDGD 348 + + F + G T + ++ V+ ++ + + Y +DG Sbjct: 125 LLPLGRGNRQAFIGSVSQIRAKGRTPLVESIVTGYRVLTEQAQRQSGYGRYVLVIVTDGA 184 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 + + + ++ V+ + + +Y S Sbjct: 185 SSDGNPAGVAMEVTRESPIEVQTIGFGVADHALNLPGVTQYVTASSPKA 233 >UniRef50_B7KMY0 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KMY0_CYAP7 Length = 143 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 19/100 (19%), Positives = 33/100 (33%), Gaps = 6/100 (6%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 ++R D + ++D SGSM + AK+ I ++ L VV+ Sbjct: 29 IEIRSNQSIDELDTKKALNLCLVIDRSGSMSGEKLETAKKSCIDIFKQLGEKDLLTVVVF 88 Query: 292 ------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 I + K + + G T +S L Sbjct: 89 DDEAEVIVNPQVPKAEVIKKINQINDRGSTNLSLGWYLGL 128 >UniRef50_A1S120 von Willebrand factor, type A n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S120_THEPD Length = 305 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 45/159 (28%), Gaps = 23/159 (14%) Query: 252 FCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 ++D S SM + +AKRF L + V V + + Sbjct: 93 VVVLDESKSMLSADVYPDRCTLAKRFAEKYIEGLQPSDLVVLVFFSSSSNATGPLPRD-- 150 Query: 307 FYSQETGG-------TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 + G T + AL V+ P + SDG N+ D Sbjct: 151 NALRALDGYTCRYRYTALGDALVSAYSVLAASGLPGAVVVV----SDGGWNYGSD----P 202 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +A+ + + + +L R+ + Sbjct: 203 LQVAQSIRASNYSLVLVRVGGDPRGSLMRDVASKANGKY 241 >UniRef50_UPI0000E49DB4 PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4 n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49DB4 Length = 1119 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 41/297 (13%), Positives = 94/297 (31%), Gaps = 19/297 (6%) Query: 115 FVFQISKDEYLDLLFEDLAL--PNLKQN-QQRQLTEYKTHRAGYTANGVPANISVVRSLQ 171 +V +++++E + F L P L++ + +LTE G +N S+ ++ Sbjct: 609 YVTELAEEEGGKMAFYLLGSVAPWLQEEIRGTKLTEETQALTGVASNETERKESLQITM- 667 Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI---ERVPF 228 L+ + + + +E A ++ + QL + +L I E P Sbjct: 668 EMLSNISLVESPTHVIKVKRKEMKATVTLGDRVQLGDGFQLLISIIHPNTPRFWTEFDPS 727 Query: 229 IDTF-DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 +++ + + + + + L+D S SM K AK+ ++ L + Sbjct: 728 CESYASMAVFYPTIQSELIADPEVVLLLDCSTSMKGEPKQDAKKICKMILQSLPEKSRFN 787 Query: 288 EVVYIR-----HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN--IY 340 + + T + G ++ N Sbjct: 788 VITFGTDFTELFPTVEPVGQRQLLEALEFIEGARSVGGSSEAWRPLRSLSLLPMMNSARN 847 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 SDG + +A K V R ++ ++ ++ + R + Sbjct: 848 VLLVSDG---HLTNEKLTLEIASKYKHVNRIFTCA-VSSAGNRHILRALADVSGGAF 900 >UniRef50_A3J9J6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=A3J9J6_9ALTE Length = 341 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 60/211 (28%), Gaps = 38/211 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEV 289 E R P++ + ++D+S SMD+ K+ + + + Sbjct: 79 EARQLPTTGRDLMLVVDISPSMDEPDMVRQGRRINRLQAVKQVLAEFIDQ-RQGDRLGLI 137 Query: 290 VY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ + + E T + A+ L + ++ER Sbjct: 138 LFGSQAYVQAPLTFDRTTVNILLQEAGLGMAGNATAIGDAVGLAVKRLRER---PLEQRV 194 Query: 341 AAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEI-------------TRRAHQTLW 386 A +DG N + +P LA+ + +R + L Sbjct: 195 AIVLTDGANTAGEITPDKASELAQASAVRLYTIGIGAGADSAITGLLQRNPSRDLDEALL 254 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 F +++ + IY +L Sbjct: 255 TRMAQQTGGQY-FRARNLAELGGIYTSINQL 284 >UniRef50_A0B5M2 von Willebrand factor, type A n=1 Tax=Methanosaeta thermophila PT RepID=A0B5M2_METTP Length = 795 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 48/196 (24%), Gaps = 19/196 (9%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVE 288 T LR S + +D SGSM D+ K L + V Sbjct: 49 VTITLRGGEIPCA----SPVDVVLSIDSSGSMTTSDPGDLRKSAAKEFVTGLDLSMDRVG 104 Query: 289 VVYIRHHTQAKEVD------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 VV + + E + G T + + LK +++ E + Sbjct: 105 VVSWNTSAISWPLTNNTKDIESAIDSTGADGNTCLDTGLKSAIDLLSECSG----SKVIV 160 Query: 343 QASDG---DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG D P + + + + + Sbjct: 161 LLTDGISTDGGHYTPPGVPGSPVDEARSKG-ILVFTIGLGPDADARNLTEIAHSTGGEFY 219 Query: 400 AMQHIRDQDDIYPVFR 415 + IY R Sbjct: 220 SAPDANALAGIYKRIR 235 >UniRef50_B2K3B2 von Willebrand factor type A n=39 Tax=Gammaproteobacteria RepID=B2K3B2_YERPB Length = 233 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 43/137 (31%), Gaps = 17/137 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRHHTQAKE-VD 302 + ++ L+D SGSM + + L + ++V + I + QA+E + Sbjct: 23 RRLPVYLLIDTSGSMRGESIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIP 82 Query: 303 EHEFFY-------SQETGGTIVSSALK----LMDEVVKERYN--PAQWNIYAAQASDGDN 349 GGT +AL+ ++ ++ W +DG Sbjct: 83 LTALENFQFTDITVPSAGGTFTGAALECLIHCVERDIQRSDGDQKGDWRPLVFLMTDGTP 142 Query: 350 WADDSPLCHEILAKKLL 366 + KK Sbjct: 143 SDVYAYGEAIKEVKKRA 159 >UniRef50_B2KDS9 von Willebrand factor type A n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDS9_ELUMP Length = 373 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 60/203 (29%), Gaps = 29/203 (14%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYIRHH---- 295 P+ + +D SGSM D AK + + VV+ Sbjct: 143 PTEGVDIILAIDTSGSMAAQDFDPNRITAAKVAAANFIAN-RLSDRIGIVVFASDAMLQS 201 Query: 296 --TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD- 348 T E + GT + A+ V +PA+ + +DG+ Sbjct: 202 PLTLDYESLLDFLADVRIGMVRTDGTAIGDAI--AVSSVHLERSPAR-SKVIILLTDGES 258 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL------WREYEHLQSTFDNFAMQ 402 N SP + L ++ Y+ I++ + +L + L + Sbjct: 259 NSGVISP--LDAAKTAALYGIKVYTIATISKNSRDSLDFKPDDLEQIAKLTGGKY-YRAY 315 Query: 403 HIRDQDDIYPVFRELFHKQNATA 425 + + IY L + + Sbjct: 316 NEAELTKIYAEIDSLEKTEFKNS 338 >UniRef50_C9QHX4 Putative outer membrane adhesin like proteiin n=1 Tax=Vibrio orientalis CIP 102891 RepID=C9QHX4_VIBOR Length = 3332 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 38/185 (20%), Gaps = 40/185 (21%) Query: 245 PSSQAVMFCLMDVSGSMDQS------------------TKDMAKRFYILLYLFLSRTYKN 286 P + ++D SGSM DM + L L + Sbjct: 2729 PGVNYNIALVVDASGSMGDYVYNTDGTVMRNPDGSAMTRMDMMQEALTNLVESLVTHDGS 2788 Query: 287 VEVVYIRHHTQAKEVDEHEFF-----------------YSQETGGTIVSSALKLMDEV-V 328 + + I +V GGT + + Sbjct: 2789 INIKLIGFD-DNIDVTFEALDITNSSDVVAELLSKIENNLPVGGGTDYGVGFEEANNWYA 2847 Query: 329 KERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 + + +DG+ N + E + + Sbjct: 2848 SSSISSNGYENMTFFLTDGEPNSGTLNNGLTEYNELVSTHNAKVMAVG--MGNDIDDSVL 2905 Query: 388 EYEHL 392 ++ Sbjct: 2906 KFFDN 2910 >UniRef50_A3K0S6 Putative uncharacterized protein n=1 Tax=Sagittula stellata E-37 RepID=A3K0S6_9RHOB Length = 248 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 48/181 (26%), Gaps = 19/181 (10%) Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY 273 I + L + PDP+ + ++D SGSM S + AK+ Sbjct: 30 VTITHDPVPPDWAAIAAWPSLEADLVDALPDPNRRITAI-VLDDSGSM-GSDMEAAKQAV 87 Query: 274 ILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSAL--KLM 324 + + T + V A+ ++TG T ++ A+ Sbjct: 88 VDALSAMQDTDRVAVVALNAGVVLPFASVADARRTLPAALAPIRDTGSTPLTRAILDTQA 147 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV----VRYYSYIEITRR 380 + +DG A D + + L + I Sbjct: 148 MLEAEASSVRGFGTFRMIVTTDG---AADDGEALQRAIEDLAAKTPIQLTTIGIG-IRGN 203 Query: 381 A 381 Sbjct: 204 H 204 >UniRef50_C9RK46 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RK46_FIBSS Length = 236 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 54/175 (30%), Gaps = 30/175 (17%) Query: 244 DPSSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLY--------LFLSRTYKNVEVVY--- 291 S + L DVSGSM++ D K + L + + Sbjct: 10 IHSRPLPVIILADVSGSMNEIGKLDSLKHALNNMISSFKDASSSSLEAEIYVSIITFGNQ 69 Query: 292 ----IRHHTQAKEVDE-----HEFFYSQETGGTIVSSALKLMDEVVKER--YNPAQWNIY 340 I A E+ + Q G T + AL + ++++ R Y + + Sbjct: 70 AANIILEPQSASEIANDPSKMNVINKMQAIGNTPLGKALTSLVDLLENREIYPSRAYRPF 129 Query: 341 AAQASDG---DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 ASDG D W K + I A +++ +++ + Sbjct: 130 IVLASDGMPNDLWQQPLDRLLNSERSKKANRLAL----AIGADADESMLKKFVNN 180 >UniRef50_Q54T94 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54T94_DICDI Length = 509 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 69/220 (31%), Gaps = 20/220 (9%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 +P K + + + VP+ L R+ +E Sbjct: 205 VPLRKHLDVVERKMSANQWNEISYSKVPSRC-------MKLQRKAFERHEPSLFAEYIES 257 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + QL E +++ + + I + E R S + + Sbjct: 258 LKKGETKVNAKQLFPHEIVKEYLKGIAKDD-----ILEEQWKVLEQEVRKLGSLKDALV- 311 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV---DEHEFFYSQ 310 L DVSGSM + +++ IL+ ++ +K++ + + T K + Sbjct: 312 LSDVSGSMSGTPMEVSIALGILISSVVAPPFKDLVITFHETPTFHKVTGDSLRDKVSNLA 371 Query: 311 E---TGGTIVSSALKLMDEVVKERYNPAQ-WNIYAAQASD 346 G T + A +++ E K+ P + SD Sbjct: 372 AAPWGGSTNFNRAFEMILEKAKQNKLPQEDMPKKLFVISD 411 >UniRef50_C1V972 Protoporphyrin IX magnesium-chelatase n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1V972_9EURY Length = 738 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 15/169 (8%) Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 + + + A ID DLR +++ ++ ++D S SM Sbjct: 523 EPATNTDDIDVPASVRAAASNGRTQIDRSDLR----TSVNRGTARTLIVFVVDASASMRP 578 Query: 264 STKDMAKRFYILLYLFLSRTYKNVEVVYIRH-------HTQAKEVDEHEFFYSQETGGTI 316 + ++ LL + + V T + + T Sbjct: 579 AMRETKGTVMSLLEDAYQQRDEVAFVAVAGDEAEVVLPPTDSVSLAARHLKQLPAGDRTP 638 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKK 364 + S L +V + + +DG N AD SP A + Sbjct: 639 LPSGLDAAHRLVTRADADSSLVVVV---TDGRANVADGSPTAATRSAAR 684 >UniRef50_Q2B6K0 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B6K0_9BACI Length = 940 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 22/120 (18%), Positives = 37/120 (30%), Gaps = 26/120 (21%) Query: 251 MFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRT------------YKNVE----V 289 + + D SGSM+ S K+ AK F +VE V Sbjct: 80 VVFVFDKSGSMNDSGKNPQKFQSAKDAMTAAVNFFKENAGPNDRFGFVPFDDDVETGKVV 139 Query: 290 VYIRHHTQA-KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + + A + GGT + +L + + + N Y +DG+ Sbjct: 140 NFAPENNMASLNLINSNSNSLSALGGTNYTQSLDAALGM----FGNSTNNKYVLFMTDGE 195 >UniRef50_A6NMZ7 Collagen alpha-6(VI) chain n=2 Tax=Theria RepID=CO6A6_HUMAN Length = 2263 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 30/258 (11%), Positives = 62/258 (24%), Gaps = 31/258 (12%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE-RVPFIDTFDLRYKNYEKRP 243 + E + + + + E + + + DL + R Sbjct: 741 PAVVLRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPRE 800 Query: 244 --DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF------------LSRTYKNVEV 289 + ++D SGS+D ++ K F I L L + Sbjct: 801 ECKRIEVLDVVFVIDSSGSIDYDEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVL 860 Query: 290 VYIRHHTQAKEVDEHEFFYSQETGGTIVSSAL---KLMDEVVKERYNPAQWNIYAAQASD 346 Y+ EV G T + AL M + +D Sbjct: 861 FYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRLNKGVPQVLIVITD 920 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G++ D + + + S+ F + Sbjct: 921 GESHDADKLNATAKALRDK--GILVLAVGIDGANP----VELLAMAGSSDKYFFV----- 969 Query: 407 QDDIYPVFRELFHKQNAT 424 + + + +F A+ Sbjct: 970 --ETFGGLKGIFSDVTAS 985 Score = 43.6 bits (101), Expect = 0.013, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 45/152 (29%), Gaps = 23/152 (15%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV--- 301 +A + L+D SGS+ K F L V++ ++ KE Sbjct: 617 KEMKADIMFLVDSSGSIGPENFSKMKTFMKNLVSKSQIGPDRVQIGVVQFSDINKEEFQL 676 Query: 302 --------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-----YAAQASDGD 348 + G T ++ V + ++P + + +DG+ Sbjct: 677 NRFMSQSDISNAIDQMAHIGQTTLTG---SALSFVSQYFSPTKGARPNIRKFLILITDGE 733 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 A D ++ ++ V YS Sbjct: 734 --AQDIVKEPAVVLRQ--EGVIIYSVGVFGSN 761 >UniRef50_D2RQJ2 von Willebrand factor type A n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RQJ2_9EURY Length = 853 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 40/154 (25%), Gaps = 12/154 (7%) Query: 271 RFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET------GGTIVSSALKLM 324 + L + V V +A + ++E+ GGT +++ L+ Sbjct: 662 EATRNVIDELDPSADRVGVYDFASSGRALHPLSDDLESAKESVVGTAYGGTNMAAGLEAA 721 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQ 383 R + I SDG N + + LA + ++ Sbjct: 722 LNDYATRGTDDRERIVIL-LSDGKNSNTANDERMDELADRSDDLDYTLHTVGLDALEHDS 780 Query: 384 ---TLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + + D++ Sbjct: 781 IPEDKLEGWATETGGNY-YQTADPDELLDLFEEI 813 >UniRef50_A9WKF3 von Willebrand factor type A n=3 Tax=Chloroflexus RepID=A9WKF3_CHLAA Length = 446 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 15/155 (9%), Positives = 33/155 (21%), Gaps = 16/155 (10%) Query: 272 FYILLYLFLSRTYKN--------VEVVYIRHHTQAKEVDEHEFFYSQE---TGGTIVSSA 320 L L + V+ + T ++ Sbjct: 105 ALHSLIERLDHNDRLGLIACASDAIVLASGIPGSRRAELVAAIARLPALRLGETTNLAQG 164 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 L+L + + +DG + D LC + + + + + Sbjct: 165 LQLA--LAQFVAADDATVRRIVLITDG--FTTDQTLCLTLAREAAARGISLSTIG-LGGS 219 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + L + L +F I Sbjct: 220 FEEHLLTQLADLSGGRASFVYDAADIPAIIAAELE 254 >UniRef50_A9GY82 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GY82_SORC5 Length = 1457 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 54/195 (27%), Gaps = 26/195 (13%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 + + A + ++D S S D+S + + +S + + Sbjct: 496 DVDWAELKELPADLVVVVDTSASADESARQQKAAAAEAILRAMSPSDHFALIALDSAPAV 555 Query: 298 AKEVD----------EHEFFYSQE---TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 D E G T + + + V + PA IY Sbjct: 556 LHPKDGLAEASDKEISAALTRLAEHATGGATDLGALFESALGRVHGKEQPAV--IYI--- 610 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVR-----YYSYIEITRRAHQTLWREYEHLQSTFDNF 399 GD A + LA++L + +++ + ++ L RE Sbjct: 611 --GDGLATSGEVTGARLAERLRRSLTGSRARFFTVG-VGENSNHALLRELARAGGGQAFR 667 Query: 400 AMQHIRDQDDIYPVF 414 + ++ + Sbjct: 668 IDEAEGSTSEVLRLA 682 >UniRef50_C9L3E2 BatB protein n=10 Tax=Bacteroidales RepID=C9L3E2_9BACE Length = 342 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 49/215 (22%), Gaps = 42/215 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + +D+S SM S + AKR L L K +V+ Sbjct: 81 KLETVKRKGVEVIIALDISNSMLAQDVQPSRLEKAKRLISRLVDEL-DNDKVGMIVFAGD 139 Query: 295 HTQAKEVDEHEFF----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + GT + A+ L + Sbjct: 140 AFTQLPITSDYISAKMFLESINPSLISKQGTAIGEAINLA---ARSFTPQEGVGRAIIVI 196 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYI---------EITRRA-------------H 382 +DG+N + + A + V E T + Sbjct: 197 TDGEN-HEGGAVEAAKAAAEKGIQVNVLGVGMPDGAPIPAEGTNDYRRDREGNVIVTRLN 255 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 +T+ +E Q I ++ Sbjct: 256 ETMCQEIAKEGKGIYVRVDNSNSAQKAINQEVNKM 290 >UniRef50_C3YP68 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YP68_BRAFL Length = 1386 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 49/157 (31%), Gaps = 20/157 (12%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQA----KEV 301 +F ++D SGS+ S + K+F + + + + + Y T A Sbjct: 476 PDLFFVLDGSGSVSVSDFETVKQFVVAVVSAFTIGLAETRVGVLQYSTSSTLACNLGDHP 535 Query: 302 DEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 DE F + Q+ G T +AL+ + R PA + +DG Sbjct: 536 DEASFVSAINTMTYQKGGSTYTGAALEFARQNAAWR--PAPVSRIMIVLTDGQ----SHD 589 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 V ++ + H L + Sbjct: 590 SVVAAAQALAADQVTVFAIG-VGSFDHSELLEITSNK 625 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 48/160 (30%), Gaps = 25/160 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIR----------HHTQ 297 +F ++D SGS+ S + K+F + + + + + Y H + Sbjct: 269 LFFVLDGSGSVSVSDFETVKQFVVAVVSAFTIGLADTRVGVLQYSTSSSLECNLGDHPDE 328 Query: 298 AKEVDEHEFFYS--QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 A V Q+ G T +A++ + R PA +DG S Sbjct: 329 ASFVS--AINTLVYQKGGNTYTGAAMEFARQNAAWR--PAPVPKIMIVLTDG----KSSD 380 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 V ++ + H L + + Sbjct: 381 SVVAAAQALAADQVAVFAIG-VGSFDHSELLE-ITNNKPG 418 >UniRef50_A1VSB7 von Willebrand factor, type A n=10 Tax=Betaproteobacteria RepID=A1VSB7_POLNA Length = 354 Score = 53.6 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 28/250 (11%), Positives = 54/250 (21%), Gaps = 61/250 (24%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVY 291 + PS + MDVSGSM + ++ L R + V + Sbjct: 78 RPLAVITLPSQNETIILAMDVSGSMRATDVLPNRLVASQNAAKAFLADLPRNVRVGVVAF 137 Query: 292 IRH------HTQAKEVDEHEFFYSQETGGTIVSSALK----------------------- 322 T ++E Q GT + + + Sbjct: 138 AGTAAVVQPPTVSREDLTAAIDKFQLQRGTAIGNGIIVSLAELFPEAGIDLESMENNRER 197 Query: 323 ---LMDEVV-------KERYNP----AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 L + K+ + P + + +DG L +A Sbjct: 198 KHGLSLDQAGKDDGNGKKAFTPVAPGSYTSAAIILLTDGQRTTGIDSLDAAKVAADRGIR 257 Query: 369 VRYYSYIEITRR------------AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 V + + + +A D +Y Sbjct: 258 VYTVGVGTVEGETIGFEGWSMRVKLDEETLKGIARATQAEYFYAGTA-TDLKKVYQTLSS 316 Query: 417 LFHKQNATAK 426 + + Sbjct: 317 RLTVEKKETE 326 >UniRef50_C5GK44 U-box domain-containing protein n=2 Tax=Ajellomyces dermatitidis RepID=C5GK44_AJEDR Length = 766 Score = 53.6 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 26/229 (11%), Positives = 52/229 (22%), Gaps = 45/229 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQST------------------KDMAKRFYILLYLFLS 281 ++ + +D+S SM S D+ K + L+ Sbjct: 65 PEKELRHVPCDIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLN 124 Query: 282 RTYKNVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + V + + K+ T + LKL E ++E Sbjct: 125 DNDRLGVVAFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEALEEV 184 Query: 332 YNPAQWNIYAAQASDG------DNWADDSPL--------CHEILAKKLLPVVRYYSYIEI 377 Q +DG + + K LP++ + + Sbjct: 185 TPIPQNVQALYILTDGMYRIVRSRVPHANASKFRHAKSYVSKAGQKDRLPMIHTFGFGY- 243 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 L + + +F L+ A Sbjct: 244 --YIRSGLLQAISEVGGGTYSFIPDAGMIGTVFVHAIANLYTTFATQAM 290 >UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B6HQM8_PENCW Length = 1029 Score = 53.6 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 47/184 (25%), Gaps = 24/184 (13%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 D + P + ++ VS SM + + L L + V + Sbjct: 500 PDAEFPAINTVHIP---LDLVVVIPVSSSMQGLKITLLRDALKFLVQNLGPRDRMGLVTF 556 Query: 292 ---------IRHHTQAKEVDEHEFFYSQETG----GTIVSSALKLMDEVVKERYNPAQWN 338 + T++ + G V + +++ +R N Sbjct: 557 GSSGGGVPLVGMTTKSWAGWSKILESIRPVGQKSLRADVVEGANVAMDLLMQR---KFNN 613 Query: 339 IY--AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 SD + D ++++ V +S+ T+ Sbjct: 614 PVSTILLISD--SSISDPESVDFVVSRAEAAKVTIHSFGLGLTHKPDTMIE-LSTRTKGS 670 Query: 397 DNFA 400 ++ Sbjct: 671 YSYV 674 >UniRef50_C3YC17 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YC17_BRAFL Length = 951 Score = 53.6 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 18/186 (9%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD----- 302 + + D SGSM + ++ + + ++ +V + + ++ +A + Sbjct: 79 RTHVIVAADKSGSMSGNPWRQVQQALLYMIGDVASVNPSVALDVVIYNDKASLLQYAGSY 138 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQASDGD---NWADDSPLCH 358 + G T ++A + + +K + +DG N D Sbjct: 139 QDAVNRVNADGMTSFAAAFSCIKDCLKTEIQGTPVSKTVVVFMTDGADTCNRGADIDRSV 198 Query: 359 EILAKKLLPV-----VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + L + V + + + + +T F D + Sbjct: 199 RSWKEALARLGHEAIVHVVGF---SAQHDYNFLGRLRNTGTTAGLFRYTEPSDGTEALKA 255 Query: 414 -FRELF 418 +ELF Sbjct: 256 KLQELF 261 >UniRef50_Q5BKJ5 Clca1 protein (Fragment) n=8 Tax=Xenopus (Silurana) tropicalis RepID=Q5BKJ5_XENTR Length = 937 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 21/122 (17%), Positives = 51/122 (41%), Gaps = 13/122 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTYKNVEVVY----IRHHTQAK 299 SS+ V+ ++DVSGSM + + + + + + V + + + Sbjct: 303 SSERVVTLVLDVSGSMGGGNRIGRLYQAAEVFVMQIVEMGSYVGIVQFESTASVRSSLLQ 362 Query: 300 EVDEHEFFYSQ------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 VD+ + + TGGT + + ++ +V ++Y+ + ++ +DG++ Sbjct: 363 IVDDTQRNRLKSLLPKTATGGTNICAGIREGIKV-NKKYDGSSYSTELVLLTDGEDNYAT 421 Query: 354 SP 355 S Sbjct: 422 SL 423 >UniRef50_A5UYK7 von Willebrand factor, type A n=2 Tax=Roseiflexus RepID=A5UYK7_ROSS1 Length = 429 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 27/214 (12%), Positives = 52/214 (24%), Gaps = 33/214 (15%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFL--- 280 + L + P + + ++D SGSM +A++ I L L Sbjct: 150 VLPGTLGVRMASGERLPGATRNLAIILDASGSMLARIDGAPKTVIARQALIALVERLPAT 209 Query: 281 ----------SRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE-----TGGTIVSSALKLMD 325 R + ++ + D G T ++ +L+ Sbjct: 210 TNVALRTYGHRRADDCSDTELVQAPAPIQRADL--INRINAIRPVNGGRTPIAQSLE--- 264 Query: 326 EVVKERYNPAQWNIYAAQASDGDNW-ADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQ 383 ++ ++ + SDGD D L V + + Sbjct: 265 DMARDLAGVDGE-VLIVLVSDGDETCGGDPVATAAALHTANPRLRVSVIGFNIEQEEWRR 323 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 L F + D L Sbjct: 324 RL-EGIAAYGGGAY-FDAANAVQLADALEQAVAL 355 >UniRef50_C3XEK2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XEK2_9HELI Length = 493 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 52/152 (34%), Gaps = 18/152 (11%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 ++ ++ + +D SGSM + + +AK + L + + Y+ + + E Sbjct: 320 QKKREKNEGAIIICVDTSGSMYGNPEYIAKALTLFLATKANTQKR---ACYLINFSIGIE 376 Query: 301 VDE----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 E +F GGT V+ ALK + +++ I SDG Sbjct: 377 TMELSGKGGMAKLMQFLEMSFGGGTDVAPALKAGLKTMQQDDFKKSDLIVI---SDGGFG 433 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 + E + + ++I + Sbjct: 434 YIPND--LEKQMQNQRQKDNKFYLLDINGNSG 463 >UniRef50_A8J3X6 Flagellar associated protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8J3X6_CHLRE Length = 1043 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 42/175 (24%), Gaps = 39/175 (22%) Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFID------------ 230 + L + + I+ P + ++ F+ Sbjct: 212 PQGYSHTQLLDVIVTINTGSPTPVA-YALPNADVDVGYRVWGNDMFLALNVSNPRPPHPA 270 Query: 231 TFDLRYKNYEKRPDP------SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 D R P + L+D SGSM + AK L+ Sbjct: 271 DPDPRGAFVLSVAPPAPEFTAPFPRSVVFLLDRSGSMSGEPMEFAKAALCFGLRSLTPLD 330 Query: 285 KNVEVVY---------------IRHHTQAKEVDE-----HEFFYSQETGGTIVSS 319 V + +R A+ + + GGT ++S Sbjct: 331 TFTVVAFDHEQLWFTPGGQLSWVRASVDARGLTDIMTPLQTAMRVLSGGGTRIAS 385 >UniRef50_B1JFF2 Na-Ca exchanger/integrin-beta4 n=1 Tax=Pseudomonas putida W619 RepID=B1JFF2_PSEPW Length = 5962 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 51/175 (29%), Gaps = 24/175 (13%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK-----NVEVVYIRHHTQAK 299 P + ++D SGSM S D AK+ ++ L+ + K V ++ + TQ K Sbjct: 5443 PGQDYNIAFIVDTSGSMGSSGVDAAKKSLESVFKTLAASVKGAQSGTVNILLVDFATQVK 5502 Query: 300 ------------EVDEHEFFYSQETGGTIVSSALK-LMDEVVKERYNPAQWNIYAAQASD 346 + GGT A K + + + + +D Sbjct: 5503 SSVSVTLNDAGLKTLLSALGTLNSNGGTNYEDAFKTTANWFANLKAAGSTGSNQTFFITD 5562 Query: 347 GD-NWADDSPLCHEILAKKLLPVVRYYS-----YIEITRRAHQTLWREYEHLQST 395 G+ + S + L + + + S + S Sbjct: 5563 GEPTYYQTSEQANPNLGNTNVKLDTFLSSINYKMGDTVTNYQLDTNNRLSIDSSG 5617 >UniRef50_D0LBG9 ATPase associated with various cellular activities AAA_5 n=4 Tax=Bacteria RepID=D0LBG9_GORB4 Length = 674 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 45/138 (32%), Gaps = 16/138 (11%) Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYI--LLY 277 + DLR + ++ ++D+SGSM + A LL Sbjct: 470 PTPALTGTLVTEDDLR----AAQRIGQESNLVVFVVDLSGSMTARRRLAAVSELCVDLLR 525 Query: 278 LFLSRTYKNVEVV-------YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK- 329 +R + VV + T++ ++ + G T ++ L EV++ Sbjct: 526 DSYTRRDRVAVVVARGSRASLVVPPTKSVDIAVRCLADVRTGGRTPLAEGLLAAAEVIER 585 Query: 330 -ERYNPAQWNIYAAQASD 346 R P + +D Sbjct: 586 SRRSEPDRR-PLLVVLTD 602 >UniRef50_A6YIP9 Capillary morphogenesis protein 2B n=14 Tax=Euteleostomi RepID=A6YIP9_DANRE Length = 487 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 20/175 (11%), Positives = 49/175 (28%), Gaps = 13/175 (7%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD-------- 302 ++ ++D SGS+ + ++ L F+S + +V+ + Sbjct: 40 LYFVLDRSGSVSDNWLEIYGFVEQLTNRFVSPKMRVSFIVFSSSAEIILPLTGDRVDIDS 99 Query: 303 -EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 + + G T + LK E + A+ + +DG + L + Sbjct: 100 GLQQLSKIRPAGDTYMHEGLKKAIEQMT--SQGARASSIIIALTDGKLEVFMNELAIKEA 157 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 R Y E + ++ + + ++ Sbjct: 158 DLARQYGARVYCVGVK--DFDANQLTEIADNKDQVFPVVDGFQALKNIVNSILQK 210 >UniRef50_D0LR75 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LR75_HALO1 Length = 816 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 27/241 (11%), Positives = 61/241 (25%), Gaps = 43/241 (17%) Query: 212 LRKEIAELRAKIERVPFIDT----FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD 267 + I P +LR P + L+D + + ++ Sbjct: 445 FEIATGLIERPIPTEPGFRDYLWGVNLRGPQVSDEDRPGLSLTV--LIDET--LPGPERE 500 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAK---EVDEHEFFYSQETGG 314 + + L L + + + + T+A+ E+ G Sbjct: 501 LTRLSLDTLATALRPDDRVNVITFAGNPKIELENVSLPTKAEGESELVRLGRRLRGRAGM 560 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV----- 369 + AL +V + +W+ +D D P E + ++ L Sbjct: 561 FDLDGALATAYKVARRHQRSERWSQV-LVLTDSD--IGTLPTLLETIVEESLTSGSSRGI 617 Query: 370 --RYYSYI----EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + R + +L ++ D + LF Sbjct: 618 RWTVLGLHAVDTSVQGNRDEQVLRPFAYLGKGSY----FTVKTNAD----AQRLFAAPLT 669 Query: 424 T 424 Sbjct: 670 G 670 >UniRef50_C5BKN1 Matrixin family protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BKN1_TERTT Length = 877 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 42/162 (25%), Gaps = 24/162 (14%) Query: 247 SQAVMFCLMDVSGSM--------DQSTKDMAKRFYILLYLFLSR---------TYKNVEV 289 + +MD SGSM S D K + FL + V V Sbjct: 418 QNTDVVLVMDRSGSMNLSSAPDPSVSKMDALKYAANVFMDFLDLDAGHRAGLVQFHEVVV 477 Query: 290 ----VYIRHHTQA--KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + A + G T + + + +P+ I Sbjct: 478 PFSPAFNLQPVNAASLSAAQTAINSMTAGGMTNIIDGVNEGIAQLTTAVDPSDRQIMLL- 536 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 +DG + +I A L V YS T L Sbjct: 537 LTDGLHNRPVGTSVTDITAPLLASEVTLYSVGFGTSTNEAEL 578 >UniRef50_D2LB55 von Willebrand factor type A n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LB55_RHOVA Length = 605 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 78/308 (25%), Gaps = 31/308 (10%) Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD 122 MF R + + E G + +D+ D Sbjct: 255 MFVLAPRATRMPEPMEDQAPPEPPPPEPQDDKEEDGKGGGESGENETAPEDDKAL---TD 311 Query: 123 EYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTA 182 + + L + + + G G + R + R Sbjct: 312 MIIAAAAAAMPKEMLVKAAAKPFRVPPSAPMG--KAGAERT-NAARGRPAGVRRGMPKPG 368 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE-K 241 L LE A + + + ++ + +D ++R ++ K Sbjct: 369 SP---LSLLETLRAAAPWQKIRKSMLDKAIGA-----------GGRLDRIEVRSDDFRIK 414 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-RTYKNVEVVY-------IR 293 R SQ +D SGS K LL R + V + + Sbjct: 415 RLKARSQTATIFAVDASGSAAMQRLAETKGAVELLLAECYIRRDQVALVAFRGTRAEILL 474 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWAD 352 T++ + GGT ++S + + + +DG N A Sbjct: 475 PPTRSLTRAKKSLAALAGGGGTPLASGIDAAGALA-LNIRRRGISPLVVFLTDGKANIAA 533 Query: 353 DSPLCHEI 360 D E Sbjct: 534 DGKPGRER 541 >UniRef50_C1F3L6 von Willebrand factor type A domain protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F3L6_ACIC5 Length = 410 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 56/178 (31%), Gaps = 17/178 (9%) Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVDEHEF 306 L+D SGSM + + + L + + V + T + E Sbjct: 187 ILVDNSGSMQN-KLNAVDKAALDLVRASNPDDEAFIVNFSDQAYLDQGFTSSIAKLEQGL 245 Query: 307 FYSQETGGTIVSSALKL-MDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKL 365 +++ GGT + A+ DE+ K+ +P Q +DG++ A L I + Sbjct: 246 AHTEARGGTALYDAIVASADELSKDARHPKQ---VLLVVTDGEDDASTMNLQQAIQRVQA 302 Query: 366 LPVVRYYSYIEITRRAHQT------LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 L Y+ + + + F + V +++ Sbjct: 303 LHGPEIYAIGLLYDDSGDEAHRARKALEQLTEQTGGLAYFPRSLENVDEVAAEVAKDI 360 >UniRef50_A8H5T6 von Willebrand factor type A n=5 Tax=Proteobacteria RepID=A8H5T6_SHEPA Length = 336 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 66/213 (30%), Gaps = 40/213 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM---AKRFYILLYLFL------SRTYKNVEVV 290 E PS + +D+SGSM + + L+ + + + ++ Sbjct: 75 EAIELPSKGRDLMLSVDLSGSMQIEDMVIDGKVVDRFTLIQHVISDFIERRKGDRIGLIL 134 Query: 291 YIRHH------TQAKEVDEHEFFYSQET--GG-TIVSSALKLMDEVVKERYNPAQW-NIY 340 + H TQ + +Q G T + A+ L +R++ + N Sbjct: 135 FADHAYLQSPLTQDRRSVAQYLKEAQIGLVGKQTAIGEAIALGV----KRFDKVEQSNRV 190 Query: 341 AAQASDG-DNWADDSPLCHEILAKKLLPVVRYYS-----------YIEITRRA----HQT 384 +DG +N +P +A + + + + ++ Sbjct: 191 LILLTDGSNNAGAITPEQASQIAAQRGITIYTIGVGADVMERRTLFGKERVNPSMDLDES 250 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 +E + F ++ + + IY V L Sbjct: 251 QLQEIAKVTGGQY-FRARNTEELEQIYQVIDTL 282 >UniRef50_A7RFK1 Predicted protein (Fragment) n=2 Tax=Nematostella vectensis RepID=A7RFK1_NEMVE Length = 1418 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 57/194 (29%), Gaps = 36/194 (18%) Query: 251 MFCLMDVSGSMD----QSTKDMA---KRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD- 302 + L+D SGS+ K+ K F L + +YK+ V + T A Sbjct: 4 LIFLVDTSGSLQYWSGGGWKNGFDDEKVFVNSLLSHIRVSYKSTYVSVVLFGTSATIDIN 63 Query: 303 --------------EHEFFYSQE-TGGTIVSSALKLMDEVVKERY-----NPAQWNIYAA 342 +F + +G T + A + +++ +Y Q Sbjct: 64 YIFNPHPNNHKCNFRRDFSNLRFRSGMTNMHDAFQAAYDIIFGKYSGHKRPTHQVKTAVF 123 Query: 343 QASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG NW D + L + + ++ + R S + F Sbjct: 124 LLTDGQWNWNGDPWPIAKRLKDR---GIEIFTIGVTNG-VNVNTLRSLA---SPNNYFHY 176 Query: 402 QHIRDQDDIYPVFR 415 ++ R Sbjct: 177 NDFTQFRELATCIR 190 >UniRef50_Q5LCG5 Aerotolerance-related membrane protein n=25 Tax=Bacteroidales RepID=Q5LCG5_BACFN Length = 341 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 24/215 (11%), Positives = 48/215 (22%), Gaps = 42/215 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + +D+S SM S + AKR L + K +V+ Sbjct: 81 KLETVKRKGVEVMIALDISNSMLAQDVQPSRLEKAKRLISKLVDGM-ENDKVGMIVFAGD 139 Query: 295 HTQAKEVDEHEFF----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + GT + +A+ L + Sbjct: 140 AFTQLPITSDYISAKMFLESISPSLISKQGTAIGAAINLA---ARSFTPQEGVGRAIVVI 196 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA----------------------H 382 +DG+N + + A K V + Sbjct: 197 TDGEN-HEGGAVEAAKEAAKKGIQVNVLGVGLPDGAPIPIEGSNDFRRDREGNVIVTRLN 255 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + +E + Q I ++ Sbjct: 256 EAMCQEIAKEGNGIYIRVDNSNSAQKAINQEINKM 290 >UniRef50_C5BKZ8 von Willebrand factor type A domain protein n=5 Tax=Gammaproteobacteria RepID=C5BKZ8_TERTT Length = 347 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 56/194 (28%), Gaps = 29/194 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMA-KRFYILLY------LFL--SRTYKNVEVV 290 E P++ + +D+SGSM + K+ +L F+ + + ++ Sbjct: 82 EPVTLPATGRDLLLAVDISGSMKTPDMVVQDKQIARILVVKYVVNEFIERRESDRLGLIL 141 Query: 291 YIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + T ++ +Q T + A+ L + ++ER Sbjct: 142 FGSQAYLQAPLTFDRKTVSTLLDEAQLGFAGEQTAIGDAVGLAIKRLRER---PASQRVL 198 Query: 342 AQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG N + +P LAK+ + + L F Sbjct: 199 ILLTDGANTAGEVAPRQAADLAKQAGIKIYTVGVG-------ADQMEQRMGLFGGFSRTV 251 Query: 401 MQHIRDQDDIYPVF 414 +D Sbjct: 252 NPSSDLDEDTLRYM 265 >UniRef50_C0ZKA7 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZKA7_BREBN Length = 437 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 59/195 (30%), Gaps = 33/195 (16%) Query: 248 QAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKN---------------- 286 + ++D SGSM +AK L + Sbjct: 135 NFNVEIILDASGSMAGKIGDKTKMQLAKEAIQEFAEALPEDARISLRVYGHKGSNADEHK 194 Query: 287 ------VEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 E+VY AK + E + TG T ++ +L+L E + + + Sbjct: 195 QLSCGSSEMVYPLQAYDAKRL-EQALDMFEPTGWTSIAHSLRLAQEDLAG-FEADKNTNV 252 Query: 341 AAQASDG-DNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 SDG + + + L++ K++P++ + Q +E H Sbjct: 253 IYLVSDGIETCDGNPVAVAKELSQSKIMPLLNVIGFDVNAEGQKQ--LKEIAHASEGLYA 310 Query: 399 FAMQHIRDQDDIYPV 413 + + ++ Sbjct: 311 NVTNREQFKQELERA 325 >UniRef50_B0C1G4 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C1G4_ACAM1 Length = 971 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 53/183 (28%), Gaps = 28/183 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHHTQAKEVDEHEFF 307 M ++D SGS+ S + + + + + +L + V + + Sbjct: 576 MMFILDKSGSVSLSERRLQRDAVMAMLNYLVDNNITSRVGIVRFDSTSATVIGYTDVTAA 635 Query: 308 YSQE-------------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 G T + + + +P +DG+ + S Sbjct: 636 NLPTFESALNTNYVNIGGGATNWEAGFQQAISLGVSPGSPD----VVFFFADGNINSGGS 691 Query: 355 PLCHEILAKKLLPVVRYYS--------YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 P + K+ + +++IT ++ T + ++ D + D Sbjct: 692 PNDEALQFKQAGAHIYGIGIQSLDIDDFLDITDGSNTTQFDAALDNANSADYVEVNSYDD 751 Query: 407 QDD 409 D Sbjct: 752 LAD 754 >UniRef50_UPI0001925847 PREDICTED: similar to polydom n=1 Tax=Hydra magnipapillata RepID=UPI0001925847 Length = 2514 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 77/236 (32%), Gaps = 26/236 (11%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 +SV+R + ++ ++ + + S++ L+ + + Sbjct: 83 LSVMRRSREAIRKKRPE---SWANNSWILHHDNAPSHTALENRLKMDTFFNFVIIFVCIF 139 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 E + ++ N ++ +S+A + L+DVSGS+ + + F L +S Sbjct: 140 ELTHSTRDWRVQRLNQFQQQYNNSKADIIFLIDVSGSISDDGFNTEREFVSSLLSKISVQ 199 Query: 284 YKNVEVVYIRHHTQ---------------AKEVDEHEFFYS--QETGGTIVSSALKLMDE 326 + + K EF ++ G T ++ AL+ Sbjct: 200 PSAARIAVVTFGRDINKDIDYIDYGYLDKNKCTFNEEFKRVKHRKEGWTNINGALQKAKA 259 Query: 327 VV----KERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEI 377 ++ ++++ N A +DG N+ L + V +S Sbjct: 260 LLDSANEKKFKRHNVNTVAVLLTDGGWNYGGSPYDTATNL-RTGFHYVDIFSIGVG 314 >UniRef50_C0GKG1 von Willebrand factor type A n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GKG1_9FIRM Length = 272 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 56/221 (25%), Gaps = 17/221 (7%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 E + ++P Q + L + A R+ D LR +++ D + + Sbjct: 35 EGFYTLRETQPRQHGCRDSLAVAETMICAARRRLTSGDAQFLRNEDFRVIKDGGAPPLEV 94 Query: 253 CLM-DVSGSMDQSTKDMAKRFYILLYLFLSR-----TYKNVEVVYIRHHTQAKEVDEHEF 306 CL+ D SGSM+ K L + T++ +V T+ + Sbjct: 95 CLLVDTSGSMNGKRIREVKTLADNLVRQMHEPLSLITFQEGDVGVKVRSTRNDLMVRRGL 154 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL-----CHEIL 361 G T + ++ + R +DG E Sbjct: 155 AAMSAAGLTPMGEGIRTAVNYLCGRRGKKH---LVILITDGLPTWASGDKDPYLDAIEAG 211 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 A + + + Sbjct: 212 ALIKKHKMHLICIGL---EPQRKFLEKLAESADASLYIVDD 249 >UniRef50_Q3APD8 von Willebrand factor, type A n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3APD8_CHLCH Length = 334 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 24/208 (11%), Positives = 56/208 (26%), Gaps = 42/208 (20%) Query: 249 AVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQ 297 A + ++DVS SM + AK+ + + + + + + T Sbjct: 89 ADVLFILDVSRSMQATDVAPNRLMRAKQEIAAISQNVQGGRRGLLI-FAASPLLHCPLTT 147 Query: 298 AKE----VDEHEFFYSQETGGTIVSSALKLM---DEVVKERYNPAQWNI-YAAQASDGDN 349 ++ + E GT + A L +V E + + SDG++ Sbjct: 148 DRDGFATLLNMAAPELIEEQGTRLQPAFALASTIFDVANESNAASTRGVQVIVLLSDGED 207 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYI---------------------EITRRAHQTLWRE 388 + + LAK+ + + + R + + Sbjct: 208 HDSNVQRAAQQLAKQSVQ-LFVIGVGSLKPSPIPLADGSFKRDASGQVVMSRFRPQMLQA 266 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + + D+ Sbjct: 267 FARQAKGLYRHSHAEVWASADVVNRINR 294 >UniRef50_Q021L5 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q021L5_SOLUE Length = 337 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 67/214 (31%), Gaps = 26/214 (12%) Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYI 274 L +E++ F D + + + D SGSM ++ Sbjct: 76 HQDRLITGLEKIHFHLFDDKVEQEITTFASEDVPVSIVIVFDCSGSM-GPKLAKSRAAVA 134 Query: 275 LLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV 327 + + V++ + Q E+ + + FY+Q G T + A+ L + Sbjct: 135 AFLSSANPEDEFSLVLFNDRAQLVSGFNRQTDEL-QSKLFYAQSKGRTALLDAIYLAMDQ 193 Query: 328 VK--ERYNPAQWNIYAAQASD-GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 +K + A SD GDN + S + K+ + +E ++ Sbjct: 194 MKHAKHSRKA-----VLVISDGGDNCSRYSMREVKNRVKEGDAQIYSIGILEAMGFRGRS 248 Query: 385 --------LWREYEHLQSTFDNFAMQHIRDQDDI 410 L + F + ++ + D+ Sbjct: 249 AEELAGPALLDDIASQSGGRL-FEIDNLNELSDV 281 >UniRef50_C6XTR0 von Willebrand factor type A n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XTR0_PEDHD Length = 344 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 48/155 (30%), Gaps = 20/155 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + S + + L+DVS SM + + AKR L L + +++ Sbjct: 81 KIEEAKRSGSDLMILLDVSNSMLAGDLAPNRLENAKRAISQLIDNLH-NDRIGIIIFAGE 139 Query: 295 H----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + AK + T GT + +A+ + + + + Sbjct: 140 AYVQLPITTDYSAAKLFLNNITTDIVPTQGTAIGAAIDMG--MKSFNFV-NGTSKAMILM 196 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 +DG+N DD+ + A + Sbjct: 197 TDGENHEDDAVSAAKR-ASAKDVAIHVIGVGSEEG 230 >UniRef50_C9PFU4 Putative hemolysin n=1 Tax=Vibrio furnissii CIP 102972 RepID=C9PFU4_VIBFU Length = 1476 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 38/159 (23%), Gaps = 36/159 (22%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM---------------------DQSTKD 267 + D+ P + ++D SGSM + S Sbjct: 847 VLMGDIGGYVVSV--QPGVNYNIALIIDTSGSMKFDLAGNDNGFSNSYQSQSQYNASRMK 904 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------------HEFFYSQETGGT 315 + L L + + I + A E GGT Sbjct: 905 LVIDALTNLATDLVNHDGVININLIGFESSAHSALTLQLTADNLQQLLTEIQDMDAEGGT 964 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 +A L + N+ +DGD +S Sbjct: 965 NYEAAFDLASNWFSHQPTEGYENLT-YFLTDGDPTFSNS 1002 >UniRef50_UPI000180D155 PREDICTED: similar to integrin alpha Hr1 n=1 Tax=Ciona intestinalis RepID=UPI000180D155 Length = 1595 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 59/189 (31%), Gaps = 25/189 (13%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHHTQAKEVDEH 304 + + ++D SGS++Q KR+ + + VVY + T + VD Sbjct: 415 KIDIVLVVDQSGSVNQCNFQKVKRWLRDIVRSFNLGVTEQDVGVVVYSKKATTSTVVDLG 474 Query: 305 EFFYSQ------------------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 Y E G T A KL +E++ + +D Sbjct: 475 FSDYDSDGHTKKQEMTKILKKLAYEGGTTYTGYAFKLANEMLTGNKSRPDAKKMIILLTD 534 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G A ++ E L V + + +QT + + FA+ + Sbjct: 535 GATTAANTLQLKEELDVSRAANVMILAVGV--GKFNQTELIQIA--GDRKNFFAVTKFSE 590 Query: 407 QDDIYPVFR 415 + + R Sbjct: 591 LEKVRDKLR 599 >UniRef50_UPI00006CD1DE von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CD1DE Length = 938 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 25/193 (12%), Positives = 52/193 (26%), Gaps = 30/193 (15%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------------H 294 S++ L+D S SM+ A IL L V+ + Sbjct: 311 SKSEFIFLLDRSASMEGLPIKRACEALILFLKSL-PNDSYFNVLSFGSEFEMLFPSSRKY 369 Query: 295 HTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + Q E+ GT + + L + + ++N +DG Sbjct: 370 NNQNLEIAIKIISNYTANLLGTEIYNPLSCIYT----KKRIQKYNRQIFLLTDG---HVS 422 Query: 354 SPLCHEILAKK--LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + L KK V + + L + + + + D+ Sbjct: 423 NRDEVLNLIKKNNQFDRVHTIGFGSDADKY---LVNKSAFYGKG----ISRIVDFKSDLS 475 Query: 412 PVFRELFHKQNAT 424 + ++ + Sbjct: 476 RIVLQMLCQSLTP 488 >UniRef50_Q2SNJ1 Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (VWF) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SNJ1_HAHCH Length = 223 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 40/149 (26%), Gaps = 21/149 (14%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY----KNVEVVYIR 293 + + S + ++D S SM LL L + +V Sbjct: 8 DVVFNDNNSQRTPCVLVLDGSSSMFGEPIRQLNEGLKLLERALKEDASTAMRVQLLVIRA 67 Query: 294 HHTQAKEVDEHEFFYSQ-------ETGGTIVSSALKLMDEVVK---ERYNPAQWN---IY 340 + EV G T + A+ L + ++ Y+ + + Sbjct: 68 GNHDQAEVLTDWVDAMDFNAPEVFANGTTPLGGAMNLALDKIEDQKAAYDANGISSTRPW 127 Query: 341 AAQASDGD----NWADDSPLCHEILAKKL 365 SDG NW + C + Sbjct: 128 IILISDGAPTDFNWEAVADRCRHAEQNRK 156 >UniRef50_A6L4M8 Putative uncharacterized protein n=9 Tax=Bacteroides RepID=A6L4M8_BACV8 Length = 453 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 53/174 (30%), Gaps = 24/174 (13%) Query: 222 KIERVPFIDTFDLRYKNYEKRP----DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY 277 + + FD + K E S Q +D SGSM + + ++K + + Sbjct: 273 ERYAEKRLQLFDYQSKETEPVKDDKHKVSGQGPYIICVDTSGSMQGNREILSKSAILAIA 332 Query: 278 LFLSRTYKNVEVVYIRHHTQAKEVDEHE----------FFYSQETGGTIVSSALKLMDEV 327 +T++ V I +A + + F + GGT + AL+ + Sbjct: 333 QLTEKTHRKCYV--INFSDEAVSLLIEDLGRDMPRLAEFLNKRFDGGTDIEPALREAAHI 390 Query: 328 VKER-YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 + + + SD + K + + + + Sbjct: 391 INGNDFRESD----IVLISDFEMPPLS--RNLMEQVKVIKRRKTSF-FGLVFGN 437 >UniRef50_C1XZC4 Uncharacterized conserved protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XZC4_9DEIN Length = 427 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 31/275 (11%), Positives = 62/275 (22%), Gaps = 36/275 (13%) Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP 135 G+ + + + + +L + P Sbjct: 147 AQGSRQPNGQGNSQGDSQDNSQAMPQPNGQGGKKVWMTDTHAPAIERRAWELGHDHAESP 206 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 L + E + R ++ + + + Sbjct: 207 ALSE------AEAEIVRGEVAR---------------AILEHAKTRGSVPAGMVRWAQEI 245 Query: 196 AI--ISNSEPAQLLEEERLRKEIAELRAKIERVPFID-TFDLRYKNYEKRPDPSSQAVMF 252 I +R + R ER+ D R + AV+ Sbjct: 246 VAPVIPWQRVMAHHLRNGVRLTVGRTRPTYERIHRRMGVMDARVRLPGSYSLKPRVAVVV 305 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET 312 D SGS+ A + + V V A++V + Sbjct: 306 ---DTSGSVSDRMLGHALGEIQGILRQVGAE--LVVVSTDAQAHAAQKVQRVDQIRLVGG 360 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 GGT +S ++ ++ P +DG Sbjct: 361 GGTDMSVGIEAAMKL---HPRPD----VIVVLTDG 388 >UniRef50_C7N671 Mg-chelatase subunit ChlD n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N671_SLAHD Length = 257 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 25/129 (19%), Positives = 44/129 (34%), Gaps = 18/129 (13%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM-AKR-FYILLYLFLSRTYKN 286 I+ +LRY+ R + ++D SGSM + K LL S+ + Sbjct: 67 IEESNLRYQKIAGRGRR----NILFIIDSSGSMVADDRFAKVKGCVISLLESAYSKRVRV 122 Query: 287 VEVVYIR--------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 + Y T A E+ + G T + AL + +++ + Sbjct: 123 AIISYGGGKARLVLPFTTSA-ELAAERIDELKGGGSTPMVDALGIAGNLLERMRDED--- 178 Query: 339 IYAAQASDG 347 + SDG Sbjct: 179 LSVYLLSDG 187 >UniRef50_A2DPQ9 von Willebrand factor type A domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2DPQ9_TRIVA Length = 694 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 49/180 (27%), Gaps = 28/180 (15%) Query: 251 MFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------------HHTQ 297 + L+D SGSM + + A + L L K V + ++ Sbjct: 239 IVFLLDCSGSMTIDNRIENAIKAMDLFLHSLEPGVKFEIVRFGSTFNSLFDFKLTEYNDD 298 Query: 298 AKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + + GGT + + +K + + +P +DG A D+ Sbjct: 299 SLNTALAFIKGTSANLGGTEIFNPIKQIYNEL----SPD----VLFVLTDG---AVDNSQ 347 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + L R + + +D I + + Sbjct: 348 AVLDFVRDSSTKIFSLGLGAGA---DMNLVRNLASFTGGVSEHVLDASQLRDSIIRLLED 404 >UniRef50_UPI000186D9CC conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D9CC Length = 1180 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 65/212 (30%), Gaps = 27/212 (12%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 ++ P +D +D R +++ +S + L+D SGSM K++A+ + L Sbjct: 186 WKQDP-VDLYDCRTRSW-FIEAATSPKDIVILVDGSGSMTGIRKEIARHVVNNILDTLGN 243 Query: 283 TYKNVEVVYIRHHTQAKEVDEH-----------EFFYSQETGGT----IVSSALKLMDEV 327 + + T+ + + F E T S AL + Sbjct: 244 NDFVNILSFNETTTEVEPCFKDILVQANLANIRNFKEKMEDITTSNIANFSFALSKAFHL 303 Query: 328 VKER-------YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 +++ Y+ A N +DG + + VR ++Y+ Sbjct: 304 LQKYRENGSDDYSGAHCNQAIMLITDGVPY---NFKEIFAEFNWPNMPVRVFTYLVGREV 360 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 A + + ++ + Sbjct: 361 ADVREIKWMACANRGYYVHLSTLAEVREQVLQ 392 >UniRef50_UPI0000E47896 PREDICTED: similar to cache domain containing 1 n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47896 Length = 1395 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 58/198 (29%), Gaps = 31/198 (15%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN----- 286 D+R KN S + ++D S+ + +A++ LSR + Sbjct: 164 VDVRSKNLYASTVQPSPTNVVIIIDHGSSISPVSLVIAQKAAKTALGALSRKDRVGVLSM 223 Query: 287 -VEVV-----------YIRHHTQAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYN 333 EVV + KE + G + +SAL+ ++++ + Sbjct: 224 GSEVVTSQPGSCYDDMLAPASAEVKEHLIKFINGIKAMDGPSNHTSALRTAFDLIQRTTS 283 Query: 334 PAQWNI-----YAAQASDGDNWADDSPLCHEILA----KKLLPVVRYYSYIEI----TRR 380 P N S G D +A ++L V +Y + T Sbjct: 284 PMPLNQSKPDSVILYISTGHASNQDEVKAAINIAISENRRLNNRVAIMTYALVEEGRTGL 343 Query: 381 AHQTLWREYEHLQSTFDN 398 R+ + Sbjct: 344 EELAFLRDLAEQDNGTYR 361 >UniRef50_UPI000023F6A9 hypothetical protein FG10431.1 n=2 Tax=Gibberella zeae PH-1 RepID=UPI000023F6A9 Length = 851 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 54/210 (25%), Gaps = 28/210 (13%) Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQ--AVMFCLMDVSGSMDQ---------STKDMA 269 + +R + + + L+D SGSM D+ Sbjct: 351 PPDDAGMAAMMVSIRPSDLFRNAIIPQSFSGEILFLLDQSGSMRGGCGSGFNGLRKIDVL 410 Query: 270 KRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD-----------EHEFFYSQET-GGTIV 317 + +L+ L +T + + E GGT + Sbjct: 411 REAMLLVISGLPKTCSFNIISWGSETRAIWEQSRKHSPDNINEARDYISQIDSNLGGTDL 470 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 A K V+ R + + +DG AD + LL +R+++ Sbjct: 471 LRAFK---STVQRRRDES-NPTQIVVLTDGQLNADKPMEFVWKTRQVLLNKIRFFALGIG 526 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 H+ L L + Sbjct: 527 RNVPHR-LIEGIAELGGGSGEIIDTTQNSR 555 >UniRef50_A9YWQ9 Zinc finger protein n=1 Tax=Medicago truncatula RepID=A9YWQ9_MEDTR Length = 691 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 47/155 (30%), Gaps = 25/155 (16%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRP-DPSSQAVMFCLMDVSGSMDQSTKDMAK 270 L E A + A ++ L+ + + ++DV G+M + K Sbjct: 255 LLPETAVVAANRNYETYVVVLKLKTPAPAPVKVLRRAPVDVVIVLDVGGAMSGQKLRLMK 314 Query: 271 RFYILLYLFLSRTYKNVEVVYIRHH--------------TQAKEVDE--HEFFYSQETGG 314 L+ L+ T + V + A+ + E ++ Sbjct: 315 NTMRLVISSLNATDRLSIVAFSGGSKRLLPLKRMTGGGQRSARRIVEALAAIDQIRD--- 371 Query: 315 TIVSS---ALKLMDEVVKERYNPAQWNIYAAQASD 346 V + ALK +V+++R SD Sbjct: 372 -AVPAKNDALKKAAKVLEDRREKNPVAC-IVVLSD 404 >UniRef50_UPI000050F951 von Willebrand factor type A n=1 Tax=Brevibacterium linens BL2 RepID=UPI000050F951 Length = 358 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 49/183 (26%), Gaps = 25/183 (13%) Query: 246 SSQAVMFCLMDVSGSMDQST-------KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + A ++ L+D + SM + K+ + L L + + + + Sbjct: 66 EAAADVYFLVDTTTSMAAEDYDGDKTRLEGVKKDMLDLAKQL-PGTRLSIISFASTASTV 124 Query: 299 KEVDEH--EFF----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + F G +I + +L + + + N D Sbjct: 125 MPLTTDHAAFASAVDVLSPEMSLNSNGSSITEAGAELDKRMKSNQEDRPDNNSLVFYFGD 184 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G+ A+ S A ++ + Y +E + + D Sbjct: 185 GEQTAETSVDSWSSFASRIDDGA-VFGYGTAQGG----KMKEPQPFGYGSSPGDDPGLGD 239 Query: 407 QDD 409 D Sbjct: 240 PQD 242 >UniRef50_A6QCW6 von Willebrand factor type A domain protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QCW6_SULNB Length = 325 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 64/208 (30%), Gaps = 28/208 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM--------DQSTKDMAKRFYILLYLFLS-RTYKNVEVV 290 E SQ + +D+SGSM + D + ++L FL R + + ++ Sbjct: 84 EPVTKDVSQRELLISVDLSGSMMTKDFVNKEGKAIDRLEAVKMVLRDFLKERKGEKIGLI 143 Query: 291 YIRHHTQAKEVDEHEFFYSQ----------ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + + + + T + ++ L ++ +E + Sbjct: 144 LFGNAAFVQAPFTQDLDALEHLLDSLRVGMAGPQTAMGDSIGLAVKMFRES---NVTDRM 200 Query: 341 AAQASDGDNWADDS-PLCHEILAKKLLPVVRYYSYIE----ITRRAHQTLWREYEHLQST 395 SDGD+ P LA K V + +E + Sbjct: 201 LIVMSDGDDTGSKVPPKTSAELAAKNGVNVFTIGIGDPKNAGEHPIDTDTLKEIAAITGG 260 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNA 423 +A ++ D DIY +L K+ Sbjct: 261 KFYYAW-NLDDLQDIYKQIDKLKPKEIK 287 >UniRef50_C3JM29 Magnesium-chelatase subunit ChlD n=3 Tax=Actinomycetales RepID=C3JM29_RHOER Length = 649 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 47/154 (30%), Gaps = 14/154 (9%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY--ILLYLFLSRTYK 285 + DLR E R ++ ++D SGSM + A LL R K Sbjct: 453 RLAPADLRGAIREGRE----GNLIVFVVDASGSMAARDRLSAVTGAVVSLLRDAYQRRDK 508 Query: 286 NVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDE-VVKERYNPAQW 337 + + T + ++ + G + ++ E V++ER Q Sbjct: 509 VAVITVRGREAELVLPPTSSVDIAVRRLQSMRTGGKSPLAEGFLRAREVVLRERLRDPQR 568 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +DG L +A LL Sbjct: 569 RALVVTLTDGRATGGKDALHRARVAAHLLADASV 602 >UniRef50_B8FBV5 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FBV5_DESAA Length = 308 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 56/196 (28%), Gaps = 25/196 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQST----------KDMAKRFYILLYLFLSRTYKNVEV 289 R + + +D S SM Q K+ T + V Sbjct: 77 ASREIKTPGVDIILCLDASESMAQPDFAIDGQRVNRLTAVKKVVHDFVKR-RDTDRIGLV 135 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V+ + T K + + + T + AL + + +K+ PA + Sbjct: 136 VFGDYAFTQAPLTLDKGLLLNLIENLRIGMAGRKTAIGDALGVAGKRIKD--IPAMSKVV 193 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 SDG+N A D ++ Y+ T +A + + + Sbjct: 194 IL-LSDGENTAGD-MTPQGAAEALAALGIKIYTIGMGTEQAGSKELAQIAAIGQGKY-YH 250 Query: 401 MQHIRDQDDIYPVFRE 416 + D IY + Sbjct: 251 ASNTEQLDSIYKEIDK 266 >UniRef50_A0BI06 Chromosome undetermined scaffold_109, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0BI06_PARTE Length = 2531 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 52/177 (29%), Gaps = 19/177 (10%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV-------D 302 ++D S SM Q K + + + KN +V I AK V Sbjct: 2345 HYIFILDNSTSMHQ-HWLQVKICMAEQFEQIKQK-KNAKVSVILFGATAKIVINCQAVDI 2402 Query: 303 EHEFFYSQETGG--TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-----NWADDSP 355 + + Q G T+ AL E+V + +DG N+ + Sbjct: 2403 DKQIELIQYEGSWFTLFGPALSAARELVLQHS--EFTQTVILFYTDGKPSRFINYQQNDE 2460 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + +K Y + +L R + F A++ + + Sbjct: 2461 VDIFCNIEKRFRD-SIYFFACSQPNLSSSLERIIDRFSQAFAQAALRDSIEPFQLNH 2516 >UniRef50_A3ZTC3 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZTC3_9PLAN Length = 346 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 50/161 (31%), Gaps = 21/161 (13%) Query: 252 FCLM-DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----IRHHTQAKEVDEHEF 306 FC++ D SGSM D K + L R + + + + + + +F Sbjct: 192 FCIIADCSGSMSGVKLDYVKEEILETVSSLPREAQFQVIFFQSQAVPFPQKGWRHPKRDF 251 Query: 307 FYSQET-------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 E GGT A ++ + P +DG + D+ + Sbjct: 252 NALSEWLKTVGPAGGTNPLPAFEIALKFSPR---PDA----VFFMTDGL-FDDNVVGEVK 303 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 P V+ ++ I R+ + L R+ Sbjct: 304 RQNDLSEPKVKVHA-ISFMDRSAEPLMRQIAGESGGEYRHV 343 >UniRef50_C2HF13 von Willebrand factor type A n=1 Tax=Finegoldia magna ATCC 53516 RepID=C2HF13_PEPMA Length = 249 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 40/124 (32%), Gaps = 18/124 (14%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL-----SRTYKNVEVVYIRHHT 296 P V+F ++D SGSM + + L S V++ + ++ Sbjct: 8 DEIPRKMMVLFFVIDTSGSMKGTKIGEVNSAIEEILPELSDISNSNPDAEVKMAILSFNS 67 Query: 297 QAKEVDEHE---------FFYSQETGGTIVSSALKLMDEVVKE----RYNPAQWNIYAAQ 343 + + + + G T + +A + ++ + + + + Sbjct: 68 EIQWITPKTGPVDPGVYLWRDLNANGTTRMGAAFEELESKLHGDKFMKSATSSYAPVIFL 127 Query: 344 ASDG 347 SDG Sbjct: 128 MSDG 131 >UniRef50_B7PIU8 Calcium activated chlorine channel, putative n=2 Tax=Ixodes scapularis RepID=B7PIU8_IXOSC Length = 519 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 54/179 (30%), Gaps = 23/179 (12%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSM----DQSTKDMAKRFYILLYLFLSRTYKNV-EVV 290 ++ +++ + S + V+ +DVS SM + + + + Y+ + V Sbjct: 144 FRLFQRSDEKSQRVVLV--LDVSHSMRPRVGEDRLAFLQCATNHMIRHMLHDYQALGIVT 201 Query: 291 Y-----------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + + T A++ GT + L E+++ A+ + Sbjct: 202 FSGRCQVAHPLVVLNTTDARDGIAKVIDGLVLGAGTSIGCGLSKATEMLEGNGTSARGGL 261 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DGD + + V ++ + + + Sbjct: 262 V-FLVTDGDENYKPWIVEQLPILVSSGVKVSTFALGTLA----EKKLEDVALQTGGTAY 315 >UniRef50_Q9WXB0 Magnesium chelatase n=1 Tax=Acidiphilium rubrum RepID=Q9WXB0_ACIRU Length = 558 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 43/275 (15%), Positives = 73/275 (26%), Gaps = 28/275 (10%) Query: 96 GGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGY 155 +A D + QDE + +E + + + RA Sbjct: 280 DEPPPEAREAQADDQPQDEPPPGDASEELAERVLDAARSALPLDLLAALAMLGGPRRAAS 339 Query: 156 TANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE 215 A+G Q + R A R A + + + P Q L R+E Sbjct: 340 AASGKTP------IAQGGVRGRPAGVRPGRPGNGARLALIDTLRAAAPMQPLR----RRE 389 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 A + D R K + + V +D SGS + AK + Sbjct: 390 RAAQNPGPMPRVLVRAEDFRIKRF----IAPVRTVAIFAVDASGSSALNRLAEAKGAIQI 445 Query: 276 LYLF-LSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDE- 326 L R + + + + T A G T ++ + Sbjct: 446 LLADCYVRRDQVALIAFRNRTAELLLPPTGALARARRSLAALPGGGATPLALGIDAACAM 505 Query: 327 -VVKERYNPAQWNIYAAQASD-GDNWADDSPLCHE 359 + + R + +D G N D E Sbjct: 506 GIAERRRGA---SPLIVLLTDGGANIGRDGKPGRE 537 >UniRef50_C8VGR0 von Willebrand domain protein (AFU_orthologue; AFUA_4G01160) n=6 Tax=Trichocomaceae RepID=C8VGR0_EMENI Length = 1109 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 47/166 (28%), Gaps = 20/166 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQAK 299 + + D SGSM+ + L + R+ + Sbjct: 342 IIFMADRSGSMES-KISSLINVMNIFIRSLPEACSFNIASFGSEVTWLWPCSKRYSQENL 400 Query: 300 EVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNP-AQWNIYAAQASDGDNWADDSP-L 356 +V + GGT + AL+ + + +N +DG+ W D+ Sbjct: 401 DVASKHVDSFRANYGGTNIYCALESVLD----HFNKQDDVPTNVILLTDGEVWDVDNVIQ 456 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +R++S R +H+ L + + Sbjct: 457 LVRRTVSMNGSNIRFFSLGIGDRVSHR-LVEGIGLQGGGYAEVVPE 501 >UniRef50_C1E936 Predicted protein n=3 Tax=Micromonas RepID=C1E936_9CHLO Length = 754 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 51/193 (26%), Gaps = 27/193 (13%) Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY 236 R + + A + + P Q + R I+ D+R Sbjct: 467 RYVKPVFPKGGVVRRVAIDATLRAAAPHQPSRRK------RRGDPPDGRRVRIERDDIRN 520 Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVV----- 290 K + + + L+D SGSM + AK LL ++ V Sbjct: 521 KKMSRA----AGTLTVFLVDASGSMALNRMAAAKGAALTLLAESYTKRDAVALVSARGDA 576 Query: 291 --YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV-------KERYNPAQWNIYA 341 I +++ E GGT ++ L V Sbjct: 577 AEVILPPSRSIARAEARLAALPCGGGTPLAHGLSTAARVAINAARTGGGGGGKCSRTRVV 636 Query: 342 AQASD-GDNWADD 353 +D G N D Sbjct: 637 L-LTDGGANVGLD 648 >UniRef50_A9GNX2 Putative membrane protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GNX2_SORC5 Length = 384 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 42/131 (32%), Gaps = 15/131 (11%) Query: 245 PSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---- 295 P++ + ++D S SM + S AK L L + V + Sbjct: 85 PATNVDVVVVLDYSKSMYARDVEPSRIFRAKVEVARLIKDL-EGARFGAVAFAGEPMGFP 143 Query: 296 -TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 T F GGT ++ AL +E++K A+ +DG++ Sbjct: 144 LTADGAAIAQFFRQLDPNDMPIGGTAIARALDQANELLKRDPKSAEHKRIILLVTDGEDL 203 Query: 351 ADDSPLCHEIL 361 + + Sbjct: 204 EGYPLSVAQAI 214 >UniRef50_O26655 Magnesium chelatase subunit n=1 Tax=Methanothermobacter thermautotrophicus str. Delta H RepID=O26655_METTH Length = 182 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 45/160 (28%), Gaps = 16/160 (10%) Query: 254 LMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVV--------YIRHHTQAKEVDEH 304 ++D+SGSM K R + VV I T Sbjct: 1 MVDISGSMFSDRKAARVKGLIERFIEDAQRHRDRISVVGFRGRDARVIIPSTAHASSFRD 60 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWAD--DSPLCHEIL 361 + G T ++ ++ E+++E ++ + SDG N D Sbjct: 61 AVESIRVGGTTPMAQGIQRGLEILREEKRHGEYVPFMVILSDGMPNVGTGRDPKREAVEA 120 Query: 362 AKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 A +L ++ E R + L E Sbjct: 121 ASRLREEEIPSTVINF-ERGSRGGRDLNMEIALASGGSYY 159 >UniRef50_C2GKW9 Putative uncharacterized protein n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GKW9_9CORY Length = 174 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 37/106 (34%), Gaps = 10/106 (9%) Query: 252 FCLMDVSGSMDQSTKDMAKRFY--ILLYLFLSRTYKNVEV-------VYIRHHTQAKEVD 302 ++D SGS+ + A +L R + + + T++ ++ Sbjct: 1 MFVVDASGSVAAKDRLKAVTGACISILQDAYRRRDRVAVISVRGKKATVLVPPTRSVDIA 60 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKER-YNPAQWNIYAAQASDG 347 ++ G T ++S L+ +++ + A +DG Sbjct: 61 VSRLSQARVGGKTPLASGLEETYKLIDREVFKSPGLRSIAVVLTDG 106 >UniRef50_B3T1G6 Putative von Willebrand factor type A domain protein n=1 Tax=uncultured marine microorganism HF4000_009L19 RepID=B3T1G6_9ZZZZ Length = 317 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 38/144 (26%), Gaps = 11/144 (7%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 + E + ++D SGS+ + + L + V + + Sbjct: 73 QQIESVLLEDVPITLLLVLDTSGSVVGAPLAQLLMAAEAVAEALRPDDRVGLVTFSHNVR 132 Query: 297 QAKEVDE------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 E + TGGT + A + + SDGD+ Sbjct: 133 VVVEPPSLPASLPDALRRVRATGGTALYDATFAAFALRERTVGRT----LMLVFSDGDDT 188 Query: 351 ADDSPLCHEILAKKLLPVVRYYSY 374 ++L V Y+ Sbjct: 189 -TSWLDPRDVLNTAQRSDVVVYAV 211 >UniRef50_C0F0K9 Putative uncharacterized protein n=2 Tax=Eubacterium hallii DSM 3353 RepID=C0F0K9_9FIRM Length = 291 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 39/124 (31%), Gaps = 15/124 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHH 295 + +F ++DVSGSM + L L +V++ ++ Sbjct: 42 LDTMEPAKKSMTIFFMIDVSGSMKGTKIGSLNSTMEELLPSLIGVGEASTDVKIAIMKFS 101 Query: 296 TQAKEVDEHEF--------FYSQETGGTIVSSAL-KLMDEVVKERYNPA---QWNIYAAQ 343 T + V + G T + A +L ++ + + + + Sbjct: 102 TDVEWVTPEPVKIEEYQYWNRLEADGLTFMGDAFMELSKKLSRSTFLSSPSLSFAPVIFL 161 Query: 344 ASDG 347 SDG Sbjct: 162 LSDG 165 >UniRef50_UPI0001976EAB hypothetical protein BbifN4_00972 n=1 Tax=Bifidobacterium bifidum NCIMB 41171 RepID=UPI0001976EAB Length = 1153 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 34/127 (26%), Gaps = 21/127 (16%) Query: 247 SQAVMFCLMDVSGSMDQS-----TKDMAKRFY-----ILLYLFLSRTYKNVEVVYIRHHT 296 S A + + D SGSM ++AK LL N+ + + T Sbjct: 607 SPADIVVVFDTSGSMSNPMGHNSRLEVAKTAVNSMAQHLLTSENQGKDSNIRMALVPFST 666 Query: 297 QAKEVDE---------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 V + GGT + L K Y SDG Sbjct: 667 TVGNVSNFTDNAMDIVSAVNGLRADGGTNWEA--ALKAANAKLTSGRKGVKKYIVFMSDG 724 Query: 348 DNWADDS 354 D S Sbjct: 725 DPTFRTS 731 >UniRef50_B9L3Z2 von Willebrand factor type A n=2 Tax=Bacteria RepID=B9L3Z2_THERP Length = 699 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 46/185 (24%), Gaps = 11/185 (5%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R + L R R E + L A R Sbjct: 433 REIFQHLKRNRFGQHQIERSGRGSEPLEETKAYEFGDPFLLHLPRTVMNAIQREGPGTPV 492 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKN 286 + D E A ++D+S SM AK+ + L + + Sbjct: 493 RLQPADFEVFRTETVTR----AATVLMVDMSRSMLYNGCFLAAKKVALALDSLIRTQFPR 548 Query: 287 VEVVYIRHHTQAKEVDEHEFFYSQETG---GTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + I A ++ E + GT + L +++ Sbjct: 549 DVLYIIGFSYLASVLEPEELPHITWDEYNYGTNMQHGFMLARQLLARH---RGGTRQIIL 605 Query: 344 ASDGD 348 +DG+ Sbjct: 606 ITDGE 610 >UniRef50_B3RLQ7 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RLQ7_TRIAD Length = 1828 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 30/235 (12%), Positives = 60/235 (25%), Gaps = 27/235 (11%) Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEF--VFQISKDEYLDLLFEDLA 133 P N V + GG +G G + + V K+ + E Sbjct: 1479 DPSNAPHVGGNTWAGGTGGRSTAGLGGRGGPYRLDAGHDVYQVSDTDKEVVPPEISEAAR 1538 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 + + + + + N++ R+ L A Sbjct: 1539 --KMNRKAFQDRLREIEM--------SEYDADAYDKVSNTVQRQVKALRVILDSLEAKGL 1588 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 N L + I L + + Y + ++ + + Sbjct: 1589 ERTWAKNQTTGDLDDNRL----IEGLTGERTIYKRRADKEPEYNSQDQEHPKRLRLAV-- 1642 Query: 254 LMDVSGSM------DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 DVSGSM D + + +L+ F + + H + E+ Sbjct: 1643 --DVSGSMYRFNGVDGRLQRTLEATCMLMEAF-ENYDQKFQYEIFGHSGEDPEIP 1694 >UniRef50_Q21JX5 von Willebrand factor, type A n=8 Tax=Gammaproteobacteria RepID=Q21JX5_SACD2 Length = 341 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 45/156 (28%), Gaps = 24/156 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD----------MAKRFYILLYLFLSRTYKNVEV 289 E+ P++ + +D+SGSMD + K + V Sbjct: 82 EEVHLPTTGRDLLVAVDISGSMDTKDMVVQNQQIPRIAVVKHIVGDFIER-RVGDRLGLV 140 Query: 290 VY--IRHHTQAKEVDEHEFFYSQ-------ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ + D T + A+ L + +++R N Sbjct: 141 LFGTSAYLQSPLTFDRTTVKQLLVESQIGFAGPNTAIGDAIGLSIKRLRDR---PAENRV 197 Query: 341 AAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYI 375 +DG N + SP LAK+ V Sbjct: 198 VILLTDGQNTAGEVSPRQAADLAKQSGVKVYTIGVG 233 >UniRef50_A4QP95 LOC563828 protein (Fragment) n=4 Tax=Cyprinidae RepID=A4QP95_DANRE Length = 835 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 56/195 (28%), Gaps = 27/195 (13%) Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 S + + L ++ L + + + +F LMD Sbjct: 296 YAQYSFDQPAIVAQALGGSLSAALDVSLPDFKKKGQSL-GRTIKVEE---GRLNVFILMD 351 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSR---TYKNVEVVYIRHHTQAKEVDEHE-------F 306 SGS+ Q T AK+ I L L K V Y + + Sbjct: 352 TSGSISQDTFQAAKKAIIELVRKLDSYEVNMKFDIVSYASEPREIVSITSFNSHDVDFVL 411 Query: 307 FYSQE--------TGGTIVSSALKLMDEVVK-----ERYNPAQWNIYAAQASDGDNWADD 353 E GT +S AL+ + + ++ + + A+DG + Sbjct: 412 RKLSEFSDEVHENRRGTDLSKALERVYGQLALLRENKKSHFNETQNILIIATDGHSNMGP 471 Query: 354 SPLCHEILAKKLLPV 368 +P + LL Sbjct: 472 NPQIMLNKIRSLLGY 486 >UniRef50_Q22X70 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22X70_TETTH Length = 783 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 33/98 (33%), Gaps = 10/98 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---------IRHHT- 296 + ++D+S SM + K+ L FL+ + + + + H T Sbjct: 83 QPLDLIFVIDLSISMRGKKMNQLKKTICNLINFLNENDRMALIGFNNSAQNLFPLSHLTQ 142 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 Q K+ G T +++ + + ++ Sbjct: 143 QNKKKVTQILNSILPMGLTNITAGMMEAIKQLESSLIN 180 >UniRef50_Q22ML1 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ML1_TETTH Length = 685 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 55/196 (28%), Gaps = 37/196 (18%) Query: 253 CLMDVSGSMDQSTKDM-AKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEV 301 L+D S SM K K++ L + + + + + E Sbjct: 91 ILIDRSQSMMSENKLQNVKQYLCNLIEKANTNSQFALISFGSSQKLIFNFTQVTHENLES 150 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVV---------KERYNPAQWNIY-AAQASDG-DNW 350 + + TG T + AL++ ++ KE + Y A +DG DN Sbjct: 151 IKGQINNIISTGDTNIIQALEVAHNIIKQDQQLENQKEEQTKKRIVRYSAFLLTDGQDNM 210 Query: 351 ADDSPLCHEILAKKLLPV-----VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + + + L + I+ Sbjct: 211 ---KEKAIFKFRENFKNKDMDYSINCLGFGI---DHDPLLLGAITSYTGGKFYY----IK 260 Query: 406 DQDDIYPVFRELFHKQ 421 ++ ++ VF++ Q Sbjct: 261 PEESVFSVFQDYIKNQ 276 >UniRef50_Q608G4 Putative MxaC protein n=1 Tax=Methylococcus capsulatus RepID=Q608G4_METCA Length = 325 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 39/128 (30%), Gaps = 26/128 (20%) Query: 242 RPDPSSQAVMFCLMDVSGSMD------------QSTKDMAKRFYILLYLFLSR--TYKNV 287 + A + LMD S SM+ +K+ R LL F+ R Sbjct: 75 EERLGTGAHIVLLMDRSSSMNENFSGRYLGGTAGESKNAVAR--RLLADFVRRRGDDLFA 132 Query: 288 EVVYIRHH------TQAKEVDEHEFFYS--QETGGTIVSSALKLMDEVVKERYNPAQWNI 339 V + TQ +E + G T ++ L + + R PA Sbjct: 133 MVAFSAAPRYVMPLTQDREAVLAAIDSVGDRGHGITNIAPGLAMALDFFNGR--PATGAR 190 Query: 340 YAAQASDG 347 SDG Sbjct: 191 IILLVSDG 198 >UniRef50_Q9UKK3 Poly [ADP-ribose] polymerase 4 n=14 Tax=Eutheria RepID=PARP4_HUMAN Length = 1724 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 53/175 (30%), Gaps = 26/175 (14%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------ 291 + + S V+ CL D S SM+ T AK+ + + K + + Sbjct: 865 DVDLPDLASESEVIICL-DCSSSMEGVTFLQAKQIALHALSLVGEKQKVNIIQFGTGYKE 923 Query: 292 -------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 I +T A E G T L+ + PA+ + Sbjct: 924 LFSYPKHITSNTAAAEFIMSAT---PTMGNTDFWKTLRYL-----SLLYPARGSRNILLV 975 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 SDG L K+ P R ++ I A++ + R + + Sbjct: 976 SDG---HLQDESLTLQLVKRSRPHTRLFACG-IGSTANRHVLRILSQCGAGVFEY 1026 >UniRef50_C0ZE04 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZE04_BREBN Length = 572 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 66/208 (31%), Gaps = 21/208 (10%) Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE--RV 226 + N + R+ KR+E+ + + + + + Sbjct: 312 AFLNEVGRQVHRFRVKRKEIRSKHFPEEYYDLRQSGDIAHMLPGEAVLLADPDFENYFML 371 Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY--ILLYLFLSRTY 284 +++ + Y +P + + C++D S SM S +A+ F L + Sbjct: 372 KWLEQKLMTYDTSGWVEEPP-KGPVICMLDTSHSMRGSKLRLAQIFIMTFAALSMLEKRD 430 Query: 285 KNVEVVYIRHHTQAKEVDEH-------EFFYSQE---TGGTIVSSALKLMDEVVKERYNP 334 ++ + KE + F+ + GGT + +K E+V++ Sbjct: 431 --FILLLFGAKGEIKEQPLYHKKPDWPAFYGLAQMAFGGGTHFDAPMKRAIELVEKEQAW 488 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILA 362 + +DG SP E L Sbjct: 489 RGADFV--MVTDG--IGGISPYVQEKLI 512 >UniRef50_B9TFA6 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TFA6_RICCO Length = 451 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 31/119 (26%), Gaps = 14/119 (11%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 M +MD SGS+ S ++ L F + + V + + Sbjct: 137 ETKRKDLDMIVVMDTSGSLSPSAANVRSSAITFLNQFNATRDRVGLVHFAFGAIVDDAIR 196 Query: 303 EHE--------FFYSQE---TGGTIVSSALKLMDEVVKE---RYNPAQWNIYAAQASDG 347 + + + +G T + + + + SDG Sbjct: 197 QTARGFDRASMTNHIKAYAFSGSTASAEGMYTARQQINSVPTANLNRSNMRVIVFFSDG 255 >UniRef50_B0ACH3 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0ACH3_9CLOT Length = 1508 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 26/214 (12%), Positives = 58/214 (27%), Gaps = 51/214 (23%) Query: 251 MFCLMDVSGSMD------------QSTKDMAKRFYILLYLFL--------SRTYKNVEVV 290 + ++DVSGSMD + +AK L + V+ Sbjct: 403 VVLVIDVSGSMDWDVDGKQTTDNTKKRITIAKDSAKQFVNQLFANNEDGSKSNNRVSVVI 462 Query: 291 YIRH-----------HTQAKEVDEHEFFYSQ--ETGGTIVSSALKLMDEVVKERYNPAQW 337 + + K+ TGGT +A+ + ++V++ + + Sbjct: 463 FSSSGYTNGILCSLKNVDNKQTVIDAIDGISNNPTGGTDYDNAMTMAEQVLETVKDTTR- 521 Query: 338 NIYAAQASDGDNWADDSPLC--------------HEILAKKLLPVVRYYSYIEITRRAH- 382 N SDG + + + S+ + Sbjct: 522 NKAVLFMSDGAPENGYNGKTGYDIYPDAFKAHEKSSEIKNNYGATIYTVSFGLKGSQYKE 581 Query: 383 --QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + R+ + ++ ++D+ F Sbjct: 582 LTEDRCRQILRDYMASNENCYKNANSKEDLENAF 615 >UniRef50_A9V9D8 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V9D8_MONBE Length = 415 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 32/137 (23%), Gaps = 25/137 (18%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------- 291 + + + MD SGSM A+ + LY L + + Sbjct: 15 QNEQHEGRTHLTICMDCSGSMMGPKMTHAREGTLSLYANLHPGDTVELITFSSMVATAIP 74 Query: 292 -IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN------------ 338 + + G T A+ E + + Sbjct: 75 RVLKDDSTDDRFAAAVQRMCARGSTAFYDAILKGLESLSRADALRGNDQKAKADQAERTV 134 Query: 339 ---IYAAQASDGDNWAD 352 +DG++ A Sbjct: 135 STKRVLVVVTDGEDTAS 151 >UniRef50_A7NQL3 von Willebrand factor type A n=2 Tax=Roseiflexus RepID=A7NQL3_ROSCS Length = 936 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 57/187 (30%), Gaps = 22/187 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQ-----STKDMAKRFYILLYLFLSRTYKNVEV---- 289 E++ P + ++D+SGSM +A + L + + Sbjct: 402 PERQRQPP--VSIVVVIDISGSMAATEDGIPKLSLALEGARRIAALLRDEDELTVIPFDD 459 Query: 290 ---VYIRH-HTQAKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 V + ++V + + G G + AL++ + P + Sbjct: 460 RPGVIVGPLPGSRRDVAIEQLNQVRLGGSGINIHDALRVAARYTRASERPV---RHIITI 516 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG++ + + L + + I + H R+ + F + Sbjct: 517 TDGND--TTQQEGALDIVRSLHDEGVTLTSVAIGQGDHVPFIRDMAAVGGGR-TFLTERA 573 Query: 405 RDQDDIY 411 D D+ Sbjct: 574 ADVPDLL 580 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 37/108 (34%), Gaps = 9/108 (8%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 P L+D S S+ S + A+ F + + +V+ R + Sbjct: 61 QPPGITTTIFLLDGSDSVAASQRARAEAFIARALAVMPPDDRAGIIVFGREALVERFPAP 120 Query: 304 HEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 F + T ++ AL+L ++ PA+ + SDG Sbjct: 121 ERTFGAPVTRPFGSATNIADALQLGLTLL-----PAEGHRRLVLLSDG 163 >UniRef50_C0Z8R6 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z8R6_BREBN Length = 597 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 55/200 (27%), Gaps = 25/200 (12%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVV------ 290 + + ++DVS SM QS K+ + S V VV Sbjct: 28 HPGFAQTSGNNMDAVLVVDVSNSMTQSDKNKVSNEAMKMFVDMTSIQANKVGVVAYTDKI 87 Query: 291 ------YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + + K + Q+ T ++ + +++ NP I A Sbjct: 88 EREKALLEINSEEDKNDIKAFIDSLQKGAYTDIAVGVTEAVKILDAGRNPNNAPIIVLLA 147 Query: 345 SDGDNW---------ADDSPLCHEILAKKLLPVVRYYSYI-EITRRAHQTLWREYEHLQS 394 DG+N+ A + + + Y+ + ++T ++ + Sbjct: 148 -DGNNFLNKASSRTQAKSDQELQQAVKEAKDKGYPVYTIGLNADGQLNRTTLQQIAAETN 206 Query: 395 TFDNFAMQHIRDQDDIYPVF 414 F I Sbjct: 207 GKF-FETSTADKLPQILSEI 225 >UniRef50_C1ZLB6 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZLB6_PLALI Length = 365 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 47/176 (26%), Gaps = 29/176 (16%) Query: 244 DPSSQAVMFCLMDVSGSMDQST-KDMAKRFYILLYLFLSRTYKNVEVVYIRHHT------ 296 + + ++D SGSM +AK + L + + Y T Sbjct: 190 IKDQGSRVVFVIDCSGSMTNYNAMRVAKTALVSSLQALDTGQQFQIIFYNDSPTFLKGTS 249 Query: 297 ------------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 K + + Q GT ALKL +++P Sbjct: 250 RDGKASLWFATEINKTLATQQISAVQPDRGTQHLPALKLAL-----KFSPE----VIYFL 300 Query: 345 SDGDNWADDSPLCHEIL-AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +D D S E++ + + + + + ++ + Sbjct: 301 TDADEPELTSIERKELIRLNQGRSRIHTIEFGQGPELKTENFLKKVARENGGSYRY 356 >UniRef50_P0A5D7 Uncharacterized protein Rv0959/MT0986 n=36 Tax=Actinomycetales RepID=Y959_MYCTU Length = 672 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 47/292 (16%), Positives = 75/292 (25%), Gaps = 33/292 (11%) Query: 87 RIERPQGGGGGSGSGQGQAS-QDGEGQDEFVFQISKDEYLDLL--------FEDLALPNL 137 + RP GS G GEG ++ + L +D+ L L Sbjct: 293 QAARPGEDWTGSQQFSGDNPFGMGEGTQALADIAELEQLAEQLSQSYPGASMDDVDLDAL 352 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRS-----LQNSLARRTAMTAGKRRELHALE 192 + Q A V S L RR TA + Sbjct: 353 ARQLGDQAAVDARTLAELERALVNQGFLDRGSDGQWRLSPKAMRRLGETALRDVAQQLSG 412 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP------- 245 + A R + LR Sbjct: 413 RHGERDHRRAGAAGELTGATRPWQFGDTEPWHVARTLTNAVLRQAAAVHDRIRITVEDVE 472 Query: 246 ------SSQAVMFCLMDVSGSMDQSTKDM-AKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 +QA + L+D S SM + + KR + L+ + +++ + I A Sbjct: 473 VAETETRTQAAVALLVDTSFSMVMENRWLPMKRTALALHHLVCTRFRSDALQIIAFGRYA 532 Query: 299 KEVDEHEFFYSQE--TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + V E GT + AL L ++ A +DG+ Sbjct: 533 RTVTAAELTGLAGVYEQGTNLHHALALAGRHLRRH---AGAQPVVLVVTDGE 581 >UniRef50_UPI0001760236 PREDICTED: similar to mCG140660 n=1 Tax=Danio rerio RepID=UPI0001760236 Length = 1753 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 61/185 (32%), Gaps = 24/185 (12%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV-- 301 +++A ++ L+D SGS+ + K+F + +V + ++ ++A V Sbjct: 812 KQTAKADIYFLLDESGSISYPDFEDMKKFIMECLDVFQIGKDHVRIGVVKFASKATTVFR 871 Query: 302 ---------DEHEFFYSQE-TGGTIVSSALK----LMDEVVKERYNPAQWNIYAAQASDG 347 E + GGT L+ L E V+ R A+ +DG Sbjct: 872 LHDYSTKSDVEKAVKDLEMYGGGTRTDLGLRQMIPLFREAVQTRGEKARE--LLIVITDG 929 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 ++ P+ + V Y+ L + L D+ + + Sbjct: 930 ESTGTVEPVEVPAKHLRAEQNVSIYAIGCEG------LLADVVFLIDGSDSVSAEDFEKM 983 Query: 408 DDIYP 412 DI Sbjct: 984 KDIME 988 Score = 50.1 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 61/171 (35%), Gaps = 24/171 (14%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ----------- 297 A + L+D S S+ ++F L V + +++ Sbjct: 31 ADIVFLVDGSASIGLDNFQQIRQFLSSLVENFEVAPDKVRIGLVQYSDTPRTEFSLNTYQ 90 Query: 298 AKEVDEHEFFYSQ-ETGGTIVSSALKLMDEV--VKERYNPAQWNI--YAAQASDGDNWAD 352 KE + +TGGT L+ + + ++E + AQ N+ A +DGD Sbjct: 91 NKEEILDYIRNLRYKTGGTHTGQGLEFILKQHFIEEAGSRAQQNVPQIAIVITDGD---- 146 Query: 353 DSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 S ++ A++L ++ ++ A L R+ + +++ Sbjct: 147 -SQDEVDLQAQELRQRGIKIFAIGIK--DADVRLLRQIANEPYDQYVYSVS 194 Score = 44.4 bits (103), Expect = 0.008, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 51/163 (31%), Gaps = 26/163 (15%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ-------- 297 ++QA + L+D SGS+ + + K+F + V + + + Sbjct: 227 TAQADIVLLVDSSGSIGDNDFEEVKKFLHAFVDRFNLRPDLVRLGLAQFSDRPYQEFLLG 286 Query: 298 ---AKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWA 351 K+ + GGT AL + E ++ A+ N+ A +DG+ + Sbjct: 287 DYADKKDLHQKLNNLIYRKGGTQTGQALTFIRE---NYFSLARPNVPGIAIVITDGE--S 341 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 D + + + + + + Sbjct: 342 RDDVEEPAQRLRNTGVSLFVIRVGKGN-------MEKLRAIAN 377 >UniRef50_A8ULL3 Putative uncharacterized protein n=1 Tax=Flavobacteriales bacterium ALC-1 RepID=A8ULL3_9FLAO Length = 200 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 34/103 (33%), Gaps = 9/103 (8%) Query: 250 VMFCLMDV-SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH---- 304 + L++ + + + K + K+ + LL ++ V Y + A + E Sbjct: 53 NITFLIETYANNFNTEDKVILKQAFKLLSKRVTEDDLISIVAYSNFNGVALKQAEATDVK 112 Query: 305 ----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + + + T ++L E KE + N Sbjct: 113 KLLYAVEHLKSSVKTFEEDGIELAYEFTKENFIEESENSVVMI 155 >UniRef50_C3XJE4 Putative uncharacterized protein (Fragment) n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XJE4_9HELI Length = 429 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 49/153 (32%), Gaps = 13/153 (8%) Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRP---DPSSQAVMFCLMDVSGSM 261 + + L + F+ + ++ ++ M +D S SM Sbjct: 280 PQELAMLKDENLELLFNLKYIQNRLFCFEKQGYETIQKEHYKMAKNEGAMIICVDTSSSM 339 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE-------FFYSQETGG 314 + + +AK + L S + ++ + E+ + F GG Sbjct: 340 SGNREYLAKAITLFLATKASMQNRACYLINFSTDIETMELSGKDNARNLINFLAMSFNGG 399 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 T V+ ALK + ++E I SDG Sbjct: 400 TDVAPALKEGLKKMQEDSFKQSDLIVI---SDG 429 >UniRef50_B0R5W4 Magnesium chelatase (Protoporphyrin IX magnesium-chelatase) n=2 Tax=Halobacterium salinarum RepID=B0R5W4_HALS3 Length = 690 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 37/276 (13%), Positives = 66/276 (23%), Gaps = 35/276 (12%) Query: 80 DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQ 139 + E P GG S S QA D E D+ D+ + + L + Sbjct: 374 EPDGDTGGAEAPTGGPDDSQSEGSQAPADSEAGDDPTDGERGDD--EDTEDAAPLVPGQP 431 Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + A R+ T + A + Sbjct: 432 RGDTVEPGTGVAPDADAPDAERATGDSGRAS---------ATPSPDARGARVRTERATPA 482 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + A + + + DLR + S+ A++ +D S Sbjct: 483 DDVDAAASVRAAAARGSDAVES----------RDLR----QSINAGSASALVVFAVDASA 528 Query: 260 SMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE-------FFYSQET 312 SM + LL + V + Sbjct: 529 SMRGPMRAAKGVALDLLRDAYQHRDEIAVVAFAGDDADVLVPPTDSVALAARHLKDLPTG 588 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 T + + L+ D+V+ A + +DG Sbjct: 589 DRTPLPAGLRAADDVLARADPDASVVVVV---TDGQ 621 >UniRef50_B7Q412 Putative uncharacterized protein n=1 Tax=Ixodes scapularis RepID=B7Q412_IXOSC Length = 1021 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 58/198 (29%), Gaps = 27/198 (13%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 ++ + ++ R S+ + L+D SGS+ Q+ ++ F Sbjct: 28 YVQAEEPHGNVFDWRGIESNNTDLVFLLDRSGSVGQAGFEVETGFVHAFLKGFDVAPNTT 87 Query: 288 EVVYIRHHTQAKEVDEHEFFYSQE----------------TGGTIVSSALKLMDEVVKER 331 V I A V +F G T + L+ EV + Sbjct: 88 RVAVISFSEDA--VVHADFLKDPGNKCHLSRKMQGVHSANQGATNTGAGLQAAWEVFQRS 145 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 A+ +DG P+ K + + + + L + E Sbjct: 146 RPTAK--KLLILVTDGMATMGPDPVKKAEKLKNMGVDIFVFGIGRM-------LKQHLEQ 196 Query: 392 LQSTFDNFAMQHIRDQDD 409 L ST N + + DD Sbjct: 197 LASTPANVSDPYRGQPDD 214 >UniRef50_UPI0001AEDBB6 hypothetical protein SalbJ_01235 n=1 Tax=Streptomyces albus J1074 RepID=UPI0001AEDBB6 Length = 521 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 38/121 (31%), Gaps = 14/121 (11%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVV---- 290 Y+ P AV+ D SGSM + D + LSR+ + Sbjct: 47 YRELGVADQPVDYAVLV---DTSGSMRTKGRYDTVRSTLRGFLGGLSRSDHVALITFDDR 103 Query: 291 ----YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 Y+ ++ GGT + +AL L ++ ++ +D Sbjct: 104 PEARYVGSAGDPGKIVGRLPKSPDPDGGTDIGAALDLALRELERSDAANVASVVML--TD 161 Query: 347 G 347 G Sbjct: 162 G 162 >UniRef50_C6PVI2 Vault protein inter-alpha-trypsin domain protein n=4 Tax=Clostridium RepID=C6PVI2_9CLOT Length = 531 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 49/184 (26%), Gaps = 23/184 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV------------YIRHHTQA 298 L+D+S +M + AK + L + +++ Sbjct: 274 YVFLIDISDTMKGDKLEQAKNALQMCIRNLEEGDTFDIIAMGETLKYFWDEGMAEFNSET 333 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + + A+K E N N + + + Sbjct: 334 LKKASQWIENLTTEDDADIFGAIKYSLE------NEGGHN-TILIFT---DDEVEEEDEI 383 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 ++ L R +++ + + L + H F + R + + F+ + Sbjct: 384 LDYVRENLGDNRIFTFGFDSETNNYFLN-KLAHESFGKAEFINKGRRIEYVVLRQFKRIQ 442 Query: 419 HKQN 422 + + Sbjct: 443 NPEV 446 >UniRef50_Q1V424 Putative RTX toxin (Fragment) n=2 Tax=Vibrio alginolyticus 12G01 RepID=Q1V424_VIBAL Length = 3397 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 42/144 (29%), Gaps = 26/144 (18%) Query: 226 VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD---QSTKDMAKRFYILLYLFLSR 282 I D++ + ++D SGSM + K + L +++ Sbjct: 2886 DHDIIVGDVQGLEI----IAGQDYNIAFVLDTSGSMGNWVGTAKQEVLDVFDELLSAVNQ 2941 Query: 283 TYK--NVEVVYIRHHTQAKEVDEHEFFYSQE----------------TGGTIVSSALKLM 324 K V + + A V + +GGT + L+ Sbjct: 2942 GEKPGTVNIHLSEFASSASAVISVDLSSLTARKEFVEELNRVIDDEGSGGTNYEAGLQSA 3001 Query: 325 DEVVKERYNPAQWNIYAAQASDGD 348 E + NP NI +DG Sbjct: 3002 VEWFSSQPNPNGQNIT-YFVTDGQ 3024 >UniRef50_A0DJ91 Chromosome undetermined scaffold_52, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0DJ91_PARTE Length = 2934 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 53/162 (32%), Gaps = 14/162 (8%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLS 281 + + +++ + + ++D S SM+ + AK + + Sbjct: 2724 EQLKIMLKNPSIKF-VPKSELSGQPKMHYIFMIDDSWSMNFKNRWNSAKFGCLYCIGQIE 2782 Query: 282 RTYKNVEVVYIRHHTQAKEVDEHEFFYSQE--------TGGTIVSSALKLMDEVVKERYN 333 + + N +V I + +A+ V E E G T +A + +++ E Sbjct: 2783 KNF-NAKVSVIIFNYKARVVINCEKVNVAEMENKITISGGLTSFENAFQEAYKLIIEH-Q 2840 Query: 334 PAQWNIY-AAQASDGDN-WADDSPLCHEILAKKLLPVVRYYS 373 ++ +DG + S L ++ + + Sbjct: 2841 NDGFDRTEILFYTDGAGFYPKKSVKLFTNLEDRVKNQIYIHC 2882 >UniRef50_D1RCP1 von Willebrand factor type A domain protein n=6 Tax=Legionella RepID=D1RCP1_LEGLO Length = 342 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 53/186 (28%), Gaps = 33/186 (17%) Query: 229 IDTFDLRYKNYEKRPDPSSQ--AVMFCLMDVSGSMDQSTKDM----------AKRFYIL- 275 + F L + P P S+ + +D+SGSM+ + K Sbjct: 68 LLVFALAGPRWVGAPKPVSREGYNIMMALDLSGSMEIPDMILHGRPTSRLNIVKSAAEQF 127 Query: 276 ----------LYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 L LF +R Y + Y RH + E T + A+ L Sbjct: 128 VRERSGDKIGLILFGTRAYLQTPLTYDRHSILLR--LEDATAGL-AGKTTSIGDAVGLAV 184 Query: 326 EVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRR---A 381 + + +DG +N +PL LAK+ + Sbjct: 185 KRLDSAPKKG---RVIILLTDGANNSGVLAPLKAAELAKEEGIKIYTIGLGSEGDSRALV 241 Query: 382 HQTLWR 387 L + Sbjct: 242 GDFLMQ 247 >UniRef50_Q4V1D8 Putative uncharacterized protein dadA n=8 Tax=Bacillus cereus group RepID=Q4V1D8_BACCZ Length = 452 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 69/238 (28%), Gaps = 34/238 (14%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF-CLMDVSGSMDQS-----TK 266 + E +K+ER+ + + P +++ L+D SGSM Sbjct: 115 AQSYKEAVSKLERIIPELAIKEQNNDNVHSIKPKEKSLNVEILLDASGSMAGKVNGQVKM 174 Query: 267 DMAKRFYILLYLFLSRTYKN----------------------VEVVYIRHHTQAKEVDEH 304 + AK+ + EV+Y + KE Sbjct: 175 EAAKKAIYNYLDKIPDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYK-KEQFNA 233 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW-ADDSPLCHEILAK 363 G T ++SA++ +++ KE N+ SDG+ D + L + Sbjct: 234 ALSKFGPKGWTPLASAIESVNDDFKEYTGEENLNVV-YIVSDGEETCGGDPVNAAKNLNQ 292 Query: 364 KLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 VV + ++ Q L + + +L+ + Sbjct: 293 SSTHAVVNIIGF-DVKNSEQQQLMNT-AEAGKGNYATVSNADELYQTLNTEYEKLYKE 348 >UniRef50_C6WL71 von Willebrand factor type A n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WL71_ACTMD Length = 559 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 47/319 (14%), Positives = 81/319 (25%), Gaps = 53/319 (16%) Query: 102 QGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN-LKQNQQRQLTEYKTHRAGYTANGV 160 A +G + +S L LP L + + Sbjct: 250 GEDAGGPVDGLITYEASLS------SLNSGGELPEPLVPLYPSDGVVSADYPLAVLSGAD 303 Query: 161 PANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELR 220 R L + L R ++ + + PA + + Sbjct: 304 ERARDAHRRLADHLVR------------AEVQREIVAKTGRRPAVAGLDLPAGARSGLVE 351 Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY--- 277 + L Y++ PS ++DVSGSM+ KR L Sbjct: 352 IPFPDSRAVVGALLDAY-YDELRRPSRT---VYVLDVSGSMEGDRMAQLKRALSRLTGSD 407 Query: 278 LFLSRTY----KNVEVVY----------------IRHHTQAKEVDEHEFFYSQETGGTIV 317 L+ Y EVV + + E G T V Sbjct: 408 ESLTGQYCRFRSREEVVLLPFNQAPLAPQEFSVDVGAPRETLERIRGAVEGLVAGGDTAV 467 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 +L+ VV +P ++ +DG+N + + A+ V + E Sbjct: 468 YDSLERAYGVVG--SSPERFTSVVLM-TDGENRVGRTFAEYREFARGKAVPVFPIVFGEA 524 Query: 378 TRRAHQTLWREYEHLQSTF 396 +R E + Sbjct: 525 SR----AKMGEIAEITGGA 539 >UniRef50_C4N894 Complement factor B-like protein n=1 Tax=Venerupis decussatus RepID=C4N894_9BIVA Length = 697 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 55/178 (30%), Gaps = 38/178 (21%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV-----------------EV 289 S + L+DVS S+ + + AK+F LL + + ++ Sbjct: 193 SGLDVVLLVDVSSSIGDRSMESAKKFMKLLVDIFGVSNETSGGKNGTRFALLTFSNEADI 252 Query: 290 VYIRHHTQAK--EVDEHEFFYSQ-ETGGTIVSSALKLM-----DEVVKERYNPAQWN--- 338 V+ + A+ E + Q GGT +AL + V+K+ + N Sbjct: 253 VFNLNDGTARSKEEVKRRIDEIQNTGGGTNFRAALLKVVGGIFFNVIKKES--QRLNHAT 310 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVR------YYSYIEITRRAHQTLWREYE 390 +D + ++ ++ + + + +T E Sbjct: 311 RAVFLLTDAE-ETSTLEKDRLPRIRQAANDLKNEGHFEIFCIG-VGQNIDETTLAEIA 366 >UniRef50_B9X084 Complement factor B n=5 Tax=Eumetazoa RepID=B9X084_NEMVE Length = 858 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 61/233 (26%), Gaps = 23/233 (9%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 ++ R A E + + +I + + + + + + + Sbjct: 317 SAKRRCQANGKWTGSEAKCMADFEYMIKDVNSTAYQLKRNIDTMLEYTCSGMNSTCNLTE 376 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK-----N 286 D+R + E V D S S+ + F I L L ++K Sbjct: 377 VDMRARAIELNEAGGLDVVFVF--DASSSIKMDDFRLGLDFSIELVKLLGTSWKPGGTHV 434 Query: 287 VEVVYIRHHT-----------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + Y AK V + GGT AL V + Sbjct: 435 AAITYGTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQVVP-FTRE 493 Query: 336 QWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 +DG N +IL K + Y+ + + L Sbjct: 494 GSQKALFFITDGHSNIGGSPRKAAKILKDK---GFQIYAIGVGKKVRRRELME 543 >UniRef50_A5UXM2 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UXM2_ROSS1 Length = 774 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 13/106 (12%), Positives = 21/106 (19%), Gaps = 5/106 (4%) Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + + G T + L+ ++ SDG Sbjct: 369 DVRARAQAAIDTLNSRGATSIGGGLQSSQRMLDTANP--DLPRVIILLSDGQENTRPFVA 426 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + V R A Q L N+A Sbjct: 427 DVLPQIRAAQTTVHTIGLG---RDADQQLMLSIAAQTGGTYNYAPT 469 >UniRef50_UPI000180C3F0 PREDICTED: similar to Vwa1 protein n=1 Tax=Ciona intestinalis RepID=UPI000180C3F0 Length = 1059 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 56/196 (28%), Gaps = 31/196 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFL---------------SRTYKNVEVVYIRHH 295 M ++D S S+ D+ K F + L R + +++ H Sbjct: 612 MIFILDSSSSVGSDNWDVMKNFVRDVINLLSITETGTRVSIFRYNRRPDRQSQILLNDHI 671 Query: 296 TQAKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDG---DN 349 K+ G GT + +AL + + + N + ++ +DG D+ Sbjct: 672 GD-KQGLLSALDQMPYNGRGTWIGNALNHAKDFILQLRNGDRPDVIDVVLTITDGRSKDD 730 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + S + Y+ I E + + D + + Sbjct: 731 VSVVSEELRKQ-------GALTYAVGVIPPNGKGPREEELLKIAGSTDRVKITTSFEG-- 781 Query: 410 IYPVFRELFHKQNATA 425 F +L + Sbjct: 782 TKDHFEKLMSDDLCGS 797 >UniRef50_UPI0000ECD6E7 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.30) (PARP-4) (Vault poly(ADP- ribose) polymerase) (VPARP) (193 kDa vault protein) (PARP- related/IalphaI-related H5/proline-rich) (PH5P). n=5 Tax=Tetrapoda RepID=UPI0000ECD6E7 Length = 1691 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 22/179 (12%), Positives = 52/179 (29%), Gaps = 14/179 (7%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH---EFF 307 + L+D S SM S AK+ + S + + + ++ ++ + Sbjct: 888 IIILLDCSNSMAGSALLQAKQIALHALKQFSSRQNVNLIKFGTNFSEFSSFSKNTSKDLA 947 Query: 308 YSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 E T+ ++ L + + SDG + L K Sbjct: 948 SLTEFITSATATMGNTDLWKTLRYLSLLFPSQGH-RNILLISDG---HIQNESVTFQLVK 1003 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD--IYPVFRELFHK 420 + R ++ + A++ + R + + + + I +F Sbjct: 1004 DNVHHTRLFTCG-VGSTANRHMLRSLSQYGAGAFEYFDSKSKYNWEAKIQNQVSRIFSP 1061 >UniRef50_Q7JMF9 Protein T24F1.6b, partially confirmed by transcript evidence n=4 Tax=Caenorhabditis RepID=Q7JMF9_CAEEL Length = 1067 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 69/227 (30%), Gaps = 31/227 (13%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ID FD R++ + S + L+D SGS+ T + K + + LS Sbjct: 214 IDLFDPRFRPW-FVNAESVPKDIVFLLDYSGSVKGPTMHLIKITMMYILSTLSPNDYFFG 272 Query: 289 VVYIRH---------------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN 333 V + H T K+V E +E ++ LK +V++ + Sbjct: 273 VYFNNHFNPIISCANRTFMPATTSNKKVFFEELGMLEEKDQAHFATPLKFSLDVLRGNLD 332 Query: 334 PA---------QWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 + + +DG D W + E + ++R + + + Sbjct: 333 SNQSLFADYRSEGHKLLIIFTDGVDEWPH--QILDEEFQTRNSELIRIFGFSMGYGTSLL 390 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDD---IYPVFRELFHKQNATAKG 427 L + + + + I V ++ + Sbjct: 391 PLQQYMACKSHGGYSEIDSIMDVKPQSRTIQNVLSQVRGDELKGTNA 437 >UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated chloride channel n=2 Tax=Gallus gallus RepID=UPI0000E80A5E Length = 928 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 45/134 (33%), Gaps = 16/134 (11%) Query: 254 LMDVSGSMDQSTKDM-AKRFYIL-LYLFLSRTYKNVEVVY---IRHHTQAKEVDEHEFFY 308 ++DVSGSM+ + + + + L + + V + + ++ Sbjct: 311 VLDVSGSMNTNNRITNLRTAAEVFLIQIIEIGSRVGIVTFESSAYEKSPLLQITSVATRQ 370 Query: 309 S-------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 GGT + + ++ E++ + +DG++ LC E Sbjct: 371 RLVQNLPTTAGGGTKICAGIEKGLEIITNAIGTTYGSEIVL-LTDGEDSTMS--LCREK- 426 Query: 362 AKKLLPVVRYYSYI 375 K+ ++ + Sbjct: 427 VKESGAIIHTIALG 440 >UniRef50_Q2SCZ7 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Gammaproteobacteria RepID=Q2SCZ7_HAHCH Length = 345 Score = 52.1 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 56/203 (27%), Gaps = 42/203 (20%) Query: 251 MFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEVVYIRHH----- 295 + +D+S SM ++ + K + + + +++ Sbjct: 96 LLLAVDISPSMQETDLQLKGNQATRLDVVKSVVTDFI-QVRQGDRLGLILFGAQPYIQAP 154 Query: 296 -----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 E+ T + A+ L + ++ER PA + +DG N Sbjct: 155 LTYDLVTVGELLNEATLGI-AGNATAIGDAIGLGIKRLRER--PAD-SRVLVLLTDGANT 210 Query: 351 ADD-SPLCHEILAKKLLPVVRYYS-----------YIEITRRA----HQTLWREYEHLQS 394 + SP LA + + +TL + Sbjct: 211 GGEVSPEQAAKLAADAGIKIYTVGVGADEIIRRGIFGYRKENPSADLDETLLQSIADETD 270 Query: 395 TFDNFAMQHIRDQDDIYPVFREL 417 F ++ + + IY +L Sbjct: 271 GQY-FRARNTGELELIYESINQL 292 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteri... 291 4e-77 UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacter... 254 6e-66 UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteri... 253 9e-66 UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodoba... 253 9e-66 UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobact... 250 1e-64 UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkhol... 245 2e-63 UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria ... 242 1e-62 UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales ... 242 2e-62 UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes ... 223 7e-57 UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium... 222 2e-56 UniRef50_C6M483 von Willebrand factor type A domain protein n=1 ... 214 5e-54 UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitroso... 213 1e-53 UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteo... 212 3e-53 UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani Rep... 208 4e-52 UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidet... 207 8e-52 UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 ... 206 1e-51 UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellac... 205 2e-51 UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostri... 205 4e-51 UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatiba... 204 5e-51 UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter... 202 2e-50 UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Rumi... 201 4e-50 UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4... 200 9e-50 UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reineke... 199 2e-49 UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20... 198 2e-49 UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidith... 198 2e-49 UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales... 198 3e-49 UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria R... 198 4e-49 UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella... 197 5e-49 UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacte... 197 7e-49 UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastop... 196 1e-48 UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobac... 196 1e-48 UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 ... 195 2e-48 UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonif... 194 4e-48 UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 ... 194 4e-48 UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 ... 193 1e-47 UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobact... 193 1e-47 UniRef50_A9FJ88 Uncharacterized conserved protein involved in st... 193 1e-47 UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostri... 191 5e-47 UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 ... 191 5e-47 UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium... 190 8e-47 UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria R... 190 9e-47 UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacte... 189 1e-46 UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteo... 188 2e-46 UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 ... 188 3e-46 UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiob... 187 5e-46 UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopi... 187 8e-46 UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobact... 185 3e-45 UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9... 184 4e-45 UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea... 184 4e-45 UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=... 183 8e-45 UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Breviba... 183 2e-44 UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenz... 182 2e-44 UniRef50_C7N770 Uncharacterized protein containing a von Willebr... 182 2e-44 UniRef50_UPI000185CB41 protein containing von Willebrand factor ... 181 4e-44 UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitroco... 180 7e-44 UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=... 178 3e-43 UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiph... 177 1e-42 UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria Re... 176 1e-42 UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocyst... 175 2e-42 UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 ... 172 2e-41 UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Ach... 172 2e-41 UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangi... 171 4e-41 UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacter... 170 8e-41 UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 ... 169 2e-40 UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meioth... 168 3e-40 UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella... 167 6e-40 UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobact... 166 9e-40 UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophag... 166 1e-39 UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimon... 165 2e-39 UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmati... 165 2e-39 UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 ... 155 3e-36 UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp.... 154 5e-36 UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5... 154 6e-36 UniRef50_B4D1N7 Autotransporter-associated beta strand repeat pr... 150 1e-34 UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 ... 143 2e-32 UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriac... 141 6e-32 UniRef50_C1RGW7 Uncharacterized protein containing a von Willebr... 141 7e-32 UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeri... 140 1e-31 UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-cont... 135 3e-30 UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=... 134 8e-30 UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12... 133 9e-30 UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga o... 133 1e-29 UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophag... 133 2e-29 UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 ... 131 6e-29 UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 130 1e-28 UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcani... 130 1e-28 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 129 2e-28 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 129 3e-28 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 129 3e-28 UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3... 128 3e-28 UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 T... 127 1e-27 UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1... 126 1e-27 UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine... 126 1e-27 UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phag... 126 1e-27 UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein... 126 2e-27 UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein... 125 3e-27 UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1... 125 3e-27 UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudo... 125 4e-27 UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein... 124 6e-27 UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepI... 124 9e-27 UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein... 122 2e-26 UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain... 122 3e-26 UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alterom... 122 3e-26 UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 ... 122 3e-26 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 121 3e-26 UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella... 121 5e-26 UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magno... 121 5e-26 UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, s... 121 6e-26 UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobact... 121 7e-26 UniRef50_Q235T9 von Willebrand factor type A domain containing p... 121 7e-26 UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN... 120 8e-26 UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-lik... 120 9e-26 UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3... 120 1e-25 UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha p... 120 1e-25 UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 ... 119 1e-25 UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=4... 119 1e-25 UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellu... 119 1e-25 UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein... 119 1e-25 UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Cion... 119 2e-25 UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanob... 119 3e-25 UniRef50_Q231J4 von Willebrand factor type A domain containing p... 118 3e-25 UniRef50_UPI00016C377F protein containing a von Willebrand facto... 118 3e-25 UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotom... 118 4e-25 UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 118 4e-25 UniRef50_C1XFF8 Mg-chelatase subunit ChlD n=1 Tax=Meiothermus ru... 118 6e-25 UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharoph... 118 6e-25 UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globul... 117 7e-25 UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 ... 117 7e-25 UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomyce... 117 8e-25 UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11... 117 8e-25 UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangi... 117 1e-24 UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythro... 116 1e-24 UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=... 116 1e-24 UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus tri... 116 1e-24 UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebr... 116 2e-24 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 116 2e-24 UniRef50_D2VKS7 von Willebrand factor type A domain-containing p... 116 2e-24 UniRef50_Q9NY47 Voltage-dependent calcium channel subunit delta-... 116 2e-24 UniRef50_Q7Z3S7 Voltage-dependent calcium channel subunit delta-... 116 2e-24 UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopu... 116 2e-24 UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfoloba... 116 2e-24 UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax... 115 3e-24 UniRef50_D1KBY4 Putative uncharacterized protein n=2 Tax=Proteob... 115 3e-24 UniRef50_B9EIV3 Cacna2d4 protein n=1 Tax=Mus musculus RepID=B9EI... 115 3e-24 UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genom... 115 3e-24 UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein... 115 3e-24 UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genom... 115 4e-24 UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina... 114 4e-24 UniRef50_Q4RV83 Chromosome 15 SCAF14992, whole genome shotgun se... 114 4e-24 UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis v... 114 4e-24 UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1... 114 5e-24 UniRef50_D1A557 Vault protein inter-alpha-trypsin domain protein... 114 5e-24 UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnol... 114 5e-24 UniRef50_UPI0000D560E4 PREDICTED: similar to inter-alpha (globul... 114 5e-24 UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis R... 114 6e-24 UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole geno... 114 6e-24 UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangi... 114 6e-24 UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=So... 114 7e-24 UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID... 114 7e-24 UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genom... 114 7e-24 UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea s... 114 8e-24 UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 ... 114 8e-24 UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis... 114 8e-24 UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein... 113 1e-23 UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiob... 113 1e-23 UniRef50_A2AR69 Novel protein similar to vertebrate inter-alpha ... 113 1e-23 UniRef50_A6BYV9 Putative uncharacterized protein n=1 Tax=Plancto... 113 1e-23 UniRef50_UPI0001760CA2 PREDICTED: inter-alpha (globulin) inhibit... 113 1e-23 UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1... 113 2e-23 UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun se... 113 2e-23 UniRef50_A7RTF3 Predicted protein (Fragment) n=1 Tax=Nematostell... 113 2e-23 UniRef50_C9LTL4 Magnesium-chelatase, subunit D/I family n=1 Tax=... 112 2e-23 UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscill... 112 2e-23 UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin... 112 3e-23 UniRef50_A7RNW3 Predicted protein n=3 Tax=Nematostella vectensis... 112 3e-23 UniRef50_UPI00006CAF43 von Willebrand factor type A domain conta... 111 4e-23 UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome... 111 4e-23 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 111 4e-23 UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexu... 111 5e-23 UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira ... 111 5e-23 UniRef50_C3JL94 von Willebrand factor type A domain protein n=1 ... 111 5e-23 UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfo... 111 5e-23 UniRef50_Q503P4 Zgc:110377 n=9 Tax=Clupeocephala RepID=Q503P4_DANRE 111 5e-23 UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-try... 111 5e-23 UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocep... 111 6e-23 UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clup... 111 6e-23 UniRef50_C7R936 Vault protein inter-alpha-trypsin domain protein... 111 6e-23 UniRef50_Q12VX7 Putative uncharacterized protein n=1 Tax=Methano... 111 6e-23 UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tet... 111 7e-23 UniRef50_UPI00017450FB von Willebrand factor type A domain prote... 111 7e-23 UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Tak... 111 7e-23 UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum '... 110 8e-23 UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillu... 110 9e-23 UniRef50_B6ZDR6 Voltage dependent calcium channel alpha2d/delta ... 110 1e-22 UniRef50_C3ZG18 Putative uncharacterized protein n=1 Tax=Branchi... 110 1e-22 UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus te... 110 1e-22 UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus ce... 110 1e-22 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 110 1e-22 UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain... 110 1e-22 UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin... 110 1e-22 UniRef50_Q0AV90 Putative uncharacterized protein n=1 Tax=Syntrop... 109 1e-22 UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3... 109 1e-22 UniRef50_A2E1S5 von Willebrand factor type A domain containing p... 109 2e-22 UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein... 109 2e-22 UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=4... 109 2e-22 UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n... 109 2e-22 UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopi... 109 2e-22 UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain ... 109 2e-22 UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR 109 2e-22 UniRef50_Q22N58 von Willebrand factor type A domain containing p... 109 2e-22 UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genom... 109 3e-22 UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 ... 109 3e-22 UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisp... 109 3e-22 UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleosto... 109 3e-22 UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 ... 108 3e-22 UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax... 108 3e-22 UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein... 108 4e-22 UniRef50_A2E6Y7 von Willebrand factor type A domain containing p... 108 4e-22 UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesioc... 108 5e-22 UniRef50_A6Q208 von Willebrand factor type A domain protein n=1 ... 108 5e-22 UniRef50_Q22ST4 von Willebrand factor type A domain containing p... 108 5e-22 UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methyli... 108 5e-22 UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein... 108 5e-22 UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26... 107 6e-22 UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeo... 107 7e-22 UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscill... 107 7e-22 UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=... 107 8e-22 UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinoco... 107 8e-22 UniRef50_B2HDT6 Putative uncharacterized protein n=3 Tax=Mycobac... 107 9e-22 UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1... 107 9e-22 UniRef50_B0CG18 von Willebrand factor type A domain protein, put... 107 1e-21 UniRef50_B8AE57 Putative uncharacterized protein n=1 Tax=Oryza s... 107 1e-21 UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genom... 107 1e-21 UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microc... 106 1e-21 UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8Y... 106 1e-21 UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein... 106 1e-21 UniRef50_UPI000180D3E0 PREDICTED: similar to LOC779593 protein n... 106 2e-21 UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza s... 106 2e-21 UniRef50_A4XHD9 von Willebrand factor, type A n=2 Tax=Clostridia... 106 2e-21 UniRef50_C9SWV9 U-box domain containing protein n=1 Tax=Verticil... 106 2e-21 UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ... 106 2e-21 UniRef50_Q0A603 von Willebrand factor, type A n=1 Tax=Alkalilimn... 106 2e-21 UniRef50_C1XMC3 Uncharacterized protein containing a von Willebr... 106 2e-21 UniRef50_A6R161 Predicted protein n=3 Tax=Onygenales RepID=A6R16... 106 2e-21 UniRef50_D2H285 Putative uncharacterized protein (Fragment) n=1 ... 106 2e-21 UniRef50_A2F7N4 von Willebrand factor type A domain containing p... 106 2e-21 UniRef50_UPI00005A0386 PREDICTED: similar to loss of heterozygos... 106 2e-21 UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geob... 106 2e-21 UniRef50_B9XLE8 Vault protein inter-alpha-trypsin domain protein... 106 2e-21 UniRef50_A2DWC0 von Willebrand factor type A domain containing p... 105 3e-21 UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacilla... 105 3e-21 UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocyst... 105 4e-21 UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza s... 105 4e-21 UniRef50_A6Q2J6 von Willebrand factor type A domain protein n=1 ... 105 4e-21 UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZE... 105 4e-21 UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 ... 105 4e-21 UniRef50_A2FKC6 von Willebrand factor type A domain containing p... 105 4e-21 UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putat... 105 4e-21 UniRef50_Q23FU3 von Willebrand factor type A domain containing p... 104 4e-21 UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 ... 104 5e-21 UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ5... 104 5e-21 UniRef50_Q4RF07 Chromosome 13 SCAF15122, whole genome shotgun se... 104 6e-21 UniRef50_A8M9M1 von Willebrand factor type A n=1 Tax=Caldivirga ... 104 6e-21 UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectria... 104 6e-21 UniRef50_A6G2V8 von Willebrand factor, type A n=1 Tax=Plesiocyst... 104 6e-21 UniRef50_Q24C76 von Willebrand factor type A domain containing p... 104 6e-21 UniRef50_D1CG77 von Willebrand factor type A; type II secretion ... 104 8e-21 UniRef50_C1GWG1 von Willebrand factor type A domain containing p... 104 8e-21 UniRef50_UPI0001C1630F hypothetical protein CRD_00534 n=2 Tax=No... 104 8e-21 UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2... 104 9e-21 UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=... 104 9e-21 UniRef50_Q1NTK1 Von Willebrand factor, type A n=2 Tax=delta prot... 103 1e-20 UniRef50_Q30SV4 von Willebrand factor, type A n=2 Tax=Campylobac... 103 1e-20 UniRef50_B5YCB4 von Willebrand factor type A domain protein n=2 ... 103 1e-20 UniRef50_UPI0000F2D28F PREDICTED: hypothetical protein n=1 Tax=M... 103 1e-20 UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi... 103 1e-20 UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YN... 103 1e-20 UniRef50_Q54CQ8 von Willebrand factor A domain-containing protei... 103 1e-20 UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuni... 103 1e-20 UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ... 103 1e-20 UniRef50_Q46AG0 BatA n=3 Tax=Methanomicrobia RepID=Q46AG0_METBF 103 1e-20 UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepI... 103 2e-20 UniRef50_UPI0001A2C533 UPI0001A2C533 related cluster n=1 Tax=Dan... 103 2e-20 UniRef50_UPI0001A2C532 UPI0001A2C532 related cluster n=2 Tax=Clu... 103 2e-20 UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fr... 103 2e-20 UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Stre... 103 2e-20 UniRef50_UPI0001C378BC von Willebrand factor, type A n=1 Tax=Rum... 103 2e-20 UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 ... 103 2e-20 UniRef50_A9WI94 von Willebrand factor type A n=2 Tax=Chloroflexu... 103 2e-20 UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi... 102 2e-20 UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 102 3e-20 UniRef50_UPI000179F51C Novel protein. n=1 Tax=Bos taurus RepID=U... 102 3e-20 UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha ... 102 3e-20 UniRef50_Q23JA0 von Willebrand factor type A domain containing p... 102 3e-20 UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 ... 102 3e-20 UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Tak... 102 3e-20 UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflex... 102 3e-20 UniRef50_A9Z1V5 von Willebrand factor A domain-containing protei... 101 4e-20 UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillu... 101 4e-20 UniRef50_A2E0T6 von Willebrand factor type A domain containing p... 101 4e-20 UniRef50_A0LPK8 Vault protein inter-alpha-trypsin domain protein... 101 4e-20 UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesioc... 101 4e-20 UniRef50_Q1VY89 Inter-alpha-trypsin inhibitor family heavy chain... 101 4e-20 UniRef50_Q54DU5 von Willebrand factor A domain-containing protei... 101 4e-20 UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ... 101 5e-20 UniRef50_UPI00015B5332 PREDICTED: similar to ENSANGP00000020925 ... 101 5e-20 UniRef50_A4YGI9 von Willebrand factor, type A n=1 Tax=Metallosph... 101 5e-20 UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n... 101 6e-20 UniRef50_B0TSG0 Vault protein inter-alpha-trypsin domain protein... 101 6e-20 UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Breviba... 101 7e-20 UniRef50_Q7SGD8 Predicted protein n=4 Tax=Sordariales RepID=Q7SG... 101 7e-20 UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter... 101 7e-20 UniRef50_UPI000180D2FB PREDICTED: similar to inter-alpha (globul... 101 7e-20 UniRef50_Q8H923 Putative uncharacterized protein OSJNBa0071K18.1... 101 7e-20 UniRef50_C7G046 von Willebrand factor A domain-containing protei... 100 8e-20 UniRef50_B4W304 von Willebrand factor type A domain protein (Fra... 100 8e-20 UniRef50_C3YRH6 Putative uncharacterized protein n=1 Tax=Branchi... 100 9e-20 UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_... 100 9e-20 UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifi... 100 9e-20 UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2... 100 9e-20 UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta Re... 100 1e-19 UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus Rep... 100 1e-19 UniRef50_Q4S685 Chromosome 9 SCAF14729, whole genome shotgun seq... 100 1e-19 UniRef50_Q9LMB7 F14D16.26 n=5 Tax=rosids RepID=Q9LMB7_ARATH 100 1e-19 UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family pr... 100 1e-19 UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens ... 100 1e-19 UniRef50_C6PWL8 von Willebrand factor type A n=1 Tax=Clostridium... 100 1e-19 UniRef50_A8VYD1 Extracellular solute-binding protein, family 5 n... 100 1e-19 UniRef50_Q28XX9 GA11538 n=5 Tax=Drosophila RepID=Q28XX9_DROPS 99 1e-19 UniRef50_B2VYM4 von Willebrand domain containing protein n=3 Tax... 99 2e-19 UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferrogl... 99 2e-19 UniRef50_C0M4X9 Inter-alpha-trypsin inhibitor heavy chain H4 (Fr... 99 2e-19 UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 ... 100 2e-19 UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesioc... 100 2e-19 UniRef50_B8HSI1 von Willebrand factor type A n=8 Tax=Cyanobacter... 100 2e-19 UniRef50_Q24CQ9 von Willebrand factor type A domain containing p... 99 3e-19 UniRef50_A8SU73 Putative uncharacterized protein n=1 Tax=Coproco... 99 3e-19 UniRef50_UPI0001757D5D PREDICTED: similar to AGAP009579-PA n=1 T... 99 3e-19 UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax... 99 3e-19 UniRef50_A0M6V8 Membrane protein containing von Willebrand facto... 99 3e-19 UniRef50_A9L314 Outer membrane adhesin like proteiin n=5 Tax=She... 99 3e-19 UniRef50_B9ML47 YD repeat protein n=1 Tax=Anaerocellum thermophi... 99 3e-19 UniRef50_UPI000186D791 calcium channel, putative n=3 Tax=Neopter... 99 3e-19 UniRef50_Q11Y10 Possible outer membrane protein n=1 Tax=Cytophag... 99 4e-19 UniRef50_C1XWJ2 von Willebrand factor type A-like protein n=1 Ta... 99 4e-19 UniRef50_A0CCS0 Chromosome undetermined scaffold_168, whole geno... 99 4e-19 UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopi... 99 4e-19 UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=T... 98 4e-19 UniRef50_Q5TIE3-4 Isoform 4 of von Willebrand factor A domain-co... 98 5e-19 UniRef50_Q5TIE3 von Willebrand factor A domain-containing protei... 98 5e-19 UniRef50_UPI0001792BA1 PREDICTED: similar to Inter-alpha-trypsin... 98 5e-19 UniRef50_Q54DV3 von Willebrand factor A domain-containing protei... 98 5e-19 UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatiba... 98 6e-19 UniRef50_D2QTD7 von Willebrand factor type A n=1 Tax=Spirosoma l... 98 6e-19 UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anseri... 98 7e-19 UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea may... 98 7e-19 UniRef50_C3YRH3 Putative uncharacterized protein (Fragment) n=1 ... 98 7e-19 UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ... 98 7e-19 UniRef50_Q60ED8 Von Willebrand factor type A domain containing p... 98 7e-19 UniRef50_Q10JU7 Von Willebrand factor type A domain containing p... 98 7e-19 UniRef50_C6X1I3 BatA (Bacteroides aerotolerance operon) n=1 Tax=... 98 8e-19 UniRef50_A9RSX3 Predicted protein n=1 Tax=Physcomitrella patens ... 98 8e-19 UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW... 98 8e-19 UniRef50_Q2BCF0 Possible D-amino acid dehydrogenase, large subun... 98 8e-19 UniRef50_C1XUY0 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus Re... 97 9e-19 UniRef50_Q4RW93 Chromosome 9 SCAF14991, whole genome shotgun seq... 97 1e-18 UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophob... 97 1e-18 UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=... 97 1e-18 UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphyloth... 97 1e-18 UniRef50_C9LDM7 BatA protein n=10 Tax=Prevotella RepID=C9LDM7_9BACT 97 1e-18 UniRef50_A0CDA0 Chromosome undetermined scaffold_17, whole genom... 97 1e-18 UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobac... 97 1e-18 UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudona... 97 1e-18 UniRef50_Q24FW2 von Willebrand factor type A domain containing p... 97 1e-18 UniRef50_A8N264 Putative uncharacterized protein n=1 Tax=Coprino... 97 1e-18 UniRef50_UPI000058940A PREDICTED: similar to inter-alpha (globul... 97 1e-18 UniRef50_B9L896 von Willebrand factor, type A n=1 Tax=Nautilia p... 97 1e-18 UniRef50_UPI000186D9CC conserved hypothetical protein n=1 Tax=Pe... 96 2e-18 UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 ... 96 2e-18 UniRef50_Q7UNJ0 Putative uncharacterized protein n=1 Tax=Rhodopi... 96 2e-18 UniRef50_Q55G98 von Willebrand factor A domain-containing protei... 96 2e-18 UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein... 96 2e-18 UniRef50_C2MDE3 BatA protein n=6 Tax=Bacteroidales RepID=C2MDE3_... 96 2e-18 UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=... 96 2e-18 UniRef50_B2SKX9 von Willebrand factor type A domain protein n=13... 96 2e-18 UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesioc... 96 2e-18 UniRef50_O00534 von Willebrand factor A domain-containing protei... 96 2e-18 UniRef50_A6QBT6 von Willebrand factor type A domain protein n=1 ... 96 2e-18 UniRef50_A9AX98 von Willebrand factor type A n=1 Tax=Herpetosiph... 96 2e-18 UniRef50_D1BQE7 von Willebrand factor type A n=1 Tax=Veillonella... 96 2e-18 UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxob... 96 2e-18 UniRef50_Q23J98 von Willebrand factor type A domain containing p... 96 2e-18 UniRef50_C7FPD9 Uncharacterized protein n=2 Tax=environmental sa... 96 3e-18 UniRef50_Q22SJ7 von Willebrand factor type A domain containing p... 96 3e-18 UniRef50_A0Z5Z1 BatB protein, putative n=2 Tax=unclassified Gamm... 96 3e-18 UniRef50_B7KCF7 von Willebrand factor type A n=1 Tax=Cyanothece ... 96 3e-18 UniRef50_C6VXL7 von Willebrand factor type A n=1 Tax=Dyadobacter... 96 3e-18 UniRef50_A9SQ90 Predicted protein n=3 Tax=Physcomitrella patens ... 95 3e-18 UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Art... 95 3e-18 UniRef50_Q54MG4 von Willebrand factor A domain-containing protei... 95 3e-18 UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacte... 95 4e-18 UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepI... 95 4e-18 UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesm... 95 4e-18 UniRef50_Q01UI0 von Willebrand factor, type A n=1 Tax=Candidatus... 95 4e-18 UniRef50_Q8TYU9 Mg-chelatase subunit ChlI and Chld (MoxR-like AT... 95 4e-18 UniRef50_A7SV91 Predicted protein (Fragment) n=2 Tax=Nematostell... 95 4e-18 UniRef50_Q7K0H4 Straightjacket n=11 Tax=Coelomata RepID=Q7K0H4_D... 95 4e-18 UniRef50_Q8TU27 Putative uncharacterized protein n=1 Tax=Methano... 95 4e-18 UniRef50_C0D9H7 Putative uncharacterized protein n=1 Tax=Clostri... 95 5e-18 UniRef50_C4DQN3 Uncharacterized protein containing a von Willebr... 95 5e-18 UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetom... 95 5e-18 UniRef50_Q1Q2F4 Putative uncharacterized protein n=1 Tax=Candida... 95 5e-18 UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus ... 95 6e-18 UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastop... 95 6e-18 UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein fam... 95 6e-18 UniRef50_B8G7Y1 von Willebrand factor type A n=3 Tax=Chloroflexu... 95 6e-18 UniRef50_A7T2Z0 Predicted protein n=4 Tax=Nematostella vectensis... 95 6e-18 UniRef50_B3RZT6 Putative uncharacterized protein n=2 Tax=Trichop... 95 7e-18 UniRef50_B2KDS9 von Willebrand factor type A n=1 Tax=Elusimicrob... 95 7e-18 UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangi... 95 7e-18 UniRef50_UPI0000E47594 PREDICTED: similar to inter-alpha (globul... 94 8e-18 UniRef50_UPI00006CD16B von Willebrand factor type A domain conta... 94 9e-18 UniRef50_A3DK47 von Willebrand factor, type A n=9 Tax=cellular o... 94 9e-18 UniRef50_D0MEC0 von Willebrand factor type A n=1 Tax=Rhodothermu... 94 9e-18 UniRef50_A7C4W6 von Willebrand factor, type A n=1 Tax=Beggiatoa ... 94 1e-17 UniRef50_D1CCX6 von Willebrand factor type A n=1 Tax=Thermobacul... 94 1e-17 UniRef50_B7XGM9 Putative uncharacterized protein n=2 Tax=Pseudom... 94 1e-17 UniRef50_C0E6Z8 Putative uncharacterized protein n=2 Tax=Coryneb... 94 1e-17 UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglob... 94 1e-17 UniRef50_C7PW75 Vault protein inter-alpha-trypsin domain protein... 94 1e-17 UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseoba... 94 1e-17 UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepI... 94 1e-17 UniRef50_A7RFL6 Predicted protein n=1 Tax=Nematostella vectensis... 94 1e-17 UniRef50_Q4RX89 Chromosome 11 SCAF14979, whole genome shotgun se... 93 1e-17 UniRef50_B5EUF0 von Willebrand factor, type A n=44 Tax=Vibrionac... 93 1e-17 UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microspo... 93 1e-17 UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepI... 93 2e-17 UniRef50_B0VJ57 Putative uncharacterized protein n=1 Tax=Candida... 93 2e-17 UniRef50_Q6L2C8 Putative uncharacterized protein n=1 Tax=Picroph... 93 2e-17 UniRef50_B3QUN4 von Willebrand factor type A n=4 Tax=Bacteria Re... 93 2e-17 UniRef50_UPI000178810F von Willebrand factor type A n=1 Tax=Geob... 93 2e-17 UniRef50_C0ZKA0 Putative uncharacterized protein n=2 Tax=Bacteri... 93 2e-17 UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornu... 93 2e-17 UniRef50_Q6IND5 MGC83495 protein n=9 Tax=cellular organisms RepI... 93 2e-17 UniRef50_B8GAZ1 von Willebrand factor type A n=3 Tax=Chloroflexu... 93 2e-17 UniRef50_UPI00006CC819 von Willebrand factor type A domain conta... 93 2e-17 UniRef50_P19827 Inter-alpha-trypsin inhibitor heavy chain H1 n=6... 93 3e-17 UniRef50_C3NI41 von Willebrand factor type A n=9 Tax=Sulfolobus ... 93 3e-17 UniRef50_C0QP91 Putative von Willebrand factor type A domain pro... 93 3e-17 UniRef50_D1RCP1 von Willebrand factor type A domain protein n=6 ... 92 3e-17 UniRef50_A5UWS5 von Willebrand factor, type A n=2 Tax=Roseiflexu... 92 3e-17 UniRef50_Q237Q6 von Willebrand factor type A domain containing p... 92 3e-17 UniRef50_UPI00006CC94A von Willebrand factor type A domain conta... 92 3e-17 UniRef50_Q0AZS1 Mg-chelatase subunit ChlD-like protein n=1 Tax=S... 92 3e-17 UniRef50_B2HK18 Conserved membrane protein n=3 Tax=Mycobacterium... 92 3e-17 UniRef50_Q23KK4 von Willebrand factor type A domain containing p... 92 3e-17 UniRef50_Q11RQ7 BatA-like protein, aerotolerance-related n=1 Tax... 92 3e-17 UniRef50_Q17A73 Dihydropyridine-sensitive l-type calcium channel... 92 3e-17 UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated c... 92 3e-17 UniRef50_UPI0001C34E55 hypothetical protein ClM62_13922 n=1 Tax=... 92 4e-17 UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 92 4e-17 UniRef50_A6G7V2 von Willebrand factor, type A n=1 Tax=Plesiocyst... 92 4e-17 UniRef50_Q21JX5 von Willebrand factor, type A n=8 Tax=Gammaprote... 92 4e-17 UniRef50_C1MHV2 Predicted protein n=1 Tax=Micromonas pusilla CCM... 92 4e-17 UniRef50_C1XFI8 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus Re... 92 4e-17 UniRef50_A0CHZ1 Chromosome undetermined scaffold_185, whole geno... 92 4e-17 UniRef50_A6TP10 von Willebrand factor, type A n=1 Tax=Alkaliphil... 92 5e-17 UniRef50_D2Q363 von Willebrand factor type A n=1 Tax=Kribbella f... 92 5e-17 UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia br... 91 5e-17 UniRef50_C5BKZ8 von Willebrand factor type A domain protein n=5 ... 91 5e-17 UniRef50_C8NWP6 Putative uncharacterized protein n=1 Tax=Coryneb... 91 5e-17 UniRef50_Q31JK3 Type A von Willebrand factor-like n=1 Tax=Thiomi... 91 6e-17 UniRef50_Q22NG1 von Willebrand factor type A domain containing p... 91 6e-17 UniRef50_C4V3L6 Magnesium chelatase n=2 Tax=Selenomonas RepID=C4... 91 6e-17 UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira... 91 6e-17 UniRef50_Q0AW90 Conserved putative chloride channel n=1 Tax=Synt... 91 6e-17 UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 ... 91 7e-17 UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verruc... 91 7e-17 UniRef50_C5GK44 U-box domain-containing protein n=2 Tax=Ajellomy... 91 7e-17 UniRef50_UPI0000E49DB4 PREDICTED: similar to poly (ADP-ribose) p... 91 8e-17 UniRef50_B0VJ58 BatA protein n=1 Tax=Candidatus Cloacamonas acid... 91 8e-17 UniRef50_B6HQ22 Pc22g19800 protein n=1 Tax=Penicillium chrysogen... 91 8e-17 UniRef50_Q9ZGE6 Magnesium-chelatase 67 kDa subunit n=2 Tax=Helio... 91 8e-17 UniRef50_D1ZZE6 Putative uncharacterized protein GLEAN_08029 n=1... 91 9e-17 UniRef50_A6QCY4 von Willebrand factor type A domain protein n=2 ... 91 1e-16 UniRef50_B0WHU4 Sushi n=3 Tax=Culicini RepID=B0WHU4_CULQU 91 1e-16 UniRef50_A3HZP7 BatA protein n=1 Tax=Algoriphagus sp. PR1 RepID=... 90 1e-16 UniRef50_A7HHW8 Vault protein inter-alpha-trypsin domain protein... 90 1e-16 UniRef50_C7ZL29 Putative uncharacterized protein n=2 Tax=Nectria... 90 1e-16 UniRef50_Q2W311 Putative uncharacterized protein n=1 Tax=Magneto... 90 1e-16 UniRef50_C0QXK8 von Willebrand factor type A (VWA) domain contai... 90 1e-16 UniRef50_UPI00005843FB PREDICTED: hypothetical protein n=1 Tax=S... 90 1e-16 UniRef50_A1S119 von Willebrand factor, type A n=1 Tax=Thermofilu... 90 1e-16 UniRef50_C5CEE7 PEGA domain protein n=1 Tax=Kosmotoga olearia TB... 90 1e-16 UniRef50_Q22HH7 von Willebrand factor type A domain containing p... 90 1e-16 UniRef50_D1YYY2 Putative uncharacterized protein n=1 Tax=Methano... 90 1e-16 UniRef50_B8FBV5 Putative uncharacterized protein n=1 Tax=Desulfa... 90 1e-16 UniRef50_B3DVX4 Uncharacterized protein containing a von Willebr... 90 2e-16 UniRef50_C6WL97 VWA containing CoxE family protein n=1 Tax=Actin... 90 2e-16 UniRef50_A6EQD3 von Willebrand factor type A like domain n=2 Tax... 90 2e-16 UniRef50_D1YD07 von Willebrand factor type A domain protein n=2 ... 90 2e-16 UniRef50_A3J9J6 Putative uncharacterized protein n=3 Tax=Bacteri... 90 2e-16 UniRef50_UPI000175F343 PREDICTED: similar to BCSC-1 n=3 Tax=Eute... 90 2e-16 UniRef50_UPI0001A2D396 UPI0001A2D396 related cluster n=5 Tax=Clu... 90 2e-16 UniRef50_A7S6T1 Predicted protein n=2 Tax=Nematostella vectensis... 90 2e-16 UniRef50_C4G1K3 Putative uncharacterized protein n=1 Tax=Abiotro... 90 2e-16 UniRef50_Q0W729 Putative uncharacterized protein n=1 Tax=uncultu... 90 2e-16 UniRef50_C6JMG8 Magnesium chelatase n=1 Tax=Fusobacterium varium... 90 2e-16 UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotri... 90 2e-16 UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytoph... 90 2e-16 UniRef50_A6QCW6 von Willebrand factor type A domain protein n=1 ... 90 2e-16 UniRef50_C3Y9U4 Putative uncharacterized protein n=1 Tax=Branchi... 90 2e-16 UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangi... 90 2e-16 UniRef50_A3I2X5 Putative batB protein n=1 Tax=Algoriphagus sp. P... 90 2e-16 UniRef50_B4BQC0 von Willebrand factor type A n=2 Tax=Geobacillus... 90 2e-16 UniRef50_Q5QXN1 Uncharacterized protein containing a von Willebr... 89 2e-16 UniRef50_B0XT03 von Willebrand domain protein n=4 Tax=Trichocoma... 89 3e-16 UniRef50_A0CY84 Chromosome undetermined scaffold_307, whole geno... 89 3e-16 UniRef50_A9B057 von Willebrand factor type A n=3 Tax=Chloroflexi... 89 3e-16 UniRef50_A7K1D3 Protein BatA n=19 Tax=Vibrionales RepID=A7K1D3_V... 89 3e-16 UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actin... 89 3e-16 UniRef50_D2A5S3 Putative uncharacterized protein GLEAN_15119 n=3... 89 3e-16 UniRef50_D0LKC7 von Willebrand factor type A n=1 Tax=Haliangium ... 89 3e-16 UniRef50_A6G2Y5 Putative uncharacterized protein n=1 Tax=Plesioc... 89 3e-16 UniRef50_Q233P7 von Willebrand factor type A domain containing p... 89 3e-16 UniRef50_B3EB10 von Willebrand factor type A n=2 Tax=Desulfuromo... 89 3e-16 UniRef50_A9UIA7 Hedgling (Fragment) n=4 Tax=Nematostella vectens... 89 3e-16 UniRef50_Q82LZ6 Putative uncharacterized protein n=1 Tax=Strepto... 89 3e-16 UniRef50_C4RGW7 Putative uncharacterized protein n=1 Tax=Micromo... 89 3e-16 UniRef50_Q7UL83 Inter-alpha-trypsin inhibitor family heavy chain... 89 4e-16 UniRef50_C3XUV0 Putative uncharacterized protein n=1 Tax=Branchi... 89 4e-16 UniRef50_A0PNU3 UPF0353 protein MUL_1490 n=43 Tax=Actinomycetale... 88 4e-16 UniRef50_A9GBN0 Putative uncharacterized protein n=1 Tax=Sorangi... 88 4e-16 UniRef50_A3TQW7 Putative membrane protein n=1 Tax=Janibacter sp.... 88 4e-16 UniRef50_UPI00006CD1DE von Willebrand factor type A domain conta... 88 5e-16 UniRef50_UPI00017F3212 von Willebrand factor, type A n=1 Tax=Esc... 88 5e-16 UniRef50_B3PJ55 von Willebrand factor type A domain protein n=1 ... 88 5e-16 UniRef50_D0LJ27 von Willebrand factor type A n=1 Tax=Haliangium ... 88 5e-16 UniRef50_A8J658 Collagen-related protein n=1 Tax=Chlamydomonas r... 88 5e-16 UniRef50_Q0VTG8 Protein containing a von Willebrand factor type ... 88 5e-16 UniRef50_Q2QZN4 von Willebrand factor type A domain containing p... 88 5e-16 UniRef50_Q7US47 Putative uncharacterized protein n=1 Tax=Rhodopi... 88 6e-16 UniRef50_B3QY78 von Willebrand factor type A n=1 Tax=Chloroherpe... 88 6e-16 UniRef50_C8VGR0 von Willebrand domain protein (AFU_orthologue; A... 88 7e-16 UniRef50_A0M6V9 Membrane protein containing von Willebrand facto... 88 7e-16 UniRef50_Q6VUC2 Putative uncharacterized protein n=1 Tax=Antonos... 88 8e-16 UniRef50_Q897H0 Membrane-associated protein n=1 Tax=Clostridium ... 88 8e-16 UniRef50_UPI000186D9CB dihydropyridine-sensitive L-type calcium ... 88 8e-16 UniRef50_Q54LJ4 Type A von Willebrand factor domain-containing p... 88 8e-16 UniRef50_A6NF34 Anthrax toxin receptor-like n=8 Tax=Catarrhini R... 88 8e-16 UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacte... 87 9e-16 UniRef50_B0TL26 von Willebrand factor type A n=7 Tax=Gammaproteo... 87 9e-16 UniRef50_D0QYP1 von Willebrand factor A domain containing 5A n=1... 87 9e-16 UniRef50_UPI0000ECD6E7 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.... 87 1e-15 UniRef50_C7R6G5 von Willebrand factor type A n=1 Tax=Kangiella k... 87 1e-15 UniRef50_B6H7Y2 Pc16g08660 protein n=1 Tax=Penicillium chrysogen... 87 1e-15 UniRef50_A8J0D9 Flagellar associated protein n=1 Tax=Chlamydomon... 87 1e-15 UniRef50_Q4RTR8 Chromosome 2 SCAF14997, whole genome shotgun seq... 87 1e-15 UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_A... 87 1e-15 UniRef50_A9B607 von Willebrand factor type A n=6 Tax=Chloroflexi... 87 1e-15 UniRef50_C9RU69 von Willebrand factor type A n=2 Tax=Geobacillus... 87 1e-15 UniRef50_Q5UWJ9 Calcium-binding protein-like n=1 Tax=Haloarcula ... 87 1e-15 UniRef50_A9AUC9 von Willebrand factor type A n=1 Tax=Herpetosiph... 87 1e-15 UniRef50_A7C1J8 von Willebrand factor, type A n=1 Tax=Beggiatoa ... 87 1e-15 UniRef50_D1HBR9 Whole genome shotgun sequence of line PN40024, s... 87 1e-15 UniRef50_B3RWX4 Putative uncharacterized protein n=1 Tax=Trichop... 87 1e-15 UniRef50_UPI00016C38A3 LPXTG-motif cell wall anchor domain prote... 87 1e-15 UniRef50_A1ZDA1 OmpA family protein n=1 Tax=Microscilla marina A... 87 1e-15 UniRef50_Q99KC8 von Willebrand factor A domain-containing protei... 87 1e-15 UniRef50_Q04NS4 BatA n=4 Tax=Leptospira RepID=Q04NS4_LEPBJ 87 1e-15 UniRef50_Q66HV5 Zgc:92481 n=2 Tax=Danio rerio RepID=Q66HV5_DANRE 87 2e-15 UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 ... 86 2e-15 UniRef50_Q2JD81 von Willebrand factor, type A n=4 Tax=Frankineae... 86 2e-15 UniRef50_Q7MCW9 Uncharacterized protein n=2 Tax=Vibrio vulnificu... 86 2e-15 UniRef50_C7NN24 von Willebrand factor type A n=1 Tax=Halorhabdus... 86 2e-15 UniRef50_C1XH40 Mg-chelatase subunit ChlD n=1 Tax=Meiothermus ru... 86 2e-15 UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepI... 86 2e-15 UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteri... 86 2e-15 UniRef50_B3RWW7 Putative uncharacterized protein n=1 Tax=Trichop... 86 2e-15 UniRef50_C9YX20 Putative uncharacterized protein n=1 Tax=Strepto... 86 2e-15 UniRef50_C7NQ34 von Willebrand factor type A n=1 Tax=Halorhabdus... 86 2e-15 UniRef50_C8PMB0 BatA protein n=1 Tax=Treponema vincentii ATCC 35... 86 2e-15 UniRef50_A0B5M2 von Willebrand factor, type A n=1 Tax=Methanosae... 86 2e-15 UniRef50_UPI000180D155 PREDICTED: similar to integrin alpha Hr1 ... 86 3e-15 UniRef50_A7VF89 Putative uncharacterized protein n=1 Tax=Clostri... 86 3e-15 UniRef50_B1L6Y8 von Willebrand factor type A n=1 Tax=Candidatus ... 86 3e-15 UniRef50_Q2QZN5 Putative uncharacterized protein n=1 Tax=Oryza s... 86 3e-15 UniRef50_C6XTR0 von Willebrand factor type A n=1 Tax=Pedobacter ... 86 3e-15 UniRef50_C3XQQ6 Putative uncharacterized protein n=1 Tax=Branchi... 86 3e-15 UniRef50_A0BYA6 Chromosome undetermined scaffold_136, whole geno... 86 3e-15 UniRef50_Q3M1S2 von Willebrand factor, type A n=1 Tax=Anabaena v... 86 3e-15 UniRef50_UPI0001A2BB4D UPI0001A2BB4D related cluster n=1 Tax=Dan... 86 3e-15 UniRef50_Q0FYU3 Von Willebrand factor, type A n=1 Tax=Fulvimarin... 85 4e-15 UniRef50_Q1N642 Putative uncharacterized protein n=1 Tax=Bermane... 85 4e-15 UniRef50_UPI0001C31E2D von Willebrand factor type A n=1 Tax=Cone... 85 4e-15 UniRef50_A6YIP9 Capillary morphogenesis protein 2B n=14 Tax=Eute... 85 4e-15 UniRef50_A8DJP2 von Willebrand factor type A n=1 Tax=Candidatus ... 85 4e-15 UniRef50_D2RSW3 von Willebrand factor type A n=1 Tax=Haloterrige... 85 4e-15 UniRef50_UPI00004D9B6D UPI00004D9B6D related cluster n=2 Tax=Xen... 85 4e-15 UniRef50_Q1D0M2 BatA protein n=2 Tax=Cystobacterineae RepID=Q1D0... 85 5e-15 UniRef50_D0MZH7 Putative uncharacterized protein n=1 Tax=Phytoph... 85 5e-15 UniRef50_Q10ZP7 von Willebrand factor, type A n=1 Tax=Trichodesm... 85 5e-15 UniRef50_UPI0001925847 PREDICTED: similar to polydom n=1 Tax=Hyd... 85 5e-15 UniRef50_Q9UKK3 Poly [ADP-ribose] polymerase 4 n=14 Tax=Eutheria... 85 5e-15 UniRef50_A6DT52 Putative uncharacterized protein n=1 Tax=Lentisp... 85 5e-15 UniRef50_A8M5H1 von Willebrand factor type A n=13 Tax=Actinomyce... 85 6e-15 UniRef50_C6VVE4 von Willebrand factor type A n=1 Tax=Dyadobacter... 85 7e-15 UniRef50_C3ZT39 Putative uncharacterized protein n=1 Tax=Branchi... 85 7e-15 UniRef50_A6DT53 BatB protein n=1 Tax=Lentisphaera araneosa HTCC2... 85 7e-15 UniRef50_B4S8S0 Magnesium chelatase ATPase subunit D n=3 Tax=Chl... 85 7e-15 UniRef50_B0SI02 BatA n=2 Tax=Leptospira biflexa serovar Patoc Re... 85 7e-15 UniRef50_A2DPQ9 von Willebrand factor type A domain containing p... 85 7e-15 UniRef50_A2SQ24 von Willebrand factor, type A n=1 Tax=Methanocor... 84 8e-15 UniRef50_B3QTN9 Vault protein inter-alpha-trypsin domain protein... 84 8e-15 UniRef50_A9GV55 Putative secreted protein n=1 Tax=Sorangium cell... 84 8e-15 UniRef50_C0EZA4 Putative uncharacterized protein n=1 Tax=Eubacte... 84 9e-15 UniRef50_Q2SCZ7 Uncharacterized protein containing a von Willebr... 84 9e-15 UniRef50_D2UZF5 von Willebrand factor type A domain-containing p... 84 1e-14 UniRef50_C4ZKE8 von Willebrand factor type A n=2 Tax=Thauera sp.... 84 1e-14 UniRef50_B1X316 Putative uncharacterized protein n=1 Tax=Cyanoth... 84 1e-14 UniRef50_C7R9E3 von Willebrand factor type A n=1 Tax=Kangiella k... 84 1e-14 UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10... 84 1e-14 UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides... 84 1e-14 UniRef50_UPI0001C31EFE hypothetical protein Cwoe_4905 n=1 Tax=Co... 84 1e-14 UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotoma... 83 1e-14 UniRef50_A8H5T6 von Willebrand factor type A n=5 Tax=Proteobacte... 83 1e-14 UniRef50_C6PWL5 von Willebrand factor type A n=3 Tax=Clostridium... 83 1e-14 UniRef50_C3WEJ2 BatB protein n=1 Tax=Fusobacterium mortiferum AT... 83 1e-14 UniRef50_A4BEC4 Putative uncharacterized protein n=1 Tax=Reineke... 83 1e-14 UniRef50_D1VLF3 von Willebrand factor type A n=1 Tax=Frankia sp.... 83 2e-14 UniRef50_B4RTV3 von Willebrand factor, type A n=20 Tax=Alteromon... 83 2e-14 UniRef50_D1IDZ7 Whole genome shotgun sequence of line PN40024, s... 83 2e-14 UniRef50_UPI0001BC3853 von Willebrand factor type A n=1 Tax=Buty... 83 2e-14 UniRef50_UPI000023F6A9 hypothetical protein FG10431.1 n=2 Tax=Gi... 83 2e-14 UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus ... 83 2e-14 UniRef50_Q47M48 von Willebrand factor, type A n=4 Tax=Streptospo... 83 2e-14 UniRef50_A7RVQ6 Predicted protein n=1 Tax=Nematostella vectensis... 83 2e-14 UniRef50_A7BVG3 von Willebrand factor type A domain protein n=1 ... 83 3e-14 UniRef50_B2A702 von Willebrand factor type A n=1 Tax=Natranaerob... 83 3e-14 UniRef50_Q1Q2F5 Putative uncharacterized protein n=1 Tax=Candida... 83 3e-14 UniRef50_A4ACS0 Magnesium-chelatase, 60 kDa subunit n=2 Tax=uncl... 82 3e-14 UniRef50_Q80UW6 Parp4 protein (Fragment) n=11 Tax=Eukaryota RepI... 82 3e-14 UniRef50_Q6A0B1 MKIAA0177 protein (Fragment) n=4 Tax=Murinae Rep... 82 3e-14 UniRef50_A8K7I4 Calcium-activated chloride channel regulator 1 n... 82 3e-14 UniRef50_D1XJQ4 von Willebrand factor type A n=2 Tax=Streptomyce... 82 3e-14 UniRef50_B3SBE5 Putative uncharacterized protein n=2 Tax=Trichop... 82 3e-14 UniRef50_B0EK65 Putative uncharacterized protein n=7 Tax=Entamoe... 82 3e-14 UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methano... 82 3e-14 UniRef50_C0ZJ85 Putative uncharacterized protein n=1 Tax=Breviba... 82 3e-14 UniRef50_A1SYF6 von Willebrand factor, type A n=1 Tax=Psychromon... 82 3e-14 UniRef50_B5JVB6 von Willebrand factor, type A n=1 Tax=gamma prot... 82 4e-14 UniRef50_UPI00016E1D58 UPI00016E1D58 related cluster n=1 Tax=Tak... 82 4e-14 UniRef50_C3ZZV2 Putative uncharacterized protein n=1 Tax=Branchi... 82 4e-14 UniRef50_UPI0001C161B1 von Willebrand factor, type A Precursor n... 82 4e-14 UniRef50_Q14CN2 Calcium-activated chloride channel regulator 4, ... 82 4e-14 UniRef50_A2AX52 Collagen alpha-4(VI) chain n=12 Tax=Chordata Rep... 82 4e-14 UniRef50_D2AUU0 Putative uncharacterized protein n=1 Tax=Strepto... 82 5e-14 UniRef50_A0BS51 Chromosome undetermined scaffold_124, whole geno... 82 5e-14 UniRef50_Q6AQK6 Conserved hypothetical membrane protein (BatA) n... 82 5e-14 UniRef50_B1J572 von Willebrand factor type A n=14 Tax=Pseudomona... 82 5e-14 UniRef50_B4D3H1 von Willebrand factor type A n=1 Tax=Chthoniobac... 82 5e-14 UniRef50_Q7JMF9 Protein T24F1.6b, partially confirmed by transcr... 82 5e-14 UniRef50_C0Z8R3 Hypothetical membrane protein n=1 Tax=Brevibacil... 81 5e-14 UniRef50_C8NJ92 Secreted Mg-chelatase subunit n=3 Tax=Corynebact... 81 5e-14 UniRef50_A8MJ77 Magnesium chelatase n=2 Tax=Clostridiales RepID=... 81 5e-14 UniRef50_C3XQR7 Putative uncharacterized protein n=1 Tax=Branchi... 81 6e-14 UniRef50_A1SAA4 Uncharacterized protein containing a von Willebr... 81 6e-14 UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobac... 81 6e-14 UniRef50_C1F7S6 Putative uncharacterized protein n=1 Tax=Acidoba... 81 6e-14 UniRef50_B8G7S2 Magnesium chelatase n=9 Tax=cellular organisms R... 81 6e-14 UniRef50_C1YR26 Uncharacterized protein containing a von Willebr... 81 7e-14 UniRef50_D2PMF8 von Willebrand factor type A n=1 Tax=Kribbella f... 81 7e-14 UniRef50_P12110 Collagen alpha-2(VI) chain n=30 Tax=Euteleostomi... 81 7e-14 UniRef50_UPI000180C65C PREDICTED: similar to calcium channel, vo... 81 7e-14 UniRef50_A5UW94 von Willebrand factor, type A n=2 Tax=Roseiflexu... 81 7e-14 UniRef50_A9BS02 von Willebrand factor type A n=1 Tax=Delftia aci... 81 8e-14 UniRef50_O50313 Magnesium-chelatase 67 kDa subunit n=15 Tax=Bact... 81 8e-14 UniRef50_C4N894 Complement factor B-like protein n=1 Tax=Venerup... 81 8e-14 UniRef50_A7NJ01 von Willebrand factor type A n=2 Tax=Roseiflexus... 81 8e-14 UniRef50_Q22UB9 von Willebrand factor type A domain containing p... 81 8e-14 UniRef50_A6NMZ7 Collagen alpha-6(VI) chain n=2 Tax=Theria RepID=... 81 8e-14 UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter... 81 9e-14 UniRef50_C8XH18 von Willebrand factor type A n=1 Tax=Nakamurella... 81 9e-14 UniRef50_B9HP09 Predicted protein n=13 Tax=cellular organisms Re... 81 9e-14 UniRef50_D1JHM1 Putitive magnesium-chelatase subunit n=2 Tax=unc... 81 9e-14 UniRef50_UPI00016E1D1D UPI00016E1D1D related cluster n=9 Tax=Tet... 81 9e-14 UniRef50_A4BKH3 Putative uncharacterized protein n=1 Tax=Reineke... 81 9e-14 UniRef50_C3Y4Z7 Putative uncharacterized protein n=1 Tax=Branchi... 81 9e-14 UniRef50_A9IZC4 Cobalamin biosynthesis protein CobT n=29 Tax=Rhi... 81 1e-13 UniRef50_B3RP11 Putative uncharacterized protein n=1 Tax=Trichop... 81 1e-13 UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromon... 81 1e-13 UniRef50_A2UW24 Von Willebrand factor, type A n=1 Tax=Shewanella... 81 1e-13 UniRef50_B7Q438 Neurogenic locus notch, putative n=1 Tax=Ixodes ... 81 1e-13 UniRef50_A4YDL0 von Willebrand factor, type A n=1 Tax=Metallosph... 81 1e-13 UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methano... 81 1e-13 UniRef50_Q3KGA0 Putative secreted protein, hemolysin n=1 Tax=Pse... 81 1e-13 UniRef50_Q8C6K9 Collagen alpha-6(VI) chain n=26 Tax=cellular org... 81 1e-13 UniRef50_A6DKL3 Putative uncharacterized protein n=1 Tax=Lentisp... 81 1e-13 UniRef50_B3RZ89 Putative uncharacterized protein n=1 Tax=Trichop... 80 1e-13 UniRef50_C9ZGQ6 Putative membrane protein n=3 Tax=Streptomyces R... 80 1e-13 UniRef50_C6PVI2 Vault protein inter-alpha-trypsin domain protein... 80 1e-13 UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium E... 80 1e-13 UniRef50_Q25545 Putative uncharacterized protein (Fragment) n=1 ... 80 1e-13 UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus ... 80 1e-13 UniRef50_Q021L5 von Willebrand factor, type A n=1 Tax=Candidatus... 80 1e-13 UniRef50_Q6MJI4 Putative uncharacterized protein batB n=1 Tax=Bd... 80 1e-13 UniRef50_P33352 Uncharacterized protein yehP n=69 Tax=root RepID... 80 1e-13 UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria Re... 80 1e-13 UniRef50_C3ZEP6 Putative uncharacterized protein n=2 Tax=Branchi... 80 1e-13 UniRef50_B9X084 Complement factor B n=5 Tax=Eumetazoa RepID=B9X0... 80 1e-13 UniRef50_UPI0000E488A7 PREDICTED: similar to Clca1 protein n=5 T... 80 1e-13 UniRef50_B0G4W3 Putative uncharacterized protein n=1 Tax=Dorea f... 80 1e-13 UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capn... 80 1e-13 UniRef50_D1YP15 von Willebrand factor type A domain protein n=1 ... 80 1e-13 UniRef50_B0NBU8 Putative uncharacterized protein n=1 Tax=Clostri... 80 2e-13 UniRef50_Q9ZQ46 Copia-like retroelement pol polyprotein n=6 Tax=... 80 2e-13 UniRef50_Q6VPP3 Parturition-related protein PRP3 n=6 Tax=Eutheri... 80 2e-13 UniRef50_Q6AQK5 Hypothetical membrane protein (BatB) n=1 Tax=Des... 80 2e-13 UniRef50_A4QDZ6 Putative uncharacterized protein n=1 Tax=Coryneb... 80 2e-13 UniRef50_C1EAC0 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 80 2e-13 UniRef50_C5S6K2 von Willebrand factor type A n=1 Tax=Allochromat... 80 2e-13 UniRef50_D1C680 ATPase associated with various cellular activiti... 80 2e-13 UniRef50_UPI0000EB12CB UPI0000EB12CB related cluster n=1 Tax=Can... 80 2e-13 UniRef50_C1XTM1 Uncharacterized protein containing a von Willebr... 80 2e-13 UniRef50_A8LLA0 von Willebrand factor type A domain protein n=7 ... 80 2e-13 UniRef50_Q74B80 Putative uncharacterized protein n=1 Tax=Geobact... 80 2e-13 UniRef50_Q22ML1 von Willebrand factor type A domain containing p... 80 2e-13 UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostri... 80 2e-13 UniRef50_A6FXN3 Putative uncharacterized protein n=1 Tax=Plesioc... 80 2e-13 UniRef50_Q1GMB7 von Willebrand factor type A n=3 Tax=Rhodobacter... 80 2e-13 UniRef50_C0QK10 Putative uncharacterized protein n=1 Tax=Desulfo... 80 2e-13 UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacter... 80 2e-13 UniRef50_C4PY90 Dihydropyridine-sensitive l-type calcium channel... 80 2e-13 UniRef50_A7RNR9 Predicted protein n=1 Tax=Nematostella vectensis... 80 2e-13 UniRef50_B0JR39 von Willebrand factor type A n=1 Tax=Microcystis... 80 2e-13 UniRef50_A7RPC2 Predicted protein (Fragment) n=2 Tax=Eumetazoa R... 80 2e-13 UniRef50_D1VKI5 von Willebrand factor type A n=1 Tax=Frankia sp.... 79 3e-13 UniRef50_D2V048 Predicted protein n=2 Tax=Naegleria gruberi RepI... 79 3e-13 UniRef50_Q5SKK6 Putative uncharacterized protein TTHA0637 n=4 Ta... 79 3e-13 UniRef50_C5YBL3 Putative uncharacterized protein Sb06g000656 n=1... 79 3e-13 UniRef50_O00339 Matrilin-2 n=30 Tax=Euteleostomi RepID=MATN2_HUMAN 79 3e-13 UniRef50_Q9VJM0 CG12455, isoform A n=9 Tax=Drosophila RepID=Q9VJ... 79 3e-13 UniRef50_Q5BIB2 RE14947p n=5 Tax=melanogaster subgroup RepID=Q5B... 79 3e-13 UniRef50_Q1N498 Putative uncharacterized protein n=1 Tax=Bermane... 79 3e-13 UniRef50_D0L4S2 von Willebrand factor type A n=1 Tax=Gordonia br... 79 3e-13 UniRef50_A7C4T6 von Willebrand factor type A domain protein n=2 ... 79 3e-13 UniRef50_Q502W6 von Willebrand factor A domain-containing protei... 79 3e-13 UniRef50_A1HT91 Von Willebrand factor, type A n=1 Tax=Thermosinu... 79 3e-13 UniRef50_B9SSC8 Protein binding protein, putative n=1 Tax=Ricinu... 79 3e-13 UniRef50_Q73PP7 Magnesium chelatase, subunit D/I family n=1 Tax=... 79 3e-13 UniRef50_Q4J9H5 Conserved protein n=2 Tax=Sulfolobus RepID=Q4J9H... 79 3e-13 UniRef50_C0NPX3 von Willebrand domain-containing protein n=5 Tax... 79 4e-13 UniRef50_B5ZN80 von Willebrand factor type A n=8 Tax=Rhizobiales... 79 4e-13 UniRef50_UPI0000F1FEC5 PREDICTED: similar to Clca1 protein n=2 T... 79 4e-13 UniRef50_Q97LT1 DnaK protein (Heat shock protein), C-terminal re... 79 4e-13 UniRef50_UPI0001760236 PREDICTED: similar to mCG140660 n=1 Tax=D... 79 4e-13 UniRef50_C8W898 Cna B domain protein n=1 Tax=Atopobium parvulum ... 79 4e-13 UniRef50_Q7MUE5 BatB protein n=4 Tax=Bacteria RepID=Q7MUE5_PORGI 78 4e-13 UniRef50_Q73NA5 BatA protein, putative n=1 Tax=Treponema dentico... 78 4e-13 UniRef50_C1F3F5 von Willebrand factor type A domain protein n=1 ... 78 4e-13 UniRef50_B5HZU2 VWA domain-containing protein n=1 Tax=Streptomyc... 78 5e-13 UniRef50_A6M139 von Willebrand factor, type A n=1 Tax=Clostridiu... 78 5e-13 UniRef50_Q4V1D8 Putative uncharacterized protein dadA n=8 Tax=Ba... 78 5e-13 UniRef50_UPI00006A1CCF polydom n=1 Tax=Xenopus (Silurana) tropic... 78 5e-13 UniRef50_B9ZQD1 von Willebrand factor type A n=1 Tax=Thioalkaliv... 78 5e-13 UniRef50_A8TX70 Collagen alpha-5(VI) chain n=18 Tax=Eutheria Rep... 78 6e-13 UniRef50_B9R4P7 von Willebrand factor type A domain protein n=1 ... 78 6e-13 UniRef50_D2S019 ATPase associated with various cellular activiti... 78 6e-13 UniRef50_A7RFK1 Predicted protein (Fragment) n=2 Tax=Nematostell... 78 6e-13 UniRef50_C0Z8R6 Putative uncharacterized protein n=1 Tax=Breviba... 78 6e-13 UniRef50_Q4RPQ3 Chromosome 12 SCAF15007, whole genome shotgun se... 78 6e-13 UniRef50_C3XUD0 Putative uncharacterized protein n=1 Tax=Branchi... 78 6e-13 UniRef50_C7DFN0 Magnesium chelatase ATPase subunit D n=1 Tax=Tha... 78 7e-13 UniRef50_C5FHM8 von Willebrand factor type A domain-containing p... 78 7e-13 UniRef50_A1AQS2 Protoporphyrin IX magnesium-chelatase n=1 Tax=Pe... 78 7e-13 UniRef50_B6VQ59 Protein F11C1.5d, partially confirmed by transcr... 78 7e-13 UniRef50_C5GPY5 von Willebrand domain-containing protein n=2 Tax... 78 7e-13 UniRef50_B8KT14 Magnesium-chelatase 60 kDa subunit n=1 Tax=gamma... 78 7e-13 UniRef50_C6JPK6 BatA protein n=2 Tax=Fusobacterium RepID=C6JPK6_... 78 8e-13 UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacter... 78 8e-13 UniRef50_A8LNP9 von Willebrand factor type A n=3 Tax=Rhodobacter... 78 8e-13 UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B... 78 8e-13 UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus... 78 9e-13 UniRef50_A0LFJ6 Protoporphyrin IX magnesium-chelatase n=1 Tax=Sy... 78 9e-13 UniRef50_A8IJ40 Predicted protein n=1 Tax=Chlamydomonas reinhard... 78 9e-13 UniRef50_UPI0000D9E789 PREDICTED: similar to poly (ADP-ribose) p... 77 1e-12 UniRef50_Q3M1S4 von Willebrand factor, type A n=8 Tax=Cyanobacte... 77 1e-12 UniRef50_UPI0000E47896 PREDICTED: similar to cache domain contai... 77 1e-12 UniRef50_A9B2Y1 VWA containing CoxE family protein n=4 Tax=Bacte... 77 1e-12 UniRef50_P58335-2 Isoform 2 of Anthrax toxin receptor 2 n=3 Tax=... 77 1e-12 UniRef50_O76836 Putative uncharacterized protein n=4 Tax=Caenorh... 77 1e-12 UniRef50_Q4SJ97 Chromosome 4 SCAF14575, whole genome shotgun seq... 77 1e-12 UniRef50_Q0I303 Putative uncharacterized protein n=5 Tax=Pasteur... 77 1e-12 UniRef50_Q08X07 Von Willebrand factor type A domain protein n=2 ... 77 1e-12 UniRef50_UPI000180C2AF PREDICTED: similar to FiBrilliN homolog f... 77 1e-12 UniRef50_UPI00017B4DF5 UPI00017B4DF5 related cluster n=3 Tax=Tet... 77 1e-12 UniRef50_C1F3L6 von Willebrand factor type A domain protein n=1 ... 77 1e-12 UniRef50_UPI0001788256 von Willebrand factor type A n=1 Tax=Geob... 77 1e-12 UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 77 1e-12 UniRef50_A9GY82 Putative uncharacterized protein n=1 Tax=Sorangi... 77 1e-12 UniRef50_C0CX78 Putative uncharacterized protein n=1 Tax=Clostri... 77 1e-12 UniRef50_UPI0001BC5690 magnesium chelatase n=1 Tax=Fusobacterium... 77 1e-12 UniRef50_A3JLW1 Putative uncharacterized protein n=1 Tax=Rhodoba... 77 2e-12 UniRef50_UPI0000F2E695 PREDICTED: hypothetical protein n=1 Tax=M... 77 2e-12 UniRef50_A9UVU8 Predicted protein n=1 Tax=Monosiga brevicollis R... 77 2e-12 UniRef50_UPI000180B9AB PREDICTED: similar to CLCA family member ... 76 2e-12 UniRef50_Q22G03 Putative uncharacterized protein n=1 Tax=Tetrahy... 76 2e-12 UniRef50_B2B639 Predicted CDS Pa_2_6630 n=1 Tax=Podospora anseri... 76 2e-12 UniRef50_UPI000180C3AB PREDICTED: similar to polydomain protein-... 76 2e-12 UniRef50_C8XDP5 von Willebrand factor type A n=1 Tax=Nakamurella... 76 2e-12 UniRef50_C3YH52 Putative uncharacterized protein n=1 Tax=Branchi... 76 2e-12 UniRef50_UPI0001AF2DA9 hypothetical protein SrosN1_23653 n=1 Tax... 76 2e-12 UniRef50_UPI00006A02BA UPI00006A02BA related cluster n=1 Tax=Xen... 76 2e-12 UniRef50_B5YKY5 Magnesium-chelatase subunit ChlD n=1 Tax=Thermod... 76 2e-12 UniRef50_UPI000180B353 PREDICTED: similar to putative calcium ac... 76 2e-12 UniRef50_Q4TBC0 Chromosome undetermined SCAF7164, whole genome s... 76 2e-12 UniRef50_C3YUG3 Putative uncharacterized protein n=1 Tax=Branchi... 76 2e-12 UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacte... 76 2e-12 UniRef50_Q2BFM3 Possible D-amino acid dehydrogenase, large subun... 76 2e-12 UniRef50_Q01T75 von Willebrand factor, type A n=2 Tax=Candidatus... 76 2e-12 UniRef50_Q2GWQ0 Putative uncharacterized protein n=1 Tax=Chaetom... 76 2e-12 UniRef50_B2UZB2 von Willebrand factor type A domain protein n=1 ... 76 3e-12 UniRef50_Q7XTB9 OSJNBa0068L06.4 protein n=7 Tax=Oryza sativa Rep... 76 3e-12 UniRef50_Q055Y9 Putative uncharacterized protein n=4 Tax=Leptosp... 76 3e-12 UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated c... 76 3e-12 UniRef50_C6X1I4 BatB n=2 Tax=Flavobacteriaceae RepID=C6X1I4_FLAB3 76 3e-12 UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteo... 76 3e-12 UniRef50_D2QI99 von Willebrand factor type A n=1 Tax=Spirosoma l... 76 3e-12 UniRef50_UPI0001789223 von Willebrand factor type A n=1 Tax=Geob... 76 3e-12 UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=... 76 3e-12 UniRef50_UPI00006A1B4F Collagen alpha-3(VI) chain precursor. n=1... 76 3e-12 UniRef50_A7SFM5 Predicted protein n=1 Tax=Nematostella vectensis... 76 3e-12 UniRef50_C0GKG1 von Willebrand factor type A n=1 Tax=Dethiobacte... 76 3e-12 UniRef50_C3YP68 Putative uncharacterized protein n=1 Tax=Branchi... 76 3e-12 UniRef50_UPI000180D2ED PREDICTED: similar to predicted protein n... 75 4e-12 UniRef50_D0KWQ3 von Willebrand factor type A n=1 Tax=Halothiobac... 75 4e-12 UniRef50_B9XQJ6 von Willebrand factor type A n=1 Tax=bacterium E... 75 4e-12 UniRef50_A9YRX2 Hedgling n=1 Tax=Amphimedon queenslandica RepID=... 75 4e-12 UniRef50_Q1LYI5 Novel collagen protein n=2 Tax=Danio rerio RepID... 75 4e-12 UniRef50_C5E9N8 von Willebrand factor type A n=4 Tax=Bifidobacte... 75 4e-12 UniRef50_Q56BS9 Putative uncharacterized protein n=1 Tax=Enterob... 75 4e-12 UniRef50_Q023A9 von Willebrand factor, type A n=1 Tax=Candidatus... 75 4e-12 UniRef50_Q9SJE1 Magnesium-chelatase subunit chlD, chloroplastic ... 75 4e-12 UniRef50_A5UXM2 von Willebrand factor, type A n=1 Tax=Roseiflexu... 75 4e-12 UniRef50_B7Q412 Putative uncharacterized protein n=1 Tax=Ixodes ... 75 4e-12 UniRef50_B3RUM1 Putative uncharacterized protein n=1 Tax=Trichop... 75 4e-12 UniRef50_Q1INP4 von Willebrand factor, type A n=1 Tax=Candidatus... 75 4e-12 UniRef50_B0S9S4 Putative uncharacterized protein n=2 Tax=Leptosp... 75 4e-12 UniRef50_Q1ILA5 von Willebrand factor, type A n=1 Tax=Candidatus... 75 5e-12 UniRef50_Q5LCG5 Aerotolerance-related membrane protein n=25 Tax=... 75 5e-12 UniRef50_C9L3E2 BatB protein n=10 Tax=Bacteroidales RepID=C9L3E2... 75 5e-12 UniRef50_Q02A45 von Willebrand factor, type A n=1 Tax=Candidatus... 75 5e-12 UniRef50_Q498S9 LOC498793 protein (Fragment) n=11 Tax=Euteleosto... 75 5e-12 UniRef50_C6QKH4 von Willebrand factor type A n=1 Tax=Geobacillus... 75 5e-12 UniRef50_B3EP84 von Willebrand factor type A n=2 Tax=Chlorobiace... 75 5e-12 UniRef50_C1XZC4 Uncharacterized conserved protein n=2 Tax=Meioth... 75 6e-12 UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fuso... 75 6e-12 UniRef50_Q9XAH6 Putative uncharacterized protein SCO6688 n=2 Tax... 75 6e-12 UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibri... 75 6e-12 UniRef50_A7RFD8 Predicted protein (Fragment) n=1 Tax=Nematostell... 75 6e-12 UniRef50_C4LHS9 Magnesium-chelatase subunit D n=1 Tax=Corynebact... 75 6e-12 UniRef50_A7SIA9 Predicted protein n=2 Tax=Nematostella vectensis... 75 6e-12 UniRef50_C3ZCZ5 Putative uncharacterized protein (Fragment) n=1 ... 75 6e-12 UniRef50_UPI000155D2F0 PREDICTED: similar to matrilin-3, partial... 75 7e-12 UniRef50_Q5NIW0 Matrilin 3b n=19 Tax=Clupeocephala RepID=Q5NIW0_... 75 7e-12 UniRef50_Q4SNW2 Chromosome 15 SCAF14542, whole genome shotgun se... 75 7e-12 UniRef50_A5GIB9 Protoporphyrin IX Mg-chelatase subunit ChlD n=22... 75 7e-12 UniRef50_D0L145 von Willebrand factor type A n=3 Tax=Gammaproteo... 75 7e-12 UniRef50_C6VU79 Sigma 54 interacting domain protein n=1 Tax=Dyad... 75 7e-12 UniRef50_A1ANG2 Protoporphyrin IX magnesium-chelatase n=6 Tax=Ba... 75 7e-12 UniRef50_C0BF89 Putative uncharacterized protein n=1 Tax=Coproco... 75 7e-12 UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular o... 75 8e-12 UniRef50_UPI00006A1B4A Collagen alpha-3(VI) chain precursor. n=5... 74 8e-12 UniRef50_C3XQR8 Putative uncharacterized protein n=1 Tax=Branchi... 74 8e-12 UniRef50_Q01RP7 von Willebrand factor, type A n=1 Tax=Candidatus... 74 9e-12 UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria Re... 74 9e-12 UniRef50_B4D6B0 von Willebrand factor type A n=1 Tax=Chthoniobac... 74 9e-12 UniRef50_D2LNF7 von Willebrand factor type A n=3 Tax=Aciduliprof... 74 9e-12 UniRef50_C5BKN1 Matrixin family protein n=1 Tax=Teredinibacter t... 74 9e-12 UniRef50_Q01W27 von Willebrand factor, type A n=1 Tax=Candidatus... 74 1e-11 UniRef50_B6B3S7 Magnesium chelatase ATPase subunit D n=1 Tax=Rho... 74 1e-11 UniRef50_C3ZY18 Putative uncharacterized protein n=1 Tax=Branchi... 74 1e-11 UniRef50_B7FTA2 Predicted protein n=3 Tax=Bacillariophyta RepID=... 74 1e-11 UniRef50_A5UT43 von Willebrand factor, type A n=1 Tax=Roseiflexu... 74 1e-11 UniRef50_UPI000155C0BC PREDICTED: hypothetical protein n=1 Tax=O... 74 1e-11 UniRef50_A9B6J8 von Willebrand factor type A n=1 Tax=Herpetosiph... 74 1e-11 UniRef50_C3ZZV3 Putative uncharacterized protein (Fragment) n=1 ... 74 1e-11 UniRef50_A1S120 von Willebrand factor, type A n=1 Tax=Thermofilu... 74 1e-11 UniRef50_Q2J8W6 von Willebrand factor, type A n=2 Tax=Actinomyce... 74 1e-11 UniRef50_A1BJ62 von Willebrand factor, type A n=1 Tax=Chlorobium... 74 1e-11 UniRef50_Q2FLV5 Protoporphyrin IX magnesium-chelatase n=1 Tax=Me... 74 1e-11 UniRef50_C3YY34 Putative uncharacterized protein (Fragment) n=2 ... 74 1e-11 UniRef50_C3B4T5 D-amino acid dehydrogenase, large subunit n=3 Ta... 74 1e-11 UniRef50_A3ZTC3 Putative uncharacterized protein n=1 Tax=Blastop... 73 1e-11 Sequences not found previously or not previously below threshold: UniRef50_A4IZW2 Conserved membrane protein with von Willebrand f... 99 4e-19 UniRef50_C0YPQ5 von Willebrand factor(VWA) type A domain-contain... 96 3e-18 UniRef50_C3WEJ1 BatA protein n=3 Tax=Fusobacterium RepID=C3WEJ1_... 95 6e-18 UniRef50_D2A5P0 Putative uncharacterized protein GLEAN_15569 n=5... 94 8e-18 UniRef50_A6VW30 von Willebrand factor type A n=2 Tax=Marinomonas... 93 2e-17 UniRef50_UPI000175837C PREDICTED: similar to inter-alpha-trypsin... 91 6e-17 UniRef50_UPI00015B5333 PREDICTED: similar to ENSANGP00000021218 ... 90 2e-16 UniRef50_B9XFD0 von Willebrand factor type A n=1 Tax=bacterium E... 88 5e-16 UniRef50_C0QK11 Putative uncharacterized protein n=1 Tax=Desulfo... 88 9e-16 UniRef50_B8A860 Putative uncharacterized protein n=1 Tax=Oryza s... 87 1e-15 UniRef50_A0YCE1 BatB protein, putative n=7 Tax=Proteobacteria Re... 87 1e-15 UniRef50_A6DS47 Putative uncharacterized protein n=1 Tax=Lentisp... 86 2e-15 UniRef50_A9A1M2 von Willebrand factor type A n=2 Tax=Thaumarchae... 86 2e-15 UniRef50_A7IDL0 von Willebrand factor type A n=3 Tax=Alphaproteo... 86 3e-15 UniRef50_Q6MJI3 Putative uncharacterized protein batA n=1 Tax=Bd... 86 3e-15 UniRef50_Q2R0C5 von Willebrand factor type A domain containing p... 85 3e-15 UniRef50_C4LFG4 von Willebrand factor type A n=1 Tax=Tolumonas a... 85 3e-15 UniRef50_B1GZR4 Aerotolerance-related cytoplasmic membrane prote... 85 4e-15 UniRef50_Q2R0C4 Expressed protein n=2 Tax=Oryza sativa Japonica ... 85 4e-15 UniRef50_B8CM90 Von Willebrand factor type A domain protein n=28... 85 5e-15 UniRef50_A1ZQV4 Von Willebrand factor type A domain protein n=1 ... 85 6e-15 UniRef50_Q96P44 Collagen alpha-1(XXI) chain n=35 Tax=Euteleostom... 84 1e-14 UniRef50_B2W982 von Willebrand domain containing protein n=1 Tax... 84 1e-14 UniRef50_Q4S5G0 Chromosome 19 SCAF14731, whole genome shotgun se... 83 1e-14 UniRef50_Q47ZS8 Von Willebrand factor type A domain protein n=1 ... 83 2e-14 UniRef50_UPI000186D1CC conserved hypothetical protein n=1 Tax=Pe... 83 2e-14 UniRef50_C3XPW9 Putative uncharacterized protein n=1 Tax=Branchi... 83 3e-14 UniRef50_UPI0001AEBBE9 von Willebrand factor, type A n=1 Tax=Alt... 81 6e-14 UniRef50_Q8T5C2 Proximal thread matrix protein 1 n=3 Tax=Mytilus... 81 7e-14 UniRef50_UPI0001791B4B PREDICTED: similar to AGAP005490-PA n=1 T... 80 1e-13 UniRef50_D2I4K8 Putative uncharacterized protein n=1 Tax=Ailurop... 80 1e-13 UniRef50_B2UMP0 von Willebrand factor type A n=1 Tax=Akkermansia... 80 1e-13 UniRef50_D1N7V1 von Willebrand factor type A n=1 Tax=Victivallis... 80 1e-13 UniRef50_A3UND6 Von Willebrand factor type A domain protein n=1 ... 80 1e-13 UniRef50_C3Y983 Putative uncharacterized protein n=3 Tax=Branchi... 80 2e-13 UniRef50_A8H3C5 von Willebrand factor type A n=3 Tax=Gammaproteo... 80 2e-13 UniRef50_Q30SU6 von Willebrand factor, type A n=3 Tax=Campylobac... 80 2e-13 UniRef50_Q32NR2 MGC130922 protein n=3 Tax=Tetrapoda RepID=Q32NR2... 80 2e-13 UniRef50_P70960 Uncharacterized protein ywmC n=5 Tax=Bacillus Re... 79 3e-13 UniRef50_Q7UML1 BatA n=3 Tax=Planctomycetaceae RepID=Q7UML1_RHOBA 79 3e-13 UniRef50_UPI000155BCFC PREDICTED: similar to hCG1812043, partial... 79 3e-13 UniRef50_B1ZS79 von Willebrand factor type A n=1 Tax=Opitutus te... 79 4e-13 UniRef50_C9Q197 Aerotolerance protein BatB n=11 Tax=Prevotella R... 79 4e-13 UniRef50_UPI0001745E24 hypothetical protein VspiD_17015 n=1 Tax=... 78 4e-13 UniRef50_Q9BPQ8 Integrin alpha Hr1 n=1 Tax=Halocynthia roretzi R... 78 6e-13 UniRef50_C3Z863 Putative uncharacterized protein (Fragment) n=2 ... 78 7e-13 UniRef50_A0YGD2 Von Willebrand factor type A domain protein n=4 ... 78 7e-13 UniRef50_A1ZJV3 Von Willebrand factor, type A, putative n=1 Tax=... 78 8e-13 UniRef50_Q3APD9 von Willebrand factor, type A n=1 Tax=Chlorobium... 78 9e-13 UniRef50_D2R6V4 von Willebrand factor type A n=1 Tax=Pirellula s... 77 1e-12 UniRef50_UPI0001926619 PREDICTED: similar to calcium channel, vo... 77 1e-12 UniRef50_UPI0000584198 PREDICTED: similar to polydom protein n=1... 77 1e-12 UniRef50_P06681 Complement C2a fragment n=39 Tax=Tetrapoda RepID... 77 1e-12 UniRef50_P20702 Integrin alpha-X n=58 Tax=Theria RepID=ITAX_HUMAN 77 1e-12 UniRef50_A3ZZR7 Putative uncharacterized protein n=1 Tax=Blastop... 77 2e-12 UniRef50_UPI00016E6A6D UPI00016E6A6D related cluster n=2 Tax=Tak... 77 2e-12 UniRef50_P21941 Cartilage matrix protein n=35 Tax=Euteleostomi R... 76 2e-12 UniRef50_B3DM79 LOC100170623 protein n=1 Tax=Xenopus (Silurana) ... 76 2e-12 UniRef50_A6DST2 Putative uncharacterized protein n=1 Tax=Lentisp... 76 2e-12 UniRef50_UPI00005A0FAD PREDICTED: similar to Integrin alpha-M pr... 76 2e-12 UniRef50_P15988 Collagen alpha-2(VI) chain n=16 Tax=Euteleostomi... 76 2e-12 UniRef50_A7RT18 Predicted protein (Fragment) n=1 Tax=Nematostell... 76 2e-12 UniRef50_UPI00006A0418 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.... 76 2e-12 UniRef50_UPI0000521DAC PREDICTED: similar to Collagen alpha-1(XI... 76 2e-12 UniRef50_Q1Q3X1 Putative uncharacterized protein n=1 Tax=Candida... 76 2e-12 UniRef50_Q31JH9 Putative uncharacterized protein n=1 Tax=Thiomic... 76 2e-12 UniRef50_C3ZCS4 Putative uncharacterized protein n=1 Tax=Branchi... 76 2e-12 UniRef50_A9B368 Conserved hypothetical membrane protein n=1 Tax=... 76 2e-12 UniRef50_Q1LEM9 von Willebrand factor, type A n=2 Tax=Burkholder... 76 2e-12 UniRef50_UPI0000EB1CF0 UPI0000EB1CF0 related cluster n=1 Tax=Can... 76 2e-12 UniRef50_B2KDS8 von Willebrand factor type A n=1 Tax=Elusimicrob... 76 3e-12 UniRef50_A0C3V7 Chromosome undetermined scaffold_148, whole geno... 76 3e-12 UniRef50_C4DP19 von Willebrand factor type A-like protein n=1 Ta... 76 3e-12 UniRef50_UPI000194D9FE PREDICTED: similar to matrilin 4 n=2 Tax=... 76 3e-12 UniRef50_Q0KB82 von Willebrand factor (VWF) type A domain n=7 Ta... 76 3e-12 UniRef50_B9XQJ5 von Willebrand factor type A n=1 Tax=bacterium E... 76 3e-12 UniRef50_A6G2R7 Aerotolerance-related membrane protein n=1 Tax=P... 75 4e-12 UniRef50_P12111 Collagen alpha-3(VI) chain n=60 Tax=Eumetazoa Re... 75 4e-12 UniRef50_B8FBV4 von Willebrand factor type A n=1 Tax=Desulfatiba... 75 4e-12 UniRef50_A0Z8M6 Von Willebrand factor type A domain protein n=1 ... 75 4e-12 UniRef50_C3JJF1 Putative von Willebrand factor, type A n=1 Tax=R... 75 4e-12 UniRef50_A8SY40 Putative uncharacterized protein n=1 Tax=Coproco... 75 4e-12 UniRef50_B4Q6G9 GD21946 n=2 Tax=Drosophila RepID=B4Q6G9_DROSI 75 4e-12 UniRef50_UPI0000F2E846 PREDICTED: similar to ITI-like protein, p... 75 5e-12 UniRef50_D2QQW6 von Willebrand factor type A n=1 Tax=Spirosoma l... 75 5e-12 UniRef50_O95460 Matrilin-4 n=32 Tax=Amniota RepID=MATN4_HUMAN 75 5e-12 UniRef50_UPI0000DB712F PREDICTED: similar to c12.2 CG12149-PA is... 75 5e-12 UniRef50_Q4S2X7 Chromosome 8 SCAF14759, whole genome shotgun seq... 75 6e-12 UniRef50_A7SV43 Predicted protein n=5 Tax=Nematostella vectensis... 75 6e-12 UniRef50_C7PL60 von Willebrand factor type A n=1 Tax=Chitinophag... 75 7e-12 UniRef50_A0NX68 Von Willebrand factor type A domain protein n=1 ... 75 7e-12 UniRef50_C3PEQ4 Putative membrane protein n=2 Tax=Corynebacteriu... 75 7e-12 UniRef50_UPI0000EB3C0A UPI0000EB3C0A related cluster n=1 Tax=Can... 75 8e-12 UniRef50_D2A0T3 Putative uncharacterized protein GLEAN_08265 n=2... 74 8e-12 UniRef50_A8DZ06 CG4587, isoform C n=21 Tax=Neoptera RepID=A8DZ06... 74 8e-12 UniRef50_B4AJ60 YwmD n=2 Tax=Bacillus pumilus RepID=B4AJ60_BACPU 74 8e-12 UniRef50_UPI000180D0B0 PREDICTED: similar to von Willebrand fact... 74 9e-12 UniRef50_UPI000180BCC9 PREDICTED: similar to calcium activated c... 74 9e-12 UniRef50_Q5NJK1 Matrilin-3a n=5 Tax=Danio rerio RepID=Q5NJK1_DANRE 74 9e-12 UniRef50_C5S895 von Willebrand factor type A n=1 Tax=Allochromat... 74 1e-11 UniRef50_UPI0001792F00 PREDICTED: similar to AGAP009579-PA n=1 T... 74 1e-11 UniRef50_D2V397 Predicted protein n=1 Tax=Naegleria gruberi RepI... 74 1e-11 UniRef50_UPI0000E46E0F PREDICTED: similar to LOC594926 protein n... 74 1e-11 UniRef50_UPI0000E49761 PREDICTED: similar to mKIAA0177 protein n... 74 1e-11 UniRef50_A5UXL9 Peptidase M23B n=2 Tax=Roseiflexus RepID=A5UXL9_... 74 1e-11 UniRef50_UPI00016C44B4 BatA n=2 Tax=Gemmata obscuriglobus UQM 22... 74 1e-11 >UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteria RepID=Y1510_YERPA Length = 424 Score = 291 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 363/421 (86%), Positives = 394/421 (93%), Gaps = 1/421 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M +FIDRRLNGKNKSMVNRQRFLRRYK+QIKQSI++AINKRSVTD++SGESVSIP +DI+ Sbjct: 1 MGYFIDRRLNGKNKSMVNRQRFLRRYKSQIKQSIADAINKRSVTDIESGESVSIPIDDIN 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EPMFHQG GGLRHRVHPGNDHF+ NDR++RP GGGG GSGQG A +DGEG+DEFVFQIS Sbjct: 61 EPMFHQGNGGLRHRVHPGNDHFITNDRVDRP-QGGGGGGSGQGNAGKDGEGEDEFVFQIS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 KDEYLDLLFEDLALPNLK+NQ +QL E+KTHRAGYT+NGVPANISVVRSLQNSLARRTAM Sbjct: 120 KDEYLDLLFEDLALPNLKRNQYKQLAEFKTHRAGYTSNGVPANISVVRSLQNSLARRTAM 179 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 TA KRREL LE L ++ NSEPAQLLEEERLRK I EL+ KI RVPFIDTFDLRYKNYE Sbjct: 180 TASKRRELRELEAALTVLENSEPAQLLEEERLRKAITELKQKIARVPFIDTFDLRYKNYE 239 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 +RP+PSSQAVMFCLMDVSGSMDQ+TKDMAKRFYILLYLFLSRTYKNV+VVYIRHHTQAKE Sbjct: 240 RRPEPSSQAVMFCLMDVSGSMDQATKDMAKRFYILLYLFLSRTYKNVDVVYIRHHTQAKE 299 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE EFFYSQETGGTIVSSALKLMDEVV+ERYNPAQWNIYAAQASDGDNWADDSPLCHE+ Sbjct: 300 VDEQEFFYSQETGGTIVSSALKLMDEVVQERYNPAQWNIYAAQASDGDNWADDSPLCHEL 359 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 LAKK+LPVVRYYSYIEITRRAHQTLWREYE L+ FDNFA+QHIR+ +DIYPVFRELFHK Sbjct: 360 LAKKILPVVRYYSYIEITRRAHQTLWREYEDLEEKFDNFAIQHIREPEDIYPVFRELFHK 419 Query: 421 Q 421 Q Sbjct: 420 Q 420 >UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacteria RepID=Y882_CHRSD Length = 426 Score = 254 bits (648), Expect = 6e-66, Method: Composition-based stats. Identities = 238/427 (55%), Positives = 310/427 (72%), Gaps = 4/427 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 MT+FIDRR N KNKS VNRQRFL+RY++ IK+++ EA+N+RS+TD++ GE +SIP +DIS Sbjct: 1 MTYFIDRRANAKNKSAVNRQRFLQRYRSHIKRAVEEAVNRRSITDMERGEKISIPAKDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP+F G GG R V PGN FV+ DR+ R GG G GSG+G AS GEG DEF F +S Sbjct: 61 EPVFQHGPGGARTIVSPGNKEFVEGDRLRR-PGGEGRGGSGEGSASNQGEGMDEFAFSLS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 ++E+LD +F+ LALP+L++ Q R L E + RAG T +GVP+ I++VRS++ + ARR M Sbjct: 120 REEFLDFVFDGLALPHLERKQLRDLDEVRPVRAGVTRDGVPSRINIVRSMREAQARRIGM 179 Query: 181 TAGKRRELHALEENLAIISNSEP--AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 A +R L EE L +P L+ EI L ++E VPFIDT+DLRY N Sbjct: 180 RAPIKRALREAEEALESEERKDPVLRNPARIGELKAEIERLEKRLEAVPFIDTYDLRYNN 239 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 +P PS++AVMFC+MDVSGSM Q KD+AKRF++LLYLFL R Y+ VE+V+IRHHT A Sbjct: 240 LIDQPQPSNKAVMFCVMDVSGSMTQGHKDIAKRFFLLLYLFLERNYEKVELVFIRHHTAA 299 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 KEVDE EFFYS+ETGGTIVSSAL L+DE++ +RY+PAQWN+Y AQASDGDNW DDS C Sbjct: 300 KEVDEEEFFYSRETGGTIVSSALTLVDEIIAKRYSPAQWNLYVAQASDGDNWDDDSLTCR 359 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF-DNFAMQHIRDQDDIYPVFREL 417 ++L L+ ++YY+Y+EIT +HQ LW EYE +Q+ FAMQ I + DIYPVFR+L Sbjct: 360 DLLMTSLMAKLQYYTYVEITPHSHQALWEEYERVQAAHPSRFAMQQIVEPGDIYPVFRKL 419 Query: 418 FHKQNAT 424 F K+ A+ Sbjct: 420 FRKRVAS 426 >UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteria RepID=Y975_NITHX Length = 439 Score = 253 bits (646), Expect = 9e-66, Method: Composition-based stats. Identities = 181/439 (41%), Positives = 269/439 (61%), Gaps = 18/439 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN K++S+ NRQRFLRR + ++K+SI + + ++D D ++VSIPT Sbjct: 1 MPIFIDRRLNPKDRSLGNRQRFLRRAREELKRSIRDRVRSGRISDADGEQAVSIPTRSTD 60 Query: 61 EPMFHQGRG-GLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 EP F + G R V PGN HFV DR+ +P G G+ + +D+F F + Sbjct: 61 EPRFEAAKDSGRREHVLPGNKHFVPGDRLRKP-----GHGAAGTPDPSMKDSEDDFRFVL 115 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 S++E LDL FEDL LP++ + +++ ++ RAG+ A G P NI+V R+++NS RR A Sbjct: 116 SREEVLDLFFEDLELPDMVKLSLKEILAFRPRRAGFAATGSPTNINVGRTMRNSYGRRIA 175 Query: 180 MTAGKRRELHALEENLAIISN--SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 + KR E+ A+ + +A + + P L+ E+ L K + ++D D+R+ Sbjct: 176 LKRPKREEVDAIRQEIAELESGSQSPVARQRIAALQAEVERLERKRRLIAYVDPVDIRFN 235 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 +E +P P+++AVMFCLMDVSGSM + KD+AKRF++LL+LFL Y E+V+I H + Sbjct: 236 RFEAQPIPNAKAVMFCLMDVSGSMGEREKDLAKRFFVLLHLFLKCRYDRTEIVFISHTHE 295 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 A+EV+E FFYS ++GGT+VS+AL+ M ++ ERY ++WNIYAAQASDGDN A DS C Sbjct: 296 AQEVNEETFFYSTQSGGTVVSTALEKMHRIIAERYPGSEWNIYAAQASDGDNAAADSHRC 355 Query: 358 HEILAKKLLPVVRYYSY----------IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 +L ++++ + +YY+Y I T +LWR Y + + + NF M I D Sbjct: 356 ITLLDEEIMRLCQYYAYVEIIDERERHIFGTTENGTSLWRAYSSVNANWPNFQMTRIADA 415 Query: 408 DDIYPVFRELFHKQNATAK 426 DIYPVFR+LF +Q K Sbjct: 416 ADIYPVFRQLFTRQATAEK 434 >UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodobacterales RepID=B6B8L1_9RHOB Length = 445 Score = 253 bits (646), Expect = 9e-66, Method: Composition-based stats. Identities = 194/444 (43%), Positives = 279/444 (62%), Gaps = 21/444 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD----VDSGESVSIPT 56 M FIDRR N K KS+ NRQRFLRR + IK+ + +++ +S+ D GE V+IP Sbjct: 1 MHHFIDRRANPKGKSLGNRQRFLRRARENIKERVDQSVRGKSIQSGSGVPDGGEKVTIPA 60 Query: 57 EDISEPMF-HQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEF 115 + EP F H +GGLR V PGN FV D I+RPQGG G G +AS++G+G+DEF Sbjct: 61 RGLKEPRFFHSSKGGLRRHVLPGNKDFVVGDTIKRPQGGTGQ---GGRKASEEGDGEDEF 117 Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 F ++++EYL++LFE L LP+L + + T RAG T G P N+++VR+++NSL Sbjct: 118 SFTLTQEEYLEILFEGLELPDLVEKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLG 177 Query: 176 RRTAMTAGKRRELHALEENLA---IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 RR A+ + LEE +A + + P Q E LRK++ + K + V +ID Sbjct: 178 RRIALQRPTTKSQRDLEEQIAELEALDDRTPPQEDFLEALRKKLDGIIRKRKVVGYIDPL 237 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 DLRY + +S+AV+FCLMDVSGSM + KD+AKRF++LL+LFL R Y++ E+V++ Sbjct: 238 DLRYDTFVPEKIRNSRAVVFCLMDVSGSMQEREKDLAKRFFLLLHLFLERCYEHTELVFV 297 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 RH A+EVDE FFY++ETGGTIVS+AL+ M E+++ERY P +WNIY AQASDG+N+ + Sbjct: 298 RHTHHAQEVDEETFFYARETGGTIVSTALEKMKEIIEERYPPDEWNIYGAQASDGENFGN 357 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEI----------TRRAHQTLWREYEHLQSTFDNFAMQ 402 DS C ++L LLPV ++Y+Y+EI A + LW+ Y +++ +F MQ Sbjct: 358 DSARCKKLLLNDLLPVSQFYAYVEIVDEAAEMLLNNPEAGEDLWQNYREVKAQAQHFEMQ 417 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 + IYP+FRE F + A+ Sbjct: 418 RVSQPGHIYPIFREFFLPKVKGAQ 441 >UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobacteria RepID=Y6755_BRAJA Length = 427 Score = 250 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 186/431 (43%), Positives = 276/431 (64%), Gaps = 18/431 (4%) Query: 3 WFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEP 62 IDRRLN KS+ NRQRFLRR K+ ++ ++ + +R + DV G V+IP + + EP Sbjct: 4 HIIDRRLNPGGKSLENRQRFLRRAKSLVQGAVKKTSQERDIKDVLEGGEVTIPLDGMHEP 63 Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD 122 F + GG R V PGN FV+ D ++R G GS + +G+ +D F F +S+D Sbjct: 64 RFRR-EGGTRDMVLPGNKKFVEGDYLQR-----SGQGSAKDSGPGEGDSEDAFRFVLSRD 117 Query: 123 EYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTA 182 E++DL +DL LP+L + + Q RAGYT +G PANISV R+++ +LARR A+ Sbjct: 118 EFVDLFLDDLELPDLAKRKIAQTESEGIQRAGYTTSGSPANISVSRTVKLALARRIALKR 177 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 ++ E+ LE +A ++ E L E+ +L AK +R+PFID D+RY+ +E Sbjct: 178 PRKDEIEELEAAIAACTDE-----DERVVLLAELEKLMAKTKRIPFIDPLDIRYRRFETV 232 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P P +QAVMFCLMDVSGSM + KD+AKRFY+LLY+FL R YK+VE+V+IRH +A+EVD Sbjct: 233 PKPVAQAVMFCLMDVSGSMSEHMKDLAKRFYMLLYVFLKRRYKHVEIVFIRHTDRAEEVD 292 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 E FFY +GGT+VSSAL+ M ++V+ER+NP+ WNIYAAQASDGDN D L +L Sbjct: 293 EQTFFYGPASGGTLVSSALQAMHDIVRERFNPSDWNIYAAQASDGDNSYSDGELTGLLLT 352 Query: 363 KKLLPVVRYYSYIEITRR-------AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 K+LPV ++++Y+E+ + +LW YE L+++ +M+ + ++ +I+PVF Sbjct: 353 DKILPVCQFFAYLEVGESGGSAFDLSDSSLWTLYERLRNSGAPLSMRKVSERSEIFPVFH 412 Query: 416 ELFHKQNATAK 426 +LF ++ + + Sbjct: 413 DLFQRRETSQE 423 >UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkholderiales RepID=A9I2A9_BORPD Length = 419 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 203/426 (47%), Positives = 280/426 (65%), Gaps = 10/426 (2%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M IDRRLNG+NKS VNR+RFLRRYK QI++++ + + +RS+ D+D G +++P DIS Sbjct: 1 MNSLIDRRLNGRNKSAVNRERFLRRYKDQIRRAVQDLVRERSIEDMDQGGEINLPARDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP F G+GG R VHPGN F + D RP G G GS G+ D+F F +S Sbjct: 61 EPHFRHGQGGDRELVHPGNREFAKGDTFPRPSGSDGEGGSEPGEGESV----DQFTFSLS 116 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 + E+L+L FEDL LP+L + Q +T+ K RAGYT G P+ +SV R+L+ SL+RR A+ Sbjct: 117 RAEFLNLFFEDLELPHLIRTQLGDVTQKKWQRAGYTTTGSPSLLSVSRTLKASLSRRVAL 176 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R EL A + L + + A E + LR+E+ + ++ R+PF+D DLRY+N Sbjct: 177 GVAARAELEAAQAKLDA-AIAAGAPQAEIDALRQEVEDCANRLARLPFLDDLDLRYRNRV 235 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 P ++AVMFCLMDVSGSMD+ KD+AKRF+ LLYLFLSR Y++V+VV+IRH A+E Sbjct: 236 SVAMPMARAVMFCLMDVSGSMDEGKKDLAKRFFTLLYLFLSRKYEHVDVVFIRHTDNAEE 295 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE FFY ++GGTIV SAL+LM E+V++RY P+ WN+YAAQASDGD++ D+ Sbjct: 296 VDEQTFFYDPKSGGTIVLSALELMHEIVQQRYPPSAWNVYAAQASDGDSFGADAGKSARF 355 Query: 361 LAKKLLPVVRYYSYIEI---TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 LA+ LLP RY++YIE+ +LW EYE +F M+ I ++ +IYPVF +L Sbjct: 356 LAENLLPATRYFAYIEVPDSQEARKSSLWAEYEQET--APHFVMRRICERGEIYPVFHDL 413 Query: 418 FHKQNA 423 F K+ A Sbjct: 414 FKKETA 419 >UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria RepID=Y587_BACC4 Length = 391 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 102/416 (24%), Positives = 178/416 (42%), Gaps = 47/416 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR + + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGYDDQQRHQEKVQEAIKNNLPDLVTEESIVMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQ-GGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN D + R GG G G+GQ + D G+D + ++S E F +L Sbjct: 81 -VGQGNGDSKVGDVVARDGSGGQKQKGPGKGQGAGDAAGEDYYEAEVSILELEQAFFREL 139 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 LPNLK+ + + G+ NI R++ ++ R Sbjct: 140 ELPNLKRKEMDENRIEHVEFNDIRKTGLWGNIDKKRTMISAYKRNAMSGKAS-------- 191 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 I DL+++ + + P S+AV+ Sbjct: 192 ---------------------------------FHPIHQEDLKFRTWNEVLKPDSKAVVL 218 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET 312 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E EFF E+ Sbjct: 219 AMMDTSGSMGIWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVTEEEFFSKGES 278 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 GGTI SS K E++ +Y+P ++NIY SDGDN D+ C + L ++L+ + Sbjct: 279 GGTICSSVYKKALELIDNKYSPDRYNIYPFHFSDGDNLTSDNARCVK-LVEELMKKCNMF 337 Query: 373 SYIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y E+ + H TL Y++++ + + ++ + D++ + F +++ Sbjct: 338 GYGEVNQYNRHSTLMSAYKNIKDDNFRYYI--LKQKADVFHAMKSFFREESGEKMA 391 >UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales RepID=Y568_CLOK1 Length = 403 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 120/413 (29%), Positives = 203/413 (49%), Gaps = 25/413 (6%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S+ +R+R + + IK ++++ I++ S+ + V IP + I E F G Sbjct: 14 DRSLEDRRRHRQLVEKSIKDNLADIISEESIIGQSKNKKVKIPIKGIKEYQFIYG--DNS 71 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V G+ + DRI + G G Q + + EG+D + +++ ++ LD L EDL Sbjct: 72 SGVGSGDGSQKKGDRIGKAIKDRDGKG---NQGAGNQEGEDMYEIEVTIEDVLDYLMEDL 128 Query: 133 ALPNLKQNQQRQLTEYK-THRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP + + + Q+ ++GY G+ ++ R++ L R+ R L Sbjct: 129 ELPLMDKKKFSQILSNNSPKKSGYQRKGINPRLAKKRTVVEKLKRQQGTKRALREIHGEL 188 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 E + +L E ++ DLRY +++P A + Sbjct: 189 E-------SDPKNKLPENTTIKSRFP-----------FKQDDLRYFRVKRKPKLELNAAI 230 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD SGSMD + K +A+ F+ +LY F+ Y NVEV +I H T AK V E+EFF+ E Sbjct: 231 ICVMDTSGSMDSTRKFLARSFFFVLYRFIKMKYNNVEVKFISHSTSAKVVTENEFFHKVE 290 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS LK EV++E YNPA WN+Y SDGDNW++D+ L + AK L V Sbjct: 291 SGGTYISSGLKKALEVIEENYNPAYWNVYTFYVSDGDNWSEDNSLALK-CAKDLCKVCNL 349 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 +SY EI + + + + T +NF + I ++ D++ +++ +K+ Sbjct: 350 FSYAEIIPSPYGSSIKHIFQNKITDNNFTVVTIHEKQDLWKSLKKILNKELEE 402 >UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes RepID=Y926_BACA2 Length = 394 Score = 223 bits (569), Expect = 7e-57, Method: Composition-based stats. Identities = 101/413 (24%), Positives = 181/413 (43%), Gaps = 47/413 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR ++ + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGFDDQQRHQKKVQEAIKNNLPDLVTEESIIMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ D + R G G+G+GQ + D G+D + ++S + + LF++L Sbjct: 81 -VGQGDGDSEVGDVVARD-GADKKQGAGKGQGAGDQAGEDYYEAEVSLMDLEEALFQELE 138 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL+Q ++ + G+ NI R++ ++ R Sbjct: 139 LPNLQQKERDNIVHTDIEFNDIRKTGLTGNIDKKRTMLSAYKRNAMTGKPS--------- 189 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 I DL+YK + P S+AV+ Sbjct: 190 --------------------------------FYPIYPEDLKYKTWNDVTKPESKAVVLA 217 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E +FF E+G Sbjct: 218 MMDTSGSMGVWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVSEEDFFSKGESG 277 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GTI SS + E++ E+Y+PA++NIY SDGDN D+ C + L ++ + Sbjct: 278 GTICSSVYRKSLELIDEKYDPARYNIYPFHFSDGDNLTSDNARCVK-LVNDIMKKSNLFC 336 Query: 374 YIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 Y E+ + H TL Y++++ + + ++ + D++ + F + + Sbjct: 337 YGEVNQYNRHSTLMSAYKNVKDDKFKYYI--LKQKSDVFQALKSFFKNEESGV 387 >UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium roseum DSM 5159 RepID=B9L510_THERP Length = 389 Score = 222 bits (566), Expect = 2e-56, Method: Composition-based stats. Identities = 111/417 (26%), Positives = 178/417 (42%), Gaps = 51/417 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K +++ R + + K IK+++++ ++++S+ D V +P + E F R Sbjct: 19 KGAIDQARHMEKVKEAIKRNLADIVSEQSLITSDGKRVVRVPIRVLEEYRFRFDPDSGR- 77 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 + GGG SG G + D G D + +++ +E +L+FEDL Sbjct: 78 ---QVGQGSGGTHVGDVVGRVGGGQRSGDGPQAGDQPGIDYYEAELTIEELSELIFEDLE 134 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL++ + R+L +G AN+ R+L+ +L R Sbjct: 135 LPNLEEKRLRELESEAVRFTEIRRHGPFANLDKRRTLRENLRRN---------------- 178 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 R+ DLR+K +E+ S AV+ Sbjct: 179 -------------------------AWRGRARIGDFANEDLRFKTWERDVKRESNAVVIA 213 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MDVSGSM K +++ FY + FL Y VE+ +I HH +A+EV E EFF E+G Sbjct: 214 MMDVSGSMGTFEKYVSRAFYYWMVRFLRTKYDRVEIRFIAHHAEAREVSEEEFFSRGESG 273 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT S+A +L ++++E Y P WNIY SDGDNW D+ C E LA++LL + Sbjct: 274 GTRASTAYELALQLIRESYPPDSWNIYPFHFSDGDNWPSDNERCRE-LAEELLRCANLFG 332 Query: 374 YIEITR---RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y EI + TL + + I ++ D+Y R F + Sbjct: 333 YGEIRQGRYTYQSTLMHTLQRI--GSPKLVTVTITEKADVYQALRRFFGPEVGQEVA 387 >UniRef50_C6M483 von Willebrand factor type A domain protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M483_NEISI Length = 538 Score = 214 bits (544), Expect = 5e-54, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 96/308 (31%), Gaps = 23/308 (7%) Query: 131 DLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHA 190 + LP + ++ Q + ++ +I V ++ R ++ Sbjct: 57 EENLPLAENTERYQDQPDQPVKSVAQEPVSTFSIDVDTGSYANVRRFLTNGEQPPKDAVR 116 Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 +EE + + P + + + + ++ ++ K+ P A Sbjct: 117 IEEIVNYFPYNYPLPTD-NRPFAVHTETIDSPWQPEAKLIKIGIQAQDTAKKDLP--PAN 173 Query: 251 MFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEV 301 + L+DVSGSMD+ + ++ +L L K + Y KE Sbjct: 174 LVFLVDVSGSMDEENKLPLVQKTLRILTQQLRPQDKVTLITYASGEDLVLPPTSGADKET 233 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEI 360 + G T SAL++ E ++ + P N A+DGD N + Sbjct: 234 ILSAIDKLRAGGATDGESALQMAYEQAQKAFVPNGINR-ILLATDGDFNVGVSDTETLKS 292 Query: 361 LAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + + V + ++ + + ++ D +++ Sbjct: 293 MVAEKRKSGVSLSTLGFGMGNYNEDMMEQIADAGDGNYSYI--------DNEKEAKKVLQ 344 Query: 420 KQNATAKG 427 +Q + Sbjct: 345 QQLTSTLA 352 >UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitrosococcus oceani RepID=Q3J885_NITOC Length = 394 Score = 213 bits (542), Expect = 1e-53, Method: Composition-based stats. Identities = 113/419 (26%), Positives = 196/419 (46%), Gaps = 49/419 (11%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S +R R ++ + I+ ++++ + + S+ + +P I E F G+ Sbjct: 17 DRSAKDRLRHRQKVRKAIRDNVADIVAEESIIGQSRDRIIKVPIRGIREYRFVYGQNTPG 76 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 G+ G G G + D G D + +I+ +E ++++ EDL Sbjct: 77 VGTGQGDSEPG-------QTVGQVPQGDGGPGHAGDRPGMDYYETEITLEELIEIMLEDL 129 Query: 133 ALPNLKQNQQRQLTEYKT-HRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP++++ + R++ +T R G+ GV ++ R+ ++ + RR A ++ Sbjct: 130 ELPDMERKRFREVLSERTSKRKGFRRVGVRVHMDKRRTAKSRIRRRLA----SDKDAEDN 185 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 E D+RY + P S AV+ Sbjct: 186 ETKHRFP------------------------------FHRDDMRYHRLREDMRPQSNAVV 215 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 FC+MD SGSMD K +A+ F+ LLY F+ Y NV+VV+I HHT+A+EV E EFF+ E Sbjct: 216 FCIMDTSGSMDTLKKYLARSFFFLLYQFVRSRYVNVDVVFIAHHTKAREVTEEEFFHKGE 275 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT +SS E+++ RY+P+ WNIYA SDGDN+ D+ + A+ L V Sbjct: 276 AGGTFISSGYSKALEIIQNRYHPSLWNIYAFHCSDGDNFDSDNAATLKA-AEVLCQVCNL 334 Query: 372 YSYIEI----TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + Y EI + T+ + ++ DNF I+ ++DI+P FR+L +++ ++K Sbjct: 335 FGYGEIKPRPSGFYEGTMLDLFRSVRM--DNFQSVLIQRKEDIWPSFRQLLSRESESSK 391 >UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteobacteria RepID=B1KPQ5_SHEWM Length = 640 Score = 212 bits (538), Expect = 3e-53, Method: Composition-based stats. Identities = 52/388 (13%), Positives = 100/388 (25%), Gaps = 37/388 (9%) Query: 65 HQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGE-------------- 110 G G+ Q+ GG E Sbjct: 55 ELSTGKSESNPIAGSSPNQQSIDTASGIGGTEVQSEASQGEVSVRETYRAAKQASERMKS 114 Query: 111 -GQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS 169 + +D+ LP L+ + + +I V Sbjct: 115 MKVHSRPESFALMGLPSRPSQDIYLPELQNRDKFERQVANGIMVAGEIPVSTFSIDVDTG 174 Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 ++L R R +EE + + PA E+ + + Sbjct: 175 SYSTLRRSINHGVLPERGTVRVEELINYFAYQYPAPDAGEQPFSVNTELAPSPYNPHKML 234 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVE 288 L+ EK +SQ + L+DVSGSM K + K +L L + Sbjct: 235 LRIGLKGFEKEKADLGASQ--LVFLLDVSGSMSSQDKLPLLKNALKMLSQQLDEGDRISI 292 Query: 289 VVYIRHHTQAKE--------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 VVY + + G T + ++L ++ ++ + N Sbjct: 293 VVYAGASGVVLDGVKGNDTLAISQALDKLKAGGSTNGGAGIELAYQLAQKHFIAGGVNRV 352 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 A+DGD N E + ++ + + + L + + Sbjct: 353 I-LATDGDFNVGVSDQQALEDMIEEKRKQGIALTTLGFGQGNYNDHLMEQLADKGNGHYA 411 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNATAK 426 + D R++ + + Sbjct: 412 YI--------DTLNEARKVLVDEISATL 431 >UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani RepID=Q896G6_CLOTE Length = 386 Score = 208 bits (528), Expect = 4e-52, Method: Composition-based stats. Identities = 109/423 (25%), Positives = 191/423 (45%), Gaps = 53/423 (12%) Query: 11 GKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGG 70 N++ +R+R + IK ++ + + + ++ + +P + + E F + Sbjct: 11 SYNRAGEDRKRHRELVEKSIKDNLVDVLLQEDISIQKENIKIKVPIKGVKEYEFTYSQNR 70 Query: 71 LRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFE 130 V GN+ Q ++R GGG+G+ + EG+D F +++ +E +F+ Sbjct: 71 SFVVVGKGNEKKGQKIALKRASEQGGGAGA------GEIEGEDIFETEVTIEEIFQSIFD 124 Query: 131 DLALPNLKQNQQRQLTEYKTHRA-GYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 DL LPNLK+ + ++ R G+ +G+ ++ R+ + R+ A R+ Sbjct: 125 DLELPNLKKKKFNKILNDSFKRKKGFKKHGISPRLAKRRTAIEKVKRKQATQKVLGRD-- 182 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 + E +K DLRY + + A Sbjct: 183 ----------------IAERFPFKK-----------------DDLRYSRVKLNKNKEYNA 209 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS 309 V+ C+MD S SMDQ K MA+ F+ ++Y F+ Y+ V++ +I H T AKEV E EFF+ Sbjct: 210 VIICIMDTSASMDQMKKYMARSFFFMIYKFIKMKYEEVDICFISHSTTAKEVTEEEFFHK 269 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV 369 E+GGT +SS K E++ RYNP +NIY ASDGDNW +D+ + +AK+L V Sbjct: 270 VESGGTYISSGYKKALEIINTRYNPQIYNIYTFHASDGDNWNEDNDRAVK-VAKELSNVC 328 Query: 370 RYYSYIEITRRAHQT-----LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + YIEI + +E E F I ++D++ +++ ++ Sbjct: 329 NLFGYIEIMGYGYSNGIRNKYLKEIEKEN-----FIPLIIEKKEDLWRALKDILKQEMRE 383 Query: 425 AKG 427 +G Sbjct: 384 ERG 386 >UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidetes RepID=C7PNZ7_CHIPD Length = 639 Score = 207 bits (526), Expect = 8e-52, Method: Composition-based stats. Identities = 42/299 (14%), Positives = 77/299 (25%), Gaps = 23/299 (7%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 + + +I V R+ +++ R + +EE + Sbjct: 166 NTEDYSPVNENRFHTVASDPLSTFSIDVDRASYSNVRRFLNEGNMPPVDAVRVEEMINYF 225 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + + L+ K+ K P S + L+DVS Sbjct: 226 DYKYSNPTG-NTPVAVRTDMAICPWNTAHQLVRIALKGKDVAKDNLPPS--NLVFLIDVS 282 Query: 259 GSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEHEFFYS 309 GSM + K + K+ + LL L + VVY K Sbjct: 283 GSMSDAKKLPLVKQAFKLLVNQLRPVDRVAIVVYAGAAGLVLPSTSGDHKTAILDALDKL 342 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL-P 367 + G T ++L + E + N A+DGD N S + + +K Sbjct: 343 EAGGSTAGGEGVQLAYKTATEYLLKSGNNRVII-ATDGDFNVGPSSDGELQRIIEKKREK 401 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + D + R F + Sbjct: 402 GIFLSVLGFGMGNYKDNKLELLADKGNGNYAYI--------DNFEEARRTFATEFGGTL 452 >UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5H3_9GAMM Length = 608 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 41/303 (13%), Positives = 83/303 (27%), Gaps = 22/303 (7%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 N + + E + +I V ++ R M + E Sbjct: 130 ENEQARENYLKNEQNPVKQVMLEPVSTFSIDVDTGSYSNSRRMIKMGKRPPADAVREEAF 189 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + A E A + ++ + EK + A + L Sbjct: 190 INYFDYHYSAPKSLETPFNVHTEVAPAPWNNQRQLLKIGIKGFDIEKAELKA--ANLVFL 247 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HE 305 +DVSGSM+ K + K +L L VVY + + Sbjct: 248 LDVSGSMNAPDKLPLLKSSLTMLTKQLDENDSVAIVVYAGAAGLVLPATKGNEYQVISNA 307 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK 364 G T + ++L ++ + + N A+DGD N S + L Sbjct: 308 LNNLSAGGSTNGAQGIELAYQIASQNFKKEGINRVI-LATDGDFNVGMSSVDALKKLIAN 366 Query: 365 LLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + L + ++ + + D R++ + + Sbjct: 367 KRKTGIALTTLGFGQGNYNDGLMEQLANIGNGQHAYI--------DTINEARKVLVDELS 418 Query: 424 TAK 426 + Sbjct: 419 STM 421 >UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellaceae RepID=A5WCP1_PSYWF Length = 571 Score = 205 bits (522), Expect = 2e-51, Method: Composition-based stats. Identities = 37/303 (12%), Positives = 85/303 (28%), Gaps = 23/303 (7%) Query: 137 LKQNQQRQLTEYKTH--RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 + QQ E + + A +I ++ R ++ +EE Sbjct: 100 MAPKQQENYAEIEPNAVNATSEQAFATLSIDTDTGSYANVRRFLNQGQLPPKDAVRVEEL 159 Query: 195 LAIISNSE-PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + A+ + + I ++ ++ A + Sbjct: 160 INYFNYDFTAAKKQANAPFLVSTEVVNSPWHPTNQIVKVGIKAEDLLTAKQKQPPANLVF 219 Query: 254 LMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEH 304 L+DVSGSMD + +AK +L L + Y + + + Sbjct: 220 LVDVSGSMDTEDKLQLAKSSLKMLTKQLRAQDSITLITYAGNTKVVLPSTPGNQTQKILN 279 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK 363 +G T +A+KL + E + N +DGD N S + + Sbjct: 280 AIDNLTASGSTNGEAAIKLAYQQATEHFKKDGINR-ILMLTDGDFNVGVSSVKDMLQIIR 338 Query: 364 KLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + + + + + ++ D +++ + Sbjct: 339 SNRDKGISLSTLGFGQGNYNDHMMEQVADNGNGNYSYI--------DSLSEAKKVLIDEM 390 Query: 423 ATA 425 + Sbjct: 391 SAT 393 >UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CVB5_9CLOT Length = 556 Score = 205 bits (520), Expect = 4e-51, Method: Composition-based stats. Identities = 52/348 (14%), Positives = 91/348 (26%), Gaps = 23/348 (6%) Query: 90 RPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYK 149 GGG + S + G ++ ++ + E P ++ Sbjct: 23 GCGAGGGKTASATEAEVKAEAGSYASETMAAQSQWDGAVMEAEGPPLSHNTEEYNYIAEN 82 Query: 150 THRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEE 209 A A V + +L R+ + +EE L + P E+ Sbjct: 83 AFLAVANAPLSTFAADVDTASYANLRRKILEGNEVPADAVRIEEMLNYFTYDYPEPT-ED 141 Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDM 268 E + L+ + + S + L+DVSGSM + Sbjct: 142 EPFSVTTYIGDCPWNENHKLLQIGLQAEKPDLENQKPS--NLVFLIDVSGSMESADKLGL 199 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSA 320 KR ++LL L V Y T K G T S Sbjct: 200 VKRAFLLLTENLRPEDTVSIVTYASSDTVVLDGVSGEEKAAIMTAIENLTAGGSTDGSKG 259 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSY-IEIT 378 ++ + +E + N A+DGD N S L +K + S T Sbjct: 260 IETAYRLAEEHFQKDGNNRVI-LATDGDLNLGLTSEGDLTRLIQKKKESGVFLSVMGFGT 318 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D + + ++ Sbjct: 319 GNIKDNKMEALADNGNGQYAYV--------DSLMEAKRVLVEELGGTL 358 >UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z5_DESAA Length = 558 Score = 204 bits (519), Expect = 5e-51, Method: Composition-based stats. Identities = 38/305 (12%), Positives = 84/305 (27%), Gaps = 24/305 (7%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 +P+ + + E ++ +I V + +++ R + + +E Sbjct: 83 RVPDYNTEEYAPIRE-GGFKSPLYDPLSTFSIDVDTASYSNVRRFLSYGNMPPVDAVRIE 141 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 E + P + + + + R + L+ + + + S + Sbjct: 142 EMINYFHYDYPQPKGQ-DPFSITMEMSQCPWNRDNMLVHVGLQGRCLDYKDVKPS--NLV 198 Query: 253 CLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT--------QAKEVDE 303 L+DVSGSM ++ + KR +L L + V Y + K Sbjct: 199 FLLDVSGSMNSENKLPLVKRSMEMLVKELGAGDRVSIVTYAGSAGLVLPSTSARNKRKII 258 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILA 362 + G T ++L V E P N +DGD N S + Sbjct: 259 TALDRLEAGGSTAGGEGIELAYRVAWENLIPEGNNRVI-LCTDGDFNVGVSSTPELVRMI 317 Query: 363 KKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 ++ + + + + ++F + Sbjct: 318 EEKRRAGIYLTICGFGMGNYKDEKMEAISNAGNGNFYYIDSR--------REAHKVFVQD 369 Query: 422 NATAK 426 Sbjct: 370 MRANM 374 >UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter sp. K31 RepID=B0T5X0_CAUSK Length = 592 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 40/303 (13%), Positives = 86/303 (28%), Gaps = 22/303 (7%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P ++ ++ + +I V + ++ R A + +EE Sbjct: 118 PPIRDTEKYPGAAANPVKRVAEEPVSTFSIDVDTAAYANVRRFLNEGAAPPHDALRVEEL 177 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + +E + + + + + + ++ + P + L Sbjct: 178 INYFDYGYARPTAQEPPFKPTVTVVPSPWSQDRQLMHIGVQGYATPRAGQP--PLNLVFL 235 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEHE 305 +D SGSM + +AK+ +L L + V Y ++K Sbjct: 236 IDTSGSMSGPDRLPLAKKALNVLIDQLRPQDRVSMVAYAGSAGAVLSPTDGKSKLKMRCA 295 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK 364 + G T L+L + ++ +P N +DGD N P + Sbjct: 296 LTALRSGGSTAGGQGLELAYALARQNLDPKAVNRVI-LMTDGDFNVGIADPTRLKDFVAD 354 Query: 365 LLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 V Y + T+ + + + D R+L Sbjct: 355 QRKSGVYLSVYGFGRGNYNDTMMQALAQNGNGTAAYV--------DGLQEARKLLRDDFD 406 Query: 424 TAK 426 +A Sbjct: 407 SAL 409 >UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C375AE Length = 550 Score = 201 bits (511), Expect = 4e-50, Method: Composition-based stats. Identities = 42/356 (11%), Positives = 94/356 (26%), Gaps = 37/356 (10%) Query: 96 GGSGSGQGQASQDGEGQDEFVFQISKDEY--------------LDLLFEDLALPNLKQNQ 141 G + S E ++ S DE D + + N+ Sbjct: 25 GDNMHDTASGSYKEESSAYEYYEESADEEFAPEYFSTDDYAPEGDYYYWEEEPELPSANE 84 Query: 142 QRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNS 201 + + + + + V + ++ R + +EE + Sbjct: 85 EYKGYTEAGFKDTKSEPLSTFSADVDTASYTNVRRLIENRNIVPEDAVRIEEFINYFDYD 144 Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 P + + R + ++ K +++ P S + L+D SGSM Sbjct: 145 YPQPEDGS-AFGRYVEIADCPWNRDHKLMMVGIQGKELQQQETPPS--NLVFLIDSSGSM 201 Query: 262 DQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEHEFFYSQET 312 + K + + + +L L + + V Y + + + + Sbjct: 202 NSYDKLPLVQSAFSMLAEQLDKNDRISIVTYAGSSAVLLDGEKGSNTDEILEQLYSITAS 261 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-VVR 370 G T +K E+ +E + N A+DGD N S L + + Sbjct: 262 GSTNGEGGIKTAYELAEEHFIKGGNNRVI-LATDGDLNVGASSEEELTRLIETKRDNGIY 320 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + ++ D + ++ + Sbjct: 321 LSVLGFGEGNYKDARMEALADNGNGNFSYI--------DSEDEAERVLVQEMSGTL 368 >UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4 Tax=Cyanobacteria RepID=B0CCM8_ACAM1 Length = 686 Score = 200 bits (508), Expect = 9e-50, Method: Composition-based stats. Identities = 62/428 (14%), Positives = 119/428 (27%), Gaps = 45/428 (10%) Query: 27 KAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGR--GGLRHRVHPGNDHFVQ 84 +A+ QS+++ S+ + S+ + +R R P N F Sbjct: 79 RAEFAQSLAQLTR--SLERLQDKSLSSVDAALLKRLQQDYATELSQVRQRRKPSNRRFGI 136 Query: 85 NDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLF--------------- 129 + R RP G Q A + Q +F S Sbjct: 137 SPRRPRPTGLPPALTKAQ-PAPAETAAQSQFSRDQSGRMKSVAPPAGLAPPAPEPRFQDK 195 Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 + L LP + + +I V + +++ R ++ Sbjct: 196 DRLHLPGTFNTEDYKRINENPFFLPQRTPLSTFSIDVDTASYSNVRRFIRQGQLPPKDAV 255 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 LEE + + ++ A + L+ K EK + Sbjct: 256 RLEELINYFDYGYASPKGDQ-PFSVSTEVATAPWNNQHKLVHIGLKGKELEKEQ----PS 310 Query: 250 VMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKE 300 + L+DVSGSM + + K+ LL L + VVY K Sbjct: 311 NLVFLIDVSGSMKRPNKLALVKKSLCLLVHQLKPEDRVSLVVYAGRAGIVLPSTPGTQKA 370 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + + G T ++ +K+ ++ + + N A+DGD N S E Sbjct: 371 TIMNAIDRLEAGGSTAGAAGIKMAYDMAERHFLKNGNNRVI-LATDGDFNVGQSSDAELE 429 Query: 360 ILAKKLLPVVRYYSY-IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 L ++ + + T + + + D +++ Sbjct: 430 RLIEQKRDRGVFLTVLGYGTGNYKDNKMELLANKGNGNYAYI--------DTLLEAQKVL 481 Query: 419 HKQNATAK 426 Sbjct: 482 VNDLRGTL 489 >UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BJU7_9GAMM Length = 555 Score = 199 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 52/304 (17%), Positives = 90/304 (29%), Gaps = 22/304 (7%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 P + + T R T + V + + R + +EE Sbjct: 83 PPVSENRENYPKTPISPIRQVATDPVSTFSTDVDTASYTNARRFLNQGMRPPADSIRVEE 142 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + PA ++ + + L+ + + P + Sbjct: 143 FINYFDYALPAPDTTNTPIQISTERTQTPWNPQTELVRVSLQSYRSDFKTLP--PLNLVF 200 Query: 254 LMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--------EH 304 L+DVSGSM+ K + +R + LL L + VY E Sbjct: 201 LLDVSGSMNSPDKLPLMQRSFNLLVSQLRPQDRVAIAVYAGQSGVVLEPTSGDQKAQINQ 260 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK 363 + GGT S+ + L ++ + Y P N +DGD N S + L + Sbjct: 261 AINQLRAGGGTHGSAGIHLAYDLAQANYLPDGINR-IFIGTDGDFNVGTTSLTELKALIE 319 Query: 364 KLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + V T + L E + + + D Y R+LF Q Sbjct: 320 RKREAGVFLSVLGFGTGNYNDALMEELSNHGNGTAYYL--------DSYQEARKLFATQL 371 Query: 423 ATAK 426 A Sbjct: 372 AATL 375 >UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20 Tax=Proteobacteria RepID=Q4KKB4_PSEF5 Length = 582 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 41/301 (13%), Positives = 82/301 (27%), Gaps = 23/301 (7%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + +Q Q + A + V ++ R + LEE + Sbjct: 92 EPREQYQKLPDNPIHSVAEAPVSTFSADVDTGAYANVRRLLNQGSLPPEGAVRLEELVNY 151 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + + ++ + + A + L+DV Sbjct: 152 FPYDYALPTDGS-PFGVTTELAPSPWNPHTRLLRIGIKASDRAVAEL--APANLVFLVDV 208 Query: 258 SGSMDQST-KDMAKRFYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEHEFFY 308 SGSMD+ + K LL L + VVY + K Sbjct: 209 SGSMDRREGLPLVKSTLKLLVDQLRDQDRVSLVVYAGESRVVLEPTSGRDKAKIRTAIDQ 268 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP 367 G T +S ++L ++ ++ + N A+DGD N + +A + Sbjct: 269 LTAGGSTAGASGIQLAYQMAQQGFIDQGINR-ILLATDGDFNVGVSDFDSLKAMAAEKRK 327 Query: 368 -VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 V + ++ L + + D R++ Q ++ Sbjct: 328 SGVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYI--------DNLREARKVLVDQLSSTL 379 Query: 427 G 427 Sbjct: 380 A 380 >UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EPS6_ACIF5 Length = 434 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 166/437 (37%), Positives = 253/437 (57%), Gaps = 17/437 (3%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD-VDSGESVSIPTEDI 59 M+ IDRR +G +S N+ R RR +A++K ++ + S+ D ++ + VSIPT D+ Sbjct: 1 MSMIIDRRSSG-TRSTANQDRLQRRVRARLKVAVEKMARSGSIEDLANTDQPVSIPTRDL 59 Query: 60 SEPMFHQG-RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQ 118 EP F + RV PGN + + D I +P+GGG G G DG G+DE Sbjct: 60 HEPSFRRDLSDTSWERVLPGNKEYQRGDEINKPEGGGSGKGRAGAP---DGLGEDEVAIV 116 Query: 119 ISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT 178 +S DE+LDLLF+ LALPNL++ Q + + RAG+ +G P+ + V R+++ + ARR Sbjct: 117 LSADEFLDLLFDGLALPNLRKMAQGDIQADQWRRAGFIKDGSPSRMHVGRTMRAARARRL 176 Query: 179 AMTAGKRRELHALEENLA----------IISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 A+ AGKRREL L + + L +I L KI+ +PF Sbjct: 177 ALRAGKRRELQDLLDARNVLQEEIQGRLAQKQDVSVEQERLSELNHQIDALERKIKAIPF 236 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ID DLR+ + +++P P + AVMFC+MDVSGSM + KD+AKRF++LLYLFL R Y+ V+ Sbjct: 237 IDEADLRFAHIDQQPHPITNAVMFCVMDVSGSMGEKEKDLAKRFFLLLYLFLHRHYQAVQ 296 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 +V+I+HH+ A E E FF ++E GGT+VS A+ L +E++++R+ P +WN+Y AQ SDGD Sbjct: 297 MVFIKHHSTASECSEQAFFGAREGGGTLVSPAIILSEEIMRQRFPPDRWNVYLAQVSDGD 356 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 N+ D+ + E L L + + Y+E+ R + L R Y+ + F +++ Sbjct: 357 NYFADNAVVEEHLLNLLPRLRNLF-YLEVNRDSESDLLRLYDAIAQDFPELVTARASERE 415 Query: 409 DIYPVFRELFHKQNATA 425 DIYP+FR LF + + Sbjct: 416 DIYPMFRTLFATEETPS 432 >UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales RepID=C8SEV7_9RHIZ Length = 718 Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats. Identities = 47/404 (11%), Positives = 107/404 (26%), Gaps = 23/404 (5%) Query: 35 SEAINKRSVTDVDSGESVSIPT-EDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQG 93 + V D+ +++ P SEP G + + Sbjct: 139 KDKKADADVESRDATDALVAPASPPKSEPTVAGGLAQQNAQGQVAPAEPAPARSGGQRVI 198 Query: 94 GGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRA 153 + ++ + D P + + Q + A Sbjct: 199 MSLTPPPQADGTTSRIARMPAAESKLMTPQQPATAPADQIAPQEENRDRVQDFKTNPVHA 258 Query: 154 GYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLR 213 +I V + + + R + + +EE + Sbjct: 259 ALEDPVSTFSIDVDTASYSFVRRSLKEGFVPQADTVRVEEMINYFPYDWKGPDSASTPFN 318 Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRF 272 ++ + + ++ + + P +A + L+DVSGSMD+ K + K Sbjct: 319 STVSVMPTPWNTHTKLMHVAIKGFDVKPTEQP--KANLVFLIDVSGSMDEPDKLPLLKSA 376 Query: 273 YILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLM 324 + LL L V Y K+ + Q G T + +K Sbjct: 377 FRLLVSKLKADDTISIVTYAGDAGTVLMPTKIAEKDKILNAIDNLQPGGSTAGEAGIKEA 436 Query: 325 DEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAH 382 ++ ++ + N A+DGD N + L ++ V + + Sbjct: 437 YKLAQQSFIKDGVNRV-MLATDGDFNVGQTDDDDLKRLIEQERKTGVFLSVFGFGRGNLN 495 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + D ++ + ++ Sbjct: 496 DEMMQTIAQNGNGTAAYI--------DTLAEAEKVLVEDASSTL 531 >UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria RepID=C6VVX3_DYAFD Length = 625 Score = 198 bits (502), Expect = 4e-49, Method: Composition-based stats. Identities = 42/304 (13%), Positives = 90/304 (29%), Gaps = 23/304 (7%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 L + + + + ++ V R+ +++ R + +EE Sbjct: 137 LAMPQATESYKPINENGFLSVGQQPVTTFSVDVDRAAYSNVRRFLNNGQMPPEDAVRIEE 196 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + P E + + + L+ K +S + Sbjct: 197 MINYFDYDYPQPRGEH-PVAIVAETTDSPWNPGLKLVHIGLQAKTVSAENLSAS--NLVF 253 Query: 254 LMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEH 304 L+DVSGSM++ + + K+ + LL L K V Y K+ + Sbjct: 254 LIDVSGSMNEANKLPLLKQAFKLLADQLRVEDKISIVAYAGSAGMVLAPTSGSEKKTIKD 313 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK 363 + G T ++L ++ K+ + P N A+DGD N + + L + Sbjct: 314 ALDKLEAGGSTAGGEGIELAYDLAKKHFLPKGNNRVI-LATDGDFNVGISNESELQKLIE 372 Query: 364 KLLPVVRYYSY-IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + S + + + D R++F ++ Sbjct: 373 EKRKAGIFLSVMGFGMGNYKDSHVETLADKGNGNYAYI--------DNIQEARKVFVQEF 424 Query: 423 ATAK 426 Sbjct: 425 GGTL 428 >UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella RepID=A3D1E9_SHEB5 Length = 642 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 45/399 (11%), Positives = 96/399 (24%), Gaps = 28/399 (7%) Query: 39 NKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGS 98 + + ++ + ++ R + + Sbjct: 36 RGNGIPSASNTAALLLVAVSLTACGGKGAEVEHRQAEQQAEQRHQVASQRQAEMRDAAKV 95 Query: 99 GSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTAN 158 + A + + + A+P + ++Q+ Sbjct: 96 EMARVAAPMQMSSNGAVMGMS----IAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEI--P 149 Query: 159 GVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAE 218 I V +L R + +EE L + P Sbjct: 150 VSTFFIDVDTGSYATLRRMLREGRLPEKGTVRVEEMLNYFAYDYPLPAKNAAPFSVTTEL 209 Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLY 277 + + L+ + K +S + L+DVSGSM + + LL Sbjct: 210 APSPYNDDMMLLRIGLKGYDLPKSQLGAS--NLVFLLDVSGSMASADKLPLLQTALKLLT 267 Query: 278 LFLSRTYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK 329 LS K VVY + + G + ++ K Sbjct: 268 AQLSAQDKVSIVVYAGAAGVVLDGVSGNDTQTLTYALEQLSAGGSINGGQGITQAYQLAK 327 Query: 330 ERYNPAQWNIYAAQASDGD-NWADDSPLCHEILA-KKLLPVVRYYSYIEITRRAHQTLWR 387 + + P N A+DGD N L K+ + + + L Sbjct: 328 KHFIPNGINRVI-LATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLME 386 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + D R++ + ++ Sbjct: 387 QLADKGNGNYAYI--------DTLNEARKVLVDELSSTL 417 >UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacteriaceae RepID=C0YQB8_9FLAO Length = 800 Score = 197 bits (500), Expect = 7e-49, Method: Composition-based stats. Identities = 47/313 (15%), Positives = 84/313 (26%), Gaps = 23/313 (7%) Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 ++ + P + N+ +I V + +++ R Sbjct: 317 EEINKQQQITPVTQNNESYDAFVENPFELTRNQPLSTFSIDVDNASYSNVRRMINNGQVV 376 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 + +EE + P E A + L+ KN Sbjct: 377 DKNAVRIEEMVNYFKYDYPQPKNEN-PFSINTEYSDAPWNPKHKLLKIGLQGKNLPMDKL 435 Query: 245 PSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA----- 298 P+S + L+DVSGSM + K + +L L K VVY Sbjct: 436 PAS--NLVFLIDVSGSMSDENKLPLLKSSFKVLLNQLRPKDKVGIVVYAGSAGMVLPPTS 493 Query: 299 ---KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDS 354 K+ Q G T + ++L ++ +E + N A+DGD N S Sbjct: 494 AGEKDKIIEALDRLQAGGSTAGGAGIELAYKLAQENFVKEGNNRVII-ATDGDFNVGTSS 552 Query: 355 PLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + L + V + + D Sbjct: 553 ISDLKTLIEDRRKSGVFLTCLGFGMGNYKDNTLETLADKGNGNYAYI--------DNMQE 604 Query: 414 FRELFHKQNATAK 426 + K+ A + Sbjct: 605 ANKFLGKEFAGSM 617 >UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT14_9PLAN Length = 616 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 52/393 (13%), Positives = 94/393 (23%), Gaps = 52/393 (13%) Query: 46 VDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQA 105 + + R P + ++ G G Sbjct: 80 AVEMNRDRVAGREKEAGKVRSDARQDRLATLPTESRRLGIEQPNAAPGFMPQLDGIAGHG 139 Query: 106 SQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANIS 165 G G D+F + E RA +I Sbjct: 140 EGPGVGGDKFAY----------------------------VENNPFRAVADEPLSTFSID 171 Query: 166 VVRSLQNSLARRTAM-TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 V + + + + +EE + + + + + Sbjct: 172 VDTASYSKIRSYLIDYHQLPPQGAVRVEELINYFTYDYATPTDQ-KPFAANVEAAACPWN 230 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRT 283 + ++ K P+S + L+DVSGSM+ + K+ LL L Sbjct: 231 AEHRLVRIGIKGKEIANAERPAS--NLVFLLDVSGSMNNARKLPLLKQGMKLLVDQLGEN 288 Query: 284 YKNVEVVYIRHHTQ--------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 K VVY K Q G T ++L + E + Sbjct: 289 DKVAIVVYAGAAGMVLNSTNGDDKSTIMEALDRLQAGGSTNGGQGIELAYQAATENFIKG 348 Query: 336 QWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSY-IEITRRAHQTLWREYEHLQ 393 N +DGD N S +A + S T + + E Sbjct: 349 GVNRVI-LCTDGDFNVGVTSTSDLVTMAADKAKSGVFLSVMGFGTGNHNDAMMEELSGKA 407 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + F D +++ +Q + Sbjct: 408 NGNYAFI--------DTITEAKKVLVEQMSGTL 432 >UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DA43_9BACT Length = 883 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 45/292 (15%), Positives = 82/292 (28%), Gaps = 21/292 (7%) Query: 144 QLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEP 203 +I V + + R +EE L P Sbjct: 319 DTLTENAFLNVPENPLSTFSIDVDTASYAIVRRYLNDNHLPPTGAVRIEELLNYFPYDYP 378 Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 + + L+ + K P S + L+DVSGSM+ Sbjct: 379 QPQG-AAPFSATMEVATCPWAPEHRLVRVGLKGREIPKDERPPS--NLVFLIDVSGSMNM 435 Query: 264 STK-DMAKRFYILLYLFLSRTYKNVEVVYIR------HHTQAKEVDEHEFFYSQETGGTI 316 K + ++ + LL L + V Y TQ KE + GGT Sbjct: 436 PNKLPLLQKCFSLLVEQLGPKDRVSIVTYASGTKLVLEPTQDKEAMQTAIDGLHAGGGTH 495 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSY- 374 SS + L + ++ + P N A+DGD N + + + + + Sbjct: 496 GSSGIDLAYRMAQQSFIPGGTNRVI-LATDGDWNIGITNQSELLSMITRKAKSGVFLTVL 554 Query: 375 IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 ++ + + + D R++F Q ++ Sbjct: 555 GFGLDNLKDSMLVKLADHGNGHYAYI--------DTEQEARKVFVDQLSSTL 598 >UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZHE2_9SPHI Length = 704 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 48/342 (14%), Positives = 100/342 (29%), Gaps = 43/342 (12%) Query: 105 ASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANI 164 + D +F + + + + E + +NQ Q+ + +I Sbjct: 205 GAGDLSSDLKFDRKAA---FRNAFPEGERYATIYENQFYQVGQN---------PLSTFSI 252 Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQ--------LLEEERLRKEI 216 V + +++ R + +EE + P L+ Sbjct: 253 DVDNASYSNVRRFVNDGQPLPKNAVRVEEMINYFEYDYPQPTPTKDKEGKLQTHPFSVNT 312 Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYIL 275 + L+ +N + + +S A + L+D SGSMD K + KR + + Sbjct: 313 EYGTCPWNPHHKLLQIGLQGENLQTKN--ASPANLVFLVDASGSMDSEDKLPLLKRSFKV 370 Query: 276 LYLFLSRTY-KNVEVVYIRHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDE 326 L L+ + K V Y +E + G T ++L + Sbjct: 371 LLKQLTDSRTKIAIVAYAGASGLVLPATSVSHREKILTALENIESGGSTAGGEGIELAYK 430 Query: 327 VVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQT 384 + ++ + N A+DGD N S L V T + + Sbjct: 431 IAQQAFIAGGNNRVI-LATDGDFNVGLSSDEELMQLISNKRKSGVYLTCLGFGTGNLNDS 489 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + + D +++ K Sbjct: 490 MMEKLTNAGNGNYYYI--------DGINEAKKVLAKNLTGTL 523 >UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonifex degensii KC4 RepID=C9RD74_AMMDK Length = 371 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 102/410 (24%), Positives = 164/410 (40%), Gaps = 53/410 (12%) Query: 12 KNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGL 71 K + +R + K I+Q + E I + S+ D V IP + E F Sbjct: 13 FRKGEEDARRHQEKLKEIIRQRLPELITEESLILADDRRKVRIPLRLVEEFRFRF-ASHQ 71 Query: 72 RHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFED 131 V ++ I P G GG + G D + ++S +E +++FE+ Sbjct: 72 EMLVGQAGSQPGTDETIVFPGIGRGGGAGTE-------PGIDYYEAEVSVEEIAEVVFEE 124 Query: 132 LALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LALP+ K A G+ A + R+L N+L R Sbjct: 125 LALPHYKPKNTAN-RGIAEEWADLRRQGIRACLDRRRTLLNALKRHAKEGR--------- 174 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + + DLR++ + P ++AV+ Sbjct: 175 --------------------------------KGEFRLCPSDLRFRVWRSIESPEARAVV 202 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 ++D SGSM K +A+ F+ + FL Y NVEVVY+ HHT+A+E EFF E Sbjct: 203 LAMLDTSGSMGPLEKYLARSFFFWMVRFLEANYANVEVVYLAHHTEARETTASEFFRKGE 262 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT SS +L ++++ RY P ++NIYA SDGDN D+ C L +LL V Sbjct: 263 SGGTRCSSVYELALDIIETRYPPTEYNIYAFHFSDGDNLPADNERC-MELIGRLLEVANL 321 Query: 372 YSYIEITRRA-HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 Y EI + + + + + +R++ D+Y + F + Sbjct: 322 VGYGEIEGPYFYTSTLKTV-YQSIAHPRLVVVTLRERKDVYRALKAFFAR 370 >UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BTM6_TERTT Length = 689 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 44/311 (14%), Positives = 86/311 (27%), Gaps = 27/311 (8%) Query: 130 EDLALP---NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 DL P + +I V + + + R+ ++ Sbjct: 209 PDLEPPHQLETADRDHFDTVATNPIKVTREEPVSTFSIDVDTASYSFVRRQLNRGQLPQK 268 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 LEE + P + I + A + + ++ Sbjct: 269 AAVRLEEMVNYFPYDYPLPSAATAPFKPTITVIPAPWNQAKRLVHIGIKALPLA----HP 324 Query: 247 SQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--- 302 +A + L+DVSGSM K + K+ LL L T VVY E Sbjct: 325 PKANLVFLLDVSGSMGSPDKLPLVKQSMELLLSGLQPTDTVSIVVYAGAAGTVLEPTPVA 384 Query: 303 -----EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPL 356 G T + ++L ++ + Y N A+DGD N P Sbjct: 385 EQQKILAALDRLNAGGSTAGAQGIELAYQLAEANYQRDAVNR-IILATDGDFNVGIADPE 443 Query: 357 CHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + ++ + + + L ++ + + D + Sbjct: 444 QLKGYVERKRANGIELSILGFGSGNYNDALMQQLAQNGNGVAAYI--------DTLSEAQ 495 Query: 416 ELFHKQNATAK 426 ++ +Q + Sbjct: 496 KVLVEQASGTL 506 >UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=C8N8N5_9GAMM Length = 563 Score = 193 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 37/304 (12%), Positives = 84/304 (27%), Gaps = 24/304 (7%) Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA-MTAGKRRELHALEEN 194 ++ ++ ++ A +I V +++ R + +EE Sbjct: 92 EVQNRERYAHSDANPVHRVSDAPVSTFSIDVDTGSYSNIRRMLTRENRLPPADAVRVEEI 151 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 L + P + + + + + ++ + P A + L Sbjct: 152 LNYFAYGYPLPQDG-KPFAVHTQTVDSPWQADAKLIRIAIQAADLAPEKRP--PANLVFL 208 Query: 255 MDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEHE 305 +D SGSMD K + K+ L + + Y KE Sbjct: 209 IDTSGSMDDPDKLPLVKKTVCHFAEALRADDRISLITYSGSTAEILPPTAGDQKETIIAA 268 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK 364 + G T AL++ + + Y N A+DGD N P + Sbjct: 269 LKPLRAHGATAGGEALRMAYDAAAKNYRKDGINR-ILLATDGDFNVGISDPATLKNYVAD 327 Query: 365 LLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + + + ++ D +++ +Q Sbjct: 328 KRKSGISLTTLGYGSGNYNDEMMEQLADAGDGNYSYI--------DSEAEAKKVLVRQLT 379 Query: 424 TAKG 427 + Sbjct: 380 STLA 383 >UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobacteriaceae RepID=YFBK_ECOLI Length = 575 Score = 193 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 41/305 (13%), Positives = 81/305 (26%), Gaps = 20/305 (6%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + Q + + ++ V ++ R + +EE + Sbjct: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 Query: 200 NSE------PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + A + D+ K+ + P+S + Sbjct: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPAS--NLVF 219 Query: 254 LMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEH 304 L+D SGSM + + LL L V Y K Sbjct: 220 LIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINA 279 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK 363 G T + L+L + + + N A+DGD N D P E + K Sbjct: 280 AIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR-ILLATDGDFNVGIDDPKSIESMVK 338 Query: 364 KLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 K V ++ ++ + + + ++ Q + R++ Sbjct: 339 KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVA 398 Query: 423 ATAKG 427 K Sbjct: 399 KDVKA 403 >UniRef50_A9FJ88 Uncharacterized conserved protein involved in stress response n=21 Tax=Bacteria RepID=A9FJ88_SORC5 Length = 405 Score = 193 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 101/405 (24%), Positives = 177/405 (43%), Gaps = 47/405 (11%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + RF + + +I++++ + I++ + + VSIP I P F G R V Sbjct: 44 DHGRFRQIVRGRIRENLRKYISQGELIGRKGKDLVSIPIPQIDIPRFRFG-DKQRGGVGQ 102 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ + P GG GQGQA GEG ++ +E +L E+L LP++ Sbjct: 103 GDGN------PGDPVGGSDDKQPGQGQA-GSGEGDHLLEVDVTLEELAGILGEELELPDI 155 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + + +++ +G G + R+ + +L R + Sbjct: 156 QDKGKSKISNAHDRYSGIRRVGPESLRHFKRTYREALKRMISSG---------------- 199 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 V D RY++++ +P + AV+ +MDV Sbjct: 200 ---------------------TFRPSAPVVVPVPDDKRYRSWKTITEPVANAVIIYMMDV 238 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV 317 SGSM K++ + + +L+R YK +E +I H A+EVD FF+++E+GGT++ Sbjct: 239 SGSMGDEQKEIVRIESFWIDAWLTRQYKGLESRFIIHDAIAREVDRDTFFHTRESGGTMI 298 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP-LCHEILAKKLLPVVRYYSYIE 376 SSA KL +++ Y P +WNIY SDGDNW+ D C ++L ++LP V ++Y + Sbjct: 299 SSAYKLCSQIIDNDYPPDEWNIYPFHFSDGDNWSMDDTLSCVDVLKTQILPRVNMFAYGQ 358 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + ++ + S D + IRD+D I ++ K Sbjct: 359 VESPYGSGQFIKDLKEHFSQDDRVVVSEIRDKDAIVGSIKDFLGK 403 >UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SXC8_9FIRM Length = 612 Score = 191 bits (484), Expect = 5e-47, Method: Composition-based stats. Identities = 37/329 (11%), Positives = 81/329 (24%), Gaps = 32/329 (9%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS 169 G V S Y ++ ++ ++ +N + + Sbjct: 100 AGDTAMVTDTSNSMYSEVAYDTREYDSMTENGF---------VSTVDRPLSTFAADRDTA 150 Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 +++ + +EE L + + + E+ + + Sbjct: 151 SYSNVRSYIESGSLPPDGAVRIEEMLNYFTYDYRKKPEDGEKFSIYTEYSDCPWNKDTKL 210 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVE 288 + + S + L+D SGSM D + + ++ + +L L + Sbjct: 211 MMVGINTDEIDFGDKKPS--NLVFLIDTSGSMYDDNKLPLVQQSFAMLAENLDENDRVSI 268 Query: 289 VVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V Y T + G T A+ E+ ++ + N Sbjct: 269 VTYAGEDTVVLSGTPGSEQYTISEALSNMTAEGCTNGGDAIITAYELAEKNFINGGNNRV 328 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFD 397 A+DGD N S L + + T Sbjct: 329 I-LATDGDLNVGLTSESDLVDLITEEKKENNIFLSVLGFGTDNLKDNKLEALADNGDGSY 387 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQNATAK 426 F D +++ + Sbjct: 388 AFI--------DSAYEAKKVLVDEMGGTL 408 >UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 Tax=Erythrobacter RepID=Q2N8R4_ERYLH Length = 580 Score = 191 bits (484), Expect = 5e-47, Method: Composition-based stats. Identities = 44/336 (13%), Positives = 93/336 (27%), Gaps = 26/336 (7%) Query: 106 SQDGEGQDEFVFQISKDEYLDLLFEDLA------LPNLKQNQQRQLTEYKTHRAGYTANG 159 S + V + + + + +P + ++ E + Sbjct: 68 SGSRIASEAAVAPDTSGQPAEAAGREYRYVMPVIVPQPEDRERYDGEEVSPVKIAAVEPL 127 Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL 219 ++ V + R + + EE + + Sbjct: 128 STFSVDVDTGAYANARRFLSQGQMPPKAAVRTEEFINYFRYDYDRPQDRSQPFTVNFDAA 187 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYL 278 R + L + E+ P A + LMDVSGSM + K + K L Sbjct: 188 RTPWNEDTRLIRIGLAGYDIERSERP--PANLVFLMDVSGSMGRPDKLPLVKTALAGLAG 245 Query: 279 FLSRTYKNVEVVYIR------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY 332 L K VVY T Q G T + ++L ++ ++ + Sbjct: 246 ELQPQDKVSIVVYAGAAGLVLEPTNDTRKIRAALNQLQAGGSTAGGAGIQLAYQIAEDNF 305 Query: 333 NPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYE 390 N A+DGD N S + +K + + T ++ + + Sbjct: 306 IEGGVNRVI-LATDGDFNVGVSSRDALIEMIEKKRDSGITLTTLGFGTGNYNEAMMEQIA 364 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + D +++ + ++ Sbjct: 365 NHGNGNYAYI--------DSALEAKKVLGDEMSSTL 392 >UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KS19_CLOPH Length = 551 Score = 190 bits (482), Expect = 8e-47, Method: Composition-based stats. Identities = 40/298 (13%), Positives = 87/298 (29%), Gaps = 23/298 (7%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 ++ + +++ + V + +++ R +EE L + Sbjct: 89 TEEYNAVIEQGYQSTKNHPLSTFSADVDTASYSNIRRMLKEGRRVDTGAVRIEEMLNYFN 148 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + + ++ + + S+ + + L+DVSG Sbjct: 149 YDYKLPEGDS-PFGITTELSDCPWNPDTKLFLAGIQTEKIDFS--KSAPSNLVFLIDVSG 205 Query: 260 SM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQ 310 SM D+ + +R ++LL L+ + V Y + T KE ++ + Sbjct: 206 SMMDEDKLPLVQRAFLLLTENLTEKDRISIVTYAGNDTVVLSGAKGNQKEKIQNAITELE 265 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-V 368 G T S ++ ++ E Y N A+DGD N S L ++ Sbjct: 266 AGGSTFGSKGIETAYQLAMENYIEGGNNRVI-LATDGDLNVGVTSESELTNLIEEKRKSG 324 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 V T + + D R++ ++ Sbjct: 325 VALSVLGFGTGNIKDNKMEALADHGNGNYAYI--------DSLMEARKVLVEEMGATL 374 >UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria RepID=A5F9T1_FLAJ1 Length = 709 Score = 190 bits (482), Expect = 9e-47, Method: Composition-based stats. Identities = 53/319 (16%), Positives = 90/319 (28%), Gaps = 23/319 (7%) Query: 119 ISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT 178 +S+ E L P L + + TA +I V + ++ R Sbjct: 219 LSEQELDKKLNIIRPNPTLPTQEDYDTFVENAFESPKTAPLSTFSIDVDNASYTNIRRFL 278 Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 ++ +EE + + P E + I L+ KN Sbjct: 279 NSGQEVPKDAVRVEEMVNFFKYNYPQPKNEH-PFSINTEYSDSPWNSQNKILKIGLQGKN 337 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 PSS + L+DVSGSM+ + K+ +L L T K VVY Sbjct: 338 IATNDLPSS--NLVFLIDVSGSMEDMNKLPLLKQSMKILVNELRPTDKVSIVVYAGAAGM 395 Query: 298 A--------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD- 348 K+ + G T + ++L ++ E + N A+DGD Sbjct: 396 VLPPTSGNEKKTIIKALDQLEAGGSTAGGAGIELAYKIATENFIKGGNNRVI-LATDGDF 454 Query: 349 NWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 N S E L ++ V + + + Sbjct: 455 NVGSSSNSDMEKLIEEKRKTGVFLTCLGYGMGNYKDSKMEILADKGNGNYAYI------- 507 Query: 408 DDIYPVFRELFHKQNATAK 426 D K+ + Sbjct: 508 -DNIQEANRFLGKEFKGSM 525 >UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacteraceae RepID=A3PN61_RHOS1 Length = 651 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 53/392 (13%), Positives = 91/392 (23%), Gaps = 27/392 (6%) Query: 51 SVSIP-TEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDG 109 V +P P R+ + + P + S +G A Q Sbjct: 94 VVVMPNARLAEPPQTAPDAPEADARLTAAPEAGGGAETAGAPVPAEPRARSAEGAAPQTF 153 Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLK----QNQQRQLTEYKTHRAGYTANGVPANIS 165 + L L + P ++ R +I Sbjct: 154 AADEAMPMAAPPAPDLALSKQAAEAPARALPQGDSEAFANAPDNPLRVTAEDPVSTFSID 213 Query: 166 VVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIER 225 V + L RE +EE + PA R ++ R Sbjct: 214 VDTASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSITRTPWNP 273 Query: 226 VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTY 284 + L+ + P + L+D SGSM + K+ + L+ L Sbjct: 274 ETRLVHVALQGRMPAIEDRP--PLNLVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPED 331 Query: 285 KNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 + V Y + + G T L L E + Sbjct: 332 QVAIVTYAGSAGEVLAPTAANQRSTILSALDRLDAGGSTAGDEGLALAYRTASEMAGAGE 391 Query: 337 WNIYAAQASDGD-NWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 A+DGD N P L + V + + Sbjct: 392 VTRVV-LATDGDFNLGISDPEELARLVAHERDTGVYLSVLGFGRGNLDDATMQALAQNGN 450 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + D +++ Q + A Sbjct: 451 GQAAYI--------DSLNEAQKVLVDQLSGAL 474 >UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteobacteria RepID=C6BAR1_RHILS Length = 706 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 50/406 (12%), Positives = 116/406 (28%), Gaps = 26/406 (6%) Query: 33 SISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQ 92 +++ I +V+ + +V+ +D P + + N ++ E Sbjct: 126 AVAPQIPAGNVSLAEQSVAVAQALQDKEAPALA-KPDASQTSEYDANA--ALTNKPEGSA 182 Query: 93 GGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQR-QLTEYKTH 151 G + A + + E L N++R Sbjct: 183 AALGATKRAAPAAPGIVPQRQFAEPMAAIAPSPVPPAEGRMQMQLDPNRERFANAAANPI 242 Query: 152 RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEER 211 ++ T + V + + R A +EE + P ++ Sbjct: 243 KSVATDPVSTFSADVDSASYAFVRRSLTGGAMPDPLSVRVEEMINYFPYDWPGPNNADQP 302 Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAK 270 + + + R + ++ + P A + L+DVSGSMD+ K + K Sbjct: 303 FKATVTVMPTPWNRDTELMHVAIKGYDIAPATTPR--ANLVFLIDVSGSMDEPDKLPLLK 360 Query: 271 RFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALK 322 + L+ L V Y + K + G T + ++ Sbjct: 361 SAFRLMVNRLKADDTVSIVTYAGNAGTVLAPTRVAEKSKILSAIDRLEPGGSTGGAEGIE 420 Query: 323 LMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSY-IEITRR 380 ++ K+ + N A+DGD N S + + ++ + + Sbjct: 421 AAYDLAKQGFVKDGVNRV-MLATDGDFNVGPSSDGDLKRIIEEKRKDGIFLTVLGFGRGN 479 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + +L + + + D ++ ++ + Sbjct: 480 LNDSLMQTLAQNGNGSAAYI--------DTLAEAQKTLVEEAGSTL 517 >UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZU67_9SPHI Length = 552 Score = 188 bits (478), Expect = 3e-46, Method: Composition-based stats. Identities = 51/340 (15%), Positives = 100/340 (29%), Gaps = 26/340 (7%) Query: 98 SGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTA 157 + + +Q G+ Q + L ED++ P +K+ + T + TA Sbjct: 45 QTTPTTKRNQKGKSAAAAKDQADAETNTIALEEDVSPPKIKEKKPAN---ENTFLSVKTA 101 Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA 217 +I V + + + LEE + + + Sbjct: 102 PLSTFSIDVDNASYSRARKSINNGQLPSTSSVRLEEFINYFNYQYKQPEGQH-PFSVNTE 160 Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILL 276 + + L+ K + R S + L+DVSGSM K + ++ + +L Sbjct: 161 VAKCPWNPKNHLVHIGLQGKRLDSRKLKLS--NLVFLIDVSGSMSAPDKLPLLRKAFKML 218 Query: 277 YLFLSRTYKNVEVVYIRHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVV 328 L + VVY + K+ Q G T + +KL ++ Sbjct: 219 VNNLGEEDRVAIVVYAGNAGLVLPATQGTDKQKIMEALDKLQSGGSTAGGAGIKLAYKIA 278 Query: 329 KERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSY-IEITRRAHQTLW 386 K+ + N A+DGD N S + L ++ + + + Sbjct: 279 KQNFIKEGNNR-IILATDGDFNLGASSDQAMQNLIEEKRKEGVFITVLGLGMGNYRDSKM 337 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D ++F K Sbjct: 338 EIIADKGNGNYYYL--------DNLNEAYKVFGKDLKGTL 369 >UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67Q87_SYMTH Length = 395 Score = 187 bits (475), Expect = 5e-46, Method: Composition-based stats. Identities = 98/422 (23%), Positives = 168/422 (39%), Gaps = 54/422 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 + ++++R R + I+ ++++ ++ ++ D + V +P + E F + Sbjct: 18 QGQMDQERHQARIREAIRANLADIVSDEAIIASDGRKVVRLPIRVLREYRFRLDWQK-QP 76 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 RV + + + RP +G D G+D F + +E +LLF +L+ Sbjct: 77 RVGEADGPVRPGEPVGRPGRAAEAAGGSGAG---DEAGEDWFETDVPLEELEELLFAELS 133 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+L+ Q+ LT G+ ANI R+L ++ R + Sbjct: 134 LPHLEPKQEPHLTVLHHEWRDVRRQGLYANIDKKRTLLEAMKRNRLAGRPPLAGIRR--- 190 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 DLR++ +++ P + AV+ Sbjct: 191 --------------------------------------EDLRFRTWDEAEIPGASAVLII 212 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K +A+ + FL Y+ V++ ++ H T+AKE+DE FF E+G Sbjct: 213 MMDTSGSMGTGEKYIARSLCHWMVRFLRTRYERVKLHFVAHTTEAKEMDEESFFTRGESG 272 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT SSA + +++ RY P + N+YA SDGDN D+P L +KLL Sbjct: 273 GTRCSSAYEYALQLIDRRYPPDRHNLYAFHFSDGDNLISDNPRAV-ALLRKLLERCALVG 331 Query: 374 YIEI--------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 Y +I + F IRD+ +IY R F + A Sbjct: 332 YGQIETQPQYLSMPYYQPNTLLTLFREEIDHPRFVTALIRDRSEIYAALRAFFPRPGAGE 391 Query: 426 KG 427 +G Sbjct: 392 RG 393 >UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UP85_RHOBA Length = 885 Score = 187 bits (474), Expect = 8e-46, Method: Composition-based stats. Identities = 49/408 (12%), Positives = 93/408 (22%), Gaps = 53/408 (12%) Query: 31 KQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIER 90 K A + S G+ P + R R + + Sbjct: 316 KDEAPTAPREPSAGKPVVGDFAVAPVPE-QLGRQQFDFRASRGRTL---ERQLGETEELA 371 Query: 91 PQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKT 150 P G G F+ ++ Sbjct: 372 PTSDRLAILPPTPDGEGQGPGMSGDKFEPIQE--------------------------NE 405 Query: 151 HRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSE-PAQLLEE 209 R +I V + + R + +EE + P + Sbjct: 406 FRRVADDALSTFSIDVDTASYAKVRSYLQRGQLPRPDSVRIEELINYFDYQYTPPSAEDP 465 Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDM 268 +A + ++ K+ +++ P + L+D SGSM + + Sbjct: 466 VPFSSAMAVASCPWNENNRLVRVGIQAKDIDRKERPR--CNLVFLIDTSGSMKRPNKLPL 523 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSA 320 +L L + VVY K+ G T + Sbjct: 524 VIEGMKVLLDQLKNRDRVAIVVYAGSSGLVLDSTPVKQKKKIIRALSALSAGGSTNGGAG 583 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILA-KKLLPVVRYYSYIEIT 378 L+L + +E + N SDGD N A ++ Sbjct: 584 LQLAYQTARENFIEDGVNRVI-LCSDGDFNVGMTGTDQLVAEATRQSKSGTELTVLGFGM 642 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + F D +++ Q A Sbjct: 643 GNHNDAMMERISNSGAGNYAFV--------DTIAEAKKVLADQVAGTL 682 >UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobacteria RepID=Q21MJ3_SACD2 Length = 708 Score = 185 bits (469), Expect = 3e-45, Method: Composition-based stats. Identities = 51/410 (12%), Positives = 118/410 (28%), Gaps = 34/410 (8%) Query: 39 NKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGS 98 + + +S + S+ ++ +E V P R Sbjct: 126 SDEQIIVDNSVQVPSVTIKEKAETKIA-DVAESEMAVPPSAALEEVVVTGMRSSAAESAK 184 Query: 99 -----GSGQGQASQDGEGQDEFVFQISKDEYLDLL------FEDLALPNLKQNQQRQLTE 147 + Q Q S + S L + + + P + N + + E Sbjct: 185 LSKKPAASQRQVSAIRAQDIGALPDQSNAVALQRIAGMPVDGDTIVAPAPQGNDKFEHVE 244 Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 + ++ A +I V + + + R+ ++ EE + + P Sbjct: 245 ENSVKSVAEAPVSTFSIDVDTASYSFVRRQLNSGYLPEKDAIRAEELINYFDYNYPLPSD 304 Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTK 266 + I + + + + L+ + P + + L+DVSGSM Q Sbjct: 305 STAPFKPNITVIDSPWAKGKKLVHIGLKGYDIAPDQKPRT--NLVFLLDVSGSMNSQDKL 362 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVS 318 + K+ +L L+ VVY E Q G T Sbjct: 363 PLVKQSMEMLLSTLNPDDTVAIVVYAGAAGTVLEPTPAKDKQKILSAMQRLQAGGSTAGG 422 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL-PVVRYYSYIE 376 + + L ++ + ++ N A+DGD N + + ++ + Sbjct: 423 AGIALAYDLAEANFDKKAVNRVI-LATDGDFNVGSTNNETLQGFVERKREKGIFLSVLGF 481 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + L + + + D +++ ++ +++ Sbjct: 482 GQGNYNDHLMQTLAQNGNGVAAYI--------DTVSEAQKVLVQEASSSL 523 >UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9IU52_BORPD Length = 582 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 46/378 (12%), Positives = 90/378 (23%), Gaps = 22/378 (5%) Query: 60 SEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 G G + + G + G A + Q Sbjct: 33 DAARALTGAGKAGNPGTVPPAVPSAPPAPPAAEADAGAPRARPGAALTRQYAPQAYSAQP 92 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 + L A P ++ + A V ++ R Sbjct: 93 AAVSLLPAPSGYYAPPQAEERENYARYRDNPVVAAQEQPVSTFGADVDTGSYTNVRRLLN 152 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 + EE + ++ A + ++ Sbjct: 153 EGRLPPPDAVRAEEFINYFDYGYATPDSRQQPFSIITEVSAAPWNPQRQLLKIGIQGYRV 212 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + P+ A + L+D SGSM + K + K L L + V Y + Sbjct: 213 APQDIPA--ANLVFLVDTSGSMAERDKLPLIKGALKQLVAQLRPQDRVAIVTYAGQASMT 270 Query: 299 --------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-N 349 K + G T + L L + + N ASDGD N Sbjct: 271 LDSTPGDQKARINAAIDELRAAGSTNGGAGLDLAYAQAAKGFVKGGVNR-ILLASDGDFN 329 Query: 350 WADDSPLCHEI-LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + +A++ + + + L + + ++ Sbjct: 330 VGATDLEDLKDKIARQRQGGIALTTLGVGGGNFNDALAMQLADAGNGSYHYL-------- 381 Query: 409 DIYPVFRELFHKQNATAK 426 D R++ Q ++ Sbjct: 382 DSLREARKVLAAQMSSTL 399 >UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DBX8_9RHIZ Length = 668 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 49/417 (11%), Positives = 105/417 (25%), Gaps = 31/417 (7%) Query: 21 RFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGND 80 +F + + ++ + RS + + E V P Sbjct: 88 QFAQDQNGFEPKLLAPLDSSRSTDPELAMDKPLPLAPQQEEQELVAAAPLPEVAVSPALK 147 Query: 81 HFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQN 140 Q + R + G QI + + A + Sbjct: 148 SSRQANDAARQRF---------TGQPVGGLAGQGLAGQIDGESLRGADGANPAPGAEAER 198 Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN 200 + + + R+ + V + + R +EE + + Sbjct: 199 DRVEGFDSNGVRSVAEYPVSTFSADVDTASYAMVRRALKQGVMPDPRTVRIEEMVNYFNY 258 Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 PA E R + + ++ + + P A + L+DVSGS Sbjct: 259 DYPAPESVETPFRATVTVTPTPWNANTRLLHIGVKGYDVKPAARPQ--ANLVLLVDVSGS 316 Query: 261 MDQ-STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--------EHEFFYSQE 311 M + + K + LL L V Y E + Sbjct: 317 MQETDKLPLLKSAFRLLIQKLEPEDTVSIVTYAGDAGTVLEPTPASDKAKILDALDDLRP 376 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVR 370 G T ++ ++ + ++ N A+DGD N + L ++ Sbjct: 377 GGSTAGAAGIEEAYRLAEKARVNGGVNRV-LLATDGDFNVGASDDDALKSLIEEKRESGV 435 Query: 371 YYS-YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + S + + L + + + D + ++ + Sbjct: 436 FLSIFGFGQGNYNDQLMQTLAQNGNGVAAYI--------DTLAEAEKTLAQEATASL 484 >UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C54C8 Length = 638 Score = 183 bits (465), Expect = 8e-45, Method: Composition-based stats. Identities = 46/390 (11%), Positives = 93/390 (23%), Gaps = 26/390 (6%) Query: 44 TDVDSGESVSIPTEDI-SEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQ 102 +G P DI + G P + P G + Sbjct: 71 PAQITGGRDEWPQGDIAPIGKKGEHAPGPALPGLPSPAPLAEPAMPAGPDALGKRIAASS 130 Query: 103 GQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPA 162 + + + N + + + R+ A Sbjct: 131 APFASVVASGGGAFGGVRGAANKPAPRDGY---NAQNAEAYGRYQENEFRSPLVAALSTF 187 Query: 163 NISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK 222 + V + ++ R L E + S + + + Sbjct: 188 SADVNTASYANVRRMLNEGTLPPASAVFLAEFVNYFPYSYAPPPAGADPVAFHVEMGPCP 247 Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLS 281 + ++ P + L+D SGSM Q + + ++ LL L+ Sbjct: 248 WNAKHHLLRVGVQAHQIPAEKLPPR--NLVFLVDTSGSMQQENRLPLVQKSLELLVEKLT 305 Query: 282 RTYKNVEVVYIR--------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN 333 + V Y K+ Q GGT +K + ++ + Sbjct: 306 EKDRVSVVTYAGDSRVALPPTSGADKKAILDVVTGLQANGGTNGEGGIKKAYQFARDTFL 365 Query: 334 PAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEH 391 N +DGD N L ++ V +E + Sbjct: 366 DGGVNRVI-LCTDGDFNVGVVDNGELVKLIEEQRKSKVFLTVLGYGMGNYKDDRLKELAN 424 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 + + D +++F +Q Sbjct: 425 HGNGHHAYI--------DTLDEAKKVFVEQ 446 >UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5D5_BREBN Length = 513 Score = 183 bits (463), Expect = 2e-44, Method: Composition-based stats. Identities = 47/343 (13%), Positives = 92/343 (26%), Gaps = 25/343 (7%) Query: 95 GGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAG 154 SG+ + + G+ + D + P + + Sbjct: 29 RSESGNKPSASVEQGQSNQVASSPSPPSQLADYALKKSGDPLPNDMYFKDY-GTNQFVST 87 Query: 155 YTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRK 214 V + + E +EE + S PA + Sbjct: 88 AKDRLSTFAADVDTASYTIMRHFIKDGNLPPAEAVRVEEFINFFPTSYPAPT--NQTFAI 145 Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFY 273 + + ++ I ++ K + A + ++DVSGSM+Q + ++ K+ Sbjct: 146 QADSGPSPFQKNLQIVRIGIKGKELSPKERK--PANLVFVIDVSGSMNQENRLELVKKSL 203 Query: 274 ILLYLFLSRTYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 +L L T VVY T+ K+ Q G T L L Sbjct: 204 HVLVDQLQPTDSVGIVVYGSEGRVLLPPTSTEDKQAILSAIDELQPEGSTNAEQGLVLGY 263 Query: 326 EVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQ 383 E+ + P N SDG N + + + S+ + Sbjct: 264 EMAARSFKPPAINRVI-LCSDGVANVGETGAEGILRSIEDYARKDIYLSSFGFGMGNYND 322 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + D + R +F + Sbjct: 323 VMMEQLANKGEGSYAYI--------DTFSEARRIFTESLTGTL 357 >UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NVX5_9RHOB Length = 608 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 37/313 (11%), Positives = 79/313 (25%), Gaps = 22/313 (7%) Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKR 185 L + L+ ++ E R ++ V + + + + Sbjct: 126 QLEPPAMPAVQLEDRERFASAEANPLRRTSADPVSTFSVDVDTASYSYVRSTLSGGRLPN 185 Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + +EE + + P ++ + + ++ P Sbjct: 186 PDAVRVEEMVNYFDYNYPVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLP 245 Query: 246 SSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH 304 S + L+D SGSM + + ++ + LL L + V Y E + Sbjct: 246 SQ--NLVFLIDTSGSMADANKLPLLQQSFRLLLSSLRDEDEVAIVTYAGSSGVLLEPTKV 303 Query: 305 EF--------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSP 355 G T LK + + + A+DGD N P Sbjct: 304 ADKTRILEKINALTSGGSTAGHEGLKGAYALAETMTGDGEQTR-IILATDGDFNVGLSDP 362 Query: 356 LCHEILAKKLLPVVRYYSY-IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + S + L + + D Sbjct: 363 DSLKRYVAEQRENGTALSVLGFGRGNYNDELMQTLAQNGQGVAAYI--------DTLSEA 414 Query: 415 RELFHKQNATAKG 427 R++ Q ++ Sbjct: 415 RKVLVDQVVSSIS 427 >UniRef50_C7N770 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N770_SLAHD Length = 629 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 50/364 (13%), Positives = 93/364 (25%), Gaps = 31/364 (8%) Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDG---EGQDEFVFQISKDEYLDLLFEDLAL 134 G + E P + S + +G + + + + D L E + Sbjct: 102 GVGTNLLGSNAEMPVAETKAASEDTMAGSANSYAPDGGLAYETDEAYETF-DTLDEGAPM 160 Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK---RRELHAL 191 + ++ E + T + V + +L R + Sbjct: 161 EDF-NTEEYAAIEENGFVSTVTRPLSTCSADVDTASYCNLRRMINDGYSLDEIPDGAVRI 219 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 EE L + + R + + +K S + + Sbjct: 220 EEMLNYFHYDSG-EPEGNDLFAVRAESARCPWNDQTQLLVMT--FTASDKAQTASKGSNL 276 Query: 252 FCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI--------RHHTQAKEVD 302 L+D+SGSMD+ K D+ K + L L + V Y Sbjct: 277 VFLIDISGSMDEPDKLDLLKDSFGTLLENLGPNDRVSIVTYAAGEDVLLEGASGDDTRKI 336 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEIL 361 + G T + L++ EV + Y N ASDGD N S Sbjct: 337 MRALNRLEADGSTNGEAGLEMAYEVAERNYIEGGVNR-IVMASDGDLNVGITSESDLYDF 395 Query: 362 AKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 ++ V + T + ++ D + + Sbjct: 396 VEEKRETGVYLSVLGFGSGNYKDTKMETLADHGNGTYHYI--------DCVEEAERVLGE 447 Query: 421 QNAT 424 Sbjct: 448 DLTA 451 >UniRef50_UPI000185CB41 protein containing von Willebrand factor n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CB41 Length = 550 Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats. Identities = 54/367 (14%), Positives = 103/367 (28%), Gaps = 16/367 (4%) Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQ-DGEGQDEFVFQISKDEYLDLLFED 131 H + + P + + + + + +E E+ Sbjct: 14 PEGHRNSSNSANQLPPPAPATTVEELTANKMASPETPPPPPPPPAYDAVVEEMEIANSEE 73 Query: 132 LALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 + L+ N+ + A + V R+ +L R ++ + Sbjct: 74 PSQQQLRSNETYKEISENPFVAVAQQPVTTFSADVDRASYANLRRMLGYGQLPPKDAIRI 133 Query: 192 EENLAIISNSEPAQLLEE-ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 EE + PA E LR + L+ K + P S Sbjct: 134 EEMINYFDYDYPAPTKEATSPLRVTPELAPTPWNPEHLLLRIGLQAKKLDLAQAPPS--N 191 Query: 251 MFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVVY--------IRHHTQAKEV 301 + L+DVSGSMD+ + K + LL L T + V Y + ++ Sbjct: 192 IVFLIDVSGSMDEPNKLPLLKSSFKLLLTQLKPTDRVAIVTYASGTKVALSSTPVKERQK 251 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEI 360 E +G T SS ++L + ++ + N A+DGD N +P E Sbjct: 252 IEKVLDNLYASGSTSGSSGIQLAYKEAQKNFIKNGNNR-IILATDGDFNVGISNPRELEK 310 Query: 361 LAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 +K + + + + + + F + Sbjct: 311 FIEKQRESGIYMSVLGFGMGNYRDDMAETIADKGNGNYAYIDDLTEAKKVLVNEFSGMLF 370 Query: 420 KQNATAK 426 K Sbjct: 371 AVAKDVK 377 >UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BNQ7_9GAMM Length = 391 Score = 180 bits (457), Expect = 7e-44, Method: Composition-based stats. Identities = 94/423 (22%), Positives = 173/423 (40%), Gaps = 58/423 (13%) Query: 12 KNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGL 71 ++ + R + + +++ + + + V +V +P + F + Sbjct: 19 FSRGTRDWLRHNEKIREAVREQLPDLVAGSDVLSRPDNRTVKVPVRFMEHYRFRLRNPDV 78 Query: 72 RHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFED 131 R G R +P + S +GEGQ F + D+ LD L+++ Sbjct: 79 RTGAGQGKAKPGDVLRPAQP-----ARPGQGKEGSGEGEGQITFALEFQIDDILDWLWDE 133 Query: 132 LALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 L LP+LK ++ E R G+ G + + R+++ ++ RR+A Sbjct: 134 LELPHLKPRLGTRIEEDAYIREGWDRRGARSRLDRRRTMKEAIKRRSAQG--PEAIP--- 188 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 I DLR++ +R P++ AV Sbjct: 189 -------------------------------------IVNDDLRFRQLARRRRPTTNAVA 211 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 F L+DVS SMD+ + +AK F+ + R + +E+V+I H +A E +E FF Sbjct: 212 FFLLDVSSSMDEHCRRLAKTFFFWALQGVRRQFSTIEIVFIAHTVEAWEFEEENFFRIHG 271 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT S+A+ ++++ERY+PA +N Y A+DG N+++D E L + L P++ + Sbjct: 272 QGGTKSSTAVHKAQQILEERYDPAMYNCYLFYATDGHNFSEDRRRATEALLR-LAPLMNF 330 Query: 372 YSYIEITRRAHQTL-------WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 Y E++ + H+ L WR ++++ DI+ + F Q A Sbjct: 331 LGYAEVSHQNHRRLDTEVAGIWRGLGAEGWPVGSYSLTRE---ADIWLAIKAFFTDQAAE 387 Query: 425 AKG 427 A+ Sbjct: 388 AEA 390 >UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C05A6 Length = 368 Score = 178 bits (452), Expect = 3e-43, Method: Composition-based stats. Identities = 117/403 (29%), Positives = 179/403 (44%), Gaps = 67/403 (16%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + +R + + I+++I I S+T+ +G V + +++ E F G V Sbjct: 18 DAKRHRKLVEKSIRENIDMLIVGESITETAAGNIVKVRIQELPEYRFKFG--SSTEYVAI 75 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ V N++ + +G D + +I D+ L LLFE L LPNL Sbjct: 76 GDGDEVVNEKCDFEMEASNEAGL------------DIYESEIVLDDALALLFEQLELPNL 123 Query: 138 KQNQQRQLTE-YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 + + + L R+G G+ + R+LQ + R Sbjct: 124 YEKKFKNLEYFSTQKRSGIKKTGIYPRFAKKRTLQEKIIR-------------------- 163 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + D+RY++ K+ S AV+ C+MD Sbjct: 164 ---------------------------NKNGRFINQDIRYQSLAKKQINHSNAVIVCIMD 196 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 SGSM + KDMAK FY LLY F+ Y VE+++I H T AKEV E++FF+ E+GGT Sbjct: 197 TSGSMGTTKKDMAKSFYFLLYQFIKIRYAKVEMIFIAHSTIAKEVTENDFFHKGESGGTY 256 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SS E++KERY+P WN+Y SDGDNW DD+ L LA +L + YIE Sbjct: 257 ISSGYTKALEIIKERYDPRLWNVYTFHCSDGDNWTDDNNLAV-SLANELCSCSNLFGYIE 315 Query: 377 I-TRRAHQTLWREY-EHLQSTFDNFAMQHIRDQDDIYPVFREL 417 I T + EY H+ +NF I + DI+ VF+++ Sbjct: 316 IKTNNYSSVILNEYNAHIT--SNNFLALKIFKKSDIFEVFKKV 356 >UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AWD1_HERA2 Length = 610 Score = 177 bits (447), Expect = 1e-42, Method: Composition-based stats. Identities = 42/372 (11%), Positives = 87/372 (23%), Gaps = 27/372 (7%) Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLD 126 + V P A+ + D + Sbjct: 89 AADDQSAQWPTAEATSVAPAPQPMPTQAADAGQPVPNPAAGKPLVDTWELPTQPIDPNPN 148 Query: 127 LLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 +E + + + T + + + + + Sbjct: 149 YAYEQDQ--EIFDSMYFKNYGTNPFVRTETDPLSTFAMDIDSASYSLMRSSINQGLLPPA 206 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE-RVPFIDTFDLRYKNYEKRPDP 245 + +EE L P + + + ++ ++ E Sbjct: 207 DSVRVEEYLNAFDYEYPQPEDGD--FAIYSEVAPSPFGGPNYELVQIGIQARSIEVADRK 264 Query: 246 SSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYI--------RHHT 296 A + ++D SGSM + +M K I L L V + Sbjct: 265 --PAALTFVIDTSGSMAQDNRLEMVKNALIYLAGQLEPDDSLAIVAFNDGMRVVLNPTSG 322 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSP 355 + + + G T + L E+ + + P N SDG N P Sbjct: 323 ENQMDIITAINSLEPAGSTNAEAGLYKGFELAWQAFKPEGINR-ILLCSDGVANSGMTEP 381 Query: 356 LCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++ L V+ +Y + L + + D Sbjct: 382 SQLLATFQQYLDAGVQLSTYGVGMGNYNDILLEQLADKGDGNYAYF--------DSADEA 433 Query: 415 RELFHKQNATAK 426 + LF +Q + Sbjct: 434 QRLFGEQLTGSL 445 >UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria RepID=B1ZYN3_OPITP Length = 792 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 40/308 (12%), Positives = 79/308 (25%), Gaps = 31/308 (10%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 + + + V + ++ R + +EE + Sbjct: 301 NTEAYRFLRESDFLSAREHPLSTFAADVDTASYANVRRFLREGRLPPADAVRIEELVNYF 360 Query: 199 SNSEPAQ---------LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 A E + A + L+ K+ + A Sbjct: 361 PYRYAAPGRVRDEGVAAPGEAPFAAALEVAAAPWAAQHRLVRIGLKAKDAAVSGR--AAA 418 Query: 250 VMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------ 302 + L+DVSGSMDQ K + + LL L + V Y + A Sbjct: 419 NLVFLLDVSGSMDQPNKLRLVQESMRLLLGRLQPEDRVAIVTYAGNSGLALPSTPVARQR 478 Query: 303 --EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + G T + L+L ++ K + N +DGD N S Sbjct: 479 EILDAIDELRAGGSTNGAMGLQLAYDIAKANFVANGVNRVI-LCTDGDFNVGVTSEGELV 537 Query: 360 ILAKKLLPVVRYYSY-IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 L ++ + + + ++ + + +L Sbjct: 538 RLIEEKAKSGVFLTVLGFGMGNLKDAMLQQIADRGNGSYGYIDTR--------REAEKLL 589 Query: 419 HKQNATAK 426 +Q + Sbjct: 590 VQQVSGTL 597 >UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GHE4_9DELT Length = 785 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 37/293 (12%), Positives = 70/293 (23%), Gaps = 21/293 (7%) Query: 146 TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQ 205 + A +I V + S+ + EE + A Sbjct: 250 QVAASFVATGEDRKSTFSIDVDTASYASVRQSLRNGWMPDPGSVRTEEMINYFDYGYVAP 309 Query: 206 LLEE-ERLRKEIAELRAKIERVPFIDTFDLRY-KNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 + ++ + + + L+DVSGSM Sbjct: 310 SGGAGAPFAVHTEVGPCPWAPDHRLVQIGVQATRELPAQAQELRTRNLVFLLDVSGSMSS 369 Query: 264 S-TKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEHEFFYSQETGG 314 + K + L L VVY KE + GG Sbjct: 370 RGKLPLIKHGFTQLVEQLGAEDHVSIVVYAGAAGVVLPPTSGDQKETILGALDRLEAGGG 429 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYS 373 T S+ + E+ + + N +DGD N L ++ + S Sbjct: 430 TNGSAGIVEAYELAQANFVDGGVNRVILG-TDGDFNVGLSDHDALVELIEQKRESGVFLS 488 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + L + + F D ++ ++ Sbjct: 489 VLGVGGHYDDELMEQLADHGNGNYAFL--------DGKREAEKVLVEEIGGTL 533 >UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 Tax=Caulobacteraceae RepID=B4WCU1_9CAUL Length = 613 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 40/302 (13%), Positives = 72/302 (23%), Gaps = 27/302 (8%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P + + +I V + +++ R + +EE Sbjct: 131 PAPSDTETYPDATPNPVKRTADQPVSTFSIDVDTAAYSNVRRFIDEGRSPPADAVRVEEL 190 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIER-----VPFIDTFDLRYKNYEKRPDPSSQA 249 + A + + I L+ + Sbjct: 191 INAFDYGYARPTSLARPFAITTAVVASPWAPRTERGGRQIVHIGLQGYELPQGEQR--PL 248 Query: 250 VMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI--------RHHTQAKE 300 + L+DVSGSM K D+AK+ L L Y K Sbjct: 249 NLTFLVDVSGSMRSPDKLDLAKQAMNLAIDRLRPQDTLSVTYYAEGAGTTLQPTPGDQKL 308 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + +GGT ++ + + + + + N +DGD N E Sbjct: 309 KMRCAVASLRASGGTAGATGMTNAYDQAQASFARDKVNR-ILMFTDGDFNVGVTDNKRLE 367 Query: 360 IL-AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 A+K V Y + + + R LF Sbjct: 368 DYVAEKRGTGVYLSVYGFGRGNYQDARMQTIAQAGNGVAAYVGD--------LRDARRLF 419 Query: 419 HK 420 Sbjct: 420 GP 421 >UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NFD9_ACHLI Length = 486 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 43/318 (13%), Positives = 91/318 (28%), Gaps = 25/318 (7%) Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 +DE + +F D + +N ++ ++S + + + + Sbjct: 30 QDENYNYIFNDDEHQEIIENPFIDVSVNNK---------SNISLSANTASYSFIRSQINS 80 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R +EE + + + + + + ++ + L K + Sbjct: 81 GRAVDRNAVRIEEMVNFFNYNYNQPETD-KTFGFKSELIQTPWNNETHLLLIGLETKQVD 139 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI------- 292 PS + L+DVSGSM + K +AK+ LL + V Y Sbjct: 140 LGDIPS---NIVILLDVSGSMSATNKLSLAKKAMELLIEQMKPNDVISLVTYSSGEKVVF 196 Query: 293 -RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NW 350 + + +G T L + +V +E + N A+DGD N Sbjct: 197 KGKSIDDMAYMTSQIRLLKASGSTAGKKGLDMAYKVAEEYFIEGGNNR-IILATDGDFNV 255 Query: 351 ADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 S + + + +Y + ++ I + Sbjct: 256 GISSTDMLIEYISEKRESGIYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARKA 315 Query: 410 IYPVFRELFHKQNATAKG 427 + + AK Sbjct: 316 FVDNIDGVLYTVARDAKA 333 >UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVG2_SORC5 Length = 656 Score = 171 bits (433), Expect = 4e-41, Method: Composition-based stats. Identities = 38/289 (13%), Positives = 69/289 (23%), Gaps = 24/289 (8%) Query: 149 KTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE 208 I V + R+ A + EE L + Sbjct: 203 NPVEDPAKDRLSTFAIDVDTASYAIARRKIMDGALPPYQAVRAEEFLNYFDYGYASPAAG 262 Query: 209 EERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST-KD 267 +A + + ++ K + + + L+D SGSM + Sbjct: 263 --PFAVHLAAAPSPFTSGHHLVRVAVQGKRVPVKER--TPVHLVYLVDTSGSMQSPDKIE 318 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSS 319 +AK+ +L L Y + K G T +SS Sbjct: 319 LAKKSLKMLTDTLKPGDTVALCTYAGSVREVLAPTGIESKGKILAALADLTAGGSTAMSS 378 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL-PVVRYYSYIEI 377 + L + + N SDGD N S K+ + + Sbjct: 379 GIDLAYSLAERTLVKGHVNRVIVL-SDGDANVGPTSHDEILKTIKRARDKGITLSTVGFG 437 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + D R +F +Q Sbjct: 438 QGNYKDLMMEQLANQGDGNYAYI--------DSEAQARRVFSEQVGGML 478 >UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacterales RepID=Q28U54_JANSC Length = 686 Score = 170 bits (431), Expect = 8e-41, Method: Composition-based stats. Identities = 39/298 (13%), Positives = 73/298 (24%), Gaps = 23/298 (7%) Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN 200 + + R +I V + L A + +EE + Sbjct: 220 EDFANADDNPLRVVADDPVSTFSIDVDTASYALLRSTLNRGALPAPDAVRIEEMVNYFPY 279 Query: 201 SEPAQLLEE-ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 PA ++ R + + ++ P + L+D SG Sbjct: 280 DYPAPTADDISPFRPNVQVFETPWNPDTQLVHIGIQGDLPVVEDRP--PLNLVFLIDTSG 337 Query: 260 SMDQS-TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE--------VDEHEFFYSQ 310 SM+ + + + L+ LS + V Y A E Q Sbjct: 338 SMNDPAKLPLLIQSFRLMLNRLSPEDEVAIVTYAGSAGVALEPTAASDTATINAALTTLQ 397 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-V 368 G T L+ + E + + A+DGD N E + Sbjct: 398 AGGSTNGVGGLEEAYRLAGEMMVDGEVSRV-LLATDGDFNVGLSDAGALEDYIAEQRDTG 456 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + ++ D + + Q A A Sbjct: 457 IYLSVLGFGRGNLQDDTMQALAQNGNGTASYI--------DTLHEAQRVLVDQLAGAL 506 >UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1CVN5_MYXXD Length = 700 Score = 169 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 39/382 (10%), Positives = 89/382 (23%), Gaps = 28/382 (7%) Query: 56 TEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEF 115 + I+ G R P + P S + F Sbjct: 149 IKRIAVARPGGRTGATRVYEPPNAMSRPHGVSLNGPPASTLPSQPLGRPGPPKPQSAPRF 208 Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 V + + + ++ ++ Q ++ + Sbjct: 209 VGRDVEPPAPAVAPAPVSPFHM----YFQGYGVNPTINTEEERFSTFSVDTDSASYTLTR 264 Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR 235 + + +EE + Q ++ + + + + ++ Sbjct: 265 AYLERGSLPNEQAVRVEEFVNTFDYGYAHQ--GSAPFSVQVEGFPSPVRKGYHVVHVGVK 322 Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + S + ++DVSGSM+ ++ + KR LL L + VVY Sbjct: 323 AREVSRPQRKPS--HLVFVIDVSGSMNLENRLGLVKRALHLLVNELDERDQVSIVVYGST 380 Query: 295 --------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + G T + L++ + N SD Sbjct: 381 ARLVLEPTSAVHAHIIRAAIDSLHTEGSTNAQAGLEMGYSLAASHLVEGGINRVI-LCSD 439 Query: 347 G-DNWADDSPLCH-EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 G N E + + + + + L + + Sbjct: 440 GVANTGLTDANSIWERIRARAAKGITLSTVGFGMGNYNDVLMERLSQVGEGNYAYV---- 495 Query: 405 RDQDDIYPVFRELFHKQNATAK 426 D +F + Sbjct: 496 ----DRIEEAHRIFVRDLTGTL 513 >UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XT76_9DEIN Length = 360 Score = 168 bits (426), Expect = 3e-40, Method: Composition-based stats. Identities = 85/404 (21%), Positives = 147/404 (36%), Gaps = 54/404 (13%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 + QRF + ++K+ E + + G+ VSIP + P G + Sbjct: 6 RDLQRFKEIVRGEVKKRAREFLTREEYLGSLDGQVVSIPLPQLELPRLQYGHNEMGQGEG 65 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 G G G V ++ +E+L+L+ E L LP Sbjct: 66 EGEGQGQGMGGTAGRGG--------------LGPSGHVPVAEMDLEEFLELIGEALKLPR 111 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L+ Q + E + G + R+L+ +L R + + + E Sbjct: 112 LEPKQGGAVEESSPKYTTLSRRGPESLRHARRTLRQALRRAIQSGIYRPEDPRLVPERDD 171 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 RY+ E +P P +QA + +D Sbjct: 172 Y-------------------------------------RYRAPEPKPRPQAQAALVFALD 194 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 VSGSM+ + + + ++ + + + Y+ H +A EV E +FF +E GGT Sbjct: 195 VSGSMEGEQLRLVRILSYWITAWVKKHFPRLSRHYLLHDAEAWEVSEEDFFRLREGGGTR 254 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SS +KL +V++ RY +N Y +DGDNW DD+ E L K LLP + Y Y + Sbjct: 255 LSSGIKLAQQVLE-RYPAQLYNRYVYHFTDGDNWQDDTAEALETL-KALLPTLSLYGYAQ 312 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + R Q + + A + ++ + + L Sbjct: 313 VRSRYGQGRFIDDLRSHFPSDPALATAELGGRESLPSALKRLLG 356 >UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WPE6_EGGLE Length = 555 Score = 167 bits (423), Expect = 6e-40, Method: Composition-based stats. Identities = 48/377 (12%), Positives = 92/377 (24%), Gaps = 28/377 (7%) Query: 66 QGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYL 125 R R+ G+ F G +G S E + + + Sbjct: 2 HASNRSRGRLAAGSIAFAVLIAGASLAGCSPDGQAGDQLGSAASESEIMAIGSALSETAS 61 Query: 126 DLLFEDLALPNLKQ--NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAG 183 +P+ ++ + + + T+ + V + +L R A Sbjct: 62 TCPPPYPYVPSPSPGGTEEYRALDEPGFLSPATSPLSTLSADVDTASYCNLRRMVAQRYA 121 Query: 184 K---RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 EE L + P + + + + + + Sbjct: 122 PAVVPAGAVRTEELLNYFDYAYP-EPVGSDLFGVSAQMSDCPWNDQTKLLVMG--FATEK 178 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH----- 294 + A + L+DVSGSMD K + K + L L+ + V Y Sbjct: 179 DGDASPTGANLVFLIDVSGSMDDPDKLPLVKDSFAALVEGLTERDRVSVVTYASGERVLL 238 Query: 295 ---HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NW 350 K G T + L+ + + + N ASDGD N Sbjct: 239 EGVPGDDKRRIMRAVDSLVAEGSTNGEAGLEQAYRLAESSFIEGGVNRVVM-ASDGDLNV 297 Query: 351 ADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 S ++ V + + ++ D Sbjct: 298 GISSESELHDFVEQKRETGVYLSVLGFGSGNYKDNKMETLADHGNGAYHYI--------D 349 Query: 410 IYPVFRELFHKQNATAK 426 R + + Sbjct: 350 CAEEARRVLGRNLRANL 366 >UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UJ22_METS4 Length = 654 Score = 166 bits (421), Expect = 9e-40, Method: Composition-based stats. Identities = 37/298 (12%), Positives = 74/298 (24%), Gaps = 22/298 (7%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + R A ++ V + + EE + Sbjct: 166 RDRFANAPEGGFRITREAPVSTVSLGVDTASYGIVRDALNRNHLPPPAAVRTEELINYFP 225 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + PA + R + + + +R P A + L+D SG Sbjct: 226 YAYPAPASPDAPFRVTASVFPSPWAEGRKLLHIGIRGYAVAPAERP--PANLVFLVDTSG 283 Query: 260 SMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--------EHEFFYSQ 310 SM + + K+ +L L + V Y E Q Sbjct: 284 SMAAPNRLPLVKQSLAMLLTTLDARDRVALVAYAGEVGTVLEPTPAGEAGRILAAIETLQ 343 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEI-LAKKLLPV 368 G T ++ + ++P N A+DGD N +A++ Sbjct: 344 AHGSTAGGEGIRQAYALAARHFDPKAVNRVI-LATDGDFNVGITGRDELTGFVARERRKG 402 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + L + + D R++ ++ + Sbjct: 403 IFLSVLGFGMGNLNDALMQALAKDGNGVAAHI--------DTAQEARKVLVEEATSTL 452 >UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PR69_CHIPD Length = 588 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 45/290 (15%), Positives = 82/290 (28%), Gaps = 23/290 (7%) Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 A T + V R+ +++ R + +EE + S P + Sbjct: 144 ENKFIAAETQIPSLFAVDVDRAAYSNIRRFVKLKERIPANAVRIEEMVNYFHYSYPLPPV 203 Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TK 266 + L + +R K+ P S + L+DVSGSM Sbjct: 204 GQ-TLAIYSNYATCPWAEDHRLLQIAVRGKSVNLDSLPPS--NLVFLIDVSGSMAMPNKL 260 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEHEFFYSQETGGTIVS 318 + + + +L L V Y AK + Y G T Sbjct: 261 PLLQAAFRILVNNLRSNDHVAIVAYAGVPGVILPSTPGSAKSKILNAIDYLSAGGATAGE 320 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYS-YIE 376 +A+KL ++ +E + N A+DGD N S E L + Sbjct: 321 AAIKLAYQIAEENFIKEGNNRVI-LATDGDFNVGQTSDHDMEQLILGKKETGVLLTCLGF 379 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + D ++F ++ + Sbjct: 380 GMKNYKDSKLETLSSKGNGNFAYI--------DNLEEASKIFAREFGSTL 421 >UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XSR4_9CAUL Length = 625 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 34/283 (12%), Positives = 66/283 (23%), Gaps = 19/283 (6%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 ++ R +I V + ++ R + R+ +EE + Sbjct: 144 DTERYPDATPNPVRRVADEPVSTFSIDVDTAAYANVRRFISEGQTPPRDAVRVEEMINYF 203 Query: 199 SNSEPAQLLEEERLRKEIAELRAKI-----ERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 +E A + I L+ + Sbjct: 204 DYGYARPGRADEPFAVSTAVAASPWSANAGAGGRQIVHIGLQGYELPAGERR--PLNLTF 261 Query: 254 LMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEH 304 ++DVSGSM K +A++ L+ L + Y K Sbjct: 262 MVDVSGSMQSPDKLGLAQQTMNLIIDRLRPEDRVAVTYYASDVGTAVGPTPGSEKLKLRC 321 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK 363 G T + + E + ++P + N +DGD N E Sbjct: 322 AVAALNAGGSTAGAQGMVNAYEQAEAAFSPDKVNR-ILMFTDGDFNVGVTDDRRLEDYVA 380 Query: 364 KLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + Y + + + Sbjct: 381 DKRGTGIYLSVYGFGRGNYQDARMQTIAQAGNGVAAYVDDLDE 423 >UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AC65_GEMAT Length = 642 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 48/369 (13%), Positives = 87/369 (23%), Gaps = 24/369 (6%) Query: 69 GGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLL 128 R + V G G + Sbjct: 98 NRTRETANASQGAEVTRTAAPAIAPSPAPQTRGVAGGMARSVGMPAPASPRRAA-SDEAR 156 Query: 129 FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRREL 188 +Q E +I V R+ + R + Sbjct: 157 PPRPYPGQPGNREQYDRIEDNPFLGVTGNPLSTFSIDVDRASYGNARRFLQDGQRPPADA 216 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 +EE + + + + A + + L+ + E P + Sbjct: 217 VRIEELINYFPYEL-REPRGNDPVAITTEVTTAPWQPRHQLVRIALQSRRIETASLPPN- 274 Query: 249 AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAK 299 + L+DVSGSM K + K+ LL + + V Y K Sbjct: 275 -NLVFLIDVSGSMQSPDKLPLVKQSLRLLVDQMRPQDRVAIVAYAGAAGLVLPSTSGDEK 333 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 E + G T + ++L +E + N ASDGD N S Sbjct: 334 ETIIQAIERLEAGGSTAGGAGIELAYRTAREHFMDHGNNRVI-LASDGDFNVGVSSDGEL 392 Query: 359 EILAKKLLPVVRYYSY-IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 E L ++ Y + T + + + R++ Sbjct: 393 ERLIERKRTEGTYLTILGFGTGNYQDAKMEKLAKRGNGNYGYVDD--------IAEARKM 444 Query: 418 FHKQNATAK 426 ++ Sbjct: 445 LVREMGATL 453 >UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1D6F9_MYXXD Length = 592 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 31/291 (10%), Positives = 67/291 (23%), Gaps = 25/291 (8%) Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 V + R +EE + Sbjct: 147 ANAFVETAKDPLSTFAADVDTASYTVSRRYLVNGQLPPASAVRVEEFVNYFKFRYAPP-- 204 Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTK 266 E + + + ++ K + A + L+D SGSM + Sbjct: 205 ETGAFAVHLEGAPSPFDAKRHFLRVGVQGKVVSRSQRK--PAHLVFLVDTSGSMHSEDKL 262 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH--------EFFYSQETGGTIVS 318 +A+ + L+ V Y + GGT + Sbjct: 263 PLAREAIKVAVKNLNENDTVAIVTYAGNTRDVLPPTPATDAKSIHAALDSLTAGGGTAMG 322 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC---HEILAKKLLPVVRYYSYI 375 S ++L ++ + + + +DGD + + + K V + Sbjct: 323 SGMELAYRHAVKKASGSVVSRVV-VLTDGDANIGRNVSANAMLDSIHKYTAEGVTLTTVG 381 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 L + + + D +++F Q Sbjct: 382 FGMGNYRDDLMEKLADKGNGNCFYV--------DSLREAKKVFETQLTGTL 424 >UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TQT5_9MICO Length = 533 Score = 154 bits (389), Expect = 5e-36, Method: Composition-based stats. Identities = 46/358 (12%), Positives = 86/358 (24%), Gaps = 24/358 (6%) Query: 80 DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQ 139 + D + + ++ Q S +P + Sbjct: 20 SASGEGDTSAAHDYFENYPTAQENDSASGTSSSSAVGGQAS-SGVAQPFPAAPNVPGPLE 78 Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + + V E EE + Sbjct: 79 DNTFVDAGTSGFIDTRERPRSTFAVDVDGGSFRVARSLLHDGHLPPPESVRPEEWVNSFD 138 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + PA ++ L+ + A ++ + + L+ + + R + ++D SG Sbjct: 139 SGFPAPRKDDLELQSDQARASSEDD-GTRLVRIGLQGREVDVREW--QPVALTMVVDTSG 195 Query: 260 SMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE--------VDEHEFFYSQ 310 SMD + + K LL L V Y T E + Sbjct: 196 SMDIRERLGLVKSSLALLAENLRPDDTIAIVTYQTDATPLLEPTPVRDTDTILAAIDRLE 255 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKL-LPV 368 G T + + L L + +E Y N ASDG N + Sbjct: 256 AGGSTNLEAGLLLGYDQAREAYKQGATN-VVLLASDGVANVGVTDGGRLATAIRDNGRRG 314 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + L + F + D + R+LF + Sbjct: 315 IHLVTVGYGMGNYSDHLMEQLADQGDGFYEYI--------DTFEEARKLFVEDLRATL 364 >UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5_THET2 Length = 351 Score = 154 bits (389), Expect = 6e-36, Method: Composition-based stats. Identities = 99/404 (24%), Positives = 156/404 (38%), Gaps = 64/404 (15%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 + RF + ++K+ + E + + + G VSIP + P G Sbjct: 10 RDLLRFKEIVRGEVKKRVREFLTREELFGQVEGRLVSIPLPQLEIPKIVHGEPLGEGLGL 69 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 G G V ++ +E+LDL+ E L LP Sbjct: 70 GGPGEEALG------------------------PGGHIPVAELELEEFLDLVGEALRLPR 105 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L+ + ++TE G V R+L+ SL R + + + E Sbjct: 106 LRPKGEGEVTEEALRHTTIARKGPRGLRHVRRTLKESLKRALQSGEYRPEDPLLVPE--- 162 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 DLRYK ++P P +QAV+ +D Sbjct: 163 ----------------------------------REDLRYKAPRRKPIPHAQAVVLFALD 188 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 VSGSM + + K + L++ R + +E Y+ H +A EV E EFF ++E GGT Sbjct: 189 VSGSMREEELKLVKTLSFWITLWIKRHFPRLERRYLLHDAEAWEVPEEEFFKAREGGGTR 248 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SSAL L +E++K Y A +N Y SDG+NW D+PL E L +LLP + Y Y + Sbjct: 249 ISSALLLAEEILK-AYPEAFYNRYLFHFSDGENWQGDTPLALEALR-RLLPSLALYGYAQ 306 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + Q E + A+ +R ++D+ R L Sbjct: 307 VEGPYGQGHFLEEVREALGGREGVALAAVRGREDLPVALRRLLG 350 >UniRef50_B4D1N7 Autotransporter-associated beta strand repeat protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D1N7_9BACT Length = 1545 Score = 150 bits (378), Expect = 1e-34, Method: Composition-based stats. Identities = 37/308 (12%), Positives = 74/308 (24%), Gaps = 15/308 (4%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 + + + + +++V A Sbjct: 1084 DAKEREDKAKETVPTAPIPQPEVQTSANAFSTFSLNVSDVSFKLAAASLEQGHMPDPASV 1143 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 EE + +P L R + + F + K P Sbjct: 1144 RSEEFINAFDYRDPEPSPGA-PLAFVTERARYPFAQNRDLLRFAV--KTAAAGRQPGRPL 1200 Query: 250 VMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF-- 306 + L+D SGSM+++ + ++ + +L L K V + R + + Sbjct: 1201 NIVLLLDRSGSMERADRVNIVREALSVLAKHLQPQDKLSIVTFARTPHLWADAVAGDKVH 1260 Query: 307 ------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 GGT + +AL L E + N +DG N D +P Sbjct: 1261 DVIARVNEITPEGGTNLEAALDLAYETAHHHFAVDSTNRVI-LFTDGAANLGDVNPDALT 1319 Query: 360 ILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 + + + + L + F + Sbjct: 1320 KKVEAQRKQGIALDCFGIGWEGYNDDLLEQLTRNADGRYGFINTPEDAAANFATQIAGAL 1379 Query: 419 HKQNATAK 426 + K Sbjct: 1380 QVAASDVK 1387 >UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JNR2_9BACT Length = 923 Score = 143 bits (359), Expect = 2e-32, Method: Composition-based stats. Identities = 43/379 (11%), Positives = 81/379 (21%), Gaps = 17/379 (4%) Query: 57 EDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFV 116 D E + N + + + EG D Sbjct: 390 PDHKELRVTFDARPRLPFQNTANSPTPSSPSLAADTFNPEPITENPPSSFNLAEGLDNPN 449 Query: 117 FQISKDEYLDLLFE--DLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSL 174 + D LP + T +++V Sbjct: 450 LVAASTANATQTAPNNDEPLPRTQNPPPTTQLSEYPESNTATDPQSTFSLNVSDVSYRLT 509 Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 A EE + +P + + + + F L Sbjct: 510 EAYLAQNVRPPAGTLRTEEFVNAFDYGDPTPPVARK-IGFTWERAHWPFAHDRDVLRFSL 568 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI- 292 + S + +D SGSM + + D+ L L+ + V + Sbjct: 569 Q--TAAHGRASSQPLHLTLAIDTSGSMSRPDRVDIVNSLATALQSNLTEKDRLSIVSFDR 626 Query: 293 -------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 A+ GGT + SAL+L + + + N + Sbjct: 627 QPRLVLDGQSVTAETNLATLATQLNPQGGTDLESALQLSYQTAQRHFQENAINRVI-LIT 685 Query: 346 DGD-NWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 DG N + + + + + T F Sbjct: 686 DGAANLGNTNAEQLRTTVTENRIRGIALDCFGIGFDGHDDTFLESLSRNGDGRYRFLRSP 745 Query: 404 IRDQDDIYPVFRELFHKQN 422 ++ P L Sbjct: 746 EDAALELGPKLAGLLRPAA 764 >UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriaceae RepID=Y746_HALSA Length = 442 Score = 141 bits (354), Expect = 6e-32, Method: Composition-based stats. Identities = 88/447 (19%), Positives = 162/447 (36%), Gaps = 56/447 (12%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 +R+RF + + +Q ++E I + ++V +P + + P F + R V Sbjct: 5 EDRERFHEIGEQR-RQDLAEFIQYGDLGGSGP-DAVRVPIKLVDLPAFEYDQ-LDRGGVG 61 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 G+ G +G G + D E+ +++ +E+ L E L L + Sbjct: 62 QGDVD------PGDQVGEPDEAGEGDDDEAGDESADHEY-YEMDPEEFAAELDERLGL-D 113 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM---------------- 180 L ++ + E + G + + L R+ AM Sbjct: 114 LDPKGKKVVAETEGAFNETARRGPRGTLDFAHLYKQGLKRKIAMDFDEAYVTAALRVDGW 173 Query: 181 ----------TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA------------- 217 + E S + A + ++ + I Sbjct: 174 GVDAVYTWAREQHIPVSRAWIAERARSPSPDDDAGRVVDDAVWASIDAMEAAVDVEPTRT 233 Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY 277 +R + D R+++ + V+ + DVSGSM +S +++ +R + L Sbjct: 234 RIRRGGPGRVPLRREDERFRHPKVVEHRERNVVVVNIRDVSGSMRESKRELVERTFTPLD 293 Query: 278 LFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 +L+ Y N E VYI H A EVD +FF Q GGT +S+A +L + V+ E Y ++W Sbjct: 294 WYLTGKYDNAEFVYIAHDADAWEVDRTDFFGIQSGGGTRISTAYELAENVLDE-YPFSEW 352 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL---WREYEHLQS 394 N Y A DG+N DD+ L + ++Y+E + Sbjct: 353 NRYVFAAGDGENSHDDTEENVIPLMNDI--DANLHAYVETQPTDGVQTGTHAGKVRDAFG 410 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQ 421 DN A+ + + DD+ + + Sbjct: 411 DTDNVAVTTVTEPDDVMGAIETILSTE 437 >UniRef50_C1RGW7 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RGW7_9CELL Length = 500 Score = 141 bits (354), Expect = 7e-32, Method: Composition-based stats. Identities = 42/341 (12%), Positives = 77/341 (22%), Gaps = 35/341 (10%) Query: 97 GSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYT 156 G+ S G A + + +Q + EDL P R Sbjct: 22 GACSAGGSADGTADWEGSGAYQPGPYQ------EDLPYPEPGPTGPTAAGMTDPARD--- 72 Query: 157 ANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEI 216 + V EE + + L I Sbjct: 73 -ALSTFALDVDTGAYTRFRDAVRQGFSVDPFGVRTEEFVNYFAQDYEPPAEG---LGVSI 128 Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYIL 275 + + + + A + ++D SGSMD++ + K Sbjct: 129 DATALPFRPDHRLVRVGI--SSAPASAVSRADADLVLVVDCSGSMDEAGKMETTKYALRT 186 Query: 276 LYLFLSRTYKNVEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEV 327 L L RT + V Y E T ++ L L ++ Sbjct: 187 LVSSLRRTDRVAMVCYSTEADVYLEPTPVAEREGVLAAIDRLAPRDSTNAAAGLALGYDL 246 Query: 328 VKERYNPAQWNIYAAQASDG-DNWADDSPL-CHEILAKKLLPVVRYYSYIEITRRAHQTL 385 + SDG N + P ++ + + S + L Sbjct: 247 AMSMRTEGRLTRVV-LVSDGVANVGETDPEGILARISSQAKAGISLISVGVGITTYNDHL 305 Query: 386 WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + D +F + Sbjct: 306 LEQLADQGDGWHVYV--------DGEAEAERVFATGLTGSL 338 >UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GJ99_SILST Length = 300 Score = 140 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 122/299 (40%), Positives = 182/299 (60%), Gaps = 13/299 (4%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI- 197 + + T RAG T G P N+++VR+++NSL RR A+ + LE +A Sbjct: 2 EKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLGRRIALQRPSTQTQRDLEAQVAEL 61 Query: 198 --ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 I P+Q L ++ ++ K V +ID DLRY + +S+AV+FCLM Sbjct: 62 EEIEARSPSQDELLAELVAKLDGIKRKRRVVGYIDPLDLRYDTFVPEKIRNSRAVVFCLM 121 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 DVSGSM + KD+AKRF++LL+LFL+R Y++ E+V++RH A+EVDE FFY++ETGGT Sbjct: 122 DVSGSMQEREKDLAKRFFLLLHLFLTRGYEHTEIVFVRHTHYAQEVDEETFFYARETGGT 181 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 IVS+AL+ M E++ ERY P +WNIY AQASDG+N+ +DS C +IL ++LLP+ ++Y+Y+ Sbjct: 182 IVSTALEKMKEIIDERYPPDEWNIYGAQASDGENFGNDSVRCRKILTEQLLPMCQFYAYV 241 Query: 376 EI----------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 EI A + LW+ Y ++ +F MQ + + IYP+FRE F + Sbjct: 242 EIVEESAQMLLDNTEAGEDLWQNYRQVKEACRHFEMQRVSEPGHIYPIFREFFLPKVKG 300 >UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB92D Length = 586 Score = 135 bits (340), Expect = 3e-30, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 58/231 (25%), Gaps = 23/231 (9%) Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD 267 + A + + + L + + + ++D SGSM Sbjct: 273 QPSPSSAPQAAIFTESKGQHDYALVMLMPPQVKSQDLQDFDRDITFVIDTSGSMGGRPIV 332 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV-----------DEHEFFYSQETGGTI 316 AK L LS + V + T+ E + GGT Sbjct: 333 DAKESLQLAIDRLSEKDRFNVVAFNNDTTRLFETSVEGTTRNKQYARDFVKHLNAGGGTE 392 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 ++ AL + R + +DG A + K L R ++ Sbjct: 393 MAPALNAALK----RTTTKDFIKQVVFITDG---AVGNEAALFSQIKNELGDARLFTVGI 445 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + F R+ DI L +K + Sbjct: 446 G-SAPNSYFMTRAAQFGLGSYVFV----RNTADIKQQMDSLLYKLESPVLS 491 >UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=Q10RY0_ORYSJ Length = 694 Score = 134 bits (336), Expect = 8e-30, Method: Composition-based stats. Identities = 48/319 (15%), Positives = 91/319 (28%), Gaps = 30/319 (9%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQ----NSLARRTAMTAGKRREL 188 LP + Q + + +SVVR L +L ++ A+ + Sbjct: 123 ELP-FQGTQPGDTAYGRARVSTVNWPQDEGQMSVVRRLSHGYSGNLQQQLAVFRTPEASI 181 Query: 189 HALEENLA----IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 +EN+ + + E + E ++ R F L+ Sbjct: 182 FNDDENIDPQSETVDDHNAVTNSVEIKTYSEFPAIQKSERRKVFAILIHLKAPKSLDSVS 241 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA------ 298 + + ++DVSGSM + KR + L + V + + Sbjct: 242 SRAPLDLVTVLDVSGSMSGIKLSLLKRAMSFVIQTLGPNDRLSVVAFSSTAQRLFPLRRM 301 Query: 299 ----KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA--- 351 ++ +GGT ++ ALK +VVK+R + SDG + Sbjct: 302 TLTGRQQALQAISSLVASGGTNIADALKKGAKVVKDRRRKNPVSS-IILLSDGQDTHSFL 360 Query: 352 ------DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + S L + V+ +++ T + +F Sbjct: 361 SGEADINYSILVPPSILPGTSHHVQIHTFGFGT-DHDSAAMHAIAETSNGTFSFIDAEGS 419 Query: 406 DQDDIYPVFRELFHKQNAT 424 QD L Sbjct: 420 IQDAFAQCMGGLLSVVVKD 438 >UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12 Tax=Actinomycetales RepID=D2BAS2_STRRD Length = 490 Score = 133 bits (335), Expect = 9e-30, Method: Composition-based stats. Identities = 35/278 (12%), Positives = 69/278 (24%), Gaps = 26/278 (9%) Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL 219 + V + R EE + ++ + Sbjct: 64 STFALDVDTASYGYAKRILQEGRLPEPGQIRPEEFVNSFRQDYKEP--GDDGFTVHMDGA 121 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYL 278 R + L+ + + P+ A + ++DVSGSM + D+ + L Sbjct: 122 RMPEN-GTALIRVGLQTR--KAEPEARRPANLTFVVDVSGSMGEPGRLDLVREALHKLVD 178 Query: 279 FLSRTYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 L + V + ++ T + + L Sbjct: 179 QLGPGDQVSIVAFSTQARLVLSMTPATGRDQLHAAIDRLGVEDSTNLETGLTAGYAEAAR 238 Query: 331 RYNPAQWNIYAAQASDG-DNWADDS-PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWRE 388 + PA N SDG N D + + +A+ + L + Sbjct: 239 AFRPAATNRVI-LLSDGLANTGDTTWQGILDRVAESAGRQITLLCVGVGR-DYGDQLMEQ 296 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + R++F +Q AT Sbjct: 297 LADNGDGAAVYVSSADD--------ARKVFVEQLATNL 326 >UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CDX4_KOSOT Length = 730 Score = 133 bits (333), Expect = 1e-29, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 56/197 (28%), Gaps = 14/197 (7%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + P+ + ++D+SGSM + AK + + L + + + Sbjct: 264 PPREPERIIPKDIVFILDISGSMSGQKIEKAKLALLQVLQMLHEGDRFSIITFNNEVNNL 323 Query: 299 KEVDEHEFFY---------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 E G T + AL EV+ + ++ +DG Sbjct: 324 TERLLPFSDRTEWYPAVKQIMAGGMTNIHDALLEGIEVLGTQSTDDRY-KVVLFLTDGAP 382 Query: 350 W-ADDSPLCHEILAKKLLP--VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + KL V + + + L E + +++ Sbjct: 383 TEGITDIGTIIRDSTKLAKVRDVHLFVFGVGY-DVNAELLDELAEKGGGKVKYIVENEEI 441 Query: 407 QDDIYPVFRELFHKQNA 423 + + ++R + + Sbjct: 442 DEKVLELYRMIETPVMS 458 >UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PTX8_CHIPD Length = 462 Score = 133 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 52/190 (27%), Gaps = 13/190 (6%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-------- 292 + P + ++D SGSM A++ L L+ T V Y Sbjct: 73 EASKPRVPLNISLVLDRSGSMSGDKIKYARQAAKFLIDQLNSTDHLSIVNYDDRVEVTSP 132 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA 351 + KE + + G T +S + VK N +DG N Sbjct: 133 SQSVKNKEALKAAIDKIHDRGSTNLSGGMLEGYTQVKSTRKEGYVNRV-LLLTDGLANQG 191 Query: 352 DDSPLCHEILAKKLLP--VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 PL + LA+ + ++ ++ L F + Sbjct: 192 ITDPLELKRLAENKYKEDGIALSTFGVGA-DYNEDLLTMLAENGRANYYFIDSPDKIPQI 250 Query: 410 IYPVFRELFH 419 + L Sbjct: 251 FAGELKGLLS 260 >UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 Tax=Bacteria RepID=A7C0I1_9GAMM Length = 367 Score = 131 bits (328), Expect = 6e-29, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 56/191 (29%), Gaps = 20/191 (10%) Query: 247 SQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--- 302 A + L+DVSGSM + K LL L+ K VVY E Sbjct: 2 PPANLVFLVDVSGSMRSNHKLALLKSALKLLSNQLTEKDKVSLVVYAGAAGVVLEPTPGH 61 Query: 303 -----EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPL 356 G T S+ + L + ++ + N A+DGD N Sbjct: 62 QSVKINGALERLTAGGSTHGSAGIHLAYNLAEQAFIKNGINR-ILLATDGDFNVGTVDFE 120 Query: 357 CHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + L ++ + + + L + + + D + Sbjct: 121 ALKNLVEEKRKSGISLTTLGFGRGNYNDQLMEQLADAGNGNYAYI--------DTLNEAQ 172 Query: 416 ELFHKQNATAK 426 ++ + ++ Sbjct: 173 KVLVDEMSSTL 183 >UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H5J9_SHEPA Length = 789 Score = 130 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 26/235 (11%), Positives = 54/235 (22%), Gaps = 26/235 (11%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 L ++ S + ++D SGSM AK Sbjct: 363 ASSSEASTEPQTASEKYGLVMLMPPQGAEQQPSSIHRELILVIDTSGSMSGDAIIQAKTA 422 Query: 273 YILLYLFLSRTYKNVEVVYIRHHT-----------QAKEVDEHEFFYSQETGGTIVSSAL 321 L T K V + ++ + GGT +S A+ Sbjct: 423 LKYALAGLRPTDKFNIVQFNSDVDKWSGMAMSATPYNLAQAQNYINRLEANGGTEMSIAI 482 Query: 322 KLMDEV-----------VKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR 370 + + + +DG A + L + L R Sbjct: 483 NAALNIETVTDKETGTELDNNDLGSNLLRQVLFITDG---AVSNESMLFELIEAQLGDSR 539 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 ++ + + L + + + + +++ Q Sbjct: 540 LFTIGIG-SAPNAHFMQRAAQLGRGTYTYIGKLDEVNQKVVSLLKKIEKPQVTDV 593 >UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcanivorax sp. DG881 RepID=B4X134_9GAMM Length = 657 Score = 130 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 43/316 (13%), Positives = 81/316 (25%), Gaps = 32/316 (10%) Query: 128 LFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRE 187 L P + L G TA+ A++ + ++ AR + + Sbjct: 175 LTPRFTPPTEAPHTLDSLLRNTVAAPGGTADAGTASVHID---LDAGARLATLGSPSHAI 231 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD------LRYKNYEK 241 + I+ A ++ + L E + F + D L + Sbjct: 232 HYQRHGRRYTITPKAGAIAMDRDLLLNWELEDTGEPLVTRFHEEIDGEHYALLMVVPPKT 291 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 + ++D SGSM + AK L L + + HT E Sbjct: 292 GQVTALPRETLFIIDSSGSMGGAPMRQAKASLHLALQRLKPGDRFNITDFDSQHTLLFET 351 Query: 302 -----------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + Q +GGT + AL + +DG Sbjct: 352 PVTVSDNSRQQAQDFVDGLQASGGTHMLPALSATLSQ----PASDGYLRQVIFITDG--- 404 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 A + + L R ++ + + DQ+ + Sbjct: 405 AVGNESGIFRALHQQLGEARLFTVGIG-SAPNSHFMTRAAQFGRGSFTYI----NDQNQV 459 Query: 411 YPVFRELFHKQNATAK 426 LF + + Sbjct: 460 QQGMDTLFRRLESPLM 475 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 24/203 (11%), Positives = 56/203 (27%), Gaps = 18/203 (8%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 ++ + C++D SGSM K + L L+ + +++ + T Sbjct: 197 QVKQVEQSRPSIDLVCVIDNSGSMQGEKIQNVKTTLLQLLDMLNSNDRLSLILFNSYPTL 256 Query: 298 AKEV----------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + GGT ++S + + ++++R + SDG Sbjct: 257 LCNLRKVDDENTPNIQSIINSITADGGTDINSGMLMAFNILQKRQFFNPVSS-IFLLSDG 315 Query: 348 DNWADDSPLCHEILAKK----LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 + D + +S+ L L+ + + Sbjct: 316 QDNGAD--EKIKKYINSNQSLKNECFSIHSFGFG-SDHDGPLMNRICQLKDGNFYYVEKI 372 Query: 404 IRDQDDIYPVFRELFHKQNATAK 426 + + LF Sbjct: 373 NQVDEFFVDALGGLFSVVAQEIL 395 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 129 bits (323), Expect = 3e-28, Method: Composition-based stats. Identities = 41/303 (13%), Positives = 90/303 (29%), Gaps = 46/303 (15%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P +++ Q + + + + IS + L S+ R+ K ++ E+ Sbjct: 1356 PKIEEKDQSEGQQEEFEQNE---THSLRKISQKKVLIKSIQRKVKTNKEKVQKALNEEDK 1412 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 +Q K I+ + + P + C+ Sbjct: 1413 EN----QTKSQQHRISSNVKNISGQFSLGQLQPMRF-----------------PIDLICV 1451 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV-----------DE 303 +D SGSM+ D+ K + L L + + + + + + + Sbjct: 1452 IDTSGSMNGQPLDLLKETLLFLVDLLQTGDRICLIQFSTNAQRLTPLLSIESKDNIKSIK 1511 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS--PLCHEIL 361 +E GGT + ++L +V+K+R SDG N ++ + L Sbjct: 1512 NEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITSV-FLLSDGLNDGAENKIRDLLKQL 1570 Query: 362 A-----KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + ++ + + + L + L + R + Sbjct: 1571 NFYQNYNEENFTIQTFGFGK---DHDPNLMDKISQLMDGNFYYIGDIHRIDECFIDALGG 1627 Query: 417 LFH 419 LF Sbjct: 1628 LFS 1630 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 129 bits (323), Expect = 3e-28, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 53/205 (25%), Gaps = 23/205 (11%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 L + E + ++D SGSM + + AK L + + Sbjct: 314 MLMPPSDEFIAAQRLPREVIFVIDTSGSMHGESLEQAKSALFFALANLDPQDSFNIIEFN 373 Query: 293 R-----------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + + + GGT + L E V + A + Sbjct: 374 SKVNALNAQALPANDFNIRRARNFVYGLKADGGTEIG----LAFEQVLDNSEHADYLRQI 429 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG + + K L R ++ + L F Sbjct: 430 VFLTDG---SISNETEVFAQIKGSLGDSRIFTIGIG-SAPNSYFMTRAATLGRGTFTFI- 484 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 D D+ + LF + A Sbjct: 485 ---GDVTDVQRTMKNLFVQLANAAL 506 >UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3 Tax=Andropogoneae RepID=C5WYU9_SORBI Length = 698 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 37/321 (11%), Positives = 83/321 (25%), Gaps = 29/321 (9%) Query: 131 DLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVR----SLQNSLARRTAMTAGKRR 186 L Q + +++VVR + +L Sbjct: 114 RAEWKELPGAQPADANYGRARVNPLNWPQDEGHMAVVRRLSHTYSGNLQEHLPFFRTLEA 173 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRK----EIAELRAKIERVPFIDTFDLRYKNYEKR 242 + +E++ + S+ ++ E + + + F LR Sbjct: 174 GIFNDDEHIDLQSDMNDEHNAITGSVKIKAYSEFPAIEQSVTKEIFAILIHLRAPKSSHS 233 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA---- 298 + + ++DVSGSM + + K + L + + + + Sbjct: 234 ASSRAPLDLVTVLDVSGSMAGTKIALLKNAMSFVIQTLGPNDRLSVIAFSSTARRLFPLR 293 Query: 299 ------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 ++ +GGT ++ LK +V+++R SDG + Sbjct: 294 RMTLAGRQQALQAVSSLVASGGTNIADGLKKGAKVIEDRRLKNPV-CSIILLSDGQDTYT 352 Query: 353 -DSPLCHEILAKKL--------LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 S + + V+ +++ + S +F Sbjct: 353 LPSDRNLLDYSALVPPSILPGTGHHVQIHTFGFG-SDHDSAAMHAIAEISSGTFSFIDAE 411 Query: 404 IRDQDDIYPVFRELFHKQNAT 424 QD L Sbjct: 412 GSIQDGFAQCIGGLLSVVVKE 432 >UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 Tax=Arabidopsis thaliana RepID=Q9M1S2_ARATH Length = 676 Score = 127 bits (318), Expect = 1e-27, Method: Composition-based stats. Identities = 28/275 (10%), Positives = 67/275 (24%), Gaps = 29/275 (10%) Query: 175 ARRTAMTAGKRRELHALE--ENLAIISNSEPAQLLEEERLRKEIAELRAKI--------- 223 R + E E + E + + Sbjct: 158 QRAITQGHPEPATFDDDERLEEQIVFDGETEVLKKENRDYVRMMDMKVYPEVSAVPQSKS 217 Query: 224 --ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + + + + ++D+SGSM + + KR + L Sbjct: 218 CENFDVLVHLKAVTGDQIS--QYRRAPIDLVTVLDISGSMGGTKLALLKRAMGFVIQNLG 275 Query: 282 RTYKNVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + + + + +++ GGT + L+ +V+++R Sbjct: 276 SSDRLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDR 335 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 SDG + + +LP + +S+ ++ Sbjct: 336 LERNSVAS-IILLSDGRDTYTTNHPDPSYKV--MLPQISVHSFGFG-SDHDASVMHSVSE 391 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + +F QD + L + Sbjct: 392 VSGGTFSFIESESVIQDALAQCIGGLLSVAVQELR 426 >UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSY7_9GAMM Length = 670 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 63/228 (27%), Gaps = 24/228 (10%) Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMA 269 + + + ++ + LR + E P + ++D SGSM A Sbjct: 282 DPVASASGAVFSEEYKGEHYALVMLRTPD-EMTSGPRMPREVVFVIDTSGSMAGQRMYHA 340 Query: 270 KRFYILLYLFLSRTYKNVEVVYI-----------RHHTQAKEVDEHEFFYSQETGGTIVS 318 K+ LS + V + + + Q GGT++ Sbjct: 341 KQALSQAVERLSPDDRFNVVEFNNQHSRLFSSMRSASAINVKQALNWVGRLQGGGGTMML 400 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 A++ V + + +D + + + ++ R ++ Sbjct: 401 PAVEDALSV----RSDPAYLRQVILITD---ASVGNEAEILRVVERQRKGARLFTVGIGV 453 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + L R+ + + ++ + LF K Sbjct: 454 S-PNSYLLRKAAQVGQGDYVYIASG----QEVKARMQRLFAKLENPVL 496 >UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RYC9_9GAMM Length = 686 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 47/434 (10%), Positives = 107/434 (24%), Gaps = 39/434 (8%) Query: 6 DRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFH 65 +RR+ GK + R+ R Y+ +KQ + ++ ++ + +I + Sbjct: 89 ERRIVGKIR---ERKEAKRVYQQALKQGKKAVLVEQQRPNLFTNRVANIAPGEKITVRLE 145 Query: 66 QGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYL 125 + P + + Sbjct: 146 YVQSVEHRSGRFSLRLPTTITPRYMPGVETETANQDMNENVAVTPSHGWAWPT------D 199 Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN-SLARRTAMTAGK 184 + L P Q + + S+ +L+RR + + K Sbjct: 200 QVTDAHLISPLQYFAQGSDSAPLNRIKISARLDMGMPLASIDSPYHEIALSRRAGVYSVK 259 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 + A + ++ S + L ++ + Sbjct: 260 LAQGSAEMDRDFVLQWSAASGSLPGAAF-----FTERVDDQYYGLLMLVPPASQRAAETV 314 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---------- 294 P + ++D SGSM + AK L + + + Sbjct: 315 PR---EIVFVVDTSGSMGGVSIKQAKGSLTRALRHLGPNDRFNVIEFNSSHRALFQHAVP 371 Query: 295 -HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV--VKERYNPAQWNIYAAQASDGDNWA 351 ++ + + +GGT + AL+L ++ ++ P +DG A Sbjct: 372 ASHHNLQLASEYVRHLEASGGTEMMPALQLALKLPGAQDELRPEPALRQVIFITDG---A 428 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + L R ++ + R+ + D ++ Sbjct: 429 VGNESALFEHIVDSLGGSRLFTVGIG-SAPNAWFMRKAAEYGRGTFTYI----GDVAEVG 483 Query: 412 PVFRELFHKQNATA 425 LF Sbjct: 484 EKMDALFLNLTRPV 497 >UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phage phiYS40 RepID=A0MN74_9CAUD Length = 340 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 79/364 (21%), Positives = 126/364 (34%), Gaps = 75/364 (20%) Query: 16 MVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRV 75 ++ R+L+R + IK + + +N + + + V I + EP F G Sbjct: 5 TIDEIRYLKRLENIIKARMQDIVNSNDIIESTPEDKVRIRIPIMDEPYFKPVFPGSGAGA 64 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP 135 G+ E E +++ +E +LLFE L LP Sbjct: 65 GSGSGSEPGEGSEEG---------------------DHEIEIELTVEELSELLFEYLGLP 103 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 +K + + + G + G + I R+T K Sbjct: 104 KIKPKG-SSVEKEEYLIEGISKTGPRSRIH---------RRKTYYEIMK----------- 142 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 +RYK+ KR P A+++ Sbjct: 143 ----YGYK---------------------------EDSIRYKHLRKREVPIFDAIVYFAR 171 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 D S S+D K K + F+ YKNV + H T+AK V E +FF E G T Sbjct: 172 DYSASVDDKKKFKIKSTAFWINNFIKYNYKNVTTKFAVHDTKAKFVSEQDFFKLSEGGAT 231 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + SS +L+ E + RY+ +N Y SDG+N DD+P L +KL +Y Sbjct: 232 LCSSVFELIYEDYR-RYSVDDYNFYLFYFSDGENLPDDNPK-LRELVEKLSEDFNLIAYG 289 Query: 376 EITR 379 E+ Sbjct: 290 EVKS 293 >UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XQ17_9BACT Length = 806 Score = 126 bits (316), Expect = 2e-27, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 53/202 (26%), Gaps = 22/202 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---- 294 + + + ++D SGSM + AK+ L+ + + + Sbjct: 302 VDAKAKQIVSKDVVFVLDTSGSMSGKKMEQAKKALQFCVESLNDGDRFEIIRFSTESEPL 361 Query: 295 -------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + +E + GGT + ALK + + +DG Sbjct: 362 FDKLAAVSKENREKAGDFIKNLKAMGGTAIDEALKKALSL----ESKEGRPFVVVFLTDG 417 Query: 348 DNW-ADDSPLCHEI-LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + ++ R + + T + L F + + Sbjct: 418 LPTVGTTDEDQILKGMQERNKEKRRIFCFGIGT-DVNTHLLDRIAEETRAFSQYVLPE-- 474 Query: 406 DQDDIYPVFRELFHKQNATAKG 427 +D+ F K N Sbjct: 475 --EDLEVKVSSFFSKINEPVLA 494 >UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q083T9_SHEFN Length = 722 Score = 125 bits (314), Expect = 3e-27, Method: Composition-based stats. Identities = 31/244 (12%), Positives = 59/244 (24%), Gaps = 20/244 (8%) Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 A + L + R E A+ + L + E + ++ Sbjct: 291 ATFYQTGKTHLADNSDERSETAQRQPNPVDNNMYSLVMLMPPSVEVSEQHLIARELILVI 350 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFY------- 308 D SGSM + AK+ L + + T Sbjct: 351 DTSGSMSGQSITQAKQALQFALAGLRDIDSFNIIEFNSDVTMLSATPLSANSRNIGKANR 410 Query: 309 ----SQETGGTIVSSALKLMD-----EVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 GGT + SAL+ + + ++ +DG A + Sbjct: 411 FIQSLDADGGTEMRSALQTALVDSVQQDSDQTDAHSEMLRQVIFMTDG---AVGNEHELY 467 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 L L R ++ + R + + Q I + ++ Sbjct: 468 QLINDQLGDSRLFTVGIG-SAPNSDFMRRAATMGRGTFTYIGNESEVQQKIEQLLNKIEQ 526 Query: 420 KQNA 423 Sbjct: 527 PVLT 530 >UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Photobacterium profundum 3TCK RepID=Q1YZ74_PHOPR Length = 714 Score = 125 bits (313), Expect = 3e-27, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 57/190 (30%), Gaps = 24/190 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------------IRHHTQ 297 + ++D+SGSM + + AK+ L V + + T Sbjct: 335 VTFVLDISGSMYGESIEQAKQALRYGLQQLQPEDSFNIVTFNHEAMLYSEQLLPVTSSTI 394 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEV-VKERYNPAQWNIYAAQASDGDNWADDSPL 356 + GGT +++ALK + ++ N +W +DG + + Sbjct: 395 TR--ALRFVDGLDADGGTEMAAALKAAFSIKTHDQLNSTRWLNQIVFITDG---SVGNES 449 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L ++ L R ++ + + D ++ R Sbjct: 450 ALFDLIEQQLVDRRLFTVGIG-SAPNSYFMTRAAMKGKGTYTYI----GDVKEVNTKMRL 504 Query: 417 LFHKQNATAK 426 LF K + Sbjct: 505 LFSKISQPVM 514 >UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NW6_PSEA6 Length = 701 Score = 125 bits (313), Expect = 4e-27, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 47/201 (23%), Gaps = 24/201 (11%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-- 299 + L+D SGSM + AKR L + + Sbjct: 296 VAQQMPSREVVFLLDTSGSMAGESIVQAKRAVDFALTQLRPEDNVNIIQFNDAPQALWKR 355 Query: 300 ---------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ-----WNIYAAQAS 345 + + GGT ++ AL L + + + Sbjct: 356 AMPATAKHIQRARNWVASLHADGGTEMAPALTLALNKPSLHRDDSDLLGSHKLRQVVFIT 415 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + + L + L R ++ + + + Sbjct: 416 DG---SVSNEDALMSLIESKLADNRLFTIGIG-SAPNSYFMTQAAQAGRGTFTYI----G 467 Query: 406 DQDDIYPVFRELFHKQNATAK 426 D + LF+K Sbjct: 468 DIQQVQHKMTALFNKLTRPVM 488 >UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FW78_SHESH Length = 770 Score = 124 bits (311), Expect = 6e-27, Method: Composition-based stats. Identities = 59/447 (13%), Positives = 121/447 (27%), Gaps = 47/447 (10%) Query: 8 RLNGKNKSMVNRQRFLRRYKAQIKQSISEA-----INKRSVTDVDSGESVSIPTEDISEP 62 + GK S+V R +K + + I+ + + D D GE S+ + P Sbjct: 140 KSAGKRASLVEHNRPN-IFKTSVANLGPDELLVVEISYQELVDYDEGE-FSLRFPMVVNP 197 Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGG--SGSGQGQASQDGEGQDEFVFQIS 120 + +GG + N F D R G +AS GE Sbjct: 198 RY-YPQGGSKQPDDFQNQGFQYQDFQYRGLTGDESWLDKLSAMEASLRGESYGVRSHNAP 256 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 L + L + + + + ++ R L + Sbjct: 257 TVNIRVNLDAGVELSEI--TSAYHTIDKTPLDNTGYQIHLASQVAANRDFV--LRWKPVA 312 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 + + A + S+++ + + E+ + + L Sbjct: 313 GSEPTAAVFAQKGQTYSSSSTQKNSSEQHVKSDPELNAKPDADYALVMLLPPSLE----- 367 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------- 291 + + ++D SGSM S + AK+ L + + Sbjct: 368 -KSRNRVSRELILVIDTSGSMSGSAMEQAKKAMKYALAGLGSDDTFNVIEFNSKVSSLSK 426 Query: 292 --IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV-----------VKERYNPA-QW 337 I T+ E+ GGT ++ AL+ ++ + + Sbjct: 427 GPIPASTKNIEMANRFVHSLTSDGGTEMALALEHALGQESGGSSWQETGLQGKDEESTSR 486 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG A + L K + R ++ + + Sbjct: 487 LRQVLFMTDG---AVGNEAELFKLIKYRIGKSRLFTLGIG-SAPNSHFMQRAAEFGRGTF 542 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQNAT 424 + Q+ I + ++ H Q Sbjct: 543 TYIGDLDEVQEKIQGLLYKIEHPQITD 569 >UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepID=C5EGH1_9FIRM Length = 681 Score = 124 bits (310), Expect = 9e-27, Method: Composition-based stats. Identities = 43/399 (10%), Positives = 101/399 (25%), Gaps = 35/399 (8%) Query: 38 INKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGG 97 ++ + + V+ ++ E +RP Sbjct: 100 VSVHGMKMQVGDKVVTAKIKEKEEAKQEFDAAKS-------EGKSASLLEQQRPNVFTMN 152 Query: 98 SGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLK-QNQQRQLTEYKTHRAGYT 156 + + + +F + P + R+ + + Y Sbjct: 153 VANI-MPGDTVNIELHYTEMIALSEGSYEFVFPAVVGPRYSSPSPDREEDGNQWVASPYQ 211 Query: 157 ANG--VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRK 214 G + SL + +++ + + ++ A I+ +PA Sbjct: 212 EGGAVPKGTYDIAVSLSTGVPITGIVSSSHKINIEQSADSSAHITLKDPADYGGNRDFIL 271 Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRPDPS-------SQAVMFCLMDVSGSMDQSTKD 267 + ++T + P ++DVSGSM D Sbjct: 272 RYQLAGQTVNSGLMLNTGEKENFFLLMVQPPERVPAEAIPPREYIFVLDVSGSMFGYPLD 331 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHH-----------TQAKEVDEHEFFYSQETGGTI 316 AK + L T +++ + E + + GGT Sbjct: 332 TAKELIRNMVSNLRETDTFNLILFSNDAIRMSARSLPATDENVERAINLINRQKGGGGTE 391 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 ++ AL+ + + + +DG + L ++S+ Sbjct: 392 LAPALEKAVGIPMD-SGAGSVSRSVVVITDG---YMSDEQAIFDIVAGNLDTTSFFSFGI 447 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 T ++ L +F + + D +F Sbjct: 448 GTS-VNRYLIEGIARTGGGE-SFVVTDSSESADTARLFD 484 >UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Shewanella RepID=A6WMD3_SHEB8 Length = 772 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 51/201 (25%), Gaps = 17/201 (8%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 EK PS + ++D SGSM + AK + L + + Sbjct: 380 PKVEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIEFNSSLS 439 Query: 295 ---------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN--IYAAQ 343 + Q GGT ++ AL +P Sbjct: 440 LLSATPLPATSSNLSRARQFVSRLQADGGTEMALALDAALPKSLGSVSPDAVQPLRQVIF 499 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG + + L + + R ++ + + L + + Sbjct: 500 MTDG---SVGNEQALFDLIRYQIGESRLFTVGIG-SAPNSHFMQRAAELGRGTFTYIGKV 555 Query: 404 IRDQDDIYPVFRELFHKQNAT 424 I + ++ + Sbjct: 556 DEVDAKISALLSKIQYPVLTD 576 >UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain H4 n=38 Tax=Eutheria RepID=ITIH4_HUMAN Length = 930 Score = 122 bits (306), Expect = 3e-26, Method: Composition-based stats. Identities = 39/335 (11%), Positives = 91/335 (27%), Gaps = 25/335 (7%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS 169 + F++ +E L L LK Q+ + + + G+ + Sbjct: 135 APNAKITFELVYEELLKRRLGVYEL-LLKVRPQQLVKHLQMDIHIFEPQGISFLETESTF 193 Query: 170 LQNSLARRTAM-TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 + N L + + + E + + + L RA Sbjct: 194 MTNQLVDALTTWQNKTKAHIRFKPTLSQQQKSPEQQETVLDGNLIIRYDVDRAISGGSIQ 253 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 I+ + + + ++D SGSM + I + LS + Sbjct: 254 IENGYF-VHYFAPEGLTTMPKNVVFVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNL 312 Query: 289 VVYIRHHTQAKE-----------VDEHEFFYSQETGGTIVSSALKLMDEVV----KERYN 333 +V+ TQ + Q GGT ++ A+ + +++ +E Sbjct: 313 IVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERL 372 Query: 334 PAQWNIYAAQASDGDNW-ADDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQTLWREYEH 391 P +DGD + +P + ++ + + + Sbjct: 373 PEGSVSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGF-DVSYAFLEKLAL 431 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + I + D ++ + + Sbjct: 432 DNGG----LARRIHEDSDSALQLQDFYQEVANPLL 462 >UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alteromonadales RepID=Q3IHK0_PSEHT Length = 664 Score = 122 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 29/243 (11%), Positives = 60/243 (24%), Gaps = 23/243 (9%) Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 ++ + ++ + + A + E L Sbjct: 267 EQNALNRDFVLEFKPLQKEQAQAAFFTEQFENGERYGLAMLMPPADNFIATQRLARETVF 326 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----------HHTQAKEVD 302 ++D SGSM + + AK L + + Sbjct: 327 VVDTSGSMHGQSMEQAKNALFYALSLLDSNDSFNIIGFDNVVTLMSDKPLVASGFNLRRA 386 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 E + Q GGT + AL + + + +DG + + Sbjct: 387 ERFIYGLQADGGTEIQGALDAVLD----GSQFDGFVRQVIFLTDG---SVSNEDALFKSI 439 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + L R ++ + R + F ++ P ++LF K Sbjct: 440 QAKLGDSRLFTVGIG-SAPNSFFMRRAADVGKGSFTFI----GSTSEVQPKMQQLFDKLA 494 Query: 423 ATA 425 A Sbjct: 495 HPA 497 >UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI7_9BACT Length = 1078 Score = 122 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 29/348 (8%), Positives = 81/348 (23%), Gaps = 23/348 (6%) Query: 92 QGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTH 151 + + + + + Q +++ L + LP++ Sbjct: 586 TPEQNSKIASHLEQVNERAQEKKKESQKARERLKQLKSQQSNLPSVALPSPLFFEGMIDA 645 Query: 152 RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEER 211 + N I V + + +EE + E Sbjct: 646 KET---NLSTFAIDVDTASYTAARSEIRAGRKVEASHVRIEEFINNFDYHYSVPKKE--A 700 Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAK 270 + + K+ + ++ + ++D SGSM + + + Sbjct: 701 FKIDSELSDHKVYAGVKLLRVGVQGQRLGADSQKPGS--YTFVIDNSGSMAAENRLPLIQ 758 Query: 271 RFYILLYLFLSRTYKNVEVVYIRHHTQ--------AKEVDEHEFFYSQETGGTIVSSALK 322 + ++ +++ + + T E + +S ++ Sbjct: 759 KTLPNMFKAMNQDDEVTILSCEGGVTNLANRITASNHSQLETAVKNIEAGTVANLSVGIE 818 Query: 323 LMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILA---KKLLPVVRYYSYIEIT 378 ++ + + N SDG + + +K + Sbjct: 819 EAYKLAAQNFRSGAVNRVI-LLSDGIASLGEKEAQEVLKTVSQYRKQGIGNTVIGVG--S 875 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + F + D + F F K Sbjct: 876 EDYDDSFLETLANKGDGVYYFGDSKEQMNDILVNNFEASFKTIARDVK 923 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 121 bits (304), Expect = 3e-26, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 61/196 (31%), Gaps = 19/196 (9%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA-- 298 S + C++DVSGSM + + + LS + + + T+ Sbjct: 81 TSEASRSGVDIVCVIDVSGSMQGEKIQLVQTTLNFMVERLSPADRICLISFSNDATKISR 140 Query: 299 --------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 K+ + +GGT + L+ + +++R Q + SDG + Sbjct: 141 LVQMSPKGKKQLKSMIPRLVASGGTNIVGGLEYGLQALRQRRTINQLSS-IILLSDGQDN 199 Query: 351 ADDSPLCHEILAKK---LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + L + +++ TL ++ + +D+ Sbjct: 200 NGTTVLQRAKATMDSIVIRDDYSVHTFGYGHG-HDSTLLNALAEPKNGAFYYV----KDE 254 Query: 408 DDIYPVFRELFHKQNA 423 + I F + + Sbjct: 255 ETIATAFANCLGELMS 270 >UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CPU4_SHEPW Length = 710 Score = 121 bits (303), Expect = 5e-26, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 53/191 (27%), Gaps = 18/191 (9%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-----------H 295 + + ++D SGSM + AK I LS + + + Sbjct: 347 APRELILVIDTSGSMSGEAIEQAKASIIYALAGLSAQDSFNILQFNSNVYALSDTPLNAS 406 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + + Q GGT +S AL + ++ + +DG A + Sbjct: 407 AKNIGRAQAYVQRLQANGGTEMSLALDKA---LSQQDANRERLRQVLFITDG---AVGNE 460 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + L R ++ + + L + + + + + Sbjct: 461 PQLFTQIRNQLQQSRLFTIGIG-DAPNAHFMQRAAELGRGTYTYIGKQSEVKSKMVAMLD 519 Query: 416 ELFHKQNATAK 426 +L + Sbjct: 520 KLEKPTVTDVE 530 >UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magnoliophyta RepID=Q9FF49_ARATH Length = 704 Score = 121 bits (303), Expect = 5e-26, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 49/199 (24%), Gaps = 24/199 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA------- 298 + + ++DVSGSM + + KR + L + + + + Sbjct: 248 RAPVDLVTVLDVSGSMAGTKLALLKRAMGFVIQNLGPFDRLSVISFSSTARRNFPLRLMT 307 Query: 299 ---KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW----- 350 K+ GGT ++ LK V+ +R + SDG + Sbjct: 308 ETGKQEALQAVNSLVSNGGTNIAEGLKKGARVLIDRRFKNPVSS-IVLLSDGQDTYTMTS 366 Query: 351 -----ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 D V + + +L +F Sbjct: 367 PNGSRGTDYKALLPKEINGNRIPVHAFGFG---ADHDASLMHSIAENSGGTFSFIESETV 423 Query: 406 DQDDIYPVFRELFHKQNAT 424 QD L Sbjct: 424 IQDAFAQCIGGLLSVVVQE 442 >UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, scaffold_125.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HHA4_VITVI Length = 630 Score = 121 bits (302), Expect = 6e-26, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 53/194 (27%), Gaps = 24/194 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------H 295 + + ++DVSGSM S + KR L L + + V + Sbjct: 202 RAPIDLVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMS 261 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 +E +GGT + LK V++ER SDG + + Sbjct: 262 DNGREAAGLAINSLTSSGGTNIVEGLKKGVRVLEERSEQNPVAS-IILLSDGKDTYNCDN 320 Query: 356 LCHEILA----------KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + + ++ + V + + T +F Sbjct: 321 VNRRQTSHCASSNPRQGRQAIIPVHTFGFG---SDHDSTAMHAISDESGGTFSFIESVAT 377 Query: 406 DQDDIYPVFRELFH 419 QD L Sbjct: 378 VQDAFAMCIGGLLS 391 >UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobactrum intermedium LMG 3301 RepID=C4WI90_9RHIZ Length = 777 Score = 121 bits (302), Expect = 7e-26, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 52/200 (26%), Gaps = 19/200 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 +Q + ++D SGSM ++ + AK L + + + T+ Sbjct: 369 PAVASAKKAQREVVFVIDNSGSMGGTSIEQAKASLDYALSHLQPGDRFNVIRFDDTLTRF 428 Query: 299 KEV-----------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 EV H + GGT + AL + + +DG Sbjct: 429 FEVSVEASQQNIASARHFVMSLEAQGGTAMLPALHAALD----DSHQGNGLRQIVFLTDG 484 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + R + T + L L Sbjct: 485 E---ISNEQQLLDAIAARRGRSRIFMVGIGT-APNSYLMNHAAELGRGTFTHIGSAAEVD 540 Query: 408 DDIYPVFRELFHKQNATAKG 427 + + +F +L + K Sbjct: 541 ERMRALFDKLENPAVTDLKA 560 >UniRef50_Q235T9 von Willebrand factor type A domain containing protein n=5 Tax=Tetrahymena thermophila RepID=Q235T9_TETTH Length = 703 Score = 121 bits (302), Expect = 7e-26, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 61/190 (32%), Gaps = 20/190 (10%) Query: 246 SSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYKNVEVVYIRHH--------- 295 + C++D SGSM+ S + K + L L+ + + + Sbjct: 208 RPNLDLICVIDNSGSMNDFSKIENVKNTILQLLEMLNENDRLSLITFNTKAKQLCGLKNV 267 Query: 296 -TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 Q K+ + + GGT + +++ ++++ R + SDG + D+ Sbjct: 268 NNQNKKSLQTITKSIKADGGTDIIRGIEIAFQILQSRKQKNSVSS-IFLLSDGQDNLADA 326 Query: 355 -----PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 ++ + + + L ++ ++ F ++ + + Sbjct: 327 GIKNLLKTTYKQLQEESFTIHSFGFGN---DHDGPLMQKIAQIKDGSFYFVEKNDQVDEF 383 Query: 410 IYPVFRELFH 419 LF Sbjct: 384 FIDALGGLFS 393 >UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN03_ARATH Length = 641 Score = 120 bits (301), Expect = 8e-26, Method: Composition-based stats. Identities = 39/268 (14%), Positives = 74/268 (27%), Gaps = 22/268 (8%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 +S+ A + E + E +L E++ L + R F Sbjct: 123 DVDSVPTFVAQRGFEDDEPLPQGDTQIHSDGHRSDHQALEIKLFPEVSALAKPVSRADFA 182 Query: 230 DTFDLRYKNYEKRPDP-SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 L+ + + + ++DVSGSMD ++ K + L T + Sbjct: 183 VLVHLKAEGVSDDARRARAPLDLITVLDVSGSMDGVKMELMKNAMSFVIQNLGETDRLSV 242 Query: 289 VVYIRHHTQA----------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 + + + K+ GGT ++ LK+ V++ R + Sbjct: 243 ISFSSMARRLFPLRLMSETGKQAAMQAVNSLVADGGTNIAEGLKIGARVIEGRRWKNPVS 302 Query: 339 IYAAQASDGDNWADDSPLCHE-------ILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 SDG + S +L + + + L Sbjct: 303 G-MMLLSDGQDNFTFSHAGVRLRTDYESLLPSSCRIPIHTFGFG---SDHDAELMHTISE 358 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + S +F QD L Sbjct: 359 VSSGTFSFIETETVIQDAFAQCIGGLLS 386 >UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-like protein n=3 Tax=Eutheria RepID=ITH5L_HUMAN Length = 1313 Score = 120 bits (301), Expect = 9e-26, Method: Composition-based stats. Identities = 50/375 (13%), Positives = 98/375 (26%), Gaps = 33/375 (8%) Query: 79 NDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLK 138 + Q G S + + S E F ++ +E L L Sbjct: 108 EEAHQQGKTAA--HVGIRDRESEKFRISTSLAAGTEVTFSLAYEELLQRHQGQYQLVVSL 165 Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 + Q +I +R+ + + E I Sbjct: 166 RPGQLVKRLSIEVTVSERTGISYVHIPPLRTGRLRTNAHASEVDSPPSTRIERGETCVRI 225 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN------YEKRPDPSSQAVMF 252 + Q +A+ + + V D++ + + R P + + Sbjct: 226 TYCPTLQDQSSISGSGIMADFLVQYDVVMEDIIGDVQIYDDYFIHYFAPRGLPPMEKNVV 285 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------------IRHHTQAKE 300 ++DVS SM + + K ++ L + + I+ Q Sbjct: 286 FVIDVSSSMFGTKMEQTKTAMNVILSDLQANDYFNIISFSDTVNVWKAGGSIQATIQNVH 345 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKE------RYNPAQWNIYAAQASDGDNW-ADD 353 + + G T V+SAL V+ R +DG+ Sbjct: 346 SAKDYLHCMEADGWTDVNSALLAAASVLNHSNQEPGRGPSVGRIPLIIFLTDGEPTAGVT 405 Query: 354 SPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 +P ++ L V +S A TL R + I + D Sbjct: 406 TPSVILSNVRQALGHRVSLFSLAFG-DDADFTLLRRLSLENRG----IARRIYEDTDAAL 460 Query: 413 VFRELFHKQNATAKG 427 + L+ + + Sbjct: 461 QLKGLYEEISMPLLA 475 >UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3 Tax=Theria RepID=ITIH4_PIG Length = 921 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 39/376 (10%), Positives = 100/376 (26%), Gaps = 25/376 (6%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL 127 G ++ + + G + Q Q + + F++ +E L Sbjct: 91 PGNIKEKAAAQEQYSAVARGESAGLVRATGRKTEQFQVAVSVAPAAKVTFELVYEELLAR 150 Query: 128 -LFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA-MTAGKR 185 L L ++ Q + + H + G+ + + N LA + Sbjct: 151 HLGVYELLLKIQPQQLVKHLQMDIHI--FEPQGISFLETESTFMTNELAEALTISQNKTK 208 Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + + E + + + R I+ + Sbjct: 209 AHIRFKPTLSQQQKSPEQQETVLDGNFIVRYDVNRTVTGGSIQIENGYF-VHYFAPEVWS 267 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---------HT 296 + + ++D SGSM + I + L + V + Sbjct: 268 AIPKNVIFVIDTSGSMRGRKIQQTREALIKILGDLGSRDQFNLVSFSGEAPRRRAVAASA 327 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN----PAQWNIYAAQASDGDNW-A 351 + E + GGT ++ A+ + ++++ PA+ + +DGD Sbjct: 328 ENVEEAKSYAAEIHAQGGTNINDAMLMAVQLLERANREELLPARSVTFIILLTDGDPTVG 387 Query: 352 DDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + +P + ++ + + + + + I + D Sbjct: 388 ETNPSKIQKNVREAIDGQHSLFCLGFGFDVPY-AFLEKMALENGG----LARRIYEDSDS 442 Query: 411 YPVFRELFHKQNATAK 426 + + + Sbjct: 443 ALQLEDFYQEVANPLL 458 >UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8U1E2_9PROT Length = 683 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 52/188 (27%), Gaps = 23/188 (12%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQA 298 + ++DVSGSM AK L R V + +R + Sbjct: 329 EVIFVIDVSGSMKGEPLRAAKASLTSGIEGLGRNDTFNVVAFNNKAAAFYDAPVRASGKF 388 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + GGT +++A +L ++ +DG A + Sbjct: 389 HRAALKVIDGLKAGGGTEMAAAFELALQM----PGDPDRLQQVVFITDG---AVSNEAAL 441 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 K L R ++ + E + D V R+LF Sbjct: 442 FNQIKGELGARRLFTVGIG-SAPNTFFMEEAARFGRGTYTYI----GDTSSAERVMRDLF 496 Query: 419 HKQNATAK 426 K + A Sbjct: 497 TKISFPAL 504 >UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YPR2_BRAFL Length = 863 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 35/338 (10%), Positives = 80/338 (23%), Gaps = 34/338 (10%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTAN--GVPANISVV 167 Q + F ++ +E L L + QQ R T + + Sbjct: 102 AAQSKVTFNLTYEELLQRRLGSYELVLSIRPQQVVRHLKIDVRIIETRDIVMLDNTYGSG 161 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 +AR + A + +E+ + R Sbjct: 162 ELEGVEIARPSPNRAHIQYRPTDMEQMR-------MSPSGISGDFLVRYDVKRDLSVGDI 214 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 I + P + ++D SGSM + K+ + L + Sbjct: 215 QIVNGYF-VHYFAPSGLPVVPKNIVFIIDKSGSMGGTKMRQTKQAMNTILKDLRDHDRFN 273 Query: 288 EVVYIRHHTQAKE-------------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + + T + + GGT ++ A+ ++++ + Sbjct: 274 VMPFSYSSTMWRPNEMVLATRENIESARTYVRRSINAGGGTNINQAIIDAADLLRRVTDD 333 Query: 335 AQWNI----YAAQASDGDN-WADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWRE 388 + +DG + P + K + V + + Sbjct: 334 QPNSPRSASLIIFLTDGLPSVGESKPRNIMVNVKNAIREQVSLFCLGFGK-DVDFPFLEK 392 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + I + D + + + Sbjct: 393 MALENRG----LARRIYEDSDAALQLKGFYDEVATPLL 426 >UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=40 Tax=Euteleostomi RepID=ITIH5_HUMAN Length = 942 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 89/378 (23%), Gaps = 31/378 (8%) Query: 75 VHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL-LFEDLA 133 ++ + G G+ +AS +D+ F +S +E L L + Sbjct: 113 EREKKSGDRVKEKRNKTTEENGEKGTEIFRASAVIPSKDKAAFFLSYEELLQRRLGKYEH 172 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 +++ Q + + + S Q R + + E Sbjct: 173 SISVRPQQLSGRLSVDVNILESAGIASLEVLPLHNSRQRGSGRGEDDSGPPPSTVINQNE 232 Query: 194 NLAIISNSEPAQLLEEER-------LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 A I R + + + + P Sbjct: 233 TFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLNGYF-VHYFAPKDLPP 291 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-----HTQAKEV 301 + ++D S SM + K + L + + + Sbjct: 292 LPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVT 351 Query: 302 DEHEFF------YSQETGGTIVSSALKLMDEVVKE----RYNPAQWNIYAAQASDGDNWA 351 + + TGGT ++ AL+ ++ + + +DG Sbjct: 352 PDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGIGDRSVSLIVFLTDGKPTV 411 Query: 352 DDSP--LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 ++ + V ++ L + + + +++D Sbjct: 412 GETHTLKILNNTREAARGQVCIFTIGIGN-DVDFRLLEKLSLENCG----LTRRVHEEED 466 Query: 410 IYPVFRELFHKQNATAKG 427 + + Sbjct: 467 AGSQLIGFYDEIRTPLLS 484 >UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MS10_ANATD Length = 1188 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 22/200 (11%), Positives = 61/200 (30%), Gaps = 18/200 (9%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR-FYILLYLFLSRT 283 K + + + ++D SGSM + + ++ L + Sbjct: 474 PTWKAIWEVPINKGEREINQQVNYIDLVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQG 533 Query: 284 YKNVEVVYIRHH------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 + V + T + ++ GGT ++ +++ ++ + R + + Sbjct: 534 DRAAVVDFDNFGYLLQPLTTDFQAVKNAIDRIDSWGGTNIAEGIRIANQQLISRSSEDRI 593 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG+ + D++ + + Y+ T + L R+ Sbjct: 594 -KVIILLTDGEGYYDNNLTT-----EAKNNGITIYTIGLGTS-VDENLLRDIATQTGGMY 646 Query: 398 NFAMQHIRDQDDIYPVFREL 417 + + VF+ + Sbjct: 647 F----PVSSASQLPQVFKRI 662 >UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVH6_PARL1 Length = 755 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 36/427 (8%), Positives = 85/427 (19%), Gaps = 51/427 (11%) Query: 34 ISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQG 93 + E ++ V + ++ E + + RP Sbjct: 114 LPENSAVDTLKMVIGDRVIEGKIKEKEEARRVYEEAKAQGHKASLVEQQ-------RPNV 166 Query: 94 GGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEY---KT 150 + + + L F + P Sbjct: 167 FTNSVANIG-PGETIIVQIEYQQTVRRDGDRFSLRFPMVVAPRYTPKTADPQLVDFAPGG 225 Query: 151 HRAGYTANGVPANISVVRSLQNSLAR--------RTAMTAGKRRELHALEENLAIISNSE 202 + ++ L + + + + Sbjct: 226 GWGEVRQSEPENDLEQPPVLHPAQGQINPVSLALSLDAGFALGDISSTHHKIALNRDGKQ 285 Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE---------------KRPDPSS 247 A L E L + + ++ + Sbjct: 286 KATLKLAEELTPANKDFELVWKPAAAKAPAAALFRERVGNEDYLLVMLTPPSGSVQPEAK 345 Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHT 296 ++D SGSM + AK + L + + + H Sbjct: 346 PREAIFVIDNSGSMSGPSMVQAKESLLWALDRLKPGDTFNVIRFDDTLTVLFPDAVPAHG 405 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + V + + GGT + AL+ ++ N +DG A + Sbjct: 406 ENLAVAKKFVKSLEANGGTEMLPALRA--SLIDRNVNDGTRLRQIVFLTDG---AISNEA 460 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L R ++ + + + + +F + Sbjct: 461 ELFHEITSNLGRSRLFTVGIG-SAPNSYFMTRASEAGRGTFTHIGKETEVTERMAELFEK 519 Query: 417 LFHKQNA 423 L + Sbjct: 520 LQNPVMT 526 >UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Ciona intestinalis RepID=UPI000180CCF8 Length = 864 Score = 119 bits (298), Expect = 2e-25, Method: Composition-based stats. Identities = 28/285 (9%), Positives = 65/285 (22%), Gaps = 31/285 (10%) Query: 168 RSLQNSLARRTA-MTAGKRRELHALEENLAIISNSEPAQLLEEER------LRKEIAELR 220 L R T E A +S + R R Sbjct: 211 TPPSPKLRRTINPDTFDTGNVEIRRSETQAYVSYRPTREQQRNIRRRSDLSFLVNYDVTR 270 Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 ++ I + + P + ++DVSGSM K + L Sbjct: 271 EELGGEILIKDGYFVHF-FAPTNLPVIPKKVVFVIDVSGSMSGHKIVQTKEALRTILDDL 329 Query: 281 SRTYKNVEVVYIRHHTQAKE------------VDEHEFFYSQETGGTIVSSALKLMDEVV 328 + + + + + GGT ++A +++ Sbjct: 330 NEIDQFNIITFSSTTNVWHPNEMVDVNPTNIRNAKKHVRSMYARGGTNFNAAALDGIQLL 389 Query: 329 K----ERYNPAQWNIYAAQASDGDNWADD--SPLCHEILAKKLLPVVRYYSYIEITRRAH 382 + R N + +DG + + +++ + Sbjct: 390 ETISSNRTNTLEEASMMILLTDGQPTVGVTGNEAIRRNIRERVNGRYSIFCLGFGQ-HLD 448 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + + I + D ++ + + + Sbjct: 449 HEFLDQIASENKG----LSRKIYNDADAALQLKDFYDEVASPLLA 489 >UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanobacteria RepID=B4VT64_9CYAN Length = 1037 Score = 119 bits (297), Expect = 3e-25, Method: Composition-based stats. Identities = 56/428 (13%), Positives = 116/428 (27%), Gaps = 47/428 (10%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKR--------SVTDVDSGESVSIPTEDISEPMFHQGR 68 RQ + Y+ K + + ++ S+ ++ GES+ + F G Sbjct: 443 KKRQEAKQIYEKAKKAGKTAGLLEQERANVFTQSLANIKPGESIQVTIRYTDSLKFEGGD 502 Query: 69 GGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLL 128 + V G + + S+ +K ++ Sbjct: 503 YEFAFPMV------VAPRYTAGNSVGSAKAPTTNSVGSKHFSASSAKAPTTNKTLMTNVA 556 Query: 129 FEDLALPNLKQNQQR--QLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 + P + + + AG + V + V + T+ R Sbjct: 557 YAAEVNPPIAPPGRSGHDIDVTVEIDAGVPISSVRSPSHPVTT---------QQTSSTVR 607 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 A +E + Q+ + + L ER T+ + E + + Sbjct: 608 VELADQETIPNKDLILRYQVAGADT---QATVLTQADERGGHFATYLI--PAIEYQQNEI 662 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT---------- 296 + L+D SGS S +K L+ + + T Sbjct: 663 VPKDVVFLVDTSGSQSGSPIVQSKELMRQFIQGLNPQDTFTIIDFANSTTQLSDKPLANT 722 Query: 297 -QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 Q ++ + GGT + + + + PA +DG D Sbjct: 723 PQNRKKALNYINRLDANGGTELMNGIDTVLNFPAA---PAGRLRSVVLLTDG--LIGDDE 777 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + +L P R YS+ + ++ L L + + F+ Sbjct: 778 QIIAEIRDRLKPGNRLYSFGVGSST-NRFLIERLAELGRGTAEVVPPNESAEVVAQEFFQ 836 Query: 416 ELFHKQNA 423 E+ + Sbjct: 837 EINNPVLT 844 >UniRef50_Q231J4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q231J4_TETTH Length = 520 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 60/189 (31%), Gaps = 13/189 (6%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 +K + ++D SGSM ++ K+ + + + + V + Sbjct: 83 ADQVKKDKVRYQPLDLIFVIDTSGSMQGKKIELVKKSILQVLHIIQGDDRISLVGFNSQA 142 Query: 296 TQAKEVD----------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 E+ + Q GGT + ++ +++KER N ++ S Sbjct: 143 KVLLELTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDIIKERTN-SKNLASIFLLS 201 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + S H + K+ + + LQ NF + I Sbjct: 202 DGQDNCGFSQTQHFMNQSKIEYPFCIDCFGFG-DDHDSLTLSKINQLQQGTFNFI-RDIS 259 Query: 406 DQDDIYPVF 414 DD + + Sbjct: 260 QIDDAFTII 268 >UniRef50_UPI00016C377F protein containing a von Willebrand factor type A domain n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C377F Length = 821 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 31/289 (10%), Positives = 78/289 (26%), Gaps = 26/289 (8%) Query: 156 TANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE 215 T ++S+ ++++ + T + +E + + + Sbjct: 173 TRTLEEFSVSLTIKSRHAVQNVYSPTHAVNTVRKSDKEVSVTFERKQALLDKDFQLFYGH 232 Query: 216 IAELRAKIERVPFIDTFDLRYKNY-----EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK 270 + V + Y + + + ++D S SM AK Sbjct: 233 GDKDIGLSPLVYKPIQTEDGYFMFLISPQVEAEKKRVARDLVLVLDTSSSMSDIKMQQAK 292 Query: 271 RFYILLYLFLSRTYKNVEVVYIRH-----------HTQAKEVDEHEFFYSQETGGTIVSS 319 + L + V + +T ++ + +GGT + Sbjct: 293 KAVKFCLSQLQPEDRFGVVRFSTTVTKFRSELVAANTDYLDLATKWIDGLKTSGGTAIWP 352 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD--DSPLCHEILAKKLLPVVRYYSYIEI 377 AL + R + +DG D ++ + + K R +++ Sbjct: 353 ALNDALAM---RSSDPSRPFTMVFFTDGQPTVDETNADKIVKNVLAKNTGNTRIFTFGVG 409 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + R+ +DI L+ K + Sbjct: 410 -DDVNAAMLDQLADSTRAVSTYV----REAEDIEVKVSGLYAKISNPVL 453 >UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J6Q3_DESRM Length = 416 Score = 118 bits (295), Expect = 4e-25, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 52/203 (25%), Gaps = 12/203 (5%) Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN 286 + ++ ++ + ++D SGSM D K+ LS Sbjct: 20 KQVAYLMVKLTAPKQVEKERPVQNLSFVIDRSGSMAGEKLDYTKKAVAFAVGHLSPQDYC 79 Query: 287 VEVVY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V + H K+ + G T +S + L VK + Q N Sbjct: 80 SVVAFDDMVTMVASSHQVANKDALKMAVESIYPGGSTNLSGGMLLGVREVKLAHKENQIN 139 Query: 339 IYAAQASDG-DNWADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG N +++ V ++ + L + Sbjct: 140 RV-LLLTDGMANVGVTDHSALVEKSREMAAGGVNLSTFGLG-EDFEEDLLQAMVEAGGGN 197 Query: 397 DNFAMQHIRDQDDIYPVFRELFH 419 + + + L Sbjct: 198 FYYIEKPDQIPGIFEQELTGLLS 220 >UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TQ23_SHEHH Length = 850 Score = 118 bits (295), Expect = 4e-25, Method: Composition-based stats. Identities = 24/227 (10%), Positives = 47/227 (20%), Gaps = 37/227 (16%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 L S + ++D SGSM AK L + + Sbjct: 439 MLMPPQGSDDESSSIARELVLVIDTSGSMSGDAIIQAKSALKYALAGLRPQDSFNVLQFN 498 Query: 293 RH-----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN--- 338 ++ Q GGT +S AL + Sbjct: 499 STVERWSRHVMPATAINLGRAQNYINGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHD 558 Query: 339 -------------------IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 +DG A + K L R ++ Sbjct: 559 DDRYQSSNETLEQSAATPLRQVLFITDG---AVANESRLFEQIKNQLGESRLFTIGIG-S 614 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + + + + ++ Q + Sbjct: 615 APNAHFMQRAAEVGRGTYTYIGKLDEVNQKVVSLLEKIEKPQVTDVE 661 >UniRef50_C1XFF8 Mg-chelatase subunit ChlD n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XFF8_MEIRU Length = 298 Score = 118 bits (294), Expect = 6e-25, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 47/187 (25%), Gaps = 12/187 (6%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYIRH---- 294 P + A + ++ SM Q+ + L L R K V + + Sbjct: 79 MPENLAGVILAIENGWSMRQTDIAPNRMVATQMAAKALVDKLPRHIKVGVVTFSGYGTLL 138 Query: 295 --HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 T ++ GG + L E + + S G + + Sbjct: 139 LPPTTDRKAIRQAIDNLDLGGGFSFTYGLLAALEALPQTPPEGSRPGVIVLFSHGHDVSG 198 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + PL A + V + + ++ + + D + Sbjct: 199 NDPLKIADQALERGIQVHAIGVGTHGHNFDEEMLKKVADRTGGRY-YPIFSASDLSKAHA 257 Query: 413 VFRELFH 419 + Sbjct: 258 DLGRVLA 264 >UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21PJ3_SACD2 Length = 763 Score = 118 bits (294), Expect = 6e-25, Method: Composition-based stats. Identities = 51/441 (11%), Positives = 113/441 (25%), Gaps = 41/441 (9%) Query: 8 RLNGKNKSMVNRQR---FLRRY-----KAQIKQSISEAINKRSVTDVDSGESVSIPTEDI 59 + GK S+V++QR F ++ + I S++ + + P Sbjct: 140 KAEGKKASLVSQQRPNLFTQKVANIPPRETISVSLTYTQRVEYHSGQFG---LRFPLTLT 196 Query: 60 SEPMFHQGRGGLRHRVHPGN-DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQ 118 + + + N D + S Q Sbjct: 197 QRYIPNSANLETNVVENTKNWDDERWENSAPDTAEKTPTSIDLAAGGYGWQSFNPIIHTQ 256 Query: 119 ISKDEYLDLLFEDLALPN-LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 + D ++ P L Q Q +T + + + + SL + Sbjct: 257 KPTPQVPDAHL--ISPPMVLAQGQYGDGQYEQTGKDNRATISIQLDAGFNVANIESLYHQ 314 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 + + N + + + + A + + L Sbjct: 315 ITINKPPSSAYNVELTNGSTLMDRDFVLQWRATASSAPQAAVFKETLAGEDYLLLMLLPP 374 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------ 291 +++ S + ++D SGSM ++ AKR L+ + + + Sbjct: 375 QGQQQHTQSLSRDIVFVVDTSGSMQGTSIQQAKRSLQFALRGLNPSDTFNIIEFDTSFSR 434 Query: 292 -----IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK-------ERYNPAQWNI 339 + + GT + +AL+ + + E + Sbjct: 435 FRSRPVSATASNVQAAVSWVNNLNADNGTEMYAALEEAFDQLASINPNGTENSKSSNNLQ 494 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG A + L + L R ++ + R+ + F Sbjct: 495 QVVFITDG---AVGNEQALLSLIHRRLNNARLFTVAIG-SAPNSYFMRKAAQFGKGANVF 550 Query: 400 AMQHIRDQDDIYPVFRELFHK 420 D ++ L K Sbjct: 551 I----GDTAEVTHKMNALLSK 567 >UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) n=1 Tax=Monodelphis domestica RepID=UPI0000F2DDBB Length = 819 Score = 117 bits (293), Expect = 7e-25, Method: Composition-based stats. Identities = 39/336 (11%), Positives = 87/336 (25%), Gaps = 27/336 (8%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALP-NLKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 + F++ +E L L ++ Q + + + + G+ + + + Sbjct: 123 DPAANATFELVYEELLKRHLGKYELMLMIQPKQLVKQLQVDIYI--FEPQGISSLENDIT 180 Query: 169 SLQNSLARRTAMTAGK-RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 + L T K + + + + R Sbjct: 181 FMTKKLEDALTKTQNKTEVHIAFKPSLAQQQKEPWKLNTVVDGKFIVRYDVDRVTTAGDI 240 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 I+ N+ P + L+D SGSM K I + L Sbjct: 241 QIENGYF-VHNFAPTQLPMVPKNIVFLIDKSGSMAGRKIKKTKAALIKILDDLKPEDHFN 299 Query: 288 EVVYIRHHTQAK-EVDEHEFFYSQET----------GGTIVSSALKLMDEVVKERYN--- 333 + + H T+ K E+ + +E G T V+ A+ ++ E Sbjct: 300 MITFSGHVTRWKPELVLALDEHLKEAKTFLSNTPALGVTNVNGAVLAAVSMLDESNKKKE 359 Query: 334 -PAQWNIYAAQASDGDNWADDS--PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 P +DGD+ ++ HE + + + + Sbjct: 360 LPEGSVSMIILLTDGDSTEGETKLQKIHENVKAAIRGQYHLFCLGFGF-DINYVFLERLA 418 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D ++ + + Sbjct: 419 LDNGGMARHIFEGL----DAELQLQDFYQEVANPLL 450 >UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KDL1_SHEWM Length = 739 Score = 117 bits (293), Expect = 7e-25, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 56/200 (28%), Gaps = 16/200 (8%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 +++ D S + ++D SGSM ++ AKR L + + + Sbjct: 351 PPSDQKQDVSISRELILVIDTSGSMSGASIAQAKRALNYALAGLKAKDTFNVIEFNSNVG 410 Query: 295 ---------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV-VKERYNPAQWNIYAAQA 344 + + + GGT + AL + + ++ Sbjct: 411 SLSPYSLPATAKNIGLANQYVRSLKANGGTEMQLALNAALDKGTETEALGSERLRQVLFM 470 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG + L K+ + R ++ + R + + Sbjct: 471 TDG---SVGDEQSLFHLIKQKIGESRLFTLGIG-SAPNSHFMRRAAEFGRGTFTYIGKLD 526 Query: 405 RDQDDIYPVFRELFHKQNAT 424 Q I + ++ Q Sbjct: 527 EVQSKIESLLYQIERPQLTD 546 >UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomycetaceae RepID=D2R2I7_9PLAN Length = 786 Score = 117 bits (292), Expect = 8e-25, Method: Composition-based stats. Identities = 29/262 (11%), Positives = 67/262 (25%), Gaps = 18/262 (6%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 + + E + ++ L + + L + + L Sbjct: 236 VDVKRPDEKHATVKFEASNYLPTTDFRLLYDVGDAPLAASVLSYRPDNSDEGFFLMLASP 295 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---- 293 N+ + ++ + ++D SGSM + A+ + L V Y Sbjct: 296 NHSQGEVDLTKKTVIFVVDRSGSMQGKKIEQAREAMRYVLNNLHEGDTFNIVAYDSTVES 355 Query: 294 -------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 ++ G T +S AL ++ P Y +D Sbjct: 356 FKPELQKFDDATRKSALAYVDGLYAGGSTNISGALDSAFAMLTGSDRPN----YILFLTD 411 Query: 347 GDNW-ADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 G + + LAK+ R ++ + L + Sbjct: 412 GLPTAGETNEGKIVELAKQKNVHRARMINFGVGY-DVNSRLLDRMSRENFGQSQYVRPDE 470 Query: 405 RDQDDIYPVFRELFHKQNATAK 426 + + ++ ++ K Sbjct: 471 NLEASVSRLYSKMSSPVLTDVK 492 >UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11 Tax=Rhizobiales RepID=A6X8G3_OCHA4 Length = 750 Score = 117 bits (292), Expect = 8e-25, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 48/200 (24%), Gaps = 19/200 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 Q + ++D SGSM ++ + AK L + + + T+ Sbjct: 342 PALASPKKVQREVIFVIDNSGSMGGTSIEQAKASLDYALSQLQPGDRFNVIRFDDTLTKF 401 Query: 299 KE-----------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 E + GGT + AL + N +DG Sbjct: 402 FEDSVDANQENIASARRFVTSLEAQGGTEMLPALHAALD----DSNQGNGLRQIVFLTDG 457 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + R + + L L Sbjct: 458 E---ISNEQQLLDAVAARRGRSRIFMVGIG-SAPNSYLMNRAAELGRGTFTHIGSAAEVD 513 Query: 408 DDIYPVFRELFHKQNATAKG 427 + + +F +L + K Sbjct: 514 ERMRALFDKLENPAVTDLKA 533 >UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G8C3_SORC5 Length = 907 Score = 117 bits (292), Expect = 1e-24, Method: Composition-based stats. Identities = 46/354 (12%), Positives = 75/354 (21%), Gaps = 28/354 (7%) Query: 84 QNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQR 143 D G G G G G S G G S A + Sbjct: 353 AGDGAAPGLGSIGTIGHGAGAGSGQGFGSGHGRAGASSPPVSARSAAAYA-----PEPEV 407 Query: 144 QLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEP 203 L Y G + A E + + A + P Sbjct: 408 ALDPNGRFATTYRPGG---------GHLAAFEAALARGVVPAAERELVGDVAARYAPEVP 458 Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 L + LR ++ F LR P + +D SGSM Sbjct: 459 LALDKALGLRADLERAALGPGGGAFHLRLALRSAAAAAAARPHLSVHLV--LDTSGSMAG 516 Query: 264 STKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---------TQAKEVDEHEFFYSQETGG 314 + D A+R L L+ + + +E GG Sbjct: 517 APIDSARRAAQALVDRLAPADDFSLTTFSSDAEVVIEDGPVGPRRAAIRRAIEGLREGGG 576 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI--LAKKLLPVVRYY 372 T + + L L P SDG + + ++ Sbjct: 577 TNIGAGLSLGYAQASRPGIPEDAVRVVLLVSDGRATSGLTHSERLAWLALDAFQRGIQTS 636 Query: 373 SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + L + + + + + + Sbjct: 637 ALGLG-DDFDGQLMSAIASDGAGGYYYLRHPEQIAPALSTELDKRLDPVATAVE 689 >UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythrobacter RepID=A3W9L9_9SPHN Length = 740 Score = 116 bits (291), Expect = 1e-24, Method: Composition-based stats. Identities = 26/230 (11%), Positives = 51/230 (22%), Gaps = 20/230 (8%) Query: 209 EERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM 268 + L + + E+ + M ++D SGSM + Sbjct: 305 SASGDAPMLGLFKQRHGELEYVMATITPPALERVGEAP-PREMIFVIDNSGSMAGESMPA 363 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF-----------YSQETGGTIV 317 A+R + L + + + T+ GGT + Sbjct: 364 ARRSLLYALETLRPQDRFNVIRFDDTMTELFASAVQASDSNIAAAKTFTHNLMANGGTEM 423 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 AL+ P + +DG A + + R + Sbjct: 424 LPALRAAL----RDRAPDERVRQVIFLTDG---ALSNEADMMEEINRNRKDSRVFMVGIG 476 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + L R +D + + L Sbjct: 477 -SAPNTYLMRRMAEAGRGTFTHVGMGEEAEDQMQRLLDRLSLPVATGLTA 525 >UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=3 Tax=Amniota RepID=UPI000155CC23 Length = 1374 Score = 116 bits (291), Expect = 1e-24, Method: Composition-based stats. Identities = 46/381 (12%), Positives = 95/381 (24%), Gaps = 47/381 (12%) Query: 79 NDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL-LFEDLALPNL 137 ++ Q G + + + S E F +S +E L L + ++ Sbjct: 144 DEARRQGKTAA--HVGVKDRETEKFRVSTSVEAGGTVTFTLSYEELLQRHLGKYQHAVSV 201 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT-----AGKRRELHALE 192 + Q + + + I + L +R T Sbjct: 202 RPQQVVKNLSVEVTISE------RTGIDYIHVLPLRTSRLLTNTLRGEADIPPSTKIEKG 255 Query: 193 ENLAII-------SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 E A I + + + I I + R P Sbjct: 256 EKCARIIFTPTPQEQAAYSSSGIMGDFVVQYDVSMKDIIGDVQIYNGYF-VHYFAPRGLP 314 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK------ 299 Q + ++DVSGSM + K+ ++ L V + + K Sbjct: 315 PVQKNVVFVIDVSGSMFGTKMKQTKKAMHVILNDLHHDDYFNIVTFSDAVSVWKASGSIQ 374 Query: 300 ------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI------YAAQASDG 347 + + + G T +++AL + V + +DG Sbjct: 375 ATPPNIKSAKVYVNKMEADGWTDINAALLVAASVFNQSTGETGRGKGLKKIPLIIFLTDG 434 Query: 348 DNWADDSP--LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + A + + L + + A L R + I Sbjct: 435 EATAGVTVASRILSNAKQSLKGNISLFGLAFG-DDADYHLMRRLSLENRG----VARRIY 489 Query: 406 DQDDIYPVFRELFHKQNATAK 426 + D + + + + Sbjct: 490 EDADATLQLKGFYDEIASPLL 510 >UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus trichocarpa RepID=B9GK57_POPTR Length = 595 Score = 116 bits (291), Expect = 1e-24, Method: Composition-based stats. Identities = 39/303 (12%), Positives = 75/303 (24%), Gaps = 47/303 (15%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 +P + A Y N P +I + L + HA+ Sbjct: 56 DVPFQAPKNVPSFQRSGSLHA-YVPNASPVHIEPDHFSDDELVPDVSQGQPSSSRPHAI- 113 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 + + + + L ++ P + + Sbjct: 114 TVKTLPEYPAVSASESFSKFGVLVRVLAPPLD---------------NTLPHHRAPIDIV 158 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEVD 302 ++DVSGSM + KR + L + + V + +E Sbjct: 159 NVLDVSGSMAGKLI-LLKRAVNFIIQNLGPSDRLSIVTFSSSARRILPLRTMSGSGREDA 217 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 TGGT + + L+ V++ER SDG + S Sbjct: 218 ISVVNSLSATGGTNIVAGLRKGVRVLEERRQHNSVAS-IILLSDGCDTQSHSTHNRLEYL 276 Query: 363 KKLLPV--------------VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 K + P + + + + +F + I Sbjct: 277 KLIFPSNNASGEESRQPTFPIHTFGFGL---DHDSAAMHAISDVSGGTFSFI-ESIDILQ 332 Query: 409 DIY 411 D + Sbjct: 333 DAF 335 >UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SQR4_HAHCH Length = 733 Score = 116 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 34/308 (11%), Positives = 78/308 (25%), Gaps = 27/308 (8%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 ++ P + + + R +I V + Sbjct: 245 PEITPPTVAKEDLLTPSHQARIRVHLNPGLPVESIESVTHRIQWTQQTNGYEVSLESNKD 304 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 + ++ E L KEI + + ++ + S Sbjct: 305 VPMDKDFTLTWRVRQGSEPEAALFKEI----VGDDVYAQLLLMPPQFSD----EGLSLPR 356 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-----------QA 298 + ++D SGSM+ + A+ + L+ + + + H +A Sbjct: 357 ELIWVVDTSGSMEGVSIQQARDAVLQALDTLTPRDRFNVIEFNSHARKLFPQAVPAQERA 416 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + + GGT ++ AL P + +DG + + L Sbjct: 417 LQQARRFVRGLKADGGTEIAEALDRALSDAA----PEGYVRQVVFLTDG---SVGNELAL 469 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 + L R ++ ++ R+ + D I + L Sbjct: 470 FKQIDQQLGDSRLFTVGIG-PSPNRFFMRKAAQFGRGAYSHINDTAEVSDKIAELTAALR 528 Query: 419 HKQNATAK 426 + Sbjct: 529 QPALRDVR 536 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 116 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 60/188 (31%), Gaps = 15/188 (7%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI--------RHH 295 + ++ + ++D SGSM + KR L L + + +V+ Sbjct: 37 ERNAPLNLGLILDHSGSMAGQPLETVKRAAQKLVDRLLPSDRLAVIVFDHVAKVLIPNQP 96 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 ++ + + GGT + L+L + A +DG+N ++ Sbjct: 97 VTDRDKIKTRISHLAAMGGTAIDEGLQLGLTELIAA--KAGAISQIFLLTDGENEHGNNS 154 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 C ++ + + + +Q + + F D+ F Sbjct: 155 RCLQLAEEAAKENITLNTLGFGY-HWNQDVLEQIADAAGGSLMFI----EYPQDVLIGFE 209 Query: 416 ELFHKQNA 423 LF++ + Sbjct: 210 RLFNQIIS 217 >UniRef50_D2VKS7 von Willebrand factor type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VKS7_NAEGR Length = 923 Score = 116 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 57/211 (27%), Gaps = 23/211 (10%) Query: 231 TFDLRYKNYEKRPDP-SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 L+ +E++ + ++D SGSM DM K + L + V Sbjct: 672 MATLQGPCFEQQAQKERKGVDLVLVVDKSGSMAGQKLDMVKSTLSFMVDQLKEKDRVAIV 731 Query: 290 VYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + K+ + T +S AL +++ R Sbjct: 732 EFDTQVKTNLDLTKMDIEGKKKAKQVSSAISPGSCTNLSGALFTSLKLLASRQQEKNEVT 791 Query: 340 YAAQASDG-DNWADDSPLCHEILAKKLLP------VVRYYSYIEITRRAHQTLWREYEHL 392 +DG N S + L+ V +++ + Sbjct: 792 SVILFTDGLANRGLISTNEILQNMQDLMDELLSTSNVTIHTFGFGQDT-DANMLTSIAQK 850 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + ++ DDI F + + Sbjct: 851 GNGLYDYLET----ADDIPKAFGNVIGNLVS 877 >UniRef50_Q9NY47 Voltage-dependent calcium channel subunit delta-2 n=115 Tax=Euteleostomi RepID=CA2D2_HUMAN Length = 1150 Score = 116 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 38/332 (11%), Positives = 82/332 (24%), Gaps = 23/332 (6%) Query: 105 ASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANI 164 E + + E D ED+ + + E + A + Sbjct: 151 QDNIKEEDIVYYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNY--SYAAV 208 Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 + + + E + + Sbjct: 209 QIPTDIYKGSTVILNELNWTEALENVFME--NRRQDPTLLWQVFGSATGVTRYYPATPWR 266 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 ID +D+R + + SS M ++DVSGS+ T + K + LS Sbjct: 267 APKKIDLYDVRRRPW-YIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDD 325 Query: 285 KNVEVVY-------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + ++ + + K+V + G T + + + ++ Sbjct: 326 YVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNS 385 Query: 332 YN-PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 A N +DG +D VR +++ T + Sbjct: 386 NITRANCNKMIMMFTDG---GEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMA 442 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 F + I + ++ + Sbjct: 443 CANKG-YYFEIPSIGAIRINTQEYLDVLGRPM 473 >UniRef50_Q7Z3S7 Voltage-dependent calcium channel subunit delta-4 n=102 Tax=Eumetazoa RepID=CA2D4_HUMAN Length = 1137 Score = 116 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 44/339 (12%), Positives = 91/339 (26%), Gaps = 27/339 (7%) Query: 104 QASQDGEGQDEFVFQISKDEYLDLL-FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPA 162 +A+++ + EF + D Y +L E N + L E H + N + Sbjct: 146 EAAEEADLNHEFNESLVFDYYNSVLINERDEKGNFVELGAEFLLESNAHFSNLPVNTSIS 205 Query: 163 NISVVRSLQNSLARRTAMTAGKRR-ELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 ++ + ++ N +E + + R Sbjct: 206 SVQLPTNVYNKDPDILNGVYMSEALNAVFVENFQRDPTLTWQYFGSATGFFRIYPGIKWT 265 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 E R +S + L+DVSGSM +AK + L Sbjct: 266 PDENGVITFDCRNRGWYI---QAATSPKDIVILVDVSGSMKGLRMTIAKHTITTILDTLG 322 Query: 282 RTYKNVEVVY---------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDE 326 + Y ++ +E + G +V AL+ + Sbjct: 323 ENDFVNIIAYNDYVHYIEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREAFQ 382 Query: 327 VV---KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 ++ +E + N SDG A + VR ++Y+ + Sbjct: 383 ILKQFQEAKQGSLCNQAIMLISDG---AVEDYEPVFEKYNWPDCKVRVFTYLIGREVSFA 439 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + Q+++ + + Sbjct: 440 DRMKWIACNNKGYYTQISTLADTQENVMEYLH-VLSRPM 477 >UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1915 Length = 728 Score = 116 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 23/205 (11%), Positives = 49/205 (23%), Gaps = 22/205 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-----R 293 + + ++D SGSM ++ + L +++ Sbjct: 238 FAPASLQKVPKNVVFVIDHSGSMHGQKIKQTYEAFLKILADLPEEDHFGILIFDDKVDKW 297 Query: 294 HHTQAKEV------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI----YAAQ 343 +T K V + GGT ++ AL +++K Sbjct: 298 QNTLVKAVPDNIIKAKQFVSKISARGGTDINKALLAAVKMLKNTSRNKLLPKISTSIILF 357 Query: 344 ASDGDNW-ADDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 SDG+ + KK Y + Sbjct: 358 LSDGEPTSGVTNHNEIINNVKKANERQTTLYCLGFGN-DVDFNFLEKMALENGG----LA 412 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 + I + D + +++ Sbjct: 413 RRIYEDSDAALQLQGFYNEVANPLL 437 >UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfolobaceae RepID=A4YGU7_METS5 Length = 383 Score = 116 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 61/187 (32%), Gaps = 12/187 (6%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 + ++ ++ L+D SGSMD + AK+ I L + + K V Sbjct: 19 LKMAFKILLVPEKISTATGFHYIVLLDTSGSMDGLKIESAKKGAIELLKRIPQGNKVSFV 78 Query: 290 VYIRHHTQAKEVDE-----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + +E + E G T +AL + + P+ Y Sbjct: 79 TFSSRVNIVREFVDPEDLTAEISSLSAGGQTAFFTALLTAFNLHNKHGIPS----YVILL 134 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG+ D + ++ +A + V+ S+ ++T+ + + Sbjct: 135 TDGNPTDDTNVETYKRIA--IPNGVQTISFGLG-DDYNETILKSLADRSGGVFYHVNDAM 191 Query: 405 RDQDDIY 411 + + Sbjct: 192 EIPEKLP 198 >UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P2E4_9RHOB Length = 772 Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats. Identities = 42/371 (11%), Positives = 88/371 (23%), Gaps = 34/371 (9%) Query: 42 SVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSG 101 SV +D + +P + L+ + + ++D GS Sbjct: 172 SVPKIDGAYELVMPMVVGPRYEGPGAQAVLKPSQNLEDKRGAEDDGPGYGDIEYPRDGSV 231 Query: 102 QGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVP 161 + + + +QI + + A + + K + Sbjct: 232 SAKPAAFTHSETVSGWQIDRLPAYPKVIGQNAPKEIDPKRVSLELALKAPMPVSRLSSDT 291 Query: 162 ANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 + V R + A R+ E A + E + L Sbjct: 292 HALEVSSKGDMQTVRFKSGRAIDNRDFVLRYELAAQSDVAAGVSSRYEADGGGYFSLLIE 351 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + D Q + ++D SGSM + +K F L Sbjct: 352 PPKL---------------PAEDMIGQRELVFVLDTSGSMSGQPIEASKTFMTAAIKALR 396 Query: 282 RTYKNVEVVYIRHHTQ-----------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 + + +Q K+ GGT ++ A+ + + Sbjct: 397 PDDYFRILHFSNDTSQFAGQAVLATERNKQKALKFVADLSAGGGTEINQAVNAAFDQAQ- 455 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 P +DG + R Y++ ++ L Sbjct: 456 ---PDNTTRIVVFLTDG---YIGDEATVIKSIANRIGKARIYAFGVGNS-VNRFLLDAMA 508 Query: 391 HLQSTFDNFAM 401 + + Sbjct: 509 TEGRGYARYVA 519 >UniRef50_D1KBY4 Putative uncharacterized protein n=2 Tax=Proteobacteria RepID=D1KBY4_9GAMM Length = 682 Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats. Identities = 31/238 (13%), Positives = 70/238 (29%), Gaps = 22/238 (9%) Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + L +A + ++ E + + ++D SG Sbjct: 284 RDFELTWQAHKTLTPTLALFTQQKGDDHYLMLMATP-PADEVFKQSHTPREVIFIIDSSG 342 Query: 260 SMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----HHTQAKEVDEHEF------FY 308 SM S+ + A I L T + + + T +D ++ + Sbjct: 343 SMMGSSMEQATNALIQAINRLKPTDRFNIIDFDSDFEVLFDTAIPAIDMNKRHGIRFAKH 402 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 +GGT A+K + + + ++ +DG + ++ + Sbjct: 403 LVASGGTEPLEAIKFAL--LSKDEDSDKYLRQVIFLTDGQ---VGNEKELFRAVQQNIDD 457 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 R+++ + L + + D D++ ELF K + A Sbjct: 458 DRFFTIGIG-SAPNDYLMTKMAEYGKGAFTYI----GDIDEVEVKMGELFSKLESPAM 510 >UniRef50_B9EIV3 Cacna2d4 protein n=1 Tax=Mus musculus RepID=B9EIV3_MOUSE Length = 1091 Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats. Identities = 43/339 (12%), Positives = 89/339 (26%), Gaps = 27/339 (7%) Query: 104 QASQDGEGQDEFVFQISKDEYLDLL-FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPA 162 +A+++ + EF + + Y +L E N + L E H + N + Sbjct: 125 EAAEEADLNHEFNASLVFNYYNSVLINEKDDKGNYVELGAEFLLESDAHFSNLRVNVSMS 184 Query: 163 NISVVRSLQNSLARRTAMTAGKRR-ELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 ++ + ++ N +E + + R Sbjct: 185 SVQLPTNVYNKDPDILNGVYMSEALNPVFVENFQRDPTLTWQYFGSSTGFFRIYPGIKWM 244 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 E R +S + L+D+SGSM +AK + L Sbjct: 245 PDENGVIAFDCRNRGWYI---QAATSPKDIVILVDISGSMKGLRMAIAKHTITTILDTLG 301 Query: 282 RTYKNVEVVY---------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDE 326 + Y ++ +E + G +VS AL E Sbjct: 302 ENDFVNIIAYNDYVHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVSQALIEAFE 361 Query: 327 VV---KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 ++ +E + N +DG A + VR ++Y+ Sbjct: 362 ILKQFQESKQGSLCNQAIMLITDG---AVEDYEPVFETYNWPDRKVRVFTYLIGREVTFA 418 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + Q+ + + + Sbjct: 419 DRMKWIACNNKGYYTQISTLADAQESVMEYLH-VLSRPM 456 >UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0E9B3_PARTE Length = 603 Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 54/176 (30%), Gaps = 15/176 (8%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV----------YIRHH 295 + C++DVSGSM ++ K L L + +V +IR+ Sbjct: 117 RPPIDLVCVVDVSGSMIGRKINLVKDSLRYLMKILGPEDRICIIVFTTVAHIVTSFIRNT 176 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + K + + + T +S + ++K R + SDG + + Sbjct: 177 QENKPLLKKAILELKGLASTNISDGMNKALWMLKNRKYKNPVSC-IFLLSDGQDDYKGAE 235 Query: 356 LCHEILAK--KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + K+ +++ + + + + +I D Sbjct: 236 QRVFDQLQLLKIEEKFVIHTFGYGQ-DHDAYVMNQIAKYREGNFYYI-DNINKASD 289 >UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Rhizobium RepID=B5ZY26_RHILW Length = 794 Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats. Identities = 30/230 (13%), Positives = 60/230 (26%), Gaps = 21/230 (9%) Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD 267 + + A L +++ + P ++ + ++D SGSM + + Sbjct: 313 QAAPGKLPSAGLFREVKDGKTCLLAFVTPPTAPDAAAPPAKREVVFVIDNSGSMSGPSIE 372 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHHTQ-----------AKEVDEHEFFYSQETGGTI 316 AK+ L L+ + + + T +E GGT Sbjct: 373 QAKQSLALAISRLTPNDRFNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLPADGGTE 432 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 + AL+ + +DG A + R ++ Sbjct: 433 MLPALEDALR--NQGPVATGALRQVVFLTDG---AIGNEQQLFQEITANRGDARVFTVGI 487 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + I D + ELF K A Sbjct: 488 G-SAPNTYFMTKAAEIGRGTF----TQIGSTDQVASRMGELFAKLQNPAM 532 >UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C9G5_PARTE Length = 648 Score = 115 bits (287), Expect = 4e-24, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 59/195 (30%), Gaps = 25/195 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ-------- 297 + + C++D SGSM+ ++ + L FLS + + + + Sbjct: 225 EAGIDLLCVIDKSGSMEGKKIASVQQSLVQLLDFLSEKDRLCLITFDGSAQRLTPLKTLT 284 Query: 298 --AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 K + + + +G T ++ ++ +++R Q SDG + Sbjct: 285 QDNKNYFKKAIYSIRASGQTNIAKGTEIAFNQIQQRKMKNQVTS-IFLLSDGQDQGAAEY 343 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + + + + + + Y L + + + + Sbjct: 344 IQRQKDVVEDIVTIHSFGYG---SDHDAALMSKICKVGQGSFYYIED--------VKLLD 392 Query: 416 ELFHKQ---NATAKG 427 E F ++A Sbjct: 393 EFFADALGRLSSALA 407 >UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina mazei RepID=Q8PU63_METMA Length = 1004 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 53/199 (26%), Gaps = 22/199 (11%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 + ++ A + ++D SGSM S AK L ++ V + + Sbjct: 306 EDNANANANVMLVIDRSGSMSGSPISSAKNSANLFIDYMEAEDMAGVVSFSSSARYDYHL 365 Query: 302 ----------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + + +G T + S ++ + +P SDG + Sbjct: 366 ATLTPEVKNSIKQKINSIYASGVTAIGSGMRYGLNDLLNYGDPN-NPWAIVLLSDGYQNS 424 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 ++P K V Q L ++ +IY Sbjct: 425 GENPNNVIPSIKASNIQVYTVGLG---PAVDQKLLGNIADQTGGKYYYSPTD-SQLQEIY 480 Query: 412 PVF-------RELFHKQNA 423 + +F + Sbjct: 481 NDIVGKIIGWKTVFKRNVK 499 >UniRef50_Q4RV83 Chromosome 15 SCAF14992, whole genome shotgun sequence. (Fragment) n=3 Tax=Euteleostomi RepID=Q4RV83_TETNG Length = 1434 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 77/277 (27%), Gaps = 26/277 (9%) Query: 163 NISVVRSLQNSLAR-RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 ++S S L R R + + + + +SEP + LE R R L + Sbjct: 333 HLSEAHSPLVILERGRFSFGQYEEQISSRRDFIRCTRKDSEPERKLEFVRKRYHKDILSS 392 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + + F E + + L+D SGSM + K ++ L Sbjct: 393 PVLMLNFCPDLLG-----EPLELHRATRELLFLVDRSGSMSGTKIQSVKEAMVIALKSLP 447 Query: 282 RTYKNVEVVYIR-----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVK 329 K V + + GT + AL + + Sbjct: 448 PGTKLNIVGFGTTIKPLFTSSKLSTDVTILQACEYLQRMRADMKGTNLLGALSWVYQQPM 507 Query: 330 ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 +R + +DG + L ++ R + +A + L + Sbjct: 508 QRS----YPRQVFIITDG---CVSNVAKVLELVRRNACAGRCFGLGLG-PKACRRLLQGV 559 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 L + R Q + ++ F + Sbjct: 560 AKLTGGTAEYLDDEERLQPKVIKSLKKAFEPVLTDVR 596 >UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5Z1_VITVI Length = 686 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 53/216 (24%), Gaps = 46/216 (21%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------H 295 + + ++DVSGSM S + KR L L + + V + Sbjct: 200 RAPIDLVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMS 259 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW----- 350 +E +GGT + LK V++ER SDG + Sbjct: 260 DNGREAAGLAINSLXSSGGTNIVEGLKKGVRVLEERSEQNPVAS-IILLSDGKDTYNCDN 318 Query: 351 ---------ADDSPLCHEILA------------------KKLLPVVRYYSYIEITRRAHQ 383 A +P ++ + V + + Sbjct: 319 VNRRQTSHCASSNPRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHTFGFG---SDHDS 375 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 T +F QD L Sbjct: 376 TAMHAISDESGGTFSFIESVAXVQDAFAMCIGGLLS 411 >UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Shewanella amazonensis SB2B RepID=A1S752_SHEAM Length = 753 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 25/258 (9%), Positives = 58/258 (22%), Gaps = 15/258 (5%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 + + + + + + E L + Sbjct: 326 INEGSEPVADFLVQPGYSYPAAVESANTQYGTAQDKAKQDEYVRVDFSQGNYSHGLLTFM 385 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 + + ++D SGSM + A+ I L + + Sbjct: 386 PPQPNLANRLARELVLVIDTSGSMAGDSMVQARSALIHALGGLGPQDSFNIIAFSSDARP 445 Query: 298 AKE-----------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + + GGT ++SAL+L + + +D Sbjct: 446 LWPDAKPATAFNLGAAQQFVRSLEADGGTEMASALELALKTPSVVDEDTKRLRQVLFITD 505 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G +D L ++ L R + + F Sbjct: 506 GAVNGED---ALFNLIERRLGTSRLFPVAIGA-APNGYFMSRAAAAGRGSFTFIGHGGEV 561 Query: 407 QDDIYPVFRELFHKQNAT 424 + + + + H + Sbjct: 562 AEKMNQLLSRIEHPVVSD 579 >UniRef50_D1A557 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Streptosporangineae RepID=D1A557_THECD Length = 795 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 48/428 (11%), Positives = 104/428 (24%), Gaps = 47/428 (10%) Query: 11 GKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGG 70 ++ R R Y+ + A+ + DV + + +IP + + Sbjct: 87 TVEAALKERGRARADYQEAVAAGHRAALAEEDRPDVFTMQVGNIPPGERVTVRLTLDQPL 146 Query: 71 LRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFE 130 + P +G G A+ D IS L + Sbjct: 147 PYEDGAATFRFPLVVAPRYIPGTALPDERAGDGIAADTDAVPDASR--ISPPVLLPGFPD 204 Query: 131 DLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHA 190 + L E AG+ + +++ VV + R T + Sbjct: 205 PVRL----------SLEADIDPAGFPLGEIRSSLHVVAADTRGSGR---TTVRLQPGERL 251 Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 + + ++ P Q L + Sbjct: 252 DRDFVLRLAYGRPEQAAASVTLTPDAEGESGTFTLTV----------LPPSERCAPRPRD 301 Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------------IRHHT 296 + L+D SGSM A+R + L+ + + + Sbjct: 302 VVILLDRSGSMHGWKMVAARRAAARIVDTLTGRDRFAVLSFDDMVERPAGLDGGLSPATD 361 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + + Q GGT +++ L+ ++ + A + +DG + Sbjct: 362 RNRFRAVEHLAGLQARGGTELAAPLREGAALL----DDAGRDRVLVLITDGQ---VGNED 414 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L L +R ++ + R + + + Sbjct: 415 QLLALIDPFLNGLRIHAVGIDQ-AVNAGFLGRLATAGQGRLELVESEDRLDEAMEHIHHR 473 Query: 417 LFHKQNAT 424 + Sbjct: 474 INAPLLTG 481 >UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnoliophyta RepID=B9SJS6_RICCO Length = 540 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 38/282 (13%), Positives = 78/282 (27%), Gaps = 21/282 (7%) Query: 156 TANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE 215 G AN+ + L + + + +E + S P + +LR Sbjct: 10 RRRGRRANLLAGGEAEQKLPLVPLLPPPLKMSSNDDDEKIVTRSRPTPPIVPARVKLR-S 68 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 I A +E +L + P + ++DVS SM+ + K + Sbjct: 69 INNDMAPLEESKLKVMLELTGGDSSSYGRP--GLDLVAVLDVSRSMEGDKMEKMKTAMLF 126 Query: 276 LYLFLSRTYKNVEVVYIR----------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 + L T + V + +++E E+ G T +++ L+ Sbjct: 127 IIKKLGPTDRLSIVTFSGGANRLCPLRQTTGKSQEEFENLINGLNADGATNITAGLQTAL 186 Query: 326 EVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 +V+K R + + SDG+ A + + + + Sbjct: 187 KVLKGRSFNGERVVGIMLMSDGEQNAGSDATGVSV----GNVPIHTFGFGI---NHEPKG 239 Query: 386 WREYEHLQ-STFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + H + L K Sbjct: 240 LKAIAHNSIGGTFSDVQNIDSLTKAFAQCLAGLLTVVVQDLK 281 >UniRef50_UPI0000D560E4 PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) n=5 Tax=Tribolium castaneum RepID=UPI0000D560E4 Length = 842 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 46/456 (10%), Positives = 110/456 (24%), Gaps = 84/456 (18%) Query: 34 ISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQG 93 + E G+S ++ E + Sbjct: 95 LPETAYISGFVMEIDGKSYKAYVKEKEEAKQIY--------------NEAVGRGQAAAHV 140 Query: 94 GGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLAL--------PNLKQNQQRQL 145 S + S + E Q + VF ++ +E L E + P N + + Sbjct: 141 EANARDSNRFTVSLNIEPQKKAVFTLTYEELLQRQNEQYEVVINIHPGQPVKDLNVEVHI 200 Query: 146 TEYKTHRAGYTANGVPAN----ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNS 201 E + + + N + + + + +A + + + Sbjct: 201 DESRPLKFVKSPPLRTGNEISKNDDKTASLAEIKQNNSTSATVKFNPNIERQKQLATGLG 260 Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDT-FDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 + + + R + + + + + Q + ++D SGS Sbjct: 261 TKEENGLAGQFVVQYDVERDPKGGEVLLKDGYFVHFFAPSEVEALPKQ--VIFVLDTSGS 318 Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--------------------------- 293 MD + K + L + V + Sbjct: 319 MDGNRIKQLKEAMNSILSELKKEDVFNIVEFSSIVKVWNVDKVQVDYEVGEDPWPLYDSP 378 Query: 294 -----------------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY--NP 334 + KE + GGT + SAL++ ++VK+ Sbjct: 379 EAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLNAYGGTDIKSALEVGLKLVKKNKENKE 438 Query: 335 AQWNIYAAQASDGDNW-ADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 +DG+ + + ++ +S A + ++ Sbjct: 439 DAHQPIIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSFGDG-ADREFLQKIS 497 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 F + D +E + + ++ Sbjct: 498 LKNLGFARHIY----EAADASLQLQEFYKQISSPLL 529 >UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V370_MONBE Length = 471 Score = 114 bits (285), Expect = 6e-24, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 58/203 (28%), Gaps = 18/203 (8%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 R + P + ++DVSGSM M + L L T + V + Sbjct: 44 RLTAPDFEPVERPAIDLVAVIDVSGSMAGQKLKMVQSTLEFLMRNLKDTDRFALVTFDSD 103 Query: 295 ----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 T KE + + T +S L E++++R Sbjct: 104 VKTVFDLRPMTTAHKEACLADVQKLRAGSCTNLSGGLFRGVELMQQRGATKGAVSSILLM 163 Query: 345 SDG-DNWADDSPLCHEILAKKL---LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG N + L P Y++ ++ + R+ + F Sbjct: 164 TDGIANEGVRDKDDMCRALRGLMGPAPDYTIYTFGYGK-DHNENMLRQLSETGNGMYYFI 222 Query: 401 MQHIRDQD---DIYPVFRELFHK 420 + + D +F + Sbjct: 223 ESNDIIPESFGDCLGGLLSVFAQ 245 >UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0C5K4_PARTE Length = 611 Score = 114 bits (285), Expect = 6e-24, Method: Composition-based stats. Identities = 45/293 (15%), Positives = 90/293 (30%), Gaps = 20/293 (6%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANG--VPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP LK N + + + + + + R +++ L Sbjct: 76 LPKLKNNLYKDNQSTIQFPDQFQPQMQMLTPQMRMNYKYIQKIPRCDDDEQLIQKQSAKL 135 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + + + + E LR K E +P + + + E + + Sbjct: 136 QNKNKY--DLQSSIAFEINSLRTSCKVSNYKSEYIPAMISIKTKENQTEMTER-TIGIDL 192 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---TQAKEVDEHE--- 305 CL+D S SM +M K+ +LL FL + + + H T K + E Sbjct: 193 ICLIDKSMSMSGDNINMVKKSLLLLLDFLGEQDRLQIITFNEHAQRLTPLKCLTEKNKQY 252 Query: 306 ----FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 G T +SSA + + +KE+ SDG + D+ Sbjct: 253 FQAVISQISAEGLTKISSATYIAFKQLKEKVYRNNVTSV-FLLSDGHD--GDALFEISDQ 309 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + V ++ + +L++ + I D+ + Sbjct: 310 IRHVKEVFTISTFGFG-DDHDAQMMTSISNLKNGNFYYVKD-ITLLDEFFAHA 360 >UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FTM1_SORC5 Length = 535 Score = 114 bits (285), Expect = 6e-24, Method: Composition-based stats. Identities = 38/263 (14%), Positives = 70/263 (26%), Gaps = 24/263 (9%) Query: 173 SLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 + E L E R+ +I E + Sbjct: 133 HVRELLRSGRAPEPWQVRTYEFLNYYRIDYAPPDEGELRVEPQIE---PGEEAGSYALQI 189 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 +R + P P + ++D SGSMD K + LS V + Sbjct: 190 GVRSYDP---PSPRRPIAVTFVLDTSGSMDGEPMAREKATVRAVAASLSEGDVVNMVTWN 246 Query: 293 RH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 +GGT + S L++ ++ +E + + N Sbjct: 247 TQNSVILSGHVVDGPDDPALLAAADALSASGGTDLESGLRVGYQLAQEHFEEGRINRVI- 305 Query: 343 QASD-GDNWADDSPLCHEILAKKL-LPVVRYYSYIEITR-RAHQTLWREYEHLQSTFDNF 399 SD G N S + A+ + + L + Sbjct: 306 LVSDGGANVGVTSEELIALHAEDADQEAIYLVGVGTGPALGYNDVLMDAVTDKGRGAYVY 365 Query: 400 AMQHIRDQDDIYPVFRELFHKQN 422 + D+D+ + +FR+ F + Sbjct: 366 ----LDDEDEAFHMFRDRFAEVM 384 >UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GUK8_SORC5 Length = 521 Score = 114 bits (285), Expect = 7e-24, Method: Composition-based stats. Identities = 30/259 (11%), Positives = 59/259 (22%), Gaps = 17/259 (6%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 + + A PA E+ + + + + Sbjct: 56 RQILEDGEIPGPDTLDDVGFFAEHKLDYPAATCGEDVCMHGLLGIMGNMISGSPCTLIQI 115 Query: 235 RYKNY-EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + + +D SGSM+ + + + L T + V Y Sbjct: 116 GMNSPVDLGALERPPLHLVIAVDTSGSMEGDPIAYVRAGLVEMIDALQPTDRISLVRYSD 175 Query: 294 HH--------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 +E F G T + L + ++ +PA N S Sbjct: 176 AAEVVLEQAEGSDREALTEAFEGLTARGSTNLYEGLFTAYALAEQHLDPAWQNRVIFL-S 234 Query: 346 DG-DNWADDSPLCHEILA---KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 DG SP LA + + R + + F Sbjct: 235 DGVATAGLTSPQRLVSLAAGYAEKGIGLTAIGVG---AEFDVDAMRGISEVGAGNFYFLE 291 Query: 402 QHIRDQDDIYPVFRELFHK 420 ++ + Sbjct: 292 DPKAVEEVFAEEVKTFLVP 310 >UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID=Q7G2L9_ORYSJ Length = 719 Score = 114 bits (284), Expect = 7e-24, Method: Composition-based stats. Identities = 31/287 (10%), Positives = 68/287 (23%), Gaps = 37/287 (12%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R+L + E +++ I E A + + Sbjct: 179 RTLDGGIFDDDEQLDLHPAEDVVGTQDVDSIVADEMAPASVGITTYAAFPAMEESVMVEE 238 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 F L+ + + + ++DVS SM + + KR + L + Sbjct: 239 FAVLIHLKAPSSPATVTSRAPIDLVTVLDVSWSMAGTKLALLKRAMSFVIQALGPGDRLS 298 Query: 288 EVVYIRHHTQA----------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 V + + ++ GGT ++ AL+ V+++R Sbjct: 299 VVTFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRRERNPV 358 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKL-----------------LPVVRYYSYIEITRR 380 SDG + V+ +++ Sbjct: 359 -CSIVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFGA-D 416 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + +F D ++ F + Sbjct: 417 HDSPAMHSIAEMSGGTFSFI--------DAAGSIQDAFAQCIGGLLS 455 >UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CDA1_PARTE Length = 604 Score = 114 bits (284), Expect = 7e-24, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 57/198 (28%), Gaps = 22/198 (11%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 N +++ + C++D SGSM +M K+ +L FL + + + Sbjct: 173 FNMDQQQHSKVGVDLLCVIDRSGSMSGEKIEMVKQTLNILLNFLGPKDRLCLIQFDDTCQ 232 Query: 295 --------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + K GGT++ ++ + +K R + SD Sbjct: 233 RLTNLRRVTDENKTYYSDIISKIYANGGTVIGLGTQMALKQIKYRKSVNNVT-AIFVLSD 291 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G + A S + + +S+ L + +L F Sbjct: 292 GQDEAAIS--SLQKQLAYYKQTLTIHSFGFG-SDHDAKLMTKISNLGKGSFYFVNN---- 344 Query: 407 QDDIYPVFRELFHKQNAT 424 + E F Sbjct: 345 ----ISLLDEFFVDALGA 358 >UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PS55_PICSI Length = 829 Score = 114 bits (284), Expect = 8e-24, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 84/325 (25%), Gaps = 42/325 (12%) Query: 119 ISKDEYLDLLFEDLALPN--LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLAR 176 S + + +D L + Q L + V + R+ N+ Sbjct: 242 QSCSQEPTVYDDDEPLDSNLRPNEGQTSLLTDDDREFEFKGLFVDHEGTSDRAAGNARKM 301 Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY 236 R A+ E A E + + K+ + V Sbjct: 302 RIAL--YPEVEAVAAGEACENFTVLVHVKAPSASEASKKQNYEDCEGNMV---------- 349 Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 K P + + ++DVSGSM + + KR + LS + VV+ Sbjct: 350 ----KDPGCRAPIDLVTVLDVSGSMSGTKLALLKRAMAFVISNLSPEDRLSVVVFSSTAK 405 Query: 297 QAKEVDEHEFFYSQET----------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + + + GGT ++ L+ +V+++R SD Sbjct: 406 RVFSLKRMTPDGQRAANRVVERLLCTGGTNIAEGLRKGAKVLEDRRQRNPVAS-IMLLSD 464 Query: 347 GDNW-----------ADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQS 394 G + D + + + +++ + Sbjct: 465 GQDTYSLSSRGVVLFPSDEQRRSARQSTRYGHVQIPVHAFGFGV-DHDAATMHAISEVSG 523 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFH 419 +F QD L Sbjct: 524 GTFSFIQAESLVQDAFAQCIGGLLS 548 >UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 Tax=cellular organisms RepID=Q47YR5_COLP3 Length = 786 Score = 114 bits (284), Expect = 8e-24, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 55/204 (26%), Gaps = 19/204 (9%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L + EK + ++D SGSM + + AK L L L+ + + Sbjct: 379 LTFFPPEKAVAQVIARDIIFIIDTSGSMQAGSMEQAKSSLQLALLQLNNKDSFNIIAFDN 438 Query: 294 -----------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 + GGT + L + K++ ++ Sbjct: 439 DTELLFPVTHMASAHNISKAQQFIDGLSANGGTEMYRPLSNALMMKKDKTQSSKAIRQIV 498 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG A + L R Y+ + ++ F Sbjct: 499 FITDG---AVANEFELMQLLNTAQGDFRLYTVGIGA-APNGYFMKKAAQFGRGSYVFI-- 552 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 +++ ++ K + A Sbjct: 553 --QNKSEVQRKMSHFMTKISQPAL 574 >UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7RKA1_NEMVE Length = 1128 Score = 114 bits (284), Expect = 8e-24, Method: Composition-based stats. Identities = 26/222 (11%), Positives = 57/222 (25%), Gaps = 35/222 (15%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++D RY+ + + ++D SGSM S +AK + L+ + Sbjct: 191 CSSYDPRYRPWYVEAASPQPKDVILVVDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAF 250 Query: 289 VVY------------------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLM 324 + + + ++ + +GGT+ + A Sbjct: 251 LAFESGVRRVKVTSGDAKDEKCFESSLAKASPVNIDILKKFLDGEYASGGTMYAIAFNAA 310 Query: 325 DEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEILAKKLLPVVR------YYSYIE 376 +++ + Y +DG D P K + + Sbjct: 311 FDILDKYYKEKNTTRRPVILFMTDGAPN--DDPGTILNTVKTRNQGLSTKADILTFGMGG 368 Query: 377 ITRRAHQTLWREYEHLQ-STFDNFAMQHIRDQDDIYPVFREL 417 A L + F + D+ + Sbjct: 369 GISPAGVDLLQSLAEQTLDGGARFEVSLTTALRDVSRHLLAV 410 >UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella loihica PV-4 RepID=A3QDW1_SHELP Length = 776 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 36/395 (9%), Positives = 85/395 (21%), Gaps = 23/395 (5%) Query: 48 SGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQ 107 GE++ + + + G LR +F + + Q Sbjct: 203 PGEALVVEIAYQQQVRYQDGEFSLRFPTAITPRYFPKGQVPDLEQASVNDIQGLNVLNES 262 Query: 108 DGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 + + + D + + L P + Q + + Sbjct: 263 TQSDEQKLYLDVVLDAGMAIS--RLETPYHQMRQTQLGGTKIGLNLTANLRPDRDFLLKW 320 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R L + E S + ++ ++ Sbjct: 321 RPLLAEQPSAVMFAQLGKTHEFKTHEFKNEESLASSQAHNDQVVSEANHPAEAQASDKEA 380 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 + + + + ++D SGSM + AK + L Sbjct: 381 KDSYALVMLMPPQDKARVRLPRELTLVIDTSGSMTGDSIAQAKSAILNALAGLGSQDTFN 440 Query: 288 EVVYIRHHTQAKEVDEHEF-----------FYSQETGGTIVSSALKLMDEVVK------E 330 + + V + GGT ++ AL + Sbjct: 441 VIAFDSSVRSLSPVALSATAANLGKANLFVQSLEADGGTEMAPALLRALSQPESGVSSIS 500 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + +DG A + L + R ++ + Sbjct: 501 SAVKPERLKQVVFITDG---AVGNEASLFALIAANIGRQRLFTVGIGA-APNGYFMERAA 556 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + + I + ++ Q + Sbjct: 557 RAGRGTYTYVGKISEVDAKIGELLEKIESPQISDV 591 >UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67LZ3_SYMTH Length = 414 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 46/196 (23%), Gaps = 16/196 (8%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV--------YI 292 P+ + ++D SGSM + K+ L ++ + V + Sbjct: 35 PAPEGRPPLNLAAVVDRSGSMAGAALYFTKQALRFLVDQMAEEDRLAIVTYDDQVHVPFP 94 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA 351 K+ G T +S L + ++ P + + +DG N Sbjct: 95 SQPVVQKDAVRLLVDGITAGGTTNLSGGLATGMQQIRPHAGPGRVSRV-LLMTDGLANVG 153 Query: 352 DDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 P A+ V + L ++ + Sbjct: 154 VTDPDVLAGWARAWREKGLAVSTMGVG---PHFSEDLLVALAEAGGGNFHYIANPDQIPR 210 Query: 409 DIYPVFRELFHKQNAT 424 L Sbjct: 211 IFQEELHGLLQVAVQG 226 >UniRef50_A2AR69 Novel protein similar to vertebrate inter-alpha (Globulin) inhibitor H family (Plasma Kallikrein-sensitive glycoprotein) (ITIH) (Fragment) n=10 Tax=Clupeocephala RepID=A2AR69_DANRE Length = 860 Score = 113 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 41/346 (11%), Positives = 90/346 (26%), Gaps = 36/346 (10%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS 169 F +S +E L L +L + + + G+ ++V Sbjct: 147 PPGARVSFSLSYEELLSRRLGRYEL-SLGLRPGQPVQNLSLEVSISERTGISFIRALVLF 205 Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 L L A A + + Q + A+ + + Sbjct: 206 LF--LIDLLADAEAPPSTKVKQNAYCAHVRYTPSIQQQRNVSPKGLSADFIIQYDVELKD 263 Query: 230 DTFDLRYKN------YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 D++ + + R P + ++D+SGSM + K + + L Sbjct: 264 PMGDIQVDDGYFVHYFAPRGLPVVPKDVIFVIDISGSMIGTKIKQTKAAMVSILSDLREG 323 Query: 284 YKNVEVVYIRHHTQAK------------EVDEHEFFYSQETGGTIVSSALKLMDEVVK-- 329 + + K + G T +++AL +++ Sbjct: 324 DYFNLITFSDDVHTWKKDRTVRATRQNVRDAKEFVRKIIAAGWTNINAALLSAAKLLNPS 383 Query: 330 -------ERYNPAQWNIYAAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 R +Q +DG+ + A+K L +V + A Sbjct: 384 TRSSSSTGRAPSSQRVPMIIFLTDGEATIGETETDVILHNAQKSLGLVSLFGLAFG-DDA 442 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + R + + + DD + + + Sbjct: 443 DFPMLRRLALENRG----VARMVYEDDDAAIQLKGFYDEVATPLLS 484 >UniRef50_A6BYV9 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYV9_9PLAN Length = 1197 Score = 113 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 53/411 (12%), Positives = 103/411 (25%), Gaps = 43/411 (10%) Query: 24 RRYKAQIKQSISEA-INKRSVTDVDSG--ESVSIP--TEDISEPMFHQG-----RGGLRH 73 R+ + I + + + ++ D + P G + Sbjct: 791 RKGREAIANLFPDFPLRQLNIKDPFAKPIRVAEAPGEIPLADRWRLILGVKGCSTPKSQQ 850 Query: 74 RVHPGNDHFVQNDRIERPQGGG--GGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFED 131 + + ++R R G G + A E + KD ++L E Sbjct: 851 VAGTLDQLYGGSEREGRGLQGDLASDRGGTEAAAPSVREWISDVERLFGKDVCEEVLGEA 910 Query: 132 LA------LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKR 185 L +L R E + ++R L ++ R A R Sbjct: 911 AVNGRAAVLEHLNHATVRPSVELLEQVLSLRGALSERELGLLRKLARNITERMAKQLANR 970 Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 ++A + +L L + K + I L Y+ KR Sbjct: 971 LRPALHGLSIARPTRRRSPRLDFARTLNSNLHTAYRKSDGRISIAPTRLVYRLPAKRQM- 1029 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE 305 + ++DVSGSM+ S ++ L V + TQ + Sbjct: 1030 --DWHLIFVVDVSGSMEASVIYS--SMMAAIFSALPAID----VKFFAFSTQVIDFTGRV 1081 Query: 306 FF------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 Q GGT + L+ E + +D + P Sbjct: 1082 EDPLSLLMEIQIGGGTHIGLGLRAARESITNPSRT-----LVVLVTDFE-EGVSVPELLS 1135 Query: 360 ILAKKLLPVVRYYSYIEITRR----AHQTLWREYEHLQSTFDNFAMQHIRD 406 + + + H + + + + Sbjct: 1136 EVVMLSSSGAKLIGLAALNDEAKPRYHAGTAAAVVQAGMPVAAVSPERLAE 1186 >UniRef50_UPI0001760CA2 PREDICTED: inter-alpha (globulin) inhibitor H5-like n=1 Tax=Danio rerio RepID=UPI0001760CA2 Length = 1157 Score = 113 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 43/351 (12%), Positives = 93/351 (26%), Gaps = 43/351 (12%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS 169 F +S +E L L +L + + + G IS +R+ Sbjct: 147 PPGARVSFSLSYEELLSRRLGRYEL-SLGLRPGQPVQNLSLEVSISERTG----ISFIRA 201 Query: 170 LQNSLARRTAMTA-----GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 L +R + T A + + Q + A+ + + Sbjct: 202 LPLRTSRLLSNTVQADAEAPPSTKVKQNAYCAHVRYTPSIQQQRNVSPKGLSADFIIQYD 261 Query: 225 RVPFIDTFDLRYKN------YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 D++ + + R P + ++D+SGSM + K + + Sbjct: 262 VELKDPMGDIQVDDGYFVHYFAPRGLPVVPKDVIFVIDISGSMIGTKIKQTKAAMVSILS 321 Query: 279 FLSRTYKNVEVVYIRHHTQAK------------EVDEHEFFYSQETGGTIVSSALKLMDE 326 L + + K + G T +++AL + Sbjct: 322 DLREGDYFNLITFSDDVHTWKKDRTVRATRQNVRDAKEFVRKIIAAGWTNINAALLSAAK 381 Query: 327 VVK---------ERYNPAQWNIYAAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIE 376 ++ R +Q +DG+ + A+K L +V + Sbjct: 382 LLNPSTRSSSSTGRAPSSQRVPMIIFLTDGEATIGETETDVILHNAQKSLGLVSLFGLAF 441 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 A + R + + + DD + + + Sbjct: 442 G-DDADFPMLRRLALENRG----VARMVYEDDDAAIQLKGFYDEVATPLLS 487 >UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1 Tax=Sorghum bicolor RepID=C5WZE3_SORBI Length = 704 Score = 113 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 54/211 (25%), Gaps = 29/211 (13%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 F LR + + + ++DVS SM + KR + L + + Sbjct: 216 FAILIHLRVPTWV---RTRAPLDLVTVLDVSRSMSGPKLALLKRAMRFVIENLEPSDRLS 272 Query: 288 EVVYIRHH----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 V + ++ + GGT ++ L+ VV++R Sbjct: 273 VVAFSSSACRLFPLRKMTAFGQQQSQQAVDSLVADGGTNIAEGLRKAARVVEDRQARNPV 332 Query: 338 NIYAAQASDG------------DNWADDSPLCHEILAKKLLPVVRYYSYIEITR---RAH 382 SDG D +PL + V +++ Sbjct: 333 -CSIILLSDGVDSHNLPPRDGSAPEPDYAPLVPRSILPGSEHHVPIHAFGLGMDHDHDHD 391 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + S +F D Sbjct: 392 SRAMHAVAQMSSGTFSFIDMVGSSIQDALAQ 422 >UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun sequence. (Fragment) n=16 Tax=Euteleostomi RepID=Q4SBF6_TETNG Length = 1039 Score = 113 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 32/309 (10%), Positives = 87/309 (28%), Gaps = 28/309 (9%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANI--SVVRSLQNSLARRTAMTAGKRRELHALEEN 194 +L + + + + R+ ++ R T + +E+ Sbjct: 323 FAPKDLPRLPKNVVFVIDMSGSMSGTKMQQEAHRAARSLQKRSTDGGTARISFSPTIEQQ 382 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 L + + R I + + + P + + Sbjct: 383 RKCPDC---PGTLIDGDFIIKYDVNRENDLGDIQIANGYFVHF-FAPKDLPRLPKNVVFV 438 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---IRH-HTQAKEVDEHEFFY-- 308 +D+SGSM + + + + L +++ I+ +T + + Sbjct: 439 IDMSGSMSGTKMQQTREAMLKILEDLDPEDHFGIILFDHRIQFWNTSLSKATKENIDEAM 498 Query: 309 -----SQETGGTIVSSALKLMDEVVKERYNPAQWNI----YAAQASDGDNWADDS--PLC 357 Q GGT +++ + +++KE + +DGD + +S P+ Sbjct: 499 VYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKRLPEKSIDMIILLTDGDPNSGESRIPVI 558 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 E + + + +S + + I + D + Sbjct: 559 QENVKAAIGGQMSLFSLGFGN-DVKYPFLDVMSRENNG----LARRIYEGSDAALQLQGF 613 Query: 418 FHKQNATAK 426 + + ++ Sbjct: 614 YDEVSSPLL 622 >UniRef50_A7RTF3 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RTF3_NEMVE Length = 756 Score = 113 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 34/302 (11%), Positives = 70/302 (23%), Gaps = 29/302 (9%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 + + + L + + T+ N+ + + R A + ++ Sbjct: 166 RVDSAYEFDFELLVQSASEIQEITSPHSKLNVVISSEDKCQATVRLAE---PFKFDVDVK 222 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 + P E I + + V D + + Sbjct: 223 VMILNRDPFLPQATFENGVTGSNITQDFLEKPLVTLNFMPDFG------KQEALETGEFI 276 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-----------HTQAKEV 301 ++D SGSM A+ L L V + + Sbjct: 277 FVIDRSGSMSGDRIKNARETLFLFLKSLPEHCHFNVVGFGSSYEKLFSSSTKYSDSSVNK 336 Query: 302 DEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + + GGT + LK + +DG+ + Sbjct: 337 ACNHAKNLEANLGGTEILEPLKYVFSQ----PVIKGSPRQVFLMTDGE---VGNTQQVIT 389 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 L KK R +++ + L + F + Q + R Sbjct: 390 LVKKNSTHARCFTFGIGQGAS-TALIKGVARAGQGTAEFITSSHQMQAKVVKTLRNALQP 448 Query: 421 QN 422 Sbjct: 449 SM 450 >UniRef50_C9LTL4 Magnesium-chelatase, subunit D/I family n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LTL4_9FIRM Length = 657 Score = 112 bits (280), Expect = 2e-23, Method: Composition-based stats. Identities = 46/380 (12%), Positives = 94/380 (24%), Gaps = 18/380 (4%) Query: 37 AINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGG 96 + +G +P + +F R + Q + E Sbjct: 256 LVEAARALAALAGRIYVMPQDIEEAALFVLAHRMSRKKEQREESSRRQREPQESQAEPQE 315 Query: 97 GSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYT 156 S A Q+ D + E + + + + +G Sbjct: 316 ESEDEADDAPQEERPDDVLTRDATGGESKEQEDDRGESQEKEAQADEEQASPADGDSGGE 375 Query: 157 ANGVPANISVVRSLQNSLARRT---AMTAGKRRELHALEENLAIISNSEPAQLLEEERLR 213 +I V + + L +G+R + S P + Sbjct: 376 DRDETHSIEAVMARLSLLRETVCVRKGKSGRRAIVQLDVPAGRPWRTSLPRTGRRIDLAF 435 Query: 214 KEIAELRAKIERVPFIDT-FDLRYKNYEK-RPDPSSQAVMFCLMDVSGSMDQ-STKDMAK 270 A +R + +R ++ + A + L+D SGSM M K Sbjct: 436 AATLRAAAPYQRQRHGEQAVVIRAEDLRVWIRARRASANILFLVDASGSMGAKERMKMVK 495 Query: 271 RFYILLY-LFLSRTYKNVEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALK 322 + L + + + + R T++ E+ E G T ++ L Sbjct: 496 GAVLALLREAYQKRDRVGLIAFRRTSAETLLPMTRSVELAEKALRSLPTGGKTPLAEGLA 555 Query: 323 LMDEVVKERYNPAQWNIYAAQASDGDNW---ADDSPLCHEILAKKL-LPVVRYYSYIEIT 378 +++ E +DG A + A+++ Sbjct: 556 AALKMMDELSRKEGAETVLVLVTDGRTNVSAAGKAKEEALRAAEEIARRDAHCIVLDTEK 615 Query: 379 RRAHQTLWREYEHLQSTFDN 398 L E + Sbjct: 616 NFPKVGLAPEIAQRMNAGYA 635 >UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZUW0_9SPHI Length = 425 Score = 112 bits (280), Expect = 2e-23, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 47/191 (24%), Gaps = 13/191 (6%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------- 291 + ++D SGSM + K+ + L V Y Sbjct: 36 APEKQERIPLNISLVVDRSGSMSGDKLNYVKKAVDFVIDNLKSDDVLSIVQYDDEIDVVA 95 Query: 292 IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNW 350 K+ + Q T +S + VK + N SDG N Sbjct: 96 SSAKVTNKKALHEKVKGIQARNMTNLSGGMMEGYAQVKSTQSNGYVNRV-LLLSDGLANA 154 Query: 351 ADDSPLCHEILAKKL--LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 +P + +A+K + ++ ++ L F + Sbjct: 155 GITAPEQLQQIAQKKFREAGIALSTFGVG-SDFNEVLMTNLSEYGGANYYFIDMPDKIPQ 213 Query: 409 DIYPVFRELFH 419 L Sbjct: 214 IFAQELEGLLS 224 >UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926ED6 Length = 464 Score = 112 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 22/206 (10%), Positives = 45/206 (21%), Gaps = 22/206 (10%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----- 294 EK + + ++D SGSM + K+ + L+ + + + Sbjct: 42 EKEHRKRAPIDLVVVIDKSGSMAGEKLALVKKTLEFVVSQLNEKDRLCLITFDTSVYLDF 101 Query: 295 -----HTQAKEVDEHEFFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDGD 348 K T + L EV+ +DG Sbjct: 102 KLTPMTPMNKYQTLKIIKDISPGSMTNLCGGLMKGLCEVIDRADEEKNEVASVLLFTDGF 161 Query: 349 NW--ADDSPLCHEILAKKLLPVV--------RYYSYIEITRRAHQTLWREYEHLQSTFDN 398 + C K + Y++ + + +E S Sbjct: 162 ANKGGLTNIYCSSSQTAKYTIGIVGPKTADASIYTFGFG-SNHNAQMLKEISDAGSGMYY 220 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNAT 424 + + L Sbjct: 221 YIENVDMIAEAFGQCLGGLLSTVAQG 246 >UniRef50_A7RNW3 Predicted protein n=3 Tax=Nematostella vectensis RepID=A7RNW3_NEMVE Length = 798 Score = 112 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 33/284 (11%), Positives = 70/284 (24%), Gaps = 30/284 (10%) Query: 155 YTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI-ISNSEPAQLLEEERLR 213 + + + V ++ RR K + + + A + L Sbjct: 201 FQPDPSDNCHASVTLAESHTFRRDVEIQIKSEDPFVAHALVEPGLPRPSDAPEDKTRGLA 260 Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY 273 L+ + V F+ + D + ++D SGSM S A R Sbjct: 261 ISTEFLQKPVAMVNFV---------PAFKADDLTCGEFIFVVDRSGSMSGSRIKDAARTL 311 Query: 274 ILLYLFLSRTYKNVEVVYIR-----------HHTQAKEVDEHEFFYSQET-GGTIVSSAL 321 L L V + ++ + + + + GGT + L Sbjct: 312 QLFLKSLPDGCYFNIVGFGSSYKTLFSKSKTYNDETLKTATNHAAHLAADLGGTEILEPL 371 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 + + +DG+ + L + R +S+ + Sbjct: 372 RWVYSQ----SLIEGAPRQLFLLTDGE---VGNTAQVISLVAENASTARVFSFGIGDGAS 424 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 L + F + Q + + Sbjct: 425 -TELIKGVARAGHGSAEFVRGQDKLQVKVIKTLKRALQPALTDV 467 >UniRef50_UPI00006CAF43 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAF43 Length = 631 Score = 111 bits (278), Expect = 4e-23, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 54/189 (28%), Gaps = 19/189 (10%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 F+ +L K + + C++D SGSM + ++ L ++ + Sbjct: 123 FVCGVNLHVKQPK-EQSERVPMDLICVIDDSGSMSGKKAQLVRKSLKYLLKIMNENDRIC 181 Query: 288 EVVYIR----------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 + + ++ + K + G T + + ++ ++K R Sbjct: 182 LISFDSVEKILTPFLRNNLENKSELKKAIKNIVGRGSTNIEAGMEAGLWMIKNRKEKNPI 241 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPV----VRYYSYIEITRRAHQTLWREYEHLQ 393 SDG + + L + L + V Y Y T R Sbjct: 242 TC-MFLLSDGQDDSPQVDLRVQKLIQSYDIQDTFIVNTYGYG---ADHDATQMRNIAETH 297 Query: 394 STFDNFAMQ 402 + Sbjct: 298 KGGYYYIED 306 >UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DZ93_PARTE Length = 522 Score = 111 bits (278), Expect = 4e-23, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 57/179 (31%), Gaps = 12/179 (6%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---TQAKEVD 302 + CL+D SGSM + K+ L L + + + + T+ Sbjct: 109 RQGIDLICLIDHSGSMSGEKMHLVKKSLKHLLKMLQPNDRLCLIEFDDQNYRLTRLMRAT 168 Query: 303 EH-------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + + G T + +A+K+ ++K R SDG++ Sbjct: 169 QENMYKFLIAIDTIEANGATDIGNAMKMALSILKHRRFKNPIAS-IFLLSDGEDEGAAGR 227 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + ++I +K + ++ + E H + + + + + + Sbjct: 228 VWNDIQSKNIKEPFTINTFGFGR-DCCPKIMSEIAHFKEGQFYYISEISKIDECFFEAL 285 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 111 bits (278), Expect = 4e-23, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 56/191 (29%), Gaps = 19/191 (9%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 D + ++D SGSMD + K + L L + + + Sbjct: 33 ADDHDRRLPLNLCLVLDHSGSMDGQPLETVKSAALGLIDRLEEDDRLSVIAFDHRAKIVI 92 Query: 300 E--------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 E + GGT + LKL + + + +DG+N Sbjct: 93 ENQQVRNGAAIAKAIERLKAEGGTAIDEGLKLGIQ--EAAKGKEDRVSHIFLLTDGENEH 150 Query: 352 DDSPLCHE--ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 D+ C + +A V + +Q + ++ + + Sbjct: 151 GDNDRCLKLGTVASDYKLTVHTLGFG---DHWNQDVLEAIAASAQGSLSYI----ENPSE 203 Query: 410 IYPVFRELFHK 420 FR+LF + Sbjct: 204 ALHTFRQLFQR 214 >UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UYN7_ROSS1 Length = 459 Score = 111 bits (277), Expect = 5e-23, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 52/188 (27%), Gaps = 20/188 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-------- 292 R + ++DVSGSM + AK FL V + Sbjct: 83 PREQHRPPLHLVAVLDVSGSMSGTKLASAKEALRQALHFLQDGDVFSLVTFSDQVQTHLK 142 Query: 293 --RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-N 349 + + ++ E+ + +G T + L ++ +++ SDG N Sbjct: 143 AESYAQRKRDKMENLLDEIRASGMTALDGGLAQGIDLGQKKRQA---TTLVLLLSDGQAN 199 Query: 350 WADDSPLCH---EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + A++ +V ++ L E + + + Sbjct: 200 VGETDLEKIGLRAQKARQSGLIVSTLGVGL---DYNEALMVEIANQGGGRFYHIQEGSQI 256 Query: 407 QDDIYPVF 414 + Sbjct: 257 PAALMQEL 264 >UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SHY6_LEPBA Length = 550 Score = 111 bits (277), Expect = 5e-23, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 59/207 (28%), Gaps = 15/207 (7%) Query: 223 IERVPFIDTFDLRYKNYE-KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 E+ + LR++ + V+ +D S SM + L +L+ Sbjct: 15 KEKEVQENHLLLRFRTPANPNVEERKPLVIGLAIDKSWSMKGEKMEAVIDASCALVNWLT 74 Query: 282 RTYKNVEVVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY 332 R V Y + H T+ V + Q T +S + + + Sbjct: 75 RHDAVSIVAYSADVQLIQPVTHLTEKVSVTDK-IRNIQVATSTNLSGGWLSALKSLNQSK 133 Query: 333 NPAQWNIYAAQASDGDNWAD--DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 P + +DG+ + D I A L + + ++ + E Sbjct: 134 IPNAYKRV-LLLTDGNPTSGIKDKEALVTIAADHLSMGISTTTIGVGN-DFNEEMLVEIA 191 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFREL 417 + D + F ++ Sbjct: 192 KAGGGNFYYIDNPENASDIFFEEFGDI 218 >UniRef50_C3JL94 von Willebrand factor type A domain protein n=1 Tax=Rhodococcus erythropolis SK121 RepID=C3JL94_RHOER Length = 614 Score = 111 bits (277), Expect = 5e-23, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 56/226 (24%), Gaps = 19/226 (8%) Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + + L A F TF L + + + + ++D SG Sbjct: 3 YTRGQTVSRAIPLAPRTVGHAALAALAAFALTFGL--FSTPTAQAEETTSSVLFVVDTSG 60 Query: 260 SMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-----------HTQAKEVDEHEFFY 308 SM S AK LS + T ++ + Sbjct: 61 SMAGSPLAQAKDALRAGIGALSSGQAAGLRSFAGDCGNGGQLLVPVATDNRDQLNNATNQ 120 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 G T AL+ + P+ + SDG + D L +L Sbjct: 121 LTAGGTTPTPDALRAAAGDL-----PSTGDRTIILISDGQSTCGDPCAVATELKTQLGID 175 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 R ++ ++ + F + + D Sbjct: 176 FRVHAVGFNAPDVAESELSCIANATGGRY-FTATNTTELSDAISAA 220 >UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XPW8_DESMR Length = 439 Score = 111 bits (277), Expect = 5e-23, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 62/232 (26%), Gaps = 15/232 (6%) Query: 209 EERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM 268 L L A + + E ++ + +D SGSM + Sbjct: 5 NVLLTPRRPALVAGFDNTLDVLVRIQAPNTPEGETKERTRLNLALAIDRSGSMAGRPLEE 64 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEHEFFYSQETGGTIVSSA 320 AKR + L T + + Y + K + + G T + Sbjct: 65 AKRCASFVVDKLKNTDRVSLIAYDSSIETRVPSVKVEDKAIFHRAIEGIDDGGCTNLHGG 124 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP-VVRYYSYIEIT 378 E + +P+ + SDG N ++L V +Y Sbjct: 125 WLKGAEQISPYIDPSTISR-IILLSDGQANEGLTDEAEIFKQCRELADAGVTTSTYGLG- 182 Query: 379 RRAHQTLWREYEHLQSTFDNF---AMQHIRDQDDIYPVFRELFHKQNATAKG 427 ++TL + A + + + LF KQ + Sbjct: 183 SNFNETLMIGMAKNGQGNSYYGRTADDLMDPFQEELSLLEALFAKQVRASIS 234 >UniRef50_Q503P4 Zgc:110377 n=9 Tax=Clupeocephala RepID=Q503P4_DANRE Length = 868 Score = 111 bits (277), Expect = 5e-23, Method: Composition-based stats. Identities = 38/340 (11%), Positives = 82/340 (24%), Gaps = 33/340 (9%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALP-NLKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 + F ++ +E L F L +K Q Q E Y G+ + Sbjct: 117 AAKSSVTFILTHEELLQRRFSKYELMIRVKPKQLVQHFEIVADI--YEPQGIAFVDAYGT 174 Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE----EERLRKEIAELRAKIE 224 + N L T ++ + + + R Sbjct: 175 FITNELLPLVDKTVTDKKAHV---SFSPTLDQQRKCTECDGTLIDGDFFITYDVNRPHDI 231 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 I + + P ++ ++D S SM + K + L Sbjct: 232 GDIQIVNGYFVHF-FAPANLPRVPKMVVFVIDNSYSMYGNKMAQTKEALGTILGELPEDD 290 Query: 285 KNVEVVYIRHHTQAKEVDEHEFFY-----------SQETGGTIVSSALKLMDEVV----K 329 +V+ + + GGT + A E++ + Sbjct: 291 YFAIIVFSTTFVVWRPYLSKATEENVKEAQEYVKTIEVIGGTELHDATIHGVEMLYAAQR 350 Query: 330 ERYNPAQWNIYAAQASDGDNW--ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 P + +DG P E + K + + + A Sbjct: 351 NGTAPKNMVLMMILLTDGQPNQYPRSLPEIQESIRKAIDGNITLFGLAFGN-DADYGFLD 409 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + ++ I + D + + + ++ Sbjct: 410 TLSKQNNG----IVRRIYEDSDAPLQLKGFYEEVSSPLLS 445 >UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-trypsin inhibitor heavy chain H3 n=11 Tax=Tetrapoda RepID=B4DPQ4_HUMAN Length = 698 Score = 111 bits (277), Expect = 5e-23, Method: Composition-based stats. Identities = 35/340 (10%), Positives = 94/340 (27%), Gaps = 31/340 (9%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALP-NLKQNQQR---QLTEYKTHRAGYTANGVPANIS 165 + F+++ +E L + ++ Q ++ G + A+ Sbjct: 145 AAGSKVTFELTYEELLKRHKGKYEMYLKVQPKQLVKHFEIEVDIFEPQGISMLDAEASFI 204 Query: 166 VVRSLQNSLARRTA--MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 L ++L + + + + + ++S L R Sbjct: 205 TNDLLGSALTKSFSGKKGHVSFKPSLDQQRSCPTCTDS-----LLNGDFTITYDVNRESP 259 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 V ++ + + + + P + ++D+SGSM + K + + + Sbjct: 260 GNVQIVNGYFVHFFAP--QGLPVVPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEE 317 Query: 284 YKNVEVVYIRH-HTQAKEVDEHEFFYSQET----------GGTIVSSALKLMDEVV---- 328 ++ T + + + QE G T ++ L ++ Sbjct: 318 DYLNFTLFSGDVSTWKEHLVQATPENLQEARTFVKSMEDKGMTNINDGLLRGISMLNKAR 377 Query: 329 KERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLW 386 +E P + +DGD N + P + + + Y+ + Sbjct: 378 EEHRIPERSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGN-NLNYNFL 436 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 F + + + E+ + + Sbjct: 437 ENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLTGVE 476 >UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocephala RepID=Q6PGW2_DANRE Length = 927 Score = 111 bits (277), Expect = 6e-23, Method: Composition-based stats. Identities = 34/334 (10%), Positives = 76/334 (22%), Gaps = 28/334 (8%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS 169 + F+++ +E L L L Q Q N + + V Sbjct: 137 AANSKVTFELTYEELLKRRLGKYEL--LINAQPMQPVADFKIDVHIQENPGISFLEVKGD 194 Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPA----QLLEEERLRKEIAELRAKIER 225 L + + R + A + L R + Sbjct: 195 L--NTGDLASAVKTTRADKDAWVTFYPTRDQQTKCTNCAENGLNGDLIITYDVNRGNPKG 252 Query: 226 VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK 285 I + P + ++D SGSM + + + L Sbjct: 253 EVQISNGYF-VHYFAPSDVPHIPKNVVFIIDRSGSMHGRKIRQTRSALLTILKDLDEDDH 311 Query: 286 NVEVVYIRHHTQAKEVDEHEF-----------FYSQETGGTIVSSALKLMDEVVKERYNP 334 + + + Q+ G T ++ A+ +++ R Sbjct: 312 FGLITFDAEIDFWRRELLQATKANRENAESFVKRIQDRGATNINDAVLAGVDMI-NRNPR 370 Query: 335 AQWNIYAAQASDGDNW-ADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHL 392 +DGD + + K+ + Y + + Sbjct: 371 KGTASILILLTDGDPTAGETNIEKIMANVKEAIGSKFPLYCLGFGY-DVNFDFLTKMSLE 429 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + I + D + + + Sbjct: 430 NNA----VARRIYEDSDADIQLQGFYDEVAVPLL 459 >UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clupeocephala RepID=Q498Q0_DANRE Length = 892 Score = 111 bits (276), Expect = 6e-23, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 55/201 (27%), Gaps = 19/201 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---- 294 + + ++D SGSM + + + + + L++ + + H Sbjct: 257 FAPTDVQRIPKNVVFIIDQSGSMQGNKIEQTRMAMLRILSDLAKDDYFGLITFSSHIQAW 316 Query: 295 -------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + E + + G T ++ A+ ++ + Y +DG Sbjct: 317 KPELLKATAENVEEAKTFVKQIRSGGATDINGAVLNAVNMINQ-YTQEGSASILILLTDG 375 Query: 348 DNW-ADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 D +P+ + K + Y + + A + I Sbjct: 376 DPTSGVTNPVTIQQNVKTAIGGKYPLYCLGFGF-NVRFEFLEKMSLENNG----AARRIY 430 Query: 406 DQDDIYPVFRELFHKQNATAK 426 + D + + + Sbjct: 431 EDSDADLQLQGFYEEVAIPLL 451 >UniRef50_C7R936 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R936_KANKD Length = 689 Score = 111 bits (276), Expect = 6e-23, Method: Composition-based stats. Identities = 24/215 (11%), Positives = 63/215 (29%), Gaps = 20/215 (9%) Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 L ++ + ++ + M ++D SGSM + AK+ Sbjct: 299 LFSESYDDHNYHVLMMLPPTHDLVQQKTQPREMIFVIDSSGSMSGESMQQAKQGLYYALS 358 Query: 279 FLSRTYKNVEVVYIRHHTQ-----------AKEVDEHEFFYSQETGGTIVSSALKLMDEV 327 LS + + + E+ ++ + GGT ++ A+ L + Sbjct: 359 QLSINDTFNIIDFDNDANKLFDEAVPATLSNLEMAKYFVATLEADGGTEIAKAINLALD- 417 Query: 328 VKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 + +DG + + + + L R ++ + Sbjct: 418 ----KPDSSLLRQVVFLTDG---SIGNERQIFQMIENQLGNNRLFTIGIGA-APNSYFMS 469 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + + + Q + +F++L + Sbjct: 470 KAANYGRGTFTYIGKASEVQTKLEQLFKKLRYPAL 504 >UniRef50_Q12VX7 Putative uncharacterized protein n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12VX7_METBU Length = 892 Score = 111 bits (276), Expect = 6e-23, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 53/186 (28%), Gaps = 21/186 (11%) Query: 249 AVMFCLMDVSGSMD------QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK--- 299 + ++D SGSM + AK + L + V + T + Sbjct: 617 LDIVLVLDRSGSMKFLGNAPEQPLTDAKSAAKIFMENLLSNTEVGVVSFSSTSTVDRQPV 676 Query: 300 --------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 ++ + GGT + A+ + ++ A+ +DG A Sbjct: 677 SLNISGNKDLLHNAIDSMVADGGTAIGDAMADANNLLINGRPDAK--KIMIVLTDGVATA 734 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRA-HQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + ++ L +R YS + + + + + + + Sbjct: 735 GSDRDGSDAISTANLNNIRIYSIGLGSSEYIDEPMLKRIASETGGSY-YNAPSGSELQTV 793 Query: 411 YPVFRE 416 Y + Sbjct: 794 YNTISK 799 >UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tetraodon nigroviridis RepID=UPI00017B0D26 Length = 856 Score = 111 bits (276), Expect = 7e-23, Method: Composition-based stats. Identities = 36/380 (9%), Positives = 88/380 (23%), Gaps = 32/380 (8%) Query: 66 QGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYL 125 Q R + D +N + + A G++ +F ++ +E L Sbjct: 53 QSEVRPREKKVKSEDGRGKNKDLRFCSSVEPEMELFKMAA--TIPGRNRAIFMLTYEELL 110 Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKR 185 L+ Q + + + + R Sbjct: 111 RRRLGRYEHVTLRPLQLVSRLTLDLTIVDHAPITQLEVLPLRNGASRASGRTWGGPKAP- 169 Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + + I+ + A R + + + P Sbjct: 170 -----ITFSPNIVQQARIATNGLLGDFVVRYDVQRDMGIGDVQVLDGHF-VHYFAPKDLP 223 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----HHTQAKE 300 + + ++D S SM K + + L + + + + Sbjct: 224 AVPKNVVFVIDTSASMLGKKMRQTKEALLTILGDLRPADRFNFISFSSRIRVWQPGRLVP 283 Query: 301 VDEHEFFY-------SQETGGTIVSSALKLMDEVVKERY--NPAQWNIY--AAQASDGDN 349 +GGT + A++ ++++ A N +DG Sbjct: 284 ATPSAVRDAKKFVVMLPTSGGTDIDGAIQTGSSLLRDHLSGRDAGPNSVSLIIFLTDGQP 343 Query: 350 W-ADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + P A+ ++ L M+ I ++ Sbjct: 344 TVGEVRPGAILGNARAAVRDKFCIFTIGMG-DDVDYRLLERMALDNCG----MMRRIPEE 398 Query: 408 DDIYPVFRELFHKQNATAKG 427 D + + + + Sbjct: 399 ADASSMLKGFYDEIGTPLLS 418 >UniRef50_UPI00017450FB von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017450FB Length = 424 Score = 111 bits (276), Expect = 7e-23, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 50/189 (26%), Gaps = 12/189 (6%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------IRH 294 + + + ++D SGSM A+ L V Y I Sbjct: 32 EASAKRAPVNVTIVIDKSGSMGGDKMVHAREAAKQALDRLGAGDMVSVVAYDDAVSLISP 91 Query: 295 HTQ--AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA 351 T ++ + Q G T + S + E ++ P Q N SDG N Sbjct: 92 ATDLTDRDRVKAAIDRIQAGGSTALFSGISKGAEELRRNKRPNQVNRVV-LLSDGMANVG 150 Query: 352 DDSPLCHEIL-AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 SP L A + + ++ L E F Sbjct: 151 PSSPQDLGRLGASLAKEGITVTTLGLGLG-YNEDLMTELALRSDGNHAFIENSQNLAGIF 209 Query: 411 YPVFRELFH 419 F ++ Sbjct: 210 QTEFGDILS 218 >UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E8A41 Length = 945 Score = 111 bits (276), Expect = 7e-23, Method: Composition-based stats. Identities = 40/350 (11%), Positives = 99/350 (28%), Gaps = 33/350 (9%) Query: 106 SQDGEGQDEFVFQISKDEYLDL-LFEDLALPNLKQNQQRQ-LTEYKTHRAGYTANGVPAN 163 + + G++ VF ++ +E L L + +L+ Q LT T G+ Sbjct: 145 AANIPGRNRAVFMLTYEELLQRRLGRYEHVTSLRPLQLVSRLTLDVTIIDHSAITGLEVL 204 Query: 164 I--SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 + + +R + E+N+ I+ S + + Sbjct: 205 LLRNSRGGGGAGTSRASGRPQLPVTTEIKKEKNMCRITFSPNIVQQARIATNGLLGDFVI 264 Query: 222 KIERVPFIDTFDLRYKN------YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 + + + D++ N + + P+ + ++D S SM K Sbjct: 265 RYDVQRDLGIGDIQVLNGHFVHYFAPKDLPAVPKNVVFVIDTSASMLGKKIRQTKEALFT 324 Query: 276 LYLFLSRTYKNVEVVYIR-----HHTQAKEVD-------EHEFFYSQETGGTIVSSALKL 323 + L + + + V + F +GGT ++SA++ Sbjct: 325 ILGDLRPGDHFNFISFSSRVKVWQPGRLVPVTPNNVRDAKKFIFMLPTSGGTNINSAIQT 384 Query: 324 MDEVVKERYNPAQWNI----YAAQASDGDNWADD--SPLCHEILAKKLLPVVRYYSYIEI 377 ++++ + + +DG + S + ++ Sbjct: 385 GSSLLQDYLSAQDASPNSVSLIIFLTDGQPTVGEVQSVTILGNTRSAVQGKFCIFTIGIG 444 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 L M+ I ++ D + + + + Sbjct: 445 N-DVDYRLLERMALDNCG----MMRRIPEEADASSMLKGFYDEIGTPLLS 489 >UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F1H2_SORC5 Length = 607 Score = 110 bits (275), Expect = 8e-23, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 49/226 (21%), Gaps = 15/226 (6%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 E + ++ E + ++D SGSM +A Sbjct: 2 FNARPDRPWLPAEPSERLLRVEITVPRPE-GGQARKPVHLSLVIDRSGSMSGEKLRLALE 60 Query: 272 FYILLYLFLSRTYKNVEVVY---------IRHHTQ-AKEVDEHEFFYSQETGGTIVSSAL 321 L + V + T A+ E G T + Sbjct: 61 AARQAIRTLQPGDRFSVVTFDHQVEVPIPSTDATPGARLRAEAALDTVIARGNTDLGGGW 120 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAK-KLLPVVRYYSYIEITR 379 V +DG N SP A+ + L V + Sbjct: 121 LRGCAEVGAHLPEDAIGRV-LLLTDGQANHGITSPDELTSRARSQRLRRVTTSTIGLGEG 179 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 ++ L FA + + E+ A Sbjct: 180 -FNEFLLGRLSEEGGGNFYFAARADELPGFVGREIGEVLSVVARDA 224 >UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CIG8_9BACI Length = 931 Score = 110 bits (275), Expect = 9e-23, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 44/179 (24%), Gaps = 14/179 (7%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------IRH 294 M ++D SGSM +AK I L + + Sbjct: 402 KKELPSLGMVIVLDRSGSMAGYKIQLAKEAAIRSAELLREKDTLGFIAFDDRPWQIIDTE 461 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + KE + GGT + +L+L E + + +DG + Sbjct: 462 PIKDKEKVIEKINGLTSGGGTNIFPSLELAYEQL---TPLELQRKHIILLTDGQSATSPD 518 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 L K+ + + E + L E + Sbjct: 519 YLTTIQEGKENNITLSTVAIGEG---SDSVLLEELSDEGGGRFYDVNDSSTIPSILSRE 574 Score = 49.1 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 41/126 (32%), Gaps = 11/126 (8%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 P + D+S S+D ++ ++ + R K V + + E Sbjct: 61 IPLKGNSTVYVADLSDSLDNQ-REHLRQTIDQAVKEMKREDKFGVVSAGGSAVVERPLKE 119 Query: 304 HEFFYSQETG-----GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 Q T +SS L+L ++ P+ N SDG+ D+ Sbjct: 120 KSEAGVQFRSQLETYSTDISSGLRLGGSLI-----PSYTNGRVVLLSDGNENTGDAVKQA 174 Query: 359 EILAKK 364 L ++ Sbjct: 175 AYLKQQ 180 >UniRef50_B6ZDR6 Voltage dependent calcium channel alpha2d/delta subunit n=3 Tax=Euteleostomi RepID=B6ZDR6_RANCA Length = 1078 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 66/231 (28%), Gaps = 24/231 (10%) Query: 206 LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST 265 L + + R ID +D+R + + +S M L+DVSGS+ T Sbjct: 215 LARYYPASPWVDKSRTP----NKIDLYDVRRRPW-YIQGAASPKDMLILVDVSGSVSGLT 269 Query: 266 KDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-------------QAKEVDEHEFFYSQET 312 + + + LS + + + K+V + Sbjct: 270 LKLIRTSVTEMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVRNKKVLKEAVNNITAK 329 Query: 313 GGTIVSSALKLMDEVVKE-RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 G T K + ++ + A N +DG +D L K VR Sbjct: 330 GTTDYKQGFKFAFDQLRNTNVSRANCNKIIMLFTDG---GEDKATETFKLYNKN-KTVRV 385 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 +++ + + + + I + ++ + Sbjct: 386 FTFSVGQHNYDKGPIQWMACENKG-YYYEIPSIGAIRINTQEYLDVLGRPM 435 >UniRef50_C3ZG18 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZG18_BRAFL Length = 806 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 41/402 (10%), Positives = 91/402 (22%), Gaps = 38/402 (9%) Query: 48 SGESVSIPTEDISEPMFHQGRG-GLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQAS 106 G V ++ + + D + G S Q S Sbjct: 67 EGRRVIGKVKEKQKAREEYDDAIASGEGAFLFEEDDRSGDVFKCSVGNLPPKTSATIQLS 126 Query: 107 QDGE----GQDEFVFQISKDEYLDLLFED--LALPN--LKQNQQRQLTEYKTHRAGYTAN 158 E F + + A + + ++ Sbjct: 127 YVAELPVEADSSLKFVLPGVLNPRYSSDTGTAAYDDTYMAPTGDVVTSDAPYKLKLKVNV 186 Query: 159 GVPANISVVRSLQNSLARR---TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE 215 P +I + S ++S+ T+ + + + + + + L + Sbjct: 187 SSPNSIDKIESPKSSIDVTYGGTSAQVRLKDDHKLDSDVELYVHYKDKHRPFAVTELGQG 246 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 A + + ++ ++D SGSM + A+ +L Sbjct: 247 TDGFMAD------HTVMLTFVPDLSREDLVANCGEFIFILDRSGSMSGNKIKNARETLLL 300 Query: 276 LYLFLSRTYKNVEVVYIR-----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKL 323 L V + + ++ + + GGT + L+ Sbjct: 301 FLKSLPIGCYFNIVGFGSTHESLFKGSEKYDNKSLKTACKALGKMEADLGGTEILQPLQY 360 Query: 324 MDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 + + A +DG+ W K R +S + Sbjct: 361 VYKQ----PPIAGHPRQLFLLTDGEVW---DTQACVREVAKHADSARCFSVGIGEGAS-T 412 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 L + F R Q + + Sbjct: 413 ALVKGVARAGRGKAEFVSGTDRLQAKVMRLLSCALQPTVTGV 454 >UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZQD5_OPITP Length = 859 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 35/294 (11%), Positives = 74/294 (25%), Gaps = 15/294 (5%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 L + + ++ V A + EE Sbjct: 408 RPATGLDAPQAEVSTAKEPVSTFSLHVSDVSFQLAQAALARGEMPDPQRIRPEEFYNAFD 467 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 +P +++ I + + + + ++ + L+D SG Sbjct: 468 YGDPTPAS-ADKIACRIEQAAHPLLQQRNLVRIAMKV--PAAGRGAGQPLNLTVLLDTSG 524 Query: 260 SMDQSTKDM-AKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS--------Q 310 SM+++ + + +L L+ + + + R E + Sbjct: 525 SMERTDRATSVRAALGVLASLLTPDDRVTLIGFARQPRLLAESLAGDQARQLVDLASTTP 584 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVV 369 TGGT + +AL L E+ + +N A N +DG N + P + L Sbjct: 585 FTGGTNLEAALSLAGELARRHHNAAAQNR-IVLITDGAANLGNADPAQLATRIETLRQQG 643 Query: 370 RYY-SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + T + F Sbjct: 644 IAFDACGVGTDGLDDAVLEALTRKGDGRYYVLDAPENADAGFARQLAGAFRPAA 697 >UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LQW0_BACS4 Length = 282 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 56/217 (25%), Gaps = 13/217 (5%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 + + K + ++ K + S + L+D SGSM K Sbjct: 9 FAHQYENVPCKGKEAAYLLVELTGAK---VKHTERSPINLSLLLDRSGSMSGEPLRYCKE 65 Query: 272 FYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEHEFFYSQETGGTIVSSALKL 323 + L+ VV+ K++ + + G T +S L Sbjct: 66 ACNFVINQLTDKDILSVVVFDDQVETIIEPQKVTHKDLLKEYIQRIETRGITNLSGGLIQ 125 Query: 324 MDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 + V ++ N SDG N LA S + ++ Sbjct: 126 GCQHVLKQEVKNYVNRVI-LLSDGQANAGITDKEALVKLADDYQSAGLVISTLGVSEHFD 184 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + L +F + L + Sbjct: 185 EELLEGVADSGRGNFHFINEVENIPSIFEQELDGLLN 221 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 63/207 (30%), Gaps = 20/207 (9%) Query: 224 ERVPFIDTFDLRYKNYEKRPDP-----SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 + ++ + + CL+D SGSM + ++ + Sbjct: 92 NQQNAALMITIKSNDILLINQRGQECVRQGVDLVCLIDHSGSMQGEKIKLVRKTLKQMLT 151 Query: 279 FLSRTYKNVEVVY---IRHHTQAKEVDEH-------EFFYSQETGGTIVSSALKLMDEVV 328 FL + +++ + T+ V + Q GGT + + +K+ ++ Sbjct: 152 FLQPCDRLCLIMFDCKVYRLTRLMRVTQENVQKFRVAISSLQARGGTDIGNGMKMALSIL 211 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWR 387 K R + SDG + + L + + ++ + Sbjct: 212 KHRKYKNPVS-AIFLLSDGVDEGA-EERVRDDLIQYNIRDSFTIKTFGFGR-DCCPKIMS 268 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVF 414 E H + F ++ + D+ + Sbjct: 269 EIAHYKEGQFYFVP-NLTNIDECFAEA 294 >UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09DT2_STIAU Length = 843 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 49/193 (25%), Gaps = 23/193 (11%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----------H 294 + + ++D SGSM+ + A+ L L + + + Sbjct: 244 PKRQEVVFVVDTSGSMEGESLPQAQGALRLCLRHLREGDRFNIIAFDTSFQSFAPQPAVF 303 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + E + + GGT + + + E +DG + Sbjct: 304 TQKTLEQADRWVAALRANGGTELLQPMLAAVQAAPEG--------VVVLLTDGQ---VGN 352 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + R YS+ T L ++ F R D + F Sbjct: 353 EAEILQAVLRARKTARIYSFGIGT-NVSDALLKDMARQTDGAVEFIHPGERIDDKVVAQF 411 Query: 415 RELFHKQNATAKG 427 + + Sbjct: 412 SRALAPRITELQA 424 >UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000E460BF Length = 1028 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 21/227 (9%), Positives = 57/227 (25%), Gaps = 21/227 (9%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 + K I + P+++ + ++DVSGSM KR Sbjct: 273 FIVTYDVIHTKKAGHLEIVDGYF-VHYFSPDGLPNTRKNVIFVIDVSGSMYGQKTRQTKR 331 Query: 272 FYILLYLFLSRTYKNVEVVYIRHHTQAKE------------VDEHEFFYSQETGGTIVSS 319 + + + + +++ + +E + GGT + Sbjct: 332 AFTTILDDVRPIDRINIILFSSYAHVWREDQMVEATSDNIAAAKRHVNGLSVGGGTNIYD 391 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 +L E++ E +DG ++ + + + +S Sbjct: 392 SLMKAVEILLEHDTGDAM-PLIIMLTDGQV--GNAAAIVRDVTSVIGGRLSLFSIGFGNG 448 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + + + + + Sbjct: 449 -VDFPFLEKLSLSNQA----LARKVYEDSSASLQMKGFYDEVANPLL 490 >UniRef50_Q0AV90 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AV90_SYNWW Length = 776 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 35/337 (10%), Positives = 78/337 (23%), Gaps = 56/337 (16%) Query: 96 GGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTE----YKTH 151 G G+ E D + D P + + R T Sbjct: 151 PGKPLGKKIGPGRAEPTDR------------VPDADFISPPIGETGYRATLSLHVHNNTP 198 Query: 152 RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEER 211 + + I + ++ + T + L + Sbjct: 199 ISSIKSPSHKIRIDRMDEYSATITLQENNTRM---------------NRDFVLNLKLDGE 243 Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 I + + T+ E+R L+D+S SM+ + A Sbjct: 244 TVPRIIYWKNPKDEYFACITYTPELPIIEQRQ----PKEYIFLIDISRSMEGKKIEHAAD 299 Query: 272 FYILLYLFLSRTYKNVEVVYIRHHT-----------QAKEVDEHEFFYSQETGGTIVSSA 320 + L + + + + + GGT + A Sbjct: 300 AIQICLRNLDEGDSFNLLAFESENHAFAPKSLPYNQENLDKASAWVKNLHAMGGTNILPA 359 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 ++L + A+DG + +K + +S T Sbjct: 360 VQLALKEA------GDQQKVVILATDGQ---VGNENEIINYVRKRNQNLCLFSLGIDT-A 409 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + + F+ ++ + F + Sbjct: 410 VNSYFINQIAEAGNGCAEFSYPGESLEEKMLRHFARI 446 >UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3 Tax=Sorghum bicolor RepID=C5WYV0_SORBI Length = 686 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 36/327 (11%), Positives = 78/327 (23%), Gaps = 35/327 (10%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 LP+++ +Q + + + L + + L Sbjct: 43 ELPSVRPSQPSSMPPTLPRQPLPRMVPMHGVQPPPVPAGQPLPQPAEPEVFDDDDEVELP 102 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY-------EKRPDP 245 + + + + E + + + F ++ R P Sbjct: 103 SGEDNQRQATASSGMLAVKTHVEFSAVARDSSQDHFAVLVHVKAPGVIVNEAAAGDRDAP 162 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------HH 295 + + ++DVSGSM + K+ + L + V + Sbjct: 163 RAPLDLVTVLDVSGSMRWDKLALVKQAMGFVIGSLGPHDRLSVVSFSSGARRVTRLLRMS 222 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD--- 352 K + + GGT ++ L+ +V+ ER + + SDG + Sbjct: 223 HTGKSLATEAVESLRAGGGTNIAEGLRTAAKVLGERRHRNAVSSVI-LLSDGHDNYSMPR 281 Query: 353 -------------DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 P A +++ +F Sbjct: 282 RARGGVPPNYEVLVPPSFVPGTASTGEGSAPIHTFGFGN-DHDAAAMHVVAEATGGTFSF 340 Query: 400 AMQHIRDQDDIYPVFRELFHKQNATAK 426 QD L A+ Sbjct: 341 IENEAVIQDAFAQCIGGLLTVVAQEAR 367 >UniRef50_A2E1S5 von Willebrand factor type A domain containing protein n=2 Tax=Trichomonas vaginalis RepID=A2E1S5_TRIVA Length = 688 Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats. Identities = 34/257 (13%), Positives = 74/257 (28%), Gaps = 25/257 (9%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK--- 241 + ++++ + +I +E + I ++ I + Y Sbjct: 166 KEIINSVRGTINVIDPHNVIFATKEFPNDESITIETQIKDKDNNIAIWSDGYIAISTFTY 225 Query: 242 -RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------- 293 S + + ++D SGSM S AK + L + + + Sbjct: 226 FETKVHSNSEFYFIIDCSGSMSGSCIQNAKLCLNIFMHSLPIGCRFSIIKFGSDYEVALH 285 Query: 294 ---HHTQAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + + + GGT + S LK + E+ + +DG + Sbjct: 286 PCDYTDENVSEAMKQLNNIDAEMGGTDILSPLKYVMEL----TPKQGFIKQVFLLTDGQD 341 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + LA++ R +S + A + L F + + Sbjct: 342 ---SNTNELCALAQENRTNNRIFSIGIGSG-ADKDLIINVSQKSGGNYVFVDD--DESEK 395 Query: 410 IYPVFRELFHKQNATAK 426 + EL + + A Sbjct: 396 LNEKVIELLNSAISYAL 412 >UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Marinobacter RepID=A1U6Y4_MARAV Length = 712 Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats. Identities = 31/262 (11%), Positives = 68/262 (25%), Gaps = 26/262 (9%) Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 T+ LE + A +S + L++ + + + F + Sbjct: 281 TSPSHPLQVELEGSRATVSPEQGQILMDRDVIVRWRPADNQAPTAALFRQQWQGEDFLMA 340 Query: 241 KRPDPSS-----QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 P++ + + ++D SGSM + A+ + L + + + Sbjct: 341 MVMPPATTGQVLRRELLFVIDTSGSMAGESIRQARSALLRGLDTLRPGDRFNVIQFNSQA 400 Query: 296 T-----------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 GGT ++ AL L + + + Sbjct: 401 HALYTQPVPANGHYLARARDYVQDLTADGGTEMAGALSLAMGM--DGSESSGHVQQMVFM 458 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG A + + L R ++ + RE + Sbjct: 459 TDG---AVGNESALFDQIRTGLGNRRLFTVAIG-SAPNMHFLREAARWGRGQY----TAV 510 Query: 405 RDQDDIYPVFRELFHKQNATAK 426 ++ +LF A Sbjct: 511 HSAAEVDKALGKLFAAMEAPVM 532 >UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=40 Tax=Euteleostomi RepID=ITIH2_HUMAN Length = 946 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 24/205 (11%), Positives = 52/205 (25%), Gaps = 22/205 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 + + ++DVSGSM + L + + Sbjct: 299 FAPDNLDPIPKNILFVIDVSGSMWGVKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTW 358 Query: 292 ----IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN----PAQWNIYAAQ 343 I + Q +GGT ++ AL ++ E N Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIIL 418 Query: 344 ASDGDNWADDSP--LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 SDGD + + + + + + +S + + Sbjct: 419 VSDGDPTVGELKLSKIQKNVKENIQDNISLFSLGMGF-DVDYDFLKRLSNENHG----IA 473 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 Q I D ++ +++ + Sbjct: 474 QRIYGNQDTSSQLKKFYNQVSTPLL 498 >UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BC4A Length = 1038 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 29/238 (12%), Positives = 62/238 (26%), Gaps = 43/238 (18%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 +NL + L + + ++ + + + Sbjct: 143 QNLPSLKWQYFGSEQGVTTLFPSLRATDCGS-FDNRCRPWYVQANVPKPKQ-------IV 194 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--------------HHT-- 296 ++D SGSM + ++AK + L+ + + + T Sbjct: 195 IVIDKSGSMGVTNMNLAKEAAKSVVNTLNPQDRFAVMAFSSIFVPFQSTVASDQCFATTF 254 Query: 297 -----QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY----------NPAQWNIYA 341 Q K+ E GGT + AL+ ++ +P++ + Sbjct: 255 ADASPQNKKKVEDFVDTISSGGGTNYAPALQKAFSFFQQEPSVSDFNIKKIDPSEIDRVI 314 Query: 342 AQASDGDNW--ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 SDG ++L V +Y A + R + Sbjct: 315 LFMSDGIPNDPGSTILSAQIRANEQLNNSVIILTYGLG--NADFGVLRNMATNKGDVY 370 >UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UNM0_RHOBA Length = 900 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 48/188 (25%), Gaps = 15/188 (7%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L ++ ++ M ++D SGSM ++AK L + + Sbjct: 448 LPVRSNFEKEREKPSLAMMLVIDKSGSMGGQKIELAKDAAQAAVELLGPKDAIGVIAFDG 507 Query: 294 H--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 T + + +GGT + A+ E + + + Sbjct: 508 DSYTVSELRSTSDRGAISDAISTIEASGGTNMYPAMADAYEALLGATAK---LKHVILMT 564 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + D + + + + + + L E + F Sbjct: 565 DGVSSPGDFQGVAGDM-SASRITLSTVALGQGSS---EDLLEELAQIGGGRYYFCDDPQS 620 Query: 406 DQDDIYPV 413 Sbjct: 621 VPQVFAKE 628 >UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744662 Length = 679 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 64/236 (27%), Gaps = 23/236 (9%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + + + L + + F+ K + P ++DV Sbjct: 268 LRYQLSGREVATGLLLHQAPAGSSPEAESFFLLNVQPPAKWEAGQTPPR---DYLFVLDV 324 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-----------QAKEVDEHEF 306 SGSM+ + +KR L L+ + + + + + Sbjct: 325 SGSMNGFPIETSKRLMSDLLKGLNPGDTFNILHFASDSAVLSPKPLAATPENIHLATKDL 384 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL 366 + GGT + AL+ + +DG L +K L Sbjct: 385 SRHRGNGGTELLPALQRAL----ATPREVGVSRSIVILTDG---YVTIEKEAFRLVRKEL 437 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 +++ T ++ L H F + +D FRE + Sbjct: 438 QNANVFTFGIGT-AVNRWLIEGLAHAGQGDP-FVVLSEKDAAAAAERFREYISRPV 491 >UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR Length = 757 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 60/199 (30%), Gaps = 28/199 (14%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----IRHHTQ 297 + + + + ++D+SGSM + AK + L+ + + + Sbjct: 324 QSMKAFRKEVIFIIDISGSMKGGPFESAKNGLLSSLQKLNPEDSFNIIAFKMDTYLFSSV 383 Query: 298 AKEVDEHE--------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 ++ E GGT + LK +++ E N +DG Sbjct: 384 MEQATEEAIIEATRWLNDKLTADGGTNILGPLKQAIKLLAETTNS---IPVIFLITDG-- 438 Query: 350 WADDSPLCHEILAKKLLP-----VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 A + K LP +R ++ T + R + Sbjct: 439 -AVEDERDICNFVKGYLPSGGSISLRISTFGIGT-YCNHHFLRMLAQIGRGHF----DTA 492 Query: 405 RDQDDIYPVFRELFHKQNA 423 D D + ++LF ++ Sbjct: 493 YDADSVDFRMQKLFTTASS 511 >UniRef50_Q22N58 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22N58_TETTH Length = 669 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 34/230 (14%), Positives = 63/230 (27%), Gaps = 32/230 (13%) Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM----------------DQS 264 K++ +R + CL+D S SM D + Sbjct: 5 TKLDLQLSQTKNYVRVSVIPPDDLERHPCNIVCLVDGSLSMGSKLVIHQKNGGKKESDMT 64 Query: 265 TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF----------FYSQETGG 314 T D+ K + L+ + V + H E+ E + G Sbjct: 65 TLDLVKHTVKTIASSLNPQDRLALVGFSTHSKIYFELTEMDDQGKNVAFTEIDKMWAGGQ 124 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKL----LPVVR 370 T + L+ EV+K+ + P Q N+ +DG + E+L + Sbjct: 125 TNIWGGLQDSLEVIKKGFRPNQ-NVCIFLFTDGRPTMIPAIGHVEMLRRWKEQHPAIQFS 183 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 +++ L E Q+ +F + Sbjct: 184 IFTFGFGN-DLDTDLMLELSQEQNGIFSFISDSSMLGTVFSNALANILST 232 >UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genome shotgun sequence n=3 Tax=Paramecium tetraurelia RepID=A0C051_PARTE Length = 636 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 59/193 (30%), Gaps = 23/193 (11%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT----- 296 + + + CL+D+SGSM +M K I+L FL + + + Sbjct: 153 QKNQRVGVDLICLIDISGSMIGVKIEMVKASLIVLLQFLGDNDRLQLITFDNDAHRLTPL 212 Query: 297 -----QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 Q K + GG +S A K+ +K R SDG ++ Sbjct: 213 KTVTNQNKSYFTQIIKQIKANGGNRISEATKMAFYQLKSRKYINNVTSV-FLLSDGVDY- 270 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + + + + V +++ + + +L+S F Sbjct: 271 --TYPEVKNQIQTVNEVFTLHTFGFG-EDHDAQMMTQLCNLKSGSFYFVQD--------V 319 Query: 412 PVFRELFHKQNAT 424 + E F Sbjct: 320 TLLDEFFADALGG 332 >UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UK93_METS4 Length = 761 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 34/422 (8%), Positives = 77/422 (18%), Gaps = 39/422 (9%) Query: 34 ISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHF----------- 82 + E ++T V ++ R + Sbjct: 146 LPEDAAVDTMTLVVGDRVIAGEIRAREAARTAYEAARETGRAAALTEQERPNLFTTSVAN 205 Query: 83 -VQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP-----N 136 + + G + + + P Sbjct: 206 IGPGETVLVQIAFQQPVRLSGGTHALRLPLVVAPRYSPAPGLLQPAAEGPARDPVPDRAR 265 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 + + T R RR A A Sbjct: 266 IAPPVLDPAVHGPVNPVTLTVTLRAGFPLGTVESATHAIRVEETGPDSRRVTLADGPVPA 325 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + + + R + + +RP + ++D Sbjct: 326 DRDFALTWRAAPSAAPAVGLFRERVGEDEYLLAVVTPPEGRAPARRPR-----EVTFVID 380 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQAKEVDEHE 305 SGSM ++ AK ++ L + + + + ++ Sbjct: 381 NSGSMAGASMRQAKASLLVALDRLGPADRFNVIRFDDTMDLLFPAPVPADEAHRDAARRF 440 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKL 365 + GGT + L+ +DG A + Sbjct: 441 VAALEARGGTEMLPPLRAAL--ADPHPEEGDRVRQIVFLTDG---AIGNEEQIFSAISAG 495 Query: 366 LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 R + + L L + + + +L Sbjct: 496 RGRSRLFMIGIG-SAPNGHLMTHAAELGGGSYTAIGTIDQVAERTAELLAKLESPVVTDL 554 Query: 426 KG 427 Sbjct: 555 AA 556 >UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI0_9BACT Length = 833 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 24/200 (12%), Positives = 55/200 (27%), Gaps = 23/200 (11%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L + ++ + ++D SGSM+ +A+ LS + + + Sbjct: 397 LPVTSRYEKEKEQPSLALVLVIDKSGSMNGQPIVLAREASKAAAELLSSRDQVGVIAFDG 456 Query: 294 HHTQAKEVDEHEF--------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 ++ GGT + A+ + +++ + + S Sbjct: 457 SAKLVTDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDMLG---IASAKIKHMIVLS 513 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + D LA ++ + S + L + + Sbjct: 514 DGQSQGGDFEGISSELA-QMGVTISTVSLGQGAA---VDLMAAIAQIGNGRAYVTNN--- 566 Query: 406 DQDDIYPVFRELFHKQNATA 425 +F K+ A Sbjct: 567 -----AEEMPRIFTKETMEA 581 >UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleostomi RepID=Q562D1_XENTR Length = 895 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 24/240 (10%), Positives = 60/240 (25%), Gaps = 24/240 (10%) Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 + + R + ++ + + + + ++D S SM Sbjct: 232 STTQLDGDFTVTYDVNRETPGNIQVVNGYFVHFFAPS--KLKEVPKNIIFIIDRSISMIG 289 Query: 264 STKDMAKRFYILLYLFLSRTYKNVEVVY-----IRHHTQAKEVDEH------EFFYSQET 312 K + + + V++ I + K E+ Sbjct: 290 LKMQQTKEALLKILDDVKEHDHFNFVIFDWGVEIWEQSLVKATPENLNRAKAYVRNLYPK 349 Query: 313 GGTIVSSALKLMDEVVKE----RYNPAQWNIYAAQASDGDN-WADDSPLCHEILAKK-LL 366 G T ++ AL ++ + R P + +DG + + + A+ + Sbjct: 350 GWTNINDALLSAISLLDQAHDARSVPKRSASLIIFMTDGQPSTGERNLDKIQENARNAIR 409 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 YS + S + I ++ D + + Sbjct: 410 GKYSLYSLGFGVG-VDYPFLEKLSLENSG----VARRIYEESDAALQMEGFYDEVANPTL 464 >UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 Tax=Myxococcales RepID=Q1DE81_MYXXD Length = 860 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 51/192 (26%), Gaps = 23/192 (11%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRH 294 + + ++DVSGSM + A+ L L + + + + Sbjct: 279 PPKQEVVFVVDVSGSMAGESLPQAQAALRLCLRHLREGDRFNVIAFENRFQSFQPEPVPF 338 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + E + GGT + + ++ + P +DG + Sbjct: 339 TQRTLEEADRWVAALNADGGTELLAPMRAAVQAA-----PDG---VIVLLTDGQ---VGN 387 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + R YS+ T L R+ F R D + F Sbjct: 388 EAEILRAVLEARKTARVYSFGIGT-NVSDVLLRDMAKQTGGDVEFIHPGERIDDKVVAQF 446 Query: 415 RELFHKQNATAK 426 + + Sbjct: 447 SRALAPRVTELE 458 >UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E105CF Length = 757 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 51/191 (26%), Gaps = 22/191 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH 304 PS Q ++D SGSM + A +L+ V + + Sbjct: 386 PSIQQNTIFVLDSSGSMHGTALTQAIDAIREGVSYLTEHDTFNIVDFDSEARALWRQSQF 445 Query: 305 EFF-----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + GGT + AL L + + + +DG + + Sbjct: 446 ADEVSKAEAMRFLRHVDSDGGTNMQDALALSLTQL---LDSSTGLTQVIFVTDG---SIN 499 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + + L R ++ + L + D +I P Sbjct: 500 NERELLKQIAEQLGDKRLFTVGIGA-APNSHFMEYAAMLGKGTYTYI----DDLTEIQPK 554 Query: 414 FRELFHKQNAT 424 LF + + Sbjct: 555 MAYLFSQLRSP 565 >UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KVI6_HALNC Length = 671 Score = 108 bits (269), Expect = 4e-22, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 54/215 (25%), Gaps = 22/215 (10%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 + + + + K + ++DVSGSM + A Sbjct: 277 KINSGLMTYEWNGEHYFLMMAQPPKRVAPTEV--MKREYLFVVDVSGSMYGFPLNTASDL 334 Query: 273 YILLYLFLSRTYKNVEVVYIR-----HHTQAKEVDEH------EFFYSQETGGTIVSSAL 321 L L + + T + E+ Q GGT + AL Sbjct: 335 MRELLSSLKPQETFNILFFSGGSRVLSPTPLQATPENLQRAMTMMRSIQGGGGTELLPAL 394 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 K + +DG D L K+ L +++ Sbjct: 395 KTAFAM----PRTEDTARSIVVITDG---YVDVERQAYDLIKQNLNSTNLFAFGIG-SSV 446 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 ++ L H F + D + FR Sbjct: 447 NRYLMESMAHAGQGEP-FIITGPNDVPGVGARFRR 480 >UniRef50_A2E6Y7 von Willebrand factor type A domain containing protein n=4 Tax=Trichomonas vaginalis RepID=A2E6Y7_TRIVA Length = 720 Score = 108 bits (269), Expect = 4e-22, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 54/196 (27%), Gaps = 19/196 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----- 293 + ++ + ++D SGSM S + AK +L L + + + Sbjct: 231 PQFEGKVEQKSEFYFIIDCSGSMSGSRIENAKFCLNILIHSLPIGCRFSIIQFGNSYKEV 290 Query: 294 -----HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + + GGT + S L+ + + + + +DG Sbjct: 291 VSICDYSNKNVKYAMSAIARINADMGGTDILSPLEYVFK----KKLGKGFIRKIFLLTDG 346 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + +K R ++ + A L + Sbjct: 347 E---VHNSDMICSRVQKERENNRIFAIGLGSG-ADPGLIKNISAKSGGNYVLIADDDNMN 402 Query: 408 DDIYPVFRELFHKQNA 423 + I + + + Sbjct: 403 NMIVEIMKSALSPSLS 418 >UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GAI6_9DELT Length = 560 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 30/249 (12%), Positives = 60/249 (24%), Gaps = 20/249 (8%) Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 E + + L A + E + + P+ Sbjct: 160 PIRTWEFMNY--YGFDYDPAADGELSVYAAMNPIEGEGDEARFQMQIGVASELMTPEERP 217 Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ---------- 297 + ++D SGSM + ++ + + L + + Sbjct: 218 PMNVTLVLDTSGSMAGTPIELLRETSRAIAAQLKLGDTVSICEWDTSNDWTLAGYAVTGP 277 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD-GDNWADDSPL 356 E+ + GGT + L+ E+ + Y+P N SD G N Sbjct: 278 NDELLLEKINDVVHGGGTNLYGGLESGYELAQMVYDPDAINR-LVLISDGGANAGITDLD 336 Query: 357 CHEILAKKLL-PVVRYYSYIEIT-RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 A + + L F ++++ F Sbjct: 337 LIAENAAYGGSDGIYLVGVGVDDPDDYNDELMDAVTDAGKGASVFMPSE----EEVWTTF 392 Query: 415 RELFHKQNA 423 + F A Sbjct: 393 GDNFESVMA 401 >UniRef50_A6Q208 von Willebrand factor type A domain protein n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q208_NITSB Length = 305 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 51/205 (24%), Gaps = 26/205 (12%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQ----------STKDMAKRFYILLYLFLSRTYKNVEVVY 291 + +D SGSM + + D+ + R V++ Sbjct: 76 VHLKKKGYDIVLAIDASGSMQEKGFDPTDPQKTKFDVVRSLVKAFISK-RRNDNIGVVIF 134 Query: 292 IRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 T KE + Y T + AL ++KE + Sbjct: 135 GSFAYIASPLTFNKEAVKKILDYLDIGVAGSKTAIDDALIESVRLLKESQAK---SKIVI 191 Query: 343 QASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG D + P +AKK + + + R F Sbjct: 192 LLTDGIDTASKTPPDVAVKMAKKYGVKIYTIGIGDKRG-IDEAFLRWLAQQGHG-YYFYA 249 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 + IY L + + Sbjct: 250 KDASMLRKIYDEINRLEPSEIRGKE 274 >UniRef50_Q22ST4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ST4_TETTH Length = 648 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 38/333 (11%), Positives = 87/333 (26%), Gaps = 37/333 (11%) Query: 117 FQISKDEYLDLLFEDLALPN-LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 Q ++ E L L + + P ++ Q + + ++ A +A + + Sbjct: 81 DQYTEQEILQLATQSIVSPQQMQLLMQYFMRQQSSNPAPLSAPVQFSRRRDSDFDLYEIQ 140 Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR 235 L + + L + R R + F+ + Sbjct: 141 ENYDDDELTESGLESKMG-IEEFQYKLSDNLELDVRCRH---KKILVKNNEKFLMPGMIT 196 Query: 236 YKNYEKRPDP------------SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 + + + + ++D SGSM+ + K + + +S Sbjct: 197 VRTCDLDYEKLLKHHQHLQTLGRQTVDLVVVIDKSGSMEGEKIQLVKETLVKIINLMSSM 256 Query: 284 YKNVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN 333 + V + + K+ + GGT +S + + ++ R Sbjct: 257 DRICIVCFNESGDRPLTFTRVTDENKQTLLNLIQQIYAGGGTNISEGINHALKAIQNRKF 316 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV----VRYYSYIEITRRAHQTLWREY 389 SDG + + + K + + E L R Sbjct: 317 KNNVTS-ILLLSDGQDTKAYTR--VKAYIDKYQIKDAFNIETIGFGE---DHDPKLLRTL 370 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 L++ NF +F + Sbjct: 371 SDLRNGTFNFMQDVNYLDTAFINIFAGMISTVA 403 >UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SP98_METPP Length = 791 Score = 108 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 69/428 (16%), Positives = 127/428 (29%), Gaps = 46/428 (10%) Query: 22 FLRRYKAQIKQSISEAINKRSVTDVDSGESVSIP------TEDISEPMFHQGRGGLRHRV 75 +L K + ++ I ++ + S P +D+ M G Sbjct: 364 YLDMLKPSERVEMAMKILEKILQPQKSNGMPQQPQNGGLTIKDLERAMGRGGAP------ 417 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQG-QASQDGEGQDEFVFQISKDEYLDLLFEDLAL 134 +PGN + + G GSG+ A GQD +S ++ L + Sbjct: 418 NPGNGNSQSGGQPGDQAGAQDGSGTEDMVPAPTVTHGQD---HVMSTEDLAQALHDAGVS 474 Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 + + + +GV + I+ Q + R + + Sbjct: 475 SDTMAKLGFDDLKKIPEEVKHAKDGVVSAINKASEDQMKVGSRYPGGHLLHYAKAQMLDF 534 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD-------PSS 247 + E A E K + + +D D+ +K+ P Sbjct: 535 FKPVLTWEMAHKKLLEACGKGSRYDPTEPWTLYHVDAADMGFKHQRDVPFMGSRMPGKEQ 594 Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV--EVVYIRHHTQAKEVDEHE 305 + +MF ++D SGS+D + M KRF R + V +V+ T + V E Sbjct: 595 KPLMFDIIDTSGSVDDA---MLKRFVSEALNQARRVSRGVAPDVLISWADTICRGVPEFI 651 Query: 306 FF-----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNI---YAAQASD-GDNW 350 GGT +A++ + E+VK +D GD+ Sbjct: 652 SEKNYKQFLTKGINYGGRGGTNFQAAIENVLEMVKPGSKSGYAKRNIDAICYMTDSGDSV 711 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD-- 408 D + L + ++ + +E + A + Sbjct: 712 P-DPARLLRKAQECGLKKLPPILFLVPKSCYDERFAKEASKWATVVYFHAGPGAKHTQKV 770 Query: 409 DIYPVFRE 416 DI RE Sbjct: 771 DINAAARE 778 >UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Burkholderiales RepID=A1VI76_POLNA Length = 701 Score = 108 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 50/197 (25%), Gaps = 19/197 (9%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ--- 297 S ++D+SGSM D AK L L + +++ + Sbjct: 318 VAAQAISPRDYIFVVDISGSMHGFPLDTAKTLMRELIGKLRPSDTFNVLLFSGSNRFLSP 377 Query: 298 --------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 E GGT + ALK + A + +DG Sbjct: 378 ASVPATQANIEQAVRTIDEMGGGGGTELIPALKRVY----AEPKAADVSRTVVVVTDG-- 431 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 L ++ L +S+ ++ L + + + Sbjct: 432 -FVTVEREAFELVRRNLSQANLFSFGIG-SSVNRHLMEGLARAGMGEPFIITEPSQARAQ 489 Query: 410 IYPVFRELFHKQNATAK 426 R + + K Sbjct: 490 AERFRRLIESPVLTSVK 506 >UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26 Tax=Gammaproteobacteria RepID=A9QZI4_YERPG Length = 472 Score = 107 bits (267), Expect = 6e-22, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 54/198 (27%), Gaps = 21/198 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH--- 295 + S + ++D S SM + A+ IL L+ T V Y H Sbjct: 86 FNLDSTRRSPINLALVIDRSTSMSGERIEKAREEAILAVNMLNITDTLSVVAYDNHAEVI 145 Query: 296 ------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD- 348 T + + G T + + + + V + N Q N SDG Sbjct: 146 IPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKHLNREQVNR-IILISDGQA 204 Query: 349 ---NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + +A K + + ++ L F Sbjct: 205 NTGPTSISELSDLARMAAKKGIAITTIGLGQ---DYNEDLMTAIAGYSDGNHTFVA---- 257 Query: 406 DQDDIYPVFRELFHKQNA 423 + D+ F + F + Sbjct: 258 NSADLEKAFTKEFQDVMS 275 >UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeoglobus fulgidus RepID=O28828_ARCFU Length = 410 Score = 107 bits (267), Expect = 7e-22, Method: Composition-based stats. Identities = 47/316 (14%), Positives = 104/316 (32%), Gaps = 25/316 (7%) Query: 112 QDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQ 171 D ++S + +D F+++ + L++ + E + HR + S+ Sbjct: 108 GDISKDELSMSQVVDNFFDEV-VDELQEMGYVEKVETRFHRKIIHYT------AKAESVL 160 Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 ++ +R E S ++++ + + + Sbjct: 161 AEKVLSLSLQNLDKRSYGEHETEKLGQSIFSSERIVDYDPFTHSYDNIDLVESLIASAMR 220 Query: 232 FDLRYKN---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++ ++P + + V L+DVS SM A + L + R E Sbjct: 221 GEIELNENEMVARQPKHTEKCVYVMLIDVSDSMRGRKIVGAIEAALCLRKAIRRAGSGDE 280 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + I + +A E+ E E + G T + ALK +++K SDG+ Sbjct: 281 LRVIAFNHRAHEIKEGEILNLEARGRTDIGLALKRARKILKGSSGTG----VVFLISDGE 336 Query: 349 NWADDSP-----LCHEILAKKLL---PVVRYYSYIEITRRAHQTLWREYEHL-QSTFDNF 399 + +P C A+K+ ++ + + R L + L + Sbjct: 337 PTSSYNPYLTPWRCALKEAEKMRNVDARLQIIMFGKEGRFL--ELCKNMAKLSGNANLFH 394 Query: 400 AMQHIRDQDDIYPVFR 415 + ++ + FR Sbjct: 395 FSDPLNLKNFVVSRFR 410 >UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZFT4_9SPHI Length = 827 Score = 107 bits (267), Expect = 7e-22, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 52/205 (25%), Gaps = 22/205 (10%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 F K + P ++DVSGSM ++KR L L Sbjct: 299 ENAEKFFLMMMQPPKAPKNSQIPPR--EYVFIVDVSGSMHGFPLSVSKRLLKNLIGKLRP 356 Query: 283 TYKNVEVVYIRHHTQAKEVDEHE-----------FFYSQETGGTIVSSALKLMDEVVKER 331 K +++ + + GGT + ALK Sbjct: 357 KDKFNVMLFESSNQMMSPESMEATQANIQKAFGVIDQQRGGGGTRLLPALKKALAF---- 412 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 ++ +DG L + L +++ ++ L Sbjct: 413 KQTKDYSRSFVVVTDG---YVTVEKEAFDLIRNNLNRANLFAFGIG-SSVNRFLIEGMAR 468 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRE 416 F + H + D FR Sbjct: 469 AGMGEP-FIVTHGTEADVKAEKFRN 492 >UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=Q8H924_ORYSJ Length = 646 Score = 107 bits (267), Expect = 8e-22, Method: Composition-based stats. Identities = 25/208 (12%), Positives = 52/208 (25%), Gaps = 33/208 (15%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV------ 301 + ++DVSGSM + + K+ + L + + + ++ + Sbjct: 173 PLDLVTVLDVSGSMVGNKLALLKQAMGFVIDNLGPGDRLCVISFSSGASRLMRLSRMTDA 232 Query: 302 ----DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP-- 355 + GGT + +AL+ +V+ +R SDG + P Sbjct: 233 GKAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVESVI-LLSDGQDTYTVPPRG 291 Query: 356 -----------------LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 P V + + + + + Sbjct: 292 GYDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGK---DHDAAAMHTIAEVTGGTFS 348 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNATAK 426 F QD L + Sbjct: 349 FIENEAAIQDGFAQCIGGLLSVAVQELR 376 >UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1D2W8_DEIDV Length = 418 Score = 107 bits (267), Expect = 8e-22, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 41/183 (22%), Gaps = 12/183 (6%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI--------RHHT 296 + ++D SGSM MAK+ I + V + Sbjct: 40 QRPPLNLAFVIDRSGSMSGLPLQMAKQAAIAAVRQARPDDRVSVVAFDDRVDVIVPSQLA 99 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSP 355 ++E + G T + V + P N SDG N Sbjct: 100 TSREAVIQAIGTIDDRGSTNLHGGWLEGATQVAQHLTPGALNRVI-LLSDGQANVGVTDR 158 Query: 356 LCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + L + + + L + R Sbjct: 159 REIARQVRGLTERGISTTTIGLG-SHYDEELLLAIANAGDGNFEHVEDPSRLPTFFEEEL 217 Query: 415 REL 417 + L Sbjct: 218 QGL 220 >UniRef50_B2HDT6 Putative uncharacterized protein n=3 Tax=Mycobacterium RepID=B2HDT6_MYCMM Length = 772 Score = 107 bits (266), Expect = 9e-22, Method: Composition-based stats. Identities = 46/423 (10%), Positives = 96/423 (22%), Gaps = 42/423 (9%) Query: 16 MVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRV 75 + R + Y+ + AI + DV S + G Sbjct: 97 LEERGQAREDYEQALAAGQRAAIVEEDRPDVFS----------VRVGNLGPGEQATIEMC 146 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP 135 G F + R + D G + + + P Sbjct: 147 LTGPLAFEDGEATYRFPLVVAPRYTTGHPVGGDQTGSGVARDTDAVPDAS-----RVTPP 201 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 L +R + G S+ ++ A A A + + Sbjct: 202 LLTDADERPDLQISLSVDGAGLPVSDLRASLPTAVLAPAADGLARLRV-ESGARADRDFV 260 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + + E + T S+ + ++ Sbjct: 261 LRFRLDQGRLSSSALLVADAAGADATDAEEGTWSLT------LVPPAEPSSAPRDVVVVL 314 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------------IRHHTQAKEV 301 D SGSM A+R + L + + + + + + Sbjct: 315 DRSGSMGGWKMVAARRAAGRIVDMLDAGDRFCVLAFDDRIETPPAMPDGLVPASDRNRFA 374 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 + GGT+++ L E++ + +DG +D Sbjct: 375 ASSWLGSLRSRGGTVMAQPLTNAVEMLADS--GEDRQASVVLVTDGQISGEDH---LLRS 429 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 ++ R Y + L S R + + + R + Sbjct: 430 LAPVVGRTRIYCVGVDR-AVNAGFLERLAGLGSGRAELVESEDRLDEVMARLARTIGRPA 488 Query: 422 NAT 424 + Sbjct: 489 LTS 491 >UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1 Tax=Sorghum bicolor RepID=C5Z1W1_SORBI Length = 607 Score = 107 bits (266), Expect = 9e-22, Method: Composition-based stats. Identities = 40/271 (14%), Positives = 77/271 (28%), Gaps = 26/271 (9%) Query: 168 RSLQNSLARRTAMTA-GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERV 226 R+ +R+ + + + EE +A SN+ + + + K + Sbjct: 23 RAASAGRSRKPGTKSSAPNKMFNDDEEPIAPASNAGKQVRGFSDVGKASVKPYYPKEAPL 82 Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYK 285 L + + + ++DVSGSM D K + L+ + Sbjct: 83 GASTVRVLLDVSSSSSTAGRAALDLVVVLDVSGSMRDFGRLDKLKSAMRFIIKKLAPMDR 142 Query: 286 NVEVVYIRHHTQ----------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 V + T+ A V GGT + + LK+ +V+ R Sbjct: 143 LSVVTFNGGATRECPLRAMSEDAVPVLTDIVDGLVARGGTNIEAGLKMGLQVLDGRRYTG 202 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 SDG+ + D + V S+ A L ++ Sbjct: 203 ARTAGVILMSDGEQNSGD----ATRVRNPQNYPVYTLSFG---SNADMNLLQKLAG-GGG 254 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 N + ++F + A Sbjct: 255 TYNPVLDSGGM------SMLDVFSQLMAGLL 279 >UniRef50_B0CG18 von Willebrand factor type A domain protein, putative n=5 Tax=Cyanobacteria RepID=B0CG18_ACAM1 Length = 708 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 39/421 (9%), Positives = 100/421 (23%), Gaps = 25/421 (5%) Query: 22 FLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDH 81 + R +A + + E + + ++ E + + Sbjct: 108 YDRPLEAIYQFPLPEDAAVDDMEIRIGNRIIRGVIKERQEAKQIYETAKQEGKTAALLEQ 167 Query: 82 FVQNDRIERPQGGGGGSGSGQ---GQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLK 138 N + G S + EG D + + + + Sbjct: 168 ERANLFTQSLANIVPGETIEVVIRYTNSLEFEGGDYEFVFPTVVGPRYIPGDQIDAAGNT 227 Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA--RRTAMTAGKRRELHALEENLA 196 +G +I+V + R + ++ + LA Sbjct: 228 TRVADAAKITPPLLPPSQRSGNDISITVNLDAGVPIRNLRSPSHPILTSKKGEQTQVKLA 287 Query: 197 IISNSEPAQLLEEERL--RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + L+ ++ ++ A L + ++ L + + + + L Sbjct: 288 NQTTIPNKDLILRYQVASKQTQATLLTQSDQRGGHFATYL-IPALKYKSNQIVPKDVVFL 346 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ-----------AKEVDE 303 +D SGS +++ L+ + + ++ ++ Sbjct: 347 IDTSGSQSGPPIVQSRKLMTQFLDKLNPNDTFSIINFSNTTSKLSPKPLANTPANRKKAL 406 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 GGT + + + V P +DG D + Sbjct: 407 EYIKKLDANGGTELMNGINT---VAAFPPAPDGRLRSVVLLTDG--LIGDDETIIAAVRD 461 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 +L P R Y + ++ L + + + + Sbjct: 462 RLKPGNRIYPFGVGFST-NRFLLDRLAEVGRGTVEVVAPKDSAEKVAAKFVQTINKPVLT 520 Query: 424 T 424 Sbjct: 521 D 521 >UniRef50_B8AE57 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AE57_ORYSI Length = 585 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 26/229 (11%), Positives = 63/229 (27%), Gaps = 33/229 (14%) Query: 209 EERLRKEIAEL--RAKIERVPFIDTFDLRYKNYEKRPDPS-SQAVMFCLMDVSGSMDQST 265 + ++ + LR + + ++DVSGSM Sbjct: 3 ADPVKVSTTTMLPTIPRGHTNKDFRVLLRVEAPPMADLKGHVPIDVVAVLDVSGSMGDPA 62 Query: 266 K--------------DMAKRFYILLYLFLSRTYKNVEVVYIRH------------HTQAK 299 D+ K + L + V + + Sbjct: 63 MASSDFEKNKPPSRLDVLKEAMKFIIRKLDDGDRLSIVAFNDRPVKEYSTGLLNISGNGR 122 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQASDGDNWADDSPLCH 358 + E + + + GGT + AL+ V+ R ++ ++ + +DGD+ + Sbjct: 123 RIAEKKVDWLEARGGTALMPALEEAIRVLDCRPGDSRNSVGFILLLTDGDDTSGFRWS-- 180 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + +++ + + L +F D+ Sbjct: 181 RDVINGAVGKYPVHTFGLGAAHSSEALL-HIAQESRGTYSFVDDENMDK 228 >UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EFJ5_PARTE Length = 610 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 32/270 (11%), Positives = 78/270 (28%), Gaps = 25/270 (9%) Query: 171 QNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF-- 228 Q + + G + E E+ + I + L + ++ Sbjct: 43 QEIVKSLLSEQQGMQLEEKVEEQTVITIDQEITDPDALQVELLNSVHLNVLPRQKAIQVQ 102 Query: 229 ----IDTFDLRYKNYEKR-PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 I L+ ++ + + + + C++DVSGSM+ + + + L T Sbjct: 103 EYSQILPVVLQIQSLKSQLKKQRANIDLMCVVDVSGSMNGEKIKLVQNSLRYIQKILKPT 162 Query: 284 YKNVEVVYIRHHTQAKEVDEHEFFY----------SQETGGTIVSSALKLMDEVVKERYN 333 + V + + + + T ++S + L ++++R Sbjct: 163 DRLALVTFGTQAGINLQWTRNIAENKKKIKKAIKDIKIRDSTNIASGVALGLRMIRDRKF 222 Query: 334 PAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREY 389 SDG D+ C + L + + + + Y + Sbjct: 223 KNPVTS-MFVLSDGVDDDRGADLRCQQALHQYNIQDTLTINTFGYG---SDHDAKVMNNI 278 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 +L+ + Q R + + Sbjct: 279 ANLKGGQFVYIDQIQRVSEHFILAMSGMLS 308 >UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXM6_9CYAN Length = 928 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 51/445 (11%), Positives = 110/445 (24%), Gaps = 59/445 (13%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKR--------SVTDVDSGESV 52 M I R N N + RQ + Y+ KQ + + ++ S+ ++ GE + Sbjct: 200 MEIRIGDRTNKGN--IKKRQEAVAIYEQAKKQGRTAGLLEQERANIFTQSLANIQPGEQI 257 Query: 53 SIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQ 112 + F G + V I G G A Sbjct: 258 DVIIRYTDSLKFTGGSYEFVFPMV------VGARYIPGTTIDENTLGGGSAPAPMTLNKD 311 Query: 113 DEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN 172 + V S+ L + P + +T P++ Q Sbjct: 312 TDLVPDASR------LNAPILPPGTRSGHNINVTVDIEAGVEIKEVHSPSH-------QI 358 Query: 173 SLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 + R+ + I+ + + ++ Sbjct: 359 QIERQDQGMRVTLSRRDTIPNKDLILRYQ---VAGDRTQTTVLSQADTRGGHFAVYLIP- 414 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 E P + L+D SGS + + L+ + + Sbjct: 415 -----AIEYNPHQLVPKDVVFLIDTSGSQSGEPLNKCQELMRRFINGLNPHDTFTIIDFS 469 Query: 293 -----------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + Q + + +GGT + ++ + + Sbjct: 470 DTTRQLSPVPLANTVQNRNSAMNYINQLNASGGTQLRRGIQAVLNFPE---VDPGRLRSI 526 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG + + + + L R +S+ ++ L + Sbjct: 527 VLLTDG--YIGNENQILAEVQRHLKLGNRLHSFGAG-SSVNRFLLNRIAEIGRG----IS 579 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 + +R + V + F + N Sbjct: 580 RIVRYDEPTEEVAEQFFGQINNPVL 604 >UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8YP40_ANASP Length = 427 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 53/204 (25%), Gaps = 27/204 (13%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 ++ + + + ++D SGSM M L L + V + T Sbjct: 31 AVAEQFEQNLPLNLCLILDQSGSMHGQPLKMVVEAVEKLLDRLQPGDRISVVAFAGSATV 90 Query: 298 AKE---------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + Q +GGT+++ L+ + + A +DG Sbjct: 91 IIPNQIVENPESIKTQIRKKLQASGGTVIAEGLQQGITELMK--GTRGAVSQAFLLTDGH 148 Query: 349 NWA-----------DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 DDS C E K + + +Q L Sbjct: 149 GEDSLKIWKWEIGPDDSRRCLEFAKKAAKINLTINTLGFGN-NWNQDLLETIADAGGGTL 207 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQ 421 + + F LF + Sbjct: 208 AHIER----PEQAVHHFNRLFTRV 227 >UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0AMP5_MARMM Length = 740 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 50/438 (11%), Positives = 97/438 (22%), Gaps = 51/438 (11%) Query: 5 IDR-RLNGKNKSMV----NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDI 59 +DR R+ ++ + RQ R Y+A ++ ++ ++ + +I + Sbjct: 120 VDRLRMQVGDRFIEGEIQERQAARRTYEAARANGQRASLVEQERPNMFTTSVANIGPGET 179 Query: 60 SEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 F + P GG S + D V Sbjct: 180 IIVQFEYQDVARFVDGRFQLTQPLGLTPRYIPDGGDFQMVSTDSSSVPDASRITPPVMPA 239 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 S + + L LP + Y A V + A Sbjct: 240 SLE-----PRDQLRLPVTITADLDAGYALGEIASLYHATLVERRSD------GTARISLA 288 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 + + A + + ++ L Sbjct: 289 DGPIP-------------ANRDFVLTWRAADPSEASAALFIEEWQGETYLLAQILPPAEL 335 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------- 291 P ++D SGSM ++ A+ I L + + + Sbjct: 336 G-ADTPRRARETIFVIDNSGSMGGASMRQARAALITALQRLEPGDRFNVIRFDNTMEQVF 394 Query: 292 ---IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + + GGT++ AL + +DG Sbjct: 395 PQAVDASPDNVATALTFARRLEAQGGTVMLPALNAALR--DTSPDDDSRVRQIVFLTDG- 451 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 A + + L R + + L I Sbjct: 452 --AIGNEAELFAAIEAGLGRSRLFPVGIG-SAPNGYFMSRAARLGRGT----STQIGQVS 504 Query: 409 DIYPVFRELFHKQNATAK 426 ++ ELF Sbjct: 505 EVEARMEELFTALERPVM 522 >UniRef50_UPI000180D3E0 PREDICTED: similar to LOC779593 protein n=1 Tax=Ciona intestinalis RepID=UPI000180D3E0 Length = 1012 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 36/385 (9%), Positives = 87/385 (22%), Gaps = 65/385 (16%) Query: 97 GSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYT 156 +G G + + F ++ E+ FE L++ + + Sbjct: 178 QTGKTAGHVAARDKSTRAFKTKLFIAEHEHATFELTYQQLLRRVMDVHIVDQNPITFVRV 237 Query: 157 ANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEI 216 ++++ + +R ++ + P L Sbjct: 238 PPIRTEQVNLISPVDVPPGSTVTFGRNRRYSHVRYAPSIDHQTEYSPF--GLSGTLVVRY 295 Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 R ++ I ++ + ++ ++D SGSM K+ Sbjct: 296 DVERTQMFGDIAIHNGFF-AHHFAPPSLAAFPKLVVFVVDTSGSMFGYKLKQVKQALADS 354 Query: 277 YLFLSRTYKNVEVVY------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLM 324 L+ VV+ T++ GGT + AL+ Sbjct: 355 LRSLNNEDHFNIVVFGDTAEPWISGVLSTASTRSINDAITYVDAVSARGGTNMLVALQTA 414 Query: 325 DEVVKERYNP---------------------------------------AQWNIYAAQAS 345 +++ + + + Sbjct: 415 FAIMEPYLPSLPENETMVEDTTPFPTPVPLQPETNHFIRKRATETQTELSNYAKMIVFLT 474 Query: 346 DG----DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 DG D+ D + + + + L Sbjct: 475 DGRPTKDDVGTDDIASRIEKINGGRVNLHTIGFGSL---VDMRFLEKLAALNGG----VS 527 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 + + + D R F + +A Sbjct: 528 RRVFESLDAATQIRHFFDEVSAPVL 552 >UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8BII0_ORYSI Length = 585 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 27/244 (11%), Positives = 57/244 (23%), Gaps = 34/244 (13%) Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM--- 261 + LR E E + + + ++DVSGSM Sbjct: 4 YAAGKVTLRSEPKEKAIPSNEERK--EWPVLVHVVAPAKTERFPIDLVAVLDVSGSMTKA 61 Query: 262 ----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------------HHTQAKEVDEHE 305 + D+ K ++ L + V + T+ + + Sbjct: 62 TSMHGWTRLDLVKGAMKMVTNKLGAGDRLAIVPFNGKVVAAGATRLMEMTTKGRADANAK 121 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEILAK 363 + G T ALK ++ R + + SDG + Sbjct: 122 VNQLKAGGDTKFLPALKHASGLLDSRPAGDKQYRPGFIFLLSDGQDNGVLDDKL------ 175 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 +++ R + + + + +F Sbjct: 176 -GGVRYPAHTFGMCQSRCNPKSMVHIATATKGSYHPIDDKLSNVAQAL----AVFLSGIT 230 Query: 424 TAKG 427 +A Sbjct: 231 SAVA 234 >UniRef50_A4XHD9 von Willebrand factor, type A n=2 Tax=Clostridia RepID=A4XHD9_CALS8 Length = 909 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 50/201 (24%), Gaps = 21/201 (10%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS------TKDMAKRFYILLYLFLSR 282 + L K K + + ++D SGSM S ++AK + L Sbjct: 386 VLEKMLPVKMQLKNKEKERNVAVVLVIDHSGSMGGSNLRNINKLEIAKSAAAKMIDHLES 445 Query: 283 TYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + + + + A K Q GGT + L ++K+ Sbjct: 446 SDSVGVIAFDHNFYWASKFGKLKSKNEVIENISTIQVGGGTAIIPPLTEAVNLLKKSKAK 505 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 +D + +AK+ + + + S Sbjct: 506 D---KVIVLLTD-GYGEEGGYEYPASIAKRNNIKITTIGVG---SSINAPILSWMAAYTS 558 Query: 395 TFDNFAMQHIRDQDDIYPVFR 415 + D + Sbjct: 559 GRFYYVKDASNLIDVFLKEAK 579 Score = 46.8 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 39/141 (27%), Gaps = 15/141 (10%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 + K S++ L D+S S D K F L ++ + VV T Sbjct: 54 SIPKLTISSNKVATIYLADLSDSNA-KNIDKMKDFIQKAIK-LKKSNEMQSVVVFGQDTN 111 Query: 298 AKEVDEHEFFYSQETGG------TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 E E G T + +A+K + P + +DG Sbjct: 112 V-EFSLKNNPKFSEFGSVVDSSETNIENAIKFAVNLF-----PKDFQKRLVILTDGKET- 164 Query: 352 DDSPLCHEILAKKLLPVVRYY 372 S L V+ Sbjct: 165 VGSAKTTIDLLSNNGIDVKVL 185 >UniRef50_C9SWV9 U-box domain containing protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SWV9_VERA1 Length = 662 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 30/261 (11%), Positives = 67/261 (25%), Gaps = 33/261 (12%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 +S +R T+ + A + + Q ++ V Sbjct: 14 SSSSRATSNPSATPDSSVADDNMSITSEPATLVQDEIDDLTLSVHPLASRDGLLVKVEPP 73 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-----------------STKDMAKRFYI 274 R + P + + ++DVSGSMD S D+ K Sbjct: 74 TTPREALQSGKRIPRAPCDIVLVIDVSGSMDDAAPAPVIPGQKDENTGLSILDLTKHAAR 133 Query: 275 LLYLFLSRTYKNVEVVYIRHHT----------QAKEVDEHEFFYSQETGGTIVSSALKLM 324 + L + V + + K + + Q GT + + Sbjct: 134 TILETLDERDRLGIVAFTTNAKVILSLVEMNPDNKVSAKDKIENLQPLNGTNMWHGITEG 193 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK---KLLPVVRYYSYIEITRRA 381 ++ + + + +DG + L + + +L + + + Sbjct: 194 IKLFSDCDSSSGRVPAMMVLTDGLPNSGCPRLGYIPKLRDMGQLPATIHTFGFGY---HI 250 Query: 382 HQTLWREYEHLQSTFDNFAMQ 402 L + + F Sbjct: 251 RSGLLKSIAEIGGGNYAFIPD 271 >UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTR8_HALO1 Length = 903 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 56/212 (26%), Gaps = 26/212 (12%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 + +R+ + ++R P + ++D SGSM + AK LS Sbjct: 434 YQGTRIEKIMPVRFDSEKQREQP--HVAIALVVDRSGSMSGLKIEAAKESARATAEVLSP 491 Query: 283 TYKNVEVVYIRHHTQ--------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + V + T + + Q GGT + AL+ E+++ Sbjct: 492 SDLITVVAFDNQPTTIVRLQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGA--- 548 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + SDG D + V A + L Sbjct: 549 NAKVKHVIVLSDGQ-APYDGIADLCQEMRSARITVSAVGIG----DADRNLLNLITDNGD 603 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 +F K+ A+ Sbjct: 604 GRLYMTDD--------LAALPRIFMKETTEAQ 627 >UniRef50_Q0A603 von Willebrand factor, type A n=1 Tax=Alkalilimnicola ehrlichii MLHE-1 RepID=Q0A603_ALHEH Length = 972 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 35/356 (9%), Positives = 86/356 (24%), Gaps = 24/356 (6%) Query: 88 IERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTE 147 + G + + G + + ++ + + Q + Sbjct: 164 PDGCGNGCDYDMLADAEGAGYTLGHEWGHYVLALYDEYEGRDPAENRDTFPQVGDVPTSP 223 Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 + G ++ S TA + + E + + Sbjct: 224 AIMNSQWQARGGNYEWLNHSTSDNIGDPEDTAQGRVYGKSGWEVLVQPTTDDPQEGNETV 283 Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP----SSQAVMFCLMDVSGSMDQ 263 + +R R E A ++ T + + + + ++D SGSM Sbjct: 284 QPDRTRYTALEAVAPTAADNWVVTQLDQMDHGCRDELEIVWMDDDLEISLIVDTSGSMSG 343 Query: 264 STKDMAKRFYILLYLFLSRTYK-NVEVVYIRHH-------------TQAKEVDEHEFFYS 309 + A+ L + V + T K+ + Sbjct: 344 APIINARTAGRTLVDVVEPGRTAMGVVRFSASASVVHPMIAIPDPGTAEKDQLKDAIDSL 403 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWN--IYAAQASD-GDNWADDSPLCHEILAKKLL 366 +G T + L L + +++ + A SD GDN + + + Sbjct: 404 PASGLTAMFDGLILGLDELQDYSAANDTDAGQVAFLLSDGGDNSSAATEPQTVQAYQDAN 463 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + Y + R + + + + + Sbjct: 464 VPIIAFGYGSFAPT---GVLRRLADNTGGEFFASPTTLAEIQEAFLAANAAVSDAV 516 >UniRef50_C1XMC3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XMC3_MEIRU Length = 412 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 54/224 (24%), Gaps = 22/224 (9%) Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA-VMFCLMDVSGSMDQSTKDMAKRFYI 274 I P LR + P + ++D SGSM S K I Sbjct: 16 IPLKPGVSATRPTRQQVLLRIHTPTPQARPERPLLNLALVLDRSGSMGGSKLKYTKEAAI 75 Query: 275 LLYLFLSRTYKNVEVVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 L + V+Y + + + G T + + Sbjct: 76 YAVHNLLPEDRVAVVIYDDAVEVLVPSTPVADGRAAIANLIRTIRTGGSTALHAGWLEGA 135 Query: 326 EVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQ 383 V + N SDG N + +P ++L V + ++ Sbjct: 136 TQVAAYQEAGRLNRVV-LLSDGLANRGETNPGVIAEQVRELARRGVSTSTLGVGL-DYNE 193 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 L F +F ++ A G Sbjct: 194 DLMTTMADAGEGNYYFIESPADLP--------RIFAQELAGLAG 229 >UniRef50_A6R161 Predicted protein n=3 Tax=Onygenales RepID=A6R161_AJECN Length = 759 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 30/280 (10%), Positives = 63/280 (22%), Gaps = 44/280 (15%) Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAK------IERVPFIDTFDLRYKNYEK 241 + E++ II + P + + + + + Sbjct: 7 AASSEDDFEIIDDQIPIRPRLSTSTTIAGERSPNEVGVQLHPLPDTNSMILSVHPPLHPE 66 Query: 242 RPDPSSQAVMFCLMDVSGSMDQST------------------KDMAKRFYILLYLFLSRT 283 + P + +DVS SM S D+ K + L+ Sbjct: 67 KEMPHVPCDIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLNEN 126 Query: 284 YKNVEVVYIRHHTQAKEVDE----------HEFFYSQETGGTIVSSALKLMDEVVKERYN 333 + V + E+ + + T + LKL + + + Sbjct: 127 DRLGIVAFSTEAEVVYEISKMNESSKKAALKAVEALKPLSSTNLWHGLKLGLKAFENERH 186 Query: 334 PAQWNIYAAQASDGDNWA-------DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 Q +DG L +P++ + + L Sbjct: 187 TPQSVQALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHTFGFGY---NIRSGLL 243 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + F L+ AK Sbjct: 244 QAIAEVGGGTFAFIPDAGMIGTVFVHAIANLYTTFATQAK 283 >UniRef50_D2H285 Putative uncharacterized protein (Fragment) n=1 Tax=Ailuropoda melanoleuca RepID=D2H285_AILME Length = 1230 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 54/195 (27%), Gaps = 20/195 (10%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----HH-- 295 + L+D SGSM + K ++ L + + Sbjct: 354 NLRKTHGEFIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSS 413 Query: 296 -TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 T ++E Q + GGT + S LK + R +DG Sbjct: 414 QTYSEESVAMACDNIQRMRADMGGTNILSPLKWIIRQPVHR----GHPRLLFLITDG--- 466 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 A ++ L + R YS+ L R + F ++ R Q + Sbjct: 467 AVNNTGKVLELLRNHAFSTRCYSFGIG-PNVCHRLVRGLATVSKGSAEFLVEGERLQPKM 525 Query: 411 YPVFRELFHKQNATA 425 ++ + Sbjct: 526 IKSLKKAMAPVLSDV 540 >UniRef50_A2F7N4 von Willebrand factor type A domain containing protein n=5 Tax=Trichomonas vaginalis RepID=A2F7N4_TRIVA Length = 722 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 52/189 (27%), Gaps = 19/189 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HT 296 S + + ++D SGSM S + A + L L + + + H Sbjct: 242 SNSEFYFIVDCSGSMSCSRINNAIKCMRLFIQSLPVGCRFSILRFGSHFETVLPPCDYTD 301 Query: 297 QAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + + GGT + + L+ + ++ + +DG+ D+ Sbjct: 302 ENVANAMNLLDNISANMGGTNILAPLQHVSDL----QASEGFVKQIFFLTDGE---VDNS 354 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 A K R +S + A L + + + + + Sbjct: 355 DIICATALKNRSTNRIFSIGLGSG-ADPGLIKGMARKSGGNYAIIGDNDNMNEKVIEMLS 413 Query: 416 ELFHKQNAT 424 Sbjct: 414 SAISPALRD 422 >UniRef50_UPI00005A0386 PREDICTED: similar to loss of heterozygosity, 11, chromosomal region 2, gene A homolog n=1 Tax=Canis lupus familiaris RepID=UPI00005A0386 Length = 881 Score = 106 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 54/195 (27%), Gaps = 20/195 (10%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----HH-- 295 + L+D SGSM + K ++ L + + Sbjct: 209 NLRKTHGEFIFLIDRSGSMSGTNIHRVKDAMLVALKSLMPACLFNVIGFGSTFKTLFPSS 268 Query: 296 -TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 T ++E Q + GGT + S LK + R +DG Sbjct: 269 QTYSEESVAMACDNIQRMRADMGGTNILSPLKWIIRQPVHR----GHPRLLFLITDG--- 321 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 A ++ L + R YS+ L R + F ++ R Q + Sbjct: 322 AVNNTGKVLELVRNHAFSTRCYSFGIG-PNVCHRLVRGLATVSKGSAEFLVEGERLQPKM 380 Query: 411 YPVFRELFHKQNATA 425 ++ + Sbjct: 381 IKSLKKAMAPVLSDV 395 >UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788F71 Length = 1007 Score = 106 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 42/180 (23%), Gaps = 14/180 (7%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ----- 297 + ++D SGSMD + ++AK + + V + Sbjct: 401 KREIPSLGLILVIDRSGSMDGNKIELAKESAMRTVELMRAKDTVGVVAFDDQPWWVVPPQ 460 Query: 298 ---AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 KE GGT + A+ ++E + +DG + + Sbjct: 461 KLGDKEEVLSSIQSIPSAGGTNIYPAVSSA---LEEMLKIDAQRRHIILMTDGQSAMNSG 517 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + + A L + F Sbjct: 518 YQDLTDTMVENKITMSSVAVGM---DADTNLLQSLADAAKGRYYFVEDETTLPAVFSREA 574 >UniRef50_B9XLE8 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XLE8_9BACT Length = 723 Score = 106 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 70/261 (26%), Gaps = 26/261 (9%) Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY 236 + + T+ +L I N + R + + + ER + Y Sbjct: 328 KISKTSTTPEQLSVDLSEGDRIPNKDFVLRYRIAGERIKSNFMVHRDERGGYFTMML--Y 385 Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 E + M ++D SGSM AK L + + H + Sbjct: 386 PPKELGQLGRAPMEMVFVLDCSGSMSGEPIAQAKAAIRHALKQLQPGDSFQIINFSEHAS 445 Query: 297 QAKEVDEHEF-----------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 Q G T + +K + + + + + Sbjct: 446 QLGAKPLEATPENIRKGLAYVEALNSDGPTEMIEGIKAALDF----PHDPERLRFVCFLT 501 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + + + R +S+ ++ L + A+ H+ Sbjct: 502 DG---FIGNEAEILAAVHERIGASRIFSFGVG--SCNRYLLDHLAKMGGG----AVAHLG 552 Query: 406 DQDDIYPVFRELFHKQNATAK 426 D+ V + F + + A Sbjct: 553 LHDNGAKVMDDFFERVSHPAM 573 >UniRef50_A2DWC0 von Willebrand factor type A domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2DWC0_TRIVA Length = 729 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 41/332 (12%), Positives = 93/332 (28%), Gaps = 27/332 (8%) Query: 111 GQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL 170 G + + Y + L + N++ Q+ + A NG I V R Sbjct: 88 GSQSDEYTSTIQNYFGSGYSSFILGCIVPNKEVQIHLKASSNADINENGYFYKIPVNRQY 147 Query: 171 QNSLARRTAMTAGKRRELHA---LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 + ++ ++ + I + + +E I ++ Sbjct: 148 IPGKFEFSTKIKTRKEIKELIAPVDGIMNAIDLHNISFVTQELPKENSIFIETRIKDKDK 207 Query: 228 FIDTFDLRYKNYEKRP----DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 I Y + + + + ++D SGSM++S AK LL L Sbjct: 208 SIAVSSDGYISISTYEYFEGKIYANSEFYFIIDCSGSMEESRIKNAKFCLNLLIHSLPVG 267 Query: 284 YKNVEVVYIR----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERY 332 + + + + + + GT + S LK + + Sbjct: 268 CRFSIIKFGSMYEVVLPTCDYTDENVAKAMEQINQMDANMEGTDILSPLKFVSDQ----S 323 Query: 333 NPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 + +DG++ D L + R ++ + A + L + + Sbjct: 324 TKEGFIKQVFLLTDGEDIHTD---QIYALVQANRTNNRIFTIGIGSG-ADRNLIKNIARI 379 Query: 393 QSTFDNFAMQHIR-DQDDIYPVFRELFHKQNA 423 + + + I + R+ Sbjct: 380 SGGNNALIEDNDEKMNEKIIELLRKAISYAMT 411 >UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HPN0_LYSSC Length = 825 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 22/179 (12%), Positives = 44/179 (24%), Gaps = 15/179 (8%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI--------RH 294 + + ++D SGSM S ++AK L + + Sbjct: 361 KEQLPSLGLVIVLDRSGSMSGSKLELAKEAAARSVEMLRDEDTLGFIAFDDRPWEIIETG 420 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 KE GGT + +L E + + + +DG + + Sbjct: 421 PLNNKEEAVDTILSVTPGGGTEIYGSLAKAYENLADMKLQ---RKHIILLTDGQSQPG-N 476 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 K + + + A L + S + + Sbjct: 477 YDDLIEQGKDNGITLSTVAIGQ---DADANLLEALSEMGSGRFYNVIDEQTIPSILSRE 532 >UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G415_9DELT Length = 877 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 51/407 (12%), Positives = 93/407 (22%), Gaps = 37/407 (9%) Query: 19 RQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPG 78 RQRF +K I+ + + + + I I + + + R Sbjct: 134 RQRFHNSFKQPIEVRYVFPLPENA---AVDDMRMIIGERTIESEVETKAKAVERFADARE 190 Query: 79 NDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLK 138 H ERP + + +F + P Sbjct: 191 AGHTAALLEQERPNVFTQSVTN-VAPGESVEVEVQYVQTLTQDGGNYEFVFPMVVGPRFS 249 Query: 139 QNQ---QRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 + T G ++S+ + + R + T + L Sbjct: 250 PPGTSAEAHAAVSPPIVGEGTRTGHDVSLSMTVAAGGKVQRWDSPTHTVVGSETSDGFAL 309 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTF---DLRYKNYEKRPDPSSQAV-- 250 + R + A+ +A P + P S Sbjct: 310 RLADQKTLPNRDFVVRWQSTAAQAKATAYFGPQLSQSVAGAQPGHFTLVVEPPQSDLDSL 369 Query: 251 -----MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV---- 301 M ++D SGSM +AK+ L + + E Sbjct: 370 VGQREMIFVIDRSGSMSGVPLALAKQTLREALSHLRPVDTFNVISFESSTAMLYEAAVPA 429 Query: 302 -------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG----DNW 350 E Q GGT++S A+ + Y +DG ++ Sbjct: 430 NEQNLVHAERFIDGLQAGGGTMMSGAVDAAL----SPEIGLGRHRYVFFVTDGFISNEDE 485 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 A K R + ++ L Sbjct: 486 IARQASALVRAADKAGQRARVFGMGIG-SSPNRELLASLSKAGKGRY 531 >UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AXM1_ORYSI Length = 614 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 58/190 (30%), Gaps = 23/190 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---- 294 + + ++DVSGSM+ + K + L + V + Sbjct: 20 PVLEGTARAGVDVVAVLDVSGSMEGERLEHVKEAMEIFIGKLGPDDRLSVVSFATSVRRL 79 Query: 295 ------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN--IY--AAQA 344 Q + V + G T + +AL ++++R + Sbjct: 80 TELTYMSEQGRAVAKEIVDGLVADGSTNMGAALLEGAMILRDRKGARDESNGRVGCMMFL 139 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 SDG N EI + + +++ + + R S +F ++I Sbjct: 140 SDGTND--------EIYKEDISGEFPAHTFGLG-SDHNPNVMRHIADETSATYSFVNRNI 190 Query: 405 RDQDDIYPVF 414 D + +F Sbjct: 191 ADIKGAFDLF 200 >UniRef50_A6Q2J6 von Willebrand factor type A domain protein n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q2J6_NITSB Length = 289 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 50/188 (26%), Gaps = 25/188 (13%) Query: 246 SSQAVMFCLMDVSGSMDQS------TKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---- 295 + +D SGSM++S ++ K + V++ Sbjct: 74 RKGRDLVLALDASGSMEESLYDEKSKFEVVKSMAQNFFHK-RFDDNIGIVIFGSFAYIAA 132 Query: 296 --TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-N 349 T + + Y + T + L + ++ +DG N Sbjct: 133 PLTYDTKALDFLINYLEPSIAGNNTAIGEGLWQGIKALQADTAK---QKVLILITDGHHN 189 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 SP AKKL + A + L + + + D Sbjct: 190 SGSISPRQAVEKAKKLGIKIYTIGLG----DADKHLLEQIAKESGGKFFY-AKSEEDLQS 244 Query: 410 IYPVFREL 417 I+ +L Sbjct: 245 IFSELNKL 252 >UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZED8_SYNY3 Length = 588 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 24/217 (11%), Positives = 50/217 (23%), Gaps = 18/217 (8%) Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRF 272 + V + + P + ++D SGSM+ A++ Sbjct: 9 IPLKNAVCSERAVTLDLIIRITPPSPPAMDQPRPSLNLGFVIDRSGSMEGHNKITYARQA 68 Query: 273 YILLYLFLSRTYKNVEVVYIRH------HTQAKEVDEHEF--FYSQETGGTIVSSALKLM 324 LS ++ T K+ + + G T + Sbjct: 69 VCYAIDQLSPGDHLSVTIFDDQVQTLIPSTLVKDKAQFKRLVQGINPGGCTDLHGGWLQG 128 Query: 325 DEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILA---KKLLPVVRYYSYIEITRR 380 V + + A+ N SDG N + +P + Sbjct: 129 GIQVSQNLS-AELNR-IILLSDGLANRGETNPDIIATDVHGLAQRGASTTTLGLG---DD 183 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 ++ L + + + L Sbjct: 184 YNEDLLEAMARSGDGNYYYVADAEQLPTIFERELQGL 220 >UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JPY1_9BACT Length = 632 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 30/302 (9%), Positives = 67/302 (22%), Gaps = 28/302 (9%) Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSL---------QNSLARRTAMTAGKRRELHAL 191 + GY A + + L L Sbjct: 20 ELSPFEVSGPSTGGYRATSTLSATRIRTKLGATVGGAQDIRYLRNLIDEGIIPSPASFTA 79 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN--YEKRPDPSSQA 249 E + E + P +D + + Sbjct: 80 EGLFSEHDLPIGGDAKEGWLFDIASQATSFESAAQPKVDILAQLGFVSGIDATTFKPAPL 139 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAK 299 + ++D SGSM ++ ++ + L + V+Y T+ + Sbjct: 140 NLVAVVDKSGSMSGDPLELVRKSLRQVVSQLGSDDQLSIVLYGSSTHIHLEPTKTSTENR 199 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 + Q G T + + L+L +V ++ + +D N Sbjct: 200 DQIIASIDRIQSHGSTAMEAGLELGYQVARQSADAFVGKTRVMLFTDERPNVGRTDATGF 259 Query: 359 EILAK---KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 +A+ K + L + ++ F + Sbjct: 260 MAMAESGSKSDIGLTTIGVGV---HFGAELAEKISSVRGGNLFFFDDDESMETTFRKELD 316 Query: 416 EL 417 + Sbjct: 317 TM 318 >UniRef50_A2FKC6 von Willebrand factor type A domain containing protein n=4 Tax=Trichomonas vaginalis RepID=A2FKC6_TRIVA Length = 667 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 53/168 (31%), Gaps = 22/168 (13%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----------IRHHTQA 298 + F ++D SGSM+ D A + L+ L + + + ++ + Sbjct: 239 SDYFFVIDCSGSMEGKLIDKAVKCMRLMLQSLPMKCRFSIYCFGYNFRQLLPIVEYNNEN 298 Query: 299 KEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + + + GGT + + LK + +DG+ D+ Sbjct: 299 VLLAMNLIKNIKANMGGTNIYNPLKDIFSQ-------DGMLKKIFLLTDGE---VDNSEE 348 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 L +K Y+ + A L R + + + + + Sbjct: 349 IINLVEKNKAFGNIYTVGIGSG-ADPGLIRNLAEVTNGKWTYVLDNEN 395 >UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putative n=1 Tax=Ricinus communis RepID=B9RR85_RICCO Length = 755 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 57/214 (26%), Gaps = 22/214 (10%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 D F L ++ + + ++D+SGSM+ + K L+ Sbjct: 304 QRDMFYLYLFPGDQPNMKVFRKEIVFIVDISGSMEGKPLEGMKNAMSGALAKLNPKDSFN 363 Query: 288 EVVYIR----HHTQAKEVDEHEFFYSQ--------ETGGTIVSSALKLMDEVVKERYNPA 335 + + + + E + GGT +S L E+V N Sbjct: 364 IIAFNGETYLFSSLMELATEKTVERAVEWMNLNFIAGGGTNISVPLNQAMEMV---SNTQ 420 Query: 336 QWNIYAAQASDGDNWADDSP-LCHEILAKKLLPVV-RYYSYIEITRRAHQTLWREYEHLQ 393 +DG + + + + R Y++ T + R + Sbjct: 421 GSLPVIFLVTDGAVEDERHICDSMKKYVRGKGAICPRIYTFGIGT-YCNHYFLRMLATVC 479 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 D D + F + + Sbjct: 480 RGQY----DAAYDVDSVQARMEIFFSRGLSAVLA 509 >UniRef50_Q23FU3 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23FU3_TETTH Length = 755 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 51/171 (29%), Gaps = 17/171 (9%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---------- 295 + CL+D SGSM + ++ L L + + V + Sbjct: 45 RLPVDIICLIDNSGSMAGKKAQLVRKSLKYLLKILEKGDQISLVSFSSTAKTLCPLTQVN 104 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + K+ + GGT V K + +++ + + +DG+ DS Sbjct: 105 DENKQQIKSAIKQINGQGGTFVIPGFKEVTKIL-NSRKEQREQTFILLLTDGEFGDIDSG 163 Query: 356 LCHEILAK-----KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 + + + ++ Y+Y + + +E Sbjct: 164 KVIQNINRLFTQSEIQKTPYIYTYGYG-DDVNPEILQEIAQKFQGKYCLIS 213 >UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 Tax=Cystobacterineae RepID=Q1D9B7_MYXXD Length = 476 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 51/183 (27%), Gaps = 13/183 (7%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---------HH 295 S + ++D SGSM AK+ L L+ + + Y Sbjct: 91 QRSPVNLALVIDRSGSMSGYKLAQAKQAARHLIGLLNDQDRLAIIHYGSDVKSLPSLEAT 150 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW-ADDS 354 +E + GGT + + L + N SDG + Sbjct: 151 AANRERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQRTYGVNR-LILMSDGQPTEGLTA 209 Query: 355 PLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 +A++L + + T ++ L + + + F + Sbjct: 210 DEELTRMARELRATGLTLSAIGVGT-DFNEDLMQAFAEYGAGAYGFLEDAAQLSTLFQKD 268 Query: 414 FRE 416 ++ Sbjct: 269 LQQ 271 >UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ58_ORYSJ Length = 589 Score = 104 bits (259), Expect = 5e-21, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 48/185 (25%), Gaps = 18/185 (9%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV---- 301 + + ++DVSGSMD D K + LS + V + + T+ + Sbjct: 65 RAGLDLVAVIDVSGSMDGDRIDKVKTALQFVIRKLSDLDRLCIVTFCTNATRLCPLRFVT 124 Query: 302 ------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + + G T + L+ VV R A + SDG Sbjct: 125 AAAQAELKALVDGLKAYGDTNMKGGLETGMSVVDGRSLAAGRAVSVMLMSDGYQNHGGDA 184 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ-STFDNFAMQHIRDQDDIYPVF 414 + V +S+ L N+ + Sbjct: 185 RDVHL----KNVPVYTFSFG---ASHDSNLLEAIARKSLGGTFNYVADSANLTGPFSQLL 237 Query: 415 RELFH 419 L Sbjct: 238 GGLLT 242 >UniRef50_Q4RF07 Chromosome 13 SCAF15122, whole genome shotgun sequence. (Fragment) n=2 Tax=Tetraodon nigroviridis RepID=Q4RF07_TETNG Length = 983 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 29/233 (12%), Positives = 66/233 (28%), Gaps = 28/233 (12%) Query: 206 LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST 265 L + + ID +D+R + + +S M L+D SGS+ T Sbjct: 39 LARYYPASPWMDARKTPS----KIDLYDVRRRPW-YIQGAASPKDMLILVDASGSVSGLT 93 Query: 266 KDMAKRFYILLYLFLSRTYKNVEVVYIRH-------------HTQAKEVDEHEFFYSQET 312 + + + LS V + + + K++ + Sbjct: 94 LKLIRTSVTEMLETLSDDDYVNVVYFNTQVKKTACFDHLVQANVRNKKLLKDAVQNITAK 153 Query: 313 GGTIVSSALKLMDEVVK-ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL--PVV 369 G T + L+ E + + A N +DG + + +K V Sbjct: 154 GITNYTKGLEFAFEQLSVTNVSRANCNKIIMLFTDG------GEERAQAILEKYNADKKV 207 Query: 370 RYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 R +++ + + + + I + ++ + Sbjct: 208 RIFTFSVGQHNYDKGPIQWMACSNKG-YFYEIPSIGAIRINTQEYLDVLGRPM 259 >UniRef50_A8M9M1 von Willebrand factor type A n=1 Tax=Caldivirga maquilingensis IC-167 RepID=A8M9M1_CALMQ Length = 474 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 33/259 (12%), Positives = 70/259 (27%), Gaps = 25/259 (9%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF- 228 +++AR + R E+ + + I + + V Sbjct: 218 ALDNVAREPTIGNALRVS--RFTEHSNYPTYITGVREYRIGDPAYRIDLDKTSMNMVRKT 275 Query: 229 IDTFDLRYKNYEKRPDPSSQA-VMFCLMDVSGSMDQ-----STKDMAKRFYILLYLFL-S 281 + ++ R + + +D SGSM + D+AK + +L Sbjct: 276 FLNKPMSTRDIVVREYADVKLMDIVLCLDTSGSMKEFSGAYMKMDIAKEAIVKYIRYLSR 335 Query: 282 RTYKNVEVVYIRHHTQAK---------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERY 332 + V++ E Y GGT +++AL+ ++ + Sbjct: 336 TNDRLSMVLFNFRADILWGPHSVKKYINEMEEMSRYIYPGGGTNIANALEKARIILSKSN 395 Query: 333 NPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 P N + +DG S C + K V + + L + Sbjct: 396 YP---NKHIICITDGRTVNASS--CIKEAVKLRRMGVTLSTVAVG-DNSDFDLLMRLSKI 449 Query: 393 QSTFDNFAMQHIRDQDDIY 411 + + Sbjct: 450 GNGLFIKINDISNLDKALI 468 >UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7YL43_NECH7 Length = 764 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 21/206 (10%), Positives = 48/206 (23%), Gaps = 33/206 (16%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ---------------STKDMA 269 ++ P + ++DVSGSM S D+ Sbjct: 61 PDRKGLIVKVQPPTAPSAEIPHVPCDIVLVIDVSGSMAGAAPVPGEETNESTGLSILDLT 120 Query: 270 KRFYILLYLFLSRTYKNVEVVYI----------RHHTQAKEVDEHEFFYSQETGGTIVSS 319 K + ++ + + V + ++ KE + T + Sbjct: 121 KHAARTIIETMNESDRLGIVTFASKAKVVQPLLSMTSENKERSRGNVTSMRPIDATNLWH 180 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH---EILAKKLLPVVRYYSYIE 376 L ++ K + +DG + +L + + + Sbjct: 181 GLLEGIKLFKN--VKSSNVPAIMVLTDGMPNHMNPAAGFVPKLRAMGQLPASIHTFGFGY 238 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQ 402 L + + F Sbjct: 239 ---HLRSGLLKSIAEIGGGNYAFIPD 261 >UniRef50_A6G2V8 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G2V8_9DELT Length = 877 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 44/433 (10%), Positives = 106/433 (24%), Gaps = 43/433 (9%) Query: 6 DRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFH 65 +R + G+ K+ + Q + Y+ K + + ++ ++ + +IP E H Sbjct: 150 ERTIRGEMKTREDAQ---QTYEDAKKAGKAAGLLEQERPNIFTQRVANIPPGQTIEVSMH 206 Query: 66 QGRG-GLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEY 124 + + V R G+ + + DE Sbjct: 207 VVQPLEQEDGRYELVLPTVVGPRFIPGTPLAQHRQPAPGENTGIAP---------NTDEV 257 Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 D + + G + + + +++ + A Sbjct: 258 PDASRITAPVVPEGFTTCAHVEASVVIDTGLRPRRIQSKFHGIDIMRSGDVAAIELDADS 317 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 +A L ++ +A+ + + Sbjct: 318 DGAPV-----VANRDFVVSWDLGRDQPKAAIVAQPPTSEGGDGYFTLTVQPPEQVADEQ- 371 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-RHHTQAKEVDE 303 + + ++D SGSM D AK + + + + ++ Sbjct: 372 -AVARELVFVVDNSGSMGGLPMDTAKGLMRKALKDIRPDDTFTVLRFSESASGLSNKLLP 430 Query: 304 HEFFYSQET----------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + GGT ++ +K V + +DG Sbjct: 431 ATQDNIEAGVDYVDAMQGMGGTQMTEGIKAALRV----PHDPDRLRVVMFLTDG---YIG 483 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + L + R +S ++ L + +A + D PV Sbjct: 484 NEQAIFELIDDNIGDARLFSLGVG-GAPNRYLLDGMASVGRGAVTYA--GYDEPAD--PV 538 Query: 414 FRELFHKQNATAK 426 + + Sbjct: 539 IERFYERVATPVL 551 >UniRef50_Q24C76 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila RepID=Q24C76_TETTH Length = 670 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 53/205 (25%), Gaps = 35/205 (17%) Query: 246 SSQAVMFCLMDVSGSMDQSTK------------------DMAKRFYILLYLFLSRTYKNV 287 + + + C++DVSGSM K D+ K ++ L Sbjct: 30 RTNSNICCVVDVSGSMSSEAKIINQSSQKSDENYSLSILDVVKHSIKMIVNTLGSEDYLS 89 Query: 288 EVVYIRHH----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 V + K + + GGT + L ++ P Sbjct: 90 IVTFSDSANVLFDLLPMNDSNKTMAIEKIENLSTEGGTELWKGLNSALNILLNNKTPN-T 148 Query: 338 NIYAAQASDGDNWA---DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 N +DG D + + + KL + + + + ++ L + + Sbjct: 149 NQSIFLLTDGQPTDSGIDTNLVKFKQAYPKLNCTINTFGF---SSSSNSELMNKIAMEYN 205 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFH 419 +F Sbjct: 206 GMFSFIPDASFIATAFANALANTLT 230 >UniRef50_D1CG77 von Willebrand factor type A; type II secretion system protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CG77_THET1 Length = 643 Score = 104 bits (258), Expect = 8e-21, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 53/184 (28%), Gaps = 13/184 (7%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQ 297 + +D S SM+ A+ L LS K + + + Q Sbjct: 92 QNPDPIDVVLALDTSASMNDDAFTAAQDAAYGLINGLSPEDKVGLITFDKTARVIEPLAQ 151 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + + GT + L L + V Q +DG N + ++ L Sbjct: 152 DHARVQESIQKLSRSVGTALYQGLSLAAQEVA----KGQNTKAIVLMTDGFNTSRNTTLE 207 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 E +AK ++ + ++ + F+ ++ + Sbjct: 208 -EAVAKAQEVGASVFTVGFGK-KVDTQGLQKIANETGGEY-FSAPTNAQLRRVFADISQK 264 Query: 418 FHKQ 421 H++ Sbjct: 265 LHQE 268 >UniRef50_C1GWG1 von Willebrand factor type A domain containing protein n=1 Tax=Paracoccidioides brasiliensis Pb01 RepID=C1GWG1_PARBA Length = 773 Score = 104 bits (258), Expect = 8e-21, Method: Composition-based stats. Identities = 28/228 (12%), Positives = 53/228 (23%), Gaps = 38/228 (16%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSM------------------DQSTKDMAKRFYIL 275 + + ++ + +DVSGSM S D+ K Sbjct: 59 IHPPLHPEKDIRHVPCDIVLCIDVSGSMQLSAPLPTTDESGKREETGLSVLDLTKHAART 118 Query: 276 LYLFLSRTYKNVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 + L+ + V + K+ Q T + LKL Sbjct: 119 IIETLNENDRLGVVTFSNDAEVAYKISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGL 178 Query: 326 EVVKERYNPAQWNIYAAQASDGDNWA-------DDSPLCHEILAKKLLPVVRYYSYIEIT 378 V+ + Q +DG K LP++ + + Sbjct: 179 SVLGKVDLRPQNVQALYVLTDGQPNHMCPRQGYVPKLRPILERQKDRLPLIHTFGFGY-- 236 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 L + + +F L+ A+ Sbjct: 237 -DIRSGLLQSIAEVGGGTYSFIPDAGMIGTVFVHAIANLYTTFATQAR 283 >UniRef50_UPI0001C1630F hypothetical protein CRD_00534 n=2 Tax=Nostocaceae RepID=UPI0001C1630F Length = 587 Score = 104 bits (258), Expect = 8e-21, Method: Composition-based stats. Identities = 30/293 (10%), Positives = 74/293 (25%), Gaps = 20/293 (6%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + AN S + + LE Sbjct: 301 DPKKLSVFVSEGLTFNSLKKIPEFANSSFIPFGVPHNNPLVKFKWTTPEQQEGLELFAKF 360 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + L + ++ +P + L ++ + D + ++D Sbjct: 361 AQSDPMQNLAPRMPPEVAQYLAQKQVPPIPSGEVLSLGQTFWKTQKDAGKTVYLMTVIDT 420 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----------IRHHTQAKEVDEHEFF 307 SGSM + K + ++ V Y + Sbjct: 421 SGSMSGGPLEAVKNGLRIASQQINPGNYVGLVSYGDQPINLVKLAPFDDLQHKRFLAGID 480 Query: 308 YSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL 366 + G T + + + E++++R Y +DG + + + + Sbjct: 481 GLEADGATAMYDGVMVGLSELLQQRKTNPNGKFYLLLLTDGQTNQGFNFEQVKEIIEYSG 540 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 V +Y E+ ++ L+ + +++ + + LF Sbjct: 541 VRVYPIAYGEV----NEAELNAIAALRE-----STVKKGTPENVQELLKGLFQ 584 >UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2QZH3_ORYSJ Length = 633 Score = 104 bits (258), Expect = 9e-21, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 62/223 (27%), Gaps = 36/223 (16%) Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS-------QAVMFCLMDVSGSMD---- 262 + I + ++ + P + + ++DVSGSM+ Sbjct: 28 APVKVSTTPIFPTIPRGQTNKDFQVLLRVEAPPAADLNSHVPLDVVAVLDVSGSMNDPVA 87 Query: 263 ---------QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH------------HTQAKEV 301 S D+ K + L+ + V + + + Sbjct: 88 AASPKSNLQGSRLDVLKASMKFVIRKLADGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSI 147 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQASDGDNWADDSPLCHEI 360 + Q GGT + AL+ +++ ER ++ ++ + +DGD+ Sbjct: 148 AGKKIDRLQARGGTALMPALEEAVKILDERQGSSRNHVGFILLLTDGDDT--TGFRWTRD 205 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 + +++ + L +F Sbjct: 206 AIHGAVFKYPVHTFGLGASHDPEALL-HIAQGSRGTYSFVDDD 247 >UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174639B Length = 868 Score = 104 bits (258), Expect = 9e-21, Method: Composition-based stats. Identities = 22/180 (12%), Positives = 43/180 (23%), Gaps = 15/180 (8%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L + + + + ++D SGSM +MAK I L+R + Sbjct: 398 LPVRLKAPDEEEKQSSALALVIDRSGSMSGEKLEMAKSAAIATAEVLTRNDSIGVYAFDS 457 Query: 294 HHTQAKEVDEHEFFY--------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + GGT + A ++ + + Sbjct: 458 EAHVVVPMTRLTSSSAVAGQIAGLTSGGGTNLHPAFTEARNALQRTKAK---IKHMIILT 514 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + + + + AH L + L + Sbjct: 515 DGQTSGQ-GYEALASQCRAEGVTISTVAIGDG---AHVGLLQAIASLGGGKSYTTLDAAN 570 Score = 43.7 bits (101), Expect = 0.013, Method: Composition-based stats. Identities = 18/141 (12%), Positives = 37/141 (26%), Gaps = 15/141 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV---- 301 SSQ + +D S S+ + + L + + E Sbjct: 62 SSQRAVVLALDNSQSLGAEGVKTVLQKADDIRKALPGDVEVYTTGFGDEARLYSEAELKV 121 Query: 302 --DEHEFFYSQETGG-TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 D+ + G + + AL+ + PA + + DG Sbjct: 122 GGDKLWADALKTHGAQSNYAGALEYARSLF-----PAGTSRHVVFVGDGHETRGS---LM 173 Query: 359 EILAKKLLPVVRYYSYIEITR 379 E ++ V ++ Sbjct: 174 EAARAAMVADVHLHAVPVAGP 194 >UniRef50_Q1NTK1 Von Willebrand factor, type A n=2 Tax=delta proteobacterium MLMS-1 RepID=Q1NTK1_9DELT Length = 771 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 50/196 (25%), Gaps = 24/196 (12%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------I 292 + + L+D SGSM + AK+ + L +++ + Sbjct: 260 EQPIPTSLAILLDCSGSMAGDSIAQAKQAISDMLNLLRPEDYCNLIMFGSEVKSVFPCQV 319 Query: 293 RHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEV------VKERYNPAQWNIYAAQAS 345 GGT + AL ++ + PA+ + + Sbjct: 320 AADKTNITTLRRAIRAIDADMGGTEMQKALVETLKMSPIYKPPEVEVVPARISRNILLIT 379 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG W D L + R ++ + + Sbjct: 380 DGQVWGDKQI-----LRRMAKSDHRVFTVGVG-GAVCEAFLHGLASQSGGACELVAPNEE 433 Query: 406 DQDDIYPVFRELFHKQ 421 + I + ++ + Sbjct: 434 MGEKIARQSKRVYAEV 449 >UniRef50_Q30SV4 von Willebrand factor, type A n=2 Tax=Campylobacterales RepID=Q30SV4_SULDN Length = 307 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 60/210 (28%), Gaps = 31/210 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQ---------------STKDMAKRFYILLYLFLSRTY 284 P + + +D SGSM+ S ++AK + Sbjct: 73 RVDPLNRNGKDIVLAIDASGSMNSTGFDFEGEAALPQKLSRFEIAKIVASEFIQK-RLSD 131 Query: 285 KNVEVVYIRHH------TQAKEVDEHEFFYS---QETGGTIVSSALKLMDEVVKERYNPA 335 V+Y T K + Y T + A+ + K Sbjct: 132 NVGIVLYGDFAFIASPITYEKNIIIEMLSYLNQGMAGQNTAIGEAIAMSLRAFKHSKAK- 190 Query: 336 QWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + +DG+ N D SP +LAK+ + A + L ++ Sbjct: 191 --SKIVVLLTDGEHNSGDISPKDALVLAKEENIKIYTIGMGN-RGEADEALLKKIADESG 247 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + ++ +IY EL + + Sbjct: 248 GEFFY-ATNAKELKEIYEHIDELESSKIKS 276 >UniRef50_B5YCB4 von Willebrand factor type A domain protein n=2 Tax=Dictyoglomus RepID=B5YCB4_DICT6 Length = 890 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 56/216 (25%), Gaps = 20/216 (9%) Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD-----QSTKDMAKR 271 + + L ++ S + ++D SGSM ++AK Sbjct: 359 DKSFSAGNYQGTPLEEILPVTLRPEQILKKSNVAIVIVLDASGSMGSYSGGDMKMELAKE 418 Query: 272 FYILLYLFLSRTYKNVEVVYI--------RHHTQAKEVDEHEFFYSQETGGTIVSSALKL 323 L+ L + + KE GGT + LK Sbjct: 419 SAQLVLDLLEEKDYFGLIAFDHSYQWIVPLQPLTNKEETASLISKISPGGGTALYPPLKS 478 Query: 324 MDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 E + + + + +DG D + LAK V E A+ Sbjct: 479 AGEALIKAPIK---SKHIIAITDGQTEGGDFYNLVKYLAK-YKITVSTIGIGE---DANI 531 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 L ++ + + + + L Sbjct: 532 PLLKDIANWGNGRFYHTWNIRNLPQLLLSETKALLR 567 >UniRef50_UPI0000F2D28F PREDICTED: hypothetical protein n=1 Tax=Monodelphis domestica RepID=UPI0000F2D28F Length = 998 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 30/230 (13%), Positives = 66/230 (28%), Gaps = 24/230 (10%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRP----DPSSQAVMFCLMDVSGSMDQSTKD 267 ++++ +R ++ + + + + + + L+D SGSM Sbjct: 359 FQRKMEVIRKRLHKDIPHHSVLMLNFCPDLQQTHYSLRKTHGEFIFLIDRSGSMSGVNMH 418 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIR-----------HHTQAKEVDEHEFFYSQET-GGT 315 + K IL+ L T + + + + + GGT Sbjct: 419 LVKDAMILILKSLMPTCLFNIIGFGSTFKTLFPSSQVYSEDNLVSACKNIQHLRADMGGT 478 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + S LK + + DG + ++ L + R YS+ Sbjct: 479 NILSPLK----WITRQPIHEGHPRLLFLLIDG---SVNNTGKVIELLRNNASTTRCYSFG 531 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 +A L + + F Q R Q + ++ + Sbjct: 532 IG-PKACPRLVQGLAAVSRGSAEFLRQGERLQPKMIKSLKKAMAPVLSDV 580 >UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=B8G546_CHLAD Length = 418 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 30/229 (13%), Positives = 64/229 (27%), Gaps = 20/229 (8%) Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 LR + P + + P + + ++D SGSM + Sbjct: 2 SASVTLRCQWGRTPVPTSSTPQVVYLLVEAVAPAS-PTSALPLNLCFVLDRSGSMQGAKL 60 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYI--------RHHTQAKEVDEHEFFYSQETGGTIVS 318 + K + L V++ + E GGT +S Sbjct: 61 ESMKAATRRVIELLRPHDVAAIVIFDDTVQTLIPATPVGDRSALLAAVETITEAGGTAMS 120 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 ++ +++ P + + +DG W D P+C ++ VR + T Sbjct: 121 LGMQAAQTELQKHLGPDRISR-MLLLTDGQTWG-DEPICRDLARTLGQAGVRITALGLGT 178 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 ++ L + + ++ F + A+ Sbjct: 179 -EWNEQLLDDIAAASDGYSDYIADPA--------QIETFFQQAVKEAQA 218 >UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YNZ7_ANASP Length = 820 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 51/202 (25%), Gaps = 21/202 (10%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI---- 292 + R D + L+D SGS + + L+ V + Sbjct: 287 PAIQYRQDQVVPKDVVFLIDTSGSQMGAPLMQCQELMRRFINGLNPDDTFSIVDFSDTTR 346 Query: 293 -------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 ++ Q + + GGT + ++ + + Sbjct: 347 QLSPVPLANNAQNRTRAINYINQLSANGGTEMLRGIRAVLNFP---VTDPGRLRSIVLLT 403 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + + + + L R YS+ ++ L L Sbjct: 404 DG--YIGNENQILAEVQQHLKSGNRLYSFGAG-SSVNRFLLNRIAELGRGIAQII--RHD 458 Query: 406 DQDDIYPVFRELFHKQNATAKG 427 + D + + + + N Sbjct: 459 EPTD--EIVDKFYRQINNPVLA 478 >UniRef50_Q54CQ8 von Willebrand factor A domain-containing protein DDB_G0292740 n=1 Tax=Dictyostelium discoideum RepID=Y2740_DICDI Length = 910 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 60/211 (28%), Gaps = 26/211 (12%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 F ++++ K + L+D SGSM + D A+R ++ L+ K Sbjct: 327 LNFFPKFESINKEDI-YQKGEFIFLIDCSGSMSGNPIDSARRALEIIIRSLNEQCKFNIY 385 Query: 290 VYIR------------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQ 336 + + + V GGT + +K +++ + +P + Sbjct: 386 CFGSGFNKAFQEGSRKYDDDSLAVVNRYVSNISANLGGTELLQPIK---DILSKEIDP-E 441 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 + +DG A K R ++Y L + Sbjct: 442 YPRQIFILTDG---AVSDRSKLIEFVSKESKTTRIFTYGIG-SSVDVELVVGLSKACKGY 497 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 IR+ D+ +L Sbjct: 498 Y----TLIRNSSDMETEVMKLLSIAFEPTLS 524 >UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BJ22_9GAMM Length = 445 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 56/195 (28%), Gaps = 20/195 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 ++ A + ++D SGSM AK I+ LS+ V Y Sbjct: 62 EQTQARIPANIAIVLDKSGSMQGDKLFRAKEAAIMAINRLSQNDIVSVVSYDSRVNVVVP 121 Query: 301 VDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA 351 + Q G T + + + +++ + + N SDG N Sbjct: 122 ATKVSDTNTIARAINRIQANGNTALFAGVSKGANELRKFLDLNKVNRVI-LLSDGLANIG 180 Query: 352 DDSPLCHEIL---AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 +P L K V ++ L + F + D Sbjct: 181 PSTPNELGKLGLSLAKEGMSVTTIGLGLG---YNEDLMTQLAGFSDGNHAFV----ENAD 233 Query: 409 DIYPVFRELFHKQNA 423 D+ VF+ F + Sbjct: 234 DLARVFQYEFGDVLS 248 >UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LL92_HALO1 Length = 430 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 51/236 (21%), Gaps = 16/236 (6%) Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 PAQ + L + + ++D SGSM Sbjct: 6 TPAQRAGSVAVTVTPQYDLLPSNARELNLMVRLEGTGDAPATR--APLDLALVIDRSGSM 63 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------HHTQAKEVDEHEFFYSQE 311 K + L L V Y + Q Sbjct: 64 SGDKLSDVKTAALELLETLQPEDTITLVSYSSDVSMHLMRTRADDAGQREARRALLALQA 123 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCH-EILAKKLLPVV 369 GGT + L E ++ + + + + SDG N + P A V Sbjct: 124 RGGTALGPGLFRALEALEGASDRTRMS-HLMLFSDGIANAGEVRPSVLGARAAGAFGAGV 182 Query: 370 RYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + ++ L +F + + L Sbjct: 183 SVSTMGVGV-DYNEDLMTRLADQGGGRYHFIQDSEAIASILDDEMKGLVATVARGV 237 >UniRef50_Q46AG0 BatA n=3 Tax=Methanomicrobia RepID=Q46AG0_METBF Length = 317 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 51/212 (24%), Gaps = 31/212 (14%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVY---I 292 + +MDVSGSM + AK +L L V + Sbjct: 81 PLEQTKEGVNVVLVMDVSGSMQAQDYTPSRLEAAKSSAEILINSLKSKDYAGIVTFESGA 140 Query: 293 RHHTQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 E + + G T + L L ++ N SDG Sbjct: 141 TTAAYLSPYKEKVIEKLRNVAPKEGSTAIGDGLSLGIDMASSIPNK---KKVIILLSDGV 197 Query: 349 NWADD-SPLCHEILAKKLLPVVRYYSYI-----------EITRRA---HQTLWREYEHLQ 393 N A SP AK V + + + + Sbjct: 198 NNAGYISPDEAIQYAKANNIQVYTIGMGSNGNVLLGYDWFGNPQYAELDEATLQAIANDT 257 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 F + D+IY E ++ Sbjct: 258 GGKY-FKSIDDKTLDEIYKNISENIKREKEET 288 >UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VHB8_NAEGR Length = 755 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 57/221 (25%), Gaps = 28/221 (12%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS--------------TKDMAKRFYILLYL 278 + K + C++DVSGSM S D+ K L Sbjct: 115 RIHVKVSPPTGGQRQPCNLVCILDVSGSMGSSAEDLSSSNENTGFSRLDLVKHSVRTLIE 174 Query: 279 FLSRTYKNVEVVYIRHHTQAKEVDE----------HEFFYSQETGGTIVSSALKLMDEVV 328 ++ + + + + + + + G T V L+L E Sbjct: 175 LMNEKDQISLIPFSDSARMELPLTKMDAVGKKKAIEKLEHLGPEGSTNVWDGLRLGMESS 234 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPL---CHEILAKKLLPVVRYYSYIEITRRAHQTL 385 A+ N +DG+ + E K+ +S+ L Sbjct: 235 LNNPLCAKTNTCLILFTDGEPNINPPRGIVPTLEKYIKEHPLNSTIHSFGFGYS-LDSAL 293 Query: 386 WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 ++ S ++ + + A+ Sbjct: 294 LKDIAMNGSGAYSYIPDCSMVGTTFVNMMSNILCTAVRRAE 334 >UniRef50_UPI0001A2C533 UPI0001A2C533 related cluster n=1 Tax=Danio rerio RepID=UPI0001A2C533 Length = 1222 Score = 103 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 52/196 (26%), Gaps = 20/196 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-------- 292 S Q L+D SGSM + K +++ L V + Sbjct: 345 TSDLRSIQGEFVFLIDRSGSMSGVNINRVKDAMVVILKSLFPACLFNIVGFGSKFKTLFS 404 Query: 293 ---RHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + ++ + + GGT + + L + R +DG Sbjct: 405 TSQSYDEESLALACEYVKKIRADMGGTNILAPLNWILRQPMHR----GHPRLLFLLTDG- 459 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 A + L + R +++ A + L + F + R Q Sbjct: 460 --AVSNTGKVIELLRSHARFTRCFTFGIGQ-AACRRLVSGLSAVSRGTAEFLAEGERLQP 516 Query: 409 DIYPVFRELFHKQNAT 424 + ++ Sbjct: 517 KMIKSLKKCMTSVLTD 532 >UniRef50_UPI0001A2C532 UPI0001A2C532 related cluster n=2 Tax=Clupeocephala RepID=UPI0001A2C532 Length = 1236 Score = 103 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 52/196 (26%), Gaps = 20/196 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-------- 292 S Q L+D SGSM + K +++ L V + Sbjct: 345 TSDLRSIQGEFVFLIDRSGSMSGVNINRVKDAMVVILKSLFPACLFNIVGFGSKFKTLFS 404 Query: 293 ---RHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + ++ + + GGT + + L + R +DG Sbjct: 405 TSQSYDEESLALACEYVKKIRADMGGTNILAPLNWILRQPMHR----GHPRLLFLLTDG- 459 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 A + L + R +++ A + L + F + R Q Sbjct: 460 --AVSNTGKVIELLRSHARFTRCFTFGIGQ-AACRRLVSGLSAVSRGTAEFLAEGERLQP 516 Query: 409 DIYPVFRELFHKQNAT 424 + ++ Sbjct: 517 KMIKSLKKCMTSVLTD 532 >UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fragment) n=1 Tax=Sorghum bicolor RepID=C5YMJ6_SORBI Length = 423 Score = 103 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 49/214 (22%), Gaps = 37/214 (17%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQ 297 + ++DVSGSM + KR L L + V + Sbjct: 124 PLDLVTVLDVSGSMAGKKMERVKRAMGFLIDNLGSDDRLSVVAFSTDARRIIRLTRMSDD 183 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW------- 350 K + +G T + L + V+ R + SDG + Sbjct: 184 GKAAAKRAVESLAASGSTNIRGGLDVAAMVLDGRRHKNAVASVI-LLSDGQDNQSMHHEY 242 Query: 351 ---------------ADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHL 392 L + V +++ + Sbjct: 243 LPTSWVPKHSPAFSKGGYDVLVPPSFQRTAGGDHRCVTVHTFGFGI-DHDAAAMHYISEV 301 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + +F H QD L A+ Sbjct: 302 TGSTFSFIENHAVIQDAFARCIGGLLSVAVQKAR 335 >UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Streptomyces RepID=D1WZ12_9ACTO Length = 1289 Score = 103 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 46/339 (13%), Positives = 84/339 (24%), Gaps = 30/339 (8%) Query: 41 RSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGS 100 D + + T G G PG P Sbjct: 931 GRRGDQLPSGAARLATALDELYGAGHGEGSRGGLSGPGRTGSRGGREPSFPGVREWSEEL 990 Query: 101 GQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGV 160 E + + L L A P++ E Y Sbjct: 991 AALFGPGVREEVLAAAAVTGRQDVLAELDPAAATPSV---------ELLRTILRYAGGLP 1041 Query: 161 PANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELR 220 A ++ +R L L R LA + +L LR +A R Sbjct: 1042 EARLAALRPLVRHLVDELTRQLTTRLRPALTGTMLARPTRRPGGRLDLPRTLRANLATAR 1101 Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 + + +++ R S+ + + DVSGSM+ ST A +L Sbjct: 1102 RTADGTVQVIPQKPVFRS---RARRSADWRLILVTDVSGSMEASTIWSALTASVLA---- 1154 Query: 281 SRTYKNVEVVYIRHHTQAKEVDEHEFFYSQ------ETGGTIVSSALKLMDEVVKERYNP 334 + ++ T+ ++ H GGT +++ L+ +++ Sbjct: 1155 --GVPTLSTHFLAFSTEVVDLTGHVHDPLSLLLEVSVGGGTHIAAGLRHARGLIEVPSRT 1212 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 SD + + + Sbjct: 1213 -----LVVVISDFE-EGAPLAGLLAEVRALVTTGCHVLG 1245 >UniRef50_UPI0001C378BC von Willebrand factor, type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C378BC Length = 565 Score = 103 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 31/253 (12%), Positives = 72/253 (28%), Gaps = 23/253 (9%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 A+ E+ LE + + +L + + +++ K Sbjct: 322 VAIGKLSDTEMKTLEMFADFCKSDKAKKLADSYGFNQMNDY--KGDNIKVNGNSWTQMQK 379 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 ++ + ++D SGSM + + K +++ + V Y + Sbjct: 380 LWKTNKNSGKPIAAVFVLDTSGSMSGAPLNSLKASLRNSIKYINSSNYIGVVSYSSNVNV 439 Query: 298 AKEVDEH----------EFFYSQETGGTIVSSALKLMDEVVKE-RYNPAQWNIYAAQASD 346 E+ + +G T SAL ++++ + + SD Sbjct: 440 DLELAKFDLNQQAYFMGAVDSLTASGNTATFSALSQAMIMLRDFTKDNPNVSPMVFLLSD 499 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G + + + + Y A+ + + A D Sbjct: 500 GQSNSGSEFSDIDGAIATAQIPIYTIGY-----NANLNELKAISEINE-----AATINAD 549 Query: 407 QDDIYPVFRELFH 419 DD+ + LF+ Sbjct: 550 TDDVIYQLKNLFN 562 >UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A188_PELCD Length = 442 Score = 103 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 48/190 (25%), Gaps = 12/190 (6%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------- 292 R + ++D SGSM + A+ I LS VVY Sbjct: 57 APRTAQRPPVNLALVLDRSGSMSGNKIAKAREAAIEAVRRLSDGDLFSLVVYDDSVETLV 116 Query: 293 -RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNW 350 E + G T + A+ V++ + N SDG N Sbjct: 117 PAQPVSDIGDIEARIRRIRPGGSTALFGAVSQGAAEVRKHSDAPYVNRVV-LLSDGLANV 175 Query: 351 ADDSPLCHEIL-AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 P L A L + + T ++ L + F Sbjct: 176 GPSRPADLARLGAALLKEGISVTTVGVGT-DFNEDLMTQLAERSDGNHYFVESSRDLPRI 234 Query: 410 IYPVFRELFH 419 ++ Sbjct: 235 FAAELGDVLS 244 >UniRef50_A9WI94 von Willebrand factor type A n=2 Tax=Chloroflexus RepID=A9WI94_CHLAA Length = 845 Score = 103 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 50/191 (26%), Gaps = 23/191 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQ----STKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 + + ++D S SM S DMAK IL L + + + Sbjct: 386 PPPRPQRAPVSILFIIDRSASMSATFGISKFDMAKEAAILSLTTLQPGDRVGVLAFDTET 445 Query: 296 TQAKEV-----------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + + GGT + AL + + +A Sbjct: 446 IWTVPFRTVGEGVSLVELQDQIATMSLGGGTNIERALSVGLPALAN---EPYSTRHAVLL 502 Query: 345 SDGDNWADDSPL--CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG +++++ P A+ + + + L + + F Sbjct: 503 TDGRSYSNNYPRYQQLVETARAAQITLSTIAIG---SDSDTELLNQLASWGNGRYYFVAD 559 Query: 403 HIRDQDDIYPV 413 + Sbjct: 560 ATDLPRITFQE 570 >UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=A9AXC2_HERA2 Length = 421 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 26/227 (11%), Positives = 59/227 (25%), Gaps = 16/227 (7%) Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 E +L +A + + L + ++D SGSM Sbjct: 2 AGEVQLTGTLARPALPALQTQQVVYLLLDITATPAVAHVQMPVNVSFVLDHSGSMKGDKM 61 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVY--------IRHHTQAKEVDEHEFFYSQETGGTIVS 318 + + V++ + + E ++ GGT ++ Sbjct: 62 RCVREATQRALGLMGPQDIVSVVIFDHRRETIISAQPVRNVAALQAEVGKIKDAGGTKIA 121 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 AL+ ++ N + +DG + L K + Sbjct: 122 PALEAALNEIRRSQNANTISR-IILLTDGQTEGERDCLRLAEEIGKASVPLTALGVG--- 177 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 ++ L E + + + +DI F+ + + Sbjct: 178 DDWNEDLLIEMANRSGGVAEY----FSNPNDIASFFQGAVQQAQSAV 220 >UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V0V6_NAEGR Length = 502 Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats. Identities = 31/263 (11%), Positives = 68/263 (25%), Gaps = 31/263 (11%) Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL----RAKIERVPFIDTFDLRYK 237 L E L+ + ++ + ++ + + Sbjct: 2 KVSESALQKFETLLSNFAPQSVNLSVQIDAGHLPTDQIGVQCEIPFLVRLLSGNLPPQEE 61 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMD--------QSTKDMAKRFYILLY-LFLSRTYKNVE 288 E + + ++D+SGSMD S K L FL+ Sbjct: 62 EAETTNVLKTPVNICLVLDISGSMDEPLKNRSKGSKLTACKSAIRELVTNFLTYKDTIHL 121 Query: 289 VVYIRHHTQA------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 + Y + V+ ++ G T ++SAL +++ P A Sbjct: 122 ITYSDSPKTVFTEKNKESVNLNDIDKISTEGSTNIASALHSAVDLLHNSNAPG--TKLIA 179 Query: 343 QASDGD-NWADDSP--------LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 SDG N + + + ++ + SY + + Sbjct: 180 FFSDGQCNVGETNLNIFGSGLLKKLKDYSEGKDDQIHISSYGVG-SDYDELWLQAIARTG 238 Query: 394 STFDNFAMQHIRDQDDIYPVFRE 416 + +D ++ Sbjct: 239 KGEYYYLEDETYAKDAFERSLKK 261 >UniRef50_UPI000179F51C Novel protein. n=1 Tax=Bos taurus RepID=UPI000179F51C Length = 1156 Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 55/195 (28%), Gaps = 20/195 (10%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----HH-- 295 ++ L+D SGSM+ + K ++ L T + + Sbjct: 347 NLRKTRGEFIFLVDRSGSMNGTNIQCVKDAMLVALKSLVPTCLFNVIGFGSTFKTLFPSS 406 Query: 296 -TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 T +E Q + GGT + S LK + R +DG Sbjct: 407 QTYNEENLAMACDSIQKMRADMGGTNILSPLKWVIRQPVLR----GCPRLLFLITDG--- 459 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 A ++ L + R YS+ L + + F + R Q + Sbjct: 460 AVNNTGKVLELVRNHAFSTRCYSFGIG-PNVCHRLVKGLATVSKGSAEFLEEGERLQPKM 518 Query: 411 YPVFRELFHKQNATA 425 ++ + Sbjct: 519 VKSLKKAMAPVLSDV 533 >UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha (Globulin) inhibitor H5 (ITIH5) (Fragment) n=2 Tax=Danio rerio RepID=Q5RHF3_DANRE Length = 906 Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats. Identities = 22/208 (10%), Positives = 54/208 (25%), Gaps = 24/208 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----IR 293 + R P + ++D S SM + K+ + L V + + Sbjct: 242 FAPRDLPVVPKNVVFVIDTSASMLGTKMKQTKQALFTIINELRPNDNFNFVTFSNRIRVW 301 Query: 294 HHTQAKEVD-------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-----A 341 + V + + TGGT ++ ++ ++ + + + + Sbjct: 302 QPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALLSDYLSSKDESHHHSVSLI 361 Query: 342 AQASDGDNWADD--SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG SP + ++ L Sbjct: 362 IFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGMG-DDVDYRLLERMSLDNCGT--- 417 Query: 400 AMQHIRDQDDIYPVFRELFHKQNATAKG 427 M+ I + D + + + + Sbjct: 418 -MRRIPEDADASLMLKGFYDEIGTPLLS 444 >UniRef50_Q23JA0 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila SB210 RepID=Q23JA0_TETTH Length = 1049 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 60/213 (28%), Gaps = 24/213 (11%) Query: 226 VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK 285 F D F + SS++ L+D SGSM A L L Sbjct: 288 DIFSDEFQQKLNQELIDHLNSSRSEFIFLLDRSGSMSGQPIRRACEALTLFLKSLPNDSY 347 Query: 286 NVEVVYIR-----HHTQAKEVDEHEFFYS------QET-GGTIVSSALKLMDEVVKERYN 333 + + + K E Q GGT + + L V + Sbjct: 348 FNVISFGSSFDKLFPSSTKYTSESLEKAILLISKYQADLGGTEIYNPLN---NVFVQNKI 404 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 +N +DG+ DSP L KK R +S + A Q L +E Sbjct: 405 -QGYNKQIFLLTDGE---VDSPQQVVRLIKKNNKYNRVHSIGFGSG-ADQYLIKESAIAG 459 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + + D+ V + Sbjct: 460 KG----ISKLVDQKCDLSEVIINMLSLCITPTL 488 >UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 Tax=Shewanella benthica KT99 RepID=A9DKM0_9GAMM Length = 167 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 100/152 (65%), Positives = 123/152 (80%), Gaps = 1/152 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN + +S VNRQRF+ RYK QIK+++S+A+ +RSVTDVD GE +SIPT+DIS Sbjct: 16 MANFIDRRLNARGRSTVNRQRFINRYKQQIKKAVSDAVTRRSVTDVDKGERISIPTKDIS 75 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP FHQG+GG+R RVHPGND F++ D+IER GGG GSGQG AS GEG D+FVFQIS Sbjct: 76 EPSFHQGQGGIRERVHPGNDQFIKGDKIER-PPGGGSQGSGQGDASNSGEGDDDFVFQIS 134 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHR 152 KDEYL+LLFEDL LPNL+ N+ +L EY+ +R Sbjct: 135 KDEYLELLFEDLELPNLQNNRLNKLVEYQVYR 166 >UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Takifugu rubripes RepID=UPI00016DFBC7 Length = 883 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 18/208 (8%), Positives = 49/208 (23%), Gaps = 25/208 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-HTQ 297 + + + ++D SGSM + + I + L + + + Sbjct: 241 FAPKDLTRLPKNVVFVIDRSGSMSGTKMQQIQEAMIKILEDLHPEDHFGIIQFDSSVDSW 300 Query: 298 AKEVDEHEFFYSQET-----------GGTIVSSALKLMDEVV----KERYNPAQWNIYAA 342 + E T +++A+ +++ + + P + Sbjct: 301 RNSLSLATEENISEAMAYVNQISHKIQATNINAAVLKAVDMLVTDREAKRLPEKSIDMII 360 Query: 343 QASDGDNWADDSPLCH----EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DGD D E + + + Y Sbjct: 361 LLTDGDPTTDIGETRIPVIQENVRNAIGGNMSLYGLGFGN-DVDYGFLDVMSRENKGLAR 419 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNATAK 426 D + + + ++ Sbjct: 420 RIYTG----ADAALQLQGFYDEVSSPLL 443 >UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflexi (class) RepID=A5UTA6_ROSS1 Length = 425 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 55/192 (28%), Gaps = 14/192 (7%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 + P + ++D S SM K + L VV+ Sbjct: 38 QQLPKLPLNLCLVLDRSSSMRGERLMQVKEAAARIVDQLGPDDYFSLVVFNDRADVVIPA 97 Query: 302 --------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + + GGT ++ L L + V+ + + +DG + D+ Sbjct: 98 QRAIKKSDLKAAIAQIEAAGGTEMAQGLALALQEVQRPFLTRGISR-LILLTDGRTYGDE 156 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 S C EI + + + T ++ L +++ + Sbjct: 157 S-RCVEIARRGQSRGIGLTALGIGT-EWNEDLLETMTASENSRAQYIATAQDVVKVFADE 214 Query: 414 FREL---FHKQN 422 + L F +Q Sbjct: 215 VKRLHAIFAQQV 226 >UniRef50_A9Z1V5 von Willebrand factor A domain-containing protein 5B1 n=8 Tax=Amniota RepID=VW5B1_MOUSE Length = 1215 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 53/193 (27%), Gaps = 20/193 (10%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------- 293 + L+D S SM ++ K ++ L + + Sbjct: 348 RKAHGEFIFLIDRSNSMSKTNIQCIKEAMLVALKSLMPACFFNIIGFGSTFKAVFASSRI 407 Query: 294 HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 ++ + + Q GGT + S LK + R +DG + Sbjct: 408 YNEENLTMACDCIQRMQADMGGTNMLSPLKWVLRQPLRR----GHPRLLFLITDG---SV 460 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 ++ L + R YS+ L + + F M+ R Q + Sbjct: 461 NNTGKVLELVRNHASSTRCYSFGIG-PTVCYRLVKGLASVSKGSAEFLMEGERLQPKMVK 519 Query: 413 VFRELFHKQNATA 425 ++ + Sbjct: 520 SLKKAMAPVLSDV 532 >UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B7C3_9BACI Length = 920 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 46/179 (25%), Gaps = 14/179 (7%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH-----TQ 297 + +MD SGSM S ++AK L + + T Sbjct: 397 KKEMPSLGLMIVMDRSGSMAGSKLELAKEAAARSVELLREKDTLGFIAFDDRPWVIVETG 456 Query: 298 AKEVDEHEFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 E + GGT + ++L+ E ++ + +DG + Sbjct: 457 PLEDKKDAVDKIGSVTPGGGTEIFTSLEKAYEELENLKLQ---RKHIILLTDGQSARSTD 513 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 K+ + + A + L E L + + Sbjct: 514 YESMIETGKENNITLSTVALG---SDADRNLLEELAGLGAGRFYDVTDSSVIPSILSRE 569 Score = 48.7 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 38/132 (28%), Gaps = 11/132 (8%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV----VYIR 293 P+ + + D S S+ ++ F + + Sbjct: 53 AVPSIRLPAPGKTVVFIADRSASVQGREGELL-DFIDAGIQSKGKEDSYAVISAGETAAA 111 Query: 294 HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + A E F + G T + + ++L ++ P + SDG A Sbjct: 112 ESSLASMKGEFREFSTDTGKGETNLEAGIQLASTLM-----PEETPGRIVLLSDGRETAG 166 Query: 353 DSPLCHEILAKK 364 S ++L + Sbjct: 167 SSREAAKLLKNR 178 >UniRef50_A2E0T6 von Willebrand factor type A domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2E0T6_TRIVA Length = 753 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 51/183 (27%), Gaps = 20/183 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------HH 295 + + ++D SGSM S AK +L L + + + + Sbjct: 239 QANTEFYFIIDCSGSMYGSRIKNAKSCLNVLLHSLPIGCRFSIIKFGTKFEVALEPCDYT 298 Query: 296 TQAKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + H+ G + S LK + E + +DG+ D Sbjct: 299 DENMSKAMHQLDLIDADMCGNDMISPLKY----ISEHPQKKDYIKQVFLLTDGE----DD 350 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + + R ++ A + L + S F + ++ Sbjct: 351 RISICAMVQANRDNFRVFTIGIG-SDADRNLIIDVARNGSGRYIFIDDEDENMNEKVIEL 409 Query: 415 REL 417 L Sbjct: 410 LRL 412 >UniRef50_A0LPK8 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Deltaproteobacteria RepID=A0LPK8_SYNFM Length = 680 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 40/312 (12%), Positives = 80/312 (25%), Gaps = 26/312 (8%) Query: 129 FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN---SLARRTAMTAGKR 185 E + + ++ + V R++ + ++ R A R Sbjct: 183 DERDRFWTVDNAVRGSFALNVELKSAFPIKDVRLPEHQDRAVVSKESNMGERGAPGDVYR 242 Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 L + E +L +E R E+ R + TF + Sbjct: 243 IALESTEGARLDKDVVLYYRLDDEVPARVELIPYRKGPD---SAGTFMVVVTPAASLKRI 299 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ-------- 297 + ++D+SGSM + +S + V + Sbjct: 300 AEGVDWTFVLDISGSMTGRKITTLIEGVSRVLGKMSANDRFRIVTFNTTAADFTGGYVPA 359 Query: 298 ---AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADD 353 + Q G T + L L ++ +DG N Sbjct: 360 SPENVQTWMQRVKQIQAGGSTALFDGLDLAYRLLDGERTTG-----IVLVTDGVCNVGPT 414 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 L K+ VR ++++ A+Q L F + + Sbjct: 415 RHDEFLGLLKQ--HDVRLFTFVIGNS-ANQPLMDRLAKESGGFAMNVSESDDIAGRLIQA 471 Query: 414 FRELFHKQNATA 425 ++FH+ Sbjct: 472 KAKVFHECLHGV 483 >UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G857_9DELT Length = 540 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 45/188 (23%), Gaps = 11/188 (5%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 + + +D+S SM+ D ++ + + L + V + Sbjct: 148 IDPAELDRPPLNLTIAVDLSKSMEGEPIDRVRQGLLQMREQLEPEDRVTLVGFGDEAQVI 207 Query: 292 IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN-W 350 + + + G T + + L+ E N SDG Sbjct: 208 VENADKDSVELATAIAALVPWGSTNLYAGLRTAFEQTDLYAQEGWQNRV-LLVSDGVPTT 266 Query: 351 ADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + E LA+ + L R L S + + Sbjct: 267 GIVNSDKIEGLAEAWSGMGYGLTTVGIGN-DFDIELMRNLSELGSGSFYYVEDPDAVIEV 325 Query: 410 IYPVFREL 417 + Sbjct: 326 FSEEVQAF 333 >UniRef50_Q1VY89 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated n=1 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VY89_9FLAO Length = 689 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 40/328 (12%), Positives = 83/328 (25%), Gaps = 27/328 (8%) Query: 107 QDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISV 166 ++ E ++S E L F+++ Q + I Sbjct: 133 EEIAENSEIEVELSYVELLPYSFDEVTFEYPSDYSAIQSDVIVFAQNLNFNLFSERTIDD 192 Query: 167 VRSLQNSLARRTAMTAGKRRELHALEENLAI---ISNSEPAQLLEEERLRKEIAELRAKI 223 V L N++ T + E I + L + E + Sbjct: 193 V-ELFNNVGTMTNDGNVATVLISENEAISINDILIKYQLASDELGVIPFSTLLEEGVNEC 251 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSR 282 + F L + + ++D SGSM + AK + L+ Sbjct: 252 D-DFGNGFFGLVVEPESNANTEVIEKNFVLIIDSSGSMRGGNKMAQAKEASEFIVNNLNI 310 Query: 283 TYKNVEVVY-----------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + + + ++ Q G T +S +L + Sbjct: 311 GDNFNVIDFDNNIVLFQPELVEYNIQNSNAALDFIENIVALGATNISESLVTAINQFEAG 370 Query: 332 YNPAQWNIYAAQASDG-DNWADDSPLCHEILAK----KLLPVVRYYSYIEITRRAHQTLW 386 +DG + + LA+ ++ + +++ L Sbjct: 371 AEDKAN--IIVFFTDGGATEGETNTQNILQLAEDTVNQIETEIFLFTFGIG-EDVTTDLL 427 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + F F + + DI F Sbjct: 428 TLLAVQNNGFVTFLGDN--EIVDIISNF 453 >UniRef50_Q54DU5 von Willebrand factor A domain-containing protein DDB_G0292028 n=1 Tax=Dictyostelium discoideum RepID=Y2028_DICDI Length = 932 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 58/193 (30%), Gaps = 24/193 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 ++ ++D SGSM + +K + L+ K V + + + E +H Sbjct: 339 QKSEFIFVLDCSGSMSGKPIEKSKMALEICMRSLNENSKFNIVCFGSNFNKLFETSKHYN 398 Query: 307 F-----------YSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 GGT + L+ + +++ + +P ++ +DG+ + Sbjct: 399 DETLQKASEYINRIDANLGGTEL---LEPIVDILSKESDP-EFPRQVFILTDGE---ISN 451 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 K R ++Y + L + + + ++ + Sbjct: 452 RDKLIDYVGKEANTTRIFTYGIG-SYVDKELIVGVSKACKGYYEMIVDNSDMEEKVM--- 507 Query: 415 RELFHKQNATAKG 427 +L Sbjct: 508 -KLISIAMQPTLS 519 >UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LNY0_HALO1 Length = 808 Score = 101 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 33/289 (11%), Positives = 68/289 (23%), Gaps = 29/289 (10%) Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 S + + A L + E L + + +R R + + Sbjct: 377 SASTAAVELVKYGVANGERPHPSLARVWEFLNYETFDSASYEELGDRFRVSMGMVSRPSL 436 Query: 225 RVPFIDTFDLRYKN--YEKRPDPSSQAVMFCLMDVSGSMDQS-----------TKDMAKR 271 + L + AV+ L+D+SGSM + D+ + Sbjct: 437 TQDGAVDYLLGANVTVPNLTREERPHAVVTFLVDISGSMAEYSPTVDAGGAPTRMDIVRE 496 Query: 272 FYILLYLFLSRTYKNVEVVYIRHHTQAKE------------VDEHEFFYSQETGGTIVSS 319 L V + E GGT +S+ Sbjct: 497 GLWKAVSALKPGDIVNVVSFDDAAQIELERGEIRPGAATPRPYLRSVLRLLPRGGTNLSA 556 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCH-EILAKKLLPVVRYYSYIEI 377 +++ V + Y+P + N +D N P + + + + Sbjct: 557 GIEVAYRVARRNYDPYRINRVIIL-TDAYANRGSIDPSLIGDHVLIGDDEGIHFSGLGVG 615 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 ++ + + F L + Sbjct: 616 Y-DFNEDFLNTLTDVGRGTYFSLITERDAARAFGERFVSLLAVAARDVR 663 >UniRef50_UPI00015B5332 PREDICTED: similar to ENSANGP00000020925 n=1 Tax=Nasonia vitripennis RepID=UPI00015B5332 Length = 2053 Score = 101 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 39/298 (13%), Positives = 83/298 (27%), Gaps = 32/298 (10%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLAR-RTAMTAGKRRELHALEENLAIISNSEP 203 E TH + N +++ + ++ + L R ++ KR + + + Sbjct: 1001 YLEPDTHFNNISVNTSFSSVHIPTNVYDRLPRVNMTISWSKRLDRIFKHNYKSDPALMWQ 1060 Query: 204 AQLLEEERLRKEIAELRA---KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 LR+ A K + DT+D R +++ + M L+D SGS Sbjct: 1061 YFCSTTGVLRQYPAMRWPVSLKKDGKEITDTYDCRVRSW-FIEASTCSKDMVILVDNSGS 1119 Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS----------- 309 M + +AK + LS V ++V Sbjct: 1120 MTGMSNAIAKTTVSTIMSTLSNND---FVAVFNFSDSTQQVVSCFQDKLVQATPENIRRI 1176 Query: 310 -------QETGGTIVSSALKLMDEVVKERYNPAQW------NIYAAQASDGDNWADDSPL 356 + G ++ A +++ N ++ N +DG Sbjct: 1177 NDDILTMKPEGVANITEAFLAAFTILENYRNESRCGSDLSCNQMIMLVTDGIASNITEVF 1236 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++ VR ++Y+ + L + + + Sbjct: 1237 QEYNWSENGTIPVRVFTYLLGQEVTKVREIQWMACLNRGYYTHIHTQAEVPEQVLKYI 1294 >UniRef50_A4YGI9 von Willebrand factor, type A n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YGI9_METS5 Length = 363 Score = 101 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 56/209 (26%), Gaps = 25/209 (11%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 R + + L+D S SM +MAK LL L + + + + Sbjct: 1 MFRIVLVPESKFEAKNLHYVILIDRSYSMKGEKLEMAKEGARLLVDNLPKDSRFSLLAFN 60 Query: 293 RHHTQAKEV-----DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + KE E + GT + AL+ + ++ Y +DG Sbjct: 61 EKVSIIKEHEHPSEMGKELESLKVGSGTAMYKALQEAFNLARKY----GEPTYVILLTDG 116 Query: 348 DNWA---DDSPLCHEILAK--------KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 L + + V+ S+ + + E F Sbjct: 117 VPSDMGCMPGLSRKFDLNRCLPVYQGLSVPENVQIISFGIG-DDYSEEILTEVSEKGRGF 175 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNATA 425 D I +L + A + Sbjct: 176 FYHVT----DPAQIPEKMPKLVKSEVAAS 200 >UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09E12_STIAU Length = 540 Score = 101 bits (251), Expect = 6e-20, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 48/180 (26%), Gaps = 17/180 (9%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF---- 306 + ++D SGSM S +AK L+ V + + Sbjct: 72 VTFVIDTSGSMQGSRMQIAKDALKYCVTRLNPQDTFNVVRFSTDVEALFPALKSAQPENI 131 Query: 307 -------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW-ADDSPLCH 358 + GGT + AL +++ + +DG + Sbjct: 132 QKAVAFVEQLEAIGGTAIDEALVRG---LQDNDGKSSAPHLLMFITDGQPTIGETDEGAI 188 Query: 359 EILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 AK R +++ + L + +F + I + ++ Sbjct: 189 AQHAKDGRKAKTRLFTFGVG-EDLNARLLDRLSSDGAGTSDFVRDGKEFETKISSFYDKV 247 >UniRef50_B0TSG0 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TSG0_SHEHH Length = 761 Score = 101 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 54/202 (26%), Gaps = 22/202 (10%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH- 294 + P +S + ++D SGSM + A + L+ +++ H Sbjct: 249 FYPQIDLPQTTSSRCIKMVVDCSGSMLGDSITQAGIALKQILKLLNEDDWFNIILFGSHH 308 Query: 295 ----------HTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + ++ E GGT + SAL + PA+ Sbjct: 309 KSLFSESVKANRANLDIAAKELANLNADLGGTEMLSALNAAYD----SAAPAELASNILL 364 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG+ W A++ + ++ F + Sbjct: 365 ITDGEIWG---EEQLICKAQESNHRHFVVGVG---SAVSEAFLKQLADKTGGASEFVTPN 418 Query: 404 IRDQDDIYPVFRELFHKQNATA 425 I F + + + Sbjct: 419 ENMSSRIVQHFCRIKQSKLTQS 440 >UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z595_BREBN Length = 947 Score = 101 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 58/214 (27%), Gaps = 29/214 (13%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLY 277 + P + + K PS + +D SGSM +A+ I Sbjct: 382 WFQTPIEEALPVHMDLKGKEQLPSLGLQLV--IDKSGSMSSDARGADKMALAREAAIRAT 439 Query: 278 LFLSRTYKNVEVVY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK 329 ++ + + + + + Q GGT + AL+L E VK Sbjct: 440 TMMNAQDYIGVIAFDDTPWDVVAPQSVTKLDEIQQQISRIQADGGTDIFPALQLGYERVK 499 Query: 330 ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 + +DG + DD V + + + L Sbjct: 500 AM---NTQRKHVILLTDGQSALDDDYEGLLQQMTAENITVSTVALG---DDSDRGLLEMI 553 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 L FA ++F K+ A Sbjct: 554 AELGKGRYYFANDAESIP--------KIFSKETA 579 Score = 50.3 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 18/142 (12%), Positives = 38/142 (26%), Gaps = 16/142 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH 304 P + ++D S SM + F K + + + Sbjct: 62 PVQAKTIVFVVDRSASMKDDPR--VLSFLREAVGQKQAADKYAVIAIGAEAAVDQPMTIR 119 Query: 305 EFFYSQETG------GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + Q G T ++ ++L ++ +DG + D+ Sbjct: 120 Q--EVQPLGVDVNRNATNLAEGIRLASAMIPTNARGK-----VVLLTDGLETSGDAARQ- 171 Query: 359 EILAKKLLPVVRYYSYIEITRR 380 LA++ V S + Sbjct: 172 TRLARERGIAVEAVSLQQPNGD 193 >UniRef50_Q7SGD8 Predicted protein n=4 Tax=Sordariales RepID=Q7SGD8_NEUCR Length = 1086 Score = 101 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 23/209 (11%), Positives = 53/209 (25%), Gaps = 20/209 (9%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 L K PS++ + + D SGSM + + K + + K Sbjct: 275 HQRALMATLVPKFNLPSTRPEIVFVCDRSGSMGGARIEGLKSALRIFLKSIPVGAKFNIC 334 Query: 290 VYI------------RHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQ 336 + + ++ + GGT + L+ E Sbjct: 335 SFGSTFEFLFSDGSRSYDHESLRLAMDYVSRMDADLGGTEMYQPLEAAFE-----KRYND 389 Query: 337 WNIYAAQASDGDNWADDSPLCHEI-LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 ++ +DG+ W + + +R ++ +H L + Sbjct: 390 MDLEVFLLTDGEIWNQEHLFTMINKKVSESQGAIRLFTLGIGNDVSH-ALIEGAARAGNG 448 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 F + + + + Sbjct: 449 FAQSVTDSEKMNAKVVRMLKAGLTPHIKD 477 >UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VUB8_DYAFD Length = 935 Score = 101 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 37/261 (14%), Positives = 67/261 (25%), Gaps = 22/261 (8%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 ++ + A A R + L A + + A R + + Sbjct: 687 VKTNSAEMPAAQQSPVRNVRPLTNEAAPSAPAAQATRDTVYVERVRVDTVYVDRNAQLQN 746 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVE 288 T L M L+DVS SM+ + KR L + Sbjct: 747 VTRSLDGFAPN---------NMVLLLDVSSSMNSPYKMPLLKRSIKSLLTLVRPEDMISI 797 Query: 289 VVYIR--------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V+Y Q G T + +KL + ++Y N Sbjct: 798 VLYSGKARVVLKPTSGAKASEISRMIDLLQSDGDTDGNEGIKLAYKTANKQYIRGGNNR- 856 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 A+DG+ D + + + +++ ++ L Sbjct: 857 IVLATDGEFPVSDEVMDMIRQNARQDVYLSIFTFG--RHEHTGQKLKKLSELGMGSYAHV 914 Query: 401 MQHIRDQDDIYP-VFRELFHK 420 D I ++L K Sbjct: 915 TDASADLQLILEAQAKKLAGK 935 >UniRef50_UPI000180D2FB PREDICTED: similar to inter-alpha (globulin) inhibitor H5 n=1 Tax=Ciona intestinalis RepID=UPI000180D2FB Length = 1586 Score = 101 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 26/234 (11%), Positives = 57/234 (24%), Gaps = 29/234 (12%) Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 + E +R + + ++ P + L+DVSGSM Sbjct: 907 GLNGKFVIEYDVF---RDRTTEMVIDQSYFAHFITSNLPPMSKRVVFLIDVSGSMFGIKI 963 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE-----------------VDEHEFFYS 309 D ++ + L+ T + + ++ Sbjct: 964 DQVRQAMNTILHGLAETDFFSVIAFNSSVSRWSPSGTAAVLASGTTANINSAMNFLNTTV 1023 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADD-SPLCHEILAKKLL 366 GGT + A++ ++ + +DG S + L Sbjct: 1024 VTRGGTDILQAVEAAIQLFDSAATGGTNTASDFMVLLTDGRPTDGTVSSTAIISAIRNLN 1083 Query: 367 P---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + + L R+ S + I + E+ Sbjct: 1084 RGRFGINTIGFGTL---VDMNLLRKIAAQNSGTSIQIFIDLNSYAQISNFYEEI 1134 Score = 51.4 bits (121), Expect = 7e-05, Method: Composition-based stats. Identities = 18/135 (13%), Positives = 36/135 (26%), Gaps = 14/135 (10%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYL----FLSRTYKNVEVVYIRHH------TQ 297 + +D +GSM AK+ ++ L V V + T Sbjct: 191 GTTLAFAIDDTGSMSGE-IRAAKQRAKMIIEERQGSLDEPRDFVLVPFNDPTVGPITVTS 249 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + + GG A +L + +Q +D D + Sbjct: 250 NPDTFKASIDRLHAHGG---GDAPELALRGILLAIENSQEGSTVFVITDVDAKDIELQDV 306 Query: 358 HEILAKKLLPVVRYY 372 A++ + + Sbjct: 307 VVAQARQRNIKITFL 321 >UniRef50_Q8H923 Putative uncharacterized protein OSJNBa0071K18.17 n=5 Tax=Poaceae RepID=Q8H923_ORYSJ Length = 606 Score = 101 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 27/235 (11%), Positives = 57/235 (24%), Gaps = 27/235 (11%) Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYI 274 E + R F + + + ++DVSGSM + K+ Sbjct: 109 EYPAVARGASRDKFAVLVHAKAAGAAAAAASRAPLDLVTVLDVSGSMAGRKLALVKKAMG 168 Query: 275 LLYLFLSRTYKNVEVVYIRHHTQA----------KEVDEHEFFYSQETGGTIVSSALKLM 324 + L + V + ++ K + + T + L++ Sbjct: 169 FVIDNLGPADRLCVVSFSTEASRRTRLLRMSEVGKATAKRAVESLVDDSATNIGDGLRVA 228 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNWADDSPL----CHEILA---------KKLLPVVRY 371 V+ +R + + SDG + + L + L + Sbjct: 229 GRVLGDRRHKNAVSSVI-LLSDGKDSYVVPRRGNGMSYMDLVPPSFASSGGRGQLAPIHT 287 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + +F QD L A+ Sbjct: 288 FGFG---ADHDAAAMNTIAESTGGTFSFVENEAAIQDSFAQCIGGLLSVAVQDAR 339 >UniRef50_C7G046 von Willebrand factor A domain-containing protein DDB_G0286969 n=1 Tax=Dictyostelium discoideum RepID=Y6969_DICDI Length = 2079 Score = 100 bits (249), Expect = 8e-20, Method: Composition-based stats. Identities = 22/192 (11%), Positives = 53/192 (27%), Gaps = 23/192 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------- 293 P ++ + L+DVS SM+ AK+ LS+ + + Sbjct: 349 PVVESELIFLVDVSESMEGYNMKHAKKALHRFLHSLSKDTYFNIISFASSHRKLFAQSVK 408 Query: 294 HHTQAKEVDEHEFFYSQE--TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 ++ + + + G T + LK + V A +DG Sbjct: 409 YNDENLKAATAYVESLKAISHGETNLLEPLKDIYSV------DATCPRKIFLLTDGR--- 459 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 ++ L ++ + + L + S +++ + + Sbjct: 460 VNNIGPIVDLVRQNAHNTSVFPIGMG-EFVSRQLVEYIANAGSGVAELVIENETIESKVM 518 Query: 412 PVFRELFHKQNA 423 + + Sbjct: 519 RQLKRALQPAMS 530 >UniRef50_B4W304 von Willebrand factor type A domain protein (Fragment) n=2 Tax=Cyanobacteria RepID=B4W304_9CYAN Length = 538 Score = 100 bits (249), Expect = 8e-20, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 53/194 (27%), Gaps = 12/194 (6%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + ++ + ++D SGSM + A + L +L+ V+Y Sbjct: 26 FQGSESSQQTSSRRPLNLSLVLDRSGSMAGAPLRYAIQAAQNLIDYLTADDFVSVVIYDD 85 Query: 294 HH--------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + + + + G T +S L V+ +P + N + Sbjct: 86 TAEVIIPPQLVGDQAALKAKIGKIRARGCTNLSGGWLLGCSQVQANQSPERINRV-LLLT 144 Query: 346 DG-DNWADDSPLCHEILA-KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 DG N+ P A +K + + ++ L + F Sbjct: 145 DGLANYGIKDPQVLTKTALEKAEADIVTTTLGFGN-YFNEDLLINMANAARGNFYFIQSP 203 Query: 404 IRDQDDIYPVFREL 417 L Sbjct: 204 DDASQVFEIEMESL 217 >UniRef50_C3YRH6 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YRH6_BRAFL Length = 495 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 49/197 (24%), Gaps = 20/197 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----HH 295 + Q ++D SGSM + A+ +L L V + Sbjct: 281 PKTSEGIQGEYIAVIDRSGSMSGAFIATARETLLLFLKSLPAGSAFNIVGFGSTFKPLFD 340 Query: 296 TQAKEVDEHEFF------YSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 E+ + GGT + L+ + PA +DG Sbjct: 341 ASVPVNQENVGTASAWVCKMRADLGGTNLLGPLEWIF----SAPRPAGRPREVFILTDG- 395 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 A + L + R ++ + + L R F + R Q Sbjct: 396 --AVSNTSRVIDLVRANSSHTRCWAVGIGEGAS-RVLIRGIAEAGRGRAEFVTEVDRMQA 452 Query: 409 DIYPVFRELFHKQNATA 425 + + Sbjct: 453 KLLLCLKRSLQPAICDV 469 >UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I2R0_9CLOT Length = 960 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 58/203 (28%), Gaps = 28/203 (13%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVE 288 L ++ + + ++D SGSM +AK + L + Sbjct: 391 LPVYMDKRGKNEVPAISINLIIDKSGSMSAEGGGVSKLTLAKEAAMKALENLREVDEISV 450 Query: 289 VVYI-RHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + + + KE + Q GGT + AL+ + + + Sbjct: 451 IAFDDTYDEVVPLQKVGDKEAIKELISGIQIRGGTSIYPALEQGYNMQMQSSAKIKHT-- 508 Query: 341 AAQASDGDN-WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG + + D+ + + E A+ L + + + Sbjct: 509 -ILLTDGQDGYGLDNYATLLQNFIDNNITLSTVAVGEG---ANAGLLNQLASIGKGRSYY 564 Query: 400 AMQHIRDQDDIYPVFRELFHKQN 422 DIY +F K+ Sbjct: 565 T--------DIYTDIPRIFAKEV 579 Score = 57.2 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 38/136 (27%), Gaps = 11/136 (8%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 L+DVS S + K F + R K V++ + K Sbjct: 57 TINLKGRNISTVFLLDVSESASDFE-ESGKDFISTAIESMPRGNKAGVVLFGDNSKIDKV 115 Query: 301 VDEHE----FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 +++ + T + A++ + + + +DG+ D Sbjct: 116 LNKKKEYKSIDEKPVVTATNIQEAVESALGLFER-----GGSKRIVLITDGEENQGDILK 170 Query: 357 CHEILAKKLLPVVRYY 372 L + + Y Sbjct: 171 STP-LINEQKIDFKVY 185 >UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GDG5_9DELT Length = 486 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 38/335 (11%), Positives = 73/335 (21%), Gaps = 29/335 (8%) Query: 107 QDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISV 166 G G D + E +D LP +Q + + Sbjct: 19 GVGCGDDGGSYDDELGECAGCGVDDGELPEPPPEEQELPKPSEVCDEETPVTLHLSPDDS 78 Query: 167 VRSLQNSLARRTAM-TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIER 225 + R A E E L S PA + + + Sbjct: 79 NSMASPVMTREVAPYGFYTTFEWIRPWEFLNYYSFEYPA----ADPGDLSVHVDLRSKDE 134 Query: 226 VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK 285 F + + + ++D S SM + K + L Sbjct: 135 GRFQLQIGVASEIVSPSER--LPMNITLVLDESTSMTGAPMYAMKATARAIAGSLREGDV 192 Query: 286 NVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 V + + GGT + + L+ + + ++ Sbjct: 193 ISLVSWSNSNNVRLASHAVAGSNDATLLDTIDAIEPGGGTDLHAGLEQGYALAQANFSAD 252 Query: 336 QWNIYAAQASD-GDNWADDSPLCHEILAK-KLLPVVRYYSYIEITRR-AHQTLWREYEHL 392 + N SD G N +A+ + + + L Sbjct: 253 RINRVV-LVSDGGANLGFTDAELIAQMAELEDGEGIYMVGVGVGDVGRYNDELMDTVTDQ 311 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 F +F ++ + G Sbjct: 312 GKGASVFIPNEA--------EAERMFGERFMSTMG 338 >UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2 Tax=Sorghum bicolor RepID=C5YHY2_SORBI Length = 567 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 54/226 (23%), Gaps = 26/226 (11%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 R D F + + + ++DVS SM + K+ + L Sbjct: 56 RFTSRDRFAVLVHAKAPSDVSRAPLDLVTVLDVSDSMKGEKLALLKQAMCFVIDQLGPAD 115 Query: 285 KNVEVVYIRHHTQAKEVDEH----------EFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + V + ++ + G T + + + EV+ R Sbjct: 116 RLSVVTFSNDASRLTRLARMSDAGKASAKIAVESLAVQGFTNIKQGIHVAAEVLAGRREK 175 Query: 335 AQWNIYAAQASDG-DNWADDS-------------PLCHEILAKKLLPVVRYYSYIEITRR 380 SDG DN S P + A P +++ T Sbjct: 176 NVVAGMI-LLSDGHDNCGGTSVRPDGTKSYVNLVPPSLTVAAGSSRPAAPIHTFGFGTS- 233 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 +F QD L A+ Sbjct: 234 HDAGAMHAVAEATGGTFSFVGDEAAIQDSFARCVGGLLSVAVQEAR 279 >UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta RepID=B6TZ81_MAIZE Length = 516 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 48/188 (25%), Gaps = 18/188 (9%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE 305 S + ++DVSGSM + K + LS + V ++ + + + Sbjct: 59 RSGLDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSSIDRLSIVTFLDTANRICPLQQVT 118 Query: 306 FFY----------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 Q G T +S L+ +V+ +R + + SDG + Sbjct: 119 EDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVGVMLMSDGQ----QNR 174 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ-STFDNFAMQHIRDQDDIYPVF 414 K V + + T+ + Sbjct: 175 GEPAANVKIGNVPVYTFGFG---ADYDPTVLNAVARNSMGGTFSVVNDVNLLSMAFSQCL 231 Query: 415 RELFHKQN 422 L Sbjct: 232 AGLLTVVV 239 >UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus RepID=B7AA98_THEAQ Length = 706 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 88/311 (28%), Gaps = 33/311 (10%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQ---NSLARRT---------AM 180 A P + + R + E R+ A +PA R+L +LAR A Sbjct: 182 AFPLTGEAEVRAVAEGSWGRSEAKARLLPA--DRARALVLGDPALARYLEAQGFLVEEAF 239 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 +L A+ + + P L + R + L ++ Sbjct: 240 RRPLEADLVAVGLGVLDLPPGAPEALRDYLRRGGGLLFTATPKGLFFGGWDRALP-EDLP 298 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH----- 295 +P A + ++DVSGSM+ MA + L + V++ Sbjct: 299 LKPLGRKGAALVLVLDVSGSMEGEKLAMAVAGALELVRSAAPEDYLGVVLFSSSPRVLFP 358 Query: 296 -----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 Q K+ E + GGT++ A + ++++ SDG + Sbjct: 359 PRPMTAQGKKEAESLLLSLRAGGGTVLGGAFREALRLLQDVPVE---RKALLVLSDGIIF 415 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 P LA V + A A Sbjct: 416 DPKEP--ILALAATAGVEVSALALG---PDADAAFLEALAQRGGGRFYRAATPKELPRLF 470 Query: 411 YPVFRELFHKQ 421 +E+F + Sbjct: 471 LKEGQEVFQGE 481 >UniRef50_Q4S685 Chromosome 9 SCAF14729, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4S685_TETNG Length = 608 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 39/374 (10%), Positives = 86/374 (22%), Gaps = 65/374 (17%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLAL------PNLKQNQQRQLTEYKTHRAGYTANGVPAN 163 E F ++ +E L L L QN +++ + + Sbjct: 103 PAGAEMSFLLTYEELLPRRLGRYELSLGLRPGQLVQNLSVEVSVTEQTGISFIKVLP--- 159 Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS----NSEPAQLLEEERLRKEIAEL 219 + R L ++ A R E A + S + + Sbjct: 160 LRTSRLLPSAAQADAEAPASTRVEQSACCARVYYSPTLQQQSSVSSKGLHADFILQYDVA 219 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF 279 + + + + P + ++DVSGSM + K+ + Sbjct: 220 LRDLLGEVQVHDGYF-VHYFAPKGLPVVPKDVIFVIDVSGSMIGTKIQQTKQAMSTILAD 278 Query: 280 LSRTYKNVEVVYIRH------------HTQAKEVDEHEFFYSQETG-------------- 313 L + + Q + G Sbjct: 279 LREGDHFNIITFSDQVRTWKRGRTVRATRQNVRDAKEFVRRIIAEGCESEATEHHLTASL 338 Query: 314 ---------------GTIVSSALKLMDEVVKERYNPAQWN----IYAAQASDGDNW-ADD 353 GT +++AL +++ + + +DG+ Sbjct: 339 CLFLLLYEFSFSFPSGTNINAALLSAAQLINPPSSSRHLSSHRVPLVIFLTDGEATIGVT 398 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + AKK L + A L + + + + D Sbjct: 399 AGDTILTNAKKALGSASLFGLAFG-DDADFLLLKRLALDNRG----VARMVYEDADAALQ 453 Query: 414 FRELFHKQNATAKG 427 + + + + Sbjct: 454 LKGFYDEVASPLLS 467 >UniRef50_Q9LMB7 F14D16.26 n=5 Tax=rosids RepID=Q9LMB7_ARATH Length = 736 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 31/276 (11%), Positives = 71/276 (25%), Gaps = 37/276 (13%) Query: 167 VRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERV 226 L+ L + ++ S + + + + A + V Sbjct: 232 SHQLKEKLRSAGKLRFAYEADVLKWSNTDFSFSYTASSSNIVGGLFLQS-----APVHDV 286 Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN 286 D F +++ + + + ++D+S SM + K L Sbjct: 287 DQRDIFSFYLFPGKQQKTKAFKREVVFVVDISKSMTGKPLEDVKNAISTALSKLDPGDSF 346 Query: 287 VEVVYIR----HHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + + T + V E GT + L+ E++ Sbjct: 347 NIITFSNDTALFSTSMESVTSDAVERGIEWMNKNFVVADGTNMLPPLEKAVEMLSNTR-- 404 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLL-------PVVRYYSYIEITRRAHQTLWR 387 +DG + + + KK L P + + + Sbjct: 405 -GSIPMIFFVTDG---SVEDERHICDVMKKHLASAGSVFPRIHTFGLGVFCNHY---FLQ 457 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 ++ + + + D I +LF K + Sbjct: 458 MLANISCGQH----ESVYNTDHIEERMDKLFTKALS 489 >UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDA4D Length = 1547 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 63/208 (30%), Gaps = 24/208 (11%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 D F + SS++ L+D SGSM D A + L L + Sbjct: 287 DEFQQKLNQELVDHLNSSRSEFIFLLDRSGSMSGQPIDRACQALTLFLKSLPTDSYFNVI 346 Query: 290 VYIR-----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQW 337 + +++Q+ E + GGT + LK V + + Sbjct: 347 SFGSSFKLLFPQSEKYNSQSLEKAISNISKYKADLGGTEIYKPLK---NVFVQNKI-QGY 402 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 N +DG+ DSP L +K R +S + A Q L + Sbjct: 403 NKQVFLLTDGE---VDSPEQVISLIRKNNKFSRVHSIGFGSG-ADQYLINQSAIAGKG-- 456 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQNATA 425 + + + D+ V + Sbjct: 457 --ISKIVDLKCDLSEVIINMLSMCITPT 482 >UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9U149_PHYPA Length = 1185 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 58/222 (26%), Gaps = 26/222 (11%) Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY 277 R + I L P S + + ++D SGSM + A + L Sbjct: 438 VERHPRDGTHAITLTFL----PRFALRPMSSSELIFVVDRSGSMQGTPIKQAGQALELFL 493 Query: 278 LFLSRTYK-NVEVVY-------IRHHTQAKEVD----EHEFFYSQET-GGTIVSSALKLM 324 + + + T E + GGT + SA + + Sbjct: 494 RSIPCEDHYFNIIGFGDNHKTLFPKSTPYNEETLTKGLRYAQALEADMGGTEMMSAFEEI 553 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV---VRYYSYIEITRRA 381 E +DG+ W DS + AKK VR +S Sbjct: 554 FEH-----RRRDVPTQIFLLTDGEIWDVDSLIECIRDAKKEEKSDNFVRVFSLGIG-SNV 607 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 L + ++ R + + + + Sbjct: 608 SHHLVESVGRAADGYAQLIVEGERMEKKVINMLKSALVPAVT 649 >UniRef50_C6PWL8 von Willebrand factor type A n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PWL8_9CLOT Length = 422 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 62/206 (30%), Gaps = 18/206 (8%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKN 286 F+ ++ + +D SGSM + + + + L + + + Sbjct: 93 FVFGAMFQFLYEISAQKFKKADDIVFAIDTSGSMKNTDPNNERFSAALNLIDNMDKNNRF 152 Query: 287 VEVVYIRHHTQAKEVDE-----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + + + + G T + AL+ E +K Sbjct: 153 SMYKFDDTAEKIIPMSQVTKQSREEVSGKLKDMQNPKGNTNMRDALEKAYEEIKSSETKD 212 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + N SDG + D S E L + Y+ + ++ +E Sbjct: 213 K-NAMVIMLSDGGDTYDLSKKFDETLKPFKEKNISIYTIGMSNGN-NFSMLKEIAKESGG 270 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQ 421 ++++ D+ VF +++ + Sbjct: 271 NYY----NVKEIKDLKNVFNKIYRDR 292 >UniRef50_A8VYD1 Extracellular solute-binding protein, family 5 n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8VYD1_9BACI Length = 978 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 17/193 (8%), Positives = 53/193 (27%), Gaps = 18/193 (9%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRF-YILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P + + ++D SG+++ D + + + + + +E+ Sbjct: 56 QPGAGLDLMFVLDNSGTVNLDDTDSIRSSTVSDYAENMLPGDRGGIISFNTEADMLQEMS 115 Query: 303 EHEFFYSQE-------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 ++ + +GGT +S ++ +E + +DG + + Sbjct: 116 DNRYDLLDALSALPDPSGGTDLSQGMRAANEQFVQ--TKGANKQIMVLITDGADTI--NL 171 Query: 356 LCHEILAKKLL-PVVRYYSYIEIT--RRAHQTLWREYEHLQSTFDNFAMQH---IRDQDD 409 ++ + ++ + + L ++ D Sbjct: 172 AEVYNQVREARMNGITIFTLGLGSLATGLDEALLQDIADQTRGQYRQVPNATVIESVLQD 231 Query: 410 IYPVFRELFHKQN 422 I + Q Sbjct: 232 IRSSLEGMRSPQI 244 >UniRef50_Q28XX9 GA11538 n=5 Tax=Drosophila RepID=Q28XX9_DROPS Length = 1196 Score = 99 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 38/297 (12%), Positives = 83/297 (27%), Gaps = 17/297 (5%) Query: 127 LLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 + +D+ P + + + E + N +++ V ++ + Sbjct: 147 DMDKDIGEPLIYVQPKVVVLEPRPEFHNTPVNFSVSSVHVPVNVFDRAPDVIKAIQWS-- 204 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAEL-----RAKIERVPFIDTFDLRYKNYEK 241 E I ++ + +K ++ +D +D R +++ Sbjct: 205 -----ENLDQIFRDNYKNDPTLSWQFFGSSTGFMRQFPASKWKKDVPVDLYDCRLRSWYM 259 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-QAKE 300 +S + LMD SGSM D+AK + L + + T Sbjct: 260 -EAATSPKDIVILMDGSGSMLGQRLDIAKHVVNTILDTLGTNDFVNIFTFDKEATLGNIR 318 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKER---YNPAQWNIYAAQASDGDNWADDSPLC 357 + ++AL E+++E AQ N DG + Sbjct: 319 ELKEGIENFGPKSIANYTAALTRAFEILEEAKSTSRGAQCNQAIMIIGDGAPENNREVFE 378 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 VR ++Y+ A+ R + ++ + Sbjct: 379 LHNWRDPPYKPVRVFTYLIGKEVANWDDIRWMACENQGYYVHLSDTAEVREMVLNYI 435 >UniRef50_B2VYM4 von Willebrand domain containing protein n=3 Tax=Leotiomyceta RepID=B2VYM4_PYRTR Length = 906 Score = 99 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 26/215 (12%), Positives = 57/215 (26%), Gaps = 20/215 (9%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 + L K + + D SGSM + D+AK+ + L Sbjct: 267 ENHPTIANHRALMATLVPKFSLRPETPEIVFVCDRSGSMQ-TAIDLAKQALQVFLKSLPI 325 Query: 283 TYKNVEVVY-----------IRHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKE 330 K + + + + ++ + GGT + L+ ++ Sbjct: 326 GVKFNICSFGNTHSFLWPKSVTYSQETLDLAINHVNSMTANYGGTEMLQPLQA---TIEN 382 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEIL-AKKLLPVVRYYSYIEITRRAHQTLWREY 389 RY + +DG+ W + + +R ++ +H L Sbjct: 383 RYKDMALD--IMLLTDGEIWRQQQLFSYLNQSVLESKDPIRVFTLGVGMGVSH-ALIEGI 439 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + F + I + + Sbjct: 440 AKAGNGFSQTVGDGEKMDKKIVRMLKGALSPHITD 474 >UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCW7_FERPL Length = 403 Score = 99 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 43/323 (13%), Positives = 104/323 (32%), Gaps = 19/323 (5%) Query: 105 ASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANI 164 ++ ++ S + ++ + L + ++L E G T + Sbjct: 88 EAKSKAWEEIKEKLRSGELKVEDINFKELLDYFFEEALKELIEMG-IIEGVTKRFFRRKV 146 Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 R + +A++ K + + E +S +L+E + + Sbjct: 147 KFSRQAERIIAQKVMKEVSKEAKGYYAESEGETLSYIPGYELVEYDEYLHSYDLIDIPET 206 Query: 225 RVPFIDTFDLRYKN---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + D + + P + L+DVS SM A + L + + Sbjct: 207 MIRAAKNEDFEIREKDIVSRNPKKVGKRHFVMLIDVSDSMRGKKIVGAIEAALALKMSIR 266 Query: 282 RTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + ++EV + + +++ E + G T ++ ALK ++ + Y Sbjct: 267 KGFDDLEV--FVFNHRTEKIREGDIVNVDVEGRTDIALALKTARNALRGKDGA----KYV 320 Query: 342 AQASDGDNWADDSPL------CHEILAKKLLPVVRYYSYIEITRRAHQTLW--REYEHLQ 393 +DG+ A +PL AK + + + + + R + Sbjct: 321 ILITDGEPTASYNPLIPPWKMAVIESAKLKDEDINL-NVLMLNDDSRFYALCERMLKAAG 379 Query: 394 STFDNFAMQHIRDQDDIYPVFRE 416 + + ++ IY +R+ Sbjct: 380 KGSIFYFPNPLNLKNYIYSKYRK 402 >UniRef50_C0M4X9 Inter-alpha-trypsin inhibitor heavy chain H4 (Fragment) n=1 Tax=Nilaparvata lugens RepID=C0M4X9_NILLU Length = 315 Score = 99 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 34/283 (12%), Positives = 76/283 (26%), Gaps = 59/283 (20%) Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 E ++ + RL E R K I+ + + P + + ++ Sbjct: 7 DYEQQKEISKDGLKGRLIIEYDVDRKKHPSQILIEDGHFVHF-FAPAELPPLRKQVVFVL 65 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH--------------------- 294 D+SGSM K + + L+ V++ + Sbjct: 66 DISGSMFGEKIKQLKDAMLKILSDLNPQDHFSIVLFSDNAYVWSKAKTAVMKKILDEGFY 125 Query: 295 ----------HTQAKEVDEHEFFY----------SQETGGTIVSSALKLMDEVVKE---- 330 E+ + + T T + L+ ++VKE Sbjct: 126 NLDNETLAILDDHRNEILQATPDNVKTAKEFVELIKPTTSTNIIDGLRKGLKLVKEGKET 185 Query: 331 -RYNPAQWNIYAAQASDGD-NWADDSPLCHEI----LAKKLLPVVRYYSYIEITRRAHQT 384 +DG+ N P+ L ++L + ++ + A T Sbjct: 186 LDTTKEPSQPIMFFLTDGEPNVDLTDPVEIVNETSSLNEQLKTPIYSLAFGQG---ADIT 242 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 ++ F ++I + D + + ++ Sbjct: 243 FLKKLSKANHGFA----RNIYEGSDATLQLNNFYKEISSPLLA 281 >UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1DFU7_MYXXD Length = 422 Score = 99.6 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 53/194 (27%), Gaps = 13/194 (6%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + K + ++D SGSM+ A+R L L + + Y Sbjct: 32 MELKARPAETGQRVPVSLALVLDRSGSMNGQKLADARRAATELVQRLKPEDRLAFIDYGT 91 Query: 294 H---------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 +A+E Q+ G T +S AL ++ + + Sbjct: 92 DVRVQPSRRMTEEAREELLTLISGLQDDGSTNISGALDAAANALRPHMREYRVSRAI-LL 150 Query: 345 SDGDNWAD--DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 SDG P + + + + + +TL R F F Sbjct: 151 SDGQPTTGIVSEPGLLDQVRQLRRDGITVSALGVGR-DYQETLMRGMAEQGGGFSGFIDD 209 Query: 403 HIRDQDDIYPVFRE 416 R + + Sbjct: 210 SARLAEVFSRELDQ 223 >UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GC99_9DELT Length = 546 Score = 99.6 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 58/227 (25%), Gaps = 23/227 (10%) Query: 221 AKIERVPFIDTFDLRYKNYEKRPD-PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF 279 A+ PF +R ++ P + ++D SGSM AK+ + L Sbjct: 102 AEDAGQPFEMPAIIRLSADDEAGQGPRPGLDLAIVLDRSGSMGGDKLRFAKQAGLDLVNR 161 Query: 280 LSRTYKNVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK 329 L + + Y EV + Q G T + AL + + + Sbjct: 162 LDEQDRVTLISYDDTVTPLSNLQRVDDDGIEVLRRQLLDIQVGGTTALGPALFMGLQRLA 221 Query: 330 ---------ERYNPAQWNIYAAQASDG-DNWADDSPLCH-EILAKKLLPVVRYYSYIEIT 378 + SDG N + P +A+ V + Sbjct: 222 APEPFGPQTRTEARHDRLRHVILLSDGIANVGETRPEVIGGRVAEHFGGGVSVSTLGMGL 281 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 ++ L +F + L + Sbjct: 282 -DYNEDLMTRIADEGGGRYHFIEDAESIPAMLGDELAGLTATVASEV 327 >UniRef50_B8HSI1 von Willebrand factor type A n=8 Tax=Cyanobacteria RepID=B8HSI1_CYAP4 Length = 589 Score = 99.6 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 28/260 (10%), Positives = 65/260 (25%), Gaps = 35/260 (13%) Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI--------ERVPFIDTFDLRYKN 238 E A E+ + + + + LR + P D++ Sbjct: 332 EKAAAEQFIEYLRSPAAQAIATNLGLRSGVPGTPLGAKFTAEFGVNPQPKYDSYRPPQPE 391 Query: 239 YEKRPDPS------SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 S +++ ++D SGSM + LS + + + Sbjct: 392 VVTTMLKSWSDFAKKPSLVVIVVDTSGSMAGEKLANVQNTLNTYINGLSPQDQVALMRFS 451 Query: 293 RHHTQAKEVD---------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 VD + G T + A + + N Sbjct: 452 SDVGTPVVVDGTPAGRDRGLQFISSLRANGNTHLYDATLAARNWLTQNLRSDAIN-AVLV 510 Query: 344 ASDGDNWAD-DSPLCHEILAKKLL----PVVRYYSYIEI-TRRAHQTLWREYEHLQSTFD 397 +DG++ S +K + +++ ++ ++ + Sbjct: 511 LTDGEDTGSAISLEQLGPELQKSGFNSDQRISFFTVGYGEEGEFDPQALQQIANVNGGYY 570 Query: 398 NFAMQHIRDQDDIYPVFREL 417 + D I + +L Sbjct: 571 S-----KGDPASIGRLMADL 585 >UniRef50_Q24CQ9 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24CQ9_TETTH Length = 856 Score = 99.2 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 60/194 (30%), Gaps = 23/194 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IR 293 SS++ L+D SGSM+ A L L + ++ Sbjct: 315 ESSRSEFIFLLDRSGSMNGRPIKKATEALNLFLKSLPPNSYFNVYSFGTRYVPMFPNSVQ 374 Query: 294 HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + ++ E+ + + G T + S L + EV ++ +N +DG Sbjct: 375 YTGKSLEIALKKVKNFKANLGRTDILSPLTNIFEVQEK---INGYNKQIFLLTDG---GV 428 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + L KK R S + A Q L + I ++D+ Sbjct: 429 KNRDKVIRLIKKNNKNSRINSIGFGSG-ADQHLINTSAIAGKG----ISKIIDMEEDLSE 483 Query: 413 VFRELFHKQNATAK 426 V E+ + Sbjct: 484 VVIEMLGNCITPSL 497 >UniRef50_A8SU73 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SU73_9FIRM Length = 550 Score = 99.2 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 36/250 (14%), Positives = 66/250 (26%), Gaps = 22/250 (8%) Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRP 243 EL ++ + + E ++ ++ A+ + YK K Sbjct: 312 TAAELEGVKLVVDYCKSDEMQKIAAQKGFNANDDYTSAEEFSGAQVTQGLKTYK---KTK 368 Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---------- 293 D + + D SGSMD + K +++ V Y Sbjct: 369 DNGKDIIAVFVADCSGSMDGDPMNQLKNSLTNGAQYINDNNYVGLVSYSNSVTIEVPIAQ 428 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQASDGDNWAD 352 + + +GGT A+ + +++ + + SDG Sbjct: 429 FDLNQRSYFQGAVNNLIASGGTASYDAVVVAVKMITEAKAQHPDAKCMLFLLSDGYANNG 488 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 S + V Y A + A D DDI Sbjct: 489 YSMDEITSALRTSGIPVYTIGYG---DDADTGELARLSGINE-----AASINADSDDIIY 540 Query: 413 VFRELFHKQN 422 + LF+ Q Sbjct: 541 KIKSLFNSQL 550 >UniRef50_UPI0001757D5D PREDICTED: similar to AGAP009579-PA n=1 Tax=Tribolium castaneum RepID=UPI0001757D5D Length = 1056 Score = 99.2 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 36/324 (11%), Positives = 81/324 (25%), Gaps = 35/324 (10%) Query: 122 DEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 + ++L AL + + TH + +++ V ++ + Sbjct: 110 ETKINLAQLSEALKRNENMYLEMVLNDDTHFYNLAVDTSRSSVHVPTNIFDRHEEAAYAI 169 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 + E + ++ + AK D FD R + + Sbjct: 170 QWSEK---LDEIFVRNYNSDPALSWQYFGSTSGIMRHYPAKKWPNIEKDEFDCRVRTW-Y 225 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ---- 297 + + L+D SGSMD + +A + S + Y T Sbjct: 226 IEAATCTKDVIILVDNSGSMDGMGRHIASLTVNTILDTFSNNDYINILYYSNQTTNYTIP 285 Query: 298 ------------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY------------- 332 + + + +G T AL++ ++++ Sbjct: 286 CFRNLLVQATPENIVLFKEAIRHLGPSGKTDFPQALQMAFDILENYREIRGCNNEEIDEE 345 Query: 333 -NPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYE 390 N +DG + + VR ++Y+ + R Sbjct: 346 GKSKACNQAIMLITDGISRNFSDIVMRNNQLDGGKTIPVRIFTYLIGKEVTNVEEIRWMA 405 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVF 414 F + + Sbjct: 406 CANRGFYTQVQTLEQVTSAVLQYI 429 >UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LJL4_HALO1 Length = 602 Score = 99.2 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 55/237 (23%), Gaps = 55/237 (23%) Query: 241 KRPDPSSQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 + ++D SGSM + D ++ LL + + V Y Sbjct: 123 PDDIEPRPLDLVVVVDTSGSMATDARMDYVRQGLHLLVDAVDEDDRLALVSYQSFAEVHA 182 Query: 300 EVD-----------------------------------EHEF--------FYSQETGGTI 316 E+ + Q GGT Sbjct: 183 ELPALPVEETPEEPTEPTDPVGEPTDPPADPDEDPVDEREAWRSEMHALVDTLQPGGGTN 242 Query: 317 VSSALKLMDEVVKERYN--PAQWNIYAAQASDG-DNWADDSPLCHEILAK---KLLPVVR 370 + L+ E+ KE P + SDG L++ + + Sbjct: 243 IYEGLERGFEIAKEARVNHPDRAQRVI-LLSDGLATEGITDSASIIALSEAFIEGGMGLT 301 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + L R + F ++ + F + ATA Sbjct: 302 TVGVG---ASFNVELMRGLAERGAGNFYFVEDPEAVREVFTEEL-DYFAEPLATAVS 354 >UniRef50_A0M6V8 Membrane protein containing von Willebrand factor(VWA) type A domain n=55 Tax=cellular organisms RepID=A0M6V8_GRAFK Length = 335 Score = 99.2 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 35/250 (14%), Positives = 59/250 (23%), Gaps = 42/250 (16%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTK 266 L+ + LR + R + + + + +DVS SM + Sbjct: 55 LKPVLFLLRLLALSAIIMAMARPRSVDVSTQTSSTQGIDIVMAIDVSASMLARDLQPNRL 114 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVDEHEFFYSQETG----GTI 316 D K + VVY T K + + GT Sbjct: 115 DATKNVAEEFIQD-RPGDRIGLVVYAGESFTKTPITSDKAIVLDALEDIEYNNVLENGTA 173 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYI 375 + S L +K+ + +DG N A P LA + V Sbjct: 174 IGSGLATAVNRIKDS---DAESKVIILLTDGVNNAGFIDPSTASELAVEFGIKVYTIGVG 230 Query: 376 EIT-----------------RRA----HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + L +E F + ++IY Sbjct: 231 SNGMALSPVGVNPANGRLRFGNVQVEIDEDLLKEIAAATGGKY-FRATNNEKLEEIYAEI 289 Query: 415 RELFHKQNAT 424 L + Sbjct: 290 DSLEKTEIEE 299 >UniRef50_A9L314 Outer membrane adhesin like proteiin n=5 Tax=Shewanella RepID=A9L314_SHEB9 Length = 1215 Score = 98.8 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 39/354 (11%), Positives = 82/354 (23%), Gaps = 31/354 (8%) Query: 88 IERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTE 147 G + S + + + Y + ++ T+ Sbjct: 150 PNGAGQGKDHNMLTDTLGSGYTLAHEWGHY--AYGVYDEYKGNAVSGAANATLTTDVATD 207 Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 NG ++ + RTA + + + Sbjct: 208 SIMSNQWQARNGDMKWLNHSTASNIGDVNRTAQGRVYGKSAWEVLAQDVKDDPKSGRKTA 267 Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP-----SSQAVMFCLMDVSGSMD 262 + R R A P +L + R M +MD SGSM Sbjct: 268 QPTRTRYTTLANNAPDANNP--VKKELPAAQFSCRDQLDFVWVEGDIDMQIVMDRSGSMF 325 Query: 263 QSTKDMAKRFYILLYLFLSRTYK-NVEVVYIRHHTQ---------------AKEVDEHEF 306 S D AK+ +L + V + + K+ + Sbjct: 326 GSPIDNAKQAAKILVDATAEGSTAMGLVSFSGRSSVKQDFAMQKMPKPDNGVKQALKGAI 385 Query: 307 FYSQETGGTIVSSALKLMDEVVK--ERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAK 363 G T + +L + + + + +DG DN + S + Sbjct: 386 DNIYANGSTALFDGSQLALDNLSAYQASAASGAPGVVFVLADGDDNNSIKSESSVITAYQ 445 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + Y + + + + + D + + Sbjct: 446 NANVPIFSFGYGSASPT---GPLVTMANATGGKYFSSPTTLAEIIDAFLQANAI 496 >UniRef50_B9ML47 YD repeat protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9ML47_ANATD Length = 3027 Score = 98.8 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 31/247 (12%), Positives = 69/247 (27%), Gaps = 21/247 (8%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 EL+ +EEN ++ + + E + + ++ + L Sbjct: 705 EAELNGVEENNLMLYYVNYDKKILEPLEDVVVDTVYNRVSGKTEHFSTFLLGDKNMPVDL 764 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSR-TYKNVEVVYIRHHTQAKEVD 302 S+ + ++D SGSM + + + + + V + + + Sbjct: 765 --SKVDIVFVLDNSGSMSSNDPNYYRIEATKKFIQNIDELNNRVGLVDFDSSVSVRSNLT 822 Query: 303 EHEFFYSQE-------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDS 354 + Q G T + LK + + + SDG N Sbjct: 823 SDKSKLLQALNAMRWTGGSTNIGGGLKAALGLFDQEQSK----KIIVLLSDGYHNTGIHP 878 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR-DQDDIYPV 413 L K+ VV + + + L + + Q+D+ Sbjct: 879 NDVLPELIKQE-IVVNTIALGK---DCDRELLHDIADKTKGGYFYVDNTGGLSQEDVDKQ 934 Query: 414 FRELFHK 420 ++ K Sbjct: 935 IELIYEK 941 >UniRef50_UPI000186D791 calcium channel, putative n=3 Tax=Neoptera RepID=UPI000186D791 Length = 652 Score = 98.8 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 29/249 (11%), Positives = 60/249 (24%), Gaps = 26/249 (10%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE-KRPDPSSQAVM 251 E +S + ++ ++ +S + Sbjct: 136 EMDPSLSWQYFGSSSG---FLRRYPAIKWPPNEGLLEKYQFHDFRTSSWYIDAATSSKDI 192 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH------- 304 L+D S SM K +AK ++ L + T+ + Sbjct: 193 VILVDSSSSMGGKKKGIAKAIVNIILDTLGNNDFVNIYRFSESATEIVPCFKDVLVQATA 252 Query: 305 --------EFFYSQETGGTIVSSALKLMDEVVKERYNPA---QWNIYAAQASDGDNWADD 353 F + + G +SAL E++ Q N +DG Sbjct: 253 ENIRELRIAFDFVKYEGSANFTSALVTGFEILHRYNRTGQGCQCNQAIMLITDG---PSS 309 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 S VR ++Y+ ++Q + Q+ + Sbjct: 310 SYKEIFKQYNWPHMPVRMFTYLVGKDGSNQEDMNWMACANKGYFAKVQNSEDAQEKVLQY 369 Query: 414 FRELFHKQN 422 + + Sbjct: 370 I-AVLARPM 377 >UniRef50_Q11Y10 Possible outer membrane protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11Y10_CYTH3 Length = 1313 Score = 98.8 bits (244), Expect = 4e-19, Method: Composition-based stats. Identities = 27/222 (12%), Positives = 62/222 (27%), Gaps = 24/222 (10%) Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMA 269 ++L +I L + + K + +D+S SM + +A Sbjct: 51 DKLENQITTLTRESVSIHENGIEQQVVKVVNPAAVKPKSISLVLTIDISESMQKQYMPLA 110 Query: 270 KRFYILLYLFLSRTY------KNVEVVYIRHH-TQAKEVDEHEFFYSQETGGTIVSSALK 322 K + L +V +I T+ + GGT + Sbjct: 111 KNAAAAIVNKLPLDISECAVTSFNDVSFINTDFTRDRFKLLQSIQTLVPAGGTDYNKGFI 170 Query: 323 L----MDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 +++K +DG + D +P AK + V + Sbjct: 171 KSNAGGLDILK----KGLHEKVLIFLTDG--YGDVNPTEIIQQAKSIGAKVYVITLGMSA 224 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + + +++ + +I V+ + ++ Sbjct: 225 P----EELKRIVTATNGSYY---ENVISEQEINAVYMSILYR 259 >UniRef50_C1XWJ2 von Willebrand factor type A-like protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XWJ2_9DEIN Length = 308 Score = 98.8 bits (244), Expect = 4e-19, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 49/209 (23%), Gaps = 25/209 (11%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRHH-- 295 P P QA + +DVSGSM D AK + K V + Sbjct: 80 PTPDEQAGVVLAIDVSGSMMADDLKPSRLDAAKAAARSFVERMPAGVKVGLVSFAAGAVL 139 Query: 296 ----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDGDNW 350 T + + T + L + N SDG N Sbjct: 140 ESGLTADHQGVIERIDLLERRANTAIGEGLLESLKAFPTGANHQVAVPATVILLSDGRNR 199 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRR-------AHQTLWREYEHLQSTFDNFAMQH 403 +P AK+ V + R + FA Sbjct: 200 IGIAPQEAAQEAKRRGVRVYTIGVGSDDPNASVDWAGFDEAELRGIAEVTGGRY-FAADS 258 Query: 404 IRDQDDIYPVFR-----ELFHKQNATAKG 427 +IY +L Q + Sbjct: 259 ADRLQEIYRELGSQIGWKLERTQVSGLVA 287 >UniRef50_A0CCS0 Chromosome undetermined scaffold_168, whole genome shotgun sequence n=6 Tax=Paramecium tetraurelia RepID=A0CCS0_PARTE Length = 981 Score = 98.8 bits (244), Expect = 4e-19, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 49/196 (25%), Gaps = 23/196 (11%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-------- 293 ++ +D SGSM AK+ IL L + + + Sbjct: 341 ENQAINRGTYLFFIDRSGSMSGGRIKKAKQSLILFLRSLPDNCRFNIISFGTMFRSLWSD 400 Query: 294 ---HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGD 348 + + + GT + + + + +Y ++ +DG+ Sbjct: 401 SKQYSQDTLDEAIKHVNAMEANMQGTEIFKPFQDV--IYNNQYGKSKTTTLNIFLLTDGE 458 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + K V E + L + + + F + Sbjct: 459 -VDVFPIIDLVKRNNKAETRVYTLGIGEGCSQY---LIKNLADVGNGKFQFVADD----E 510 Query: 409 DIYPVFRELFHKQNAT 424 DI +L Sbjct: 511 DINAKVIDLLEDSMTP 526 >UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7ULL3_RHOBA Length = 484 Score = 98.8 bits (244), Expect = 4e-19, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 55/206 (26%), Gaps = 13/206 (6%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 E+ L + + + ++D SGSM AK LS Sbjct: 61 EKQTNHLRIALTGFELKSAEERP-PVNVCLVLDHSGSMSGQKLARAKEAAEAAIDRLSDD 119 Query: 284 YKNVEVVYIRHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 V+Y + T + + + Q T + + + V++ Sbjct: 120 DIVSVVLYDSNVTVLVPATKATDRSSIKQKIRGIQAGSSTALFAGVSKGAAEVRKFLADE 179 Query: 336 QWNIYAAQASDG-DNWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQ 393 Q N SDG N SP E L + L + + + ++ L + Sbjct: 180 QVNRVI-LLSDGLANVGPKSPQELEGLGRSLMKEAISVSTLGLGSG-YNEDLMVALASVG 237 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFH 419 F F L Sbjct: 238 GGNHAFIEDADSLVSVFNQEFDGLLS 263 >UniRef50_A4IZW2 Conserved membrane protein with von Willebrand factor type A domain n=20 Tax=Francisella RepID=A4IZW2_FRATW Length = 333 Score = 98.8 bits (244), Expect = 4e-19, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 60/214 (28%), Gaps = 41/214 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMD-----------QSTKDMAKRFYILLYLFLSRTYKNVE 288 + P S + +D+SGSM +S D+ R + + Sbjct: 83 KPVSLPQSGRDLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFIDT-RKGDRVGL 141 Query: 289 VVYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +++ T + + T + A+ L + +K+ + Sbjct: 142 ILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGD---SK 198 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI---------------TRRAHQ 383 +DG+N + PL +AK+ + + Sbjct: 199 ALILLTDGENNSGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDT 258 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 T+ + + F Q+ D +Y +L Sbjct: 259 TVLEKIATMTGGKY-FRAQNSSDLKKVYESIDKL 291 >UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CF36E Length = 790 Score = 98.4 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 22/214 (10%), Positives = 47/214 (21%), Gaps = 37/214 (17%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK----------------DM 268 ++ S + C++DVSGSM K D+ Sbjct: 112 LQINRFNDQVKISIKTPEGQQRSACDICCVIDVSGSMSDEAKIKNSKGDIESNGLTILDL 171 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE----------HEFFYSQETGGTIVS 318 K + L + V + + + ++ E T + Sbjct: 172 VKHSVKTIINNLDERDRLSLVAFHTNAYKITDLTPMNENGRNHAIKELEKLIPLDSTNIW 231 Query: 319 SALKLMDEVV---KERYNPAQWNIY----AAQASDGDNWADDSPL---CHEILAKKLLPV 368 + EVV +++ +DG + ++ Sbjct: 232 DGIYQALEVVKAGQQQSIQKGEQRVAFSQILLFTDGQPNVIPPRGHLPMLKKYKEENDVN 291 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 ++ L + F Sbjct: 292 CSISTFGFGY-NLDSELLDQLAIEGRGSFAFIPD 324 >UniRef50_Q5TIE3-4 Isoform 4 of von Willebrand factor A domain-containing protein 5B1 n=2 Tax=Amniota RepID=Q5TIE3-4 Length = 1016 Score = 98.4 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 51/194 (26%), Gaps = 20/194 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---------- 293 + L+D S SM + K ++ L + + Sbjct: 355 LRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALKSLMPACLFNIIGFGSTFKSLFPSSQ 414 Query: 294 -HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + + + + + GGT + S LK + R +DG A Sbjct: 415 TYSEDSLAMACDDIQRMKADMGGTNILSPLKWVIRQPVHR----GHPRLLFVITDG---A 467 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 ++ L + R YS+ L + + M+ R Q + Sbjct: 468 VNNTGKVLELVRNHAFSTRCYSFGIG-PNVCHRLVKGLASVSEGSAELLMEGERLQPKMV 526 Query: 412 PVFRELFHKQNATA 425 ++ + Sbjct: 527 KSLKKAMAPVLSDV 540 >UniRef50_Q5TIE3 von Willebrand factor A domain-containing protein 5B1 n=19 Tax=Euteleostomi RepID=VW5B1_HUMAN Length = 1220 Score = 98.4 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 51/194 (26%), Gaps = 20/194 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---------- 293 + L+D S SM + K ++ L + + Sbjct: 355 LRKAHGEFIFLIDRSSSMSGISMHRVKDAMLVALKSLMPACLFNIIGFGSTFKSLFPSSQ 414 Query: 294 -HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + + + + + GGT + S LK + R +DG A Sbjct: 415 TYSEDSLAMACDDIQRMKADMGGTNILSPLKWVIRQPVHR----GHPRLLFVITDG---A 467 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 ++ L + R YS+ L + + M+ R Q + Sbjct: 468 VNNTGKVLELVRNHAFSTRCYSFGIG-PNVCHRLVKGLASVSEGSAELLMEGERLQPKMV 526 Query: 412 PVFRELFHKQNATA 425 ++ + Sbjct: 527 KSLKKAMAPVLSDV 540 >UniRef50_UPI0001792BA1 PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P... n=1 Tax=Acyrthosiphon pisum RepID=UPI0001792BA1 Length = 821 Score = 98.4 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 22/234 (9%), Positives = 53/234 (22%), Gaps = 50/234 (21%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 + + + ++DVSGSM+ K + + +++ Sbjct: 264 FAPTDLKPLKTHVIFILDVSGSMNGQKITQVKGAMSQILSEIDSEDFFTLILFSSLAQIW 323 Query: 292 -------------------------------IRHHTQAKEVDEHEFFYSQETGGTIVSSA 320 + Q + + + T + A Sbjct: 324 TINATQNTSNYWDDRGRNLNNFETMGENHFIFSANEQNIQYAKKFIQALEPDSTTNMEDA 383 Query: 321 LKLMDEVV---KERYNPAQW--NIYAAQASDGD-NWADDSPLCHEILAKKLLPVVR-YYS 373 L + K R+ + +DG+ N +P + YS Sbjct: 384 LNKALSIAKLGKMRFKDSAKTPKPIIVFLTDGEMNEGITNPQALMKYVSDINVDNYPIYS 443 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 A ++ + F + D + + ++ Sbjct: 444 LGFGKG-ADIEFLKKLSLNNTGFARVIY----EASDASLQLHNFYKEISSPVLS 492 >UniRef50_Q54DV3 von Willebrand factor A domain-containing protein DDB_G0292016 n=1 Tax=Dictyostelium discoideum RepID=Y2016_DICDI Length = 918 Score = 98.0 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 60/206 (29%), Gaps = 21/206 (10%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 F +KN ++ L+D SGSM + + A+R ++ L+ +K Sbjct: 280 INFYPSFKNVNPDEV-YQKSEFIFLIDCSGSMSGQSINKARRAMEIIIRSLNEQHKVNIY 338 Query: 290 VYIR-----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQW 337 + ++ + E+ GGT + + +++ +P ++ Sbjct: 339 CFGSSFNKVFDKSRVYNDETLEIAGSFVEKISANLGGTELLPPM---VDILSSPNDP-EY 394 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG+ K R ++Y Q L + Sbjct: 395 PRQVFILTDGE---ISERDKLIDYVAKEANTTRIFTYGIGAS-VDQELVIGLSKACKGYY 450 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + F + Sbjct: 451 EMIKETTNMEKQVMKLLNVAFEPMLS 476 >UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z6_DESAA Length = 480 Score = 98.0 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 58/226 (25%), Gaps = 23/226 (10%) Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYI 274 ++ A +R+ + + + M ++D SGSM AK Sbjct: 59 TQDKIHATGDRMFDLVLTMTADEVLAPEQTKTKPVDMVIVLDRSGSMGGQKVRDAKAAVK 118 Query: 275 LLYLFLSRTYKNVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLM 324 L L + V Y + GGT + L+ Sbjct: 119 GLVEGLRSQDRFSLVTYSNSVNGGDGLHYLTADKRNSLNWMVDSIPAGGGTNLGGGLEKG 178 Query: 325 DEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPVV--RYYSYIEITRRA 381 V++ P + SDG N P +A + + Sbjct: 179 VGVLRAYGAPDRMGKVI-LISDGQANQGVTDPNQLAAMAALRDDGLVYSVTTVGIGQ-DF 236 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 ++ L + + + D F +F ++ + Sbjct: 237 NEQLMATVADGGRGRYYY----LENPGD----FLAVFQEEANWTRA 274 >UniRef50_D2QTD7 von Willebrand factor type A n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTD7_9SPHI Length = 359 Score = 98.0 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 61/213 (28%), Gaps = 37/213 (17%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRH 294 E R + S + MDVS SM +S A+R R + V++ Sbjct: 104 ELREEQSEGIDIMLAMDVSVSMSESDILPTRLAAARRVAQAFVRG-RRNDRIGLVIFAGE 162 Query: 295 H------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYN----------- 333 T + + GT + AL +++R Sbjct: 163 AFSLCPLTTDYNLLNQYLNDLNDGMIRTSGTAIGDALARCINRMRDRPAASSDTTQAKTE 222 Query: 334 --PAQWNIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI------TRRAHQT 384 ++ + SDGDN A + P+ LAK + + + + Sbjct: 223 QWKSERSKVIILLSDGDNTAGNLDPITAASLAKAFNIKIYTIAVGQPVASASEASTVDEG 282 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + ++ + F ++ +L Sbjct: 283 ILKKIATIGKGSF-FRAVDSGRLKTVFAQISQL 314 >UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anserina RepID=B2AQN8_PODAN Length = 648 Score = 97.6 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 19/199 (9%), Positives = 44/199 (22%), Gaps = 37/199 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQST-----------------KDMAKRFYILLYLFLSR 282 + R + +DVSGSM D+ + + L Sbjct: 62 DLRERNHVPLDLVLSIDVSGSMGADAPVPAKNGTEGEHYGLSVLDLVRHAAKTILETLDD 121 Query: 283 TYKNVEVVYIRHHTQAKEVD----------EHEFFYSQETGGTIVSSALKLMDEVV---- 328 + V + +E+ + Q T + ++ + Sbjct: 122 HDRLGIVTFSTSSKVVRELTYMTPANKAKILKQLDALQPLSMTNLWHGIRDGLSLFNNNL 181 Query: 329 ----KERYNPAQWNIYAAQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 R + +DG + L + +++ Sbjct: 182 KAVNDRRNPGSGRVPALLVLTDGMPNHQCPNQGYVAKLRQWSTLPASIHTFGFGYS-LRS 240 Query: 384 TLWREYEHLQSTFDNFAMQ 402 L + + +F Sbjct: 241 GLLKSIAEVGGGNYSFIPD 259 >UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea mays RepID=C0HF51_MAIZE Length = 459 Score = 97.6 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 23/224 (10%), Positives = 68/224 (30%), Gaps = 21/224 (9%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 A + + + + E + ++ ++ + ++D+SGSM Sbjct: 1 MAPHGDIDLAKAYHHVTVSMREHTEKVM---VKLTAPHTGKGDTAPLDIVVVLDISGSMR 57 Query: 263 QSTKDMAKRFY-ILLYLFLS-RTYKNVEVVYIRHHTQAKEVDEHEFFY----------SQ 310 + + K + L R + + + + ++ + Sbjct: 58 GTKLEHMKHAMTRFIIEKLGIRGDRLAIITFESKAHKVFDLSSMLPDQVKKAVAVVEGLK 117 Query: 311 ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR 370 G T + + L+ +V+K R + SDG ++ L ++ Sbjct: 118 AGGDTNIKAGLEAGLDVLKTRRGHSHNASCIFLMSDGH----ENVDKARTLLDRVGEH-S 172 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++ ++ + L + + ++ D++ + F Sbjct: 173 VVTFGFG-EKSDEQLLYDIAYHSHAGTYHHVREKEDENQLMKAF 215 >UniRef50_C3YRH3 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YRH3_BRAFL Length = 581 Score = 97.6 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 42/329 (12%), Positives = 93/329 (28%), Gaps = 27/329 (8%) Query: 115 FVFQISKDEYLDLLFEDLALP-NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNS 173 + F+ + + L + P + + A + V L S Sbjct: 212 YEFEFQLEVKMPCLLAGVESPTHSIRVDADPYARNANEVFVTLAEQHTYSEDVQVLLYLS 271 Query: 174 LARRTAMTAGKRRELHA-LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI--- 229 + A+ + EE + + Q +EE+ ++ LR ++ + Sbjct: 272 DPHKPAIILEHGDMSLSGYEEYVKSRRGFKRLQREKEEKPSSKVDYLRGRLHKDLMHHAA 331 Query: 230 --DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 +F + + R ++D SGSM + A+ +L L Sbjct: 332 IMLSFFPDFSSIPPRDPSKIPGNFIFILDRSGSMSGANIAGARETLLLFLKSLPTCCVFN 391 Query: 288 EVVYIR-----HHT------QAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPA 335 V + T Q + + + GGT + S L+ + + Sbjct: 392 IVSFGSSYKPMFSTSVPYTQQNVDKASADIKKMRADMGGTNILSPLQWVF----SAPVTS 447 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + +DG + + L +K R ++ +A ++L + Sbjct: 448 GYPRQVFLLTDG---SVSNTGTVIDLVRKNAYNTRCFALGIG-PKASRSLVQGVGSAGGG 503 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + R Q + + + Sbjct: 504 AAELIQEGERIQPKVVACLKRALQPCISD 532 >UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LWF9_HALO1 Length = 419 Score = 97.6 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 22/204 (10%), Positives = 53/204 (25%), Gaps = 12/204 (5%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 +R + + ++D S SM A + L + + + Sbjct: 20 VRIEAQATESSARMPVNLALVIDRSSSMRGPRLASAIVAARQVVEQLDERDRLSVIAFDA 79 Query: 294 HH----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 +A++ E + GT +++ +K E V+ + + Sbjct: 80 TARTIFGPMSVTDEARQTLEQALAGLRTGVGTNLAAGMKKGAEAVRSGFVRGALSR-LVL 138 Query: 344 ASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG LA+K + + + + L + H ++ Sbjct: 139 LTDGQPSLGITDNDRLCALAQKEADRGVTITTMGLGQGFDDELLADLAHSGRGGFHYLAS 198 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 +F + Sbjct: 199 AADIPGAFGRELSGVFAIAATQTE 222 >UniRef50_Q60ED8 Von Willebrand factor type A domain containing protein n=6 Tax=Poaceae RepID=Q60ED8_ORYSJ Length = 801 Score = 97.6 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 53/201 (26%), Gaps = 32/201 (15%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----RHHTQ 297 + + ++D SGSM + K L + + + + Sbjct: 325 QKRKVFRNASVFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDYFNIITFNDELHSFSSC 384 Query: 298 AKEVDEHEFFYSQ--------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 ++V+E ++ GGT + L ++ +N +DG Sbjct: 385 LEQVNEKTIENAREWVNTNFIAEGGTDIMHPLSEAIALLSNSHNA---LPQIFLVTDG-- 439 Query: 350 WADDSPLCHEILAKKLL-------PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + + K+ L P + + + R + Sbjct: 440 -SVEDERNICRTVKEQLATRGSKSPRISTFGLG---SYCNHYFLRMLASIGKGHY----D 491 Query: 403 HIRDQDDIYPVFRELFHKQNA 423 D I + F K ++ Sbjct: 492 AAFDTGSIEGRMVQWFQKASS 512 >UniRef50_Q10JU7 Von Willebrand factor type A domain containing protein, expressed n=17 Tax=Poaceae RepID=Q10JU7_ORYSJ Length = 680 Score = 97.6 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 53/201 (26%), Gaps = 32/201 (15%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----RHHTQ 297 + + ++D SGSM + K L + + + + Sbjct: 325 QKRKVFRNASVFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDYFNIITFNDELHSFSSC 384 Query: 298 AKEVDEHEFFYSQ--------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 ++V+E ++ GGT + L ++ +N +DG Sbjct: 385 LEQVNEKTIENAREWVNTNFIAEGGTDIMHPLSEAIALLSNSHNA---LPQIFLVTDG-- 439 Query: 350 WADDSPLCHEILAKKLL-------PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + + K+ L P + + + R + Sbjct: 440 -SVEDERNICRTVKEQLATRGSKSPRISTFGLG---SYCNHYFLRMLASIGKGHY----D 491 Query: 403 HIRDQDDIYPVFRELFHKQNA 423 D I + F K ++ Sbjct: 492 AAFDTGSIEGRMVQWFQKASS 512 >UniRef50_C6X1I3 BatA (Bacteroides aerotolerance operon) n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X1I3_FLAB3 Length = 334 Score = 97.6 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 56/234 (23%), Gaps = 41/234 (17%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRT 283 I R + D + + +DVS SM + K Sbjct: 72 IAMARPRTFTISENNDDTKGIDIMMSVDVSLSMLARDLEPDRLTALKNIAKKFVDK-RPG 130 Query: 284 YKNVEVVYIRHH------TQAKEVDEHEFFYSQE---TGGTIVSSALKLMDEVVKERYNP 334 + V Y T V E GT + L + ++ Sbjct: 131 DRIGLVTYSGEAFTKVPVTSDHAVLLEELENLNPLELQPGTAIGEGLSVAVSHLRHSKAK 190 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA------------- 381 + +DG N +++ +R YS T Sbjct: 191 ---SKIIILMTDGVNTIENAMPAQVGAQLAKSNDIRVYSIGIGTNGYALMPTQTDIFGDL 247 Query: 382 ---------HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + RE F + +++Y +L + ++K Sbjct: 248 VFTEVEVKIDEPVLREIAQTTGGKY-FRATSNQSLEEVYEEINQLEKSELQSSK 300 >UniRef50_A9RSX3 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RSX3_PHYPA Length = 1068 Score = 97.6 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 64/233 (27%), Gaps = 28/233 (12%) Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 L + + L + P + M L+D SGSM + A Sbjct: 245 LGLDQPRALVERHPSHGTHAIALTF-QPRFALQPLRTSEMIFLVDRSGSMMGTQIKQAGE 303 Query: 272 FYILLYLFLS-RTYKNVEVVY-----IRHHTQAKEV------DEHEFFYSQET-GGTIVS 318 L + + V + T + H Q GGT ++ Sbjct: 304 ALELFLRSIPFENHYFNIVGFGSNHNFLFPTSVEYTEDSLKKAVHYAQTIQANMGGTEIA 363 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK------KLLPVVRYY 372 +A EV + R +DG W D+ + + + + VR + Sbjct: 364 NAF---FEVFQRRRRN--VPTQIFLLTDGMVW--DAEQLTKSIIEAVDDGARNNSPVRVF 416 Query: 373 SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + +H L + +++ R + + + + A Sbjct: 417 TLGVGNAVSH-HLIESVARAGGGYAQLVLENERMEKKVLNMLKAGLTPSVTNA 468 >UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW34_ANASP Length = 615 Score = 97.6 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 60/206 (29%), Gaps = 14/206 (6%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 LR++ E P + ++D SGSM + A + + L VV Sbjct: 22 NILLRFRA-EIPESPRRNLNLSLVIDRSGSMAGAALHHALKAAESVVDQLEPKDILSVVV 80 Query: 291 YI--------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 Y K + + G T +S E VK + +P + N Sbjct: 81 YDDAVDTVVPPQPVTDKPALKKSIRQVRAGGITNLSGGWLKGCEYVKHQLDPQKINRV-L 139 Query: 343 QASDGD-NWADDSPLCHEILA-KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG N P + +K + + ++ L + F Sbjct: 140 LLTDGHANMGIQDPKILTATSTQKAEEGITTTTLGFAQG-FNEDLLIGMARAANGNFYFI 198 Query: 401 MQHIRDQDDIYPVFRELFHKQNATAK 426 Q I + +++ + + Sbjct: 199 -QSIDEAAEVFSIELDSLRSVVGQNL 223 >UniRef50_Q2BCF0 Possible D-amino acid dehydrogenase, large subunit n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BCF0_9BACI Length = 459 Score = 97.6 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 45/358 (12%), Positives = 92/358 (25%), Gaps = 35/358 (9%) Query: 92 QGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTH 151 G G G+ D ++ + S DE + +D+ + + ++ E K Sbjct: 4 AGCSGEDEKASGEKKNDPPQEEAREQETSSDEKIPEAADDIE--GMVAQKHGKILEGKLE 61 Query: 152 RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEER 211 A+ A + N + A + ++ L Sbjct: 62 PEVEIADLWDA---KKYTGFNEETLQPAAEKEMKAYFSEQKDLSGSQVYDYLVYQLGSGL 118 Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTK 266 + EL + + +L E + + ++ + LMD SGSM + Sbjct: 119 YQSYYEELVS---FEHGHEMPELPDGEDEIQQAKNQKSNIVILMDASGSMKADVSGGNKM 175 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVY---------------------IRHHTQAKEVDEHE 305 +AK L + Y K Sbjct: 176 MLAKETIKEFTSSLEDDASVSLMAYGHVGTGNDEDKAESCSRIDEVFPLGAYEKTAFNKS 235 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKL 365 + +G T ++ A+ E++ YN + SDG D P+ + Sbjct: 236 MDSFEASGWTPLAGAIDKARELL-SAYNSTDYKNTLYIVSDGVETCDGDPVEAAQQLQGS 294 Query: 366 LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + Q +E +D + ++ + + Sbjct: 295 NIEAKVNIIGFDVDDEGQKQLKEVAEAGGGTYATVRDKDELEDQVLKKWKPSLGQIFS 352 >UniRef50_C1XUY0 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus RepID=C1XUY0_9DEIN Length = 319 Score = 97.3 bits (240), Expect = 9e-19, Method: Composition-based stats. Identities = 23/203 (11%), Positives = 55/203 (27%), Gaps = 31/203 (15%) Query: 247 SQAVMFCLMDVSGSMDQ-----STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 + + +DVS SM S + A+ + L + + V + R T+ Sbjct: 84 PRTTIVLALDVSRSMRATDVLPSRFEAAREALKVFIRELPQGARIGLVTFSRAATEVVAP 143 Query: 302 D---EHEFFYSQETG---GTIVSSALKLMDEVV-----KERYNPAQWNIYAAQASDGDNW 350 + + G GT + + + + ++ +DG + Sbjct: 144 TTNRQRLLDSVELIGLEFGTAIGEGILTSLQALPPLEQRKDAKDPSELATIILLTDGRSI 203 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRR--------------AHQTLWREYEHLQSTF 396 + PL +A + + +T + + ++ + Sbjct: 204 SGIDPLEAARIAAEQKVRIHTIGVGRVTEGPVPGLESVYQWAAYFDEDVLKQIAAITGGK 263 Query: 397 DNFAMQHIRDQDDIYPVFRELFH 419 F + Y + F Sbjct: 264 YFFV-NSAGKLRETYQQLSQSFV 285 >UniRef50_Q4RW93 Chromosome 9 SCAF14991, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4RW93_TETNG Length = 766 Score = 97.3 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 62/246 (25%), Gaps = 24/246 (9%) Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 ++ S + R+ E R + + + Sbjct: 136 NKVFVDNFERDPSLIWQYFGSAKGFFRQYPGIKWRPDENGVIAFDC--RNRKW-YIQAAT 192 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------------- 291 S + L+DVSGSM +A++ + L + Y Sbjct: 193 SPKDVVILVDVSGSMKGLRLTIARQTVSSILDTLGDDDFFNIIAYNEELHYVEPCLNGTL 252 Query: 292 IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK---ERYNPAQWNIYAAQASDGD 348 ++ K+ G ++ AL +++ E + + +DG Sbjct: 253 VQADVTNKDHFREHLDKLFAQGIGMLDVALTEAFSLLRDFNETGRGSDCSQAIMLVTDG- 311 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 A D+ VR + Y+ A + + Q+ Sbjct: 312 --AVDTYDTIFAKYNWPERKVRIFPYLIGRESAFAENLKWMACANKGYFTQISTLADVQE 369 Query: 409 DIYPVF 414 ++ Sbjct: 370 NVMEYL 375 >UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LPD4_SYNFM Length = 479 Score = 97.3 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 57/192 (29%), Gaps = 21/192 (10%) Query: 246 SSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVVYIRH---------- 294 + M +MD SGSM + A++ + L LS T + V Y H Sbjct: 89 RRELDMVVVMDRSGSMADAGKLTHARQAVLNLLSRLSETDRFALVSYSDHVQRHGGLLPI 148 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADD 353 + E Q G T + L+ + E + + SDG N Sbjct: 149 TPANRATLERIVRGIQPGGATNLGGGLQEGISQLAELQQNGRLSR-LILISDGLANRGVT 207 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 P +A S + + ++ L + F + Sbjct: 208 DPSALGTMASVAAERGYAVSTVGVGLDFNEHLMTSIADKGAGNYTFM--------ESASA 259 Query: 414 FRELFHKQNATA 425 F ++F K+ A Sbjct: 260 FAQVFDKEFRDA 271 >UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=Dialister invisus DSM 15470 RepID=C9LLI0_9FIRM Length = 640 Score = 97.3 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 47/360 (13%), Positives = 98/360 (27%), Gaps = 53/360 (14%) Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS-KDEYL 125 +++ D E P G G + D D F DE + Sbjct: 296 DPDKNNTEKDQESENNDPGDDGEDPSGNHIVEAMG-NGGNNDESSSDMPEFPQGADDEKV 354 Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKR 185 D + LP L +QN + T +GKR Sbjct: 355 DSADLHVTLPPLW-------------------------------IQNEKKQFTPKGSGKR 383 Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN---YEKR 242 + E + P + + + A ++ + + ++ K Sbjct: 384 HITRSDERQGRYVKAGIPKGETHDIAIDATL-RAAAPHQKGRQSNGCAVVIRHEDIRRKE 442 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVY-------IR 293 + + + L+D SGSM + A + + +L + + + + + Sbjct: 443 REKRTGNIFLFLVDASGSMGARERMKAVKGVVFKMLADAYQKRDRVGMIAFRRDRAEVLL 502 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN-IYAAQASDGD---- 348 T++ E + + G T ++ L ++++ Y +DG Sbjct: 503 PITRSIEFAQKKLAALPTGGKTPLAQGLIKAEDMLDRLYKQDPLQDPVLILITDGRATNS 562 Query: 349 -NWADDSPLCHEILAKKLLPVVRYYSYIEITRRA-HQTLWREYEHLQSTFDNFAMQHIRD 406 N D A+++ + I+ L +E + D Sbjct: 563 LNKNTDPVRDALSEAERIGHRHMLAAVIDTESGFIKLGLAKELAQKMGASYFHVDKISED 622 >UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphylothermus marinus F1 RepID=A3DLZ3_STAMF Length = 416 Score = 97.3 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 50/189 (26%), Gaps = 14/189 (7%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQA 298 ++D S SMD AK+ + L L + Sbjct: 37 PPIAFLIVIDTSYSMDGEKIFRAKQAALRLLDILRDKDYVGVYGFAGKFYKVLEPVPATN 96 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDN-WADDSPL 356 + E + GT + LK + E K+ ++ +DG+ P Sbjct: 97 RNEVEKAIIGLKLGSGTNIYDTLKKLVEETKKVLESGAISLVRIIFITDGEPTTGQKKPE 156 Query: 357 CHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 +AKKL T ++ L + + + I + Sbjct: 157 KILEMAKKLREAGASALIIGVGT-EYNEKLLSRMAMVLNGEFEHVSDPASLEKLISEYAK 215 Query: 416 ELFHKQNAT 424 ++ + Sbjct: 216 S--TQEISA 222 >UniRef50_C9LDM7 BatA protein n=10 Tax=Prevotella RepID=C9LDM7_9BACT Length = 334 Score = 97.3 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 55/224 (24%), Gaps = 45/224 (20%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + +DVS SM + AK+ V+ Sbjct: 79 ALSNKETEGINIMMAIDVSTSMLTPDLPPSRIETAKQVAYEFINN-RPDDNIGLTVFGGE 137 Query: 295 H------TQAKEVDEHEFFY----SQETG----GTIVSSALKLMDEVVKERYNPAQWNIY 340 T + F Q+ G GT + L +++ + + Sbjct: 138 AYTQCPLTTDHSALLNMFKQVNCDLQKEGVISPGTAIGMGLSSAVSHLEQSKSK---SKV 194 Query: 341 AAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIE--------------------ITR 379 +DG+N + SPL +AK+L + S I + Sbjct: 195 IILLTDGENNAGEISPLTAAEMAKRLGIRIYTISVGTDAAVNQTVATLPNGETYEAAIKQ 254 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + DIY L + Sbjct: 255 NTDPKTLEAIANSTGGKF-YQARSKAKLRDIYQNIDRLEKTKLK 297 >UniRef50_A0CDA0 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0CDA0_PARTE Length = 508 Score = 96.9 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 31/298 (10%), Positives = 83/298 (27%), Gaps = 33/298 (11%) Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 + + ++ + + T Q + + + + K R ++++ Sbjct: 35 SFQPKKKNPIQKSNTMHNLIQQ--------FQTQTQENNDLKNSQSESKNRFSQRTQQSI 86 Query: 196 AIISNSEPAQLLEEERLRK-EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 S+ L ++ + + + L+ + +FCL Sbjct: 87 PSRSSINKQPLDDDIQFDMFSVNPGSNILNLTQHTIPIVLQLRTKTLEELDQIGVDLFCL 146 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEVDEH 304 +D+ M D K+ + L + + + ++ +E Sbjct: 147 IDIGNGMQGQKIDYVKQILHSILTNLREQDRLCLISFNNDGKLLTGLQKVTSETQEYFAF 206 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 Q G T + ++ +V+ +R N W SDG + + K+ Sbjct: 207 VIDGLQCNGTTELWKGTEVAFDVINQRKNKNNWAR-ILIFSDGQ-----DEIALTKIKKQ 260 Query: 365 LLPVVRYYS---YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 L ++ + A +L+ + + ++ + F Sbjct: 261 LEYNYDIFTIDSFGFSNSNA-SKRLSSITNLRFGKHHII----NSEQQVFKCLEQTFA 313 >UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobacteria RepID=Q747C5_GEOSL Length = 447 Score = 96.9 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 69/371 (18%), Positives = 134/371 (36%), Gaps = 51/371 (13%) Query: 23 LRRYKAQIKQSISEAINKRSVTDVDSGES---VSIPTEDISEPMFHQGRGGLRHRVHPGN 79 L R + + + + I + +G V +PT + + + G Sbjct: 72 LERDRLREEDGLPRKIRIGKLIKPGAGGKEKIVVVPTTVEEKLIHDRAPEETEEDESMGG 131 Query: 80 DHFVQ-----NDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLAL 134 ++ RPQ GG +G G+ + + + +L E L Sbjct: 132 TGDGDEGEIIGEQPVRPQQEGGSGTAGHGEGEGH-------ELESTAYDLGRILTERFDL 184 Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 PNLK+ ++ S+ + R ++ Sbjct: 185 PNLKEKGKKS--------------------SLSHYSYDLTDRNRGFGQILEKKQTLRRIL 224 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 I+ A + E + R I + +RV I + +L Y SQA++F + Sbjct: 225 ETNIALGTVADVAEIDPTRLVI----SPRDRVYRILSRELEY---------ESQALVFFI 271 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRHHTQAKEVDEHE-FFYSQET 312 D SGSM+ + ++L+Y +L + + VE +I H A+EV + ++ + Sbjct: 272 RDYSGSMEGKATEAVCSQHVLIYSWLLYQFARQVETRFILHDNDAREVPDFYTYYNLRVA 331 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 GGT V++A ++++E+V++ +NIY +DGD+W + L +++L Sbjct: 332 GGTRVAAAYRMVNEIVEKESLARDYNIYVFHGTDGDDWDTNGEETIPEL-RRMLAYANRI 390 Query: 373 SYIEITRRAHQ 383 Sbjct: 391 GVTIAEHTYGS 401 >UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B5YMD8_THAPS Length = 868 Score = 96.9 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 38/304 (12%), Positives = 80/304 (26%), Gaps = 25/304 (8%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH---A 190 P + + + + + A A + E A Sbjct: 18 SPGKVAKRSAEDDADNDDALALEQRSKRTRVDDTEEVLYFQEEQEAEGAVAQEEEAVDFA 77 Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD-PSSQA 249 +E + + + + + ++ ++ +R S Sbjct: 78 MEMLAREHEQQGIREETLQLSVAPHRESIGLQSGEFTGQICATIKARDLPQRDSFARSPI 137 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT----------QAK 299 + +DVSGSM D+ K LL L + + + + K Sbjct: 138 DIVVALDVSGSMRVEKLDLCKETLHLLLRELHHDDRFALISFSEDAVIEVPMQKVNERNK 197 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCH 358 + H G T ++SA+ L +VV P + +DG N + Sbjct: 198 QQALHAIDRLSVKGRTNIASAVSLAAQVVNGVAEPNKV-RSVFLLTDGNANTGYTEAIDL 256 Query: 359 EIL----AKKLL----PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 L + P + +++ Q L R S ++++ Sbjct: 257 VKLTSIFVEANRNPHTPPISLHTFGYG-PEPDQKLLRGMAMATSGGSFYSVRDNSQVSSA 315 Query: 411 YPVF 414 + Sbjct: 316 FGDA 319 >UniRef50_Q24FW2 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FW2_TETTH Length = 1074 Score = 96.9 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 52/188 (27%), Gaps = 15/188 (7%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH--- 294 N++ + + C+MD SGSM +M K + L L + V++ Sbjct: 353 NFDAKAYQRPPIDLICVMDNSGSMHGEKINMLKETLLYLIDQLDEKDRLGLVLFNSEVTF 412 Query: 295 ------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASD 346 T K + + GGT ++ + + +K R N SD Sbjct: 413 RPMKSMDTTNKLKLKQYISDIRAQGGTDINLGMTEAFKFIKTR---KYCNPVTSVFLLSD 469 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G + + + + + L + + + F Sbjct: 470 GLDSKAQDRVAVTLKNMSINEQFSINCFGFGR-DHDPILMNQIKKIDQVDMFFVDALGGL 528 Query: 407 QDDIYPVF 414 I Sbjct: 529 FSVIGQDV 536 >UniRef50_A8N264 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8N264_COPC7 Length = 885 Score = 96.9 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 35/313 (11%), Positives = 75/313 (23%), Gaps = 46/313 (14%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 +D P+L + + +T P++ + S T RR Sbjct: 160 QDAVQPSLAETRI-SITADIQMYGKIQRIVSPSHPDGITESPYS----TPQGRPSRR--- 211 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKI-------ERVPFIDTF-DLRYKNYEK 241 + P L + L L + + + Sbjct: 212 -----RTTVRYRSPHYLDHDFVLGIHADGLDKPRCFAEVRRNPERGVPDTLAFQLTMVPR 266 Query: 242 RPDPSSQA-VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK-NVEVVYI------- 292 P ++ ++D SGSM+ AK +++ + + Sbjct: 267 TKLPPIRSQEYIFIVDCSGSMEGPRIQTAKDSLVMMLQMIPSHNSIFNIFAFGNECKSWV 326 Query: 293 ----RHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + E Q GGT + +AL++ +DG Sbjct: 327 AHSQNYSGKTLEEAIRYTESMQADLGGTEMRNALRVAL-----SSCSKGLPTVVFLLTDG 381 Query: 348 DNWADDS-PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + D + R ++ + L + + + Sbjct: 382 GCFDVDGCKQEVKGFVDGCTAPKRIFTLGIGESAS-SDLCESIARVGNGESFMVIDTSS- 439 Query: 407 QDDIYPVFRELFH 419 + +LF Sbjct: 440 ---VVQKCAKLFT 449 >UniRef50_UPI000058940A PREDICTED: similar to inter-alpha (globulin) inhibitor H3 n=1 Tax=Strongylocentrotus purpuratus RepID=UPI000058940A Length = 964 Score = 96.9 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 29/276 (10%), Positives = 71/276 (25%), Gaps = 31/276 (11%) Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 SL + + RR+ A E+ + + + A + Sbjct: 277 SLAHRVQRRSDNRWEVHYSPRAREQLGVSPMGIMADYTVRYDVVHGNDAGDIQVLN---- 332 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 + ++Y + + + ++D+SGSM + K + +S T K Sbjct: 333 --DYFVQYFSPS--GLSVLRKNIIFVIDISGSMSGTKLAQVKDALSTILDDMSETDKFNI 388 Query: 289 ------VVYIRHHTQAKEVDEHEFF------YSQETGGTIVSSALKLMDEVVKERYNPA- 335 V ++ E+ QE T + A+ +++ Sbjct: 389 LPFSDDVHFLESTGMLYSTKENVRRAKRFVMGLQEMDNTNLHKAIISGVNMLRAESEQDP 448 Query: 336 ---QWNIYAAQASDGDNWADDSPLCHEI--LAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + +DG+ + + + + + A R Sbjct: 449 QEEEIVSMLIVLTDGNPNHGEIDKTIIERNVHEAINGDFSLFCIGFGA-DADYPFLRRLS 507 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + I ++ D +++ Sbjct: 508 LQNHG----VARRIPERADAGEHLENFYYEVATPLL 539 >UniRef50_B9L896 von Willebrand factor, type A n=1 Tax=Nautilia profundicola AmH RepID=B9L896_NAUPA Length = 288 Score = 96.9 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 52/183 (28%), Gaps = 20/183 (10%) Query: 246 SSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQA 298 + +D SGSM + + D AK + + VV+ T Sbjct: 73 KKGYNIVIDLDTSGSMAEFNKIDAAKAVSLDFAKK-RKNDALGLVVFGNIAYIASPLTFD 131 Query: 299 KEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDS 354 K+ E G T + AL L + K +DG DN + Sbjct: 132 KKTFEDILKRIYVSIAGGKTAIYDALFLSSNLFKNANGE----KIIILLTDGMDNMSITP 187 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 KK V + A ++ ++ + + + D IY Sbjct: 188 LDVVIKKLKKEHIKVYSIAIG---GDADLSVLKKISKETNGKF-YIASSLEDLKKIYSDI 243 Query: 415 REL 417 +L Sbjct: 244 NKL 246 >UniRef50_UPI000186D9CC conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D9CC Length = 1180 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 65/221 (29%), Gaps = 26/221 (11%) Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 + + A + +D +D R +++ +S + L+D SGSM K++A+ Sbjct: 178 MRQFPAIQWKQDPVDLYDCRTRSW-FIEAATSPKDIVILVDGSGSMTGIRKEIARHVVNN 236 Query: 276 LYLFLSRTYKNVEVVYIRHHTQAKEV---------------DEHEFFYSQETGGTIVSSA 320 + L + + T+ + + + + S A Sbjct: 237 ILDTLGNNDFVNILSFNETTTEVEPCFKDILVQANLANIRNFKEKMEDITTSNIANFSFA 296 Query: 321 LKLMDEVVKER-------YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 L ++++ Y+ A N +DG + + VR ++ Sbjct: 297 LSKAFHLLQKYRENGSDDYSGAHCNQAIMLITDGVPY---NFKEIFAEFNWPNMPVRVFT 353 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 Y+ A + + ++ + Sbjct: 354 YLVGREVADVREIKWMACANRGYYVHLSTLAEVREQVLQYI 394 >UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZRP2_9SPHI Length = 1088 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 45/158 (28%), Gaps = 12/158 (7%) Query: 250 VMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKE 300 + L+DVSGSM + K + L + V+Y +E Sbjct: 913 NLMLLLDVSGSMSSKDKLPLLKESFKYLISIMRPQDDVSIVIYAGDAAIVLKPTSASNQE 972 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + G T V + KL + + + + N A+DG+ Sbjct: 973 QINAVIDKLRSRGKTNVKAGFKLAYKWMSKNFKEGGNNR-IILATDGEFPISKYIYKLVE 1031 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 + +S+ +T++ + Sbjct: 1032 KRATKGINLSVFSFGSMTKKF--ETLEKLVAKGKGNYE 1067 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 42/133 (31%), Gaps = 10/133 (7%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEV 301 + L+DVSGSM M K L + K VV+ + K Sbjct: 686 NLMLLLDVSGSMKNE-LPMLKSALKYLVNIMRPEDKVSVVVFGSEAKLMLRPTSAKYKAQ 744 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 + +G T + LKL + ++ Y N ASDG+ Sbjct: 745 IMQAIDTLKSSGRTNGEAGLKLAYQWIQNNYKNNNNNR-IILASDGEFSISKGLYQMIEQ 803 Query: 362 AKKLLPVVRYYSY 374 + + +S+ Sbjct: 804 KAEESIALSVFSF 816 >UniRef50_Q7UNJ0 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UNJ0_RHOBA Length = 327 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 51/184 (27%), Gaps = 13/184 (7%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---------TQAKE 300 + ++D SGSM S + + + L+ T + ++ ++ T+ Sbjct: 148 DITLVVDRSGSMAGSRFNDLQAAIRIFTDLLATTPVDEQIGLASYNDRASEDVQLTENFA 207 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + + G T +S ++ E+ P +DG + P Sbjct: 208 EVNNAMDRLRTGGFTSISRGMQAGQEIALRGRPPEFVERTMIVMTDGRHNRGPEPRVVAT 267 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + ++ A ++ + +F + DIY Sbjct: 268 DLAADGVTIHTITFGAG---ADFGRMQDVARIGGGR-HFHATNGDQLRDIYREIALTLGT 323 Query: 421 QNAT 424 Sbjct: 324 VLTE 327 >UniRef50_Q55G98 von Willebrand factor A domain-containing protein DDB_G0267758 n=1 Tax=Dictyostelium discoideum RepID=Y7758_DICDI Length = 878 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 55/192 (28%), Gaps = 22/192 (11%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIR- 293 +K+ + + ++ L+D SGSM KR ++ L+ + V + Sbjct: 303 FKDIKIEDM-NQKSEFIFLIDCSGSMVGEPMRKVKRAMEIIIRSLNENQHRVNVVCFGSS 361 Query: 294 ----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 ++ + E + GGT + + +K + ++ Sbjct: 362 FKKVFKVSRDYNDETLECLSKYIQSIEANLGGTELLTPIKNIL----SSPPNPEYPRQLF 417 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG+ D K R ++Y L + F Sbjct: 418 ILTDGEAPHRD---KIIHYLSKESNTTRIFTYGIGDS-VDIDLIIGLSNACKGHYEFITD 473 Query: 403 HIRDQDDIYPVF 414 + + + + Sbjct: 474 NDNFEKQVMKLL 485 >UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Deltaproteobacteria RepID=A0LHW4_SYNFM Length = 812 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 45/412 (10%), Positives = 89/412 (21%), Gaps = 45/412 (10%) Query: 41 RSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGS 100 ++ +V+ E + + +RP + Sbjct: 122 SAMKMTIGERTVTAVIRKREEARRDYEQAKSQ-------GKSASLLEQQRPNVFQMNVAN 174 Query: 101 GQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGV 160 + + + ++ + P + N Sbjct: 175 I-MPGDEIKTELTYNELLSPTEGVYEFVYPTVVGPRYSNQP----AAGAPASEKWVQNPY 229 Query: 161 PANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII-SNSEPAQLLEEERLRKEIAEL 219 N AR A + + E + L E+ + Sbjct: 230 LHEKEPPTYSFNITARLNAGLPIREITCPSHETAIRYEGQTRASVDLGANEKFGGNRDFV 289 Query: 220 RAKIERVPFIDTFDLRYK-------------NYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 ID+ L Y+ ++DVSGSM Sbjct: 290 LKYRLSGESIDSGLLLYRGKDENFFLLTVQPPKRVVEAAIPAREYIFIVDVSGSMHGFPL 349 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE-----------VDEHEFFYSQETGGT 315 +++KR L L T +++ T E Q GGT Sbjct: 350 EISKRLLTDLIGGLKPTDCFNVMLFSGDSTVMAERSVPASADNVRRAVEMIGRRQGGGGT 409 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + ALK + + A+DG L + + ++ + Sbjct: 410 ELLPALKKALSL----PRKEGVSRSMVIATDG---FVTVEEEAFELIRSHIGDANFFPFG 462 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 T ++ L + + R + K Sbjct: 463 IGTS-VNRMLIEGMARAGAGEPFVITRPDEAPAGAEKFRRYIQSPLLTNVKA 513 >UniRef50_C2MDE3 BatA protein n=6 Tax=Bacteroidales RepID=C2MDE3_9PORP Length = 326 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 27/216 (12%), Positives = 52/216 (24%), Gaps = 38/216 (17%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQ-----STKDMAKRFYILLYLFLSRTYKNVEVVYIRHH- 295 + MD+SGSM + + A+ + VV+ Sbjct: 80 EERSIQGIDLVLAMDLSGSMQALDLKPNRFEAARDVASEMI-AARPNDNIGLVVFAGESF 138 Query: 296 -----TQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 T +V ++ GT + L ++ N + +DG Sbjct: 139 TLCPLTVDHDVILQMLDATEIGQLEDGTAIGLGLATAINTLRGSDNK---SKVIILLTDG 195 Query: 348 DNWADD-SPLCHEILAKKLLPVVRYYSYIEITRR------------------AHQTLWRE 388 N A D +P LA++ + + + R Sbjct: 196 SNNAGDITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTASGIEYVEADVQIDEGTLRH 255 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + +IY L + + Sbjct: 256 IAQQTGGKY-YRATDETKLHEIYKEIDSLEKSRLTS 290 >UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=Q2QSE5_ORYSJ Length = 524 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 45/168 (26%), Gaps = 16/168 (9%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE 305 + ++DVSGSM + K+ + + L+ + V + + ++ Sbjct: 58 REGLDLVAVVDVSGSMRGHKIESVKKALQFVIMKLTPVDRLSIVTFESSAKRLTKLRAMT 117 Query: 306 FF----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 GGT + + L L V+ +R SDG S Sbjct: 118 QDFRGELDGIVKSLIANGGTDIKAGLDLGLAVLADRVFTESRTANIFLMSDGKLEGKTSG 177 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ-STFDNFAMQ 402 + V Y++ L + + Sbjct: 178 DPT----QVNPGEVSVYTFGFGHGT-DHQLLTDIAKNSPGGTYSTVPD 220 >UniRef50_B2SKX9 von Willebrand factor type A domain protein n=13 Tax=Xanthomonadaceae RepID=B2SKX9_XANOP Length = 335 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 36/266 (13%), Positives = 66/266 (24%), Gaps = 41/266 (15%) Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY--EKRPDP 245 + + +E Q + + + + R F+ L E P Sbjct: 36 QRRADAAALRVPYAEQLQAVAQAKTTPSLRMPRWLAWLGWFLLCAALARPQQLGEVIQPP 95 Query: 246 SSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 M +D+SGSM++ AK + +V+ + Sbjct: 96 REARQMMLAVDLSGSMNEPDMVLGGKVVDRLTAAKAVLSDFLDR-RDGDRVGLLVFGQRA 154 Query: 296 ---TQAKEVDEHEFFYSQ------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 T + T + A+ L + ++E Q +D Sbjct: 155 YALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRLRE---QKQGQRVVVLLTD 211 Query: 347 GDNWADD-SPLCHEILAKKLLPVVRYYSY--------------IEITRRAHQTLWREYEH 391 G N A PL LAK + ++ + R+ Sbjct: 212 GVNTAGVLDPLKAAELAKAEGVRIYTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQ 271 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFREL 417 F + + IY L Sbjct: 272 QTGGRF-FRARDTEELAGIYAELDRL 296 >UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBY0_9DELT Length = 996 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 28/284 (9%), Positives = 62/284 (21%), Gaps = 25/284 (8%) Query: 143 RQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSE 202 R L+ +PA+ +R + + A+ + + Sbjct: 431 RALSANHLSVDTVAPTELPASAEGLRKYDLVIFSDIPSRWVTPAQEAAVVRYVKDLGGGF 490 Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 E L + +R + ++D SGSM Sbjct: 491 IMVGGENS---------FGVGGWGGSTIEQVLPVRFSGERQREQPTLALILVIDKSGSMS 541 Query: 263 -QSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEHEFFYSQETG 313 D+ K L + + + + + G Sbjct: 542 SGDRLDLVKEAARATARTLDPSDEIGVIAFDNSPQVLVRLQPAANRLRISSSIRRLSAGG 601 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT AL+ + + SDG++ ++ ++ V Sbjct: 602 GTNAMPALREAYLQLAGS---KALVKHVILLSDGES-PENGINALLGDMRQSDITVSSVG 657 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + ++ RE+ Sbjct: 658 VGDGAG---KDFLIRVAERGRGRYFYSEDGTDVPRIFSREAREV 698 >UniRef50_O00534 von Willebrand factor A domain-containing protein 5A n=8 Tax=Theria RepID=VMA5A_HUMAN Length = 786 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 50/211 (23%), Gaps = 29/211 (13%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQS---------TKDMAKRFYILLYLFLSRTYKN 286 Y N + ++ LMD SGSM AK ILL L Sbjct: 267 YPNIPEDQPSNTCGEFIFLMDRSGSMQSPMSSQDTSQLRIQAAKETLILLLKSLPIGCYF 326 Query: 287 VEVVYIR-----HHTQAKEVDEHEFFYS------QET-GGTIVSSALKLMDEVVKERYNP 334 + K + Q GGT + + L+ + + P Sbjct: 327 NIYGFGSSYEACFPESVKYTQQTMEEALGRVKLMQADLGGTEILAPLQ---NIYRGPSIP 383 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + +DG+ + R +S+ + +L + Sbjct: 384 -GHPLQLFVFTDGE---VTDTFSVIKEVRINRQKHRCFSFGIGEGTS-TSLIKGIARASG 438 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 F R Q + Sbjct: 439 GTSEFITGKDRMQSKALRTLKRSLQPVVEDV 469 >UniRef50_A6QBT6 von Willebrand factor type A domain protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QBT6_SULNB Length = 305 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 22/202 (10%), Positives = 47/202 (23%), Gaps = 26/202 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQS----------TKDMAKRFYILLYLFLSRTYKNVEVVYI-- 292 + ++D S SM Q D+ K + + V + Sbjct: 79 KKEGRDIVLVIDSSDSMRQMGFDPKDPYKNKFDVVKEVVADFIKK-RKNDRIGMVTFADV 137 Query: 293 -------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 T ++ AL ++ + + + Sbjct: 138 AFIASPLTFEKDFLTNITEMQKLGMAGKRTAINDALVQAYNLMSKSKAK---SKIIILLT 194 Query: 346 DG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 DG DN + + + +K + R + +A + Sbjct: 195 DGRDNMSKIPLSDVKHMIEKRDVKLYTIGIG-GPRDYDAQYLKTLAKAGKGQA-YAARSA 252 Query: 405 RDQDDIYPVFRELFHKQNATAK 426 IY +L + + K Sbjct: 253 AMLSKIYDEINKLEVTKLDSKK 274 >UniRef50_A9AX98 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AX98_HERA2 Length = 828 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 33/299 (11%), Positives = 74/299 (24%), Gaps = 30/299 (10%) Query: 132 LALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 P+ N + L +P ++S +R + + + + + AL Sbjct: 277 ERSPDSSANLRDALEAANLVTEALRPAALPTSLSQLRVYDSIVLQDISANDLSLDQQLAL 336 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 E + + + A + L + Sbjct: 337 REFVRSLGHGVVVLGGTNSYNLGSYAGTPLEE---------LLPVSMEPPPRRERPTVTL 387 Query: 252 FCLMDVSGSMDQ----STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE---- 303 ++D S SM +AK I L + + + + Sbjct: 388 LLILDRSASMLGESGKDKFSLAKAAAIAATDSLGADDTIGVLAFDDTNDWTVTFTKVGQG 447 Query: 304 -------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + GGT + +AL++ + ++ +A +DG + + S Sbjct: 448 VQLSEIQNNIAGLSAGGGTDIYAALEVGMGGLAQQTGK---VRHAVLLTDGRSGGESSYE 504 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + + + A L L + +FA + Sbjct: 505 SLIAPLRAQGITLSTIAIG---GDADTVLLESLAKLGAGRYHFASRPDDLPRLTLQEAE 560 Score = 41.8 bits (96), Expect = 0.053, Method: Composition-based stats. Identities = 14/136 (10%), Positives = 39/136 (28%), Gaps = 16/136 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMA-KRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + + + L+D S ++ +D A + + V + Sbjct: 45 PRNQANQQASPLILLVDQSANLPSELRDAAWNEAVRFYQQQIEQRP----VRLLAFGADV 100 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + + G ++ AL+ ++ + + SDG + + Sbjct: 101 RVSQTDQRPAIDPNGS-DLAGALQFASGLLPQGGD-------IILLSDGASTTTNGQNQV 152 Query: 359 EILAKKLLPVVRYYSY 374 A++ +R + Sbjct: 153 STFAQR---SIRLHGV 165 >UniRef50_D1BQE7 von Willebrand factor type A n=1 Tax=Veillonella parvula DSM 2008 RepID=D1BQE7_VEIPT Length = 671 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 44/372 (11%), Positives = 93/372 (25%), Gaps = 38/372 (10%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGG----GSGSGQGQASQDGEGQDEFVFQISKDE 123 + + P + + D S + S D + E Sbjct: 297 NDSMGNEAAPKDSNANDGDTHSAMNSAEDASSQQDDSSEADGSNQSNDLDATQKESCDSE 356 Query: 124 YLDLLFED------LALPN---------LKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 +++ +D L LP+ + + T + +R G + Sbjct: 357 AMNVGGDDSQRLSSLCLPDTVARIANQLFQWKLESSKTVDRQYRKGSGRRLMTKTKDTRG 416 Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 + + A+ +L ++ A + E+ + + K + Sbjct: 417 RMIRAYQDEHAL-----EDLALVDTLRAAAPYQRLRAATKTEQEKLSTQSQQLKHQGGKG 471 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF--YILLYLFLSRTYKN 286 + K + A ++D SGSM + A + LL Sbjct: 472 LAIVIKPQDYRRKAREKRIGAYQLFVVDASGSMAARHRMEATKAAILSLLRDSYIHRDSV 531 Query: 287 VEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + + T++ E E G T ++ L++ + + Sbjct: 532 GLIAFRKESAEVLLPFTRSVERAERLLTSMPTGGKTPLAHGLRMAYTLCDRLLRAHRAER 591 Query: 340 Y-AAQASDGDNWADDSPLCHEILAKKLLP----VVRYYSYIEITRRAHQTLWREYEHLQS 394 +DG + DS + V T L +E L + Sbjct: 592 IQIICITDGRATSGDSEDPVAESKQWARILGTLPVDCIVIDTETGFIKLGLAKELCKLMN 651 Query: 395 TFDNFAMQHIRD 406 D Sbjct: 652 GSYYAMDTITAD 663 >UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxobacter sp. K RepID=B4UFP8_ANASK Length = 480 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 47/191 (24%), Gaps = 13/191 (6%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 S + ++DVSGSM A + + L L+ VV+ Sbjct: 33 ARAERSPVCVIPVLDVSGSMHGEKLHFATQSIMKLVDHLAPGDFCGVVVFSTEVETLAAP 92 Query: 302 DEHEFF----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW- 350 E + T ++ L + K P + +DG Sbjct: 93 TEMTQDRKDALKVALGRLRPRHNTNLAGGLLAGLDHAKVTKVPDGMPVRVILFTDGLANE 152 Query: 351 -ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 SP L + L ++ A Q L RE L + Sbjct: 153 GPATSPEGLCALLEANLGTASVSAFGYG-DDADQELLRELSTLGRGNYAYVRSPEDALTA 211 Query: 410 IYPVFRELFHK 420 L Sbjct: 212 FARELGGLLST 222 >UniRef50_Q23J98 von Willebrand factor type A domain containing protein n=4 Tax=Tetrahymena thermophila SB210 RepID=Q23J98_TETTH Length = 1633 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 60/201 (29%), Gaps = 20/201 (9%) Query: 226 VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK 285 F D + + SS++ ++D SGSM A IL L Sbjct: 1023 DIFSDEYQQKLNQELIDHLNSSRSEFIFILDRSGSMRGQPIRRACEAIILFLKSLPNDSY 1082 Query: 286 NVEVVYIR-----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYN 333 + + + +++ E GGT + + L + + Sbjct: 1083 FNVISFGSSFEKLFPFSTKYTSESLEKAVQIINNYDSDLGGTEIYNPLHNVFIM----KR 1138 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 + +N +DG+ DS L KK R +S +A Q L +E Sbjct: 1139 ISGYNRQIFLLTDGE---VDSSEQVIELIKKNNKYNRVHSIGFGF-KADQYLIKESAIAG 1194 Query: 394 STFDNFAMQHIRDQDDIYPVF 414 Q+ + I + Sbjct: 1195 KGISKIVDQNCDLSEVIINML 1215 >UniRef50_C0YPQ5 von Willebrand factor(VWA) type A domain-containing protein n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YPQ5_9FLAO Length = 330 Score = 95.7 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 52/231 (22%), Gaps = 41/231 (17%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRT 283 I R + D + + +DVS SM + K + Sbjct: 68 IAMARPRTFTISQDRDDTKGVDIMLSIDVSLSMLAKDLNPDRITALKDIAVKFVQK-RPN 126 Query: 284 YKNVEVVYIRHH------TQAKEVDEHEFFYSQETG---GTIVSSALKLMDEVVKERYNP 334 + V Y T +V E G GT + L + + + Sbjct: 127 DRIGVVAYAAEAFTKVPVTSDHQVVIDEIKNLNSAGLEPGTAIGEGLSVAVNHLVKSKAK 186 Query: 335 AQWNIYAAQASDGDNWADD--SPLCHEILAKKLLPVVRYYSYIE-----------ITRR- 380 + +DG + + P LAK V I Sbjct: 187 ---SKVVILMTDGVSNIQNAIPPQVAAELAKNNNIKVYAIGIGTNGYALMPTSQDIFGDL 243 Query: 381 --------AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + RE F +++Y +L Sbjct: 244 VFTETEVTIDENTLREIAQTTGGKY-FRATSNSSLEEVYDEINQLEKSDVK 293 >UniRef50_C7FPD9 Uncharacterized protein n=2 Tax=environmental samples RepID=C7FPD9_9BACT Length = 836 Score = 95.7 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 49/175 (28%), Gaps = 19/175 (10%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-H 295 + PD + +F ++D SGSM D A+ + + + Sbjct: 297 PPLKVDPDQVTPKELFFVVDTSGSMMGEPLDKARAAMRYALERMGPDDTFQIIDFASGVA 356 Query: 296 TQAKEVDEHEFFYSQET----------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + A + ++ GGT + + ++ + P A + Sbjct: 357 SLAPRPLPNTPENLRKGLAFIEAMTSQGGTEMLAGIRAALD----GPTPPGRLRIVAFMT 412 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 DG + + + R +S+ ++ L E + Sbjct: 413 DG---YIGNDGDILDYIDQSVGQARLFSFGVG-EDVNRYLLEEMATRGRGTVQYV 463 >UniRef50_Q22SJ7 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22SJ7_TETTH Length = 642 Score = 95.7 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 19/206 (9%), Positives = 54/206 (26%), Gaps = 30/206 (14%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 K+ + + C+++ S SM K + L L+ + V+ + T Sbjct: 186 KDQVLVKNSRPSIDLVCVINNSESMHGEKILNVKNTLLYLLEMLNSNDRLSLVLSNNNPT 245 Query: 297 QAKEV----------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 ++ + T T ++ ++ +++ R + + + SD Sbjct: 246 TLFDLKYLDEKNKQDLKRIINNISITQNTNITKSMIKAFNILQFRQSQNKVSS-IFLLSD 304 Query: 347 GDNWADDSPLCHEILA------KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 G + + + + + Y + + L++ + Sbjct: 305 G--VDSSAEKQIQNYISSQQSLQNKNFAIHSFGYGF---DQDAEMINKICSLKNGNFYYI 359 Query: 401 MQHIRDQDDIYPVFRELFHKQNATAK 426 + F Sbjct: 360 --------QNMNQVDQYFADVLGGTL 377 >UniRef50_A0Z5Z1 BatB protein, putative n=2 Tax=unclassified Gammaproteobacteria (miscellaneous) RepID=A0Z5Z1_9GAMM Length = 332 Score = 95.7 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 58/205 (28%), Gaps = 32/205 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 E +S + +D+SGSM + K + + Sbjct: 81 EPIELANSGRDLLLAIDLSGSMQIEDMQIGNSLVSRITAVKAIAADFASR-RTGDRVGLI 139 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ T + + +Q T + AL L + ++ER + Sbjct: 140 LFGTRAYVQAPLTFDVKTVKQFIEEAQLGFAGEDTAIGDALGLAVKRLRERPAD---SRV 196 Query: 341 AAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSY----IEIT---RRAHQTLWREYEHL 392 +DG + A P+ LA ++ + + + L Sbjct: 197 LILLTDGQDTASTVDPMEAAALASEMNVKIYTIGISRRLGTSSNSSGEVDEALLTAIAQA 256 Query: 393 QSTFDNFAMQHIRDQDDIYPVFREL 417 F + ++ DIY V EL Sbjct: 257 TGGRY-FRARTPKELQDIYQVLDEL 280 >UniRef50_B7KCF7 von Willebrand factor type A n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KCF7_CYAP7 Length = 573 Score = 95.7 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 22/195 (11%), Positives = 59/195 (30%), Gaps = 20/195 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----- 293 ++ + D + ++D SGSMD + + K+ + ++ V Y Sbjct: 388 WKLQKDAGKTVYLMTVIDTSGSMDGAPLEAVKKGLRIASKEINPGNYVGLVTYGDRAAEV 447 Query: 294 -----HHTQAKEVDEHEFFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDG 347 + + G T + + + ++++++ N Y +DG Sbjct: 448 VPLGLFDELQHKRFLAAIDNLRADGATAMYDGMMIGLSKLMEQKKNNPDGRFYLLLLTDG 507 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + + + V +Y +Q L+ + Sbjct: 508 QANMGVTFDEVKEVIEYSGVRVYPIAYG----DVNQEELEAIASLRE-----STVKKGTP 558 Query: 408 DDIYPVFRELFHKQN 422 +++ + + LF Sbjct: 559 ENVEDLLKGLFQTNL 573 >UniRef50_C6VXL7 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VXL7_DYAFD Length = 339 Score = 95.7 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 63/216 (29%), Gaps = 33/216 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 E++ S + L+D+S SM + + AKR + +V+ Sbjct: 94 ERKDRFSEGIDIMLLLDISDSMIEKDLSPNRLEAAKRMARQFIKG-RLQDRIGLIVFAGE 152 Query: 295 HTQAKEVD----------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + + T GT + SAL + ++ + A + A Sbjct: 153 AVSLCPLTTDYELLYGFLDEVTPSLIPTPGTAIGSALAVAVNRMR---DTAGESKVAILI 209 Query: 345 SDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRA------------HQTLWREYEH 391 SDGDN + + P LA V S + + + + Sbjct: 210 SDGDNTSGNLGPTTSAQLANAFGVKVYTISVGKPKSASKADTTASAGALMDEGELQNIAG 269 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + + F + ++ +L ++ Sbjct: 270 IGNGKY-FRATDNTALESVFKQIDQLEKVKSRDVLS 304 >UniRef50_A9SQ90 Predicted protein n=3 Tax=Physcomitrella patens subsp. patens RepID=A9SQ90_PHYPA Length = 778 Score = 95.3 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 22/207 (10%), Positives = 47/207 (22%), Gaps = 32/207 (15%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------ 292 + Q + L+D SGSM + A + L + + Sbjct: 336 PDPSKITVFQRAVVFLLDRSGSMYGDPLNDALQALYSGLESLKPEDSFNIIAFDHETALF 395 Query: 293 ------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + E + GGT + S L+ ++V+ Y +D Sbjct: 396 SSQMERANSASILRAREWATEKCKARGGTDILSPLQQAFKLVEN---FPGAVPYVFLITD 452 Query: 347 GDNWADDSPLCHEILAK-------KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 G A D+ + + P + + + + + Sbjct: 453 G---AVDNEKNICLTMQSRIVELGARAPRISTFGIG---HYCNYYFLKMLAVIGRG---- 502 Query: 400 AMQHIRDQDDIYPVFRELFHKQNATAK 426 + + Sbjct: 503 LSDVAFASGKLRGQMERMLVAAATPVL 529 >UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38CFE Length = 489 Score = 95.3 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 55/189 (29%), Gaps = 21/189 (11%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-RTYKNVEVVYIRHH------TQA 298 + + L+D SGSM S +R + + V + T+ Sbjct: 49 TKPQAVVMLIDTSGSMSGSKLPEVQRAASEFVSRQNLKRDDLAVVEFSSRASVVADFTRD 108 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + + GGT +S L V++ P +DG+ ++ Sbjct: 109 ERELQQAIARLSAWGGTNLSEGFNLATSVLQNSDRPGN----ILLFTDGEP---NNRRMA 161 Query: 359 EILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE- 416 +A+++ + + + F + D D + + Sbjct: 162 ASIAQQIRASGINLVAVGTGDAPVNYLT----ALTGDPDLVF-YANFGDLDSAFRGAEKA 216 Query: 417 LFHKQNATA 425 ++ +Q + Sbjct: 217 IYGQQLVES 225 >UniRef50_Q54MG4 von Willebrand factor A domain-containing protein DDB_G0285975 n=4 Tax=Dictyostelium discoideum RepID=Y5975_DICDI Length = 917 Score = 95.3 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 52/189 (27%), Gaps = 20/189 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 ++ L+D SGSM AKR ++ L+ K + T+A + + Sbjct: 336 QKSEFIFLIDCSGSMSGEPIKKAKRALEIIIRSLNENCKFNIYCFGSRFTKAFDNSKMYN 395 Query: 307 F-----------YSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 GGT + ++ +++ + ++ +DG+ Sbjct: 396 DETLKEISGYVEKIDADLGGTELLPPIR---DILSTESDF-EYPRQLFILTDGE---VSE 448 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 R ++Y L + + ++ + + Sbjct: 449 RDSLINYVATESNNTRIFTYGIGNS-VDTELVIGLSKACKGYYEMIKDNSNFEEQVMKLV 507 Query: 415 RELFHKQNA 423 F + Sbjct: 508 SIAFEPTLS 516 >UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacteria RepID=Q114A2_TRIEI Length = 1204 Score = 95.3 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 68/219 (31%), Gaps = 21/219 (9%) Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 + L + + Q E + + L F++ + R E +P Sbjct: 573 DTVNLPLVVDSYLQEKQKQEAREAQAKAAPERLLEPE----FVENPEQRLPEPEFVENPE 628 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRHHTQAKEV-- 301 ++ + L+D S SM + + + VE+ I +++ + V Sbjct: 629 NRCPIILLLDTSYSMSGEAITELNQGVKIFQASVKEDELASLRVEIAVITFNSEIEVVQD 688 Query: 302 ----DEHEFFYSQETGGTIVSSALKLMDEVVKERYNP------AQWNIYAAQASDGDNWA 351 D+ + +G T + A++ E++++R + + +DG Sbjct: 689 FVTVDKFIPKTLEASGVTHMGKAIEKALELLEKRKQDYKNSDIQYYRPWIFLITDGQPT- 747 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 D+ ++ + + R A E Sbjct: 748 -DTWQDAAKKIEEAETNRKLLFFAVGVRDADMETLSEIS 785 >UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepID=B9GN58_POPTR Length = 705 Score = 95.3 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 53/175 (30%), Gaps = 17/175 (9%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 P + + ++DVS SM + M KR L+ L + V + + Sbjct: 323 LDPSRRAPIDLITVLDVSASMTGAKLQMLKRAMRLVISSLGSADRLSIVAFSSSPKRLLP 382 Query: 301 V----------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + G+ V AL+ +V+++R SDG + Sbjct: 383 LKRMTPNGQRSARRIIDRLVCGQGSSVGEALRKATKVLEDRRERNPVAS-IMLLSDGQDE 441 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + H + V + + + + + + + +Q +R Sbjct: 442 RSSTRFAHIEI------PVHSFGFGQSGGNSQEPAEDAFAKCVGGLLSVVVQDLR 490 >UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10Z89_TRIEI Length = 477 Score = 95.3 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 18/191 (9%), Positives = 47/191 (24%), Gaps = 23/191 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 P + L+D S SM + + + + + + Sbjct: 36 LSLTRRPPQPQTVVLLIDTSSSMWGGKLPEVQAAATGFVE--RQNLTVNNLAIVEFSSNS 93 Query: 299 KEVDEHEFF---------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + + + +GGT +S LK + +++ P +DG Sbjct: 94 QVLTNFDADKTELKQAIANLTPSGGTNLSQGLKTVASLLRNSNTPN-----ILLFTDGQP 148 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + ++ + + T A+ + + D Sbjct: 149 NDPRASKSIAREIREA--GINLVTVG--TGDANSNYLTSLTENPDLVFF---ANSGEIDQ 201 Query: 410 IYPVFRELFHK 420 + + + Sbjct: 202 AFRAAEKAISQ 212 >UniRef50_Q01UI0 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01UI0_SOLUE Length = 837 Score = 95.3 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 20/182 (10%), Positives = 48/182 (26%), Gaps = 15/182 (8%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ----- 297 P + ++D S SM+ ++A+ I + L +++ Sbjct: 387 PRSPEGTAVVLIIDKSSSMEGRKIELARLAAIGVVENLRPIDSVGVLIFDNSFQWAVPIR 446 Query: 298 ---AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + + GGT ++ AL + + + +DG + DS Sbjct: 447 KAEDRATIKKLISGITPDGGTQIAPALTEAYQRI---LPQTAMYKHIVLLTDGISEEGDS 503 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 A+ + + ++ + F + + Sbjct: 504 M-TLTKEAQANHVTISTVGLGQ---DVNRAFLEKVASNADGKAYFLNDPSGLEQLLLRDV 559 Query: 415 RE 416 E Sbjct: 560 EE 561 >UniRef50_Q8TYU9 Mg-chelatase subunit ChlI and Chld (MoxR-like ATPase and vWF domain) n=1 Tax=Methanopyrus kandleri RepID=Q8TYU9_METKA Length = 818 Score = 95.3 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 45/315 (14%), Positives = 89/315 (28%), Gaps = 23/315 (7%) Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLT-------EYKTHRAGYTANGVPANISVVRSLQN 172 S DE + + L++ QR+L +++ G + ++ +L Sbjct: 503 SYDEIASQQHDYSLIDELEEEIQRRLEILQKLGFVRPSYQGGVSLTLKGRELAAFSALIE 562 Query: 173 SLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE-EERLRKEIAELRAKIERVPFIDT 231 L G E S S + + L + A I Sbjct: 563 ELEAFEGTEFGHHAAKRLSERGTGSRSYSREYRRGDPYANLDVRGSLRTAVRRGRREILP 622 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF-LSRTYKNVEVV 290 DLR + E+ + ++D SGSM D AKR I L F + + V Sbjct: 623 EDLRSFDREE----EVCLDIVYVIDTSGSMSGDRIDAAKRAAIALAHFSVKAGDRVGIVG 678 Query: 291 YIRHHTQAKEVDEHEFF------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + ++ + G T + A+++ E+ + P + + + Sbjct: 679 FNTKAEIVVDITSDVEEIITKVMSLKPGGATDIGDAIRVGTELFRRCGRPDR-DWHMILL 737 Query: 345 SDGDNWADDSPLCHEILAK---KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG + + L++ V + L + + Sbjct: 738 TDGVPTKGEPDPETKALSEATAASRMGVTISTIGIKLPEEGIRLIEHIAGISGGRSHHIT 797 Query: 402 QHIRDQDDIYPVFRE 416 +R Sbjct: 798 DPEELTLVTLNEYRR 812 >UniRef50_A7SV91 Predicted protein (Fragment) n=2 Tax=Nematostella vectensis RepID=A7SV91_NEMVE Length = 614 Score = 95.0 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 28/255 (10%), Positives = 60/255 (23%), Gaps = 21/255 (8%) Query: 184 KRRELHALEENLAIISNSEPAQLLEEERL--RKEIAELRAKIERVPFIDTF--DLRYKNY 239 KR+ L ++ ++ E F+ T L Y Sbjct: 207 KRQASVRLTGESYHFNDDVQVNVIRSEPFQLHALTENGVTTKSSDKFLATPIVMLNYFPE 266 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----- 294 ++ L+D S +M D A+ +L L + Y Sbjct: 267 FTDCKERTRGEFIFLVDRSKNMKGENIDRAREVVLLFLKSLPDGCHFNVISYGASYIKLF 326 Query: 295 ------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 ++E + + G + + + EV + + G+ Sbjct: 327 TQSEVCDHSSRERASDFVWDLKA--GIDNARPIDPLREVYSHNLIDTGYPRQVFLVTGGE 384 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + +K R ++ + + R Q R Sbjct: 385 ---VGNAEQVTDEVRKNAHTARCFTLGINVGKHSDHV-RRIARAGRGTCELVSQLDRVAP 440 Query: 409 DIYPVFRELFHKQNA 423 + ++ E Sbjct: 441 KVQSMWDEATQPVVT 455 >UniRef50_Q7K0H4 Straightjacket n=11 Tax=Coelomata RepID=Q7K0H4_DROME Length = 1218 Score = 95.0 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 40/313 (12%), Positives = 84/313 (26%), Gaps = 35/313 (11%) Query: 127 LLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 + +D+ P + + + E + N +++ V ++ + Sbjct: 147 DMDKDIGEPLIYVQPKVVVLEPRPEFHNTPVNFSVSSVHVPVNVFDRAPDVIKAIQWS-- 204 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAEL-----RAKIERVPFIDTFDLRYKNYEK 241 E I ++ + +K + +D +D R +++ Sbjct: 205 -----ENLDQIFRDNYKNDPTLSWQFFGSSTGFMRQFPASKWRKDVPVDLYDCRLRSWYM 259 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 +S + LMD SGSM D+AK + L + + + Sbjct: 260 -EAATSPKDIVILMDGSGSMLGQRLDIAKHVVNTILDTLGTNDFVNIFTFDKEVSPVVPC 318 Query: 302 DEHEFFYSQETGGT-----------------IVSSALKLMDEVVKE---RYNPAQWNIYA 341 E Q G ++AL E+++E AQ N Sbjct: 319 FEDTL--IQANLGNIRELKEGIELFRPKSIANYTAALTKAFELLEETKLSSRGAQCNQAI 376 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 DG + VR ++Y+ A+ R + Sbjct: 377 MIIGDGAPENNREVFELHNWRDPPYKPVRVFTYLIGKEVANWDDIRWMACENQGYYVHLS 436 Query: 402 QHIRDQDDIYPVF 414 ++ + Sbjct: 437 DTAEVREMVLNYI 449 >UniRef50_Q8TU27 Putative uncharacterized protein n=1 Tax=Methanosarcina acetivorans RepID=Q8TU27_METAC Length = 589 Score = 95.0 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 46/186 (24%), Gaps = 19/186 (10%) Query: 248 QAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVV-------YIRHHTQAK 299 + +D SGSM + K + + VV + T Sbjct: 80 PMDVVFAIDSSGSMQSNDPSGLRKTAAKSFVDKMDSSRDTAGVVSWDDSIDFSLPLTNDF 139 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + + +G T ++ L+ +++ +DG S Sbjct: 140 PLVKTNIDSVDSSGSTNLNVGLEEAIDILDANPRTENSVEVIIFLTDGQGTYLHST---A 196 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 A V+ + ++ + + +F ++F Sbjct: 197 QEAADKGYVIYSIGLGGVNPTP----LQDMATTTGGAYYSSPDATS----LQAIFDDIFS 248 Query: 420 KQNATA 425 + + Sbjct: 249 EVTTST 254 >UniRef50_C0D9H7 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D9H7_9CLOT Length = 1360 Score = 95.0 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 29/248 (11%), Positives = 66/248 (26%), Gaps = 24/248 (9%) Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 + N++ + S +++ ++ + +++ ++ + Y Sbjct: 487 SMQRSAVNISSLDASAFSEITAYVQIETPVDYSIDELKSHITVEDCGAQISEYNLEKVEY 546 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKE 300 S A M DVSGSM + ++ I + +S + +++ T + Sbjct: 547 SSANMLLCCDVSGSMQGRPIEDSRAAVISMAESMSGNARLGVILFNSSVQGLTDFTVQPD 606 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE- 359 V GGT + + E + SDG S + Sbjct: 607 VIRSTAESMTANGGTNIFDTVVHGLESFPKNGPE--VLNTLVVMSDGQENNAHSAEEIQT 664 Query: 360 ---ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 AK +V + + Sbjct: 665 AIGQAAKDKSILVHCLGLG---SEVDANYLQTIAQSAGGTYQYVTDSSSL---------A 712 Query: 417 LFHKQNAT 424 +F++ A+ Sbjct: 713 VFYQNLAS 720 >UniRef50_C4DQN3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DQN3_9ACTO Length = 831 Score = 95.0 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 49/439 (11%), Positives = 102/439 (23%), Gaps = 48/439 (10%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 MT + R + R + + Y I + +I + DV + ++ + + Sbjct: 83 MTMTVAER--TVTAELHERAKARQLYDTAISEGKRASIAEAERADVFTMRVGNLGAGEEA 140 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 + P G + G+G A D Sbjct: 141 VVTLTLVGPLAFEDNEATLRLPLVVAPRYIPGQPTGAAPVGEGYAEDTDAVPDASRITPP 200 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 L N R E AG + +++ V + Sbjct: 201 V------------LLPGFPNPVRLSIEVTIDPAGLPLRQLRSSLHAVTVDETGEV----T 244 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 + + E + + E + ++ I DL Sbjct: 245 RVRIEPGERVNRDFILRFDYGESGDVAGSLLTAPDENEPTSGTFQLTAIPPSDLP----- 299 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 + + L+D SGSM A+R + LS + + T + Sbjct: 300 ----RARPRDVVVLLDRSGSMGGWKMVAARRAAARIVDTLSSADRFAVRCFDTAMTSPEG 355 Query: 301 VDEH------------EFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 +D + + + GGT + L +++ + + Sbjct: 356 LDPNGLSAGTDRNRFRAVEHLAGTETRGGTDILKPLSTAVDLLTA--GEKGRDRVIILVT 413 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + L +R + + + R Sbjct: 414 DGQ---VGNEDQILRELTGRLSGMRVHVVGIDK-AVNAGFLHRLALVGRGRCELVESEDR 469 Query: 406 DQDDIYPVFRELFHKQNAT 424 + + R + Sbjct: 470 LDEATAHIHRRIVAPVVTD 488 >UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2GTB7_CHAGB Length = 777 Score = 95.0 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 19/211 (9%), Positives = 44/211 (20%), Gaps = 31/211 (14%) Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST-------------- 265 + + + E + +D+SGSM Sbjct: 44 FSSEHERGGLIVKIQPPREPENADLHHVPCDLVLSIDISGSMADEAPAPSKPGGEAGEDT 103 Query: 266 ----KDMAKRFYILLYLFLSRTYKNVEVVYIRHHT------QAKEVDEHEFFYSQETGGT 315 D+ K + L + V + + K + T Sbjct: 104 GLRVIDLVKHAARTIVATLDSRDRLGIVTFTNRSKVGIPPYENKAKTLENIESMEPFSST 163 Query: 316 IVSSALKLMDEVVKERYNPA-QWNIYAAQASDGDNWADDSPLC---HEILAKKLLPVVRY 371 + ++ + E + +DG P + L + Sbjct: 164 NMWHGIRDGLSLFSEAEGGSTGRVPALLVLTDGMPNYMCPPKGYVPMLRSMEPLPATIHT 223 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + + L + + +F Sbjct: 224 FGFGY---ELRSGLLKSIAEVGGGNYSFIPD 251 >UniRef50_Q1Q2F4 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q2F4_9BACT Length = 331 Score = 95.0 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 51/221 (23%), Gaps = 44/221 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 E+ + + +D+SGSM +M K+ T V Sbjct: 77 EQTKVKTEGIDIVLAVDISGSMLAEDFEMDGKRQNRLYVVKQVVKDFINK-RSTDPIGLV 135 Query: 290 VYIRHHTQAKEVDEHEFFYSQE---------TGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V+ + + Q GT + SA+ + ++ + Sbjct: 136 VFSANAYTQCPLTLDYGILLQFLEKTEIGLLEDGTAIGSAIASSVDRLRN---TKAQSKV 192 Query: 341 AAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITR-------------------R 380 +DG N PL LA+ + Sbjct: 193 IVLLTDGRNNSGQIDPLTAAELAQAFNIKIYTIGAGSKGLVPYPARDLFGNRVMRQVKID 252 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 E ++ + +IY L + Sbjct: 253 IDDESLAEIANITGGRY-YRATDTGSLKEIYQQIDALEKTE 292 >UniRef50_C3WEJ1 BatA protein n=3 Tax=Fusobacterium RepID=C3WEJ1_FUSMR Length = 319 Score = 95.0 bits (234), Expect = 6e-18, Method: Composition-based stats. Identities = 25/212 (11%), Positives = 54/212 (25%), Gaps = 36/212 (16%) Query: 244 DPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH--- 295 + +D+S SM + + AK+ + VV+ Sbjct: 77 IKKEGIDIVVALDLSQSMLQRDFKPNRLETAKKLLEEFIDK-RINDRISLVVFGGDAYTK 135 Query: 296 ---TQAKEVDEH-----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 T V + T + L + +K+ + +DG Sbjct: 136 VPLTFDHNVVKDITSKLTTDDITSNNRTAIGMGLGVSLNRLKDSEAK---SKVIILMTDG 192 Query: 348 DNWADD-SPLCHEILAKKLLPVVRYYSYI--------------EITRRAHQTLWREYEHL 392 +N + + SP+ +AK+L + + L + Sbjct: 193 ENNSGEMSPMGASEIAKELGIKIYTIGIGAREIQIRVPFGHTTVKNTELDENLLKNIAST 252 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 F ++ +I+ L + Sbjct: 253 TGGEY-FRAGSEKEFQEIFNRIDSLEKTKIDG 283 >UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RNW6_9PROT Length = 452 Score = 94.6 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 44/190 (23%), Gaps = 13/190 (6%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI---- 292 + ++D SGSM A R + L T VV+ Sbjct: 35 DPLATEKKARKPYHLALVIDRSGSMSGPPLAEAVRCAKHIADQLEPTDIASLVVFDDRVQ 94 Query: 293 ----RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG- 347 ++ G T + + + + A SDG Sbjct: 95 TLVPPRPVGDRQALHLALSRVHSGGSTNLHGGWQAGADGLLPAAGQAALARVI-LLSDGN 153 Query: 348 DNWAD-DSPLCHEIL-AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 N + P L A+ V +Y ++ L E + Sbjct: 154 ANVGEITDPAGIAALCAQAAERGVSTSTYGLG-SHFNEDLMVEMAKRGGGNHYYGDTAAD 212 Query: 406 DQDDIYPVFR 415 + F Sbjct: 213 LFEPFAAEFD 222 >UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZR58_9PLAN Length = 1032 Score = 94.6 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 26/257 (10%), Positives = 61/257 (23%), Gaps = 17/257 (6%) Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 A + + + ++ + L + + +R+ Sbjct: 389 ASGSSGEEITSFSDRQIELLVRNTQQLGCGLLILGGPSSFGAGGWANTKLEEASPVRFTI 448 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + + P + ++D SGSM M + + + + + + Sbjct: 449 RDAKVVPV--GALMLVLDKSGSMQGEKMQMTQGAALAAIRAMGAADFAGVIGFDSQAQRI 506 Query: 299 KEVDEHEF--------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + + + +GGT ++ + L ++ + SDG Sbjct: 507 VPIRKVDNPGMFVAQVRKLSASGGTNMTPGVALGFRDLQNV---DAGVKHMIVLSDGQTE 563 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + KK+ V + A Q L Sbjct: 564 PG-NVAQIASDMKKMGMTVSAVAVG---SDADQKLMATVARNGGGKFYAVNNPKAIPRIF 619 Query: 411 YPVFRELFHKQNATAKG 427 R + A G Sbjct: 620 MREARRVAQPLVKEAPG 636 >UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein family-like n=8 Tax=Oryza sativa RepID=Q6ZFR4_ORYSJ Length = 703 Score = 94.6 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 50/201 (24%), Gaps = 28/201 (13%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE---- 303 + ++DVSGSM+ + KR L L + V + + + Sbjct: 268 SVDLVTVLDVSGSMEGYKLALLKRAMGL----LGPGDRLAVVSFSYSARRVIRLTRMSEG 323 Query: 304 ------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN-------W 350 G T + L +V R SDG + W Sbjct: 324 GKASAKSAVESLHADGCTNILEGLVEAAKVFDGRRYRNAVASVI-LLSDGQDNYNVNGGW 382 Query: 351 ADDSPLCHEILA-----KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + + +L + + +++ T + +F Sbjct: 383 GASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGT-DHDASAMHTIAEETGGTFSFIENQAV 441 Query: 406 DQDDIYPVFRELFHKQNATAK 426 QD L A+ Sbjct: 442 VQDAFAQCIGGLLSVPVQEAR 462 >UniRef50_B8G7Y1 von Willebrand factor type A n=3 Tax=Chloroflexus RepID=B8G7Y1_CHLAD Length = 914 Score = 94.6 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 32/300 (10%), Positives = 69/300 (23%), Gaps = 25/300 (8%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L Q+T V S V + +L R A + L Sbjct: 301 LLTTTPDQVTALVAAWQATGITVVVQEPSQVPADPRAL-RAFDTIALVDTPADEVPTALQ 359 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + + A R ++ + R + + ++D Sbjct: 360 RALTTYVRDNGGGLLVIGGPRSFGAGGWRRTLLEPILPVALDPPLREERP-DLALVLVID 418 Query: 257 VSGSMDQ------STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE--------VD 302 SGSM + + D+A+ L++ + + + Sbjct: 419 RSGSMRELVDDGRTQLDLAREAVYQASRGLTQRDQIALIAFDSIADTLLPLQPLPGLFTI 478 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 E GGT + S + L E + + +DG ++ Sbjct: 479 EDALSRLVAGGGTNIRSGIALAAETIATS---QARIRHVILLTDG--VSETEYADLVADL 533 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + V + T + + + + ++ Sbjct: 534 RAQGITVSAIAIGLDT----DPALERVAQIGGGKYYLVQRVPDLPQVVLEETVRVANRDV 589 >UniRef50_A7T2Z0 Predicted protein n=4 Tax=Nematostella vectensis RepID=A7T2Z0_NEMVE Length = 357 Score = 94.6 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 51/182 (28%), Gaps = 12/182 (6%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-------- 296 P + + L+D SGS++ + D K F L + + V + +T Sbjct: 141 PPLKMNLVFLIDNSGSINDTEFDNFKEFAKKLAESFTISATYTHVAAVYFNTLANFGFNL 200 Query: 297 -QAKEVDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 V + GGT + AL + V + +DG + DS Sbjct: 201 KYDINVIKTAIDNLPNIGGGTHIGKALTYTLDNVFKVAPRQNVKNVLVVLTDGK--SHDS 258 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + P V ++ + F ++H + Sbjct: 259 VTLPAAAVRNYGPGVEVFAVGVGAGDSFVAQLNVIASDPDEDHVFHVEHFSQIESTTGAV 318 Query: 415 RE 416 + Sbjct: 319 ED 320 >UniRef50_B3RZT6 Putative uncharacterized protein n=2 Tax=Trichoplax adhaerens RepID=B3RZT6_TRIAD Length = 1343 Score = 94.6 bits (233), Expect = 7e-18, Method: Composition-based stats. Identities = 25/223 (11%), Positives = 60/223 (26%), Gaps = 18/223 (8%) Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYI 274 ++ + ++ R + + ++D SGSM S + Sbjct: 263 TWTVIKRHQDFTNQNRPARIKNTAPIFRFVKAMPVRIVMVLDKSGSMRGSNLQQLIQAAT 322 Query: 275 LLYLFLSRTY-KNVEVVYIRHHTQAKEVDEHEFFYSQ----------ETGGTIVSSALKL 323 + L L + +++ T + + +GGT + S + Sbjct: 323 NVILQLGQIDGSIGIIIFSTSATVTCPLMAVNNDQDKNKLIGCLPPEASGGTSIGSGILK 382 Query: 324 MDEVVKERYNPAQWN-IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 E++ + + + SDG A+ + VV S+ + + Sbjct: 383 GIELLLGSVGEQKPSGGHLIVMSDGQENANPRIKDVMSNITENDVVVTSISFGQSASKV- 441 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + FA + + F + + Sbjct: 442 ---LEDLAKSTGGSSYFASTNGTLT--LMNAFTAIISNLVGPS 479 >UniRef50_B2KDS9 von Willebrand factor type A n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDS9_ELUMP Length = 373 Score = 94.6 bits (233), Expect = 7e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 53/207 (25%), Gaps = 25/207 (12%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVY 291 + P+ + +D SGSM D AK + + VV+ Sbjct: 135 DAQKTVLPPTEGVDIILAIDTSGSMAAQDFDPNRITAAKVAAANFIAN-RLSDRIGIVVF 193 Query: 292 IRHH------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYA 341 T E + GT + A+ V +PA+ + Sbjct: 194 ASDAMLQSPLTLDYESLLDFLADVRIGMVRTDGTAIGDAI--AVSSVHLERSPAR-SKVI 250 Query: 342 AQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITR----RAHQTLWREYEHLQSTF 396 +DG+ N SPL A V + I + L Sbjct: 251 ILLTDGESNSGVISPLDAAKTAALYGIKVYTIATISKNSRDSLDFKPDDLEQIAKLTGGK 310 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + IY L + Sbjct: 311 Y-YRAYNEAELTKIYAEIDSLEKTEFK 336 >UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FM70_SORC5 Length = 507 Score = 94.6 bits (233), Expect = 7e-18, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 51/225 (22%), Gaps = 14/225 (6%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 + R + + R P + A + L+D SGSM + A+ Sbjct: 86 AAGASAFALPGTRETQLGVWIDVPAARAARGQPRAPAAVVLLVDASGSMQGPKMENARAA 145 Query: 273 YILLYLFLSRTYKNVEVVYIRHH----------TQAKEVDEHEFFYSQETGGTIVSSALK 322 L + + G T + + LK Sbjct: 146 AQAFVDRLPDGDLVSVASFADTAQARVAPTVLGRSTRPAVARAIAALGPDGSTNLFAGLK 205 Query: 323 LMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKK-LLPVVRYYSYIEITRR 380 L ++ + SDG N SP LA++ V+ S Sbjct: 206 LAEQHALAAPSTHAVRRVV-LISDGQANIGPSSPDILGALAQRGAAHGVQITSIGVGA-D 263 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + S + + L A Sbjct: 264 YDERTLNALAVGSSGRLYHLTEAREMSSVLERELALLQTTAATGA 308 >UniRef50_D2A5P0 Putative uncharacterized protein GLEAN_15569 n=5 Tax=Tribolium castaneum RepID=D2A5P0_TRICA Length = 2194 Score = 94.2 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 83/379 (21%), Gaps = 68/379 (17%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLAL-----------PNLKQNQQRQLTEYKTHRAGYTAN 158 E E +F+++ +E L L + + + + Sbjct: 855 EPSSETIFRLTYEELLQRQNGQYELIINVHPGQIVDDLCVEVKIDETRPLAFVKTPSLRT 914 Query: 159 GVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAE 218 G + + + K R E+ + + Sbjct: 915 GNEISDDKPELDPCAKTEMINANSAKVRFNPDKEQQKKYAELLGSKDQGLAGQFVVQYDV 974 Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 R + + + + + ++D SGSM+ + + + Sbjct: 975 ERDPKGGEVLLRDGYFVHF-FAPSGLQTFPKHVVFVLDHSGSMEGRKYEQLMQAMDKILS 1033 Query: 279 FLSRTYKNVEVVYIRH-------------------------HTQAKEVDEHEFFYSQET- 312 L+ V + + + E + + E Sbjct: 1034 DLNPDDLFHIVRFSENVSVWNFEKNKFDQVSFLQKPEYRNLDSFLAEFNLGDAAQVTEGN 1093 Query: 313 --------------GGTIVSSALKLMDEVVKERYNP-------AQWNIYAAQASDG-DNW 350 G T + L + +V+ + +DG N Sbjct: 1094 IKKAKKIKDHDVDMGCTNIIGGLVVGLYLVRRTLQKFYEKNVETKHQPMIIFLTDGLPNE 1153 Query: 351 ADDSPLCHEILAKKLLPVVR---YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 +P + K+ +S A + ++ F + Sbjct: 1154 GISNPDKITKIVTKINQGTNRAAIFSLSFG-EDADKNFLKKLSAQNLGFSRHIY----EA 1208 Query: 408 DDIYPVFRELFHKQNATAK 426 D + + ++ Sbjct: 1209 ADAALQLQNFYRTVSSPLL 1227 Score = 91.1 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 32/380 (8%), Positives = 84/380 (22%), Gaps = 66/380 (17%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLAL------PNLKQNQQRQLTEYKTHRAGYTAN----- 158 E E +F+++ +E L L + + ++ +T + Sbjct: 98 EPSSEIIFRLTYEELLQRQNGQYELIINVHPGQIVDDLCVEVKIDETRPLTFVKTPSLCT 157 Query: 159 GVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAE 218 G + + + K + E+ + + Sbjct: 158 GNEISDDKPELDPCAKTEMINANSAKVKFNPDKEQQKKYAELLGSKDQGLAGQFVVQYDV 217 Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 R + + + + + ++D S SM + + + Sbjct: 218 ERDPKGGEVLLRDGYFVHF-FAPSGLQTFPKHVVLVLDHSASMRGRKHEQLMQAMDKILS 276 Query: 279 FLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET-------------------------- 312 L+ V + +++ +F Sbjct: 277 DLNPDDLFHVVCF-SEDVSVWNLEKKQFDLIDFMEKFDYENLDSCLTELNLGNAVQFTED 335 Query: 313 ---------------GGTIVSSALKLMDEVVKERYNPA-------QWNIYAAQASDG-DN 349 G T + L + +V+ + +DG N Sbjct: 336 NIKKAKGIKNDDMHMGCTNIIGGLVVGLFLVRRTLKKNYEQNVETKHQPMIILLTDGLPN 395 Query: 350 WADDSPLCHEILAKKLLPVVR---YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 P+ + K+ +S A + ++ F + Sbjct: 396 VGLSDPVEITKIVTKINQGTNRAAIFSLSFG-EDADKNFLKKLSAQNLGFSRHIYEAADA 454 Query: 407 QDDIYPVFRELFHKQNATAK 426 + +R +F + Sbjct: 455 ALQLQNFYRTVFSPLLRDVR 474 Score = 82.6 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 28/379 (7%), Positives = 77/379 (20%), Gaps = 68/379 (17%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLAL-----------PNLKQNQQRQLTEYKTHRAGYTAN 158 E E +F+++ +E L L + + + + Sbjct: 1643 EPSSETIFRLTYEELLQRQNGQYELIINVHPGQIVHDLCVEVKIDETRPLTFVKTPSLRT 1702 Query: 159 GVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAE 218 G + + + K + E+ + + Sbjct: 1703 GNEISDDKPELDPCAKTEMINANSAKVKFNPDKEQQKKYAELLGSKDEGLAGQFVVQYDV 1762 Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 R + + + + + ++D S SM+ + + + Sbjct: 1763 ERDPKGGEVLLRDGYFVHF-FAPSGLQTLPKHVVFVLDYSASMEGRKHEQLMQAMDKILS 1821 Query: 279 FLSRTYKNVEVVYI----------------------------RH------------HTQA 298 L+ V + Sbjct: 1822 DLNPDDLFHIVRFSVIVSVWNFEKNRFDQIKFAQKPEYENLDSFLAEFNLGDAAQVSEDN 1881 Query: 299 KEVDEHEFFYSQETGGTIVSSAL-------KLMDEVVKERYNPAQWNIYAAQASDG-DNW 350 + + + + T + L + E E+ + +DG N Sbjct: 1882 IKKAKEIKDHDVDMDCTNIIGGLVVGLYLVRQTLEKFYEKNIETKHQPMIIFLTDGLPNV 1941 Query: 351 ADDSPLCHEILAKKLLPVVR---YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + K+ +S A + ++ F + Sbjct: 1942 GLIIRDEITDVVTKINQGTNRAAIFSLSFG-EDADKNFLKKLSAQNLGFSRHIY----EA 1996 Query: 408 DDIYPVFRELFHKQNATAK 426 D + + ++ Sbjct: 1997 ADAALQLQNFYRTVSSPLL 2015 >UniRef50_UPI0000E47594 PREDICTED: similar to inter-alpha (globulin) inhibitor H3 variant n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47594 Length = 902 Score = 94.2 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 20/217 (9%), Positives = 55/217 (25%), Gaps = 26/217 (11%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 +D +++ + P + + ++DVSGSM K + + T K Sbjct: 327 LDNHFVQFFSPS--GLPVLRKNVIFIIDVSGSMAGVKLRQVKDALTTILNDMPETDKFNI 384 Query: 289 VVYIR------------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA- 335 + + + + QE T + A+ ++++ + Sbjct: 385 IPFSDDVNFLDRNKMLFSTSSNVRRAKRFVKSLQERDNTNLHKAIIAGVRMLRDESDQNV 444 Query: 336 ----QWNIYAAQASDGDNWADDSPLCHEILA--KKLLPVVRYYSYIEITRRAHQTLWREY 389 SDG+ + + + ++ Sbjct: 445 RPDENVVSMLIVLSDGNPNHGEIDKEIIERNVEEAIRGDFSLFNLGFG-EDLDFPFLERM 503 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + I ++ D + + + Sbjct: 504 AYQNHG----VARQIPERADAGKLLENFYFEVATPLL 536 >UniRef50_UPI00006CD16B von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila RepID=UPI00006CD16B Length = 730 Score = 94.2 bits (232), Expect = 9e-18, Method: Composition-based stats. Identities = 30/250 (12%), Positives = 67/250 (26%), Gaps = 20/250 (8%) Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTF-DLRYKN 238 + + A + L ++ +E+ + FI + Sbjct: 194 KGELSKINEGYFSQEKAYLKVKYTKNNLSMLSFDQKNSEISPYCALINFIPPQISTQENL 253 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 K D ++ ++D SGSM ++AK I L + + Sbjct: 254 LTKTTDQLIKSEFVLIIDRSGSMYGPKMELAKESLIFFLKSLPVGSIYNIISFGSTCEIM 313 Query: 292 ----IRHHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 ++ + Q + + GGT VS AL+ + ++ +D Sbjct: 314 FDQSVQFNDQNVQNSIQQIDQFSANLGGTNVSKALEHVY---LNLFDQYGLRKKIFIITD 370 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G+ + + + + + E + F HI Sbjct: 371 GEFTDRNETQELVNAY-QNRCDINVLCIGK---DSQFQQAIEIANKTGGFTQHVKDHIDI 426 Query: 407 QDDIYPVFRE 416 + + + Sbjct: 427 ISKVILLLSQ 436 >UniRef50_A3DK47 von Willebrand factor, type A n=9 Tax=cellular organisms RepID=A3DK47_CLOTH Length = 565 Score = 94.2 bits (232), Expect = 9e-18, Method: Composition-based stats. Identities = 38/257 (14%), Positives = 73/257 (28%), Gaps = 23/257 (8%) Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY 236 A+ + + L + + +S+ +L E + L Sbjct: 321 MYAIGNLTQEKKEILNKFVEFCKSSKSQELATEYGFNRLDDYLPEISNFDGEAIMKAQ-- 378 Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--- 293 K ++++ D ++ V + DVSGSM + K+ I ++S V Y Sbjct: 379 KLWKEKKDVNNDIVAVFVADVSGSMAGEPLNRLKQSLINGSKYISSDVSIGLVSYSTDVN 438 Query: 294 -------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-YAAQAS 345 + + G T A+ + +++KE S Sbjct: 439 INLPIAKFDLNQRSLFVGAVESLAAGGNTATFDAIIVATKMLKEEKAKNPNAKLMLFVLS 498 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG S + + K + Y A+ + A Sbjct: 499 DGVTNYGHSLNDIKDMMKTFGIPIYTIGY-----NANIKALETLSQINE-----AANINA 548 Query: 406 DQDDIYPVFRELFHKQN 422 D +D+ LF+ Q Sbjct: 549 DTEDVVYQLGSLFNAQM 565 >UniRef50_D0MEC0 von Willebrand factor type A n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MEC0_RHOM4 Length = 329 Score = 94.2 bits (232), Expect = 9e-18, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 54/215 (25%), Gaps = 38/215 (17%) Query: 242 RPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 + ++D+S SM S ++A+R I R + VV+ Sbjct: 81 EKRTVEGRDLMLVLDLSSSMLAQDFSPSRFEVARRTAIQFVQG-RRADRIGLVVFAGQAF 139 Query: 295 ----HTQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 T Q GT + +A+ +K + +DG Sbjct: 140 TQVPPTLDYRFLLTMLQRLQVGRLEDGTAIGTAIATAINRLKNS---EARSKVIILLTDG 196 Query: 348 DNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRA------------------HQTLWRE 388 N + PL LA++ + + + RE Sbjct: 197 QNNRGEIDPLTAAELARQAGIRIYTIGLSGRGEAPYPVQTPFGTRPQPVPVEIDEAMMRE 256 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 F R + IY L A Sbjct: 257 VAEKTGGRY-FRATDARTLEAIYAEIDRLEKSPVA 290 >UniRef50_A7C4W6 von Willebrand factor, type A n=1 Tax=Beggiatoa sp. PS RepID=A7C4W6_9GAMM Length = 305 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 33/269 (12%), Positives = 72/269 (26%), Gaps = 40/269 (14%) Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 A + + + + Q + R + L K + Sbjct: 48 ITPAHYEAAQFFIDYLLDKSQQQKALQYGFRPANVSVPLASPIDIAHGVNPLAPKMILEM 107 Query: 243 P-------------DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 P A + ++D SG M A+ + L + + Sbjct: 108 PTVDMMEAIIKIWHQYKKPANIVLVLDTSGGMRGEKILHARTMALQLLEIVKEADYFSLL 167 Query: 290 VYIRHHTQ---------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + ++ + +F Y GGT + A+ +++ P + + Sbjct: 168 SFNHSLNWIAKNIQVKSQQKWLKRQFNYQFPGGGTALYDAIFNAYTFLQKNSFPDKIAVM 227 Query: 341 AAQASDGDNWADDSPLCHEILAKKLL-----PVVRYYSYIEITRRAHQTLWREYEHLQST 395 SDG + S L + L K+ +R ++ + + E + Sbjct: 228 IVL-SDGGDSH--SELNFKDLLSKIPFNSDTSPIRIFAVGYGSIT-DKKRLNEIAKMTQG 283 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 D ++F K+ A+ Sbjct: 284 KFY---------DGAMVDVDKIFKKEMAS 303 >UniRef50_D1CCX6 von Willebrand factor type A n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CCX6_THET1 Length = 918 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 49/401 (12%), Positives = 94/401 (23%), Gaps = 44/401 (10%) Query: 50 ESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQG--QASQ 107 +SV P F + D+ +G Sbjct: 198 DSVKAPARIKPGQSFSLQIVVSSTVQQRAKLTVLDGDKSVVSTDVSLKAGKNTFVVPLPG 257 Query: 108 DGEGQDEFVF--QISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANIS 165 G+G ++ + ++DE ++ +G A + Sbjct: 258 QGQGVHKWEARIEAAQDEIPQNDRAASFTYVESPSRVLVAEGTPGEASGLVAALKAGKLV 317 Query: 166 VVRSLQNSLARRTAMTAGKRRELHALEENLAII-----SNSEPAQLLEEERLRKEIAELR 220 V N + + + T K + + + + L + + Sbjct: 318 VDTVDSNDIPKDIS-TLAKYDAVVLVNVPANSLQDAGKTLQVYVHDLGKGLVAIGGDRAF 376 Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ----------------S 264 A L + + PD Q + +D SGSM Sbjct: 377 ALGGYFNTPLEQTLPVDSQIRNPDEEPQVAVVMAIDKSGSMAACHCEGSKLLEQYPGGIP 436 Query: 265 TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ--------AKEVDEHEFFYSQETGGTI 316 D+AK IL L V + K + Q +GGT Sbjct: 437 KVDIAKESAILSSETLGPNDIFGVVAFDTAPRWVVRPEPVTDKSSIAEKVAGIQGSGGTN 496 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYI 375 + L + + + N + +DG N + L ++K + + Sbjct: 497 IYGGLAEAIDSLIKVKAK---NKHVILLTDGWSNVGNYDEL----ISKARRHGITISTVS 549 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 A L R + ++ Sbjct: 550 AAGGSA--QLLRSIAEKGGGTFYNTRDSADIPQIVLKETQK 588 Score = 60.3 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 27/131 (20%), Positives = 42/131 (32%), Gaps = 10/131 (7%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 PS + + L+D S S+ AK F Y R VV+ + Sbjct: 58 RLPSHKLGVVFLVDASDSVGPEGIAQAKEFVRKAYQLAGRDVDLGVVVFGKEPLIDSLTS 117 Query: 303 EH----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 +F ++ T + SA++L + PA + SDG N Sbjct: 118 SDGKLPDFLSRPDSTATDIPSAMRLAFSMF-----PADSSKKIVLLSDG-NNNVGDMQEV 171 Query: 359 EILAKKLLPVV 369 LA+ V Sbjct: 172 SRLARMFGVTV 182 >UniRef50_B7XGM9 Putative uncharacterized protein n=2 Tax=Pseudomonas RepID=B7XGM9_PSEPU Length = 604 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 51/395 (12%), Positives = 102/395 (25%), Gaps = 38/395 (9%) Query: 15 SMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHR 74 S + Q + Q + + + D ++ E + Sbjct: 201 SQEDEQEDEQPQPEQGSEGAPQDSQESGGGDQQQEQNQGGGAGAGDEGKEEGQPESKQET 260 Query: 75 VHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA- 133 G D+ + G G + S +G G+ +++ D DL Sbjct: 261 SDGGQSGSEPQDQSQPSSGSNAGDDGQETSQSSNGSGKPSPEASNLREQAKDATEADLKG 320 Query: 134 -LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 + + L RS Q S+ R + Sbjct: 321 LISEVGDKAGELLGRKAVRDGNPIRPFSLDGRGRNRSDQASVRR-----------VQLGI 369 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD-PSSQAVM 251 E A + L + R + +I+ + + + A + Sbjct: 370 EQSAGLRQCLNGLLQAQVDCRVRLKRQGKRIDTGRIAMMKGGETRVFRSKSRAERQSASI 429 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFL---------------SRTYKNVEVVYIRHHT 296 L+D SGSM + D A+ + L I+ Sbjct: 430 QILLDKSGSMKSA-MDQAEAAVYAVLSALEGLPLVTTGAMSFPNKANDGVERCALIKSPK 488 Query: 297 -QAKEVDEHEFFYSQETGGTIVSSALK-LMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + + F + GGT ++ AL EV++ +DG+ A + Sbjct: 489 ERLIKAVSEGGFGAMSEGGTPLAQALWPAAVEVLRA----KGEKKILFVITDGEPNAGTT 544 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 E L + + + A++ + + Sbjct: 545 HAAKEFLQRCEVSGIEVIGLGFG--SANEHILKAL 577 >UniRef50_C0E6Z8 Putative uncharacterized protein n=2 Tax=Corynebacterium matruchotii RepID=C0E6Z8_9CORY Length = 1107 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 44/344 (12%), Positives = 90/344 (26%), Gaps = 35/344 (10%) Query: 69 GGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQAS---QDGEGQDEFVFQISKDEYL 125 ++ + + + G G G + Q+E D+ Sbjct: 760 SAGGRMASALDELYGADTVGKDHAGESDGRVHAAGNGPSQLGVRQWQEEITALFGADQLQ 819 Query: 126 DLLFEDLALP------NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 ++ + + +L R E T + + +R L + + + Sbjct: 820 EIFGKAADMGRSDVITSLDAESVRPSVELLTTVLNLKGALPESRLRQLRPLVSKIVSELS 879 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 + S + +L +R + P I ++ Sbjct: 880 KELASQLSPALGGMANTKPSRRKSPRLDLPATVRNNLKHTVMV-NSRPQIIPVTPIFRAP 938 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 S + L+DVSGSM+ ST A + +V +I T Sbjct: 939 ---ERKVSPWHIIVLVDVSGSMEPSTVFAAMTA------GILAGVNTFKVSFITFDTSVI 989 Query: 300 EVDEHEFF------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 ++ H + GGT ++ ++ V SD + W Sbjct: 990 DLTGHVEDPLELLLEIKVGGGTNIAQGVRYAASQV-----TNPTKTILVLISDFEEWGSV 1044 Query: 354 SPLCHEILAKKLLPVVRYYSYI----EITRRAHQTLWREYEHLQ 393 + L HE +A V+ + + + Sbjct: 1045 NNLTHE-IAALADSGVKLIGCAALDDDGKAAYNVGIAESVAAAG 1087 >UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGD5_ARCPR Length = 411 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 46/314 (14%), Positives = 106/314 (33%), Gaps = 35/314 (11%) Query: 118 QISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 +S E ++ +E++ + +LK+ + + G + R+L + + Sbjct: 114 DLSTSELVNYFYEEI-IEDLKKEGYLEDDYF---------RGYKFTKNAERALSKKI-LQ 162 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP-FIDTFDLRY 236 ++ + E +S+ +++E + LR + + V + LR+ Sbjct: 163 LSLQDLTGEDFGEHETEKTGVSSFLKNEIVEYDELRHSYDSIDLQETLVKCALRDPSLRF 222 Query: 237 KNYEKRPDPSSQAV---MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + L+DVS SM A + L + ++ E+ + Sbjct: 223 DERDLVAREGKHMEKCVYVMLIDVSDSMRGRRIVGALESALALRKVIKKS-NMDELHVVA 281 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + + +++ + E + G T + ALK E++K+R +DG+ + Sbjct: 282 FNHRVRKIKDEEILNLRTRGRTDIGLALKTAREIIKKRRGSG----VIFLITDGEPTSSY 337 Query: 354 SP-----LCHEILAKKLLPVVRYYSYIEITRRAHQ-TLWREYEHLQS-TFDNFAMQHIRD 406 P +C A+KL V + + ++ L L + + Sbjct: 338 DPYLTPTMCALREAEKLRKVDANLTIVMLSPEKRFLALCERIAKLSRKANLVYIENPLN- 396 Query: 407 QDDIYPVFRELFHK 420 ++ F K Sbjct: 397 -------MKKFFVK 403 >UniRef50_C7PW75 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7PW75_CATAD Length = 1033 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 50/422 (11%), Positives = 97/422 (22%), Gaps = 32/422 (7%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 R+ Y A I +I + V + +I + Sbjct: 97 KEREAARADYDAAISAGQRASIAEEERPGVFTLRVGNIMPGERVVIRTSLSGRLPYEDGQ 156 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 + P G G G AS + D L Sbjct: 157 ATFRFPLVVAPRYIPGADLPGEQVGSGTASDTDQVPDASRISPPI------------LLP 204 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 N R E G G+ +++ V ++ +R R L Sbjct: 205 GFPNPVRLSIEVAVDPVGLPLAGLTSSLHGVSVEESEGSRYLVRLNPGARADRDFVLRLG 264 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + L + + P L + + + ++D Sbjct: 265 YGGSGAATSLAVAWDSESANEVAKESAKATPTDIGTFLLTVLPPEPTGATRPRDVALILD 324 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-------HTQAKEVDEHEFFY- 308 SGSM A+R + L+ + + + T E + F Sbjct: 325 RSGSMGGWKMTAARRAAARIVDTLTAEDRFAVLTFDDQMETPDGLPTGLSEATDRHRFRA 384 Query: 309 ------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 GGT + L+ ++ + + + +DG + Sbjct: 385 VQHLATVDARGGTEMEPPLRRAATLLSD--DNPDRDRVLILITDGQ---VGNEDRLLTTL 439 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 L +R ++ T + + L R D + + + Sbjct: 440 SPKLTHIRVHTVGIDT-AVNAAFLQRLSTLGGGHCELVESEDRLDDAMDAIHHRIATPLV 498 Query: 423 AT 424 Sbjct: 499 TG 500 >UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseobacter sp. AzwK-3b RepID=A6FSG0_9RHOB Length = 444 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 41/188 (21%), Gaps = 12/188 (6%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV------- 290 +P + ++D S SM AKR + L + + V Sbjct: 32 APVTETEPRPPLNLALVLDRSSSMRGQPLHEAKRAADQIVAGLRPSDRLAIVAFDNATEV 91 Query: 291 -YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-D 348 + + G T + L E A SDG Sbjct: 92 MFSGGPRGDGQAARAALSRIHARGMTALHDGWLLGVEQ-SIAMREAGTPARVFLLSDGVA 150 Query: 349 NWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 N ++ + + ++ L E + Q Sbjct: 151 NVGLTDASAIAADCTRMAEHGITTSTCGLGMG-FNEDLMAEMARAGRGNAYYGETAEDLQ 209 Query: 408 DDIYPVFR 415 D F Sbjct: 210 DPFEQEFD 217 >UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W4Q3_NAEGR Length = 454 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 59/200 (29%), Gaps = 26/200 (13%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L++ Q + +DVSGSM D AK L+ + T V + Y Sbjct: 26 LQFDLISNIQRKEKQ--IVIALDVSGSMRGQGIDQAKIAISNLFEQVVDTPDVVLITYDT 83 Query: 294 HH------TQAKEVDEHEFFYSQETGGTIVSSALKLM--DEVVKERYNPAQWNIYAAQAS 345 + E + Q GGT + + + ++ + + + Sbjct: 84 SAELYDLRKKPAETRQSTLEQIQAGGGTDFTCVFEAISNLDMFNRQSE-----VAILFFT 138 Query: 346 DGDNWADDSPLCHEILAKKLLPVVR----YYSYIEITRRAHQTLWREYEHLQ--STFDNF 399 DG + + KK+L +++ T L + L + Sbjct: 139 DGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGF-TSSHDVALLTQITQLGSVQGTFQY 197 Query: 400 AMQHIRDQDDIYPVFRELFH 419 +D ++I L Sbjct: 198 V----KDANEINQSMENLIG 213 >UniRef50_A7RFL6 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RFL6_NEMVE Length = 981 Score = 93.8 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 24/225 (10%), Positives = 54/225 (24%), Gaps = 50/225 (22%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS-TKDMAKRFYILLYLFLSRTYKNVEVV 290 FD R++++ + + ++D S SM + +A++ + + L K V Sbjct: 199 FDPRFQSWYVEAVTRMRTNIVVVIDRSSSMSTAGRMALARQAAVTVLDTLGPNDKVGVVA 258 Query: 291 YIR--------HHTQAKEVDEHEFFYSQET-----------------------GGTIVSS 319 + E + G T Sbjct: 259 FSHFIIKPPGCFGGNVAEALPKNINRIKAWVEALTPRGKVSLQKTNLRYVSFPGATKYVP 318 Query: 320 ALKLMDEVVKERYN------------PAQWNIYAAQASDGDNWADDSPLCHEILAK---- 363 AL+ E++ +N +DGD + + + + Sbjct: 319 ALEAAFEMLGGDFNIKILHHPLIALIKRSAENMILFLTDGDPFDRNPDVSIFEAIRIGQR 378 Query: 364 --KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + Y E + ++ L + Sbjct: 379 KLAFPARINVYGLGESLNIDNLNRLKQIASLNNGTFTQINDQDAS 423 >UniRef50_Q4RX89 Chromosome 11 SCAF14979, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4RX89_TETNG Length = 1160 Score = 93.4 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 49/189 (25%), Gaps = 22/189 (11%) Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------------IRHHTQAKEVD 302 +S + K + + LS + ++ + + K++ Sbjct: 169 ISEGHGHHRGRLMKTSVMEMLDTLSDDDYVNVARFNEKADAVVPCFRTLVQANVRNKKIF 228 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVV--KERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + + Q G T S E + + A N +DG +D Sbjct: 229 KEAVMHMQAKGTTDYKSGFTFAFEQLLNESSAPRANCNKMIMMFTDG---GEDRAQEIFE 285 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 VR +++ T + F + I + ++ + Sbjct: 286 KYNWPNKTVRVFTFSVGQHNYDVTPLQWIACSNKG-YYFEIPSIGAIRINTQEYLDVLGR 344 Query: 421 QN--ATAKG 427 A +K Sbjct: 345 PMVLAGSKA 353 >UniRef50_B5EUF0 von Willebrand factor, type A n=44 Tax=Vibrionaceae RepID=B5EUF0_VIBFM Length = 321 Score = 93.4 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 53/214 (24%), Gaps = 41/214 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD-----------MAKRFYILLYLFLSRTYKNVE 288 + M ++D+SGSM + K+ + + Sbjct: 74 DPVDIQPEHRDMMLVVDLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQ-RKGDRLGL 132 Query: 289 VVYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNI 339 V++ H T + + + T + L L + E P Sbjct: 133 VLFGDHAYLQTPLTFDRNTVREQLDRTVLRLVGQMTAMGEGLGLATKTFIESNAP---QR 189 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI---------------TRRAHQ 383 SDG N A PL LAK + R + Sbjct: 190 TIILLSDGANTAGVLEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNTARDLDE 249 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + F ++ + +IY L Sbjct: 250 DTLTKIATMTGGQY-FRARNADELAEIYQTIDAL 282 >UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FLY1_NANOT Length = 748 Score = 93.4 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 23/205 (11%), Positives = 48/205 (23%), Gaps = 40/205 (19%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ----------------STKDMAKRFYILLY 277 ++ K P + ++D+S SM+ S D+ K + Sbjct: 55 IQPPLKPKDDVPHVPCDIVLVIDISASMNSAAPIPTGESGGEDTGLSILDLTKHAAKTII 114 Query: 278 LFLSRTYKNVEVVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV 327 L+ + V + + K T + +K +V Sbjct: 115 QTLNENDRLAVVTFCTEIRVAFELEFMSEENKSKVLAAIDCLHGISSTNLWHGIKEGLKV 174 Query: 328 VKERYNPAQWNIYAAQASDGDNWA-----DDSPLCHEILAKKL-----LPVVRYYSYIEI 377 + +DG P + L LP++ + + Sbjct: 175 LATNSTQGNVQ-ALLVLTDGAPNHMCPAQGYVPKLRQTLLDHRDLTGSLPLIHTFGFGY- 232 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQ 402 L + + F Sbjct: 233 --YLRSPLLQSIAEIGGGTFAFIPD 255 >UniRef50_A6VW30 von Willebrand factor type A n=2 Tax=Marinomonas RepID=A6VW30_MARMS Length = 342 Score = 93.4 bits (230), Expect = 2e-17, Method: Composition-based stats. Identities = 31/249 (12%), Positives = 57/249 (22%), Gaps = 40/249 (16%) Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 Q + R + + + I E S + +D+SGSM Sbjct: 43 HQDGKAIPKRPTLRYACLLVSWILLIFAMTQPVWLGEPTKVTPSGRDLLIALDLSGSMQV 102 Query: 264 STK----------DMAKRFYILLYLFLSRTYKNVEVVY---------IRHHTQAKEVDEH 304 + + AK R + +V+ + T+ Sbjct: 103 TDMALNGQPANRLEAAKSVLSDFIQE-RRGDRIGIIVFGSKAYLQAPLSFDTKTINQLVQ 161 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW-ADDSPLCHEILAK 363 E T + A+ L + ++ + +DG N P A Sbjct: 162 EAQIGFAGEQTAIGDAIGLGIKRLE---DKPSDKKVLILMTDGANTAGRVQPQQAATFAA 218 Query: 364 KLLPVVRYYSYIEI---------------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + + +TL + F + D Sbjct: 219 SQNVKIHTIGIGADSMIVQSFFGPKAINPSSDLDETLLKNIAAQTGGEY-FRAKSTEDLQ 277 Query: 409 DIYPVFREL 417 IY L Sbjct: 278 AIYQTLDAL 286 >UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VY88_NAEGR Length = 1082 Score = 93.4 bits (230), Expect = 2e-17, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 77/231 (33%), Gaps = 23/231 (9%) Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 SN + Q+ E+ R E+ L + + + + P P+ L DVS Sbjct: 82 SNQKVEQVEEKSLFRIEVEHLIDLDDHDIGVAELTIHVNDPSLNPKPTL---FIALADVS 138 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH-----EFFYSQE-- 311 GSM + S + + + + AKE+D + Sbjct: 139 GSMQGRPWEQVCTSLKHFAQQ-SFNNPAIICRMVAYESSAKEIDMKGTLQSIIRNIETAF 197 Query: 312 -TGGTIVSSALKLMDEVVKERYNPAQW-----NIYAAQASDGDNWADDS-PLCHEILAKK 364 GGT +SA +L ++ + N+ +DG++++ P + L+++ Sbjct: 198 TGGGTDFASAFQLACTIITRESGQDRENLPFGNVVITFLTDGEDFSKVGKPGGLQYLSEE 257 Query: 365 L----LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + + ++ + L + + + + D +D+ Sbjct: 258 INRVYRGDITIHTVGFG-SHHNLELLDNIRKVGTIEGAYRYANYDDNNDVI 307 >UniRef50_B0VJ57 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJ57_9BACT Length = 331 Score = 93.4 bits (230), Expect = 2e-17, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 65/214 (30%), Gaps = 36/214 (16%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKD--MAKRFYILLYLFLSR--TYKNVEVVYIR 293 +YE + SS + +DVS SMD + R + + FL + T + + + Sbjct: 78 DYENKELQSSGMDIIFALDVSKSMDATDMMPSRLLRAILQIGSFLEQVKTDRIGIIAFAG 137 Query: 294 HHTQAKEVDEHEFF-YSQETG---------GTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 T + + G GT + SAL+L + + + Sbjct: 138 TATLQCPLTDDYEAVRIVLNGLNSNTVEIPGTDIGSALRLA----ENAFPEGSKSKTLVL 193 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYI--------------EITRRAHQTLWREY 389 SDG++ + + K V E+ + + +E Sbjct: 194 ISDGEDLQHSALREA-RILKTKGIRVYTMGVGSPEGTIIRHPETGEEVKSKLDEATLQEI 252 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + ++I + + ++ ++ Sbjct: 253 ARITEGEYYRVTPG---GEEIQLILKRIYESEST 283 >UniRef50_Q6L2C8 Putative uncharacterized protein n=1 Tax=Picrophilus torridus RepID=Q6L2C8_PICTO Length = 379 Score = 93.0 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 24/217 (11%), Positives = 58/217 (26%), Gaps = 16/217 (7%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 + + + + Y ++ +S +DVS SM + D+AK Sbjct: 2 SISMRLEYSHGKSTTKDVMTVFKVVLYPEKTVKASGFHYIIAIDVSNSMRKGKLDLAKEG 61 Query: 273 YILLYLFLSRTYKNVEVVYIRHHTQAKE-----VDEHEFFYSQETGGTIVSSALKLMDEV 327 + L + R + + E + G T + +AL ++ Sbjct: 62 AMNLIEKIPRDNIVSLIAFGDTAKVIVEGKEPTFALEAIPSLKVAGNTAMYTALLTATKL 121 Query: 328 VKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 + P + +DG +E L ++ L + Sbjct: 122 ADKYNMPGR----IILLTDGMPTDVSMNESYENL--QVPEGFTIDCIGIG-DNYRDDLLK 174 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 ++ + + +++ V + Sbjct: 175 LLADKGNSIFYH----LENPEELPKVMESTVSSDISA 207 >UniRef50_B3QUN4 von Willebrand factor type A n=4 Tax=Bacteria RepID=B3QUN4_CHLT3 Length = 340 Score = 93.0 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 50/213 (23%), Gaps = 41/213 (19%) Query: 246 SSQAVMFCLMDVSGSMDQST------KDMAKRFYILLYLFLSRTYKNVEVVYIRHH---- 295 S + +D+SGSM + AK + + VV+ Sbjct: 95 SEGIDIVLAIDLSGSMLAEDFEPKNRIEAAKSVATDFIHQ-RLSDRIGLVVFSGKSFTQC 153 Query: 296 --TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 T + + + GT + +A+ ++E + +DG N Sbjct: 154 PLTLDYRLLTNFISELKAGTIEEDGTAIGTAIATATNRLRESTAK---SKVIILLTDGQN 210 Query: 350 W-ADDSPLCHEILAKKLLPVVRYYSYIEITR-------------------RAHQTLWREY 389 + P+ LA L + + Sbjct: 211 NAGEIEPVTAAELAAALGIKIYTVGAGTRGYARYPIPDPLFGKRYVQMKVDVDDSTLTRI 270 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + F + Y EL + Sbjct: 271 ARISGGRY-FRATDLESLKKTYHEIDELEKTKV 302 >UniRef50_UPI000178810F von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178810F Length = 421 Score = 93.0 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 59/185 (31%), Gaps = 21/185 (11%) Query: 250 VMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE----H 304 + ++D SGSM+++ + + L + R + +V+ T + Sbjct: 114 DIVLVIDNSGSMNETDPNQDRYTAAKNLINRMDRDNRVSVMVFDHATTLLQPFTRVKNQE 173 Query: 305 EFFYSQE--------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 GGT +S AL+ ++E + A + SDG +++ Sbjct: 174 TKDEIIAEIDGLATNDGGTDISLALEDTMSHIQESRD-AGRSAMVIMLSDG--FSETDHD 230 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 K+ V + L + ++ +D+ VF++ Sbjct: 231 RVLAEYKQQQIAVNTIGLSLVNPD-GAQLLQTIAAETGGQYY----DVQHAEDLSFVFQK 285 Query: 417 LFHKQ 421 ++ Sbjct: 286 IYDDV 290 >UniRef50_C0ZKA0 Putative uncharacterized protein n=2 Tax=Bacteria RepID=C0ZKA0_BREBN Length = 477 Score = 93.0 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 45/368 (12%), Positives = 93/368 (25%), Gaps = 59/368 (16%) Query: 75 VHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLL--FEDL 132 P + P D G +++++E + L Sbjct: 22 QEPDTSQNAEGQNPSAPP---STETPPTQSQPPDQTGD--PNAEMTQEEKVKALEAMAQE 76 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 +P ++ + + +G + + + + + L + + E Sbjct: 77 GMPLKRETTEDFVNSPPGRFSGVSYD------NNREEVLSELKKFPTVEKP-------DE 123 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 E + L + + + I+ + + + + S + Sbjct: 124 EMMN------KYYLALLGLFAQSYPDPQQIIDELKMASFGNPDIDDPRFKFKESYNVEII 177 Query: 253 CLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVYI--------------- 292 +D SGSM D AK L VY Sbjct: 178 --LDASGSMAAKSNGKTRMDAAKEAIQAFAESLPEQANVALRVYGHKGSGKESDKTLSCG 235 Query: 293 ------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 T KE Q TG T ++ +L+ + + + N SD Sbjct: 236 SSELVYGMQTYNKEKLTQSLNQFQPTGYTPIAYSLQEAKKDLSKLPGDKNTN-MIFLVSD 294 Query: 347 G-DNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 G + D + LA+ ++ P++ + Q +E Sbjct: 295 GIETCDGDPVEAAKQLAQSEITPIINVIGFGVDG--PGQQQLKEVAKAAGGRYVLIQDQK 352 Query: 405 RDQDDIYP 412 QD+ Sbjct: 353 ELQDEFNR 360 >UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6X2_PHATR Length = 523 Score = 93.0 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 30/265 (11%), Positives = 65/265 (24%), Gaps = 58/265 (21%) Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + + A L ++I + +++ F + R ++ D + + ++ Sbjct: 16 DSFNPHQDAALDLSILAERDIIGIDSEVSTNHFCASIHART-MPKEDEDCRTPIDLIVVL 74 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEVDEHE 305 DVSGSM + + K+ +L L + + + Q K + Sbjct: 75 DVSGSMTGNKLKLCKKTLTMLLRVLQTQDRFGLISFGSDARVEFPAQAMSKQNKASALQK 134 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKK 364 G T +S+AL L + +K + +DG N L + Sbjct: 135 IQSLTTRGCTNMSAALGLAVQELK-IIEKSNPVRSLFFLTDGLANEGISDLDGLVSLTRN 193 Query: 365 -------------------------------------------LLPVVRYYSYIEITRRA 381 + +++ Sbjct: 194 CLLPSDNPSNVLNSEVMIAECLDDLATSQHQITRLPVAEIESVCRAPITLHTFGYGR-DH 252 Query: 382 HQTLWREYEHLQ-STFDNFAMQHIR 405 + L F Sbjct: 253 NAALLESLADTTQGGAYYFIEDDSN 277 >UniRef50_Q6IND5 MGC83495 protein n=9 Tax=cellular organisms RepID=Q6IND5_XENLA Length = 861 Score = 92.6 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 45/417 (10%), Positives = 103/417 (24%), Gaps = 47/417 (11%) Query: 42 SVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSG 101 S G+ + ++ + + G + F+ + G+ Sbjct: 64 SFEATVEGKKIVADIQERQQANKTYDEA-----ISQGQEAFLLQEDESSGDVFSCSVGNL 118 Query: 102 QGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVP 161 + + + D + + + P + + R Sbjct: 119 PPGQEAEVKLSFVRELPVESDGAVRFVLPTVLNPRYTPKEHEESVTATVPRLPADKIPYS 178 Query: 162 ANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 +++ +++ + + ++ A +S ++ + + L + Sbjct: 179 LDLTAHFQSVYGISKIESNCELSPIQYTDGDKLSAKVSLAQGHKFDRDVELLAYYLQANT 238 Query: 222 KIERVPFIDTFDLR-------------YKNYEKRPDPSSQAVMFCLMDVSGSM------- 261 V + Y N+ + + S+ LMD SGSM Sbjct: 239 PSAMVEAGLPNAVEGSIMADPVVMLNFYPNFPETKEKSNFGEFIFLMDRSGSMTEQMTNE 298 Query: 262 --DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----------HHTQAKEVDEHEFFY 308 D AK ILL L + + Q+ E Sbjct: 299 QNAPRRIDSAKETLILLLKSLPLGCYFNIFSFGSDFTSIFSESMAYTQQSMEEAVKLVNQ 358 Query: 309 SQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP 367 GGT + LK + + +P +DG+ + ++ Sbjct: 359 MDADMGGTEILEPLKKIYKTAGRPSHP----RQLFVFTDGE---VGNTNFVIDEVRRNAH 411 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 R +++ + L + S F R Q + + Sbjct: 412 NHRCFTFGIGEGAS-TALIKGLARAASGTFEFITGKERMQPKVLQTLKCSLQPTVKD 467 >UniRef50_B8GAZ1 von Willebrand factor type A n=3 Tax=Chloroflexus RepID=B8GAZ1_CHLAD Length = 958 Score = 92.6 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 51/454 (11%), Positives = 106/454 (23%), Gaps = 65/454 (14%) Query: 7 RRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQ 66 +RL + N R + + + I I ++ + D+ V+ + P + Sbjct: 156 KRLVLLSDGGENSGRAIDVARLAASRGIPIDIVDLALVETDAEALVA----SVEAPNGVR 211 Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQ-ASQDGEGQDEFVFQISKDEYL 125 G + + V R G + + + S + Sbjct: 212 D--GQEALIVATVESTVAQRATVRLIDDVGVVAERELDLGPGATRVEFVVPIKGSGFQRY 269 Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSL--ARRTAMTAG 183 + E ++ N+ L + V N + R L +L A A Sbjct: 270 RVQVEAAQDGRVQNNEAAALIRVQGPPRVL---LVARNAADARPLATALTAADIVAEIIA 326 Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD---------- 233 +L + A + + A + + Sbjct: 327 PEAAPRSLADLSAYDALVLVNTPARALPVGLMQAIPGYVRDLGRGLLMIGGEESFGVGGY 386 Query: 234 ---------LRYKNYEKRPDPSSQAVMFCLMDVSGSM-----------------DQSTKD 267 Y + R + ++D SGSM + D Sbjct: 387 GRTAVEEALPVYMDVRNRELRP-DLAIVFVIDKSGSMDACHCANPDRGGPITSSSERKID 445 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIR--HHTQA------KEVDEHEFFYSQETGGTIVSS 319 +AK LS V + T E + G T + + Sbjct: 446 IAKDAVAQATALLSPQDTVGVVTFDGAAFPTFVATRGATVEQVMDAVSGVEPRGPTNIRA 505 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 L +E++++ + +DG D L ++ + + + Sbjct: 506 GLLRAEEMLQQV---DARIKHMILLTDGWGSGGDQLDIAARL-REQGITLTVVAAGSGSA 561 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 ++ A Sbjct: 562 TY----LQQLAAEGGGRYYPAADMADVPQIFVQE 591 Score = 53.7 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 40/133 (30%), Gaps = 10/133 (7%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH----TQAK 299 P + L+D S SM ST+ A+ F + + VV+ + Sbjct: 62 RPVDRLTTVFLLDGSDSMPASTRAQAEAFIRAALQEMPPDDQAAIVVFGGNALVERAPDS 121 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + T T + +A++L + PA SDG + + Sbjct: 122 DRRLGRITSIPITNRTNIEAAIQLGMALF-----PADSQKRLVLLSDGGENSGRAID-VA 175 Query: 360 ILAKKLLPVVRYY 372 LA + Sbjct: 176 RLAASRGIPIDIV 188 >UniRef50_UPI00006CC819 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CC819 Length = 930 Score = 92.6 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 49/203 (24%), Gaps = 20/203 (9%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-- 293 + SS+ L+D SGSM + A IL L + Sbjct: 312 FNQQIIDQTDSSKCEFIFLLDRSGSMSGQSIQNAIEALILFIKSLPLDSYFNIYSFGTEF 371 Query: 294 ---------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + + E+ +E GGT + L + + Sbjct: 372 SKLFDQSQKYSNENVELALNEIITYSANYGGTNIYQPLSEIFNQ----PYVKGYGRQIYI 427 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG ++ L + R ++ + L + Sbjct: 428 LTDGQ---IENKENVMHLIQSNNISNRVHAIGIGL-YVDKDLIIQSAKSGKGCHAHVTDQ 483 Query: 404 IRDQDDIYPVFRELFHKQNATAK 426 Q+ I + + K Sbjct: 484 SLIQESIINILQNSISPILEDVK 506 >UniRef50_P19827 Inter-alpha-trypsin inhibitor heavy chain H1 n=63 Tax=Mammalia RepID=ITIH1_HUMAN Length = 911 Score = 92.6 bits (228), Expect = 3e-17, Method: Composition-based stats. Identities = 17/205 (8%), Positives = 48/205 (23%), Gaps = 22/205 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------- 291 + + + + ++D+SGSM K + + + V++ Sbjct: 281 FAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFGTRVQSW 340 Query: 292 ----IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP----AQWNIYAAQ 343 ++ + + T ++ L E++ + + Sbjct: 341 KGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELSNHASILIM 400 Query: 344 ASDGDNW-ADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DGD + + Y+ + Sbjct: 401 LTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFG-HNVDFNFLEVMSMENNGRA---- 455 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 Q I + D + + + Sbjct: 456 QRIYEDHDATQQLQGFYSQVAKPLL 480 >UniRef50_C3NI41 von Willebrand factor type A n=9 Tax=Sulfolobus RepID=C3NI41_SULIN Length = 356 Score = 92.6 bits (228), Expect = 3e-17, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 47/165 (28%), Gaps = 9/165 (5%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE 305 +S ++D S SM + A + L L+ +++ H + Sbjct: 36 TSSIHYIIMIDNSPSMRGEKLNTAVQSAQKLLYNLNEGNYVTLILFSNHPEIKYQGPAKG 95 Query: 306 FFYSQETGG--TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 G T + A+ + K+ P + +DG + +E L Sbjct: 96 IITFDVGKGYTTRLHEAVSFTINLAKQSQVPTK----IIMLTDGKPTDKRNVKDYEKL-- 149 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + P + + ++ + ++ S + Sbjct: 150 DIPPNTQIITIGIGN-NYNERILKKLADRSSGKFYHIKDISELPN 193 >UniRef50_C0QP91 Putative von Willebrand factor type A domain protein n=1 Tax=Persephonella marina EX-H1 RepID=C0QP91_PERMH Length = 304 Score = 92.6 bits (228), Expect = 3e-17, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 46/183 (25%), Gaps = 25/183 (13%) Query: 250 VMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVD 302 + +DVS SM + K ++K L + +V+ T + Sbjct: 85 NIIIALDVSNSMKEKNKLKISKEILRDFLLKRDEEDRIGILVFDNLPFRLMPLTSDRGAL 144 Query: 303 EHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD-GDNWADDSPLC 357 + GGT + L + + + N +D GD + + Sbjct: 145 LRVISIIRPAMVDVGGTAMYDGLVEALNM----FMKDRRNKIIILLTDGGDINSKYTLED 200 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + + + + F D R + Sbjct: 201 VVRFNQDIGAKIYTIGVSSGM---NFYVLERLSEATGGKAFFVT------KDYQKALRSV 251 Query: 418 FHK 420 F + Sbjct: 252 FDE 254 >UniRef50_D1RCP1 von Willebrand factor type A domain protein n=6 Tax=Legionella RepID=D1RCP1_LEGLO Length = 342 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 33/273 (12%), Positives = 66/273 (24%), Gaps = 44/273 (16%) Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 R L L + + + ++E+ + V + F L + Sbjct: 22 LMPRVKVKLPTALRVPFFAAMIDIADQEKSSISVQHSLLIPALVWLLLVFALAGPRWVGA 81 Query: 243 PDP--SSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEVV 290 P P + +D+SGSM+ ++ K K ++ Sbjct: 82 PKPVSREGYNIMMALDLSGSMEIPDMILHGRPTSRLNIVKSAAEQFVRE-RSGDKIGLIL 140 Query: 291 YIRHH------TQAKE----VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + T + E T + A+ L + + Sbjct: 141 FGTRAYLQTPLTYDRHSILLRLEDATAGL-AGKTTSIGDAVGLAVKRLDSAPKKG---RV 196 Query: 341 AAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITR---------------RAHQT 384 +DG N + +PL LAK+ + + Sbjct: 197 IILLTDGANNSGVLAPLKAAELAKEEGIKIYTIGLGSEGDSRALVGDFLMQSPAADLDEE 256 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 ++ + F IY +L Sbjct: 257 TLKKMSDMTGGRY-FRATDTESLHLIYKTINQL 288 >UniRef50_A5UWS5 von Willebrand factor, type A n=2 Tax=Roseiflexus RepID=A5UWS5_ROSS1 Length = 851 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 37/297 (12%), Positives = 80/297 (26%), Gaps = 24/297 (8%) Query: 132 LALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR-TAMTAGKRRELHA 190 +P + + R GV A+++ +L ++ + Sbjct: 274 APMPRILLVEGSDGAISAPLRGALREAGVIADVADPTALPAQISALGLYEGIVLIDVPAS 333 Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 + + E + L L + S Sbjct: 334 ALTFDQMATLREFVRSEGRGLLAIGGRSSFTLGAYKNTPLEETLPVEMTPPPRPERSDTT 393 Query: 251 MFCLMDVSGSMDQS----TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV----- 301 + ++D S SM MAK I+ L + + + + + Sbjct: 394 LLLIIDQSASMGPETGISKFTMAKEAAIMATESLRQEDRIGVLAFDVSTRWVVDFQPVGV 453 Query: 302 ------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + GGT + +AL+ + + +A +DG ++ DD Sbjct: 454 GLSLADVQRRISTLPLGGGTDIYNALQEGLPALAQ---QPGRVRHAVLLTDGRSFTDDRQ 510 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 +L + + + T A L +E + ++A + +DI Sbjct: 511 AYRMLLEEARSQNITLSTIAIGT-DADINLLQELARWGAGRYHYAA----EPNDIPR 562 >UniRef50_Q237Q6 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q237Q6_TETTH Length = 713 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 24/206 (11%), Positives = 49/206 (23%), Gaps = 36/206 (17%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ---------------STKDMAKRFYI 274 +R + + + C++DVSGSM S D+ K Sbjct: 45 LENQVRIQILSPKGKSKVSNSICCVVDVSGSMGSRAVTKQSGGNSELGYSVLDIVKHSLN 104 Query: 275 LLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS----------QETGGTIVSSALKLM 324 + L + V + + + Q T + + ++ Sbjct: 105 TIVQNLDEGDEFSMVTFSDNSKLVCNYQQMTESNIKSSVDLINQCQPDASTNIWAGIEQG 164 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK-------KLLPVVRYYSYIEI 377 E ++ N + N +DG + L P + + + Sbjct: 165 LEQMQNDSNKNK-NQQLIVLTDGQPNVNPPRGILTTLNNFYNKNIISPKPSINTFGFGY- 222 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQH 403 L +F Sbjct: 223 --YLDSHLLFNIAQDCQGIYSFIPDS 246 >UniRef50_UPI00006CC94A von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CC94A Length = 901 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 56/197 (28%), Gaps = 23/197 (11%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE- 300 SS++ L+D SGSM A L L V + + + E Sbjct: 314 DHLKSSRSEYIFLIDRSGSMRGKPLTKALEALQLFLQSLPPDSYFNIVSFGSNFKKLYER 373 Query: 301 -------VDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + + + GT + S L + + +N +DG Sbjct: 374 SQKYNSQTLKFACNKIKDYSADMNGTDILSPLNNIFYYGQNIR---GYNRQIFVLTDG-- 428 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 A + K+ R + A + L +E + ++ + D Sbjct: 429 -AVQNRQSVVREIKRNNKKNRVHFIGFG-SSADKILIQESAIAGKG----IHEMVQFEQD 482 Query: 410 IYPVFRELFHKQNATAK 426 + + ++ K + Sbjct: 483 LSSIVIKILCKTISATL 499 >UniRef50_Q0AZS1 Mg-chelatase subunit ChlD-like protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AZS1_SYNWW Length = 592 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 43/310 (13%), Positives = 86/310 (27%), Gaps = 28/310 (9%) Query: 111 GQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL 170 D + +E L +D L + + QL ++ + + N+ + S Sbjct: 276 WGDVEHYVEQLEELG--LIKDTILGKVMTRKGLQLKDFVINHKCELETEIRRNMRKMPSG 333 Query: 171 QNSLARRTAM--TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 NS R+ + E + + + L E + + + + + Sbjct: 334 GNSRFRKLGQVDQKQTQVEFTNRNKTVNNPDKNWSGDLAVPETIVQAMKNSFLRNDPHFT 393 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 I DL Y + + L+D SGSM + A L LS K Sbjct: 394 IKKEDLHYYD----KKSYVPIDVCLLIDASGSMAGDKRQAACFLAQNLL--LSGKEKVAV 447 Query: 289 VVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 V + T+ + + G T ++ + ++K N Sbjct: 448 VTFQERSSEVVVPFTRNQNILNKGLSTISPAGLTPMADGIMTAVNLIKNNRV---RNPLL 504 Query: 342 AQASDGDN----WADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTF 396 SDG W D+ A + + + +++ + Sbjct: 505 VLISDGIPNIPLWTLDAQADALEAATHIRENKIHFICIGL---ESNRFYLEKLSANAGGA 561 Query: 397 DNFAMQHIRD 406 +D Sbjct: 562 LYLVDDLNKD 571 >UniRef50_B2HK18 Conserved membrane protein n=3 Tax=Mycobacterium RepID=B2HK18_MYCMM Length = 983 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 45/185 (24%), Gaps = 18/185 (9%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------------ 293 + ++D S SM A+R + L+ + + + Sbjct: 296 PRPRHLVLVLDRSRSMAGWKMTAARRAASRIVDALTSDDRFAVLTFDDGIEYPVGLPAGL 355 Query: 294 --HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + + + G T + + L+ ++ + SDG Sbjct: 356 TEASDRHRYRAVEHLARVEARGDTEMLAPLRRALALLGREQVADTDDAVLILISDGQ--- 412 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + L VR ++ + R + R + + Sbjct: 413 VGNEDQLLQELSGDLGRVRLHTIGV-DEAVNAGFLRRLAGVGGGRCVLVDNEDRLDEALP 471 Query: 412 PVFRE 416 V + Sbjct: 472 GVIQR 476 >UniRef50_Q23KK4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23KK4_TETTH Length = 1085 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 38/346 (10%), Positives = 97/346 (28%), Gaps = 42/346 (12%) Query: 96 GGSGSGQGQASQDGEGQDEFVFQISKDEYLDLL--FEDLALPNLKQNQQRQLTEYKTHRA 153 G S + + Q E D F++ +L ++N + + Sbjct: 290 GQQMSDSQDNLFSLKEDLKKEIQKRDKEQYDASQKFQEASLLITQKNSMISTHQTRQKLL 349 Query: 154 GYTANGVPANISVVRSLQNSLARRTA------MTAGKRRELHALEENLAIISNSEPAQLL 207 + + I + + +N + + + + E A +Q Sbjct: 350 NSDIQQIESKIKKLEADKNKQVEDLEARKDKYLKQLEEEKAKLIREKSAFWDKKNSSQEA 409 Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD 267 + + I L +K + +Y +Y + + D SGS + Sbjct: 410 RLSQYSQSINSLNSKYPLGKMCSIVEKKYFHY------------YFIQDESGSFSNDHQY 457 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH-------EFFYSQ--ETGGTIVS 318 + L + + + Q GGT Sbjct: 458 AIQGVAQLF-NRIKPNDYITYIKFDSSSHVDIPKTLKSSLSQGDFISKIQKCRGGGTNFQ 516 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 SA + + + ++ +Y+ ++ + +DG + L + +Y+ Sbjct: 517 SAFQTLLQQIQSKYDQQEYPVVIF-ITDGQDN--TDLDSIISQITSLCQDIVFYTIGYG- 572 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 ++ + + + + ++ +I +LF+ +N Sbjct: 573 -SVNEKYLKNITNKFN-------NTVGEKKEINGKPVDLFYVKNTP 610 >UniRef50_Q11RQ7 BatA-like protein, aerotolerance-related n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11RQ7_CYTH3 Length = 351 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 58/212 (27%), Gaps = 36/212 (16%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 K+ E + M +DVS SM S D AK+ + + V++ Sbjct: 100 KSNETNTQYTEGINMIFAIDVSESMKITDIHPSRFDAAKQICTDIINK-RSNDRIGIVIF 158 Query: 292 IRHH------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYA 341 T + +++ ++ GT + +AL +K Sbjct: 159 SGEAVTLSPLTNDYVLLKNQLNDLKQNKDLQSGTAIGTALGTAINRLKNAETKE---RII 215 Query: 342 AQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYI---------------EITRRAHQTL 385 SDG+N + P+ L + + + + + Sbjct: 216 VLISDGENTSGLMDPITAADLCLEYNIKIYCIGLGKDGTHQFKDDNGTIQYVESKLDENT 275 Query: 386 WREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + A + DD+ +L Sbjct: 276 LKNISATTKGKFYRAYDK-KSLDDVIANIDQL 306 >UniRef50_Q17A73 Dihydropyridine-sensitive l-type calcium channel n=4 Tax=Culicidae RepID=Q17A73_AEDAE Length = 1173 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 38/306 (12%), Positives = 74/306 (24%), Gaps = 32/306 (10%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVP--ANISVVRSLQN-SLARRTAMTAG-KRRELHALEE 193 + N L E N NISV S + + + L E Sbjct: 102 EPNIPSSLEENIWMYRNMYLNPDTHFFNISVNTSYSSVHVPQNVYDRYPWVMEALQWSEA 161 Query: 194 NLAIISNSEPAQLLEEERLRKEIAEL----RAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 + + + + + A +DTFD R +++ + Sbjct: 162 LDDVFMQNYNSDPALSWQYFGSYTGILRHYPALEWDRRQVDTFDCRKRSW-YIETATCSK 220 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEV 301 + L+D SGSM +A+ + S Y + Sbjct: 221 DIVILLDNSGSMTGYRNYIAQLTVKSILDTFSNNDFINIYKYSNDVDPLVDCFADMLIQA 280 Query: 302 D-------EHEFFYSQETGGTIVSSALKLMDEVVKERYNP-------AQWNIYAAQASDG 347 + + G V A E+++ + N +DG Sbjct: 281 TPENIRFMNEKVRGLEPDGYANVKKAFVKAFELLQHYREMRRCNETVSGCNQAIMLITDG 340 Query: 348 DNWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + VR ++Y+ + L + Sbjct: 341 VPSNITDVFEQYNWFENGTKIPVRVFTYLLGREVTKVREIQWMACLNRGHYSHIQSLDEV 400 Query: 407 QDDIYP 412 Q+++ Sbjct: 401 QEEVLK 406 >UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated chloride channel 1 precursor n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4A663 Length = 1245 Score = 92.3 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 41/155 (26%), Gaps = 16/155 (10%) Query: 251 MFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------ 303 + ++D SGSM + D + V + T + + Sbjct: 525 VVLVLDTSGSMGTSNRIDKVNSAATAFVNLVDDGISIGIVTFTGSPTTRHALTQINTQAD 584 Query: 304 ----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + F +GGT + L+ EV+ + + +DG + + Sbjct: 585 RDSLRDIFQLTASGGTCIGCGLEQGLEVLMAHPSGSADGGIIVLMTDGQDSGIQNH-IIR 643 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + + V + E + Sbjct: 644 QTLQDMGVRVNTVAIGE--DAYGE--LSLIAQETG 674 >UniRef50_UPI0001C34E55 hypothetical protein ClM62_13922 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001C34E55 Length = 466 Score = 92.3 bits (227), Expect = 4e-17, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 49/186 (26%), Gaps = 23/186 (12%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY----------KNVEVVYIRHHT--- 296 + +D S M+ S + AK+ L R + V + T Sbjct: 39 DIVFAIDRSAKMEGSALEAAKKGIKAFIETLERESAQPEGYAGEKRVGLVSFSDTATVNS 98 Query: 297 ---QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 E G + + A++ +++ + +DG Sbjct: 99 MLSPVVEQAARAAEGLTAGGKSNQAEAIRAAVKLLDMKTPGE---KMLFLITDGQTPFRS 155 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 A++ V ++ R + D+ ++ IR+ + Sbjct: 156 QTDSAAAEARQAGVTVYCIGIAAPDG-VNREALRSWA--SGPSDSHIIE-IRELGEAQTA 211 Query: 414 FRELFH 419 F L Sbjct: 212 FERLMK 217 >UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VDM1_NAEGR Length = 754 Score = 92.3 bits (227), Expect = 4e-17, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 59/198 (29%), Gaps = 22/198 (11%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L++ Q + +DVSGSM D AK L+ + V + Y Sbjct: 26 LQFDLISNIQRKEKQ--IVIALDVSGSMRGQGIDQAKIAISNLFEQVVDIPDVVLIAYDT 83 Query: 294 HH------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + E + Q GGT + + + ++ +Q + +DG Sbjct: 84 SAELYDLRKKPAETRQSTLEQIQAGGGTDFTCVFEAISKL---DMFNSQSEVAILFFTDG 140 Query: 348 DNWADDSPLCHEILAKKLLPVVR----YYSYIEITRRAHQTLWREYEHLQ--STFDNFAM 401 + + KK+L +++ T L + L + Sbjct: 141 QDGSSHKREKAIEQMKKVLETKTQSFEFHTIGF-TSSHDVALLTQITQLGSVQGTFQYV- 198 Query: 402 QHIRDQDDIYPVFRELFH 419 +D ++I L Sbjct: 199 ---KDANEINQSMENLIG 213 >UniRef50_A6G7V2 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G7V2_9DELT Length = 820 Score = 91.9 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 40/404 (9%), Positives = 97/404 (24%), Gaps = 42/404 (10%) Query: 20 QRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGN 79 Q+ R++ + + S A+ + + + + E ++P + Sbjct: 96 QQARERFEDALLEGKSAALLEEERSSLFTQELGNVPPRVEVTCELILDQPLKWLAAAGAR 155 Query: 80 DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQ 139 P A ++ V +S + + L++ + Sbjct: 156 GFAGW--EWRFPT----TVAPRFSGAPGRVADAEKVVVDLSVEPLGPRVRVSLSIADALP 209 Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 +H+ A + + +L R + + A E+ + Sbjct: 210 EGAWP--TSPSHKLNVARVEGRAKVELGGDAGAALDRDVVV-RWPGAPVVAGEDAGVSLE 266 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + P L R + + L+D SG Sbjct: 267 LARPDAAHLGAANSYGRLVLTPPPIE--------------PGREVSAVPRDLIVLLDTSG 312 Query: 260 SMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI-----------RHHTQAKEVDEHEFFY 308 SM A+ L L + V + +E Sbjct: 313 SMRGEPLAHAQAVTEALIRSLRDRDRLELVEFSSRVRRWSQAPASMSAAKREEALRWVGA 372 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 + +GGT + + ++ + +DG + + + P Sbjct: 373 LRASGGTHMRDGILAALASLR-----PEAQRQILLITDGLIAFES--EIVQAARQHRPPG 425 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 R ++ +++L R + ++ Sbjct: 426 CRVHTLGIG-SSVNRSLTRPVALAGGGLEVIVAPGEDAEEAAAR 468 >UniRef50_Q21JX5 von Willebrand factor, type A n=8 Tax=Gammaproteobacteria RepID=Q21JX5_SACD2 Length = 341 Score = 91.9 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 60/216 (27%), Gaps = 43/216 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQ----------STKDMAKRFYILLYLFLSRTYKNVEV 289 E+ P++ + +D+SGSMD + K + V Sbjct: 82 EEVHLPTTGRDLLVAVDISGSMDTKDMVVQNQQIPRIAVVKHIVGDFIER-RVGDRLGLV 140 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ T + + SQ T + A+ L + +++R N Sbjct: 141 LFGTSAYLQSPLTFDRTTVKQLLVESQIGFAGPNTAIGDAIGLSIKRLRDRP---AENRV 197 Query: 341 AAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYI------------------EITRRA 381 +DG N + SP LAK+ V +R Sbjct: 198 VILLTDGQNTAGEVSPRQAADLAKQSGVKVYTIGVGANEMIVSDGFFGNFQRKINPSRDL 257 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + F ++ + IY + EL Sbjct: 258 DEDTLTYIAETTGGRY-FRAHSPQELNQIYQLLDEL 292 >UniRef50_C1MHV2 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MHV2_9CHLO Length = 802 Score = 91.9 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 14/203 (6%), Positives = 46/203 (22%), Gaps = 21/203 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------ 292 + + + ++D SGSM+ + A L+ + Sbjct: 291 PKPERCLAFGRSVVFVIDRSGSMNGEPMEAANEALTTGLRSLTEHDYFNICAFDDGQEYF 350 Query: 293 -------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + + T + + L +++ + + Sbjct: 351 DANAMTQATPKNVERAMAWMNEHCVARYTTDIYTPLSEALKLLAGCAG-NGTVPFVFLIT 409 Query: 346 DGDNWADDSP-LCHEILAKKLLPVV-RYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 DG + +++ + R ++ + + ++ Sbjct: 410 DGAVSDEKEICKMLMAESQQKGEALPRVCTFGIGQ-YCNHYFLKMLANIGRGLF----DA 464 Query: 404 IRDQDDIYPVFRELFHKQNATAK 426 D I ++ + Sbjct: 465 AFTNDKIATQMSKMLTAARSPVL 487 >UniRef50_C1XFI8 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus RepID=C1XFI8_MEIRU Length = 722 Score = 91.9 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 55/190 (28%), Gaps = 17/190 (8%) Query: 244 DPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA---- 298 + + ++DVSGSM + +A + L VV+ Sbjct: 309 EEPGGVGIVLVLDVSGSMLEDDKLGLAVTGSLELIRSARPQDYIGVVVFSDRPRWLFRPR 368 Query: 299 ------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 ++ E +Q GGT++ A E +++ + +DG A Sbjct: 369 PMTEQGRKEAESLLLSTQAGGGTMIRRAYLEALEALEQVPTE---SKQVIALTDG--LAA 423 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 D A++ P ++ + A RE + Sbjct: 424 DVTPDLFDAAREASPRIKTNTVAIGA-DADGRFLRELAQAGDGTYWDVPRPEDLPRFFLE 482 Query: 413 VFRELFHKQN 422 + +F ++ Sbjct: 483 EAQRVFRREA 492 >UniRef50_A0CHZ1 Chromosome undetermined scaffold_185, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CHZ1_PARTE Length = 265 Score = 91.9 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 70/207 (33%), Gaps = 17/207 (8%) Query: 163 NISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK 222 N SL +++ TA + ++ + ++P E+ L EI L+ Sbjct: 55 NFGYQHSLGPKYSQQLPQTAISQEIFDDDDQVQTNLVQAKPNMYDLEKELIFEIKTLQKM 114 Query: 223 IERVPFIDTFDLRY------KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 I+ + + CL+D S SM+ S + K+ +L Sbjct: 115 IKLSKISTQQLPGIISIKTKDQLNDQDLNRVGVDLICLIDKSSSMNGSKIETVKQSLKVL 174 Query: 277 YLFLSRTYKNVEVVYIRHH---TQAKEVDE-------HEFFYSQETGGTIVSSALKLMDE 326 FLS + +++ H T K + E + GGT +SSA ++ Sbjct: 175 LTFLSNQDRLQLIIFNTHAKRLTPLKRITEDNKLYFTQMIDQIKSDGGTQISSATQIAIS 234 Query: 327 VVKERYNPAQWNIYAAQASDGDNWADD 353 +K R + SDG + Sbjct: 235 QLKGRKYRNNVSSV-FLLSDGQDNDAT 260 >UniRef50_A6TP10 von Willebrand factor, type A n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TP10_ALKMQ Length = 551 Score = 91.9 bits (226), Expect = 5e-17, Method: Composition-based stats. Identities = 19/190 (10%), Positives = 51/190 (26%), Gaps = 20/190 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF--LSRTYKNVEVVYIRHH--- 295 + ++D SGSM + AK + S + + + Sbjct: 166 LANLQQVPLSVSLILDNSGSMSGNPMTQAKSAAKQFLNYVDFSNGDQVEIIEFNSDVYIR 225 Query: 296 ---TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA 351 + + T + AL + P +DG +N + Sbjct: 226 IPYGSDIKSLNTAIDTMESNSQTALYDALYTGLVRAYSQSGP----KCILAFTDGEENAS 281 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 S L++ + + + +E ++ + ++ Sbjct: 282 IRSVSEVTELSRATSIPIFIIGVGSL---IDEESLKEIAEQTGGEYFYSPTAV----ELE 334 Query: 412 PVFRELFHKQ 421 +++ ++ +Q Sbjct: 335 QIYKTVYDQQ 344 >UniRef50_D2Q363 von Willebrand factor type A n=1 Tax=Kribbella flavida DSM 17836 RepID=D2Q363_9ACTO Length = 837 Score = 91.9 bits (226), Expect = 5e-17, Method: Composition-based stats. Identities = 52/368 (14%), Positives = 84/368 (22%), Gaps = 33/368 (8%) Query: 8 RLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQG 67 R N + +++ R + I S G P D P Sbjct: 435 RRNPFDAPGLDQDR----LEQAISDSTPPDDPDDDPDGGAPGRPDGGPNSDGD-PSGDGS 489 Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL 127 R D + +R G D + Sbjct: 490 PDDARDPGARDPDARDSDLTADRDANGSAPGDRAPDGGLSDSPADGNPQRVRGNGSAENT 549 Query: 128 LFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS-LQNSLARRTAMTAGKRR 186 + AL + + ++ RS + R A R Sbjct: 550 GDPENALSGDAPTGDVATSSTPYKARLLSVQSAGPGVAGRRSRAITEVGRVVGDRARVGR 609 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 E LA + + P Q + D+R E R Sbjct: 610 ESGRPH-LLATLRAAAPHQHSRGRSG------------PGLAVHPTDVRLGVREGRE--- 653 Query: 247 SQAVMFCLMDVSGSMDQ-STKDMAKRFY-ILLYLFLSRTYKNVEVVYIRH-------HTQ 297 ++ +D SGSM K LL R K V + R T Sbjct: 654 -GNLVLFCVDASGSMGTKRRMSEVKTAIVSLLLDAYQRRDKVGLVTFARSQATVALPPTG 712 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVK-ERYNPAQWNIYAAQASDGDNWADDSPL 356 + E G T ++ L +V++ + +DG +S Sbjct: 713 SVETAVRRLESLPTGGRTPLAEGLVRAADVLRIAAIRDPRRRPLLVLVTDGRATHGESAF 772 Query: 357 CHEILAKK 364 A + Sbjct: 773 SRARQAAE 780 >UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LD98_GORB4 Length = 423 Score = 91.5 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 47/203 (23%), Gaps = 12/203 (5%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L E + + A + ++D SGSM A+R + L V + Sbjct: 24 LEIVAPEGKVTDRAPAALQVVLDRSGSMSGPPLAGAQRALAGVIGQLDPRDVFGVVTFDD 83 Query: 294 HHTQAKEVDEHEFF--------YSQETGGTIVSSALKLMDEVVKERYNPAQWN-IYAAQA 344 G T +SS + ++ A Sbjct: 84 DAQVVLPAAPLADKARAVDAVGSIVPGGCTDLSSGYLRGLQELRRATASAGIRGGTVLVI 143 Query: 345 SDGD-NWADDSPLCHEIL-AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 SDG N + AK + + +TL + FA Sbjct: 144 SDGHVNRGIRDLDEFASITAKAAADGIITSTLGYGRG-YDETLLSAIARSGNGNHVFADD 202 Query: 403 HIRDQDDIYPVFRELFHKQNATA 425 I L K Sbjct: 203 PDAAGAAIAGEVDGLLSKSAQAV 225 >UniRef50_C5BKZ8 von Willebrand factor type A domain protein n=5 Tax=Gammaproteobacteria RepID=C5BKZ8_TERTT Length = 347 Score = 91.5 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 60/215 (27%), Gaps = 41/215 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFL--------SRTYKNVEVV 290 E P++ + +D+SGSM K+ +L + + + ++ Sbjct: 82 EPVTLPATGRDLLLAVDISGSMKTPDMVVQDKQIARILVVKYVVNEFIERRESDRLGLIL 141 Query: 291 YIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + T ++ +Q T + A+ L + ++ER Sbjct: 142 FGSQAYLQAPLTFDRKTVSTLLDEAQLGFAGEQTAIGDAVGLAIKRLRERP---ASQRVL 198 Query: 342 AQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYI----------EIT--------RRAH 382 +DG N + +P LAK+ + Sbjct: 199 ILLTDGANTAGEVAPRQAADLAKQAGIKIYTVGVGADQMEQRMGLFGGFSRTVNPSSDLD 258 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + R F ++ ++ IY +L Sbjct: 259 EDTLRYMAETTGGLY-FRARNPQELQAIYEELDKL 292 >UniRef50_C8NWP6 Putative uncharacterized protein n=1 Tax=Corynebacterium genitalium ATCC 33030 RepID=C8NWP6_9CORY Length = 1152 Score = 91.5 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 44/419 (10%), Positives = 106/419 (25%), Gaps = 36/419 (8%) Query: 8 RLNGKNKSMVNRQR-FLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTE----DISEP 62 R + +R R + + + + + + + P + + Sbjct: 740 RERLMDAVGADRARPVNVPVEELGRWAADDIAAWERLQRLGLASAALTPAQRWSLVLGRR 799 Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD 122 R + + + + G GG + + DE ++ Sbjct: 800 REENPSPTQRRLARSLDQLYGRGEGEGSTAGSMGGRAGNEPPYPTARDWVDELDALFGEE 859 Query: 123 EYLDLLFEDL------ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLAR 176 D+ + A+ L + Q R E + + + +R + + Sbjct: 860 VREDIAATAVDTRHPYAMELLTETQPRASVELLSDVLTLAGGMPESVLDKLRPVLRRMVE 919 Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY 236 + S +L + + + + P + + Sbjct: 920 ELTQVLASQLRPALRGLQGWRPSTRPSPELDPLSTIHRNLRHAVLDSDGAPQLVVATPIF 979 Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 + S+ + ++DVS SM+ S + L + + V ++ T Sbjct: 980 RQPIA---KRSEWHVIVVVDVSASMEPS------TVFAALTASILSGVDALSVTFLAFST 1030 Query: 297 QAKEVDEHEFFYSQE------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + ++ H GGT ++ AL + E V SD D + Sbjct: 1031 EVIDLSGHVSDPLSLLLEIHIGGGTNIAGALAVAHEHVTVPSRT-----LLITISDFDEY 1085 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYI----EITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + VR R + + + + + + Sbjct: 1086 G-SVERLLARVQALNNAGVRLLGCAALDDTGQARYNVGIAGQLADVGMAVSAVSPTALA 1143 >UniRef50_Q31JK3 Type A von Willebrand factor-like n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31JK3_THICR Length = 349 Score = 91.5 bits (225), Expect = 6e-17, Method: Composition-based stats. Identities = 32/284 (11%), Positives = 75/284 (26%), Gaps = 42/284 (14%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 L L + H ++ + ++ + + V Sbjct: 29 LIRILLKPAVQRQTPLLAPHLMQRL--RHTPQPQFVQANRRSIKIPLTGIFL-WSLVVLA 85 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLF 279 + + P +S + +D+SGSM+++ + K Sbjct: 86 AMRPVWF--LNTTPFQASGKDLMLAVDLSGSMEKTDMPLRGVEVDRLTAVKSVVKNFIQK 143 Query: 280 LSRTYKNVEVVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 + + VV+ + + E +E T + A+ + + + + Sbjct: 144 -RQGDRMGLVVFGSQAFLQSPLTYDLNTVETLLNETEIGMAGNNTAIGDAIGIALKHLHQ 202 Query: 331 RYNPAQWNIYAAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIE------------I 377 +DG N PL A+++ + + Sbjct: 203 NSEKKA---VLILLTDGSNTAGAVQPLDAAKQAQEMGLKIYTIGIGQNQATGLDAFIFGP 259 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 R T ++ L F + ++IY + +L Q Sbjct: 260 NRNMDTTTLQKIAELTQGRF-FMAKDTNQLNEIYQLIDQLEASQ 302 >UniRef50_Q22NG1 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22NG1_TETTH Length = 821 Score = 91.5 bits (225), Expect = 6e-17, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 54/192 (28%), Gaps = 20/192 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---------- 293 + S+A F L+D SGSM + A+ I L + + Sbjct: 332 EKHSKAQFFFLIDRSGSMC-TIFQKARDTLIEFLQRLPDDSYFNVISFGSGYQFLFEEAK 390 Query: 294 -HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + Q+ + + GGT + L+ ++ + + + + +DG Sbjct: 391 KKNKQSMKSALEQISKFSADMGGTEIYQPLE---KIFQCKNVNDLYQMQIFLLTDGQ--- 444 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 P L + R + + + L R + ++ + Sbjct: 445 VSQPDMVVQLIRNNSHKARVHCIGLGSG-VDKQLLRRCSESGRGANRQVDNASELKEVVI 503 Query: 412 PVFRELFHKQNA 423 V F Sbjct: 504 NVLENSFTPSYT 515 >UniRef50_C4V3L6 Magnesium chelatase n=2 Tax=Selenomonas RepID=C4V3L6_9FIRM Length = 636 Score = 91.1 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 47/347 (13%), Positives = 92/347 (26%), Gaps = 25/347 (7%) Query: 37 AINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGG 96 I T +G S +P + F R + G Sbjct: 247 LIEAACATAALAGRSFVMPADVEEAAEFVLVHRMSRPQEEQ--QPPPDAGNAPEQGGNDM 304 Query: 97 GSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYT 156 + QD G + S++E + + P+ + + Sbjct: 305 PDEPQEAPPPQDDGGAESPHGASSENESQNAPQDGADDPSPPEEAHEDGDDRVAAPLE-- 362 Query: 157 ANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEI 216 N+ SL + R +GKR + + + P + Sbjct: 363 ------NVMARLSLLSMTMRAQGRKSGKRDIVQTHTADGRCLRTELPHSGARLDLALSAT 416 Query: 217 AELRAKIERVPFIDTFD-LRYKNYEK-RPDPSSQAVMFCLMDVSGSMDQ-STKDMAKRFY 273 A +R +R ++ + A + L+D SGSM M K Sbjct: 417 LRAAAPYQRARQRTQTVVIRPEDVRVWVRAKRAAANILFLVDASGSMGARERMRMVKGAI 476 Query: 274 ILLY-LFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 + L + + + + T++ E+ E + G T ++ L Sbjct: 477 LALLQEAYQKRDCVGLIAFRRDRAETLLPMTRSVELAEKQLRDLPTGGRTPLAEGLACAV 536 Query: 326 EVVKERYNPAQWNIYAAQASDGDNW----ADDSPLCHEILAKKLLPV 368 + ++E +DG DD A+++ Sbjct: 537 QTLRELERRGSEKTVLILITDGRTNTARDGDDGVQRALRAAEEIAGT 583 >UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira maxima CS-328 RepID=B5W7H4_SPIMA Length = 488 Score = 91.1 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 46/189 (24%), Gaps = 21/189 (11%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY--LFLSRTYKNVEVVYIRHH------TQ 297 + + L+D SGSM + L R V + T+ Sbjct: 48 TKPKAVVLLIDTSGSMSGQKLREVQTAASEFVSRQNLKRHD-LAVVEFSSRASVVADFTR 106 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + + GGT +S L V++ + +DG Sbjct: 107 NETELQQAIARLSARGGTNLSEGFNLATSVLQN----SDRTPNILLFTDGVPNNPPMAAS 162 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE- 416 + + + L D + D D + + Sbjct: 163 IAQQIR--ASGINLVAVGTG-----DAQINYLTALTGDPDLVFYANFGDLDRAFRGAEKA 215 Query: 417 LFHKQNATA 425 ++ +Q + Sbjct: 216 IYGQQLVES 224 >UniRef50_UPI000175837C PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein n=2 Tax=Tribolium castaneum RepID=UPI000175837C Length = 750 Score = 91.1 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 32/380 (8%), Positives = 84/380 (22%), Gaps = 66/380 (17%) Query: 110 EGQDEFVFQISKDEYLDLLFEDLAL------PNLKQNQQRQLTEYKTHRAGYTAN----- 158 E E +F+++ +E L L + + ++ +T + Sbjct: 143 EPSSEIIFRLTYEELLQRQNGQYELIINVHPGQIVDDLCVEVKIDETRPLTFVKTPSLCT 202 Query: 159 GVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAE 218 G + + + K + E+ + + Sbjct: 203 GNEISDDKPELDPCAKTEMINANSAKVKFNPDKEQQKKYAELLGSKDQGLAGQFVVQYDV 262 Query: 219 LRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL 278 R + + + + + ++D S SM + + + Sbjct: 263 ERDPKGGEVLLRDGYFVHF-FAPSGLQTFPKHVVLVLDHSASMRGRKHEQLMQAMDKILS 321 Query: 279 FLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET-------------------------- 312 L+ V + +++ +F Sbjct: 322 DLNPDDLFHVVCF-SEDVSVWNLEKKQFDLIDFMEKFDYENLDSCLTELNLGNAVQFTED 380 Query: 313 ---------------GGTIVSSALKLMDEVVKERYNPA-------QWNIYAAQASDG-DN 349 G T + L + +V+ + +DG N Sbjct: 381 NIKKAKGIKNDDMHMGCTNIIGGLVVGLFLVRRTLKKNYEQNVETKHQPMIILLTDGLPN 440 Query: 350 WADDSPLCHEILAKKLLPVVR---YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 P+ + K+ +S A + ++ F + Sbjct: 441 VGLSDPVEITKIVTKINQGTNRAAIFSLSFG-EDADKNFLKKLSAQNLGFSRHIYEAADA 499 Query: 407 QDDIYPVFRELFHKQNATAK 426 + +R +F + Sbjct: 500 ALQLQNFYRTVFSPLLRDVR 519 >UniRef50_Q0AW90 Conserved putative chloride channel n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AW90_SYNWW Length = 951 Score = 91.1 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 52/196 (26%), Gaps = 23/196 (11%) Query: 243 PDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 + ++D SGSM + ++AK I L V + Sbjct: 401 KKEIPSLGLVLVIDKSGSMSEGSGGYSKVELAKEAAIQATSILGPLDMAGVVAFDDTAQW 460 Query: 298 --------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 K+ + + + GGT + AL L +K+ + + +DG + Sbjct: 461 VVEFQAVKDKDAIQDDIATIRADGGTSIYPALALAYTALKDAHTKF---KHIILLTDGQS 517 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ-- 407 + + + E A L + F+ + Sbjct: 518 ATTGDYYFLSRRMARAGITMSTVAVGEG---ADTLLLEQLAAWGQGRYYFSDEISNIPRI 574 Query: 408 --DDIYPVFRELFHKQ 421 + + ++ Sbjct: 575 FTKETMKAIKSYLVEE 590 Score = 53.4 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 45/125 (36%), Gaps = 10/125 (8%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE- 303 P + + +D+S S + S ++ A+ F + VV+ + + + + Sbjct: 62 PLERQSVVFAVDLSASCENS-REAAENFIREALKHKKADDQAGVVVFGGNALVDQSLSDS 120 Query: 304 ---HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 H + + ALKL D ++ P ++ SDG D+ Sbjct: 121 SRLHPIESLVDRNYSKPEQALKLADALM-----PDNFSRRVVLLSDGRQNDGDALKEAAY 175 Query: 361 LAKKL 365 LA+K Sbjct: 176 LAEKK 180 >UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 Tax=Octadecabacter antarcticus 307 RepID=B5JCH3_9RHOB Length = 197 Score = 91.1 bits (224), Expect = 7e-17, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 49/185 (26%), Gaps = 4/185 (2%) Query: 82 FVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQ 141 ++ + + A D G + + S D ++ +L + Sbjct: 17 RDEDATLVPQAAPAPQQLTRAAPAGNDMAGNLRSMAEPSNDGFVHVLRDGSTFYEEYDET 76 Query: 142 QRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNS 201 + +I V + + + +EE + + Sbjct: 77 FAN-DTPNPLKITSDEPVSTFSIDVDTAAYALIRSSLTRGQLPPTDAVRIEEMINYFPYA 135 Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 PA E R I + ++ + P + L+D SGSM Sbjct: 136 YPAPEGEA-PFRPTINVFETPWNADTQLVHIGIQGEMPAIEDRP--PLNLVFLIDTSGSM 192 Query: 262 DQSTK 266 + + K Sbjct: 193 ESADK 197 >UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JQC2_9BACT Length = 808 Score = 91.1 bits (224), Expect = 7e-17, Method: Composition-based stats. Identities = 34/252 (13%), Positives = 63/252 (25%), Gaps = 22/252 (8%) Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRP 243 L+ E AI+ + EE L + L + + + P Sbjct: 218 TGELLYRFESQGAILDEDFVFYYMLEENLPGRLEVLTYRENEDKPGTFMMVMTPGVDLHP 277 Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------I 292 A +DVSGSM +A L + V + + Sbjct: 278 L-EGGADFVFALDVSGSMQGKLHTLA-SGVKKAIGQLKPEDRFRVVAFNNTAFDLNRGWV 335 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWA 351 GGT V + + L E + A +DG N Sbjct: 336 SATEANLRETFARLDQLNSNGGTNVYAGVHLALERLD-----ADRVATLILVTDGVTNQG 390 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 P L +R+Y ++ ++ L + ++ Sbjct: 391 IVDPKAFYKLM--HKQDLRFYGFLLGNS-SNWPLMQLMCDASGGSYRAVSNSDDIIGEVM 447 Query: 412 PVFRELFHKQNA 423 ++ ++ Sbjct: 448 IAKNKIVYESMR 459 >UniRef50_C5GK44 U-box domain-containing protein n=2 Tax=Ajellomyces dermatitidis RepID=C5GK44_AJEDR Length = 766 Score = 91.1 bits (224), Expect = 7e-17, Method: Composition-based stats. Identities = 31/291 (10%), Positives = 68/291 (23%), Gaps = 51/291 (17%) Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK------IERVPFIDTFDLRYK 237 + + + E++ II + P + + + Sbjct: 3 PAQSVASTEDDFEIIDDQIPIRSRPSTGTNIAGERNPNEVAVQLHPLPDTNSMILSVHPP 62 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQST------------------KDMAKRFYILLYLF 279 + ++ + +D+S SM S D+ K + Sbjct: 63 LHPEKELRHVPCDIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIET 122 Query: 280 LSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET----------GGTIVSSALKLMDEVVK 329 L+ + V + ++ + T + LKL E ++ Sbjct: 123 LNDNDRLGVVAFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEALE 182 Query: 330 ERYNPAQWNIYAAQASDGDNWADDSPLCHEILA--------------KKLLPVVRYYSYI 375 E Q +DG S + H + K LP++ + + Sbjct: 183 EVTPIPQNVQALYILTDGMYRIVRSRVPHANASKFRHAKSYVSKAGQKDRLPMIHTFGFG 242 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 L + + +F L+ A Sbjct: 243 Y---YIRSGLLQAISEVGGGTYSFIPDAGMIGTVFVHAIANLYTTFATQAM 290 >UniRef50_UPI0000E49DB4 PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4 n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49DB4 Length = 1119 Score = 91.1 bits (224), Expect = 8e-17, Method: Composition-based stats. Identities = 42/329 (12%), Positives = 97/329 (29%), Gaps = 21/329 (6%) Query: 115 FVFQISKDEYLDLLFEDLAL--PNLKQN-QQRQLTEYKTHRAGYTANGVPANISVVRSLQ 171 +V +++++E + F L P L++ + +LTE G +N S+ ++ Sbjct: 609 YVTELAEEEGGKMAFYLLGSVAPWLQEEIRGTKLTEETQALTGVASNETERKESLQITM- 667 Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 L+ + + + +E A ++ + QL + +L I F + Sbjct: 668 EMLSNISLVESPTHVIKVKRKEMKATVTLGDRVQLGDGFQLLISIIHPNTPRFWTEFDPS 727 Query: 232 FD----LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV 287 + + + + + + L+D S SM K AK+ ++ L + Sbjct: 728 CESYASMAVFYPTIQSELIADPEVVLLLDCSTSMKGEPKQDAKKICKMILQSLPEKSRFN 787 Query: 288 EVVYIR-----HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN--IY 340 + + T + G ++ N Sbjct: 788 VITFGTDFTELFPTVEPVGQRQLLEALEFIEGARSVGGSSEAWRPLRSLSLLPMMNSARN 847 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 SDG + +A K V R ++ + ++ + R + Sbjct: 848 VLLVSDGH---LTNEKLTLEIASKYKHVNRIFTCAV-SSAGNRHILRALADVSGGAFESF 903 Query: 401 MQHIRDQDD--IYPVFRELFHKQNATAKG 427 + + + I + K Sbjct: 904 SPKTKSKWEGKIAAQIDRCRQPSVSCVKA 932 >UniRef50_B0VJ58 BatA protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJ58_9BACT Length = 270 Score = 91.1 bits (224), Expect = 8e-17, Method: Composition-based stats. Identities = 25/220 (11%), Positives = 44/220 (20%), Gaps = 39/220 (17%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKD------MAKRFYILLYLFLSRTYKNVEVVYI 292 + R + + +D+SGSM A + V + Sbjct: 15 IKTRDLSNKGVDIVMAIDISGSMLAMDFAPKNRLSAAVSVAKDFVKR-RPNDRFGLVAFS 73 Query: 293 RHH------TQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + T + + T + L +K + Sbjct: 74 EYALTQVPLTFDHLAMLNSLDKLKVNEEASATAIGMGLAKAVARLKNSTAK---SKVIIL 130 Query: 344 ASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITR-------------------RAHQ 383 +DG N + PL +AK+L V Sbjct: 131 ITDGVSNTGEIDPLTAAGMAKELGIKVYPIGVGSKGLVPFPYSDPIFGTRYINTYIDLDM 190 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + T + DI L Sbjct: 191 ETLNKIAETTGTGKAALATDAKGLADIMNEIDRLEKTLFT 230 >UniRef50_B6HQ22 Pc22g19800 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6HQ22_PENCW Length = 896 Score = 90.7 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 34/262 (12%), Positives = 66/262 (25%), Gaps = 24/262 (9%) Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK----IERVPFIDTF 232 + + E N A I E + E+ + + + Sbjct: 201 QVDLGRTADMPESTFESNYASIKLRENVTIDEDFVITVNADKQDLPFAFLETHPTLPNQK 260 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 L K P + + ++D SGSM + L L + + Sbjct: 261 ALMVSLVPKFSLPPDLSEIVFVVDRSGSM-TDNMHTLRSALGLFLKSLPLGVPFNLISFG 319 Query: 293 R-----------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ E Q GGT + S L+ E +RY Sbjct: 320 SSFEAIWARSKVSTRESLEEALQHTKNIQADLGGTEILSGLEAAVE---KRYQDKVLE-- 374 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG+ W A + R+++ +H L F Sbjct: 375 VLVLTDGEVWNQSEVFDLVNQANQQ-HSTRFFTLGLGDSVSHS-LINGISRAGKGFTQTV 432 Query: 401 MQHIRDQDDIYPVFRELFHKQN 422 + + + + + + Sbjct: 433 LNNEDLNKTVVRMLKGALMPRL 454 >UniRef50_Q9ZGE6 Magnesium-chelatase 67 kDa subunit n=2 Tax=Heliobacteriaceae RepID=BCHD_HELMO Length = 666 Score = 90.7 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 54/374 (14%), Positives = 109/374 (29%), Gaps = 48/374 (12%) Query: 48 SGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQ 107 G + P + F R V ++ + P Sbjct: 301 QGRTAVEPIDLAVAVEFVIKP---RQTVDLPDEEEQM--QPPPPPPPPPPPPEPDKPDDP 355 Query: 108 DGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 + + + + + F+ +P + Q + R G A+G ++ Sbjct: 356 ETPPDEAPKDEQTLQLPEEFFFDAEEVPMEDELLSLQNKVQRQARGG--AHGKQKSLERG 413 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R + L ++ A + + P Q E + +R Sbjct: 414 RYARALL---------PPPGKNSRVAVDATLRAAAPYQRQRRESGQ--------YGDRQV 456 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL-FLSRTYKN 286 + D+R K + S A++ ++D SGSM + AK +L K Sbjct: 457 IVTNSDIRAKQF----VRKSGALIIFVVDASGSMAFNRMSSAKGAVSVLLNEAYVNRDKV 512 Query: 287 VEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +++ T++ E+ + F GG+ ++ A+ EV + Sbjct: 513 ALIIFRGQQAETLVPPTRSVELAKKRFDQVPVGGGSPLAGAIAQAIEVGVNSIGSDVGQV 572 Query: 340 YAAQASDGD-NWADD---SPLCHEILAKKLLPVVRY-----YSYIEITRRAHQT---LWR 387 +DG N D P E L +++L + R +S + I T + Sbjct: 573 IITLITDGRGNVPMDPQAGPKNREQLNEEILALSRLVPENGFSMLVIDTANKFTSTGFAK 632 Query: 388 EYEHLQSTFDNFAM 401 + + Sbjct: 633 KIADAAFAQYYYLP 646 >UniRef50_D1ZZE6 Putative uncharacterized protein GLEAN_08029 n=1 Tax=Tribolium castaneum RepID=D1ZZE6_TRICA Length = 1868 Score = 90.7 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 56/187 (29%), Gaps = 24/187 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE 305 + + + L+D S S+ ++ +F L ++ Y + V + + + Sbjct: 75 TQKLELIFLIDGSSSVGETNFRSELKFVKKLLSDVTVDYNHTRVAIATFSSSVSKNIDQI 134 Query: 306 FFYSQE-----------------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 +E GGT A ++ E+ + N ++ +DG Sbjct: 135 SDPRKENNKCFLLSKLLSKIEYTGGGTNTLKAFEVAKEIFTQSRNDSE--KVLFLITDGF 192 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + D L K V+ ++ + E ++ + + + Sbjct: 193 SNGGDPIPLAAELKKDQ---VKIFTIGIANGNYKE--LYELASTPGEIYSYLLDSFEEFE 247 Query: 409 DIYPVFR 415 + + Sbjct: 248 SLARHLK 254 >UniRef50_A6QCY4 von Willebrand factor type A domain protein n=2 Tax=unclassified Epsilonproteobacteria RepID=A6QCY4_SULNB Length = 307 Score = 90.7 bits (223), Expect = 1e-16, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 51/207 (24%), Gaps = 27/207 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---------SRTYKNVEVV 290 + +D SGSM QS D RF L V+ Sbjct: 71 AAGNQHKKGRDLVLAIDASGSMAQSGFDEKDRFKTKYETTLDLSADFIKHRFDDNMGVVI 130 Query: 291 YIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + T E E + T + AL + + Sbjct: 131 FGTFAYTASPLTYDLEAMESMLKMTTVGIAGESTAIGDALMQAMRTL---SYGEAQSKAI 187 Query: 342 AQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG N SP AK+ + + + L ++A Sbjct: 188 ILLTDGYHNAGRSSPKAAVAKAKEKGIKIYTIGVGK-SSDYDAALLDTIAKESGGK-SYA 245 Query: 401 MQHIRDQDDIYPVFRELFHKQNATAKG 427 ++Y +L + + +G Sbjct: 246 AASAAQLKEVYKEIDKL---EPSPVRG 269 >UniRef50_B0WHU4 Sushi n=3 Tax=Culicini RepID=B0WHU4_CULQU Length = 2239 Score = 90.7 bits (223), Expect = 1e-16, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 52/199 (26%), Gaps = 29/199 (14%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 K+ EK + + + L+D S S+ + +F L + +Y V I + Sbjct: 123 KSVEKIKTKNKRVDIVFLIDASSSVGRQNFASEIKFVKKLLSDFNVSYNYTRVAVITFSS 182 Query: 297 QAKEVDEHEFFYSQ---------------------ETGGTIVSSALKLMDEVVKERYNPA 335 Q K GGT ALK +E+ K + Sbjct: 183 QKKIF--RHIDQISQSVEDNDKCLLLNYQVPRIAFSGGGTYTYGALKEAEEIFKNARLDS 240 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + +DG + D L K V YS + Sbjct: 241 K--KIIFLITDGFSNGRDPIPLAGRLKKDN--NVVIYSIGIQSGNY--AELHAIASAPEG 294 Query: 396 FDNFAMQHIRDQDDIYPVF 414 + + + + Sbjct: 295 DHCYLLDSFDHFETLARKA 313 >UniRef50_A3HZP7 BatA protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZP7_9SPHI Length = 347 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 62/218 (28%), Gaps = 37/218 (16%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVY 291 K+ E+ + + +MD+S SMD + AK I + VV+ Sbjct: 95 KSNERVEQFTEGIDIMLVMDISESMDLQDFKPNRLEAAKATAIDFING-RFGDRIGMVVF 153 Query: 292 IRHHTQAKEVDEHEF----------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + F E GT + SA+ +KE + Sbjct: 154 AGEAYSLAPLTNDYKLLTDLIQDISFNMMEAKGTAIGSAIASATNRMKES---ESASKVL 210 Query: 342 AQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYI----------------EITRRAHQT 384 SDG+ N + PL LA L + + + +T Sbjct: 211 ILLSDGESNAGNVDPLFAAQLASALDIKIYTIAVGKDGMVPYGTDFFGRPQMVESYLDET 270 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 RE + + F ++I+ + + Sbjct: 271 NLREIAKIGNGEF-FRASDGGTLNNIFDRIDTMEKAEI 307 >UniRef50_A7HHW8 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HHW8_ANADF Length = 1362 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 38/412 (9%), Positives = 93/412 (22%), Gaps = 41/412 (9%) Query: 26 YKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQN 85 Y+A ++ + A+ ++ ++ + ++P + + Sbjct: 178 YEAAKREGRTAALTEQERPNLFTQSVANVPPGETVAVVLRYVHEVPFDGGRYAFHFPTTV 237 Query: 86 DRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQL 145 P G + V S+ P ++ Sbjct: 238 GPRYVPGAALAPDAPGGARGPG-TSPDTGRVPDASRVTPP-------VSPPGTRSGHDVD 289 Query: 146 TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQ 205 + G + VV L+ + AR A+ E + +++ Sbjct: 290 LLVRLVPGGAFDEVETRSHRVVTGLEPAGARLVAL-----AEDDRIPNKDFVLTWRPAGV 344 Query: 206 LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST 265 + L + E+ ++ P + L+D SGSM + Sbjct: 345 VPGAHAL--------VQREKGEDFLMLFVQPPA-GVAPALVRPKELVFLVDKSGSMMGAP 395 Query: 266 KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV-----------DEHEFFYSQETGG 314 D + + V + E + + GG Sbjct: 396 FDRVRALVARALDAMGPDDTFQVVAFDGSAQAMSEAPLPATPSAIARAKEWLASLEGGGG 455 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 T + ++ + +DG + + L R + + Sbjct: 456 TEMLEGVRAAL----SPPEDPRRLRMVVFCTDG---FIGNEPEIIEAVEALRGRARVFGF 508 Query: 375 IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 ++ L + +F + + Sbjct: 509 GIG-SSVNRYLVEGVGRAGRGASEVVSLDEPPDAAVARLFARIDRPLLTDLE 559 >UniRef50_C7ZL29 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7ZL29_NECH7 Length = 923 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 56/199 (28%), Gaps = 20/199 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH---- 294 K PS + + + D SGSM + D+ K + L K + H Sbjct: 268 VPKFKLPSEKPEIVFICDRSGSMGGTIPDL-KAALEIFLKSLPVGVKFNICSFGSHFSFL 326 Query: 295 ----HTQAKEVDEHEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 T +K+ + + + GGT + ++ + N+ +D Sbjct: 327 WDRSQTYSKDSLDKALRHIKSFDADFGGTEMYQPVEATFK-----KRYTDMNLEIFLLTD 381 Query: 347 GDNWADDSPLCHEI-LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 G + D + VR +S + + +L + F + Sbjct: 382 GAIYDQDRLFELINHNVAESKGSVRVFSLGIGSGAS-TSLVEGVARAGNGFAQTVGHGEK 440 Query: 406 DQDDIYPVFRELFHKQNAT 424 + + + Sbjct: 441 MDKKVVRMLKGALFPHITD 459 >UniRef50_Q2W311 Putative uncharacterized protein n=1 Tax=Magnetospirillum magneticum AMB-1 RepID=Q2W311_MAGSA Length = 1171 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 53/177 (29%), Gaps = 16/177 (9%) Query: 249 AVMFCLMDVSGSMD---QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV---- 301 + L+D S SM S M R LS + V + + + + Sbjct: 63 LDVVMLLDHSSSMGAAPGSPLQMMLRAAGNFLRQLSPDSRVAVVGFNQVPSVHCTLAATP 122 Query: 302 --DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 G T +++AL E++ + SDG + + Sbjct: 123 AQARSALQAISPGGATSIAAALNQAVELLAHGRP--GMDKVVVLCSDGQDDIAEIADALA 180 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 L K +P VR + H T Q F + RD DD++ + Sbjct: 181 RL--KAIPSVRVLAVGFGDEVIHATFLAMVADRQD---YFHLTRARDMDDVFQRLAK 232 >UniRef50_C0QXK8 von Willebrand factor type A (VWA) domain containing protein n=2 Tax=Brachyspira RepID=C0QXK8_BRAHW Length = 289 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 26/210 (12%), Positives = 53/210 (25%), Gaps = 39/210 (18%) Query: 248 QAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYI------RHHT 296 + ++DVS SM + +K+ I K V + T Sbjct: 46 GVYISLVVDVSPSMMAEDMIPTRLEASKKTMIDFIKK-RNFDKISLVSFALRASVLSPAT 104 Query: 297 QAKEVDEHEFFYSQ--ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD- 353 E E + E G T + + +++ R +DG+N + + Sbjct: 105 FDYTSLEEEIKKIEIDEEGSTSIGLGIATAVDML--RSVKEDNEKIIILLTDGENNSGEI 162 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRA---------------------HQTLWREYEHL 392 P +A + + ++ + Sbjct: 163 DPKLASEIASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIRADFTLNEESLIDIAAT 222 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 F ++ D++Y L K Sbjct: 223 TGGKY-FNAKNASALDNVYNTIDRLEKKPI 251 >UniRef50_UPI00005843FB PREDICTED: hypothetical protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005843FB Length = 429 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 18/193 (9%), Positives = 49/193 (25%), Gaps = 24/193 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE---------- 300 + ++DVS SM + K + L+ T + + +E Sbjct: 165 IVFVIDVSASMYGTKLSQTKEALKTMLDNLNPTDYFNIITFSDGVQYWRENNRLAPAQRR 224 Query: 301 ---VDEHEFFYSQETGGTIVSSALKLMDEVVKE----RYNPAQWNIYAAQASDGDN-WAD 352 ++ T ++ A+ E++ +DG Sbjct: 225 YMDDAMAYVDSLRDDSETNLNEAIVKAGELLDSEARYNRPGDSVYSMMILLTDGRPSVGT 284 Query: 353 DSPLCHEILAKKL---LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 A+++ + + + L + + + + + Sbjct: 285 TDQQEILDNAREVIAGKHSLNILGFGRL---VDFDLLVKLAYENNGTAKMIYEGTTAAEQ 341 Query: 410 IYPVFRELFHKQN 422 + + EL+ Sbjct: 342 LREFYFELYRPLL 354 >UniRef50_A1S119 von Willebrand factor, type A n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S119_THEPD Length = 327 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 50/183 (27%), Gaps = 17/183 (9%) Query: 252 FCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------HHTQAKE 300 ++DVSGSM ++A+R LL + + + T + Sbjct: 103 VLVVDVSGSMEDSIPGGVKIEVARRAATLLVERMPGGVDVGLLAFSDRIVLSLPPTGDRR 162 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + GGT+ + L+ +K + SDG + Sbjct: 163 RVLDAIESLKPGGGTMYTYPLQAALSWLKPYKLFNA-STLVVFVSDGLPADAATYRTLLS 221 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + L V + + + +++ F+ L K Sbjct: 222 EFRSLGIPVYTVYIG-PGGDEGERELKLIAGSTGGEEY----TAGSAEELLKAFKTLAEK 276 Query: 421 QNA 423 ++ Sbjct: 277 ASS 279 >UniRef50_C5CEE7 PEGA domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CEE7_KOSOT Length = 1706 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 54/189 (28%), Gaps = 23/189 (12%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------HHTQ 297 +D SGSM + AK L + + + + T+ Sbjct: 289 KKEPSLNFVLEVDRSGSMK-PVMEKAKDAASYFLDLLPENSELALIAFDTEIEVLKNFTR 347 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG--DNWADDSP 355 +E + + G T + + E++ ER P + +DG N+ D +P Sbjct: 348 DREQLKRALAIIKARGATPLYDTVAKGIELLSERSGP----RFLILVTDGVDANYGDTAP 403 Query: 356 ------LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 LA++ V ++ TR + I + Sbjct: 404 GSEKTLSEVIRLAREN--NVVIFAIGLGTR-IDEFSLGTLARSTGGMF-LKSPTIDNLKT 459 Query: 410 IYPVFRELF 418 + E F Sbjct: 460 AFNSLLETF 468 >UniRef50_Q22HH7 von Willebrand factor type A domain containing protein n=3 Tax=Tetrahymena thermophila SB210 RepID=Q22HH7_TETTH Length = 796 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 59/207 (28%), Gaps = 20/207 (9%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 VP I + + +A L+D SGSM S + AK+ I L Sbjct: 255 NFVPNIAQVKKQLFRQAQNEIELMKAEFLLLIDRSGSMVGSNIETAKQALIFFLKSLPEG 314 Query: 284 YKNVEVVYIRHHTQAKEVDEHEFFY-----------SQET-GGTIVSSALKLMDEVVKER 331 + + ++T Q GGT +S ALK + ++++ Sbjct: 315 SIYNIISFGTNYTVMYPQSVQVNDQNLQDSIDKIEKFQANMGGTNISQALKYLMYNLQDQ 374 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 Y +DG+ + D P + K + + + Sbjct: 375 Y---GLRKKIYIITDGE-FQDYQPALEIVKKNKFKCDINALCIG----SYEFLYATQILN 426 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELF 418 + + + ++ F Sbjct: 427 ETGGNFQKVTDTSQIISQVIQLLKDSF 453 >UniRef50_D1YYY2 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YYY2_METPS Length = 716 Score = 89.9 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 47/200 (23%), Gaps = 16/200 (8%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + L+D SGSM K+ A+ L L + + Sbjct: 157 PASKDVKKISGEYVILIDHSGSMAGPKKEAAEWAVGKFLLGLGPDDWFTLGAFSNNTRWY 216 Query: 299 KEVDEHE-----------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + E GGT + AL+ ++ + + + +D Sbjct: 217 SRLLAGATGDTVKNAVEFMKSKFEGGGTEMGVALEQALDI---KRLKGDVSRHVLIITDA 273 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + L + P R S + I + L F + Sbjct: 274 EVTDGGRILRLVD-RESRRPDRRSISLLCIDAAPNSYLAAAIAERGGGIVKFLTSDPSE- 331 Query: 408 DDIYPVFRELFHKQNATAKG 427 DI + Sbjct: 332 GDISSALDAILDDWRQPLLA 351 >UniRef50_B8FBV5 Putative uncharacterized protein n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FBV5_DESAA Length = 308 Score = 89.9 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 33/253 (13%), Positives = 67/253 (26%), Gaps = 25/253 (9%) Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 ++ + +R I L + V + R + Sbjct: 27 RPPAVTYSAASCLGRIAGKNAEIRARIPLLVRTLALVLLVAAIARPQTVDASREIKTPGV 86 Query: 250 VMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEVVYIRHH---- 295 + +D S SM Q + K+ T + VV+ + Sbjct: 87 DIILCLDASESMAQPDFAIDGQRVNRLTAVKKVVHDFVKR-RDTDRIGLVVFGDYAFTQA 145 Query: 296 --TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 T K + + + T + AL + + +K + + SDG+N Sbjct: 146 PLTLDKGLLLNLIENLRIGMAGRKTAIGDALGVAGKRIK---DIPAMSKVVILLSDGENT 202 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 A D A ++ Y+ T +A + + + + D I Sbjct: 203 AGDMTPQGAAEA-LAALGIKIYTIGMGTEQAGSKELAQIAAIGQGKY-YHASNTEQLDSI 260 Query: 411 YPVFRELFHKQNA 423 Y + + Sbjct: 261 YKEIDKAEKTEAK 273 >UniRef50_B3DVX4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DVX4_METI4 Length = 334 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 50/219 (22%), Gaps = 45/219 (20%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---------SRTYKNVEVVY 291 K P + ++D+SGSM ++ ++ L + L + V + Sbjct: 79 KVPLRKEGYDIILVLDISGSMLAEDYEIDQKRVSRLDIVLEVVKTFLDKRTNDRIGLVAF 138 Query: 292 IRHH------TQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQWNIY-- 340 T + + Q GT + AL L ++ + + Sbjct: 139 AGRAYTVCPLTFDHNWLKRKIDQLQAGTIEDGTAIGDALGLALSRLEGKKESGERKKIGS 198 Query: 341 -AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI---------------------- 377 +DG N + E V ++ Sbjct: 199 FLILLTDGANNCG-NLTPIEAARLAAHAAVPVFTIGAGINGEVTMPVMDEERRKIGSQTV 257 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + L R L F + Sbjct: 258 VSEVDEGLLRNIAQLTGGEY-FRATDSNAIVSAFQAIDA 295 >UniRef50_C6WL97 VWA containing CoxE family protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WL97_ACTMD Length = 1295 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 53/347 (15%), Positives = 96/347 (27%), Gaps = 35/347 (10%) Query: 47 DSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQG--- 103 G +IP E + G R G + D + G G G Sbjct: 919 AEGAETAIPAELPPVLRWRLLLGARGDRPAGGGRYAAALDELYGHDRGEGADSGSLGGSA 978 Query: 104 -----QASQDGEGQDEFVFQIS---KDEYLDLLFEDLALP---NLKQNQQRQLTEYKTHR 152 E DE ++E L E L + R + + Sbjct: 979 GGDGDPFPVVREWSDELKALFGDRVREEVLAAAAEGGRLEAALEIDPTSVRPSVDLLRNV 1038 Query: 153 AGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERL 212 ++ +R L L R A R L + +L L Sbjct: 1039 LSLAGGLSEDALARLRPLVARLVRELAEQLANRIRPALTGMQLPFPTRRPGGKLDLPRTL 1098 Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 R +A R + +++ R ++ + ++DVSGSM+ ST A Sbjct: 1099 RANLATARRDEHGKVVVIPERPVFRS---RGRKANDWRLILVVDVSGSMEASTVWAALTA 1155 Query: 273 YILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS------QETGGTIVSSALKLMDE 326 + +++ ++ TQ ++ E + GGT ++ AL+ + Sbjct: 1156 SVFA------GVRSLTTHFLAFSTQVVDLSERVADPLSLLLEVKVGGGTHIAGALRHARD 1209 Query: 327 VVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 +V SD + + + + V Sbjct: 1210 LV-----TVPERTMVVLVSDFE-EGGPVASLTAQVRELVSAGVTVLG 1250 >UniRef50_A6EQD3 von Willebrand factor type A like domain n=2 Tax=Bacteroidetes RepID=A6EQD3_9BACT Length = 733 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 56/202 (27%), Gaps = 28/202 (13%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 +P + ++DVSGSM+ +++K L L+ ++ T Sbjct: 285 VKPKNVTAREYLFIVDVSGSMNGYPLEVSKDLMRNLLCNLNADDTFNVQLFASSSTIFNP 344 Query: 301 VDEHEFFYSQETGGTI---------------VSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + T + SAL + E+ + + + Sbjct: 345 TPVEATDENV----TNAIKFLTSGQGGGGTQLLSALNVAYELPRS---QEGSSRSMVIIT 397 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG ++ L +++ ++ L + ++F Sbjct: 398 DG---YVSVEREAFTKIEENLDQASVFTFGIG-SSVNRYLIEGMAAVSK-SESFIATSRE 452 Query: 406 DQDDIYPVFRE-LFHKQNATAK 426 + + F++ + K Sbjct: 453 EASKVAEDFKKYIDSPVMTQVK 474 >UniRef50_D1YD07 von Willebrand factor type A domain protein n=2 Tax=Propionibacterium acnes RepID=D1YD07_PROAC Length = 318 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 26/212 (12%), Positives = 49/212 (23%), Gaps = 34/212 (16%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYIRH--- 294 P +A + +DVS SM + + AK L + V + Sbjct: 81 EVPRDRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLPPRFNVSLVKFAASAQV 140 Query: 295 ---HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN-----IYAAQASD 346 T + Q T + + +K + + SD Sbjct: 141 VVPPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDDPKHPGQKPPAAIVLLSD 200 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR-----------AHQTLWREYEHLQST 395 G L A + V +Y + Sbjct: 201 GATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGG 260 Query: 396 FDNFAMQHIRDQDDIYPVF------RELFHKQ 421 F+ + + D+Y ++F + Sbjct: 261 E-KFSAESLGQLSDVYKSIAQSVGYEKVFGEV 291 >UniRef50_A3J9J6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=A3J9J6_9ALTE Length = 341 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 58/211 (27%), Gaps = 38/211 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEV 289 E R P++ + ++D+S SMD+ K+ + + + Sbjct: 79 EARQLPTTGRDLMLVVDISPSMDEPDMVRQGRRINRLQAVKQVLAEFIDQ-RQGDRLGLI 137 Query: 290 VY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ + + E T + A+ L + ++ER Sbjct: 138 LFGSQAYVQAPLTFDRTTVNILLQEAGLGMAGNATAIGDAVGLAVKRLRERPLE---QRV 194 Query: 341 AAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITR-------------RAHQTLW 386 A +DG N + +P LA+ + + L Sbjct: 195 AIVLTDGANTAGEITPDKASELAQASAVRLYTIGIGAGADSAITGLLQRNPSRDLDEALL 254 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 F +++ + IY +L Sbjct: 255 TRMAQQTGGQY-FRARNLAELGGIYTSINQL 284 >UniRef50_UPI000175F343 PREDICTED: similar to BCSC-1 n=3 Tax=Euteleostomi RepID=UPI000175F343 Length = 1292 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 31/261 (11%), Positives = 61/261 (23%), Gaps = 20/261 (7%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 RE + E+RL + P + Sbjct: 299 LERGRLTFREYEQQIRARRDFIRCARKEAESEKRLEFVRRRYHKDLLLNPVLMLNFCPDL 358 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV-------- 289 E + + L+D S +M Q + K ++ L Sbjct: 359 LSEPTELQQASRELIFLIDNSNTMTQHNINKIKEGMLVAIKSLPTRTMLNIAGICSTVRP 418 Query: 290 VYIRHHTQAKEVDEHEFF---YSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 ++ T + G + SAL + R P Sbjct: 419 LFNTSKTCTDVTVTQALEFIQKLRSDHGAVNLWSALSWVYRQPVHRSCP----RQLFIIM 474 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + L ++ VR + +A + L + + +F + R Sbjct: 475 DG---NLSNVGRVIELVRRNACNVRCFGLGLG-PQACRRLLQGIAKVTGGSTDFLSEEER 530 Query: 406 DQDDIYPVFRELFHKQNATAK 426 Q + ++ + Sbjct: 531 LQPKLIKCLKKALEPVLTDVR 551 >UniRef50_UPI0001A2D396 UPI0001A2D396 related cluster n=5 Tax=Clupeocephala RepID=UPI0001A2D396 Length = 1227 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 31/261 (11%), Positives = 61/261 (23%), Gaps = 20/261 (7%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 RE + E+RL + P + Sbjct: 288 LERGRLTFREYEQQIRARRDFIRCARKEAESEKRLEFVRRRYHKDLLLNPVLMLNFCPDL 347 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV-------- 289 E + + L+D S +M Q + K ++ L Sbjct: 348 LSEPTELQQASRELIFLIDNSNTMTQHNINKIKEGMLVAIKSLPTRTMLNIAGICSTVRP 407 Query: 290 VYIRHHTQAKEVDEHEFF---YSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 ++ T + G + SAL + R P Sbjct: 408 LFNTSKTCTDVTVTQALEFIQKLRSDHGAVNLWSALSWVYRQPVHRSCP----RQLFIIM 463 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + L ++ VR + +A + L + + +F + R Sbjct: 464 DG---NLSNVGRVIELVRRNACNVRCFGLGLG-PQACRRLLQGIAKVTGGSTDFLSEEER 519 Query: 406 DQDDIYPVFRELFHKQNATAK 426 Q + ++ + Sbjct: 520 LQPKLIKCLKKALEPVLTDVR 540 >UniRef50_A7S6T1 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7S6T1_NEMVE Length = 1235 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 32/263 (12%), Positives = 63/263 (23%), Gaps = 42/263 (15%) Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRA--KIERVPFIDTFDLRYKNYEKRPDP 245 E + +S + + + L + +D R + + Sbjct: 220 RVLGEGIYSAMSRNLKQNQRLKWQFFGSKEGLCTIYPAAPLKECHAYDNRLRPWYTSAAY 279 Query: 246 SSQAVMFCLMDVSGSMDQS---------TKDMAKRFYILLYLFLSRTYKNVEVVYIR--- 293 S + ++D S SM D+AK + L K V++ Sbjct: 280 PSTKKLVIVLDTSSSMASRVELGTKRRTRLDVAKAALSTILSTLLPQDKVGVVLFNSKVT 339 Query: 294 ----------HHTQAKEVDEHEFFYS-------QETGGTIVSSALKLMDEVVKERYNPA- 335 + T+ Y + GGT +A K ++K + Sbjct: 340 LAGSSGVDECYSTRLAPAGRFNVNYLKDFINRSRPGGGTQYQNAFKAAFTLLKSAKSGDG 399 Query: 336 -QWNIYAAQASDGDNWADDSPLCHEILA-------KKLLPVVRYYSYIEITRRAHQTLWR 387 + +DG D L E L ++ V + + Sbjct: 400 GGEQSFLLFLTDGGP--KDDALEVERLIAQNKKEMEESRERVTIMTIGLGKDEHMKDFLG 457 Query: 388 EYEHLQSTFDNFAMQHIRDQDDI 410 + + I Sbjct: 458 RLSKNVGSKYSQVDNEAHMYSAI 480 >UniRef50_C4G1K3 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1K3_ABIDE Length = 1659 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 52/166 (31%), Gaps = 11/166 (6%) Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + E++ I++ + +L+ K + + + +MD S Sbjct: 37 PDEYYKNGSEKQENGVTISKKVTRYNAADGTYDIELKVKGSTEVVQNNKILDIVLVMDTS 96 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---------TQAKEVDEHEFFYS 309 GSM+ + + AK+ L NV + + T+ ++ Sbjct: 97 GSMEGKSLENAKKAANNFVDKLLPQNNNVNIGIVSFAEKGEIKSGLTRNVTTLKNAIKGL 156 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 + GGT L+ V+ PA+ DG+ + Sbjct: 157 KADGGTYTQQGLEKAATVLNGA--PAEHKKVMVVIGDGEPTYANGE 200 >UniRef50_Q0W729 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W729_UNCMA Length = 1310 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 38/170 (22%), Gaps = 22/170 (12%) Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRH-----------HTQAKEVDEHEFFYSQETGGTIV 317 AK + + V + K ++ +GGT + Sbjct: 909 AKTSAVSFVESRGDGDQVGVVSFYTSASLNSALKQMNSGTNKTTVKNAINSLSASGGTDI 968 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 SS +K + Y +DG ++ K ++ Sbjct: 969 SSGIKKAIAELDAHKRSTA-KQYIIVLTDG--YSQYPEFDLIEADKAKAKGYTIFTIGMG 1025 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 A + ++ + + + Y ++ Sbjct: 1026 M--ADEDTLKKIA--SKPEYYYRVLSPEQLEAAYYDI----GQEIGGVIA 1067 >UniRef50_C6JMG8 Magnesium chelatase n=1 Tax=Fusobacterium varium ATCC 27725 RepID=C6JMG8_FUSVA Length = 632 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 42/372 (11%), Positives = 99/372 (26%), Gaps = 25/372 (6%) Query: 44 TDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQG 103 G S + +F + + +E + Sbjct: 251 LAALDGRSYLNIDDLKEAAVFVLPHRTNQKHESTSQSK---GNELE----DKNQETEEEK 303 Query: 104 QASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPAN 163 S+D Q++ V + E D + + +++++ G Sbjct: 304 NNSEDNITQEKEVPEEPNKENFDNGNDSENIEESDRDEKKNKNNENAESEEEFGIGEIFK 363 Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA-- 221 + + R+ A GKR + + I ++ P + + I Sbjct: 364 VKDILIDDVHDTRKRA-GTGKRCKTKSGSLQGRYIKSTLPKGKIRDFAFDATIRAAAPYQ 422 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD-QSTKDMAKRF-YILLYLF 279 K + + + K + + A + ++D SGSM + + K LL Sbjct: 423 KKNKENNLMINIKKEHIRVKVREKRTGASILFVVDSSGSMGVKKRMEAVKGAVMSLLKDA 482 Query: 280 LSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY 332 + + V + + T++ ++ + + G T ++ + ++K Sbjct: 483 YEKRDRVGMVSFRRDKAEELLPITRSIDLAQKKLEKLATGGKTPLAEGIAKAYTIIKNEM 542 Query: 333 NPAQ-WNIYAAQASDGDNW----ADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLW 386 + SDG D +A+K +R + Sbjct: 543 RKDKEVVPLIVFLSDGKGNFSASGKDPVKESLEMAEKIKNEGIRAIVIDTEEGFIKLEMA 602 Query: 387 REYEHLQSTFDN 398 + Sbjct: 603 KTLSEAMKAEYY 614 >UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1C6_SLAHD Length = 744 Score = 89.6 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 47/168 (27%), Gaps = 15/168 (8%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRH-- 294 + + P+ +S + +D SGSMD + K + ++ V Y Sbjct: 369 DSKVDPNDASSRHVVLALDTSGSMDGEPLNETKTATREFASTIFKSDADVCLVSYDSSAR 428 Query: 295 ----HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-N 349 T + + GGT + AL++ E ++ SDG+ N Sbjct: 429 NVIDSTDNEYALKAAVRDLSAGGGTNIEDALRVSYERLEGS---GSDKRIIVLMSDGEAN 485 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAH----QTLWREYEHLQ 393 + V Y+ + Q + Sbjct: 486 EGLVGDDLIAYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIASPG 533 >UniRef50_D0N9W4 Putative uncharacterized protein n=2 Tax=Phytophthora infestans T30-4 RepID=D0N9W4_PHYIN Length = 2146 Score = 89.6 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 21/184 (11%), Positives = 49/184 (26%), Gaps = 10/184 (5%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYI-LLYLFLSRTYKNVEVVYIRHHT------QAKEVDEH 304 ++D SGSM+ + + +Y ++ V + +A+ + Sbjct: 1905 VFVLDCSGSMNGQPWNDLMAAWKEYVYNRIADGATLDLVSVVTFDNSAQIVYEARSITTV 1964 Query: 305 EFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 Q GGT ++ L+ +EV+ R N + SDG + Sbjct: 1965 TNARIQYRGGGTNYAAGLRSANEVL-SRVNFDMFKPAIVFFSDGHPCDPLQGEELATHIR 2023 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 ++ + + + + + + + Sbjct: 2024 GCYERNGLQAFAVGFGSINLNMLERVAEKLGGTYHHVLTG-NELKATFFSISASLSTRAG 2082 Query: 424 TAKG 427 A Sbjct: 2083 LALA 2086 >UniRef50_UPI00015B5333 PREDICTED: similar to ENSANGP00000021218 n=1 Tax=Nasonia vitripennis RepID=UPI00015B5333 Length = 1230 Score = 89.6 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 35/313 (11%), Positives = 71/313 (22%), Gaps = 47/313 (15%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + G N + + V ++ EE Sbjct: 121 DAKNYTHFNTSLSEHFGGDVNLNFSAVHVPTTVYGRAKEVLRAIRWS-------EELDNT 173 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERV---------PFIDTFDLRYKNYEKRPDPSSQ 248 N+ + + + D FD R +++ +S Sbjct: 174 FKNNYLQDPSLSWQYFGSSTGFMRQYPAINWKPNGSDPHDPDLFDCRTRSW-YIEAATSP 232 Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKE 300 + L+D SGSM K++A+ + L V + + Sbjct: 233 KDVLILVDTSGSMTGMRKEIARHVVNNILDTLGNNDYVNIVKFSNVTELAVPCFGDTLVQ 292 Query: 301 VDEHEFFYSQETGG-------TIVSSALKLMDEVVK---ERYNPAQWNIYAAQASDGDNW 350 + + S L E+++ E A N +DG Sbjct: 293 ANLANIRELKNGISEMNTERIANFSMILTYAFELLEEFREMRRGACCNQAIMLVTDGVP- 351 Query: 351 ADDSPLCHEILAK---------KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 D+ + VR ++Y+ A R + Sbjct: 352 --DNYKEIFQRYNWASNPDNPDQADMPVRIFTYLIGREVADVRDSRWMACANRGYFVHLS 409 Query: 402 QHIRDQDDIYPVF 414 ++ + Sbjct: 410 TLAEVREQVLNYI 422 >UniRef50_A6QCW6 von Willebrand factor type A domain protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6QCW6_SULNB Length = 325 Score = 89.6 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 59/209 (28%), Gaps = 30/209 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----------DMAKRFYILLYLFLSRTYKNVE 288 E SQ + +D+SGSM + K + K Sbjct: 84 EPVTKDVSQRELLISVDLSGSMMTKDFVNKEGKAIDRLEAVKMVLRDFLKE-RKGEKIGL 142 Query: 289 VVYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +++ TQ + EH + T + ++ L ++ +E Sbjct: 143 ILFGNAAFVQAPFTQDLDALEHLLDSLRVGMAGPQTAMGDSIGLAVKMFRESNVTD---R 199 Query: 340 YAAQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYIE----ITRRAHQTLWREYEHLQS 394 SDGD+ P LA K V + +E + Sbjct: 200 MLIVMSDGDDTGSKVPPKTSAELAAKNGVNVFTIGIGDPKNAGEHPIDTDTLKEIAAITG 259 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 +A ++ D DIY +L K+ Sbjct: 260 GKFYYAW-NLDDLQDIYKQIDKLKPKEIK 287 >UniRef50_C3Y9U4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y9U4_BRAFL Length = 1317 Score = 89.6 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 28/246 (11%), Positives = 66/246 (26%), Gaps = 29/246 (11%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 +++ ++ +I I + R + + L+D Sbjct: 23 WDARGSDRIVNDDPSALQIGLDPDSPRSKVEILGEIFKKHVQRLRQTENQTVELVFLVDS 82 Query: 258 SGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYI---------------RHHTQAK 299 S S+ + RF L + V + +H Sbjct: 83 SASVGNENFNSELRFVKKLLADFTLAENAARVAIVTFSSRNKVVNHVDHLSKPSYHKHKC 142 Query: 300 EVDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + E E + GGT A+ EV++ A +DG + D Sbjct: 143 SLLEEELPRIKYAGGGTYTKGAMIKAQEVLRHARPNA--TKAVFLMTDGYSNGGDPLPEA 200 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 L + V+ +++ + + + + ++ + + + + R Sbjct: 201 RKLKQN---DVQIFTFGIRSGNVKE--LQNMATDPAEEHSYFLDSFAEFEAL---ARRAL 252 Query: 419 HKQNAT 424 H+ Sbjct: 253 HEDLQG 258 >UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F2Q0_SORC5 Length = 521 Score = 89.6 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 31/257 (12%), Positives = 64/257 (24%), Gaps = 17/257 (6%) Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 T R+ +A + S L + R F++R Sbjct: 41 TEPPRQVPEVTGGAVAAVDPSRFTTGSRLM-LEGRVGHARLPRSAQETFLMFEVRGDGSP 99 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 R + + +D SGSM + A + L+ V + + Sbjct: 100 ARSLAQANLSLV--IDRSGSMKGTRLTNAVQAATTAVSRLNDGDVVSVVTFDTRTSVVVP 157 Query: 301 VD----------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-N 349 G T +S ++ ++ + A + SDGD N Sbjct: 158 PTTVGPETRGRILASVRGISLGGDTCISCGIEEGLSLLGQTS--AGVSRMLVL-SDGDAN 214 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 +A++ + I + ++ + + F Sbjct: 215 HGVRDVPGFRAMAQRARDRGVAITTIGVDVDYNEKILSAIALDSNGRHYFVENDAALARI 274 Query: 410 IYPVFRELFHKQNATAK 426 +L + A+ Sbjct: 275 FEAEAEQLTTSVASGAE 291 >UniRef50_A3I2X5 Putative batB protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3I2X5_9SPHI Length = 321 Score = 89.6 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 22/214 (10%), Positives = 54/214 (25%), Gaps = 38/214 (17%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + +F +D+S SM+ + K L + + +++ Sbjct: 69 SVKEIKEEGKDIFLAVDLSQSMNATDIGPSRLQRIKFELKELTKSF-PSDRIGLIIFSSE 127 Query: 295 H------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 T + V + GT +++ L++ + + + + Sbjct: 128 AFMQCPLTFDQSVLQLYIDGLNTGLVPNFGTDLNAPLRIALDRFQNDESQEVKSKSVILI 187 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR---------------------AHQ 383 SDG+N+ D K L V + + Sbjct: 188 SDGENFG-DELENIGSELKNLGVKVFALGIGTESGSTIPRGNGIVMDPQTGEPAQTVLDK 246 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 ++ +++ D+ L Sbjct: 247 RPLQQIAAETDGQYFEISDEVQEVADLIKRLERL 280 >UniRef50_B4BQC0 von Willebrand factor type A n=2 Tax=Geobacillus RepID=B4BQC0_9BACI Length = 668 Score = 89.6 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 38/282 (13%), Positives = 68/282 (24%), Gaps = 43/282 (15%) Query: 98 SGSGQGQASQDGEGQDEFVFQISKDEYLD------LLFEDLALPNLKQNQQRQLTEYKTH 151 G DG+G F++ E + + + +++ Sbjct: 45 EGWATAPGEGDGKGNSIRFFEVRIQLKGQNGTAVSYRTEAVEVEPDRNGEKKYTFSLDMR 104 Query: 152 RAGYTANGVPANISVVRSLQNSLAR-----RTAMTAGKRRELHALEENLAIISNSEPAQL 206 +A G A +V L + E + A + S Sbjct: 105 GKWPSAPGTTATYEIVVDAYRVLGNGQEEVYFSFPQPPYEYTRQTETSTAKLDFSLSFSQ 164 Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 E A+ + T + + +MDVSGSM Sbjct: 165 -------PEYAKPPDGDAQGRLDVTLIPQGGVPAPV---RPPIDVVFVMDVSGSMTTMKL 214 Query: 267 DMAKRFYILLYLFLS----RTYKNVEVVYI--------------RHHTQAKEVDEHEFFY 308 AK + + + + + + E Sbjct: 215 QSAKSALQAAVNYFKTNYHPNDRFALIPFSDDVKATSVVPFGSKSNVISQLDAILDEGNR 274 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 GGT S+AL L + +N + Y +DG Sbjct: 275 LTANGGTNYSAALSLA----QSYFNDPERKKYIIFLTDGMPT 312 >UniRef50_Q5QXN1 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=3 Tax=Alteromonadales RepID=Q5QXN1_IDILO Length = 327 Score = 89.2 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 57/213 (26%), Gaps = 40/213 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 E + + +D+SGSM+ + + K + + Sbjct: 77 EPLAIRAEGREIMLAVDLSGSMEIADMQLEGRSVNRLTMVKHVLSDFIER-REGDRLGLI 135 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ T + + S T + A+ L VK + + N Sbjct: 136 LFADTAYLQTPMTYDRNTVKQMLNESVLGLVGERTAIGDAIALS---VKRFRDDEKSNRV 192 Query: 341 AAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYI---------------EITRRAHQT 384 +DG N A + P LA+ + + +R Sbjct: 193 LVLLTDGQNTAGNLPPEQALELAQAYDVTIYPIAVGAEEVVVDSFFGQRRVNPSRDLDVP 252 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 L + F + + ++IY +L Sbjct: 253 LMQSIAKQTGGKY-FRARSTNELEEIYQRLDKL 284 >UniRef50_B0XT03 von Willebrand domain protein n=4 Tax=Trichocomaceae RepID=B0XT03_ASPFC Length = 946 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 26/233 (11%), Positives = 56/233 (24%), Gaps = 28/233 (12%) Query: 206 LLEEERLRKEIAELRAKIERV----PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 L + L + L + + K + + ++D SGSM Sbjct: 246 LARDFVLLVKADGLDTPRAMLETHPVIPTQRAIMTTLVPKFGLEPIKPEIIFVIDRSGSM 305 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF-----------FYSQ 310 D K + L + H+ + + Sbjct: 306 -MDKIDTLKSALRVFLKSLPVGVCFNICSFGSRHSFLWKQSLFYTAESLQEALSFVDGVR 364 Query: 311 ET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK--LLP 367 GGT + A++ R + +DG W + ++ Sbjct: 365 ANMGGTEMQEAVEATV-----RSRMKDKELEVLILTDGQIW---NQQTLFGFIRETAADN 416 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 R++S +H L + F + + + + + Sbjct: 417 GARFFSLGIGNGASHS-LVEGIARAGNGFSQMVVNYEELDRKVVRMLKGALTP 468 >UniRef50_A0CY84 Chromosome undetermined scaffold_307, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CY84_PARTE Length = 625 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 30/296 (10%), Positives = 71/296 (23%), Gaps = 49/296 (16%) Query: 155 YTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRK 214 + N V + R + + S+ ++ + Sbjct: 296 LQSTNAKNNSDVQTE--GNQKRIILQLQNIPENYETTRQFTLLFSSD---EINLPRAILS 350 Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRP--------------DPSSQAVMFCLMDVSGS 260 + +R TF ++ ++ ++D SGS Sbjct: 351 HTDNDALQYQRYCATLTFIPKFNEVSLDDAYTQYLDGLSIADNQVINRGNYLFIIDRSGS 410 Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----------------------RHHTQA 298 M S + AK+ IL L + + + + + Sbjct: 411 MSGSRIEKAKQALILFLKSLPQDSEFNIISFGIADIFLFFNHQSVPLNNVSSQQQFLGIV 470 Query: 299 KEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDNWADDSPL 356 + GGT + + L+ M V Y ++ +DG+ D + Sbjct: 471 QNEAIQHVEEMAANMGGTEILTPLQQM--VYNASYGTSKNTTLNVFMLTDGE-TDADQII 527 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + + + + L + + + + I+ Sbjct: 528 QLVQSNNQAQTRIYTLGIGQGCSQY---LIQRVAEVGNGKSQIVSDKEDINEKIHQ 580 >UniRef50_A9B057 von Willebrand factor type A n=3 Tax=Chloroflexi (class) RepID=A9B057_HERA2 Length = 562 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 43/318 (13%), Positives = 90/318 (28%), Gaps = 36/318 (11%) Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 +L+E+L + + Q L A Y G + + L+ M A K Sbjct: 257 AAILYENLIIESYDQALYPNL--ELPMVAIYPKEGSFWSDHPLVVLETE-----RMNADK 309 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR-- 242 R +E L A +I+ L A I+ +D L+ Sbjct: 310 RAAAQVFQEFLLAQPQQAKAMQYGFRPANVDIS-LAAPIDTAHGVDPSQLQVALPTPSAE 368 Query: 243 ---------PDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI 292 Q + ++D SGSM Q + AK + ++ Sbjct: 369 VLQAITQLWQQHKKQVDVALIIDTSGSMRQENRLREAKTALGDFIDIFADQDNVQVTIFS 428 Query: 293 RHHTQAKEV---------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + T+ ++ G T + S + + ++++ + Sbjct: 429 TNATELSDLSPIGPKRADLHTRIDGLVADGETRLYSTIGEVYTDIQQQTEVQRI-RALVV 487 Query: 344 ASDGDNWADD-SPLCHEILAK--KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG++ A S + + ++ ++ A+Q + + + Sbjct: 488 LTDGEDTASSLSLEQLNEQIRQDESGTSIKIFTIAYG-SDANQEVLQRIAEITGAKSY-- 544 Query: 401 MQHIRDQDDIYPVFRELF 418 +Y F Sbjct: 545 TGDPATIRQVYHEIATFF 562 >UniRef50_A7K1D3 Protein BatA n=19 Tax=Vibrionales RepID=A7K1D3_VIBSE Length = 334 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 55/213 (25%), Gaps = 40/213 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 + + ++D+SGSM Q ++ K+ + + V Sbjct: 88 DPVEFQPKYRDLMLVVDLSGSMQQEDMELNGEYIDRLTAVKKVLSDFVAK-RKGDRLGVV 146 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ H T ++ + + T + + L + + P Sbjct: 147 LFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDSDAP---QRV 203 Query: 341 AAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI---------------TRRAHQT 384 SDG N A PL +AKK + + Sbjct: 204 MILLSDGSNTAGVLEPLEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQ 263 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + F + + IY +L Sbjct: 264 TLTKVAEMTGGQY-FRARDTDQLEKIYDTINQL 295 >UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actinomycetales RepID=Q6ABM1_PROAC Length = 654 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 51/372 (13%), Positives = 99/372 (26%), Gaps = 36/372 (9%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGG-SGSGQGQASQDGEGQDEFVFQISKDEYLD 126 +H + + ERP G+ + + + Sbjct: 286 PPRNQHPDDQPDQPEQRPREPERPDPDVEKWQAGENLATPPSSSGEQQPEYHDGPQ---N 342 Query: 127 LLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 + P + + + G P + + Q+ ARR + R Sbjct: 343 QRDDGQHDPRKQPSGSGEQVVAA---------GDPFAVRPLEPSQDRFARRACGRRLRTR 393 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY-EKRPDP 245 ++ P L + LR +++ ++ ++ K Sbjct: 394 SNDRRGRYVSARPTDRPDDLALDATLRAAAVHQKSRRATERPDLAVHVKPIDWRAKVRAG 453 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVY-------IRHHT 296 + + + ++D SGSM + A + LL + + + + + T Sbjct: 454 RAASCVIFVVDASGSMGSRGRMTASKGAVLSLLLDAYVKRDRVCLIGFRRDRAEVLVPVT 513 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDGD-NWADDS 354 + EV +H G T +S+ L EVV+ +DG N + D Sbjct: 514 SSVEVAQHGLAELPVGGRTPLSAGLIKACEVVRPLLLKDPGLRPLLILVTDGRGNVSLDG 573 Query: 355 ------PLCHEILAKKLLPVVRYYSYIEITRRAHQTLW---REYEHLQSTFDNFAMQHIR 405 +A KL R + T R+ Sbjct: 574 RPNSQATDEAIRVATKLGVDTRLSWVVIDTEDPRGIQLSRARDIATALGGPCLRIDDLRA 633 Query: 406 DQDDIYPVFREL 417 D D+ V L Sbjct: 634 D--DLVNVVSRL 643 >UniRef50_D2A5S3 Putative uncharacterized protein GLEAN_15119 n=3 Tax=Tribolium castaneum RepID=D2A5S3_TRICA Length = 1022 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 32/260 (12%), Positives = 75/260 (28%), Gaps = 24/260 (9%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE----IAELRAKIERVPFI 229 + R + E N+ + + + A I I Sbjct: 1 MVRFVYCAKEVISGIIWSELLDKTFKNNYKQDPTLSWQYFGSSTGFMRQFPAMIWSQEPI 60 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK---- 285 D FD R +++ SS + L+D SGSM +++A+ + L Sbjct: 61 DLFDCRTRSW-YIEAASSPKDVVILVDRSGSMTGMRREIARHVVHNILDTLGNNDYYTTD 119 Query: 286 -----NVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ---W 337 + ++ + V + + +S AL ++++ N ++ Sbjct: 120 PLIECFDNI-LVQANLANVRVLKEAMADFKTEQIANLSLALVTAFQLLENYRNESKGANC 178 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLL---PVVRYYSYIEITRRAHQTLWREYEHLQS 394 N +DG D+ + VR ++Y+ + + Sbjct: 179 NQAIMLVTDGV---QDNYMEIFRDYNWDNLPFINVRVFTYLIGREVSDVRDVKWMACANR 235 Query: 395 TFDNFAMQHIRDQDDIYPVF 414 + + ++++ Sbjct: 236 GYYVHLSTYAEVREEVLQYI 255 >UniRef50_D0LKC7 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LKC7_HALO1 Length = 346 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 28/234 (11%), Positives = 52/234 (22%), Gaps = 51/234 (21%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSM----------DQSTKDMAKRFYILLY---LFL--SR 282 + + ++D SGSM DQ+ ++ K + L Sbjct: 80 AVGENTIRREGIAIMMVVDTSGSMRALDLADGGLDQTRLEVVKDVFRAFVAGEDGLDGRS 139 Query: 283 TYKNVEVVYIRHHTQAKEVDEHEFFYS-----------QETGGTIVSSALKLMDEVVKER 331 V + + + + GT + L L E ++E Sbjct: 140 NDTIGLVSFAGFADTRCPLTLNHGSLLTILDDLEIVRERAEDGTAIGDGLGLAVERLRES 199 Query: 332 YNPAQWNIYAAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRA--------- 381 + +DG N ++PL LA +L V Sbjct: 200 ---EASSRVIILLTDGVNNAGIETPLEAAELASRLGIKVYTIGAGTDGVAPVRVTNPLTG 256 Query: 382 -----------HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + F +Y L + + Sbjct: 257 AEELRPMPVEIDEATLEAIAEHTGGRY-FRATDGDGLRQVYEQIDRLERTEISE 309 >UniRef50_A6G2Y5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G2Y5_9DELT Length = 516 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 44/362 (12%), Positives = 92/362 (25%), Gaps = 34/362 (9%) Query: 80 DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQ 139 + +D G G G + E D+ E D +D A Sbjct: 10 SSTLDDDDSVADGYGDEGETQGADEGEDSTETGDDTGTSDDSSEASDTGEDDGA-GETGP 68 Query: 140 NQQRQLTEYKTHRAGYT--ANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 L E AG + + AN+ + +L R + L Sbjct: 69 TDPGMLAEACDPNAGQSHGYDLAVANLELAPALVREAVLFGDGKV--PRIPLSPRPFLNH 126 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 L ++ + F K P + ++D+ Sbjct: 127 FDFGYAPSPGASPSLSGQLWPVGPANAEGVERYRFQFAVKAPALSPQQRPPVDLAIVVDL 186 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV 317 SM + + + L + + +E + + G T + Sbjct: 187 GPSMAGEPLVLVEEALAAIESALGPGDRVTLIA------AGEEAMDLAAADIEGFGSTPL 240 Query: 318 S---------------SALKLMDEVVKERYNPAQW-NIYAAQASDGDNWADDSPLCHEIL 361 + +A++L ++ ++ + + S+G + SP + + Sbjct: 241 TGLINPEEAFGYAKLEAAIELAYASLESPWSGDEIGHRRVLLLSNGH--FEVSPALADTV 298 Query: 362 AKKLLPVVRYYSYIEITRR-AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + S T + RE L FA ++ + F + Sbjct: 299 SSAAAEDRHLVSVATGTHELYADSALRELGRLGQGSAVFAPTADA----VWAKLADGFTE 354 Query: 421 QN 422 + Sbjct: 355 RM 356 >UniRef50_Q233P7 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q233P7_TETTH Length = 790 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 48/181 (26%), Gaps = 24/181 (13%) Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----------HHTQAKEVDEHEF 306 S SM A IL L V + + T + E Sbjct: 236 SRSMSGQPIQKACEALILFLKSLPIDSYFNVVSFGSSYEKLFQSSIKYDTNSLEKAIKII 295 Query: 307 FYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKL 365 GGT + L+ + + +N +DG+ SP L +K Sbjct: 296 KNYTADLGGTEIYKPLQSVFK----ETKIDGYNKQIFLLTDGE---VKSPKEVVKLIRKN 348 Query: 366 LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + R S + A + L E + + + D+ + E+ Sbjct: 349 NKLNRINSIGFGSG-ADKYLIEESAITGKG----ISKIVDLKCDLSEIIIEMLSVCLTPT 403 Query: 426 K 426 Sbjct: 404 L 404 >UniRef50_B3EB10 von Willebrand factor type A n=2 Tax=Desulfuromonadales RepID=B3EB10_GEOLS Length = 572 Score = 89.2 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 49/371 (13%), Positives = 97/371 (26%), Gaps = 38/371 (10%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 Q+ R + + +++ + + S D S + E G + Sbjct: 194 EKHQQEQERKRQEDQKAKTTCSSDGSKPDGSSNDQ--------EEYNQTSGTDQQSGKPK 245 Query: 77 P-GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP 135 +D D G S + + + + + ++L D Sbjct: 246 QDTDDSSPDGDNTGTAAGDDSQSQAAGNPSPPVPKSGSSGELGGNPEALQEMLDCD---- 301 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 + E H V V+ + SL RR L L Sbjct: 302 ---PGGFGDVGEILKHELITEGETVGNRGGVLLPPEASLRTFPIDIHETRRHTALLRARL 358 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + + + + R +I V + D R + + L+ Sbjct: 359 SGLIQASRLRRRHARRTGSKIDN-----RVVHRLALNDTRLF-LHQEEKTEVNTAIMILV 412 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLS--RTYKNVEVVY-IRHHTQAKEVDEH------EF 306 D SGSM ++A + ++ L + + + + Sbjct: 413 DRSGSMQHQKIEVASKTAYVVAEALDSIPGCFAAVAAFPVGNSDGVAPLVRFGERPCSSK 472 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL 366 F GGT ++ AL + +R P +DG+ D + + LL Sbjct: 473 FGMSAGGGTPLAQALYWAGVELLKREEP---RKILLSVTDGEP---DDLRTTKRAIRWLL 526 Query: 367 P-VVRYYSYIE 376 + Sbjct: 527 EQGIEPMGLGI 537 >UniRef50_A9UIA7 Hedgling (Fragment) n=4 Tax=Nematostella vectensis RepID=A9UIA7_NEMVE Length = 3480 Score = 88.8 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 55/184 (29%), Gaps = 22/184 (11%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-E 300 R + + ++D SGS+ + K F + F + K V I + T A+ E Sbjct: 171 RDVCQTSVDLVFILDTSGSVGSYNFEKMKTFVKNVVDFFNIGPKGTHVAVITYSTWAQVE 230 Query: 301 VDEHEFFYSQE------------TGGTIVSSALKLM----DEVVKERYNPAQWNIYAAQA 344 + S+ +G T + AL L +V A Sbjct: 231 FNLKAHHSSKAALKNAVNAIYYRSGWTYTADALDLAGRNIFQVANGMRPDKGIPKIAVLL 290 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG + ++ L V + ++ + F +++ Sbjct: 291 TDGYSNGNNPLGPANDL---RAAGVNVFCVGIG--NYYERELNDIATDPDKDHVFKLENF 345 Query: 405 RDQD 408 D + Sbjct: 346 NDLN 349 >UniRef50_Q82LZ6 Putative uncharacterized protein n=1 Tax=Streptomyces avermitilis RepID=Q82LZ6_STRAW Length = 462 Score = 88.8 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 42/158 (26%), Gaps = 12/158 (7%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--- 291 R + + ++D SGSM + D K +Y L + + Sbjct: 30 RARPEDAPATEPLPVNFVFVVDTSGSMTGTKLDTVKSALQTIYRELRPADCLGIITFDHN 89 Query: 292 -------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + E GGT + ++ + + + Sbjct: 90 VRTVLPAVAKQDLPPERFAEVVSALTTQGGTDIDLGVQYGIDEISRHSVSGRTVNCLYLF 149 Query: 345 SDGDNWAD--DSPLCHEILAKKLLPVVRYYSYIEITRR 380 SDGD + D +A KL + + + Sbjct: 150 SDGDPTSGERDWIKVRANVAAKLRGDLTLSCFGFGSDA 187 >UniRef50_C4RGW7 Putative uncharacterized protein n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RGW7_9ACTO Length = 633 Score = 88.8 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 35/273 (12%), Positives = 71/273 (26%), Gaps = 21/273 (7%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL 127 R ++ + G G + E DE ++ Sbjct: 283 PADARRYARALDELYGAGRGEGGIDLGHAAGGGQEAAFPTAREWADELEALFGTTVREEV 342 Query: 128 LFEDLA------LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 + L L R E T A ++ +R L L Sbjct: 343 IARAAESGRTDVLSELDPAAVRPSVELLTSVLSLAGGLPEAKLAGLRPLVRRLVEELTAR 402 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 + + ++ LR +A R + + + Sbjct: 403 LATQVRPALTGLTSPRPTRRPGGRINLARTLRANLAHTRRLDDGRTVVVPQRPVFH---T 459 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 R + + ++DVSGSM+ S A +L + ++ T+ ++ Sbjct: 460 RTRREADWRLVLVVDVSGSMEASVVWSALTAAVLA------GVPTLSTHFLAFSTEVVDL 513 Query: 302 DEHEFFYS------QETGGTIVSSALKLMDEVV 328 + + GGT +++ L +V Sbjct: 514 TDRVDDPLALLLEVRVGGGTHIAAGLAHARSLV 546 >UniRef50_Q7UL83 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain n=1 Tax=Rhodopirellula baltica RepID=Q7UL83_RHOBA Length = 764 Score = 88.8 bits (218), Expect = 4e-16, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 56/195 (28%), Gaps = 18/195 (9%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 + P + + ++D SGSM+ + F + L+ + + + T Sbjct: 331 PKWSIEPTEITPREVILVLDTSGSMNGPAISQLRLFADHVLDHLNPNDEFRVIAFSNRTT 390 Query: 297 QAKEVDEHEFF-----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + + +GGT + ALKL + + + Y + Sbjct: 391 AFQPNAVSATDANIQSAKQFVRGLRASGGTNLLPALKLA---LGGEADESARPRYMILMT 447 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 D + L + R + + L + + + + Sbjct: 448 D--ALVGNDHSILRYLRQPEFQDARVFPIAFGA-APNDYLISRAAEMGRG-FSMQVTNQD 503 Query: 406 DQDDIYPVFRELFHK 420 + +I F EL + Sbjct: 504 NTPEIARRFHELTSQ 518 >UniRef50_C3XUV0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XUV0_BRAFL Length = 815 Score = 88.8 bits (218), Expect = 4e-16, Method: Composition-based stats. Identities = 46/379 (12%), Positives = 100/379 (26%), Gaps = 48/379 (12%) Query: 69 GGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLL 128 G D + + G G S++ G+ + L+ L Sbjct: 121 GENGDTDGDAEDTTGEAEDAAGETGDTDGQTKDTTGESKEAAGETPMDTPKT---LLEEL 177 Query: 129 FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISV--------VRSLQNSLARRT-- 178 P+LK+ + + G + + + V +R+L+ S+ + Sbjct: 178 RRQPETPSLKETPKTMMQR-LKKPMGIPKSLMEKTLKVLMWTLKLLMRTLKTSMEKTLRV 236 Query: 179 ---AMTAGKRRELHALEENLAIISNSE-PAQLLEEERLRKEIAELRAKI-ERVPFIDTFD 233 + + L E L + + L K + + + + Sbjct: 237 LMWTLKPLLEKTPETLMETLKPLMEKTLKTLIGTLTPLMKALKITNSVRGAGAHGLSVQN 296 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVV 290 R P +F L+D SGS+ + K+F + + + + Sbjct: 297 PRGSGSSTCEAP---VDLFFLLDGSGSVKAANFAKVKQFAVDMVNSFDVSPAATRVGVLQ 353 Query: 291 YIRHHT--------QAKEVDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 Y +T K + GGT +AL+ + R Sbjct: 354 YSNRNTLVFNLGNKVNKPTTVSAINSISYQGGGTRTGAALQYIRGNAAWRR--GNVPKVL 411 Query: 342 AQASDG---DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DG D+ + S + Y+ + + + + Sbjct: 412 IVLTDGKSEDSVSGPSQNLVSDRVE-------VYAIGV--SNFDHEELLQIVNNKQSNVI 462 Query: 399 FAMQHIRDQDDIYPVFREL 417 I + +++ Sbjct: 463 ELNDFNALATKIDEIAQDV 481 >UniRef50_A0PNU3 UPF0353 protein MUL_1490 n=43 Tax=Actinomycetales RepID=Y1490_MYCUA Length = 335 Score = 88.4 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 25/209 (11%), Positives = 53/209 (25%), Gaps = 34/209 (16%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYIR-- 293 P ++AV+ ++DVS SM + + A+ L+ + Y Sbjct: 89 DVRIPRNRAVVMLVIDVSQSMRATDVEPNRMVAAQEAAKQFADELTPGINLGLIAYAGTA 148 Query: 294 ----HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE-----RYNPAQWNIYAAQA 344 T +E + Q T A+ + + Sbjct: 149 TVLVSPTTNREATKAALDKLQFADRTATGEAIFTALQAIATVGAVIGGGDTPPPARIVLF 208 Query: 345 SDGDNWADDSPL------CHEILAKKLLPVVRYYSYIEITR-----------RAHQTLWR 387 SDG +P AK + S+ + Sbjct: 209 SDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMK 268 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + L ++ + + + +Y ++ Sbjct: 269 KVAQLSGGN-SYNAATLAELNSVYVSLQQ 296 >UniRef50_A9GBN0 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GBN0_SORC5 Length = 940 Score = 88.4 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 51/205 (24%), Gaps = 23/205 (11%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 L + +R M +MD SGSM +MAK LS Sbjct: 450 WYRTTIERILPVRMDNERKKDMPSVAMALVMDRSGSMTGLPLEMAKAAAKATAGVLSSDD 509 Query: 285 KNVEVVYIRHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 + + T + + E Q GGT + SAL + + Sbjct: 510 LIEVIAFDSAPTRYVKMQPARNRSRIAGEIARIQPGGGTEIFSALDAAYQDM---TVTQA 566 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 + +DG + + + + + L + + Sbjct: 567 RKKHVILLTDGKASTGGIRDLVSAMIAE---SITVTTVGLGN-DLDEQLLKMIADVGGGR 622 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQ 421 + +F K+ Sbjct: 623 FHAVPDPNNLP--------RIFTKE 639 Score = 41.4 bits (95), Expect = 0.073, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 41/148 (27%), Gaps = 27/148 (18%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE 305 + + L+DVS S+ D A+ + + + R A+ Sbjct: 110 TEKVATIYLIDVSESVTDEALDDARATLDKAFAEKPEDGVIKVITFARRPRLAERAAGEA 169 Query: 306 F------------FYSQETG-------GTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 ++ G G+ + +AL+L + Y P SD Sbjct: 170 AGREEERRAPPIARHADANGQRGDLGAGSNLQAALQLAYGL----YPPGYLKRAV-LMSD 224 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSY 374 G D + VR +S Sbjct: 225 GVQTDGD---VLAEANRARDVGVRLFSI 249 >UniRef50_A3TQW7 Putative membrane protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TQW7_9MICO Length = 654 Score = 88.4 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 47/195 (24%), Gaps = 13/195 (6%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 R +P +Q L+D SGSM +S + + + V + Sbjct: 73 RPSPVTSKPATRAQRTTVLLIDTSGSMGRSGMATVRTAVKDFLASAPKDVRIGVVSFGNT 132 Query: 295 ------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 T A+ + + G T + S + ++ + SDG Sbjct: 133 AGPEIAPTTARAAVQAVVDDLRADGNTALFSGVTQAVRMLGSTGD-----RSIVLLSDGK 187 Query: 349 NWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 N D K L VR T + Sbjct: 188 NTVGDRASGLAAAGKALTASQVRVEVVRFTTGENDPEALAAFAKAGGGS-VVQATDAEGV 246 Query: 408 DDIYPVFRELFHKQN 422 + ++ Q Sbjct: 247 RTAFQTAAKVLESQV 261 >UniRef50_UPI00006CD1DE von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CD1DE Length = 938 Score = 88.4 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 28/261 (10%), Positives = 66/261 (25%), Gaps = 31/261 (11%) Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR 235 L + + ++ + + + E ++ + Sbjct: 247 EEQVKKEILGERHSVLISFIPNFNKDISCEIDDSIKAAISSEKDIFSDEFQETVNQELVD 306 Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-- 293 Y + K L+D S SM+ A IL L + + Sbjct: 307 YLSLSKSE-------FIFLLDRSASMEGLPIKRACEALILFLKSLPNDSYFNVLSFGSEF 359 Query: 294 ---------HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 ++ Q E+ GT + + L + + ++N Sbjct: 360 EMLFPSSRKYNNQNLEIAIKIISNYTANLLGTEIYNPLSCIY----TKKRIQKYNRQIFL 415 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG + L KK R ++ A + L + + Sbjct: 416 LTDGH---VSNRDEVLNLIKKNNQFDRVHTIGFG-SDADKYLVNKSAFYGKG----ISRI 467 Query: 404 IRDQDDIYPVFRELFHKQNAT 424 + + D+ + ++ + Sbjct: 468 VDFKSDLSRIVLQMLCQSLTP 488 >UniRef50_UPI00017F3212 von Willebrand factor, type A n=1 Tax=Escherichia coli O157:H7 str. EC4024 RepID=UPI00017F3212 Length = 325 Score = 88.4 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 30/272 (11%), Positives = 75/272 (27%), Gaps = 26/272 (9%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 L R +E + L + S E + + Sbjct: 24 LGRFLFTRERTVKEYVRVP-FLPGLIESLQLNQQPERSGKVATCMFWLVWALMV-CALAR 81 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSM-------DQSTKDMAKRFYILLYLFLSRTYKN 286 Y + + + + ++DVSGSM + ++ ++ + Sbjct: 82 PEYLTPPQHIEKPMR-NIMLILDVSGSMEKNDVAGGLTRLQAVQQSVKKFV-AARKSDRI 139 Query: 287 VEVVYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQW 337 V++ ++ K+ E T + AL + +++ + Sbjct: 140 GLVIFANSAWPFAPVSEDKQALETRISQLTPGMAGQQTAIGDALGVTVKLLDSTGDKEA- 198 Query: 338 NIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRAHQ----TLWREYEHL 392 + A +DG++ A +P LA + ++ ++ L ++ + Sbjct: 199 SKLAILLTDGNDTASQLTPRLAAQLAVSHHVQLHTIAFGDVNSSGDDKVDLNLLQDLARM 258 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 A D ++ + Q + Sbjct: 259 TGGRSWTAENSGASLDAVWKEIDAITPVQVKS 290 >UniRef50_B3PJ55 von Willebrand factor type A domain protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PJ55_CELJU Length = 318 Score = 88.4 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 61/196 (31%), Gaps = 25/196 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEV 289 E P+S + +D+SGSM + K+ ++ + V Sbjct: 80 EATSMPNSGRDLLLAVDISGSMREPDMVYNNRRITRLMAVKKVVGDFVAR-RQSDRLGLV 138 Query: 290 VY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ + + + E T + A+ L + ++E+ N Sbjct: 139 LFGTQAFLQAPLTFDVKTVQEMLIEAESGYAGEATAIGDAIALSIKRLREQPNA---KRV 195 Query: 341 AAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG+N A + LA K + ++ R ++ F Sbjct: 196 IILLTDGENTAGELGIATATDLAVKANTKIYTIAFSPYDREVDSHSMQQIAEQTGGEF-F 254 Query: 400 AMQHIRDQDDIYPVFR 415 ++ RD ++I+ Sbjct: 255 RARNTRDLEEIHRQLD 270 >UniRef50_D0LJ27 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LJ27_HALO1 Length = 775 Score = 88.4 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 49/194 (25%), Gaps = 23/194 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-----------H 295 + + ++D SGSM + AK + L + + + Sbjct: 263 ADKDVTLVLDRSGSMSGAPLARAKDAAKAVVARLGDGDRVNVMAFDDGVDALFLRPVPIS 322 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEV---VKERYNPAQWNIYAAQASDGDNWAD 352 + + + GGT ++ AL + + + +DG Sbjct: 323 AERRSQAVEYIDRLSDGGGTDLAGALAEALDAQHPSESEADTGSRPHVILFLTDGQ---- 378 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 +A+ R ++ + L + F +I Sbjct: 379 SDSQATLQVARGDAGDARVFTIGVGDG-VEKPLLARLASEKRGRFTFIA----SPSEIER 433 Query: 413 VFRELFHKQNATAK 426 L+ + A Sbjct: 434 KVSRLYSEIAAPVL 447 >UniRef50_B9XFD0 von Willebrand factor type A n=1 Tax=bacterium Ellin514 RepID=B9XFD0_9BACT Length = 338 Score = 88.4 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 28/230 (12%), Positives = 54/230 (23%), Gaps = 49/230 (21%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSM------------DQSTKDMAKRFYILLYLFLSRTY 284 + +S + +D+SGSM + +A+ ++ Sbjct: 76 FVQSETKVSASGVDIVVALDMSGSMLAEDEGFVLNGQQATRFIIARDVLKKFVDK-RQSD 134 Query: 285 KNVEVVYIRH------HTQAKEVDEHEFFYSQET---GG-TIVSSALKLMDEVVKERYNP 334 + VV+ T E G T + SAL ++E + Sbjct: 135 RIGLVVFGTQAYVAVPPTLDHEFLLKNLERLGIGSINGNQTAIGSALSTSMNRLRELKSK 194 Query: 335 AQWNIYAAQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYIEIT--------------- 378 + +DG N A PL A+ L + Sbjct: 195 ---SKIIILMTDGQNNAGKVPPLTAAEAARALGIKIYTIGVGTKGVARMAVGTDPFSGQK 251 Query: 379 ------RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + + + + IY L + Sbjct: 252 IYQQVPVDIDEGTLTSISKMTNAKY-YRADSTATLEKIYADIDRLEKTEA 300 >UniRef50_A8J658 Collagen-related protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8J658_CHLRE Length = 387 Score = 88.4 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 58/208 (27%), Gaps = 17/208 (8%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA---VMFCL 254 ++ A RL +LR + + L Sbjct: 68 LTRRAAAIAGGYRRLYGSKDVTEKAAAPWTKPCPEELRASVEKLATRLVGDVQQVNVVFL 127 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE-----------VDE 303 +D SGS++ + F + L+ + N++V ++ + + + Sbjct: 128 VDGSGSVNAEEFEAMLGFCVDASNQLAESVPNLQVAVVQFSNDVRVEVGLAPLDSEALRK 187 Query: 304 HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG--DNWADDSPLCHEIL 361 + GGT V+ AL +++K P + +DG D++ Sbjct: 188 TTREMVRMNGGTNVAVALTKAGQLLKRDAAPDAM-RHVVLLTDGRVDSYQAHEARQVADQ 246 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREY 389 V ++Y L Sbjct: 247 LADEQRHVSLFAYGVGRGVDRAELLHII 274 >UniRef50_Q0VTG8 Protein containing a von Willebrand factor type A domain n=5 Tax=Gammaproteobacteria RepID=Q0VTG8_ALCBS Length = 698 Score = 88.0 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 36/312 (11%), Positives = 81/312 (25%), Gaps = 35/312 (11%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQ--NSLARRTAMTAGKRRE 187 + L LP Q + +QL + N RS + N+ A Sbjct: 207 DALRLPKHPQAKIQQLGSQQWQVTLDNRNASTTIKGKGRSEKGDNNFAPPATNGHPPSAF 266 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 + + + ++ + +R F+ + ++ Sbjct: 267 TLDQDIVVYWRHQQDLPGSVDLVAYKAP------GKDRGTFMLSITPGDDLPPI----TT 316 Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI------------RHH 295 + ++D+SGSM+ + L + V++ Sbjct: 317 GSDWVFVLDISGSMN-AKLATLGDGVRQALGKLRGNDRFRIVLFDDRAEELTSGFVDATP 375 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDS 354 ++ + Q GGT + L L + +DG N Sbjct: 376 NNIRQYTQKIM-QLQSRGGTNLFGGLSLALTPLDADRPTG-----IVLVTDGVANVGKTR 429 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 L + VR ++++ A++ + + F I Sbjct: 430 QKDFIDLLEN--HDVRLFTFVMGNS-ANRPMLTAMTDASNGFAISVSNSDDIAGQILNAT 486 Query: 415 RELFHKQNATAK 426 ++ H+ + Sbjct: 487 SKVTHQAMNDVE 498 >UniRef50_Q2QZN4 von Willebrand factor type A domain containing protein n=2 Tax=Oryza sativa Japonica Group RepID=Q2QZN4_ORYSJ Length = 574 Score = 88.0 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 37/291 (12%), Positives = 88/291 (30%), Gaps = 66/291 (22%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 +++A++ A + + E ++ +R IA + K + + Sbjct: 41 YYSTIAKQCNKKATSKAAIDRQEVRVST------------TPIRAAIARDQRKDDFEVLV 88 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD----------QSTKDMAKRFYILLYLF 279 + P+ + + ++DVSGSM+ S D+ K + Sbjct: 89 TVEAPKV----VAPEKRAPIDLVAVLDVSGSMNKEEFVRGKHMSSRLDLLKIAMKYIIKL 144 Query: 280 LSRTYKNVEVVYI----------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLM----- 324 + + V + R+ +++ E+ + +G T ALK Sbjct: 145 VRDADRLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASGNTDFRPALKKAVEDMN 204 Query: 325 ------------DEVVKERYNPAQWNIY--AAQASDGDNWADDSPLCHEILAK------- 363 +++ R + SDG + S + E +AK Sbjct: 205 IQNIKNSSAYNNFQILDGRGKEEKKKRVGFILLLSDGVDQFQYSRINWEKVAKSTDVDHS 264 Query: 364 ---KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 +L +++ R+ L +F +++ + + + Sbjct: 265 EVGAMLRKYAVHTFGFSAS-HDPVPLRQISALSYGLYSFVCKNLDNITEAF 314 >UniRef50_Q7US47 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7US47_RHOBA Length = 1291 Score = 88.0 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 44/343 (12%), Positives = 84/343 (24%), Gaps = 33/343 (9%) Query: 80 DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL------A 133 + R G G + + ++ D ++L A Sbjct: 941 RGHGEGSRGGLANAPSGMGGGTEAPEPTTAQWAEDLEALFGSDLCQEVLGTAAGNGRSTA 1000 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 + L + E + ++ +R L L + A R + Sbjct: 1001 IELLDPDTVTPSLELLQQVLSLAGAMPESKVATLRRLARRLTEQLASELAVRLQPAMNGL 1060 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + + +L LR +A + + I L + + KR + Sbjct: 1061 SSPRPTRRRARKLNLPRTLRDNLANCHRRADGRATIVAEKLMFHSPSKRQM---DWHVTF 1117 Query: 254 LMDVSGSMDQSTKDMA-KRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS--- 309 ++DVS SM S A LS V ++ T+ + E Sbjct: 1118 VVDVSASMSASVIYSALVAAVFDALPALS-------VRFLAFSTEVLDFSEQVADPLSLL 1170 Query: 310 ---QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL 366 Q GGT + L+ SD + + + + Sbjct: 1171 LEVQVGGGTDIGLGLRAA-----RAGVTVPSRSIVILVSDFE-EGVSVGRMIAEVRELVD 1224 Query: 367 PVVRYYSYI----EITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 V+ R HQ + + + Sbjct: 1225 AGVKCLGLASLDDSGVARFHQGYAAMMAGAGMPVAAVSPEKLA 1267 >UniRef50_B3QY78 von Willebrand factor type A n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QY78_CHLT3 Length = 346 Score = 88.0 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 30/235 (12%), Positives = 55/235 (23%), Gaps = 50/235 (21%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + +DVS SM S +K L + VV+ Sbjct: 80 RLKEVKRKGIEVVIALDVSNSMLADDIQPSRLQKSKYTISNFLERLG-NDRVGLVVFAGQ 138 Query: 295 H------TQAKEVDEHEFFYSQET----GGTIVSSALKL---MDEVVKERYNPAQWNIY- 340 T K + GT SSA++ E ++E + N Sbjct: 139 SFVQCPITSDKSALKLFMDIVSTDAIPTQGTNFSSAIRESIRALERIEEGAEAEEKNRVR 198 Query: 341 ---AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI---------------------- 375 SDG++ + A + Sbjct: 199 NKVILIFSDGED-HEAGIDEVLEEAASKNIRIYTVGVGSAEPTPIPVLNKDGKRVDFKRD 257 Query: 376 ----EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 +T + L R+ D + I +L ++ + + Sbjct: 258 SQGSVVTTHLQEALLRKIAEQTKGNYYRIAPQGSDFELIADDINKLEKQELSAKE 312 >UniRef50_C8VGR0 von Willebrand domain protein (AFU_orthologue; AFUA_4G01160) n=6 Tax=Trichocomaceae RepID=C8VGR0_EMENI Length = 1109 Score = 87.6 bits (215), Expect = 7e-16, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 50/186 (26%), Gaps = 20/186 (10%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK---------- 299 + + D SGSM+ S + L + T Sbjct: 341 EIIFMADRSGSME-SKISSLINVMNIFIRSLPEACSFNIASFGSEVTWLWPCSKRYSQEN 399 Query: 300 -EVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP-L 356 +V + GGT + AL+ + + + +DG+ W D+ Sbjct: 400 LDVASKHVDSFRANYGGTNIYCALESVLDHFNK---QDDVPTNVILLTDGEVWDVDNVIQ 456 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI--RDQDDIYPVF 414 +R++S R +H+ L + + Q+ + + Sbjct: 457 LVRRTVSMNGSNIRFFSLGIGDRVSHR-LVEGIGLQGGGYAEVVPESSMGSWQERVIQML 515 Query: 415 RELFHK 420 + Sbjct: 516 KAALSP 521 >UniRef50_A0M6V9 Membrane protein containing von Willebrand factor (VWA) type A domain n=21 Tax=Bacteroidetes RepID=A0M6V9_GRAFK Length = 354 Score = 87.6 bits (215), Expect = 7e-16, Method: Composition-based stats. Identities = 24/225 (10%), Positives = 55/225 (24%), Gaps = 45/225 (20%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + +DVS SMD + +K+ + L + + + Y Sbjct: 81 KMETVKREGVDIVFAIDVSKSMDAEDIAPNRLEKSKQLVSQILSSLG-SDRVGIIAYAGG 139 Query: 295 H----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + AK + + GT +S A++L + Q N Sbjct: 140 AYPQLPITTDFSAAKMFLQALNTDMISSQGTAISDAIELATTYYD---DDQQTNRVLFII 196 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA----------------------- 381 SDG++ + + A + + Sbjct: 197 SDGED-HEGNVEDIAEQAAEKGIRIFTIGVGTEKGGPIPIKRNGVVQNYKKDRQGETVIT 255 Query: 382 --HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + +E + D++ + + + Sbjct: 256 KLNPATLQEIANATDGSYIEGNVTSEVVDNVTEKLQNIEKTEFEA 300 >UniRef50_Q6VUC2 Putative uncharacterized protein n=1 Tax=Antonospora locustae RepID=Q6VUC2_ANTLO Length = 824 Score = 87.6 bits (215), Expect = 8e-16, Method: Composition-based stats. Identities = 41/305 (13%), Positives = 86/305 (28%), Gaps = 41/305 (13%) Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARR---TAMTAGKRRELHALEENLAIISNSEPA 204 + Y + + S VRS + R + G E + + + + Sbjct: 179 PRKAYFKYGDTVLRSEASFVRSDDEATFRFEIDFSGVRGWSIECNDDHDVKNRVYTIAES 238 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA-----VMFCLMDVSG 259 + E+ R + R ID ++ + + +A + ++D SG Sbjct: 239 KRDEDIVFRMRMEREERVSARHFEIDNHNVVELTIVPKMEVLRRAVLCSREIILVVDKSG 298 Query: 260 SMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------------ 302 SM + + + +V VV+ H +V Sbjct: 299 SMGWKCGGTVPHKLVSEAIEVFMSTVCDLNIHVNVVFFDHECCESDVLFRGCSERLTSEQ 358 Query: 303 ------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 E + GGT + + L+ + +K +DG D + Sbjct: 359 GLLDKKEWVREKAGPRGGTCIVAGLQRAVD-LKPAAEDGSIRRNIILLTDG---GDSNLR 414 Query: 357 CHEILA-KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 L ++ R+++ ++ T+ +F RD D+ Sbjct: 415 EITSLVQREAAKGTRFFAIGIGNGVSYDTVME-VARAGRGTHDFI----RDACDVGSCLS 469 Query: 416 ELFHK 420 + K Sbjct: 470 SMLEK 474 >UniRef50_Q897H0 Membrane-associated protein n=1 Tax=Clostridium tetani RepID=Q897H0_CLOTE Length = 842 Score = 87.6 bits (215), Expect = 8e-16, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 47/186 (25%), Gaps = 20/186 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 K A + L+D SGSMD ++AK+ I L + + Sbjct: 401 KNKRKQGDAGIVLLIDCSGSMDDESGGVKKIELAKQGAIETIKALESEDYIGILGFSDTI 460 Query: 296 TQ--------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 KE E + GGT++ L + + + +DG Sbjct: 461 DWVVPFQKAENKEKLIKEVGKLKPKGGTLIIPGLIEGVKTLSSAKTK---VKHMILLTDG 517 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + KK + E + + + F+ Sbjct: 518 QAE-KNGFDKYLENMKKNNMTLSTVGLGE---DSDREVLTHLSDFTGGRKYFSNDFKSVP 573 Query: 408 DDIYPV 413 Sbjct: 574 IIFAKE 579 >UniRef50_UPI000186D9CB dihydropyridine-sensitive L-type calcium channel subunits alpha-2/delta precursor calcium channel subunit, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186D9CB Length = 1205 Score = 87.6 bits (215), Expect = 8e-16, Method: Composition-based stats. Identities = 25/220 (11%), Positives = 52/220 (23%), Gaps = 25/220 (11%) Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 A + + + +R + + LMD SGSM +AK L Sbjct: 198 AMKWSTSDNDVDLYDCRMR---PWFIEAATCTKDIVILMDNSGSMTGMRNTIAKLVVNSL 254 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEVDE---------------HEFFYSQETGGTIVSSAL 321 + + + + G +A Sbjct: 255 LKTFGNNDFINVLKFSWKPETVMPCFKDSLVQATPEVLKSFQEAVSLVKPEGNASFPNAF 314 Query: 322 KLMDEVVKERYNP-------AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 ++K+ N +DG + L + +P+VR ++Y Sbjct: 315 SYSLNLLKKYREDRNATNNLGGCNQAIMLVTDGLPGNVTEVFENLNLDENGMPIVRIFTY 374 Query: 375 IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + T ++ + D + Sbjct: 375 LVGTEVKGVEDLQQMACSNRGYYVHVHDLDEVHDQVLKYI 414 >UniRef50_Q54LJ4 Type A von Willebrand factor domain-containing protein n=3 Tax=Eukaryota RepID=Q54LJ4_DICDI Length = 2563 Score = 87.6 bits (215), Expect = 8e-16, Method: Composition-based stats. Identities = 44/331 (13%), Positives = 101/331 (30%), Gaps = 36/331 (10%) Query: 122 DEYLDLLFEDLALP-NLKQNQQRQLTEYKTH-------RAGYTANGVPANISVVRSLQNS 173 + +D L LP ++ Q+ Q + T +ISV + + Sbjct: 889 ELSIDGLDISFVLPRSITPKQRLQSSSSNTQSVTSTVQVTELAQKQSDLSISVGIEMPYN 948 Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI------ERVP 227 + + + T R + + + ++N + L + +L + E + E+ Sbjct: 949 IVKLISPTHDVRIKRTHTKATIE-LNNQDNQYLDKNFQLLIGLEEPYSPRMWVEVDEKGH 1007 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS--RTYK 285 K S ++ L+D+S SM + R + L + Sbjct: 1008 HASMLAFYPKLDIDNTMKDSHTMVTLLIDLSSSMAGDAFEDLLRAVRITISNLRGMQKVL 1067 Query: 286 NVEVVYIRHHTQ-----------AKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYN 333 V + ++ + + + GGT++ L+ + + ++ Sbjct: 1068 FDVVCFGDTFDWLFGIGVPPTESNLQIAWSHINHLKTSYGGTLLHQPLQSLYLLAEKAKP 1127 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 N +DG + ++L KK P R +++ + + L Sbjct: 1128 TNPHN--ILLFTDG---NVANEELVQMLVKKASPYCRMFAFGIG-EHCSRHFVKSICRLG 1181 Query: 394 STFDNFAMQHIR-DQDDIYPVFRELFHKQNA 423 + F + R + I + L + Sbjct: 1182 GGYPEFIQTNKRPNPKKIIDQLQRLTQPAMS 1212 >UniRef50_A6NF34 Anthrax toxin receptor-like n=8 Tax=Catarrhini RepID=ANTRL_HUMAN Length = 565 Score = 87.6 bits (215), Expect = 8e-16, Method: Composition-based stats. Identities = 18/187 (9%), Positives = 49/187 (26%), Gaps = 11/187 (5%) Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 ++ + ++ + ++ ++D SGS++ + D+ + F Sbjct: 47 SRRAHHHHGPGWRQHWRQGQAGHRCQGSFDLYFILDKSGSVNNNWIDLYMWVEETVARFQ 106 Query: 281 SRTYKNVEVVYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKER 331 S + + Y T K ++ Q G T + + + + ++ Sbjct: 107 SPNIRMCFITYSTDGQTVLPLTSDKNRIKNGLDQLQKIVPDGHTFMQAGFRKAIQQIESF 166 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 + + +DG+ A K Y+ + Sbjct: 167 NSGNKVPSMIIAMTDGELVAHAFQDTLREAQKARKLGANVYTLGVA--DYNLDQITAIAD 224 Query: 392 LQSTFDN 398 Sbjct: 225 SPGHVFA 231 >UniRef50_C0QK11 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QK11_DESAH Length = 332 Score = 87.6 bits (215), Expect = 9e-16, Method: Composition-based stats. Identities = 24/222 (10%), Positives = 55/222 (24%), Gaps = 44/222 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQ----------STKDMAKRFYILLYLFLSRTYKNVEV 289 K + + +D+S SM + D K + + V Sbjct: 78 RKMNVKTEGINIILALDLSKSMAALDFKLDGAIVNRLDAVKNVVKDFIMK-RSGDRIGMV 136 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V+ T+ + + T + A+ + + +++ + + Sbjct: 137 VFGSEAFTQMPLTRDYDTIAFVLSRLKIGAAGPSTAIGDAMGISLKRLEDVKSKSN---I 193 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRA------------------ 381 +DG N + +P +A++ V + + Sbjct: 194 VILLTDGKSNSGEITPGAAADIARERGVKVYTIGVGQRGKAPFLVNDPLFGQRYVYQMVD 253 Query: 382 -HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 +E FA IY + L + Sbjct: 254 MDHEALKEIADKTGGAF-FAAADTDSLKKIYDMIDSLEKTEV 294 >UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacteria RepID=Q3M2E0_ANAVT Length = 218 Score = 87.2 bits (214), Expect = 9e-16, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 51/166 (30%), Gaps = 16/166 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE---VVYIRHH 295 E +P ++ + L+D SGSM R + + + V I Sbjct: 6 PEFVENPENRCPVILLLDTSGSMSGQPIQELNRGLATFKEDVIKDSQASLSVEVAIITFG 65 Query: 296 -----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN------IYAAQA 344 +D+ + G T + A++ ++++ R + + N + Sbjct: 66 PVRLVQDFVNIDQFTPPQLEAEGVTPMGEAIEYALDLLETRKSAYKENGILYYRPWIFLI 125 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 +DG + + + +++ ++ R+ Sbjct: 126 TDGAPTDYYHLAAQRVKEAEANRRLCFFTVGVQGADFNK--LRQIA 169 >UniRef50_B0TL26 von Willebrand factor type A n=7 Tax=Gammaproteobacteria RepID=B0TL26_SHEHH Length = 345 Score = 87.2 bits (214), Expect = 9e-16, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 56/213 (26%), Gaps = 40/213 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEV 289 E PS + +D+SGSM + + + + + Sbjct: 75 EAIELPSKGRDLMLSVDLSGSMQIEDMVLDGKVVDRFSLIQHVISDFIER-RKGDRIGLI 133 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ H TQ + +Q T + A+ L + + Q N Sbjct: 134 LFADHAYLQSPLTQDRRTVAQYLKEAQIGLVGKQTAIGEAIALAVKRFDKV---EQSNRV 190 Query: 341 AAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYI---------EITR------RAHQT 384 +DG N SP +A K + ++ Sbjct: 191 LILLTDGSNNAGAISPEQATQIAAKRGITIYTIGVGADVMERRTLFGKERVNPSMDLDES 250 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 +E F ++ + + IY V L Sbjct: 251 QLQEIAKTTGGQY-FRARNTEELEQIYQVIDTL 282 >UniRef50_D0QYP1 von Willebrand factor A domain containing 5A n=12 Tax=Clupeocephala RepID=D0QYP1_SALSA Length = 791 Score = 87.2 bits (214), Expect = 9e-16, Method: Composition-based stats. Identities = 30/255 (11%), Positives = 63/255 (24%), Gaps = 33/255 (12%) Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 +E L +P ++E + + L + + + + +S Sbjct: 226 RDVELLLYYQDTHQPTAIVEAAQTSAKPGSLMGDPMVMLSMYPEFPK----DVMSLMTSH 281 Query: 249 AVMFCLMDVSGSMDQSTKD---------MAKRFYILLYLFLSRTYKNVEVVYIR------ 293 ++D SGSM + A+ +LL L + + Sbjct: 282 GEFLFVVDCSGSMSCPMHNGSGAQDRIGSARDTLLLLLKSLPMGCYFNIIGFGSRYESFF 341 Query: 294 -----HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + + + + GG+ + LK + +DG Sbjct: 342 PKSVEYSQKTMDEALQKVKAMRADQGGSEILQPLKHIYSQ----PCLPDQPRQLFLFTDG 397 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + + L K R +S+ + L F R Q Sbjct: 398 E---VRNTKEVLDLVKANAGSHRCFSFGIGEGAS-SALISGVAKQGGGHAQFITGQDRMQ 453 Query: 408 DDIYPVFRELFHKQN 422 + R Sbjct: 454 PKVMQSLRFALQPAV 468 >UniRef50_UPI0000ECD6E7 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.30) (PARP-4) (Vault poly(ADP- ribose) polymerase) (VPARP) (193 kDa vault protein) (PARP- related/IalphaI-related H5/proline-rich) (PH5P). n=5 Tax=Tetrapoda RepID=UPI0000ECD6E7 Length = 1691 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 22/218 (10%), Positives = 55/218 (25%), Gaps = 16/218 (7%) Query: 214 KEIAELRAKIERVPFIDTFDLRYKNY--EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 + + + + L ++ + + + L+D S SM S AK+ Sbjct: 849 AYLPRMWVEKHPNKNSEACMLVFQPEFEAAFDEEQLSSEIIILLDCSNSMAGSALLQAKQ 908 Query: 272 FYILLYLFLSRTYKNVEVV-------YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLM 324 + S + + + + T+ ++ L Sbjct: 909 IALHALKQFSSRQNVNLIKFGTNFSEFSSFSKNTSKDLASLTEFITSATATMGNTDLWKT 968 Query: 325 DEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 + + SDG + L K + R ++ A++ Sbjct: 969 LRYLSLLFPSQGH-RNILLISDGH---IQNESVTFQLVKDNVHHTRLFTCGVG-STANRH 1023 Query: 385 LWREYEHLQSTFDNFAMQHIRD--QDDIYPVFRELFHK 420 + R + + + + I +F Sbjct: 1024 MLRSLSQYGAGAFEYFDSKSKYNWEAKIQNQVSRIFSP 1061 >UniRef50_C7R6G5 von Willebrand factor type A n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R6G5_KANKD Length = 348 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 26/212 (12%), Positives = 57/212 (26%), Gaps = 42/212 (19%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEVVYI 292 P++ + +D+SGSM+ + K + + +++ Sbjct: 80 DLPATGRDLMISIDISGSMEMPDMVIEDKEVDRLVAVKALLTDFIAR-RKGDRVGMILFG 138 Query: 293 RHH------TQAKEVDEHEFF----YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 T + + + T + + L + ++ER N Sbjct: 139 EQAYLQTPLTFDLKTVQTMLDETTIGLAGSSRTAIGDGIGLAVKRLRER---DANNRVLI 195 Query: 343 QASDGD-NWADDSPLCHEILAKKLLPVVRYYSYI----------EITR------RAHQTL 385 +DG N +PL LA+ + R + Sbjct: 196 LLTDGQNNTGALNPLQAAELAEHAGITIYTIGVGADEMIVKNRFFGNRRINPSLELDEES 255 Query: 386 WREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 F + ++ ++IY + EL Sbjct: 256 LIAVAEKTGGRY-FRARDTKEMEEIYQIIDEL 286 >UniRef50_B6H7Y2 Pc16g08660 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6H7Y2_PENCW Length = 944 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 53/204 (25%), Gaps = 24/204 (11%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 L K + + ++D SGSM+ S K + L Sbjct: 270 QCALMASLVPKFNLRPASPEVVFVIDRSGSME-SKIPTLKSALQVFLKSLPVGICFNICS 328 Query: 291 YIRHHTQAKEVDEHEF-----------FYSQET-GGTIVSSALKLMDEVVKERYNPAQWN 338 + +++ + GGT + A+ V+ R N + Sbjct: 329 FGSYYSFMWPTSQVYDASSLNQALAFVDTVYADMGGTEMKQAVVA---TVQNRLNFEDLD 385 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPV--VRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG + D ++ R++S +H L + F Sbjct: 386 --VLILTDGQIFDQD---QLFNFVREKAADNTARFFSLGIGEAASHS-LIEGIARAGNGF 439 Query: 397 DNFAMQHIRDQDDIYPVFRELFHK 420 ++ I + + Sbjct: 440 CQSVTEYETLDRKIVRMLKGALTP 463 >UniRef50_A8J0D9 Flagellar associated protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8J0D9_CHLRE Length = 4349 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 34/331 (10%), Positives = 76/331 (22%), Gaps = 32/331 (9%) Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKR 185 +L + LP ++++ A + + R+ A + Sbjct: 851 AILIKAAELPLFRRHELVSAAAADPDLALAFTDPAEPTQRPTPQAVGMVTRKLAGLLQQV 910 Query: 186 RELHALEENLAIISNSEPAQLLEEE-RLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 R+ + A + P E L ++ K + K + Sbjct: 911 RDSSPPPTDEAHETRQGPVASDEAAEPLTLKVLPEYEKYALGKEACRAVISIKASAEVKQ 970 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH 304 + + C++D SGSM ++ + L L+ V Y + + Sbjct: 971 R-AHVALTCVLDRSGSMGGERIELVRETCHFLIDQLTADDYLGIVSYSNTVREDVPLLRM 1029 Query: 305 EFF----------YSQETGGTIVSSALKLMDEV-----------VKERYNPAQWNIYA-- 341 GGT + + L+ + + + Sbjct: 1030 TPEARRLAHTMISSLTLHGGTALYAGLEAGVKQQMAAASELKALAAAAGGGSDSSRIVHS 1089 Query: 342 -AQASDGD-NWADDSPLCHEILAKKLL----PVVRYYSYIEITRRAHQTLWREYEHLQST 395 +DG + L + +++ L + QS Sbjct: 1090 CFLFTDGQATTGPCTVNEIMGQMTSLQSPADQNITVHTFGFG-DDHSVELLQGVAEAQSG 1148 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + L + Sbjct: 1149 VYYYISCADDIPSGFGDALGGLLAVVAKDVR 1179 >UniRef50_Q4RTR8 Chromosome 2 SCAF14997, whole genome shotgun sequence n=3 Tax=Tetraodon nigroviridis RepID=Q4RTR8_TETNG Length = 1406 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 31/274 (11%), Positives = 74/274 (27%), Gaps = 18/274 (6%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + Q + + + + ++R T +T + + + +++ Sbjct: 821 QTTQASPAPGTDPVLLVFEMTRREFTLDACVEMPHEISRLTCLTHSVKTKRTDCKAVVSV 880 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + + + A L + Y + + PS+ + L+D Sbjct: 881 LPGQILGPEGFQLSVSLSKAHLPRMWVEKHPEKDSQV-YFDIDAGSPPSN--EVVLLLDT 937 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---HHTQAKEVDEHEFFYSQE-TG 313 S SM S + + + L K +++ G Sbjct: 938 SESMRDS-LHTLQEIALRVLKALHPDVKVNIILFGTGQFDPGGLAVTSPLSPQRFTPVGG 996 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY- 372 T + L+ ++ P++ SDG +P L + + R + Sbjct: 997 STELWRPLR-ALSLL----PPSRGLRNLLLLSDGH---VQNPELTLQLLRDGVQHSRLFP 1048 Query: 373 -SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + A++ + R F R Sbjct: 1049 AASGLHGPTANRHMLRALAQAGGGAFEFFDTKSR 1082 >UniRef50_Q8YTA2 Alr2822 protein n=11 Tax=Bacteria RepID=Q8YTA2_ANASP Length = 224 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 58/171 (33%), Gaps = 21/171 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHH 295 E +P + L+D SGSM + + + + L L + + VE+ + Sbjct: 11 VEFAENPEPRCPCVLLLDTSGSMQGAAIEALNQGLLSLKDELMKNSIAARRVEIAIVTFD 70 Query: 296 TQAKEVDEHEFFY------SQETGGTIVSSALKLMDEVVKERY------NPAQWNIYAAQ 343 + + + G T + + + ++V+ER A + + Sbjct: 71 SHINVIQDFVTADQFNPPILTAQGLTSMGAGIHKALDMVQERKSLYRANGVAYYRPWVFM 130 Query: 344 ASDGDNWADDSPLCHEILAK----KLLPVVRYYSYIEITRRAHQTLWREYE 390 +DG+ + L + + ++ V ++S A+ T + Sbjct: 131 ITDGEPQGELDHLVEQAALRLQGDEVNKRVAFFSVGV--ENANMTRLNQIA 179 >UniRef50_A9B607 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=A9B607_HERA2 Length = 550 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 59/223 (26%), Gaps = 17/223 (7%) Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQST 265 L + + D+ + A + ++D SGSM D Sbjct: 334 ALASPLTAQFGVDPNQPRNSLATPPADVIVAAKNAWANNRKPANIMLVVDSSGSMRDDDK 393 Query: 266 KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF---------YSQETGGTI 316 D AK + L + + + G T Sbjct: 394 MDQAKLGVEVFLNRLPSKDNVGMIGFSSSPAVLVPLATRSENMANLQMQTQGLVPDGNTS 453 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP-VVRYYSYI 375 + A+ L + ++ P + N SDG + A S L + + ++ + Sbjct: 454 LYDAIDLARQELENLKQPDRIN-AIVVLSDGADTA--SQLSIDQMLGNFGESSIQIFPIA 510 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 A ++ ++ T D D I+ F Sbjct: 511 YGA-DAETSILQQIADFSRT--ELVQGSTGDIDKIFENLSRYF 550 >UniRef50_C9RU69 von Willebrand factor type A n=2 Tax=Geobacillus RepID=C9RU69_GEOSY Length = 1077 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 36/302 (11%), Positives = 67/302 (22%), Gaps = 36/302 (11%) Query: 98 SGSGQGQASQDGEGQDEFVFQISKDEYLD------LLFEDLALPNLKQNQQRQLTEYKTH 151 G S +G+G F I E + P + +++ Sbjct: 46 EGWATAPGSGNGQGNSIRYFMIRIQPKEQSGTAVSYRAEAVEAPADRNGEKKYTFSLDMR 105 Query: 152 RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL-EEE 210 +A G + + GK + + + + Sbjct: 106 GKWPSAPGTTVTYQITVDAYR------VLGNGKEDVYFSFPQTPYQYTRQTGVSTAKLDF 159 Query: 211 RLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK 270 L E + +MDVSGSM AK Sbjct: 160 SLSFSQPEYAKPPNGDAQGRLDVTLVPQGAVSGIIRPPIDVVFVMDVSGSMTAMKLQSAK 219 Query: 271 RFYILLYL----FLSRTYKNVEVVYI--------------RHHTQAKEVDEHEFFYSQET 312 ++ + + + + + + Sbjct: 220 SALQAAVNYFKSNYNQNDRFALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAG 279 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE-ILAKKLLPVVRY 371 GGT S+AL L K + Y +DG ++ K+ Sbjct: 280 GGTNYSAALSLA----KSYFTDPTRKKYIIFLTDGMPTVLNAVDTITYREVKQNFWGGYS 335 Query: 372 YS 373 Y+ Sbjct: 336 YT 337 >UniRef50_Q5UWJ9 Calcium-binding protein-like n=1 Tax=Haloarcula marismortui RepID=Q5UWJ9_HALMA Length = 1562 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 41/176 (23%), Gaps = 15/176 (8%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKE 300 + +MD SGSM S K + L + V + T Sbjct: 498 RPVDVTLVMDTSGSMSSSVK-LRNTAGQRFVAGLLDVDRAAVVDFDSSAYVAQDLTSDFG 556 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 GGT + S L + N +DG S Sbjct: 557 AANSTLDNLGSGGGTDIGSGLSTANSQFASNSN-DSRAQVMILLTDGRGNGGISEAQTAA 615 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 Y+ A++ R+ ++ N+ + + Sbjct: 616 -----NQNTTVYTVGF--DNANRDKLRDIANITDGEFNYVTDRSELPNVFSRIAEN 664 >UniRef50_A9AUC9 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AUC9_HERA2 Length = 579 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 52/221 (23%), Gaps = 19/221 (8%) Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 + + L R F + + ++ Sbjct: 299 DGVEVNRIDDTRHPEIDMYLSIMRPTGVVTDVPRQ--NVKVFENNNQIEGFSWVNLSRVQ 356 Query: 247 SQAVMFCLMDVSGSMDQST-------KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 + ++D SGSM S D AK + L + + T Sbjct: 357 DPLNIMLVIDTSGSMGPSKEGLTDGGLDAAKIAALDFIDHLPSNANVGLIHFGTLVTVDH 416 Query: 300 EVD------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + + G T + AL + ++ + SDG + A Sbjct: 417 SLTNDIGAVRQSISELKPEGQTAIYDALAISYTQLRRAKG----QTFIVLISDGADTASK 472 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 I+AK + Y + L + + Sbjct: 473 GDNYDSIVAKATKANIPTYIIGLTSPEFDGQLLEDLQRDTK 513 >UniRef50_A7C1J8 von Willebrand factor, type A n=1 Tax=Beggiatoa sp. PS RepID=A7C1J8_9GAMM Length = 478 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 47/197 (23%), Gaps = 21/197 (10%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLS-----RTYKNVEVV 290 K + P + L+D SGSM + + I K V Sbjct: 31 KTTQPPSIPLPPHDVILLIDTSGSMAEGTKLQEVQAAAIQFIQRRHGLTHLANNKIAVVG 90 Query: 291 YIRHH------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + T E + GGT + L+ + Sbjct: 91 FGGRAYLVANLTSDLMNLEQPIQKLRAVGGTPMDRGLQSAMNQL--SAGSDSEQRSILLF 148 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG + L L K + T A L + + Sbjct: 149 TDGKPDNQRTTLNASQLVKNANIQI----VAIATDDADIGLLTQV---TGDAALVFPTSV 201 Query: 405 RDQDDIYPVFRELFHKQ 421 + D + + ++Q Sbjct: 202 GNFDQAFQKAEQAIYEQ 218 >UniRef50_D1HBR9 Whole genome shotgun sequence of line PN40024, scaffold_205.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HBR9_VITVI Length = 656 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 39/125 (31%), Gaps = 11/125 (8%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------- 295 P + + ++DV G M + M KR L+ LS T + V + Sbjct: 278 PARRAPIDLVTVLDVGGGMTGAKLQMMKRAMRLVISSLSSTDRLSIVAFSASSKRLMPLK 337 Query: 296 ---TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 T + GT ALK +V+++R SDG N Sbjct: 338 RMTTTGRRSARRIIESLIAGQGTSAGEALKKASKVLEDRRERNPVAS-IMLLSDGQNERV 396 Query: 353 DSPLC 357 S Sbjct: 397 SSKST 401 >UniRef50_B8A860 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8A860_ORYSI Length = 1128 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 28/244 (11%), Positives = 56/244 (22%), Gaps = 26/244 (10%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 R+ + A ++ +L +R Sbjct: 555 RDSIDRAMFHDVFLAPVSALASDKVQLSTFPRVDAIPRRECHPRLPVLVRVAVPATAARR 614 Query: 246 SSQAVMFCLMDVSGSMD----QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH------- 294 + + L+D+S D+ ++ L+ L + V + Sbjct: 615 -APVDLVTLLDISCGGGGGAPARRLDLLRKAMDLVIGNLGADDRLAIVPFHSSVVDATGL 673 Query: 295 ---HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDN 349 + + V + GGT + AL E+++ R A+ SDGD+ Sbjct: 674 LEMSVEGRGVASRKVQSLAVAGGTKLFPALNAAVEILEARCWEAKRERVGAVVLISDGDD 733 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 A V + + S D Sbjct: 734 ------RTIFREAINPRYPVHAFGF---RGAHDARAVHHVADHTSGVYGVLDDEHDRVTD 784 Query: 410 IYPV 413 + Sbjct: 785 AFAA 788 Score = 82.6 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 51/196 (26%), Gaps = 26/196 (13%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ---STKDMAKRFYILLYLFLSRTYKNVEV 289 + + + ++DVS + D+ K+ + L + V Sbjct: 37 RVVAPPPAAASSERAPIDLVAVLDVSCCGGLGPVNRMDLLKKAMGFVIDKLGEHDRLAVV 96 Query: 290 VYIRHHTQA-------------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 A KE TG +S+ALK +++ R + + Sbjct: 97 PVQASAAIAEKHDLVEMNAEGRKEATRMVQSSLTVTGENKLSTALKKAATILEGRKDHDK 156 Query: 337 WNI-YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + SDGD+ + L + +++ + + + Sbjct: 157 KRPGFIVLISDGDDASV--------LNDAMNLNCSVHAFGF-RDAHNARAMHRIANTSAG 207 Query: 396 FDNFAMQHIRDQDDIY 411 D + Sbjct: 208 TYGILNDGHDGLADAF 223 >UniRef50_A0YCE1 BatB protein, putative n=7 Tax=Proteobacteria RepID=A0YCE1_9GAMM Length = 354 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 23/218 (10%), Positives = 57/218 (26%), Gaps = 45/218 (20%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD----------MAKRFYILLYLFLSRTYKNVEV 289 + PSS + +D+SGSM + K + + + Sbjct: 81 DPINLPSSGRDLLLAVDLSGSMKIEDMEVNGDRVPRIVAVKTVLNEFIQR-RKGDRLGLI 139 Query: 290 VYIRHH----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 ++ T + T + A+ L + +++R Sbjct: 140 LFGSQAYVQAPLTFDQTTVQRFMREAQIGFAGEENTAIGDAIGLSVKRLRDRPGDRH--- 196 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI-------------------TR 379 +DG N +P+ +A ++ + Sbjct: 197 VMILLTDGQNNGGKINPIPASKIAANNGIIIYTIGVGADEMVMPGVLGSSFGSRRVNPSA 256 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + ++ F ++ ++ + IY + +L Sbjct: 257 DLDEKTLQQVATATGGQY-FRARNPQELEKIYRLLDQL 293 >UniRef50_B3RWX4 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RWX4_TRIAD Length = 1007 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 45/196 (22%), Gaps = 22/196 (11%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI---- 292 K S+ + L+DVSGSM D+AK L V + Sbjct: 220 KRPWYISSSSTAKDVVILLDVSGSMHGMPLDIAKISIQSLIRTFGENDFLNIVFFNKDIN 279 Query: 293 -----------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-- 339 + + G S A ++++ + + Sbjct: 280 LSIPCFKDVVQTSESHKYVFGRALAANILDGGIADFSKAYDYAFQMLQRSRSKQEQKRCH 339 Query: 340 -YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 SDG ++ P + +Y T + R Sbjct: 340 QLIMVFSDG---TEERPKAVFDKYNA-DKQISVITYGIGTVTSRFEALRWMACYNKGKFI 395 Query: 399 FAMQHIRDQDDIYPVF 414 D+I Sbjct: 396 RISNVGAIPDNIQKYL 411 >UniRef50_UPI00016C38A3 LPXTG-motif cell wall anchor domain protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C38A3 Length = 874 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 43/300 (14%), Positives = 75/300 (25%), Gaps = 24/300 (8%) Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL---AI 197 + A G ++V + + T A +R E + Sbjct: 176 ESTSRHADGQPLALLRDPGHRFALNVTFAEAERVTSATHALAVERDRAQLREGEVIPDRD 235 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 A+ + LR A + ++ + L+D Sbjct: 236 CVLIWRAKAEDRSALRVWTQPDPATGKAYFLALCAPPKF-----ADAKKVPREVILLVDH 290 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE-FFYSQE----- 311 SGSM + + A LS ++ E + Sbjct: 291 SGSMSGAKWEAADWAVERFLAGLSEDDAFSLGLFHSTTKWFGERTRKATPENVRAAVEFL 350 Query: 312 -----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL 366 GGT + AL+ + PA + +D + L L + Sbjct: 351 KLNRDQGGTELGVALEQALARSRSAETPA---RHVLILTDAEVTDAGRILRLADL-ESEK 406 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 P R S + I + L E F + D DD+ E+ +A Sbjct: 407 PNRRRISVLCIDAAPNAALASELAERGGGVSRFLTSNPDD-DDVTTALDEVLADWSAPVL 465 >UniRef50_A1ZDA1 OmpA family protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZDA1_9SPHI Length = 756 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 33/244 (13%), Positives = 67/244 (27%), Gaps = 32/244 (13%) Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 ++ ++ + + RK + ++ + + +K E S + Sbjct: 353 GNFISHLAPPYVSYRESRKFFRKIVESVKGRRYAINN-------FKVREIHEMISKPYDI 405 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-------HTQAKEVDEH 304 +MD SGSM + L K V + Q + D Sbjct: 406 SLVMDYSGSMAG-NIKKLEEATRKFILTKHPNDKISVVKFDERLETELRLTAQGSKTDCV 464 Query: 305 EFFYS-QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD--------DSP 355 +F + G T + + E +K AQ N +DG+ + Sbjct: 465 KFDGLTRYGGSTALYAGADEGLESLKN----AQNNKVMLLFTDGEENSSLQYFGKRAFRA 520 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 A++ V +Y + L F ++ + +Y Sbjct: 521 SEVVKKAREKGIRVFTIAYGTG---VNNKTLNALSMLTDGKTYFI-ENPDEIKQVYEELP 576 Query: 416 ELFH 419 +F Sbjct: 577 RIFR 580 >UniRef50_Q99KC8 von Willebrand factor A domain-containing protein 5A n=29 Tax=Euteleostomi RepID=VMA5A_MOUSE Length = 793 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 51/201 (25%), Gaps = 29/201 (14%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQS---------TKDMAKRFYILLYLFLSRTYKN 286 Y + + + LMD SGSMD + AK +LL L Sbjct: 267 YPDIPEVEASKACGEFVFLMDRSGSMDSPMSTENNSQLRIEAAKETLLLLLKSLPMGCYF 326 Query: 287 VEVVYIR-----HHTQAKEVDEHEFFY------SQET-GGTIVSSALKLMDEVVKERYNP 334 + K + + GGT + + L + K P Sbjct: 327 NIYGFGSSYEKFFPESVKYTQDTMEDAVKRVKALKANLGGTEILTPL---CNIYKASSIP 383 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + +DG+ K R +S+ + +L + + Sbjct: 384 -GHPLQLFVFTDGE---VSDTFSVIREVKLNSKKHRCFSFGIGQGAS-TSLIKNIARVSG 438 Query: 395 TFDNFAMQHIRDQDDIYPVFR 415 F R Q + Sbjct: 439 GTAVFITGKDRMQTKALGSLK 459 >UniRef50_Q04NS4 BatA n=4 Tax=Leptospira RepID=Q04NS4_LEPBJ Length = 312 Score = 86.9 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 28/210 (13%), Positives = 57/210 (27%), Gaps = 25/210 (11%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM-------AKRFYILLYLFLSRTYKNV 287 K P+ + +DVSGSM +S + +K+ ++ + Sbjct: 72 PGKKTTFLPNEKKGVDVMIALDVSGSMSRSRDFLPETRLGVSKKLLRKFIDK-RKSDRLG 130 Query: 288 EVVYIRHH------TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQW 337 VV+ T +E + GT + A+ L ++ Sbjct: 131 LVVFAGAAYLQAPLTGDRESLNEILGTIEEETVAEQGTAIGDAIILSTYRLRAS---QAR 187 Query: 338 NIYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRA--HQTLWREYEHLQS 394 + +DG N P+ LA+ + + + + + RE Sbjct: 188 SKVIVLITDGVSNTGKIDPVTATDLAEHIGVKIYSVGIGKEDGSYEINFEILRELSASTG 247 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 F + + + L Sbjct: 248 GKF-FRAEDPEEMKAVLTSIDSLEKDPLQA 276 >UniRef50_Q66HV5 Zgc:92481 n=2 Tax=Danio rerio RepID=Q66HV5_DANRE Length = 804 Score = 86.9 bits (213), Expect = 2e-15, Method: Composition-based stats. Identities = 51/422 (12%), Positives = 108/422 (25%), Gaps = 58/422 (13%) Query: 42 SVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSG 101 S + G+ + D G R + Q+ + R G +G Sbjct: 63 SFSATVGGQEIRGELRDKQTARDEYDDGVSAGRQVFLLEESDQSADVFRLSVGCLPAGGA 122 Query: 102 QGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVP 161 + +++ +E L + LP + + +A+ +P Sbjct: 123 A-------TVSFTYSTELTVEEDGGLRY---CLPAVLNPRYTPAAAGAGVPQVCSASVIP 172 Query: 162 ANISVVRSLQNSLA------------------RRTAMTAGKRRELHALEENLAIISNSEP 203 ++S+ +++S+ + T ++ ++ +P Sbjct: 173 YSLSLSADVRSSVPIARLDSSCALEPLQFLDPQHTHAQVSLSAGHRFDKDVELLLYYVDP 232 Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD- 262 Q A + + + + + +S L+D SGSMD Sbjct: 233 HQPSAVLEAGAATAPAGSLMADPLLMLSLYPEFPAAVMSSL-TSHGEFIFLVDQSGSMDC 291 Query: 263 --------QSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----------HHTQAKEVDE 303 Q + A+ +LL L + + Q + Sbjct: 292 PMHHGEGAQMRIESARDTLLLLLKSLPLGCYFNIYGFGSSFQAFFPQSVLYSEQTLQEAL 351 Query: 304 HEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 + GGT + L+ + + +DG+ W + L Sbjct: 352 QRVKLMRADLGGTEILQPLQHIYRQA----CIPEHPRQLFIFTDGEVW---NTRELLDLV 404 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + R +S+ + L S F R Q + R Sbjct: 405 RAHSSSHRCFSFGIGEGAS-TALITGMAKEGSGHAQFITGSDRMQPKVMQSLRFALQPAV 463 Query: 423 AT 424 Sbjct: 464 EE 465 >UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G9E8_9DELT Length = 532 Score = 86.5 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 48/194 (24%), Gaps = 17/194 (8%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 + + ++D SGSM + A + L V Y Sbjct: 126 AVARTSAEPLNLAIVIDHSGSMKGQRERNALDAAAGMISRLRDGDTVSVVSYNTKAHTIV 185 Query: 300 EVDEHEFFYS-------------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 V + + +G T VS ++ + ++ R SD Sbjct: 186 PVTTLDARNRDRVISDLRVGVASRPSGNTCVSCGVEAGLQTLQGRRPGIDR---MLLLSD 242 Query: 347 GD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 G+ N LA++ S I + ++ L + F+ Sbjct: 243 GEANRGVRDEPGIRRLAREARNRGVSISSIGVDVDYNEVLMSAIAREANGRHYFSETGSN 302 Query: 406 DQDDIYPVFRELFH 419 L Sbjct: 303 LDAIFDQELDSLIQ 316 >UniRef50_Q2JD81 von Willebrand factor, type A n=4 Tax=Frankineae RepID=Q2JD81_FRASC Length = 319 Score = 86.5 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 47/207 (22%), Gaps = 29/207 (14%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVV 290 + P +A + +DVS SM + AK+ L V Sbjct: 74 ARPARAERVPRERATIILAIDVSNSMAATDIAPTRLAAAKQGASAFVDQLPPRINLGLVS 133 Query: 291 YIRHHTQ------AKEVDEHEFFYSQETGGTIVSSAL---KLMDEVVKERYNPAQW---N 338 + T +E Q T V + +R++ Sbjct: 134 FAGSATVLVPASADRESVRAGIRGLQLGPATAVGEGIFASLQAITTAGKRFSDTGQSAPP 193 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR-----------AHQTLWR 387 SDG+ A++ V +Y ++ R Sbjct: 194 AAIVLLSDGETTRGRPNNQAIEAARQARIPVDTIAYGTADGTLDVGGQEVPVPVNEQALR 253 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + + +Y Sbjct: 254 DIAEQTGGSYH-RATSGDELRSVYRGL 279 >UniRef50_Q7MCW9 Uncharacterized protein n=2 Tax=Vibrio vulnificus RepID=Q7MCW9_VIBVY Length = 688 Score = 86.5 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 57/233 (24%), Gaps = 21/233 (9%) Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + +E L + + + + T L + + ++D Sbjct: 257 TLDKDITVYWRLQEGLPGRLEAVSYRDPQQSERGTIKLTFTPGDDLSAIQQGRDWVFVLD 316 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-----------IRHHTQAKEVDEHE 305 SGSM L + +++ I + Sbjct: 317 KSGSMSG-KHATLTEGVKRGLGKLPSGDRFRILMFDNRVQEITNGFIAVNQNNVTQAIET 375 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKK 364 GGT + AL+ + +DG N L ++ Sbjct: 376 INQIATGGGTNLYDALERAVSGLDSDRTTG-----IILVTDGVANVGVTEKKQFLKLMQR 430 Query: 365 LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 VR Y++I A+ L + + F I V +L Sbjct: 431 Y--DVRLYTFIMGNS-ANTPLLEPMTQVSNGFATSISNSDDILGHIMNVTSKL 480 >UniRef50_C7NN24 von Willebrand factor type A n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NN24_HALUD Length = 592 Score = 86.5 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 35/391 (8%), Positives = 81/391 (20%), Gaps = 47/391 (12%) Query: 70 GLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLF 129 R + + + + D+ S D + Sbjct: 3 SNRRQYLRLCAAGAGGALAGCAGLFTDPKSTENDGIPGNTDTDDDPETPGSTDTGELIDD 62 Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL-----QNSLARRTAMTAGK 184 + ++ A G + + ++ R Sbjct: 63 WQYDPQEQAEVANTSSGVLQSSGGNGAAAGGGGSGDLGFAVGGASNVADFRRNVEEEYLP 122 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY--KNYEKR 242 + +E A + + Sbjct: 123 LPDSLPVEGLFYNYYFDTGGTGECSSLFCPSYATAITADPLGESTGRYFTVGLNSTLDTS 182 Query: 243 PDPSSQAVMFCLMDVSGSM----------------------DQSTKDMAKRFYILLYLFL 280 + + ++D+SGSM +S +AK + L L Sbjct: 183 TFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEGDSRSKMAVAKDALVALTEQL 242 Query: 281 SRTYKNVEVVYIRHHTQAKEVDE-----------HEFFYSQETGGTIVSSALKLMDEVVK 329 + V++ T AK + + H + GGT ++ + +++ Sbjct: 243 HPDDRVGVVLFNNEPTVAKPLRDVETTDMDAIRGHIREDIEAGGGTNIADGMAEAADMLG 302 Query: 330 ERYNPAQW---NIYAAQASDGDNWAD--DSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 E + +D D + LA + + Sbjct: 303 EYADSDPTEAETRQIV-ITDAMPNTGQTDDQALQDRLAGYAEDGIHTSFVGVGV-DFNPE 360 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 L E ++ + + F Sbjct: 361 LVDEITAVRGANYRSVHSAEDFETYLGEEFE 391 >UniRef50_C1XH40 Mg-chelatase subunit ChlD n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XH40_MEIRU Length = 313 Score = 86.5 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 47/206 (22%), Gaps = 26/206 (12%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVY 291 + P P A + +D+S SM + AK+ L K V + Sbjct: 73 RPQAVVPVPDPTATVIVTIDISLSMRAQDIQPTRFEAAKQEAKNFVRSLPDGIKVGLVSF 132 Query: 292 IRH------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQA 344 + T + + Q T + L + K+ Sbjct: 133 AGYATLEAEPTTDHQRVIDQIELLQMARRTAIGDGLLESLRAIPKDENGKPLGPSTVVLL 192 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIE----ITRR---------AHQTLWREYEH 391 SDG + P+ A+ + VV + R Sbjct: 193 SDGRTNSGVDPMEVAPFARDMGVVVHTIGLGRRSNPGDPDQYWGGYWMQFDEETLRAIAE 252 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFREL 417 +A Y + Sbjct: 253 ATGGQY-YAAGSAEALRQAYRNLGRM 277 >UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepID=Q7S708_NEUCR Length = 766 Score = 86.5 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 23/233 (9%), Positives = 53/233 (22%), Gaps = 45/233 (19%) Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS---SQAVMFCLMDVSGSMDQS 264 E + ++ + + ++ PDP+ + +DVSGSM Sbjct: 26 EIRPVAPKLEIHPLPSHTSGLLLR-VIPPRSPPNLPDPNFHHVPCDIVLAIDVSGSMSAD 84 Query: 265 T----------------------KDMAKRFYILLYLFLSRTYKNVEVVYIRHH------- 295 D+ K + L+ + + V + Sbjct: 85 APVPTTASADYTNEQPEHNGLSVLDLVKHAARTIVSTLNSSDRLGIVTFSTEAKVLQPLM 144 Query: 296 ---TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 K+ E Q T + + ++ + +DG Sbjct: 145 PMTALNKKKTERNLGGMQPFSATNLWGGIVEGLKLFD---GQSGRMPALMVLTDGMPNHM 201 Query: 353 DSPL---CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 + L + + + L + + +F Sbjct: 202 CPAQGYVAKLRAMETLPAAIHTFGFGY---SLRSGLLKSVAEIGGGGYSFIPD 251 >UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteriaceae RepID=C7P2A9_HALMD Length = 393 Score = 86.5 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 24/224 (10%), Positives = 48/224 (21%), Gaps = 23/224 (10%) Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY 273 I + T + + + +D SGSM+ A+ Sbjct: 2 ASIETSVNRPNVPADGTTVTAEIDVEPGEQETDVRRHIALCIDTSGSMEGDNIKRARDGA 61 Query: 274 ILLYLFLSRTYKNVEVVYIRHHTQ----------AKEVDEHEFFYSQETGGTIVSSALKL 323 ++ L+ V + T ++ GGT + + LK Sbjct: 62 AWVFGLLADEDYVSIVAFDTEATVILPATRWSDLDRQTAMDHVEELTAGGGTDMYNGLKA 121 Query: 324 MDEVVKERYNPAQWNIYAAQASDGDNW--ADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 E + SDG + D ++ Sbjct: 122 AKETLSSSATGPDTVKRLLLLSDGKDNERTPDEFEGLAEAIDDAGIRIQSAGIGT---DY 178 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 ++ R + + F + A Sbjct: 179 NEATIRTLGTAGRGTWTHL--------EAPGDIEDFFGEAVEQA 214 >UniRef50_B3RWW7 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RWW7_TRIAD Length = 732 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 33/309 (10%), Positives = 87/309 (28%), Gaps = 28/309 (9%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKR-RELHALE 192 +L N +LT + + N + I + ++ + A + + +E Sbjct: 124 YDSLSSNITSELTMTYSTKFKTLINSSLSVIQIPTNVYSGSRSILATIEYTKNLNDYFIE 183 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 + + R K E + R P + Sbjct: 184 NYNNDPNLRNQYFGGNDGVFRTFPGRPWPKDESGIVLYDCRQRGWYILGSDSPK---NVI 240 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--------------IRHHTQA 298 L+D SGSM +AK L L++ + + I+ + Sbjct: 241 ILIDRSGSMRGMPLAIAKWGTSNLLDTLNQNDFFTILTFNESITPVIDCYTNLIQATDEN 300 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKER-----YNPAQWNIYAAQASDGDNWADD 353 K++ + + G + + + ++++ + A+ +DG A Sbjct: 301 KKLYKTYLEKFTDGGRADFNHSYAMAFDLLQNAKTQSFRHSAKCQEAIVLFTDGA--AQY 358 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + + +R +++ + + + + + + D Sbjct: 359 TEKLLAE--RNSEKKIRIITFVVGPQFYDTVPIEKLTCEYNG-FLGKIPSMGEVGDAIRQ 415 Query: 414 FRELFHKQN 422 + E+ ++ Sbjct: 416 YTEVMNRPL 424 >UniRef50_C9YX20 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9YX20_STRSW Length = 1239 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 40/291 (13%), Positives = 78/291 (26%), Gaps = 28/291 (9%) Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQ----GGGGGSGSGQGQASQDGEGQDEFVFQISKDE 123 G R H ++ + GG +G E +E + Sbjct: 885 PQGARRYAHALDELYGTGRGEGSSDLGREGGRSRAGGQDTSFPTAREWAEELDALFGAEV 944 Query: 124 YLDLLFEDLA------LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 ++L L L R + T ++ +R L L Sbjct: 945 REEVLARAADQGRTDVLAELDPKAVRPSVDLLTSVLSLAGGMPEQQLARLRPLVRRLVDE 1004 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 A R + +L LR +A R + + + Sbjct: 1005 LAKELATRMRPALTGLATPRPTRRPGGRLDLPRTLRANLAHTRRTADGRTVVVPERPVF- 1063 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 R + + ++DVSGSM+ S A +L + ++ T Sbjct: 1064 --STRSRKEADWRLILVVDVSGSMEASVIWSALTAAVL------GGVPTLSTHFLAFSTD 1115 Query: 298 AKEVDEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 ++ + + GGT +++ L +V P++ + Sbjct: 1116 VIDLTDRVDDPLSLLLEVRVGGGTHIAAGLAHARSLV---TVPSRTLVVVV 1163 >UniRef50_C7NQ34 von Willebrand factor type A n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NQ34_HALUD Length = 1100 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 44/182 (24%), Gaps = 15/182 (8%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE-- 305 + ++D SGSM + AK L + V + T + + Sbjct: 512 PIDLAFVIDESGSMGGARIQDAKASAKRFVGGLYEDDRAALVSFAGGATLGQSLTTDHGA 571 Query: 306 ----FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 GGT + L+ + + N +DG P+ Sbjct: 572 VNASIDQLNAGGGTNTGAGLQKAVDEL--TSNGEGDTQEIILLADGGTGLGPDPVTIAQT 629 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 A + + + D ++ VF + + Sbjct: 630 ADEHRITINTIGMGTG---IDAQELTSIADATGGEFY----QVSDSSELPEVFDRVEQNR 682 Query: 422 NA 423 + Sbjct: 683 IS 684 >UniRef50_C8PMB0 BatA protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PMB0_9SPIO Length = 332 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 25/216 (11%), Positives = 48/216 (22%), Gaps = 41/216 (18%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK------DMAKRFYILLYLFLSRTYKNVE 288 + + SS + ++D S SM + AKR Sbjct: 76 PVRQTSEAMYSSSGQALMFVIDTSPSMAAQDMGTETRLEAAKRIIKSFAEKY-EGDSLGL 134 Query: 289 VVYIRH------HTQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQWNI 339 T + Q GT + L + + Sbjct: 135 TALGSSAAVLIPPTIDRHTFLTRLDQLQVGELGDGTAIGMGLASAVLHLTQYST---LPS 191 Query: 340 YAAQASDGDN-WADDSPLCHEILAKKLLPVVRYYSYI--------------------EIT 378 + +DGDN + P + K + Sbjct: 192 HIILFTDGDNNTGEIHPRAAADIIKHKKIGFYIIGLGKSGYAPVKYIDPIQKKEISGTLN 251 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++T ++ + F+ + DI+ F Sbjct: 252 TVFNETELQKIAGYGNGRY-FSAKSPELLTDIFNRF 286 >UniRef50_A6DS47 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DS47_9BACT Length = 333 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 24/208 (11%), Positives = 58/208 (27%), Gaps = 30/208 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----------DMAKRFYILLYLFLSRTYKNVE 288 E + + +D+SGSM+ D K + Sbjct: 84 EPLTKTIASRDLLLAVDLSGSMETKDFKNKSGENVTRLDSVKEVLSEFLAE-REGDRVGL 142 Query: 289 VVYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNI 339 V + T+ E+ + +Q T++ A+ L + + Sbjct: 143 VFFGSAAFIQMPFTEDLEICQELMDEAQVRMAGPQTMLGDAIGLSISIFDQSELED---K 199 Query: 340 YAAQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYIE----ITRRAHQTLWREYEHLQS 394 +DG++ +P +A+ V+ + + + + R L Sbjct: 200 VLILLTDGNDTGSLVAPEKAAQIARDKGIVIHTVAVGDPAAAGEQALDEATLRSISSLTK 259 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + + + IY ++ ++ Sbjct: 260 GKY-YWAGNREELAGIYDEIDKIGVREL 286 >UniRef50_A9A1M2 von Willebrand factor type A n=2 Tax=Thaumarchaeota RepID=A9A1M2_NITMS Length = 316 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 50/190 (26%), Gaps = 30/190 (15%) Query: 246 SSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVY------IRH 294 + + ++D S SM + D AK L L + + V++ + + Sbjct: 85 ENGINLSIVLDGSESMAATDYEPTRLDAAKNAINNLILKMGPQHNVGVVLFESGATTVSY 144 Query: 295 HTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWAD 352 T KE + ++ G T + L L ++ + SDG N Sbjct: 145 LTPDKEKSVNAISSIEQGLGATAIGDGLALGVDMASSIPDKKG---VVILLSDGVHNSGL 201 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRA--------------HQTLWREYEHLQSTFDN 398 +P AK + I + S Sbjct: 202 VTPEEATEYAKINNVQIHTIGLGSIEPVFLRDDIYGEPQYAELDEETLVIIAQQTSGNYY 261 Query: 399 FAMQHIRDQD 408 ++ + Sbjct: 262 KSLDEQTLNE 271 >UniRef50_A0B5M2 von Willebrand factor, type A n=1 Tax=Methanosaeta thermophila PT RepID=A0B5M2_METTP Length = 795 Score = 86.1 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 46/180 (25%), Gaps = 17/180 (9%) Query: 247 SQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLS-RTYKNVEVVYIRHH-----TQAK 299 S + +D SGSM D+ K L + V + T Sbjct: 62 SPVDVVLSIDSSGSMTTSDPGDLRKSAAKEFVTGLDLSMDRVGVVSWNTSAISWPLTNNT 121 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD---DSPL 356 + E + G T + + LK +++ E + +DG + P Sbjct: 122 KDIESAIDSTGADGNTCLDTGLKSAIDLLSECSG----SKVIVLLTDGISTDGGHYTPPG 177 Query: 357 CHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + + ++ A E H A IY R Sbjct: 178 VPGSPVDEARSKGILVFTIGLG-PDADARNLTEIAHSTGGEFYSAPDANALAG-IYKRIR 235 >UniRef50_UPI000180D155 PREDICTED: similar to integrin alpha Hr1 n=1 Tax=Ciona intestinalis RepID=UPI000180D155 Length = 1595 Score = 86.1 bits (211), Expect = 3e-15, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 56/191 (29%), Gaps = 25/191 (13%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVV------------ 290 + + ++D SGS++Q KR+ + + VV Sbjct: 413 RGKIDIVLVVDQSGSVNQCNFQKVKRWLRDIVRSFNLGVTEQDVGVVVYSKKATTSTVVD 472 Query: 291 --YIRHHTQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + + + + + E G T A KL +E++ + Sbjct: 473 LGFSDYDSDGHTKKQEMTKILKKLAYEGGTTYTGYAFKLANEMLTGNKSRPDAKKMIILL 532 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG A ++ E L V + +QT + + FA+ Sbjct: 533 TDGATTAANTLQLKEELDVSRAANVMILAVGVGK--FNQTELIQIA--GDRKNFFAVTKF 588 Query: 405 RDQDDIYPVFR 415 + + + R Sbjct: 589 SELEKVRDKLR 599 >UniRef50_A7VF89 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VF89_9CLOT Length = 1391 Score = 86.1 bits (211), Expect = 3e-15, Method: Composition-based stats. Identities = 30/230 (13%), Positives = 69/230 (30%), Gaps = 16/230 (6%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 + A S + + ++E+A+ K + +++ + + + Sbjct: 494 DTSAYPSIQAYININGTKDSKEELADQFTKEDFTVIDTQYEITDFTLNSGAESEA-VSIG 552 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVDEHEF 306 +MD SGSM+ + AK+ ++ K + V Y T ++ Sbjct: 553 IVMDKSGSMEGAAIANAKQAATEAVEHITSE-KMMIVSYDNEAYLEQSLTSRSGTLKNSI 611 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL 366 + GGT +S+ L L + ++ + SDG + + Sbjct: 612 AAISDGGGTNISAGLNLALDNLEAEKG----SRAVILMSDGQD-GGSEEDMQAATDRAAK 666 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + Y+ + + + DIY ++ Sbjct: 667 LGISVYTVGFG--ECDDAYMQAIAEVTGGKF-VKASASTELSDIYLYLQK 713 >UniRef50_B1L6Y8 von Willebrand factor type A n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L6Y8_KORCO Length = 328 Score = 85.7 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 45/308 (14%), Positives = 87/308 (28%), Gaps = 29/308 (9%) Query: 119 ISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT 178 ++ Y +L +L + + E A V ++ ++ R Sbjct: 41 LALSNYSQILIAELISESGFPERAILSMERDPEAAVKLYRQVRWKLNER--SRSLFKRLI 98 Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 A K + ++ S E L K I + K ++ ++RY++ Sbjct: 99 AKIVIKISSGDSRGFSVNSKEYSVAYSPGMEFDLEKTIERMIEKCKK-----VDEMRYED 153 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + ++D SGSM +A + L + V + Sbjct: 154 IVASDKRKRDKSLIMILDSSGSMTGKKILIAMMIAAIASHKLRSG-RYGVVGFNSTAFVI 212 Query: 299 KEVDEHEFF--------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 K E++ G T +S LK E+ NP +DG+ Sbjct: 213 KSPAENKDSVKVIEEILDLVPIGYTNISDGLKKGLEISYHLKNPKY-----LLITDGEYN 267 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + P K L ++ + L +E + + D I Sbjct: 268 VGEDPRKVARRFKNL---CVIHTRGK-RDSRGSVLCKEIARIGGSKYFVI----DDIKQI 319 Query: 411 YPVFRELF 418 V + + Sbjct: 320 QRVMKSIL 327 >UniRef50_Q2QZN5 Putative uncharacterized protein n=1 Tax=Oryza sativa Japonica Group RepID=Q2QZN5_ORYSJ Length = 553 Score = 85.7 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 17/180 (9%), Positives = 41/180 (22%), Gaps = 25/180 (13%) Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------------HHTQAKEVDEHEFFYS 309 + D+ K ++ L + V + T+ + + Sbjct: 9 GWTRLDLVKGAMKMVTNKLGAGDRLAIVPFNGKVVAAGATRLMEMTTKGRADANAKVNQL 68 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEILAKKLLP 367 + G T ALK ++ R + + SDG + Sbjct: 69 KAGGDTKFLPALKHASGLLDSRPAGDKQYRPGFIFLLSDGQDNGVLDDKL-------GGV 121 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 +++ R + + + + +F +A Sbjct: 122 RYPAHTFGMCQSRCNPKSMVHIATATKGSYHPIDDKLSNVAQAL----AVFLSGITSAVA 177 >UniRef50_C6XTR0 von Willebrand factor type A n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XTR0_PEDHD Length = 344 Score = 85.7 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 25/216 (11%), Positives = 55/216 (25%), Gaps = 45/216 (20%) Query: 242 RPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH- 295 S + + L+DVS SM + + AKR L L + +++ Sbjct: 83 EEAKRSGSDLMILLDVSNSMLAGDLAPNRLENAKRAISQLIDNLH-NDRIGIIIFAGEAY 141 Query: 296 ---------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + AK + T GT + +A+ + + + +D Sbjct: 142 VQLPITTDYSAAKLFLNNITTDIVPTQGTAIGAAIDMGMKSFN---FVNGTSKAMILMTD 198 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA------------------------- 381 G+N +D + A + Sbjct: 199 GEN-HEDDAVSAAKRASAKDVAIHVIGVGSEEGAPVPIYKNGKPVSFHTDEAGKTVVSKL 257 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 ++ + +E A + + ++ Sbjct: 258 NEQMCKEISEAGDGVYVRASNANSGLNIVMDQVNKM 293 >UniRef50_C3XQQ6 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XQQ6_BRAFL Length = 655 Score = 85.7 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 30/244 (12%), Positives = 69/244 (28%), Gaps = 29/244 (11%) Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 PA+ ++ R + D FD R + ++ SS M L+D SG Sbjct: 255 RMYPARPMQPHRFGSRTLTIADVP---RRFDEFDARMTQWYQQTI-SSPKDMLILLDTSG 310 Query: 260 SMDQSTKDMAKRFYILLYLFLSRTYKN-------------VEVVYIRHHTQAKEVDEHEF 306 S++ + + K L L+ +++ T KEV Sbjct: 311 SVEGRSLSLMKHTTWFLLDRLTEDDYVATGYFNAYAQAVSCLSSFVQATTHNKEVIHKSL 370 Query: 307 FYSQETGGTIVSSALKLMDEV-----VKERYNPAQ--WNIYAAQASDGDNWADDSPLCHE 359 + + L+ ++ +++R+ N + +N + Sbjct: 371 DNLEAADQANYYAGLEYAFKIFNNFEMEDRFENQGAECNKVIVLVT--ENAELYPEAVFQ 428 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 V E ++ ++ + + + + F ++ Sbjct: 429 KYNPDRNIRVFVIVVGEPIHDW--SVLQKMACDNRGYFSTV-RSDGAAREASGDFGQVLS 485 Query: 420 KQNA 423 + A Sbjct: 486 RPVA 489 >UniRef50_A0BYA6 Chromosome undetermined scaffold_136, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0BYA6_PARTE Length = 608 Score = 85.7 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 51/177 (28%), Gaps = 23/177 (12%) Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE-FFYSQE-------- 311 MD S AK+ IL L +T + + + E QE Sbjct: 1 MDGSRIQKAKQSLILFLKSLPQTSLFNIISFGTQYVSLWEESRQYTQDNLQEAIQHVKDM 60 Query: 312 ---TGGTIVSSALKLMDEVVKERYN-PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP 367 GGT + + LK +++ Y + +DG++ A D + + Sbjct: 61 QADMGGTNIYNPLK--NKIYNSSYGCSKDTTLNVFLLTDGEDNA-DPIIELVKNNNRAET 117 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + E L + + + + + +DI +L Sbjct: 118 RIYTLGIGERCSFY---LIKRVAEVGNGKFHIVGDN----EDINEKVIDLLEDSLTP 167 >UniRef50_A7IDL0 von Willebrand factor type A n=3 Tax=Alphaproteobacteria RepID=A7IDL0_XANP2 Length = 345 Score = 85.7 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 35/278 (12%), Positives = 69/278 (24%), Gaps = 41/278 (14%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 L R A +R+ A ++ + R +A L + Sbjct: 18 LVRLLLPRAPERQAGALRLPFFADLARAGLVAAGRPRFSRLRLATLAFIWTLLVIAAARP 77 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRT 283 + P M +D+S SM + KR Sbjct: 78 VYVGTP--VAIPVEGREMMLAVDLSASMSSPDLVQSGVPANRLQVVKRVADDFIAR-RTG 134 Query: 284 YKNVEVVYIRHH------TQAKEVDEH--EFFYSQETG-GTIVSSALKLMDEVVKERYNP 334 + +++ T + V TG T + A+ L + +++R Sbjct: 135 DRIGLILFSTRAYVQAPLTLDRNVVRQLLAEASIGMTGRNTSIGDAIGLAVKTLRDRPAK 194 Query: 335 AQWNIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI--------------TR 379 +DG N + P+ +A K + + Sbjct: 195 D---RVLILLTDGANTSGVLDPMEAAAIAAKENVRIHTIGVGADSNFTDIQPGMLMNPSG 251 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + ++ L F ++ + IY L Sbjct: 252 DLDEEALKKIAGLTGGQY-FRARNDKGLAAIYADIDRL 288 >UniRef50_Q6MJI3 Putative uncharacterized protein batA n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MJI3_BDEBA Length = 336 Score = 85.7 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 23/212 (10%), Positives = 46/212 (21%), Gaps = 47/212 (22%) Query: 248 QAVMFCLMDVSGSMDQSTK------DMAKRFYILLYLFLSRTYKNVEVVYIRHH------ 295 + +DVS SM + AK + + VV+ Sbjct: 87 GIDIVICLDVSDSMLIEDMKPLNRLEAAKETIAKFISA-RTSDRIGLVVFAGESFTMVPP 145 Query: 296 TQAKEVDEHEFFYSQET------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 T ++ GT + A+ +K+ + +DG+N Sbjct: 146 TLDYQMILQRVNEISSASSAKIKDGTALGVAMANAAGRLKDS---QARSRVMIFMTDGEN 202 Query: 350 WADD-SPLCHEILAKKLLPVVRYYSYIEIT-----------------------RRAHQTL 385 + P +AK V + ++ L Sbjct: 203 NSGTIDPETGLEIAKGYGIKVYSIGIGKDGPTRIPVYSRDIFGQKVKTYQPFESTVNEDL 262 Query: 386 WREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + ++ L Sbjct: 263 LGRMASDTGGKY-YRATTEGALQKVFSDIDTL 293 >UniRef50_Q3M1S2 von Willebrand factor, type A n=1 Tax=Anabaena variabilis ATCC 29413 RepID=Q3M1S2_ANAVT Length = 592 Score = 85.7 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 45/190 (23%), Gaps = 18/190 (9%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-RTYKNVEVVY------IRHHTQ 297 + + L+D S SM K + K V + T Sbjct: 46 QQTPQAIVLLIDASSSMSDGKLTEVKTAATKFVERRNLTQDKLAVVSFGLDIQTATPLTD 105 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + E E GGT ++ L ++ + + +DG D L Sbjct: 106 NADTLESAIASLSEAGGTPMAQGLDAAIGELQATFL----SRNILLFTDG--VPDSQALA 159 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + + L + + D + Sbjct: 160 SLSAQSARSQRINLIAVATGDADTNY-----LAQLTADPSLVFYANSGQFDQAFRNAEAA 214 Query: 418 FHKQNATAKG 427 +KQ ++ Sbjct: 215 IYKQLVESEA 224 >UniRef50_UPI0001A2BB4D UPI0001A2BB4D related cluster n=1 Tax=Danio rerio RepID=UPI0001A2BB4D Length = 805 Score = 85.7 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 67/258 (25%), Gaps = 36/258 (13%) Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS-- 246 +E L +P+ ++E +A + + L Y + + S Sbjct: 225 RDVELFLYYQDTHQPSAVVEAG-----VATAPSGSLMRNPVVMITL-YPEFPEEVMSSMA 278 Query: 247 SQAVMFCLMDVSGSMDQS---------TKDMAKRFYILLYLFLSRTYKNVEVVYIR---- 293 +Q +MD SGSMD + AK +LL L + Sbjct: 279 TQGEFVFVMDRSGSMDGMMHRGKEAQHRIESAKDTLLLLLKSLPMGCYFNIYGFGSEFES 338 Query: 294 -------HHTQAKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + + + GT + L + + P ++ + Sbjct: 339 FFPKSVEYSQDTMDQALKRVMEMRADMCGTEILQPLTHIY---SQPCIPEHPSLQLFIFT 395 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG+ + L K R +S+ + L S F R Sbjct: 396 DGE---VGNTKEVLDLVKSHAHSHRCFSFGIGEGAS-TALITGLAREGSGHAQFITGRER 451 Query: 406 DQDDIYPVFRELFHKQNA 423 Q R + Sbjct: 452 MQPKAMESLRFALQPAVS 469 >UniRef50_Q2R0C5 von Willebrand factor type A domain containing protein, expressed n=1 Tax=Oryza sativa Japonica Group RepID=Q2R0C5_ORYSJ Length = 605 Score = 85.3 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 51/196 (26%), Gaps = 26/196 (13%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ---STKDMAKRFYILLYLFLSRTYKNVEV 289 + + + ++DVS + D+ K+ + L + V Sbjct: 37 RVVAPPPAAASSERAPIDLVAVLDVSCCGGLGPVNRMDLLKKAMGFVIDKLGEHDRLAVV 96 Query: 290 VYIRHHTQA-------------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 A KE TG +S+ALK +++ R + + Sbjct: 97 PVQASAAIAEKHDLVEMNAEGRKEATRMVQSSLTVTGENKLSTALKKAATILEGRKDHDK 156 Query: 337 WNI-YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + SDGD+ + L + +++ + + + Sbjct: 157 KRPGFIVLISDGDDASV--------LNDAMNLNCSVHAFGF-RDAHNARAMHRIANTSAG 207 Query: 396 FDNFAMQHIRDQDDIY 411 D + Sbjct: 208 TYGILNDGHDGLADAF 223 >UniRef50_C4LFG4 von Willebrand factor type A n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LFG4_TOLAT Length = 316 Score = 85.3 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 49/200 (24%), Gaps = 29/200 (14%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKD----------MAKRFYILLYLFLSRTYKNVEVV 290 + +D+S SM M K + + + ++ Sbjct: 79 PVVQSFPSRDLLLAVDISQSMQIKDMTINGEAVDRLSMVKSYLQSFIKQ-RQGDRIGIIL 137 Query: 291 YIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + H TQ + T + A+ L + N Sbjct: 138 FADHAYLMVPFTQDWQAAGLLLDEVNIGLAGKFTAIGEAITLAVKK-TLHEPKPIQNKTL 196 Query: 342 AQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYI------EITRRAHQTLWREYEHLQS 394 SDG + + P LAK + E +T E ++ Sbjct: 197 ILLSDGKDSINTIQPTDAAALAKASGLKIYTIGIGSDSTDAEAESDLDETTLEEIANMTG 256 Query: 395 TFDNFAMQHIRDQDDIYPVF 414 F + +D +IY Sbjct: 257 GQY-FRARSEQDLSEIYQQI 275 >UniRef50_B1GZR4 Aerotolerance-related cytoplasmic membrane protein BatA n=1 Tax=uncultured Termite group 1 bacterium phylotype Rs-D17 RepID=B1GZR4_UNCTG Length = 333 Score = 85.3 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 26/218 (11%), Positives = 59/218 (27%), Gaps = 42/218 (19%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQ------STKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 + +D S SM + + AK+ + + V++ Sbjct: 83 EHLSDQGIDIIVALDTSTSMRSLDFRSLNRMEAAKKVIRDFMKE-RKYDRIGLVIFSGLA 141 Query: 296 ------TQAKEVDEHEFFYSQETG----GTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 T K+ GT + SA+ +K+ + + Sbjct: 142 FTQCPLTTDKDSLAEFINNINIGDTGLDGTAIGSAIMTSVNRLKDSRAK---SRIIILVT 198 Query: 346 DGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITR--------------------RAHQT 384 DG+N + PL +A+ + + +++ Sbjct: 199 DGNNNMGEIDPLTASKIARSYDIKIYAVGVGSLDGAIYEVDDPFLGKREIKYRKDAINES 258 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + +E + S F Q ++ ++I +L Sbjct: 259 VLKEVAYNTSGGY-FRAQDVKSFENIMKQIDKLEKDDI 295 >UniRef50_Q0FYU3 Von Willebrand factor, type A n=1 Tax=Fulvimarina pelagi HTCC2506 RepID=Q0FYU3_9RHIZ Length = 584 Score = 85.3 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 45/334 (13%), Positives = 92/334 (27%), Gaps = 32/334 (9%) Query: 48 SGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQ 107 + E V + + P Q P + D + + S Sbjct: 218 ADEDVLLACRLVLAPRARQAPS---FDEQPPEEDSESEDHPDASNDDTPPQDHDEPDQSD 274 Query: 108 DGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 + + + D+ ED+A+P + + G A Sbjct: 275 SEKSETNQEASSQSGQDTDVQSEDVAIPLDILKRLAAMVRNHHTAGKTARKGATARSGGA 334 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R R AG R + A ++ + P Q +R + R Sbjct: 335 R------GRPLGSRAGDPRHSSL--DISATLTAALPWQRFRRQRFPRLTD-------RPV 379 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY-ILLYLFLSRTYKN 286 + D+R + + + + ++ ++D SGS + AK +L R + Sbjct: 380 ILTPSDIRIRRLQAKRETAT----IFVVDASGSAALARLAEAKGAIERILAECYRRRDRV 435 Query: 287 VEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 V + T++ GGT ++S + L ++ ++ + Sbjct: 436 AMVAFRGTSAETLLPETKSLTRARRALAGLSAGGGTPLASGIALAGDLARQCERQERT-P 494 Query: 340 YAAQASDG-DNWADDSPLCHEILAKKLLPVVRYY 372 +DG N D + + Sbjct: 495 LIVFLTDGKANITLDGMAGRSSAREDVNTQATVL 528 >UniRef50_Q1N642 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1N642_9GAMM Length = 340 Score = 85.3 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 24/214 (11%), Positives = 58/214 (27%), Gaps = 42/214 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD----------MAKRFYILLYLFLSRTYKNVEV 289 E + + M +D+S SM + K + + + Sbjct: 79 EPKALQQTDRNMMLAVDISKSMLEEDMQYQGRLVNRLQTVKAVVTDFVEE-RKGDRLGLI 137 Query: 290 VY---------IRHH-TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 ++ + + K + + T + A+ L + +++ + N Sbjct: 138 LFGEQAYIQTPLTFDLSTVKRLLDEAVVGL-AGNKTAIGDAIGLGVKRLQDLP---ESNR 193 Query: 340 YAAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYI----EITR-----------RAHQ 383 +DG N + PL LA+K + I + Sbjct: 194 VLILLTDGQNTAGEIEPLKAAELAEKAGVKIYAIGIGADEMVIQGFFGPRRVNPSRDLDE 253 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + +++ + + IY V ++ Sbjct: 254 DTLTAIAENTGGQY-YRARNVNELEQIYDVLNQI 286 >UniRef50_Q2R0C4 Expressed protein n=2 Tax=Oryza sativa Japonica Group RepID=Q2R0C4_ORYSJ Length = 629 Score = 85.3 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 53/227 (23%), Gaps = 26/227 (11%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 A ++ +L +R + + L+D+S Sbjct: 73 SALASDKVQLSTFPRVDAIPRRECHPRLPVLVRVAVPATAARR-APVDLVTLLDISCGGG 131 Query: 263 ----QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEVDEHEFFY 308 D+ ++ L+ L + V + + + V + Sbjct: 132 GGAPARRLDLLRKAMDLVIGNLGADDRLAIVPFHSSVVDATGLLEMSVEGRGVASRKVQS 191 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDNWADDSPLCHEILAKKLL 366 GGT + AL E+++ R A+ SDGD+ A Sbjct: 192 LAVAGGTKLFPALNAAVEILEARCWEAKRERVGAVVLISDGDD------RTIFREAINPR 245 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 V + + S D + Sbjct: 246 YPVHAFGF---RGAHDARAVHHVADHTSGVYGVLDDEHDRVTDAFAA 289 >UniRef50_UPI0001C31E2D von Willebrand factor type A n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31E2D Length = 319 Score = 85.3 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 55/207 (26%), Gaps = 30/207 (14%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 + P +A + + DVSGSM + AKR + RT + + Sbjct: 74 RPERTVAVPVERASIALVTDVSGSMLATDVQPNRMIAAKRAARRFVDEVPRTVNLGVISF 133 Query: 292 IRHHTQAKEVDEH------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQ 343 T + + +GGT A+ E+++ + Sbjct: 134 NNTATVLQSPTRNRSDVLTAIDRLAVSGGTATGEAIATATEMLRNQPGENGRRPPSAIVL 193 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR----------------AHQTLWR 387 SDG + P+ A++L + ++ T Sbjct: 194 ISDGTSTNGRDPIEAAAEARRLRIPIYTVAFGTDQGTITVPGRDGVERTERVPPDPTALA 253 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + F D ++ Sbjct: 254 QIAEMTGGE-TFTADSADRLDTVFERL 279 >UniRef50_A6YIP9 Capillary morphogenesis protein 2B n=14 Tax=Euteleostomi RepID=A6YIP9_DANRE Length = 487 Score = 85.3 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 48/176 (27%), Gaps = 13/176 (7%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF-- 307 ++ ++D SGS+ + ++ L F+S + +V+ + Sbjct: 39 DLYFVLDRSGSVSDNWLEIYGFVEQLTNRFVSPKMRVSFIVFSSSAEIILPLTGDRVDID 98 Query: 308 -------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 + G T + LK E + A+ + +DG + L + Sbjct: 99 SGLQQLSKIRPAGDTYMHEGLKKAIEQM--TSQGARASSIIIALTDGKLEVFMNELAIKE 156 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 R Y E + ++ + + ++ Sbjct: 157 ADLARQYGARVYCVGVK--DFDANQLTEIADNKDQVFPVVDGFQALKNIVNSILQK 210 >UniRef50_A8DJP2 von Willebrand factor type A n=1 Tax=Candidatus Chloracidobacterium thermophilum RepID=A8DJP2_9BACT Length = 324 Score = 85.3 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 31/272 (11%), Positives = 71/272 (26%), Gaps = 24/272 (8%) Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 +L + + + + + + I S + K + Sbjct: 19 TLLSPKGQERPSKRPPKADRTTTPDEVFTIDTSLVVLDVAVFDQDNRFVGDLRKENFRVY 78 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 + + + + + + P + ++D SGSM + L + Sbjct: 79 DEQVEQQIEYFSRDEAP---VSLGFVVDTSGSMR-PRRAKVIEAVKFLARAAKPGDEFFL 134 Query: 289 VVYIRHHTQAKEVD------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 V + A+E E GGT + A++L E + Sbjct: 135 VDFKNKAELAEEFTPRPADIEEAVDNIVWGGGTALLDAIQLSAEYADKE--GKNRRKAIV 192 Query: 343 QASDGDNWAD-DSPLCHEILAKKLLPVVRYYSY--------IEITRRAHQ--TLWREYEH 391 SDGD+ L ++ V + + + L ++ + Sbjct: 193 VFSDGDDRDSYYDRRQLIKLLQEYQVQVYIVGFPDDDDDGGLFGRSTRKRAVQLIKDIAN 252 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 F + + + +I Q + Sbjct: 253 ETGGRAFF-PKSVDELPEIVRTINADLRTQYS 283 >UniRef50_D2RSW3 von Willebrand factor type A n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RSW3_9EURY Length = 1446 Score = 85.3 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 48/174 (27%), Gaps = 12/174 (6%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVD 302 A + D SGSM S A+ L+ + + V Y T + Sbjct: 534 ADFVFVNDESGSMSGSPTHYAELAGKRFVGALTDSERAGRVGYASGANLDQPLTTDHDAV 593 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 +GGT + L++ ++E + + SDG + PL A Sbjct: 594 NSSLERLSASGGTNTRAGLRVGLNHLEEEGWENR-SAVMILLSDGK--SGSDPLPVAEDA 650 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + ++ RE + + D V Sbjct: 651 AEAGVEISTVGLGN---NINENELREIAAITGGDFYHVEREEDLPDTFERVAEN 701 >UniRef50_UPI00004D9B6D UPI00004D9B6D related cluster n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00004D9B6D Length = 994 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 73/246 (29%), Gaps = 18/246 (7%) Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY----EKRP 243 E+ L S+ + +K+ +R ++ + + + + Sbjct: 44 PSEYEQFLRGKSDFIRGTKKDPSPEKKQTEIIRKRLIKDICHNPVXMLNFCPDLQXAQTD 103 Query: 244 DPSSQAVMFCLMDVSGSMDQST-KDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 +Q L+D SGSM + M ++ L+ + + ++ + +E Sbjct: 104 LRKAQGEFIFLLDRSGSMSGAALFPMVRQEPPLVLTQVQSSDSLYFLLLGCSLSHGQESV 163 Query: 303 EHEFF---YSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + GGT + S L + R + +DG A + Sbjct: 164 AIACDSIKRLRADMGGTNILSPLNWIFRQPVCR----GYPRLLFLLTDG---AVSNTGKV 216 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 L + R YS+ A + L + + F + R Q + + Sbjct: 217 IELIRHHSSFTRCYSFGIGQ-NACRRLVQGVASVSKGSAEFLSEGERLQPKVPER-ERVL 274 Query: 419 HKQNAT 424 + N Sbjct: 275 AEPNMG 280 >UniRef50_Q1D0M2 BatA protein n=2 Tax=Cystobacterineae RepID=Q1D0M2_MYXXD Length = 336 Score = 84.9 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 60/221 (27%), Gaps = 40/221 (18%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ------STKDMAKRFYILLYLFLSRTYKN 286 + ++ R + +D+S SM+ + +AK + Sbjct: 76 RPQVRDSRVRDLSVEGIDIVVALDLSTSMEAGDFRPQNRMHVAKEVLSEFIAN-RVNDRI 134 Query: 287 VEVVYIRHH------TQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQW 337 VV+ T V + + GT + AL +++ Sbjct: 135 GLVVFAGAAYTQAPLTLDYGVLKEVVKQLRTRVLEDGTAIGDALATSLNRLRDSEAK--- 191 Query: 338 NIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRA--------------- 381 + +DGDN + SP+ +A+ L + + + Sbjct: 192 SRVVVLITDGDNNSGKISPMDSANMAQALKVPIYTILVGKGGKVPFPQGTDLFGNTVWRD 251 Query: 382 -----HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + L ++ A + ++ + V L Sbjct: 252 TEIPINPELMQDIADRTGGEYYRATDPEQLREGLQKVLDSL 292 >UniRef50_B8CM90 Von Willebrand factor type A domain protein n=28 Tax=Gammaproteobacteria RepID=B8CM90_SHEPW Length = 360 Score = 84.9 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 56/221 (25%), Gaps = 45/221 (20%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK----------------------------DMAKR 271 E + + ++D+SGSMD D KR Sbjct: 85 EPQTRLQIGRDLMVVVDLSGSMDTKDFTLHVKQQTADGIANSSGTEISDEYISRLDAVKR 144 Query: 272 FYILLYLFLSRTYKNVEVVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALK 322 + + +++ + E + T V AL Sbjct: 145 VLHEFAEQ-RQGDRLGLILFGDAAYLQAPFTADLASWLRLLDESRVAMAGQSTHVGDALG 203 Query: 323 LMDEVVKERYNPAQW-NIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYI----E 376 L +V+ + N +DG++ PL +A K V + Sbjct: 204 LAIKVMSSDEIKSSQKNKVVLLLTDGNDTDSSVPPLEAAKIAAKKGIRVHVIAIGDPQTV 263 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + L F ++ + +Y +L Sbjct: 264 GEQAMDMEVIEGVAALTGGKA-FKAISTQELNKVYQTISKL 303 >UniRef50_D0MZH7 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MZH7_PHYIN Length = 1850 Score = 84.9 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 55/444 (12%), Positives = 132/444 (29%), Gaps = 59/444 (13%) Query: 8 RLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQG 67 R+ +++ ++R+ L ++ +++E + ++ G V P S P Sbjct: 1438 RVWQLDQASLDREAALW---ERMYGTMAELASDGTLELKVDGSRVGGP----STPKTGLD 1490 Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL 127 + P ND V + GG +G G + V Q+S+++ ++ Sbjct: 1491 TPK-YGKDDPNNDPHVGGNTWAGGTGGSDTAGLGGRGGPYRLDKGH-PVHQVSQEKKDEV 1548 Query: 128 LFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRE 187 E + + R + + + ++ + R +A Sbjct: 1549 SAEA-------RAKARAMAQEALAEKLREIDMSEREWETYQTYFKRVERESAQLRAVLAN 1601 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 L A+ + + + +L + + + + + D ++ Sbjct: 1602 LEAVAQERNWLRHQSSGELDDGKL----VDGVAGERLVFKRRGVRDSPFQAPAGHQQEQE 1657 Query: 248 QAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV- 301 M +MDVSGSM S + +++ + + ++ H + E+ Sbjct: 1658 PKRMVFVMDVSGSMYRFNGQDSRLERMLETSLMIMESFAGFERELDYCIFGHSGDSPEIP 1717 Query: 302 -------DEHEFFYSQE-----------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + Q G A++ + V + Sbjct: 1718 FVEFGAPPKDRKERLQVLQKMVAHTQYCRSGDHTVEAVERGVQRVAALEGGD---RFVFV 1774 Query: 344 ASDG--DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY-EHLQSTFDNFA 400 SD + + + L + P VR ++ I A + L + + Sbjct: 1775 VSDANLERYGIEPRYLGRKLLAE--PGVRAHAL-FIASFADEA--ERIRSELPTGRGHVC 1829 Query: 401 MQHIRDQDDIYPVFRELFHKQNAT 424 + D+ +F+++F Sbjct: 1830 LDT----SDLPRMFKQIFTSAFGD 1849 >UniRef50_Q10ZP7 von Willebrand factor, type A n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10ZP7_TRIEI Length = 420 Score = 84.9 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 48/202 (23%), Gaps = 22/202 (10%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 +R + PS M +D S SM AK + + L Y Sbjct: 29 RVRIQPKTDANLPSLPIRMAIALDTSQSMKGEKLQRAKEACLAVVSHLRDPDYLSLAGYS 88 Query: 293 RHHTQAKE----------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA- 341 T E E Q G T + L + ++E P + Sbjct: 89 TRVTPLLESLAGGGAAAGFAEGAIADLQARGVTRI----DLALDWIEESLLPEKSPPLVG 144 Query: 342 AQASDGD--NWADDSPLCHEILAKKLLP----VVRYYSYIEI-TRRAHQTLWREYEHLQS 394 +DG N + K + + + + + Sbjct: 145 VLITDGHATNAGGTPLDDMKPFIVKARNMKSCGIILCAVGLGDAANFNTSFLTDLSDQGG 204 Query: 395 TFDNFAMQHIRDQDDIYPVFRE 416 +A + D+ + Sbjct: 205 GAFIYADTPDKLLSDLQNRLKA 226 >UniRef50_UPI0001925847 PREDICTED: similar to polydom n=1 Tax=Hydra magnipapillata RepID=UPI0001925847 Length = 2514 Score = 84.9 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 33/248 (13%), Positives = 79/248 (31%), Gaps = 25/248 (10%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 +SV+R + ++ ++ + + S++ L+ + + Sbjct: 83 LSVMRRSREAIRKKRPE---SWANNSWILHHDNAPSHTALENRLKMDTFFNFVIIFVCIF 139 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL--- 280 E + ++ N ++ +S+A + L+DVSGS+ + + F L + Sbjct: 140 ELTHSTRDWRVQRLNQFQQQYNNSKADIIFLIDVSGSISDDGFNTEREFVSSLLSKISVQ 199 Query: 281 SRTYKNVEVVYI------------RHHTQAKEVDEHEFFYSQET--GGTIVSSALKLMDE 326 + V + + + K EF + G T ++ AL+ Sbjct: 200 PSAARIAVVTFGRDINKDIDYIDYGYLDKNKCTFNEEFKRVKHRKEGWTNINGALQKAKA 259 Query: 327 VV----KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 ++ ++++ N A +DG SP + V +S Sbjct: 260 LLDSANEKKFKRHNVNTVAVLLTDGGWNYGGSPYDTATNLRTGFHYVDIFSIGVG-HWLD 318 Query: 383 QTLWREYE 390 + + Sbjct: 319 RKQLKNIA 326 >UniRef50_Q9UKK3 Poly [ADP-ribose] polymerase 4 n=14 Tax=Eutheria RepID=PARP4_HUMAN Length = 1724 Score = 84.9 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 63/219 (28%), Gaps = 23/219 (10%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP--SSQAVMFCLMDVSGSMDQSTKDMAK 270 + + + + L ++ P +S++ + +D S SM+ T AK Sbjct: 837 AAYLPRMWVEKHPEKESEACMLVFQPDLDVDLPDLASESEVIICLDCSSSMEGVTFLQAK 896 Query: 271 RFYILLYLFLSRTYKNVEVVYIR----------HHTQAKEVDEHEFFYSQETGGTIVSSA 320 + + + K + + H T E + G T Sbjct: 897 QIALHALSLVGEKQKVNIIQFGTGYKELFSYPKHITSNTAAAEFIMSATPTMGNTDFWKT 956 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 L+ ++ PA+ + SDG L K+ P R ++ Sbjct: 957 LRY-LSLL----YPARGSRNILLVSDGH---LQDESLTLQLVKRSRPHTRLFACGIG-ST 1007 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRD--QDDIYPVFREL 417 A++ + R + + + + I L Sbjct: 1008 ANRHVLRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRL 1046 >UniRef50_A6DT52 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DT52_9BACT Length = 348 Score = 84.9 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 23/233 (9%), Positives = 52/233 (22%), Gaps = 60/233 (25%) Query: 246 SSQAVMFCLMDVSGSMDQ------------------------STKDMAKRFYILLYLFLS 281 + +D+SGSM AK+ Sbjct: 84 KDSVDIVFSLDISGSMSSYDQPEDLAVNRRVIAEAINNKELHPRLHYAKKSIADFIDK-R 142 Query: 282 RTYKNVEVVYIRH------HTQAKEVDEHEFFYSQE------TGGTIVSSALKLMDEVVK 329 ++ + VV+ T E ++ T +++A+ ++ Sbjct: 143 KSDRLGLVVFGAEAYSVCPPTNDHEYLQNRLKEISTEYLGDYNRQTNITAAISGGLARLR 202 Query: 330 ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI------------ 377 + P +DG + A+ + Y+ Sbjct: 203 KSKAP---KKIIILVTDGSHTANSNLTPRMAAKAAAKSDAVIYTIGVGNEVAWNVENFFG 259 Query: 378 -------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + L +E F+++ D+ L + Sbjct: 260 SSRLNASNSDFDEELLKEIAEKTGGLY-FSVREAEQMKDVLKKIDALEKVELK 311 >UniRef50_A8M5H1 von Willebrand factor type A n=13 Tax=Actinomycetales RepID=A8M5H1_SALAI Length = 319 Score = 84.9 bits (208), Expect = 6e-15, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 53/219 (24%), Gaps = 37/219 (16%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 + + P +A + +DVS SM + AK L + V Sbjct: 73 ARPTAEVRVPRERATVMVAVDVSTSMLAGDVEPDRLTAAKEAARRFVDGLPDEFNVGLVA 132 Query: 291 YIRH------HTQAKEVDEHEFFYSQET----GGTIVSSALKL---MDEVVKERYNPAQW 337 + +E + E GT + A+ + + Sbjct: 133 FAGSAAVLVPPDTDREALDEGIDRLVEGATGVQGTAIGEAINTSLGAVKALDGEAAKDPP 192 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ-----------TLW 386 SDG N + P+ A + V ++ + + Sbjct: 193 PARIVLLSDGANTSGMDPMEAATDAVAMDVPVHTIAFGTASGYVDRGGRPIQVPVDGQTL 252 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 E + + D R ++ ++ Sbjct: 253 DEVARETGGQFH--------EADSAKELRAVYDDIGSSV 283 >UniRef50_A1ZQV4 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZQV4_9SPHI Length = 351 Score = 84.9 bits (208), Expect = 6e-15, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 63/215 (29%), Gaps = 37/215 (17%) Query: 244 DPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH--- 295 S + +D+S SM + + AK + + V++ Sbjct: 106 QTSEGIDILLTLDISESMLIEDFTPNRLEAAKLVAKNFVHG-RKYDRIGLVIFSGEAYSV 164 Query: 296 ---TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 T ++ + +E GT + SAL + ++E A + SDGD Sbjct: 165 SPLTTDYKLLKRYIEDIREDMIQENGTAIGSALGMGTIRMQES---ASRSKVVILISDGD 221 Query: 349 NWADD-SPLCHEILAKKLLPVVRYYSYI----------------EITRRAHQTLWREYEH 391 N A + P+ LA + + +++ RE Sbjct: 222 NTAGNLDPITASRLATAHNIKIYTILVGRSGKVPYGRDMFGQPQYVNNTVDESVLREIAK 281 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + A + ++ ++ L + + Sbjct: 282 IGEGKFYRASDNQALKN-VFAEINRLEKTEIIENR 315 >UniRef50_C6VVE4 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VVE4_DYAFD Length = 320 Score = 84.6 bits (207), Expect = 7e-15, Method: Composition-based stats. Identities = 24/212 (11%), Positives = 52/212 (24%), Gaps = 37/212 (17%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQ-----STKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 +R + +F ++D+S SMD S + K R + +++ Sbjct: 69 AERDIQAKGKDIFMVVDLSKSMDAADVTPSRLEKVKFELNRFIEN-ERANRIGIIIFSND 127 Query: 295 H------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 T E Q GT V A+++ + +P Sbjct: 128 AYIHVPLTYDAAALELFIQSLQTDLLPTNGTNVCGAIEMAYNKLMNSADPTSRAKMMVLF 187 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIE--------------------ITRRAHQT 384 +DG+N + ++ V + + + + Sbjct: 188 TDGENSSSC-TNALFNNLRRFGIGVYSVAVGTKVGISIQENGKPLKDKNDKLVISKLDEN 246 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 R + D + + Sbjct: 247 FLRGIANSSRGSYYELNNSKNDIQKLISDINQ 278 >UniRef50_C3ZT39 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZT39_BRAFL Length = 1044 Score = 84.6 bits (207), Expect = 7e-15, Method: Composition-based stats. Identities = 18/175 (10%), Positives = 36/175 (20%), Gaps = 20/175 (11%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD----EHEFF 307 ++D SGSM + ++ L V + T + E Sbjct: 274 VLVLDTSGSMGKKLLFNLRQSLTSHVYNLPIGSSLGIVTFNSEATINAPMTVIGNETTRD 333 Query: 308 YS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 G T + S L+ ++ SDG Sbjct: 334 ALVGALPMTTGGKTSIGSGLQEALGLLGNDLGR------IILISDGQEDELPHIADVLPA 387 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + V + + + + + + I Sbjct: 388 LRVAGHTVHTVAIGADG----DPMLEQLSRDTGGKSFYHTRWSTNFPGILRTIEA 438 >UniRef50_A6DT53 BatB protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DT53_9BACT Length = 621 Score = 84.6 bits (207), Expect = 7e-15, Method: Composition-based stats. Identities = 25/203 (12%), Positives = 55/203 (27%), Gaps = 44/203 (21%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMD------QSTKDMAKRFYILLYLFLSRTYKN 286 + K + + SS++ + L+D+S SM+ QS + +K + L + + Sbjct: 77 RPQGKEISQEKESSSRS-ILFLVDISKSMNVRDMNEQSRLEYSKWWAKKLMNDI-PGDRF 134 Query: 287 VEVVYI----------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 + + GGT +++AL + KE + Sbjct: 135 GLITFSRIANIECPLTSEPDMVLLYLSDLNSSLLPGGGTNIAAALDHAQKQFKEN---ER 191 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI--------------------- 375 + SDG+ + +K V S Sbjct: 192 DSRVVVLLSDGE-TDGNKWRESLEALQKKKIPVNVISLGDPKREGLVLNEKGHPIRNSKG 250 Query: 376 -EITRRAHQTLWREYEHLQSTFD 397 + + + ++ Sbjct: 251 DYVMSLSDTSTLKQIADETGGTY 273 >UniRef50_B4S8S0 Magnesium chelatase ATPase subunit D n=3 Tax=Chlorobiaceae RepID=B4S8S0_PROA2 Length = 619 Score = 84.6 bits (207), Expect = 7e-15, Method: Composition-based stats. Identities = 38/324 (11%), Positives = 82/324 (25%), Gaps = 43/324 (13%) Query: 108 DGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 + + E + + D +L+ + + + + S Sbjct: 308 NSDPDAEEENEETPDMIEELMMDAIETE----------LPENLMNISLASKKKSKSGSRG 357 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 +L N R + + ++ P Q ++ ++ + Sbjct: 358 EALNNRRGRFVRSQ--PGEIRGGKVALIPTLISAAPWQESRRLERLRKTGKVSTTGLIIN 415 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY-LFLSRTYKN 286 D ++++ S + ++D SGSM + AK L + Sbjct: 416 KEDVKVKKFRD-------KSGTLFIFIVDASGSMALNRMRQAKGAVSHLLQNAYVHRDQV 468 Query: 287 VEVVYIR-------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + +Q+ + + E GGT ++SA+ L E K+ I Sbjct: 469 ALISFRGKEAQLLLPPSQSVDRAKRELDVLPTGGGTPLASAIYLAWETAKQARTKGVSQI 528 Query: 340 YAAQASDGD-----------NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR- 387 +DG N E + L V I Sbjct: 529 MFVLITDGRGNIGLQSMMDKNAPKAPKEEIEKEVEALAASVYADGIASIVVDTQMNYLSR 588 Query: 388 ----EYEHLQSTFDNFAMQHIRDQ 407 + + +Q Sbjct: 589 GEAPKLAEKLGGRYFYLPNAKAEQ 612 >UniRef50_B0SI02 BatA n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SI02_LEPBA Length = 317 Score = 84.6 bits (207), Expect = 7e-15, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 54/207 (26%), Gaps = 23/207 (11%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR------TYKNVE 288 Y+ PD + + +D+SGSM S + + + L + Sbjct: 77 PGSKYKLSPDSTKGVDIMIALDISGSMVNSYDFLPRNRLSVSKDLLREFVKKRLYDRIGI 136 Query: 289 VVYIRHHTQAKEV--DEHEFFYSQET--------GGTIVSSALKLMDEVVKERYNPAQWN 338 VV+ + D GT V AL L +K + Sbjct: 137 VVFAGAAYLQSPLSSDRFALDELIAGTSSEDIEEQGTAVGDALVLSSYRLKNSEAK---S 193 Query: 339 IYAAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRA--HQTLWREYEHLQST 395 +DG N P K + V + + + ++ + Sbjct: 194 KVIILLTDGVSNTGKLDPDTAAYTTKTMGIKVYCIGIGKEEGQYEINYESLQKISSNTNG 253 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHKQN 422 F + + + +L + Sbjct: 254 KF-FRAESPEVLESVLNEIDQLEVVEL 279 >UniRef50_A2DPQ9 von Willebrand factor type A domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2DPQ9_TRIVA Length = 694 Score = 84.6 bits (207), Expect = 7e-15, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 53/195 (27%), Gaps = 28/195 (14%) Query: 243 PDPSSQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVY-IRHHT-QAK 299 + + L+D SGSM + + A + L L K V + ++ Sbjct: 231 NRKTDVKSIVFLLDCSGSMTIDNRIENAIKAMDLFLHSLEPGVKFEIVRFGSTFNSLFDF 290 Query: 300 EVDEHEFFYSQET-----------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 ++ E+ GGT + + +K + + +DG Sbjct: 291 KLTEYNDDSLNTALAFIKGTSANLGGTEIFNPIKQIYNELS--------PDVLFVLTDG- 341 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 A D+ + + A L R + + +D Sbjct: 342 --AVDNSQAVLDFVRDSSTKIFSLGLGAG---ADMNLVRNLASFTGGVSEHVLDASQLRD 396 Query: 409 DIYPVFRELFHKQNA 423 I + + + + Sbjct: 397 SIIRLLEDSTNPTLS 411 >UniRef50_A2SQ24 von Willebrand factor, type A n=1 Tax=Methanocorpusculum labreanum Z RepID=A2SQ24_METLZ Length = 313 Score = 84.2 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 55/202 (27%), Gaps = 31/202 (15%) Query: 246 SSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIR------- 293 S + +DVS SM S + AK +L LS + V++ Sbjct: 84 SENVNLVVALDVSASMSASDYSPTRVEAAKGSSEILIRSLSESDTAGVVIFESGASSAAY 143 Query: 294 -HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWA 351 + + V E + G T + L L ++V PA SDG N Sbjct: 144 LSSDKNRVVSRLEQVSVKT-GKTALGDGLALAVDMVTA--IPAGTY-IVVLLSDGVSNSG 199 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITR------------RAHQTLWREYEHLQSTFDNF 399 +P AK VV + + R + F Sbjct: 200 MITPQEAAEYAKNSGVVVYTIGVGSESPVEVSSDGVQQYASLDEETLRSIAEITGGEY-F 258 Query: 400 AMQHIRDQDDIYPVFRELFHKQ 421 + I + ++ Sbjct: 259 RSVDEKTLVQIQNTIQTSIIRE 280 >UniRef50_B3QTN9 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Bacteria RepID=B3QTN9_CHLT3 Length = 837 Score = 84.2 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 36/278 (12%), Positives = 70/278 (25%), Gaps = 25/278 (8%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPAN-ISVVRSLQNSLARRTAMTAGKRREL 188 + + + E T I + + + A R + L Sbjct: 221 DAGERDKWVKTPYLKEGEAPTSEFDIAVEVSTGVPIDDIACVSHKTAVRLEKKSLAEVSL 280 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 E+ +L + + L + E+ F K ++ P Sbjct: 281 DKSEKFGGNRDYILRYRLAGNQ---IQSGLLLFEGEKENFFLATVQPPKRVTEKMIP--N 335 Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH-----------TQ 297 ++DVSGSM ++K L L T +++ + Sbjct: 336 REYIYIVDVSGSMFGQPIAISKELMKKLLGRLRPTETFNLLLFSGGSKLLSEKSLPATDK 395 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 E + GGT + AL + K + +DG + Sbjct: 396 NIEKAFYALENEHGGGGTELLRALNRALGLPK----KEAGSRTFVVITDG---YVSFEVE 448 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +K L ++ ++ L S Sbjct: 449 TFETIRKNLNKANLFAVGIGNG-VNRFLIEGMARAGSG 485 >UniRef50_A9GV55 Putative secreted protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GV55_SORC5 Length = 563 Score = 84.2 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 42/169 (24%), Gaps = 19/169 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK---------NVEVV 290 ++D SGSM ++AK+ + + L + Sbjct: 26 ALAQVAPLDTNHVVIIDRSGSMYGDRLELAKKAAKIYWNTLVSSNVPASQSFTELLGVAS 85 Query: 291 YIRHHTQAKEVD-------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 Y + + + G T + + L+ +++ Sbjct: 86 YSDTSSVTYPLTALPASGLDTAVDALVADGSTSIGAGLEEALDMLISESPTKSARECVIL 145 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 SDG + +P + V + A + + + Sbjct: 146 LSDGQHNTPPAPSDFYADYFSRVDEVHSIALGSG---ADEAMMSDIAAN 191 >UniRef50_C0EZA4 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EZA4_9FIRM Length = 538 Score = 84.2 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 24/203 (11%), Positives = 51/203 (25%), Gaps = 26/203 (12%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF-LSRTYKNVEVVYIRHHT----- 296 S + + +D+S SMD D K+ L++ V Y T Sbjct: 219 KVTSKKRDIVLTLDISASMDGIPLDETKKAAAKFVDSILNKNSNIGLVSYSDEATSLSGI 278 Query: 297 -QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 ++ T + L +++ SDG Sbjct: 279 CSNDVFLKNTITSLSSAENTNIEDGLSRAYSMLQ---LGQSKKKLIVLMSDGLPTLGKDG 335 Query: 356 LCHEILAKKLLPV---VRYYSYIEITRRAHQT---LWREYEHLQSTFDNFAMQHIRDQ-- 407 A+K+ + + + T L + ++ + D Sbjct: 336 EELIKYAEKIKDQGVLIYTLGFFQNTEEYKAEGQYLMEKIASEG---CHYEVSSSEDLVF 392 Query: 408 --DDIYPVF---RELFHKQNATA 425 +D+ + ++ K Sbjct: 393 FFEDVAGQIGGQKYIYVKVACPV 415 >UniRef50_Q2SCZ7 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Gammaproteobacteria RepID=Q2SCZ7_HAHCH Length = 345 Score = 84.2 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 26/214 (12%), Positives = 56/214 (26%), Gaps = 42/214 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 E P + +D+S SM ++ + K + + + + Sbjct: 85 EPIPMDYEARDLLLAVDISPSMQETDLQLKGNQATRLDVVKSVVTDFI-QVRQGDRLGLI 143 Query: 290 VYIRHH----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 ++ E+ T + A+ L + ++ER + Sbjct: 144 LFGAQPYIQAPLTYDLVTVGELLNEATLGI-AGNATAIGDAIGLGIKRLRERPAD---SR 199 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI---------------TRRAHQ 383 +DG N + SP LA + + + Sbjct: 200 VLVLLTDGANTGGEVSPEQAAKLAADAGIKIYTVGVGADEIIRRGIFGYRKENPSADLDE 259 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 TL + F ++ + + IY +L Sbjct: 260 TLLQSIADETDGQY-FRARNTGELELIYESINQL 292 >UniRef50_D2UZF5 von Willebrand factor type A domain-containing protein n=1 Tax=Naegleria gruberi RepID=D2UZF5_NAEGR Length = 207 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 20/175 (11%), Positives = 50/175 (28%), Gaps = 14/175 (8%) Query: 209 EERLRKEIAELRAKIER-VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD 267 + K + ++ Y ++ + SS + ++D SGSM + Sbjct: 6 SVSFSCTFEYDQVKYNQTQNMFGMASIKAPIYVEKENRSS-LDIIAVLDKSGSMS-DKIE 63 Query: 268 MAKRFYILLYLFLSRTYKNVEVVYI----------RHHTQAKEVDEHEFFYSQETGGTIV 317 + K+ + + + + V + K+ + + T + Sbjct: 64 LVKKSLLFMIDQMQARDRLGIVEFDANVSTTLKLTSMDNGGKKQAMNCVNNIKLGTTTNI 123 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNW-ADDSPLCHEILAKKLLPVVRY 371 S A+ +++ R +DG + +KL + Sbjct: 124 SGAIIEAFDILANRGGNISPTTSILLFTDGLPTVGVQQQDKIVNIVEKLYTKLNL 178 >UniRef50_Q96P44 Collagen alpha-1(XXI) chain n=35 Tax=Euteleostomi RepID=COLA1_HUMAN Length = 957 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 20/189 (10%), Positives = 60/189 (31%), Gaps = 18/189 (9%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ-AKEVDE 303 ++ + ++D S S+ ++ K++ + + K ++V +++ E+ Sbjct: 32 RTAPTDLVFILDGSYSVGPENFEIVKKWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPL 91 Query: 304 HEFFY-----------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + G T A++ + + + + A +DG + Sbjct: 92 GSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLFAKSSR-FLTKIAVVLTDGK--SQ 148 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 D A+ + ++ + R + S+ F ++ I Sbjct: 149 DDVKDAAQAARD--SKITLFAIGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIRE 205 Query: 413 VFRELFHKQ 421 V ++ ++ Sbjct: 206 VMKQKLCEE 214 >UniRef50_C4ZKE8 von Willebrand factor type A n=2 Tax=Thauera sp. MZ1T RepID=C4ZKE8_THASP Length = 840 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 53/202 (26%), Gaps = 23/202 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------ 293 + P + + L+D SGSM + A+R + L + + Sbjct: 258 PRVPAAAHPLAVKILVDCSGSMQGDSIAAARRALQAIIAGLREGERFSLSRFGSTVEHRS 317 Query: 294 -----HHTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQW-----NIYAA 342 ++ + Q GGT + +AL + + + Sbjct: 318 RALWRTSAATRQAGQRWAMQLQADLGGTEMENALASTLALAGDAEPSPGTEEGAAAVDLL 377 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG A D + R + + + R +F Sbjct: 378 LITDGQIHAIDRTVKRARAL-----GNRIFVVGIG-SAPAEGVLRRLADETGGACDFVAP 431 Query: 403 HIRDQDDIYPVFRELFHKQNAT 424 + + +F L ++ Sbjct: 432 GEAVEPAVLRMFARLRSQRMDA 453 >UniRef50_B2W982 von Willebrand domain containing protein n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2W982_PYRTR Length = 933 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 47/196 (23%), Gaps = 24/196 (12%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 K + + + ++D SGSM + + + H+ Sbjct: 282 VPKFNLKTEKPEIIFIVDRSGSMSHQ-IPTLVSALKVFLKSIPVGCMFNICSFGSSHSCL 340 Query: 299 K--------EVDEHEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 E E GGT +A++ ++ + +D Sbjct: 341 WSESKGYNQETLEEAINCVDAFQADMGGTETLAAVQSCFKM-----RNKHCSTEMVLLTD 395 Query: 347 GDNWADDSPLCHEILA--KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 G+ WA + VR + + L F ++ Sbjct: 396 GNIWA---QQQLFNYIIEETKSRDVRVFPIGIG-GQVSSALIEGVARAGGGFAEMVAENE 451 Query: 405 RDQDDIYPVFRELFHK 420 + I + + Sbjct: 452 KLDRKIIRILKGSLTP 467 >UniRef50_B1X316 Putative uncharacterized protein n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X316_CYAA5 Length = 547 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 68/252 (26%), Gaps = 34/252 (13%) Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKI-ERVPFIDTFDLRYKNYEKRPDPSS 247 +E + P + + ++L + K + + L + R Sbjct: 303 QLIEGVFNQLEAENPRKFQQIKQLNPPLPNTVTKEVTPIAEVHRQLLDNFHPSVRQKR-- 360 Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYL--------FLSRTYKNVEVVYIRHHTQAK 299 + ++D SGSM + + L S + ++Y R+ Sbjct: 361 --WIIGIIDASGSMRGQGYEQLLAAFSELLEPQKAKDNFLYSPDDRFSLIIYQRNDAYQI 418 Query: 300 EVDEHEFFYS-------------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 ++ + GGT V L + E ++ P+ + + +D Sbjct: 419 PLNSQPGEEISRETLWETLQKEVKPGGGTPVDKGLIMGLETAQK--IPSDYKLEIFLFTD 476 Query: 347 GDNWADDSPL--CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 G +P + P + + ++ + Sbjct: 477 GQFRDPVTPELLNIYQSIQDKNPELTI----VGAGGVNTQQLQQLSAKLDARPIISQNAS 532 Query: 405 RDQDDIYPVFRE 416 D++ FRE Sbjct: 533 ETLDELLKAFRE 544 >UniRef50_C7R9E3 von Willebrand factor type A n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R9E3_KANKD Length = 958 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 25/211 (11%), Positives = 47/211 (22%), Gaps = 36/211 (17%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQS------TKDMAKRFYILLYL--FLSRTYKNVE 288 + P P + ++D SGSM D AK L S ++ Sbjct: 362 NDIATTPVPVPDQDIMLVIDRSGSMSGDAGTGQSKIDEAKDSASLFVQLVEASAGHRMGL 421 Query: 289 VVYIRHHTQAK-----------------EVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 V + + + G T + + + Sbjct: 422 VSFSTSASIDEGIGNLNPGKKNQLIGPAPYSGGAVGGLIPDGWTSIGDGIDKA----QSE 477 Query: 332 YNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITR-RAHQTLWREY 389 +DG N A + R ++ T + L + Sbjct: 478 LTGGANPKTILLLTDGLQNTPP-----MIETATNDIGDTRIHAIGLGTEANLNGGLLSDL 532 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 A + + F ++F Sbjct: 533 TQSTGGAYTRAGDGLELKKFFALAFGDIFED 563 >UniRef50_D1TTW6 von Willebrand factor type A domain protein n=10 Tax=Enterobacteriaceae RepID=D1TTW6_YERPE Length = 327 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 40/162 (24%), Gaps = 19/162 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE----VVYIRHHTQAK--- 299 + +F ++D S SM ++ L + + I AK Sbjct: 2 RRLPIFFVLDCSESMIGENLKKMNDGLQMIINDLK-KDPHALETAWISVIAFAGVAKTIV 60 Query: 300 ---EVDEHEFFYSQETGGTIVSSALKLMDEVVKE------RYNPAQWNIYAAQASDGDNW 350 EV GGT + +AL+ + + W +DG Sbjct: 61 PLVEVVSFYPPRLPIGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKPVVYLLTDGRPT 120 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 D V + A + R+ Sbjct: 121 DDT-TAEITRWKTHYARKVNLIAIGLG-PSADLNILRQLTEN 160 >UniRef50_C6Z299 Tellurium resistance protein n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z299_9BACE Length = 348 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 56/196 (28%), Gaps = 27/196 (13%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS------RTYKNVEVVYIRHHTQAKE 300 + ++ L+DVS SM + + ++ L T + + Sbjct: 2 RRLPVYFLVDVSESMVGAPIQQVQDGMRMIVQELRTDPYALETAYISVIAFAGKAKCVSP 61 Query: 301 VDE---HEFFYSQETGGTIVSSALKLMDEVVKERY------NPAQWNIYAAQASDGDNWA 351 + E GGT + +AL+ + + + + W +DG+ Sbjct: 62 LTELYKFYPPTFPIGGGTSLGNALEFLMDDMDKTLVRTTTEQKGDWKPIVFLFTDGNPTD 121 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + S K + I + L + + + D+I Sbjct: 122 NPS-NAFTRWNNKYRGKANIVAI-SIGDNVNTQLLGQISDN--------VLRLNKTDEI- 170 Query: 412 PVFRELFHKQNATAKG 427 F+ F A+ K Sbjct: 171 -SFKSFFKWVTASIKA 185 >UniRef50_UPI0001C31EFE hypothetical protein Cwoe_4905 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31EFE Length = 317 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 49/205 (23%), Gaps = 28/205 (13%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 K P +A + + D S SM + AKR L + + Sbjct: 74 KPERTVGVPVEKASVMLVTDHSRSMLAEDVEPDRITAAKRAASRFLDQLPPGIRVGVTTF 133 Query: 292 I------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE-RYNPAQWNIYAAQA 344 + T ++ GGT AL++ + ++ N + Sbjct: 134 SDVPDGTQTPTYDHDLIRRTIEAQIADGGTATGDALQVALDTLERLEQNGERTPAAMVLL 193 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR---------------RAHQTLWREY 389 SDG P+ A + + + + Sbjct: 194 SDGATTTGRDPVMVARAAGEARIPIYTVALGTRDATVPNPGPTGPPLLPVAPDPETLQAI 253 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVF 414 F Q ++ IY Sbjct: 254 ADASGGRA-FQAQDDQELSSIYETL 277 >UniRef50_C8W3F9 von Willebrand factor type A n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W3F9_DESAS Length = 219 Score = 83.4 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 50/169 (29%), Gaps = 17/169 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT------YKNVEVVYI 292 + + S + ++ L+D SGSM + K+ + L + + + Sbjct: 2 FNEVEGLSRRLPVYLLLDRSGSMFGEPIEAVKQGVKYMISELKKEPQAIETAYISVITFG 61 Query: 293 R---HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQ 343 Q E+ + + G T + +AL +++ + Sbjct: 62 SDARQDVQLTELAAFKEPQIEANGTTSLGAALHILNNCFDNEVRKSTPTQKGDYKPLVFI 121 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 +DG+ D + +K V + + ++ + Sbjct: 122 MTDGEPTD-DWENAAREIKQKSGKVANIVAVGCG-PDVNTDTLKKITDI 168 >UniRef50_Q4S5G0 Chromosome 19 SCAF14731, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4S5G0_TETNG Length = 993 Score = 83.4 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 24/282 (8%), Positives = 61/282 (21%), Gaps = 75/282 (26%) Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMA---- 269 + + A +D + Y + P+ + ++D S SM Sbjct: 225 PALRHIPASRPLSQVLDGHFVHYFAPK--DLPAVPKNVVFVIDTSASMLGKKMRQVRAGP 282 Query: 270 ---------------------------------------------KRFYILLYLFLSRTY 284 K + + L Sbjct: 283 LPASRPGTPTFPPKSPRRAERCSCQSGAGTSAKRDYFGPPCGRKTKEALLTILGDLRPAD 342 Query: 285 KNVEVVYIR-------------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + + + + ++ + GGT + A++ ++++ Sbjct: 343 RFNFISFSSRIRVWQPGRLVPATPSAVRDAKKFVVMLPTSGGGTDIDGAIQTGSSLLRDH 402 Query: 332 Y--NPAQWNIY--AAQASDGDNW-ADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTL 385 A N +DG + P A+ ++ L Sbjct: 403 LSGRDAGPNSVSLIIFLTDGQPTVGEVRPGAILGNARAAVRDKFCIFTIGMG-DDVDYRL 461 Query: 386 WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 M+ I ++ D + + + + Sbjct: 462 LERMALDNCG----MMRRIPEEADASSMLKGFYDEIGTPLLS 499 >UniRef50_A8H5T6 von Willebrand factor type A n=5 Tax=Proteobacteria RepID=A8H5T6_SHEPA Length = 336 Score = 83.4 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 57/213 (26%), Gaps = 40/213 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEV 289 E PS + +D+SGSM + + + + + Sbjct: 75 EAIELPSKGRDLMLSVDLSGSMQIEDMVIDGKVVDRFTLIQHVISDFIER-RKGDRIGLI 133 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ H TQ + +Q T + A+ L + + Q N Sbjct: 134 LFADHAYLQSPLTQDRRSVAQYLKEAQIGLVGKQTAIGEAIALGVKRFDKV---EQSNRV 190 Query: 341 AAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYI---------EITR------RAHQT 384 +DG N +P +A + + ++ Sbjct: 191 LILLTDGSNNAGAITPEQASQIAAQRGITIYTIGVGADVMERRTLFGKERVNPSMDLDES 250 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 +E + F ++ + + IY V L Sbjct: 251 QLQEIAKVTGGQY-FRARNTEELEQIYQVIDTL 282 >UniRef50_C6PWL5 von Willebrand factor type A n=3 Tax=Clostridium RepID=C6PWL5_9CLOT Length = 580 Score = 83.4 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 52/197 (26%), Gaps = 27/197 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLS-RTYKNVEVVYIRH--- 294 + + SS + ++D SGSM +S + + + + K V Y + Sbjct: 27 AESSNTSSNLDVVFVLDSSGSMKESDPEEIRTEAIKMFLDMSQVQGNKFGLVAYSDNVVR 86 Query: 295 --------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 KE ++ T + + ++ ++ SD Sbjct: 87 EHNLDTINSNDDKERIKNMALNIPLGQKTDTGAGILEAVNLMNSGHDKN-HKPVIILLSD 145 Query: 347 GDNWADDSPLCHEILAKKLLPVVR--------YYSYIE-ITRRAHQTLWREYEHLQSTFD 397 G N D E K L + Y+ +T E + Sbjct: 146 GKN---DPQRKTEDSLKDLKSSISTCKDKGYPVYTIGLNYDGTVDKTQLEEMSNETKGK- 201 Query: 398 NFAMQHIRDQDDIYPVF 414 N+ D I Sbjct: 202 NYITSTAADLPKILTDI 218 >UniRef50_C3WEJ2 BatB protein n=1 Tax=Fusobacterium mortiferum ATCC 9817 RepID=C3WEJ2_FUSMR Length = 322 Score = 83.4 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 49/209 (23%), Gaps = 48/209 (22%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 K E ++ L+D S SM + + KR L L + + + Sbjct: 68 KEIEDEEIEVKGMNIYVLIDTSRSMLTEDVYPNRLEAGKRVLTNLIQSLK-GDRVGFIPF 126 Query: 292 IRHH------TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 T + ++ GGT + AL+L ++ KE N Sbjct: 127 SDSAYIQMPLTDDYNITQNYINAIDTTLISGGGTELYQALELAEKSFKE---IGSENKTV 183 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA-------------------- 381 SDG D K+ V Sbjct: 184 IVISDG----GDFDKKSLDFVKENKIDVYSIGVGTKEGNVIPEYLNGVKRGFIKDESGSA 239 Query: 382 -----HQTLWREYEHLQSTFDNFAMQHIR 405 + ++ + + + Sbjct: 240 VISKLNSDFLQKISNENNGKYYEVNNLVD 268 >UniRef50_A4BEC4 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BEC4_9GAMM Length = 322 Score = 83.4 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 22/210 (10%), Positives = 48/210 (22%), Gaps = 37/210 (17%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEV 289 + ++ +D+S SM + + + R V Sbjct: 75 DPLNLDQRGRSLYLAVDLSESMLEQDMIWNQRPVSRYEAMQAVISEFVED-RRGDFIGLV 133 Query: 290 VYIRHHTQAKEVD--EHEFFYS-------QETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V+ + + T + L L ++E Sbjct: 134 VFGSFADVQAPLTPDLNAIQSLLADLRPGMADSRTAIGDGLALAVRQLRESTTED---RV 190 Query: 341 AAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRA------------HQTLWR 387 SDG+N + + P +A V + R + + R Sbjct: 191 VVLLSDGENNSGEIRPDEATAVAAAENIRVYTIGFGSAGRDSLLQSFGLRSSSLDEQTLR 250 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 E + + +++ L Sbjct: 251 EIAEQTQGRY-YRATSSAELAEVFRDIERL 279 >UniRef50_D1VLF3 von Willebrand factor type A n=1 Tax=Frankia sp. EuI1c RepID=D1VLF3_9ACTO Length = 372 Score = 83.4 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 24/240 (10%), Positives = 47/240 (19%), Gaps = 60/240 (25%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVV 290 + P S+ + +DVSGSM + A++ + V Sbjct: 73 ARPQATVPITSNSTTIMLALDVSGSMCSTDVPPNRITAAEKAATAFIKAQPAGSRIGLVT 132 Query: 291 YIR------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN---PAQWNI-- 339 + T + + GT + + + + + P + Sbjct: 133 FSGIAGLLVPPTTDSQKLLDALQNLTTSRGTAIGQGILTSIDAIADADPSVAPTGSAVSG 192 Query: 340 ---------YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA--------- 381 +DG N P A V + T Sbjct: 193 NGTGPYAADVIVVLTDGANTQGVDPQTAAKQAAARRLRVYTIGFGTTTPAPMVCGSSQVG 252 Query: 382 -------------------------HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + R+ + Q+ D Sbjct: 253 GFGGFGGFGGFGGGGRLGDRSPLVIDEQALRDVAATTGGTY-YRAQNAGQLQDALGTLPR 311 >UniRef50_Q47ZS8 Von Willebrand factor type A domain protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q47ZS8_COLP3 Length = 364 Score = 83.4 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 17/183 (9%), Positives = 44/183 (24%), Gaps = 19/183 (10%) Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---------IRHHTQAKEVDEHEF 306 D + K + +++ + +E Sbjct: 139 DTGKGEKVNRLVAVKHVLNAFVKS-REHDRLGLILFGDAPYLQAPFTDDIATWQALLNES 197 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD-DSPLCHEILAKKL 365 T A+ L V ++ N +DG++ A P+ +A Sbjct: 198 DIGMAGQSTAFGDAIGLAISVFQQS---DTQNRVLIVLTDGNDTASKVPPVEAAKVAAAR 254 Query: 366 LPVVRYYSYI----EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 + + + + + + +F + + +Y L +Q Sbjct: 255 DIKIYTIAIGDPSAVGEEKVDLEVLQAMAEITQGK-SFQALNSEELLKVYAEIDRLEPQQ 313 Query: 422 NAT 424 + Sbjct: 314 FDS 316 >UniRef50_B4RTV3 von Willebrand factor, type A n=20 Tax=Alteromonadales RepID=B4RTV3_ALTMD Length = 349 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 56/214 (26%), Gaps = 42/214 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 E P+ M +D+SGSM + K + + Sbjct: 79 EPVSIPNEGREMMLAVDLSGSMKIDDMQLNGRQVNRLTMTKSVVYDFIQR-RVGDRLGLI 137 Query: 290 VYIRHH------TQAKEV----DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 ++ T ++ T + A+ L + ER + N Sbjct: 138 LFADTAYVQAPLTYDRDTVSTLLSEAVIGL-VGEQTAIGDAIGLAVKRFDER---DESNN 193 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI---------------TRRAHQ 383 +DG N A + +P + LA V ++ + Sbjct: 194 VLILLTDGQNTAGNITPEQAKELAINKGVKVYTIGVGADKMLIQSFFGSREINPSQELDE 253 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + F ++ ++ + IY L Sbjct: 254 GMLTDIATSTGGQY-FRARNAQELEAIYQQLDAL 286 >UniRef50_D1IDZ7 Whole genome shotgun sequence of line PN40024, scaffold_19.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1IDZ7_VITVI Length = 478 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 52/197 (26%), Gaps = 29/197 (14%) Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR 235 R ++ E S + + + A I Sbjct: 123 RNSSNGNAAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNIQRNM-- 180 Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 + +P + + ++D+SGSM + + KR Sbjct: 181 SNSPLNSHNPRAPVDLVTVLDISGSMAGTKLALLKRAMGFAL------------------ 222 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 GGT ++ L+ +V+++R + SDG + Sbjct: 223 --------QAVNSLVANGGTNIAEGLRKGAKVMEDRKERNPVSS-IILLSDGQDTYTTES 273 Query: 356 LCHEILAKKLLPVVRYY 372 + + A+ + ++ Sbjct: 274 VIQDAFAQCIGGLLSVV 290 >UniRef50_UPI000186D1CC conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186D1CC Length = 1003 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 31/335 (9%), Positives = 78/335 (23%), Gaps = 37/335 (11%) Query: 124 YLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAG 183 + L+ + + + + L + AN++ + + ++ + + Sbjct: 96 LIKLVNKAEETYSKYEKINKPLQNAGVNFFNMKDPNNSANMTFSHNFRQKVSYKESGVHI 155 Query: 184 KRRELH--------------ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 E + A Sbjct: 156 PLEIFDGYSNILNGVNWTSSLDEVFKENHRMDPNIGWQFFGSFHGFLRVYPAFRWPDHSR 215 Query: 230 DTFDLRYKNYE-KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 + +S + L D SGS+ T D+ K L L Sbjct: 216 YPDFFDVRRRSWYIQSSTSPKDVIILFDRSGSVHGPTLDIMKITARALLNSLGENDFVNV 275 Query: 289 VVY--------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD------EVV 328 + ++ TQ K + E+ T +AL E + Sbjct: 276 AWFNNDVKWVVPCLKTLVQATTQIKNLLADAIERLTESNLTSYVTALDFAYEEFRKFEEI 335 Query: 329 KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWRE 388 K+ + + + SDG + ++ +++ + +E Sbjct: 336 KKPWIGSNCHKIVMFLSDGGTEWPTDVINR-HCNNSNNENIKIFTFACGPHPIPTVILKE 394 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + + + + A Sbjct: 395 MACSTGGYFSPITALGSVRIKV-RDYVNVLSRPLA 428 >UniRef50_UPI0001BC3853 von Willebrand factor type A n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3853 Length = 623 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 56/209 (26%), Gaps = 39/209 (18%) Query: 236 YKNYEKRPDPSSQAVMFC----LMDVSGSMDQSTKD---MAKRFY--------------- 273 ++ + K + + L+D SGSM + D K Sbjct: 80 FEAWNKEIKYPNSNNIVFDTVILIDCSGSMRTNDPDFEYSVKNTLYPGSSYQITTCYRKL 139 Query: 274 --ILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------HEFFYSQETGGTIVSSALKLMD 325 + V++ E+ + GGT ++A+K Sbjct: 140 ASKNYVKAQGNDDRTGIVLFTSEANTVCELTNSEYVLMNAIDKIYSNGGTNFNNAIKESI 199 Query: 326 EVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 ++ N ++ SDG++ S LA + + I + + L Sbjct: 200 RILTNTRNDSEKR--ILLVSDGESELSSS---VIDLAIENNIKINTV---YIGGQNNNEL 251 Query: 386 WREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + F + +IY Sbjct: 252 LKNVAERTGGKY-FKAVTADELINIYSEI 279 >UniRef50_UPI000023F6A9 hypothetical protein FG10431.1 n=2 Tax=Gibberella zeae PH-1 RepID=UPI000023F6A9 Length = 851 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 32/233 (13%), Positives = 55/233 (23%), Gaps = 30/233 (12%) Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ--AVMFCLMDVSGSMDQ--------- 263 A L + +R + + + L+D SGSM Sbjct: 345 SQAILCPPDDAGMAAMMVSIRPSDLFRNAIIPQSFSGEILFLLDQSGSMRGGCGSGFNGL 404 Query: 264 STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHE-FFYSQET---------- 312 D+ + +L+ L +T + + E E Sbjct: 405 RKIDVLREAMLLVISGLPKTCSFNIISWGSETRAIWEQSRKHSPDNINEARDYISQIDSN 464 Query: 313 -GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT + A K + R +DG AD + LL +R+ Sbjct: 465 LGGTDLLRAFKSTVQ----RRRDESNPTQIVVLTDGQLNADKPMEFVWKTRQVLLNKIRF 520 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 ++ H+ L L + L Sbjct: 521 FALGIGRNVPHR-LIEGIAELGGGSGEIIDTTQNSRWH--SRLNRLLKSALEP 570 >UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BM85_HYPBU Length = 439 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 103/348 (29%), Gaps = 48/348 (13%) Query: 86 DRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQL 145 +++ RPQGG S S + + + D+ + L NL Sbjct: 117 NKLPRPQGGETRSKSAADAEQGLNAENIRESVRKALETARDVAQQAKELTNLA------- 169 Query: 146 TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR--ELHALEENLAIISNSEP 203 N ++ V +LAR T + + + E + P Sbjct: 170 ------MRFTAGNASMLSLDDVIQDVINLARNTDVKVLLEALKTIESTEAYIRTRKIRSP 223 Query: 204 AQLLEEERLRKEIAELRAKIERVP---FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 L+ L +I + A +P F+ F R K+ + L+D SGS Sbjct: 224 RGELDGYELGSDIERVVASELALPTDLFLLKFAERNLLLYKKVVSEEYGKFYVLLDKSGS 283 Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH------------HTQAKEVDEHEFFY 308 M AK + L R + + + H + Sbjct: 284 MMGMKIIWAKAVALALAQRAIREKREFYIRFFDSIPYPPLYIPKRVHGRDVVKLLEYVAR 343 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 + GGT ++ A+ + + + ++ + +DG++ ++ L Sbjct: 344 IRANGGTDITRAILTAVDDIATKLQRSKVS-DIILITDGED------KIAIDTIRRSLNK 396 Query: 369 V--RYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 V R ++ + R ++ + D+++ V Sbjct: 397 VNARLHTVMISGNNPD---LRAISD------SYMVATKLDREEALRVI 435 >UniRef50_Q47M48 von Willebrand factor, type A n=4 Tax=Streptosporangineae RepID=Q47M48_THEFY Length = 609 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 37/263 (14%), Positives = 67/263 (25%), Gaps = 41/263 (15%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 E + N+ L E + + + ++ E + A Sbjct: 352 DEAQQVFLDNAFRGHDGRTGDLLTEENGVLPAEPVELSLPSANVLNAMLENWAELRKPAN 411 Query: 251 MFCLMDVSGSMDQS-------TKDMAKRFYILLYLFLSRTYKNVEVVYIR---------- 293 + ++D SGSM +S ++AK I S + + ++ Sbjct: 412 VLLVIDTSGSMQESVPGTGSTRLELAKEAAITSLDEFSDSDRVGLWMFSTDLEDNGQDWR 471 Query: 294 ------------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + T +E GGT + +V E P N Sbjct: 472 ELVPLGPLGASVNGTPRREELAERISNLPPGGGTGLYDTALAAHTLVAEHSRPDAINAVV 531 Query: 342 AQASDGDNWADDSPLCHEILAKKL-----LPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG N + E L + VR ++ A + + Sbjct: 532 FL-TDGKNEDLNGIS-LEKLLDSITPEPGQQGVRIFTISYG-EDADLKTMTQIAEATNAA 588 Query: 397 DNFAMQHIRDQDDIYPVFRELFH 419 D I VF + Sbjct: 589 AY----DASDPQSIDEVFEAVIS 607 >UniRef50_A7RVQ6 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RVQ6_NEMVE Length = 419 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 59/190 (31%), Gaps = 21/190 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHHTQAKEVDE 303 +A + L+D SGS+ S + K+F L ++ + + + + + Sbjct: 42 RKADLLFLLDTSGSLSLSNFNTEKKFIRNLLNVIAVGFDATRVEIITFGSDVNRRVPFIS 101 Query: 304 HEFFY-------------SQETGGTIVSSALKLMDEVVKERYNPAQWNIY---AAQASDG 347 E G T + A + EV K ++ + +DG Sbjct: 102 EAHEKDTKCTFNEKFANVVHEWGMTNMRGAFEKAYEVCKGTWSGKKRLNIKTTVILITDG 161 Query: 348 D-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST-FDNFAMQHIR 405 NW +P + + V ++ + L + ++ F + + Sbjct: 162 HWNWPWQNPDPVPKAQQLIREGVEILAFGVGYGISLSNLQTITANQRAGHTYAFQISNFD 221 Query: 406 DQDDIYPVFR 415 + + + R Sbjct: 222 EFNKLATYLR 231 >UniRef50_A7BVG3 von Willebrand factor type A domain protein n=1 Tax=Beggiatoa sp. PS RepID=A7BVG3_9GAMM Length = 280 Score = 82.6 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 41/160 (25%), Gaps = 15/160 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-RTYKNVEVVYIRHH-- 295 +F L+DVS SMD S AK+ + + Sbjct: 79 PSPESVTLVHQSVFLLIDVSYSMDGSALAEAKQAAQEFVRKSDLAHTAIGLIEFGSKAKI 138 Query: 296 ----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 TQ + + G T ++ L +K +P + +DG Sbjct: 139 ISGLTQNAKHLYKAINRLKTNGSTNMTEGLTTAYLKLKNVDDP----RFIILLTDGLPNH 194 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 + + + T A +T + Sbjct: 195 PKNTQQIAQEI--CADGIELITIG--TGDADKTYLQSLAC 230 >UniRef50_B2A702 von Willebrand factor type A n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A702_NATTJ Length = 599 Score = 82.6 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 21/187 (11%), Positives = 55/187 (29%), Gaps = 18/187 (9%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 + + + L+D SGSM K F + L K + + + + Sbjct: 414 RNKKKQTSMNVCFLVDASGSMGGRRMQEVKFFAEHVL--LKGRDKIAILTFREDNVNVEI 471 Query: 301 VDEHEFFYSQET-------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW--- 350 + + G T +S +++ + ++ + N + +DG Sbjct: 472 PFTRNWDKLRSGLNKIKAFGLTPMSKGIEMARKYLESEVGQQK-NTFLVLITDGLPTISD 530 Query: 351 -ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 +D A+KL ++ I + ++ + ++ Sbjct: 531 GGEDPFKETLKAAQKLSQ--TSIKFVCIGLEPNVKFLKKLAQASQASLYIVEEL--QKES 586 Query: 410 IYPVFRE 416 + + + Sbjct: 587 LARIMDK 593 >UniRef50_Q1Q2F5 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q2F5_9BACT Length = 333 Score = 82.6 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 27/218 (12%), Positives = 51/218 (23%), Gaps = 47/218 (21%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVV 290 ++E+ + +D S SM + ++AKR L L + + Sbjct: 78 GYHWEEVEKK--GIDIMIAVDTSRSMLADDVKPNRLEVAKREIEDLLKIL-EGDRVGLIA 134 Query: 291 YIRHH------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIY 340 + T GGT ++ A+ + E + Sbjct: 135 FAGRAFTYCPLTSDYSAFRLFLNDLNVNIIPVGGTAIAEAIYKGIDAFGEN---ENNHKA 191 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA------------------- 381 +DG+N + PL AK+ V+ + Sbjct: 192 MIIITDGEN-HETDPLKAASKAKEKGIVIYTVGVGKKEGSYIKIIDEQGKETLLKDAHGQ 250 Query: 382 ------HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + + A IY Sbjct: 251 VVKSRLDEITLNKIALETGGLYTPAYGTKWGLAKIYNE 288 >UniRef50_C3XPW9 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XPW9_BRAFL Length = 2122 Score = 82.6 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 58/192 (30%), Gaps = 21/192 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 + S+A + L+D SGS+ + + + F L L+ + + V + A+ Sbjct: 37 VKKYQDSRADIVFLLDNSGSVGRYNFEEVEIAFVENLLSQLTISPQASRVAVVSFDDVAR 96 Query: 300 EVD------EHEFFYSQE-------TGGTIVSSALKLMDEVVK---ERYNPAQWNIYAAQ 343 +++ + +E T A +L E+++ N Sbjct: 97 THIDYIKYPKNKCSFLRELKTVKYIGEWTNTEDAFRLAQELLRPPSAFKNERPVKQVVIL 156 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG P+ K + +S ++ + + + Sbjct: 157 LTDGRPTRGGDPVKRANNLKSVY-NAEIFSIGIG-GNLNKQQLEDCA--TDAQHLYLSPN 212 Query: 404 IRDQDDIYPVFR 415 D D+ R Sbjct: 213 FVDFKDLAKRIR 224 >UniRef50_A4ACS0 Magnesium-chelatase, 60 kDa subunit n=2 Tax=unclassified Gammaproteobacteria RepID=A4ACS0_9GAMM Length = 615 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 44/325 (13%), Positives = 89/325 (27%), Gaps = 25/325 (7%) Query: 34 ISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQG 93 ++ + SV+D + V + ++ + V D + + + Sbjct: 233 LTALAREESVSDEQLEQVVRLVLLPLATRLPGGDEQDAEDDVEESPDDSADDSQPPDNET 292 Query: 94 GGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRA 153 A ++ E + + L + L L + L + Sbjct: 293 PDAPDSPPPSDAQRESEDDKAPDAEQPNTDPDPLTDDRL-LEAAQAMLPPDLLAKLLSGS 351 Query: 154 GYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLR 213 + S R R G R L + +A + + P Q L Sbjct: 352 MGARRNAASGKSGARQRAKMRGRPMGSLPGDPRSGARL-DVMATLRTAAPWQRLRGPATG 410 Query: 214 KEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY 273 ++A+ R+ SQ+V ++D SGS AK Sbjct: 411 GARLRVQAEDFRIVRF--------------RQRSQSVTVFVVDASGSSALYRLAEAKGAV 456 Query: 274 ILLY-LFLSRTYKNVEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 LL R + + + T++ + GGT +++ + Sbjct: 457 ELLLADCYVRRDEVALIAFRGDSAELLLPPTRSLVRAKRSLSALPGGGGTPLAAGIDATA 516 Query: 326 EVVKERYNPAQWNIYAAQASDGDNW 350 E+++ Y +DG Sbjct: 517 ELLEMLDRRGATPAYVML-TDGRGN 540 >UniRef50_Q80UW6 Parp4 protein (Fragment) n=11 Tax=Eukaryota RepID=Q80UW6_MOUSE Length = 498 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 38/295 (12%), Positives = 81/295 (27%), Gaps = 23/295 (7%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L +N Q + + G + A + + ++ T K + A+ + Sbjct: 132 LNENLQDTVETIRIKEIGAEQSFSLAMSIEMPYMIEFISSDTHELRQKSTDCKAVVSTVE 191 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP--SSQAVMFCL 254 S L + + + + L ++ P + + Sbjct: 192 GSSLDSGGFSLHIGLRDAYLPRMWVEKHPEKESEACMLVFQPELADVLPDLRGKNEVIIC 251 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------IRHH---TQAKEVDEH 304 +D S SM+ T AK+ + L K + + + T +K E Sbjct: 252 LDCSSSMEGVTFTQAKQVALYALSLLGEEQKVNIMQFGTGYKELFSYPKCITDSKMATEF 311 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 + G T L+ + Y + SDG S L K+ Sbjct: 312 IMSAAPSMGNTDFWKVLRY----LSLLYPSEGF-RNILLISDGH---LQSESLTLQLVKR 363 Query: 365 LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD--QDDIYPVFREL 417 + R ++ A++ + R + + + + I + Sbjct: 364 NIQHTRVFTCAVG-STANRHILRTLSQCGAGVFEYFNSKSKHSWKKQIEAQMTRI 417 >UniRef50_Q6A0B1 MKIAA0177 protein (Fragment) n=4 Tax=Murinae RepID=Q6A0B1_MOUSE Length = 1269 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 38/295 (12%), Positives = 81/295 (27%), Gaps = 23/295 (7%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L +N Q + + G + A + + ++ T K + A+ + Sbjct: 769 LNENLQDTVETIRIKEIGAEQSFSLAMSIEMPYMIEFISSDTHELRQKSTDCKAVVSTVE 828 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP--SSQAVMFCL 254 S L + + + + L ++ P + + Sbjct: 829 GSSLDSGGFSLHIGLRDAYLPRMWVEKHPEKESEACMLVFQPELADVLPDLRGKNEVIIC 888 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY-------IRHH---TQAKEVDEH 304 +D S SM+ T AK+ + L K + + + T +K E Sbjct: 889 LDCSSSMEGVTFTQAKQVALYALSLLGEEQKVNIMQFGTGYKELFSYPKCITDSKMATEF 948 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 + G T L+ + Y + SDG S L K+ Sbjct: 949 IMSAAPSMGNTDFWKVLRY----LSLLYPSEGF-RNILLISDGH---LQSESLTLQLVKR 1000 Query: 365 LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD--QDDIYPVFREL 417 + R ++ A++ + R + + + + I + Sbjct: 1001 NIQHTRVFTCAVG-STANRHILRTLSQCGAGVFEYFNSKSKHSWKKQIEAQMTRI 1054 >UniRef50_A8K7I4 Calcium-activated chloride channel regulator 1 n=44 Tax=Eumetazoa RepID=CLCA1_HUMAN Length = 914 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 29/267 (10%), Positives = 74/267 (27%), Gaps = 31/267 (11%) Query: 161 PANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN---------SEPAQLLEEER 211 + V L + + +++ I P + ++ Sbjct: 209 RCTFNKVTGLYEKGCEFVLQSRQTEKASIMFAQHVDSIVEFCTEQNHNKEAPNKQNQKCN 268 Query: 212 LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD-QSTKDMAK 270 LR +R + + N Q ++ ++D SGSM + + Sbjct: 269 LRSTWEVIRDSED-FKKTTPMTTQPPNPTFSLLQIGQRIVCLVLDKSGSMATGNRLNRLN 327 Query: 271 RF-YILLYLFLSRTYKNVEVVYIRHHTQAKEVDE----HEFFYSQ------ETGGTIVSS 319 + + L + V + E+ + + +GGT + S Sbjct: 328 QAGQLFLLQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAKRLPAAASGGTSICS 387 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 L+ V++++Y +DG++ K+ ++ + Sbjct: 388 GLRSAFTVIRKKYPTDGSE--IVLLTDGEDNTISG---CFNEVKQSGAIIHTVALG---- 438 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRD 406 + E + +A +++ Sbjct: 439 PSAAQELEELSKMTGGLQTYASDQVQN 465 >UniRef50_D1XJQ4 von Willebrand factor type A n=2 Tax=Streptomyces RepID=D1XJQ4_9ACTO Length = 624 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 51/190 (26%), Gaps = 21/190 (11%) Query: 247 SQAVMFCLMDVSGSMDQST------KDMAKRFYILLYLFLSRTYKNVEVVYIR------H 294 + + ++D SGSM + + A+R + L Y VY Sbjct: 25 AGGSLVMVLDSSGSMGEDDGTGSTRMESARRAVGAVVDALPDGYPTGLRVYGADRPQGCA 84 Query: 295 HTQ--------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 T+ + + + TG T + +L+ E + + A SD Sbjct: 85 DTRLVRPVRPLDRAAVKSAVAGVRPTGDTPIGLSLRKAAEDLPAPRDGAARTRTIVLVSD 144 Query: 347 GDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 G++ P +R + + A + + A Sbjct: 145 GEDTCGTPPPCEVAARLAGQGAGLRIDTVGFQVKGAAREQLECVAEAGNGRYYDAPDADA 204 Query: 406 DQDDIYPVFR 415 + + Sbjct: 205 LARQLLRSAQ 214 >UniRef50_B3SBE5 Putative uncharacterized protein n=2 Tax=Trichoplax adhaerens RepID=B3SBE5_TRIAD Length = 1262 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 29/234 (12%), Positives = 68/234 (29%), Gaps = 21/234 (8%) Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS 264 L + + + L + E + + + +D+S SM Q Sbjct: 852 FRLLIRLAEIHVPRMWVEKYPDTDSQACMLTFY-PEFDVTENPRPEIIIALDMSNSMKQC 910 Query: 265 TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF-----------YSQET- 312 D K ++L L + VV+ H + + + + Sbjct: 911 LMDTQKIAALILTN-LPPECRFNIVVFGSAHNELFPMYQEVSKESVNMAIKFIGSLSASW 969 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 G + + +VKE + N SDG ++S L + + +R + Sbjct: 970 GNSNFYHVIDNFTHIVKELKANSVSN--VFLISDGHFGDENSITAI--LRRDKIDNLRLF 1025 Query: 373 SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 ++ +++ R + + + R + + +K + Sbjct: 1026 TFSTG-DTSNRYFMRTLAKIGAGYHEHFDTKFRSKWQ--QKIQNQIYKASQPTL 1076 >UniRef50_B0EK65 Putative uncharacterized protein n=7 Tax=Entamoeba RepID=B0EK65_ENTDI Length = 720 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 22/190 (11%), Positives = 51/190 (26%), Gaps = 23/190 (12%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-HHTQAKEV 301 + + + D SGSM + + L L K + + + KE+ Sbjct: 206 KEKEGDINIIFICDRSGSMYGEGINALRNMLQLFLRQLPLKSKFEIISFGSKYDFMFKEM 265 Query: 302 DEHEFFYSQ-ET----------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 E+ + + GGT + + LK + + + +DG Sbjct: 266 VEYNEDTLKNASNRISEFEANYGGTSMDAPLKALID-------NNTEKCHIILLTDG--- 315 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 D+ + L + L R + + + + ++ Sbjct: 316 YVDNKINTIEYIHNLSKKNSLHGVGLGRS-CDIELIRNIGRIGNGISVISKNVNLLKKEV 374 Query: 411 YPVFRELFHK 420 + + Sbjct: 375 SKITERILIP 384 >UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q46D40_METBF Length = 612 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 49/407 (12%), Positives = 109/407 (26%), Gaps = 37/407 (9%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDS---GESVSIPTEDISEPMFHQGRGGLR-H 73 N+Q + + S + S+ S S E F Q Sbjct: 184 NQQATQENKQENPLEQESSLTQENSLPQESSLPQESSFQQENSLEQESSFQQESSLQDPE 243 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSG-SGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 + N + E + +A + + E +E + +L + L Sbjct: 244 HENWENPDIQAGNTAESERLASMTLNFMSSEKAGEVLDSVIEESIAAKIEELIPVLEDHL 303 Query: 133 A----LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT---------A 179 L L + + HR + A + S + + + Sbjct: 304 EMLEILSMLFPGRAWDYSLKALHREYFGNLEKYAALLRKSSAIHEILEQVGRIELEYGSK 363 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 + + + + + T+ L+ +N+ Sbjct: 364 KLSLSPYSKSEVHSVTFSGDLRTLLPAETVKLKNPLLKRKFYADMLEGKLLTYQLKGENW 423 Query: 240 EKRPD-PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 + + L+D S SM S + +AK + + + ++V+V+ Q Sbjct: 424 NSDSAGKKRKGPVVALVDTSASMRGSPELLAKAVVLAVTRRMLTENRDVKVILFSSKWQT 483 Query: 299 KEVDEHEFFYS----------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 E++ GGT ++AL+ + +K + +DG Sbjct: 484 VEIELTNKKRMGEEFLEFLKFTFGGGTDFNTALRAGLKAMKNEKAFEGAD--LLFLTDG- 540 Query: 349 NWADDSPLCHEILAKKLL--PVVRYYSYIEITRRAHQTLWREYEHLQ 393 +++ S ++ R +S I ++ Sbjct: 541 -YSELSEKPLIREWNEIKAERRARIFSLIIG--NYDAGGLQQISDHT 584 >UniRef50_C0ZJ85 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZJ85_BREBN Length = 677 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 63/207 (30%), Gaps = 33/207 (15%) Query: 246 SSQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ------- 297 + ++D S SM+ A + + ++ ++ + Sbjct: 42 EAGVDAVFVVDTSNSMNKTDPGKTAAEVMSMFIDM--SEATRTRIGFVAYNDRIVQAQSP 99 Query: 298 -------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD---- 346 +E + + +G + + L+ E++++ +PA+ + SD Sbjct: 100 ASMAEARNREQLKRTIQGLRYSGYSDLGLGLRRGAEMIEKAKDPAR-KPFLILLSDGGTD 158 Query: 347 ------GDNWADDSPLCHEILAKKLLPVVRYYSYIEIT-RRAHQTLWREYEHLQSTFDNF 399 G + A + +++K Y+ + ++ +F Sbjct: 159 LRQNAGGRSVAASNKDVETVISKAKAQGYPIYTIGLNNDGSVQKEQLKKIAEATGGT-SF 217 Query: 400 AMQHIRDQDDIYPVFRELFHKQNATAK 426 Q D +I F ++F K + Sbjct: 218 VTQSTDDLPEI---FNQIFAKHIQSQL 241 >UniRef50_A1SYF6 von Willebrand factor, type A n=1 Tax=Psychromonas ingrahamii 37 RepID=A1SYF6_PSYIN Length = 327 Score = 82.2 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 57/213 (26%), Gaps = 40/213 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---------SRTYKNVEVV 290 + M +D+SGSM + + + L L + + ++ Sbjct: 77 DPIRLQQQSRDMIISLDLSGSMQEVDMPLNGQTVDRLTLLKDLLKTFIKQRQGDRLGLIL 136 Query: 291 YIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + H T + + S+ T + ++ + + E N Sbjct: 137 FADHAYLQTPLTFDLKTIQQMVDESEIGLAGTRTAIGESIAMAIKRFVENKNE---QRVL 193 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR-----------------RAHQT 384 SDG N + S + + + Y+ + Sbjct: 194 ILVSDGANNSG-SIEPIQAAKQAAKNNITIYTIGMGAEQMIKRGLFGNQRINPSADLDEK 252 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 E +L F ++ + +IY +L Sbjct: 253 TLTEIANLTGGKY-FRARNQTELQNIYQTLNKL 284 >UniRef50_B5JVB6 von Willebrand factor, type A n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JVB6_9GAMM Length = 336 Score = 81.9 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 61/226 (26%), Gaps = 46/226 (20%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD----------MAKRFYILLYLFLSRTYKNVEV 289 E P + +D+SGSM++ D + K + + V Sbjct: 77 EPVAMPREGRALVVALDISGSMEEQDMDDNGQRRSRIAVTKDVAMDFVKQ-REGDRIALV 135 Query: 290 VYIRHH------TQAKEVDEH--------EFFYSQET-GGTIVSSALKLMDEVVKERYNP 334 ++ H T Q T + A+ L + +++ P Sbjct: 136 LFGTHPYLQTPLTFDHPTVMQHIYEAQLTMADDLQRGIHATAIGDAIGLAVKRLRDIDAP 195 Query: 335 AQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYS--------------YIEI-T 378 + +DG DN + +PL +A + + + Sbjct: 196 DKT---LILLTDGSDNASQVAPLKAAQIAAREGLKIYTIGLGAEQRQASLLGFDFGFGKN 252 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 R + ++ F ++ + +IY L + Sbjct: 253 REIDEKTLKDIAKATDGRY-FRARNPEELREIYQHIDRLEPSEAEA 297 >UniRef50_UPI00016E1D58 UPI00016E1D58 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E1D58 Length = 451 Score = 81.9 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 53/189 (28%), Gaps = 26/189 (13%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEVDEHEF 306 + ++D SGS+ + + + F L L + V+Y +V + F Sbjct: 1 DIVFIIDESGSIGSANFQLMRSFLHSLISGLQVASNRVRVGIVMYNVEP--MAQVFLNTF 58 Query: 307 FYSQE-----------TGGTIVSSALKLMDEVV----KERYNPAQWNIYAAQASDGDNWA 351 E GGT +AL + V + A +DG + Sbjct: 59 KDKSELLDFIKILPYHGGGTNTGAALNFTLQEVFIKQRGSRKDLGVQQVAVVITDGKSQD 118 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + S A V Y+ A + + + F + + Sbjct: 119 EVSSPA----ANLRRAGVTVYAVGVK--DADKAQLDQIASYPTNKHTFIIDSFTKLKTLE 172 Query: 412 PVFRELFHK 420 + + + Sbjct: 173 ASLQRILCQ 181 Score = 64.9 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 49/183 (26%), Gaps = 21/183 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-------- 296 + L+D SGS+ K F + V V +++ T Sbjct: 227 KDIPGDLIFLIDSSGSIYPEDYKKMKDFMKSVIKQSIVGKNEVHVGVMQYSTIQKLVFPL 286 Query: 297 ---QAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDNWA 351 K+ Q+ GGT A+ + + R +DG+ + Sbjct: 287 NQYYTKDELSKAIDEMQQIGGGTHTGEAITDVSQYFDARNGGRPDLKQRLVVVTDGE--S 344 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 D + +V + A+ + E + +A + D+ Sbjct: 345 QDEVRQPAEALRAKGVIVYSIGVV----AANTSQLLEIS--GTPNRMYAGRDFDALKDLE 398 Query: 412 PVF 414 Sbjct: 399 KQM 401 >UniRef50_C3ZZV2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZZV2_BRAFL Length = 4065 Score = 81.9 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 49/188 (26%), Gaps = 23/188 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYI--------- 292 + + L+D SGS+ + D+ K F + + + V Y Sbjct: 1584 RTLNLDVVFLLDGSGSVGSANFDLLKTFTTRIATNFDVSTNLTRVGVVQYSDQTNSEFVL 1643 Query: 293 -RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDE--VVKERYNPAQWNIYAAQASDGDN 349 T+A+ + Q GGT +AL + + + + +DG Sbjct: 1644 NTFSTEAEVLAAIAAISYQ-NGGTSTGAALDYVRQNVFISASGDRPDAANILIVLTDG-- 1700 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 S + YS ++ DD Sbjct: 1701 --VSSDDVSFPAMAARNAGITIYSVGIGDG-VDYNTLQQIA--GDPNKVLQATGFSSLDD 1755 Query: 410 IYPVFREL 417 I EL Sbjct: 1756 IGGQLEEL 1763 Score = 78.4 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 44/182 (24%), Gaps = 19/182 (10%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 + L+D SGS+ + K F L+ Sbjct: 1815 WSEPVPECMAPTTPPPPGCDELSFGGWDLVFLLDGSGSVGSNNFLNVKNFTKLITDLFPV 1874 Query: 283 TYKNVEVVYIRH-HTQAKEVDEHEFFY----SQE-------TGGTIVSSALKLMDEVVKE 330 +V ++ T KE D ++ GGT +A+ + +V Sbjct: 1875 GDNATKVGLVQFSDTIQKEFDLRDYDTKAEILSAIDNISYLGGGTYTGNAIDYVRQVSFN 1934 Query: 331 --RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWRE 388 N +DG+++ + ++ T E Sbjct: 1935 TINGNRGSHPDMLIVLTDGESFDP----VTFASQSARDQGITIFAIGVGTG-VDYATLEE 1989 Query: 389 YE 390 Sbjct: 1990 IA 1991 Score = 71.5 bits (173), Expect = 7e-11, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 47/187 (25%), Gaps = 21/187 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD-- 302 + L+D SGS+ + D+ K F L ++ +++ Sbjct: 747 RDVPLDIVFLLDGSGSVGSANFDLVKDFTRTLARNFDIAANMTQIGVVQYSDTVNREFGL 806 Query: 303 ------EHEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQ--WNIYAAQASDGDNW 350 + GGT+ +A+ + + + + +DG Sbjct: 807 GDFHNRQDVLNAISAVSYQQGGTLTGAAIDFVRQTSFTTGDGDRPDVPNMLIVVTDG--V 864 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + DS A+ + + E + + Sbjct: 865 SGDSVQGPADAAR--REGITTFGVGIGNG-IDFGTLLEIA--GDSARVLQADDFGALATV 919 Query: 411 YPVFREL 417 +E+ Sbjct: 920 AQRLQEV 926 Score = 69.1 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 45/168 (26%), Gaps = 19/168 (11%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 + P + L+D SGS+ + ++ K F + L + + V +++ Sbjct: 1008 RAPVCANFPYGGLDLVFLLDGSGSVGTTNFELVKDFTSEVVLNFNISADTTNVGVVQYSD 1067 Query: 297 QAKE-----------VDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQW--NIYAA 342 + TGGT+ A+ + + R A+ Sbjct: 1068 TVRNEFFLSSYDTKLPLIDAINQISYLTGGTLTGFAIDYVRQSSFSRPAGARNTFPDVLV 1127 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 +DG A + ++ T + Sbjct: 1128 VLTDGQ----SQDDVVSSAAAARSQGITIFAVGIG-SEVDFTTLLQIS 1170 Score = 67.2 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 48/188 (25%), Gaps = 23/188 (12%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHTQAKEVDEHEF 306 + L+D SGS+ S+ D+ K F + + + V Y + A E + Sbjct: 2126 DLVFLLDGSGSVGASSFDLMKSFTNRITTNFDVSPTSTRVGVVQYSSQGSVATEFRLDSY 2185 Query: 307 FY-----------SQETGGTIVSSALKLMDE--VVKERYNPAQWNIYAAQASDGDNWADD 353 + G T AL + + A +DG + D Sbjct: 2186 SNKDDVIAAVNGIVYQNGNTYTGEALNYVRQNSFAVANGGRADVANILVVITDGQSVD-D 2244 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + L ++ V Y+ + + + Sbjct: 2245 VTGPAQDLLRE---GVTVYALGIGDG-IQYSTLEAIAQ--DQSRVLQANTFTNLSNTAQA 2298 Query: 414 FRELFHKQ 421 +E Sbjct: 2299 LQESLGDA 2306 Score = 66.1 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 53/183 (28%), Gaps = 21/183 (11%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA---------- 298 + L+D SGS+ ++ K F + + + ++ Sbjct: 1302 LDLIFLLDGSGSITAPNFELVKSFTYSVSRNFDVSPNATRIGVAQYSDTNSLEFNLNRYS 1361 Query: 299 -KEVDEHEFFYSQ-ETGGTIVSSALKLMDE--VVKERYNPAQWNIYAAQASDGDNWADDS 354 K+ + + GGT +AL + + +V+ + A+DG+ + D Sbjct: 1362 TKDEVLNAVNGISYQGGGTYTGAALDFVRQTMMVESAGDRTMSPNILVVATDGE--SSDD 1419 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + +V Y+ + TL + I Sbjct: 1420 QRTPAEVLRNAGTLV--YAVGIGAGVSSTTLLD-IAGYN--SRVLQATDFASLEVIGREL 1474 Query: 415 REL 417 +E Sbjct: 1475 QEF 1477 Score = 63.8 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 45/181 (24%), Gaps = 21/181 (11%) Query: 256 DVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHT-----QAKEVDEHEFF 307 D SGS+ ++ K+F L K V Y A + Sbjct: 2419 DGSGSVGADNFNLVKQFAKRLVDNFEISQTDTKVGVVQYSSSSNVEFYLNAFSTKQAVLD 2478 Query: 308 YSQE----TGGTIVSSALKLMDEVV--KERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 GGT +A+ + + A + +DG++ + L Sbjct: 2479 AINAVTYQQGGTNTGAAITYTMQEIFASANGARANYPDVLIVVTDGES---SDDVAVPAL 2535 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 + + + Y+ +Q + + + ++ Sbjct: 2536 SARNAGTL-IYAVGVGNG-VNQATLLQIA--GNAGQVLQAADFAGLTTVVQSLQQNLCDA 2591 Query: 422 N 422 Sbjct: 2592 A 2592 Score = 61.8 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 20/174 (11%), Positives = 42/174 (24%), Gaps = 31/174 (17%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-RTYKNVEVVYIRHHTQAKEV 301 P A + L+D S S+ ++ K F + + + +N + Y Sbjct: 544 NLPYPGADLVFLLDGSASITSPNFELVKDFAERVARHFTISSSRNDNMSYRSFTAATNVG 603 Query: 302 DEHEFFYSQET-----------------------GGTIVSSALKLMDEVVKE--RYNPAQ 336 + GGT AL + + Sbjct: 604 AVQYSDTVRSEFFLSSFDTDFEVVRALDGISYLAGGTFTGFALDFVQQSAFSPVAGARDG 663 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + +DG + D + A+K + ++ + Sbjct: 664 YPDILVVVTDG--VSQDDVVAPAESARK--EGIAVFAVGIG-SAVDYATLLQIA 712 Score = 61.1 bits (146), Expect = 9e-08, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 45/188 (23%), Gaps = 23/188 (12%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIR-----HHTQAKE 300 + L+D S S+ + K F + S + V Y + Sbjct: 135 LDLVFLVDGSSSVGSDNFETIKVFLEAITAGFEVSSSQTRVGVVQYSTGINTEFDLNSFA 194 Query: 301 VDEHEFFYSQE----TGGTIVSSALKLM-DEVVKERY-NPAQWNIYAAQASDGDNWADDS 354 + + G T + + E + +DG + DS Sbjct: 195 TEAEVINAIRGLSHQRGSTFTGAGITFTRLESFTGASGDRPDAPNVLIVITDG--ISADS 252 Query: 355 PLCHEILAKKLLPVVRYYSYIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 A+ + YS + D +D+ V Sbjct: 253 VDAPAEAAR--ADNITTYSIGIGDEINY----LTLLSIAGMRERVLNVTTFGDLNDLDEV 306 Query: 414 FRELFHKQ 421 ++ ++ Sbjct: 307 LLQILCER 314 Score = 60.3 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 46/177 (25%), Gaps = 21/177 (11%) Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEVDEHEFFY--SQ 310 D SGS+ ++ K F + + V Y + ++ + Sbjct: 2705 DGSGSVGSDNFNLLKAFTQNIVGNFDIAVNNTRVGVVQYSDFNNIEFNLNAYATEAEVLA 2764 Query: 311 E-------TGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEIL 361 GGT +A+ + + V + + +DG+ S Sbjct: 2765 AIGAISYQRGGTFTGAAIDFVRQDVFTTAGGNRADKPDILLVLTDGE----SSDSVAGPA 2820 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 L + Y+ + + +E + + I +E Sbjct: 2821 QNTLNAGITIYAVGIGSG-VNADTLQEIA--GDPGRVLQVADFQGLAAITNQLQEAL 2874 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 48/196 (24%), Gaps = 21/196 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 P + + L+D S S+ D+ K F + + + + +++ T Sbjct: 345 VEPCERLEIDVIFLIDGSSSISLLNFDLLKTFLQNITMKFDVSSDITRIGVVQYSTDVNT 404 Query: 301 VDE-----------HEFFYS-QETGGTIVSSALKLMDEVVKERYNPAQ--WNIYAAQASD 346 E H ++ G T + + + + + +D Sbjct: 405 EFELKTYATEAEVIHAISNITRQRGSTFIGAGINFVRTNSFTVAAGDRPLAPNILVTITD 464 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G + D A+ + E + F + + Sbjct: 465 G--ISADDVAGPAQAARDQGILTYSIGIGE---EIQWPTLLSIAGAR--HRVFNVTSFSE 517 Query: 407 QDDIYPVFRELFHKQN 422 I L + Sbjct: 518 LPGIEASLTALLCEVL 533 >UniRef50_UPI0001C161B1 von Willebrand factor, type A Precursor n=2 Tax=Nostocaceae RepID=UPI0001C161B1 Length = 474 Score = 81.9 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 17/186 (9%), Positives = 43/186 (23%), Gaps = 18/186 (9%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-RTYKNVEVVYIRHH------TQAK 299 + L+D S SM K + + + V + T Sbjct: 48 KPQAIVLLIDTSSSMSDGKLAEVKTAASQFIQRRNLESDQIAVVNFGATVQTPAPLTNDI 107 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + E G T + + + ++ N +DG + Sbjct: 108 NTLNNAIDQLLEIGSTPMGEGINTAQDQLQATTL----NKNIILFTDGLPDDPNFAYNSA 163 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + + ++ + + + + D + + + Sbjct: 164 LSVRNA--GIKLIAVATGGADTNY-----LTQITGDRSLVFYANSGQFDQAFSQAEAVIY 216 Query: 420 KQNATA 425 KQ + Sbjct: 217 KQLIES 222 >UniRef50_Q14CN2 Calcium-activated chloride channel regulator 4, 30 kDa form n=30 Tax=Theria RepID=CLCA4_HUMAN Length = 919 Score = 81.9 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 65/223 (29%), Gaps = 21/223 (9%) Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + P+ + R + + I SQ ++ ++ Sbjct: 253 KTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPV-FSLLKISQRIVCLVL 311 Query: 256 DVSGSMDQSTK--DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE----HEFFYS 309 D SGSM + M + L + V + T ++ + E Sbjct: 312 DKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTL 371 Query: 310 QET------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAK 363 GGT + S +K +V+ E ++ + +DG++ S K Sbjct: 372 MAGLPTYPLGGTSICSGIKYAFQVIGELHSQLDGSEV-LLLTDGEDNTASS---CIDEVK 427 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + +V + + R A + + + + ++ Sbjct: 428 QSGAIVHFIALG---RAADEAVIE-MSKITGGSHFYVSDEAQN 466 >UniRef50_A2AX52 Collagen alpha-4(VI) chain n=12 Tax=Chordata RepID=CO6A4_MOUSE Length = 2309 Score = 81.9 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 34/272 (12%), Positives = 73/272 (26%), Gaps = 35/272 (12%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 L ++ + A A R + ++N + P ++LR E+ K + Sbjct: 164 LVYAIGVKDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTLGKAAQYTER 223 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 + +S A + L+D S S+ K F + L V+V Sbjct: 224 E---------SPACSEASPADIVFLVDSSTSIGLQNFQKVKHFLHSVVSGLDVRSDQVQV 274 Query: 290 VYIRHHTQAKEVDEHEFFYSQ------------ETGGTIVSSALKLM----DEVVKERYN 333 +++ + + GGT SAL+ + + Sbjct: 275 GLVQYSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEFIRANSLTEMSGSRA 334 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 +DG+ + D K V + + + ++ + Sbjct: 335 KDGVPQIVVLVTDGE--SSDEVQDVADQLK--RDGVFVFVVGINIQDVQE--LQKIANEP 388 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 F ++ I + + Sbjct: 389 FEEFLFTTENFS----ILQALSGTLLQALCST 416 Score = 76.1 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 47/316 (14%), Positives = 86/316 (27%), Gaps = 39/316 (12%) Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR----TAM 180 LD L +++ LP R + + ++ V + + L R A Sbjct: 719 LDFLRKEVFLPEKGSRPHRGVQQIAVVIIES------PSLDNVSTPASYLRRAGVTIYAA 772 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 E LE+ + +L +L +L+ K+ L E Sbjct: 773 GTQPASESKDLEKIVTYPPWKHAIRLESFLQLSVVGNKLKKKLCPEMLSGMPPLMSFIPE 832 Query: 241 KRPDPSS-------QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEV- 289 + +A ++ L+D SGS+ + K F + + V Sbjct: 833 STRQSTQEGCESVEKADIYFLIDGSGSIKPNDFIEMKDFMKEVIKMFHIGPDRVRFGVVQ 892 Query: 290 -------VYIRHHTQAKEVDEHEFFYS-QETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + Q GGT AL M V + Y Sbjct: 893 YSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQNTARID-VARYL 951 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 +DG + D + + + R A+ T E + F Sbjct: 952 IVITDGQST--DPVAEAAQGLRDIGVNIYAIGV----RDANTTELEEIASKKM---FFIY 1002 Query: 402 QHIRDQDDIYPVFREL 417 + + V R++ Sbjct: 1003 EFDSLKSIHQEVIRDI 1018 Score = 63.8 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 52/196 (26%), Gaps = 25/196 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE- 303 S +A + L+D S S+ + K F + + +++ ++ + +E Sbjct: 1025 KSQKADIIFLIDGSESIAPKDFEKMKDFMERMVNQSNIGADEIQIGLLQFSSNPQEEFRL 1084 Query: 304 -------HEFFYS----QETGGTIVSSALKLMDEVVKERY-NPAQWNIYAAQASDGDNWA 351 Q + GT AL + + Y +DG + Sbjct: 1085 NRYSSKVDMCRAILSVQQMSDGTHTGKALNFTLPFFDSSRGGRPRVHQYLIVITDG--VS 1142 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 D+ + + + E + Q + + Sbjct: 1143 QDNVAPPAKALRDR----NIIIFAIGVGNVQRAQLLEITNDQDKVFQ------EENFESL 1192 Query: 412 PVFRELFHKQNATAKG 427 + + +++G Sbjct: 1193 QSLEKEILSEVCSSQG 1208 Score = 49.9 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 17/158 (10%), Positives = 39/158 (24%), Gaps = 23/158 (14%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK--------- 299 A + L+D S Q++ + F + L ++ ++ Q Sbjct: 429 ADVVFLIDTSQGTSQASFQWMQNFISRIIGILEVGQDKYQIGLAQYSDQGHTEFLFNTHK 488 Query: 300 ---EVDEHEFFYSQETGGTI-VSSAL----KLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 E+ H GG+ L + + Y + G + Sbjct: 489 TRNEMVAHIHELLVFQGGSRKTGQGLRFLHRTFFQEAAGSRLLQGVPQYVVVITSGK--S 546 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 +D + +K + + + Sbjct: 547 EDEVGEVAQILRKRGVDIVSVGL----QDFDRAELEGI 580 Score = 49.1 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 40/341 (11%), Positives = 77/341 (22%), Gaps = 31/341 (9%) Query: 59 ISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQ 118 + E RG H G +G G G+ +G D Sbjct: 1585 LGETGELGSRGEPGHPGPQGPRGRQGPPGFFGQKGDPGTQGNPGLPGPSGSKGPDGPRGL 1644 Query: 119 ISKDEYLDLLFEDLAL-PNLKQNQQRQLTEYKTHRAGYTA-NGVPANISVVRSLQNSLAR 176 + P + R G G P V N Sbjct: 1645 KGEVGPAGERGPRGQQGPRGQPGLFGPDGHGYPGRKGRKGEPGFPGYPGVQGEDGN---- 1700 Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY 236 K + + + + + + +R P +R Sbjct: 1701 -PGRGGEKGAKGIRGKRGNSGFPGLAGTPGDQGPPGKMGTKGSKGLADRTPCEIVDFVRG 1759 Query: 237 KNYEKRPD---PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL-------SRTYKN 286 P+ + +D+S + S + + + L + L + Sbjct: 1760 NCPCSTGISRCPAFPTEVVFTLDMSNDVAPSDFERMRNILLSLLMKLEMCESNCPTGARV 1819 Query: 287 VEVVYIR--------HHTQAKEVDEHEFFYS---QETGGTIVSSALKL-MDEVVKERYNP 334 V Y + K + +G + + ++ V K + Sbjct: 1820 AIVSYNTRTDYLVRLSDHRGKAALLQAVRKIPLERSSGSRNLGATMRFVARHVFKRVRSG 1879 Query: 335 AQWNIYAAQASDGDNWAD--DSPLCHEILAKKLLPVVRYYS 373 A G N+ S E+ A + V ++ Sbjct: 1880 LLVRKVAVFFQAGRNYDTASVSTATLELHAADIATAVVTFT 1920 >UniRef50_D2AUU0 Putative uncharacterized protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AUU0_STRRD Length = 1330 Score = 81.9 bits (200), Expect = 5e-14, Method: Composition-based stats. Identities = 42/360 (11%), Positives = 88/360 (24%), Gaps = 35/360 (9%) Query: 6 DRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFH 65 DRR + + + Q + + + ++G Sbjct: 907 DRRPARDGEEVEDTQEPPSPEAGAVTEDPRTPARPETAEVTEAGPGARRRFTPADRWRLL 966 Query: 66 QG------RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQA-----SQDGEGQDE 114 G G R ++ + G G E +E Sbjct: 967 LGRESERLPAGARSYARALDELYGTGRGEGAGHFGQSPGEGGDSGGRGDSFPTAREWAEE 1026 Query: 115 FVFQISKDEYLDLLFEDLA------LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 + ++L L L R E T ++ +R Sbjct: 1027 LEVLFGTEVREEVLARAADAGRTDVLTELDPAAVRPSVELLTSVLTLAGGLPEQRLAKLR 1086 Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 L L R + ++ LR + R + Sbjct: 1087 PLVRRLVAELTRELATRLRPALTGLATPRPTRRPGGRIDLPRTLRANLQHTRRMGDGRLV 1146 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 + + R + + ++DVSGSM+ S A +L + Sbjct: 1147 VVPERPVF---STRARREADWRLILVVDVSGSMEASVVWSALTAAVLA------GVPTLS 1197 Query: 289 VVYIRHHTQAKEVDEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 ++ T+ ++ + + GGT +++ L +V P++ + Sbjct: 1198 THFLSFSTEVIDLTDRVADPLSLLLEVRVGGGTHIAAGLAHARSLV---TVPSRTLVVVV 1254 >UniRef50_A0BS51 Chromosome undetermined scaffold_124, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0BS51_PARTE Length = 947 Score = 81.9 bits (200), Expect = 5e-14, Method: Composition-based stats. Identities = 29/265 (10%), Positives = 67/265 (25%), Gaps = 26/265 (9%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPA-----QLLEEERLRKEIAELRAKIERVPFIDTF 232 K+R L+ + + + A + LE + + + + Sbjct: 671 IPPGILKQRILYEKGLFIKRYDSIKKAAFIFTECLETSKFYDPEIRINCLKQLKEIFQSQ 730 Query: 233 DLRYKNYEKRPDPS-----SQAVMFCLMDVSGSMDQSTKDMA-KRFYILLYLFLSRTYKN 286 +L YK + + ++D SGSM+ K++A + +L + Sbjct: 731 NLLYKVPKIEQLLELNEIKKNNDIVFVIDHSGSMENIKKELAINGILKIFDNYLQDQDRI 790 Query: 287 VEVVYI------------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + + +T + G T + SA+ + ++ Sbjct: 791 SYMRFNQNIEVIFDLTSKSENTAYLRSAIERSKNIRAEGMTAMLSAVLHAYSIHEKAVKK 850 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR--RAHQTLWREYEHL 392 + DG++ + P + + L Sbjct: 851 D-NQQWIVVLCDGEDNLSNITYERMKKFTSKRPQISLIVIGIGLSLKPDCLDELYDLCRL 909 Query: 393 QSTFDNFAMQHIRDQDDIYPVFREL 417 + D D + L Sbjct: 910 SQKGFLIESVYSEDLDIAFQSISNL 934 >UniRef50_Q6AQK6 Conserved hypothetical membrane protein (BatA) n=1 Tax=Desulfotalea psychrophila RepID=Q6AQK6_DESPS Length = 328 Score = 81.9 bits (200), Expect = 5e-14, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 47/216 (21%), Gaps = 44/216 (20%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEVV 290 R SS + +DVSGSM ++ K V Sbjct: 79 TREIKSSGIDILLAVDVSGSMQAMDFTLNGKRTNRLEVVKDVMAKFISQ-RPNDSIGLVA 137 Query: 291 YIRHHTQAKEVDEHEFF------YSQET---GGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + GT + SA+ ++E+ +P + Sbjct: 138 FAGRPYVVCPPTLDHNWLTLRLHSLSIGMIEDGTAIGSAIGTGVNRLREKKSP---SQII 194 Query: 342 AQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYIEITR-------------------RA 381 +DG N A PL AK V Sbjct: 195 ILLTDGINNAGKVPPLIAAEAAKSFKVKVYTIGAGTRGEAPIPITDAFGRRQLVRARVDI 254 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + F + +Y + Sbjct: 255 DDKTLSKVAQITGARY-FRATDTESLEKVYAEINSM 289 >UniRef50_B1J572 von Willebrand factor type A n=14 Tax=Pseudomonadaceae RepID=B1J572_PSEPW Length = 358 Score = 81.9 bits (200), Expect = 5e-14, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 56/222 (25%), Gaps = 39/222 (17%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK----------DMAKRFYILLYLFLSRTYKNVEV 289 + P +S + +DVSGSMD D+ K + + Sbjct: 81 DPVPVAASGRDLLVAVDVSGSMDFPDMQWKDEEVSRLDLVKALLGDFLQD-REGDRVGLI 139 Query: 290 VYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ T + +Q T + A+ L + +++R + Sbjct: 140 LFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRLRQRP---AQSRV 196 Query: 341 AAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITR--------------RAHQTL 385 +DG N PL LA + + + Sbjct: 197 LVLITDGANNGGRIHPLTAARLAAQEDVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAS 256 Query: 386 WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 +E L F + D I +L + Sbjct: 257 LKEIADLTHGAY-FRAHDGAELDAIGDTLDQLEPVAQQPTQA 297 >UniRef50_B4D3H1 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D3H1_9BACT Length = 341 Score = 81.9 bits (200), Expect = 5e-14, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 53/220 (24%), Gaps = 44/220 (20%) Query: 243 PDPSSQAVMFCLMDVSGSM----------DQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 +S + +DVSGSM S D+ K+ + + + Sbjct: 88 QVQASGIDIMLALDVSGSMIAEDFTIGGERASRVDVVKQVTQKFIEA-RPNDRIGMIAFA 146 Query: 293 RHH------TQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 T + GT + SA+ + ER + Sbjct: 147 ARPYLVSPLTLDHGWLIQNLDRVKLGLVEDGTAIGSAIASCTTRLIER--KDSKSRIVVL 204 Query: 344 ASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYIEIT--------------------RRAH 382 +DGDN A SPL A L V Sbjct: 205 LTDGDNNAGKVSPLTAAEAASALGVKVYTIGAGTKGFAPMPVGRDVFGRKVYQNVKVDVD 264 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + ++ + + + IY +L + Sbjct: 265 EDTLKKIADMTKAKF-YRATDTKSLTQIYEEIDQLEKTKV 303 >UniRef50_Q7JMF9 Protein T24F1.6b, partially confirmed by transcript evidence n=4 Tax=Caenorhabditis RepID=Q7JMF9_CAEEL Length = 1067 Score = 81.9 bits (200), Expect = 5e-14, Method: Composition-based stats. Identities = 44/338 (13%), Positives = 92/338 (27%), Gaps = 36/338 (10%) Query: 124 YLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPAN----ISVVRSLQN---SLAR 176 + + + A+P + + + + + +N V N I + + R Sbjct: 102 FDEYDDQAYAVPQADKRCEAYMKKMNESDMHFVSNMVEHNSKSGIHITVESYQCDPRVMR 161 Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY 236 T K E + R ID FD R+ Sbjct: 162 DFDWTGTKHLEKTMSDNKEKAPEMGHQYIGTYSGLTRMYPRRHWKVEPTPITIDLFDPRF 221 Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-- 294 + + S + L+D SGS+ T + K + + LS V + H Sbjct: 222 RPW-FVNAESVPKDIVFLLDYSGSVKGPTMHLIKITMMYILSTLSPNDYFFGVYFNNHFN 280 Query: 295 -------------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA------ 335 T K+V E +E ++ LK +V++ + Sbjct: 281 PIISCANRTFMPATTSNKKVFFEELGMLEEKDQAHFATPLKFSLDVLRGNLDSNQSLFAD 340 Query: 336 ---QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 + + +DG + L E + ++R + + + L + Sbjct: 341 YRSEGHKLLIIFTDGVDEWPHQILD-EEFQTRNSELIRIFGFSMGYGTSLLPLQQYMACK 399 Query: 393 QSTFDNFAMQHIRDQDD---IYPVFRELFHKQNATAKG 427 + + + I V ++ + Sbjct: 400 SHGGYSEIDSIMDVKPQSRTIQNVLSQVRGDELKGTNA 437 >UniRef50_C0Z8R3 Hypothetical membrane protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z8R3_BREBN Length = 424 Score = 81.5 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 49/186 (26%), Gaps = 22/186 (11%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 S + ++D SGSM S D + + + V + + + Sbjct: 108 QQASGANNIVMVLDTSGSMQSSDPDNQLFKAAADMVQRMDSDMNIAVVTFHDQTNVLQPL 167 Query: 302 DEHEFFYS------------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 E + GGT + AL+ + ++ N SDG Sbjct: 168 TELSSQSVKDEVVKKLLQFPRTDGGTRIDLALQAGLDQLQAN---QMANSTVVLMSDGY- 223 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAH-QTLWREYEHLQSTFDNFAMQHIRDQD 408 LA V ++ A L ++ F ++H Sbjct: 224 ---SDLDVPAALAPYKQNQVIVHTVGMSQIDADGTALLQKIAAETGGSY-FNVEHADQMT 279 Query: 409 DIYPVF 414 I+ Sbjct: 280 GIFGQI 285 >UniRef50_C8NJ92 Secreted Mg-chelatase subunit n=3 Tax=Corynebacterium RepID=C8NJ92_COREF Length = 530 Score = 81.5 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 31/269 (11%), Positives = 66/269 (24%), Gaps = 38/269 (14%) Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELR--AKIERVPFIDTFDLRY 236 +T + ++ A+ + R+ + + F+ Sbjct: 265 PLTPLATADAETNQQVEALADWMLDHPEHLTDTFRRPVDPMAILPPELAQAFVIEQPFPG 324 Query: 237 KNYEKRPD-------PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL----------F 279 ++DVSGSM + ++ + + + Sbjct: 325 DRAVTDALISAYNNDLRVPGDTTFVLDVSGSMAGTRMELLRSTMLEMISGEASSLTGDVS 384 Query: 280 LSRTYKNVEVVYIRHHTQAKEVDEHE------------FFYSQETGGTIVSSALKLMDEV 327 L + + + E Q GGT + AL E Sbjct: 385 LRERENVTIIPFNFSPGEPITATVDEVGGPQRQELVDGVTALQAEGGTGIYDALLRAYEQ 444 Query: 328 VKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLW 386 V+ P +DG+ + S + L +L R ++ + A+ T Sbjct: 445 VE----PGASIPSIVLMTDGEQTSGLSFGHFQRLYSELPTEKKRIPVFVILYGEANITEM 500 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 L ++ + R Sbjct: 501 ENLAGLTGGKTF--DAMNGGLEEAFKEIR 527 >UniRef50_A8MJ77 Magnesium chelatase n=2 Tax=Clostridiales RepID=A8MJ77_ALKOO Length = 629 Score = 81.5 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 44/342 (12%), Positives = 94/342 (27%), Gaps = 26/342 (7%) Query: 97 GSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYT 156 + +A + S D+ + D + + + Sbjct: 297 SDSLAENRAEESHHQNHSGEIDSSADKPISNGDYDGDSEHGD--SSAEGSVSSHTVEDIE 354 Query: 157 ANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEI 216 A G ++ +V R +GKR ++ + I P ++ Sbjct: 355 AFGEDLSMDMVWQS-----RFAVKGSGKRNKVKTDSKEGRYIRYRIPKGRPKDIAFDATF 409 Query: 217 AELRA-KIERVPFIDTFDLR-YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--F 272 + R + +R EK + + A + ++D SGSM + A + Sbjct: 410 RIAACSQGGRNREGLSLVIRSGDIREKVREKHTGATILFVVDASGSMGAKRRMGAVKGAV 469 Query: 273 YILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 LL + + + + + T++ ++ + G T ++S L Sbjct: 470 LSLLNDAYQKRDNVGIIAFRKDGADTLLNITRSVDLAQKCLTNLPTGGKTPLASGLYKAY 529 Query: 326 EVVK-ERYNPAQWNIYAAQASDGD-NWADDSPLCHEILA----KKLLPVVRYYSYIEITR 379 E++K +R A Y SDG N S E K ++ Sbjct: 530 ELLKIDRIKNADALQYIVLVSDGKGNVPLFSENAIEDAYHVGEKIRNENIKSMVLDTENG 589 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 ++ + + +I + Sbjct: 590 YIQLGFAKKLAEKMD--SAYIKMNHISSKEIEDNVKGFIRTV 629 >UniRef50_C3XQR7 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XQR7_BRAFL Length = 1460 Score = 81.5 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 26/218 (11%), Positives = 55/218 (25%), Gaps = 46/218 (21%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST-------KDMAKRFYIL 275 + + D R++N+ + + +MDVSGSM + ++AK+ + Sbjct: 1139 PKNDRSCEGNDHRFRNWYVSAASPKKKNVVIVMDVSGSMREPPGPEEQNRLNLAKQAALT 1198 Query: 276 LYLFLSRTYKNVEVVYIRHHT---------------QAKEVDEHEFFYSQETGGTIVSSA 320 + L+ V + + + T+ Sbjct: 1199 VLDTLTPRDWGGVVSFSARAETPEGCLGDSLGEANPTNIGIMKDFINQRVPETITMYGVG 1258 Query: 321 LKLMDEVVKERYNPAQWN-----IYAAQASDGDNWADDSPLCHEILAKKLLPV-VRYYSY 374 + ++ E N SDG D L ++L+ V ++Y Sbjct: 1259 FRKAFDMFAEARNKKPEQFEDCYNIIIFLSDGSPTDKDFALNAITQGQELMDRSVYIFTY 1318 Query: 375 IEI----------TRRAH--------QTLWREYEHLQS 394 + R + Sbjct: 1319 GLGANLMWASSQWAPDPNNQYVYLPALDFLRTIADQNN 1356 Score = 78.8 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 23/222 (10%), Positives = 54/222 (24%), Gaps = 53/222 (23%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST--------KDMAKRFYI 274 + + R++N+ + + +MDVSGSM + ++AK+ + Sbjct: 182 PKSDRSCEGNGHRFRNWYVSAASPKKKNVVIVMDVSGSMREPHGVPEEQNRLNLAKQAAL 241 Query: 275 LLYLFLSRTYKNVEVVYIRHHT---------------QAKEVDEHEFFYSQETGGTIVSS 319 + L+ V + + + T+ + Sbjct: 242 TVLDTLTPRDWAGVVSFSARAKAPEGCLGDSLGEANPTNIGIMKDFINQRVPETITVYAE 301 Query: 320 ALKLMDEVVKERYNPA-----QWNIYAAQASDGDNWAD----DSPLCHEILAKKLLPVVR 370 K + E N +DG D + + L ++ V Sbjct: 302 GFKKAFNMFFESKNKKPEQFEDCQNIIIFLTDGQPTDTYFTLDDIVKGQDLMER---SVH 358 Query: 371 YYSYIEI-------TRRAHQ-----------TLWREYEHLQS 394 ++Y + + + Sbjct: 359 IFTYGLGANLQWANSGWYNDPSRPWVQLPALDFLATIADQNN 400 >UniRef50_UPI0001AEBBE9 von Willebrand factor, type A n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEBBE9 Length = 358 Score = 81.5 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 54/214 (25%), Gaps = 42/214 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 E P+ M +D+SGSM + K + + Sbjct: 79 EPVSIPNEGREMMLAVDLSGSMKIDDMQLNGRQVNRLTMTKSVVYDFIQR-RVGDRIGLI 137 Query: 290 VYIRHH------TQAKEV----DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 ++ T ++ T + A+ L + ER + N Sbjct: 138 LFADTAYVQAPLTYDRDTVSTLLSEAVIGL-VGEQTAIGDAIGLAVKRFDER---EESNN 193 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI---------------TRRAHQ 383 +DG N A + +P + LA V ++ + Sbjct: 194 VLILLTDGQNTAGNITPEQAKELAISKGVKVYTIGVGADKMLIQSFFGSRQINPSQELDE 253 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + F ++ ++ IY L Sbjct: 254 GMLTNIATSTGGQY-FRARNAQELQAIYQQLDAL 286 >UniRef50_A1SAA4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Shewanella amazonensis SB2B RepID=A1SAA4_SHEAM Length = 713 Score = 81.5 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 46/417 (11%), Positives = 102/417 (24%), Gaps = 39/417 (9%) Query: 19 RQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPG 78 RQ+ Y+ Q Q + A+ ++ + + + + L Sbjct: 109 RQKAREIYEDQKAQGNATALTEKDDYKRFDMRVFPVQPQQSVKVRLVYMQDAL------- 161 Query: 79 NDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLK 138 + G + E F + + + LP+ Sbjct: 162 -LDHGVGRYLYP---LEEGGVDEARDSFWQRNAVVEQDFSFNVTLRSAYPVDGVRLPSHP 217 Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI- 197 + + Q + + P ++ S T+ A + Sbjct: 218 EAEIHQEMQAGAALWRASLTSQPESLHTGAIESVSANEDTSQGAESGAAGEGKASHQPRA 277 Query: 198 --ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + +E L + + + +V T L + + + ++ Sbjct: 278 MGLDKDIVFYWRLQEGLPGRVDMVTYRDPKVSTKGTVKLTFTPGDDLGPVTQGRDWVFVL 337 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-----------HTQAKEVDEH 304 D SGSM+ + L + +++ + Sbjct: 338 DKSGSMNGKYATLV-EGVRQGLGKLPAQDRFRIILFDESTQEFSKGFVPVDSNNINQALA 396 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAK 363 GT + LK + + +DG N L + Sbjct: 397 WVEGISPGNGTDLYQGLKRALTPLDADRSTG-----VVLITDGVANVGVTEKRRFLELMQ 451 Query: 364 KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + VR +++I A+ L L + + + DDI + K Sbjct: 452 QQ--DVRLFTFIMGNS-ANTPLLVPMTRLSNG----VATSVSNADDIVGHLMNITSK 501 >UniRef50_A1VWQ4 von Willebrand factor, type A n=20 Tax=Proteobacteria RepID=A1VWQ4_POLNA Length = 350 Score = 81.5 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 41/161 (25%), Gaps = 17/161 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---HHTQAK---- 299 + +F ++D S SM + + + L + +E V+ A+ Sbjct: 2 RRLPVFFVLDCSESMVGANLKKMEGAVAAIVKSLRTDPQALETVFFSVIAFAGVARTIAP 61 Query: 300 --EVDEHEFFYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDGDNWA 351 E+ GGT + SAL + + W +DG Sbjct: 62 LVEIVSFYPPKLPLGGGTNLGSALDALMGEIDRSVIKTTAERKGDWRPIIYLVTDGRPTD 121 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 + S E + T R Sbjct: 122 NPS-RAIERWNSHYAKKATLIAIGLGRS-VDFTALRRLTEN 160 >UniRef50_C1F7S6 Putative uncharacterized protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7S6_ACIC5 Length = 339 Score = 81.5 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 49/202 (24%), Gaps = 16/202 (7%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 +D Y + L+D S S+ + + + L L Sbjct: 95 LDDHRAPAAVYSFTQQTQLPLRLGILVDTSTSIRERFQFEQQAVTNFLLQVLRPKTDEAF 154 Query: 289 V-------VYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD--EVVKERYNPAQWNI 339 V +I + + + GGT + A+ +++ P Sbjct: 155 VEGFDEAPNFILNWSNNLDTLSSAIQDLHPGGGTALYDAVYSACRDKLLNAASGPIYVRR 214 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR---RAHQTLWREYEHLQSTF 396 SDGD+ + + + Y+ T + R+ Sbjct: 215 AIILVSDGDDN-QSHAYLTDAIKECQRAQTAIYAVSTDTDPTPDPGDDILRKMAEETGGR 273 Query: 397 DNF---AMQHIRDQDDIYPVFR 415 F + + R Sbjct: 274 AFFPRVITNLPASFNSVEDELR 295 >UniRef50_B8G7S2 Magnesium chelatase n=9 Tax=cellular organisms RepID=B8G7S2_CHLAD Length = 696 Score = 81.5 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 45/385 (11%), Positives = 91/385 (23%), Gaps = 29/385 (7%) Query: 37 AINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGG 96 + ++ + D R + + Sbjct: 314 LAAELALPHRMRRQPFGEVKLDEQRMATILEHCSKRAEEIRAQAEVKKKPDLNDDGSNND 373 Query: 97 GSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYT 156 G+ QG S + + + + ++ R T Sbjct: 374 EGGNEQGGGSTTVPVGGAGTPETATPANEQATGG---------TEHQAGDVFRPRRLETT 424 Query: 157 ANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEI 216 + RS + ++ +R A + + Q L I Sbjct: 425 PDRTQRRAPGRRSRSRTTRKQGRYITSRRAARVTDLALDATLREAAIYQRKRRMELMHTI 484 Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF--YI 274 + I DLR K ++ + ++D S SM + A + Sbjct: 485 D-TPYRRRPKIVIKRSDLRQK----VRVRRTRNAVCFVVDASWSMAAEERMQATKAAVLS 539 Query: 275 LLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV 327 LL R + V + + T + E+ + G T +S L E+ Sbjct: 540 LLRDAYQRRDQVGLVSFQRDYARVLLPLTNSVELAQRRLQSMPTGGKTPLSRGLLTAFEL 599 Query: 328 VKE-RYNPAQWNIYAAQASDGD-NWADDS---PLCHEILAKKLLPV-VRYYSYIEITRRA 381 ++ R A+ +DG N + +A+ + ++ Sbjct: 600 LERARRRDAEVVPLMVLLTDGQANVSISDLPPQQEAYRIAEMIADRQIQAIVIDTEHPHF 659 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRD 406 L R D Sbjct: 660 DHGLSRRLAERLRGIYYRLEDLQDD 684 >UniRef50_C1YR26 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YR26_NOCDA Length = 505 Score = 81.1 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 47/187 (25%), Gaps = 13/187 (6%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT--------QAKE 300 A + ++D SGSM D A R + L L+ + V + + K Sbjct: 40 ATLQVVLDRSGSMGGGRLDGAVRALLSLVERLAPSDNFGLVSFNDQARVEVPCGPLEDKA 99 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 +GGT +SS L + + SDG N Sbjct: 100 RVRRLISGLHASGGTDLSSGLLRGVQEARRAGADRGGT--LLLISDGHANQGVTDHDLLR 157 Query: 360 ILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 +A V S + L + FA I L Sbjct: 158 QVAADAYAHGVTTTSLGYGLG-YDEELLGAVADGGAGSALFAEDPDTAGGLIAREAEYLL 216 Query: 419 HKQNATA 425 K Sbjct: 217 AKTAQAV 223 >UniRef50_D2PMF8 von Willebrand factor type A n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PMF8_9ACTO Length = 654 Score = 81.1 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 52/215 (24%), Gaps = 34/215 (15%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSM-------DQSTKDMAKRFYILLYLFLSRTYKN 286 L + N + + ++D SGSM + D AKR + L + Sbjct: 18 LTFVNASTAAAAGELSPVMVVLDSSGSMTARDAGGSGTRMDAAKRAVGSMVDGLPAGAQV 77 Query: 287 VEVVYIRHHT---------------------QAKEVDEHEFFYSQETGGTIVSSALKLMD 325 +Y K + ++ +G T + AL+ Sbjct: 78 GLAIYGAGTGSSGAEKVAGCKDVRVVQPVGPVNKPALKRAVTATKASGYTPIGQALRTAA 137 Query: 326 EVVKERYNPAQWNIYAAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 + P + SDG++ A P K + ++ + Sbjct: 138 AQL-----PKEGQRSIVLVSDGEDTCAPPQPCEVAKELSKQGVDLHVHTIGFRVDAKARA 192 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + A + V Sbjct: 193 QLACIAQNTGGTYHDASDADSLLGVLGRVTERALR 227 >UniRef50_Q8T5C2 Proximal thread matrix protein 1 n=3 Tax=Mytilus RepID=Q8T5C2_MYTGA Length = 453 Score = 81.1 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 49/190 (25%), Gaps = 25/190 (13%) Query: 248 QAVMFCLMDVSGSMDQST---KDMAKRFYILLYLFL----SRTYKNVEVVYIRHHTQ--- 297 A + + D S S++ + + K F + + V + T+ Sbjct: 251 HADIAFVFDASSSINANNPNNYQLMKNFMKDIVDRFNKTGPDGTQFAVVTFADRATKQFG 310 Query: 298 -----AKEVDEHEFFYSQET--GGTIVSSALKLM-DEVVKERYNPAQ--WNIYAAQASDG 347 +K + + G T + L+ EV R + +DG Sbjct: 311 LKDYSSKADIKGAIDKVSPSIIGQTAIGDGLENARLEVFPNRNGGGREEVQKVVILLTDG 370 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 N SP L +K V + T L S F Sbjct: 371 QNNGHKSPEHESSLLRK--EGVVIVAIGVGTGFLKSELI-NIA--SSEEYVFTTSSFDKL 425 Query: 408 DDIYPVFREL 417 I +L Sbjct: 426 SKIMEDVVKL 435 Score = 68.0 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 46/201 (22%), Gaps = 28/201 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD---MAKRFYILLYLFL-------SRTYKNVEV 289 + + QA + L D S S+ K+ M K F L + V Sbjct: 44 KDAEECDVQADIIVLFDDSSSIQYDNKENYQMMKDFVKELVDSFTTVGVNGRNGSQFGVV 103 Query: 290 VYI-----RHHTQAKEVDEHEFFYSQE-----TGGTIVSSALKLMDEVVKERYNPAQWN- 338 + + E Q+ G T + + LK + E Sbjct: 104 QFSQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEIGTGLKHVRENSFSGAEGGGNPD 163 Query: 339 --IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG + A P K V + + Sbjct: 164 KQKIVILMTDGKSNAGAPPQ--HEAHKLKAEGVTVIAIGIGQGFVKTE-LEQIA--TMKN 218 Query: 397 DNFAMQHIRDQDDIYPVFREL 417 + + + +L Sbjct: 219 YVLTTNSFSELSTLLKLVIDL 239 >UniRef50_P12110 Collagen alpha-2(VI) chain n=30 Tax=Euteleostomi RepID=CO6A2_HUMAN Length = 1019 Score = 81.1 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 44/342 (12%), Positives = 77/342 (22%), Gaps = 36/342 (10%) Query: 62 PMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISK 121 P G + G+ + G G G + +G + Sbjct: 429 PGTKGSPGSDGPKGEKGDPGPEGPRGLAGEVGNKGAKGDRGLPGPRGPQGALGEPGKQGS 488 Query: 122 DEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 R G++ G Sbjct: 489 RGDPGDAGPRGDSGQPGPKG-------DPGRPGFSYPGPRGAPGEKGEPGPRGPEGGRGD 541 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 G + E E P + E P + D+ E Sbjct: 542 FGLKGEPGRKGEKGEPADPGPPGEPGPRGPRGVPGPEGEPGPPGDPGLTECDVMTYVRET 601 Query: 242 R-----PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR---------TYKNV 287 + ++D S S+ + + K F I + L + Sbjct: 602 CGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVG 661 Query: 288 EVVYIRHHTQ-AKEVDEHEFFYSQE-----------TGGTIVSSALKLMDEVVKERYNPA 335 V Y T A ++D+ GGT SALK + + + Sbjct: 662 VVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRRQ 721 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 + ++A +DG + D L L + V + Sbjct: 722 KTRVFAVVITDGRHDPRDDDLNLRALCDR---DVTVTAIGIG 760 Score = 51.0 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 18/206 (8%), Positives = 47/206 (22%), Gaps = 28/206 (13%) Query: 243 PDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVY------ 291 ++ ++D S S+ + K+F L + +V Sbjct: 39 EKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 292 ---------IRHHTQAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + GT AL M E +++ + + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDRSKGTVHFAV 158 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH-----LQSTF 396 +DG + +R ++ + + R+ ++ + Sbjct: 159 V-ITDGHVTGSPCGGIKLQAERAREEGIRLFAVA-PNQNLKEQGLRDIASTPHELYRNDY 216 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQN 422 D ++ + Sbjct: 217 ATMLPDSTEIDQDTINRIIKVMKHEA 242 Score = 44.1 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 41/196 (20%), Gaps = 33/196 (16%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYK--NVEVVYI 292 + L+D S + + A+RF + L R N V + Sbjct: 821 ELSVAQCTQRPVDIVFLLDGSERLGEQNFHKARRFVEQVARRLTLARRDDDPLNARVALL 880 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALK----------------LMDEVVKERYNPAQ 336 + ++ + T + AL+ + Sbjct: 881 QFGGPGEQQVAFPLSH----NLTAIHEALETTQYLNSFSHVGAGVVHAINAIVRSPRGGA 936 Query: 337 WNIY---AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 +DG +DS +K V + + L Sbjct: 937 RRHAELSFVFLTDGV-TGNDSLHESAHSMRKQNVVPTVLALG---SDVDMDVLTTLS-LG 991 Query: 394 STFDNFAMQHIRDQDD 409 F + Sbjct: 992 DRAAVFHEKDYDSLAQ 1007 >UniRef50_UPI000180C65C PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1 n=1 Tax=Ciona intestinalis RepID=UPI000180C65C Length = 1114 Score = 81.1 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 24/218 (11%), Positives = 47/218 (21%), Gaps = 26/218 (11%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 K +S + ++D SGS+ T + + L L+ Sbjct: 239 PWNTQCLYHDLHDVTKVSWFVKGMTSPKDVLIMIDTSGSIIGITLSLIQTSVKKLMSTLT 298 Query: 282 RTYKNVEVVYIRHHTQ--------------AKEVDEHEFFYSQETGGTIVSSALKLMDEV 327 V+ K++ + E+ Sbjct: 299 ENDFFNIFVFNNEPKFLQPSCPNLMQATPKHKQMAAGWLSNLTVHNSSAFEKGFDFAFEI 358 Query: 328 VKE--------RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 + + R A N +DG P L VR ++Y Sbjct: 359 LTQSNSLNTTHRPIRAGCNSAILLFTDG---GAAYPSQVFKKW-NLDKEVRVFTYSVGKP 414 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + T ++ + +L Sbjct: 415 FSSTTTLKQMACNNRGEFTAIPSYSATNLQTRKYLSKL 452 >UniRef50_A5UW94 von Willebrand factor, type A n=2 Tax=Roseiflexus RepID=A5UW94_ROSS1 Length = 452 Score = 81.1 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 18/167 (10%), Positives = 33/167 (19%), Gaps = 15/167 (8%) Query: 260 SMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEHEFFYSQE 311 S D + L R + VV+ H + Sbjct: 100 KAASSALDHVVHALHTVVERLDRNDRLSLVVFADHALLLIPGMVGSDRVTLVRAIERLPG 159 Query: 312 ---TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 GT ++ + L ++ + + N +DG + L A Sbjct: 160 LDLGDGTNLADGIALALNQIRANRDARRANRV-LLLTDGFTRDPAACLTLADQAADEHIA 218 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + L F + I Sbjct: 219 ITTIGLG---GEFQDDLLTGIADRSGGNALFLKRASAIPRAISAELE 262 >UniRef50_A9BS02 von Willebrand factor type A n=1 Tax=Delftia acidovorans SPH-1 RepID=A9BS02_DELAS Length = 536 Score = 81.1 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 58/244 (23%), Gaps = 36/244 (14%) Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR-----P 243 + + A +P + + E A +F R + + Sbjct: 274 RLVAQFKAPAFQGQPLRQAQLRPASPEAQASPALPTAPVVELSFPNRLEVIDAVLSAYQS 333 Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL-----------YLFLSRTYKNVEVVYI 292 D A ++DVSGSM + K LL Y + + + + Sbjct: 334 DLRRPATSIFVLDVSGSMKGARLAQMKEALKLLSGAEASAASQRYAAFQARERVLLIPFS 393 Query: 293 R---------HHTQAKEVD----EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + GGT + AL L + ++ Sbjct: 394 GLVGQPARVQFAAGDLQAASAQVLAYADSLVADGGTAIYDALTLAQQQARQELRADPERF 453 Query: 340 Y-AAQASDGDNWADDSPLCHEILAKKLLPV----VRYYSYIEITRRAHQTLWREYEHLQS 394 +DG N A E + VR + I A + L Sbjct: 454 VSIVLLTDGANTAGRDWAAFEREQRMARDGGAPLVRVFPIIFG--EAQSGEMQALAALTG 511 Query: 395 TFDN 398 Sbjct: 512 GRAF 515 >UniRef50_O50313 Magnesium-chelatase 67 kDa subunit n=15 Tax=Bacteria RepID=BCHD_CHLP8 Length = 619 Score = 81.1 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 57/441 (12%), Positives = 124/441 (28%), Gaps = 63/441 (14%) Query: 7 RRLNGKNKSMVNRQRFLRRYKAQIKQSISEAIN---KRSVTDVDS----------GESVS 53 R L G + + + QIK I AI+ + + D+ + G+ Sbjct: 195 RMLRGIIGAAREQLHHVSITNEQIKGLIQTAISLGVEGNRVDIFAIRAALANAALGQRTE 254 Query: 54 IPTEDISEP-MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQ 112 + ED+ R ++ +Q + P+ G + + + Sbjct: 255 VDDEDLKLAVKLVLVPRATRMPEREPSEEEMQQEEPPPPEEQPEQEGEDENAPPDETDSD 314 Query: 113 DEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN 172 + + + D +L+ + + + L K ++G S +L N Sbjct: 315 ADEEQEETPDMIEELMMDAIETDLPENILNISLASKKKAKSG----------SRGEALNN 364 Query: 173 SLARRTAM--TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFID 230 R K ++ + ++ + + ++ K A + + + Sbjct: 365 KRGRFVRSQPGEIKSGKVALIPTLISAAPWQAARKAEKAKKGIKTGALVISTDDVKIK-- 422 Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF-YILLYLFLSRTYKNVEV 289 R+++ S + ++D SGSM + AK LL + + Sbjct: 423 ----RFRD-------KSGTLFIFMVDASGSMALNRMRQAKGAVASLLQNAYVHRDQVSLI 471 Query: 290 VYIR-------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 + +Q+ + + E GGT ++SAL E K+ I Sbjct: 472 SFRGKQAQVLLPPSQSVDRAKRELDVLPTGGGTPLASALLTGWETAKQARTKGITQIMFV 531 Query: 343 QASDGDNWADDSP-----------LCHEILAKKLLPVVRYYSYIEITRRAHQTLWR---- 387 +DG + E + L ++ I Sbjct: 532 MITDGRGNIPLAAAVDPAAAKAPKEELEKEVEALALSIQSDGIASIVVDTQMNYLSRGEA 591 Query: 388 -EYEHLQSTFDNFAMQHIRDQ 407 + + +Q Sbjct: 592 PKLAQKLGGRYFYLPNAKAEQ 612 >UniRef50_C4N894 Complement factor B-like protein n=1 Tax=Venerupis decussatus RepID=C4N894_9BIVA Length = 697 Score = 81.1 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 51/196 (26%), Gaps = 32/196 (16%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---------SRTYKNVEVVYIRHHT 296 S + L+DVS S+ + + AK+F LL + + + Sbjct: 192 KSGLDVVLLVDVSSSIGDRSMESAKKFMKLLVDIFGVSNETSGGKNGTRFALLTFSNEAD 251 Query: 297 QA----------KEVDEHEFFYSQ-ETGGTIVSSALKLM-----DEVVKERYNP-AQWNI 339 KE + Q GGT +AL + V+K+ Sbjct: 252 IVFNLNDGTARSKEEVKRRIDEIQNTGGGTNFRAALLKVVGGIFFNVIKKESQRLNHATR 311 Query: 340 YAAQASDGDNWAD---DSPLCHEILAKKLLPVVR--YYSYIEITRRAHQTLWREYEHLQS 394 +D + + D A L + +T E Sbjct: 312 AVFLLTDAEETSTLEKDRLPRIRQAANDLKNEGHFEIFCIGVGQ-NIDETTLAEIASTPH 370 Query: 395 TFDNFAMQHIRDQDDI 410 F + D + + Sbjct: 371 IEHVFTLSKFDDLEKV 386 >UniRef50_A7NJ01 von Willebrand factor type A n=2 Tax=Roseiflexus RepID=A7NJ01_ROSCS Length = 972 Score = 81.1 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 42/440 (9%), Positives = 95/440 (21%), Gaps = 49/440 (11%) Query: 7 RRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQ 66 +RL + N R + + + + V I + + + Sbjct: 156 KRLVLISDGGENAGRVADAAQLAAIRKVPIDV----VYMPGERGPDVIVAGLSAPAVVRE 211 Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLD 126 G+ N + G + + + + Sbjct: 212 GQDLTLQANITSNYATSGRLQTFVDGQLIGEQELSIPEGASTID-IRVPSGETGFRRIEV 270 Query: 127 LLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 L D R A+ +++ +L+ + R + Sbjct: 271 RLDADGDTEPQNNRGAAFTEVLGPPRLLLIASNEARAVNLRDALRAAEVR--VDVLPPDQ 328 Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT--------------- 231 L++ A + A E + Sbjct: 329 APATLDQLGAYAGVIIVDTPARDMPRTLMEALPVYVRELGRGLAMVGGIDSFGAGGYRRT 388 Query: 232 ---FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-------STKDMAKRFYILLYLFLS 281 L + ++D SGSM + + D+AK L L+ Sbjct: 389 PLEPVLPVLLDPLDTKQQPDLALVMVIDRSGSMSELVGGSRRNRLDLAKEAVYQASLGLT 448 Query: 282 RTYKNVEVVYIRHHTQAKEV--------DEHEFFYSQETGGTIVSSALKLMDEVVKERYN 333 + VV+ + E GGT + ++ + + + Sbjct: 449 PIDQVGLVVFDDAANWVLPLQRLPSVVEIERALGSFGIGGGTNIRPGIE---QAAQALAS 505 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 + +DG A+ + + + + E A+ L + Sbjct: 506 ADAKVKHVILLTDG--IAESNYSDLIAQMRAAGVTISTVAIGE---DANPNLVD-VANAG 559 Query: 394 STFDNFAMQHIRDQDDIYPV 413 + Sbjct: 560 GGRSYRVTRIEDVPRIFLQE 579 Score = 47.6 bits (111), Expect = 0.001, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 42/133 (31%), Gaps = 14/133 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH------HTQ 297 P + L+DVS SM + ++ A ++ + + VV+ + Sbjct: 62 LPVRELTTVFLVDVSDSMTPAQRERALQYVNDALAAMPPGDQAAVVVFGDNALVERAPGP 121 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD-GDNWADDSPL 356 + T + A++L + PA+ SD G+N + Sbjct: 122 IGPLSRLTSVPITTR--TNLQEAVQLGLALF-----PAETQKRLVLISDGGENAGRVADA 174 Query: 357 CHEILAKKLLPVV 369 +K+ V Sbjct: 175 AQLAAIRKVPIDV 187 >UniRef50_Q22UB9 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22UB9_TETTH Length = 2269 Score = 81.1 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 47/163 (28%), Gaps = 14/163 (8%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 +K + + D SGSM+ ++ + SR + I Sbjct: 2089 IKKCKSQINPVHFIIVFDESGSMEGEKWITLRKELLNFIDNRSRATAQDFITLIGFAHTV 2148 Query: 299 KEVD--EHEFFYSQ-------ETGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDGD 348 K E + GGT S+ L+ ++ + + N SDGD Sbjct: 2149 KLYTKVEKLNEQIKQKVPQEFMDGGTNYSAPLQQALNILSQEQCQTFKKNNVIFFLSDGD 2208 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 P +KL +++ + + + + Sbjct: 2209 ---AKEPKTEIQQLQKLGHLIKLIQF-VGYGDENFQTLKSMAN 2247 >UniRef50_A6NMZ7 Collagen alpha-6(VI) chain n=2 Tax=Theria RepID=CO6A6_HUMAN Length = 2263 Score = 81.1 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 28/259 (10%), Positives = 60/259 (23%), Gaps = 31/259 (11%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE---RVPFIDTFDLRYKNYEK 241 + E + + + + E + + D + + Sbjct: 741 PAVVLRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPRE 800 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL------------FLSRTYKNVEV 289 + ++D SGS+D ++ K F I L L + Sbjct: 801 ECKRIEVLDVVFVIDSSGSIDYDEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVL 860 Query: 290 VYIRHHTQAKEVDEHEFFYSQETGGTIVSSAL---KLMDEVVKERYNPAQWNIYAAQASD 346 Y+ EV G T + AL M + +D Sbjct: 861 FYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEARGSRLNKGVPQVLIVITD 920 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G++ D + + + S+ F + Sbjct: 921 GESHDADKLNATAKALRDK--GILVLAVGIDGANP----VELLAMAGSSDKYFFV----- 969 Query: 407 QDDIYPVFRELFHKQNATA 425 + + + +F A+ Sbjct: 970 --ETFGGLKGIFSDVTASV 986 Score = 65.7 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 34/275 (12%), Positives = 79/275 (28%), Gaps = 27/275 (9%) Query: 161 PANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELR 220 + V +L R E + + I S+ + + + A + Sbjct: 342 RDSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQ 401 Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPS-----SQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 ++++ T + + S +A ++ L+D SGS + K F Sbjct: 402 TFLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKTFLSE 461 Query: 276 LY---LFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGG-TIVSSALKL 323 + + V Y E+++ ++ GG T +AL Sbjct: 462 VVGMFNIAPHKVRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNF 521 Query: 324 MDEVVKERYNPAQWNI--YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 ++++ + + + E + +R Y+ A Sbjct: 522 TLSLLQKAKKQRGNKVPCHLVVLT----NGMSKDSILEPANRLREEHIRVYAIGIK--EA 575 Query: 382 HQTLWREYE-HLQSTFDNFAMQHIRDQ-DDIYPVF 414 +QT RE + + ++D + + Sbjct: 576 NQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEI 610 Score = 63.0 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 30/244 (12%), Positives = 58/244 (23%), Gaps = 22/244 (9%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 E + A + + E + + D + + Sbjct: 556 PANRLREEHIRVYAIGIKEANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEICTEEA 615 Query: 245 -PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 +A + L+D SGS+ K F L V++ ++ KE + Sbjct: 616 CKEMKADIMFLVDSSGSIGPENFSKMKTFMKNLVSKSQIGPDRVQIGVVQFSDINKEEFQ 675 Query: 304 -----------HEFFYSQETGGTIV-SSALKLMDEVVK-ERYNPAQWNIYAAQASDGDNW 350 + G T + SAL + + + + +DG+ Sbjct: 676 LNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGARPNIRKFLILITDGEAQ 735 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + V YS T E F +++ I Sbjct: 736 DIVKEPAVVL----RQEGVIIYSVGVFGSNV--TQLEEIS--GRPEMVFYVENFDILQRI 787 Query: 411 YPVF 414 Sbjct: 788 EDDL 791 Score = 63.0 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 42/166 (25%), Gaps = 23/166 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA- 298 K + + LMD S S+ + K F + + V + + Sbjct: 990 SKVDCEIDKVDLVFLMDGSTSIQPNDFKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYH 1049 Query: 299 -----------KEVDEHEFFYSQETGGTIVSSALKLMDEVVK---ERYNPAQWNIYAAQA 344 KE+ Q G T + +AL+ ++ + Sbjct: 1050 PEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFRPDMGSRINTGTPQVLLVL 1109 Query: 345 SDGDNWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREY 389 +DG S A+ L + YS + Sbjct: 1110 TDGQ-----SQDEVAQAAEALRHRGIDIYSVGIG--DVDDQQLIQI 1148 Score = 57.6 bits (137), Expect = 9e-07, Method: Composition-based stats. Identities = 33/286 (11%), Positives = 69/286 (24%), Gaps = 35/286 (12%) Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL 219 + V +L + + + E A+ ++ L + + +L Sbjct: 140 SSESEDNVEEASKALRKDGVKIISVGVQKASEENLKAMATSQFHFNL-------RTVRDL 192 Query: 220 RAKIERVPFIDTFDLRYKN------YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY 273 + + I ++YK + + S A + L+D+S + + D K F Sbjct: 193 SMFSQNMTHIIKDVIKYKEGAVDDIFVEACQGPSMADVVFLLDMSINGSEENFDYLKGFL 252 Query: 274 ILLYLFLSRT---YKNVEVVYIRHH--------TQAKEVDEHEFFYSQET-GGTIVSSAL 321 L + V Y K G +A+ Sbjct: 253 EESVSALDIKENCMRVGLVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAI 312 Query: 322 KLMDEVV----KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 K + + V A + D + V ++ Sbjct: 313 KKLRKEVFSARNGSRKNQGVPQIAVLVT----HRDSEDNVTKAAVNLRREGVTIFTLGIE 368 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 A T + + ++ D F + Q Sbjct: 369 G--ASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQIT 412 Score = 50.3 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 50/196 (25%), Gaps = 28/196 (14%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-EVDEH 304 A + L+D S + + K F + L V ++ + E Sbjct: 23 PEYADVVFLVDSSDRLGSKSFPFVKMFITKMISSLPIEADKYRVALAQYSDKLHSEFHLS 82 Query: 305 EFFYSQE------------TGGTIVSSALKLMDEV-----VKERYNPAQWNIYAAQASDG 347 F G + AL+ R I AS Sbjct: 83 TFKGRSPMLNHLRKNFGFIGGSLQIGKALQEAHRTYFSAPANGRDKKQFPPILVVLAS-- 140 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + ++D+ +K V+ S ++A + + +F ++ +RD Sbjct: 141 -SESEDNVEEASKALRK--DGVKIISVGV--QKASEENLKAMATS---QFHFNLRTVRDL 192 Query: 408 DDIYPVFRELFHKQNA 423 + Sbjct: 193 SMFSQNMTHIIKDVIK 208 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 18/238 (7%), Positives = 53/238 (22%), Gaps = 39/238 (16%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEI-----AELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + + S + + + + + A +D+ + Sbjct: 1897 VITFSNVPSVRRAFAIDDTGTFQVIVVPSGADYIPALERLQRCTFCYDVCKPDASCDQAR 1956 Query: 246 SSQA----VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---------RTYKNVEVV-- 290 L+D S +M + + + F L + + Sbjct: 1957 PPPVQSYMDAAFLLDASRNMGSAEFEDIRAFLGALLDHFEITPEPETSVTGDRVALLSHA 2016 Query: 291 ---YIRHHTQAKEVDEHEFFYSQE---------------TGGTIVSSALKLMDEVVKERY 332 ++ + ++ E + G + AL+ + V Sbjct: 2017 PPDFLPNTQKSPVRAEFNLTTYRSKRLMKRHVHESVKQLNGDAFIGHALQWTLDNVFLST 2076 Query: 333 NPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + N S G+ D + + + + + + + Sbjct: 2077 PNLRRNKVIFVISAGETSHLDGEILKKESLRAKCQGYALFVF-SLGPIWDDKELEDLA 2133 >UniRef50_C9RJ63 von Willebrand factor type A n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJ63_FIBSS Length = 227 Score = 80.7 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 55/172 (31%), Gaps = 24/172 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT------YKNVEVVYIR 293 + +PSS+ + ++D SGSM+ + + L Y + + V + Sbjct: 10 DLENNPSSRVPVCLVLDTSGSMEGDSINELNEGVRLFYDAVRSDETALYAAEISVVTFGG 69 Query: 294 HHTQA--KEVDEHEFF--YSQETGGTIVSSALKLMDEVVKERYNP------AQWNIYAAQ 343 H + EH+ GGT + A+ + +++++R + + + Sbjct: 70 HASCQAGFSTLEHQPDAPQFYADGGTPMGEAMNMALDMLEKRKSEYKASGVDYYQPWIVL 129 Query: 344 ASDGDNWADDSPLCHEILAKK-----LLPVVRYYSYIEITRRAHQTLWREYE 390 +DG S ++ + + A + + Sbjct: 130 MTDGMPNG--SQAELSRSIQRTCDMINDRKLTIFPIGIG-EDADMDVLARFS 178 >UniRef50_C8XH18 von Willebrand factor type A n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XH18_NAKMY Length = 618 Score = 80.7 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 73/263 (27%), Gaps = 38/263 (14%) Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 ++ E N + P + + A P + Sbjct: 366 LPEQQKVFTEANFRTADHQ-PGEPITSSPYLIADGVTIALNPPGPSVLR-----DVRALW 419 Query: 243 PDPSSQAVMFCLMDVSGSM-------DQSTKDMAKRFYILLYLFLSRTYKNVEVVY---- 291 A + +MDVSGSM +S D+AK+ L+ T + + Sbjct: 420 TQVRKPARVLVVMDVSGSMASESGYGSESKLDLAKKAATSALGQLTDTDQMGLWAFTTDL 479 Query: 292 ------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + Q ++ GT + +A + + + + +P N Sbjct: 480 PTPDTITADLVGVGPLAQTRQPIIDAISSLTPLNGTPLYAATREAAKAMNAQKDPNSINA 539 Query: 340 YAAQASDGDNWADDSPLC---HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG N D+ L E+ A VR ++ A +E Sbjct: 540 VVVL-TDGRNEYTDNDLDGLLRELNASAEEDGVRVFTIAYG-PDADLATLQEISEASRAA 597 Query: 397 DNFAMQHIRDQDDIYPVFRELFH 419 R+ I VF ++ Sbjct: 598 AY----DARNPTSIDKVFSDVLS 616 >UniRef50_B9HP09 Predicted protein n=13 Tax=cellular organisms RepID=B9HP09_POPTR Length = 786 Score = 80.7 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 29/272 (10%), Positives = 69/272 (25%), Gaps = 38/272 (13%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL------RAKI 223 Q + RR K + E+ I P ++ + + R + Sbjct: 500 AQQAQRRRGKAGRAK--NVIFSEDRGRYIKPMLPKGPVKRLAVDATLRAAAPYQKLRKEK 557 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSR 282 + + + KR + A++ ++D SGSM + AK LL + Sbjct: 558 DTQKSRKVYVEKTDMRAKRMARKAGALVIFVVDASGSMALNRMQNAKGAALKLLAESYTS 617 Query: 283 TYKNVEVVYIR-------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV-VKERYNP 334 + + + +++ + GG+ ++ L V + + Sbjct: 618 RDQVAIIPFRGDAAEVLLPPSRSISMARKRLERLPCGGGSPLAHGLTTAVRVGLNAEKSG 677 Query: 335 AQWNIYAAQASDGD----------------NWADDSPLCHEILAKKLLPVVR-----YYS 373 I +DG + S + ++ + Sbjct: 678 DVGRIMIVAITDGRANISLKRSTDPEAAGPDAPRPSTQELKDEILEVAGKIYKAGMSLLV 737 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + +E + + Sbjct: 738 IDTENKFVSTGFAKEIARVAQGKYYYLPNASD 769 >UniRef50_D1JHM1 Putitive magnesium-chelatase subunit n=2 Tax=uncultured archaeon RepID=D1JHM1_9ARCH Length = 705 Score = 80.7 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 37/311 (11%), Positives = 73/311 (23%), Gaps = 28/311 (9%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P + ++ + + + R+T+ L Sbjct: 392 PEGQDDKDDVNESKNESVFNIGDPIDTRAVRQKQKKDKTYRRKTSGRRIPTLSLRNNG-- 449 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ---AVM 251 I + P + + L I + + K+ + R Sbjct: 450 -KYIRHGIPKGKITDVALDATIRAAAVYQKERVVDSDLAVVIKSQDIREKIRVGKISTAT 508 Query: 252 FCLMDVSGSMD-QSTKDMAKRF-YILLYLFLSRTYKNVEVVYIRHHTQAK-------EVD 302 ++D SGSM + AK LL + K V + ++ Sbjct: 509 MFVVDASGSMGANRRMESAKGAVLSLLLDSYQQRDKVGMVAFKGDQADVLLPLCSSSDLA 568 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN-IYAAQASDGD-NWADDSPLCHEI 360 G T +++ L+ ++ + + SDG N + E Sbjct: 569 VERLRELPTGGRTPLAAGLEQGLNLLMAEKHRDEEAIPILLLISDGRANVSAGGSKELEQ 628 Query: 361 LAKKLLPVVR---YYSYIEITRRAHQTLW-------REYEHLQSTFDNFAMQ-HIRDQDD 409 L R Y + T + R + D Sbjct: 629 ELLALAEQARAKGIYVIVIDTEIVSDSFIQMQLGYCRAIANYSGGKYYPIADLTSGAVRD 688 Query: 410 IYPVFRELFHK 420 I R + + Sbjct: 689 IVISERNMLND 699 >UniRef50_UPI00016E1D1D UPI00016E1D1D related cluster n=9 Tax=Tetraodontidae RepID=UPI00016E1D1D Length = 2191 Score = 80.7 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 54/192 (28%), Gaps = 26/192 (13%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEVDEHE 305 A + ++D SGS+ + + + F L L + V+Y +V + Sbjct: 381 ADIVFIIDESGSIGSANFQLMRSFLHSLISGLQVASNRVRVGIVMYNVEP--MAQVFLNT 438 Query: 306 FFYSQE-----------TGGTIVSSALKLMDEVV----KERYNPAQWNIYAAQASDGDNW 350 F E GGT +AL + V + A +DG + Sbjct: 439 FKDKSELLDFIKILPYHGGGTNTGAALNFTLQEVFIKQRGSRKDLGVQQVAVVITDGKSQ 498 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + S A V Y+ A + + + F + + Sbjct: 499 DEVSSPA----ANLRRAGVTVYAVGVK--DADKAQLDQIASYPTNKHTFIIDSFTKLKTL 552 Query: 411 YPVFRELFHKQN 422 + + + Sbjct: 553 EASLQRILCQNV 564 Score = 69.1 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 61/223 (27%), Gaps = 23/223 (10%) Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 + L K++A +R + + +A + L+D S S+ Q Sbjct: 951 DALKDLEKQMAMELCDPDRGYHFIPTPPPLIHEISECKKTEKADIIFLVDGSTSITQPKF 1010 Query: 267 DMAKRFYILLYLFLSRTY---KNVEVVYIRHHTQA---------KEVDEHEFFYSQETGG 314 +F + + + ++Y +EV + G Sbjct: 1011 RSMLKFMASMVNQTTVGSDLTRFGVILYSNDANSMFTLKQYSAKREVLQAIAALKSPLGD 1070 Query: 315 TIVSSALKLMDEVVKERYNPAQ---WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 T AL + E + +DG+ + D L + L V Sbjct: 1071 TYTGKALTYSLQFFNEEHGGRAALQVPQILMVITDGE--SQDDVEDAARLLRSL--GVEV 1126 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++ L + S F ++ + ++I Sbjct: 1127 FTIGIGN-AHDLELLQ-IA--GSPERVFTVKSFGNLENIKQKV 1165 Score = 67.6 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 45/181 (24%), Gaps = 22/181 (12%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH-TQAKEVDEHEFFY 308 + L+D S S+ + F L +++ ++ +E + Sbjct: 1 DIVFLVDGSSSIGTDNFQEVRLFLRNFTSGLDIGPDKIQIGLAQYSNDPHQEFLLKDHME 60 Query: 309 SQE-----------TGGTIVSSALKLMDEVV----KERYNPAQWNIYAAQASDGDNWADD 353 TGGT A+ + + A +DGD+ D Sbjct: 61 KTALLAALDSFPYRTGGTETGKAIDFLRTQYFTKEAGSRANQRVPQIAVVITDGDSTDDV 120 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + + L + A+Q + F + + + Sbjct: 121 TVPA------QSLRKHGVIVFAIGVGNANQNELESIANRPPKRFKFTIDSFQALQRLTKG 174 Query: 414 F 414 Sbjct: 175 L 175 Score = 64.5 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 58/197 (29%), Gaps = 28/197 (14%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVY 291 R + + + +A +F L+D SGS++ + K+F I + + V + Sbjct: 574 RRASIREGCLQTDEADIFFLIDHSGSINPADFHDMKKFMIEFLHTFRVGPQHIRIGVVKF 633 Query: 292 IRHHTQAKEVDEHEFFYSQE-----------TGGTIVSSALKLMDEVVKERYNPAQWNI- 339 + E D + + GGT AL+ M + + Sbjct: 634 A--DSPQLEFDLQAYSDVKSLEDAILNIKQIGGGTETGRALEFMSPQFDQALATHGHKVK 691 Query: 340 -YAAQASDGDNWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 Y +DG + + A KL V Y+ A + E Sbjct: 692 EYLVVITDGKSTD-----KVKAPADKLRSQDVVVYAIGVK--NADENQLLEIS--GDPQR 742 Query: 398 NFAMQHIRDQDDIYPVF 414 F + + I Sbjct: 743 TFFVNNFDALRPIKDDI 759 Score = 63.4 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 61/212 (28%), Gaps = 24/212 (11%) Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 I ++ ++ + ++ LR + + L+D SGS+ K F Sbjct: 760 ITDICSQDGKQIYVLNVVLRANPQALQGHKR---DLIFLIDSSGSIYPEDYKKMKDFMKS 816 Query: 276 LYLFLSRTYKNVEVVYIRHHTQAK------------EVDEHEFFYSQETGGTIVSSALKL 323 + V V +++ T K E+ + Q GGT A+ Sbjct: 817 VIKQSIVGKNEVHVGVMQYSTIQKLVFPLNQYYTKDELSKAIDEMQQIGGGTHTGEAITD 876 Query: 324 MDEVVKERYNPAQWNIY-AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 + + R +DG+ + D + +V + A+ Sbjct: 877 VSQYFDARNGGRPDLKQRLVVVTDGE--SQDEVRQPAEALRAKGVIVYSIGVV----AAN 930 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + E + +A + D+ Sbjct: 931 TSQLLEIS--GTPNRMYAGRDFDALKDLEKQM 960 >UniRef50_A4BKH3 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BKH3_9GAMM Length = 553 Score = 80.7 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 50/182 (27%), Gaps = 14/182 (7%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--- 293 + ++ + + DVSGSMD K F+S V + Sbjct: 367 RIWKDKKSGGRPIAAMFVADVSGSMDGDRIRALKIALDESANFVSSRNSIGLVTFNDRVN 426 Query: 294 -------HHTQAKEVDEHEFFYSQETGGTIVSSA-LKLMDEVVKERYNPAQWNIYAAQAS 345 Q K GGT + A L E++ + + S Sbjct: 427 VDLPIREFDLQQKSQFLGAVERMSAGGGTATNDAILVAAHELLNFAKTHPEHKLTIFVLS 486 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH--QTLWRE-YEHLQSTFDNFAMQ 402 DG+ E + + L V +Y + L Y + + + Sbjct: 487 DGETRNGLPLGDVEKVIQMLNIPVHSIAYGFESADLKKVSGLVEASYTESSTGSAAYQIG 546 Query: 403 HI 404 ++ Sbjct: 547 NL 548 >UniRef50_C3Y4Z7 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y4Z7_BRAFL Length = 1236 Score = 80.7 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 31/319 (9%), Positives = 78/319 (24%), Gaps = 33/319 (10%) Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 + L + + + ++ Q + + + V + + ++ Q + Sbjct: 499 ARPFAPPGMEAELEDYQKIVDQQKQQLQVPQKPPPKPKTFKDRHVAVSEKLSKASQEKIK 558 Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR 235 GK + ++ S + ++R + +++ + Sbjct: 559 ELIIEHGGKPVPFVKKKAAYNMLVCSRAEFNDKSIKVRDALE--NQVRDKMKVLLELSAD 616 Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQS-------TKDMAKRFYILLYLFLSRTYKNVE 288 K+ E + ++D S SMD + F L+ + Sbjct: 617 EKSLEASGHQRTPLRFVAVIDESYSMDDRIGRDKLTLIQRMQIFAELMAKDFKDEDQMGI 676 Query: 289 VVYIRH----------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V + + ++ + G T +S L + K N Sbjct: 677 VTFANDAKVVLPMTRMDSSGRDSALEKIQNISTRGQTNLSDGLLSAISMFKGSSGSDFHN 736 Query: 339 IYAAQASDGD-NWADDSPLCHEILAKKLLPV--------VRYYSYIEITRRAHQTLWREY 389 +DG N + + ++ L E Sbjct: 737 G-IILFTDGQANQGIIDAAELVQEYNSKMAGLGEGVCLPISTFTIG----DYRPKLLCEV 791 Query: 390 EHLQSTFDNFAMQHIRDQD 408 + F + D + Sbjct: 792 AQNLGSDAFFWLSDDTDFE 810 Score = 62.6 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 16/182 (8%), Positives = 37/182 (20%), Gaps = 29/182 (15%) Query: 243 PDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 ++D SGSM +++ K F ++ + V + Sbjct: 48 NKTRLPLRFVAVIDESGSMASTIGNETLIYKMKIFARVMVRKMKAEDMLGIVGFDSDARV 107 Query: 298 AKEVDEHEFF----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + + T + + ++ A +DG Sbjct: 108 LLPITQMDKDGKKAAMDSIESLSAKTFTNLCEGILTGAKLFDTTEECAHCRNGMVVFTDG 167 Query: 348 -DNWADDSPLCHEILAKKLLPV---------VRYYSYIEITRRAHQTLWREYEHLQSTFD 397 N + + + L E + Sbjct: 168 IANQGITDADGIVSAFNSIAQQTFGSNFCLPISTLTIG----DYKPDLLLEISQRLGSDA 223 Query: 398 NF 399 F Sbjct: 224 FF 225 >UniRef50_A9IZC4 Cobalamin biosynthesis protein CobT n=29 Tax=Rhizobiales RepID=A9IZC4_BART1 Length = 636 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 53/465 (11%), Positives = 108/465 (23%), Gaps = 89/465 (19%) Query: 9 LNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVS---IPTEDISE--PM 63 LN + N+Q F + Q+ ++ + + ++D ++ + + P + E Sbjct: 196 LNELTHHIYNQQAFAQIV-CQMLATLKMTVQQDEISDSENNKKLKNACEPQDQAEEENKN 254 Query: 64 FHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDE 123 + + +D + + Q + + S E Sbjct: 255 KKHAQSEEQTSTEQESDVQDEGKTQATQTNDNDQADGEQKNGPWQEKPS-KPKRLFSDSE 313 Query: 124 YLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAG 183 ++ L + + Q ++ E + + + + + ++ R A Sbjct: 314 QMERLGDY----KIFTRQFDEILEATDFCSENELDHLRRCLDKQANHLQNIVGRLA---- 365 Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRP 243 + AQ + E L ID E Sbjct: 366 ------------NRLQRRLMAQQNRNWKFDLEEGYLDTARLPRLIIDPMQPLSFKMESNT 413 Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-------------------- 283 V+ L+D SGSM +A +L L R Sbjct: 414 QFR-DTVVSLLIDNSGSMRGRPISVAASCADILAQTLERCGVKVEILGFTTKTWKGGKSR 472 Query: 284 ---------------YKNVEVVYIRHHTQAKEVDEHEFFYSQET--GGTIVSSALKLMDE 326 ++Y T + + QE I AL + Sbjct: 473 EKWLEKHKPHHPGRLNDLCHIIYKSADTPWRRARRNLGLMMQEGLLKENIDGEALIWAHQ 532 Query: 327 VVKERYNPAQWNIYAAQASDGDNWADDS---------PLCHEILAKKLL--PVVRYYSYI 375 + R + SDG D + + +++ + + Sbjct: 533 RLLSRR---EQRRILMVISDGAPVDDSTLSVNSSNYLEKHLRAVIQEIQTHSPIELIAIG 589 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 M D I LF Sbjct: 590 IGHDVTRYYQ----------RAVTIMNAEELADAITKQLEALFSD 624 >UniRef50_B3RP11 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RP11_TRIAD Length = 356 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 40/152 (26%), Gaps = 21/152 (13%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY--- 291 ++K + S + ++D SGSM K + L + + + Sbjct: 204 KFKQWYVNAASPSSKRLVLVLDRSGSMSGDRFLKVKEAATAVLDSLGPNDEIGVIAFDDE 263 Query: 292 -----------IRHHTQAKEVDEHEF--FYSQET-GGTIVSSALKLMDEVVKERYNPAQW 337 + T + +F Q G T ALK +++ Sbjct: 264 IRIHGGCKVTTVSPATPQSIIFLKDFINNKIQPEFGSTGYVPALKHAFDMLSTNMTSKAK 323 Query: 338 NI--YAAQASDGDNWADDSPLCHEILAKKLLP 367 +DG D+ + K Sbjct: 324 TKTNLIVFLTDGHP--DEPESQILDVIKNRNE 353 >UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KNK1_AERHH Length = 552 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 40/294 (13%), Positives = 84/294 (28%), Gaps = 20/294 (6%) Query: 105 ASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANI 164 G + ++ SK L + + +L + E + Sbjct: 259 GYGQILGDIDTTYEKSKALLLACSGANFSYNDL--KLCKDDIEPLAKQLQQNHAIKELTY 316 Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 + R+ + ++ A + + + + L Sbjct: 317 KMGRAYISEEKKKQA--RIPHASKSEVHGTHRSEDLARVLPTELLNLEDEALETLFYARF 374 Query: 225 RVPFIDTFDLRYKNYEKRPD----PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 + T++L+ + +D SGSM + A+ + + L Sbjct: 375 LERNLMTYELQGTTCTSGEQLELEQKRTGPVVACLDTSGSMSGAPLLKARALLLAVSAVL 434 Query: 281 SRTYKNVEVVYIRHHTQAKEVDEHEFFYSQ---------ETGGTIVSSALKLMDEVVKER 331 + +++ VV + + +E HE + GGT + L E++++ Sbjct: 435 QQEARSLHVVLFGDNGELREYAIHEENSASGLLHFLRQGFGGGTDFETPLNRACEIIRDA 494 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 SDGD D + H KK+L YS + +R Sbjct: 495 --KEYEKADILMISDGDCVLSDDYIEHLQTRKKIL-DCSIYSVLCHGQRVADRF 545 >UniRef50_A2UW24 Von Willebrand factor, type A n=1 Tax=Shewanella putrefaciens 200 RepID=A2UW24_SHEPU Length = 599 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 47/370 (12%), Positives = 100/370 (27%), Gaps = 32/370 (8%) Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLD 126 G + +D + + G GS G A + D + + E+LD Sbjct: 246 GSDDDSDDANGSDDDSDDANGSDDDSDGAEGSQGDTGDADSSNDSGDINGLKSALREFLD 305 Query: 127 LLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRR 186 DL +L++ + ++ + G P +S S+ + + Sbjct: 306 DESGDLE-DSLEKALKAEIEAE---KEGDDNEPDPREVSYNPSVLPAPIQNQNFAHADSI 361 Query: 187 EL-HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR-YKNYEKRPD 244 + E + + +L + + R + D+R +K+ + Sbjct: 362 QKNARKESLFLAQRLHGLVESKAQLQLDRRSSGRRLINNAGLRLMQNDMRVFKHESQVDM 421 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR------------TYKNVEVVYI 292 P + L+D S SM + L LS+ + Sbjct: 422 P--NTAVTLLLDQSNSMCGKAYQTSVESTYALVEALSKINLVKTSVLGFGNSTESVIALK 479 Query: 293 RHH-TQAKEVDEHEFFYSQETGG-TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 T AK + + G T +++ L + Y + +DG Sbjct: 480 GFEETPAKCLTKLASSS--ADGYCTPLATGLWAALNQL---YTRTEDRKVVLVVTDGQPH 534 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 +A+ V Y + + F Q +++ Sbjct: 535 GFQYCKNL--IAEMQASNVEVYGIGIGN-DLNLPTLQSL--FGKQFAIKVDQLSDLGNEV 589 Query: 411 YPVFRELFHK 420 + + + Sbjct: 590 FKIAEGILLD 599 >UniRef50_B7Q438 Neurogenic locus notch, putative n=1 Tax=Ixodes scapularis RepID=B7Q438_IXOSC Length = 1597 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 60/203 (29%), Gaps = 23/203 (11%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTY-K 285 + DL+ +++ + ++D SGS+ ++A + L +S Y + Sbjct: 23 VLDADLQLFVSTIERYSTTKNDIVFVLDESGSIGADVFPAELAFTEMVARLLVVSPEYSR 82 Query: 286 NVEVVYI-----------RHHTQAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYN 333 + + HE +GGT AL E++ Sbjct: 83 LTVMTFSNDNLVHIDQVGSSGDTNMCKFVHELNQIPYRSGGTRTREALGYAGEILWNARQ 142 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 A N SDG + P L + V+ + ++ + Sbjct: 143 EA--NRIVVLISDGQANSGSEPSEIARLLRVKGIVI----FGVGVAHINKDELLDVA--S 194 Query: 394 STFDNFAMQHIRDQDDIYPVFRE 416 S + +++ + R+ Sbjct: 195 SPAHTYMLRNFEYIKKVNKDLRK 217 >UniRef50_A4YDL0 von Willebrand factor, type A n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YDL0_METS5 Length = 394 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 23/217 (10%), Positives = 49/217 (22%), Gaps = 49/217 (22%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMD---------------------------------QS 264 +F ++DVSGSM + Sbjct: 26 TIVPERVKPVPLDLFIVLDVSGSMGIIDNPPEVDDSLIAGTAEVDGHVVRYLKDDIGVNN 85 Query: 265 TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE-----VDEHEFFYSQETGGTIVSS 319 ++A L + + + + H G T + S Sbjct: 86 RLEVALEAIRNLLENADTSTRVTIITFSDHVNVLCRRVTPSTALEHLEEIVPDGNTALYS 145 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR 379 A+K ++ E +DG + + L ++ ++ Sbjct: 146 AVKKAISLIDEHPAR------VLLITDGYPTDVEDETEYSKL--EVPRFSQFIPIGVG-- 195 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + R L + + + I R Sbjct: 196 EYNAKILRSLADLSNGRFYHVND-VSEISRIMEEERA 231 >UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q466I6_METBF Length = 562 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 42/297 (14%), Positives = 91/297 (30%), Gaps = 27/297 (9%) Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYT-------ANGVPANISVVR 168 F +E L+ LF+ L L ++N + E K + + + + Sbjct: 233 EFVTEMEENLE-LFDTLTLLFPQRNWSYSVKELKKEPFYVQLKMLKNYSTFFEKSPDLKK 291 Query: 169 SLQNSLARRTAMT----AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 + R ++ S + + + L + + Sbjct: 292 IMDFIGRREFDPPSDRIRLSPFGKDRIQTVRFSDSINNLLPMEAAKLLNPSLKKKFYADM 351 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 + ++ K+Y P + M L+D SGSM + + +AK + + + Sbjct: 352 LEGKLLSYQFLGKHYTGPPRIKPRGPMIVLVDTSGSMHGAPQTLAKSAVLAMAKLMLSQQ 411 Query: 285 KNVEVVYIRHHTQAKEVDEHEFFYSQE----------TGGTIVSSALKLMDEVVKERYNP 334 ++++V+ +Q E++ E GGT ++AL + +KE+ Sbjct: 412 RDMKVILFASTSQHLEIELSSRKKMSEKFLNFLLYTFGGGTDFNTALASGLKSLKEKDFQ 471 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 +DG ++ S ++ Y I + E Sbjct: 472 GAD---LLFITDGK--SEVSDELVLARWEEAKKKYNAKVYSLIVGSSGAGGLSEISD 523 >UniRef50_Q3KGA0 Putative secreted protein, hemolysin n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3KGA0_PSEPF Length = 2887 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 37/160 (23%), Gaps = 19/160 (11%) Query: 246 SSQAVMFCLMDVSGSMDQ-------STKDMAKRFYILLYLFLSR--TYKNVEVVYIRHHT 296 + + ++D+SGSM S ++AK+ L K V + + T Sbjct: 2078 EIDSNILIVLDISGSMADASGVPGLSRLELAKQAISALLDKYDDLGDVKVQLVTFSSNAT 2137 Query: 297 Q------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + GGT +A+ M SDG Sbjct: 2138 DRTSVWVDVATAKTLLAGLSAGGGTNYDAAVATMYNAFNTSGKLTGAQNVGYFFSDGKPN 2197 Query: 351 ADD----SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 D + A+ Sbjct: 2198 EGDIGTADEATLKAFLDANNIKNYAIGLGSGVSNANLDPL 2237 >UniRef50_Q8C6K9 Collagen alpha-6(VI) chain n=26 Tax=cellular organisms RepID=CO6A6_MOUSE Length = 2265 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 32/300 (10%), Positives = 71/300 (23%), Gaps = 41/300 (13%) Query: 153 AGYTANGVPANISVVRSLQNSLARRT--------AMTAGKRRELHALEENLAIISNSEPA 204 G V S + + ++ + E + + Sbjct: 700 TGSALTFVSQYFSPDKGARPNVRKFLILITDGEAQDIVRDPAIALRKEGVIIYSVGVFGS 759 Query: 205 QLLEEERLRKEIAELRAKIE---RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 + + E + + + D L + + + ++D SGS+ Sbjct: 760 NVTQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPREECKRIEVLDVVFVIDSSGSI 819 Query: 262 DQSTKDMAKRFYILLYL------------FLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS 309 D ++ K F I L L + Y+ EV Sbjct: 820 DYQEYNIMKDFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDH 879 Query: 310 QETGGTIVSSAL---KLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL 366 G T + AL M + +DG++ + + Sbjct: 880 PMGGNTYTAEALAFSDHMFTEARGSRLHKGVPQVLIVITDGESHDAEKLNTTAKALRDK- 938 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQST-FDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + + + W S+ F + + + +F +A+ Sbjct: 939 -GILVLAVGIAGANS----WELLAMAGSSDKYYFV--------ETFGGLKGIFSDVSASV 985 Score = 64.5 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 43/166 (25%), Gaps = 23/166 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---- 295 K + + LMD S S+ K F + + + V + + Sbjct: 989 SKVDCEIEKVDLVFLMDGSNSIHPDDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYR 1048 Query: 296 --------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK---ERYNPAQWNIYAAQA 344 T +E+ Q G T + AL+ + + A Sbjct: 1049 SEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQPDMGSRINAGTPQVLLVL 1108 Query: 345 SDGDNWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREY 389 +DG S A++L V YS + Sbjct: 1109 TDGR-----SQDEVAQAAEELRHKGVDIYSVGIG--DVDDQELVQI 1147 Score = 64.5 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 50/191 (26%), Gaps = 27/191 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 + +A + L+D SGS+ K F L V++ ++ + K Sbjct: 611 AEEACRDMKADIMFLVDSSGSIGPENFSKMKMFMKNLVSKSQIGADRVQIGVVQFSHENK 670 Query: 300 EVDE-----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ-----WNIYAAQ 343 E + + G T ++ V + ++P + + Sbjct: 671 EEFQLNTFMSQSDIANAIDRMTHIGETTLTG---SALTFVSQYFSPDKGARPNVRKFLIL 727 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG+ + V YS T E F +++ Sbjct: 728 ITDGEAQDIVRDPAIAL----RKEGVIIYSVGVFGSNV--TQLEEIS--GKPEMVFYVEN 779 Query: 404 IRDQDDIYPVF 414 I Sbjct: 780 FDILQHIEDDL 790 Score = 63.4 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 35/286 (12%), Positives = 75/286 (26%), Gaps = 31/286 (10%) Query: 161 PANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELR 220 A+ V +L R E +E I S+ + + + Sbjct: 341 RASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKIASHPAEQFTSKLGNFSELATHNQ 400 Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSS-----QAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 ++++ T + + S+ +A ++ L+D SGS + K F Sbjct: 401 TFLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGSGSTQPTDFHEMKTFLSE 460 Query: 276 LY---LFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGG-TIVSSALKL 323 + + V Y E+ + ++ GG T +AL Sbjct: 461 VVGMFNIAPHKVRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNF 520 Query: 324 MDEVVKERYNPAQ--WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 ++++ + + K +R ++ A Sbjct: 521 TLKLLQRAKKERGSKVPCHLVVLT----NGMSRDSVLGPAHKLREENIRVHAIGVK--EA 574 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 +QT RE + D R ++ + Sbjct: 575 NQTQLREIAGEEKRVYYVHEF------DALRNIRNQVVQEICAEEA 614 Score = 58.0 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 25/314 (7%), Positives = 65/314 (20%), Gaps = 26/314 (8%) Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKR 185 + L E R ++ + ++ + ++ Sbjct: 108 NALQEAHRTYFSAPTNGRDKKQFPPILVVLASAESEDDVEEAAKALREDGVKIISVGVQK 167 Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 L+ + + + E+ + + D+ + Sbjct: 168 ASEENLKAMATSQFHFNLRTARDLSVFAPNMTEIIKDVTQYREGMADDI----IVEACQG 223 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIR--------H 294 S A + L+D++ + Q D K F L + V Y Sbjct: 224 PSVADVVFLLDMAINGSQEDLDHLKAFLGESISALDIKENCMRVGLVTYSNETRVISSLS 283 Query: 295 HTQAKEVDEHEFFYSQET-GGTIVSSALKLMDEVV----KERYNPAQWNIYAAQASDGDN 349 K G +AL+ + + + A + Sbjct: 284 TGNNKTEVLQRIQDLSPQVGQAYTGAALRKTRKEIFSAQRGSRKNQGVPQIAVLVT---- 339 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + V ++ + + + + + + Sbjct: 340 HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDE--LEKIASHPAEQFTSKLGNFSELAT 397 Query: 410 IYPVFRELFHKQNA 423 F + Q Sbjct: 398 HNQTFLKKLRNQIT 411 Score = 50.3 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 15/171 (8%), Positives = 37/171 (21%), Gaps = 30/171 (17%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---------RTYKNVEVV--------- 290 L+D S + + + + F L + + Sbjct: 1963 LDTAFLLDGSRHVGSAEFEDMRDFLEALLDHFEITSEPETSVTGDRVALLSHAPLDFLPN 2022 Query: 291 ---------YIRHHTQAKEVDEHEFFY--SQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + +K + + Q G + AL + V + N Sbjct: 2023 TQRSPVRTEFNLTSYSSKRLMKRHVDQAVQQLHGDAFLGHALGWALDNVFLNTPNLRRNK 2082 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 S G+ D+ + + + + + + Sbjct: 2083 VIFVISAGETSHLDAETLKKESLRAKCHGYALFVF-SLGPDWDDKELEDLA 2132 Score = 49.9 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 52/194 (26%), Gaps = 28/194 (14%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE-- 303 A + L+D S + + + K F + L V ++ + Sbjct: 22 PEYADVVFLVDSSDHLGLKSFPLVKTFIHKMISSLPIEANKYRVALAQYSDALHNEFQLG 81 Query: 304 ------HEFFYSQ-----ETGGTIVSSALKLM----DEVVKERYNPAQWNIYAAQASDGD 348 + + G + +AL+ + Q+ Sbjct: 82 TFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKKQFPPILVVL---- 137 Query: 349 NWADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + +S E AK L V+ S ++A + + +F ++ RD Sbjct: 138 -ASAESEDDVEEAAKALREDGVKIISVGV--QKASEENLKAMATS---QFHFNLRTARDL 191 Query: 408 DDIYPVFRELFHKQ 421 P E+ Sbjct: 192 SVFAPNMTEIIKDV 205 >UniRef50_A6DKL3 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKL3_9BACT Length = 890 Score = 80.7 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 51/207 (24%), Gaps = 25/207 (12%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQS------TKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 K ++ + +MD SGSM + ++A L + Sbjct: 378 KNEHRKLRSNLSIVMDRSGSMGMTVKGGKTKMELANEGAAQTIELLGAMDSVSVIAVDTE 437 Query: 295 HT---------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 A E+ + GG V + L+ ++ R + S Sbjct: 438 AHAIVPQTVLKDAPEIASQARRVKSQGGGIYVYTGLEESWRQLEGREG----QKHVILFS 493 Query: 346 DGDNWADDSP-LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 D ++ + K V + E + + F + Sbjct: 494 DSNDSEEPGRYKELLADMKDEGMTVSVIALGE-RTDVDSPFLIDIANRGRGRIFFTDDPL 552 Query: 405 RDQ----DDIYPVFRELFHKQNATAKG 427 + V R F K+ K Sbjct: 553 SLPSIFAQETVTVARSAFLKEVTATKS 579 >UniRef50_B3RZ89 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RZ89_TRIAD Length = 933 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 16/171 (9%), Positives = 45/171 (26%), Gaps = 16/171 (9%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFY-ILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 + + ++DVSGSM + ++ L + + + + + Sbjct: 309 RAKKPRTVLVLDVSGSMRGKPMEQLQQAATNFLLNVAQNGSFVGIITFSSAASIRSSLVQ 368 Query: 304 HEFFYSQ----------ETGGTIVSSALKLMDEVVKERYNPAQWN-IYAAQASDGDNWAD 352 + +G T + + ++ +++K + SDG Sbjct: 369 INDDADRQRLILLLPSGASGSTSIGAGIQAGVKILKASVGNKSPSGGTLIVLSDGRENRS 428 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 + + V+ S+ + + + F Sbjct: 429 PTIADVKKQVLDNKITVQGISFGSLASV----KLQSLCYETGGLSFFVPND 475 >UniRef50_C9ZGQ6 Putative membrane protein n=3 Tax=Streptomyces RepID=C9ZGQ6_STRSW Length = 534 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 28/271 (10%), Positives = 70/271 (25%), Gaps = 23/271 (8%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + LT + TA+ ++++ + RR + E Sbjct: 253 KSRPDLTVIRPRDGVVTADYPLSSLASTGTDVRDDVRRLTDALRTPDVQRLITE------ 306 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + ++ + R + P + + + + ++D SG Sbjct: 307 RTLRRPVVASVPPAAGLDTTRRRELPFPGSRSVAVGLLDAYENDLRRPSRT-VYVLDTSG 365 Query: 260 SMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-------------HHTQAKEVDEHEF 306 SM+ D K L + + + + + Sbjct: 366 SMEGDRLDRLKTALTELTGDFRDREEVTLMPFGSDVKSVRTHVVRPADPKAGLDGIRADT 425 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLL 366 G T + ++L+ E + + +DG+N SP + +L Sbjct: 426 RKLSAAGETAIYTSLRRAYEHLGAVDRDTFTS--IVLMTDGENTEGASPADFDDFYGRLP 483 Query: 367 PVVRYY-SYIEITRRAHQTLWREYEHLQSTF 396 R+ + + + + + Sbjct: 484 DAARHIPVFPILFGDSDRDELEHIAEVTGGR 514 >UniRef50_C6PVI2 Vault protein inter-alpha-trypsin domain protein n=4 Tax=Clostridium RepID=C6PVI2_9CLOT Length = 531 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 22/219 (10%), Positives = 55/219 (25%), Gaps = 27/219 (12%) Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 +R + +YE S L+D+S +M + AK + Sbjct: 243 YEYKENGEDRRVVYLRMIPKLDDYEMEDKES----YVFLIDISDTMKGDKLEQAKNALQM 298 Query: 276 LYLFLSRTYKNVEVV------------YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKL 323 L + +++ + + A+K Sbjct: 299 CIRNLEEGDTFDIIAMGETLKYFWDEGMAEFNSETLKKASQWIENLTTEDDADIFGAIKY 358 Query: 324 MDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 E N N + + + ++ L R +++ + + Sbjct: 359 SLE------NEGGHNT-ILIFT---DDEVEEEDEILDYVRENLGDNRIFTFGFDSETNN- 407 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + H F + R + + F+ + + + Sbjct: 408 YFLNKLAHESFGKAEFINKGRRIEYVVLRQFKRIQNPEV 446 >UniRef50_B9XNU9 von Willebrand factor type A n=1 Tax=bacterium Ellin514 RepID=B9XNU9_9BACT Length = 229 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 55/174 (31%), Gaps = 17/174 (9%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEV 289 L + + E +P + ++DVS SM + D L L+R+ K VE Sbjct: 4 QLPFIDVEFVDNPEPRCPCVLVLDVSSSMRGAAIDFLNLGVDLFAHDLTRSRLACKRVET 63 Query: 290 VYIRHHTQAKEVDEHE------FFYSQETGGTIVSSALKLMDEVVKERYNP------AQW 337 I V + + G T + A+ E++++R + + Sbjct: 64 AIITFGDGVHIVQDFVSPSAFVPPRFEAGGKTPMGEAVVQACELLEKRKRKYRAAGVSYF 123 Query: 338 NIYAAQASDGDNWA--DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 + +DG+ + + + + + A+Q E Sbjct: 124 RPWIFLITDGEPTDYETANWRQAVEIVRAGEVDKKLMFFGVAVSDANQGKLNEL 177 >UniRef50_Q25545 Putative uncharacterized protein (Fragment) n=1 Tax=Naegleria fowleri RepID=Q25545_NAEFO Length = 357 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 36/127 (28%), Gaps = 12/127 (9%) Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLC 357 K+ + T +S L ++K+R + +DG N + Sbjct: 2 KQKAKQVAKNIHAGTCTNLSGGLFEGLRLIKQRTTCNEITS-ILLFTDGLANEGITNTSE 60 Query: 358 H-----EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + +++ + +++ + + F + + DDI Sbjct: 61 IVSKMNTTIHEEIRKQITCFTFGFG-SDTDANMLTSIAQAGNGLYYF----LNNVDDIPK 115 Query: 413 VFRELFH 419 F + Sbjct: 116 AFGNVIG 122 >UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHI4_9EURY Length = 705 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 68/210 (32%), Gaps = 12/210 (5%) Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 ++ + + ++ + E + +++N +R +I +L Sbjct: 451 TVYDGPVEQYNERIRPNPKIKKMLEEIELLNNPRRDLTGGTSGIRPDIRKLIR-----YE 505 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 + R + A ++ L+D+SGSM + AKR ++ L K V Sbjct: 506 VTKDPAYISKPYLRHIKDAGAEIWMLLDISGSMGGQKINAAKRILGSIHDSL-DGSKYVH 564 Query: 289 VVYIRH----HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + T E D G T A+ +++K+ + + ++ Sbjct: 565 LRMFGFYGSDGTHVFEFDRKMLMNLAAMGDTPTDIAIYYAMDLMKK--DKSNFDKTLFII 622 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 +DGD K + V ++ Sbjct: 623 TDGDPNNGQETKNALNSLKNAMKNVNVFTI 652 >UniRef50_Q021L5 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q021L5_SOLUE Length = 337 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 61/214 (28%), Gaps = 22/214 (10%) Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 L +E++ F D + + + D SGSM ++ Sbjct: 78 DRLITGLEKIHFHLFDDKVEQEITTFASEDVPVSIVIVFDCSGSM-GPKLAKSRAAVAAF 136 Query: 277 YLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK 329 + + V++ + Q E+ F+ Q G T + A+ L + +K Sbjct: 137 LSSANPEDEFSLVLFNDRAQLVSGFNRQTDELQSKLFYA-QSKGRTALLDAIYLAMDQMK 195 Query: 330 ERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRR-------- 380 SDG DN + S + K+ + +E Sbjct: 196 HA---KHSRKAVLVISDGGDNCSRYSMREVKNRVKEGDAQIYSIGILEAMGFRGRSAEEL 252 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 A L + F + ++ + D+ Sbjct: 253 AGPALLDDIASQSGGR-LFEIDNLNELSDVASKI 285 >UniRef50_UPI0001791B4B PREDICTED: similar to AGAP005490-PA n=1 Tax=Acyrthosiphon pisum RepID=UPI0001791B4B Length = 1156 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 37/297 (12%), Positives = 78/297 (26%), Gaps = 25/297 (8%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI-I 198 N + +H N +++ + + + + A K E + I Sbjct: 135 NILLNDLKENSHFYHTPVNATYSSVHIPTYVYDRASDVI--KALKWSENLDVAFRKNYEI 192 Query: 199 SNSEPAQLLEEER-LRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 Q ++ + + D +D R + + +S + L+D Sbjct: 193 DPRLSWQYFGSTTGFMRQFPAMEWPDKPDNTEDLYDCRMRPW-YVEAAASPKDILILVDN 251 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEHEFFYS 309 SGSM +K +A+ L LS +V+ + Sbjct: 252 SGSMMGQSKIIARHVINTLLDTLSVNDYVNVLVFSNVTNEVVPCFKNLLVQATLANIREL 311 Query: 310 Q--------ETGGTIVSSALKLMDEVVK---ERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + + S AL + E ++ ER A N +DG Sbjct: 312 KLGVENIADPNNISDFSIALTMAFETLEMYRERNMGAMCNQAIMLVTDGVPENFYELFKS 371 Query: 359 EILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + VR ++Y+ + + ++ + Sbjct: 372 HNWKNGTMGMPVRVFTYLIGREVPEIRDMKWMACANHGYFVHLSTVEEVREQVVHYL 428 >UniRef50_D2I4K8 Putative uncharacterized protein n=1 Tax=Ailuropoda melanoleuca RepID=D2I4K8_AILME Length = 1096 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 53/200 (26%), Gaps = 23/200 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHTQA--- 298 + + L+D S S+ + + +++ L + V Y T A Sbjct: 545 KTVHYDLVFLLDTSSSVGKEDFEKVRQWVANLVDTFEVGPERTRVGVVRYSDQPTTAFEL 604 Query: 299 -----KEVDEHEFFYSQETGG-TIVSSALKLMDE-----VVKERYNPAQWNIYAAQASDG 347 +E + + GG T AL+ + R + A Sbjct: 605 GLFGSREAVKAAARHLAYHGGNTNTGDALRFITRHSFSPQAGGRPGDRAFKQVAILL--- 661 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 L + A +R ++ A + E + F + Sbjct: 662 -PAGRSQDLVLDAAAAAHRAGIRIFAVGVG--AALKEELEEIASEPKSAHVFHVSDFNAI 718 Query: 408 DDIYPVFRELFHKQNATAKG 427 D I R + ++A Sbjct: 719 DKIRGKLRRRLCESFSSALS 738 >UniRef50_Q6MJI4 Putative uncharacterized protein batB n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MJI4_BDEBA Length = 354 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 52/235 (22%), Gaps = 51/235 (21%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 ++ S + +DVS SM AK L L K V + Sbjct: 81 SQQEVKSEGVEIIFAVDVSESMMAEDVKPSRLAQAKAELSRLVD-LMPGNKVGIVAFAGS 139 Query: 295 H------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKE----RYNPAQWNIY 340 T + + + GT + ALK+ E + + Sbjct: 140 AALLSPLTNDPGAIKMYLESLEPSSVSSQGTNFTEALKISKEAFERGGVSTDETVKVTRV 199 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR--------------------- 379 ASDG++ + K VR +S T Sbjct: 200 ILIASDGEDH---EQGALDEAKKMAGEGVRIFSLAYGTEKGGAIPVRDGMGFLKGYKKDR 256 Query: 380 -------RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 R FA + +L Q T Sbjct: 257 QGQTILTTVKGDALRALAEAGQGSFYFATFGGEQTKLLVEDISKLEKAQFDTTMA 311 >UniRef50_B2UMP0 von Willebrand factor type A n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UMP0_AKKM8 Length = 328 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 33/271 (12%), Positives = 68/271 (25%), Gaps = 38/271 (14%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 R E ++ + A L + + I R ++ E + Sbjct: 26 RRRRGAEGSITYPTVRFIASLARTPQSLAGKIGALCFVLAAAAIAIALARPQHVEDKTFR 85 Query: 246 SSQ-AVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEVVYIRH 294 + + D+S SM+ + AK + V + Sbjct: 86 TVNGIDIMIAFDLSYSMETPDMVLNRMPINRLVAAKHVITQFVDS-RPDDRIGIVGFAGK 144 Query: 295 HTQAKEVD-----------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + + Q G T + SA+ + +R + Sbjct: 145 TKSFCPLTLDHALVNSIIRDFHPRMIQADG-TAIGSAIAAAATRLDDR--KETKSKIIIL 201 Query: 344 ASDGD-NWADDSPLCHEILAKKLLPVVRYYSYI----------EITRRAHQTLWREYEHL 392 +DG N SPL A KL + + + + R+ L Sbjct: 202 VTDGASNSGQISPLVAAENAAKLGIKIYTIAVGTEEGTLANGMVVQSEFDEPTLRKIAQL 261 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 +F ++ + + +L + Sbjct: 262 TGGE-HFRATNMASFNKAFTSIGKLEKSEAK 291 >UniRef50_P33352 Uncharacterized protein yehP n=69 Tax=root RepID=YEHP_ECOLI Length = 378 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 43/367 (11%), Positives = 92/367 (25%), Gaps = 48/367 (13%) Query: 56 TEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIE-----RPQGGGGGSGSGQGQASQDG- 109 T ++ G ++ + +E P+ G SG S Sbjct: 10 TRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTT 69 Query: 110 -EGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKT--HRAGYTANGVPANISV 166 E + + E L + + + R + + + A + Sbjct: 70 PEWINSIHTLFPQQVI-----ERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHT 124 Query: 167 VRSLQNSL---ARRTAMTAGKRRELHALEENLAII------SNSEPAQLLEEERLRKEIA 217 + + ARR + +E L + + Sbjct: 125 KHLMNPEVLAAARRIVCQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKSTLR 184 Query: 218 ELRAKIERVP-FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 + R+ + S Q + L+D SGSM S A Sbjct: 185 ANLQHWHPQHGKLYIESPRFN--SRIKRQSEQWQLVLLVDQSGSMVDSVIHSA--VMAAC 240 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF------YSQETGGTIVSSALKLMDEVVKE 330 L + V + T ++ Q GGT ++SA++ +++++ Sbjct: 241 LWQL-PGIRTHLVAF---DTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQ 296 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR----AHQTLW 386 SD S L + K + ++ + + Sbjct: 297 -----PAKSVIILVSDFY-EGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTA 350 Query: 387 REYEHLQ 393 + ++ Sbjct: 351 QALVNVG 357 >UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria RepID=C9RRF6_FIBSS Length = 228 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 20/170 (11%), Positives = 47/170 (27%), Gaps = 23/170 (13%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT------YKNVEVVYIR 293 + +PS++ + ++D SGSM+ Y + + V + Sbjct: 11 DLENNPSTRVPVCLVLDTSGSMEGQPISELNEGINCFYDAVRSDETALYAAEIAVVTFGG 70 Query: 294 HHT--QAKEVDEHEFF--YSQETGGTIVSSALKLMDEVVKERYNP------AQWNIYAAQ 343 EH+ GGT + A+ + +++++R + + Sbjct: 71 SAVLKTDFSTLEHQPDSPNFFANGGTPMGEAMNMALDLLEKRKGEYKASGVDYYQPWIVL 130 Query: 344 ASDGDNWADDSP--LCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYE 390 +DG D S + + + + Sbjct: 131 MTDGKPNGDSSEYARAVQRTCEMIKNRKLTIFPIGIGEDAD----MNALA 176 >UniRef50_C3ZEP6 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3ZEP6_BRAFL Length = 994 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 45/187 (24%), Gaps = 17/187 (9%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK-NVEVVYIRHHTQ 297 + + +M ++D SGSM + V + Sbjct: 320 PDFVLLQEKEPMMVLVLDTSGSMRGDPIRRLNQAATHFIRSTVLDDSWLGIVTFSTTANT 379 Query: 298 AKEVDE--HEFFYSQ--------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + + G T + AL +V++ + +P+ SDG Sbjct: 380 YHPLLQITSAADRTSLINRVPSTVGGTTCIGCALLEGVKVLEAQGDPSGG--ILFLMSDG 437 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 ++ +Y +R+ L F D Sbjct: 438 QENEAPDIATVTPQVLAKGIIIDTLAY----KRSADPQIESLALLTGGKSYFYSGEQGDS 493 Query: 408 DDIYPVF 414 + F Sbjct: 494 TALNDAF 500 >UniRef50_B9X084 Complement factor B n=5 Tax=Eumetazoa RepID=B9X084_NEMVE Length = 858 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 31/259 (11%), Positives = 64/259 (24%), Gaps = 22/259 (8%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 ++ R A E + + +I + + + + + + + Sbjct: 317 SAKRRCQANGKWTGSEAKCMADFEYMIKDVNSTAYQLKRNIDTMLEYTCSGMNSTCNLTE 376 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL-----SRTYKN 286 D+R + E + + + D S S+ + F I L L Sbjct: 377 VDMRARAIELNE--AGGLDVVFVFDASSSIKMDDFRLGLDFSIELVKLLGTSWKPGGTHV 434 Query: 287 VEVVYIRHHT-----------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + Y AK V + GGT AL V + Sbjct: 435 AAITYGTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQVV-PFTRE 493 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +DG + SP + K + Y+ + + L Sbjct: 494 GSQKALFFITDGHSNIGGSPRKAAKILKDK--GFQIYAIGVGKKVRRRELME-IASEPED 550 Query: 396 FDNFAMQHIRDQDDIYPVF 414 +++ + Sbjct: 551 EYVISVRKYKQLLSAVKKA 569 >UniRef50_UPI0000E488A7 PREDICTED: similar to Clca1 protein n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000E488A7 Length = 966 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 49/195 (25%), Gaps = 18/195 (9%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRHHTQAKEVD 302 PS + ++D SGSMD D R + V + + Sbjct: 309 QPSGSLRIVLVLDTSGSMDGERFDKMIRGAKNFIQSIVPNNSYVAIVEFNYESIVDSYMT 368 Query: 303 E-------HEFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 E + G T + + +V + + +Y SDG+ Sbjct: 369 ELTSVISRKDLASLLPTLADGATCIGCGIVTAIQVAQ-YNDMDSRGVYLILLSDGEENHG 427 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + +V ++ E T + + Q + Sbjct: 428 TPIADTMDDIEGSGVIVHSIAFYEA-----DTQLEDLAQMTGGISATCADGGSAQC-VIS 481 Query: 413 VFRELFHKQNATAKG 427 F + ++ + Sbjct: 482 AFVSIIAQRPQSVAA 496 >UniRef50_B0G4W3 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G4W3_9FIRM Length = 685 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 52/198 (26%), Gaps = 24/198 (12%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL--FLSRTYKNVEVVYIR------- 293 + + M DVSGSMD S + AK+ + Sbjct: 339 EKEALKVDMVA--DVSGSMDGSPLNEAKQVMSDFVGSVQFDAGDLVELTSFSTGVCLEQE 396 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWAD 352 A + ++ T + AL E R +DG DN+++ Sbjct: 397 FSDDAATLT-NDINNLVTGDMTSLYDALYTAVE----RVAAQNGARCVIAFTDGNDNYSN 451 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ---DD 409 + +A + V I + ++ Sbjct: 452 CTKEDVVNVANRYHVPVFIIGIGSI----DYADVNDIATQTGGMYYNVSDVTSMDKIYEE 507 Query: 410 IYPVFRELFHKQNATAKG 427 IY + ++L+ + G Sbjct: 508 IYQMEKQLYLVEFEDNTG 525 >UniRef50_D1N7V1 von Willebrand factor type A n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N7V1_9BACT Length = 342 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 28/230 (12%), Positives = 53/230 (23%), Gaps = 57/230 (24%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQST-----------------------KDMAKRFYILL 276 EK S + +D+SGSM+ ++AK+ Sbjct: 75 EKVLIRSQGIDIVLALDMSGSMEAYDVPRNINDARTLIAAVKNKEVENRIEVAKKEIRRF 134 Query: 277 YLFLSRTYKNVEVVY------IRHHTQAKEVDEHEFFYSQET---GGTIVSSALKLMDEV 327 + + + T + T +++ L Sbjct: 135 IEQ-RPNDRIGLIGFADQAYSFAPPTLDHAWLLAHLEQLEPGMIGQQTGIAAPLASGVNR 193 Query: 328 VKERYNPAQWNIYAAQASDGDNWADD--SPLCHEILAKKLLPVVRYYSYI---------- 375 +K+ P +DG N D+ +P L K+ V+ Sbjct: 194 LKKSDAP---RRVLVLFTDGRNNVDNRLTPEQAAALGKEFDVVIHTVGIGSRNAFVLVTD 250 Query: 376 --------EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 I + L R + A + +L Sbjct: 251 PFGRQQFQGIEDEFDEKLLRSLAEITGGTYFHAADADGM-KQVMDEINQL 299 >UniRef50_UPI000185CA78 von Willebrand factor type A n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CA78 Length = 347 Score = 79.9 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 41/161 (25%), Gaps = 17/161 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT------YKNVEVVYIRHH---TQ 297 + ++ L+DVS SM + + + L + + T Sbjct: 2 RRLPIYFLLDVSESMVGDPIEHVQDGMATIIKELKADPFALETVWLSIIGFAGKSKVITP 61 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDGDNWA 351 +++ GGT ++S L + + W +DG Sbjct: 62 LQDIITFYPPKIPIGGGTSLASGLNELMNAIDREVVKTTLERKGDWKPLVFLFTDGIPTD 121 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 D E V + + + L + Sbjct: 122 -DPAQAIERWNAHYRRKVNLVAI-SLGENTNYNLLGQLTDQ 160 >UniRef50_A3UND6 Von Willebrand factor type A domain protein n=1 Tax=Vibrio splendidus 12B01 RepID=A3UND6_VIBSP Length = 345 Score = 79.9 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 26/208 (12%), Positives = 55/208 (26%), Gaps = 30/208 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD-----------MAKRFYILLYLFLSRTYKNVE 288 E S + +D+SGSM + +AK+ + Sbjct: 85 EPIEQKKSAREIMVALDLSGSMSEEDFADKKGNKHDRLTIAKQVLREFAAQ-REHDRLGL 143 Query: 289 VVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +++ + + T A+ L V ++ Sbjct: 144 ILFADSAYVQAPFTEDINVWQSLLEDVELGYAGFKTAFGDAIGLSIAVFEQ---EQSRQR 200 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEIT--RRAHQTL--WREYEHLQS 394 +DGD+ + P+ +A K + + + + R L + Sbjct: 201 VMILLTDGDDTSSKMPPVKAAEIAAKYGVKIYTIAIGDPSTKGRYKMDLPTLEKVSAATG 260 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQN 422 AM + D Y +L ++ Sbjct: 261 GQMFHAMDR-KQLDQAYATIDQLEQQEF 287 >UniRef50_D1YP15 von Willebrand factor type A domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YP15_9FIRM Length = 675 Score = 79.9 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 50/379 (13%), Positives = 95/379 (25%), Gaps = 42/379 (11%) Query: 65 HQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDE----FVFQIS 120 H R + +D + I+ G S + S E S Sbjct: 294 HSDELNTNGRDNLNSDGHDGSSDIDFDDGKSWTSNHPETLDSTANEWGTSDSMNTHSDAS 353 Query: 121 KDEYLDLLFEDLAL---PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 L + +A + Q T + +R G + + Sbjct: 354 NGISSLCLPDTVARIANQFFQWKLQSSKTVDRQYRKGSGRRLMTKTKDTRGRMIRVYQDE 413 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA--------ELRAKIERVPFI 229 A+ +L ++ A + L+ + ++ P Sbjct: 414 HAL-----EDLALIDTLRAAAPYQRLRAVERVTPLQADTPLNVVERSVTSECLCQKAPQT 468 Query: 230 DTFDLRYKNYEKRPD--------PSSQAVMFCLMDVSGSMDQSTKDMA-KRFYILLY-LF 279 + DL+ + +P A ++D SGSM + A K + L Sbjct: 469 EHHDLKGLSIVVKPQDYRRKAREKRIGAYQLFVVDASGSMAARHRMEATKGAILSLLRDS 528 Query: 280 LSRTYKNVEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE-R 331 +V+ + T++ E E G T ++ L++ + Sbjct: 529 YVHRDSVGLIVFRKDSAEVLLPFTRSVERAERLLAKLPTGGKTPLAKGLRVAYTMCDRLL 588 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP----VVRYYSYIEITRRAHQTLWR 387 + I +DG + DS + V T L + Sbjct: 589 RRHSAERIQMICITDGRATSGDSENPVAEAKQWARILGTLPVDCIVIDTETGFIKLGLAK 648 Query: 388 EYEHLQSTFDNFAMQHIRD 406 E L + D Sbjct: 649 ELCKLMNGSYYAMDSITSD 667 >UniRef50_C3Y983 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Y983_BRAFL Length = 642 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 40/461 (8%), Positives = 101/461 (21%), Gaps = 72/461 (15%) Query: 2 TWFIDRRLNGKNKSMVNRQRFLRRY-----------KAQIKQSISEAINKRSVTDVDSGE 50 I RL ++K R R +++ + + + + Sbjct: 40 NVVIAGRLGSRDKVTERRVRRQDDVSDTVDVYWLHVDSRVTSRFATVTVESKLANRRDRG 99 Query: 51 SVSIPTEDISEPMFHQ------------GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGS 98 ++ T + E F G + D V+ Sbjct: 100 REAVFTMQLPETAFISNFSMVISNTLYVGTVMKKGAAREAYDAAVEAGESAGHVAQESPR 159 Query: 99 GSGQGQASQDGEGQDEFVFQISKDEYL-DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTA 157 G + + S + +++ FQ++ E+L L + +++ Q Sbjct: 160 GREEFRISINVGAEEKVTFQLTYQEFLVRRLGIYEHVISIRPEQIIPDMRISVDLRESQG 219 Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA 217 + S + + + ++ S Q + + Sbjct: 220 FSFLRVPDIRTSDLLTDQ---STDELSSATVSRASPEHYTVTYSPSEQEQAAMSTQGIMG 276 Query: 218 ELRAKIERVPFIDTFDLRYK------NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR 271 + + + + ++ + P + ++D Sbjct: 277 DFVVQYDVIRQQQIGFIQVYGDYFVHYFAPTGLPVMPKNVIFVID--------------- 321 Query: 272 FYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF-----------YSQETGGTIVSSA 320 L V + + K + GGT ++ A Sbjct: 322 --------LREMDFFNIVTFSSRTKRWKSELQVANQENIRAADTYVTSMAAHGGTNINDA 373 Query: 321 LKLMDEVV--KERYNPAQWNIYAAQASDGDNWAD-DSPLCHEILAKKLLPVVR-YYSYIE 376 + ++ + R +DG + A+ L + Sbjct: 374 ILEASVLLDPELRSRHDSHASMIVLLTDGQPTGGVTNTNHIIANARDSLAGNHALFCLGF 433 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 F + + F E+ Sbjct: 434 GY-DVSFEFLERLALQNGGFARRIYPDDDGELQLTSFFDEV 473 >UniRef50_B0NBU8 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0NBU8_EUBSP Length = 1865 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 46/345 (13%), Positives = 95/345 (27%), Gaps = 35/345 (10%) Query: 33 SISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQ 92 S++ + NK + DS V D + + + N+++ + Sbjct: 336 SVTVSANKEGIIPADSKLKVIPVLPDDKKTKDQYKEVEDKLKDKAKNENYSIAGFLAYDI 395 Query: 93 GGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHR 152 G + I K+ + + + +L++N++ Q+ + Sbjct: 396 SFVDEDGKEVEPDGNVKVTMEYKKDVIPKEVEVTEKDLGVTVMHLEENEKGQVKKVVDM- 454 Query: 153 AGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERL 212 +G A++ + + A + E L + + + L Sbjct: 455 --VADDGSKASVETTDNGKVKKAEFVTDSFSTFTLAWQEYEPLLT-DYANGSVRTSDNSL 511 Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST------- 265 R K T L + + + + ++D SGSM + Sbjct: 512 GAPEHNKRIKYNEKDKDYTLTL---DVTGKRGKKAGVDVLLVIDKSGSMGLNDNGRTDSN 568 Query: 266 ----KDMAKRFYILLYLF-LSRTYKNVEVVYIRHHTQA---------------KEVDEHE 305 K+ L L + V I + K + Sbjct: 569 YFNLMPTLKKTVPTLVDTILPDSDSVNRVAAISFSSDDYTGNDISTDWVDYNGKSGFNRK 628 Query: 306 FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 GGT A++ D+ +K R +Q SDG+ Sbjct: 629 IEGLGTKGGTNWQLAMRNADKKLKPR-AESQNKKVVVFLSDGEPT 672 Score = 61.1 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 44/157 (28%), Gaps = 32/157 (20%) Query: 249 AVMFCLMDVSGSM--DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH------------ 294 A + ++D S SM + + E+ I + Sbjct: 1067 ASIVLVLDASASMQENGKKLKDIQDAAKAFVNTTKEKSPISEIAVIWYQGSEGSSSTITD 1126 Query: 295 -------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + +GGT + AL+ + ++ R N + YA +DG Sbjct: 1127 SGFYTLDTSDNVDAINRFISNKNASGGTPMGDALEEANSILSGRPNS---SKYALLFTDG 1183 Query: 348 DNWADDS--------PLCHEILAKKLLPVVRYYSYIE 376 + S AK++ + Y+ Sbjct: 1184 MPGYNSSNNSFNCMVANHANNEAKEIKEYAKLYTIGY 1220 >UniRef50_Q9ZQ46 Copia-like retroelement pol polyprotein n=6 Tax=Arabidopsis thaliana RepID=Q9ZQ46_ARATH Length = 683 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 27/266 (10%), Positives = 67/266 (25%), Gaps = 42/266 (15%) Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN 200 + R + ++ N + +L +S + + E S Sbjct: 183 EIRNYAKPESQIKPEIKNKSLRVYNDDEALISSPISPAGFHTILESDENEDCEEFTGFSV 242 Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS----------SQAV 250 + P+ L + + + + + Y K P Sbjct: 243 NTPSPLTAKLLTDRNVDVKLSPESAIVASGKGYETYSVVMKVKSPPFPTARGFARRVPVD 302 Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQ 310 + ++DVSG +M K+ ++ L + + + + + + Sbjct: 303 LVAVLDVSGRNSGGKLEMLKQTMRIVLSNLREMDRLSIIAFSSSSKRLSPLRRMTANGRR 362 Query: 311 ----------------------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 G V+ ALK +V+ +R + +D Sbjct: 363 SARRIVDIITVPGSVSGVGIDFSGEGMSVNDALKKAVKVLDDRRQKNPFT-AVFVLTD-- 419 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSY 374 +A+ + ++ Sbjct: 420 -------RQAHQVAQLAHSRIPIHTI 438 >UniRef50_Q6VPP3 Parturition-related protein PRP3 n=6 Tax=Eutheria RepID=Q6VPP3_RAT Length = 923 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 31/261 (11%), Positives = 70/261 (26%), Gaps = 32/261 (12%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 L + + +++ ++ + R + + Sbjct: 217 TQLYEKDCQFFPDKVQTEKSSIMFMQSIDSVTEF--CKKENHNREAPTLHNQKCDYRSTW 274 Query: 228 FIDTFDLRYKNYEKRPDPSSQA----------VMFCLMDVSGSMDQSTK-DMAKRFYILL 276 + + +KN P S ++ ++DVSGSM + + + Sbjct: 275 EVISNSEDFKNSTPMEMPPSPPFFSLLRISERIVCLVLDVSGSMGSYDRLNRMNQAAKFF 334 Query: 277 YLF-LSRTYKNVEVVYIRHHTQAKEVDEHEFF----YS------QETGGTIVSSALKLMD 325 L V + T E+ + +GGT + S ++ Sbjct: 335 LQQILESRSWAGMVHFHSSATVKSELIQINSDVERNQLLETLPTSASGGTSICSGIRTAF 394 Query: 326 EVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 +V K + N SDG++ + K VV + + + Sbjct: 395 QVFKNKGYQTGGN-DILLLSDGED---STAKDCLDEVKDSGAVVHFIALGKAF----DQS 446 Query: 386 WREYEHLQSTFDNFAMQHIRD 406 ++ FA ++ Sbjct: 447 ISNMANVTGGKQLFATDEAQN 467 >UniRef50_Q6AQK5 Hypothetical membrane protein (BatB) n=1 Tax=Desulfotalea psychrophila RepID=Q6AQK5_DESPS Length = 566 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 24/219 (10%), Positives = 49/219 (22%), Gaps = 43/219 (19%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + +D S SM + + A+ + L + + + Sbjct: 85 FRWVDVQRRGIDILFAIDTSRSMLSQDLKPNRLERARYAVMDFVATLG-GDRVGLIPFAG 143 Query: 294 HHTQAKEVDEHEF---FYSQE-------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + + GT ++ + L ++ V + N Sbjct: 144 SSYLMCPLTLDYQAFTDSLKALDTKIIPRRGTNIAKVIALAEKTVADSSNH----KILII 199 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR----------------------A 381 +DG+N D L LAKK + Sbjct: 200 LTDGENLQGD-VLKAADLAKKNGLTIYTIGVGTAAGELIPGGPGGAFIRDSSGKYVKSKL 258 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + +E + + IY K Sbjct: 259 DEETLQEIAEKTGGISVLLGNNNQGLKKIYTDKLRFIPK 297 >UniRef50_A8H3C5 von Willebrand factor type A n=3 Tax=Gammaproteobacteria RepID=A8H3C5_SHEPA Length = 328 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 51/203 (25%), Gaps = 30/203 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----------DMAKRFYILLYLFLSRTYKNVE 288 E M +D+SGSM+ D K L + + Sbjct: 84 EPIQIEQVGREMMIAVDLSGSMEARDFVDPQGEILRRVDGVKALLQSFLLK-RDSDRIGL 142 Query: 289 VVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + Q + GT + A+ + ++ N Sbjct: 143 IAFGENAYLQAPFTQDKQILSQLLQQMDVRMAGAGTAIGDAIGVAVNHFEQSEVE---NK 199 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYI----EITRRAHQTLWREYEHLQS 394 +DG++ + + PL A + V+ + L Sbjct: 200 VLLLLTDGNDTSSEFPPLDAAHYAGEQGVVIYPIAIGDPKNVGEDSLDIATLERIADLTQ 259 Query: 395 TFDNFAMQHIRDQDDIYPVFREL 417 F + ++Y V +L Sbjct: 260 GR-VFEADDGQSLIEVYKVLEQL 281 >UniRef50_A4QDZ6 Putative uncharacterized protein n=1 Tax=Corynebacterium glutamicum R RepID=A4QDZ6_CORGB Length = 354 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 32/257 (12%), Positives = 59/257 (22%), Gaps = 35/257 (13%) Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY------EKR 242 + E + A R A L A++ I+ K Sbjct: 102 EQVAELAGWFAEHPDALTDTYRRPTTANATLPAELSSQTIIEAPFPGSKTVTDALIDAYT 161 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL----------FLSRTYKNVEVVYI 292 ++DVSGSM + K L L K + + Sbjct: 162 NQFRVPGETTFVLDVSGSMLGQRITLLKDTMSDLISGGATTDLANVSLRGREKVSIIPFS 221 Query: 293 RHHTQA------------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + + + Q GGT + A+ + Sbjct: 222 FGPHEVISETLGAVGSPSRIDLQQRVEALQADGGTGIYDAVLAAY----AESAGGDYIPS 277 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYY-SYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG+ A + L +R ++ + A+ + Sbjct: 278 IVLMTDGELTAGRTYDQFLTEWNALPSNIRSIPVFVILYGEANVADMEQLAATTGGKTF- 336 Query: 400 AMQHIRDQDDIYPVFRE 416 D D+ + R Sbjct: 337 -DAINGDLDEAFKEIRA 352 >UniRef50_C1EAC0 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1EAC0_9CHLO Length = 753 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 21/219 (9%), Positives = 50/219 (22%), Gaps = 26/219 (11%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 F + + + +F ++D SGSM A + + L Sbjct: 276 RAPFIISISPPDPKSCAPFARSVFFIVDRSGSMTGKPMAGANQALLAGLSSLGPQDTFNI 335 Query: 289 VVYI------------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 + + G T + S L+ ++ +R Sbjct: 336 CAFDNLQEYLSEDMVPASPENINAAKGWIQGHCTARGTTDILSPLRAAVAILSKRPLLGA 395 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKK---------LLPVVRYYSYIEITRRAHQTLWR 387 +DG A ++ ++ L+ R ++ + + Sbjct: 396 V-PLIYVVTDG---AVENEREICRYMQEVMSAPPPEGLMTHPRVCTFGIGR-YCNHYFLK 450 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + A R + + Sbjct: 451 MLSQIGKGLSDAAYTDERVGSQMIALINASRTPVLTDVM 489 >UniRef50_C5S6K2 von Willebrand factor type A n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S6K2_CHRVI Length = 341 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 50/213 (23%), Gaps = 40/213 (18%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLYLFLSRTYKNVEV 289 P P + + +DVSGSM Q ++ + + + Sbjct: 82 APVPLPLAGRDLMLAIDVSGSMAQEDYELDGRPVSRLAVVRTVASAFVER-RAGDRLGLI 140 Query: 290 VY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ + + T + A+ L + ++E + Sbjct: 141 LFGTRAYLQTPLTFDGATVAAMLRDSVVGLAGRETAIGDAIGLAVKRLRE---QPEGQRV 197 Query: 341 AAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEI---------------TRRAHQT 384 +DGDN A PL LA + V Sbjct: 198 LILLTDGDNTAGALDPLEAAELAAQAGVRVYTIGIGGGELGVRSLFGMRLLRQASDFDPA 257 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + F + + +Y L Sbjct: 258 TLERIAEITGGRA-FTADSRQQLEAVYDELDRL 289 >UniRef50_D1C680 ATPase associated with various cellular activities AAA_5 n=2 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C680_SPHTD Length = 658 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 50/397 (12%), Positives = 99/397 (24%), Gaps = 26/397 (6%) Query: 26 YKAQIKQSISEAINKRSVTDVDSGES--VSIPTEDISEPMFHQGRGGLRHRVHPGNDHFV 83 + ++S +++I + V V G+ P +R G+ Sbjct: 270 VRESSRRS-ADSIVEEIVLAVLDGQPEAPRYPDPGGPGRRPADHESSIRPGAGRGSGTGR 328 Query: 84 QNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQ--NQ 141 R + G+ + + S + F P++ + + Sbjct: 329 ATSRPPDHPVAPSHVNTTGSGRVYLGDEALDAAARRSASQRAVRAF-AERHPHVTRALRR 387 Query: 142 QRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNS 201 + + V L R + R L + + + Sbjct: 388 LPGVRTALETSLADGPPLPREILGEVHHLAGHADHRALVDRIARDVLVRMAQ--RDLDQH 445 Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 L R E AEL ++ ++ ++ + ++DVSGSM Sbjct: 446 PGPGRLTSTPYRGEAAELDLDRSLERVLEQPEVTDEDLVVIERRPRKRAYALMLDVSGSM 505 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETG 313 + A + + + + + R K +DE G Sbjct: 506 QGAAIYEAALALAAVAVRVDP-DPFAVIAFWRDAAVLKRLDEPVDLDHLIDRVLSLPGRG 564 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 T ++ L++ + + SDG A P + P + S Sbjct: 565 LTNLALGLRVGLDELSRASTQE---RVGLLFSDGLRTAGPPPDAYAAAF----PTLHVIS 617 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 T RE L I Sbjct: 618 --TGRSEQSATACRELAALGDGRYAAITGLASIPTAI 652 >UniRef50_UPI0000EB12CB UPI0000EB12CB related cluster n=1 Tax=Canis lupus familiaris RepID=UPI0000EB12CB Length = 2186 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 53/181 (29%), Gaps = 19/181 (10%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV----------VYIRHHTQA 298 + ++D SGS+ ++ I L V V V Sbjct: 792 LDIVFVLDHSGSIGTQEQESMMNLTIHLVKKADVDSDRVRVGALKYSDYPEVLFYLSGNK 851 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN---PAQWNIYAAQASDGDNWADDSP 355 V EH +G T + AL+ + + E Y +DG + D+ Sbjct: 852 SAVIEHLRRRRYTSGHTYTARALEHANIMFTEEYGSRIQQNVKQMLIIITDGVSHDRDNL 911 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + + Y+ A+Q + + F + + + DIY + Sbjct: 912 SDTASKLRNK--GINIYAVGVGQ--ANQLELETMA--GNKSNTFHVDNFSNLKDIYLPLQ 965 Query: 416 E 416 E Sbjct: 966 E 966 Score = 68.8 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 57/188 (30%), Gaps = 21/188 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 K+ +A + L+D SGS+ K F L + + ++ ++ + Sbjct: 577 AKKRCEDMKADIMFLVDSSGSIGHDNFGKMKTFMKNLLAKIQIGPDSTQIGVVQFSDINQ 636 Query: 300 EVDE-----------HEFFYSQE-TGGTIVSSALKLMDEVVK-ERYNPAQWNIYAAQASD 346 E + GT+ SAL + + + + + +D Sbjct: 637 EEFQLNKYFTQNETSDAIDRMSLINRGTLTGSALTFVGQYFTPTKGARTKVKKFLILITD 696 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G+ A D + V +S A++T E + F +++ D Sbjct: 697 GE--AQDPVRDPAKALRDK--GVVIFSVGVYG--ANRTQLEEIS--GDSSLVFQVENFDD 748 Query: 407 QDDIYPVF 414 + Sbjct: 749 LKTVESKL 756 Score = 66.1 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 52/164 (31%), Gaps = 22/164 (13%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV-----------EVV 290 + +A ++ L+D S S++ + K+F + + S V V Sbjct: 374 DKPHTKEADIYFLIDGSTSINTEGFEQIKQFMLAVTGMFSIGSDKVQAGAVQYSDKIRVE 433 Query: 291 YIRHHTQAKEVDEHEFFYS-QETGGTIVSSALKLMDEVVKERYNP--AQWNIYAAQASDG 347 + + + Q G T AL M ++K+ ++ + +DG Sbjct: 434 FYINASSNDMDLRKAILNIEQLQGNTHTGKALDFMLSIIKKDRKHRISEIPCHLIVLTDG 493 Query: 348 DNWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYE 390 S A++L + ++ A + ++ Sbjct: 494 K-----SQDEVLKPAERLRDEQITIHAVGIG--EADKIQLQQIA 530 Score = 50.7 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 19/162 (11%), Positives = 40/162 (24%), Gaps = 25/162 (15%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------ 303 + L+D S + + K F + L V ++ Q + Sbjct: 1 DVVFLVDSSNHLGTKSFPFVKTFISKIINSLPIEAHKYRVALAQYSDQLHSEFQLGTFKS 60 Query: 304 --HEFFYSQ-----ETGGTIVSSALKLMDEVVKERYNPAQWNIY-----AAQASDGDNWA 351 + + G + AL+ R + + AS + Sbjct: 61 RNPMLNHLKKNFGFVGGSLRIGQALREAHRTYFSRPDSGRDKKQFPPILVVLAS---AES 117 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 +D + VR S + A + + + Sbjct: 118 EDDVEEPSKALR--GDGVRIISVGL--QSASEQELKAMATVS 155 Score = 44.1 bits (102), Expect = 0.011, Method: Composition-based stats. Identities = 30/293 (10%), Positives = 70/293 (23%), Gaps = 31/293 (10%) Query: 71 LRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFE 130 + G++ P G G GS + G + + + Sbjct: 1600 PGRKGEKGSEGHKGPQGSSGPVGAKGNVGSPGPPGKKGESGILGDPGPMGQAGQRGRQGD 1659 Query: 131 DLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHA 190 D +P ++ + + G + V R +T G++ E + Sbjct: 1660 D-GIPGYGHMGRKGVKGPRGFTGDMGQKGDVGDPGVPGGPGPKGYRGLTLTVGRKGEEGS 1718 Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 + + + ++ + P Sbjct: 1719 PGPPGPPGRRGMKGMAG--KPVYSQCDLIQFMRDHSRCCQF---------AEKCPVYPTE 1767 Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-------TYKNVEVVYIR--------HH 295 + +D S S+ + + + + L+ + V V Y Sbjct: 1768 LVFALDQSNSVSEQRFNEMRDIITSIVNDLNIRESNCPVGARVVVVSYDTGTSYLIRWSD 1827 Query: 296 TQAKEVDEHEFFYSQETGGT---IVSSALKL-MDEVVKERYNPAQWNIYAAQA 344 +K+ + T + +A++ + K Y A A Sbjct: 1828 YHSKKQLLQLISQIKYRPPTEAQDIGNAMRFVARNIFKRTYGGANVRKVAVFF 1880 >UniRef50_Q30SU6 von Willebrand factor, type A n=3 Tax=Campylobacterales RepID=Q30SU6_SULDN Length = 309 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 27/241 (11%), Positives = 61/241 (25%), Gaps = 29/241 (12%) Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL-RYKNYEKRPDPSSQAVMFCLMDVS 258 + + L K + + + K+ +P + ++D S Sbjct: 34 YFPHVAQFLKGTVSSSKRLLFLKWLGIFMMIVALMSPIKDEPYELEPKDGYEIALILDAS 93 Query: 259 GSMDQ----------STKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVD 302 SM S D+ K + VV+ + T + Sbjct: 94 ESMKAQGFDVQNQHLSRFDVVKEIVSDFISQ-RKNDNMGLVVFGAYSFIASPLTYDVNIL 152 Query: 303 EHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 Q T ++++L ++K+ + A +DG + + + Sbjct: 153 NKILSQLQIGMAGKYTALNTSLAQGANLLKQSKSK---TKIAILLTDGYSTPQVDTITLD 209 Query: 360 ---ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + KK V + + + F + ++Y Sbjct: 210 IALDMIKKEGIKVYPIGIGMP-HEYNTEALLKIANESGGVA-FGASSAAELQEVYKKIDS 267 Query: 417 L 417 L Sbjct: 268 L 268 >UniRef50_C1XTM1 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Deinococci RepID=C1XTM1_9DEIN Length = 464 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 22/213 (10%), Positives = 53/213 (24%), Gaps = 45/213 (21%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ---------------------------STK 266 L+ + + Q V+ ++D SGSM + S Sbjct: 29 LKIRPSAEATRSRPQLVVAFVVDTSGSMREVVTEPTERTGQSVRVDGKDYEVVRGAKSKI 88 Query: 267 DMAKRFYILLYL--FLSRTYKNVEVVY---------IRHHTQAKEVDEHEFFYSQETGGT 315 D+ L L + + V + + + +Q +GGT Sbjct: 89 DLVIEALQNLLSSPQLQPSDRLAIVKFDDVAEVVQPFTPANEKARLVAAAERLTQYSGGT 148 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + + ++ +++ + +DG + + + V Sbjct: 149 QMGAGMREGMRLLEREAG----SRRLILLTDGQTFDEPLVETVAAQLAQARIPVTAIGVG 204 Query: 376 EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + L E + ++ Sbjct: 205 ---DEWNDDLLAEITDRTQGKPFHVIPDNQNPQ 234 >UniRef50_A8LLA0 von Willebrand factor type A domain protein n=7 Tax=Proteobacteria RepID=A8LLA0_DINSH Length = 328 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 23/203 (11%), Positives = 46/203 (22%), Gaps = 30/203 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----------DMAKRFYILLYLFLSRTYKNVE 288 E S+ + +D+SGSMD K + Sbjct: 86 EPITITSAARDLVLAVDISGSMDDRDMTAPDGTRLQRLQAVKDVVGAFVAE-REGDRISL 144 Query: 289 VVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +V+ + ++ T + A+ L ++ Sbjct: 145 IVFGAKPFIQAPFTEDLDSVVELLNQVQTGMAGPNTAIGDAIGLAIRSFEDSEIEE---R 201 Query: 340 YAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRR----AHQTLWREYEHLQS 394 SDG + A +P+ +A + + + Sbjct: 202 LLILLSDGADTASTMTPINAAQIAAQEGITIYTIGVGNPDGSGEERLDPATLEDIATRGG 261 Query: 395 TFDNFAMQHIRDQDDIYPVFREL 417 F + +IY L Sbjct: 262 GAFYF-ADDVEGLSEIYAEIDAL 283 >UniRef50_Q74B80 Putative uncharacterized protein n=1 Tax=Geobacter sulfurreducens RepID=Q74B80_GEOSL Length = 575 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 56/372 (15%), Positives = 106/372 (28%), Gaps = 32/372 (8%) Query: 25 RYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGR------GGLRHRVHPG 78 R A ++ + + I + +G+S S+P H + R + G Sbjct: 180 REAAALRDELMKLIRDSAHQQPKNGQS----RPSQSQPGSHGDQNDGTGSDNSRQQNGTG 235 Query: 79 NDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI-SKDEYLDLLFEDLALPNL 137 ++ Q ++ Q +GSGQ ++S S++ L E L+ ++ Sbjct: 236 SNGSRQAQNGDKQQISSQPAGSGQDKSSGHSSQGQSGASDSPSQESVRQALAEALSGKSV 295 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM-TAGKRRELHALEENLA 196 + E A N + + + A+ + Sbjct: 296 G--SFGNVGEKLAELLCQNATESSFNGTAAAAPRLPTAQLLNQNGGYDDLSALRVHTAAL 353 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 Q +++R R + + D R K + + L+D Sbjct: 354 RARLQGLVQASKQKRSTPVSVGHRLDSRVLTRLRICDTRVFT-RKEEKRAVNTAVCMLLD 412 Query: 257 VSGSMDQS----TKDMAKRFYILLYLFLS--RTYKNVEVVYIRHHTQAK------EVDEH 304 SGSM + +A R + L + + H E +H Sbjct: 413 SSGSMGNTTILNKMGIASRACFVAAEALFSIPGVRTAIATFKGHDNHVFPMVNFGEKPDH 472 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 F +GGT + AL + R + SDGD D P+ + + Sbjct: 473 SRFNITGSGGTRLGHALWWAWGELSLRR---ETRKICIAFSDGDT--GDGPVTQAAIKRM 527 Query: 365 LLPVVRYYSYIE 376 + Sbjct: 528 REEGIEVIGIGI 539 >UniRef50_Q22ML1 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ML1_TETTH Length = 685 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 16/190 (8%), Positives = 42/190 (22%), Gaps = 31/190 (16%) Query: 253 CLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH----------HTQAKEV 301 L+D S SM ++ K++ L + + + + + E Sbjct: 91 ILIDRSQSMMSENKLQNVKQYLCNLIEKANTNSQFALISFGSSQKLIFNFTQVTHENLES 150 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEV------VKERYNPAQWNIYA----AQASDGDNWA 351 + + TG T + AL++ + ++ + +DG + Sbjct: 151 IKGQINNIISTGDTNIIQALEVAHNIIKQDQQLENQKEEQTKKRIVRYSAFLLTDGQDNM 210 Query: 352 DDSPLCHEILAKKLLPV-----VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 + + + L + Sbjct: 211 K--EKAIFKFRENFKNKDMDYSINCLGFGI---DHDPLLLGAITSYTGGKFYYIKPEESV 265 Query: 407 QDDIYPVFRE 416 + Sbjct: 266 FSVFQDYIKN 275 >UniRef50_Q32NR2 MGC130922 protein n=3 Tax=Tetrapoda RepID=Q32NR2_XENLA Length = 840 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 55/187 (29%), Gaps = 23/187 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD--- 302 + ++D S S+ + ++ K+F + L + K V I++ T + Sbjct: 567 EGPVDLVFVIDGSKSLGEDNFEIVKQFVKGILDSLEISQKAARVGLIQYSTHVRTEFTMA 626 Query: 303 --------EHEFFYSQETG-GTIVSSALKLMDEVV-----KERYNPAQWNIYAAQASDGD 348 + + G G++ ALKLM E R P + A +DG Sbjct: 627 QYSSAKDVKKAVSQIKYMGRGSMTGLALKLMHEKSFSEAQGARARPMRVPRVAIVFTDGR 686 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 A D + AK+ + + +E + Sbjct: 687 --AQDEVSEYAEKAKQSGITIYAIGIGKAI----DEELQEIASAPQEKHVIYAEDFSAMG 740 Query: 409 DIYPVFR 415 I + Sbjct: 741 YIMEKLK 747 Score = 71.8 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 53/187 (28%), Gaps = 23/187 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-HTQAKEVDEHE 305 + ++D S S+ + + K F I + FL N V +++ T E Sbjct: 49 KPMDLVFIIDSSRSVRPADFEKVKEFLITMLKFLDIGPDNTRVGLLQYGSTVKNEFSLKT 108 Query: 306 FFY-----------SQETGGTIVSSALKLMDEVV-----KERYNPAQWNIYAAQASDGDN 349 + GT+ A++ + R A +DG Sbjct: 109 YKRKPDIERAVKRMMHLATGTMTGLAIQYAMNIAFSEAEGARPLNQYVPRIAMIVTDGRP 168 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 EI AK + ++ + + + F + + + Sbjct: 169 QDP----VAEIAAKARNSGILIFAIGVGR--VDMSTLKTIGSQPHSEHVFLVANFSQIET 222 Query: 410 IYPVFRE 416 + VF+ Sbjct: 223 LTSVFQN 229 >UniRef50_C0DBA5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DBA5_9CLOT Length = 231 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 53/167 (31%), Gaps = 18/167 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK---NVEVVYIRHHTQAKEVD 302 L+D SGSM ++ + + + L + +V I ++ + V Sbjct: 20 ERHIACVLLVDTSGSMAGASINELNQGLLEFGNALDQDEHARGVADVCVISFNSNVETVV 79 Query: 303 EHEFFY------SQETGGTIVSSALKLMDEVVKERY------NPAQWNIYAAQASDGDNW 350 G T ++ A+ + ++ER + + + +DG+ Sbjct: 80 PFCPAANYSAPTLSAGGLTSMNEAVIAGLDAIEERKQLYRQLGCSYYRPWMFLLTDGEPT 139 Query: 351 ADDSPLCHEILAKKLL--PVVRYYSYIEITRRAHQTLWREYEHLQST 395 + + ++ L V ++ + A+ + Y + Sbjct: 140 DQNMEGEAKNRLQQALNDKKVNFFPMGIGSG-ANYAHLKSYTKGGNG 185 >UniRef50_A6FXN3 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6FXN3_9DELT Length = 416 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 41/197 (20%), Gaps = 36/197 (18%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH----------TQAKE 300 M ++D S SM + AK + L L+ VV+ ++ Sbjct: 1 MVLVVDTSASMKGDAIEGAKAAAMELVDGLAEGDSFALVVFHSRAEVLMPSTVINEDSRA 60 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVV---------------KERYNPAQWNIYAAQAS 345 + Q G T ++ L+ + + Sbjct: 61 AARSKIETMQAWGTTDLAGGLQQALAQLQVAQNIVGAGGSTGAQSGAPDPTVLERVVLLG 120 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + + Y +TL F Sbjct: 121 DGVPNDASTIPSTVGQLAARGTQITALGYGI---EYDETLLASLAEQTHGSFRFVDD--- 174 Query: 406 DQDDIYPVFRELFHKQN 422 LF + Sbjct: 175 -----PEAVASLFRDEV 186 >UniRef50_Q1GMB7 von Willebrand factor type A n=3 Tax=Rhodobacteraceae RepID=Q1GMB7_SILST Length = 477 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 45/182 (24%), Gaps = 24/182 (13%) Query: 253 CLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVY--------------IR 293 ++D SGSM +A+ L L + + Y I Sbjct: 28 LVLDASGSMWGQIDGVAKITIAQDVMQHLLKTLPENQELGLMAYGHRRKGDCNDIEQLIA 87 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 +++ G T +S+A+ + ++ A SDG+ Sbjct: 88 PAAGSRQAISQAVTQISPKGKTPLSAAVMQAADALRSSEEKA----TVILISDGEETCGL 143 Query: 354 SPLCHEILAKKLLPVVRYYSYIEI-TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 P + ++ A + + F A + Sbjct: 144 DPCAVGAELEARGVDFTLHAIGFGIADDAARAQLQCLAENTGGFYRDASSASELTAALAQ 203 Query: 413 VF 414 V Sbjct: 204 VA 205 >UniRef50_C0QK10 Putative uncharacterized protein n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QK10_DESAH Length = 598 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 54/217 (24%), Gaps = 43/217 (19%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNV 287 L ++ + +D S SM + AKR I L + + + Sbjct: 75 PLAGFRWQTVEQK--GVDIMICLDCSRSMLAQDIKPTRLERAKREIIDLMGMIQ-SDRAG 131 Query: 288 EVVYIRHH------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQW 337 V + T + GGT + A++ ++ Sbjct: 132 LVAFAGRAILQCPLTLDHSAFNLFLNALEPDYLPVGGTDLGGAIETALNGFEKEVESE-- 189 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI---------------------E 376 +DG+N DS + A + + Sbjct: 190 -KAIILITDGENTTGDSIEMAKKAADQ-GVKIFCIGVGSPEGAPVPDSAGGFKKDRSGKI 247 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 I R + ++ + ++ D D IY Sbjct: 248 IISRVDEPALKKIAAMTQGVYVRSVAGDMDLDRIYDQ 284 >UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacterium sp. JLS RepID=A3PUP3_MYCSJ Length = 233 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 51/165 (30%), Gaps = 17/165 (10%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRHHTQA 298 +P + L DVSGSM +R + +L K VEV + T A Sbjct: 12 DANPDPRVACVVLADVSGSMQGEPIAALERGFAAFTRYLQNEVLASKRVEVAVVTFGTVA 71 Query: 299 KEVDEHEFFY------SQETGGTIVSSALKLMDEVVKERYNP------AQWNIYAAQASD 346 + + +G T +++ + L +++++R + + + +D Sbjct: 72 TVLVPMQEARTLQPVAFTASGTTNMAAGIHLALDILEDRKHAYKAAGLQYYRPWILLLTD 131 Query: 347 GDNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYE 390 G D L + V ++ R Sbjct: 132 GKPNLDGFDEAVARLNAVESARGVTVFAVGAG-PRVDYQQLGRLS 175 >UniRef50_C4PY90 Dihydropyridine-sensitive l-type calcium channel, putative n=1 Tax=Schistosoma mansoni RepID=C4PY90_SCHMA Length = 421 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 29/227 (12%), Positives = 56/227 (24%), Gaps = 39/227 (17%) Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMA 269 + A R +D FD+R +++ S +F L+D SGSM + +A Sbjct: 183 ASFTGILRVYPAFPWRQQNVDMFDVRRRSW-FIQGSSVPKDLFILLDTSGSMTGQSLKLA 241 Query: 270 KRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFY--------------------- 308 L L + + Sbjct: 242 NLSAQKLIEALDVDDYFTVAHFPGAKDHVAPMIVTANNESEPICFNSFVQATRRNKLRLF 301 Query: 309 -----SQETGGTIVSSALKLMDEVVKE-------RYNPAQWNIYAAQASDGDNWADDSPL 356 + G + ++LK E+ + N +D N Sbjct: 302 YDLSTLKARGYSDFPASLKFAYEMFRNLTESARGDRGKELRNKILVLLTD--NAFVFDES 359 Query: 357 CHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 L ++ YS E A++ + + + + Sbjct: 360 VLSQLKQQKSNITTFIYSLGEPVGAAYEHKMK--ACATNDYYQYLPT 404 >UniRef50_A7RNR9 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RNR9_NEMVE Length = 1450 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 23/208 (11%), Positives = 48/208 (23%), Gaps = 33/208 (15%) Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-------STKDMAK 270 ++ ++D R + + + ++D SGSM + + MA Sbjct: 174 TTIYPMQSQAECGSYDNRARPWYVDAAAPKPKNVVLVVDSSGSMAEKHTANGKTWLQMAI 233 Query: 271 RFYILLYLFLSRTYKNVEVVYI-------RHHTQAKEV-------------DEHEFFYSQ 310 + L+ K V + T + + Sbjct: 234 DAAKAVLDTLNPRDKVGVVSLATDANTPGSNDTTWCYANTLAEANSVNINNMKIFLDGMR 293 Query: 311 ETGGTIVSSALKLMDEVVKERYN--PAQWNIYAAQASDGDNWA--DDSPLCHEILAKKLL 366 G T+ AL ++ P + +D + K L Sbjct: 294 SAGFTMYIPALTKAFALLLNSKPESPDDCDQVIIFLTDAKPTELKESVMRTIVESNKLLD 353 Query: 367 PVVRYYSYIEITRRAHQTLWREYEHLQS 394 V +Y + + Sbjct: 354 NRVVILAYGIGAEDF--SFLGDMVRQSG 379 >UniRef50_B0JR39 von Willebrand factor type A n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JR39_MICAN Length = 724 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 53/189 (28%), Gaps = 31/189 (16%) Query: 245 PSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRT-YKNVEVVYIRHH------T 296 + + L+D SGSM+ K AK + + V + T Sbjct: 48 EQNPQSVVMLIDTSGSMNDDNKLQEAKNAAKAFIERQDPSVNRFAVVGFGSQVQIGTGLT 107 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + GGT + L E ++ + + +DG SP Sbjct: 108 SDLATLNQAIDNLSDGGGTRMDLGLATAIEQLESSSSD----RHILLFTDGQPAPAPSPE 163 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF-- 414 + I I ++ E +Q NFA + + + D Sbjct: 164 QID---------------IMIVLDVTSSMNEEIAGVQQGIQNFAQELKKRKLDAQIGLIA 208 Query: 415 --RELFHKQ 421 LF ++ Sbjct: 209 FGDRLFGEE 217 >UniRef50_A7RPC2 Predicted protein (Fragment) n=2 Tax=Eumetazoa RepID=A7RPC2_NEMVE Length = 930 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 20/238 (8%), Positives = 62/238 (26%), Gaps = 31/238 (13%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 L+ + + + + + E P + ++D S SM Sbjct: 608 SGFQLQIQLAEIHVPRMWVERHDTEADSQACMLAFYPEFESGPPHHVEVIFVLDASCSMK 667 Query: 263 QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET---------- 312 AK+ ++ + + VV+ ++++ + + + Sbjct: 668 GKALQEAKKLTLMCLSLMEEEWAFNIVVFGSNYSELFTQSQKKDKETVARAAKFVKSVKA 727 Query: 313 --GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR 370 G T + L+ + + + A + +DG + + P Sbjct: 728 VKGSTDLWRVLRSLY--LLRCNSTADYPSNVFLFTDGH---VTEESTTLAYIRDIRPTC- 781 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD--IYPVFRELFHKQNATAK 426 ++ R + + ++ + + + + + K Sbjct: 782 -----------NRHFLRSMARVGAGAYELFDSKVKSKWERKVESQLSKARQPVLTSVK 828 >UniRef50_D1VKI5 von Willebrand factor type A n=1 Tax=Frankia sp. EuI1c RepID=D1VKI5_9ACTO Length = 560 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 31/273 (11%), Positives = 59/273 (21%), Gaps = 29/273 (10%) Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 +L R ++ L ++ PA R L Sbjct: 288 ALLADNRRDEYTRVVNWLRSPQVQRRLQDQTSRRPAVPGVPLDSRFGTRALAELPFPATE 347 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY----------L 278 L ++ P+ ++D SGSM+ ++ L Sbjct: 348 QVADQLLAAYLDQYRRPTRA---IYVLDTSGSMEGPRLAALQQALTGLTGADDSLSGRFA 404 Query: 279 FLSRTYKNVEVVYIRHHTQAKEVD--------------EHEFFYSQETGGTIVSSALKLM 324 + + + T ++ + G T + SAL Sbjct: 405 RFRAREQVTIITFNDKVTATRQFTVSDPTPGSADLKAISDYGAALRAGGNTAIYSALDAA 464 Query: 325 DEVVKERYNPA-QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR-YYSYIEITRRAH 382 +DG+N P VR ++ A Sbjct: 465 YTTAAAGMKADPSALTSIVLMTDGENNRGLDSAGFLARYNTRPPDVRGVRTFAVDFGDAD 524 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + + A D++ R Sbjct: 525 RAALTQIATSTGGAVFDATAPGVSLSDVFREIR 557 >UniRef50_D2V048 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2V048_NAEGR Length = 1065 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 52/171 (30%), Gaps = 20/171 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-TYKNVEVVYIRHHTQAKEVDE 303 + ++ +D SGSM S AK L + + + +V+ + + Sbjct: 80 KNQGKLLIIALDKSGSMAGSGISEAKLALETLLSNVEGCNERILFIVFDSNSELIDMTNM 139 Query: 304 HEFFYSQ------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS--- 354 Q GGT SS + ++++ + +DG + + Sbjct: 140 ELENKLQVVKKVSAGGGTDFSS----VFKIIRNYGGSLNGQVAIIFFTDGQDQYSSNSTR 195 Query: 355 PLCHEILAKKLL---PVVRYYSYIEITRRAHQTLWREYEHLQ--STFDNFA 400 + L ++L +++ T L + L FA Sbjct: 196 EGSIKSLQERLNTESESYEFHTIGF-TSVHDARLLTDITRLGTAQGSFQFA 245 >UniRef50_Q5SKK6 Putative uncharacterized protein TTHA0637 n=4 Tax=Thermaceae RepID=Q5SKK6_THET8 Length = 407 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 39/292 (13%), Positives = 75/292 (25%), Gaps = 36/292 (12%) Query: 132 LALPNLKQNQQRQ---LTEYKTHRAGYTANGVPA-NISVVRSLQNSLARRTAMTAGKRRE 187 L LP Q E R T + +R L +L R Sbjct: 104 LRLPGEDPTDPAQGGYRGEAGEARFELTEKASDFLGLKSLRELLGALGRNPPGLHPTPHH 163 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 +E+ L + A + DL + ++ Sbjct: 164 APGVEKTGETKPWEWGDPLELNVPETLKKAMAKGLERLSHEDLVIDL--------AEYTA 215 Query: 248 QAVMFCLMDVSGSM---DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH 304 L+D S SM + AK+ + L + Y V ++ H A+E+ Sbjct: 216 SMSTVVLLDCSHSMILYGEDRFTPAKKVALALAHLIRTQYPGDRVRFVLFHDTAEEIPLA 275 Query: 305 EFFYSQETGG-TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD---------- 353 + Q T + L+L ++++ +DG A Sbjct: 276 KLPLVQVGPYHTNTKAGLELARTLLRKM---GGEMKQIILITDGKPSALTLPSGEIYKNA 332 Query: 354 ---SPLCHEILAKK----LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 PL K+ + ++++ ++ + Sbjct: 333 WGLDPLILAETLKEATLARREGIPIHTFMLAREPELLAFVKKLSQITRGKAY 384 >UniRef50_C5YBL3 Putative uncharacterized protein Sb06g000656 n=1 Tax=Sorghum bicolor RepID=C5YBL3_SORBI Length = 434 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 53/211 (25%), Gaps = 40/211 (18%) Query: 245 PSSQAVMFCLMDVSGSMDQS----------TKDMAKRFYILLYLFLSRTY--------KN 286 + + ++DVSGSM ++AK L + Sbjct: 8 ERAPVDVVAVLDVSGSMAWDYGNGTTVENHRLELAKEAMAKAIQSLGPAAAAVAAGGARR 67 Query: 287 VEVVYIRHHTQAKEVD-------------EHEFFYSQETGGTIVSSALKLMDEVVKERYN 333 + + K+V ++ + G LK+ +++ ER Sbjct: 68 NRLAVVPFSNVVKQVTPLTEMDMEGQQTVKNAVDALKPGGQADYLMPLKIAAKILDERKA 127 Query: 334 -PAQWNIYAAQASDGDNWADDSPLCHEI-LAKKLLPVVRYYSYIEITRRAHQT-----LW 386 SDG + + L + L +++ + Sbjct: 128 EEKDRLAIIIFVSDGQDHYFRDTDDMKETLTQHKLIKYPIHAFGVSVSEQDSSGGGAKAL 187 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 R S Q D D V +L Sbjct: 188 RAMADATSGSYTSITQ--DDDVDTMAVAEKL 216 >UniRef50_O00339 Matrilin-2 n=30 Tax=Euteleostomi RepID=MATN2_HUMAN Length = 956 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 55/198 (27%), Gaps = 24/198 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-HTQAKEVDEHE 305 +A + ++D S S++ K F + + FL V +++ T E Sbjct: 54 KRADLVFIIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKT 113 Query: 306 FFYSQE-----------TGGTIVSSALKLMDEVV-----KERYNPAQWNIYAAQASDGDN 349 F E + GT+ A++ + R +DG Sbjct: 114 FKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRP 173 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 DS A+ + ++ + + F + + + Sbjct: 174 --QDSVAEVAAKARD--TGILIFAIGVGQVDFN--TLKSIGSEPHEDHVFLVANFSQIET 227 Query: 410 IYPVFRELFHKQ-NATAK 426 + VF++ + Sbjct: 228 LTSVFQKKLCTAHMCSTL 245 Score = 77.6 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 64/199 (32%), Gaps = 23/199 (11%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV---- 301 + ++D S S+ + ++ K+F + L+ + K V +++ TQ Sbjct: 651 EGPIDLVFVIDGSKSLGEENFEVVKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLR 710 Query: 302 -------DEHEFFYSQETG-GTIVSSALKLMDEVV-----KERYNPAQWNIYAAQASDGD 348 + + + G G++ ALK M E R + A +DG Sbjct: 711 NFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGR 770 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 D S +K + Y+ A + +E + F + D Sbjct: 771 AQDDVSEWA----SKAKANGITMYAVGVGK--AIEEELQEIASEPTNKHLFYAEDFSTMD 824 Query: 409 DIYPVFRELFHKQNATAKG 427 +I ++ + + G Sbjct: 825 EISEKLKKGICEALEDSDG 843 >UniRef50_Q9VJM0 CG12455, isoform A n=9 Tax=Drosophila RepID=Q9VJM0_DROME Length = 2190 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 24/235 (10%), Positives = 61/235 (25%), Gaps = 35/235 (14%) Query: 206 LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST 265 +L + + + + K + + L+D SGSM Sbjct: 204 ILRHYPAAQWTDTRPNRDDADTYDCR-----KRSWYIETATCSKDIVILLDHSGSMTGFR 258 Query: 266 KDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHT-------QAKEVDEHEFFYS- 309 +AK + S + Y + + EV + Sbjct: 259 HHVAKFTIRSILDTFSNNDFFTILRYSSEVNDIIPCFNGALVQATPENIEVFNQQIEQLD 318 Query: 310 QETGGTIVSSALKLMDEVVKERYN------PAQWNIYAAQASDGDNWADDSPLCHEILAK 363 G ++ A + +++++ Y+ + N +DG A ++ + Sbjct: 319 DPEGYANLTLAYETAFQLLRKYYDSRHCVTNSTCNQAIMLVTDG--VAGNTTEVFQKYNW 376 Query: 364 KLLP------VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 R ++Y+ + L + + +++ Sbjct: 377 GNGENGTSQMDTRVFTYLLGKEVTKVREIQWMACLNRGYYSHVQTLDEVHEEVLK 431 >UniRef50_Q5BIB2 RE14947p n=5 Tax=melanogaster subgroup RepID=Q5BIB2_DROME Length = 1215 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 24/235 (10%), Positives = 61/235 (25%), Gaps = 35/235 (14%) Query: 206 LLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST 265 +L + + + + K + + L+D SGSM Sbjct: 242 ILRHYPAAQWTDTRPNRDDADTYDCR-----KRSWYIETATCSKDIVILLDHSGSMTGFR 296 Query: 266 KDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHT-------QAKEVDEHEFFYS- 309 +AK + S + Y + + EV + Sbjct: 297 HHVAKFTIRSILDTFSNNDFFTILRYSSEVNDIIPCFNGALVQATPENIEVFNQQIEQLD 356 Query: 310 QETGGTIVSSALKLMDEVVKERYN------PAQWNIYAAQASDGDNWADDSPLCHEILAK 363 G ++ A + +++++ Y+ + N +DG A ++ + Sbjct: 357 DPEGYANLTLAYETAFQLLRKYYDSRHCVTNSTCNQAIMLVTDG--VAGNTTEVFQKYNW 414 Query: 364 KLLP------VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 R ++Y+ + L + + +++ Sbjct: 415 GNGENGTSQMDTRVFTYLLGKEVTKVREIQWMACLNRGYYSHVQTLDEVHEEVLK 469 >UniRef50_Q1N498 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1N498_9GAMM Length = 731 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 54/193 (27%), Gaps = 25/193 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ-------- 297 L+D+SGSM + + V++ + + Sbjct: 348 QRGTDWTLLLDISGSMQG-KFQTLIEGVKKGLKRFNPQDRVRVVLFNDYASNLTGGFLPA 406 Query: 298 ---AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADD 353 + GGT + ++ + A +DG N + Sbjct: 407 TQKNIAEIIRKLDLVLPNGGTHLMDGVRFALSGLD-----ADRTSAIWLVTDGVTNVGET 461 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 L K+ +R +++I A++ L + + F ++ + DDI Sbjct: 462 KQRKFVDLLKQK--DIRVFTFIMGNG-ANRPLLKAITKASNGFA----INVSNSDDIIGQ 514 Query: 414 FRELFHKQNATAK 426 + K A Sbjct: 515 LEKAASKVTHEAL 527 >UniRef50_D0L4S2 von Willebrand factor type A n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0L4S2_GORB4 Length = 461 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 49/208 (23%), Gaps = 41/208 (19%) Query: 251 MFCLMDVSGSMD-----QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT--------- 296 + ++D S SM D A+ L+ V Y + Sbjct: 45 VVVILDGSESMQIADAPGPRIDAARNAVSTFISDLTSGTPFGLVAYGNTESAKTTPQAVG 104 Query: 297 ------------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 KE + G T +S+AL E++ Sbjct: 105 CEDVSTLARLGPIDKEAARSAIDGVRAQGWTPLSAALTRAAEMLGTEAGS------VVLV 158 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM--- 401 SDG+ P ++ P + + ++ + A Sbjct: 159 SDGEANCLPDPCATARSLREQNPNLTISTVGF---KSDAAQLQCVAREGGGVFVTADNTA 215 Query: 402 ---QHIRDQDDIYPVFRELFHKQNATAK 426 I D +L + A + Sbjct: 216 QLSARIDAARDAEAASSKLTNTGLAGVE 243 >UniRef50_P70960 Uncharacterized protein ywmC n=5 Tax=Bacillus RepID=YWMC_BACSU Length = 227 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 25/212 (11%), Positives = 55/212 (25%), Gaps = 38/212 (17%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQ-----STKDMAKRFYILLYLFLSRTYKNVEVV 290 + + + + A + L+D SGSM + S + AK+ L + V Sbjct: 22 FAAEKTETEAKAPANVAVLLDASGSMAKRIDGVSKFNSAKKEISKFASSLPEGTQVKMSV 81 Query: 291 YI---------------------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK 329 + + ++ + TG T ++ AL Sbjct: 82 FGSEGNNKNSGKVQSCEAIRNVYGFQSFNEQSFLNSLNTIGPTGWTPIAKALNEAKSSFD 141 Query: 330 ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 + + +DG+ +P+ +K V + + Sbjct: 142 QLDAKGE--KVVYLLTDGEETCGGNPIKTAKELQKDNITVNVIGFDYKEGY--KGQLNAI 197 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 + A ++F +Q Sbjct: 198 AKVGGGEYFPAYT--------QKDVEKIFTQQ 221 >UniRef50_Q7UML1 BatA n=3 Tax=Planctomycetaceae RepID=Q7UML1_RHOBA Length = 357 Score = 79.2 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 55/228 (24%), Gaps = 51/228 (22%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDM----------AKRFYILLY---LFL--SRTYKNVEV 289 + + ++D SGSM ++ K L + + Sbjct: 82 QTEGIAIEMVIDRSGSMQALDFNIDGEPVDRLTAVKNVASKFITGGEDLEGRFSDLVGLI 141 Query: 290 VYIRHHTQAKEVDEHEFF-----------YSQETGGTIVSSALKLM---DEVVKERYNPA 335 + + F ++ GT + A+ L + R Sbjct: 142 TFAAYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALSVEKLNALDARQERK 201 Query: 336 QWNIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITR--------------- 379 + +DG+N A + P+ LA+ L + + Sbjct: 202 VQSKILILLTDGENTAGELDPVQAAELAETLGIKIYAIGVGTTGKAPVPVRDPFTGRQRL 261 Query: 380 -----RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 + ++ + F D IY +L + Sbjct: 262 HYMEVNIDEATLQKVAEITGGKY-FRATDTDSLDAIYREIDQLEKTEV 308 >UniRef50_A7C4T6 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=A7C4T6_9GAMM Length = 180 Score = 78.8 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 43/122 (35%), Gaps = 9/122 (7%) Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD-SPLCHEILAKKLLPV 368 T + A+ L + ++ER + + +DG+N A PL LAK+ Sbjct: 30 MAGRDTAIGDAIGLAVKKLRERP---EGSRILILLTDGENNAGALKPLQAAELAKQYDIR 86 Query: 369 VRYYSYIEITR----RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + ++T ++ L + F ++ +++Y + K A Sbjct: 87 IYTIGVGGKGGMFSRGLNETELKKIAQLTNGAY-FPATNLGALNNVYEHIDKTLQKTEAD 145 Query: 425 AK 426 + Sbjct: 146 TR 147 >UniRef50_Q502W6 von Willebrand factor A domain-containing protein 3B n=48 Tax=Amniota RepID=VWA3B_HUMAN Length = 1294 Score = 78.8 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 51/192 (26%), Gaps = 36/192 (18%) Query: 251 MFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVVYIRHHTQAKEV-------- 301 ++ L+D S SM S D+ K + + L K V + +E Sbjct: 509 IYILIDTSHSMK-SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLAEVNEDN 567 Query: 302 ---DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + + T SALK + +DG D P Sbjct: 568 LEQAQSWIRDIKIGSSTNTLSALKTAF--------ADKETQAIYLLTDGRP--DQPPETV 617 Query: 359 EILAKK-LLPVVRYYSYIEITRRAHQT----LWREYEHLQSTFDNFAMQHIRD---QDDI 410 K+ + S+ + +E L +F +D + + Sbjct: 618 IDQVKRFQEIPIYTISF-----NYNDEIANRFLKEVAALTGGEFHFYNFGCKDPTPPEAV 672 Query: 411 YPVFRELFHKQN 422 L K+ Sbjct: 673 QNEDVTLLVKEM 684 >UniRef50_UPI000155BCFC PREDICTED: similar to hCG1812043, partial n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155BCFC Length = 567 Score = 78.8 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 19/167 (11%), Positives = 42/167 (25%), Gaps = 20/167 (11%) Query: 271 RFYILLYLFLSRTYKNVEVVYIR-----------HHTQAKEVDEHEFFYSQET-GGTIVS 318 +++ L + + + ++ + GGT + Sbjct: 1 DALLVILKSLMPACLFNVIGFGSTFKTLFPSSQTYSEESLATACKNIKRLRADMGGTNIL 60 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 S LK + +DG A + L + R YS+ Sbjct: 61 SPLKWIVRQ----PIHRGHPRLLFLLTDG---AVSNTGKVLELVRNHSFSTRCYSFGIG- 112 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 A + L + + F + R Q + ++ + Sbjct: 113 PNACRRLVQGLAAVSKGSAEFLAEGERLQPKMIKSLKKAMAPILSDV 159 >UniRef50_A1HT91 Von Willebrand factor, type A n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HT91_9FIRM Length = 586 Score = 78.8 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 31/267 (11%), Positives = 71/267 (26%), Gaps = 14/267 (5%) Query: 149 KTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE 208 + + + + R+++ R K R + L Sbjct: 310 REFKDYLDRHLPDLEAHLRRAIRLLKPRSPDQGRSKARMQAMHGQEQRHAKRWTVGGSLG 369 Query: 209 EERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM 268 + + + + + P + S+ + ++D S SM Sbjct: 370 QLAVAETVIAAAQRCAAGPGGPFTIGSQDIHHFIRKKKSKTDICLIIDASASMSGQRVGA 429 Query: 269 AKRFYILLYLFLSRTYKNVEVVYIRHH-------TQAKEVDEHEFFYSQETGGTIVSSAL 321 + +L LS + + +V+ + T+ E + + G T ++ L Sbjct: 430 --AKLLAKHLLLSTSDRVAVIVFQENQARVQVPLTRDFAQAESSLAHIESFGSTPLALGL 487 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGDNW--ADDSPLCHEILAKKLLPVVRYYSYIEITR 379 K+ E +KE N +DG + L Y + I Sbjct: 488 KVGIEYLKESRAK---NPLVILITDGVPTVGDITGDPLADALTAAASIKSHGYGFTCIGL 544 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRD 406 + H+ + + + Sbjct: 545 KPHRDYLTQVAQAAGGNIYVLDELEKQ 571 >UniRef50_B9SSC8 Protein binding protein, putative n=1 Tax=Ricinus communis RepID=B9SSC8_RICCO Length = 705 Score = 78.8 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 47/175 (26%), Gaps = 14/175 (8%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + + E L E A + A + LR + Sbjct: 248 PAQEFQGFFVNPTPPVLKPRRNVELSLLPESAVVTAGRTYQTHVV--VLRIRAPPYTAAR 305 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---------- 295 + ++DVS M + KR ++ L+ + V + Sbjct: 306 RPPIDLVMVLDVSQRMCGVKLQVMKRIMRVVMSSLNSNDRLSIVAFSATSKRLSPLKRMT 365 Query: 296 TQAKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + TG G + ALK +V+++R S+G + Sbjct: 366 ADGRRSARRIIDALGSTGQGMSANDALKKAAKVIEDRRVKNPVASIII-ISNGQD 419 >UniRef50_Q73PP7 Magnesium chelatase, subunit D/I family n=1 Tax=Treponema denticola RepID=Q73PP7_TREDE Length = 643 Score = 78.8 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 39/323 (12%), Positives = 80/323 (24%), Gaps = 21/323 (6%) Query: 101 GQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGV 160 + + + ++ + + N + + ++ G + Sbjct: 293 SDKDNNDKTKSDKNKTAEKPNEKDNQKIESKSRISNNENDATEHNGNNTNNQNGISYKNS 352 Query: 161 PANISVVRSLQNSL-----ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE 215 ++ +L GKR + E + +Q L + Sbjct: 353 DTELADKIFKIKNLLSINEDNSFRKGMGKRNKTRTNELKGKSFGYTRSSQNLHNLAIIPT 412 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK--DMAKRFY 273 I + + KR A + L+D SGSM + + Sbjct: 413 IKSAALHQIKPKKGIIKINKDDYKFKRRKTRIGASIIFLVDASGSMGAMKRMKETKNTIL 472 Query: 274 ILLYLFLSRTYKNVEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDE 326 LL + + + + T++ + + E G T ++ L E Sbjct: 473 SLLMDSYQKHDEVSMITFAGTRVEIILPFTRSVLLAKRELQLIPTIGKTPLALGLNKALE 532 Query: 327 VVK-ERYNPAQWNIYAAQASDGDNWAD----DSPLCHEILAKKLLPVVRYYSYIEIT--R 379 K R +DG D P+ + K + YS + T Sbjct: 533 YFKIHRLKNKDMIPLLFLITDGRTNHGSVFFDEPIKDALFISKKIKNANIYSVVIDTESG 592 Query: 380 RAHQTLWREYEHLQSTFDNFAMQ 402 L E + Sbjct: 593 FVKLALAEEVAKNLNARYYQIED 615 >UniRef50_Q4J9H5 Conserved protein n=2 Tax=Sulfolobus RepID=Q4J9H5_SULAC Length = 360 Score = 78.8 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 47/171 (27%), Gaps = 11/171 (6%) Query: 247 SQAVMFCLMDVSGSMD-QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV---D 302 S ++D S SM ++ ++A L + ++ E Sbjct: 37 SGIHYVVMIDNSPSMKKENKINLALSSASRLVQDIIPGNFISIYLFSNDIETLYEGESGK 96 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 + E + T + A+ + E K P + SDG +E L Sbjct: 97 QIELKSIKMGYTTNLHKAITKVLEKFKSSEIPVK----IILLSDGKPTDKRYSRDYESL- 151 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 ++ V+ + ++ + + S + + Sbjct: 152 -QVPKNVQLITIGLG-EDYNEAIMKILADKGSGVFYHINDPSQLPTTLVEQ 200 >UniRef50_C0NPX3 von Willebrand domain-containing protein n=5 Tax=Onygenales RepID=C0NPX3_AJECG Length = 1071 Score = 78.8 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 22/212 (10%), Positives = 51/212 (24%), Gaps = 24/212 (11%) Query: 223 IERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 + + K P++ + ++D SGSM + + L Sbjct: 265 ETHTTIPNQRAIMATLVPKFNIPNNNPEIVFIIDRSGSMGG-KIQTLQTALRVFLKSLPV 323 Query: 283 TYKNVEVVYIRHHTQAKE-----------VDEHEFFYSQET-GGTIVSSALKLMDEVVKE 330 K + H+ + GGT + ++ E + Sbjct: 324 GVKFNICSFGSSHSFMWKKSQAYDASSLKAALKYVDSVSANLGGTEILGPVRATVERRLK 383 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV--VRYYSYIEITRRAHQTLWRE 388 + + G+ W + K + +R +S + + L Sbjct: 384 DLDLDILLLSD-----GEIWDQN---TLFSYINKAVSDQPIRLFSLGIGSGASQS-LIEG 434 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 F F + + + + Sbjct: 435 IARAGDGFAQFVNDNELLDKKVVRMLKGALTP 466 >UniRef50_B5ZN80 von Willebrand factor type A n=8 Tax=Rhizobiales RepID=B5ZN80_RHILW Length = 522 Score = 78.8 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 33/269 (12%), Positives = 64/269 (23%), Gaps = 38/269 (14%) Query: 177 RTAMTAGKRRELHALEENLAIIS----NSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 G + + LA + A L A+ P Sbjct: 259 FVDHGRGPEAQTFFN-DLLAYLRSASAQQRIADTGRRIPLSGVAAKPEPGWNFDPARLVT 317 Query: 233 DLRYKNYEKRPDP--------SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL------ 278 +R E ++ +D SGSM +D ++ L Sbjct: 318 AIRMPEPEVIRQALTLYQAALRKPSLTALCLDFSGSMQGDGEDQLQKAMRFLLTPDEASK 377 Query: 279 ---FLSRTYKNVEVVYIRH--HTQA-------KEVDEHEFFYSQETGGTIVSSALKLMDE 326 S + + + + +T +E +E + GGT + + + Sbjct: 378 VLVQWSPADRIIVIPFDGSVRNTFMASGNPLEQEGLLNEISRQKAGGGTDMYTCAAQALQ 437 Query: 327 VVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 + + + +DG +DD P V + A +T Sbjct: 438 QIARSDRLSTYLPAIVIMTDGR--SDDQSQAFMSEWNATEPHVPV--FGITFGDADKTQL 493 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 S + + R Sbjct: 494 DSLAKQTSAR---VFDGGSNLATAFRTAR 519 >UniRef50_B1ZS79 von Willebrand factor type A n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZS79_OPITP Length = 377 Score = 78.8 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 26/222 (11%), Positives = 56/222 (25%), Gaps = 43/222 (19%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSM-------DQSTKDMAKRFYILLYLFL--SRTYKNV 287 K +KR S + +D+SGSM + + ++ F+ + + Sbjct: 99 KVEDKRDVHSQGYDLMLCIDLSGSMLSEDYERGGDRINRLQAIKPVIQAFIERRPSDRIG 158 Query: 288 EVVYIRHH------TQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQWN 338 V++ T + + GT + L + +++ + Sbjct: 159 IVLFSGRAYTMAPLTFDHRWLGSQLERIKVGLIEDGTAIGDGLGVGLTRLEQAQRESGGK 218 Query: 339 IY---AAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIE------------------ 376 +DG N +P LAK V + Sbjct: 219 RQGAFVVLLTDGANNRGSLTPQQAAELAKARGIPVYTIGAGQDGIVPFPVFDDKGRKLGY 278 Query: 377 --ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 I + R+ + F + + + Sbjct: 279 RRIMSDLDEGALRDIAEMTGGHF-FRAADVGTVESAFRAIDR 319 >UniRef50_C9Q197 Aerotolerance protein BatB n=11 Tax=Prevotella RepID=C9Q197_9BACT Length = 591 Score = 78.8 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 22/209 (10%), Positives = 52/209 (24%), Gaps = 41/209 (19%) Query: 245 PSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVYIRHH---- 295 + +D+S SM + +K L + + VV+ Sbjct: 123 KRNGIEAVIAVDISNSMMAQDVVPSRLEKSKLLIENLVDHFT-HDRIGLVVFAGDAFVQL 181 Query: 296 ------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 AK ++ T GT ++ A+ L ++ +DG++ Sbjct: 182 PITTDYVSAKMFLQNIDPALIATQGTDIAKAINLS---MRSFSQQKDIGKAVIVITDGED 238 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRA---------------------HQTLWRE 388 + L A + V ++++ ++ Sbjct: 239 -HEGGALEAAKAANERGIRVFILGIGSTKGSPIPLAEGGYLADRSGQTVLTALNESMCKQ 297 Query: 389 YEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + Q+ + +L Sbjct: 298 IAQAGNGTYIHVDNTNDAQEKLNNELAKL 326 >UniRef50_UPI0000F1FEC5 PREDICTED: similar to Clca1 protein n=2 Tax=Danio rerio RepID=UPI0000F1FEC5 Length = 903 Score = 78.8 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 53/186 (28%), Gaps = 20/186 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 + + ++DVSGSM ++ M + LL ++ V + + + Sbjct: 293 LQRKKRAVCLILDVSGSMATESRILRMRQAATHLLRNYVEEQASVGIVKFSTAASIVSSL 352 Query: 302 D----EHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + + G T + + L+L +V+ E + +DG Sbjct: 353 TIIESDATRDHLINLLPETPGGSTNMCNGLRLGLQVLSE-DDMDAIGDEIIFLTDGQATD 411 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 D LC ++ + +E ++ + + Sbjct: 412 -DVTLCIPDAINSGAI---IHTIALSDSAHNA--LQEMADKTGGIFFYSKDDFTS-NQLM 464 Query: 412 PVFREL 417 F L Sbjct: 465 DAFASL 470 >UniRef50_Q97LT1 DnaK protein (Heat shock protein), C-terminal region has VWA type A domain n=1 Tax=Clostridium acetobutylicum RepID=Q97LT1_CLOAB Length = 698 Score = 78.8 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 24/258 (9%), Positives = 70/258 (27%), Gaps = 20/258 (7%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 ++ T + ++ + +II + E K+ + Sbjct: 435 EEISDCTILGKYVFYDIEYVGRKPSIIDIEYSYNKNGVVDVFATQRETAKKLPLKIEKLS 494 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-YKNVEVV 290 D ++ + + + +D+SGSM + A + + + Sbjct: 495 EDFIFEEELNEKEEAVHKNIVIAIDLSGSMRGKPLEEAIEASKTFVDSIDEGSFSLALIG 554 Query: 291 YIR------HHTQAKEVDEHEFFYS-QETGGTI-VSSALKLMDEVVKERYNPAQWNIYAA 342 + + T+ +E + GT +S ++K+ Y + Sbjct: 555 FADKVKTLINLTEDREEIFRAIDGLKKADVGTSTMSEPFSEAYNILKDAYGD----CFVV 610 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG + + K+ + + A + + +N Sbjct: 611 VLTDGQWYGKKDIMAEVNKCKEYEIEIAAIGFG----NAKKDFLDKIATC---EENSIFT 663 Query: 403 HIRDQDDIYPVFRELFHK 420 + + + ++ + Sbjct: 664 EVSNLKQSFSRIAKVISR 681 >UniRef50_UPI0001760236 PREDICTED: similar to mCG140660 n=1 Tax=Danio rerio RepID=UPI0001760236 Length = 1753 Score = 78.8 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 53/194 (27%), Gaps = 24/194 (12%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVV--------YIRHHTQ 297 A + L+D S S+ ++F L + V + + Q Sbjct: 31 ADIVFLVDGSASIGLDNFQQIRQFLSSLVENFEVAPDKVRIGLVQYSDTPRTEFSLNTYQ 90 Query: 298 AKEVDEHEFFYSQE-TGGTIVSSALKLMDEVV----KERYNPAQWNIYAAQASDGDNWAD 352 KE + TGGT L+ + + A +DG Sbjct: 91 NKEEILDYIRNLRYKTGGTHTGQGLEFILKQHFIEEAGSRAQQNVPQIAIVITDG----- 145 Query: 353 DSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 DS ++ A++L ++ ++ A L R+ + +++ I Sbjct: 146 DSQDEVDLQAQELRQRGIKIFAIGIK--DADVRLLRQIANEPYDQYVYSVSDFAALQGIS 203 Query: 412 PVFRELFHKQNATA 425 Sbjct: 204 QSVVRELCTSVKDV 217 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 54/189 (28%), Gaps = 21/189 (11%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHTQAKE 300 +++A ++ L+D SGS+ + K+F + + V + T Sbjct: 812 KQTAKADIYFLLDESGSISYPDFEDMKKFIMECLDVFQIGKDHVRIGVVKFASKATTVFR 871 Query: 301 V--------DEHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDN 349 + E + GGT L+ M + +E +DG++ Sbjct: 872 LHDYSTKSDVEKAVKDLEMYGGGTRTDLGLRQMIPLFREAVQTRGEKARELLIVITDGES 931 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 P+ + V Y+ A D+ + + D Sbjct: 932 TGTVEPVEVPAKHLRAEQNVSIYAIGCEGLLADVVFLI------DGSDSVSAEDFEKMKD 985 Query: 410 IYP-VFREL 417 I V + Sbjct: 986 IMEYVIEKF 994 Score = 68.8 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 56/182 (30%), Gaps = 19/182 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA------- 298 ++QA + L+D SGS+ + + K+F + V + + + Sbjct: 227 TAQADIVLLVDSSGSIGDNDFEEVKKFLHAFVDRFNLRPDLVRLGLAQFSDRPYQEFLLG 286 Query: 299 -----KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 K++ + GGT AL + E P A +DG+ + D Sbjct: 287 DYADKKDLHQKLNNLIYRKGGTQTGQALTFIRENYFSLARPN-VPGIAIVITDGE--SRD 343 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + + + + R ++ F++ + ++ + Sbjct: 344 DVEEPAQRLRNTGVSLFVIRVGKG----NMEKLRAIANIPHEEFLFSINNYQELQGLKES 399 Query: 414 FR 415 R Sbjct: 400 LR 401 Score = 68.8 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 21/193 (10%), Positives = 49/193 (25%), Gaps = 22/193 (11%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ----------- 297 A + ++D S S+ + + F L +++ I + Sbjct: 611 ADIAFIVDQSSSIKSRNFQLVRDFLENTIGRLDVGKDKIQIAVILYSDFPRADVYLNTFS 670 Query: 298 AKEVDEHEFFYSQET-GGTIVSSALKLMDEV----VKERYNPAQWNIYAAQASDGDNWAD 352 K G T +AL+ E + A +DG + Sbjct: 671 NKNDILRYINTLPYGRGKTYTGAALRFAKEHVFTKARGSRRDKYVQQVAVVITDGKSTD- 729 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 A+ V ++ + RE +++ + + Sbjct: 730 ---DAASAAAELRRSGVSIFALGIKDTKEDD--LREIASYPPKKFVLNVENFDQLNSLAG 784 Query: 413 VFRELFHKQNATA 425 + + + + A Sbjct: 785 ILTKTLCNEISDA 797 Score = 68.0 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 41/157 (26%), Gaps = 19/157 (12%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------ 302 A + L+D S S+ + D+ K+ I + L V + Sbjct: 1230 ADLVFLIDGSESIKPPSWDILKQTMIGIVKELDIAKDKWRVGVAQFSDILLHQFYLNTYT 1289 Query: 303 -----EHEFFYSQET-GGTIVSSALKLMDEVVKE---RYNPAQWNIYAAQASDGDNWADD 353 E ++ GT ALKL+ + +DG+ + Sbjct: 1290 SFAEVEEAINNIKQRKQGTNTWDALKLIKYYFTKENGSRIEGGVAQNLLLITDGEANDEK 1349 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 L K + ++ RE Sbjct: 1350 DLNALADL-KNKKIAITVIGIGN---EIKKSELREIA 1382 Score = 45.3 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 45/166 (27%), Gaps = 29/166 (17%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFY 308 A + L+D S S+ + K + + + V +++ Sbjct: 963 ADVVFLIDGSDSVSAEDFEKMKDIMEYVIEKFAIGSEKERVAVVQY-------------- 1008 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 GT + + + Q Q +DG+ + D + Sbjct: 1009 -----GTNPNE--EFSLNAFDNKDRLLQEIRNIRQVTDGE--SRDDVALPAKALRDN--S 1057 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + Y+ R A+++ S + F + ++ Sbjct: 1058 INTYAIGL--RHANRSQI--LAIAGSHGEVFYEDAVASLKELSSEV 1099 >UniRef50_C8W898 Cna B domain protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8W898_ATOPD Length = 863 Score = 78.8 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 30/271 (11%), Positives = 68/271 (25%), Gaps = 41/271 (15%) Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 + + +A + + + + T ++ + + + Sbjct: 31 AFVGQFVAGARVAYAEDPQINTP--PVHKKTISPNTDGTYDLTLTIKGETSAASEEQKAN 88 Query: 249 AVMFCLMDVSGSM------DQSTKDMAKRFYILLYLFL-----SRTYKNVEVVYIRHH-- 295 ++ D S SM + D AKR L + + VE+ + + Sbjct: 89 VLVVF--DNSSSMTAQTGGGEMRLDAAKRVVNQLSSTILGINRNAQKDVVEMALLSFNEK 146 Query: 296 -------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 T + GT SAL+ +V+ ++ Y +DG Sbjct: 147 PNLECGWTADLNEFQRATNNMGFHTGTNWESALERA-KVLADQKAANGNPTYVIFVTDGL 205 Query: 349 NWADDS---------PLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQ----- 393 D + A+ + +YS A+ Y + Sbjct: 206 PTQDRNGWVRNNQIGYEHALDEARAIGSAGYHFYSVYMYGGHAYLRQLTNYAYTGNPFGN 265 Query: 394 -STFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + + K Sbjct: 266 PGGTYYYEANNTAQMEQAFKEIASVITKSIT 296 >UniRef50_UPI0001745E24 hypothetical protein VspiD_17015 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745E24 Length = 339 Score = 78.4 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 54/222 (24%), Gaps = 41/222 (18%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSM----------DQSTKDMAKRFYILLYLFLSRTYKN 286 K S + DVS SM AKR + + Sbjct: 82 KVISYDELKSEGIGIVVAFDVSLSMRIRDFYIGNRQVDRMTAAKRVLVDFIKG-RPNDRI 140 Query: 287 VEVVYIRHH------TQAKEVDEHEFFYSQET---GGTIVSSALKLMDEVVKERYNPAQW 337 V + T + + Q GT + S + + + Sbjct: 141 GIVAFGGAPYNPCPPTLDHDWLLNNMDRIQTGIMEDGTAIGSGIAAAARRLDQLEVK--- 197 Query: 338 NIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIE----------------ITRR 380 + +DG N + SP LA L + S + Sbjct: 198 SKVILLMTDGANNSGKLSPQDAARLAATLGIRIHAISIGTPGMHPIYMPNGPPINSGRQE 257 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 +E ++ S F + + + I+ E+ + Sbjct: 258 FDPETLQEVANIGSGSF-FRAEDLSTLERIFKTVDEMERTEI 298 >UniRef50_Q7MUE5 BatB protein n=4 Tax=Bacteria RepID=Q7MUE5_PORGI Length = 339 Score = 78.4 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 26/220 (11%), Positives = 53/220 (24%), Gaps = 40/220 (18%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNV 287 + P +D+S SM + AK+ L+ L K Sbjct: 75 RPQISIRVDVPKEEKGIEAMICLDISNSMLCEDVKPNRLSFAKQVLGKLFDGLQ-NDKVG 133 Query: 288 EVVYIRHH----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 VV+ + + AK+ GT + +A++L + + + Sbjct: 134 LVVFAGNAYTQIPITTDLSAAKQFLADISPNMVTAQGTAIGAAIELASKSF---SDNKEI 190 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR----------------- 380 +DG+N + + + A + V Sbjct: 191 GKTIIVLTDGEN-HEGNAIEAAQQAHEAGIRVNVIGLGTALGAPIPIEEGYLKDETGNPV 249 Query: 381 ---AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + R+ I +L Sbjct: 250 VTKFDEKMCRDIASAGEGTFFSGQSASALVRAIESQLDKL 289 >UniRef50_Q73NA5 BatA protein, putative n=1 Tax=Treponema denticola RepID=Q73NA5_TREDE Length = 332 Score = 78.4 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 19/206 (9%), Positives = 45/206 (21%), Gaps = 41/206 (19%) Query: 247 SQAVMFCLMDVSGSMDQSTK------DMAKRFYILLYLFLSRTYKNVEVVYIRH------ 294 + + + L+D+S SM AK+ Sbjct: 88 AGSSIMFLLDISPSMAAKDMSGETRIAAAKKIIRKFVAKY-PGDSFGLTALSSSAALILP 146 Query: 295 HTQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN-W 350 T +V GT + L + + + Y +DG+N Sbjct: 147 PTIDHKVFLSRLDSLSIGELGDGTAIGMGLAVSSAYMTRTKL---NSSYIVLLTDGENNT 203 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIE--------------------ITRRAHQTLWREYE 390 + +P + I + + ++ Sbjct: 204 GEINPKTAAKVLVNKNIGFYVIGIGSSGYTTLEYTDRKTGKTYSGSIFSKFDELELKKIA 263 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + ++I+ + Sbjct: 264 QYGNGKYA-SASSPEILENIFNTISK 288 >UniRef50_C1F3F5 von Willebrand factor type A domain protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F3F5_ACIC5 Length = 313 Score = 78.4 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 48/194 (24%), Gaps = 20/194 (10%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 + +D SGS+ + + + L L + V + + Sbjct: 63 LERQTGLPLSIVLAIDTSGSVRKDLDEEKRAAREFLRATLRPEDRVEIVNFNTRVHEVVP 122 Query: 301 VDEHEF------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + E T + +A+ E + +R SDGDN +S Sbjct: 123 FTNNLKKIDRGLNRLSEGPATALYAAIAYGSEELAQRPG----RKVLVVISDGDNTVANS 178 Query: 355 PLCHEILAKKLLPVVRYYSYIE-------ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + L + + +S I+ + + + Sbjct: 179 SYQ-QALDRAVRAETMIFSVIDLPVINDAGRDVGGEHAMIALSEATGGEYYYEAD--GNL 235 Query: 408 DDIYPVFRELFHKQ 421 ++ + Sbjct: 236 QGVFKRLSTALRTE 249 >UniRef50_B5HZU2 VWA domain-containing protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HZU2_9ACTO Length = 518 Score = 78.4 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 27/266 (10%), Positives = 65/266 (24%), Gaps = 24/266 (9%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 LT + TA+ ++ + RR + R + + Sbjct: 243 LTVIRPRDGVITADYPLTSLRSTSARTREDVRRVSEDLRTERIQREITA------RTHRR 296 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS 264 ++ + R + P + + ++D SGSM+ Sbjct: 297 PVVASVPPASGLDTTRRRELPFPG-TRSVADGLLDSYENELRRPSRTVYVLDTSGSMEGD 355 Query: 265 TKDMAKRFYILLYLFLSRTYKNVEVVYIRH-------------HTQAKEVDEHEFFYSQE 311 D K L + + + + + Sbjct: 356 RLDRLKTALADLTGDFREREEVTLMPFGSQVKSVRTHVVKPSDPRAGLDAIRDDTSALSA 415 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVR- 370 G T + ++L+ + + + +DG+N A + +L R Sbjct: 416 DGDTAIYTSLEKAYDHLGAGRDAFTS---IVLMTDGENTAGAKARDFDAFYARLGRKARD 472 Query: 371 YYSYIEITRRAHQTLWREYEHLQSTF 396 + + + ++ L Sbjct: 473 TPVFPILFGDSDRSELAHIADLTGGR 498 >UniRef50_A6M139 von Willebrand factor, type A n=1 Tax=Clostridium beijerinckii NCIMB 8052 RepID=A6M139_CLOB8 Length = 962 Score = 78.4 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 47/191 (24%), Gaps = 16/191 (8%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R+ L + ++ A + + + I + Sbjct: 7 KRKFLKKVSLIVSLMLILVSINSSVFRVKADIASKPQFTVTID-SYTPKNPKLGEEITIN 65 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLS--RTYKNVEVVY 291 + + + ++D SGSM D K+ +S + K V + Sbjct: 66 GTIHPQPFKISIPPKEIVLVLDSSGSMADNYKLTNLKKAATDFITKMSTVKNLKIAIVDF 125 Query: 292 IRHHTQAKEVD-----------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ-WNI 339 T ++ + GGT L+ ++ + Sbjct: 126 DTQATIINKLTDVSSSTNVTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSEQNPLASK 185 Query: 340 YAAQASDGDNW 350 SDG+ Sbjct: 186 NIIFMSDGEPT 196 >UniRef50_Q4V1D8 Putative uncharacterized protein dadA n=8 Tax=Bacillus cereus group RepID=Q4V1D8_BACCZ Length = 452 Score = 78.4 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 38/329 (11%), Positives = 74/329 (22%), Gaps = 41/329 (12%) Query: 119 ISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT 178 K E ++ L L G + R+L + Sbjct: 34 EPKSEVVEKETSKKELEKEDTKTYESLFATNAPLMTQEGPG---KYAKERALSSEEMNEL 90 Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 K +S + L + + + +P + + N Sbjct: 91 NQELIKLP---------KDLSAEQIYVQLLNLQAQSYKEAVSKLERIIPELAIKEQNNDN 141 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKN------- 286 + L+D SGSM + AK+ + Sbjct: 142 VHSIKPKEKSLNVEILLDASGSMAGKVNGQVKMEAAKKAIYNYLDKIPDNANVMLRVYGH 201 Query: 287 ---------------VEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 EV+Y KE G T ++SA++ +++ KE Sbjct: 202 KGSNNENDKSLSCGSSEVMYPLQP-YKKEQFNAALSKFGPKGWTPLASAIESVNDDFKEY 260 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 N SDG+ P+ + + + Q Sbjct: 261 TGEENLN-VVYIVSDGEETCGGDPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLMNTAE 319 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + + +L+ + Sbjct: 320 AGKGNYATVSNADELYQTLNTEYEKLYKE 348 >UniRef50_UPI00006A1CCF polydom n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1CCF Length = 376 Score = 78.4 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 47/189 (24%), Gaps = 26/189 (13%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVY---------- 291 S + L+D S S+ S RF L + + + Sbjct: 1 KSLSLDLVFLVDESSSVGHSNFVNELRFVKKLLSDFPVVPSATRVAIITFSSKTNVQTRV 60 Query: 292 --IRHHTQAKEVDEHEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 I + GGT A + +++ RY+ + + Sbjct: 61 DYISSSEPHQHKCSLLNREIPAITYKGGGTFTKGAFQQAAQIL--RYSRSNSTKVIFLIT 118 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + D P + L V ++ + + + + Sbjct: 119 DGYSNGGD-PRPIAANLRDL--GVEIFTVGIWQGNIRE--LHDMASHPKEEHCYLLHSFA 173 Query: 406 DQDDIYPVF 414 + + + Sbjct: 174 EFEALARRA 182 >UniRef50_B9ZQD1 von Willebrand factor type A n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZQD1_9GAMM Length = 615 Score = 78.4 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 44/355 (12%), Positives = 97/355 (27%), Gaps = 34/355 (9%) Query: 40 KRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSG 99 + D G+ E GG D + GG +G Sbjct: 236 DEAGGDEAGGDEAGGDEAGGDEAG-GDEAGGDEAGGDEAGGDEAGGDEAGGDEAGGDEAG 294 Query: 100 S---GQGQASQDGEGQDEF----VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHR 152 G +A D G DE + D+ D + E + + + +++++ + Sbjct: 295 GDEAGGDEAGGDEAGGDEAWSTRPTEQMLDQEADGMEETFDMGDALKESIQEVSQKEEQE 354 Query: 153 AGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERL 212 G +G+ R AG + S + + + Sbjct: 355 KGRYTDGLCPFTFCGEMPVQGSGDRLVQQAGIASAALRTRLASQLQSVNRERRWASRKGS 414 Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 + L + + +++ + + L+D SGSM + A Sbjct: 415 KLSSRHLSRVVTGDHRVFG--------KRQESGTPNTAVQILVDRSGSMAGDPIETAMTA 466 Query: 273 YILL-------------YLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSS 319 + + + V + ++ F TG T +S+ Sbjct: 467 ALAIQLATDSLRGINTQVSAFPASSSGGLVPITSFGENGRMKADN--FGVGSTGATPMSN 524 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 A+ V+ + ++ +DG +S + +A+ + + Sbjct: 525 AI---LGVLPSMFARSESRKVMLVITDGAPNDSESAMEAIRMARDVNVEMYAIGI 576 >UniRef50_A8TX70 Collagen alpha-5(VI) chain n=18 Tax=Eutheria RepID=CO6A5_HUMAN Length = 2615 Score = 78.0 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 58/197 (29%), Gaps = 21/197 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL------------FLSRTYKNVEVVY 291 + + ++D SGS+ + +D I L L + + + Y Sbjct: 808 KRITLLDVVFVLDHSGSIKKQYQDHMINLTIHLVKKADVGRDRVQFGALKYSDQPNILFY 867 Query: 292 IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE---RYNPAQWNIYAAQASDGD 348 + ++ + E+ G T + ALK + + E +DG+ Sbjct: 868 LNTYSNRSAIIENLRKRRDTGGNTYTAKALKHANALFTEEHGSRIKQNVKQMLIVITDGE 927 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + D + + ++ A+Q + + + + Sbjct: 928 SHDHDQLNDTA--LELRNKGITIFAVGVGK--ANQKELEGMA--GNKNNTIYVDNFDKLK 981 Query: 409 DIYPVFRELFHKQNATA 425 D++ + +E + Sbjct: 982 DVFTLVQERMCTEAPEV 998 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 37/259 (14%), Positives = 82/259 (31%), Gaps = 28/259 (10%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP--FIDTFDLRYKNYEKR 242 LEE ++ + L L + K++ I T+ + + Sbjct: 375 GANNTQLEEIVSYPPEQTISTLKSYADLETYSTKFLKKLQNEIWSQISTYAEQRNLDKTG 434 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------- 295 + +A + L+D S S+ + + KRF + + S V V +++ Sbjct: 435 CVDTKEADIHFLIDGSSSIQEKQFEQIKRFMLEVTEMFSIGPDKVRVGVVQYSDDTEVEF 494 Query: 296 -----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE--RYNPAQWNIYAAQASDGD 348 + ++ + F Q TGGT AL + +++K + ++ Y +DG Sbjct: 495 YITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSKVPCYLIVLTDGM 554 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + E + + ++ A++ +E + Sbjct: 555 STD----RVVEPAKRLRAEQITVHAVGIG--AANKIELQEIA----GKEERV--SFGQNF 602 Query: 409 DIYPVFRELFHKQNATAKG 427 D + ++ KG Sbjct: 603 DALKSIKNEVVREICAEKG 621 Score = 74.2 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 41/327 (12%), Positives = 86/327 (26%), Gaps = 40/327 (12%) Query: 112 QDEFVFQISKDEYLDLLFEDLALPNLKQ-NQQRQLTEYKTHRAGYTANGVPANISVVRSL 170 D+ + +Y + + A+ N+KQ + + NG+ +S V Sbjct: 487 SDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSKVPCY 546 Query: 171 QNSLARRTAMTAGKRRELHALEENL----------AIISNSEPAQLLEEERLRKEIAELR 220 L + E + I E A E + L+ Sbjct: 547 LIVLTDGMSTDRVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEERVSFGQNFDALK 606 Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 + V ++ +A + L+D S S+ K F L + Sbjct: 607 SIKNEVVREIC--------AEKGCEDMKADIMFLVDSSWSIGNENFRKMKIFMKNLLTKI 658 Query: 281 SRTYKNVEVVYIRHHTQAKEVDE-----------HEFFYSQE-TGGTIVSSALKLMDEVV 328 ++ ++ + KE + GT+ AL + + Sbjct: 659 QIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYF 718 Query: 329 KERYNPA-QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 + +DG A D + + V +S A+++ Sbjct: 719 THSKGARLGAKKFLILITDG--VAQDDVRDPARILR--GKDVTIFSVGVY--NANRSQLE 772 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVF 414 E + F +++ + Sbjct: 773 EIS--GDSSLVFHVENFDHLKALERKL 797 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 46/398 (11%), Positives = 99/398 (24%), Gaps = 40/398 (10%) Query: 40 KRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSG 99 +R + V PT + +G + G + G G G Sbjct: 1559 QRGLRGVSGEPGNPGPTGTLGAEGLQGPQGSQGNPGRKGEKGSQGQKGPQGSPGLMGAKG 1618 Query: 100 SGQGQA--SQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTA 157 S + GE D +P Q ++ + + Sbjct: 1619 STGRPGLLGKKGEPGLPGDLGPVGQTGQRGRQGDSGIPGYGQMGRKGVKGPRGFPGDAGQ 1678 Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA 217 G N + R A+T G + E S P + Sbjct: 1679 KGDIGNPGIPGGPGPKGFRGLALTVGLKGEEG---------SRGLPGPPGQRGIKGMAGQ 1729 Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY 277 + ++ + + F+ K P+ + +D S + + + + + + Sbjct: 1730 PVYSQCDLIRFLREHSP----CWKEKCPAYPTELVFALDNSYDVTEESFNKTRDIITSIV 1785 Query: 278 LFLSR-------TYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGT---IVSS 319 L+ + V Y + K+ + + T V + Sbjct: 1786 NDLNIRENNCPVGARVAMVSYNSGTSYLIRWSDYNRKKQLLQQLSQIKYQDTTEPRDVGN 1845 Query: 320 ALKLM-DEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 A++ + V K Y A A S+G + S + + L +++ Sbjct: 1846 AMRFVTRNVFKRTYAGANVRRVAVFFSNGQTASRSSIITATMEFSALDISPTVFAF---- 1901 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + + ++ R Sbjct: 1902 --DERVFLEAFGFDNTGTFQVIPVPPNGENQTLERLRR 1937 Score = 55.7 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 22/221 (9%), Positives = 47/221 (21%), Gaps = 37/221 (16%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 I+ + + E L K E + L+D Sbjct: 2247 INYEKDQKSAEIASLTSGHENYGRKEEPDH--------TYEPGDVSLQEYYMDVAFLIDA 2298 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSR---------TYKNVEVVYIR--------------- 293 S + K F + + + + Y Sbjct: 2299 SQRVGSDEFKEVKAFITSVLDYFHIAPTPLTSTLGDRVAVLSYSPPGYMPNTEECPVYLE 2358 Query: 294 HH----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 ++ H Q G + AL+ + V + N S G+ Sbjct: 2359 FDLVTYNSIHQMKHHLQDSQQLNGDVFIGHALQWTIDNVFVGTPNLRKNKVIFVISAGET 2418 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + D + + + + + + + E Sbjct: 2419 NSLDKDVLRNVSLRAKCQGYSIFVFSFG-PKHNDKELEELA 2458 Score = 52.6 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 54/197 (27%), Gaps = 30/197 (15%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK------ 299 A + L+D S + + K F + L V ++ + Sbjct: 26 PVYADVVFLVDSSDHLGPKSFPFVKTFINKMINSLPIEANKYRVALAQYSDEFHSEFHLS 85 Query: 300 ------EVDEHEFFYSQETGGT-IVSSALKLMDEV-----VKERYNPAQWNIYAAQASDG 347 + H Q GG+ + AL+ + R I AS Sbjct: 86 TFKGRSPMLNHLKKNFQFIGGSLQIGKALQEAHRTYFSAPINGRDRKQFPPILVVLAS-- 143 Query: 348 DNWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 +S E +K L V+ S ++A + + +F ++ IRD Sbjct: 144 ----AESEDEVEEASKALQKDGVKIISVGV--QKASEENLKAMATS---HFHFNLRTIRD 194 Query: 407 QDDIYPVFRELFHKQNA 423 ++ Sbjct: 195 LSTFSQNMTQIIKDVTK 211 >UniRef50_B9R4P7 von Willebrand factor type A domain protein n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9R4P7_9RHOB Length = 624 Score = 78.0 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 40/308 (12%), Positives = 77/308 (25%), Gaps = 23/308 (7%) Query: 81 HFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQN 140 +PQ + + + E L L E L Sbjct: 272 QPNDPVEETQPQPAENDQSVSPDAGPKQDPESRDQETSQNAKEDLSALTE-----LLAAV 326 Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLAR-RTAMTAGKRRELHALEENLAIIS 199 + + A T + A +++ R R T+ A LA + Sbjct: 327 EAGTIDGLPEFLADTTRSSPRARSGKSGAVRKDARRGRPVSTSRMPPRPDARPNILATLR 386 Query: 200 NSEPAQLLE---EERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + P QL+ + L ++ A T R +R + + ++D Sbjct: 387 AAAPWQLIRNRNRDLLAAKLERAIAAP-PRRKPRTLITRDDYRYQRLRHETPSTAIFVVD 445 Query: 257 VSGSMDQSTKDMAKRFYILLYLF-LSRTYKNVEVVYIR-------HHTQAKEVDEHEFFY 308 SGS K L R + + + T++ + + + Sbjct: 446 ASGSTALERLGETKGAIEQLLSRCYVRRDEVAMIAFRGTQAETLLSPTRSLVMAKRKLAG 505 Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW----ADDSPLCHEILAKK 364 G T +++ L+ E+ +DG Sbjct: 506 LPGGGPTPLAAGLERGLELALSVRRQGST-PVLVFMTDGRGNIALDGTPDRTRAAEQVHA 564 Query: 365 LLPVVRYY 372 L R + Sbjct: 565 LAEQCRTH 572 >UniRef50_D2S019 ATPase associated with various cellular activities AAA_5 n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2S019_9EURY Length = 665 Score = 78.0 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 66/237 (27%), Gaps = 15/237 (6%) Query: 129 FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA----RRTAMTAGK 184 E N + + + G G A+ V R+ R Sbjct: 337 DEAEGSDNETDRESDEDRDAAAAAGGIPIRGDEASYPVDRTAIQPPRDRTMREALARRTP 396 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR-P 243 + + + + + LR A+ E + K+ ++ Sbjct: 397 SKVDVRSGRYVRARDSESVDDVAIDATLRAAAPHQPARRETDDSSSGIAIEPKDLRQKIR 456 Query: 244 DPSSQAVMFCLMDVSGS-MDQSTKDMAKRFYILLY-LFLSRTYKNVEVVY-------IRH 294 + ++A++ ++D SGS M KR + L + VV+ + Sbjct: 457 ERRAEALVVFVVDASGSVMSGRQMFETKRGILSLVEDAYRARDRVAVVVFREEGAFTLVE 516 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK-ERYNPAQWNIYAAQASDGDNW 350 T+ G T ++ L E+V+ ER SDG Sbjct: 517 PTRNLSAARRAVSKLTVGGNTPLAHGLVEAYELVERERRRDEDLYPLVVLFSDGQTN 573 >UniRef50_A7RFK1 Predicted protein (Fragment) n=2 Tax=Nematostella vectensis RepID=A7RFK1_NEMVE Length = 1418 Score = 78.0 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 56/196 (28%), Gaps = 36/196 (18%) Query: 249 AVMFCLMDVSGSMDQST-------KDMAKRFYILLYLFLSRTYKNVEVV----------- 290 + + L+D SGS+ + D K F L + +YK+ V Sbjct: 2 SDLIFLVDTSGSLQYWSGGGWKNGFDDEKVFVNSLLSHIRVSYKSTYVSVVLFGTSATID 61 Query: 291 ----YIRHHTQAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVV-----KERYNPAQWNIY 340 + H K +F + +G T + A + +++ + Q Sbjct: 62 INYIFNPHPNNHKCNFRRDFSNLRFRSGMTNMHDAFQAAYDIIFGKYSGHKRPTHQVKTA 121 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DG NW D + L + + ++ + R + F Sbjct: 122 VFLLTDGQWNWNGDPWPIAKRLKDR---GIEIFTIGVTNG-VNVNTLRSLASPNN---YF 174 Query: 400 AMQHIRDQDDIYPVFR 415 ++ R Sbjct: 175 HYNDFTQFRELATCIR 190 >UniRef50_C0Z8R6 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z8R6_BREBN Length = 597 Score = 78.0 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 53/200 (26%), Gaps = 25/200 (12%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMA-KRFYILLYLFLSRT-YKNVEVVYIR-- 293 + + ++DVS SM QS K+ + S K V Y Sbjct: 28 HPGFAQTSGNNMDAVLVVDVSNSMTQSDKNKVSNEAMKMFVDMTSIQANKVGVVAYTDKI 87 Query: 294 ---------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + + K + Q+ T ++ + +++ NP Sbjct: 88 EREKALLEINSEEDKNDIKAFIDSLQKGAYTDIAVGVTEAVKILDAGRNPN-NAPIIVLL 146 Query: 345 SDGDN--------WADDSPLCHEILAKKLLPVVR-YYSYIE-ITRRAHQTLWREYEHLQS 394 +DG+N S + K+ Y+ + ++T ++ + Sbjct: 147 ADGNNFLNKASSRTQAKSDQELQQAVKEAKDKGYPVYTIGLNADGQLNRTTLQQIAAETN 206 Query: 395 TFDNFAMQHIRDQDDIYPVF 414 F I Sbjct: 207 GKF-FETSTADKLPQILSEI 225 >UniRef50_Q4RPQ3 Chromosome 12 SCAF15007, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4RPQ3_TETNG Length = 457 Score = 78.0 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 18/188 (9%), Positives = 49/188 (26%), Gaps = 13/188 (6%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 ++ + ++ ++D SGS+ ++ L F+S + +V+ Sbjct: 38 SWAEEASCHGAYDLYFVLDKSGSVAGDWGEIYSFVKNLTDRFVSPRMRVSFIVFSAQAKV 97 Query: 298 AKEVDEHEFF---------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + + + G T + +K + R P+ + +DG Sbjct: 98 LLPLTGDSYKIKEGLRKLYDVKPAGETFMHVGIKEASVQI--RAQPSPTSSIILALTDGK 155 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 L + + R Y + + + Sbjct: 156 LEVYVHDLTVKEANEARKYGARVYCVGIK--DFDEQQLANIADTKDQVFPVKDGFHALKG 213 Query: 409 DIYPVFRE 416 + + + Sbjct: 214 IVNSILKR 221 >UniRef50_Q9BPQ8 Integrin alpha Hr1 n=1 Tax=Halocynthia roretzi RepID=Q9BPQ8_HALRO Length = 1332 Score = 78.0 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 56/200 (28%), Gaps = 31/200 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 PSS + ++D SGS+ D K + + L + V V +++ + Sbjct: 196 RNTECPSSGVDVLFVLDGSGSV-GKNFDKVKDWVKNITAKLDIGKEIVRVGVVQYSHYVE 254 Query: 300 ----------------------EVDEHEFFYSQETGGT-IVSSALKLMDEVVKERYNPAQ 336 + E+ Q G T AL+ + + Y Sbjct: 255 GKSINKQKYITTEISIGEFKLLDNFENAVDRIQLQGYTTYTGRALQKVIRDFDDAYI--G 312 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 +DG A D+ L + + ++ + + + Sbjct: 313 NKQVLLLLTDGQ--AKDNKLILPNANRLRNKGIATFAVGVG--EYDISELKLIASGTDST 368 Query: 397 DN-FAMQHIRDQDDIYPVFR 415 D F + + D I + Sbjct: 369 DRVFTVTDFGELDSIVKSLQ 388 >UniRef50_C3XUD0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XUD0_BRAFL Length = 443 Score = 78.0 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 22/207 (10%), Positives = 49/207 (23%), Gaps = 31/207 (14%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-------TYKNVEVVYIRHH---- 295 +D SGSM+ K+ L + V + Sbjct: 1 MPLDTVLCLDTSGSMNGRGMAELKKGVRHFLLGVQETANKMSLRENVAVVEFGGGARIIQ 60 Query: 296 --TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDG 347 + + G T + L + + +R + +DG Sbjct: 61 PLSGNYGTVMQSVDNLKAGGTTPMFEGLMEAMKEILQRGGVLTLPGGRKMTPRVILMTDG 120 Query: 348 DNWADDSPLCHEI-------LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 ++ L + A L + + L + L + Sbjct: 121 YPDDKENVLKAALSFGPAGWQAVGLPHPIPIACVGCG-DDVDKDLLQAIAKLTNGMY--I 177 Query: 401 MQHIRDQDDIYPVFRELFHKQNATAKG 427 + + + + R++ + A G Sbjct: 178 LGDVSQLSEFFR--RQVLLIRFAAQFG 202 >UniRef50_C7DFN0 Magnesium chelatase ATPase subunit D n=1 Tax=Thalassiobium sp. R2A62 RepID=C7DFN0_9RHOB Length = 548 Score = 78.0 bits (190), Expect = 7e-13, Method: Composition-based stats. Identities = 53/385 (13%), Positives = 106/385 (27%), Gaps = 38/385 (9%) Query: 30 IKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIE 89 +++S + I +V +VS+ + P F + + + D + Sbjct: 164 LRRSARKVIVPPAVLSDLVTLAVSLGISSLRAPTFAL--NAAQTHAAWQDRDTLTADDVA 221 Query: 90 RPQGGGGGSGSGQGQASQDGE--GQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTE 147 + Q E QD +D+ + + L L +K L Sbjct: 222 IAVALVYAHRATQMPHDDATENTPQDANQDSQPQDQPFSIPKDIL-LDAIKAVLPPNLLA 280 Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 + A GV + + N R G + + + +A + + P Q L Sbjct: 281 NLDAQTSKRAAGVGSGAKR---ISNRRGRPLPARTGSKASAARV-DLIATLRAAIPWQTL 336 Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD 267 + I + + + S ++ +D SGS + Sbjct: 337 RKNAEPARIGPIFRQSDLRHKCYQTL-------------SDRLLIFTVDASGSAAMARLA 383 Query: 268 MAKRFYILLY-LFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSS 319 AK LL +R + + I T++ + GGT +++ Sbjct: 384 EAKGAVELLLSEAYARRDHVALIAFRGTDADLILPPTRSLVQTKKRLAALPGGGGTPLAA 443 Query: 320 ALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE------ILAKKLLPVVRYY 372 + + +DG N A D + L K + R Sbjct: 444 G-LTAALALAQTTTQKGLTPTIVLLTDGRANVALDGTGNRKLAATDAQLIAKKINAARVE 502 Query: 373 SYIEITRRAHQTLWREYEHLQSTFD 397 + + T + +E Sbjct: 503 AIVIDTTMRPERALQELAQTMDATY 527 >UniRef50_C3Z863 Putative uncharacterized protein (Fragment) n=2 Tax=Branchiostoma floridae RepID=C3Z863_BRAFL Length = 286 Score = 78.0 bits (190), Expect = 7e-13, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 57/192 (29%), Gaps = 21/192 (10%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-EVDEHEFFY 308 + ++D +GS+ + K F L+ +L V V +++ QA+ E +++ Sbjct: 1 DIVFIVDGTGSVGLENFERMKTFIRQLFSYLDIGENAVRVSIVQYAAQARTEFFLDQYYD 60 Query: 309 SQE-----------TGGTIVSSALKLMDEVVKERYN--PAQWNIYAAQASDGDNWADDSP 355 QE G T+ A+ + + A A +DG ++ D + Sbjct: 61 LQEAQDAVDDIEYMGGYTLTGKAIDFATNLHFDLRKGARADVTKIAVVITDGRSYDDVNR 120 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + + + + ++ D+ Sbjct: 121 PAR----RMRQAGIVTIAVGVGN-NLDRDQLTAIA--GDPKTLLSLDGFDRLQDLTTSLP 173 Query: 416 ELFHKQNATAKG 427 + ++ G Sbjct: 174 TMLCDVVSSTVG 185 >UniRef50_C5FHM8 von Willebrand factor type A domain-containing protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FHM8_NANOT Length = 1002 Score = 78.0 bits (190), Expect = 7e-13, Method: Composition-based stats. Identities = 20/244 (8%), Positives = 54/244 (22%), Gaps = 31/244 (12%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + ++ + + ++ + +P + L+D Sbjct: 272 LDRDFVLRISTIASAKPRASLETHPSIEGHSAIMIEIPPDFMLESQEPVDDKEIIFLVDR 331 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET----- 312 SGSM L + + Q + Sbjct: 332 SGSMAG-KIHGLISSMQFYLRSLPMSTLFNICSF-GSSYQLLWEQSRAYSEITLNEALYY 389 Query: 313 --------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 GGT + AL+ + + + + +DG+ W + Sbjct: 390 VSSFSSNLGGTDLLPALEHVV-LQQNHSSKD-----IIVLTDGEVW---RLEETIRFVRL 440 Query: 365 ----LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 +R+++ +H+ L + + + + Sbjct: 441 THIVSKKAIRFFALGIGNAVSHE-LVEGIANSGGGYAEIIPATSSGNWE--DRLVAVLRA 497 Query: 421 QNAT 424 + Sbjct: 498 ALSG 501 >UniRef50_A1AQS2 Protoporphyrin IX magnesium-chelatase n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AQS2_PELPD Length = 617 Score = 78.0 bits (190), Expect = 7e-13, Method: Composition-based stats. Identities = 41/305 (13%), Positives = 84/305 (27%), Gaps = 25/305 (8%) Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 HR + + E+P + Q + + +DE + Sbjct: 267 HRRLQLREEEQNSTEPEQPHDQDQQQNNEQRDDQGEQP----PPPENGQDENDRHPSGEG 322 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 + R G P + + ++ R+ + R Sbjct: 323 EHESTVSMG------ESAQREEIMGVGAPFKLRRLSFRKDRRKRQANGRRTRTRIKGRGG 376 Query: 193 ENLAIISNSEPAQLLEEERLRKEIA-ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + + +S + + LR + + + I+ DLR++ E ++ Sbjct: 377 RYVKSLLSSTEHDIAIDATLRACAPFQKARNRQGMLKIEQDDLRFRQRE----RRMGHLV 432 Query: 252 FCLMDVSGSMDQ-STKDMAKRFYI-LLYLFLSRTYKNVEVVY-------IRHHTQAKEVD 302 ++D SGSM K LL + K +V+ + T + E+ Sbjct: 433 LFVVDGSGSMGARQRMMETKGAVQSLLLDCYQKRDKVAMIVFRKDRAELVLPPTASVELA 492 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDNWADDSPLCHEIL 361 G T ++S L +V+ +DG +P + Sbjct: 493 ARRLAELPVGGKTPLASGLLKTHRLVRRTSMHHPEQRILVVLITDGRGNQHLTPETRKEE 552 Query: 362 AKKLL 366 L Sbjct: 553 VSNLA 557 >UniRef50_A0YGD2 Von Willebrand factor type A domain protein n=4 Tax=Bacteria RepID=A0YGD2_9GAMM Length = 341 Score = 78.0 bits (190), Expect = 7e-13, Method: Composition-based stats. Identities = 19/189 (10%), Positives = 42/189 (22%), Gaps = 29/189 (15%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKD-----------MAKRFYILLYLFLSRTYKNVE 288 E S + +D+SGSM+ K+ + Sbjct: 95 EPIEINRSARDLMVAVDLSGSMEAQDFTTEQGEKIDRLTAVKQVLTEFSQR-RDGDRLGL 153 Query: 289 VVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +V+ E + T + A+ L + N Sbjct: 154 IVFGSAAYLQAPFTADKDTWLTLLQETEIAMAGASTSIGDAIGLSISTFEHS---DTDNR 210 Query: 340 YAAQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYI----EITRRAHQTLWREYEHLQS 394 +DG++ P+ +A + + ++ + Sbjct: 211 VLIVLTDGNDTGSRVPPVDAARVANARDVKIYTIAIGDPETIGEDAMDVDTLKQVSDITG 270 Query: 395 TFDNFAMQH 403 A+ Sbjct: 271 GAYFEALDR 279 >UniRef50_B6VQ59 Protein F11C1.5d, partially confirmed by transcript evidence n=6 Tax=Caenorhabditis RepID=B6VQ59_CAEEL Length = 1818 Score = 77.6 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 46/443 (10%), Positives = 112/443 (25%), Gaps = 74/443 (16%) Query: 8 RLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQG 67 R++ KS + +R + + + + + + EP Sbjct: 1422 RMSTLEKSFDDWKR--------MTGGADD----EKLRMEFDRKPDDVDFNKLDEPKL--- 1466 Query: 68 RGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDL 127 ++ P N ++ GG +G G + + Sbjct: 1467 -----GKIDPNNAPHHGGNQWMGGTGGYNTAGLGGIGGPFRLDAGHDVHQMPD------- 1514 Query: 128 LFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRE 187 F +P + R++ + + + N + + + + K Sbjct: 1515 -FAKQQVPKHILKKAREIAKVEYAKKLREINMSEYDAD----AYEKVWNKVQAPSRKLAS 1569 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 + E Q + K I + + K + + Sbjct: 1570 VIDQLEAKKKEREWTKHQTSGDLDDGKLIEGVTGEQNIYRRRID-----KVPDPGAPQTK 1624 Query: 248 QAVMFCLMDVSGSM---DQSTKDMAK--RFYILLYLFLSRTYKNVEVVYIRHHTQAKEV- 301 + +DVSGSM + + + K ++ L V+ I H + V Sbjct: 1625 PKRLKVCLDVSGSMYRFNGYDQRLVKSLEAALMTMTALDGKTDKVQYDIIGHSGDSPCVS 1684 Query: 302 -------DEHEFFYSQE------------TGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 ++ +G +L+ + + + + Sbjct: 1685 FVKTDHHPKNNKERLDTLKRMIAHTQYCVSGD-NTVESLQFAIKELAAKKDDFDET-VVI 1742 Query: 343 QASDG--DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 SD + + + +AK+ ++ A Q + L Sbjct: 1743 LVSDANLERYGIQPKELKDAMAKEPNINSFVIFIGSLSDEADQ--LQR--ELPVGKAFV- 1797 Query: 401 MQHIRDQDDIYPVFRELFHKQNA 423 ++D ++ + +F A Sbjct: 1798 ---LKDTSELPKIMETIFSSTIA 1817 >UniRef50_C5GPY5 von Willebrand domain-containing protein n=2 Tax=Ajellomyces RepID=C5GPY5_AJEDR Length = 1108 Score = 77.6 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 32/310 (10%), Positives = 82/310 (26%), Gaps = 34/310 (10%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANG--VPANISVVRSLQNSLARRTAMTAGKRRELHA 190 +L + + +T + G T G P + + + S+A + H Sbjct: 178 SLASFVKKGAINITVDVSVDRGSTIRGLHSPTHPVAITLGRTSVAAQDLFEPNLASAAHT 237 Query: 191 LEENLAIISNSEPA--QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 +++ A +++ + + + + T ++ P++ Sbjct: 238 MQQGNAFFDTDFVLIVNAKDQDVPSAFVEKHPTIPNQRAVMATLVPKFN------IPNNN 291 Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK--------- 299 + ++D SGSM + + L K + H+ Sbjct: 292 PEIVFIIDRSGSMTG-NIKTLQSALRVFLKSLPVGVKFNICSFGSRHSFMWNKSKTYDAS 350 Query: 300 --EVDEHEFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + GGT + +K + + + + G+ W Sbjct: 351 SLKAALQYVDSIAADFGGTEMLEPVKATVKNRLKDLDLDVLLLSD-----GEIW---DQK 402 Query: 357 CHEILAKK--LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + +R +S + + Q+L F F + + + + Sbjct: 403 TLFAYLNEVVSEQPIRLFSLGIGSGAS-QSLIEGIARAGDGFAQFVGDNEQLDKKVVRML 461 Query: 415 RELFHKQNAT 424 + Sbjct: 462 KGALTPHIKD 471 >UniRef50_B8KT14 Magnesium-chelatase 60 kDa subunit n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KT14_9GAMM Length = 610 Score = 77.6 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 48/317 (15%), Positives = 90/317 (28%), Gaps = 27/317 (8%) Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 E+ P + + L + +PA + L+N +RR ++ AGK Sbjct: 295 EQAQDEEAQSPETEPDDPVPLEALSEQLLEASTAVLPAEMLGQLLLRNQPSRRQSLHAGK 354 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY-------- 236 ++ I E+ A +R+ L Sbjct: 355 SGATQRSKQRGRPIGVKPGNPKSGEKLNLVATLRTAAPWQRLRQQQRDGLNTGTNIAIET 414 Query: 237 -KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY-LFLSRTYKNVEVVYIRH 294 R S + ++D SGS AK LL R + + + Sbjct: 415 SDFRVVRYRQRSASTTVFVVDASGSAALHRLAEAKGAVELLLAECYVRRDQVALIAFRGT 474 Query: 295 -------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 T++ + GGT ++SAL+L+D V ++ + Y +DG Sbjct: 475 EAELLLPPTRSLVRAKRSLAELPGGGGTPLASALRLLDTVTEQIASHGGTPHYV-LLTDG 533 Query: 348 DNW----ADDSPLCHEILAKKLLPVVR---YYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + A++ +R + T E +A Sbjct: 534 RANIGLSGEPGREQASQDAEQTASHLRKRQLRGIVIDTSSRPSYRAEELAGHLG--AIYA 591 Query: 401 MQHIRDQDDIYPVFREL 417 + D+ V + + Sbjct: 592 PLPQANASDLQRVIQAV 608 >UniRef50_C6JPK6 BatA protein n=2 Tax=Fusobacterium RepID=C6JPK6_FUSVA Length = 325 Score = 77.6 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 27/218 (12%), Positives = 54/218 (24%), Gaps = 52/218 (23%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRTYKNVEVVY 291 K ++ ++ L+D S SM + AKR L L + + + Sbjct: 71 KLLDEDTVEVKGLNIYALIDTSRSMMAEDVYPNRLEAAKRTLENLLQGLK-GDRIGFIPF 129 Query: 292 IRHHTQAKEVDEHE----------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + GGT + AL+L ++ KE N Sbjct: 130 SDSAYIQMPLTDDYSIGKNYINALDTNLISGGGTELYQALELAEKSFKE---INSDNKTI 186 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT----------------------- 378 SDG ++ + S + + +S T Sbjct: 187 IILSDGGDFDEKSLKFVKD------NKMNVFSIGIGTEEGTIIPEYVNGKKVGFIKDQNG 240 Query: 379 ----RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + + ++ D + + Sbjct: 241 SAVISKLNSDFLKKLSSESDGKYYEVNNLKDDSSNFFK 278 >UniRef50_B5W3I3 von Willebrand factor type A n=4 Tax=Cyanobacteria RepID=B5W3I3_SPIMA Length = 228 Score = 77.6 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 53/175 (30%), Gaps = 21/175 (12%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRH 294 E +P + L+D S SM D + L + K VE+ I Sbjct: 14 AVEFAENPEPRCPCVLLLDTSASMQGEPLDGLNAGLMTFRENLIKDELAKKRVEIAIITF 73 Query: 295 HTQAKEVDEHEFFY------SQETGGTIVSSALKLMDEVVKERYNPAQWN------IYAA 342 Q K + + G T + +A+ +++ R + N + Sbjct: 74 DNQVKIIQDFVTADRFEPPLLNAQGQTYMGTAIGEALDMIASRKAEYRNNGITYYRPWVF 133 Query: 343 QASDGDNWADDSPLCHEILAK----KLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 +DG+ + + + + + + V +++ A+ E Sbjct: 134 MITDGEPQGESDRITEQAIKRIRDEEANKQVAFFAVGVEG--ANMERLGEMAQRT 186 >UniRef50_A8LNP9 von Willebrand factor type A n=3 Tax=Rhodobacteraceae RepID=A8LNP9_DINSH Length = 320 Score = 77.6 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 27/216 (12%), Positives = 53/216 (24%), Gaps = 32/216 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM---------AKRFYILLYLFLSRTYKNVEVV 290 S + ++D+SGSM + ++ A + + + VV Sbjct: 82 PVSALKVSGRDLAIVLDLSGSMVRDDFNLDDRAVTRLEAVKAVGADFARRRAGDRLALVV 141 Query: 291 YIR---HHTQAKEVDEHEFFYSQE------TGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 + + E +E T +S L L + + + Sbjct: 142 FGSEAYFASPFTFDTESVARRIEEATIGISGRATSISDGLGLALKRLSTST---ATSRVV 198 Query: 342 AQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYI---------EITRRAHQTLWREYEH 391 SDG +N +P LA + V + R Sbjct: 199 ILLSDGINNAGATNPRGVAELAARYGVRVHTIALGPKDLTTAEVGERGVVDAATLRAISQ 258 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + +F ++ D + L Sbjct: 259 ISGGE-SFRVRTTEDLVAVTEALDRLEATDGDGLAA 293 >UniRef50_A1ZJV3 Von Willebrand factor, type A, putative n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZJV3_9SPHI Length = 354 Score = 77.6 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 24/224 (10%), Positives = 58/224 (25%), Gaps = 44/224 (19%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 K+ +F +D+S SM + + K + L + + V++ Sbjct: 97 KKSIAVVGKDIFIAVDLSLSMKATDIPPSRLEKIKYELSNIINTLK-SDRIGLVIFSSSA 155 Query: 296 TQAKEVDEHE----------FFYSQETG--GTIVSSALKLMDEVVKE-----RYNPAQWN 338 + + G GT + L+L+ + +E R ++ Sbjct: 156 FMHCPLTYDKGALNLFTQILNTNLMPIGNAGTDFYAPLELVLKKYQEANKSNRKQQNEYA 215 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR------------------ 380 SDG+ + D K+ V + Sbjct: 216 KVVVLFSDGEEFG-DRYTAIVDQYKQNNIRVFTVGVGSLQGGKIPTSLGFKKDKKGKVVL 274 Query: 381 --AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 T + + + + ++ +E+ ++ Sbjct: 275 SKLSTTSLQTIAEQTNGRFFEVSETKNEIPELINTIQEIKGQKL 318 >UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B6HQM8_PENCW Length = 1029 Score = 77.6 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 39/168 (23%), Gaps = 17/168 (10%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---------IRHHTQA 298 + ++ VS SM + + L L + V + + T++ Sbjct: 513 PLDLVVVIPVSSSMQGLKITLLRDALKFLVQNLGPRDRMGLVTFGSSGGGVPLVGMTTKS 572 Query: 299 KEVDEHEFFYSQETG----GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + G V + +++ +R + SD +S Sbjct: 573 WAGWSKILESIRPVGQKSLRADVVEGANVAMDLLMQRKFNNPVST-ILLISDSSISDPES 631 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 A+ + + E ++ Sbjct: 632 VDFVVSRAEAAKVTIHSFGLGL---THKPDTMIELSTRTKGSYSYVKD 676 >UniRef50_C0AYK3 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AYK3_9ENTR Length = 227 Score = 77.6 bits (189), Expect = 9e-13, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 52/181 (28%), Gaps = 19/181 (10%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS------RT 283 L + + + + ++D SGSM LL L + Sbjct: 1 MMEHLMIPDVALVDNSEQRTPLILVLDSSGSMYGQPIQQLNEGLKLLEQELKNDVIAAKR 60 Query: 284 YKNVEVVYIRHH-----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK---ERYNPA 335 + + + Y + K+ + + G T + A+ L E ++ +R+ A Sbjct: 61 VRILVIEYGGYDQCTIHGDWKDAMDFTAPVLEANGTTPMGQAITLALEEIEAEKQRFKQA 120 Query: 336 Q---WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 + SDG D L ++ + + + A + + Sbjct: 121 GVAYTRPWLFLMSDGVPT--DQWEQAAQLCRQAEESQKTAVFPIMVDGASAEVMGSFSRN 178 Query: 393 Q 393 Sbjct: 179 G 179 >UniRef50_A0LFJ6 Protoporphyrin IX magnesium-chelatase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LFJ6_SYNFM Length = 612 Score = 77.6 bits (189), Expect = 9e-13, Method: Composition-based stats. Identities = 40/311 (12%), Positives = 81/311 (26%), Gaps = 44/311 (14%) Query: 53 SIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQ 112 +P I + R + G G+ +A DG+G Sbjct: 262 VVPLALIHR---RRSAAPPREEEMREQQSQIPRAEEREQAGDSSRGGAEAREAPADGDGS 318 Query: 113 DEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN 172 S++ R G + +R ++ Sbjct: 319 QAHSSPKSRE--------------------------GHSREEVFPVGDSFKVKRMRFAKD 352 Query: 173 SLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA-ELRAKIERVPFIDT 231 + R + R + I ++ + + LR ++ + Sbjct: 353 RIERNASGRRTNTRFSGKAGRYVGSILRAKKLDVAVDATLRAAAPWQILRGRTGNLIVSR 412 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST--KDMAKRFYILLYLFLSRTYKNVEV 289 DLR+K R + ++ +D SGSM + +L + K + Sbjct: 413 EDLRFK----RREKKMGHLVVFAVDCSGSMGARRRMIETKGAVLSMLTDCYHKRDKVSLI 468 Query: 290 VY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA- 341 + + T + E+ G T + +AL + +++ Sbjct: 469 AFRKDGAEVVLPPTSSVELASRRLAEIPVGGKTPLPAALVAIYNLIRRVLIKEPALRIVA 528 Query: 342 AQASDGDNWAD 352 A SDG Sbjct: 529 AVVSDGRANQG 539 >UniRef50_A8IJ40 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IJ40_CHLRE Length = 434 Score = 77.6 bits (189), Expect = 9e-13, Method: Composition-based stats. Identities = 26/239 (10%), Positives = 51/239 (21%), Gaps = 42/239 (17%) Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 S + + L + +L + + K + + C++D Sbjct: 120 FSANTEQPVPPVSDLDGRLDKLLKQYGLGEEAVRAVVSLKAVADVKQR-AHVALTCVLDR 178 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR----------HHTQAKEVDEHEFF 307 SGSM + + L L+ V Y A+ + Sbjct: 179 SGSMSGERIALVRETCHFLIDQLTPDDYLGIVSYSGGVRADVPLLRMTPAARGLAHAMVD 238 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP 367 + G T + L E P +D Sbjct: 239 ALEADGSTALYDGLVAGVRQQMEAEAP----------TD--------------------Q 268 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 V +++ + L + QS + L + Sbjct: 269 HVTVHTFGFGAGHS-VELLQAVADAQSGVYYYISCVDDIPSGFGDALGGLLAVVAKDVR 326 >UniRef50_Q3APD9 von Willebrand factor, type A n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3APD9_CHLCH Length = 329 Score = 77.6 bits (189), Expect = 9e-13, Method: Composition-based stats. Identities = 31/258 (12%), Positives = 66/258 (25%), Gaps = 33/258 (12%) Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY----EKRP 243 + +A IS + + + + + R + L Sbjct: 32 KERWQRQVAAISFPDVQRFERAKLVAPRWMVRMPQWFRWAALAVGMLLLAEPHLTLRSTT 91 Query: 244 DPSSQAVMFCLMDVSGSMDQS------TKDMAKRFYILLYLFLSRTYKNVEVVYIRHH-- 295 + M +D+S SM QS ++A++ + + VV+ Sbjct: 92 AAARGIDMVLAIDISESMMQSQTDTQSRFEIARQAARNVVEQ-RSNDRIGLVVFRGEAYT 150 Query: 296 --------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 T + ++ + GT + SAL + ++ +DG Sbjct: 151 LSPLTRDHTVLSLLLDNLSSRIIQDDGTAIGSALLVALNRLQAS---ESELQMVILLTDG 207 Query: 348 DNW-ADDSPLCHEILAKKLLPVVRYYSYIE--------ITRRAHQTLWREYEHLQSTFDN 398 +N + SPL LA + + + +E Sbjct: 208 ENNAGEVSPLTAAALAARRGVRFYVLNVAFESVKDENAPRSALYAAELQEVARRTGGSYF 267 Query: 399 FAMQHIRDQDDIYPVFRE 416 + I + Sbjct: 268 TVNNKTELETTIASIAAR 285 >UniRef50_D2R6V4 von Willebrand factor type A n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R6V4_9PLAN Length = 395 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 11/102 (10%), Positives = 31/102 (30%), Gaps = 4/102 (3%) Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 GGT +S+ + ++ A +DG P+ A + Sbjct: 288 GGTNMSAGIDAGRTLLNGNTVRALAKKTMILMTDGQWNQGRDPIDAAEDAADEGIQIHTI 347 Query: 373 SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 +++ + Q R+ + + + + ++ + Sbjct: 348 TFLSGSA---QNTMRQVAEITGGKY-YVSSNQAELEEAFRDL 385 >UniRef50_UPI0000D9E789 PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4, partial n=7 Tax=Euteleostomi RepID=UPI0000D9E789 Length = 741 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 63/221 (28%), Gaps = 25/221 (11%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP--SSQAVMFCLMDVSGSMDQSTKDMAK 270 + + + + L ++ P ++++ + +D S SM+ T AK Sbjct: 386 DAYLPRMWVEKHPEKESEACMLVFQPDLDVHLPDLANESEVIICLDCSSSMEGVTFLQAK 445 Query: 271 RFYILLYLFLSRTYKNVEVVYIRHH---TQAKEVDEHEFFYSQ---------ETGGTIVS 318 + + + +K + + T + + G T Sbjct: 446 QIALHALSLVGEKHKVNIIQFGTDVSVMTANGDCIQQGEHNLSLHFLQSATPTMGNTDFW 505 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 L+ ++ PA+ + SDG L K+ P R ++ Sbjct: 506 KTLRY-LSLL----YPARGSRNILLVSDGH---LQDESLTLQLVKRSRPHTRLFACGIG- 556 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRD--QDDIYPVFREL 417 A++ + R + + + + I L Sbjct: 557 PTANRHILRILSQCGAGVFEYFNAKSKHSWRKQIEDQMTRL 597 >UniRef50_Q3M1S4 von Willebrand factor, type A n=8 Tax=Cyanobacteria RepID=Q3M1S4_ANAVT Length = 464 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 23/227 (10%), Positives = 53/227 (23%), Gaps = 44/227 (19%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM----------- 261 E L+ + ++ ++D SGSM Sbjct: 8 SITPHREFMPAETEGQKLFLMLKLRPTKEVAVSRPPTTFAFVIDTSGSMYEIVTGETTPT 67 Query: 262 ----------------DQSTKDMAKRFYILLYLF--LSRTYKNVEVVYIRHHTQAKEVD- 302 +S D+ + L L + + V + +Q ++ Sbjct: 68 GVTYTQDAKEYSQVTGGKSKIDIVIESLLALVRSGRLEASDRVAIVQFDDTASQIIDLTP 127 Query: 303 -------EHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 E+ + +GGT + L+ +++ +DG + +D Sbjct: 128 ATQVSQLENAIAQLRSFSGGTRMGLGLRRALDML---SGQDMAVRRTLLFTDGQTFDEDI 184 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 + E + L + + Sbjct: 185 CRALASDFATKNIPITALGVGE---DFKEDLLSHLSDSTGGTLFYVV 228 >UniRef50_UPI0000E47896 PREDICTED: similar to cache domain containing 1 n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47896 Length = 1395 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 29/242 (11%), Positives = 61/242 (25%), Gaps = 33/242 (13%) Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + + + L + D+R KN S + ++D Sbjct: 131 TLKWQYFSSEAGIHAIFPATQYLATHKYADQ--CSVDVRSKNLYASTVQPSPTNVVIIID 188 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV-----------------YIRHHTQAK 299 S+ + +A++ LSR + + + K Sbjct: 189 HGSSISPVSLVIAQKAAKTALGALSRKDRVGVLSMGSEVVTSQPGSCYDDMLAPASAEVK 248 Query: 300 EVDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNI-----YAAQASDGDNWADD 353 E + G + +SAL+ ++++ +P N S G D Sbjct: 249 EHLIKFINGIKAMDGPSNHTSALRTAFDLIQRTTSPMPLNQSKPDSVILYISTGHASNQD 308 Query: 354 SPLCHEILA----KKLLPVVRYYSYIEI----TRRAHQTLWREYEHLQSTFDNFAMQHIR 405 +A ++L V +Y + T R+ + Sbjct: 309 EVKAAINIAISENRRLNNRVAIMTYALVEEGRTGLEELAFLRDLAEQDNGTYRAVASDSD 368 Query: 406 DQ 407 Sbjct: 369 LP 370 >UniRef50_A9B2Y1 VWA containing CoxE family protein n=4 Tax=Bacteria RepID=A9B2Y1_HERA2 Length = 460 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 42/356 (11%), Positives = 100/356 (28%), Gaps = 43/356 (12%) Query: 85 NDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQ 144 + + G G G F +S++E ++ + L +K+ R+ Sbjct: 126 GFQPGALRQSQPGQGQASQPGGLQGGQGVGSGFNLSEEELRQVI-QGLEKDLIKRMALRE 184 Query: 145 LTEYKTHRAGYTANGV------PANISVVRSLQNSLARRTAMTAGKRRELHALE---ENL 195 + + A T + + + + R + ++ L+ Sbjct: 185 VLQDNRLAAQLTPSMAVVEQLLRDKSHLSGNALINAKRLIKQYVDELADVLRLQVMQAVS 244 Query: 196 AIISNSEPAQLL-EEERLRKEIAELRAKIE-RVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 A I S P + + L++ I + L Y+ + M Sbjct: 245 AKIDRSVPPKRVFRNLDLKRTIWRNLTNWNSNEGRLYVDRLYYRQTA---KKRTPMRMIV 301 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF------ 307 ++D SGSM + + + +V++ I T+ ++ Sbjct: 302 VVDQSGSMVDAMVQ------CTILASIFAGLPHVDMHLIAFDTRMLDLTPWVHDPFEVLL 355 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP 367 +Q GGT ++ AL E ++E A +D + + + + Sbjct: 356 RTQLGGGTSINEALLFASEKIQEPRKTA-----VVLITDFY-EGGSDQVLLDTIKAMIES 409 Query: 368 VVRYYSYIEITR----RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 V + +T + + + + + ++ Sbjct: 410 GVHFIPVGAVTSSGYFSVNDWFRTKLKEMGRPIFA------GSPRKLIEQIKQFIT 459 >UniRef50_UPI0001926619 PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1a, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926619 Length = 1260 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 27/240 (11%), Positives = 61/240 (25%), Gaps = 51/240 (21%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----- 291 + + +S + L+D SGSM + +AK + L V++ Sbjct: 293 RRLWYQQSAASPKDVVILIDNSGSMTGTNIVIAKIAAKSIIDTLDENDYFN-VIFAGEDP 351 Query: 292 --------------------------IRHHTQAKEV--------------DEHEFFYSQE 311 + +++ Sbjct: 352 SMCSVKKSSSFVLEAKEWTVEENKVQVSFDVINLYPSVPLDKVILDITYSLDNDKNNLNL 411 Query: 312 TGG---TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 T + ++L + R ++ N SDG + Sbjct: 412 RTKLSLTDIHKIIELCLRLEYNRTYSSRCNKIILFFSDGIEGDYSNTAKSFFDKWNNDKS 471 Query: 369 VRYYSYIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 VR ++Y+ + + +E + +Q + + D + E+ + KG Sbjct: 472 VRVFTYLVGRTKNPNDRVLKEMACNNRGHF-YQIQTLGNVWDTVIQYLEVLSRPIVQHKG 530 >UniRef50_P58335-2 Isoform 2 of Anthrax toxin receptor 2 n=3 Tax=Homininae RepID=P58335-2 Length = 386 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 47/164 (28%), Gaps = 14/164 (8%) Query: 239 YEKRPDPSSQA--VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 + PS + ++ ++D SGS+ + ++ L F+S + +V+ T Sbjct: 31 LRAQEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAERFVSPEMRLSFIVFSSQAT 90 Query: 297 QAKEVD---------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + G T + LKL +E +++ + + +DG Sbjct: 91 IILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA-GGLKTSSIIIALTDG 149 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 + Y + Q Sbjct: 150 KLDGLVPSYAEKEAKISRSLGASVYCVGVL--DFEQAQLERIAD 191 >UniRef50_O76836 Putative uncharacterized protein n=4 Tax=Caenorhabditis RepID=O76836_CAEEL Length = 1028 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 54/227 (23%), Gaps = 19/227 (8%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + A +E ++ S + +L + I LR + P Sbjct: 327 ADARAADEVISKESWT---ELANSDPFATLICSFFKLPTTQMSSQLEYLRTERDAVIDTP 383 Query: 246 SSQA-VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV--- 301 + M D S S+ F L N V I + EV Sbjct: 384 KCEVLDMIIAFDTSESLSSLIVPQYVDFAKKLVAQYKYGNDNTRVGIITFSSDVVEVRKL 443 Query: 302 --------DEHEFFYSQETGG-TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 TGG T V+ A + N + N +DG D Sbjct: 444 TDGNTLDAVNAAIDTVHYTGGLTNVTKAQLTAKNLFDTESNANR-NKVLFILTDGVPTVD 502 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 A L + S+ + E + F Sbjct: 503 TYTDEVA--AGDKLKSISVISFFVGYSSYSDEVKTELGKVSEPKYIF 547 >UniRef50_Q4SJ97 Chromosome 4 SCAF14575, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4SJ97_TETNG Length = 628 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 19/178 (10%), Positives = 46/178 (25%), Gaps = 14/178 (7%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS 309 ++ ++D SGS+ ++ + L + F+S + +V+ + + Sbjct: 45 DLYFVLDKSGSVQHYWNEIFYFVHHLAHKFISPQMRMSFIVFST-DGRTLMALTEDRDKI 103 Query: 310 QE----------TGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDGDNWADDSPLCH 358 + G T + L E + + +DG+ D Sbjct: 104 RAGLEELRMVQPGGDTYMDRGLHRASEQIYYAAGDGYRAASVIIALTDGELREDQFDTAQ 163 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + Y ++T + Q I + + Sbjct: 164 REAGRARQLGASVYCVGLK--DFNETQLSTIADSKDHVFPVHDGFEALQSVIDSILKR 219 >UniRef50_UPI0000584198 PREDICTED: similar to polydom protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000584198 Length = 1500 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 52/185 (28%), Gaps = 23/185 (12%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFY 308 + + ++D SGS+ QS ++ F +S + V I + + + + Sbjct: 42 SELVFILDSSGSVAQSDFIISVEFLKFASKIISVSASTTRVAVISYSSCNQIHIRVNYIS 101 Query: 309 SQET-----------------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 S E GGT + AL+ V P +DG + Sbjct: 102 SPENKNKCTFDNDLTSVNYHPGGTCTAGALEAAGRDVLSHGRPGA-QRVVMLLTDGASND 160 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 P + K + I + + + D ++ Sbjct: 161 GGPPHANAQKLKSEGVKIFTIGIGSI----KLSELNAIATSVD-EYVYILADFGDVRNLA 215 Query: 412 PVFRE 416 V ++ Sbjct: 216 TVVKD 220 >UniRef50_Q0I303 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=Q0I303_HAES1 Length = 343 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 43/162 (26%), Gaps = 17/162 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS------RTYKNVEVVYIRHHT---Q 297 + +F ++DVS SM + + L L + + + Sbjct: 2 RRLPIFLVVDVSESMAGDSHRQMQEAINRLVQRLRCDPYALESVYISVIAFAGAAGVIAP 61 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDGDNWA 351 E+ GT + +AL L + ++ + SDG Sbjct: 62 LTELMSFYAPRLPMGSGTSLGAALNLTMDEIQRNVVRSSGDQKGDFKPLVYILSDGVATD 121 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 D + ++ + + A + + L Sbjct: 122 -DPTSAIQRWQQEFKSRTKLIAVGLGN-FADLSALNQIAELT 161 >UniRef50_Q08X07 Von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q08X07_STIAU Length = 884 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 53/221 (23%), Gaps = 23/221 (10%) Query: 226 VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD------QSTKDMAKRFYILLYLF 279 + L + + + LMD S SM ++ ++A + Sbjct: 343 RRSVLEPVLPVSLELREEQRRTSVALSVLMDCSCSMGVTVPDGRTKMEVAAEGVVGALTL 402 Query: 280 LSRTYKNVEVVYIRHHTQAKEV--------DEHEFFYSQETGGTIVSSALKLMDEVVKER 331 L+ + + + GG V AL+ + Sbjct: 403 LNEKDDASVHMVDTEPHEIFSLSSVGEGLPLNKVARGFSGGGGIFVGEALREGKTQI--- 459 Query: 332 YNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + + SD D+ D ++ V + L RE Sbjct: 460 LRSDKATRHVLLFSDAADSEEPDDYRATLAALRRENVTVSVIGLGTPK-DSDADLLREVA 518 Query: 391 HLQSTFDNFAMQHIRDQ----DDIYPVFRELFHKQNATAKG 427 L FA + + V R F + + Sbjct: 519 QLGGGRIYFAEDALSLPRIFSQETITVARAAFVDVPTSLEA 559 >UniRef50_P06681 Complement C2a fragment n=39 Tax=Tetrapoda RepID=CO2_HUMAN Length = 752 Score = 77.2 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 22/208 (10%), Positives = 52/208 (25%), Gaps = 34/208 (16%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHTQAKE 300 S ++ L+D S S+ ++ + K L+ + + + Sbjct: 248 QRSGHLNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMS 307 Query: 301 VDEHEFFYSQE---------------TGGTIVSSALKLMDEVVKERYN--------PAQW 337 V E GT +AL + ++ + + Sbjct: 308 VLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEI 367 Query: 338 NIYAAQASDGDNWADDSP-------LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 +DG + SP + +K + Y+ E Sbjct: 368 RHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELG 427 Query: 391 HLQST-FDNFAMQHIRDQDDIYPVFREL 417 + F +Q + ++ ++ Sbjct: 428 SKKDGERHAFILQDTKALHQVFEHMLDV 455 >UniRef50_UPI000180C2AF PREDICTED: similar to FiBrilliN homolog family member (fbn-1) n=1 Tax=Ciona intestinalis RepID=UPI000180C2AF Length = 990 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 59/184 (32%), Gaps = 20/184 (10%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-------- 296 P ++ + +MD SGS+ + K+F +Y + + + + + H+ Sbjct: 808 PKARMDLIFIMDSSGSIGEENFKTMKQFVKNVYERFTLSDEFTRIAVVTFHSVVQLANDT 867 Query: 297 ---QAKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 +K ++ Q G GT+ AL E + + +DG + Sbjct: 868 EWFYSKTELDNAIDSLQFAGKGTLTGQALTFTREHLIGKR--EGSTNVVIAVTDG--NSK 923 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 D+ + + V + ++ + ++ + D +D+ Sbjct: 924 DNSKAAAAELRNM--NVHVMAVGITGSHLRD--LSMIASKPASENVLSLSQVEDINDVID 979 Query: 413 VFRE 416 V Sbjct: 980 VVGR 983 >UniRef50_UPI00017B4DF5 UPI00017B4DF5 related cluster n=3 Tax=Tetraodontidae RepID=UPI00017B4DF5 Length = 2436 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE---- 300 +A + L+D SGS+ K+F I L + V V + + K+ Sbjct: 984 EKQKADLVFLLDQSGSIQSDDYTTMKKFTIDLINKFQISRDLVHVGLAQFSSTFKDEFYL 1043 Query: 301 --------VDEHEFFYSQETGGTIVSSALKLMDEVVKE---RYNPAQWNIYAAQASDGDN 349 + H QE GGT++ AL + + + + +DGD Sbjct: 1044 NKFFDEQAISAHIKDMQQEEGGTLIGLALNSIRKYFEASHGSRKAEGISQNLVLITDGD- 1102 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + D L + L V ++ H + + + F +++ + Sbjct: 1103 -SQDDVEEAARLLRGL--GVEVFAIGIG--NVHDLELLQIA--GTPENVFTVKNFDKLEG 1155 Query: 410 IYPVFRELFHK 420 I+ + + Sbjct: 1156 IHQKVVDTICQ 1166 Score = 73.0 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 50/181 (27%), Gaps = 21/181 (11%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF-- 306 A + L+D S S+ S + F L L+ + N+ + + +E Sbjct: 3 ADIVFLVDGSSSIGPSNFQEVRLFLRSLASGLNVSPDNIRIGLAQFDEPHQEFLLKYHIE 62 Query: 307 --FYSQE-------TGGTIVSSALKLMDEVV----KERYNPAQWNIYAAQASDGDNWADD 353 GGT A+ + + + A +DGD+ D Sbjct: 63 KMNLLAAFESFPYRNGGTETGKAINFLRKQYFTKKAGSRADQRVPQIAVVITDGDST--D 120 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + +K + A+Q + + S F + + + Sbjct: 121 DVVVPARELRKHG----VIVFAIGVGNANQGELKSIANRPSERFKFTIDSFQALKRLTER 176 Query: 414 F 414 Sbjct: 177 L 177 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 50/183 (27%), Gaps = 21/183 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-------- 296 + L+D SGS+ K F L + V V +++ T Sbjct: 596 KDVPGDLIFLIDSSGSIYPEDYQKMKDFMKSLVQKSNIGKDQVHVGVLQYSTEQKLVFPL 655 Query: 297 ---QAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVK-ERYNPAQWNIYAAQASDGDNWA 351 K+ Q+ GGT A+ ++ + + +DG+ + Sbjct: 656 IQYYTKDQLSKAIDDMQQIGGGTHTGEAIAVVSKYFDAQNGGRPDLKQRLVVVTDGE--S 713 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 D + +V + A+ + E +A + D+ Sbjct: 714 QDDVKLPAEALRAKGVIVYSIGVV----AANTSQLLEIS--GDADRMYAERDFDALKDLE 767 Query: 412 PVF 414 Sbjct: 768 KQM 770 Score = 66.8 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 53/202 (26%), Gaps = 26/202 (12%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-------------RTY 284 A +F L+D G + Q+ + F L L+ Sbjct: 190 QVFPSALKEKFADVFILLD--GGITQAEFRQIRTFLGSLVNQLNFSPSTYRLGLAQYGQD 247 Query: 285 KNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV----KERYNPAQWNIY 340 V+ ++ H T + + GGT +AL V K Sbjct: 248 IKVDFLFKDHQTNKDLLTAVKNAQQHHGGGTNTGAALNFTQHQVFVREKGSRIELGVQQV 307 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 A +DG + D S A L V Y+ A + + + F Sbjct: 308 AVVITDGRSQDDVSTP-----AANLRAGVTVYAVGVK--DADEAQLHQIASYPTKEHTFT 360 Query: 401 MQHIRDQDDIYPVFRELFHKQN 422 + + + + + Sbjct: 361 VDSFSKLKTLETSLQRVMCQNV 382 Score = 63.8 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 40/153 (26%), Gaps = 19/153 (12%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEV---- 301 A +F L+D SGS+ K+F + + V Y T ++ Sbjct: 393 ADIFFLIDQSGSIHPPDFYDMKKFILEFLQTFRVGPNHVRIGVVKYADSPTLEFDLHTYT 452 Query: 302 ----DEHEFFYS-QETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDS 354 E Q GGT AL M + Y +DG + D Sbjct: 453 DVKSLEKAITNIHQVGGGTETGKALDFMRPQFDRAVTTRGHKVKEYLVVITDG--NSTDK 510 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 + VV + L Sbjct: 511 VKDPADKLRAQGVVVYAIGV---KDAVEKELLE 540 Score = 63.0 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 25/216 (11%), Positives = 51/216 (23%), Gaps = 27/216 (12%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNV 287 D + + ++QA + L+DVS S+ + F + S + Sbjct: 775 CDPDRVEVRPVISDCKKTAQADIIFLVDVSTSILKEKAFPSVTVFMESVVNQSSVGPELT 834 Query: 288 EVVYIRHHTQAKEVD-----EHEFFYSQE-------TGGTIVSSALKLMDEVVKERYNPA 335 I T + + + Q G T AL + + + Sbjct: 835 RFGVITFSTGVQSIFTLKQYSSKRDVLQAVGAVTAPGGNTNTGDALDYSLQYFGKEHGGR 894 Query: 336 ---QWNIYAAQASDG---DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 + +DG + P V +S A + Sbjct: 895 AALKVPQILMVITDGAAQEPSKLPGPSEALR-----KQGVSVFSIGVK--NASREQLDIM 947 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 F + + +Y ++ Sbjct: 948 AG-NDPSRVFFVDTFDALETLYKNISKVLCNHTKPV 982 >UniRef50_C1F3L6 von Willebrand factor type A domain protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F3L6_ACIC5 Length = 410 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 31/291 (10%), Positives = 78/291 (26%), Gaps = 17/291 (5%) Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL--AII 198 Q + + + V+ +A ++ ++ + Sbjct: 73 APAQTVLANPKQPETQPSLLVDRDPVLSPDYEDDQPVSASRPLSQQRAREIQHLRGNEYM 132 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 +++ + + L + R F D + + M L+D S Sbjct: 133 IRRNVDEVVLNCTVVDKKGRLVTDLTRQDFHVWEDNKPQVIASFSHEDLPVSMGILVDNS 192 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKEVDEHEFFYSQET 312 GSM + + + L + + V + T + E +++ Sbjct: 193 GSMQN-KLNAVDKAALDLVRASNPDDEAFIVNFSDQAYLDQGFTSSIAKLEQGLAHTEAR 251 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 GGT + A+ + + + + +DG++ A L I + L Y Sbjct: 252 GGTALYDAIVASADELSKDARHPK--QVLLVVTDGEDDASTMNLQQAIQRVQALHGPEIY 309 Query: 373 SYIEITRRAHQT------LWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + + + F + V +++ Sbjct: 310 AIGLLYDDSGDEAHRARKALEQLTEQTGGLAYFPRSLENVDEVAAEVAKDI 360 >UniRef50_UPI0001788256 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788256 Length = 595 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 49/198 (24%), Gaps = 25/198 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFL-SRTYKNVEVVYIR---- 293 S +MD S SM S + + L + K V Y Sbjct: 29 AAAASQGSNIDAVLVMDASNSMKNSDPERISGEAMKMFIDMLATTGDKVGIVSYTDRIQR 88 Query: 294 -------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 K + T +S L +V+K+ +PA +D Sbjct: 89 EKALLEIQSEADKTALKEFIDQLDRGPYTDMSVGLDEAVKVLKQGMDPA-HAPMIVVLAD 147 Query: 347 GDN-----WADDSPLCHEIL----AKKLLPVVRYYSYIE-ITRRAHQTLWREYEHLQSTF 396 G+N S E L + + Y+ + ++ E Sbjct: 148 GNNDLDPNTGRTSKEASEQLNQAVKEAKGSGIPIYTIGLNADGKLNKETLAELAKQTGGK 207 Query: 397 DNFAMQHIRDQDDIYPVF 414 +F D I Sbjct: 208 -SFTTSSADDLPQILSEI 224 >UniRef50_Q19NM5 TerY1 n=54 Tax=root RepID=Q19NM5_ECOK1 Length = 239 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 47/167 (28%), Gaps = 19/167 (11%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT------YKNVEVVYIRH 294 K+ + ++ L+D SGSM + K L L + + + Sbjct: 23 KKELHLRRLPVYLLLDTSGSMHGEPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSS 82 Query: 295 HTQAKEVDE---HEFFYSQETGGTIVSSALKL-MDEVVKE-----RYNPAQWNIYAAQAS 345 QA + + + +G T + AL L + KE W + Sbjct: 83 ARQAVPLTDLLSFQMPALTASGTTSLGEALSLTASSIAKEVQKTTADTKGDWRPLVFLMT 142 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 DG D K V A ++ +E + Sbjct: 143 DGSPN--DDWRKGLNDFKAARTGVVVAC--AAGHDADTSVLKEITEI 185 >UniRef50_A9GY82 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GY82_SORC5 Length = 1457 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 46/429 (10%), Positives = 100/429 (23%), Gaps = 33/429 (7%) Query: 19 RQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGL--RHRVH 76 + R L + ++ + + I + V S + + Sbjct: 274 QARTLEISRQAVRAVLRDGIAETEVDQTFSN----PGGRQVEGWYWFTVPERAIVSSFAV 329 Query: 77 PGNDHFVQNDRIERPQGGGGGSGS-GQGQASQDGEGQDEFVFQISKDEYLDLLFED--LA 133 N V+ + IER + + G E D ++ L Sbjct: 330 ETNGVLVEGEVIERKEAAQRYQTAVQTGHEPALLEWVDGHSYRARIFPVPASGSRRVVLR 389 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV-RSLQNSLARRTAMTAGKRRELHALE 192 + +L+ RA ++SV L T A + Sbjct: 390 YVEMLSAPSGKLSYVYPMRASDPVQIGEFSLSVDLGQAGQGLQIATLADAVVEDGGRRVS 449 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY-KNYEKRPDPSSQAVM 251 + QL +++ + D RY + + A + Sbjct: 450 MRRSGYEPRADFQLEASVKVKPSPLRVSRFAAGEDRADYVMARYVPDVDWAELKELPADL 509 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE---VDEHEFFY 308 ++D S S D+S + + +S + + + E Sbjct: 510 VVVVDTSASADESARQQKAAAAEAILRAMSPSDHFALIALDSAPAVLHPKDGLAEASDKE 569 Query: 309 SQE----------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD--DSPL 356 G T + + + + DG + Sbjct: 570 ISAALTRLAEHATGGATDLGALFESAL-----GRVHGKEQPAVIYIGDGLATSGEVTGAR 624 Query: 357 CHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 E L + L R+++ ++ L RE + ++ + Sbjct: 625 LAERLRRSLTGSRARFFTVGVG-ENSNHALLRELARAGGGQAFRIDEAEGSTSEVLRLAG 683 Query: 416 ELFHKQNAT 424 + Sbjct: 684 AIKTPTITD 692 >UniRef50_C0CX78 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CX78_9CLOT Length = 547 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 67/221 (30%), Gaps = 13/221 (5%) Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 E N A + + AQ + + + + + L + Sbjct: 22 AFAETNQAFVERAYQAQEGQMDVICAIPGQEGNGENFQAMLGEQSLPVLSVSTAEQSGLP 81 Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE------VVYIRHHTQAKEVD 302 ++CL+DVSGSM + K + L+ V + + +E+ Sbjct: 82 KTIYCLVDVSGSMKG-RMEQVKETLTAISGGLNENDNLVIGKMGNQITDSAFLSGQEEI- 139 Query: 303 EHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG--DNWADDSPLCHE 359 + + Q TG T + S L + +++ + SDG D A + Sbjct: 140 KAQIDSLQYTGEDTDLYSGLIHGLKFLQQE-PEVKTLRALVVLSDGCDDQGAGSTWKEAY 198 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQT-LWREYEHLQSTFDNF 399 +K V + I + Q + + +F Sbjct: 199 DAVEKADIPVYTVAVILSEKDYEQAKELGSFARNSAGGLHF 239 >UniRef50_P20702 Integrin alpha-X n=58 Tax=Theria RepID=ITAX_HUMAN Length = 1163 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 22/204 (10%), Positives = 53/204 (25%), Gaps = 15/204 (7%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL-SRTYKNVEVVYIR--HH 295 ++ P + + L+D SGS+ F + + + + + Sbjct: 140 VSRQECPRQEQDIVFLIDGSGSISSRNFATMMNFVRAVISQFQRPSTQFSLMQFSNKFQT 199 Query: 296 TQAKEVDEHEFFYS-------QETGGTIVSSALKLMDEVVKERYNPAQWN--IYAAQASD 346 E Q G T ++A++ + + A+ + +D Sbjct: 200 HFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHASYGARRDAAKILIVITD 259 Query: 347 GDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRAHQ-TLWREYEHLQSTFDNFAMQHI 404 G D +A + + + + S F ++ Sbjct: 260 GKKEGDSLDYKDVIPMADAAGIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDF 319 Query: 405 RDQDDIYPVFR-ELFHKQNATAKG 427 DI + ++F + Sbjct: 320 DALKDIQNQLKEKIFAIEGTETTS 343 >UniRef50_UPI0001BC5690 magnesium chelatase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5690 Length = 605 Score = 76.8 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 41/323 (12%), Positives = 90/323 (27%), Gaps = 46/323 (14%) Query: 109 GEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 E + EFV K+E + + A+ L + L G + Sbjct: 312 MEREQEFVPPKEKEESTFSIGDTFAVKELVHKKTLHLK---------KRRGSGKRLKTTT 362 Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 SL+ R KR+ + K F Sbjct: 363 SLKQ--GRDIKSGFPKRKMEDFAFAATIRAAAPHQ------------------KRREKKF 402 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFY--ILLYLFLSRTYKN 286 + + K + + ++D SGSM + A + LL + K Sbjct: 403 VKISIQKEDIRIKIREKRIGTHILFVVDSSGSMGAKKRMRAVKGAIFSLLQDAYEKRDKV 462 Query: 287 VEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ-WN 338 V + + T++ E+ + + G T ++ L +++++ Sbjct: 463 ALVAFRKKSAEELLSMTRSIELAKKQLQNLATGGKTPLAEGLFKAYQLIRQLKKKDGEIY 522 Query: 339 IYAAQASDGDNW----ADDSPLCHEILAKKLLP-VVRYYSYIEITRRAHQTLWREYEHLQ 393 SDG D +A+K+ + + + Sbjct: 523 PLLVLISDGRANISLHGRDPIEESLEMARKIKKEGISSVVIDTEEGFTLLEMAKNISEA- 581 Query: 394 STFDNFAMQHIRDQDDIYPVFRE 416 + + +++I +D+ + ++ Sbjct: 582 MGAEYYRLENI-QAEDMLKLLKK 603 >UniRef50_A3ZZR7 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZR7_9PLAN Length = 373 Score = 76.8 bits (187), Expect = 2e-12, Method: Composition-based stats. Identities = 29/214 (13%), Positives = 62/214 (28%), Gaps = 37/214 (17%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSM--------------------DQSTKDMAKRFYIL 275 ++ + ++D SGSM ++ Sbjct: 156 FQPVADATASQIDRDIALVVDRSGSMTFRINRNSYESGWRNNDPVPSRARWWALVDSVDG 215 Query: 276 LYLFLSRTYKNVEVVYIRHHTQAK------------EVDEHEFFYSQETGGTIVSSALKL 323 L T + V +++ AK E ++ G T +++ + Sbjct: 216 FLTELGSTPQLELVSLSTYNSSAKIDEQLTDKYSRIEDALDDYSRRYPDGSTNITAGMDR 275 Query: 324 MDEVVKER-YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH 382 ++ + Y + +DG++ SP A VV +Y + A+ Sbjct: 276 GISTLQNKKYARPYASKTMVVMTDGNHNYGSSPTNAAYDAASDDIVVHTITY---SDGAN 332 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 Q+L RE + A + ++I+ Sbjct: 333 QSLMREVARIGGGQHWHAPDG-DELEEIFREIAR 365 >UniRef50_A3JLW1 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JLW1_9RHOB Length = 354 Score = 76.8 bits (187), Expect = 2e-12, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 39/180 (21%), Gaps = 22/180 (12%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE---- 303 L+D SGSM + L T + + + E Sbjct: 165 PMSFVLLVDRSGSMA-EIMPEVREAAKEFVAALPDTAECSVSSFAGDWDFSHRGPEGALT 223 Query: 304 -----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLC 357 F Q G T + L+ + E +DG N S Sbjct: 224 CKPENFAFDNIQPGGTTNIYGPLREAYGWLSESERTD-HQKAVILLTDGRANDDAASESQ 282 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL----QSTFDNFAMQHIRDQDDIYPV 413 + Y+++ + R ++ D Y Sbjct: 283 TLAMKDDA------YTFVYYMGDSDDRWLRSLADNYFSGGGHVSAQLERYFNVVSDAYSA 336 >UniRef50_UPI0000F2E695 PREDICTED: hypothetical protein n=1 Tax=Monodelphis domestica RepID=UPI0000F2E695 Length = 2439 Score = 76.8 bits (187), Expect = 2e-12, Method: Composition-based stats. Identities = 47/307 (15%), Positives = 83/307 (27%), Gaps = 26/307 (8%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 +Q R + T + SL + +A Sbjct: 508 QQKGSRYRQGVQQLAVVITEGYSQDEVDRPASLLRRAGVTVFAVGTLKASGSRDLNKIAS 567 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKI---ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 + A LE I E K E V + + + ++ + +A + L Sbjct: 568 HPPRKHAIYLESFLQLSVITEKIKKRVCTEIVQKTFSVPVMTRTLQEGCVSTEEADFYFL 627 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEV--------DE 303 +D SGS++ K F I L + V Y T ++ + Sbjct: 628 IDGSGSINHDDFAEMKTFMIELISTFRVGADHVRFGVVQYSDSPTVEFDIRQHSSVAQLK 687 Query: 304 HEFFYS-QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 Q GGT AL M + E + + +DG S A Sbjct: 688 SAITKIWQTGGGTRTGEALTFMKRLFSEVAR-DKVLRFLIVITDGQ-----SQDQVAQAA 741 Query: 363 KKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL-FHK 420 ++L + Y+ + L Q+ F + V +++ F + Sbjct: 742 EELRQENITIYAIGV-KSAVTKELLE-ISGSQN-RMFFVNDFDSLKPIQQEVIQDICFLE 798 Query: 421 QNATAKG 427 K Sbjct: 799 VCKGMKA 805 Score = 74.9 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 46/186 (24%), Gaps = 22/186 (11%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 ++ A + L+D S S+ K F L L L V V + + Sbjct: 6 LTTACRKAAVADLVFLVDSSTSIGPENFQKVKSFLYSLVLGLEIGRDQVRVGLAQFNDNI 65 Query: 299 KEVDEHEFFYSQET------------GGTIVSSALK----LMDEVVKERYNPAQWNIYAA 342 + F + GGT SAL Sbjct: 66 YKAFLLNQFPRKSDVLEQILSLPYRTGGTRTGSALNFLRTEFFTESAGSRAKDNVPQIVI 125 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG+ + E +K V Y + + + F+++ Sbjct: 126 LVTDGE----SNDEVAEAASKLKGQGVSIYVVGINVQDVQE--LKTIASKPLEKFLFSIE 179 Query: 403 HIRDQD 408 + Sbjct: 180 DFNILE 185 Score = 64.5 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 17/183 (9%), Positives = 40/183 (21%), Gaps = 23/183 (12%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE-------- 300 + L+D S S+ K F + + V + ++ T ++ Sbjct: 1185 IDLVFLIDGSSSIHPRNFTAMKTFMKQIVNSFTIGKDRVRIGVAQYSTNPQKEFYLNTFY 1244 Query: 301 ---VDEHEFFYS-QETGGTIVSSALKLMDEVVKERYNPAQWNIY---AAQASDGDNWADD 353 Q T L+ + + + +DG Sbjct: 1245 SGAEINQHIDKITQLRTQTYTGKGLRFVKSFFEPANGSRKNLHVLQSLVVITDGM----S 1300 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + E ++ +S + + + F + I Sbjct: 1301 NDSVVEAANDLRNEKIQIFSIGIGV--INLFELQLIA--GNVKRVFVVGDFGQLGSIERK 1356 Query: 414 FRE 416 Sbjct: 1357 VVR 1359 Score = 47.2 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 39/159 (24%), Gaps = 23/159 (14%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK--------- 299 A + L+D S + + K F + L ++ ++ Q Sbjct: 211 ADVVFLVDTSEGTSSVSFNQMKDFICRVIDTLEVGRDKDQIGLAQYGNQGHVEFLLNAYQ 270 Query: 300 ---EVDEHEFFYSQETGGTI-VSSALKLMDEVVKERYNPA----QWNIYAAQASDGDNWA 351 E+ H GG + L+ + E + + YA + Sbjct: 271 NPVEMISHIQQNFLPRGGARKTGNGLQYIQETFFQEEAGSRFLQGIPQYAVVIT----SG 326 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 L E K V+ + + Sbjct: 327 QSEDLVLEKAQKLKERGVKIMVVGI--QDFDSRELKAMA 363 >UniRef50_UPI00016E6A6D UPI00016E6A6D related cluster n=2 Tax=Takifugu rubripes RepID=UPI00016E6A6D Length = 832 Score = 76.8 bits (187), Expect = 2e-12, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 61/189 (32%), Gaps = 20/189 (10%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK--------- 299 + ++D S SM + ++ K F I + L+ + V +++ T+ + Sbjct: 562 MDLVFVIDGSKSMGPANFELVKHFVISIVESLNVSQMGSHVGLLQYSTKVRTEFTLRQHT 621 Query: 300 --EVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDS 354 + Q G G++ SAL+ M + A+ N+ +DG + D S Sbjct: 622 SAQSIRQAVSRMQYMGRGSMTGSALRHMFQFSFSAKEGARPNVPHVGIVFTDGRSQDDVS 681 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 K V Y+ A + RE + + +D +I Sbjct: 682 EWA----NKAKKSGVTIYAVGVGK--AIEQELREIASEPDEKHLYYAEDFQDMGEITKKL 735 Query: 415 RELFHKQNA 423 + Sbjct: 736 KSRMCTALK 744 Score = 68.4 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 65/216 (30%), Gaps = 24/216 (11%) Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 + + R + + ++ K E P + ++D S S+ + + K F + Sbjct: 17 VNKHRPRSAAARGQNETTVQNKAVE-NPCKAVPLDFVFVIDSSRSIRPNDYEKVKTFIVN 75 Query: 276 LYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET------------GGTIVSSALKL 323 L FL V +++ + + F S+ GT+ A+K Sbjct: 76 LIQFLEIGPDATRVGLLQYGSVVQPEFSLNTFTSKAEVEQAVRNMRHLATGTMTGLAIKY 135 Query: 324 MDEVVKERYNPAQ-----WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 E + A+ A +DG D+ A++ ++ ++ Sbjct: 136 AMETSFTEEDGARAAHLHIPRIAVIVTDGRP--QDTVEQVAAQARQA--GIQIFAIGVGR 191 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + + + + + VF Sbjct: 192 --VDMNTLKTIGSEPHSEHVHLVANFSQIETLISVF 225 >UniRef50_A9UVU8 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UVU8_MONBE Length = 785 Score = 76.8 bits (187), Expect = 2e-12, Method: Composition-based stats. Identities = 37/313 (11%), Positives = 70/313 (22%), Gaps = 50/313 (15%) Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRS---LQNSLA-----RR 177 + E + P + G +G N RS L R Sbjct: 11 QISHELMRDPVAAPDGYVYDRTNILQWIGQGEDGQRNNSPFDRSITISAADLRPAMTIRS 70 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 + L L + L E+A E I + Sbjct: 71 ALEEYIAQHHLEFEVAPLVTGRPALKLPARLASELELEVALHPVPGETRKAILELIPQ-- 128 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQST----------------KDMAKRFYILLYLFLS 281 R ++ + ++DVSGSM + D+ K + L+ Sbjct: 129 ----RQTATTNIHLNLVLDVSGSMGAAVTARDESNTLIEYNLCVMDLVKFASQVAVKCLA 184 Query: 282 RTYKNVEVVYIRHHTQAKEVDEHEFFYSQE----------------TGGTIVSSALKLMD 325 V + E G T + + ++ Sbjct: 185 PGDVISIVTFSDAAKIIVEPISVPDPKMGADTTVADVLGKIDAIYHGGSTNLWAGIETGL 244 Query: 326 EVVKERYNPAQWNIYAAQASDGDNW--ADDSPLCHEILAKKLLPVVRY-YSYIEITRRAH 382 +++ P N +DG+ + K++ ++ R Sbjct: 245 QLLASCAQPHLHN-VCVALTDGEPNRHPEQGYETAHRRFKQMPNFSYVLHTLPFGFGRID 303 Query: 383 QTLWREYEHLQST 395 L + Sbjct: 304 SALLQSLARTGEG 316 >UniRef50_P21941 Cartilage matrix protein n=35 Tax=Euteleostomi RepID=MATN1_HUMAN Length = 496 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 48/191 (25%), Gaps = 23/191 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRH------- 294 + + ++D S S+ + K F + L + V Y Sbjct: 36 RTRPTDLVFVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSL 95 Query: 295 -HTQAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVV-----KERYNPAQWNIYAAQASDG 347 +K Q + GT+ A++ R + +DG Sbjct: 96 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDG 155 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + A+ V ++ + R+ ++ Sbjct: 156 RPQDSVQDVS----ARARASGVELFAIGVG--SVDKATLRQIASEPQDEHVDYVESYSVI 209 Query: 408 DDIYPVFRELF 418 + + F+E F Sbjct: 210 EKLSRKFQEAF 220 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 55/185 (29%), Gaps = 20/185 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE-- 303 SS + L+D S S+ ++ K+F + L + K +V +++ + ++ Sbjct: 271 SSATDLVFLIDGSKSVRPENFELVKKFISQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLG 330 Query: 304 --HEFFYSQE--------TGGTIVSSALKLMDE--VVKERYNPAQWNIYAAQASDGDNWA 351 H + GT+ +ALK + + +DG + Sbjct: 331 RFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGAQKVGIVFTDGRSQD 390 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + K + ++ + RE F + + I Sbjct: 391 YIND----AAKKAKDLGFKMFAVGVGNAVEDE--LREIASEPVAEHYFYTADFKTINQIG 444 Query: 412 PVFRE 416 ++ Sbjct: 445 KKLQK 449 >UniRef50_B3DM79 LOC100170623 protein n=1 Tax=Xenopus (Silurana) tropicalis RepID=B3DM79_XENTR Length = 1380 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 56/192 (29%), Gaps = 23/192 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 + L+D S SM+ + AKR +L L+ + + + H + + Sbjct: 871 QSWDLTILLDCSNSMEST-FQSAKRIALLAASSLNPWHNINIISFGTGHKEFSIRPKESQ 929 Query: 307 FYSQE-----------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 E G T + L+ + + N SDG + Sbjct: 930 NLIPELEQFIKMAKPNMGNTELWKPLQSLCLLA---PPSDMHN--VLLISDGH---IQNE 981 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD--QDDIYPV 413 + KK VR ++ A++ + R + F F + + + Sbjct: 982 SLVFQILKKNAGKVRLFTCGVGA-TANRHMLRCLAQYGAGFFEFFEDKSKTSWKKKMEAQ 1040 Query: 414 FRELFHKQNATA 425 + +A Sbjct: 1041 LERMDSPACTSA 1052 >UniRef50_UPI000180B9AB PREDICTED: similar to CLCA family member 1, chloride channel regulator n=1 Tax=Ciona intestinalis RepID=UPI000180B9AB Length = 1001 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 32/255 (12%), Positives = 76/255 (29%), Gaps = 31/255 (12%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK---RPDPSSQA 249 E + ++ + + L+ + A + + + +N P Sbjct: 175 EPINFHNSEADNEQNAKCNLKSLWEVIGASPDFREGANPPNPNLRNLTPTFRVVKPPQNR 234 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLF-LSRTYKNVEVVYIRHHTQAKEV--DEHEF 306 ++DVSGSM M ++ LS K V + + + Sbjct: 235 RFVVVLDVSGSMRGKRLLMMRQSTSEFISSLLSDGDKIGIVQFHSFAQTLLPIRHVNSQT 294 Query: 307 FYS--------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 + G T + ++ + ++ R +P + + +DG SP Sbjct: 295 DRFDICSRFPNRTGGSTCIGCGIQAAMQEME-RDDPTEPCGHIVVLTDGMEN--RSPYTV 351 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH------------IRD 406 ++ ++ + + +T + L + + S FA + Sbjct: 352 DVSSRAVNRGCTIDAI-FLTTTQNTALVQ-LVNRTSGRWFFAQDRDLRRLTGAFAVIADE 409 Query: 407 QDDIYPVFRELFHKQ 421 DI + + ++Q Sbjct: 410 DGDIRNLVSTILYRQ 424 >UniRef50_A6DST2 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DST2_9BACT Length = 307 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 50/186 (26%), Gaps = 25/186 (13%) Query: 249 AVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 + + +D SGSM ++A + V+ + V + Sbjct: 88 SNIQICLDSSGSMRADFGGKNRYEVAMQAVKEFTE-YREGDAFGLTVFGTEYINWVPVTK 146 Query: 304 H-----------EFFYSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + GGT ++ AL+ + + ++ + SDG + + Sbjct: 147 DTSAIALATPFLAPDRMSKWFGGTNIAKALRGSQQQLLQQEDGD---RMIILVSDGVSGS 203 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 + + + V + F + + + D+ + Sbjct: 204 PNDTVDMAQELRNNKIVAYCIYIGSGNGSP---EMNALAAITGGQ-VFGVNNPKALDETF 259 Query: 412 PVFREL 417 ++ Sbjct: 260 RFIDKM 265 >UniRef50_Q22G03 Putative uncharacterized protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22G03_TETTH Length = 994 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 25/204 (12%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 L+ + S Q ++D SGSMD + + K+ L F+ +T + ++ + Sbjct: 29 KLKLDETALVQNNSRQKNYQIVIDNSGSMDGTNIQLTKQLCNELVQFVIKTQPHSKISLM 88 Query: 293 RHHTQAKEV----------DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 +T V E GGTI + ++ ++ + + Sbjct: 89 TFNTSIDHVENLHLKSLKQVEQFISNINANGGTIFHITFDKLRDICQK-FTNQNEELVIV 147 Query: 343 QASDGDNWADDSPLCHE-------ILAKKLLPVVRYYSYIEITRRAHQTLWREYE--HLQ 393 +DG + + + KK + V ++ T + + Sbjct: 148 YLTDGQVQSGQDSTNLKDSFIFLQQVLKKFVNNVEVHALGMGTS-HDPVILDKIISLQTT 206 Query: 394 STFDNFAMQHIRDQDDIYPVFREL 417 + F ++ +I F+ + Sbjct: 207 QSTYQFI----KESSEIEGAFKNI 226 >UniRef50_UPI00005A0FAD PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 4 n=1 Tax=Canis lupus familiaris RepID=UPI00005A0FAD Length = 1036 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 21/211 (9%), Positives = 56/211 (26%), Gaps = 15/211 (7%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 + + R P + + L+D SGS++ + K F + + + Sbjct: 131 LWQPPQSFPEKLRECPRQDSDIAFLIDGSGSINPTDFQRMKEFVSTVMDQFKNSKTLFSL 190 Query: 290 VYIRHHTQAKEVDEHEFFYSQET----------GGTIVSSALKLMDEVVKERYNPAQWN- 338 + Q + + G T ++ ++ + + + A+ N Sbjct: 191 MQFSEDFQIHFTFNEFKKNPKPSFLVKSIKQLLGRTHTATGIRKVVRELFHSSSGARENA 250 Query: 339 -IYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRA-HQTLWREYEHLQST 395 +DG+ + D A + + + ++ Sbjct: 251 LKILVVITDGEKYGDPLDYKDVIPEADREGIIRYVIGVGDAFNHLKNREELNIIASKPPR 310 Query: 396 FDNFAMQHIRDQDDIYPVF-RELFHKQNATA 425 F + + I ++F + Sbjct: 311 DHVFRVNNFEALKTIQNQLQEKIFAIEGTQT 341 >UniRef50_B2B639 Predicted CDS Pa_2_6630 n=1 Tax=Podospora anserina RepID=B2B639_PODAN Length = 1378 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 40/311 (12%), Positives = 86/311 (27%), Gaps = 34/311 (10%) Query: 107 QDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANI-- 164 D S E L + E + + + G A A++ Sbjct: 224 GDKPQDYNNAASTSIPEGLTIQIEVVEAGKIASIVSPTHKVTVENWLGTRAASSFADLVG 283 Query: 165 -SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 S++ +L + +A R+ + + E E + I + Sbjct: 284 EDTRSSVETALVKLETGSAFLDRDFV--------LDIATGGPNDEAESPQAWIEKHPTLP 335 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 + + T + + +P+ Q + L D+SGSMD + + Sbjct: 336 NQQALMVTIPPGFTT--RTSNPTDQTEILLLADLSGSMD-DKLTSLRAAMQFFLKGIPNG 392 Query: 284 YKNVEVVY-IRHHTQAKEVDEHEFFYSQE------------TGGTIVSSALKLMDEVVKE 330 K + + + ++ Q GGT + A++ + + Sbjct: 393 RKFNVWCFGSSYKSWQPHSVDYGEASYQSASSWVDTNFHANMGGTELLPAVQAIVTARDK 452 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREY 389 R +DG+ W D L + + L +R+++ L Sbjct: 453 RLPTD-----IIILTDGETWRLDETLEYIRKQRDLTEGGIRFFALGIG-PAVSHALVEGI 506 Query: 390 EHLQSTFDNFA 400 + + Sbjct: 507 AKVGGGYAEVV 517 >UniRef50_P15988 Collagen alpha-2(VI) chain n=16 Tax=Euteleostomi RepID=CO6A2_CHICK Length = 1022 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 44/355 (12%), Positives = 84/355 (23%), Gaps = 29/355 (8%) Query: 65 HQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEY 124 G + PG + E P+G G G+ + Q G + + Sbjct: 426 RGDPGTKGSKGGPGAKGERGDPGPEGPRGLPGEVGNKGARGDQGLPGPRGPTGAVGEPGN 485 Query: 125 LDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 + + L + R G++ G + G Sbjct: 486 IGSRGDPGDLGPRGDAGPPGPKGDRG-RPGFSYPGPRGPQGDKGEKGQPGPKGGRGELGP 544 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR-- 242 + E + + E P + D+ E Sbjct: 545 KGTQGTKGEKGEPGDPGPRGEPGTRGPPGEAGPEGTPGPPGDPGLTDCDVMTYVRETCGC 604 Query: 243 ---PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR---------TYKNVEVV 290 + ++D S S+ + + K F + + L + V Sbjct: 605 CDCEKRCGALDIMFVIDSSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQ 664 Query: 291 YIRHHTQ-AKEVDEHEFFYSQE-----------TGGTIVSSALKLMDEVVKERYNPAQWN 338 Y T A ++D+ GGT SAL+ + + + Sbjct: 665 YSHEGTFEAIKLDDERINSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREKAQ 724 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA-HQTLWREYEHL 392 ++A +DG D L +V +I + Sbjct: 725 VFAVVITDGRYDPRDDDKNLGALC-GRDVLVNTIGIGDIFDQPEQSETLVSIACN 778 Score = 41.4 bits (95), Expect = 0.068, Method: Composition-based stats. Identities = 17/177 (9%), Positives = 39/177 (22%), Gaps = 31/177 (17%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYK--NVEVVYI 292 + L+D S + + A F + L R N + + Sbjct: 821 ELAVAQCTQRPVDIVFLLDGSERIGEQNFHRAHHFVEQVAQQLTLARRNDDNMNARIALL 880 Query: 293 RHH-----------TQAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIY 340 ++ T + + + + SA+ + +P Sbjct: 881 QYGSEREQNVVFPLTYNLTEISNALAQIKYLDSSSNIGSAIIHAINNI--VLSPGNGQRV 938 Query: 341 A--------AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 A +DG + KK + + + + Sbjct: 939 ARRNAELSFVFITDG-ITGSKNLEEAINSMKKQDVMPTVVALG---SDVDMDVLLKL 991 >UniRef50_A7RT18 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RT18_NEMVE Length = 177 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 51/178 (28%), Gaps = 20/178 (11%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHT--------QA 298 + ++D SGS+ + M F L + V Y + + Sbjct: 1 DIGFVLDASGSVRANRFKMCLNFINKLVNSFHIGPHNTRIGIVRYSTRPSGIFRFTSYRN 60 Query: 299 KEVDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 K +H + G T +A+ + + +DG + DS + Sbjct: 61 KHSTKHRVNRIRYTGGWTRTGAAINYARRYLYQHNRRRGVRKVLIVMTDGK--SQDSVVG 118 Query: 358 HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 K++ + ++ ++ + ++ + RD I + Sbjct: 119 ASRSVKRM--GIEVFAIGIGRG-YRRSELNQMATDRN---HVLTARFRDLHKIIGKIK 170 >UniRef50_UPI000180C3AB PREDICTED: similar to polydomain protein-like n=1 Tax=Ciona intestinalis RepID=UPI000180C3AB Length = 721 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 51/174 (29%), Gaps = 21/174 (12%) Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 +DL K+ PS+ + ++D S S+ + + K+F + + + Sbjct: 503 WDLAPPCCAKKCPPSAPMDLVLILDSSSSVKRPNWNTMKQFVRSIITTFNFGENEARMAV 562 Query: 292 IRHHTQ--------------AKEVDEHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQ 336 R++ Q K + G GT AL+ V+ N + Sbjct: 563 FRYNRQVDTRNQILLSDHINNKTTFLEAYDKLPYNGFGTFTGRALRHAKNVILANRNGNR 622 Query: 337 WNI--YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSY--IEITRRAHQTLW 386 N+ +DG + D+ +++ L Sbjct: 623 PNVKDVILTITDGR--SQDNVATISTELREMGVTTFVIGIQPGNGAGLDQDQLL 674 >UniRef50_UPI00006A0418 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.30) (PARP-4) (Vault poly(ADP- ribose) polymerase) (VPARP) (193 kDa vault protein) (PARP- related/IalphaI-related H5/proline-rich) (PH5P). n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A0418 Length = 1230 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 45/311 (14%), Positives = 86/311 (27%), Gaps = 39/311 (12%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN--SLARRTAMTAGKRRELHALEENL 195 K ++ T+ + +GVP S VRS S+ + K LE Sbjct: 755 KDKALKENTQDTVEKVCVEESGVPQGYSFVRSPPKTCSVCSFLQLNDLKG-IADRLECFF 813 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR---------PDPS 246 + E +LR + + + K + Sbjct: 814 PLQRLD-----DEGFQLRVYLESSFLESSWNQPRMWVEKHPKEDSEACMLVFLPSFKTSI 868 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR---HHTQAKEVDE 303 + L+D S SM+ + AKR +L L+ + + + H +E Sbjct: 869 QSWDLTILLDCSNSMEST-FQSAKRIALLAASSLNPWHNINIISFGTGKYKHYVQRERAL 927 Query: 304 HEFFYSQE-------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + G T + L+ + + N SDG + Sbjct: 928 LPQSSIKPLLMAKPNMGNTELWKPLQSLCLLA---PPSDMHN--VLLISDGH---IQNES 979 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD--QDDIYPVF 414 + KK VR ++ A++ + R + F F + + + Sbjct: 980 LVFQILKKNAGKVRLFTCGVGA-TANRHMLRCLAQYGAGFFEFFEDKSKTSWKKKMEAQL 1038 Query: 415 RELFHKQNATA 425 + +A Sbjct: 1039 ERMDSPACTSA 1049 >UniRef50_C8XDP5 von Willebrand factor type A n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XDP5_NAKMY Length = 681 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 45/174 (25%), Gaps = 32/174 (18%) Query: 252 FCLMDVSGSMD-----QSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH----------- 295 ++D SGSM+ D AK L L + +VY Sbjct: 60 MIVLDASGSMNQDDAPGLRIDAAKAAVTDLLGTLPAPTQVGLMVYGTSTGSTDAERAAGC 119 Query: 296 ----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 T + +G T + +AL+ + + P + S Sbjct: 120 QDIKTLAPVGTLNAATLTSQVAGITASGYTPIGNALRAAAQAL-----PNEGPRSIVLVS 174 Query: 346 DGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 DG++ A +P + + ++ + + Sbjct: 175 DGEDTCAPPAPCDVARELHEQGVDLTVHTVGFKVDATARDQLSCVAQATGGTYS 228 >UniRef50_C3YH52 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YH52_BRAFL Length = 1119 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 64/208 (30%), Gaps = 23/208 (11%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-- 281 + +I DL + + +F ++D S S+ + D+ K F + + + Sbjct: 726 NEIRWIGLNDLINEAFPTVEPCDESVDLFFVLDGSDSVSLADFDIVKEFVVAVVSGFTIS 785 Query: 282 -RTYKNVEVVYIRHHT---------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + + Y T +++ GGT +AL+ + R Sbjct: 786 LTDTRVGVLQYSDGSTLECNLGDHPDWSSFVNSMNTMARQGGGTSTGAALEFARLIAAWR 845 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 P +DGD+ + LA + V ++ +++ + + Sbjct: 846 PAP-VVPRIMIVLTDGDSED-SVVTPAQALATEQ---VTVFAIGVG--SFNRSELLQITN 898 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + + D + I + + Sbjct: 899 NNQDR----VFELADFNAIANIMNRIIQ 922 Score = 68.0 bits (164), Expect = 7e-10, Method: Composition-based stats. Identities = 25/210 (11%), Positives = 54/210 (25%), Gaps = 34/210 (16%) Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-- 281 R+ ++ + E + +F ++D SGS+ + K+F + L + Sbjct: 918 NRIIQAACINIVFPTVEPCDVTT---DLFFVLDGSGSVGLYNFNTVKQFVVTLVSAFTIG 974 Query: 282 ----RTYKNVEVVYIRHHT-----QAKEVDEHEFFYSQE-----TGGTIVSSALKLMDEV 327 + + Y +T T +AL+ ++ Sbjct: 975 LNDVNDTRVGVLQYSSSNTLGCNLGDHPDLSSFVNAMNAMRYHYGPSTQTGAALQAAGQI 1034 Query: 328 VKERYNPAQWNIYAAQASDG---DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 R PA +DG D+ S V ++ ++ Sbjct: 1035 AAWR--PAPVPRIMVVVTDGMAHDSVVAPSQGLAADQ-------VNVFAIGVG--NYVRS 1083 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + + F + D Sbjct: 1084 ELLQIANNNQAR-VFELADFNAIRDNINDI 1112 >UniRef50_UPI0001AF2DA9 hypothetical protein SrosN1_23653 n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AF2DA9 Length = 527 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 30/262 (11%), Positives = 64/262 (24%), Gaps = 26/262 (9%) Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 A+T A+ ++ + + LR+ + + + +L + Sbjct: 265 ALTGATPEARDAVRTLTEHFRSTAVQREITALTLRRPVVAAARPADPLAPEQRRELPFPG 324 Query: 239 YEKR---------PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 + ++D SGSM K L + + Sbjct: 325 TRSVADGLLSSYEHRLRRPSRTVYVLDTSGSMKGRRLAQLKSALNGLTGDFREREQVTLL 384 Query: 290 VYIRHHTQAKEVDEHEFF-------------YSQETGGTIVSSALKLMDEVVKERYNPAQ 336 + Q + G T + S+L + + A Sbjct: 385 PFGSTVKQVRTHTVDPADPKAGPAAIRADAAALSAEGDTAIYSSLAAAYDHLGPDTESAF 444 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY-SYIEITRRAHQTLWREYEHLQST 395 + +DG+N A S + L R + + + ++ L Sbjct: 445 TS--IVLMTDGENTAGRSAAEFGAFYRALPEARRVTPVFPVVFGDSDRSELEAIAALTGG 502 Query: 396 FDNFAMQHIRDQDDIYPVFREL 417 F + F E+ Sbjct: 503 R-LFDGTKEEGPGSLDGAFEEI 523 >UniRef50_UPI00006A02BA UPI00006A02BA related cluster n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A02BA Length = 865 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 55/187 (29%), Gaps = 21/187 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL------------FLSRTYKNV 287 A + ++D SGS+D + + F + L L + N Sbjct: 179 AAECKRIEVADIVFVIDSSGSIDYTEYKEMQNFMVSLVNKSAVGPDNVQFGALKYSDYNT 238 Query: 288 EVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKE---RYNPAQWNIYAAQA 344 E+ Y+ +T ++ H + + G T + A++ E E Sbjct: 239 ELFYLNRYTNKVDIINHINKDTTQGGNTYTAGAVRFSKEFFTEKHGSRKARGVPQIVMVI 298 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DGD+ D ++ + Y+ ++ F + + Sbjct: 299 TDGDSHDKDKLNETARQLEQ--EGIIIYAIGIDQANTNE--LETLAG-TEGK-WFMVANF 352 Query: 405 RDQDDIY 411 DI Sbjct: 353 SGLQDIL 359 >UniRef50_B5YKY5 Magnesium-chelatase subunit ChlD n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=B5YKY5_THEYD Length = 614 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 36/277 (12%), Positives = 76/277 (27%), Gaps = 22/277 (7%) Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 + + G + ++ + R+T K + + + P + Sbjct: 331 PSSEKEEIFPIGEKFKVKRFIFKKDRIVRQTTGRRTKTKTKGRGGRYVRSLMQKRPDIAI 390 Query: 208 EEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD-QSTK 266 + + ++ I DLRYK + + ++D SGSM + Sbjct: 391 DATLRAAAPFQKLRAMKDNVVIFDDDLRYK----EKERRMSHNVIFVVDGSGSMGVEQRM 446 Query: 267 DMAKRF-YILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVS 318 K LL + K +V+ + T + E+ G T +S Sbjct: 447 KATKGAVLSLLIDCYKKRDKVAMIVFRKDKAEILLPLTSSVELALKRLREIPTGGKTPLS 506 Query: 319 SALKLMDEVVK-ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK---LLPVVRYYSY 374 + L +++K + + + +DG S K +L + Sbjct: 507 AGLMEAYKLMKITHFKYPENRLLILIITDGKPNVSLSDKPVLEELKSVCFMLKDFPLTDF 566 Query: 375 IEITRRAHQTLWR-----EYEHLQSTFDNFAMQHIRD 406 I I + E + + Sbjct: 567 IVIDTEKKDKFMKMDLAIEIAEWLQATYHSIDSLKSE 603 >UniRef50_UPI0000521DAC PREDICTED: similar to Collagen alpha-1(XIV) chain n=1 Tax=Ciona intestinalis RepID=UPI0000521DAC Length = 725 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 49/184 (26%), Gaps = 22/184 (11%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYL--FLSRTY-KNVEVVYIRHHTQAKEVDEH 304 + ++ ++D SGS+ S + RF L + + VY T ++H Sbjct: 192 KLDLWFVIDGSGSVGFSNFQDSLRFLASLTKRFTIGPDDVRVGFSVYSSTSTIHSHFNQH 251 Query: 305 EFFY-SQE--------TGGTIVSSALKLMDE--VVKE---RYNPAQWNIYAAQASDGDNW 350 + GGT A+ + V+ R +DG Sbjct: 252 MNNSALEAEILGTSYTGGGTSTGRAINDVLNNGFVERNGARPASEGVPRILVVMTDGQ-- 309 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + DS K + + + E + + + + + Sbjct: 310 SGDSVKTPSDNVKAA--GITVFGVGIGSG-IDIAEVNEIASNPDSRYAYELTGFNLLNVL 366 Query: 411 YPVF 414 Sbjct: 367 SQRL 370 >UniRef50_Q1Q3X1 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q3X1_9BACT Length = 336 Score = 76.5 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 51/233 (21%), Gaps = 57/233 (24%) Query: 234 LRYKNYEKRPD-PSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNV 287 L+ K + ++DVS SM + + AK L L + Sbjct: 75 LQPKWQTTEEQYEKEGLEIVFVLDVSMSMLAEDVKPNRLECAKMEIANLVRGL-EDDRVG 133 Query: 288 EVVYIR-------HHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERY 332 VV+ + T+ E+ + G T + +AL E Sbjct: 134 LVVFAARAFSLLPYPTKDYEMVFLRILNMVNEHYVRFVPYG-TNIGNALIAAMETFSNEA 192 Query: 333 NPAQWNIYAAQASDGDNWADDSPLCHEI---LAKKLLPVVRYYSYI-------------- 375 +DG+ E L +K Sbjct: 193 GK----KIIILLTDGEEQLLRRSQVVEAIRLLLEKNDISTYIIGIGDPNNSTSIPKRDRL 248 Query: 376 -------------EITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 I + L RE + Q+ V Sbjct: 249 GNKSGYERTEDGEIIYTKPDPKLLREIAEMAGGSYQHDATGAELQNIFKQVIE 301 >UniRef50_UPI000180B353 PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 n=1 Tax=Ciona intestinalis RepID=UPI000180B353 Length = 1580 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 30/261 (11%), Positives = 62/261 (23%), Gaps = 42/261 (16%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIE-------RVPFIDTFDLRYKNYEKRPDP 245 + + + + + LR + + + P + ++ Sbjct: 276 DPVNQHNTEADNEQNAKCNLRSTWDVITSTSDFSGGSNPPNPTLTNLAPTFRVVRVAASR 335 Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL-FLSRTYKNVEVVYIRHHTQAKEVDEH 304 ++DVSGSM + M ++ L K V + E+ Sbjct: 336 R----FVLVLDVSGSMSGNRLLMMRQSAGDFISTSLPDGDKVGIVQFHSSANLMMEI-RQ 390 Query: 305 EFFYSQ-----------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 G T + + ++ + + +DG Sbjct: 391 ISSQLDRVAIAAGIPGIAGGSTCIGCGIYAAMNEMERH-DANETCGNIIVLTDGKENQPP 449 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH---------- 403 LA + VV + T + L FA Sbjct: 450 YVNDVSQLAIQKNCVVNAILF---TTTENSALVD-LVTATGGQWFFAQDRDLKRLMGSFA 505 Query: 404 ---IRDQDDIYPVFRELFHKQ 421 D D+ + + +KQ Sbjct: 506 VIAANDDGDVRNLVSTILYKQ 526 >UniRef50_Q4TBC0 Chromosome undetermined SCAF7164, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4TBC0_TETNG Length = 1636 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 62/191 (32%), Gaps = 23/191 (12%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE---- 300 +A + L+D SGS+ K+F I L + V V + + K+ Sbjct: 1041 EKQKADLVFLLDQSGSIQSDDYTTMKKFTIDLINKFQISRDLVHVGLAQFSSTFKDEFYL 1100 Query: 301 --------VDEHEFFYSQETGGTIVSSALKLMDEVVKE---RYNPAQWNIYAAQASDGDN 349 + H QE GGT++ AL + + + + +DGD Sbjct: 1101 NKFFDEQAISAHIKDMQQEEGGTLIGLALNSIRKYFEASHGSRKAEGISQNLVLITDGD- 1159 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + D L + L V ++ H + + + F +++ + Sbjct: 1160 -SQDDVEEAARLLRGL--GVEVFAIGIG--NVHDLELLQIA--GTPENVFTVKNFDKLEG 1212 Query: 410 IYPVFRELFHK 420 I+ + + Sbjct: 1213 IHQKVVDTICQ 1223 Score = 68.8 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 50/183 (27%), Gaps = 21/183 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT-------- 296 + L+D SGS+ K F L + V V +++ T Sbjct: 649 KDVPGDLIFLIDSSGSIYPEDYQKMKDFMKSLVQKSNIGKDQVHVGVLQYSTEQKLVFPL 708 Query: 297 ---QAKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVK-ERYNPAQWNIYAAQASDGDNWA 351 K+ Q+ GGT A+ ++ + + +DG+ + Sbjct: 709 IQYYTKDQLSKAIDDMQQIGGGTHTGEAIAVVSKYFDAQNGGRPDLKQRLVVVTDGE--S 766 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 D + +V + A+ + E +A + D+ Sbjct: 767 QDDVKLPAEALRAKGVIVYSIGVV----AANTSQLLEIS--GDADRMYAERDFDALKDLE 820 Query: 412 PVF 414 Sbjct: 821 KQM 823 Score = 66.8 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 34/249 (13%), Positives = 68/249 (27%), Gaps = 31/249 (12%) Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 P+ + Q + + + N ++ +++N+ R+ + Sbjct: 235 SPSTYRLGLAQYGQDIKVDFLFKDHQT--NKDLLTAVKNAQQRKLQPNEPRNLGKALQYA 292 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + ++ + R+ + L K P + + N + Sbjct: 293 YKNFFTPEAGSRNDQS--FRQYLVVLTGKDADDPVYEEAHCKAANIA---------DIVF 341 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHH---------TQAKEV 301 ++D SGS+ S + + F L L + VVY T E+ Sbjct: 342 IIDESGSIGSSDFQLVRTFLHSLVSGLEVSPNRVRVGIVVYHGEPKAEVFLNTFTDKSEL 401 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVV----KERYNPAQWNIYAAQASDGDNWADDSPLC 357 + GGT +AL V K A +DG + Sbjct: 402 LDFIRILPYHGGGTNTGAALNFTQHQVFVREKGSRIELGVQQVAVVITDG--CVETEEAD 459 Query: 358 HEILAKKLL 366 L + Sbjct: 460 IFFLIDQSG 468 Score = 63.0 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 29/215 (13%), Positives = 52/215 (24%), Gaps = 20/215 (9%) Query: 187 ELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPS 246 + L + + I+ + + + + + Sbjct: 397 DKSELLDFIRILPYHGGGTNTGAALNFTQHQVFVREKGSRIELGVQQVAV-VITDGCVET 455 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEV-- 301 +A +F L+D SGS+ K+F + + V Y T ++ Sbjct: 456 EEADIFFLIDQSGSIHPPDFYDMKKFILEFLQTFRVGPNHVRIGVVKYADSPTLEFDLHT 515 Query: 302 ------DEHEFFYS-QETGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWAD 352 E Q GGT AL M + Y +DG + Sbjct: 516 YTDVKSLEKAITNIHQVGGGTETGKALDFMRPQFDRAVTTRGHKVKEYLVVITDG--NST 573 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 D + VV + L Sbjct: 574 DKVKDPADKLRAQGVVVYAIGV---KDAVEKELLE 605 Score = 61.4 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 25/219 (11%), Positives = 50/219 (22%), Gaps = 27/219 (12%) Query: 226 VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTY 284 P + ++QA + L+DVS S+ + F + S Sbjct: 829 DPDRGNPFIPAPAPLSYCKKTAQADIIFLVDVSTSILKEKAFPSVTVFMESVVNQSSVGP 888 Query: 285 KNVEVVYIRHHTQAKEVD-----EHEFFYSQE-------TGGTIVSSALKLMDEVVKERY 332 + I T + + + Q G T AL + + + Sbjct: 889 ELTRFGVITFSTGVQSIFTLKQYSSKRDVLQAVGAVTAPGGNTNTGDALDYSLQYFGKEH 948 Query: 333 NPA---QWNIYAAQASDG---DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 + +DG + P V +S A + Sbjct: 949 GGRAALKVPQILMVITDGAAQEPSKLPGPSEALR-----KQGVSVFSIGVK--NASREQL 1001 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 F + + +Y ++ Sbjct: 1002 DIMAG-NDPSRVFFVDTFDALETLYKNISKVLCNHTKPV 1039 Score = 51.4 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 14/107 (13%), Positives = 29/107 (27%), Gaps = 10/107 (9%) Query: 312 TGGTIVSSALKLMDEVV----KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP 367 GGT A+ + + + A +DGD+ D + +K Sbjct: 83 NGGTETGKAINFLRKQYFTKKAGSRADQRVPQIAVVITDGDST--DDVVVPARELRKHG- 139 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + A+Q + + S F + + + Sbjct: 140 ---VIVFAIGVGNANQGELKSIANRPSERFKFTIDSFQALKRLTERL 183 >UniRef50_C3YUG3 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YUG3_BRAFL Length = 1096 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 26/248 (10%), Positives = 67/248 (27%), Gaps = 26/248 (10%) Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL-RYKNYEKRPDPSSQ 248 + L+ ++ P+ R + + + DL Y S+ Sbjct: 105 ESGDPLSYHNSQAPSPQNLRCGGRSAWDVMLEHPDFAGGANRPDLSPYAEPNFSVVRSTD 164 Query: 249 AVMFCLMDVSGSMDQS-TKDMAKRFYI-LLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 + ++DVSGSM + +R + + V + + ++++ Sbjct: 165 RRVVLVLDVSGSMTGQGRMERLRRSVSTYILSTIEDGAWLGIVTFRGTSHKICDLEQLNG 224 Query: 307 FYSQE-------TG--------GTIVSSALKLMDEVV-----KERYNPAQWNIYAAQASD 346 +E G GT ++ A+ L +++ + + +D Sbjct: 225 DSVREEILNMTLDGLTNRTGKVGTNITRAVTLAVQILGPAVQDRKLGDSTGPRQMILITD 284 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G + ++ +L V + + + F+ Sbjct: 285 GRDRRLNN-SVIFMLQNDTAKGVVIDTIALGDGA--EEGLPLLSEVTGGQFFFSPDSDAG 341 Query: 407 QDDIYPVF 414 + Sbjct: 342 GSALDDAL 349 >UniRef50_Q5NWS3 Tellurium resistance protein n=2 Tax=Proteobacteria RepID=Q5NWS3_AZOSE Length = 349 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 40/154 (25%), Gaps = 16/154 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT------YKNVEVVYIRHH---TQ 297 + +F L+DVS SM + L L + + T Sbjct: 2 RRLPIFFLVDVSESMAGDNLRQLQEGLERLVRSLRADPYALETVFISVIAFAGKPKTLTP 61 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDGDNWA 351 E+ + GT + SA+ + + ++ W +DG Sbjct: 62 LVELYQFYAPRLPLGSGTSLGSAMAHLMDEMERTVQRSTPEKKGDWRPVVYLLTDGKPTD 121 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 D P + + + + L Sbjct: 122 DIEP-AIKRWKRDFEERSNLVAIGVGKHASLSAL 154 >UniRef50_Q31JH9 Putative uncharacterized protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31JH9_THICR Length = 363 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 25/217 (11%), Positives = 51/217 (23%), Gaps = 40/217 (18%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSM-------DQSTKDMAK---RFYILLYLFLSRTYKNVE 288 P + + +++ S SM D + K L + Sbjct: 106 LPPEPQTKTVRDIVFVVETSVSMVLEDYQIDGEPQSRIKVIQTVLDQFISGL-AGNRFGF 164 Query: 289 VVYIRHH------TQAKEVDEHEFFYSQE--TGGTI--VSSALKLMDEVVKERYNPAQWN 338 ++Y T + G T AL L + ++ + + N Sbjct: 165 ILYADDAYTLMPLTSDATTARLMLKRLKPYLAGRTDEATGEALGLALQQAEKSTD-STEN 223 Query: 339 IYAAQASDGDNWADDSP-LCHEILAKKLLPVVRYYSYIEITRRAH-------------QT 384 SDG P A+ L + ++ A + Sbjct: 224 RIVVLISDGSTRDSRLPIAEAINYAQGLNIPIYTIGVGANSKDADKREFRGLLYEALESS 283 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 ++ I D+ V + + + Sbjct: 284 SLKQIADQTQGRYY----QIGSGQDLQKVLQAIDQTE 316 >UniRef50_Q2BFM3 Possible D-amino acid dehydrogenase, large subunit n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BFM3_9BACI Length = 432 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 37/320 (11%), Positives = 86/320 (26%), Gaps = 39/320 (12%) Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 P+ + + + + Y A V + + + + + E Sbjct: 22 EDPSAADSSDVKSVKIEEEPVQYPEAATSAEEIVKQQMGEKIEKAMDEG--SEEANMSEE 79 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 E +A L A +++ + ++ + Y + + Sbjct: 80 EVMADFEAEGMTAEEVYNGLVHWFAVDYSEVNDA--LVNYEPAFGEYGAEEEEQQTKNIS 137 Query: 253 CLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVYI--------------- 292 +D SGSM+ ++AK Y Sbjct: 138 IQIDSSGSMNGQVSGGVKMNLAKEAVENFAAGFPEDTIMTLRTYGHKGTGDDKDKAMSCA 197 Query: 293 ------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 +T + + + +G T +++++K E +K++ N SD Sbjct: 198 STEVMYDANTYDQAAFKAALEKFKPSGWTPLAASIKAGYEDLKKKAGEDTEN-ILYIVSD 256 Query: 347 G-DNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 G + D + LA L V + A Q ++ + + Sbjct: 257 GIETCEGDPVKEAKALADSDLNMKVHIIGF--DVDDAGQDQLKKTAEAGNGKYYTVNSKL 314 Query: 405 RDQDDIYPVFRELFHKQNAT 424 ++ EL + ++ Sbjct: 315 ----ELTNTLNELMGEAISS 330 >UniRef50_Q01T75 von Willebrand factor, type A n=2 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01T75_SOLUE Length = 320 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 27/210 (12%), Positives = 56/210 (26%), Gaps = 26/210 (12%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH-- 295 + K + ++D SGSM + + L +R + V + Sbjct: 78 DIRKFKSEDVPVSLGLVIDNSGSMRN-KLQKVEAAALALVKASNRDDEVFIVNFNDTAYL 136 Query: 296 --------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 T E GGT + A+++ + +K+ + +DG Sbjct: 137 DNPKDKDFTNDIGELEQALKRIDARGGTAMRDAIQMSIDHLKKGHRD---KKVLVVITDG 193 Query: 348 -DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT------LWREYEHLQSTFDNFA 400 DN + + A + V Y +T H+ + F Sbjct: 194 NDNSSVINMERIMKNAHQ--SDVLIYGVGLLTEEEHREAARAKRALNDLAEATGGKTFFP 251 Query: 401 MQHIRDQDDIYPVFRELFHK---QNATAKG 427 V ++ + + + Sbjct: 252 KDLEEVDAIASQVAHDIRSQYTIEYSPTNA 281 >UniRef50_Q2GWQ0 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2GWQ0_CHAGB Length = 1045 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 32/288 (11%), Positives = 74/288 (25%), Gaps = 24/288 (8%) Query: 132 LALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 L + ++ + + A G S TA E + Sbjct: 268 LEVEIVETEKIASIVSPSHAIAVTRRRGARTAQSFADLAGEDDRSNV-ETASVALESGRV 326 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + + + E + + E + L ++ P Q+ + Sbjct: 327 FLDKDFVLDIVTTPDGNNENPQAWLEEHPSLPN--HKTLMLTLPSGFLARKAPPVQQSEI 384 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-HHTQAKEVDEHEFFYSQ 310 L D SGSM + + + + K + + + + ++ Sbjct: 385 LFLADCSGSMKDKIRPL-RSAMQFFLKGIPEGRKFNIWCFGTKYASWQPQSVDYTEESLN 443 Query: 311 E------------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 GGT + A++ + ++ +DG W + L Sbjct: 444 SALSWVERDMRANMGGTELLPAVQAIVAAREKALMTD-----VIVLTDGQTWRLEQTLDL 498 Query: 359 EILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + L VR+++ +H L + + + Sbjct: 499 IQKTRGLTEGRVRFFALGIGRAVSH-ALVDGIAKAGGGYAEVVQEASQ 545 >UniRef50_C3ZCS4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZCS4_BRAFL Length = 949 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 46/161 (28%), Gaps = 20/161 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRHHTQ----- 297 S++ + ++D SGS+ + F + + V Y T Sbjct: 24 SAKLDLMLVLDGSGSVGDADFAKTLEFAENVVNAFDIGTDLTRVGVVQYSDTPTMEFNLG 83 Query: 298 ---AKEVDEHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 K Q + GGT +AL+ + A +DG + DD Sbjct: 84 VHADKGSTIAAVNNIQYQNGGTATGAALEFAR--ANANWRGAPVPKVMIVVTDGKS-GDD 140 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 + LA V Y+ ++ + + Sbjct: 141 VTAAAQALA---GEGVAVYAIGVG--NYDLPELQQIANGNN 176 Score = 58.4 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 28/230 (12%), Positives = 58/230 (25%), Gaps = 30/230 (13%) Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + + L R ++ L K + + + P+ A + + Sbjct: 640 YSPPALSSAALASVLCRVQMEVLSTKAAVLASVPMAGMDLPAQTCVRIPALSAELTLV-- 697 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR--------HHTQAKEVDEHEFFY 308 + K L + LS + Y + K + Sbjct: 698 ------GLQWNPVKWTMYRLLVTLSVASAVGVIQYSSTVQEEFSLNAHFTKTAVLNAIDN 751 Query: 309 SQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP 367 GGT+ +A+ M + + R A +DG + D P Sbjct: 752 IVYMGGGTLTGAAITYMKDNSQWRP---GVAKIAIVVTDGKSSDDVGPPSSAAQ----QT 804 Query: 368 VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + ++ QT + + + D D + +L Sbjct: 805 GITMHAIGVGA-NVDQTELSQIAS----TSQYVTT-VADYDALDAQMAQL 848 Score = 49.9 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 33/97 (34%), Gaps = 12/97 (12%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHHT- 296 P ++ +F ++D SGS+ + D K+F + + + V Y +T Sbjct: 505 PAPTCNAPLDLFFVLDGSGSVTGANFDKVKQFTKNVVNAFDISATATRVGVVQYSDSNTL 564 Query: 297 -------QAKEVDEHEFFYSQ-ETGGTIVSSALKLMD 325 K + GGT SAL+ Sbjct: 565 EFNLGDHADKPSTLAAIDSIVYQGGGTTTGSALEFAR 601 >UniRef50_A9B368 Conserved hypothetical membrane protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B368_HERA2 Length = 330 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 61/247 (24%), Gaps = 43/247 (17%) Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDP--SSQAVMFCLMDVSGSMDQSTKD------MA 269 L + + L Y + + + +D+S SM D +A Sbjct: 55 VLISLRAAAVGLLVVVLTRPQYAQSSERVVREGIDIQLALDISLSMKAGDFDPKDRITVA 114 Query: 270 KRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG---------GTIVSSA 320 K + + VV+ H + F G GT + A Sbjct: 115 KEVIAEFVKG-RKDDRIGLVVFSGHAFTQVPLTLDYDFLQNLLGQVQTVRRPDGTAIGLA 173 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIE-IT 378 L ++ + +DG N D P +A+ L V + Sbjct: 174 LAHSVNGLRNSTTK---SKVVILLTDGSNNRGDIEPAQAAEIARALDVRVYTILVGKPGN 230 Query: 379 RRA-------------------HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + R+ F + D+Y ++ Sbjct: 231 GEYPVHDPWRDETYLIPAPTAEDEVALRDIAEQTGGIF-FRAGDEQGLRDVYDTIDKMER 289 Query: 420 KQNATAK 426 Q A+ K Sbjct: 290 SQVASEK 296 >UniRef50_Q1LEM9 von Willebrand factor, type A n=2 Tax=Burkholderiaceae RepID=Q1LEM9_RALME Length = 334 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 23/210 (10%), Positives = 49/210 (23%), Gaps = 31/210 (14%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-----------DMAKRFYILLYLFLS 281 +Y + + + +D+S SMD D ++ Sbjct: 79 RPQYLEPPLQRTQPVR-DLLLALDLSQSMDTRDFKTPSGVLEPRVDAVRQVVADFVAR-R 136 Query: 282 RTYKNVEVVYIRHH------TQAKEVDEHEFFYS---QETGGTIVSSALKLMDEVVKERY 332 + +V+ T + T + A+ L ++ + + Sbjct: 137 TGDRIGLIVFGDAPYPLAPFTLDHALVRELLADMVPGMAGASTSLGDAIGLGIKMFDQSH 196 Query: 333 NPAQWNIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYI----EITRRAHQTLWR 387 +DG++ A P +AK VV ++ + Sbjct: 197 AQE---KVMILLTDGNDTASRMPPAQAADIAKTRGVVVHTVGIGDPATTGEQKVDLDALK 253 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 F IY + Sbjct: 254 HIASTTGGRYFFGADQTS-LASIYATLDRV 282 >UniRef50_UPI0000EB1CF0 UPI0000EB1CF0 related cluster n=1 Tax=Canis lupus familiaris RepID=UPI0000EB1CF0 Length = 440 Score = 76.1 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 55/193 (28%), Gaps = 21/193 (10%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD-- 302 + + ++D S S+ ++ K L ++ + I + + ++V Sbjct: 7 KETPLELMFVIDSSESVGLENFEIIKSLVKTLSDQVALDLATARIGIINYSHKVEKVAHL 66 Query: 303 ---------EHEFFYSQETG-GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + Q G GT ++AL + + + A +DG + Sbjct: 67 TQFSNKDDFKLAVDNMQYLGEGTYTATALHEANHMFEAARP--GVKKVALVITDGQTDSR 124 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEI-TRRAHQTL----WREYEHLQSTFDNFAMQHIRDQ 407 D E++ + V + + + + + Sbjct: 125 DEKNLTEVVKRASDINVEIFVIGVAKKNDPNFEMFHKEMNLIATDPDSEHVYQFDDFITL 184 Query: 408 DDIYPVFRELFHK 420 D ++LF K Sbjct: 185 QDTLK--QKLFKK 195 >UniRef50_B2KDS8 von Willebrand factor type A n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDS8_ELUMP Length = 335 Score = 76.1 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 25/220 (11%), Positives = 51/220 (23%), Gaps = 46/220 (20%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIR 293 EK + + +DVS SM + AK +L + V + Sbjct: 81 VEKINVTAQSSHSVIAVDVSDSMKARDLKPTRLENAKTMLKMLISAKGEQ-RTGIVAFTS 139 Query: 294 HH------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 T E ++ + GT ++ A++ E++ + Sbjct: 140 KAYTQCPITNDVEALKYFVNQLRPEMLNAKGTALAPAVQRAAEMLSKYPGK----KALIL 195 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA---------------------- 381 +DG++ + A+K + Sbjct: 196 LTDGEDHEPEQIEEAIKTAQKEGIKIIAVGIGTEEGEPIPEKIEGGRVLEYKKDADGKTV 255 Query: 382 ----HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + +E I V +L Sbjct: 256 ITKLDEKSLKELASKTGGVYIKYKNAQTVAAQIAQVLEDL 295 >UniRef50_B2UZB2 von Willebrand factor type A domain protein n=1 Tax=Clostridium botulinum E3 str. Alaska E43 RepID=B2UZB2_CLOBA Length = 984 Score = 76.1 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 32/112 (28%), Gaps = 9/112 (8%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS--RTYKNVEVVYIRH------HTQA 298 + + ++D SGSM S K + + V Y + Sbjct: 88 PKKEIVLVLDTSGSMKDSKIKKMKNAAMEFVNKIKKIPNLDIDIVTYSTSGYTYLNNGNT 147 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 +E + GGT L+ + ++ N + SDG Sbjct: 148 EEDLLKIINSIKADGGTNTGEGLRKANYILDLEKNKNA-DKSIVFMSDGMPT 198 >UniRef50_A0C3V7 Chromosome undetermined scaffold_148, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0C3V7_PARTE Length = 802 Score = 76.1 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 47/180 (26%), Gaps = 22/180 (12%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-----HHTQAK--- 299 Q + +D S SM+ + +AK+ L L + T K Sbjct: 310 QRNITFFLDQSNSMEGTKIMLAKQGLQLFIRSLPSECYFNIYSFGSKWRKIFQTYQKICE 369 Query: 300 EVDEHEFFYSQE----TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 V + + G T + A+ + + +DG Sbjct: 370 PVLKQCEKEIKMMEGNMGSTFLLGAMNDAL--LNSVKSSDATW---FILTDGR---IAEI 421 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 L K P +R +S Q + + + + F + + + Sbjct: 422 EEIFALLKSN-PHIRVFSLGFGL-EFDQEIVEQLANQTNGSCIFCQNVQSLNSQMIQLLQ 479 >UniRef50_Q7XTB9 OSJNBa0068L06.4 protein n=7 Tax=Oryza sativa RepID=Q7XTB9_ORYSJ Length = 724 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 53/215 (24%), Gaps = 22/215 (10%) Query: 151 HRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEE 210 R T + + L A A + E + Sbjct: 217 KRTTTTDDHKRKSYDDDEPLLAPKAAAGAFNPIPEDDEDDATEFRGFFPAR--PRSGLAV 274 Query: 211 RLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK 270 L + A + A ++ ++ P + + ++DVS M M K Sbjct: 275 TLAPDAALVSAGRRHGKYVVAVRVKAPALRSSPSTRAPIDLVTVLDVSQGMMGDKLHMLK 334 Query: 271 RFYILLYLFLSRTYKNVEVVYIRHHTQAKEV----------DEHEFFYSQ----ETG--- 313 R L+ L + V + + + G Sbjct: 335 RGMRLVIASLGPADRLAIVAFSGAAKRLLPLRRMTRQGQRSARQIVDRLVVCAAAQGQEQ 394 Query: 314 --GTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 V AL+ +V+++R + SD Sbjct: 395 PQAVCVGDALRKATKVLEDRRDRNPVATV-MLLSD 428 >UniRef50_Q055Y9 Putative uncharacterized protein n=4 Tax=Leptospira RepID=Q055Y9_LEPBL Length = 379 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 50/171 (29%), Gaps = 22/171 (12%) Query: 253 CLMDVSGSMDQ-----STKDMAKRFYILLYLFLSRTYKNVEVVYIR-----HHTQAKEVD 302 ++D SGSM++ +AK+ L + + Y ++ E Sbjct: 69 FIVDASGSMNEYLGIYQKIHLAKKHVSRYISTLPTETEIGFIAYGNRIPGCSSSRLYEPL 128 Query: 303 EHEF--------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + E F +G T ++ ++++ ++ +R + +DG Sbjct: 129 QRENHGTFKNRLFSLTPSGATPLAESIRIAGNLISQRKKETE----IILITDGVESCYGD 184 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 P K+ +++ + + + + Sbjct: 185 PKKELQALKQQGIYFKFHILGLGLKPDEERKMKILAEEGNGKYFGIEDDSS 235 >UniRef50_C4DP19 von Willebrand factor type A-like protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DP19_9ACTO Length = 626 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 23/209 (11%), Positives = 46/209 (22%), Gaps = 26/209 (12%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM------DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + +D SGSM + + A+ I + + K Y Sbjct: 29 APASAADNDGELLMALDASGSMEESDGAGNTKMETARDAVIDVAEAMPGHAKVGLRAYGP 88 Query: 294 HHTQAKEVDEH---------------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 T + + G T ++ +L+ + A+ Sbjct: 89 ASTGSGCKASKELVPIDKIDADAITTAATELKPEGDTPIAYSLEKA----AGDFTEAKGP 144 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 SDG+ P+ +R + A + E Sbjct: 145 KTILLVSDGEETCGGDPVKVAEKIASQGVDLRVHVIGFQVDDATRKQLTEIAKAGKGSYY 204 Query: 399 FAMQHIRDQDDIYPVFRELFHK-QNATAK 426 A + Q + Sbjct: 205 DAQDGPALASRLKRASESALRPYQTTGTQ 233 >UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated chloride channel n=2 Tax=Gallus gallus RepID=UPI0000E80A5E Length = 928 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 22/243 (9%), Positives = 67/243 (27%), Gaps = 26/243 (10%) Query: 196 AIISNSEPAQLLEEERLRKEIAEL-RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 ++ P + + + + R + + + + + Sbjct: 252 NTHNSEAPNMQNKMCNYKSTWEIIMESDDFRNSSVVNSLVPPFETTFELLQTQDRAVSLV 311 Query: 255 MDVSGSMD-QSTKDMAKRFYI-LLYLFLSRTYKNVEVVYIRHHTQAKEVDE----HEFFY 308 +DVSGSM+ + + L + + V + + + + Sbjct: 312 LDVSGSMNTNNRITNLRTAAEVFLIQIIEIGSRVGIVTFESSAYEKSPLLQITSVATRQR 371 Query: 309 S------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 GGT + + ++ E++ + +DG++ + Sbjct: 372 LVQNLPTTAGGGTKICAGIEKGLEIITNAIGTTYGSE-IVLLTDGED---STMSLCREKV 427 Query: 363 KKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 K+ ++ + + E+ ++ +A+ D+ E F + Sbjct: 428 KESGAIIHTIALG----PSAAKELEEFSNITGGLQLYAVDV-----DVPSKLVEAFSEIT 478 Query: 423 ATA 425 + Sbjct: 479 TGS 481 >UniRef50_C6X1I4 BatB n=2 Tax=Flavobacteriaceae RepID=C6X1I4_FLAB3 Length = 335 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 22/199 (11%), Positives = 48/199 (24%), Gaps = 47/199 (23%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDM--AKRFYILLYLFLSR--TYKNVEVVYIRHH 295 E+ + L+DVS SM+ + ++ L+ + + K +V+ Sbjct: 81 EEVETKQKMNNVIFLLDVSNSMNAQDVEQNRLQQAKNLIINAMGKMTNDKVGIIVFAGEA 140 Query: 296 TQAKEVDEHEFF----------YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + + + GT A++ + + A+ + S Sbjct: 141 SSIMPLTTDFTAVETYVGGVETSIVKMQGTDFLKAMQTAADKFRNV---AKGSRKVVLLS 197 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR--------------AHQTL------ 385 DG++ + LA +R S + L Sbjct: 198 DGEDNEG-NEKAAAKLAN--REGIRVISVGIGSEEGAPIPEYVFGQLMGYKTDLSGQTVI 254 Query: 386 -------WREYEHLQSTFD 397 Sbjct: 255 SKRQTAALSNIADNTGGTY 273 >UniRef50_UPI000194D9FE PREDICTED: similar to matrilin 4 n=2 Tax=Neognathae RepID=UPI000194D9FE Length = 580 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 51/197 (25%), Gaps = 23/197 (11%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 + + ++D S S+ + +RF + + L V I++ + Sbjct: 21 PKPAALKCRTGPLDIVFVIDSSRSVRPFEFETMRRFMMDIIGNLDVGPNATRVGVIQYSS 80 Query: 297 QAKEV-----------DEHEFFYSQE-TGGTIVSSALKLMDEVV-----KERYNPAQWNI 339 Q + + E GT+ A++ V R + Sbjct: 81 QVQNIFSLKTFFTRADMERAINSIIPLAQGTMTGLAIQYAMNVAFTTQEGARPLHKRIPR 140 Query: 340 YAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 A +DG D A+ + Y+ + R F Sbjct: 141 IAIVVTDGRP--QDRVTEVATQARNA--GIEIYAVGIQRADMNS--LRAMASPPLEEHVF 194 Query: 400 AMQHIRDQDDIYPVFRE 416 ++ F++ Sbjct: 195 LVESFELIQQFAKQFQD 211 Score = 60.3 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 45/162 (27%), Gaps = 23/162 (14%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIR-----HH---- 295 + ++D S S+ ++ K+F + L + V Y Sbjct: 343 HVDLVMVIDGSKSVRPQNFELVKQFVNRIVDLLEVSPHGTRVGLVQYSSRVRTEFPLNKY 402 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDE----VVKERYNPA-QWNIYAAQASDGDNW 350 A E+ + GT+ ALK M E ++ + +DG Sbjct: 403 HSADEIKKAVMDVEYMEKGTMTGLALKHMVEHSFSELEGARPLSYNIPRIGLVFTDGR-- 460 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 + D AK+ + ++ + R Sbjct: 461 SQDDISEWARRAKE--SGIVMFAVGVGKAV--EEELRAIASE 498 >UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TM27_9PROT Length = 318 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 32/75 (42%), Positives = 45/75 (60%) Query: 4 FIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPM 63 IDRR N + KS+ NRQRFLRR K Q+ +++ +A +R + DV GE + IPT+ ++EP Sbjct: 230 IIDRRRNSQGKSLANRQRFLRRAKRQVTEAVRQASAERRIRDVADGEQIVIPTDGLNEPR 289 Query: 64 FHQGRGGLRHRVHPG 78 F LR H Sbjct: 290 FRHDARRLRLDRHAA 304 >UniRef50_D2QI99 von Willebrand factor type A n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QI99_9SPHI Length = 320 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 58/213 (27%), Gaps = 37/213 (17%) Query: 250 VMFCLMDVSGSMDQSTK-----DMAKRFYILLYLFLSRTYKNVEVVYIRHH--------- 295 F L+DVS SMD + K L L + ++ Sbjct: 79 DTFLLVDVSRSMDAGDIVPTRLERVKYDIQQLCDTL-PADRFGLILAAPQSILLSPLTAD 137 Query: 296 -TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 K+ TG T + +A+ + + + + + Q SDG+N++ Sbjct: 138 HDALKQFIREVHTSISPTGETDLCNAIAMARQKLIDDSSTHQSVRAIVLFSDGENFSSCE 197 Query: 355 PLCHEILAKKLLPVVRYYSYI--------------------EITRRAHQTLWREYEHLQS 394 L + + + + ++ +E Sbjct: 198 QTELARL-RSFGLPLVTVGVGTEAGASIRKGSDFVRDDNGQIVNSQLNRPFLQELARDSR 256 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 A + R +++ + R L + + Sbjct: 257 GQYIEADANGRYVNELAAILRLLKGRAIDQHRA 289 >UniRef50_UPI0001789223 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001789223 Length = 968 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 22/195 (11%), Positives = 50/195 (25%), Gaps = 29/195 (14%) Query: 247 SQAVMFCLMDVSGSM--------DQSTKDMAKRFYILLYLFLS-RTYKNVEV-----VYI 292 + ++D SGSM + AK + ++ V Sbjct: 69 MPNDVVLIIDKSGSMAPTYGPNNGEDKMTNAKEAAKGFVDLMDMTKHRVAVVDFSSSASS 128 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA- 351 T K+ + GGT +A+ ++ + A +DG Sbjct: 129 FPFTVDKDAAKSYINTINSGGGTATGNAIDAAVALLADHRTEA--QPVIVLMTDGAATES 186 Query: 352 --DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT--------LWREYEHLQSTFDNFAM 401 + P + + + Y ++ L + + + + Sbjct: 187 PKNTDPFDYALQRAQAAKDAGVIFYTIALLNPNEDPITSAPNVLMKNMA--TTATHHHFV 244 Query: 402 QHIRDQDDIYPVFRE 416 + + IY + Sbjct: 245 LGSKGLNQIYAAIVK 259 >UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C09D7 Length = 262 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 50/168 (29%), Gaps = 19/168 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-----TYKNVEVVYIRH 294 E P + +F ++D SGSM + L +++ + Sbjct: 6 EIDEMPRKELHVFYVLDTSGSMTGVPIAALNTAMEECTVALKDLAKKNADAKLKIAVLEF 65 Query: 295 HTQAKEVD---------EHEFFYSQETGGTIVSSALKLM-DEVVKERYNPAQW---NIYA 341 T AK V E E+ + G T + +AL+ + ++ + + + Sbjct: 66 STGAKWVTYNGPESLDDEFEWEHLSAGGVTDIGAALRELDIKLSRNGFLKSMTGALMPVI 125 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 +DG + + E+ + + A + Sbjct: 126 IFMTDGYPTDEYAAALAELRKNRWYTSSTKIGFAIG-DDADAAIISSI 172 >UniRef50_UPI00006A1B4F Collagen alpha-3(VI) chain precursor. n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1B4F Length = 556 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 58/198 (29%), Gaps = 26/198 (13%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV------- 301 A + L+D S S++ + K F + V++ I+ ++ KE Sbjct: 1 ADIVFLVDSSASINSDDYETMKEFMESMVKQAEIGPDRVQIGLIQFSSETKEEFPLNRYK 60 Query: 302 -------------DEHEFFYS-QETGGTIVSSALKLMDEVVKERY-NPAQWNIYAAQASD 346 F Q +G T + AL+ Y + +D Sbjct: 61 KQFSLNTYSTKLDILKAVFSLPQVSGYTYTAKALEYTRIRFGTSYGGRPGISHILILVTD 120 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G D P + + ++ + ++ F +Q+ + Sbjct: 121 GATTEADRPNLPIVSKALKDDGIIVFAVGVGKAVPQE--LQQIA--GYPDRWFLVQNYKG 176 Query: 407 QDDIYPVFRELFHKQNAT 424 D+I+ ++ ++ Sbjct: 177 LDNIHDNITQVVCDESKP 194 >UniRef50_A7SFM5 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7SFM5_NEMVE Length = 417 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 36/134 (26%), Gaps = 19/134 (14%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT--------QAKE 300 A L+D SGSM K IL L + + ++ +E Sbjct: 291 AEFIFLVDRSGSMSGKHIFQVKEMLILFLKSLPANCYFNLIGFGSYYRSVYQETQIYDEE 350 Query: 301 VDEHEFF---YSQET-GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 EH + GGT + L+ + + +DG + Sbjct: 351 TAEHACNYVQKMRADLGGTNLILPLEFIFNQ----PPKKGIPRFVFMLTDG---GVSNTT 403 Query: 357 CHEILAKKLLPVVR 370 ++ R Sbjct: 404 EVIDFVRRNAYSTR 417 >UniRef50_Q0KB82 von Willebrand factor (VWF) type A domain n=7 Tax=Proteobacteria RepID=Q0KB82_RALEH Length = 345 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 22/203 (10%), Positives = 49/203 (24%), Gaps = 30/203 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-----------DMAKRFYILLYLFLSRTYKNVE 288 + +D+S SMD ++ + Sbjct: 85 APIQKVQPARDLLIALDLSQSMDTRDFGDPSGALIPRVQAVRQVVSGFVAR-RPGDRIGL 143 Query: 289 VVYIRHH------TQAKEVDEHEFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +V+ T ++ + T + A+ L ++ + P Sbjct: 144 IVFGDAPYPLAPFTLDHQLVQTLITGLLPGMAGPSTALGDAIGLGIKMFEHSEAPE---K 200 Query: 340 YAAQASDGDNWADDSPLCHEI-LAKKLLPVVRYYSYIE----ITRRAHQTLWREYEHLQS 394 +DG++ A P +AK+ VV + + + + Sbjct: 201 VLIVLTDGNDTASRMPPERAGGIAKERKVVVHTIGIGDPNASGEEKVDLGVLQRLAAQTG 260 Query: 395 TFDNFAMQHIRDQDDIYPVFREL 417 F + IY + Sbjct: 261 GRYFFGADQAG-LETIYATLDRI 282 >UniRef50_C0GKG1 von Willebrand factor type A n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GKG1_9FIRM Length = 272 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 52/223 (23%), Gaps = 21/223 (9%) Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ-AVM 251 E + ++P Q + L + A R+ D LR +++ D + + Sbjct: 35 EGFYTLRETQPRQHGCRDSLAVAETMICAARRRLTSGDAQFLRNEDFRVIKDGGAPPLEV 94 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR-------HHTQAKEVDEH 304 L+D SGSM+ K L + + + T+ + Sbjct: 95 CLLVDTSGSMNGKRIREVKTLADNLVRQMHE--PLSLITFQEGDVGVKVRSTRNDLMVRR 152 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP-----LCHE 359 G T + ++ + R +DG E Sbjct: 153 GLAAMSAAGLTPMGEGIRTAVNYLCGRRGKKH---LVILITDGLPTWASGDKDPYLDAIE 209 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 A + + + Sbjct: 210 AGALIKKHKMHLICIGL---EPQRKFLEKLAESADASLYIVDD 249 >UniRef50_B9XQJ5 von Willebrand factor type A n=1 Tax=bacterium Ellin514 RepID=B9XQJ5_9BACT Length = 346 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 25/233 (10%), Positives = 56/233 (24%), Gaps = 53/233 (22%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMD----------QSTKDMAKRFYILLYLFLSRTYKNVEV 289 ++ +S + + D+S SM + +A + + Sbjct: 81 DRTETQASGVDIMLVFDLSWSMMVLDMGGHDETGTRFGIASAVLEDFVNK-RPNDRIGLI 139 Query: 290 VYIR----------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 V+ +H E GT + A + ++ + Sbjct: 140 VFSGVPYLASPLTLNHDWLVENLHRLHIGIIRELGTAIGDATAAATKRLQMS--KDSKSR 197 Query: 340 YAAQASDGDNW-ADDSPLCHEILAKKLLPVVRYYSYIEITRRA----------------- 381 +DGDN + P+ LA + + Sbjct: 198 IIILLTDGDNNQGEIEPVPAAQLAAAIGAKIYTIGLGIEEPSHLPAFDVDTGKFKHGPGG 257 Query: 382 -----------HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + ++ + L + + RD ++IY L + Sbjct: 258 ELIPTIMLQPANYSVLGQMSRLAHGKF-YRATNRRDLENIYNEIDRLEKTEVK 309 >UniRef50_C3YP68 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YP68_BRAFL Length = 1386 Score = 75.7 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 56/201 (27%), Gaps = 24/201 (11%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKN 286 + + E +F ++D SGS+ S + K+F + + + + Sbjct: 460 EDPMPTFPTPEPCHLTP---DLFFVLDGSGSVSVSDFETVKQFVVAVVSAFTIGLAETRV 516 Query: 287 VEVVYIRHHTQAKEVDEHEFF----------YSQETGGTIVSSALKLMDEVVKERYNPAQ 336 + Y T A + +H Q+ G T +AL+ + R PA Sbjct: 517 GVLQYSTSSTLACNLGDHPDEASFVSAINTMTYQKGGSTYTGAALEFARQNAAWR--PAP 574 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 + +DG + V ++ + L + Sbjct: 575 VSRIMIVLTDGQSHD----SVVAAAQALAADQVTVFAIGVGSFDH-SELLE-ITSNKLGH 628 Query: 397 DNFAMQHIRDQDDIYPVFREL 417 +I + R + Sbjct: 629 VFELDDFNAMAQNITQIVRAV 649 Score = 64.9 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 54/192 (28%), Gaps = 21/192 (10%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHH 295 + + +F ++D SGS+ S + K+F + + + + + Y Sbjct: 257 FPTAEPCVVTSDLFFVLDGSGSVSVSDFETVKQFVVAVVSAFTIGLADTRVGVLQYSTSS 316 Query: 296 TQAKEVDEH--------EFFYS--QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + + +H Q+ G T +A++ + R PA + Sbjct: 317 SLECNLGDHPDEASFVSAINTLVYQKGGNTYTGAAMEFARQNAAWR--PAPVPKIMIVLT 374 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG + + V ++ + L + + Sbjct: 375 DGKSSDSVVAAAQAL----AADQVAVFAIGVGSFDH-SELLE-ITNNKPGRVFELDDFDV 428 Query: 406 DQDDIYPVFREL 417 I + R + Sbjct: 429 LAQSINRIVRAV 440 >UniRef50_UPI000180D2ED PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180D2ED Length = 983 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 52/194 (26%), Gaps = 30/194 (15%) Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ-----STKDMAKRF 272 ++P + D R++ + L+D S SM+ D+A+ Sbjct: 174 MTVYPSHKIPNCTSIDPRFRPWYTETAWPKPKRFLILLDSSRSMENTFNSKPMIDIAREL 233 Query: 273 YILLYLFLSRTYKNVEVVY--------------IRHHT-QAKEVDEHEFFYSQETGGTIV 317 +L L K + + + + KE G + Sbjct: 234 IDILLETLRPNDKISAIGFRHEALRSQGCFRNQLAFASETNKEKLRSFLRNITPMGESSY 293 Query: 318 SSALKLMDEVVKERYNP-----AQWNIYAAQASDGDNWA-----DDSPLCHEILAKKLLP 367 + A + +++++ Y SDG D E K+ Sbjct: 294 TVAFQSAFQLLEQDYIKYKNKSDTEKYVILLISDGQPKEAYGRMQDVYSIIEQQNLKINN 353 Query: 368 VVRYYSYIEITRRA 381 V +SY Sbjct: 354 SVSIFSYAIGRNGH 367 >UniRef50_D0KWQ3 von Willebrand factor type A n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KWQ3_HALNC Length = 339 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 26/210 (12%), Positives = 49/210 (23%), Gaps = 24/210 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQST-----------KDMAKRFYILLYLFLSRTYKNVE 288 E P P + L+D S SM D K+ + + Sbjct: 98 EFLPAPPPARSILLLVDASPSMQAEDFPAGKDRFIARIDAMKQGLLRFI-AARPQDRFSV 156 Query: 289 VVYIRHH------TQAKEVDEHEFFYSQET--GG-TIVSSALKLMDEVVKERYNPAQWNI 339 +V T V ++ + G T + L + + + Q Sbjct: 157 IVVTNSAGTLVPMTTDHAVLDYWIRQLRAGINGSDTALGDGLAMAIRSIAAQSQAGQPAP 216 Query: 340 YAAQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL--WREYEHLQSTF 396 +DG + +P LA+ + + Q + L Sbjct: 217 LLVVWTDGFSTGGLMTPAEALALARAYGIKLFTVNLAPKGSPPDQGQPSLAQLADLTGGK 276 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 A + + K + Sbjct: 277 PILASDLAAMNAVTDQIAASVAPKSATPTE 306 >UniRef50_B9XQJ6 von Willebrand factor type A n=1 Tax=bacterium Ellin514 RepID=B9XQJ6_9BACT Length = 342 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 24/211 (11%), Positives = 44/211 (20%), Gaps = 46/211 (21%) Query: 245 PSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH---- 295 + + L+D S SM S +K + R + V + Sbjct: 91 KALGEDVMFLLDCSKSMLAADVQPSRLSRSKYAILDFVQQHGRG-RVGLVAFAGQAFLQC 149 Query: 296 --TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 T + E GGT + AL ++ + +DG++ Sbjct: 150 PLTFDYDAFRDALLAIDEQTIPVGGTDIGRALDEAYRAME----KNDRHKILVLITDGED 205 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYI-------------------------EITRRAHQT 384 + LA+K VV + + Sbjct: 206 LEKAGIKTAQALAEK-GIVVYTIGVGTAAGSPIKVMNERGMLDYVKDEQGNVVESHLDEA 264 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 + I Sbjct: 265 TLTAIAQMTHGSYQPLGPLGEGLGRIRRALE 295 >UniRef50_A9YRX2 Hedgling n=1 Tax=Amphimedon queenslandica RepID=A9YRX2_9METZ Length = 2416 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 22/180 (12%), Positives = 44/180 (24%), Gaps = 23/180 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA-------- 298 + + ++D SGS+ +A F + F +V I + T A Sbjct: 182 TNLDVVFVLDQSGSIGYYNHQLALNFLSKVVEFFKIGANKTQVGLITYSTHAYVQFDLND 241 Query: 299 ---KEVDEHEFFYS-QETGGTIVSSALKLMDEVVK------ERYNPAQWNIYAAQASDGD 348 K + G T + L ++ R +DG Sbjct: 242 YHSKSTILNRISRIYYTGGWTATALGLFQAGVILNPQQMRGARPISQGVPRVVILLTDGR 301 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + L ++ Y+ + + + F + D Sbjct: 302 SNRVPIDEVAPSLHD---FGIQVYTVGVG--NIYLPELKFIASDPDPYHIFLLDSFSDAS 356 >UniRef50_Q1LYI5 Novel collagen protein n=2 Tax=Danio rerio RepID=Q1LYI5_DANRE Length = 873 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 66/191 (34%), Gaps = 20/191 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-HTQAKEVDEH 304 ++ + ++D S S+ + AKR+ I + + ++ +V +++ T E+ Sbjct: 10 TTPNDLAFIIDGSSSLGVPNFETAKRWLINITKGFDVSSRHTQVAVVQYSDTPRLEIPLG 69 Query: 305 EFFYSQE-----------TGGTIVSSALKLMDEV---VKERYNPAQWNIYAAQASDGDNW 350 + SQE G T A+K + + + + N A +DG + Sbjct: 70 KHQNSQELVEAVGSVSYLGGNTRTGRAIKFATDHVFGMPNHTSQSPRNRIAVVLTDGRSQ 129 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 D E A+ + ++ + L + ++ ++ I Sbjct: 130 DDVEDAAMEARAQN----IVLFAVGVGNEITNSELV-SMANKPASTYVLHVEDYNSIASI 184 Query: 411 YPVFRELFHKQ 421 + + + ++ Sbjct: 185 WDLMEQKLCEE 195 >UniRef50_C5E9N8 von Willebrand factor type A n=4 Tax=Bifidobacterium longum RepID=C5E9N8_BIFLO Length = 401 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 54/188 (28%), Gaps = 21/188 (11%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL-------FLSR--TYKNVEVVYIRHHT 296 + ++D SGSM K+ + ++ N+ + + Sbjct: 218 RKPSWTIWVVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSYIEPASGDVNILIPFETEAH 277 Query: 297 QAKEVD-------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 + + HE + +GGT + L + + +Q+ +DG + Sbjct: 278 RPVKATGTSTSDLLHEADATDASGGTDIYEGLLSALDELPSESEASQYTTAIVLMTDGRS 337 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + D E K + +S + A + + L + D Sbjct: 338 NS-DHQDEFESAYKSRGRDLPIFSIMFG--DADPSQLKSLATLSN--AKVFDGRSGDLAA 392 Query: 410 IYPVFREL 417 ++ + Sbjct: 393 VFRQAKGF 400 >UniRef50_A6G2R7 Aerotolerance-related membrane protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G2R7_9DELT Length = 350 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 21/208 (10%), Positives = 43/208 (20%), Gaps = 28/208 (13%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQS----------TKDMAKRFYILLYLFLSRTYKNVEVV 290 + +D+S SM +AK+ + V Sbjct: 113 TERVEHEGIDIVIALDLSDSMSNPMDGRRGLGLDRLTVAKQVIDEFIRR-RPHDRIALVG 171 Query: 291 YIRHHTQAKEVDE----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + H + + + T + + L + +KE Sbjct: 172 FGAHASTIAPLTLDHAVLRNLIVQVRLGVVDGQETAIGAGLGVSLNRLKES---QAATKI 228 Query: 341 AAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRR---AHQTLWREYEHLQSTF 396 +DG N P A + V+ + T + Sbjct: 229 IVLLTDGVHNADGMDPDTVAQTAAERGVVIYTVLMGQQTGDRSSVDAGQLERLAGATDGY 288 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNAT 424 A + + +L Sbjct: 289 AYLAEDTQTLETSFQDLLDKLEKSSIEG 316 >UniRef50_Q56BS9 Putative uncharacterized protein n=1 Tax=Enterobacteria phage RB43 RepID=Q56BS9_9CAUD Length = 739 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 47/177 (26%), Gaps = 15/177 (8%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------ 291 ++ + + DVSGSM + K L + + + + Sbjct: 7 EFKNAVSKPTPTNHVFVCDVSGSMYNELPKIRKHLKANLASLVKQDDTVSILYFSSKGDY 66 Query: 292 --------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + + + + Y + TG T L L E+ + + Sbjct: 67 GTVFRGEKVSNVSDLTNICTAIDRYLKPTGCTGFVEPLNLAAEIATDLQSENGNLNSLIF 126 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG + L +++E ++ L + + F Sbjct: 127 LTDGYDN-CWRTDDILKACAVLPLTFNSIAFLEYGYYVNRPLLEKMAEATNALHKFV 182 >UniRef50_P12111 Collagen alpha-3(VI) chain n=60 Tax=Eumetazoa RepID=CO6A3_HUMAN Length = 3177 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 22/195 (11%), Positives = 50/195 (25%), Gaps = 22/195 (11%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHT--------Q 297 A + L+D S ++ + + + F + L + V + + + Sbjct: 38 ADIIFLVDSSWTIGEEHFQLVREFLYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYR 97 Query: 298 AKEVDEHEFFYSQETGGTI-VSSALKLMDEV----VKERYNPAQWNIYAAQASDGDNWAD 352 K+ GGT L+ + + +DG + Sbjct: 98 TKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDG 157 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 A+ V ++ A + +E F +++ DI Sbjct: 158 ----LALPSAELKSADVNVFAIGV--EDADEGALKEIASEPLNMHMFNLENFTSLHDIVG 211 Query: 413 VFRELFHKQNATAKG 427 H + + Sbjct: 212 NLVSCVHSSVSPERA 226 Score = 59.9 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 22/199 (11%), Positives = 53/199 (26%), Gaps = 25/199 (12%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-HTQAKEV 301 S++ + L+D S ++ ++ + F + L L N+ V ++ T E Sbjct: 632 EVHSNKRDIIFLLDGSANVGKTNFPYVRDFVMNLVNSLDIGNDNIRVGLVQFSDTPVTEF 691 Query: 302 DEHEFF------------YSQETGGTIVSSALKLMD----EVVKERYNPAQWNIYAAQAS 345 + + Q G SAL + + Sbjct: 692 SLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIREHVPQLLLLLT 751 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 G ++DS L + ++ +A++ + + + M Sbjct: 752 AGQ--SEDSYLQAANALTRAG----ILTFCVGASQANKAELEQIA--FNPSLVYLMDDFS 803 Query: 406 DQDDIYPVFRELFHKQNAT 424 + + + Sbjct: 804 SLPALPQQLIQPLTTYVSG 822 Score = 59.9 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 40/274 (14%), Positives = 78/274 (28%), Gaps = 33/274 (12%) Query: 162 ANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA 221 + VR+ + R A+ G + E + P + R+ + Sbjct: 1143 RSGDDVRNPSVVVKRGGAVPIGIGIGNADITEMQT--ISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQA----VMFCLMDVSGSMDQSTKDMAKRFYILLY 277 ERV + +L +P PS + L+D S S + L Sbjct: 1201 ISERVTQLTREELSRLQPVLQPLPSPGVGGKRDVVFLIDGSQS-AGPEFQYVRTLIERLV 1259 Query: 278 LFLSRTYKNVEVVYIRHH-----------TQAKEVDEHEFFYSQETGG--TIVSSALKLM 324 +L + V I+ +K+ ++ + GG V +AL+ + Sbjct: 1260 DYLDVGFDTTRVAVIQFSDDPKVEFLLNAHSSKDEVQNAVQRLRPKGGRQINVGNALEYV 1319 Query: 325 DEVVKER----YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 + +R + S G + + E K+ + R Sbjct: 1320 SRNIFKRPLGSRIEEGVPQFLVLISSGKSDDEVDDPAVE--LKQFGVAPFTIA-----RN 1372 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 A Q + S F++ R+ + Sbjct: 1373 ADQEELVKISL--SPEYVFSVSTFRELPSLEQKL 1404 Score = 58.0 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 25/225 (11%), Positives = 51/225 (22%), Gaps = 28/225 (12%) Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 + + RY S A + L+D S + + F Sbjct: 1405 LTPITTLTSEQIQKLLASTRYPPPAVE---SDAADIVFLIDSSEGVRPDGFAHIRDFVSR 1461 Query: 276 LYLFLSRTYKNVEVVYIRHHTQA-----------KEVDEHEFFYSQETGGT--IVSSALK 322 + L+ V V ++ + + GG+ AL+ Sbjct: 1462 IVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALE 1521 Query: 323 LMDEVV----KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEIT 378 + + + S A+ + S Sbjct: 1522 FVARNLFVKSAGSRIEDGVPQHLVLV-----LGGKSQDDVSRFAQVIRSSG-IVSLGVGD 1575 Query: 379 RRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 R +T + + F ++ R+ +I F A Sbjct: 1576 RNIDRTELQTITN--DPRLVFTVREFRELPNIEERIMNSFGPSAA 1618 Score = 57.6 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 41/186 (22%), Gaps = 25/186 (13%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQA---- 298 A + L+D S + + F + L L + + V + Sbjct: 238 QDSADIIFLIDGSNNTGSVNFAVILDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLD 297 Query: 299 ----KEVDEHEFFYSQETGG--TIVSSALKLMDE----VVKERYNPAQWNIYAAQASDGD 348 K GG + AL + E S G Sbjct: 298 TYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFTRAGGSRVEEGVPQVLVLISAGP 357 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + + L +S+ + A + + F + R Sbjct: 358 SSDEIRYGVVA------LKQASVFSFGLGAQAASRAELQHIA--TDDNLVFTVPEFRSFG 409 Query: 409 DIYPVF 414 D+ Sbjct: 410 DLQEKL 415 Score = 52.2 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 27/276 (9%), Positives = 76/276 (27%), Gaps = 33/276 (11%) Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 + + ++ E + G + +R +L + + + G + + E Sbjct: 333 HFTRAGGSRVEEGVPQVLVLISAGPSS--DEIRYGVVALKQASVFSFGLGAQAASRAELQ 390 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 I ++ + E R ++ E L+ + ++ + L+ Sbjct: 391 HIATDDNLVFTVPEFRSFGDLQEKLLPYIVGVAQRHIVLKPPTIVTQVIEVNKRDIVFLV 450 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV-------------YIRHHTQAKEVD 302 D S ++ + + + F + L ++V + H T+ + + Sbjct: 451 DGSSALGLANFNAIRDFIAKVIQRLEIGQDLIQVAVAQYADTVRPEFYFNTHPTKREVIT 510 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKER--------YNPAQWNIYAAQASDGDNWADDS 354 + G+ + + + V+ + G + + S Sbjct: 511 --AVRKMKPLDGSALYTG--SALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDEIS 566 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + L ++ + A Q E Sbjct: 567 QPA------QELKRSSIMAFAIGNKGADQAELEEIA 596 Score = 49.1 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 22/213 (10%), Positives = 51/213 (23%), Gaps = 25/213 (11%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNV 287 +A + L+D S + + + RF + + + Sbjct: 1620 PAPPGVDTPPPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFVSEIVDTVYEDGDSIQVG 1679 Query: 288 EVVYIRHHTQ--------AKEVDEHEFFYSQETGG--TIVSSALKLM----DEVVKERYN 333 V Y T K GG L+ + Sbjct: 1680 LVQYNSDPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSRL 1739 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 + A + G + +D+ L ++ V+ ++ R + Sbjct: 1740 DQRVPQIAFVITGGKSV-EDAQDVSLALTQR---GVKVFAVGV--RNIDSEEVGKIASNS 1793 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 F + ++++ ++ E H Sbjct: 1794 --ATAFRVGNVQELSELSEQVLETLHDAMHETL 1824 >UniRef50_B8FBV4 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FBV4_DESAA Length = 336 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 20/211 (9%), Positives = 51/211 (24%), Gaps = 48/211 (22%) Query: 242 RPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH- 295 + + +D+S SM + + AKR L + + V + Sbjct: 82 QEVSRKGVDIMVCVDISNSMMVEDAQPNRLERAKREVADLIRVAT-GDRLGLVAFSGVAF 140 Query: 296 -----TQAKEVDEHEFFYSQET------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 T + + GT + +A+++ + + Sbjct: 141 TQCPLTLDYQAIQMFLDQLTVDLLPLRFQGTDLGAAIEMGMTAFD---PKSSTDKVILLI 197 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR------------------------ 380 +DG++ + + K +R + Sbjct: 198 TDGED---NEEAGLKAAEKASDEGIRIFVLGIGDPAGGPVPSLDGSGFEKDAGGKIILSK 254 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRDQDDIY 411 ++ + + + D D +Y Sbjct: 255 PDESTLQAIANETGGDYIRSEAGDFDLDQLY 285 >UniRef50_Q023A9 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q023A9_SOLUE Length = 309 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 57/215 (26%), Gaps = 24/215 (11%) Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 E+ I +R ++ + ++D SGSM Sbjct: 47 RVASYLREQDFEIFEDGVR-QSIRLFSHEDIPVTVGLVIDHSGSMR-PKMASVIAAARTF 104 Query: 277 YLFLSRTYKNVEVVY-----------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 S + V + I + +++ + +S TG T + A+ Sbjct: 105 IQSSSPEDQMFVVNFNEDVTLGLSTEIPFTNRPEDLT-YAISHSPPTGKTALYDAVWKAR 163 Query: 326 EVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA--- 381 E V SDG DN + + A K ++ ++ Sbjct: 164 EWVARGSRD---KKVLVVVSDGGDNASTHTLSEILEAANK--SNIQVFTIGIFDPDDPDK 218 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + R+ F + + I + Sbjct: 219 NPGVLRQLARATGGEA-FVPDELSEVVAICESIAK 252 >UniRef50_Q9SJE1 Magnesium-chelatase subunit chlD, chloroplastic n=49 Tax=cellular organisms RepID=CHLD_ARATH Length = 760 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 32/272 (11%), Positives = 72/272 (26%), Gaps = 38/272 (13%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 Q + RR K + E+ I P ++ + + + Sbjct: 474 AQQAQKRRGKAGRAK--NVIFSEDRGRYIKPMLPKGPVKRLAVDATLRAAAPYQKLRREK 531 Query: 230 DTFDLR------YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSR 282 D R KR + A++ ++D SGSM + AK LL + Sbjct: 532 DISGTRKVFVEKTDMRAKRMARKAGALVIFVVDASGSMALNRMQNAKGAALKLLAESYTS 591 Query: 283 TYKNVEVVYIR-------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEV-VKERYNP 334 + + + +++ + + GG+ ++ L V + + Sbjct: 592 RDQVSIIPFRGDAAEVLLPPSRSIAMARNRLERLPCGGGSPLAHGLTTAVRVGLNAEKSG 651 Query: 335 AQWNIYAAQASDG------------DNWADDSPL----CHEILAKKLLPVVR-----YYS 373 I +DG ++ A D+P + ++ + Sbjct: 652 DVGRIMIVAITDGRANITLKRSTDPESIAPDAPRPTSKELKDEILEVAGKIYKAGMSLLV 711 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 + +E + + Sbjct: 712 IDTENKFVSTGFAKEIARVAQGKYYYLPNASD 743 >UniRef50_A5UXM2 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UXM2_ROSS1 Length = 774 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 16/148 (10%), Positives = 30/148 (20%), Gaps = 7/148 (4%) Query: 260 SMDQSTKDMAKRFYILLYLFLSRTY--KNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV 317 +D + + L+ +V + + G T + Sbjct: 330 RLDDYRGQTIRMRFRLVTDAFGVRDGWYIDDVTIGPEWDDVRARAQAAIDTLNSRGATSI 389 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 L+ ++ SDG + V Sbjct: 390 GGGLQSSQRMLDTANP--DLPRVIILLSDGQENTRPFVADVLPQIRAAQTTVHTIGLG-- 445 Query: 378 TRRAHQTLWREYEHLQSTFDNFAMQHIR 405 R A Q L N+A + Sbjct: 446 -RDADQQLMLSIAAQTGGTYNYAPTPEQ 472 >UniRef50_A0Z8M6 Von Willebrand factor type A domain protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z8M6_9GAMM Length = 316 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 22/203 (10%), Positives = 50/203 (24%), Gaps = 30/203 (14%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMA-----------KRFYILLYLFLSRTYKNVE 288 E + + +D+SGSM+ A K L + Sbjct: 72 EPIEQQKAGRDLMIAVDLSGSMETEDFSQADGKPADRLTAVKTVLRQLANE-RAGDRLGL 130 Query: 289 VVY---------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +V+ + + +E T + A+ L ++ K+ + Sbjct: 131 IVFGSSAYLQSPFTEDHRTWLLLLNETRIRMAGPSTALGDAVGLAIKLFKDAETE---HR 187 Query: 340 YAAQASDGDNWAD-DSPLCHEILAKKLLPVVRYYSYI----EITRRAHQTLWREYEHLQS 394 +DG++ P+ +A + + + Sbjct: 188 VLLLLTDGNDTGSLVPPVDAARVAATEDIRIYPIAVGDPTAVGEEAIDLDTLARMAEVTG 247 Query: 395 TFDNFAMQHIRDQDDIYPVFREL 417 F D ++ + L Sbjct: 248 GQA-FEALSSEDLIAVFKLLDTL 269 >UniRef50_B7Q412 Putative uncharacterized protein n=1 Tax=Ixodes scapularis RepID=B7Q412_IXOSC Length = 1021 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 51/177 (28%), Gaps = 20/177 (11%) Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY---LFLSRTY 284 ++ + ++ R S+ + L+D SGS+ Q+ ++ F T Sbjct: 28 YVQAEEPHGNVFDWRGIESNNTDLVFLLDRSGSVGQAGFEVETGFVHAFLKGFDVAPNTT 87 Query: 285 KNVEVVYIRHHTQAKEVDEHEFFYSQET-----------GGTIVSSALKLMDEVVKERYN 333 + + + + + + G T + L+ EV + Sbjct: 88 RVAVISFSEDAVVHADFLKDPGNKCHLSRKMQGVHSANQGATNTGAGLQAAWEVFQRSRP 147 Query: 334 PAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 A+ +DG P+ K + + + + ++ + Sbjct: 148 TAK--KLLILVTDGMATMGPDPVKKAEKLKNMGVDIFVFGIGRMLKQH----LEQLA 198 >UniRef50_B3RUM1 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RUM1_TRIAD Length = 1173 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 19/188 (10%), Positives = 44/188 (23%), Gaps = 31/188 (16%) Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + L V F +D R + + S + ++ Sbjct: 192 NYLKWQYFGSKFG---LSYTFPGRPWTTNFVGFTKDYDPRLRPWYIAAT-SGPKDVVIVI 247 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV------------------------- 290 D SM + +AK + L+R V Sbjct: 248 DCGLSMQGNRFKIAKSVAKTVLATLTRNDYVNIVCTRFSHWDETGKWHFYETTVLGCYKD 307 Query: 291 -YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 I ++ + + G + + + ++++ + +DG+ Sbjct: 308 QLIPASLTNRKSLSNAIDNLKAGGTSEMKKGFQKAFKLLRG-SHRTGCQSIMIVITDGEK 366 Query: 350 WADDSPLC 357 C Sbjct: 367 TDGPKVRC 374 >UniRef50_C3JJF1 Putative von Willebrand factor, type A n=1 Tax=Rhodococcus erythropolis SK121 RepID=C3JJF1_RHOER Length = 684 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 58/209 (27%), Gaps = 30/209 (14%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDM------AKRFYILLY-LFLSRTYKNVEVV 290 R ++ A + +MD+SGSM+ + + AK+ + S + Sbjct: 32 TPPTRAVSAAPAALLMVMDLSGSMNDNDANGKNKLTGAKQSLSRIVGDTASSSTPLGLWT 91 Query: 291 YIRHHTQ----------------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 Y + + + + GGT AL+ + +K Sbjct: 92 YPTAGSNCDPGSFLAGADGGVRKDTDTLMAQVSGLKADGGTPTGPALRASVDSLKANGIT 151 Query: 335 AQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 SDG++ +P V + +T Sbjct: 152 TAT---VVLISDGESNCGQAPCDTAKQIVAEGFDVTVEALGFQLSGQGRTELECIASTTG 208 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + I D D++ +EL + A Sbjct: 209 GRYS----DIADVDEMQKRLKELMIPELA 233 >UniRef50_A8SY40 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SY40_9FIRM Length = 465 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 43/157 (27%), Gaps = 14/157 (8%) Query: 252 FCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVVYIRHHTQ---------AKEV 301 ++D S SM S K+ + L + K + + + K Sbjct: 148 MLVIDDSSSMKTSDKNDRRLTAANELLEHIDGNRKVGLIRFSKDIHCYIPMDYLKVNKST 207 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 HE + GGT ++ AL + + A + +DG + + Sbjct: 208 LNHELENKAKEGGTDINDALYAVLNAFDK-VGTATGSRSVILLTDGKSTTNVDEEYLINR 266 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 A + + S T + + Sbjct: 267 ANSMNIQINVISLGNHT---DKAFIKRITSSTGGKAA 300 >UniRef50_B4Q6G9 GD21946 n=2 Tax=Drosophila RepID=B4Q6G9_DROSI Length = 1100 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 37/333 (11%), Positives = 87/333 (26%), Gaps = 31/333 (9%) Query: 101 GQGQASQDGEGQDEFVFQISKDEYLDL-LFEDLALPNLKQNQQRQLTEYKTHRAGYTANG 159 GQGQA GQ + + + D L + +++ + E ++ Sbjct: 14 GQGQAESPMGGQQHYDARRINEYNADGKLADGARHMDIR---FMRRFERLPVNLSLSSIL 70 Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL 219 VP + + S A+ + + S LR+ Sbjct: 71 VPHGVDLDEPDVKS-----ALQWSGHLDPLFQNNLEQDPALSWQYFGSSTGFLRRFPGTA 125 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF 279 D R N+ SS + L+D S SM + + D+ + Sbjct: 126 WPPEGSKGSKLIHDFRTHNW-FVQAASSPKDIMILLDASSSMTEKSFDLGMATAFNILDT 184 Query: 280 LSRTYKNVEVVY---------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLM 324 L + + +R + + + ++ L+ Sbjct: 185 LGEDDFVNLITFSEVVKTPVPCFKDRMVRATPDNIQEIKSAVKAIKLQDTANFTAGLEYA 244 Query: 325 DEVVKERYNPAQ---WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 ++ + N + ++ ++ S VR ++Y+ + Sbjct: 245 FSLLHKYNQSGAGSQCNQAIMLIT--ESTSE-SHKDVIKQYNWPHMPVRIFTYLIGSDSG 301 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++ + F + + + Sbjct: 302 SRSNLHDMACSNKGFFVQINDYDEARRKVIDYA 334 >UniRef50_Q1INP4 von Willebrand factor, type A n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1INP4_ACIBL Length = 430 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 53/181 (29%), Gaps = 14/181 (7%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQAKEV 301 + ++D SGSM + I L + + V + + +T + Sbjct: 188 PVALGVVIDNSGSMR-DKRPAVNAATINLVKASNPEDEVFVVNFNDDYYLDQDYTDSVAK 246 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS--PLCHE 359 + + GGT + A+ + + + P +DG++ A + Sbjct: 247 LKEALEKYETRGGTALYDAVLASNAHLMKA--PKLEKKVLFIVTDGEDDASLNTLEQTIR 304 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQ---TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + ++ P + ++ T + RE F + + Sbjct: 305 KVQQENGPTIYTIGILDETGGHKRRAQRALREMAESTGGVAFFPQSLDEVSRITQQIAHD 364 Query: 417 L 417 + Sbjct: 365 I 365 >UniRef50_B0S9S4 Putative uncharacterized protein n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0S9S4_LEPBA Length = 373 Score = 75.3 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 55/209 (26%), Gaps = 26/209 (12%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST-----KDMAKRFYILLYLFLSRT 283 + L + + P + ++D SGSM + +AK I + L + Sbjct: 19 LIPVVLFFLHLSLLPQSNHNKRYVFILDASGSMSEKWDGKTRMAVAKEKLIQVLGGLPKD 78 Query: 284 YKNVEVVYIR-----HHTQAKEVDEHEFFYSQ--------ETGGTIVSSALKLMDEVVKE 330 V Y + + G T ++ L+++ E + Sbjct: 79 ASVGLVAYGNRIAGCQSARLYHPIQKGGASIVSQKLTTIVPAGSTPIAQTLQVVGEYLLS 138 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + SDG + P ++ R + + Sbjct: 139 DQLETE----IIFISDGVESCEGDPKSVLYNLRQSGKKFRLQILGIDIDPKGEEDLKRLS 194 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 L ++ +D F+ +F Sbjct: 195 ILGDGNYF----PLKTPEDYDRSFQRIFA 219 >UniRef50_UPI0000F2E846 PREDICTED: similar to ITI-like protein, partial n=1 Tax=Monodelphis domestica RepID=UPI0000F2E846 Length = 1002 Score = 75.3 bits (183), Expect = 5e-12, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 40/173 (23%), Gaps = 20/173 (11%) Query: 270 KRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV------------DEHEFFYSQETGGTIV 317 K+ ++ L V + K + + T + Sbjct: 329 KKAMHVILGDLCPKDHFNIVTFSDTVHIWKAAGSIQAIPPNIQRAKAYVSRMKAARWTDM 388 Query: 318 SSALKLMDEVVKER-YNPAQWNIYAAQASDGDNWAD--DSPLCHEILAKKLLPVVRYYSY 374 ++AL ++ + P +DG+ A + L V + Sbjct: 389 NAALLAAASILNQSIAGPLGEARLIIFLTDGEPTAGVTSPARILANAQRALAGQVALFGL 448 Query: 375 IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 A L R + IR+ D + + + Sbjct: 449 ALG-DDADLPLLRRLSLENRGTAH----RIREDHDAASQLKGFYDRIAYPLLS 496 >UniRef50_D2QQW6 von Willebrand factor type A n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQW6_9SPHI Length = 316 Score = 75.3 bits (183), Expect = 5e-12, Method: Composition-based stats. Identities = 18/177 (10%), Positives = 49/177 (27%), Gaps = 17/177 (9%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 L+D +GS+ + + + L + +T ++ Sbjct: 86 KPGGYSAMLLLDQTGSISTTDPYNLRIEASKIFLNNLGTDDYTGLTSFTSSYTSVVKLHS 145 Query: 304 HEFFY------------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 +GGT + ++ ++ A N +DG+N Sbjct: 146 GFTNKTEQMKKSLDTLALNVSGGTPLYTSTIQSVTYTAQKGPTA--NKAVIVFTDGENNV 203 Query: 352 DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + L + AK + + ++ + + + + + +A + Sbjct: 204 TTNTLE-DATAKAIQQKIPLFTVGL-STDVNVNVLAQMANETGGAFFYAKDAGQLIS 258 >UniRef50_O95460 Matrilin-4 n=32 Tax=Amniota RepID=MATN4_HUMAN Length = 622 Score = 75.3 bits (183), Expect = 5e-12, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 51/185 (27%), Gaps = 23/185 (12%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA--------- 298 + ++D S S+ + ++F + L L+ V I++ +Q Sbjct: 32 PLDLVFVIDSSRSVRPFEFETMRQFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAF 91 Query: 299 --KEVDEHEFFYSQE-TGGTIVSSALKLMDEVV-----KERYNPAQWNIYAAQASDGDNW 350 +E E GT+ A++ V R + A +DG Sbjct: 92 SRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGARPPEERVPRVAVIVTDGRPQ 151 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 E+ A+ + Y+ R F ++ + Sbjct: 152 D----RVAEVAAQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Query: 411 YPVFR 415 F+ Sbjct: 206 GLQFQ 210 Score = 64.9 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 44/177 (24%), Gaps = 23/177 (12%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEV 289 L+ + L+D S S+ ++ KRF + FL + V Sbjct: 369 QLQADGKSCNRCREGHVDLVLLVDGSKSVRPQNFELVKRFVNQIVDFLDVSPEGTRVGLV 428 Query: 290 VYIR-----HHTQAKEVDEHEFFYSQE----TGGTIVSSALKLMDE----VVKERYNPA- 335 + GT+ AL+ M E + A Sbjct: 429 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRAL 488 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 +DG + D S A+ + Y+ + RE Sbjct: 489 NVPRVGLVFTDGRSQDDISVWA----ARAKEEGIVMYAVGVGKAV--EAELREIASE 539 >UniRef50_Q1ILA5 von Willebrand factor, type A n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1ILA5_ACIBL Length = 356 Score = 74.9 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 57/208 (27%), Gaps = 17/208 (8%) Query: 227 PFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN 286 F+D + R + + + L+D S S+ K + L + + Sbjct: 112 KFVDDGKPVASIRDFRKETNLPLRVGLLIDSSNSIRDRFKFEQESAIEFLNQIIRPKFDK 171 Query: 287 VEVVYIRHHTQAKEVDEHEFFY---------SQETGGTIVSSALKLM-DEVVKERYNPAQ 336 V I T A+ + + GGT + A+ + + + Sbjct: 172 AFV--IGFDTTAEVTQDFTDDTDLLGKGVRMLRPGGGTAMYDAIYYACRDKLLKENGNTA 229 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAH---QTLWREYEHLQ 393 SDG++ + + V+ Y+ T + + Sbjct: 230 MRKAMILLSDGEDNQSRVTREEAVEMAQRAEVI-IYAISTNTSGLKLRGDKVLERFAEAT 288 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHKQ 421 F I D + + ++ Q Sbjct: 289 GGRAFF-PFKISDVANAFSEIQDELRSQ 315 >UniRef50_Q5LCG5 Aerotolerance-related membrane protein n=25 Tax=Bacteroidales RepID=Q5LCG5_BACFN Length = 341 Score = 74.9 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 25/215 (11%), Positives = 50/215 (23%), Gaps = 42/215 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + +D+S SM S + AKR L + K +V+ Sbjct: 81 KLETVKRKGVEVMIALDISNSMLAQDVQPSRLEKAKRLISKLVDGM-ENDKVGMIVFAGD 139 Query: 295 H------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 T + + GT + +A+ L + Sbjct: 140 AFTQLPITSDYISAKMFLESISPSLISKQGTAIGAAINLA---ARSFTPQEGVGRAIVVI 196 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA----------------------H 382 +DG+N + + A K V + Sbjct: 197 TDGEN-HEGGAVEAAKEAAKKGIQVNVLGVGLPDGAPIPIEGSNDFRRDREGNVIVTRLN 255 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 + + +E + Q I ++ Sbjct: 256 EAMCQEIAKEGNGIYIRVDNSNSAQKAINQEINKM 290 >UniRef50_UPI0000DB712F PREDICTED: similar to c12.2 CG12149-PA isoform 1 n=1 Tax=Apis mellifera RepID=UPI0000DB712F Length = 1748 Score = 74.9 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 49/436 (11%), Positives = 106/436 (24%), Gaps = 58/436 (13%) Query: 16 MVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRV 75 V+ +R ++ + R + D+ ++ Sbjct: 1341 TVDAGGCVRLWETSLVNIEKSLGAWRRMVGDDNENLQIT----KERYSGLDVSSPKHGKI 1396 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP 135 N+ V + GG +G G + V Q+S +E A+P Sbjct: 1397 DEKNEPHVGGNTWAGGTGGRDTAGLGGKGGPYRLDAGH-TVHQLSDEEKN-------AVP 1448 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 + R++ + N + + N++ + L A + Sbjct: 1449 EHIKKAAREMGLKVFQQRLREINMSEYDHQLYSQFSNAVQKEVQALRTIIDSLQAKSKER 1508 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + +L + + I L + K E PS + ++ Sbjct: 1509 QWCRHQTSGELDDTKL----IEGLTGEKTIYRRRAE-----KEPELGTPPSKSKRLKLVV 1559 Query: 256 DVSGSM---DQSTKDMAKR--FYILLYLFLS-------------RTYKNVEVVYIRHHTQ 297 DVSGSM + + + I++ + V ++ H Sbjct: 1560 DVSGSMYRFNGYDGRLDREMEACIMVMEAFNGYEGKFQYDIVGHSGDDYSIV-FVNHTHP 1618 Query: 298 AKE--------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 H +G A + + SD + Sbjct: 1619 PVNNKRRLEIIKTMHAHSQFCMSGD-NTLEATQHA---IANLAKEDADECIVVVLSDANF 1674 Query: 350 WADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 P V ++ + + + + M D D Sbjct: 1675 ERYGIRPEIFAKILTSNPNVNAFAIFIGSLGNQAFNLTK--KITAGRAFVCM----DLKD 1728 Query: 410 IYPVFRELFHKQNATA 425 I + +++F + Sbjct: 1729 IPRILQQIFAASLLST 1744 >UniRef50_C9L3E2 BatB protein n=10 Tax=Bacteroidales RepID=C9L3E2_9BACE Length = 342 Score = 74.9 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 49/215 (22%), Gaps = 42/215 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + +D+S SM S + AKR L L K +V+ Sbjct: 81 KLETVKRKGVEVIIALDISNSMLAQDVQPSRLEKAKRLISRLVDEL-DNDKVGMIVFAGD 139 Query: 295 H----------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 AK E GT + A+ L + Sbjct: 140 AFTQLPITSDYISAKMFLESINPSLISKQGTAIGEAINLA---ARSFTPQEGVGRAIIVI 196 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA----------------------H 382 +DG+N + + A + V + Sbjct: 197 TDGEN-HEGGAVEAAKAAAEKGIQVNVLGVGMPDGAPIPAEGTNDYRRDREGNVIVTRLN 255 Query: 383 QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 +T+ +E Q I ++ Sbjct: 256 ETMCQEIAKEGKGIYVRVDNSNSAQKAINQEVNKM 290 >UniRef50_Q02A45 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q02A45_SOLUE Length = 311 Score = 74.9 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 27/257 (10%), Positives = 69/257 (26%), Gaps = 21/257 (8%) Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 ++ + + ++ N+ + + + + L + ++ L + F Sbjct: 13 AMLSGQVKIEQRPRPAPKQEPRPGANIRVDTTLILVPVSVNDPLNRPVSGLERE-NFRVF 71 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 D + + + +P + + D SGSM ++ + + + Sbjct: 72 EDKVEQKIVQFAMDDEP---VAVGLVFDTSGSM-GEKLQRSRMAAREFFHISNPEDEFFL 127 Query: 289 VVYIRHHTQAKEVDEHE---FFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 V + + + + G T + A+ L +K + Sbjct: 128 VEFDSSPRLVVPLTSDTGTIEDHLTFSRSHGSTALLDAIFLALHEMKHS---KKNKKALL 184 Query: 343 QASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT----LWREYEHLQSTFD 397 SDG DN + S + K+ ++ + L + Sbjct: 185 IISDGGDNHSRYSEKEVSSVVKESDVLIYSIGVFGGGGSPEEAGGPGLLSKVSEQTGGR- 243 Query: 398 NFAMQHIRDQDDIYPVF 414 + DI Sbjct: 244 -LFEASAVELPDIAKKI 259 >UniRef50_Q498S9 LOC498793 protein (Fragment) n=11 Tax=Euteleostomi RepID=Q498S9_RAT Length = 568 Score = 74.9 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 18/125 (14%), Positives = 37/125 (29%), Gaps = 11/125 (8%) Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNI----YAAQASDGDNW-ADDSPLCHEILA 362 Q +GGT ++ AL ++ E N N SDGD + + Sbjct: 1 KIQPSGGTNINEALLRAIFILNEASNMGLLNPDSVSLIILVSDGDPTVGELKLSKIQKNV 60 Query: 363 KK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 K+ + + +S + + Q I D ++ +++ Sbjct: 61 KQNIQDNISLFSLGIGF-DVDYDFLKRLSNENRG----IAQRIYGNRDTSSQLKKFYNQV 115 Query: 422 NATAK 426 + Sbjct: 116 STPLL 120 >UniRef50_C6QKH4 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y4.1MC1 RepID=C6QKH4_9BACI Length = 932 Score = 74.9 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 39/169 (23%), Gaps = 25/169 (14%) Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 + + E + + DVSGSM Sbjct: 34 SSTAALNFTITSSQTEYAKPPNADAQGRLDITLTPQGRVDNIIRPPIDVVFVFDVSGSMV 93 Query: 263 QS--TKDMAKRFYILLYLFLS----RTYKNVEVVY---------IRHHTQAKEVDEH--- 304 D AK + + V + + + +V +H Sbjct: 94 MPSLKLDSAKYALQSAVDYFKANANPNDRFALVPFSDGVQSDKVVPFPSGTYDVKQHLNW 153 Query: 305 ---EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 + GGT + AL+ + +N Y +DG Sbjct: 154 IATVANSLRANGGTNYTQALQQA----QSFFNDPARKKYIIFLTDGMPT 198 >UniRef50_B3EP84 von Willebrand factor type A n=2 Tax=Chlorobiaceae RepID=B3EP84_CHLPB Length = 331 Score = 74.9 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 22/206 (10%), Positives = 49/206 (23%), Gaps = 33/206 (16%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + +D+S SM + S D AK+ + + V++ Sbjct: 87 AVTEASEKGIDIVFALDISESMLEEDFEGSRLDAAKKIALRFIRE-RPQDRFGLVLFRGK 145 Query: 295 H------TQAKEVDEHEFFYSQ----ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 T + GT + SA+ + ++ + Sbjct: 146 SFTLCPLTLDHRLLGMLVRQVSVDAISDKGTAIGSAILVGTNRLRASVSKE---RVLLLL 202 Query: 345 SDGD-NWADDSPLCHEILAKKLLPVVRYYSY------IEITR------RAHQTLWREYEH 391 +DG+ N + P+ +A+ + + + Sbjct: 203 TDGEHNSGEVGPVTASEIAQSEGIRIYVIGVRNEEEAGSPESMDAEREGVDEQVLGTVAG 262 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFREL 417 + F D + L Sbjct: 263 MTGGRY-FRASDENSLKDAFGEIDAL 287 >UniRef50_C1XZC4 Uncharacterized conserved protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XZC4_9DEIN Length = 427 Score = 74.9 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 25/275 (9%), Positives = 60/275 (21%), Gaps = 30/275 (10%) Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 G+ + + + + +L + Sbjct: 144 QANAQGSRQPNGQGNSQGDSQDNSQAMPQPNGQGGKKVWMTDTHAPAIERRAWELGHDHA 203 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 P L + + + V R++ + Sbjct: 204 ESPALSEAEAEIVRGE-----------------VARAILEHAK----TRGSVPAGMVRWA 242 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 + + + R + R + + S + + Sbjct: 243 QEIVAPVIPWQRVMAHHLRNGVRLTVGRTRPTYERIHRRMGVMDARVRLPGSYSLKPRVA 302 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET 312 ++D SGS+ A + + V V A++V + Sbjct: 303 VVVDTSGSVSDRMLGHALGEIQGILRQVGAE--LVVVSTDAQAHAAQKVQRVDQIRLVGG 360 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 GGT +S ++ ++ P +DG Sbjct: 361 GGTDMSVGIEAAMKL---HPRPD----VIVVLTDG 388 >UniRef50_UPI0001B52F00 von Willebrand factor type A n=1 Tax=Fusobacterium sp. D11 RepID=UPI0001B52F00 Length = 218 Score = 74.9 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 41/167 (24%), Gaps = 15/167 (8%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN---VEVVYIRH-- 294 E P + L D S SM + L + + +I Sbjct: 2 EFTSQPKKVLPLILLADTSSSMR-EWMRELNTAIRDMLGTLKEQESLKAEIHISFITFGN 60 Query: 295 -----HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY--NPAQWNIYAAQASDG 347 HT V EF E G T + AL++ E+V+ R + SDG Sbjct: 61 GGANLHTALTPVSNIEFNDFTEGGMTPLGGALRIAKEMVENREIIPSKSYAPIILLLSDG 120 Query: 348 DNWADDSPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQ 393 + S + + + Sbjct: 121 APNDNGWENEMYRFINDGRSKKCMRMSLGIGR-DYDYDVLKGFSSNG 166 >UniRef50_Q4S2X7 Chromosome 8 SCAF14759, whole genome shotgun sequence n=3 Tax=Euteleostomi RepID=Q4S2X7_TETNG Length = 647 Score = 74.9 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 25/203 (12%), Positives = 55/203 (27%), Gaps = 27/203 (13%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVY-------- 291 P + ++D S S+ + K F + L FL + + Y Sbjct: 47 PCKAVPLDFVFVIDSSRSIRPRDYEKVKTFIVNLVQFLEVGPEATRVGLLQYGSVVQPEF 106 Query: 292 --IRHHTQA-KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ-----WNIYAAQ 343 T+A E + GT+ A++ E + A+ A Sbjct: 107 SLSTFSTKAEVEQAVRNMKHLAT--GTMTGLAIQYAAETSFTEADGARPAHLHIPRIAVV 164 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG D A++ ++ ++ + + + Sbjct: 165 VTDGRP--QDRVEEVAAQARQA--GIQIFAIGVGR--VDMKTLKTIGSEPHSEHVHLVAS 218 Query: 404 IRDQDDIYPVFRELFHKQNATAK 426 + + VF+ ++ Sbjct: 219 FSQMETLVSVFQSKLCREMCELL 241 Score = 67.6 bits (163), Expect = 9e-10, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 57/203 (28%), Gaps = 40/203 (19%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK--------- 299 + ++D S SM + + K+F I + L + V +++ T + Sbjct: 406 MDLVFMIDGSKSMGPANFERVKQFVISIVESLDVSPTGAHVGLLQYSTNVRTEFTLSQHT 465 Query: 300 --EVDEHEFFYSQETG-GTIVSSALKLMDE--VVKERYNPAQWNIYAAQASDGDNWADDS 354 + Q G G++ SAL+ M + E + +DG + D Sbjct: 466 SAQGIRQAVSRMQYMGRGSMTGSALRRMFQSSFSAEEGARPNVPRVSVVFTDGR--SQDD 523 Query: 355 PLCHEILAKKLLP----------------------VVRYYSYIEITRRAHQTLWREYEHL 392 AK V Y+ A + RE Sbjct: 524 ASEWAKKAKNSGIPGSFSYFGGTGRFLSCSFLLVLGVTIYAVGVGK--AIEQELREIASE 581 Query: 393 QSTFDNFAMQHIRDQDDIYPVFR 415 + Q +D +I + Sbjct: 582 PEEKHLYYAQEFKDVGEITEKLK 604 >UniRef50_A7SV43 Predicted protein n=5 Tax=Nematostella vectensis RepID=A7SV43_NEMVE Length = 1323 Score = 74.9 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 25/230 (10%), Positives = 59/230 (25%), Gaps = 27/230 (11%) Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKN-----YEKRPDPSSQAVMFCLMDVSGSMDQS 264 + + LR + L + + + +D S S+ + Sbjct: 1091 DPPITNVRMLRLLPLTWHSHISLRLEFYSCPGEIPHLAKPCPKSLDIGIALDRSTSVGPT 1150 Query: 265 TKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEV--------DEHEFFYSQETG 313 ++AK F +L + + Y ++ + + TG Sbjct: 1151 NFNIAKTFLKILVERMKISTNGSHFGLIAYSSSASRVISFRFSQKAADINRQIDAIEFTG 1210 Query: 314 G-TIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWADDSP-LCHEILAKKLLPVV 369 G T AL++ + + N+ ++G P K+ V Sbjct: 1211 GKTRTDFALQVAITDLFTNSAGDRENVTDVLIVMTNGRTSQGSLPYKDVMKPLKEK--KV 1268 Query: 370 RYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + ++ E D+ + D + + + Sbjct: 1269 DVIAVGIG-PDVNEAELLEIAE--GGLDHVI--RVDDYEALATKLNSILA 1313 >UniRef50_Q9XAH6 Putative uncharacterized protein SCO6688 n=2 Tax=Streptomyces RepID=Q9XAH6_STRCO Length = 1171 Score = 74.9 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 37/319 (11%), Positives = 77/319 (24%), Gaps = 27/319 (8%) Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLD 126 +G ++ + + Sbjct: 820 DEESRETNEGGDGGEPEPGAGGTSEGSDDDRTGGAARSFPSVRHWAEDLRTLFGAEIRQE 879 Query: 127 LLFEDLA------LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 +L +A + L R E + ++ +R L L Sbjct: 880 VLERAVADGRTDVIALLDPASVRPSVELLSAVLTLARGMPEQRVASLRPLVKRLVEELTK 939 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R + L LR +A +R + + + ++ Sbjct: 940 ELATRLRPTLTGLTTPRPTRRPGGPLDLPRTLRANLAHIRRREDGRVEVVPERPVFR--- 996 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 R + + ++DVS SM+ S A IL + ++ TQ + Sbjct: 997 TRTARRNDWRLILVVDVSASMETSVVWSALTAAIL------GGAPTLSTHFLTFSTQVAD 1050 Query: 301 VDEHEFFYS------QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 + + GGT +++ L +V P + + SD + Sbjct: 1051 LTGLVADPLSLLLEVKVGGGTHIAAGLAHARSLV---TVPDRTLVVVV--SDFE-EGAAV 1104 Query: 355 PLCHEILAKKLLPVVRYYS 373 + + VR Sbjct: 1105 EGLLAEVGALVSAGVRLLG 1123 >UniRef50_C1TQB2 Uncharacterized protein n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQB2_9BACT Length = 225 Score = 74.9 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 56/166 (33%), Gaps = 16/166 (9%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY------KNVEVVYIR 293 E +P+ + + ++DVSGSM + + R L + L + + + Sbjct: 12 EMVENPTPRVPVSLVLDVSGSMLGAPIEELNRGVELFFKSLKDDDVARYSAEVSVISFSN 71 Query: 294 HHTQAKE---VDEHEFFYSQETGGTIVSSALKLMDEVVKERY------NPAQWNIYAAQA 344 TQ + +++ + + G T + A+ L E +++R + + Sbjct: 72 EVTQEVDFGPLEKCDIPELKAIGKTRMGGAVSLALESLEKRKELYRTLGVDYYQPWMVIM 131 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 +DG D + A + + A +E+ Sbjct: 132 TDGKPNDDWQLAAAKTSALVDKGKLTVFPIAIG-DNACTDTLKEFS 176 >UniRef50_A7RFD8 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RFD8_NEMVE Length = 182 Score = 74.5 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 44/152 (28%), Gaps = 17/152 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHHTQAKEVDE 303 + + +D S S+ + + K F L + + +VY A + + Sbjct: 4 TNIDLVFAIDASSSVGKVNFERVKGFIRRLVESFHISRTSTRVAAIVYSSRPRVAFDFNR 63 Query: 304 --------HEFFYSQE-TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 H + GGT AL+L + RY + +DG S Sbjct: 64 YTSARRAAHAVKRLRFLRGGTSTGRALRLASSRLFRRYGRKRR-KVLMLITDGK----SS 118 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 + V+ ++ + L Sbjct: 119 DDVLKPSKALKRKGVQIFAVGVGMSVSRNELI 150 >UniRef50_C4LHS9 Magnesium-chelatase subunit D n=1 Tax=Corynebacterium kroppenstedtii DSM 44385 RepID=C4LHS9_CORK4 Length = 665 Score = 74.5 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 45/347 (12%), Positives = 86/347 (24%), Gaps = 38/347 (10%) Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 P + HP ++ D + + D S Sbjct: 274 RPRLPEDVDDSDFDDHPRDNDENGGDENHDGNSEEENNKLSKNANPAAARPDDA--DTTS 331 Query: 121 KDEYLDLLFEDLALPNL--------KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN 172 +E +++ P+ ++ + + A Sbjct: 332 DEEPGADTAPEVSSPSESDSGDTSREERNDPPHNDDASTGAEARNTDPVDKPDDQPLAIP 391 Query: 173 SLARR--TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFID 230 + RR T+ T R + + PA+ + + A +R Sbjct: 392 HIPRRGRTSTTVWPGRHGPRGISHRGRVRRVMPARTHDTDLAILPTLAAAAPWQRFRHRH 451 Query: 231 TFDLRY-----KNYEKRPDPSSQAVMFCLM-DVSGSMDQSTKDMAK-RFYILLYLFLSRT 283 D R + S+ + ++ D SGSM + AK +L Sbjct: 452 QDDQRRIILTRDDLRTAQRGSAGGELVIIIVDASGSMGRGAIRTAKSTALEVLQSSYRDR 511 Query: 284 YKNVEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 K +V H T++ GGT ++SA + + R+ P Sbjct: 512 SKVCIIVARGHEAAIGLPVTRSLSRARRCLTSLPTGGGTPLASARLRAASIAR-RFPPEL 570 Query: 337 WNIYAAQASDG---------DNWADDSPLCHEILAKKLLPVVRYYSY 374 + SDG D + ++ V Sbjct: 571 VR--VIELSDGRANVGLPHSHGNPADDADRAKYELDAMVSSVTCIPV 615 >UniRef50_A7SIA9 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7SIA9_NEMVE Length = 484 Score = 74.5 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 68/238 (28%), Gaps = 23/238 (9%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRY-KNYEKRPDPSSQA 249 + E ++ A + E + D + L+ K + Sbjct: 211 DGRRFNYTNWQEGEPSGMRRGQEQDCAAIAMYPELGKWEDHYCLQNMPFICKIKMCGERT 270 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------- 302 + +D SGSM AKRF L + K V IR T+AK + Sbjct: 271 DLAFAIDASGSMGDQGFLRAKRFVKALIGSFKVSQKGTHVGIIRFSTRAKVMFTFTEHFT 330 Query: 303 ----EHEFFYSQ-ETGGTIVSSALKLMDEVVKERYNPAQWNIYAA----QASDGDNWADD 353 + + GGT AL+L + + ++ + +DG + Sbjct: 331 HEDVNYAIDDIEYTEGGTKTELALRLARTELFSKQGGSRTSPLIFKLFVLMTDGRSEYFH 390 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS---TFDNFAMQHIRDQD 408 + + K V + +Q +S +F IR + Sbjct: 391 AVARQAKMLK--RSGVHVMAVGIGKYT-NQRELEVIASSKSDVIGVVSFRDLMIRMNE 445 >UniRef50_C3ZCZ5 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3ZCZ5_BRAFL Length = 371 Score = 74.5 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 19/203 (9%), Positives = 54/203 (26%), Gaps = 21/203 (10%) Query: 202 EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM 261 + + + T + P ++ + ++D SGS+ Sbjct: 175 SSVCSVLNDNFLTTLTIQDC--NSDHISITMPCYTLLEKVTPPCNNPVDIVFVLDGSGSV 232 Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-EVDEHEFFYSQE--------- 311 + + + + + + V +++ + + E F Q Sbjct: 233 GRRNFEKVQAGVKKIVGDFNIALDSTRVGVVQYSSIVRQEFALDTFSNLQGLESGIQSIP 292 Query: 312 --TGGTIVSSALKLMDE--VVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLP 367 GGT +A++ + +DG ++ D AK+ Sbjct: 293 YMAGGTRTGAAMEYAIQNSFTSANGARPDVGHVIVLVTDGRSY--DDVSQASQKAKQA-- 348 Query: 368 VVRYYSYIEITRRAHQTLWREYE 390 + ++ ++ + Sbjct: 349 GIVVFAVGIGDGAV-ESQLNQIA 370 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 45/183 (24%), Gaps = 26/183 (14%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK-EVDEHEF 306 + ++D SGS+ + K F N V +++ + + E F Sbjct: 1 PIDIIFMLDGSGSVGPDNFNKMKEFVKKTVGGYLIGPSNTRVAVMQYSSSVRQEFALDAF 60 Query: 307 FYSQ-----------ETGGTIVSSALKLMDEV--VKERYNPAQWNIYAAQASDGDNWADD 353 + GGT AL + ++ A +DG + Sbjct: 61 NTLEDLLVGIEEIRYMRGGTRTGKALTRLRRQGFLESNGARKNVPHVAVIVTDGRSSDSV 120 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE--HLQSTFDNFAMQHIRDQDDIY 411 E + Y+ + + + + DD+ Sbjct: 121 DQAALET----RQSGIVLYAVGVG--NYDLGQLTDIASTNETLG----VVDNFNLLDDVR 170 Query: 412 PVF 414 Sbjct: 171 NSL 173 >UniRef50_UPI000155D2F0 PREDICTED: similar to matrilin-3, partial n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155D2F0 Length = 354 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 37/320 (11%), Positives = 72/320 (22%), Gaps = 46/320 (14%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANIS-VVRSLQNSLARRTAMTAGKRRELHAL---- 191 L+++++R + + S V R S R A + + Sbjct: 19 LRESRKRNGRSKHPNTFAASRRRSTDGFSTVGRCYVAS--RNLAGPPDRTESPVRVKVTT 76 Query: 192 --------EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRP 243 E + + LR + Sbjct: 77 AGSLVPGEEAEQSGRERWIQGRAKVAGPLRPRADSPGSGWTLARAPRVLRTGGSQSPGAG 136 Query: 244 D--------PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYI 292 S + ++D S S+ + K F + L + V Y Sbjct: 137 GRGPGADICRSRPLDLVFIVDSSRSVRPREFEKVKTFLSQVIDTLDIGETATRVAVVNYA 196 Query: 293 R--------HHTQAKEVDEHEFFYSQE-TGGTIVSSALKLMDE-----VVKERYNPAQWN 338 KE + GT+ A++ + R Sbjct: 197 STVKVEFHLQTHSDKESLKQAVSRIAPLATGTMSGLAIRTAMDEVFTVEAGARAPAFNIP 256 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DG E +A+ + Y+ A R+ Sbjct: 257 KVVVIVTDGRPQD----QVQEAVAQAQASGIEIYAVGVGR--ADMQSLRQLASEPVETHA 310 Query: 399 FAMQHIRDQDDIYPVFRELF 418 F ++ + + FR+ F Sbjct: 311 FYVETYGVIEKLTSTFRKTF 330 >UniRef50_Q5NIW0 Matrilin 3b n=19 Tax=Clupeocephala RepID=Q5NIW0_DANRE Length = 478 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 50/196 (25%), Gaps = 23/196 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT--- 296 P S + ++D S S+ + + K F + L V + + + Sbjct: 194 PAEPCKSRPLDLVFIIDSSRSVRPAEFEKVKIFLSEMVNSLDIGSDATRVALVNYASTVN 253 Query: 297 --------QAKEVDEHEFFYSQE-TGGTIVSSALKLMDEV-----VKERYNPAQWNIYAA 342 +K + F + GT+ A+K E R A Sbjct: 254 IEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFTENAGARPLKKGIGKVAI 313 Query: 343 QASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 +DG + + Y+ ++ F ++ Sbjct: 314 IVTDGRPQDKVEEVSAAA----RASGIEIYAVGVDRAEMRS--LKQMASQPLDDHVFYVE 367 Query: 403 HIRDQDDIYPVFRELF 418 + + FRE Sbjct: 368 TYGVIEKLTSKFRETL 383 >UniRef50_Q4SNW2 Chromosome 15 SCAF14542, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4SNW2_TETNG Length = 1009 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 54/203 (26%), Gaps = 25/203 (12%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR---------TYKNVEVVYIR 293 + ++D S S+ + + K F I L + V Y Sbjct: 702 EKRCGALDIVFVIDSSESVGLTNFTLEKNFVINTINRLGSLAKDPKSETGTRVGVVQYSH 761 Query: 294 HHTQ-AKEVDEHEFFYSQE-----------TGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 T A D+ + GGT SALK + + A+ ++ Sbjct: 762 SGTFQAIRPDDPKIDSLTSFKEAVKQMEWIAGGTWTPSALKYAYDNLIRDSRRAKASVSV 821 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ-TLWREYEHLQSTFDNFA 400 +DG D L V ++ + + + R + Sbjct: 822 VVITDGRFDPRDDDSLLTYLCSDPKVDVNAIGIGDMFYQVEENEILRSIACQKDGK-VLG 880 Query: 401 MQHIRDQ--DDIYPVFRELFHKQ 421 M+ D ++ + + Sbjct: 881 MRRFADLVAEEFIDKIETVLCPE 903 Score = 49.5 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 48/165 (29%), Gaps = 22/165 (13%) Query: 248 QAVMFCLMDVSGSMD------QSTKDMAKRFYILLYLFLSRTYKNVEVVYIR------HH 295 ++ +D S ++ S + K F I L+ V Sbjct: 49 NIEVYFTIDTSETIALQESPPGSLVESIKDFTIEFVKRLADEEYRGAVRLSWKMGGLHFS 108 Query: 296 TQAKEVDEHEFFYSQETG---------GTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 + + G GT + AL M + + + +P + +A +D Sbjct: 109 QEQRVFSRLGTKAQFINGISGIRYLGKGTYIDCALTNMTQEMTQSPSPFKPLRFAVVITD 168 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEH 391 G + + +R ++ +R +T RE + Sbjct: 169 GHVTGNPCGGIKVSAERARDAGIRIFAVA-ASRNIDETGMREIAN 212 >UniRef50_A5GIB9 Protoporphyrin IX Mg-chelatase subunit ChlD n=22 Tax=Cyanobacteria RepID=A5GIB9_SYNPW Length = 728 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 41/280 (14%), Positives = 73/280 (26%), Gaps = 27/280 (9%) Query: 80 DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQ 139 +D P+G +D ++ S D+ E + Sbjct: 380 GEQNSDDTPPPPEGSADDD----NDPPEDSSDDNDSEDDDSNDDEDSEQDEAPPSVPEEF 435 Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIIS 199 + + A + S RS+ S +R + R A + Sbjct: 436 MLDPEAVAIDPDLLLFNAAKSKSGNSGSRSVVLSDSRGRYVKPMLPRGPVRRIAVDATLR 495 Query: 200 NSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSG 259 + P Q + ER ++ DLR K + A++ L+D SG Sbjct: 496 AAAPYQKARRA----------RQPERTVIVEESDLRAKLL----QRQAGALVIFLVDASG 541 Query: 260 SMDQSTKDMAKRF-YILLYLFLSRTYKNVEVVYIR-------HHTQAKEVDEHEFFYSQE 311 SM + AK LL + + + T++ Sbjct: 542 SMALNRMQSAKGAVIRLLTEAYENRDEVALIPFRGDQAEVLLPPTRSITAARRRLESMPC 601 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ-ASDGDNW 350 GG+ ++ L V +DG Sbjct: 602 GGGSPLAHGLTQAARVGANALATGDLGQVVVVAITDGRGN 641 >UniRef50_C7PL60 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PL60_CHIPD Length = 345 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 21/223 (9%), Positives = 52/223 (23%), Gaps = 43/223 (19%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + +DVS SM AK+ L L + VV+ + Sbjct: 82 RMEKITRKGVDVVIALDVSKSMLAGDVKPDRLTRAKQLISKLADKL-DNDRVGLVVFAGN 140 Query: 295 H------TQAKEVDEHEFFYSQET----GGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 T + GT + A+++ ++ + + + Sbjct: 141 AYLQMPLTIDYSAAKMYLTTVSPDMIPTQGTAIGQAIQVANDAFNK---KERKHKSLIII 197 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA----------------------H 382 SDG++ + + + V+ T + Sbjct: 198 SDGEDHDEAAISKARAAFED-GVVINTIGIGSPTGSPLPDPETGTYKKDKEGNTVISKLN 256 Query: 383 QTLWREYEHLQSTFDNFAMQHIRD-QDDIYPVFRELFHKQNAT 424 + + + + + + + K+ Sbjct: 257 EDALKSIAAAGKGIYEHLDNNTEEVVNSLTQKIDSMEQKEFGE 299 >UniRef50_D0L145 von Willebrand factor type A n=3 Tax=Gammaproteobacteria RepID=D0L145_HALNC Length = 610 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 47/445 (10%), Positives = 107/445 (24%), Gaps = 64/445 (14%) Query: 19 RQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPT------EDISEPMFHQGRGGLR 72 ++R A+ + I + S + S+P ++ Sbjct: 187 QERISEIDDARTAREIGLRLAHDIGQMRLSMDEGSVPPMSIYRDDNQHLWYEEHPLPEET 246 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFV--FQISKDEYLDLLFE 130 + + E G + + Q S E + ++ + + Sbjct: 247 KAADQATEGVQSGLQFEEATEGRQLAFTDVPQVSSGAEAGYRIEEVTEDAQLTFFQTSED 306 Query: 131 DLALPNLKQNQQRQLTEYKTHRAGY--------TANGVPANISVVRSLQNSLARRTAMTA 182 + P+ + + + + PA ++ R L L R Sbjct: 307 EETHPSRYPEWFARFGVERENWCAVHERKAQEGDKDWAPAVLAANRPLLAHLRRVVGALR 366 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 + R L +E + + LR + + Sbjct: 367 SRHRVLLRKQETGDELD--------LDAALRAMADIRNGREPDARVFIRTQPQDDQVLAL 418 Query: 243 PDPSSQAVMFCLMD-VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 + D VSG D+A+ +LL + E+ H+Q + + Sbjct: 419 SLLLDLSESVNNPDPVSGR---PILDLAREAVLLLAETAT--DLGSELSLAGFHSQGRHL 473 Query: 302 DEHEFFYS--------------QETG--GTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 +++ G T + +A++ EV+ R N + Sbjct: 474 VDYQSAKRFDEPFDAAAKARLAAFDGRYSTRMGAAIRHATEVLLTR---GAHNRIILIMT 530 Query: 346 DGDNWADDS-------PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 DG+ D + + + V + Sbjct: 531 DGEPSDIDVFDSEHLLLDTRQAVIEARQRGVPVFCVSLDPGA--DRYVERI--FGEGHFL 586 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNA 423 + + + LF + + Sbjct: 587 V----LDNLRHLPERLSRLFLRLVS 607 >UniRef50_C6VU79 Sigma 54 interacting domain protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VU79_DYAFD Length = 614 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 39/384 (10%), Positives = 90/384 (23%), Gaps = 52/384 (13%) Query: 49 GESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQD 108 G ++ P + + + P + +R S +A Sbjct: 266 GRTLVTPDDIRESATLVLTHRKGKKALSPNSQQPN--ERPASGDENPQNQKSQTPEAP-- 321 Query: 109 GEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 D+ E + E + T + A Sbjct: 322 ----DKTQNGQQDGECGEGNCESEGSERIANIDFGMTVPELTKKPASAAP---------- 367 Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 A R A + + + + + + + + + Sbjct: 368 --------DVANGRDVRARQTAKGAAIRAVRSETASDIAMTDTILHALTRNPDDLTIGKA 419 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD-QSTKDMAK-RFYILLYLFLSRTYKN 286 +R + ++ ++D SGSM + K LL + Sbjct: 420 DLHQKVRSG--------KAGRLILFVVDSSGSMAAGKRMEAVKGSVMKLLEDAYQKRNMV 471 Query: 287 VEVVYIR-------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + + T++ + E G T + AL++ ++++ Sbjct: 472 AVIAFRGVEATVLLEPTRSTGLAEQALEQLPTGGRTPLPHALEMAEKMLASFAGRDTMEP 531 Query: 340 YAAQASDGDNW------ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 SDG D+ LA+ L + RE Sbjct: 532 LLVILSDGKANVPLPGSGGDAWRQSLQLAQNLHCDA--LVLDTESGYVRYGKARELATAL 589 Query: 394 STFDNFAMQHIRDQDDIYPVFREL 417 ++Q + + + + + Sbjct: 590 GAEYR-SLQELSADEITDTIAKRI 612 >UniRef50_A0NX68 Von Willebrand factor type A domain protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NX68_9RHOB Length = 657 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 49/202 (24%), Gaps = 24/202 (11%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQS-----TKDMAKRFYILLYLFLSRTYKNVEVVY--- 291 R + + ++D S SM ++A+ + L + + Y Sbjct: 15 PARAQTDTSPDLLFVLDSSNSMWGQIDGTAKAEIARSAFEGFVAGLPDGTRAGVMAYGHR 74 Query: 292 -----------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 + + G T ++ L+ E++ + P + Sbjct: 75 RKADCGDVETLVPVSDLDRAKLVESVKALTPRGKTPITETLRQAAELLAQNDRPGR---- 130 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE-ITRRAHQTLWREYEHLQSTFDNF 399 SDG P + + I +A Q HL Sbjct: 131 LILISDGIETCGGDPCALAEALASSGVDFKAHVIGFDIASKADQAKIACIAHLTGGTYWN 190 Query: 400 AMQHIRDQDDIYPVFRELFHKQ 421 A + + + K Sbjct: 191 ARDADGLNEALKESVAAVEEKA 212 >UniRef50_C3PEQ4 Putative membrane protein n=2 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PEQ4_CORA7 Length = 693 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 44/196 (22%), Gaps = 33/196 (16%) Query: 252 FCLMDVSGSM------DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE----- 300 + D SGSM Q+ D AK V Y + +A E Sbjct: 90 MVVFDSSGSMITNDAGGQTRIDAAKDAARTFITEAGDDAPLGLVTYGGNTGEAPEDEAAG 149 Query: 301 ----------------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 Q G T + +L+ + P + Sbjct: 150 CQDITVVTPPEAGNSEKMIAHMDGLQPRGFTPIGESLRKAAAEL-----PKEGQRSIILV 204 Query: 345 SDGDNWADDSPLC-HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 SDG P+C K+ + + Q + A Sbjct: 205 SDGVATCTPPPVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQCIADATGGTYANASDA 264 Query: 404 IRDQDDIYPVFRELFH 419 ++ F+ Sbjct: 265 DSLAKELNRAAPRTFN 280 >UniRef50_A1ANG2 Protoporphyrin IX magnesium-chelatase n=6 Tax=Bacteria RepID=A1ANG2_PELPD Length = 689 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 42/320 (13%), Positives = 80/320 (25%), Gaps = 20/320 (6%) Query: 101 GQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGV 160 + + EGQ++ Q E D E+ + + R G Sbjct: 357 SPDRQPEQAEGQEQRE-QREDRENRDDAGEEEEQSEQENSDMPHPPSQGEERESVFQVGA 415 Query: 161 PANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELR 220 + +RS ++ + RR + R + + + LR Sbjct: 416 TFKVKTIRSDKDRVVRRGSGRRSSTRVSRKQGRYVKSSYACSNNDIALDATLRAAAPHQL 475 Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYL 278 + + R + R + ++D SGSM + A + LL Sbjct: 476 FRGTKEGMCVNLSDRDFRGKVREKRVGNF-LLFVVDASGSMGARGRMAASKGAVMSLLLD 534 Query: 279 FLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER 331 + + + + T + E+ G T +S+A+ E ++ Sbjct: 535 AYQKRDRVGMISFRKNEAFVNLPPTTSVELAGKLLEEMPVGGRTPLSAAIAKSYEQLRGV 594 Query: 332 YNPAQWNI-YAAQASDGDNW-------ADDSPLCHEILAKKLLPVVRYYSY-IEITRRAH 382 +DG + D + RY E Sbjct: 595 LGRDPTARPIVIFITDGKSNVALGDGRPVDEAMGLARAMAVKEERARYIVVDTEEEGMVS 654 Query: 383 QTLWREYEHLQSTFDNFAMQ 402 L R Q Sbjct: 655 FGLARRLADAMEAEYFRIDQ 674 >UniRef50_C0BF89 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BF89_9FIRM Length = 275 Score = 74.5 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 16/136 (11%), Positives = 36/136 (26%), Gaps = 12/136 (8%) Query: 262 DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------------HEFFYS 309 K+ L+++ N E+ + + A E + Sbjct: 73 ANDRFYYLKQAATNFTTQLAQSSPNSEIALVTFNKTATEQFDFKNVGKDSAYITETINAM 132 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV 369 + +GGT + L +++ N + Y +DG + K Sbjct: 133 ETSGGTHQNEGLDRAYKILNNDQNTSNLKRYVVLLTDGCPNGVTYDQITTSINKIKSTNT 192 Query: 370 RYYSYIEITRRAHQTL 385 + + + L Sbjct: 193 KLITVGVGLDETNTGL 208 >UniRef50_UPI0000EB3C0A UPI0000EB3C0A related cluster n=1 Tax=Canis lupus familiaris RepID=UPI0000EB3C0A Length = 563 Score = 74.5 bits (181), Expect = 8e-12, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 53/186 (28%), Gaps = 20/186 (10%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE----HE 305 + L+D S S+ ++ K+F + L + K +V +++ + ++ H Sbjct: 346 DLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHT 405 Query: 306 FFYSQE--------TGGTIVSSALKLMDE--VVKERYNPAQWNIYAAQASDGDNWADDSP 355 + GT+ +ALK + + +DG + + Sbjct: 406 KKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGAQKVGIVFTDGRSQDYIND 465 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 K + ++ + RE F + + I R Sbjct: 466 ----AAKKAKDLGFKMFAVGVGNAVEDE--LREIASEPVAEHYFYTADFKTINQIEEDLR 519 Query: 416 ELFHKQ 421 + Sbjct: 520 GSVRPE 525 >UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular organisms RepID=YEGL_ECOLI Length = 219 Score = 74.5 bits (181), Expect = 8e-12, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 53/175 (30%), Gaps = 19/175 (10%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT------YKN 286 + + + +P + L+DVSGSM+ + + L + Sbjct: 4 QITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVEL 63 Query: 287 VEVVYIRHHTQAKEVDEHEFFY--SQETGGTIVSSALKLMDEVVKERYNP------AQWN 338 V + H + FF G T + +A+ ++V+ER + + Sbjct: 64 GIVTFGPVHVEQPFTSAANFFPPILFAQGDTPMGAAITKALDMVEERKREYRANGISYYR 123 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 + +DG + +++ + ++S + + Sbjct: 124 PWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGAD-----MKTLAQIS 173 >UniRef50_UPI00006A1B4A Collagen alpha-3(VI) chain precursor. n=5 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1B4A Length = 2535 Score = 74.2 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 58/193 (30%), Gaps = 26/193 (13%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVV--------YIRHHTQ 297 A ++ L+D SGS+ + K+F I L + V + Sbjct: 795 ADIYFLIDGSGSIYPEDFEDMKKFMIELISMFQVGANRVRFGVVQYSDVRRTEFFISEHN 854 Query: 298 AKEVDEHEFFYS-QETGGTIVSSALKLMDEVV--KERYNPAQWNIYAAQASDGDNWADDS 354 +++ + Q GGT+ AL M ++ + P + +DG+ Sbjct: 855 TQKMLKDAISQIEQLGGGTLTGEALTSMKQLFVNAAKDRPHKVPQSLVVITDGE----SQ 910 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 E A+ + ++ A + R+ + D V Sbjct: 911 DRVTEAAAEIRNDGITIFAIGVK--NAVEEEIRDIAGSNEKMFFV------NNFDSLKVI 962 Query: 415 RELFHKQNATAKG 427 + ++ T + Sbjct: 963 KNDLARELCTPEA 975 Score = 67.6 bits (163), Expect = 9e-10, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 50/195 (25%), Gaps = 19/195 (9%) Query: 217 AELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 R L + +A + L+D S S++ + K F + Sbjct: 965 DLARELCTPEATRAVPKLWLSLPLNTACKNMKADIVFLVDSSASINSDDYETMKEFMESM 1024 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEVDE----HEFFYSQE--------TGGTIVSSALKLM 324 V++ I+ ++ KE Q + GT++ ALK Sbjct: 1025 VKQAEIGPDRVQIGLIQFSSETKEEFPLNRYKRKDEIQSAIRGIQQLSQGTLMGEALKYT 1084 Query: 325 DEVVKERYNPA-QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 Y +DG+ A D+ + V Y+ + Sbjct: 1085 LPYFSASKGGRVNTKQYLIVITDGE--AQDAVGNPAKAIRD--HGVIIYAIGVQQAN-NT 1139 Query: 384 TLWREYEHLQSTFDN 398 L Q Sbjct: 1140 QLLE-IAGKQEQVYY 1153 Score = 67.2 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 48/193 (24%), Gaps = 22/193 (11%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK--------- 299 A + LMD S S+ K F L +++ I++ A+ Sbjct: 1 ADIVFLMDGSWSIGTENFITMKNFLYTLVNGFDVGLDKIQIGLIQYSDNARTEFFLNSYS 60 Query: 300 ---EVDEHEFFYSQETGGTIVSSALKLMDEV----VKERYNPAQWNIYAAQASDGDNWAD 352 +V ++ + GGT +L+ M A +DG Sbjct: 61 NKEDVLKYIQNLKYKGGGTKTGLSLEFMLTQHFSEAAGSRAAEGVPQIAVVITDGQAQDS 120 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + + Y+ + E +++ I Sbjct: 121 IREPAIAV----KNAGIILYAIGIKDAVLSE--LNEIASDPDDKHVYSVADFNALQSISQ 174 Query: 413 VFRELFHKQNATA 425 ++ A Sbjct: 175 NMIQVLCTTVEEA 187 Score = 66.4 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 61/200 (30%), Gaps = 17/200 (8%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY-LFL--SRTYKNVEVVYIRH 294 Y + + A + L+D S S+ + + +RF + L + VVY + Sbjct: 1193 RYTQACKKTEVADIIFLLDASASITRGEFRLMQRFVEAVVNDSLVGKDNVQFGAVVYGTN 1252 Query: 295 HTQAKEV--------DEHEFFYS-QETGGTIVSSALKLMDEVVKERYN-PAQWNIYAAQA 344 + + F Q +G T + AL+ Y + Sbjct: 1253 PAEQFSLNTYSTKLDILKAVFSLPQVSGYTYTAKALEYTRIRFGTSYGGRPGISHILILV 1312 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 +DG D P + + ++ + ++ F +Q+ Sbjct: 1313 TDGATTEADRPNLPIVSKALKDDGIIVFAVGVGKAVPQE--LQQIA--GYPDRWFLVQNY 1368 Query: 405 RDQDDIYPVFRELFHKQNAT 424 + D+I+ ++ ++ Sbjct: 1369 KGLDNIHDNITQVVCDESKP 1388 Score = 64.9 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 52/180 (28%), Gaps = 21/180 (11%) Query: 248 QAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV------ 301 Q + L+D S S+ S AK F + + + V + ++ K+ Sbjct: 1406 QLDLVFLIDGSASITSSNFTSAKTFMKEIVDSFTISENRVRIGVAQYSANPKKEFFLNEY 1465 Query: 302 -----DEHEFFYS-QETGGTIVSSALKLMDEVVK-ERYNPAQWNIYAAQASDGDNWADDS 354 + + Q T L+ + + Y +DG + + Sbjct: 1466 YSSSDMKKQIDSISQLKATTYTGKGLRFVKQFFDPANGGRKNVPQYLIVMTDGMSNDSVN 1525 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 + V+ +S R + + + S + + ++ + D I Sbjct: 1526 EDAAAL----RSSGVKIFSIGIGLRNSFELVM--IA--GSPKNVYEVETFQALDSIKRQI 1577 Score = 59.5 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 22/200 (11%), Positives = 53/200 (26%), Gaps = 22/200 (11%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + ++QA + L++ + M +T + AK F L L + + + + Sbjct: 190 QTGQIAQVCRTANQADIVLLVESTTRMGDATFEKAKNFLYDLVSNLDVGINKIRIGLVTY 249 Query: 295 HTQA------------KEVDEHEFFYSQETGGTIVSSALKLM----DEVVKERYNPAQWN 338 + + E+ E G T AL+ + Sbjct: 250 NDETNPEFLLNSYSSKTEILESIQNMKYVEGYTYTGRALEYVNTTYFTQAAGSRFEESVA 309 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 + D S E + + Y + + +E Sbjct: 310 QILIIVT----EGDSSDTLTEPAKELKSRGISVYVVGT-NIKYDRQ-LQEASSKPDEKFF 363 Query: 399 FAMQHIRDQDDIYPVFRELF 418 + + D +++ + Sbjct: 364 YQLDDFDDSENVTEQLLKNL 383 Score = 54.9 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 25/249 (10%), Positives = 61/249 (24%), Gaps = 28/249 (11%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 + N+ + A +E+ ++ + + + + A Sbjct: 341 VGTNIKYDRQLQEASSKPDEKFFYQLDDFDDSENVTEQLLKNLCFSIDLNIQVYSKRYAD 400 Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT--------QAKEVD 302 + L+D S SM K F I + L+ + ++ E Sbjct: 401 VVFLVDSSTSMGTIFFQKMKDFIIHIINQLNVGINKHRIGLAQYSGLPQTEFLLNHYETK 460 Query: 303 EHEFFYSQE-----TGGTIVSSALKLMDEVV----KERYNPAQWNIYAAQASDGDNWADD 353 E + +E G AL+ + + + + Sbjct: 461 EEILKHIKETFTYRGGPLKTGHALEFVRSTFFIEEAGSRINYGNPQFLVVIT-----SSK 515 Query: 354 SPLCHEILAKKLL-PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 S A++L V S + + + F ++ ++ Sbjct: 516 SEDAVRRHAEELKSVGVTTISVGIG--NSDRKELEKIATD---PFVFQTTGLQHISNLQQ 570 Query: 413 VFRELFHKQ 421 + + Sbjct: 571 DVANVIIAE 579 Score = 52.2 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 49/186 (26%), Gaps = 31/186 (16%) Query: 227 PFIDTFDLRYKNYEKRPD---PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS-- 281 I L++ P +S A + L+D S S+ + + F + L Sbjct: 575 VIIAEDMLQFSPVSVVPAVCSSASVADIVFLIDESSSIGPINFQLTRVFLHKVVSALDIS 634 Query: 282 -RTYKNVEVVYIRHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKER- 331 + V+Y K GG + A + ++++ Sbjct: 635 LSNVRVGLVLYSDEPRLELKLNTFNEKYEILDFITKLPYRGGKAHTGA---ALDFLRKKM 691 Query: 332 -------YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQT 384 A ++G + + + AK V ++ + + T Sbjct: 692 FTKQNGGRPHQGVQQIAVVMTNGQSMDN----FTKPAAKLRRSGVEVFAVGF--QNINDT 745 Query: 385 LWREYE 390 Sbjct: 746 ELDIIA 751 >UniRef50_D2A0T3 Putative uncharacterized protein GLEAN_08265 n=2 Tax=Tribolium castaneum RepID=D2A0T3_TRICA Length = 1091 Score = 74.2 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 35/322 (10%), Positives = 82/322 (25%), Gaps = 22/322 (6%) Query: 113 DEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN 172 + + +E + + + Q+ Y V N+SVV+ N Sbjct: 117 HNYFTKRRIEELFAPACDCSRPIDRTRPQENFNLLPLKKHPNYGDTPVNLNLSVVKIATN 176 Query: 173 SLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRA-KIERVPFIDT 231 R + G R + + + A K + T Sbjct: 177 IYEREQEVLQGVRWSEPLDMIFKENLDKDPTLKYQYFASPHGYMRHFPAVKWSDERYDQT 236 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 +D R +++ +S +F L+D SGS+ + + +A + L+ ++ Sbjct: 237 YDPRTRSWYTEAM-TSPKDVFILLDSSGSVCKLKRKIAAHIVNNILDTLNDNDFVNIYLF 295 Query: 292 IRHHTQAKE---------------VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA- 335 + + V+ L+ ++ E Sbjct: 296 ANSTRPLVPCFKDTLVQANEENLRLLRETLDNYKPDFQANVAVGLEKAFTLLAEFREKGI 355 Query: 336 --QWNIYAAQASDGDNWADDSPLCHEILAK-KLLPVVRYYSYIEITRRAHQTLWREYEHL 392 N + + + + + + VR ++Y + L Sbjct: 356 GSLCNQAIMLIT-EEAFFREDEKNFFNRSNWQYGTPVRVFTYQLERSESDARLLEWIACS 414 Query: 393 QSTFDNFAMQHIRDQDDIYPVF 414 + ++ P Sbjct: 415 NKGYFVNISLMQEVREKALPYL 436 >UniRef50_A8DZ06 CG4587, isoform C n=21 Tax=Neoptera RepID=A8DZ06_DROME Length = 1243 Score = 74.2 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 37/333 (11%), Positives = 87/333 (26%), Gaps = 31/333 (9%) Query: 101 GQGQASQDGEGQDEFVFQISKDEYLDL-LFEDLALPNLKQNQQRQLTEYKTHRAGYTANG 159 GQGQA GQ + + + D L + +++ + E ++ Sbjct: 118 GQGQAESPMGGQQHYDARRINEYNADGKLADGARHMDIR---FMRRFERLPVNLSLSSIL 174 Query: 160 VPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAEL 219 VP + + S A+ + + S LR+ Sbjct: 175 VPHGVDLDEPDVKS-----ALQWSGHLDPLFQNNLEQDPALSWQYFGSSTGFLRRFPGTA 229 Query: 220 RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLF 279 D R N+ SS + L+D S SM + + D+ + Sbjct: 230 WPPEGSKGSKLIHDFRTHNW-FVQAASSPKDIMILLDASSSMTEKSFDLGMATAFNILDT 288 Query: 280 LSRTYKNVEVVY---------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLM 324 L + + +R + + + ++ L+ Sbjct: 289 LGEDDFVNLITFSEVVKTPVPCFKDRMVRATPDNIQEIKSAVKAIKLQDTANFTAGLEYA 348 Query: 325 DEVVKERYNPAQ---WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 ++ + N + ++ ++ S VR ++Y+ + Sbjct: 349 FSLLHKYNQSGAGSQCNQAIMLIT--ESTSE-SHKDVIKQYNWPHMPVRIFTYLIGSDSG 405 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++ + F + + + Sbjct: 406 SRSNLHDMACSNKGFFVQINDYDEARRKVIDYA 438 >UniRef50_B4AJ60 YwmD n=2 Tax=Bacillus pumilus RepID=B4AJ60_BACPU Length = 225 Score = 74.2 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 49/205 (23%), Gaps = 29/205 (14%) Query: 242 RPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVY----- 291 A + L+D+SGSM + D+AKR LS + + V+ Sbjct: 24 VTKKQEPAHVVILLDLSGSMAQSVEGEKKIDIAKRSIQSFASILSDDTQVLLRVFGHEGT 83 Query: 292 ----------------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + + + TG T ++ AL + ++ Sbjct: 84 NKNAGKAISCESSEAVFGFGSYESSTFQQALNVYKPTGWTPLAKALTDTKQDFEDHQAEG 143 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 + SDG SP + + Sbjct: 144 KN--IVYVVSDGQETCGGSPSQAAKELHEGGIDTIVNIIGFDVNEKEARSLKSVAKAGGG 201 Query: 396 FDNFAMQHIRDQDDIYPVFRELFHK 420 + + + I F + Sbjct: 202 QYQ-PAANAEELNYILQNAASTFSQ 225 >UniRef50_C3XQR8 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XQR8_BRAFL Length = 2411 Score = 74.2 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 20/204 (9%), Positives = 47/204 (23%), Gaps = 46/204 (22%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQST-------KDMAKRFYILLYLFLSRTYKNVEV 289 +N+ + + ++DVSGSM + ++AK+ + + L+ V Sbjct: 127 RNWYVSAASPKKKNVVIVIDVSGSMREPPGPEEQNRLNLAKQAALTVLDTLTPRDWGGVV 186 Query: 290 VYIRHHT---------------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + + + T+ + ++ E N Sbjct: 187 SFSARAETPEGCLGDSLGEANPTNIGIMQDFINQRVPETITMYGVGFRKAFDMFAEARNK 246 Query: 335 AQWN-----IYAAQASDGDNWA-DDSPLCHEILAKKLLPVVRYYSYIEI----------T 378 SDG + + + V ++Y Sbjct: 247 KPEQFEDCYNIIIFLSDGSPTDKAFALDEITKGQELMDRSVYIFTYGLGANLMWASSQWA 306 Query: 379 RRAH--------QTLWREYEHLQS 394 + R + Sbjct: 307 PDPNNPFVYLPALDFLRTIADQNN 330 >UniRef50_Q01RP7 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01RP7_SOLUE Length = 326 Score = 74.2 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 50/186 (26%), Gaps = 25/186 (13%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH------TQAKE 300 + A + + D S SM + A L + + V + T+ E Sbjct: 96 APASVGLVFDTSDSMQ-PRMNKAHEAVEALLKNANPADEFFLVQFSDRARLVAGMTKDSE 154 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD-SPLCHE 359 + G T + A+ + E +K + SDGD+ + + Sbjct: 155 EISRRAASMRIGGSTALLDAVAMAMEEMKSAH---YLRKVMVIISDGDDNSSRCPVNDLK 211 Query: 360 ILAKKLLPVVRYYSYIEITRRA-----------HQTLWREYEHLQSTFDNFAMQHIRDQD 408 + ++ V Y+ L E F + ++ Sbjct: 212 RIVRE--GDVTIYAIGITDDNVPLAYPQRDRLTGAALLNEIATQTGGR-LFEVHKLKQLP 268 Query: 409 DIYPVF 414 +I Sbjct: 269 EIAAKI 274 >UniRef50_UPI000180D0B0 PREDICTED: similar to von Willebrand factor A domain containing 3A n=1 Tax=Ciona intestinalis RepID=UPI000180D0B0 Length = 1107 Score = 74.2 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 51/186 (27%), Gaps = 26/186 (13%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK-NVEVVYIRHHTQAKEVDE---- 303 + + L+DVSGSM + ++ K L++ L+ V + T+ ++ Sbjct: 865 SNVVILIDVSGSMSYNMDELKKEITSLIWEQLNGNKTAFNIVAFSNTSTKWQDSITESNQ 924 Query: 304 -------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 GG+ A+++ + +DG + Sbjct: 925 SACHDAVQWVSALTAHGGSATLKAIQVAL--------ADEEAEAIYLLTDGKPDSSIKLT 976 Query: 357 CHEI-LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 E K + S+ R A+ + + + D Sbjct: 977 LSEASNLNKKNIPIHTISFNCDNREAND-FLKSLSSNSGGRFH----RCHGEADAQFAIH 1031 Query: 416 ELFHKQ 421 L + Sbjct: 1032 RLMQDE 1037 >UniRef50_A9AV55 von Willebrand factor type A n=2 Tax=Bacteria RepID=A9AV55_HERA2 Length = 222 Score = 74.2 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 55/163 (33%), Gaps = 21/163 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY------KNVEVVYIRHHTQAK 299 + + ++D SGSM D + + + +S ++ + V + Sbjct: 12 EQKCLCILVVDTSGSMQGRPIDELNQGLQVFHQDISNSFSTAQRLEICLVEFNSQADCIV 71 Query: 300 EVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKERY------NPAQWNIYAAQASDGDNW 350 E + F+ G T + ++L V+ER + + +DG+ Sbjct: 72 EPSLVDQFHMPILAVAGTTKLVDGVRLAIHKVQERKSWYRSTGQPYYRPWIILMTDGEP- 130 Query: 351 ADDSPLCHEILAKKLLPVVR---YYSYIEITRRAHQTLWREYE 390 DS LA+++ V + + + A + ++ Sbjct: 131 --DSDQDVAGLAREIQHGVNNKQFVFFPIGVQGADMRMLQQIS 171 >UniRef50_B4D6B0 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D6B0_9BACT Length = 879 Score = 74.2 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 26/224 (11%), Positives = 51/224 (22%), Gaps = 37/224 (16%) Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTY 284 L + + M ++D SGSM Q+ +A + + L Sbjct: 402 IEQMLPVRMEHDDRLDTPTVAMLVVLDRSGSMTAAVAGQTKISLADQGAVFAMNALQPKD 461 Query: 285 KNVEVVYIRHHTQAKEVD--------EHEFFYSQETGG-TIVSSALKLMDEVVKERYNPA 335 V + E + GG + +++ + + R PA Sbjct: 462 YFGVVAVDTKPHTVVPLAPISAKGAAEQKILSITAGGGGIYIYTSMVEAFQQL--RDIPA 519 Query: 336 QWNIYAAQ-------------ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITR-RA 381 + SDG +S + L + T Sbjct: 520 RVKHLLLFSDAADAEEKAAGEMSDGIRTGGNSLDLASAM---LAAKITTSVVGLGTEQDK 576 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQD----DIYPVFRELFHKQ 421 R+ S + V + ++ Sbjct: 577 DTPFLRQLAERGSGRFYLTDDATTLPQIFSTETMKVAQSSLIEE 620 Score = 49.9 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 16/118 (13%), Positives = 34/118 (28%), Gaps = 18/118 (15%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE 303 PS + + ++D S S+ + A+ F + + + E+ + Sbjct: 73 LPSQELSVLFVVDHSASISAPAQKEARNFVSTSLAAQHTSDTAGVIGFAAKP----ELWQ 128 Query: 304 HEFFYSQE---------TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + Q T + AL + PA +DG++ Sbjct: 129 APAVHLQPAAQWPEPTDRKATDIGGALDFASAIF-----PAGKARRVVLLTDGNDTGG 181 >UniRef50_D2LNF7 von Willebrand factor type A n=3 Tax=Aciduliprofundum boonei T469 RepID=D2LNF7_9EURY Length = 2166 Score = 74.2 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 38/154 (24%), Gaps = 37/154 (24%) Query: 240 EKRPDPSSQAVMFCLMDVSGSM-----------------DQSTKDMAKRFYILLYLFLSR 282 R + + ++D SGSM + D+A + I L Sbjct: 1242 APRLNKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRIDVAIQAAIDAVKELGP 1301 Query: 283 TYKNVEVVYIRHHTQ------------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVK- 329 + + + + Q GGT + L + Sbjct: 1302 QDRVAVFTFDGNSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLYDTLSWAVYYMDT 1361 Query: 330 ---ERYNPAQWNIYAAQASDG----DNWADDSPL 356 + + +DG DN+ ++ Sbjct: 1362 QSADNPDREDATRGILVLTDGLSNSDNYGPNNVR 1395 >UniRef50_UPI000180BCC9 PREDICTED: similar to calcium activated chloride channel 4 n=2 Tax=Ciona intestinalis RepID=UPI000180BCC9 Length = 1075 Score = 74.2 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 56/191 (29%), Gaps = 21/191 (10%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL-FLSRTYKNVEVVYIRHH 295 R P+++ ++D SGSM S + + + ++ + + V + Sbjct: 312 PTINVRQQPTNR--YVLVLDTSGSMSGSNYEYMMQAATDFIMTYIPKGAEAGIVEFSYTA 369 Query: 296 TQAKEVD----EHEFFYS------QETGGTIVSSALKLMDEVVKER-YNPAQWNIYAAQA 344 T ++ + + Y Q G T + + EV+ + +PA Sbjct: 370 TTLSQLVSIENKADREYLASRLPGQPDGSTCIGCGILNGIEVLSNQGRDPAGG--QLIVL 427 Query: 345 SDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM-QH 403 +DG+ A + VV + A ++ + Sbjct: 428 TDGEENYSPYVNDVRDNAIEAHVVVDSIFFGASGNGA----LQQLTEDTKGTMYYNDVTD 483 Query: 404 IRDQDDIYPVF 414 I + + Sbjct: 484 ITGLKETFKQL 494 >UniRef50_Q5NJK1 Matrilin-3a n=5 Tax=Danio rerio RepID=Q5NJK1_DANRE Length = 460 Score = 74.2 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 30/238 (12%), Positives = 58/238 (24%), Gaps = 23/238 (9%) Query: 207 LEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 + ++ +R S + ++D S S+ Sbjct: 21 DSYDLNNPSYILAQSYAQRRNNGLPHRTLNPAATDSQCRSRPLDLVFIIDSSRSVRPGEF 80 Query: 267 DMAKRFYILLYLFL---SRTYKNVEVVYIRH--------HTQAKEVDEHEFFYSQE-TGG 314 + K F + L + V Y K+ + + G Sbjct: 81 EKVKIFLADMVDTLDVGPDATRVAVVNYASTVKIESLLKSHLTKDTIKQAITRIEPLAAG 140 Query: 315 TIVSSALKLMDEVV-----KERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV 369 T+ A+K + R + A +DG D A+ + Sbjct: 141 TMTGMAIKKAMDEAFTEKSGARPKSKNISKVAIIVTDGRP--QDQVEEVSAAAR--ASGI 196 Query: 370 RYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y+ RA + F ++ + + FRE +A G Sbjct: 197 EIYAVGV--DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETLCDVDACEMG 252 >UniRef50_C5BKN1 Matrixin family protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BKN1_TERTT Length = 877 Score = 74.2 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 44/174 (25%), Gaps = 25/174 (14%) Query: 247 SQAVMFCLMDVSGSMDQS--------TKDMAKRFYILLYLFLS--RTYKNVEVVY----I 292 + +MD SGSM+ S D K + FL ++ V + + Sbjct: 418 QNTDVVLVMDRSGSMNLSSAPDPSVSKMDALKYAANVFMDFLDLDAGHRAGLVQFHEVVV 477 Query: 293 RHH---------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQ 343 + + G T + + + +P+ Sbjct: 478 PFSPAFNLQPVNAASLSAAQTAINSMTAGGMTNIIDGVNEGIAQLTTAVDPSDRQ-IMLL 536 Query: 344 ASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG + +I A L V YS T ++ Sbjct: 537 LTDGLHNRPVGTSVTDITAPLLASEVTLYSVGFGTST-NEAELTPLALSTGGVH 589 >UniRef50_Q01W27 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01W27_SOLUE Length = 306 Score = 74.2 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 53/193 (27%), Gaps = 19/193 (9%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + + + ++D S SM +D + + + + + + Sbjct: 72 QKQEIKVFRQEDVPISLGLVIDTSASMSN-KRDRVNSAALAMVKASNPEDEVFVISFSEE 130 Query: 295 H------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 T + E G T + AL L + ++ + +DG+ Sbjct: 131 AFITQDFTSDVKQLESSLRKLGSKGETAMRDALSLGLDHLRAPARKDK--KVLVVITDGE 188 Query: 349 NWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQ------TLWREYEHLQSTFDNFAM 401 + S E L + L V Y + A + Sbjct: 189 DN--SSIQKQENLIRAAHLSNVIIYGIGLLAAEAPASAQRAKASLDVLTLATGGRSWY-P 245 Query: 402 QHIRDQDDIYPVF 414 +++ D + I P Sbjct: 246 ENVADIEKITPEI 258 >UniRef50_B6B3S7 Magnesium chelatase ATPase subunit D n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B3S7_9RHOB Length = 547 Score = 74.2 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 44/323 (13%), Positives = 84/323 (26%), Gaps = 41/323 (12%) Query: 90 RPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYK 149 R Q + ++ I +D LD + L L++ Sbjct: 230 RATRLPETQDEQPKQPQNETRQNEDDTLSIPQDLLLDAVKAALPADVLEKLA-------- 281 Query: 150 THRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEE 209 + G N + + N R AG + + + +A + + P Q Sbjct: 282 ---SDTKRKGTTGNGAGAKQNGNRRGRPLPARAGSKANTARV-DLIATLRAAIPYQ---- 333 Query: 210 ERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMA 269 I P I DLR K Y S ++ +D SGS + A Sbjct: 334 -----TIRREAQPQRTGPIIHPGDLRRKRY----QTLSDRLLIFTVDASGSAAMARLAEA 384 Query: 270 KRFYILLY-LFLSRTYKNVEVVYIR-------HHTQAKEVDEHEFFYSQETGGTIVSSAL 321 K +L +R + + T++ + GGT ++S L Sbjct: 385 KGAVEMLLSEAYARRDHVALISFRGLDAEVLLPPTRSLVQTKRRLAALPGGGGTPLASGL 444 Query: 322 KLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV--RYYSYIEITR 379 + + + + +DG + A + S Sbjct: 445 TAALSLAETASHK-GMSATIVLLTDGRANIALDGQANRTQAGDDAQTIARNILSAGVDAL 503 Query: 380 RAH-----QTLWREYEHLQSTFD 397 + ++ + Sbjct: 504 VIDTTIRPERSLKQLADMMHANY 526 >UniRef50_C3ZY18 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZY18_BRAFL Length = 534 Score = 74.2 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 55/181 (30%), Gaps = 23/181 (12%) Query: 246 SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHHTQAKEVD 302 ++ + ++D SGS+ + +F + L L + + V Y + T + Sbjct: 143 KTELDLVFVIDGSGSISSVSFGSVMKFAADMSLRLDISATTTRVGMVQYSTNVTPEFMLK 202 Query: 303 EHEFFY---------SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 EH + GGT ALK + + R P A +DG + D Sbjct: 203 EHTTKKSVEKAIGDVKRLGGGTNTGKALKFVRTEMDWRDPP--TKRVAIVVTDGK--SQD 258 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 ++ V+ T +E +A+ + DI Sbjct: 259 DVGTPATALRQAGVVLYAVGVGLPTDE-----LKEIT--GDPTKVYALNSYDELQDIIQD 311 Query: 414 F 414 Sbjct: 312 I 312 >UniRef50_C5S895 von Willebrand factor type A n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S895_CHRVI Length = 346 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 32/294 (10%), Positives = 68/294 (23%), Gaps = 46/294 (15%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 + + T + L++ + P L R + L + Sbjct: 25 DRRTQVDETLEGRRQTLLHPRLDDLRTAFTARRPRLQLAGRLYRALLYLLWIALVVALMR 84 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ----------STKDMAKRFYILLYLF 279 + Y + +D S SM+ + + K Sbjct: 85 PQWLTPYTEVST-----PGYDLMIAVDASHSMEALDFTVEGRQVNRMAVVKGVMGRFIDA 139 Query: 280 LSRTYKNVEVVYIRHH------TQAKEVDEHEFFYSQ---ETGGTIVSSALKLMDEVVKE 330 + + +++ T + T + A+ L ++E Sbjct: 140 -RQGDRVGLILFGSQAFILSPLTLDRHAARQLLDGVVPSIAGPATALGDAIALGVSKLRE 198 Query: 331 RYNPAQWNIYAAQASDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRA-------- 381 R + + +DGDN A +P LA+ + Sbjct: 199 RP---EGSRVMIVIADGDNNAGSFAPKEAARLARATGTRIYVIGVGSKQPSIPILEEGSV 255 Query: 382 --------HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + +E L F R ++I +L + Sbjct: 256 RYRDDLTMDEGTLQEIADLTGGGY-FRATDTRALEEISSRIGQLEKTEAEARTA 308 >UniRef50_UPI0001792F00 PREDICTED: similar to AGAP009579-PA n=1 Tax=Acyrthosiphon pisum RepID=UPI0001792F00 Length = 1209 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 25/275 (9%), Positives = 57/275 (20%), Gaps = 41/275 (14%) Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 + E S+ + + P + LR Sbjct: 214 QKPLQWSEKMDEVFLQNYRSDPSLSWQYFGSTAGFMRHYPAIRWSAKPSVFDTRLR---P 270 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 + + L+D+SGSM + +++ L + + + T Sbjct: 271 WYIQAATCSKDVVILLDISGSMSGMHQSISQLVIKSLLKTFNNDDSINIIFFNTGFTYLV 330 Query: 300 EVDEH------------------EFFYSQETGGTIVSSALKLMDEVVKERY-----NPAQ 336 + E + S A +++ + Sbjct: 331 TCFKELLVQATPENLYTFHKAIIEDPRLLPSSTANYSKAFIEAFQLLSNNRNNNRCSEET 390 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLP---------VVRYYSYIEITRRAHQTLWR 387 N +D DD + +K VR ++Y+ Sbjct: 391 CNQMIMLVTD-----DDPNEELFDVVQKHNRIDDNKFTNIPVRIFTYMMGRDITTTPELE 445 Query: 388 EYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQN 422 F + + + + Sbjct: 446 RLACENRGFFAHVHSKDEVLESVLKYI-AVLARPL 479 >UniRef50_D2V397 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V397_NAEGR Length = 452 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 18/175 (10%), Positives = 46/175 (26%), Gaps = 19/175 (10%) Query: 246 SSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR----------- 293 S+ + ++DVS SM + + A + +L+ V++ Sbjct: 75 STSKSIIIILDVSSSMGAYHRLENAIYATRSVINYLTEKDYVGIVLFNAGAFTCKKQTEF 134 Query: 294 ---HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERY--NPAQWNIYAAQASDGD 348 Q K+ G T +A + + + +DG Sbjct: 135 LLKATAQNKKTLIDCIENMMPFGSTNFEAAFNETFNLFDRSEEIASSTCDRVVLFLTDGT 194 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +P+ + + + + + + A + + + Sbjct: 195 ITKGANPIPLIR-KRNIEYQAKIFGF-SLGSVADTEIPKRIACENRGLWSVIEDK 247 >UniRef50_UPI0000E46E0F PREDICTED: similar to LOC594926 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E46E0F Length = 870 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 17/209 (8%), Positives = 49/209 (23%), Gaps = 25/209 (11%) Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH--- 294 + + + + ++D S SM + K + ++ + + + + + Sbjct: 316 RFAPANFGAVKKRVVFVLDFSASMYGNKIKQTKEAMYTILDEMNDSDRFNVLPFSDYVYS 375 Query: 295 ---HTQAKEV-------DEHEFFYSQETGGTIVSSALKLMDEVVK-----ERYNPAQWNI 339 Q +V + GT ++ AL +++ + Sbjct: 376 GWNSGQMVDVNPYNIRDAKDFIRQLDIQRGTNLNDALLGGLSLLESTGSMNSTSSNPMVC 435 Query: 340 YAAQASDGDNW-ADDSPLCHEILAKK-LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG S E + + + Sbjct: 436 ILFVLTDGKPSEGVTSLSEIERNVRNANNQRCSIVTLGFGRL-VNYNFLVRLALQNRG-- 492 Query: 398 NFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + I + R ++ + Sbjct: 493 --MARKIYEDSSAAGQLRGVYSEVATPLL 519 >UniRef50_B7FTA2 Predicted protein n=3 Tax=Bacillariophyta RepID=B7FTA2_PHATR Length = 800 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 68/230 (29%), Gaps = 19/230 (8%) Query: 131 DLALPNLKQNQQRQLTEYKTHRAGYTANGVPAN-ISVVRSLQNSLARRTAMTAGKRRELH 189 +P + Q + + R L S R + + Sbjct: 481 APDVPEVPQEFMFDIDATPMDPDLIDFTSRERSGKGGGRGLIFSQDRGRYIKPMLPKGKV 540 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 A + S P Q ER A +K R I D+R K + + Sbjct: 541 IRLAVDATLRASAPYQKSRRER-----AVGTSKEGRGVHIQQSDVRIKKMA----RKAGS 591 Query: 250 VMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKNVEVVYIRH-------HTQAKEV 301 ++ ++D SGSM + + AK LL K + + T++ + Sbjct: 592 LIIFVVDASGSMALNRMNAAKGAAVSLLTEAYQSRDKISLIPFQGEMADVLLPPTKSITM 651 Query: 302 DEHEFFYSQETGGTIVSSALKLM-DEVVKERYNPAQWNIYAAQASDGDNW 350 GG+ ++ AL+L + + + + SDG Sbjct: 652 ARQRLEQMPCGGGSPLAHALQLATLTGINAQKSGDVGKVVVVLISDGRAN 701 >UniRef50_A5UT43 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UT43_ROSS1 Length = 504 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 29/254 (11%), Positives = 52/254 (20%), Gaps = 40/254 (15%) Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 +A + + P + + F + + + Sbjct: 83 VAPFTPATPTATGADAPTTVPESSSPPTDTTTIFR---------PAEGEAAQVTTNIQLV 133 Query: 255 MDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVE-VVYIRHHTQ----------- 297 D SGSM ++ A+R + L H Sbjct: 134 FDASGSMAQRIGGETKIQAARRAMERIIDTLPDNPDLNVGFRVFGHEGDSSEAQKARSCQ 193 Query: 298 -----------AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 K + + Q TG T +S AL+ E + +D Sbjct: 194 STALLVPMQGVNKALLRQQAQAWQPTGWTPISLALQRAGEDFQA---GENVRNVIIMVTD 250 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G+ P + VR T R A Sbjct: 251 GEETCGGDPCAVAKALAESQAEVRIDVVGFGTTPDVAKTLRCIAENSGGVYTDAQNGDAL 310 Query: 407 QDDIYPVFRELFHK 420 + + + Sbjct: 311 VQTLEELIAATLKR 324 >UniRef50_UPI0000E49761 PREDICTED: similar to mKIAA0177 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49761 Length = 585 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 21/228 (9%), Positives = 57/228 (25%), Gaps = 25/228 (10%) Query: 213 RKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRF 272 + + + + + + L+D S SM + AK+ Sbjct: 108 DIHRPRMWVEQHPENKHSQACMVTFYPDFEASNVEKPEIILLLDCSNSMKEGALKQAKQI 167 Query: 273 YILLYLFLSRTYKNVEVVYIR-----HH-------TQAKEVDEHEFFYSQETGGTIVSSA 320 +L +S V + T G T Sbjct: 168 LLLTLHHMSDDSIFNVVTFGTGFEELFASSQAKTETTILAATTFINQACATQGNTDAWRP 227 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 L+ +++ SDG ++ + R +++ + Sbjct: 228 LQS-YFLLRSEGLQN-----LFLVSDGH---INNEQSTLHAIAQNTHS-RVFTFGV-SSS 276 Query: 381 AHQTLWREYEHLQSTFDNFAMQHIRD--QDDIYPVFRELFHKQNATAK 426 A++ L + + F + + ++ + + ++ + Sbjct: 277 ANRHLLKGMARVGCGAYEFFDNNAKSRWENKVKAQLSKAKQPSVSSLE 324 >UniRef50_UPI000155C0BC PREDICTED: hypothetical protein n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155C0BC Length = 2392 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 41/311 (13%), Positives = 84/311 (27%), Gaps = 44/311 (14%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKR---------RELH 189 +N + E G T G A + + L ++ A + + Sbjct: 494 KNDLVKAVENIRQLGGNTDTG--AALDKMLPLFQRARQQRARKVPQHLVVLTDGLSHDSV 551 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD----LRYKNYE----K 241 N + +E ++ E+ RV ++ FD ++ + + Sbjct: 552 REPAGRLRGDNINVYAIGVKEANHTQLEEIAGSDSRVYYVHNFDSLKDIKNRVVRSICSE 611 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE- 300 A + L+D SGS+ + K F + V+V ++ KE Sbjct: 612 EACKEMSADIMFLVDSSGSIGGDNFEKMKTFMKNVVNRTKIGANQVQVGLVQFSDINKEG 671 Query: 301 ----------VDEHEFFYSQETG-GTIVSSALKLMDEVVK-ERYNPAQWNIYAAQASDGD 348 G GT++ AL + + + + +DG Sbjct: 672 FQLNQYDTKTKISDAIDGLSLIGRGTLIGGALTFVSDYFSVSKGARPNVKKFLVLLTDGK 731 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + D+ + ++ V YS + E + Sbjct: 732 --SQDAVKEAAVALRQ--DGVIIYSVGVFGSEY--SQLEEISGRSDMVFYV------ENF 779 Query: 409 DIYPVFRELFH 419 DI ++ Sbjct: 780 DILKPVEDVLV 790 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 24/258 (9%), Positives = 60/258 (23%), Gaps = 31/258 (12%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK---NYEKR 242 + + ++ + E + + D+ + + Sbjct: 740 AVALRQDGVIIYSVGVFGSEYSQLEEISGRSDMVFYVENFDILKPVEDVLVFGICSPYEV 799 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVYIRHH---- 295 + ++D SGS+D + ++ K F I L + + Y Sbjct: 800 CKRIEVLDIVFVIDSSGSIDSNEYNIMKAFMIDLVKKADVGKNQVQFGALKYSDFPEVLF 859 Query: 296 -----TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYN---PAQWNIYAAQASDG 347 + E+ G T + AL + E +DG Sbjct: 860 NLNEFSSKSEIISFIQNDHPRGGSTYTAKALAHSAHLFSESLGSRMHRGVPQVLIVITDG 919 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 ++ + + + + L F Sbjct: 920 ESHDAHLLNATARALRDK--GILVLAVGIEGANH-EELL-SMAGSTD-RYFFV------- 967 Query: 408 DDIYPVFRELFHKQNATA 425 + + + +F +A+ Sbjct: 968 -ENFEGLKGIFENVSASV 984 Score = 65.7 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 32/282 (11%), Positives = 73/282 (25%), Gaps = 27/282 (9%) Query: 161 PANISVVRSLQNSLARRTAMTAGKR---RELHALEENLAIISNSEPAQLLEEERLRKEIA 217 ++ V SL R L++ + + + + Sbjct: 340 RSSEDNVSEAALSLRREGVTVFAVGIEGANETQLDQIASYPREQYVSMVKSYSDMGAYYR 399 Query: 218 ELRAKIER--VPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 + K+ + + + + + A ++ L+D SGS+ + KRF Sbjct: 400 IFQKKLRNEIQNKVSVASEQTERLKSGCADTEAADIYLLIDGSGSIQVADFQEMKRFLAE 459 Query: 276 LYLFLSRTYKNVEVVYIRHHTQAKEVDE-----------HEFFYSQE-TGGTIVSSALKL 323 + + V +++ + E ++ G T +AL Sbjct: 460 VIGMFNIGPHKVRFGAVQYSHLWEWEFEMDRYSNKNDLVKAVENIRQLGGNTDTGAALDK 519 Query: 324 MDEVVKERY--NPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRA 381 M + + + + +DG E + + Y+ Sbjct: 520 MLPLFQRARQQRARKVPQHLVVLTDG----LSHDSVREPAGRLRGDNINVYAIGVKEANH 575 Query: 382 HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 T E S + + + DI + A Sbjct: 576 --TQLEEIA--GSDSRVYYVHNFDSLKDIKNRVVRSICSEEA 613 Score = 62.6 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 47/183 (25%), Gaps = 23/183 (12%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFY 308 A + L+D S S+ + K F + + V V + + + F Sbjct: 997 ADLVFLIDGSTSILEEDFKKMKDFLVTIVNDFDIRPGKVHVGLAQFSHEYRPEFSLIPFR 1056 Query: 309 ------------SQETGGTIVSSALKLMDEVVK---ERYNPAQWNIYAAQASDGDNWADD 353 Q G T++ +AL+ + A +DG + D Sbjct: 1057 DKIEVKNQIGRIQQIFGNTLIGAALRNVGSYFWPDFGSRINAGVQQVLLVLTDGQ--SQD 1114 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPV 413 + + ++ + + S + + + D I Sbjct: 1115 EVAQAAEDLRNKGIDIYSLGVGQV----NDQQLIQIS--GSAKKKLTVDNFSELDKIKKR 1168 Query: 414 FRE 416 Sbjct: 1169 VVR 1171 Score = 54.9 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 53/205 (25%), Gaps = 28/205 (13%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 + + A + L+D S ++ K F + L + ++ Sbjct: 4 RTSVNQDAAPEFADVVFLVDSSDNLGNKAFPFVKTFVNKMINALPIEASKYRIALAQYSD 63 Query: 297 QAK------------EVDEHEFFYSQETGGTI-VSSALKLMDEV----VKERYNPAQWNI 339 + H GG+ + AL+ + + +P ++ Sbjct: 64 DLHSEFQLNTFKSKNPMLNHVKKNFAFRGGSPRLGLALQKAHKTYFSGLTNGRDPKRFPP 123 Query: 340 YAAQASDGDNWADDSPLCHEILAKKL-LPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 + S E AK L V+ S + A + + Sbjct: 124 VLVVL-----ASGPSEDDVEAPAKALQRDRVKIISLG--MQAASDRDLKAMAT---PQFD 173 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNA 423 F ++ IR+ P + Sbjct: 174 FLLRTIREVSMFSPNMTRIIEDVVK 198 Score = 54.1 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 18/190 (9%), Positives = 49/190 (25%), Gaps = 22/190 (11%) Query: 249 AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRH--------HTQ 297 + L+D S + + K F + + V+Y Sbjct: 226 VDIVFLVDESVNGTDENFEHLKGFLVETIDSFDVKENCMRIGLVMYSNETKLVSRLGTGT 285 Query: 298 AKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVKER----YNPAQWNIYAAQASDGDNWAD 352 K + G + +A+ + + + R + ++ Sbjct: 286 NKSDILQQIDGLSPKAGRALTGAAINVTRKEIFSRGAGSRKSQGVLQITVLIT--HRSSE 343 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 D+ + + V ++ A++T + ++ D Y Sbjct: 344 DNVSEAALSLR--REGVTVFAVGIEG--ANETQLDQIASYPREQYVSMVKSYSDMGAYYR 399 Query: 413 VFRELFHKQN 422 +F++ + Sbjct: 400 IFQKKLRNEI 409 >UniRef50_A9B6J8 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B6J8_HERA2 Length = 950 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 55/218 (25%), Gaps = 40/218 (18%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ----------------STKDMAKRFYILLY 277 L + + ++D SGSMD D+AK Sbjct: 393 LPVNMDVRNRQQRPDIALVFIIDKSGSMDACHCNGGDMAAREGGGTRKIDIAKEAVAQAA 452 Query: 278 LFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVK 329 L + K V + E+D+ +G T V S + E ++ Sbjct: 453 AVLGKDDKLGVVTFDDSAHWTIELDKVPSQDDVVAALAPVPPSGQTNVVSGMNAAYEQLR 512 Query: 330 ERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREY 389 + +A +DG K + + + A + Y Sbjct: 513 QS---DAKIKHAILLTDGWG-HATDIGSIAENMNKDGITLSVVAAGNGSDNA----LQRY 564 Query: 390 EHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 L + ++F ++ A G Sbjct: 565 AELGGGRYY--------PARVMEEVPQIFLQETIQAVG 594 Score = 54.9 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 40/145 (27%), Gaps = 10/145 (6%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 + P L+D S S+ + ++F + K VV+ + + Sbjct: 57 QVRQPVQNLTTVFLLDSSDSIAPGQRSNNEQFIAQALETMQEGDKAAVVVFGENALVERV 116 Query: 301 VDE----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 E T +S A++L + PA SDG + L Sbjct: 117 PSEIQRLGTIQSVPIAARTDISEAIQLGLALF-----PADTQKRLVLLSDGGENSG-RAL 170 Query: 357 CHEILAKKLLPVVRYYSYIEITRRA 381 LA++ + Sbjct: 171 EMIPLAQRRNVPIDIVPTGIGQGNP 195 >UniRef50_C3ZZV3 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3ZZV3_BRAFL Length = 2692 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 39/160 (24%), Gaps = 19/160 (11%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEV 301 + + + L+D SGS+ + D+ K F L + V Y T + Sbjct: 1375 RNIELDLVFLLDGSGSVTTANFDIVKEFTRRLANNFDISLADTRVGVVQYSDSPTLEFNL 1434 Query: 302 DEHEFFYSQ---------ETGGTIVSSALKLMD--EVVKERYNPAQWNIYAAQASDGDNW 350 + + GGT A+ + + + +DG Sbjct: 1435 NSFNTNELVDLAIRNIQYQQGGTNTGQAIDFVRVNSFSANNGDRSDVPNVMIVVTDGQ-- 1492 Query: 351 ADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 S + Y+ + Sbjct: 1493 --SSDDVVGPAQTARNAGISMYAVGIGNG-VDTNELLQIA 1529 Score = 73.0 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 45/196 (22%), Gaps = 22/196 (11%) Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL---SRTYKNVEVVY 291 +Y +P + L+D SGS+ + K F + + + V Y Sbjct: 285 QYYTPCLVRNPE--FDLIFLLDESGSIGTDNFKLVKSFTERMANNFDISPNSTRVGVVQY 342 Query: 292 IRHHT--------QAKEVDEHEFFYSQETGG-TIVSSALKLMD--EVVKERYNPAQWNIY 340 K GG T +A+ + E + Sbjct: 343 SNFPGTEFSLNAFTDKAAVLDAISKIDYNGGSTFTGAAIDFVRNNEFTSVNGDRDDVPNI 402 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DG+ D S + Y+ Q + Sbjct: 403 LIVITDGNPNDDVSGPA----ISANNAGITTYAVGIG-SNVDQANLVQMT-AGRPGRVLQ 456 Query: 401 MQHIRDQDDIYPVFRE 416 D + +E Sbjct: 457 AADFTDLTTVVGTLQE 472 Score = 66.8 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 48/162 (29%), Gaps = 19/162 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ--------- 297 + + L+D SGS+ + D+ K F + + V +++ Q Sbjct: 1095 NGTDLVFLLDGSGSVGSNNFDLVKTFTKNVVQNFDISETATRVAVVQYSDQFSTEFSLNA 1154 Query: 298 --AKEVDEHEFFYSQE-TGGTIVSSALKLMDEVVKERYN--PAQWNIYAAQASDGDNWAD 352 K + TGGT A+ + + V + + +DG + D Sbjct: 1155 FSTKTEVYNAIDNISYLTGGTFTGFAIDFVMQSVFTSISGERDGYPDLLVVVTDGLSTDD 1214 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 S A+ V Y+ + L S Sbjct: 1215 VSGPADTARAQ----GVTIYAVGVG-SDIDFNTLEQIAGLTS 1251 Score = 65.3 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 19/174 (10%), Positives = 46/174 (26%), Gaps = 20/174 (11%) Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVE---- 288 + L+D S S+ S ++ K F + + Sbjct: 1 APTPAPECTQFGGVDLVFLLDGSASVGASNFELVKDFTQQTTAKFDISDGSTRVAVAQYS 60 Query: 289 ----VVYIRHHTQAKEVDEHEFFYSQE--TGGTIVSSALKLMDEVVKERYNPAQWNI--Y 340 V + + + + T A++ + + +N A+ + Sbjct: 61 STPQVEFNLNTNSDVDTLSNAIEQITYMNGDSTFTGFAIEFVRQSAFSSFNGARDDKPDI 120 Query: 341 AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQS 394 +DG + DS A++ V ++ T + ++ Sbjct: 121 MVVVTDGQ--SADSVTSSAATAREQ--GVTMFAVGVGTG-VGLSELQDIAGYTD 169 Score = 61.1 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 49/155 (31%), Gaps = 18/155 (11%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-HTQAKEVDEHE 305 A + L+D SGS+ + K F + + + V +++ T E +E Sbjct: 552 PGADLVFLLDGSGSIGTDNFQLVKAFTKEVIRNFAISPTATRVGLLQYSDTIDNEFFMNE 611 Query: 306 FFYSQE-----------TGGTIVSSALKLMDEVVKERYN--PAQWNIYAAQASDGDNWAD 352 F E TGGT A++ ++ + +DG++ Sbjct: 612 FNTRDELYTAVDNVVYKTGGTFTGFAVEFTRQIAFRTSAGTRDNYPDILIVVTDGNSEDV 671 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 +A + + Y+ + +L Sbjct: 672 ----VTSAVASAIDQGILIYAVGVGSNVDFASLLE 702 Score = 59.9 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 53/196 (27%), Gaps = 34/196 (17%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVY-- 291 + + L+D SGS+ D+ K F + + V + Sbjct: 1623 RAPVCTDLSFGGLDLVFLLDGSGSVTAVNFDLVKDFASGVVSEFQISTTETRVGVVQFSD 1682 Query: 292 --------IRHHTQAKEV-DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY-- 340 T+ + + + Y Q G T+ +A+ ++ N Sbjct: 1683 TLRTEFFMSSFSTKQQVLQAISDIDYIQ--GNTLTGAAITFAT---ASSFSTPAGNRANF 1737 Query: 341 ---AAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 +DG + DS + A+ + ++ + Sbjct: 1738 PDFMIVVTDG--LSQDSVVQPAQSARDQ--GITIFAVGVGN-EVDFATLLQIT----GVP 1788 Query: 398 NFAMQHIRDQDDIYPV 413 + +Q + D D+ Sbjct: 1789 EYILQ-VTDFSDLLAA 1803 Score = 59.1 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 46/149 (30%), Gaps = 16/149 (10%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 + M L+D SGS+ Q ++ K+F + + + + V +++ + F Sbjct: 1891 TPVDMVFLLDGSGSVTQPNFELVKQFTQNVVVNFNISSATTRVGLVQYSDTIRT---EFF 1947 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNI-----YAAQASDGDNWADDSPLCHEIL 361 S + T+ +A+ + ++ N +DG + D + Sbjct: 1948 LNSHPSRNTLTGAAIDFVRT---SSFSIPAGNRLTLPDVLVVVTDG--LSQDDVAGPAQI 2002 Query: 362 AKKLLPVVRYYSYIEITRRAHQTLWREYE 390 A+ + + Sbjct: 2003 ARDNGIAIYAVGIG---SEVDFATLLDIA 2028 Score = 56.8 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 17/158 (10%), Positives = 42/158 (26%), Gaps = 19/158 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 + + L+D S S+ + K F + + + V +++ + Sbjct: 2247 TPVDLVFLLDGSSSITSPNFQIVKDFTADVVRTFNVSSAATNVGLVQYSDTIRTEFFLNS 2306 Query: 307 FYSQET------------GGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWAD 352 F ++ G T +A+ + + N Y +DG + Sbjct: 2307 FDTKSGVLNAIGNIGYLQGNTRTGAAIDFVRISSFSVPAGNRGNQPDYLIVVTDG--LSQ 2364 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 D + A+ + ++ Sbjct: 2365 DDVVVPAQTARN--DGISIFAVGIG-SEIDFATLLNIA 2399 Score = 54.1 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 19/158 (12%), Positives = 39/158 (24%), Gaps = 19/158 (12%) Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH-HTQAKEVDEHE 305 + + L+D S S+ + K F + + + V +++ T E + Sbjct: 2506 TPVDLVFLLDGSSSITSPNFQIVKDFTADVVRTFNVSSAATNVGLVQYSDTIRTEFFLNS 2565 Query: 306 FFYSQE-----------TGGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDGDNWAD 352 F E G T +A+ + + N Y +DG Sbjct: 2566 FDTKSEVLNAIGNIGYLQGNTRTGAAIDFVRISSFSVPAGNRGNQPDYLIVVTDG----L 2621 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYE 390 + ++ T Sbjct: 2622 SQDEVLGPAQTARFEGINIFAVGIGN-EIDFTTLLHIA 2658 Score = 52.6 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 48/158 (30%), Gaps = 21/158 (13%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS---RTYKNVEVVYIRHHTQAKEV 301 S + + L+D S S+ + ++ K F S + V + T+ ++ Sbjct: 833 RSVEFDLVFLVDKSSSVGPANFELVKEFMYDFTNTFSVGLSDTRIGAVQFADAQTKDFDM 892 Query: 302 DEHEFFYSQ-ET-----------GGTIVSSALKLMDEVVKERYNPAQWNI--YAAQASDG 347 D GG +A+ + + R N + ++ + Sbjct: 893 DTFATKEQTLAGIQNIVYTDNQVGGVATGAAIDFVRQNSYTRGNGDRTSVPDLLVVVT-- 950 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTL 385 + + D + A+K + Y+ + L Sbjct: 951 SSASTDDVASAQETAEK--EGITIYTVGVTNSVSFAEL 986 >UniRef50_A1S120 von Willebrand factor, type A n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S120_THEPD Length = 305 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 51/225 (22%), Gaps = 21/225 (9%) Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 RR E + P L ++ + E Sbjct: 26 RRAKRVQEGIRRALPGELPGLTPRLALASAIALLLVLAASEPRVVEVKRVELPLEEAYKL 85 Query: 245 PSSQAVMFCLMDVSGSMDQSTKD-----MAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 + ++D S SM + +AKRF L + V V + Sbjct: 86 SGVPTLHVVVLDESKSMLSADVYPDRCTLAKRFAEKYIEGLQPSDLVVLVFFSSSSNATG 145 Query: 300 EVDEHEFFYSQETGG-------TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + + G T + AL V+ P SDG Sbjct: 146 PLPRD--NALRALDGYTCRYRYTALGDALVSAYSVLAASGLPGA----VVVVSDGGWNYG 199 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 PL + + + +L R+ + Sbjct: 200 SDPLQVAQSIRASN---YSLVLVRVGGDPRGSLMRDVASKANGKY 241 >UniRef50_A5UXL9 Peptidase M23B n=2 Tax=Roseiflexus RepID=A5UXL9_ROSS1 Length = 982 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 12/97 (12%), Positives = 21/97 (21%), Gaps = 5/97 (5%) Query: 309 SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV 368 G T + L+ +++ SDG + Sbjct: 570 LTPRGSTSIGGGLQRSQQLLSASAP--GRTRAIVLLSDGQENTAPYVSDVLPQIRASQIT 627 Query: 369 VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 V A Q L N+A + + Sbjct: 628 VHTIGLGT---DADQQLMLSIAAQTGGTYNYAPRPDQ 661 >UniRef50_UPI00016C44B4 BatA n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C44B4 Length = 317 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 48/204 (23%), Gaps = 28/204 (13%) Query: 243 PDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 S + +DVSGSM D + D + + F + + Sbjct: 91 QQKRSLTNIQFAVDVSGSMLAPFGDGNRYDASMKAIDTFLDF-RKGDAFGL-TFFGDAFV 148 Query: 298 AKEVDEHEFFYSQET-------------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + + + GGT ++ AL ++ R + + Sbjct: 149 HWVPLTTDVTAIRCSPPFMRPETVPPPFGGTAIAKALNGCKTELRRR---DEGDKMIVLI 205 Query: 345 SDGDNWADD-SPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 +DG ++ + V + I + L F Sbjct: 206 TDGFSYDLTGNDEEIARTL--SAEGVAVFCIIVGGFEPQAEIV-NICRLTGGEA-FRADD 261 Query: 404 IRDQDDIYPVFRELFHKQNATAKG 427 ++ + Q Sbjct: 262 PDALPAVFKKIDTMKQAQLKPTIA 285 >UniRef50_Q2J8W6 von Willebrand factor, type A n=2 Tax=Actinomycetales RepID=Q2J8W6_FRASC Length = 534 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 35/276 (12%), Positives = 60/276 (21%), Gaps = 39/276 (14%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R + L R R H L+ + L R Sbjct: 267 RPDKRDLYDRVVSWLRLPRTQHRLQTATSRRPALPGVALDARFPTRTLTEL---PFPASR 323 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLY---------- 277 + L + L+DVSGSM S + L Sbjct: 324 QVADQLL----SAYLDRFRRPSHAIFLLDVSGSMAGSRIAALQAALRGLTGADDTLSGRF 379 Query: 278 LFLSRTYKNVEVVYIRHHTQAKEVDEHEF--------------FYSQETGGTIVSSALKL 323 K + + + ++ + GT + SAL+ Sbjct: 380 ARFRGREKITMITFAGRANDPVDFAVNDPRPGSADLAGVNTFVDGLRLQDGTAIYSALEA 439 Query: 324 MDEVVKERYNPA-QWNIYAAQASDGDNWADDSPLCHEILAKKL---LPVVRYYSYIEITR 379 + +DG+N + S ++L VR ++ Sbjct: 440 GYRAAGAAVEADPGYLTSIVLMTDGENNSGISAADFRSSYQRLPAAARAVRTFTIAFG-- 497 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 A R+ D + R Sbjct: 498 EADPAALRDISADTGG--AVFDARTSSLADAFKDIR 531 >UniRef50_A1BJ62 von Willebrand factor, type A n=1 Tax=Chlorobium phaeobacteroides DSM 266 RepID=A1BJ62_CHLPD Length = 337 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 32/278 (11%), Positives = 59/278 (21%), Gaps = 47/278 (16%) Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 R+ A +++ + + A + L Sbjct: 25 RVTRQRQARELLADADLADRIMPGGDLRILFVRWLLLFLATTLFLFAFCGPQL---CRGS 81 Query: 242 RPDPSSQAVMFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT 296 RP + ++DVS SM AK + + L + +++ Sbjct: 82 RPVERKGVDVLFMLDVSNSMLVADVSPDRLTRAKSGILRISKGLRDG-RQALLLFAGSPL 140 Query: 297 QAKEVDEHEFF----------YSQETGGTIVSSALKLMDEVVKERYNPAQWN-----IYA 341 + GT SAL L + + P Sbjct: 141 VQCPMTTDHAAFEALLGMVSTELVSDQGTAFDSALNLAMRLFERTEPPGDVKEVQGEKVI 200 Query: 342 AQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI----------------------EITR 379 SDG+N + + K+ V + Sbjct: 201 VLLSDGENHSG-NFRAVADALKQSGVSVFTIVLGKPLPAAIPLGQSSGVKKDAAGKIVKT 259 Query: 380 RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 R+ R A + D + L Sbjct: 260 RSSPETMRRLAGDSGGTFFDASEDDAVYDRVAERISTL 297 >UniRef50_Q2FLV5 Protoporphyrin IX magnesium-chelatase n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FLV5_METHJ Length = 619 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 38/383 (9%), Positives = 89/383 (23%), Gaps = 39/383 (10%) Query: 49 GESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQD 108 E + +DIS + + R + P Sbjct: 257 NERDQVSVDDISATLQVSLQHRRRDYQKEPQSSHDTDKNKSSPPEPENKKDGDGLPQDNR 316 Query: 109 GEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVR 168 E D+ Q + + + +++ + + + + G + Sbjct: 317 HEEGDQNESQQKQIPQAPAPDQVFTIGDIQFTKDLIVQKKNKNFHSMVKTGKKGS----H 372 Query: 169 SLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPF 228 + R + + + R + Sbjct: 373 QKISDSGRYFRSKNPSGKI----------------YDIAFDATFRAAAPHQITRSNGTLA 416 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKN 286 ++ + K S + ++D SGSM + + A + LL + Sbjct: 417 LNISVQDIRV--KERKRKSGRTIIFVVDSSGSMGAAKRMSAVKGAVLSLLKDAYINRDQV 474 Query: 287 VEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN- 338 + + T++ H+ + G T +SS + +++ + Sbjct: 475 ALISFRGPGAEVLLKPTRSGMTAYHQLAHLPTGGQTPLSSGIYTTVSLIRTIRRKNSHDE 534 Query: 339 IYAAQASDGDNWADDSPLC-----HEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQ 393 + SDG S A YY L ++ Sbjct: 535 PFVIIISDGRANHARSDNDPVAEAWMAAAAARNEKAHYYVIDTECGYPRFFLAKKLAEHI 594 Query: 394 STFDNFAMQHIRDQDDIYPVFRE 416 + +I + R+ Sbjct: 595 GGTYTLLDTMNGE--EIVQLVRQ 615 >UniRef50_C3YY34 Putative uncharacterized protein (Fragment) n=2 Tax=Branchiostoma floridae RepID=C3YY34_BRAFL Length = 193 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 54/190 (28%), Gaps = 26/190 (13%) Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT---YKNVEVVYIRHHTQAKEV 301 Q M ++D SGS+ K+F + L L+ + + + Y T ++ Sbjct: 1 KHQQLDMVFVLDGSGSIQAVNFAKVKKFAVDLSDGLNISPTATRVGLIEYTDSPTVEFKL 60 Query: 302 DEHEFFYSQE---------TGGTIVSSALKLMDEVVKERYNPAQWN-----IYAAQASDG 347 +H S +GGT AL + R P A +DG Sbjct: 61 ADHTNKASLATAINNVSYQSGGTQTGRALDAARTQMDWRQPPVPNVCFSLLQAAIVVTDG 120 Query: 348 DNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 + D+ + + Y + E + + + Sbjct: 121 M--SGDNVQQPAKALRDN--DISAYGVGIG-PAINANELNEIAGGDAGHVFYIPNY---- 171 Query: 408 DDIYPVFREL 417 D + ++ Sbjct: 172 DKLEKEMEKI 181 >UniRef50_C3B4T5 D-amino acid dehydrogenase, large subunit n=3 Tax=Bacillus RepID=C3B4T5_BACMY Length = 474 Score = 73.8 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 30/271 (11%), Positives = 60/271 (22%), Gaps = 35/271 (12%) Query: 186 RELHALEENLAIISNSEPAQLLEEERL-RKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 + E+ + E + + L + + + K Sbjct: 116 KNKSNSEQADLYVHMMYSLLKQEIIPFDKMPLQILEIGRVEDEEKKSNGTKGEQKNKEDR 175 Query: 245 PSSQAVMFCLMDVSGSMDQ-----STKDMAKRFYILLYLFLSRTYKNVEVVYI--RHHT- 296 + L+D SGSM D+AK L VY + Sbjct: 176 --GNYNIEILLDASGSMAGKIDGKMKMDIAKEAIQQFVSDLPEAVNVSLRVYGHKGSNDE 233 Query: 297 ------------------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 + Q G T ++ A+K E + + Sbjct: 234 KDKTASCGAIENIYTLQKYDQTTFRQSLDGFQPVGWTPLAEAIKKSTETFQSAKENDKN- 292 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLLPV--VRYYSYIEITRRAHQTLWREYEHLQSTF 396 SDG +P+ + + + +E + Sbjct: 293 -IIYIVSDGVETCGGNPVEEAQKVSNSNIKPIMNIIGFQV--DHEAEKQLKEIAEVSKGK 349 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 A QD +++ ++ Sbjct: 350 YVLANSAKELQDQFKETGKDITSRRLKAGGA 380 >UniRef50_A3ZTC3 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZTC3_9PLAN Length = 346 Score = 73.4 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 47/168 (27%), Gaps = 20/168 (11%) Query: 244 DPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----IRHHTQAK 299 ++ + D SGSM D K + L R + + + + + Sbjct: 185 LRAAGRRFCIIADCSGSMSGVKLDYVKEEILETVSSLPREAQFQVIFFQSQAVPFPQKGW 244 Query: 300 EVDEHEFFYSQ-------ETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 + +F GGT A ++ + P +DG + D Sbjct: 245 RHPKRDFNALSEWLKTVGPAGGTNPLPAFEIALKF---SPRPDA----VFFMTDG-LFDD 296 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + + P V+ ++ R+ + L R+ Sbjct: 297 NVVGEVKRQNDLSEPKVKVHAISF-MDRSAEPLMRQIAGESGGEYRHV 343 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.116 0.256 Lambda K H 0.267 0.0357 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,366,496,962 Number of Sequences: 3077464 Number of extensions: 37577030 Number of successful extensions: 455339 Number of sequences better than 1.0e-01: 1000 Number of HSP's better than 0.1 without gapping: 954 Number of HSP's successfully gapped in prelim test: 4355 Number of HSP's that attempted gapping in prelim test: 443455 Number of HSP's gapped (non-prelim): 8889 length of query: 427 length of database: 1,040,396,356 effective HSP length: 131 effective length of query: 296 effective length of database: 637,248,572 effective search space: 188625577312 effective search space used: 188625577312 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 94 (41.0 bits)