BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (575 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobact... 1178 0.0 UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20... 496 e-138 UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9... 474 e-132 UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellac... 473 e-132 UniRef50_C6M483 von Willebrand factor type A domain protein n=1 ... 460 e-128 UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteo... 441 e-122 UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella... 440 e-122 UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 ... 427 e-118 UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter... 422 e-116 UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea... 421 e-116 UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales... 414 e-114 UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 ... 413 e-113 UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 ... 412 e-113 UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria R... 412 e-113 UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenz... 412 e-113 UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidet... 409 e-112 UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteo... 405 e-111 UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmati... 402 e-110 UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 ... 400 e-109 UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopi... 399 e-109 UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4... 397 e-109 UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacter... 397 e-109 UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 ... 396 e-108 UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria Re... 395 e-108 UniRef50_UPI000185CB41 protein containing von Willebrand factor ... 392 e-107 UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatiba... 388 e-106 UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastop... 387 e-106 UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria R... 384 e-105 UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacte... 383 e-104 UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=... 381 e-104 UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobact... 381 e-104 UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reineke... 366 1e-99 UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacte... 366 1e-99 UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 ... 354 4e-96 UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimon... 354 7e-96 UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium... 351 3e-95 UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobact... 351 4e-95 UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 ... 345 4e-93 UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostri... 343 9e-93 UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Rumi... 337 8e-91 UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobac... 333 1e-89 UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocyst... 329 2e-88 UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 ... 326 2e-87 UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostri... 317 1e-84 UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 ... 310 1e-82 UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 ... 308 3e-82 UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Breviba... 306 1e-81 UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangi... 306 2e-81 UniRef50_C7N770 Uncharacterized protein containing a von Willebr... 301 5e-80 UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella... 295 2e-78 UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophag... 274 7e-72 UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiph... 268 3e-70 UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Ach... 258 6e-67 UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp.... 238 4e-61 UniRef50_C1RGW7 Uncharacterized protein containing a von Willebr... 213 2e-53 UniRef50_B4D1N7 Autotransporter-associated beta strand repeat pr... 205 4e-51 UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12... 199 2e-49 UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 ... 199 2e-49 UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 ... 169 2e-40 UniRef50_UPI0001912300 conserved protein n=1 Tax=Salmonella ente... 154 1e-35 UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus te... 154 1e-35 UniRef50_UPI00016986EC hypothetical protein Epers_34925 n=1 Tax=... 153 2e-35 UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 ... 135 4e-30 UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter... 126 3e-27 UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophag... 116 3e-24 UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiob... 110 1e-22 UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscill... 107 1e-21 UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 ... 103 2e-20 UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinoco... 102 3e-20 UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotom... 102 4e-20 UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopi... 102 5e-20 UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 ... 100 2e-19 UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophob... 99 5e-19 UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=So... 99 5e-19 UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 ... 97 1e-18 UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus ce... 97 2e-18 UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangi... 96 3e-18 UniRef50_A7C0I2 von Willebrand factor type A domain protein n=1 ... 96 4e-18 UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum '... 94 2e-17 UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis R... 93 3e-17 UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifi... 92 4e-17 UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfo... 89 3e-16 UniRef50_B4W304 von Willebrand factor type A domain protein (Fra... 88 1e-15 UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangi... 87 2e-15 UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ... 87 2e-15 UniRef50_C1XMC3 Uncharacterized protein containing a von Willebr... 87 2e-15 UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW... 86 4e-15 UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira ... 84 1e-14 UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26... 84 1e-14 UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 ... 84 1e-14 UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflex... 84 1e-14 UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ... 82 5e-14 UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ... 82 5e-14 UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus ... 81 1e-13 UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesioc... 81 1e-13 UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuni... 80 3e-13 UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 ... 79 4e-13 UniRef50_UPI00017450FB von Willebrand factor type A domain prote... 79 4e-13 UniRef50_Q235T9 von Willebrand factor type A domain containing p... 79 5e-13 UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphyloth... 79 6e-13 UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genom... 78 9e-13 UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax... 78 9e-13 UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 ... 77 1e-12 UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesioc... 77 2e-12 UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexu... 77 2e-12 UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangi... 76 3e-12 UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangi... 76 3e-12 UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesioc... 75 5e-12 UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesioc... 75 1e-11 UniRef50_A3U9M7 Putative uncharacterized protein n=1 Tax=Croceib... 74 1e-11 UniRef50_UPI00006CAF43 von Willebrand factor type A domain conta... 73 3e-11 UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZE... 73 3e-11 UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8Y... 72 5e-11 UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genom... 72 6e-11 UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepI... 71 1e-10 UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseoba... 71 1e-10 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 71 1e-10 UniRef50_D0LP28 von Willebrand factor type A n=1 Tax=Haliangium ... 70 2e-10 UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=... 70 2e-10 UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi... 70 3e-10 UniRef50_Q82LZ6 Putative uncharacterized protein n=1 Tax=Strepto... 70 3e-10 UniRef50_D2VKS7 von Willebrand factor type A domain-containing p... 69 4e-10 UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia br... 69 4e-10 UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3... 69 5e-10 UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus tri... 68 7e-10 UniRef50_A9WKF3 von Willebrand factor type A n=3 Tax=Chloroflexu... 68 8e-10 UniRef50_C1YR26 Uncharacterized protein containing a von Willebr... 68 9e-10 UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatiba... 67 1e-09 UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnol... 67 2e-09 UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetom... 67 2e-09 UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3... 67 2e-09 UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin... 67 2e-09 UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopi... 66 3e-09 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 66 3e-09 UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, s... 65 8e-09 UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fr... 65 9e-09 UniRef50_Q231J4 von Willebrand factor type A domain containing p... 65 9e-09 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 64 1e-08 UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis v... 64 1e-08 UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole geno... 64 2e-08 UniRef50_A8IJ40 Predicted protein n=1 Tax=Chlamydomonas reinhard... 64 2e-08 UniRef50_Q22ST4 von Willebrand factor type A domain containing p... 64 2e-08 UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 T... 64 2e-08 UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN... 63 2e-08 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 63 2e-08 UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornu... 62 7e-08 UniRef50_A8J0D9 Flagellar associated protein n=1 Tax=Chlamydomon... 62 8e-08 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 62 8e-08 UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1... 62 8e-08 UniRef50_A6FXN3 Putative uncharacterized protein n=1 Tax=Plesioc... 62 9e-08 UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verruc... 61 9e-08 UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudo... 60 2e-07 UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n... 60 2e-07 UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ5... 60 3e-07 UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=... 60 3e-07 UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genom... 59 5e-07 UniRef50_A5UW94 von Willebrand factor, type A n=2 Tax=Roseiflexu... 59 6e-07 UniRef50_A7RTF3 Predicted protein (Fragment) n=1 Tax=Nematostell... 59 7e-07 UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genom... 58 1e-06 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 58 1e-06 UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillu... 58 1e-06 UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome... 58 1e-06 UniRef50_B0CG18 von Willebrand factor type A domain protein, put... 57 1e-06 UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanob... 57 2e-06 UniRef50_C7NQ34 von Willebrand factor type A n=1 Tax=Halorhabdus... 57 2e-06 UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genom... 57 2e-06 UniRef50_D0LR75 Putative uncharacterized protein n=1 Tax=Haliang... 57 2e-06 UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=... 57 2e-06 UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magno... 57 2e-06 UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza s... 57 2e-06 UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microspo... 57 3e-06 UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisp... 57 3e-06 UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 ... 56 3e-06 UniRef50_Q113J8 Putative uncharacterized protein n=1 Tax=Trichod... 56 4e-06 UniRef50_A6C9I8 BatB n=2 Tax=Planctomycetaceae RepID=A6C9I8_9PLAN 56 4e-06 UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2... 55 5e-06 UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax... 55 7e-06 UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microc... 55 8e-06 UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anseri... 55 1e-05 UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein fam... 55 1e-05 UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteri... 55 1e-05 UniRef50_C9Q197 Aerotolerance protein BatB n=11 Tax=Prevotella R... 54 1e-05 UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geob... 54 1e-05 UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-cont... 54 1e-05 UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Breviba... 54 1e-05 UniRef50_Q73UD3 UPF0353 protein MAP_3435c n=4 Tax=Mycobacterium ... 54 1e-05 UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharoph... 54 2e-05 UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_... 54 2e-05 UniRef50_A0CDA0 Chromosome undetermined scaffold_17, whole genom... 54 2e-05 UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxob... 54 2e-05 UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillu... 54 2e-05 UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesm... 54 2e-05 UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B... 54 2e-05 UniRef50_Q22N58 von Willebrand factor type A domain containing p... 54 2e-05 UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=T... 54 2e-05 UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain... 53 3e-05 UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1... 53 3e-05 UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectria... 53 3e-05 UniRef50_Q24C76 von Willebrand factor type A domain containing p... 53 3e-05 UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1... 53 4e-05 UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythro... 53 4e-05 UniRef50_Q5LCG5 Aerotolerance-related membrane protein n=25 Tax=... 53 4e-05 UniRef50_B6ZDR6 Voltage dependent calcium channel alpha2d/delta ... 53 4e-05 UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastop... 53 4e-05 UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcani... 53 4e-05 UniRef50_Q1AYC2 Protoporphyrin IX magnesium-chelatase n=15 Tax=B... 52 5e-05 UniRef50_C1XFF8 Mg-chelatase subunit ChlD n=1 Tax=Meiothermus ru... 52 5e-05 UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1... 52 5e-05 UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacilla... 52 5e-05 UniRef50_Q4RF07 Chromosome 13 SCAF15122, whole genome shotgun se... 52 6e-05 UniRef50_A9EV77 Putative uncharacterized protein n=1 Tax=Sorangi... 52 6e-05 UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellu... 52 6e-05 UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza s... 52 7e-05 UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein... 52 7e-05 UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein... 51 1e-04 UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1... 51 1e-04 UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YN... 51 1e-04 UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 ... 51 1e-04 UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 51 1e-04 UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein... 50 2e-04 UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 50 2e-04 UniRef50_C3Y4Z7 Putative uncharacterized protein n=1 Tax=Branchi... 50 3e-04 UniRef50_C1GWG1 von Willebrand factor type A domain containing p... 50 3e-04 UniRef50_B8G7Y1 von Willebrand factor type A n=3 Tax=Chloroflexu... 50 3e-04 UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudona... 50 3e-04 UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomyce... 50 3e-04 UniRef50_B9ML47 YD repeat protein n=1 Tax=Anaerocellum thermophi... 50 3e-04 UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=... 49 4e-04 UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putat... 49 4e-04 UniRef50_UPI0001745E25 hypothetical protein VspiD_17020 n=1 Tax=... 49 4e-04 UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated c... 49 5e-04 UniRef50_A9B057 von Willebrand factor type A n=3 Tax=Chloroflexi... 49 5e-04 UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein... 49 5e-04 UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi... 49 5e-04 UniRef50_Q25545 Putative uncharacterized protein (Fragment) n=1 ... 49 5e-04 UniRef50_B8HSI1 von Willebrand factor type A n=8 Tax=Cyanobacter... 49 5e-04 UniRef50_Q237Q6 von Willebrand factor type A domain containing p... 49 6e-04 UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga o... 49 6e-04 UniRef50_A8ULL3 Putative uncharacterized protein n=1 Tax=Flavoba... 49 6e-04 UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-try... 49 6e-04 UniRef50_D2R3Y3 von Willebrand factor type A n=1 Tax=Pirellula s... 49 7e-04 UniRef50_UPI000180C65C PREDICTED: similar to calcium channel, vo... 49 7e-04 UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepI... 49 7e-04 UniRef50_D2RSW3 von Willebrand factor type A n=1 Tax=Haloterrige... 49 7e-04 UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 48 8e-04 UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain... 48 8e-04 UniRef50_A7RVQ6 Predicted protein n=1 Tax=Nematostella vectensis... 48 0.001 UniRef50_A9B6J8 von Willebrand factor type A n=1 Tax=Herpetosiph... 48 0.001 UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3... 48 0.001 UniRef50_Q22SJ7 von Willebrand factor type A domain containing p... 48 0.001 >UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobacteriaceae RepID=YFBK_ECOLI Length = 575 Score = 1178 bits (3047), Expect = 0.0, Method: Compositional matrix adjust. Identities = 575/575 (100%), Positives = 575/575 (100%) Query: 1 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK Sbjct: 1 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 Query: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP Sbjct: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF Sbjct: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL Sbjct: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY Sbjct: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA Sbjct: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY Sbjct: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE Sbjct: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 Query: 481 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ 540 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ Sbjct: 481 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ 540 Query: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ Sbjct: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 >UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20 Tax=Proteobacteria RepID=Q4KKB4_PSEF5 Length = 582 Score = 496 bits (1276), Expect = e-138, Method: Compositional matrix adjust. Identities = 246/468 (52%), Positives = 325/468 (69%), Gaps = 11/468 (2%) Query: 104 RYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSD 163 +YQ+ DNP+ VA+ P++TFS DVDTG+YANVRR LNQG LPP AVR+EE+VNYFP D Sbjct: 96 QYQKLPDNPIHSVAEAPVSTFSADVDTGAYANVRRLLNQGSLPPEGAVRLEELVNYFPYD 155 Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 + + S PF + ELAP+PWN LL++ I A DR EL +NLVFL+D Sbjct: 156 YALPTDGS-------PFGVTTELAPSPWNPHTRLLRIGIKASDRAVAELAPANLVFLVDV 208 Query: 224 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDS 283 SGSM E LPL++S+LKLLV +LR+QD +++V YAG+SR+ L SG KA+I AID Sbjct: 209 SGSMDRREGLPLVKSTLKLLVDQLRDQDRVSLVVYAGESRVVLEPTSGRDKAKIRTAIDQ 268 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 L A GST G +G++LAYQ A +GFI GINRILLATDGDFNVG+ D S+++M ++R+S Sbjct: 269 LTAGGSTAGASGIQLAYQMAQQGFIDQGINRILLATDGDFNVGVSDFDSLKAMAAEKRKS 328 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 GV+L+T G G NYNE +M ++AD G+GNY+YID L EA+KVL ++ L VAKDVK Sbjct: 329 GVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDNLREARKVLVDQLSSTLAVVAKDVKL 388 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDK 463 Q+EFNPA V+EYR +GYE R L+ E F+ND VDAG+IGAG +T L+E+ G+K ++ Sbjct: 389 QVEFNPAQVSEYRLLGYENRALKREDFSNDKVDAGEIGAGHTVTALYEIVPAGEKGWLEP 448 Query: 464 LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGP----TINAPSEDMRFRAA 519 LRYA +S K ELA L++R+K P+G S+L+E P+ ++ A S D+RF AA Sbjct: 449 LRYAQAKAPQQSGKQGELAMLRLRYKAPEGGSSRLIERPISAQQPGSLAAASPDLRFAAA 508 Query: 520 VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 VAA+ Q+L+ Y N S + AQ AKG DP G R EF++L+ELA Sbjct: 509 VAAFSQQLKDGRYTGNFSLADTVKLAQGAKGADPYGLRGEFVQLVELA 556 >UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9IU52_BORPD Length = 582 Score = 474 bits (1221), Expect = e-132, Method: Compositional matrix adjust. Identities = 242/469 (51%), Positives = 323/469 (68%), Gaps = 11/469 (2%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 Y ++ DNPV + P++TF DVDTGSY NVRR LN+G LPPPDAVR EE +NYF + Sbjct: 116 YARYRDNPVVAAQEQPVSTFGADVDTGSYTNVRRLLNEGRLPPPDAVRAEEFINYFDYGY 175 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 + P S+ PF++ E++ APWN QR LLK+ I +++PA+NLVFL+DTS Sbjct: 176 ------ATPDSRQQPFSIITEVSAAPWNPQRQLLKIGIQGYRVAPQDIPAANLVFLVDTS 229 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM ++LPLI+ +LK LV +LR QD +AIVTYAG + + L S G KA INAAID L Sbjct: 230 GSMAERDKLPLIKGALKQLVAQLRPQDRVAIVTYAGQASMTLDSTPGDQKARINAAIDEL 289 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 A GSTNGGAGL+LAY QA KGF+KGG+NRILLA+DGDFNVG D + ++ + +QR+ G Sbjct: 290 RAAGSTNGGAGLDLAYAQAAKGFVKGGVNRILLASDGDFNVGATDLEDLKDKIARQRQGG 349 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 + L+T GVG N+N+A+ +++AD GNG+Y Y+D+L EA+KVL ++M L+T+A+DVK Q Sbjct: 350 IALTTLGVGGGNFNDALAMQLADAGNGSYHYLDSLREARKVLAAQMSSTLLTIARDVKIQ 409 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELT-LNGQKASIDK 463 +EFNPA V EYR IGYEKR L E FNND VDAG+IGAG ++T L+E+T L A +D Sbjct: 410 VEFNPAVVAEYRLIGYEKRALAREDFNNDRVDAGEIGAGANVTALYEITPLAAGGARLDP 469 Query: 464 LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVE--FPLGPTINAPSEDMRFRAAVA 521 LRY A + ELA++++R+K P +SQLVE P A ++ MR AA A Sbjct: 470 LRYG--KPAADAGPADELAFVRVRYKLPGASDSQLVEQAVPRADARAAGTDGMRRAAAAA 527 Query: 522 AYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 A+ Q LRG +YL++ S QI A+ A+G+DP G AE L+E+A G+ Sbjct: 528 AFAQWLRGGKYLDDYSPAQIAALARGARGDDPHGLNAELAALVEMAAGL 576 >UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellaceae RepID=A5WCP1_PSYWF Length = 571 Score = 473 bits (1217), Expect = e-132, Method: Compositional matrix adjust. Identities = 254/568 (44%), Positives = 357/568 (62%), Gaps = 24/568 (4%) Query: 7 IMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQE 66 ++ ++S +++ C PQ + + + +T E A+Q+ +A A E Sbjct: 23 LLSVLSLSVITACAPQSKITPASDKASTTTLE----TAEQSIQADAAAPVVVMATPAMAE 78 Query: 67 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSL 126 +Q S K R+ P+ ++A Y + + N V ++ AT S+ Sbjct: 79 SRQLS-KMTTNARIMPPPSQG--------YMAPKQQENYAEIEPNAVNATSEQAFATLSI 129 Query: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 D DTGSYANVRRFLNQG LPP DAVRVEE++NYF D+ KQ+ PF + E+ Sbjct: 130 DTDTGSYANVRRFLNQGQLPPKDAVRVEELINYFNYDFTAAKKQA-----NAPFLVSTEV 184 Query: 187 APAPWNEQRTLLKVDILAKD--RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 +PW+ ++KV I A+D ++ P +NLVFL+D SGSM ++++L L +SSLK+L Sbjct: 185 VNSPWHPTNQIVKVGIKAEDLLTAKQKQPPANLVFLVDVSGSMDTEDKLQLAKSSLKMLT 244 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 K+LR QD+I ++TYAG++++ LPS G+ +I AID+L A GSTNG A ++LAYQQAT Sbjct: 245 KQLRAQDSITLITYAGNTKVVLPSTPGNQTQKILNAIDNLTASGSTNGEAAIKLAYQQAT 304 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + F K GINRIL+ TDGDFNVG+ K + +++ R+ G++LST G G NYN+ MM + Sbjct: 305 EHFKKDGINRILMLTDGDFNVGVSSVKDMLQIIRSNRDKGISLSTLGFGQGNYNDHMMEQ 364 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +AD GNGNYSYID+LSEA+KVL EM TVAKDVK Q+EFNPA V+E+R IGYE R Sbjct: 365 VADNGNGNYSYIDSLSEAKKVLIDEMSATFNTVAKDVKIQLEFNPAAVSEWRLIGYENRV 424 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 L E FNNDNVDAG++GAGK + LFE+T GQK ++ RY N A S +EL +L Sbjct: 425 LAKEDFNNDNVDAGELGAGKSVVALFEVTPVGQKGLLEPSRY--QNSAAVSGNNRELGFL 482 Query: 485 KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 KIR+K PQ ++SQL+ FP+ + A S D F AVA YGQ L GS+Y+N+ S+ Q+++ Sbjct: 483 KIRYKAPQAEKSQLLSFPIANRVTAASADTNFALAVAGYGQLLTGSKYVNDLSYSQLQRL 542 Query: 545 AQQAKGE--DPQGYRAEFIRLIELADGV 570 A+ D G R+EFI+L+ LA+ + Sbjct: 543 AKSGAQSPIDSSGSRSEFIKLVSLAEAL 570 >UniRef50_C6M483 von Willebrand factor type A domain protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M483_NEISI Length = 538 Score = 460 bits (1183), Expect = e-128, Method: Compositional matrix adjust. Identities = 233/473 (49%), Positives = 320/473 (67%), Gaps = 16/473 (3%) Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 T RYQ D PVK VAQ P++TFS+DVDTGSYANVRRFL G PP DAVR+EEIVNYFP Sbjct: 66 TERYQDQPDQPVKSVAQEPVSTFSIDVDTGSYANVRRFLTNGEQPPKDAVRIEEIVNYFP 125 Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 ++ + PFA+ E +PW + L+K+ I A+D ++LP +NLVFL+ Sbjct: 126 YNYPLPTDNR-------PFAVHTETIDSPWQPEAKLIKIGIQAQDTAKKDLPPANLVFLV 178 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAI 281 D SGSM + +LPL+Q +L++L ++LR QD + ++TYA + LP SG+ K I +AI Sbjct: 179 DVSGSMDEENKLPLVQKTLRILTQQLRPQDKVTLITYASGEDLVLPPTSGADKETILSAI 238 Query: 282 DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR 341 D L A G+T+G + L++AY+QA K F+ GINRILLATDGDFNVG+ D ++++SMV ++R Sbjct: 239 DKLRAGGATDGESALQMAYEQAQKAFVPNGINRILLATDGDFNVGVSDTETLKSMVAEKR 298 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 +SGV+LST G G NYNE MM +IAD G+GNYSYID EA+KVL ++ L TVA+DV Sbjct: 299 KSGVSLSTLGFGMGNYNEDMMEQIADAGDGNYSYIDNEKEAKKVLQQQLTSTLATVAQDV 358 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI 461 K Q+EFNPA V EYR +GY R LR E FNND VDAGDIGAG +T L+E+ G++ + Sbjct: 359 KIQVEFNPATVKEYRLVGYTNRTLRNEDFNNDRVDAGDIGAGHSVTALYEIIPQGKQGWL 418 Query: 462 DKLRY--APDNKLAKSDKTKELAWLKIRWKYPQGKESQLVE--FPLGPT-INAPSEDMRF 516 ++ RY AP K +K+ E A++K+R+K P K+SQL++ P+G ++ +D Sbjct: 419 EESRYQKAPAAKGSKN----EYAFVKVRYKLPGQKDSQLMQQAVPVGSKPLDQADKDTLL 474 Query: 517 RAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADG 569 A+Y Q LRG EY SW+ I+ AQ+ +G+DP G ++EF++L+ A G Sbjct: 475 ALTAASYAQALRGGEYNGKLSWRDIENMAQKVQGDDPFGLKSEFLQLVRTAAG 527 >UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteobacteria RepID=B1KPQ5_SHEWM Length = 640 Score = 441 bits (1134), Expect = e-122, Method: Compositional matrix adjust. Identities = 223/465 (47%), Positives = 319/465 (68%), Gaps = 15/465 (3%) Query: 111 NPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQ 170 N + + P++TFS+DVDTGSY+ +RR +N G+LP VRVEE++NYF + Sbjct: 154 NGIMVAGEIPVSTFSIDVDTGSYSTLRRSINHGVLPERGTVRVEELINYFAYQY------ 207 Query: 171 SIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD 230 P + PF++ ELAP+P+N + LL++ + +++ +L AS LVFL+D SGSM S Sbjct: 208 PAPDAGEQPFSVNTELAPSPYNPHKMLLRIGLKGFEKEKADLGASQLVFLLDVSGSMSSQ 267 Query: 231 ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGST 290 ++LPL++++LK+L ++L E D I+IV YAG S + L + G+ I+ A+D L A GST Sbjct: 268 DKLPLLKNALKMLSQQLDEGDRISIVVYAGASGVVLDGVKGNDTLAISQALDKLKAGGST 327 Query: 291 NGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTF 350 NGGAG+ELAYQ A K FI GG+NR++LATDGDFNVG+ D +++E M++++R+ G+ L+T Sbjct: 328 NGGAGIELAYQLAQKHFIAGGVNRVILATDGDFNVGVSDQQALEDMIEEKRKQGIALTTL 387 Query: 351 GVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 G G NYN+ +M ++AD GNG+Y+YIDTL+EA+KVL E+ L+T+AKDVK QIEFNPA Sbjct: 388 GFGQGNYNDHLMEQLADKGNGHYAYIDTLNEARKVLVDEISATLLTIAKDVKVQIEFNPA 447 Query: 411 WVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELT-LNGQKASIDKLRYAPD 469 V+EYR IGYE R L E FNND VDAG+IGAG +T L+EL+ ++ + D LRY D Sbjct: 448 LVSEYRLIGYENRALNREDFNNDKVDAGEIGAGHRVTALYELSFVDSPNQANDVLRYGLD 507 Query: 470 NKLAKSDKTK-ELAWLKIRWKYPQGKE-SQLVEFPL--GPTINA---PSEDMRFRAAVAA 522 K K ++ ELA+LK+R+K P G+E S+L+ +P+ IN S+D RF AAVA Sbjct: 508 IKTGKEKYSRDELAYLKLRYK-PIGQEKSKLISYPVLTSTAINEFAQASDDFRFAAAVAG 566 Query: 523 YGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 +GQ + S YL++ + ++ AQ A GED GYR EF++L + A Sbjct: 567 FGQLINHSHYLHDMDYAKVSDIAQAAMGEDSFGYRHEFVQLTKTA 611 >UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella RepID=A3D1E9_SHEB5 Length = 642 Score = 440 bits (1132), Expect = e-122, Method: Compositional matrix adjust. Identities = 244/580 (42%), Positives = 352/580 (60%), Gaps = 28/580 (4%) Query: 5 NIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQ 64 N LL+ ++ L+ CG + E +Q + QV + +QA +++A + A A Sbjct: 45 NTAALLLVAVSLTACGGKGAEVEHRQAEQQAEQRHQVASQRQAEMRDAAKVEMARVAAPM 104 Query: 65 QEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANP-GTARYQQFDDNPVKQVAQNPLAT 123 Q S A+ G + AP + A P +++Q N + + P++T Sbjct: 105 Q----MSSNGAVMG-MSIAPM-------PRDYAAIPLAQNKFEQQVQNGIMVAGEIPVST 152 Query: 124 FSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMR 183 F +DVDTGSYA +RR L +G LP VRVEE++NYF D+ +PA PF++ Sbjct: 153 FFIDVDTGSYATLRRMLREGRLPEKGTVRVEEMLNYFAYDY------PLPAKNAAPFSVT 206 Query: 184 YELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLL 243 ELAP+P+N+ LL++ + D +L ASNLVFL+D SGSM S ++LPL+Q++LKLL Sbjct: 207 TELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSMASADKLPLLQTALKLL 266 Query: 244 VKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 +L QD ++IV YAG + + L +SG+ + A++ L A GS NGG G+ AYQ A Sbjct: 267 TAQLSAQDKVSIVVYAGAAGVVLDGVSGNDTQTLTYALEQLSAGGSINGGQGITQAYQLA 326 Query: 304 TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 K FI GINR++LATDGDFNVG+ D + ++++K+++ G+ L+T G G NYN+ +M Sbjct: 327 KKHFIPNGINRVILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLME 386 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKR 423 ++AD GNGNY+YIDTL+EA+KVL E+ L T+AKDVK Q+EFNPA V+EYR IGYE R Sbjct: 387 QLADKGNGNYAYIDTLNEARKVLVDELSSTLFTIAKDVKVQVEFNPALVSEYRLIGYENR 446 Query: 424 QLRVEHFNNDNVDAGDIGAGKHITLLFELTL--NGQKASIDKLRYAPDNKLAKSDKT-KE 480 L E FNN VDAG+IGAG +T L+EL G + + DKLRY D + K + KE Sbjct: 447 ALAREDFNNYKVDAGEIGAGHTVTALYELRYVEAGNRMN-DKLRYGVDAQTGKEKYSRKE 505 Query: 481 LAWLKIRWKYPQGKESQLVEFPLG-----PTINAPSEDMRFRAAVAAYGQKLRGSEYLNN 535 +A+LK+R+K P +SQL+ +P+ + S+D RF AAVA GQ L GS YL+ Sbjct: 506 IAFLKLRYKLPAQTQSQLLSYPIRLDQSVKQLEQASDDFRFAAAVAGLGQLLNGSHYLHQ 565 Query: 536 TSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 + ++ A+ A G+DP GYR EF++L+E A + +Q Sbjct: 566 FDYTKLSLLARSALGDDPFGYRHEFVQLMETAAAIEQSNQ 605 >UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5H3_9GAMM Length = 608 Score = 427 bits (1099), Expect = e-118, Method: Compositional matrix adjust. Identities = 218/473 (46%), Positives = 308/473 (65%), Gaps = 16/473 (3%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 Y + + NPVKQV P++TFS+DVDTGSY+N RR + G PP DAVR E +NYF + Sbjct: 138 YLKNEQNPVKQVMLEPVSTFSIDVDTGSYSNSRRMIKMGKRPPADAVREEAFINYFDYHY 197 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 S P S PF + E+APAPWN QR LLK+ I D + EL A+NLVFL+D S Sbjct: 198 ------SAPKSLETPFNVHTEVAPAPWNNQRQLLKIGIKGFDIEKAELKAANLVFLLDVS 251 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM + ++LPL++SSL +L K+L E D++AIV YAG + + LP+ G+ I+ A+++L Sbjct: 252 GSMNAPDKLPLLKSSLTMLTKQLDENDSVAIVVYAGAAGLVLPATKGNEYQVISNALNNL 311 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 A GSTNG G+ELAYQ A++ F K GINR++LATDGDFNVG+ +++ ++ +R++G Sbjct: 312 SAGGSTNGAQGIELAYQIASQNFKKEGINRVILATDGDFNVGMSSVDALKKLIANKRKTG 371 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 + L+T G G NYN+ +M ++A++GNG ++YIDT++EA+KVL E+ + +AKDVK Q Sbjct: 372 IALTTLGFGQGNYNDGLMEQLANIGNGQHAYIDTINEARKVLVDELSSTMQIIAKDVKIQ 431 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL-NGQKASIDK 463 +EFNPA V EYR IGY+ R L+ E FNND VDAG++GAG +T L+E+TL N ID Sbjct: 432 VEFNPAQVAEYRLIGYQNRLLKQEDFNNDTVDAGELGAGHTVTALYEITLANSPAKQIDD 491 Query: 464 LRYAPDNKLAK----SDKTKELAWLKIRWKYPQGKESQLVEFPLGPT-----INAPSEDM 514 LRY ++ S ELA++K+R+K P S+L+ + + S+D Sbjct: 492 LRYQTPQQMPTNSNFSSAQDELAYVKLRYKAPNSDVSKLMSQAIFASETQSQFAQASQDF 551 Query: 515 RFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 +F A VA + KL+G +Y +QQ+ A KG+DP GYR EFI+L+ A Sbjct: 552 QFAATVAGFADKLKGEKYTGLWQYQQLIDVAVANKGDDPFGYRNEFIQLLRTA 604 >UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter sp. K31 RepID=B0T5X0_CAUSK Length = 592 Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust. Identities = 224/494 (45%), Positives = 305/494 (61%), Gaps = 16/494 (3%) Query: 85 TFARAAKAKATHIANP--GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ 142 +FA + A + A P T +Y NPVK+VA+ P++TFS+DVDT +YANVRRFLN+ Sbjct: 104 SFAAPSPVVAPNFAPPIRDTEKYPGAAANPVKRVAEEPVSTFSIDVDTAAYANVRRFLNE 163 Query: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 G PP DA+RVEE++NYF + + P ++ PF + P+PW++ R L+ + + Sbjct: 164 GAAPPHDALRVEELINYFDYGY------ARPTAQEPPFKPTVTVVPSPWSQDRQLMHIGV 217 Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 P NLVFLIDTSGSM +RLPL + +L +L+ +LR QD +++V YAG + Sbjct: 218 QGYATPRAGQPPLNLVFLIDTSGSMSGPDRLPLAKKALNVLIDQLRPQDRVSMVAYAGSA 277 Query: 263 RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD 322 L G K ++ A+ +L + GST GG GLELAY A + +NR++L TDGD Sbjct: 278 GAVLSPTDGKSKLKMRCALTALRSGGSTAGGQGLELAYALARQNLDPKAVNRVILMTDGD 337 Query: 323 FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 FNVGI DP ++ V QR+SGV LS +G G NYN+ MM +A GNG +Y+D L EA Sbjct: 338 FNVGIADPTRLKDFVADQRKSGVYLSVYGFGRGNYNDTMMQALAQNGNGTAAYVDGLQEA 397 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 +K+L + L +A DVK Q+EFNPA V+EYR IGYE R L E FNND VDAG+IG+ Sbjct: 398 RKLLRDDFDSALFPIADDVKIQVEFNPAKVSEYRLIGYETRLLNREDFNNDQVDAGEIGS 457 Query: 443 GKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFP 502 G +T ++E+T G K S D LRY K + + ELA+LKIR+K P G S+L+E P Sbjct: 458 GAAVTAIYEITPVGAKPSSDPLRYG--AKPSPATGGSELAFLKIRYKPPGGSTSKLIERP 515 Query: 503 LG-----PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNT-SWQQIKQWAQQAKGEDPQGY 556 +G ++ A E RF AVAAYGQKLRG +++ + W + AQ A+GEDP G Sbjct: 516 IGAGDMHASLAAAPEATRFAVAVAAYGQKLRGDPWVDASFDWDAVTALAQGARGEDPYGL 575 Query: 557 RAEFIRLIELADGV 570 RAEF++L A V Sbjct: 576 RAEFVQLTRAAKDV 589 >UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DBX8_9RHIZ Length = 668 Score = 421 bits (1083), Expect = e-116, Method: Compositional matrix adjust. Identities = 218/473 (46%), Positives = 300/473 (63%), Gaps = 12/473 (2%) Query: 104 RYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSD 163 R + FD N V+ VA+ P++TFS DVDT SYA VRR L QG++P P VR+EE+VNYF D Sbjct: 200 RVEGFDSNGVRSVAEYPVSTFSADVDTASYAMVRRALKQGVMPDPRTVRIEEMVNYFNYD 259 Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 + P S PF + P PWN LL + + D K P +NLV L+D Sbjct: 260 Y------PAPESVETPFRATVTVTPTPWNANTRLLHIGVKGYDVKPAARPQANLVLLVDV 313 Query: 224 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDS 283 SGSM ++LPL++S+ +LL+++L +D ++IVTYAGD+ L S KA+I A+D Sbjct: 314 SGSMQETDKLPLLKSAFRLLIQKLEPEDTVSIVTYAGDAGTVLEPTPASDKAKILDALDD 373 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 L GST G AG+E AY+ A K + GG+NR+LLATDGDFNVG D +++S+++++RES Sbjct: 374 LRPGGSTAGAAGIEEAYRLAEKARVNGGVNRVLLATDGDFNVGASDDDALKSLIEEKRES 433 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 GV LS FG G NYN+ +M +A GNG +YIDTL+EA+K L E L +A DVK Sbjct: 434 GVFLSIFGFGQGNYNDQLMQTLAQNGNGVAAYIDTLAEAEKTLAQEATASLFPIASDVKF 493 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDK 463 QIEFNP + EYRQIG+E R L E FNND VDAG+IG+G +T ++E+T G A ++ Sbjct: 494 QIEFNPETIAEYRQIGFETRALSREDFNNDQVDAGEIGSGHTVTAIYEVTPVGSPAILNS 553 Query: 464 -LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG-----PTINAPSEDMRFR 517 LRY + ++ E A+LKIR K P +ES L E P+ + +A +D+RF Sbjct: 554 DLRYGAETPVSDVAHGDEFAFLKIRAKVPGEEESSLTEIPVMKDAELTSFSAAPQDVRFS 613 Query: 518 AAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 AVAA+ QKLR + +N + I+ A A+GEDP GYR+EF++L+ LA+G+ Sbjct: 614 IAVAAFAQKLRRIQQVNGFGFDAIESIASDARGEDPFGYRSEFLQLVRLANGL 666 >UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales RepID=C8SEV7_9RHIZ Length = 718 Score = 414 bits (1065), Expect = e-114, Method: Compositional matrix adjust. Identities = 240/560 (42%), Positives = 324/560 (57%), Gaps = 39/560 (6%) Query: 34 STPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQG---RLQEAPTFARAA 90 S PT LA Q A + A A A++ Q+ + + G R+ P AA Sbjct: 165 SEPTVAGGLAQQNAQGQVAPAEPAPARSGGQRVIMSLTPPPQADGTTSRIARMP----AA 220 Query: 91 KAKATHIANPGTA-------------RYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 137 ++K P TA R Q F NPV ++P++TFS+DVDT SY+ VR Sbjct: 221 ESKLMTPQQPATAPADQIAPQEENRDRVQDFKTNPVHAALEDPVSTFSIDVDTASYSFVR 280 Query: 138 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTL 197 R L +G +P D VRVEE++NYFP DW D S P F + P PWN L Sbjct: 281 RSLKEGFVPQADTVRVEEMINYFPYDWKGPDSASTP------FNSTVSVMPTPWNTHTKL 334 Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVT 257 + V I D K E P +NLVFLID SGSM ++LPL++S+ +LLV +L+ D I+IVT Sbjct: 335 MHVAIKGFDVKPTEQPKANLVFLIDVSGSMDEPDKLPLLKSAFRLLVSKLKADDTISIVT 394 Query: 258 YAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 YAGD+ L + K +I AID+L GST G AG++ AY+ A + FIK G+NR++L Sbjct: 395 YAGDAGTVLMPTKIAEKDKILNAIDNLQPGGSTAGEAGIKEAYKLAQQSFIKDGVNRVML 454 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 ATDGDFNVG D ++ +++++R++GV LS FG G N N+ MM IA GNG +YID Sbjct: 455 ATDGDFNVGQTDDDDLKRLIEQERKTGVFLSVFGFGRGNLNDEMMQTIAQNGNGTAAYID 514 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA 437 TL+EA+KVL + L T+AKDVK Q+EFNP V+EYR IGYE R L E FNND VDA Sbjct: 515 TLAEAEKVLVEDASSTLFTIAKDVKIQVEFNPDKVSEYRLIGYETRALNREDFNNDRVDA 574 Query: 438 GDIGAGKHITLLFELTLNGQKAS-IDKLRYAP----DNKLAKSDKTKELAWLKIRWKYPQ 492 GDIG+G +T ++E+T G ID LRY + +A +D E A++KIR+K P Sbjct: 575 GDIGSGHSVTAIYEITPKGGGGEQIDPLRYGQAGVNNGGVANAD---EYAFVKIRYKLPN 631 Query: 493 GKESQLVEFPLGP-----TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQ 547 S+L+ P+ + + S D RF AVAA+GQKLR + + +I + A Sbjct: 632 EDVSKLITTPVTSANEVASFDQASTDQRFSVAVAAFGQKLRDEDATAKFGYDKIMEIATA 691 Query: 548 AKGEDPQGYRAEFIRLIELA 567 A+G DP GYR+EF+ L+ LA Sbjct: 692 ARGADPFGYRSEFLSLVRLA 711 >UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=C8N8N5_9GAMM Length = 563 Score = 413 bits (1061), Expect = e-113, Method: Compositional matrix adjust. Identities = 220/469 (46%), Positives = 299/469 (63%), Gaps = 15/469 (3%) Query: 104 RYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLN-QGLLPPPDAVRVEEIVNYFPS 162 RY D NPV +V+ P++TFS+DVDTGSY+N+RR L + LPP DAVRVEEI+NYF Sbjct: 98 RYAHSDANPVHRVSDAPVSTFSIDVDTGSYSNIRRMLTRENRLPPADAVRVEEILNYFAY 157 Query: 163 DWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLID 222 + + PFA+ + +PW L+++ I A D E+ P +NLVFLID Sbjct: 158 GYPLPQDGK-------PFAVHTQTVDSPWQADAKLIRIAIQAADLAPEKRPPANLVFLID 210 Query: 223 TSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAID 282 TSGSM ++LPL++ ++ + LR D I+++TY+G + LP +G K I AA+ Sbjct: 211 TSGSMDDPDKLPLVKKTVCHFAEALRADDRISLITYSGSTAEILPPTAGDQKETIIAALK 270 Query: 283 SLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRE 342 L A G+T GG L +AY A K + K GINRILLATDGDFNVGI DP ++++ V +R+ Sbjct: 271 PLRAHGATAGGEALRMAYDAAAKNYRKDGINRILLATDGDFNVGISDPATLKNYVADKRK 330 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 SG++L+T G G+ NYN+ MM ++AD G+GNYSYID+ +EA+KVL ++ L TVA+D+K Sbjct: 331 SGISLTTLGYGSGNYNDEMMEQLADAGDGNYSYIDSEAEAKKVLVRQLTSTLATVARDIK 390 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASID 462 Q+EFNPA V EYR +GYE R LR E FNND VDAGDIGAG +IT L+E+ G+ +D Sbjct: 391 IQLEFNPAAVKEYRLVGYENRLLREEDFNNDRVDAGDIGAGHNITALYEIIPQGKTGWLD 450 Query: 463 KLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG----PTINAPSEDMRFRA 518 Y N A S K E WLK+R+K P+ ++SQL+E P+ P +A E RF Sbjct: 451 ARHY--QNAPAASGKADEYGWLKLRYKAPESEQSQLIEQPIAAKSIPLADA-EEATRFAI 507 Query: 519 AVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 A A+Y Q L+G +Y W I + AQ A+G DP RA ++LIE A Sbjct: 508 AAASYAQALKGGKYNGALDWAGILRLAQAAQGSDPYDERAGLLQLIEKA 556 >UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BTM6_TERTT Length = 689 Score = 412 bits (1060), Expect = e-113, Method: Compositional matrix adjust. Identities = 221/473 (46%), Positives = 301/473 (63%), Gaps = 15/473 (3%) Query: 102 TARYQQFDD---NPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVN 158 TA FD NP+K + P++TFS+DVDT SY+ VRR LN+G LP AVR+EE+VN Sbjct: 219 TADRDHFDTVATNPIKVTREEPVSTFSIDVDTASYSFVRRQLNRGQLPQKAAVRLEEMVN 278 Query: 159 YFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLV 218 YFP D+ +P++ PF + PAPWN+ + L+ + I K P +NLV Sbjct: 279 YFPYDY------PLPSAATAPFKPTITVIPAPWNQAKRLVHIGI--KALPLAHPPKANLV 330 Query: 219 FLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEIN 278 FL+D SGSM S ++LPL++ S++LL+ L+ D ++IV YAG + L + + +I Sbjct: 331 FLLDVSGSMGSPDKLPLVKQSMELLLSGLQPTDTVSIVVYAGAAGTVLEPTPVAEQQKIL 390 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVK 338 AA+D L+A GST G G+ELAYQ A + + +NRI+LATDGDFNVGI DP+ ++ V+ Sbjct: 391 AALDRLNAGGSTAGAQGIELAYQLAEANYQRDAVNRIILATDGDFNVGIADPEQLKGYVE 450 Query: 339 KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVA 398 ++R +G+ LS G G+ NYN+A+M ++A GNG +YIDTLSEAQKVL + L TVA Sbjct: 451 RKRANGIELSILGFGSGNYNDALMQQLAQNGNGVAAYIDTLSEAQKVLVEQASGTLFTVA 510 Query: 399 KDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK 458 KDVK Q+EFNPA V EYR +GYE R L+ E FNND VDAG+IGAG +T ++E+T G K Sbjct: 511 KDVKIQVEFNPATVAEYRLLGYETRALKREDFNNDAVDAGEIGAGHTVTAIYEITPAGSK 570 Query: 459 AS-IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMR-- 515 A+ ID RYA +DK E +LKIR+K P G ESQL+ P+ ++ +R Sbjct: 571 AALIDSQRYAAAKIATDTDKASEYGFLKIRYKQPGGSESQLISAPIPVAMDTTQTQLREA 630 Query: 516 -FRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 F AAVA + Q L+ YL N S + AQ KG+DP GYR EF++L+ A Sbjct: 631 QFGAAVAGFAQWLKDPRYLGNWSLDDALKLAQANKGDDPYGYRTEFVQLVRKA 683 >UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria RepID=C6VVX3_DYAFD Length = 625 Score = 412 bits (1060), Expect = e-113, Method: Compositional matrix adjust. Identities = 220/495 (44%), Positives = 313/495 (63%), Gaps = 21/495 (4%) Query: 85 TFARAAKAKATH----IANP-GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRF 139 TF AA A +TH +A P T Y+ ++N V Q P+ TFS+DVD +Y+NVRRF Sbjct: 122 TFDMAA-APSTHSETILAMPQATESYKPINENGFLSVGQQPVTTFSVDVDRAAYSNVRRF 180 Query: 140 LNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLK 199 LN G +PP DAVR+EE++NYF D+ + P A+ E +PWN L+ Sbjct: 181 LNNGQMPPEDAVRIEEMINYFDYDYPQPRGEH-------PVAIVAETTDSPWNPGLKLVH 233 Query: 200 VDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 + + AK +E L ASNLVFLID SGSM +LPL++ + KLL +LR +D I+IV YA Sbjct: 234 IGLQAKTVSAENLSASNLVFLIDVSGSMNEANKLPLLKQAFKLLADQLRVEDKISIVAYA 293 Query: 260 GDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLAT 319 G + + L SGS K I A+D L+A GST GG G+ELAY A K F+ G NR++LAT Sbjct: 294 GSAGMVLAPTSGSEKKTIKDALDKLEAGGSTAGGEGIELAYDLAKKHFLPKGNNRVILAT 353 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DGDFNVGI + ++ +++++R++G+ LS G G NY ++ + +AD GNGNY+YID + Sbjct: 354 DGDFNVGISNESELQKLIEEKRKAGIFLSVMGFGMGNYKDSHVETLADKGNGNYAYIDNI 413 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGD 439 EA+KV E L T+AKDVK QIEFNPA V YR IGYE R LR + FN+D DAGD Sbjct: 414 QEARKVFVQEFGGTLFTIAKDVKIQIEFNPAHVQAYRLIGYENRALRNDEFNDDRKDAGD 473 Query: 440 IGAGKHITLLFELTLNGQK----ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE 495 +G+G +T ++E+ +G K A+ D L+Y P N A KE+ +K+R+K P ++ Sbjct: 474 MGSGHTVTAIYEIVPSGVKSPYVATTDALKYQPGNA-ATGSSNKEMMTIKVRYKQPDSEK 532 Query: 496 SQLVEFPLGPTINA---PSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGED 552 S+L + P+ T A S ++RF +AVA +G LRGSE+ + S+ + + A+ A G+D Sbjct: 533 SKLFDLPVPATTVAFDQCSANLRFASAVAEFGLLLRGSEFKGSASYADVIRRARAAFGKD 592 Query: 553 PQGYRAEFIRLIELA 567 +GYR+EF++L+++A Sbjct: 593 EEGYRSEFVQLVKVA 607 >UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NVX5_9RHOB Length = 608 Score = 412 bits (1059), Expect = e-113, Method: Compositional matrix adjust. Identities = 217/469 (46%), Positives = 297/469 (63%), Gaps = 10/469 (2%) Query: 104 RYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSD 163 R+ + NP+++ + +P++TFS+DVDT SY+ VR L+ G LP PDAVRVEE+VNYF + Sbjct: 142 RFASAEANPLRRTSADPVSTFSVDVDTASYSYVRSTLSGGRLPNPDAVRVEEMVNYFDYN 201 Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 + +P PF+ + PWNE L++V I ++LP+ NLVFLIDT Sbjct: 202 Y------PVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLPSQNLVFLIDT 255 Query: 224 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDS 283 SGSM +LPL+Q S +LL+ LR++D +AIVTYAG S + L + K I I++ Sbjct: 256 SGSMADANKLPLLQQSFRLLLSSLRDEDEVAIVTYAGSSGVLLEPTKVADKTRILEKINA 315 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 L + GST G GL+ AY A G RI+LATDGDFNVG+ DP S++ V +QRE+ Sbjct: 316 LTSGGSTAGHEGLKGAYALAETMTGDGEQTRIILATDGDFNVGLSDPDSLKRYVAEQREN 375 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 G LS G G NYN+ +M +A G G +YIDTLSEA+KVL ++ + +A+DVK Sbjct: 376 GTALSVLGFGRGNYNDELMQTLAQNGQGVAAYIDTLSEARKVLVDQVVSSISMIAQDVKI 435 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS-ID 462 Q+EFNP V EYR IGYE R LR E F ND VDAGDIGAG ++T L+E+T G A Sbjct: 436 QVEFNPETVAEYRLIGYETRALRTEDFKNDKVDAGDIGAGHNVTALYEITPVGSPAEKFS 495 Query: 463 KLRYAPDNKL--AKSDKTKELAWLKIRWKYPQGKESQLVEFP-LGPTINAPSEDMRFRAA 519 LRY P K+ ++ ELA++K+R+K P KES LVE P + T+ P + F A+ Sbjct: 496 DLRYGPKEKIEAVRTSYAGELAFVKLRYKLPGDKESTLVETPVMEDTVGIPKSETLFAAS 555 Query: 520 VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 VAA+GQKL+G++YL + ++ I++ A KG DP GYR+EF+ L+ LAD Sbjct: 556 VAAFGQKLKGTDYLGDWDFKAIEKLASDNKGTDPFGYRSEFLTLVRLAD 604 >UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidetes RepID=C7PNZ7_CHIPD Length = 639 Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust. Identities = 217/472 (45%), Positives = 300/472 (63%), Gaps = 15/472 (3%) Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 T Y ++N VA +PL+TFS+DVD SY+NVRRFLN+G +PP DAVRVEE++NYF Sbjct: 167 TEDYSPVNENRFHTVASDPLSTFSIDVDRASYSNVRRFLNEGNMPPVDAVRVEEMINYF- 225 Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 D+ K S P P A+R ++A PWN L+++ + KD + LP SNLVFLI Sbjct: 226 -DY----KYSNPTGN-TPVAVRTDMAICPWNTAHQLVRIALKGKDVAKDNLPPSNLVFLI 279 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAI 281 D SGSM ++LPL++ + KLLV +LR D +AIV YAG + + LPS SG HK I A+ Sbjct: 280 DVSGSMSDAKKLPLVKQAFKLLVNQLRPVDRVAIVVYAGAAGLVLPSTSGDHKTAILDAL 339 Query: 282 DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR 341 D L+A GST GG G++LAY+ AT+ +K G NR+++ATDGDFNVG ++ +++K+R Sbjct: 340 DKLEAGGSTAGGEGVQLAYKTATEYLLKSGNNRVIIATDGDFNVGPSSDGELQRIIEKKR 399 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 E G+ LS G G NY + + +AD GNGNY+YID EA++ +E L T+AKDV Sbjct: 400 EKGIFLSVLGFGMGNYKDNKLELLADKGNGNYAYIDNFEEARRTFATEFGGTLFTIAKDV 459 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKA-- 459 K Q+EFNP +V YR +GYE R L E FN+D DAGD+GAG +T L+E+ G + Sbjct: 460 KLQVEFNPKYVQSYRLVGYENRLLNNEDFNDDKKDAGDMGAGHTVTALYEVVPVGVQTGQ 519 Query: 460 -SIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG---PTINAPSEDMR 515 ++D L+Y N+ D T E+ +K+R+K P SQL+ L I+A ED R Sbjct: 520 PAVDPLKYQ-QNQPVSGDNT-EVLTVKLRYKNPADTSSQLISQVLHWKRQDISAAPEDFR 577 Query: 516 FRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 AVA +G LR SE+ N S++Q+ + A A+G D +GYRAEFI+L++ A Sbjct: 578 MATAVADFGLLLRNSEHKGNASYEQVLKLAGNARGTDEEGYRAEFIQLVKKA 629 >UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteobacteria RepID=C6BAR1_RHILS Length = 706 Score = 405 bits (1040), Expect = e-111, Method: Compositional matrix adjust. Identities = 219/535 (40%), Positives = 321/535 (60%), Gaps = 23/535 (4%) Query: 47 AAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQ 106 AA+ +++A AA + Q +Q+++ A AP+ A+ + +P R+ Sbjct: 183 AALGATKRAAPAAPGIVPQ--RQFAEPMAAI-----APSPVPPAEGRMQMQLDPNRERFA 235 Query: 107 QFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDI 166 NP+K VA +P++TFS DVD+ SYA VRR L G +P P +VRVEE++NYFP DW Sbjct: 236 NAAANPIKSVATDPVSTFSADVDSASYAFVRRSLTGGAMPDPLSVRVEEMINYFPYDW-- 293 Query: 167 KDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGS 226 P + PF + P PWN L+ V I D P +NLVFLID SGS Sbjct: 294 ----PGPNNADQPFKATVTVMPTPWNRDTELMHVAIKGYDIAPATTPRANLVFLIDVSGS 349 Query: 227 MISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDA 286 M ++LPL++S+ +L+V L+ D ++IVTYAG++ L + K++I +AID L+ Sbjct: 350 MDEPDKLPLLKSAFRLMVNRLKADDTVSIVTYAGNAGTVLAPTRVAEKSKILSAIDRLEP 409 Query: 287 EGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 GST G G+E AY A +GF+K G+NR++LATDGDFNVG ++ +++++R+ G+ Sbjct: 410 GGSTGGAEGIEAAYDLAKQGFVKDGVNRVMLATDGDFNVGPSSDGDLKRIIEEKRKDGIF 469 Query: 347 LSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 L+ G G N N+++M +A GNG+ +YIDTL+EAQK L E L +A DVK Q+E Sbjct: 470 LTVLGFGRGNLNDSLMQTLAQNGNGSAAYIDTLAEAQKTLVEEAGSTLFPIASDVKFQVE 529 Query: 407 FNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI-DKLR 465 FNP + EYR IGYE R L E FNND VDAGDIG+G +T ++E+T G A + D LR Sbjct: 530 FNPERIAEYRLIGYETRALNREDFNNDRVDAGDIGSGHSVTAIYEITPKGSPAVMNDDLR 589 Query: 466 YAPDNKL----AKSDKTKELAWLKIRWKYPQGKESQLVEFPLG-----PTINAPSEDMRF 516 Y +K+ + S ELA++K+R+K P +S L+ P+ T++A +D+RF Sbjct: 590 YGAADKVPAEASDSAHHGELAFVKMRYKRPGEDKSALITTPVNDGNAVATVDAAPQDVRF 649 Query: 517 RAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 AVAA+GQKL ++ S+Q I A ++G D GYR++F+ L+ LADG++ Sbjct: 650 SVAVAAFGQKLSHVAAVDTYSYQAIADLAAASRGTDTFGYRSDFLGLVRLADGLS 704 >UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AC65_GEMAT Length = 642 Score = 402 bits (1034), Expect = e-110, Method: Compositional matrix adjust. Identities = 214/468 (45%), Positives = 302/468 (64%), Gaps = 14/468 (2%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 Y + +DNP V NPL+TFS+DVD SY N RRFL G PP DAVR+EE++NYFP + Sbjct: 171 YDRIEDNPFLGVTGNPLSTFSIDVDRASYGNARRFLQDGQRPPADAVRIEELINYFP--Y 228 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 ++++ + P A+ E+ APW + L+++ + ++ ++ LP +NLVFLID S Sbjct: 229 ELREPRGND-----PVAITTEVTTAPWQPRHQLVRIALQSRRIETASLPPNNLVFLIDVS 283 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM S ++LPL++ SL+LLV ++R QD +AIV YAG + + LPS SG K I AI+ L Sbjct: 284 GSMQSPDKLPLVKQSLRLLVDQMRPQDRVAIVAYAGAAGLVLPSTSGDEKETIIQAIERL 343 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 +A GST GGAG+ELAY+ A + F+ G NR++LA+DGDFNVG+ +E +++++R G Sbjct: 344 EAGGSTAGGAGIELAYRTAREHFMDHGNNRVILASDGDFNVGVSSDGELERLIERKRTEG 403 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 L+ G G NY +A M ++A GNGNY Y+D ++EA+K+L EM L+TVA DVK Q Sbjct: 404 TYLTILGFGTGNYQDAKMEKLAKRGNGNYGYVDDIAEARKMLVREMGATLLTVANDVKLQ 463 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI--- 461 +EFNP V YR IGYE R LR E F +D DAGD+GAG +T L+E+ G + ++ Sbjct: 464 VEFNPRRVQAYRLIGYEDRLLRTEDFTDDRKDAGDLGAGHQVTALYEIVPVGVQGTVRLQ 523 Query: 462 --DKLRYAPDNKLAKSD--KTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFR 517 + RY P A+S + EL ++K+R+K P S+L+ P+ S+DMRF Sbjct: 524 DTEARRYEPVTGEARSSTATSDELLFVKLRYKRPGESTSRLITHPVPARTVRGSDDMRFA 583 Query: 518 AAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE 565 ++VAA+G LR S Y NTS Q+ + A+ A GED GYRAEFIRL+E Sbjct: 584 SSVAAFGMLLRESPYAGNTSAAQVLEQARAALGEDDGGYRAEFIRLVE 631 >UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 Tax=Erythrobacter RepID=Q2N8R4_ERYLH Length = 580 Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust. Identities = 219/468 (46%), Positives = 306/468 (65%), Gaps = 16/468 (3%) Query: 104 RYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSD 163 RY + +PVK A PL+TFS+DVDTG+YAN RRFL+QG +PP AVR EE +NYF D Sbjct: 110 RYDGEEVSPVKIAAVEPLSTFSVDVDTGAYANARRFLSQGQMPPKAAVRTEEFINYFRYD 169 Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 +D P + PF + ++ A PWNE L+++ + D + E P +NLVFL+D Sbjct: 170 YDR------PQDRSQPFTVNFDAARTPWNEDTRLIRIGLAGYDIERSERPPANLVFLMDV 223 Query: 224 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDS 283 SGSM ++LPL++++L L EL+ QD ++IV YAG + + L + + K I AA++ Sbjct: 224 SGSMGRPDKLPLVKTALAGLAGELQPQDKVSIVVYAGAAGLVLEPTNDTRK--IRAALNQ 281 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 L A GST GGAG++LAYQ A FI+GG+NR++LATDGDFNVG+ ++ M++K+R+S Sbjct: 282 LQAGGSTAGGAGIQLAYQIAEDNFIEGGVNRVILATDGDFNVGVSSRDALIEMIEKKRDS 341 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 G+TL+T G G NYNEAMM +IA+ GNGNY+YID+ EA+KVL EM L T+AKDVK Sbjct: 342 GITLTTLGFGTGNYNEAMMEQIANHGNGNYAYIDSALEAKKVLGDEMSSTLFTIAKDVKI 401 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDK 463 Q+EFNPA +++YR IGYE R LR E F+ND VDAGDIGAG +T ++E+ G K I Sbjct: 402 QVEFNPAVISQYRLIGYENRALRDEDFDNDAVDAGDIGAGHQVTAIYEVVPVGTKGWIPP 461 Query: 464 LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGP----TINAPSEDMRFRAA 519 LRY A S++ +E A++K+R+K P G+ S+L+++ L T P D F +A Sbjct: 462 LRYGDRPAQAASERAEEAAYVKLRYKMPDGETSKLIDYVLPASTLRTATMPRGDFAFASA 521 Query: 520 VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 VA +GQKLRG L + ++ + + A G +R EF++L LA Sbjct: 522 VAGFGQKLRGDPMLGDFAYDDLARLA----GTQQDFWRQEFVKLTSLA 565 >UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UP85_RHOBA Length = 885 Score = 399 bits (1025), Expect = e-109, Method: Compositional matrix adjust. Identities = 230/550 (41%), Positives = 317/550 (57%), Gaps = 46/550 (8%) Query: 58 AAKALAQQEVQQYSDKQALQGRLQE---------APTFARAAKAKATHIA---NPGTA-- 103 A + +Q +Q D +A +GR E APT R A T PG + Sbjct: 337 AVAPVPEQLGRQQFDFRASRGRTLERQLGETEELAPTSDRLAILPPTPDGEGQGPGMSGD 396 Query: 104 RYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSD 163 +++ +N ++VA + L+TFS+DVDT SYA VR +L +G LP PD+VR+EE++NYF Sbjct: 397 KFEPIQENEFRRVADDALSTFSIDVDTASYAKVRSYLQRGQLPRPDSVRIEELINYF--- 453 Query: 164 WDIKDKQSIP--ASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 D Q P A P+PF+ +A PWNE L++V I AKD +E P NLVFLI Sbjct: 454 ----DYQYTPPSAEDPVPFSSAMAVASCPWNENNRLVRVGIQAKDIDRKERPRCNLVFLI 509 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAI 281 DTSGSM +LPL+ +K+L+ +L+ +D +AIV YAG S + L S K +I A+ Sbjct: 510 DTSGSMKRPNKLPLVIEGMKVLLDQLKNRDRVAIVVYAGSSGLVLDSTPVKQKKKIIRAL 569 Query: 282 DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR 341 +L A GSTNGGAGL+LAYQ A + FI+ G+NR++L +DGDFNVG+ + + +Q Sbjct: 570 SALSAGGSTNGGAGLQLAYQTARENFIEDGVNRVILCSDGDFNVGMTGTDQLVAEATRQS 629 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 +SG L+ G G N+N+AMM RI++ G GNY+++DT++EA+KVL ++ L TVAKDV Sbjct: 630 KSGTELTVLGFGMGNHNDAMMERISNSGAGNYAFVDTIAEAKKVLADQVAGTLFTVAKDV 689 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQ---- 457 K QIEFNPA V+ YR IGYE R L E FN+D VDAG+IGAG +T L+E+ G+ Sbjct: 690 KIQIEFNPAVVSAYRLIGYENRVLAKEDFNDDKVDAGEIGAGHRVTALYEIAPVGKLPDS 749 Query: 458 -KASIDKLRYAPDN---------------KLAKSDKTKELAWLKIRWKYPQGKESQLVEF 501 +D L+Y P K + TKE+ LKIR K PQG S+ + F Sbjct: 750 IAPDVDPLKYQPSGEENPDSQEANEPRVPKDSDESATKEILTLKIRHKPPQGDVSEKLAF 809 Query: 502 PL---GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRA 558 PL D +F AVA +G +LR S + + + A AKG+D G RA Sbjct: 810 PLVNESVPFQEADTDFQFAVAVAVFGMQLRNSTHAGTWTMDDVIATATNAKGDDEHGLRA 869 Query: 559 EFIRLIELAD 568 EF+ L A+ Sbjct: 870 EFLELARTAE 879 >UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4 Tax=Cyanobacteria RepID=B0CCM8_ACAM1 Length = 686 Score = 397 bits (1021), Expect = e-109, Method: Compositional matrix adjust. Identities = 219/478 (45%), Positives = 308/478 (64%), Gaps = 25/478 (5%) Query: 100 PGTAR---YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEI 156 PGT Y++ ++NP + PL+TFS+DVDT SY+NVRRF+ QG LPP DAVR+EE+ Sbjct: 201 PGTFNTEDYKRINENPFFLPQRTPLSTFSIDVDTASYSNVRRFIRQGQLPPKDAVRLEEL 260 Query: 157 VNYFPSDW-DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPAS 215 +NYF + K Q PF++ E+A APWN Q L+ + + K+ + E+ S Sbjct: 261 INYFDYGYASPKGDQ--------PFSVSTEVATAPWNNQHKLVHIGLKGKELEKEQ--PS 310 Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 NLVFLID SGSM +L L++ SL LLV +L+ +D +++V YAG + I LPS G+ KA Sbjct: 311 NLVFLIDVSGSMKRPNKLALVKKSLCLLVHQLKPEDRVSLVVYAGRAGIVLPSTPGTQKA 370 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 I AID L+A GST G AG+++AY A + F+K G NR++LATDGDFNVG +E Sbjct: 371 TIMNAIDRLEAGGSTAGAAGIKMAYDMAERHFLKNGNNRVILATDGDFNVGQSSDAELER 430 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLI 395 +++++R+ GV L+ G G NY + M +A+ GNGNY+YIDTL EAQKVL +++R L Sbjct: 431 LIEQKRDRGVFLTVLGYGTGNYKDNKMELLANKGNGNYAYIDTLLEAQKVLVNDLRGTLF 490 Query: 396 TVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN 455 T+AKDVK Q+EFNP V YR IGYE R LR + FN+D DAG+IG+G IT L+E+ Sbjct: 491 TIAKDVKIQVEFNPGKVQAYRLIGYENRLLRDQDFNDDRKDAGEIGSGHTITALYEVIPT 550 Query: 456 GQKA-----SIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGP---TI 507 G K+ ID L++ K S + +L LK+R+K P G +SQL+ + +I Sbjct: 551 GVKSDVELPDIDPLKF---QKPTASSNSSDLMNLKLRYKQPTGSKSQLISTAIADKNRSI 607 Query: 508 NAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE 565 + +++++F AAVA YG LR S+Y ++ Q+ A QAKG+DPQGYR F++L+E Sbjct: 608 QSATDNLKFSAAVAMYGMVLRDSDYKGKATFNQVLDLADQAKGKDPQGYRMAFMQLVE 665 >UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacterales RepID=Q28U54_JANSC Length = 686 Score = 397 bits (1020), Expect = e-109, Method: Compositional matrix adjust. Identities = 205/464 (44%), Positives = 284/464 (61%), Gaps = 6/464 (1%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 + DDNP++ VA +P++TFS+DVDT SYA +R LN+G LP PDAVR+EE+VNYFP D+ Sbjct: 222 FANADDNPLRVVADDPVSTFSIDVDTASYALLRSTLNRGALPAPDAVRIEEMVNYFPYDY 281 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 I PF ++ PWN L+ + I E+ P NLVFLIDTS Sbjct: 282 PAPTADDIS-----PFRPNVQVFETPWNPDTQLVHIGIQGDLPVVEDRPPLNLVFLIDTS 336 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM +LPL+ S +L++ L +D +AIVTYAG + +AL + S A INAA+ +L Sbjct: 337 GSMNDPAKLPLLIQSFRLMLNRLSPEDEVAIVTYAGSAGVALEPTAASDTATINAALTTL 396 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 A GSTNG GLE AY+ A + + G ++R+LLATDGDFNVG+ D ++E + +QR++G Sbjct: 397 QAGGSTNGVGGLEEAYRLAGEMMVDGEVSRVLLATDGDFNVGLSDAGALEDYIAEQRDTG 456 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 + LS G G N + M +A GNG SYIDTL EAQ+VL ++ L +A D+K Q Sbjct: 457 IYLSVLGFGRGNLQDDTMQALAQNGNGTASYIDTLHEAQRVLVDQLAGALYPIADDLKVQ 516 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS-IDK 463 +EFNP + EYR IGYE R L E F ND VDAGDIGAG +T ++E+T G A + Sbjct: 517 VEFNPDVIAEYRLIGYETRALAREDFANDAVDAGDIGAGHSVTAIYEVTPVGSPAVLVAP 576 Query: 464 LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAY 523 LRY D ++ EL ++ +RWK P ESQL++FP+ + P + +F AA+A + Sbjct: 577 LRYTADEGAPEAAFGDELGFISLRWKEPGADESQLIDFPIANAVADPGTEAQFAAAIAGF 636 Query: 524 GQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 GQ LRGS+++ + + A +G D GYR E ++L+ LA Sbjct: 637 GQLLRGSDFVADWDYADAIALANANRGMDEFGYRTEAVQLMRLA 680 >UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZHE2_9SPHI Length = 704 Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust. Identities = 210/498 (42%), Positives = 308/498 (61%), Gaps = 15/498 (3%) Query: 78 GRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 137 G L F R A A A P RY +N QV QNPL+TFS+DVD SY+NVR Sbjct: 207 GDLSSDLKFDRKA---AFRNAFPEGERYATIYENQFYQVGQNPLSTFSIDVDNASYSNVR 263 Query: 138 RFLNQGLLPPPDAVRVEEIVNYFPSDWD----IKDKQSIPASKPIPFAMRYELAPAPWNE 193 RF+N G P +AVRVEE++NYF D+ KDK+ + P F++ E PWN Sbjct: 264 RFVNDGQPLPKNAVRVEEMINYFEYDYPQPTPTKDKEGKLQTHP--FSVNTEYGTCPWNP 321 Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE-QDN 252 LL++ + ++ +++ +NLVFL+D SGSM S+++LPL++ S K+L+K+L + + Sbjct: 322 HHKLLQIGLQGENLQTKNASPANLVFLVDASGSMDSEDKLPLLKRSFKVLLKQLTDSRTK 381 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 IAIV YAG S + LP+ S SH+ +I A++++++ GST GG G+ELAY+ A + FI GG Sbjct: 382 IAIVAYAGASGLVLPATSVSHREKILTALENIESGGSTAGGEGIELAYKIAQQAFIAGGN 441 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 NR++LATDGDFNVG+ + + ++ +R+SGV L+ G G N N++MM ++ + GNGN Sbjct: 442 NRVILATDGDFNVGLSSDEELMQLISNKRKSGVYLTCLGFGTGNLNDSMMEKLTNAGNGN 501 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN 432 Y YID ++EA+KVL + L +AKDVK Q+EFNPA V YR +GYE R L+ F N Sbjct: 502 YYYIDGINEAKKVLAKNLTGTLYAIAKDVKIQLEFNPARVKSYRLVGYENRVLKHRDFKN 561 Query: 433 DNVDAGDIGAGKHITLLFELT-LNGQKASIDK---LRYAPDNKLAKSDKTKELAWLKIRW 488 D VDAG++G G +T L+E+ +N + + L+Y + + EL +K+R+ Sbjct: 562 DQVDAGELGVGHTVTALYEIVPVNRTQPMLADEIPLKYQTTQIDSAALANNELVTIKLRY 621 Query: 489 KYPQGKESQLVEFPL-GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQ 547 K P+ +S+L+E + + S + +F VAA+G +LR S Y+ NTS+QQI W Q Sbjct: 622 KRPKENKSRLIEKVVKNKLVTQTSNNFKFATTVAAFGMRLRNSPYVGNTSYQQIYSWGQY 681 Query: 548 AKGEDPQGYRAEFIRLIE 565 AK D GYR EF+ L++ Sbjct: 682 AKSVDSNGYRREFLELVK 699 >UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria RepID=B1ZYN3_OPITP Length = 792 Score = 395 bits (1015), Expect = e-108, Method: Compositional matrix adjust. Identities = 224/511 (43%), Positives = 306/511 (59%), Gaps = 28/511 (5%) Query: 85 TFAR-AAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQG 143 TFA + + H T Y+ ++ ++PL+TF+ DVDT SYANVRRFL +G Sbjct: 284 TFAGIGTRVRGDHRQAMNTEAYRFLRESDFLSAREHPLSTFAADVDTASYANVRRFLREG 343 Query: 144 LLPPPDAVRVEEIVNYFPSDW----DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLK 199 LPP DAVR+EE+VNYFP + ++D + + A PFA E+A APW Q L++ Sbjct: 344 RLPPADAVRIEELVNYFPYRYAAPGRVRD-EGVAAPGEAPFAAALEVAAAPWAAQHRLVR 402 Query: 200 VDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 + + AKD A+NLVFL+D SGSM +L L+Q S++LL+ L+ +D +AIVTYA Sbjct: 403 IGLKAKDAAVSGRAAANLVFLLDVSGSMDQPNKLRLVQESMRLLLGRLQPEDRVAIVTYA 462 Query: 260 GDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLAT 319 G+S +ALPS + + EI AID L A GSTNG GL+LAY A F+ G+NR++L T Sbjct: 463 GNSGLALPSTPVARQREILDAIDELRAGGSTNGAMGLQLAYDIAKANFVANGVNRVILCT 522 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DGDFNVG+ + +++++ +SGV L+ G G N +AM+ +IAD GNG+Y YIDT Sbjct: 523 DGDFNVGVTSEGELVRLIEEKAKSGVFLTVLGFGMGNLKDAMLQQIADRGNGSYGYIDTR 582 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGD 439 EA+K+L ++ L+TVAKDVK Q+EFNPA V YR IGYEKR L E F ND +DAG+ Sbjct: 583 REAEKLLVQQVSGTLLTVAKDVKLQVEFNPAKVARYRLIGYEKRLLNQEDFANDKIDAGE 642 Query: 440 IGAGKHITLLFELTLNGQKASIDKLRYAPDNK----------------LAKSDKTKELAW 483 IGAG +T L+E+ G K + P+++ LA +D EL Sbjct: 643 IGAGHTVTALYEIIPVGAKDAEVTEETEPEDRRYTYSSAAPSAVEKRTLAHAD---ELLT 699 Query: 484 LKIRWKYPQGKESQLVEFPL---GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ 540 LK+R+K P S +EFPL G SED RF +AVAA+G LR S Y + Sbjct: 700 LKVRYKQPTALLSTRLEFPLKDDGGNFAQASEDFRFASAVAAFGMILRDSPYKGVATLDD 759 Query: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 + WA A +DP GYRAEF+ L++ A +T Sbjct: 760 VIAWANAATSDDPGGYRAEFVELVKQARLLT 790 >UniRef50_UPI000185CB41 protein containing von Willebrand factor n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CB41 Length = 550 Score = 392 bits (1007), Expect = e-107, Method: Compositional matrix adjust. Identities = 222/468 (47%), Positives = 304/468 (64%), Gaps = 12/468 (2%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 Y++ +NP VAQ P+ TFS DVD SYAN+RR L G LPP DA+R+EE++NYF D+ Sbjct: 85 YKEISENPFVAVAQQPVTTFSADVDRASYANLRRMLGYGQLPPKDAIRIEEMINYFDYDY 144 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 K++ P + ELAP PWN + LL++ + AK + P SN+VFLID S Sbjct: 145 PAPTKEATS-----PLRVTPELAPTPWNPEHLLLRIGLQAKKLDLAQAPPSNIVFLIDVS 199 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM +LPL++SS KLL+ +L+ D +AIVTYA +++AL S + +I +D+L Sbjct: 200 GSMDEPNKLPLLKSSFKLLLTQLKPTDRVAIVTYASGTKVALSSTPVKERQKIEKVLDNL 259 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 A GST+G +G++LAY++A K FIK G NRI+LATDGDFNVGI +P+ +E ++KQRESG Sbjct: 260 YASGSTSGSSGIQLAYKEAQKNFIKNGNNRIILATDGDFNVGISNPRELEKFIEKQRESG 319 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 + +S G G NY + M IAD GNGNY+YID L+EA+KVL +E ML VAKDVK Q Sbjct: 320 IYMSVLGFGMGNYRDDMAETIADKGNGNYAYIDDLTEAKKVLVNEFSGMLFAVAKDVKLQ 379 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKL 464 IEFNP +V EY+ IGYE R L E F +D DAG+IGAG +T L+EL + K + L Sbjct: 380 IEFNPKYVKEYKLIGYENRMLANEDFTDDKKDAGEIGAGHTVTALYELIPSEGKVA-QNL 438 Query: 465 RYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF--PL---GPTINAPSEDMRFRAA 519 RY +L + K EL +LKIR+K P+ K+++ VE PL ++N S D RF A+ Sbjct: 439 RYQ-TKELNEKGKGNELGFLKIRYKDPKVKDAKSVEVTEPLLFAKKSLNETSVDFRFAAS 497 Query: 520 VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 VA +G LRG+ ++ Q+ + A A G+D +GYR EF+RL++ A Sbjct: 498 VAEFGILLRGNSNKAQATYDQVVELANGAIGKDEEGYRKEFVRLVKSA 545 >UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z5_DESAA Length = 558 Score = 388 bits (997), Expect = e-106, Method: Compositional matrix adjust. Identities = 200/482 (41%), Positives = 292/482 (60%), Gaps = 13/482 (2%) Query: 92 AKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAV 151 A + + T Y + K +PL+TFS+DVDT SY+NVRRFL+ G +PP DAV Sbjct: 79 AYYCRVPDYNTEEYAPIREGGFKSPLYDPLSTFSIDVDTASYSNVRRFLSYGNMPPVDAV 138 Query: 152 RVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEE 211 R+EE++NYF D+ Q PF++ E++ PWN L+ V + + ++ Sbjct: 139 RIEEMINYFHYDYPQPKGQD-------PFSITMEMSQCPWNRDNMLVHVGLQGRCLDYKD 191 Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG 271 + SNLVFL+D SGSM S+ +LPL++ S+++LVKEL D ++IVTYAG + + LPS S Sbjct: 192 VKPSNLVFLLDVSGSMNSENKLPLVKRSMEMLVKELGAGDRVSIVTYAGSAGLVLPSTSA 251 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 +K +I A+D L+A GST GG G+ELAY+ A + I G NR++L TDGDFNVG+ Sbjct: 252 RNKRKIITALDRLEAGGSTAGGEGIELAYRVAWENLIPEGNNRVILCTDGDFNVGVSSTP 311 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 + M++++R +G+ L+ G G NY + M I++ GNGN+ YID+ EA KV +MR Sbjct: 312 ELVRMIEEKRRAGIYLTICGFGMGNYKDEKMEAISNAGNGNFYYIDSRREAHKVFVQDMR 371 Query: 392 QMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFE 451 + T+AKDVK Q+EFNP V++YR +GYE R L E FNND DAG+IG G +T L+E Sbjct: 372 ANMFTLAKDVKIQVEFNPGRVSQYRLVGYENRLLAAEDFNNDLKDAGEIGPGHSVTALYE 431 Query: 452 LTLNGQKAS---IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT-- 506 + G +D L+Y + + + E+ +K R+K P+ S+L+ L + Sbjct: 432 IVPAGLGMGAQRVDPLKYQESEPVPELRNSNEILTIKFRYKNPEENRSRLITRVLDESSM 491 Query: 507 -INAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE 565 S+D RF AAVA +G LR S Y + +W QI+ A+++ G D GYRAEFI+L++ Sbjct: 492 EFGDTSDDFRFSAAVAGWGMLLRNSSYADRLTWGQIQSMAEESVGPDEMGYRAEFIKLVK 551 Query: 566 LA 567 Sbjct: 552 TC 553 >UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT14_9PLAN Length = 616 Score = 387 bits (995), Expect = e-106, Method: Compositional matrix adjust. Identities = 204/473 (43%), Positives = 284/473 (60%), Gaps = 18/473 (3%) Query: 104 RYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFL-NQGLLPPPDAVRVEEIVNYFPS 162 ++ ++NP + VA PL+TFS+DVDT SY+ +R +L + LPP AVRVEE++NYF Sbjct: 148 KFAYVENNPFRAVADEPLSTFSIDVDTASYSKIRSYLIDYHQLPPQGAVRVEELINYFTY 207 Query: 163 DWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLID 222 D+ Q PFA E A PWN + L+++ I K+ + E PASNLVFL+D Sbjct: 208 DYATPTDQK-------PFAANVEAAACPWNAEHRLVRIGIKGKEIANAERPASNLVFLLD 260 Query: 223 TSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAID 282 SGSM + +LPL++ +KLLV +L E D +AIV YAG + + L S +G K+ I A+D Sbjct: 261 VSGSMNNARKLPLLKQGMKLLVDQLGENDKVAIVVYAGAAGMVLNSTNGDDKSTIMEALD 320 Query: 283 SLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRE 342 L A GSTNGG G+ELAYQ AT+ FIKGG+NR++L TDGDFNVG+ + +M + + Sbjct: 321 RLQAGGSTNGGQGIELAYQAATENFIKGGVNRVILCTDGDFNVGVTSTSDLVTMAADKAK 380 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 SGV LS G G N+N+AMM ++ NGNY++IDT++EA+KVL +M L T+AKDVK Sbjct: 381 SGVFLSVMGFGTGNHNDAMMEELSGKANGNYAFIDTITEAKKVLVEQMSGTLTTIAKDVK 440 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG-----Q 457 QIEFNP V YR +GYE R L E FN+D DAG+IGAG +T +E+ Sbjct: 441 IQIEFNPTKVAAYRLVGYENRLLANEDFNDDKKDAGEIGAGHCVTAFYEIVPASVESPVT 500 Query: 458 KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL---GPTINAPSEDM 514 A +D L+Y + + + EL LKIR+K P ES L+ + G S D Sbjct: 501 TAKVDDLKYQATRDVTPAADSDELLTLKIRYKQPDEDESSLISVGVKDSGNRFAQASGDF 560 Query: 515 RFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 +F A VA +G LR + + +I + G+D YR EF+++++ A Sbjct: 561 QFAAGVAMFGMLLRAGDQDAKVNLDEITELVSNNVGDD--SYRGEFLKIVQAA 611 >UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria RepID=A5F9T1_FLAJ1 Length = 709 Score = 384 bits (986), Expect = e-105, Method: Compositional matrix adjust. Identities = 214/479 (44%), Positives = 303/479 (63%), Gaps = 20/479 (4%) Query: 100 PGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNY 159 P Y F +N + PL+TFS+DVD SY N+RRFLN G P DAVRVEE+VN+ Sbjct: 238 PTQEDYDTFVENAFESPKTAPLSTFSIDVDNASYTNIRRFLNSGQEVPKDAVRVEEMVNF 297 Query: 160 FPSDWDIKDKQSIPASK-PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLV 218 F ++ P K PF++ E + +PWN Q +LK+ + K+ + +LP+SNLV Sbjct: 298 FKYNY--------PQPKNEHPFSINTEYSDSPWNSQNKILKIGLQGKNIATNDLPSSNLV 349 Query: 219 FLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEIN 278 FLID SGSM +LPL++ S+K+LV ELR D ++IV YAG + + LP SG+ K I Sbjct: 350 FLIDVSGSMEDMNKLPLLKQSMKILVNELRPTDKVSIVVYAGAAGMVLPPTSGNEKKTII 409 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVK 338 A+D L+A GST GGAG+ELAY+ AT+ FIKGG NR++LATDGDFNVG +E +++ Sbjct: 410 KALDQLEAGGSTAGGAGIELAYKIATENFIKGGNNRVILATDGDFNVGSSSNSDMEKLIE 469 Query: 339 KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVA 398 ++R++GV L+ G G NY ++ M +AD GNGNY+YID + EA + L E + + +A Sbjct: 470 EKRKTGVFLTCLGYGMGNYKDSKMEILADKGNGNYAYIDNIQEANRFLGKEFKGSMFAIA 529 Query: 399 KDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK 458 KDVK QIEFNP V YR IGYE R+LR E F ND +DAG++G+ +T L+E+ G K Sbjct: 530 KDVKIQIEFNPKQVQAYRLIGYENRKLRPEDFKNDAIDAGELGSNHTVTALYEIIPAGVK 589 Query: 459 ASIDKLRYAPDN-KLAKSDK-----TKELAWLKIRWKYPQGKES-QLVEF--PLGPTINA 509 + D L PD+ K K++ + ELA +K R+K P G +S ++V+ +++ Sbjct: 590 S--DFLNVQPDDLKYTKTETNSANYSNELATIKFRYKKPDGDKSIEMVQVINTKSVSLDQ 647 Query: 510 PSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 S+D +F AVA +G KLR S+ + + S + I + AQQ D GY+AEFIRL+E ++ Sbjct: 648 ASDDFKFSTAVAWFGLKLRDSKLITDKSSESIAELAQQGMSFDKGGYKAEFIRLVETSE 706 >UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacteraceae RepID=A3PN61_RHOS1 Length = 651 Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust. Identities = 203/484 (41%), Positives = 295/484 (60%), Gaps = 9/484 (1%) Query: 89 AAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPP 148 AA+A A + + + DNP++ A++P++TFS+DVDT SYA +R L G LPP Sbjct: 175 AAEAPARALPQGDSEAFANAPDNPLRVTAEDPVSTFSIDVDTASYAILRSSLRAGQLPPR 234 Query: 149 DAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRK 208 +AVR+EE++NYFP D+ P + PF + PWN + L+ V + + Sbjct: 235 EAVRIEEMINYFPYDY------PAPENGTPPFRPTLSITRTPWNPETRLVHVALQGRMPA 288 Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS 268 E+ P NLVFLIDTSGSM +LPL++ S L++ LR +D +AIVTYAG + L Sbjct: 289 IEDRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRPEDQVAIVTYAGSAGEVLAP 348 Query: 269 ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID 328 + + ++ I +A+D LDA GST G GL LAY+ A++ G + R++LATDGDFN+GI Sbjct: 349 TAANQRSTILSALDRLDAGGSTAGDEGLALAYRTASEMAGAGEVTRVVLATDGDFNLGIS 408 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 DP+ + +V +R++GV LS G G N ++A M +A GNG +YID+L+EAQKVL Sbjct: 409 DPEELARLVAHERDTGVYLSVLGFGRGNLDDATMQALAQNGNGQAAYIDSLNEAQKVLVD 468 Query: 389 EMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 ++ L +A DVK Q+E++PA V EYR IGYE R LR E F ND VDAG+IGAG +T Sbjct: 469 QLSGALFPIADDVKVQVEWSPARVAEYRLIGYETRGLRREDFANDRVDAGEIGAGHSVTA 528 Query: 449 LFELTLNGQKASI-DKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTI 507 ++E+T A + D LRY + + EL +L++R+K P S L++ P+ + Sbjct: 529 IYEITPVDSPARLTDPLRYGAEPP--EGAHGDELGFLRLRYKAPGESTSTLIDTPIPDML 586 Query: 508 NAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 SED+RF A+A +G+ LRGS+ L W + A A+G DP GYR E ++L+ LA Sbjct: 587 TEASEDVRFSTAIAGFGELLRGSDKLGAWGWDEAIALADGARGADPFGYRVEAVQLMRLA 646 Query: 568 DGVT 571 + ++ Sbjct: 647 ESLS 650 >UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C54C8 Length = 638 Score = 381 bits (978), Expect = e-104, Method: Compositional matrix adjust. Identities = 202/475 (42%), Positives = 292/475 (61%), Gaps = 13/475 (2%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 Y ++ +N + L+TFS DV+T SYANVRR LN+G LPP AV + E VNYFP + Sbjct: 168 YGRYQENEFRSPLVAALSTFSADVNTASYANVRRMLNEGTLPPASAVFLAEFVNYFPYSY 227 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 + P + P A E+ P PWN + LL+V + A +E+LP NLVFL+DTS Sbjct: 228 ------APPPAGADPVAFHVEMGPCPWNAKHHLLRVGVQAHQIPAEKLPPRNLVFLVDTS 281 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM + RLPL+Q SL+LLV++L E+D +++VTYAGDSR+ALP SG+ K I + L Sbjct: 282 GSMQQENRLPLVQKSLELLVEKLTEKDRVSVVTYAGDSRVALPPTSGADKKAILDVVTGL 341 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 A G TNG G++ AYQ A F+ GG+NR++L TDGDFNVG+ D + ++++QR+S Sbjct: 342 QANGGTNGEGGIKKAYQFARDTFLDGGVNRVILCTDGDFNVGVVDNGELVKLIEEQRKSK 401 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 V L+ G G NY + + +A+ GNG+++YIDTL EA+KV E L+ VAKDVK Q Sbjct: 402 VFLTVLGYGMGNYKDDRLKELANHGNGHHAYIDTLDEAKKVF-VEQGGALVCVAKDVKFQ 460 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS---I 461 I+FNPA V YR +GYE R L+ E F ND DAGD+G+G +T+L+E+ G K + Sbjct: 461 IDFNPAKVNAYRLVGYENRLLKDEDFKNDAKDAGDVGSGHQVTVLYEIVPPGVKVDLPEV 520 Query: 462 DKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKES-QLVEFPLGPTINAPSEDMRFRAAV 520 D +Y K ++ + E +K+R+K+P S +L G S+D RF AAV Sbjct: 521 DASKY--QKKDVPANASDEWLTVKMRYKHPDEDVSKELTAAHKGAVAKELSDDFRFAAAV 578 Query: 521 AAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 A++G LR S++ ++ + + AQ A G DP +R +F+ L+ A ++++ + Sbjct: 579 ASFGMLLRDSKFKGAMTYAGVLEEAQGALGADPNNHRKQFLELVRRAKELSNVQK 633 >UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobacteria RepID=Q21MJ3_SACD2 Length = 708 Score = 381 bits (978), Expect = e-104, Method: Compositional matrix adjust. Identities = 215/478 (44%), Positives = 298/478 (62%), Gaps = 19/478 (3%) Query: 101 GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 G +++ ++N VK VA+ P++TFS+DVDT SY+ VRR LN G LP DA+R EE++NYF Sbjct: 236 GNDKFEHVEENSVKSVAEAPVSTFSIDVDTASYSFVRRQLNSGYLPEKDAIRAEELINYF 295 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 ++ +P+ PF + +PW + + L+ + + D ++ P +NLVFL Sbjct: 296 DYNY------PLPSDSTAPFKPNITVIDSPWAKGKKLVHIGLKGYDIAPDQKPRTNLVFL 349 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 +D SGSM S ++LPL++ S+++L+ L D +AIV YAG + L K +I +A Sbjct: 350 LDVSGSMNSQDKLPLVKQSMEMLLSTLNPDDTVAIVVYAGAAGTVLEPTPAKDKQKILSA 409 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQ 340 + L A GST GGAG+ LAY A F K +NR++LATDGDFNVG + ++++ V+++ Sbjct: 410 MQRLQAGGSTAGGAGIALAYDLAEANFDKKAVNRVILATDGDFNVGSTNNETLQGFVERK 469 Query: 341 RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD 400 RE G+ LS G G NYN+ +M +A GNG +YIDT+SEAQKVL E L +AKD Sbjct: 470 REKGIFLSVLGFGQGNYNDHLMQTLAQNGNGVAAYIDTVSEAQKVLVQEASSSLFPIAKD 529 Query: 401 VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS 460 VK Q+EFNPA V EYR IGYE R L E FNND VDAGDIGAG +T ++E+T G A Sbjct: 530 VKIQVEFNPATVAEYRLIGYETRALNREDFNNDAVDAGDIGAGHTVTAIYEITPVGSSAV 589 Query: 461 -IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL---GPTINAPSEDMR- 515 ID+ RYA K AK+ E + K+R+K P S+L+E P+ P + P+E M+ Sbjct: 590 LIDESRYAQKEK-AKAPTNAEYGFFKLRYKLPSEDTSRLIEAPILQQQPLV--PAELMQE 646 Query: 516 --FRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLI---ELAD 568 F AVAAY QKL+GS +LN S+ I AQ +KG D GYR EF++L+ ELAD Sbjct: 647 VNFSVAVAAYAQKLKGSNFLNKYSYHDIIALAQASKGSDEYGYRTEFVQLVRKAELAD 704 >UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BJU7_9GAMM Length = 555 Score = 366 bits (940), Expect = 1e-99, Method: Compositional matrix adjust. Identities = 197/457 (43%), Positives = 278/457 (60%), Gaps = 12/457 (2%) Query: 111 NPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQ 170 +P++QVA +P++TFS DVDT SY N RRFLNQG+ PP D++RVEE +NYF D+ + Sbjct: 98 SPIRQVATDPVSTFSTDVDTASYTNARRFLNQGMRPPADSIRVEEFINYF--DYALP--- 152 Query: 171 SIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD 230 P + P + E PWN Q L++V + + + LP NLVFL+D SGSM S Sbjct: 153 -APDTTNTPIQISTERTQTPWNPQTELVRVSLQSYRSDFKTLPPLNLVFLLDVSGSMNSP 211 Query: 231 ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGST 290 ++LPL+Q S LLV +LR QD +AI YAG S + L SG KA+IN AI+ L A G T Sbjct: 212 DKLPLMQRSFNLLVSQLRPQDRVAIAVYAGQSGVVLEPTSGDQKAQINQAINQLRAGGGT 271 Query: 291 NGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTF 350 +G AG+ LAY A ++ GINRI + TDGDFNVG ++++++++RE+GV LS Sbjct: 272 HGSAGIHLAYDLAQANYLPDGINRIFIGTDGDFNVGTTSLTELKALIERKREAGVFLSVL 331 Query: 351 GVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 G G NYN+A+M +++ GNG Y+D+ EA+K+ +++ L TVAKDVK QIEFNPA Sbjct: 332 GFGTGNYNDALMEELSNHGNGTAYYLDSYQEARKLFATQLAATLQTVAKDVKIQIEFNPA 391 Query: 411 WVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI-DKLRYAPD 469 V EYR IGY+ R L E FNND +DAG++G+G +T L+E+ + D LRY D Sbjct: 392 QVAEYRLIGYDNRLLAREDFNNDAIDAGEMGSGHAVTALYEIVRRDSEFRFSDPLRYQDD 451 Query: 470 NKLAKSDKT-KELAWLKIRWKYPQGKESQLVEFPLGPT-INAPSEDMRFRAAVAAYGQKL 527 + SD E+A++K R+K P S+L+ + T + + S+ VA + + L Sbjct: 452 D---LSDTVGGEIAFVKARYKLPDEAHSRLLSQAITDTPMQSSSQRQALAIGVAGFAEIL 508 Query: 528 RGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 RGS YL + S + + ED GYR E + L+ Sbjct: 509 RGSPYLRDWSINDAIDYIGPSLQEDRWGYRQELVTLM 545 >UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacteriaceae RepID=C0YQB8_9FLAO Length = 800 Score = 366 bits (939), Expect = 1e-99, Method: Compositional matrix adjust. Identities = 202/471 (42%), Positives = 297/471 (63%), Gaps = 22/471 (4%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 Y F +NP + PL+TFS+DVD SY+NVRR +N G + +AVR+EE+VNYF D+ Sbjct: 335 YDAFVENPFELTRNQPLSTFSIDVDNASYSNVRRMINNGQVVDKNAVRIEEMVNYFKYDY 394 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 ++ PF++ E + APWN + LLK+ + K+ ++LPASNLVFLID S Sbjct: 395 PQPKNEN-------PFSINTEYSDAPWNPKHKLLKIGLQGKNLPMDKLPASNLVFLIDVS 447 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM + +LPL++SS K+L+ +LR +D + IV YAG + + LP S K +I A+D L Sbjct: 448 GSMSDENKLPLLKSSFKVLLNQLRPKDKVGIVVYAGSAGMVLPPTSAGEKDKIIEALDRL 507 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 A GST GGAG+ELAY+ A + F+K G NR+++ATDGDFNVG ++++++ +R+SG Sbjct: 508 QAGGSTAGGAGIELAYKLAQENFVKEGNNRVIIATDGDFNVGTSSISDLKTLIEDRRKSG 567 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 V L+ G G NY + + +AD GNGNY+YID + EA K L E + +AKD+K Q Sbjct: 568 VFLTCLGFGMGNYKDNTLETLADKGNGNYAYIDNMQEANKFLGKEFAGSMYAIAKDMKIQ 627 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKL 464 IEFNP +V YR IGYE R+L+ E F ND +DAG++G+G +T L+E+ A+++ Sbjct: 628 IEFNPEYVKSYRLIGYENRKLKNEDFTNDKIDAGELGSGHTVTALYEVI----PANVNS- 682 Query: 465 RYAP---DNKLAKSDKTK----ELAWLKIRWKYPQGKESQLVEFPLGPT---INAPSEDM 514 +AP D K +++ +K ELA +K R+K P G S+ + + + I++ S D Sbjct: 683 DFAPKESDLKYSQNTSSKGFGDELATIKFRYKKPDGDTSREITQVVKNSDNRISSASPDF 742 Query: 515 RFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE 565 +F ++VA +G LR SE + I+ A+Q K +D +GYR+EFIRLIE Sbjct: 743 KFASSVAWFGLVLRNSELITKKDLSDIENLAKQGKNKDEEGYRSEFIRLIE 793 >UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZU67_9SPHI Length = 552 Score = 354 bits (909), Expect = 4e-96, Method: Compositional matrix adjust. Identities = 194/468 (41%), Positives = 295/468 (63%), Gaps = 13/468 (2%) Query: 109 DDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKD 168 ++N V PL+TFS+DVD SY+ R+ +N G LP +VR+EE +NYF + + Sbjct: 91 NENTFLSVKTAPLSTFSIDVDNASYSRARKSINNGQLPSTSSVRLEEFINYFNYQYKQPE 150 Query: 169 KQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI 228 Q PF++ E+A PWN + L+ + + K S +L SNLVFLID SGSM Sbjct: 151 GQH-------PFSVNTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLSNLVFLIDVSGSMS 203 Query: 229 SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEG 288 + ++LPL++ + K+LV L E+D +AIV YAG++ + LP+ G+ K +I A+D L + G Sbjct: 204 APDKLPLLRKAFKMLVNNLGEEDRVAIVVYAGNAGLVLPATQGTDKQKIMEALDKLQSGG 263 Query: 289 STNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 ST GGAG++LAY+ A + FIK G NRI+LATDGDFN+G ++++++++++R+ GV ++ Sbjct: 264 STAGGAGIKLAYKIAKQNFIKEGNNRIILATDGDFNLGASSDQAMQNLIEEKRKEGVFIT 323 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFN 408 G+G NY ++ M IAD GNGNY Y+D L+EA KV +++ L T+AKDVK Q+EFN Sbjct: 324 VLGLGMGNYRDSKMEIIADKGNGNYYYLDNLNEAYKVFGKDLKGTLFTIAKDVKIQVEFN 383 Query: 409 PAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL--NGQKASIDKLRY 466 A V YR IGYE R L F +D DAG+IGAG +T L+E+TL N Q ++D+ Sbjct: 384 SAVVKSYRLIGYENRLLANRDFRDDTKDAGEIGAGHTVTALYEVTLHSNPQTVAVDQ-NQ 442 Query: 467 APDNKLAKSDKTKELAWLKIRWKYPQGK---ESQLVEFPLGPTINAPSEDMRFRAAVAAY 523 P N A ++L +++R+K P+G E+ + +++ S + RF AAVA++ Sbjct: 443 IPANFQATQFNNQQLMNVRLRYKKPEGSTGIETSQIIAANHQSVDETSHNFRFSAAVASF 502 Query: 524 GQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 G L+ S+Y +T++Q + A+ +KG+D YRAEFI L++ A +T Sbjct: 503 GMLLKNSQYKGSTTFQTVLTLAKGSKGKDMNQYRAEFIDLVQKASQIT 550 >UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XSR4_9CAUL Length = 625 Score = 354 bits (908), Expect = 7e-96, Method: Compositional matrix adjust. Identities = 191/479 (39%), Positives = 280/479 (58%), Gaps = 18/479 (3%) Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 T RY NPV++VA P++TFS+DVDT +YANVRRF+++G PP DAVRVEE++NYF Sbjct: 145 TERYPDATPNPVRRVADEPVSTFSIDVDTAAYANVRRFISEGQTPPRDAVRVEEMINYFD 204 Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPAS-----N 216 + + P PFA+ +A +PW+ I+ + ELPA N Sbjct: 205 YGY------ARPGRADEPFAVSTAVAASPWSANAGAGGRQIVHIGLQGYELPAGERRPLN 258 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAE 276 L F++D SGSM S ++L L Q ++ L++ LR +D +A+ YA D A+ GS K + Sbjct: 259 LTFMVDVSGSMQSPDKLGLAQQTMNLIIDRLRPEDRVAVTYYASDVGTAVGPTPGSEKLK 318 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESM 336 + A+ +L+A GST G G+ AY+QA F +NRIL+ TDGDFNVG+ D + +E Sbjct: 319 LRCAVAALNAGGSTAGAQGMVNAYEQAEAAFSPDKVNRILMFTDGDFNVGVTDDRRLEDY 378 Query: 337 VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 V +R +G+ LS +G G NY +A M IA GNG +Y+D L EA+ + + Sbjct: 379 VADKRGTGIYLSVYGFGRGNYQDARMQTIAQAGNGVAAYVDDLDEARCLFGPAFDRGAFP 438 Query: 397 VAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 +A DVK Q+EFNPA V+EYR IGYE R L E F ND +DAG++G+G +T L+E+T G Sbjct: 439 IADDVKIQVEFNPARVSEYRLIGYETRLLNEEDFANDAIDAGEVGSGASVTALYEITPVG 498 Query: 457 QKASIDKLRYAPDNKL-AKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPS---- 511 + I + RY + A D T E+ ++++R+K P S+L++ P+ T + P Sbjct: 499 GASQIPERRYEANRAGDAGGDPTGEIGFVQVRYKLPGQPTSRLIQQPISGTTDGPGSARL 558 Query: 512 -EDMRFRAAVAAYGQKLRGSEYLN-NTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 E R+ AVA +GQ+LRG ++ + I AQ +GEDP G RA F++++ A+ Sbjct: 559 PEATRWAMAVAGFGQRLRGDPWMGADFDTAAILDLAQGVRGEDPYGDRAAFVQMVRAAE 617 >UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KS19_CLOPH Length = 551 Score = 351 bits (901), Expect = 3e-95, Method: Compositional matrix adjust. Identities = 191/470 (40%), Positives = 280/470 (59%), Gaps = 15/470 (3%) Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 T Y + + +PL+TFS DVDT SY+N+RR L +G AVR+EE++NYF Sbjct: 89 TEEYNAVIEQGYQSTKNHPLSTFSADVDTASYSNIRRMLKEGRRVDTGAVRIEEMLNYFN 148 Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 D+ + + S PF + EL+ PWN L I + + SNLVFLI Sbjct: 149 YDYKLPEGDS-------PFGITTELSDCPWNPDTKLFLAGIQTEKIDFSKSAPSNLVFLI 201 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAI 281 D SGSM+ +++LPL+Q + LL + L E+D I+IVTYAG+ + L G+ K +I AI Sbjct: 202 DVSGSMMDEDKLPLVQRAFLLLTENLTEKDRISIVTYAGNDTVVLSGAKGNQKEKIQNAI 261 Query: 282 DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR 341 L+A GST G G+E AYQ A + +I+GG NR++LATDGD NVG+ + ++++++R Sbjct: 262 TELEAGGSTFGSKGIETAYQLAMENYIEGGNNRVILATDGDLNVGVTSESELTNLIEEKR 321 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 +SGV LS G G N + M +AD GNGNY+YID+L EA+KVL EM L+TVA DV Sbjct: 322 KSGVALSVLGFGTGNIKDNKMEALADHGNGNYAYIDSLMEARKVLVEEMGATLVTVAGDV 381 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI 461 K Q+EFNPA V YR +GY+ R L E FN+D DAG++GAG +T+L+EL L K I Sbjct: 382 KFQVEFNPAKVKGYRLLGYDNRLLATEDFNDDTKDAGEVGAGHSVTVLYELVLEDSKMEI 441 Query: 462 --DKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT--INAPSEDMRFR 517 +L+Y ++ EL + IR+K P +S L+ P+G + ++++ F Sbjct: 442 PETELKYTT---TEPTNMVDELLTVNIRYKKPGKDKSILMSEPVGINQLADTRTDNLAFA 498 Query: 518 AAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 AVA +G L+ SEY + ++ ++ ++ + + YRAEF +L++LA Sbjct: 499 TAVAEFGLLLKDSEYKGDATFSKVLSRLEETNYKQDE-YRAEFYQLVKLA 547 >UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UJ22_METS4 Length = 654 Score = 351 bits (901), Expect = 4e-95, Method: Compositional matrix adjust. Identities = 211/524 (40%), Positives = 290/524 (55%), Gaps = 27/524 (5%) Query: 58 AAKALAQQEVQQYSDKQALQGRLQEAP----TFARAAKAKATHIANP-GTARYQQFDDNP 112 A +A A + +Q + R + +P ARAA A + P G R+ + Sbjct: 119 AGEADAGRTLQAFRSSGGF--RFEASPRGPAAMARAAGETAPVPSEPVGRDRFANAPEGG 176 Query: 113 VKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSI 172 + + P++T SL VDT SY VR LN+ LPPP AVR EE++NYFP + Sbjct: 177 FRITREAPVSTVSLGVDTASYGIVRDALNRNHLPPPAAVRTEELINYFPYAY------PA 230 Query: 173 PASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDER 232 PAS PF + + P+PW E R LL + I E P +NLVFL+DTSGSM + R Sbjct: 231 PASPDAPFRVTASVFPSPWAEGRKLLHIGIRGYAVAPAERPPANLVFLVDTSGSMAAPNR 290 Query: 233 LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNG 292 LPL++ SL +L+ L +D +A+V YAG+ L I AAI++L A GST G Sbjct: 291 LPLVKQSLAMLLTTLDARDRVALVAYAGEVGTVLEPTPAGEAGRILAAIETLQAHGSTAG 350 Query: 293 GAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGV 352 G G+ AY A + F +NR++LATDGDFNVGI + V ++R G+ LS G Sbjct: 351 GEGIRQAYALAARHFDPKAVNRVILATDGDFNVGITGRDELTGFVARERRKGIFLSVLGF 410 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV 412 G N N+A+M +A GNG ++IDT EA+KVL E LI +A+DVK Q+EFNPA V Sbjct: 411 GMGNLNDALMQALAKDGNGVAAHIDTAQEARKVLVEEATSTLIPIARDVKIQVEFNPATV 470 Query: 413 TEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKL 472 EYR IGYE R L F ND DAG++G+G+ +T L+E+ K LRYA ++ Sbjct: 471 AEYRLIGYETRPLDRADFANDEADAGEVGSGQTVTALYEIVPADGKRVTGDLRYA-PHEA 529 Query: 473 AKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAA---------VAAY 523 A + +++ A + IR+K P +ES L+E P+GP E RF A VAA+ Sbjct: 530 APAPASRDYAHVAIRFKRPDARESTLIETPVGPE----GEAARFAEAPQEARFAAAVAAF 585 Query: 524 GQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 GQ LRG ++ S + + A A+G+DP GYRAEF+ L+ A Sbjct: 586 GQILRGGKHTGRFSLDDVIRIAAPARGDDPFGYRAEFLGLVRAA 629 >UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 Tax=Caulobacteraceae RepID=B4WCU1_9CAUL Length = 613 Score = 345 bits (884), Expect = 4e-93, Method: Compositional matrix adjust. Identities = 191/478 (39%), Positives = 275/478 (57%), Gaps = 21/478 (4%) Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 T Y NPVK+ A P++TFS+DVDT +Y+NVRRF+++G PP DAVRVEE++N F Sbjct: 136 TETYPDATPNPVKRTADQPVSTFSIDVDTAAYSNVRRFIDEGRSPPADAVRVEELINAFD 195 Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPAS-----N 216 + + P S PFA+ + +PW + I+ + ELP N Sbjct: 196 YGY------ARPTSLARPFAITTAVVASPWAPRTERGGRQIVHIGLQGYELPQGEQRPLN 249 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAE 276 L FL+D SGSM S ++L L + ++ L + LR QD +++ YA + L G K + Sbjct: 250 LTFLVDVSGSMRSPDKLDLAKQAMNLAIDRLRPQDTLSVTYYAEGAGTTLQPTPGDQKLK 309 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESM 336 + A+ SL A G T G G+ AY QA F + +NRIL+ TDGDFNVG+ D K +E Sbjct: 310 MRCAVASLRASGGTAGATGMTNAYDQAQASFARDKVNRILMFTDGDFNVGVTDNKRLEDY 369 Query: 337 VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 V ++R +GV LS +G G NY +A M IA GNG +Y+ L +A+++ + Sbjct: 370 VAEKRGTGVYLSVYGFGRGNYQDARMQTIAQAGNGVAAYVGDLRDARRLFGPMFDKGAFP 429 Query: 397 VAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 +A DVK Q+EFNPA V E+R IGYE R L F ND +DAG++G+G +T L+E+T G Sbjct: 430 IADDVKIQVEFNPARVAEWRLIGYETRLLNEADFANDRIDAGEVGSGASVTALYEITPVG 489 Query: 457 QKASIDKLRYAPDNKL--AKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINA----P 510 + + RY PDN++ D E+ ++++R+K P G S L++ PL T A P Sbjct: 490 GPTQVPERRY-PDNRIGVGGGDPNGEIGFIQVRYKQPGGSRSDLIQQPL--TSRAAGAQP 546 Query: 511 SEDMRFRAAVAAYGQKLRGSEYLN-NTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 E R+ AVAA+GQKLR +++ + W Q+ AQ A+GEDP G RAEF++L+ A Sbjct: 547 PEATRWALAVAAFGQKLRNDPWMSADYGWDQVLAQAQGARGEDPWGDRAEFVQLVRAA 604 >UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CVB5_9CLOT Length = 556 Score = 343 bits (881), Expect = 9e-93, Method: Compositional matrix adjust. Identities = 215/590 (36%), Positives = 314/590 (53%), Gaps = 67/590 (11%) Query: 2 RNKNI-IMLLMSSLI---LSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAA 57 R K + I LLM +L+ LSGCG + + ++ TE +V +AE + Sbjct: 3 RGKQLTIGLLMCALLAGLLSGCG-------AGGGKTASATEAEV---------KAEAGSY 46 Query: 58 AAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVA 117 A++ +A Q + +A E P + T Y +N VA Sbjct: 47 ASETMAAQSQWDGAVMEA------EGPPLSH------------NTEEYNYIAENAFLAVA 88 Query: 118 QNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKP 177 PL+TF+ DVDT SYAN+RR + +G P DAVR+EE++NYF D+ ++ Sbjct: 89 NAPLSTFAADVDTASYANLRRKILEGNEVPADAVRIEEMLNYFTYDYP-------EPTED 141 Query: 178 IPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQ 237 PF++ + PWNE LL++ + A+ E SNLVFLID SGSM S ++L L++ Sbjct: 142 EPFSVTTYIGDCPWNENHKLLQIGLQAEKPDLENQKPSNLVFLIDVSGSMESADKLGLVK 201 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 + LL + LR +D ++IVTYA + L +SG KA I AI++L A GST+G G+E Sbjct: 202 RAFLLLTENLRPEDTVSIVTYASSDTVVLDGVSGEEKAAIMTAIENLTAGGSTDGSKGIE 261 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 AY+ A + F K G NR++LATDGD N+G+ + +++K++ESGV LS G G N Sbjct: 262 TAYRLAEEHFQKDGNNRVILATDGDLNLGLTSEGDLTRLIQKKKESGVFLSVMGFGTGNI 321 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 + M +AD GNG Y+Y+D+L EA++VL E+ L TVAKDVK Q+EFNPA V YR Sbjct: 322 KDNKMEALADNGNGQYAYVDSLMEAKRVLVEELGGTLFTVAKDVKLQVEFNPAKVKGYRL 381 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASID--KLRYAPDNKLAKS 475 IGYE R + F++D D G+IGAG +T L+EL G + +L+Y N A Sbjct: 382 IGYENRLMEARDFDDDAKDGGEIGAGHRVTALYELVPAGSDEDLGEVELKYGAGNVAAAE 441 Query: 476 D------------------KTKELAWLKIRWKYPQGKESQLVEFPL--GPTINAPSEDMR 515 + E LK+R+K P G++S+L+E+P+ DMR Sbjct: 442 NGENGGAEARPAEGAPAPGADSEWLTLKVRYKEPDGEQSRLLEYPVDDSAVCRELPPDMR 501 Query: 516 FRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE 565 F + VA G LR SEY +S++ I ++ G Y+ EF+ L++ Sbjct: 502 FASCVAQTGMLLRDSEYAGGSSYKAIAAELERIDGLRGDPYKEEFLYLVK 551 >UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C375AE Length = 550 Score = 337 bits (864), Expect = 8e-91, Method: Compositional matrix adjust. Identities = 185/471 (39%), Positives = 272/471 (57%), Gaps = 13/471 (2%) Query: 104 RYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSD 163 Y+ + + K PL+TFS DVDT SY NVRR + + P DAVR+EE +NYF D Sbjct: 85 EYKGYTEAGFKDTKSEPLSTFSADVDTASYTNVRRLIENRNIVPEDAVRIEEFINYFDYD 144 Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 + + S F E+A PWN L+ V I K+ + +E P SNLVFLID+ Sbjct: 145 YPQPEDGS-------AFGRYVEIADCPWNRDHKLMMVGIQGKELQQQETPPSNLVFLIDS 197 Query: 224 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDS 283 SGSM S ++LPL+QS+ +L ++L + D I+IVTYAG S + L GS+ EI + S Sbjct: 198 SGSMNSYDKLPLVQSAFSMLAEQLDKNDRISIVTYAGSSAVLLDGEKGSNTDEILEQLYS 257 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 + A GSTNG G++ AY+ A + FIKGG NR++LATDGD NVG + + +++ +R++ Sbjct: 258 ITASGSTNGEGGIKTAYELAEEHFIKGGNNRVILATDGDLNVGASSEEELTRLIETKRDN 317 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 G+ LS G G NY +A M +AD GNGN+SYID+ EA++VL EM L T+AKDVK Sbjct: 318 GIYLSVLGFGEGNYKDARMEALADNGNGNFSYIDSEDEAERVLVQEMSGTLYTIAKDVKI 377 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL--NGQKASI 461 Q+EFNP+ V+ YR IGY+ R + E F +D DAG++G+G +T L+E+ + G Sbjct: 378 QVEFNPSQVSSYRLIGYDNRLMNAEDFLDDTKDAGEVGSGHSVTALYEIEMADTGDSYHG 437 Query: 462 DKLRYAP--DNKLAKSDKTKELAWLKIRWKYPQGKESQLVE--FPLGPTINAPSEDMRFR 517 L +A D+ A+++ E+ L I +K P G E++ + + ++PS M+ Sbjct: 438 VPLEFASEHDSIPAENNGRSEICKLSIAYKTPVGNENRNTSDLYSMENYSSSPSNSMKLA 497 Query: 518 AAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 A A +G LR S+Y + + + Q K D + +++ AD Sbjct: 498 QAAAGFGMVLRNSDYKGDADFDTVLDILDQLKVNDNDKINELYGLILDAAD 548 >UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DA43_9BACT Length = 883 Score = 333 bits (854), Expect = 1e-89, Method: Compositional matrix adjust. Identities = 165/366 (45%), Positives = 237/366 (64%), Gaps = 13/366 (3%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 + +N V +NPL+TFS+DVDT SYA VRR+LN LPP AVR+EE++NYFP D+ Sbjct: 318 FDTLTENAFLNVPENPLSTFSIDVDTASYAIVRRYLNDNHLPPTGAVRIEELLNYFPYDY 377 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 + PF+ E+A PW + L++V + ++ +E P SNLVFLID S Sbjct: 378 PQPQGAA-------PFSATMEVATCPWAPEHRLVRVGLKGREIPKDERPPSNLVFLIDVS 430 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM +LPL+Q LLV++L +D ++IVTYA +++ L K + AID L Sbjct: 431 GSMNMPNKLPLLQKCFSLLVEQLGPKDRVSIVTYASGTKLVLEPTQ--DKEAMQTAIDGL 488 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 A G T+G +G++LAY+ A + FI GG NR++LATDGD+N+GI + + SM+ ++ +SG Sbjct: 489 HAGGGTHGSSGIDLAYRMAQQSFIPGGTNRVILATDGDWNIGITNQSELLSMITRKAKSG 548 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 V L+ G G N ++M+V++AD GNG+Y+YIDT EA+KV ++ L+T+AKDVK Q Sbjct: 549 VFLTVLGFGLDNLKDSMLVKLADHGNGHYAYIDTEQEARKVFVDQLSSTLVTIAKDVKIQ 608 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK----AS 460 +EFNP V+ YR +GYEKR L E FNND DAG+IGAG +T L+E+ G++ A Sbjct: 609 VEFNPVQVSSYRLVGYEKRLLAKEDFNNDKKDAGEIGAGHTVTALYEVVPVGKERPEIAK 668 Query: 461 IDKLRY 466 +D+L+Y Sbjct: 669 VDELKY 674 Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 43/92 (46%), Positives = 60/92 (65%), Gaps = 3/92 (3%) Query: 479 KELAWLKIRWKYPQGKESQLVEFPL---GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNN 535 KE+ LK+R+K P G++S+L+EFPL G T S D F AAVA+YG LR S+YL Sbjct: 787 KEMLTLKLRYKEPDGEKSKLLEFPLTDPGTTWEKSSPDFHFAAAVASYGMLLRDSKYLGE 846 Query: 536 TSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 +WQ + +WA++ G D GYR EF+ L++ A Sbjct: 847 ATWQSVVEWAREGLGADKHGYRTEFLSLLDRA 878 >UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GHE4_9DELT Length = 785 Score = 329 bits (843), Expect = 2e-88, Method: Compositional matrix adjust. Identities = 178/448 (39%), Positives = 268/448 (59%), Gaps = 29/448 (6%) Query: 122 ATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFA 181 +TFS+DVDT SYA+VR+ L G +P P +VR EE++NYF + + PFA Sbjct: 264 STFSIDVDTASYASVRQSLRNGWMPDPGSVRTEEMINYFDYGY-----VAPSGGAGAPFA 318 Query: 182 MRYELAPAPWNEQRTLLKVDILAKDR---KSEELPASNLVFLIDTSGSMISDERLPLIQS 238 + E+ P PW L+++ + A +++EL NLVFL+D SGSM S +LPLI+ Sbjct: 319 VHTEVGPCPWAPDHRLVQIGVQATRELPAQAQELRTRNLVFLLDVSGSMSSRGKLPLIKH 378 Query: 239 SLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 LV++L +D+++IV YAG + + LP SG K I A+D L+A G TNG AG+ Sbjct: 379 GFTQLVEQLGAEDHVSIVVYAGAAGVVLPPTSGDQKETILGALDRLEAGGGTNGSAGIVE 438 Query: 299 AYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYN 358 AY+ A F+ GG+NR++L TDGDFNVG+ D ++ +++++RESGV LS GVG +Y+ Sbjct: 439 AYELAQANFVDGGVNRVILGTDGDFNVGLSDHDALVELIEQKRESGVFLSVLGVGG-HYD 497 Query: 359 EAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 + +M ++AD GNGNY+++D EA+KVL E+ L T+AKDVK Q+ FNP VT++R I Sbjct: 498 DELMEQLADHGNGNYAFLDGKREAEKVLVEEIGGTLTTIAKDVKVQVAFNPEQVTKHRLI 557 Query: 419 GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKT 478 Y+ R+L FN+D DAG+IG G ++T L+E+ +++ + Sbjct: 558 AYQNRRLAHRDFNDDTKDAGEIGVGHNVTALYEII-----------------PADEAEAS 600 Query: 479 KELAWLKIRWKYPQGKESQLVEFPL---GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNN 535 + L L++R+K P G S V + G +++ S+D RF AAVA +G+ L G + Sbjct: 601 EALMSLELRYKKPDGHRSTKVTTSVRDAGRSLDQNSDDFRFAAAVAGFGESLAGRRPDAS 660 Query: 536 TSWQQIKQWAQQAKGEDPQGYRAEFIRL 563 ++ + AQ A GED + R EF+ L Sbjct: 661 WNYADTLELAQGALGEDARCLRHEFLEL 688 >UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 Tax=Bacteria RepID=A7C0I1_9GAMM Length = 367 Score = 326 bits (835), Expect = 2e-87, Method: Compositional matrix adjust. Identities = 184/364 (50%), Positives = 256/364 (70%), Gaps = 9/364 (2%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG 271 +P +NLVFL+D SGSM S+ +L L++S+LKLL +L E+D +++V YAG + + L G Sbjct: 1 MPPANLVFLVDVSGSMRSNHKLALLKSALKLLSNQLTEKDKVSLVVYAGAAGVVLEPTPG 60 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 +IN A++ L A GST+G AG+ LAY A + FIK GINRILLATDGDFNVG D + Sbjct: 61 HQSVKINGALERLTAGGSTHGSAGIHLAYNLAEQAFIKNGINRILLATDGDFNVGTVDFE 120 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 +++++V+++R+SG++L+T G G NYN+ +M ++AD GNGNY+YIDTL+EAQKVL EM Sbjct: 121 ALKNLVEEKRKSGISLTTLGFGRGNYNDQLMEQLADAGNGNYAYIDTLNEAQKVLVDEMS 180 Query: 392 QMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFE 451 L T+AKDVK QIEFNPA V EYR IGYE R L+ E F+ND VDAG+IGAG +T L+E Sbjct: 181 STLNTIAKDVKIQIEFNPAIVAEYRLIGYENRLLKREDFSNDKVDAGEIGAGHTVTALYE 240 Query: 452 LTLNGQKAS-IDKLRYAPDNKLAKS--DKTKELAWLKIRWKYPQGKESQLVEFPLG---- 504 + L G ++ LRY+ + + KS ++ ELA+L++R+K P SQL+E+P+ Sbjct: 241 MALVGSGGQRLESLRYSQNQDVPKSNDNQNNELAFLRLRYKAPNSDTSQLLEWPMMRQDI 300 Query: 505 -PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRL 563 T++ +E RF AAVAA+GQ+LRG +YL S+ I A+ A+G DP GYR E I+L Sbjct: 301 LETVDT-NERFRFAAAVAAFGQQLRGGKYLEQFSYDNILNLARDARGNDPFGYRGELIKL 359 Query: 564 IELA 567 + LA Sbjct: 360 VNLA 363 >UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SXC8_9FIRM Length = 612 Score = 317 bits (811), Expect = 1e-84, Method: Compositional matrix adjust. Identities = 184/452 (40%), Positives = 255/452 (56%), Gaps = 27/452 (5%) Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 T Y +N PL+TF+ D DT SY+NVR ++ G LPP AVR+EE++NYF Sbjct: 121 TREYDSMTENGFVSTVDRPLSTFAADRDTASYSNVRSYIESGSLPPDGAVRIEEMLNYFT 180 Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 D+ K P F++ E + PWN+ L+ V I + + SNLVFLI Sbjct: 181 YDYRKK-----PEDGE-KFSIYTEYSDCPWNKDTKLMMVGINTDEIDFGDKKPSNLVFLI 234 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAI 281 DTSGSM D +LPL+Q S +L + L E D ++IVTYAG+ + L GS + I+ A+ Sbjct: 235 DTSGSMYDDNKLPLVQQSFAMLAENLDENDRVSIVTYAGEDTVVLSGTPGSEQYTISEAL 294 Query: 282 DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMV-KKQ 340 ++ AEG TNGG + AY+ A K FI GG NR++LATDGD NVG+ + ++ +++ Sbjct: 295 SNMTAEGCTNGGDAIITAYELAEKNFINGGNNRVILATDGDLNVGLTSESDLVDLITEEK 354 Query: 341 RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD 400 +E+ + LS G G N + + +AD G+G+Y++ID+ EA+KVL EM L TVAKD Sbjct: 355 KENNIFLSVLGFGTDNLKDNKLEALADNGDGSYAFIDSAYEAKKVLVDEMGGTLNTVAKD 414 Query: 401 VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK-- 458 VK Q+EFNP V YRQIGYE R L F ND VD G+IGAG +T+L+E+ G Sbjct: 415 VKFQLEFNPTNVKGYRQIGYENRALADADFANDAVDGGEIGAGHMVTVLYEIVPAGSDFE 474 Query: 459 ---------ASIDKLRYAPDNKLAKSDKTK-------ELAWLKIRWKYPQGKESQLVEFP 502 +I+++ A DK+ ELA + IR+K P G +S LV Sbjct: 475 VPAANHKYGENINQVNTAESTSQDLRDKSDSAENYAGELATVNIRYKDPDGDKSNLVSCV 534 Query: 503 L-GPTINAP-SEDMRFRAAVAAYGQKLRGSEY 532 + + N S DM +AVAAYG L+ SEY Sbjct: 535 VKTDSYNGGMSADMSAASAVAAYGMLLKNSEY 566 >UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1CVN5_MYXXD Length = 700 Score = 310 bits (794), Expect = 1e-82, Method: Compositional matrix adjust. Identities = 174/457 (38%), Positives = 253/457 (55%), Gaps = 40/457 (8%) Query: 99 NPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVN 158 +P +Q + NP + +TFS+D D+ SY R +L +G LP AVRVEE VN Sbjct: 226 SPFHMYFQGYGVNPTINTEEERFSTFSVDTDSASYTLTRAYLERGSLPNEQAVRVEEFVN 285 Query: 159 YFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLV 218 F D+ + S P F+++ E P+P + ++ V + A++ + S+LV Sbjct: 286 TF--DYGYAHQGSAP------FSVQVEGFPSPVRKGYHVVHVGVKAREVSRPQRKPSHLV 337 Query: 219 FLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEIN 278 F+ID SGSM + RL L++ +L LLV EL E+D ++IV Y +R+ L S H I Sbjct: 338 FVIDVSGSMNLENRLGLVKRALHLLVNELDERDQVSIVVYGSTARLVLEPTSAVHAHIIR 397 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVK 338 AAIDSL EGSTN AGLE+ Y A ++GGINR++L +DG N G+ D SI ++ Sbjct: 398 AAIDSLHTEGSTNAQAGLEMGYSLAASHLVEGGINRVILCSDGVANTGLTDANSIWERIR 457 Query: 339 KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVA 398 + G+TLST G G NYN+ +M R++ VG GNY+Y+D + EA ++ ++ L VA Sbjct: 458 ARAAKGITLSTVGFGMGNYNDVLMERLSQVGEGNYAYVDRIEEAHRIFVRDLTGTLQVVA 517 Query: 399 KDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK 458 KDVK Q+EF+P V+ YR +GYE R L E F +D VDAG+IGAG +T L+E+ L Sbjct: 518 KDVKLQMEFDPKAVSHYRLLGYENRMLTKEQFADDRVDAGEIGAGHAVTALYEVKLTEPS 577 Query: 459 ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAP-----SED 513 AS LR IR+K P+G +S+L+E PL ++ P + Sbjct: 578 ASFGTLR--------------------IRYKAPEGGDSKLIEKPLPSSVLRPAYGRAAPP 617 Query: 514 MRFRAAVAAYGQKLRGSEYLNNTS-------WQQIKQ 543 R AA+ +KLRGS ++ + W++I Q Sbjct: 618 TRLSYVAAAFAEKLRGSYWVRPLTYDALFSFWEEIGQ 654 >UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1D6F9_MYXXD Length = 592 Score = 308 bits (790), Expect = 3e-82, Method: Compositional matrix adjust. Identities = 186/470 (39%), Positives = 268/470 (57%), Gaps = 35/470 (7%) Query: 101 GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 G ++ + N + A++PL+TF+ DVDT SY RR+L G LPP AVRVEE VNYF Sbjct: 138 GGNTFEAWKANAFVETAKDPLSTFAADVDTASYTVSRRYLVNGQLPPASAVRVEEFVNYF 197 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 K + + P + FA+ E AP+P++ +R L+V + K + ++LVFL Sbjct: 198 ------KFRYAPPETGA--FAVHLEGAPSPFDAKRHFLRVGVQGKVVSRSQRKPAHLVFL 249 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 +DTSGSM S+++LPL + ++K+ VK L E D +AIVTYAG++R LP + I+AA Sbjct: 250 VDTSGSMHSEDKLPLAREAIKVAVKNLNENDTVAIVTYAGNTRDVLPPTPATDAKSIHAA 309 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID-DPKSIESMVKK 339 +DSL A G T G+G+ELAY+ A K ++R+++ TDGD N+G + ++ + K Sbjct: 310 LDSLTAGGGTAMGSGMELAYRHAVKKASGSVVSRVVVLTDGDANIGRNVSANAMLDSIHK 369 Query: 340 QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAK 399 GVTL+T G G NY + +M ++AD GNGN Y+D+L EA+KV +++ L +AK Sbjct: 370 YTAEGVTLTTVGFGMGNYRDDLMEKLADKGNGNCFYVDSLREAKKVFETQLTGTLEVIAK 429 Query: 400 DVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKA 459 DVK Q+EFNPA V YR +GYE R + F ND VDAG+IGAG ++T ++E+ L G+ Sbjct: 430 DVKFQVEFNPAAVRRYRLVGYENRDVADHDFRNDKVDAGEIGAGHNVTAVYEVELTGE-- 487 Query: 460 SIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF-----PLGPTINAPSEDM 514 T+ LA +++R K P G E+ EF L T+ S D Sbjct: 488 -----------------ATEALATVRVRAKAPNGTEASEREFRFERTKLRDTLAQASPDF 530 Query: 515 RFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 RF AVAA LR S S ++ A+ A D R EF+RL+ Sbjct: 531 RFAVAVAATADVLRDSPSAEGWSLATAEKLAEGATEGDAD--RKEFVRLV 578 >UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5D5_BREBN Length = 513 Score = 306 bits (785), Expect = 1e-81, Method: Compositional matrix adjust. Identities = 171/467 (36%), Positives = 261/467 (55%), Gaps = 33/467 (7%) Query: 100 PGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNY 159 P ++ + N A++ L+TF+ DVDT SY +R F+ G LPP +AVRVEE +N+ Sbjct: 71 PNDMYFKDYGTNQFVSTAKDRLSTFAADVDTASYTIMRHFIKDGNLPPAEAVRVEEFINF 130 Query: 160 FPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVF 219 FP+ + PA FA++ + P+P+ + ++++ I K+ +E +NLVF Sbjct: 131 FPTSY--------PAPTNQTFAIQADSGPSPFQKNLQIVRIGIKGKELSPKERKPANLVF 182 Query: 220 LIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINA 279 +ID SGSM + RL L++ SL +LV +L+ D++ IV Y + R+ LP S K I + Sbjct: 183 VIDVSGSMNQENRLELVKKSLHVLVDQLQPTDSVGIVVYGSEGRVLLPPTSTEDKQAILS 242 Query: 280 AIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKK 339 AID L EGSTN GL L Y+ A + F INR++L +DG NVG + I ++ Sbjct: 243 AIDELQPEGSTNAEQGLVLGYEMAARSFKPPAINRVILCSDGVANVGETGAEGILRSIED 302 Query: 340 QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAK 399 + LS+FG G NYN+ MM ++A+ G G+Y+YIDT SEA+++ + L T+A+ Sbjct: 303 YARKDIYLSSFGFGMGNYNDVMMEQLANKGEGSYAYIDTFSEARRIFTESLTGTLQTIAR 362 Query: 400 DVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKA 459 DVK Q+EF+P V YR IGYE R +R E F ND DAG+IGAG +T L+E+ L Sbjct: 363 DVKIQVEFDPKKVDSYRLIGYENRDVRDEDFRNDKTDAGEIGAGHSVTALYEVKL----- 417 Query: 460 SIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAA 519 S EL +++R+ + ++ + + P+ + S D+ F AA Sbjct: 418 --------------ASPVHAELGTVRVRYHHASTQKVEEISEPV-KVQSTLSPDVTFLAA 462 Query: 520 VAAYGQKLRGSEYLNNTSWQQIKQWAQ-QAKGEDPQGYRAEFIRLIE 565 VA YG+ LR S Y +S + + A+ A GE+ + EF+RL++ Sbjct: 463 VAEYGEILRESPYAERSSLADVLKLAEATATGEE----QLEFVRLVK 505 >UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVG2_SORC5 Length = 656 Score = 306 bits (783), Expect = 2e-81, Method: Compositional matrix adjust. Identities = 181/482 (37%), Positives = 274/482 (56%), Gaps = 35/482 (7%) Query: 101 GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 G+ Y+ + NPV+ A++ L+TF++DVDT SYA RR + G LPP AVR EE +NYF Sbjct: 193 GSETYRDYGVNPVEDPAKDRLSTFAIDVDTASYAIARRKIMDGALPPYQAVRAEEFLNYF 252 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 + + PA+ P FA+ AP+P+ L++V + K +E +LV+L Sbjct: 253 DYGY------ASPAAGP--FAVHLAAAPSPFTSGHHLVRVAVQGKRVPVKERTPVHLVYL 304 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 +DTSGSM S +++ L + SLK+L L+ D +A+ TYAG R L K +I AA Sbjct: 305 VDTSGSMQSPDKIELAKKSLKMLTDTLKPGDTVALCTYAGSVREVLAPTGIESKGKILAA 364 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQ 340 + L A GST +G++LAY A + +KG +NR+++ +DGD NVG I +K+ Sbjct: 365 LADLTAGGSTAMSSGIDLAYSLAERTLVKGHVNRVIVLSDGDANVGPTSHDEILKTIKRA 424 Query: 341 RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD 400 R+ G+TLST G G NY + MM ++A+ G+GNY+YID+ ++A++V + ++ ML +A+D Sbjct: 425 RDKGITLSTVGFGQGNYKDLMMEQLANQGDGNYAYIDSEAQARRVFSEQVGGMLQVIARD 484 Query: 401 VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS 460 VK Q+EF+P++V YR IGYE R + F ND VDAG+IGAG +T ++++ L Sbjct: 485 VKIQVEFDPSFVKSYRLIGYENRDVADRDFRNDKVDAGEIGAGHSVTAIYDVELKAP--- 541 Query: 461 IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGK---ESQLVEFPLG---PTINAPSEDM 514 AP + A +++R K P G E LV+ G PT +A D Sbjct: 542 ------APKGEGAAP------IVVRLRHKAPLGSNTAEETLVKMAPGAIAPTFDAAPADF 589 Query: 515 RFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE----LADGV 570 RF +AVA + + LR S + + I++ A+ A +G + EFI +I LA+G Sbjct: 590 RFASAVAGFAEVLRHSPHARSWRLADIEKIARAAASS--KGDQQEFIGIIRRAGALANGK 647 Query: 571 TD 572 TD Sbjct: 648 TD 649 >UniRef50_C7N770 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N770_SLAHD Length = 629 Score = 301 bits (771), Expect = 5e-80, Method: Compositional matrix adjust. Identities = 175/442 (39%), Positives = 247/442 (55%), Gaps = 21/442 (4%) Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQG--LLPPPD-AVRVEEIVN 158 T Y ++N PL+T S DVDT SY N+RR +N G L PD AVR+EE++N Sbjct: 165 TEEYAAIEENGFVSTVTRPLSTCSADVDTASYCNLRRMINDGYSLDEIPDGAVRIEEMLN 224 Query: 159 YFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLV 218 YF D P + FA+R E A PWN+Q LL + A D+ SNLV Sbjct: 225 YFHYD------SGEPEGNDL-FAVRAESARCPWNDQTQLLVMTFTASDKAQTASKGSNLV 277 Query: 219 FLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEIN 278 FLID SGSM ++L L++ S L++ L D ++IVTYA + L SG +I Sbjct: 278 FLIDISGSMDEPDKLDLLKDSFGTLLENLGPNDRVSIVTYAAGEDVLLEGASGDDTRKIM 337 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVK 338 A++ L+A+GSTNG AGLE+AY+ A + +I+GG+NRI++A+DGD NVGI + V+ Sbjct: 338 RALNRLEADGSTNGEAGLEMAYEVAERNYIEGGVNRIVMASDGDLNVGITSESDLYDFVE 397 Query: 339 KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVA 398 ++RE+GV LS G G+ NY + M +AD GNG Y YID + EA++VL ++ + VA Sbjct: 398 EKRETGVYLSVLGFGSGNYKDTKMETLADHGNGTYHYIDCVEEAERVLGEDLTANFVPVA 457 Query: 399 KDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL---N 455 DVK Q+EFNPA V YR IGYE R + E F N+ DA ++GAG T+ +EL L + Sbjct: 458 DDVKLQVEFNPAQVKAYRLIGYENRAMADEDFLNEAADAAEVGAGAQFTVAYELVLADSD 517 Query: 456 GQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLV---EFPLGPT--INAP 510 A + L+Y + A T E WL +Y + + V + +G +P Sbjct: 518 YDVADVPDLKYGS-GEAAGDSSTDE--WLTCSMRYKAVDDDKAVRSQDLVVGADSQTESP 574 Query: 511 SEDMRFRAAVAAYGQKLRGSEY 532 S+D F ++V +G SE+ Sbjct: 575 SDDWVFASSVIEFGMIASDSEF 596 >UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WPE6_EGGLE Length = 555 Score = 295 bits (756), Expect = 2e-78, Method: Compositional matrix adjust. Identities = 181/496 (36%), Positives = 255/496 (51%), Gaps = 22/496 (4%) Query: 71 SDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDT 130 S+ A+ L E + + GT Y+ D+ A +PL+T S DVDT Sbjct: 47 SEIMAIGSALSETASTCPPPYPYVPSPSPGGTEEYRALDEPGFLSPATSPLSTLSADVDT 106 Query: 131 GSYANVRRFLNQGLLP---PPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 187 SY N+RR + Q P P AVR EE++NYF D+ + P + F + +++ Sbjct: 107 ASYCNLRRMVAQRYAPAVVPAGAVRTEELLNYF--DYAYPE----PVGSDL-FGVSAQMS 159 Query: 188 PAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKEL 247 PWN+Q LL + + +NLVFLID SGSM ++LPL++ S LV+ L Sbjct: 160 DCPWNDQTKLLVMGFATEKDGDASPTGANLVFLIDVSGSMDDPDKLPLVKDSFAALVEGL 219 Query: 248 REQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 E+D +++VTYA R+ L + G K I A+DSL AEGSTNG AGLE AY+ A F Sbjct: 220 TERDRVSVVTYASGERVLLEGVPGDDKRRIMRAVDSLVAEGSTNGEAGLEQAYRLAESSF 279 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 I+GG+NR+++A+DGD NVGI + V+++RE+GV LS G G+ NY + M +AD Sbjct: 280 IEGGVNRVVMASDGDLNVGISSESELHDFVEQKRETGVYLSVLGFGSGNYKDNKMETLAD 339 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRV 427 GNG Y YID EA++VL +R L+ +A DVK Q+EFNP V YR IGYE R L Sbjct: 340 HGNGAYHYIDCAEEARRVLGRNLRANLVPLADDVKIQVEFNPDRVKGYRLIGYENRALAD 399 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQK----ASIDKLRY-APDNKLAKSDKTKELA 482 E F + DAG++GAG T+ +E+ G AS K A D + + + Sbjct: 400 EEFRD---DAGEVGAGHAFTVAYEIVPAGSAFEVGASASKYGSDADDRQDGRRSEANGGE 456 Query: 483 WLKIRWKYPQGKESQLVEFPL----GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSW 538 WL +Y + VE L + P+ D F AAV G L S + + Sbjct: 457 WLTCTMRYRPAGTVEAVEQALVVDDESCTDDPNGDWTFAAAVIECGMALHRSPHAGAATL 516 Query: 539 QQIKQWAQQAKGEDPQ 554 + + + D Q Sbjct: 517 ESARDLLASCELTDQQ 532 >UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PR69_CHIPD Length = 588 Score = 274 bits (701), Expect = 7e-72, Method: Compositional matrix adjust. Identities = 168/443 (37%), Positives = 240/443 (54%), Gaps = 23/443 (5%) Query: 124 FSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMR 183 F++DVD +Y+N+RRF+ P +AVR+EE+VNYF + + P + + Sbjct: 158 FAVDVDRAAYSNIRRFVKLKERIPANAVRIEEMVNYFHYSYPLP-----PVGQTLAIYSN 212 Query: 184 YELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLL 243 Y A PW E LL++ + K + LP SNLVFLID SGSM +LPL+Q++ ++L Sbjct: 213 Y--ATCPWAEDHRLLQIAVRGKSVNLDSLPPSNLVFLIDVSGSMAMPNKLPLLQAAFRIL 270 Query: 244 VKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 V LR D++AIV YAG + LPS GS K++I AID L A G+T G A ++LAYQ A Sbjct: 271 VNNLRSNDHVAIVAYAGVPGVILPSTPGSAKSKILNAIDYLSAGGATAGEAAIKLAYQIA 330 Query: 304 TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 + FIK G NR++LATDGDFNVG +E ++ ++E+GV L+ G G NY ++ + Sbjct: 331 EENFIKEGNNRVILATDGDFNVGQTSDHDMEQLILGKKETGVLLTCLGFGMKNYKDSKLE 390 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKR 423 ++ GNGN++YID L EA K+ E L TVA+DV+A + FNP V YR IGYE + Sbjct: 391 TLSSKGNGNFAYIDNLEEASKIFAREFGSTLFTVARDVQADVVFNPRTVKSYRLIGYENK 450 Query: 424 QLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW 483 ++ D+ A IG G ++ P L +D Sbjct: 451 VIK------DDDSASQIGGGIIGA---------GHCAVAIYEIVPQKGLMPADSMLAAVH 495 Query: 484 LKIRWKYPQGKESQLVEFP-LGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIK 542 L R + + P + T S+D RF +AVA G LR S Y + S + Sbjct: 496 LAYRETTDTTIKRLFYKVPDIFTTFQQSSDDFRFASAVALMGMLLRKSGYKGSGSCDMVM 555 Query: 543 QWAQQAKGEDPQGYRAEFIRLIE 565 A+++ G+DP GYR EFI L++ Sbjct: 556 DIARRSLGDDPGGYRREFITLLK 578 >UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AWD1_HERA2 Length = 610 Score = 268 bits (686), Expect = 3e-70, Method: Compositional matrix adjust. Identities = 157/470 (33%), Positives = 262/470 (55%), Gaps = 32/470 (6%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 ++ + NP + +PL+TF++D+D+ SY+ +R +NQGLLPP D+VRVEE +N F ++ Sbjct: 163 FKNYGTNPFVRTETDPLSTFAMDIDSASYSLMRSSINQGLLPPADSVRVEEYLNAFDYEY 222 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWN-EQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 P + FA+ E+AP+P+ L+++ I A+ + + + L F+IDT Sbjct: 223 --------PQPEDGDFAIYSEVAPSPFGGPNYELVQIGIQARSIEVADRKPAALTFVIDT 274 Query: 224 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDS 283 SGSM D RL +++++L L +L D++AIV + R+ L SG ++ +I AI+S Sbjct: 275 SGSMAQDNRLEMVKNALIYLAGQLEPDDSLAIVAFNDGMRVVLNPTSGENQMDIITAINS 334 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 L+ GSTN AGL ++ A + F GINRILL +DG N G+ +P + + ++ ++ Sbjct: 335 LEPAGSTNAEAGLYKGFELAWQAFKPEGINRILLCSDGVANSGMTEPSQLLATFQQYLDA 394 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 GV LST+GVG NYN+ ++ ++AD G+GNY+Y D+ EAQ++ ++ L T+ ++ K Sbjct: 395 GVQLSTYGVGMGNYNDILLEQLADKGDGNYAYFDSADEAQRLFGEQLTGSLQTIGREAKI 454 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN----GQKA 459 Q+ F+P V YR IGYE R + F ND+VD G++GAG +T L+E+ + G A Sbjct: 455 QVNFDPNVVKRYRLIGYENRAVADSDFRNDSVDGGEVGAGHSVTALYEIKRHPDAQGPIA 514 Query: 460 SIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAA 519 ++ +RY + A +++ ++ +I + + S M + Sbjct: 515 QVN-IRYISMDTNAPVEESLNISTAQIHSSFDRA-----------------SARMHLATS 556 Query: 520 VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRA-EFIRLIELAD 568 VA Y + LR S + N T + A++A + P A EF+ L+ A+ Sbjct: 557 VAEYAELLRHSRWNNGTDILDVLDLAEEAALDLPNNQSAVEFVTLLRRAE 606 >UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NFD9_ACHLI Length = 486 Score = 258 bits (658), Expect = 6e-67, Method: Compositional matrix adjust. Identities = 157/463 (33%), Positives = 250/463 (53%), Gaps = 31/463 (6%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 +Q+ +NP V+ N + SL +T SY+ +R +N G +AVR+EE+VN+F ++ Sbjct: 43 HQEIIENPFIDVSVNNKSNISLSANTASYSFIRSQINSGRAVDRNAVRIEEMVNFFNYNY 102 Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 + P + F + EL PWN + LL + + K ++P SN+V L+D S Sbjct: 103 NQ------PETDK-TFGFKSELIQTPWNNETHLLLIGLETKQVDLGDIP-SNIVILLDVS 154 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM + +L L + +++LL+++++ D I++VTY+ ++ S A + + I L Sbjct: 155 GSMSATNKLSLAKKAMELLIEQMKPNDVISLVTYSSGEKVVFKGKSIDDMAYMTSQIRLL 214 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 A GST G GL++AY+ A + FI+GG NRI+LATDGDFNVGI + + ++RESG Sbjct: 215 KASGSTAGKKGLDMAYKVAEEYFIEGGNNRIILATDGDFNVGISSTDMLIEYISEKRESG 274 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 + S +G G N+ + + R+A GNG Y YID + A+K + +L TVA+D KAQ Sbjct: 275 IYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARKAFVDNIDGVLYTVARDAKAQ 334 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKL 464 I F+ + V EYR IGYE RQL + F++ DAG+IG G +T ++EL LN Sbjct: 335 IVFDASAVLEYRLIGYENRQLTDDEFDDGTTDAGEIGTGLQVTAIYELKLN--------- 385 Query: 465 RYAPDNKLAKSDKTKELAWLKIRWK-YPQGKESQLVE-FPLGPTINA-PSEDMRFRAAVA 521 + ++ L IR+K + ++QL E F + IN PS D +F ++V Sbjct: 386 -----------EGASDVGSLTIRYKNHDITDDTQLEEAFTVLNAINENPSVDAKFISSVV 434 Query: 522 AYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 +G L S+Y + + + + YR +FI ++ Sbjct: 435 EFGLILMDSKYKVDADLGAVLERIETETYNLEDYYRNDFIDVL 477 >UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TQT5_9MICO Length = 533 Score = 238 bits (608), Expect = 4e-61, Method: Compositional matrix adjust. Identities = 148/387 (38%), Positives = 215/387 (55%), Gaps = 26/387 (6%) Query: 118 QNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKP 177 + P +TF++DVD GS+ R L+ G LPPP++VR EE VN F S + PA + Sbjct: 95 ERPRSTFAVDVDGGSFRVARSLLHDGHLPPPESVRPEEWVNSFDSGF--------PAPRK 146 Query: 178 IPFAMRYELAPAPWNEQRT-LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 ++ + A A + T L+++ + ++ E L ++DTSGSM ERL L+ Sbjct: 147 DDLELQSDQARASSEDDGTRLVRIGLQGREVDVREWQPVALTMVVDTSGSMDIRERLGLV 206 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 +SSL LL + LR D IAIVTY D+ L I AAID L+A GSTN AGL Sbjct: 207 KSSLALLAENLRPDDTIAIVTYQTDATPLLEPTPVRDTDTILAAIDRLEAGGSTNLEAGL 266 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 L Y QA + + +G N +LLA+DG NVG+ D + + ++ G+ L T G G N Sbjct: 267 LLGYDQAREAYKQGATNVVLLASDGVANVGVTDGGRLATAIRDNGRRGIHLVTVGYGMGN 326 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYR 416 Y++ +M ++AD G+G Y YIDT EA+K+ ++R L VAKD K Q+EF+P V+ YR Sbjct: 327 YSDHLMEQLADQGDGFYEYIDTFEEARKLFVEDLRATLTPVAKDAKIQVEFDPRTVSAYR 386 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 IGYE R L + F+ND VDAG++GAG +T L+E+ A++D Sbjct: 387 LIGYENRALSDDDFDNDAVDAGEVGAGHKVTALYEV-----------------RPTAQAD 429 Query: 477 KTKELAWLKIRWKYPQGKESQLVEFPL 503 + L +++RW+ G+E + PL Sbjct: 430 EGDALGTVRVRWRSVDGEEQREDSLPL 456 >UniRef50_C1RGW7 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RGW7_9CELL Length = 500 Score = 213 bits (542), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 115/338 (34%), Positives = 191/338 (56%), Gaps = 9/338 (2%) Query: 117 AQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASK 176 A++ L+TF+LDVDTG+Y R + QG P VR EE VNYF D++ P ++ Sbjct: 70 ARDALSTFALDVDTGAYTRFRDAVRQGFSVDPFGVRTEEFVNYFAQDYE-------PPAE 122 Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 + ++ + P+ L++V I + + ++LV ++D SGSM ++ Sbjct: 123 GLGVSI--DATALPFRPDHRLVRVGISSAPASAVSRADADLVLVVDCSGSMDEAGKMETT 180 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 + +L+ LV LR D +A+V Y+ ++ + L + + + AAID L STN AGL Sbjct: 181 KYALRTLVSSLRRTDRVAMVCYSTEADVYLEPTPVAEREGVLAAIDRLAPRDSTNAAAGL 240 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 L Y A +G + R++L +DG NVG DP+ I + + Q ++G++L + GVG + Sbjct: 241 ALGYDLAMSMRTEGRLTRVVLVSDGVANVGETDPEGILARISSQAKAGISLISVGVGITT 300 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYR 416 YN+ ++ ++AD G+G + Y+D +EA++V + + L+ D +AQ+EF+PA V YR Sbjct: 301 YNDHLLEQLADQGDGWHVYVDGEAEAERVFATGLTGSLVVAGTDARAQVEFDPAQVAGYR 360 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL 454 +GYE R + E F ND VD G++ AG+ T L+E+ + Sbjct: 361 LLGYENRAVADEDFRNDAVDGGEVFAGRSTTALYEVAM 398 >UniRef50_B4D1N7 Autotransporter-associated beta strand repeat protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D1N7_9BACT Length = 1545 Score = 205 bits (521), Expect = 4e-51, Method: Compositional matrix adjust. Identities = 135/460 (29%), Positives = 226/460 (49%), Gaps = 38/460 (8%) Query: 112 PVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQS 171 P Q + N +TFSL+V S+ L QG +P P +VR EE +N F D +D + Sbjct: 1104 PEVQTSANAFSTFSLNVSDVSFKLAAASLEQGHMPDPASVRSEEFINAF----DYRDPEP 1159 Query: 172 IPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDE 231 P + P A E A P+ + R LL+ + + N+V L+D SGSM + Sbjct: 1160 SPGA---PLAFVTERARYPFAQNRDLLRFAVKTAAAGRQPGRPLNIVLLLDRSGSMERAD 1216 Query: 232 RLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTN 291 R+ +++ +L +L K L+ QD ++IVT+A + +++G ++ A ++ + EG TN Sbjct: 1217 RVNIVREALSVLAKHLQPQDKLSIVTFARTPHLWADAVAGDKVHDVIARVNEITPEGGTN 1276 Query: 292 GGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFG 351 A L+LAY+ A F NR++L TDG N+G +P ++ V+ QR+ G+ L FG Sbjct: 1277 LEAALDLAYETAHHHFAVDSTNRVILFTDGAANLGDVNPDALTKKVEAQRKQGIALDCFG 1336 Query: 352 VGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW 411 +G YN+ ++ ++ +G Y +I+T +A +++ L A DVK Q+EFNP Sbjct: 1337 IGWEGYNDDLLEQLTRNADGRYGFINTPEDAAANFATQIAGALQVAASDVKVQVEFNPHR 1396 Query: 412 VTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLF--ELTLNGQKASIDKLRYAPD 469 V YRQIGY QL E F +++V+A IGA + L+ E+ +G+ Sbjct: 1397 VKTYRQIGYATHQLTKEQFRDNSVNAAQIGAAESGNALYVVEVDPHGE------------ 1444 Query: 470 NKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG-----PTINAPSEDMRFRAAVAAYG 524 +LA + +R++ P + + E+P+ P + S +R A +A+ Sbjct: 1445 ---------GDLATVHVRFRVPGTSDYREHEWPVPFAGEVPPLEQASSALRLAGAASAFS 1495 Query: 525 QKLRGSEYLNNTSWQQ---IKQWAQQAKGEDPQGYRAEFI 561 + L S Y + + I G DP+ + E++ Sbjct: 1496 EMLAASPYATEVTSDRLLNILNGVPPIYGADPRPTKLEWM 1535 >UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12 Tax=Actinomycetales RepID=D2BAS2_STRRD Length = 490 Score = 199 bits (507), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 122/371 (32%), Positives = 188/371 (50%), Gaps = 29/371 (7%) Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 ++TF+LDVDT SY +R L +G LP P +R EE VN F D+ F Sbjct: 63 ISTFALDVDTASYGYAKRILQEGRLPEPGQIRPEEFVNSFRQDYKEPGDDG--------F 114 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 + + A P N L++V + + + E +NL F++D SGSM RL L++ +L Sbjct: 115 TVHMDGARMPENGT-ALIRVGLQTRKAEPEARRPANLTFVVDVSGSMGEPGRLDLVREAL 173 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 LV +L D ++IV ++ +R+ L + + +++AAID L E STN GL Y Sbjct: 174 HKLVDQLGPGDQVSIVAFSTQARLVLSMTPATGRDQLHAAIDRLGVEDSTNLETGLTAGY 233 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 +A + F NR++L +DG N G + I V + +TL GVG +Y + Sbjct: 234 AEAARAFRPAATNRVILLSDGLANTGDTTWQGILDRVAESAGRQITLLCVGVGR-DYGDQ 292 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 +M ++AD G+G Y+ + +A+KV ++ L A+D KAQ+ FNP+ V YR IGY Sbjct: 293 LMEQLADNGDGAAVYVSSADDARKVFVEQLATNLDLRARDAKAQVVFNPSAVESYRLIGY 352 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 E RQ+ E F +D D G+IG G +T L+ + L +S + + Sbjct: 353 ENRQIAAEDFRDDTKDGGEIGPGHSVTALYGVRL-------------------RSGASGQ 393 Query: 481 LAWLKIRWKYP 491 LA +RW+ P Sbjct: 394 LATATVRWQDP 404 >UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI7_9BACT Length = 1078 Score = 199 bits (507), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 108/335 (32%), Positives = 186/335 (55%), Gaps = 8/335 (2%) Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 L+TF++DVDT SY R + G VR+EE +N F + + K++ F Sbjct: 650 LSTFAIDVDTASYTAARSEIRAGRKVEASHVRIEEFINNFDYHYSVPKKEA--------F 701 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 + EL+ LL+V + + ++ + F+ID SGSM ++ RLPLIQ +L Sbjct: 702 KIDSELSDHKVYAGVKLLRVGVQGQRLGADSQKPGSYTFVIDNSGSMAAENRLPLIQKTL 761 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 + K + + D + I++ G I+ S+ +++ A+ +++A N G+E AY Sbjct: 762 PNMFKAMNQDDEVTILSCEGGVTNLANRITASNHSQLETAVKNIEAGTVANLSVGIEEAY 821 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 + A + F G +NR++L +DG ++G + + + V + R+ G+ + GVG+ +Y+++ Sbjct: 822 KLAAQNFRSGAVNRVILLSDGIASLGEKEAQEVLKTVSQYRKQGIGNTVIGVGSEDYDDS 881 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 + +A+ G+G Y + D+ + +L + T+A+DVK Q+EFNP V YR +GY Sbjct: 882 FLETLANKGDGVYYFGDSKEQMNDILVNNFEASFKTIARDVKIQLEFNPQAVRSYRLLGY 941 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN 455 EKR+L + F ND VDAG+IGAG+ +T L+EL +N Sbjct: 942 EKRRLANKDFRNDKVDAGEIGAGQSVTALYELVVN 976 >UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JNR2_9BACT Length = 923 Score = 169 bits (429), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 114/382 (29%), Positives = 195/382 (51%), Gaps = 18/382 (4%) Query: 107 QFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDI 166 Q + P A +P +TFSL+V SY +L Q + PP +R EE VN F D Sbjct: 480 QLSEYPESNTATDPQSTFSLNVSDVSYRLTEAYLAQNVRPPAGTLRTEEFVNAF----DY 535 Query: 167 KDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI--LAKDRKSEELPASNLVFLIDTS 224 D + P ++ I F +E A P+ R +L+ + A R S + +L IDTS Sbjct: 536 GDP-TPPVARKIGFT--WERAHWPFAHDRDVLRFSLQTAAHGRASSQ--PLHLTLAIDTS 590 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSL 284 GSM +R+ ++ S L L E+D ++IV++ R+ L S + + + L Sbjct: 591 GSMSRPDRVDIVNSLATALQSNLTEKDRLSIVSFDRQPRLVLDGQSVTAETNLATLATQL 650 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 + +G T+ + L+L+YQ A + F + INR++L TDG N+G + + + + V + R G Sbjct: 651 NPQGGTDLESALQLSYQTAQRHFQENAINRVILITDGAANLGNTNAEQLRTTVTENRIRG 710 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 + L FG+G +++ + ++ G+G Y ++ + +A L ++ +L A DVK Q Sbjct: 711 IALDCFGIGFDGHDDTFLESLSRNGDGRYRFLRSPEDAALELGPKLAGLLRPAAYDVKVQ 770 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA---GKHITLLFELTLNGQKASI 461 +EFNP V Y+Q+GY++ Q+ + F N+ VDA ++ A G + L L + Sbjct: 771 VEFNPTRVETYQQLGYQQHQIADQDFRNNAVDAAELAATESGNALYLAKVLPDGRGDLGL 830 Query: 462 DKLRYAPDNKLAKSDKTKELAW 483 ++R+ + A+S +EL+W Sbjct: 831 VRVRF----RDAESGAYEELSW 848 >UniRef50_UPI0001912300 conserved protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. M223 RepID=UPI0001912300 Length = 79 Score = 154 bits (388), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 79/79 (100%), Positives = 79/79 (100%) Query: 8 MLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEV 67 MLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEV Sbjct: 1 MLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEV 60 Query: 68 QQYSDKQALQGRLQEAPTF 86 QQYSDKQALQGRLQEAPTF Sbjct: 61 QQYSDKQALQGRLQEAPTF 79 >UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZQD5_OPITP Length = 859 Score = 154 bits (388), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 125/468 (26%), Positives = 208/468 (44%), Gaps = 39/468 (8%) Query: 110 DNPVKQV--AQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIK 167 D P +V A+ P++TFSL V S+ + L +G +P P +R EE N F D Sbjct: 414 DAPQAEVSTAKEPVSTFSLHVSDVSFQLAQAALARGEMPDPQRIRPEEFYNAF----DYG 469 Query: 168 DKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV--DILAKDRKSEELPASNLVFLIDTSG 225 D A K A R E A P +QR L+++ + A R + + NL L+DTSG Sbjct: 470 DPTPASADK---IACRIEQAAHPLLQQRNLVRIAMKVPAAGRGAGQ--PLNLTVLLDTSG 524 Query: 226 SMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLD 285 SM +R ++++L +L L D + ++ +A R+ S++G ++ + Sbjct: 525 SMERTDRATSVRAALGVLASLLTPDDRVTLIGFARQPRLLAESLAGDQARQLVDLASTTP 584 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 G TN A L LA + A + NRI+L TDG N+G DP + + ++ R+ G+ Sbjct: 585 FTGGTNLEAALSLAGELARRHHNAAAQNRIVLITDGAANLGNADPAQLATRIETLRQQGI 644 Query: 346 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 GVG ++A++ + G+G Y +D A ++ A+++K Q+ Sbjct: 645 AFDACGVGTDGLDDAVLEALTRKGDGRYYVLDAPENADAGFARQLAGAFRPAAENIKVQV 704 Query: 406 EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLR 465 FNPA V YR IG+E+ +LR + F ND VDA ++ A + L+++ + Q Sbjct: 705 RFNPARVASYRLIGFEQHRLREQDFRNDQVDAAELAAEESAVALYQVEVLPQGEG----- 759 Query: 466 YAPDNKLAKSDKTKELAWLKIRWKYPQG-----KESQLVEFPLGPTINAPSEDMRFRAAV 520 EL + R++ P + ++ P P S ++ Sbjct: 760 --------------ELGDVFARFRDPATGAMIERSWTMLHEPRAPAFERASPSLQLAGVA 805 Query: 521 AAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 A +KLRG E ++ + +G Q R + +LI + D Sbjct: 806 ALVAEKLRGGEAAGQIHLNELAGVVNRLRGHYGQNLRVQ--QLIAMFD 851 >UniRef50_UPI00016986EC hypothetical protein Epers_34925 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI00016986EC Length = 196 Score = 153 bits (386), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 85/174 (48%), Positives = 122/174 (70%), Gaps = 7/174 (4%) Query: 397 VAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 +AKDVK QIEFNP +VTEYR +GYE R L E FNND VDAG+IGAG I+ L+E++L+G Sbjct: 1 MAKDVKIQIEFNPEYVTEYRLVGYENRLLAREDFNNDKVDAGEIGAGHSISALYEISLSG 60 Query: 457 QKAS-IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT-----INAP 510 I+ LRYA +++ K+ ELA+LK+R+K+P ES+L+E P+ + +++ Sbjct: 61 STGQRIEPLRYA-QGQVSSVGKSDELAFLKLRFKHPGETESELIETPILASQIVQDLSSS 119 Query: 511 SEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 S+D RF AAVAA+GQ LRG +YL + + ++ + A+ A+G DP G R EF+RL+ Sbjct: 120 SDDFRFAAAVAAFGQSLRGGKYLKDMGYDEMIELARGARGNDPDGERVEFVRLL 173 >UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZRP2_9SPHI Length = 1088 Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 68/163 (41%), Positives = 102/163 (62%), Gaps = 4/163 (2%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 +NL+ L+D SGSM S ++LPL++ S K L+ +R QD+++IV YAGD+ I L S S++ Sbjct: 912 NNLMLLLDVSGSMSSKDKLPLLKESFKYLISIMRPQDDVSIVIYAGDAAIVLKPTSASNQ 971 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 +INA ID L + G TN AG +LAY+ +K F +GG NRI+LATDG+F + K I Sbjct: 972 EQINAVIDKLRSRGKTNVKAGFKLAYKWMSKNFKEGGNNRIILATDGEFPIS----KYIY 1027 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 +V+K+ G+ LS F G+ + ++ G GNY ++ Sbjct: 1028 KLVEKRATKGINLSVFSFGSMTKKFETLEKLVAKGKGNYEQVN 1070 Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 55/158 (34%), Positives = 96/158 (60%), Gaps = 6/158 (3%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 +NL+ L+D SGSM ++ LP+++S+LK LV +R +D +++V + ++++ L S +K Sbjct: 685 NNLMLLLDVSGSMKNE--LPMLKSALKYLVNIMRPEDKVSVVVFGSEAKLMLRPTSAKYK 742 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 A+I AID+L + G TNG AGL+LAYQ + NRI+LA+DG+F++ K + Sbjct: 743 AQIMQAIDTLKSSGRTNGEAGLKLAYQWIQNNYKNNNNNRIILASDGEFSIS----KGLY 798 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 M++++ E + LS F + + ++ +G GN Sbjct: 799 QMIEQKAEESIALSVFSFADQLKAYKKLKKLVSLGKGN 836 >UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VUB8_DYAFD Length = 935 Score = 126 bits (316), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 63/180 (35%), Positives = 114/180 (63%), Gaps = 5/180 (2%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 +N+V L+D S SM S ++PL++ S+K L+ +R +D I+IV Y+G +R+ L SG+ Sbjct: 757 NNMVLLLDVSSSMNSPYKMPLLKRSIKSLLTLVRPEDMISIVLYSGKARVVLKPTSGAKA 816 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 +EI+ ID L ++G T+G G++LAY+ A K +I+GG NRI+LATDG+F V + Sbjct: 817 SEISRMIDLLQSDGDTDGNEGIKLAYKTANKQYIRGGNNRIVLATDGEFPVS----DEVM 872 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI-DTLSEAQKVLNSEMRQM 393 M+++ V LS F G + + +++++G G+Y+++ D ++ Q +L ++ +++ Sbjct: 873 DMIRQNARQDVYLSIFTFGRHEHTGQKLKKLSELGMGSYAHVTDASADLQLILEAQAKKL 932 >UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PTX8_CHIPD Length = 462 Score = 116 bits (290), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 67/191 (35%), Positives = 111/191 (58%), Gaps = 3/191 (1%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 N+ ++D SGSM S +++ + + K L+ +L D+++IV Y + PS S +K Sbjct: 82 NISLVLDRSGSM-SGDKIKYARQAAKFLIDQLNSTDHLSIVNYDDRVEVTSPSQSVKNKE 140 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 + AAID + GSTN G+ Y Q +G +NR+LL TDG N GI DP ++ Sbjct: 141 ALKAAIDKIHDRGSTNLSGGMLEGYTQVKSTRKEGYVNRVLLLTDGLANQGITDPLELKR 200 Query: 336 MVK-KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQML 394 + + K +E G+ LSTFGVG ++YNE ++ +A+ G NY +ID+ + ++ E++ +L Sbjct: 201 LAENKYKEDGIALSTFGVG-ADYNEDLLTMLAENGRANYYFIDSPDKIPQIFAGELKGLL 259 Query: 395 ITVAKDVKAQI 405 VA++ A+I Sbjct: 260 SVVAQNAWAEI 270 >UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67LZ3_SYMTH Length = 414 Score = 110 bits (275), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 64/197 (32%), Positives = 102/197 (51%), Gaps = 2/197 (1%) Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSI 269 E P NL ++D SGSM + L + +L+ LV ++ E+D +AIVTY + PS Sbjct: 38 EGRPPLNLAAVVDRSGSM-AGAALYFTKQALRFLVDQMAEEDRLAIVTYDDQVHVPFPSQ 96 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 K + +D + A G+TN GL QQ G ++R+LL TDG NVG+ D Sbjct: 97 PVVQKDAVRLLVDGITAGGTTNLSGGLATGMQQIRPHAGPGRVSRVLLMTDGLANVGVTD 156 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 P + + RE G+ +ST GVG +++E ++V +A+ G GN+ YI + ++ E Sbjct: 157 PDVLAGWARAWREKGLAVSTMGVG-PHFSEDLLVALAEAGGGNFHYIANPDQIPRIFQEE 215 Query: 390 MRQMLITVAKDVKAQIE 406 + +L + + IE Sbjct: 216 LHGLLQVAVQGLHLIIE 232 >UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZUW0_9SPHI Length = 425 Score = 107 bits (267), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 62/214 (28%), Positives = 116/214 (54%), Gaps = 4/214 (1%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 K E +P N+ ++D SGSM S ++L ++ ++ ++ L+ D ++IV Y + + Sbjct: 39 KQERIPL-NISLVVDRSGSM-SGDKLNYVKKAVDFVIDNLKSDDVLSIVQYDDEIDVVAS 96 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 S ++K ++ + + A TN G+ Y Q G +NR+LL +DG N GI Sbjct: 97 SAKVTNKKALHEKVKGIQARNMTNLSGGMMEGYAQVKSTQSNGYVNRVLLLSDGLANAGI 156 Query: 328 DDPKSIESMV-KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVL 386 P+ ++ + KK RE+G+ LSTFGVG S++NE +M +++ G NY +ID + ++ Sbjct: 157 TAPEQLQQIAQKKFREAGIALSTFGVG-SDFNEVLMTNLSEYGGANYYFIDMPDKIPQIF 215 Query: 387 NSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 E+ +L VA++ ++ F +++ + G+ Sbjct: 216 AQELEGLLSVVAQNTTLEVVFPQSYLKCTQVYGF 249 >UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 Tax=Octadecabacter antarcticus 307 RepID=B5JCH3_9RHOB Length = 197 Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 55/144 (38%), Positives = 83/144 (57%), Gaps = 14/144 (9%) Query: 96 HIANPGTARYQQFDD-------NPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPP 148 H+ G+ Y+++D+ NP+K + P++TFS+DVDT +YA +R L +G LPP Sbjct: 61 HVLRDGSTFYEEYDETFANDTPNPLKITSDEPVSTFSIDVDTAAYALIRSSLTRGQLPPT 120 Query: 149 DAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRK 208 DAVR+EE++NYFP + + ++ PF + PWN L+ + I + Sbjct: 121 DAVRIEEMINYFPYAYPAPEGEA-------PFRPTINVFETPWNADTQLVHIGIQGEMPA 173 Query: 209 SEELPASNLVFLIDTSGSMISDER 232 E+ P NLVFLIDTSGSM S ++ Sbjct: 174 IEDRPPLNLVFLIDTSGSMESADK 197 >UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1D2W8_DEIDV Length = 418 Score = 102 bits (255), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 59/197 (29%), Positives = 104/197 (52%), Gaps = 2/197 (1%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 P NL F+ID SGSM S L + + + V++ R D +++V + + +PS + Sbjct: 42 PPLNLAFVIDRSGSM-SGLPLQMAKQAAIAAVRQARPDDRVSVVAFDDRVDVIVPSQLAT 100 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS 332 + + AI ++D GSTN G Q + G +NR++L +DG NVG+ D + Sbjct: 101 SREAVIQAIGTIDDRGSTNLHGGWLEGATQVAQHLTPGALNRVILLSDGQANVGVTDRRE 160 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 I V+ E G++ +T G+G S+Y+E +++ IA+ G+GN+ +++ S E++ Sbjct: 161 IARQVRGLTERGISTTTIGLG-SHYDEELLLAIANAGDGNFEHVEDPSRLPTFFEEELQG 219 Query: 393 MLITVAKDVKAQIEFNP 409 + T + V +E NP Sbjct: 220 LTRTTGRIVSLGLEPNP 236 >UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J6Q3_DESRM Length = 416 Score = 102 bits (254), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 68/220 (30%), Positives = 116/220 (52%), Gaps = 8/220 (3%) Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 P N+Q L V + A + +E P NL F+ID SGSM + E+L + ++ V L Sbjct: 17 PGNKQVAYLMVKLTAPKQVEKERPVQNLSFVIDRSGSM-AGEKLDYTKKAVAFAVGHLSP 75 Query: 250 QDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK 309 QD ++V + + S ++K + A++S+ GSTN G+ L ++ + Sbjct: 76 QDYCSVVAFDDMVTMVASSHQVANKDALKMAVESIYPGGSTNLSGGMLLGVREVKLAHKE 135 Query: 310 GGINRILLATDGDFNVGIDDPKSIESMVKKQRE---SGVTLSTFGVGNSNYNEAMMVRIA 366 INR+LL TDG NVG+ D + +V+K RE GV LSTFG+G ++ E ++ + Sbjct: 136 NQINRVLLLTDGMANVGVTDHSA---LVEKSREMAAGGVNLSTFGLG-EDFEEDLLQAMV 191 Query: 367 DVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 + G GN+ YI+ + + E+ +L VA+++ +++ Sbjct: 192 EAGGGNFYYIEKPDQIPGIFEQELTGLLSIVAQNLSVKVK 231 >UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7ULL3_RHOBA Length = 484 Score = 102 bits (253), Expect = 5e-20, Method: Compositional matrix adjust. Identities = 76/287 (26%), Positives = 142/287 (49%), Gaps = 25/287 (8%) Query: 207 RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIAL 266 + +EE P N+ ++D SGSM S ++L + + + + L + D +++V Y + + + Sbjct: 77 KSAEERPPVNVCLVLDHSGSM-SGQKLARAKEAAEAAIDRLSDDDIVSVVLYDSNVTVLV 135 Query: 267 PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVG 326 P+ + ++ I I + A ST AG+ + K +NR++L +DG NVG Sbjct: 136 PATKATDRSSIKQKIRGIQAGSSTALFAGVSKGAAEVRKFLADEQVNRVILLSDGLANVG 195 Query: 327 IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVL 386 P+ +E + + + +++ST G+G S YNE +MV +A VG GN+++I+ V Sbjct: 196 PKSPQELEGLGRSLMKEAISVSTLGLG-SGYNEDLMVALASVGGGNHAFIEDADSLVSVF 254 Query: 387 NSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHI 446 N E +L VA + + ++ + + V R IG E GDI G+ I Sbjct: 255 NQEFDGLLSVVANEFEIVVKLDES-VRPVRMIGSE----------------GDI-EGQTI 296 Query: 447 TL-LFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQ 492 + L +L N ++ I + +P + D T++LA + ++++ Q Sbjct: 297 RIPLAQLYANQERYFIVETEVSPGTE----DSTRDLAEVTVQYRNLQ 339 >UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 Tax=Cystobacterineae RepID=Q1D9B7_MYXXD Length = 476 Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 76/267 (28%), Positives = 134/267 (50%), Gaps = 12/267 (4%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSI--SGSH 273 NL +ID SGSM S +L + + + L+ L +QD +AI+ Y D + +LPS+ + ++ Sbjct: 96 NLALVIDRSGSM-SGYKLAQAKQAARHLIGLLNDQDRLAIIHYGSDVK-SLPSLEATAAN 153 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 + + +D + EG TN GAGL Q + G+NR++L +DG G+ + + Sbjct: 154 RERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQRTYGVNRLILMSDGQPTEGLTADEEL 213 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 M ++ R +G+TLS GVG +++NE +M A+ G G Y +++ ++ + +++Q Sbjct: 214 TRMARELRATGLTLSAIGVG-TDFNEDLMQAFAEYGAGAYGFLEDAAQLSTLFQKDLQQA 272 Query: 394 LITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELT 453 TVA+ V P + +GY Q N +V D AG+ ++ L Sbjct: 273 GTTVARGVTMTFTLPPG-TSLGEVLGYRASQ----SGNQVHVSLPDFSAGQLERVVVRLN 327 Query: 454 LNGQKASIDKLRYAPDNKLAKSDKTKE 480 + G S+ + D KLA +D ++ Sbjct: 328 VTGD--SVGRTARVLDLKLAYTDLIRD 352 >UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LPD4_SYNFM Length = 479 Score = 98.6 bits (244), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 76/280 (27%), Positives = 128/280 (45%), Gaps = 21/280 (7%) Query: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 G PP + VN S I+DK + + A+ E AP ++D Sbjct: 37 GRKPPDPGLARAGTVNL--SGRLIQDKVHMGGDGTVTVALTLECDRAPGGNVEARRELD- 93 Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD- 261 +V ++D SGSM +L + ++ L+ L E D A+V+Y+ Sbjct: 94 --------------MVVVMDRSGSMADAGKLTHARQAVLNLLSRLSETDRFALVSYSDHV 139 Query: 262 -SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATD 320 L I+ +++A + + + G+TN G GL+ Q + G ++R++L +D Sbjct: 140 QRHGGLLPITPANRATLERIVRGIQPGGATNLGGGLQEGISQLAELQQNGRLSRLILISD 199 Query: 321 GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 G N G+ DP ++ +M E G +ST GVG ++NE +M IAD G GNY+++++ S Sbjct: 200 GLANRGVTDPSALGTMASVAAERGYAVSTVGVG-LDFNEHLMTSIADKGAGNYTFMESAS 258 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 +V + E R VA V+ + +P +T GY Sbjct: 259 AFAQVFDKEFRDAGTVVASSVEVHVPLSPG-MTLVHAAGY 297 >UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GUK8_SORC5 Length = 521 Score = 98.6 bits (244), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 80/328 (24%), Positives = 142/328 (43%), Gaps = 37/328 (11%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 D + E P +LV +DTSGSM D + +++ L ++ L+ D I++V Y+ + + Sbjct: 122 DLGALERPPLHLVIAVDTSGSMEGDP-IAYVRAGLVEMIDALQPTDRISLVRYSDAAEVV 180 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 L GS + + A + L A GSTN GL AY A + NR++ +DG Sbjct: 181 LEQAEGSDREALTEAFEGLTARGSTNLYEGLFTAYALAEQHLDPAWQNRVIFLSDGVATA 240 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 G+ P+ + S+ E G+ L+ GVG + ++ M I++VG GN+ +++ ++V Sbjct: 241 GLTSPQRLVSLAAGYAEKGIGLTAIGVG-AEFDVDAMRGISEVGAGNFYFLEDPKAVEEV 299 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 E++ L+ +A DV+ + +V + + G+ G H Sbjct: 300 FAEEVKTFLVPLALDVELDVAVGDGYVVR-------------GAYGTNGWQGGERGGAVH 346 Query: 446 ITLLFELTLNGQKASIDK-------------LRYAPDNKLAKSDKTKELAWLKIRWKYPQ 492 I LF L G+ ++ + L P + + + L + W++P Sbjct: 347 IPSLF---LAGRTSAAEPVGSGRRGGGGAILLELVPKPDQRGVEDPRAVGSLALSWRHPL 403 Query: 493 GKESQL----VEFPLGPTINAPSEDMRF 516 E+ +E P P +AP E F Sbjct: 404 TGEAHAQEVDIEAPSAP--DAPPEAGYF 429 >UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JPY1_9BACT Length = 632 Score = 97.4 bits (241), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 82/284 (28%), Positives = 136/284 (47%), Gaps = 29/284 (10%) Query: 136 VRRFLNQGLLPPPDAVRVEEIVNY--FPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNE 193 +R +++G++P P + E + + P D K+ + A +E A P Sbjct: 62 LRNLIDEGIIPSPASFTAEGLFSEHDLPIGGDAKEGWLFDIASQ---ATSFESAAQP--- 115 Query: 194 QRTLLKVDILAK-------DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKE 246 KVDILA+ D + + NLV ++D SGSM D L L++ SL+ +V + Sbjct: 116 -----KVDILAQLGFVSGIDATTFKPAPLNLVAVVDKSGSMSGDP-LELVRKSLRQVVSQ 169 Query: 247 LREQDNIAIVTYAGDSRIAL--PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQ--- 301 L D ++IV Y + I L S ++ +I A+ID + + GST AGLEL YQ Sbjct: 170 LGSDDQLSIVLYGSSTHIHLEPTKTSTENRDQIIASIDRIQSHGSTAMEAGLELGYQVAR 229 Query: 302 QATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAM 361 Q+ F+ G R++L TD NVG D +M + +S + L+T GVG ++ + Sbjct: 230 QSADAFV--GKTRVMLFTDERPNVGRTDATGFMAMAESGSKSDIGLTTIGVG-VHFGAEL 286 Query: 362 MVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 +I+ V GN + D + E+ M++ +A D+ ++ Sbjct: 287 AEKISSVRGGNLFFFDDDESMETTFRKELDTMVLELAYDMSLKV 330 >UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LQW0_BACS4 Length = 282 Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 61/228 (26%), Positives = 113/228 (49%), Gaps = 3/228 (1%) Query: 178 IPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQ 237 + FA +YE P E LL AK + +E P NL L+D SGSM S E L + Sbjct: 7 LSFAHQYENVPCKGKEAAYLLVELTGAKVKHTERSPI-NLSLLLDRSGSM-SGEPLRYCK 64 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 + ++ +L ++D +++V + + +HK + I ++ G TN GL Sbjct: 65 EACNFVINQLTDKDILSVVVFDDQVETIIEPQKVTHKDLLKEYIQRIETRGITNLSGGLI 124 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 Q K +K +NR++L +DG N GI D +++ + + +G+ +ST GV + ++ Sbjct: 125 QGCQHVLKQEVKNYVNRVILLSDGQANAGITDKEALVKLADDYQSAGLVISTLGV-SEHF 183 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 +E ++ +AD G GN+ +I+ + + E+ +L + +++ I Sbjct: 184 DEELLEGVADSGRGNFHFINEVENIPSIFEQELDGLLNVIGQNITLNI 231 >UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FTM1_SORC5 Length = 535 Score = 96.3 bits (238), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 107/431 (24%), Positives = 183/431 (42%), Gaps = 47/431 (10%) Query: 132 SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW 191 S +VR L G P P VR E +NY+ D+ D+ +R E P Sbjct: 130 SPVHVRELLRSGRAPEPWQVRTYEFLNYYRIDYAPPDEGE----------LRVEPQIEPG 179 Query: 192 NEQRTL-LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 E + L++ + + D S P + + F++DTSGSM E + +++++ + L E Sbjct: 180 EEAGSYALQIGVRSYDPPSPRRPIA-VTFVLDTSGSM-DGEPMAREKATVRAVAASLSEG 237 Query: 251 DNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 D + +VT+ + + L + G + AA D+L A G T+ +GL + YQ A + F Sbjct: 238 DVVNMVTWNTQNSVILSGHVVDGPDDPALLAAADALSASGGTDLESGLRVGYQLAQEHFE 297 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNS-NYNEAMMVRIAD 367 +G INR++L +DG NVG+ + I + + + L G G + YN+ +M + D Sbjct: 298 EGRINRVILVSDGGANVGVTSEELIALHAEDADQEAIYLVGVGTGPALGYNDVLMDAVTD 357 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRV 427 G G Y Y+D EA + +++ A+ V Q+E W + + E+ Sbjct: 358 KGRGAYVYLDDEDEAFHMFRDRFAEVMEVAARGV--QVELTMPWYFKMEKFYGEEYSTNP 415 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 E ++ GD ++F + G + ++ ++ R Sbjct: 416 EEVEPQHLAPGD-------AMIFSQLVRGCDPGV--------------INDEDTLTVRAR 454 Query: 488 WKYPQGKESQLV--EFPLGPTINAPSEDMRFRAAVAAYGQKLRG--SEYLNNTSWQQIKQ 543 W+ P E++ V E L E + A+ AY + L+ SE L+ Q I Sbjct: 455 WQTPLTHEAKEVSREATLAELAAGSKEQLVKGKAIVAYAEALKAGTSEALHAAREQVIAA 514 Query: 544 WAQQAKGEDPQ 554 A G DP+ Sbjct: 515 NA----GGDPE 521 >UniRef50_A7C0I2 von Willebrand factor type A domain protein n=1 Tax=Beggiatoa sp. PS RepID=A7C0I2_9GAMM Length = 150 Score = 95.5 bits (236), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 40/56 (71%), Positives = 50/56 (89%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 Y FD+NP+KQV++ P++TFS+DVDTGSYANVRRF+N G LPP DAVRVEE++NYF Sbjct: 88 YAHFDNNPIKQVSEQPVSTFSIDVDTGSYANVRRFINSGRLPPHDAVRVEEMLNYF 143 >UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F1H2_SORC5 Length = 607 Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 65/222 (29%), Positives = 109/222 (49%), Gaps = 8/222 (3%) Query: 190 PW---NEQRTLLKVDI-LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVK 245 PW LL+V+I + + + +L +ID SGSM S E+L L + + ++ Sbjct: 9 PWLPAEPSERLLRVEITVPRPEGGQARKPVHLSLVIDRSGSM-SGEKLRLALEAARQAIR 67 Query: 246 ELREQDNIAIVTYAGDSRIALPSISGSHKAEINA--AIDSLDAEGSTNGGAGLELAYQQA 303 L+ D ++VT+ + +PS + A + A A+D++ A G+T+ G G + Sbjct: 68 TLQPGDRFSVVTFDHQVEVPIPSTDATPGARLRAEAALDTVIARGNTDLGGGWLRGCAEV 127 Query: 304 TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 + I R+LL TDG N GI P + S + QR VT ST G+G +NE ++ Sbjct: 128 GAHLPEDAIGRVLLLTDGQANHGITSPDELTSRARSQRLRRVTTSTIGLGE-GFNEFLLG 186 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 R+++ G GN+ + E + E+ ++L VA+D I Sbjct: 187 RLSEEGGGNFYFAARADELPGFVGREIGEVLSVVARDAALVI 228 >UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V370_MONBE Length = 471 Score = 92.8 bits (229), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 66/197 (33%), Positives = 107/197 (54%), Gaps = 19/197 (9%) Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 E + + A D + E PA +LV +ID SGSM + ++L ++QS+L+ L++ L++ D Sbjct: 36 ESSAYISCRLTAPDFEPVERPAIDLVAVIDVSGSM-AGQKLKMVQSTLEFLMRNLKDTDR 94 Query: 253 IAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTN--GGA--GLELAYQQATKG 306 A+VT+ D + L ++ +HK A + L A TN GG G+EL Q +G Sbjct: 95 FALVTFDSDVKTVFDLRPMTTAHKEACLADVQKLRAGSCTNLSGGLFRGVELMQQ---RG 151 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR-----ESGVTLSTFGVGNSNYNEAM 361 KG ++ ILL TDG N G+ D + M + R T+ TFG G ++NE M Sbjct: 152 ATKGAVSSILLMTDGIANEGVRDK---DDMCRALRGLMGPAPDYTIYTFGYGK-DHNENM 207 Query: 362 MVRIADVGNGNYSYIDT 378 + ++++ GNG Y +I++ Sbjct: 208 LRQLSETGNGMYYFIES 224 >UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GDG5_9DELT Length = 486 Score = 92.4 bits (228), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 90/381 (23%), Positives = 175/381 (45%), Gaps = 41/381 (10%) Query: 151 VRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAK-DRKS 209 +R E +NY+ ++ PA+ P ++ +L +E R L++ + ++ S Sbjct: 102 IRPWEFLNYYSFEY--------PAADPGDLSVHVDLRSK--DEGRFQLQIGVASEIVSPS 151 Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS--RIALP 267 E LP N+ ++D S SM + + ++++ + + LRE D I++V+++ + R+A Sbjct: 152 ERLPM-NITLVLDESTSM-TGAPMYAMKATARAIAGSLREGDVISLVSWSNSNNVRLASH 209 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 +++GS+ A + ID+++ G T+ AGLE Y A F INR++L +DG N+G Sbjct: 210 AVAGSNDATLLDTIDAIEPGGGTDLHAGLEQGYALAQANFSADRINRVVLVSDGGANLGF 269 Query: 328 DDPKSIESMVKKQRESGVTL-STFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVL 386 D + I M + + G+ + YN+ +M + D G G +I +EA+++ Sbjct: 270 TDAELIAQMAELEDGEGIYMVGVGVGDVGRYNDELMDTVTDQGKGASVFIPNEAEAERMF 329 Query: 387 NSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHI 446 + A+DV+ ++ P G+E + E F++D + + Sbjct: 330 GERFMSTMGVAARDVRVELSLPP---------GFEIVRFSGEEFSDDPSEIEPQHLAPND 380 Query: 447 TLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKES--QLVEFPLG 504 ++F L AP +LA D L + +RW+ P K++ + VE+ Sbjct: 381 AMVFYQELE---------TCAP--ELATEDA---LLGVVVRWREPFSKQARERAVEYAFA 426 Query: 505 PTINAPSEDMRFRAAVAAYGQ 525 + A S + A+ +Y + Sbjct: 427 DLLGAESPMLDKGQAILSYAE 447 >UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XPW8_DESMR Length = 439 Score = 89.4 bits (220), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 59/190 (31%), Positives = 93/190 (48%), Gaps = 2/190 (1%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 NL ID SGSM + L + +V +L+ D ++++ Y +PS+ KA Sbjct: 46 NLALAIDRSGSM-AGRPLEEAKRCASFVVDKLKNTDRVSLIAYDSSIETRVPSVKVEDKA 104 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 + AI+ +D G TN G +Q + I+RI+L +DG N G+ D I Sbjct: 105 IFHRAIEGIDDGGCTNLHGGWLKGAEQISPYIDPSTISRIILLSDGQANEGLTDEAEIFK 164 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLI 395 ++ ++GVT ST+G+G SN+NE +M+ +A G GN Y T + E+ + Sbjct: 165 QCRELADAGVTTSTYGLG-SNFNETLMIGMAKNGQGNSYYGRTADDLMDPFQEELSLLEA 223 Query: 396 TVAKDVKAQI 405 AK V+A I Sbjct: 224 LFAKQVRASI 233 >UniRef50_B4W304 von Willebrand factor type A domain protein (Fragment) n=2 Tax=Cyanobacteria RepID=B4W304_9CYAN Length = 538 Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 55/191 (28%), Positives = 98/191 (51%), Gaps = 2/191 (1%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 NL ++D SGSM IQ++ + L+ L D +++V Y + + +P +A Sbjct: 42 NLSLVLDRSGSMAGAPLRYAIQAA-QNLIDYLTADDFVSVVIYDDTAEVIIPPQLVGDQA 100 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 + A I + A G TN G L Q INR+LL TDG N GI DP+ + Sbjct: 101 ALKAKIGKIRARGCTNLSGGWLLGCSQVQANQSPERINRVLLLTDGLANYGIKDPQVLTK 160 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLI 395 ++ E+ + +T G GN +NE +++ +A+ GN+ +I + +A +V EM ++ Sbjct: 161 TALEKAEADIVTTTLGFGNY-FNEDLLINMANAARGNFYFIQSPDDASQVFEIEMESLVS 219 Query: 396 TVAKDVKAQIE 406 VA++++ +++ Sbjct: 220 VVAQNLRVRLQ 230 >UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F2Q0_SORC5 Length = 521 Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 69/280 (24%), Positives = 128/280 (45%), Gaps = 19/280 (6%) Query: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKL 242 R A P + Q T L ++ + L +NL +ID SGSM +Q++ Sbjct: 74 RVGHARLPRSAQETFLMFEVRGDGSPARSLAQANLSLVIDRSGSMKGTRLTNAVQAATTA 133 Query: 243 LVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 V L + D +++VT+ + + +P ++ + I A++ + G T G+E Sbjct: 134 -VSRLNDGDVVSVVTFDTRTSVVVPPTTVGPETRGRILASVRGISLGGDTCISCGIEEGL 192 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 + G G++R+L+ +DGD N G+ D +M ++ R+ GV ++T GV + +YNE Sbjct: 193 --SLLGQTSAGVSRMLVLSDGDANHGVRDVPGFRAMAQRARDRGVAITTIGV-DVDYNEK 249 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 ++ IA NG + +++ + ++ +E Q+ +VA + I+ P G Sbjct: 250 ILSAIALDSNGRHYFVENDAALARIFEAEAEQLTTSVASGAELAIDLAP---------GV 300 Query: 421 EKRQLRVEHFNNDN----VDAGDIGAGKHITLLFELTLNG 456 E ++ F V G AG+ T+L ++ L G Sbjct: 301 ELDRVFDRSFRRAGDQVIVPLGAFAAGEVKTVLLKVRLGG 340 >UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LL92_HALO1 Length = 430 Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 80/295 (27%), Positives = 139/295 (47%), Gaps = 23/295 (7%) Query: 170 QSIPASKPIPFAM----RYELAPAPWNEQRTLLKVDILAKDRKSEELPAS----NLVFLI 221 Q PA + A+ +Y+L P+ E +++++ + + PA+ +L +I Sbjct: 4 QPTPAQRAGSVAVTVTPQYDLLPSNARELNLMVRLE------GTGDAPATRAPLDLALVI 57 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG--SHKAEINA 279 D SGSM D+ + ++L+LL + L+ +D I +V+Y+ D + L + + E Sbjct: 58 DRSGSMSGDKLSDVKTAALELL-ETLQPEDTITLVSYSSDVSMHLMRTRADDAGQREARR 116 Query: 280 AIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKK 339 A+ +L A G T G GL A + + ++ ++L +DG N G P + + Sbjct: 117 ALLALQARGGTALGPGLFRALEALEGASDRTRMSHLMLFSDGIANAGEVRPSVLGARAAG 176 Query: 340 QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAK 399 +GV++ST GVG +YNE +M R+AD G G Y +I +L+ EM+ ++ TVA+ Sbjct: 177 AFGAGVSVSTMGVG-VDYNEDLMTRLADQGGGRYHFIQDSEAIASILDDEMKGLVATVAR 235 Query: 400 DVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL 454 V + V R GY E + G +GAG+ +L + L Sbjct: 236 GVTMDLT-RAEGVGTVRVFGYASE----ESAGRVHTRVGSLGAGQTRAILVRIDL 285 >UniRef50_C1XMC3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XMC3_MEIRU Length = 412 Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 67/218 (30%), Positives = 107/218 (49%), Gaps = 16/218 (7%) Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 Q+ LL++ + E P NL ++D SGSM +L + + V L +D Sbjct: 28 TRQQVLLRIHTPTPQARPER-PLLNLALVLDRSGSM-GGSKLKYTKEAAIYAVHNLLPED 85 Query: 252 NIAIVTYAGDSRIALPSIS-GSHKAEINAAIDSLDAEGSTNGGAG-LE-----LAYQQAT 304 +A+V Y + +PS +A I I ++ GST AG LE AYQ+A Sbjct: 86 RVAVVIYDDAVEVLVPSTPVADGRAAIANLIRTIRTGGSTALHAGWLEGATQVAAYQEA- 144 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 G +NR++L +DG N G +P I V++ GV+ ST GVG +YNE +M Sbjct: 145 -----GRLNRVVLLSDGLANRGETNPGVIAEQVRELARRGVSTSTLGVG-LDYNEDLMTT 198 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 +AD G GNY +I++ ++ ++ E+ + T+ V+ Sbjct: 199 MADAGEGNYYFIESPADLPRIFAQELAGLAGTLGTRVR 236 >UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW34_ANASP Length = 615 Score = 85.9 bits (211), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 63/215 (29%), Positives = 107/215 (49%), Gaps = 9/215 (4%) Query: 198 LKVDILAKDRKSEELPAS-----NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 LK +IL + R E+P S NL +ID SGSM + L + + +V +L +D Sbjct: 19 LKANILLRFRA--EIPESPRRNLNLSLVIDRSGSM-AGAALHHALKAAESVVDQLEPKDI 75 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 +++V Y +P + K + +I + A G TN G + I Sbjct: 76 LSVVVYDDAVDTVVPPQPVTDKPALKKSIRQVRAGGITNLSGGWLKGCEYVKHQLDPQKI 135 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 NR+LL TDG N+GI DPK + + ++ E G+T +T G +NE +++ +A NGN Sbjct: 136 NRVLLLTDGHANMGIQDPKILTATSTQKAEEGITTTTLGFAQ-GFNEDLLIGMARAANGN 194 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 + +I ++ EA +V + E+ + V +++K +E Sbjct: 195 FYFIQSIDEAAEVFSIELDSLRSVVGQNLKVTLEL 229 >UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SHY6_LEPBA Length = 550 Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 63/224 (28%), Positives = 103/224 (45%), Gaps = 4/224 (1%) Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 ID S SM ++ +I +S L V L D ++IV Y+ D ++ P + K + Sbjct: 48 IDKSWSMKGEKMEAVIDASCAL-VNWLTRHDAVSIVAYSADVQLIQPVTHLTEKVSVTDK 106 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQ 340 I ++ STN G A + + I R+LL TDG+ GI D +++ ++ Sbjct: 107 IRNIQVATSTNLSGGWLSALKSLNQSKIPNAYKRVLLLTDGNPTSGIKDKEALVTIAADH 166 Query: 341 RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD 400 G++ +T GVGN ++NE M+V IA G GN+ YID A + E + A+ Sbjct: 167 LSMGISTTTIGVGN-DFNEEMLVEIAKAGGGNFYYIDNPENASDIFFEEFGDIGALYAQA 225 Query: 401 VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 + +++ P +Q+ E +E F+ DA I K Sbjct: 226 IDVELQLAPG--VRLKQVLSETSHQVMEEFDEFLGDAKTISRQK 267 >UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26 Tax=Gammaproteobacteria RepID=A9QZI4_YERPG Length = 472 Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 62/230 (26%), Positives = 113/230 (49%), Gaps = 8/230 (3%) Query: 182 MRYELAPAPW----NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQ 237 ++ ELA AP +E + LK+ + + S NL +ID S SM S ER+ + Sbjct: 60 VKSELA-APVMLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRSTSM-SGERIEKAR 117 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAID-SLDAEGSTNGGAGL 296 L V L D +++V Y + + +P+ + K + A+I + G T AG+ Sbjct: 118 EEAILAVNMLNITDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGV 177 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 + Q K + +NRI+L +DG N G + + + + G+ ++T G+G + Sbjct: 178 SMGIGQVDKHLNREQVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQ-D 236 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 YNE +M IA +GN++++ ++ +K E + ++ VA+D+ QI+ Sbjct: 237 YNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIK 286 >UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G9E8_9DELT Length = 532 Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 67/244 (27%), Positives = 118/244 (48%), Gaps = 18/244 (7%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK- 274 NL +ID SGSM + ++ + + LR+ D +++V+Y + +P + + Sbjct: 136 NLAIVIDHSGSMKGQRERNALDAAAGM-ISRLRDGDTVSVVSYNTKAHTIVPVTTLDARN 194 Query: 275 -----AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 +++ + S + G+T G+E Q T + GI+R+LL +DG+ N G+ D Sbjct: 195 RDRVISDLRVGVASRPS-GNTCVSCGVEAGLQ--TLQGRRPGIDRMLLLSDGEANRGVRD 251 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 I + ++ R GV++S+ GV + +YNE +M IA NG + + +T S + + E Sbjct: 252 EPGIRRLAREARNRGVSISSIGV-DVDYNEVLMSAIAREANGRHYFSETGSNLDAIFDQE 310 Query: 390 MRQMLITVAKDVKAQIEFNPAW-VTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 + ++ +AKD + +E P V E Y++ RV V G AG+ TL Sbjct: 311 LDSLIQAIAKDGQVIVELAPGVRVVEVFDRSYQQVDRRV------IVPMGTFSAGEDKTL 364 Query: 449 LFEL 452 L L Sbjct: 365 LMRL 368 >UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflexi (class) RepID=A5UTA6_ROSS1 Length = 425 Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 54/194 (27%), Positives = 101/194 (52%), Gaps = 5/194 (2%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 NL ++D S SM ERL ++ + +V +L D ++V + + + +P+ K+ Sbjct: 46 NLCLVLDRSSSM-RGERLMQVKEAAARIVDQLGPDDYFSLVVFNDRADVVIPAQRAIKKS 104 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 ++ AAI ++A G T GL LA Q+ + F+ GI+R++L TDG G D+ + +E Sbjct: 105 DLKAAIAQIEAAGGTEMAQGLALALQEVQRPFLTRGISRLILLTDGR-TYG-DESRCVE- 161 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLI 395 + ++ + G+ L+ G+G + +NE ++ + N YI T + KV E++++ Sbjct: 162 IARRGQSRGIGLTALGIG-TEWNEDLLETMTASENSRAQYIATAQDVVKVFADEVKRLHA 220 Query: 396 TVAKDVKAQIEFNP 409 A+ V +E P Sbjct: 221 IFAQQVHLSLETRP 234 >UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LNY0_HALO1 Length = 808 Score = 82.4 bits (202), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 79/311 (25%), Positives = 133/311 (42%), Gaps = 33/311 (10%) Query: 120 PLATFSLDVDTGSYANV---RRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASK 176 P FS D D+ S A V + + G P P RV E +NY +D + + Sbjct: 368 PYFYFSYD-DSASTAAVELVKYGVANGERPHPSLARVWEFLNY--ETFDSASYEELGDRF 424 Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI-------- 228 + M + LL ++ + EE P + + FL+D SGSM Sbjct: 425 RVSMGMVSRPSLTQDGAVDYLLGANVTVPNLTREERPHAVVTFLVDISGSMAEYSPTVDA 484 Query: 229 --SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINA------- 279 + R+ +++ L V L+ D + +V++ ++I L + EI Sbjct: 485 GGAPTRMDIVREGLWKAVSALKPGDIVNVVSFDDAAQIEL------ERGEIRPGAATPRP 538 Query: 280 ---AIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESM 336 ++ L G TN AG+E+AY+ A + + INR+++ TD N G DP I Sbjct: 539 YLRSVLRLLPRGGTNLSAGIEVAYRVARRNYDPYRINRVIILTDAYANRGSIDPSLIGDH 598 Query: 337 VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 V + G+ S GVG ++NE + + DVG G Y + T +A + +L Sbjct: 599 VLIGDDEGIHFSGLGVG-YDFNEDFLNTLTDVGRGTYFSLITERDAARAFGERFVSLLAV 657 Query: 397 VAKDVKAQIEF 407 A+DV+ ++++ Sbjct: 658 AARDVRFRLDY 668 >UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LWF9_HALO1 Length = 419 Score = 82.0 bits (201), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 66/266 (24%), Positives = 132/266 (49%), Gaps = 20/266 (7%) Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL---KLLVKELREQDN 252 L V I A+ +S NL +ID S SM R P + S++ + +V++L E+D Sbjct: 16 VFLLVRIEAQATESSARMPVNLALVIDRSSSM----RGPRLASAIVAARQVVEQLDERDR 71 Query: 253 IAIVTYAGDSRIALPSISGSHKAE--INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG 310 ++++ + +R +S + +A + A+ L TN AG++ + GF++G Sbjct: 72 LSVIAFDATARTIFGPMSVTDEARQTLEQALAGLRTGVGTNLAAGMKKGAEAVRSGFVRG 131 Query: 311 GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 ++R++L TDG ++GI D + ++ +K+ + GVT++T G+G +++ ++ +A G Sbjct: 132 ALSRLVLLTDGQPSLGITDNDRLCALAQKEADRGVTITTMGLGQ-GFDDELLADLAHSGR 190 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHF 430 G + Y+ + ++ E+ + A + +I PA +QI + R+ Sbjct: 191 GGFHYLASAADIPGAFGRELSGVFAIAA--TQTEIGLRPA-----QQIDAAEVLHRLPSR 243 Query: 431 NNDN---VDAGDIGAGKHITLLFELT 453 D+ V+ G++ AG +LF L+ Sbjct: 244 PLDDGLAVELGELAAGTPRQVLFRLS 269 >UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RNW6_9PROT Length = 452 Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 60/216 (27%), Positives = 108/216 (50%), Gaps = 17/216 (7%) Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQS--SLKLLVKELREQDNIAIVTY 258 D LA ++K+ + +L +ID SGSM PL ++ K + +L D ++V + Sbjct: 35 DPLATEKKARK--PYHLALVIDRSGSMSGP---PLAEAVRCAKHIADQLEPTDIASLVVF 89 Query: 259 AGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI----KGGINR 314 + +P + ++ A+ + + GSTN L +Q G + + + R Sbjct: 90 DDRVQTLVPPRPVGDRQALHLALSRVHSGGSTN----LHGGWQAGADGLLPAAGQAALAR 145 Query: 315 ILLATDGDFNVG-IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNY 373 ++L +DG+ NVG I DP I ++ + E GV+ ST+G+G S++NE +MV +A G GN+ Sbjct: 146 VILLSDGNANVGEITDPAGIAALCAQAAERGVSTSTYGLG-SHFNEDLMVEMAKRGGGNH 204 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 Y DT ++ + +E + A+ V+ + P Sbjct: 205 YYGDTAADLFEPFAAEFDFISALCARHVRLSLAAAP 240 >UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GAI6_9DELT Length = 560 Score = 80.9 bits (198), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 67/288 (23%), Positives = 130/288 (45%), Gaps = 23/288 (7%) Query: 151 VRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAP--APWNEQRTLLKVDILAKDRK 208 +R E +NY+ D+D PA+ ++ + P +E R +++ + ++ Sbjct: 161 IRTWEFMNYYGFDYD-------PAADG-ELSVYAAMNPIEGEGDEARFQMQIGVASELMT 212 Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY--AGDSRIAL 266 EE P N+ ++DTSGSM + + L++ + + + +L+ D ++I + + D +A Sbjct: 213 PEERPPMNVTLVLDTSGSM-AGTPIELLRETSRAIAAQLKLGDTVSICEWDTSNDWTLAG 271 Query: 267 PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVG 326 +++G + + I+ + G TN GLE Y+ A + INR++L +DG N G Sbjct: 272 YAVTGPNDELLLEKINDVVHGGGTNLYGGLESGYELAQMVYDPDAINRLVLISDGGANAG 331 Query: 327 IDDPKSIESMVKKQRESGVTLSTFGVGN-SNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 I D I G+ L GV + +YN+ +M + D G G ++ + E Sbjct: 332 ITDLDLIAENAAYGGSDGIYLVGVGVDDPDDYNDELMDAVTDAGKGASVFMPSEEEVWTT 391 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND 433 ++ A++V+ Q++ P G+E + E + D Sbjct: 392 FGDNFESVMAIAAREVQVQLDMPP---------GFEVVKFSGEEISGD 430 >UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BJ22_9GAMM Length = 445 Score = 79.7 bits (195), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 55/210 (26%), Positives = 106/210 (50%), Gaps = 4/210 (1%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG 271 +PA N+ ++D SGSM D+ ++++ + + L + D +++V+Y + +P+ Sbjct: 68 IPA-NIAIVLDKSGSMQGDKLFRAKEAAI-MAINRLSQNDIVSVVSYDSRVNVVVPATKV 125 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 S I AI+ + A G+T AG+ + K +NR++L +DG N+G P Sbjct: 126 SDTNTIARAINRIQANGNTALFAGVSKGANELRKFLDLNKVNRVILLSDGLANIGPSTPN 185 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 + + + G++++T G+G YNE +M ++A +GN+++++ + +V E Sbjct: 186 ELGKLGLSLAKEGMSVTTIGLG-LGYNEDLMTQLAGFSDGNHAFVENADDLARVFQYEFG 244 Query: 392 QMLITVAKDVKAQIEFNPAWVTEYRQIGYE 421 +L VA+ V I VT R +G E Sbjct: 245 DVLSVVAQGVDIHIRC-LNGVTPLRVLGRE 273 >UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A188_PELCD Length = 442 Score = 79.3 bits (194), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 62/233 (26%), Positives = 117/233 (50%), Gaps = 12/233 (5%) Query: 194 QRTLLKVDILA-KDRKSEELPASNLVFLIDTSGSM----ISDERLPLIQSSLKLLVKELR 248 Q+T++K+ + A + ++ + P NL ++D SGSM I+ R I++ V+ L Sbjct: 44 QKTVIKIALDAPRAPRTAQRPPVNLALVLDRSGSMSGNKIAKAREAAIEA-----VRRLS 98 Query: 249 EQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 + D ++V Y +P+ S +I A I + GST + + K Sbjct: 99 DGDLFSLVVYDDSVETLVPAQPVSDIGDIEARIRRIRPGGSTALFGAVSQGAAEVRKHSD 158 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 +NR++L +DG NVG P + + + G++++T GVG +++NE +M ++A+ Sbjct: 159 APYVNRVVLLSDGLANVGPSRPADLARLGAALLKEGISVTTVGVG-TDFNEDLMTQLAER 217 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYE 421 +GN+ ++++ + ++ +E+ +L VA+ V IE P V R IG E Sbjct: 218 SDGNHYFVESSRDLPRIFAAELGDVLSVVARKVVISIEC-PQGVKPLRVIGRE 269 >UniRef50_UPI00017450FB von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017450FB Length = 424 Score = 79.3 bits (194), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 68/265 (25%), Positives = 126/265 (47%), Gaps = 18/265 (6%) Query: 196 TLLKVDILAKDRKSEELPAS------NLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 T LKV + +EL AS N+ +ID SGSM D ++ + + K + L Sbjct: 20 TYLKVGL-----TGQELEASAKRAPVNVTIVIDKSGSMGGD-KMVHAREAAKQALDRLGA 73 Query: 250 QDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK 309 D +++V Y + P+ + + + AAID + A GST +G+ ++ + Sbjct: 74 GDMVSVVAYDDAVSLISPATDLTDRDRVKAAIDRIQAGGSTALFSGISKGAEELRRNKRP 133 Query: 310 GGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG 369 +NR++L +DG NVG P+ + + + G+T++T G+G YNE +M +A Sbjct: 134 NQVNRVVLLSDGMANVGPSSPQDLGRLGASLAKEGITVTTLGLG-LGYNEDLMTELALRS 192 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEH 429 +GN+++I+ + +E +L VA+ ++ +++ V R +G E H Sbjct: 193 DGNHAFIENSQNLAGIFQTEFGDILSVVAQRIRVRVQC-AEGVRPVRVLGREADI----H 247 Query: 430 FNNDNVDAGDIGAGKHITLLFELTL 454 + ++ I A +H LL E+ + Sbjct: 248 GQDVELEMNQIYARQHKYLLLEVEI 272 >UniRef50_Q235T9 von Willebrand factor type A domain containing protein n=5 Tax=Tetrahymena thermophila RepID=Q235T9_TETTH Length = 703 Score = 79.0 bits (193), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 51/211 (24%), Positives = 110/211 (52%), Gaps = 4/211 (1%) Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG 260 D+ K E P +L+ +ID SGSM ++ +++++ L++ L E D ++++T+ Sbjct: 197 DMEVKSNPLEGRPNLDLICVIDNSGSMNDFSKIENVKNTILQLLEMLNENDRLSLITFNT 256 Query: 261 DSR--IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 ++ L +++ +K + S+ A+G T+ G+E+A+Q K ++ I L Sbjct: 257 KAKQLCGLKNVNNQNKKSLQTITKSIKADGGTDIIRGIEIAFQILQSRKQKNSVSSIFLL 316 Query: 319 TDGDFNVGIDDPKSIESMVKKQ-RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 +DG N+ K++ KQ +E T+ +FG GN +++ +M +IA + +G++ +++ Sbjct: 317 SDGQDNLADAGIKNLLKTTYKQLQEESFTIHSFGFGN-DHDGPLMQKIAQIKDGSFYFVE 375 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFN 408 + + + + VA+D+ +IE N Sbjct: 376 KNDQVDEFFIDALGGLFSVVAQDLTIKIEIN 406 >UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphylothermus marinus F1 RepID=A3DLZ3_STAMF Length = 416 Score = 78.6 bits (192), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 57/201 (28%), Positives = 99/201 (49%), Gaps = 4/201 (1%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 P + +IDTS SM ++ Q++L+LL LR++D + + +AG L + + Sbjct: 37 PPIAFLIVIDTSYSMDGEKIFRAKQAALRLL-DILRDKDYVGVYGFAGKFYKVLEPVPAT 95 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN--RILLATDGDFNVGIDDP 330 ++ E+ AI L TN L+ ++ K G I+ RI+ TDG+ G P Sbjct: 96 NRNEVEKAIIGLKLGSGTNIYDTLKKLVEETKKVLESGAISLVRIIFITDGEPTTGQKKP 155 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEM 390 + I M KK RE+G + GVG + YNE ++ R+A V NG + ++ + +K+++ Sbjct: 156 EKILEMAKKLREAGASALIIGVG-TEYNEKLLSRMAMVLNGEFEHVSDPASLEKLISEYA 214 Query: 391 RQMLITVAKDVKAQIEFNPAW 411 + AK+V +P + Sbjct: 215 KSTQEISAKNVAVLFRLSPGF 235 >UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C9G5_PARTE Length = 648 Score = 78.2 bits (191), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 56/199 (28%), Positives = 106/199 (53%), Gaps = 14/199 (7%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSH 273 +L+ +ID SGSM +++ +Q SL L+ L E+D + ++T+ G ++ P +++ + Sbjct: 229 DLLCVIDKSGSM-EGKKIASVQQSLVQLLDFLSEKDRLCLITFDGSAQRLTPLKTLTQDN 287 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 K AI S+ A G TN G E+A+ Q + +K + I L +DG D + Sbjct: 288 KNYFKKAIYSIRASGQTNIAKGTEIAFNQIQQRKMKNQVTSIFLLSDG------QDQGAA 341 Query: 334 ESMVKKQR---ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEM 390 E +++Q+ E VT+ +FG G S+++ A+M +I VG G++ YI+ + + + Sbjct: 342 E-YIQRQKDVVEDIVTIHSFGYG-SDHDAALMSKICKVGQGSFYYIEDVKLLDEFFADAL 399 Query: 391 RQMLITVAKDVKAQIEFNP 409 ++ +A+ V+ I+ P Sbjct: 400 GRLSSALAEKVQIDIKCAP 418 >UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LJL4_HALO1 Length = 602 Score = 78.2 bits (191), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 60/227 (26%), Positives = 106/227 (46%), Gaps = 38/227 (16%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI-----ALPSIS 270 +LV ++DTSGSM +D R+ ++ L LLV + E D +A+V+Y + + ALP Sbjct: 132 DLVVVVDTSGSMATDARMDYVRQGLHLLVDAVDEDDRLALVSYQSFAEVHAELPALPVEE 191 Query: 271 G------------------------------SHKAEINAAIDSLDAEGSTNGGAGLELAY 300 + ++E++A +D+L G TN GLE + Sbjct: 192 TPEEPTEPTDPVGEPTDPPADPDEDPVDEREAWRSEMHALVDTLQPGGGTNIYEGLERGF 251 Query: 301 QQATKGFIKGG--INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYN 358 + A + + R++L +DG GI D SI ++ + E G+ L+T GVG S +N Sbjct: 252 EIAKEARVNHPDRAQRVILLSDGLATEGITDSASIIALSEAFIEGGMGLTTVGVGAS-FN 310 Query: 359 EAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 +M +A+ G GN+ +++ ++V E+ +A V ++ Sbjct: 311 VELMRGLAERGAGNFYFVEDPEAVREVFTEELDYFAEPLATAVSIEV 357 >UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1DFU7_MYXXD Length = 422 Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 68/257 (26%), Positives = 126/257 (49%), Gaps = 19/257 (7%) Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS- 268 + +P S L ++D SGSM + ++L + + LV+ L+ +D +A + Y D R+ PS Sbjct: 43 QRVPVS-LALVLDRSGSM-NGQKLADARRAATELVQRLKPEDRLAFIDYGTDVRVQ-PSR 99 Query: 269 -ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 ++ + E+ I L +GSTN L+ A + ++R +L +DG GI Sbjct: 100 RMTEEAREELLTLISGLQDDGSTNISGALDAAANALRPHMREYRVSRAILLSDGQPTTGI 159 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 + V++ R G+T+S GVG +Y E +M +A+ G G +ID + +V + Sbjct: 160 VSEPGLLDQVRQLRRDGITVSALGVGR-DYQETLMRGMAEQGGGFSGFIDDSARLAEVFS 218 Query: 388 SEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY----EKRQLRVEHFNNDNVDAGDIGAG 443 E+ Q TVA+ V+ +++ P V + +G E L+V + D+ + Sbjct: 219 RELDQATSTVARMVELRLDV-PPEVQDVEVMGMASFREGNVLKVPLY--------DMASA 269 Query: 444 KHITLLFELTLNGQKAS 460 + + ++ +LTLN + + Sbjct: 270 QTVRVMAKLTLNTSRTA 286 >UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBY0_9DELT Length = 996 Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 48/188 (25%), Positives = 96/188 (51%), Gaps = 12/188 (6%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 + E P L+ +ID SGSM S +RL L++ + + + L D I ++ + ++ + Sbjct: 521 RQREQPTLALILVIDKSGSMSSGDRLDLVKEAARATARTLDPSDEIGVIAFDNSPQVLVR 580 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ--ATKGFIKGGINRILLATDGDFNV 325 +++ I+++I L A G TN L AY Q +K +K ++L +DG+ Sbjct: 581 LQPAANRLRISSSIRRLSAGGGTNAMPALREAYLQLAGSKALVK----HVILLSDGE--- 633 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 I +++ R+S +T+S+ GVG+ + ++R+A+ G G Y Y + ++ ++ Sbjct: 634 --SPENGINALLGDMRQSDITVSSVGVGDGAGKD-FLIRVAERGRGRYFYSEDGTDVPRI 690 Query: 386 LNSEMRQM 393 + E R++ Sbjct: 691 FSREAREV 698 >UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UYN7_ROSS1 Length = 459 Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 60/203 (29%), Positives = 104/203 (51%), Gaps = 8/203 (3%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 + + P +LV ++D SGSM S +L + +L+ + L++ D ++VT++ + L Sbjct: 84 REQHRPPLHLVAVLDVSGSM-SGTKLASAKEALRQALHFLQDGDVFSLVTFSDQVQTHLK 142 Query: 268 SISGSHKA--EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 + S + + ++ +D + A G T GL K + +LL +DG NV Sbjct: 143 AESYAQRKRDKMENLLDEIRASGMTALDGGLAQGIDLGQKK--RQATTLVLLLSDGQANV 200 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 G D + I +K R+SG+ +ST GVG +YNEA+MV IA+ G G + +I S+ Sbjct: 201 GETDLEKIGLRAQKARQSGLIVSTLGVG-LDYNEALMVEIANQGGGRFYHIQEGSQIPAA 259 Query: 386 LNSEMRQMLITVAKDVKAQIEFN 408 L E+ + A+ V ++EF+ Sbjct: 260 LMQELGSAAMLAARQV--EVEFD 280 >UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FM70_SORC5 Length = 507 Score = 76.3 bits (186), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 66/260 (25%), Positives = 121/260 (46%), Gaps = 10/260 (3%) Query: 207 RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA--GDSRI 264 R PA+ +V L+D SGSM ++ +++ + V L + D +++ ++A +R+ Sbjct: 115 RGQPRAPAA-VVLLVDASGSM-QGPKMENARAAAQAFVDRLPDGDLVSVASFADTAQARV 172 Query: 265 ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 A + S + + AI +L +GSTN AGL+LA Q A + R++L +DG N Sbjct: 173 APTVLGRSTRPAVARAIAALGPDGSTNLFAGLKLAEQHALAAPSTHAVRRVVLISDGQAN 232 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 +G P + ++ ++ GV +++ GVG ++Y+E + +A +G ++ E Sbjct: 233 IGPSSPDILGALAQRGAAHGVQITSIGVG-ADYDERTLNALAVGSSGRLYHLTEAREMSS 291 Query: 385 VLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 VL E+ + T A A +E PA E + E+ + + V G + G+ Sbjct: 292 VLERELALLQTTAA--TGAFVEIVPAPGVELLDVPNERTERSGDAL---RVLLGTMFGGQ 346 Query: 445 HITLLFELTLNGQKASIDKL 464 H +L + A L Sbjct: 347 HREMLVRARVTAPAAGSHPL 366 >UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G8C3_SORC5 Length = 907 Score = 76.3 bits (186), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 57/212 (26%), Positives = 104/212 (49%), Gaps = 5/212 (2%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP-SISG 271 P ++ ++DTSGSM + + + + + LV L D+ ++ T++ D+ + + G Sbjct: 500 PHLSVHLVLDTSGSM-AGAPIDSARRAAQALVDRLAPADDFSLTTFSSDAEVVIEDGPVG 558 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK-GFIKGGINRILLATDGDFNVGIDDP 330 +A I AI+ L G TN GAGL L Y QA++ G + + +LL +DG G+ Sbjct: 559 PRRAAIRRAIEGLREGGGTNIGAGLSLGYAQASRPGIPEDAVRVVLLVSDGRATSGLTHS 618 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEM 390 + + + + G+ S G+G+ +++ +M IA G G Y Y+ + L++E+ Sbjct: 619 ERLAWLALDAFQRGIQTSALGLGD-DFDGQLMSAIASDGAGGYYYLRHPEQIAPALSTEL 677 Query: 391 RQMLITVAKDVKAQIEFNPAWVTEYRQIGYEK 422 + L VA V+ ++ A V R G + Sbjct: 678 DKRLDPVATAVEVRVRLK-AGVDLLRAYGSRR 708 >UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G857_9DELT Length = 540 Score = 75.5 bits (184), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 51/189 (26%), Positives = 101/189 (53%), Gaps = 3/189 (1%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 P NL +D S SM E + ++ L + ++L +D + +V + ++++ + + + Sbjct: 156 PPLNLTIAVDLSKSM-EGEPIDRVRQGLLQMREQLEPEDRVTLVGFGDEAQVIVEN-ADK 213 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS 332 E+ AI +L GSTN AGL A++Q +G NR+LL +DG GI + Sbjct: 214 DSVELATAIAALVPWGSTNLYAGLRTAFEQTDLYAQEGWQNRVLLVSDGVPTTGIVNSDK 273 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 IE + + G L+T G+GN +++ +M ++++G+G++ Y++ +V + E++ Sbjct: 274 IEGLAEAWSGMGYGLTTVGIGN-DFDIELMRNLSELGSGSFYYVEDPDAVIEVFSEEVQA 332 Query: 393 MLITVAKDV 401 + +A+DV Sbjct: 333 FTVPLAEDV 341 >UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GC99_9DELT Length = 546 Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 78/295 (26%), Positives = 131/295 (44%), Gaps = 34/295 (11%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 P +L ++D SGSM D+ Q+ L L V L EQD + +++Y D+ L ++ Sbjct: 129 PGLDLAIVLDRSGSMGGDKLRFAKQAGLDL-VNRLDEQDRVTLISY-DDTVTPLSNLQRV 186 Query: 273 HKAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR----------ILLAT 319 I + + G+T G L + Q+ G R ++L + Sbjct: 187 DDDGIEVLRRQLLDIQVGGTTALGPALFMGLQRLAAPEPFGPQTRTEARHDRLRHVILLS 246 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DG NVG P+ I V + GV++ST G+G +YNE +M RIAD G G Y +I+ Sbjct: 247 DGIANVGETRPEVIGGRVAEHFGGGVSVSTLGMG-LDYNEDLMTRIADEGGGRYHFIEDA 305 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW-VTEYRQIGYEKRQLRVEHFNNDNVDAG 438 +L E+ + TVA +V + P VTE GY + ++ + G Sbjct: 306 ESIPAMLGDELAGLTATVASEVDSVFATLPGTDVTEV--YGYTQ----TVAGSDTTIRVG 359 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQG 493 +GAG+ ++ L L+ ++ +AP ++ EL +++R++ G Sbjct: 360 FLGAGQSREIVVNLRLDPEQVR----GWAPGERV-------ELGEVEVRYRLVTG 403 >UniRef50_A3U9M7 Putative uncharacterized protein n=1 Tax=Croceibacter atlanticus HTCC2559 RepID=A3U9M7_9FLAO Length = 244 Score = 74.3 bits (181), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%) Query: 105 YQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW 164 Y ++N K + +PL+TFS+DVD SY+N+RR +N P DAV+VEE++NYF ++ Sbjct: 155 YALVEENTFKTASVSPLSTFSIDVDRASYSNIRRMINNSQPIPVDAVKVEEMINYFSYNY 214 Query: 165 DIKDKQSIPASKPI-PFAMRYELAPAPWNEQRTLLKV 200 P K PF+ EL +PWN ++L++ Sbjct: 215 --------PEPKTNDPFSEYTELVQSPWNTNTSILRI 243 >UniRef50_UPI00006CAF43 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAF43 Length = 631 Score = 73.2 bits (178), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 58/206 (28%), Positives = 106/206 (51%), Gaps = 15/206 (7%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 +SE +P +L+ +ID SGSM S ++ L++ SLK L+K + E D I ++++ +I P Sbjct: 137 QSERVPM-DLICVIDDSGSM-SGKKAQLVRKSLKYLLKIMNENDRICLISFDSVEKILTP 194 Query: 268 SISGS--HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 + + +K+E+ AI ++ GSTN AG+E K I + L +DG Sbjct: 195 FLRNNLENKSELKKAIKNIVGRGSTNIEAGMEAGLWMIKNRKEKNPITCMFLLSDGQ--- 251 Query: 326 GIDDPKSIESMVKKQRES-----GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 DD ++ V+K +S ++T+G G ++++ M IA+ G Y YI+ + Sbjct: 252 --DDSPQVDLRVQKLIQSYDIQDTFIVNTYGYG-ADHDATQMRNIAETHKGGYYYIEDVK 308 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQIE 406 + + + +L V +DV+ +I+ Sbjct: 309 KVSEWFVLSISGLLSAVGEDVRIRIK 334 >UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZED8_SYNY3 Length = 588 Score = 73.2 bits (178), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 47/189 (24%), Positives = 92/189 (48%), Gaps = 2/189 (1%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 P+ NL F+ID SGSM ++ + ++ + +L D++++ + + +PS Sbjct: 42 PSLNLGFVIDRSGSMEGHNKITYARQAVCYAIDQLSPGDHLSVTIFDDQVQTLIPSTLVK 101 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS 332 KA+ + ++ G T+ G Q ++ + +NRI+L +DG N G +P Sbjct: 102 DKAQFKRLVQGINPGGCTDLHGGWLQGGIQVSQN-LSAELNRIILLSDGLANRGETNPDI 160 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 I + V + G + +T G+G+ +YNE ++ +A G+GNY Y+ + + E++ Sbjct: 161 IATDVHGLAQRGASTTTLGLGD-DYNEDLLEAMARSGDGNYYYVADAEQLPTIFERELQG 219 Query: 393 MLITVAKDV 401 + T V Sbjct: 220 LAATYGNGV 228 >UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8YP40_ANASP Length = 427 Score = 72.4 bits (176), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 53/190 (27%), Positives = 94/190 (49%), Gaps = 13/190 (6%) Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVT 257 L + I A + E+ NL ++D SGSM + L ++ +++ L+ L+ D I++V Sbjct: 25 LAISISAVAEQFEQNLPLNLCLILDQSGSM-HGQPLKMVVEAVEKLLDRLQPGDRISVVA 83 Query: 258 YAGDSRIALPSISGSHKAEINAAI-DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRIL 316 +AG + + +P+ + I I L A G T GL+ + KG +G +++ Sbjct: 84 FAGSATVIIPNQIVENPESIKTQIRKKLQASGGTVIAEGLQQGITELMKG-TRGAVSQAF 142 Query: 317 LATDGD---------FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 L TDG + +G DD + KK + +T++T G GN N+N+ ++ IAD Sbjct: 143 LLTDGHGEDSLKIWKWEIGPDDSRRCLEFAKKAAKINLTINTLGFGN-NWNQDLLETIAD 201 Query: 368 VGNGNYSYID 377 G G ++I+ Sbjct: 202 AGGGTLAHIE 211 >UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0E9B3_PARTE Length = 603 Score = 72.0 bits (175), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 60/209 (28%), Positives = 96/209 (45%), Gaps = 14/209 (6%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 P +LV ++D SGSMI ++ L++ SL+ L+K L +D I I+ + + I I + Sbjct: 118 PPIDLVCVVDVSGSMIG-RKINLVKDSLRYLMKILGPEDRICIIVFTTVAHIVTSFIRNT 176 Query: 273 --HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDP 330 +K + AI L STN G+ A K ++ I L +DG DD Sbjct: 177 QENKPLLKKAILELKGLASTNISDGMNKALWMLKNRKYKNPVSCIFLLSDGQ-----DDY 231 Query: 331 KSIESMVKKQR-----ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 K E V Q E + TFG G +++ +M +IA GN+ YID +++A Sbjct: 232 KGAEQRVFDQLQLLKIEEKFVIHTFGYG-QDHDAYVMNQIAKYREGNFYYIDNINKASDY 290 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTE 414 M ML A++V ++ N + + Sbjct: 291 FILAMSGMLSIYAQNVSINLKSNDCEIVK 319 >UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VHB8_NAEGR Length = 755 Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 59/204 (28%), Positives = 98/204 (48%), Gaps = 22/204 (10%) Query: 215 SNLVFLIDTSGSMISD-------------ERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 NLV ++D SGSM S RL L++ S++ L++ + E+D I+++ ++ Sbjct: 131 CNLVCILDVSGSMGSSAEDLSSSNENTGFSRLDLVKHSVRTLIELMNEKDQISLIPFSDS 190 Query: 262 SRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRIL-LA 318 +R+ LP + K + ++ L EGSTN GL L + + + N L L Sbjct: 191 ARMELPLTKMDAVGKKKAIEKLEHLGPEGSTNVWDGLRLGMESSLNNPLCAKTNTCLILF 250 Query: 319 TDGDFNVGIDDPKSIESMVKK---QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSY 375 TDG+ N I+ P+ I ++K + T+ +FG G S + A++ IA G+G YSY Sbjct: 251 TDGEPN--INPPRGIVPTLEKYIKEHPLNSTIHSFGFGYS-LDSALLKDIAMNGSGAYSY 307 Query: 376 IDTLSEAQKVLNSEMRQMLITVAK 399 I S + M +L T + Sbjct: 308 IPDCSMVGTTFVNMMSNILCTAVR 331 >UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseobacter sp. AzwK-3b RepID=A6FSG0_9RHOB Length = 444 Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 53/198 (26%), Positives = 88/198 (44%), Gaps = 2/198 (1%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 ++E P NL ++D S SM + L + + +V LR D +AIV + + + Sbjct: 36 ETEPRPPLNLALVLDRSSSM-RGQPLHEAKRAADQIVAGLRPSDRLAIVAFDNATEVMFS 94 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 AA+ + A G T G L +Q+ G R+ L +DG NVG+ Sbjct: 95 GGPRGDGQAARAALSRIHARGMTALHDGWLLGVEQSIAMREAGTPARVFLLSDGVANVGL 154 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 D +I + + E G+T ST G+G +NE +M +A G GN Y +T + Q Sbjct: 155 TDASAIAADCTRMAEHGITTSTCGLG-MGFNEDLMAEMARAGRGNAYYGETAEDLQDPFE 213 Query: 388 SEMRQMLITVAKDVKAQI 405 E + A+ ++ ++ Sbjct: 214 QEFDLLRNICARGLRLRL 231 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 69/247 (27%), Positives = 122/247 (49%), Gaps = 13/247 (5%) Query: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVK 245 A AP ++++ L++ + AK D LP NL ++D SGSM + L ++S+ L+ Sbjct: 16 AGAPTSQRQ--LRIAVAAKADDHDRRLPL-NLCLVLDHSGSM-DGQPLETVKSAALGLID 71 Query: 246 ELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK 305 L E D ++++ + ++I + + + A I AI+ L AEG T GL+L Q+A K Sbjct: 72 RLEEDDRLSVIAFDHRAKIVIENQQVRNGAAIAKAIERLKAEGGTAIDEGLKLGIQEAAK 131 Query: 306 GFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI 365 G + ++ I L TDG+ G D+ + ++ + + +T+ T G G+ ++N+ ++ I Sbjct: 132 GK-EDRVSHIFLLTDGENEHG-DNDRCLK-LGTVASDYKLTVHTLGFGD-HWNQDVLEAI 187 Query: 366 ADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP----AWVTEYRQIGYE 421 A G+ SYI+ SEA ++M + +E P A V Q+ E Sbjct: 188 AASAQGSLSYIENPSEALHTFRQLFQRMSNVGLTNAHLLLELAPQAHLAIVKPVAQVSPE 247 Query: 422 KRQLRVE 428 L V+ Sbjct: 248 TMDLTVQ 254 >UniRef50_D0LP28 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LP28_HALO1 Length = 523 Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 73/222 (32%), Positives = 104/222 (46%), Gaps = 22/222 (9%) Query: 137 RRFLNQGLLPPPDAVRVEEIVNY--FPSDWDIKDKQSIPASKPIPFAMRYELAPAPW--N 192 R + +G +PP +A VE + + P D D +R LA AP Sbjct: 60 RDLIAEGRVPPAEAFLVEAMFSEHDLPVAGDACDSM---------LCLRSSLAVAPALDG 110 Query: 193 EQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSM---ISDERLP---LIQSSLKLLVK 245 L+V + + D + E P+ +V +D SGSM +D+++ L ++ L LV Sbjct: 111 TPTGWLQVGMSSTIDPATFERPSLTIVATVDVSGSMGWGYADDQVSAGSLTRNLLGALVD 170 Query: 246 ELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK 305 +L +D IAIVTY AL S K EI+ AID L GSTN AGL+ AY A++ Sbjct: 171 QLGPEDRIAIVTYGSRVDTALTLRSAGQKDEIHTAIDKLSEAGSTNMEAGLQRAYAIASE 230 Query: 306 GFIKGGIN--RILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 G + RI+L TD NVG E+M + +SGV Sbjct: 231 AAADGETDSTRIMLFTDVQPNVGATGASQFEAMASEGADSGV 272 >UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=Q10RY0_ORYSJ Length = 694 Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 65/234 (27%), Positives = 110/234 (47%), Gaps = 27/234 (11%) Query: 192 NEQRTLLKVDILAKDRKSEELPAS----NLVFLIDTSGSMISDERLPLIQSSLKLLVKEL 247 +E+R + + I K KS + +S +LV ++D SGSM S +L L++ ++ +++ L Sbjct: 219 SERRKVFAILIHLKAPKSLDSVSSRAPLDLVTVLDVSGSM-SGIKLSLLKRAMSFVIQTL 277 Query: 248 REQDNIAIVTYAGDSRIALP----SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 D +++V ++ ++ P +++G +A AI SL A G TN L+ + Sbjct: 278 GPNDRLSVVAFSSTAQRLFPLRRMTLTGRQQAL--QAISSLVASGGTNIADALKKGAKVV 335 Query: 304 TKGFIKGGINRILLATDG-----------DFNVGIDDPKSIESMVKKQRESGVTLSTFGV 352 K ++ I+L +DG D N I P SI V + TFG Sbjct: 336 KDRRRKNPVSSIILLSDGQDTHSFLSGEADINYSILVPPSILPGTSHH----VQIHTFGF 391 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 G ++++ A M IA+ NG +S+ID Q M +L V KD++ IE Sbjct: 392 G-TDHDSAAMHAIAETSNGTFSFIDAEGSIQDAFAQCMGGLLSVVVKDMRLCIE 444 >UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=B8G546_CHLAD Length = 418 Score = 69.7 bits (169), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 69/281 (24%), Positives = 129/281 (45%), Gaps = 31/281 (11%) Query: 188 PAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKEL 247 P Q L V+ +A + LP NL F++D SGSM +L ++++ + +++ L Sbjct: 17 PTSSTPQVVYLLVEAVAPASPTSALPL-NLCFVLDRSGSM-QGAKLESMKAATRRVIELL 74 Query: 248 REQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 R D AIV + + +P+ ++ + AA++++ G T G++ A + K Sbjct: 75 RPHDVAAIVIFDDTVQTLIPATPVGDRSALLAAVETITEAGGTAMSLGMQAAQTELQKHL 134 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 I+R+LL TDG D+P + + ++GV ++ G+G + +NE ++ IA Sbjct: 135 GPDRISRMLLLTDG--QTWGDEPIC-RDLARTLGQAGVRITALGLG-TEWNEQLLDDIAA 190 Query: 368 VGNGNYSYI-----------DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE-- 414 +G YI + EAQ V+ ++ R +L+ + +DV + + V Sbjct: 191 ASDGYSDYIADPAQIETFFQQAVKEAQAVVATDAR-LLLRLVRDVTPRAIYRVKPVIANL 249 Query: 415 -YRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL 454 Y+ IG +R+ GD+ G+ +L +L L Sbjct: 250 GYQPIGDAAVAVRL----------GDLVGGQPAAVLLDLML 280 >UniRef50_Q82LZ6 Putative uncharacterized protein n=1 Tax=Streptomyces avermitilis RepID=Q82LZ6_STRAW Length = 462 Score = 69.7 bits (169), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 51/186 (27%), Positives = 94/186 (50%), Gaps = 14/186 (7%) Query: 182 MRYELAPAPW----NEQRT-LLKVDILAKDRKSEELPAS-----NLVFLIDTSGSMISDE 231 M + LA P NE T +++ + A + E+ PA+ N VF++DTSGSM + Sbjct: 1 MSHGLAAVPGEFKANEATTHFVEIALKAGRARPEDAPATEPLPVNFVFVVDTSGSM-TGT 59 Query: 232 RLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH--KAEINAAIDSLDAEGS 289 +L ++S+L+ + +ELR D + I+T+ + R LP+++ + +L +G Sbjct: 60 KLDTVKSALQTIYRELRPADCLGIITFDHNVRTVLPAVAKQDLPPERFAEVVSALTTQGG 119 Query: 290 TNGGAGLELAYQQATKGFIKG-GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 T+ G++ + ++ + G +N + L +DGD G D + + V + +TLS Sbjct: 120 TDIDLGVQYGIDEISRHSVSGRTVNCLYLFSDGDPTSGERDWIKVRANVAAKLRGDLTLS 179 Query: 349 TFGVGN 354 FG G+ Sbjct: 180 CFGFGS 185 >UniRef50_D2VKS7 von Willebrand factor type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VKS7_NAEGR Length = 923 Score = 69.3 bits (168), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 57/203 (28%), Positives = 105/203 (51%), Gaps = 18/203 (8%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP----SISG 271 +LV ++D SGSM + ++L +++S+L +V +L+E+D +AIV + + L I G Sbjct: 692 DLVLVVDKSGSM-AGQKLDMVKSTLSFMVDQLKEKDRVAIVEFDTQVKTNLDLTKMDIEG 750 Query: 272 SHKA-EINAAIDSLDAEGSTNGGAGLELAYQQ--ATKGFIKGGINRILLATDGDFNVGID 328 KA ++++AI + GS +G + A++ K + ++L TDG N G+ Sbjct: 751 KKKAKQVSSAI----SPGSCTNLSGALFTSLKLLASRQQEKNEVTSVILFTDGLANRGLI 806 Query: 329 DPKSI-----ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 I + M + S VT+ TFG G + + M+ IA GNG Y Y++T + Sbjct: 807 STNEILQNMQDLMDELLSTSNVTIHTFGFG-QDTDANMLTSIAQKGNGLYDYLETADDIP 865 Query: 384 KVLNSEMRQMLITVAKDVKAQIE 406 K + + ++ V +++K +I+ Sbjct: 866 KAFGNVIGNLVSVVGQNIKIRIQ 888 >UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LD98_GORB4 Length = 423 Score = 69.3 bits (168), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 58/210 (27%), Positives = 100/210 (47%), Gaps = 4/210 (1%) Query: 200 VDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 ++I+A + K + + L ++D SGSM S L Q +L ++ +L +D +VT+ Sbjct: 24 LEIVAPEGKVTDRAPAALQVVLDRSGSM-SGPPLAGAQRALAGVIGQLDPRDVFGVVTFD 82 Query: 260 GDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI--NRILL 317 D+++ LP+ + KA A+ S+ G T+ +G Q+ + GI +L+ Sbjct: 83 DDAQVVLPAAPLADKARAVDAVGSIVPGGCTDLSSGYLRGLQELRRATASAGIRGGTVLV 142 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 +DG N GI D S+ K G+ ST G G Y+E ++ IA GNGN+ + D Sbjct: 143 ISDGHVNRGIRDLDEFASITAKAAADGIITSTLGYGR-GYDETLLSAIARSGNGNHVFAD 201 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 A + E+ +L A+ V + + Sbjct: 202 DPDAAGAAIAGEVDGLLSKSAQAVTLTVRY 231 >UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3 Tax=Andropogoneae RepID=C5WYU9_SORBI Length = 698 Score = 68.9 bits (167), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 63/251 (25%), Positives = 120/251 (47%), Gaps = 21/251 (8%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP----SISG 271 +LV ++D SGSM + ++ L+++++ +++ L D ++++ ++ +R P +++G Sbjct: 241 DLVTVLDVSGSM-AGTKIALLKNAMSFVIQTLGPNDRLSVIAFSSTARRLFPLRRMTLAG 299 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG--------DF 323 +A A+ SL A G TN GL+ + +K + I+L +DG D Sbjct: 300 RQQAL--QAVSSLVASGGTNIADGLKKGAKVIEDRRLKNPVCSIILLSDGQDTYTLPSDR 357 Query: 324 NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 N+ +D + + V + TFG G S+++ A M IA++ +G +S+ID Q Sbjct: 358 NL-LDYSALVPPSILPGTGHHVQIHTFGFG-SDHDSAAMHAIAEISSGTFSFIDAEGSIQ 415 Query: 384 KVLNSEMRQMLITVAKDVKAQIEF--NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIG 441 + +L V K+++ IE N +T + GY + E+ +VD GD+ Sbjct: 416 DGFAQCIGGLLSVVVKEMRLGIECVDNGVLLTSIKSGGYTSQV--AENGRGGSVDIGDLY 473 Query: 442 AGKHITLLFEL 452 A + L L Sbjct: 474 ADEERGFLLTL 484 >UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus trichocarpa RepID=B9GK57_POPTR Length = 595 Score = 68.2 bits (165), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 51/202 (25%), Positives = 101/202 (50%), Gaps = 15/202 (7%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSH 273 ++V ++D SGSM +L L++ ++ +++ L D ++IVT++ +R LP ++SGS Sbjct: 156 DIVNVLDVSGSMAG--KLILLKRAVNFIIQNLGPSDRLSIVTFSSSARRILPLRTMSGSG 213 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 + + + ++SL A G TN AGL + + + I+L +DG + Sbjct: 214 REDAISVVNSLSATGGTNIVAGLRKGVRVLEERRQHNSVASIILLSDGCDTQSHSTHNRL 273 Query: 334 ESMV----------KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 E + ++ R+ + TFG G +++ A M I+DV G +S+I+++ Q Sbjct: 274 EYLKLIFPSNNASGEESRQPTFPIHTFGFG-LDHDSAAMHAISDVSGGTFSFIESIDILQ 332 Query: 384 KVLNSEMRQMLITVAKDVKAQI 405 + + VA+DV+ ++ Sbjct: 333 DAFARCIGGLTSIVARDVQLKV 354 >UniRef50_A9WKF3 von Willebrand factor type A n=3 Tax=Chloroflexus RepID=A9WKF3_CHLAA Length = 446 Score = 68.2 bits (165), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 61/225 (27%), Positives = 104/225 (46%), Gaps = 21/225 (9%) Query: 239 SLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDA---EGSTNGGAG 295 +L L++ L D + ++ A D+ + I GS +AE+ AAI L A +TN G Sbjct: 105 ALHSLIERLDHNDRLGLIACASDAIVLASGIPGSRRAELVAAIARLPALRLGETTNLAQG 164 Query: 296 LELAYQQATKGFIK---GGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGV 352 L+LA Q F+ + RI+L TDG F D ++ ++ G++LST G+ Sbjct: 165 LQLALAQ----FVAADDATVRRIVLITDG-FTT---DQTLCLTLAREAAARGISLSTIGL 216 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV 412 G S + E ++ ++AD+ G S++ ++ ++ +E+ T A+ + Q P V Sbjct: 217 GGS-FEEHLLTQLADLSGGRASFVYDAADIPAIIAAELESARQTTAQALTLQCNL-PQTV 274 Query: 413 TEYRQIGYEK-----RQLRVEHFNNDNVDAGDIGAGKHITLLFEL 452 + R I L EH + GD+ G+ + LL E Sbjct: 275 SLRRIIRLTPALTVLNPLSTEHGRRLTIHLGDLRHGEEVRLLVEF 319 >UniRef50_C1YR26 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YR26_NOCDA Length = 505 Score = 68.2 bits (165), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 55/196 (28%), Positives = 92/196 (46%), Gaps = 5/196 (2%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 + L ++D SGSM RL +L LV+ L DN +V++ +R+ +P K Sbjct: 40 ATLQVVLDRSGSM-GGGRLDGAVRALLSLVERLAPSDNFGLVSFNDQARVEVPCGPLEDK 98 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATK-GFIKGGINRILLATDGDFNVGIDDPKSI 333 A + I L A G T+ +GL Q+A + G +GG +LL +DG N G+ D + Sbjct: 99 ARVRRLISGLHASGGTDLSSGLLRGVQEARRAGADRGGT--LLLISDGHANQGVTDHDLL 156 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 + GVT ++ G G Y+E ++ +AD G G+ + + A ++ E + Sbjct: 157 RQVAADAYAHGVTTTSLGYG-LGYDEELLGAVADGGAGSALFAEDPDTAGGLIAREAEYL 215 Query: 394 LITVAKDVKAQIEFNP 409 L A+ V ++ P Sbjct: 216 LAKTAQAVSLRVPSGP 231 >UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z6_DESAA Length = 480 Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 69/259 (26%), Positives = 120/259 (46%), Gaps = 30/259 (11%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA-----GDSRIALPSIS 270 ++V ++D SGSM +++ ++++K LV+ LR QD ++VTY+ GD L ++ Sbjct: 94 DMVIVLDRSGSM-GGQKVRDAKAAVKGLVEGLRSQDRFSLVTYSNSVNGGD---GLHYLT 149 Query: 271 GSHKAEINAAIDSLDAEGSTNG------GAGLELAYQQATKGFIKGGINRILLATDGDFN 324 + +N +DS+ A G TN G G+ AY + + +++L +DG N Sbjct: 150 ADKRNSLNWMVDSIPAGGGTNLGGGLEKGVGVLRAYGAPDR------MGKVILISDGQAN 203 Query: 325 VGIDDPKSIESMVKKQRESGV--TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 G+ DP + +M R+ G+ +++T G+G ++NE +M +AD G G Y Y++ + Sbjct: 204 QGVTDPNQLAAMA-ALRDDGLVYSVTTVGIGQ-DFNEQLMATVADGGRGRYYYLENPGDF 261 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 V E A + + P VT GY + N + G + + Sbjct: 262 LAVFQEEANWTRAVAASALSIHLPL-PKGVTAVSANGYPV----INKENGAFISPGALLS 316 Query: 443 GKHITLLFELTLNGQKASI 461 G+ TL L N A I Sbjct: 317 GQSRTLYIRLHANDDAAEI 335 >UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnoliophyta RepID=B9SJS6_RICCO Length = 540 Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 62/249 (24%), Positives = 115/249 (46%), Gaps = 21/249 (8%) Query: 168 DKQSIPASKPIP--FAMRYELAP-----APWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 D++ + S+P P R +L AP E + + +++ D S P +LV + Sbjct: 46 DEKIVTRSRPTPPIVPARVKLRSINNDMAPLEESKLKVMLELTGGDSSSYGRPGLDLVAV 105 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEIN 278 +D S SM D ++ +++++ ++K+L D ++IVT++G + P +G + E Sbjct: 106 LDVSRSMEGD-KMEKMKTAMLFIIKKLGPTDRLSIVTFSGGANRLCPLRQTTGKSQEEFE 164 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG--INRILLATDGDFNVGIDDPKSIESM 336 I+ L+A+G+TN AGL+ A + KG G + I+L +DG+ N G D Sbjct: 165 NLINGLNADGATNITAGLQTAL-KVLKGRSFNGERVVGIMLMSDGEQNAGSD-------- 215 Query: 337 VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 V + TFG G ++ + + + G +S + + K + +L Sbjct: 216 ATGVSVGNVPIHTFGFGINHEPKGLKAIAHNSIGGTFSDVQNIDSLTKAFAQCLAGLLTV 275 Query: 397 VAKDVKAQI 405 V +D+K I Sbjct: 276 VVQDLKMTI 284 >UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2GTB7_CHAGB Length = 777 Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 55/212 (25%), Positives = 93/212 (43%), Gaps = 23/212 (10%) Query: 215 SNLVFLIDTSGSMISDERLP-----------------LIQSSLKLLVKELREQDNIAIVT 257 +LV ID SGSM + P L++ + + +V L +D + IVT Sbjct: 73 CDLVLSIDISGSMADEAPAPSKPGGEAGEDTGLRVIDLVKHAARTIVATLDSRDRLGIVT 132 Query: 258 YAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE--LAYQQATKGFIKGGINRI 315 + S++ +P +KA+ I+S++ STN G+ L+ +G G + + Sbjct: 133 FTNRSKVGIPPYE--NKAKTLENIESMEPFSSTNMWHGIRDGLSLFSEAEGGSTGRVPAL 190 Query: 316 LLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSY 375 L+ TDG N + PK M++ T+ TFG G ++ IA+VG GNYS+ Sbjct: 191 LVLTDGMPNY-MCPPKGYVPMLRSMEPLPATIHTFGFGY-ELRSGLLKSIAEVGGGNYSF 248 Query: 376 IDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 I V + + T A + K ++ + Sbjct: 249 IPDAGMLGTVFIHAVAHLQSTFANNAKLRLTY 280 >UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3 Tax=Sorghum bicolor RepID=C5WYV0_SORBI Length = 686 Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 55/218 (25%), Positives = 100/218 (45%), Gaps = 19/218 (8%) Query: 204 AKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 A DR + P +LV ++D SGSM D +L L++ ++ ++ L D +++V+++ +R Sbjct: 156 AGDRDAPRAPL-DLVTVLDVSGSMRWD-KLALVKQAMGFVIGSLGPHDRLSVVSFSSGAR 213 Query: 264 --IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG 321 L +S + K+ A++SL A G TN GL A + + + ++ ++L +DG Sbjct: 214 RVTRLLRMSHTGKSLATEAVESLRAGGGTNIAEGLRTAAKVLGERRHRNAVSSVILLSDG 273 Query: 322 DFNVGIDD--------------PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 N + P S E + TFG GN + AM V +A+ Sbjct: 274 HDNYSMPRRARGGVPPNYEVLVPPSFVPGTASTGEGSAPIHTFGFGNDHDAAAMHV-VAE 332 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 G +S+I+ + Q + +L VA++ + I Sbjct: 333 ATGGTFSFIENEAVIQDAFAQCIGGLLTVVAQEARVAI 370 >UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926ED6 Length = 464 Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 56/214 (26%), Positives = 102/214 (47%), Gaps = 38/214 (17%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI----ALPSISG 271 +LV +ID SGSM + E+L L++ +L+ +V +L E+D + ++T+ D+ + L ++ Sbjct: 52 DLVVVIDKSGSM-AGEKLALVKKTLEFVVSQLNEKDRLCLITF--DTSVYLDFKLTPMTP 108 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQA-----------------TKGFI-KGGIN 313 +K + I + TN GL + T GF KGG+ Sbjct: 109 MNKYQTLKIIKDISPGSMTNLCGGLMKGLCEVIDRADEEKNEVASVLLFTDGFANKGGLT 168 Query: 314 RILLATD--GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 I ++ + +GI PK+ ++ ++ TFG G SN+N M+ I+D G+G Sbjct: 169 NIYCSSSQTAKYTIGIVGPKTADA----------SIYTFGFG-SNHNAQMLKEISDAGSG 217 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 Y YI+ + + + +L TVA+ ++ +I Sbjct: 218 MYYYIENVDMIAEAFGQCLGGLLSTVAQGIQVEI 251 >UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UNM0_RHOBA Length = 900 Score = 66.2 bits (160), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 48/182 (26%), Positives = 86/182 (47%), Gaps = 9/182 (4%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 K E P+ ++ +ID SGSM +++ L + + + V+ L +D I ++ + GDS Sbjct: 456 KEREKPSLAMMLVIDKSGSM-GGQKIELAKDAAQAAVELLGPKDAIGVIAFDGDSYTVSE 514 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 S S + I+ AI +++A G TN + AY+ K + ++L TD G+ Sbjct: 515 LRSTSDRGAISDAISTIEASGGTNMYPAMADAYEALLGATAK--LKHVILMTD-----GV 567 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 P + + S +TLST +G + +E ++ +A +G G Y + D +V Sbjct: 568 SSPGDFQGVAGDMSASRITLSTVALGQGS-SEDLLEELAQIGGGRYYFCDDPQSVPQVFA 626 Query: 388 SE 389 E Sbjct: 627 KE 628 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 66.2 bits (160), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 50/170 (29%), Positives = 91/170 (53%), Gaps = 14/170 (8%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 +L+ +IDTSGSM + + L L++ +L LV L+ D I ++ ++ +++ P +S K Sbjct: 1447 DLICVIDTSGSM-NGQPLDLLKETLLFLVDLLQTGDRICLIQFSTNAQRLTPLLSIESKD 1505 Query: 276 EINA---AIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS 332 I + I+ L A+G TN G++LA+ + K I + L +DG N G ++ Sbjct: 1506 NIKSIKNEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLLSDG-LNDGAENK-- 1562 Query: 333 IESMVKK------QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 I ++K+ E T+ TFG G +++ +M +I+ + +GN+ YI Sbjct: 1563 IRDLLKQLNFYQNYNEENFTIQTFGFG-KDHDPNLMDKISQLMDGNFYYI 1611 >UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, scaffold_125.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HHA4_VITVI Length = 630 Score = 64.7 bits (156), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 48/199 (24%), Positives = 102/199 (51%), Gaps = 10/199 (5%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSH 273 +LV ++D SGSM + +L L++ ++ L++ L D ++IV+++ +R P +S + Sbjct: 206 DLVAVLDVSGSM-AGSKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMSDNG 264 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD---- 329 + AI+SL + G TN GL+ + + + + I+L +DG D+ Sbjct: 265 REAAGLAINSLTSSGGTNIVEGLKKGVRVLEERSEQNPVASIILLSDGKDTYNCDNVNRR 324 Query: 330 --PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 S ++ R++ + + TFG G S+++ M I+D G +S+I++++ Q Sbjct: 325 QTSHCASSNPRQGRQAIIPVHTFGFG-SDHDSTAMHAISDESGGTFSFIESVATVQDAFA 383 Query: 388 SEMRQMLITVAKDVKAQIE 406 + +L VA++++ ++ Sbjct: 384 MCIGGLLSVVAQELRLTVK 402 >UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fragment) n=1 Tax=Sorghum bicolor RepID=C5YMJ6_SORBI Length = 423 Score = 64.7 bits (156), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 59/238 (24%), Positives = 109/238 (45%), Gaps = 27/238 (11%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR--IALPSISGSH 273 +LV ++D SGSM + +++ ++ ++ L+ L D +++V ++ D+R I L +S Sbjct: 126 DLVTVLDVSGSM-AGKKMERVKRAMGFLIDNLGSDDRLSVVAFSTDARRIIRLTRMSDDG 184 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD---- 329 KA A++SL A GSTN GL++A K + ++L +DG N + Sbjct: 185 KAAAKRAVESLAASGSTNIRGGLDVAAMVLDGRRHKNAVASVILLSDGQDNQSMHHEYLP 244 Query: 330 ----PKSIESMVKK----------QRESG-----VTLSTFGVGNSNYNEAMMVRIADVGN 370 PK + K QR +G VT+ TFG G +++ A M I++V Sbjct: 245 TSWVPKHSPAFSKGGYDVLVPPSFQRTAGGDHRCVTVHTFGFG-IDHDAAAMHYISEVTG 303 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 +S+I+ + Q + +L + + +E + +IG+ R + V+ Sbjct: 304 STFSFIENHAVIQDAFARCIGGLLSVAVQKARISLECGASTPAYESRIGWGGRAVTVD 361 >UniRef50_Q231J4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q231J4_TETTH Length = 520 Score = 64.7 bits (156), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 53/252 (21%), Positives = 120/252 (47%), Gaps = 27/252 (10%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI--ALPSISGSH 273 +L+F+IDTSGSM +++ L++ S+ ++ ++ D I++V + +++ L ++ + Sbjct: 97 DLIFVIDTSGSM-QGKKIELVKKSILQVLHIIQGDDRISLVGFNSQAKVLLELTQLTKNS 155 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 K +I +D L A G T G G++ A+ + + I L +DG N G + Sbjct: 156 KKKIQKTVDELQAGGGTQIGFGMQKAFDIIKERTNSKNLASIFLLSDGQDNCGFSQTQHF 215 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 M + + E + FG G+ +++ + +I + G +++I +S+ + + Sbjct: 216 --MNQSKIEYPFCIDCFGFGD-DHDSLTLSKINQLQQGTFNFIRDISQIDDAFTIILAGI 272 Query: 394 LITVAKDVKAQIEF-----------NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 VA++VK + F + + +E+++I ++ ++++ H + A Sbjct: 273 KTFVAQNVKISVNFGNTELMNGITVSKTYGSEWKKIQDKQYEIQLNH----------LMA 322 Query: 443 GKHITLLFELTL 454 G+ +FEL + Sbjct: 323 GRSKDFVFELQI 334 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 44/198 (22%), Positives = 100/198 (50%), Gaps = 4/198 (2%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSIS 270 P+ +LV +ID SGSM E++ ++++L L+ L D ++++ + + L + Sbjct: 206 PSIDLVCVIDNSGSM-QGEKIQNVKTTLLQLLDMLNSNDRLSLILFNSYPTLLCNLRKVD 264 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDP 330 + I + I+S+ A+G T+ +G+ +A+ K ++ I L +DG N + Sbjct: 265 DENTPNIQSIINSITADGGTDINSGMLMAFNILQKRQFFNPVSSIFLLSDGQDNGADEKI 324 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEM 390 K + + + ++ +FG G S+++ +M RI + +GN+ Y++ +++ + + Sbjct: 325 KKYINSNQSLKNECFSIHSFGFG-SDHDGPLMNRICQLKDGNFYYVEKINQVDEFFVDAL 383 Query: 391 RQMLITVAKDVKAQIEFN 408 + VA+++ +I N Sbjct: 384 GGLFSVVAQEILIEINLN 401 >UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5Z1_VITVI Length = 686 Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 62/270 (22%), Positives = 124/270 (45%), Gaps = 34/270 (12%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSH 273 +LV ++D SGSM + +L L++ ++ L++ L D ++IV+++ +R P +S + Sbjct: 204 DLVAVLDVSGSM-AGSKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMSDNG 262 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD---- 329 + AI+SL + G TN GL+ + + + + I+L +DG D+ Sbjct: 263 REAAGLAINSLXSSGGTNIVEGLKKGVRVLEERSEQNPVASIILLSDGKDTYNCDNVNRR 322 Query: 330 ---------PKSI--------ESMVKKQRESG-------VTLSTFGVGNSNYNEAMMVRI 365 P+ + S+ + RESG + + TFG G S+++ M I Sbjct: 323 QTSHCASSNPRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHTFGFG-SDHDSTAMHAI 381 Query: 366 ADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE-FNPAWVTEYRQIGYEKRQ 424 +D G +S+I++++ Q + +L VA++++ ++ +P E G + Sbjct: 382 SDESGGTFSFIESVAXVQDAFAMCIGGLLSVVAQELRLTVKSVSPGVHIESIPSGKYLSE 441 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTL 454 + + +D GD+ A + L LT+ Sbjct: 442 I-CDQGQQGVIDVGDLYAEEGKEFLIYLTV 470 >UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0C5K4_PARTE Length = 611 Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 48/174 (27%), Positives = 90/174 (51%), Gaps = 8/174 (4%) Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP- 267 +E +L+ LID S SM S + + +++ SL LL+ L EQD + I+T+ ++ P Sbjct: 184 TERTIGIDLICLIDKSMSM-SGDNINMVKKSLLLLLDFLGEQDRLQIITFNEHAQRLTPL 242 Query: 268 -SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVG 326 ++ +K A I + AEG T + +A++Q + + + + L +DG Sbjct: 243 KCLTEKNKQYFQAVISQISAEGLTKISSATYIAFKQLKEKVYRNNVTSVFLLSDGH---D 299 Query: 327 IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 D I ++ +E T+STFG G+ +++ MM I+++ NGN+ Y+ ++ Sbjct: 300 GDALFEISDQIRHVKEV-FTISTFGFGD-DHDAQMMTSISNLKNGNFYYVKDIT 351 >UniRef50_A8IJ40 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IJ40_CHLRE Length = 434 Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 54/212 (25%), Positives = 93/212 (43%), Gaps = 48/212 (22%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSHK 274 L ++D SGSM S ER+ L++ + L+ +L D + IV+Y+G R +P ++ + + Sbjct: 172 LTCVLDRSGSM-SGERIALVRETCHFLIDQLTPDDYLGIVSYSGGVRADVPLLRMTPAAR 230 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 +A +D+L+A+GST G + G Sbjct: 231 GLAHAMVDALEADGST-----------ALYDGLVAG------------------------ 255 Query: 335 SMVKKQRES------GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 V++Q E+ VT+ TFG G + E ++ +AD +G Y YI + + Sbjct: 256 --VRQQMEAEAPTDQHVTVHTFGFGAGHSVE-LLQAVADAQSGVYYYISCVDDIPSGFGD 312 Query: 389 EMRQMLITVAKDVKAQIEFNPAW-VTEYRQIG 419 + +L VAKDV+ + P +T +R G Sbjct: 313 ALGGLLAVVAKDVRVGVRAAPGINLTAFRSGG 344 >UniRef50_Q22ST4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ST4_TETTH Length = 648 Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 48/200 (24%), Positives = 98/200 (49%), Gaps = 14/200 (7%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY--AGDSRIALPSISGSH 273 +LV +ID SGSM E++ L++ +L ++ + D I IV + +GD + ++ + Sbjct: 223 DLVVVIDKSGSM-EGEKIQLVKETLVKIINLMSSMDRICIVCFNESGDRPLTFTRVTDEN 281 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS- 332 K + I + A G TN G+ A + K + ILL +DG D K+ Sbjct: 282 KQTLLNLIQQIYAGGGTNISEGINHALKAIQNRKFKNNVTSILLLSDG------QDTKAY 335 Query: 333 --IESMVKK-QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 +++ + K Q + + T G G +++ ++ ++D+ NG ++++ ++ + Sbjct: 336 TRVKAYIDKYQIKDAFNIETIGFG-EDHDPKLLRTLSDLRNGTFNFMQDVNYLDTAFINI 394 Query: 390 MRQMLITVAKDVKAQIEFNP 409 M+ TVA+++K ++F P Sbjct: 395 FAGMISTVAQNIKVGVKFTP 414 >UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 Tax=Arabidopsis thaliana RepID=Q9M1S2_ARATH Length = 676 Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 64/279 (22%), Positives = 132/279 (47%), Gaps = 15/279 (5%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSH 273 +LV ++D SGSM +L L++ ++ +++ L D ++++ ++ +R P +S + Sbjct: 244 DLVTVLDISGSM-GGTKLALLKRAMGFVIQNLGSSDRLSVIAFSSTARRLFPLTRMSDAG 302 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 + A++SL A G TN GL + + + I+L +DG + P Sbjct: 303 RQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIILLSDGRDTYTTNHPDPS 362 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 ++ Q +++ +FG G S+++ ++M +++V G +S+I++ S Q L + + Sbjct: 363 YKVMLPQ----ISVHSFGFG-SDHDASVMHSVSEVSGGTFSFIESESVIQDALAQCIGGL 417 Query: 394 LITVAKDVKAQIE-FNP-AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFE 451 L ++++ +IE +P ++ + Y L ++ VD GD+ A + L Sbjct: 418 LSVAVQELRVEIEGVSPNVRLSSIKAGSYS--SLVTGDGHSGLVDLGDLYADEERDFLVS 475 Query: 452 LTLNGQK---ASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 + + ++ + KLR N L K T E L+IR Sbjct: 476 INIPVEEDGHTPLLKLRCLYINPLTKEITTLESHVLQIR 514 >UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN03_ARATH Length = 641 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 73/313 (23%), Positives = 144/313 (46%), Gaps = 29/313 (9%) Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVT 257 LK + ++ D + P +L+ ++D SGSM ++ L+++++ +++ L E D +++++ Sbjct: 187 LKAEGVSDDARRARAPL-DLITVLDVSGSM-DGVKMELMKNAMSFVIQNLGETDRLSVIS 244 Query: 258 YAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRI 315 ++ +R P +S + K A++SL A+G TN GL++ + K ++ + Sbjct: 245 FSSMARRLFPLRLMSETGKQAAMQAVNSLVADGGTNIAEGLKIGARVIEGRRWKNPVSGM 304 Query: 316 LLATDGDFN-----VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 +L +DG N G+ ES++ + + TFG G S+++ +M I++V + Sbjct: 305 MLLSDGQDNFTFSHAGVRLRTDYESLLPSS--CRIPIHTFGFG-SDHDAELMHTISEVSS 361 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE-H 429 G +S+I+T + Q + +L V + +IE + I + R+ Sbjct: 362 GTFSFIETETVIQDAFAQCIGGLLSVVILEQVVEIECIHEQGLKISSIKAGSYRSRIAPD 421 Query: 430 FNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWK 489 +D GD+ A + L L + DN S +++ L+ LK+R Sbjct: 422 ARTATIDVGDMYAEEERDFLVLLEIP-----------CCDN---GSGESESLSLLKVRCV 467 Query: 490 Y--PQGKESQLVE 500 Y P KE VE Sbjct: 468 YKDPVTKEIVHVE 480 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 58/240 (24%), Positives = 116/240 (48%), Gaps = 13/240 (5%) Query: 180 FAMRYELAPAPWNEQRTLLKVDILAKDRKSE-ELPASNLVFLIDTSGSMISDERLPLIQS 238 FA Y L +P Q +++ + + SE ++V +ID SGSM E++ L+Q+ Sbjct: 54 FAFNY-LQLSPEKAQEIPCTINLESPAQTSEASRSGVDIVCVIDVSGSM-QGEKIQLVQT 111 Query: 239 SLKLLVKELREQDNIAIVTYAGD-SRIA-LPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 +L +V+ L D I +++++ D ++I+ L +S K ++ + I L A G TN GL Sbjct: 112 TLNFMVERLSPADRICLISFSNDATKISRLVQMSPKGKKQLKSMIPRLVASGGTNIVGGL 171 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGID----DPKSIESMVKKQRESGVTLSTFGV 352 E Q + ++ I+L +DG N G +++S+V + S + TFG Sbjct: 172 EYGLQALRQRRTINQLSSIILLSDGQDNNGTTVLQRAKATMDSIVIRDDYS---VHTFGY 228 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV 412 G+ ++ ++ +A+ NG + Y+ + + +++ VA ++ ++ P + Sbjct: 229 GHG-HDSTLLNALAEPKNGAFYYVKDEETIATAFANCLGELMSVVADQIEVKLMTQPTEI 287 >UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6X2_PHATR Length = 523 Score = 61.6 bits (148), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 38/125 (30%), Positives = 65/125 (52%), Gaps = 3/125 (2%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSH 273 +L+ ++D SGSM + +L L + +L +L++ L+ QD ++++ D+R+ P ++S + Sbjct: 69 DLIVVLDVSGSMTGN-KLKLCKKTLTMLLRVLQTQDRFGLISFGSDARVEFPAQAMSKQN 127 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 KA I SL G TN A L LA Q+ + + TDG N GI D + Sbjct: 128 KASALQKIQSLTTRGCTNMSAALGLAVQELKIIEKSNPVRSLFFLTDGLANEGISDLDGL 187 Query: 334 ESMVK 338 S+ + Sbjct: 188 VSLTR 192 >UniRef50_A8J0D9 Flagellar associated protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8J0D9_CHLRE Length = 4349 Score = 61.6 bits (148), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 52/214 (24%), Positives = 92/214 (42%), Gaps = 22/214 (10%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAE 276 L ++D SGSM ER+ L++ + L+ +L D + IV+Y+ R +P + + +A Sbjct: 976 LTCVLDRSGSM-GGERIELVRETCHFLIDQLTADDYLGIVSYSNTVREDVPLLRMTPEAR 1034 Query: 277 --INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG---------------INRILLAT 319 + I SL G T AGLE +Q + ++ L T Sbjct: 1035 RLAHTMISSLTLHGGTALYAGLEAGVKQQMAAASELKALAAAAGGGSDSSRIVHSCFLFT 1094 Query: 320 DGDFNVG---IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 DG G +++ + ++ + +T+ TFG G+ + E ++ +A+ +G Y YI Sbjct: 1095 DGQATTGPCTVNEIMGQMTSLQSPADQNITVHTFGFGDDHSVE-LLQGVAEAQSGVYYYI 1153 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 + + +L VAKDV+ I P Sbjct: 1154 SCADDIPSGFGDALGGLLAVVAKDVRVSIRTKPG 1187 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 61.6 bits (148), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 59/257 (22%), Positives = 116/257 (45%), Gaps = 34/257 (13%) Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV-----DILAKDRKSEEL--PASN 216 +++KD SI AS + N+Q L + DIL +++ +E + Sbjct: 75 YNLKDNISIQAS-----------SHTLMNQQNAALMITIKSNDILLINQRGQECVRQGVD 123 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA----LPSISGS 272 LV LID SGSM E++ L++ +LK ++ L+ D + ++ + D ++ L ++ Sbjct: 124 LVCLIDHSGSM-QGEKIKLVRKTLKQMLTFLQPCDRLCLIMF--DCKVYRLTRLMRVTQE 180 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID---D 329 + + AI SL A G T+ G G+++A K ++ I L +D G+D + Sbjct: 181 NVQKFRVAISSLQARGGTDIGNGMKMALSILKHRKYKNPVSAIFLLSD-----GVDEGAE 235 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 + + +++ T+ TFG G + +M IA G + ++ L+ + Sbjct: 236 ERVRDDLIQYNIRDSFTIKTFGFG-RDCCPKIMSEIAHYKEGQFYFVPNLTNIDECFAEA 294 Query: 390 MRQMLITVAKDVKAQIE 406 + ++ VA V+ ++ Sbjct: 295 LGGLVSVVANHVQLSVQ 311 >UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Photobacterium profundum 3TCK RepID=Q1YZ74_PHOPR Length = 714 Score = 61.6 bits (148), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 56/217 (25%), Positives = 108/217 (49%), Gaps = 19/217 (8%) Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--- 265 S L ++ F++D SGSM E + + +L+ +++L+ +D+ IVT+ ++ + Sbjct: 327 SSALFHQSVTFVLDISGSMYG-ESIEQAKQALRYGLQQLQPEDSFNIVTFNHEAMLYSEQ 385 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG--INRILLATDGDF 323 L ++ S +D LDA+G T A L+ A+ T + +N+I+ TDG Sbjct: 386 LLPVTSSTITRALRFVDGLDADGGTEMAAALKAAFSIKTHDQLNSTRWLNQIVFITDG-- 443 Query: 324 NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 + + ++ ++++Q L T G+G++ N M R A G G Y+YI + E Sbjct: 444 --SVGNESALFDLIEQQLVDR-RLFTVGIGSAP-NSYFMTRAAMKGKGTYTYIGDVKE-- 497 Query: 384 KVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 +N++MR + +++ V I+ AW ++ R + Y Sbjct: 498 --VNTKMRLLFSKISQPVMRDIKL--AW-SDGRSVDY 529 >UniRef50_A6FXN3 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6FXN3_9DELT Length = 416 Score = 61.6 bits (148), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 51/211 (24%), Positives = 98/211 (46%), Gaps = 24/211 (11%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHK 274 +V ++DTS SM D + +++ LV L E D+ A+V + + + +PS I+ + Sbjct: 1 MVLVVDTSASMKGDA-IEGAKAAAMELVDGLAEGDSFALVVFHSRAEVLMPSTVINEDSR 59 Query: 275 AEINAAIDSLDAEGSTNGGAGLE--LAYQQATKGFIKGG--------------INRILLA 318 A + I+++ A G+T+ GL+ LA Q + + G + R++L Sbjct: 60 AAARSKIETMQAWGTTDLAGGLQQALAQLQVAQNIVGAGGSTGAQSGAPDPTVLERVVLL 119 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 DG N D +I S V + G ++ G G Y+E ++ +A+ +G++ ++D Sbjct: 120 GDGVPN----DASTIPSTVGQLAARGTQITALGYGI-EYDETLLASLAEQTHGSFRFVDD 174 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 + E+ + TVA D++ + P Sbjct: 175 PEAVASLFRDEVLDIERTVANDLRLSVGLGP 205 >UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JQC2_9BACT Length = 808 Score = 61.2 bits (147), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 54/195 (27%), Positives = 93/195 (47%), Gaps = 21/195 (10%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY---AGDSRIALPSIS 270 ++ VF +D SGSM +L + S +K + +L+ +D +V + A D S + Sbjct: 281 GADFVFALDVSGSM--QGKLHTLASGVKKAIGQLKPEDRFRVVAFNNTAFDLNRGWVSAT 338 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDP 330 ++ E A +D L++ G TN AG+ LA ++ + ++L TDG N GI DP Sbjct: 339 EANLRETFARLDQLNSNGGTNVYAGVHLALERLDADRVA----TLILVTDGVTNQGIVDP 394 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI----DTLSEA---- 382 K+ ++ KQ + F +GNS+ N +M + D G+Y + D + E Sbjct: 395 KAFYKLMHKQD---LRFYGFLLGNSS-NWPLMQLMCDASGGSYRAVSNSDDIIGEVMIAK 450 Query: 383 QKVLNSEMRQMLITV 397 K++ MR I++ Sbjct: 451 NKIVYESMRHAEISI 465 >UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NW6_PSEA6 Length = 701 Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 53/210 (25%), Positives = 99/210 (47%), Gaps = 22/210 (10%) Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS-----R 263 ++++P+ +VFL+DTSGSM + E + + ++ + +LR +DN+ I+ + D+ + Sbjct: 297 AQQMPSREVVFLLDTSGSM-AGESIVQAKRAVDFALTQLRPEDNVNIIQF-NDAPQALWK 354 Query: 264 IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELA------YQQATKGFIKGGINRILL 317 A+P+ + H + SL A+G T L LA ++ + + +++ Sbjct: 355 RAMPA-TAKHIQRARNWVASLHADGGTEMAPALTLALNKPSLHRDDSDLLGSHKLRQVVF 413 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 TDG + IES + R L T G+G S N M + A G G ++YI Sbjct: 414 ITDGSVSNEDALMSLIESKLADNR-----LFTIGIG-SAPNSYFMTQAAQAGRGTFTYIG 467 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 + + Q + + ++ V +D+ IEF Sbjct: 468 DIQQVQHKMTALFNKLTRPVMQDI--HIEF 495 >UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09E12_STIAU Length = 540 Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 48/183 (26%), Positives = 82/183 (44%), Gaps = 7/183 (3%) Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS 268 + E+ A + F+IDTSGSM R+ + + +LK V L QD +V ++ D P+ Sbjct: 64 ASEIAAKRVTFVIDTSGSM-QGSRMQIAKDALKYCVTRLNPQDTFNVVRFSTDVEALFPA 122 Query: 269 ISGSHKAEINAAI---DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 + + I A+ + L+A G T L Q + ++ TDG + Sbjct: 123 LKSAQPENIQKAVAFVEQLEAIGGTAIDEALVRGLQDNDGK--SSAPHLLMFITDGQPTI 180 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 G D +I K R++ L TFGVG + N ++ R++ G G ++ E + Sbjct: 181 GETDEGAIAQHAKDGRKAKTRLFTFGVGE-DLNARLLDRLSSDGAGTSDFVRDGKEFETK 239 Query: 386 LNS 388 ++S Sbjct: 240 ISS 242 >UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ58_ORYSJ Length = 589 Score = 59.7 bits (143), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 46/193 (23%), Positives = 90/193 (46%), Gaps = 12/193 (6%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSH 273 +LV +ID SGSM D R+ ++++L+ ++++L + D + IVT+ ++ P ++ + Sbjct: 69 DLVAVIDVSGSMDGD-RIDKVKTALQFVIRKLSDLDRLCIVTFCTNATRLCPLRFVTAAA 127 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQAT-KGFIKGGINRILLATDGDFNVGIDDPKS 332 +AE+ A +D L A G TN GLE + G ++L +DG N G D Sbjct: 128 QAELKALVDGLKAYGDTNMKGGLETGMSVVDGRSLAAGRAVSVMLMSDGYQNHGGD---- 183 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 + V + TF G S+ + + G ++Y+ + + + Sbjct: 184 ----ARDVHLKNVPVYTFSFGASHDSNLLEAIARKSLGGTFNYVADSANLTGPFSQLLGG 239 Query: 393 MLITVAKDVKAQI 405 +L +A+D++ + Sbjct: 240 LLTIIAQDLELTV 252 >UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=Q8H924_ORYSJ Length = 646 Score = 59.7 bits (143), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 47/179 (26%), Positives = 89/179 (49%), Gaps = 19/179 (10%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA-GDSRIA-LPSISGSH 273 +LV ++D SGSM+ + +L L++ ++ ++ L D + +++++ G SR+ L ++ + Sbjct: 175 DLVTVLDVSGSMVGN-KLALLKQAMGFVIDNLGPGDRLCVISFSSGASRLMRLSRMTDAG 233 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG--DFNV----GI 327 KA A+ SL A G TN GA L A + + + ++L +DG + V G Sbjct: 234 KAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVESVILLSDGQDTYTVPPRGGY 293 Query: 328 DDPKSIESMV---------KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 D + +++V + TFG G +++ A M IA+V G +S+I+ Sbjct: 294 DRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGK-DHDAAAMHTIAEVTGGTFSFIE 351 >UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CDA1_PARTE Length = 604 Score = 58.9 bits (141), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 44/205 (21%), Positives = 100/205 (48%), Gaps = 12/205 (5%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR-- 263 D++ +L+ +ID SGSM S E++ +++ +L +L+ L +D + ++ + + Sbjct: 176 DQQQHSKVGVDLLCVIDRSGSM-SGEKIEMVKQTLNILLNFLGPKDRLCLIQFDDTCQRL 234 Query: 264 IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 L ++ +K + I + A G T G G ++A +Q + I + +DG Sbjct: 235 TNLRRVTDENKTYYSDIISKIYANGGTVIGLGTQMALKQIKYRKSVNNVTAIFVLSDG-- 292 Query: 324 NVGIDDPKSIESMVKK--QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 D +I S+ K+ + +T+ +FG G S+++ +M +I+++G G++ +++ +S Sbjct: 293 ----QDEAAISSLQKQLAYYKQTLTIHSFGFG-SDHDAKLMTKISNLGKGSFYFVNNISL 347 Query: 382 AQKVLNSEMRQMLITVAKDVKAQIE 406 + + + V D+ +E Sbjct: 348 LDEFFVDALGALTSMVVTDISINLE 372 >UniRef50_A5UW94 von Willebrand factor, type A n=2 Tax=Roseiflexus RepID=A5UW94_ROSS1 Length = 452 Score = 58.5 bits (140), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 54/227 (23%), Positives = 97/227 (42%), Gaps = 17/227 (7%) Query: 236 IQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAID---SLDAEGSTNG 292 + +L +V+ L D +++V +A + + +P + GS + + AI+ LD TN Sbjct: 109 VVHALHTVVERLDRNDRLSLVVFADHALLLIPGMVGSDRVTLVRAIERLPGLDLGDGTNL 168 Query: 293 GAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGV 352 G+ LA Q NR+LL TDG F DP + ++ + + + ++T G+ Sbjct: 169 ADGIALALNQIRANRDARRANRVLLLTDG-FT---RDPAACLTLADQAADEHIAITTIGL 224 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV 412 G + + ++ IAD GN ++ S + +++E+ V I P Sbjct: 225 GG-EFQDDLLTGIADRSGGNALFLKRASAIPRAISAELESARAAALPGV--DIAIAPMRG 281 Query: 413 TEYRQIGYEKRQLRV--EHFNNDNVDA-----GDIGAGKHITLLFEL 452 R++ + L + E D GD+ AG +TLL E Sbjct: 282 VMLRRVTRTRPVLAILAEPTGTGASDVVSVLLGDLPAGSPVTLLLEF 328 >UniRef50_A7RTF3 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RTF3_NEMVE Length = 756 Score = 58.5 bits (140), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 51/219 (23%), Positives = 98/219 (44%), Gaps = 8/219 (3%) Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNI 253 ++ L+ ++ + K E L +F+ID SGSM S +R+ + +L L +K L E + Sbjct: 252 EKPLVTLNFMPDFGKQEALETGEFIFVIDRSGSM-SGDRIKNARETLFLFLKSLPEHCHF 310 Query: 254 AIVTYAGDSRIALPSISGSHKAEINAAID-SLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 +V + S + + +N A + + + E + G LE ++ IKG Sbjct: 311 NVVGFGSSYEKLFSSSTKYSDSSVNKACNHAKNLEANLGGTEILEPLKYVFSQPVIKGSP 370 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 ++ L TDG+ + + + + ++VKK TFG+G + A++ +A G G Sbjct: 371 RQVFLMTDGE----VGNTQQVITLVKKNSTHARCF-TFGIGQ-GASTALIKGVARAGQGT 424 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW 411 +I + + Q + +R L +D+K + W Sbjct: 425 AEFITSSHQMQAKVVKTLRNALQPSMEDIKVTWDLPHGW 463 >UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EFJ5_PARTE Length = 610 Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 66/264 (25%), Positives = 125/264 (47%), Gaps = 30/264 (11%) Query: 145 LPPPDAVRVEEI----VNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV 200 + PDA++VE + +N P I Q S+ +P ++ + + +QR + Sbjct: 74 ITDPDALQVELLNSVHLNVLPRQKAI---QVQEYSQILPVVLQIQSLKSQLKKQRA--NI 128 Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG 260 D++ ++D SGSM + E++ L+Q+SL+ + K L+ D +A+VT+ Sbjct: 129 DLMC---------------VVDVSGSM-NGEKIKLVQNSLRYIQKILKPTDRLALVTFGT 172 Query: 261 DSRIALPSIS--GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 + I L +K +I AI + STN +G+ L + K + + + Sbjct: 173 QAGINLQWTRNIAENKKKIKKAIKDIKIRDSTNIASGVALGLRMIRDRKFKNPVTSMFVL 232 Query: 319 TDG-DFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 +DG D + G D + +++ + + +T++TFG G S+++ +M IA++ G + YID Sbjct: 233 SDGVDDDRGA-DLRCQQALHQYNIQDTLTINTFGYG-SDHDAKVMNNIANLKGGQFVYID 290 Query: 378 TLSEAQKVLNSEMRQMLITVAKDV 401 + + M ML AK+V Sbjct: 291 QIQRVSEHFILAMSGMLSVKAKNV 314 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 49/194 (25%), Positives = 102/194 (52%), Gaps = 10/194 (5%) Query: 186 LAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVK 245 L+P ++++ + V +A+ P NL ++D SGSM + + L ++ + + LV Sbjct: 14 LSPTRVSQRQLEISVAAIAQASGERNAPL-NLGLILDHSGSM-AGQPLETVKRAAQKLVD 71 Query: 246 ELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ--A 303 L D +A++ + +++ +P+ + + +I I L A G T GL+L + A Sbjct: 72 RLLPSDRLAVIVFDHVAKVLIPNQPVTDRDKIKTRISHLAAMGGTAIDEGLQLGLTELIA 131 Query: 304 TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 K G I++I L TDG+ G ++ + ++ + ++ + +TL+T G G ++N+ ++ Sbjct: 132 AKA---GAISQIFLLTDGENEHG-NNSRCLQ-LAEEAAKENITLNTLGFG-YHWNQDVLE 185 Query: 364 RIADVGNGNYSYID 377 +IAD G+ +I+ Sbjct: 186 QIADAAGGSLMFIE 199 >UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CIG8_9BACI Length = 931 Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 57/224 (25%), Positives = 109/224 (48%), Gaps = 17/224 (7%) Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 LL VD+ K +K ELP+ +V ++D SGSM + ++ L + + + LRE+D + + Sbjct: 392 LLPVDMDLKGKK--ELPSLGMVIVLDRSGSM-AGYKIQLAKEAAIRSAELLREKDTLGFI 448 Query: 257 TYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRIL 316 + + + K ++ I+ L + G TN LELAY+Q T ++ I+ Sbjct: 449 AFDDRPWQIIDTEPIKDKEKVIEKINGLTSGGGTNIFPSLELAYEQLTP--LELQRKHII 506 Query: 317 LATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 L TDG D + +++ +E+ +TLST +G + + ++ ++D G G + + Sbjct: 507 LLTDGQSATSPD----YLTTIQEGKENNITLSTVAIGEGS-DSVLLEELSDEGGGRFYDV 561 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 + S +L+ R+ ++T + IE +P + T G+ Sbjct: 562 NDSSTIPSILS---RETVLT----TRTYIEDDPFYPTVIDASGF 598 >UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DZ93_PARTE Length = 522 Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 87/178 (48%), Gaps = 16/178 (8%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD----SRIALPSISG 271 +L+ LID SGSM S E++ L++ SLK L+K L+ D + ++ + +R+ + Sbjct: 113 DLICLIDHSGSM-SGEKMHLVKKSLKHLLKMLQPNDRLCLIEFDDQNYRLTRLMRATQEN 171 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD-- 329 +K I AID+++A G+T+ G +++A K I I L +DG+ D+ Sbjct: 172 MYKFLI--AIDTIEANGATDIGNAMKMALSILKHRRFKNPIASIFLLSDGE-----DEGA 224 Query: 330 -PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVL 386 + + K + T++TFG G + +M IA G + YI +S+ + Sbjct: 225 AGRVWNDIQSKNIKEPFTINTFGFG-RDCCPKIMSEIAHFKEGQFYYISEISKIDECF 281 >UniRef50_B0CG18 von Willebrand factor type A domain protein, putative n=5 Tax=Cyanobacteria RepID=B0CG18_ACAM1 Length = 708 Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 62/218 (28%), Positives = 99/218 (45%), Gaps = 25/218 (11%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSS--LKLLVKELREQDNIAIVTYA 259 I A KS ++ ++VFLIDTSGS P++QS + + +L D +I+ ++ Sbjct: 328 IPALKYKSNQIVPKDVVFLIDTSGSQSGP---PIVQSRKLMTQFLDKLNPNDTFSIINFS 384 Query: 260 GDSRIALPSISGSHKAEINAA---IDSLDAEGSTN--GGAGLELAYQQATKGFIKGGINR 314 + P + A A I LDA G T G A+ A G ++ Sbjct: 385 NTTSKLSPKPLANTPANRKKALEYIKKLDANGGTELMNGINTVAAFPPAPDGRLRS---- 440 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++L TDG I D ++I + V+ + + G + FGVG S N ++ R+A+VG G Sbjct: 441 VVLLTDGL----IGDDETIIAAVRDRLKPGNRIYPFGVGFST-NRFLLDRLAEVGRGTVE 495 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV 412 + A+KV + + T+ K V IE +WV Sbjct: 496 VVAPKDSAEKV----AAKFVQTINKPVLTDIEV--SWV 527 >UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanobacteria RepID=B4VT64_9CYAN Length = 1037 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 68/247 (27%), Positives = 122/247 (49%), Gaps = 38/247 (15%) Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLL-KVD----------ILAKDRKSEELP 213 ++ D+++IP I +RY++A A + Q T+L + D I A + + E+ Sbjct: 609 ELADQETIPNKDLI---LRYQVAGA--DTQATVLTQADERGGHFATYLIPAIEYQQNEIV 663 Query: 214 ASNLVFLIDTSGSMISDERLPLIQSS--LKLLVKELREQDNIAIVTYAGD----SRIALP 267 ++VFL+DTSGS P++QS ++ ++ L QD I+ +A S L Sbjct: 664 PKDVVFLVDTSGSQSGS---PIVQSKELMRQFIQGLNPQDTFTIIDFANSTTQLSDKPLA 720 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLE--LAYQQATKGFIKGGINRILLATDGDFNV 325 + + K +N I+ LDA G T G++ L + A G ++ ++L TDG Sbjct: 721 NTPQNRKKALN-YINRLDANGGTELMNGIDTVLNFPAAPAGRLRS----VVLLTDGL--- 772 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 I D + I + ++ + + G L +FGVG+S N ++ R+A++G G + +E+ +V Sbjct: 773 -IGDDEQIIAEIRDRLKPGNRLYSFGVGSST-NRFLIERLAELGRGTAEVVPP-NESAEV 829 Query: 386 LNSEMRQ 392 + E Q Sbjct: 830 VAQEFFQ 836 >UniRef50_C7NQ34 von Willebrand factor type A n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NQ34_HALUD Length = 1100 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 58/172 (33%), Positives = 91/172 (52%), Gaps = 9/172 (5%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 +L F+ID SGSM R+ ++S K V L E D A+V++AG + + S++ H A Sbjct: 514 DLAFVIDESGSM-GGARIQDAKASAKRFVGGLYEDDRAALVSFAGGATLG-QSLTTDHGA 571 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 +NA+ID L+A G TN GAGL+ A + T +G I+L DG +G DP +I Sbjct: 572 -VNASIDQLNAGGGTNTGAGLQKAVDELTSNG-EGDTQEIILLADGGTGLG-PDPVTIAQ 628 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 + R +T++T G+G + + + IAD G + + SE +V + Sbjct: 629 TADEHR---ITINTIGMG-TGIDAQELTSIADATGGEFYQVSDSSELPEVFD 676 >UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genome shotgun sequence n=3 Tax=Paramecium tetraurelia RepID=A0C051_PARTE Length = 636 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 50/218 (22%), Positives = 106/218 (48%), Gaps = 33/218 (15%) Query: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPA---------------------SNLVFLI 221 +Y+L + E R+L K+ L R ++ LP +L+ LI Sbjct: 108 KYDLNEKLYFEVRSLYKMGKLLNSR-TQYLPGIVSIKALDQAVTQNQKNQRVGVDLICLI 166 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEINA 279 D SGSMI ++ ++++SL +L++ L + D + ++T+ D+ P +++ +K+ Sbjct: 167 DISGSMIG-VKIEMVKASLIVLLQFLGDNDRLQLITFDNDAHRLTPLKTVTNQNKSYFTQ 225 Query: 280 AIDSLDAEGSTNGGAGLELA-YQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVK 338 I + A G ++A YQ ++ +I + + L +DG V P+ +++ ++ Sbjct: 226 IIKQIKANGGNRISEATKMAFYQLKSRKYI-NNVTSVFLLSDG---VDYTYPE-VKNQIQ 280 Query: 339 KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 E TL TFG G +++ MM ++ ++ +G++ ++ Sbjct: 281 TVNEV-FTLHTFGFG-EDHDAQMMTQLCNLKSGSFYFV 316 >UniRef50_D0LR75 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LR75_HALO1 Length = 816 Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 56/226 (24%), Positives = 103/226 (45%), Gaps = 21/226 (9%) Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 L V++ E+ P +L LID + + ER L + SL L LR D + ++ Sbjct: 466 LWGVNLRGPQVSDEDRPGLSLTVLIDET--LPGPER-ELTRLSLDTLATALRPDDRVNVI 522 Query: 257 TYAGDSRIALPSISGSHKAE-----INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG 311 T+AG+ +I L ++S KAE + G + L AY+ A + Sbjct: 523 TFAGNPKIELENVSLPTKAEGESELVRLGRRLRGRAGMFDLDGALATAYKVARRHQRSER 582 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESG-------VTLSTFGVGNS---NYNEAM 361 +++L+ TD D P +E++V++ SG L V S N +E + Sbjct: 583 WSQVLVLTDSDIGT---LPTLLETIVEESLTSGSSRGIRWTVLGLHAVDTSVQGNRDEQV 639 Query: 362 MVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 + A +G G+Y + T ++AQ++ + + +L A++++ ++EF Sbjct: 640 LRPFAYLGKGSYFTVKTNADAQRLFAAPLTGLLEPAARNIRFRLEF 685 >UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174639B Length = 868 Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 49/184 (26%), Positives = 89/184 (48%), Gaps = 16/184 (8%) Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 +L V + A D EE +S L +ID SGSM S E+L + +S+ + L D+I + Sbjct: 397 VLPVRLKAPDE--EEKQSSALALVIDRSGSM-SGEKLEMAKSAAIATAEVLTRNDSIGVY 453 Query: 257 TYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF--IKGGINR 314 + ++ + +P + + + I L + G TN L A+ +A K I Sbjct: 454 AFDSEAHVVVPMTRLTSSSAVAGQIAGLTSGGGTN----LHPAFTEARNALQRTKAKIKH 509 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG-NY 373 +++ TDG + + E++ + R GVT+ST +G+ + ++ IA +G G +Y Sbjct: 510 MIILTDGQTS-----GQGYEALASQCRAEGVTISTVAIGDGAHV-GLLQAIASLGGGKSY 563 Query: 374 SYID 377 + +D Sbjct: 564 TTLD 567 >UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magnoliophyta RepID=Q9FF49_ARATH Length = 704 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 44/199 (22%), Positives = 99/199 (49%), Gaps = 10/199 (5%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSH 273 +LV ++D SGSM + +L L++ ++ +++ L D +++++++ +R P ++ + Sbjct: 252 DLVTVLDVSGSM-AGTKLALLKRAMGFVIQNLGPFDRLSVISFSSTARRNFPLRLMTETG 310 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 K E A++SL + G TN GL+ + K ++ I+L +DG + P Sbjct: 311 KQEALQAVNSLVSNGGTNIAEGLKKGARVLIDRRFKNPVSSIVLLSDGQDTYTMTSPNGS 370 Query: 334 ES------MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 + K+ + + + FG G ++++ ++M IA+ G +S+I++ + Q Sbjct: 371 RGTDYKALLPKEINGNRIPVHAFGFG-ADHDASLMHSIAENSGGTFSFIESETVIQDAFA 429 Query: 388 SEMRQMLITVAKDVKAQIE 406 + +L V +++ IE Sbjct: 430 QCIGGLLSVVVQELCVTIE 448 >UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8BII0_ORYSI Length = 585 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 68/244 (27%), Positives = 104/244 (42%), Gaps = 47/244 (19%) Query: 190 PWNEQRTLLKVDI-LAKDRKSEELPASNLVFLIDTSGSMISDE------RLPLIQSSLKL 242 P NE+R V + + K+E P +LV ++D SGSM RL L++ ++K+ Sbjct: 21 PSNEERKEWPVLVHVVAPAKTERFPI-DLVAVLDVSGSMTKATSMHGWTRLDLVKGAMKM 79 Query: 243 LVKELREQDNIAIVTY------AGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 + +L D +AIV + AG +R+ + G +A+ NA ++ L A G T L Sbjct: 80 VTNKLGAGDRLAIVPFNGKVVAAGATRLMEMTTKG--RADANAKVNQLKAGGDTKFLPAL 137 Query: 297 ELAY----------QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 + A +Q GFI L +DG N +DD + GV Sbjct: 138 KHASGLLDSRPAGDKQYRPGFI-------FLLSDGQDNGVLDD-----------KLGGVR 179 Query: 347 L--STFGVGNSNYNEAMMVRIADVGNGNYSYI-DTLSEAQKVLNSEMRQMLITVAKDVKA 403 TFG+ S N MV IA G+Y I D LS + L + + VA + + Sbjct: 180 YPAHTFGMCQSRCNPKSMVHIATATKGSYHPIDDKLSNVAQALAVFLSGITSAVAVNARV 239 Query: 404 QIEF 407 Q+ Sbjct: 240 QLHV 243 >UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FLY1_NANOT Length = 748 Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 48/185 (25%), Positives = 88/185 (47%), Gaps = 24/185 (12%) Query: 215 SNLVFLIDTSGSMISDERLP---------------LIQSSLKLLVKELREQDNIAIVTYA 259 ++V +ID S SM S +P L + + K +++ L E D +A+VT+ Sbjct: 70 CDIVLVIDISASMNSAAPIPTGESGGEDTGLSILDLTKHAAKTIIQTLNENDRLAVVTFC 129 Query: 260 GDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 + R+A L +S +K+++ AAID L STN G++ + +G + +L+ Sbjct: 130 TEIRVAFELEFMSEENKSKVLAAIDCLHGISSTNLWHGIKEGLKVLATNSTQGNVQALLV 189 Query: 318 ATDGDFNVGIDD----PKSIESMVKKQRESGV--TLSTFGVGNSNYNEAMMVRIADVGNG 371 TDG N PK ++++ + +G + TFG G ++ IA++G G Sbjct: 190 LTDGAPNHMCPAQGYVPKLRQTLLDHRDLTGSLPLIHTFGFG-YYLRSPLLQSIAEIGGG 248 Query: 372 NYSYI 376 +++I Sbjct: 249 TFAFI 253 >UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI0_9BACT Length = 833 Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 45/188 (23%), Positives = 87/188 (46%), Gaps = 9/188 (4%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 + ++ K +E P+ LV +ID SGSM + + + L + + K + L +D + ++ + G Sbjct: 399 VTSRYEKEKEQPSLALVLVIDKSGSM-NGQPIVLAREASKAAAELLSSRDQVGVIAFDGS 457 Query: 262 SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG 321 +++ S ++K E+ + ID + A G TN + + G I +++ +DG Sbjct: 458 AKLVTDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDML--GIASAKIKHMIVLSDG 515 Query: 322 DFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 G E + + + GVT+ST +G + +M IA +GNG + E Sbjct: 516 QSQGG-----DFEGISSELAQMGVTISTVSLGQGAAVD-LMAAIAQIGNGRAYVTNNAEE 569 Query: 382 AQKVLNSE 389 ++ E Sbjct: 570 MPRIFTKE 577 >UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KDL1_SHEWM Length = 739 Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 52/194 (26%), Positives = 92/194 (47%), Gaps = 14/194 (7%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD---- 261 D+K + + L+ +IDTSGSM S + + +L + L+ +D ++ + + Sbjct: 354 DQKQDVSISRELILVIDTSGSM-SGASIAQAKRALNYALAGLKAKDTFNVIEFNSNVGSL 412 Query: 262 SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG--INRILLAT 319 S +LP+ + + N + SL A G T L A + T+ G + ++L T Sbjct: 413 SPYSLPA-TAKNIGLANQYVRSLKANGGTEMQLALNAALDKGTETEALGSERLRQVLFMT 471 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DG + D +S+ ++K Q+ L T G+G++ N M R A+ G G ++YI L Sbjct: 472 DGS----VGDEQSLFHLIK-QKIGESRLFTLGIGSAP-NSHFMRRAAEFGRGTFTYIGKL 525 Query: 380 SEAQKVLNSEMRQM 393 E Q + S + Q+ Sbjct: 526 DEVQSKIESLLYQI 539 >UniRef50_Q113J8 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q113J8_TRIEI Length = 92 Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 30/81 (37%), Positives = 46/81 (56%), Gaps = 3/81 (3%) Query: 488 WKYPQGKESQLVEFP---LGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 +K GK S+L++ P G + N S +F AAVA YG LR SE + S ++ + Sbjct: 3 YKEQNGKNSELIQQPDVNKGVSFNNASNGFKFAAAVAEYGMILRDSENQIDASLNKVLKL 62 Query: 545 AQQAKGEDPQGYRAEFIRLIE 565 A ++ G D + YR+EFI ++E Sbjct: 63 ANESNGLDLESYRSEFINMVE 83 >UniRef50_A6C9I8 BatB n=2 Tax=Planctomycetaceae RepID=A6C9I8_9PLAN Length = 798 Score = 55.8 bits (133), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 42/167 (25%), Positives = 87/167 (52%), Gaps = 17/167 (10%) Query: 217 LVFLIDTSGSMISDE----RLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 ++FL+D S SM++++ RL + +K +V E+ D + +V +AG++R ++P S Sbjct: 93 VMFLLDVSRSMLAEDVSPSRLDRAKQQIKDMVDEM-SGDRVGLVVFAGETRQSVPLTS-- 149 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR--ILLATDGDFNVGIDDP 330 H + ++D++ GG+ L A + AT GFI + I++ TDG+ + Sbjct: 150 HYEDFKQSLDAVGPHSVRRGGSLLGDAIRSATAGFIDKTNDHKAIVVFTDGEDQ----ES 205 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 K +E+ + ++G+ + T G+G+ + RI + G ++++ Sbjct: 206 KPVEAAKEAFTKNGIRIFTVGLGDMDQG----ARIPETEQGGQAFVE 248 >UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2 Tax=Sorghum bicolor RepID=C5YHY2_SORBI Length = 567 Score = 55.5 bits (132), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 80/363 (22%), Positives = 153/363 (42%), Gaps = 32/363 (8%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD-SRIA-LPSISGSH 273 +LV ++D S SM E+L L++ ++ ++ +L D +++VT++ D SR+ L +S + Sbjct: 81 DLVTVLDVSDSM-KGEKLALLKQAMCFVIDQLGPADRLSVVTFSNDASRLTRLARMSDAG 139 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVG-----ID 328 KA A++SL +G TN G+ +A + K + ++L +DG N G D Sbjct: 140 KASAKIAVESLAVQGFTNIKQGIHVAAEVLAGRREKNVVAGMILLSDGHDNCGGTSVRPD 199 Query: 329 DPKSIESMV-------KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 KS ++V + TFG G S+ AM +A+ G +S++ + Sbjct: 200 GTKSYVNLVPPSLTVAAGSSRPAAPIHTFGFGTSHDAGAMHA-VAEATGGTFSFVGDEAA 258 Query: 382 AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND----NVDA 437 Q + +L ++ + + V +Q+ K V H D +D Sbjct: 259 IQDSFARCVGGLLSVAVQEARVAVTCLHRGV-HVQQV---KSGAYVSHVGADGHAATIDV 314 Query: 438 GDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQ 497 G++ G+ L + + +++ + R + + T + A + + Sbjct: 315 GELYDGEERRFLVLVHVPRARSTEEVTRLIKASCTYREAATGQAARKVAAPAAVVQRPLE 374 Query: 498 LVEFPLGPTINAPSEDMRFRAA-------VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 L P P+++ E +R AA AA G + G+ + + + ++Q A A G Sbjct: 375 LATLP-APSLDVERERVRLAAAEDIAAARTAADGGQNAGAARILESRLKAVEQSAPGAAG 433 Query: 551 EDP 553 DP Sbjct: 434 NDP 436 >UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E105CF Length = 757 Score = 55.1 bits (131), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 57/188 (30%), Positives = 87/188 (46%), Gaps = 17/188 (9%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG---- 271 N +F++D+SGSM I + ++ V L E D IV + ++R AL S Sbjct: 391 NTIFVLDSSGSMHGTALTQAIDA-IREGVSYLTEHDTFNIVDFDSEAR-ALWRQSQFADE 448 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 KAE + +D++G TN L L+ Q G+ +++ TDG N + K Sbjct: 449 VSKAEAMRFLRHVDSDGGTNMQDALALSLTQLLDS--STGLTQVIFVTDGSINNERELLK 506 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ---KVLNS 388 I + +R L T G+G + N M A +G G Y+YID L+E Q L S Sbjct: 507 QIAEQLGDKR-----LFTVGIGAAP-NSHFMEYAAMLGKGTYTYIDDLTEIQPKMAYLFS 560 Query: 389 EMRQMLIT 396 ++R +IT Sbjct: 561 QLRSPMIT 568 >UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXM6_9CYAN Length = 928 Score = 55.1 bits (131), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 53/176 (30%), Positives = 83/176 (47%), Gaps = 17/176 (9%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 I A + +L ++VFLIDTSGS S E L Q ++ + L D I+ ++ Sbjct: 413 IPAIEYNPHQLVPKDVVFLIDTSGSQ-SGEPLNKCQELMRRFINGLNPHDTFTIIDFSDT 471 Query: 262 SRIALPSISGSHKAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFIK---GGINRI 315 +R P + N+A I+ L+A G T G+ QA F + G + I Sbjct: 472 TRQLSPVPLANTVQNRNSAMNYINQLNASGGTQLRRGI-----QAVLNFPEVDPGRLRSI 526 Query: 316 LLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 +L TDG I + I + V++ + G L +FG G S+ N ++ RIA++G G Sbjct: 527 VLLTDG----YIGNENQILAEVQRHLKLGNRLHSFGAG-SSVNRFLLNRIAEIGRG 577 >UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anserina RepID=B2AQN8_PODAN Length = 648 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 64/254 (25%), Positives = 108/254 (42%), Gaps = 54/254 (21%) Query: 192 NEQRTLLKVDI-----LAKDRKSEELPASNLVFLIDTSGSMISDERLP------------ 234 E L+K+D L R+ +P +LV ID SGSM +D +P Sbjct: 44 TEDGVLIKIDPPKEPELEDLRERNHVPL-DLVLSIDVSGSMGADAPVPAKNGTEGEHYGL 102 Query: 235 ----LIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEG 288 L++ + K +++ L + D + IVT++ S++ L ++ ++KA+I +D+L Sbjct: 103 SVLDLVRHAAKTILETLDDHDRLGIVTFSTSSKVVRELTYMTPANKAKILKQLDALQPLS 162 Query: 289 STNGGAGLE---------LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKK 339 TN G+ L + G + +L+ TDG N + + V K Sbjct: 163 MTNLWHGIRDGLSLFNNNLKAVNDRRNPGSGRVPALLVLTDGMPNHQCPN----QGYVAK 218 Query: 340 QRESGV---TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 R+ ++ TFG G S ++ IA+VG GNYS+I +IT Sbjct: 219 LRQWSTLPASIHTFGFGYS-LRSGLLKSIAEVGGGNYSFIPDAG-------------MIT 264 Query: 397 VAKDVKAQIEFNPA 410 V+ Q F+P+ Sbjct: 265 TGDAVEKQQPFSPS 278 >UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein family-like n=8 Tax=Oryza sativa RepID=Q6ZFR4_ORYSJ Length = 703 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 49/174 (28%), Positives = 92/174 (52%), Gaps = 18/174 (10%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR--IALPSISGSH 273 +LV ++D SGSM +L L++ ++ LL D +A+V+++ +R I L +S Sbjct: 270 DLVTVLDVSGSM-EGYKLALLKRAMGLL----GPGDRLAVVSFSYSARRVIRLTRMSEGG 324 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG--DFNV----GI 327 KA +A++SL A+G TN GL A + + + ++L +DG ++NV G Sbjct: 325 KASAKSAVESLHADGCTNILEGLVEAAKVFDGRRYRNAVASVILLSDGQDNYNVNGGWGA 384 Query: 328 DDPKSIESMV----KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 + K+ +V K+ + + + TFG G ++++ + M IA+ G +S+I+ Sbjct: 385 SNSKNYSVLVPPSFKRSGDRRLPVHTFGFG-TDHDASAMHTIAEETGGTFSFIE 437 >UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteriaceae RepID=C7P2A9_HALMD Length = 393 Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 45/198 (22%), Positives = 91/198 (45%), Gaps = 7/198 (3%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 ++ IDTSGSM D + + + L ++D ++IV + ++ + LP+ S Sbjct: 38 HIALCIDTSGSMEGD-NIKRARDGAAWVFGLLADEDYVSIVAFDTEATVILPATRWSDLD 96 Query: 276 EINAA--IDSLDAEGSTNGGAGLELAYQQ-ATKGFIKGGINRILLATDGDFNVGIDDPKS 332 A ++ L A G T+ GL+ A + ++ + R+LL +DG N P Sbjct: 97 RQTAMDHVEELTAGGGTDMYNGLKAAKETLSSSATGPDTVKRLLLLSDGKDNERT--PDE 154 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 E + + ++G+ + + G+G ++YNEA + + G G +++++ + + + Q Sbjct: 155 FEGLAEAIDDAGIRIQSAGIG-TDYNEATIRTLGTAGRGTWTHLEAPGDIEDFFGEAVEQ 213 Query: 393 MLITVAKDVKAQIEFNPA 410 VA D ++ P Sbjct: 214 AGSVVAPDAHLDLDVAPG 231 >UniRef50_C9Q197 Aerotolerance protein BatB n=11 Tax=Prevotella RepID=C9Q197_9BACT Length = 591 Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 49/204 (24%), Positives = 94/204 (46%), Gaps = 35/204 (17%) Query: 218 VFLIDTSGSMISDERLPLIQSSLKLLVKELREQ---DNIAIVTYAGDSRIALPSISGSHK 274 V +D S SM++ + +P KLL++ L + D I +V +AGD+ + LP + Sbjct: 130 VIAVDISNSMMAQDVVPSRLEKSKLLIENLVDHFTHDRIGLVVFAGDAFVQLPITTDYVS 189 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI--KGGINRILLATDGDFNVGIDDPKS 332 A++ + ++D G + A + + F K +++ TDG+ + G + Sbjct: 190 AKM--FLQNIDPALIATQGTDIAKAINLSMRSFSQQKDIGKAVIVITDGEDHEG----GA 243 Query: 333 IESMVKKQRESGVTLSTFGVGNSN-----------------------YNEAMMVRIADVG 369 +E+ K E G+ + G+G++ NE+M +IA G Sbjct: 244 LEA-AKAANERGIRVFILGIGSTKGSPIPLAEGGYLADRSGQTVLTALNESMCKQIAQAG 302 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQM 393 NG Y ++D ++AQ+ LN+E+ ++ Sbjct: 303 NGTYIHVDNTNDAQEKLNNELAKL 326 >UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788F71 Length = 1007 Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 54/215 (25%), Positives = 99/215 (46%), Gaps = 22/215 (10%) Query: 211 ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSIS 270 E+P+ L+ +ID SGSM + ++ L + S V+ +R +D + +V + +P Sbjct: 403 EIPSLGLILVIDRSGSMDGN-KIELAKESAMRTVELMRAKDTVGVVAFDDQPWWVVPPQK 461 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD--FNVGID 328 K E+ ++I S+ + G TN + A ++ K I I+L TDG N G Sbjct: 462 LGDKEEVLSSIQSIPSAGGTNIYPAVSSALEEMLK--IDAQRRHIILMTDGQSAMNSGYQ 519 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 D ++MV E+ +T+S+ VG + + ++ +AD G Y +++ + V + Sbjct: 520 D--LTDTMV----ENKITMSSVAVG-MDADTNLLQSLADAAKGRYYFVEDETTLPAVFSR 572 Query: 389 EMRQMLITVAKDVKAQIEFNPA------WVTEYRQ 417 E + +AK F PA W + ++Q Sbjct: 573 EA----VMLAKSYIVDKPFVPAVQNPGDWASLFQQ 603 >UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB92D Length = 586 Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 52/192 (27%), Positives = 96/192 (50%), Gaps = 19/192 (9%) Query: 216 NLVFLIDTSGSMISDERLPLI--QSSLKLLVKELREQDNIAIVTYAGD-SRIALPSISGS 272 ++ F+IDTSGSM P++ + SL+L + L E+D +V + D +R+ S+ G+ Sbjct: 315 DITFVIDTSGSMGGR---PIVDAKESLQLAIDRLSEKDRFNVVAFNNDTTRLFETSVEGT 371 Query: 273 HKAEINA--AIDSLDAEGSTNGGAGLELAYQQ-ATKGFIKGGINRILLATDGDFNVGIDD 329 + + A + L+A G T L A ++ TK FIK +++ TDG + + Sbjct: 372 TRNKQYARDFVKHLNAGGGTEMAPALNAALKRTTTKDFIK----QVVFITDG----AVGN 423 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 ++ S +K + L T G+G S N M R A G G+Y ++ ++ ++ ++S Sbjct: 424 EAALFSQIKNEL-GDARLFTVGIG-SAPNSYFMTRAAQFGLGSYVFVRNTADIKQQMDSL 481 Query: 390 MRQMLITVAKDV 401 + ++ V D+ Sbjct: 482 LYKLESPVLSDL 493 >UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z595_BREBN Length = 947 Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 45/184 (24%), Positives = 85/184 (46%), Gaps = 11/184 (5%) Query: 210 EELPASNLVFLIDTSGSMISDER----LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 E+LP+ L +ID SGSM SD R + L + + + QD I ++ + Sbjct: 401 EQLPSLGLQLVIDKSGSMSSDARGADKMALAREAAIRATTMMNAQDYIGVIAFDDTPWDV 460 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 + S + EI I + A+G T+ L+L Y++ + ++L TDG Sbjct: 461 VAPQSVTKLDEIQQQISRIQADGGTDIFPALQLGYERVKA--MNTQRKHVILLTDG--QS 516 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 +DD E ++++ +T+ST +G+ + + ++ IA++G G Y + + K+ Sbjct: 517 ALDD--DYEGLLQQMTAENITVSTVALGDDS-DRGLLEMIAELGKGRYYFANDAESIPKI 573 Query: 386 LNSE 389 + E Sbjct: 574 FSKE 577 >UniRef50_Q73UD3 UPF0353 protein MAP_3435c n=4 Tax=Mycobacterium avium complex (MAC) RepID=Y3435_MYCPA Length = 335 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 62/240 (25%), Positives = 106/240 (44%), Gaps = 36/240 (15%) Query: 188 PAPWNEQRTLLKVDILAKDRKSEELPASNL---------VFLIDTSGSMISDE----RLP 234 P+ W T+L L + P S++ + +ID S SM S + RL Sbjct: 61 PSRWRHVPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMASTDVPPNRLA 120 Query: 235 LIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 + + K +L N+ +V +A ++ + +P + ++A + A IDSL T G Sbjct: 121 AAKEAGKQFADQLTPAINLGLVEFAANATLLVPPTT--NRAAVKAGIDSLQPAPKTATGE 178 Query: 295 GLELAYQQ-ATKGFIKGG-----INRILLATDGDFNVGIDD--PKSIESMVKKQRESGVT 346 G+ A Q AT G + GG RI+L +DG NV +D P+ + + + GV Sbjct: 179 GIFTALQAIATVGSVMGGGEGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQ 238 Query: 347 LST--FGV--GNSNYNEAMMV---------RIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 +ST FG G +Y A + +I ++ +G + D+L + V ++ RQ+ Sbjct: 239 ISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQI 298 >UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21PJ3_SACD2 Length = 763 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 75/326 (23%), Positives = 142/326 (43%), Gaps = 29/326 (8%) Query: 96 HIANPGTARYQ-QFDDNPVKQVAQNPLATFSLDVDTG-SYANVRRFLNQGLL--PPPDAV 151 H+ +P Q Q+ D +Q ++ AT S+ +D G + AN+ +Q + PP A Sbjct: 266 HLISPPMVLAQGQYGDGQYEQTGKDNRATISIQLDAGFNVANIESLYHQITINKPPSSAY 325 Query: 152 RVEEIVNYFPSDWD-IKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSE 210 VE D D + ++ +S P + LA E LL + ++ Sbjct: 326 NVELTNGSTLMDRDFVLQWRATASSAPQAAVFKETLA----GEDYLLLMLLPPQGQQQHT 381 Query: 211 ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY-AGDSRIALPSI 269 + + ++VF++DTSGSM + + SL+ ++ L D I+ + SR + Sbjct: 382 QSLSRDIVFVVDTSGSM-QGTSIQQAKRSLQFALRGLNPSDTFNIIEFDTSFSRFRSRPV 440 Query: 270 SGSHKAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFIKG--------GINRILLA 318 S + + + AA +++L+A+ T A LE A+ Q G + +++ Sbjct: 441 SAT-ASNVQAAVSWVNNLNADNGTEMYAALEEAFDQLASINPNGTENSKSSNNLQQVVFI 499 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 TDG + + +++ S++ + R + L T +G S N M + A G G +I Sbjct: 500 TDG----AVGNEQALLSLIHR-RLNNARLFTVAIG-SAPNSYFMRKAAQFGKGANVFIGD 553 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQ 404 +E +N+ + ++ T+ D+ Q Sbjct: 554 TAEVTHKMNALLSKLKTTLVSDINVQ 579 >UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I2R0_9CLOT Length = 960 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 50/213 (23%), Positives = 98/213 (46%), Gaps = 14/213 (6%) Query: 207 RKSEELPASNLVFLIDTSGSMISD----ERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 R E+PA ++ +ID SGSM ++ +L L + + ++ LRE D I+++ + Sbjct: 398 RGKNEVPAISINLIIDKSGSMSAEGGGVSKLTLAKEAAMKALENLREVDEISVIAFDDTY 457 Query: 263 RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD 322 +P K I I + G T+ LE Y + K I +L TDG Sbjct: 458 DEVVPLQKVGDKEAIKELISGIQIRGGTSIYPALEQGYNMQMQSSAK--IKHTILLTDGQ 515 Query: 323 FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 G+D+ ++++ ++ +TLST VG N ++ ++A +G G Y D ++ Sbjct: 516 DGYGLDN---YATLLQNFIDNNITLSTVAVGEGA-NAGLLNQLASIGKGRSYYTDIYTDI 571 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEY 415 ++ +++L++ + + EF P ++ + Sbjct: 572 PRIF---AKEVLLSAGTYIINE-EFTPKILSNH 600 >UniRef50_A0CDA0 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0CDA0_PARTE Length = 508 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 55/245 (22%), Positives = 105/245 (42%), Gaps = 19/245 (7%) Query: 173 PASKPIPFAMRYELAPAP--WNEQRTLLKVDILAKDRKSEELP--ASNLVFLIDTSGSMI 228 P I F M + + P N + + + + + + EEL +L LID G+ + Sbjct: 96 PLDDDIQFDM-FSVNPGSNILNLTQHTIPIVLQLRTKTLEELDQIGVDLFCLIDI-GNGM 153 Query: 229 SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI--ALPSISGSHKAEINAAIDSLDA 286 +++ ++ L ++ LREQD + ++++ D ++ L ++ + ID L Sbjct: 154 QGQKIDYVKQILHSILTNLREQDRLCLISFNNDGKLLTGLQKVTSETQEYFAFVIDGLQC 213 Query: 287 EGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG-- 344 G+T G E+A+ + K RIL+ +DG + + + +KKQ E Sbjct: 214 NGTTELWKGTEVAFDVINQRKNKNNWARILIFSDGQDEIAL-------TKIKKQLEYNYD 266 Query: 345 -VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 T+ +FG NSN ++ + I ++ G + I++ + K L + DV Sbjct: 267 IFTIDSFGFSNSNASKRLS-SITNLRFGKHHIINSEQQVFKCLEQTFANFPFNLWDDVTI 325 Query: 404 QIEFN 408 I N Sbjct: 326 TISTN 330 >UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxobacter sp. K RepID=B4UFP8_ANASK Length = 480 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 48/202 (23%), Positives = 100/202 (49%), Gaps = 8/202 (3%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD-SRIAL 266 ++E P ++ ++D SGSM E+L S+ LV L D +V ++ + +A Sbjct: 34 RAERSPVC-VIPVLDVSGSM-HGEKLHFATQSIMKLVDHLAPGDFCGVVVFSTEVETLAA 91 Query: 267 PS-ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN-RILLATDGDFN 324 P+ ++ K + A+ L +TN GL A + G+ R++L TDG N Sbjct: 92 PTEMTQDRKDALKVALGRLRPRHNTNLAGGLLAGLDHAKVTKVPDGMPVRVILFTDGLAN 151 Query: 325 VG-IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 G P+ + ++++ + ++S FG G+ + ++ ++ ++ +G GNY+Y+ + +A Sbjct: 152 EGPATSPEGLCALLEANLGTA-SVSAFGYGD-DADQELLRELSTLGRGNYAYVRSPEDAL 209 Query: 384 KVLNSEMRQMLITVAKDVKAQI 405 E+ +L T A+ ++ ++ Sbjct: 210 TAFARELGGLLSTYAQRIEVRV 231 >UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B7C3_9BACI Length = 920 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 54/204 (26%), Positives = 97/204 (47%), Gaps = 10/204 (4%) Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 LL V++ K +K E+P+ L+ ++D SGSM + +L L + + V+ LRE+D + + Sbjct: 387 LLPVNMDIKGKK--EMPSLGLMIVMDRSGSM-AGSKLELAKEAAARSVELLREKDTLGFI 443 Query: 257 TYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRIL 316 + + + + K + I S+ G T LE AY++ +K I+ Sbjct: 444 AFDDRPWVIVETGPLEDKKDAVDKIGSVTPGGGTEIFTSLEKAYEELEN--LKLQRKHII 501 Query: 317 LATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 L TDG D ESM++ +E+ +TLST +G S+ + ++ +A +G G + + Sbjct: 502 LLTDGQSARSTD----YESMIETGKENNITLSTVALG-SDADRNLLEELAGLGAGRFYDV 556 Query: 377 DTLSEAQKVLNSEMRQMLITVAKD 400 S +L+ E T +D Sbjct: 557 TDSSVIPSILSRETVMATRTYIED 580 >UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10Z89_TRIEI Length = 477 Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 46/143 (32%), Positives = 75/143 (52%), Gaps = 12/143 (8%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLV-KELREQDNIAIVTYAGDSRIALPSISGSHKA 275 +V LIDTS SM +LP +Q++ V ++ +N+AIV ++ +S++ + + K Sbjct: 48 VVLLIDTSSSMWGG-KLPEVQAAATGFVERQNLTVNNLAIVEFSSNSQVL--TNFDADKT 104 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 E+ AI +L G TN GL + ILL TDG N DP++ +S Sbjct: 105 ELKQAIANLTPSGGTNLSQGL----KTVASLLRNSNTPNILLFTDGQPN----DPRASKS 156 Query: 336 MVKKQRESGVTLSTFGVGNSNYN 358 + ++ RE+G+ L T G G++N N Sbjct: 157 IAREIREAGINLVTVGTGDANSN 179 >UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B6HQM8_PENCW Length = 1029 Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 61/275 (22%), Positives = 121/275 (44%), Gaps = 28/275 (10%) Query: 211 ELPASN-------LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 E PA N LV +I S SM ++ L++ +LK LV+ L +D + +VT+ G S Sbjct: 503 EFPAINTVHIPLDLVVVIPVSSSM-QGLKITLLRDALKFLVQNLGPRDRMGLVTF-GSSG 560 Query: 264 IALPSISGSHKA-----EINAAI-----DSLDAEGSTNGGAGLELAYQQATKGFIKGGIN 313 +P + + K+ +I +I SL A+ ++L Q+ ++ Sbjct: 561 GGVPLVGMTTKSWAGWSKILESIRPVGQKSLRADVVEGANVAMDLLMQRK----FNNPVS 616 Query: 314 RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNY 373 ILL +D I DP+S++ +V + + VT+ +FG+G + + M+ ++ G+Y Sbjct: 617 TILLISDS----SISDPESVDFVVSRAEAAKVTIHSFGLGLT-HKPDTMIELSTRTKGSY 671 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND 433 SY+ ++ + + + T ++VK ++ ++ +I + + Sbjct: 672 SYVKDWMMLRECVAGCLGALQTTSHQNVKLKLRLPEGSPAKFVKISGALHTTKRATGKDA 731 Query: 434 NVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAP 468 GD+ G +L +L + A+ D + P Sbjct: 732 EAALGDLRFGDKRDVLVQLVIQPDNATQDNMPQDP 766 >UniRef50_Q22N58 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22N58_TETTH Length = 669 Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 56/223 (25%), Positives = 98/223 (43%), Gaps = 34/223 (15%) Query: 215 SNLVFLIDTSGSMISD---------------ERLPLIQSSLKLLVKELREQDNIAIVTYA 259 N+V L+D S SM S L L++ ++K + L QD +A+V ++ Sbjct: 33 CNIVCLVDGSLSMGSKLVIHQKNGGKKESDMTTLDLVKHTVKTIASSLNPQDRLALVGFS 92 Query: 260 GDSRI--ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 S+I L + K ID + A G TN GL+ + + KGF I L Sbjct: 93 THSKIYFELTEMDDQGKNVAFTEIDKMWAGGQTNIWGGLQDSLEVIKKGFRPNQNVCIFL 152 Query: 318 ATDGDFNVGIDDPKSIES-----MVKKQRESG----VTLSTFGVGNSNYNEAMMVRIADV 368 TDG P I + M+++ +E ++ TFG GN + + +M+ ++ Sbjct: 153 FTDG-------RPTMIPAIGHVEMLRRWKEQHPAIQFSIFTFGFGN-DLDTDLMLELSQE 204 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW 411 NG +S+I S V ++ + +L T+A +V ++ + + Sbjct: 205 QNGIFSFISDSSMLGTVFSNALANILSTMANNVHLNLQLSEGY 247 >UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CF36E Length = 790 Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 62/238 (26%), Positives = 110/238 (46%), Gaps = 35/238 (14%) Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMISDER----------------LPLIQSSLKL 242 +V I K + ++ A ++ +ID SGSM SDE L L++ S+K Sbjct: 120 QVKISIKTPEGQQRSACDICCVIDVSGSM-SDEAKIKNSKGDIESNGLTILDLVKHSVKT 178 Query: 243 LVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDA---EGSTNGGAGLELA 299 ++ L E+D +++V + ++ + ++ ++ N AI L+ STN G+ A Sbjct: 179 IINNLDERDRLSLVAFHTNAY-KITDLTPMNENGRNHAIKELEKLIPLDSTNIWDGIYQA 237 Query: 300 Y--------QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES---GVTLS 348 Q KG + ++ILL TDG NV P+ M+KK +E ++S Sbjct: 238 LEVVKAGQQQSIQKGEQRVAFSQILLFTDGQPNV--IPPRGHLPMLKKYKEENDVNCSIS 295 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 TFG G N + ++ ++A G G++++I V + + ++ T+A D IE Sbjct: 296 TFGFG-YNLDSELLDQLAIEGRGSFAFIPDGQFVGTVFVNALSNLMTTLAVDAVLCIE 352 >UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain H4 n=38 Tax=Eutheria RepID=ITIH4_HUMAN Length = 930 Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 39/146 (26%), Positives = 73/146 (50%), Gaps = 9/146 (6%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 N+VF+ID SGSM S ++ + +L ++ +L +D ++ ++ ++ PS+ + Sbjct: 274 NVVFVIDKSGSM-SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPASAE 332 Query: 276 EINAAID---SLDAEGSTNGGAGLELAYQ-----QATKGFIKGGINRILLATDGDFNVGI 327 +N A + A G TN + +A Q + +G ++ I+L TDGD VG Sbjct: 333 NVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVSLIILLTDGDPTVGE 392 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVG 353 +P+SI++ V++ +L G G Sbjct: 393 TNPRSIQNNVREAVSGRYSLFCLGFG 418 >UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Shewanella amazonensis SB2B RepID=A1S752_SHEAM Length = 753 Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 54/194 (27%), Positives = 92/194 (47%), Gaps = 15/194 (7%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH 273 A LV +IDTSGSM D + +S+L + L QD+ I+ ++ D+R P + Sbjct: 396 ARELVLVIDTSGSMAGDSMV-QARSALIHALGGLGPQDSFNIIAFSSDARPLWPDAKPAT 454 Query: 274 KAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR---ILLATDGDFNVGI 327 + AA + SL+A+G T + LELA + T + R +L TDG N G Sbjct: 455 AFNLGAAQQFVRSLEADGGTEMASALELALK--TPSVVDEDTKRLRQVLFITDGAVN-GE 511 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 D ++ +++++ R L +G + N M R A G G++++I E + +N Sbjct: 512 D---ALFNLIER-RLGTSRLFPVAIGAAP-NGYFMSRAAAAGRGSFTFIGHGGEVAEKMN 566 Query: 388 SEMRQMLITVAKDV 401 + ++ V D+ Sbjct: 567 QLLSRIEHPVVSDL 580 >UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7YL43_NECH7 Length = 764 Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 54/213 (25%), Positives = 91/213 (42%), Gaps = 30/213 (14%) Query: 185 ELAPAPWNEQRTLLKVDILAKDRKSEELP--ASNLVFLIDTSGSMISDERLP-------- 234 L P P R L V + S E+P ++V +ID SGSM +P Sbjct: 56 HLEPVP---DRKGLIVKVQPPTAPSAEIPHVPCDIVLVIDVSGSMAGAAPVPGEETNEST 112 Query: 235 ------LIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDA 286 L + + + +++ + E D + IVT+A +++ P S++ +K + S+ Sbjct: 113 GLSILDLTKHAARTIIETMNESDRLGIVTFASKAKVVQPLLSMTSENKERSRGNVTSMRP 172 Query: 287 EGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG-- 344 +TN GL L + K + I++ TDG N V K R G Sbjct: 173 IDATNLWHGL-LEGIKLFKNVKSSNVPAIMVLTDGMPN----HMNPAAGFVPKLRAMGQL 227 Query: 345 -VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 ++ TFG G + ++ IA++G GNY++I Sbjct: 228 PASIHTFGFG-YHLRSGLLKSIAEIGGGNYAFI 259 >UniRef50_Q24C76 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila RepID=Q24C76_TETTH Length = 670 Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 51/185 (27%), Positives = 88/185 (47%), Gaps = 29/185 (15%) Query: 215 SNLVFLIDTSGSMISDER-----------------LPLIQSSLKLLVKELREQDNIAIVT 257 SN+ ++D SGSM S+ + L +++ S+K++V L +D ++IVT Sbjct: 33 SNICCVVDVSGSMSSEAKIINQSSQKSDENYSLSILDVVKHSIKMIVNTLGSEDYLSIVT 92 Query: 258 YAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRI 315 ++ + + L ++ S+K I++L EG T GL A I Sbjct: 93 FSDSANVLFDLLPMNDSNKTMAIEKIENLSTEGGTELWKGLNSALNILLNNKTPNTNQSI 152 Query: 316 LLATDGD-FNVGIDDPKSIESMVKKQR---ESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 L TDG + GID ++VK ++ + T++TFG +S+ N +M +IA NG Sbjct: 153 FLLTDGQPTDSGID-----TNLVKFKQAYPKLNCTINTFGFSSSS-NSELMNKIAMEYNG 206 Query: 372 NYSYI 376 +S+I Sbjct: 207 MFSFI 211 >UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1 Tax=Sorghum bicolor RepID=C5WZE3_SORBI Length = 704 Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 50/209 (23%), Positives = 101/209 (48%), Gaps = 19/209 (9%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSH 273 +LV ++D S SM S +L L++ +++ +++ L D +++V ++ + P ++ Sbjct: 235 DLVTVLDVSRSM-SGPKLALLKRAMRFVIENLEPSDRLSVVAFSSSACRLFPLRKMTAFG 293 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG--DFNVGIDDPK 331 + + A+DSL A+G TN GL A + + + I+L +DG N+ D Sbjct: 294 QQQSQQAVDSLVADGGTNIAEGLRKAARVVEDRQARNPVCSIILLSDGVDSHNLPPRDGS 353 Query: 332 SIE----SMVKKQ----RESGVTLSTFGVG---NSNYNEAMMVRIADVGNGNYSYIDTL- 379 + E +V + E V + FG+G + +++ M +A + +G +S+ID + Sbjct: 354 APEPDYAPLVPRSILPGSEHHVPIHAFGLGMDHDHDHDSRAMHAVAQMSSGTFSFIDMVG 413 Query: 380 SEAQKVLNSEMRQML--ITVAKDVKAQIE 406 S Q L + +L VA++ + +E Sbjct: 414 SSIQDALAQCIGGLLSVSVVAQETRLSVE 442 >UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythrobacter RepID=A3W9L9_9SPHN Length = 740 Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 46/198 (23%), Positives = 94/198 (47%), Gaps = 11/198 (5%) Query: 211 ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSIS 270 E P ++F+ID SGSM + E +P + SL ++ LR QD ++ + D + S Sbjct: 340 EAPPREMIFVIDNSGSM-AGESMPAARRSLLYALETLRPQDRFNVIRF--DDTMTELFAS 396 Query: 271 GSHKAEIN-AAIDSLDAEGSTNGGAGLELAYQQATKGFIKG-GINRILLATDGDFNVGID 328 ++ N AA + NGG + A + A + + +++ TDG + + Sbjct: 397 AVQASDSNIAAAKTFTHNLMANGGTEMLPALRAALRDRAPDERVRQVIFLTDGALS---N 453 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 + +E + + +++S V + G + Y +M R+A+ G G ++++ EA+ + Sbjct: 454 EADMMEEINRNRKDSRVFMVGIGSAPNTY---LMRRMAEAGRGTFTHVGMGEEAEDQMQR 510 Query: 389 EMRQMLITVAKDVKAQIE 406 + ++ + VA + A +E Sbjct: 511 LLDRLSLPVATGLTANVE 528 >UniRef50_Q5LCG5 Aerotolerance-related membrane protein n=25 Tax=Bacteroidales RepID=Q5LCG5_BACFN Length = 341 Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 58/205 (28%), Positives = 90/205 (43%), Gaps = 42/205 (20%) Query: 221 IDTSGSMISDERLPLIQSSLKLLVKEL---REQDNIAIVTYAGDSRIALPSISG--SHKA 275 +D S SM++ + P K L+ +L E D + ++ +AGD+ LP S S K Sbjct: 96 LDISNSMLAQDVQPSRLEKAKRLISKLVDGMENDKVGMIVFAGDAFTQLPITSDYISAKM 155 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR-ILLATDGD------------ 322 + + SL ++ T GA + LA + T + G+ R I++ TDG+ Sbjct: 156 FLESISPSLISKQGTAIGAAINLAARSFTP---QEGVGRAIVVITDGENHEGGAVEAAKE 212 Query: 323 ----------FNVGIDD--PKSIESM--VKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 VG+ D P IE ++ RE V ++ NEAM IA Sbjct: 213 AAKKGIQVNVLGVGLPDGAPIPIEGSNDFRRDREGNVIVTRL-------NEAMCQEIAKE 265 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQM 393 GNG Y +D + AQK +N E+ +M Sbjct: 266 GNGIYIRVDNSNSAQKAINQEINKM 290 >UniRef50_B6ZDR6 Voltage dependent calcium channel alpha2d/delta subunit n=3 Tax=Euteleostomi RepID=B6ZDR6_RANCA Length = 1078 Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 56/244 (22%), Positives = 117/244 (47%), Gaps = 33/244 (13%) Query: 189 APW-NEQRTLLKVDILAKDRKSEELPAS----NLVFLIDTSGSMISDERLPLIQSSLKLL 243 +PW ++ RT K+D+ R+ + + +++ L+D SGS +S L LI++S+ + Sbjct: 222 SPWVDKSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGS-VSGLTLKLIRTSVTEM 280 Query: 244 VKELREQDNIAIVTYAGDSRIALPSISGSH---------KAEINAAIDSLDAEGSTNGGA 294 ++ L + D + + + ++ +S H K + A++++ A+G+T+ Sbjct: 281 LETLSDDDFVNVAAFNSNAH----DVSCFHHLVQANVRNKKVLKEAVNNITAKGTTDYKQ 336 Query: 295 GLELAYQQATKGFI-KGGINRI-LLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGV 352 G + A+ Q + + N+I +L TDG G D K+ E+ + V + TF V Sbjct: 337 GFKFAFDQLRNTNVSRANCNKIIMLFTDG----GED--KATETFKLYNKNKTVRVFTFSV 390 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLS----EAQKVLNSEMRQMLITVAKDVKAQIEFN 408 G NY++ + +A G Y I ++ Q+ L+ R M++ A++ Q+++ Sbjct: 391 GQHNYDKGPIQWMACENKGYYYEIPSIGAIRINTQEYLDVLGRPMVL--AREKAKQVQWT 448 Query: 409 PAWV 412 ++ Sbjct: 449 NVYL 452 >UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZR58_9PLAN Length = 1032 Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 36/187 (19%), Positives = 86/187 (45%), Gaps = 9/187 (4%) Query: 207 RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIAL 266 R ++ +P L+ ++D SGSM E++ + Q + ++ + D ++ + ++ + Sbjct: 449 RDAKVVPVGALMLVLDKSGSM-QGEKMQMTQGAALAAIRAMGAADFAGVIGFDSQAQRIV 507 Query: 267 PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVG 326 P + A + L A G TN G+ L ++ + G+ +++ +DG Sbjct: 508 PIRKVDNPGMFVAQVRKLSASGGTNMTPGVALGFRDLQN--VDAGVKHMIVLSDGQ---- 561 Query: 327 IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVL 386 +P ++ + ++ G+T+S VG S+ ++ +M +A G G + ++ ++ Sbjct: 562 -TEPGNVAQIASDMKKMGMTVSAVAVG-SDADQKLMATVARNGGGKFYAVNNPKAIPRIF 619 Query: 387 NSEMRQM 393 E R++ Sbjct: 620 MREARRV 626 >UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcanivorax sp. DG881 RepID=B4X134_9GAMM Length = 657 Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 53/220 (24%), Positives = 105/220 (47%), Gaps = 21/220 (9%) Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 E LL V + K + LP L F+ID+SGSM + ++SL L ++ L+ D Sbjct: 278 GEHYALLMV-VPPKTGQVTALPRETL-FIIDSSGSM-GGAPMRQAKASLHLALQRLKPGD 334 Query: 252 NIAIVTYAGDSRIAL-----PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ-ATK 305 I + DS+ L ++S + + + +D L A G T+ L Q A+ Sbjct: 335 RFNITDF--DSQHTLLFETPVTVSDNSRQQAQDFVDGLQASGGTHMLPALSATLSQPASD 392 Query: 306 GFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI 365 G+++ +++ TDG +++ + + R L T G+G++ N M R Sbjct: 393 GYLR----QVIFITDGAVGNESGIFRALHQQLGEAR-----LFTVGIGSAP-NSHFMTRA 442 Query: 366 ADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 A G G+++YI+ ++ Q+ +++ R++ + ++++ Q+ Sbjct: 443 AQFGRGSFTYINDQNQVQQGMDTLFRRLESPLMRNLQVQL 482 >UniRef50_Q1AYC2 Protoporphyrin IX magnesium-chelatase n=15 Tax=Bacteria RepID=Q1AYC2_RUBXD Length = 616 Score = 52.4 bits (124), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 45/149 (30%), Positives = 79/149 (53%), Gaps = 12/149 (8%) Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKE-LREQDNIAIVTYA 259 D+ K R+ E + LV ++D+SGSM + R+ ++ +++ L+++ R +D A++++ Sbjct: 436 DLREKVREGRE--GNLLVLVVDSSGSMAARSRMSAVKGAVRALLEDAYRRRDRAAVISFR 493 Query: 260 G-DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 G ++R+ +P SG A A ++ L G T AGLELA + + + R LL Sbjct: 494 GEEARLLVPPASGVEAA--AARLEELPTGGRTPLAAGLELAAETVLREASREPERRPLLV 551 Query: 319 --TDGDFNVGIDDPKSIESMVKKQRESGV 345 TDG G +DP + ++ RE GV Sbjct: 552 VITDGRATAG-EDPL---AAARRLRERGV 576 >UniRef50_C1XFF8 Mg-chelatase subunit ChlD n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XFF8_MEIRU Length = 298 Score = 52.4 bits (124), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 80/176 (45%), Gaps = 11/176 (6%) Query: 215 SNLVFLIDTSGSM----ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSIS 270 + ++ I+ SM I+ R+ Q + K LV +L + +VT++G + LP + Sbjct: 84 AGVILAIENGWSMRQTDIAPNRMVATQMAAKALVDKLPRHIKVGVVTFSGYGTLLLPPTT 143 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDP 330 + I AID+LD G + GL A + + +G +++ +V +DP Sbjct: 144 --DRKAIRQAIDNLDLGGGFSFTYGLLAALEALPQTPPEGSRPGVIVLFSHGHDVSGNDP 201 Query: 331 KSIESMVKKQRESGVTLSTFGVGNS--NYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 I + E G+ + GVG N++E M+ ++AD G Y I + S+ K Sbjct: 202 LKIAD---QALERGIQVHAIGVGTHGHNFDEEMLKKVADRTGGRYYPIFSASDLSK 254 >UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1 Tax=Sorghum bicolor RepID=C5Z1W1_SORBI Length = 607 Score = 52.4 bits (124), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 51/193 (26%), Positives = 95/193 (49%), Gaps = 15/193 (7%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH 273 A +LV ++D SGSM RL ++S+++ ++K+L D +++VT+ G + P + S Sbjct: 104 ALDLVVVLDVSGSMRDFGRLDKLKSAMRFIIKKLAPMDRLSVVTFNGGATRECPLRAMSE 163 Query: 274 KA--EINAAIDSLDAEGSTNGGAGLELAYQQAT-KGFIKGGINRILLATDGDFNVGIDDP 330 A + +D L A G TN AGL++ Q + + ++L +DG+ N G Sbjct: 164 DAVPVLTDIVDGLVARGGTNIEAGLKMGLQVLDGRRYTGARTAGVILMSDGEQNSG---- 219 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS-YIDTLSEAQ-KVLNS 388 + V+ + V +FG SN + ++ ++A G G Y+ +D+ + V + Sbjct: 220 --DATRVRNPQNYPVYTLSFG---SNADMNLLQKLAG-GGGTYNPVLDSGGMSMLDVFSQ 273 Query: 389 EMRQMLITVAKDV 401 M +L V +D+ Sbjct: 274 LMAGLLTVVVRDL 286 >UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HPN0_LYSSC Length = 825 Score = 52.4 bits (124), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 49/184 (26%), Positives = 94/184 (51%), Gaps = 12/184 (6%) Query: 195 RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 TLL V++ K + E+LP+ LV ++D SGSM S +L L + + V+ LR++D + Sbjct: 349 ETLLPVEMEIKGK--EQLPSLGLVIVLDRSGSM-SGSKLELAKEAAARSVEMLRDEDTLG 405 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 + + + + ++K E I S+ G T L AY+ +K Sbjct: 406 FIAFDDRPWEIIETGPLNNKEEAVDTILSVTPGGGTEIYGSLAKAYENLAD--MKLQRKH 463 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN-Y 373 I+L TDG P + + ++++ +++G+TLST +G + + ++ ++++G+G Y Sbjct: 464 IILLTDGQ-----SQPGNYDDLIEQGKDNGITLSTVAIGQ-DADANLLEALSEMGSGRFY 517 Query: 374 SYID 377 + ID Sbjct: 518 NVID 521 >UniRef50_Q4RF07 Chromosome 13 SCAF15122, whole genome shotgun sequence. (Fragment) n=2 Tax=Tetraodon nigroviridis RepID=Q4RF07_TETNG Length = 983 Score = 52.0 bits (123), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 52/199 (26%), Positives = 98/199 (49%), Gaps = 20/199 (10%) Query: 189 APWNEQR-TLLKVDILAKDRKSEELPAS----NLVFLIDTSGSMISDERLPLIQSSLKLL 243 +PW + R T K+D+ R+ + + +++ L+D SGS +S L LI++S+ + Sbjct: 46 SPWMDARKTPSKIDLYDVRRRPWYIQGAASPKDMLILVDASGS-VSGLTLKLIRTSVTEM 104 Query: 244 VKELREQDNIAIVTYAGDSRIA-----LPSISGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 ++ L + D + +V + + L + +K + A+ ++ A+G TN GLE Sbjct: 105 LETLSDDDYVNVVYFNTQVKKTACFDHLVQANVRNKKLLKDAVQNITAKGITNYTKGLEF 164 Query: 299 AYQQ-ATKGFIKGGINR-ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 A++Q + + N+ I+L TDG G + ++I + K + V + TF VG N Sbjct: 165 AFEQLSVTNVSRANCNKIIMLFTDG----GEERAQAI--LEKYNADKKVRIFTFSVGQHN 218 Query: 357 YNEAMMVRIADVGNGNYSY 375 Y++ + +A N Y Y Sbjct: 219 YDKGPIQWMA-CSNKGYFY 236 >UniRef50_A9EV77 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EV77_SORC5 Length = 524 Score = 52.0 bits (123), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 55/216 (25%), Positives = 93/216 (43%), Gaps = 26/216 (12%) Query: 204 AKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 A D S + P +L ++ S DE + + + L++ LR +D ++++ Y G+ Sbjct: 113 AVDPSSLDRPPVHLAIAVENSRFAGIDESA--LDAGMGGLLESLRPEDRVSVIRY-GERV 169 Query: 264 IALPSISGSHKAEINAAIDSLDAEGSTNGGAG----LELAYQQATKGFIKG------GIN 313 ++ AE+ I A+ GGAG L A K F +G G + Sbjct: 170 ERRAFLAAPESAELARLI----ADERLGGGAGELVGLYEGIAAAEKAFDEGDAAGFEGAH 225 Query: 314 RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI----ADVG 369 R+LL T G GI DP I + + E+G GVG E+ +V+I +G Sbjct: 226 RVLLLTSGHATSGITDPDRILGLGEALVENGTAFGVIGVG-----ESFLVKIPSALGSMG 280 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 G Y+Y + + +L+ E + L +A D ++ Sbjct: 281 AGTYAYALSPGDLGGLLSEEGKTTLFPLATDFSLEV 316 >UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MS10_ANATD Length = 1188 Score = 52.0 bits (123), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 48/171 (28%), Positives = 86/171 (50%), Gaps = 10/171 (5%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 +LVF++D+SGSM ++ + + K V L + D A+V + + P ++ +A Sbjct: 499 DLVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGDRAAVVDFDNFGYLLQP-LTTDFQA 557 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 + AID +D+ G TN G+ +A QQ + I I+L TDG+ G D + Sbjct: 558 -VKNAIDRIDSWGGTNIAEGIRIANQQLISRSSEDRIKVIILLTDGE---GYYD----NN 609 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVL 386 + + + +G+T+ T G+G S +E ++ IA G Y + + S+ +V Sbjct: 610 LTTEAKNNGITIYTIGLGTS-VDENLLRDIATQTGGMYFPVSSASQLPQVF 659 >UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AXM1_ORYSI Length = 614 Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 44/169 (26%), Positives = 84/169 (49%), Gaps = 19/169 (11%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR--IALPSISGSH 273 ++V ++D SGSM ERL ++ ++++ + +L D +++V++A R L +S Sbjct: 31 DVVAVLDVSGSM-EGERLEHVKEAMEIFIGKLGPDDRLSVVSFATSVRRLTELTYMSEQG 89 Query: 274 KAEINAAIDSLDAEGSTNGGAGLE-----LAYQQATKGFIKGGINRILLATDGDFNVGID 328 +A +D L A+GSTN GA L L ++ + G + ++ +DG Sbjct: 90 RAVAKEIVDGLVADGSTNMGAALLEGAMILRDRKGARDESNGRVGCMMFLSDG------- 142 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 + + + K+ TFG+G S++N +M IAD + YS+++ Sbjct: 143 ---TNDEIYKEDISGEFPAHTFGLG-SDHNPNVMRHIADETSATYSFVN 187 >UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q083T9_SHEFN Length = 722 Score = 52.0 bits (123), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 53/205 (25%), Positives = 96/205 (46%), Gaps = 20/205 (9%) Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI--ALP 267 + L A L+ +IDTSGSM S + + + +L+ + LR+ D+ I+ + D + A P Sbjct: 339 QHLIARELILVIDTSGSM-SGQSITQAKQALQFALAGLRDIDSFNIIEFNSDVTMLSATP 397 Query: 268 -SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK------GFIKGGINRILLATD 320 S + + + N I SLDA+G T + L+ A + + + +++ TD Sbjct: 398 LSANSRNIGKANRFIQSLDADGGTEMRSALQTALVDSVQQDSDQTDAHSEMLRQVIFMTD 457 Query: 321 GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 G + + + ++ Q L T G+G++ N M R A +G G ++YI S Sbjct: 458 G----AVGNEHELYQLINDQLGDS-RLFTVGIGSAP-NSDFMRRAATMGRGTFTYIGNES 511 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQI 405 E Q+ ++ Q+L + + V I Sbjct: 512 EVQQ----KIEQLLNKIEQPVLTNI 532 >UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XQ17_9BACT Length = 806 Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 42/165 (25%), Positives = 86/165 (52%), Gaps = 8/165 (4%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI- 264 D K++++ + ++VF++DTSGSM S +++ + +L+ V+ L + D I+ ++ +S Sbjct: 303 DAKAKQIVSKDVVFVLDTSGSM-SGKKMEQAKKALQFCVESLNDGDRFEIIRFSTESEPL 361 Query: 265 --ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD 322 L ++S ++ + I +L A G T L+ A +K +G ++ TDG Sbjct: 362 FDKLAAVSKENREKAGDFIKNLKAMGGTAIDEALKKALSLESK---EGRPFVVVFLTDGL 418 Query: 323 FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 VG D I ++++ + + FG+G ++ N ++ RIA+ Sbjct: 419 PTVGTTDEDQILKGMQERNKEKRRIFCFGIG-TDVNTHLLDRIAE 462 >UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSY7_9GAMM Length = 670 Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 50/204 (24%), Positives = 95/204 (46%), Gaps = 21/204 (10%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAE 276 +VF+IDTSGSM + +R+ + +L V+ L D +V + S+ + Sbjct: 322 VVFVIDTSGSM-AGQRMYHAKQALSQAVERLSPDDRFNVVEFNNQHSRLFSSMRSASAIN 380 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG---INRILLATDGDFNVGIDDPKSI 333 + A++ + G GG G + ++ + +++L TD + + I Sbjct: 381 VKQALNWV---GRLQGGGGTMMLPAVEDALSVRSDPAYLRQVILITDAS----VGNEAEI 433 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ--------KV 385 +V++QR+ G L T G+G S N ++ + A VG G+Y YI + E + K+ Sbjct: 434 LRVVERQRK-GARLFTVGIGVSP-NSYLLRKAAQVGQGDYVYIASGQEVKARMQRLFAKL 491 Query: 386 LNSEMRQMLITVAKDVKAQIEFNP 409 N ++Q+ I + + +A++ NP Sbjct: 492 ENPVLKQLNIDLPEGAEAEVWPNP 515 >UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YNZ7_ANASP Length = 820 Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 57/210 (27%), Positives = 100/210 (47%), Gaps = 15/210 (7%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSS--LKLLVKELREQDNIAIVTYA 259 I A + +++ ++VFLIDTSGS + PL+Q ++ + L D +IV ++ Sbjct: 286 IPAIQYRQDQVVPKDVVFLIDTSGSQMG---APLMQCQELMRRFINGLNPDDTFSIVDFS 342 Query: 260 GDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI---KGGINRIL 316 +R P ++ AI+ ++ + S NGG + L +A F G + I+ Sbjct: 343 DTTRQLSPVPLANNAQNRTRAINYIN-QLSANGGTEM-LRGIRAVLNFPVTDPGRLRSIV 400 Query: 317 LATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 L TDG I + I + V++ +SG L +FG G S+ N ++ RIA++G G I Sbjct: 401 LLTDGY----IGNENQILAEVQQHLKSGNRLYSFGAG-SSVNRFLLNRIAELGRGIAQII 455 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 ++++ RQ+ V ++ Q E Sbjct: 456 RHDEPTDEIVDKFYRQINNPVLANINLQWE 485 >UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 Tax=Myxococcales RepID=Q1DE81_MYXXD Length = 860 Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 45/167 (26%), Positives = 80/167 (47%), Gaps = 17/167 (10%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS---I 269 P +VF++D SGSM + E LP Q++L+L ++ LRE D ++ + + P Sbjct: 280 PKQEVVFVVDVSGSM-AGESLPQAQAALRLCLRHLREGDRFNVIAFENRFQSFQPEPVPF 338 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 + E + + +L+A+G T A + A Q A G I+L TDG + + Sbjct: 339 TQRTLEEADRWVAALNADGGTELLAPMRAAVQAAPDGV-------IVLLTDGQ----VGN 387 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 I V + R++ S FG+G +N ++ ++ +A G+ +I Sbjct: 388 EAEILRAVLEARKTARVYS-FGIG-TNVSDVLLRDMAKQTGGDVEFI 432 >UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V0V6_NAEGR Length = 502 Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 52/208 (25%), Positives = 91/208 (43%), Gaps = 25/208 (12%) Query: 216 NLVFLIDTSGSMISDE---------RLPLIQSSLKLLVKE-LREQDNIAIVTYAGDSRIA 265 N+ ++D SGSM DE +L +S+++ LV L +D I ++TY+ + Sbjct: 74 NICLVLDISGSM--DEPLKNRSKGSKLTACKSAIRELVTNFLTYKDTIHLITYSDSPKTV 131 Query: 266 LPSISGSHKAEINAA-IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 +K +N ID + EGSTN + L A G I +DG N Sbjct: 132 FTE---KNKESVNLNDIDKISTEGSTNIASALHSAVDLLHNSNAPG-TKLIAFFSDGQCN 187 Query: 325 VGIDDPKSIESMVKKQ-------RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 VG + S + K+ ++ + +S++GVG S+Y+E + IA G G Y Y++ Sbjct: 188 VGETNLNIFGSGLLKKLKDYSEGKDDQIHISSYGVG-SDYDELWLQAIARTGKGEYYYLE 246 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQI 405 + A+ +++ + K + Sbjct: 247 DETYAKDAFERSLKKYKYQIGKKFNVTV 274 >UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Shewanella RepID=A6WMD3_SHEB8 Length = 772 Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 50/208 (24%), Positives = 98/208 (47%), Gaps = 21/208 (10%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLL--VKELREQDNIAIVTYAGD----SRIALPSIS 270 L+ +IDTSGSM D ++Q+ LL +K L+ +D+ I+ + S LP+ S Sbjct: 394 LILVIDTSGSMAGDS---IVQAKNALLYALKGLKPEDSFNIIEFNSSLSLLSATPLPATS 450 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI---NRILLATDGDFNVGI 327 S+ + + L A+G T L+ A ++ + +++ TDG + Sbjct: 451 -SNLSRARQFVSRLQADGGTEMALALDAALPKSLGSVSPDAVQPLRQVIFMTDGS----V 505 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 + +++ +++ Q L T G+G S N M R A++G G ++YI + E ++ Sbjct: 506 GNEQALFDLIRYQIGES-RLFTVGIG-SAPNSHFMQRAAELGRGTFTYIGKVDEVDAKIS 563 Query: 388 SEMRQMLITVAKDVKAQIEFNPAWVTEY 415 + + ++ V D+ Q+ ++ V +Y Sbjct: 564 ALLSKIQYPVLTDI--QVRYDDGSVPDY 589 >UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TQ23_SHEHH Length = 850 Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 61/231 (26%), Positives = 101/231 (43%), Gaps = 41/231 (17%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQ--SSLKLLVKELREQDNIAIVTYAGD----SRIALP 267 A LV +IDTSGSM D +IQ S+LK + LR QD+ ++ + SR +P Sbjct: 454 ARELVLVIDTSGSMSGDA---IIQAKSALKYALAGLRPQDSFNVLQFNSTVERWSRHVMP 510 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELA--------------------YQQATKGF 307 + + + I+ L A+G T L+ A YQ + + Sbjct: 511 A-TAINLGRAQNYINGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHDDDRYQSSNETL 569 Query: 308 IKGG---INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + + ++L TDG ++ + E + + ES L T G+G++ N M R Sbjct: 570 EQSAATPLRQVLFITDGAV---ANESRLFEQIKNQLGES--RLFTIGIGSAP-NAHFMQR 623 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEY 415 A+VG G Y+YI L E + + S + ++ DV ++ F+ V +Y Sbjct: 624 AAEVGRGTYTYIGKLDEVNQKVVSLLEKIEKPQVTDV--ELHFSDGSVPDY 672 >UniRef50_C3Y4Z7 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y4Z7_BRAFL Length = 1236 Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 57/194 (29%), Positives = 87/194 (44%), Gaps = 21/194 (10%) Query: 192 NEQRTLLKVDI-LAKDRKSEELPAS-----NLVFLIDTSGSM---ISDERLPLIQSS--- 239 N+ R +KV + L+ D KS E V +ID S SM I ++L LIQ Sbjct: 601 NQVRDKMKVLLELSADEKSLEASGHQRTPLRFVAVIDESYSMDDRIGRDKLTLIQRMQIF 660 Query: 240 LKLLVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 +L+ K+ +++D + IVT+A D+++ LP + S + I ++ G TN GL Sbjct: 661 AELMAKDFKDEDQMGIVTFANDAKVVLPMTRMDSSGRDSALEKIQNISTRGQTNLSDGLL 720 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES---GVTL--STFGV 352 A N I+L TDG N GI D + + GV L STF + Sbjct: 721 SAISMFKGSSGSDFHNGIILFTDGQANQGIIDAAELVQEYNSKMAGLGEGVCLPISTFTI 780 Query: 353 GNSNYNEAMMVRIA 366 G +Y ++ +A Sbjct: 781 G--DYRPKLLCEVA 792 >UniRef50_C1GWG1 von Willebrand factor type A domain containing protein n=1 Tax=Paracoccidioides brasiliensis Pb01 RepID=C1GWG1_PARBA Length = 773 Score = 49.7 bits (117), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 49/198 (24%), Positives = 93/198 (46%), Gaps = 27/198 (13%) Query: 203 LAKDRKSEELPASNLVFLIDTSGSM-------ISDER----------LPLIQSSLKLLVK 245 L ++ +P ++V ID SGSM +DE L L + + + +++ Sbjct: 63 LHPEKDIRHVPC-DIVLCIDVSGSMQLSAPLPTTDESGKREETGLSVLDLTKHAARTIIE 121 Query: 246 ELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 L E D + +VT++ D+ +A + + ++K A+++L STN GL+L Sbjct: 122 TLNENDRLGVVTFSNDAEVAYKISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGLSVL 181 Query: 304 TKGFIKG-GINRILLATDGDFNVGIDD----PKSIESMVKKQRESGVTLSTFGVGNSNYN 358 K ++ + + + TDG N PK + ++++Q++ + TFG G + Sbjct: 182 GKVDLRPQNVQALYVLTDGQPNHMCPRQGYVPK-LRPILERQKDRLPLIHTFGFG-YDIR 239 Query: 359 EAMMVRIADVGNGNYSYI 376 ++ IA+VG G YS+I Sbjct: 240 SGLLQSIAEVGGGTYSFI 257 >UniRef50_B8G7Y1 von Willebrand factor type A n=3 Tax=Chloroflexus RepID=B8G7Y1_CHLAD Length = 914 Score = 49.7 bits (117), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 51/185 (27%), Positives = 83/185 (44%), Gaps = 15/185 (8%) Query: 210 EELPASNLVFLIDTSGSM---ISDER--LPLIQSSLKLLVKELREQDNIAIVTYAGDSRI 264 EE P LV +ID SGSM + D R L L + ++ + L ++D IA++ + + Sbjct: 406 EERPDLALVLVIDRSGSMRELVDDGRTQLDLAREAVYQASRGLTQRDQIALIAFDSIADT 465 Query: 265 ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 LP I A+ L A G TN +G+ LA + T + I ++L TDG Sbjct: 466 LLPLQPLPGLFTIEDALSRLVAGGGTNIRSGIALAAE--TIATSQARIRHVILLTDGVSE 523 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 D +V R G+T+S +G + + R+A +G G Y + + + + Sbjct: 524 TEYAD------LVADLRAQGITVSAIAIGLD--TDPALERVAQIGGGKYYLVQRVPDLPQ 575 Query: 385 VLNSE 389 V+ E Sbjct: 576 VVLEE 580 >UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B5YMD8_THAPS Length = 868 Score = 49.7 bits (117), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 46/147 (31%), Positives = 74/147 (50%), Gaps = 10/147 (6%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSH 273 ++V +D SGSM E+L L + +L LL++EL D A+++++ D+ I +P ++ + Sbjct: 138 DIVVALDVSGSM-RVEKLDLCKETLHLLLRELHHDDRFALISFSEDAVIEVPMQKVNERN 196 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVG----IDD 329 K + AID L +G TN + + LA Q + + L TDG+ N G ID Sbjct: 197 KQQALHAIDRLSVKGRTNIASAVSLAAQVVNGVAEPNKVRSVFLLTDGNANTGYTEAIDL 256 Query: 330 PKSIESMVKKQRESG---VTLSTFGVG 353 K V+ R ++L TFG G Sbjct: 257 VKLTSIFVEANRNPHTPPISLHTFGYG 283 >UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomycetaceae RepID=D2R2I7_9PLAN Length = 786 Score = 49.7 bits (117), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 54/208 (25%), Positives = 91/208 (43%), Gaps = 18/208 (8%) Query: 211 ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA----- 265 +L ++F++D SGSM +++ + +++ ++ L E D IV Y DS + Sbjct: 303 DLTKKTVIFVVDRSGSM-QGKKIEQAREAMRYVLNNLHEGDTFNIVAY--DSTVESFKPE 359 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 L + + A +D L A GSTN L+ A+ T N IL TDG Sbjct: 360 LQKFDDATRKSALAYVDGLYAGGSTNISGALDSAFAMLTGS---DRPNYILFLTDGLPTA 416 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI---DTLSEA 382 G + I + K++ + FGVG + N ++ R++ G Y+ + L + Sbjct: 417 GETNEGKIVELAKQKNVHRARMINFGVG-YDVNSRLLDRMSRENFGQSQYVRPDENLEAS 475 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPA 410 L S+M ++T DVK I+ A Sbjct: 476 VSRLYSKMSSPVLT---DVKVSIDIEGA 500 >UniRef50_B9ML47 YD repeat protein n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9ML47_ANATD Length = 3027 Score = 49.7 bits (117), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 55/229 (24%), Positives = 108/229 (47%), Gaps = 27/229 (11%) Query: 211 ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN-IAIVTYAGDSRIALPSI 269 +L ++VF++D SGSM S++ + K ++ + E +N + +V + DS +++ S Sbjct: 763 DLSKVDIVFVLDNSGSMSSNDPNYYRIEATKKFIQNIDELNNRVGLVDF--DSSVSVRSN 820 Query: 270 SGSHKAEINAAIDSLD-AEGSTNGGAGLELAY----QQATKGFIKGGINRILLATDGDFN 324 S K+++ A++++ GSTN G GL+ A Q+ +K I+L +DG N Sbjct: 821 LTSDKSKLLQALNAMRWTGGSTNIGGGLKAALGLFDQEQSKKI-------IVLLSDGYHN 873 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS---- 380 GI + ++K++ + ++T +G + + ++ IAD G Y Y+D Sbjct: 874 TGIHPNDVLPELIKQE----IVVNTIALG-KDCDRELLHDIADKTKGGYFYVDNTGGLSQ 928 Query: 381 ---EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLR 426 + Q L E IT+ K+ + ++ EY +G + + Sbjct: 929 EDVDKQIELIYEKLTKWITLQKEAEKNLKPQEVLSIEYNDVGLDNEEFH 977 >UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=Q2QSE5_ORYSJ Length = 524 Score = 49.3 bits (116), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 44/191 (23%), Positives = 89/191 (46%), Gaps = 12/191 (6%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR--IALPSISGSH 273 +LV ++D SGSM ++ ++ +L+ ++ +L D ++IVT+ ++ L +++ Sbjct: 62 DLVAVVDVSGSM-RGHKIESVKKALQFVIMKLTPVDRLSIVTFESSAKRLTKLRAMTQDF 120 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQ-ATKGFIKGGINRILLATDGDFNVGID-DPK 331 + E++ + SL A G T+ AGL+L A + F + I L +DG DP Sbjct: 121 RGELDGIVKSLIANGGTDIKAGLDLGLAVLADRVFTESRTANIFLMSDGKLEGKTSGDPT 180 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 + V++ TFG G+ ++ + + G YS + + + + Sbjct: 181 QVNP-------GEVSVYTFGFGHGTDHQLLTDIAKNSPGGTYSTVPDGTNLSAPFATLLG 233 Query: 392 QMLITVAKDVK 402 ++ VA+DV+ Sbjct: 234 GLVTVVAQDVR 244 >UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putative n=1 Tax=Ricinus communis RepID=B9RR85_RICCO Length = 755 Score = 49.3 bits (116), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 44/189 (23%), Positives = 91/189 (48%), Gaps = 16/189 (8%) Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPL--IQSSLKLLVKELREQ 250 +QR + + + D+ + ++ +VF++D SGSM E PL +++++ + +L + Sbjct: 303 DQRDMFYLYLFPGDQPNMKVFRKEIVFIVDISGSM---EGKPLEGMKNAMSGALAKLNPK 359 Query: 251 DNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF--I 308 D+ I+ + G++ + + + + + A++ ++ GG + + QA + Sbjct: 360 DSFNIIAFNGETYLFSSLMELATEKTVERAVEWMNLNFIAGGGTNISVPLNQAMEMVSNT 419 Query: 309 KGGINRILLATDGDFNVGIDDPKSI-ESMVKKQRESGVT---LSTFGVGNSNYNEAMMVR 364 +G + I L TDG ++D + I +SM K R G + TFG+G + N + Sbjct: 420 QGSLPVIFLVTDG----AVEDERHICDSMKKYVRGKGAICPRIYTFGIG-TYCNHYFLRM 474 Query: 365 IADVGNGNY 373 +A V G Y Sbjct: 475 LATVCRGQY 483 >UniRef50_UPI0001745E25 hypothetical protein VspiD_17020 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745E25 Length = 652 Score = 49.3 bits (116), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 54/199 (27%), Positives = 93/199 (46%), Gaps = 21/199 (10%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ---DNIAIVTYAGDSRI 264 + +E N+ IDTS SM++D+ P KL ++L E+ D + ++ +AG S + Sbjct: 83 RVDERSGRNIFIAIDTSKSMLADDVSPNRLGRAKLAAQDLLERLPNDRVGVIAFAGRSYL 142 Query: 265 ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF--IKGGINRILLATDGD 322 P ++ H+A I I SLD GG+ + A Q A + +KG + ++L TDG Sbjct: 143 QAP-LTNDHEAVIE-CIQSLDHTTIPRGGSSIASAIQLAVETIDKVKGREHGMVLFTDGQ 200 Query: 323 FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN-GNYSYIDTLSE 381 + ++ + + + G+ + GVG + E ++ D N G Y E Sbjct: 201 -----ETDEATLTAARMAAQKGLIVIPVGVGTT---EGALIPDPDEQNTGGY----LRDE 248 Query: 382 AQKVLNSEMRQ-MLITVAK 399 V+NS + +L+ VAK Sbjct: 249 NGNVINSRLEAPLLLEVAK 267 >UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated chloride channel 1 precursor n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4A663 Length = 1245 Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 45/154 (29%), Positives = 72/154 (46%), Gaps = 16/154 (10%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD--SRIALPSISGSHK 274 +V ++DTSGSM + R+ + S+ V + + +I IVT+ G +R AL I+ Sbjct: 525 VVLVLDTSGSMGTSNRIDKVNSAATAFVNLVDDGISIGIVTFTGSPTTRHALTQINTQAD 584 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQ---QATKGFIKGGINRILLATDGDFNVGIDDPK 331 + I L A G T G GLE + G GGI I+L TDG Sbjct: 585 RDSLRDIFQLTASGGTCIGCGLEQGLEVLMAHPSGSADGGI--IVLMTDG-------QDS 635 Query: 332 SIESMVKKQ--RESGVTLSTFGVGNSNYNEAMMV 363 I++ + +Q ++ GV ++T +G Y E ++ Sbjct: 636 GIQNHIIRQTLQDMGVRVNTVAIGEDAYGELSLI 669 >UniRef50_A9B057 von Willebrand factor type A n=3 Tax=Chloroflexi (class) RepID=A9B057_HERA2 Length = 562 Score = 48.9 bits (115), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 40/162 (24%), Positives = 82/162 (50%), Gaps = 4/162 (2%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD-SRIALPSISGSHK 274 ++ +IDTSGSM + RL +++L + +QDN+ + ++ + + ++ S G + Sbjct: 385 DVALIIDTSGSMRQENRLREAKTALGDFIDIFADQDNVQVTIFSTNATELSDLSPIGPKR 444 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 A+++ ID L A+G T + + Y + I +++ TDG+ + + Sbjct: 445 ADLHTRIDGLVADGETRLYSTIGEVYTDIQQQTEVQRIRALVVLTDGEDTASSLSLEQLN 504 Query: 335 SMVKKQRESGVTLSTFGVG-NSNYNEAMMVRIADVGNGNYSY 375 + +Q ESG ++ F + S+ N+ ++ RIA++ G SY Sbjct: 505 EQI-RQDESGTSIKIFTIAYGSDANQEVLQRIAEI-TGAKSY 544 >UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella loihica PV-4 RepID=A3QDW1_SHELP Length = 776 Score = 48.9 bits (115), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 50/187 (26%), Positives = 80/187 (42%), Gaps = 18/187 (9%) Query: 205 KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI 264 +D+ LP L +IDTSGSM D + +S++ + L QD ++ + R Sbjct: 393 QDKARVRLP-RELTLVIDTSGSMTGDS-IAQAKSAILNALAGLGSQDTFNVIAFDSSVRS 450 Query: 265 ALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF--IKGGIN-----R 314 P S + ++ + N + SL+A+G T L A Q G I + + Sbjct: 451 LSPVALSATAANLGKANLFVQSLEADGGTEMAPALLRALSQPESGVSSISSAVKPERLKQ 510 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++ TDG I + + +QR L T G+G + N M R A G G Y+ Sbjct: 511 VVFITDGAVGNEASLFALIAANIGRQR-----LFTVGIGAAP-NGYFMERAARAGRGTYT 564 Query: 375 YIDTLSE 381 Y+ +SE Sbjct: 565 YVGKISE 571 >UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=A9AXC2_HERA2 Length = 421 Score = 48.9 bits (115), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 61/277 (22%), Positives = 119/277 (42%), Gaps = 13/277 (4%) Query: 188 PAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKEL 247 PA +Q L +DI A + N+ F++D SGSM D ++ ++ + + + + Sbjct: 17 PALQTQQVVYLLLDITATPAVAHVQMPVNVSFVLDHSGSMKGD-KMRCVREATQRALGLM 75 Query: 248 REQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 QD +++V + + + + A + A + + G T LE A + + Sbjct: 76 GPQDIVSVVIFDHRRETIISAQPVRNVAALQAEVGKIKDAGGTKIAPALEAALNEIRRSQ 135 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 I+RI+L TDG D + E + K + V L+ GVG+ ++NE +++ +A+ Sbjct: 136 NANTISRIILLTDGQTEGERDCLRLAEEIGK----ASVPLTALGVGD-DWNEDLLIEMAN 190 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYR---QIGYEKRQ 424 G Y ++ ++Q V ++ + F E R Q+ +Q Sbjct: 191 RSGGVAEYFSNPNDIASFFQGAVQQAQSAVVQNSALTLRFVQG--VEPRALWQVTPLIQQ 248 Query: 425 LRVEHFNND--NVDAGDIGAGKHITLLFELTLNGQKA 459 L ++ V GDI +H +L E+ ++ ++A Sbjct: 249 LPYRPISDRAVGVSLGDISKDEHRMVLIEMLVDPKQA 285 >UniRef50_Q25545 Putative uncharacterized protein (Fragment) n=1 Tax=Naegleria fowleri RepID=Q25545_NAEFO Length = 357 Score = 48.9 bits (115), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 47/169 (27%), Positives = 79/169 (46%), Gaps = 14/169 (8%) Query: 295 GLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES----MVKKQRESGVTLSTF 350 GL L Q+ T I ILL TDG N GI + I S + ++ +T TF Sbjct: 27 GLRLIKQRTTCN----EITSILLFTDGLANEGITNTSEIVSKMNTTIHEEIRKQITCFTF 82 Query: 351 GVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 G G S+ + M+ IA GNG Y +++ + + K + + ++ VA+++K +I P Sbjct: 83 GFG-SDTDANMLTSIAQAGNGLYYFLNNVDDIPKAFGNVIGGLVSVVAQNIKVKIM--PN 139 Query: 411 WVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELT---LNG 456 + +++ R+ + + GDI + + L+F LT LNG Sbjct: 140 SNVKLKKVFTTFRKTDLSGGTGCEIAVGDIYSEEKKDLVFVLTVPALNG 188 >UniRef50_B8HSI1 von Willebrand factor type A n=8 Tax=Cyanobacteria RepID=B8HSI1_CYAP4 Length = 589 Score = 48.9 bits (115), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 51/177 (28%), Positives = 84/177 (47%), Gaps = 18/177 (10%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 S +V ++DTSGSM + E+L +Q++L + L QD +A++ ++ D + P + Sbjct: 408 SLVVIVVDTSGSM-AGEKLANVQNTLNTYINGLSPQDQVALMRFSSD--VGTPVVVDGTP 464 Query: 275 AEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG-DFNVGIDDP 330 A + I SL A G+T+ A T+ IN +L+ TDG D I Sbjct: 465 AGRDRGLQFISSLRANGNTHLYDATLAARNWLTQNLRSDAINAVLVLTDGEDTGSAI--- 521 Query: 331 KSIESMVKKQRESGVT----LSTFGVG---NSNYNEAMMVRIADVGNGNYSYIDTLS 380 S+E + + ++SG +S F VG ++ + +IA+V G YS D S Sbjct: 522 -SLEQLGPELQKSGFNSDQRISFFTVGYGEEGEFDPQALQQIANVNGGYYSKGDPAS 577 >UniRef50_Q237Q6 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q237Q6_TETTH Length = 713 Score = 48.9 bits (115), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 53/234 (22%), Positives = 109/234 (46%), Gaps = 30/234 (12%) Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSM--------------ISDERLPLIQSSLKLL 243 +++ IL+ KS+ ++++ ++D SGSM + L +++ SL + Sbjct: 49 VRIQILSPKGKSK--VSNSICCVVDVSGSMGSRAVTKQSGGNSELGYSVLDIVKHSLNTI 106 Query: 244 VKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDA---EGSTNGGAGLELAY 300 V+ L E D ++VT++ +S++ + ++ I +++D ++ + STN AG+E Sbjct: 107 VQNLDEGDEFSMVTFSDNSKLVC-NYQQMTESNIKSSVDLINQCQPDASTNIWAGIEQGL 165 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV-----TLSTFGVGNS 355 +Q K ++++ TDG NV + P+ I + + + +++TFG G Sbjct: 166 EQMQNDSNKNKNQQLIVLTDGQPNV--NPPRGILTTLNNFYNKNIISPKPSINTFGFG-Y 222 Query: 356 NYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 + ++ IA G YS+I S + + + M T A + A + F P Sbjct: 223 YLDSHLLFNIAQDCQGIYSFIPDSSFVGTIFTNSIASMQSTFATN--AVLVFKP 274 >UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CDX4_KOSOT Length = 730 Score = 48.5 bits (114), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 60/211 (28%), Positives = 101/211 (47%), Gaps = 16/211 (7%) Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD- 261 L R+ E + ++VF++D SGSM S +++ + +L +++ L E D +I+T+ + Sbjct: 262 LVPPREPERIIPKDIVFILDISGSM-SGQKIEKAKLALLQVLQMLHEGDRFSIITFNNEV 320 Query: 262 SRIALPSISGSHKAEINAAIDSLDAEGSTNGG----AGLELAYQQATKGFIKGGINRILL 317 + + + S + E A+ + A G TN G+E+ Q+T K +L Sbjct: 321 NNLTERLLPFSDRTEWYPAVKQIMAGGMTNIHDALLEGIEVLGTQSTDDRYK----VVLF 376 Query: 318 ATDGDFNVGIDDPKS-IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 TDG GI D + I K + V L FGVG + N ++ +A+ G G YI Sbjct: 377 LTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVG-YDVNAELLDELAEKGGGKVKYI 435 Query: 377 DTLSEA-QKVLNSEMRQMLIT-VAKDVKAQI 405 E +KVL E+ +M+ T V +V +I Sbjct: 436 VENEEIDEKVL--ELYRMIETPVMSNVHLEI 464 >UniRef50_A8ULL3 Putative uncharacterized protein n=1 Tax=Flavobacteriales bacterium ALC-1 RepID=A8ULL3_9FLAO Length = 200 Score = 48.5 bits (114), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 29/102 (28%), Positives = 55/102 (53%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 N+ FLI+T + + E +++ + KLL K + E D I+IV Y+ + +AL + Sbjct: 53 NITFLIETYANNFNTEDKVILKQAFKLLSKRVTEDDLISIVAYSNFNGVALKQAEATDVK 112 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 ++ A++ L + T G+ELAY+ + FI+ N +++ Sbjct: 113 KLLYAVEHLKSSVKTFEEDGIELAYEFTKENFIEESENSVVM 154 >UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-trypsin inhibitor heavy chain H3 n=11 Tax=Tetrapoda RepID=B4DPQ4_HUMAN Length = 698 Score = 48.5 bits (114), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 41/152 (26%), Positives = 72/152 (47%), Gaps = 10/152 (6%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI---ALPSISGS 272 N+ F+ID SGSM + +L + +L ++++++E+D + ++GD L + Sbjct: 284 NVAFVIDISGSM-AGRKLEQTKEALLRILEDMKEEDYLNFTLFSGDVSTWKEHLVQATPE 342 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI-----NRILLATDGDFNVGI 327 + E + S++ +G TN GL K + I + +++ TDGD NVG Sbjct: 343 NLQEARTFVKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDANVGE 402 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNS-NYN 358 P+ I+ V+ L G GN+ NYN Sbjct: 403 SRPEKIQENVRNAIGGKFPLYNLGFGNNLNYN 434 >UniRef50_D2R3Y3 von Willebrand factor type A n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R3Y3_9PLAN Length = 776 Score = 48.5 bits (114), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 56/268 (20%), Positives = 110/268 (41%), Gaps = 21/268 (7%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 P+ +++ +D S SM R+ ++S+L ++R+ D ++IV + S + + + S Sbjct: 379 PSRHMIVAVDVSSSMHRQGRMQQVRSALDKFTSQMRDGDQLSIVAFRDVSEVLVERATAS 438 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLE----LAYQQATKGF-IKGGINRILLATDGDFNVGI 327 A +D TN +GL+ LA Q +++ TDG Sbjct: 439 EAQSAVAMLDLPVVVSGTNLASGLQQSLLLAMQAPGDATATPPSATSVVVITDGTPEWSH 498 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNS----------NYNEAMMVRIADVGNGNYSYID 377 + + ++ + + + VGN+ + + +++ + G ID Sbjct: 499 ATVQQLHALAADAAQQRIEMHVALVGNNARAQLAAERDSLGTLALDKLSSLLAGEVHAID 558 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA 437 + +L + +A + + +I+FNP V+ YR +G+ L V A Sbjct: 559 SSRNLYALLTDTLAGGSAVLASEARLRIDFNPQVVSAYRLLGHGATAL--ADVRPAEVSA 616 Query: 438 GDIGAGKHITLLFELTL---NGQKASID 462 +I AG+ +L EL L +GQ +S D Sbjct: 617 -EIRAGETAVVLVELWLAVDSGQSSSDD 643 >UniRef50_UPI000180C65C PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1 n=1 Tax=Ciona intestinalis RepID=UPI000180C65C Length = 1114 Score = 48.5 bits (114), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 57/219 (26%), Positives = 91/219 (41%), Gaps = 33/219 (15%) Query: 184 YELAPA-PWNEQ------RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 + PA PWN Q + KV K S + +++ +IDTSGS+I L LI Sbjct: 232 FRYFPAKPWNTQCLYHDLHDVTKVSWFVKGMTSPK----DVLIMIDTSGSIIGIT-LSLI 286 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALPSI------SGSHKAEINAAIDSLDAEGST 290 Q+S+K L+ L E D I + + + PS + HK + +L S+ Sbjct: 287 QTSVKKLMSTLTENDFFNIFVFNNEPKFLQPSCPNLMQATPKHKQMAAGWLSNLTVHNSS 346 Query: 291 NGGAGLELAYQQATKGF--------IKGGINR-ILLATDGDFNVGIDDPKSIESMVKKQR 341 G + A++ T+ I+ G N ILL TDG G P + K Sbjct: 347 AFEKGFDFAFEILTQSNSLNTTHRPIRAGCNSAILLFTDG----GAAYPSQV--FKKWNL 400 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 + V + T+ VG + + ++A G ++ I + S Sbjct: 401 DKEVRVFTYSVGKPFSSTTTLKQMACNNRGEFTAIPSYS 439 >UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepID=Q7S708_NEUCR Length = 766 Score = 48.5 bits (114), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 58/263 (22%), Positives = 108/263 (41%), Gaps = 45/263 (17%) Query: 179 PFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASN-------LVFLIDTSGSMISDE 231 P A + E+ P P + LL+V R LP N +V ID SGSM +D Sbjct: 29 PVAPKLEIHPLPSHTSGLLLRV---IPPRSPPNLPDPNFHHVPCDIVLAIDVSGSMSADA 85 Query: 232 RLP---------------------LIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--S 268 +P L++ + + +V L D + IVT++ ++++ P Sbjct: 86 PVPTTASADYTNEQPEHNGLSVLDLVKHAARTIVSTLNSSDRLGIVTFSTEAKVLQPLMP 145 Query: 269 ISGSHKAEINAAIDSLDAEGSTN--GGA--GLELAYQQATKGFIKGGINRILLATDGDFN 324 ++ +K + + + +TN GG GL+L Q+ G + +++ TDG N Sbjct: 146 MTALNKKKTERNLGGMQPFSATNLWGGIVEGLKLFDGQS------GRMPALMVLTDGMPN 199 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 + + + ++ + TFG G S ++ +A++G G YS+I Sbjct: 200 -HMCPAQGYVAKLRAMETLPAAIHTFGFGYS-LRSGLLKSVAEIGGGGYSFIPDAGMIGT 257 Query: 385 VLNSEMRQMLITVAKDVKAQIEF 407 V + + T A +V ++ + Sbjct: 258 VFVHSVANLQSTFANNVVLRLTY 280 >UniRef50_D2RSW3 von Willebrand factor type A n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RSW3_9EURY Length = 1446 Score = 48.5 bits (114), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 48/169 (28%), Positives = 83/169 (49%), Gaps = 16/169 (9%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG 271 + ++ VF+ D SGSM S + + K V L + + V YA + + P ++ Sbjct: 531 IETADFVFVNDESGSM-SGSPTHYAELAGKRFVGALTDSERAGRVGYASGANLDQP-LTT 588 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR---ILLATDGDFNVGID 328 H A +N++++ L A G TN AGL + + +G NR ++L +DG Sbjct: 589 DHDA-VNSSLERLSASGGTNTRAGLRVGLNHLEE---EGWENRSAVMILLSDGKSG---S 641 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 DP + + E+GV +ST G+GN N NE + IA + G++ +++ Sbjct: 642 DPLPV---AEDAAEAGVEISTVGLGN-NINENELREIAAITGGDFYHVE 686 >UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H5J9_SHEPA Length = 789 Score = 48.1 bits (113), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 62/211 (29%), Positives = 98/211 (46%), Gaps = 38/211 (18%) Query: 202 ILAKDRKSEELPAS---NLVFLIDTSGSMISDERLPLIQ--SSLKLLVKELREQDNIAIV 256 +L + +E+ P+S L+ +IDTSGSM D +IQ ++LK + LR D IV Sbjct: 383 MLMPPQGAEQQPSSIHRELILVIDTSGSMSGDA---IIQAKTALKYALAGLRPTDKFNIV 439 Query: 257 TYAGD----SRIALPSISGSHKAEINAAIDSLDAEGST------NGGAGLELAYQQATKG 306 + D S +A+ S + + A+ I+ L+A G T N +E + T Sbjct: 440 QFNSDVDKWSGMAM-SATPYNLAQAQNYINRLEANGGTEMSIAINAALNIETVTDKETGT 498 Query: 307 FIKGG------INRILLATDGDFNVGIDDPKSIESMVKKQRESGV---TLSTFGVGNSNY 357 + + ++L TDG S ESM+ + E+ + L T G+G++ Sbjct: 499 ELDNNDLGSNLLRQVLFITDGAV--------SNESMLFELIEAQLGDSRLFTIGIGSAP- 549 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEA-QKVLN 387 N M R A +G G Y+YI L E QKV++ Sbjct: 550 NAHFMQRAAQLGRGTYTYIGKLDEVNQKVVS 580 >UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09DT2_STIAU Length = 843 Score = 48.1 bits (113), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 44/163 (26%), Positives = 79/163 (48%), Gaps = 17/163 (10%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAE 276 +VF++DTSGSM E LP Q +L+L ++ LRE D I+ + + P + + Sbjct: 249 VVFVVDTSGSM-EGESLPQAQGALRLCLRHLREGDRFNIIAFDTSFQSFAPQPAVFTQKT 307 Query: 277 INAA---IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 + A + +L A G T + A Q A +G ++L TDG + + I Sbjct: 308 LEQADRWVAALRANGGTELLQPMLAAVQAAPEGV-------VVLLTDGQ----VGNEAEI 356 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 V + R++ + +FG+G +N ++A++ +A +G +I Sbjct: 357 LQAVLRARKT-ARIYSFGIG-TNVSDALLKDMARQTDGAVEFI 397 >UniRef50_A7RVQ6 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RVQ6_NEMVE Length = 419 Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust. Identities = 51/184 (27%), Positives = 84/184 (45%), Gaps = 22/184 (11%) Query: 184 YELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM----ISDERLPLIQSS 239 ++ A A W ++ +D L K R S E ++L+FL+DTSGS+ + E+ I++ Sbjct: 17 HQAAAASWLDK----SLDELKKLRGSVENRKADLLFLLDTSGSLSLSNFNTEK-KFIRNL 71 Query: 240 LKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAE-----GSTNGGA 294 L ++ + + I+T+ D +P IS +H+ + + A G TN Sbjct: 72 LNVIAVGF-DATRVEIITFGSDVNRRVPFISEAHEKDTKCTFNEKFANVVHEWGMTNMRG 130 Query: 295 GLELAYQQATKGFIKGG-----INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLST 349 E AY + KG G ++L TDG +N +P + + RE GV + Sbjct: 131 AFEKAY-EVCKGTWSGKKRLNIKTTVILITDGHWNWPWQNPDPVPKAQQLIRE-GVEILA 188 Query: 350 FGVG 353 FGVG Sbjct: 189 FGVG 192 >UniRef50_A9B6J8 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B6J8_HERA2 Length = 950 Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust. Identities = 51/220 (23%), Positives = 93/220 (42%), Gaps = 28/220 (12%) Query: 207 RKSEELPASNLVFLIDTSGSMIS---------------DERLPLIQSSLKLLVKELREQD 251 R ++ P LVF+ID SGSM + ++ + + ++ L + D Sbjct: 400 RNRQQRPDIALVFIIDKSGSMDACHCNGGDMAAREGGGTRKIDIAKEAVAQAAAVLGKDD 459 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG 311 + +VT+ + + + ++ AA+ + G TN +G+ AY+Q + K Sbjct: 460 KLGVVTFDDSAHWTIELDKVPSQDDVVAALAPVPPSGQTNVVSGMNAAYEQLRQSDAK-- 517 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 I +L TDG + I S+ + + G+TLS GN + N + R A++G G Sbjct: 518 IKHAILLTDGWGHA-----TDIGSIAENMNKDGITLSVVAAGNGSDN--ALQRYAELGGG 570 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW 411 Y + E ++ E Q + T + +F PA+ Sbjct: 571 RYYPARVMEEVPQIFLQETIQAVGTYI----VEEQFTPAY 606 >UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3 Tax=Theria RepID=ITIH4_PIG Length = 921 Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust. Identities = 41/147 (27%), Positives = 71/147 (48%), Gaps = 13/147 (8%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVK---ELREQDNIAIVTYAGDS-RIALPSISG 271 N++F+IDTSGSM R IQ + + L+K +L +D +V+++G++ R + S Sbjct: 272 NVIFVIDTSGSM----RGRKIQQTREALIKILGDLGSRDQFNLVSFSGEAPRRRAVAASA 327 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG-----FIKGGINRILLATDGDFNVG 326 + E + + A+G TN + +A Q + + I+L TDGD VG Sbjct: 328 ENVEEAKSYAAEIHAQGGTNINDAMLMAVQLLERANREELLPARSVTFIILLTDGDPTVG 387 Query: 327 IDDPKSIESMVKKQRESGVTLSTFGVG 353 +P I+ V++ + +L G G Sbjct: 388 ETNPSKIQKNVREAIDGQHSLFCLGFG 414 >UniRef50_Q22SJ7 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22SJ7_TETTH Length = 642 Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust. Identities = 43/201 (21%), Positives = 94/201 (46%), Gaps = 10/201 (4%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD--SRIALPSIS 270 P+ +LV +I+ S SM E++ ++++L L++ L D +++V + + L + Sbjct: 196 PSIDLVCVINNSESM-HGEKILNVKNTLLYLLEMLNSNDRLSLVLSNNNPTTLFDLKYLD 254 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDP 330 +K ++ I+++ +TN + A+ + ++ I L +DG V Sbjct: 255 EKNKQDLKRIINNISITQNTNITKSMIKAFNILQFRQSQNKVSSIFLLSDG---VDSSAE 311 Query: 331 KSIESMVKKQR---ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 K I++ + Q+ + +FG G + + M+ +I + NGN+ YI +++ + Sbjct: 312 KQIQNYISSQQSLQNKNFAIHSFGYG-FDQDAEMINKICSLKNGNFYYIQNMNQVDQYFA 370 Query: 388 SEMRQMLITVAKDVKAQIEFN 408 + L VA+D+ +I N Sbjct: 371 DVLGGTLTAVAQDITIEISLN 391 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobact... 783 0.0 UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella... 582 e-164 UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellac... 564 e-159 UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20... 560 e-158 UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatiba... 559 e-157 UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidet... 554 e-156 UniRef50_C6M483 von Willebrand factor type A domain protein n=1 ... 550 e-155 UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria R... 547 e-154 UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales... 545 e-153 UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 ... 542 e-152 UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteo... 536 e-151 UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 ... 536 e-150 UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 ... 532 e-149 UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=... 531 e-149 UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter... 530 e-149 UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4... 523 e-147 UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopi... 521 e-146 UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostri... 520 e-146 UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteo... 520 e-146 UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 ... 515 e-144 UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria Re... 514 e-144 UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenz... 512 e-143 UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacte... 512 e-143 UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacte... 511 e-143 UniRef50_UPI000185CB41 protein containing von Willebrand factor ... 508 e-142 UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria R... 506 e-141 UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmati... 506 e-141 UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea... 504 e-141 UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium... 503 e-140 UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacter... 502 e-140 UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastop... 501 e-140 UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 ... 501 e-140 UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 ... 500 e-140 UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobact... 498 e-139 UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9... 494 e-138 UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reineke... 494 e-138 UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Rumi... 481 e-134 UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimon... 479 e-133 UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Breviba... 474 e-132 UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 ... 474 e-132 UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobact... 470 e-131 UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangi... 465 e-129 UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiph... 462 e-128 UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocyst... 459 e-127 UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostri... 456 e-126 UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 ... 438 e-121 UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella... 437 e-121 UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 ... 435 e-120 UniRef50_C7N770 Uncharacterized protein containing a von Willebr... 433 e-119 UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophag... 418 e-115 UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobac... 418 e-115 UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Ach... 413 e-113 UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 ... 377 e-103 UniRef50_B4D1N7 Autotransporter-associated beta strand repeat pr... 353 9e-96 UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp.... 350 8e-95 UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12... 343 7e-93 UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 ... 332 2e-89 UniRef50_C1RGW7 Uncharacterized protein containing a von Willebr... 322 3e-86 UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 ... 310 8e-83 UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus te... 300 9e-80 UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 T... 262 2e-68 UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3... 249 2e-64 UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=... 247 7e-64 UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN... 244 1e-62 UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2... 241 6e-62 UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophag... 240 9e-62 UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 ... 240 1e-61 UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscill... 238 5e-61 UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotom... 236 2e-60 UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, s... 236 2e-60 UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magno... 235 3e-60 UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiob... 233 1e-59 UniRef50_UPI00017450FB von Willebrand factor type A domain prote... 232 4e-59 UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangi... 231 8e-59 UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus ce... 229 3e-58 UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus tri... 229 3e-58 UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis v... 228 6e-58 UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=... 228 7e-58 UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=So... 227 9e-58 UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis R... 226 3e-57 UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopi... 225 4e-57 UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3... 221 5e-56 UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinoco... 219 3e-55 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 219 3e-55 UniRef50_Q235T9 von Willebrand factor type A domain containing p... 218 3e-55 UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin... 218 5e-55 UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26... 216 1e-54 UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophob... 215 4e-54 UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fr... 214 5e-54 UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein fam... 214 6e-54 UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ... 213 2e-53 UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifi... 213 2e-53 UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea s... 212 4e-53 UniRef50_C1XMC3 Uncharacterized protein containing a von Willebr... 211 5e-53 UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1... 211 6e-53 UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZE... 211 8e-53 UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuni... 211 9e-53 UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharoph... 210 1e-52 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 209 3e-52 UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 ... 206 2e-51 UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesioc... 206 2e-51 UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ... 206 2e-51 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 206 2e-51 UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genom... 205 3e-51 UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genom... 205 3e-51 UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira ... 204 6e-51 UniRef50_B4W304 von Willebrand factor type A domain protein (Fra... 204 8e-51 UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnol... 202 3e-50 UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 ... 201 8e-50 UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum '... 200 1e-49 UniRef50_D2VKS7 von Willebrand factor type A domain-containing p... 200 2e-49 UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus ... 199 2e-49 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 197 1e-48 UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectria... 197 1e-48 UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW... 196 3e-48 UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfo... 195 3e-48 UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornu... 195 4e-48 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 194 8e-48 UniRef50_Q231J4 von Willebrand factor type A domain containing p... 193 2e-47 UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genom... 193 2e-47 UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ... 192 3e-47 UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesioc... 192 3e-47 UniRef50_Q22ST4 von Willebrand factor type A domain containing p... 192 4e-47 UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 ... 191 5e-47 UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexu... 191 8e-47 UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseoba... 191 9e-47 UniRef50_C1GWG1 von Willebrand factor type A domain containing p... 190 2e-46 UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangi... 188 5e-46 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 188 6e-46 UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microspo... 186 2e-45 UniRef50_Q22N58 von Willebrand factor type A domain containing p... 186 2e-45 UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ5... 185 4e-45 UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8Y... 184 6e-45 UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome... 184 1e-44 UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genom... 184 1e-44 UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax... 182 3e-44 UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole geno... 182 3e-44 UniRef50_UPI00006CAF43 von Willebrand factor type A domain conta... 181 7e-44 UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepI... 181 9e-44 UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesioc... 181 1e-43 UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepI... 180 1e-43 UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia br... 180 1e-43 UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflex... 180 2e-43 UniRef50_Q24C76 von Willebrand factor type A domain containing p... 180 2e-43 UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 ... 179 2e-43 UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetom... 178 4e-43 UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatiba... 178 6e-43 UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi... 178 6e-43 UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi... 177 1e-42 UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphyloth... 177 1e-42 UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=T... 176 2e-42 UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudona... 176 2e-42 UniRef50_A8J0D9 Flagellar associated protein n=1 Tax=Chlamydomon... 176 3e-42 UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza s... 176 3e-42 UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangi... 175 4e-42 UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B... 174 6e-42 UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza s... 173 2e-41 UniRef50_Q237Q6 von Willebrand factor type A domain containing p... 171 7e-41 UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga o... 171 7e-41 UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genom... 171 8e-41 UniRef50_Q22SJ7 von Willebrand factor type A domain containing p... 169 2e-40 UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangi... 169 2e-40 UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_... 169 3e-40 UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anseri... 169 3e-40 UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacilla... 169 3e-40 UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillu... 169 3e-40 UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopi... 169 4e-40 UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxob... 168 6e-40 UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisp... 167 8e-40 UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=... 167 1e-39 UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1... 166 2e-39 UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter... 165 4e-39 UniRef50_D2R3Y3 von Willebrand factor type A n=1 Tax=Pirellula s... 164 9e-39 UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesioc... 164 1e-38 UniRef50_UPI00016986EC hypothetical protein Epers_34925 n=1 Tax=... 163 2e-38 UniRef50_Q25545 Putative uncharacterized protein (Fragment) n=1 ... 162 2e-38 UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 ... 162 3e-38 UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein... 162 3e-38 UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteri... 162 4e-38 UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Breviba... 161 7e-38 UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudo... 160 1e-37 UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-cont... 159 2e-37 UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein... 159 3e-37 UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geob... 159 3e-37 UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein... 159 3e-37 UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1... 158 4e-37 UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1... 158 6e-37 UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillu... 157 7e-37 UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 ... 157 9e-37 UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcani... 157 1e-36 UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellu... 156 3e-36 UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 156 3e-36 UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 ... 155 3e-36 UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella... 155 3e-36 UniRef50_A8IJ40 Predicted protein n=1 Tax=Chlamydomonas reinhard... 155 3e-36 UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein... 154 8e-36 UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomyce... 154 1e-35 UniRef50_C1YR26 Uncharacterized protein containing a von Willebr... 153 2e-35 UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n... 151 8e-35 UniRef50_Q2QZN5 Putative uncharacterized protein n=1 Tax=Oryza s... 150 1e-34 UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain... 148 7e-34 UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=... 147 9e-34 UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 147 9e-34 UniRef50_A5UW94 von Willebrand factor, type A n=2 Tax=Roseiflexu... 147 1e-33 UniRef50_A6FXN3 Putative uncharacterized protein n=1 Tax=Plesioc... 147 1e-33 UniRef50_A9WKF3 von Willebrand factor type A n=3 Tax=Chloroflexu... 147 1e-33 UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 147 1e-33 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 147 1e-33 UniRef50_D0LP28 von Willebrand factor type A n=1 Tax=Haliangium ... 146 2e-33 UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanob... 146 2e-33 UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microc... 145 5e-33 UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythro... 145 6e-33 UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3... 144 7e-33 UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YN... 144 1e-32 UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-try... 143 2e-32 UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastop... 142 3e-32 UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1... 142 3e-32 UniRef50_Q01UI0 von Willebrand factor, type A n=1 Tax=Candidatus... 142 3e-32 UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax... 142 4e-32 UniRef50_B0CG18 von Willebrand factor type A domain protein, put... 142 4e-32 Sequences not found previously or not previously below threshold: UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID... 228 6e-58 UniRef50_Q8H923 Putative uncharacterized protein OSJNBa0071K18.1... 209 2e-52 UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta Re... 186 3e-45 UniRef50_C9SWV9 U-box domain containing protein n=1 Tax=Verticil... 185 4e-45 UniRef50_A6R161 Predicted protein n=3 Tax=Onygenales RepID=A6R16... 184 8e-45 UniRef50_B8AE57 Putative uncharacterized protein n=1 Tax=Oryza s... 179 3e-43 UniRef50_C5GK44 U-box domain-containing protein n=2 Tax=Ajellomy... 174 9e-42 UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2... 166 2e-39 UniRef50_Q24FW2 von Willebrand factor type A domain containing p... 162 3e-38 UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebr... 160 2e-37 UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea may... 156 2e-36 UniRef50_UPI00016C377F protein containing a von Willebrand facto... 155 3e-36 UniRef50_C7NN24 von Willebrand factor type A n=1 Tax=Halorhabdus... 155 4e-36 UniRef50_Q2QW82 Zinc finger family protein, putative, expressed ... 155 5e-36 UniRef50_C1HBZ8 von Willebrand and RING finger domain-containing... 154 7e-36 UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepI... 152 2e-35 UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein... 152 4e-35 UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein... 150 9e-35 UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ... 150 1e-34 UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein... 150 2e-34 UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine... 149 2e-34 UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobact... 149 4e-34 UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein... 149 4e-34 UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11... 148 4e-34 UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 ... 147 1e-33 UniRef50_Q2QZN4 von Willebrand factor type A domain containing p... 144 1e-32 UniRef50_Q2R0C4 Expressed protein n=2 Tax=Oryza sativa Japonica ... 144 1e-32 UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocep... 143 2e-32 UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alterom... 143 2e-32 UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina... 142 4e-32 >UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobacteriaceae RepID=YFBK_ECOLI Length = 575 Score = 783 bits (2021), Expect = 0.0, Method: Composition-based stats. Identities = 575/575 (100%), Positives = 575/575 (100%) Query: 1 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK Sbjct: 1 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 Query: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP Sbjct: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF Sbjct: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL Sbjct: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY Sbjct: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA Sbjct: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY Sbjct: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE Sbjct: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 Query: 481 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ 540 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ Sbjct: 481 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ 540 Query: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ Sbjct: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 >UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella RepID=A3D1E9_SHEB5 Length = 642 Score = 582 bits (1499), Expect = e-164, Method: Composition-based stats. Identities = 242/579 (41%), Positives = 350/579 (60%), Gaps = 26/579 (4%) Query: 5 NIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQ 64 N LL+ ++ L+ CG + E +Q + QV + +QA +++A + A A Sbjct: 45 NTAALLLVAVSLTACGGKGAEVEHRQAEQQAEQRHQVASQRQAEMRDAAKVEMARVAAPM 104 Query: 65 QEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANP-GTARYQQFDDNPVKQVAQNPLAT 123 Q S A+ G + AP + A P +++Q N + + P++T Sbjct: 105 Q----MSSNGAVMG-MSIAPM-------PRDYAAIPLAQNKFEQQVQNGIMVAGEIPVST 152 Query: 124 FSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMR 183 F +DVDTGSYA +RR L +G LP VRVEE++NYF D+ + PA PF++ Sbjct: 153 FFIDVDTGSYATLRRMLREGRLPEKGTVRVEEMLNYFAYDYPL------PAKNAAPFSVT 206 Query: 184 YELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLL 243 ELAP+P+N+ LL++ + D +L ASNLVFL+D SGSM S ++LPL+Q++LKLL Sbjct: 207 TELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSMASADKLPLLQTALKLL 266 Query: 244 VKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 +L QD ++IV YAG + + L +SG+ + A++ L A GS NGG G+ AYQ A Sbjct: 267 TAQLSAQDKVSIVVYAGAAGVVLDGVSGNDTQTLTYALEQLSAGGSINGGQGITQAYQLA 326 Query: 304 TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 K FI GINR++LATDGDFNVG+ D + ++++K+++ G+ L+T G G NYN+ +M Sbjct: 327 KKHFIPNGINRVILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLME 386 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKR 423 ++AD GNGNY+YIDTL+EA+KVL E+ L T+AKDVK Q+EFNPA V+EYR IGYE R Sbjct: 387 QLADKGNGNYAYIDTLNEARKVLVDELSSTLFTIAKDVKVQVEFNPALVSEYRLIGYENR 446 Query: 424 QLRVEHFNNDNVDAGDIGAGKHITLLFELT-LNGQKASIDKLRYAPDNKLAKSDKTK-EL 481 L E FNN VDAG+IGAG +T L+EL + DKLRY D + K ++ E+ Sbjct: 447 ALAREDFNNYKVDAGEIGAGHTVTALYELRYVEAGNRMNDKLRYGVDAQTGKEKYSRKEI 506 Query: 482 AWLKIRWKYPQGKESQLVEFPLG-----PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNT 536 A+LK+R+K P +SQL+ +P+ + S+D RF AAVA GQ L GS YL+ Sbjct: 507 AFLKLRYKLPAQTQSQLLSYPIRLDQSVKQLEQASDDFRFAAAVAGLGQLLNGSHYLHQF 566 Query: 537 SWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 + ++ A+ A G+DP GYR EF++L+E A + +Q Sbjct: 567 DYTKLSLLARSALGDDPFGYRHEFVQLMETAAAIEQSNQ 605 >UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellaceae RepID=A5WCP1_PSYWF Length = 571 Score = 564 bits (1454), Expect = e-159, Method: Composition-based stats. Identities = 254/568 (44%), Positives = 357/568 (62%), Gaps = 24/568 (4%) Query: 7 IMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQE 66 ++ ++S +++ C PQ + + + +T E A+Q+ +A A E Sbjct: 23 LLSVLSLSVITACAPQSKITPASDKASTTTLE----TAEQSIQADAAAPVVVMATPAMAE 78 Query: 67 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSL 126 +Q S K R+ P+ ++A Y + + N V ++ AT S+ Sbjct: 79 SRQLS-KMTTNARIMPPPSQG--------YMAPKQQENYAEIEPNAVNATSEQAFATLSI 129 Query: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 D DTGSYANVRRFLNQG LPP DAVRVEE++NYF D+ KQ+ PF + E+ Sbjct: 130 DTDTGSYANVRRFLNQGQLPPKDAVRVEELINYFNYDFTAAKKQA-----NAPFLVSTEV 184 Query: 187 APAPWNEQRTLLKVDILAKDR--KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 +PW+ ++KV I A+D ++ P +NLVFL+D SGSM ++++L L +SSLK+L Sbjct: 185 VNSPWHPTNQIVKVGIKAEDLLTAKQKQPPANLVFLVDVSGSMDTEDKLQLAKSSLKMLT 244 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 K+LR QD+I ++TYAG++++ LPS G+ +I AID+L A GSTNG A ++LAYQQAT Sbjct: 245 KQLRAQDSITLITYAGNTKVVLPSTPGNQTQKILNAIDNLTASGSTNGEAAIKLAYQQAT 304 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + F K GINRIL+ TDGDFNVG+ K + +++ R+ G++LST G G NYN+ MM + Sbjct: 305 EHFKKDGINRILMLTDGDFNVGVSSVKDMLQIIRSNRDKGISLSTLGFGQGNYNDHMMEQ 364 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +AD GNGNYSYID+LSEA+KVL EM TVAKDVK Q+EFNPA V+E+R IGYE R Sbjct: 365 VADNGNGNYSYIDSLSEAKKVLIDEMSATFNTVAKDVKIQLEFNPAAVSEWRLIGYENRV 424 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 L E FNNDNVDAG++GAGK + LFE+T GQK ++ RY N A S +EL +L Sbjct: 425 LAKEDFNNDNVDAGELGAGKSVVALFEVTPVGQKGLLEPSRY--QNSAAVSGNNRELGFL 482 Query: 485 KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 KIR+K PQ ++SQL+ FP+ + A S D F AVA YGQ L GS+Y+N+ S+ Q+++ Sbjct: 483 KIRYKAPQAEKSQLLSFPIANRVTAASADTNFALAVAGYGQLLTGSKYVNDLSYSQLQRL 542 Query: 545 AQQAKGE--DPQGYRAEFIRLIELADGV 570 A+ D G R+EFI+L+ LA+ + Sbjct: 543 AKSGAQSPIDSSGSRSEFIKLVSLAEAL 570 >UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20 Tax=Proteobacteria RepID=Q4KKB4_PSEF5 Length = 582 Score = 560 bits (1443), Expect = e-158, Method: Composition-based stats. Identities = 257/562 (45%), Positives = 350/562 (62%), Gaps = 26/562 (4%) Query: 13 SLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSD 72 L ++GCG + + + + A + +A + A + + S Sbjct: 20 LLAVAGCGVSSKPESAAGSSTQGALQAAPQAQYEVQHADATMAKRAVHPM------RLSA 73 Query: 73 KQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGS 132 AP +R + +YQ+ DNP+ VA+ P++TFS DVDTG+ Sbjct: 74 PM-------PAPISSRDSLVAGYRDEP--REQYQKLPDNPIHSVAEAPVSTFSADVDTGA 124 Query: 133 YANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWN 192 YANVRR LNQG LPP AVR+EE+VNYFP D+ + S PF + ELAP+PWN Sbjct: 125 YANVRRLLNQGSLPPEGAVRLEELVNYFPYDYALPTDGS-------PFGVTTELAPSPWN 177 Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 LL++ I A DR EL +NLVFL+D SGSM E LPL++S+LKLLV +LR+QD Sbjct: 178 PHTRLLRIGIKASDRAVAELAPANLVFLVDVSGSMDRREGLPLVKSTLKLLVDQLRDQDR 237 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 +++V YAG+SR+ L SG KA+I AID L A GST G +G++LAYQ A +GFI GI Sbjct: 238 VSLVVYAGESRVVLEPTSGRDKAKIRTAIDQLTAGGSTAGASGIQLAYQMAQQGFIDQGI 297 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 NRILLATDGDFNVG+ D S+++M ++R+SGV+L+T G G NYNE +M ++AD G+GN Sbjct: 298 NRILLATDGDFNVGVSDFDSLKAMAAEKRKSGVSLTTLGFGVDNYNEHLMEQLADAGDGN 357 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN 432 Y+YID L EA+KVL ++ L VAKDVK Q+EFNPA V+EYR +GYE R L+ E F+N Sbjct: 358 YAYIDNLREARKVLVDQLSSTLAVVAKDVKLQVEFNPAQVSEYRLLGYENRALKREDFSN 417 Query: 433 DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQ 492 D VDAG+IGAG +T L+E+ G+K ++ LRYA +S K ELA L++R+K P+ Sbjct: 418 DKVDAGEIGAGHTVTALYEIVPAGEKGWLEPLRYAQAKAPQQSGKQGELAMLRLRYKAPE 477 Query: 493 GKESQLVEFPLGP----TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQA 548 G S+L+E P+ ++ A S D+RF AAVAA+ Q+L+ Y N S + AQ A Sbjct: 478 GGSSRLIERPISAQQPGSLAAASPDLRFAAAVAAFSQQLKDGRYTGNFSLADTVKLAQGA 537 Query: 549 KGEDPQGYRAEFIRLIELADGV 570 KG DP G R EF++L+ELA + Sbjct: 538 KGADPYGLRGEFVQLVELAQSL 559 >UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z5_DESAA Length = 558 Score = 559 bits (1441), Expect = e-157, Method: Composition-based stats. Identities = 210/541 (38%), Positives = 305/541 (56%), Gaps = 18/541 (3%) Query: 41 VLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAA-----KAKAT 95 V A AA+ S A Y+ K A A A Sbjct: 23 VCLAGSAAVSFTSCSPKTVNEDAMTGYSGYTSKSTSAEPSMSAAKPCPAPKSEQRYAYYC 82 Query: 96 HIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEE 155 + + T Y + K +PL+TFS+DVDT SY+NVRRFL+ G +PP DAVR+EE Sbjct: 83 RVPDYNTEEYAPIREGGFKSPLYDPLSTFSIDVDTASYSNVRRFLSYGNMPPVDAVRIEE 142 Query: 156 IVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPAS 215 ++NYF D+ Q PF++ E++ PWN L+ V + + +++ S Sbjct: 143 MINYFHYDYPQPKGQ-------DPFSITMEMSQCPWNRDNMLVHVGLQGRCLDYKDVKPS 195 Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 NLVFL+D SGSM S+ +LPL++ S+++LVKEL D ++IVTYAG + + LPS S +K Sbjct: 196 NLVFLLDVSGSMNSENKLPLVKRSMEMLVKELGAGDRVSIVTYAGSAGLVLPSTSARNKR 255 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 +I A+D L+A GST GG G+ELAY+ A + I G NR++L TDGDFNVG+ + Sbjct: 256 KIITALDRLEAGGSTAGGEGIELAYRVAWENLIPEGNNRVILCTDGDFNVGVSSTPELVR 315 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLI 395 M++++R +G+ L+ G G NY + M I++ GNGN+ YID+ EA KV +MR + Sbjct: 316 MIEEKRRAGIYLTICGFGMGNYKDEKMEAISNAGNGNFYYIDSRREAHKVFVQDMRANMF 375 Query: 396 TVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN 455 T+AKDVK Q+EFNP V++YR +GYE R L E FNND DAG+IG G +T L+E+ Sbjct: 376 TLAKDVKIQVEFNPGRVSQYRLVGYENRLLAAEDFNNDLKDAGEIGPGHSVTALYEIVPA 435 Query: 456 G---QKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT---INA 509 G +D L+Y + + + E+ +K R+K P+ S+L+ L + Sbjct: 436 GLGMGAQRVDPLKYQESEPVPELRNSNEILTIKFRYKNPEENRSRLITRVLDESSMEFGD 495 Query: 510 PSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADG 569 S+D RF AAVA +G LR S Y + +W QI+ A+++ G D GYRAEFI+L++ Sbjct: 496 TSDDFRFSAAVAGWGMLLRNSSYADRLTWGQIQSMAEESVGPDEMGYRAEFIKLVKTCRE 555 Query: 570 V 570 + Sbjct: 556 L 556 >UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidetes RepID=C7PNZ7_CHIPD Length = 639 Score = 554 bits (1428), Expect = e-156, Method: Composition-based stats. Identities = 213/508 (41%), Positives = 306/508 (60%), Gaps = 15/508 (2%) Query: 71 SDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDT 130 + K + A + + + T Y ++N VA +PL+TFS+DVD Sbjct: 136 APKTVAGSPVVNAYMKSASPAFYGSRAPQFNTEDYSPVNENRFHTVASDPLSTFSIDVDR 195 Query: 131 GSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAP 190 SY+NVRRFLN+G +PP DAVRVEE++NYF + + P A+R ++A P Sbjct: 196 ASYSNVRRFLNEGNMPPVDAVRVEEMINYFDYKYSNP-------TGNTPVAVRTDMAICP 248 Query: 191 WNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 WN L+++ + KD + LP SNLVFLID SGSM ++LPL++ + KLLV +LR Sbjct: 249 WNTAHQLVRIALKGKDVAKDNLPPSNLVFLIDVSGSMSDAKKLPLVKQAFKLLVNQLRPV 308 Query: 251 DNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG 310 D +AIV YAG + + LPS SG HK I A+D L+A GST GG G++LAY+ AT+ +K Sbjct: 309 DRVAIVVYAGAAGLVLPSTSGDHKTAILDALDKLEAGGSTAGGEGVQLAYKTATEYLLKS 368 Query: 311 GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 G NR+++ATDGDFNVG ++ +++K+RE G+ LS G G NY + + +AD GN Sbjct: 369 GNNRVIIATDGDFNVGPSSDGELQRIIEKKREKGIFLSVLGFGMGNYKDNKLELLADKGN 428 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHF 430 GNY+YID EA++ +E L T+AKDVK Q+EFNP +V YR +GYE R L E F Sbjct: 429 GNYAYIDNFEEARRTFATEFGGTLFTIAKDVKLQVEFNPKYVQSYRLVGYENRLLNNEDF 488 Query: 431 NNDNVDAGDIGAGKHITLLFELTLNG---QKASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 N+D DAGD+GAG +T L+E+ G + ++D L+Y + + S E+ +K+R Sbjct: 489 NDDKKDAGDMGAGHTVTALYEVVPVGVQTGQPAVDPLKYQQNQPV--SGDNTEVLTVKLR 546 Query: 488 WKYPQGKESQLVEFPLG---PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 +K P SQL+ L I+A ED R AVA +G LR SE+ N S++Q+ + Sbjct: 547 YKNPADTSSQLISQVLHWKRQDISAAPEDFRMATAVADFGLLLRNSEHKGNASYEQVLKL 606 Query: 545 AQQAKGEDPQGYRAEFIRLIELADGVTD 572 A A+G D +GYRAEFI+L++ A +++ Sbjct: 607 AGNARGTDEEGYRAEFIQLVKKAQLISN 634 >UniRef50_C6M483 von Willebrand factor type A domain protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M483_NEISI Length = 538 Score = 550 bits (1416), Expect = e-155, Method: Composition-based stats. Identities = 237/516 (45%), Positives = 327/516 (63%), Gaps = 14/516 (2%) Query: 57 AAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQV 116 AA A + S + LQ A + A A+ T RYQ D PVK V Sbjct: 23 AALAACSGPLEHSSSSPEGLQSPPNAALSTAAVAEENLP--LAENTERYQDQPDQPVKSV 80 Query: 117 AQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASK 176 AQ P++TFS+DVDTGSYANVRRFL G PP DAVR+EEIVNYFP ++ + + Sbjct: 81 AQEPVSTFSIDVDTGSYANVRRFLTNGEQPPKDAVRIEEIVNYFPYNYPLP-------TD 133 Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 PFA+ E +PW + L+K+ I A+D ++LP +NLVFL+D SGSM + +LPL+ Sbjct: 134 NRPFAVHTETIDSPWQPEAKLIKIGIQAQDTAKKDLPPANLVFLVDVSGSMDEENKLPLV 193 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 Q +L++L ++LR QD + ++TYA + LP SG+ K I +AID L A G+T+G + L Sbjct: 194 QKTLRILTQQLRPQDKVTLITYASGEDLVLPPTSGADKETILSAIDKLRAGGATDGESAL 253 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 ++AY+QA K F+ GINRILLATDGDFNVG+ D ++++SMV ++R+SGV+LST G G N Sbjct: 254 QMAYEQAQKAFVPNGINRILLATDGDFNVGVSDTETLKSMVAEKRKSGVSLSTLGFGMGN 313 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYR 416 YNE MM +IAD G+GNYSYID EA+KVL ++ L TVA+DVK Q+EFNPA V EYR Sbjct: 314 YNEDMMEQIADAGDGNYSYIDNEKEAKKVLQQQLTSTLATVAQDVKIQVEFNPATVKEYR 373 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 +GY R LR E FNND VDAGDIGAG +T L+E+ G++ +++ RY A Sbjct: 374 LVGYTNRTLRNEDFNNDRVDAGDIGAGHSVTALYEIIPQGKQGWLEESRY--QKAPAAKG 431 Query: 477 KTKELAWLKIRWKYPQGKESQLVEFPLG---PTINAPSEDMRFRAAVAAYGQKLRGSEYL 533 E A++K+R+K P K+SQL++ + ++ +D A+Y Q LRG EY Sbjct: 432 SKNEYAFVKVRYKLPGQKDSQLMQQAVPVGSKPLDQADKDTLLALTAASYAQALRGGEYN 491 Query: 534 NNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADG 569 SW+ I+ AQ+ +G+DP G ++EF++L+ A G Sbjct: 492 GKLSWRDIENMAQKVQGDDPFGLKSEFLQLVRTAAG 527 >UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria RepID=C6VVX3_DYAFD Length = 625 Score = 547 bits (1410), Expect = e-154, Method: Composition-based stats. Identities = 215/529 (40%), Positives = 321/529 (60%), Gaps = 15/529 (2%) Query: 54 QSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPV 113 Q++ K +A++E++ ++ A + ++ T Y+ ++N Sbjct: 95 QASVKKKDIAREELKDSEQPVMVRKMFTFDMAAAPSTHSETILAMPQATESYKPINENGF 154 Query: 114 KQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIP 173 V Q P+ TFS+DVD +Y+NVRRFLN G +PP DAVR+EE++NYF D+ + Sbjct: 155 LSVGQQPVTTFSVDVDRAAYSNVRRFLNNGQMPPEDAVRIEEMINYFDYDYPQPRGEH-- 212 Query: 174 ASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERL 233 P A+ E +PWN L+ + + AK +E L ASNLVFLID SGSM +L Sbjct: 213 -----PVAIVAETTDSPWNPGLKLVHIGLQAKTVSAENLSASNLVFLIDVSGSMNEANKL 267 Query: 234 PLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGG 293 PL++ + KLL +LR +D I+IV YAG + + L SGS K I A+D L+A GST GG Sbjct: 268 PLLKQAFKLLADQLRVEDKISIVAYAGSAGMVLAPTSGSEKKTIKDALDKLEAGGSTAGG 327 Query: 294 AGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVG 353 G+ELAY A K F+ G NR++LATDGDFNVGI + ++ +++++R++G+ LS G G Sbjct: 328 EGIELAYDLAKKHFLPKGNNRVILATDGDFNVGISNESELQKLIEEKRKAGIFLSVMGFG 387 Query: 354 NSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT 413 NY ++ + +AD GNGNY+YID + EA+KV E L T+AKDVK QIEFNPA V Sbjct: 388 MGNYKDSHVETLADKGNGNYAYIDNIQEARKVFVQEFGGTLFTIAKDVKIQIEFNPAHVQ 447 Query: 414 EYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK----ASIDKLRYAPD 469 YR IGYE R LR + FN+D DAGD+G+G +T ++E+ +G K A+ D L+Y P Sbjct: 448 AYRLIGYENRALRNDEFNDDRKDAGDMGSGHTVTAIYEIVPSGVKSPYVATTDALKYQPG 507 Query: 470 NKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT---INAPSEDMRFRAAVAAYGQK 526 N S KE+ +K+R+K P ++S+L + P+ T + S ++RF +AVA +G Sbjct: 508 NAATGS-SNKEMMTIKVRYKQPDSEKSKLFDLPVPATTVAFDQCSANLRFASAVAEFGLL 566 Query: 527 LRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 LRGSE+ + S+ + + A+ A G+D +GYR+EF++L+++A + + Sbjct: 567 LRGSEFKGSASYADVIRRARAAFGKDEEGYRSEFVQLVKVAQSLDGSQE 615 >UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales RepID=C8SEV7_9RHIZ Length = 718 Score = 545 bits (1405), Expect = e-153, Method: Composition-based stats. Identities = 238/556 (42%), Positives = 319/556 (57%), Gaps = 25/556 (4%) Query: 34 STPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQG---RLQEAP------ 84 S PT LA Q A + A A A++ Q+ + + G R+ P Sbjct: 165 SEPTVAGGLAQQNAQGQVAPAEPAPARSGGQRVIMSLTPPPQADGTTSRIARMPAAESKL 224 Query: 85 -TFARAAKAKATHIAN--PGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLN 141 T + A A A IA R Q F NPV ++P++TFS+DVDT SY+ VRR L Sbjct: 225 MTPQQPATAPADQIAPQEENRDRVQDFKTNPVHAALEDPVSTFSIDVDTASYSFVRRSLK 284 Query: 142 QGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVD 201 +G +P D VRVEE++NYFP DW D S P F + P PWN L+ V Sbjct: 285 EGFVPQADTVRVEEMINYFPYDWKGPDSASTP------FNSTVSVMPTPWNTHTKLMHVA 338 Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 I D K E P +NLVFLID SGSM ++LPL++S+ +LLV +L+ D I+IVTYAGD Sbjct: 339 IKGFDVKPTEQPKANLVFLIDVSGSMDEPDKLPLLKSAFRLLVSKLKADDTISIVTYAGD 398 Query: 262 SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG 321 + L + K +I AID+L GST G AG++ AY+ A + FIK G+NR++LATDG Sbjct: 399 AGTVLMPTKIAEKDKILNAIDNLQPGGSTAGEAGIKEAYKLAQQSFIKDGVNRVMLATDG 458 Query: 322 DFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 DFNVG D ++ +++++R++GV LS FG G N N+ MM IA GNG +YIDTL+E Sbjct: 459 DFNVGQTDDDDLKRLIEQERKTGVFLSVFGFGRGNLNDEMMQTIAQNGNGTAAYIDTLAE 518 Query: 382 AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIG 441 A+KVL + L T+AKDVK Q+EFNP V+EYR IGYE R L E FNND VDAGDIG Sbjct: 519 AEKVLVEDASSTLFTIAKDVKIQVEFNPDKVSEYRLIGYETRALNREDFNNDRVDAGDIG 578 Query: 442 AGKHITLLFELTLNGQKA-SIDKLRYAPDN-KLAKSDKTKELAWLKIRWKYPQGKESQLV 499 +G +T ++E+T G ID LRY E A++KIR+K P S+L+ Sbjct: 579 SGHSVTAIYEITPKGGGGEQIDPLRYGQAGVNNGGVANADEYAFVKIRYKLPNEDVSKLI 638 Query: 500 EFPLG-----PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQ 554 P+ + + S D RF AVAA+GQKLR + + +I + A A+G DP Sbjct: 639 TTPVTSANEVASFDQASTDQRFSVAVAAFGQKLRDEDATAKFGYDKIMEIATAARGADPF 698 Query: 555 GYRAEFIRLIELADGV 570 GYR+EF+ L+ LA + Sbjct: 699 GYRSEFLSLVRLASAL 714 >UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5H3_9GAMM Length = 608 Score = 542 bits (1397), Expect = e-152, Method: Composition-based stats. Identities = 232/599 (38%), Positives = 340/599 (56%), Gaps = 40/599 (6%) Query: 7 IMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQE 66 I+ L +L+GC Q +N +Q + + EQ+ + +Q+ Sbjct: 16 ILTLTIISLLAGCQGQQQNTSAQSDEQAAVIEQKNTQREAQTQTNTSAEVQTKSKSSQEN 75 Query: 67 VQQYSDKQALQGRLQEAPTFARAAKAKATHIA------------------------NPGT 102 +++ + + A A + H Sbjct: 76 EIRFNHPRIENVAGLNHESLAPAQMQQVRHRIGAVYPPMPMPPILPPKPMPPQFENEQAR 135 Query: 103 ARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPS 162 Y + + NPVKQV P++TFS+DVDTGSY+N RR + G PP DAVR E +NYF Sbjct: 136 ENYLKNEQNPVKQVMLEPVSTFSIDVDTGSYSNSRRMIKMGKRPPADAVREEAFINYFDY 195 Query: 163 DWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLID 222 + S P S PF + E+APAPWN QR LLK+ I D + EL A+NLVFL+D Sbjct: 196 HY------SAPKSLETPFNVHTEVAPAPWNNQRQLLKIGIKGFDIEKAELKAANLVFLLD 249 Query: 223 TSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAID 282 SGSM + ++LPL++SSL +L K+L E D++AIV YAG + + LP+ G+ I+ A++ Sbjct: 250 VSGSMNAPDKLPLLKSSLTMLTKQLDENDSVAIVVYAGAAGLVLPATKGNEYQVISNALN 309 Query: 283 SLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRE 342 +L A GSTNG G+ELAYQ A++ F K GINR++LATDGDFNVG+ +++ ++ +R+ Sbjct: 310 NLSAGGSTNGAQGIELAYQIASQNFKKEGINRVILATDGDFNVGMSSVDALKKLIANKRK 369 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 +G+ L+T G G NYN+ +M ++A++GNG ++YIDT++EA+KVL E+ + +AKDVK Sbjct: 370 TGIALTTLGFGQGNYNDGLMEQLANIGNGQHAYIDTINEARKVLVDELSSTMQIIAKDVK 429 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL-NGQKASI 461 Q+EFNPA V EYR IGY+ R L+ E FNND VDAG++GAG +T L+E+TL N I Sbjct: 430 IQVEFNPAQVAEYRLIGYQNRLLKQEDFNNDTVDAGELGAGHTVTALYEITLANSPAKQI 489 Query: 462 DKLRYAPDNKLAK----SDKTKELAWLKIRWKYPQGKESQLVEFPLGPT-----INAPSE 512 D LRY ++ S ELA++K+R+K P S+L+ + + S+ Sbjct: 490 DDLRYQTPQQMPTNSNFSSAQDELAYVKLRYKAPNSDVSKLMSQAIFASETQSQFAQASQ 549 Query: 513 DMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 D +F A VA + KL+G +Y +QQ+ A KG+DP GYR EFI+L+ A + Sbjct: 550 DFQFAATVAGFADKLKGEKYTGLWQYQQLIDVAVANKGDDPFGYRNEFIQLLRTAAELE 608 >UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteobacteria RepID=B1KPQ5_SHEWM Length = 640 Score = 536 bits (1381), Expect = e-151, Method: Composition-based stats. Identities = 230/561 (40%), Positives = 344/561 (61%), Gaps = 21/561 (3%) Query: 26 KESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQ---QYSDKQALQGRLQE 82 ES S+P +Q + A E + A+ + ++ + Q S++ Sbjct: 61 SESNPIAGSSPNQQSIDTASGIGGTEVQSEASQGEVSVRETYRAAKQASERMKSMKVHSR 120 Query: 83 APTFARAAKAKATH-----IANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 137 +FA ++++ N + + P++TFS+DVDTGSY+ +R Sbjct: 121 PESFALMGLPSRPSQDIYLPELQNRDKFERQVANGIMVAGEIPVSTFSIDVDTGSYSTLR 180 Query: 138 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTL 197 R +N G+LP VRVEE++NYF + D + PF++ ELAP+P+N + L Sbjct: 181 RSINHGVLPERGTVRVEELINYFAYQYPAPD------AGEQPFSVNTELAPSPYNPHKML 234 Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVT 257 L++ + +++ +L AS LVFL+D SGSM S ++LPL++++LK+L ++L E D I+IV Sbjct: 235 LRIGLKGFEKEKADLGASQLVFLLDVSGSMSSQDKLPLLKNALKMLSQQLDEGDRISIVV 294 Query: 258 YAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 YAG S + L + G+ I+ A+D L A GSTNGGAG+ELAYQ A K FI GG+NR++L Sbjct: 295 YAGASGVVLDGVKGNDTLAISQALDKLKAGGSTNGGAGIELAYQLAQKHFIAGGVNRVIL 354 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 ATDGDFNVG+ D +++E M++++R+ G+ L+T G G NYN+ +M ++AD GNG+Y+YID Sbjct: 355 ATDGDFNVGVSDQQALEDMIEEKRKQGIALTTLGFGQGNYNDHLMEQLADKGNGHYAYID 414 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA 437 TL+EA+KVL E+ L+T+AKDVK QIEFNPA V+EYR IGYE R L E FNND VDA Sbjct: 415 TLNEARKVLVDEISATLLTIAKDVKVQIEFNPALVSEYRLIGYENRALNREDFNNDKVDA 474 Query: 438 GDIGAGKHITLLFELT-LNGQKASIDKLRYAPDNKLAKSDKT-KELAWLKIRWKYPQGKE 495 G+IGAG +T L+EL+ ++ + D LRY D K K + ELA+LK+R+K ++ Sbjct: 475 GEIGAGHRVTALYELSFVDSPNQANDVLRYGLDIKTGKEKYSRDELAYLKLRYKPIGQEK 534 Query: 496 SQLVEFPLGPT-----INAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 S+L+ +P+ + S+D RF AAVA +GQ + S YL++ + ++ AQ A G Sbjct: 535 SKLISYPVLTSTAINEFAQASDDFRFAAAVAGFGQLINHSHYLHDMDYAKVSDIAQAAMG 594 Query: 551 EDPQGYRAEFIRLIELADGVT 571 ED GYR EF++L + A + Sbjct: 595 EDSFGYRHEFVQLTKTAGLLA 615 >UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZHE2_9SPHI Length = 704 Score = 536 bits (1380), Expect = e-150, Method: Composition-based stats. Identities = 207/497 (41%), Positives = 301/497 (60%), Gaps = 11/497 (2%) Query: 78 GRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 137 G L F R A A A P RY +N QV QNPL+TFS+DVD SY+NVR Sbjct: 207 GDLSSDLKFDRKA---AFRNAFPEGERYATIYENQFYQVGQNPLSTFSIDVDNASYSNVR 263 Query: 138 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPAS--KPIPFAMRYELAPAPWNEQR 195 RF+N G P +AVRVEE++NYF D+ + PF++ E PWN Sbjct: 264 RFVNDGQPLPKNAVRVEEMINYFEYDYPQPTPTKDKEGKLQTHPFSVNTEYGTCPWNPHH 323 Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE-QDNIA 254 LL++ + ++ +++ +NLVFL+D SGSM S+++LPL++ S K+L+K+L + + IA Sbjct: 324 KLLQIGLQGENLQTKNASPANLVFLVDASGSMDSEDKLPLLKRSFKVLLKQLTDSRTKIA 383 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 IV YAG S + LP+ S SH+ +I A++++++ GST GG G+ELAY+ A + FI GG NR Sbjct: 384 IVAYAGASGLVLPATSVSHREKILTALENIESGGSTAGGEGIELAYKIAQQAFIAGGNNR 443 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++LATDGDFNVG+ + + ++ +R+SGV L+ G G N N++MM ++ + GNGNY Sbjct: 444 VILATDGDFNVGLSSDEELMQLISNKRKSGVYLTCLGFGTGNLNDSMMEKLTNAGNGNYY 503 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 YID ++EA+KVL + L +AKDVK Q+EFNPA V YR +GYE R L+ F ND Sbjct: 504 YIDGINEAKKVLAKNLTGTLYAIAKDVKIQLEFNPARVKSYRLVGYENRVLKHRDFKNDQ 563 Query: 435 VDAGDIGAGKHITLLFELTL----NGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKY 490 VDAG++G G +T L+E+ A L+Y + + EL +K+R+K Sbjct: 564 VDAGELGVGHTVTALYEIVPVNRTQPMLADEIPLKYQTTQIDSAALANNELVTIKLRYKR 623 Query: 491 PQGKESQLVEFPLGP-TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAK 549 P+ +S+L+E + + S + +F VAA+G +LR S Y+ NTS+QQI W Q AK Sbjct: 624 PKENKSRLIEKVVKNKLVTQTSNNFKFATTVAAFGMRLRNSPYVGNTSYQQIYSWGQYAK 683 Query: 550 GEDPQGYRAEFIRLIEL 566 D GYR EF+ L++ Sbjct: 684 SVDSNGYRREFLELVKK 700 >UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BTM6_TERTT Length = 689 Score = 532 bits (1370), Expect = e-149, Method: Composition-based stats. Identities = 231/572 (40%), Positives = 333/572 (58%), Gaps = 22/572 (3%) Query: 3 NKNIIMLLMSSLILSGCGPQPENKESQQQQ--PSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 K ++ + + + P+PE + Q+ + + L + +AA ++ AA Sbjct: 129 GKEVVEEVKVTGMRESLQPEPETQRHALQEYDAISAKDIGALPSHEAARNLQRFASPAAS 188 Query: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 + A++EV + + R + + + NP+K + P Sbjct: 189 SEAKREVLMKREASSFMPR--------KPDLEPPHQLETADRDHFDTVATNPIKVTREEP 240 Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 ++TFS+DVDT SY+ VRR LN+G LP AVR+EE+VNYFP D+ + P++ PF Sbjct: 241 VSTFSIDVDTASYSFVRRQLNRGQLPQKAAVRLEEMVNYFPYDYPL------PSAATAPF 294 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 + PAPWN+ + L+ + I A P +NLVFL+D SGSM S ++LPL++ S+ Sbjct: 295 KPTITVIPAPWNQAKRLVHIGIKA--LPLAHPPKANLVFLLDVSGSMGSPDKLPLVKQSM 352 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 +LL+ L+ D ++IV YAG + L + + +I AA+D L+A GST G G+ELAY Sbjct: 353 ELLLSGLQPTDTVSIVVYAGAAGTVLEPTPVAEQQKILAALDRLNAGGSTAGAQGIELAY 412 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 Q A + + +NRI+LATDGDFNVGI DP+ ++ V+++R +G+ LS G G+ NYN+A Sbjct: 413 QLAEANYQRDAVNRIILATDGDFNVGIADPEQLKGYVERKRANGIELSILGFGSGNYNDA 472 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 +M ++A GNG +YIDTLSEAQKVL + L TVAKDVK Q+EFNPA V EYR +GY Sbjct: 473 LMQQLAQNGNGVAAYIDTLSEAQKVLVEQASGTLFTVAKDVKIQVEFNPATVAEYRLLGY 532 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS-IDKLRYAPDNKLAKSDKTK 479 E R L+ E FNND VDAG+IGAG +T ++E+T G KA+ ID RYA +DK Sbjct: 533 ETRALKREDFNNDAVDAGEIGAGHTVTAIYEITPAGSKAALIDSQRYAAAKIATDTDKAS 592 Query: 480 ELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMR---FRAAVAAYGQKLRGSEYLNNT 536 E +LKIR+K P G ESQL+ P+ ++ +R F AAVA + Q L+ YL N Sbjct: 593 EYGFLKIRYKQPGGSESQLISAPIPVAMDTTQTQLREAQFGAAVAGFAQWLKDPRYLGNW 652 Query: 537 SWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 S + AQ KG+DP GYR EF++L+ A Sbjct: 653 SLDDALKLAQANKGDDPYGYRTEFVQLVRKAK 684 >UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C54C8 Length = 638 Score = 531 bits (1368), Expect = e-149, Method: Composition-based stats. Identities = 209/564 (37%), Positives = 313/564 (55%), Gaps = 21/564 (3%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 PQ + ++ P A A + A A ++ + A + Sbjct: 82 PQGDIAPIGKKGEHAP-----GPALPGLPSPAPLAEPAMPAGPDALGKRIAASSAPFASV 136 Query: 81 QEAPTFARAAKAKATHIANP-------GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSY 133 + A A + P Y ++ +N + L+TFS DV+T SY Sbjct: 137 VASGGGAFGGVRGAANKPAPRDGYNAQNAEAYGRYQENEFRSPLVAALSTFSADVNTASY 196 Query: 134 ANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNE 193 ANVRR LN+G LPP AV + E VNYFP + + P + P A E+ P PWN Sbjct: 197 ANVRRMLNEGTLPPASAVFLAEFVNYFPYSY------APPPAGADPVAFHVEMGPCPWNA 250 Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNI 253 + LL+V + A +E+LP NLVFL+DTSGSM + RLPL+Q SL+LLV++L E+D + Sbjct: 251 KHHLLRVGVQAHQIPAEKLPPRNLVFLVDTSGSMQQENRLPLVQKSLELLVEKLTEKDRV 310 Query: 254 AIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN 313 ++VTYAGDSR+ALP SG+ K I + L A G TNG G++ AYQ A F+ GG+N Sbjct: 311 SVVTYAGDSRVALPPTSGADKKAILDVVTGLQANGGTNGEGGIKKAYQFARDTFLDGGVN 370 Query: 314 RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNY 373 R++L TDGDFNVG+ D + ++++QR+S V L+ G G NY + + +A+ GNG++ Sbjct: 371 RVILCTDGDFNVGVVDNGELVKLIEEQRKSKVFLTVLGYGMGNYKDDRLKELANHGNGHH 430 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND 433 +YIDTL EA+KV + L+ VAKDVK QI+FNPA V YR +GYE R L+ E F ND Sbjct: 431 AYIDTLDEAKKVFVEQ-GGALVCVAKDVKFQIDFNPAKVNAYRLVGYENRLLKDEDFKND 489 Query: 434 NVDAGDIGAGKHITLLFELTLNGQKASIDKLRYA-PDNKLAKSDKTKELAWLKIRWKYPQ 492 DAGD+G+G +T+L+E+ G K + ++ + K ++ + E +K+R+K+P Sbjct: 490 AKDAGDVGSGHQVTVLYEIVPPGVKVDLPEVDASKYQKKDVPANASDEWLTVKMRYKHPD 549 Query: 493 GKESQLVEFP-LGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGE 551 S+ + G S+D RF AAVA++G LR S++ ++ + + AQ A G Sbjct: 550 EDVSKELTAAHKGAVAKELSDDFRFAAAVASFGMLLRDSKFKGAMTYAGVLEEAQGALGA 609 Query: 552 DPQGYRAEFIRLIELADGVTDISQ 575 DP +R +F+ L+ A ++++ + Sbjct: 610 DPNNHRKQFLELVRRAKELSNVQK 633 >UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter sp. K31 RepID=B0T5X0_CAUSK Length = 592 Score = 530 bits (1366), Expect = e-149, Method: Composition-based stats. Identities = 228/561 (40%), Positives = 313/561 (55%), Gaps = 20/561 (3%) Query: 18 GCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQ 77 C + S +++ + + A L Sbjct: 41 ACAALGYAAPAPDGYDSVVVTATKRTSREQRLTSRARPVIATPGLTPPPPPPPPSPPPPP 100 Query: 78 GRLQEAPTFARAAKAKATHIANP--GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYAN 135 +FA + A + A P T +Y NPVK+VA+ P++TFS+DVDT +YAN Sbjct: 101 AAY----SFAAPSPVVAPNFAPPIRDTEKYPGAAANPVKRVAEEPVSTFSIDVDTAAYAN 156 Query: 136 VRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQR 195 VRRFLN+G PP DA+RVEE++NYF + + P ++ PF + P+PW++ R Sbjct: 157 VRRFLNEGAAPPHDALRVEELINYFDYGY------ARPTAQEPPFKPTVTVVPSPWSQDR 210 Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 L+ + + P NLVFLIDTSGSM +RLPL + +L +L+ +LR QD +++ Sbjct: 211 QLMHIGVQGYATPRAGQPPLNLVFLIDTSGSMSGPDRLPLAKKALNVLIDQLRPQDRVSM 270 Query: 256 VTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRI 315 V YAG + L G K ++ A+ +L + GST GG GLELAY A + +NR+ Sbjct: 271 VAYAGSAGAVLSPTDGKSKLKMRCALTALRSGGSTAGGQGLELAYALARQNLDPKAVNRV 330 Query: 316 LLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSY 375 +L TDGDFNVGI DP ++ V QR+SGV LS +G G NYN+ MM +A GNG +Y Sbjct: 331 ILMTDGDFNVGIADPTRLKDFVADQRKSGVYLSVYGFGRGNYNDTMMQALAQNGNGTAAY 390 Query: 376 IDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNV 435 +D L EA+K+L + L +A DVK Q+EFNPA V+EYR IGYE R L E FNND V Sbjct: 391 VDGLQEARKLLRDDFDSALFPIADDVKIQVEFNPAKVSEYRLIGYETRLLNREDFNNDQV 450 Query: 436 DAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE 495 DAG+IG+G +T ++E+T G K S D LRY A ELA+LKIR+K P G Sbjct: 451 DAGEIGSGAAVTAIYEITPVGAKPSSDPLRYGAKPSPATGGS--ELAFLKIRYKPPGGST 508 Query: 496 SQLVEFPLG-----PTINAPSEDMRFRAAVAAYGQKLRGSEYLN-NTSWQQIKQWAQQAK 549 S+L+E P+G ++ A E RF AVAAYGQKLRG +++ + W + AQ A+ Sbjct: 509 SKLIERPIGAGDMHASLAAAPEATRFAVAVAAYGQKLRGDPWVDASFDWDAVTALAQGAR 568 Query: 550 GEDPQGYRAEFIRLIELADGV 570 GEDP G RAEF++L A V Sbjct: 569 GEDPYGLRAEFVQLTRAAKDV 589 >UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4 Tax=Cyanobacteria RepID=B0CCM8_ACAM1 Length = 686 Score = 523 bits (1348), Expect = e-147, Method: Composition-based stats. Identities = 223/565 (39%), Positives = 324/565 (57%), Gaps = 23/565 (4%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEA 83 E + +Q++ + + + A A A+ Q + A Sbjct: 119 ELSQVRQRRKPSNRRFGISPRRPRPTGLPPALTKAQPAPAETAAQSQFSRDQSGRMKSVA 178 Query: 84 PTFARAAKAKATHIANP---------GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYA 134 P A A + T Y++ ++NP + PL+TFS+DVDT SY+ Sbjct: 179 PPAGLAPPAPEPRFQDKDRLHLPGTFNTEDYKRINENPFFLPQRTPLSTFSIDVDTASYS 238 Query: 135 NVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ 194 NVRRF+ QG LPP DAVR+EE++NYF + PF++ E+A APWN Q Sbjct: 239 NVRRFIRQGQLPPKDAVRLEELINYFDYGYASPK-------GDQPFSVSTEVATAPWNNQ 291 Query: 195 RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 L+ + + K+ + E+ SNLVFLID SGSM +L L++ SL LLV +L+ +D ++ Sbjct: 292 HKLVHIGLKGKELEKEQ--PSNLVFLIDVSGSMKRPNKLALVKKSLCLLVHQLKPEDRVS 349 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 +V YAG + I LPS G+ KA I AID L+A GST G AG+++AY A + F+K G NR Sbjct: 350 LVVYAGRAGIVLPSTPGTQKATIMNAIDRLEAGGSTAGAAGIKMAYDMAERHFLKNGNNR 409 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++LATDGDFNVG +E +++++R+ GV L+ G G NY + M +A+ GNGNY+ Sbjct: 410 VILATDGDFNVGQSSDAELERLIEQKRDRGVFLTVLGYGTGNYKDNKMELLANKGNGNYA 469 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 YIDTL EAQKVL +++R L T+AKDVK Q+EFNP V YR IGYE R LR + FN+D Sbjct: 470 YIDTLLEAQKVLVNDLRGTLFTIAKDVKIQVEFNPGKVQAYRLIGYENRLLRDQDFNDDR 529 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDN--KLAKSDKTKELAWLKIRWKYPQ 492 DAG+IG+G IT L+E+ G K+ ++ P K S + +L LK+R+K P Sbjct: 530 KDAGEIGSGHTITALYEVIPTGVKSDVELPDIDPLKFQKPTASSNSSDLMNLKLRYKQPT 589 Query: 493 GKESQLVEFPLGP---TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAK 549 G +SQL+ + +I + +++++F AAVA YG LR S+Y ++ Q+ A QAK Sbjct: 590 GSKSQLISTAIADKNRSIQSATDNLKFSAAVAMYGMVLRDSDYKGKATFNQVLDLADQAK 649 Query: 550 GEDPQGYRAEFIRLIELADGVTDIS 574 G+DPQGYR F++L+E + + Sbjct: 650 GKDPQGYRMAFMQLVERSQTLQQAK 674 >UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UP85_RHOBA Length = 885 Score = 521 bits (1343), Expect = e-146, Method: Composition-based stats. Identities = 228/574 (39%), Positives = 324/574 (56%), Gaps = 42/574 (7%) Query: 34 STPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE---------AP 84 + P ++ A ++ + + A + +Q +Q D +A +GR E AP Sbjct: 313 TAPKDEAPTAPREPSAGKPVVGDFAVAPVPEQLGRQQFDFRASRGRTLERQLGETEELAP 372 Query: 85 TFARAAKAKATHI---ANPGT--ARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRF 139 T R A T PG +++ +N ++VA + L+TFS+DVDT SYA VR + Sbjct: 373 TSDRLAILPPTPDGEGQGPGMSGDKFEPIQENEFRRVADDALSTFSIDVDTASYAKVRSY 432 Query: 140 LNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLK 199 L +G LP PD+VR+EE++NYF + + P+PF+ +A PWNE L++ Sbjct: 433 LQRGQLPRPDSVRIEELINYFDYQYTPPSAE-----DPVPFSSAMAVASCPWNENNRLVR 487 Query: 200 VDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 V I AKD +E P NLVFLIDTSGSM +LPL+ +K+L+ +L+ +D +AIV YA Sbjct: 488 VGIQAKDIDRKERPRCNLVFLIDTSGSMKRPNKLPLVIEGMKVLLDQLKNRDRVAIVVYA 547 Query: 260 GDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLAT 319 G S + L S K +I A+ +L A GSTNGGAGL+LAYQ A + FI+ G+NR++L + Sbjct: 548 GSSGLVLDSTPVKQKKKIIRALSALSAGGSTNGGAGLQLAYQTARENFIEDGVNRVILCS 607 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DGDFNVG+ + + +Q +SG L+ G G N+N+AMM RI++ G GNY+++DT+ Sbjct: 608 DGDFNVGMTGTDQLVAEATRQSKSGTELTVLGFGMGNHNDAMMERISNSGAGNYAFVDTI 667 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGD 439 +EA+KVL ++ L TVAKDVK QIEFNPA V+ YR IGYE R L E FN+D VDAG+ Sbjct: 668 AEAKKVLADQVAGTLFTVAKDVKIQIEFNPAVVSAYRLIGYENRVLAKEDFNDDKVDAGE 727 Query: 440 IGAGKHITLLFELTLNGQ-----KASIDKLRYAPD---------------NKLAKSDKTK 479 IGAG +T L+E+ G+ +D L+Y P K + TK Sbjct: 728 IGAGHRVTALYEIAPVGKLPDSIAPDVDPLKYQPSGEENPDSQEANEPRVPKDSDESATK 787 Query: 480 ELAWLKIRWKYPQGKESQLVEFPL---GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNT 536 E+ LKIR K PQG S+ + FPL D +F AVA +G +LR S + Sbjct: 788 EILTLKIRHKPPQGDVSEKLAFPLVNESVPFQEADTDFQFAVAVAVFGMQLRNSTHAGTW 847 Query: 537 SWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 + + A AKG+D G RAEF+ L A+ + Sbjct: 848 TMDDVIATATNAKGDDEHGLRAEFLELARTAERL 881 >UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CVB5_9CLOT Length = 556 Score = 520 bits (1340), Expect = e-146, Method: Composition-based stats. Identities = 212/591 (35%), Positives = 311/591 (52%), Gaps = 67/591 (11%) Query: 2 RNKNI-IMLLMSSLI---LSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAA 57 R K + I LLM +L+ LSGCG + + ++ TE +V +AE + Sbjct: 3 RGKQLTIGLLMCALLAGLLSGCG-------AGGGKTASATEAEV---------KAEAGSY 46 Query: 58 AAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVA 117 A++ +A Q + +A E P + T Y +N VA Sbjct: 47 ASETMAAQSQWDGAVMEA------EGPPLSH------------NTEEYNYIAENAFLAVA 88 Query: 118 QNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKP 177 PL+TF+ DVDT SYAN+RR + +G P DAVR+EE++NYF D+ ++ Sbjct: 89 NAPLSTFAADVDTASYANLRRKILEGNEVPADAVRIEEMLNYFTYDYPEP-------TED 141 Query: 178 IPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQ 237 PF++ + PWNE LL++ + A+ E SNLVFLID SGSM S ++L L++ Sbjct: 142 EPFSVTTYIGDCPWNENHKLLQIGLQAEKPDLENQKPSNLVFLIDVSGSMESADKLGLVK 201 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 + LL + LR +D ++IVTYA + L +SG KA I AI++L A GST+G G+E Sbjct: 202 RAFLLLTENLRPEDTVSIVTYASSDTVVLDGVSGEEKAAIMTAIENLTAGGSTDGSKGIE 261 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 AY+ A + F K G NR++LATDGD N+G+ + +++K++ESGV LS G G N Sbjct: 262 TAYRLAEEHFQKDGNNRVILATDGDLNLGLTSEGDLTRLIQKKKESGVFLSVMGFGTGNI 321 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 + M +AD GNG Y+Y+D+L EA++VL E+ L TVAKDVK Q+EFNPA V YR Sbjct: 322 KDNKMEALADNGNGQYAYVDSLMEAKRVLVEELGGTLFTVAKDVKLQVEFNPAKVKGYRL 381 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYA---------- 467 IGYE R + F++D D G+IGAG +T L+EL G + ++ Sbjct: 382 IGYENRLMEARDFDDDAKDGGEIGAGHRVTALYELVPAGSDEDLGEVELKYGAGNVAAAE 441 Query: 468 ----------PDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTI--NAPSEDMR 515 P E LK+R+K P G++S+L+E+P+ + DMR Sbjct: 442 NGENGGAEARPAEGAPAPGADSEWLTLKVRYKEPDGEQSRLLEYPVDDSAVCRELPPDMR 501 Query: 516 FRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIEL 566 F + VA G LR SEY +S++ I ++ G Y+ EF+ L++ Sbjct: 502 FASCVAQTGMLLRDSEYAGGSSYKAIAAELERIDGLRGDPYKEEFLYLVKR 552 >UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteobacteria RepID=C6BAR1_RHILS Length = 706 Score = 520 bits (1339), Expect = e-146, Method: Composition-based stats. Identities = 220/562 (39%), Positives = 324/562 (57%), Gaps = 26/562 (4%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 P + + + A AA+ +++A AA + Q +Q+++ A Sbjct: 160 PDASQTSEYDANAALTNKPEGSA---AALGATKRAAPAAPGIVPQ--RQFAEPMAAI--- 211 Query: 81 QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFL 140 AP+ A+ + +P R+ NP+K VA +P++TFS DVD+ SYA VRR L Sbjct: 212 --APSPVPPAEGRMQMQLDPNRERFANAAANPIKSVATDPVSTFSADVDSASYAFVRRSL 269 Query: 141 NQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV 200 G +P P +VRVEE++NYFP DW P + PF + P PWN L+ V Sbjct: 270 TGGAMPDPLSVRVEEMINYFPYDW------PGPNNADQPFKATVTVMPTPWNRDTELMHV 323 Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG 260 I D P +NLVFLID SGSM ++LPL++S+ +L+V L+ D ++IVTYAG Sbjct: 324 AIKGYDIAPATTPRANLVFLIDVSGSMDEPDKLPLLKSAFRLMVNRLKADDTVSIVTYAG 383 Query: 261 DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATD 320 ++ L + K++I +AID L+ GST G G+E AY A +GF+K G+NR++LATD Sbjct: 384 NAGTVLAPTRVAEKSKILSAIDRLEPGGSTGGAEGIEAAYDLAKQGFVKDGVNRVMLATD 443 Query: 321 GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 GDFNVG ++ +++++R+ G+ L+ G G N N+++M +A GNG+ +YIDTL+ Sbjct: 444 GDFNVGPSSDGDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNGSAAYIDTLA 503 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDI 440 EAQK L E L +A DVK Q+EFNP + EYR IGYE R L E FNND VDAGDI Sbjct: 504 EAQKTLVEEAGSTLFPIASDVKFQVEFNPERIAEYRLIGYETRALNREDFNNDRVDAGDI 563 Query: 441 GAGKHITLLFELTLNGQKASI-DKLRYAPDNKLAKSDKT----KELAWLKIRWKYPQGKE 495 G+G +T ++E+T G A + D LRY +K+ ELA++K+R+K P + Sbjct: 564 GSGHSVTAIYEITPKGSPAVMNDDLRYGAADKVPAEASDSAHHGELAFVKMRYKRPGEDK 623 Query: 496 SQLVEFPLGP-----TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 S L+ P+ T++A +D+RF AVAA+GQKL ++ S+Q I A ++G Sbjct: 624 SALITTPVNDGNAVATVDAAPQDVRFSVAVAAFGQKLSHVAAVDTYSYQAIADLAAASRG 683 Query: 551 EDPQGYRAEFIRLIELADGVTD 572 D GYR++F+ L+ LADG++ Sbjct: 684 TDTFGYRSDFLGLVRLADGLSQ 705 >UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZU67_9SPHI Length = 552 Score = 515 bits (1327), Expect = e-144, Method: Composition-based stats. Identities = 192/479 (40%), Positives = 295/479 (61%), Gaps = 11/479 (2%) Query: 97 IANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEI 156 ++ P + ++N V PL+TFS+DVD SY+ R+ +N G LP +VR+EE Sbjct: 79 VSPPKIKEKKPANENTFLSVKTAPLSTFSIDVDNASYSRARKSINNGQLPSTSSVRLEEF 138 Query: 157 VNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASN 216 +NYF + + Q PF++ E+A PWN + L+ + + K S +L SN Sbjct: 139 INYFNYQYKQPEGQH-------PFSVNTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLSN 191 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAE 276 LVFLID SGSM + ++LPL++ + K+LV L E+D +AIV YAG++ + LP+ G+ K + Sbjct: 192 LVFLIDVSGSMSAPDKLPLLRKAFKMLVNNLGEEDRVAIVVYAGNAGLVLPATQGTDKQK 251 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESM 336 I A+D L + GST GGAG++LAY+ A + FIK G NRI+LATDGDFN+G ++++++ Sbjct: 252 IMEALDKLQSGGSTAGGAGIKLAYKIAKQNFIKEGNNRIILATDGDFNLGASSDQAMQNL 311 Query: 337 VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 ++++R+ GV ++ G+G NY ++ M IAD GNGNY Y+D L+EA KV +++ L T Sbjct: 312 IEEKRKEGVFITVLGLGMGNYRDSKMEIIADKGNGNYYYLDNLNEAYKVFGKDLKGTLFT 371 Query: 397 VAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 +AKDVK Q+EFN A V YR IGYE R L F +D DAG+IGAG +T L+E+TL+ Sbjct: 372 IAKDVKIQVEFNSAVVKSYRLIGYENRLLANRDFRDDTKDAGEIGAGHTVTALYEVTLHS 431 Query: 457 -QKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGK---ESQLVEFPLGPTINAPSE 512 + P N A ++L +++R+K P+G E+ + +++ S Sbjct: 432 NPQTVAVDQNQIPANFQATQFNNQQLMNVRLRYKKPEGSTGIETSQIIAANHQSVDETSH 491 Query: 513 DMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 + RF AAVA++G L+ S+Y +T++Q + A+ +KG+D YRAEFI L++ A +T Sbjct: 492 NFRFSAAVASFGMLLKNSQYKGSTTFQTVLTLAKGSKGKDMNQYRAEFIDLVQKASQIT 550 >UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria RepID=B1ZYN3_OPITP Length = 792 Score = 514 bits (1324), Expect = e-144, Method: Composition-based stats. Identities = 227/572 (39%), Positives = 315/572 (55%), Gaps = 19/572 (3%) Query: 20 GPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGR 79 P+ + + T++QV A+ A Q+ A A+ G Sbjct: 220 APESTAFGALARAELRETQRQVRQARAQKKDAAMQALLVANEEPAALSSFPGQAPAMDGY 279 Query: 80 LQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRF 139 + + + H T Y+ ++ ++PL+TF+ DVDT SYANVRRF Sbjct: 280 IASTTFAGIGTRVRGDHRQAMNTEAYRFLRESDFLSAREHPLSTFAADVDTASYANVRRF 339 Query: 140 LNQGLLPPPDAVRVEEIVNYFPSDWDIK---DKQSIPASKPIPFAMRYELAPAPWNEQRT 196 L +G LPP DAVR+EE+VNYFP + + + A PFA E+A APW Q Sbjct: 340 LREGRLPPADAVRIEELVNYFPYRYAAPGRVRDEGVAAPGEAPFAAALEVAAAPWAAQHR 399 Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 L+++ + AKD A+NLVFL+D SGSM +L L+Q S++LL+ L+ +D +AIV Sbjct: 400 LVRIGLKAKDAAVSGRAAANLVFLLDVSGSMDQPNKLRLVQESMRLLLGRLQPEDRVAIV 459 Query: 257 TYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRIL 316 TYAG+S +ALPS + + EI AID L A GSTNG GL+LAY A F+ G+NR++ Sbjct: 460 TYAGNSGLALPSTPVARQREILDAIDELRAGGSTNGAMGLQLAYDIAKANFVANGVNRVI 519 Query: 317 LATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 L TDGDFNVG+ + +++++ +SGV L+ G G N +AM+ +IAD GNG+Y YI Sbjct: 520 LCTDGDFNVGVTSEGELVRLIEEKAKSGVFLTVLGFGMGNLKDAMLQQIADRGNGSYGYI 579 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVD 436 DT EA+K+L ++ L+TVAKDVK Q+EFNPA V YR IGYEKR L E F ND +D Sbjct: 580 DTRREAEKLLVQQVSGTLLTVAKDVKLQVEFNPAKVARYRLIGYEKRLLNQEDFANDKID 639 Query: 437 AGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDK-------------TKELAW 483 AG+IGAG +T L+E+ G K + P+++ EL Sbjct: 640 AGEIGAGHTVTALYEIIPVGAKDAEVTEETEPEDRRYTYSSAAPSAVEKRTLAHADELLT 699 Query: 484 LKIRWKYPQGKESQLVEFPL---GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ 540 LK+R+K P S +EFPL G SED RF +AVAA+G LR S Y + Sbjct: 700 LKVRYKQPTALLSTRLEFPLKDDGGNFAQASEDFRFASAVAAFGMILRDSPYKGVATLDD 759 Query: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTD 572 + WA A +DP GYRAEF+ L++ A +T Sbjct: 760 VIAWANAATSDDPGGYRAEFVELVKQARLLTQ 791 >UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NVX5_9RHOB Length = 608 Score = 512 bits (1319), Expect = e-143, Method: Composition-based stats. Identities = 230/575 (40%), Positives = 322/575 (56%), Gaps = 15/575 (2%) Query: 3 NKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKAL 62 K + ++ ++ G + + E + A+ A Q A A Sbjct: 36 GKKLTDTTEAAGRMTSSGKPDGDASVSTAELPKQEETHTVVAEIARPVATPQPAPAPALP 95 Query: 63 AQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANP-----GTARYQQFDDNPVKQVA 117 +Q + L A + + A P R+ + NP+++ + Sbjct: 96 QKQRSRSDGAGGGLMTFSSGAGGAVLNSGIQLEPPAMPAVQLEDRERFASAEANPLRRTS 155 Query: 118 QNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKP 177 +P++TFS+DVDT SY+ VR L+ G LP PDAVRVEE+VNYF ++ + P Sbjct: 156 ADPVSTFSVDVDTASYSYVRSTLSGGRLPNPDAVRVEEMVNYFDYNYPV------PEKGG 209 Query: 178 IPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQ 237 PF+ + PWNE L++V I ++LP+ NLVFLIDTSGSM +LPL+Q Sbjct: 210 HPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLPSQNLVFLIDTSGSMADANKLPLLQ 269 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 S +LL+ LR++D +AIVTYAG S + L + K I I++L + GST G GL+ Sbjct: 270 QSFRLLLSSLRDEDEVAIVTYAGSSGVLLEPTKVADKTRILEKINALTSGGSTAGHEGLK 329 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 AY A G RI+LATDGDFNVG+ DP S++ V +QRE+G LS G G NY Sbjct: 330 GAYALAETMTGDGEQTRIILATDGDFNVGLSDPDSLKRYVAEQRENGTALSVLGFGRGNY 389 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 N+ +M +A G G +YIDTLSEA+KVL ++ + +A+DVK Q+EFNP V EYR Sbjct: 390 NDELMQTLAQNGQGVAAYIDTLSEARKVLVDQVVSSISMIAQDVKIQVEFNPETVAEYRL 449 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS-IDKLRYAPDNK--LAK 474 IGYE R LR E F ND VDAGDIGAG ++T L+E+T G A LRY P K + Sbjct: 450 IGYETRALRTEDFKNDKVDAGDIGAGHNVTALYEITPVGSPAEKFSDLRYGPKEKIEAVR 509 Query: 475 SDKTKELAWLKIRWKYPQGKESQLVEFPL-GPTINAPSEDMRFRAAVAAYGQKLRGSEYL 533 + ELA++K+R+K P KES LVE P+ T+ P + F A+VAA+GQKL+G++YL Sbjct: 510 TSYAGELAFVKLRYKLPGDKESTLVETPVMEDTVGIPKSETLFAASVAAFGQKLKGTDYL 569 Query: 534 NNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 + ++ I++ A KG DP GYR+EF+ L+ LAD Sbjct: 570 GDWDFKAIEKLASDNKGTDPFGYRSEFLTLVRLAD 604 >UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacteriaceae RepID=C0YQB8_9FLAO Length = 800 Score = 512 bits (1319), Expect = e-143, Method: Composition-based stats. Identities = 196/474 (41%), Positives = 284/474 (59%), Gaps = 12/474 (2%) Query: 97 IANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEI 156 Y F +NP + PL+TFS+DVD SY+NVRR +N G + +AVR+EE+ Sbjct: 327 PVTQNNESYDAFVENPFELTRNQPLSTFSIDVDNASYSNVRRMINNGQVVDKNAVRIEEM 386 Query: 157 VNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASN 216 VNYF D+ + PF++ E + APWN + LLK+ + K+ ++LPASN Sbjct: 387 VNYFKYDYPQPKNE-------NPFSINTEYSDAPWNPKHKLLKIGLQGKNLPMDKLPASN 439 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAE 276 LVFLID SGSM + +LPL++SS K+L+ +LR +D + IV YAG + + LP S K + Sbjct: 440 LVFLIDVSGSMSDENKLPLLKSSFKVLLNQLRPKDKVGIVVYAGSAGMVLPPTSAGEKDK 499 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESM 336 I A+D L A GST GGAG+ELAY+ A + F+K G NR+++ATDGDFNVG ++++ Sbjct: 500 IIEALDRLQAGGSTAGGAGIELAYKLAQENFVKEGNNRVIIATDGDFNVGTSSISDLKTL 559 Query: 337 VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 ++ +R+SGV L+ G G NY + + +AD GNGNY+YID + EA K L E + Sbjct: 560 IEDRRKSGVFLTCLGFGMGNYKDNTLETLADKGNGNYAYIDNMQEANKFLGKEFAGSMYA 619 Query: 397 VAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 +AKD+K QIEFNP +V YR IGYE R+L+ E F ND +DAG++G+G +T L+E+ Sbjct: 620 IAKDMKIQIEFNPEYVKSYRLIGYENRKLKNEDFTNDKIDAGELGSGHTVTALYEVIPAN 679 Query: 457 QKASIDKLR--YAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT---INAPS 511 + + ELA +K R+K P G S+ + + + I++ S Sbjct: 680 VNSDFAPKESDLKYSQNTSSKGFGDELATIKFRYKKPDGDTSREITQVVKNSDNRISSAS 739 Query: 512 EDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE 565 D +F ++VA +G LR SE + I+ A+Q K +D +GYR+EFIRLIE Sbjct: 740 PDFKFASSVAWFGLVLRNSELITKKDLSDIENLAKQGKNKDEEGYRSEFIRLIE 793 >UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacteraceae RepID=A3PN61_RHOS1 Length = 651 Score = 511 bits (1317), Expect = e-143, Method: Composition-based stats. Identities = 218/568 (38%), Positives = 314/568 (55%), Gaps = 18/568 (3%) Query: 14 LILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDK 73 L L P E Q P P L A A AE + A A + + + Sbjct: 91 LALVVVMPNARLAEPPQTAPDAPEADARLTAAPEAGGGAETAGAPVPAEPRARSAEGAAP 150 Query: 74 QALQGRLQEAPTFAR---------AAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATF 124 Q AA+A A + + + DNP++ A++P++TF Sbjct: 151 QTFAADEAMPMAAPPAPDLALSKQAAEAPARALPQGDSEAFANAPDNPLRVTAEDPVSTF 210 Query: 125 SLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRY 184 S+DVDT SYA +R L G LPP +AVR+EE++NYFP D+ P + PF Sbjct: 211 SIDVDTASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYPA------PENGTPPFRPTL 264 Query: 185 ELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 + PWN + L+ V + + E+ P NLVFLIDTSGSM +LPL++ S L++ Sbjct: 265 SITRTPWNPETRLVHVALQGRMPAIEDRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLML 324 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 LR +D +AIVTYAG + L + + ++ I +A+D LDA GST G GL LAY+ A+ Sbjct: 325 GRLRPEDQVAIVTYAGSAGEVLAPTAANQRSTILSALDRLDAGGSTAGDEGLALAYRTAS 384 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + G + R++LATDGDFN+GI DP+ + +V +R++GV LS G G N ++A M Sbjct: 385 EMAGAGEVTRVVLATDGDFNLGISDPEELARLVAHERDTGVYLSVLGFGRGNLDDATMQA 444 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +A GNG +YID+L+EAQKVL ++ L +A DVK Q+E++PA V EYR IGYE R Sbjct: 445 LAQNGNGQAAYIDSLNEAQKVLVDQLSGALFPIADDVKVQVEWSPARVAEYRLIGYETRG 504 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI-DKLRYAPDNKLAKSDKTKELAW 483 LR E F ND VDAG+IGAG +T ++E+T A + D LRY + + EL + Sbjct: 505 LRREDFANDRVDAGEIGAGHSVTAIYEITPVDSPARLTDPLRYGAEPP--EGAHGDELGF 562 Query: 484 LKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQ 543 L++R+K P S L++ P+ + SED+RF A+A +G+ LRGS+ L W + Sbjct: 563 LRLRYKAPGESTSTLIDTPIPDMLTEASEDVRFSTAIAGFGELLRGSDKLGAWGWDEAIA 622 Query: 544 WAQQAKGEDPQGYRAEFIRLIELADGVT 571 A A+G DP GYR E ++L+ LA+ ++ Sbjct: 623 LADGARGADPFGYRVEAVQLMRLAESLS 650 >UniRef50_UPI000185CB41 protein containing von Willebrand factor n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CB41 Length = 550 Score = 508 bits (1309), Expect = e-142, Method: Composition-based stats. Identities = 224/528 (42%), Positives = 315/528 (59%), Gaps = 18/528 (3%) Query: 55 SAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGT------ARYQQF 108 A A + + + + + A + + + P Y++ Sbjct: 29 PPAPATTVEELTANKMASPETPPPPPPPPAYDAVVEEMEIANSEEPSQQQLRSNETYKEI 88 Query: 109 DDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKD 168 +NP VAQ P+ TFS DVD SYAN+RR L G LPP DA+R+EE++NYF D+ Sbjct: 89 SENPFVAVAQQPVTTFSADVDRASYANLRRMLGYGQLPPKDAIRIEEMINYFDYDYPAPT 148 Query: 169 KQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI 228 K++ P + ELAP PWN + LL++ + AK + P SN+VFLID SGSM Sbjct: 149 KEATS-----PLRVTPELAPTPWNPEHLLLRIGLQAKKLDLAQAPPSNIVFLIDVSGSMD 203 Query: 229 SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEG 288 +LPL++SS KLL+ +L+ D +AIVTYA +++AL S + +I +D+L A G Sbjct: 204 EPNKLPLLKSSFKLLLTQLKPTDRVAIVTYASGTKVALSSTPVKERQKIEKVLDNLYASG 263 Query: 289 STNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 ST+G +G++LAY++A K FIK G NRI+LATDGDFNVGI +P+ +E ++KQRESG+ +S Sbjct: 264 STSGSSGIQLAYKEAQKNFIKNGNNRIILATDGDFNVGISNPRELEKFIEKQRESGIYMS 323 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFN 408 G G NY + M IAD GNGNY+YID L+EA+KVL +E ML VAKDVK QIEFN Sbjct: 324 VLGFGMGNYRDDMAETIADKGNGNYAYIDDLTEAKKVLVNEFSGMLFAVAKDVKLQIEFN 383 Query: 409 PAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAP 468 P +V EY+ IGYE R L E F +D DAG+IGAG +T L+EL + K + LRY Sbjct: 384 PKYVKEYKLIGYENRMLANEDFTDDKKDAGEIGAGHTVTALYELIPSEGKVAQ-NLRYQ- 441 Query: 469 DNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG-----PTINAPSEDMRFRAAVAAY 523 +L + K EL +LKIR+K P+ K+++ VE ++N S D RF A+VA + Sbjct: 442 TKELNEKGKGNELGFLKIRYKDPKVKDAKSVEVTEPLLFAKKSLNETSVDFRFAASVAEF 501 Query: 524 GQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 G LRG+ ++ Q+ + A A G+D +GYR EF+RL++ A + Sbjct: 502 GILLRGNSNKAQATYDQVVELANGAIGKDEEGYRKEFVRLVKSAKLLA 549 >UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria RepID=A5F9T1_FLAJ1 Length = 709 Score = 506 bits (1303), Expect = e-141, Method: Composition-based stats. Identities = 216/527 (40%), Positives = 313/527 (59%), Gaps = 19/527 (3%) Query: 51 EAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDD 110 E+ S +KA + + K+ L L E + + P Y F + Sbjct: 192 ESAASIYGSKAANGAVI--IATKKGLYKNLSEQ-ELDKKLNIIRPNPTLPTQEDYDTFVE 248 Query: 111 NPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQ 170 N + PL+TFS+DVD SY N+RRFLN G P DAVRVEE+VN+F ++ + Sbjct: 249 NAFESPKTAPLSTFSIDVDNASYTNIRRFLNSGQEVPKDAVRVEEMVNFFKYNYPQPKNE 308 Query: 171 SIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD 230 PF++ E + +PWN Q +LK+ + K+ + +LP+SNLVFLID SGSM Sbjct: 309 H-------PFSINTEYSDSPWNSQNKILKIGLQGKNIATNDLPSSNLVFLIDVSGSMEDM 361 Query: 231 ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGST 290 +LPL++ S+K+LV ELR D ++IV YAG + + LP SG+ K I A+D L+A GST Sbjct: 362 NKLPLLKQSMKILVNELRPTDKVSIVVYAGAAGMVLPPTSGNEKKTIIKALDQLEAGGST 421 Query: 291 NGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTF 350 GGAG+ELAY+ AT+ FIKGG NR++LATDGDFNVG +E +++++R++GV L+ Sbjct: 422 AGGAGIELAYKIATENFIKGGNNRVILATDGDFNVGSSSNSDMEKLIEEKRKTGVFLTCL 481 Query: 351 GVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 G G NY ++ M +AD GNGNY+YID + EA + L E + + +AKDVK QIEFNP Sbjct: 482 GYGMGNYKDSKMEILADKGNGNYAYIDNIQEANRFLGKEFKGSMFAIAKDVKIQIEFNPK 541 Query: 411 WVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI-----DKLR 465 V YR IGYE R+LR E F ND +DAG++G+ +T L+E+ G K+ D L+ Sbjct: 542 QVQAYRLIGYENRKLRPEDFKNDAIDAGELGSNHTVTALYEIIPAGVKSDFLNVQPDDLK 601 Query: 466 YAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGP---TINAPSEDMRFRAAVAA 522 Y + ++ + ELA +K R+K P G +S + + +++ S+D +F AVA Sbjct: 602 YT-KTETNSANYSNELATIKFRYKKPDGDKSIEMVQVINTKSVSLDQASDDFKFSTAVAW 660 Query: 523 YGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADG 569 +G KLR S+ + + S + I + AQQ D GY+AEFIRL+E ++ Sbjct: 661 FGLKLRDSKLITDKSSESIAELAQQGMSFDKGGYKAEFIRLVETSEQ 707 >UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AC65_GEMAT Length = 642 Score = 506 bits (1302), Expect = e-141, Method: Composition-based stats. Identities = 233/579 (40%), Positives = 334/579 (57%), Gaps = 19/579 (3%) Query: 2 RNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKA 61 R K ++ L S +++G P Q+ T + A+Q A + A A Sbjct: 67 RLKQSVIAL--SGVVTGMKPTVTGVAEQKADNFNRTRETANASQGAEVTRTAAPAIAPSP 124 Query: 62 LAQQEVQQYSDKQALQGRLQEAPTFARA--AKAKATHIANPG-TARYQQFDDNPVKQVAQ 118 Q +++ +P A + A+ + PG +Y + +DNP V Sbjct: 125 APQTRGVAGGMARSVGMPAPASPRRAASDEARPPRPYPGQPGNREQYDRIEDNPFLGVTG 184 Query: 119 NPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPI 178 NPL+TFS+DVD SY N RRFL G PP DAVR+EE++NYFP + Sbjct: 185 NPLSTFSIDVDRASYGNARRFLQDGQRPPADAVRIEELINYFPY-------ELREPRGND 237 Query: 179 PFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQS 238 P A+ E+ APW + L+++ + ++ ++ LP +NLVFLID SGSM S ++LPL++ Sbjct: 238 PVAITTEVTTAPWQPRHQLVRIALQSRRIETASLPPNNLVFLIDVSGSMQSPDKLPLVKQ 297 Query: 239 SLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 SL+LLV ++R QD +AIV YAG + + LPS SG K I AI+ L+A GST GGAG+EL Sbjct: 298 SLRLLVDQMRPQDRVAIVAYAGAAGLVLPSTSGDEKETIIQAIERLEAGGSTAGGAGIEL 357 Query: 299 AYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYN 358 AY+ A + F+ G NR++LA+DGDFNVG+ +E +++++R G L+ G G NY Sbjct: 358 AYRTAREHFMDHGNNRVILASDGDFNVGVSSDGELERLIERKRTEGTYLTILGFGTGNYQ 417 Query: 359 EAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 +A M ++A GNGNY Y+D ++EA+K+L EM L+TVA DVK Q+EFNP V YR I Sbjct: 418 DAKMEKLAKRGNGNYGYVDDIAEARKMLVREMGATLLTVANDVKLQVEFNPRRVQAYRLI 477 Query: 419 GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI-----DKLRYAPDNKLA 473 GYE R LR E F +D DAGD+GAG +T L+E+ G + ++ + RY P A Sbjct: 478 GYEDRLLRTEDFTDDRKDAGDLGAGHQVTALYEIVPVGVQGTVRLQDTEARRYEPVTGEA 537 Query: 474 KSD--KTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSE 531 +S + EL ++K+R+K P S+L+ P+ S+DMRF ++VAA+G LR S Sbjct: 538 RSSTATSDELLFVKLRYKRPGESTSRLITHPVPARTVRGSDDMRFASSVAAFGMLLRESP 597 Query: 532 YLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 Y NTS Q+ + A+ A GED GYRAEFIRL+E + Sbjct: 598 YAGNTSAAQVLEQARAALGEDDGGYRAEFIRLVERYRSI 636 >UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DBX8_9RHIZ Length = 668 Score = 504 bits (1299), Expect = e-141, Method: Composition-based stats. Identities = 229/551 (41%), Positives = 317/551 (57%), Gaps = 28/551 (5%) Query: 42 LAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAP----------------T 85 LA QQ + + A++ A + R P + Sbjct: 122 LAPQQEEQELVAAAPLPEVAVSPALKSSRQANDAARQRFTGQPVGGLAGQGLAGQIDGES 181 Query: 86 FARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLL 145 A A A R + FD N V+ VA+ P++TFS DVDT SYA VRR L QG++ Sbjct: 182 LRGADGANPAPGAEAERDRVEGFDSNGVRSVAEYPVSTFSADVDTASYAMVRRALKQGVM 241 Query: 146 PPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAK 205 P P VR+EE+VNYF D+ P S PF + P PWN LL + + Sbjct: 242 PDPRTVRIEEMVNYFNYDYPA------PESVETPFRATVTVTPTPWNANTRLLHIGVKGY 295 Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 D K P +NLV L+D SGSM ++LPL++S+ +LL+++L +D ++IVTYAGD+ Sbjct: 296 DVKPAARPQANLVLLVDVSGSMQETDKLPLLKSAFRLLIQKLEPEDTVSIVTYAGDAGTV 355 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 L S KA+I A+D L GST G AG+E AY+ A K + GG+NR+LLATDGDFNV Sbjct: 356 LEPTPASDKAKILDALDDLRPGGSTAGAAGIEEAYRLAEKARVNGGVNRVLLATDGDFNV 415 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 G D +++S+++++RESGV LS FG G NYN+ +M +A GNG +YIDTL+EA+K Sbjct: 416 GASDDDALKSLIEEKRESGVFLSIFGFGQGNYNDQLMQTLAQNGNGVAAYIDTLAEAEKT 475 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 L E L +A DVK QIEFNP + EYRQIG+E R L E FNND VDAG+IG+G Sbjct: 476 LAQEATASLFPIASDVKFQIEFNPETIAEYRQIGFETRALSREDFNNDQVDAGEIGSGHT 535 Query: 446 ITLLFELTLNGQKASID-KLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG 504 +T ++E+T G A ++ LRY + ++ E A+LKIR K P +ES L E P+ Sbjct: 536 VTAIYEVTPVGSPAILNSDLRYGAETPVSDVAHGDEFAFLKIRAKVPGEEESSLTEIPVM 595 Query: 505 -----PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAE 559 + +A +D+RF AVAA+ QKLR + +N + I+ A A+GEDP GYR+E Sbjct: 596 KDAELTSFSAAPQDVRFSIAVAAFAQKLRRIQQVNGFGFDAIESIASDARGEDPFGYRSE 655 Query: 560 FIRLIELADGV 570 F++L+ LA+G+ Sbjct: 656 FLQLVRLANGL 666 >UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KS19_CLOPH Length = 551 Score = 503 bits (1294), Expect = e-140, Method: Composition-based stats. Identities = 188/471 (39%), Positives = 277/471 (58%), Gaps = 11/471 (2%) Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 T Y + + +PL+TFS DVDT SY+N+RR L +G AVR+EE++NYF Sbjct: 89 TEEYNAVIEQGYQSTKNHPLSTFSADVDTASYSNIRRMLKEGRRVDTGAVRIEEMLNYFN 148 Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 D+ + + PF + EL+ PWN L I + + SNLVFLI Sbjct: 149 YDYKLPE-------GDSPFGITTELSDCPWNPDTKLFLAGIQTEKIDFSKSAPSNLVFLI 201 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAI 281 D SGSM+ +++LPL+Q + LL + L E+D I+IVTYAG+ + L G+ K +I AI Sbjct: 202 DVSGSMMDEDKLPLVQRAFLLLTENLTEKDRISIVTYAGNDTVVLSGAKGNQKEKIQNAI 261 Query: 282 DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR 341 L+A GST G G+E AYQ A + +I+GG NR++LATDGD NVG+ + ++++++R Sbjct: 262 TELEAGGSTFGSKGIETAYQLAMENYIEGGNNRVILATDGDLNVGVTSESELTNLIEEKR 321 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 +SGV LS G G N + M +AD GNGNY+YID+L EA+KVL EM L+TVA DV Sbjct: 322 KSGVALSVLGFGTGNIKDNKMEALADHGNGNYAYIDSLMEARKVLVEEMGATLVTVAGDV 381 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI 461 K Q+EFNPA V YR +GY+ R L E FN+D DAG++GAG +T+L+EL L K I Sbjct: 382 KFQVEFNPAKVKGYRLLGYDNRLLATEDFNDDTKDAGEVGAGHSVTVLYELVLEDSKMEI 441 Query: 462 DKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG--PTINAPSEDMRFRAA 519 + ++ EL + IR+K P +S L+ P+G + ++++ F A Sbjct: 442 PETELKYTT-TEPTNMVDELLTVNIRYKKPGKDKSILMSEPVGINQLADTRTDNLAFATA 500 Query: 520 VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 VA +G L+ SEY + ++ ++ ++ + + YRAEF +L++LA + Sbjct: 501 VAEFGLLLKDSEYKGDATFSKVLSRLEETNYKQDE-YRAEFYQLVKLAKDI 550 >UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacterales RepID=Q28U54_JANSC Length = 686 Score = 502 bits (1292), Expect = e-140, Method: Composition-based stats. Identities = 215/547 (39%), Positives = 302/547 (55%), Gaps = 12/547 (2%) Query: 32 QPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAK 91 P A A + + +AL ++ Q R + Sbjct: 143 APEADVVADAEAPSLAPMVLPAPAPMVREALGGLADLDHAGDGVAQIRRPIQGLTLYSDG 202 Query: 92 AKATHIA------NPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLL 145 HI P + DDNP++ VA +P++TFS+DVDT SYA +R LN+G L Sbjct: 203 GPQNHIGTGDLALAPLPEDFANADDNPLRVVADDPVSTFSIDVDTASYALLRSTLNRGAL 262 Query: 146 PPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAK 205 P PDAVR+EE+VNYFP D+ A PF ++ PWN L+ + I Sbjct: 263 PAPDAVRIEEMVNYFPYDYPAPT-----ADDISPFRPNVQVFETPWNPDTQLVHIGIQGD 317 Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 E+ P NLVFLIDTSGSM +LPL+ S +L++ L +D +AIVTYAG + +A Sbjct: 318 LPVVEDRPPLNLVFLIDTSGSMNDPAKLPLLIQSFRLMLNRLSPEDEVAIVTYAGSAGVA 377 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 L + S A INAA+ +L A GSTNG GLE AY+ A + + G ++R+LLATDGDFNV Sbjct: 378 LEPTAASDTATINAALTTLQAGGSTNGVGGLEEAYRLAGEMMVDGEVSRVLLATDGDFNV 437 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 G+ D ++E + +QR++G+ LS G G N + M +A GNG SYIDTL EAQ+V Sbjct: 438 GLSDAGALEDYIAEQRDTGIYLSVLGFGRGNLQDDTMQALAQNGNGTASYIDTLHEAQRV 497 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 L ++ L +A D+K Q+EFNP + EYR IGYE R L E F ND VDAGDIGAG Sbjct: 498 LVDQLAGALYPIADDLKVQVEFNPDVIAEYRLIGYETRALAREDFANDAVDAGDIGAGHS 557 Query: 446 ITLLFELTLNGQKAS-IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG 504 +T ++E+T G A + LRY D ++ EL ++ +RWK P ESQL++FP+ Sbjct: 558 VTAIYEVTPVGSPAVLVAPLRYTADEGAPEAAFGDELGFISLRWKEPGADESQLIDFPIA 617 Query: 505 PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 + P + +F AA+A +GQ LRGS+++ + + A +G D GYR E ++L+ Sbjct: 618 NAVADPGTEAQFAAAIAGFGQLLRGSDFVADWDYADAIALANANRGMDEFGYRTEAVQLM 677 Query: 565 ELADGVT 571 LA ++ Sbjct: 678 RLAQSLS 684 >UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT14_9PLAN Length = 616 Score = 501 bits (1290), Expect = e-140, Method: Composition-based stats. Identities = 214/566 (37%), Positives = 307/566 (54%), Gaps = 36/566 (6%) Query: 32 QPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSD-KQALQGRLQEAPTFARAA 90 + ++ A + A +A +E + A Q RL PT +R Sbjct: 58 KQEAKPASELSAVASQPVPAARAVEMNRDRVAGREKEAGKVRSDARQDRLATLPTESRRL 117 Query: 91 KAKATHIANPGT-----------------ARYQQFDDNPVKQVAQNPLATFSLDVDTGSY 133 + + A ++ ++NP + VA PL+TFS+DVDT SY Sbjct: 118 GIEQPNAAPGFMPQLDGIAGHGEGPGVGGDKFAYVENNPFRAVADEPLSTFSIDVDTASY 177 Query: 134 ANVRRF-LNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWN 192 + +R + ++ LPP AVRVEE++NYF D+ Q PFA E A PWN Sbjct: 178 SKIRSYLIDYHQLPPQGAVRVEELINYFTYDYATPTDQ-------KPFAANVEAAACPWN 230 Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 + L+++ I K+ + E PASNLVFL+D SGSM + +LPL++ +KLLV +L E D Sbjct: 231 AEHRLVRIGIKGKEIANAERPASNLVFLLDVSGSMNNARKLPLLKQGMKLLVDQLGENDK 290 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 +AIV YAG + + L S +G K+ I A+D L A GSTNGG G+ELAYQ AT+ FIKGG+ Sbjct: 291 VAIVVYAGAAGMVLNSTNGDDKSTIMEALDRLQAGGSTNGGQGIELAYQAATENFIKGGV 350 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 NR++L TDGDFNVG+ + +M + +SGV LS G G N+N+AMM ++ NGN Sbjct: 351 NRVILCTDGDFNVGVTSTSDLVTMAADKAKSGVFLSVMGFGTGNHNDAMMEELSGKANGN 410 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN 432 Y++IDT++EA+KVL +M L T+AKDVK QIEFNP V YR +GYE R L E FN+ Sbjct: 411 YAFIDTITEAKKVLVEQMSGTLTTIAKDVKIQIEFNPTKVAAYRLVGYENRLLANEDFND 470 Query: 433 DNVDAGDIGAGKHITLLFELTL-----NGQKASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 D DAG+IGAG +T +E+ A +D L+Y + + + EL LKIR Sbjct: 471 DKKDAGEIGAGHCVTAFYEIVPASVESPVTTAKVDDLKYQATRDVTPAADSDELLTLKIR 530 Query: 488 WKYPQGKESQLVEFPLGPT---INAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 +K P ES L+ + + S D +F A VA +G LR + + +I + Sbjct: 531 YKQPDEDESSLISVGVKDSGNRFAQASGDFQFAAGVAMFGMLLRAGDQDAKVNLDEITEL 590 Query: 545 AQQAKGEDPQGYRAEFIRLIELADGV 570 G+D YR EF+++++ A + Sbjct: 591 VSNNVGDDS--YRGEFLKIVQAAKTL 614 >UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 Tax=Erythrobacter RepID=Q2N8R4_ERYLH Length = 580 Score = 501 bits (1290), Expect = e-140, Method: Composition-based stats. Identities = 232/577 (40%), Positives = 332/577 (57%), Gaps = 22/577 (3%) Query: 1 MRNKNIIMLLMSSLILSGCGPQPENKE------SQQQQPSTPTEQQVLAAQQAAIKEAEQ 54 MR + M ++ L+ C Q E S+ + S + A Sbjct: 1 MRIIRFATVSMLAVALASCASQSSEGERIVVTGSKADRSSDASPPPPPPPPPPPPSPAYA 60 Query: 55 SAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVK 114 + A + + + G+ EA RY + +PVK Sbjct: 61 AQQAVVVSGSRIASEAAVAPDTSGQPAEAAGREYRYVMPVIVPQPEDRERYDGEEVSPVK 120 Query: 115 QVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPA 174 A PL+TFS+DVDTG+YAN RRFL+QG +PP AVR EE +NYF D+D P Sbjct: 121 IAAVEPLSTFSVDVDTGAYANARRFLSQGQMPPKAAVRTEEFINYFRYDYD------RPQ 174 Query: 175 SKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLP 234 + PF + ++ A PWNE L+++ + D + E P +NLVFL+D SGSM ++LP Sbjct: 175 DRSQPFTVNFDAARTPWNEDTRLIRIGLAGYDIERSERPPANLVFLMDVSGSMGRPDKLP 234 Query: 235 LIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 L++++L L EL+ QD ++IV YAG + + L + + K I AA++ L A GST GGA Sbjct: 235 LVKTALAGLAGELQPQDKVSIVVYAGAAGLVLEPTNDTRK--IRAALNQLQAGGSTAGGA 292 Query: 295 GLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGN 354 G++LAYQ A FI+GG+NR++LATDGDFNVG+ ++ M++K+R+SG+TL+T G G Sbjct: 293 GIQLAYQIAEDNFIEGGVNRVILATDGDFNVGVSSRDALIEMIEKKRDSGITLTTLGFGT 352 Query: 355 SNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE 414 NYNEAMM +IA+ GNGNY+YID+ EA+KVL EM L T+AKDVK Q+EFNPA +++ Sbjct: 353 GNYNEAMMEQIANHGNGNYAYIDSALEAKKVLGDEMSSTLFTIAKDVKIQVEFNPAVISQ 412 Query: 415 YRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAK 474 YR IGYE R LR E F+ND VDAGDIGAG +T ++E+ G K I LRY A Sbjct: 413 YRLIGYENRALRDEDFDNDAVDAGDIGAGHQVTAIYEVVPVGTKGWIPPLRYGDRPAQAA 472 Query: 475 SDKTKELAWLKIRWKYPQGKESQLVEFPLGPTI----NAPSEDMRFRAAVAAYGQKLRGS 530 S++ +E A++K+R+K P G+ S+L+++ L + P D F +AVA +GQKLRG Sbjct: 473 SERAEEAAYVKLRYKMPDGETSKLIDYVLPASTLRTATMPRGDFAFASAVAGFGQKLRGD 532 Query: 531 EYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 L + ++ + + A G +R EF++L LA Sbjct: 533 PMLGDFAYDDLARLA----GTQQDFWRQEFVKLTSLA 565 >UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=C8N8N5_9GAMM Length = 563 Score = 500 bits (1287), Expect = e-140, Method: Composition-based stats. Identities = 231/570 (40%), Positives = 328/570 (57%), Gaps = 19/570 (3%) Query: 6 IIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQ 65 +I+ L+++ +S ++ +S + P E +A A A A L+ + Sbjct: 6 LILALLAASGISHAAGLCDDLDS----DAPPVEYAARSAPVLQKAAASPQAQAIPDLSNR 61 Query: 66 EVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFS 125 + + AR + RY D NPV +V+ P++TFS Sbjct: 62 LGVNMPVPAGANPQWALNKSVARMGIRGTVEVQ--NRERYAHSDANPVHRVSDAPVSTFS 119 Query: 126 LDVDTGSYANVRRFL-NQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRY 184 +DVDTGSY+N+RR L + LPP DAVRVEEI+NYF + + PFA+ Sbjct: 120 IDVDTGSYSNIRRMLTRENRLPPADAVRVEEILNYFAYGYPLPQ-------DGKPFAVHT 172 Query: 185 ELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 + +PW L+++ I A D E+ P +NLVFLIDTSGSM ++LPL++ ++ Sbjct: 173 QTVDSPWQADAKLIRIAIQAADLAPEKRPPANLVFLIDTSGSMDDPDKLPLVKKTVCHFA 232 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 + LR D I+++TY+G + LP +G K I AA+ L A G+T GG L +AY A Sbjct: 233 EALRADDRISLITYSGSTAEILPPTAGDQKETIIAALKPLRAHGATAGGEALRMAYDAAA 292 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 K + K GINRILLATDGDFNVGI DP ++++ V +R+SG++L+T G G+ NYN+ MM + Sbjct: 293 KNYRKDGINRILLATDGDFNVGISDPATLKNYVADKRKSGISLTTLGYGSGNYNDEMMEQ 352 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +AD G+GNYSYID+ +EA+KVL ++ L TVA+D+K Q+EFNPA V EYR +GYE R Sbjct: 353 LADAGDGNYSYIDSEAEAKKVLVRQLTSTLATVARDIKIQLEFNPAAVKEYRLVGYENRL 412 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 LR E FNND VDAGDIGAG +IT L+E+ G+ +D Y N A S K E WL Sbjct: 413 LREEDFNNDRVDAGDIGAGHNITALYEIIPQGKTGWLDARHY--QNAPAASGKADEYGWL 470 Query: 485 KIRWKYPQGKESQLVEFPLGP---TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQI 541 K+R+K P+ ++SQL+E P+ + E RF A A+Y Q L+G +Y W I Sbjct: 471 KLRYKAPESEQSQLIEQPIAAKSIPLADAEEATRFAIAAASYAQALKGGKYNGALDWAGI 530 Query: 542 KQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 + AQ A+G DP RA ++LIE A ++ Sbjct: 531 LRLAQAAQGSDPYDERAGLLQLIEKARELS 560 >UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobacteria RepID=Q21MJ3_SACD2 Length = 708 Score = 498 bits (1282), Expect = e-139, Method: Composition-based stats. Identities = 214/550 (38%), Positives = 310/550 (56%), Gaps = 15/550 (2%) Query: 33 PSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKA 92 PS E+ V+ +++ E+ + + A +Q + A R A Sbjct: 163 PSAALEEVVVTGMRSSAAESAKLSKKPAASQRQVSAIRAQDIGALPDQSNAVALQRIAGM 222 Query: 93 KAT-----HIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPP 147 A G +++ ++N VK VA+ P++TFS+DVDT SY+ VRR LN G LP Sbjct: 223 PVDGDTIVAPAPQGNDKFEHVEENSVKSVAEAPVSTFSIDVDTASYSFVRRQLNSGYLPE 282 Query: 148 PDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDR 207 DA+R EE++NYF ++ + P+ PF + +PW + + L+ + + D Sbjct: 283 KDAIRAEELINYFDYNYPL------PSDSTAPFKPNITVIDSPWAKGKKLVHIGLKGYDI 336 Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 ++ P +NLVFL+D SGSM S ++LPL++ S+++L+ L D +AIV YAG + L Sbjct: 337 APDQKPRTNLVFLLDVSGSMNSQDKLPLVKQSMEMLLSTLNPDDTVAIVVYAGAAGTVLE 396 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 K +I +A+ L A GST GGAG+ LAY A F K +NR++LATDGDFNVG Sbjct: 397 PTPAKDKQKILSAMQRLQAGGSTAGGAGIALAYDLAEANFDKKAVNRVILATDGDFNVGS 456 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 + ++++ V+++RE G+ LS G G NYN+ +M +A GNG +YIDT+SEAQKVL Sbjct: 457 TNNETLQGFVERKREKGIFLSVLGFGQGNYNDHLMQTLAQNGNGVAAYIDTVSEAQKVLV 516 Query: 388 SEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHIT 447 E L +AKDVK Q+EFNPA V EYR IGYE R L E FNND VDAGDIGAG +T Sbjct: 517 QEASSSLFPIAKDVKIQVEFNPATVAEYRLIGYETRALNREDFNNDAVDAGDIGAGHTVT 576 Query: 448 LLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTI 507 ++E+T G A + + AK+ E + K+R+K P S+L+E P+ Sbjct: 577 AIYEITPVGSSAVLIDESRYAQKEKAKAPTNAEYGFFKLRYKLPSEDTSRLIEAPILQQQ 636 Query: 508 ----NAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRL 563 +++ F AVAAY QKL+GS +LN S+ I AQ +KG D GYR EF++L Sbjct: 637 PLVPAELMQEVNFSVAVAAYAQKLKGSNFLNKYSYHDIIALAQASKGSDEYGYRTEFVQL 696 Query: 564 IELADGVTDI 573 + A+ D+ Sbjct: 697 VRKAELADDL 706 >UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9IU52_BORPD Length = 582 Score = 494 bits (1272), Expect = e-138, Method: Composition-based stats. Identities = 250/557 (44%), Positives = 338/557 (60%), Gaps = 13/557 (2%) Query: 17 SGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQAL 76 S + + V +A A + A +QY+ Sbjct: 30 SAADAARALTGAGKAGNPGTVPPAVPSAPPAPPAAEADAGAPRARPGAALTRQYAP--QA 87 Query: 77 QGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANV 136 A + A Y ++ DNPV + P++TF DVDTGSY NV Sbjct: 88 YSAQPAAVSLLPAPSGYYAPPQAEERENYARYRDNPVVAAQEQPVSTFGADVDTGSYTNV 147 Query: 137 RRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRT 196 RR LN+G LPPPDAVR EE +NYF + + P S+ PF++ E++ APWN QR Sbjct: 148 RRLLNEGRLPPPDAVRAEEFINYFDYGY------ATPDSRQQPFSIITEVSAAPWNPQRQ 201 Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 LLK+ I +++PA+NLVFL+DTSGSM ++LPLI+ +LK LV +LR QD +AIV Sbjct: 202 LLKIGIQGYRVAPQDIPAANLVFLVDTSGSMAERDKLPLIKGALKQLVAQLRPQDRVAIV 261 Query: 257 TYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRIL 316 TYAG + + L S G KA INAAID L A GSTNGGAGL+LAY QA KGF+KGG+NRIL Sbjct: 262 TYAGQASMTLDSTPGDQKARINAAIDELRAAGSTNGGAGLDLAYAQAAKGFVKGGVNRIL 321 Query: 317 LATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 LA+DGDFNVG D + ++ + +QR+ G+ L+T GVG N+N+A+ +++AD GNG+Y Y+ Sbjct: 322 LASDGDFNVGATDLEDLKDKIARQRQGGIALTTLGVGGGNFNDALAMQLADAGNGSYHYL 381 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVD 436 D+L EA+KVL ++M L+T+A+DVK Q+EFNPA V EYR IGYEKR L E FNND VD Sbjct: 382 DSLREARKVLAAQMSSTLLTIARDVKIQVEFNPAVVAEYRLIGYEKRALAREDFNNDRVD 441 Query: 437 AGDIGAGKHITLLFELTL-NGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE 495 AG+IGAG ++T L+E+T A +D LRY A + ELA++++R+K P + Sbjct: 442 AGEIGAGANVTALYEITPLAAGGARLDPLRY--GKPAADAGPADELAFVRVRYKLPGASD 499 Query: 496 SQLVEF--PLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDP 553 SQLVE P A ++ MR AA AA+ Q LRG +YL++ S QI A+ A+G+DP Sbjct: 500 SQLVEQAVPRADARAAGTDGMRRAAAAAAFAQWLRGGKYLDDYSPAQIAALARGARGDDP 559 Query: 554 QGYRAEFIRLIELADGV 570 G AE L+E+A G+ Sbjct: 560 HGLNAELAALVEMAAGL 576 >UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BJU7_9GAMM Length = 555 Score = 494 bits (1271), Expect = e-138, Method: Composition-based stats. Identities = 207/559 (37%), Positives = 303/559 (54%), Gaps = 23/559 (4%) Query: 9 LLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQ 68 ++++L+L CG Q + P+ E +A ++ A + ++ EVQ Sbjct: 9 SVVATLMLLSCGTQTTDGALTD--PAVLQEPVHTEETRAIETDSADQVFLAASKSRVEVQ 66 Query: 69 QYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDV 128 + + + Y + +P++QVA +P++TFS DV Sbjct: 67 E-----------SYVLPSSTPIIPMPNPPVSENRENYPKTPISPIRQVATDPVSTFSTDV 115 Query: 129 DTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAP 188 DT SY N RRFLNQG+ PP D++RVEE +NYF P + P + E Sbjct: 116 DTASYTNARRFLNQGMRPPADSIRVEEFINYFDYA------LPAPDTTNTPIQISTERTQ 169 Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 PWN Q L++V + + + LP NLVFL+D SGSM S ++LPL+Q S LLV +LR Sbjct: 170 TPWNPQTELVRVSLQSYRSDFKTLPPLNLVFLLDVSGSMNSPDKLPLMQRSFNLLVSQLR 229 Query: 249 EQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 QD +AI YAG S + L SG KA+IN AI+ L A G T+G AG+ LAY A ++ Sbjct: 230 PQDRVAIAVYAGQSGVVLEPTSGDQKAQINQAINQLRAGGGTHGSAGIHLAYDLAQANYL 289 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 GINRI + TDGDFNVG ++++++++RE+GV LS G G NYN+A+M +++ Sbjct: 290 PDGINRIFIGTDGDFNVGTTSLTELKALIERKREAGVFLSVLGFGTGNYNDALMEELSNH 349 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 GNG Y+D+ EA+K+ +++ L TVAKDVK QIEFNPA V EYR IGY+ R L E Sbjct: 350 GNGTAYYLDSYQEARKLFATQLAATLQTVAKDVKIQIEFNPAQVAEYRLIGYDNRLLARE 409 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKAS-IDKLRYAPDNKLAKSDKTKELAWLKIR 487 FNND +DAG++G+G +T L+E+ + D LRY D+ E+A++K R Sbjct: 410 DFNNDAIDAGEMGSGHAVTALYEIVRRDSEFRFSDPLRYQDDDLSDTVG--GEIAFVKAR 467 Query: 488 WKYPQGKESQLVEFPLGPT-INAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQ 546 +K P S+L+ + T + + S+ VA + + LRGS YL + S + Sbjct: 468 YKLPDEAHSRLLSQAITDTPMQSSSQRQALAIGVAGFAEILRGSPYLRDWSINDAIDYIG 527 Query: 547 QAKGEDPQGYRAEFIRLIE 565 + ED GYR E + L+ Sbjct: 528 PSLQEDRWGYRQELVTLMR 546 >UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C375AE Length = 550 Score = 481 bits (1239), Expect = e-134, Method: Composition-based stats. Identities = 183/477 (38%), Positives = 270/477 (56%), Gaps = 13/477 (2%) Query: 98 ANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIV 157 Y+ + + K PL+TFS DVDT SY NVRR + + P DAVR+EE + Sbjct: 79 LPSANEEYKGYTEAGFKDTKSEPLSTFSADVDTASYTNVRRLIENRNIVPEDAVRIEEFI 138 Query: 158 NYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNL 217 NYF D+ + S F E+A PWN L+ V I K+ + +E P SNL Sbjct: 139 NYFDYDYPQPEDGSA-------FGRYVEIADCPWNRDHKLMMVGIQGKELQQQETPPSNL 191 Query: 218 VFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEI 277 VFLID+SGSM S ++LPL+QS+ +L ++L + D I+IVTYAG S + L GS+ EI Sbjct: 192 VFLIDSSGSMNSYDKLPLVQSAFSMLAEQLDKNDRISIVTYAGSSAVLLDGEKGSNTDEI 251 Query: 278 NAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMV 337 + S+ A GSTNG G++ AY+ A + FIKGG NR++LATDGD NVG + + ++ Sbjct: 252 LEQLYSITASGSTNGEGGIKTAYELAEEHFIKGGNNRVILATDGDLNVGASSEEELTRLI 311 Query: 338 KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITV 397 + +R++G+ LS G G NY +A M +AD GNGN+SYID+ EA++VL EM L T+ Sbjct: 312 ETKRDNGIYLSVLGFGEGNYKDARMEALADNGNGNFSYIDSEDEAERVLVQEMSGTLYTI 371 Query: 398 AKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL--N 455 AKDVK Q+EFNP+ V+ YR IGY+ R + E F +D DAG++G+G +T L+E+ + Sbjct: 372 AKDVKIQVEFNPSQVSSYRLIGYDNRLMNAEDFLDDTKDAGEVGSGHSVTALYEIEMADT 431 Query: 456 GQKASIDKLRYAPDNKLAKSDKTK--ELAWLKIRWKYPQGKESQLVE--FPLGPTINAPS 511 G L +A ++ ++ E+ L I +K P G E++ + + ++PS Sbjct: 432 GDSYHGVPLEFASEHDSIPAENNGRSEICKLSIAYKTPVGNENRNTSDLYSMENYSSSPS 491 Query: 512 EDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 M+ A A +G LR S+Y + + + Q K D + +++ AD Sbjct: 492 NSMKLAQAAAGFGMVLRNSDYKGDADFDTVLDILDQLKVNDNDKINELYGLILDAAD 548 >UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XSR4_9CAUL Length = 625 Score = 479 bits (1233), Expect = e-133, Method: Composition-based stats. Identities = 189/502 (37%), Positives = 283/502 (56%), Gaps = 18/502 (3%) Query: 81 QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFL 140 T T RY NPV++VA P++TFS+DVDT +YANVRRF+ Sbjct: 124 AAGQTTVDGVVVPGRPGTRVDTERYPDATPNPVRRVADEPVSTFSIDVDTAAYANVRRFI 183 Query: 141 NQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ-----R 195 ++G PP DAVRVEE++NYF + + P PFA+ +A +PW+ R Sbjct: 184 SEGQTPPRDAVRVEEMINYFDYGY------ARPGRADEPFAVSTAVAASPWSANAGAGGR 237 Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 ++ + + + + E NL F++D SGSM S ++L L Q ++ L++ LR +D +A+ Sbjct: 238 QIVHIGLQGYELPAGERRPLNLTFMVDVSGSMQSPDKLGLAQQTMNLIIDRLRPEDRVAV 297 Query: 256 VTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRI 315 YA D A+ GS K ++ A+ +L+A GST G G+ AY+QA F +NRI Sbjct: 298 TYYASDVGTAVGPTPGSEKLKLRCAVAALNAGGSTAGAQGMVNAYEQAEAAFSPDKVNRI 357 Query: 316 LLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSY 375 L+ TDGDFNVG+ D + +E V +R +G+ LS +G G NY +A M IA GNG +Y Sbjct: 358 LMFTDGDFNVGVTDDRRLEDYVADKRGTGIYLSVYGFGRGNYQDARMQTIAQAGNGVAAY 417 Query: 376 IDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNV 435 +D L EA+ + + +A DVK Q+EFNPA V+EYR IGYE R L E F ND + Sbjct: 418 VDDLDEARCLFGPAFDRGAFPIADDVKIQVEFNPARVSEYRLIGYETRLLNEEDFANDAI 477 Query: 436 DAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDN-KLAKSDKTKELAWLKIRWKYPQGK 494 DAG++G+G +T L+E+T G + I + RY + A D T E+ ++++R+K P Sbjct: 478 DAGEVGSGASVTALYEITPVGGASQIPERRYEANRAGDAGGDPTGEIGFVQVRYKLPGQP 537 Query: 495 ESQLVEFPLGPTINAP-----SEDMRFRAAVAAYGQKLRGSEYLN-NTSWQQIKQWAQQA 548 S+L++ P+ T + P E R+ AVA +GQ+LRG ++ + I AQ Sbjct: 538 TSRLIQQPISGTTDGPGSARLPEATRWAMAVAGFGQRLRGDPWMGADFDTAAILDLAQGV 597 Query: 549 KGEDPQGYRAEFIRLIELADGV 570 +GEDP G RA F++++ A+ + Sbjct: 598 RGEDPYGDRAAFVQMVRAAESL 619 >UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5D5_BREBN Length = 513 Score = 474 bits (1221), Expect = e-132, Method: Composition-based stats. Identities = 176/526 (33%), Positives = 279/526 (53%), Gaps = 40/526 (7%) Query: 41 VLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANP 100 ++ + + +E + ++ Q + Q + + P+ K + P Sbjct: 19 GCSSSEQFVSRSESGNKPSASVEQGQSNQVASSPS-------PPSQLADYALKKSGDPLP 71 Query: 101 GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 ++ + N A++ L+TF+ DVDT SY +R F+ G LPP +AVRVEE +N+F Sbjct: 72 NDMYFKDYGTNQFVSTAKDRLSTFAADVDTASYTIMRHFIKDGNLPPAEAVRVEEFINFF 131 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 P+ + PA FA++ + P+P+ + ++++ I K+ +E +NLVF+ Sbjct: 132 PTSY--------PAPTNQTFAIQADSGPSPFQKNLQIVRIGIKGKELSPKERKPANLVFV 183 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 ID SGSM + RL L++ SL +LV +L+ D++ IV Y + R+ LP S K I +A Sbjct: 184 IDVSGSMNQENRLELVKKSLHVLVDQLQPTDSVGIVVYGSEGRVLLPPTSTEDKQAILSA 243 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQ 340 ID L EGSTN GL L Y+ A + F INR++L +DG NVG + I ++ Sbjct: 244 IDELQPEGSTNAEQGLVLGYEMAARSFKPPAINRVILCSDGVANVGETGAEGILRSIEDY 303 Query: 341 RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD 400 + LS+FG G NYN+ MM ++A+ G G+Y+YIDT SEA+++ + L T+A+D Sbjct: 304 ARKDIYLSSFGFGMGNYNDVMMEQLANKGEGSYAYIDTFSEARRIFTESLTGTLQTIARD 363 Query: 401 VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS 460 VK Q+EF+P V YR IGYE R +R E F ND DAG+IGAG +T L+E+ L Sbjct: 364 VKIQVEFDPKKVDSYRLIGYENRDVRDEDFRNDKTDAGEIGAGHSVTALYEVKLA----- 418 Query: 461 IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAV 520 S EL +++R+ + ++ + + P+ + S D+ F AAV Sbjct: 419 --------------SPVHAELGTVRVRYHHASTQKVEEISEPV-KVQSTLSPDVTFLAAV 463 Query: 521 AAYGQKLRGSEYLNNTSWQQIKQWAQQ-AKGEDPQGYRAEFIRLIE 565 A YG+ LR S Y +S + + A+ A GE+ + EF+RL++ Sbjct: 464 AEYGEILRESPYAERSSLADVLKLAEATATGEE----QLEFVRLVK 505 >UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 Tax=Caulobacteraceae RepID=B4WCU1_9CAUL Length = 613 Score = 474 bits (1220), Expect = e-132, Method: Composition-based stats. Identities = 187/505 (37%), Positives = 277/505 (54%), Gaps = 15/505 (2%) Query: 75 ALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYA 134 A G+ ++ A T Y NPVK+ A P++TFS+DVDT +Y+ Sbjct: 109 AAPGQAPLNAVVVTGSRIMPGAPAPSDTETYPDATPNPVKRTADQPVSTFSIDVDTAAYS 168 Query: 135 NVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ 194 NVRRF+++G PP DAVRVEE++N F + + P S PFA+ + +PW + Sbjct: 169 NVRRFIDEGRSPPADAVRVEELINAFDYGY------ARPTSLARPFAITTAVVASPWAPR 222 Query: 195 -----RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 R ++ + + + E NL FL+D SGSM S ++L L + ++ L + LR Sbjct: 223 TERGGRQIVHIGLQGYELPQGEQRPLNLTFLVDVSGSMRSPDKLDLAKQAMNLAIDRLRP 282 Query: 250 QDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK 309 QD +++ YA + L G K ++ A+ SL A G T G G+ AY QA F + Sbjct: 283 QDTLSVTYYAEGAGTTLQPTPGDQKLKMRCAVASLRASGGTAGATGMTNAYDQAQASFAR 342 Query: 310 GGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG 369 +NRIL+ TDGDFNVG+ D K +E V ++R +GV LS +G G NY +A M IA G Sbjct: 343 DKVNRILMFTDGDFNVGVTDNKRLEDYVAEKRGTGVYLSVYGFGRGNYQDARMQTIAQAG 402 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEH 429 NG +Y+ L +A+++ + +A DVK Q+EFNPA V E+R IGYE R L Sbjct: 403 NGVAAYVGDLRDARRLFGPMFDKGAFPIADDVKIQVEFNPARVAEWRLIGYETRLLNEAD 462 Query: 430 FNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDN-KLAKSDKTKELAWLKIRW 488 F ND +DAG++G+G +T L+E+T G + + RY + + D E+ ++++R+ Sbjct: 463 FANDRIDAGEVGSGASVTALYEITPVGGPTQVPERRYPDNRIGVGGGDPNGEIGFIQVRY 522 Query: 489 KYPQGKESQLVEFPLGPTI--NAPSEDMRFRAAVAAYGQKLRGSEYL-NNTSWQQIKQWA 545 K P G S L++ PL P E R+ AVAA+GQKLR ++ + W Q+ A Sbjct: 523 KQPGGSRSDLIQQPLTSRAAGAQPPEATRWALAVAAFGQKLRNDPWMSADYGWDQVLAQA 582 Query: 546 QQAKGEDPQGYRAEFIRLIELADGV 570 Q A+GEDP G RAEF++L+ A + Sbjct: 583 QGARGEDPWGDRAEFVQLVRAARDL 607 >UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UJ22_METS4 Length = 654 Score = 470 bits (1210), Expect = e-131, Method: Composition-based stats. Identities = 209/523 (39%), Positives = 287/523 (54%), Gaps = 23/523 (4%) Query: 58 AAKALAQQEVQQYSDKQALQGRLQE--APTFARAAKAKATHIANP-GTARYQQFDDNPVK 114 A +A A + +Q + + ARAA A + P G R+ + + Sbjct: 119 AGEADAGRTLQAFRSSGGFRFEASPRGPAAMARAAGETAPVPSEPVGRDRFANAPEGGFR 178 Query: 115 QVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPA 174 + P++T SL VDT SY VR LN+ LPPP AVR EE++NYFP + PA Sbjct: 179 ITREAPVSTVSLGVDTASYGIVRDALNRNHLPPPAAVRTEELINYFPYAYPA------PA 232 Query: 175 SKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLP 234 S PF + + P+PW E R LL + I E P +NLVFL+DTSGSM + RLP Sbjct: 233 SPDAPFRVTASVFPSPWAEGRKLLHIGIRGYAVAPAERPPANLVFLVDTSGSMAAPNRLP 292 Query: 235 LIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 L++ SL +L+ L +D +A+V YAG+ L I AAI++L A GST GG Sbjct: 293 LVKQSLAMLLTTLDARDRVALVAYAGEVGTVLEPTPAGEAGRILAAIETLQAHGSTAGGE 352 Query: 295 GLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGN 354 G+ AY A + F +NR++LATDGDFNVGI + V ++R G+ LS G G Sbjct: 353 GIRQAYALAARHFDPKAVNRVILATDGDFNVGITGRDELTGFVARERRKGIFLSVLGFGM 412 Query: 355 SNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE 414 N N+A+M +A GNG ++IDT EA+KVL E LI +A+DVK Q+EFNPA V E Sbjct: 413 GNLNDALMQALAKDGNGVAAHIDTAQEARKVLVEEATSTLIPIARDVKIQVEFNPATVAE 472 Query: 415 YRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAK 474 YR IGYE R L F ND DAG++G+G+ +T L+E+ K LRYA ++ A Sbjct: 473 YRLIGYETRPLDRADFANDEADAGEVGSGQTVTALYEIVPADGKRVTGDLRYA-PHEAAP 531 Query: 475 SDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAA---------VAAYGQ 525 + +++ A + IR+K P +ES L+E P+GP A RF A VAA+GQ Sbjct: 532 APASRDYAHVAIRFKRPDARESTLIETPVGPEGEAA----RFAEAPQEARFAAAVAAFGQ 587 Query: 526 KLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 LRG ++ S + + A A+G+DP GYRAEF+ L+ A Sbjct: 588 ILRGGKHTGRFSLDDVIRIAAPARGDDPFGYRAEFLGLVRAAK 630 >UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVG2_SORC5 Length = 656 Score = 465 bits (1196), Expect = e-129, Method: Composition-based stats. Identities = 169/480 (35%), Positives = 262/480 (54%), Gaps = 31/480 (6%) Query: 99 NPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVN 158 G+ Y+ + NPV+ A++ L+TF++DVDT SYA RR + G LPP AVR EE +N Sbjct: 191 PQGSETYRDYGVNPVEDPAKDRLSTFAIDVDTASYAIARRKIMDGALPPYQAVRAEEFLN 250 Query: 159 YFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLV 218 YF + PFA+ AP+P+ L++V + K +E +LV Sbjct: 251 YFDYGYASP--------AAGPFAVHLAAAPSPFTSGHHLVRVAVQGKRVPVKERTPVHLV 302 Query: 219 FLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEIN 278 +L+DTSGSM S +++ L + SLK+L L+ D +A+ TYAG R L K +I Sbjct: 303 YLVDTSGSMQSPDKIELAKKSLKMLTDTLKPGDTVALCTYAGSVREVLAPTGIESKGKIL 362 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVK 338 AA+ L A GST +G++LAY A + +KG +NR+++ +DGD NVG I +K Sbjct: 363 AALADLTAGGSTAMSSGIDLAYSLAERTLVKGHVNRVIVLSDGDANVGPTSHDEILKTIK 422 Query: 339 KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVA 398 + R+ G+TLST G G NY + MM ++A+ G+GNY+YID+ ++A++V + ++ ML +A Sbjct: 423 RARDKGITLSTVGFGQGNYKDLMMEQLANQGDGNYAYIDSEAQARRVFSEQVGGMLQVIA 482 Query: 399 KDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK 458 +DVK Q+EF+P++V YR IGYE R + F ND VDAG+IGAG +T ++++ L Sbjct: 483 RDVKIQVEFDPSFVKSYRLIGYENRDVADRDFRNDKVDAGEIGAGHSVTAIYDVELKAP- 541 Query: 459 ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF------PLGPTINAPSE 512 A + +++R K P G + + PT +A Sbjct: 542 --------------APKGEGAAPIVVRLRHKAPLGSNTAEETLVKMAPGAIAPTFDAAPA 587 Query: 513 DMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTD 572 D RF +AVA + + LR S + + I++ A+ A +G + EFI +I A + + Sbjct: 588 DFRFASAVAGFAEVLRHSPHARSWRLADIEKIARAAASS--KGDQQEFIGIIRRAGALAN 645 >UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AWD1_HERA2 Length = 610 Score = 462 bits (1188), Expect = e-128, Method: Composition-based stats. Identities = 171/589 (29%), Positives = 289/589 (49%), Gaps = 57/589 (9%) Query: 9 LLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQ 68 +++ +LI+S CG + Q + + +A + + A + A A Q Sbjct: 54 IVLIALIISACGGEASLPTINPQPQRPAPQPRPTSAADDQSAQWPTAEATSVAPAPQ--- 110 Query: 69 QYSDKQALQGRLQEAPTFARAAKAK-------ATHIANPG-------------TARYQQF 108 Q P AA T +P + ++ + Sbjct: 111 ----PMPTQAADAGQPVPNPAAGKPLVDTWELPTQPIDPNPNYAYEQDQEIFDSMYFKNY 166 Query: 109 DDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKD 168 NP + +PL+TF++D+D+ SY+ +R +NQGLLPP D+VRVEE +N F ++ + Sbjct: 167 GTNPFVRTETDPLSTFAMDIDSASYSLMRSSINQGLLPPADSVRVEEYLNAFDYEYPQPE 226 Query: 169 KQSIPASKPIPFAMRYELAPAPWN-EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM 227 FA+ E+AP+P+ L+++ I A+ + + + L F+IDTSGSM Sbjct: 227 --------DGDFAIYSEVAPSPFGGPNYELVQIGIQARSIEVADRKPAALTFVIDTSGSM 278 Query: 228 ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAE 287 D RL +++++L L +L D++AIV + R+ L SG ++ +I AI+SL+ Sbjct: 279 AQDNRLEMVKNALIYLAGQLEPDDSLAIVAFNDGMRVVLNPTSGENQMDIITAINSLEPA 338 Query: 288 GSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTL 347 GSTN AGL ++ A + F GINRILL +DG N G+ +P + + ++ ++GV L Sbjct: 339 GSTNAEAGLYKGFELAWQAFKPEGINRILLCSDGVANSGMTEPSQLLATFQQYLDAGVQL 398 Query: 348 STFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 ST+GVG NYN+ ++ ++AD G+GNY+Y D+ EAQ++ ++ L T+ ++ K Q+ F Sbjct: 399 STYGVGMGNYNDILLEQLADKGDGNYAYFDSADEAQRLFGEQLTGSLQTIGREAKIQVNF 458 Query: 408 NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS---IDKL 464 +P V YR IGYE R + F ND+VD G++GAG +T L+E+ + + Sbjct: 459 DPNVVKRYRLIGYENRAVADSDFRNDSVDGGEVGAGHSVTALYEIKRHPDAQGPIAQVNI 518 Query: 465 RYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYG 524 RY + A +++ ++ +I + + S M +VA Y Sbjct: 519 RYISMDTNAPVEESLNISTAQIH-----------------SSFDRASARMHLATSVAEYA 561 Query: 525 QKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRA-EFIRLIELADGVTD 572 + LR S + N T + A++A + P A EF+ L+ A+ + Sbjct: 562 ELLRHSRWNNGTDILDVLDLAEEAALDLPNNQSAVEFVTLLRRAEQMHQ 610 >UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GHE4_9DELT Length = 785 Score = 459 bits (1182), Expect = e-127, Method: Composition-based stats. Identities = 179/461 (38%), Positives = 271/461 (58%), Gaps = 29/461 (6%) Query: 113 VKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSI 172 ++ +TFS+DVDT SYA+VR+ L G +P P +VR EE++NYF + + Sbjct: 255 FVATGEDRKSTFSIDVDTASYASVRQSLRNGWMPDPGSVRTEEMINYFDYGYVAPSGGA- 313 Query: 173 PASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDR---KSEELPASNLVFLIDTSGSMIS 229 PFA+ E+ P PW L+++ + A +++EL NLVFL+D SGSM S Sbjct: 314 ----GAPFAVHTEVGPCPWAPDHRLVQIGVQATRELPAQAQELRTRNLVFLLDVSGSMSS 369 Query: 230 DERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGS 289 +LPLI+ LV++L +D+++IV YAG + + LP SG K I A+D L+A G Sbjct: 370 RGKLPLIKHGFTQLVEQLGAEDHVSIVVYAGAAGVVLPPTSGDQKETILGALDRLEAGGG 429 Query: 290 TNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLST 349 TNG AG+ AY+ A F+ GG+NR++L TDGDFNVG+ D ++ +++++RESGV LS Sbjct: 430 TNGSAGIVEAYELAQANFVDGGVNRVILGTDGDFNVGLSDHDALVELIEQKRESGVFLSV 489 Query: 350 FGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 GVG +Y++ +M ++AD GNGNY+++D EA+KVL E+ L T+AKDVK Q+ FNP Sbjct: 490 LGVG-GHYDDELMEQLADHGNGNYAFLDGKREAEKVLVEEIGGTLTTIAKDVKVQVAFNP 548 Query: 410 AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPD 469 VT++R I Y+ R+L FN+D DAG+IG G ++T L+E+ + Sbjct: 549 EQVTKHRLIAYQNRRLAHRDFNDDTKDAGEIGVGHNVTALYEIIPADE------------ 596 Query: 470 NKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL---GPTINAPSEDMRFRAAVAAYGQK 526 ++ ++ L L++R+K P G S V + G +++ S+D RF AAVA +G+ Sbjct: 597 -----AEASEALMSLELRYKKPDGHRSTKVTTSVRDAGRSLDQNSDDFRFAAAVAGFGES 651 Query: 527 LRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 L G + ++ + AQ A GED + R EF+ L A Sbjct: 652 LAGRRPDASWNYADTLELAQGALGEDARCLRHEFLELAWRA 692 >UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SXC8_9FIRM Length = 612 Score = 456 bits (1172), Expect = e-126, Method: Composition-based stats. Identities = 183/496 (36%), Positives = 257/496 (51%), Gaps = 27/496 (5%) Query: 101 GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 T Y +N PL+TF+ D DT SY+NVR ++ G LPP AVR+EE++NYF Sbjct: 120 DTREYDSMTENGFVSTVDRPLSTFAADRDTASYSNVRSYIESGSLPPDGAVRIEEMLNYF 179 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 D+ K F++ E + PWN+ L+ V I + + SNLVFL Sbjct: 180 TYDYRKK------PEDGEKFSIYTEYSDCPWNKDTKLMMVGINTDEIDFGDKKPSNLVFL 233 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 IDTSGSM D +LPL+Q S +L + L E D ++IVTYAG+ + L GS + I+ A Sbjct: 234 IDTSGSMYDDNKLPLVQQSFAMLAENLDENDRVSIVTYAGEDTVVLSGTPGSEQYTISEA 293 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMV-KK 339 + ++ AEG TNGG + AY+ A K FI GG NR++LATDGD NVG+ + ++ ++ Sbjct: 294 LSNMTAEGCTNGGDAIITAYELAEKNFINGGNNRVILATDGDLNVGLTSESDLVDLITEE 353 Query: 340 QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAK 399 ++E+ + LS G G N + + +AD G+G+Y++ID+ EA+KVL EM L TVAK Sbjct: 354 KKENNIFLSVLGFGTDNLKDNKLEALADNGDGSYAFIDSAYEAKKVLVDEMGGTLNTVAK 413 Query: 400 DVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKA 459 DVK Q+EFNP V YRQIGYE R L F ND VD G+IGAG +T+L+E+ G Sbjct: 414 DVKFQLEFNPTNVKGYRQIGYENRALADADFANDAVDGGEIGAGHMVTVLYEIVPAGSDF 473 Query: 460 SIDKLRYAP------------------DNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF 501 + + D + + ELA + IR+K P G +S LV Sbjct: 474 EVPAANHKYGENINQVNTAESTSQDLRDKSDSAENYAGELATVNIRYKDPDGDKSNLVSC 533 Query: 502 PLGPTI--NAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAE 559 + S DM +AVAAYG L+ SEY + G Sbjct: 534 VVKTDSYNGGMSADMSAASAVAAYGMLLKNSEYAGAADLDMVLSLVSGKTGSSSDSDDII 593 Query: 560 FIRLIELADGVTDISQ 575 ++ + AD V + Sbjct: 594 DMQWQDFADMVRQTQK 609 >UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1CVN5_MYXXD Length = 700 Score = 438 bits (1127), Expect = e-121, Method: Composition-based stats. Identities = 184/550 (33%), Positives = 272/550 (49%), Gaps = 41/550 (7%) Query: 30 QQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARA 89 + + A + A+ +Q + K R Sbjct: 158 GGRTGATRVYEPPNAMSRPHGVSLNGPPASTLPSQPLGRPGPPKPQSAPRFV-GRDVEPP 216 Query: 90 AKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPD 149 A A A +P +Q + NP + +TFS+D D+ SY R +L +G LP Sbjct: 217 APAVAPAPVSPFHMYFQGYGVNPTINTEEERFSTFSVDTDSASYTLTRAYLERGSLPNEQ 276 Query: 150 AVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKS 209 AVRVEE VN F + + PF+++ E P+P + ++ V + A++ Sbjct: 277 AVRVEEFVNTFDYGYAHQ--------GSAPFSVQVEGFPSPVRKGYHVVHVGVKAREVSR 328 Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSI 269 + S+LVF+ID SGSM + RL L++ +L LLV EL E+D ++IV Y +R+ L Sbjct: 329 PQRKPSHLVFVIDVSGSMNLENRLGLVKRALHLLVNELDERDQVSIVVYGSTARLVLEPT 388 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 S H I AAIDSL EGSTN AGLE+ Y A ++GGINR++L +DG N G+ D Sbjct: 389 SAVHAHIIRAAIDSLHTEGSTNAQAGLEMGYSLAASHLVEGGINRVILCSDGVANTGLTD 448 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 SI ++ + G+TLST G G NYN+ +M R++ VG GNY+Y+D + EA ++ + Sbjct: 449 ANSIWERIRARAAKGITLSTVGFGMGNYNDVLMERLSQVGEGNYAYVDRIEEAHRIFVRD 508 Query: 390 MRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLL 449 + L VAKDVK Q+EF+P V+ YR +GYE R L E F +D VDAG+IGAG +T L Sbjct: 509 LTGTLQVVAKDVKLQMEFDPKAVSHYRLLGYENRMLTKEQFADDRVDAGEIGAGHAVTAL 568 Query: 450 FELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG----- 504 +E+ L AS L+IR+K P+G +S+L+E PL Sbjct: 569 YEVKLTEPSAS--------------------FGTLRIRYKAPEGGDSKLIEKPLPSSVLR 608 Query: 505 PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQI----KQWAQQAKGEDPQGYRAEF 560 P + R AA+ +KLRGS ++ ++ + ++ Q K D AE Sbjct: 609 PAYGRAAPPTRLSYVAAAFAEKLRGSYWVRPLTYDALFSFWEEIGQPLKARDDV---AEL 665 Query: 561 IRLIELADGV 570 LI+ A + Sbjct: 666 GALIQKARAL 675 >UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WPE6_EGGLE Length = 555 Score = 437 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 179/520 (34%), Positives = 256/520 (49%), Gaps = 28/520 (5%) Query: 53 EQSAAAAKALAQQEVQQY------SDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQ 106 ++ A + Q Q S+ A+ L E + + GT Y+ Sbjct: 23 AGASLAGCSPDGQAGDQLGSAASESEIMAIGSALSETASTCPPPYPYVPSPSPGGTEEYR 82 Query: 107 QFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLP---PPDAVRVEEIVNYFPSD 163 D+ A +PL+T S DVDT SY N+RR + Q P P AVR EE++NYF Sbjct: 83 ALDEPGFLSPATSPLSTLSADVDTASYCNLRRMVAQRYAPAVVPAGAVRTEELLNYFDYA 142 Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 + + F + +++ PWN+Q LL + + +NLVFLID Sbjct: 143 YPEPVGSDL-------FGVSAQMSDCPWNDQTKLLVMGFATEKDGDASPTGANLVFLIDV 195 Query: 224 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDS 283 SGSM ++LPL++ S LV+ L E+D +++VTYA R+ L + G K I A+DS Sbjct: 196 SGSMDDPDKLPLVKDSFAALVEGLTERDRVSVVTYASGERVLLEGVPGDDKRRIMRAVDS 255 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 L AEGSTNG AGLE AY+ A FI+GG+NR+++A+DGD NVGI + V+++RE+ Sbjct: 256 LVAEGSTNGEAGLEQAYRLAESSFIEGGVNRVVMASDGDLNVGISSESELHDFVEQKRET 315 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 GV LS G G+ NY + M +AD GNG Y YID EA++VL +R L+ +A DVK Sbjct: 316 GVYLSVLGFGSGNYKDNKMETLADHGNGAYHYIDCAEEARRVLGRNLRANLVPLADDVKI 375 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDK 463 Q+EFNP V YR IGYE R L E F + DAG++GAG T+ +E+ G + Sbjct: 376 QVEFNPDRVKGYRLIGYENRALADEEFRD---DAGEVGAGHAFTVAYEIVPAGSAFEVGA 432 Query: 464 LRY-----APDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL----GPTINAPSEDM 514 A D + + + WL +Y + VE L + P+ D Sbjct: 433 SASKYGSDADDRQDGRRSEANGGEWLTCTMRYRPAGTVEAVEQALVVDDESCTDDPNGDW 492 Query: 515 RFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQ 554 F AAV G L S + + + + + D Q Sbjct: 493 TFAAAVIECGMALHRSPHAGAATLESARDLLASCELTDQQ 532 >UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1D6F9_MYXXD Length = 592 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 205/601 (34%), Positives = 305/601 (50%), Gaps = 65/601 (10%) Query: 10 LMSSLILSGC---GPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAA--------- 57 L+ + L C P + + S S + A + A++++A Sbjct: 17 LLVASTLPACHNRSPAADERPSLGAAQSVARDDDAAHAPEREEYVADRASAEHSVAAPAP 76 Query: 58 -----------AAKALAQQEVQQYSDKQALQGRLQEAPT-------FARAAKAKATHIAN 99 A+A A Q ++ S +A R + P ++K A Sbjct: 77 AAPPASALAGPVARAPAPQAAKKVSLGKAELHRREPRPMKPSADALAGAPLESKPQDAAP 136 Query: 100 PGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNY 159 G ++ + N + A++PL+TF+ DVDT SY RR+L G LPP AVRVEE VNY Sbjct: 137 AGGNTFEAWKANAFVETAKDPLSTFAADVDTASYTVSRRYLVNGQLPPASAVRVEEFVNY 196 Query: 160 FPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVF 219 F + + + FA+ E AP+P++ +R L+V + K + ++LVF Sbjct: 197 FKFRYAPPETGA--------FAVHLEGAPSPFDAKRHFLRVGVQGKVVSRSQRKPAHLVF 248 Query: 220 LIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINA 279 L+DTSGSM S+++LPL + ++K+ VK L E D +AIVTYAG++R LP + I+A Sbjct: 249 LVDTSGSMHSEDKLPLAREAIKVAVKNLNENDTVAIVTYAGNTRDVLPPTPATDAKSIHA 308 Query: 280 AIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID-DPKSIESMVK 338 A+DSL A G T G+G+ELAY+ A K ++R+++ TDGD N+G + ++ + Sbjct: 309 ALDSLTAGGGTAMGSGMELAYRHAVKKASGSVVSRVVVLTDGDANIGRNVSANAMLDSIH 368 Query: 339 KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVA 398 K GVTL+T G G NY + +M ++AD GNGN Y+D+L EA+KV +++ L +A Sbjct: 369 KYTAEGVTLTTVGFGMGNYRDDLMEKLADKGNGNCFYVDSLREAKKVFETQLTGTLEVIA 428 Query: 399 KDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK 458 KDVK Q+EFNPA V YR +GYE R + F ND VDAG+IGAG ++T ++E+ L G+ Sbjct: 429 KDVKFQVEFNPAAVRRYRLVGYENRDVADHDFRNDKVDAGEIGAGHNVTAVYEVELTGE- 487 Query: 459 ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF-----PLGPTINAPSED 513 T+ LA +++R K P G E+ EF L T+ S D Sbjct: 488 ------------------ATEALATVRVRAKAPNGTEASEREFRFERTKLRDTLAQASPD 529 Query: 514 MRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDI 573 RF AVAA LR S S ++ A+ A D R EF+RL+ A + Sbjct: 530 FRFAVAVAATADVLRDSPSAEGWSLATAEKLAEGATEGDAD--RKEFVRLVTQARALKGA 587 Query: 574 S 574 S Sbjct: 588 S 588 >UniRef50_C7N770 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N770_SLAHD Length = 629 Score = 433 bits (1113), Expect = e-119, Method: Composition-based stats. Identities = 188/531 (35%), Positives = 269/531 (50%), Gaps = 22/531 (4%) Query: 46 QAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARY 105 A + AE AA+ +A Y+ L EA A + + T Y Sbjct: 111 NAEMPVAETKAASEDTMAG-SANSYAPDGGLAYETDEAYETFDTLDEGAP-MEDFNTEEY 168 Query: 106 QQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLL---PPPDAVRVEEIVNYFPS 162 ++N PL+T S DVDT SY N+RR +N G P AVR+EE++NYF Sbjct: 169 AAIEENGFVSTVTRPLSTCSADVDTASYCNLRRMINDGYSLDEIPDGAVRIEEMLNYFHY 228 Query: 163 DWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLID 222 D + + FA+R E A PWN+Q LL + A D+ SNLVFLID Sbjct: 229 DSGEPEGNDL-------FAVRAESARCPWNDQTQLLVMTFTASDKAQTASKGSNLVFLID 281 Query: 223 TSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAID 282 SGSM ++L L++ S L++ L D ++IVTYA + L SG +I A++ Sbjct: 282 ISGSMDEPDKLDLLKDSFGTLLENLGPNDRVSIVTYAAGEDVLLEGASGDDTRKIMRALN 341 Query: 283 SLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRE 342 L+A+GSTNG AGLE+AY+ A + +I+GG+NRI++A+DGD NVGI + V+++RE Sbjct: 342 RLEADGSTNGEAGLEMAYEVAERNYIEGGVNRIVMASDGDLNVGITSESDLYDFVEEKRE 401 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 +GV LS G G+ NY + M +AD GNG Y YID + EA++VL ++ + VA DVK Sbjct: 402 TGVYLSVLGFGSGNYKDTKMETLADHGNGTYHYIDCVEEAERVLGEDLTANFVPVADDVK 461 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK---A 459 Q+EFNPA V YR IGYE R + E F N+ DA ++GAG T+ +EL L A Sbjct: 462 LQVEFNPAQVKAYRLIGYENRAMADEDFLNEAADAAEVGAGAQFTVAYELVLADSDYDVA 521 Query: 460 SIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE---SQLVEFPLGPTINAPSEDMRF 516 + L+Y + A T E +R+K + SQ + +PS+D F Sbjct: 522 DVPDLKYGSG-EAAGDSSTDEWLTCSMRYKAVDDDKAVRSQDLVVGADSQTESPSDDWVF 580 Query: 517 RAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 ++V +G SE+ + + D R F L+E+A Sbjct: 581 ASSVIEFGMIASDSEFAEGLDTGDVLDQLDTIRLNDE---RQGFYDLVEIA 628 >UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PR69_CHIPD Length = 588 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 176/536 (32%), Positives = 260/536 (48%), Gaps = 38/536 (7%) Query: 31 QQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAA 90 +Q S ++L A AE + AKA+A++ S+ + F Sbjct: 80 KQLSAGAYSRMLVQMNAQEDPAEDAVKKAKAIARERSSNGSNPNYGNALMGTRAFFD--- 136 Query: 91 KAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDA 150 Y +N + F++DVD +Y+N+RRF+ P +A Sbjct: 137 ------------ETYGTLYENKFIAAETQIPSLFAVDVDRAAYSNIRRFVKLKERIPANA 184 Query: 151 VRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSE 210 VR+EE+VNYF + + A+ A PW E LL++ + K + Sbjct: 185 VRIEEMVNYFHYSYPLP-------PVGQTLAIYSNYATCPWAEDHRLLQIAVRGKSVNLD 237 Query: 211 ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSIS 270 LP SNLVFLID SGSM +LPL+Q++ ++LV LR D++AIV YAG + LPS Sbjct: 238 SLPPSNLVFLIDVSGSMAMPNKLPLLQAAFRILVNNLRSNDHVAIVAYAGVPGVILPSTP 297 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDP 330 GS K++I AID L A G+T G A ++LAYQ A + FIK G NR++LATDGDFNVG Sbjct: 298 GSAKSKILNAIDYLSAGGATAGEAAIKLAYQIAEENFIKEGNNRVILATDGDFNVGQTSD 357 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEM 390 +E ++ ++E+GV L+ G G NY ++ + ++ GNGN++YID L EA K+ E Sbjct: 358 HDMEQLILGKKETGVLLTCLGFGMKNYKDSKLETLSSKGNGNFAYIDNLEEASKIFAREF 417 Query: 391 RQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLF 450 L TVA+DV+A + FNP V YR IGYE + ++ + + Sbjct: 418 GSTLFTVARDVQADVVFNPRTVKSYRLIGYENKVIKDDDSASQIGGG------------- 464 Query: 451 ELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFP-LGPTINA 509 + ++ P L +D L R + + P + T Sbjct: 465 --IIGAGHCAVAIYEIVPQKGLMPADSMLAAVHLAYRETTDTTIKRLFYKVPDIFTTFQQ 522 Query: 510 PSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE 565 S+D RF +AVA G LR S Y + S + A+++ G+DP GYR EFI L++ Sbjct: 523 SSDDFRFASAVALMGMLLRKSGYKGSGSCDMVMDIARRSLGDDPGGYRREFITLLK 578 >UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DA43_9BACT Length = 883 Score = 418 bits (1074), Expect = e-115, Method: Composition-based stats. Identities = 180/497 (36%), Positives = 268/497 (53%), Gaps = 18/497 (3%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE 82 P +P + + EA+ S A A + + ++ G Sbjct: 241 PVGVPLDSAEPQGGAALSKAVTPKDKLAEADASKAMPVAAWARVRRGFAATSGGIGDNSY 300 Query: 83 APTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ 142 KA+ + +N V +NPL+TFS+DVDT SYA VRR+LN Sbjct: 301 GLDDRGGIADKAS-----NANSFDTLTENAFLNVPENPLSTFSIDVDTASYAIVRRYLND 355 Query: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 LPP AVR+EE++NYFP D+ + PF+ E+A PW + L++V + Sbjct: 356 NHLPPTGAVRIEELLNYFPYDYPQPQGAA-------PFSATMEVATCPWAPEHRLVRVGL 408 Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 ++ +E P SNLVFLID SGSM +LPL+Q LLV++L +D ++IVTYA + Sbjct: 409 KGREIPKDERPPSNLVFLIDVSGSMNMPNKLPLLQKCFSLLVEQLGPKDRVSIVTYASGT 468 Query: 263 RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD 322 ++ L K + AID L A G T+G +G++LAY+ A + FI GG NR++LATDGD Sbjct: 469 KLVLEPT--QDKEAMQTAIDGLHAGGGTHGSSGIDLAYRMAQQSFIPGGTNRVILATDGD 526 Query: 323 FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 +N+GI + + SM+ ++ +SGV L+ G G N ++M+V++AD GNG+Y+YIDT EA Sbjct: 527 WNIGITNQSELLSMITRKAKSGVFLTVLGFGLDNLKDSMLVKLADHGNGHYAYIDTEQEA 586 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 +KV ++ L+T+AKDVK Q+EFNP V+ YR +GYEKR L E FNND DAG+IGA Sbjct: 587 RKVFVDQLSSTLVTIAKDVKIQVEFNPVQVSSYRLVGYEKRLLAKEDFNNDKKDAGEIGA 646 Query: 443 GKHITLLFELTLNGQK----ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 G +T L+E+ G++ A +D+L+Y + +K +I+ + + Sbjct: 647 GHTVTALYEVVPVGKERPEIAKVDELKYQRIPRAVPVEKPTPQRETEIQEREKLPSAAPA 706 Query: 499 VEFPLGPTINAPSEDMR 515 P+ D R Sbjct: 707 AAPVPAAKAETPAADGR 723 Score = 115 bits (287), Expect = 6e-24, Method: Composition-based stats. Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 6/134 (4%) Query: 442 AGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF 501 H T++ + L + + KE+ LK+R+K P G++S+L+EF Sbjct: 753 HAPHKTIIVDADRGNNTTQASAL---AEPSPLPASVRKEMLTLKLRYKEPDGEKSKLLEF 809 Query: 502 PL---GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRA 558 PL G T S D F AAVA+YG LR S+YL +WQ + +WA++ G D GYR Sbjct: 810 PLTDPGTTWEKSSPDFHFAAAVASYGMLLRDSKYLGEATWQSVVEWAREGLGADKHGYRT 869 Query: 559 EFIRLIELADGVTD 572 EF+ L++ A + Sbjct: 870 EFLSLLDRARAMKQ 883 >UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NFD9_ACHLI Length = 486 Score = 413 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 154/476 (32%), Positives = 248/476 (52%), Gaps = 31/476 (6%) Query: 101 GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 +Q+ +NP V+ N + SL +T SY+ +R +N G +AVR+EE+VN+F Sbjct: 39 NDDEHQEIIENPFIDVSVNNKSNISLSANTASYSFIRSQINSGRAVDRNAVRIEEMVNFF 98 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 +++ + F + EL PWN + LL + + K ++P+ N+V L Sbjct: 99 NYNYNQPE-------TDKTFGFKSELIQTPWNNETHLLLIGLETKQVDLGDIPS-NIVIL 150 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 +D SGSM + +L L + +++LL+++++ D I++VTY+ ++ S A + + Sbjct: 151 LDVSGSMSATNKLSLAKKAMELLIEQMKPNDVISLVTYSSGEKVVFKGKSIDDMAYMTSQ 210 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQ 340 I L A GST G GL++AY+ A + FI+GG NRI+LATDGDFNVGI + + ++ Sbjct: 211 IRLLKASGSTAGKKGLDMAYKVAEEYFIEGGNNRIILATDGDFNVGISSTDMLIEYISEK 270 Query: 341 RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD 400 RESG+ S +G G N+ + + R+A GNG Y YID + A+K + +L TVA+D Sbjct: 271 RESGIYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARKAFVDNIDGVLYTVARD 330 Query: 401 VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS 460 KAQI F+ + V EYR IGYE RQL + F++ DAG+IG G +T ++EL LN Sbjct: 331 AKAQIVFDASAVLEYRLIGYENRQLTDDEFDDGTTDAGEIGTGLQVTAIYELKLN----- 385 Query: 461 IDKLRYAPDNKLAKSDKTKELAWLKIRWK-YPQGKESQLVEF--PLGPTINAPSEDMRFR 517 + ++ L IR+K + ++QL E L PS D +F Sbjct: 386 ---------------EGASDVGSLTIRYKNHDITDDTQLEEAFTVLNAINENPSVDAKFI 430 Query: 518 AAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDI 573 ++V +G L S+Y + + + + YR +FI ++ +T Sbjct: 431 SSVVEFGLILMDSKYKVDADLGAVLERIETETYNLEDYYRNDFIDVLNTYKDMTTS 486 >UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 Tax=Bacteria RepID=A7C0I1_9GAMM Length = 367 Score = 377 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 183/366 (50%), Positives = 253/366 (69%), Gaps = 7/366 (1%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG 271 +P +NLVFL+D SGSM S+ +L L++S+LKLL +L E+D +++V YAG + + L G Sbjct: 1 MPPANLVFLVDVSGSMRSNHKLALLKSALKLLSNQLTEKDKVSLVVYAGAAGVVLEPTPG 60 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 +IN A++ L A GST+G AG+ LAY A + FIK GINRILLATDGDFNVG D + Sbjct: 61 HQSVKINGALERLTAGGSTHGSAGIHLAYNLAEQAFIKNGINRILLATDGDFNVGTVDFE 120 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 +++++V+++R+SG++L+T G G NYN+ +M ++AD GNGNY+YIDTL+EAQKVL EM Sbjct: 121 ALKNLVEEKRKSGISLTTLGFGRGNYNDQLMEQLADAGNGNYAYIDTLNEAQKVLVDEMS 180 Query: 392 QMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFE 451 L T+AKDVK QIEFNPA V EYR IGYE R L+ E F+ND VDAG+IGAG +T L+E Sbjct: 181 STLNTIAKDVKIQIEFNPAIVAEYRLIGYENRLLKREDFSNDKVDAGEIGAGHTVTALYE 240 Query: 452 LTLNGQKAS-IDKLRYAPDNKLAKSDK--TKELAWLKIRWKYPQGKESQLVEFPLGPT-- 506 + L G ++ LRY+ + + KS+ ELA+L++R+K P SQL+E+P+ Sbjct: 241 MALVGSGGQRLESLRYSQNQDVPKSNDNQNNELAFLRLRYKAPNSDTSQLLEWPMMRQDI 300 Query: 507 --INAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 +E RF AAVAA+GQ+LRG +YL S+ I A+ A+G DP GYR E I+L+ Sbjct: 301 LETVDTNERFRFAAAVAAFGQQLRGGKYLEQFSYDNILNLARDARGNDPFGYRGELIKLV 360 Query: 565 ELADGV 570 LA + Sbjct: 361 NLAKSL 366 >UniRef50_B4D1N7 Autotransporter-associated beta strand repeat protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D1N7_9BACT Length = 1545 Score = 353 bits (906), Expect = 9e-96, Method: Composition-based stats. Identities = 150/562 (26%), Positives = 257/562 (45%), Gaps = 47/562 (8%) Query: 21 PQPENKESQQQQPSTPTEQQVLAA---QQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQ 77 P+ + P+TP ++ + SA K+ + ++ D+ L Sbjct: 1018 PKDLIPSAPDASPNTPIDRDTVKNWLIANGLTFNGNASALYIKSTNRLVIRNTQDQLDLV 1077 Query: 78 GRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 137 R+ +A R KAK T P P Q + N +TFSL+V S+ Sbjct: 1078 DRIVKADAKEREDKAKETVPTAP--------IPQPEVQTSANAFSTFSLNVSDVSFKLAA 1129 Query: 138 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTL 197 L QG +P P +VR EE +N F +D + P + P A E A P+ + R L Sbjct: 1130 ASLEQGHMPDPASVRSEEFINAFDY----RDPEPSPGA---PLAFVTERARYPFAQNRDL 1182 Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVT 257 L+ + + N+V L+D SGSM +R+ +++ +L +L K L+ QD ++IVT Sbjct: 1183 LRFAVKTAAAGRQPGRPLNIVLLLDRSGSMERADRVNIVREALSVLAKHLQPQDKLSIVT 1242 Query: 258 YAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 +A + +++G ++ A ++ + EG TN A L+LAY+ A F NR++L Sbjct: 1243 FARTPHLWADAVAGDKVHDVIARVNEITPEGGTNLEAALDLAYETAHHHFAVDSTNRVIL 1302 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 TDG N+G +P ++ V+ QR+ G+ L FG+G YN+ ++ ++ +G Y +I+ Sbjct: 1303 FTDGAANLGDVNPDALTKKVEAQRKQGIALDCFGIGWEGYNDDLLEQLTRNADGRYGFIN 1362 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA 437 T +A +++ L A DVK Q+EFNP V YRQIGY QL E F +++V+A Sbjct: 1363 TPEDAAANFATQIAGALQVAASDVKVQVEFNPHRVKTYRQIGYATHQLTKEQFRDNSVNA 1422 Query: 438 GDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQ 497 IGA + L+ + ++ +LA + +R++ P + + Sbjct: 1423 AQIGAAESGNALYVVEVDPHGEG-------------------DLATVHVRFRVPGTSDYR 1463 Query: 498 LVEFPLG-----PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQ---AK 549 E+P+ P + S +R A +A+ + L S Y + ++ Sbjct: 1464 EHEWPVPFAGEVPPLEQASSALRLAGAASAFSEMLAASPYATEVTSDRLLNILNGVPPIY 1523 Query: 550 GEDPQGYRAEFIRLIELADGVT 571 G DP+ + E+ +I A ++ Sbjct: 1524 GADPRPTKLEW--MIRQARSLS 1543 >UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TQT5_9MICO Length = 533 Score = 350 bits (898), Expect = 8e-95, Method: Composition-based stats. Identities = 150/443 (33%), Positives = 224/443 (50%), Gaps = 25/443 (5%) Query: 62 LAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANP-GTARYQQFDDNPVKQVAQNP 120 QE S + +A + A ++ P + + + P Sbjct: 38 PTAQENDSASGTSSSSAVGGQASSGVAQPFPAAPNVPGPLEDNTFVDAGTSGFIDTRERP 97 Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 +TF++DVD GS+ R L+ G LPPP++VR EE VN F S + K + Sbjct: 98 RSTFAVDVDGGSFRVARSLLHDGHLPPPESVRPEEWVNSFDSGFPAPRKDDLELQSD--- 154 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 + + ++ L+++ + ++ E L ++DTSGSM ERL L++SSL Sbjct: 155 ----QARASSEDDGTRLVRIGLQGREVDVREWQPVALTMVVDTSGSMDIRERLGLVKSSL 210 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 LL + LR D IAIVTY D+ L I AAID L+A GSTN AGL L Y Sbjct: 211 ALLAENLRPDDTIAIVTYQTDATPLLEPTPVRDTDTILAAIDRLEAGGSTNLEAGLLLGY 270 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 QA + + +G N +LLA+DG NVG+ D + + ++ G+ L T G G NY++ Sbjct: 271 DQAREAYKQGATNVVLLASDGVANVGVTDGGRLATAIRDNGRRGIHLVTVGYGMGNYSDH 330 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 +M ++AD G+G Y YIDT EA+K+ ++R L VAKD K Q+EF+P V+ YR IGY Sbjct: 331 LMEQLADQGDGFYEYIDTFEEARKLFVEDLRATLTPVAKDAKIQVEFDPRTVSAYRLIGY 390 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 E R L + F+ND VDAG++GAG +T L+E+ A++D+ Sbjct: 391 ENRALSDDDFDNDAVDAGEVGAGHKVTALYEVRPT-----------------AQADEGDA 433 Query: 481 LAWLKIRWKYPQGKESQLVEFPL 503 L +++RW+ G+E + PL Sbjct: 434 LGTVRVRWRSVDGEEQREDSLPL 456 >UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12 Tax=Actinomycetales RepID=D2BAS2_STRRD Length = 490 Score = 343 bits (881), Expect = 7e-93, Method: Composition-based stats. Identities = 140/503 (27%), Positives = 227/503 (45%), Gaps = 37/503 (7%) Query: 75 ALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYA 134 A G + + + +P Q+ + A + ++TF+LDVDT SY Sbjct: 19 AACGGSGSSRPASEPRNNPGNAVPSPAQD--QESGAARERDAAADQISTFALDVDTASYG 76 Query: 135 NVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ 194 +R L +G LP P +R EE VN F D+ F + + A P N Sbjct: 77 YAKRILQEGRLPEPGQIRPEEFVNSFRQDYKEP--------GDDGFTVHMDGARMPEN-G 127 Query: 195 RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 L++V + + + E +NL F++D SGSM RL L++ +L LV +L D ++ Sbjct: 128 TALIRVGLQTRKAEPEARRPANLTFVVDVSGSMGEPGRLDLVREALHKLVDQLGPGDQVS 187 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 IV ++ +R+ L + + +++AAID L E STN GL Y +A + F NR Sbjct: 188 IVAFSTQARLVLSMTPATGRDQLHAAIDRLGVEDSTNLETGLTAGYAEAARAFRPAATNR 247 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++L +DG N G + I V + +TL GVG +Y + +M ++AD G+G Sbjct: 248 VILLSDGLANTGDTTWQGILDRVAESAGRQITLLCVGVGR-DYGDQLMEQLADNGDGAAV 306 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 Y+ + +A+KV ++ L A+D KAQ+ FNP+ V YR IGYE RQ+ E F +D Sbjct: 307 YVSSADDARKVFVEQLATNLDLRARDAKAQVVFNPSAVESYRLIGYENRQIAAEDFRDDT 366 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGK 494 D G+IG G +T L+ + L +S + +LA +RW+ P + Sbjct: 367 KDGGEIGPGHSVTALYGVRL-------------------RSGASGQLATATVRWQDPDTR 407 Query: 495 ----ESQLVEFP--LGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQA 548 S+ +E S + AA+ + LR E + +++ A + Sbjct: 408 GPGETSRSLESADLSASVWRESSPRFQVDVVAAAFAEYLRTREEIAGVGARELAGHATRL 467 Query: 549 KGEDPQGYRAEFIRLIELADGVT 571 E LI+ A ++ Sbjct: 468 AASTEDAAVTELAGLIDRAVSLS 490 >UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI7_9BACT Length = 1078 Score = 332 bits (852), Expect = 2e-89, Method: Composition-based stats. Identities = 138/557 (24%), Positives = 260/557 (46%), Gaps = 38/557 (6%) Query: 20 GPQPENKESQQQQPSTPTEQQ-VLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQG 78 G + ESQ Q +E V + + IK + + + +Q ++ +K+ Sbjct: 553 GDYRKYLESQGFQFDAGSEVDFVESVNRLVIKNTPEQNSKIASHLEQVNERAQEKKKESQ 612 Query: 79 RLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRR 138 + +E ++ ++ +A P ++ D + L+TF++DVDT SY R Sbjct: 613 KARERLKQLKSQQSNLPSVALPSPLFFEGMID-----AKETNLSTFAIDVDTASYTAARS 667 Query: 139 FLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLL 198 + G VR+EE +N F + + K++ F + EL+ LL Sbjct: 668 EIRAGRKVEASHVRIEEFINNFDYHYSVPKKEA--------FKIDSELSDHKVYAGVKLL 719 Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY 258 +V + + ++ + F+ID SGSM ++ RLPLIQ +L + K + + D + I++ Sbjct: 720 RVGVQGQRLGADSQKPGSYTFVIDNSGSMAAENRLPLIQKTLPNMFKAMNQDDEVTILSC 779 Query: 259 AGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 G I+ S+ +++ A+ +++A N G+E AY+ A + F G +NR++L Sbjct: 780 EGGVTNLANRITASNHSQLETAVKNIEAGTVANLSVGIEEAYKLAAQNFRSGAVNRVILL 839 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 +DG ++G + + + V + R+ G+ + GVG+ +Y+++ + +A+ G+G Y + D+ Sbjct: 840 SDGIASLGEKEAQEVLKTVSQYRKQGIGNTVIGVGSEDYDDSFLETLANKGDGVYYFGDS 899 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAG 438 + +L + T+A+DVK Q+EFNP V YR +GYEKR+L + F ND VDAG Sbjct: 900 KEQMNDILVNNFEASFKTIARDVKIQLEFNPQAVRSYRLLGYEKRRLANKDFRNDKVDAG 959 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKY----PQGK 494 +IGAG+ +T L+EL +N ++ + +L + +R+K + Sbjct: 960 EIGAGQSVTALYELVVN------------------ENTQEAKLGAVNLRYKNLENELVTE 1001 Query: 495 ESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQ 554 ++ ++ + MR A + + L+ + + Q+ + E PQ Sbjct: 1002 INKEIKAQQNVSFPESQSSMRLAWCAATFARLLKNNG-KGELRFDQLAAEVDKILLERPQ 1060 Query: 555 GYR-AEFIRLIELADGV 570 + EF LI + Sbjct: 1061 DQKIQEFKDLIIRCQSL 1077 >UniRef50_C1RGW7 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RGW7_9CELL Length = 500 Score = 322 bits (824), Expect = 3e-86, Method: Composition-based stats. Identities = 131/486 (26%), Positives = 223/486 (45%), Gaps = 38/486 (7%) Query: 88 RAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPP 147 + + T + A++ L+TF+LDVDTG+Y R + QG Sbjct: 45 GPYQEDLPYPEPGPT----GPTAAGMTDPARDALSTFALDVDTGAYTRFRDAVRQGFSVD 100 Query: 148 PDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDR 207 P VR EE VNYF D++ + + + P+ L++V I + Sbjct: 101 PFGVRTEEFVNYFAQDYEPPAEG---------LGVSIDATALPFRPDHRLVRVGISSAPA 151 Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 + ++LV ++D SGSM ++ + +L+ LV LR D +A+V Y+ ++ + L Sbjct: 152 SAVSRADADLVLVVDCSGSMDEAGKMETTKYALRTLVSSLRRTDRVAMVCYSTEADVYLE 211 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 + + + AAID L STN AGL L Y A +G + R++L +DG NVG Sbjct: 212 PTPVAEREGVLAAIDRLAPRDSTNAAAGLALGYDLAMSMRTEGRLTRVVLVSDGVANVGE 271 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 DP+ I + + Q ++G++L + GVG + YN+ ++ ++AD G+G + Y+D +EA++V Sbjct: 272 TDPEGILARISSQAKAGISLISVGVGITTYNDHLLEQLADQGDGWHVYVDGEAEAERVFA 331 Query: 388 SEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHIT 447 + + L+ D +AQ+EF+PA V YR +GYE R + E F ND VD G++ AG+ T Sbjct: 332 TGLTGSLVVAGTDARAQVEFDPAQVAGYRLLGYENRAVADEDFRNDAVDGGEVFAGRSTT 391 Query: 448 LLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKY----PQGKESQLVEFPL 503 L+E+ + + +R+ P +++ L Sbjct: 392 ALYEVAMR------------------EGAGDGAFVRATVRYLDDDGRPVERDASLSRDDC 433 Query: 504 GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRA--EFI 561 + S +R VA L + + ++ A+ G G RA E + Sbjct: 434 AASPREASPRLRQDLVVALLTDHLTDGPWSQEIAPADVRAEARTLLGVL-DGDRAVQELV 492 Query: 562 RLIELA 567 L++ A Sbjct: 493 ELVDRA 498 >UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JNR2_9BACT Length = 923 Score = 310 bits (795), Expect = 8e-83, Method: Composition-based stats. Identities = 124/552 (22%), Positives = 230/552 (41%), Gaps = 36/552 (6%) Query: 27 ESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQ--EVQQYSDKQALQGRLQEAP 84 +++ + P T + + + + + + + A Sbjct: 400 DARPRLPFQNTANSPTPSSPSLAADTFNPEPITENPPSSFNLAEGLDNPNLVAASTANAT 459 Query: 85 TFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGL 144 A NP Q + P A +P +TFSL+V SY +L Q + Sbjct: 460 QTAPNNDEPLPRTQNPPPTT--QLSEYPESNTATDPQSTFSLNVSDVSYRLTEAYLAQNV 517 Query: 145 LPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILA 204 PP +R EE VN F P ++ I F +E A P+ R +L+ + Sbjct: 518 RPPAGTLRTEEFVNAFDYGDPTP-----PVARKIGF--TWERAHWPFAHDRDVLRFSLQT 570 Query: 205 KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI 264 +L IDTSGSM +R+ ++ S L L E+D ++IV++ R+ Sbjct: 571 AAHGRASSQPLHLTLAIDTSGSMSRPDRVDIVNSLATALQSNLTEKDRLSIVSFDRQPRL 630 Query: 265 ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 L S + + + L+ +G T+ + L+L+YQ A + F + INR++L TDG N Sbjct: 631 VLDGQSVTAETNLATLATQLNPQGGTDLESALQLSYQTAQRHFQENAINRVILITDGAAN 690 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 +G + + + + V + R G+ L FG+G +++ + ++ G+G Y ++ + +A Sbjct: 691 LGNTNAEQLRTTVTENRIRGIALDCFGIGFDGHDDTFLESLSRNGDGRYRFLRSPEDAAL 750 Query: 385 VLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 L ++ +L A DVK Q+EFNP V Y+Q+GY++ Q+ + F N+ VDA ++ A + Sbjct: 751 ELGPKLAGLLRPAAYDVKVQVEFNPTRVETYQQLGYQQHQIADQDFRNNAVDAAELAATE 810 Query: 445 HITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF--- 501 L+ + D +L +++R++ + + + + Sbjct: 811 SGNALYLAKVLP-------------------DGRGDLGLVRVRFRDAESGAYEELSWNLP 851 Query: 502 --PLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAE 559 P ++ S +R + AA+ L S + + + + A P R + Sbjct: 852 YKANAPELDQASPSLRLASIAAAFAALLNESPLAHAITHSDLYELAAPLPQHFPTQTRVQ 911 Query: 560 FIR-LIELADGV 570 +R LI A + Sbjct: 912 TLRTLINQARRL 923 >UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZQD5_OPITP Length = 859 Score = 300 bits (769), Expect = 9e-80, Method: Composition-based stats. Identities = 119/462 (25%), Positives = 200/462 (43%), Gaps = 33/462 (7%) Query: 114 KQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIP 173 A+ P++TFSL V S+ + L +G +P P +R EE N F P Sbjct: 420 VSTAKEPVSTFSLHVSDVSFQLAQAALARGEMPDPQRIRPEEFYNAFDYG------DPTP 473 Query: 174 ASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERL 233 AS A R E A P +QR L+++ + NL L+DTSGSM +R Sbjct: 474 ASADK-IACRIEQAAHPLLQQRNLVRIAMKVPAAGRGAGQPLNLTVLLDTSGSMERTDRA 532 Query: 234 PLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGG 293 ++++L +L L D + ++ +A R+ S++G ++ + G TN Sbjct: 533 TSVRAALGVLASLLTPDDRVTLIGFARQPRLLAESLAGDQARQLVDLASTTPFTGGTNLE 592 Query: 294 AGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVG 353 A L LA + A + NRI+L TDG N+G DP + + ++ R+ G+ GVG Sbjct: 593 AALSLAGELARRHHNAAAQNRIVLITDGAANLGNADPAQLATRIETLRQQGIAFDACGVG 652 Query: 354 NSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT 413 ++A++ + G+G Y +D A ++ A+++K Q+ FNPA V Sbjct: 653 TDGLDDAVLEALTRKGDGRYYVLDAPENADAGFARQLAGAFRPAAENIKVQVRFNPARVA 712 Query: 414 EYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLA 473 YR IG+E+ +LR + F ND VDA ++ A + L+++ + Q Sbjct: 713 SYRLIGFEQHRLREQDFRNDQVDAAELAAEESAVALYQVEVLPQGEG------------- 759 Query: 474 KSDKTKELAWLKIRWKYPQG-----KESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLR 528 EL + R++ P + ++ P P S ++ A +KLR Sbjct: 760 ------ELGDVFARFRDPATGAMIERSWTMLHEPRAPAFERASPSLQLAGVAALVAEKLR 813 Query: 529 GSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 G E ++ + +G Q R + +LI + D + Sbjct: 814 GGEAAGQIHLNELAGVVNRLRGHYGQNLRVQ--QLIAMFDQL 853 >UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 Tax=Arabidopsis thaliana RepID=Q9M1S2_ARATH Length = 676 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 90/473 (19%), Positives = 194/473 (41%), Gaps = 36/473 (7%) Query: 106 QQFDDNPVKQVAQN-PLATFSLDVDTGSYANVR------RFLNQGLLPPPDAV----RVE 154 ++ + P+++ + + P F + + + R R + QG P P R+E Sbjct: 120 AKWKEIPIQKPSLDLPYYPFDRCNNDAAISLFRCLPPSQRAITQGH-PEPATFDDDERLE 178 Query: 155 EIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNE-----QRTLLKVDILAKDRKS 209 E + F + ++ K++ + + + E++ P ++ + + Sbjct: 179 EQI-VFDGETEVLKKENRDYVRMMDMKVYPEVSAVPQSKSCENFDVLVHLKAVTGDQISQ 237 Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP-- 267 +LV ++D SGSM +L L++ ++ +++ L D ++++ ++ +R P Sbjct: 238 YRRAPIDLVTVLDISGSM-GGTKLALLKRAMGFVIQNLGSSDRLSVIAFSSTARRLFPLT 296 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 +S + + A++SL A G TN GL + + + I+L +DG Sbjct: 297 RMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIILLSDGRDTYTT 356 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 + P ++ Q +++ +FG G S+++ ++M +++V G +S+I++ S Q L Sbjct: 357 NHPDPSYKVMLPQ----ISVHSFGFG-SDHDASVMHSVSEVSGGTFSFIESESVIQDALA 411 Query: 388 SEMRQMLITVAKDVKAQIE-FNPA-WVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 + +L ++++ +IE +P ++ + Y L ++ VD GD+ A + Sbjct: 412 QCIGGLLSVAVQELRVEIEGVSPNVRLSSIKAGSY--SSLVTGDGHSGLVDLGDLYADEE 469 Query: 446 ITLLFELTLN---GQKASIDKLRYAPDNKLAKSDKTKELAWLKI-RWKYPQGKESQLVEF 501 L + + + KLR N L K T E L+I R +Y ++ +E Sbjct: 470 RDFLVSINIPVEEDGHTPLLKLRCLYINPLTKEITTLESHVLQIRRPEYVAEEKVVPIEV 529 Query: 502 PLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQ 554 +E M +A +G + + N + AK D Sbjct: 530 VRQRNRFLAAEAMAQARTLAEHGDLEAAVKAIENFRL--VLAETVAAKSCDRF 580 >UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3 Tax=Andropogoneae RepID=C5WYU9_SORBI Length = 698 Score = 249 bits (637), Expect = 2e-64, Method: Composition-based stats. Identities = 71/341 (20%), Positives = 146/341 (42%), Gaps = 35/341 (10%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 + +LV ++D SGSM + ++ L+++++ +++ L D ++++ ++ +R P Sbjct: 233 SASSRAPLDLVTVLDVSGSM-AGTKIALLKNAMSFVIQTLGPNDRLSVIAFSSTARRLFP 291 Query: 268 --SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 ++ + + + A+ SL A G TN GL+ + +K + I+L +DG Sbjct: 292 LRRMTLAGRQQALQAVSSLVASGGTNIADGLKKGAKVIEDRRLKNPVCSIILLSDGQDTY 351 Query: 326 GIDDPKSIESM-------VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 + +++ + V + TFG G S+++ A M IA++ +G +S+ID Sbjct: 352 TLPSDRNLLDYSALVPPSILPGTGHHVQIHTFGFG-SDHDSAAMHAIAEISSGTFSFIDA 410 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV--TEYRQIGYEKRQLRVEHFNNDNVD 436 Q + +L V K+++ IE V T + GY + E+ +VD Sbjct: 411 EGSIQDGFAQCIGGLLSVVVKEMRLGIECVDNGVLLTSIKSGGYTSQV--AENGRGGSVD 468 Query: 437 AGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKES 496 GD+ A + L L + + ++ + A + + E+ ++R + P+ Sbjct: 469 IGDLYADEERGFLLTLHVPPAQGQTVLIKPSCTYHDAITMENIEVHGEEVRIQRPE---- 524 Query: 497 QLVEFPLGPTIN------APSEDM----------RFRAAVA 521 V+ + P + +EDM F AVA Sbjct: 525 HHVDCKMSPEVEREWHRVQATEDMSAARAAAEVGAFSQAVA 565 >UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=Q10RY0_ORYSJ Length = 694 Score = 247 bits (631), Expect = 7e-64, Method: Composition-based stats. Identities = 94/457 (20%), Positives = 172/457 (37%), Gaps = 47/457 (10%) Query: 106 QQFDDNPVKQV-------AQNPLATFSLDVDTGSYANVRRFLN--QGLLPPPDAVRVEEI 156 ++ + P + + ++T + D G + VRR + G L AV Sbjct: 119 AEWKELPFQGTQPGDTAYGRARVSTVNWPQDEGQMSVVRRLSHGYSGNLQQQLAVFRTPE 178 Query: 157 VNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW------NEQRTLLKVDILAKDRKSE 210 + F D +I D QS E+ +E+R + + I K KS Sbjct: 179 ASIFNDDENI-DPQSETVDDHNAVTNSVEIKTYSEFPAIQKSERRKVFAILIHLKAPKSL 237 Query: 211 E----LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIAL 266 + +LV ++D SGSM S +L L++ ++ +++ L D +++V ++ ++ Sbjct: 238 DSVSSRAPLDLVTVLDVSGSM-SGIKLSLLKRAMSFVIQTLGPNDRLSVVAFSSTAQRLF 296 Query: 267 P--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 P ++ + + + AI SL A G TN L+ + K ++ I+L +DG Sbjct: 297 PLRRMTLTGRQQALQAISSLVASGGTNIADALKKGAKVVKDRRRKNPVSSIILLSDGQDT 356 Query: 325 VG-------IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 I+ + + V + TFG G ++++ A M IA+ NG +S+ID Sbjct: 357 HSFLSGEADINYSILVPPSILPGTSHHVQIHTFGFG-TDHDSAAMHAIAETSNGTFSFID 415 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT--EYRQIGYEKRQLRVEHFNNDNV 435 Q M +L V KD++ IE V+ + Y + E + V Sbjct: 416 AEGSIQDAFAQCMGGLLSVVVKDMRLCIECIDEGVSLTSIKSGSYASQVAGNE--RSGLV 473 Query: 436 DAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW-----------L 484 D GD+ A + L L + ++ A + + +L + Sbjct: 474 DIGDLYADEERGFLVTLHVPAAHGQTVLIKPKCTYLDAITMENVQLDGEEVIIQRPAYCV 533 Query: 485 KIRWKYPQGKESQLVEFPL-GPTINAPSEDMRFRAAV 520 +E V+ + +ED F AV Sbjct: 534 DCTMSPEVEREWHRVQATEDMSAARSAAEDGSFSQAV 570 >UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN03_ARATH Length = 641 Score = 244 bits (622), Expect = 1e-62, Method: Composition-based stats. Identities = 64/303 (21%), Positives = 132/303 (43%), Gaps = 24/303 (7%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 D +L+ ++D SGSM ++ L+++++ +++ L E D +++++++ +R Sbjct: 194 DDARRARAPLDLITVLDVSGSMDG-VKMELMKNAMSFVIQNLGETDRLSVISFSSMARRL 252 Query: 266 LPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 P +S + K A++SL A+G TN GL++ + K ++ ++L +DG Sbjct: 253 FPLRLMSETGKQAAMQAVNSLVADGGTNIAEGLKIGARVIEGRRWKNPVSGMMLLSDGQD 312 Query: 324 N-----VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 N G+ ES++ + + TFG G S+++ +M I++V +G +S+I+T Sbjct: 313 NFTFSHAGVRLRTDYESLLPSS--CRIPIHTFGFG-SDHDAELMHTISEVSSGTFSFIET 369 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRV-EHFNNDNVDA 437 + Q + +L V + +IE + I + R+ +D Sbjct: 370 ETVIQDAFAQCIGGLLSVVILEQVVEIECIHEQGLKISSIKAGSYRSRIAPDARTATIDV 429 Query: 438 GDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQ 497 GD+ A + L L + + ++ L ++ +K P KE Sbjct: 430 GDMYAEEERDFLVLLEIPCCDNG------------SGESESLSLLKVRCVYKDPVTKEIV 477 Query: 498 LVE 500 VE Sbjct: 478 HVE 480 >UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2 Tax=Sorghum bicolor RepID=C5YHY2_SORBI Length = 567 Score = 241 bits (615), Expect = 6e-62, Method: Composition-based stats. Identities = 78/416 (18%), Positives = 157/416 (37%), Gaps = 26/416 (6%) Query: 161 PSDWDIKDKQSIPASKPIPFAMRYEL-APAPWN-EQRTLLKVDILAKDRKSEELPASNLV 218 P D Q A+ + + PA R V + AK +LV Sbjct: 24 PLDRPTAPPQGPAANGGRGLVLSTQCEFPAVGRFTSRDRFAVLVHAKAPSDVSRAPLDLV 83 Query: 219 FLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAE 276 ++D S SM E+L L++ ++ ++ +L D +++VT++ D+ L +S + KA Sbjct: 84 TVLDVSDSMKG-EKLALLKQAMCFVIDQLGPADRLSVVTFSNDASRLTRLARMSDAGKAS 142 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS---- 332 A++SL +G TN G+ +A + K + ++L +DG N G + Sbjct: 143 AKIAVESLAVQGFTNIKQGIHVAAEVLAGRREKNVVAGMILLSDGHDNCGGTSVRPDGTK 202 Query: 333 --------IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 ++ + TFG G S ++ M +A+ G +S++ + Q Sbjct: 203 SYVNLVPPSLTVAAGSSRPAAPIHTFGFGTS-HDAGAMHAVAEATGGTFSFVGDEAAIQD 261 Query: 385 VLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 + +L ++ + + V + + +D G++ G+ Sbjct: 262 SFARCVGGLLSVAVQEARVAVTCLHRGVHVQQVKSGAYVSHVGADGHAATIDVGELYDGE 321 Query: 445 HITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG 504 L + + +++ + R + + T + A + +L P Sbjct: 322 ERRFLVLVHVPRARSTEEVTRLIKASCTYREAATGQAARKVAAPAAVVQRPLELATLP-A 380 Query: 505 PTINAPSEDMRFRAA-------VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDP 553 P+++ E +R AA AA G + G+ + + + ++Q A A G DP Sbjct: 381 PSLDVERERVRLAAAEDIAAARTAADGGQNAGAARILESRLKAVEQSAPGAAGNDP 436 >UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PTX8_CHIPD Length = 462 Score = 240 bits (613), Expect = 9e-62, Method: Composition-based stats. Identities = 82/300 (27%), Positives = 145/300 (48%), Gaps = 10/300 (3%) Query: 197 LLKVDILAKDRKSEE-LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 L V+I + ++ + N+ ++D SGSM S +++ + + K L+ +L D+++I Sbjct: 62 YLYVNIKGGEGEASKPRVPLNISLVLDRSGSM-SGDKIKYARQAAKFLIDQLNSTDHLSI 120 Query: 256 VTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRI 315 V Y + PS S +K + AAID + GSTN G+ Y Q +G +NR+ Sbjct: 121 VNYDDRVEVTSPSQSVKNKEALKAAIDKIHDRGSTNLSGGMLEGYTQVKSTRKEGYVNRV 180 Query: 316 LLATDGDFNVGIDDPKSIESMVK-KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 LL TDG N GI DP ++ + + K +E G+ LSTFGVG ++YNE ++ +A+ G NY Sbjct: 181 LLLTDGLANQGITDPLELKRLAENKYKEDGIALSTFGVG-ADYNEDLLTMLAENGRANYY 239 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 +ID+ + ++ E++ +L VA++ A+I P + GY Sbjct: 240 FIDSPDKIPQIFAGELKGLLSVVAQNAWAEISI-PQDMECTYVYGYPYEV----KGGKVL 294 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPD-NKLAKSDKTKELAWLKIRWKYPQG 493 V D+ A +L +L G + + ++ ++ KE ++I+ + Sbjct: 295 VRFNDLYANDEKAILIKLKSKGTYTTNLRFDCTVGYTNVSSFEQVKESKPVQIKMTSDKE 354 >UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A188_PELCD Length = 442 Score = 240 bits (612), Expect = 1e-61, Method: Composition-based stats. Identities = 72/328 (21%), Positives = 144/328 (43%), Gaps = 24/328 (7%) Query: 193 EQRTLLKVDILAKD-RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 Q+T++K+ + A ++ + P NL ++D SGSM S ++ + + V+ L + D Sbjct: 43 AQKTVIKIALDAPRAPRTAQRPPVNLALVLDRSGSM-SGNKIAKAREAAIEAVRRLSDGD 101 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG 311 ++V Y +P+ S +I A I + GST + + K Sbjct: 102 LFSLVVYDDSVETLVPAQPVSDIGDIEARIRRIRPGGSTALFGAVSQGAAEVRKHSDAPY 161 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 +NR++L +DG NVG P + + + G++++T GVG +++NE +M ++A+ +G Sbjct: 162 VNRVVLLSDGLANVGPSRPADLARLGAALLKEGISVTTVGVG-TDFNEDLMTQLAERSDG 220 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFN 431 N+ ++++ + ++ +E+ +L VA+ V IE P V R IG E Sbjct: 221 NHYFVESSRDLPRIFAAELGDVLSVVARKVVISIEC-PQGVKPLRVIGREGSI----KGQ 275 Query: 432 NDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYP 491 V + G+ L E+ + +A ++ +LA + R++ Sbjct: 276 RVEVRMNQLYGGQEKYALVEVEVPASRA----------------NQKLDLARVDCRYRNA 319 Query: 492 QGKESQLVEFPLGPTINAPSEDMRFRAA 519 + + + SE++R A+ Sbjct: 320 LTDTGESSTAMVQTRFSKRSEEVRKAAS 347 >UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZUW0_9SPHI Length = 425 Score = 238 bits (606), Expect = 5e-61, Method: Composition-based stats. Identities = 71/286 (24%), Positives = 144/286 (50%), Gaps = 9/286 (3%) Query: 204 AKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 K + +E N+ ++D SGSM S ++L ++ ++ ++ L+ D ++IV Y + Sbjct: 34 GKAPEKQERIPLNISLVVDRSGSM-SGDKLNYVKKAVDFVIDNLKSDDVLSIVQYDDEID 92 Query: 264 IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 + S ++K ++ + + A TN G+ Y Q G +NR+LL +DG Sbjct: 93 VVASSAKVTNKKALHEKVKGIQARNMTNLSGGMMEGYAQVKSTQSNGYVNRVLLLSDGLA 152 Query: 324 NVGIDDPKSIESMVKKQ-RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 N GI P+ ++ + +K+ RE+G+ LSTFGVG S++NE +M +++ G NY +ID + Sbjct: 153 NAGITAPEQLQQIAQKKFREAGIALSTFGVG-SDFNEVLMTNLSEYGGANYYFIDMPDKI 211 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 ++ E+ +L VA++ ++ F +++ + G+ + +V+ D+ A Sbjct: 212 PQIFAQELEGLLSVVAQNTTLEVVFPQSYLKCTQVYGFPANISPDK----VSVNFNDVVA 267 Query: 443 GKHITLL--FELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKI 486 + +L FE+ + + K R D+ + K ++ + L++ Sbjct: 268 EEEKAVLIKFEVIRTPDEPFVLKTRLQYDDVIDKMERITDELDLRM 313 >UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J6Q3_DESRM Length = 416 Score = 236 bits (602), Expect = 2e-60, Method: Composition-based stats. Identities = 81/297 (27%), Positives = 138/297 (46%), Gaps = 9/297 (3%) Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 P N+Q L V + A + +E P NL F+ID SGSM + E+L + ++ V L Sbjct: 17 PGNKQVAYLMVKLTAPKQVEKERPVQNLSFVIDRSGSM-AGEKLDYTKKAVAFAVGHLSP 75 Query: 250 QDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK 309 QD ++V + + S ++K + A++S+ GSTN G+ L ++ + Sbjct: 76 QDYCSVVAFDDMVTMVASSHQVANKDALKMAVESIYPGGSTNLSGGMLLGVREVKLAHKE 135 Query: 310 GGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG 369 INR+LL TDG NVG+ D ++ ++ GV LSTFG+G ++ E ++ + + G Sbjct: 136 NQINRVLLLTDGMANVGVTDHSALVEKSREMAAGGVNLSTFGLGE-DFEEDLLQAMVEAG 194 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEH 429 GN+ YI+ + + E+ +L VA+++ +++ G E Sbjct: 195 GGNFYYIEKPDQIPGIFEQELTGLLSIVAQNLSVKVKPGQG----VSITGVLGYPFSSEE 250 Query: 430 FNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKI 486 V+ DI +G+ LL EL ++ KL + + A K+ L LK Sbjct: 251 G--VTVNLPDIYSGESKLLLLELLISPLTEGNHKL-ISVELDYADVRKSLALVNLKA 304 >UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, scaffold_125.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HHA4_VITVI Length = 630 Score = 236 bits (601), Expect = 2e-60, Method: Composition-based stats. Identities = 63/285 (22%), Positives = 131/285 (45%), Gaps = 16/285 (5%) Query: 194 QRTLLKVDILAK----DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 + + V I A D + +LV ++D SGSM + +L L++ ++ L++ L Sbjct: 180 RTFAVLVGIKAPALLDDAHLLDRAPIDLVAVLDVSGSM-AGSKLSLLKRAVCFLIQNLGP 238 Query: 250 QDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 D ++IV+++ +R P +S + + AI+SL + G TN GL+ + + Sbjct: 239 SDRLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKKGVRVLEERS 298 Query: 308 IKGGINRILLATDGDFNVGIDDPK------SIESMVKKQRESGVTLSTFGVGNSNYNEAM 361 + + I+L +DG D+ S ++ R++ + + TFG G S+++ Sbjct: 299 EQNPVASIILLSDGKDTYNCDNVNRRQTSHCASSNPRQGRQAIIPVHTFGFG-SDHDSTA 357 Query: 362 MVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE-FNPAWVTEYRQIGY 420 M I+D G +S+I++++ Q + +L VA++++ ++ +P E G Sbjct: 358 MHAISDESGGTFSFIESVATVQDAFAMCIGGLLSVVAQELRLTVKSVSPGVHIESIPSGK 417 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLR 465 ++ + +D GD+ A + L LT+ ++ + R Sbjct: 418 YLSEI-CDQGQQGVIDVGDLYAEEGKEFLIYLTVPELSSAEGEER 461 >UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magnoliophyta RepID=Q9FF49_ARATH Length = 704 Score = 235 bits (600), Expect = 3e-60, Method: Composition-based stats. Identities = 62/318 (19%), Positives = 133/318 (41%), Gaps = 10/318 (3%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--I 269 +LV ++D SGSM + +L L++ ++ +++ L D +++++++ +R P + Sbjct: 248 RAPVDLVTVLDVSGSM-AGTKLALLKRAMGFVIQNLGPFDRLSVISFSSTARRNFPLRLM 306 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 + + K E A++SL + G TN GL+ + K ++ I+L +DG + Sbjct: 307 TETGKQEALQAVNSLVSNGGTNIAEGLKKGARVLIDRRFKNPVSSIVLLSDGQDTYTMTS 366 Query: 330 PKSIES------MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 P + K+ + + + FG G ++++ ++M IA+ G +S+I++ + Q Sbjct: 367 PNGSRGTDYKALLPKEINGNRIPVHAFGFG-ADHDASLMHSIAENSGGTFSFIESETVIQ 425 Query: 384 KVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAG 443 + +L V +++ IE + R + ++ GD+ A Sbjct: 426 DAFAQCIGGLLSVVVQELCVTIECMHHLLRIGSVKAGSYRFDNGPNSRTGSIAVGDLYAE 485 Query: 444 KHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL 503 + L L + D + + K TKE L + + + E Sbjct: 486 EERNFLVNLDIPIVDGVSDVMSLLKVQCVYKDPVTKETVNLNNSGEVKILRPIVMTERRP 545 Query: 504 GPTINAPSEDMRFRAAVA 521 ++ + +R RAA A Sbjct: 546 VVSVEVDRQRIRLRAAEA 563 >UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67LZ3_SYMTH Length = 414 Score = 233 bits (595), Expect = 1e-59, Method: Composition-based stats. Identities = 88/342 (25%), Positives = 147/342 (42%), Gaps = 20/342 (5%) Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 +P + LL + E P NL ++D SGSM L + +L+ LV ++ Sbjct: 17 SPSGGEVYLLVTVKAPRMPAPEGRPPLNLAAVVDRSGSMAGAA-LYFTKQALRFLVDQMA 75 Query: 249 EQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 E+D +AIVTY + PS K + +D + A G+TN GL QQ Sbjct: 76 EEDRLAIVTYDDQVHVPFPSQPVVQKDAVRLLVDGITAGGTTNLSGGLATGMQQIRPHAG 135 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 G ++R+LL TDG NVG+ DP + + RE G+ +ST GVG +++E ++V +A+ Sbjct: 136 PGRVSRVLLMTDGLANVGVTDPDVLAGWARAWREKGLAVSTMGVGP-HFSEDLLVALAEA 194 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 G GN+ YI + ++ E+ +L + + IE + V +GY + + Sbjct: 195 GGGNFHYIANPDQIPRIFQEELHGLLQVAVQGLHLIIE-TESGVAVSGVLGYRSQGTPLR 253 Query: 429 HFNNDNVDAGDIGAGKHITLLFELT----LNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 + D+ AG+ +L L+ G K L Y P + + L Sbjct: 254 ----AALSLPDLYAGEVKHVLVRLSVAAPPAGGKLGRVALHYLPAAAGGRPGTLEADVSL 309 Query: 485 KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQK 526 ++ ++L E P + +R A AA+ + Sbjct: 310 EV-----TDDPARLGEPPDETVMRQ----LRLSQAGAAWDEA 342 >UniRef50_UPI00017450FB von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017450FB Length = 424 Score = 232 bits (591), Expect = 4e-59, Method: Composition-based stats. Identities = 72/333 (21%), Positives = 143/333 (42%), Gaps = 17/333 (5%) Query: 195 RTLLKVDILAKDRK-SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNI 253 T LKV + ++ + S + N+ +ID SGSM +++ + + K + L D + Sbjct: 19 TTYLKVGLTGQELEASAKRAPVNVTIVIDKSGSM-GGDKMVHAREAAKQALDRLGAGDMV 77 Query: 254 AIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN 313 ++V Y + P+ + + + AAID + A GST +G+ ++ + +N Sbjct: 78 SVVAYDDAVSLISPATDLTDRDRVKAAIDRIQAGGSTALFSGISKGAEELRRNKRPNQVN 137 Query: 314 RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNY 373 R++L +DG NVG P+ + + + G+T++T G+G YNE +M +A +GN+ Sbjct: 138 RVVLLSDGMANVGPSSPQDLGRLGASLAKEGITVTTLGLGLG-YNEDLMTELALRSDGNH 196 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND 433 ++I+ + +E +L VA+ ++ +++ V R +G E H + Sbjct: 197 AFIENSQNLAGIFQTEFGDILSVVAQRIRVRVQCAEG-VRPVRVLGREADI----HGQDV 251 Query: 434 NVDAGDIGAGKHITLLFELTLN----GQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWK 489 ++ I A +H LL E+ + A + N L + + R Sbjct: 252 ELEMNQIYARQHKYLLLEVEIPEGVADTDAPVATAEVISVNALTGIENRSLTRSVARRVN 311 Query: 490 YP-----QGKESQLVEFPLGPTINAPSEDMRFR 517 P ++ LVE + + ++ Sbjct: 312 DPALIANTVNKAVLVEVARAISTEKEALAVKLS 344 >UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FTM1_SORC5 Length = 535 Score = 231 bits (588), Expect = 8e-59, Method: Composition-based stats. Identities = 107/431 (24%), Positives = 182/431 (42%), Gaps = 47/431 (10%) Query: 132 SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW 191 S +VR L G P P VR E +NY+ D+ D+ +R E P Sbjct: 130 SPVHVRELLRSGRAPEPWQVRTYEFLNYYRIDYAPPDEGE----------LRVEPQIEPG 179 Query: 192 NEQRTL-LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 E + L++ + + D S P + F++DTSGSM E + +++++ + L E Sbjct: 180 EEAGSYALQIGVRSYDPPSPRRP-IAVTFVLDTSGSMDG-EPMAREKATVRAVAASLSEG 237 Query: 251 DNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 D + +VT+ + + L + G + AA D+L A G T+ +GL + YQ A + F Sbjct: 238 DVVNMVTWNTQNSVILSGHVVDGPDDPALLAAADALSASGGTDLESGLRVGYQLAQEHFE 297 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNS-NYNEAMMVRIAD 367 +G INR++L +DG NVG+ + I + + + L G G + YN+ +M + D Sbjct: 298 EGRINRVILVSDGGANVGVTSEELIALHAEDADQEAIYLVGVGTGPALGYNDVLMDAVTD 357 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRV 427 G G Y Y+D EA + +++ A+ V Q+E W + + E+ Sbjct: 358 KGRGAYVYLDDEDEAFHMFRDRFAEVMEVAARGV--QVELTMPWYFKMEKFYGEEYSTNP 415 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 E ++ GD ++F + G + ++ ++ R Sbjct: 416 EEVEPQHLAPGD-------AMIFSQLVRGCDPGV--------------INDEDTLTVRAR 454 Query: 488 WKYPQGKESQLV--EFPLGPTINAPSEDMRFRAAVAAYGQKLRG--SEYLNNTSWQQIKQ 543 W+ P E++ V E L E + A+ AY + L+ SE L+ Q I Sbjct: 455 WQTPLTHEAKEVSREATLAELAAGSKEQLVKGKAIVAYAEALKAGTSEALHAAREQVIAA 514 Query: 544 WAQQAKGEDPQ 554 A G DP+ Sbjct: 515 NA----GGDPE 521 >UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LQW0_BACS4 Length = 282 Score = 229 bits (583), Expect = 3e-58, Method: Composition-based stats. Identities = 63/244 (25%), Positives = 115/244 (47%), Gaps = 4/244 (1%) Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 + FA +YE P E LL V++ K E NL L+D SGSM S E L Sbjct: 6 KLSFAHQYENVPCKGKEAAYLL-VELTGAKVKHTERSPINLSLLLDRSGSM-SGEPLRYC 63 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 + + ++ +L ++D +++V + + +HK + I ++ G TN GL Sbjct: 64 KEACNFVINQLTDKDILSVVVFDDQVETIIEPQKVTHKDLLKEYIQRIETRGITNLSGGL 123 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 Q K +K +NR++L +DG N GI D +++ + + +G+ +ST GV + + Sbjct: 124 IQGCQHVLKQEVKNYVNRVILLSDGQANAGITDKEALVKLADDYQSAGLVISTLGV-SEH 182 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYR 416 ++E ++ +AD G GN+ +I+ + + E+ +L + +++ I V Sbjct: 183 FDEELLEGVADSGRGNFHFINEVENIPSIFEQELDGLLNVIGQNITLNI-LPKKGVRITN 241 Query: 417 QIGY 420 GY Sbjct: 242 VFGY 245 >UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus trichocarpa RepID=B9GK57_POPTR Length = 595 Score = 229 bits (583), Expect = 3e-58, Method: Composition-based stats. Identities = 96/474 (20%), Positives = 179/474 (37%), Gaps = 65/474 (13%) Query: 127 DVDTGSYANVRRFLNQGLL----PPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAM 182 DV + NV F G L P V +E ++F D + D S P A+ Sbjct: 56 DVPFQAPKNVPSFQRSGSLHAYVPNASPVHIEP--DHFSDDELVPDVSQGQPSSSRPHAI 113 Query: 183 RYELAP------APWNEQRTLLKVDILAKDRK---SEELPASNLVFLIDTSGSMISDERL 233 + P A + + + V +LA ++V ++D SGSM +L Sbjct: 114 TVKTLPEYPAVSASESFSKFGVLVRVLAPPLDNTLPHHRAPIDIVNVLDVSGSMAG--KL 171 Query: 234 PLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTN 291 L++ ++ +++ L D ++IVT++ +R LP ++SGS + + + ++SL A G TN Sbjct: 172 ILLKRAVNFIIQNLGPSDRLSIVTFSSSARRILPLRTMSGSGREDAISVVNSLSATGGTN 231 Query: 292 GGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMV----------KKQR 341 AGL + + + I+L +DG +E + ++ R Sbjct: 232 IVAGLRKGVRVLEERRQHNSVASIILLSDGCDTQSHSTHNRLEYLKLIFPSNNASGEESR 291 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 + + TFG G +++ A M I+DV G +S+I+++ Q + + VA+DV Sbjct: 292 QPTFPIHTFGFGL-DHDSAAMHAISDVSGGTFSFIESIDILQDAFARCIGGLTSIVARDV 350 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN--GQKA 459 + ++ V + + + +D GD+ A + L L++ Sbjct: 351 QLKVRSASPGVQILSTPSGRHKNKIFDQGHQATIDIGDLYAEEEKEFLVFLSIPVFPAVD 410 Query: 460 SIDKLRYAPDNKLAKSDKTK--------ELAWLKIR-------------WKYPQGKESQL 498 + L P ++ K E ++IR + + + L Sbjct: 411 GEEMLENMPLVDVSGFQKDSVSTDTVEVEGERVEIRRPQFLSSTDWVPCLEVDRQRNRLL 470 Query: 499 VEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGED 552 V + T A G L+G++ L + A G+D Sbjct: 471 VTETIAKTQRM-----------AEMGD-LKGAQALLAEQLSTLLSTASAQAGDD 512 >UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5Z1_VITVI Length = 686 Score = 228 bits (580), Expect = 6e-58, Method: Composition-based stats. Identities = 73/342 (21%), Positives = 143/342 (41%), Gaps = 48/342 (14%) Query: 194 QRTLLKVDILAK----DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 + + V I A D + +LV ++D SGSM + +L L++ ++ L++ L Sbjct: 178 RTFAVLVGIKAPALLDDAHLLDRAPIDLVAVLDVSGSM-AGSKLSLLKRAVCFLIQNLGP 236 Query: 250 QDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 D ++IV+++ +R P +S + + AI+SL + G TN GL+ + + Sbjct: 237 SDRLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLXSSGGTNIVEGLKKGVRVLEERS 296 Query: 308 IKGGINRILLATDGDFNVGID-------------DPKSI--------ESMVKKQRESG-- 344 + + I+L +DG D +P+ + S+ + RESG Sbjct: 297 EQNPVASIILLSDGKDTYNCDNVNRRQTSHCASSNPRQVLEYLNLLPASICPRNRESGDE 356 Query: 345 -----VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAK 399 + + TFG G S+++ M I+D G +S+I++++ Q + +L VA+ Sbjct: 357 GRQAIIPVHTFGFG-SDHDSTAMHAISDESGGTFSFIESVAXVQDAFAMCIGGLLSVVAQ 415 Query: 400 DVKAQIE-FNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK 458 +++ ++ +P E G ++ + +D GD+ A + L LT+ Sbjct: 416 ELRLTVKSVSPGVHIESIPSGKYLSEI-CDQGQQGVIDVGDLYAEEGKEFLIYLTVPELS 474 Query: 459 ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVE 500 ++ + R L + +K KE VE Sbjct: 475 SAEGEERVKRTT----------LLDVMCSYKDSVSKEVVQVE 506 >UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID=Q7G2L9_ORYSJ Length = 719 Score = 228 bits (580), Expect = 6e-58, Method: Composition-based stats. Identities = 64/327 (19%), Positives = 125/327 (38%), Gaps = 30/327 (9%) Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 L+ + + +LV ++D S SM + +L L++ ++ +++ L D +++V Sbjct: 242 LIHLKAPSSPATVTSRAPIDLVTVLDVSWSM-AGTKLALLKRAMSFVIQALGPGDRLSVV 300 Query: 257 TYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 T++ +R P ++ S + + SL A+G TN L A + + + Sbjct: 301 TFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRRERNPVCS 360 Query: 315 ILLATDGDFNVGIDDPKSIESMVKK---------------QRESGVTLSTFGVGNSNYNE 359 I+L +DG + P+ + V + FG G ++++ Sbjct: 361 IVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFG-ADHDS 419 Query: 360 AMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV--TEYRQ 417 M IA++ G +S+ID Q + +L VA++++ +E V T R Sbjct: 420 PAMHSIAEMSGGTFSFIDAAGSIQDAFAQCIGGLLSVVAQELRLSVECGDDGVLLTSVRS 479 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDK 477 GY VD GD+ A + L + + A+ + +S Sbjct: 480 GGYASHV--DGDGRGGFVDVGDLYADEERDFLVTVRVP---AARGVSALITPSCTYRSTA 534 Query: 478 TKELAWLKIRWKYPQGKESQLVEFPLG 504 T E +R + V+ P+G Sbjct: 535 TME----TVRVGGDTVTVPRTVDAPVG 557 >UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=Q8H924_ORYSJ Length = 646 Score = 228 bits (580), Expect = 7e-58, Method: Composition-based stats. Identities = 60/337 (17%), Positives = 124/337 (36%), Gaps = 27/337 (8%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISG 271 +LV ++D SGSM+ +L L++ ++ ++ L D + +++++ + L ++ Sbjct: 173 PLDLVTVLDVSGSMVG-NKLALLKQAMGFVIDNLGPGDRLCVISFSSGASRLMRLSRMTD 231 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD-- 329 + KA A+ SL A G TN GA L A + + + ++L +DG + Sbjct: 232 AGKAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVESVILLSDGQDTYTVPPRG 291 Query: 330 -------------PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 P + + + TFG G +++ A M IA+V G +S+I Sbjct: 292 GYDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGK-DHDAAAMHTIAEVTGGTFSFI 350 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVD 436 + + Q + +L ++++ + V + + VD Sbjct: 351 ENEAAIQDGFAQCIGGLLSVAVQELRLDVACVDTGVRVTAVKSGRYKSHIEDDGRAAKVD 410 Query: 437 AGDIGAGKHITLLFELTLNGQKASIDKLR-------YAPDNKLAKSDKTKELAWLKIRW- 488 G++ A + + L + + A D Y + + + +R Sbjct: 411 VGELYADEERSFLLFVVVPRAPAWDDVTHLIEVSCSYRDMETGRTTSVAGDEEAVVLRPS 470 Query: 489 KYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQ 525 + G + VE +D+ A A G+ Sbjct: 471 RAESGVAERSVEVDRELVRVEAIDDIALARAAAERGE 507 >UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GUK8_SORC5 Length = 521 Score = 227 bits (579), Expect = 9e-58, Method: Composition-based stats. Identities = 92/410 (22%), Positives = 168/410 (40%), Gaps = 42/410 (10%) Query: 133 YANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWN 192 + R+ L G +P PD + D+ + M ++ +P Sbjct: 52 FGLFRQILEDGEIPGPDTLDDVGFFAEHKLDYPAATCGEDVCMHGLLGIMGNMISGSPC- 110 Query: 193 EQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 TL+++ + + D + E P +LV +DTSGSM D + +++ L ++ L+ D Sbjct: 111 ---TLIQIGMNSPVDLGALERPPLHLVIAVDTSGSMEGD-PIAYVRAGLVEMIDALQPTD 166 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG 311 I++V Y+ + + L GS + + A + L A GSTN GL AY A + Sbjct: 167 RISLVRYSDAAEVVLEQAEGSDREALTEAFEGLTARGSTNLYEGLFTAYALAEQHLDPAW 226 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 NR++ +DG G+ P+ + S+ E G+ L+ GVG + ++ M I++VG G Sbjct: 227 QNRVIFLSDGVATAGLTSPQRLVSLAAGYAEKGIGLTAIGVG-AEFDVDAMRGISEVGAG 285 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFN 431 N+ +++ ++V E++ L+ +A DV+ + +V + Sbjct: 286 NFYFLEDPKAVEEVFAEEVKTFLVPLALDVELDVAVGDGYVV-------------RGAYG 332 Query: 432 NDNVDAGDIGAGKHITLLFELTLNGQKASIDK-------------LRYAPDNKLAKSDKT 478 + G+ G HI LF L G+ ++ + L P + Sbjct: 333 TNGWQGGERGGAVHIPSLF---LAGRTSAAEPVGSGRRGGGGAILLELVPKPDQRGVEDP 389 Query: 479 KELAWLKIRWKYPQGKESQL----VEFPLGPTINAPSEDMRFRAAVAAYG 524 + + L + W++P E+ +E P P +AP E F G Sbjct: 390 RAVGSLALSWRHPLTGEAHAQEVDIEAPSAP--DAPPEAGYFSGDTVEKG 437 >UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V370_MONBE Length = 471 Score = 226 bits (575), Expect = 3e-57, Method: Composition-based stats. Identities = 84/388 (21%), Positives = 163/388 (42%), Gaps = 15/388 (3%) Query: 171 SIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD 230 PA + ++ A E + + A D + E PA +LV +ID SGSM + Sbjct: 14 EAPAPLQLDVRPLWQYAEIGARESSAYISCRLTAPDFEPVERPAIDLVAVIDVSGSM-AG 72 Query: 231 ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEG 288 ++L ++QS+L+ L++ L++ D A+VT+ D + L ++ +HK A + L A Sbjct: 73 QKLKMVQSTLEFLMRNLKDTDRFALVTFDSDVKTVFDLRPMTTAHKEACLADVQKLRAGS 132 Query: 289 STNGGAGLELAYQ-QATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR--ESGV 345 TN GL + +G KG ++ ILL TDG N G+ D + ++ Sbjct: 133 CTNLSGGLFRGVELMQQRGATKGAVSSILLMTDGIANEGVRDKDDMCRALRGLMGPAPDY 192 Query: 346 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 T+ TFG G ++NE M+ ++++ GNG Y +I++ + + +L A++++ ++ Sbjct: 193 TIYTFGYGK-DHNENMLRQLSETGNGMYYFIESNDIIPESFGDCLGGLLSVFAQNIEVKL 251 Query: 406 EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLR 465 T +++ ++ + GDI + + +LFE+ L + Sbjct: 252 SAVHPEAT-IKRVCMQRAATLADDRRTATFTIGDIQSEEVKEVLFEVNLPLVELHDAVAE 310 Query: 466 YAPDNKLAKSD-KTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAA--VAA 522 ++D L+ ++ E P + +E + A VA Sbjct: 311 AGTATAYFRADVSYLNLSSSSFEKHAATFSTARPAEVP---AVREANEAVTDAIARFVAV 367 Query: 523 YGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 G + + +++ Q +G Sbjct: 368 DGME-EARRLAEAGKFDAVRERLQDVRG 394 >UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7ULL3_RHOBA Length = 484 Score = 225 bits (573), Expect = 4e-57, Method: Composition-based stats. Identities = 69/321 (21%), Positives = 144/321 (44%), Gaps = 24/321 (7%) Query: 191 WNEQRTLLKVDILAKD-RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 +Q L++ + + + +EE P N+ ++D SGSM S ++L + + + + L + Sbjct: 60 GEKQTNHLRIALTGFELKSAEERPPVNVCLVLDHSGSM-SGQKLARAKEAAEAAIDRLSD 118 Query: 250 QDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK 309 D +++V Y + + +P+ + ++ I I + A ST AG+ + K Sbjct: 119 DDIVSVVLYDSNVTVLVPATKATDRSSIKQKIRGIQAGSSTALFAGVSKGAAEVRKFLAD 178 Query: 310 GGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG 369 +NR++L +DG NVG P+ +E + + + +++ST G+G+ YNE +MV +A VG Sbjct: 179 EQVNRVILLSDGLANVGPKSPQELEGLGRSLMKEAISVSTLGLGSG-YNEDLMVALASVG 237 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEH 429 GN+++I+ V N E +L VA + + ++ + + V R IG E Sbjct: 238 GGNHAFIEDADSLVSVFNQEFDGLLSVVANEFEIVVKLDES-VRPVRMIGSEGDI----E 292 Query: 430 FNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWK 489 + + A + + E ++ D T++LA + ++++ Sbjct: 293 GQTIRIPLAQLYANQERYFIVETEVSPG----------------TEDSTRDLAEVTVQYR 336 Query: 490 YPQGKESQLVEFPLGPTINAP 510 Q + + + + + Sbjct: 337 NLQTETKEKLTSSIQVRFSDE 357 >UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3 Tax=Sorghum bicolor RepID=C5WYV0_SORBI Length = 686 Score = 221 bits (564), Expect = 5e-56, Method: Composition-based stats. Identities = 70/327 (21%), Positives = 132/327 (40%), Gaps = 38/327 (11%) Query: 204 AKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 A + +LV ++D SGSM D +L L++ ++ ++ L D +++V+++ +R Sbjct: 155 AAGDRDAPRAPLDLVTVLDVSGSMRWD-KLALVKQAMGFVIGSLGPHDRLSVVSFSSGAR 213 Query: 264 IA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG 321 L +S + K+ A++SL A G TN GL A + + + ++ ++L +DG Sbjct: 214 RVTRLLRMSHTGKSLATEAVESLRAGGGTNIAEGLRTAAKVLGERRHRNAVSSVILLSDG 273 Query: 322 DFNVGIDD--------------PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 N + P S E + TFG GN +++ A M +A+ Sbjct: 274 HDNYSMPRRARGGVPPNYEVLVPPSFVPGTASTGEGSAPIHTFGFGN-DHDAAAMHVVAE 332 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV--TEYRQIGYEKRQL 425 G +S+I+ + Q + +L VA++ + I V + + YE R Sbjct: 333 ATGGTFSFIENEAVIQDAFAQCIGGLLTVVAQEARVAIACGHPGVRISSVKSGRYESRV- 391 Query: 426 RVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLK 485 E + ++ G++ A + L LT+ +A+ + LK Sbjct: 392 -DEDGRSASIAVGELYADEERRFLLFLTVPPVEAT----------------DGDDTLLLK 434 Query: 486 IRWKYPQGKESQLVEFPLGPTINAPSE 512 R Y + V+ T+ A E Sbjct: 435 ARCSYREAAGGTHVDVTAEDTVVARPE 461 >UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1D2W8_DEIDV Length = 418 Score = 219 bits (557), Expect = 3e-55, Method: Composition-based stats. Identities = 73/311 (23%), Positives = 136/311 (43%), Gaps = 7/311 (2%) Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLK 241 +R L L++V + + P NL F+ID SGSM S L + + + Sbjct: 11 LRAGLTAGQTTTLTLLIRVHPAPVTTQVSQRPPLNLAFVIDRSGSM-SGLPLQMAKQAAI 69 Query: 242 LLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQ 301 V++ R D +++V + + +PS + + + AI ++D GSTN G Sbjct: 70 AAVRQARPDDRVSVVAFDDRVDVIVPSQLATSREAVIQAIGTIDDRGSTNLHGGWLEGAT 129 Query: 302 QATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAM 361 Q + G +NR++L +DG NVG+ D + I V+ E G++ +T G+G S+Y+E + Sbjct: 130 QVAQHLTPGALNRVILLSDGQANVGVTDRREIARQVRGLTERGISTTTIGLG-SHYDEEL 188 Query: 362 MVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYE 421 ++ IA+ G+GN+ +++ S E++ + T + V +E NP + Sbjct: 189 LLAIANAGDGNFEHVEDPSRLPTFFEEELQGLTRTTGRIVSLGLEPNPEHGVLVSDV--- 245 Query: 422 KRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKEL 481 F + ++ G+ + +L LT+ Q + +LA + Sbjct: 246 LNDFERNSF--GRLQLPNLVGGQPVDVLATLTVPPQPQRGGQTVGVTRVRLAWTGTDGAR 303 Query: 482 AWLKIRWKYPQ 492 L+ + P Sbjct: 304 RKLRAQLDLPV 314 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 219 bits (557), Expect = 3e-55, Method: Composition-based stats. Identities = 63/348 (18%), Positives = 140/348 (40%), Gaps = 26/348 (7%) Query: 167 KDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEE-LPASNLVFLIDTSG 225 ++ + FA Y L +P Q +++ + + SE ++V +ID SG Sbjct: 41 PTVGAVDIAAYGVFAFNY-LQLSPEKAQEIPCTINLESPAQTSEASRSGVDIVCVIDVSG 99 Query: 226 SMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDS 283 SM E++ L+Q++L +V+ L D I +++++ D+ L +S K ++ + I Sbjct: 100 SM-QGEKIQLVQTTLNFMVERLSPADRICLISFSNDATKISRLVQMSPKGKKQLKSMIPR 158 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQ-RE 342 L A G TN GLE Q + ++ I+L +DG N G + ++ + Sbjct: 159 LVASGGTNIVGGLEYGLQALRQRRTINQLSSIILLSDGQDNNGTTVLQRAKATMDSIVIR 218 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 ++ TFG G+ ++ ++ +A+ NG + Y+ + + +++ VA ++ Sbjct: 219 DDYSVHTFGYGHG-HDSTLLNALAEPKNGAFYYVKDEETIATAFANCLGELMSVVADQIE 277 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASID 462 ++ P I + ++ + + +G +F + Sbjct: 278 VKLMTQPTE------IPFSLSKVYSNSGDTVFT-LPPLMSGDKKEAVFLVQ--------- 321 Query: 463 KLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAP 510 + P + +S + K++++ E+P+ + P Sbjct: 322 ---FDPTTQRVESGHRIQPICFKLKYRIVSNGNIVEQEYPIWLRVENP 366 >UniRef50_Q235T9 von Willebrand factor type A domain containing protein n=5 Tax=Tetrahymena thermophila RepID=Q235T9_TETTH Length = 703 Score = 218 bits (556), Expect = 3e-55, Method: Composition-based stats. Identities = 72/368 (19%), Positives = 155/368 (42%), Gaps = 28/368 (7%) Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG 260 D+ K E P +L+ +ID SGSM ++ +++++ L++ L E D ++++T+ Sbjct: 197 DMEVKSNPLEGRPNLDLICVIDNSGSMNDFSKIENVKNTILQLLEMLNENDRLSLITFNT 256 Query: 261 DSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 ++ L +++ +K + S+ A+G T+ G+E+A+Q K ++ I L Sbjct: 257 KAKQLCGLKNVNNQNKKSLQTITKSIKADGGTDIIRGIEIAFQILQSRKQKNSVSSIFLL 316 Query: 319 TDGDFNVGIDDPKSIE-SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 +DG N+ K++ + K+ +E T+ +FG GN +++ +M +IA + +G++ +++ Sbjct: 317 SDGQDNLADAGIKNLLKTTYKQLQEESFTIHSFGFGN-DHDGPLMQKIAQIKDGSFYFVE 375 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIE-------FNPAWVTEYRQIGYEKRQLRVEHF 430 + + + + VA+D+ +IE F + Y Y + Sbjct: 376 KNDQVDEFFIDALGGLFSVVAQDLTIKIEINRQNELFQKFFKNSYISKTYGHMWKIINQN 435 Query: 431 NNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKY 490 ++ I +G +FELT+ + L ++ E+ +++ + Sbjct: 436 QELRININQIFSGVSKDFIFELTVPKSE----------IKDLQDFERNLEIINVQLTARP 485 Query: 491 PQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 L E L T+ +E VA + E+ N + + + K Sbjct: 486 VDSMLQTLKESKLVLTLFTDNEQ------VAQDSEINDKVEF-NYIRVKAAQAIEEAIKY 538 Query: 551 EDPQGYRA 558 D Y Sbjct: 539 ADQNQYNQ 546 >UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926ED6 Length = 464 Score = 218 bits (555), Expect = 5e-55, Method: Composition-based stats. Identities = 65/309 (21%), Positives = 135/309 (43%), Gaps = 26/309 (8%) Query: 174 ASKPIPFAMRYELAPAPWNEQRTLL--------KVDILAKDRKSEELPASNLVFLIDTSG 225 K + A E P+ E+ + + + +++ + +LV +ID SG Sbjct: 2 NDKKLTVACSTEYKDYPFKEKLDIWTLISLKAPSLGMTLDEKEHRKRAPIDLVVVIDKSG 61 Query: 226 SMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDS 283 SM + E+L L++ +L+ +V +L E+D + ++T+ + L ++ +K + I Sbjct: 62 SM-AGEKLALVKKTLEFVVSQLNEKDRLCLITFDTSVYLDFKLTPMTPMNKYQTLKIIKD 120 Query: 284 LDAEGSTNGGAGLELAYQQATKG--FIKGGINRILLATDGDFN-VGIDDPKSIESMVKKQ 340 + TN GL + K + +LL TDG N G+ + S K Sbjct: 121 ISPGSMTNLCGGLMKGLCEVIDRADEEKNEVASVLLFTDGFANKGGLTNIYCSSSQTAKY 180 Query: 341 -------RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 + + ++ TFG G SN+N M+ I+D G+G Y YI+ + + + + Sbjct: 181 TIGIVGPKTADASIYTFGFG-SNHNAQMLKEISDAGSGMYYYIENVDMIAEAFGQCLGGL 239 Query: 394 LITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELT 453 L TVA+ ++ +I +++ Q + ++ ++ GD+ + + ++ EL+ Sbjct: 240 LSTVAQGIQVEIMMENK--VSIKKV--HSNQPTEKQGSSIKINMGDLQSEESRDVVLELS 295 Query: 454 LNGQKASID 462 ++ + D Sbjct: 296 IDSLDSPTD 304 >UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26 Tax=Gammaproteobacteria RepID=A9QZI4_YERPG Length = 472 Score = 216 bits (551), Expect = 1e-54, Method: Composition-based stats. Identities = 77/360 (21%), Positives = 149/360 (41%), Gaps = 29/360 (8%) Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 +E + LK+ + + S NL +ID S SM S ER+ + L V L D Sbjct: 73 SEDKNYLKISLTGFNLDSTRRSPINLALVIDRSTSM-SGERIEKAREEAILAVNMLNITD 131 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAAIDS-LDAEGSTNGGAGLELAYQQATKGFIKG 310 +++V Y + + +P+ + K + A+I + G T AG+ + Q K + Sbjct: 132 TLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKHLNRE 191 Query: 311 GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 +NRI+L +DG N G + + + + G+ ++T G+G +YNE +M IA + Sbjct: 192 QVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQ-DYNEDLMTAIAGYSD 250 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHF 430 GN++++ ++ +K E + ++ VA+D+ QI+ V R +G + L Sbjct: 251 GNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIKTGDK-VKPVRLLGRDGDIL----G 305 Query: 431 NNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKY 490 N NV + + + +L E+ + + K+LA + I + Sbjct: 306 NTVNVKLNQLYSNQEKYILLEVIP----------------EKGTDKQQKDLADVSISYLN 349 Query: 491 PQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSE-YLNNTSWQQIKQWAQQAK 549 K+ + + + + E + AV + L SE + + + + Sbjct: 350 LSSKKQDQINERVTVSYSQSVEKVN--DAVQE--EVLAESEIQKTALANDEAIKLIDAGR 405 >UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LPD4_SYNFM Length = 479 Score = 215 bits (547), Expect = 4e-54, Method: Composition-based stats. Identities = 81/315 (25%), Positives = 136/315 (43%), Gaps = 25/315 (7%) Query: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 G PP + VN S I+DK + + A+ E AP Sbjct: 37 GRKPPDPGLARAGTVNL--SGRLIQDKVHMGGDGTVTVALTLECDRAPGGNV-------- 86 Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 E ++V ++D SGSM +L + ++ L+ L E D A+V+Y+ Sbjct: 87 -------EARRELDMVVVMDRSGSMADAGKLTHARQAVLNLLSRLSETDRFALVSYSDHV 139 Query: 263 RI--ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATD 320 + L I+ +++A + + + G+TN G GL+ Q + G ++R++L +D Sbjct: 140 QRHGGLLPITPANRATLERIVRGIQPGGATNLGGGLQEGISQLAELQQNGRLSRLILISD 199 Query: 321 GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 G N G+ DP ++ +M E G +ST GVG ++NE +M IAD G GNY+++++ S Sbjct: 200 GLANRGVTDPSALGTMASVAAERGYAVSTVGVGL-DFNEHLMTSIADKGAGNYTFMESAS 258 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDI 440 +V + E R VA V+ + +P +T GY GD+ Sbjct: 259 AFAQVFDKEFRDAGTVVASSVEVHVPLSPG-MTLVHAAGYPIEV----GEGRAVFRPGDL 313 Query: 441 GAGKHITLLFELTLN 455 G+ L L + Sbjct: 314 RFGQSRKLFLTLRIP 328 >UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fragment) n=1 Tax=Sorghum bicolor RepID=C5YMJ6_SORBI Length = 423 Score = 214 bits (546), Expect = 5e-54, Method: Composition-based stats. Identities = 75/336 (22%), Positives = 134/336 (39%), Gaps = 65/336 (19%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISG 271 +LV ++D SGSM + +++ ++ ++ L+ L D +++V ++ D+R L +S Sbjct: 124 PLDLVTVLDVSGSM-AGKKMERVKRAMGFLIDNLGSDDRLSVVAFSTDARRIIRLTRMSD 182 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 KA A++SL A GSTN GL++A K + ++L +DG N + Sbjct: 183 DGKAAAKRAVESLAASGSTNIRGGLDVAAMVLDGRRHKNAVASVILLSDGQDNQSMHHEY 242 Query: 332 SIESMVKK------------------QRESG-----VTLSTFGVGNSNYNEAMMVRIADV 368 S V K QR +G VT+ TFG G +++ A M I++V Sbjct: 243 LPTSWVPKHSPAFSKGGYDVLVPPSFQRTAGGDHRCVTVHTFGFGI-DHDAAAMHYISEV 301 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 +S+I+ + Q + +L + + +E + +IG+ R + Sbjct: 302 TGSTFSFIENHAVIQDAFARCIGGLLSVAVQKARISLECGASTPAYESRIGWGGRAV--- 358 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW--LKI 486 VD G++ A + L + + ELA K+ Sbjct: 359 -----TVDVGELYADEERRFLLFVAVPRAH------------------SMDELATRLFKV 395 Query: 487 RWKYPQGKESQLVEFPLGPTINAPSEDMR--FRAAV 520 R Y ++ P G +++ ED R AV Sbjct: 396 RCTY--------LDTPTGQSVDVAGEDARPDLAFAV 423 >UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein family-like n=8 Tax=Oryza sativa RepID=Q6ZFR4_ORYSJ Length = 703 Score = 214 bits (546), Expect = 6e-54, Method: Composition-based stats. Identities = 62/313 (19%), Positives = 127/313 (40%), Gaps = 20/313 (6%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISG 271 + +LV ++D SGSM +L L++ ++ L L D +A+V+++ +R L +S Sbjct: 268 SVDLVTVLDVSGSMEG-YKLALLKRAMGL----LGPGDRLAVVSFSYSARRVIRLTRMSE 322 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV------ 325 KA +A++SL A+G TN GL A + + + ++L +DG N Sbjct: 323 GGKASAKSAVESLHADGCTNILEGLVEAAKVFDGRRYRNAVASVILLSDGQDNYNVNGGW 382 Query: 326 GIDDPKSIESMV----KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 G + K+ +V K+ + + + TFG G ++++ + M IA+ G +S+I+ + Sbjct: 383 GASNSKNYSVLVPPSFKRSGDRRLPVHTFGFG-TDHDASAMHTIAEETGGTFSFIENQAV 441 Query: 382 AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIG 441 Q + +L ++ + I A V + +VD G++ Sbjct: 442 VQDAFAQCIGGLLSVPVQEARIAITCPHAAVRVRSVNSGRYDSVIDGDGRAASVDVGELY 501 Query: 442 AGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF 501 A + L + + A D + + +++ + + + V Sbjct: 502 ADEERRFLVFVDVPAAGAGEDVTELIKVSCTYRDTASRQQMVVAG--EDAVVQRPAEVST 559 Query: 502 PLGPTINAPSEDM 514 P++ E Sbjct: 560 STEPSMEVERERF 572 >UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LNY0_HALO1 Length = 808 Score = 213 bits (541), Expect = 2e-53, Method: Composition-based stats. Identities = 89/474 (18%), Positives = 174/474 (36%), Gaps = 56/474 (11%) Query: 119 NPLATFSLD--VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASK 176 P FS D T + V+ + G P P RV E +NY +D + + Sbjct: 367 QPYFYFSYDDSASTAAVELVKYGVANGERPHPSLARVWEFLNY--ETFDSASYEELGDRF 424 Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM--------- 227 + M + LL ++ + EE P + + FL+D SGSM Sbjct: 425 RVSMGMVSRPSLTQDGAVDYLLGANVTVPNLTREERPHAVVTFLVDISGSMAEYSPTVDA 484 Query: 228 -ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS----ISGSHKAEINAAID 282 + R+ +++ L V L+ D + +V++ ++I L + ++ Sbjct: 485 GGAPTRMDIVREGLWKAVSALKPGDIVNVVSFDDAAQIELERGEIRPGAATPRPYLRSVL 544 Query: 283 SLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRE 342 L G TN AG+E+AY+ A + + INR+++ TD N G DP I V + Sbjct: 545 RLLPRGGTNLSAGIEVAYRVARRNYDPYRINRVIILTDAYANRGSIDPSLIGDHVLIGDD 604 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 G+ S GVG ++NE + + DVG G Y + T +A + +L A+DV+ Sbjct: 605 EGIHFSGLGVG-YDFNEDFLNTLTDVGRGTYFSLITERDAARAFGERFVSLLAVAARDVR 663 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASID 462 ++++ + E E + D E+ + Sbjct: 664 FRLDYP---------VEMEHTSSASEELSRDPR---------------EVQPTNFSYNSS 699 Query: 463 KLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE--SQLVEFPLGPTINAPSEDMRFRAAV 520 + + + L + + P ++++ + + +E++ A+ Sbjct: 700 QYFFETFRADESVEADASRFRLSVSYTDPVTGTGHVRVLDRSVEQLLGRETENIAAAEAI 759 Query: 521 AAYGQKLRGSEYLNNT-SWQQIKQWAQ----QAKGEDPQGYRAEFIRLIELADG 569 ++ + +++++ Q +G Y F +L+ +G Sbjct: 760 HSF------VRFSGEYLTYEEVDARLQSYSESQRGPLFYEYVELFEQLVATING 807 >UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GDG5_9DELT Length = 486 Score = 213 bits (541), Expect = 2e-53, Method: Composition-based stats. Identities = 88/401 (21%), Positives = 173/401 (43%), Gaps = 39/401 (9%) Query: 131 GSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAP 190 S R G + +R E +NY+ ++ PA+ P ++ +L Sbjct: 82 ASPVMTREVAPYGFYTTFEWIRPWEFLNYYSFEY--------PAADPGDLSVHVDLRSK- 132 Query: 191 WNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 +E R L++ + ++ E N+ ++D S SM + ++++ + + LRE Sbjct: 133 -DEGRFQLQIGVASEIVSPSERLPMNITLVLDESTSMTG-APMYAMKATARAIAGSLREG 190 Query: 251 DNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 D I++V+++ + + L S ++GS+ A + ID+++ G T+ AGLE Y A F Sbjct: 191 DVISLVSWSNSNNVRLASHAVAGSNDATLLDTIDAIEPGGGTDLHAGLEQGYALAQANFS 250 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN-YNEAMMVRIAD 367 INR++L +DG N+G D + I M + + G+ + GVG+ YN+ +M + D Sbjct: 251 ADRINRVVLVSDGGANLGFTDAELIAQMAELEDGEGIYMVGVGVGDVGRYNDELMDTVTD 310 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRV 427 G G +I +EA+++ + A+DV+ ++ P G+E + Sbjct: 311 QGKGASVFIPNEAEAERMFGERFMSTMGVAARDVRVELSLPP---------GFEIVRFSG 361 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 E F++D + + ++F L + L + +R Sbjct: 362 EEFSDDPSEIEPQHLAPNDAMVFYQELETCAPEL--------------ATEDALLGVVVR 407 Query: 488 WKYPQGKESQL--VEFPLGPTINAPSEDMRFRAAVAAYGQK 526 W+ P K+++ VE+ + A S + A+ +Y + Sbjct: 408 WREPFSKQARERAVEYAFADLLGAESPMLDKGQAILSYAEV 448 >UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PS55_PICSI Length = 829 Score = 212 bits (539), Expect = 4e-53, Method: Composition-based stats. Identities = 64/330 (19%), Positives = 132/330 (40%), Gaps = 28/330 (8%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSI 269 +LV ++D SGSM S +L L++ ++ ++ L +D +++V ++ ++ L + Sbjct: 355 RAPIDLVTVLDVSGSM-SGTKLALLKRAMAFVISNLSPEDRLSVVVFSSTAKRVFSLKRM 413 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 + + N ++ L G TN GL + + + I+L +DG + Sbjct: 414 TPDGQRAANRVVERLLCTGGTNIAEGLRKGAKVLEDRRQRNPVASIMLLSDGQDTYSLSS 473 Query: 330 PKSIESMVKKQRES----------GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 + +QR S + + FG G +++ A M I++V G +S+I Sbjct: 474 RGVVLFPSDEQRRSARQSTRYGHVQIPVHAFGFGV-DHDAATMHAISEVSGGTFSFIQAE 532 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQI-EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAG 438 S Q + +L V +DV+ + + + YE E ++ V+ G Sbjct: 533 SLVQDAFAQCIGGLLSVVVQDVRVTVSACAGTKLKSFHAGSYET--CVAEDGSHGTVNLG 590 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYAP------DNKLAKSDKTKELAWLKIRWKYPQ 492 D+ A + +L EL K++ + + D +S ++E ++ +R + Sbjct: 591 DLYAEEERDILVELKFPAVKSASNPMNLISVGCFFKDPVSQRSFHSREQSFSILRPESTD 650 Query: 493 GKESQLVEFPLGPTINAPSEDMRFRAAVAA 522 G + L + +R A+A Sbjct: 651 G-----LPVALNLEVEKERNRLRTAQAIAE 675 >UniRef50_C1XMC3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XMC3_MEIRU Length = 412 Score = 211 bits (538), Expect = 5e-53, Method: Composition-based stats. Identities = 74/324 (22%), Positives = 132/324 (40%), Gaps = 18/324 (5%) Query: 167 KDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGS 226 D ++ ++ Q+ LL++ + E P NL ++D SGS Sbjct: 3 PDSSPNARPHLDLIPLKPGVSATRPTRQQVLLRIHTPTPQARP-ERPLLNLALVLDRSGS 61 Query: 227 MISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH-KAEINAAIDSLD 285 M +L + + V L +D +A+V Y + +PS + +A I I ++ Sbjct: 62 M-GGSKLKYTKEAAIYAVHNLLPEDRVAVVIYDDAVEVLVPSTPVADGRAAIANLIRTIR 120 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 GST AG Q G +NR++L +DG N G +P I V++ GV Sbjct: 121 TGGSTALHAGWLEGATQVAAYQEAGRLNRVVLLSDGLANRGETNPGVIAEQVRELARRGV 180 Query: 346 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 + ST GVG +YNE +M +AD G GNY +I++ ++ ++ E+ + T+ V+ + Sbjct: 181 STSTLGVGL-DYNEDLMTTMADAGEGNYYFIESPADLPRIFAQELAGLAGTLGTRVRLWL 239 Query: 406 EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA---GDIGAGKHITLLFELTLNGQKASID 462 G R D A ++ AG + L EL + + Sbjct: 240 RP-----------GDGSRAWLFNDLEQDPSGAYVLPNLVAGIPLEFLLELEAPAGREASL 288 Query: 463 KLRYAPDNKLAKSDKTKELAWLKI 486 +L + + ++ + + L + Sbjct: 289 RLELDWETPEGQRERLEAVLRLPV 312 >UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1 Tax=Sorghum bicolor RepID=C5WZE3_SORBI Length = 704 Score = 211 bits (537), Expect = 6e-53, Method: Composition-based stats. Identities = 57/303 (18%), Positives = 120/303 (39%), Gaps = 22/303 (7%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SI 269 +LV ++D S SM +L L++ +++ +++ L D +++V ++ + P + Sbjct: 231 RAPLDLVTVLDVSRSMSGP-KLALLKRAMRFVIENLEPSDRLSVVAFSSSACRLFPLRKM 289 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV---- 325 + + + A+DSL A+G TN GL A + + + I+L +DG + Sbjct: 290 TAFGQQQSQQAVDSLVADGGTNIAEGLRKAARVVEDRQARNPVCSIILLSDGVDSHNLPP 349 Query: 326 --GIDDPKSIESMVKKQR----ESGVTLSTFGVG---NSNYNEAMMVRIADVGNGNYSYI 376 G +V + E V + FG+G + +++ M +A + +G +S+I Sbjct: 350 RDGSAPEPDYAPLVPRSILPGSEHHVPIHAFGLGMDHDHDHDSRAMHAVAQMSSGTFSFI 409 Query: 377 DTL-SEAQKVLNSEMRQML--ITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND 433 D + S Q L + +L VA++ + +E V Sbjct: 410 DMVGSSIQDALAQCIGGLLSVSVVAQETRLSVECADQGVLLTSIKSGSYASGVDGDGRGG 469 Query: 434 NVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELA---WLKIRWKY 490 V G + A + L + + + S +R A + + + + +R ++ Sbjct: 470 FVHVGRLYADEERDFLVTVRVPPSRVSTALVRPLCTYHDAVTAEMVRVGGDPVMLLRPEF 529 Query: 491 PQG 493 P Sbjct: 530 PVS 532 >UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZED8_SYNY3 Length = 588 Score = 211 bits (536), Expect = 8e-53, Method: Composition-based stats. Identities = 67/363 (18%), Positives = 137/363 (37%), Gaps = 18/363 (4%) Query: 187 APAPWNEQRTLLKVDILAKDRKSEE--LPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 A L + I + + P+ NL F+ID SGSM ++ + ++ + Sbjct: 14 AVCSERAVTLDLIIRITPPSPPAMDQPRPSLNLGFVIDRSGSMEGHNKITYARQAVCYAI 73 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 +L D++++ + + +PS KA+ + ++ G T+ G Q + Sbjct: 74 DQLSPGDHLSVTIFDDQVQTLIPSTLVKDKAQFKRLVQGINPGGCTDLHGGWLQGGIQVS 133 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + +NRI+L +DG N G +P I + V + G + +T G+G+ +YNE ++ Sbjct: 134 QNL-SAELNRIILLSDGLANRGETNPDIIATDVHGLAQRGASTTTLGLGD-DYNEDLLEA 191 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +A G+GNY Y+ + + E++ + T V +E Sbjct: 192 MARSGDGNYYYVADAEQLPTIFERELQGLAATYGNGVTLTATSQAGVQVLDLLNDFELD- 250 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 N ++ G I ++ L + +I + + L+ D ++ L Sbjct: 251 ------NQGRYQLPNLIYGDSIDVVVRLKVP----AIKEEQILGTVTLSWLDGERQKQTL 300 Query: 485 KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 + + P + + P + M A +++ +Y Q+ Q Sbjct: 301 MVNLQLPVLTKVEFEALPSNQEVQQQVALMMSARAKKEAMERVDRGDYGGA---GQVLQE 357 Query: 545 AQQ 547 A+ Sbjct: 358 ARA 360 >UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BJ22_9GAMM Length = 445 Score = 211 bits (536), Expect = 9e-53, Method: Composition-based stats. Identities = 57/266 (21%), Positives = 124/266 (46%), Gaps = 8/266 (3%) Query: 194 QRTLLKVDILA-KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 + +K+ + K +++ +N+ ++D SGSM ++L + + + + L + D Sbjct: 48 HKAFIKISLEGHKLEQTQARIPANIAIVLDKSGSM-QGDKLFRAKEAAIMAINRLSQNDI 106 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 +++V+Y + +P+ S I AI+ + A G+T AG+ + K + Sbjct: 107 VSVVSYDSRVNVVVPATKVSDTNTIARAINRIQANGNTALFAGVSKGANELRKFLDLNKV 166 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 NR++L +DG N+G P + + + G++++T G+G YNE +M ++A +GN Sbjct: 167 NRVILLSDGLANIGPSTPNELGKLGLSLAKEGMSVTTIGLGLG-YNEDLMTQLAGFSDGN 225 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN 432 +++++ + +V E +L VA+ V I VT R +G E + + Sbjct: 226 HAFVENADDLARVFQYEFGDVLSVVAQGVDIHIRCLNG-VTPLRVLGREADI----NGSQ 280 Query: 433 DNVDAGDIGAGKHITLLFELTLNGQK 458 + + + ++ E+ + Q+ Sbjct: 281 VRTRLNQLYSEQEKFIILEVEVPSQQ 306 >UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21PJ3_SACD2 Length = 763 Score = 210 bits (535), Expect = 1e-52, Method: Composition-based stats. Identities = 76/377 (20%), Positives = 148/377 (39%), Gaps = 33/377 (8%) Query: 45 QQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTAR 104 + +A AE++ + A Q + T + H+ +P Sbjct: 223 ENSAPDTAEKTPTSIDLAAGGYGWQSFNP--------IIHTQKPTPQVPDAHLISPPMVL 274 Query: 105 YQ-QFDDNPVKQVAQNPLATFSLDVDTG-SYANVRRFLNQGLL--PPPDAVRVEEIVNYF 160 Q Q+ D +Q ++ AT S+ +D G + AN+ +Q + PP A VE Sbjct: 275 AQGQYGDGQYEQTGKDNRATISIQLDAGFNVANIESLYHQITINKPPSSAYNVELTNGST 334 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 D D + AS A+ E E LL + ++ + + ++VF+ Sbjct: 335 LMDRDFVLQWRATASSAPQAAVFKE---TLAGEDYLLLMLLPPQGQQQHTQSLSRDIVFV 391 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEI 277 +DTSGSM + + SL+ ++ L D I+ + S + S+ Sbjct: 392 VDTSGSM-QGTSIQQAKRSLQFALRGLNPSDTFNIIEFDTSFSRFRSRPVSATASNVQAA 450 Query: 278 NAAIDSLDAEGSTNGGAGLELAYQQA--------TKGFIKGGINRILLATDGDFNVGIDD 329 + +++L+A+ T A LE A+ Q + +++ TDG + + Sbjct: 451 VSWVNNLNADNGTEMYAALEEAFDQLASINPNGTENSKSSNNLQQVVFITDGA----VGN 506 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 +++ S++ + R + L T +G++ N M + A G G +I +E +N+ Sbjct: 507 EQALLSLIHR-RLNNARLFTVAIGSAP-NSYFMRKAAQFGKGANVFIGDTAEVTHKMNAL 564 Query: 390 MRQMLITVAKDVKAQIE 406 + ++ T+ D+ Q Sbjct: 565 LSKLKTTLVSDINVQWP 581 >UniRef50_Q8H923 Putative uncharacterized protein OSJNBa0071K18.17 n=5 Tax=Poaceae RepID=Q8H923_ORYSJ Length = 606 Score = 209 bits (532), Expect = 2e-52, Method: Composition-based stats. Identities = 62/309 (20%), Positives = 128/309 (41%), Gaps = 24/309 (7%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS--RIALPSISG 271 +LV ++D SGSM + +L L++ ++ ++ L D + +V+++ ++ R L +S Sbjct: 142 PLDLVTVLDVSGSM-AGRKLALVKKAMGFVIDNLGPADRLCVVSFSTEASRRTRLLRMSE 200 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV------ 325 KA A++SL + +TN G GL +A + K ++ ++L +DG + Sbjct: 201 VGKATAKRAVESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSSVILLSDGKDSYVVPRRG 260 Query: 326 -GIDDPKSIESMVKKQRESG--VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 G+ + G + TFG G ++++ A M IA+ G +S+++ + Sbjct: 261 NGMSYMDLVPPSFASSGGRGQLAPIHTFGFG-ADHDAAAMNTIAESTGGTFSFVENEAAI 319 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEF-NPAW-VTEYRQIGYEKRQLRVEHFNNDNVDAGDI 440 Q + +L +D + + +P V E + YE R +V+ G++ Sbjct: 320 QDSFAQCIGGLLSVAVQDARIAVACSSPGVLVREIKSGRYESRV--DADGRAASVEVGEL 377 Query: 441 GAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVE 500 A + L + + +A+ D + + + + R E +V Sbjct: 378 YADEERRFLLFINVPIAEATEDATQLIKLSCTYRD-------TVTGRTIDVAAGEDAVVR 430 Query: 501 FPLGPTINA 509 PL + Sbjct: 431 RPLEVSAAD 439 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 209 bits (531), Expect = 3e-52, Method: Composition-based stats. Identities = 58/276 (21%), Positives = 122/276 (44%), Gaps = 11/276 (3%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG 271 +L+ +IDTSGSM + + L L++ +L LV L+ D I ++ ++ +++ P +S Sbjct: 1443 RFPIDLICVIDTSGSM-NGQPLDLLKETLLFLVDLLQTGDRICLIQFSTNAQRLTPLLSI 1501 Query: 272 SHKAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD---FNV 325 K I + I+ L A+G TN G++LA+ + K I + L +DG Sbjct: 1502 ESKDNIKSIKNEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLLSDGLNDGAEN 1561 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 I D + + E T+ TFG G +++ +M +I+ + +GN+ YI + + Sbjct: 1562 KIRDLLKQLNFYQNYNEENFTIQTFGFGK-DHDPNLMDKISQLMDGNFYYIGDIHRIDEC 1620 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI-GYEKRQLRVEHFNNDNVDAGDIGAGK 444 + + ++++V ++ + R + Y E + +++ + +G Sbjct: 1621 FIDALGGLFSVISQNVSINVQVPQEMREQIRIVKTYGDIWHTKEPYYEYSININQLLSGV 1680 Query: 445 HITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 + E+ L I+ L+ + + ++ E Sbjct: 1681 SKDYILEMEL--DSKLIEDLQCIHEIHSEEEIQSSE 1714 >UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 Tax=Cystobacterineae RepID=Q1D9B7_MYXXD Length = 476 Score = 206 bits (525), Expect = 2e-51, Method: Composition-based stats. Identities = 82/338 (24%), Positives = 148/338 (43%), Gaps = 19/338 (5%) Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 P D+ + NL +ID SGSM S +L + + + L+ L + Sbjct: 70 PAGTSEVFATFDLSGAQVPGAQRSPVNLALVIDRSGSM-SGYKLAQAKQAARHLIGLLND 128 Query: 250 QDNIAIVTYAGDSRIALPSI--SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 QD +AI+ Y D + +LPS+ + +++ + +D + EG TN GAGL Q + Sbjct: 129 QDRLAIIHYGSDVK-SLPSLEATAANRERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQ 187 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 G+NR++L +DG G+ + + M ++ R +G+TLS GVG +++NE +M A+ Sbjct: 188 RTYGVNRLILMSDGQPTEGLTADEELTRMARELRATGLTLSAIGVG-TDFNEDLMQAFAE 246 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRV 427 G G Y +++ ++ + +++Q TVA+ V P + +GY Q Sbjct: 247 YGAGAYGFLEDAAQLSTLFQKDLQQAGTTVARGVTMTFTLPPGT-SLGEVLGYRASQ--- 302 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL-KI 486 N +V D AG+ ++ L + G S+ + D KLA +D ++ + Sbjct: 303 -SGNQVHVSLPDFSAGQLERVVVRLNVTGD--SVGRTARVLDLKLAYTDLIRDAEVANEA 359 Query: 487 RWKYPQGKESQLV------EFPLGPTINAPSEDMRFRA 518 + V E + + +M+ A Sbjct: 360 SLSAVVTNRREEVLARQDREATVYAARARSAVNMQKAA 397 >UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GAI6_9DELT Length = 560 Score = 206 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 88/427 (20%), Positives = 164/427 (38%), Gaps = 45/427 (10%) Query: 151 VRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSE 210 +R E +NY+ D+D A + +E R +++ + ++ E Sbjct: 161 IRTWEFMNYYGFDYDPA------ADGELSVYAAMNPIEGEGDEARFQMQIGVASELMTPE 214 Query: 211 ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--S 268 E P N+ ++DTSGSM + + L++ + + + +L+ D ++I + + L + Sbjct: 215 ERPPMNVTLVLDTSGSM-AGTPIELLRETSRAIAAQLKLGDTVSICEWDTSNDWTLAGYA 273 Query: 269 ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID 328 ++G + + I+ + G TN GLE Y+ A + INR++L +DG N GI Sbjct: 274 VTGPNDELLLEKINDVVHGGGTNLYGGLESGYELAQMVYDPDAINRLVLISDGGANAGIT 333 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGN-SNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 D I G+ L GV + +YN+ +M + D G G ++ + E Sbjct: 334 DLDLIAENAAYGGSDGIYLVGVGVDDPDDYNDELMDAVTDAGKGASVFMPSEEEVWTTFG 393 Query: 388 SEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHIT 447 ++ A++V+ Q++ P G+E + E + D + + T Sbjct: 394 DNFESVMAIAAREVQVQLDMPP---------GFEVVKFSGEEISGDPKEVEPQNLAPNDT 444 Query: 448 LLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLV--EFPLGP 505 ++F + LA D + + W P ESQ + + +G Sbjct: 445 MVFYQQVETCAP-----------DLAGEDAE---VTVTVTWDDPWTFESQELAQTWTIGE 490 Query: 506 TINAPSEDMRFRAAVAAYGQKLRG--SEYLNNTSWQ------QIKQWAQQAKGEDPQGYR 557 + AA+ AY L+ Y N+ AQ A D G Sbjct: 491 LTGMDQALLLKGAAILAYTDALKAFKQAYTNDQKAAALQPALDALALAQTA--NDTDGDL 548 Query: 558 AEFIRLI 564 E +++ Sbjct: 549 IEIGQIL 555 >UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LL92_HALO1 Length = 430 Score = 206 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 87/365 (23%), Positives = 148/365 (40%), Gaps = 17/365 (4%) Query: 170 QSIPASKPIPFAMRY--ELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM 227 Q PA + A+ + P N + L V + +L +ID SGSM Sbjct: 4 QPTPAQRAGSVAVTVTPQYDLLPSNARELNLMVRLEGTGDAPATRAPLDLALVIDRSGSM 63 Query: 228 ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH--KAEINAAIDSLD 285 S ++L ++++ L++ L+ +D I +V+Y+ D + L + E A+ +L Sbjct: 64 -SGDKLSDVKTAALELLETLQPEDTITLVSYSSDVSMHLMRTRADDAGQREARRALLALQ 122 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 A G T G GL A + + ++ ++L +DG N G P + + +GV Sbjct: 123 ARGGTALGPGLFRALEALEGASDRTRMSHLMLFSDGIANAGEVRPSVLGARAAGAFGAGV 182 Query: 346 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 ++ST GVG +YNE +M R+AD G G Y +I +L+ EM+ ++ TVA+ V + Sbjct: 183 SVSTMGVGV-DYNEDLMTRLADQGGGRYHFIQDSEAIASILDDEMKGLVATVARGVTMDL 241 Query: 406 EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL----NGQKASI 461 V R GY E + G +GAG+ +L + L K + Sbjct: 242 -TRAEGVGTVRVFGYASE----ESAGRVHTRVGSLGAGQTRAILVRIDLLSDATADKRPL 296 Query: 462 DKLRYAPDNKLAKSDKTKELAWLKIRWKYP--QGKESQLVEFPLGPTINAPSEDMRFRAA 519 L D+ ++ L I + S+ + + + M A Sbjct: 297 GHLHIEFDDVSDDGERKSVDVPLSIAHTDDIAAARASEHKDVTVRVAEIESAASMELAAQ 356 Query: 520 VAAYG 524 A G Sbjct: 357 AAGRG 361 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 206 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 53/300 (17%), Positives = 130/300 (43%), Gaps = 16/300 (5%) Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LP 267 + P+ +LV +ID SGSM E++ ++++L L+ L D ++++ + + L Sbjct: 203 QSRPSIDLVCVIDNSGSM-QGEKIQNVKTTLLQLLDMLNSNDRLSLILFNSYPTLLCNLR 261 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 + + I + I+S+ A+G T+ +G+ +A+ K ++ I L +DG N Sbjct: 262 KVDDENTPNIQSIINSITADGGTDINSGMLMAFNILQKRQFFNPVSSIFLLSDGQDNGAD 321 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 + K + + + ++ +FG G S+++ +M RI + +GN+ Y++ +++ + Sbjct: 322 EKIKKYINSNQSLKNECFSIHSFGFG-SDHDGPLMNRICQLKDGNFYYVEKINQVDEFFV 380 Query: 388 SEMRQMLITVAKDVKAQIEFNPAWVTEYRQI--------GYEKRQLRVEHFNNDNVDAGD 439 + + VA+++ +I N +++ Y ++ + Sbjct: 381 DALGGLFSVVAQEILIEINLN-RQDKNFQKYFSNCKVSKTYGDMWKCIKQDEIYQIKINQ 439 Query: 440 IGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLV 499 + +G + E+ + Q+ + + + L + I++ K+S L+ Sbjct: 440 LFSGVSKDFIMEIVVPKQEVKMLE---DFERNLEIVKGQLTAIPVDIQYTTKIVKDSNLI 496 >UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C9G5_PARTE Length = 648 Score = 205 bits (522), Expect = 3e-51, Method: Composition-based stats. Identities = 63/260 (24%), Positives = 117/260 (45%), Gaps = 10/260 (3%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISG 271 +L+ +ID SGSM +++ +Q SL L+ L E+D + ++T+ G ++ P +++ Sbjct: 227 GIDLLCVIDKSGSMEG-KKIASVQQSLVQLLDFLSEKDRLCLITFDGSAQRLTPLKTLTQ 285 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 +K AI S+ A G TN G E+A+ Q + +K + I L +DG Sbjct: 286 DNKNYFKKAIYSIRASGQTNIAKGTEIAFNQIQQRKMKNQVTSIFLLSDGQD----QGAA 341 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 K E VT+ +FG G S+++ A+M +I VG G++ YI+ + + + Sbjct: 342 EYIQRQKDVVEDIVTIHSFGYG-SDHDAALMSKICKVGQGSFYYIEDVKLLDEFFADALG 400 Query: 392 QMLITVAKDVKAQIEFNPAWVTEYRQI--GYEKRQLRVEHFNNDNVDAGDIGAGKHITLL 449 ++ +A+ V+ I+ P + +I Y ++E + I + + Sbjct: 401 RLSSALAEKVQIDIKCAPFIPFQDIKIQKTYGDMWKQIEQERLYQIKIPQIASDSRKDYV 460 Query: 450 FELTLNGQKASIDKLRYAPD 469 FE+ L I + P Sbjct: 461 FEIALPPYSEQILDEQRVPQ 480 >UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0E9B3_PARTE Length = 603 Score = 205 bits (522), Expect = 3e-51, Method: Composition-based stats. Identities = 62/249 (24%), Positives = 112/249 (44%), Gaps = 7/249 (2%) Query: 211 ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSI- 269 + P +LV ++D SGSMI ++ L++ SL+ L+K L +D I I+ + + I I Sbjct: 116 DRPPIDLVCVVDVSGSMIG-RKINLVKDSLRYLMKILGPEDRICIIVFTTVAHIVTSFIR 174 Query: 270 -SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID 328 + +K + AI L STN G+ A K ++ I L +DG + Sbjct: 175 NTQENKPLLKKAILELKGLASTNISDGMNKALWMLKNRKYKNPVSCIFLLSDGQDDYKGA 234 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 + + + + + E + TFG G +++ +M +IA GN+ YID +++A Sbjct: 235 EQRVFDQLQLLKIEEKFVIHTFGYGQ-DHDAYVMNQIAKYREGNFYYIDNINKASDYFIL 293 Query: 389 EMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 M ML A++V ++ N + + G E + + NN + + G+ Sbjct: 294 AMSGMLSIYAQNVSINLKSNDCEI--VKAFG-EGQVWYKQDANNYKIQLNYLLEGESKDF 350 Query: 449 LFELTLNGQ 457 +FEL + Sbjct: 351 VFELFVKED 359 >UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SHY6_LEPBA Length = 550 Score = 204 bits (520), Expect = 6e-51, Method: Composition-based stats. Identities = 76/317 (23%), Positives = 131/317 (41%), Gaps = 17/317 (5%) Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 E LL+ A + EE + ID S SM E++ + + LV L D Sbjct: 20 QENHLLLRFRTPA-NPNVEERKPLVIGLAIDKSWSMKG-EKMEAVIDASCALVNWLTRHD 77 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG 311 ++IV Y+ D ++ P + K + I ++ STN G A + + I Sbjct: 78 AVSIVAYSADVQLIQPVTHLTEKVSVTDKIRNIQVATSTNLSGGWLSALKSLNQSKIPNA 137 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 R+LL TDG+ GI D +++ ++ G++ +T GVGN ++NE M+V IA G G Sbjct: 138 YKRVLLLTDGNPTSGIKDKEALVTIAADHLSMGISTTTIGVGN-DFNEEMLVEIAKAGGG 196 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFN 431 N+ YID A + E + A+ + +++ P +Q+ E +E F+ Sbjct: 197 NFYYIDNPENASDIFFEEFGDIGALYAQAIDVELQLAPG--VRLKQVLSETSHQVMEEFD 254 Query: 432 NDNVDA------------GDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTK 479 DA GD+ A L+ L ++ + + + K Sbjct: 255 EFLGDAKTISRQKINLQLGDLRADDLRNLVLRLEIDDRVNETNTPFCEVNVSYYNLLKQN 314 Query: 480 ELAWLKIRWKYPQGKES 496 L +K + + +GK + Sbjct: 315 VLESVKQTFSFERGKNT 331 >UniRef50_B4W304 von Willebrand factor type A domain protein (Fragment) n=2 Tax=Cyanobacteria RepID=B4W304_9CYAN Length = 538 Score = 204 bits (519), Expect = 8e-51, Method: Composition-based stats. Identities = 85/384 (22%), Positives = 154/384 (40%), Gaps = 29/384 (7%) Query: 187 APAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKE 246 A +P + + + S P NL ++D SGSM + L + + L+ Sbjct: 14 ADSPSTVDLLITFQGSESSQQTSSRRP-LNLSLVLDRSGSM-AGAPLRYAIQAAQNLIDY 71 Query: 247 LREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 L D +++V Y + + +P +A + A I + A G TN G L Q Sbjct: 72 LTADDFVSVVIYDDTAEVIIPPQLVGDQAALKAKIGKIRARGCTNLSGGWLLGCSQVQAN 131 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIA 366 INR+LL TDG N GI DP+ + ++ E+ + +T G GN +NE +++ +A Sbjct: 132 QSPERINRVLLLTDGLANYGIKDPQVLTKTALEKAEADIVTTTLGFGNY-FNEDLLINMA 190 Query: 367 DVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLR 426 + GN+ +I + +A +V EM ++ VA++++ +++ Y Q+ Sbjct: 191 NAARGNFYFIQSPDDASQVFEIEMESLVSVVAQNLRVRLQPEEFVKVGEILNNYRFSQV- 249 Query: 427 VEHFNNDNVDAGDIGAGKHITLLFELTLNGQK----ASIDKLRYAPDNKLAKS---DKTK 479 N V GD+ + L LT++ + +I + Y + S + Sbjct: 250 ---GNTIEVVLGDVYGVEQKPLAIPLTVSPRSQSGMTTIATVTYDYQTIVEGSIQDISNQ 306 Query: 480 ELAWLKIRWKY----------PQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRG 529 L + + + SQL L + ++ F+ AV KLR Sbjct: 307 FSITLTVGSEAEAKNIQPDPQVLEQTSQLRIAKLKDEAVSLADKGDFQQAVV----KLRE 362 Query: 530 S-EYLNNTSWQQIKQWAQQAKGED 552 + E L + + A++ D Sbjct: 363 TIEELKRKTLDEFFDIAEEIAQLD 386 >UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnoliophyta RepID=B9SJS6_RICCO Length = 540 Score = 202 bits (514), Expect = 3e-50, Method: Composition-based stats. Identities = 83/414 (20%), Positives = 163/414 (39%), Gaps = 36/414 (8%) Query: 170 QSIPASKPIPFAMRYELAP---APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGS 226 +S P +P ++ AP E + + +++ D S P +LV ++D S S Sbjct: 52 RSRPTPPIVPARVKLRSINNDMAPLEESKLKVMLELTGGDSSSYGRPGLDLVAVLDVSRS 111 Query: 227 MISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSL 284 M D ++ +++++ ++K+L D ++IVT++G + P +G + E I+ L Sbjct: 112 MEGD-KMEKMKTAMLFIIKKLGPTDRLSIVTFSGGANRLCPLRQTTGKSQEEFENLINGL 170 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKG-GINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 +A+G+TN AGL+ A + G + I+L +DG+ N G D Sbjct: 171 NADGATNITAGLQTALKVLKGRSFNGERVVGIMLMSDGEQNAGSD--------ATGVSVG 222 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVG-NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 V + TFG G N+ + IA G +S + + K + +L V +D+K Sbjct: 223 NVPIHTFGFGI-NHEPKGLKAIAHNSIGGTFSDVQNIDSLTKAFAQCLAGLLTVVVQDLK 281 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHF-NNDNVDAGDIGAGKHITLLFELTLN----GQ 457 I P ++ +Q+ + + V GD+ + + ++ +L L G Sbjct: 282 MTIA-QPKDESKIQQVSAGSYPQTRDDVAGSVTVTFGDLYSKEVRKVIVDLLLPSASKGW 340 Query: 458 KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFR 517 ++ ++ YA + + ++ P +E V T + M+ Sbjct: 341 GGNVLEITYAYSTRGKLFEAPPATLTVRRTVASPVQEERPEVIT--EETRLRTAGMMKEA 398 Query: 518 AAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQG-----YRAEFIRLIEL 566 A+A Y + + + +D R E +L++L Sbjct: 399 RAMAD-----NNKLYDARDKLVEAENLLEDVV-DDGHNPVIEMLRLELQQLLKL 446 >UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1DFU7_MYXXD Length = 422 Score = 201 bits (510), Expect = 8e-50, Method: Composition-based stats. Identities = 82/411 (19%), Positives = 168/411 (40%), Gaps = 24/411 (5%) Query: 171 SIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD 230 S P + A + A +++ A+ ++ + +L ++D SGSM + Sbjct: 3 SKPEDGALNMAGKLSGAYVQTGPSEAFAWMELKARPAETGQRVPVSLALVLDRSGSM-NG 61 Query: 231 ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA-LPSISGSHKAEINAAIDSLDAEGS 289 ++L + + LV+ L+ +D +A + Y D R+ ++ + E+ I L +GS Sbjct: 62 QKLADARRAATELVQRLKPEDRLAFIDYGTDVRVQPSRRMTEEAREELLTLISGLQDDGS 121 Query: 290 TNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLST 349 TN L+ A + ++R +L +DG GI + V++ R G+T+S Sbjct: 122 TNISGALDAAANALRPHMREYRVSRAILLSDGQPTTGIVSEPGLLDQVRQLRRDGITVSA 181 Query: 350 FGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 GVG +Y E +M +A+ G G +ID + +V + E+ Q TVA+ V+ +++ P Sbjct: 182 LGVGR-DYQETLMRGMAEQGGGFSGFIDDSARLAEVFSRELDQATSTVARMVELRLDVPP 240 Query: 410 AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS------IDK 463 V + +G N V D+ + + + ++ +LTLN + + + Sbjct: 241 E-VQDVEVMGMA----SFREGNVLKVPLYDMASAQTVRVMAKLTLNTSRTAGALALLNAR 295 Query: 464 LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAY 523 + Y + ++ +L ++ + +E E + ++ M VAA Sbjct: 296 VHYVDVARDLPTETALQL-TAEVTSDLDRVREYLDKEVRVHAVRAMGTQHM-----VAAA 349 Query: 524 GQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDIS 574 + RG++ ++ + A+ G E + + S Sbjct: 350 EEMKRGNKAKASS----LLDNARTIFGTSADALSGELADVRRSQAALAGAS 396 >UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F1H2_SORC5 Length = 607 Score = 200 bits (509), Expect = 1e-49, Method: Composition-based stats. Identities = 70/272 (25%), Positives = 117/272 (43%), Gaps = 10/272 (3%) Query: 190 PWNEQRTLLKVDILAKDRKS-EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 P LL+V+I + + +L +ID SGSM S E+L L + + ++ L+ Sbjct: 12 PAEPSERLLRVEITVPRPEGGQARKPVHLSLVIDRSGSM-SGEKLRLALEAARQAIRTLQ 70 Query: 249 EQDNIAIVTYAGDSRIALPSI--SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 D ++VT+ + +PS + + AA+D++ A G+T+ G G + Sbjct: 71 PGDRFSVVTFDHQVEVPIPSTDATPGARLRAEAALDTVIARGNTDLGGGWLRGCAEVGAH 130 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIA 366 + I R+LL TDG N GI P + S + QR VT ST G+G +NE ++ R++ Sbjct: 131 LPEDAIGRVLLLTDGQANHGITSPDELTSRARSQRLRRVTTSTIGLGEG-FNEFLLGRLS 189 Query: 367 DVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLR 426 + G GN+ + E + E+ ++L VA+D I P V Y Sbjct: 190 EEGGGNFYFAARADELPGFVGREIGEVLSVVARDAALVIR-APGGVEVESLNDYP----C 244 Query: 427 VEHFNNDNVDAGDIGAGKHITLLFELTLNGQK 458 + G + G + +F L Sbjct: 245 TRDGGTCSFSLGSLPGGMVLAPMFRLRFPAGA 276 >UniRef50_D2VKS7 von Willebrand factor type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VKS7_NAEGR Length = 923 Score = 200 bits (508), Expect = 2e-49, Method: Composition-based stats. Identities = 57/246 (23%), Positives = 110/246 (44%), Gaps = 14/246 (5%) Query: 174 ASKPIPFAMRYELAPAPWNEQRTL-LKVDILAK---DRKSEELPASNLVFLIDTSGSMIS 229 S I F E P+ + L + + +E +LV ++D SGSM + Sbjct: 646 TSNKIVFNGHCEYEAIPFETECDLYCMATLQGPCFEQQAQKERKGVDLVLVVDKSGSM-A 704 Query: 230 DERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI--ALPSISGSHKAEINAAIDSLDAE 287 ++L +++S+L +V +L+E+D +AIV + + L + K + ++ Sbjct: 705 GQKLDMVKSTLSFMVDQLKEKDRVAIVEFDTQVKTNLDLTKMDIEGKKKAKQVSSAISPG 764 Query: 288 GSTNGGAGLELAYQQ-ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES--- 343 TN L + + A++ K + ++L TDG N G+ I ++ + Sbjct: 765 SCTNLSGALFTSLKLLASRQQEKNEVTSVILFTDGLANRGLISTNEILQNMQDLMDELLS 824 Query: 344 --GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 VT+ TFG G + + M+ IA GNG Y Y++T + K + + ++ V +++ Sbjct: 825 TSNVTIHTFGFGQ-DTDANMLTSIAQKGNGLYDYLETADDIPKAFGNVIGNLVSVVGQNI 883 Query: 402 KAQIEF 407 K +I+ Sbjct: 884 KIRIQP 889 >UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RNW6_9PROT Length = 452 Score = 199 bits (507), Expect = 2e-49, Method: Composition-based stats. Identities = 81/388 (20%), Positives = 150/388 (38%), Gaps = 26/388 (6%) Query: 193 EQRTLLKVDILAKDRKSEE---LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 Q+ + + + A D + E +L +ID SGSM ++ + K + +L Sbjct: 22 AQKLPVLIRVQAPDPLATEKKARKPYHLALVIDRSGSMSGPPLAEAVRCA-KHIADQLEP 80 Query: 250 QDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK 309 D ++V + + +P + ++ A+ + + GSTN G + + Sbjct: 81 TDIASLVVFDDRVQTLVPPRPVGDRQALHLALSRVHSGGSTNLHGGWQAGADGLLPAAGQ 140 Query: 310 GGINRILLATDGDFNVG-IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 + R++L +DG+ NVG I DP I ++ + E GV+ ST+G+G S++NE +MV +A Sbjct: 141 AALARVILLSDGNANVGEITDPAGIAALCAQAAERGVSTSTYGLG-SHFNEDLMVEMAKR 199 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 G GN+ Y DT ++ + +E + A+ V+ + P I L Sbjct: 200 GGGNHYYGDTAADLFEPFAAEFDFISALCARHVRLSLAAAPGVG-----IRLLNDYLVEG 254 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRW 488 + DI G L EL + A + A + + + LA+ Sbjct: 255 DAGFPVIRLPDIAFGAEAWALVELQIPVGLAGEGAGQLLQAAVTASTPEGEPLAFADATL 314 Query: 489 KYPQGKES---QLVEFPLGPTINAPSEDMRF---RAAVAAYGQKL---------RGSEYL 533 P + L+ PL + + + A A +G R Sbjct: 315 TLPAMSPAAWETLLSDPLVLSRQSELAAGKLLEQARAAAEHGDWNVVERLLAEARQRFAD 374 Query: 534 NNTSWQQIKQWAQQAKGEDPQGYRAEFI 561 + ++ A+ A+ D +R E + Sbjct: 375 QPWLIEVLESMAELARSRDTARFRKEAL 402 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 197 bits (501), Expect = 1e-48, Method: Composition-based stats. Identities = 78/383 (20%), Positives = 151/383 (39%), Gaps = 36/383 (9%) Query: 189 APWNEQRTLLKVDILAKDRKSEEL-PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKEL 247 +P + L++ + A + S E NL ++D SGSM + + L ++ + + LV L Sbjct: 15 SPTRVSQRQLEISVAAIAQASGERNAPLNLGLILDHSGSM-AGQPLETVKRAAQKLVDRL 73 Query: 248 REQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 D +A++ + +++ +P+ + + +I I L A G T GL+L + Sbjct: 74 LPSDRLAVIVFDHVAKVLIPNQPVTDRDKIKTRISHLAAMGGTAIDEGLQLGLTELIAAK 133 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 G I++I L TDG+ G + + ++ + +TL+T G G ++N+ ++ +IAD Sbjct: 134 A-GAISQIFLLTDGENEHG--NNSRCLQLAEEAAKENITLNTLGFG-YHWNQDVLEQIAD 189 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW----VTEYRQIGYEKR 423 G+ +I+ + Q++ + + +P + Q+ Sbjct: 190 AAGGSLMFIEYPQDVLIGFERLFNQIISVGFTNAHLHLSLSPGVRLANLKPIAQVAPATI 249 Query: 424 QLRVE-HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELA 482 L N V GD+ TLL L ++ + L P+ LA Sbjct: 250 DLPHGMEGNTAIVRLGDLLTDTPRTLLANLYIDPPTVA-HPLPSPPET----------LA 298 Query: 483 WLKIRWKYPQGKESQLVEFPLGPTI-------NAPSEDMRFRA-AVAAYGQ------KLR 528 L+IR+ P + + L P+ + P+ ++ A+A Y Q KL Sbjct: 299 TLQIRYDDPTQEHTGLRSEPIAVLAHRCADQESQPNPQVQKSILALAKYRQTQLAEAKLH 358 Query: 529 GSEYLNNTSWQQIKQWAQQAKGE 551 + + I G+ Sbjct: 359 QGDRAGAATMLHIAAQTALQMGD 381 >UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7YL43_NECH7 Length = 764 Score = 197 bits (501), Expect = 1e-48, Method: Composition-based stats. Identities = 66/371 (17%), Positives = 131/371 (35%), Gaps = 46/371 (12%) Query: 156 IVNYFPSDWDIKDKQSIPASKPIPFAM-----RYELAPAPWNEQRTLLKVDILAKDRKSE 210 + ++ P++ P + L P P R L V + S Sbjct: 22 FWSTLTGGKKSEEIVGGPSAIQPPVVIASDDATLHLEPVP---DRKGLIVKVQPPTAPSA 78 Query: 211 ELP--ASNLVFLIDTSGSMISDER--------------LPLIQSSLKLLVKELREQDNIA 254 E+P ++V +ID SGSM L L + + + +++ + E D + Sbjct: 79 EIPHVPCDIVLVIDVSGSMAGAAPVPGEETNESTGLSILDLTKHAARTIIETMNESDRLG 138 Query: 255 IVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 IVT+A +++ P S++ +K + S+ +TN GL + K + Sbjct: 139 IVTFASKAKVVQPLLSMTSENKERSRGNVTSMRPIDATNLWHGLLEGIK-LFKNVKSSNV 197 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 I++ TDG N ++ ++ + ++ TFG G + ++ IA++G GN Sbjct: 198 PAIMVLTDGMPNH-MNPAAGFVPKLRAMGQLPASIHTFGFG-YHLRSGLLKSIAEIGGGN 255 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKA------QIEFNPAWVTEYRQIGYEKRQLR 426 Y++I V + + T A ++E T Q + Sbjct: 256 YAFIPDAGMIGTVFVHAVANLQSTFATRAVLKLTYSKELELEETTGTSVEQQPPQPVNGS 315 Query: 427 VEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKI 486 + ++ G+I G+ + + + L + D + L I Sbjct: 316 DDGEMELTLNLGNIQYGQSRDIFLRVN-----------NLSKLESLKEKDASSSLVNASI 364 Query: 487 RWKYPQGKESQ 497 + P G+ + Sbjct: 365 AYLKPGGEPTT 375 >UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW34_ANASP Length = 615 Score = 196 bits (497), Expect = 3e-48, Method: Composition-based stats. Identities = 60/259 (23%), Positives = 113/259 (43%), Gaps = 5/259 (1%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 + NL +ID SGSM + + + +V +L +D +++V Y Sbjct: 30 EIPESPRRNLNLSLVIDRSGSMAGAALHHAL-KAAESVVDQLEPKDILSVVVYDDAVDTV 88 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 +P + K + +I + A G TN G + INR+LL TDG N+ Sbjct: 89 VPPQPVTDKPALKKSIRQVRAGGITNLSGGWLKGCEYVKHQLDPQKINRVLLLTDGHANM 148 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 GI DPK + + ++ E G+T +T G +NE +++ +A NGN+ +I ++ EA +V Sbjct: 149 GIQDPKILTATSTQKAEEGITTTTLGFAQG-FNEDLLIGMARAANGNFYFIQSIDEAAEV 207 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 + E+ + V +++K +E +T + ++ + G++ G+ Sbjct: 208 FSIELDSLRSVVGQNLKVTLELADG-ITLVDTLSLA--KVSQNEAGQPVITLGELYEGED 264 Query: 446 ITLLFELTLNGQKASIDKL 464 L L ++ + L Sbjct: 265 KLLGLSLMISSAQVGELPL 283 >UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XPW8_DESMR Length = 439 Score = 195 bits (496), Expect = 3e-48, Method: Composition-based stats. Identities = 63/259 (24%), Positives = 109/259 (42%), Gaps = 10/259 (3%) Query: 205 KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI 264 + +++E NL ID SGSM + L + +V +L+ D ++++ Y Sbjct: 35 PEGETKERTRLNLALAIDRSGSM-AGRPLEEAKRCASFVVDKLKNTDRVSLIAYDSSIET 93 Query: 265 ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 +PS+ KA + AI+ +D G TN G +Q + I+RI+L +DG N Sbjct: 94 RVPSVKVEDKAIFHRAIEGIDDGGCTNLHGGWLKGAEQISPYIDPSTISRIILLSDGQAN 153 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 G+ D I ++ ++GVT ST+G+G SN+NE +M+ +A G GN Y T + Sbjct: 154 EGLTDEAEIFKQCRELADAGVTTSTYGLG-SNFNETLMIGMAKNGQGNSYYGRTADDLMD 212 Query: 385 VLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 E+ + AK V+A I + + E + + D+ Sbjct: 213 PFQEELSLLEALFAKQVRASISASAGILFEI--------LNKYSTDRQGKIQLPDLAYEG 264 Query: 445 HITLLFELTLNGQKASIDK 463 +L + + Sbjct: 265 EAWMLVRCFIPRTQTGEGD 283 >UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6X2_PHATR Length = 523 Score = 195 bits (496), Expect = 4e-48, Method: Composition-based stats. Identities = 67/308 (21%), Positives = 118/308 (38%), Gaps = 51/308 (16%) Query: 200 VDILAKDRKSEE---LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 I A+ E+ +L+ ++D SGSM +L L + +L +L++ L+ QD ++ Sbjct: 50 ASIHARTMPKEDEDCRTPIDLIVVLDVSGSMTG-NKLKLCKKTLTMLLRVLQTQDRFGLI 108 Query: 257 TYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 ++ D+R+ P+ +S +KA I SL G TN A L LA Q+ + Sbjct: 109 SFGSDARVEFPAQAMSKQNKASALQKIQSLTTRGCTNMSAALGLAVQELKIIEKSNPVRS 168 Query: 315 ILLATDGDFNVGIDDPKSIESM-------------------------------------- 336 + TDG N GI D + S+ Sbjct: 169 LFFLTDGLANEGISDLDGLVSLTRNCLLPSDNPSNVLNSEVMIAECLDDLATSQHQITRL 228 Query: 337 ----VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG-NGNYSYIDTLSEAQKVLNSEMR 391 ++ + +TL TFG G ++N A++ +AD G Y +I+ S + + Sbjct: 229 PVAEIESVCRAPITLHTFGYGR-DHNAALLESLADTTQGGAYYFIEDDSNVGSAFGNALG 287 Query: 392 QMLITVAKDVKAQIEFN-PAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLF 450 ++ VA++ I A R + Q + V GD A + ++F Sbjct: 288 GIMSIVAQNAVLTIRLPSEAEARGARIVEVYHDQAIKRENDIYTVSLGDFYAEESRDVIF 347 Query: 451 ELTLNGQK 458 ++ L Sbjct: 348 KMELTKPA 355 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 194 bits (493), Expect = 8e-48, Method: Composition-based stats. Identities = 87/370 (23%), Positives = 153/370 (41%), Gaps = 41/370 (11%) Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 + L++ + AK + NL ++D SGSM + L ++S+ L+ L E D Sbjct: 20 TSQRQLRIAVAAKADDHDRRLPLNLCLVLDHSGSMDG-QPLETVKSAALGLIDRLEEDDR 78 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 ++++ + ++I + + + A I AI+ L AEG T GL+L Q+A KG + + Sbjct: 79 LSVIAFDHRAKIVIENQQVRNGAAIAKAIERLKAEGGTAIDEGLKLGIQEAAKGK-EDRV 137 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 + I L TDG+ G D + + +T+ T G G+ ++N+ ++ IA G+ Sbjct: 138 SHIFLLTDGENEHG--DNDRCLKLGTVASDYKLTVHTLGFGD-HWNQDVLEAIAASAQGS 194 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP----AWVTEYRQIGYEKRQLRVE 428 SYI+ SEA ++M + +E P A V Q+ E L V+ Sbjct: 195 LSYIENPSEALHTFRQLFQRMSNVGLTNAHLLLELAPQAHLAIVKPVAQVSPETMDLTVQ 254 Query: 429 -HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 + V GD+ + LL L L+ + ++IR Sbjct: 255 NQGAIEEVRLGDLMTDQERVLLLNLYLDQLLPGQHV-----------------IGQVQIR 297 Query: 488 WKYPQGKESQLVEFPLGPTIN-----APSEDMRFRAAV---AAYGQ------KLRGSEYL 533 + P ++ L+ PL TI PS D++ + ++ A Y Q KL+ + Sbjct: 298 YDDPASGQTNLLSDPLPLTIQVQTQYQPSTDVQVQESILTLAKYRQTQIAETKLKAGDRQ 357 Query: 534 NNTSWQQIKQ 543 + Q Sbjct: 358 GAATMLQTAA 367 >UniRef50_Q231J4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q231J4_TETTH Length = 520 Score = 193 bits (490), Expect = 2e-47, Method: Composition-based stats. Identities = 65/313 (20%), Positives = 137/313 (43%), Gaps = 21/313 (6%) Query: 192 NEQRTLLKVDILAKDRKSEEL-------PASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 N Q V + AKD ++++ +L+F+IDTSGSM +++ L++ S+ ++ Sbjct: 66 NNQIIPGVVSVEAKDFDADQVKKDKVRYQPLDLIFVIDTSGSM-QGKKIELVKKSILQVL 124 Query: 245 KELREQDNIAIVTYAGDSRIALPSI--SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ 302 ++ D I++V + +++ L + + K +I +D L A G T G G++ A+ Sbjct: 125 HIIQGDDRISLVGFNSQAKVLLELTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDI 184 Query: 303 ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMM 362 + + I L +DG N G + M + + E + FG G+ +++ + Sbjct: 185 IKERTNSKNLASIFLLSDGQDNCGFSQTQHF--MNQSKIEYPFCIDCFGFGD-DHDSLTL 241 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA----WVTEYRQI 418 +I + G +++I +S+ + + VA++VK + F +T + Sbjct: 242 SKINQLQQGTFNFIRDISQIDDAFTIILAGIKTFVAQNVKISVNFGNTELMNGITVSKTY 301 Query: 419 GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI-DKLRYAPDNKLAKSDK 477 G E ++++ + + + + AG+ +FEL + + + D+ RY Sbjct: 302 GSEWKKIQDKQY---EIQLNHLMAGRSKDFVFELQIPQFEMKLTDQQRYQIIGSAKIKAN 358 Query: 478 TKELAWLKIRWKY 490 + KI K Sbjct: 359 SLNQGKKKITKKA 371 >UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CDA1_PARTE Length = 604 Score = 193 bits (490), Expect = 2e-47, Method: Composition-based stats. Identities = 66/353 (18%), Positives = 147/353 (41%), Gaps = 37/353 (10%) Query: 176 KPIPFAMRYELAPAPWNEQRTL-LKVDILAKDRKSEELP--------------------- 213 + I F + E +N+++ L V I AK+ ++P Sbjct: 124 ENIEFYLTSEYKTLRFNDRKALSCLVGIKAKEVHQPQVPQQIEGDAQNQFNMDQQQHSKV 183 Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISG 271 +L+ +ID SGSM S E++ +++ +L +L+ L +D + ++ + + L ++ Sbjct: 184 GVDLLCVIDRSGSM-SGEKIEMVKQTLNILLNFLGPKDRLCLIQFDDTCQRLTNLRRVTD 242 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 +K + I + A G T G G ++A +Q + I + +DG I + Sbjct: 243 ENKTYYSDIISKIYANGGTVIGLGTQMALKQIKYRKSVNNVTAIFVLSDGQDEAAISSLQ 302 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 + K+ +T+ +FG G S+++ +M +I+++G G++ +++ +S + + Sbjct: 303 KQLAYYKQ----TLTIHSFGFG-SDHDAKLMTKISNLGKGSFYFVNNISLLDEFFVDALG 357 Query: 392 QMLITVAKDVKAQIE---FNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 + V D+ +E NP + Y + +L ++ ++ + G+ Sbjct: 358 ALTSMVVTDISINLENIMQNPYDAISLSKF-YGQEKLNYQN-KTIHLKIPYLAEGQRRDF 415 Query: 449 LFELTLNGQKASIDKLRYAPDNKLAK--SDKTKELAWLKIRWKYPQGKESQLV 499 +F+L + ++ A+ S +T+E + ESQL+ Sbjct: 416 VFQLNIPYINNNVQSENVVMFKASARITSSQTRETIEKQAELVIRFVDESQLI 468 >UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LWF9_HALO1 Length = 419 Score = 192 bits (488), Expect = 3e-47, Method: Composition-based stats. Identities = 79/372 (21%), Positives = 158/372 (42%), Gaps = 36/372 (9%) Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 L V I A+ +S NL +ID S SM RL + + +V++L E+D ++++ Sbjct: 17 FLLVRIEAQATESSARMPVNLALVIDRSSSMRGP-RLASAIVAARQVVEQLDERDRLSVI 75 Query: 257 TYAGDSRIALPSISGSH--KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 + +R +S + + + A+ L TN AG++ + GF++G ++R Sbjct: 76 AFDATARTIFGPMSVTDEARQTLEQALAGLRTGVGTNLAAGMKKGAEAVRSGFVRGALSR 135 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++L TDG ++GI D + ++ +K+ + GVT++T G+G +++ ++ +A G G + Sbjct: 136 LVLLTDGQPSLGITDNDRLCALAQKEADRGVTITTMGLGQG-FDDELLADLAHSGRGGFH 194 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 Y+ + ++ E+ + A + +I PA + ++ + ++ Sbjct: 195 YLASAADIPGAFGRELSGVFAIAAT--QTEIGLRPAQQIDAAEVLHRLPSRPLDDG--LA 250 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGK 494 V+ G++ AG +LF L+ +++ + R +S + P Sbjct: 251 VELGELAAGTPRQVLFRLS---RRSGDIEARCGTLTVTYRSSEG-----------TPGDA 296 Query: 495 ESQLVEFPLGPTINAPS----EDMRFRAAVA--------AYGQKLRGSEYLNNTSWQQIK 542 +E P P E MR A A A G LR L+ + Sbjct: 297 HLLGIEVPAQPDPAHRRIIALERMRLAVASAVDVAWARRASGDSLRALGALSEIKLE--V 354 Query: 543 QWAQQAKGEDPQ 554 ++++G DP Sbjct: 355 SQLKESEGADPD 366 >UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GC99_9DELT Length = 546 Score = 192 bits (488), Expect = 3e-47, Method: Composition-based stats. Identities = 70/294 (23%), Positives = 125/294 (42%), Gaps = 30/294 (10%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSI 269 P +L ++D SGSM ++L + + LV L EQD + +++Y L + Sbjct: 128 RPGLDLAIVLDRSGSM-GGDKLRFAKQAGLDLVNRLDEQDRVTLISYDDTVTPLSNLQRV 186 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG----------GINRILLAT 319 + + + G+T G L + Q+ G + ++L + Sbjct: 187 DDDGIEVLRRQLLDIQVGGTTALGPALFMGLQRLAAPEPFGPQTRTEARHDRLRHVILLS 246 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DG NVG P+ I V + GV++ST G+G +YNE +M RIAD G G Y +I+ Sbjct: 247 DGIANVGETRPEVIGGRVAEHFGGGVSVSTLGMGL-DYNEDLMTRIADEGGGRYHFIEDA 305 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGD 439 +L E+ + TVA +V + P GY + ++ + G Sbjct: 306 ESIPAMLGDELAGLTATVASEVDSVFATLPGTDVT-EVYGYTQTV----AGSDTTIRVGF 360 Query: 440 IGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQG 493 +GAG+ ++ L L+ ++ +AP ++ EL +++R++ G Sbjct: 361 LGAGQSREIVVNLRLDPEQVR----GWAPGERV-------ELGEVEVRYRLVTG 403 >UniRef50_Q22ST4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ST4_TETTH Length = 648 Score = 192 bits (487), Expect = 4e-47, Method: Composition-based stats. Identities = 59/300 (19%), Positives = 121/300 (40%), Gaps = 15/300 (5%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY--AGDSRIALPSI 269 +LV +ID SGSM E++ L++ +L ++ + D I IV + +GD + + Sbjct: 219 RQTVDLVVVIDKSGSMEG-EKIQLVKETLVKIINLMSSMDRICIVCFNESGDRPLTFTRV 277 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 + +K + I + A G TN G+ A + K + ILL +DG Sbjct: 278 TDENKQTLLNLIQQIYAGGGTNISEGINHALKAIQNRKFKNNVTSILLLSDGQDTKAYTR 337 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 K+ K Q + + T G G +++ ++ ++D+ NG ++++ ++ + Sbjct: 338 VKAYID--KYQIKDAFNIETIGFGE-DHDPKLLRTLSDLRNGTFNFMQDVNYLDTAFINI 394 Query: 390 MRQMLITVAKDVKAQIEF-NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 M+ TVA+++K ++F P +R ++ +I +G Sbjct: 395 FAGMISTVAQNIKVGVKFTPPEQFKNFRISKVFGDNWTKVSETQYEINLINILSGVSKDF 454 Query: 449 LFELTLNGQKASIDKL--------RYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVE 500 +FE+ + + + A+S +KE A + ++ +VE Sbjct: 455 VFEMEADAFDEQTESTITDENSIFNFITAELTAQSVSSKEEANKQKTFQLQLIDTRSMVE 514 >UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JPY1_9BACT Length = 632 Score = 191 bits (486), Expect = 5e-47, Method: Composition-based stats. Identities = 78/320 (24%), Positives = 144/320 (45%), Gaps = 12/320 (3%) Query: 136 VRRFLNQGLLPPPDAVRVEEIVNY--FPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNE 193 +R +++G++P P + E + + P D K+ + A +E A P + Sbjct: 62 LRNLIDEGIIPSPASFTAEGLFSEHDLPIGGDAKEGWLFDIASQ---ATSFESAAQPKVD 118 Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNI 253 L + D + + NLV ++D SGSM S + L L++ SL+ +V +L D + Sbjct: 119 ILAQLGF-VSGIDATTFKPAPLNLVAVVDKSGSM-SGDPLELVRKSLRQVVSQLGSDDQL 176 Query: 254 AIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK-G 310 +IV Y + I L S ++ +I A+ID + + GST AGLEL YQ A + Sbjct: 177 SIVLYGSSTHIHLEPTKTSTENRDQIIASIDRIQSHGSTAMEAGLELGYQVARQSADAFV 236 Query: 311 GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 G R++L TD NVG D +M + +S + L+T GVG ++ + +I+ V Sbjct: 237 GKTRVMLFTDERPNVGRTDATGFMAMAESGSKSDIGLTTIGVGV-HFGAELAEKISSVRG 295 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHF 430 GN + D + E+ M++ +A D+ ++ + G + Sbjct: 296 GNLFFFDDDESMETTFRKELDTMVLELAYDMSLKVTPAEGFFLSG-LYGIPGDAVTWADD 354 Query: 431 NNDNVDAGDIGAGKHITLLF 450 + +++ + A ++ ++ Sbjct: 355 GSLSLEIATLFASRNKGAIY 374 >UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UYN7_ROSS1 Length = 459 Score = 191 bits (484), Expect = 8e-47, Method: Composition-based stats. Identities = 82/386 (21%), Positives = 154/386 (39%), Gaps = 20/386 (5%) Query: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 G L DA R +Y + ++ + P + + +PA + L Sbjct: 28 GELKYEDAARTASGGSYLTAQLELDQVYAPPGQNVDRYLLLTLCSPAKVPPEHAL----- 82 Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 + + P +LV ++D SGSM S +L + +L+ + L++ D ++VT++ Sbjct: 83 ----PREQHRPPLHLVAVLDVSGSM-SGTKLASAKEALRQALHFLQDGDVFSLVTFSDQV 137 Query: 263 RIALPSISGSHKA--EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATD 320 + L + S + + ++ +D + A G T GL K + +LL +D Sbjct: 138 QTHLKAESYAQRKRDKMENLLDEIRASGMTALDGGLAQGIDLGQK--KRQATTLVLLLSD 195 Query: 321 GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 G NVG D + I +K R+SG+ +ST GVG +YNEA+MV IA+ G G + +I S Sbjct: 196 GQANVGETDLEKIGLRAQKARQSGLIVSTLGVGL-DYNEALMVEIANQGGGRFYHIQEGS 254 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDI 440 + L E+ + A+ V+ + + Y + + GD+ Sbjct: 255 QIPAALMQELGSAAMLAARQVEVEFDLPSGAALVSLTALYPLEMVNSRPL----LKVGDL 310 Query: 441 GAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVE 500 + + LTL A + +T LA + ++ + ++ + Sbjct: 311 LPDVRVEIPLRLTLYPHAAGERFSVSGGVHHQTPRGQTLGLALNAVSVRFVEQRQFEERP 370 Query: 501 FPLGPTINAPSEDMRFRAAVAAYGQK 526 + P + E R A + + + Sbjct: 371 GYVAPVMERVLE-FRRAAHLLEFARL 395 >UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseobacter sp. AzwK-3b RepID=A6FSG0_9RHOB Length = 444 Score = 191 bits (484), Expect = 9e-47, Method: Composition-based stats. Identities = 73/352 (20%), Positives = 130/352 (36%), Gaps = 16/352 (4%) Query: 204 AKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 A ++E P NL ++D S SM + L + + +V LR D +AIV + + Sbjct: 32 APVTETEPRPPLNLALVLDRSSSMRG-QPLHEAKRAADQIVAGLRPSDRLAIVAFDNATE 90 Query: 264 IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 + AA+ + A G T G L +Q+ G R+ L +DG Sbjct: 91 VMFSGGPRGDGQAARAALSRIHARGMTALHDGWLLGVEQSIAMREAGTPARVFLLSDGVA 150 Query: 324 NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 NVG+ D +I + + E G+T ST G+G +NE +M +A G GN Y +T + Q Sbjct: 151 NVGLTDASAIAADCTRMAEHGITTSTCGLGMG-FNEDLMAEMARAGRGNAYYGETAEDLQ 209 Query: 384 KVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAG 443 E + A+ ++ ++ G E R L + V D+ Sbjct: 210 DPFEQEFDLLRNICARGLRLRLSAGA---------GVEMRVLNQYPERDGEVLLPDLAFD 260 Query: 444 KHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL 503 + EL + ++ ++ + P+ S Sbjct: 261 GEAWAMVELQFDESDEQPGDRLLLSAEVTGQTPDGDAISDGPASLRLPRLPASAFDMITQ 320 Query: 504 GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTS-WQQIKQWAQQAKGEDPQ 554 + A + ++R A ++ R + ++ Q + AQ+ GE+ Sbjct: 321 EEIVQARARELR----AADLQERARLAARRHDWDQVQTLIDEAQRESGENAW 368 >UniRef50_C1GWG1 von Willebrand factor type A domain containing protein n=1 Tax=Paracoccidioides brasiliensis Pb01 RepID=C1GWG1_PARBA Length = 773 Score = 190 bits (482), Expect = 2e-46, Method: Composition-based stats. Identities = 78/414 (18%), Positives = 150/414 (36%), Gaps = 48/414 (11%) Query: 184 YELAPAPWNEQRTL-LKVDILAKDRKSEELPASNLVFLIDTSGSM----------ISDER 232 + P +++ +L L + K ++V ID SGSM S +R Sbjct: 42 VGVQLHPLSDKNSLILSIHPPLHPEKDIRHVPCDIVLCIDVSGSMQLSAPLPTTDESGKR 101 Query: 233 -------LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDS 283 L L + + + +++ L E D + +VT++ D+ +A + + ++K A+++ Sbjct: 102 EETGLSVLDLTKHAARTIIETLNENDRLGVVTFSNDAEVAYKISHMDDTNKKAALEAVEA 161 Query: 284 LDAEGSTNGGAGLELAYQQATK-GFIKGGINRILLATDGDFNVGIDDPK---SIESMVKK 339 L STN GL+L K + + + TDG N + ++++ Sbjct: 162 LQPLASTNLWHGLKLGLSVLGKVDLRPQNVQALYVLTDGQPNHMCPRQGYVPKLRPILER 221 Query: 340 QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAK 399 Q++ + TFG G + ++ IA+VG G YS+I V + + T A Sbjct: 222 QKDRLPLIHTFGFG-YDIRSGLLQSIAEVGGGTYSFIPDAGMIGTVFVHAIANLYTTFAT 280 Query: 400 DVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN---VDAGDIGAGKHITLLFEL-TLN 455 + + + + G + L E D V G++ G+ L+ + + Sbjct: 281 QARVLLRTS-GSAELVQDEGSKTGLLLDEMSAKDGDIIVTVGNLQYGQSRDLVLRIKNVT 339 Query: 456 GQK-ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQ-LVEFPLGPTINAPSED 513 G A+ L Y + ++L ++K S + T Sbjct: 340 GNASAAQATLTYNFQGSVKSVISNEQLLS---QYKSLPAHVSSYHLSRARICTFLRSIYP 396 Query: 514 MR------------FRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQG 555 +R A A ++ ++ L T + GEDP+G Sbjct: 397 LRQDYEYMYLDANGLEKARAELDIVIKETKKLGYTDIAN-ASLVRDLAGEDPEG 449 >UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F2Q0_SORC5 Length = 521 Score = 188 bits (477), Expect = 5e-46, Method: Composition-based stats. Identities = 70/276 (25%), Positives = 124/276 (44%), Gaps = 11/276 (3%) Query: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKL 242 R A P + Q T L ++ + L +NL +ID SGSM RL + Sbjct: 74 RVGHARLPRSAQETFLMFEVRGDGSPARSLAQANLSLVIDRSGSMKGT-RLTNAVQAATT 132 Query: 243 LVKELREQDNIAIVTYAGDSRIALPSIS--GSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 V L + D +++VT+ + + +P + + I A++ + G T G+E Sbjct: 133 AVSRLNDGDVVSVVTFDTRTSVVVPPTTVGPETRGRILASVRGISLGGDTCISCGIEEGL 192 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 G G++R+L+ +DGD N G+ D +M ++ R+ GV ++T GV + +YNE Sbjct: 193 SLL--GQTSAGVSRMLVLSDGDANHGVRDVPGFRAMAQRARDRGVAITTIGV-DVDYNEK 249 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 ++ IA NG + +++ + ++ +E Q+ +VA + I+ P V R Sbjct: 250 ILSAIALDSNGRHYFVENDAALARIFEAEAEQLTTSVASGAELAIDLAPG-VELDRVFDR 308 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 R + V G AG+ T+L ++ L G Sbjct: 309 SFR----RAGDQVIVPLGAFAAGEVKTVLLKVRLGG 340 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 188 bits (477), Expect = 6e-46, Method: Composition-based stats. Identities = 58/291 (19%), Positives = 122/291 (41%), Gaps = 26/291 (8%) Query: 142 QGLLP----PPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL---------AP 188 +G +P P ++++ + S+ + + + AM+Y L + Sbjct: 29 EGRVPIVGQNPSRFNDDDMITVYQSNNNKLNYGRNSLGQGYMTAMKYNLKDNISIQASSH 88 Query: 189 APWNEQRTLLKVDILAKD-----RKSEE--LPASNLVFLIDTSGSMISDERLPLIQSSLK 241 N+Q L + I + D ++ +E +LV LID SGSM E++ L++ +LK Sbjct: 89 TLMNQQNAALMITIKSNDILLINQRGQECVRQGVDLVCLIDHSGSM-QGEKIKLVRKTLK 147 Query: 242 LLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELA 299 ++ L+ D + ++ + L ++ + + AI SL A G T+ G G+++A Sbjct: 148 QMLTFLQPCDRLCLIMFDCKVYRLTRLMRVTQENVQKFRVAISSLQARGGTDIGNGMKMA 207 Query: 300 YQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNE 359 K ++ I L +DG + + + +++ T+ TFG G + Sbjct: 208 LSILKHRKYKNPVSAIFLLSDGVDEGA--EERVRDDLIQYNIRDSFTIKTFGFGR-DCCP 264 Query: 360 AMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 +M IA G + ++ L+ + + ++ VA V+ ++ + Sbjct: 265 KIMSEIAHYKEGQFYFVPNLTNIDECFAEALGGLVSVVANHVQLSVQPMHS 315 >UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FLY1_NANOT Length = 748 Score = 186 bits (473), Expect = 2e-45, Method: Composition-based stats. Identities = 69/411 (16%), Positives = 148/411 (36%), Gaps = 60/411 (14%) Query: 193 EQRTLLKVDILAKDRKSEELP--ASNLVFLIDTSGSMISDERLP---------------L 235 + + V I + +++P ++V +ID S SM S +P L Sbjct: 46 PNKNSMVVSIQPPLKPKDDVPHVPCDIVLVIDISASMNSAAPIPTGESGGEDTGLSILDL 105 Query: 236 IQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGG 293 + + K +++ L E D +A+VT+ + R+A L +S +K+++ AAID L STN Sbjct: 106 TKHAAKTIIQTLNENDRLAVVTFCTEIRVAFELEFMSEENKSKVLAAIDCLHGISSTNLW 165 Query: 294 AGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG------VTL 347 G++ + +G + +L+ TDG N + + + + + Sbjct: 166 HGIKEGLKVLATNSTQGNVQALLVLTDGAPNHMCPAQGYVPKLRQTLLDHRDLTGSLPLI 225 Query: 348 STFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 TFG G ++ IA++G G +++I V + + T + Sbjct: 226 HTFGFGYY-LRSPLLQSIAEIGGGTFAFIPDAGMIGTVFVHAVANLYSTFTPQANLLLHG 284 Query: 408 NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLF----------------- 450 + +L ++ G + G+ ++ Sbjct: 285 DSLTTFTVDLGSKPGLELENPTGKGVSLRLGSLQYGQSRDIIIHYDKSSKNRTVNIQGKL 344 Query: 451 ELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLK------IRWKYPQGKESQLVEFPLG 504 T+ G + + + ++++ + ++ +R +P ++ +FP Sbjct: 345 NYTVGGDIKTAEVHKLVHIDEVSLPRSVYDYHLMRSTICTLLRKFHPLEDTAENYQFPTA 404 Query: 505 PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQG 555 + S+ + A + + L S+ N + + I GE P G Sbjct: 405 NLGDIRSKIEKLAAEI----EALGHSDEYNQSLLKDIA-------GEPPHG 444 >UniRef50_Q22N58 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22N58_TETTH Length = 669 Score = 186 bits (473), Expect = 2e-45, Method: Composition-based stats. Identities = 66/308 (21%), Positives = 126/308 (40%), Gaps = 27/308 (8%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMI---------SDER------LPLIQSSLKLLVKE 246 + E N+V L+D S SM ++ L L++ ++K + Sbjct: 20 VSVIPPDDLERHPCNIVCLVDGSLSMGSKLVIHQKNGGKKESDMTTLDLVKHTVKTIASS 79 Query: 247 LREQDNIAIVTYAGDSRI--ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 L QD +A+V ++ S+I L + K ID + A G TN GL+ + + Sbjct: 80 LNPQDRLALVGFSTHSKIYFELTEMDDQGKNVAFTEIDKMWAGGQTNIWGGLQDSLEVIK 139 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG----VTLSTFGVGNSNYNEA 360 KGF I L TDG + M+++ +E ++ TFG GN + + Sbjct: 140 KGFRPNQNVCIFLFTDGRPT--MIPAIGHVEMLRRWKEQHPAIQFSIFTFGFGN-DLDTD 196 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 +M+ ++ NG +S+I S V ++ + +L T+A +V ++ + + + + Sbjct: 197 LMLELSQEQNGIFSFISDSSMLGTVFSNALANILSTMANNVHLNLQLSEGYTFDGEGVMQ 256 Query: 421 EKRQLRVEHFNNDNVD--AGDIGAGKHITLLFELTLNGQKASID-KLRYAPDNKLAKSDK 477 + + N +D G I G+ L+ ++ + D K+ Y KL D Sbjct: 257 SQSFQAKKTNKNTTLDLNLGLIRYGQTKDLILKILPQQNRKLSDVKITYTLKYKLFVGDN 316 Query: 478 TKELAWLK 485 +++ + Sbjct: 317 PQDIETVS 324 >UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta RepID=B6TZ81_MAIZE Length = 516 Score = 186 bits (471), Expect = 3e-45, Method: Composition-based stats. Identities = 76/392 (19%), Positives = 151/392 (38%), Gaps = 34/392 (8%) Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 AP E + +++ D S + +LV ++D SGSM E++ +++++K +VK+L Sbjct: 37 APLEENTQKVLLELTGGDSTS-DRSGLDLVAVLDVSGSM-QGEKIEKMKTAMKFVVKKLS 94 Query: 249 EQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 D ++IVT+ + P ++ + ++ ID+L G+TN GL+ + Sbjct: 95 SIDRLSIVTFLDTANRICPLQQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADR 154 Query: 307 -FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI 365 G + ++L +DG N G + V + TFG G ++Y+ ++ + Sbjct: 155 KLSSGRVVGVMLMSDGQQNRGEP--------AANVKIGNVPVYTFGFG-ADYDPTVLNAV 205 Query: 366 ADVG-NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 A G +S ++ ++ + + +L V +D+ + T + Q Sbjct: 206 ARNSMGGTFSVVNDVNLLSMAFSQCLAGLLTVVVQDLTVTVARIEDESTIQKVAAGNYPQ 265 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLN--GQKASIDKLRYAPDNKLAKSDKTKELA 482 + V GD+ + + ++ +L L D L K A A Sbjct: 266 TPDADAGSVTVAFGDLYSKEVRKVIVDLLLPAIDSDRGADILEVTYSYKTAGKLFDAPPA 325 Query: 483 WLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAV-AAYGQKLRGSEYLNNTSWQQI 541 + +R S P ++ +E+ R + A + + + L + Sbjct: 326 TVTVR-------RSGTAFPADDPPVDVQTEEARLKTATMIQQARTMADGKKLGDAR--DK 376 Query: 542 KQWAQQAKGE-----DP--QGYRAEFIRLIEL 566 AQ A + DP R E L++L Sbjct: 377 LAEAQNALEDVVAQSDPLLDALRTELQELLKL 408 >UniRef50_C9SWV9 U-box domain containing protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SWV9_VERA1 Length = 662 Score = 185 bits (470), Expect = 4e-45, Method: Composition-based stats. Identities = 52/310 (16%), Positives = 112/310 (36%), Gaps = 39/310 (12%) Query: 173 PASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDER 232 P + ++ E P + + + R ++V +ID SGSM Sbjct: 59 PLASRDGLLVKVEPPTTP--------REALQSGKRIP--RAPCDIVLVIDVSGSMDDAAP 108 Query: 233 ----------------LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHK 274 L L + + + +++ L E+D + IV + ++++ L ++ +K Sbjct: 109 APVIPGQKDENTGLSILDLTKHAARTILETLDERDRLGIVAFTTNAKVILSLVEMNPDNK 168 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATK-GFIKGGINRILLATDGDFNVGIDDPKSI 333 I++L TN G+ + + G + +++ TDG N G I Sbjct: 169 VSAKDKIENLQPLNGTNMWHGITEGIKLFSDCDSSSGRVPAMMVLTDGLPNSGCPRLGYI 228 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 + + + T+ TFG G + ++ IA++G GNY++I V + + Sbjct: 229 PKL-RDMGQLPATIHTFGFG-YHIRSGLLKSIAEIGGGNYAFIPDAGMIGTVFVHAVANL 286 Query: 394 LITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHF-------NNDNVDAGDIGAGKHI 446 T A + + P+ + +G + + + G+I G+ Sbjct: 287 QSTFANRATLTLTY-PSELAIQESVGDSVEKQAPTELAGYLTPTSQLTISLGNIQYGQSR 345 Query: 447 TLLFELTLNG 456 + + + Sbjct: 346 DVYLRASPDA 355 >UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ58_ORYSJ Length = 589 Score = 185 bits (469), Expect = 4e-45, Method: Composition-based stats. Identities = 66/324 (20%), Positives = 131/324 (40%), Gaps = 24/324 (7%) Query: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKL 242 +Y A L +++ S + +LV +ID SGSM D R+ ++++L+ Sbjct: 37 KYHNDVASMAPHDQELLLELRG-SSSSTDRAGLDLVAVIDVSGSMDGD-RIDKVKTALQF 94 Query: 243 LVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 ++++L + D + IVT+ ++ P ++ + +AE+ A +D L A G TN GLE Sbjct: 95 VIRKLSDLDRLCIVTFCTNATRLCPLRFVTAAAQAELKALVDGLKAYGDTNMKGGLETGM 154 Query: 301 QQAT-KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNE 359 + G ++L +DG N G D + V + TF G S ++ Sbjct: 155 SVVDGRSLAAGRAVSVMLMSDGYQNHGGD--------ARDVHLKNVPVYTFSFGAS-HDS 205 Query: 360 AMMVRIADVG-NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 ++ IA G ++Y+ + + + +L +A+D++ + VT R + Sbjct: 206 NLLEAIARKSLGGTFNYVADSANLTGPFSQLLGGLLTIIAQDLELTVTRFHGEVTIKRVV 265 Query: 419 ---GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL------NGQKASIDKLRYAPD 469 Q ++ V G + + + ++ L L A++ +Y Sbjct: 266 WVDAGTYPQTTASDGSSVTVSFGTLYSAEARRVIVYLALADKTASPPYDANVCLAQYRFT 325 Query: 470 NKLAKSDKTKELAWLKIRWKYPQG 493 + + +L +K R G Sbjct: 326 FQAQQVTSNPDLITIKRRPSAAPG 349 >UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8YP40_ANASP Length = 427 Score = 184 bits (468), Expect = 6e-45, Method: Composition-based stats. Identities = 72/335 (21%), Positives = 137/335 (40%), Gaps = 38/335 (11%) Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNI 253 + L + I A + E+ NL ++D SGSM + L ++ +++ L+ L+ D I Sbjct: 21 SQRQLAISISAVAEQFEQNLPLNLCLILDQSGSMHG-QPLKMVVEAVEKLLDRLQPGDRI 79 Query: 254 AIVTYAGDSRIALPSISGSHKAEINAAI-DSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 ++V +AG + + +P+ + I I L A G T GL+ + KG +G + Sbjct: 80 SVVAFAGSATVIIPNQIVENPESIKTQIRKKLQASGGTVIAEGLQQGITELMKG-TRGAV 138 Query: 313 NRILLATDGD---------FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 ++ L TDG + +G DD + KK + +T++T G GN N+N+ ++ Sbjct: 139 SQAFLLTDGHGEDSLKIWKWEIGPDDSRRCLEFAKKAAKINLTINTLGFGN-NWNQDLLE 197 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP----AWVTEYRQIG 419 IAD G G ++I+ +A N ++ + + P A + Q+ Sbjct: 198 TIADAGGGTLAHIERPEQAVHHFNRLFTRVQSVGLTNAYLTLSLAPQVRLAELRPIAQVA 257 Query: 420 YEKRQLRVEHFNNDN--VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDK 477 + +L VE + + V GD+ + +L L L Sbjct: 258 PDIIELPVEPEADGSFIVRLGDLMKDINRVVLANLYLGKLPEGQQV-------------- 303 Query: 478 TKELAWLKIRWKYPQGKESQLVE--FPLGPTINAP 510 + +++R+ P E L+ +P+ + Sbjct: 304 ---IGNVQVRYDNPSLNEEGLLSQTWPIYANVMQA 335 >UniRef50_A6R161 Predicted protein n=3 Tax=Onygenales RepID=A6R161_AJECN Length = 759 Score = 184 bits (467), Expect = 8e-45, Method: Composition-based stats. Identities = 61/339 (17%), Positives = 120/339 (35%), Gaps = 35/339 (10%) Query: 174 ASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELP--ASNLVFLIDTSGSMISDE 231 A + P + +L P P + + + +E+P ++V ID S SM S Sbjct: 34 AGERSPNEVGVQLHPLP---DTNSMILSVHPPLHPEKEMPHVPCDIVLCIDVSYSMQSSA 90 Query: 232 RLP-----------------LIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGS 272 LP L + + + +++ L E D + IV ++ ++ + + ++ S Sbjct: 91 PLPTTDESGEREETGLSVLDLTKHAARTIIETLNENDRLGIVAFSTEAEVVYEISKMNES 150 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQAT-KGFIKGGINRILLATDGDFNVGIDDPK 331 K A+++L STN GL+L + + + + + TDG N Sbjct: 151 SKKAALKAVEALKPLSSTNLWHGLKLGLKAFENERHTPQSVQALYVLTDGMPNHMCPKQG 210 Query: 332 SIESM--VKKQRESGVT-LSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 + + + + + + TFG G N ++ IA+VG G +++I V Sbjct: 211 YVTKLRPILQLLGHRMPMIHTFGFG-YNIRSGLLQAIAEVGGGTFAFIPDAGMIGTVFVH 269 Query: 389 EMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND--NVDAGDIGAGKHI 446 + + T A K + + L E + V G + G+ Sbjct: 270 AIANLYTTFATQAKVTFRTSGSVTLAQDLGSKTGLGLHEESTRDSNLTVAIGTLQYGQSR 329 Query: 447 TLLFEL----TLNGQKASIDKLRYAPDNKLAKSDKTKEL 481 L+ + T + L Y L +++ Sbjct: 330 DLVIRMKNATTAATPSMAQATLTYQFQGCLKSVVADEQV 368 >UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DZ93_PARTE Length = 522 Score = 184 bits (466), Expect = 1e-44, Method: Composition-based stats. Identities = 63/310 (20%), Positives = 120/310 (38%), Gaps = 31/310 (10%) Query: 129 DTGSYANVRRFLN------QGLLP----------PPDAVRVEEIVNYFPSDWDIKDKQSI 172 D SYAN + + +G P DA+ V I N + + Sbjct: 9 DNDSYANTTKAIVINEQFIEGERPAICQEHGKFNDDDAIDV-VITNESNYGRKSLSQNYM 67 Query: 173 PAS-----KPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM 227 + + + Y P + + + + K++ +L+ LID SGSM Sbjct: 68 KQANYVLQDNVELKLSYSGLPTQGTQA---VLLSVQTKNQAITIRQGIDLICLIDHSGSM 124 Query: 228 ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLD 285 S E++ L++ SLK L+K L+ D + ++ + + L + + + AID+++ Sbjct: 125 -SGEKMHLVKKSLKHLLKMLQPNDRLCLIEFDDQNYRLTRLMRATQENMYKFLIAIDTIE 183 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 A G+T+ G +++A K I I L +DG+ + + K + Sbjct: 184 ANGATDIGNAMKMALSILKHRRFKNPIASIFLLSDGEDEGAAG--RVWNDIQSKNIKEPF 241 Query: 346 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 T++TFG G + +M IA G + YI +S+ + + +A + I Sbjct: 242 TINTFGFGR-DCCPKIMSEIAHFKEGQFYYISEISKIDECFFEALGGEASVIAYNTHITI 300 Query: 406 EFNPAWVTEY 415 + + Sbjct: 301 SCKKNTIKKV 310 >UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genome shotgun sequence n=3 Tax=Paramecium tetraurelia RepID=A0C051_PARTE Length = 636 Score = 184 bits (466), Expect = 1e-44, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 114/258 (44%), Gaps = 9/258 (3%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 + + +L+ LID SGSMI ++ ++++SL +L++ L + D + ++T+ D+ P Sbjct: 153 QKNQRVGVDLICLIDISGSMIG-VKIEMVKASLIVLLQFLGDNDRLQLITFDNDAHRLTP 211 Query: 268 --SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 +++ +K+ I + A G ++A+ Q + + L +DG Sbjct: 212 LKTVTNQNKSYFTQIIKQIKANGGNRISEATKMAFYQLKSRKYINNVTSVFLLSDGVDYT 271 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 + I+++ + TL TFG G +++ MM ++ ++ +G++ ++ ++ + Sbjct: 272 YPEVKNQIQTVNEV-----FTLHTFGFGE-DHDAQMMTQLCNLKSGSFYFVQDVTLLDEF 325 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 + ++ V + ++ + + + QI + + N + I +G Sbjct: 326 FADALGGLISVVGEQLEITLSSSAPPPYQDIQISKTYGNMWQKKGNQYYITQPQIASGSR 385 Query: 446 ITLLFELTLNGQKASIDK 463 +FEL L + I+ Sbjct: 386 KDYVFELALPKFEGKIED 403 >UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LJL4_HALO1 Length = 602 Score = 182 bits (463), Expect = 3e-44, Method: Composition-based stats. Identities = 85/424 (20%), Positives = 165/424 (38%), Gaps = 67/424 (15%) Query: 137 RRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAP-----APW 191 R L G +P + F + + PA P ++ A A Sbjct: 54 RSILEAGGIPAASTLDAAGF---FAEHY----VEMPPADCGQPLCLQAMSARGNEWTANG 106 Query: 192 NEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 E++ +L V + E +LV ++DTSGSM +D R+ ++ L LLV + E Sbjct: 107 VEEQIVLAVAMNTPIGPDDIEPRPLDLVVVVDTSGSMATDARMDYVRQGLHLLVDAVDED 166 Query: 251 DNIAIVTYAGDSRI--ALPSISG---------------------------------SHKA 275 D +A+V+Y + + LP++ + ++ Sbjct: 167 DRLALVSYQSFAEVHAELPALPVEETPEEPTEPTDPVGEPTDPPADPDEDPVDEREAWRS 226 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFI--KGGINRILLATDGDFNVGIDDPKSI 333 E++A +D+L G TN GLE ++ A + + R++L +DG GI D SI Sbjct: 227 EMHALVDTLQPGGGTNIYEGLERGFEIAKEARVNHPDRAQRVILLSDGLATEGITDSASI 286 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 ++ + E G+ L+T GVG S +N +M +A+ G GN+ +++ ++V E+ Sbjct: 287 IALSEAFIEGGMGLTTVGVGAS-FNVELMRGLAERGAGNFYFVEDPEAVREVFTEELDYF 345 Query: 394 LITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELT 453 +A V ++ + +G +L N+ ++ + + + Sbjct: 346 AEPLATAVSIEVRTTDGYG-LGEVVG---TRLWSTEGNSGSMYLPAVFVASRKS-----S 396 Query: 454 LNGQ---KASIDKLRYAPDNKLAKSDKTKELAWLKIRWK----YPQGKESQLVEFPLGPT 506 G+ + + + P + ++ A + +R+ P ++SQ E + Sbjct: 397 APGEYGGRRGGGGMLFLPLYPSIDTGFSEAAALVTLRYSAADGAPGSEQSQTTEVIIPAR 456 Query: 507 INAP 510 A Sbjct: 457 FGAS 460 >UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0C5K4_PARTE Length = 611 Score = 182 bits (462), Expect = 3e-44, Method: Composition-based stats. Identities = 60/282 (21%), Positives = 127/282 (45%), Gaps = 15/282 (5%) Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSE---ELPASNLVFLIDTSGSMISDERLPLIQ 237 ++R + + + + I K+ ++E +L+ LID S SM D + +++ Sbjct: 153 SLRTSCKVSNYKSEYIPAMISIKTKENQTEMTERTIGIDLICLIDKSMSMSGDN-INMVK 211 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAG 295 SL LL+ L EQD + I+T+ ++ P ++ +K A I + AEG T + Sbjct: 212 KSLLLLLDFLGEQDRLQIITFNEHAQRLTPLKCLTEKNKQYFQAVISQISAEGLTKISSA 271 Query: 296 LELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNS 355 +A++Q + + + + L +DG D I ++ +E T+STFG G+ Sbjct: 272 TYIAFKQLKEKVYRNNVTSVFLLSDGHDG---DALFEISDQIRHVKEV-FTISTFGFGD- 326 Query: 356 NYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEY 415 +++ MM I+++ NGN+ Y+ ++ + + ++ +A+ ++ + + Sbjct: 327 DHDAQMMTSISNLKNGNFYYVKDITLLDEFFAHALGGIVSVIAEQIQISLSLTLTKPLQD 386 Query: 416 RQIG--YEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN 455 QI Y + EH ++ + +G +FE+ + Sbjct: 387 VQISKTYGNMWKKREHA--YEINIPQLASGTRKDFVFEIQIP 426 >UniRef50_UPI00006CAF43 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAF43 Length = 631 Score = 181 bits (459), Expect = 7e-44, Method: Composition-based stats. Identities = 52/204 (25%), Positives = 101/204 (49%), Gaps = 4/204 (1%) Query: 205 KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI 264 + ++ E +L+ +ID SGSM S ++ L++ SLK L+K + E D I ++++ +I Sbjct: 133 QPKEQSERVPMDLICVIDDSGSM-SGKKAQLVRKSLKYLLKIMNENDRICLISFDSVEKI 191 Query: 265 ALPSISG--SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD 322 P + +K+E+ AI ++ GSTN AG+E K I + L +DG Sbjct: 192 LTPFLRNNLENKSELKKAIKNIVGRGSTNIEAGMEAGLWMIKNRKEKNPITCMFLLSDGQ 251 Query: 323 FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 + D + + + + ++T+G G ++++ M IA+ G Y YI+ + + Sbjct: 252 DDSPQVDLRVQKLIQSYDIQDTFIVNTYGYG-ADHDATQMRNIAETHKGGYYYIEDVKKV 310 Query: 383 QKVLNSEMRQMLITVAKDVKAQIE 406 + + +L V +DV+ +I+ Sbjct: 311 SEWFVLSISGLLSAVGEDVRIRIK 334 >UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepID=Q7S708_NEUCR Length = 766 Score = 181 bits (458), Expect = 9e-44, Method: Composition-based stats. Identities = 60/346 (17%), Positives = 124/346 (35%), Gaps = 46/346 (13%) Query: 179 PFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELP-------ASNLVFLIDTSGSMISDE 231 P A + E+ P P + LL+V R LP ++V ID SGSM +D Sbjct: 29 PVAPKLEIHPLPSHTSGLLLRV---IPPRSPPNLPDPNFHHVPCDIVLAIDVSGSMSADA 85 Query: 232 R---------------------LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--S 268 L L++ + + +V L D + IVT++ ++++ P Sbjct: 86 PVPTTASADYTNEQPEHNGLSVLDLVKHAARTIVSTLNSSDRLGIVTFSTEAKVLQPLMP 145 Query: 269 ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID 328 ++ +K + + + +TN G+ + G + +++ TDG N + Sbjct: 146 MTALNKKKTERNLGGMQPFSATNLWGGIVEGLKLFDGQ--SGRMPALMVLTDGMPNH-MC 202 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 + + ++ + TFG G S ++ +A++G G YS+I V Sbjct: 203 PAQGYVAKLRAMETLPAAIHTFGFGYS-LRSGLLKSVAEIGGGGYSFIPDAGMIGTVFVH 261 Query: 389 EMRQMLITVAKDVKAQI---------EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGD 439 + + T A +V ++ E V + + EK + + ++ Sbjct: 262 SVANLQSTFANNVVLRLTYPKYLGLEETTGESVDKVESVQLEKGDVDPDSSMQLTLNLST 321 Query: 440 IGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLK 485 + G+ + Q+A D + + + + + Sbjct: 322 LQYGQSRDIFLRYDSKAQEAIADGFDFESPPSVLATLDYQHFTNIT 367 >UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G857_9DELT Length = 540 Score = 181 bits (458), Expect = 1e-43, Method: Composition-based stats. Identities = 77/396 (19%), Positives = 153/396 (38%), Gaps = 38/396 (9%) Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 D + P NL +D S SM E + ++ L + ++L +D + +V + ++ Sbjct: 146 TPIDPAELDRPPLNLTIAVDLSKSMEG-EPIDRVRQGLLQMREQLEPEDRVTLVGFGDEA 204 Query: 263 RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD 322 ++ + + E+ AI +L GSTN AGL A++Q +G NR+LL +DG Sbjct: 205 QVIVENA-DKDSVELATAIAALVPWGSTNLYAGLRTAFEQTDLYAQEGWQNRVLLVSDGV 263 Query: 323 FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 GI + IE + + G L+T G+GN +++ +M ++++G+G++ Y++ Sbjct: 264 PTTGIVNSDKIEGLAEAWSGMGYGLTTVGIGN-DFDIELMRNLSELGSGSFYYVEDPDAV 322 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 +V + E++ + +A+DV + + R + K+ N +D + Sbjct: 323 IEVFSEEVQAFTVPLAEDVIIDATVFEGY--DLRAVYGTKQVETW--GNQALIDIPILQI 378 Query: 443 GKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQG----KESQL 498 + + L + E+ L+ + P ++ Sbjct: 379 AHREGA-----SDNENGRRGGGGAMIFELLPTGESPGEVGRLEFEYTVPGTEEVVEQVVE 433 Query: 499 VEFPLGPTINAPSEDMRFRAAV----------AAYGQK---LRGSEYLNNTSWQQ----- 540 + PLGP P + AV + +Y + + Sbjct: 434 ISSPLGP-WELPEDGFFEADAVEKSFVMLNIYVGFEMAASRAAAGDYAGALTVLEPLVLS 492 Query: 541 IKQWAQQAKGEDPQG---YRAEFIRLIELADGVTDI 573 ++ W + ED + Y FI +E G T+ Sbjct: 493 VEDWLSANEDEDIEDDLFYINLFIDNLEAQGGATNS 528 >UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VHB8_NAEGR Length = 755 Score = 180 bits (457), Expect = 1e-43, Method: Composition-based stats. Identities = 62/297 (20%), Positives = 123/297 (41%), Gaps = 25/297 (8%) Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI------------ 228 ++ + +E + + V + + NLV ++D SGSM Sbjct: 99 SLNLTIVSKQVSESKRRIHVKVSPPTGG--QRQPCNLVCILDVSGSMGSSAEDLSSSNEN 156 Query: 229 -SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLD 285 RL L++ S++ L++ + E+D I+++ ++ +R+ LP + K + ++ L Sbjct: 157 TGFSRLDLVKHSVRTLIELMNEKDQISLIPFSDSARMELPLTKMDAVGKKKAIEKLEHLG 216 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGGIN-RILLATDGDFNVGIDDPKSIESMVKKQRESG 344 EGSTN GL L + + + N ++L TDG+ N I+ P+ I ++K + Sbjct: 217 PEGSTNVWDGLRLGMESSLNNPLCAKTNTCLILFTDGEPN--INPPRGIVPTLEKYIKEH 274 Query: 345 ---VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 T+ +FG G S + A++ IA G+G YSYI S + M +L T + Sbjct: 275 PLNSTIHSFGFGYS-LDSALLKDIAMNGSGAYSYIPDCSMVGTTFVNMMSNILCTAVRRA 333 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK 458 + I + G + + + G + + + ++ ++ Sbjct: 334 ELVISSMNGAKISH-VYGSSQNGNNSTNEKQFTISMGGVQFQQSRDYIIDVDMHANN 389 >UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LD98_GORB4 Length = 423 Score = 180 bits (457), Expect = 1e-43, Method: Composition-based stats. Identities = 74/320 (23%), Positives = 131/320 (40%), Gaps = 27/320 (8%) Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 + + ++I+A + K + + L ++D SGSM L Q +L ++ +L +D Sbjct: 16 SADEVTVLLEIVAPEGKVTDRAPAALQVVLDRSGSMSGP-PLAGAQRALAGVIGQLDPRD 74 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG 311 +VT+ D+++ LP+ + KA A+ S+ G T+ +G Q+ + G Sbjct: 75 VFGVVTFDDDAQVVLPAAPLADKARAVDAVGSIVPGGCTDLSSGYLRGLQELRRATASAG 134 Query: 312 IN--RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG 369 I +L+ +DG N GI D S+ K G+ ST G G Y+E ++ IA G Sbjct: 135 IRGGTVLVISDGHVNRGIRDLDEFASITAKAAADGIITSTLGYGRG-YDETLLSAIARSG 193 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEH 429 NGN+ + D A + E+ +L A+ V + + VTE L Sbjct: 194 NGNHVFADDPDAAGAAIAGEVDGLLSKSAQAVTLTVRYIQK-VTELSLY----NDLPAHQ 248 Query: 430 FNNDNV--DAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 + V + GD+ A + LL + ++ S +LA L++R Sbjct: 249 IGDGEVMIELGDLYAAEARKLLLRMKVD----------------ALASLGLAQLAMLELR 292 Query: 488 WKYPQGKESQLVEFPLGPTI 507 + V P+ + Sbjct: 293 YVQTATLTEHTVTLPISVNV 312 >UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflexi (class) RepID=A5UTA6_ROSS1 Length = 425 Score = 180 bits (456), Expect = 2e-43, Method: Composition-based stats. Identities = 76/369 (20%), Positives = 141/369 (38%), Gaps = 27/369 (7%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH 273 NL ++D S SM ERL ++ + +V +L D ++V + + + +P+ Sbjct: 44 PLNLCLVLDRSSSMRG-ERLMQVKEAAARIVDQLGPDDYFSLVVFNDRADVVIPAQRAIK 102 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 K+++ AAI ++A G T GL LA Q+ + F+ GI+R++L TDG D Sbjct: 103 KSDLKAAIAQIEAAGGTEMAQGLALALQEVQRPFLTRGISRLILLTDGRT---YGDESRC 159 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 + ++ + G+ L+ G+G + +NE ++ + N YI T + KV E++++ Sbjct: 160 VEIARRGQSRGIGLTALGIG-TEWNEDLLETMTASENSRAQYIATAQDVVKVFADEVKRL 218 Query: 394 LITVAKDVKAQIEFNPAWVT-EYRQIGY--EKRQLRVEHFNNDNVDAGDIGAGKHITLLF 450 A+ V +E P + Q+ + E + GD L Sbjct: 219 HAIFAQQVHLSLETRPGALIRSLDQVRPFIAPITIVEEEERRWAANLGDWPDTGVQGFLI 278 Query: 451 ELTLNGQKASIDK-----LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKES--------- 496 E+ + LRY + +EL + +R Sbjct: 279 EVVVPALPVGDYPVLKLTLRYHLPAANLRDQVREELIRISMRPAAEVTHRVDATVKHWLE 338 Query: 497 QLVEFPLGPTINAPSEDMRFRAA-----VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGE 551 +LV + L + + + R A +A G L +T Q+ + + Sbjct: 339 RLVAYRLQASAWKHAAEGRLAEASERLHMAGTRMLNVGDTALAHTLQQEATRILRSGAAS 398 Query: 552 DPQGYRAEF 560 + R F Sbjct: 399 EEGRKRIRF 407 >UniRef50_Q24C76 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila RepID=Q24C76_TETTH Length = 670 Score = 180 bits (456), Expect = 2e-43, Method: Composition-based stats. Identities = 55/271 (20%), Positives = 106/271 (39%), Gaps = 22/271 (8%) Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDER-----------------LPLIQSSLKLLVK 245 L + SN+ ++D SGSM S+ + L +++ S+K++V Sbjct: 21 LVPPQNYSSRTNSNICCVVDVSGSMSSEAKIINQSSQKSDENYSLSILDVVKHSIKMIVN 80 Query: 246 ELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 L +D ++IVT++ + + L ++ S+K I++L EG T GL A Sbjct: 81 TLGSEDYLSIVTFSDSANVLFDLLPMNDSNKTMAIEKIENLSTEGGTELWKGLNSALNIL 140 Query: 304 TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 I L TDG D + + + T++TFG +S+ N +M Sbjct: 141 LNNKTPNTNQSIFLLTDGQPTDSGIDTN-LVKFKQAYPKLNCTINTFGF-SSSSNSELMN 198 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW-VTEYRQIGYEK 422 +IA NG +S+I S + + L + I +T + + Sbjct: 199 KIAMEYNGMFSFIPDASFIATAFANALANTLTVYTNNCLLHITTLEGSNLTLHPAHSFLV 258 Query: 423 RQLRVEHFNNDNVDAGDIGAGKHITLLFELT 453 +++ + +D G + + +F++ Sbjct: 259 KKMNSQGQQEILIDVGVVNTQQTRDFIFKIN 289 >UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G9E8_9DELT Length = 532 Score = 179 bits (455), Expect = 2e-43, Method: Composition-based stats. Identities = 78/329 (23%), Positives = 134/329 (40%), Gaps = 26/329 (7%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH 273 NL +ID SGSM +R + ++ LR+ D +++V+Y + +P + Sbjct: 134 PLNLAIVIDHSGSMKG-QRERNALDAAAGMISRLRDGDTVSVVSYNTKAHTIVPVTTLDA 192 Query: 274 KAEINAAIDSLD------AEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 + + I L G+T G+E Q GI+R+LL +DG+ N G+ Sbjct: 193 RNR-DRVISDLRVGVASRPSGNTCVSCGVEAGLQTLQGRRP--GIDRMLLLSDGEANRGV 249 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 D I + ++ R GV++S+ GV + +YNE +M IA NG + + +T S + + Sbjct: 250 RDEPGIRRLAREARNRGVSISSIGV-DVDYNEVLMSAIAREANGRHYFSETGSNLDAIFD 308 Query: 388 SEMRQMLITVAKDVKAQIEFNPA-WVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHI 446 E+ ++ +AKD + +E P V E Y++ RV V G AG+ Sbjct: 309 QELDSLIQAIAKDGQVIVELAPGVRVVEVFDRSYQQVDRRV------IVPMGTFSAGEDK 362 Query: 447 TLLFELTLNGQKASIDKLRYAPD--NKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG 504 TLL L + L + + L+ + + L + S L L Sbjct: 363 TLLMRLEVPPSPEGSRPLAHVSLSYDDLSTREPGECFGELATVMTATPAEVSPLDAIVLA 422 Query: 505 PTINAPSE------DMRFRAAVAAYGQKL 527 + + + F A Q+L Sbjct: 423 RLTKSETAKTLQQSNALFALGRADEAQRL 451 >UniRef50_B8AE57 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AE57_ORYSI Length = 585 Score = 179 bits (454), Expect = 3e-43, Method: Composition-based stats. Identities = 77/373 (20%), Positives = 140/373 (37%), Gaps = 43/373 (11%) Query: 174 ASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELP---ASNLVFLIDTSGSMISD 230 ++ P+ + L P +V + + +L ++V ++D SGSM Sbjct: 2 SADPVKVSTTTMLPTIPRGHTNKDFRVLLRVEAPPMADLKGHVPIDVVAVLDVSGSMGDP 61 Query: 231 -------------ERLPLIQSSLKLLVKELREQDNIAIVTYAGDS----RIALPSISGSH 273 RL +++ ++K ++++L + D ++IV + L +ISG+ Sbjct: 62 AMASSDFEKNKPPSRLDVLKEAMKFIIRKLDDGDRLSIVAFNDRPVKEYSTGLLNISGNG 121 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQA--TKGFIKGGINRILLATDGDFNVGID-DP 330 + +D L+A G T LE A + G + + ILL TDGD G Sbjct: 122 RRIAEKKVDWLEARGGTALMPALEEAIRVLDCRPGDSRNSVGFILLLTDGDDTSGFRWSR 181 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE--AQKVLNS 388 I V K + TFG+G ++ +EA++ IA G YS++D + L Sbjct: 182 DVINGAVGKY-----PVHTFGLGAAHSSEALLH-IAQESRGTYSFVDDENMDKIAGALAV 235 Query: 389 EMRQMLITVAKDVKAQI---EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 + + A D + + E + A + GYE R + V G + AG+ Sbjct: 236 CIGGVKTVAAVDTRVSVRVAELSGARIERIDSCGYESRVACG--GASGEVVVGVLYAGEV 293 Query: 446 ITLLFELTLNGQKASIDKLRYAPDNKLAKSDKT-----KELAWLKIRWKYPQGKESQLVE 500 + + L L AS+ L + D E L + + Y + ++ + Sbjct: 294 KSFIIHLHLP--AASVSSLECGYCDAATTCDHHCPRRRHEQRLLDVGYSYRRAPDASAIS 351 Query: 501 FPLGPTINAPSED 513 E+ Sbjct: 352 IVGRGVFVQRPEE 364 >UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2GTB7_CHAGB Length = 777 Score = 178 bits (452), Expect = 4e-43, Method: Composition-based stats. Identities = 69/352 (19%), Positives = 125/352 (35%), Gaps = 39/352 (11%) Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSE---ELPASNLVFLIDTSGSMISDER----- 232 +++ +L P +R L V I +LV ID SGSM + Sbjct: 36 SLQLQLHPFSSEHERGGLIVKIQPPREPENADLHHVPCDLVLSIDISGSMADEAPAPSKP 95 Query: 233 ------------LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 + L++ + + +V L +D + IVT+ S++ +P +KA+ Sbjct: 96 GGEAGEDTGLRVIDLVKHAARTIVATLDSRDRLGIVTFTNRSKVGIPPY--ENKAKTLEN 153 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGF--IKGGINRILLATDGDFNVGIDDPKSIESMVK 338 I+S++ STN G+ ++ G + +L+ TDG N + PK M++ Sbjct: 154 IESMEPFSSTNMWHGIRDGLSLFSEAEGGSTGRVPALLVLTDGMPNY-MCPPKGYVPMLR 212 Query: 339 KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVA 398 T+ TFG G ++ IA+VG GNYS+I V + + T A Sbjct: 213 SMEPLPATIHTFGFG-YELRSGLLKSIAEVGGGNYSFIPDAGMLGTVFIHAVAHLQSTFA 271 Query: 399 KDVKAQIEFNPAWVTEYRQIG--------YEKRQLRVEHFNNDNVDAGDIGAGKHITLLF 450 + K ++ + P+++ G E E + + +I G+ Sbjct: 272 NNAKLRLTY-PSYLKLEEMTGEAVGRQEPVELEGDVPESMTSLTIPIDNIQYGQSRD--- 327 Query: 451 ELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFP 502 + L + + N E ++S P Sbjct: 328 -ICLRYGNLAAAIQKEGGANPPPAITAVLEYQHFSPTVHQAVAQQSPFASTP 378 >UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z6_DESAA Length = 480 Score = 178 bits (451), Expect = 6e-43, Method: Composition-based stats. Identities = 67/259 (25%), Positives = 118/259 (45%), Gaps = 12/259 (4%) Query: 207 RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS--RI 264 + + ++V ++D SGSM +++ ++++K LV+ LR QD ++VTY+ Sbjct: 85 PEQTKTKPVDMVIVLDRSGSM-GGQKVRDAKAAVKGLVEGLRSQDRFSLVTYSNSVNGGD 143 Query: 265 ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 L ++ + +N +DS+ A G TN G GLE + +++L +DG N Sbjct: 144 GLHYLTADKRNSLNWMVDSIPAGGGTNLGGGLEKGVGVLRAYGAPDRMGKVILISDGQAN 203 Query: 325 VGIDDPKSIESMVKKQRESGV--TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 G+ DP + +M R+ G+ +++T G+G ++NE +M +AD G G Y Y++ + Sbjct: 204 QGVTDPNQLAAMAA-LRDDGLVYSVTTVGIGQ-DFNEQLMATVADGGRGRYYYLENPGDF 261 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 V E A + + P VT GY + N + G + + Sbjct: 262 LAVFQEEANWTRAVAASALSIHLPL-PKGVTAVSANGYP----VINKENGAFISPGALLS 316 Query: 443 GKHITLLFELTLNGQKASI 461 G+ TL L N A I Sbjct: 317 GQSRTLYIRLHANDDAAEI 335 >UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=B8G546_CHLAD Length = 418 Score = 178 bits (451), Expect = 6e-43, Method: Composition-based stats. Identities = 60/278 (21%), Positives = 119/278 (42%), Gaps = 9/278 (3%) Query: 186 LAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVK 245 P Q L V+ +A + LP NL F++D SGSM +L ++++ + +++ Sbjct: 15 PVPTSSTPQVVYLLVEAVAPASPTSALP-LNLCFVLDRSGSM-QGAKLESMKAATRRVIE 72 Query: 246 ELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK 305 LR D AIV + + +P+ ++ + AA++++ G T G++ A + K Sbjct: 73 LLRPHDVAAIVIFDDTVQTLIPATPVGDRSALLAAVETITEAGGTAMSLGMQAAQTELQK 132 Query: 306 GFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI 365 I+R+LL TDG D + + ++GV ++ G+G + +NE ++ I Sbjct: 133 HLGPDRISRMLLLTDGQT---WGDEPICRDLARTLGQAGVRITALGLG-TEWNEQLLDDI 188 Query: 366 ADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF-NPAWVTEYRQIGYEKRQ 424 A +G YI ++ + +++ VA D + + ++ Sbjct: 189 AAASDGYSDYIADPAQIETFFQQAVKEAQAVVATDARLLLRLVRDVTPRAIYRVKPVIAN 248 Query: 425 LRVEHFNNDNVDA--GDIGAGKHITLLFELTLNGQKAS 460 L + + V GD+ G+ +L +L L + Sbjct: 249 LGYQPIGDAAVAVRLGDLVGGQPAAVLLDLMLPPRTRG 286 >UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=A9AXC2_HERA2 Length = 421 Score = 177 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 58/286 (20%), Positives = 121/286 (42%), Gaps = 9/286 (3%) Query: 188 PAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKEL 247 PA +Q L +DI A + N+ F++D SGSM D ++ ++ + + + + Sbjct: 17 PALQTQQVVYLLLDITATPAVAHVQMPVNVSFVLDHSGSMKGD-KMRCVREATQRALGLM 75 Query: 248 REQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 QD +++V + + + + A + A + + G T LE A + + Sbjct: 76 GPQDIVSVVIFDHRRETIISAQPVRNVAALQAEVGKIKDAGGTKIAPALEAALNEIRRSQ 135 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 I+RI+L TDG + + + ++ ++ V L+ GVG+ ++NE +++ +A+ Sbjct: 136 NANTISRIILLTDGQ----TEGERDCLRLAEEIGKASVPLTALGVGD-DWNEDLLIEMAN 190 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA-WVTEYRQIGYEKRQLR 426 G Y ++ ++Q V ++ + F Q+ +QL Sbjct: 191 RSGGVAEYFSNPNDIASFFQGAVQQAQSAVVQNSALTLRFVQGVEPRALWQVTPLIQQLP 250 Query: 427 VEHFNN--DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDN 470 ++ V GDI +H +L E+ ++ ++A +L N Sbjct: 251 YRPISDRAVGVSLGDISKDEHRMVLIEMLVDPKQAGQYRLGQIEVN 296 >UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphylothermus marinus F1 RepID=A3DLZ3_STAMF Length = 416 Score = 177 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 69/292 (23%), Positives = 127/292 (43%), Gaps = 11/292 (3%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 P + +IDTS SM E++ + + L+ LR++D + + +AG L + + Sbjct: 37 PPIAFLIVIDTSYSMDG-EKIFRAKQAALRLLDILRDKDYVGVYGFAGKFYKVLEPVPAT 95 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN--RILLATDGDFNVGIDDP 330 ++ E+ AI L TN L+ ++ K G I+ RI+ TDG+ G P Sbjct: 96 NRNEVEKAIIGLKLGSGTNIYDTLKKLVEETKKVLESGAISLVRIIFITDGEPTTGQKKP 155 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEM 390 + I M KK RE+G + GVG + YNE ++ R+A V NG + ++ + +K+++ Sbjct: 156 EKILEMAKKLREAGASALIIGVG-TEYNEKLLSRMAMVLNGEFEHVSDPASLEKLISEYA 214 Query: 391 RQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLF 450 + AK+V +P + + Y V+ GDI + I ++ Sbjct: 215 KSTQEISAKNVAVLFRLSPGFRVDIYNRPYNNIP------EGVEVEIGDIYYREIIDIVG 268 Query: 451 ELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW-LKIRWKYPQGKESQLVEF 501 ++T + + + + +E A + IR K +E+ V+ Sbjct: 269 DITTPPLLIGEAHIGEIQISYVNPETEEQEFATPIPIRIKVKPLEEASTVKV 320 >UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CF36E Length = 790 Score = 176 bits (447), Expect = 2e-42, Method: Composition-based stats. Identities = 65/357 (18%), Positives = 137/357 (38%), Gaps = 62/357 (17%) Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMISDER---------------LPLIQSSLKLL 243 +V I K + ++ A ++ +ID SGSM + + L L++ S+K + Sbjct: 120 QVKISIKTPEGQQRSACDICCVIDVSGSMSDEAKIKNSKGDIESNGLTILDLVKHSVKTI 179 Query: 244 VKELREQDNIAIVTYAGDSR--IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQ 301 + L E+D +++V + ++ L ++ + + ++ L STN G+ A + Sbjct: 180 INNLDERDRLSLVAFHTNAYKITDLTPMNENGRNHAIKELEKLIPLDSTNIWDGIYQALE 239 Query: 302 QATKGFIKG--------GINRILLATDGDFNVGIDDPKSIESMVKKQRESG---VTLSTF 350 G + ++ILL TDG NV P+ M+KK +E ++STF Sbjct: 240 VVKAGQQQSIQKGEQRVAFSQILLFTDGQPNV--IPPRGHLPMLKKYKEENDVNCSISTF 297 Query: 351 GVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 G G N + ++ ++A G G++++I V + + ++ T+A D IE + Sbjct: 298 GFG-YNLDSELLDQLAIEGRGSFAFIPDGQFVGTVFVNALSNLMTTLAVDAVLCIENSNG 356 Query: 411 WVTEYRQIGYEK--------RQLRVEHFNNDN----VDAGDIGAGKHITLLFELTLNGQK 458 E I E+ L + + ++ G + G+ ++ + Sbjct: 357 AQFEEVLIEEEQAKNILNKETVLGNYDYQRCSWGLNINIGTLQYGQSKDIVVTM------ 410 Query: 459 ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKES-QLVEFPLGPTINAPSEDM 514 K ++ K ++++ + + + +E M Sbjct: 411 ------------KNVNNNSNKPYITATLKYRTSSTHKQPEEISASSSDISQQENEVM 455 >UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B5YMD8_THAPS Length = 868 Score = 176 bits (447), Expect = 2e-42, Method: Composition-based stats. Identities = 75/365 (20%), Positives = 136/365 (37%), Gaps = 33/365 (9%) Query: 168 DKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEE---LPASNLVFLIDTS 224 + + + A E E + I A+D + ++V +D S Sbjct: 87 QQGIREETLQLSVAPHRESIGLQSGEFTGQICATIKARDLPQRDSFARSPIDIVVALDVS 146 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS--HKAEINAAID 282 GSM E+L L + +L LL++EL D A+++++ D+ I +P + +K + AID Sbjct: 147 GSM-RVEKLDLCKETLHLLLRELHHDDRFALISFSEDAVIEVPMQKVNERNKQQALHAID 205 Query: 283 SLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESM----VK 338 L +G TN + + LA Q + + L TDG+ N G + + + V+ Sbjct: 206 RLSVKGRTNIASAVSLAAQVVNGVAEPNKVRSVFLLTDGNANTGYTEAIDLVKLTSIFVE 265 Query: 339 KQRESG---VTLSTFGVGNSNYNEAMMVRIADVG-NGNYSYIDTLSEAQKVLNSEMRQM- 393 R ++L TFG G ++ ++ +A G++ + S+ + + Sbjct: 266 ANRNPHTPPISLHTFGYGPEP-DQKLLRGMAMATSGGSFYSVRDNSQVSSAFGDAIGGIL 324 Query: 394 -------LITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND--NVDAGDIGAGK 444 L VA++V I I R+E + V GD+ A + Sbjct: 325 SLALYSVLSVVAQNVIVTISVPSESAKCGAGI-VAIHDDRMEEIGDGVFQVSLGDLYAEE 383 Query: 445 HITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQG-------KESQ 497 +LFE TL + +P + L ++ R P S Sbjct: 384 CRDILFETTLVYPTKKPPQSNASPIFYPHALVELSYLDTIRHRSIAPISFLAGIARPNSN 443 Query: 498 LVEFP 502 + +P Sbjct: 444 EISWP 448 >UniRef50_A8J0D9 Flagellar associated protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8J0D9_CHLRE Length = 4349 Score = 176 bits (445), Expect = 3e-42, Method: Composition-based stats. Identities = 57/263 (21%), Positives = 103/263 (39%), Gaps = 24/263 (9%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 + ++ L ++D SGSM ER+ L++ + L+ +L D + IV+Y+ R +P Sbjct: 967 EVKQRAHVALTCVLDRSGSM-GGERIELVRETCHFLIDQLTADDYLGIVSYSNTVREDVP 1025 Query: 268 --SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG-------------- 311 ++ + + I SL G T AGLE +Q + Sbjct: 1026 LLRMTPEARRLAHTMISSLTLHGGTALYAGLEAGVKQQMAAASELKALAAAAGGGSDSSR 1085 Query: 312 -INRILLATDGDFNVGIDDPKSIE---SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 ++ L TDG G I + ++ + +T+ TFG G+ +++ ++ +A+ Sbjct: 1086 IVHSCFLFTDGQATTGPCTVNEIMGQMTSLQSPADQNITVHTFGFGD-DHSVELLQGVAE 1144 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW-VTEYRQIGYEKRQLR 426 +G Y YI + + +L VAKDV+ I P + +R G Sbjct: 1145 AQSGVYYYISCADDIPSGFGDALGGLLAVVAKDVRVSIRTKPGIKLAAFRSGGRVVGATG 1204 Query: 427 VEHFNNDNVDAGDIGA-GKHITL 448 ++ A GA G Sbjct: 1205 AASGARNSAAATTPGAKGHAAPF 1227 >UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8BII0_ORYSI Length = 585 Score = 176 bits (445), Expect = 3e-42, Method: Composition-based stats. Identities = 80/362 (22%), Positives = 130/362 (35%), Gaps = 55/362 (15%) Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM------ISDERLPLIQSSLKLL 243 P NE+R V + E +LV ++D SGSM RL L++ ++K++ Sbjct: 21 PSNEERKEWPVLVHVVAPAKTERFPIDLVAVLDVSGSMTKATSMHGWTRLDLVKGAMKMV 80 Query: 244 VKELREQDNIAIVTYAGDS----RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELA 299 +L D +AIV + G L ++ +A+ NA ++ L A G T L+ A Sbjct: 81 TNKLGAGDRLAIVPFNGKVVAAGATRLMEMTTKGRADANAKVNQLKAGGDTKFLPALKHA 140 Query: 300 YQQATKGFIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTL--STFGVGN 354 R I L +DG N +DD + GV TFG+ Sbjct: 141 SGLLDSRPAGDKQYRPGFIFLLSDGQDNGVLDD-----------KLGGVRYPAHTFGMCQ 189 Query: 355 SNYNEAMMVRIADVGNGNYSYIDT-LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT 413 S N MV IA G+Y ID LS + L + + VA + + Q+ Sbjct: 190 SRCNPKSMVHIATATKGSYHPIDDKLSNVAQALAVFLSGITSAVAVNARVQLHVADNSGV 249 Query: 414 EYRQIGYEKRQLRVEHFNN-----DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAP 468 +I +E N ++ G + A + + L + P Sbjct: 250 LINKIDSGAYDKTIESGNGKASSKGTINVGVLSAEEDKKFIVYLDV-------------P 296 Query: 469 DNKLAKSDKTKELAWLKIRWKYPQG--------KESQLVEFPLGPTINAPSED--MRFRA 518 + A++ + L + + P G + S VE P + D + + Sbjct: 297 KLENAQAKPPQLLLTVAGEYSTPAGGRKVENMEESSVQVERPAPAGGATKTGDHLVTWSE 356 Query: 519 AV 520 AV Sbjct: 357 AV 358 >UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FM70_SORC5 Length = 507 Score = 175 bits (443), Expect = 4e-42, Method: Composition-based stats. Identities = 64/288 (22%), Positives = 128/288 (44%), Gaps = 10/288 (3%) Query: 204 AKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 A+ + + + +V L+D SGSM ++ +++ + V L + D +++ ++A ++ Sbjct: 111 ARAARGQPRAPAAVVLLVDASGSMQGP-KMENARAAAQAFVDRLPDGDLVSVASFADTAQ 169 Query: 264 IALPSISG--SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG 321 + S + + AI +L +GSTN AGL+LA Q A + R++L +DG Sbjct: 170 ARVAPTVLGRSTRPAVARAIAALGPDGSTNLFAGLKLAEQHALAAPSTHAVRRVVLISDG 229 Query: 322 DFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 N+G P + ++ ++ GV +++ GVG ++Y+E + +A +G ++ E Sbjct: 230 QANIGPSSPDILGALAQRGAAHGVQITSIGVG-ADYDERTLNALAVGSSGRLYHLTEARE 288 Query: 382 AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIG 441 VL E+ + T A +E PA E + E+ + + V G + Sbjct: 289 MSSVLERELALLQTTAATGA--FVEIVPAPGVELLDVPNERTERSGDALR---VLLGTMF 343 Query: 442 AGKHITLLFELTLNGQKASIDKLRYAPDN-KLAKSDKTKELAWLKIRW 488 G+H +L + A L + + A+S + + R+ Sbjct: 344 GGQHREMLVRARVTAPAAGSHPLASVRLHFRDAESGNLPRVQEVVARY 391 >UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B6HQM8_PENCW Length = 1029 Score = 174 bits (442), Expect = 6e-42, Method: Composition-based stats. Identities = 61/327 (18%), Positives = 131/327 (40%), Gaps = 29/327 (8%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY-AGDSRIALPSISGS 272 +LV +I S SM ++ L++ +LK LV+ L +D + +VT+ + + L ++ Sbjct: 513 PLDLVVVIPVSSSM-QGLKITLLRDALKFLVQNLGPRDRMGLVTFGSSGGGVPLVGMTTK 571 Query: 273 HKAEINAAIDSLDAEGS----TNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID 328 A + ++S+ G + G +A + ++ ILL +D I Sbjct: 572 SWAGWSKILESIRPVGQKSLRADVVEGANVAMDLLMQRKFNNPVSTILLISD----SSIS 627 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 DP+S++ +V + + VT+ +FG+G + + M+ ++ G+YSY+ ++ + Sbjct: 628 DPESVDFVVSRAEAAKVTIHSFGLGLT-HKPDTMIELSTRTKGSYSYVKDWMMLRECVAG 686 Query: 389 EMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 + + T ++VK ++ ++ +I + + GD+ G + Sbjct: 687 CLGALQTTSHQNVKLKLRLPEGSPAKFVKISGALHTTKRATGKDAEAALGDLRFGDKRDV 746 Query: 449 LFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTIN 508 L +L + A+ D + P L + G E +++ Sbjct: 747 LVQLVIQPDNATQDNMPQDPWESLVSGLEALGGCS--------DGDEGRVLSV------- 791 Query: 509 APSEDMRFRAAVAAYGQKLRGSEYLNN 535 E++ A YG LR ++ Sbjct: 792 ---EEVPLIQADLTYGDLLRDGHLTHS 815 >UniRef50_C5GK44 U-box domain-containing protein n=2 Tax=Ajellomyces dermatitidis RepID=C5GK44_AJEDR Length = 766 Score = 174 bits (441), Expect = 9e-42, Method: Composition-based stats. Identities = 60/354 (16%), Positives = 127/354 (35%), Gaps = 38/354 (10%) Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 + A + P + +L P P L L +++ +P ++V ID Sbjct: 24 RSRPSTGTNIAGERNPNEVAVQLHPLPDTNSMILSVHPPLHPEKELRHVP-CDIVLCIDI 82 Query: 224 SGSMISDERLP-----------------LIQSSLKLLVKELREQDNIAIVTYAGDSRIA- 265 S SM S LP L + + + +++ L + D + +V ++ D+ + Sbjct: 83 SYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDNDRLGVVAFSTDAEVVY 142 Query: 266 -LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK-GFIKGGINRILLATDGDF 323 + +++ +K A+++L STN GL+L+ + + I + + + TDG + Sbjct: 143 KISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEALEEVTPIPQNVQALYILTDGMY 202 Query: 324 NVGIDDPKSIESMVKKQRESGVT----------LSTFGVGNSNYNEAMMVRIADVGNGNY 373 + + + +S V+ + TFG G ++ I++VG G Y Sbjct: 203 RIVRSRVPHANASKFRHAKSYVSKAGQKDRLPMIHTFGFGYY-IRSGLLQAISEVGGGTY 261 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT---EYRQIGYEKRQLRVEHF 430 S+I V + + T A I + + E + G + E Sbjct: 262 SFIPDAGMIGTVFVHAIANLYTTFATQAMISIRTSGSVEIAQDEGSKTGLGLYEESTEDG 321 Query: 431 NNDNVDAGDIGAGKHITLLFELT--LNGQKASIDKLRYAPDNKLAKSDKTKELA 482 V G + G+ ++ + + + L Y ++++ Sbjct: 322 ALA-VTVGSLQYGQSRDVIIRMKNATSKPSTAQATLTYNFQGNAKSVASSEQIL 374 >UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AXM1_ORYSI Length = 614 Score = 173 bits (438), Expect = 2e-41, Method: Composition-based stats. Identities = 58/279 (20%), Positives = 119/279 (42%), Gaps = 24/279 (8%) Query: 192 NEQRTLLKVDILAKDR-KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 ++ + + + A + ++V ++D SGSM ERL ++ ++++ + +L Sbjct: 6 RQENFPVLIQVTAPPVLEGTARAGVDVVAVLDVSGSMEG-ERLEHVKEAMEIFIGKLGPD 64 Query: 251 DNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF- 307 D +++V++A R L +S +A +D L A+GSTN GA L Sbjct: 65 DRLSVVSFATSVRRLTELTYMSEQGRAVAKEIVDGLVADGSTNMGAALLEGAMILRDRKG 124 Query: 308 ----IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 G + ++ +DG + + + K+ TFG+G S++N +M Sbjct: 125 ARDESNGRVGCMMFLSDG----------TNDEIYKEDISGEFPAHTFGLG-SDHNPNVMR 173 Query: 364 RIADVGNGNYSYID-TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA-WVTEYRQIGYE 421 IAD + YS+++ +++ + + + + VA V+ + + V+ R GY Sbjct: 174 HIADETSATYSFVNRNIADIKGAFDLFISGLTSVVATAVRVTVRAHAGAAVSSIRSGGYA 233 Query: 422 KRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS 460 R + +D D+ AG+ + LT+ + Sbjct: 234 HRVAA--DRLSGAIDIHDMYAGERKCFVVYLTVAEGRGG 270 >UniRef50_Q237Q6 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q237Q6_TETTH Length = 713 Score = 171 bits (433), Expect = 7e-41, Method: Composition-based stats. Identities = 68/386 (17%), Positives = 143/386 (37%), Gaps = 35/386 (9%) Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMI--------------SDERLPLIQSSLKLLV 244 +V I K + ++++ ++D SGSM L +++ SL +V Sbjct: 48 QVRIQILSPKGKSKVSNSICCVVDVSGSMGSRAVTKQSGGNSELGYSVLDIVKHSLNTIV 107 Query: 245 KELREQDNIAIVTYAGDSRIAL--PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ 302 + L E D ++VT++ +S++ ++ S+ I+ + STN AG+E +Q Sbjct: 108 QNLDEGDEFSMVTFSDNSKLVCNYQQMTESNIKSSVDLINQCQPDASTNIWAGIEQGLEQ 167 Query: 303 ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV-----TLSTFGVGNSNY 357 K ++++ TDG NV + P+ I + + + +++TFG G Sbjct: 168 MQNDSNKNKNQQLIVLTDGQPNV--NPPRGILTTLNNFYNKNIISPKPSINTFGFGYY-L 224 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD-VKAQIEFNPAWVTEYR 416 + ++ IA G YS+I S + + + M T A + V N Sbjct: 225 DSHLLFNIAQDCQGIYSFIPDSSFVGTIFTNSIASMQSTFATNAVLVFKPLNKNAQLNLS 284 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 QI + + V+ G++ + ++F+ L+ + + K + Sbjct: 285 QIKS-NFKTYLNKEGEYIVELGNLFFDQSKDIIFQQDLHSELLRDFSVEVKYYTKDTNNF 343 Query: 477 KTKELAWL--KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAY------GQKLR 528 + +I E + + + T+ + + + + R Sbjct: 344 QLSHRMHKAEQIHNANKLEFEQEALRLEVVKTVQQCKDSSQKSQKLVEDTINLLKASQFR 403 Query: 529 GSEYLNNTSWQQIKQWAQQAKGEDPQ 554 EY+ N + ++ +QA D Sbjct: 404 EDEYIQNL-CKDMEDQVKQAVSRDDY 428 >UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CDX4_KOSOT Length = 730 Score = 171 bits (433), Expect = 7e-41, Method: Composition-based stats. Identities = 53/218 (24%), Positives = 91/218 (41%), Gaps = 5/218 (2%) Query: 191 WNEQRTLLKVDI-LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 W+E + L R+ E + ++VF++D SGSM S +++ + +L +++ L E Sbjct: 249 WDEADRRGYFLLTLVPPREPERIIPKDIVFILDISGSM-SGQKIEKAKLALLQVLQMLHE 307 Query: 250 QDNIAIVTYAGDSRIALPSI-SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 D +I+T+ + + S + E A+ + A G TN L + Sbjct: 308 GDRFSIITFNNEVNNLTERLLPFSDRTEWYPAVKQIMAGGMTNIHDALLEGIEVLGTQST 367 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRE-SGVTLSTFGVGNSNYNEAMMVRIAD 367 +L TDG GI D +I K + V L FGVG + N ++ +A+ Sbjct: 368 DDRYKVVLFLTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVG-YDVNAELLDELAE 426 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 G G YI E + + R + V +V +I Sbjct: 427 KGGGKVKYIVENEEIDEKVLELYRMIETPVMSNVHLEI 464 >UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EFJ5_PARTE Length = 610 Score = 171 bits (432), Expect = 8e-41, Method: Composition-based stats. Identities = 72/348 (20%), Positives = 141/348 (40%), Gaps = 29/348 (8%) Query: 144 LLPPPDAVRVEEI----VNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLK 199 + PDA++VE + +N P I Q S+ +P ++ + + +QR Sbjct: 73 EITDPDALQVELLNSVHLNVLPRQKAI---QVQEYSQILPVVLQIQSLKSQLKKQRA--- 126 Query: 200 VDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 +L+ ++D SGSM + E++ L+Q+SL+ + K L+ D +A+VT+ Sbjct: 127 --------------NIDLMCVVDVSGSM-NGEKIKLVQNSLRYIQKILKPTDRLALVTFG 171 Query: 260 GDSRIALPSIS--GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 + I L +K +I AI + STN +G+ L + K + + + Sbjct: 172 TQAGINLQWTRNIAENKKKIKKAIKDIKIRDSTNIASGVALGLRMIRDRKFKNPVTSMFV 231 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 +DG + D + +++ + + +T++TFG G S+++ +M IA++ G + YID Sbjct: 232 LSDGVDDDRGADLRCQQALHQYNIQDTLTINTFGYG-SDHDAKVMNNIANLKGGQFVYID 290 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA 437 + + M ML AK+V ++ + G + ++ + Sbjct: 291 QIQRVSEHFILAMSGMLSVKAKNVILTVKQLNNEFKLSKIFGDDFLWNKISE-TEFQLTL 349 Query: 438 GDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLK 485 + E+ + G K L L A+ K Sbjct: 350 NYLVDEDKKEFALEIEIPGFKQQELVLENIMQIDLQGVFIGLNTAFKK 397 >UniRef50_Q22SJ7 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22SJ7_TETTH Length = 642 Score = 169 bits (429), Expect = 2e-40, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 91/199 (45%), Gaps = 4/199 (2%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSI 269 P+ +LV +I+ S SM E++ ++++L L++ L D +++V + L + Sbjct: 195 RPSIDLVCVINNSESMHG-EKILNVKNTLLYLLEMLNSNDRLSLVLSNNNPTTLFDLKYL 253 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 +K ++ I+++ +TN + A+ + ++ I L +DG + Sbjct: 254 DEKNKQDLKRIINNISITQNTNITKSMIKAFNILQFRQSQNKVSSIFLLSDGVDSSAEKQ 313 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 ++ S + + + +FG G + + M+ +I + NGN+ YI +++ + Sbjct: 314 IQNYISSQQSLQNKNFAIHSFGYGF-DQDAEMINKICSLKNGNFYYIQNMNQVDQYFADV 372 Query: 390 MRQMLITVAKDVKAQIEFN 408 + L VA+D+ +I N Sbjct: 373 LGGTLTAVAQDITIEISLN 391 >UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G8C3_SORC5 Length = 907 Score = 169 bits (429), Expect = 2e-40, Method: Composition-based stats. Identities = 89/456 (19%), Positives = 170/456 (37%), Gaps = 39/456 (8%) Query: 62 LAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPL 121 A Q + P AR+A A A D N P Sbjct: 370 GAGAGSGQGFGSGHGRAGASSPPVSARSAAAYAPEPEV-------ALDPNGRFATTYRP- 421 Query: 122 ATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFA 181 G A L +G++P + E+V + + +P + Sbjct: 422 -------GGGHLAAFEAALARGVVPAAER----ELVGDVAARYAP----EVPLALDKALG 466 Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEEL--PASNLVFLIDTSGSMISDERLPLIQSS 239 +R +L A + + + + P ++ ++DTSGSM + + + + Sbjct: 467 LRADLERAALGPGGGAFHLRLALRSAAAAAAARPHLSVHLVLDTSGSM-AGAPIDSARRA 525 Query: 240 LKLLVKELREQDNIAIVTYAGDSRIALPSIS-GSHKAEINAAIDSLDAEGSTNGGAGLEL 298 + LV L D+ ++ T++ D+ + + G +A I AI+ L G TN GAGL L Sbjct: 526 AQALVDRLAPADDFSLTTFSSDAEVVIEDGPVGPRRAAIRRAIEGLREGGGTNIGAGLSL 585 Query: 299 AYQQATK-GFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 Y QA++ G + + +LL +DG G+ + + + + G+ S G+G+ ++ Sbjct: 586 GYAQASRPGIPEDAVRVVLLVSDGRATSGLTHSERLAWLALDAFQRGIQTSALGLGD-DF 644 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 + +M IA G G Y Y+ + L++E+ + L VA V+ ++ A V R Sbjct: 645 DGQLMSAIASDGAGGYYYLRHPEQIAPALSTELDKRLDPVATAVEVRVRLK-AGVDLLRA 703 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK---------ASIDKLRYAP 468 G + A D+ A + + + + A D+ Sbjct: 704 YGSRRLDGAEASRVRAQEIAADVAAQSRDRITADRRDDAEGGMRFFMPVFARDDRHALLL 763 Query: 469 DNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG 504 + +A +++++K K++ + E P+ Sbjct: 764 KLSAGPGAGKRAVATVELKYKDRLAKKNVIEEIPIE 799 >UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I2R0_9CLOT Length = 960 Score = 169 bits (428), Expect = 3e-40, Method: Composition-based stats. Identities = 51/221 (23%), Positives = 94/221 (42%), Gaps = 14/221 (6%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSM----ISDERLPLIQSSLKLLVKELREQDNIAIVT 257 + R E+PA ++ +ID SGSM +L L + + ++ LRE D I+++ Sbjct: 393 VYMDKRGKNEVPAISINLIIDKSGSMSAEGGGVSKLTLAKEAAMKALENLREVDEISVIA 452 Query: 258 YAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 + +P K I I + G T+ LE Y + I +L Sbjct: 453 FDDTYDEVVPLQKVGDKEAIKELISGIQIRGGTSIYPALEQGYNM--QMQSSAKIKHTIL 510 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 TDG G+D+ ++++ ++ +TLST VG N ++ ++A +G G Y D Sbjct: 511 LTDGQDGYGLDN---YATLLQNFIDNNITLSTVAVGEG-ANAGLLNQLASIGKGRSYYTD 566 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 ++ ++ E+ +L A EF P ++ + + Sbjct: 567 IYTDIPRIFAKEV--LLS--AGTYIINEEFTPKILSNHEIL 603 Score = 58.5 bits (140), Expect = 6e-07, Method: Composition-based stats. Identities = 35/147 (23%), Positives = 60/147 (40%), Gaps = 20/147 (13%) Query: 190 PWNEQRT--------LLKVDILAKDRKSEELPASNL--VFLIDTSGSMISDERLPLIQSS 239 PW++ + + ILA + L N+ VFL+D S S E + Sbjct: 30 PWSKNEIAIFISRIVVFTLLILAFGNITINLKGRNISTVFLLDVSESASDFE--ESGKDF 87 Query: 240 LKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELA 299 + ++ + + +V + +S+I +K + +ID +TN +E A Sbjct: 88 ISTAIESMPRGNKAGVVLFGDNSKID----KVLNKKKEYKSIDEKPVVTATNIQEAVESA 143 Query: 300 YQQATKGFIKGGINRILLATDGDFNVG 326 F +GG RI+L TDG+ N G Sbjct: 144 LGL----FERGGSKRIVLITDGEENQG 166 >UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anserina RepID=B2AQN8_PODAN Length = 648 Score = 169 bits (428), Expect = 3e-40, Method: Composition-based stats. Identities = 55/272 (20%), Positives = 102/272 (37%), Gaps = 56/272 (20%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDER----------------LPLIQSSLKLLVKELRE 249 D + +LV ID SGSM +D L L++ + K +++ L + Sbjct: 62 DLRERNHVPLDLVLSIDVSGSMGADAPVPAKNGTEGEHYGLSVLDLVRHAAKTILETLDD 121 Query: 250 QDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA---- 303 D + IVT++ S++ L ++ ++KA+I +D+L TN G+ Sbjct: 122 HDRLGIVTFSTSSKVVRELTYMTPANKAKILKQLDALQPLSMTNLWHGIRDGLSLFNNNL 181 Query: 304 -----TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYN 358 + G + +L+ TDG N + + + +++ ++ TFG G S Sbjct: 182 KAVNDRRNPGSGRVPALLVLTDGMPNHQCPN-QGYVAKLRQWSTLPASIHTFGFGYS-LR 239 Query: 359 EAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 ++ IA+VG GNYS+I T V+ Q F+P+ Sbjct: 240 SGLLKSIAEVGGGNYSFIPDAGMI-------------TTGDAVEKQQPFSPSG------- 279 Query: 419 GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLF 450 + G++ G+ + Sbjct: 280 -------DDSPNKTLYISLGNLQYGQSREIYL 304 >UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HPN0_LYSSC Length = 825 Score = 169 bits (427), Expect = 3e-40, Method: Composition-based stats. Identities = 69/317 (21%), Positives = 133/317 (41%), Gaps = 24/317 (7%) Query: 195 RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 TLL V++ K + E+LP+ LV ++D SGSM S +L L + + V+ LR++D + Sbjct: 349 ETLLPVEMEIKGK--EQLPSLGLVIVLDRSGSM-SGSKLELAKEAAARSVEMLRDEDTLG 405 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 + + + + ++K E I S+ G T L AY+ ++ Sbjct: 406 FIAFDDRPWEIIETGPLNNKEEAVDTILSVTPGGGTEIYGSLAKAYENLADMKLQ--RKH 463 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 I+L TDG P + + ++++ +++G+TLST +G + + ++ ++++G+G + Sbjct: 464 IILLTDGQ-----SQPGNYDDLIEQGKDNGITLSTVAIGQ-DADANLLEALSEMGSGRFY 517 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 + +L+ E + T IE NP + Y G+ L N Sbjct: 518 NVIDEQTIPSILSRETAMISRTY-------IEDNPFYPVLYNAGGW--NTLFANGVPQMN 568 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLA-KSDKTKELAWLKIRWKYPQG 493 G A + T++ E + + + +Y A SD T + + RW+ Sbjct: 569 AYIGT-TAKQGATVVAESE--KEDPVLAQWQYGLGKTFAFTSDSTGKWSGDWARWQDWGT 625 Query: 494 KESQLVEFPLGPTINAP 510 L+ L + Sbjct: 626 FWQTLISQMLPSYNDVA 642 Score = 47.4 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 54/151 (35%), Gaps = 20/151 (13%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ-DN--IAIVTYAGDSRIALPSISGSH 273 +V+L+D S SM E ++ + L+ + D + +++ + ++ + Sbjct: 28 IVYLVDRSASMNGTED-----EMVQFIQDSLQSKKDEQLAGLYSFSSTLQTEA-IMTKTL 81 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 K + A TN L+LA T R++L TDG+ G S Sbjct: 82 KE--VPKFTEIKATDQTNIEQSLQLA----TGIIDPKKATRLVLLTDGNETKG-----SA 130 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 K + S +++ N+ + Sbjct: 131 LDFATKFKGSNISVDVVPFSQPVVNDVSLKS 161 >UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CIG8_9BACI Length = 931 Score = 169 bits (427), Expect = 3e-40, Method: Composition-based stats. Identities = 58/227 (25%), Positives = 108/227 (47%), Gaps = 17/227 (7%) Query: 195 RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 LL VD+ K +K ELP+ +V ++D SGSM + ++ L + + + LRE+D + Sbjct: 390 EKLLPVDMDLKGKK--ELPSLGMVIVLDRSGSM-AGYKIQLAKEAAIRSAELLREKDTLG 446 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 + + + + K ++ I+ L + G TN LELAY+Q T ++ Sbjct: 447 FIAFDDRPWQIIDTEPIKDKEKVIEKINGLTSGGGTNIFPSLELAYEQLTP--LELQRKH 504 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 I+L TDG P + +++ +E+ +TLST +G + + ++ ++D G G + Sbjct: 505 IILLTDGQ---SATSPD-YLTTIQEGKENNITLSTVAIGEGS-DSVLLEELSDEGGGRFY 559 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYE 421 ++ S +L+ E +L T + IE +P + T G+ Sbjct: 560 DVNDSSTIPSILSRE--TVLTT-----RTYIEDDPFYPTVIDASGFT 599 Score = 58.5 bits (140), Expect = 6e-07, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 59/139 (42%), Gaps = 14/139 (10%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG 271 L ++ V++ D S S+ + ++ ++ VKE++ +D +V+ AG S + + Sbjct: 63 LKGNSTVYVADLSDSLDNQR--EHLRQTIDQAVKEMKREDKFGVVS-AGGSAVVERPLKE 119 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 +A + L+ ST+ +GL L R++L +DG+ N G Sbjct: 120 KSEAGV-QFRSQLETY-STDISSGLRLGGSLI----PSYTNGRVVLLSDGNENTG----- 168 Query: 332 SIESMVKKQRESGVTLSTF 350 ++ G+T+ F Sbjct: 169 DAVKQAAYLKQQGLTVDVF 187 >UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UNM0_RHOBA Length = 900 Score = 169 bits (427), Expect = 4e-40, Method: Composition-based stats. Identities = 52/213 (24%), Positives = 96/213 (45%), Gaps = 13/213 (6%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 + + K E P+ ++ +ID SGSM +++ L + + + V+ L +D I ++ + GD Sbjct: 450 VRSNFEKEREKPSLAMMLVIDKSGSM-GGQKIELAKDAAQAAVELLGPKDAIGVIAFDGD 508 Query: 262 SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG 321 S S S + I+ AI +++A G TN + AY+ K + ++L TDG Sbjct: 509 SYTVSELRSTSDRGAISDAISTIEASGGTNMYPAMADAYEALLGATAK--LKHVILMTDG 566 Query: 322 DFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 + P + + S +TLST +G + +E ++ +A +G G Y + D Sbjct: 567 -----VSSPGDFQGVAGDMSASRITLSTVALGQGS-SEDLLEELAQIGGGRYYFCDDPQS 620 Query: 382 AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE 414 +V E + +K ++ F P V Sbjct: 621 VPQVFAKE----TVEASKSAINELPFVPQLVRP 649 >UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxobacter sp. K RepID=B4UFP8_ANASK Length = 480 Score = 168 bits (425), Expect = 6e-40, Method: Composition-based stats. Identities = 60/342 (17%), Positives = 128/342 (37%), Gaps = 14/342 (4%) Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLK 241 + YE + L+ + E ++ ++D SGSM E+L S+ Sbjct: 7 LTYEKVRFDEAKDAHLVVSLVAPHGNARAERSPVCVIPVLDVSGSMHG-EKLHFATQSIM 65 Query: 242 LLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELA 299 LV L D +V ++ + ++ K + A+ L +TN GL Sbjct: 66 KLVDHLAPGDFCGVVVFSTEVETLAAPTEMTQDRKDALKVALGRLRPRHNTNLAGGLLAG 125 Query: 300 YQQATKGFIKGGIN-RILLATDGDFNVGI-DDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 A + G+ R++L TDG N G P+ + ++++ + ++S FG G+ + Sbjct: 126 LDHAKVTKVPDGMPVRVILFTDGLANEGPATSPEGLCALLEANLGT-ASVSAFGYGD-DA 183 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 ++ ++ ++ +G GNY+Y+ + +A E+ +L T A+ + ++ P + + Sbjct: 184 DQELLRELSTLGRGNYAYVRSPEDALTAFARELGGLLSTYAQ--RIEVRVAPCEGAQLTE 241 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDK 477 + + + + D+ + L+ L + +D + ++ Sbjct: 242 VVSDVD--ARDEGGTAVIRVPDLLVDEVRHLVLGARLAPRPVPLDAPVAVAEIEVIFERV 299 Query: 478 TKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAA 519 IR + + VE + P D A Sbjct: 300 ENGRV---IREQPACKASVRFVEAADAQAVPTPGVDEVVAVA 338 >UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI0_9BACT Length = 833 Score = 167 bits (424), Expect = 8e-40, Method: Composition-based stats. Identities = 56/280 (20%), Positives = 114/280 (40%), Gaps = 20/280 (7%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 + ++ K +E P+ LV +ID SGSM + + + L + + K + L +D + ++ + G Sbjct: 399 VTSRYEKEKEQPSLALVLVIDKSGSM-NGQPIVLAREASKAAAELLSSRDQVGVIAFDGS 457 Query: 262 SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG 321 +++ S ++K E+ + ID + A G TN + + G I +++ +DG Sbjct: 458 AKLVTDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDML--GIASAKIKHMIVLSDG 515 Query: 322 DFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 G E + + + GVT+ST +G + +M IA +GNG + E Sbjct: 516 QSQGG-----DFEGISSELAQMGVTISTVSLGQGAAVD-LMAAIAQIGNGRAYVTNNAEE 569 Query: 382 AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIG 441 ++ E + ++ I+ P + Y L+ + + + G + Sbjct: 570 MPRIFTKE-------TMEASRSAIKEEPFAPIKIDDSDY----LQGINIDETPLLLGYVM 618 Query: 442 AGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKEL 481 + +L + RY +A + T +L Sbjct: 619 TKVKASAQVQLLTETGDPLLASGRYGLGQSVAFTSDTTDL 658 >UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=Q2QSE5_ORYSJ Length = 524 Score = 167 bits (423), Expect = 1e-39, Method: Composition-based stats. Identities = 70/375 (18%), Positives = 139/375 (37%), Gaps = 38/375 (10%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSI 269 +LV ++D SGSM ++ ++ +L+ ++ +L D ++IVT+ ++ L ++ Sbjct: 58 REGLDLVAVVDVSGSMRG-HKIESVKKALQFVIMKLTPVDRLSIVTFESSAKRLTKLRAM 116 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG-FIKGGINRILLATDGDFNVGID 328 + + E++ + SL A G T+ AGL+L F + I L +DG Sbjct: 117 TQDFRGELDGIVKSLIANGGTDIKAGLDLGLAVLADRVFTESRTANIFLMSDGKLEGKTS 176 Query: 329 -DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG-NGNYSYIDTLSEAQKVL 386 DP + V++ TFG G+ + ++ IA G YS + + Sbjct: 177 GDPTQVNP-------GEVSVYTFGFGHGT-DHQLLTDIAKNSPGGTYSTVPDGTNLSAPF 228 Query: 387 NSEMRQMLITVAKDVKAQI--EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 + + ++ VA+DV+ + + + + + + G + +G+ Sbjct: 229 ATLLGGLVTVVAQDVRLTLTPKTADGDLDKMEVADGTDYTQTTDAKGEITIKFGTLFSGE 288 Query: 445 HITLLFELTLNGQ------KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 + TLN A++ R++ + A + K P + Sbjct: 289 TRKVAVNFTLNESPDTEEYNATLAVARHSYAAQEAPQPAQNIVRLRKPEPTTPGSDDGIE 348 Query: 499 VEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGE--DPQGY 556 + D+ +L + L + + I AQ A G+ G Sbjct: 349 ERSVQAEVVRRRHADL------IGKASELANGQKLGDAR-ETIMD-AQNALGDILLDDGD 400 Query: 557 R------AEFIRLIE 565 R AE +RL+E Sbjct: 401 RMVNALQAELLRLLE 415 >UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1 Tax=Sorghum bicolor RepID=C5Z1W1_SORBI Length = 607 Score = 166 bits (421), Expect = 2e-39, Method: Composition-based stats. Identities = 71/342 (20%), Positives = 136/342 (39%), Gaps = 43/342 (12%) Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 AP + +D+ + + A +LV ++D SGSM RL ++S+++ ++K+L Sbjct: 80 APLGASTVRVLLDVSSSSSTAG-RAALDLVVVLDVSGSMRDFGRLDKLKSAMRFIIKKLA 138 Query: 249 EQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 D +++VT+ G + P ++S + +D L A G TN AGL++ Q Sbjct: 139 PMDRLSVVTFNGGATRECPLRAMSEDAVPVLTDIVDGLVARGGTNIEAGLKMGLQVLDGR 198 Query: 307 FIKG-GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI 365 G ++L +DG+ N G + + + T G SN + ++ ++ Sbjct: 199 RYTGARTAGVILMSDGEQNSG--------DATRVRNPQNYPVYTLSFG-SNADMNLLQKL 249 Query: 366 ADVGNGNYSYIDTLS--EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE--------- 414 A G G Y+ + V + M +L V +D+ + A V Sbjct: 250 A-GGGGTYNPVLDSGGMSMLDVFSQLMAGLLTVVVRDLYLILSKPAAVVVAATHPDDHDL 308 Query: 415 ---YRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNK 471 + + RQ V GD+ +G+ ++ EL+L + Sbjct: 309 DKIVKVDPGDFRQETDAQSGTVTVKFGDLFSGEVRKVVVELSL---------------RE 353 Query: 472 LAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSED 513 + SD E+ +++ + QG+ +LV L + + D Sbjct: 354 TSSSDYDAEILDVEVSYPNEQGERKKLVGQTLHVKRTSTASD 395 >UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2QZH3_ORYSJ Length = 633 Score = 166 bits (420), Expect = 2e-39, Method: Composition-based stats. Identities = 71/361 (19%), Positives = 129/361 (35%), Gaps = 46/361 (12%) Query: 190 PWNEQRTLLKVDILAKDRKSEEL---PASNLVFLIDTSGSMI------------SDERLP 234 P + +V + + + +L ++V ++D SGSM RL Sbjct: 42 PRGQTNKDFQVLLRVEAPPAADLNSHVPLDVVAVLDVSGSMNDPVAAASPKSNLQGSRLD 101 Query: 235 LIQSSLKLLVKELREQDNIAIVTYAGDS----RIALPSISGSHKAEINAAIDSLDAEGST 290 ++++S+K ++++L + D ++IV + L +SG ++ ID L A G T Sbjct: 102 VLKASMKFVIRKLADGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGT 161 Query: 291 NGGAGLELAYQQA--TKGFIKGGINRILLATDGDFNVGID-DPKSIESMVKKQRESGVTL 347 LE A + +G + + ILL TDGD G +I V K + Sbjct: 162 ALMPALEEAVKILDERQGSSRNHVGFILLLTDGDDTTGFRWTRDAIHGAVFKY-----PV 216 Query: 348 STFGVGNSNYNEAMMVRIADVGNGNYSYIDT--LSEAQKVLNSEMRQMLITVAKDVKAQI 405 TFG+G S+ EA++ IA G YS++D L+ L + + A D + + Sbjct: 217 HTFGLGASHDPEALLH-IAQGSRGTYSFVDDDNLANIAGALAVCLGGLKTVAAVDTRVSL 275 Query: 406 EFNP-----AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS 460 + A + GYE + V G + G+ + L + ++ Sbjct: 276 KAAELSGGGARIVRVDSGGYESSVACG--GASGEVVVGVLYTGEVKNFVVHLHVPAASST 333 Query: 461 IDKLRYAPDNKLAKSDK---------TKELAWLKIRWKYPQGKESQLVEFPLGPTINAPS 511 + ++L + + + G S V Sbjct: 334 TLTFSSVECGGYYDAATVCDHCHHRHQQQLLAVGYSYSHAPGAASAAVSVEGHGVFVERP 393 Query: 512 E 512 E Sbjct: 394 E 394 >UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VUB8_DYAFD Length = 935 Score = 165 bits (418), Expect = 4e-39, Method: Composition-based stats. Identities = 98/407 (24%), Positives = 176/407 (43%), Gaps = 53/407 (13%) Query: 29 QQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQ--------YSDKQALQGRL 80 + Q T+ A +A ++ Q+ + Y D A GR Sbjct: 537 RDYQAGEKTKMPETANLEADSRQLIADEYKNMKGLQRYGRSNGLCPYTPYEDLGANAGRF 596 Query: 81 QE-APTFARAAKAKATHIANPGTARYQQ---FDDNPVKQVAQNPL-ATFSL-DVDTGSYA 134 E + AA A H P Y ++ N ++++ L T + DV +A Sbjct: 597 AEKVQQYKPAAPGYAAHPYEPFYYFYNNELVYEYNNFIELSKAGLLKTINQPDV----FA 652 Query: 135 NVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKD-------KQSIPASKPIPFAM----- 182 R ++PP +VEE P+ +D +PA++ P Sbjct: 653 FRRTKPQNAVVPPA---KVEEPKVEKPAGRPAEDIAQVKTNSAEMPAAQQSPVRNVRPLT 709 Query: 183 ---------------RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM 227 + + + R + +N+V L+D S SM Sbjct: 710 NEAAPSAPAAQATRDTVYVERVRVDTVYVDRNAQLQNVTRSLDGFAPNNMVLLLDVSSSM 769 Query: 228 ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAE 287 S ++PL++ S+K L+ +R +D I+IV Y+G +R+ L SG+ +EI+ ID L ++ Sbjct: 770 NSPYKMPLLKRSIKSLLTLVRPEDMISIVLYSGKARVVLKPTSGAKASEISRMIDLLQSD 829 Query: 288 GSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTL 347 G T+G G++LAY+ A K +I+GG NRI+LATDG+F V + M+++ V L Sbjct: 830 GDTDGNEGIKLAYKTANKQYIRGGNNRIVLATDGEFPVS----DEVMDMIRQNARQDVYL 885 Query: 348 STFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS-EAQKVLNSEMRQM 393 S F G + + +++++G G+Y+++ S + Q +L ++ +++ Sbjct: 886 SIFTFGRHEHTGQKLKKLSELGMGSYAHVTDASADLQLILEAQAKKL 932 >UniRef50_D2R3Y3 von Willebrand factor type A n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R3Y3_9PLAN Length = 776 Score = 164 bits (415), Expect = 9e-39, Method: Composition-based stats. Identities = 70/389 (17%), Positives = 144/389 (37%), Gaps = 38/389 (9%) Query: 140 LNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIP-FAMRYELAPAPWNEQRTLL 198 L L P + VE+ + + + + P A + A +P + +LL Sbjct: 307 LKAARLVRPSQIAVEDFLAAMDYRFPVPQAGLALRTAAGPSIASLTDRAASPG-PRSSLL 365 Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY 258 + I + P+ +++ +D S SM R+ ++S+L ++R+ D ++IV + Sbjct: 366 LLGIQGANIPKTA-PSRHMIVAVDVSSSMHRQGRMQQVRSALDKFTSQMRDGDQLSIVAF 424 Query: 259 AGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG-----FIKGGIN 313 S + + + S A +D TN +GL+ + A + Sbjct: 425 RDVSEVLVERATASEAQSAVAMLDLPVVVSGTNLASGLQQSLLLAMQAPGDATATPPSAT 484 Query: 314 RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAM-----------M 362 +++ TDG + + ++ + + + VGN N + + Sbjct: 485 SVVVITDGTPEWSHATVQQLHALAADAAQQRIEMHVALVGN-NARAQLAAERDSLGTLAL 543 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEK 422 +++ + G ID+ +L + +A + + +I+FNP V+ YR +G+ Sbjct: 544 DKLSSLLAGEVHAIDSSRNLYALLTDTLAGGSAVLASEARLRIDFNPQVVSAYRLLGHGA 603 Query: 423 RQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELA 482 L V A +I AG+ +L EL L + + ++A Sbjct: 604 TALA--DVRPAEVSA-EIRAGETAVVLVELWLAVD---------------SGQSSSDDVA 645 Query: 483 WLKIRWKYPQGKESQLVEFPLGPTINAPS 511 + W +P ++ V + APS Sbjct: 646 TAHLSWLHPATGQATEVRQRVSRLQIAPS 674 >UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBY0_9DELT Length = 996 Score = 164 bits (414), Expect = 1e-38, Method: Composition-based stats. Identities = 50/228 (21%), Positives = 103/228 (45%), Gaps = 12/228 (5%) Query: 191 WNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 W + + + E P L+ +ID SGSM S +RL L++ + + + L Sbjct: 504 WGGSTIEQVLPVRFSGERQREQPTLALILVIDKSGSMSSGDRLDLVKEAARATARTLDPS 563 Query: 251 DNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG 310 D I ++ + ++ + +++ I+++I L A G TN L AY Q K Sbjct: 564 DEIGVIAFDNSPQVLVRLQPAANRLRISSSIRRLSAGGGTNAMPALREAYLQLAGS--KA 621 Query: 311 GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 + ++L +DG+ I +++ R+S +T+S+ GVG+ + ++R+A+ G Sbjct: 622 LVKHVILLSDGE-----SPENGINALLGDMRQSDITVSSVGVGDG-AGKDFLIRVAERGR 675 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 G Y Y + ++ ++ + E R+ V ++ + P + + Sbjct: 676 GRYFYSEDGTDVPRIFSREARE----VKRNALVERGLYPRVAKPVQLL 719 Score = 45.4 bits (106), Expect = 0.007, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 63/169 (37%), Gaps = 25/169 (14%) Query: 195 RTLLKVDILAKDRKSEELPAS---NLVFLIDTSGSMISDERLPLIQSSLKLL-------- 243 R L+ + + + +VF++D S S I D +L + +++ Sbjct: 143 RGLVVMAVALALAQPSLRSPIRGKTVVFVVDVSES-IDDSQLAAAEQAVREAAELAASEA 201 Query: 244 ---VKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 +++ ++ + ++TYAG +R+ + + + D+ + A Sbjct: 202 ELGIEK-EDRTRVRVITYAGRARLLELEAGEAGELSLPRDPDN-------AMASDHASAL 253 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLST 349 + A R++L TD + + + + + + GV++ T Sbjct: 254 RLAEALLDPDTEGRVVLMTD--ATGDLAEREGLGQAIFDLEDRGVSVHT 300 >UniRef50_UPI00016986EC hypothetical protein Epers_34925 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI00016986EC Length = 196 Score = 163 bits (412), Expect = 2e-38, Method: Composition-based stats. Identities = 85/174 (48%), Positives = 122/174 (70%), Gaps = 7/174 (4%) Query: 397 VAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 +AKDVK QIEFNP +VTEYR +GYE R L E FNND VDAG+IGAG I+ L+E++L+G Sbjct: 1 MAKDVKIQIEFNPEYVTEYRLVGYENRLLAREDFNNDKVDAGEIGAGHSISALYEISLSG 60 Query: 457 QKAS-IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT-----INAP 510 I+ LRYA +++ K+ ELA+LK+R+K+P ES+L+E P+ + +++ Sbjct: 61 STGQRIEPLRYA-QGQVSSVGKSDELAFLKLRFKHPGETESELIETPILASQIVQDLSSS 119 Query: 511 SEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 S+D RF AAVAA+GQ LRG +YL + + ++ + A+ A+G DP G R EF+RL+ Sbjct: 120 SDDFRFAAAVAAFGQSLRGGKYLKDMGYDEMIELARGARGNDPDGERVEFVRLL 173 >UniRef50_Q25545 Putative uncharacterized protein (Fragment) n=1 Tax=Naegleria fowleri RepID=Q25545_NAEFO Length = 357 Score = 162 bits (411), Expect = 2e-38, Method: Composition-based stats. Identities = 50/244 (20%), Positives = 100/244 (40%), Gaps = 22/244 (9%) Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS 332 K + ++ A TN GL + + I ILL TDG N GI + Sbjct: 1 GKQKAKQVAKNIHAGTCTNLSGGLFEGLRLIKQRTTCNEITSILLFTDGLANEGITNTSE 60 Query: 333 IE----SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 I + + ++ +T TFG G S+ + M+ IA GNG Y +++ + + K + Sbjct: 61 IVSKMNTTIHEEIRKQITCFTFGFG-SDTDANMLTSIAQAGNGLYYFLNNVDDIPKAFGN 119 Query: 389 EMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 + ++ VA+++K +I P + +++ R+ + + GDI + + L Sbjct: 120 VIGGLVSVVAQNIKVKIM--PNSNVKLKKVFTTFRKTDLSGGTGCEIAVGDIYSEEKKDL 177 Query: 449 LFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTIN 508 +F LT+ + +ELA L++++ +++ EF + Sbjct: 178 VFVLTVPALNGPV---------------AMQELAKLQLQYFNVITTDNEGFEFEIIVNRQ 222 Query: 509 APSE 512 + + Sbjct: 223 SSQD 226 >UniRef50_Q24FW2 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FW2_TETTH Length = 1074 Score = 162 bits (411), Expect = 3e-38, Method: Composition-based stats. Identities = 43/258 (16%), Positives = 103/258 (39%), Gaps = 22/258 (8%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR-I 264 D K+ + P +L+ ++D SGSM E++ +++ +L L+ +L E+D + +V + + Sbjct: 355 DAKAYQRPPIDLICVMDNSGSMHG-EKINMLKETLLYLIDQLDEKDRLGLVLFNSEVTFR 413 Query: 265 ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 + S+ ++K ++ I + A+G T+ G+ A++ + + L +DG + Sbjct: 414 PMKSMDTTNKLKLKQYISDIRAQGGTDINLGMTEAFKFIKTRKYCNPVTSVFLLSDGLDS 473 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 D + ++ +++ FG G +++ +M + I + + Sbjct: 474 KAQD--RVAVTLKNMSINEQFSINCFGFGR-DHDPILMNQ-----------IKKIDQVDM 519 Query: 385 VLNSEMRQMLITVAKDVKAQIEFNPAW---VTEYRQIGYEKRQLRVEHFNND---NVDAG 438 + + + +DV +++ V R G L + N + Sbjct: 520 FFVDALGGLFSVIGQDVLIKVKSVKELSKDVNIVRNYGDMWHLLTDQDTNGGWEYCIKLN 579 Query: 439 DIGAGKHITLLFELTLNG 456 + G + EL + Sbjct: 580 HLLLGTSKDYMCELFIPA 597 >UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZRP2_9SPHI Length = 1088 Score = 162 bits (410), Expect = 3e-38, Method: Composition-based stats. Identities = 70/177 (39%), Positives = 104/177 (58%), Gaps = 5/177 (2%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 NL+ L+D SGSM S ++LPL++ S K L+ +R QD+++IV YAGD+ I L S S++ Sbjct: 913 NLMLLLDVSGSMSSKDKLPLLKESFKYLISIMRPQDDVSIVIYAGDAAIVLKPTSASNQE 972 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 +INA ID L + G TN AG +LAY+ +K F +GG NRI+LATDG+F + K I Sbjct: 973 QINAVIDKLRSRGKTNVKAGFKLAYKWMSKNFKEGGNNRIILATDGEFPIS----KYIYK 1028 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 +V+K+ G+ LS F G+ + ++ G GNY + + L E + Sbjct: 1029 LVEKRATKGINLSVFSFGSMTKKFETLEKLVAKGKGNYEQV-NARNVKYKLVKEAQS 1084 Score = 139 bits (349), Expect = 4e-31, Method: Composition-based stats. Identities = 52/137 (37%), Positives = 87/137 (63%), Gaps = 6/137 (4%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 NL+ L+D SGSM ++ LP+++S+LK LV +R +D +++V + ++++ L S +KA Sbjct: 686 NLMLLLDVSGSMKNE--LPMLKSALKYLVNIMRPEDKVSVVVFGSEAKLMLRPTSAKYKA 743 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 +I AID+L + G TNG AGL+LAYQ + NRI+LA+DG+F++ K + Sbjct: 744 QIMQAIDTLKSSGRTNGEAGLKLAYQWIQNNYKNNNNNRIILASDGEFSIS----KGLYQ 799 Query: 336 MVKKQRESGVTLSTFGV 352 M++++ E + LS F Sbjct: 800 MIEQKAEESIALSVFSF 816 >UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XQ17_9BACT Length = 806 Score = 162 bits (410), Expect = 3e-38, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 103/217 (47%), Gaps = 8/217 (3%) Query: 191 WNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 +E L + D K++++ + ++VF++DTSGSM S +++ + +L+ V+ L + Sbjct: 288 GDEDGYFLLLASPGVDAKAKQIVSKDVVFVLDTSGSM-SGKKMEQAKKALQFCVESLNDG 346 Query: 251 DNIAIVTYAGDSRIA---LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 D I+ ++ +S L ++S ++ + I +L A G T L+ A +K Sbjct: 347 DRFEIIRFSTESEPLFDKLAAVSKENREKAGDFIKNLKAMGGTAIDEALKKALSLESK-- 404 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 +G ++ TDG VG D I ++++ + + FG+G ++ N ++ RIA+ Sbjct: 405 -EGRPFVVVFLTDGLPTVGTTDEDQILKGMQERNKEKRRIFCFGIG-TDVNTHLLDRIAE 462 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 Y+ + + ++S ++ V + K + Sbjct: 463 ETRAFSQYVLPEEDLEVKVSSFFSKINEPVLANPKLK 499 >UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteriaceae RepID=C7P2A9_HALMD Length = 393 Score = 162 bits (409), Expect = 4e-38, Method: Composition-based stats. Identities = 52/308 (16%), Positives = 121/308 (39%), Gaps = 9/308 (2%) Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 ++ + T + +I + + E ++ IDTSGSM D + + Sbjct: 3 SIETSVNRPNVPADGTTVTAEIDVEPGEQETDVRRHIALCIDTSGSMEGDN-IKRARDGA 61 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSI--SGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 + L ++D ++IV + ++ + LP+ S + ++ L A G T+ GL+ Sbjct: 62 AWVFGLLADEDYVSIVAFDTEATVILPATRWSDLDRQTAMDHVEELTAGGGTDMYNGLKA 121 Query: 299 AYQQATKGFI-KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 A + + + R+LL +DG N P E + + ++G+ + + G+G ++Y Sbjct: 122 AKETLSSSATGPDTVKRLLLLSDGKDN--ERTPDEFEGLAEAIDDAGIRIQSAGIG-TDY 178 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE--Y 415 NEA + + G G +++++ + + + Q VA D ++ P Y Sbjct: 179 NEATIRTLGTAGRGTWTHLEAPGDIEDFFGEAVEQAGSVVAPDAHLDLDVAPGVEVSEVY 238 Query: 416 RQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKS 475 R + + N V D+ + ++ ++ ++ +++ Sbjct: 239 RALPQAQEVSPEWEANATRVKLPDLIERESQRVVLKIHAPPREPGSEEVLADVQLSARGD 298 Query: 476 DKTKELAW 483 + ++ Sbjct: 299 TASDQIGV 306 >UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z595_BREBN Length = 947 Score = 161 bits (407), Expect = 7e-38, Method: Composition-based stats. Identities = 45/201 (22%), Positives = 91/201 (45%), Gaps = 15/201 (7%) Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD----ERLPLIQSSLKLLVKELR 248 E+ + +D+ K E+LP+ L +ID SGSM SD +++ L + + + Sbjct: 388 EEALPVHMDLKGK----EQLPSLGLQLVIDKSGSMSSDARGADKMALAREAAIRATTMMN 443 Query: 249 EQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 QD I ++ + + S + EI I + A+G T+ L+L Y++ Sbjct: 444 AQDYIGVIAFDDTPWDVVAPQSVTKLDEIQQQISRIQADGGTDIFPALQLGYERVKAMNT 503 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 + ++L TDG + E ++++ +T+ST +G+ + + ++ IA++ Sbjct: 504 Q--RKHVILLTDGQSAL----DDDYEGLLQQMTAENITVSTVALGD-DSDRGLLEMIAEL 556 Query: 369 GNGNYSYIDTLSEAQKVLNSE 389 G G Y + + K+ + E Sbjct: 557 GKGRYYFANDAESIPKIFSKE 577 Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 59/154 (38%), Gaps = 16/154 (10%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISG 271 + A +VF++D S SM D P + S L+ V + + D A++ ++ + P Sbjct: 63 VQAKTIVFVVDRSASMKDD---PRVLSFLREAVGQKQAADKYAVIAIGAEAAVDQPMTIR 119 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 + ++ +TN G+ LA +++L TDG G Sbjct: 120 QEVQPLGVDVNR----NATNLAEGIRLASAMI----PTNARGKVVLLTDGLETSG----- 166 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI 365 + RE G+ + + N +E ++ + Sbjct: 167 DAARQTRLARERGIAVEAVSLQQPNGDEVVLTSV 200 >UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NW6_PSEA6 Length = 701 Score = 160 bits (404), Expect = 1e-37, Method: Composition-based stats. Identities = 46/220 (20%), Positives = 100/220 (45%), Gaps = 20/220 (9%) Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIAL-- 266 ++++P+ +VFL+DTSGSM + E + + ++ + +LR +DN+ I+ + + Sbjct: 297 AQQMPSREVVFLLDTSGSM-AGESIVQAKRAVDFALTQLRPEDNVNIIQFNDAPQALWKR 355 Query: 267 -PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI------KGGINRILLAT 319 + H + SL A+G T L LA + + + +++ T Sbjct: 356 AMPATAKHIQRARNWVASLHADGGTEMAPALTLALNKPSLHRDDSDLLGSHKLRQVVFIT 415 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DG + + ++ S+++ + L T G+G++ N M + A G G ++YI + Sbjct: 416 DG----SVSNEDALMSLIESKLADN-RLFTIGIGSAP-NSYFMTQAAQAGRGTFTYIGDI 469 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQI----EFNPAWVTEY 415 + Q + + ++ V +D+ + EF P+ + + Sbjct: 470 QQVQHKMTALFNKLTRPVMQDIHIEFARETEFYPSVIPDL 509 >UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SQR4_HAHCH Length = 733 Score = 160 bits (404), Expect = 2e-37, Method: Composition-based stats. Identities = 73/414 (17%), Positives = 146/414 (35%), Gaps = 49/414 (11%) Query: 99 NPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPP-DAV--RVEE 155 P T + + VA+ L T S A +R LN GL ++V R++ Sbjct: 235 APATNQVADAPEITPPTVAKEDLLTPSHQ------ARIRVHLNPGLPVESIESVTHRIQW 288 Query: 156 IVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNE-------QRTLLKVDILAKDRK 208 + ++ + +P K R P ++ ++ Sbjct: 289 TQQTNGYEVSLESNKDVPMDKDFTLTWRVRQGSEPEAALFKEIVGDDVYAQLLLMPPQFS 348 Query: 209 SEELP-ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 E L L++++DTSGSM + + ++ + L +D ++ + +R P Sbjct: 349 DEGLSLPRELIWVVDTSGSMEG-VSIQQARDAVLQALDTLTPRDRFNVIEFNSHARKLFP 407 Query: 268 SISGSHKAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 + + + A + L A+G T L+ A + +G + +++ TDG Sbjct: 408 QAVPAQERALQQARRFVRGLKADGGTEIAEALDRA---LSDAAPEGYVRQVVFLTDGSVG 464 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 + K I+ + R L T G+G S N M + A G G YS+I+ +E Sbjct: 465 NELALFKQIDQQLGDSR-----LFTVGIGPSP-NRFFMRKAAQFGRGAYSHINDTAEVSD 518 Query: 385 VLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 + + +DV+ ++ L E + V D+ G+ Sbjct: 519 KIAELTAALRQPALRDVRLDVQ----------------SALNAEVYP---VAIPDLYRGE 559 Query: 445 HITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 + LLF++ + + L + ++ + L ++ P ++ Sbjct: 560 PVQLLFKVEDGAAASELPASIQGYGVGLLQDEQPLWMRSLDLQQAAPSQGVARA 613 >UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB92D Length = 586 Score = 159 bits (403), Expect = 2e-37, Method: Composition-based stats. Identities = 55/250 (22%), Positives = 105/250 (42%), Gaps = 20/250 (8%) Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPA--SNLVF 219 D DI + S A+ E Q V ++ KS++L ++ F Sbjct: 264 MDRDIWLEWQPSPSSAPQAAIFTES-----KGQHDYALVMLMPPQVKSQDLQDFDRDITF 318 Query: 220 LIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSI---SGSHKAE 276 +IDTSGSM + + SL+L + L E+D +V + D+ + + +K Sbjct: 319 VIDTSGSM-GGRPIVDAKESLQLAIDRLSEKDRFNVVAFNNDTTRLFETSVEGTTRNKQY 377 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESM 336 + L+A G T L A + + K I +++ TDG + + ++ S Sbjct: 378 ARDFVKHLNAGGGTEMAPALNAALK---RTTTKDFIKQVVFITDGA----VGNEAALFSQ 430 Query: 337 VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 +K + L T G+G++ N M R A G G+Y ++ ++ ++ ++S + ++ Sbjct: 431 IKNEL-GDARLFTVGIGSAP-NSYFMTRAAQFGLGSYVFVRNTADIKQQMDSLLYKLESP 488 Query: 397 VAKDVKAQIE 406 V D+ + Sbjct: 489 VLSDLSLTLP 498 >UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Shewanella RepID=A6WMD3_SHEB8 Length = 772 Score = 159 bits (402), Expect = 3e-37, Method: Composition-based stats. Identities = 52/232 (22%), Positives = 107/232 (46%), Gaps = 18/232 (7%) Query: 192 NEQRTLLKVDILAKDRKSEELP-ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 NE L + + K KS + L+ +IDTSGSM + + + +++L +K L+ + Sbjct: 368 NEDNYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSM-AGDSIVQAKNALLYALKGLKPE 426 Query: 251 DNIAIVTYAGD----SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 D+ I+ + S LP+ S S+ + + L A+G T L+ A ++ Sbjct: 427 DSFNIIEFNSSLSLLSATPLPATS-SNLSRARQFVSRLQADGGTEMALALDAALPKSLGS 485 Query: 307 FIKGGI---NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 + +++ TDG + + +++ +++ Q L T G+G++ N M Sbjct: 486 VSPDAVQPLRQVIFMTDG----SVGNEQALFDLIRYQIGES-RLFTVGIGSAP-NSHFMQ 539 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEY 415 R A++G G ++YI + E +++ + ++ V D Q+ ++ V +Y Sbjct: 540 RAAELGRGTFTYIGKVDEVDAKISALLSKIQYPVLTD--IQVRYDDGSVPDY 589 >UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788F71 Length = 1007 Score = 159 bits (402), Expect = 3e-37, Method: Composition-based stats. Identities = 52/220 (23%), Positives = 96/220 (43%), Gaps = 20/220 (9%) Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 E+ + +++ K E+P+ L+ +ID SGSM ++ L + S V+ +R +D Sbjct: 389 EKALPVSMELEGKR----EIPSLGLILVIDRSGSMDG-NKIELAKESAMRTVELMRAKDT 443 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 + +V + +P K E+ ++I S+ + G TN + A ++ K I Sbjct: 444 VGVVAFDDQPWWVVPPQKLGDKEEVLSSIQSIPSAGGTNIYPAVSSALEEMLK--IDAQR 501 Query: 313 NRILLATDGDF--NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 I+L TDG N G D + E+ +T+S+ VG + + ++ +AD Sbjct: 502 RHIILMTDGQSAMNSGYQD------LTDTMVENKITMSSVAVGM-DADTNLLQSLADAAK 554 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 G Y +++ + V + E + AK F PA Sbjct: 555 GRYYFVEDETTLPAVFSREAVML----AKSYIVDKPFVPA 590 >UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella loihica PV-4 RepID=A3QDW1_SHELP Length = 776 Score = 159 bits (402), Expect = 3e-37, Method: Composition-based stats. Identities = 56/267 (20%), Positives = 100/267 (37%), Gaps = 36/267 (13%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 ++ K+ L +IDTSGSM D + +S++ + L QD ++ + Sbjct: 389 LMPPQDKARVRLPRELTLVIDTSGSMTGD-SIAQAKSAILNALAGLGSQDTFNVIAFDSS 447 Query: 262 SRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ-------ATKGFIKGG 311 R P S + ++ + N + SL+A+G T L A Q + Sbjct: 448 VRSLSPVALSATAANLGKANLFVQSLEADGGTEMAPALLRALSQPESGVSSISSAVKPER 507 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 + +++ TDG I + + +QR L T G+G + N M R A G G Sbjct: 508 LKQVVFITDGAVGNEASLFALIAANIGRQR-----LFTVGIGAAP-NGYFMERAARAGRG 561 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFN 431 Y+Y+ +SE + + ++ DV + + + +Y Sbjct: 562 TYTYVGKISEVDAKIGELLEKIESPQISDVTLTL--DDGSIPDY---------------- 603 Query: 432 NDNVDAGDIGAGKHITLLFELTLNGQK 458 V GD+ A + I + LT + Sbjct: 604 -WPVQIGDLYAHEPIMVALRLTPAQRS 629 >UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSY7_9GAMM Length = 670 Score = 158 bits (400), Expect = 4e-37, Method: Composition-based stats. Identities = 73/394 (18%), Positives = 143/394 (36%), Gaps = 41/394 (10%) Query: 29 QQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFAR 88 Q + V + + A + + +Y++ A+ GR F+ Sbjct: 136 QDAKKQGKRAALVEQQRPNLFTSKVANIAPGETI--HVELRYTEALAIDGR-----EFSL 188 Query: 89 AAKAKATHIANPGTARYQQFDDNPVKQVA----QNPLATFSLDVDTG---------SYAN 135 T +P + + + P+ + + LA ++D+D G S+ Sbjct: 189 RLPTTMTSRFHPQESSIKPVEQGPIVPSSAVGQSSHLADITVDIDGGWPIQNIESPSHPF 248 Query: 136 VRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQR 195 V R L +G + + D D+ + + A+ E + + Sbjct: 249 VERSLGRGYRVHMGS----SFSDKVAMDQDVVLRWQLDPVASASGAVFSE----EYKGEH 300 Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 L + + S +VF+IDTSGSM + +R+ + +L V+ L D + Sbjct: 301 YALVMLRTPDEMTSGPRMPREVVFVIDTSGSM-AGQRMYHAKQALSQAVERLSPDDRFNV 359 Query: 256 VTYAGDSRIALPSISGSHKAEINAAID---SLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 V + S+ + + A++ L G T +E A + Sbjct: 360 VEFNNQHSRLFSSMRSASAINVKQALNWVGRLQGGGGTMMLPAVEDALSV---RSDPAYL 416 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 +++L TD + + I +V++QR+ G L T G+G S N ++ + A VG G+ Sbjct: 417 RQVILITD----ASVGNEAEILRVVERQRK-GARLFTVGIGVSP-NSYLLRKAAQVGQGD 470 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 Y YI + E + + ++ V K + + Sbjct: 471 YVYIASGQEVKARMQRLFAKLENPVLKQLNIDLP 504 >UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Photobacterium profundum 3TCK RepID=Q1YZ74_PHOPR Length = 714 Score = 158 bits (399), Expect = 6e-37, Method: Composition-based stats. Identities = 64/311 (20%), Positives = 137/311 (44%), Gaps = 24/311 (7%) Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 S L ++ F++D SGSM E + + +L+ +++L+ +D+ IVT+ ++ Sbjct: 321 QVNSTTSSALFHQSVTFVLDISGSMYG-ESIEQAKQALRYGLQQLQPEDSFNIVTFNHEA 379 Query: 263 RI---ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG--INRILL 317 + L ++ S +D LDA+G T A L+ A+ T + +N+I+ Sbjct: 380 MLYSEQLLPVTSSTITRALRFVDGLDADGGTEMAAALKAAFSIKTHDQLNSTRWLNQIVF 439 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 TDG + + ++ ++++Q L T G+G++ N M R A G G Y+YI Sbjct: 440 ITDG----SVGNESALFDLIEQQLVDR-RLFTVGIGSAP-NSYFMTRAAMKGKGTYTYIG 493 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKA------QIEFNPAWVTEYRQIGYEKRQLRVEHFN 431 + E + ++ V +D+K +++ P V + Y++ L+V F Sbjct: 494 DVKEVNTKMRLLFSKISQPVMRDIKLAWSDGRSVDYWPNPVPDL----YQQEPLQV-SFK 548 Query: 432 NDNVDAGDIGAGKHITLLFELTLNGQKA-SIDKLRYAPDNKLAKSDKTKELAWLKIRWKY 490 + A I G+ + + ++ + +ID+ + P L +++ +++ Sbjct: 549 IPDNAANLIITGQQVDHEWRQDVDIHQGLAIDEKQPQPRIGLDIIWARNQISSIQMNPAI 608 Query: 491 PQGKESQLVEF 501 ++ + +E Sbjct: 609 SMDEKKKHIEL 619 >UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B7C3_9BACI Length = 920 Score = 157 bits (398), Expect = 7e-37, Method: Composition-based stats. Identities = 57/219 (26%), Positives = 103/219 (47%), Gaps = 17/219 (7%) Query: 195 RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 LL V++ K +K E+P+ L+ ++D SGSM + +L L + + V+ LRE+D + Sbjct: 385 EKLLPVNMDIKGKK--EMPSLGLMIVMDRSGSM-AGSKLELAKEAAARSVELLREKDTLG 441 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 + + + + + K + I S+ G T LE AY++ +K Sbjct: 442 FIAFDDRPWVIVETGPLEDKKDAVDKIGSVTPGGGTEIFTSLEKAYEELEN--LKLQRKH 499 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 I+L TDG D ESM++ +E+ +TLST +G S+ + ++ +A +G G + Sbjct: 500 IILLTDGQSARSTD----YESMIETGKENNITLSTVALG-SDADRNLLEELAGLGAGRFY 554 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT 413 + S +L+ E +A + IE NP + + Sbjct: 555 DVTDSSVIPSILSRE-----TVMAT--RTYIEDNPFYPS 586 >UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KDL1_SHEWM Length = 739 Score = 157 bits (398), Expect = 9e-37, Method: Composition-based stats. Identities = 54/228 (23%), Positives = 99/228 (43%), Gaps = 14/228 (6%) Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 E L + + D+K + + L+ +IDTSGSM S + + +L + L+ +D Sbjct: 341 EDDYALLMLLPPSDQKQDVSISRELILVIDTSGSM-SGASIAQAKRALNYALAGLKAKDT 399 Query: 253 IAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ--ATKGF 307 ++ + + P + + N + SL A G T L A + T+ Sbjct: 400 FNVIEFNSNVGSLSPYSLPATAKNIGLANQYVRSLKANGGTEMQLALNAALDKGTETEAL 459 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 + ++L TDG + D +S+ ++K++ L T G+G++ N M R A+ Sbjct: 460 GSERLRQVLFMTDG----SVGDEQSLFHLIKQKIGES-RLFTLGIGSAP-NSHFMRRAAE 513 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEY 415 G G ++YI L E Q + S + Q+ D+K + + V +Y Sbjct: 514 FGRGTFTYIGKLDEVQSKIESLLYQIERPQLTDIKLR--YADNRVPDY 559 >UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcanivorax sp. DG881 RepID=B4X134_9GAMM Length = 657 Score = 157 bits (397), Expect = 1e-36, Method: Composition-based stats. Identities = 76/453 (16%), Positives = 158/453 (34%), Gaps = 60/453 (13%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQ-----ALQG 78 K +Q + S V + A + A + + + Q + L+ Sbjct: 112 AEKTYRQARASGKKASLVSQQRPNLFTTAVANIAPGETVQVELHYQQTLNVDGHRFQLRL 171 Query: 79 RLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTG------- 131 L P F +A T + A A+ +D+D G Sbjct: 172 PLTLTPRFTPPTEAPHTLDSLLRNTVAAPGG------TADAGTASVHIDLDAGARLATLG 225 Query: 132 SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW 191 S ++ + G + D D+ + + +E Sbjct: 226 SPSHAIHYQRHGRR-----YTITPKAGAIAMDRDLLLNWELEDTGEPLVTRFHEEIDG-- 278 Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 + L + + K + LP L F+ID+SGSM + ++SL L ++ L+ D Sbjct: 279 --EHYALLMVVPPKTGQVTALPRETL-FIIDSSGSM-GGAPMRQAKASLHLALQRLKPGD 334 Query: 252 NIAIVTYAGDSRIALPS---ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 I + + + +S + + + +D L A G T+ L ++ Sbjct: 335 RFNITDFDSQHTLLFETPVTVSDNSRQQAQDFVDGLQASGGTHMLPALSA---TLSQPAS 391 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 G + +++ TDG + + I + +Q L T G+G++ N M R A Sbjct: 392 DGYLRQVIFITDGA----VGNESGIFRALHQQLGE-ARLFTVGIGSAP-NSHFMTRAAQF 445 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 G G+++YI+ ++ Q+ +++ R++ + ++ Q++ V E Sbjct: 446 GRGSFTYINDQNQVQQGMDTLFRRLESPLMRN--LQVQLPDGIVAE-------------- 489 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKASI 461 D+ AG+ + + +L+ Q+ ++ Sbjct: 490 ---RWPQKLPDLYAGEPLLVAMKLSAPPQQITV 519 >UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea mays RepID=C0HF51_MAIZE Length = 459 Score = 156 bits (394), Expect = 2e-36, Method: Composition-based stats. Identities = 59/294 (20%), Positives = 117/294 (39%), Gaps = 21/294 (7%) Query: 176 KPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPL 235 I A Y E + V + A + ++V ++D SGSM +L Sbjct: 5 GDIDLAKAYHHVTVSMREHTEKVMVKLTAPHTGKGDTAPLDIVVVLDISGSMRGT-KLEH 63 Query: 236 IQSSL-KLLVKELR-EQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTN 291 ++ ++ + ++++L D +AI+T+ + L S+ + A ++ L A G TN Sbjct: 64 MKHAMTRFIIEKLGIRGDRLAIITFESKAHKVFDLSSMLPDQVKKAVAVVEGLKAGGDTN 123 Query: 292 GGAGLELAYQQA-TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTF 350 AGLE T+ + I L +DG NV D +++ V + ++ TF Sbjct: 124 IKAGLEAGLDVLKTRRGHSHNASCIFLMSDGHENV--DKARTLLDRVGEH-----SVVTF 176 Query: 351 GVGNSNYNEAMMVRIADVG-NGNYSYI---DTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 G G + +E ++ IA G Y ++ + ++ K + D+K + Sbjct: 177 GFGEKS-DEQLLYDIAYHSHAGTYHHVREKEDENQLMKAFA-FLAIYRSISMLDLKVTVS 234 Query: 407 FN-PAWVTEYRQIGYEKRQLRVEHFNN-DNVDAGDIGAGKHITLLFELTLNGQK 458 + A R + + ++ H + V GD+ + +L ++ L + Sbjct: 235 AHKEAAGAIIRGVDPCRYRVDGPHGDGSFTVHFGDLAREESRRILVDVQLPQVQ 288 >UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MS10_ANATD Length = 1188 Score = 156 bits (394), Expect = 3e-36, Method: Composition-based stats. Identities = 47/172 (27%), Positives = 83/172 (48%), Gaps = 10/172 (5%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 +LVF++D+SGSM ++ + + K V L + D A+V + + P + Sbjct: 498 IDLVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGDRAAVVDFDNFGYLLQPLTTDF-- 555 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 + AID +D+ G TN G+ +A QQ + I I+L TDG+ G D Sbjct: 556 QAVKNAIDRIDSWGGTNIAEGIRIANQQLISRSSEDRIKVIILLTDGE---GYYD----N 608 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVL 386 ++ + + +G+T+ T G+G S +E ++ IA G Y + + S+ +V Sbjct: 609 NLTTEAKNNGITIYTIGLGTS-VDENLLRDIATQTGGMYFPVSSASQLPQVF 659 >UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H5J9_SHEPA Length = 789 Score = 156 bits (394), Expect = 3e-36, Method: Composition-based stats. Identities = 56/252 (22%), Positives = 100/252 (39%), Gaps = 26/252 (10%) Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASN--LVFLIDTSGSMISDERLPLIQS 238 A E + P V ++ ++ + + L+ +IDTSGSM D + ++ Sbjct: 363 ASSSEASTEPQTASEKYGLVMLMPPQGAEQQPSSIHRELILVIDTSGSMSGDAII-QAKT 421 Query: 239 SLKLLVKELREQDNIAIVTYAGDSRI---ALPSISGSHKAEINAAIDSLDAEGSTNGGAG 295 +LK + LR D IV + D S + + A+ I+ L+A G T Sbjct: 422 ALKYALAGLRPTDKFNIVQFNSDVDKWSGMAMSATPYNLAQAQNYINRLEANGGTEMSIA 481 Query: 296 LELAYQQAT------------KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 + A T + ++L TDG + + + +++ Q Sbjct: 482 INAALNIETVTDKETGTELDNNDLGSNLLRQVLFITDGA----VSNESMLFELIEAQLGD 537 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 L T G+G++ N M R A +G G Y+YI L E + + S ++++ DV Sbjct: 538 S-RLFTIGIGSAP-NAHFMQRAAQLGRGTYTYIGKLDEVNQKVVSLLKKIEKPQVTDV-- 593 Query: 404 QIEFNPAWVTEY 415 + F+ V +Y Sbjct: 594 DLRFSDGSVPDY 605 >UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 Tax=Octadecabacter antarcticus 307 RepID=B5JCH3_9RHOB Length = 197 Score = 155 bits (393), Expect = 3e-36, Method: Composition-based stats. Identities = 61/201 (30%), Positives = 100/201 (49%), Gaps = 16/201 (7%) Query: 39 QQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIA 98 V+ I++ + + A A Q++ + + + A + H+ Sbjct: 6 AVVMEETATHIRDEDATLVPQAAPAPQQLTRAAPAGNDMAGNLR--SMAEPSNDGFVHVL 63 Query: 99 NPGTARYQQFDD-------NPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAV 151 G+ Y+++D+ NP+K + P++TFS+DVDT +YA +R L +G LPP DAV Sbjct: 64 RDGSTFYEEYDETFANDTPNPLKITSDEPVSTFSIDVDTAAYALIRSSLTRGQLPPTDAV 123 Query: 152 RVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEE 211 R+EE++NYFP + + ++ PF + PWN L+ + I + E+ Sbjct: 124 RIEEMINYFPYAYPAPEGEA-------PFRPTINVFETPWNADTQLVHIGIQGEMPAIED 176 Query: 212 LPASNLVFLIDTSGSMISDER 232 P NLVFLIDTSGSM S ++ Sbjct: 177 RPPLNLVFLIDTSGSMESADK 197 >UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CPU4_SHEPW Length = 710 Score = 155 bits (393), Expect = 3e-36, Method: Composition-based stats. Identities = 55/262 (20%), Positives = 103/262 (39%), Gaps = 15/262 (5%) Query: 157 VNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASN 216 +N+ P + E Q +L+ + + L Sbjct: 292 LNWRPIVGSAPKAAIFSQQGKTHVS-DLESKATAAQPQYSLVMLLPPQDKMRLSALAPRE 350 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS---ISGSH 273 L+ +IDTSGSM S E + ++S+ + L QD+ I+ + + + S + Sbjct: 351 LILVIDTSGSM-SGEAIEQAKASIIYALAGLSAQDSFNILQFNSNVYALSDTPLNASAKN 409 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 A + L A G T L+ A Q + + ++L TDG + + + Sbjct: 410 IGRAQAYVQRLQANGGTEMSLALDKALSQQDAN--RERLRQVLFITDGA----VGNEPQL 463 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 + ++ Q + L T G+G++ N M R A++G G Y+YI SE + + + + ++ Sbjct: 464 FTQIRNQLQQS-RLFTIGIGDAP-NAHFMQRAAELGRGTYTYIGKQSEVKSKMVAMLDKL 521 Query: 394 LITVAKDVKAQIEFNPAWVTEY 415 DV ++ F V +Y Sbjct: 522 EKPTVTDV--EVHFADGSVPDY 541 >UniRef50_A8IJ40 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IJ40_CHLRE Length = 434 Score = 155 bits (393), Expect = 3e-36, Method: Composition-based stats. Identities = 54/216 (25%), Positives = 93/216 (43%), Gaps = 36/216 (16%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 ++ L ++D SGSM S ER+ L++ + L+ +L D + IV+Y+G R +P Sbjct: 163 DVKQRAHVALTCVLDRSGSM-SGERIALVRETCHFLIDQLTPDDYLGIVSYSGGVRADVP 221 Query: 268 --SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 ++ + + +A +D+L+A+GST GL +Q + TD Sbjct: 222 LLRMTPAARGLAHAMVDALEADGSTALYDGLVAGVRQQMEAEAP---------TD----- 267 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 VT+ TFG G + E ++ +AD +G Y YI + + Sbjct: 268 -----------------QHVTVHTFGFGAGHSVE-LLQAVADAQSGVYYYISCVDDIPSG 309 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAW-VTEYRQIGY 420 + +L VAKDV+ + P +T +R G Sbjct: 310 FGDALGGLLAVVAKDVRVGVRAAPGINLTAFRSGGR 345 >UniRef50_UPI00016C377F protein containing a von Willebrand factor type A domain n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C377F Length = 821 Score = 155 bits (393), Expect = 3e-36, Method: Composition-based stats. Identities = 48/281 (17%), Positives = 108/281 (38%), Gaps = 14/281 (4%) Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 P + I + ++ A +LV ++DTS SM SD ++ + ++K + +L+ Sbjct: 246 PIQTEDGYFMFLISPQVEAEKKRVARDLVLVLDTSSSM-SDIKMQQAKKAVKFCLSQLQP 304 Query: 250 QDNIAIVTYAGDSRIALPSISGSHKAEI---NAAIDSLDAEGSTNGGAGLELAYQQATKG 306 +D +V ++ + ++ + ID L G T L A A + Sbjct: 305 EDRFGVVRFSTTVTKFRSELVAANTDYLDLATKWIDGLKTSGGTAIWPALNDAL--AMRS 362 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIA 366 ++ TDG V + I V + + TFGVG+ + N AM+ ++A Sbjct: 363 SDPSRPFTMVFFTDGQPTVDETNADKIVKNVLAKNTGNTRIFTFGVGD-DVNAAMLDQLA 421 Query: 367 DVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA----QIEFNPAWVTEYRQIGYEK 422 D +Y+ + + ++ ++ V DV+ ++ + + + + Sbjct: 422 DSTRAVSTYVREAEDIEVKVSGLYAKISNPVLTDVQLATSENVQLHEIYPPKLPDLFQGT 481 Query: 423 RQLRVEHFNNDN---VDAGDIGAGKHITLLFELTLNGQKAS 460 + + + + + + + + L++E G+ S Sbjct: 482 QLVVIGRYTGEGPSVIRLTGLVGKERQELVYEFNFPGKTES 522 >UniRef50_C7NN24 von Willebrand factor type A n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NN24_HALUD Length = 592 Score = 155 bits (392), Expect = 4e-36, Method: Composition-based stats. Identities = 76/432 (17%), Positives = 158/432 (36%), Gaps = 58/432 (13%) Query: 130 TGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA 189 + A+ RR + + LP PD++ VE + + +D + +A P Sbjct: 106 ASNVADFRRNVEEEYLPLPDSLPVEGLF--YNYYFDTGGTGECSSLFCPSYATAITADPL 163 Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD------------------- 230 + R D + E ++V ++D SGSM S Sbjct: 164 GESTGRYFTVGLNSTLDTSTFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEGD 223 Query: 231 --ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAI-DSLD 285 ++ + + +L L ++L D + +V + + +A L + + I I + ++ Sbjct: 224 SRSKMAVAKDALVALTEQLHPDDRVGVVLFNNEPTVAKPLRDVETTDMDAIRGHIREDIE 283 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGGI---NRILLATDGDFNVGIDDPKSIESMVKKQRE 342 A G TN G+ A + R ++ TD N G D ++++ + E Sbjct: 284 AGGGTNIADGMAEAADMLGEYADSDPTEAETRQIVITDAMPNTGQTDDQALQDRLAGYAE 343 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 G+ S GVG ++N ++ I V NY + + + + L E M+ + D+ Sbjct: 344 DGIHTSFVGVGV-DFNPELVDEITAVRGANYRSVHSAEDFETYLGEEFEYMVTPLVYDLS 402 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDI-----GAGKHITLLFELTLNGQ 457 +++ A + G E ++ + + G+ + + L+G+ Sbjct: 403 VELDAADAEIATV--YG----STAAEDATDELLSVNTLFPSPKSDGETRGGVVLVKLDGE 456 Query: 458 KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFR 517 + LR + +++ +D+T ++ +EFP P + +R Sbjct: 457 ASGDMTLRASWEDRSGSTDET-----------------TRTIEFPEEPPEYFANTGIRKA 499 Query: 518 AAVAAYGQKLRG 529 +A Y L+ Sbjct: 500 VLLARYADLLKN 511 >UniRef50_Q2QW82 Zinc finger family protein, putative, expressed n=1 Tax=Oryza sativa Japonica Group RepID=Q2QW82_ORYSJ Length = 529 Score = 155 bits (391), Expect = 5e-36, Method: Composition-based stats. Identities = 66/369 (17%), Positives = 125/369 (33%), Gaps = 74/369 (20%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH 273 +LV ++D S SM D +A+ Sbjct: 129 PLDLVTVLDVSTSMTG---------------------DKLAL----------------DG 151 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV-------- 325 KA A+D+L A G+TN GL++ + + ++L +DG N Sbjct: 152 KATAKRAVDALVANGNTNIRDGLDVDAKVLDGRRHTDAVASVILLSDGQDNQTMGYRGRF 211 Query: 326 GIDDPKSIESMV----------KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSY 375 + D K+ + E + FG G ++++ A M I+++ G +S+ Sbjct: 212 HMTDFKAAATSYDVLVPPSFTRAGGGERCAPVHAFGFG-TDHDAAAMHSISEITGGTFSF 270 Query: 376 IDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE-HFNNDN 434 I+ L+ Q + +L A++ + +E V R + + + R++ Sbjct: 271 IENLAVIQDTFARCIGGLLSVAAQNARISVECLDPGV-RVRAVKSGRYESRIDAEGRAAT 329 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW--------LKI 486 VD G++ A + L L + + D E + + + Sbjct: 330 VDVGELYADEERRFLLLLDVPRADGDAAVATRLASVRCTYRDTATEQSVDVAGEEDAVVL 389 Query: 487 RWKYPQG-KESQLVEFPLGPTINAPSEDMRFRAAVA---AYGQKLRGSEYLNNTSWQQIK 542 R G S VE ++D+ A A AYG+ R + + + + Sbjct: 390 RPAVATGVAPSMEVELERERVRLEAADDIALARAAAERGAYGEAAR----ILDARREALS 445 Query: 543 QWAQQAKGE 551 + A A G+ Sbjct: 446 RSAPAASGD 454 >UniRef50_C1HBZ8 von Willebrand and RING finger domain-containing protein n=3 Tax=Paracoccidioides brasiliensis RepID=C1HBZ8_PARBA Length = 1068 Score = 154 bits (390), Expect = 7e-36, Method: Composition-based stats. Identities = 57/357 (15%), Positives = 124/357 (34%), Gaps = 36/357 (10%) Query: 137 RRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAM-RYELAPAPWNEQR 195 RR L P D + ++D+ + + + R + + Sbjct: 392 RRALLDLHNPDADRSLALPRNHNQDYNYDLDNSGTEEDDYRTHQTIKRVSSTTSSYGGGT 451 Query: 196 TLLKVDILAKDRKSEELP--------------ASNLVFLIDTSGSMISDERLPLIQSSLK 241 + ++ +LV +I S SM ++ L++ +L+ Sbjct: 452 RSNNTALTDYTNSIRDVTTISTTSTIPPTFHIPLDLVVVIPVSSSM-QGLKISLLRDTLR 510 Query: 242 LLVKELREQDNIAIVTYAGDSRIALPSISGSHKA-----EINAAI-----DSLDAEGSTN 291 LV L +D + +VT+ G S +P + + K I AI SL A+ Sbjct: 511 FLVANLGPRDRMGLVTF-GSSGGGVPLVGMTTKTWGGWPAILGAIRPVGQKSLRAD---- 565 Query: 292 GGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFG 351 G +A + I ILL +D G ++ +V + + V + +FG Sbjct: 566 VVEGANVAMDLLMQRRSSNPIATILLISDSSMGEGES----VDFVVSRAEAAKVGIHSFG 621 Query: 352 VGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW 411 +G + + M+ ++ +Y+Y+ ++ + + + T ++ K ++ Sbjct: 622 LGLT-HKPDTMIELSSRTKASYTYVKDWMMLRECVAGCLGLLQSTSHQNAKLKLRLPEGS 680 Query: 412 VTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAP 468 ++ +I + + GD+ G +L +L + + + L P Sbjct: 681 PAKFVKISGALHTTKRAAGRDAEAALGDLRFGDKRDILVQLVIAPDTTTQEHLPQDP 737 >UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q083T9_SHEFN Length = 722 Score = 154 bits (389), Expect = 8e-36, Method: Composition-based stats. Identities = 51/231 (22%), Positives = 101/231 (43%), Gaps = 17/231 (7%) Query: 183 RYELAPAPWNEQRTLLKVDILA-KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLK 241 + P P + L + + + + L A L+ +IDTSGSM S + + + +L+ Sbjct: 311 TAQRQPNPVDNNMYSLVMLMPPSVEVSEQHLIARELILVIDTSGSM-SGQSITQAKQALQ 369 Query: 242 LLVKELREQDNIAIVTYAGDSRIA-LPSISGSHKA--EINAAIDSLDAEGSTNGGAGLEL 298 + LR+ D+ I+ + D + +S + + + N I SLDA+G T + L+ Sbjct: 370 FALAGLRDIDSFNIIEFNSDVTMLSATPLSANSRNIGKANRFIQSLDADGGTEMRSALQT 429 Query: 299 AYQQATKG------FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGV 352 A + + + +++ TDG + + + ++ Q L T G+ Sbjct: 430 ALVDSVQQDSDQTDAHSEMLRQVIFMTDGA----VGNEHELYQLINDQLGDS-RLFTVGI 484 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 G++ N M R A +G G ++YI SE Q+ + + ++ V ++ Sbjct: 485 GSAP-NSDFMRRAATMGRGTFTYIGNESEVQQKIEQLLNKIEQPVLTNIGL 534 >UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomycetaceae RepID=D2R2I7_9PLAN Length = 786 Score = 154 bits (389), Expect = 1e-35, Method: Composition-based stats. Identities = 61/269 (22%), Positives = 103/269 (38%), Gaps = 15/269 (5%) Query: 144 LLPPPDAVRVE-EIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 P V+ E NY P D + + P A L+ P N + Sbjct: 239 KRPDEKHATVKFEASNYLP----TTDFRLLYDVGDAPLAASV-LSYRPDNSDEGFFLMLA 293 Query: 203 LAKDRKSE-ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 + E +L ++F++D SGSM +++ + +++ ++ L E D IV Y Sbjct: 294 SPNHSQGEVDLTKKTVIFVVDRSGSM-QGKKIEQAREAMRYVLNNLHEGDTFNIVAYDST 352 Query: 262 S---RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 + L + + A +D L A GSTN L+ A+ T N IL Sbjct: 353 VESFKPELQKFDDATRKSALAYVDGLYAGGSTNISGALDSAFAMLTG---SDRPNYILFL 409 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 TDG G + I + K++ + FGVG + N ++ R++ G Y+ Sbjct: 410 TDGLPTAGETNEGKIVELAKQKNVHRARMINFGVG-YDVNSRLLDRMSRENFGQSQYVRP 468 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 + ++ +M V DVK I+ Sbjct: 469 DENLEASVSRLYSKMSSPVLTDVKVSIDI 497 >UniRef50_C1YR26 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YR26_NOCDA Length = 505 Score = 153 bits (386), Expect = 2e-35, Method: Composition-based stats. Identities = 63/290 (21%), Positives = 117/290 (40%), Gaps = 24/290 (8%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 + L ++D SGSM RL +L LV+ L DN +V++ +R+ +P K Sbjct: 40 ATLQVVLDRSGSMGGG-RLDGAVRALLSLVERLAPSDNFGLVSFNDQARVEVPCGPLEDK 98 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 A + I L A G T+ +GL Q+A + +LL +DG N G+ D + Sbjct: 99 ARVRRLISGLHASGGTDLSSGLLRGVQEARRA-GADRGGTLLLISDGHANQGVTDHDLLR 157 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQML 394 + GVT ++ G G Y+E ++ +AD G G+ + + A ++ E +L Sbjct: 158 QVAADAYAHGVTTTSLGYGLG-YDEELLGAVADGGAGSALFAEDPDTAGGLIAREAEYLL 216 Query: 395 ITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL 454 A+ V ++ P + +G E R+ + V+ GD + Sbjct: 217 AKTAQAVSLRVPSGP-LLRSVSVVG-EMPSHRLADGS-VVVELGDFHS------------ 261 Query: 455 NGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG 504 ++ R ++ +A L++ + P +++ V + Sbjct: 262 ------GERRRLLLRLEVRGLSAPGAVAALEVAYADPATLDTRTVSLSVE 305 >UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepID=B9GN58_POPTR Length = 705 Score = 152 bits (385), Expect = 2e-35, Method: Composition-based stats. Identities = 46/258 (17%), Positives = 91/258 (35%), Gaps = 33/258 (12%) Query: 187 APAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKE 246 AP P T + A +L+ ++D S SM +L +++ +++L++ Sbjct: 303 APPPLPSLTTRNSSNSTASLLDPSRRAPIDLITVLDVSASMTG-AKLQMLKRAMRLVISS 361 Query: 247 LREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 L D ++IV ++ + LP ++ + + ID L ++ G L A + Sbjct: 362 LGSADRLSIVAFSSSPKRLLPLKRMTPNGQRSARRIIDRLVCGQGSSVGEALRKATKVLE 421 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + + I+L +DG IE + + +FG G S N Sbjct: 422 DRRERNPVASIMLLSDGQDERSSTRFAHIE----------IPVHSFGFGQSGGNSQ---- 467 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 A+ + +L V +D++ Q+ F + + Sbjct: 468 ---------------EPAEDAFAKCVGGLLSVVVQDLRIQLGFA-SSSAPAEIVAVYPCN 511 Query: 425 LRVEHFNNDNVDAGDIGA 442 R + +V GD+ A Sbjct: 512 SRPNVLGSGSVRLGDLYA 529 >UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVH6_PARL1 Length = 755 Score = 152 bits (384), Expect = 4e-35, Method: Composition-based stats. Identities = 93/527 (17%), Positives = 177/527 (33%), Gaps = 50/527 (9%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEA 83 E ++Q + S +Q+ + + QQ V++ D+ +L+ + A Sbjct: 147 EEAKAQGHKASLVEQQRPNVFTNSVANIGPGETIIVQIEYQQTVRRDGDRFSLRFPMVVA 206 Query: 84 PTFARAAKAKATHIANPGT---ARYQQFDDNPVKQVAQ--------NPLATFSLDVDTG- 131 P + PG Q +N ++Q NP++ +L +D G Sbjct: 207 PRYTPKTADPQLVDFAPGGGWGEVRQSEPENDLEQPPVLHPAQGQINPVS-LALSLDAGF 265 Query: 132 -----SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 S + + LN+ EE+ P++ D + A+K A+ E Sbjct: 266 ALGDISSTHHKIALNRDGKQKATLKLAEEL---TPANKDFELVWKPAAAKAPAAALFRER 322 Query: 187 APAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKE 246 NE L+ + + + E P + F+ID SGSM + + SL + Sbjct: 323 V---GNEDYLLVMLTPPSGSVQPEAKPREAI-FVIDNSGSMSGP-SMVQAKESLLWALDR 377 Query: 247 LREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 L+ D ++ + + P G + A + SL+A G T L Sbjct: 378 LKPGDTFNVIRFDDTLTVLFPDAVPAHGENLAVAKKFVKSLEANGGTEMLPALRA--SLI 435 Query: 304 TKGFIKG-GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMM 362 + G + +I+ TDG I + + + L T G+G++ N M Sbjct: 436 DRNVNDGTRLRQIVFLTDGA----ISNEAELFHEITSNLGRS-RLFTVGIGSAP-NSYFM 489 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI------EFNPAWVTEYR 416 R ++ G G +++I +E + + ++ V ++ A E P V + Sbjct: 490 TRASEAGRGTFTHIGKETEVTERMAELFEKLQNPVMTNITATWPDGRTTESWPNPVPDLY 549 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 + R+ GD+ AG+ + +L + I KL ++D Sbjct: 550 KGEPVVLSARMPKATGTLTLKGDV-AGEPWEVRMQLNAGETRPGIGKLWARNKIGQLEAD 608 Query: 477 KTKELAWLK-----IRWKYPQGKESQLVEFPLGPTINAPSEDMRFRA 518 W K +R S+ + + + Sbjct: 609 AQIAGDWEKHDAEILRVALDHNLVSRRTSLVAVDVTPSRPAGEKLAS 655 >UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09E12_STIAU Length = 540 Score = 151 bits (381), Expect = 8e-35, Method: Composition-based stats. Identities = 49/195 (25%), Positives = 86/195 (44%), Gaps = 7/195 (3%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 + + E+ A + F+IDTSGSM R+ + + +LK V L QD +V ++ D Sbjct: 61 EVSASEIAAKRVTFVIDTSGSM-QGSRMQIAKDALKYCVTRLNPQDTFNVVRFSTDVEAL 119 Query: 266 LPSISGSHKAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD 322 P++ + I A ++ L+A G T L Q + ++ TDG Sbjct: 120 FPALKSAQPENIQKAVAFVEQLEAIGGTAIDEALVRGLQ--DNDGKSSAPHLLMFITDGQ 177 Query: 323 FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 +G D +I K R++ L TFGVG + N ++ R++ G G ++ E Sbjct: 178 PTIGETDEGAIAQHAKDGRKAKTRLFTFGVGE-DLNARLLDRLSSDGAGTSDFVRDGKEF 236 Query: 383 QKVLNSEMRQMLITV 397 + ++S ++ V Sbjct: 237 ETKISSFYDKVSNPV 251 >UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FW78_SHESH Length = 770 Score = 150 bits (380), Expect = 9e-35, Method: Composition-based stats. Identities = 51/236 (21%), Positives = 91/236 (38%), Gaps = 25/236 (10%) Query: 185 ELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 EL P + L + + KS + L+ +IDTSGSM S + + ++K + Sbjct: 347 ELNAKP--DADYALVMLLPPSLEKSRNRVSRELILVIDTSGSM-SGSAMEQAKKAMKYAL 403 Query: 245 KELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQ 301 L D ++ + S + N + SL ++G T LE A Sbjct: 404 AGLGSDDTFNVIEFNSKVSSLSKGPIPASTKNIEMANRFVHSLTSDGGTEMALALEHALG 463 Query: 302 QATKGFI-------------KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 Q + G + ++L TDG + + + ++K R L Sbjct: 464 QESGGSSWQETGLQGKDEESTSRLRQVLFMTDGA----VGNEAELFKLIK-YRIGKSRLF 518 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 T G+G++ N M R A+ G G ++YI L E Q+ + + ++ D++ Sbjct: 519 TLGIGSAP-NSHFMQRAAEFGRGTFTYIGDLDEVQEKIQGLLYKIEHPQITDIELH 573 >UniRef50_Q2QZN5 Putative uncharacterized protein n=1 Tax=Oryza sativa Japonica Group RepID=Q2QZN5_ORYSJ Length = 553 Score = 150 bits (380), Expect = 1e-34, Method: Composition-based stats. Identities = 69/320 (21%), Positives = 114/320 (35%), Gaps = 49/320 (15%) Query: 226 SMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS----RIALPSISGSHKAEINAAI 281 SM RL L++ ++K++ +L D +AIV + G L ++ +A+ NA + Sbjct: 6 SMHGWTRLDLVKGAMKMVTNKLGAGDRLAIVPFNGKVVAAGATRLMEMTTKGRADANAKV 65 Query: 282 DSLDAEGSTNGGAGLELAYQQATKGFIKGGINR---ILLATDGDFNVGIDDPKSIESMVK 338 + L A G T L+ A R I L +DG N +DD Sbjct: 66 NQLKAGGDTKFLPALKHASGLLDSRPAGDKQYRPGFIFLLSDGQDNGVLDD--------- 116 Query: 339 KQRESGVTL--STFGVGNSNYNEAMMVRIADVGNGNYSYIDT-LSEAQKVLNSEMRQMLI 395 + GV TFG+ S N MV IA G+Y ID LS + L + + Sbjct: 117 --KLGGVRYPAHTFGMCQSRCNPKSMVHIATATKGSYHPIDDKLSNVAQALAVFLSGITS 174 Query: 396 TVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN-----DNVDAGDIGAGKHITLLF 450 VA + + Q+ +I +E N ++ G + A + + Sbjct: 175 AVAVNARVQLHVADNSGVLINKIDSGAYDKTIESGNGKASSKGTINVGVLSAEEDKKFIV 234 Query: 451 ELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQG--------KESQLVEFP 502 L + P + A++ + L + + P G + S VE P Sbjct: 235 YLDV-------------PKLENAQAKPPQLLLTVAGEYSTPAGGRKVENMEESSVQVERP 281 Query: 503 LGPTINAPSED--MRFRAAV 520 + D + + AV Sbjct: 282 APAGGATKTGDHLVTWSEAV 301 >UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTR8_HALO1 Length = 903 Score = 150 bits (379), Expect = 1e-34, Method: Composition-based stats. Identities = 50/240 (20%), Positives = 92/240 (38%), Gaps = 18/240 (7%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 K E P + ++D SGSM S ++ + S + + L D I +V + + Sbjct: 451 KQREQPHVAIALVVDRSGSM-SGLKIEAAKESARATAEVLSPSDLITVVAFDNQPTTIVR 509 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 S++ I I L A G TN L AY+ K + +++ +DG Sbjct: 510 LQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGANAK--VKHVIVLSDGQAPY-- 565 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 I + ++ R + +T+S G+G+++ N ++ I D G+G D L+ ++ Sbjct: 566 ---DGIADLCQEMRSARITVSAVGIGDADRN--LLNLITDNGDGRLYMTDDLAALPRIFM 620 Query: 388 SEMRQML------ITVAKDVKAQIEFNPAWVTEYRQI--GYEKRQLRVEHFNNDNVDAGD 439 E + V V ++E E + GY + + D G+ Sbjct: 621 KETTEAQRSALVESPVRAHVLKRVEMIEGTGVENAPLLRGYVTTKPKPRSEVILVSDLGE 680 >UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Marinobacter RepID=A1U6Y4_MARAV Length = 712 Score = 150 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 94/530 (17%), Positives = 168/530 (31%), Gaps = 75/530 (14%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAE-------------QSAAAAKALAQQEVQQ 69 E + Q QP Q A+QA K A + A + + + Q Sbjct: 137 GERRIVGQLQPRAQARQNYEKAKQAGQKAATVEQNRPNLFTSRIANIAPGEEVTVEVQYQ 196 Query: 70 YSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVD 129 PT +A P +A + + + ++ F++ D Sbjct: 197 QPVNYRHGEFELRLPTTLTPRYMPGAPVATPASAWQSGWSLPTTQVADADEISPFTVLPD 256 Query: 130 TGSYANVRRFLN---QGLLP--------PPDAVRVEEI-------VNYFPSDWDIKDKQS 171 + + R + + LP P V +E D D+ + Sbjct: 257 DVNPGSHRATIQLDIEAGLPVDEVTSPSHPLQVELEGSRATVSPEQGQILMDRDVIVRW- 315 Query: 172 IPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDE 231 PA P L W + L+ + + + ++ L+F+IDTSGSM + E Sbjct: 316 RPADNQAP---TAALFRQQWQGEDFLMAMVM--PPATTGQVLRRELLFVIDTSGSM-AGE 369 Query: 232 RLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH---KAEINAAIDSLDAEG 288 + +S+L + LR D ++ + + ++ A + L A+G Sbjct: 370 SIRQARSALLRGLDTLRPGDRFNVIQFNSQAHALYTQPVPANGHYLARARDYVQDLTADG 429 Query: 289 STNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 T L LA G + +++ TDG + + ++ ++ L Sbjct: 430 GTEMAGALSLAMGM-DGSESSGHVQQMVFMTDGA----VGNESALFDQIRTGL-GNRRLF 483 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFN 408 T +G++ N + A G G Y+ + + +E K L M V DV+ Q N Sbjct: 484 TVAIGSAP-NMHFLREAARWGRGQYTAVHSAAEVDKALGKLFAAMEAPVMTDVEVQWPGN 542 Query: 409 PAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLF-------ELTLNGQKASI 461 A + GD+ G+ + + ELT++G+ Sbjct: 543 AAQPVPAK--------------------PGDLFHGQPLLQVVRGAPSEGELTVSGRLPGG 582 Query: 462 DKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPS 511 R + D A K + W + R P I S Sbjct: 583 RSWRTSLDLASAAPGKGLDRQWARGRIDAVMDSARLAGTEPDEAAIVELS 632 >UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RYC9_9GAMM Length = 686 Score = 149 bits (377), Expect = 2e-34, Method: Composition-based stats. Identities = 64/387 (16%), Positives = 139/387 (35%), Gaps = 28/387 (7%) Query: 31 QQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFA--- 87 + T + V + + + + + + V+ + Q + + P+ Sbjct: 137 GEKITVRLEYVQSVEHRSGRFSLRLPTTITPRYMPGVETETANQDMNENVAVTPSHGWAW 196 Query: 88 RAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSY-ANVRRFLNQGLLP 146 + H+ +P Q D P+ ++ S +D G A++ ++ L Sbjct: 197 PTDQVTDAHLISPLQYFAQGSDSAPLNRIK------ISARLDMGMPLASIDSPYHEIALS 250 Query: 147 PPDAV-RVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAK 205 V V+ D D + S + A E +Q L + + Sbjct: 251 RRAGVYSVKLAQGSAEMDRDFVLQWSAASGSLPGAAFFTERVD----DQYYGLLMLVPPA 306 Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 +++ E +VF++DTSGSM + + SL ++ L D ++ + R Sbjct: 307 SQRAAETVPREIVFVVDTSGSM-GGVSIKQAKGSLTRALRHLGPNDRFNVIEFNSSHRAL 365 Query: 266 LP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQ---QATKGFIKGGINRILLAT 319 S + + + L+A G T L+LA + + + + +++ T Sbjct: 366 FQHAVPASHHNLQLASEYVRHLEASGGTEMMPALQLALKLPGAQDELRPEPALRQVIFIT 425 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DG + + ++ + G L T G+G++ N M + A+ G G ++YI + Sbjct: 426 DGA----VGNESALFEHIVDSL-GGSRLFTVGIGSAP-NAWFMRKAAEYGRGTFTYIGDV 479 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIE 406 +E + +++ + VA + Sbjct: 480 AEVGEKMDALFLNLTRPVATHLNVDWP 506 >UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobactrum intermedium LMG 3301 RepID=C4WI90_9RHIZ Length = 777 Score = 149 bits (375), Expect = 4e-34, Method: Composition-based stats. Identities = 81/492 (16%), Positives = 168/492 (34%), Gaps = 64/492 (13%) Query: 37 TEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATH 96 + LA ++ +++ A V Q + Q G+ +A T Sbjct: 218 QQAVRLADERFSLRVPLVVAPRYNPNIASPVVQKVEMQNGWGKSSDAGKPDPYNAPIVTP 277 Query: 97 IANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSY-ANVRRFLNQGLLPPPDAVRVEE 155 + P R NP++ S+++ G V ++ + + E Sbjct: 278 LTPPAELR-------------TNPVS-ISVELKPGFPLGKVESLYHKVRIETTNDATREI 323 Query: 156 IV-NYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPA 214 + +D D + S A+ + E + + + S + Sbjct: 324 TLDGTAAADRDFVLEWSAVANDAPQVGLFREHVG-----KDDYVLAYVTPPAVASAKKAQ 378 Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS---ISG 271 +VF+ID SGSM + ++SL + L+ D ++ + S Sbjct: 379 REVVFVIDNSGSM-GGTSIEQAKASLDYALSHLQPGDRFNVIRFDDTLTRFFEVSVEASQ 437 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 + A + SL+A+G T L A + +G G+ +I+ TDG+ I + + Sbjct: 438 QNIASARHFVMSLEAQGGTAMLPALHAALDDSHQG---NGLRQIVFLTDGE----ISNEQ 490 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 + + +R + G+G + N +M A++G G +++I + +E + + + Sbjct: 491 QLLDAIAARRGRS-RIFMVGIGTAP-NSYLMNHAAELGRGTFTHIGSAAEVDERMRALFD 548 Query: 392 QMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFE 451 ++ D+KA F+ V+ I D+ G+ + + Sbjct: 549 KLENPAVTDLKAN--FSEKNVSMTPSI------------------LPDLYRGEPLVIAAR 588 Query: 452 LTLNGQ----KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTI 507 + + ID + + L ++ K + ++ L R K VE LG Sbjct: 589 MGKAAGNLVIEGQIDGRPWTVNLPLDQAMKAEGISKLWARRKIDDA----EVELTLGKIS 644 Query: 508 NAPSED--MRFR 517 + +R Sbjct: 645 QDAANARILRLA 656 >UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Rhizobium RepID=B5ZY26_RHILW Length = 794 Score = 149 bits (375), Expect = 4e-34, Method: Composition-based stats. Identities = 51/287 (17%), Positives = 105/287 (36%), Gaps = 36/287 (12%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG---DSRIALPSISG 271 +VF+ID SGSM + + SL L + L D ++ + D L + + Sbjct: 354 REVVFVIDNSGSMSGP-SIEQAKQSLALAISRLTPNDRFNVIRFDDTMTDYFKGLVAATP 412 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 ++ + A + L A+G T LE A + G + +++ TDG I + + Sbjct: 413 DNREKAIAYVRGLPADGGTEMLPALEDALRN-QGPVATGALRQVVFLTDGA----IGNEQ 467 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 + + R + T G+G++ N M + A++G G ++ I + + + Sbjct: 468 QLFQEITANR-GDARVFTVGIGSAP-NTYFMTKAAEIGRGTFTQIGSTDQVASRMGELFA 525 Query: 392 QMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFE 451 ++ D+ A E + E + D+ +G+ + L E Sbjct: 526 KLQNPAMTDIAANFE-----------------GIAAEDITPN--PMPDLYSGEPVVLTAE 566 Query: 452 LTLNGQKASIDKL------RYAPDNKLAKSDKTKELAWLKIRWKYPQ 492 L + ++ + + +A + ++ L R K Sbjct: 567 LPGDKPAGKLEIIGKTGDQPWRVQMDIANAADGNGISKLWARRKIDD 613 >UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11 Tax=Rhizobiales RepID=A6X8G3_OCHA4 Length = 750 Score = 148 bits (374), Expect = 4e-34, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 93/212 (43%), Gaps = 13/212 (6%) Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 + + S + ++F+ID SGSM + ++SL + +L+ D + Sbjct: 333 DYVLAYVTPPALASPKKVQREVIFVIDNSGSM-GGTSIEQAKASLDYALSQLQPGDRFNV 391 Query: 256 VTYAGDSRIALPSISGSHKAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 + + +++ I +A + SL+A+G T L A + +G G+ Sbjct: 392 IRFDDTLTKFFEDSVDANQENIASARRFVTSLEAQGGTEMLPALHAALDDSNQG---NGL 448 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 +I+ TDG+ I + + + V +R + G+G++ N +M R A++G G Sbjct: 449 RQIVFLTDGE----ISNEQQLLDAVAARRGRS-RIFMVGIGSAP-NSYLMNRAAELGRGT 502 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 +++I + +E + + + ++ D+KA Sbjct: 503 FTHIGSAAEVDERMRALFDKLENPAVTDLKAN 534 >UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain H4 n=38 Tax=Eutheria RepID=ITIH4_HUMAN Length = 930 Score = 148 bits (373), Expect = 7e-34, Method: Composition-based stats. Identities = 45/197 (22%), Positives = 89/197 (45%), Gaps = 13/197 (6%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP---SIS 270 N+VF+ID SGSM S ++ + +L ++ +L +D ++ ++ ++ P S Sbjct: 272 PKNVVFVIDKSGSM-SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPAS 330 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA-----TKGFIKGGINRILLATDGDFNV 325 + + + + A G TN + +A Q + +G ++ I+L TDGD V Sbjct: 331 AENVNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVSLIILLTDGDPTV 390 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI-DTLSEAQK 384 G +P+SI++ V++ +L G G + + A + ++A G I + A + Sbjct: 391 GETNPRSIQNNVREAVSGRYSLFCLGFGF-DVSYAFLEKLALDNGGLARRIHEDSDSALQ 449 Query: 385 V--LNSEMRQMLITVAK 399 + E+ L+T Sbjct: 450 LQDFYQEVANPLLTAVT 466 >UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174639B Length = 868 Score = 147 bits (372), Expect = 9e-34, Method: Composition-based stats. Identities = 42/187 (22%), Positives = 79/187 (42%), Gaps = 9/187 (4%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 + K EE +S L +ID SGSM S E+L + +S+ + L D+I + + + Sbjct: 400 VRLKAPDEEEKQSSALALVIDRSGSM-SGEKLEMAKSAAIATAEVLTRNDSIGVYAFDSE 458 Query: 262 SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG 321 + + +P + + + I L + G TN A + K I +++ TDG Sbjct: 459 AHVVVPMTRLTSSSAVAGQIAGLTSGGGTNLHPAFTEARNALQR--TKAKIKHMIILTDG 516 Query: 322 DFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 + E++ + R GVT+ST +G+ + ++ IA +G G + Sbjct: 517 Q-----TSGQGYEALASQCRAEGVTISTVAIGDGAHV-GLLQAIASLGGGKSYTTLDAAN 570 Query: 382 AQKVLNS 388 ++ Sbjct: 571 IVRIFTQ 577 >UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TQ23_SHEHH Length = 850 Score = 147 bits (372), Expect = 9e-34, Method: Composition-based stats. Identities = 98/493 (19%), Positives = 167/493 (33%), Gaps = 76/493 (15%) Query: 44 AQQAAIKEAEQSAAAAKALAQQEVQQYS----DKQALQGRLQEAPTFARAAKAKATHIAN 99 Q + + +S A A+ + E + + + Q R+ TF A +I + Sbjct: 272 KQSNELIDINKSVYAHSAVVKAEDEALESEALESRERQNRVSMTVTFD--AAMPIENIVS 329 Query: 100 PGTARYQQFDDNPVKQVAQNPLATFSLD-VDT-----GSYANVRRFLNQGLLPPPDAVRV 153 P +N QV+ + A + D V T GS F QG A +V Sbjct: 330 PYHGISINMVENAAAQVSLDNYAVANRDFVLTWQPVQGSEPTAAVFSQQGKTHAELASQV 389 Query: 154 EEIVNYFPSDWD----IKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKS 209 F Q P S+ + + E+ L+ + Sbjct: 390 TAGDTSFNQGGAKSKLSPQSQPEPQSQLQVQDSKQQTLSKKALEKYALVMLMPPQGSDDE 449 Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI---AL 266 A LV +IDTSGSM D + +S+LK + LR QD+ ++ + + Sbjct: 450 SSSIARELVLVIDTSGSMSGDAII-QAKSALKYALAGLRPQDSFNVLQFNSTVERWSRHV 508 Query: 267 PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG---------------- 310 + + I+ L A+G T L+ A + Sbjct: 509 MPATAINLGRAQNYINGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHDDDRYQSSNET 568 Query: 311 -------GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 + ++L TDG + + + +K Q L T G+G++ N M Sbjct: 569 LEQSAATPLRQVLFITDGA----VANESRLFEQIKNQLGES-RLFTIGIGSAP-NAHFMQ 622 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKR 423 R A+VG G Y+YI L E + + S + ++ DV ++ F+ V +Y Sbjct: 623 RAAEVGRGTYTYIGKLDEVNQKVVSLLEKIEKPQVTDV--ELHFSDGSVPDY-------- 672 Query: 424 QLRVEHFNNDNVDAGDIGAGKHITLLFE--------LTLNGQKASIDKLRYAPDNKLAKS 475 V D+ A + + + L + GQ A R P N A+ Sbjct: 673 ---------WPVRIPDLYAHEPVLVALRIPSYVSDDLLIQGQLAGQFWQRRLPLNSAAQV 723 Query: 476 DKTKELAWLKIRW 488 + ++ L + W Sbjct: 724 NDLEQAKGLDLIW 736 >UniRef50_A5UW94 von Willebrand factor, type A n=2 Tax=Roseiflexus RepID=A5UW94_ROSS1 Length = 452 Score = 147 bits (371), Expect = 1e-33, Method: Composition-based stats. Identities = 58/253 (22%), Positives = 104/253 (41%), Gaps = 23/253 (9%) Query: 224 SGSMISDER------LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEI 277 SGS+ + R L + +L +V+ L D +++V +A + + +P + GS + + Sbjct: 91 SGSVPQEVRKAASSALDHVVHALHTVVERLDRNDRLSLVVFADHALLLIPGMVGSDRVTL 150 Query: 278 NAAIDS---LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 AI+ LD TN G+ LA Q NR+LL TDG DP + Sbjct: 151 VRAIERLPGLDLGDGTNLADGIALALNQIRANRDARRANRVLLLTDGF----TRDPAACL 206 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQML 394 ++ + + + ++T G+G + + ++ IAD GN ++ S + +++E+ Sbjct: 207 TLADQAADEHIAITTIGLG-GEFQDDLLTGIADRSGGNALFLKRASAIPRAISAELESAR 265 Query: 395 ITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRV--EHFNNDNVDA-----GDIGAGKHIT 447 V I P R++ + L + E D GD+ AG +T Sbjct: 266 AAALPGV--DIAIAPMRGVMLRRVTRTRPVLAILAEPTGTGASDVVSVLLGDLPAGSPVT 323 Query: 448 LLFELTLNGQKAS 460 LL E + Sbjct: 324 LLLEFLVPAANPG 336 >UniRef50_A6FXN3 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6FXN3_9DELT Length = 416 Score = 147 bits (371), Expect = 1e-33, Method: Composition-based stats. Identities = 72/369 (19%), Positives = 133/369 (36%), Gaps = 49/369 (13%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSI--SGSHK 274 +V ++DTS SM D + +++ LV L E D+ A+V + + + +PS + + Sbjct: 1 MVLVVDTSASMKGDA-IEGAKAAAMELVDGLAEGDSFALVVFHSRAEVLMPSTVINEDSR 59 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQAT----------------KGFIKGGINRILLA 318 A + I+++ A G+T+ GL+ A Q + R++L Sbjct: 60 AAARSKIETMQAWGTTDLAGGLQQALAQLQVAQNIVGAGGSTGAQSGAPDPTVLERVVLL 119 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 DG N D +I S V + G ++ G G Y+E ++ +A+ +G++ ++D Sbjct: 120 GDGVPN----DASTIPSTVGQLAARGTQITALGYGI-EYDETLLASLAEQTHGSFRFVDD 174 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAG 438 + E+ + TVA D++ + P V IG R + V G Sbjct: 175 PEAVASLFRDEVLDIERTVANDLRLSVGLGPG-VQLLEVIG---RPAAWAGGSKLEVQLG 230 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 + G+ ++ L + T EL+ L++ + Q Sbjct: 231 SLSEGQSHDIMLRLAVGPHS----------------EGATVELSDLELSFADVYTGTGQR 274 Query: 499 VEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRA 558 E D + LR + Q QA+ D QG + Sbjct: 275 FERDFLSVEATADAD---AVEAGEQPELLRIGARAR--TSAATLQVLAQARSGDTQGAKR 329 Query: 559 EFIRLIELA 567 ++ A Sbjct: 330 SLREAVDWA 338 >UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UK93_METS4 Length = 761 Score = 147 bits (371), Expect = 1e-33, Method: Composition-based stats. Identities = 89/517 (17%), Positives = 166/517 (32%), Gaps = 66/517 (12%) Query: 20 GPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGR 79 G E ++ T + + + ++ A Q + R Sbjct: 185 GRAAALTEQERPNLFTTSVANIGPGETVLVQIAFQQPVRLSGG----THALRLPLVVAPR 240 Query: 80 LQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRF 139 AP + A P AR +P NP+ +R Sbjct: 241 YSPAPGLLQPAAEGPARDPVPDRARIAPPVLDPAVHGPVNPVTL---------TVTLRAG 291 Query: 140 LNQGLLPPPD-AVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA--------- 189 G + A+RVEE D P FA+ + AP+ Sbjct: 292 FPLGTVESATHAIRVEET----GPDSRRVTLADGPVPADRDFALTWRAAPSAAPAVGLFR 347 Query: 190 -PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 E LL V + + ++ + F+ID SGSM + + ++SL + + L Sbjct: 348 ERVGEDEYLLAV-VTPPEGRAPARRPREVTFVIDNSGSM-AGASMRQAKASLLVALDRLG 405 Query: 249 EQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK 305 D ++ + + P +H+ + +L+A G T L A A Sbjct: 406 PADRFNVIRFDDTMDLLFPAPVPADEAHRDAARRFVAALEARGGTEMLPPLRAAL--ADP 463 Query: 306 GFIKG-GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 +G + +I+ TDG I + + I S + R L G+G++ N +M Sbjct: 464 HPEEGDRVRQIVFLTDGA----IGNEEQIFSAISAGRGRS-RLFMIGIGSAP-NGHLMTH 517 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 A++G G+Y+ I T+ + + + ++ V D+ A P R Sbjct: 518 AAELGGGSYTAIGTIDQVAERTAELLAKLESPVVTDLAAAFS-EPGVEATPRL------- 569 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQ----KASIDKLRYAPDNKLAKSDKTKE 480 D+ G+ + L L + I + + LA++ + Sbjct: 570 ------------LPDLYRGEPVVLAARLREATGTLTLRGRIGEAPWQQVLTLAEAREGSG 617 Query: 481 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFR 517 ++ L R K + + ++L +A + Sbjct: 618 ISKLWARAKIGEAETARLTGRMSAEAADAAILRLALA 654 >UniRef50_A9WKF3 von Willebrand factor type A n=3 Tax=Chloroflexus RepID=A9WKF3_CHLAA Length = 446 Score = 147 bits (371), Expect = 1e-33, Method: Composition-based stats. Identities = 60/240 (25%), Positives = 105/240 (43%), Gaps = 15/240 (6%) Query: 232 RLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDA---EG 288 + +L L++ L D + ++ A D+ + I GS +AE+ AAI L A Sbjct: 98 PIDYTTHALHSLIERLDHNDRLGLIACASDAIVLASGIPGSRRAELVAAIARLPALRLGE 157 Query: 289 STNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 +TN GL+LA Q + RI+L TDG D ++ ++ G++LS Sbjct: 158 TTNLAQGLQLALAQFVAA-DDATVRRIVLITDGF----TTDQTLCLTLAREAAARGISLS 212 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFN 408 T G+G S + E ++ ++AD+ G S++ ++ ++ +E+ T A+ + Q Sbjct: 213 TIGLGGS-FEEHLLTQLADLSGGRASFVYDAADIPAIIAAELESARQTTAQALTLQCNL- 270 Query: 409 PAWVTEYRQIGYEK-----RQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDK 463 P V+ R I L EH + GD+ G+ + LL E + A + Sbjct: 271 PQTVSLRRIIRLTPALTVLNPLSTEHGRRLTIHLGDLRHGEEVRLLVEFLVAPGAAGQQR 330 >UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V0V6_NAEGR Length = 502 Score = 147 bits (370), Expect = 1e-33, Method: Composition-based stats. Identities = 77/375 (20%), Positives = 139/375 (37%), Gaps = 60/375 (16%) Query: 214 ASNLVFLIDTSGSMIS-------DERLPLIQSSLKLLVKE-LREQDNIAIVTYAGDSRIA 265 N+ ++D SGSM +L +S+++ LV L +D I ++TY+ + Sbjct: 72 PVNICLVLDISGSMDEPLKNRSKGSKLTACKSAIRELVTNFLTYKDTIHLITYSDSPKTV 131 Query: 266 LPSISGSHKAEIN-AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 + +K +N ID + EGSTN + L A G I +DG N Sbjct: 132 F---TEKNKESVNLNDIDKISTEGSTNIASALHSAVDLLHNSNAPG-TKLIAFFSDGQCN 187 Query: 325 VGIDDPKSIESMV-------KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 VG + S + + ++ + +S++GVG S+Y+E + IA G G Y Y++ Sbjct: 188 VGETNLNIFGSGLLKKLKDYSEGKDDQIHISSYGVG-SDYDELWLQAIARTGKGEYYYLE 246 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIE-FNPAWVTEYRQIGYEKRQLRVEHFNNDNVD 436 + A+ +++ + K + N V + N D Sbjct: 247 DETYAKDAFERSLKKYKYQIGKKFNVTVSGLNGVTVKSF----------------NRKSD 290 Query: 437 AGDIGAGKHITLLF--ELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGK 494 + +G + LF +L + A L + D L LK+ + Y Sbjct: 291 LNSLISGVSLGGLFCRDLRFIEGVLVYNPQNSATLEALKQLDSNNGLPMLKVDYSYISNT 350 Query: 495 ESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLR---GSEYLNNTSWQ-QIKQWAQQAKG 550 + Q+V+ RF + + +L+ S N+ + Q I A Sbjct: 351 DEQIVKSVNYIH-------TRFASNTSELSTQLKLYETSAERNDFTVQNSILSIAD---- 399 Query: 551 EDPQGYRAEFIRLIE 565 + +F +L+E Sbjct: 400 -----LQVQFNQLLE 409 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 147 bits (370), Expect = 1e-33, Method: Composition-based stats. Identities = 51/262 (19%), Positives = 103/262 (39%), Gaps = 19/262 (7%) Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 N+ L+ + + + + + ++F+IDTSGSM E L +S+L + L QD Sbjct: 307 NDDYALVMLMPPSDEFIAAQRLPREVIFVIDTSGSMHG-ESLEQAKSALFFALANLDPQD 365 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGFI 308 + I+ + ++ I A + L A+G T G E Q Sbjct: 366 SFNIIEFNSKVNALNAQALPANDFNIRRARNFVYGLKADGGTEIGLAFE---QVLDNSEH 422 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 + +I+ TDG I + + + +K + T G+G++ N M R A + Sbjct: 423 ADYLRQIVFLTDG----SISNETEVFAQIKGSLGDS-RIFTIGIGSAP-NSYFMTRAATL 476 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ------IEFNPAWVTEYRQIGYEK 422 G G +++I +++ Q+ + + Q+ K++ ++F P + + Sbjct: 477 GRGTFTFIGDVTDVQRTMKNLFVQLANAALKELIITDENGDALDFWPKPIADLYFNQAMM 536 Query: 423 RQLRVEHFNNDNVDAGDIGAGK 444 +++ N G G+ Sbjct: 537 VAIKLNAGQNQINVRGQQAFGQ 558 >UniRef50_D0LP28 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LP28_HALO1 Length = 523 Score = 146 bits (369), Expect = 2e-33, Method: Composition-based stats. Identities = 95/396 (23%), Positives = 153/396 (38%), Gaps = 31/396 (7%) Query: 135 NVRRFLNQGLLPPPDAVRVEEIVNY--FPSDWDIKDKQSIPASKPIPFAMRYELAPAPWN 192 R + +G +PP +A VE + + P D D + +APA Sbjct: 58 FARDLIAEGRVPPAEAFLVEAMFSEHDLPVAGDACDSM-------LCLRSSLAVAPALDG 110 Query: 193 EQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISDERLPLI------QSSLKLLVK 245 L+V + + D + E P+ +V +D SGSM + ++ L LV Sbjct: 111 TPTGWLQVGMSSTIDPATFERPSLTIVATVDVSGSMGWGYADDQVSAGSLTRNLLGALVD 170 Query: 246 ELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK 305 +L +D IAIVTY AL S K EI+ AID L GSTN AGL+ AY A++ Sbjct: 171 QLGPEDRIAIVTYGSRVDTALTLRSAGQKDEIHTAIDKLSEAGSTNMEAGLQRAYAIASE 230 Query: 306 GFIKGGI--NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 G RI+L TD NVG E+M + +SGV L+ FG+ + +M Sbjct: 231 AAADGETDSTRIMLFTDVQPNVGATGASQFEAMASEGADSGVGLTVFGL-GLGLGQELMT 289 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKR 423 ++ + GN + +++ + + +A D++ + P ++ G+ + Sbjct: 290 AMSHLRGGNAFSLTRHESVGELIEDDWPWLASPIAYDLEVALA-APEGLSIRESYGFPEG 348 Query: 424 QLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW 483 F V + + LL L + D + D L Sbjct: 349 SEESAGFEVSTV----FLSKRKGALLISLQPGDAASEEDGTEGDGADAGEGEDAASALDS 404 Query: 484 LKI----RWKYPQGKESQLVEFPLGPTINAPSEDMR 515 + R+ P G+ VE L + + D R Sbjct: 405 FAVSGLLRYTTPAGEP---VENTLSASYAGEALDAR 437 >UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanobacteria RepID=B4VT64_9CYAN Length = 1037 Score = 146 bits (368), Expect = 2e-33, Method: Composition-based stats. Identities = 56/256 (21%), Positives = 110/256 (42%), Gaps = 21/256 (8%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 I A + + E+ ++VFL+DTSGS S + + ++ ++ L QD I+ +A Sbjct: 652 IPAIEYQQNEIVPKDVVFLVDTSGS-QSGSPIVQSKELMRQFIQGLNPQDTFTIIDFANS 710 Query: 262 SRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 + + + ++ + I+ LDA G T G++ G + ++L Sbjct: 711 TTQLSDKPLANTPQNRKKALNYINRLDANGGTELMNGIDTVLN--FPAAPAGRLRSVVLL 768 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 TDG I D + I + ++ + + G L +FGVG+S N ++ R+A++G G + Sbjct: 769 TDGL----IGDDEQIIAEIRDRLKPGNRLYSFGVGSST-NRFLIERLAELGRGTAEVVP- 822 Query: 379 LSEAQKVLNSE-MRQMLITVAKDVKAQI-------EFNPAWVTEYRQIGYEKRQLRVEHF 430 +E+ +V+ E +++ V +++ EF P V + R Sbjct: 823 PNESAEVVAQEFFQEINNPVLTNIQVSWEGTGNAPEFYPQKVRDLFANQPLVVFGRKGDR 882 Query: 431 NNDNVDA-GDIGAGKH 445 N + G + G+ Sbjct: 883 TNGKLKISGTVAGGQP 898 >UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXM6_9CYAN Length = 928 Score = 145 bits (365), Expect = 5e-33, Method: Composition-based stats. Identities = 51/208 (24%), Positives = 91/208 (43%), Gaps = 11/208 (5%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 I A + +L ++VFLIDTSGS S E L Q ++ + L D I+ ++ Sbjct: 413 IPAIEYNPHQLVPKDVVFLIDTSGS-QSGEPLNKCQELMRRFINGLNPHDTFTIIDFSDT 471 Query: 262 SRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 +R P + + ++ I+ L+A G T G++ G + I+L Sbjct: 472 TRQLSPVPLANTVQNRNSAMNYINQLNASGGTQLRRGIQAVLNFPE--VDPGRLRSIVLL 529 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 TDG I + I + V++ + G L +FG G+S N ++ RIA++G G + Sbjct: 530 TDGY----IGNENQILAEVQRHLKLGNRLHSFGAGSS-VNRFLLNRIAEIGRGISRIVRY 584 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIE 406 ++V Q+ V +++ + Sbjct: 585 DEPTEEVAEQFFGQINNPVLTNIQLYWQ 612 >UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythrobacter RepID=A3W9L9_9SPHN Length = 740 Score = 145 bits (365), Expect = 6e-33, Method: Composition-based stats. Identities = 51/245 (20%), Positives = 99/245 (40%), Gaps = 33/245 (13%) Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 + E P ++F+ID SGSM + E +P + SL ++ LR QD ++ + Sbjct: 337 RVGEAPPREMIFVIDNSGSM-AGESMPAARRSLLYALETLRPQDRFNVIRFDDTMTELFA 395 Query: 268 S---ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 S S S+ A +L A G T L A + +++ TDG Sbjct: 396 SAVQASDSNIAAAKTFTHNLMANGGTEMLPALRAA---LRDRAPDERVRQVIFLTDGA-- 450 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 + + + + + R+ + G+G++ N +M R+A+ G G ++++ EA+ Sbjct: 451 --LSNEADMMEEINRNRKDS-RVFMVGIGSAP-NTYLMRRMAEAGRGTFTHVGMGEEAED 506 Query: 385 VLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 + + ++ + VA + A +E + + D D+ AG+ Sbjct: 507 QMQRLLDRLSLPVATGLTANVE--------------------GGNIDFAPRDLPDLYAGE 546 Query: 445 HITLL 449 + LL Sbjct: 547 PLVLL 551 >UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3 Tax=Theria RepID=ITIH4_PIG Length = 921 Score = 144 bits (364), Expect = 7e-33, Method: Composition-based stats. Identities = 55/281 (19%), Positives = 112/281 (39%), Gaps = 28/281 (9%) Query: 207 RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS-RIA 265 + N++F+IDTSGSM ++ + +L ++ +L +D +V+++G++ R Sbjct: 263 PEVWSAIPKNVIFVIDTSGSMRG-RKIQQTREALIKILGDLGSRDQFNLVSFSGEAPRRR 321 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG-----GINRILLATD 320 + S + E + + A+G TN + +A Q + + + I+L TD Sbjct: 322 AVAASAENVEEAKSYAAEIHAQGGTNINDAMLMAVQLLERANREELLPARSVTFIILLTD 381 Query: 321 GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI-DTL 379 GD VG +P I+ V++ + +L G G + A + ++A G I + Sbjct: 382 GDPTVGETNPSKIQKNVREAIDGQHSLFCLGFGF-DVPYAFLEKMALENGGLARRIYEDS 440 Query: 380 SEAQKV--LNSEMRQMLITVAKDVKAQIEFNPAWVTEY-----RQIGYEKRQLRVEHFNN 432 A ++ E+ L+ + E+ V E R + + Sbjct: 441 DSALQLEDFYQEVANPLLRL-----VAFEYPSNAVEEVTQDNFRLFFKGSELVVAGKLRD 495 Query: 433 DNVDA------GDIGAGKHITLLFELTLNGQKASIDKLRYA 467 + D G + +++T + E + Q+A +Y Sbjct: 496 QSPDVLSAKVRGQLHM-ENVTFVMESRVAEQEAEFLSPKYI 535 >UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YNZ7_ANASP Length = 820 Score = 144 bits (363), Expect = 1e-32, Method: Composition-based stats. Identities = 53/208 (25%), Positives = 93/208 (44%), Gaps = 11/208 (5%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 I A + +++ ++VFLIDTSGS + L Q ++ + L D +IV ++ Sbjct: 286 IPAIQYRQDQVVPKDVVFLIDTSGSQMG-APLMQCQELMRRFINGLNPDDTFSIVDFSDT 344 Query: 262 SRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 +R P + + ++ I+ L A G T G+ G + I+L Sbjct: 345 TRQLSPVPLANNAQNRTRAINYINQLSANGGTEMLRGIRAVLN--FPVTDPGRLRSIVLL 402 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 TDG I + I + V++ +SG L +FG G+S N ++ RIA++G G I Sbjct: 403 TDGY----IGNENQILAEVQQHLKSGNRLYSFGAGSS-VNRFLLNRIAELGRGIAQIIRH 457 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIE 406 ++++ RQ+ V ++ Q E Sbjct: 458 DEPTDEIVDKFYRQINNPVLANINLQWE 485 >UniRef50_Q2QZN4 von Willebrand factor type A domain containing protein n=2 Tax=Oryza sativa Japonica Group RepID=Q2QZN4_ORYSJ Length = 574 Score = 144 bits (362), Expect = 1e-32, Method: Composition-based stats. Identities = 67/427 (15%), Positives = 145/427 (33%), Gaps = 69/427 (16%) Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDE---------R 232 +R +A + +L K E+ +LV ++D SGSM +E R Sbjct: 71 IRAAIARDQRKDDFEVLVTVEAPKVVAPEKRAPIDLVAVLDVSGSMNKEEFVRGKHMSSR 130 Query: 233 LPLIQSSLKLLVKELREQDNIAIVTYAGDS--RIALPSISGSHKAEINAAIDSLDAEGST 290 L L++ ++K ++K +R+ D +AIV++ L S + ++ +D L A G+T Sbjct: 131 LDLLKIAMKYIIKLVRDADRLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASGNT 190 Query: 291 NGGAGLELAYQQATKGFIKG--------------------GINRILLATDGDFNVGID-- 328 + L+ A + IK + ILL +DG Sbjct: 191 DFRPALKKAVEDMNIQNIKNSSAYNNFQILDGRGKEEKKKRVGFILLLSDGVDQFQYSRI 250 Query: 329 -----------DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI- 376 D + +M++K + TFG +++++ + +I+ + G YS++ Sbjct: 251 NWEKVAKSTDVDHSEVGAMLRKYA-----VHTFGF-SASHDPVPLRQISALSYGLYSFVC 304 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEF---------NPAWVTEYRQIGYEKRQLRV 427 L + + + VA +++ ++ P + GYE + + Sbjct: 305 KNLDNITEAFARCLGGLRTVVAAEIRVDLKPKSSDKQQQQQPVLIKSIDSGGYESQVI-- 362 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 + + + + + L + A+ + + ++I+ Sbjct: 363 GGGTSGKILIPVLYVDEVKKFIVHLKVPKVSATTVN-NQQEILTADGDANSVDGKTVRIK 421 Query: 488 W-----KYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIK 542 + P Q P + + V Q+ R ++ + + Sbjct: 422 EHKLAIRRPPEVVDQADLRPAPQVVEQVVV-FKLLDMVPKTFQQRREDDHKGVKNTKVAV 480 Query: 543 QWAQQAK 549 Q Q+ Sbjct: 481 QLLQRNM 487 >UniRef50_Q2R0C4 Expressed protein n=2 Tax=Oryza sativa Japonica Group RepID=Q2R0C4_ORYSJ Length = 629 Score = 144 bits (362), Expect = 1e-32, Method: Composition-based stats. Identities = 70/418 (16%), Positives = 140/418 (33%), Gaps = 38/418 (9%) Query: 138 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTL 197 + L P ++ + P+ + S AS + + + P E Sbjct: 40 SLIISSELVPTLLFALDLQIQLGPTMYINLAPVSALASDKVQLSTFPRVDAIPRRECHPR 99 Query: 198 LKVDIL-AKDRKSEELPASNLVFLIDTS---GSMISDERLPLIQSSLKLLVKELREQDNI 253 L V + A + +LV L+D S G RL L++ ++ L++ L D + Sbjct: 100 LPVLVRVAVPATAARRAPVDLVTLLDISCGGGGGAPARRLDLLRKAMDLVIGNLGADDRL 159 Query: 254 AIVTYAGDS--RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG---FI 308 AIV + L +S + + + SL G T L A + Sbjct: 160 AIVPFHSSVVDATGLLEMSVEGRGVASRKVQSLAVAGGTKLFPALNAAVEILEARCWEAK 219 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 + + ++L +DGD ++ ++ + FG ++ + +AD Sbjct: 220 RERVGAVVLISDGDD----------RTIFREAINPRYPVHAFGF-RGAHDARAVHHVADH 268 Query: 369 GNGNYSYIDTLSE-AQKVLNSEMRQMLITVAKDVKAQIEFNP---AWVTEYRQIGYEKRQ 424 +G Y +D + + +R++ VA D + + A + + G + R Sbjct: 269 TSGVYGVLDDEHDRVTDAFAACVRRVTSVVAVDAQVDLTCGAYSRASLLAVERSG-DHRA 327 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRY---------APDNKLAKS 475 E + + AG + AG L + ++ + A K Sbjct: 328 HVDEDRRSGFIYAGALCAGDVKNFLVYVDVDREADGGGVTELLTAHGTYMDAARRKETTV 387 Query: 476 DKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPS---EDMRFRAAVAAYGQKLRGS 530 + +A ++ R K P E T+ + + + + + AA +LR Sbjct: 388 HLDERMAVVQRRDKVPDVSRDVAAELVRVDTVKMVAVVLDRFKDKGSAAA-AMELREG 444 >UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-trypsin inhibitor heavy chain H3 n=11 Tax=Tetrapoda RepID=B4DPQ4_HUMAN Length = 698 Score = 143 bits (360), Expect = 2e-32, Method: Composition-based stats. Identities = 53/256 (20%), Positives = 105/256 (41%), Gaps = 18/256 (7%) Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 F + Y++ + + + + + N+ F+ID SGSM + +L Sbjct: 245 NGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSM-AGRKLEQT 303 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRI---ALPSISGSHKAEINAAIDSLDAEGSTNGG 293 + +L ++++++E+D + ++GD L + + E + S++ +G TN Sbjct: 304 KEALLRILEDMKEEDYLNFTLFSGDVSTWKEHLVQATPENLQEARTFVKSMEDKGMTNIN 363 Query: 294 AGLELAYQQATKGFIKGGINR-----ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 GL K + I +++ TDGD NVG P+ I+ V+ L Sbjct: 364 DGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLY 423 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYI--DTLSEAQ-KVLNSEMRQMLITVAKDVKAQI 405 G GN N N + +A +G I D+ ++ Q + E+ L+T ++ Sbjct: 424 NLGFGN-NLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLTG-----VEM 477 Query: 406 EFNPAWVTEYRQIGYE 421 E+ + + Q Y+ Sbjct: 478 EYPENAILDLTQNTYQ 493 >UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocephala RepID=Q6PGW2_DANRE Length = 927 Score = 143 bits (360), Expect = 2e-32, Method: Composition-based stats. Identities = 64/299 (21%), Positives = 119/299 (39%), Gaps = 22/299 (7%) Query: 119 NPLATFSLDVD-----TGSYANVRRFLNQGLLPPPDAVRV-----EEIVNYFP-SDWDIK 167 P+A F +DV S+ V+ LN G L AV+ + V ++P D K Sbjct: 169 QPVADFKIDVHIQENPGISFLEVKGDLNTGDL--ASAVKTTRADKDAWVTFYPTRDQQTK 226 Query: 168 DKQSIPASKPIPFAMRYELAPA-PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGS 226 + Y++ P E + + N+VF+ID SGS Sbjct: 227 CTNCAENGLNGDLIITYDVNRGNPKGEVQISNGYFVHYFAPSDVPHIPKNVVFIIDRSGS 286 Query: 227 MISDERLPLIQSSLKLLVKELREQDNIAIVTYA---GDSRIALPSISGSHKAEINAAIDS 283 M ++ +S+L ++K+L E D+ ++T+ R L + +++ + + Sbjct: 287 MHG-RKIRQTRSALLTILKDLDEDDHFGLITFDAEIDFWRRELLQATKANRENAESFVKR 345 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 + G+TN + + KG + ++L TDGD G + + I + VK+ S Sbjct: 346 IQDRGATNINDAVLAGVDMINRNPRKGTASILILLTDGDPTAGETNIEKIMANVKEAIGS 405 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI--DTLSEAQ-KVLNSEMRQMLITVAK 399 L G G + N + +++ N I D+ ++ Q + E+ L+T + Sbjct: 406 KFPLYCLGFG-YDVNFDFLTKMSLENNAVARRIYEDSDADIQLQGFYDEVAVPLLTDIQ 463 >UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alteromonadales RepID=Q3IHK0_PSEHT Length = 664 Score = 143 bits (360), Expect = 2e-32, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 88/216 (40%), Gaps = 13/216 (6%) Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 E+ L + A + + + A VF++DTSGSM + + +++L + L D Sbjct: 299 GERYGLAMLMPPADNFIATQRLARETVFVVDTSGSMHG-QSMEQAKNALFYALSLLDSND 357 Query: 252 NIAIVTYAGDSRIALPS---ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 + I+ + + SG + I L A+G T L+ Sbjct: 358 SFNIIGFDNVVTLMSDKPLVASGFNLRRAERFIYGLQADGGTEIQGALDA---VLDGSQF 414 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 G + +++ TDG + + ++ ++ + L T G+G++ N M R ADV Sbjct: 415 DGFVRQVIFLTDG----SVSNEDALFKSIQAKLGDS-RLFTVGIGSAP-NSFFMRRAADV 468 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 G G++++I + SE Q + ++ ++ Sbjct: 469 GKGSFTFIGSTSEVQPKMQQLFDKLAHPAITNLALS 504 >UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZR58_9PLAN Length = 1032 Score = 142 bits (359), Expect = 3e-32, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 89/195 (45%), Gaps = 9/195 (4%) Query: 207 RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIAL 266 R ++ +P L+ ++D SGSM E++ + Q + ++ + D ++ + ++ + Sbjct: 449 RDAKVVPVGALMLVLDKSGSM-QGEKMQMTQGAALAAIRAMGAADFAGVIGFDSQAQRIV 507 Query: 267 PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVG 326 P + A + L A G TN G+ L ++ + G+ +++ +DG Sbjct: 508 PIRKVDNPGMFVAQVRKLSASGGTNMTPGVALGFRDLQN--VDAGVKHMIVLSDGQ---- 561 Query: 327 IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVL 386 +P ++ + ++ G+T+S VG S+ ++ +M +A G G + ++ ++ Sbjct: 562 -TEPGNVAQIASDMKKMGMTVSAVAVG-SDADQKLMATVARNGGGKFYAVNNPKAIPRIF 619 Query: 387 NSEMRQMLITVAKDV 401 E R++ + K+ Sbjct: 620 MREARRVAQPLVKEA 634 Score = 55.1 bits (131), Expect = 8e-06, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 54/133 (40%), Gaps = 11/133 (8%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELRE--QDNIAIVTYAGDSRIALPSISGSHK 274 +++L+D S S+ SD+R +++ ++ + RE D + ++ + I P Sbjct: 84 VIYLLDQSQSIPSDQRQAMVEYVVREVSAHRREEQGDRVGVIVFGDQPAIEFPPTDAPLP 143 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 A +L TN ++LA RI++ +DG+ +G P Sbjct: 144 PLKRLAALALVETDETNLAGAMQLA----QASLQSDSAGRIVIVSDGNQTIGDALP---- 195 Query: 335 SMVKKQRESGVTL 347 + + SGV + Sbjct: 196 -IARSLAASGVGI 207 >UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Shewanella amazonensis SB2B RepID=A1S752_SHEAM Length = 753 Score = 142 bits (359), Expect = 3e-32, Method: Composition-based stats. Identities = 56/274 (20%), Positives = 106/274 (38%), Gaps = 23/274 (8%) Query: 140 LNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIP----ASKPIPFAMRYELAPAPWNEQR 195 +N+G P V + + + + + A + + N Sbjct: 326 INEGSEP------VADFLVQPGYSYPAAVESANTQYGTAQDKAKQDEYVRVDFSQGNYSH 379 Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 LL + A LV +IDTSGSM D + +S+L + L QD+ I Sbjct: 380 GLLTF--MPPQPNLANRLARELVLVIDTSGSMAGD-SMVQARSALIHALGGLGPQDSFNI 436 Query: 256 VTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ-ATKGFIKGG 311 + ++ D+R P + + + SL+A+G T + LELA + + Sbjct: 437 IAFSSDARPLWPDAKPATAFNLGAAQQFVRSLEADGGTEMASALELALKTPSVVDEDTKR 496 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 + ++L TDG N ++ +++++ R L +G + N M R A G G Sbjct: 497 LRQVLFITDGAVNG----EDALFNLIER-RLGTSRLFPVAIGAAP-NGYFMSRAAAAGRG 550 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 ++++I E + +N + ++ V D+ Sbjct: 551 SFTFIGHGGEVAEKMNQLLSRIEHPVVSDLSVTW 584 >UniRef50_Q01UI0 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01UI0_SOLUE Length = 837 Score = 142 bits (358), Expect = 3e-32, Method: Composition-based stats. Identities = 47/222 (21%), Positives = 94/222 (42%), Gaps = 13/222 (5%) Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 + L+ + AK + +V +ID S SM ++ L + + +V+ LR D Sbjct: 370 GKPEDALERSLPAKLAPPRSPEGTAVVLIIDKSSSMEG-RKIELARLAAIGVVENLRPID 428 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG 311 ++ ++ + + A+P +A I I + +G T L AYQ+ Sbjct: 429 SVGVLIFDNSFQWAVPIRKAEDRATIKKLISGITPDGGTQIAPALTEAYQRILPQT--AM 486 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 I+L TDG I + ++ K+ + + VT+ST G+G + N A + ++A +G Sbjct: 487 YKHIVLLTDG-----ISEEGDSMTLTKEAQANHVTISTVGLGQ-DVNRAFLEKVASNADG 540 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT 413 +++ S +++L ++ + A + P V Sbjct: 541 KAYFLNDPSGLEQLLLRDVEEHTGVTA----VEKAITPKVVK 578 >UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E105CF Length = 757 Score = 142 bits (358), Expect = 4e-32, Method: Composition-based stats. Identities = 49/192 (25%), Positives = 83/192 (43%), Gaps = 12/192 (6%) Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS---ISGS 272 N +F++D+SGSM L +++ V L E D IV + ++R Sbjct: 391 NTIFVLDSSGSMHGTA-LTQAIDAIREGVSYLTEHDTFNIVDFDSEARALWRQSQFADEV 449 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS 332 KAE + +D++G TN L L+ Q G+ +++ TDG N + + Sbjct: 450 SKAEAMRFLRHVDSDGGTNMQDALALSLTQLLDSST--GLTQVIFVTDGSIN----NERE 503 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 + + +Q L T G+G + N M A +G G Y+YID L+E Q + Q Sbjct: 504 LLKQIAEQLGDK-RLFTVGIGAAP-NSHFMEYAAMLGKGTYTYIDDLTEIQPKMAYLFSQ 561 Query: 393 MLITVAKDVKAQ 404 + + D++ Sbjct: 562 LRSPMITDIQLT 573 >UniRef50_B0CG18 von Willebrand factor type A domain protein, putative n=5 Tax=Cyanobacteria RepID=B0CG18_ACAM1 Length = 708 Score = 142 bits (358), Expect = 4e-32, Method: Composition-based stats. Identities = 60/231 (25%), Positives = 103/231 (44%), Gaps = 19/231 (8%) Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 I A KS ++ ++VFLIDTSGS + + + + +L D +I+ ++ Sbjct: 328 IPALKYKSNQIVPKDVVFLIDTSGSQSGP-PIVQSRKLMTQFLDKLNPNDTFSIINFSNT 386 Query: 262 SRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 + P + + +++ + I LDA G T G+ A G + ++L Sbjct: 387 TSKLSPKPLANTPANRKKALEYIKKLDANGGTELMNGINTV--AAFPPAPDGRLRSVVLL 444 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 TDG I D ++I + V+ + + G + FGVG S N ++ R+A+VG G + Sbjct: 445 TDGL----IGDDETIIAAVRDRLKPGNRIYPFGVGFST-NRFLLDRLAEVGRGTVEVVAP 499 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEH 429 A+KV + + T+ K V IE +WV + G + LRV Sbjct: 500 KDSAEKV----AAKFVQTINKPVLTDIEV--SWVGPGK--GPDIYPLRVPD 542 >UniRef50_Q8PU63 Putative chloride channel n=1 Tax=Methanosarcina mazei RepID=Q8PU63_METMA Length = 1004 Score = 142 bits (358), Expect = 4e-32, Method: Composition-based stats. Identities = 48/197 (24%), Positives = 92/197 (46%), Gaps = 11/197 (5%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI--ALPSISGS 272 +N++ +ID SGSM S + ++S L + + +D +V+++ +R L +++ Sbjct: 313 ANVMLVIDRSGSM-SGSPISSAKNSANLFIDYMEAEDMAGVVSFSSSARYDYHLATLTPE 371 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS 332 K I I+S+ A G T G+G+ I+L +DG N G + Sbjct: 372 VKNSIKQKINSIYASGVTAIGSGMRYGLNDLLNYGDPNNPWAIVLLSDGYQNSGENPNNV 431 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 I S+ + S + + T G+G + ++ ++ IAD G Y Y T S+ Q++ N + + Sbjct: 432 IPSI----KASNIQVYTVGLGPA-VDQKLLGNIADQTGGKYYYSPTDSQLQEIYNDIVGK 486 Query: 393 ML---ITVAKDVKAQIE 406 ++ ++VK I Sbjct: 487 IIGWKTVFKRNVKMFIH 503 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobact... 601 e-170 UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella... 488 e-136 UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatiba... 487 e-136 UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidet... 474 e-132 UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20... 473 e-132 UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 ... 472 e-131 UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria R... 469 e-130 UniRef50_C6M483 von Willebrand factor type A domain protein n=1 ... 467 e-130 UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellac... 465 e-129 UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 ... 462 e-128 UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales... 459 e-127 UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteo... 458 e-127 UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4... 455 e-126 UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter... 454 e-126 UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenz... 453 e-126 UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=... 453 e-126 UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 ... 452 e-125 UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmati... 443 e-122 UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 ... 440 e-122 UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacte... 440 e-122 UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacte... 438 e-121 UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 ... 436 e-121 UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium... 436 e-120 UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reineke... 436 e-120 UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteo... 434 e-120 UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostri... 434 e-120 UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 ... 433 e-120 UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria Re... 433 e-119 UniRef50_UPI000185CB41 protein containing von Willebrand factor ... 431 e-119 UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea... 431 e-119 UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopi... 429 e-118 UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacter... 429 e-118 UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastop... 429 e-118 UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobact... 428 e-118 UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria R... 428 e-118 UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Rumi... 417 e-115 UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Breviba... 416 e-114 UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9... 413 e-113 UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiph... 410 e-113 UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 ... 409 e-112 UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimon... 404 e-111 UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangi... 402 e-110 UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostri... 397 e-109 UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobact... 393 e-108 UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 ... 389 e-106 UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocyst... 389 e-106 UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 ... 379 e-103 UniRef50_C7N770 Uncharacterized protein containing a von Willebr... 369 e-100 UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophag... 360 9e-98 UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Ach... 359 2e-97 UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobac... 357 5e-97 UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella... 355 3e-96 UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 ... 332 3e-89 UniRef50_B4D1N7 Autotransporter-associated beta strand repeat pr... 323 1e-86 UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 ... 319 1e-85 UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp.... 311 5e-83 UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12... 307 5e-82 UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 T... 304 5e-81 UniRef50_C1RGW7 Uncharacterized protein containing a von Willebr... 296 2e-78 UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 ... 293 8e-78 UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=... 287 6e-76 UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3... 279 2e-73 UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus tri... 272 2e-71 UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magno... 267 8e-70 UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, s... 261 5e-68 UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus te... 256 2e-66 UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN... 251 6e-65 UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID... 251 7e-65 UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2... 249 2e-64 UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis v... 249 2e-64 UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=... 246 2e-63 UniRef50_Q235T9 von Willebrand factor type A domain containing p... 243 2e-62 UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophag... 241 4e-62 UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZE... 238 5e-61 UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis R... 237 1e-60 UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=So... 236 2e-60 UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotom... 235 3e-60 UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 ... 235 4e-60 UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea s... 231 6e-59 UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnol... 231 6e-59 UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscill... 230 1e-58 UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1... 230 2e-58 UniRef50_UPI00017450FB von Willebrand factor type A domain prote... 230 2e-58 UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3... 229 2e-58 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 226 1e-57 UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiob... 226 2e-57 UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein fam... 225 4e-57 UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinoco... 224 9e-57 UniRef50_C1GWG1 von Willebrand factor type A domain containing p... 222 3e-56 UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein... 221 5e-56 UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharoph... 221 6e-56 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 221 6e-56 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 220 1e-55 UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin... 218 4e-55 UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 ... 218 6e-55 UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuni... 217 1e-54 UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesioc... 216 2e-54 UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein... 216 2e-54 UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopi... 216 2e-54 UniRef50_Q8H923 Putative uncharacterized protein OSJNBa0071K18.1... 216 2e-54 UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26... 216 2e-54 UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectria... 215 3e-54 UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus ... 215 3e-54 UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta Re... 215 3e-54 UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genom... 215 3e-54 UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcani... 215 4e-54 UniRef50_C1XMC3 Uncharacterized protein containing a von Willebr... 214 8e-54 UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1... 214 8e-54 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 213 2e-53 UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1... 213 2e-53 UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fr... 212 3e-53 UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genom... 212 3e-53 UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobact... 212 3e-53 UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 ... 211 4e-53 UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus ce... 211 4e-53 UniRef50_Q22ST4 von Willebrand factor type A domain containing p... 211 9e-53 UniRef50_A6R161 Predicted protein n=3 Tax=Onygenales RepID=A6R16... 210 1e-52 UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophob... 210 2e-52 UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangi... 209 2e-52 UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11... 209 3e-52 UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ... 208 4e-52 UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome... 208 7e-52 UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebr... 206 1e-51 UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ5... 206 2e-51 UniRef50_Q231J4 von Willebrand factor type A domain containing p... 206 2e-51 UniRef50_B4W304 von Willebrand factor type A domain protein (Fra... 205 3e-51 UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genom... 205 3e-51 UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-cont... 205 4e-51 UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ... 205 4e-51 UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornu... 205 4e-51 UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum '... 205 5e-51 UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microspo... 205 5e-51 UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexu... 204 6e-51 UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseoba... 204 8e-51 UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudona... 204 8e-51 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 204 9e-51 UniRef50_C9SWV9 U-box domain containing protein n=1 Tax=Verticil... 203 1e-50 UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfo... 203 2e-50 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 202 3e-50 UniRef50_B8AE57 Putative uncharacterized protein n=1 Tax=Oryza s... 202 3e-50 UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein... 202 3e-50 UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine... 201 4e-50 UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepI... 201 5e-50 UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 ... 201 9e-50 UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesioc... 200 9e-50 UniRef50_Q237Q6 von Willebrand factor type A domain containing p... 200 1e-49 UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW... 199 2e-49 UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira ... 199 2e-49 UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genom... 199 3e-49 UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepI... 198 5e-49 UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflex... 198 5e-49 UniRef50_C5GK44 U-box domain-containing protein n=2 Tax=Ajellomy... 198 5e-49 UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocep... 198 7e-49 UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B... 197 1e-48 UniRef50_D2VKS7 von Willebrand factor type A domain-containing p... 195 4e-48 UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza s... 195 5e-48 UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudo... 195 5e-48 UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetom... 194 7e-48 UniRef50_Q22N58 von Willebrand factor type A domain containing p... 194 1e-47 UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 194 1e-47 UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=4... 194 1e-47 UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole geno... 193 1e-47 UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella... 193 1e-47 UniRef50_D1KBY4 Putative uncharacterized protein n=2 Tax=Proteob... 193 1e-47 UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepI... 193 2e-47 UniRef50_Q24C76 von Willebrand factor type A domain containing p... 193 2e-47 UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 ... 192 3e-47 UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ... 192 3e-47 UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2... 192 4e-47 UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesioc... 191 4e-47 UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphyloth... 191 5e-47 UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein... 191 5e-47 UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifi... 191 8e-47 UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1... 191 8e-47 UniRef50_C1HBZ8 von Willebrand and RING finger domain-containing... 191 9e-47 UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga o... 190 1e-46 UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza s... 189 2e-46 UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein... 189 2e-46 UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8Y... 189 2e-46 UniRef50_UPI00016C377F protein containing a von Willebrand facto... 189 3e-46 UniRef50_UPI00006CAF43 von Willebrand factor type A domain conta... 189 3e-46 UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alterom... 188 4e-46 UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopu... 188 4e-46 UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=T... 188 5e-46 UniRef50_C7NN24 von Willebrand factor type A n=1 Tax=Halorhabdus... 188 6e-46 UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax... 188 6e-46 UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi... 187 9e-46 UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 ... 187 9e-46 UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clup... 187 9e-46 UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein... 187 1e-45 UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacilla... 186 2e-45 UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=... 186 2e-45 UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3... 186 2e-45 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 185 4e-45 UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein... 184 5e-45 UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein... 184 7e-45 UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Tak... 184 8e-45 UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-try... 184 9e-45 UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 184 1e-44 UniRef50_C7R936 Vault protein inter-alpha-trypsin domain protein... 183 2e-44 UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangi... 182 3e-44 UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain... 182 3e-44 UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=... 182 4e-44 UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genom... 182 4e-44 UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythro... 181 5e-44 UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha p... 181 5e-44 UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia br... 180 1e-43 UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi... 180 1e-43 UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun se... 180 1e-43 UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter... 180 1e-43 UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 ... 179 2e-43 UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein... 179 2e-43 UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein... 179 3e-43 UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 ... 179 3e-43 UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tet... 179 3e-43 UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein... 179 3e-43 UniRef50_Q2R0C4 Expressed protein n=2 Tax=Oryza sativa Japonica ... 178 4e-43 UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleosto... 178 4e-43 UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha ... 178 5e-43 UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 ... 178 5e-43 UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanob... 178 5e-43 UniRef50_Q2QZN4 von Willebrand factor type A domain containing p... 178 5e-43 UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Cion... 178 7e-43 UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomyce... 178 7e-43 UniRef50_Q24FW2 von Willebrand factor type A domain containing p... 178 7e-43 UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxob... 177 9e-43 UniRef50_Q64D90 Cell surface protein n=8 Tax=environmental sampl... 177 1e-42 UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anseri... 177 1e-42 UniRef50_A6G2V8 von Willebrand factor, type A n=1 Tax=Plesiocyst... 177 1e-42 UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea may... 177 1e-42 UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisp... 177 1e-42 UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteri... 176 2e-42 UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocyst... 176 2e-42 UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein... 176 2e-42 UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=4... 175 4e-42 UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepI... 175 5e-42 UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangi... 174 7e-42 UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_... 174 8e-42 UniRef50_B0CG18 von Willebrand factor type A domain protein, put... 174 9e-42 UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellu... 174 1e-41 UniRef50_Q22SJ7 von Willebrand factor type A domain containing p... 174 1e-41 UniRef50_A3ZU57 Putative lipoprotein n=1 Tax=Blastopirellula mar... 173 1e-41 UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain ... 173 2e-41 UniRef50_Q0AV90 Putative uncharacterized protein n=1 Tax=Syntrop... 173 2e-41 UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscill... 172 3e-41 UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microc... 172 3e-41 UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YN... 172 4e-41 UniRef50_B9XLE8 Vault protein inter-alpha-trypsin domain protein... 171 5e-41 UniRef50_B8A860 Putative uncharacterized protein n=1 Tax=Oryza s... 171 7e-41 UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatiba... 171 9e-41 Sequences not found previously or not previously below threshold: >UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobacteriaceae RepID=YFBK_ECOLI Length = 575 Score = 601 bits (1550), Expect = e-170, Method: Composition-based stats. Identities = 575/575 (100%), Positives = 575/575 (100%) Query: 1 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK Sbjct: 1 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 Query: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP Sbjct: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF Sbjct: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL Sbjct: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY Sbjct: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA Sbjct: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY Sbjct: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE Sbjct: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 Query: 481 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ 540 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ Sbjct: 481 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ 540 Query: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ Sbjct: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 >UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella RepID=A3D1E9_SHEB5 Length = 642 Score = 488 bits (1255), Expect = e-136, Method: Composition-based stats. Identities = 238/578 (41%), Positives = 345/578 (59%), Gaps = 24/578 (4%) Query: 5 NIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQ 64 N LL+ ++ L+ CG + E +Q + QV + +QA +++A + A A Sbjct: 45 NTAALLLVAVSLTACGGKGAEVEHRQAEQQAEQRHQVASQRQAEMRDAAKVEMARVAAPM 104 Query: 65 QEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATF 124 Q + + AP A +++Q N + + P++TF Sbjct: 105 QMSSNGAVMG-----MSIAPMPRDYAAIPL------AQNKFEQQVQNGIMVAGEIPVSTF 153 Query: 125 SLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRY 184 +DVDTGSYA +RR L +G LP VRVEE++NYF D+ +PA PF++ Sbjct: 154 FIDVDTGSYATLRRMLREGRLPEKGTVRVEEMLNYFAYDYP------LPAKNAAPFSVTT 207 Query: 185 ELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 ELAP+P+N+ LL++ + D +L ASNLVFL+D SGSM S ++LPL+Q++LKLL Sbjct: 208 ELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSMASADKLPLLQTALKLLT 267 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 +L QD ++IV YAG + + L +SG+ + A++ L A GS NGG G+ AYQ A Sbjct: 268 AQLSAQDKVSIVVYAGAAGVVLDGVSGNDTQTLTYALEQLSAGGSINGGQGITQAYQLAK 327 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 K FI GINR++LATDGDFNVG+ D + ++++K+++ G+ L+T G G NYN+ +M + Sbjct: 328 KHFIPNGINRVILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQ 387 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +AD GNGNY+YIDTL+EA+KVL E+ L T+AKDVK Q+EFNPA V+EYR IGYE R Sbjct: 388 LADKGNGNYAYIDTLNEARKVLVDELSSTLFTIAKDVKVQVEFNPALVSEYRLIGYENRA 447 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELT-LNGQKASIDKLRYAPDNKLAKSDKTK-ELA 482 L E FNN VDAG+IGAG +T L+EL + DKLRY D + K ++ E+A Sbjct: 448 LAREDFNNYKVDAGEIGAGHTVTALYELRYVEAGNRMNDKLRYGVDAQTGKEKYSRKEIA 507 Query: 483 WLKIRWKYPQGKESQLVEFPL-----GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTS 537 +LK+R+K P +SQL+ +P+ + S+D RF AAVA GQ L GS YL+ Sbjct: 508 FLKLRYKLPAQTQSQLLSYPIRLDQSVKQLEQASDDFRFAAAVAGLGQLLNGSHYLHQFD 567 Query: 538 WQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 + ++ A+ A G+DP GYR EF++L+E A + +Q Sbjct: 568 YTKLSLLARSALGDDPFGYRHEFVQLMETAAAIEQSNQ 605 >UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z5_DESAA Length = 558 Score = 487 bits (1254), Expect = e-136, Method: Composition-based stats. Identities = 210/541 (38%), Positives = 305/541 (56%), Gaps = 18/541 (3%) Query: 41 VLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAA-----KAKAT 95 V A AA+ S A Y+ K A A A Sbjct: 23 VCLAGSAAVSFTSCSPKTVNEDAMTGYSGYTSKSTSAEPSMSAAKPCPAPKSEQRYAYYC 82 Query: 96 HIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEE 155 + + T Y + K +PL+TFS+DVDT SY+NVRRFL+ G +PP DAVR+EE Sbjct: 83 RVPDYNTEEYAPIREGGFKSPLYDPLSTFSIDVDTASYSNVRRFLSYGNMPPVDAVRIEE 142 Query: 156 IVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPAS 215 ++NYF D+ Q PF++ E++ PWN L+ V + + +++ S Sbjct: 143 MINYFHYDYPQPKGQ-------DPFSITMEMSQCPWNRDNMLVHVGLQGRCLDYKDVKPS 195 Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 NLVFL+D SGSM S+ +LPL++ S+++LVKEL D ++IVTYAG + + LPS S +K Sbjct: 196 NLVFLLDVSGSMNSENKLPLVKRSMEMLVKELGAGDRVSIVTYAGSAGLVLPSTSARNKR 255 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 +I A+D L+A GST GG G+ELAY+ A + I G NR++L TDGDFNVG+ + Sbjct: 256 KIITALDRLEAGGSTAGGEGIELAYRVAWENLIPEGNNRVILCTDGDFNVGVSSTPELVR 315 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLI 395 M++++R +G+ L+ G G NY + M I++ GNGN+ YID+ EA KV +MR + Sbjct: 316 MIEEKRRAGIYLTICGFGMGNYKDEKMEAISNAGNGNFYYIDSRREAHKVFVQDMRANMF 375 Query: 396 TVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN 455 T+AKDVK Q+EFNP V++YR +GYE R L E FNND DAG+IG G +T L+E+ Sbjct: 376 TLAKDVKIQVEFNPGRVSQYRLVGYENRLLAAEDFNNDLKDAGEIGPGHSVTALYEIVPA 435 Query: 456 G---QKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT---INA 509 G +D L+Y + + + E+ +K R+K P+ S+L+ L + Sbjct: 436 GLGMGAQRVDPLKYQESEPVPELRNSNEILTIKFRYKNPEENRSRLITRVLDESSMEFGD 495 Query: 510 PSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADG 569 S+D RF AAVA +G LR S Y + +W QI+ A+++ G D GYRAEFI+L++ Sbjct: 496 TSDDFRFSAAVAGWGMLLRNSSYADRLTWGQIQSMAEESVGPDEMGYRAEFIKLVKTCRE 555 Query: 570 V 570 + Sbjct: 556 L 556 >UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidetes RepID=C7PNZ7_CHIPD Length = 639 Score = 474 bits (1220), Expect = e-132, Method: Composition-based stats. Identities = 210/526 (39%), Positives = 304/526 (57%), Gaps = 11/526 (2%) Query: 51 EAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDD 110 + ++ + + + K + A + + + T Y ++ Sbjct: 116 DPVNTSLQEDVVVNAMIVPEAPKTVAGSPVVNAYMKSASPAFYGSRAPQFNTEDYSPVNE 175 Query: 111 NPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQ 170 N VA +PL+TFS+DVD SY+NVRRFLN+G +PP DAVRVEE++NYF + Sbjct: 176 NRFHTVASDPLSTFSIDVDRASYSNVRRFLNEGNMPPVDAVRVEEMINYFDYKYSN---- 231 Query: 171 SIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD 230 + P A+R ++A PWN L+++ + KD + LP SNLVFLID SGSM Sbjct: 232 ---PTGNTPVAVRTDMAICPWNTAHQLVRIALKGKDVAKDNLPPSNLVFLIDVSGSMSDA 288 Query: 231 ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGST 290 ++LPL++ + KLLV +LR D +AIV YAG + + LPS SG HK I A+D L+A GST Sbjct: 289 KKLPLVKQAFKLLVNQLRPVDRVAIVVYAGAAGLVLPSTSGDHKTAILDALDKLEAGGST 348 Query: 291 NGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTF 350 GG G++LAY+ AT+ +K G NR+++ATDGDFNVG ++ +++K+RE G+ LS Sbjct: 349 AGGEGVQLAYKTATEYLLKSGNNRVIIATDGDFNVGPSSDGELQRIIEKKREKGIFLSVL 408 Query: 351 GVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 G G NY + + +AD GNGNY+YID EA++ +E L T+AKDVK Q+EFNP Sbjct: 409 GFGMGNYKDNKLELLADKGNGNYAYIDNFEEARRTFATEFGGTLFTIAKDVKLQVEFNPK 468 Query: 411 WVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKL-RYAPD 469 +V YR +GYE R L E FN+D DAGD+GAG +T L+E+ G + + Sbjct: 469 YVQSYRLVGYENRLLNNEDFNDDKKDAGDMGAGHTVTALYEVVPVGVQTGQPAVDPLKYQ 528 Query: 470 NKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG---PTINAPSEDMRFRAAVAAYGQK 526 S E+ +K+R+K P SQL+ L I+A ED R AVA +G Sbjct: 529 QNQPVSGDNTEVLTVKLRYKNPADTSSQLISQVLHWKRQDISAAPEDFRMATAVADFGLL 588 Query: 527 LRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTD 572 LR SE+ N S++Q+ + A A+G D +GYRAEFI+L++ A +++ Sbjct: 589 LRNSEHKGNASYEQVLKLAGNARGTDEEGYRAEFIQLVKKAQLISN 634 >UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20 Tax=Proteobacteria RepID=Q4KKB4_PSEF5 Length = 582 Score = 473 bits (1217), Expect = e-132, Method: Composition-based stats. Identities = 253/565 (44%), Positives = 349/565 (61%), Gaps = 26/565 (4%) Query: 12 SSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYS 71 L ++GCG + + + + A + +A + ++ Sbjct: 19 LLLAVAGCGVSSKPESAAGSSTQGALQAAPQAQYEVQHADATMA------------KRAV 66 Query: 72 DKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTG 131 L + + + A + +YQ+ DNP+ VA+ P++TFS DVDTG Sbjct: 67 HPMRLSAPMPAPISSRDSLVAGYR---DEPREQYQKLPDNPIHSVAEAPVSTFSADVDTG 123 Query: 132 SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW 191 +YANVRR LNQG LPP AVR+EE+VNYFP D+ + + PF + ELAP+PW Sbjct: 124 AYANVRRLLNQGSLPPEGAVRLEELVNYFPYDYAL-------PTDGSPFGVTTELAPSPW 176 Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 N LL++ I A DR EL +NLVFL+D SGSM E LPL++S+LKLLV +LR+QD Sbjct: 177 NPHTRLLRIGIKASDRAVAELAPANLVFLVDVSGSMDRREGLPLVKSTLKLLVDQLRDQD 236 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG 311 +++V YAG+SR+ L SG KA+I AID L A GST G +G++LAYQ A +GFI G Sbjct: 237 RVSLVVYAGESRVVLEPTSGRDKAKIRTAIDQLTAGGSTAGASGIQLAYQMAQQGFIDQG 296 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 INRILLATDGDFNVG+ D S+++M ++R+SGV+L+T G G NYNE +M ++AD G+G Sbjct: 297 INRILLATDGDFNVGVSDFDSLKAMAAEKRKSGVSLTTLGFGVDNYNEHLMEQLADAGDG 356 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFN 431 NY+YID L EA+KVL ++ L VAKDVK Q+EFNPA V+EYR +GYE R L+ E F+ Sbjct: 357 NYAYIDNLREARKVLVDQLSSTLAVVAKDVKLQVEFNPAQVSEYRLLGYENRALKREDFS 416 Query: 432 NDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYP 491 ND VDAG+IGAG +T L+E+ G+K ++ LRYA +S K ELA L++R+K P Sbjct: 417 NDKVDAGEIGAGHTVTALYEIVPAGEKGWLEPLRYAQAKAPQQSGKQGELAMLRLRYKAP 476 Query: 492 QGKESQLVEFPLGP----TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQ 547 +G S+L+E P+ ++ A S D+RF AAVAA+ Q+L+ Y N S + AQ Sbjct: 477 EGGSSRLIERPISAQQPGSLAAASPDLRFAAAVAAFSQQLKDGRYTGNFSLADTVKLAQG 536 Query: 548 AKGEDPQGYRAEFIRLIELADGVTD 572 AKG DP G R EF++L+ELA + Sbjct: 537 AKGADPYGLRGEFVQLVELAQSLQT 561 >UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5H3_9GAMM Length = 608 Score = 472 bits (1214), Expect = e-131, Method: Composition-based stats. Identities = 232/600 (38%), Positives = 340/600 (56%), Gaps = 40/600 (6%) Query: 6 IIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQ 65 I+ L +L+GC Q +N +Q + + EQ+ + +Q+ Sbjct: 15 SILTLTIISLLAGCQGQQQNTSAQSDEQAAVIEQKNTQREAQTQTNTSAEVQTKSKSSQE 74 Query: 66 EVQQYSDKQALQGRLQEAPTFARAAKAKATHIAN------------------------PG 101 +++ + + A A + H Sbjct: 75 NEIRFNHPRIENVAGLNHESLAPAQMQQVRHRIGAVYPPMPMPPILPPKPMPPQFENEQA 134 Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 Y + + NPVKQV P++TFS+DVDTGSY+N RR + G PP DAVR E +NYF Sbjct: 135 RENYLKNEQNPVKQVMLEPVSTFSIDVDTGSYSNSRRMIKMGKRPPADAVREEAFINYFD 194 Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 + S P S PF + E+APAPWN QR LLK+ I D + EL A+NLVFL+ Sbjct: 195 YHY------SAPKSLETPFNVHTEVAPAPWNNQRQLLKIGIKGFDIEKAELKAANLVFLL 248 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAI 281 D SGSM + ++LPL++SSL +L K+L E D++AIV YAG + + LP+ G+ I+ A+ Sbjct: 249 DVSGSMNAPDKLPLLKSSLTMLTKQLDENDSVAIVVYAGAAGLVLPATKGNEYQVISNAL 308 Query: 282 DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR 341 ++L A GSTNG G+ELAYQ A++ F K GINR++LATDGDFNVG+ +++ ++ +R Sbjct: 309 NNLSAGGSTNGAQGIELAYQIASQNFKKEGINRVILATDGDFNVGMSSVDALKKLIANKR 368 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 ++G+ L+T G G NYN+ +M ++A++GNG ++YIDT++EA+KVL E+ + +AKDV Sbjct: 369 KTGIALTTLGFGQGNYNDGLMEQLANIGNGQHAYIDTINEARKVLVDELSSTMQIIAKDV 428 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKA-S 460 K Q+EFNPA V EYR IGY+ R L+ E FNND VDAG++GAG +T L+E+TL A Sbjct: 429 KIQVEFNPAQVAEYRLIGYQNRLLKQEDFNNDTVDAGELGAGHTVTALYEITLANSPAKQ 488 Query: 461 IDKLRYAPDNKLAK----SDKTKELAWLKIRWKYPQGKESQLVEFPLGPT-----INAPS 511 ID LRY ++ S ELA++K+R+K P S+L+ + + S Sbjct: 489 IDDLRYQTPQQMPTNSNFSSAQDELAYVKLRYKAPNSDVSKLMSQAIFASETQSQFAQAS 548 Query: 512 EDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 +D +F A VA + KL+G +Y +QQ+ A KG+DP GYR EFI+L+ A + Sbjct: 549 QDFQFAATVAGFADKLKGEKYTGLWQYQQLIDVAVANKGDDPFGYRNEFIQLLRTAAELE 608 >UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria RepID=C6VVX3_DYAFD Length = 625 Score = 469 bits (1207), Expect = e-130, Method: Composition-based stats. Identities = 215/530 (40%), Positives = 320/530 (60%), Gaps = 15/530 (2%) Query: 53 EQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNP 112 Q++ K +A++E++ ++ A + ++ T Y+ ++N Sbjct: 94 TQASVKKKDIAREELKDSEQPVMVRKMFTFDMAAAPSTHSETILAMPQATESYKPINENG 153 Query: 113 VKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSI 172 V Q P+ TFS+DVD +Y+NVRRFLN G +PP DAVR+EE++NYF D+ Sbjct: 154 FLSVGQQPVTTFSVDVDRAAYSNVRRFLNNGQMPPEDAVRIEEMINYFDYDYPQ------ 207 Query: 173 PASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDER 232 P A+ E +PWN L+ + + AK +E L ASNLVFLID SGSM + Sbjct: 208 -PRGEHPVAIVAETTDSPWNPGLKLVHIGLQAKTVSAENLSASNLVFLIDVSGSMNEANK 266 Query: 233 LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNG 292 LPL++ + KLL +LR +D I+IV YAG + + L SGS K I A+D L+A GST G Sbjct: 267 LPLLKQAFKLLADQLRVEDKISIVAYAGSAGMVLAPTSGSEKKTIKDALDKLEAGGSTAG 326 Query: 293 GAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGV 352 G G+ELAY A K F+ G NR++LATDGDFNVGI + ++ +++++R++G+ LS G Sbjct: 327 GEGIELAYDLAKKHFLPKGNNRVILATDGDFNVGISNESELQKLIEEKRKAGIFLSVMGF 386 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV 412 G NY ++ + +AD GNGNY+YID + EA+KV E L T+AKDVK QIEFNPA V Sbjct: 387 GMGNYKDSHVETLADKGNGNYAYIDNIQEARKVFVQEFGGTLFTIAKDVKIQIEFNPAHV 446 Query: 413 TEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK----ASIDKLRYAP 468 YR IGYE R LR + FN+D DAGD+G+G +T ++E+ +G K A+ D L+Y P Sbjct: 447 QAYRLIGYENRALRNDEFNDDRKDAGDMGSGHTVTAIYEIVPSGVKSPYVATTDALKYQP 506 Query: 469 DNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT---INAPSEDMRFRAAVAAYGQ 525 N A KE+ +K+R+K P ++S+L + P+ T + S ++RF +AVA +G Sbjct: 507 GNA-ATGSSNKEMMTIKVRYKQPDSEKSKLFDLPVPATTVAFDQCSANLRFASAVAEFGL 565 Query: 526 KLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 LRGSE+ + S+ + + A+ A G+D +GYR+EF++L+++A + + Sbjct: 566 LLRGSEFKGSASYADVIRRARAAFGKDEEGYRSEFVQLVKVAQSLDGSQE 615 >UniRef50_C6M483 von Willebrand factor type A domain protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M483_NEISI Length = 538 Score = 467 bits (1202), Expect = e-130, Method: Composition-based stats. Identities = 238/521 (45%), Positives = 328/521 (62%), Gaps = 14/521 (2%) Query: 57 AAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQV 116 AA A + S + LQ A + A A+ T RYQ D PVK V Sbjct: 23 AALAACSGPLEHSSSSPEGLQSPPNAALSTAAVAEENLPL--AENTERYQDQPDQPVKSV 80 Query: 117 AQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASK 176 AQ P++TFS+DVDTGSYANVRRFL G PP DAVR+EEIVNYFP ++ + + Sbjct: 81 AQEPVSTFSIDVDTGSYANVRRFLTNGEQPPKDAVRIEEIVNYFPYNYPL-------PTD 133 Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 PFA+ E +PW + L+K+ I A+D ++LP +NLVFL+D SGSM + +LPL+ Sbjct: 134 NRPFAVHTETIDSPWQPEAKLIKIGIQAQDTAKKDLPPANLVFLVDVSGSMDEENKLPLV 193 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 Q +L++L ++LR QD + ++TYA + LP SG+ K I +AID L A G+T+G + L Sbjct: 194 QKTLRILTQQLRPQDKVTLITYASGEDLVLPPTSGADKETILSAIDKLRAGGATDGESAL 253 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 ++AY+QA K F+ GINRILLATDGDFNVG+ D ++++SMV ++R+SGV+LST G G N Sbjct: 254 QMAYEQAQKAFVPNGINRILLATDGDFNVGVSDTETLKSMVAEKRKSGVSLSTLGFGMGN 313 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYR 416 YNE MM +IAD G+GNYSYID EA+KVL ++ L TVA+DVK Q+EFNPA V EYR Sbjct: 314 YNEDMMEQIADAGDGNYSYIDNEKEAKKVLQQQLTSTLATVAQDVKIQVEFNPATVKEYR 373 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 +GY R LR E FNND VDAGDIGAG +T L+E+ G++ +++ RY A Sbjct: 374 LVGYTNRTLRNEDFNNDRVDAGDIGAGHSVTALYEIIPQGKQGWLEESRY--QKAPAAKG 431 Query: 477 KTKELAWLKIRWKYPQGKESQLVEFPLG---PTINAPSEDMRFRAAVAAYGQKLRGSEYL 533 E A++K+R+K P K+SQL++ + ++ +D A+Y Q LRG EY Sbjct: 432 SKNEYAFVKVRYKLPGQKDSQLMQQAVPVGSKPLDQADKDTLLALTAASYAQALRGGEYN 491 Query: 534 NNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDIS 574 SW+ I+ AQ+ +G+DP G ++EF++L+ A G S Sbjct: 492 GKLSWRDIENMAQKVQGDDPFGLKSEFLQLVRTAAGHGGTS 532 >UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellaceae RepID=A5WCP1_PSYWF Length = 571 Score = 465 bits (1197), Expect = e-129, Method: Composition-based stats. Identities = 254/568 (44%), Positives = 357/568 (62%), Gaps = 24/568 (4%) Query: 7 IMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQE 66 ++ ++S +++ C PQ + + + +T E A+Q+ +A A E Sbjct: 23 LLSVLSLSVITACAPQSKITPASDKASTTTLE----TAEQSIQADAAAPVVVMATPAMAE 78 Query: 67 VQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSL 126 +Q S K R+ P+ ++A Y + + N V ++ AT S+ Sbjct: 79 SRQLS-KMTTNARIMPPPSQG--------YMAPKQQENYAEIEPNAVNATSEQAFATLSI 129 Query: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 D DTGSYANVRRFLNQG LPP DAVRVEE++NYF D+ KQ+ PF + E+ Sbjct: 130 DTDTGSYANVRRFLNQGQLPPKDAVRVEELINYFNYDFTAAKKQA-----NAPFLVSTEV 184 Query: 187 APAPWNEQRTLLKVDILAKDR--KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 +PW+ ++KV I A+D ++ P +NLVFL+D SGSM ++++L L +SSLK+L Sbjct: 185 VNSPWHPTNQIVKVGIKAEDLLTAKQKQPPANLVFLVDVSGSMDTEDKLQLAKSSLKMLT 244 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 K+LR QD+I ++TYAG++++ LPS G+ +I AID+L A GSTNG A ++LAYQQAT Sbjct: 245 KQLRAQDSITLITYAGNTKVVLPSTPGNQTQKILNAIDNLTASGSTNGEAAIKLAYQQAT 304 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + F K GINRIL+ TDGDFNVG+ K + +++ R+ G++LST G G NYN+ MM + Sbjct: 305 EHFKKDGINRILMLTDGDFNVGVSSVKDMLQIIRSNRDKGISLSTLGFGQGNYNDHMMEQ 364 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +AD GNGNYSYID+LSEA+KVL EM TVAKDVK Q+EFNPA V+E+R IGYE R Sbjct: 365 VADNGNGNYSYIDSLSEAKKVLIDEMSATFNTVAKDVKIQLEFNPAAVSEWRLIGYENRV 424 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 L E FNNDNVDAG++GAGK + LFE+T GQK ++ RY N A S +EL +L Sbjct: 425 LAKEDFNNDNVDAGELGAGKSVVALFEVTPVGQKGLLEPSRY--QNSAAVSGNNRELGFL 482 Query: 485 KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 KIR+K PQ ++SQL+ FP+ + A S D F AVA YGQ L GS+Y+N+ S+ Q+++ Sbjct: 483 KIRYKAPQAEKSQLLSFPIANRVTAASADTNFALAVAGYGQLLTGSKYVNDLSYSQLQRL 542 Query: 545 AQQAKGE--DPQGYRAEFIRLIELADGV 570 A+ D G R+EFI+L+ LA+ + Sbjct: 543 AKSGAQSPIDSSGSRSEFIKLVSLAEAL 570 >UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BTM6_TERTT Length = 689 Score = 462 bits (1189), Expect = e-128, Method: Composition-based stats. Identities = 231/572 (40%), Positives = 333/572 (58%), Gaps = 22/572 (3%) Query: 3 NKNIIMLLMSSLILSGCGPQPENKES--QQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 K ++ + + + P+PE + Q+ + + L + +AA ++ AA Sbjct: 129 GKEVVEEVKVTGMRESLQPEPETQRHALQEYDAISAKDIGALPSHEAARNLQRFASPAAS 188 Query: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 + A++EV + + R + + + NP+K + P Sbjct: 189 SEAKREVLMKREASSFMPR--------KPDLEPPHQLETADRDHFDTVATNPIKVTREEP 240 Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 ++TFS+DVDT SY+ VRR LN+G LP AVR+EE+VNYFP D+ +P++ PF Sbjct: 241 VSTFSIDVDTASYSFVRRQLNRGQLPQKAAVRLEEMVNYFPYDYP------LPSAATAPF 294 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 + PAPWN+ + L+ + I A P +NLVFL+D SGSM S ++LPL++ S+ Sbjct: 295 KPTITVIPAPWNQAKRLVHIGIKALPLAHP--PKANLVFLLDVSGSMGSPDKLPLVKQSM 352 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 +LL+ L+ D ++IV YAG + L + + +I AA+D L+A GST G G+ELAY Sbjct: 353 ELLLSGLQPTDTVSIVVYAGAAGTVLEPTPVAEQQKILAALDRLNAGGSTAGAQGIELAY 412 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 Q A + + +NRI+LATDGDFNVGI DP+ ++ V+++R +G+ LS G G+ NYN+A Sbjct: 413 QLAEANYQRDAVNRIILATDGDFNVGIADPEQLKGYVERKRANGIELSILGFGSGNYNDA 472 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 +M ++A GNG +YIDTLSEAQKVL + L TVAKDVK Q+EFNPA V EYR +GY Sbjct: 473 LMQQLAQNGNGVAAYIDTLSEAQKVLVEQASGTLFTVAKDVKIQVEFNPATVAEYRLLGY 532 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS-IDKLRYAPDNKLAKSDKTK 479 E R L+ E FNND VDAG+IGAG +T ++E+T G KA+ ID RYA +DK Sbjct: 533 ETRALKREDFNNDAVDAGEIGAGHTVTAIYEITPAGSKAALIDSQRYAAAKIATDTDKAS 592 Query: 480 ELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMR---FRAAVAAYGQKLRGSEYLNNT 536 E +LKIR+K P G ESQL+ P+ ++ +R F AAVA + Q L+ YL N Sbjct: 593 EYGFLKIRYKQPGGSESQLISAPIPVAMDTTQTQLREAQFGAAVAGFAQWLKDPRYLGNW 652 Query: 537 SWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 S + AQ KG+DP GYR EF++L+ A Sbjct: 653 SLDDALKLAQANKGDDPYGYRTEFVQLVRKAK 684 >UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales RepID=C8SEV7_9RHIZ Length = 718 Score = 459 bits (1181), Expect = e-127, Method: Composition-based stats. Identities = 232/569 (40%), Positives = 311/569 (54%), Gaps = 25/569 (4%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ- 81 S PT LA Q A + A A A++ Q+ + + G Sbjct: 154 DALVAPASPPKSEPTVAGGLAQQNAQGQVAPAEPAPARSGGQRVIMSLTPPPQADGTTSR 213 Query: 82 -----------EAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDT 130 P A A R Q F NPV ++P++TFS+DVDT Sbjct: 214 IARMPAAESKLMTPQQPATAPADQIAPQEENRDRVQDFKTNPVHAALEDPVSTFSIDVDT 273 Query: 131 GSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAP 190 SY+ VRR L +G +P D VRVEE++NYFP D P S PF + P P Sbjct: 274 ASYSFVRRSLKEGFVPQADTVRVEEMINYFPYD------WKGPDSASTPFNSTVSVMPTP 327 Query: 191 WNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 WN L+ V I D K E P +NLVFLID SGSM ++LPL++S+ +LLV +L+ Sbjct: 328 WNTHTKLMHVAIKGFDVKPTEQPKANLVFLIDVSGSMDEPDKLPLLKSAFRLLVSKLKAD 387 Query: 251 DNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG 310 D I+IVTYAGD+ L + K +I AID+L GST G AG++ AY+ A + FIK Sbjct: 388 DTISIVTYAGDAGTVLMPTKIAEKDKILNAIDNLQPGGSTAGEAGIKEAYKLAQQSFIKD 447 Query: 311 GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 G+NR++LATDGDFNVG D ++ +++++R++GV LS FG G N N+ MM IA GN Sbjct: 448 GVNRVMLATDGDFNVGQTDDDDLKRLIEQERKTGVFLSVFGFGRGNLNDEMMQTIAQNGN 507 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHF 430 G +YIDTL+EA+KVL + L T+AKDVK Q+EFNP V+EYR IGYE R L E F Sbjct: 508 GTAAYIDTLAEAEKVLVEDASSTLFTIAKDVKIQVEFNPDKVSEYRLIGYETRALNREDF 567 Query: 431 NNDNVDAGDIGAGKHITLLFELTLNGQKA-SIDKLRYAPD-NKLAKSDKTKELAWLKIRW 488 NND VDAGDIG+G +T ++E+T G ID LRY E A++KIR+ Sbjct: 568 NNDRVDAGDIGSGHSVTAIYEITPKGGGGEQIDPLRYGQAGVNNGGVANADEYAFVKIRY 627 Query: 489 KYPQGKESQLVEFPL-----GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQ 543 K P S+L+ P+ + + S D RF AVAA+GQKLR + + +I + Sbjct: 628 KLPNEDVSKLITTPVTSANEVASFDQASTDQRFSVAVAAFGQKLRDEDATAKFGYDKIME 687 Query: 544 WAQQAKGEDPQGYRAEFIRLIELADGVTD 572 A A+G DP GYR+EF+ L+ LA + Sbjct: 688 IATAARGADPFGYRSEFLSLVRLASALGG 716 >UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteobacteria RepID=B1KPQ5_SHEWM Length = 640 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 229/562 (40%), Positives = 343/562 (61%), Gaps = 21/562 (3%) Query: 25 NKESQQQQPSTPTEQQVLAAQ----QAAIKEAEQSAAAAKALAQ---QEVQQYSDKQALQ 77 ES S+P +Q + A EA Q + + + Q ++ + Sbjct: 60 KSESNPIAGSSPNQQSIDTASGIGGTEVQSEASQGEVSVRETYRAAKQASERMKSMKVHS 119 Query: 78 GRLQEAPTFARAAKAKATH-IANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANV 136 A + ++ + ++++ N + + P++TFS+DVDTGSY+ + Sbjct: 120 RPESFALMGLPSRPSQDIYLPELQNRDKFERQVANGIMVAGEIPVSTFSIDVDTGSYSTL 179 Query: 137 RRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRT 196 RR +N G+LP VRVEE++NYF + P + PF++ ELAP+P+N + Sbjct: 180 RRSINHGVLPERGTVRVEELINYFAYQYP------APDAGEQPFSVNTELAPSPYNPHKM 233 Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 LL++ + +++ +L AS LVFL+D SGSM S ++LPL++++LK+L ++L E D I+IV Sbjct: 234 LLRIGLKGFEKEKADLGASQLVFLLDVSGSMSSQDKLPLLKNALKMLSQQLDEGDRISIV 293 Query: 257 TYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRIL 316 YAG S + L + G+ I+ A+D L A GSTNGGAG+ELAYQ A K FI GG+NR++ Sbjct: 294 VYAGASGVVLDGVKGNDTLAISQALDKLKAGGSTNGGAGIELAYQLAQKHFIAGGVNRVI 353 Query: 317 LATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 LATDGDFNVG+ D +++E M++++R+ G+ L+T G G NYN+ +M ++AD GNG+Y+YI Sbjct: 354 LATDGDFNVGVSDQQALEDMIEEKRKQGIALTTLGFGQGNYNDHLMEQLADKGNGHYAYI 413 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVD 436 DTL+EA+KVL E+ L+T+AKDVK QIEFNPA V+EYR IGYE R L E FNND VD Sbjct: 414 DTLNEARKVLVDEISATLLTIAKDVKVQIEFNPALVSEYRLIGYENRALNREDFNNDKVD 473 Query: 437 AGDIGAGKHITLLFELT-LNGQKASIDKLRYAPDNKLAKSD-KTKELAWLKIRWKYPQGK 494 AG+IGAG +T L+EL+ ++ + D LRY D K K ELA+LK+R+K + Sbjct: 474 AGEIGAGHRVTALYELSFVDSPNQANDVLRYGLDIKTGKEKYSRDELAYLKLRYKPIGQE 533 Query: 495 ESQLVEFPLGPT-----INAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAK 549 +S+L+ +P+ + S+D RF AAVA +GQ + S YL++ + ++ AQ A Sbjct: 534 KSKLISYPVLTSTAINEFAQASDDFRFAAAVAGFGQLINHSHYLHDMDYAKVSDIAQAAM 593 Query: 550 GEDPQGYRAEFIRLIELADGVT 571 GED GYR EF++L + A + Sbjct: 594 GEDSFGYRHEFVQLTKTAGLLA 615 >UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4 Tax=Cyanobacteria RepID=B0CCM8_ACAM1 Length = 686 Score = 455 bits (1171), Expect = e-126, Method: Composition-based stats. Identities = 221/567 (38%), Positives = 322/567 (56%), Gaps = 23/567 (4%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 E + +Q++ + + + A A A+ Q + Sbjct: 117 ATELSQVRQRRKPSNRRFGISPRRPRPTGLPPALTKAQPAPAETAAQSQFSRDQSGRMKS 176 Query: 82 EAPTFARAAKAKATH---------IANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGS 132 AP A A T Y++ ++NP + PL+TFS+DVDT S Sbjct: 177 VAPPAGLAPPAPEPRFQDKDRLHLPGTFNTEDYKRINENPFFLPQRTPLSTFSIDVDTAS 236 Query: 133 YANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWN 192 Y+NVRRF+ QG LPP DAVR+EE++NYF + PF++ E+A APWN Sbjct: 237 YSNVRRFIRQGQLPPKDAVRLEELINYFDYGY-------ASPKGDQPFSVSTEVATAPWN 289 Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 Q L+ + + K+ + E+ SNLVFLID SGSM +L L++ SL LLV +L+ +D Sbjct: 290 NQHKLVHIGLKGKELEKEQ--PSNLVFLIDVSGSMKRPNKLALVKKSLCLLVHQLKPEDR 347 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 +++V YAG + I LPS G+ KA I AID L+A GST G AG+++AY A + F+K G Sbjct: 348 VSLVVYAGRAGIVLPSTPGTQKATIMNAIDRLEAGGSTAGAAGIKMAYDMAERHFLKNGN 407 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 NR++LATDGDFNVG +E +++++R+ GV L+ G G NY + M +A+ GNGN Sbjct: 408 NRVILATDGDFNVGQSSDAELERLIEQKRDRGVFLTVLGYGTGNYKDNKMELLANKGNGN 467 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN 432 Y+YIDTL EAQKVL +++R L T+AKDVK Q+EFNP V YR IGYE R LR + FN+ Sbjct: 468 YAYIDTLLEAQKVLVNDLRGTLFTIAKDVKIQVEFNPGKVQAYRLIGYENRLLRDQDFND 527 Query: 433 DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTK--ELAWLKIRWKY 490 D DAG+IG+G IT L+E+ G K+ ++ P + + +L LK+R+K Sbjct: 528 DRKDAGEIGSGHTITALYEVIPTGVKSDVELPDIDPLKFQKPTASSNSSDLMNLKLRYKQ 587 Query: 491 PQGKESQLVEFPLGP---TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQ 547 P G +SQL+ + +I + +++++F AAVA YG LR S+Y ++ Q+ A Q Sbjct: 588 PTGSKSQLISTAIADKNRSIQSATDNLKFSAAVAMYGMVLRDSDYKGKATFNQVLDLADQ 647 Query: 548 AKGEDPQGYRAEFIRLIELADGVTDIS 574 AKG+DPQGYR F++L+E + + Sbjct: 648 AKGKDPQGYRMAFMQLVERSQTLQQAK 674 >UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter sp. K31 RepID=B0T5X0_CAUSK Length = 592 Score = 454 bits (1168), Expect = e-126, Method: Composition-based stats. Identities = 227/564 (40%), Positives = 311/564 (55%), Gaps = 20/564 (3%) Query: 18 GCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQ 77 C + S +++ + + A Sbjct: 41 ACAALGYAAPAPDGYDSVVVTATKRTSREQRLTSRARPVIATPG----LTPPPPPPPPSP 96 Query: 78 GRLQEAPTFARAAKAKATHIANP--GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYAN 135 A +FA + A + A P T +Y NPVK+VA+ P++TFS+DVDT +YAN Sbjct: 97 PPPPAAYSFAAPSPVVAPNFAPPIRDTEKYPGAAANPVKRVAEEPVSTFSIDVDTAAYAN 156 Query: 136 VRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQR 195 VRRFLN+G PP DA+RVEE++NYF + + P ++ PF + P+PW++ R Sbjct: 157 VRRFLNEGAAPPHDALRVEELINYFDYGY------ARPTAQEPPFKPTVTVVPSPWSQDR 210 Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 L+ + + P NLVFLIDTSGSM +RLPL + +L +L+ +LR QD +++ Sbjct: 211 QLMHIGVQGYATPRAGQPPLNLVFLIDTSGSMSGPDRLPLAKKALNVLIDQLRPQDRVSM 270 Query: 256 VTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRI 315 V YAG + L G K ++ A+ +L + GST GG GLELAY A + +NR+ Sbjct: 271 VAYAGSAGAVLSPTDGKSKLKMRCALTALRSGGSTAGGQGLELAYALARQNLDPKAVNRV 330 Query: 316 LLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSY 375 +L TDGDFNVGI DP ++ V QR+SGV LS +G G NYN+ MM +A GNG +Y Sbjct: 331 ILMTDGDFNVGIADPTRLKDFVADQRKSGVYLSVYGFGRGNYNDTMMQALAQNGNGTAAY 390 Query: 376 IDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNV 435 +D L EA+K+L + L +A DVK Q+EFNPA V+EYR IGYE R L E FNND V Sbjct: 391 VDGLQEARKLLRDDFDSALFPIADDVKIQVEFNPAKVSEYRLIGYETRLLNREDFNNDQV 450 Query: 436 DAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE 495 DAG+IG+G +T ++E+T G K S D LRY A ELA+LKIR+K P G Sbjct: 451 DAGEIGSGAAVTAIYEITPVGAKPSSDPLRYGAKPSPATGGS--ELAFLKIRYKPPGGST 508 Query: 496 SQLVEFPL-----GPTINAPSEDMRFRAAVAAYGQKLRGSEYL-NNTSWQQIKQWAQQAK 549 S+L+E P+ ++ A E RF AVAAYGQKLRG ++ + W + AQ A+ Sbjct: 509 SKLIERPIGAGDMHASLAAAPEATRFAVAVAAYGQKLRGDPWVDASFDWDAVTALAQGAR 568 Query: 550 GEDPQGYRAEFIRLIELADGVTDI 573 GEDP G RAEF++L A V Sbjct: 569 GEDPYGLRAEFVQLTRAAKDVKGS 592 >UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NVX5_9RHOB Length = 608 Score = 453 bits (1166), Expect = e-126, Method: Composition-based stats. Identities = 229/575 (39%), Positives = 322/575 (56%), Gaps = 15/575 (2%) Query: 3 NKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKAL 62 K + ++ ++ G + + E + A+ A Q A A Sbjct: 36 GKKLTDTTEAAGRMTSSGKPDGDASVSTAELPKQEETHTVVAEIARPVATPQPAPAPALP 95 Query: 63 AQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPG-----TARYQQFDDNPVKQVA 117 +Q + L A + + A P R+ + NP+++ + Sbjct: 96 QKQRSRSDGAGGGLMTFSSGAGGAVLNSGIQLEPPAMPAVQLEDRERFASAEANPLRRTS 155 Query: 118 QNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKP 177 +P++TFS+DVDT SY+ VR L+ G LP PDAVRVEE+VNYF ++ + P Sbjct: 156 ADPVSTFSVDVDTASYSYVRSTLSGGRLPNPDAVRVEEMVNYFDYNYPV------PEKGG 209 Query: 178 IPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQ 237 PF+ + PWNE L++V I ++LP+ NLVFLIDTSGSM +LPL+Q Sbjct: 210 HPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLPSQNLVFLIDTSGSMADANKLPLLQ 269 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 S +LL+ LR++D +AIVTYAG S + L + K I I++L + GST G GL+ Sbjct: 270 QSFRLLLSSLRDEDEVAIVTYAGSSGVLLEPTKVADKTRILEKINALTSGGSTAGHEGLK 329 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 AY A G RI+LATDGDFNVG+ DP S++ V +QRE+G LS G G NY Sbjct: 330 GAYALAETMTGDGEQTRIILATDGDFNVGLSDPDSLKRYVAEQRENGTALSVLGFGRGNY 389 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 N+ +M +A G G +YIDTLSEA+KVL ++ + +A+DVK Q+EFNP V EYR Sbjct: 390 NDELMQTLAQNGQGVAAYIDTLSEARKVLVDQVVSSISMIAQDVKIQVEFNPETVAEYRL 449 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS-IDKLRYAPDN--KLAK 474 IGYE R LR E F ND VDAGDIGAG ++T L+E+T G A LRY P + + Sbjct: 450 IGYETRALRTEDFKNDKVDAGDIGAGHNVTALYEITPVGSPAEKFSDLRYGPKEKIEAVR 509 Query: 475 SDKTKELAWLKIRWKYPQGKESQLVEFPL-GPTINAPSEDMRFRAAVAAYGQKLRGSEYL 533 + ELA++K+R+K P KES LVE P+ T+ P + F A+VAA+GQKL+G++YL Sbjct: 510 TSYAGELAFVKLRYKLPGDKESTLVETPVMEDTVGIPKSETLFAASVAAFGQKLKGTDYL 569 Query: 534 NNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 + ++ I++ A KG DP GYR+EF+ L+ LAD Sbjct: 570 GDWDFKAIEKLASDNKGTDPFGYRSEFLTLVRLAD 604 >UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C54C8 Length = 638 Score = 453 bits (1165), Expect = e-126, Method: Composition-based stats. Identities = 207/559 (37%), Positives = 307/559 (54%), Gaps = 11/559 (1%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 PQ + ++ P A + E A + + Sbjct: 82 PQGDIAPIGKKGEHAPGPALPGLPSPAPLAEPAMPAGPDALGKRIAASSAPFASVVASGG 141 Query: 81 QEAPTFARAAKAKATHIA--NPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRR 138 AA A Y ++ +N + L+TFS DV+T SYANVRR Sbjct: 142 GAFGGVRGAANKPAPRDGYNAQNAEAYGRYQENEFRSPLVAALSTFSADVNTASYANVRR 201 Query: 139 FLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLL 198 LN+G LPP AV + E VNYFP + + P + P A E+ P PWN + LL Sbjct: 202 MLNEGTLPPASAVFLAEFVNYFPYSY------APPPAGADPVAFHVEMGPCPWNAKHHLL 255 Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY 258 +V + A +E+LP NLVFL+DTSGSM + RLPL+Q SL+LLV++L E+D +++VTY Sbjct: 256 RVGVQAHQIPAEKLPPRNLVFLVDTSGSMQQENRLPLVQKSLELLVEKLTEKDRVSVVTY 315 Query: 259 AGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 AGDSR+ALP SG+ K I + L A G TNG G++ AYQ A F+ GG+NR++L Sbjct: 316 AGDSRVALPPTSGADKKAILDVVTGLQANGGTNGEGGIKKAYQFARDTFLDGGVNRVILC 375 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 TDGDFNVG+ D + ++++QR+S V L+ G G NY + + +A+ GNG+++YIDT Sbjct: 376 TDGDFNVGVVDNGELVKLIEEQRKSKVFLTVLGYGMGNYKDDRLKELANHGNGHHAYIDT 435 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAG 438 L EA+KV + L+ VAKDVK QI+FNPA V YR +GYE R L+ E F ND DAG Sbjct: 436 LDEAKKVFVEQ-GGALVCVAKDVKFQIDFNPAKVNAYRLVGYENRLLKDEDFKNDAKDAG 494 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYA-PDNKLAKSDKTKELAWLKIRWKYPQGKESQ 497 D+G+G +T+L+E+ G K + ++ + K ++ + E +K+R+K+P S+ Sbjct: 495 DVGSGHQVTVLYEIVPPGVKVDLPEVDASKYQKKDVPANASDEWLTVKMRYKHPDEDVSK 554 Query: 498 LVEFP-LGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGY 556 + G S+D RF AAVA++G LR S++ ++ + + AQ A G DP + Sbjct: 555 ELTAAHKGAVAKELSDDFRFAAAVASFGMLLRDSKFKGAMTYAGVLEEAQGALGADPNNH 614 Query: 557 RAEFIRLIELADGVTDISQ 575 R +F+ L+ A ++++ + Sbjct: 615 RKQFLELVRRAKELSNVQK 633 >UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZHE2_9SPHI Length = 704 Score = 452 bits (1163), Expect = e-125, Method: Composition-based stats. Identities = 212/574 (36%), Positives = 323/574 (56%), Gaps = 8/574 (1%) Query: 1 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 MR K + + + + + + +P T Q++ Q + Sbjct: 127 MRQKADTLASKLDTQVVWLSKEFIDLDLPKFEPLTKVYQKLSFYQAFFAHDHLGEVLVTL 186 Query: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNP 120 Q V +Y + + + + + + A A P RY +N QV QNP Sbjct: 187 TTFQATVLRYQSEALKKLGAGDLSSDLKFDRKAAFRNAFPEGERYATIYENQFYQVGQNP 246 Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPAS--KPI 178 L+TFS+DVD SY+NVRRF+N G P +AVRVEE++NYF D+ + Sbjct: 247 LSTFSIDVDNASYSNVRRFVNDGQPLPKNAVRVEEMINYFEYDYPQPTPTKDKEGKLQTH 306 Query: 179 PFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQS 238 PF++ E PWN LL++ + ++ +++ +NLVFL+D SGSM S+++LPL++ Sbjct: 307 PFSVNTEYGTCPWNPHHKLLQIGLQGENLQTKNASPANLVFLVDASGSMDSEDKLPLLKR 366 Query: 239 SLKLLVKELRE-QDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 S K+L+K+L + + IAIV YAG S + LP+ S SH+ +I A++++++ GST GG G+E Sbjct: 367 SFKVLLKQLTDSRTKIAIVAYAGASGLVLPATSVSHREKILTALENIESGGSTAGGEGIE 426 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 LAY+ A + FI GG NR++LATDGDFNVG+ + + ++ +R+SGV L+ G G N Sbjct: 427 LAYKIAQQAFIAGGNNRVILATDGDFNVGLSSDEELMQLISNKRKSGVYLTCLGFGTGNL 486 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 N++MM ++ + GNGNY YID ++EA+KVL + L +AKDVK Q+EFNPA V YR Sbjct: 487 NDSMMEKLTNAGNGNYYYIDGINEAKKVLAKNLTGTLYAIAKDVKIQLEFNPARVKSYRL 546 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL----NGQKASIDKLRYAPDNKLA 473 +GYE R L+ F ND VDAG++G G +T L+E+ A L+Y + Sbjct: 547 VGYENRVLKHRDFKNDQVDAGELGVGHTVTALYEIVPVNRTQPMLADEIPLKYQTTQIDS 606 Query: 474 KSDKTKELAWLKIRWKYPQGKESQLVEFPLG-PTINAPSEDMRFRAAVAAYGQKLRGSEY 532 + EL +K+R+K P+ +S+L+E + + S + +F VAA+G +LR S Y Sbjct: 607 AALANNELVTIKLRYKRPKENKSRLIEKVVKNKLVTQTSNNFKFATTVAAFGMRLRNSPY 666 Query: 533 LNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIEL 566 + NTS+QQI W Q AK D GYR EF+ L++ Sbjct: 667 VGNTSYQQIYSWGQYAKSVDSNGYRREFLELVKK 700 >UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AC65_GEMAT Length = 642 Score = 443 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 227/579 (39%), Positives = 326/579 (56%), Gaps = 19/579 (3%) Query: 2 RNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKA 61 R K ++ L S +++G P Q+ T + A+Q A + A A Sbjct: 67 RLKQSVIAL--SGVVTGMKPTVTGVAEQKADNFNRTRETANASQGAEVTRTAAPAIAPSP 124 Query: 62 LAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANP---GTARYQQFDDNPVKQVAQ 118 Q +++ +P A + +A+ +Y + +DNP V Sbjct: 125 APQTRGVAGGMARSVGMPAPASPRRAASDEARPPRPYPGQPGNREQYDRIEDNPFLGVTG 184 Query: 119 NPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPI 178 NPL+TFS+DVD SY N RRFL G PP DAVR+EE++NYFP + Sbjct: 185 NPLSTFSIDVDRASYGNARRFLQDGQRPPADAVRIEELINYFPY-------ELREPRGND 237 Query: 179 PFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQS 238 P A+ E+ APW + L+++ + ++ ++ LP +NLVFLID SGSM S ++LPL++ Sbjct: 238 PVAITTEVTTAPWQPRHQLVRIALQSRRIETASLPPNNLVFLIDVSGSMQSPDKLPLVKQ 297 Query: 239 SLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 SL+LLV ++R QD +AIV YAG + + LPS SG K I AI+ L+A GST GGAG+EL Sbjct: 298 SLRLLVDQMRPQDRVAIVAYAGAAGLVLPSTSGDEKETIIQAIERLEAGGSTAGGAGIEL 357 Query: 299 AYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYN 358 AY+ A + F+ G NR++LA+DGDFNVG+ +E +++++R G L+ G G NY Sbjct: 358 AYRTAREHFMDHGNNRVILASDGDFNVGVSSDGELERLIERKRTEGTYLTILGFGTGNYQ 417 Query: 359 EAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 +A M ++A GNGNY Y+D ++EA+K+L EM L+TVA DVK Q+EFNP V YR I Sbjct: 418 DAKMEKLAKRGNGNYGYVDDIAEARKMLVREMGATLLTVANDVKLQVEFNPRRVQAYRLI 477 Query: 419 GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG-------QKASIDKLRYAPDNK 471 GYE R LR E F +D DAGD+GAG +T L+E+ G Q + Sbjct: 478 GYEDRLLRTEDFTDDRKDAGDLGAGHQVTALYEIVPVGVQGTVRLQDTEARRYEPVTGEA 537 Query: 472 LAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSE 531 + + + EL ++K+R+K P S+L+ P+ S+DMRF ++VAA+G LR S Sbjct: 538 RSSTATSDELLFVKLRYKRPGESTSRLITHPVPARTVRGSDDMRFASSVAAFGMLLRESP 597 Query: 532 YLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 Y NTS Q+ + A+ A GED GYRAEFIRL+E + Sbjct: 598 YAGNTSAAQVLEQARAALGEDDGGYRAEFIRLVERYRSI 636 >UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=C8N8N5_9GAMM Length = 563 Score = 440 bits (1132), Expect = e-122, Method: Composition-based stats. Identities = 231/570 (40%), Positives = 328/570 (57%), Gaps = 19/570 (3%) Query: 6 IIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQ 65 +I+ L+++ +S ++ +S + P E +A A A A L+ + Sbjct: 6 LILALLAASGISHAAGLCDDLDS----DAPPVEYAARSAPVLQKAAASPQAQAIPDLSNR 61 Query: 66 EVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFS 125 + + AR + RY D NPV +V+ P++TFS Sbjct: 62 LGVNMPVPAGANPQWALNKSVARMGIRGT--VEVQNRERYAHSDANPVHRVSDAPVSTFS 119 Query: 126 LDVDTGSYANVRRFL-NQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRY 184 +DVDTGSY+N+RR L + LPP DAVRVEEI+NYF + + PFA+ Sbjct: 120 IDVDTGSYSNIRRMLTRENRLPPADAVRVEEILNYFAYGYPLPQ-------DGKPFAVHT 172 Query: 185 ELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 + +PW L+++ I A D E+ P +NLVFLIDTSGSM ++LPL++ ++ Sbjct: 173 QTVDSPWQADAKLIRIAIQAADLAPEKRPPANLVFLIDTSGSMDDPDKLPLVKKTVCHFA 232 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 + LR D I+++TY+G + LP +G K I AA+ L A G+T GG L +AY A Sbjct: 233 EALRADDRISLITYSGSTAEILPPTAGDQKETIIAALKPLRAHGATAGGEALRMAYDAAA 292 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 K + K GINRILLATDGDFNVGI DP ++++ V +R+SG++L+T G G+ NYN+ MM + Sbjct: 293 KNYRKDGINRILLATDGDFNVGISDPATLKNYVADKRKSGISLTTLGYGSGNYNDEMMEQ 352 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +AD G+GNYSYID+ +EA+KVL ++ L TVA+D+K Q+EFNPA V EYR +GYE R Sbjct: 353 LADAGDGNYSYIDSEAEAKKVLVRQLTSTLATVARDIKIQLEFNPAAVKEYRLVGYENRL 412 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 LR E FNND VDAGDIGAG +IT L+E+ G+ +D Y N A S K E WL Sbjct: 413 LREEDFNNDRVDAGDIGAGHNITALYEIIPQGKTGWLDARHY--QNAPAASGKADEYGWL 470 Query: 485 KIRWKYPQGKESQLVEFPLGP---TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQI 541 K+R+K P+ ++SQL+E P+ + E RF A A+Y Q L+G +Y W I Sbjct: 471 KLRYKAPESEQSQLIEQPIAAKSIPLADAEEATRFAIAAASYAQALKGGKYNGALDWAGI 530 Query: 542 KQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 + AQ A+G DP RA ++LIE A ++ Sbjct: 531 LRLAQAAQGSDPYDERAGLLQLIEKARELS 560 >UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacteraceae RepID=A3PN61_RHOS1 Length = 651 Score = 440 bits (1131), Expect = e-122, Method: Composition-based stats. Identities = 213/567 (37%), Positives = 309/567 (54%), Gaps = 16/567 (2%) Query: 14 LILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDK 73 L L P E Q P P L A A AE + A A + + + Sbjct: 91 LALVVVMPNARLAEPPQTAPDAPEADARLTAAPEAGGGAETAGAPVPAEPRARSAEGAAP 150 Query: 74 QALQGRLQEAPTFAR---------AAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATF 124 Q AA+A A + + + DNP++ A++P++TF Sbjct: 151 QTFAADEAMPMAAPPAPDLALSKQAAEAPARALPQGDSEAFANAPDNPLRVTAEDPVSTF 210 Query: 125 SLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRY 184 S+DVDT SYA +R L G LPP +AVR+EE++NYFP D+ P + PF Sbjct: 211 SIDVDTASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYP------APENGTPPFRPTL 264 Query: 185 ELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 + PWN + L+ V + + E+ P NLVFLIDTSGSM +LPL++ S L++ Sbjct: 265 SITRTPWNPETRLVHVALQGRMPAIEDRPPLNLVFLIDTSGSMQDPAKLPLLKQSFGLML 324 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 LR +D +AIVTYAG + L + + ++ I +A+D LDA GST G GL LAY+ A+ Sbjct: 325 GRLRPEDQVAIVTYAGSAGEVLAPTAANQRSTILSALDRLDAGGSTAGDEGLALAYRTAS 384 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + G + R++LATDGDFN+GI DP+ + +V +R++GV LS G G N ++A M Sbjct: 385 EMAGAGEVTRVVLATDGDFNLGISDPEELARLVAHERDTGVYLSVLGFGRGNLDDATMQA 444 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +A GNG +YID+L+EAQKVL ++ L +A DVK Q+E++PA V EYR IGYE R Sbjct: 445 LAQNGNGQAAYIDSLNEAQKVLVDQLSGALFPIADDVKVQVEWSPARVAEYRLIGYETRG 504 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 LR E F ND VDAG+IGAG +T ++E+T + + + EL +L Sbjct: 505 LRREDFANDRVDAGEIGAGHSVTAIYEIT-PVDSPARLTDPLRYGAEPPEGAHGDELGFL 563 Query: 485 KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 ++R+K P S L++ P+ + SED+RF A+A +G+ LRGS+ L W + Sbjct: 564 RLRYKAPGESTSTLIDTPIPDMLTEASEDVRFSTAIAGFGELLRGSDKLGAWGWDEAIAL 623 Query: 545 AQQAKGEDPQGYRAEFIRLIELADGVT 571 A A+G DP GYR E ++L+ LA+ ++ Sbjct: 624 ADGARGADPFGYRVEAVQLMRLAESLS 650 >UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacteriaceae RepID=C0YQB8_9FLAO Length = 800 Score = 438 bits (1125), Expect = e-121, Method: Composition-based stats. Identities = 197/480 (41%), Positives = 286/480 (59%), Gaps = 12/480 (2%) Query: 97 IANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEI 156 Y F +NP + PL+TFS+DVD SY+NVRR +N G + +AVR+EE+ Sbjct: 327 PVTQNNESYDAFVENPFELTRNQPLSTFSIDVDNASYSNVRRMINNGQVVDKNAVRIEEM 386 Query: 157 VNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASN 216 VNYF D+ ++ PF++ E + APWN + LLK+ + K+ ++LPASN Sbjct: 387 VNYFKYDYPQPKNEN-------PFSINTEYSDAPWNPKHKLLKIGLQGKNLPMDKLPASN 439 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAE 276 LVFLID SGSM + +LPL++SS K+L+ +LR +D + IV YAG + + LP S K + Sbjct: 440 LVFLIDVSGSMSDENKLPLLKSSFKVLLNQLRPKDKVGIVVYAGSAGMVLPPTSAGEKDK 499 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESM 336 I A+D L A GST GGAG+ELAY+ A + F+K G NR+++ATDGDFNVG ++++ Sbjct: 500 IIEALDRLQAGGSTAGGAGIELAYKLAQENFVKEGNNRVIIATDGDFNVGTSSISDLKTL 559 Query: 337 VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 ++ +R+SGV L+ G G NY + + +AD GNGNY+YID + EA K L E + Sbjct: 560 IEDRRKSGVFLTCLGFGMGNYKDNTLETLADKGNGNYAYIDNMQEANKFLGKEFAGSMYA 619 Query: 397 VAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 +AKD+K QIEFNP +V YR IGYE R+L+ E F ND +DAG++G+G +T L+E+ Sbjct: 620 IAKDMKIQIEFNPEYVKSYRLIGYENRKLKNEDFTNDKIDAGELGSGHTVTALYEVIPAN 679 Query: 457 QKASIDKLR--YAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPT---INAPS 511 + + ELA +K R+K P G S+ + + + I++ S Sbjct: 680 VNSDFAPKESDLKYSQNTSSKGFGDELATIKFRYKKPDGDTSREITQVVKNSDNRISSAS 739 Query: 512 EDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 D +F ++VA +G LR SE + I+ A+Q K +D +GYR+EFIRLIE V Sbjct: 740 PDFKFASSVAWFGLVLRNSELITKKDLSDIENLAKQGKNKDEEGYRSEFIRLIESYKTVQ 799 >UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZU67_9SPHI Length = 552 Score = 436 bits (1122), Expect = e-121, Method: Composition-based stats. Identities = 191/480 (39%), Positives = 294/480 (61%), Gaps = 11/480 (2%) Query: 97 IANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEI 156 ++ P + ++N V PL+TFS+DVD SY+ R+ +N G LP +VR+EE Sbjct: 79 VSPPKIKEKKPANENTFLSVKTAPLSTFSIDVDNASYSRARKSINNGQLPSTSSVRLEEF 138 Query: 157 VNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASN 216 +NYF + + Q PF++ E+A PWN + L+ + + K S +L SN Sbjct: 139 INYFNYQYKQPEGQH-------PFSVNTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLSN 191 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAE 276 LVFLID SGSM + ++LPL++ + K+LV L E+D +AIV YAG++ + LP+ G+ K + Sbjct: 192 LVFLIDVSGSMSAPDKLPLLRKAFKMLVNNLGEEDRVAIVVYAGNAGLVLPATQGTDKQK 251 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESM 336 I A+D L + GST GGAG++LAY+ A + FIK G NRI+LATDGDFN+G ++++++ Sbjct: 252 IMEALDKLQSGGSTAGGAGIKLAYKIAKQNFIKEGNNRIILATDGDFNLGASSDQAMQNL 311 Query: 337 VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 ++++R+ GV ++ G+G NY ++ M IAD GNGNY Y+D L+EA KV +++ L T Sbjct: 312 IEEKRKEGVFITVLGLGMGNYRDSKMEIIADKGNGNYYYLDNLNEAYKVFGKDLKGTLFT 371 Query: 397 VAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 +AKDVK Q+EFN A V YR IGYE R L F +D DAG+IGAG +T L+E+TL+ Sbjct: 372 IAKDVKIQVEFNSAVVKSYRLIGYENRLLANRDFRDDTKDAGEIGAGHTVTALYEVTLHS 431 Query: 457 QKASID-KLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG---PTINAPSE 512 ++ P N A ++L +++R+K P+G + +++ S Sbjct: 432 NPQTVAVDQNQIPANFQATQFNNQQLMNVRLRYKKPEGSTGIETSQIIAANHQSVDETSH 491 Query: 513 DMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTD 572 + RF AAVA++G L+ S+Y +T++Q + A+ +KG+D YRAEFI L++ A +T Sbjct: 492 NFRFSAAVASFGMLLKNSQYKGSTTFQTVLTLAKGSKGKDMNQYRAEFIDLVQKASQITT 551 >UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KS19_CLOPH Length = 551 Score = 436 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 188/472 (39%), Positives = 276/472 (58%), Gaps = 11/472 (2%) Query: 101 GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 T Y + + +PL+TFS DVDT SY+N+RR L +G AVR+EE++NYF Sbjct: 88 STEEYNAVIEQGYQSTKNHPLSTFSADVDTASYSNIRRMLKEGRRVDTGAVRIEEMLNYF 147 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 D+ + + PF + EL+ PWN L I + + SNLVFL Sbjct: 148 NYDYKLPE-------GDSPFGITTELSDCPWNPDTKLFLAGIQTEKIDFSKSAPSNLVFL 200 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 ID SGSM+ +++LPL+Q + LL + L E+D I+IVTYAG+ + L G+ K +I A Sbjct: 201 IDVSGSMMDEDKLPLVQRAFLLLTENLTEKDRISIVTYAGNDTVVLSGAKGNQKEKIQNA 260 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQ 340 I L+A GST G G+E AYQ A + +I+GG NR++LATDGD NVG+ + ++++++ Sbjct: 261 ITELEAGGSTFGSKGIETAYQLAMENYIEGGNNRVILATDGDLNVGVTSESELTNLIEEK 320 Query: 341 RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD 400 R+SGV LS G G N + M +AD GNGNY+YID+L EA+KVL EM L+TVA D Sbjct: 321 RKSGVALSVLGFGTGNIKDNKMEALADHGNGNYAYIDSLMEARKVLVEEMGATLVTVAGD 380 Query: 401 VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS 460 VK Q+EFNPA V YR +GY+ R L E FN+D DAG++GAG +T+L+EL L K Sbjct: 381 VKFQVEFNPAKVKGYRLLGYDNRLLATEDFNDDTKDAGEVGAGHSVTVLYELVLEDSKME 440 Query: 461 IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG--PTINAPSEDMRFRA 518 I + ++ EL + IR+K P +S L+ P+G + ++++ F Sbjct: 441 IPETELKYTT-TEPTNMVDELLTVNIRYKKPGKDKSILMSEPVGINQLADTRTDNLAFAT 499 Query: 519 AVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 AVA +G L+ SEY + ++ ++ ++ + YRAEF +L++LA + Sbjct: 500 AVAEFGLLLKDSEYKGDATFSKVLSRLEETNYK-QDEYRAEFYQLVKLAKDI 550 >UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BJU7_9GAMM Length = 555 Score = 436 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 203/566 (35%), Positives = 301/566 (53%), Gaps = 21/566 (3%) Query: 9 LLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQ 68 ++++L+L CG Q + P+ E +A ++ A + ++ EVQ Sbjct: 9 SVVATLMLLSCGTQTTDGALTD--PAVLQEPVHTEETRAIETDSADQVFLAASKSRVEVQ 66 Query: 69 QYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDV 128 + P+ ++ Y + +P++QVA +P++TFS DV Sbjct: 67 E----------SYVLPSSTPIIPMPNPPVS-ENRENYPKTPISPIRQVATDPVSTFSTDV 115 Query: 129 DTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAP 188 DT SY N RRFLNQG+ PP D++RVEE +NYF P + P + E Sbjct: 116 DTASYTNARRFLNQGMRPPADSIRVEEFINYFDYALP------APDTTNTPIQISTERTQ 169 Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 PWN Q L++V + + + LP NLVFL+D SGSM S ++LPL+Q S LLV +LR Sbjct: 170 TPWNPQTELVRVSLQSYRSDFKTLPPLNLVFLLDVSGSMNSPDKLPLMQRSFNLLVSQLR 229 Query: 249 EQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 QD +AI YAG S + L SG KA+IN AI+ L A G T+G AG+ LAY A ++ Sbjct: 230 PQDRVAIAVYAGQSGVVLEPTSGDQKAQINQAINQLRAGGGTHGSAGIHLAYDLAQANYL 289 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 GINRI + TDGDFNVG ++++++++RE+GV LS G G NYN+A+M +++ Sbjct: 290 PDGINRIFIGTDGDFNVGTTSLTELKALIERKREAGVFLSVLGFGTGNYNDALMEELSNH 349 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 GNG Y+D+ EA+K+ +++ L TVAKDVK QIEFNPA V EYR IGY+ R L E Sbjct: 350 GNGTAYYLDSYQEARKLFATQLAATLQTVAKDVKIQIEFNPAQVAEYRLIGYDNRLLARE 409 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRW 488 FNND +DAG++G+G +T L+E+ + + E+A++K R+ Sbjct: 410 DFNNDAIDAGEMGSGHAVTALYEI-VRRDSEFRFSDPLRYQDDDLSDTVGGEIAFVKARY 468 Query: 489 KYPQGKESQLVEFPLGPT-INAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQ 547 K P S+L+ + T + + S+ VA + + LRGS YL + S + Sbjct: 469 KLPDEAHSRLLSQAITDTPMQSSSQRQALAIGVAGFAEILRGSPYLRDWSINDAIDYIGP 528 Query: 548 AKGEDPQGYRAEFIRLIELADGVTDI 573 + ED GYR E + L+ + Sbjct: 529 SLQEDRWGYRQELVTLMRNLQSMEHA 554 >UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteobacteria RepID=C6BAR1_RHILS Length = 706 Score = 434 bits (1117), Expect = e-120, Method: Composition-based stats. Identities = 217/563 (38%), Positives = 317/563 (56%), Gaps = 21/563 (3%) Query: 20 GPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGR 79 P ++ Q + +A AA A +Q+++ A Sbjct: 154 APALAKPDASQTSEYDANAALTNKPEGSAAALGATKRAAPAAPGIVPQRQFAEPMA---- 209 Query: 80 LQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRF 139 AP+ A+ + +P R+ NP+K VA +P++TFS DVD+ SYA VRR Sbjct: 210 -AIAPSPVPPAEGRMQMQLDPNRERFANAAANPIKSVATDPVSTFSADVDSASYAFVRRS 268 Query: 140 LNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLK 199 L G +P P +VRVEE++NYFP D P + PF + P PWN L+ Sbjct: 269 LTGGAMPDPLSVRVEEMINYFPYD------WPGPNNADQPFKATVTVMPTPWNRDTELMH 322 Query: 200 VDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 V I D P +NLVFLID SGSM ++LPL++S+ +L+V L+ D ++IVTYA Sbjct: 323 VAIKGYDIAPATTPRANLVFLIDVSGSMDEPDKLPLLKSAFRLMVNRLKADDTVSIVTYA 382 Query: 260 GDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLAT 319 G++ L + K++I +AID L+ GST G G+E AY A +GF+K G+NR++LAT Sbjct: 383 GNAGTVLAPTRVAEKSKILSAIDRLEPGGSTGGAEGIEAAYDLAKQGFVKDGVNRVMLAT 442 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DGDFNVG ++ +++++R+ G+ L+ G G N N+++M +A GNG+ +YIDTL Sbjct: 443 DGDFNVGPSSDGDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNGSAAYIDTL 502 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGD 439 +EAQK L E L +A DVK Q+EFNP + EYR IGYE R L E FNND VDAGD Sbjct: 503 AEAQKTLVEEAGSTLFPIASDVKFQVEFNPERIAEYRLIGYETRALNREDFNNDRVDAGD 562 Query: 440 IGAGKHITLLFELTLNGQKASI-DKLRYAPDNKLAKSDKT----KELAWLKIRWKYPQGK 494 IG+G +T ++E+T G A + D LRY +K+ ELA++K+R+K P Sbjct: 563 IGSGHSVTAIYEITPKGSPAVMNDDLRYGAADKVPAEASDSAHHGELAFVKMRYKRPGED 622 Query: 495 ESQLVEFPLGP-----TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAK 549 +S L+ P+ T++A +D+RF AVAA+GQKL ++ S+Q I A ++ Sbjct: 623 KSALITTPVNDGNAVATVDAAPQDVRFSVAVAAFGQKLSHVAAVDTYSYQAIADLAAASR 682 Query: 550 GEDPQGYRAEFIRLIELADGVTD 572 G D GYR++F+ L+ LADG++ Sbjct: 683 GTDTFGYRSDFLGLVRLADGLSQ 705 >UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CVB5_9CLOT Length = 556 Score = 434 bits (1116), Expect = e-120, Method: Composition-based stats. Identities = 207/595 (34%), Positives = 308/595 (51%), Gaps = 67/595 (11%) Query: 2 RNKNIIMLLM----SSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAA 57 R K + + L+ + +LSGCG + + ++ TE +V +AE + Sbjct: 3 RGKQLTIGLLMCALLAGLLSGCG-------AGGGKTASATEAEV---------KAEAGSY 46 Query: 58 AAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVA 117 A++ +A Q + +A L T Y +N VA Sbjct: 47 ASETMAAQSQWDGAVMEAEGPPLSH------------------NTEEYNYIAENAFLAVA 88 Query: 118 QNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKP 177 PL+TF+ DVDT SYAN+RR + +G P DAVR+EE++NYF D+ ++ Sbjct: 89 NAPLSTFAADVDTASYANLRRKILEGNEVPADAVRIEEMLNYFTYDYP-------EPTED 141 Query: 178 IPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQ 237 PF++ + PWNE LL++ + A+ E SNLVFLID SGSM S ++L L++ Sbjct: 142 EPFSVTTYIGDCPWNENHKLLQIGLQAEKPDLENQKPSNLVFLIDVSGSMESADKLGLVK 201 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 + LL + LR +D ++IVTYA + L +SG KA I AI++L A GST+G G+E Sbjct: 202 RAFLLLTENLRPEDTVSIVTYASSDTVVLDGVSGEEKAAIMTAIENLTAGGSTDGSKGIE 261 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 AY+ A + F K G NR++LATDGD N+G+ + +++K++ESGV LS G G N Sbjct: 262 TAYRLAEEHFQKDGNNRVILATDGDLNLGLTSEGDLTRLIQKKKESGVFLSVMGFGTGNI 321 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 + M +AD GNG Y+Y+D+L EA++VL E+ L TVAKDVK Q+EFNPA V YR Sbjct: 322 KDNKMEALADNGNGQYAYVDSLMEAKRVLVEELGGTLFTVAKDVKLQVEFNPAKVKGYRL 381 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYA---------- 467 IGYE R + F++D D G+IGAG +T L+EL G + ++ Sbjct: 382 IGYENRLMEARDFDDDAKDGGEIGAGHRVTALYELVPAGSDEDLGEVELKYGAGNVAAAE 441 Query: 468 ----------PDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTI--NAPSEDMR 515 P E LK+R+K P G++S+L+E+P+ + DMR Sbjct: 442 NGENGGAEARPAEGAPAPGADSEWLTLKVRYKEPDGEQSRLLEYPVDDSAVCRELPPDMR 501 Query: 516 FRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 F + VA G LR SEY +S++ I ++ G Y+ EF+ L++ + Sbjct: 502 FASCVAQTGMLLRDSEYAGGSSYKAIAAELERIDGLRGDPYKEEFLYLVKRLAAM 556 >UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 Tax=Erythrobacter RepID=Q2N8R4_ERYLH Length = 580 Score = 433 bits (1114), Expect = e-120, Method: Composition-based stats. Identities = 231/580 (39%), Positives = 332/580 (57%), Gaps = 22/580 (3%) Query: 1 MRNKNIIMLLMSSLILSGCGPQPENKE------SQQQQPSTPTEQQVLAAQQAAIKEAEQ 54 MR + M ++ L+ C Q E S+ + S + A Sbjct: 1 MRIIRFATVSMLAVALASCASQSSEGERIVVTGSKADRSSDASPPPPPPPPPPPPSPAYA 60 Query: 55 SAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVK 114 + A + + + G+ EA RY + +PVK Sbjct: 61 AQQAVVVSGSRIASEAAVAPDTSGQPAEAAGREYRYVMPVIVPQPEDRERYDGEEVSPVK 120 Query: 115 QVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPA 174 A PL+TFS+DVDTG+YAN RRFL+QG +PP AVR EE +NYF D+D P Sbjct: 121 IAAVEPLSTFSVDVDTGAYANARRFLSQGQMPPKAAVRTEEFINYFRYDYD------RPQ 174 Query: 175 SKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLP 234 + PF + ++ A PWNE L+++ + D + E P +NLVFL+D SGSM ++LP Sbjct: 175 DRSQPFTVNFDAARTPWNEDTRLIRIGLAGYDIERSERPPANLVFLMDVSGSMGRPDKLP 234 Query: 235 LIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 L++++L L EL+ QD ++IV YAG + + L + +I AA++ L A GST GGA Sbjct: 235 LVKTALAGLAGELQPQDKVSIVVYAGAAGLVLEPT--NDTRKIRAALNQLQAGGSTAGGA 292 Query: 295 GLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGN 354 G++LAYQ A FI+GG+NR++LATDGDFNVG+ ++ M++K+R+SG+TL+T G G Sbjct: 293 GIQLAYQIAEDNFIEGGVNRVILATDGDFNVGVSSRDALIEMIEKKRDSGITLTTLGFGT 352 Query: 355 SNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE 414 NYNEAMM +IA+ GNGNY+YID+ EA+KVL EM L T+AKDVK Q+EFNPA +++ Sbjct: 353 GNYNEAMMEQIANHGNGNYAYIDSALEAKKVLGDEMSSTLFTIAKDVKIQVEFNPAVISQ 412 Query: 415 YRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAK 474 YR IGYE R LR E F+ND VDAGDIGAG +T ++E+ G K I LRY A Sbjct: 413 YRLIGYENRALRDEDFDNDAVDAGDIGAGHQVTAIYEVVPVGTKGWIPPLRYGDRPAQAA 472 Query: 475 SDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINA----PSEDMRFRAAVAAYGQKLRGS 530 S++ +E A++K+R+K P G+ S+L+++ L + P D F +AVA +GQKLRG Sbjct: 473 SERAEEAAYVKLRYKMPDGETSKLIDYVLPASTLRTATMPRGDFAFASAVAGFGQKLRGD 532 Query: 531 EYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 L + ++ + + A G +R EF++L LA + Sbjct: 533 PMLGDFAYDDLARLA----GTQQDFWRQEFVKLTSLAGSM 568 >UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria RepID=B1ZYN3_OPITP Length = 792 Score = 433 bits (1112), Expect = e-119, Method: Composition-based stats. Identities = 227/575 (39%), Positives = 314/575 (54%), Gaps = 19/575 (3%) Query: 17 SGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQAL 76 S P+ + + T++QV A+ A Q+ A A+ Sbjct: 217 SFAAPESTAFGALARAELRETQRQVRQARAQKKDAAMQALLVANEEPAALSSFPGQAPAM 276 Query: 77 QGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANV 136 G + + + H T Y+ ++ ++PL+TF+ DVDT SYANV Sbjct: 277 DGYIASTTFAGIGTRVRGDHRQAMNTEAYRFLRESDFLSAREHPLSTFAADVDTASYANV 336 Query: 137 RRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIP---ASKPIPFAMRYELAPAPWNE 193 RRFL +G LPP DAVR+EE+VNYFP + + A PFA E+A APW Sbjct: 337 RRFLREGRLPPADAVRIEELVNYFPYRYAAPGRVRDEGVAAPGEAPFAAALEVAAAPWAA 396 Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNI 253 Q L+++ + AKD A+NLVFL+D SGSM +L L+Q S++LL+ L+ +D + Sbjct: 397 QHRLVRIGLKAKDAAVSGRAAANLVFLLDVSGSMDQPNKLRLVQESMRLLLGRLQPEDRV 456 Query: 254 AIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN 313 AIVTYAG+S +ALPS + + EI AID L A GSTNG GL+LAY A F+ G+N Sbjct: 457 AIVTYAGNSGLALPSTPVARQREILDAIDELRAGGSTNGAMGLQLAYDIAKANFVANGVN 516 Query: 314 RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNY 373 R++L TDGDFNVG+ + +++++ +SGV L+ G G N +AM+ +IAD GNG+Y Sbjct: 517 RVILCTDGDFNVGVTSEGELVRLIEEKAKSGVFLTVLGFGMGNLKDAMLQQIADRGNGSY 576 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND 433 YIDT EA+K+L ++ L+TVAKDVK Q+EFNPA V YR IGYEKR L E F ND Sbjct: 577 GYIDTRREAEKLLVQQVSGTLLTVAKDVKLQVEFNPAKVARYRLIGYEKRLLNQEDFAND 636 Query: 434 NVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDK-------------TKE 480 +DAG+IGAG +T L+E+ G K + P+++ E Sbjct: 637 KIDAGEIGAGHTVTALYEIIPVGAKDAEVTEETEPEDRRYTYSSAAPSAVEKRTLAHADE 696 Query: 481 LAWLKIRWKYPQGKESQLVEFPLGP---TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTS 537 L LK+R+K P S +EFPL SED RF +AVAA+G LR S Y + Sbjct: 697 LLTLKVRYKQPTALLSTRLEFPLKDDGGNFAQASEDFRFASAVAAFGMILRDSPYKGVAT 756 Query: 538 WQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTD 572 + WA A +DP GYRAEF+ L++ A +T Sbjct: 757 LDDVIAWANAATSDDPGGYRAEFVELVKQARLLTQ 791 >UniRef50_UPI000185CB41 protein containing von Willebrand factor n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CB41 Length = 550 Score = 431 bits (1108), Expect = e-119, Method: Composition-based stats. Identities = 222/528 (42%), Positives = 311/528 (58%), Gaps = 18/528 (3%) Query: 55 SAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGT------ARYQQF 108 A A + + + + + A + + + P Y++ Sbjct: 29 PPAPATTVEELTANKMASPETPPPPPPPPAYDAVVEEMEIANSEEPSQQQLRSNETYKEI 88 Query: 109 DDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKD 168 +NP VAQ P+ TFS DVD SYAN+RR L G LPP DA+R+EE++NYF D+ Sbjct: 89 SENPFVAVAQQPVTTFSADVDRASYANLRRMLGYGQLPPKDAIRIEEMINYFDYDYPAPT 148 Query: 169 KQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI 228 K++ P + ELAP PWN + LL++ + AK + P SN+VFLID SGSM Sbjct: 149 KEA-----TSPLRVTPELAPTPWNPEHLLLRIGLQAKKLDLAQAPPSNIVFLIDVSGSMD 203 Query: 229 SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEG 288 +LPL++SS KLL+ +L+ D +AIVTYA +++AL S + +I +D+L A G Sbjct: 204 EPNKLPLLKSSFKLLLTQLKPTDRVAIVTYASGTKVALSSTPVKERQKIEKVLDNLYASG 263 Query: 289 STNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 ST+G +G++LAY++A K FIK G NRI+LATDGDFNVGI +P+ +E ++KQRESG+ +S Sbjct: 264 STSGSSGIQLAYKEAQKNFIKNGNNRIILATDGDFNVGISNPRELEKFIEKQRESGIYMS 323 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFN 408 G G NY + M IAD GNGNY+YID L+EA+KVL +E ML VAKDVK QIEFN Sbjct: 324 VLGFGMGNYRDDMAETIADKGNGNYAYIDDLTEAKKVLVNEFSGMLFAVAKDVKLQIEFN 383 Query: 409 PAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAP 468 P +V EY+ IGYE R L E F +D DAG+IGAG +T L+EL + + LRY Sbjct: 384 PKYVKEYKLIGYENRMLANEDFTDDKKDAGEIGAGHTVTALYEL-IPSEGKVAQNLRYQ- 441 Query: 469 DNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTIN-----APSEDMRFRAAVAAY 523 +L + K EL +LKIR+K P+ K+++ VE S D RF A+VA + Sbjct: 442 TKELNEKGKGNELGFLKIRYKDPKVKDAKSVEVTEPLLFAKKSLNETSVDFRFAASVAEF 501 Query: 524 GQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 G LRG+ ++ Q+ + A A G+D +GYR EF+RL++ A + Sbjct: 502 GILLRGNSNKAQATYDQVVELANGAIGKDEEGYRKEFVRLVKSAKLLA 549 >UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DBX8_9RHIZ Length = 668 Score = 431 bits (1107), Expect = e-119, Method: Composition-based stats. Identities = 229/555 (41%), Positives = 318/555 (57%), Gaps = 28/555 (5%) Query: 38 EQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAP------------- 84 + LA QQ + + A++ A + R P Sbjct: 118 KPLPLAPQQEEQELVAAAPLPEVAVSPALKSSRQANDAARQRFTGQPVGGLAGQGLAGQI 177 Query: 85 ---TFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLN 141 + A A A R + FD N V+ VA+ P++TFS DVDT SYA VRR L Sbjct: 178 DGESLRGADGANPAPGAEAERDRVEGFDSNGVRSVAEYPVSTFSADVDTASYAMVRRALK 237 Query: 142 QGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVD 201 QG++P P VR+EE+VNYF D+ P S PF + P PWN LL + Sbjct: 238 QGVMPDPRTVRIEEMVNYFNYDYP------APESVETPFRATVTVTPTPWNANTRLLHIG 291 Query: 202 ILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 + D K P +NLV L+D SGSM ++LPL++S+ +LL+++L +D ++IVTYAGD Sbjct: 292 VKGYDVKPAARPQANLVLLVDVSGSMQETDKLPLLKSAFRLLIQKLEPEDTVSIVTYAGD 351 Query: 262 SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDG 321 + L S KA+I A+D L GST G AG+E AY+ A K + GG+NR+LLATDG Sbjct: 352 AGTVLEPTPASDKAKILDALDDLRPGGSTAGAAGIEEAYRLAEKARVNGGVNRVLLATDG 411 Query: 322 DFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 DFNVG D +++S+++++RESGV LS FG G NYN+ +M +A GNG +YIDTL+E Sbjct: 412 DFNVGASDDDALKSLIEEKRESGVFLSIFGFGQGNYNDQLMQTLAQNGNGVAAYIDTLAE 471 Query: 382 AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIG 441 A+K L E L +A DVK QIEFNP + EYRQIG+E R L E FNND VDAG+IG Sbjct: 472 AEKTLAQEATASLFPIASDVKFQIEFNPETIAEYRQIGFETRALSREDFNNDQVDAGEIG 531 Query: 442 AGKHITLLFELTLNGQKASIDK-LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVE 500 +G +T ++E+T G A ++ LRY + ++ E A+LKIR K P +ES L E Sbjct: 532 SGHTVTAIYEVTPVGSPAILNSDLRYGAETPVSDVAHGDEFAFLKIRAKVPGEEESSLTE 591 Query: 501 FPLGP-----TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQG 555 P+ + +A +D+RF AVAA+ QKLR + +N + I+ A A+GEDP G Sbjct: 592 IPVMKDAELTSFSAAPQDVRFSIAVAAFAQKLRRIQQVNGFGFDAIESIASDARGEDPFG 651 Query: 556 YRAEFIRLIELADGV 570 YR+EF++L+ LA+G+ Sbjct: 652 YRSEFLQLVRLANGL 666 >UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UP85_RHOBA Length = 885 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 229/573 (39%), Positives = 323/573 (56%), Gaps = 42/573 (7%) Query: 35 TPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE---------APT 85 P ++ A ++ + + A + +Q +Q D +A +GR E APT Sbjct: 314 APKDEAPTAPREPSAGKPVVGDFAVAPVPEQLGRQQFDFRASRGRTLERQLGETEELAPT 373 Query: 86 FARAAKAKATHI---ANPGT--ARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFL 140 R A T PG +++ +N ++VA + L+TFS+DVDT SYA VR +L Sbjct: 374 SDRLAILPPTPDGEGQGPGMSGDKFEPIQENEFRRVADDALSTFSIDVDTASYAKVRSYL 433 Query: 141 NQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV 200 +G LP PD+VR+EE++NYF + A P+PF+ +A PWNE L++V Sbjct: 434 QRGQLPRPDSVRIEELINYFDYQY-----TPPSAEDPVPFSSAMAVASCPWNENNRLVRV 488 Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG 260 I AKD +E P NLVFLIDTSGSM +LPL+ +K+L+ +L+ +D +AIV YAG Sbjct: 489 GIQAKDIDRKERPRCNLVFLIDTSGSMKRPNKLPLVIEGMKVLLDQLKNRDRVAIVVYAG 548 Query: 261 DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATD 320 S + L S K +I A+ +L A GSTNGGAGL+LAYQ A + FI+ G+NR++L +D Sbjct: 549 SSGLVLDSTPVKQKKKIIRALSALSAGGSTNGGAGLQLAYQTARENFIEDGVNRVILCSD 608 Query: 321 GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 GDFNVG+ + + +Q +SG L+ G G N+N+AMM RI++ G GNY+++DT++ Sbjct: 609 GDFNVGMTGTDQLVAEATRQSKSGTELTVLGFGMGNHNDAMMERISNSGAGNYAFVDTIA 668 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDI 440 EA+KVL ++ L TVAKDVK QIEFNPA V+ YR IGYE R L E FN+D VDAG+I Sbjct: 669 EAKKVLADQVAGTLFTVAKDVKIQIEFNPAVVSAYRLIGYENRVLAKEDFNDDKVDAGEI 728 Query: 441 GAGKHITLLFELTLNGQ-----KASIDKLRYAPD---------------NKLAKSDKTKE 480 GAG +T L+E+ G+ +D L+Y P K + TKE Sbjct: 729 GAGHRVTALYEIAPVGKLPDSIAPDVDPLKYQPSGEENPDSQEANEPRVPKDSDESATKE 788 Query: 481 LAWLKIRWKYPQGKESQLVEFPLGPT---INAPSEDMRFRAAVAAYGQKLRGSEYLNNTS 537 + LKIR K PQG S+ + FPL D +F AVA +G +LR S + + Sbjct: 789 ILTLKIRHKPPQGDVSEKLAFPLVNESVPFQEADTDFQFAVAVAVFGMQLRNSTHAGTWT 848 Query: 538 WQQIKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 + A AKG+D G RAEF+ L A+ + Sbjct: 849 MDDVIATATNAKGDDEHGLRAEFLELARTAERL 881 >UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacterales RepID=Q28U54_JANSC Length = 686 Score = 429 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 215/548 (39%), Positives = 303/548 (55%), Gaps = 12/548 (2%) Query: 32 QPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAK 91 P A A + + +AL ++ Q R + Sbjct: 143 APEADVVADAEAPSLAPMVLPAPAPMVREALGGLADLDHAGDGVAQIRRPIQGLTLYSDG 202 Query: 92 AKATHIA------NPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLL 145 HI P + DDNP++ VA +P++TFS+DVDT SYA +R LN+G L Sbjct: 203 GPQNHIGTGDLALAPLPEDFANADDNPLRVVADDPVSTFSIDVDTASYALLRSTLNRGAL 262 Query: 146 PPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAK 205 P PDAVR+EE+VNYFP D+ + A PF ++ PWN L+ + I Sbjct: 263 PAPDAVRIEEMVNYFPYDYP-----APTADDISPFRPNVQVFETPWNPDTQLVHIGIQGD 317 Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 E+ P NLVFLIDTSGSM +LPL+ S +L++ L +D +AIVTYAG + +A Sbjct: 318 LPVVEDRPPLNLVFLIDTSGSMNDPAKLPLLIQSFRLMLNRLSPEDEVAIVTYAGSAGVA 377 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 L + S A INAA+ +L A GSTNG GLE AY+ A + + G ++R+LLATDGDFNV Sbjct: 378 LEPTAASDTATINAALTTLQAGGSTNGVGGLEEAYRLAGEMMVDGEVSRVLLATDGDFNV 437 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 G+ D ++E + +QR++G+ LS G G N + M +A GNG SYIDTL EAQ+V Sbjct: 438 GLSDAGALEDYIAEQRDTGIYLSVLGFGRGNLQDDTMQALAQNGNGTASYIDTLHEAQRV 497 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 L ++ L +A D+K Q+EFNP + EYR IGYE R L E F ND VDAGDIGAG Sbjct: 498 LVDQLAGALYPIADDLKVQVEFNPDVIAEYRLIGYETRALAREDFANDAVDAGDIGAGHS 557 Query: 446 ITLLFELTLNGQKAS-IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLG 504 +T ++E+T G A + LRY D ++ EL ++ +RWK P ESQL++FP+ Sbjct: 558 VTAIYEVTPVGSPAVLVAPLRYTADEGAPEAAFGDELGFISLRWKEPGADESQLIDFPIA 617 Query: 505 PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 + P + +F AA+A +GQ LRGS+++ + + A +G D GYR E ++L+ Sbjct: 618 NAVADPGTEAQFAAAIAGFGQLLRGSDFVADWDYADAIALANANRGMDEFGYRTEAVQLM 677 Query: 565 ELADGVTD 572 LA ++ Sbjct: 678 RLAQSLSG 685 >UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT14_9PLAN Length = 616 Score = 429 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 215/579 (37%), Positives = 308/579 (53%), Gaps = 40/579 (6%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSD-----KQALQG 78 S E + + A + +A A + + + + A Q Sbjct: 46 AEAVYADNNKSVKQEAKPASELSAVASQPVPAARAVEMNRDRVAGREKEAGKVRSDARQD 105 Query: 79 RLQEAPTFARAAKAKATHIANPGT-----------------ARYQQFDDNPVKQVAQNPL 121 RL PT +R + + A ++ ++NP + VA PL Sbjct: 106 RLATLPTESRRLGIEQPNAAPGFMPQLDGIAGHGEGPGVGGDKFAYVENNPFRAVADEPL 165 Query: 122 ATFSLDVDTGSYANVRRFLNQ-GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 +TFS+DVDT SY+ +R +L LPP AVRVEE++NYF D+ + PF Sbjct: 166 STFSIDVDTASYSKIRSYLIDYHQLPPQGAVRVEELINYFTYDY-------ATPTDQKPF 218 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 A E A PWN + L+++ I K+ + E PASNLVFL+D SGSM + +LPL++ + Sbjct: 219 AANVEAAACPWNAEHRLVRIGIKGKEIANAERPASNLVFLLDVSGSMNNARKLPLLKQGM 278 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 KLLV +L E D +AIV YAG + + L S +G K+ I A+D L A GSTNGG G+ELAY Sbjct: 279 KLLVDQLGENDKVAIVVYAGAAGMVLNSTNGDDKSTIMEALDRLQAGGSTNGGQGIELAY 338 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 Q AT+ FIKGG+NR++L TDGDFNVG+ + +M + +SGV LS G G N+N+A Sbjct: 339 QAATENFIKGGVNRVILCTDGDFNVGVTSTSDLVTMAADKAKSGVFLSVMGFGTGNHNDA 398 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 MM ++ NGNY++IDT++EA+KVL +M L T+AKDVK QIEFNP V YR +GY Sbjct: 399 MMEELSGKANGNYAFIDTITEAKKVLVEQMSGTLTTIAKDVKIQIEFNPTKVAAYRLVGY 458 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL-----NGQKASIDKLRYAPDNKLAKS 475 E R L E FN+D DAG+IGAG +T +E+ A +D L+Y + + Sbjct: 459 ENRLLANEDFNDDKKDAGEIGAGHCVTAFYEIVPASVESPVTTAKVDDLKYQATRDVTPA 518 Query: 476 DKTKELAWLKIRWKYPQGKESQLVEFPLGPT---INAPSEDMRFRAAVAAYGQKLRGSEY 532 + EL LKIR+K P ES L+ + + S D +F A VA +G LR + Sbjct: 519 ADSDELLTLKIRYKQPDEDESSLISVGVKDSGNRFAQASGDFQFAAGVAMFGMLLRAGDQ 578 Query: 533 LNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 + +I + G+D YR EF+++++ A + Sbjct: 579 DAKVNLDEITELVSNNVGDDS--YRGEFLKIVQAAKTLK 615 >UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobacteria RepID=Q21MJ3_SACD2 Length = 708 Score = 428 bits (1101), Expect = e-118, Method: Composition-based stats. Identities = 214/549 (38%), Positives = 309/549 (56%), Gaps = 15/549 (2%) Query: 33 PSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKA 92 PS E+ V+ +++ E+ + + A +Q + A R A Sbjct: 163 PSAALEEVVVTGMRSSAAESAKLSKKPAASQRQVSAIRAQDIGALPDQSNAVALQRIAGM 222 Query: 93 KAT-----HIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPP 147 A G +++ ++N VK VA+ P++TFS+DVDT SY+ VRR LN G LP Sbjct: 223 PVDGDTIVAPAPQGNDKFEHVEENSVKSVAEAPVSTFSIDVDTASYSFVRRQLNSGYLPE 282 Query: 148 PDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDR 207 DA+R EE++NYF ++ +P+ PF + +PW + + L+ + + D Sbjct: 283 KDAIRAEELINYFDYNYP------LPSDSTAPFKPNITVIDSPWAKGKKLVHIGLKGYDI 336 Query: 208 KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 ++ P +NLVFL+D SGSM S ++LPL++ S+++L+ L D +AIV YAG + L Sbjct: 337 APDQKPRTNLVFLLDVSGSMNSQDKLPLVKQSMEMLLSTLNPDDTVAIVVYAGAAGTVLE 396 Query: 268 SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 K +I +A+ L A GST GGAG+ LAY A F K +NR++LATDGDFNVG Sbjct: 397 PTPAKDKQKILSAMQRLQAGGSTAGGAGIALAYDLAEANFDKKAVNRVILATDGDFNVGS 456 Query: 328 DDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 387 + ++++ V+++RE G+ LS G G NYN+ +M +A GNG +YIDT+SEAQKVL Sbjct: 457 TNNETLQGFVERKREKGIFLSVLGFGQGNYNDHLMQTLAQNGNGVAAYIDTVSEAQKVLV 516 Query: 388 SEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHIT 447 E L +AKDVK Q+EFNPA V EYR IGYE R L E FNND VDAGDIGAG +T Sbjct: 517 QEASSSLFPIAKDVKIQVEFNPATVAEYRLIGYETRALNREDFNNDAVDAGDIGAGHTVT 576 Query: 448 LLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL---- 503 ++E+T G A + + AK+ E + K+R+K P S+L+E P+ Sbjct: 577 AIYEITPVGSSAVLIDESRYAQKEKAKAPTNAEYGFFKLRYKLPSEDTSRLIEAPILQQQ 636 Query: 504 GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRL 563 +++ F AVAAY QKL+GS +LN S+ I AQ +KG D GYR EF++L Sbjct: 637 PLVPAELMQEVNFSVAVAAYAQKLKGSNFLNKYSYHDIIALAQASKGSDEYGYRTEFVQL 696 Query: 564 IELADGVTD 572 + A+ D Sbjct: 697 VRKAELADD 705 >UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria RepID=A5F9T1_FLAJ1 Length = 709 Score = 428 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 205/482 (42%), Positives = 297/482 (61%), Gaps = 14/482 (2%) Query: 95 THIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVE 154 + P Y F +N + PL+TFS+DVD SY N+RRFLN G P DAVRVE Sbjct: 233 PNPTLPTQEDYDTFVENAFESPKTAPLSTFSIDVDNASYTNIRRFLNSGQEVPKDAVRVE 292 Query: 155 EIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPA 214 E+VN+F ++ + PF++ E + +PWN Q +LK+ + K+ + +LP+ Sbjct: 293 EMVNFFKYNYPQPKNEH-------PFSINTEYSDSPWNSQNKILKIGLQGKNIATNDLPS 345 Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 SNLVFLID SGSM +LPL++ S+K+LV ELR D ++IV YAG + + LP SG+ K Sbjct: 346 SNLVFLIDVSGSMEDMNKLPLLKQSMKILVNELRPTDKVSIVVYAGAAGMVLPPTSGNEK 405 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 I A+D L+A GST GGAG+ELAY+ AT+ FIKGG NR++LATDGDFNVG +E Sbjct: 406 KTIIKALDQLEAGGSTAGGAGIELAYKIATENFIKGGNNRVILATDGDFNVGSSSNSDME 465 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQML 394 +++++R++GV L+ G G NY ++ M +AD GNGNY+YID + EA + L E + + Sbjct: 466 KLIEEKRKTGVFLTCLGYGMGNYKDSKMEILADKGNGNYAYIDNIQEANRFLGKEFKGSM 525 Query: 395 ITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL 454 +AKDVK QIEFNP V YR IGYE R+LR E F ND +DAG++G+ +T L+E+ Sbjct: 526 FAIAKDVKIQIEFNPKQVQAYRLIGYENRKLRPEDFKNDAIDAGELGSNHTVTALYEIIP 585 Query: 455 NGQKASIDKLRYA----PDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL---GPTI 507 G K+ ++ + ++ + ELA +K R+K P G +S + + ++ Sbjct: 586 AGVKSDFLNVQPDDLKYTKTETNSANYSNELATIKFRYKKPDGDKSIEMVQVINTKSVSL 645 Query: 508 NAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 + S+D +F AVA +G KLR S+ + + S + I + AQQ D GY+AEFIRL+E + Sbjct: 646 DQASDDFKFSTAVAWFGLKLRDSKLITDKSSESIAELAQQGMSFDKGGYKAEFIRLVETS 705 Query: 568 DG 569 + Sbjct: 706 EQ 707 >UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C375AE Length = 550 Score = 417 bits (1072), Expect = e-115, Method: Composition-based stats. Identities = 185/526 (35%), Positives = 283/526 (53%), Gaps = 15/526 (2%) Query: 51 EAEQSAAAAKALAQQEVQQYSDKQALQGRLQE--APTFARAAKAKATHIANPGTARYQQF 108 + + ++ A + ++ +D++ + Y+ + Sbjct: 30 DTASGSYKEESSAYEYYEESADEEFAPEYFSTDDYAPEGDYYYWEEEPELPSANEEYKGY 89 Query: 109 DDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKD 168 + K PL+TFS DVDT SY NVRR + + P DAVR+EE +NYF D+ + Sbjct: 90 TEAGFKDTKSEPLSTFSADVDTASYTNVRRLIENRNIVPEDAVRIEEFINYFDYDYPQPE 149 Query: 169 KQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI 228 S F E+A PWN L+ V I K+ + +E P SNLVFLID+SGSM Sbjct: 150 DGSA-------FGRYVEIADCPWNRDHKLMMVGIQGKELQQQETPPSNLVFLIDSSGSMN 202 Query: 229 SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEG 288 S ++LPL+QS+ +L ++L + D I+IVTYAG S + L GS+ EI + S+ A G Sbjct: 203 SYDKLPLVQSAFSMLAEQLDKNDRISIVTYAGSSAVLLDGEKGSNTDEILEQLYSITASG 262 Query: 289 STNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 STNG G++ AY+ A + FIKGG NR++LATDGD NVG + + +++ +R++G+ LS Sbjct: 263 STNGEGGIKTAYELAEEHFIKGGNNRVILATDGDLNVGASSEEELTRLIETKRDNGIYLS 322 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFN 408 G G NY +A M +AD GNGN+SYID+ EA++VL EM L T+AKDVK Q+EFN Sbjct: 323 VLGFGEGNYKDARMEALADNGNGNFSYIDSEDEAERVLVQEMSGTLYTIAKDVKIQVEFN 382 Query: 409 PAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN--GQKASIDKLRY 466 P+ V+ YR IGY+ R + E F +D DAG++G+G +T L+E+ + G L + Sbjct: 383 PSQVSSYRLIGYDNRLMNAEDFLDDTKDAGEVGSGHSVTALYEIEMADTGDSYHGVPLEF 442 Query: 467 APDNKLAKSDKTK--ELAWLKIRWKYPQGKESQLVE--FPLGPTINAPSEDMRFRAAVAA 522 A ++ ++ E+ L I +K P G E++ + + ++PS M+ A A Sbjct: 443 ASEHDSIPAENNGRSEICKLSIAYKTPVGNENRNTSDLYSMENYSSSPSNSMKLAQAAAG 502 Query: 523 YGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 +G LR S+Y + + + Q K D + +++ AD Sbjct: 503 FGMVLRNSDYKGDADFDTVLDILDQLKVNDNDKINELYGLILDAAD 548 >UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5D5_BREBN Length = 513 Score = 416 bits (1068), Expect = e-114, Method: Composition-based stats. Identities = 172/524 (32%), Positives = 275/524 (52%), Gaps = 38/524 (7%) Query: 42 LAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPG 101 ++ + + +E + ++ Q + Q + + P+ K + P Sbjct: 20 CSSSEQFVSRSESGNKPSASVEQGQSNQVASSPS-------PPSQLADYALKKSGDPLPN 72 Query: 102 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFP 161 ++ + N A++ L+TF+ DVDT SY +R F+ G LPP +AVRVEE +N+FP Sbjct: 73 DMYFKDYGTNQFVSTAKDRLSTFAADVDTASYTIMRHFIKDGNLPPAEAVRVEEFINFFP 132 Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 + + PA FA++ + P+P+ + ++++ I K+ +E +NLVF+I Sbjct: 133 TSY--------PAPTNQTFAIQADSGPSPFQKNLQIVRIGIKGKELSPKERKPANLVFVI 184 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAI 281 D SGSM + RL L++ SL +LV +L+ D++ IV Y + R+ LP S K I +AI Sbjct: 185 DVSGSMNQENRLELVKKSLHVLVDQLQPTDSVGIVVYGSEGRVLLPPTSTEDKQAILSAI 244 Query: 282 DSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR 341 D L EGSTN GL L Y+ A + F INR++L +DG NVG + I ++ Sbjct: 245 DELQPEGSTNAEQGLVLGYEMAARSFKPPAINRVILCSDGVANVGETGAEGILRSIEDYA 304 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 + LS+FG G NYN+ MM ++A+ G G+Y+YIDT SEA+++ + L T+A+DV Sbjct: 305 RKDIYLSSFGFGMGNYNDVMMEQLANKGEGSYAYIDTFSEARRIFTESLTGTLQTIARDV 364 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI 461 K Q+EF+P V YR IGYE R +R E F ND DAG+IGAG +T L+E+ L + Sbjct: 365 KIQVEFDPKKVDSYRLIGYENRDVRDEDFRNDKTDAGEIGAGHSVTALYEVKLASPVHA- 423 Query: 462 DKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVA 521 EL +++R+ + ++ + + P+ + S D+ F AAVA Sbjct: 424 ------------------ELGTVRVRYHHASTQKVEEISEPV-KVQSTLSPDVTFLAAVA 464 Query: 522 AYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE 565 YG+ LR S Y +S + + A+ + EF+RL++ Sbjct: 465 EYGEILRESPYAERSSLADVLKLAEATA---TGEEQLEFVRLVK 505 >UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9IU52_BORPD Length = 582 Score = 413 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 251/557 (45%), Positives = 338/557 (60%), Gaps = 13/557 (2%) Query: 17 SGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQAL 76 S + + V +A A + A +QY+ Sbjct: 30 SAADAARALTGAGKAGNPGTVPPAVPSAPPAPPAAEADAGAPRARPGAALTRQYA--PQA 87 Query: 77 QGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANV 136 A + A Y ++ DNPV + P++TF DVDTGSY NV Sbjct: 88 YSAQPAAVSLLPAPSGYYAPPQAEERENYARYRDNPVVAAQEQPVSTFGADVDTGSYTNV 147 Query: 137 RRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRT 196 RR LN+G LPPPDAVR EE +NYF + P S+ PF++ E++ APWN QR Sbjct: 148 RRLLNEGRLPPPDAVRAEEFINYFDYGYAT------PDSRQQPFSIITEVSAAPWNPQRQ 201 Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 LLK+ I +++PA+NLVFL+DTSGSM ++LPLI+ +LK LV +LR QD +AIV Sbjct: 202 LLKIGIQGYRVAPQDIPAANLVFLVDTSGSMAERDKLPLIKGALKQLVAQLRPQDRVAIV 261 Query: 257 TYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRIL 316 TYAG + + L S G KA INAAID L A GSTNGGAGL+LAY QA KGF+KGG+NRIL Sbjct: 262 TYAGQASMTLDSTPGDQKARINAAIDELRAAGSTNGGAGLDLAYAQAAKGFVKGGVNRIL 321 Query: 317 LATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 LA+DGDFNVG D + ++ + +QR+ G+ L+T GVG N+N+A+ +++AD GNG+Y Y+ Sbjct: 322 LASDGDFNVGATDLEDLKDKIARQRQGGIALTTLGVGGGNFNDALAMQLADAGNGSYHYL 381 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVD 436 D+L EA+KVL ++M L+T+A+DVK Q+EFNPA V EYR IGYEKR L E FNND VD Sbjct: 382 DSLREARKVLAAQMSSTLLTIARDVKIQVEFNPAVVAEYRLIGYEKRALAREDFNNDRVD 441 Query: 437 AGDIGAGKHITLLFELT-LNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE 495 AG+IGAG ++T L+E+T L A +D LRY A + ELA++++R+K P + Sbjct: 442 AGEIGAGANVTALYEITPLAAGGARLDPLRYG--KPAADAGPADELAFVRVRYKLPGASD 499 Query: 496 SQLVEF--PLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDP 553 SQLVE P A ++ MR AA AA+ Q LRG +YL++ S QI A+ A+G+DP Sbjct: 500 SQLVEQAVPRADARAAGTDGMRRAAAAAAFAQWLRGGKYLDDYSPAQIAALARGARGDDP 559 Query: 554 QGYRAEFIRLIELADGV 570 G AE L+E+A G+ Sbjct: 560 HGLNAELAALVEMAAGL 576 >UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AWD1_HERA2 Length = 610 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 172/590 (29%), Positives = 291/590 (49%), Gaps = 44/590 (7%) Query: 1 MRNKNIIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAK 60 MR K +++ +LI+S CG + Q + + +A + + A + Sbjct: 47 MRLKRS-SIVLIALIISACGGEASLPTINPQPQRPAPQPRPTSAADDQSAQWPTAEATSV 105 Query: 61 ALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTAR-------------YQQ 107 A A Q + + A T +P ++ Sbjct: 106 APAPQPMPTQAADAGQPVPNPAAGKPLVDTWELPTQPIDPNPNYAYEQDQEIFDSMYFKN 165 Query: 108 FDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIK 167 + NP + +PL+TF++D+D+ SY+ +R +NQGLLPP D+VRVEE +N F ++ Sbjct: 166 YGTNPFVRTETDPLSTFAMDIDSASYSLMRSSINQGLLPPADSVRVEEYLNAFDYEY--- 222 Query: 168 DKQSIPASKPIPFAMRYELAPAPWN-EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGS 226 P + FA+ E+AP+P+ L+++ I A+ + + + L F+IDTSGS Sbjct: 223 -----PQPEDGDFAIYSEVAPSPFGGPNYELVQIGIQARSIEVADRKPAALTFVIDTSGS 277 Query: 227 MISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDA 286 M D RL +++++L L +L D++AIV + R+ L SG ++ +I AI+SL+ Sbjct: 278 MAQDNRLEMVKNALIYLAGQLEPDDSLAIVAFNDGMRVVLNPTSGENQMDIITAINSLEP 337 Query: 287 EGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 GSTN AGL ++ A + F GINRILL +DG N G+ +P + + ++ ++GV Sbjct: 338 AGSTNAEAGLYKGFELAWQAFKPEGINRILLCSDGVANSGMTEPSQLLATFQQYLDAGVQ 397 Query: 347 LSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 LST+GVG NYN+ ++ ++AD G+GNY+Y D+ EAQ++ ++ L T+ ++ K Q+ Sbjct: 398 LSTYGVGMGNYNDILLEQLADKGDGNYAYFDSADEAQRLFGEQLTGSLQTIGREAKIQVN 457 Query: 407 FNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS---IDK 463 F+P V YR IGYE R + F ND+VD G++GAG +T L+E+ + Sbjct: 458 FDPNVVKRYRLIGYENRAVADSDFRNDSVDGGEVGAGHSVTALYEIKRHPDAQGPIAQVN 517 Query: 464 LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAY 523 +RY + A +++ ++ +I + + S M +VA Y Sbjct: 518 IRYISMDTNAPVEESLNISTAQI-----------------HSSFDRASARMHLATSVAEY 560 Query: 524 GQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRA-EFIRLIELADGVTD 572 + LR S + N T + A++A + P A EF+ L+ A+ + Sbjct: 561 AELLRHSRWNNGTDILDVLDLAEEAALDLPNNQSAVEFVTLLRRAEQMHQ 610 >UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 Tax=Caulobacteraceae RepID=B4WCU1_9CAUL Length = 613 Score = 409 bits (1050), Expect = e-112, Method: Composition-based stats. Identities = 193/570 (33%), Positives = 284/570 (49%), Gaps = 22/570 (3%) Query: 17 SGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQAL 76 + C P + T A + + Sbjct: 44 AACAPLGFVLSPPGEPMQHLTSPVRPTPIPAPTQSRIEGVPPPPPPPPPPPPPAPPAPMR 103 Query: 77 QGRLQEAPTFARAAKAKAT-------HIANPGTARYQQFDDNPVKQVAQNPLATFSLDVD 129 + AP A T A T Y NPVK+ A P++TFS+DVD Sbjct: 104 PAVVAAAPGQAPLNAVVVTGSRIMPGAPAPSDTETYPDATPNPVKRTADQPVSTFSIDVD 163 Query: 130 TGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA 189 T +Y+NVRRF+++G PP DAVRVEE++N F + + P S PFA+ + + Sbjct: 164 TAAYSNVRRFIDEGRSPPADAVRVEELINAFDYGY------ARPTSLARPFAITTAVVAS 217 Query: 190 PWNEQ-----RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 PW + R ++ + + + E NL FL+D SGSM S ++L L + ++ L + Sbjct: 218 PWAPRTERGGRQIVHIGLQGYELPQGEQRPLNLTFLVDVSGSMRSPDKLDLAKQAMNLAI 277 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 LR QD +++ YA + L G K ++ A+ SL A G T G G+ AY QA Sbjct: 278 DRLRPQDTLSVTYYAEGAGTTLQPTPGDQKLKMRCAVASLRASGGTAGATGMTNAYDQAQ 337 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 F + +NRIL+ TDGDFNVG+ D K +E V ++R +GV LS +G G NY +A M Sbjct: 338 ASFARDKVNRILMFTDGDFNVGVTDNKRLEDYVAEKRGTGVYLSVYGFGRGNYQDARMQT 397 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 IA GNG +Y+ L +A+++ + +A DVK Q+EFNPA V E+R IGYE R Sbjct: 398 IAQAGNGVAAYVGDLRDARRLFGPMFDKGAFPIADDVKIQVEFNPARVAEWRLIGYETRL 457 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRY-APDNKLAKSDKTKELAW 483 L F ND +DAG++G+G +T L+E+T G + + RY + D E+ + Sbjct: 458 LNEADFANDRIDAGEVGSGASVTALYEITPVGGPTQVPERRYPDNRIGVGGGDPNGEIGF 517 Query: 484 LKIRWKYPQGKESQLVEFPL--GPTINAPSEDMRFRAAVAAYGQKLRGSEYL-NNTSWQQ 540 +++R+K P G S L++ PL P E R+ AVAA+GQKLR ++ + W Q Sbjct: 518 IQVRYKQPGGSRSDLIQQPLTSRAAGAQPPEATRWALAVAAFGQKLRNDPWMSADYGWDQ 577 Query: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 + AQ A+GEDP G RAEF++L+ A + Sbjct: 578 VLAQAQGARGEDPWGDRAEFVQLVRAARDL 607 >UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XSR4_9CAUL Length = 625 Score = 404 bits (1038), Expect = e-111, Method: Composition-based stats. Identities = 192/560 (34%), Positives = 289/560 (51%), Gaps = 19/560 (3%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE 82 + P Q + A + Sbjct: 67 GTRTGAYAPPPPVTLPIQPTTKEDRVAYGAPPPPPPPPPPPPPAPAALVRPPVVVTNAA- 125 Query: 83 APTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ 142 T T RY NPV++VA P++TFS+DVDT +YANVRRF+++ Sbjct: 126 GQTTVDGVVVPGRPGTRVDTERYPDATPNPVRRVADEPVSTFSIDVDTAAYANVRRFISE 185 Query: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ-----RTL 197 G PP DAVRVEE++NYF + + P PFA+ +A +PW+ R + Sbjct: 186 GQTPPRDAVRVEEMINYFDYGY------ARPGRADEPFAVSTAVAASPWSANAGAGGRQI 239 Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVT 257 + + + + + E NL F++D SGSM S ++L L Q ++ L++ LR +D +A+ Sbjct: 240 VHIGLQGYELPAGERRPLNLTFMVDVSGSMQSPDKLGLAQQTMNLIIDRLRPEDRVAVTY 299 Query: 258 YAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 YA D A+ GS K ++ A+ +L+A GST G G+ AY+QA F +NRIL+ Sbjct: 300 YASDVGTAVGPTPGSEKLKLRCAVAALNAGGSTAGAQGMVNAYEQAEAAFSPDKVNRILM 359 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 TDGDFNVG+ D + +E V +R +G+ LS +G G NY +A M IA GNG +Y+D Sbjct: 360 FTDGDFNVGVTDDRRLEDYVADKRGTGIYLSVYGFGRGNYQDARMQTIAQAGNGVAAYVD 419 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA 437 L EA+ + + +A DVK Q+EFNPA V+EYR IGYE R L E F ND +DA Sbjct: 420 DLDEARCLFGPAFDRGAFPIADDVKIQVEFNPARVSEYRLIGYETRLLNEEDFANDAIDA 479 Query: 438 GDIGAGKHITLLFELTLNGQKASIDKLRYAPDNK-LAKSDKTKELAWLKIRWKYPQGKES 496 G++G+G +T L+E+T G + I + RY + A D T E+ ++++R+K P S Sbjct: 480 GEVGSGASVTALYEITPVGGASQIPERRYEANRAGDAGGDPTGEIGFVQVRYKLPGQPTS 539 Query: 497 QLVEFPLGPTINAP-----SEDMRFRAAVAAYGQKLRGSEYLN-NTSWQQIKQWAQQAKG 550 +L++ P+ T + P E R+ AVA +GQ+LRG ++ + I AQ +G Sbjct: 540 RLIQQPISGTTDGPGSARLPEATRWAMAVAGFGQRLRGDPWMGADFDTAAILDLAQGVRG 599 Query: 551 EDPQGYRAEFIRLIELADGV 570 EDP G RA F++++ A+ + Sbjct: 600 EDPYGDRAAFVQMVRAAESL 619 >UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVG2_SORC5 Length = 656 Score = 402 bits (1033), Expect = e-110, Method: Composition-based stats. Identities = 169/483 (34%), Positives = 263/483 (54%), Gaps = 31/483 (6%) Query: 98 ANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIV 157 G+ Y+ + NPV+ A++ L+TF++DVDT SYA RR + G LPP AVR EE + Sbjct: 190 EPQGSETYRDYGVNPVEDPAKDRLSTFAIDVDTASYAIARRKIMDGALPPYQAVRAEEFL 249 Query: 158 NYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNL 217 NYF + + PFA+ AP+P+ L++V + K +E +L Sbjct: 250 NYFDYGYA--------SPAAGPFAVHLAAAPSPFTSGHHLVRVAVQGKRVPVKERTPVHL 301 Query: 218 VFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEI 277 V+L+DTSGSM S +++ L + SLK+L L+ D +A+ TYAG R L K +I Sbjct: 302 VYLVDTSGSMQSPDKIELAKKSLKMLTDTLKPGDTVALCTYAGSVREVLAPTGIESKGKI 361 Query: 278 NAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMV 337 AA+ L A GST +G++LAY A + +KG +NR+++ +DGD NVG I + Sbjct: 362 LAALADLTAGGSTAMSSGIDLAYSLAERTLVKGHVNRVIVLSDGDANVGPTSHDEILKTI 421 Query: 338 KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITV 397 K+ R+ G+TLST G G NY + MM ++A+ G+GNY+YID+ ++A++V + ++ ML + Sbjct: 422 KRARDKGITLSTVGFGQGNYKDLMMEQLANQGDGNYAYIDSEAQARRVFSEQVGGMLQVI 481 Query: 398 AKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQ 457 A+DVK Q+EF+P++V YR IGYE R + F ND VDAG+IGAG +T ++++ L Sbjct: 482 ARDVKIQVEFDPSFVKSYRLIGYENRDVADRDFRNDKVDAGEIGAGHSVTAIYDVEL--- 538 Query: 458 KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFP------LGPTINAPS 511 A + +++R K P G + + PT +A Sbjct: 539 ------------KAPAPKGEGAAPIVVRLRHKAPLGSNTAEETLVKMAPGAIAPTFDAAP 586 Query: 512 EDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVT 571 D RF +AVA + + LR S + + I++ A+ A +G + EFI +I A + Sbjct: 587 ADFRFASAVAGFAEVLRHSPHARSWRLADIEKIARAAASS--KGDQQEFIGIIRRAGALA 644 Query: 572 DIS 574 + Sbjct: 645 NGK 647 >UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SXC8_9FIRM Length = 612 Score = 397 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 183/499 (36%), Positives = 257/499 (51%), Gaps = 27/499 (5%) Query: 98 ANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIV 157 T Y +N PL+TF+ D DT SY+NVR ++ G LPP AVR+EE++ Sbjct: 117 VAYDTREYDSMTENGFVSTVDRPLSTFAADRDTASYSNVRSYIESGSLPPDGAVRIEEML 176 Query: 158 NYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNL 217 NYF D+ K F++ E + PWN+ L+ V I + + SNL Sbjct: 177 NYFTYDYRKK------PEDGEKFSIYTEYSDCPWNKDTKLMMVGINTDEIDFGDKKPSNL 230 Query: 218 VFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEI 277 VFLIDTSGSM D +LPL+Q S +L + L E D ++IVTYAG+ + L GS + I Sbjct: 231 VFLIDTSGSMYDDNKLPLVQQSFAMLAENLDENDRVSIVTYAGEDTVVLSGTPGSEQYTI 290 Query: 278 NAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMV 337 + A+ ++ AEG TNGG + AY+ A K FI GG NR++LATDGD NVG+ + ++ Sbjct: 291 SEALSNMTAEGCTNGGDAIITAYELAEKNFINGGNNRVILATDGDLNVGLTSESDLVDLI 350 Query: 338 -KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLIT 396 ++++E+ + LS G G N + + +AD G+G+Y++ID+ EA+KVL EM L T Sbjct: 351 TEEKKENNIFLSVLGFGTDNLKDNKLEALADNGDGSYAFIDSAYEAKKVLVDEMGGTLNT 410 Query: 397 VAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 VAKDVK Q+EFNP V YRQIGYE R L F ND VD G+IGAG +T+L+E+ G Sbjct: 411 VAKDVKFQLEFNPTNVKGYRQIGYENRALADADFANDAVDGGEIGAGHMVTVLYEIVPAG 470 Query: 457 QKASIDKLRYAP------------------DNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 + + D + + ELA + IR+K P G +S L Sbjct: 471 SDFEVPAANHKYGENINQVNTAESTSQDLRDKSDSAENYAGELATVNIRYKDPDGDKSNL 530 Query: 499 VEFPLGPTI--NAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGY 556 V + S DM +AVAAYG L+ SEY + G Sbjct: 531 VSCVVKTDSYNGGMSADMSAASAVAAYGMLLKNSEYAGAADLDMVLSLVSGKTGSSSDSD 590 Query: 557 RAEFIRLIELADGVTDISQ 575 ++ + AD V + Sbjct: 591 DIIDMQWQDFADMVRQTQK 609 >UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UJ22_METS4 Length = 654 Score = 393 bits (1010), Expect = e-108, Method: Composition-based stats. Identities = 206/523 (39%), Positives = 280/523 (53%), Gaps = 23/523 (4%) Query: 58 AAKALAQQEVQQYSDKQALQGRLQE--APTFARAAKAKATHIANP-GTARYQQFDDNPVK 114 A +A A + +Q + + ARAA A + P G R+ + + Sbjct: 119 AGEADAGRTLQAFRSSGGFRFEASPRGPAAMARAAGETAPVPSEPVGRDRFANAPEGGFR 178 Query: 115 QVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPA 174 + P++T SL VDT SY VR LN+ LPPP AVR EE++NYFP + PA Sbjct: 179 ITREAPVSTVSLGVDTASYGIVRDALNRNHLPPPAAVRTEELINYFPYAYP------APA 232 Query: 175 SKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLP 234 S PF + + P+PW E R LL + I E P +NLVFL+DTSGSM + RLP Sbjct: 233 SPDAPFRVTASVFPSPWAEGRKLLHIGIRGYAVAPAERPPANLVFLVDTSGSMAAPNRLP 292 Query: 235 LIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGA 294 L++ SL +L+ L +D +A+V YAG+ L I AAI++L A GST GG Sbjct: 293 LVKQSLAMLLTTLDARDRVALVAYAGEVGTVLEPTPAGEAGRILAAIETLQAHGSTAGGE 352 Query: 295 GLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGN 354 G+ AY A + F +NR++LATDGDFNVGI + V ++R G+ LS G G Sbjct: 353 GIRQAYALAARHFDPKAVNRVILATDGDFNVGITGRDELTGFVARERRKGIFLSVLGFGM 412 Query: 355 SNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE 414 N N+A+M +A GNG ++IDT EA+KVL E LI +A+DVK Q+EFNPA V E Sbjct: 413 GNLNDALMQALAKDGNGVAAHIDTAQEARKVLVEEATSTLIPIARDVKIQVEFNPATVAE 472 Query: 415 YRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAK 474 YR IGYE R L F ND DAG++G+G+ +T L+E+ K LRYAP Sbjct: 473 YRLIGYETRPLDRADFANDEADAGEVGSGQTVTALYEIVPADGKRVTGDLRYAPHEAAPA 532 Query: 475 SDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAA---------YGQ 525 ++ A + IR+K P +ES L+E P+GP A RF A +GQ Sbjct: 533 PAS-RDYAHVAIRFKRPDARESTLIETPVGPEGEAA----RFAEAPQEARFAAAVAAFGQ 587 Query: 526 KLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 LRG ++ S + + A A+G+DP GYRAEF+ L+ A Sbjct: 588 ILRGGKHTGRFSLDDVIRIAAPARGDDPFGYRAEFLGLVRAAK 630 >UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1CVN5_MYXXD Length = 700 Score = 389 bits (999), Expect = e-106, Method: Composition-based stats. Identities = 183/552 (33%), Positives = 272/552 (49%), Gaps = 41/552 (7%) Query: 29 QQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFAR 88 + + A + A+ +Q + K R Sbjct: 157 PGGRTGATRVYEPPNAMSRPHGVSLNGPPASTLPSQPLGRPGPPKPQSAPRF-VGRDVEP 215 Query: 89 AAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPP 148 A A A +P +Q + NP + +TFS+D D+ SY R +L +G LP Sbjct: 216 PAPAVAPAPVSPFHMYFQGYGVNPTINTEEERFSTFSVDTDSASYTLTRAYLERGSLPNE 275 Query: 149 DAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRK 208 AVRVEE VN F + PF+++ E P+P + ++ V + A++ Sbjct: 276 QAVRVEEFVNTFDYGYAH--------QGSAPFSVQVEGFPSPVRKGYHVVHVGVKAREVS 327 Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS 268 + S+LVF+ID SGSM + RL L++ +L LLV EL E+D ++IV Y +R+ L Sbjct: 328 RPQRKPSHLVFVIDVSGSMNLENRLGLVKRALHLLVNELDERDQVSIVVYGSTARLVLEP 387 Query: 269 ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID 328 S H I AAIDSL EGSTN AGLE+ Y A ++GGINR++L +DG N G+ Sbjct: 388 TSAVHAHIIRAAIDSLHTEGSTNAQAGLEMGYSLAASHLVEGGINRVILCSDGVANTGLT 447 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 D SI ++ + G+TLST G G NYN+ +M R++ VG GNY+Y+D + EA ++ Sbjct: 448 DANSIWERIRARAAKGITLSTVGFGMGNYNDVLMERLSQVGEGNYAYVDRIEEAHRIFVR 507 Query: 389 EMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 ++ L VAKDVK Q+EF+P V+ YR +GYE R L E F +D VDAG+IGAG +T Sbjct: 508 DLTGTLQVVAKDVKLQMEFDPKAVSHYRLLGYENRMLTKEQFADDRVDAGEIGAGHAVTA 567 Query: 449 LFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTI- 507 L+E+ L AS L+IR+K P+G +S+L+E PL ++ Sbjct: 568 LYEVKLTEPSAS--------------------FGTLRIRYKAPEGGDSKLIEKPLPSSVL 607 Query: 508 ----NAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQI----KQWAQQAKGEDPQGYRAE 559 + R AA+ +KLRGS ++ ++ + ++ Q K D AE Sbjct: 608 RPAYGRAAPPTRLSYVAAAFAEKLRGSYWVRPLTYDALFSFWEEIGQPLKARD---DVAE 664 Query: 560 FIRLIELADGVT 571 LI+ A + Sbjct: 665 LGALIQKARALD 676 >UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GHE4_9DELT Length = 785 Score = 389 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 178/466 (38%), Positives = 270/466 (57%), Gaps = 29/466 (6%) Query: 113 VKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSI 172 ++ +TFS+DVDT SYA+VR+ L G +P P +VR EE++NYF + + Sbjct: 255 FVATGEDRKSTFSIDVDTASYASVRQSLRNGWMPDPGSVRTEEMINYFDYGYVAPSGGA- 313 Query: 173 PASKPIPFAMRYELAPAPWNEQRTLLKVDILAK---DRKSEELPASNLVFLIDTSGSMIS 229 PFA+ E+ P PW L+++ + A +++EL NLVFL+D SGSM S Sbjct: 314 ----GAPFAVHTEVGPCPWAPDHRLVQIGVQATRELPAQAQELRTRNLVFLLDVSGSMSS 369 Query: 230 DERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGS 289 +LPLI+ LV++L +D+++IV YAG + + LP SG K I A+D L+A G Sbjct: 370 RGKLPLIKHGFTQLVEQLGAEDHVSIVVYAGAAGVVLPPTSGDQKETILGALDRLEAGGG 429 Query: 290 TNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLST 349 TNG AG+ AY+ A F+ GG+NR++L TDGDFNVG+ D ++ +++++RESGV LS Sbjct: 430 TNGSAGIVEAYELAQANFVDGGVNRVILGTDGDFNVGLSDHDALVELIEQKRESGVFLSV 489 Query: 350 FGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 GVG +Y++ +M ++AD GNGNY+++D EA+KVL E+ L T+AKDVK Q+ FNP Sbjct: 490 LGVG-GHYDDELMEQLADHGNGNYAFLDGKREAEKVLVEEIGGTLTTIAKDVKVQVAFNP 548 Query: 410 AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPD 469 VT++R I Y+ R+L FN+D DAG+IG G ++T L+E+ + Sbjct: 549 EQVTKHRLIAYQNRRLAHRDFNDDTKDAGEIGVGHNVTALYEIIPADE------------ 596 Query: 470 NKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGP---TINAPSEDMRFRAAVAAYGQK 526 ++ ++ L L++R+K P G S V + +++ S+D RF AAVA +G+ Sbjct: 597 -----AEASEALMSLELRYKKPDGHRSTKVTTSVRDAGRSLDQNSDDFRFAAAVAGFGES 651 Query: 527 LRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTD 572 L G + ++ + AQ A GED + R EF+ L A Sbjct: 652 LAGRRPDASWNYADTLELAQGALGEDARCLRHEFLELAWRAGMAEG 697 >UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1D6F9_MYXXD Length = 592 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 205/602 (34%), Positives = 305/602 (50%), Gaps = 65/602 (10%) Query: 9 LLMSSLILSGC---GPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAA-------- 57 L+ + L C P + + S S + A + A++++A Sbjct: 16 ALLVASTLPACHNRSPAADERPSLGAAQSVARDDDAAHAPEREEYVADRASAEHSVAAPA 75 Query: 58 ------------AAKALAQQEVQQYSDKQALQGRLQEAPT-------FARAAKAKATHIA 98 A+A A Q ++ S +A R + P ++K A Sbjct: 76 PAAPPASALAGPVARAPAPQAAKKVSLGKAELHRREPRPMKPSADALAGAPLESKPQDAA 135 Query: 99 NPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVN 158 G ++ + N + A++PL+TF+ DVDT SY RR+L G LPP AVRVEE VN Sbjct: 136 PAGGNTFEAWKANAFVETAKDPLSTFAADVDTASYTVSRRYLVNGQLPPASAVRVEEFVN 195 Query: 159 YFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLV 218 YF + + + FA+ E AP+P++ +R L+V + K + ++LV Sbjct: 196 YFKFRYAPPETGA--------FAVHLEGAPSPFDAKRHFLRVGVQGKVVSRSQRKPAHLV 247 Query: 219 FLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEIN 278 FL+DTSGSM S+++LPL + ++K+ VK L E D +AIVTYAG++R LP + I+ Sbjct: 248 FLVDTSGSMHSEDKLPLAREAIKVAVKNLNENDTVAIVTYAGNTRDVLPPTPATDAKSIH 307 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID-DPKSIESMV 337 AA+DSL A G T G+G+ELAY+ A K ++R+++ TDGD N+G + ++ + Sbjct: 308 AALDSLTAGGGTAMGSGMELAYRHAVKKASGSVVSRVVVLTDGDANIGRNVSANAMLDSI 367 Query: 338 KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITV 397 K GVTL+T G G NY + +M ++AD GNGN Y+D+L EA+KV +++ L + Sbjct: 368 HKYTAEGVTLTTVGFGMGNYRDDLMEKLADKGNGNCFYVDSLREAKKVFETQLTGTLEVI 427 Query: 398 AKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQ 457 AKDVK Q+EFNPA V YR +GYE R + F ND VDAG+IGAG ++T ++E+ L G Sbjct: 428 AKDVKFQVEFNPAAVRRYRLVGYENRDVADHDFRNDKVDAGEIGAGHNVTAVYEVELTG- 486 Query: 458 KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFP-----LGPTINAPSE 512 + T+ LA +++R K P G E+ EF L T+ S Sbjct: 487 ------------------EATEALATVRVRAKAPNGTEASEREFRFERTKLRDTLAQASP 528 Query: 513 DMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTD 572 D RF AVAA LR S S ++ A+ A D R EF+RL+ A + Sbjct: 529 DFRFAVAVAATADVLRDSPSAEGWSLATAEKLAEGATEGDAD--RKEFVRLVTQARALKG 586 Query: 573 IS 574 S Sbjct: 587 AS 588 >UniRef50_C7N770 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N770_SLAHD Length = 629 Score = 369 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 177/525 (33%), Positives = 254/525 (48%), Gaps = 18/525 (3%) Query: 51 EAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDD 110 AA + + + + T Y ++ Sbjct: 114 MPVAETKAASEDTMAGSANSYAPDGGLAYETDEAYETFDTLDEGAPMEDFNTEEYAAIEE 173 Query: 111 NPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLL---PPPDAVRVEEIVNYFPSDWDIK 167 N PL+T S DVDT SY N+RR +N G P AVR+EE++NYF D Sbjct: 174 NGFVSTVTRPLSTCSADVDTASYCNLRRMINDGYSLDEIPDGAVRIEEMLNYFHYD---- 229 Query: 168 DKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM 227 S FA+R E A PWN+Q LL + A D+ SNLVFLID SGSM Sbjct: 230 ---SGEPEGNDLFAVRAESARCPWNDQTQLLVMTFTASDKAQTASKGSNLVFLIDISGSM 286 Query: 228 ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAE 287 ++L L++ S L++ L D ++IVTYA + L SG +I A++ L+A+ Sbjct: 287 DEPDKLDLLKDSFGTLLENLGPNDRVSIVTYAAGEDVLLEGASGDDTRKIMRALNRLEAD 346 Query: 288 GSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTL 347 GSTNG AGLE+AY+ A + +I+GG+NRI++A+DGD NVGI + V+++RE+GV L Sbjct: 347 GSTNGEAGLEMAYEVAERNYIEGGVNRIVMASDGDLNVGITSESDLYDFVEEKRETGVYL 406 Query: 348 STFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 S G G+ NY + M +AD GNG Y YID + EA++VL ++ + VA DVK Q+EF Sbjct: 407 SVLGFGSGNYKDTKMETLADHGNGTYHYIDCVEEAERVLGEDLTANFVPVADDVKLQVEF 466 Query: 408 NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYA 467 NPA V YR IGYE R + E F N+ DA ++GAG T+ +EL L + + Sbjct: 467 NPAQVKAYRLIGYENRAMADEDFLNEAADAAEVGAGAQFTVAYELVLADSDYDVADVPDL 526 Query: 468 P--DNKLAKSDKTKELAWLKIRWKYPQGKE---SQLVEFPLGPTINAPSEDMRFRAAVAA 522 + A T E +R+K + SQ + +PS+D F ++V Sbjct: 527 KYGSGEAAGDSSTDEWLTCSMRYKAVDDDKAVRSQDLVVGADSQTESPSDDWVFASSVIE 586 Query: 523 YGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELA 567 +G SE+ + + D R F L+E+A Sbjct: 587 FGMIASDSEFAEGLDTGDVLDQLDTIRLNDE---RQGFYDLVEIA 628 >UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PR69_CHIPD Length = 588 Score = 360 bits (923), Expect = 9e-98, Method: Composition-based stats. Identities = 176/547 (32%), Positives = 263/547 (48%), Gaps = 38/547 (6%) Query: 30 QQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARA 89 +Q S ++L A AE + AKA+A++ S+ + F Sbjct: 79 PKQLSAGAYSRMLVQMNAQEDPAEDAVKKAKAIARERSSNGSNPNYGNALMGTRAFFD-- 136 Query: 90 AKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPD 149 Y +N + F++DVD +Y+N+RRF+ P + Sbjct: 137 -------------ETYGTLYENKFIAAETQIPSLFAVDVDRAAYSNIRRFVKLKERIPAN 183 Query: 150 AVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKS 209 AVR+EE+VNYF + + A+ A PW E LL++ + K Sbjct: 184 AVRIEEMVNYFHYSYPL-------PPVGQTLAIYSNYATCPWAEDHRLLQIAVRGKSVNL 236 Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSI 269 + LP SNLVFLID SGSM +LPL+Q++ ++LV LR D++AIV YAG + LPS Sbjct: 237 DSLPPSNLVFLIDVSGSMAMPNKLPLLQAAFRILVNNLRSNDHVAIVAYAGVPGVILPST 296 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 GS K++I AID L A G+T G A ++LAYQ A + FIK G NR++LATDGDFNVG Sbjct: 297 PGSAKSKILNAIDYLSAGGATAGEAAIKLAYQIAEENFIKEGNNRVILATDGDFNVGQTS 356 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 +E ++ ++E+GV L+ G G NY ++ + ++ GNGN++YID L EA K+ E Sbjct: 357 DHDMEQLILGKKETGVLLTCLGFGMKNYKDSKLETLSSKGNGNFAYIDNLEEASKIFARE 416 Query: 390 MRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLL 449 L TVA+DV+A + FNP V YR IGYE + ++ + + Sbjct: 417 FGSTLFTVARDVQADVVFNPRTVKSYRLIGYENKVIKDDDSASQIGGG------------ 464 Query: 450 FELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFP-LGPTIN 508 + ++ P L +D L R + + P + T Sbjct: 465 ---IIGAGHCAVAIYEIVPQKGLMPADSMLAAVHLAYRETTDTTIKRLFYKVPDIFTTFQ 521 Query: 509 APSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELAD 568 S+D RF +AVA G LR S Y + S + A+++ G+DP GYR EFI L++ Sbjct: 522 QSSDDFRFASAVALMGMLLRKSGYKGSGSCDMVMDIARRSLGDDPGGYRREFITLLKDLK 581 Query: 569 GVTDISQ 575 ++ + Sbjct: 582 KSKNLEK 588 >UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NFD9_ACHLI Length = 486 Score = 359 bits (921), Expect = 2e-97, Method: Composition-based stats. Identities = 154/476 (32%), Positives = 248/476 (52%), Gaps = 31/476 (6%) Query: 101 GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 +Q+ +NP V+ N + SL +T SY+ +R +N G +AVR+EE+VN+F Sbjct: 39 NDDEHQEIIENPFIDVSVNNKSNISLSANTASYSFIRSQINSGRAVDRNAVRIEEMVNFF 98 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 +++ + F + EL PWN + LL + + K ++P+ N+V L Sbjct: 99 NYNYNQPET-------DKTFGFKSELIQTPWNNETHLLLIGLETKQVDLGDIPS-NIVIL 150 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAA 280 +D SGSM + +L L + +++LL+++++ D I++VTY+ ++ S A + + Sbjct: 151 LDVSGSMSATNKLSLAKKAMELLIEQMKPNDVISLVTYSSGEKVVFKGKSIDDMAYMTSQ 210 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQ 340 I L A GST G GL++AY+ A + FI+GG NRI+LATDGDFNVGI + + ++ Sbjct: 211 IRLLKASGSTAGKKGLDMAYKVAEEYFIEGGNNRIILATDGDFNVGISSTDMLIEYISEK 270 Query: 341 RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD 400 RESG+ S +G G N+ + + R+A GNG Y YID + A+K + +L TVA+D Sbjct: 271 RESGIYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARKAFVDNIDGVLYTVARD 330 Query: 401 VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS 460 KAQI F+ + V EYR IGYE RQL + F++ DAG+IG G +T ++EL LN + Sbjct: 331 AKAQIVFDASAVLEYRLIGYENRQLTDDEFDDGTTDAGEIGTGLQVTAIYELKLNEGAS- 389 Query: 461 IDKLRYAPDNKLAKSDKTKELAWLKIRWK-YPQGKESQLVE--FPLGPTINAPSEDMRFR 517 ++ L IR+K + ++QL E L PS D +F Sbjct: 390 -------------------DVGSLTIRYKNHDITDDTQLEEAFTVLNAINENPSVDAKFI 430 Query: 518 AAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDI 573 ++V +G L S+Y + + + + YR +FI ++ +T Sbjct: 431 SSVVEFGLILMDSKYKVDADLGAVLERIETETYNLEDYYRNDFIDVLNTYKDMTTS 486 >UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DA43_9BACT Length = 883 Score = 357 bits (917), Expect = 5e-97, Method: Composition-based stats. Identities = 181/508 (35%), Positives = 269/508 (52%), Gaps = 18/508 (3%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE 82 P +P + + EA+ S A A + + ++ G Sbjct: 241 PVGVPLDSAEPQGGAALSKAVTPKDKLAEADASKAMPVAAWARVRRGFAATSGGIGDNSY 300 Query: 83 APTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ 142 KA++ + +N V +NPL+TFS+DVDT SYA VRR+LN Sbjct: 301 GLDDRGGIADKASNA-----NSFDTLTENAFLNVPENPLSTFSIDVDTASYAIVRRYLND 355 Query: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 LPP AVR+EE++NYFP D+ PF+ E+A PW + L++V + Sbjct: 356 NHLPPTGAVRIEELLNYFPYDYPQPQ-------GAAPFSATMEVATCPWAPEHRLVRVGL 408 Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 ++ +E P SNLVFLID SGSM +LPL+Q LLV++L +D ++IVTYA + Sbjct: 409 KGREIPKDERPPSNLVFLIDVSGSMNMPNKLPLLQKCFSLLVEQLGPKDRVSIVTYASGT 468 Query: 263 RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD 322 ++ L K + AID L A G T+G +G++LAY+ A + FI GG NR++LATDGD Sbjct: 469 KLVLEPT--QDKEAMQTAIDGLHAGGGTHGSSGIDLAYRMAQQSFIPGGTNRVILATDGD 526 Query: 323 FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 +N+GI + + SM+ ++ +SGV L+ G G N ++M+V++AD GNG+Y+YIDT EA Sbjct: 527 WNIGITNQSELLSMITRKAKSGVFLTVLGFGLDNLKDSMLVKLADHGNGHYAYIDTEQEA 586 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 +KV ++ L+T+AKDVK Q+EFNP V+ YR +GYEKR L E FNND DAG+IGA Sbjct: 587 RKVFVDQLSSTLVTIAKDVKIQVEFNPVQVSSYRLVGYEKRLLAKEDFNNDKKDAGEIGA 646 Query: 443 GKHITLLFELTL----NGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 G +T L+E+ + A +D+L+Y + +K +I+ + + Sbjct: 647 GHTVTALYEVVPVGKERPEIAKVDELKYQRIPRAVPVEKPTPQRETEIQEREKLPSAAPA 706 Query: 499 VEFPLGPTINAPSEDMRFRAAVAAYGQK 526 P+ D R A A + Sbjct: 707 AAPVPAAKAETPAADGRTGAEKPATEEI 734 Score = 101 bits (252), Expect = 6e-20, Method: Composition-based stats. Identities = 46/155 (29%), Positives = 70/155 (45%), Gaps = 12/155 (7%) Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 + A H T++ + L + + KE Sbjct: 738 PAPGAAPADGRTVTMHAP------HKTIIVDADRGNNTTQASAL---AEPSPLPASVRKE 788 Query: 481 LAWLKIRWKYPQGKESQLVEFPLGP---TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTS 537 + LK+R+K P G++S+L+EFPL T S D F AAVA+YG LR S+YL + Sbjct: 789 MLTLKLRYKEPDGEKSKLLEFPLTDPGTTWEKSSPDFHFAAAVASYGMLLRDSKYLGEAT 848 Query: 538 WQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTD 572 WQ + +WA++ G D GYR EF+ L++ A + Sbjct: 849 WQSVVEWAREGLGADKHGYRTEFLSLLDRARAMKQ 883 >UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WPE6_EGGLE Length = 555 Score = 355 bits (911), Expect = 3e-96, Method: Composition-based stats. Identities = 178/559 (31%), Positives = 261/559 (46%), Gaps = 28/559 (5%) Query: 33 PSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQY------SDKQALQGRLQEAPTF 86 S + ++ A A ++ A + Q Q S+ A+ L E + Sbjct: 3 ASNRSRGRLAAGSIAFAVLIAGASLAGCSPDGQAGDQLGSAASESEIMAIGSALSETAST 62 Query: 87 ARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLP 146 + GT Y+ D+ A +PL+T S DVDT SY N+RR + Q P Sbjct: 63 CPPPYPYVPSPSPGGTEEYRALDEPGFLSPATSPLSTLSADVDTASYCNLRRMVAQRYAP 122 Query: 147 ---PPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDIL 203 P AVR EE++NYF + F + +++ PWN+Q LL + Sbjct: 123 AVVPAGAVRTEELLNYFDYAYP-------EPVGSDLFGVSAQMSDCPWNDQTKLLVMGFA 175 Query: 204 AKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 + +NLVFLID SGSM ++LPL++ S LV+ L E+D +++VTYA R Sbjct: 176 TEKDGDASPTGANLVFLIDVSGSMDDPDKLPLVKDSFAALVEGLTERDRVSVVTYASGER 235 Query: 264 IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 + L + G K I A+DSL AEGSTNG AGLE AY+ A FI+GG+NR+++A+DGD Sbjct: 236 VLLEGVPGDDKRRIMRAVDSLVAEGSTNGEAGLEQAYRLAESSFIEGGVNRVVMASDGDL 295 Query: 324 NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 NVGI + V+++RE+GV LS G G+ NY + M +AD GNG Y YID EA+ Sbjct: 296 NVGISSESELHDFVEQKRETGVYLSVLGFGSGNYKDNKMETLADHGNGAYHYIDCAEEAR 355 Query: 384 KVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAG 443 +VL +R L+ +A DVK Q+EFNP V YR IGYE R L E F + DAG++GAG Sbjct: 356 RVLGRNLRANLVPLADDVKIQVEFNPDRVKGYRLIGYENRALADEEFRD---DAGEVGAG 412 Query: 444 KHITLLFELTLNGQKASIDKLRYAPDNKLAK-------SDKTKELAWLKIRWKYPQGKES 496 T+ +E+ G + + E +R++ E+ Sbjct: 413 HAFTVAYEIVPAGSAFEVGASASKYGSDADDRQDGRRSEANGGEWLTCTMRYRPAGTVEA 472 Query: 497 QLVEFPLGPT--INAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQ 554 + + P+ D F AAV G L S + + + + + D Q Sbjct: 473 VEQALVVDDESCTDDPNGDWTFAAAVIECGMALHRSPHAGAATLESARDLLASCELTDQQ 532 Query: 555 GYRAEFIRLIELADGVTDI 573 + + +G Sbjct: 533 QGFETLLADLARQEGAHGS 551 >UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 Tax=Bacteria RepID=A7C0I1_9GAMM Length = 367 Score = 332 bits (850), Expect = 3e-89, Method: Composition-based stats. Identities = 181/365 (49%), Positives = 252/365 (69%), Gaps = 7/365 (1%) Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS 272 P +NLVFL+D SGSM S+ +L L++S+LKLL +L E+D +++V YAG + + L G Sbjct: 2 PPANLVFLVDVSGSMRSNHKLALLKSALKLLSNQLTEKDKVSLVVYAGAAGVVLEPTPGH 61 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKS 332 +IN A++ L A GST+G AG+ LAY A + FIK GINRILLATDGDFNVG D ++ Sbjct: 62 QSVKINGALERLTAGGSTHGSAGIHLAYNLAEQAFIKNGINRILLATDGDFNVGTVDFEA 121 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 ++++V+++R+SG++L+T G G NYN+ +M ++AD GNGNY+YIDTL+EAQKVL EM Sbjct: 122 LKNLVEEKRKSGISLTTLGFGRGNYNDQLMEQLADAGNGNYAYIDTLNEAQKVLVDEMSS 181 Query: 393 MLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFEL 452 L T+AKDVK QIEFNPA V EYR IGYE R L+ E F+ND VDAG+IGAG +T L+E+ Sbjct: 182 TLNTIAKDVKIQIEFNPAIVAEYRLIGYENRLLKREDFSNDKVDAGEIGAGHTVTALYEM 241 Query: 453 TLNGQKAS-IDKLRYAPDNKL--AKSDKTKELAWLKIRWKYPQGKESQLVEFPLGP---- 505 L G ++ LRY+ + + + ++ ELA+L++R+K P SQL+E+P+ Sbjct: 242 ALVGSGGQRLESLRYSQNQDVPKSNDNQNNELAFLRLRYKAPNSDTSQLLEWPMMRQDIL 301 Query: 506 TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIE 565 +E RF AAVAA+GQ+LRG +YL S+ I A+ A+G DP GYR E I+L+ Sbjct: 302 ETVDTNERFRFAAAVAAFGQQLRGGKYLEQFSYDNILNLARDARGNDPFGYRGELIKLVN 361 Query: 566 LADGV 570 LA + Sbjct: 362 LAKSL 366 >UniRef50_B4D1N7 Autotransporter-associated beta strand repeat protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D1N7_9BACT Length = 1545 Score = 323 bits (828), Expect = 1e-86, Method: Composition-based stats. Identities = 148/566 (26%), Positives = 253/566 (44%), Gaps = 47/566 (8%) Query: 18 GCGPQPENKESQQQQPSTPTEQQVLAAQ---QAAIKEAEQSAAAAKALAQQEVQQYSDKQ 74 P+ + P+TP ++ + SA K+ + ++ D+ Sbjct: 1015 WTIPKDLIPSAPDASPNTPIDRDTVKNWLIANGLTFNGNASALYIKSTNRLVIRNTQDQL 1074 Query: 75 ALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYA 134 L R+ +A R KAK T P P Q + N +TFSL+V S+ Sbjct: 1075 DLVDRIVKADAKEREDKAKETVPTAP--------IPQPEVQTSANAFSTFSLNVSDVSFK 1126 Query: 135 NVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ 194 L QG +P P +VR EE +N F + P A E A P+ + Sbjct: 1127 LAAASLEQGHMPDPASVRSEEFINAFDYRDPEPSPGA-------PLAFVTERARYPFAQN 1179 Query: 195 RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 R LL+ + + N+V L+D SGSM +R+ +++ +L +L K L+ QD ++ Sbjct: 1180 RDLLRFAVKTAAAGRQPGRPLNIVLLLDRSGSMERADRVNIVREALSVLAKHLQPQDKLS 1239 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 IVT+A + +++G ++ A ++ + EG TN A L+LAY+ A F NR Sbjct: 1240 IVTFARTPHLWADAVAGDKVHDVIARVNEITPEGGTNLEAALDLAYETAHHHFAVDSTNR 1299 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++L TDG N+G +P ++ V+ QR+ G+ L FG+G YN+ ++ ++ +G Y Sbjct: 1300 VILFTDGAANLGDVNPDALTKKVEAQRKQGIALDCFGIGWEGYNDDLLEQLTRNADGRYG 1359 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 +I+T +A +++ L A DVK Q+EFNP V YRQIGY QL E F +++ Sbjct: 1360 FINTPEDAAANFATQIAGALQVAASDVKVQVEFNPHRVKTYRQIGYATHQLTKEQFRDNS 1419 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGK 494 V+A IGA + L+ + ++ +LA + +R++ P Sbjct: 1420 VNAAQIGAAESGNALYVVEVDPHGEG-------------------DLATVHVRFRVPGTS 1460 Query: 495 ESQLVEFPLG-----PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQ-- 547 + + E+P+ P + S +R A +A+ + L S Y + ++ Sbjct: 1461 DYREHEWPVPFAGEVPPLEQASSALRLAGAASAFSEMLAASPYATEVTSDRLLNILNGVP 1520 Query: 548 -AKGEDPQGYRAEFIRLIELADGVTD 572 G DP+ + E+ +I A ++ Sbjct: 1521 PIYGADPRPTKLEW--MIRQARSLSG 1544 >UniRef50_A6DLI7 von Willebrand factor type A domain protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI7_9BACT Length = 1078 Score = 319 bits (818), Expect = 1e-85, Method: Composition-based stats. Identities = 138/557 (24%), Positives = 257/557 (46%), Gaps = 38/557 (6%) Query: 20 GPQPENKESQQQQPSTPTEQQ-VLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQG 78 G + ESQ Q +E V + + IK + + + +Q ++ +K+ Sbjct: 553 GDYRKYLESQGFQFDAGSEVDFVESVNRLVIKNTPEQNSKIASHLEQVNERAQEKKKESQ 612 Query: 79 RLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRR 138 + +E ++ ++ +A P ++ D + L+TF++DVDT SY R Sbjct: 613 KARERLKQLKSQQSNLPSVALPSPLFFEGMID-----AKETNLSTFAIDVDTASYTAARS 667 Query: 139 FLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLL 198 + G VR+EE +N F + + K++ F + EL+ LL Sbjct: 668 EIRAGRKVEASHVRIEEFINNFDYHYSVPKKEA--------FKIDSELSDHKVYAGVKLL 719 Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY 258 +V + + ++ + F+ID SGSM ++ RLPLIQ +L + K + + D + I++ Sbjct: 720 RVGVQGQRLGADSQKPGSYTFVIDNSGSMAAENRLPLIQKTLPNMFKAMNQDDEVTILSC 779 Query: 259 AGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 G I+ S+ +++ A+ +++A N G+E AY+ A + F G +NR++L Sbjct: 780 EGGVTNLANRITASNHSQLETAVKNIEAGTVANLSVGIEEAYKLAAQNFRSGAVNRVILL 839 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 +DG ++G + + + V + R+ G+ + GVG+ +Y+++ + +A+ G+G Y + D+ Sbjct: 840 SDGIASLGEKEAQEVLKTVSQYRKQGIGNTVIGVGSEDYDDSFLETLANKGDGVYYFGDS 899 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAG 438 + +L + T+A+DVK Q+EFNP V YR +GYEKR+L + F ND VDAG Sbjct: 900 KEQMNDILVNNFEASFKTIARDVKIQLEFNPQAVRSYRLLGYEKRRLANKDFRNDKVDAG 959 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 +IGAG+ +T L+EL +N + +L + +R+K + + Sbjct: 960 EIGAGQSVTALYELVVNENT------------------QEAKLGAVNLRYKNLENELVTE 1001 Query: 499 VEFPLGP----TINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQ 554 + + + MR A + + L+ + + Q+ + E PQ Sbjct: 1002 INKEIKAQQNVSFPESQSSMRLAWCAATFARLLKNNG-KGELRFDQLAAEVDKILLERPQ 1060 Query: 555 GYR-AEFIRLIELADGV 570 + EF LI + Sbjct: 1061 DQKIQEFKDLIIRCQSL 1077 >UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TQT5_9MICO Length = 533 Score = 311 bits (796), Expect = 5e-83, Method: Composition-based stats. Identities = 160/514 (31%), Positives = 243/514 (47%), Gaps = 37/514 (7%) Query: 62 LAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTA-RYQQFDDNPVKQVAQNP 120 QE S + +A + A ++ P + + + P Sbjct: 38 PTAQENDSASGTSSSSAVGGQASSGVAQPFPAAPNVPGPLEDNTFVDAGTSGFIDTRERP 97 Query: 121 LATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 +TF++DVD GS+ R L+ G LPPP++VR EE VN F S + PA + Sbjct: 98 RSTFAVDVDGGSFRVARSLLHDGHLPPPESVRPEEWVNSFDSGF--------PAPRKDDL 149 Query: 181 AMRYELAPAPWNEQ-RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSS 239 ++ + A A + L+++ + ++ E L ++DTSGSM ERL L++SS Sbjct: 150 ELQSDQARASSEDDGTRLVRIGLQGREVDVREWQPVALTMVVDTSGSMDIRERLGLVKSS 209 Query: 240 LKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELA 299 L LL + LR D IAIVTY D+ L I AAID L+A GSTN AGL L Sbjct: 210 LALLAENLRPDDTIAIVTYQTDATPLLEPTPVRDTDTILAAIDRLEAGGSTNLEAGLLLG 269 Query: 300 YQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNE 359 Y QA + + +G N +LLA+DG NVG+ D + + ++ G+ L T G G NY++ Sbjct: 270 YDQAREAYKQGATNVVLLASDGVANVGVTDGGRLATAIRDNGRRGIHLVTVGYGMGNYSD 329 Query: 360 AMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIG 419 +M ++AD G+G Y YIDT EA+K+ ++R L VAKD K Q+EF+P V+ YR IG Sbjct: 330 HLMEQLADQGDGFYEYIDTFEEARKLFVEDLRATLTPVAKDAKIQVEFDPRTVSAYRLIG 389 Query: 420 YEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTK 479 YE R L + F+ND VDAG++GAG +T L+E+ A++D+ Sbjct: 390 YENRALSDDDFDNDAVDAGEVGAGHKVTALYEVRP-----------------TAQADEGD 432 Query: 480 ELAWLKIRWKYPQGKESQLVEFPLGPTINAPS--------EDMRFRAAVAAYGQKLRGSE 531 L +++RW+ G+E + PL V G+ + Sbjct: 433 ALGTVRVRWRSVDGEEQREDSLPLTLGDAESPTGALGVAAAVADLAQLVKGGGESMSTQP 492 Query: 532 YLN-NTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 +T+ ++ +D G R E ++ Sbjct: 493 RGESSTTLANLRDRVAALVEQDAPGAR-ELADVL 525 >UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12 Tax=Actinomycetales RepID=D2BAS2_STRRD Length = 490 Score = 307 bits (787), Expect = 5e-82, Method: Composition-based stats. Identities = 140/503 (27%), Positives = 227/503 (45%), Gaps = 37/503 (7%) Query: 75 ALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYA 134 A G + + + +P Q+ + A + ++TF+LDVDT SY Sbjct: 19 AACGGSGSSRPASEPRNNPGNAVPSPAQD--QESGAARERDAAADQISTFALDVDTASYG 76 Query: 135 NVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ 194 +R L +G LP P +R EE VN F D+ F + + A P N Sbjct: 77 YAKRILQEGRLPEPGQIRPEEFVNSFRQDYK--------EPGDDGFTVHMDGARMPEN-G 127 Query: 195 RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 L++V + + + E +NL F++D SGSM RL L++ +L LV +L D ++ Sbjct: 128 TALIRVGLQTRKAEPEARRPANLTFVVDVSGSMGEPGRLDLVREALHKLVDQLGPGDQVS 187 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 IV ++ +R+ L + + +++AAID L E STN GL Y +A + F NR Sbjct: 188 IVAFSTQARLVLSMTPATGRDQLHAAIDRLGVEDSTNLETGLTAGYAEAARAFRPAATNR 247 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++L +DG N G + I V + +TL GVG +Y + +M ++AD G+G Sbjct: 248 VILLSDGLANTGDTTWQGILDRVAESAGRQITLLCVGVGR-DYGDQLMEQLADNGDGAAV 306 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 Y+ + +A+KV ++ L A+D KAQ+ FNP+ V YR IGYE RQ+ E F +D Sbjct: 307 YVSSADDARKVFVEQLATNLDLRARDAKAQVVFNPSAVESYRLIGYENRQIAAEDFRDDT 366 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQG- 493 D G+IG G +T L+ + L +S + +LA +RW+ P Sbjct: 367 KDGGEIGPGHSVTALYGVRL-------------------RSGASGQLATATVRWQDPDTR 407 Query: 494 ---KESQLVEFP--LGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQA 548 + S+ +E S + AA+ + LR E + +++ A + Sbjct: 408 GPGETSRSLESADLSASVWRESSPRFQVDVVAAAFAEYLRTREEIAGVGARELAGHATRL 467 Query: 549 KGEDPQGYRAEFIRLIELADGVT 571 E LI+ A ++ Sbjct: 468 AASTEDAAVTELAGLIDRAVSLS 490 >UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 Tax=Arabidopsis thaliana RepID=Q9M1S2_ARATH Length = 676 Score = 304 bits (779), Expect = 5e-81, Method: Composition-based stats. Identities = 86/470 (18%), Positives = 187/470 (39%), Gaps = 30/470 (6%) Query: 106 QQFDDNPVKQVAQN-PLATFSLDVDTGSYANVR------RFLNQGLLPPPDAVRVEEIVN 158 ++ + P+++ + + P F + + + R R + QG P P +E + Sbjct: 120 AKWKEIPIQKPSLDLPYYPFDRCNNDAAISLFRCLPPSQRAITQGH-PEPATFDDDERLE 178 Query: 159 Y---FPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNE-----QRTLLKVDILAKDRKSE 210 F + ++ K++ + + + E++ P ++ + + Sbjct: 179 EQIVFDGETEVLKKENRDYVRMMDMKVYPEVSAVPQSKSCENFDVLVHLKAVTGDQISQY 238 Query: 211 ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS-- 268 +LV ++D SGSM +L L++ ++ +++ L D ++++ ++ +R P Sbjct: 239 RRAPIDLVTVLDISGSM-GGTKLALLKRAMGFVIQNLGSSDRLSVIAFSSTARRLFPLTR 297 Query: 269 ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID 328 +S + + A++SL A G TN GL + + + I+L +DG + Sbjct: 298 MSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIILLSDGRDTYTTN 357 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 P ++ Q +++ +FG G S+++ ++M +++V G +S+I++ S Q L Sbjct: 358 HPDPSYKVMLPQ----ISVHSFGFG-SDHDASVMHSVSEVSGGTFSFIESESVIQDALAQ 412 Query: 389 EMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 + +L ++++ +IE V L ++ VD GD+ A + Sbjct: 413 CIGGLLSVAVQELRVEIEGVSPNVRLSSIKAGSYSSLVTGDGHSGLVDLGDLYADEERDF 472 Query: 449 LFELTLNGQKASIDK---LRYAPDNKLAKSDKTKELAWLKI-RWKYPQGKESQLVEFPLG 504 L + + ++ LR N L K T E L+I R +Y ++ +E Sbjct: 473 LVSINIPVEEDGHTPLLKLRCLYINPLTKEITTLESHVLQIRRPEYVAEEKVVPIEVVRQ 532 Query: 505 PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQ 554 +E M +A +G + + N + AK D Sbjct: 533 RNRFLAAEAMAQARTLAEHGDLEAAVKAIENFRL--VLAETVAAKSCDRF 580 >UniRef50_C1RGW7 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RGW7_9CELL Length = 500 Score = 296 bits (757), Expect = 2e-78, Method: Composition-based stats. Identities = 128/510 (25%), Positives = 218/510 (42%), Gaps = 32/510 (6%) Query: 63 AQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLA 122 + + A++ L+ Sbjct: 16 GVAVALGACSAGGSADGTADWEGSGAYQPGPYQEDLPYPEPGPTGPTAAGMTDPARDALS 75 Query: 123 TFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAM 182 TF+LDVDTG+Y R + QG P VR EE VNYF D++ + + Sbjct: 76 TFALDVDTGAYTRFRDAVRQGFSVDPFGVRTEEFVNYFAQDYEPPAEG---------LGV 126 Query: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKL 242 + P+ L++V I + + ++LV ++D SGSM ++ + +L+ Sbjct: 127 SIDATALPFRPDHRLVRVGISSAPASAVSRADADLVLVVDCSGSMDEAGKMETTKYALRT 186 Query: 243 LVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ 302 LV LR D +A+V Y+ ++ + L + + + AAID L STN AGL L Y Sbjct: 187 LVSSLRRTDRVAMVCYSTEADVYLEPTPVAEREGVLAAIDRLAPRDSTNAAAGLALGYDL 246 Query: 303 ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMM 362 A +G + R++L +DG NVG DP+ I + + Q ++G++L + GVG + YN+ ++ Sbjct: 247 AMSMRTEGRLTRVVLVSDGVANVGETDPEGILARISSQAKAGISLISVGVGITTYNDHLL 306 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEK 422 ++AD G+G + Y+D +EA++V + + L+ D +AQ+EF+PA V YR +GYE Sbjct: 307 EQLADQGDGWHVYVDGEAEAERVFATGLTGSLVVAGTDARAQVEFDPAQVAGYRLLGYEN 366 Query: 423 RQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELA 482 R + E F ND VD G++ AG+ T L+E+ + Sbjct: 367 RAVADEDFRNDAVDGGEVFAGRSTTALYEVAMREGAG------------------DGAFV 408 Query: 483 WLKIRWKY----PQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSW 538 +R+ P +++ L + S +R VA L + + Sbjct: 409 RATVRYLDDDGRPVERDASLSRDDCAASPREASPRLRQDLVVALLTDHLTDGPWSQEIAP 468 Query: 539 QQIKQWAQQAKGE-DPQGYRAEFIRLIELA 567 ++ A+ G D E + L++ A Sbjct: 469 ADVRAEARTLLGVLDGDRAVQELVELVDRA 498 >UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JNR2_9BACT Length = 923 Score = 293 bits (751), Expect = 8e-78, Method: Composition-based stats. Identities = 121/552 (21%), Positives = 225/552 (40%), Gaps = 36/552 (6%) Query: 27 ESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQ--QEVQQYSDKQALQGRLQEAP 84 +++ + P T + + + + + + + A Sbjct: 400 DARPRLPFQNTANSPTPSSPSLAADTFNPEPITENPPSSFNLAEGLDNPNLVAASTANAT 459 Query: 85 TFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGL 144 A NP Q + P A +P +TFSL+V SY +L Q + Sbjct: 460 QTAPNNDEPLPRTQNPPPTT--QLSEYPESNTATDPQSTFSLNVSDVSYRLTEAYLAQNV 517 Query: 145 LPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILA 204 PP +R EE VN F +E A P+ R +L+ + Sbjct: 518 RPPAGTLRTEEFVNAFDYGDPT-------PPVARKIGFTWERAHWPFAHDRDVLRFSLQT 570 Query: 205 KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI 264 +L IDTSGSM +R+ ++ S L L E+D ++IV++ R+ Sbjct: 571 AAHGRASSQPLHLTLAIDTSGSMSRPDRVDIVNSLATALQSNLTEKDRLSIVSFDRQPRL 630 Query: 265 ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 L S + + + L+ +G T+ + L+L+YQ A + F + INR++L TDG N Sbjct: 631 VLDGQSVTAETNLATLATQLNPQGGTDLESALQLSYQTAQRHFQENAINRVILITDGAAN 690 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 +G + + + + V + R G+ L FG+G +++ + ++ G+G Y ++ + +A Sbjct: 691 LGNTNAEQLRTTVTENRIRGIALDCFGIGFDGHDDTFLESLSRNGDGRYRFLRSPEDAAL 750 Query: 385 VLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 L ++ +L A DVK Q+EFNP V Y+Q+GY++ Q+ + F N+ VDA ++ A + Sbjct: 751 ELGPKLAGLLRPAAYDVKVQVEFNPTRVETYQQLGYQQHQIADQDFRNNAVDAAELAATE 810 Query: 445 HITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF--- 501 L+ + D +L +++R++ + + + + Sbjct: 811 SGNALYLAKVLP-------------------DGRGDLGLVRVRFRDAESGAYEELSWNLP 851 Query: 502 --PLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAE 559 P ++ S +R + AA+ L S + + + + A P R + Sbjct: 852 YKANAPELDQASPSLRLASIAAAFAALLNESPLAHAITHSDLYELAAPLPQHFPTQTRVQ 911 Query: 560 FIR-LIELADGV 570 +R LI A + Sbjct: 912 TLRTLINQARRL 923 >UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=Q10RY0_ORYSJ Length = 694 Score = 287 bits (735), Expect = 6e-76, Method: Composition-based stats. Identities = 86/448 (19%), Positives = 165/448 (36%), Gaps = 31/448 (6%) Query: 106 QQFDDNPVKQV-------AQNPLATFSLDVDTGSYANVRRFLN--QGLLPPPDAVRVEEI 156 ++ + P + + ++T + D G + VRR + G L AV Sbjct: 119 AEWKELPFQGTQPGDTAYGRARVSTVNWPQDEGQMSVVRRLSHGYSGNLQQQLAVFRTPE 178 Query: 157 VNYFPSDWDIKDKQSIPASKPI-----PFAMRYELAPAPWNEQRTLLKVDILAKDRKSEE 211 + F D +I + E +E+R + + I K KS + Sbjct: 179 ASIFNDDENIDPQSETVDDHNAVTNSVEIKTYSEFPAIQKSERRKVFAILIHLKAPKSLD 238 Query: 212 ----LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP 267 +LV ++D SGSM +L L++ ++ +++ L D +++V ++ ++ P Sbjct: 239 SVSSRAPLDLVTVLDVSGSMSGI-KLSLLKRAMSFVIQTLGPNDRLSVVAFSSTAQRLFP 297 Query: 268 S--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 ++ + + + AI SL A G TN L+ + K ++ I+L +DG Sbjct: 298 LRRMTLTGRQQALQAISSLVASGGTNIADALKKGAKVVKDRRRKNPVSSIILLSDGQDTH 357 Query: 326 GIDDPKS-------IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 ++ + + V + TFG G +++ A M IA+ NG +S+ID Sbjct: 358 SFLSGEADINYSILVPPSILPGTSHHVQIHTFGFGT-DHDSAAMHAIAETSNGTFSFIDA 416 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAG 438 Q M +L V KD++ IE V+ + + VD G Sbjct: 417 EGSIQDAFAQCMGGLLSVVVKDMRLCIECIDEGVSLTSIKSGSYASQVAGNERSGLVDIG 476 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 D+ A + L L + ++ A + + +L ++ + P Sbjct: 477 DLYADEERGFLVTLHVPAAHGQTVLIKPKCTYLDAITMENVQLDGEEVIIQRPAYCVDCT 536 Query: 499 VEFPLGPTI--NAPSEDMRFRAAVAAYG 524 + + +EDM + A G Sbjct: 537 MSPEVEREWHRVQATEDMSAARSAAEDG 564 >UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3 Tax=Andropogoneae RepID=C5WYU9_SORBI Length = 698 Score = 279 bits (714), Expect = 2e-73, Method: Composition-based stats. Identities = 89/495 (17%), Positives = 182/495 (36%), Gaps = 43/495 (8%) Query: 106 QQFDDNPVKQVAQN-----PLATFSLDVDTGSYANVRRFLN--QGLLPPPDAVRVEEIVN 158 ++ + P Q A + + D G A VRR + G L Sbjct: 115 AEWKELPGAQPADANYGRARVNPLNWPQDEGHMAVVRRLSHTYSGNLQEHLPFFRTLEAG 174 Query: 159 YFPSDWDIKDK-----QSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEE-- 211 F D I + + + + E + + + + I + KS Sbjct: 175 IFNDDEHIDLQSDMNDEHNAITGSVKIKAYSEFPAIEQSVTKEIFAILIHLRAPKSSHSA 234 Query: 212 --LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS- 268 +LV ++D SGSM ++ L+++++ +++ L D ++++ ++ +R P Sbjct: 235 SSRAPLDLVTVLDVSGSMAG-TKIALLKNAMSFVIQTLGPNDRLSVIAFSSTARRLFPLR 293 Query: 269 -ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI 327 ++ + + + A+ SL A G TN GL+ + +K + I+L +DG + Sbjct: 294 RMTLAGRQQALQAVSSLVASGGTNIADGLKKGAKVIEDRRLKNPVCSIILLSDGQDTYTL 353 Query: 328 DDPKSIESM-------VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 +++ + V + TFG G S+++ A M IA++ +G +S+ID Sbjct: 354 PSDRNLLDYSALVPPSILPGTGHHVQIHTFGFG-SDHDSAAMHAIAEISSGTFSFIDAEG 412 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDI 440 Q + +L V K+++ IE V E+ +VD GD+ Sbjct: 413 SIQDGFAQCIGGLLSVVVKEMRLGIECVDNGVLLTSIKSGGYTSQVAENGRGGSVDIGDL 472 Query: 441 GAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVE 500 A + L L + + ++ + A + + E+ ++R + P+ + Sbjct: 473 YADEERGFLLTLHVPPAQGQTVLIKPSCTYHDAITMENIEVHGEEVRIQRPEHHVDCKMS 532 Query: 501 FPLGPTI--NAPSEDM----------RFRAAVAAYGQKLR--GSEYLNNTSWQ--QIKQW 544 + +EDM F AVA + R S+ ++ Q + Sbjct: 533 PEVEREWHRVQATEDMSAARAAAEVGAFSQAVAILEARRRILESQAAQSSDNQCLALMTE 592 Query: 545 AQQAKGEDPQGYRAE 559 ++ + R E Sbjct: 593 LREMQERVENRRRYE 607 >UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus trichocarpa RepID=B9GK57_POPTR Length = 595 Score = 272 bits (696), Expect = 2e-71, Method: Composition-based stats. Identities = 92/463 (19%), Positives = 173/463 (37%), Gaps = 43/463 (9%) Query: 127 DVDTGSYANVRRFLNQGLL----PPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAM 182 DV + NV F G L P V +E ++F D + D S P A+ Sbjct: 56 DVPFQAPKNVPSFQRSGSLHAYVPNASPVHIEP--DHFSDDELVPDVSQGQPSSSRPHAI 113 Query: 183 RY----ELAPAPWNEQRTLLKVDILAKDRK-----SEELPASNLVFLIDTSGSMISDERL 233 E +E + V + ++V ++D SGSM +L Sbjct: 114 TVKTLPEYPAVSASESFSKFGVLVRVLAPPLDNTLPHHRAPIDIVNVLDVSGSMAG--KL 171 Query: 234 PLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTN 291 L++ ++ +++ L D ++IVT++ +R LP +SGS + + + ++SL A G TN Sbjct: 172 ILLKRAVNFIIQNLGPSDRLSIVTFSSSARRILPLRTMSGSGREDAISVVNSLSATGGTN 231 Query: 292 GGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID--DPKSIESMV--------KKQR 341 AGL + + + I+L +DG + ++ ++ R Sbjct: 232 IVAGLRKGVRVLEERRQHNSVASIILLSDGCDTQSHSTHNRLEYLKLIFPSNNASGEESR 291 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 + + TFG G +++ A M I+DV G +S+I+++ Q + + VA+DV Sbjct: 292 QPTFPIHTFGFGL-DHDSAAMHAISDVSGGTFSFIESIDILQDAFARCIGGLTSIVARDV 350 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN--GQKA 459 + ++ V + + + +D GD+ A + L L++ Sbjct: 351 QLKVRSASPGVQILSTPSGRHKNKIFDQGHQATIDIGDLYAEEEKEFLVFLSIPVFPAVD 410 Query: 460 SIDKLRYAPDNKLAKSDKTK--------ELAWLKIRW--KYPQGKESQLVEFPLGPTINA 509 + L P ++ K E ++IR +E Sbjct: 411 GEEMLENMPLVDVSGFQKDSVSTDTVEVEGERVEIRRPQFLSSTDWVPCLEVDRQRNRLL 470 Query: 510 PSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGED 552 +E + +A G L+G++ L + A G+D Sbjct: 471 VTETIAKTQRMAEMGD-LKGAQALLAEQLSTLLSTASAQAGDD 512 >UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magnoliophyta RepID=Q9FF49_ARATH Length = 704 Score = 267 bits (682), Expect = 8e-70, Method: Composition-based stats. Identities = 82/480 (17%), Positives = 177/480 (36%), Gaps = 43/480 (8%) Query: 106 QQFDDNPVKQVAQNPLA---TFSLDVDTG--SYANVRRFLNQGLLPPPDAVRVEEIVNY- 159 ++++ P++ P + D S R Q PD +RV I N Sbjct: 117 AKWNEIPIQSPNAKPKSGVKPIGRPRDDAWMSIPPRRSSPIQ-YTSRPDCLRVSSIFNTE 175 Query: 160 ---FPSDWD--IKDKQSIPASKPIPFAMRYELAPAPWNEQ--------RTLLKVDILAKD 206 F D +D+ + E+ P + + +++ A Sbjct: 176 PAVFNDDEALEHQDRSAESGLDKPGVTGTLEVKTYPEISEVVRSVSFKDFAVLINLKAPT 235 Query: 207 RKSEE-------LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 +LV ++D SGSM +L L++ ++ +++ L D +++++++ Sbjct: 236 SSKSSSNPSSSSRAPVDLVTVLDVSGSMAG-TKLALLKRAMGFVIQNLGPFDRLSVISFS 294 Query: 260 GDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 +R P ++ + K E A++SL + G TN GL+ + K ++ I+L Sbjct: 295 STARRNFPLRLMTETGKQEALQAVNSLVSNGGTNIAEGLKKGARVLIDRRFKNPVSSIVL 354 Query: 318 ATDGDFNVGIDDPK-----SIESMV-KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 +DG + P ++++ K+ + + + FG G ++++ ++M IA+ G Sbjct: 355 LSDGQDTYTMTSPNGSRGTDYKALLPKEINGNRIPVHAFGFG-ADHDASLMHSIAENSGG 413 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFN 431 +S+I++ + Q + +L V +++ IE + R + Sbjct: 414 TFSFIESETVIQDAFAQCIGGLLSVVVQELCVTIECMHHLLRIGSVKAGSYRFDNGPNSR 473 Query: 432 NDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYP 491 ++ GD+ A + L L + D + + K TKE L + Sbjct: 474 TGSIAVGDLYAEEERNFLVNLDIPIVDGVSDVMSLLKVQCVYKDPVTKETVNLNNSGEVK 533 Query: 492 QGKESQLVEFPLGPTINAPSEDMRFRAAVA-AYGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 + + E ++ + +R RAA A + + L + + +G Sbjct: 534 ILRPIVMTERRPVVSVEVDRQRIRLRAAEAISEARVLAERG-----DLTEAVSVLETCRG 588 >UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, scaffold_125.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HHA4_VITVI Length = 630 Score = 261 bits (667), Expect = 5e-68, Method: Composition-based stats. Identities = 85/439 (19%), Positives = 167/439 (38%), Gaps = 36/439 (8%) Query: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 DV + AN+ G+ D E +V D S+ + + EL Sbjct: 115 DVPFQAPANIGDPQCNGMGQAHD----EPLVVNSAESTDPTSLVSLSRPQLVTVKALPEL 170 Query: 187 APAPWNEQ--RTLLKVDILAK----DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 +E + V I A D + +LV ++D SGSM +L L++ ++ Sbjct: 171 PAISASESFRTFAVLVGIKAPALLDDAHLLDRAPIDLVAVLDVSGSMAG-SKLSLLKRAV 229 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 L++ L D ++IV+++ +R P +S + + AI+SL + G TN GL+ Sbjct: 230 CFLIQNLGPSDRLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKK 289 Query: 299 AYQQATKGFIKGGINRILLATDGDFNVGIDDPKS------IESMVKKQRESGVTLSTFGV 352 + + + + I+L +DG D+ S ++ R++ + + TFG Sbjct: 290 GVRVLEERSEQNPVASIILLSDGKDTYNCDNVNRRQTSHCASSNPRQGRQAIIPVHTFGF 349 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV 412 G S+++ M I+D G +S+I++++ Q + +L VA++++ ++ V Sbjct: 350 G-SDHDSTAMHAISDESGGTFSFIESVATVQDAFAMCIGGLLSVVAQELRLTVKSVSPGV 408 Query: 413 TEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKL 472 + + +D GD+ A + L LT+ ++ + R Sbjct: 409 HIESIPSGKYLSEICDQGQQGVIDVGDLYAEEGKEFLIYLTVPELSSAEGEERVKRTT-- 466 Query: 473 AKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEY 532 L + +K K E +E + +A G L G++ Sbjct: 467 --------LLDVMCSYKDSVSK-----EVDRQRNRLWVAEGIAEAQRMAETG-NLEGAKA 512 Query: 533 LNNTSWQQIKQWAQQAKGE 551 + + A G+ Sbjct: 513 VLAHRRSTLLSSASAQAGD 531 >UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZQD5_OPITP Length = 859 Score = 256 bits (653), Expect = 2e-66, Method: Composition-based stats. Identities = 116/470 (24%), Positives = 194/470 (41%), Gaps = 36/470 (7%) Query: 110 DNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDK 169 A+ P++TFSL V S+ + L +G +P P +R EE N F Sbjct: 416 PQAEVSTAKEPVSTFSLHVSDVSFQLAQAALARGEMPDPQRIRPEEFYNAFDYGDPT--- 472 Query: 170 QSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMIS 229 + A R E A P +QR L+++ + NL L+DTSGSM Sbjct: 473 ----PASADKIACRIEQAAHPLLQQRNLVRIAMKVPAAGRGAGQPLNLTVLLDTSGSMER 528 Query: 230 DERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGS 289 +R ++++L +L L D + ++ +A R+ S++G ++ + G Sbjct: 529 TDRATSVRAALGVLASLLTPDDRVTLIGFARQPRLLAESLAGDQARQLVDLASTTPFTGG 588 Query: 290 TNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLST 349 TN A L LA + A + NRI+L TDG N+G DP + + ++ R+ G+ Sbjct: 589 TNLEAALSLAGELARRHHNAAAQNRIVLITDGAANLGNADPAQLATRIETLRQQGIAFDA 648 Query: 350 FGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 GVG ++A++ + G+G Y +D A ++ A+++K Q+ FNP Sbjct: 649 CGVGTDGLDDAVLEALTRKGDGRYYVLDAPENADAGFARQLAGAFRPAAENIKVQVRFNP 708 Query: 410 AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPD 469 A V YR IG+E+ +LR + F ND VDA ++ A + L+++ + Q Sbjct: 709 ARVASYRLIGFEQHRLREQDFRNDQVDAAELAAEESAVALYQVEVLPQGEG--------- 759 Query: 470 NKLAKSDKTKELAWLKIRWKYPQGKESQLVEF-----PLGPTINAPSEDMRFRAAVAAYG 524 EL + R++ P + P P S ++ A Sbjct: 760 ----------ELGDVFARFRDPATGAMIERSWTMLHEPRAPAFERASPSLQLAGVAALVA 809 Query: 525 QKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAE-----FIRLIELADG 569 +KLRG E ++ + +G Q R + F +L A Sbjct: 810 EKLRGGEAAGQIHLNELAGVVNRLRGHYGQNLRVQQLIAMFDQLRRRAGE 859 >UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN03_ARATH Length = 641 Score = 251 bits (640), Expect = 6e-65, Method: Composition-based stats. Identities = 80/398 (20%), Positives = 157/398 (39%), Gaps = 32/398 (8%) Query: 163 DWDIKDKQSIPASKPIPFAMRYELAPA--PWNEQRTLLKVDILAK---DRKSEELPASNL 217 D I + + + E++ P + + V + A+ D +L Sbjct: 146 DTQIHSDGHRSDHQALEIKLFPEVSALAKPVSRADFAVLVHLKAEGVSDDARRARAPLDL 205 Query: 218 VFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKA 275 + ++D SGSM ++ L+++++ +++ L E D +++++++ +R P +S + K Sbjct: 206 ITVLDVSGSMDG-VKMELMKNAMSFVIQNLGETDRLSVISFSSMARRLFPLRLMSETGKQ 264 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN-----VGIDDP 330 A++SL A+G TN GL++ + K ++ ++L +DG N G+ Sbjct: 265 AAMQAVNSLVADGGTNIAEGLKIGARVIEGRRWKNPVSGMMLLSDGQDNFTFSHAGVRLR 324 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEM 390 ES++ + + TFG G S+++ +M I++V +G +S+I+T + Q + Sbjct: 325 TDYESLLPSSCR--IPIHTFGFG-SDHDAELMHTISEVSSGTFSFIETETVIQDAFAQCI 381 Query: 391 RQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLR-VEHFNNDNVDAGDIGAGKHITLL 449 +L V + +IE + I + R +D GD+ A + L Sbjct: 382 GGLLSVVILEQVVEIECIHEQGLKISSIKAGSYRSRIAPDARTATIDVGDMYAEEERDFL 441 Query: 450 FELTLNGQKASIDK--------LRYAPDNKLAKSDKTKELAWLKI-RWKYPQGKESQLVE 500 L + + +R + + K E L I R GKE +E Sbjct: 442 VLLEIPCCDNGSGESESLSLLKVRCVYKDPVTKEIVHVESGELSIQRPMKLTGKEVVSIE 501 Query: 501 FPLGPTINAPSEDMRFRAAVAAYGQK------LRGSEY 532 S+ M +A G LR E Sbjct: 502 VDRQLNRFLVSQAMSEARVLADGGDLSGAVGILRNRER 539 >UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID=Q7G2L9_ORYSJ Length = 719 Score = 251 bits (640), Expect = 7e-65, Method: Composition-based stats. Identities = 78/445 (17%), Positives = 156/445 (35%), Gaps = 30/445 (6%) Query: 153 VEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKS-EE 211 E++V D + D+ + + +A + + E+ +L + Sbjct: 197 AEDVVGTQDVDSIVADEMAPASVGITTYAAFPAMEESVMVEEFAVLIHLKAPSSPATVTS 256 Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--I 269 +LV ++D S SM +L L++ ++ +++ L D +++VT++ +R P + Sbjct: 257 RAPIDLVTVLDVSWSMAG-TKLALLKRAMSFVIQALGPGDRLSVVTFSSSARRLFPLRKM 315 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD 329 + S + + SL A+G TN L A + + + I+L +DG + Sbjct: 316 TESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRRERNPVCSIVLLSDGRDTYTVPV 375 Query: 330 PKSIESMVKK---------------QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 P+ + V + FG G ++++ M IA++ G +S Sbjct: 376 PRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFG-ADHDSPAMHSIAEMSGGTFS 434 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 +ID Q + +L VA++++ +E V Sbjct: 435 FIDAAGSIQDAFAQCIGGLLSVVAQELRLSVECGDDGVLLTSVRSGGYASHVDGDGRGGF 494 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKT----KELAWLKIRWKY 490 VD GD+ A + L + + + + + + + +T + + Sbjct: 495 VDVGDLYADEERDFLVTVRVPAARGVSALITPSCTYRSTATMETVRVGGDTVTVPRTVDA 554 Query: 491 PQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 P G + E +EDM A A RG L + + + Sbjct: 555 PVGYDGMSPEVERELHRVQATEDMAAARAAAE-----RGDFELAAAILDERRGVLESRAD 609 Query: 551 EDPQGYRAEFIRLIELADGVTDISQ 575 +DPQ A L E+ D V + Sbjct: 610 DDPQSV-ALAAELREMQDRVETRQR 633 >UniRef50_C5YHY2 Putative uncharacterized protein Sb07g005010 n=2 Tax=Sorghum bicolor RepID=C5YHY2_SORBI Length = 567 Score = 249 bits (636), Expect = 2e-64, Method: Composition-based stats. Identities = 79/445 (17%), Positives = 162/445 (36%), Gaps = 28/445 (6%) Query: 156 IVNYFPSDWDIKDKQSIPASKPIPFAMRYELA--PAPWNEQRTLLKVDILAKDRKSEELP 213 + P D Q A+ + + R V + AK Sbjct: 19 FQDDEPLDRPTAPPQGPAANGGRGLVLSTQCEFPAVGRFTSRDRFAVLVHAKAPSDVSRA 78 Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISG 271 +LV ++D S SM E+L L++ ++ ++ +L D +++VT++ D+ L +S Sbjct: 79 PLDLVTVLDVSDSMKG-EKLALLKQAMCFVIDQLGPADRLSVVTFSNDASRLTRLARMSD 137 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPK 331 + KA A++SL +G TN G+ +A + K + ++L +DG N G + Sbjct: 138 AGKASAKIAVESLAVQGFTNIKQGIHVAAEVLAGRREKNVVAGMILLSDGHDNCGGTSVR 197 Query: 332 S------------IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 ++ + TFG G S ++ M +A+ G +S++ Sbjct: 198 PDGTKSYVNLVPPSLTVAAGSSRPAAPIHTFGFGTS-HDAGAMHAVAEATGGTFSFVGDE 256 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGD 439 + Q + +L ++ + + V + + +D G+ Sbjct: 257 AAIQDSFARCVGGLLSVAVQEARVAVTCLHRGVHVQQVKSGAYVSHVGADGHAATIDVGE 316 Query: 440 IGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLV 499 + G+ L + + +++ + R + + T + A + +L Sbjct: 317 LYDGEERRFLVLVHVPRARSTEEVTRLIKASCTYREAATGQAARKVAAPAAVVQRPLELA 376 Query: 500 EFPLGPTINAPSEDMRFRAA-------VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGED 552 P P+++ E +R AA AA G + G+ + + + ++Q A A G D Sbjct: 377 TLP-APSLDVERERVRLAAAEDIAAARTAADGGQNAGAARILESRLKAVEQSAPGAAGND 435 Query: 553 P--QGYRAEFIRLIELADGVTDISQ 575 P + + E L + Q Sbjct: 436 PTCEAIKEELRDLSARVGDRAEYQQ 460 >UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5Z1_VITVI Length = 686 Score = 249 bits (635), Expect = 2e-64, Method: Composition-based stats. Identities = 91/524 (17%), Positives = 180/524 (34%), Gaps = 70/524 (13%) Query: 85 TFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGL 144 ++K + A N + Q +P D + N++ Q Sbjct: 68 QLCPICRSKWRDVPFQAPANIGDPQCNGMGQARVSPFHPPPEDFHGQTPRNLQPXSPQSP 127 Query: 145 LPPPDAVRVEEIVNYF-PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDIL 203 P + +VN +D S P + + A + + + V I Sbjct: 128 EPRHFSDDEPLVVNSAESTDPTSLVSLSRPQLVTVKALPEWPAISASESFRTFAVLVGIK 187 Query: 204 AK----DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 A D + +LV ++D SGSM +L L++ ++ L++ L D ++IV+++ Sbjct: 188 APALLDDAHLLDRAPIDLVAVLDVSGSMAG-SKLSLLKRAVCFLIQNLGPSDRLSIVSFS 246 Query: 260 GDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 +R P +S + + AI+SL + G TN GL+ + + + + I+L Sbjct: 247 STARRIFPLRRMSDNGREAAGLAINSLXSSGGTNIVEGLKKGVRVLEERSEQNPVASIIL 306 Query: 318 ATDGDFNV-------------GIDDPKSIESMVK--------KQRESG-------VTLST 349 +DG +P+ + + + RESG + + T Sbjct: 307 LSDGKDTYNCDNVNRRQTSHCASSNPRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHT 366 Query: 350 FGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 FG G S+++ M I+D G +S+I++++ Q + +L VA++++ ++ Sbjct: 367 FGFG-SDHDSTAMHAISDESGGTFSFIESVAXVQDAFAMCIGGLLSVVAQELRLTVKSVS 425 Query: 410 AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPD 469 V + + +D GD+ A + L LT+ ++ + R Sbjct: 426 PGVHIESIPSGKYLSEICDQGQQGVIDVGDLYAEEGKEFLIYLTVPELSSAEGEERVKRT 485 Query: 470 NKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAP------------------- 510 L + +K KE VE Sbjct: 486 T----------LLDVMCSYKDSVSKEVVQVECERVEIRRPEVLSPMDMIVCLEVDRQRNR 535 Query: 511 ---SEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGE 551 +E + +A G L G++ + + A G+ Sbjct: 536 LWVAEGIAEAQRMAETG-NLEGAKAVLAHRRSTLLSSASAQAGD 578 >UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=Q8H924_ORYSJ Length = 646 Score = 246 bits (628), Expect = 2e-63, Method: Composition-based stats. Identities = 60/343 (17%), Positives = 126/343 (36%), Gaps = 27/343 (7%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISG 271 +LV ++D SGSM+ +L L++ ++ ++ L D + +++++ + L ++ Sbjct: 173 PLDLVTVLDVSGSMVG-NKLALLKQAMGFVIDNLGPGDRLCVISFSSGASRLMRLSRMTD 231 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD-- 329 + KA A+ SL A G TN GA L A + + + ++L +DG + Sbjct: 232 AGKAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVESVILLSDGQDTYTVPPRG 291 Query: 330 -------------PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 P + + + TFG G +++ A M IA+V G +S+I Sbjct: 292 GYDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGK-DHDAAAMHTIAEVTGGTFSFI 350 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVD 436 + + Q + +L ++++ + V + + VD Sbjct: 351 ENEAAIQDGFAQCIGGLLSVAVQELRLDVACVDTGVRVTAVKSGRYKSHIEDDGRAAKVD 410 Query: 437 AGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKES 496 G++ A + + L + + A D + + +T + + + S Sbjct: 411 VGELYADEERSFLLFVVVPRAPAWDDVTHLIEVSCSYRDMETGRTTSVAGDEEAVVLRPS 470 Query: 497 QL--------VEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSE 531 + VE +D+ A A G+ +E Sbjct: 471 RAESGVAERSVEVDRELVRVEAIDDIALARAAAERGEYAEAAE 513 >UniRef50_Q235T9 von Willebrand factor type A domain containing protein n=5 Tax=Tetrahymena thermophila RepID=Q235T9_TETTH Length = 703 Score = 243 bits (619), Expect = 2e-62, Method: Composition-based stats. Identities = 72/368 (19%), Positives = 154/368 (41%), Gaps = 28/368 (7%) Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG 260 D+ K E P +L+ +ID SGSM ++ +++++ L++ L E D ++++T+ Sbjct: 197 DMEVKSNPLEGRPNLDLICVIDNSGSMNDFSKIENVKNTILQLLEMLNENDRLSLITFNT 256 Query: 261 DSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 ++ L +++ +K + S+ A+G T+ G+E+A+Q K ++ I L Sbjct: 257 KAKQLCGLKNVNNQNKKSLQTITKSIKADGGTDIIRGIEIAFQILQSRKQKNSVSSIFLL 316 Query: 319 TDGDFNVGIDDPKSIESMV-KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 +DG N+ K++ K+ +E T+ +FG GN +++ +M +IA + +G++ +++ Sbjct: 317 SDGQDNLADAGIKNLLKTTYKQLQEESFTIHSFGFGN-DHDGPLMQKIAQIKDGSFYFVE 375 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIE-------FNPAWVTEYRQIGYEKRQLRVEHF 430 + + + + VA+D+ +IE F + Y Y + Sbjct: 376 KNDQVDEFFIDALGGLFSVVAQDLTIKIEINRQNELFQKFFKNSYISKTYGHMWKIINQN 435 Query: 431 NNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKY 490 ++ I +G +FELT+ + L ++ E+ +++ + Sbjct: 436 QELRININQIFSGVSKDFIFELTVPKSE----------IKDLQDFERNLEIINVQLTARP 485 Query: 491 PQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 L E L T+ +E VA + E+ N + + + K Sbjct: 486 VDSMLQTLKESKLVLTLFTDNEQ------VAQDSEINDKVEF-NYIRVKAAQAIEEAIKY 538 Query: 551 EDPQGYRA 558 D Y Sbjct: 539 ADQNQYNQ 546 >UniRef50_C7PTX8 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PTX8_CHIPD Length = 462 Score = 241 bits (616), Expect = 4e-62, Method: Composition-based stats. Identities = 86/352 (24%), Positives = 158/352 (44%), Gaps = 13/352 (3%) Query: 197 LLKVDILAKDRKSEE-LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 L V+I + ++ + N+ ++D SGSM +++ + + K L+ +L D+++I Sbjct: 62 YLYVNIKGGEGEASKPRVPLNISLVLDRSGSMSG-DKIKYARQAAKFLIDQLNSTDHLSI 120 Query: 256 VTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRI 315 V Y + PS S +K + AAID + GSTN G+ Y Q +G +NR+ Sbjct: 121 VNYDDRVEVTSPSQSVKNKEALKAAIDKIHDRGSTNLSGGMLEGYTQVKSTRKEGYVNRV 180 Query: 316 LLATDGDFNVGIDDPKSIESMVK-KQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 LL TDG N GI DP ++ + + K +E G+ LSTFGVG ++YNE ++ +A+ G NY Sbjct: 181 LLLTDGLANQGITDPLELKRLAENKYKEDGIALSTFGVG-ADYNEDLLTMLAENGRANYY 239 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 +ID+ + ++ E++ +L VA++ A+I P + GY Sbjct: 240 FIDSPDKIPQIFAGELKGLLSVVAQNAWAEISI-PQDMECTYVYGYPYEV----KGGKVL 294 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPD-NKLAKSDKTKELAWLKIRWKYPQG 493 V D+ A +L +L G + + ++ ++ KE ++I+ + Sbjct: 295 VRFNDLYANDEKAILIKLKSKGTYTTNLRFDCTVGYTNVSSFEQVKESKPVQIKMTSDKE 354 Query: 494 KESQLVEFPLGPTIN--APSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQ 543 + + + + ++ A GQ + + Q +K Sbjct: 355 LVASGEDKEVQEMLALFESTQRFDDIMADVDKGQY-DAARKKGQAAVQVLKD 405 >UniRef50_Q6ZED8 Slr7060 protein n=2 Tax=Chroococcales RepID=Q6ZED8_SYNY3 Length = 588 Score = 238 bits (606), Expect = 5e-61, Method: Composition-based stats. Identities = 67/363 (18%), Positives = 137/363 (37%), Gaps = 18/363 (4%) Query: 187 APAPWNEQRTLLKVDILAKDRKSEE--LPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 A L + I + + P+ NL F+ID SGSM ++ + ++ + Sbjct: 14 AVCSERAVTLDLIIRITPPSPPAMDQPRPSLNLGFVIDRSGSMEGHNKITYARQAVCYAI 73 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 +L D++++ + + +PS KA+ + ++ G T+ G Q + Sbjct: 74 DQLSPGDHLSVTIFDDQVQTLIPSTLVKDKAQFKRLVQGINPGGCTDLHGGWLQGGIQVS 133 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + +NRI+L +DG N G +P I + V + G + +T G+G+ +YNE ++ Sbjct: 134 QNL-SAELNRIILLSDGLANRGETNPDIIATDVHGLAQRGASTTTLGLGD-DYNEDLLEA 191 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 +A G+GNY Y+ + + E++ + T V +E Sbjct: 192 MARSGDGNYYYVADAEQLPTIFERELQGLAATYGNGVTLTATSQAGVQVLDLLNDFELD- 250 Query: 425 LRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 N ++ G I ++ L + +I + + L+ D ++ L Sbjct: 251 ------NQGRYQLPNLIYGDSIDVVVRLKVP----AIKEEQILGTVTLSWLDGERQKQTL 300 Query: 485 KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 + + P + + P + M A +++ +Y Q+ Q Sbjct: 301 MVNLQLPVLTKVEFEALPSNQEVQQQVALMMSARAKKEAMERVDRGDYGGA---GQVLQE 357 Query: 545 AQQ 547 A+ Sbjct: 358 ARA 360 >UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V370_MONBE Length = 471 Score = 237 bits (604), Expect = 1e-60, Method: Composition-based stats. Identities = 83/388 (21%), Positives = 158/388 (40%), Gaps = 11/388 (2%) Query: 169 KQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI 228 PA + ++ A E + + A D + E PA +LV +ID SGSM Sbjct: 12 TAEAPAPLQLDVRPLWQYAEIGARESSAYISCRLTAPDFEPVERPAIDLVAVIDVSGSMA 71 Query: 229 SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDA 286 ++L ++QS+L+ L++ L++ D A+VT+ D + L ++ +HK A + L A Sbjct: 72 G-QKLKMVQSTLEFLMRNLKDTDRFALVTFDSDVKTVFDLRPMTTAHKEACLADVQKLRA 130 Query: 287 EGSTNGGAGLELAYQQATKG-FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQR--ES 343 TN GL + + KG ++ ILL TDG N G+ D + ++ Sbjct: 131 GSCTNLSGGLFRGVELMQQRGATKGAVSSILLMTDGIANEGVRDKDDMCRALRGLMGPAP 190 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 T+ TFG G ++NE M+ ++++ GNG Y +I++ + + +L A++++ Sbjct: 191 DYTIYTFGYGK-DHNENMLRQLSETGNGMYYFIESNDIIPESFGDCLGGLLSVFAQNIEV 249 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDK 463 ++ T +++ ++ + GDI + + +LFE+ L + Sbjct: 250 KLSAVHPEAT-IKRVCMQRAATLADDRRTATFTIGDIQSEEVKEVLFEVNLPLVELHDAV 308 Query: 464 LRYAPDNKLAKSD-KTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAA 522 ++D L+ ++ E P N D VA Sbjct: 309 AEAGTATAYFRADVSYLNLSSSSFEKHAATFSTARPAEVPAVREANEAVTD-AIARFVAV 367 Query: 523 YGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 G + + +++ Q +G Sbjct: 368 DGME-EARRLAEAGKFDAVRERLQDVRG 394 >UniRef50_A9GUK8 Putative uncharacterized protein yfbK n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GUK8_SORC5 Length = 521 Score = 236 bits (601), Expect = 2e-60, Method: Composition-based stats. Identities = 86/418 (20%), Positives = 158/418 (37%), Gaps = 18/418 (4%) Query: 113 VKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSI 172 A P + + R+ L G +P PD + D+ Sbjct: 32 FAAPADPPGSVGVSQGGAQDFGLFRQILEDGEIPGPDTLDDVGFFAEHKLDYPAATCGED 91 Query: 173 PASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDER 232 + M ++ +P + D + E P +LV +DTSGSM + Sbjct: 92 VCMHGLLGIMGNMISGSP---CTLIQIGMNSPVDLGALERPPLHLVIAVDTSGSMEG-DP 147 Query: 233 LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNG 292 + +++ L ++ L+ D I++V Y+ + + L GS + + A + L A GSTN Sbjct: 148 IAYVRAGLVEMIDALQPTDRISLVRYSDAAEVVLEQAEGSDREALTEAFEGLTARGSTNL 207 Query: 293 GAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGV 352 GL AY A + NR++ +DG G+ P+ + S+ E G+ L+ GV Sbjct: 208 YEGLFTAYALAEQHLDPAWQNRVIFLSDGVATAGLTSPQRLVSLAAGYAEKGIGLTAIGV 267 Query: 353 GNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWV 412 G + ++ M I++VG GN+ +++ ++V E++ L+ +A DV+ + +V Sbjct: 268 G-AEFDVDAMRGISEVGAGNFYFLEDPKAVEEVFAEEVKTFLVPLALDVELDVAVGDGYV 326 Query: 413 TEYRQIGYEKRQLRVEHFNNDNVD----AGDIGAGKHITLLFELTLNGQKASIDKLRYAP 468 R E ++ AG A + + + L P Sbjct: 327 --VRGAYGTNGWQGGERGGAVHIPSLFLAGRTSAAEPVG-----SGRRGGGGAILLELVP 379 Query: 469 DNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTI--NAPSEDMRFRAAVAAYG 524 + + + L + W++P E+ E + +AP E F G Sbjct: 380 KPDQRGVEDPRAVGSLALSWRHPLTGEAHAQEVDIEAPSAPDAPPEAGYFSGDTVEKG 437 >UniRef50_A4J6Q3 von Willebrand factor, type A n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J6Q3_DESRM Length = 416 Score = 235 bits (600), Expect = 3e-60, Method: Composition-based stats. Identities = 87/381 (22%), Positives = 157/381 (41%), Gaps = 15/381 (3%) Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 P N+Q L V + A + +E P NL F+ID SGSM E+L + ++ V L Sbjct: 16 LPGNKQVAYLMVKLTAPKQVEKERPVQNLSFVIDRSGSMAG-EKLDYTKKAVAFAVGHLS 74 Query: 249 EQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 QD ++V + + S ++K + A++S+ GSTN G+ L ++ Sbjct: 75 PQDYCSVVAFDDMVTMVASSHQVANKDALKMAVESIYPGGSTNLSGGMLLGVREVKLAHK 134 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 + INR+LL TDG NVG+ D ++ ++ GV LSTFG+G ++ E ++ + + Sbjct: 135 ENQINRVLLLTDGMANVGVTDHSALVEKSREMAAGGVNLSTFGLGE-DFEEDLLQAMVEA 193 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 G GN+ YI+ + + E+ +L VA+++ +++ G E Sbjct: 194 GGGNFYYIEKPDQIPGIFEQELTGLLSIVAQNLSVKVKPGQG----VSITGVLGYPFSSE 249 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRW 488 V+ DI +G+ LL EL ++ KL + + A K+ L LK Sbjct: 250 EG--VTVNLPDIYSGESKLLLLELLISPLTEGNHKL-ISVELDYADVRKSLALVNLKAEL 306 Query: 489 KY-----PQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQ 543 + ++ +E + ++ +A G + S + +++ Sbjct: 307 SINASAEIGDEPAENIEVIKQVELFRCAQAKEEAIRLADQGD-FQASRLVLENQLYKLQS 365 Query: 544 WAQQAKGEDPQGYRAEFIRLI 564 D E + Sbjct: 366 LGACLDSSDLNMEVNELQENL 386 >UniRef50_Q3A188 von Willebrand factor type A domain protein n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A188_PELCD Length = 442 Score = 235 bits (599), Expect = 4e-60, Method: Composition-based stats. Identities = 76/379 (20%), Positives = 149/379 (39%), Gaps = 33/379 (8%) Query: 204 AKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 + ++ + P NL ++D SGSM ++ + + V+ L + D ++V Y Sbjct: 55 PRAPRTAQRPPVNLALVLDRSGSMSG-NKIAKAREAAIEAVRRLSDGDLFSLVVYDDSVE 113 Query: 264 IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 +P+ S +I A I + GST + + K +NR++L +DG Sbjct: 114 TLVPAQPVSDIGDIEARIRRIRPGGSTALFGAVSQGAAEVRKHSDAPYVNRVVLLSDGLA 173 Query: 324 NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 NVG P + + + G++++T GVG ++NE +M ++A+ +GN+ ++++ + Sbjct: 174 NVGPSRPADLARLGAALLKEGISVTTVGVGT-DFNEDLMTQLAERSDGNHYFVESSRDLP 232 Query: 384 KVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAG 443 ++ +E+ +L VA+ V IE P V R IG E V + G Sbjct: 233 RIFAAELGDVLSVVARKVVISIEC-PQGVKPLRVIGREGSI----KGQRVEVRMNQLYGG 287 Query: 444 KHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL 503 + L E+ + +A+ +LA + R++ + + Sbjct: 288 QEKYALVEVEVPASRANQ----------------KLDLARVDCRYRNALTDTGESSTAMV 331 Query: 504 GPTINAPSEDMRFRA------AVAAYGQKLRGSE----YLNNTSWQQIKQWAQQAKGEDP 553 + SE++R A AV + Y + + Q+++ Sbjct: 332 QTRFSKRSEEVRKAASKDVQKAVVENEMAVARDRALNLYNAGRKPEAARVLRQKSQSLQE 391 Query: 554 QGYRAEFIRLIELADGVTD 572 Q F L + A + D Sbjct: 392 QNAILGFDDLAQEAGQLQD 410 >UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PS55_PICSI Length = 829 Score = 231 bits (588), Expect = 6e-59, Method: Composition-based stats. Identities = 78/475 (16%), Positives = 164/475 (34%), Gaps = 51/475 (10%) Query: 141 NQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV 200 N+G E D + ++ ++ + A+ E+ E V Sbjct: 264 NEGQTSLLTDDDREFEFKGLFVDHEGTSDRAAGNARKMRIALYPEVEAVAAGEACENFTV 323 Query: 201 DILAKDRKSEE--------------------LPASNLVFLIDTSGSMISDERLPLIQSSL 240 + K + E +LV ++D SGSM +L L++ ++ Sbjct: 324 LVHVKAPSASEASKKQNYEDCEGNMVKDPGCRAPIDLVTVLDVSGSMSG-TKLALLKRAM 382 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 ++ L +D +++V ++ ++ ++ + N ++ L G TN GL Sbjct: 383 AFVISNLSPEDRLSVVVFSSTAKRVFSLKRMTPDGQRAANRVVERLLCTGGTNIAEGLRK 442 Query: 299 AYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES----------GVTLS 348 + + + I+L +DG + + +QR S + + Sbjct: 443 GAKVLEDRRQRNPVASIMLLSDGQDTYSLSSRGVVLFPSDEQRRSARQSTRYGHVQIPVH 502 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE-F 407 FG G +++ A M I++V G +S+I S Q + +L V +DV+ + Sbjct: 503 AFGFGV-DHDAATMHAISEVSGGTFSFIQAESLVQDAFAQCIGGLLSVVVQDVRVTVSAC 561 Query: 408 NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYA 467 + + YE E ++ V+ GD+ A + +L EL K++ + + Sbjct: 562 AGTKLKSFHAGSYET--CVAEDGSHGTVNLGDLYAEEERDILVELKFPAVKSASNPMNLI 619 Query: 468 P------DNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVA 521 D +S ++E ++ +R + G + L + +R A+A Sbjct: 620 SVGCFFKDPVSQRSFHSREQSFSILRPESTDG-----LPVALNLEVEKERNRLRTAQAIA 674 Query: 522 AYGQKLRGSEYLNNTSW--QQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDIS 574 + + Q G+ E + E+ +T+I Sbjct: 675 EARTLADQGDMSGAQRVLQNAKIELQQTRTGDHSLSLALE-AEITEIQARMTNIQ 728 >UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnoliophyta RepID=B9SJS6_RICCO Length = 540 Score = 231 bits (588), Expect = 6e-59, Method: Composition-based stats. Identities = 82/425 (19%), Positives = 160/425 (37%), Gaps = 34/425 (8%) Query: 161 PSDWDIKDKQSIPASKPIPFAMRYEL---APAPWNEQRTLLKVDILAKDRKSEELPASNL 217 D + +S P +P ++ AP E + + +++ D S P +L Sbjct: 43 NDDDEKIVTRSRPTPPIVPARVKLRSINNDMAPLEESKLKVMLELTGGDSSSYGRPGLDL 102 Query: 218 VFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKA 275 V ++D S SM +++ +++++ ++K+L D ++IVT++G + P +G + Sbjct: 103 VAVLDVSRSMEG-DKMEKMKTAMLFIIKKLGPTDRLSIVTFSGGANRLCPLRQTTGKSQE 161 Query: 276 EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG-GINRILLATDGDFNVGIDDPKSIE 334 E I+ L+A+G+TN AGL+ A + G + I+L +DG+ N G D Sbjct: 162 EFENLINGLNADGATNITAGLQTALKVLKGRSFNGERVVGIMLMSDGEQNAGSD------ 215 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG-NGNYSYIDTLSEAQKVLNSEMRQM 393 V + TFG G N+ + IA G +S + + K + + Sbjct: 216 --ATGVSVGNVPIHTFGFGI-NHEPKGLKAIAHNSIGGTFSDVQNIDSLTKAFAQCLAGL 272 Query: 394 LITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELT 453 L V +D+K I + Q R + + V GD+ + + ++ +L Sbjct: 273 LTVVVQDLKMTIAQPKDESKIQQVSAGSYPQTRDDVAGSVTVTFGDLYSKEVRKVIVDLL 332 Query: 454 LNGQKAS----IDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINA 509 L + ++ YA + + ++ P +E V T Sbjct: 333 LPSASKGWGGNVLEITYAYSTRGKLFEAPPATLTVRRTVASPVQEERPEVIT--EETRLR 390 Query: 510 PSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQG-----YRAEFIRLI 564 + M+ A+A Y + + + +D R E +L+ Sbjct: 391 TAGMMKEARAMAD-----NNKLYDARDKLVEAENLLEDVV-DDGHNPVIEMLRLELQQLL 444 Query: 565 ELADG 569 +L Sbjct: 445 KLMKS 449 >UniRef50_A1ZUW0 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZUW0_9SPHI Length = 425 Score = 230 bits (586), Expect = 1e-58, Method: Composition-based stats. Identities = 70/347 (20%), Positives = 153/347 (44%), Gaps = 12/347 (3%) Query: 204 AKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR 263 K + +E N+ ++D SGSM ++L ++ ++ ++ L+ D ++IV Y + Sbjct: 34 GKAPEKQERIPLNISLVVDRSGSMSG-DKLNYVKKAVDFVIDNLKSDDVLSIVQYDDEID 92 Query: 264 IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 + S ++K ++ + + A TN G+ Y Q G +NR+LL +DG Sbjct: 93 VVASSAKVTNKKALHEKVKGIQARNMTNLSGGMMEGYAQVKSTQSNGYVNRVLLLSDGLA 152 Query: 324 NVGIDDPKSIESMVKKQ-RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 N GI P+ ++ + +K+ RE+G+ LSTFGVG S++NE +M +++ G NY +ID + Sbjct: 153 NAGITAPEQLQQIAQKKFREAGIALSTFGVG-SDFNEVLMTNLSEYGGANYYFIDMPDKI 211 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 ++ E+ +L VA++ ++ F +++ + G+ + +V+ D+ A Sbjct: 212 PQIFAQELEGLLSVVAQNTTLEVVFPQSYLKCTQVYGFPANISPDK----VSVNFNDVVA 267 Query: 443 GKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFP 502 + +L + + + + + E ++ + + Sbjct: 268 EEEKAVLIKFEV--IRTPDEPFVLKTRLQYDDVIDKMERITDELDLRMELTTDEHAYRAG 325 Query: 503 LGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAK 549 + + E A + Q ++ ++ + +QI A+ + Sbjct: 326 INAKV---HEQTALFTANDLFEQAIKVADGRDFEKAKQIIAQAKASL 369 >UniRef50_C5WZE3 Putative uncharacterized protein Sb01g019910 n=1 Tax=Sorghum bicolor RepID=C5WZE3_SORBI Length = 704 Score = 230 bits (585), Expect = 2e-58, Method: Composition-based stats. Identities = 75/443 (16%), Positives = 157/443 (35%), Gaps = 29/443 (6%) Query: 145 LPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILA 204 +P P +E + + + + F E +++ + + + Sbjct: 169 IPEPAVFNDDEQIETAVGGGHYEIPPLLEITTYTEFPAIQESVA----QEQFAILIHLRV 224 Query: 205 KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI 264 +LV ++D S SM +L L++ +++ +++ L D +++V ++ + Sbjct: 225 PTWVRT-RAPLDLVTVLDVSRSMSGP-KLALLKRAMRFVIENLEPSDRLSVVAFSSSACR 282 Query: 265 ALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGD 322 P ++ + + A+DSL A+G TN GL A + + + I+L +DG Sbjct: 283 LFPLRKMTAFGQQQSQQAVDSLVADGGTNIAEGLRKAARVVEDRQARNPVCSIILLSDGV 342 Query: 323 FNV------GIDDPKSIESMVKKQR----ESGVTLSTFGVG---NSNYNEAMMVRIADVG 369 + G +V + E V + FG+G + +++ M +A + Sbjct: 343 DSHNLPPRDGSAPEPDYAPLVPRSILPGSEHHVPIHAFGLGMDHDHDHDSRAMHAVAQMS 402 Query: 370 NGNYSYIDTL-SEAQKVLNSEMRQMLITV--AKDVKAQIEFNPAWVTEYRQIGYEKRQLR 426 +G +S+ID + S Q L + +L A++ + +E V Sbjct: 403 SGTFSFIDMVGSSIQDALAQCIGGLLSVSVVAQETRLSVECADQGVLLTSIKSGSYASGV 462 Query: 427 VEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKI 486 V G + A + L + + + S +R A + + + + Sbjct: 463 DGDGRGGFVHVGRLYADEERDFLVTVRVPPSRVSTALVRPLCTYHDAVTAEMVRVGGDPV 522 Query: 487 ---RWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQ 543 R ++P L + +EDM A A G R + L ++ Sbjct: 523 MLLRPEFPVSAGVSL-QVERERHRVRATEDMAAAQAAAEDGDYARAASILATRRVL-LES 580 Query: 544 WAQQAKGEDPQGYRAEFIRLIEL 566 A + + Q AE + E Sbjct: 581 CASLSWDQQTQALVAELREMQER 603 >UniRef50_UPI00017450FB von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017450FB Length = 424 Score = 230 bits (585), Expect = 2e-58, Method: Composition-based stats. Identities = 78/378 (20%), Positives = 157/378 (41%), Gaps = 21/378 (5%) Query: 182 MRYELAPAP---WNEQRTLLKVDILAKDRK-SEELPASNLVFLIDTSGSMISDERLPLIQ 237 + ELA + T LKV + ++ + S + N+ +ID SGSM +++ + Sbjct: 3 LSVELAHPQILAGRKMTTYLKVGLTGQELEASAKRAPVNVTIVIDKSGSM-GGDKMVHAR 61 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 + K + L D +++V Y + P+ + + + AAID + A GST +G+ Sbjct: 62 EAAKQALDRLGAGDMVSVVAYDDAVSLISPATDLTDRDRVKAAIDRIQAGGSTALFSGIS 121 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 ++ + +NR++L +DG NVG P+ + + + G+T++T G+G Y Sbjct: 122 KGAEELRRNKRPNQVNRVVLLSDGMANVGPSSPQDLGRLGASLAKEGITVTTLGLGLG-Y 180 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 NE +M +A +GN+++I+ + +E +L VA+ ++ +++ V R Sbjct: 181 NEDLMTELALRSDGNHAFIENSQNLAGIFQTEFGDILSVVAQRIRVRVQCAEG-VRPVRV 239 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQ----KASIDKLRYAPDNKLA 473 +G E H + ++ I A +H LL E+ + A + N L Sbjct: 240 LGREADI----HGQDVELEMNQIYARQHKYLLLEVEIPEGVADTDAPVATAEVISVNALT 295 Query: 474 KSDKTKELAWLKIRWKYP-----QGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLR 528 + + R P ++ LVE + + ++ + + + Sbjct: 296 GIENRSLTRSVARRVNDPALIANTVNKAVLVEVARAISTEKEALAVKLSDE-GRHEEATK 354 Query: 529 GSEYLNNTSWQQIKQWAQ 546 + S + K+ A Sbjct: 355 AYNAACSWSLSEAKRLAS 372 >UniRef50_C5WYV0 Putative uncharacterized protein Sb01g047470 n=3 Tax=Sorghum bicolor RepID=C5WYV0_SORBI Length = 686 Score = 229 bits (584), Expect = 2e-58, Method: Composition-based stats. Identities = 71/382 (18%), Positives = 140/382 (36%), Gaps = 45/382 (11%) Query: 167 KDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDR-----------KSEELPAS 215 +Q+ +S + E + + + V + K + Sbjct: 107 NQRQATASSGMLAVKTHVEFSAVARDSSQDHFAVLVHVKAPGVIVNEAAAGDRDAPRAPL 166 Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSH 273 +LV ++D SGSM ++L L++ ++ ++ L D +++V+++ +R L +S + Sbjct: 167 DLVTVLDVSGSMRW-DKLALVKQAMGFVIGSLGPHDRLSVVSFSSGARRVTRLLRMSHTG 225 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDD---- 329 K+ A++SL A G TN GL A + + + ++ ++L +DG N + Sbjct: 226 KSLATEAVESLRAGGGTNIAEGLRTAAKVLGERRHRNAVSSVILLSDGHDNYSMPRRARG 285 Query: 330 ----------PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 P S E + TFG GN +++ A M +A+ G +S+I+ Sbjct: 286 GVPPNYEVLVPPSFVPGTASTGEGSAPIHTFGFGN-DHDAAAMHVVAEATGGTFSFIENE 344 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGD 439 + Q + +L VA++ + I V E + ++ G+ Sbjct: 345 AVIQDAFAQCIGGLLTVVAQEARVAIACGHPGVRISSVKSGRYESRVDEDGRSASIAVGE 404 Query: 440 IGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLV 499 + A + L LT+ +A+ + LK R Y + V Sbjct: 405 LYADEERRFLLFLTVPPVEAT----------------DGDDTLLLKARCSYREAAGGTHV 448 Query: 500 EFPLGPTINAPSEDMRFRAAVA 521 + T+ A E A Sbjct: 449 DVTAEDTVVARPEHAADAERSA 470 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 226 bits (577), Expect = 1e-57, Method: Composition-based stats. Identities = 55/276 (19%), Positives = 124/276 (44%), Gaps = 11/276 (3%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSIS- 270 +L+ +IDTSGSM + L L++ +L LV L+ D I ++ ++ +++ P +S Sbjct: 1443 RFPIDLICVIDTSGSMNG-QPLDLLKETLLFLVDLLQTGDRICLIQFSTNAQRLTPLLSI 1501 Query: 271 --GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID 328 + I I+ L A+G TN G++LA+ + K I + L +DG + + Sbjct: 1502 ESKDNIKSIKNEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLLSDGLNDGAEN 1561 Query: 329 DPKSIESMV---KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 + + + + E T+ TFG G +++ +M +I+ + +GN+ YI + + Sbjct: 1562 KIRDLLKQLNFYQNYNEENFTIQTFGFGK-DHDPNLMDKISQLMDGNFYYIGDIHRIDEC 1620 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI-GYEKRQLRVEHFNNDNVDAGDIGAGK 444 + + ++++V ++ + R + Y E + +++ + +G Sbjct: 1621 FIDALGGLFSVISQNVSINVQVPQEMREQIRIVKTYGDIWHTKEPYYEYSININQLLSGV 1680 Query: 445 HITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 + E+ L+ + I+ L+ + + ++ E Sbjct: 1681 SKDYILEMELDSKL--IEDLQCIHEIHSEEEIQSSE 1714 >UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67LZ3_SYMTH Length = 414 Score = 226 bits (575), Expect = 2e-57, Method: Composition-based stats. Identities = 86/338 (25%), Positives = 145/338 (42%), Gaps = 12/338 (3%) Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 +P + LL + E P NL ++D SGSM L + +L+ LV ++ Sbjct: 17 SPSGGEVYLLVTVKAPRMPAPEGRPPLNLAAVVDRSGSMAG-AALYFTKQALRFLVDQMA 75 Query: 249 EQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 E+D +AIVTY + PS K + +D + A G+TN GL QQ Sbjct: 76 EEDRLAIVTYDDQVHVPFPSQPVVQKDAVRLLVDGITAGGTTNLSGGLATGMQQIRPHAG 135 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 G ++R+LL TDG NVG+ DP + + RE G+ +ST GVG +++E ++V +A+ Sbjct: 136 PGRVSRVLLMTDGLANVGVTDPDVLAGWARAWREKGLAVSTMGVGP-HFSEDLLVALAEA 194 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 G GN+ YI + ++ E+ +L + + IE + V +GY + + Sbjct: 195 GGGNFHYIANPDQIPRIFQEELHGLLQVAVQGLHLIIE-TESGVAVSGVLGYRSQGTPLR 253 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRW 488 + D+ AG+ +L L++ A R A A + + Sbjct: 254 ----AALSLPDLYAGEVKHVLVRLSVAAPPAGGKLGRVALHYLPAAAGGRPGTLEADVSL 309 Query: 489 KYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQK 526 + ++L E P + +R A AA+ + Sbjct: 310 E-VTDDPARLGEPPDETVMRQ----LRLSQAGAAWDEA 342 >UniRef50_Q6ZFR4 Zinc finger (C3HC4-type RING finger) protein family-like n=8 Tax=Oryza sativa RepID=Q6ZFR4_ORYSJ Length = 703 Score = 225 bits (572), Expect = 4e-57, Method: Composition-based stats. Identities = 76/444 (17%), Positives = 159/444 (35%), Gaps = 32/444 (7%) Query: 131 GSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAP 190 + A RR +G + E + + + + E Sbjct: 182 AAGAITRRVTEEGHAYDDN--EPVESPASRGGGEPGGGEAAANDGELVVIKTHCEFPAIA 239 Query: 191 WNEQRTLLKVDILAKDRK-----SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVK 245 + R V + K + + +LV ++D SGSM +L L++ ++ L Sbjct: 240 RSTPRDNFAVLLHVKAPSIAAEAAPARASVDLVTVLDVSGSMEGY-KLALLKRAMGL--- 295 Query: 246 ELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 L D +A+V+++ +R L +S KA +A++SL A+G TN GL A + Sbjct: 296 -LGPGDRLAVVSFSYSARRVIRLTRMSEGGKASAKSAVESLHADGCTNILEGLVEAAKVF 354 Query: 304 TKGFIKGGINRILLATDGDFNV------GIDDPKSIESMV----KKQRESGVTLSTFGVG 353 + + ++L +DG N G + K+ +V K+ + + + TFG G Sbjct: 355 DGRRYRNAVASVILLSDGQDNYNVNGGWGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFG 414 Query: 354 NSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT 413 +++ + M IA+ G +S+I+ + Q + +L ++ + I A V Sbjct: 415 T-DHDASAMHTIAEETGGTFSFIENQAVVQDAFAQCIGGLLSVPVQEARIAITCPHAAVR 473 Query: 414 EYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLA 473 + +VD G++ A + L + + A D + Sbjct: 474 VRSVNSGRYDSVIDGDGRAASVDVGELYADEERRFLVFVDVPAAGAGEDVTELIKVSCTY 533 Query: 474 KSDKTKELAWLKI------RWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKL 527 + +++ + R +E +ED+ A G Sbjct: 534 RDTASRQQMVVAGEDAVVQRPAEVSTSTEPSMEVERERFCVEATEDIAAAREAAERGAY- 592 Query: 528 RGSEYLNNTSWQQIKQWAQQAKGE 551 ++ + + + + + A++ G+ Sbjct: 593 AAAKAILDRRQEALARSARRLAGD 616 >UniRef50_C1D2W8 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1D2W8_DEIDV Length = 418 Score = 224 bits (570), Expect = 9e-57, Method: Composition-based stats. Identities = 76/361 (21%), Positives = 148/361 (40%), Gaps = 8/361 (2%) Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLK 241 +R L L++V + + P NL F+ID SGSM S L + + + Sbjct: 11 LRAGLTAGQTTTLTLLIRVHPAPVTTQVSQRPPLNLAFVIDRSGSM-SGLPLQMAKQAAI 69 Query: 242 LLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQ 301 V++ R D +++V + + +PS + + + AI ++D GSTN G Sbjct: 70 AAVRQARPDDRVSVVAFDDRVDVIVPSQLATSREAVIQAIGTIDDRGSTNLHGGWLEGAT 129 Query: 302 QATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAM 361 Q + G +NR++L +DG NVG+ D + I V+ E G++ +T G+G S+Y+E + Sbjct: 130 QVAQHLTPGALNRVILLSDGQANVGVTDRREIARQVRGLTERGISTTTIGLG-SHYDEEL 188 Query: 362 MVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYE 421 ++ IA+ G+GN+ +++ S E++ + T + V +E NP + + Sbjct: 189 LLAIANAGDGNFEHVEDPSRLPTFFEEELQGLTRTTGRIVSLGLEPNPEHGVLVSDVLND 248 Query: 422 KRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKEL 481 + + ++ G+ + +L LT+ Q + +LA + Sbjct: 249 FERNSF-----GRLQLPNLVGGQPVDVLATLTVPPQPQRGGQTVGVTRVRLAWTGTDGAR 303 Query: 482 AWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKL-RGSEYLNNTSWQQ 540 L+ + P P + ++ + + RG ++ Sbjct: 304 RKLRAQLDLPVMAADAYHTLQEDPAVLEAQALLQAARYKREAVEAIDRGDRTTARQRYRD 363 Query: 541 I 541 I Sbjct: 364 I 364 >UniRef50_C1GWG1 von Willebrand factor type A domain containing protein n=1 Tax=Paracoccidioides brasiliensis Pb01 RepID=C1GWG1_PARBA Length = 773 Score = 222 bits (566), Expect = 3e-56, Method: Composition-based stats. Identities = 76/433 (17%), Positives = 148/433 (34%), Gaps = 48/433 (11%) Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 + A P + +L P +L + K ++V ID Sbjct: 24 RPRLSTSTTIAEAQNPDEVGVQLHPLSDKNS-LILSIHPPLHPEKDIRHVPCDIVLCIDV 82 Query: 224 SGSMI-----------------SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA- 265 SGSM L L + + + +++ L E D + +VT++ D+ +A Sbjct: 83 SGSMQLSAPLPTTDESGKREETGLSVLDLTKHAARTIIETLNENDRLGVVTFSNDAEVAY 142 Query: 266 -LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK-GFIKGGINRILLATDGDF 323 + + ++K A+++L STN GL+L K + + + TDG Sbjct: 143 KISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGLSVLGKVDLRPQNVQALYVLTDGQP 202 Query: 324 NVGIDDPK---SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 N + ++++Q++ + TFG G + ++ IA+VG G YS+I Sbjct: 203 NHMCPRQGYVPKLRPILERQKDRLPLIHTFGFG-YDIRSGLLQSIAEVGGGTYSFIPDAG 261 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQIEF---NPAWVTEYRQIGYEKRQLRVEHFNNDNVDA 437 V + + T A + + E + G ++ + + V Sbjct: 262 MIGTVFVHAIANLYTTFATQARVLLRTSGSAELVQDEGSKTGLLLDEMSAKDG-DIIVTV 320 Query: 438 GDIGAGKHITLLFEL-TLNGQK-ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE 495 G++ G+ L+ + + G A+ L Y + ++L ++K Sbjct: 321 GNLQYGQSRDLVLRIKNVTGNASAAQATLTYNFQGSVKSVISNEQLLS---QYKSLPAHV 377 Query: 496 SQ-LVEFPLGPTINAPSEDMR------------FRAAVAAYGQKLRGSEYLNNTSWQQIK 542 S + T +R A A ++ ++ L T Sbjct: 378 SSYHLSRARICTFLRSIYPLRQDYEYMYLDANGLEKARAELDIVIKETKKLGYTDIANA- 436 Query: 543 QWAQQAKGEDPQG 555 + GEDP+G Sbjct: 437 SLVRDLAGEDPEG 449 >UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVH6_PARL1 Length = 755 Score = 221 bits (563), Expect = 5e-56, Method: Composition-based stats. Identities = 91/527 (17%), Positives = 171/527 (32%), Gaps = 48/527 (9%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE 82 E ++Q + S +Q+ + + QQ V++ D+ +L+ + Sbjct: 146 YEEAKAQGHKASLVEQQRPNVFTNSVANIGPGETIIVQIEYQQTVRRDGDRFSLRFPMVV 205 Query: 83 APTFARAAKAKATHIANPGTARYQ--------QFDDNPVKQVAQNPLATFSL--DVDTG- 131 AP + PG + + PV AQ + SL +D G Sbjct: 206 APRYTPKTADPQLVDFAPGGGWGEVRQSEPENDLEQPPVLHPAQGQINPVSLALSLDAGF 265 Query: 132 -----SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 S + + LN+ EE+ P++ D + A+K A+ E Sbjct: 266 ALGDISSTHHKIALNRDGKQKATLKLAEELT---PANKDFELVWKPAAAKAPAAALFRE- 321 Query: 187 APAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKE 246 + LL + E +F+ID SGSM + + SL + Sbjct: 322 ---RVGNEDYLLVMLTPPSGSVQPEAKPREAIFVIDNSGSMSGPS-MVQAKESLLWALDR 377 Query: 247 LREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 L+ D ++ + + P G + A + SL+A G T L + Sbjct: 378 LKPGDTFNVIRFDDTLTVLFPDAVPAHGENLAVAKKFVKSLEANGGTEMLPALRASL--I 435 Query: 304 TKGFIKG-GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMM 362 + G + +I+ TDG I + + + L T G+G++ N M Sbjct: 436 DRNVNDGTRLRQIVFLTDGA----ISNEAELFHEITSNLGRS-RLFTVGIGSAP-NSYFM 489 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI------EFNPAWVTEYR 416 R ++ G G +++I +E + + ++ V ++ A E P V + Sbjct: 490 TRASEAGRGTFTHIGKETEVTERMAELFEKLQNPVMTNITATWPDGRTTESWPNPVPDLY 549 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 + R+ GD+ AG+ + +L + I KL ++D Sbjct: 550 KGEPVVLSARMPKATGTLTLKGDV-AGEPWEVRMQLNAGETRPGIGKLWARNKIGQLEAD 608 Query: 477 KTKELAWLK-----IRWKYPQGKESQLVEFPLGPTINAPSEDMRFRA 518 W K +R S+ + + + Sbjct: 609 AQIAGDWEKHDAEILRVALDHNLVSRRTSLVAVDVTPSRPAGEKLAS 655 >UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21PJ3_SACD2 Length = 763 Score = 221 bits (563), Expect = 6e-56, Method: Composition-based stats. Identities = 87/502 (17%), Positives = 178/502 (35%), Gaps = 44/502 (8%) Query: 45 QQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTAR 104 + +A AE++ + A Q + T + H+ +P Sbjct: 223 ENSAPDTAEKTPTSIDLAAGGYGWQSFNPIIH--------TQKPTPQVPDAHLISPPMVL 274 Query: 105 YQQ-FDDNPVKQVAQNPLATFSLDVDTG-SYANVRRFLNQGLL--PPPDAVRVEEIVNYF 160 Q + D +Q ++ AT S+ +D G + AN+ +Q + PP A VE Sbjct: 275 AQGQYGDGQYEQTGKDNRATISIQLDAGFNVANIESLYHQITINKPPSSAYNVELTNGST 334 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFL 220 D D + AS A+ E E LL + ++ + + ++VF+ Sbjct: 335 LMDRDFVLQWRATASSAPQAAVFKETLA---GEDYLLLMLLPPQGQQQHTQSLSRDIVFV 391 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS---ISGSHKAEI 277 +DTSGSM + + SL+ ++ L D I+ + + S+ Sbjct: 392 VDTSGSMQGTS-IQQAKRSLQFALRGLNPSDTFNIIEFDTSFSRFRSRPVSATASNVQAA 450 Query: 278 NAAIDSLDAEGSTNGGAGLELAYQQA--------TKGFIKGGINRILLATDGDFNVGIDD 329 + +++L+A+ T A LE A+ Q + +++ TDG + + Sbjct: 451 VSWVNNLNADNGTEMYAALEEAFDQLASINPNGTENSKSSNNLQQVVFITDGA----VGN 506 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 +++ S++ + R + L T +G++ N M + A G G +I +E +N+ Sbjct: 507 EQALLSLIHR-RLNNARLFTVAIGSAP-NSYFMRKAAQFGKGANVFIGDTAEVTHKMNAL 564 Query: 390 MRQMLITVAKDVKAQI----EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 + ++ T+ D+ Q E P + + G +D A + Sbjct: 565 LSKLKTTLVSDINVQWPQQSEVYPQRIPDLYA-GEPLLLAAKTSGAMGTIDISGNTALQP 623 Query: 446 ITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFP--- 502 + + + ++ + KT+ +R + + + P Sbjct: 624 WQSQLTINPYHNNSGVAQVWAKSKIDALEDSKTEGANPQDVRKQVVDVALTHALITPYTS 683 Query: 503 ---LGPTINAPSEDMRFRAAVA 521 + ++ P+ AVA Sbjct: 684 FVAVEELVSRPAHQPVQTQAVA 705 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 221 bits (563), Expect = 6e-56, Method: Composition-based stats. Identities = 69/422 (16%), Positives = 165/422 (39%), Gaps = 32/422 (7%) Query: 166 IKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSE-ELPASNLVFLIDTS 224 ++ + FA Y L +P Q +++ + + SE ++V +ID S Sbjct: 40 EPTVGAVDIAAYGVFAFNY-LQLSPEKAQEIPCTINLESPAQTSEASRSGVDIVCVIDVS 98 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAID 282 GSM E++ L+Q++L +V+ L D I +++++ D+ L +S K ++ + I Sbjct: 99 GSMQG-EKIQLVQTTLNFMVERLSPADRICLISFSNDATKISRLVQMSPKGKKQLKSMIP 157 Query: 283 SLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKK-QR 341 L A G TN GLE Q + ++ I+L +DG N G + ++ + Sbjct: 158 RLVASGGTNIVGGLEYGLQALRQRRTINQLSSIILLSDGQDNNGTTVLQRAKATMDSIVI 217 Query: 342 ESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDV 401 ++ TFG G+ ++ ++ +A+ NG + Y+ + + +++ VA + Sbjct: 218 RDDYSVHTFGYGHG-HDSTLLNALAEPKNGAFYYVKDEETIATAFANCLGELMSVVADQI 276 Query: 402 KAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASI 461 + ++ P + Y + + +G +F + + + Sbjct: 277 EVKLMTQPTEIPFSLSKVYSNS-------GDTVFTLPPLMSGDKKEAVFLVQFDPTTQRV 329 Query: 462 D---KLRYAPDNKLAKSDKTKELAW----LKIRWKYPQ-GKESQLVEFPLGPTINAPSED 513 + +++ + + + +R + P+ +E + L + D Sbjct: 330 ESGHRIQPICFKLKYRIVSNGNIVEQEYPIWLRVENPEYEEEIVIDSDVLVNFYRMKAAD 389 Query: 514 MRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDI 573 + +A+ A + ++ L + +++ + E ++ + AD + I Sbjct: 390 IMKQASKLADRNQFEAAKELLTSGNKELLDSVVAS---------HEIVQAL-AADLLRSI 439 Query: 574 SQ 575 ++ Sbjct: 440 NE 441 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 220 bits (561), Expect = 1e-55, Method: Composition-based stats. Identities = 86/565 (15%), Positives = 200/565 (35%), Gaps = 47/565 (8%) Query: 41 VLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANP 100 Q +IK + + LA++ S + Q + KA Sbjct: 15 PQKKQSVSIKTEAKKISQPSTLAKKTTSSISKNLSNQVKTSPQVQLISPVIKKAVLQPQS 74 Query: 101 GTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYF 160 + Q NPV+ P + + +++ + + + Sbjct: 75 VVVQKQPLAQNPVQAKTIQPKTVVPKSLTSIQKSDI--SIEKQEKIQKLDTKTMLQEQEI 132 Query: 161 PSDWDIKDKQSIPASKPIPFAMRYELAPAPW-----NEQRTLLKVDILAKDRKS------ 209 D K + S + + +E+ NEQ + + + K + S Sbjct: 133 KPDLQQMVKDAKKPSYDLEKGLTFEIKTLNKHFQFNNEQDCNIPIMVSVKTQDSTNDILE 192 Query: 210 ----------EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 + P+ +LV +ID SGSM E++ ++++L L+ L D ++++ + Sbjct: 193 EQKEQVKQVEQSRPSIDLVCVIDNSGSMQG-EKIQNVKTTLLQLLDMLNSNDRLSLILFN 251 Query: 260 GDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 + L + + I + I+S+ A+G T+ +G+ +A+ K ++ I L Sbjct: 252 SYPTLLCNLRKVDDENTPNIQSIINSITADGGTDINSGMLMAFNILQKRQFFNPVSSIFL 311 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 +DG N + K + + + ++ +FG G S+++ +M RI + +GN+ Y++ Sbjct: 312 LSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFG-SDHDGPLMNRICQLKDGNFYYVE 370 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI--------GYEKRQLRVEH 429 +++ + + + VA+++ +I N +++ Y ++ Sbjct: 371 KINQVDEFFVDALGGLFSVVAQEILIEINLN-RQDKNFQKYFSNCKVSKTYGDMWKCIKQ 429 Query: 430 FNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWK 489 + + +G + E+ + Q+ + + + L + I++ Sbjct: 430 DEIYQIKINQLFSGVSKDFIMEIVVPKQEVKMLE---DFERNLEIVKGQLTAIPVDIQYT 486 Query: 490 YPQGKESQLV--------EFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQI 541 K+S L+ P+ IN E R A + Q+ Sbjct: 487 TKIVKDSNLILTLFLQNETVPVDSEINDAVEFNYLRVQAAEAMEVAINFSEKQQYDQGQV 546 Query: 542 KQWAQQAKGEDPQGYRAEFIRLIEL 566 + +K E+ E +++++ Sbjct: 547 VLKSILSKIENSHPKNKEKVQILKK 571 >UniRef50_UPI0001926ED6 PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926ED6 Length = 464 Score = 218 bits (555), Expect = 4e-55, Method: Composition-based stats. Identities = 80/407 (19%), Positives = 156/407 (38%), Gaps = 35/407 (8%) Query: 175 SKPIPFAMRYELAPAPWNEQRTLLKVD--------ILAKDRKSEELPASNLVFLIDTSGS 226 K + A E P+ E+ + + + +++ + +LV +ID SGS Sbjct: 3 DKKLTVACSTEYKDYPFKEKLDIWTLISLKAPSLGMTLDEKEHRKRAPIDLVVVIDKSGS 62 Query: 227 MISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSL 284 M E+L L++ +L+ +V +L E+D + ++T+ + L ++ +K + I + Sbjct: 63 MAG-EKLALVKKTLEFVVSQLNEKDRLCLITFDTSVYLDFKLTPMTPMNKYQTLKIIKDI 121 Query: 285 DAEGSTNGGAGLELAYQQATKGFI--KGGINRILLATDGDFN-VGIDDPKSIESMVKKQ- 340 TN GL + K + +LL TDG N G+ + S K Sbjct: 122 SPGSMTNLCGGLMKGLCEVIDRADEEKNEVASVLLFTDGFANKGGLTNIYCSSSQTAKYT 181 Query: 341 ------RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQML 394 + + ++ TFG G SN+N M+ I+D G+G Y YI+ + + + +L Sbjct: 182 IGIVGPKTADASIYTFGFG-SNHNAQMLKEISDAGSGMYYYIENVDMIAEAFGQCLGGLL 240 Query: 395 ITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL 454 TVA+ ++ +I + Q + ++ ++ GD+ + + ++ EL++ Sbjct: 241 STVAQGIQVEIMMEN----KVSIKKVHSNQPTEKQGSSIKINMGDLQSEESRDVVLELSI 296 Query: 455 NGQKASID-------KLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTI 507 + + D KL Y L L+ KES ++ Sbjct: 297 DSLDSPTDCQTLFNIKLNYFNVINECLESSNAVLTVLRPEKCEDHNKESSNIDLNKQVLR 356 Query: 508 NAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQ 554 ++ M A G +++L + ED Q Sbjct: 357 INVTKAMNNAYEKAEEGDFSGAADFLEEQLL--VLNSVSSRLAEDDQ 401 >UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1DFU7_MYXXD Length = 422 Score = 218 bits (554), Expect = 6e-55, Method: Composition-based stats. Identities = 82/411 (19%), Positives = 167/411 (40%), Gaps = 24/411 (5%) Query: 171 SIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD 230 S P + A + A +++ A+ ++ + +L ++D SGSM Sbjct: 3 SKPEDGALNMAGKLSGAYVQTGPSEAFAWMELKARPAETGQRVPVSLALVLDRSGSMNG- 61 Query: 231 ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA-LPSISGSHKAEINAAIDSLDAEGS 289 ++L + + LV+ L+ +D +A + Y D R+ ++ + E+ I L +GS Sbjct: 62 QKLADARRAATELVQRLKPEDRLAFIDYGTDVRVQPSRRMTEEAREELLTLISGLQDDGS 121 Query: 290 TNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLST 349 TN L+ A + ++R +L +DG GI + V++ R G+T+S Sbjct: 122 TNISGALDAAANALRPHMREYRVSRAILLSDGQPTTGIVSEPGLLDQVRQLRRDGITVSA 181 Query: 350 FGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 GVG +Y E +M +A+ G G +ID + +V + E+ Q TVA+ V+ +++ P Sbjct: 182 LGVGR-DYQETLMRGMAEQGGGFSGFIDDSARLAEVFSRELDQATSTVARMVELRLDVPP 240 Query: 410 AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS------IDK 463 V + +G N V D+ + + + ++ +LTLN + + + Sbjct: 241 -EVQDVEVMGMAS----FREGNVLKVPLYDMASAQTVRVMAKLTLNTSRTAGALALLNAR 295 Query: 464 LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAY 523 + Y + ++ +L ++ + +E E + ++ M VAA Sbjct: 296 VHYVDVARDLPTETALQL-TAEVTSDLDRVREYLDKEVRVHAVRAMGTQHM-----VAAA 349 Query: 524 GQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDIS 574 + RG++ ++ + A+ G E + + S Sbjct: 350 EEMKRGNKAKASS----LLDNARTIFGTSADALSGELADVRRSQAALAGAS 396 >UniRef50_Q2BJ22 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BJ22_9GAMM Length = 445 Score = 217 bits (552), Expect = 1e-54, Method: Composition-based stats. Identities = 71/394 (18%), Positives = 157/394 (39%), Gaps = 35/394 (8%) Query: 194 QRTLLKVDILAKDRKSEE-LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 + +K+ + + + +N+ ++D SGSM ++L + + + + L + D Sbjct: 48 HKAFIKISLEGHKLEQTQARIPANIAIVLDKSGSMQG-DKLFRAKEAAIMAINRLSQNDI 106 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 +++V+Y + +P+ S I AI+ + A G+T AG+ + K + Sbjct: 107 VSVVSYDSRVNVVVPATKVSDTNTIARAINRIQANGNTALFAGVSKGANELRKFLDLNKV 166 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 NR++L +DG N+G P + + + G++++T G+G YNE +M ++A +GN Sbjct: 167 NRVILLSDGLANIGPSTPNELGKLGLSLAKEGMSVTTIGLGLG-YNEDLMTQLAGFSDGN 225 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN 432 +++++ + +V E +L VA+ V I VT R +G E + + Sbjct: 226 HAFVENADDLARVFQYEFGDVLSVVAQGVDIHIRCLNG-VTPLRVLGREADI----NGSQ 280 Query: 433 DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQ 492 + + + ++ E+ + Q + T EL + + + Sbjct: 281 VRTRLNQLYSEQEKFIILEVEVPSQ----------------QDKATVELVDVDVNYLDLF 324 Query: 493 GKESQLVEFPLGPTINAPSEDMRFR------AAVAAYGQKLRGSEYLNNTSWQQIKQ--- 543 K S+ + + + +++ A A + + I+ Sbjct: 325 NKRSEKLNTSVTAAFSRSETEVKDAIDTQTYEAAAEQVANELNRKAVQRRDQGDIEGAKS 384 Query: 544 -WAQQAKGEDPQGYRAEFIRLIEL-ADGVTDISQ 575 + A+ D G +L+E A+ + D + Sbjct: 385 ILKESAQYLDNLGQSLASPKLLEQKAEALQDAEE 418 >UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G857_9DELT Length = 540 Score = 216 bits (550), Expect = 2e-54, Method: Composition-based stats. Identities = 83/466 (17%), Positives = 169/466 (36%), Gaps = 41/466 (8%) Query: 133 YANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWN 192 + R L +G +P P+ + + + Sbjct: 79 FGLFREILLEGGIPGPETIDDMGFFAEHQIELPEPTCGEDVCIHG---KLGVMGNMINGG 135 Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 ++ D + P NL +D S SM E + ++ L + ++L +D Sbjct: 136 NCTVVVVGMNTPIDPAELDRPPLNLTIAVDLSKSMEG-EPIDRVRQGLLQMREQLEPEDR 194 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 + +V + ++++ + + E+ AI +L GSTN AGL A++Q +G Sbjct: 195 VTLVGFGDEAQVIVENA-DKDSVELATAIAALVPWGSTNLYAGLRTAFEQTDLYAQEGWQ 253 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 NR+LL +DG GI + IE + + G L+T G+GN +++ +M ++++G+G+ Sbjct: 254 NRVLLVSDGVPTTGIVNSDKIEGLAEAWSGMGYGLTTVGIGN-DFDIELMRNLSELGSGS 312 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN 432 + Y++ +V + E++ + +A+DV + + R + K+ N Sbjct: 313 FYYVEDPDAVIEVFSEEVQAFTVPLAEDVIIDATVFEGY--DLRAVYGTKQVETW--GNQ 368 Query: 433 DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQ 492 +D + E + + L + E+ L+ + P Sbjct: 369 ALIDIPILQIAHR-----EGASDNENGRRGGGGAMIFELLPTGESPGEVGRLEFEYTVPG 423 Query: 493 G----KESQLVEFPLGPTINAPSEDMRFRAAV----------AAYGQ---KLRGSEYLNN 535 ++ + PLGP P + AV + + +Y Sbjct: 424 TEEVVEQVVEISSPLGP-WELPEDGFFEADAVEKSFVMLNIYVGFEMAASRAAAGDYAGA 482 Query: 536 TSWQQ-----IKQWAQQAKGEDPQG---YRAEFIRLIELADGVTDI 573 + + ++ W + ED + Y FI +E G T+ Sbjct: 483 LTVLEPLVLSVEDWLSANEDEDIEDDLFYINLFIDNLEAQGGATNS 528 >UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Marinobacter RepID=A1U6Y4_MARAV Length = 712 Score = 216 bits (550), Expect = 2e-54, Method: Composition-based stats. Identities = 76/513 (14%), Positives = 154/513 (30%), Gaps = 50/513 (9%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 + ++ + + V + + A + + + Q Sbjct: 148 RAQARQNYEKAKQAGQKAATVEQNRPNLFTSRIANIAPGEEVTVEVQYQQPVNYRHGEFE 207 Query: 81 QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFL 140 PT +A P +A + + + ++ F++ D + + R + Sbjct: 208 LRLPTTLTPRYMPGAPVATPASAWQSGWSLPTTQVADADEISPFTVLPDDVNPGSHRATI 267 Query: 141 N----QGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNE--- 193 GL + + +R+ A Sbjct: 268 QLDIEAGLPVDEVTSPSHPLQVELEGSRATVSPEQGQILMDRDVIVRWRPADNQAPTAAL 327 Query: 194 -----QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 Q + ++ + ++ L+F+IDTSGSM + + +S+L + LR Sbjct: 328 FRQQWQGEDFLMAMVMPPATTGQVLRRELLFVIDTSGSMAGES-IRQARSALLRGLDTLR 386 Query: 249 EQDNIAIVTYAGDSRIALPSISGSH---KAEINAAIDSLDAEGSTNGGAGLELAYQQATK 305 D ++ + + ++ A + L A+G T L LA Sbjct: 387 PGDRFNVIQFNSQAHALYTQPVPANGHYLARARDYVQDLTADGGTEMAGALSLAMGM-DG 445 Query: 306 GFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI 365 G + +++ TDG + + ++ ++ + L T +G++ N + Sbjct: 446 SESSGHVQQMVFMTDGA----VGNESALFDQIRTGLGNR-RLFTVAIGSAP-NMHFLREA 499 Query: 366 ADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQL 425 A G G Y+ + + +E K L M V DV+ Q N A + Sbjct: 500 ARWGRGQYTAVHSAAEVDKALGKLFAAMEAPVMTDVEVQWPGNAAQPVPAKP-------- 551 Query: 426 RVEHFNNDNVDAGDIGAGKHITLLF-------ELTLNGQKASIDKLRYAPDNKLAKSDKT 478 GD+ G+ + + ELT++G+ R + D A K Sbjct: 552 ------------GDLFHGQPLLQVVRGAPSEGELTVSGRLPGGRSWRTSLDLASAAPGKG 599 Query: 479 KELAWLKIRWKYPQGKESQLVEFPLGPTINAPS 511 + W + R P I S Sbjct: 600 LDRQWARGRIDAVMDSARLAGTEPDEAAIVELS 632 >UniRef50_Q7ULL3 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7ULL3_RHOBA Length = 484 Score = 216 bits (550), Expect = 2e-54, Method: Composition-based stats. Identities = 79/430 (18%), Positives = 168/430 (39%), Gaps = 36/430 (8%) Query: 130 TGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA 189 T R ++ P +R ++ + Q+ + + Sbjct: 2 TSEPCPARSLVHS--CPRSFPMRFSLLMLLVAAMTATPLHQASAEQVKLDVRL-VHPVMK 58 Query: 190 PWNEQRTLLKVDILAKDRKS-EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 +Q L++ + + KS EE P N+ ++D SGSM ++L + + + + L Sbjct: 59 AGEKQTNHLRIALTGFELKSAEERPPVNVCLVLDHSGSMSG-QKLARAKEAAEAAIDRLS 117 Query: 249 EQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 + D +++V Y + + +P+ + ++ I I + A ST AG+ + K Sbjct: 118 DDDIVSVVLYDSNVTVLVPATKATDRSSIKQKIRGIQAGSSTALFAGVSKGAAEVRKFLA 177 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 +NR++L +DG NVG P+ +E + + + +++ST G+G+ YNE +MV +A V Sbjct: 178 DEQVNRVILLSDGLANVGPKSPQELEGLGRSLMKEAISVSTLGLGSG-YNEDLMVALASV 236 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 G GN+++I+ V N E +L VA + + ++ V R IG E Sbjct: 237 GGGNHAFIEDADSLVSVFNQEFDGLLSVVANEFEIVVKL-DESVRPVRMIGSEGDI---- 291 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRW 488 + + A + + E ++ D T++LA + +++ Sbjct: 292 EGQTIRIPLAQLYANQERYFIVETEVSPG----------------TEDSTRDLAEVTVQY 335 Query: 489 KYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYG--------QKLRGSEYLNNTS-WQ 539 + Q + + + + + + + + Y + R + L + + Sbjct: 336 RNLQTETKEKLTSSIQVRFSDEEKIVEEAKDMEVYAYCSLQITTELNREATALRDAGQVK 395 Query: 540 QIKQWAQQAK 549 + + Q Sbjct: 396 EAQSLLNQNA 405 >UniRef50_Q8H923 Putative uncharacterized protein OSJNBa0071K18.17 n=5 Tax=Poaceae RepID=Q8H923_ORYSJ Length = 606 Score = 216 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 59/312 (18%), Positives = 124/312 (39%), Gaps = 20/312 (6%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS--RIALPSISG 271 +LV ++D SGSM +L L++ ++ ++ L D + +V+++ ++ R L +S Sbjct: 142 PLDLVTVLDVSGSMAGR-KLALVKKAMGFVIDNLGPADRLCVVSFSTEASRRTRLLRMSE 200 Query: 272 SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGI---D 328 KA A++SL + +TN G GL +A + K ++ ++L +DG + + Sbjct: 201 VGKATAKRAVESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSSVILLSDGKDSYVVPRRG 260 Query: 329 DPKSIESMV------KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEA 382 + S +V R + TFG G ++++ A M IA+ G +S+++ + Sbjct: 261 NGMSYMDLVPPSFASSGGRGQLAPIHTFGFG-ADHDAAAMNTIAESTGGTFSFVENEAAI 319 Query: 383 QKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 Q + +L +D + + + V +V+ G++ A Sbjct: 320 QDSFAQCIGGLLSVAVQDARIAVACSSPGVLVREIKSGRYESRVDADGRAASVEVGELYA 379 Query: 443 GKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFP 502 + L + + +A+ D + + + + R E +V P Sbjct: 380 DEERRFLLFINVPIAEATEDATQLIKLSCTYRD-------TVTGRTIDVAAGEDAVVRRP 432 Query: 503 LGPTINAPSEDM 514 L + M Sbjct: 433 LEVSAADQEVSM 444 >UniRef50_A9QZI4 von Willebrand factor type A domain protein n=26 Tax=Gammaproteobacteria RepID=A9QZI4_YERPG Length = 472 Score = 216 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 74/362 (20%), Positives = 146/362 (40%), Gaps = 29/362 (8%) Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 +E + LK+ + + S NL +ID S SM ER+ + L V L D Sbjct: 73 SEDKNYLKISLTGFNLDSTRRSPINLALVIDRSTSMSG-ERIEKAREEAILAVNMLNITD 131 Query: 252 NIAIVTYAGDSRIALPSISGSHKAEINAAIDS-LDAEGSTNGGAGLELAYQQATKGFIKG 310 +++V Y + + +P+ + K + A+I + G T AG+ + Q K + Sbjct: 132 TLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKHLNRE 191 Query: 311 GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 +NRI+L +DG N G + + + + G+ ++T G+G +YNE +M IA + Sbjct: 192 QVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQ-DYNEDLMTAIAGYSD 250 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHF 430 GN++++ ++ +K E + ++ VA+D+ QI+ V R +G + L Sbjct: 251 GNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIKTGDK-VKPVRLLGRDGDIL----G 305 Query: 431 NNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKY 490 N NV + + + +L E+ + K+LA + I + Sbjct: 306 NTVNVKLNQLYSNQEKYILLEVIPEKG----------------TDKQQKDLADVSISYLN 349 Query: 491 PQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSE-YLNNTSWQQIKQWAQQAK 549 K+ + + + + E + + L SE + + + + Sbjct: 350 LSSKKQDQINERVTVSYSQSVEKVNDAV----QEEVLAESEIQKTALANDEAIKLIDAGR 405 Query: 550 GE 551 + Sbjct: 406 KD 407 >UniRef50_C7YL43 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7YL43_NECH7 Length = 764 Score = 215 bits (548), Expect = 3e-54, Method: Composition-based stats. Identities = 62/369 (16%), Positives = 128/369 (34%), Gaps = 40/369 (10%) Query: 155 EIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNE--QRTLLKVDILAKDRKSEEL 212 + ++ P++ P + + A R L V + S E+ Sbjct: 21 PFWSTLTGGKKSEEIVGGPSAIQPPVVIASDDATLHLEPVPDRKGLIVKVQPPTAPSAEI 80 Query: 213 P--ASNLVFLIDTSGSMISD--------------ERLPLIQSSLKLLVKELREQDNIAIV 256 P ++V +ID SGSM L L + + + +++ + E D + IV Sbjct: 81 PHVPCDIVLVIDVSGSMAGAAPVPGEETNESTGLSILDLTKHAARTIIETMNESDRLGIV 140 Query: 257 TYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 T+A +++ P ++ +K + S+ +TN GL + + Sbjct: 141 TFASKAKVVQPLLSMTSENKERSRGNVTSMRPIDATNLWHGLLEGIKLFKNVKSSN-VPA 199 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 I++ TDG N ++ ++ + ++ TFG G + ++ IA++G GNY+ Sbjct: 200 IMVLTDGMPNH-MNPAAGFVPKLRAMGQLPASIHTFGFG-YHLRSGLLKSIAEIGGGNYA 257 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKA------QIEFNPAWVTEYRQIGYEKRQLRVE 428 +I V + + T A ++E T Q + + Sbjct: 258 FIPDAGMIGTVFVHAVANLQSTFATRAVLKLTYSKELELEETTGTSVEQQPPQPVNGSDD 317 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRW 488 ++ G+I G+ + + + L + D + L I + Sbjct: 318 GEMELTLNLGNIQYGQSRDIFLRVN-----------NLSKLESLKEKDASSSLVNASIAY 366 Query: 489 KYPQGKESQ 497 P G+ + Sbjct: 367 LKPGGEPTT 375 >UniRef50_C7RNW6 von Willebrand factor type A n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RNW6_9PROT Length = 452 Score = 215 bits (548), Expect = 3e-54, Method: Composition-based stats. Identities = 82/387 (21%), Positives = 148/387 (38%), Gaps = 26/387 (6%) Query: 194 QRTLLKVDILAKDRKSEE---LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 Q+ + + + A D + E +L +ID SGSM L K + +L Sbjct: 23 QKLPVLIRVQAPDPLATEKKARKPYHLALVIDRSGSMSGP-PLAEAVRCAKHIADQLEPT 81 Query: 251 DNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG 310 D ++V + + +P + ++ A+ + + GSTN G + + Sbjct: 82 DIASLVVFDDRVQTLVPPRPVGDRQALHLALSRVHSGGSTNLHGGWQAGADGLLPAAGQA 141 Query: 311 GINRILLATDGDFNVG-IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG 369 + R++L +DG+ NVG I DP I ++ + E GV+ ST+G+G S++NE +MV +A G Sbjct: 142 ALARVILLSDGNANVGEITDPAGIAALCAQAAERGVSTSTYGLG-SHFNEDLMVEMAKRG 200 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEH 429 GN+ Y DT ++ + +E + A+ V+ + P I L Sbjct: 201 GGNHYYGDTAADLFEPFAAEFDFISALCARHVRLSLAAAPGVG-----IRLLNDYLVEGD 255 Query: 430 FNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWK 489 + DI G L EL + A + A + + + LA+ Sbjct: 256 AGFPVIRLPDIAFGAEAWALVELQIPVGLAGEGAGQLLQAAVTASTPEGEPLAFADATLT 315 Query: 490 YPQGKES---QLVEFPLGPTINAPSEDMRF---RAAVAAYGQKL---------RGSEYLN 534 P + L+ PL + + + A A +G R Sbjct: 316 LPAMSPAAWETLLSDPLVLSRQSELAAGKLLEQARAAAEHGDWNVVERLLAEARQRFADQ 375 Query: 535 NTSWQQIKQWAQQAKGEDPQGYRAEFI 561 + ++ A+ A+ D +R E + Sbjct: 376 PWLIEVLESMAELARSRDTARFRKEAL 402 >UniRef50_B6TZ81 Protein binding protein n=9 Tax=Magnoliophyta RepID=B6TZ81_MAIZE Length = 516 Score = 215 bits (548), Expect = 3e-54, Method: Composition-based stats. Identities = 76/415 (18%), Positives = 155/415 (37%), Gaps = 34/415 (8%) Query: 169 KQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI 228 + P + AP E + +++ D + + +LV ++D SGSM Sbjct: 17 PRPTPIVPGRVQLVSKNNNMAPLEENTQKVLLELTGGD-STSDRSGLDLVAVLDVSGSMQ 75 Query: 229 SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDA 286 E++ +++++K +VK+L D ++IVT+ + P ++ + ++ ID+L Sbjct: 76 G-EKIEKMKTAMKFVVKKLSSIDRLSIVTFLDTANRICPLQQVTEDSQPQLLKLIDALQP 134 Query: 287 EGSTNGGAGLELAYQQATKGF-IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 G+TN GL+ + G + ++L +DG N G + V Sbjct: 135 GGNTNISDGLQTGLKVLADRKLSSGRVVGVMLMSDGQQNRGEP--------AANVKIGNV 186 Query: 346 TLSTFGVGNSNYNEAMMVRIADVG-NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 + TFG G ++Y+ ++ +A G +S ++ ++ + + +L V +D+ Sbjct: 187 PVYTFGFG-ADYDPTVLNAVARNSMGGTFSVVNDVNLLSMAFSQCLAGLLTVVVQDLTVT 245 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG--QKASID 462 + T + Q + V GD+ + + ++ +L L D Sbjct: 246 VARIEDESTIQKVAAGNYPQTPDADAGSVTVAFGDLYSKEVRKVIVDLLLPAIDSDRGAD 305 Query: 463 KLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAA-VA 521 L K A A + +R S P ++ +E+ R + A + Sbjct: 306 ILEVTYSYKTAGKLFDAPPATVTVR-------RSGTAFPADDPPVDVQTEEARLKTATMI 358 Query: 522 AYGQKLRGSEYLNNTSWQQIKQWAQQAKGE-----DP--QGYRAEFIRLIELADG 569 + + + L + AQ A + DP R E L++L Sbjct: 359 QQARTMADGKKLGDAR--DKLAEAQNALEDVVAQSDPLLDALRTELQELLKLMKS 411 >UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0E9B3_PARTE Length = 603 Score = 215 bits (548), Expect = 3e-54, Method: Composition-based stats. Identities = 71/353 (20%), Positives = 141/353 (39%), Gaps = 15/353 (4%) Query: 176 KPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPL 235 + + + E P N+ + K + + P +LV ++D SGSMI ++ L Sbjct: 81 QNLDIRLYPERDPIKNNKFVPAVLSLKTKKVSNNLDRPPIDLVCVVDVSGSMIGR-KINL 139 Query: 236 IQSSLKLLVKELREQDNIAIVTYAGDSRIALPSI--SGSHKAEINAAIDSLDAEGSTNGG 293 ++ SL+ L+K L +D I I+ + + I I + +K + AI L STN Sbjct: 140 VKDSLRYLMKILGPEDRICIIVFTTVAHIVTSFIRNTQENKPLLKKAILELKGLASTNIS 199 Query: 294 AGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVG 353 G+ A K ++ I L +DG + + + + + + E + TFG G Sbjct: 200 DGMNKALWMLKNRKYKNPVSCIFLLSDGQDDYKGAEQRVFDQLQLLKIEEKFVIHTFGYG 259 Query: 354 NSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT 413 +++ +M +IA GN+ YID +++A M ML A++V ++ N + Sbjct: 260 Q-DHDAYVMNQIAKYREGNFYYIDNINKASDYFILAMSGMLSIYAQNVSINLKSNDCEIV 318 Query: 414 EYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLA 473 + E + + NN + + G+ +FEL + Y ++ Sbjct: 319 ---KAFGEGQVWYKQDANNYKIQLNYLLEGESKDFVFELFVKED--------YTLNHINL 367 Query: 474 KSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQK 526 + + +L + ++ + + + G + + A A G+ Sbjct: 368 QIEINGQLLKQNLEFRKDSNFQINVSQEKEGQLNEHVEINYQRAKAGHAIGEA 420 >UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcanivorax sp. DG881 RepID=B4X134_9GAMM Length = 657 Score = 215 bits (547), Expect = 4e-54, Method: Composition-based stats. Identities = 76/489 (15%), Positives = 161/489 (32%), Gaps = 63/489 (12%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQ-----AL 76 K +Q + S V + A + A + + + Q + L Sbjct: 110 AEAEKTYRQARASGKKASLVSQQRPNLFTTAVANIAPGETVQVELHYQQTLNVDGHRFQL 169 Query: 77 QGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTG----- 131 + L P F +A T + A A+ +D+D G Sbjct: 170 RLPLTLTPRFTPPTEAPHTLDSLLRNTVAAPGG------TADAGTASVHIDLDAGARLAT 223 Query: 132 --SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA 189 S ++ + G + D D+ + + +E Sbjct: 224 LGSPSHAIHYQRHGRR-----YTITPKAGAIAMDRDLLLNWELEDTGEPLVTRFHE---- 274 Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 + + ++ +F+ID+SGSM + ++SL L ++ L+ Sbjct: 275 -EIDGEHYALLMVVPPKTGQVTALPRETLFIIDSSGSM-GGAPMRQAKASLHLALQRLKP 332 Query: 250 QDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 D I + + ++S + + + +D L A G T+ L ++ Sbjct: 333 GDRFNITDFDSQHTLLFETPVTVSDNSRQQAQDFVDGLQASGGTHMLPALSA---TLSQP 389 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIA 366 G + +++ TDG + + I + +Q L T G+G++ N M R A Sbjct: 390 ASDGYLRQVIFITDGA----VGNESGIFRALHQQLGE-ARLFTVGIGSAP-NSHFMTRAA 443 Query: 367 DVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLR 426 G G+++YI+ ++ Q+ +++ R++ + ++ Q++ V E Sbjct: 444 QFGRGSFTYINDQNQVQQGMDTLFRRLESPLMRN--LQVQLPDGIVAE------------ 489 Query: 427 VEHFNNDNVDAGDIGAGKHITLLFELTLNGQK---ASIDKLRYAPDNKLAKSDKTKELAW 483 D+ AG+ + + +L+ Q+ + + L ++ A Sbjct: 490 -----RWPQKLPDLYAGEPLLVAMKLSAPPQQITVSGYSTQHWQQPVTLPTNNNHPGTAS 544 Query: 484 LKIRWKYPQ 492 L R K Sbjct: 545 LWARRKIAD 553 >UniRef50_C1XMC3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XMC3_MEIRU Length = 412 Score = 214 bits (544), Expect = 8e-54, Method: Composition-based stats. Identities = 77/367 (20%), Positives = 144/367 (39%), Gaps = 18/367 (4%) Query: 166 IKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSG 225 D ++ ++ Q+ LL++ + E P NL ++D SG Sbjct: 2 HPDSSPNARPHLDLIPLKPGVSATRPTRQQVLLRIHTPTPQARP-ERPLLNLALVLDRSG 60 Query: 226 SMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH-KAEINAAIDSL 284 SM +L + + V L +D +A+V Y + +PS + +A I I ++ Sbjct: 61 SM-GGSKLKYTKEAAIYAVHNLLPEDRVAVVIYDDAVEVLVPSTPVADGRAAIANLIRTI 119 Query: 285 DAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG 344 GST AG Q G +NR++L +DG N G +P I V++ G Sbjct: 120 RTGGSTALHAGWLEGATQVAAYQEAGRLNRVVLLSDGLANRGETNPGVIAEQVRELARRG 179 Query: 345 VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 V+ ST GVG +YNE +M +AD G GNY +I++ ++ ++ E+ + T+ V+ Sbjct: 180 VSTSTLGVGL-DYNEDLMTTMADAGEGNYYFIESPADLPRIFAQELAGLAGTLGTRVRLW 238 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKL 464 + +E + ++ AG + L EL + + +L Sbjct: 239 LRPGDG--------SRAWLFNDLEQDPSGAYVLPNLVAGIPLEFLLELEAPAGREASLRL 290 Query: 465 RYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYG 524 + + ++ + + L P +E++ P + A + + A Sbjct: 291 ELDWETPEGQRERLEAVLRL------PVLEEAEFERLQPHPDVAAMTAKLEATRARQRAM 344 Query: 525 QKLRGSE 531 + L + Sbjct: 345 EALADGD 351 >UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSY7_9GAMM Length = 670 Score = 214 bits (544), Expect = 8e-54, Method: Composition-based stats. Identities = 85/526 (16%), Positives = 165/526 (31%), Gaps = 70/526 (13%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 Q + V + + A + + + + + Sbjct: 129 AQAKAIYQDAKKQGKRAALVEQQRPNLFTSKVANIAPGETIHVELRYTEALAIDGREFSL 188 Query: 82 EAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQ----NPLATFSLDVDTG------ 131 PT T +P + + + P+ + + LA ++D+D G Sbjct: 189 RLPT-------TMTSRFHPQESSIKPVEQGPIVPSSAVGQSSHLADITVDIDGGWPIQNI 241 Query: 132 ---SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAP 188 S+ V R L +G + + D D+ + + A+ E Sbjct: 242 ESPSHPFVERSLGRGYRVHMGS----SFSDKVAMDQDVVLRWQLDPVASASGAVFSE--- 294 Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 + + L + + S +VF+IDTSGSM +R+ + +L V+ L Sbjct: 295 -EYKGEHYALVMLRTPDEMTSGPRMPREVVFVIDTSGSMAG-QRMYHAKQALSQAVERLS 352 Query: 249 EQDNIAIVTYAGDSRIALPSI---SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK 305 D +V + S+ S + + + L G T +E A Sbjct: 353 PDDRFNVVEFNNQHSRLFSSMRSASAINVKQALNWVGRLQGGGGTMMLPAVEDALSV--- 409 Query: 306 GFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI 365 + +++L TD + + I +V++QR+ G L T G+G S N ++ + Sbjct: 410 RSDPAYLRQVILITD----ASVGNEAEILRVVERQRK-GARLFTVGIGVSP-NSYLLRKA 463 Query: 366 ADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQL 425 A VG G+Y YI + E + + ++ V K + I+ E Sbjct: 464 AQVGQGDYVYIASGQEVKARMQRLFAKLENPVLK--QLNIDLPEGAEAEVWPNP------ 515 Query: 426 RVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK---ASIDKLRYAPDNKLAKSDKTKELA 482 D+ G+ + L +L + + L + + + Sbjct: 516 -----------LPDLYHGRPLYLAMKLDKPIDHLVLKAQTDRLWQQRIALPEPTEATGIH 564 Query: 483 WLKIRWKYPQ-------GKESQLVEFPLGPTINAPSEDMRFRAAVA 521 L R K GK + V + + + + VA Sbjct: 565 TLWAREKIAAQLDGLRQGKTQEEVRQAVLAVALEHAVMSPYTSFVA 610 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 213 bits (542), Expect = 2e-53, Method: Composition-based stats. Identities = 77/383 (20%), Positives = 148/383 (38%), Gaps = 36/383 (9%) Query: 189 APWNEQRTLLKVDILAKDRKSEEL-PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKEL 247 +P + L++ + A + S E NL ++D SGSM + L ++ + + LV L Sbjct: 15 SPTRVSQRQLEISVAAIAQASGERNAPLNLGLILDHSGSMAG-QPLETVKRAAQKLVDRL 73 Query: 248 REQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 D +A++ + +++ +P+ + + +I I L A G T GL+L + Sbjct: 74 LPSDRLAVIVFDHVAKVLIPNQPVTDRDKIKTRISHLAAMGGTAIDEGLQLGLTELIAAK 133 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 G I++I L TDG+ G + + ++ + +TL+T G G ++N+ ++ +IAD Sbjct: 134 A-GAISQIFLLTDGENEHG--NNSRCLQLAEEAAKENITLNTLGFG-YHWNQDVLEQIAD 189 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW----VTEYRQIGYEKR 423 G+ +I+ + Q++ + + +P + Q+ Sbjct: 190 AAGGSLMFIEYPQDVLIGFERLFNQIISVGFTNAHLHLSLSPGVRLANLKPIAQVAPATI 249 Query: 424 QLRVE-HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELA 482 L N V GD+ TLL L ++ P + LA Sbjct: 250 DLPHGMEGNTAIVRLGDLLTDTPRTLLANLYIDP-----------PTVAHPLPSPPETLA 298 Query: 483 WLKIRWKYPQGKESQLVEFPLGPTI-------NAPSEDMRFRA-AVAAYGQ------KLR 528 L+IR+ P + + L P+ + P+ ++ A+A Y Q KL Sbjct: 299 TLQIRYDDPTQEHTGLRSEPIAVLAHRCADQESQPNPQVQKSILALAKYRQTQLAEAKLH 358 Query: 529 GSEYLNNTSWQQIKQWAQQAKGE 551 + + I G+ Sbjct: 359 QGDRAGAATMLHIAAQTALQMGD 381 >UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Photobacterium profundum 3TCK RepID=Q1YZ74_PHOPR Length = 714 Score = 213 bits (541), Expect = 2e-53, Method: Composition-based stats. Identities = 81/501 (16%), Positives = 183/501 (36%), Gaps = 43/501 (8%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEA 83 K+ + Q + V + + A +++ + Q + Sbjct: 139 AKKQYEAAQQAGVKASLVEQHRPNIFSTQVANIAPDESVTVEIEYQEAVLYRDGEFSLRF 198 Query: 84 PTFARAAKAKATHI-ANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ 142 PT + P F ++ + +P FSL++D N + Sbjct: 199 PTVVAPRYIPVVPLNKVPDVNEITPF----LRDLQDDPTLPFSLNIDL----NAGLPIAV 250 Query: 143 GLLPPPDAVRVEEIVNYFP--------SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ 194 P + + +++ +D D+ A+ A+ + Sbjct: 251 INTPSHAFTQQKLSEDHYILSLIQPDIADRDVVLSWRPKATDLPSTALFTQHVEGQGYGL 310 Query: 195 --RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 +V+ S L ++ F++D SGSM + + + +L+ +++L+ +D+ Sbjct: 311 LLTMPPQVNHQVNSTTSSALFHQSVTFVLDISGSMYGES-IEQAKQALRYGLQQLQPEDS 369 Query: 253 IAIVTYAGDS---RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK 309 IVT+ ++ L ++ S +D LDA+G T A L+ A+ T + Sbjct: 370 FNIVTFNHEAMLYSEQLLPVTSSTITRALRFVDGLDADGGTEMAAALKAAFSIKTHDQLN 429 Query: 310 GG--INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 +N+I+ TDG + + ++ ++++Q L T G+G++ N M R A Sbjct: 430 STRWLNQIVFITDG----SVGNESALFDLIEQQLVDR-RLFTVGIGSAP-NSYFMTRAAM 483 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA------QIEFNPAWVTEYRQIGYE 421 G G Y+YI + E + ++ V +D+K +++ P V + Q Sbjct: 484 KGKGTYTYIGDVKEVNTKMRLLFSKISQPVMRDIKLAWSDGRSVDYWPNPVPDLYQQEPL 543 Query: 422 KRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKA-SIDKLRYAPDNKLAKSDKTKE 480 + ++ A I G+ + + ++ + +ID+ + P L + Sbjct: 544 QVSFKIPDN-----AANLIITGQQVDHEWRQDVDIHQGLAIDEKQPQPRIGLDIIWARNQ 598 Query: 481 LAWLKIRWKYPQGKESQLVEF 501 ++ +++ ++ + +E Sbjct: 599 ISSIQMNPAISMDEKKKHIEL 619 >UniRef50_C5YMJ6 Putative uncharacterized protein Sb07g002215 (Fragment) n=1 Tax=Sorghum bicolor RepID=C5YMJ6_SORBI Length = 423 Score = 212 bits (540), Expect = 3e-53, Method: Composition-based stats. Identities = 77/415 (18%), Positives = 141/415 (33%), Gaps = 75/415 (18%) Query: 147 PPDAVRVEEIVNYFPSDWDIKDKQSIP-------ASKPIPFAMRYELAPAPWNEQRTLLK 199 + +E + Y D P A+ + E R Sbjct: 41 EAPSAAIEPM--YGDDDPVEPTPAHAPLGGTGRSAAGLLVLKTHCEYPALAKATARDGFG 98 Query: 200 VDILAKDRKSEELP-----------ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 V + AK S P +LV ++D SGSM +++ ++ ++ L+ L Sbjct: 99 VLVHAKAPSSVAAPESSAAAAASRAPLDLVTVLDVSGSMAG-KKMERVKRAMGFLIDNLG 157 Query: 249 EQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 D +++V ++ D+R L +S KA A++SL A GSTN GL++A Sbjct: 158 SDDRLSVVAFSTDARRIIRLTRMSDDGKAAAKRAVESLAASGSTNIRGGLDVAAMVLDGR 217 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKK------------------QRESG---- 344 K + ++L +DG N + S V K QR +G Sbjct: 218 RHKNAVASVILLSDGQDNQSMHHEYLPTSWVPKHSPAFSKGGYDVLVPPSFQRTAGGDHR 277 Query: 345 -VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 VT+ TFG G +++ A M I++V +S+I+ + Q + +L + + Sbjct: 278 CVTVHTFGFGI-DHDAAAMHYISEVTGSTFSFIENHAVIQDAFARCIGGLLSVAVQKARI 336 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDK 463 +E + +IG+ VD G++ A + L + + + + Sbjct: 337 SLECGASTPAYESRIGWG--------GRAVTVDVGELYADEERRFLLFVAVPRAHSMDEL 388 Query: 464 LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRA 518 ++ + +S V D+ F Sbjct: 389 ATRLFK--------------VRCTYLDTPTGQSVDV------AGEDARPDLAFAV 423 >UniRef50_A0C9G5 Chromosome undetermined scaffold_16, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0C9G5_PARTE Length = 648 Score = 212 bits (540), Expect = 3e-53, Method: Composition-based stats. Identities = 69/322 (21%), Positives = 138/322 (42%), Gaps = 19/322 (5%) Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNE---QRTLLKVDILAKDRKSEEL------ 212 + + ++ + I ++ + + QR V I+ KD + Sbjct: 166 YQYHKQVGLNMNLEELIDLRVQAQYDYCKLKKSESQRLPAMVSIITKDIEQYVKNNSSIE 225 Query: 213 PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--IS 270 +L+ +ID SGSM +++ +Q SL L+ L E+D + ++T+ G ++ P ++ Sbjct: 226 AGIDLLCVIDKSGSMEG-KKIASVQQSLVQLLDFLSEKDRLCLITFDGSAQRLTPLKTLT 284 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDP 330 +K AI S+ A G TN G E+A+ Q + +K + I L +DG + Sbjct: 285 QDNKNYFKKAIYSIRASGQTNIAKGTEIAFNQIQQRKMKNQVTSIFLLSDGQDQGAAEYI 344 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEM 390 + + +V+ VT+ +FG G S+++ A+M +I VG G++ YI+ + + + Sbjct: 345 QRQKDVVEDI----VTIHSFGYG-SDHDAALMSKICKVGQGSFYYIEDVKLLDEFFADAL 399 Query: 391 RQMLITVAKDVKAQIEFNPAWVTEYRQI--GYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 ++ +A+ V+ I+ P + +I Y ++E + I + Sbjct: 400 GRLSSALAEKVQIDIKCAPFIPFQDIKIQKTYGDMWKQIEQERLYQIKIPQIASDSRKDY 459 Query: 449 LFELTLNGQKASIDKLRYAPDN 470 +FE+ L I + P Sbjct: 460 VFEIALPPYSEQILDEQRVPQV 481 >UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobactrum intermedium LMG 3301 RepID=C4WI90_9RHIZ Length = 777 Score = 212 bits (539), Expect = 3e-53, Method: Composition-based stats. Identities = 82/520 (15%), Positives = 174/520 (33%), Gaps = 65/520 (12%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE 82 E +S+ ++ + +Q+ A + QQ V+ ++ +L+ L Sbjct: 177 YEKAKSEGKKATLIEQQRPNVFTNAVANIGPHEKVVIQIEYQQAVRLADERFSLRVPLVV 236 Query: 83 APTFARAAKAKATHIANPGTARYQQFDDN--------------PVKQVAQNPLATFSLDV 128 AP + + + D P ++ NP++ S+++ Sbjct: 237 APRYNPNIASPVVQKVEMQNGWGKSSDAGKPDPYNAPIVTPLTPPAELRTNPVS-ISVEL 295 Query: 129 DTGSY-ANVRRFLNQGLLPPPDAVRVEEIV-NYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 G V ++ + + E + +D D + S A+ + E Sbjct: 296 KPGFPLGKVESLYHKVRIETTNDATREITLDGTAAADRDFVLEWSAVANDAPQVGLFREH 355 Query: 187 APAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKE 246 + + + S + +VF+ID SGSM + ++SL + Sbjct: 356 V-----GKDDYVLAYVTPPAVASAKKAQREVVFVIDNSGSM-GGTSIEQAKASLDYALSH 409 Query: 247 LREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA 303 L+ D ++ + S + A + SL+A+G T L A Sbjct: 410 LQPGDRFNVIRFDDTLTRFFEVSVEASQQNIASARHFVMSLEAQGGTAMLPALHAA---L 466 Query: 304 TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 G+ +I+ TDG+ I + + + + R + G+G + N +M Sbjct: 467 DDSHQGNGLRQIVFLTDGE----ISNEQQLLDAIA-ARRGRSRIFMVGIGTAP-NSYLMN 520 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKR 423 A++G G +++I + +E + + + ++ D+KA F+ V+ I Sbjct: 521 HAAELGRGTFTHIGSAAEVDERMRALFDKLENPAVTDLKAN--FSEKNVSMTPSI----- 573 Query: 424 QLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQ----KASIDKLRYAPDNKLAKSDKTK 479 D+ G+ + + + + ID + + L ++ K + Sbjct: 574 -------------LPDLYRGEPLVIAARMGKAAGNLVIEGQIDGRPWTVNLPLDQAMKAE 620 Query: 480 ELAWLKIRWKYPQGKESQLVEFPLGPTINAPSED--MRFR 517 ++ L R K VE LG + +R Sbjct: 621 GISKLWARRKIDDA----EVELTLGKISQDAANARILRLA 656 >UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 Tax=Cystobacterineae RepID=Q1D9B7_MYXXD Length = 476 Score = 211 bits (538), Expect = 4e-53, Method: Composition-based stats. Identities = 74/366 (20%), Positives = 136/366 (37%), Gaps = 9/366 (2%) Query: 176 KPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPL 235 + R P D+ + NL +ID SGSM +L Sbjct: 56 GSLTLTSRLSHPYVPAGTSEVFATFDLSGAQVPGAQRSPVNLALVIDRSGSMSGY-KLAQ 114 Query: 236 IQSSLKLLVKELREQDNIAIVTYAGDSRIALPS-ISGSHKAEINAAIDSLDAEGSTNGGA 294 + + + L+ L +QD +AI+ Y D + + +++ + +D + EG TN GA Sbjct: 115 AKQAARHLIGLLNDQDRLAIIHYGSDVKSLPSLEATAANRERMFQYVDGIWDEGGTNIGA 174 Query: 295 GLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGN 354 GL Q + G+NR++L +DG G+ + + M ++ R +G+TLS GVG Sbjct: 175 GLSAGRYQLSTAQRTYGVNRLILMSDGQPTEGLTADEELTRMARELRATGLTLSAIGVGT 234 Query: 355 SNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE 414 ++NE +M A+ G G Y +++ ++ + +++Q TVA+ V P + Sbjct: 235 -DFNEDLMQAFAEYGAGAYGFLEDAAQLSTLFQKDLQQAGTTVARGVTMTFTLPPGT-SL 292 Query: 415 YRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAK 474 +GY + N +V D AG+ ++ L + G Sbjct: 293 GEVLGYR----ASQSGNQVHVSLPDFSAGQLERVVVRLNVTGDSVGRTARVLDLKLAYTD 348 Query: 475 SDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLN 534 + E+A + + V + R + + L Sbjct: 349 LIRDAEVAN-EASLSAVVTNRREEVLARQDREATVYAARARSAVNMQKAAEALSEGRKDE 407 Query: 535 NTSWQQ 540 + Q Sbjct: 408 AKLYLQ 413 >UniRef50_D2LQW0 von Willebrand factor type A n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LQW0_BACS4 Length = 282 Score = 211 bits (538), Expect = 4e-53, Method: Composition-based stats. Identities = 62/263 (23%), Positives = 119/263 (45%), Gaps = 8/263 (3%) Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 + FA +YE P ++ L V++ K E NL L+D SGSM E L Sbjct: 6 KLSFAHQYENVPCK-GKEAAYLLVELTGAKVKHTERSPINLSLLLDRSGSMSG-EPLRYC 63 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 + + ++ +L ++D +++V + + +HK + I ++ G TN GL Sbjct: 64 KEACNFVINQLTDKDILSVVVFDDQVETIIEPQKVTHKDLLKEYIQRIETRGITNLSGGL 123 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 Q K +K +NR++L +DG N GI D +++ + + +G+ +ST GV + + Sbjct: 124 IQGCQHVLKQEVKNYVNRVILLSDGQANAGITDKEALVKLADDYQSAGLVISTLGV-SEH 182 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYR 416 ++E ++ +AD G GN+ +I+ + + E+ +L + +++ I V Sbjct: 183 FDEELLEGVADSGRGNFHFINEVENIPSIFEQELDGLLNVIGQNITLNI-LPKKGVRITN 241 Query: 417 QIGYEKRQLRVEHFNNDNVDAGD 439 GY + ++ GD Sbjct: 242 VFGYNYNS----DEDAVDLTLGD 260 >UniRef50_Q22ST4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ST4_TETTH Length = 648 Score = 211 bits (536), Expect = 9e-53, Method: Composition-based stats. Identities = 60/330 (18%), Positives = 123/330 (37%), Gaps = 18/330 (5%) Query: 207 RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIAL 266 ++ +LV +ID SGSM E++ L++ +L ++ + D I IV + L Sbjct: 214 LQTLGRQTVDLVVVIDKSGSMEG-EKIQLVKETLVKIINLMSSMDRICIVCFNESGDRPL 272 Query: 267 PSI--SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFN 324 + +K + I + A G TN G+ A + K + ILL +DG Sbjct: 273 TFTRVTDENKQTLLNLIQQIYAGGGTNISEGINHALKAIQNRKFKNNVTSILLLSDGQDT 332 Query: 325 VGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQK 384 K+ K Q + + T G G +++ ++ ++D+ NG ++++ ++ Sbjct: 333 KAYTRVKAYID--KYQIKDAFNIETIGFGE-DHDPKLLRTLSDLRNGTFNFMQDVNYLDT 389 Query: 385 VLNSEMRQMLITVAKDVKAQIEF-NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAG 443 + M+ TVA+++K ++F P +R ++ +I +G Sbjct: 390 AFINIFAGMISTVAQNIKVGVKFTPPEQFKNFRISKVFGDNWTKVSETQYEINLINILSG 449 Query: 444 KHITLLFELTLNGQKASIDKL--------RYAPDNKLAKSDKTKELAWLKIRWKYPQGKE 495 +FE+ + + + A+S +KE A + ++ Sbjct: 450 VSKDFVFEMEADAFDEQTESTITDENSIFNFITAELTAQSVSSKEEANKQKTFQLQLIDT 509 Query: 496 SQLVEFPLGPTINAPSE---DMRFRAAVAA 522 +VE E + A+ Sbjct: 510 RSMVEGVEQEEDGEVVENYYRVLSAQAIQE 539 >UniRef50_A6R161 Predicted protein n=3 Tax=Onygenales RepID=A6R161_AJECN Length = 759 Score = 210 bits (535), Expect = 1e-52, Method: Composition-based stats. Identities = 65/365 (17%), Positives = 122/365 (33%), Gaps = 31/365 (8%) Query: 146 PPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAK 205 P + EI++ + A + P + +L P P +L V Sbjct: 6 PAASSEDDFEIIDDQIPIRPRLSTSTTIAGERSPNEVGVQLHPLPDTNSM-ILSVHPPLH 64 Query: 206 DRKSEELPASNLVFLIDTSGSMISD-----------------ERLPLIQSSLKLLVKELR 248 K ++V ID S SM S L L + + + +++ L Sbjct: 65 PEKEMPHVPCDIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLN 124 Query: 249 EQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT-K 305 E D + IV ++ ++ + ++ S K A+++L STN GL+L + + Sbjct: 125 ENDRLGIVAFSTEAEVVYEISKMNESSKKAALKAVEALKPLSSTNLWHGLKLGLKAFENE 184 Query: 306 GFIKGGINRILLATDGDFNVGIDDPKSIESM--VKKQRESGVT-LSTFGVGNSNYNEAMM 362 + + + TDG N + + + + + + TFG G N ++ Sbjct: 185 RHTPQSVQALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHTFGFG-YNIRSGLL 243 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEK 422 IA+VG G +++I V + + T A K + + Sbjct: 244 QAIAEVGGGTFAFIPDAGMIGTVFVHAIANLYTTFATQAKVTFRTSGSVTLAQDLGSKTG 303 Query: 423 RQLRVEHFN--NDNVDAGDIGAGKHITLLFEL----TLNGQKASIDKLRYAPDNKLAKSD 476 L E N V G + G+ L+ + T + L Y L Sbjct: 304 LGLHEESTRDSNLTVAIGTLQYGQSRDLVIRMKNATTAATPSMAQATLTYQFQGCLKSVV 363 Query: 477 KTKEL 481 +++ Sbjct: 364 ADEQV 368 >UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LPD4_SYNFM Length = 479 Score = 210 bits (533), Expect = 2e-52, Method: Composition-based stats. Identities = 91/416 (21%), Positives = 161/416 (38%), Gaps = 27/416 (6%) Query: 137 RRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRT 196 R G PP + VN S I+DK + + A+ E AP Sbjct: 31 RLAAAPGRKPPDPGLARAGTVNL--SGRLIQDKVHMGGDGTVTVALTLECDRAPGGNVEA 88 Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 ++V ++D SGSM +L + ++ L+ L E D A+V Sbjct: 89 ---------------RRELDMVVVMDRSGSMADAGKLTHARQAVLNLLSRLSETDRFALV 133 Query: 257 TYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 +Y+ + L I+ +++A + + + G+TN G GL+ Q + G ++R Sbjct: 134 SYSDHVQRHGGLLPITPANRATLERIVRGIQPGGATNLGGGLQEGISQLAELQQNGRLSR 193 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++L +DG N G+ DP ++ +M E G +ST GVG ++NE +M IAD G GNY+ Sbjct: 194 LILISDGLANRGVTDPSALGTMASVAAERGYAVSTVGVGL-DFNEHLMTSIADKGAGNYT 252 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 ++++ S +V + E R VA V+ + +P +T GY Sbjct: 253 FMESASAFAQVFDKEFRDAGTVVASSVEVHVPLSPG-MTLVHAAGYPIEV----GEGRAV 307 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELA--WLKIRWKYPQ 492 GD+ G+ L L + + + + + L++ Sbjct: 308 FRPGDLRFGQSRKLFLTLRIPVGEEKTWDIGAISAHYRSGERAYTASLPQPLRVACVRDP 367 Query: 493 GKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQA 548 + ++ + R R VAA ++ + E + +Q A A Sbjct: 368 SEAMASIDKAQWEDKVIREDYNRLREEVAASVKEGKREEAVQQIDRYVTRQQAANA 423 >UniRef50_A9FTM1 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FTM1_SORC5 Length = 535 Score = 209 bits (532), Expect = 2e-52, Method: Composition-based stats. Identities = 118/568 (20%), Positives = 207/568 (36%), Gaps = 61/568 (10%) Query: 6 IIMLLMSSLILSGCGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQ 65 + +L+ L+ CG Q + E ++ + A + Sbjct: 15 VSAVLLGGAGLAACG-QGSSPEPPFAPFDNGAGSSTGSSGSSGSSGTGSFGTGAGSTGAG 73 Query: 66 EVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFS 125 + + P + G Q + Sbjct: 74 SGSGGGAGAVDGAQAEPPPAPSDEDAGAPPAFTCEGLDSSQPVVL----------YLSAD 123 Query: 126 LDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYE 185 S +VR L G P P VR E +NY+ D+ D+ +R E Sbjct: 124 DSNSMSSPVHVRELLRSGRAPEPWQVRTYEFLNYYRIDYAPPDEGE----------LRVE 173 Query: 186 LAPAPWNEQRTL-LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 P E + L++ + + D S + F++DTSGSM E + +++++ + Sbjct: 174 PQIEPGEEAGSYALQIGVRSYDPPSPRR-PIAVTFVLDTSGSMDG-EPMAREKATVRAVA 231 Query: 245 KELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ 302 L E D + +VT+ + + L + G + AA D+L A G T+ +GL + YQ Sbjct: 232 ASLSEGDVVNMVTWNTQNSVILSGHVVDGPDDPALLAAADALSASGGTDLESGLRVGYQL 291 Query: 303 ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN---YNE 359 A + F +G INR++L +DG NVG+ + I + + + L GVG YN+ Sbjct: 292 AQEHFEEGRINRVILVSDGGANVGVTSEELIALHAEDADQEAIYL--VGVGTGPALGYND 349 Query: 360 AMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIG 419 +M + D G G Y Y+D EA + +++ A+ V+ ++ W + + Sbjct: 350 VLMDAVTDKGRGAYVYLDDEDEAFHMFRDRFAEVMEVAARGVQVELTMP--WYFKMEKFY 407 Query: 420 YEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTK 479 E+ E ++ GD ++F + G + + Sbjct: 408 GEEYSTNPEEVEPQHLAPGD-------AMIFSQLVRGCDPGV--------------INDE 446 Query: 480 ELAWLKIRWKYPQGKESQLV--EFPLGPTINAPSEDMRFRAAVAAYGQKLRG--SEYLNN 535 + ++ RW+ P E++ V E L E + A+ AY + L+ SE L+ Sbjct: 447 DTLTVRARWQTPLTHEAKEVSREATLAELAAGSKEQLVKGKAIVAYAEALKAGTSEALHA 506 Query: 536 TSWQQIKQWAQQAKGEDPQGYRAEFIRL 563 Q I A A G+ AE I L Sbjct: 507 AREQVI---AANAGGDPELDEIAELIPL 531 >UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11 Tax=Rhizobiales RepID=A6X8G3_OCHA4 Length = 750 Score = 209 bits (531), Expect = 3e-52, Method: Composition-based stats. Identities = 74/519 (14%), Positives = 161/519 (31%), Gaps = 66/519 (12%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 Q + +Q + + + A + + + Q Q + + + + Sbjct: 144 QKAREIYEQAKSEGKKATLIEQQRPNVFTNAVANIGPHEKVVIQIEYQQAVRLSDERFSL 203 Query: 82 EAPTFARAAKAKA-THIANPGTARYQQFDDN-----------PVKQVAQNPLA------T 123 P + + P+ P A T Sbjct: 204 RVPLVVAPRYNPDNASPVVQEVEIKNGWGKSRDTGKPDTYNTPLVTPLAPPTALRTNPVT 263 Query: 124 FSLDVDTGSY-ANVRRFLNQGLLPPPDAVRVEEIV-NYFPSDWDIKDKQSIPASKPIPFA 181 S+ + G V ++ + + E + +D D + S AS Sbjct: 264 ISVKLKAGFPLGKVESLFHKVRIDTTNDATREITLDGAAAADRDFVLEWSAVASDAPQVG 323 Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLK 241 + E + + + S + ++F+ID SGSM + ++SL Sbjct: 324 LFREHI-----GKDDYVLAYVTPPALASPKKVQREVIFVIDNSGSM-GGTSIEQAKASLD 377 Query: 242 LLVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 + +L+ D ++ + + + A + SL+A+G T L Sbjct: 378 YALSQLQPGDRFNVIRFDDTLTKFFEDSVDANQENIASARRFVTSLEAQGGTEMLPALHA 437 Query: 299 AYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYN 358 A G+ +I+ TDG+ I + + + V R + G+G++ N Sbjct: 438 A---LDDSNQGNGLRQIVFLTDGE----ISNEQQLLDAVA-ARRGRSRIFMVGIGSAP-N 488 Query: 359 EAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 +M R A++G G +++I + +E + + + ++ D+KA +T Sbjct: 489 SYLMNRAAELGRGTFTHIGSAAEVDERMRALFDKLENPAVTDLKANFSEKNVSMTPSL-- 546 Query: 419 GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN----GQKASIDKLRYAPDNKLAK 474 D+ G+ + + + + ID + + L + Sbjct: 547 ------------------LPDLYRGEPLVIAARMGKAIGNLAIEGQIDGRPWTVNLPLDQ 588 Query: 475 SDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSED 513 + + ++ L R K VE LG ++ Sbjct: 589 AMNAEGISKLWARRKIDDA----EVELTLGKISQDAADA 623 >UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LL92_HALO1 Length = 430 Score = 208 bits (530), Expect = 4e-52, Method: Composition-based stats. Identities = 84/420 (20%), Positives = 155/420 (36%), Gaps = 17/420 (4%) Query: 166 IKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSG 225 + + + + P N + L V + +L +ID SG Sbjct: 2 TAQPTPAQRAGSVAVTVTPQYDLLPSNARELNLMVRLEGTGDAPATRAPLDLALVIDRSG 61 Query: 226 SMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH--KAEINAAIDS 283 SM ++L ++++ L++ L+ +D I +V+Y+ D + L + E A+ + Sbjct: 62 SMSG-DKLSDVKTAALELLETLQPEDTITLVSYSSDVSMHLMRTRADDAGQREARRALLA 120 Query: 284 LDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRES 343 L A G T G GL A + + ++ ++L +DG N G P + + + Sbjct: 121 LQARGGTALGPGLFRALEALEGASDRTRMSHLMLFSDGIANAGEVRPSVLGARAAGAFGA 180 Query: 344 GVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 GV++ST GVG +YNE +M R+AD G G Y +I +L+ EM+ ++ TVA+ V Sbjct: 181 GVSVSTMGVGV-DYNEDLMTRLADQGGGRYHFIQDSEAIASILDDEMKGLVATVARGVTM 239 Query: 404 QIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDK 463 + V R GY E + G +GAG+ +L + L + + Sbjct: 240 DLTRAEG-VGTVRVFGYASE----ESAGRVHTRVGSLGAGQTRAILVRIDLLSDATADKR 294 Query: 464 LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE------SQLVEFPLGPTINAPSEDMRFR 517 + E + + + S+ + + + M Sbjct: 295 PLGHLHIEFDDVSDDGERKSVDVPLSIAHTDDIAAARASEHKDVTVRVAEIESAASMELA 354 Query: 518 AAVAAYGQ--KLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 A A G R ++ + A A + E + A+G + + Sbjct: 355 AQAAGRGDFGHARNEIAGAIGKLERRRAQAPSAALDKQIADLREAESEVAGAEGSAEERK 414 >UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DZ93_PARTE Length = 522 Score = 208 bits (528), Expect = 7e-52, Method: Composition-based stats. Identities = 77/444 (17%), Positives = 154/444 (34%), Gaps = 60/444 (13%) Query: 129 DTGSYANVRRFL------NQGLLP----------PPDAVRVEEIVNYFPSDWDIKDKQSI 172 D SYAN + + +G P DA+ V I N + + Sbjct: 9 DNDSYANTTKAIVINEQFIEGERPAICQEHGKFNDDDAIDVV-ITNESNYGRKSLSQNYM 67 Query: 173 PA-----SKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM 227 + + Y P + + + K++ +L+ LID SGSM Sbjct: 68 KQANYVLQDNVELKLSYSGLPTQG---TQAVLLSVQTKNQAITIRQGIDLICLIDHSGSM 124 Query: 228 ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLD 285 E++ L++ SLK L+K L+ D + ++ + + L + + + AID+++ Sbjct: 125 SG-EKMHLVKKSLKHLLKMLQPNDRLCLIEFDDQNYRLTRLMRATQENMYKFLIAIDTIE 183 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 A G+T+ G +++A K I I L +DG+ + + K + Sbjct: 184 ANGATDIGNAMKMALSILKHRRFKNPIASIFLLSDGEDEGAAG--RVWNDIQSKNIKEPF 241 Query: 346 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 T++TFG G + +M IA G + YI +S+ + + +A + I Sbjct: 242 TINTFGFGR-DCCPKIMSEIAHFKEGQFYYISEISKIDECFFEALGGEASVIAYNTHITI 300 Query: 406 EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLR 465 + + Y + + ++ + G + E ++ Sbjct: 301 SCKKNTIKKV----YGDKWALNLQEQSFSIYQPQLQFGVRKDFIVETSVP---------- 346 Query: 466 YAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAV----- 520 E+ +K+ + E +E + N +E + F+ + Sbjct: 347 ---------IGMMDEIITVKMHTDSVETSERFTIEQFINIVPNMVAEQIVFQEVMSHYYR 397 Query: 521 AAYGQKLRGSEYLN-NTSWQQIKQ 543 + R + L N +QQ + Sbjct: 398 VQAAETFRNALNLGYNLQYQQAQA 421 >UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SQR4_HAHCH Length = 733 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 75/519 (14%), Positives = 155/519 (29%), Gaps = 77/519 (14%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 Q K QQ + V + + + + Q + Sbjct: 130 QKAEKVYQQAKADGKRAAVVHQQRPNLFTHKVANIGPGEMITLTLRYQQNISYRSGAFSL 189 Query: 82 EAPTFARAAKAKATHIAN----------------------------PGTARYQQFDDNPV 113 P + P T + + Sbjct: 190 ALPLTVTPRYIPGSSATPAWSEEDKTRLRNELRSHERTLINDSGWAPATNQVADAPEITP 249 Query: 114 KQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPP-DAV--RVEEIVNYFPSDWDIKDKQ 170 VA+ L T S A +R LN GL ++V R++ + ++ + Sbjct: 250 PTVAKEDLLTPSHQ------ARIRVHLNPGLPVESIESVTHRIQWTQQTNGYEVSLESNK 303 Query: 171 SIPASKPIPFAMRYELAPAPWNE-------QRTLLKVDILAKDRKSEELP-ASNLVFLID 222 +P K R P ++ ++ E L L++++D Sbjct: 304 DVPMDKDFTLTWRVRQGSEPEAALFKEIVGDDVYAQLLLMPPQFSDEGLSLPRELIWVVD 363 Query: 223 TSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEINA 279 TSGSM + + ++ + L +D ++ + +R P + Sbjct: 364 TSGSMEG-VSIQQARDAVLQALDTLTPRDRFNVIEFNSHARKLFPQAVPAQERALQQARR 422 Query: 280 AIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKK 339 + L A+G T L+ A + +G + +++ TDG + + ++ + + Sbjct: 423 FVRGLKADGGTEIAEALDRA---LSDAAPEGYVRQVVFLTDG----SVGNELALFKQIDQ 475 Query: 340 QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAK 399 Q L T G+G S N M + A G G YS+I+ +E + + + Sbjct: 476 QLGDS-RLFTVGIGPSP-NRFFMRKAAQFGRGAYSHINDTAEVSDKIAELTAALRQPALR 533 Query: 400 DVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKA 459 DV+ ++ V D+ G+ + LLF++ + Sbjct: 534 DVRLDVQSALNAEV-------------------YPVAIPDLYRGEPVQLLFKVEDGAAAS 574 Query: 460 SIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 + L + ++ + L ++ P ++ Sbjct: 575 ELPASIQGYGVGLLQDEQPLWMRSLDLQQAAPSQGVARA 613 >UniRef50_Q8LQ58 Os01g0640200 protein n=9 Tax=Poaceae RepID=Q8LQ58_ORYSJ Length = 589 Score = 206 bits (524), Expect = 2e-51, Method: Composition-based stats. Identities = 72/394 (18%), Positives = 150/394 (38%), Gaps = 25/394 (6%) Query: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKL 242 +Y A L +++ S + +LV +ID SGSM +R+ ++++L+ Sbjct: 37 KYHNDVASMAPHDQELLLELRG-SSSSTDRAGLDLVAVIDVSGSMDG-DRIDKVKTALQF 94 Query: 243 LVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 ++++L + D + IVT+ ++ P ++ + +AE+ A +D L A G TN GLE Sbjct: 95 VIRKLSDLDRLCIVTFCTNATRLCPLRFVTAAAQAELKALVDGLKAYGDTNMKGGLETGM 154 Query: 301 QQATKGF-IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNE 359 G ++L +DG N G D + V + TF G S ++ Sbjct: 155 SVVDGRSLAAGRAVSVMLMSDGYQNHGGD--------ARDVHLKNVPVYTFSFGAS-HDS 205 Query: 360 AMMVRIADVG-NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 ++ IA G ++Y+ + + + +L +A+D++ + VT R + Sbjct: 206 NLLEAIARKSLGGTFNYVADSANLTGPFSQLLGGLLTIIAQDLELTVTRFHGEVTIKRVV 265 Query: 419 ---GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL------NGQKASIDKLRYAPD 469 Q ++ V G + + + ++ L L A++ +Y Sbjct: 266 WVDAGTYPQTTASDGSSVTVSFGTLYSAEARRVIVYLALADKTASPPYDANVCLAQYRFT 325 Query: 470 NKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRG 529 + + +L +K R G + ++ +R +A K+ Sbjct: 326 FQAQQVTSNPDLITIKRRPSAAPGAARKPQPVENELARRQHADMIRAARDMAE-ANKMED 384 Query: 530 SEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIRL 563 + + + +++ QA E +L Sbjct: 385 ARNKLEEARKALEENFNQAANPTVAMLLEELRQL 418 >UniRef50_Q231J4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q231J4_TETTH Length = 520 Score = 206 bits (524), Expect = 2e-51, Method: Composition-based stats. Identities = 62/311 (19%), Positives = 131/311 (42%), Gaps = 17/311 (5%) Query: 192 NEQRTLLKVDILAKDRKSEELP-------ASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 N Q V + AKD ++++ +L+F+IDTSGSM +++ L++ S+ ++ Sbjct: 66 NNQIIPGVVSVEAKDFDADQVKKDKVRYQPLDLIFVIDTSGSMQG-KKIELVKKSILQVL 124 Query: 245 KELREQDNIAIVTYAGDSRIALPSI--SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ 302 ++ D I++V + +++ L + + K +I +D L A G T G G++ A+ Sbjct: 125 HIIQGDDRISLVGFNSQAKVLLELTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDI 184 Query: 303 ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMM 362 + + I L +DG N G + + + + E + FG G+ +++ + Sbjct: 185 IKERTNSKNLASIFLLSDGQDNCGFSQTQHFMN--QSKIEYPFCIDCFGFGD-DHDSLTL 241 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIG--Y 420 +I + G +++I +S+ + + VA++VK + F + + Y Sbjct: 242 SKINQLQQGTFNFIRDISQIDDAFTIILAGIKTFVAQNVKISVNFGNTELMNGITVSKTY 301 Query: 421 EKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS-IDKLRYAPDNKLAKSDKTK 479 +++ + + AG+ +FEL + + D+ RY + Sbjct: 302 GSEWKKIQD-KQYEIQLNHLMAGRSKDFVFELQIPQFEMKLTDQQRYQIIGSAKIKANSL 360 Query: 480 ELAWLKIRWKY 490 KI K Sbjct: 361 NQGKKKITKKA 371 >UniRef50_B4W304 von Willebrand factor type A domain protein (Fragment) n=2 Tax=Cyanobacteria RepID=B4W304_9CYAN Length = 538 Score = 205 bits (522), Expect = 3e-51, Method: Composition-based stats. Identities = 78/381 (20%), Positives = 150/381 (39%), Gaps = 28/381 (7%) Query: 191 WNEQRTLLKVDILAKD--RKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 + L + + +++ NL ++D SGSM L + + L+ L Sbjct: 15 DSPSTVDLLITFQGSESSQQTSSRRPLNLSLVLDRSGSMAG-APLRYAIQAAQNLIDYLT 73 Query: 249 EQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 D +++V Y + + +P +A + A I + A G TN G L Q Sbjct: 74 ADDFVSVVIYDDTAEVIIPPQLVGDQAALKAKIGKIRARGCTNLSGGWLLGCSQVQANQS 133 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 INR+LL TDG N GI DP+ + ++ E+ + +T G GN +NE +++ +A+ Sbjct: 134 PERINRVLLLTDGLANYGIKDPQVLTKTALEKAEADIVTTTLGFGNY-FNEDLLINMANA 192 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 GN+ +I + +A +V EM ++ VA++++ +++ Y Q+ Sbjct: 193 ARGNFYFIQSPDDASQVFEIEMESLVSVVAQNLRVRLQPEEFVKVGEILNNYRFSQV--- 249 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQK----ASIDKLRYAPDNKLA---KSDKTKEL 481 N V GD+ + L LT++ + +I + Y + + + Sbjct: 250 -GNTIEVVLGDVYGVEQKPLAIPLTVSPRSQSGMTTIATVTYDYQTIVEGSIQDISNQFS 308 Query: 482 AWLKIRWKY----------PQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSE 531 L + + + SQL L + ++ F+ AV + + E Sbjct: 309 ITLTVGSEAEAKNIQPDPQVLEQTSQLRIAKLKDEAVSLADKGDFQQAVVKLRETI---E 365 Query: 532 YLNNTSWQQIKQWAQQAKGED 552 L + + A++ D Sbjct: 366 ELKRKTLDEFFDIAEEIAQLD 386 >UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CDA1_PARTE Length = 604 Score = 205 bits (522), Expect = 3e-51, Method: Composition-based stats. Identities = 52/299 (17%), Positives = 123/299 (41%), Gaps = 11/299 (3%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 D++ +L+ +ID SGSM E++ +++ +L +L+ L +D + ++ + + Sbjct: 176 DQQQHSKVGVDLLCVIDRSGSMSG-EKIEMVKQTLNILLNFLGPKDRLCLIQFDDTCQRL 234 Query: 266 --LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 L ++ +K + I + A G T G G ++A +Q + I + +DG Sbjct: 235 TNLRRVTDENKTYYSDIISKIYANGGTVIGLGTQMALKQIKYRKSVNNVTAIFVLSDGQD 294 Query: 324 NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 I + + K+ T+ +FG G S+++ +M +I+++G G++ +++ +S Sbjct: 295 EAAISSLQKQLAYYKQTL----TIHSFGFG-SDHDAKLMTKISNLGKGSFYFVNNISLLD 349 Query: 384 KVLNSEMRQMLITVAKDVKAQIE-FNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGA 442 + + + V D+ +E + ++ ++ + Sbjct: 350 EFFVDALGALTSMVVTDISINLENIMQNPYDAISLSKFYGQEKLNYQNKTIHLKIPYLAE 409 Query: 443 GKHITLLFELTLNGQKASIDKLRYAPDNKLAK--SDKTKELAWLKIRWKYPQGKESQLV 499 G+ +F+L + ++ A+ S +T+E + ESQL+ Sbjct: 410 GQRRDFVFQLNIPYINNNVQSENVVMFKASARITSSQTRETIEKQAELVIRFVDESQLI 468 >UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB92D Length = 586 Score = 205 bits (521), Expect = 4e-51, Method: Composition-based stats. Identities = 80/485 (16%), Positives = 151/485 (31%), Gaps = 55/485 (11%) Query: 34 STPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAK 93 S +Q+ Q A Q+V K P ++ Sbjct: 122 SLTEQQRPNLFTQQVANIAPGEEIMVTLQYVQQVDYRDGKFTFHLPTTLTPRYSPGIPLN 181 Query: 94 ATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRV 153 + + + + + F + + G LN GL R Sbjct: 182 QFNENIEAEISGTGWGEPTDQVPDARAITPFMREGNEGPQLTFNATLNTGLTLNSVTSRN 241 Query: 154 ------EEIVNYF--------PSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLK 199 E NY D DI + S A+ E + L+ Sbjct: 242 HRVNWSESTGNYLVTFNQSNIKMDRDIWLEWQPSPSSAPQAAIFTE---SKGQHDYALVM 298 Query: 200 VDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 + + + ++ F+IDTSGSM + + SL+L + L E+D +V + Sbjct: 299 LMPPQVKSQDLQDFDRDITFVIDTSGSM-GGRPIVDAKESLQLAIDRLSEKDRFNVVAFN 357 Query: 260 GDSRIALPSI---SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRIL 316 D+ + + +K + L+A G T L A + K I +++ Sbjct: 358 NDTTRLFETSVEGTTRNKQYARDFVKHLNAGGGTEMAPALNAA---LKRTTTKDFIKQVV 414 Query: 317 LATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI 376 TDG + + ++ S +K + L T G+G++ N M R A G G+Y ++ Sbjct: 415 FITDGA----VGNEAALFSQIKNEL-GDARLFTVGIGSAP-NSYFMTRAAQFGLGSYVFV 468 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVD 436 ++ ++ ++S + ++ V D+ + A E Sbjct: 469 RNTADIKQQMDSLLYKLESPVLSDLSLTLPAGYAQSAEIYPS-----------------K 511 Query: 437 AGDIGAGKHITLLFELTLNGQKAS------IDKLRYAPDNKLAKSDKTKELAWLKIRWKY 490 D+ AG LL + L + L ++ +A + + Sbjct: 512 IPDLYAGVP--LLLNVKLPHNAGTSGKITLQGTLGRQAGHRSRVDPFGSNIANVDRFYST 569 Query: 491 PQGKE 495 P + Sbjct: 570 PWRSD 574 >UniRef50_D0LNY0 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LNY0_HALO1 Length = 808 Score = 205 bits (521), Expect = 4e-51, Method: Composition-based stats. Identities = 91/473 (19%), Positives = 176/473 (37%), Gaps = 54/473 (11%) Query: 119 NPLATFSLD--VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASK 176 P FS D T + V+ + G P P RV E +NY +D + + Sbjct: 367 QPYFYFSYDDSASTAAVELVKYGVANGERPHPSLARVWEFLNY--ETFDSASYEELGDRF 424 Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM--------- 227 + M + LL ++ + EE P + + FL+D SGSM Sbjct: 425 RVSMGMVSRPSLTQDGAVDYLLGANVTVPNLTREERPHAVVTFLVDISGSMAEYSPTVDA 484 Query: 228 -ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSIS----GSHKAEINAAID 282 + R+ +++ L V L+ D + +V++ ++I L + ++ Sbjct: 485 GGAPTRMDIVREGLWKAVSALKPGDIVNVVSFDDAAQIELERGEIRPGAATPRPYLRSVL 544 Query: 283 SLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRE 342 L G TN AG+E+AY+ A + + INR+++ TD N G DP I V + Sbjct: 545 RLLPRGGTNLSAGIEVAYRVARRNYDPYRINRVIILTDAYANRGSIDPSLIGDHVLIGDD 604 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 G+ S GVG ++NE + + DVG G Y + T +A + +L A+DV+ Sbjct: 605 EGIHFSGLGVG-YDFNEDFLNTLTDVGRGTYFSLITERDAARAFGERFVSLLAVAARDVR 663 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASID 462 ++++ E +L + E+ + Sbjct: 664 FRLDYP----VEMEHTSSASEELSRDPR--------------------EVQPTNFSYNSS 699 Query: 463 KLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE--SQLVEFPLGPTINAPSEDMRFRAAV 520 + + + L + + P ++++ + + +E++ A+ Sbjct: 700 QYFFETFRADESVEADASRFRLSVSYTDPVTGTGHVRVLDRSVEQLLGRETENIAAAEAI 759 Query: 521 AAYGQKLRGSEYLNNTSWQQIKQWAQ----QAKGEDPQGYRAEFIRLIELADG 569 ++ + EYL +++++ Q +G Y F +L+ +G Sbjct: 760 HSFVRF--SGEYL---TYEEVDARLQSYSESQRGPLFYEYVELFEQLVATING 807 >UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6X2_PHATR Length = 523 Score = 205 bits (521), Expect = 4e-51, Method: Composition-based stats. Identities = 70/333 (21%), Positives = 124/333 (37%), Gaps = 51/333 (15%) Query: 194 QRTLLKVDILAKDRKSEE---LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 I A+ E+ +L+ ++D SGSM +L L + +L +L++ L+ Q Sbjct: 44 STNHFCASIHARTMPKEDEDCRTPIDLIVVLDVSGSMTG-NKLKLCKKTLTMLLRVLQTQ 102 Query: 251 DNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 D ++++ D+R+ P+ +S +KA I SL G TN A L LA Q+ Sbjct: 103 DRFGLISFGSDARVEFPAQAMSKQNKASALQKIQSLTTRGCTNMSAALGLAVQELKIIEK 162 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESM-------------------------------- 336 + + TDG N GI D + S+ Sbjct: 163 SNPVRSLFFLTDGLANEGISDLDGLVSLTRNCLLPSDNPSNVLNSEVMIAECLDDLATSQ 222 Query: 337 ----------VKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG-NGNYSYIDTLSEAQKV 385 ++ + +TL TFG G ++N A++ +AD G Y +I+ S Sbjct: 223 HQITRLPVAEIESVCRAPITLHTFGYGR-DHNAALLESLADTTQGGAYYFIEDDSNVGSA 281 Query: 386 LNSEMRQMLITVAKDVKAQIEFN-PAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 + + ++ VA++ I A R + Q + V GD A + Sbjct: 282 FGNALGGIMSIVAQNAVLTIRLPSEAEARGARIVEVYHDQAIKRENDIYTVSLGDFYAEE 341 Query: 445 HITLLFELTLNGQKASIDKLRYAPDNKLAKSDK 477 ++F++ L + + LA +D Sbjct: 342 SRDVIFKMELTKPAFTSTLPVAHVEVTLAYTDT 374 >UniRef50_A9F1H2 Family membership n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F1H2_SORC5 Length = 607 Score = 205 bits (521), Expect = 5e-51, Method: Composition-based stats. Identities = 88/388 (22%), Positives = 149/388 (38%), Gaps = 17/388 (4%) Query: 180 FAMRYELAPAPWNEQRTLLKVDILAKDRKSEE-LPASNLVFLIDTSGSMISDERLPLIQS 238 F R + P LL+V+I + + +L +ID SGSM E+L L Sbjct: 2 FNARPDRPWLPAEPSERLLRVEITVPRPEGGQARKPVHLSLVIDRSGSMSG-EKLRLALE 60 Query: 239 SLKLLVKELREQDNIAIVTYAGDSRIALPSI--SGSHKAEINAAIDSLDAEGSTNGGAGL 296 + + ++ L+ D ++VT+ + +PS + + AA+D++ A G+T+ G G Sbjct: 61 AARQAIRTLQPGDRFSVVTFDHQVEVPIPSTDATPGARLRAEAALDTVIARGNTDLGGGW 120 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 + + I R+LL TDG N GI P + S + QR VT ST G+G Sbjct: 121 LRGCAEVGAHLPEDAIGRVLLLTDGQANHGITSPDELTSRARSQRLRRVTTSTIGLGEG- 179 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYR 416 +NE ++ R+++ G GN+ + E + E+ ++L VA+D I P V Sbjct: 180 FNEFLLGRLSEEGGGNFYFAARADELPGFVGREIGEVLSVVARDAALVIR-APGGVEVES 238 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 Y + G + G + +F L + +A +D Sbjct: 239 LNDYP----CTRDGGTCSFSLGSLPGGMVLAPMFRLRFPAGAIAE-----TLVIDVALAD 289 Query: 477 KTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNT 536 + L+ +P+ E Q P P + + + A + Sbjct: 290 RDGALSKEPWTLAWPRVAEEQARAQPADPEVLRAAAALDAARARRVALGLNDDRLWKEAA 349 Query: 537 S-WQQIKQWAQQAKGEDPQGYRAEFIRL 563 S + DP+ RAE RL Sbjct: 350 SALTDAASRLRGYADGDPE-IRAEADRL 376 >UniRef50_C5FLY1 U-box domain containing protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FLY1_NANOT Length = 748 Score = 205 bits (520), Expect = 5e-51, Method: Composition-based stats. Identities = 67/411 (16%), Positives = 145/411 (35%), Gaps = 60/411 (14%) Query: 193 EQRTLLKVDILAKDRKSEELP--ASNLVFLIDTSGSMI---------------SDERLPL 235 + + V I + +++P ++V +ID S SM L L Sbjct: 46 PNKNSMVVSIQPPLKPKDDVPHVPCDIVLVIDISASMNSAAPIPTGESGGEDTGLSILDL 105 Query: 236 IQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGG 293 + + K +++ L E D +A+VT+ + R+A +S +K+++ AAID L STN Sbjct: 106 TKHAAKTIIQTLNENDRLAVVTFCTEIRVAFELEFMSEENKSKVLAAIDCLHGISSTNLW 165 Query: 294 AGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG------VTL 347 G++ + +G + +L+ TDG N + + + + + Sbjct: 166 HGIKEGLKVLATNSTQGNVQALLVLTDGAPNHMCPAQGYVPKLRQTLLDHRDLTGSLPLI 225 Query: 348 STFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 TFG G ++ IA++G G +++I V + + T + Sbjct: 226 HTFGFGYY-LRSPLLQSIAEIGGGTFAFIPDAGMIGTVFVHAVANLYSTFTPQANLLLHG 284 Query: 408 NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLL-----------------F 450 + +L ++ G + G+ ++ Sbjct: 285 DSLTTFTVDLGSKPGLELENPTGKGVSLRLGSLQYGQSRDIIIHYDKSSKNRTVNIQGKL 344 Query: 451 ELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLK------IRWKYPQGKESQLVEFPLG 504 T+ G + + + ++++ + ++ +R +P ++ +FP Sbjct: 345 NYTVGGDIKTAEVHKLVHIDEVSLPRSVYDYHLMRSTICTLLRKFHPLEDTAENYQFPTA 404 Query: 505 PTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQG 555 + S+ + A + + L S+ N + + I GE P G Sbjct: 405 NLGDIRSKIEKLAAEI----EALGHSDEYNQSLLKDIA-------GEPPHG 444 >UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UYN7_ROSS1 Length = 459 Score = 204 bits (519), Expect = 6e-51, Method: Composition-based stats. Identities = 80/386 (20%), Positives = 152/386 (39%), Gaps = 20/386 (5%) Query: 143 GLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 G L DA R +Y + ++ + P + + +PA + L Sbjct: 28 GELKYEDAARTASGGSYLTAQLELDQVYAPPGQNVDRYLLLTLCSPAKVPPEHAL----- 82 Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 + + P +LV ++D SGSM +L + +L+ + L++ D ++VT++ Sbjct: 83 ----PREQHRPPLHLVAVLDVSGSMSG-TKLASAKEALRQALHFLQDGDVFSLVTFSDQV 137 Query: 263 RIALPSISGSHKA--EINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATD 320 + L + S + + ++ +D + A G T GL + +LL +D Sbjct: 138 QTHLKAESYAQRKRDKMENLLDEIRASGMTALDGGLAQGIDL--GQKKRQATTLVLLLSD 195 Query: 321 GDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS 380 G NVG D + I +K R+SG+ +ST GVG +YNEA+MV IA+ G G + +I S Sbjct: 196 GQANVGETDLEKIGLRAQKARQSGLIVSTLGVGL-DYNEALMVEIANQGGGRFYHIQEGS 254 Query: 381 EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDI 440 + L E+ + A+ V+ + + Y + + GD+ Sbjct: 255 QIPAALMQELGSAAMLAARQVEVEFDLPSGAALVSLTALYPLEMVNSRP----LLKVGDL 310 Query: 441 GAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVE 500 + + LTL A + +T LA + ++ + ++ + Sbjct: 311 LPDVRVEIPLRLTLYPHAAGERFSVSGGVHHQTPRGQTLGLALNAVSVRFVEQRQFEERP 370 Query: 501 FPLGPTINAPSEDMRFRAAVAAYGQK 526 + P + E R A + + + Sbjct: 371 GYVAPVMERVLE-FRRAAHLLEFARL 395 >UniRef50_A6FSG0 Putative uncharacterized protein n=1 Tax=Roseobacter sp. AzwK-3b RepID=A6FSG0_9RHOB Length = 444 Score = 204 bits (519), Expect = 8e-51, Method: Composition-based stats. Identities = 73/357 (20%), Positives = 130/357 (36%), Gaps = 16/357 (4%) Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY 258 V A ++E P NL ++D S SM + L + + +V LR D +AIV + Sbjct: 27 IVAPSAPVTETEPRPPLNLALVLDRSSSMRG-QPLHEAKRAADQIVAGLRPSDRLAIVAF 85 Query: 259 AGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 + + AA+ + A G T G L +Q+ G R+ L Sbjct: 86 DNATEVMFSGGPRGDGQAARAALSRIHARGMTALHDGWLLGVEQSIAMREAGTPARVFLL 145 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 +DG NVG+ D +I + + E G+T ST G+G +NE +M +A G GN Y +T Sbjct: 146 SDGVANVGLTDASAIAADCTRMAEHGITTSTCGLGMG-FNEDLMAEMARAGRGNAYYGET 204 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAG 438 + Q E + A+ ++ ++ E R L + V Sbjct: 205 AEDLQDPFEQEFDLLRNICARGLRLRLSAGAGV---------EMRVLNQYPERDGEVLLP 255 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 D+ + EL + ++ ++ + P+ S Sbjct: 256 DLAFDGEAWAMVELQFDESDEQPGDRLLLSAEVTGQTPDGDAISDGPASLRLPRLPASAF 315 Query: 499 VEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTS-WQQIKQWAQQAKGEDPQ 554 + A + ++R A ++ R + ++ Q + AQ+ GE+ Sbjct: 316 DMITQEEIVQARARELR----AADLQERARLAARRHDWDQVQTLIDEAQRESGENAW 368 >UniRef50_B5YMD8 Predicted protein n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B5YMD8_THAPS Length = 868 Score = 204 bits (518), Expect = 8e-51, Method: Composition-based stats. Identities = 84/418 (20%), Positives = 153/418 (36%), Gaps = 37/418 (8%) Query: 147 PPDAVRVEEIVNYFPSD---WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDIL 203 AV EE F + + + + + + A E E + I Sbjct: 63 AEGAVAQEEEAVDFAMEMLAREHEQQGIREETLQLSVAPHRESIGLQSGEFTGQICATIK 122 Query: 204 AKDRKSEE---LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG 260 A+D + ++V +D SGSM E+L L + +L LL++EL D A+++++ Sbjct: 123 ARDLPQRDSFARSPIDIVVALDVSGSMR-VEKLDLCKETLHLLLRELHHDDRFALISFSE 181 Query: 261 DSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 D+ I +P ++ +K + AID L +G TN + + LA Q + + L Sbjct: 182 DAVIEVPMQKVNERNKQQALHAIDRLSVKGRTNIASAVSLAAQVVNGVAEPNKVRSVFLL 241 Query: 319 TDGDFNVGIDDPKSIESM----VKKQRESG---VTLSTFGVGNSNYNEAMMVRIADVG-N 370 TDG+ N G + + + V+ R ++L TFG G ++ ++ +A Sbjct: 242 TDGNANTGYTEAIDLVKLTSIFVEANRNPHTPPISLHTFGYGPEP-DQKLLRGMAMATSG 300 Query: 371 GNYSYIDTLSEAQKVLNSEMRQM--------LITVAKDVKAQIEFNPAWVTEYRQIGYEK 422 G++ + S+ + + L VA++V I I Sbjct: 301 GSFYSVRDNSQVSSAFGDAIGGILSLALYSVLSVVAQNVIVTISVPSESAKCGAGI-VAI 359 Query: 423 RQLRVEHFNND--NVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 R+E + V GD+ A + +LFE TL + +P + Sbjct: 360 HDDRMEEIGDGVFQVSLGDLYAEECRDILFETTLVYPTKKPPQSNASPIFYPHALVELSY 419 Query: 481 LAWLKIRWKYPQG-------KESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSE 531 L ++ R P S + +P + +R A+ Q + E Sbjct: 420 LDTIRHRSIAPISFLAGIARPNSNEISWP-NQYVAVQWLRVRTAKAIRQAEQLAKDGE 476 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 204 bits (518), Expect = 9e-51, Method: Composition-based stats. Identities = 89/394 (22%), Positives = 158/394 (40%), Gaps = 41/394 (10%) Query: 177 PIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 + + + A + L++ + AK + NL ++D SGSM + L + Sbjct: 4 DLSLLLSDQNLDAGAPTSQRQLRIAVAAKADDHDRRLPLNLCLVLDHSGSMDG-QPLETV 62 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 +S+ L+ L E D ++++ + ++I + + + A I AI+ L AEG T GL Sbjct: 63 KSAALGLIDRLEEDDRLSVIAFDHRAKIVIENQQVRNGAAIAKAIERLKAEGGTAIDEGL 122 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 +L Q+A KG + ++ I L TDG+ G D + + +T+ T G G+ + Sbjct: 123 KLGIQEAAKGK-EDRVSHIFLLTDGENEHG--DNDRCLKLGTVASDYKLTVHTLGFGD-H 178 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP----AWV 412 +N+ ++ IA G+ SYI+ SEA ++M + +E P A V Sbjct: 179 WNQDVLEAIAASAQGSLSYIENPSEALHTFRQLFQRMSNVGLTNAHLLLELAPQAHLAIV 238 Query: 413 TEYRQIGYEKRQLRV-EHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNK 471 Q+ E L V + V GD+ + LL L L+ + Sbjct: 239 KPVAQVSPETMDLTVQNQGAIEEVRLGDLMTDQERVLLLNLYLDQLLPGQHVIGQ----- 293 Query: 472 LAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTIN-----APSEDMRFRAAV---AAY 523 ++IR+ P ++ L+ PL TI PS D++ + ++ A Y Sbjct: 294 ------------VQIRYDDPASGQTNLLSDPLPLTIQVQTQYQPSTDVQVQESILTLAKY 341 Query: 524 GQ------KLRGSEYLNNTSWQQIKQWAQQAKGE 551 Q KL+ + + Q G+ Sbjct: 342 RQTQIAETKLKAGDRQGAATMLQTAAKTALQMGD 375 >UniRef50_C9SWV9 U-box domain containing protein n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SWV9_VERA1 Length = 662 Score = 203 bits (517), Expect = 1e-50, Method: Composition-based stats. Identities = 47/305 (15%), Positives = 106/305 (34%), Gaps = 32/305 (10%) Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEE-----LPASNLVFLIDTSGSMI-------- 228 + + P + + + ++V +ID SGSM Sbjct: 53 LTLSVHPLASRDGLLVKVEPPTTPREALQSGKRIPRAPCDIVLVIDVSGSMDDAAPAPVI 112 Query: 229 --------SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEIN 278 L L + + + +++ L E+D + IV + ++++ L ++ +K Sbjct: 113 PGQKDENTGLSILDLTKHAARTILETLDERDRLGIVAFTTNAKVILSLVEMNPDNKVSAK 172 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQATK-GFIKGGINRILLATDGDFNVGIDDPKSIESMV 337 I++L TN G+ + + G + +++ TDG N G I + Sbjct: 173 DKIENLQPLNGTNMWHGITEGIKLFSDCDSSSGRVPAMMVLTDGLPNSGCPRLGYIPKL- 231 Query: 338 KKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITV 397 + + T+ TFG G + ++ IA++G GNY++I V + + T Sbjct: 232 RDMGQLPATIHTFGFG-YHIRSGLLKSIAEIGGGNYAFIPDAGMIGTVFVHAVANLQSTF 290 Query: 398 AKDVKAQIEFN------PAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFE 451 A + + + + + + + + G+I G+ + Sbjct: 291 ANRATLTLTYPSELAIQESVGDSVEKQAPTELAGYLTPTSQLTISLGNIQYGQSRDVYLR 350 Query: 452 LTLNG 456 + + Sbjct: 351 ASPDA 355 >UniRef50_C4XPW8 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XPW8_DESMR Length = 439 Score = 203 bits (515), Expect = 2e-50, Method: Composition-based stats. Identities = 82/380 (21%), Positives = 143/380 (37%), Gaps = 36/380 (9%) Query: 193 EQRTLLKVDILAKDRKS---EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 + + V I A + +E NL ID SGSM L + +V +L+ Sbjct: 20 DNTLDVLVRIQAPNTPEGETKERTRLNLALAIDRSGSMAGR-PLEEAKRCASFVVDKLKN 78 Query: 250 QDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK 309 D ++++ Y +PS+ KA + AI+ +D G TN G +Q + Sbjct: 79 TDRVSLIAYDSSIETRVPSVKVEDKAIFHRAIEGIDDGGCTNLHGGWLKGAEQISPYIDP 138 Query: 310 GGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG 369 I+RI+L +DG N G+ D I ++ ++GVT ST+G+G SN+NE +M+ +A G Sbjct: 139 STISRIILLSDGQANEGLTDEAEIFKQCRELADAGVTTSTYGLG-SNFNETLMIGMAKNG 197 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEH 429 GN Y T + E+ + AK V+A I + + E + Sbjct: 198 QGNSYYGRTADDLMDPFQEELSLLEALFAKQVRASISASAGILFEI--------LNKYST 249 Query: 430 FNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWK 489 + D+ +L + + +L ++ ++ Sbjct: 250 DRQGKIQLPDLAYEGEAWMLVRCFIPRTQTGEG-----------DGTDLIDLFSVEFTYQ 298 Query: 490 YPQGKESQLVEFPLGPTINAPSEDMRFR-AAVAAYGQKLRGSEYLNNTSWQQIKQWAQQA 548 G+ F + P A +V +R + L Q+ Q A Sbjct: 299 DLNGE-----SFNIPPIKLALPSLPAKAWESVTEDSLTVRRANELEAADIQEKAQQA-AK 352 Query: 549 KGEDPQGYRAEFIRLIELAD 568 +G+ P+ R L++ A Sbjct: 353 RGDWPEVKR-----LLKNAK 367 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 202 bits (514), Expect = 3e-50, Method: Composition-based stats. Identities = 62/358 (17%), Positives = 135/358 (37%), Gaps = 27/358 (7%) Query: 142 QGLLP----PPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL---------AP 188 +G +P P ++++ + S+ + + + AM+Y L + Sbjct: 29 EGRVPIVGQNPSRFNDDDMITVYQSNNNKLNYGRNSLGQGYMTAMKYNLKDNISIQASSH 88 Query: 189 APWNEQRTLLKVDILAKDR-------KSEELPASNLVFLIDTSGSMISDERLPLIQSSLK 241 N+Q L + I + D + +LV LID SGSM E++ L++ +LK Sbjct: 89 TLMNQQNAALMITIKSNDILLINQRGQECVRQGVDLVCLIDHSGSMQG-EKIKLVRKTLK 147 Query: 242 LLVKELREQDNIAIVTYAGDSRIALPSI--SGSHKAEINAAIDSLDAEGSTNGGAGLELA 299 ++ L+ D + ++ + + + + + AI SL A G T+ G G+++A Sbjct: 148 QMLTFLQPCDRLCLIMFDCKVYRLTRLMRVTQENVQKFRVAISSLQARGGTDIGNGMKMA 207 Query: 300 YQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNE 359 K ++ I L +DG + + +++ T+ TFG G + Sbjct: 208 LSILKHRKYKNPVSAIFLLSDGVDEGAEERVRD--DLIQYNIRDSFTIKTFGFGR-DCCP 264 Query: 360 AMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIG 419 +M IA G + ++ L+ + + ++ VA V+ ++ + + ++ Sbjct: 265 KIMSEIAHYKEGQFYFVPNLTNIDECFAEALGGLVSVVANHVQLSVQPMHSNKVQIKK-A 323 Query: 420 YEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDK 477 Y + + + +G +FE+ + + + DK Sbjct: 324 YGDKWTYDSWKGVFTLYQPHLLSGVRKDYIFEVADYKTSGKQEIRVLLQADPVEGGDK 381 >UniRef50_B8AE57 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AE57_ORYSI Length = 585 Score = 202 bits (513), Expect = 3e-50, Method: Composition-based stats. Identities = 75/374 (20%), Positives = 138/374 (36%), Gaps = 39/374 (10%) Query: 174 ASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELP---ASNLVFLIDTSGSMISD 230 ++ P+ + L P +V + + +L ++V ++D SGSM Sbjct: 2 SADPVKVSTTTMLPTIPRGHTNKDFRVLLRVEAPPMADLKGHVPIDVVAVLDVSGSMGDP 61 Query: 231 -------------ERLPLIQSSLKLLVKELREQDNIAIVTYAGDS----RIALPSISGSH 273 RL +++ ++K ++++L + D ++IV + L +ISG+ Sbjct: 62 AMASSDFEKNKPPSRLDVLKEAMKFIIRKLDDGDRLSIVAFNDRPVKEYSTGLLNISGNG 121 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQA--TKGFIKGGINRILLATDGDFNVG-IDDP 330 + +D L+A G T LE A + G + + ILL TDGD G Sbjct: 122 RRIAEKKVDWLEARGGTALMPALEEAIRVLDCRPGDSRNSVGFILLLTDGDDTSGFRWSR 181 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE--AQKVLNS 388 I V K + TFG+G ++ +EA++ IA G YS++D + L Sbjct: 182 DVINGAVGKY-----PVHTFGLGAAHSSEALLH-IAQESRGTYSFVDDENMDKIAGALAV 235 Query: 389 EMRQMLITVAKDVKAQI---EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 + + A D + + E + A + GYE R + V G + AG+ Sbjct: 236 CIGGVKTVAAVDTRVSVRVAELSGARIERIDSCGYESRVACG--GASGEVVVGVLYAGEV 293 Query: 446 ITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTK---ELAWLKIRWKYPQGKESQLVEFP 502 + + L L S + Y + E L + + Y + ++ + Sbjct: 294 KSFIIHLHLPAASVSSLECGYCDAATTCDHHCPRRRHEQRLLDVGYSYRRAPDASAISIV 353 Query: 503 LGPTINAPSEDMRF 516 E+ Sbjct: 354 GRGVFVQRPEEEVL 367 >UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Rhizobium RepID=B5ZY26_RHILW Length = 794 Score = 202 bits (513), Expect = 3e-50, Method: Composition-based stats. Identities = 74/530 (13%), Positives = 165/530 (31%), Gaps = 60/530 (11%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 Q + +Q + + + + + + Q Q + Q+ Sbjct: 145 RQEAREIYEQAKAEGKKTALLEQQRPNIFTNQVANIGPGEEIVVQIEYQQTVHQSGGEFS 204 Query: 81 QEAPTFARAAKAKAT----------------HIANPGTARYQQFDDNPVKQVAQNPLATF 124 P A + + +P + NP++ Sbjct: 205 LRFPMVVAPRYNPAPIVQTVEFNNGAGFATPRDPLDNREKIEAPALDPRENAKINPVS-L 263 Query: 125 SLDVDTGSY-ANVRRFLNQGLLPPPDAVRVEEIV--NYFPSDWDIKDKQSIPASKPIPFA 181 ++D+ G V+ ++ + + + P+D D + K Sbjct: 264 TVDLKAGFPLGEVKSSFHEVDIRQDGDQERTISLKGDAVPADKDFELTWQAAPGKLPSAG 323 Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLK 241 + E+ + L V + +VF+ID SGSM + + SL Sbjct: 324 LFREVKD---GKTCLLAFVTPPTAPDAAAPPAKREVVFVIDNSGSMSGPS-IEQAKQSLA 379 Query: 242 LLVKELREQDNIAIVTYAG---DSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 L + L D ++ + D L + + ++ + A + L A+G T LE Sbjct: 380 LAISRLTPNDRFNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLPADGGTEMLPALED 439 Query: 299 AYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYN 358 A + G + +++ TDG I + + + + R + T G+G++ N Sbjct: 440 ALRN-QGPVATGALRQVVFLTDGA----IGNEQQLFQEITANR-GDARVFTVGIGSAP-N 492 Query: 359 EAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 M + A++G G ++ I + + + ++ D+ A E Sbjct: 493 TYFMTKAAEIGRGTFTQIGSTDQVASRMGELFAKLQNPAMTDIAANFE------------ 540 Query: 419 GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASID------KLRYAPDNKL 472 + E + D+ +G+ + L EL + ++ + + Sbjct: 541 -----GIAAEDITPNP--MPDLYSGEPVVLTAELPGDKPAGKLEIIGKTGDQPWRVQMDI 593 Query: 473 AKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAA 522 A + ++ L R K E++ E ++ E + + + Sbjct: 594 ANAADGNGISKLWARRKIDD-FEARAYERQDPAGLDKDIETVALAHHLVS 642 >UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RYC9_9GAMM Length = 686 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 76/488 (15%), Positives = 165/488 (33%), Gaps = 61/488 (12%) Query: 26 KESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPT 85 + T + V + + + + + + V+ + Q + + P+ Sbjct: 132 ANIAPGEKITVRLEYVQSVEHRSGRFSLRLPTTITPRYMPGVETETANQDMNENVAVTPS 191 Query: 86 FA---RAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSY-ANVRRFLN 141 + H+ +P Q D P+ ++ S +D G A++ + Sbjct: 192 HGWAWPTDQVTDAHLISPLQYFAQGSDSAPLNRIK------ISARLDMGMPLASIDSPYH 245 Query: 142 QGLLPPPDAV-RVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKV 200 + L V V+ D D + S + A E +Q L + Sbjct: 246 EIALSRRAGVYSVKLAQGSAEMDRDFVLQWSAASGSLPGAAFFTERVD----DQYYGLLM 301 Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG 260 + +++ E +VF++DTSGSM + + SL ++ L D ++ + Sbjct: 302 LVPPASQRAAETVPREIVFVVDTSGSM-GGVSIKQAKGSLTRALRHLGPNDRFNVIEFNS 360 Query: 261 DSRIAL---PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA---TKGFIKGGINR 314 R S + + + L+A G T L+LA + + + + + Sbjct: 361 SHRALFQHAVPASHHNLQLASEYVRHLEASGGTEMMPALQLALKLPGAQDELRPEPALRQ 420 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++ TDG + + ++ + G L T G+G++ N M + A+ G G ++ Sbjct: 421 VIFITDGA----VGNESALFEHIVDSL-GGSRLFTVGIGSAP-NAWFMRKAAEYGRGTFT 474 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 YI ++E + +++ + VA + + + Sbjct: 475 YIGDVAEVGEKMDALFLNLTRPVATHLNVDWP---------------------DGVDVWP 513 Query: 435 VDAGDIGAGKHITLLFEL--TLN----------GQKASIDKLRYAPDNKLAKSDKTKELA 482 D+ AG+ + + +L L G+ KL++ + ++ +A Sbjct: 514 ARTPDLYAGEPLLIAVKLGDELPIDGLKIQGLLGETEWSTKLQFPAALNPLSMNNSEGVA 573 Query: 483 WLKIRWKY 490 L R K Sbjct: 574 TLWARKKI 581 >UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepID=C5EGH1_9FIRM Length = 681 Score = 201 bits (512), Expect = 5e-50, Method: Composition-based stats. Identities = 67/410 (16%), Positives = 134/410 (32%), Gaps = 30/410 (7%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 + +E + + + + + + Y++ AL Sbjct: 122 EEAKQEFDAAKSEGKSASLLEQQRPNVFTMNVANIMPGDTV--NIELHYTEMIALSEGSY 179 Query: 82 EAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLN 141 E F + + + Q+ +P ++ P T+ + V + + Sbjct: 180 EF-VFPAVVGPRYSSPSPDREEDGNQWVASPYQEGGAVPKGTYDIAVSLSTGVPI----- 233 Query: 142 QGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA--------PAPWNE 193 G++ + +E S I K F +RY+LA E Sbjct: 234 TGIVSSSHKINIE---QSADSSAHITLKDPADYGGNRDFILRYQLAGQTVNSGLMLNTGE 290 Query: 194 QRTLLKVDILAKDRKSEE-LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 + + + +R E +P +F++D SGSM L + ++ +V LRE D Sbjct: 291 KENFFLLMVQPPERVPAEAIPPREYIFVLDVSGSMFGY-PLDTAKELIRNMVSNLRETDT 349 Query: 253 IAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK 309 ++ ++ D+ + + I+ G T LE A Sbjct: 350 FNLILFSNDAIRMSARSLPATDENVERAINLINRQKGGGGTELAPALEKAVGIPMDSGAG 409 Query: 310 GGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG 369 +++ TDG + D ++I +V ++ + +FG+G S N ++ IA G Sbjct: 410 SVSRSVVVITDGY----MSDEQAIFDIVAGNLDT-TSFFSFGIGTS-VNRYLIEGIARTG 463 Query: 370 NGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIG 419 G + SE+ + V DV+ + A+ E I Sbjct: 464 GGESFVVTDSSESADTARLFDTYIQSPVLTDVQVDYDGFDAYDVEPTAIP 513 >UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UK93_METS4 Length = 761 Score = 201 bits (510), Expect = 9e-50, Method: Composition-based stats. Identities = 80/519 (15%), Positives = 160/519 (30%), Gaps = 55/519 (10%) Query: 20 GPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSD------- 72 + + + + + + + + + Q Q Sbjct: 170 AREAARTAYEAARETGRAAALTEQERPNLFTTSVANIGPGETVLVQIAFQQPVRLSGGTH 229 Query: 73 ----KQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDV 128 + R AP + A P AR +P NP+ T ++ + Sbjct: 230 ALRLPLVVAPRYSPAPGLLQPAAEGPARDPVPDRARIAPPVLDPAVHGPVNPV-TLTVTL 288 Query: 129 DTGSY-ANVRRFLNQGLLPP--PDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYE 185 G V + + PD+ RV P+D D S + E Sbjct: 289 RAGFPLGTVESATHAIRVEETGPDSRRVTLADGPVPADRDFALTWRAAPSAAPAVGLFRE 348 Query: 186 LAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVK 245 + L + + ++ + F+ID SGSM + ++SL + + Sbjct: 349 RV-----GEDEYLLAVVTPPEGRAPARRPREVTFVIDNSGSMAGAS-MRQAKASLLVALD 402 Query: 246 ELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ 302 L D ++ + + P +H+ + +L+A G T L A Sbjct: 403 RLGPADRFNVIRFDDTMDLLFPAPVPADEAHRDAARRFVAALEARGGTEMLPPLRAALAD 462 Query: 303 ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMM 362 + +I+ TDG I + + I S + R L G+G++ N +M Sbjct: 463 -PHPEEGDRVRQIVFLTDGA----IGNEEQIFSAISAGRGRS-RLFMIGIGSAP-NGHLM 515 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEK 422 A++G G+Y+ I T+ + + + ++ V D+ A P R Sbjct: 516 THAAELGGGSYTAIGTIDQVAERTAELLAKLESPVVTDLAAAFS-EPGVEATPRL----- 569 Query: 423 RQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQ----KASIDKLRYAPDNKLAKSDKT 478 D+ G+ + L L + I + + LA++ + Sbjct: 570 --------------LPDLYRGEPVVLAARLREATGTLTLRGRIGEAPWQQVLTLAEAREG 615 Query: 479 KELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFR 517 ++ L R K + + ++L +A + Sbjct: 616 SGISKLWARAKIGEAETARLTGRMSAEAADAAILRLALA 654 >UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GAI6_9DELT Length = 560 Score = 200 bits (509), Expect = 9e-50, Method: Composition-based stats. Identities = 88/450 (19%), Positives = 165/450 (36%), Gaps = 45/450 (10%) Query: 129 DTGSYANVRR-FLNQGLLPPPDA-VRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 S VR LN G +R E +NY+ D+D A + Sbjct: 137 SMSSPVQVRDWVLNYGGNSLSGFPIRTWEFMNYYGFDYD------PAADGELSVYAAMNP 190 Query: 187 APAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKE 246 +E R +++ + ++ EE P N+ ++DTSGSM + L++ + + + + Sbjct: 191 IEGEGDEARFQMQIGVASELMTPEERPPMNVTLVLDTSGSMAG-TPIELLRETSRAIAAQ 249 Query: 247 LREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 L+ D ++I + + L +++G + + I+ + G TN GLE Y+ A Sbjct: 250 LKLGDTVSICEWDTSNDWTLAGYAVTGPNDELLLEKINDVVHGGGTNLYGGLESGYELAQ 309 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGN-SNYNEAMMV 363 + INR++L +DG N GI D I G+ L GV + +YN+ +M Sbjct: 310 MVYDPDAINRLVLISDGGANAGITDLDLIAENAAYGGSDGIYLVGVGVDDPDDYNDELMD 369 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKR 423 + D G G ++ + E ++ A++V+ Q++ P G+E Sbjct: 370 AVTDAGKGASVFMPSEEEVWTTFGDNFESVMAIAAREVQVQLDMPP---------GFEVV 420 Query: 424 QLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKEL-A 482 + E +G E+ + + Y A ++ Sbjct: 421 KFSGEEI-----------SGDPK----EVEPQNLAPNDTMVFYQQVETCAPDLAGEDAEV 465 Query: 483 WLKIRWKYPQGKESQLV--EFPLGPTINAPSEDMRFRAAVAAYGQKLRG------SEYLN 534 + + W P ESQ + + +G + AA+ AY L+ ++ Sbjct: 466 TVTVTWDDPWTFESQELAQTWTIGELTGMDQALLLKGAAILAYTDALKAFKQAYTNDQKA 525 Query: 535 NTSWQQIKQWAQQAKGEDPQGYRAEFIRLI 564 + A D G E +++ Sbjct: 526 AALQPALDALALAQTANDTDGDLIEIGQIL 555 >UniRef50_Q237Q6 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q237Q6_TETTH Length = 713 Score = 200 bits (509), Expect = 1e-49, Method: Composition-based stats. Identities = 67/386 (17%), Positives = 142/386 (36%), Gaps = 35/386 (9%) Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMI--------------SDERLPLIQSSLKLLV 244 +V I K + ++++ ++D SGSM L +++ SL +V Sbjct: 48 QVRIQILSPKGKSKVSNSICCVVDVSGSMGSRAVTKQSGGNSELGYSVLDIVKHSLNTIV 107 Query: 245 KELREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ 302 + L E D ++VT++ +S++ ++ S+ I+ + STN AG+E +Q Sbjct: 108 QNLDEGDEFSMVTFSDNSKLVCNYQQMTESNIKSSVDLINQCQPDASTNIWAGIEQGLEQ 167 Query: 303 ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV-----TLSTFGVGNSNY 357 K ++++ TDG NV + P+ I + + + +++TFG G Sbjct: 168 MQNDSNKNKNQQLIVLTDGQPNV--NPPRGILTTLNNFYNKNIISPKPSINTFGFGYY-L 224 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI-EFNPAWVTEYR 416 + ++ IA G YS+I S + + + M T A + N Sbjct: 225 DSHLLFNIAQDCQGIYSFIPDSSFVGTIFTNSIASMQSTFATNAVLVFKPLNKNAQLNLS 284 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 QI + + V+ G++ + ++F+ L+ + + K + Sbjct: 285 QI-KSNFKTYLNKEGEYIVELGNLFFDQSKDIIFQQDLHSELLRDFSVEVKYYTKDTNNF 343 Query: 477 KTKELAWL--KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAY------GQKLR 528 + +I E + + + T+ + + + + R Sbjct: 344 QLSHRMHKAEQIHNANKLEFEQEALRLEVVKTVQQCKDSSQKSQKLVEDTINLLKASQFR 403 Query: 529 GSEYLNNTSWQQIKQWAQQAKGEDPQ 554 EY+ N + ++ +QA D Sbjct: 404 EDEYIQNL-CKDMEDQVKQAVSRDDY 428 >UniRef50_Q8YW34 All1782 protein n=5 Tax=Cyanobacteria RepID=Q8YW34_ANASP Length = 615 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 74/370 (20%), Positives = 141/370 (38%), Gaps = 11/370 (2%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 + NL +ID SGSM + + +V +L +D +++V Y Sbjct: 30 EIPESPRRNLNLSLVIDRSGSMAGAAL-HHALKAAESVVDQLEPKDILSVVVYDDAVDTV 88 Query: 266 LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNV 325 +P + K + +I + A G TN G + INR+LL TDG N+ Sbjct: 89 VPPQPVTDKPALKKSIRQVRAGGITNLSGGWLKGCEYVKHQLDPQKINRVLLLTDGHANM 148 Query: 326 GIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKV 385 GI DPK + + ++ E G+T +T G +NE +++ +A NGN+ +I ++ EA +V Sbjct: 149 GIQDPKILTATSTQKAEEGITTTTLGFAQG-FNEDLLIGMARAANGNFYFIQSIDEAAEV 207 Query: 386 LNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 + E+ + V +++K +E ++ + G++ G+ Sbjct: 208 FSIELDSLRSVVGQNLKVTLELADGITLVDTL---SLAKVSQNEAGQPVITLGELYEGED 264 Query: 446 ITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGP 505 L L+L A + +L + A + + + K + E L Sbjct: 265 K--LLGLSLMISSAQVGELPLMRLHYSADVVQNDIIQSVSGT-TDVITKVGTVEESALAS 321 Query: 506 TINAPSEDMRFRAAVA-AYGQKLR--GSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFIR 562 + + + R A A +L G + + + Q+ + + E + Sbjct: 322 SSHIILDLSRLTIAKAKETALELAEHGQHQAAEKTLRDLVQYLRDQGLNENFEIAEEIDQ 381 Query: 563 LIELADGVTD 572 L A + Sbjct: 382 LEYFAGRIAQ 391 >UniRef50_B0SHY6 Anti-sigma factor antagonist n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SHY6_LEPBA Length = 550 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 73/315 (23%), Positives = 129/315 (40%), Gaps = 16/315 (5%) Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNI 253 + LL + EE + ID S SM E++ + + LV L D + Sbjct: 21 ENHLLLRFRTPANPNVEERKPLVIGLAIDKSWSMKG-EKMEAVIDASCALVNWLTRHDAV 79 Query: 254 AIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN 313 +IV Y+ D ++ P + K + I ++ STN G A + + I Sbjct: 80 SIVAYSADVQLIQPVTHLTEKVSVTDKIRNIQVATSTNLSGGWLSALKSLNQSKIPNAYK 139 Query: 314 RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNY 373 R+LL TDG+ GI D +++ ++ G++ +T GVGN ++NE M+V IA G GN+ Sbjct: 140 RVLLLTDGNPTSGIKDKEALVTIAADHLSMGISTTTIGVGN-DFNEEMLVEIAKAGGGNF 198 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN- 432 YID A + E + A+ + +++ P +Q+ E +E F+ Sbjct: 199 YYIDNPENASDIFFEEFGDIGALYAQAIDVELQLAPG--VRLKQVLSETSHQVMEEFDEF 256 Query: 433 -----------DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKEL 481 N+ GD+ A L+ L ++ + + + K L Sbjct: 257 LGDAKTISRQKINLQLGDLRADDLRNLVLRLEIDDRVNETNTPFCEVNVSYYNLLKQNVL 316 Query: 482 AWLKIRWKYPQGKES 496 +K + + +GK + Sbjct: 317 ESVKQTFSFERGKNT 331 >UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genome shotgun sequence n=3 Tax=Paramecium tetraurelia RepID=A0C051_PARTE Length = 636 Score = 199 bits (505), Expect = 3e-49, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 123/309 (39%), Gaps = 11/309 (3%) Query: 157 VNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASN 216 N D + K + + + + P L + + + + Sbjct: 104 QNTNKYDLNEKLYFEVRSLYKMGKLLNSRTQYLPGIVSIKALDQAVT--QNQKNQRVGVD 161 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHK 274 L+ LID SGSMI ++ ++++SL +L++ L + D + ++T+ D+ P ++ +K Sbjct: 162 LICLIDISGSMIG-VKIEMVKASLIVLLQFLGDNDRLQLITFDNDAHRLTPLKTVTNQNK 220 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 + I + A G ++A+ Q + + L +DG + I+ Sbjct: 221 SYFTQIIKQIKANGGNRISEATKMAFYQLKSRKYINNVTSVFLLSDGVDYTYPEVKNQIQ 280 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQML 394 ++ TL TFG G +++ MM ++ ++ +G++ ++ ++ + + ++ Sbjct: 281 TV-----NEVFTLHTFGFGE-DHDAQMMTQLCNLKSGSFYFVQDVTLLDEFFADALGGLI 334 Query: 395 ITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL 454 V + ++ + + + QI + + N + I +G +FEL L Sbjct: 335 SVVGEQLEITLSSSAPPPYQDIQISKTYGNMWQKKGNQYYITQPQIASGSRKDYVFELAL 394 Query: 455 NGQKASIDK 463 + I+ Sbjct: 395 PKFEGKIED 403 >UniRef50_Q7S708 Predicted protein n=1 Tax=Neurospora crassa RepID=Q7S708_NEUCR Length = 766 Score = 198 bits (503), Expect = 5e-49, Method: Composition-based stats. Identities = 70/451 (15%), Positives = 149/451 (33%), Gaps = 62/451 (13%) Query: 178 IPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEE----LPASNLVFLIDTSGSM------ 227 P A + E+ P P + LL+V + ++V ID SGSM Sbjct: 28 RPVAPKLEIHPLPSHTSGLLLRVIPPRSPPNLPDPNFHHVPCDIVLAIDVSGSMSADAPV 87 Query: 228 ---------------ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP--SIS 270 L L++ + + +V L D + IVT++ ++++ P ++ Sbjct: 88 PTTASADYTNEQPEHNGLSVLDLVKHAARTIVSTLNSSDRLGIVTFSTEAKVLQPLMPMT 147 Query: 271 GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDP 330 +K + + + +TN G+ + G + +++ TDG N + Sbjct: 148 ALNKKKTERNLGGMQPFSATNLWGGIVEGLKLFDGQ--SGRMPALMVLTDGMPNH-MCPA 204 Query: 331 KSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEM 390 + + ++ + TFG G S ++ +A++G G YS+I V + Sbjct: 205 QGYVAKLRAMETLPAAIHTFGFGYS-LRSGLLKSVAEIGGGGYSFIPDAGMIGTVFVHSV 263 Query: 391 RQMLITVAKDVKAQIEFNP---------AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIG 441 + T A +V ++ + V + + EK + + ++ + Sbjct: 264 ANLQSTFANNVVLRLTYPKYLGLEETTGESVDKVESVQLEKGDVDPDSSMQLTLNLSTLQ 323 Query: 442 AGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEF 501 G+ + Q+A D + + + + + E + Sbjct: 324 YGQSRDIFLRYDSKAQEAIADGFDFESPPSVLATLDYQHFTNI----TNTVVSECNDIFR 379 Query: 502 PLGPTINAPSEDMRF---RAAVAAY--------------GQKLRGSEYLNNTSWQQIKQW 544 P + R+A+ ++ + + S + ++ Sbjct: 380 PNPQVKQLTPAQTAYHISRSALISFLFSLYTLRPDREHQPRFFKDSFATSLQTFLSTLTA 439 Query: 545 AQQAKGEDPQGYRAEFIRLIELADGVTDISQ 575 AQ A DP R+ LI + D +Q Sbjct: 440 AQPAFASDPHC-RSLVQDLIGSTSAINDANQ 469 >UniRef50_A5UTA6 von Willebrand factor, type A n=6 Tax=Chloroflexi (class) RepID=A5UTA6_ROSS1 Length = 425 Score = 198 bits (503), Expect = 5e-49, Method: Composition-based stats. Identities = 73/369 (19%), Positives = 139/369 (37%), Gaps = 27/369 (7%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSH 273 NL ++D S SM + + ++ + +V +L D ++V + + + +P+ Sbjct: 44 PLNLCLVLDRSSSMRGERLM-QVKEAAARIVDQLGPDDYFSLVVFNDRADVVIPAQRAIK 102 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 K+++ AAI ++A G T GL LA Q+ + F+ GI+R++L TDG D Sbjct: 103 KSDLKAAIAQIEAAGGTEMAQGLALALQEVQRPFLTRGISRLILLTDGRT---YGDESRC 159 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 + ++ + G+ L+ G+G +NE ++ + N YI T + KV E++++ Sbjct: 160 VEIARRGQSRGIGLTALGIGT-EWNEDLLETMTASENSRAQYIATAQDVVKVFADEVKRL 218 Query: 394 LITVAKDVKAQIEFNPAW-VTEYRQIGY--EKRQLRVEHFNNDNVDAGDIGAGKHITLLF 450 A+ V +E P + Q+ + E + GD L Sbjct: 219 HAIFAQQVHLSLETRPGALIRSLDQVRPFIAPITIVEEEERRWAANLGDWPDTGVQGFLI 278 Query: 451 ELTLNGQKASIDK-----LRYAPDNKLAKSDKTKELAWLKIRWKYPQGKES--------- 496 E+ + LRY + +EL + +R Sbjct: 279 EVVVPALPVGDYPVLKLTLRYHLPAANLRDQVREELIRISMRPAAEVTHRVDATVKHWLE 338 Query: 497 QLVEFPLGPTINAPSEDMRFRAA-----VAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGE 551 +LV + L + + + R A +A G L +T Q+ + + Sbjct: 339 RLVAYRLQASAWKHAAEGRLAEASERLHMAGTRMLNVGDTALAHTLQQEATRILRSGAAS 398 Query: 552 DPQGYRAEF 560 + R F Sbjct: 399 EEGRKRIRF 407 >UniRef50_C5GK44 U-box domain-containing protein n=2 Tax=Ajellomyces dermatitidis RepID=C5GK44_AJEDR Length = 766 Score = 198 bits (503), Expect = 5e-49, Method: Composition-based stats. Identities = 59/354 (16%), Positives = 121/354 (34%), Gaps = 36/354 (10%) Query: 164 WDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 + A + P + +L P P +L V K ++V ID Sbjct: 24 RSRPSTGTNIAGERNPNEVAVQLHPLPDTNSM-ILSVHPPLHPEKELRHVPCDIVLCIDI 82 Query: 224 SGSMISD-----------------ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA- 265 S SM S L L + + + +++ L + D + +V ++ D+ + Sbjct: 83 SYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDNDRLGVVAFSTDAEVVY 142 Query: 266 -LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK-GFIKGGINRILLATDGD- 322 + +++ +K A+++L STN GL+L+ + + I + + + TDG Sbjct: 143 KISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEALEEVTPIPQNVQALYILTDGMY 202 Query: 323 -------FNVGIDDPKSIESMVKKQRESG--VTLSTFGVGNSNYNEAMMVRIADVGNGNY 373 + + +S V K + + TFG G ++ I++VG G Y Sbjct: 203 RIVRSRVPHANASKFRHAKSYVSKAGQKDRLPMIHTFGFGYY-IRSGLLQAISEVGGGTY 261 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNND 433 S+I V + + T A I + + + L E + Sbjct: 262 SFIPDAGMIGTVFVHAIANLYTTFATQAMISIRTSGSVEIAQDEGSKTGLGLYEESTEDG 321 Query: 434 --NVDAGDIGAGKHITLLFELT--LNGQKASIDKLRYAPDNKLAKSDKTKELAW 483 V G + G+ ++ + + + L Y ++++ Sbjct: 322 ALAVTVGSLQYGQSRDVIIRMKNATSKPSTAQATLTYNFQGNAKSVASSEQILS 375 >UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocephala RepID=Q6PGW2_DANRE Length = 927 Score = 198 bits (502), Expect = 7e-49, Method: Composition-based stats. Identities = 61/298 (20%), Positives = 116/298 (38%), Gaps = 15/298 (5%) Query: 119 NPLATFSLDVD-----TGSYANVRRFLNQGLL---PPPDAVRVEEIVNYFPS-DWDIKDK 169 P+A F +DV S+ V+ LN G L + V ++P+ D K Sbjct: 169 QPVADFKIDVHIQENPGISFLEVKGDLNTGDLASAVKTTRADKDAWVTFYPTRDQQTKCT 228 Query: 170 QSIPASKPIPFAMRYELAPA-PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI 228 + Y++ P E + + N+VF+ID SGSM Sbjct: 229 NCAENGLNGDLIITYDVNRGNPKGEVQISNGYFVHYFAPSDVPHIPKNVVFIIDRSGSMH 288 Query: 229 SDERLPLIQSSLKLLVKELREQDNIAIVTYA---GDSRIALPSISGSHKAEINAAIDSLD 285 ++ +S+L ++K+L E D+ ++T+ R L + +++ + + + Sbjct: 289 GR-KIRQTRSALLTILKDLDEDDHFGLITFDAEIDFWRRELLQATKANRENAESFVKRIQ 347 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGV 345 G+TN + + KG + ++L TDGD G + + I + VK+ S Sbjct: 348 DRGATNINDAVLAGVDMINRNPRKGTASILILLTDGDPTAGETNIEKIMANVKEAIGSKF 407 Query: 346 TLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 L G G + N + +++ N I S+A L ++ + + D++ Sbjct: 408 PLYCLGFG-YDVNFDFLTKMSLENNAVARRIYEDSDADIQLQGFYDEVAVPLLTDIQL 464 >UniRef50_B6HQM8 Pc22g11730 protein n=17 Tax=Leotiomyceta RepID=B6HQM8_PENCW Length = 1029 Score = 197 bits (500), Expect = 1e-48, Method: Composition-based stats. Identities = 61/329 (18%), Positives = 130/329 (39%), Gaps = 29/329 (8%) Query: 214 ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR-IALPSISGS 272 +LV +I S SM ++ L++ +LK LV+ L +D + +VT+ + L ++ Sbjct: 513 PLDLVVVIPVSSSMQG-LKITLLRDALKFLVQNLGPRDRMGLVTFGSSGGGVPLVGMTTK 571 Query: 273 HKAEINAAIDSLDAEGS----TNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGID 328 A + ++S+ G + G +A + ++ ILL +D I Sbjct: 572 SWAGWSKILESIRPVGQKSLRADVVEGANVAMDLLMQRKFNNPVSTILLISD----SSIS 627 Query: 329 DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 388 DP+S++ +V + + VT+ +FG+G + + M+ ++ G+YSY+ ++ + Sbjct: 628 DPESVDFVVSRAEAAKVTIHSFGLGLT-HKPDTMIELSTRTKGSYSYVKDWMMLRECVAG 686 Query: 389 EMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITL 448 + + T ++VK ++ ++ +I + + GD+ G + Sbjct: 687 CLGALQTTSHQNVKLKLRLPEGSPAKFVKISGALHTTKRATGKDAEAALGDLRFGDKRDV 746 Query: 449 LFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTIN 508 L +L + A+ D + P L + G E +++ Sbjct: 747 LVQLVIQPDNATQDNMPQDPWESLVSGLEALGGCS--------DGDEGRVLSV------- 791 Query: 509 APSEDMRFRAAVAAYGQKLRGSEYLNNTS 537 E++ A YG LR ++ Sbjct: 792 ---EEVPLIQADLTYGDLLRDGHLTHSPR 817 >UniRef50_D2VKS7 von Willebrand factor type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VKS7_NAEGR Length = 923 Score = 195 bits (496), Expect = 4e-48, Method: Composition-based stats. Identities = 56/281 (19%), Positives = 112/281 (39%), Gaps = 14/281 (4%) Query: 139 FLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ-RTL 197 ++ ++P + + S I F E P+ + Sbjct: 611 PIDHAVIPSTSNTSLTNSSQQDNQQEQPTIMNTRSTSNKIVFNGHCEYEAIPFETECDLY 670 Query: 198 LKVDILAK---DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 + + +E +LV ++D SGSM ++L +++S+L +V +L+E+D +A Sbjct: 671 CMATLQGPCFEQQAQKERKGVDLVLVVDKSGSMAG-QKLDMVKSTLSFMVDQLKEKDRVA 729 Query: 255 IVTYAGDSRI--ALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF-IKGG 311 IV + + L + K + ++ TN L + + K Sbjct: 730 IVEFDTQVKTNLDLTKMDIEGKKKAKQVSSAISPGSCTNLSGALFTSLKLLASRQQEKNE 789 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRES-----GVTLSTFGVGNSNYNEAMMVRIA 366 + ++L TDG N G+ I ++ + VT+ TFG G + + M+ IA Sbjct: 790 VTSVILFTDGLANRGLISTNEILQNMQDLMDELLSTSNVTIHTFGFGQ-DTDANMLTSIA 848 Query: 367 DVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 GNG Y Y++T + K + + ++ V +++K +I+ Sbjct: 849 QKGNGLYDYLETADDIPKAFGNVIGNLVSVVGQNIKIRIQP 889 >UniRef50_B8BII0 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8BII0_ORYSI Length = 585 Score = 195 bits (495), Expect = 5e-48, Method: Composition-based stats. Identities = 77/378 (20%), Positives = 131/378 (34%), Gaps = 51/378 (13%) Query: 188 PAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM------ISDERLPLIQSSLK 241 P NE+R V + E +LV ++D SGSM RL L++ ++K Sbjct: 19 AIPSNEERKEWPVLVHVVAPAKTERFPIDLVAVLDVSGSMTKATSMHGWTRLDLVKGAMK 78 Query: 242 LLVKELREQDNIAIVTYAGDS----RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 ++ +L D +AIV + G L ++ +A+ NA ++ L A G T L+ Sbjct: 79 MVTNKLGAGDRLAIVPFNGKVVAAGATRLMEMTTKGRADANAKVNQLKAGGDTKFLPALK 138 Query: 298 LAYQQATKGFIKGGINR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGN 354 A R I L +DG N +DD K TFG+ Sbjct: 139 HASGLLDSRPAGDKQYRPGFIFLLSDGQDNGVLDD---------KLGGVRYPAHTFGMCQ 189 Query: 355 SNYNEAMMVRIADVGNGNYSYIDT-LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVT 413 S N MV IA G+Y ID LS + L + + VA + + Q+ Sbjct: 190 SRCNPKSMVHIATATKGSYHPIDDKLSNVAQALAVFLSGITSAVAVNARVQLHVADNSGV 249 Query: 414 EYRQIGYEKRQLRVEHFNN-----DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAP 468 +I +E N ++ G + A + + L + Sbjct: 250 LINKIDSGAYDKTIESGNGKASSKGTINVGVLSAEEDKKFIVYLDVP------------- 296 Query: 469 DNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLR 528 + A++ + L + + P G + VE ++ A G + Sbjct: 297 KLENAQAKPPQLLLTVAGEYSTPAGG--RKVENMEESSVQVERP--------APAGGATK 346 Query: 529 GSEYLNNTSWQQIKQWAQ 546 ++L S + + + Sbjct: 347 TGDHLVTWSEAVMVEMVR 364 >UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NW6_PSEA6 Length = 701 Score = 195 bits (495), Expect = 5e-48, Method: Composition-based stats. Identities = 80/504 (15%), Positives = 173/504 (34%), Gaps = 59/504 (11%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEA 83 E ++Q ++ S + + + + QQ V +L+ + Sbjct: 98 EQAKTQGRKASLIEQHRPNLFTNTIANIGPNESVSITIEYQQVVGFDEQTFSLRFPMTIT 157 Query: 84 PTFARAAK-AKATHIANPGTARYQQFDD-NPVKQVAQNPLATFSLDVDTGS-YANVRRFL 140 P ++ K+T Q + + A P L V+ S +A + Sbjct: 158 PRYSPNNATDKSTVTTVNTQGWGQSVTAISQQIKTADEPANPIRLSVELDSGFALTADDI 217 Query: 141 NQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ------ 194 P + S + I+ Q A++ + L+ AP Sbjct: 218 TSEHHPINIS-----QQGEKNSGYHIELAQEHIANQDFALTWQPALSDAPSAAHFSETQG 272 Query: 195 -RTLLKVDILAKDRKS---------EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLV 244 V + + + +++P+ +VFL+DTSGSM + + + ++ + Sbjct: 273 KYRYGLVMLTPPVQDAYHSTGGAVAQQMPSREVVFLLDTSGSMAGES-IVQAKRAVDFAL 331 Query: 245 KELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQ 301 +LR +DN+ I+ + + + H + SL A+G T L LA Sbjct: 332 TQLRPEDNVNIIQFNDAPQALWKRAMPATAKHIQRARNWVASLHADGGTEMAPALTLALN 391 Query: 302 QATKGFIKG------GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNS 355 + + + +++ TDG + + ++ S+++ + L T G+G++ Sbjct: 392 KPSLHRDDSDLLGSHKLRQVVFITDG----SVSNEDALMSLIESKLADN-RLFTIGIGSA 446 Query: 356 NYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI----EFNPAW 411 N M + A G G ++YI + + Q + + ++ V +D+ + EF P+ Sbjct: 447 P-NSYFMTQAAQAGRGTFTYIGDIQQVQHKMTALFNKLTRPVMQDIHIEFARETEFYPSV 505 Query: 412 VTEYR-----QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRY 466 + + I Y +E+ + ++T ++ + Sbjct: 506 IPDLYAAQPLVIHYRVPVTHLENGMQL----------QQPDNALKVTGWQSARAVSSKPW 555 Query: 467 APDNKLAKSDKTKELAWLKIRWKY 490 + L+ + K L R K Sbjct: 556 SIRMPLSPTTKRAGLGVAWAREKI 579 >UniRef50_Q2GTB7 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2GTB7_CHAGB Length = 777 Score = 194 bits (493), Expect = 7e-48, Method: Composition-based stats. Identities = 71/379 (18%), Positives = 129/379 (34%), Gaps = 38/379 (10%) Query: 175 SKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKS---EELPASNLVFLIDTSGSMISDE 231 K +++ +L P +R L V I +LV ID SGSM + Sbjct: 30 PKKPSNSLQLQLHPFSSEHERGGLIVKIQPPREPENADLHHVPCDLVLSIDISGSMADEA 89 Query: 232 R-----------------LPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 + L++ + + +V L +D + IVT+ S++ +P +K Sbjct: 90 PAPSKPGGEAGEDTGLRVIDLVKHAARTIVATLDSRDRLGIVTFTNRSKVGIPP--YENK 147 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGF--IKGGINRILLATDGDFNVGIDDPKS 332 A+ I+S++ STN G+ ++ G + +L+ TDG N + PK Sbjct: 148 AKTLENIESMEPFSSTNMWHGIRDGLSLFSEAEGGSTGRVPALLVLTDGMPNY-MCPPKG 206 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 M++ T+ TFG G ++ IA+VG GNYS+I V + Sbjct: 207 YVPMLRSMEPLPATIHTFGFG-YELRSGLLKSIAEVGGGNYSFIPDAGMLGTVFIHAVAH 265 Query: 393 MLITVAKDVKAQIEFNP-------AWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH 445 + T A + K ++ + RQ E E + + +I G+ Sbjct: 266 LQSTFANNAKLRLTYPSYLKLEEMTGEAVGRQEPVELEGDVPESMTSLTIPIDNIQYGQS 325 Query: 446 ITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGP 505 + + + N E ++S P Sbjct: 326 RDICLRY----GNLAAAIQKEGGANPPPAITAVLEYQHFSPTVHQAVAQQSPFASTP-SL 380 Query: 506 TINAPSEDMRFRAAVAAYG 524 + + + A +A + Sbjct: 381 HPDEIAYHLSRAALIAFFA 399 >UniRef50_Q22N58 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22N58_TETTH Length = 669 Score = 194 bits (492), Expect = 1e-47, Method: Composition-based stats. Identities = 66/314 (21%), Positives = 125/314 (39%), Gaps = 23/314 (7%) Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD---------------ERLPLIQS 238 +T V + E N+V L+D S SM S L L++ Sbjct: 12 SQTKNYVRVSVIPPDDLERHPCNIVCLVDGSLSMGSKLVIHQKNGGKKESDMTTLDLVKH 71 Query: 239 SLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGL 296 ++K + L QD +A+V ++ S+I + K ID + A G TN GL Sbjct: 72 TVKTIASSLNPQDRLALVGFSTHSKIYFELTEMDDQGKNVAFTEIDKMWAGGQTNIWGGL 131 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNV--GIDDPKSIESMVKKQRESGVTLSTFGVGN 354 + + + KGF I L TDG + I + + ++ ++ TFG GN Sbjct: 132 QDSLEVIKKGFRPNQNVCIFLFTDGRPTMIPAIGHVEMLRRWKEQHPAIQFSIFTFGFGN 191 Query: 355 SNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE 414 + + +M+ ++ NG +S+I S V ++ + +L T+A +V ++ + + + Sbjct: 192 -DLDTDLMLELSQEQNGIFSFISDSSMLGTVFSNALANILSTMANNVHLNLQLSEGYTFD 250 Query: 415 YRQIGYEKRQLRVEHFNNDNVD--AGDIGAGKHITLLFELTLNGQKASIDK-LRYAPDNK 471 + + + N +D G I G+ L+ ++ + D + Y K Sbjct: 251 GEGVMQSQSFQAKKTNKNTTLDLNLGLIRYGQTKDLILKILPQQNRKLSDVKITYTLKYK 310 Query: 472 LAKSDKTKELAWLK 485 L D +++ + Sbjct: 311 LFVGDNPQDIETVS 324 >UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TQ23_SHEHH Length = 850 Score = 194 bits (492), Expect = 1e-47, Method: Composition-based stats. Identities = 99/510 (19%), Positives = 169/510 (33%), Gaps = 75/510 (14%) Query: 54 QSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPV 113 +A KA + + + + Q R+ TF A +I +P +N Sbjct: 286 AHSAVVKAEDEALESEALESRERQNRVSMTVTFD--AAMPIENIVSPYHGISINMVENAA 343 Query: 114 KQVAQNPLATFSLD-VDT-----GSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWD-- 165 QV+ + A + D V T GS F QG A +V F Sbjct: 344 AQVSLDNYAVANRDFVLTWQPVQGSEPTAAVFSQQGKTHAELASQVTAGDTSFNQGGAKS 403 Query: 166 --IKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 Q P S+ + + E+ L+ + A LV +IDT Sbjct: 404 KLSPQSQPEPQSQLQVQDSKQQTLSKKALEKYALVMLMPPQGSDDESSSIARELVLVIDT 463 Query: 224 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA---LPSISGSHKAEINAA 280 SGSM D + +S+LK + LR QD+ ++ + + + + Sbjct: 464 SGSMSGDAII-QAKSALKYALAGLRPQDSFNVLQFNSTVERWSRHVMPATAINLGRAQNY 522 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFIKG-----------------------GINRILL 317 I+ L A+G T L+ A + + ++L Sbjct: 523 INGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHDDDRYQSSNETLEQSAATPLRQVLF 582 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 TDG + + + +K Q L T G+G++ N M R A+VG G Y+YI Sbjct: 583 ITDGA----VANESRLFEQIKNQLGES-RLFTIGIGSAP-NAHFMQRAAEVGRGTYTYIG 636 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA 437 L E + + S + ++ DV+ F+ V +Y V Sbjct: 637 KLDEVNQKVVSLLEKIEKPQVTDVELH--FSDGSVPDY-----------------WPVRI 677 Query: 438 GDIGAGKHITLLFE--------LTLNGQKASIDKLRYAPDNKLAKSDKTKE---LAWLKI 486 D+ A + + + L + GQ A R P N A+ + ++ L + Sbjct: 678 PDLYAHEPVLVALRIPSYVSDDLLIQGQLAGQFWQRRLPLNSAAQVNDLEQAKGLDLIWA 737 Query: 487 RWKYPQGKESQLVEFPLGPTINAPSEDMRF 516 R + + S+ + M+F Sbjct: 738 RKQIAALELSKQTANKERIEKQITAIAMKF 767 >UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=40 Tax=Euteleostomi RepID=ITIH5_HUMAN Length = 942 Score = 194 bits (492), Expect = 1e-47, Method: Composition-based stats. Identities = 72/415 (17%), Positives = 147/415 (35%), Gaps = 42/415 (10%) Query: 146 PPPDAV--RVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW-NEQRTLLKVDI 202 PPP V + E N ++ + F +RY++ + + L + Sbjct: 222 PPPSTVINQNETFANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLNGYFV 281 Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 K N+VF++D+S SM+ +L + +L ++ +LR QD +I+ ++ Sbjct: 282 HYFAPKDLPPLPKNVVFVLDSSASMVG-TKLRQTKDALFTILHDLRPQDRFSIIGFSNRI 340 Query: 263 RIA---LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG-----INR 314 ++ L S++ + I + G T+ L+ A + K G ++ Sbjct: 341 KVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGIGDRSVSL 400 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 I+ TDG VG I + ++ V + T G+GN + + ++ +++ G Sbjct: 401 IVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGN-DVDFRLLEKLSLENCGLTR 459 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE------ 428 + +A L ++ + D++ + P+ V + + + E Sbjct: 460 RVHEEEDAGSQLIGFYDEIRTPLLSDIRID--YPPSSVVQATKTLFPNYFNGSEIIIAGK 517 Query: 429 --HFNNDNVDAGDIGAGKHITLLF---ELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW 483 D++ ++ A + ++ + QKA D +P T + Sbjct: 518 LVDRKLDHLHV-EVTASNSKKFIILKTDVPVRPQKAGKDVTG-SPRPGGDGEGDTNHIER 575 Query: 484 LKIRWKYPQGKESQLVEFPLGPTINAPSEDMR-FRAAVAA--------YGQKLRG 529 L +L+ L E +R A+A KLRG Sbjct: 576 LW-----SYLTTKELLSSWLQSDDEPEKERLRQRAQALAVSYRFLTPFTSMKLRG 625 >UniRef50_A0C5K4 Chromosome undetermined scaffold_150, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0C5K4_PARTE Length = 611 Score = 193 bits (491), Expect = 1e-47, Method: Composition-based stats. Identities = 63/339 (18%), Positives = 141/339 (41%), Gaps = 22/339 (6%) Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSE---ELPASNLVFLIDTSGSMISDERLPLIQ 237 ++R + + + + I K+ ++E +L+ LID S SM D + +++ Sbjct: 153 SLRTSCKVSNYKSEYIPAMISIKTKENQTEMTERTIGIDLICLIDKSMSMSGDN-INMVK 211 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAG 295 SL LL+ L EQD + I+T+ ++ P ++ +K A I + AEG T + Sbjct: 212 KSLLLLLDFLGEQDRLQIITFNEHAQRLTPLKCLTEKNKQYFQAVISQISAEGLTKISSA 271 Query: 296 LELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNS 355 +A++Q + + + + L +DG + + VK+ T+STFG G+ Sbjct: 272 TYIAFKQLKEKVYRNNVTSVFLLSDGHDGDALFEISDQIRHVKEV----FTISTFGFGD- 326 Query: 356 NYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEY 415 +++ MM I+++ NGN+ Y+ ++ + + ++ +A+ ++ + + Sbjct: 327 DHDAQMMTSISNLKNGNFYYVKDITLLDEFFAHALGGIVSVIAEQIQISLSLTLTKPLQD 386 Query: 416 RQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKS 475 QI + + + ++ + +G +FE+ + + I Sbjct: 387 VQISKTYGNMWKKREHAYEINIPQLASGTRKDFVFEIQIPYIYSKIQDQERVVK------ 440 Query: 476 DKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDM 514 + +++ K P E L T+ +E++ Sbjct: 441 -----VLEARLKLKDPLSGEIIQKSAALNLTLFNENENV 474 >UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CPU4_SHEPW Length = 710 Score = 193 bits (491), Expect = 1e-47, Method: Composition-based stats. Identities = 81/498 (16%), Positives = 167/498 (33%), Gaps = 55/498 (11%) Query: 25 NKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAP 84 + Q ++ S +++ A + Q+ V + + +L+ + AP Sbjct: 146 EAKKQGKKASLLQQKRPNIFSAEVANLAPGETLIVELNYQELVHYDNGEFSLRFPMVVAP 205 Query: 85 TFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDV--DTGSY-ANVRRFLN 141 + + A Y ++ + ++V D G + + Sbjct: 206 RYKPRGEKSALSNMAAEVNSYVASTLGDLQGSESERVNLVDIEVTLDAGMPIGEINSPYH 265 Query: 142 QGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMR----------YELAPAPW 191 Q + + + + ++ D A+ E Sbjct: 266 QIEVSSNGDSQAQIQLTAAKANSDFVLNWRPIVGSAPKAAIFSQQGKTHVSDLESKATAA 325 Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 Q +L+ + + L L+ +IDTSGSM E + ++S+ + L QD Sbjct: 326 QPQYSLVMLLPPQDKMRLSALAPRELILVIDTSGSMSG-EAIEQAKASIIYALAGLSAQD 384 Query: 252 NIAIVTYAGDSRIALPS---ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 + I+ + + + S + A + L A G T L+ A Q + Sbjct: 385 SFNILQFNSNVYALSDTPLNASAKNIGRAQAYVQRLQANGGTEMSLALDKALSQ--QDAN 442 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 + + ++L TDG + + + + ++ Q + L T G+G++ N M R A++ Sbjct: 443 RERLRQVLFITDGA----VGNEPQLFTQIRNQLQQS-RLFTIGIGDAP-NAHFMQRAAEL 496 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 G G Y+YI SE + + + + ++ DV+ F V +Y Sbjct: 497 GRGTYTYIGKQSEVKSKMVAMLDKLEKPTVTDVEVH--FADGSVPDY------------- 541 Query: 429 HFNNDNVDAGDIGAGKHITLLF--------ELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 D+ A + I + EL ++GQ A + + S + K Sbjct: 542 ----WPASIPDLYAHEPIMVAMKLPSFSDKELVVSGQLAGQY---WQQSLVIENSAEAKG 594 Query: 481 LAWLKIRWKYPQGKESQL 498 L + R + + S+ Sbjct: 595 LDLIWARKQIAALELSKE 612 >UniRef50_D1KBY4 Putative uncharacterized protein n=2 Tax=Proteobacteria RepID=D1KBY4_9GAMM Length = 682 Score = 193 bits (491), Expect = 1e-47, Method: Composition-based stats. Identities = 63/488 (12%), Positives = 158/488 (32%), Gaps = 49/488 (10%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKAL-----AQQEVQQYSDKQAL 76 + K Q + + V + + + + QQ V+ +D+ ++ Sbjct: 132 REAKKIYNQAKKAGKKVSLVEQQRPNIFTTKVANIGPGETITIAIEYQQAVRIDNDQFSI 191 Query: 77 QGRLQEAPTFARAAKAKATHIANPGTARYQQFDDN----PVKQVAQNPLATFSLDVDTG- 131 + + + K A A + D P + S+++ G Sbjct: 192 RFPMVVGERYIPGKKINTQPNALGNKANTHRVKDASKIAPPTDTNADRPVAISINLKAGF 251 Query: 132 SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW 191 ++ +Q + D + + ++ D + + A+ + Sbjct: 252 DTDSIISPYHQISIVETDKLTKHISLKNTQANRDFELTWQAHKTLTPTLALFTQ---QKG 308 Query: 192 NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 ++ +L A + + ++F+ID+ + + ++L + L+ D Sbjct: 309 DDHYLMLMATPPADEVFKQSHTPREVIFIIDS-SGSMMGSSMEQATNALIQAINRLKPTD 367 Query: 252 NIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 I+ + D + +K L A G T ++ A +K Sbjct: 368 RFNIIDFDSDFEVLFDTAIPAIDMNKRHGIRFAKHLVASGGTEPLEAIKFAL--LSKDED 425 Query: 309 KGG-INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 + +++ TDG + + K + V++ + T G+G++ N+ +M ++A+ Sbjct: 426 SDKYLRQVIFLTDGQ----VGNEKELFRAVQQNIDDD-RFFTIGIGSAP-NDYLMTKMAE 479 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRV 427 G G ++YI + E + + ++ D+ Sbjct: 480 YGKGAFTYIGDIDEVEVKMGELFSKLESPAMTDININFPI-------------------D 520 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLR----YAPDNKLAKSDKTKELAW 483 + + D+ G+ IT +++L K +I ++ + ++ T+ + Sbjct: 521 INADQALGSIADLYKGEAITAVYKLNAIPNKLTISGNTANGIFSKSISINANNSTEGINV 580 Query: 484 LKIRWKYP 491 L R K Sbjct: 581 LWARRKID 588 >UniRef50_D2VHB8 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VHB8_NAEGR Length = 755 Score = 193 bits (490), Expect = 2e-47, Method: Composition-based stats. Identities = 67/348 (19%), Positives = 138/348 (39%), Gaps = 26/348 (7%) Query: 131 GSYANVRRFLNQGLLPPPDAVRVE-EIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA 189 + R +++ LL A+R E + + + + + ++ + Sbjct: 48 ATSPLTREAMSKELLVSNIALRNTIEQLVHGNTPIVVNRVTPLVEQDIEDHSLNLTIVSK 107 Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI-------------SDERLPLI 236 +E + ++ + + NLV ++D SGSM RL L+ Sbjct: 108 QVSESKR--RIHVKVSPPTGGQRQPCNLVCILDVSGSMGSSAEDLSSSNENTGFSRLDLV 165 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGS--HKAEINAAIDSLDAEGSTNGGA 294 + S++ L++ + E+D I+++ ++ +R+ LP K + ++ L EGSTN Sbjct: 166 KHSVRTLIELMNEKDQISLIPFSDSARMELPLTKMDAVGKKKAIEKLEHLGPEGSTNVWD 225 Query: 295 GLELAYQQATKGFIKGGIN-RILLATDGDFNVGIDDPKSIESMVKKQRESGV---TLSTF 350 GL L + + + N ++L TDG+ N I+ P+ I ++K + T+ +F Sbjct: 226 GLRLGMESSLNNPLCAKTNTCLILFTDGEPN--INPPRGIVPTLEKYIKEHPLNSTIHSF 283 Query: 351 GVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 G G S + A++ IA G+G YSYI S + M +L T + + I Sbjct: 284 GFGYS-LDSALLKDIAMNGSGAYSYIPDCSMVGTTFVNMMSNILCTAVRRAELVISSMNG 342 Query: 411 WVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQK 458 + G + + + G + + + ++ ++ Sbjct: 343 AKISH-VYGSSQNGNNSTNEKQFTISMGGVQFQQSRDYIIDVDMHANN 389 >UniRef50_Q24C76 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila RepID=Q24C76_TETTH Length = 670 Score = 193 bits (490), Expect = 2e-47, Method: Composition-based stats. Identities = 60/326 (18%), Positives = 122/326 (37%), Gaps = 24/326 (7%) Query: 195 RTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD-----------------ERLPLIQ 237 + + L + SN+ ++D SGSM S+ L +++ Sbjct: 13 KDDFLMISLVPPQNYSSRTNSNICCVVDVSGSMSSEAKIINQSSQKSDENYSLSILDVVK 72 Query: 238 SSLKLLVKELREQDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAG 295 S+K++V L +D ++IVT++ + + ++ S+K I++L EG T G Sbjct: 73 HSIKMIVNTLGSEDYLSIVTFSDSANVLFDLLPMNDSNKTMAIEKIENLSTEGGTELWKG 132 Query: 296 LELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNS 355 L A I L TDG D + + + T++TFG +S Sbjct: 133 LNSALNILLNNKTPNTNQSIFLLTDGQPTDSGIDTN-LVKFKQAYPKLNCTINTFGF-SS 190 Query: 356 NYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAW-VTE 414 + N +M +IA NG +S+I S + + L + I +T Sbjct: 191 SSNSELMNKIAMEYNGMFSFIPDASFIATAFANALANTLTVYTNNCLLHITTLEGSNLTL 250 Query: 415 YRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELT--LNGQKASIDKLRYAPDNKL 472 + + +++ + +D G + + +F++ + +++ Sbjct: 251 HPAHSFLVKKMNSQGQQEILIDVGVVNTQQTRDFIFKINNLPKQLDRTYAQVKLTYQQSF 310 Query: 473 AKSDKTKELAWLKIRWKYPQGKESQL 498 + +D K + I+ Q +ES+ Sbjct: 311 SLNDYVKNIPSQTIQKSLIQREESEE 336 >UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YPR2_BRAFL Length = 863 Score = 192 bits (488), Expect = 3e-47, Method: Composition-based stats. Identities = 74/398 (18%), Positives = 143/398 (35%), Gaps = 37/398 (9%) Query: 140 LNQGLLPPPDAVRVEEIVNYFPSDW-DIKDKQSIPASKPIPFAMRYELAP-APWNEQRTL 197 G L + R + D++ + P+ F +RY++ + + + Sbjct: 158 YGSGELEGVEIARPSPNRAHIQYRPTDMEQMRMSPSGISGDFLVRYDVKRDLSVGDIQIV 217 Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVT 257 + + N+VF+ID SGSM ++ + ++ ++K+LR+ D ++ Sbjct: 218 NGYFVHYFAPSGLPVVPKNIVFIIDKSGSM-GGTKMRQTKQAMNTILKDLRDHDRFNVMP 276 Query: 258 YAGDSRIALP----SISGSHKAEINAAID-SLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 ++ S + P + + + S++A G TN + A + Sbjct: 277 FSYSSTMWRPNEMVLATRENIESARTYVRRSINAGGGTNINQAIIDAADLLRRVTDDQPN 336 Query: 313 N-----RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 + I+ TDG +VG P++I VK V+L G G + + + ++A Sbjct: 337 SPRSASLIIFLTDGLPSVGESKPRNIMVNVKNAIREQVSLFCLGFGK-DVDFPFLEKMAL 395 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKR---- 423 G I S+A L ++ + DV Q+ + VT+ + + Sbjct: 396 ENRGLARRIYEDSDAALQLKGFYDEVATPLLFDV--QMRYPENLVTDLTPVDFNTYFDGS 453 Query: 424 ------QLRVEHFNNDNVDAGDIGAGKHIT--LLFELTLNGQKASIDKLRYAPDNKLAKS 475 +L E +V G + L E +N L+Y N+ A Sbjct: 454 ELVVAGRLSNEVGRTLDVSV----VGNTVDSQLTLEKEVNVTAPHKALLKYGILNRQAVG 509 Query: 476 DKTKEL-AWLKIRWKYPQGKESQLVEFPLGPTINAPSE 512 D T+ L A+L I+ + +L+ P T Sbjct: 510 DFTQRLWAYLTIKHYL----KQRLIATPEEKTNLTAKA 543 >UniRef50_D0LWF9 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LWF9_HALO1 Length = 419 Score = 192 bits (487), Expect = 3e-47, Method: Composition-based stats. Identities = 74/380 (19%), Positives = 149/380 (39%), Gaps = 12/380 (3%) Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 L V I A+ +S NL +ID S SM RL + + +V++L E+D ++++ Sbjct: 17 FLLVRIEAQATESSARMPVNLALVIDRSSSMRGP-RLASAIVAARQVVEQLDERDRLSVI 75 Query: 257 TYAGDSRIALPSISGSH--KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 + +R +S + + + A+ L TN AG++ + GF++G ++R Sbjct: 76 AFDATARTIFGPMSVTDEARQTLEQALAGLRTGVGTNLAAGMKKGAEAVRSGFVRGALSR 135 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++L TDG ++GI D + ++ +K+ + GVT++T G+G +++ ++ +A G G + Sbjct: 136 LVLLTDGQPSLGITDNDRLCALAQKEADRGVTITTMGLGQG-FDDELLADLAHSGRGGFH 194 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 Y+ + ++ E+ + A + + R L Sbjct: 195 YLASAADIPGAFGRELSGVFAIAATQTEIGLRPAQQIDAAEVLHRLPSRPLDDG----LA 250 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGK 494 V+ G++ AG +LF L+ + ++ P Sbjct: 251 VELGELAAGTPRQVLFRLSRRSGDIEARCGTLTVTYRSSEGTPGDAHLLGIEVPAQPDPA 310 Query: 495 ESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQ 554 +++ A + D+ A A G LR L+ + ++++G DP Sbjct: 311 HRRIIALERMRLAVASAVDV--AWARRASGDSLRALGALSEIKLE--VSQLKESEGADPD 366 Query: 555 GYRAEFIRLIELADGVTDIS 574 + E V S Sbjct: 367 ALDVLLRDIGEAESAVVKSS 386 >UniRef50_Q2QZH3 Os11g0687100 protein n=79 Tax=Eukaryota RepID=Q2QZH3_ORYSJ Length = 633 Score = 192 bits (487), Expect = 4e-47, Method: Composition-based stats. Identities = 64/364 (17%), Positives = 120/364 (32%), Gaps = 39/364 (10%) Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI------------S 229 + + N+ +L ++V ++D SGSM Sbjct: 37 IFPTIPRGQTNKDFQVLLRVEAPPAADLNSHVPLDVVAVLDVSGSMNDPVAAASPKSNLQ 96 Query: 230 DERLPLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLD 285 RL ++++S+K ++++L + D ++IV + + L +SG ++ ID L Sbjct: 97 GSRLDVLKASMKFVIRKLADGDRLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQ 156 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGG--INRILLATDGDFNVG-IDDPKSIESMVKKQRE 342 A G T LE A + + + ILL TDGD G +I V K Sbjct: 157 ARGGTALMPALEEAVKILDERQGSSRNHVGFILLLTDGDDTTGFRWTRDAIHGAVFKY-- 214 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL--SEAQKVLNSEMRQMLITVAKD 400 + TFG+G S ++ ++ IA G YS++D + L + + A D Sbjct: 215 ---PVHTFGLGAS-HDPEALLHIAQGSRGTYSFVDDDNLANIAGALAVCLGGLKTVAAVD 270 Query: 401 VKAQI---EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQ 457 + + E + R + V G + G+ + L + Sbjct: 271 TRVSLKAAELSGGGARIVRVDSGGYESSVACGGASGEVVVGVLYTGEVKNFVVHLHVPAA 330 Query: 458 KASIDKLRYAPDNKLAKSD---------KTKELAWLKIRWKYPQGKESQLVEFPLGPTIN 508 ++ + ++L + + + G S V Sbjct: 331 SSTTLTFSSVECGGYYDAATVCDHCHHRHQQQLLAVGYSYSHAPGAASAAVSVEGHGVFV 390 Query: 509 APSE 512 E Sbjct: 391 ERPE 394 >UniRef50_A6GC99 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GC99_9DELT Length = 546 Score = 191 bits (486), Expect = 4e-47, Method: Composition-based stats. Identities = 100/495 (20%), Positives = 171/495 (34%), Gaps = 43/495 (8%) Query: 91 KAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDA 150 A P T+ P+ T D G+ + + P A Sbjct: 4 AVAAPSPRAPLTSLAALVGLCACTPTNAAPIDTQGPDSGQGATLTLAEPEPEADAQPGAA 63 Query: 151 VRVEEIVNYFPSDWDIKDKQSIPASKPIPFAM-----RYELAPAPWNEQRTLLKVDILAK 205 EE P D + P+ +A + + A Sbjct: 64 ---EEPETKAPQRPDSLGEFVRAHDGPVHLEFISQYGYVHVAEDAGQPFEMPAIIRLSAD 120 Query: 206 DRKSE-ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI 264 D + P +L ++D SGSM ++L + + LV L EQD + +++Y Sbjct: 121 DEAGQGPRPGLDLAIVLDRSGSM-GGDKLRFAKQAGLDLVNRLDEQDRVTLISYDDTVTP 179 Query: 265 A--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI----------KGGI 312 L + + + + G+T G L + Q+ + Sbjct: 180 LSNLQRVDDDGIEVLRRQLLDIQVGGTTALGPALFMGLQRLAAPEPFGPQTRTEARHDRL 239 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 ++L +DG NVG P+ I V + GV++ST G+G +YNE +M RIAD G G Sbjct: 240 RHVILLSDGIANVGETRPEVIGGRVAEHFGGGVSVSTLGMGL-DYNEDLMTRIADEGGGR 298 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN 432 Y +I+ +L E+ + TVA +V + P GY + ++ Sbjct: 299 YHFIEDAESIPAMLGDELAGLTATVASEVDSVFATLPGTDVT-EVYGYTQTVA----GSD 353 Query: 433 DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQ 492 + G +GAG+ ++ L L+ ++ + + EL +++R++ Sbjct: 354 TTIRVGFLGAGQSREIVVNLRLDPEQ-----------VRGWAPGERVELGEVEVRYRLVT 402 Query: 493 GKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGED 552 G E P + A AA + +E + + Q A Sbjct: 403 GAT----EGGPAPMRSLAVPATVLVAHDAAQARASERTEVTVRAAEVAAARHIQAASVAV 458 Query: 553 PQGYRAEFIRLIELA 567 QG AE RL+E A Sbjct: 459 DQGDFAEAQRLLESA 473 >UniRef50_A3DLZ3 von Willebrand factor, type A n=1 Tax=Staphylothermus marinus F1 RepID=A3DLZ3_STAMF Length = 416 Score = 191 bits (486), Expect = 5e-47, Method: Composition-based stats. Identities = 76/342 (22%), Positives = 142/342 (41%), Gaps = 13/342 (3%) Query: 193 EQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDN 252 E R + +L+ + P + +IDTS SM E++ + + L+ LR++D Sbjct: 17 EGREDIIPFVLSVEGVYSAHPPIAFLIVIDTSYSMDG-EKIFRAKQAALRLLDILRDKDY 75 Query: 253 IAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 + + +AG L + +++ E+ AI L TN L+ ++ K G I Sbjct: 76 VGVYGFAGKFYKVLEPVPATNRNEVEKAIIGLKLGSGTNIYDTLKKLVEETKKVLESGAI 135 Query: 313 N--RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 + RI+ TDG+ G P+ I M KK RE+G + GVG YNE ++ R+A V N Sbjct: 136 SLVRIIFITDGEPTTGQKKPEKILEMAKKLREAGASALIIGVGT-EYNEKLLSRMAMVLN 194 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHF 430 G + ++ + +K+++ + AK+V +P + + Y VE Sbjct: 195 GEFEHVSDPASLEKLISEYAKSTQEISAKNVAVLFRLSPGFRVDIYNRPYNNIPEGVE-- 252 Query: 431 NNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW-LKIRWK 489 V+ GDI + I ++ ++T + + + + +E A + IR K Sbjct: 253 ----VEIGDIYYREIIDIVGDITTPPLLIGEAHIGEIQISYVNPETEEQEFATPIPIRIK 308 Query: 490 YPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSE 531 +E+ V+ +R + +K + + Sbjct: 309 VKPLEEASTVKVDEKVLAEVR--MIRTATKLQETLEKRKAKD 348 >UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XQ17_9BACT Length = 806 Score = 191 bits (486), Expect = 5e-47, Method: Composition-based stats. Identities = 65/372 (17%), Positives = 144/372 (38%), Gaps = 28/372 (7%) Query: 117 AQNPLATFSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASK 176 + P+ + S+ V+ S +R L P + + N ++ + A Sbjct: 213 SSKPIKSVSVKVNVES----KRPLKTIYSPSHEVEVKRDGSNRATVGYEA-SEVKPDADL 267 Query: 177 PIPFAMRYE------LAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD 230 + FA + +A +E L + D K++++ + ++VF++DTSGSM Sbjct: 268 QLYFAPEKDEIGVNLMAYKTGDEDGYFLLLASPGVDAKAKQIVSKDVVFVLDTSGSMSG- 326 Query: 231 ERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA---LPSISGSHKAEINAAIDSLDAE 287 +++ + +L+ V+ L + D I+ ++ +S L ++S ++ + I +L A Sbjct: 327 KKMEQAKKALQFCVESLNDGDRFEIIRFSTESEPLFDKLAAVSKENREKAGDFIKNLKAM 386 Query: 288 GSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTL 347 G T L+ A +G ++ TDG VG D I ++++ + + Sbjct: 387 GGTAIDEALKKALSL---ESKEGRPFVVVFLTDGLPTVGTTDEDQILKGMQERNKEKRRI 443 Query: 348 STFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA---- 403 FG+G + N ++ RIA+ Y+ + + ++S ++ V + K Sbjct: 444 FCFGIGT-DVNTHLLDRIAEETRAFSQYVLPEEDLEVKVSSFFSKINEPVLANPKLKFTA 502 Query: 404 QIEFNPAWVTEYRQIGYEKRQL----RVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKA 459 I + + + ++ + ++ V GD+ G ++L Sbjct: 503 DIRTTKMYPSPLPDLFKGEQLVLVGRYSGKGSSAAVIDGDV-NGDKKKFTYDLNFPEHAD 561 Query: 460 SIDKLRYAPDNK 471 D + + Sbjct: 562 EHDFIPRLWATR 573 >UniRef50_A6GDG5 Putative lipoprotein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GDG5_9DELT Length = 486 Score = 191 bits (484), Expect = 8e-47, Method: Composition-based stats. Identities = 90/441 (20%), Positives = 181/441 (41%), Gaps = 39/441 (8%) Query: 129 DTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAP 188 S R G + +R E +NY+ ++ PA+ P ++ +L Sbjct: 80 SMASPVMTREVAPYGFYTTFEWIRPWEFLNYYSFEY--------PAADPGDLSVHVDLR- 130 Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELR 248 +E R L++ + ++ E N+ ++D S SM + ++++ + + LR Sbjct: 131 -SKDEGRFQLQIGVASEIVSPSERLPMNITLVLDESTSMTG-APMYAMKATARAIAGSLR 188 Query: 249 EQDNIAIVTYAGD--SRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 E D I++V+++ R+A +++GS+ A + ID+++ G T+ AGLE Y A Sbjct: 189 EGDVISLVSWSNSNNVRLASHAVAGSNDATLLDTIDAIEPGGGTDLHAGLEQGYALAQAN 248 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN-YNEAMMVRI 365 F INR++L +DG N+G D + I M + + G+ + GVG+ YN+ +M + Sbjct: 249 FSADRINRVVLVSDGGANLGFTDAELIAQMAELEDGEGIYMVGVGVGDVGRYNDELMDTV 308 Query: 366 ADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQL 425 D G G +I +EA+++ + A+DV+ ++ P G+E + Sbjct: 309 TDQGKGASVFIPNEAEAERMFGERFMSTMGVAARDVRVELSLPP---------GFEIVRF 359 Query: 426 RVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLK 485 E F++D + + ++F L + L + Sbjct: 360 SGEEFSDDPSEIEPQHLAPNDAMVFYQELETCAPELATE--------------DALLGVV 405 Query: 486 IRWKYPQGKESQL--VEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQ 543 +RW+ P K+++ VE+ + A S + A+ +Y + + + Sbjct: 406 VRWREPFSKQARERAVEYAFADLLGAESPMLDKGQAILSYAEVFATGLGALEQAEADLAA 465 Query: 544 WAQQAKGEDPQGYRAEFIRLI 564 Q G+ + ++ Sbjct: 466 AEQALPGDSDLAEIRSVLDML 486 >UniRef50_C5Z1W1 Putative uncharacterized protein Sb10g030210 n=1 Tax=Sorghum bicolor RepID=C5Z1W1_SORBI Length = 607 Score = 191 bits (484), Expect = 8e-47, Method: Composition-based stats. Identities = 78/405 (19%), Positives = 149/405 (36%), Gaps = 37/405 (9%) Query: 184 YELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLL 243 Y AP + +D+ + A +LV ++D SGSM RL ++S+++ + Sbjct: 75 YYPKEAPLGASTVRVLLDVS-SSSSTAGRAALDLVVVLDVSGSMRDFGRLDKLKSAMRFI 133 Query: 244 VKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQ 301 +K+L D +++VT+ G + P +S + +D L A G TN AGL++ Q Sbjct: 134 IKKLAPMDRLSVVTFNGGATRECPLRAMSEDAVPVLTDIVDGLVARGGTNIEAGLKMGLQ 193 Query: 302 QATKGFIKG-GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 G ++L +DG+ N G + + + T G SN + Sbjct: 194 VLDGRRYTGARTAGVILMSDGEQNSG--------DATRVRNPQNYPVYTLSFG-SNADMN 244 Query: 361 MMVRIADVGNGNYSYIDTLS--EAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE---- 414 ++ ++A G G Y+ + V + M +L V +D+ + A V Sbjct: 245 LLQKLAG-GGGTYNPVLDSGGMSMLDVFSQLMAGLLTVVVRDLYLILSKPAAVVVAATHP 303 Query: 415 --------YRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRY 466 + + RQ V GD+ +G+ ++ EL+L +S Sbjct: 304 DDHDLDKIVKVDPGDFRQETDAQSGTVTVKFGDLFSGEVRKVVVELSLRETSSSDYDAEI 363 Query: 467 APDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAV-AAYGQ 525 +++ + + + + G T +E +R + AV + Sbjct: 364 LDVEVSYPNEQGERKKLVGQTLHVKRTSTA---SDNGGTTPELEAELLRRQHAVSIRAAR 420 Query: 526 KLRGSEYLNN--TSWQQIKQWAQQAKGEDP---QGYRAEFIRLIE 565 L E L Q+ ++ + E R E +L++ Sbjct: 421 LLADDEKLGAARARLQKAQRELEGVLEETNPMVAVLRTELQQLMD 465 >UniRef50_C1HBZ8 von Willebrand and RING finger domain-containing protein n=3 Tax=Paracoccidioides brasiliensis RepID=C1HBZ8_PARBA Length = 1068 Score = 191 bits (484), Expect = 9e-47, Method: Composition-based stats. Identities = 63/454 (13%), Positives = 147/454 (32%), Gaps = 55/454 (12%) Query: 115 QVAQNPLATFSLDV-----------DTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSD 163 V + P+ T +L V RR L P D + + Sbjct: 359 SVPEEPILTLNLSVAELPCFHLQFQSRNQLELWRRALLDLHNPDADRSLALPRNHNQDYN 418 Query: 164 WDIKDKQSIPASKPIPFAM-RYELAPAPWNEQRTLLKVDILAKDRKSEEL---------- 212 +D+ + + + R + + + ++ Sbjct: 419 YDLDNSGTEEDDYRTHQTIKRVSSTTSSYGGGTRSNNTALTDYTNSIRDVTTISTTSTIP 478 Query: 213 ----PASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR-IALP 267 +LV +I S SM ++ L++ +L+ LV L +D + +VT+ + L Sbjct: 479 PTFHIPLDLVVVIPVSSSMQG-LKISLLRDTLRFLVANLGPRDRMGLVTFGSSGGGVPLV 537 Query: 268 SISGSHKAEINAAIDSLDAEGS----TNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 ++ A + ++ G + G +A + I ILL +D Sbjct: 538 GMTTKTWGGWPAILGAIRPVGQKSLRADVVEGANVAMDLLMQRRSSNPIATILLISD--- 594 Query: 324 NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 + + +S++ +V + + V + +FG+G + + M+ ++ +Y+Y+ + Sbjct: 595 -SSMGEGESVDFVVSRAEAAKVGIHSFGLGLT-HKPDTMIELSSRTKASYTYVKDWMMLR 652 Query: 384 KVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAG 443 + + + + T ++ K ++ ++ +I + + GD+ G Sbjct: 653 ECVAGCLGLLQSTSHQNAKLKLRLPEGSPAKFVKISGALHTTKRAAGRDAEAALGDLRFG 712 Query: 444 KHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL 503 +L +L + + + L P + E I + + + Sbjct: 713 DKRDILVQLVIAPDTTTQEHLPQDPWESIV---SGLEALSGPI-----DSDDQRTISV-- 762 Query: 504 GPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTS 537 E++ A YG LR ++ Sbjct: 763 --------EEVPLLQADLTYGDILRDGHLTHSPR 788 >UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CDX4_KOSOT Length = 730 Score = 190 bits (483), Expect = 1e-46, Method: Composition-based stats. Identities = 57/303 (18%), Positives = 108/303 (35%), Gaps = 13/303 (4%) Query: 191 WNEQRTLLKVDILAKDRKSEELP-ASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 W+E + + E ++VF++D SGSM +++ + +L +++ L E Sbjct: 249 WDEADRRGYFLLTLVPPREPERIIPKDIVFILDISGSMSG-QKIEKAKLALLQVLQMLHE 307 Query: 250 QDNIAIVTYAGDSRIALPSISG-SHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 D +I+T+ + + S + E A+ + A G TN L + Sbjct: 308 GDRFSIITFNNEVNNLTERLLPFSDRTEWYPAVKQIMAGGMTNIHDALLEGIEVLGTQST 367 Query: 309 KGGINRILLATDGDFNVGIDDPKSIE-SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 +L TDG GI D +I K + V L FGVG + N ++ +A+ Sbjct: 368 DDRYKVVLFLTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVG-YDVNAELLDELAE 426 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRV 427 G G YI E + + R + V +V +I I Y + Sbjct: 427 KGGGKVKYIVENEEIDEKVLELYRMIETPVMSNVHLEINGTD--------ISYVLPKGPY 478 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIR 487 F+ + + + + + I + + L ++ +A L + Sbjct: 479 TLFSGTALRISGVYYHEGVVNVTLSWEEKLNGVIVQNKIKYKFDLTRNSSFPFIATLWAQ 538 Query: 488 WKY 490 + Sbjct: 539 KRI 541 >UniRef50_B8AXM1 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8AXM1_ORYSI Length = 614 Score = 189 bits (481), Expect = 2e-46, Method: Composition-based stats. Identities = 64/348 (18%), Positives = 135/348 (38%), Gaps = 29/348 (8%) Query: 192 NEQRTLLKVDILAKDR-KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 ++ + + + A + ++V ++D SGSM ERL ++ ++++ + +L Sbjct: 6 RQENFPVLIQVTAPPVLEGTARAGVDVVAVLDVSGSMEG-ERLEHVKEAMEIFIGKLGPD 64 Query: 251 DNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF- 307 D +++V++A R +S +A +D L A+GSTN GA L Sbjct: 65 DRLSVVSFATSVRRLTELTYMSEQGRAVAKEIVDGLVADGSTNMGAALLEGAMILRDRKG 124 Query: 308 ----IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 G + ++ +DG + + K+ TFG+G S++N +M Sbjct: 125 ARDESNGRVGCMMFLSDGTND----------EIYKEDISGEFPAHTFGLG-SDHNPNVMR 173 Query: 364 RIADVGNGNYSYID-TLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA-WVTEYRQIGYE 421 IAD + YS+++ +++ + + + + VA V+ + + V+ R GY Sbjct: 174 HIADETSATYSFVNRNIADIKGAFDLFISGLTSVVATAVRVTVRAHAGAAVSSIRSGGYA 233 Query: 422 KRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS----IDKLRYAPDNKLAKSDK 477 R + +D D+ AG+ + LT+ + + + + Sbjct: 234 HRVAA--DRLSGAIDIHDMYAGERKCFVVYLTVAEGRGGRKKRLLTVGGSYRRSSDVMSS 291 Query: 478 TKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQ 525 L +++ P+ S + + A ++ VAA Sbjct: 292 QLMLRDVEVSVVRPRWWCS-AAGLAIHGEVAAELARIKLEDGVAAIAD 338 >UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Deltaproteobacteria RepID=A0LHW4_SYNFM Length = 812 Score = 189 bits (481), Expect = 2e-46, Method: Composition-based stats. Identities = 70/480 (14%), Positives = 143/480 (29%), Gaps = 35/480 (7%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 + ++ +Q + + + + + + + + Sbjct: 140 REEARRDYEQAKSQGKSASLLEQQRPNVFQMNVANIMPGDEIKTELTYNELLSPTEGVYE 199 Query: 81 QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFL 140 PT + P + ++ NP + P +F++ T N + Sbjct: 200 FVYPTVVGPRYSNQPAAGAPASE---KWVQNPYLHEKEPPTYSFNI---TARL-NAGLPI 252 Query: 141 NQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA-----PWNEQR 195 + P + E D +K + + + E + ++ Sbjct: 253 REITCPSHETAIRYEGQTRASVDLGANEKFGGNRDFVLKYRLSGESIDSGLLLYRGKDEN 312 Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 L K +PA +F++D SGSM L + + L L+ L+ D + Sbjct: 313 FFLLTVQPPKRVVEAAIPAREYIFIVDVSGSMHGF-PLEISKRLLTDLIGGLKPTDCFNV 371 Query: 256 VTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 + ++GDS + S + I G T L+ A K +G Sbjct: 372 MLFSGDSTVMAERSVPASADNVRRAVEMIGRRQGGGGTELLPALKKALSLPRK---EGVS 428 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 +++ATDG V + +++ FG+G S N ++ +A G G Sbjct: 429 RSMVIATDGFVTV----EEEAFELIRSHI-GDANFFPFGIGTS-VNRMLIEGMARAGAGE 482 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI------EFNPAWVTEYRQIGYEKRQLR 426 I EA R + + +VKA + PA + + + Sbjct: 483 PFVITRPDEAPAGAEKFRRYIQSPLLTNVKADFGAFGVYDVEPAGIPDVLAERPVVIFGK 542 Query: 427 VEHFNNDNVDA-GDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLK 485 + + G GAG+ + + G + A+ + L+ Sbjct: 543 WRGEPSGKITVKGSTGAGE---FAKTVDVGGSRPEEGNSVLKYLWARARITALTDRNQLR 599 >UniRef50_Q8YP40 Alr4360 protein n=15 Tax=Cyanobacteria RepID=Q8YP40_ANASP Length = 427 Score = 189 bits (481), Expect = 2e-46, Method: Composition-based stats. Identities = 72/335 (21%), Positives = 137/335 (40%), Gaps = 38/335 (11%) Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNI 253 + L + I A + E+ NL ++D SGSM + L ++ +++ L+ L+ D I Sbjct: 21 SQRQLAISISAVAEQFEQNLPLNLCLILDQSGSMHG-QPLKMVVEAVEKLLDRLQPGDRI 79 Query: 254 AIVTYAGDSRIALPSISGSHKAEINAAI-DSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 ++V +AG + + +P+ + I I L A G T GL+ + KG +G + Sbjct: 80 SVVAFAGSATVIIPNQIVENPESIKTQIRKKLQASGGTVIAEGLQQGITELMKG-TRGAV 138 Query: 313 NRILLATDGD---------FNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 ++ L TDG + +G DD + KK + +T++T G GN N+N+ ++ Sbjct: 139 SQAFLLTDGHGEDSLKIWKWEIGPDDSRRCLEFAKKAAKINLTINTLGFGN-NWNQDLLE 197 Query: 364 RIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNP----AWVTEYRQIG 419 IAD G G ++I+ +A N ++ + + P A + Q+ Sbjct: 198 TIADAGGGTLAHIERPEQAVHHFNRLFTRVQSVGLTNAYLTLSLAPQVRLAELRPIAQVA 257 Query: 420 YEKRQLRVEHFNNDN--VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDK 477 + +L VE + + V GD+ + +L L L Sbjct: 258 PDIIELPVEPEADGSFIVRLGDLMKDINRVVLANLYLGKLPEGQQV-------------- 303 Query: 478 TKELAWLKIRWKYPQGKESQLVE--FPLGPTINAP 510 + +++R+ P E L+ +P+ + Sbjct: 304 ---IGNVQVRYDNPSLNEEGLLSQTWPIYANVMQA 335 >UniRef50_UPI00016C377F protein containing a von Willebrand factor type A domain n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C377F Length = 821 Score = 189 bits (479), Expect = 3e-46, Method: Composition-based stats. Identities = 52/281 (18%), Positives = 101/281 (35%), Gaps = 14/281 (4%) Query: 186 LAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVK 245 L P + I + ++ A +LV ++DTS SM SD ++ + ++K + Sbjct: 242 LVYKPIQTEDGYFMFLISPQVEAEKKRVARDLVLVLDTSSSM-SDIKMQQAKKAVKFCLS 300 Query: 246 ELREQDNIAIVTYAGDSRIALPSISGSHKAE---INAAIDSLDAEGSTNGGAGLELAYQQ 302 +L+ +D +V ++ + ++ ID L G T L A Sbjct: 301 QLQPEDRFGVVRFSTTVTKFRSELVAANTDYLDLATKWIDGLKTSGGTAIWPALNDAL-- 358 Query: 303 ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMM 362 A + ++ TDG V + I V + + TFGVG+ + N AM+ Sbjct: 359 AMRSSDPSRPFTMVFFTDGQPTVDETNADKIVKNVLAKNTGNTRIFTFGVGD-DVNAAML 417 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ-------IEFNPAWVTEY 415 ++AD +Y+ + + ++ ++ V DV+ E P + + Sbjct: 418 DQLADSTRAVSTYVREAEDIEVKVSGLYAKISNPVLTDVQLATSENVQLHEIYPPKLPDL 477 Query: 416 RQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 Q R + + + L++E G Sbjct: 478 FQGTQLVVIGRYTGEGPSVIRLTGLVGKERQELVYEFNFPG 518 >UniRef50_UPI00006CAF43 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CAF43 Length = 631 Score = 189 bits (479), Expect = 3e-46, Method: Composition-based stats. Identities = 61/296 (20%), Positives = 128/296 (43%), Gaps = 12/296 (4%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 ++ E +L+ +ID SGSM + L++ SLK L+K + E D I ++++ +I Sbjct: 134 PKEQSERVPMDLICVIDDSGSMSGKKA-QLVRKSLKYLLKIMNENDRICLISFDSVEKIL 192 Query: 266 LPSI--SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 P + + +K+E+ AI ++ GSTN AG+E K I + L +DG Sbjct: 193 TPFLRNNLENKSELKKAIKNIVGRGSTNIEAGMEAGLWMIKNRKEKNPITCMFLLSDGQD 252 Query: 324 NVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQ 383 + D + + + + ++T+G G ++++ M IA+ G Y YI+ + + Sbjct: 253 DSPQVDLRVQKLIQSYDIQDTFIVNTYGYG-ADHDATQMRNIAETHKGGYYYIEDVKKVS 311 Query: 384 KVLNSEMRQMLITVAKDVKAQIEFNPAWVTE--YRQIGYEKRQLRVEHFNND--NVDAGD 439 + + +L V +DV+ +I+ + T+ Q+ + + + + Sbjct: 312 EWFVLSISGLLSAVGEDVRIRIKSQNSQQTKLAITQVYGGDQLWVKQDYEKGVFEIYLPH 371 Query: 440 IGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKE 495 + G + +F+L N ++ + LA + E+ +K + + Sbjct: 372 LVIGDNKFFVFDLQFNSEQFKLTDQN----RSLAALEAELEIKAVKEEYIIRKNDT 423 >UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alteromonadales RepID=Q3IHK0_PSEHT Length = 664 Score = 188 bits (478), Expect = 4e-46, Method: Composition-based stats. Identities = 66/487 (13%), Positives = 147/487 (30%), Gaps = 50/487 (10%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE 82 K+ + + + V + + A + + + Q + Sbjct: 119 AAEKKYAEAKQAGKQAALVRQQRANMFITNVANIAPGEQVIIELEYQEIIDYSSGTFTVR 178 Query: 83 APTFARAAKAKATHIANPGTARYQQFDDNP---------VKQVAQNPLATFSLDVDTG-- 131 P + + + P ++ P + F+L++D Sbjct: 179 FPGTITPRYHVTQGEIDINKESQKPTNSLPHGWLSPVYSTQKNDDKPSSQFNLNLDIDVG 238 Query: 132 -SYANVRRFLNQGLLPPPDAVRVEEIVNYFPS-DWDIKDKQSIPASKPIPFAMRYELAPA 189 ++ + + + +N + + D + + A E Sbjct: 239 LELVDINSKFHNVNIQNTAFGQYSIELNEQNALNRDFVLEFKPLQKEQAQAAFFTEQFEN 298 Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 E+ L + A + + + A VF++DTSGSM + +++L + L Sbjct: 299 --GERYGLAMLMPPADNFIATQRLARETVFVVDTSGSMHGQS-MEQAKNALFYALSLLDS 355 Query: 250 QDNIAIVTYAGDSRIALPS---ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 D+ I+ + + SG + I L A+G T L+ Sbjct: 356 NDSFNIIGFDNVVTLMSDKPLVASGFNLRRAERFIYGLQADGGTEIQGALDA---VLDGS 412 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIA 366 G + +++ TDG + + ++ ++ + L T G+G++ N M R A Sbjct: 413 QFDGFVRQVIFLTDG----SVSNEDALFKSIQAKLGDS-RLFTVGIGSAP-NSFFMRRAA 466 Query: 367 DVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLR 426 DVG G++++I + SE Q + ++ ++ Sbjct: 467 DVGKGSFTFIGSTSEVQPKMQQLFDKLAHPAITNLAL-------------------SDEN 507 Query: 427 VEHFNNDNVDAGDIGAGKHITLLFELTLNGQ---KASIDKLRYAPDNKLAKSDKTKELAW 483 + D+ + I + +L + + + K +A Sbjct: 508 GNSLDFWPSPLPDLYFNEPIMVAIKLNNASNVILNGQTAQGPISINLNTQAGSNAKGIAK 567 Query: 484 LKIRWKY 490 L R K Sbjct: 568 LWARQKI 574 >UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1915 Length = 728 Score = 188 bits (478), Expect = 4e-46, Method: Composition-based stats. Identities = 59/353 (16%), Positives = 122/353 (34%), Gaps = 20/353 (5%) Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPW-NEQRTLLKVDILAKDRKSEELPASNLVFL 220 D + F ++Y++ + + S + N+VF+ Sbjct: 194 MDQQRTCPECTETLLDGDFLIKYDVKRENVAGNIQISNGYFVHYFAPASLQKVPKNVVFV 253 Query: 221 IDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA---LPSISGSHKAEI 277 ID SGSM +++ + ++ +L E+D+ I+ + L + + Sbjct: 254 IDHSGSMHG-QKIKQTYEAFLKILADLPEEDHFGILIFDDKVDKWQNTLVKAVPDNIIKA 312 Query: 278 NAAIDSLDAEGSTNGGAGLELAYQQATKGF-----IKGGINRILLATDGDFNVGIDDPKS 332 + + A G T+ L A + K + IL +DG+ G+ + Sbjct: 313 KQFVSKISARGGTDINKALLAAVKMLKNTSRNKLLPKISTSIILFLSDGEPTSGVTNHNE 372 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 I + VKK E TL G GN + + + ++A G I S+A L + Sbjct: 373 IINNVKKANERQTTLYCLGFGN-DVDFNFLEKMALENGGLARRIYEDSDAALQLQGFYNE 431 Query: 393 MLITVAKDVKAQ---IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLL 449 + + +V+ Q +R + H NNDN++ + G+ + Sbjct: 432 VANPLLLNVQLQYLDHSVGDVTQNNFRHYYQGSEIVVAGHINNDNLEC--LITGQGVEEQ 489 Query: 450 FELTLNGQKASID----KLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 F + + ++ +L+Y + + + L + + E + Sbjct: 490 FSVNVQTNITEVEEATKELQYIFGDFTERLWAYLTIEQLLTQHRISAQGEDKE 542 >UniRef50_UPI00006CF36E U-box domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CF36E Length = 790 Score = 188 bits (477), Expect = 5e-46, Method: Composition-based stats. Identities = 63/357 (17%), Positives = 134/357 (37%), Gaps = 62/357 (17%) Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSM---------------ISDERLPLIQSSLKLL 243 +V I K + ++ A ++ +ID SGSM L L++ S+K + Sbjct: 120 QVKISIKTPEGQQRSACDICCVIDVSGSMSDEAKIKNSKGDIESNGLTILDLVKHSVKTI 179 Query: 244 VKELREQDNIAIVTYAGDSR--IALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQ 301 + L E+D +++V + ++ L ++ + + ++ L STN G+ A + Sbjct: 180 INNLDERDRLSLVAFHTNAYKITDLTPMNENGRNHAIKELEKLIPLDSTNIWDGIYQALE 239 Query: 302 QATK------GFIKGGI--NRILLATDGDFNVGIDDPKSIESMVKKQRESG---VTLSTF 350 + + ++ILL TDG NV P+ M+KK +E ++STF Sbjct: 240 VVKAGQQQSIQKGEQRVAFSQILLFTDGQPNV--IPPRGHLPMLKKYKEENDVNCSISTF 297 Query: 351 GVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA 410 G G N + ++ ++A G G++++I V + + ++ T+A D IE + Sbjct: 298 GFG-YNLDSELLDQLAIEGRGSFAFIPDGQFVGTVFVNALSNLMTTLAVDAVLCIENSNG 356 Query: 411 WVTEYRQIGYEKRQLRVEHFN------------NDNVDAGDIGAGKHITLLFELTLNGQK 458 E I E+ + + N++ G + G+ ++ + Sbjct: 357 AQFEEVLIEEEQAKNILNKETVLGNYDYQRCSWGLNINIGTLQYGQSKDIVVTMK----- 411 Query: 459 ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKES-QLVEFPLGPTINAPSEDM 514 ++ K ++++ + + + +E M Sbjct: 412 -------------NVNNNSNKPYITATLKYRTSSTHKQPEEISASSSDISQQENEVM 455 >UniRef50_C7NN24 von Willebrand factor type A n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NN24_HALUD Length = 592 Score = 188 bits (477), Expect = 6e-46, Method: Composition-based stats. Identities = 74/432 (17%), Positives = 156/432 (36%), Gaps = 58/432 (13%) Query: 130 TGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA 189 + A+ RR + + LP PD++ VE + + +D + +A P Sbjct: 106 ASNVADFRRNVEEEYLPLPDSLPVEGLF--YNYYFDTGGTGECSSLFCPSYATAITADPL 163 Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMIS-------------------- 229 + R D + E ++V ++D SGSM S Sbjct: 164 GESTGRYFTVGLNSTLDTSTFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEGD 223 Query: 230 -DERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAI-DSLD 285 ++ + + +L L ++L D + +V + + +A P + + I I + ++ Sbjct: 224 SRSKMAVAKDALVALTEQLHPDDRVGVVLFNNEPTVAKPLRDVETTDMDAIRGHIREDIE 283 Query: 286 AEGSTNGGAGLELAYQQATKGFIKGGI---NRILLATDGDFNVGIDDPKSIESMVKKQRE 342 A G TN G+ A + R ++ TD N G D ++++ + E Sbjct: 284 AGGGTNIADGMAEAADMLGEYADSDPTEAETRQIVITDAMPNTGQTDDQALQDRLAGYAE 343 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 G+ S GVG ++N ++ I V NY + + + + L E M+ + D+ Sbjct: 344 DGIHTSFVGVGV-DFNPELVDEITAVRGANYRSVHSAEDFETYLGEEFEYMVTPLVYDLS 402 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIG-----AGKHITLLFELTLNGQ 457 +++ A + E ++ + + G+ + + L+G+ Sbjct: 403 VELDAADAEIATV------YGSTAAEDATDELLSVNTLFPSPKSDGETRGGVVLVKLDGE 456 Query: 458 KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFR 517 + LR + +++ +D + ++ +EFP P + +R Sbjct: 457 ASGDMTLRASWEDRSGSTD-----------------ETTRTIEFPEEPPEYFANTGIRKA 499 Query: 518 AAVAAYGQKLRG 529 +A Y L+ Sbjct: 500 VLLARYADLLKN 511 >UniRef50_D0LJL4 Myxococcales GC_trans_RRR domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LJL4_HALO1 Length = 602 Score = 188 bits (476), Expect = 6e-46, Method: Composition-based stats. Identities = 81/425 (19%), Positives = 156/425 (36%), Gaps = 67/425 (15%) Query: 136 VRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAP-----AP 190 R L G +P + + PA P ++ A A Sbjct: 53 FRSILEAGGIPAASTLDAAGFFAEH-------YVEMPPADCGQPLCLQAMSARGNEWTAN 105 Query: 191 WNEQRTLLKVDI-LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 E++ +L V + E +LV ++DTSGSM +D R+ ++ L LLV + E Sbjct: 106 GVEEQIVLAVAMNTPIGPDDIEPRPLDLVVVVDTSGSMATDARMDYVRQGLHLLVDAVDE 165 Query: 250 QDNIAIVTYAGDSRIALPSI-----------------------------------SGSHK 274 D +A+V+Y + + + + Sbjct: 166 DDRLALVSYQSFAEVHAELPALPVEETPEEPTEPTDPVGEPTDPPADPDEDPVDEREAWR 225 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI--KGGINRILLATDGDFNVGIDDPKS 332 +E++A +D+L G TN GLE ++ A + + R++L +DG GI D S Sbjct: 226 SEMHALVDTLQPGGGTNIYEGLERGFEIAKEARVNHPDRAQRVILLSDGLATEGITDSAS 285 Query: 333 IESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQ 392 I ++ + E G+ L+T GVG S +N +M +A+ G GN+ +++ ++V E+ Sbjct: 286 IIALSEAFIEGGMGLTTVGVGAS-FNVELMRGLAERGAGNFYFVEDPEAVREVFTEELDY 344 Query: 393 MLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFEL 452 +A V ++ + L N+ ++ + + Sbjct: 345 FAEPLATAVSIEVRTTDGYGLGEVVGTR----LWSTEGNSGSMYLPAVFVASRKS----- 395 Query: 453 TLNGQ---KASIDKLRYAPDNKLAKSDKTKELAWLKIRWK----YPQGKESQLVEFPLGP 505 + G+ + + + P + ++ A + +R+ P ++SQ E + Sbjct: 396 SAPGEYGGRRGGGGMLFLPLYPSIDTGFSEAAALVTLRYSAADGAPGSEQSQTTEVIIPA 455 Query: 506 TINAP 510 A Sbjct: 456 RFGAS 460 >UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=B8G546_CHLAD Length = 418 Score = 187 bits (475), Expect = 9e-46, Method: Composition-based stats. Identities = 72/394 (18%), Positives = 151/394 (38%), Gaps = 18/394 (4%) Query: 175 SKPIPFAMRYELAPAPWNEQRTLLKVDILAKDR-KSEELPASNLVFLIDTSGSMISDERL 233 S + ++ P P + ++ + + A NL F++D SGSM +L Sbjct: 2 SASVTLRCQWGRTPVPTSSTPQVVYLLVEAVAPASPTSALPLNLCFVLDRSGSMQG-AKL 60 Query: 234 PLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGG 293 ++++ + +++ LR D AIV + + +P+ ++ + AA++++ G T Sbjct: 61 ESMKAATRRVIELLRPHDVAAIVIFDDTVQTLIPATPVGDRSALLAAVETITEAGGTAMS 120 Query: 294 AGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVG 353 G++ A + K I+R+LL TDG D + + ++GV ++ G+G Sbjct: 121 LGMQAAQTELQKHLGPDRISRMLLLTDGQTW---GDEPICRDLARTLGQAGVRITALGLG 177 Query: 354 NSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFN-PAWV 412 +NE ++ IA +G YI ++ + +++ VA D + + Sbjct: 178 T-EWNEQLLDDIAAASDGYSDYIADPAQIETFFQQAVKEAQAVVATDARLLLRLVRDVTP 236 Query: 413 TEYRQIGYEKRQLRVEHFNND--NVDAGDIGAGKHITLLFELTLNGQKAS-----IDKLR 465 ++ L + + V GD+ G+ +L +L L + +L Sbjct: 237 RAIYRVKPVIANLGYQPIGDAAVAVRLGDLVGGQPAAVLLDLMLPPRTRGRFRIAQAELH 296 Query: 466 YAPDNKLAKSDKTKELAWLKIRWKYPQG---KESQLVEFPLGPTINAPSEDMRFRAAVAA 522 P ++ +++ +++ P+ LVE + + A Sbjct: 297 LTPVDQRSETVIKQDILLDVADQAGPESYVPDVMNLVERVTAFKLQTRALSEAASGNTAG 356 Query: 523 YGQKLRGSE-YLNNTSWQQIKQWAQQAKGEDPQG 555 QKLR + L + ++ Q QG Sbjct: 357 ATQKLRAAATRLLDLGELELAAKMNQQAATLEQG 390 >UniRef50_B5JPY1 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JPY1_9BACT Length = 632 Score = 187 bits (475), Expect = 9e-46, Method: Composition-based stats. Identities = 82/373 (21%), Positives = 159/373 (42%), Gaps = 24/373 (6%) Query: 136 VRRFLNQGLLPPPDAVRVEEIVNY--FPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNE 193 +R +++G++P P + E + + P D K+ + A +E A P + Sbjct: 62 LRNLIDEGIIPSPASFTAEGLFSEHDLPIGGDAKEGWLFDIASQ---ATSFESAAQPKVD 118 Query: 194 QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNI 253 L + D + + NLV ++D SGSM + L L++ SL+ +V +L D + Sbjct: 119 ILAQLGF-VSGIDATTFKPAPLNLVAVVDKSGSMSG-DPLELVRKSLRQVVSQLGSDDQL 176 Query: 254 AIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK-G 310 +IV Y + I L S ++ +I A+ID + + GST AGLEL YQ A + Sbjct: 177 SIVLYGSSTHIHLEPTKTSTENRDQIIASIDRIQSHGSTAMEAGLELGYQVARQSADAFV 236 Query: 311 GINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGN 370 G R++L TD NVG D +M + +S + L+T GVG ++ + +I+ V Sbjct: 237 GKTRVMLFTDERPNVGRTDATGFMAMAESGSKSDIGLTTIGVGV-HFGAELAEKISSVRG 295 Query: 371 GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHF 430 GN + D + E+ M++ +A D+ ++ + G + Sbjct: 296 GNLFFFDDDESMETTFRKELDTMVLELAYDMSLKVTPAEGFFLSG-LYGIPGDAVTWADD 354 Query: 431 NNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKY 490 + +++ + A ++ ++ L + + + L + +A +I ++ Sbjct: 355 GSLSLEIATLFASRNKGAIY-LGFDRTAQATESLPHTSL-----------IATAEISYQQ 402 Query: 491 PQGKESQLVEFPL 503 ++ P+ Sbjct: 403 ADNNLTRQSSLPI 415 >UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clupeocephala RepID=Q498Q0_DANRE Length = 892 Score = 187 bits (475), Expect = 9e-46, Method: Composition-based stats. Identities = 66/392 (16%), Positives = 139/392 (35%), Gaps = 25/392 (6%) Query: 36 PTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKAT 95 P + + + + K AQQE Q + G + K T Sbjct: 68 PKNAFISQFRMTIDGKTYDAIVKEKEEAQQEYSQAVSRGESAGLVSAVGRTLEEFKTSVT 127 Query: 96 HIANPGTARYQQFDDNPVKQVAQ----------NPLATFSLDVD-----TGSYANVRRFL 140 AN +++ +++ + P+A F +D+ S V+ L Sbjct: 128 VAANSKVTFELTYEELLKRRLGKYKLLINAQPMQPVADFKIDIHIHESAGISLLEVKGGL 187 Query: 141 NQGLL---PPPDAVRVEEIVNYFPSDWDIKDKQSIPASK-PIPFAMRYELAPA-PWNEQR 195 N L + + V ++P+ KD + + Y++ + + Sbjct: 188 NTKDLANAVTTTRAQEDAWVKFYPTRDQQKDCDDCTKNGLNGNLVIMYDVERVKQSGDFK 247 Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 + + N+VF+ID SGSM ++ + ++ ++ +L + D + Sbjct: 248 VANGYFVHYFAPTDVQRIPKNVVFIIDQSGSMQG-NKIEQTRMAMLRILSDLAKDDYFGL 306 Query: 256 VTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI 312 +T++ + P + + E + + + G+T+ + A + +G Sbjct: 307 ITFSSHIQAWKPELLKATAENVEEAKTFVKQIRSGGATDINGAVLNAVNMINQYTQEGSA 366 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 + ++L TDGD G+ +P +I+ VK L G G N + +++ NG Sbjct: 367 SILILLTDGDPTSGVTNPVTIQQNVKTAIGGKYPLYCLGFGF-NVRFEFLEKMSLENNGA 425 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 I S+A L ++ I + +V+ Sbjct: 426 ARRIYEDSDADLQLQGFYEEVAIPLLTNVQLN 457 >UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella loihica PV-4 RepID=A3QDW1_SHELP Length = 776 Score = 187 bits (475), Expect = 1e-45, Method: Composition-based stats. Identities = 80/534 (14%), Positives = 162/534 (30%), Gaps = 86/534 (16%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEA 83 + +Q + S V + + +AL + Q + Sbjct: 169 ARQTFEQAKASGKKASLVSQQRPNIFTSEVANLGPGEALVVEIAYQQQVRYQDGEFSLRF 228 Query: 84 PTFARAAKAKATHIANPGTARYQQFD-------------------------------DNP 112 PT + + A + P Sbjct: 229 PTAITPRYFPKGQVPDLEQASVNDIQGLNVLNESTQSDEQKLYLDVVLDAGMAISRLETP 288 Query: 113 VKQVAQNPLATFSLDVD-TGSYANVRRFLNQ-----GLLPPPDAVRVEEIVNYFPSDWDI 166 Q+ Q L + ++ T + R FL + P + F + Sbjct: 289 YHQMRQTQLGGTKIGLNLTANLRPDRDFLLKWRPLLAEQPSAVMFAQLGKTHEFKTHEFK 348 Query: 167 KDKQSIPASKPIPFAMR-----YELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 ++ + + E + + + V ++ K+ L +I Sbjct: 349 NEESLASSQAHNDQVVSEANHPAEAQASDKEAKDSYALVMLMPPQDKARVRLPRELTLVI 408 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEIN 278 DTSGSM + + +S++ + L QD ++ + R P S + ++ + N Sbjct: 409 DTSGSMTG-DSIAQAKSAILNALAGLGSQDTFNVIAFDSSVRSLSPVALSATAANLGKAN 467 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQ-------ATKGFIKGGINRILLATDGDFNVGIDDPK 331 + SL+A+G T L A Q + + +++ TDG + + Sbjct: 468 LFVQSLEADGGTEMAPALLRALSQPESGVSSISSAVKPERLKQVVFITDGA----VGNEA 523 Query: 332 SIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMR 391 S+ +++ L T G+G + N M R A G G Y+Y+ +SE + + Sbjct: 524 SLFALIAANIGRQ-RLFTVGIGAAP-NGYFMERAARAGRGTYTYVGKISEVDAKIGELLE 581 Query: 392 QMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFE 451 ++ DV ++ + +Y V GD+ A + I + Sbjct: 582 KIESPQISDVTLTLD--DGSIPDY-----------------WPVQIGDLYAHEPIMVALR 622 Query: 452 LTLNGQ--------KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQ 497 LT + +D + L DK + + + R + + S+ Sbjct: 623 LTPAQRSRSDALIISGDLDGKNWQRRLALTGGDKPRGIDLIWARNQIASLQLSK 676 >UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HPN0_LYSSC Length = 825 Score = 186 bits (472), Expect = 2e-45, Method: Composition-based stats. Identities = 72/374 (19%), Positives = 142/374 (37%), Gaps = 42/374 (11%) Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 + + + + E+LP+ LV ++D SGSM S +L L + + V+ LR++D + Sbjct: 348 IETLLPVEMEIKGKEQLPSLGLVIVLDRSGSM-SGSKLELAKEAAARSVEMLRDEDTLGF 406 Query: 256 VTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRI 315 + + + + ++K E I S+ G T L AY+ ++ I Sbjct: 407 IAFDDRPWEIIETGPLNNKEEAVDTILSVTPGGGTEIYGSLAKAYENLADMKLQ--RKHI 464 Query: 316 LLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSY 375 +L TDG P + + ++++ +++G+TLST +G + + ++ ++++G+G + Sbjct: 465 ILLTDGQ-----SQPGNYDDLIEQGKDNGITLSTVAIGQ-DADANLLEALSEMGSGRFYN 518 Query: 376 IDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNV 435 + +L+ E + T +D NP + Y G+ L N Sbjct: 519 VIDEQTIPSILSRETAMISRTYIED-------NPFYPVLYNAGGW--NTLFANGVPQMNA 569 Query: 436 DAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLA-KSDKTKELAWLKIRWKYPQGK 494 G A + T++ E + + + +Y A SD T + + RW+ Sbjct: 570 YIGT-TAKQGATVVAESE--KEDPVLAQWQYGLGKTFAFTSDSTGKWSGDWARWQDWGTF 626 Query: 495 ESQLVEFPLGPTINAP------------------SEDMRFRAAVAAYGQKLRGSEYLNNT 536 L+ L + AV G++L L Sbjct: 627 WQTLISQMLPSYNDVAYDVRVESDGSFMITDPTNEAAFLDIVAVNEAGEEL--DTQLETI 684 Query: 537 SWQQIKQWAQQAKG 550 S Q++ Q G Sbjct: 685 SASQVRAVVQAEPG 698 Score = 54.4 bits (129), Expect = 1e-05, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 54/150 (36%), Gaps = 20/150 (13%) Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ-DN--IAIVTYAGDSRIALPSISGSH 273 +V+L+D S SM E ++ + L+ + D + +++ + +I Sbjct: 28 IVYLVDRSASMNGTED-----EMVQFIQDSLQSKKDEQLAGLYSFSS--TLQTEAIMTKT 80 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 E+ + + A TN L+LA R++L TDG+ G S Sbjct: 81 LKEVPKFTE-IKATDQTNIEQSLQLATGII----DPKKATRLVLLTDGNETKG-----SA 130 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMV 363 K + S +++ N+ + Sbjct: 131 LDFATKFKGSNISVDVVPFSQPVVNDVSLK 160 >UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=Q2QSE5_ORYSJ Length = 524 Score = 186 bits (472), Expect = 2e-45, Method: Composition-based stats. Identities = 70/375 (18%), Positives = 139/375 (37%), Gaps = 38/375 (10%) Query: 212 LPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA--LPSI 269 +LV ++D SGSM ++ ++ +L+ ++ +L D ++IVT+ ++ L ++ Sbjct: 58 REGLDLVAVVDVSGSMRG-HKIESVKKALQFVIMKLTPVDRLSIVTFESSAKRLTKLRAM 116 Query: 270 SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG-FIKGGINRILLATDGDFNVGID 328 + + E++ + SL A G T+ AGL+L F + I L +DG Sbjct: 117 TQDFRGELDGIVKSLIANGGTDIKAGLDLGLAVLADRVFTESRTANIFLMSDGKLEGKTS 176 Query: 329 -DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVG-NGNYSYIDTLSEAQKVL 386 DP + V++ TFG G+ + ++ IA G YS + + Sbjct: 177 GDPTQV-------NPGEVSVYTFGFGHGT-DHQLLTDIAKNSPGGTYSTVPDGTNLSAPF 228 Query: 387 NSEMRQMLITVAKDVKAQI--EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 + + ++ VA+DV+ + + + + + + G + +G+ Sbjct: 229 ATLLGGLVTVVAQDVRLTLTPKTADGDLDKMEVADGTDYTQTTDAKGEITIKFGTLFSGE 288 Query: 445 HITLLFELTLNGQ------KASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQL 498 + TLN A++ R++ + A + K P + Sbjct: 289 TRKVAVNFTLNESPDTEEYNATLAVARHSYAAQEAPQPAQNIVRLRKPEPTTPGSDDGIE 348 Query: 499 VEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGE--DPQGY 556 + D+ +L + L + + I AQ A G+ G Sbjct: 349 ERSVQAEVVRRRHADL------IGKASELANGQKLGDAR-ETIMD-AQNALGDILLDDGD 400 Query: 557 R------AEFIRLIE 565 R AE +RL+E Sbjct: 401 RMVNALQAELLRLLE 415 >UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3 Tax=Theria RepID=ITIH4_PIG Length = 921 Score = 186 bits (472), Expect = 2e-45, Method: Composition-based stats. Identities = 74/517 (14%), Positives = 167/517 (32%), Gaps = 48/517 (9%) Query: 19 CGPQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQG 78 Q + + + + +Q + + AA E + Sbjct: 98 AAAQEQYSAVARGESAGLVRATGRKTEQFQVAVSVAPAAKVTFELVYEELLARHLGVYEL 157 Query: 79 RLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRR 138 L+ P HI P + + ++ T S + Sbjct: 158 LLKIQPQQLVKHLQMDIHIFEPQGISFLE-TESTFMTNELAEALTISQNKTKA------- 209 Query: 139 FLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA-PWNEQRTL 197 +R + ++ K + F +RY++ + Sbjct: 210 -----------HIRFKPTLSQ-----QQKSPEQQETVLDGNFIVRYDVNRTVTGGSIQIE 253 Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVT 257 + + N++F+IDTSGSM ++ + +L ++ +L +D +V+ Sbjct: 254 NGYFVHYFAPEVWSAIPKNVIFVIDTSGSMRGR-KIQQTREALIKILGDLGSRDQFNLVS 312 Query: 258 YAGDS-RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKG-----G 311 ++G++ R + S + E + + A+G TN + +A Q + + Sbjct: 313 FSGEAPRRRAVAASAENVEEAKSYAAEIHAQGGTNINDAMLMAVQLLERANREELLPARS 372 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 + I+L TDGD VG +P I+ V++ + +L G G + A + ++A G Sbjct: 373 VTFIILLTDGDPTVGETNPSKIQKNVREAIDGQHSLFCLGFGF-DVPYAFLEKMALENGG 431 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFN---PAWVTEYRQIGYEKRQLRVE 428 I S++ L +++ + + V + N +R + Sbjct: 432 LARRIYEDSDSALQLEDFYQEVANPLLRLVAFEYPSNAVEEVTQDNFRLFFKGSELVVAG 491 Query: 429 HFNNDNVDA------GDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKEL- 481 + + D G + +++T + E + Q+A +Y + + + + Sbjct: 492 KLRDQSPDVLSAKVRGQLHM-ENVTFVMESRVAEQEAEFLSPKYIFHSFMERLWAYLTIQ 550 Query: 482 ----AWLKIRWKYPQGKESQLVEFPLGPTINAPSEDM 514 + + E++ + L + P M Sbjct: 551 QLLAQTVSASDAEKKALEARALSLSLNYSFVTPLTSM 587 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 185 bits (469), Expect = 4e-45, Method: Composition-based stats. Identities = 63/453 (13%), Positives = 141/453 (31%), Gaps = 43/453 (9%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 + + + + V + + A + + V Q Sbjct: 119 KQAEQAYEVAKQQGKKASLVSQKRPNLFVSQVANIEAGQTVEVTLVYQQLLHYEQGEFTV 178 Query: 82 EAPTFARAAKAKATHIANPGTA----RYQQFDDNPVKQVAQNPLA--------------T 123 P + + + P + A + Sbjct: 179 RFPMLVAPRYQPKRLVFDADLQGNWPSASEITSAPFIDELETQSAKAVGQGMSDAKIKQS 238 Query: 124 FSLDVDTG-SYANVRRFLNQG--LLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPF 180 S+ ++ G + ++ L + +V ++ D + I Sbjct: 239 VSIALNLGFELDTIMSPYHEINQQLIGNNHYQVSLKQGTTFANRDFVLRVKPKNQAAIQA 298 Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 A+ E N+ L+ + + + + + ++F+IDTSGSM + L +S+L Sbjct: 299 AVFKEHF---ENDDYALVMLMPPSDEFIAAQRLPREVIFVIDTSGSMHGES-LEQAKSAL 354 Query: 241 KLLVKELREQDNIAIVTYAGDSRIA---LPSISGSHKAEINAAIDSLDAEGSTNGGAGLE 297 + L QD+ I+ + + + + L A+G T G E Sbjct: 355 FFALANLDPQDSFNIIEFNSKVNALNAQALPANDFNIRRARNFVYGLKADGGTEIGLAFE 414 Query: 298 LAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 Q + +I+ TDG I + + + +K + T G+G++ Sbjct: 415 ---QVLDNSEHADYLRQIVFLTDG----SISNETEVFAQIKGSLGDS-RIFTIGIGSAP- 465 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ------IEFNPAW 411 N M R A +G G +++I +++ Q+ + + Q+ K++ ++F P Sbjct: 466 NSYFMTRAATLGRGTFTFIGDVTDVQRTMKNLFVQLANAALKELIITDENGDALDFWPKP 525 Query: 412 VTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGK 444 + + +++ N G G+ Sbjct: 526 IADLYFNQAMMVAIKLNAGQNQINVRGQQAFGQ 558 >UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Shewanella RepID=A6WMD3_SHEB8 Length = 772 Score = 184 bits (468), Expect = 5e-45, Method: Composition-based stats. Identities = 72/528 (13%), Positives = 165/528 (31%), Gaps = 87/528 (16%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 + +Q + V + + A + L + Q S K Sbjct: 154 AEAKQIFEQAKAEGKRASLVSQERPNMFTTEVANLAPQEELIVEISYQESIKYEDGLFSL 213 Query: 82 EAPTFARAAKAKATHIANPGTARYQQFDDNPVK---------------------QVAQNP 120 P + + ++P Sbjct: 214 RFPLVVAPRYIPGLTSDASASDSNNPIAQSSQSSRVTSSQVFDADLIVAPVRDGASGRDP 273 Query: 121 LATFSLDVDTG---SYANVRRFLNQGLLPPPDAVRVE-EIVNYFPSDWDIKDKQ------ 170 + + V A++ + L ++ V+ + P++ D + Sbjct: 274 VLKADIQVLLAKGVDKASIESPYHDIKLKQTNSGAVDVSLAQRVPANRDFVLQWRVQQGT 333 Query: 171 -------------SIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELP--AS 215 P + E + A + V +L + P Sbjct: 334 SPMAWVFNQQGKTHKPDGDNLSQD-TLETSKANGMNEDNYSLVMVLPPKVEKSTQPSLPR 392 Query: 216 NLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP---SISGS 272 L+ +IDTSGSM + + +++L +K L+ +D+ I+ + + + S Sbjct: 393 ELILVIDTSGSMAG-DSIVQAKNALLYALKGLKPEDSFNIIEFNSSLSLLSATPLPATSS 451 Query: 273 HKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGI---NRILLATDGDFNVGIDD 329 + + + L A+G T L+ A ++ + +++ TDG + + Sbjct: 452 NLSRARQFVSRLQADGGTEMALALDAALPKSLGSVSPDAVQPLRQVIFMTDG----SVGN 507 Query: 330 PKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSE 389 +++ +++ Q L T G+G++ N M R A++G G ++YI + E +++ Sbjct: 508 EQALFDLIRYQIGES-RLFTVGIGSAP-NSHFMQRAAELGRGTFTYIGKVDEVDAKISAL 565 Query: 390 MRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLL 449 + ++ V D Q+ ++ V +Y D+ G+ + + Sbjct: 566 LSKIQYPVLTD--IQVRYDDGSVPDY-----------------WPSPIADLYRGEPVLVS 606 Query: 450 F--------ELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWK 489 EL ++G++ + + + T + A L + W Sbjct: 607 LKRSAREPQELVISGRQGHKNWQQSLSLQDNSAGLITNQGAGLDLLWA 654 >UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FW78_SHESH Length = 770 Score = 184 bits (467), Expect = 7e-45, Method: Composition-based stats. Identities = 77/512 (15%), Positives = 151/512 (29%), Gaps = 98/512 (19%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE 82 K +Q + + V + K + + + L + Q Sbjct: 131 EAKKIYEQAKSAGKRASLVEHNRPNIFKTSVANLGPDELLVVEISYQELVDYDEGEFSLR 190 Query: 83 APTFARAAKAKATHIANPGTARYQQFDDNPVK-------QVAQNPLATFSLDVDTGSYA- 134 P P + Q F + + + L+ + SY Sbjct: 191 FPMVVNPRYYPQGGSKQPDDFQNQGFQYQDFQYRGLTGDESWLDKLSAMEASLRGESYGV 250 Query: 135 --------NVRRFLNQGLLPPP-----DAVRVEEIVNY---------FPSDWDIKDKQSI 172 N+R L+ G+ + + N ++ D + Sbjct: 251 RSHNAPTVNIRVNLDAGVELSEITSAYHTIDKTPLDNTGYQIHLASQVAANRDFVLRWKP 310 Query: 173 PASKPIPFAMR------------------------YELAPAPWNEQRTLLKVDILAKDRK 208 A A+ EL P + L + + K Sbjct: 311 VAGSEPTAAVFAQKGQTYSSSSTQKNSSEQHVKSDPELNAKPDAD--YALVMLLPPSLEK 368 Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP- 267 S + L+ +IDTSGSM S + + ++K + L D ++ + Sbjct: 369 SRNRVSRELILVIDTSGSM-SGSAMEQAKKAMKYALAGLGSDDTFNVIEFNSKVSSLSKG 427 Query: 268 --SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA-------------TKGFIKGGI 312 S + N + SL ++G T LE A Q + Sbjct: 428 PIPASTKNIEMANRFVHSLTSDGGTEMALALEHALGQESGGSSWQETGLQGKDEESTSRL 487 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 ++L TDG + + + ++K + L T G+G++ N M R A+ G G Sbjct: 488 RQVLFMTDGA----VGNEAELFKLIKYRIGKS-RLFTLGIGSAP-NSHFMQRAAEFGRGT 541 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNN 432 ++YI L E Q+ + + ++ D++ + + ++ Sbjct: 542 FTYIGDLDEVQEKIQGLLYKIEHPQITDIELH--YTDGTIPDF----------------- 582 Query: 433 DNVDAGDIGAGKHITLLFELTLNGQKASIDKL 464 D+ A + + + ++ + AS DKL Sbjct: 583 WPATIPDLYAEEPLLVAIKMPSDKYAASQDKL 614 >UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E8A41 Length = 945 Score = 184 bits (467), Expect = 8e-45, Method: Composition-based stats. Identities = 47/278 (16%), Positives = 108/278 (38%), Gaps = 13/278 (4%) Query: 138 RFLNQGLLPPPDAVRVE-EIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW-NEQR 195 R + LP ++ E + S ++ + F +RY++ + + Sbjct: 219 RASGRPQLPVTTEIKKEKNMCRITFSPNIVQQARIATNGLLGDFVIRYDVQRDLGIGDIQ 278 Query: 196 TLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAI 255 L + K N+VF+IDTS SM+ +++ + +L ++ +LR D+ Sbjct: 279 VLNGHFVHYFAPKDLPAVPKNVVFVIDTSASMLG-KKIRQTKEALFTILGDLRPGDHFNF 337 Query: 256 VTYAGDSRIALP----SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT-----KG 306 ++++ ++ P ++ ++ + I L G TN + ++ + Sbjct: 338 ISFSSRVKVWQPGRLVPVTPNNVRDAKKFIFMLPTSGGTNINSAIQTGSSLLQDYLSAQD 397 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIA 366 ++ I+ TDG VG +I + + + T G+GN + + ++ R+A Sbjct: 398 ASPNSVSLIIFLTDGQPTVGEVQSVTILGNTRSAVQGKFCIFTIGIGN-DVDYRLLERMA 456 Query: 367 DVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQ 404 G I ++A +L ++ + D++ Sbjct: 457 LDNCGMMRRIPEEADASSMLKGFYDEIGTPLLSDIRIN 494 >UniRef50_B4DPQ4 cDNA FLJ60769, highly similar to Inter-alpha-trypsin inhibitor heavy chain H3 n=11 Tax=Tetrapoda RepID=B4DPQ4_HUMAN Length = 698 Score = 184 bits (466), Expect = 9e-45, Method: Composition-based stats. Identities = 53/268 (19%), Positives = 103/268 (38%), Gaps = 12/268 (4%) Query: 163 DWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLID 222 D + F + Y++ + + + + + N+ F+ID Sbjct: 231 DQQRSCPTCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVID 290 Query: 223 TSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA---LPSISGSHKAEINA 279 SGSM +L + +L ++++++E+D + ++GD L + + E Sbjct: 291 ISGSMAGR-KLEQTKEALLRILEDMKEEDYLNFTLFSGDVSTWKEHLVQATPENLQEART 349 Query: 280 AIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR-----ILLATDGDFNVGIDDPKSIE 334 + S++ +G TN GL K + I +++ TDGD NVG P+ I+ Sbjct: 350 FVKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDGDANVGESRPEKIQ 409 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQML 394 V+ L G GN N N + +A +G I S+A L ++ Sbjct: 410 ENVRNAIGGKFPLYNLGFGN-NLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVA 468 Query: 395 ITVAKDVKAQIEFNPAWVTEYRQIGYEK 422 + V ++E+ + + Q Y+ Sbjct: 469 NPLLTGV--EMEYPENAILDLTQNTYQH 494 >UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H5J9_SHEPA Length = 789 Score = 184 bits (466), Expect = 1e-44, Method: Composition-based stats. Identities = 75/415 (18%), Positives = 144/415 (34%), Gaps = 55/415 (13%) Query: 126 LDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYE 185 ++VD LN+ + D V + + + S A E Sbjct: 311 INVDMAESGGAIVSLNRDAIANKDFVLTWKPIQ---GNEPTAAVFSQIGKTHTSQASSSE 367 Query: 186 LAPAPWNEQRTLLKVDILAKDRKSEELPASN--LVFLIDTSGSMISDERLPLIQSSLKLL 243 + P V ++ ++ + + L+ +IDTSGSM D + +++LK Sbjct: 368 ASTEPQTASEKYGLVMLMPPQGAEQQPSSIHRELILVIDTSGSMSGDAII-QAKTALKYA 426 Query: 244 VKELREQDNIAIVTYAGDSRIAL---PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 + LR D IV + D S + + A+ I+ L+A G T + A Sbjct: 427 LAGLRPTDKFNIVQFNSDVDKWSGMAMSATPYNLAQAQNYINRLEANGGTEMSIAINAAL 486 Query: 301 ------------QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLS 348 + + ++L TDG + + + +++ Q L Sbjct: 487 NIETVTDKETGTELDNNDLGSNLLRQVLFITDGA----VSNESMLFELIEAQLGDS-RLF 541 Query: 349 TFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFN 408 T G+G++ N M R A +G G Y+YI L E + + S ++++ DV + F+ Sbjct: 542 TIGIGSAP-NAHFMQRAAQLGRGTYTYIGKLDEVNQKVVSLLKKIEKPQVTDV--DLRFS 598 Query: 409 PAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLR--- 465 V +Y V D+ A + LL + L +S ++ Sbjct: 599 DGSVPDY-----------------WPVRIPDLYAHEP--LLVAVKLPSYASSDLLVQGLL 639 Query: 466 ----YAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRF 516 + ++ +++ K L + R + + S+ + M F Sbjct: 640 AGQFWQRRLSVSATEQAKGLDLVWARKQIAALELSKQAANRERIEKQITAIAMNF 694 >UniRef50_C7R936 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R936_KANKD Length = 689 Score = 183 bits (464), Expect = 2e-44, Method: Composition-based stats. Identities = 70/508 (13%), Positives = 155/508 (30%), Gaps = 71/508 (13%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALA-----QQEVQQYSDKQAL 76 Q + ++ + V + + A +++ QQ V ++ Sbjct: 111 QQAKRTYEKAKAEGKRTSLVEQIRDNLFTTKLANIAPGESITIHIEFQQLVHNDGTHFSM 170 Query: 77 QGRLQEAPTFARAAK--------------------AKATHIANPGTARYQQFDDNPVKQV 116 + L P + ++ ++ P + Sbjct: 171 RMPLGITPRYKPVDATHEFSNNDNNDDYNYFSSDDSETVYLDTPAHHSAELSPAFTQFNT 230 Query: 117 AQNPLATFSLDVD---TGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIP 173 P S+ V + + ++ ++ N ++ D Sbjct: 231 GSQPDRPVSVTVHLNPGFDLSLLESPYHKMNTQQTGGQYTIQLENPAQAERDFVLNWQPQ 290 Query: 174 ASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERL 233 + A+ E + + +L + D ++ ++F+ID+ +S E + Sbjct: 291 LGQQPKVALFSE---SYDDHNYHVLMMLPPTHDLVQQKTQPREMIFVIDS-SGSMSGESM 346 Query: 234 PLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGST 290 + L + +L D I+ + D+ + S+ + +L+A+G T Sbjct: 347 QQAKQGLYYALSQLSINDTFNIIDFDNDANKLFDEAVPATLSNLEMAKYFVATLEADGGT 406 Query: 291 NGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTF 350 + LA + +++ TDG I + + I M++ Q + L T Sbjct: 407 EIAKAINLALD----KPDSSLLRQVVFLTDG----SIGNERQIFQMIENQLGNN-RLFTI 457 Query: 351 GVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA------Q 404 G+G + N M + A+ G G ++YI SE Q L +++ +++ Q Sbjct: 458 GIGAAP-NSYFMSKAANYGRGTFTYIGKASEVQTKLEQLFKKLRYPALENLSLESKYSDQ 516 Query: 405 IEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKL 464 +E P + + R+ ++V K D Sbjct: 517 LELYPGRLRDLYLGEPLFVSYRIPKGVTNSVQV--------------------KGQADAY 556 Query: 465 RYAPDNKLAKSDKTKELAWLKIRWKYPQ 492 ++ + K K +A L R K Sbjct: 557 DWSFKLPPVTNGKDKGIARLWARMKIDA 584 >UniRef50_A9F2Q0 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F2Q0_SORC5 Length = 521 Score = 182 bits (462), Expect = 3e-44, Method: Composition-based stats. Identities = 78/362 (21%), Positives = 146/362 (40%), Gaps = 18/362 (4%) Query: 165 DIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 D + R A P + Q T L ++ + L +NL +ID S Sbjct: 56 AAVDPSRFTTGSRLMLEGRVGHARLPRSAQETFLMFEVRGDGSPARSLAQANLSLVIDRS 115 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSIS--GSHKAEINAAID 282 GSM RL + V L + D +++VT+ + + +P + + I A++ Sbjct: 116 GSMKG-TRLTNAVQAATTAVSRLNDGDVVSVVTFDTRTSVVVPPTTVGPETRGRILASVR 174 Query: 283 SLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRE 342 + G T G+E G G++R+L+ +DGD N G+ D +M ++ R+ Sbjct: 175 GISLGGDTCISCGIEEGLSLL--GQTSAGVSRMLVLSDGDANHGVRDVPGFRAMAQRARD 232 Query: 343 SGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVK 402 GV ++T GV + +YNE ++ IA NG + +++ + ++ +E Q+ +VA + Sbjct: 233 RGVAITTIGV-DVDYNEKILSAIALDSNGRHYFVENDAALARIFEAEAEQLTTSVASGAE 291 Query: 403 AQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG------ 456 I+ P V R R + V G AG+ T+L ++ L G Sbjct: 292 LAIDLAPG-VELDRVFDRSFR----RAGDQVIVPLGAFAAGEVKTVLLKVRLGGLGGDAR 346 Query: 457 QKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRF 516 + + + + +A++D + L + S+L G + + Sbjct: 347 GDSPVADVGLTYRDLVARTDA-RCAGKLGVAIADRDADASELDALVAGRVQRSETAAALK 405 Query: 517 RA 518 +A Sbjct: 406 QA 407 >UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain H4 n=38 Tax=Eutheria RepID=ITIH4_HUMAN Length = 930 Score = 182 bits (462), Expect = 3e-44, Method: Composition-based stats. Identities = 51/265 (19%), Positives = 105/265 (39%), Gaps = 13/265 (4%) Query: 165 DIKDKQSIPASKPIPFAMRYELAPA-PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDT 223 K + +RY++ A + + + N+VF+ID Sbjct: 222 QQKSPEQQETVLDGNLIIRYDVDRAISGGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDK 281 Query: 224 SGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAA 280 SGSM ++ + +L ++ +L +D ++ ++ ++ P S + + + Sbjct: 282 SGSMSGR-KIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPASAENVNKARSF 340 Query: 281 IDSLDAEGSTNGGAGLELAYQQA-----TKGFIKGGINRILLATDGDFNVGIDDPKSIES 335 + A G TN + +A Q + +G ++ I+L TDGD VG +P+SI++ Sbjct: 341 AAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVSLIILLTDGDPTVGETNPRSIQN 400 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLI 395 V++ +L G G + + A + ++A G I S++ L +++ Sbjct: 401 NVREAVSGRYSLFCLGFGF-DVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVAN 459 Query: 396 TVAKDVKAQIEFNPAWVTEYRQIGY 420 + V E+ V E Q + Sbjct: 460 PLLTAV--TFEYPSNAVEEVTQNNF 482 >UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=3 Tax=Amniota RepID=UPI000155CC23 Length = 1374 Score = 182 bits (461), Expect = 4e-44, Method: Composition-based stats. Identities = 57/385 (14%), Positives = 135/385 (35%), Gaps = 29/385 (7%) Query: 142 QGLLPPPDAVRVEEIVNYF-PSDWDIKDKQSIPASKPIPFAMRYELAPAPW-NEQRTLLK 199 + +PP + E + + + F ++Y+++ + + Sbjct: 243 EADIPPSTKIEKGEKCARIIFTPTPQEQAAYSSSGIMGDFVVQYDVSMKDIIGDVQIYNG 302 Query: 200 VDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYA 259 + + N+VF+ID SGSM ++ + ++ +++ +L D IVT++ Sbjct: 303 YFVHYFAPRGLPPVQKNVVFVIDVSGSMFG-TKMKQTKKAMHVILNDLHHDDYFNIVTFS 361 Query: 260 GDSRIA----LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK------ 309 + + + ++ ++A+G T+ A L +A + + Sbjct: 362 DAVSVWKASGSIQATPPNIKSAKVYVNKMEADGWTDINAALLVAASVFNQSTGETGRGKG 421 Query: 310 -GGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 I I+ TDG+ G+ I S K+ + ++L G+ + + +M R++ Sbjct: 422 LKKIPLIIFLTDGEATAGVTVASRILSNAKQSLKGNISLFGLAFGD-DADYHLMRRLSLE 480 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 G I ++A L ++ + D ++ + + V + Q + E Sbjct: 481 NRGVARRIYEDADATLQLKGFYDEIASPLLYD--IELTYLDSSVQDVTQNLFPNYFGGSE 538 Query: 429 HFNNDNVDAG--DIGAGKHITLLFELTLNGQKASIDKLR--YAPDNKLAKSDKTKELAW- 483 V G D+ E + G + + + L++ ++ + W Sbjct: 539 LVVAGKVKPGVRDLHVQMTAQNYKEQVVVGNDITTNSTEAAFGCAEDLSQIERFVQRLWA 598 Query: 484 -------LKIRWKYPQGKESQLVEF 501 L+ R+K +L+ Sbjct: 599 YFTIQDLLQARFKANDTANRRLLSE 623 >UniRef50_A0EFJ5 Chromosome undetermined scaffold_93, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EFJ5_PARTE Length = 610 Score = 182 bits (461), Expect = 4e-44, Method: Composition-based stats. Identities = 80/422 (18%), Positives = 161/422 (38%), Gaps = 23/422 (5%) Query: 166 IKDKQSIPASKPIPFAMRYELAPAPWNE--------QRTLLKVDILA-KDRKSEELPASN 216 I + + P + + L P + Q + + I + K + ++ + Sbjct: 70 IDQEITDPDALQVELLNSVHLNVLPRQKAIQVQEYSQILPVVLQIQSLKSQLKKQRANID 129 Query: 217 LVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS--RIALPSISGSHK 274 L+ ++D SGSM E++ L+Q+SL+ + K L+ D +A+VT+ + + +K Sbjct: 130 LMCVVDVSGSMNG-EKIKLVQNSLRYIQKILKPTDRLALVTFGTQAGINLQWTRNIAENK 188 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 +I AI + STN +G+ L + K + + + +DG + D + + Sbjct: 189 KKIKKAIKDIKIRDSTNIASGVALGLRMIRDRKFKNPVTSMFVLSDGVDDDRGADLRCQQ 248 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQML 394 ++ + + +T++TFG G S+++ +M IA++ G + YID + + M ML Sbjct: 249 ALHQYNIQDTLTINTFGYG-SDHDAKVMNNIANLKGGQFVYIDQIQRVSEHFILAMSGML 307 Query: 395 ITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTL 454 AK+V ++ + +I + + + E+ + Sbjct: 308 SVKAKNVILTVKQLNNE-FKLSKIFGDDFLWNKISETEFQLTLNYLVDEDKKEFALEIEI 366 Query: 455 NGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDM 514 G K L L A+ K + +EF P E + Sbjct: 367 PGFKQQELVLENIMQIDLQGVFIGLNTAFKK--------SSNLELEFSKTEVQYQPVELV 418 Query: 515 RFRAAVAAYGQKLRGSEYLNNT-SWQQIKQWAQQAKGEDPQGYRAEFIRLIELADGVTDI 573 A G ++ ++ L N+ + Q Q Q E + +L+ + + DI Sbjct: 419 EVNYLRAKAGDRIGQAKELANSKKYDQSIQLLNQMIDEIENSLFKDSKQLVVVLKDLNDI 478 Query: 574 SQ 575 Q Sbjct: 479 KQ 480 >UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythrobacter RepID=A3W9L9_9SPHN Length = 740 Score = 181 bits (460), Expect = 5e-44, Method: Composition-based stats. Identities = 71/524 (13%), Positives = 160/524 (30%), Gaps = 55/524 (10%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 + + ++ + + V + + + + + + Q Q Q Sbjct: 141 REEAREIYEEAKANGQKAGLVEQHRPNVFRNSIANVGPGETVLVQIEFQAPIAQVAGDYS 200 Query: 81 QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFL 140 P + AN +R + + + ++ L Sbjct: 201 LRMPLVVGPRYIAGSANANGTLSRASLLAGSADLIAPTADPNMVARAGGGLNPVSITVNL 260 Query: 141 NQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAP--------APWN 192 + G P + + F +R+ + + Sbjct: 261 DPGFAPEAISSPYHAVSVRGSGSTRTVTLADGAVPANRDFELRWSASGDAPMLGLFKQRH 320 Query: 193 EQRTLLKVDILAKDRKSE-ELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQD 251 + + I + E P ++F+ID SGSM + +P + SL ++ LR QD Sbjct: 321 GELEYVMATITPPALERVGEAPPREMIFVIDNSGSMAGES-MPAARRSLLYALETLRPQD 379 Query: 252 NIAIVTYAGDSRIAL---PSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 ++ + S S+ A +L A G T L A Sbjct: 380 RFNVIRFDDTMTELFASAVQASDSNIAAAKTFTHNLMANGGTEMLPALRAA---LRDRAP 436 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 + +++ TDG + + + + + R+ + G+G++ N +M R+A+ Sbjct: 437 DERVRQVIFLTDGA----LSNEADMMEEINRNRKDS-RVFMVGIGSAP-NTYLMRRMAEA 490 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 G G ++++ EA+ + + ++ + VA + A +E Sbjct: 491 GRGTFTHVGMGEEAEDQMQRLLDRLSLPVATGLTANVE--------------------GG 530 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKAS----IDKLRYAPDNKLAKSDKTKELAWL 484 + + D D+ AG+ + LL + I+ +R+ L + + +A L Sbjct: 531 NIDFAPRDLPDLYAGEPLVLLGRTRHLEGTLTVSGMIEGVRWTRSIDLKDASDSDVVAKL 590 Query: 485 KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLR 528 S+ + + + R ++ G Sbjct: 591 WA---------SRRIAEVEAERWSGETSHERADTSIEELGMAFH 625 >UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8U1E2_9PROT Length = 683 Score = 181 bits (460), Expect = 5e-44, Method: Composition-based stats. Identities = 79/491 (16%), Positives = 149/491 (30%), Gaps = 43/491 (8%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 Q + + + V + + + + Q S + Sbjct: 124 QQALETYKTAKKEGKRAALVEQDRPNLFNTSLANIPPHGNIIVNIEYQQSVQWDGGVFRL 183 Query: 82 EAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTG---SYANVRR 138 P K A + P A + + D G S + Sbjct: 184 RYPMSITPRY-KPEEPAKYLLESKADVSSGWTILPDERPQA-VAYEEDGGTVESKTTLEV 241 Query: 139 FLNQGLLP---------------PPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMR 183 L+ G + + SD D + + A+ + Sbjct: 242 SLSPGFQVGQVQSLFHGVIENASSDGTILLSLDGGAVRSDRDFVLEWTPAATSIPTASFF 301 Query: 184 YELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLL 243 E A +Q + +L K ++F+ID SGSM E L ++SL Sbjct: 302 SESA-----DQGDYGLLTLLPPTLKDWGNMTREVIFVIDVSGSMKG-EPLRAAKASLTSG 355 Query: 244 VKELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 ++ L D +V + + SG ID L A G T A ELA Sbjct: 356 IEGLGRNDTFNVVAFNNKAAAFYDAPVRASGKFHRAALKVIDGLKAGGGTEMAAAFELAL 415 Query: 301 QQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEA 360 Q + +++ TDG + + ++ + +K + + L T G+G++ N Sbjct: 416 QM---PGDPDRLQQVVFITDGA----VSNEAALFNQIKGELGAR-RLFTVGIGSAP-NTF 466 Query: 361 MMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE----FNPAWVTEYR 416 M A G G Y+YI S A++V+ ++ +++ + E P + + Sbjct: 467 FMEEAARFGRGTYTYIGDTSSAERVMRDLFTKISFPALTNIEVRGEGVEDITPGTIPDLY 526 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 +++ N G +G + L+ G ++ I + Sbjct: 527 AGEPLSIAMKLRQGNKAITVQGRLGDTEWKR-TVTLSPAGNQSGIRVDWARKRIADWQRA 585 Query: 477 KTKELAWLKIR 487 L ++R Sbjct: 586 GYLGLDKDRVR 596 >UniRef50_D0LD98 von Willebrand factor type A n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LD98_GORB4 Length = 423 Score = 180 bits (457), Expect = 1e-43, Method: Composition-based stats. Identities = 78/375 (20%), Positives = 140/375 (37%), Gaps = 19/375 (5%) Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 A+ ++ + + ++I+A + K + + L ++D SGSM L Q +L Sbjct: 6 ALDVDVVAH-ESADEVTVLLEIVAPEGKVTDRAPAALQVVLDRSGSMSGP-PLAGAQRAL 63 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 ++ +L +D +VT+ D+++ LP+ + KA A+ S+ G T+ +G Sbjct: 64 AGVIGQLDPRDVFGVVTFDDDAQVVLPAAPLADKARAVDAVGSIVPGGCTDLSSGYLRGL 123 Query: 301 QQATKGFIKGGIN--RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYN 358 Q+ + GI +L+ +DG N GI D S+ K G+ ST G G Y+ Sbjct: 124 QELRRATASAGIRGGTVLVISDGHVNRGIRDLDEFASITAKAAADGIITSTLGYGRG-YD 182 Query: 359 EAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI 418 E ++ IA GNGN+ + D A + E+ +L A+ V + + Sbjct: 183 ETLLSAIARSGNGNHVFADDPDAAGAAIAGEVDGLLSKSAQAVTLTVRY---IQKVTELS 239 Query: 419 GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKA-SIDKLRYAPDNKLAKSDK 477 Y ++ GD+ A + LL + ++ + + +L + + Sbjct: 240 LYNDLPAHQIGDGEVMIELGDLYAAEARKLLLRMKVDALASLGLAQLAMLELRYVQTATL 299 Query: 478 TKELAWLKIRWKYPQGKE-SQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNT 536 T+ L I G E V P + E KLR S Sbjct: 300 TEHTVTLPISVNVVPGDELGTRVPDPTVVSEKLYQE---------GQADKLRASRAYEAG 350 Query: 537 SWQQIKQWAQQAKGE 551 A + Sbjct: 351 DLGAGNALLATASSK 365 >UniRef50_A9AXC2 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=A9AXC2_HERA2 Length = 421 Score = 180 bits (457), Expect = 1e-43, Method: Composition-based stats. Identities = 67/392 (17%), Positives = 144/392 (36%), Gaps = 29/392 (7%) Query: 188 PAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKEL 247 PA +Q L +DI A + N+ F++D SGSM +++ ++ + + + + Sbjct: 17 PALQTQQVVYLLLDITATPAVAHVQMPVNVSFVLDHSGSMKG-DKMRCVREATQRALGLM 75 Query: 248 REQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 QD +++V + + + + A + A + + G T LE A + + Sbjct: 76 GPQDIVSVVIFDHRRETIISAQPVRNVAALQAEVGKIKDAGGTKIAPALEAALNEIRRSQ 135 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 I+RI+L TDG + + ++ ++ V L+ GVG+ ++NE +++ +A+ Sbjct: 136 NANTISRIILLTDGQTEG----ERDCLRLAEEIGKASVPLTALGVGD-DWNEDLLIEMAN 190 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPA-WVTEYRQIGYEKRQLR 426 G Y ++ ++Q V ++ + F Q+ +QL Sbjct: 191 RSGGVAEYFSNPNDIASFFQGAVQQAQSAVVQNSALTLRFVQGVEPRALWQVTPLIQQLP 250 Query: 427 VEHFNN--DNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWL 484 ++ V GDI +H +L E+ ++ ++A +L N ++ Sbjct: 251 YRPISDRAVGVSLGDISKDEHRMVLIEMLVDPKQAGQYRLGQIEVNYDIPQM---QVVGE 307 Query: 485 KIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQW 544 K R+ + D V + + + ++ Sbjct: 308 KARYDVMLNFVA----------------DPAQATGVVPQVMNIVEKVSAHKLQTRALEDL 351 Query: 545 AQQAKGEDPQGYRAEFIRLIELAD-GVTDISQ 575 A+ G Q + RL+ + + Q Sbjct: 352 AEGNIGAATQKLQGAVTRLLNQGETELAQTMQ 383 >UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun sequence. (Fragment) n=16 Tax=Euteleostomi RepID=Q4SBF6_TETNG Length = 1039 Score = 180 bits (457), Expect = 1e-43, Method: Composition-based stats. Identities = 48/273 (17%), Positives = 99/273 (36%), Gaps = 13/273 (4%) Query: 177 PIPFAMRYELAPAPW-NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPL 235 F ++Y++ + + + K N+VF+ID SGSM ++ Sbjct: 394 DGDFIIKYDVNRENDLGDIQIANGYFVHFFAPKDLPRLPKNVVFVIDMSGSMSG-TKMQQ 452 Query: 236 IQSSLKLLVKELREQDNIAIVTYAGDSRIA---LPSISGSHKAEINAAIDSLDAEGSTNG 292 + ++ ++++L +D+ I+ + + L + + E + ++ + G T+ Sbjct: 453 TREAMLKILEDLDPEDHFGIILFDHRIQFWNTSLSKATKENIDEAMVYVKAIQSYGGTDI 512 Query: 293 GAGLELAYQQATKGFIKGGIN-----RILLATDGDFNVGIDDPKSIESMVKKQRESGVTL 347 A + A + + I+L TDGD N G I+ VK ++L Sbjct: 513 NAPVLKAVDMLKEDRKAKRLPEKSIDMIILLTDGDPNSGESRIPVIQENVKAAIGGQMSL 572 Query: 348 STFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEF 407 + G GN + + ++ NG I S+A L ++ + DV + + Sbjct: 573 FSLGFGN-DVKYPFLDVMSRENNGLARRIYEGSDAALQLQGFYDEVSSPLLLDV--DLRY 629 Query: 408 NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDI 440 V + + E + DI Sbjct: 630 PDNAVDSLTTNQFSQLFNGSEIVVAGRLKDNDI 662 Score = 54.0 bits (128), Expect = 2e-05, Method: Composition-based stats. Identities = 18/100 (18%), Positives = 34/100 (34%), Gaps = 10/100 (10%) Query: 177 PIPFAMRYELAPAPW-NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPL 235 F ++Y++ + + + K N+VF+ID SGSM + Sbjct: 294 DGDFIIKYDVNRENDLGDIQIANGYFVHFFAPKDLPRLPKNVVFVIDMSGSMSGTKMQQE 353 Query: 236 IQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKA 275 + + L + R D G +RI+ + Sbjct: 354 AHRAARSL--QKRSTD-------GGTARISFSPTIEQQRK 384 >UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VUB8_DYAFD Length = 935 Score = 180 bits (456), Expect = 1e-43, Method: Composition-based stats. Identities = 98/408 (24%), Positives = 176/408 (43%), Gaps = 53/408 (12%) Query: 28 SQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQ--------YSDKQALQGR 79 + Q T+ A +A ++ Q+ + Y D A GR Sbjct: 536 VRDYQAGEKTKMPETANLEADSRQLIADEYKNMKGLQRYGRSNGLCPYTPYEDLGANAGR 595 Query: 80 LQEA-PTFARAAKAKATHIANPGTARYQQ---FDDNPVKQVAQNPL-ATFSL-DVDTGSY 133 E + AA A H P Y ++ N ++++ L T + DV + Sbjct: 596 FAEKVQQYKPAAPGYAAHPYEPFYYFYNNELVYEYNNFIELSKAGLLKTINQPDV----F 651 Query: 134 ANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKD-------KQSIPASKPIPFAM---- 182 A R ++PP +VEE P+ +D +PA++ P Sbjct: 652 AFRRTKPQNAVVPPA---KVEEPKVEKPAGRPAEDIAQVKTNSAEMPAAQQSPVRNVRPL 708 Query: 183 ----------------RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGS 226 + + + R + +N+V L+D S S Sbjct: 709 TNEAAPSAPAAQATRDTVYVERVRVDTVYVDRNAQLQNVTRSLDGFAPNNMVLLLDVSSS 768 Query: 227 MISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDA 286 M S ++PL++ S+K L+ +R +D I+IV Y+G +R+ L SG+ +EI+ ID L + Sbjct: 769 MNSPYKMPLLKRSIKSLLTLVRPEDMISIVLYSGKARVVLKPTSGAKASEISRMIDLLQS 828 Query: 287 EGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 +G T+G G++LAY+ A K +I+GG NRI+LATDG+F V + M+++ V Sbjct: 829 DGDTDGNEGIKLAYKTANKQYIRGGNNRIVLATDGEFPVS----DEVMDMIRQNARQDVY 884 Query: 347 LSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLS-EAQKVLNSEMRQM 393 LS F G + + +++++G G+Y+++ S + Q +L ++ +++ Sbjct: 885 LSIFTFGRHEHTGQKLKKLSELGMGSYAHVTDASADLQLILEAQAKKL 932 >UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 Tax=cellular organisms RepID=Q47YR5_COLP3 Length = 786 Score = 179 bits (454), Expect = 2e-43, Method: Composition-based stats. Identities = 72/528 (13%), Positives = 158/528 (29%), Gaps = 75/528 (14%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 + Q+ + V + + A + + Sbjct: 160 KAAKAIYQKAKKKGRKASLVEQQRPNLFTNKIANIPAQSTVVVTLKFIMPVSFSQGKFNL 219 Query: 82 EAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVA---------QNPLAT------FSL 126 P + + + + + + + ++T F + Sbjct: 220 RLPLALTDRYQPRSTSNSFNESSGHSPEYSSERSPSNFTHDLTNVSANISTKSLPEPFKI 279 Query: 127 DVDTGSYANVRRF----------LNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSI---- 172 S +VR LN G +P V + + +Q+ Sbjct: 280 STTRSSTTHVRSVARSQSSINIVLNSG-IPITSIVSDSHKIQSRDLSSKLNSEQNAYFIT 338 Query: 173 ----PASKPIPFAMRYELAPA---------PWNEQRTLLKVDILAKDRKSEELPASNLVF 219 F + ++L + + ++ ++ A +++F Sbjct: 339 LDKTQVISNKTFDLTWQLIASNQPQVSSFTQEISGEHYTLLTFFPPEKAVAQVIARDIIF 398 Query: 220 LIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSI---SGSHKAE 276 +IDTSGSM + + +SSL+L + +L +D+ I+ + D+ + P S + ++ Sbjct: 399 IIDTSGSMQAGS-MEQAKSSLQLALLQLNNKDSFNIIAFDNDTELLFPVTHMASAHNISK 457 Query: 277 INAAIDSLDAEGSTNGGAGLELAYQQATK-GFIKGGINRILLATDGDFNVGIDDPKSIES 335 ID L A G T L A I +I+ TDG + + + Sbjct: 458 AQQFIDGLSANGGTEMYRPLSNALMMKKDKTQSSKAIRQIVFITDGA----VANEFELMQ 513 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLI 395 ++ + L T G+G + N M + A G G+Y +I SE Q+ ++ M ++ Sbjct: 514 LLNTA-QGDFRLYTVGIGAAP-NGYFMKKAAQFGRGSYVFIQNKSEVQRKMSHFMTKISQ 571 Query: 396 TVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLN 455 ++ ++ E D+ G+ + + + Sbjct: 572 PALTNIALTLDNQIHQHVEV-----------------YPKKIPDLYFGEPLQIALKSQFP 614 Query: 456 GQKASIDKLRYAPDNKLAKSDKTKE----LAWLKIRWKYPQGKESQLV 499 + + ++ ++ L R K +S +V Sbjct: 615 ISSVQLTAETVSTPFYQQLIIDDRQPSKGISSLWARRKIESLVDSLIV 662 >UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q083T9_SHEFN Length = 722 Score = 179 bits (454), Expect = 2e-43, Method: Composition-based stats. Identities = 84/538 (15%), Positives = 177/538 (32%), Gaps = 55/538 (10%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEA 83 EN + Q +Q S +Q+ + QQ+V+ +L+ L Sbjct: 135 ENAKKQGKQASLLQQQRRNLFTSDVANLGPHEQLVVEISYQQKVEYRDGLFSLRFPLAIT 194 Query: 84 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDV--DTGSYANVRRFLN 141 P + A + + A + ++D + S ++ + Sbjct: 195 PRYNPQADRTTEQPLLAMPSSANTATSAKHVRPALDVKMQVNIDAGFELTSLDSLYHPIK 254 Query: 142 QGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYE---------------- 185 Q + +V +D D + A Y+ Sbjct: 255 QSNVGNHYSVN---FAGKQIADRDFVLQWQANVGAVPKAATFYQTGKTHLADNSDERSET 311 Query: 186 --LAPAPWNEQRTLLKVDILA-KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKL 242 P P + L + + + + L A L+ +IDTSGSM + + +L+ Sbjct: 312 AQRQPNPVDNNMYSLVMLMPPSVEVSEQHLIARELILVIDTSGSMSGQS-ITQAKQALQF 370 Query: 243 LVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELA 299 + LR+ D+ I+ + D + S + + + N I SLDA+G T + L+ A Sbjct: 371 ALAGLRDIDSFNIIEFNSDVTMLSATPLSANSRNIGKANRFIQSLDADGGTEMRSALQTA 430 Query: 300 Y-QQATKGFI-----KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVG 353 + + +++ TDG + + + ++ Q L T G+G Sbjct: 431 LVDSVQQDSDQTDAHSEMLRQVIFMTDGA----VGNEHELYQLINDQLGDS-RLFTVGIG 485 Query: 354 NSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI------EF 407 ++ N M R A +G G ++YI SE Q+ + + ++ V ++ ++ Sbjct: 486 SAP-NSDFMRRAATMGRGTFTYIGNESEVQQKIEQLLNKIEQPVLTNIGLYYLDGSVPDY 544 Query: 408 NPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRY- 466 P +++ Q ++ + G++ + N ID L Sbjct: 545 WPTTISDLYQNEPLWVSIKSASHQQQPIIVSGSINGQYWQQQLDFEENQAAKGIDLLWAN 604 Query: 467 -------APDNKLAKSDKTKELAWLKIRWKYPQGKES-QLVEFPLGPTINAPSEDMRF 516 + ++ K++ L + + + S V+ + D++ Sbjct: 605 AQITSLELYKDNASRDRVNKQVEALGLLYHLVTSQTSLVAVDVTPVNPLVDNPIDVQL 662 >UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0AMP5_MARMM Length = 740 Score = 179 bits (454), Expect = 3e-43, Method: Composition-based stats. Identities = 72/500 (14%), Positives = 143/500 (28%), Gaps = 53/500 (10%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 Q + + + + V + + + + + Q Q + Sbjct: 139 RQAARRTYEAARANGQRASLVEQERPNMFTTSVANIGPGETIIVQFEYQDVARFVDGRFQ 198 Query: 81 QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLA-----------TFSLDVD 129 P + D P + T + D+D Sbjct: 199 LTQPLGLTPRYIPDGGDFQMVSTDSSSVPDASRITPPVMPASLEPRDQLRLPVTITADLD 258 Query: 130 TG-SYANVRRFLNQG--LLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 G + + + R+ P++ D A+ E Sbjct: 259 AGYALGEIASLYHATLVERRSDGTARISLADGPIPANRDFVLTWRAADPSEASAALFIE- 317 Query: 187 APAPWNEQRTLLKVDILAKDR-KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVK 245 W + LL + + A +F+ID SGSM + +++L ++ Sbjct: 318 ---EWQGETYLLAQILPPAELGADTPRRARETIFVIDNSGSM-GGASMRQARAALITALQ 373 Query: 246 ELREQDNIAIVTYAGDSRIALPS---ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQ 302 L D ++ + P S + A L+A+G T L A + Sbjct: 374 RLEPGDRFNVIRFDNTMEQVFPQAVDASPDNVATALTFARRLEAQGGTVMLPALNAALRD 433 Query: 303 ATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMM 362 + + +I+ TDG I + + + ++ L G+G++ N M Sbjct: 434 TSPD-DDSRVRQIVFLTDGA----IGNEAELFAAIEAGLGRS-RLFPVGIGSAP-NGYFM 486 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEK 422 R A +G G + I +SE + + + V D F ++E Sbjct: 487 SRAARLGRGTSTQIGQVSEVEARMEELFTALERPVMTD--LDALFPEGALSEI------- 537 Query: 423 RQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQ----KASIDKLRYAPDNKLAKSDKT 478 D+ G+ +TL L + R+ LA + + Sbjct: 538 ----------WPAPLPDLYYGEPVTLTARLASRNGNMVIEGETAGARWRETLSLADAHEG 587 Query: 479 KELAWLKIRWKYPQGKESQL 498 +A L R + +E++ Sbjct: 588 HGIATLFGRNRIRALEETRF 607 >UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KDL1_SHEWM Length = 739 Score = 179 bits (454), Expect = 3e-43, Method: Composition-based stats. Identities = 71/473 (15%), Positives = 149/473 (31%), Gaps = 73/473 (15%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEA 83 K+ + + + V + + + +AL + Q Sbjct: 138 AKKQFEMAKKAGKRASLVSQSTPNIFTTSVANLGPGEALVVEISYQELVSYKKGEFSLRF 197 Query: 84 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFS--LDVDTGSYANVRRFLN 141 P + + + + F+ + +D + ++ L+ Sbjct: 198 PMVVNPRYFSPLSMQEQKSRSVSFANTLATIENGTA----FNDHIGIDAAAKVSIEVELD 253 Query: 142 QG------LLPPPDAVRVEEIVNYFPS-------DWDIKDKQSIP------------ASK 176 G P + +++ D D K Sbjct: 254 AGVELGLISSPYHKVSHTQLSGSHYRVSLTGVKADRDFVLNWRPQLDTKPVAAVFSQQGK 313 Query: 177 PIPFAMRYELAPAPWN-----------EQRTLLKVDILAKDRKSEELPASNLVFLIDTSG 225 + + ++ P N E L + + D+K + + L+ +IDTSG Sbjct: 314 THSLSSKAQVEPTDSNASTKADSNKAVEDDYALLMLLPPSDQKQDVSISRELILVIDTSG 373 Query: 226 SMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAID 282 SM + + +L + L+ +D ++ + + P + + N + Sbjct: 374 SMSGAS-IAQAKRALNYALAGLKAKDTFNVIEFNSNVGSLSPYSLPATAKNIGLANQYVR 432 Query: 283 SLDAEGSTNGGAGLELAYQ--QATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQ 340 SL A G T L A T+ + ++L TDG + D +S+ ++K++ Sbjct: 433 SLKANGGTEMQLALNAALDKGTETEALGSERLRQVLFMTDG----SVGDEQSLFHLIKQK 488 Query: 341 RESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKD 400 L T G+G++ N M R A+ G G ++YI L E Q + S + Q+ D Sbjct: 489 IGES-RLFTLGIGSAP-NSHFMRRAAEFGRGTFTYIGKLDEVQSKIESLLYQIERPQLTD 546 Query: 401 VKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELT 453 +K + + V +Y D+ A + + + ++ Sbjct: 547 IKLR--YADNRVPDYWPAM-----------------IPDLYAEEPLLVAIKMN 580 >UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tetraodon nigroviridis RepID=UPI00017B0D26 Length = 856 Score = 179 bits (454), Expect = 3e-43, Method: Composition-based stats. Identities = 57/405 (14%), Positives = 131/405 (32%), Gaps = 19/405 (4%) Query: 166 IKDKQSIPASKPIPFAMRYELAPAPW-NEQRTLLKVDILAKDRKSEELPASNLVFLIDTS 224 ++ + F +RY++ + + L + K N+VF+IDTS Sbjct: 177 VQQARIATNGLLGDFVVRYDVQRDMGIGDVQVLDGHFVHYFAPKDLPAVPKNVVFVIDTS 236 Query: 225 GSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP----SISGSHKAEINAA 280 SM+ +++ + +L ++ +LR D ++++ R+ P + S + Sbjct: 237 ASMLG-KKMRQTKEALLTILGDLRPADRFNFISFSSRIRVWQPGRLVPATPSAVRDAKKF 295 Query: 281 IDSLDAEGSTNGGAGLELAYQQATKGFI-----KGGINRILLATDGDFNVGIDDPKSIES 335 + L G T+ ++ ++ I+ TDG VG P +I Sbjct: 296 VVMLPTSGGTDIDGAIQTGSSLLRDHLSGRDAGPNSVSLIIFLTDGQPTVGEVRPGAILG 355 Query: 336 MVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLI 395 + + T G+G+ + + ++ R+A G I ++A +L ++ Sbjct: 356 NARAAVRDKFCIFTIGMGD-DVDYRLLERMALDNCGMMRRIPEEADASSMLKGFYDEIGT 414 Query: 396 TVAKDVKAQIEFNPA-WVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKH-----ITLL 449 + D++ + +VT++ Y V N A + ++ Sbjct: 415 PLLSDIRVNYTQDSVQYVTQHLFTNYFNGSELVIAGKLSNQSAPSLHVQVTASNSDKSIT 474 Query: 450 FELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW-LKIRWKYPQGKESQLVEFPLGPTIN 508 E + + + ++ + + + L+ R + +E + Sbjct: 475 LETDVPLRHRQGETEKHVKAGFVERVWGFLSVKEGLRSRLRSQTSRERERHIQRATNLSL 534 Query: 509 APSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDP 553 V L + + A + G P Sbjct: 535 TYHFLTPLTNLVVEKPGALADGAAVPPPTTADEVPEALEGAGSRP 579 >UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Burkholderiales RepID=A1VI76_POLNA Length = 701 Score = 179 bits (454), Expect = 3e-43, Method: Composition-based stats. Identities = 72/489 (14%), Positives = 146/489 (29%), Gaps = 37/489 (7%) Query: 22 QPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQ 81 Q E + T + + + + + L Sbjct: 131 QQARIEYDAARNEGKTAALLEQHLPNVFEMNVANILPGDDVKVELRYTE-----LLVPQS 185 Query: 82 EAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLN 141 A F + + + Q+ P Q F + V + ++ + Sbjct: 186 GAYQFVFPTVVGPRYNSAQSSQALAQWVAQPFLPAGQASATAFDIKVKLATPIGIKEVSS 245 Query: 142 QGLLPPPDAVRVEEIVNYFPS------DWDIKDKQSIPASKPIPFAMRYELAPAPWNEQR 195 E S + D + + M Y+ P Sbjct: 246 HSHSIDVTKDGDERAAVSLRSGDKPGNNRDFILDYRLAGERIESGVMLYQGTPGNGASGE 305 Query: 196 TLLKVDI-LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 I K ++ + + +F++D SGSM L ++ ++ L+ +LR D Sbjct: 306 NFFLAMIEPPKQVAAQAISPRDYIFVVDISGSMHGF-PLDTAKTLMRELIGKLRPSDTFN 364 Query: 255 IVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGG 311 ++ ++G +R P + ++ + ID + G T L+ Y + Sbjct: 365 VLLFSGSNRFLSPASVPATQANIEQAVRTIDEMGGGGGTELIPALKRVY---AEPKAADV 421 Query: 312 INRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNG 371 +++ TDG V + +V++ L +FG+G+S N +M +A G G Sbjct: 422 SRTVVVVTDGFVTV----EREAFELVRRNLSQ-ANLFSFGIGSS-VNRHLMEGLARAGMG 475 Query: 372 NYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFN 431 I S+A+ R + V VK + E G + + Sbjct: 476 EPFIITEPSQARAQAERFRRLIESPVLTSVKLRFE------------GLDVYDVEPAQLP 523 Query: 432 NDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYP 491 + + + GK + +++ R A T L +L R + Sbjct: 524 DVLGERPVVVFGKWRGTPAGQVIIDGQSATGPYRNAVTVPAQNGQNTAALRYLWARHRIA 583 Query: 492 QGKESQLVE 500 + + +E Sbjct: 584 SLSDQEALE 592 >UniRef50_Q2R0C4 Expressed protein n=2 Tax=Oryza sativa Japonica Group RepID=Q2R0C4_ORYSJ Length = 629 Score = 178 bits (452), Expect = 4e-43, Method: Composition-based stats. Identities = 64/417 (15%), Positives = 134/417 (32%), Gaps = 36/417 (8%) Query: 138 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTL 197 + L P ++ + P+ + S AS + + + P E Sbjct: 40 SLIISSELVPTLLFALDLQIQLGPTMYINLAPVSALASDKVQLSTFPRVDAIPRRECHPR 99 Query: 198 LKVDIL-AKDRKSEELPASNLVFLIDTSGSMISDER---LPLIQSSLKLLVKELREQDNI 253 L V + A + +LV L+D S L L++ ++ L++ L D + Sbjct: 100 LPVLVRVAVPATAARRAPVDLVTLLDISCGGGGGAPARRLDLLRKAMDLVIGNLGADDRL 159 Query: 254 AIVTYAGDS--RIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG---FI 308 AIV + L +S + + + SL G T L A + Sbjct: 160 AIVPFHSSVVDATGLLEMSVEGRGVASRKVQSLAVAGGTKLFPALNAAVEILEARCWEAK 219 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 + + ++L +DGD ++ ++ + FG ++ + +AD Sbjct: 220 RERVGAVVLISDGDD----------RTIFREAINPRYPVHAFGF-RGAHDARAVHHVADH 268 Query: 369 GNGNYSYIDTLSE-AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQI--GYEKRQL 425 +G Y +D + + +R++ VA D + + + + R Sbjct: 269 TSGVYGVLDDEHDRVTDAFAACVRRVTSVVAVDAQVDLTCGAYSRASLLAVERSGDHRAH 328 Query: 426 RVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDK-------- 477 E + + AG + AG L + ++ + + Sbjct: 329 VDEDRRSGFIYAGALCAGDVKNFLVYVDVDREADGGGVTELLTAHGTYMDAARRKETTVH 388 Query: 478 -TKELAWLKIRWKYPQGKESQLVEFPLGPTINAPS---EDMRFRAAVAAYGQKLRGS 530 + +A ++ R K P E T+ + + + + + AA +LR Sbjct: 389 LDERMAVVQRRDKVPDVSRDVAAELVRVDTVKMVAVVLDRFKDKGSAAA-AMELREG 444 >UniRef50_Q562D1 LOC594926 protein (Fragment) n=18 Tax=Euteleostomi RepID=Q562D1_XENTR Length = 895 Score = 178 bits (452), Expect = 4e-43, Method: Composition-based stats. Identities = 51/365 (13%), Positives = 124/365 (33%), Gaps = 23/365 (6%) Query: 162 SDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI 221 D F + Y++ + + + + N++F+I Sbjct: 222 IDQQRSCSNCSTTQLDGDFTVTYDVNRETPGNIQVVNGYFVHFFAPSKLKEVPKNIIFII 281 Query: 222 DTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA---LPSISGSHKAEIN 278 D S SMI ++ + +L ++ +++E D+ V + I L + + Sbjct: 282 DRSISMIG-LKMQQTKEALLKILDDVKEHDHFNFVIFDWGVEIWEQSLVKATPENLNRAK 340 Query: 279 AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR-----ILLATDGDFNVGIDDPKSI 333 A + +L +G TN L A + + + I+ TDG + G + I Sbjct: 341 AYVRNLYPKGWTNINDALLSAISLLDQAHDARSVPKRSASLIIFMTDGQPSTGERNLDKI 400 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 + + +L + G G + + +++ +G I S+A + ++ Sbjct: 401 QENARNAIRGKYSLYSLGFGVG-VDYPFLEKLSLENSGVARRIYEESDAALQMEGFYDEV 459 Query: 394 LITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDI--------GAGKH 445 D++ Q + +++ Q ++ E + D+ G+ Sbjct: 460 ANPTLMDIELQ--YPENAISDVTQNKFKHYFDGSEIVVAGRITDNDLNWLTADVTAEGEE 517 Query: 446 ITLLFE--LTLNGQKASIDKLRYAPDNKLAKSDKTKEL-AWLKIRWKYPQGKESQLVEFP 502 TL + + + ++++ +Y + + + L+ R P ++ L Sbjct: 518 DTLNYTHSAKVQEESEAVEQQQYIFGDFTERLWAYLTIQQLLEKRISAPANEKGNLTARA 577 Query: 503 LGPTI 507 L ++ Sbjct: 578 LALSL 582 >UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha (Globulin) inhibitor H5 (ITIH5) (Fragment) n=2 Tax=Danio rerio RepID=Q5RHF3_DANRE Length = 906 Score = 178 bits (452), Expect = 5e-43, Method: Composition-based stats. Identities = 65/414 (15%), Positives = 141/414 (34%), Gaps = 33/414 (7%) Query: 146 PPPDAV--RVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW-NEQRTLLKVDI 202 PP + + E + + + F + Y++ + + + Sbjct: 180 PPVSTIVKQNETFCKISFTPNIAQQAKIATNGMLGEFVVHYDVERETGIGDIQVQDGHFV 239 Query: 203 LAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDS 262 + + N+VF+IDTS SM+ ++ + +L ++ ELR DN VT++ Sbjct: 240 HYFAPRDLPVVPKNVVFVIDTSASMLG-TKMKQTKQALFTIINELRPNDNFNFVTFSNRI 298 Query: 263 RIALP----SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIK------GGI 312 R+ P ++ + I + G T+ G++ + + Sbjct: 299 RVWQPGKLVPVTPISIRDAKKFIYMISVTGGTDINGGIQTGSALLSDYLSSKDESHHHSV 358 Query: 313 NRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGN 372 + I+ TDG VG+ +I S K + L T G+G+ + + ++ R++ G Sbjct: 359 SLIIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGMGD-DVDYRLLERMSLDNCGT 417 Query: 373 YSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE---- 428 I ++A +L ++ + D ++E++ V Q + E Sbjct: 418 MRRIPEDADASLMLKGFYDEIGTPLLSD--IRVEYSEDAVEYITQNLFTNYFNGSELIVA 475 Query: 429 ---HFNNDNVDAGDIGAGKHITLLFELTLN-GQKASIDKLRYAPDNKLAKSDKTKELAW- 483 +D++ + +++ E ++ ++ + R A +KSD E W Sbjct: 476 GKLTNRSDSLHVQVTASNSDRSIIMEKDVSIQEREQETERRLAEAEAGSKSDGYVERLWG 535 Query: 484 -------LKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGS 530 LK R + E + + + + L Sbjct: 536 FLSVKDGLKGRLRSQTSGERENYTQHATNLSLTYNFLTPLTQMIVEKPEVLADG 589 >UniRef50_A6G9E8 von Willebrand factor type A domain protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G9E8_9DELT Length = 532 Score = 178 bits (451), Expect = 5e-43, Method: Composition-based stats. Identities = 82/385 (21%), Positives = 145/385 (37%), Gaps = 26/385 (6%) Query: 205 KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRI 264 NL +ID SGSM + ++ LR+ D +++V+Y + Sbjct: 125 DAVARTSAEPLNLAIVIDHSGSMKGQRE-RNALDAAAGMISRLRDGDTVSVVSYNTKAHT 183 Query: 265 ALPSISGSHKAEINAAIDSLDAE------GSTNGGAGLELAYQQATKGFIKGGINRILLA 318 +P + + + I L G+T G+E Q GI+R+LL Sbjct: 184 IVPVTTLDARNR-DRVISDLRVGVASRPSGNTCVSCGVEAGLQTLQGRRP--GIDRMLLL 240 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 +DG+ N G+ D I + ++ R GV++S+ GV + +YNE +M IA NG + + +T Sbjct: 241 SDGEANRGVRDEPGIRRLAREARNRGVSISSIGV-DVDYNEVLMSAIAREANGRHYFSET 299 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAG 438 S + + E+ ++ +AKD + +E P V +Q+ V G Sbjct: 300 GSNLDAIFDQELDSLIQAIAKDGQVIVELAPG-VRVVEVFDRSYQQVD----RRVIVPMG 354 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYAPD--NKLAKSDKTKELAWLKIRWKYPQGKES 496 AG+ TLL L + L + + L+ + + L + S Sbjct: 355 TFSAGEDKTLLMRLEVPPSPEGSRPLAHVSLSYDDLSTREPGECFGELATVMTATPAEVS 414 Query: 497 QLVEFPLGPTINAPSE------DMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQWAQQAKG 550 L L + + + F A Q+L E+L + K ++A G Sbjct: 415 PLDAIVLARLTKSETAKTLQQSNALFALGRADEAQRLVE-EHLGTVRERHRKTRKKKASG 473 Query: 551 EDPQGYRAEFIRLI-ELADGVTDIS 574 + +F + A + S Sbjct: 474 GLIDPFNRDFDEAFNDQASVLEGAS 498 >UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanobacteria RepID=B4VT64_9CYAN Length = 1037 Score = 178 bits (451), Expect = 5e-43, Method: Composition-based stats. Identities = 83/553 (15%), Positives = 174/553 (31%), Gaps = 67/553 (12%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 Q + ++ + + T + + ++ + +++ S K Sbjct: 445 RQEAKQIYEKAKKAGKTAGLLEQERANVFTQSLANIKPGESIQVTIRYTDSLKFEGGDYE 504 Query: 81 QEAPTFARAAKAKATHIANPG---------------------------TARYQQFDDNPV 113 P + + T + NP Sbjct: 505 FAFPMVVAPRYTAGNSVGSAKAPTTNSVGSKHFSASSAKAPTTNKTLMTNVAYAAEVNPP 564 Query: 114 KQVAQNPLATFSLDV--DTGSY-ANVRRFLNQGLLPP-PDAVRVEEIVNYFPSDWDIKDK 169 + V D G ++VR + VRVE + D+ + Sbjct: 565 IAPPGRSGHDIDVTVEIDAGVPISSVRSPSHPVTTQQTSSTVRVELADQETIPNKDLILR 624 Query: 170 QSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMIS 229 + + + I A + + E+ ++VFL+DTSGS S Sbjct: 625 YQVAGADTQ-----ATVLTQADERGGHFATYLIPAIEYQQNEIVPKDVVFLVDTSGS-QS 678 Query: 230 DERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIAL--PSI-SGSHKAEINAAIDSLDA 286 + + ++ ++ L QD I+ +A + P + ++ + I+ LDA Sbjct: 679 GSPIVQSKELMRQFIQGLNPQDTFTIIDFANSTTQLSDKPLANTPQNRKKALNYINRLDA 738 Query: 287 EGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 G T G++ G + ++L TDG I D + I + ++ + + G Sbjct: 739 NGGTELMNGIDTVLNF--PAAPAGRLRSVVLLTDGL----IGDDEQIIAEIRDRLKPGNR 792 Query: 347 LSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI- 405 L +FGVG+S N ++ R+A++G G + A+ V +++ V +++ Sbjct: 793 LYSFGVGSST-NRFLIERLAELGRGTAEVVPPNESAEVVAQEFFQEINNPVLTNIQVSWE 851 Query: 406 ------EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA-GDIGAGKHITLLFELTLNGQK 458 EF P V + R N + G + G+ ++ + + Sbjct: 852 GTGNAPEFYPQKVRDLFANQPLVVFGRKGDRTNGKLKISGTVAGGQPYETSLDVNFDEVR 911 Query: 459 ASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRA 518 + + A +K G+E+ V + T ++ + Sbjct: 912 GNGAIAQLW------------GRARIKALMNQMYGRETPEVVEAVTDTALNYRLMSKYTS 959 Query: 519 AVAAYGQKLRGSE 531 VA + S+ Sbjct: 960 FVAVTEEIRVDSK 972 >UniRef50_Q2QZN4 von Willebrand factor type A domain containing protein n=2 Tax=Oryza sativa Japonica Group RepID=Q2QZN4_ORYSJ Length = 574 Score = 178 bits (451), Expect = 5e-43, Method: Composition-based stats. Identities = 70/428 (16%), Positives = 149/428 (34%), Gaps = 67/428 (15%) Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISD---------ER 232 +R +A + +L K E+ +LV ++D SGSM + R Sbjct: 71 IRAAIARDQRKDDFEVLVTVEAPKVVAPEKRAPIDLVAVLDVSGSMNKEEFVRGKHMSSR 130 Query: 233 LPLIQSSLKLLVKELREQDNIAIVTYAGDS--RIALPSISGSHKAEINAAIDSLDAEGST 290 L L++ ++K ++K +R+ D +AIV++ L S + ++ +D L A G+T Sbjct: 131 LDLLKIAMKYIIKLVRDADRLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASGNT 190 Query: 291 NGGAGLELAYQQAT--------------------KGFIKGGINRILLATDGDFNVGID-- 328 + L+ A + K K + ILL +DG Sbjct: 191 DFRPALKKAVEDMNIQNIKNSSAYNNFQILDGRGKEEKKKRVGFILLLSDGVDQFQYSRI 250 Query: 329 -----------DPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYI- 376 D + +M++K + TFG +++++ + +I+ + G YS++ Sbjct: 251 NWEKVAKSTDVDHSEVGAMLRKYA-----VHTFGF-SASHDPVPLRQISALSYGLYSFVC 304 Query: 377 DTLSEAQKVLNSEMRQMLITVAKDVKAQIEF---------NPAWVTEYRQIGYEKRQLRV 427 L + + + VA +++ ++ P + GYE + + Sbjct: 305 KNLDNITEAFARCLGGLRTVVAAEIRVDLKPKSSDKQQQQQPVLIKSIDSGGYESQVI-- 362 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQKAS----IDKLRYAPDNKLAKSDKTKELAW 483 + + + + + L + A+ ++ A + + KT + Sbjct: 363 GGGTSGKILIPVLYVDEVKKFIVHLKVPKVSATTVNNQQEILTADGDANSVDGKTVRIKE 422 Query: 484 LKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQ 543 K+ + P Q P + + V Q+ R ++ + + Q Sbjct: 423 HKLAIRRPPEVVDQADLRPAPQVVEQVVV-FKLLDMVPKTFQQRREDDHKGVKNTKVAVQ 481 Query: 544 WAQQAKGE 551 Q+ E Sbjct: 482 LLQRNMDE 489 >UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Ciona intestinalis RepID=UPI000180CCF8 Length = 864 Score = 178 bits (450), Expect = 7e-43, Method: Composition-based stats. Identities = 46/270 (17%), Positives = 101/270 (37%), Gaps = 14/270 (5%) Query: 151 VRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAP-WNEQRTLLKVDILAKDRKS 209 +R E Y + +++I + F + Y++ E + + Sbjct: 233 IRRSETQAYVSYRPTREQQRNIRRRSDLSFLVNYDVTREELGGEILIKDGYFVHFFAPTN 292 Query: 210 EELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALP-- 267 + +VF+ID SGSM ++ + +L+ ++ +L E D I+T++ + + P Sbjct: 293 LPVIPKKVVFVIDVSGSMSG-HKIVQTKEALRTILDDLNEIDQFNIITFSSTTNVWHPNE 351 Query: 268 --SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN------RILLAT 319 ++ ++ + S+ A G TN A Q + N ++L T Sbjct: 352 MVDVNPTNIRNAKKHVRSMYARGGTNFNAAALDGIQLL-ETISSNRTNTLEEASMMILLT 410 Query: 320 DGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTL 379 DG VG+ ++I ++++ ++ G G + + + +IA G I Sbjct: 411 DGQPTVGVTGNEAIRRNIRERVNGRYSIFCLGFGQ-HLDHEFLDQIASENKGLSRKIYND 469 Query: 380 SEAQKVLNSEMRQMLITVAKDVKAQIEFNP 409 ++A L ++ + V + Sbjct: 470 ADAALQLKDFYDEVASPLLAHVIMRYTGPD 499 >UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomycetaceae RepID=D2R2I7_9PLAN Length = 786 Score = 178 bits (450), Expect = 7e-43, Method: Composition-based stats. Identities = 80/448 (17%), Positives = 143/448 (31%), Gaps = 44/448 (9%) Query: 144 LLPPPDAVRVE-EIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 202 P V+ E NY P D + + P A L+ P N + Sbjct: 239 KRPDEKHATVKFEASNYLP----TTDFRLLYDVGDAPLAASV-LSYRPDNSDEGFFLMLA 293 Query: 203 LA-KDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGD 261 + +L ++F++D SGSM +++ + +++ ++ L E D IV Y Sbjct: 294 SPNHSQGEVDLTKKTVIFVVDRSGSMQG-KKIEQAREAMRYVLNNLHEGDTFNIVAYDST 352 Query: 262 SRIALPSI---SGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 P + + + A +D L A GSTN L+ A+ T N IL Sbjct: 353 VESFKPELQKFDDATRKSALAYVDGLYAGGSTNISGALDSAFAMLTGS---DRPNYILFL 409 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 TDG G + I + K++ + FGVG + N ++ R++ G Y+ Sbjct: 410 TDGLPTAGETNEGKIVELAKQKNVHRARMINFGVG-YDVNSRLLDRMSRENFGQSQYVRP 468 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEF-------------NPAWVTEYRQIGYEKRQL 425 + ++ +M V DVK I+ P V + Sbjct: 469 DENLEASVSRLYSKMSSPVLTDVKVSIDIEGAGDSSSAVNRMYPKQVMDIFSGEQLVIAG 528 Query: 426 RVEHFNNDNVDAGDIGAGKHITL-----LFELTLNGQKASIDKLRYAPDNKLAKSDKTKE 480 R + N + G+ E +++ ++KL + + Sbjct: 529 RYKKSGNAKITLSGKLKGEDKKFDFPASFVEKSIDQTHGFVEKLWAMRRIGEIIDEIDLK 588 Query: 481 LAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQ 540 ++ + + P A + L+ + Sbjct: 589 GMNDELVKELVALSTKHGILTPYTSF---------LADDQAKPSELADSRRNLDRANLS- 638 Query: 541 IKQWAQQAKGEDPQGYRAEFIRLIELAD 568 + QA G+ RAE +L E A Sbjct: 639 -LKQLDQAGGQSGFAQRAEKKQLQEAAG 665 >UniRef50_Q24FW2 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FW2_TETTH Length = 1074 Score = 178 bits (450), Expect = 7e-43, Method: Composition-based stats. Identities = 60/440 (13%), Positives = 153/440 (34%), Gaps = 46/440 (10%) Query: 158 NYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAK---DRKSEELPA 214 + F ++ D Q + I Q+ + + + K D K+ + P Sbjct: 304 SSFNPNFISTDHQMLENQLEINIQSTQNYIQLFEQSQQIPVMISLNTKGNFDAKAYQRPP 363 Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSR-IALPSISGSH 273 +L+ ++D SGSM E++ +++ +L L+ +L E+D + +V + + + S+ ++ Sbjct: 364 IDLICVMDNSGSMHG-EKINMLKETLLYLIDQLDEKDRLGLVLFNSEVTFRPMKSMDTTN 422 Query: 274 KAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSI 333 K ++ I + A+G T+ G+ A++ + + L +DG + Sbjct: 423 KLKLKQYISDIRAQGGTDINLGMTEAFKFIKTRKYCNPVTSVFLLSDGLD--SKAQDRVA 480 Query: 334 ESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQM 393 ++ +++ FG G +++ +M +I + + + + Sbjct: 481 VTLKNMSINEQFSINCFGFGR-DHDPILMNQI-----------KKIDQVDMFFVDALGGL 528 Query: 394 LITVAKDVKAQIEFNPAW---VTEYRQIGYEKRQLRVEHFNND---NVDAGDIGAGKHIT 447 + +DV +++ V R G L + N + + G Sbjct: 529 FSVIGQDVLIKVKSVKELSKDVNIVRNYGDMWHLLTDQDTNGGWEYCIKLNHLLLGTSKD 588 Query: 448 LLFELTLN----------GQKASIDKLRYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQ 497 + EL + G + + + ++ + K A L ++ + Sbjct: 589 YMCELFIPAFVNQNFVQNGAEMAFVIVEITAFTIASQPQRIKRQAILNLQVYSQNAVIPK 648 Query: 498 LVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYL--NNTSWQQIKQWAQQAKGEDPQG 555 ++ + ++ A+ ++ + Y ++++ + G Sbjct: 649 QIKDVYNKEVQMNFLRVKGAKALQDCIEEAQNKRYKQGQQILAERLEDIERAQLGG---- 704 Query: 556 YRAEFI----RLIELADGVT 571 +AE + LIE + Sbjct: 705 -QAELLMLRNDLIESRQALQ 723 >UniRef50_B4UFP8 von Willebrand factor type A n=1 Tax=Anaeromyxobacter sp. K RepID=B4UFP8_ANASK Length = 480 Score = 177 bits (449), Expect = 9e-43, Method: Composition-based stats. Identities = 61/360 (16%), Positives = 137/360 (38%), Gaps = 16/360 (4%) Query: 182 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLK 241 + YE + L+ + E ++ ++D SGSM E+L S+ Sbjct: 7 LTYEKVRFDEAKDAHLVVSLVAPHGNARAERSPVCVIPVLDVSGSMHG-EKLHFATQSIM 65 Query: 242 LLVKELREQDNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELA 299 LV L D +V ++ + ++ K + A+ L +TN GL Sbjct: 66 KLVDHLAPGDFCGVVVFSTEVETLAAPTEMTQDRKDALKVALGRLRPRHNTNLAGGLLAG 125 Query: 300 YQQATKGFIKGGIN-RILLATDGDFNVGI-DDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 A + G+ R++L TDG N G P+ + ++++ + ++S FG G+ + Sbjct: 126 LDHAKVTKVPDGMPVRVILFTDGLANEGPATSPEGLCALLEANLGT-ASVSAFGYGD-DA 183 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQ 417 ++ ++ ++ +G GNY+Y+ + +A E+ +L T A+ ++ ++ + Sbjct: 184 DQELLRELSTLGRGNYAYVRSPEDALTAFARELGGLLSTYAQRIEVRVAPCEGA----QL 239 Query: 418 IGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPD-----NKL 472 + + D+ + L+ L + +D + ++ Sbjct: 240 TEVVSDVDARDEGGTAVIRVPDLLVDEVRHLVLGARLAPRPVPLDAPVAVAEIEVIFERV 299 Query: 473 AKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEY 532 +E K ++ + ++Q V P + A ++ +R + ++ R E Sbjct: 300 ENGRVIREQPACKASVRFVEAADAQAVPTPGVDEVVAVAQLVRKQIEAEEAARQGRYREA 359 >UniRef50_Q64D90 Cell surface protein n=8 Tax=environmental samples RepID=Q64D90_9ARCH Length = 1359 Score = 177 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 77/489 (15%), Positives = 161/489 (32%), Gaps = 48/489 (9%) Query: 36 PTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFARAAKAKAT 95 P ++ + A Q V + ++ + + P A Sbjct: 781 PGAERTGSFTYNWTYTPPGDNITICADYQNTVTELNETNNCRSKTLLYPPPTSGAGITHE 840 Query: 96 HIANPGTARYQQFDDNPVKQVAQNPLATFSL-DVDTGSYANVRRFLNQGLLPPPDAVRVE 154 + P + + + + K + +T DV+ N R + LP P + E Sbjct: 841 EVYCPSPSYFSGYSTSFSKNIG---FSTGGAKDVN-----NFRENIGNDYLPLPTDITYE 892 Query: 155 EIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQ-RTLLKVDILAKDRKSE-EL 212 + D+ + + Y L+ P +E L V + + +S+ + Sbjct: 893 GLF----YDYYFDTGEKAECQNLFCPSYSYALSKDPVSEVLGYYLSVGLNSGIIESDFQR 948 Query: 213 PASNLVFLIDTSGSMISD---------------ERLPLIQSS--------LKLLVKELRE 249 NL ++D SGSM S + S + L+ L + Sbjct: 949 KKLNLALVLDISGSMGSSFDEYYYDRFGNHVAVNDTEDAEKSKIEIAAAAIVALLDHLED 1008 Query: 250 QDNIAIVTYAGDSRIALP--SISGSHKAEINAAIDSLDAEGSTNGGAGLELA---YQQAT 304 D + +V + + +A P + + ++ + + A G T AG+++A Y + Sbjct: 1009 DDRLGLVLFNTGAELAEPVSLVGAKNMQKLKGDVLEISATGGTRLSAGMQMATELYDEFL 1068 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + NRI+ TD N G +S+ M++ V + G+G ++N ++ Sbjct: 1069 EVNQSEYENRIIFLTDAMPNSGQTSEESLLGMIEANANKNVYTTFIGIGV-DFNTELVEY 1127 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQ 424 I + NY + + ++ ++ ++ E M+ + D Q+ + A + G + Sbjct: 1128 ITKIRGANYYSVHSATQFKERMDDEFEYMVTPLVFD--LQLNSDAAGYEIEKVYGSPEAD 1185 Query: 425 LRVEHFNNDNVDAGDIGAG-KHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW 483 G + L L L + L+ + + A Sbjct: 1186 EATGEIMKVKTLFPSKKEGNETRGGLVLLKLRRISP-ENSLKLNVSYEDRNGVSGSDEAT 1244 Query: 484 LKIRWKYPQ 492 + + K P Sbjct: 1245 VVLEEKEPD 1253 >UniRef50_B2AQN8 Predicted CDS Pa_4_3600 n=1 Tax=Podospora anserina RepID=B2AQN8_PODAN Length = 648 Score = 177 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 63/328 (19%), Positives = 119/328 (36%), Gaps = 72/328 (21%) Query: 166 IKDKQSIPASKPIPFAMRYE-----------LAPAPWNEQRTLLKVDILAKDRKSEEL-- 212 + + P++ +R L P + +L K+ + E+L Sbjct: 6 LCNWGRTPSASNSTLPLRTREKEELPRSQSMLQIHPLETEDGVLIKIDPPKEPELEDLRE 65 Query: 213 ---PASNLVFLIDTSGSMISD----------------ERLPLIQSSLKLLVKELREQDNI 253 +LV ID SGSM +D L L++ + K +++ L + D + Sbjct: 66 RNHVPLDLVLSIDVSGSMGADAPVPAKNGTEGEHYGLSVLDLVRHAAKTILETLDDHDRL 125 Query: 254 AIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQA-------- 303 IVT++ S++ L ++ ++KA+I +D+L TN G+ Sbjct: 126 GIVTFSTSSKVVRELTYMTPANKAKILKQLDALQPLSMTNLWHGIRDGLSLFNNNLKAVN 185 Query: 304 -TKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMM 362 + G + +L+ TDG N + + + ++ ++ TFG G S ++ Sbjct: 186 DRRNPGSGRVPALLVLTDGMPNHQCPNQGYVAKL-RQWSTLPASIHTFGFGYS-LRSGLL 243 Query: 363 VRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEK 422 IA+VG GNYS+I T V+ Q F+P+ + Y Sbjct: 244 KSIAEVGGGNYSFIPDAGMI-------------TTGDAVEKQQPFSPSGDDSPNKTLY-- 288 Query: 423 RQLRVEHFNNDNVDAGDIGAGKHITLLF 450 + G++ G+ + Sbjct: 289 ------------ISLGNLQYGQSREIYL 304 >UniRef50_A6G2V8 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G2V8_9DELT Length = 877 Score = 177 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 78/551 (14%), Positives = 165/551 (29%), Gaps = 73/551 (13%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQ-----QEVQQYSDKQA 75 + + + + + + + + + + + Q ++Q + Sbjct: 160 REDAQQTYEDAKKAGKAAGLLEQERPNIFTQRVANIPPGQTIEVSMHVVQPLEQEDGRYE 219 Query: 76 LQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYAN 135 L P F PG + + V ++ T ++ Sbjct: 220 LVLPTVVGPRFIPGTPLAQHRQPAPGENTGIAPNTDEVPDASRITAPVVPEGFTTCAHVE 279 Query: 136 VRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIK----DKQSIPASKPIPFAMRYEL----- 186 ++ GL P + I D D P F + ++L Sbjct: 280 ASVVIDTGLRPRRIQSKFHGIDIMRSGDVAAIELDADSDGAPVVANRDFVVSWDLGRDQP 339 Query: 187 -------APAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSS 239 P + E+ A LVF++D SGSM + + Sbjct: 340 KAAIVAQPPTSEGGDGYFTLTVQPPEQVADEQAVARELVFVVDNSGSM-GGLPMDTAKGL 398 Query: 240 LKLLVKELREQDNIAIVTYAGDSRIA---LPSISGSHKAEINAAIDSLDAEGSTNGGAGL 296 ++ +K++R D ++ ++ + L + + +D++ G T G+ Sbjct: 399 MRKALKDIRPDDTFTVLRFSESASGLSNKLLPATQDNIEAGVDYVDAMQGMGGTQMTEGI 458 Query: 297 ELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSN 356 + A + + ++ TDG I + ++I ++ L + GVG + Sbjct: 459 KAALRV---PHDPDRLRVVMFLTDGY----IGNEQAIFELIDDNI-GDARLFSLGVGGAP 510 Query: 357 YNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYR 416 N ++ +A VG G +Y A V+ ++ V DV+ Sbjct: 511 -NRYLLDGMASVGRGAVTYAGYDEPADPVIERFYERVATPVLTDVEIDW----------- 558 Query: 417 QIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSD 476 + L VE + D+ AG+ IT+ + + A++ Sbjct: 559 ------QGLAVEEVYPGKI--PDLFAGQPITVF-------GRYAGAPTGEIVIKAKARTA 603 Query: 477 KTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEY--LN 534 + E L + + + + + V G D A G +R + L Sbjct: 604 EGVETIELPVHFDVAKADDVEGV----GSVWARTKID-------ALMGYPMRPDPWSPLG 652 Query: 535 NTSWQQIKQWA 545 + Q + A Sbjct: 653 KETKQAVIDVA 663 >UniRef50_C0HF51 Putative uncharacterized protein n=1 Tax=Zea mays RepID=C0HF51_MAIZE Length = 459 Score = 177 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 69/391 (17%), Positives = 144/391 (36%), Gaps = 42/391 (10%) Query: 176 KPIPFAMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPL 235 I A Y E + V + A + ++V ++D SGSM +L Sbjct: 5 GDIDLAKAYHHVTVSMREHTEKVMVKLTAPHTGKGDTAPLDIVVVLDISGSMRG-TKLEH 63 Query: 236 IQSSL-KLLVKELR-EQDNIAIVTYAGDSRIA--LPSISGSHKAEINAAIDSLDAEGSTN 291 ++ ++ + ++++L D +AI+T+ + L S+ + A ++ L A G TN Sbjct: 64 MKHAMTRFIIEKLGIRGDRLAIITFESKAHKVFDLSSMLPDQVKKAVAVVEGLKAGGDTN 123 Query: 292 GGAGLELAYQQAT-KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTF 350 AGLE + + I L +DG NV D +++ V + ++ TF Sbjct: 124 IKAGLEAGLDVLKTRRGHSHNASCIFLMSDGHENV--DKARTLLDRVGEH-----SVVTF 176 Query: 351 GVGNSNYNEAMMVRIADVGN-GNYSYI---DTLSEAQKVLNSEMRQMLITVAKDVKAQIE 406 G G + +E ++ IA + G Y ++ + ++ K + D+K + Sbjct: 177 GFGEKS-DEQLLYDIAYHSHAGTYHHVREKEDENQLMKAFA-FLAIYRSISMLDLKVTVS 234 Query: 407 FN-PAWVTEYRQIGYEKRQLRVEHF-NNDNVDAGDIGAGKHITLLFELTLNGQKASIDKL 464 + A R + + ++ H + V GD+ + +L ++ L Sbjct: 235 AHKEAAGAIIRGVDPCRYRVDGPHGDGSFTVHFGDLAREESRRILVDVQLP--------- 285 Query: 465 RYAPDNKLAKSDKTKELAWLKIRWKYPQGKESQ---LVEFPLGPTINAPSEDMRFRAAVA 521 ++ K K + +K + P+ + ++ + + +A Sbjct: 286 ------QVQHEQKDKPVIHVKYEYSSPKKVKVTNTLDIKMNRVQNPGQAAMNPNVSRELA 339 Query: 522 AYGQK--LRGS-EYLNNTSWQQIKQWAQQAK 549 Q LR + N + K ++A+ Sbjct: 340 RRAQVEHLRKVMDLANKKNLGAAKDEVERAR 370 >UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI0_9BACT Length = 833 Score = 177 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 56/283 (19%), Positives = 113/283 (39%), Gaps = 20/283 (7%) Query: 199 KVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTY 258 + + ++ K +E P+ LV +ID SGSM + + L + + K + L +D + ++ + Sbjct: 396 VLPVTSRYEKEKEQPSLALVLVIDKSGSMNG-QPIVLAREASKAAAELLSSRDQVGVIAF 454 Query: 259 AGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLA 318 G +++ S ++K E+ + ID + A G TN + + G I +++ Sbjct: 455 DGSAKLVTDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDML--GIASAKIKHMIVL 512 Query: 319 TDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT 378 +DG G E + + + GVT+ST +G +M IA +GNG + Sbjct: 513 SDGQSQGG-----DFEGISSELAQMGVTISTVSLGQGAA-VDLMAAIAQIGNGRAYVTNN 566 Query: 379 LSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAG 438 E ++ E + ++ I+ P + Y L+ + + + G Sbjct: 567 AEEMPRIFTKE-------TMEASRSAIKEEPFAPIKIDDSDY----LQGINIDETPLLLG 615 Query: 439 DIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKEL 481 + + +L + RY +A + T +L Sbjct: 616 YVMTKVKASAQVQLLTETGDPLLASGRYGLGQSVAFTSDTTDL 658 >UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteriaceae RepID=C7P2A9_HALMD Length = 393 Score = 176 bits (446), Expect = 2e-42, Method: Composition-based stats. Identities = 55/327 (16%), Positives = 127/327 (38%), Gaps = 12/327 (3%) Query: 181 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSL 240 ++ + T + +I + + E ++ IDTSGSM D + + Sbjct: 3 SIETSVNRPNVPADGTTVTAEIDVEPGEQETDVRRHIALCIDTSGSMEGDN-IKRARDGA 61 Query: 241 KLLVKELREQDNIAIVTYAGDSRIALPSI--SGSHKAEINAAIDSLDAEGSTNGGAGLEL 298 + L ++D ++IV + ++ + LP+ S + ++ L A G T+ GL+ Sbjct: 62 AWVFGLLADEDYVSIVAFDTEATVILPATRWSDLDRQTAMDHVEELTAGGGTDMYNGLKA 121 Query: 299 AYQQATKGFI-KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNY 357 A + + + R+LL +DG N P E + + ++G+ + + G+G +Y Sbjct: 122 AKETLSSSATGPDTVKRLLLLSDGKDN--ERTPDEFEGLAEAIDDAGIRIQSAGIGT-DY 178 Query: 358 NEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTE--Y 415 NEA + + G G +++++ + + + Q VA D ++ P Y Sbjct: 179 NEATIRTLGTAGRGTWTHLEAPGDIEDFFGEAVEQAGSVVAPDAHLDLDVAPGVEVSEVY 238 Query: 416 RQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKS 475 R + + N V D+ + ++ ++ ++ +++ Sbjct: 239 RALPQAQEVSPEWEANATRVKLPDLIERESQRVVLKIHAPPREPGSEEVLADVQLSARGD 298 Query: 476 DKTKELAWLKIRWKYPQGKESQLVEFP 502 + ++ + + Q K ++ E Sbjct: 299 TASDQIG---VEYTDEQEKLAEHNESV 322 >UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G415_9DELT Length = 877 Score = 176 bits (446), Expect = 2e-42, Method: Composition-based stats. Identities = 77/489 (15%), Positives = 152/489 (31%), Gaps = 51/489 (10%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQ------YSDKQALQ 77 + + +++ Q+ A + + Q + Q + + Sbjct: 186 ADAREAGHTAALLEQERPNVFTQSVTNVAPGESVEVEVQYVQTLTQDGGNYEFVFPMVVG 245 Query: 78 GRLQEAPTFARAAKAKATHIANPGTARYQQFD------DNPVKQVAQNPLATFSLDVDTG 131 R T A A A + I GT Q +P T + Sbjct: 246 PRFSPPGTSAEAHAAVSPPIVGEGTRTGHDVSLSMTVAAGGKVQRWDSPTHTVVGSETSD 305 Query: 132 SYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPW 191 +A R +Q LP D V QS A +L+ + Sbjct: 306 GFAL--RLADQKTLPNRDFVVRW---------------QSTAAQAKATAYFGPQLSQSVA 348 Query: 192 NEQRTLLKVDILAKDRKSEELPA-SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 Q + + + L ++F+ID SGSM S L L + +L+ + LR Sbjct: 349 GAQPGHFTLVVEPPQSDLDSLVGQREMIFVIDRSGSM-SGVPLALAKQTLREALSHLRPV 407 Query: 251 DNIAIVTYAGDSRIALPSISGSHKAE---INAAIDSLDAEGSTNGGAGLELAYQQATKGF 307 D ++++ + + + +++ ID L A G T ++ A + Sbjct: 408 DTFNVISFESSTAMLYEAAVPANEQNLVHAERFIDGLQAGGGTMMSGAVDAA---LSPEI 464 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESG--VTLSTFGVGNSNYNEAMMVRI 365 G + TDG + + + ++V+ ++G + G+G+S N ++ + Sbjct: 465 GLGRHRYVFFVTDGFISNEDEIARQASALVRAADKAGQRARVFGMGIGSSP-NRELLASL 523 Query: 366 ADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIG----YE 421 + G G Y + ++ + + R + V D+ + P Y Sbjct: 524 SKAGKGRYLAVGNREHPREAVEAYTRMVDSAVLTDIHIDWDGLPVQAVFPSADTLPDLYA 583 Query: 422 KRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKEL 481 +D++D + + + L A P A + + L Sbjct: 584 SHSTVWVGRYDDSLDPAQLAQAQPV-------LRATVAGTQTTVELPITVAASPEDDRTL 636 Query: 482 AWLKIRWKY 490 A L R + Sbjct: 637 ATLWAREQI 645 >UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KVI6_HALNC Length = 671 Score = 176 bits (446), Expect = 2e-42, Method: Composition-based stats. Identities = 78/515 (15%), Positives = 157/515 (30%), Gaps = 50/515 (9%) Query: 28 SQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEAPTFA 87 + + + + + + + PT Sbjct: 123 YEVAKQAGKHAALLEQKRPNVFMMNVANIMPGDTVELVLQYSELLIPDDGVYQLVYPTVV 182 Query: 88 RAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGLLPP 147 A P + ++ NP + + + T + + + L Sbjct: 183 GPRYGGDPIRATP----HNRWIANPYAKDNTDGSNP--AQIKTDIHVRIASPIPISDLRS 236 Query: 148 PDAVRVEEIVN--YFPSDWDIKDKQSIPASKPIPFA-----MRYELAPAPWNEQRTLLKV 200 V +N D + + + F + L WN + L + Sbjct: 237 AQHKIVTHWLNDKSAEISLDPSETHTGNRDFILSFRLQGAKINSGLMTYEWNGEHYFLMM 296 Query: 201 DILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAG 260 K E+ +F++D SGSM L ++ L+ L+ Q+ I+ ++G Sbjct: 297 AQPPKRVAPTEVMKREYLFVVDVSGSMYGF-PLNTASDLMRELLSSLKPQETFNILFFSG 355 Query: 261 DSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILL 317 SR+ P + + + S+ G T L+ A+ + I++ Sbjct: 356 GSRVLSPTPLQATPENLQRAMTMMRSIQGGGGTELLPALKTAFAM---PRTEDTARSIVV 412 Query: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 TDG +V + ++K+ S L FG+G+S N +M +A G G I Sbjct: 413 ITDGYVDV----ERQAYDLIKQNLNS-TNLFAFGIGSS-VNRYLMESMAHAGQGEPFIIT 466 Query: 378 TLSEAQKVLNSEMRQMLITVAKDVKAQ---IEFNPAWVTEYRQIGYEKRQLRVEHFNN-- 432 ++ V R + V +K + +E +E + E+ + + N Sbjct: 467 GPNDVPGVGARFRRYVDAPVLSHIKIRGNGVELYDTEPSEIPVMLAERPIVIFGKYRNAQ 526 Query: 433 --DNVDA-GDIGAGKHITLLF-----------ELTLNGQKASIDKLRYAPDNKLAKSDKT 478 ++ G G++ L + L + +L Y D + Sbjct: 527 PGATLELTGTRATGEYRATLSLDDSNGQADKNQAELLPVLWARQRLMYLSDLQGDDDAHR 586 Query: 479 KELAWLKIRWKYPQGKES-----QLVEFPLGPTIN 508 E+ L +R+ S + + P G T + Sbjct: 587 DEIIRLGLRYSLLTRYTSFVAVDETISNPNGNTTD 621 >UniRef50_P19823 Inter-alpha-trypsin inhibitor heavy chain H2 n=40 Tax=Euteleostomi RepID=ITIH2_HUMAN Length = 946 Score = 175 bits (444), Expect = 4e-42, Method: Composition-based stats. Identities = 48/269 (17%), Positives = 104/269 (38%), Gaps = 13/269 (4%) Query: 169 KQSIPASKPIPFAMRYELAPA-PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM 227 + + Y++ E + + + N++F+ID SGSM Sbjct: 262 PNCRETAVDGELVVLYDVKREEKAGELEVFNGYFVHFFAPDNLDPIPKNILFVIDVSGSM 321 Query: 228 ISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA---LPSISGSHKAEINAAIDSL 284 ++ ++K ++ +LR +D+ +++ + + R L S + + A+ I+ + Sbjct: 322 WG-VKMKQTVEAMKTILDDLRAEDHFSVIDFNQNIRTWRNDLISATKTQVADAKRYIEKI 380 Query: 285 DAEGSTNGGAGLELAYQQATKG-----FIKGGINRILLATDGDFNVGIDDPKSIESMVKK 339 G TN L A + ++ I+L +DGD VG I+ VK+ Sbjct: 381 QPSGGTNINEALLRAIFILNEANNLGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKE 440 Query: 340 QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAK 399 + ++L + G+G + + + R+++ +G I + L Q+ + + Sbjct: 441 NIQDNISLFSLGMGF-DVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYNQVSTPLLR 499 Query: 400 DVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 +V Q + VT+ Q + E Sbjct: 500 NV--QFNYPHTSVTDVTQNNFHNYFGGSE 526 >UniRef50_B9GN58 Predicted protein (Fragment) n=3 Tax=rosids RepID=B9GN58_POPTR Length = 705 Score = 175 bits (443), Expect = 5e-42, Method: Composition-based stats. Identities = 73/450 (16%), Positives = 145/450 (32%), Gaps = 80/450 (17%) Query: 156 IVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAP--------WNEQRTLLKVDILAKDR 207 +FP+ K + F+ ++ P + + + + A Sbjct: 247 FQGFFPTHSTSVVKSDEVSINDRDFSRNVQVRLLPEVAVISVGRGYETYAVALRVKAPPP 306 Query: 208 -----------------KSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 +L+ ++D S SM +L +++ +++L++ L Sbjct: 307 LPSLTTRNSSNSTASLLDPSRRAPIDLITVLDVSASMTG-AKLQMLKRAMRLVISSLGSA 365 Query: 251 DNIAIVTYAGDSRIALPS--ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFI 308 D ++IV ++ + LP ++ + + ID L ++ G L A + Sbjct: 366 DRLSIVAFSSSPKRLLPLKRMTPNGQRSARRIIDRLVCGQGSSVGEALRKATKVLEDRRE 425 Query: 309 KGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADV 368 + + I+L +DG IE + + +FG G S N Sbjct: 426 RNPVASIMLLSDGQDERSSTRFAHIE----------IPVHSFGFGQSGGNSQ-------- 467 Query: 369 GNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVE 428 A+ + +L V +D++ Q+ F + Y R Sbjct: 468 -----------EPAEDAFAKCVGGLLSVVVQDLRIQLGFASSSAPAEIVAVYPCNS-RPN 515 Query: 429 HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKIRW 488 + +V GD+ A + LL EL + ++++ L K T+E+ + Sbjct: 516 VLGSGSVRLGDLYAEEERELLVELRVP--QSAVGSHHVMSARCLYKDPATQEVVY----- 568 Query: 489 KYPQGKESQLVEFP-----LGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSW---QQ 540 +S LV P GP I S A+A + + +E+ + + Sbjct: 569 ---DRDQSLLVPRPHALPSTGPKIQHLSNLFITTRALAEARRLVEHNEFTSAHHLLVSSR 625 Query: 541 IKQWAQQAKGEDPQGYRAEFIRLIELADGV 570 D R E ELA+ + Sbjct: 626 ALILQSSLISADEYVRRLEA----ELAEQM 651 >UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FM70_SORC5 Length = 507 Score = 174 bits (441), Expect = 7e-42, Method: Composition-based stats. Identities = 73/370 (19%), Positives = 147/370 (39%), Gaps = 13/370 (3%) Query: 188 PAPWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKEL 247 P Q + A+ + + + +V L+D SGSM ++ +++ + V L Sbjct: 95 PGTRETQLGVWIDVPAARAARGQPRAPAAVVLLVDASGSMQGP-KMENARAAAQAFVDRL 153 Query: 248 REQDNIAIVTYAGDSRIALPSIS--GSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATK 305 + D +++ ++A ++ + S + + AI +L +GSTN AGL+LA Q A Sbjct: 154 PDGDLVSVASFADTAQARVAPTVLGRSTRPAVARAIAALGPDGSTNLFAGLKLAEQHALA 213 Query: 306 GFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRI 365 + R++L +DG N+G P + ++ ++ GV +++ GVG ++Y+E + + Sbjct: 214 APSTHAVRRVVLISDGQANIGPSSPDILGALAQRGAAHGVQITSIGVG-ADYDERTLNAL 272 Query: 366 ADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQL 425 A +G ++ E VL E+ + T A +I P +R Sbjct: 273 AVGSSGRLYHLTEAREMSSVLERELALLQTTAATGAFVEIVPAPGVELLDVPNERTER-- 330 Query: 426 RVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDN-KLAKSDKTKELAWL 484 + V G + G+H +L + A L + + A+S + + Sbjct: 331 ---SGDALRVLLGTMFGGQHREMLVRARVTAPAAGSHPLASVRLHFRDAESGNLPRVQEV 387 Query: 485 KIRW---KYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQI 541 R+ P + E + M+ AA +E + ++ Sbjct: 388 VARYDVTSDPALVAAHQNEKTQTIAAVMEAGMMQLDAAQEVSAGNFGAAEAKLAAAEDRL 447 Query: 542 KQWAQQAKGE 551 +Q A +A+ + Sbjct: 448 RQQAARARSD 457 >UniRef50_C1I2R0 von Willebrand factor n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I2R0_9CLOT Length = 960 Score = 174 bits (441), Expect = 8e-42, Method: Composition-based stats. Identities = 62/296 (20%), Positives = 117/296 (39%), Gaps = 17/296 (5%) Query: 129 DTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAP 188 + S + + L + D V +++ N F + + K I F A Sbjct: 323 PSASPSTLNELLEYKSIVLND-VHRDDLSNGFMDNIEAYVKDYG--GGLITFGGEDSYAL 379 Query: 189 APWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSM----ISDERLPLIQSSLKLLV 244 + + + + R E+PA ++ +ID SGSM +L L + + + Sbjct: 380 GGYKDTSLEKVLPVYMDKRGKNEVPAISINLIIDKSGSMSAEGGGVSKLTLAKEAAMKAL 439 Query: 245 KELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQAT 304 + LRE D I+++ + +P K I I + G T+ LE Y Sbjct: 440 ENLREVDEISVIAFDDTYDEVVPLQKVGDKEAIKELISGIQIRGGTSIYPALEQGYNMQM 499 Query: 305 KGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVR 364 + K I +L TDG G+D+ ++++ ++ +TLST VG N ++ + Sbjct: 500 QSSAK--IKHTILLTDGQDGYGLDN---YATLLQNFIDNNITLSTVAVGEG-ANAGLLNQ 553 Query: 365 IADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGY 420 +A +G G Y D ++ ++ E+ +L A EF P ++ + + Sbjct: 554 LASIGKGRSYYTDIYTDIPRIFAKEV--LLS--AGTYIINEEFTPKILSNHEILAG 605 Score = 62.9 bits (151), Expect = 3e-08, Method: Composition-based stats. Identities = 32/134 (23%), Positives = 55/134 (41%), Gaps = 12/134 (8%) Query: 197 LLKVDILAKDRKSEELPASNL--VFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIA 254 + + ILA + L N+ VFL+D S S E + + ++ + + Sbjct: 45 VFTLLILAFGNITINLKGRNISTVFLLDVSESASDFE--ESGKDFISTAIESMPRGNKAG 102 Query: 255 IVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 +V + +S+I +K + +ID +TN +E A + GG R Sbjct: 103 VVLFGDNSKID----KVLNKKKEYKSIDEKPVVTATNIQEAVESALGLFER----GGSKR 154 Query: 315 ILLATDGDFNVGID 328 I+L TDG+ N G Sbjct: 155 IVLITDGEENQGDI 168 >UniRef50_B0CG18 von Willebrand factor type A domain protein, putative n=5 Tax=Cyanobacteria RepID=B0CG18_ACAM1 Length = 708 Score = 174 bits (441), Expect = 9e-42, Method: Composition-based stats. Identities = 81/471 (17%), Positives = 159/471 (33%), Gaps = 39/471 (8%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 Q + + + T + + ++ + + + S + Sbjct: 145 RQEAKQIYETAKQEGKTAALLEQERANLFTQSLANIVPGETIEVVIRYTNSLEFEGGDYE 204 Query: 81 QEAPTFARAAKAKATHI-ANPGTARYQQFD--DNPVKQVAQNPLATFSLDVDTGSYANVR 137 PT I A T R P+ +Q S+ V+ + +R Sbjct: 205 FVFPTVVGPRYIPGDQIDAAGNTTRVADAAKITPPLLPPSQRSGNDISITVNLDAGVPIR 264 Query: 138 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTL 197 L P + + E+ + I +K I + + L Sbjct: 265 -NLRSPSHPILTSKKGEQTQVKLANQTTIPNKDLILRYQVASKQTQATLLTQSDQRGGHF 323 Query: 198 LKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVT 257 I A KS ++ ++VFLIDTSGS + + + + +L D +I+ Sbjct: 324 ATYLIPALKYKSNQIVPKDVVFLIDTSGSQSGP-PIVQSRKLMTQFLDKLNPNDTFSIIN 382 Query: 258 YAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR 314 ++ + P + + +++ + I LDA G T G+ A G + Sbjct: 383 FSNTTSKLSPKPLANTPANRKKALEYIKKLDANGGTELMNGINTV--AAFPPAPDGRLRS 440 Query: 315 ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYS 374 ++L TDG I D ++I + V+ + + G + FGVG S N ++ R+A+VG G Sbjct: 441 VVLLTDGL----IGDDETIIAAVRDRLKPGNRIYPFGVGFST-NRFLLDRLAEVGRGTVE 495 Query: 375 YIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDN 434 + A+KV ++ + V D++ P + Sbjct: 496 VVAPKDSAEKVAAKFVQTINKPVLTDIEVSW-VGPGKGPDI-----------------YP 537 Query: 435 VDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLK 485 + D+ A + L L+G+K + ++A ++ +K Sbjct: 538 LRVPDLFANQP------LVLHGRKQDSQSGKLKITGRIAGGKSYEQELDVK 582 >UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MS10_ANATD Length = 1188 Score = 174 bits (440), Expect = 1e-41, Method: Composition-based stats. Identities = 48/186 (25%), Positives = 86/186 (46%), Gaps = 10/186 (5%) Query: 215 SNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHK 274 +LVF++D+SGSM ++ + + K V L + D A+V + + P + Sbjct: 498 IDLVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGDRAAVVDFDNFGYLLQPLTTDF-- 555 Query: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIE 334 + AID +D+ G TN G+ +A QQ + I I+L TDG+ G D Sbjct: 556 QAVKNAIDRIDSWGGTNIAEGIRIANQQLISRSSEDRIKVIILLTDGE---GYYD----N 608 Query: 335 SMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQML 394 ++ + + +G+T+ T G+G S +E ++ IA G Y + + S+ +V + Sbjct: 609 NLTTEAKNNGITIYTIGLGTS-VDENLLRDIATQTGGMYFPVSSASQLPQVFKRITEIVT 667 Query: 395 ITVAKD 400 + D Sbjct: 668 EPIDTD 673 >UniRef50_Q22SJ7 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22SJ7_TETTH Length = 642 Score = 174 bits (440), Expect = 1e-41, Method: Composition-based stats. Identities = 47/262 (17%), Positives = 106/262 (40%), Gaps = 11/262 (4%) Query: 209 SEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPS 268 P+ +LV +I+ S SM ++ L ++++L L++ L D +++V + Sbjct: 192 KNSRPSIDLVCVINNSESMHGEKILN-VKNTLLYLLEMLNSNDRLSLVLSNNNPTTLFDL 250 Query: 269 --ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDFNVG 326 + +K ++ I+++ +TN + A+ + ++ I L +DG + Sbjct: 251 KYLDEKNKQDLKRIINNISITQNTNITKSMIKAFNILQFRQSQNKVSSIFLLSDGVDSSA 310 Query: 327 IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVL 386 ++ S + + + +FG G + + M+ +I + NGN+ YI +++ + Sbjct: 311 EKQIQNYISSQQSLQNKNFAIHSFGYGF-DQDAEMINKICSLKNGNFYYIQNMNQVDQYF 369 Query: 387 NSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIG-------YEKRQLRVEHFNNDNVDAGD 439 + L VA+D+ +I N + Y + ++ + Sbjct: 370 ADVLGGTLTAVAQDITIEISLNQQDKNFQKYFSNCRVSKTYGEAWKCIKKDEIYQIKINH 429 Query: 440 IGAGKHITLLFELTLNGQKASI 461 + G + ELT+ QK I Sbjct: 430 LRQGASQDFIMELTIPKQKVKI 451 >UniRef50_A3ZU57 Putative lipoprotein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZU57_9PLAN Length = 671 Score = 173 bits (439), Expect = 1e-41, Method: Composition-based stats. Identities = 94/591 (15%), Positives = 195/591 (32%), Gaps = 72/591 (12%) Query: 12 SSLILSGCGPQPENKES----QQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEV 67 S+ +LS GP+P + E Q + P + + + A A+++ + A LA Sbjct: 112 SAGLLSWLGPEPADGEHLDLPQGESPDSSSVRSETVAWNASLQ--PRPAPLDPQLAHLAS 169 Query: 68 QQYSDKQALQGRLQEAPTFARAAKAKATH--IANPGTARYQQF---------------DD 110 +D A ++ P AR + P T + ++ ++ Sbjct: 170 DLPTDLMADAFLMRWKPLGARPLIDALPREILRAPATDQQAEYFSLVPGFDRQFLLREEE 229 Query: 111 NPVKQVAQNPLATFSLD-VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDK 169 P + L + + V GS+ + G P +R+EE+ Sbjct: 230 QPFISLPNPQLNSITPPLVTAGSWREAASQM-SGESGVPQPIRLEEL--------GALGA 280 Query: 170 QSIPASKPIPFAMRYELAPAPW-NEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMI 228 + A+R PA + N +L+V ++A + P ++L ID + M Sbjct: 281 ALFIETGRQEIALRTAAGPAVFANSSVQMLQVGMVAGSADIVDRPPTHLTIAIDLTSDMQ 340 Query: 229 SDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEG 288 R ++ +L L+ + D +++V S + + + + L+ Sbjct: 341 RSGRWISVRQALGNLIDRMSPYDTVSLVCIDEFSHTLVEDATSTDSLAWRETLQQLEPND 400 Query: 289 STNGGAGLELAYQQATKGFIKGGINR-ILLATDGDFNVGIDDPKSIESMVKKQRESGVTL 347 S G+ + A + G I R +++ +D +G D + ++ ++ V Sbjct: 401 SDCLAEGIRFSSAVALRTSSFGDIRRPLIVISDRLDRLGDADRRELQPLIDDANSERVDY 460 Query: 348 STFGVGNSNYNEAMMVRIADVGN--GNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI 405 F + ++ I+ G+ D + L S + +A+D++ + Sbjct: 461 HWFAI--EAHDRY--EAISQFWRDHGSVFTGDHERSILRALESVVYGKSTRLAEDIRLTV 516 Query: 406 EFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLR 465 +NP V YR +G + + + LFEL L Sbjct: 517 HWNPKSVASYRLVGCHVEAAALGAGKAPL----QLHGDEAAATLFELVLTP--------- 563 Query: 466 YAPDNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPL-----GPTINAPSEDMRFRAAV 520 D E+A ++ W+ P + + + P+ A ++ Sbjct: 564 ----------DGPNEVARIEATWRDPATSKVKKEIQVVSRLQFAPSWEASPLSLQAAQLA 613 Query: 521 AAYGQKLRGSEYLNNTSWQQIKQWAQQAKGEDPQGYRAEFI---RLIELAD 568 G +R S ++ + A + EF +L+ + Sbjct: 614 VQAGGLIRESYFVRQRGGDAAELSALLGRANRQLAGHPEFAYLEQLVRATE 664 >UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744662 Length = 679 Score = 173 bits (438), Expect = 2e-41, Method: Composition-based stats. Identities = 65/406 (16%), Positives = 142/406 (34%), Gaps = 26/406 (6%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEA 83 ++ + + + + + + + + + + + Sbjct: 124 AKATYEKAKSQNKSASLLEEHRPNVFEMSVANILPGEEVKVSLHYSEKLLPSNRVYEFVF 183 Query: 84 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGS---YANVRRFL 140 PT + + G + + +NP ATF+L+++ + ++ Sbjct: 184 PTVVGP-----RYQSGSGNGTGEAWMENPYVATGSVGAATFALNLELQAGMPLQSITSPS 238 Query: 141 NQGLL----PPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRT 196 +QGL+ + + ++ D+ ++ + S + +P E Sbjct: 239 HQGLMQFSGKSSASFTLSPSADHANHDFVLRYQLSGREVATGLLLHQAPAGSSPEAESFF 298 Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 LL V AK ++ + P + +F++D SGSM + + + L+K L D I+ Sbjct: 299 LLNVQPPAKW-EAGQTPPRDYLFVLDVSGSMNGF-PIETSKRLMSDLLKGLNPGDTFNIL 356 Query: 257 TYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN 313 +A DS + P + + + + G T L+ A + + Sbjct: 357 HFASDSAVLSPKPLAATPENIHLATKDLSRHRGNGGTELLPALQRALATPREVGVS---R 413 Query: 314 RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNY 373 I++ TDG + K +V+K+ + TFG+G + N ++ +A G G+ Sbjct: 414 SIVILTDGYVTI----EKEAFRLVRKEL-QNANVFTFGIGTA-VNRWLIEGLAHAGQGDP 467 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIG 419 + + +A + V DV+ E A+ TE I Sbjct: 468 FVVLSEKDAAAAAERFREYISRPVLTDVQVTYEGFDAYETEPASIP 513 >UniRef50_Q0AV90 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AV90_SYNWW Length = 776 Score = 173 bits (437), Expect = 2e-41, Method: Composition-based stats. Identities = 72/524 (13%), Positives = 164/524 (31%), Gaps = 51/524 (9%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 KE Q S + + + + + + A + + K Sbjct: 79 RDQAFKEYDQAIRSGDSSILLESVRPNVFQVSLGQIDADEEVEIAISYFQEIKNIDTEMR 138 Query: 81 QEAPTFARAAKAKATH----IANPGTARYQQFDDNPVKQVAQNPL---ATFSLDVDTGSY 133 P I + D AT SL V + Sbjct: 139 ISIPMLLAPRFIPGKPLGKKIGPGRAEPTDRVPDADFISPPIGETGYRATLSLHVHNNTP 198 Query: 134 -ANVRRFLNQGLLPPPD--AVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAP 190 ++++ ++ + D + + N + D + + Sbjct: 199 ISSIKSPSHKIRIDRMDEYSATITLQENNTRMNRDFVLN------LKLDGETVPRIIYWK 252 Query: 191 WNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQ 250 + + + E+ +FLID S SM +++ ++++ ++ L E Sbjct: 253 NPKDEYFACITYTPELPIIEQRQPKEYIFLIDISRSMEG-KKIEHAADAIQICLRNLDEG 311 Query: 251 DNIAIVTYAGDSRIALPSISGSHKAEINAA---IDSLDAEGSTNGGAGLELAYQQATKGF 307 D+ ++ + ++ P ++ ++ A + +L A G TN ++LA ++A Sbjct: 312 DSFNLLAFESENHAFAPKSLPYNQENLDKASAWVKNLHAMGGTNILPAVQLALKEAGDQQ 371 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 ++LATDG + + I + V+K R + L + G+ ++ N + +IA+ Sbjct: 372 -----KVVILATDGQ----VGNENEIINYVRK-RNQNLCLFSLGI-DTAVNSYFINQIAE 420 Query: 368 VGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI---------EFNPAWVTEYRQI 418 GNG + ++ + ++ T +V + E P+ + + Sbjct: 421 AGNGCAEFSYPGESLEEKMLRHFARINATSMDNVTFSLPNISAYDWAETPPSRLYDMEPY 480 Query: 419 GYEKRQLRVEH----------FNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAP 468 + R + I ++ +L +L + ++ Sbjct: 481 THLIRLAAPPQEELLITGDCCGQKMVLKVDQIIKIENAEILEKLWAKRKITQLETY-LQT 539 Query: 469 DNKLAKSDKTKELAWLKIRWKYPQGKESQLVEFPLGPTINAPSE 512 N S +E+ L R+ S + EF ++ E Sbjct: 540 GNPRRASGTKEEIVSLSERYHILSTLTSFIAEFERKDKLSGIPE 583 >UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZFT4_9SPHI Length = 827 Score = 172 bits (436), Expect = 3e-41, Method: Composition-based stats. Identities = 67/491 (13%), Positives = 149/491 (30%), Gaps = 42/491 (8%) Query: 24 ENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQEA 83 + + + T + + + + + + Sbjct: 121 ARAQYEAAKKQGKTASLLEQHRPNVFQMNVANILPKDLIKVELHYTELLVPTDGVYEFSY 180 Query: 84 PTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQG 143 PT + ++ NP + P TF ++ + +++ Sbjct: 181 PTVVGPRYSDTPAAKATAGE---KWVKNPYLKEGSKPNYTFDINTTINAGMPIQQMACTS 237 Query: 144 LLPPPDA----------VRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA---- 189 + + E+ + S S + + E+A Sbjct: 238 HKVNVNYQDKSTGVIKLKKSEKFGGNRDYIVRYRLAGSKIQSGLLLYEGENEVASGKEED 297 Query: 190 PWNEQRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELRE 249 N ++ L + K K+ ++P VF++D SGSM L + + LK L+ +LR Sbjct: 298 NENAEKFFLMMMQPPKAPKNSQIPPREYVFIVDVSGSMHGF-PLSVSKRLLKNLIGKLRP 356 Query: 250 QDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKG 306 +D ++ + +++ P + ++ + ID G T L+ A Sbjct: 357 KDKFNVMLFESSNQMMSPESMEATQANIQKAFGVIDQQRGGGGTRLLPALKKALAF---K 413 Query: 307 FIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIA 366 K ++ TDG V K +++ L FG+G+S N ++ +A Sbjct: 414 QTKDYSRSFVVVTDGYVTV----EKEAFDLIRNNLNR-ANLFAFGIGSS-VNRFLIEGMA 467 Query: 367 DVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLR 426 G G + +EA + V ++K + + Q+ + Sbjct: 468 RAGMGEPFIVTHGTEADVKAEKFRNYIQNPVLTNIKIK--------YDGFQVYDTEPWAV 519 Query: 427 VEHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAWLKI 486 + F + GK ++T+ G + A + + + +L Sbjct: 520 PDVFAERPIIVYGKYKGKPTG---KITVTGL-SGNKTYSKTIKVSSATQENNQAIRYLWA 575 Query: 487 RWKYPQGKESQ 497 R + + + Sbjct: 576 RERIKLHDDYR 586 >UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXM6_9CYAN Length = 928 Score = 172 bits (436), Expect = 3e-41, Method: Composition-based stats. Identities = 79/470 (16%), Positives = 148/470 (31%), Gaps = 55/470 (11%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 Q +Q + T + + ++ + + + S K Sbjct: 216 RQEAVAIYEQAKKQGRTAGLLEQERANIFTQSLANIQPGEQIDVIIRYTDSLKFTGGSYE 275 Query: 81 QEAPTFARAAKAKATHI--------ANPGTARYQQFDD---------NPVKQVAQNPLAT 123 P A T I + P + D P+ Sbjct: 276 FVFPMVVGARYIPGTTIDENTLGGGSAPAPMTLNKDTDLVPDASRLNAPILPPGTRSGHN 335 Query: 124 FSLDVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMR 183 ++ VD + ++ P +++E +D +R Sbjct: 336 INVTVDIEAGVEIKEV-----HSPSHQIQIERQDQGMRVTLSRRDTIP-----NKDLILR 385 Query: 184 YELAPAPWNE---------QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLP 234 Y++A I A + +L ++VFLIDTSGS S E L Sbjct: 386 YQVAGDRTQTTVLSQADTRGGHFAVYLIPAIEYNPHQLVPKDVVFLIDTSGS-QSGEPLN 444 Query: 235 LIQSSLKLLVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTN 291 Q ++ + L D I+ ++ +R P + + ++ I+ L+A G T Sbjct: 445 KCQELMRRFINGLNPHDTFTIIDFSDTTRQLSPVPLANTVQNRNSAMNYINQLNASGGTQ 504 Query: 292 GGAGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFG 351 G++ G + I+L TDG I + I + V++ + G L +FG Sbjct: 505 LRRGIQAVLNFPE--VDPGRLRSIVLLTDGY----IGNENQILAEVQRHLKLGNRLHSFG 558 Query: 352 VGNSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE----- 406 G+S N ++ RIA++G G + ++V Q+ V +++ + Sbjct: 559 AGSS-VNRFLLNRIAEIGRGISRIVRYDEPTEEVAEQFFGQINNPVLTNIQLYWQGEGEP 617 Query: 407 --FNPAWVTEYRQIGYEKRQLRVEHFNNDNVDA-GDIGAGKHITLLFELT 453 PA + + ++ G I G+ FEL Sbjct: 618 PIIYPATPPDLFAEQPLVLFGCKKDALPGRLEVTGTIAGGQQYQQSFELN 667 >UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YNZ7_ANASP Length = 820 Score = 172 bits (435), Expect = 4e-41, Method: Composition-based stats. Identities = 79/457 (17%), Positives = 142/457 (31%), Gaps = 50/457 (10%) Query: 21 PQPENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRL 80 Q + +Q + T + + ++ + + + S K Sbjct: 89 RQEAQQIYEQAKQQGRTAGLLEQERDNIFTQSLANIQPGEQIDVIIRYSESLKFTAGNYE 148 Query: 81 QEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFL 140 P I G A P+ Q + V GS N Sbjct: 149 FVFPMVVAPRYIPGIPI--EGNASGVGSATAPMTQNQDTDI------VPDGSRLNAPILP 200 Query: 141 NQGLLPPPDAVRVE-----------EIVNYFPSDWDIKDKQSIPASKP----IPFAMRYE 185 + P V +E + + K A +RY+ Sbjct: 201 SGMRSPHDINVTIEIDAGVKVQNIQSPSHQVQISYAEKQVLVKLAGGDTIPNKDLILRYQ 260 Query: 186 LAPAPWNE---------QRTLLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLI 236 +A I A + +++ ++VFLIDTSGS + + Sbjct: 261 VAGESTQATVLSQADERGGHFALYLIPAIQYRQDQVVPKDVVFLIDTSGSQMGAPLM-QC 319 Query: 237 QSSLKLLVKELREQDNIAIVTYAGDSRIALP---SISGSHKAEINAAIDSLDAEGSTNGG 293 Q ++ + L D +IV ++ +R P + + ++ I+ L A G T Sbjct: 320 QELMRRFINGLNPDDTFSIVDFSDTTRQLSPVPLANNAQNRTRAINYINQLSANGGTEML 379 Query: 294 AGLELAYQQATKGFIKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVG 353 G+ G + I+L TDG I + I + V++ +SG L +FG G Sbjct: 380 RGIRAVLNF--PVTDPGRLRSIVLLTDGY----IGNENQILAEVQQHLKSGNRLYSFGAG 433 Query: 354 NSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKAQIE------- 406 +S N ++ RIA++G G I ++++ RQ+ V ++ Q E Sbjct: 434 SS-VNRFLLNRIAELGRGIAQIIRHDEPTDEIVDKFYRQINNPVLANINLQWEGDGDSPI 492 Query: 407 FNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIGAG 443 P + + + I AG Sbjct: 493 IYPCNPPDLFAEQPLVLFGKKPDARGGKLHITGIVAG 529 >UniRef50_B9XLE8 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XLE8_9BACT Length = 723 Score = 171 bits (434), Expect = 5e-41, Method: Composition-based stats. Identities = 62/471 (13%), Positives = 152/471 (32%), Gaps = 26/471 (5%) Query: 23 PENKESQQQQPSTPTEQQVLAAQQAAIKEAEQSAAAAKALAQQEVQQYSDKQALQGRLQE 82 + QQ + + + +++ + K + + Sbjct: 200 EAEQIYQQAKSQGYVASLLTEERPNIFRQSVANIEPGKQIDVNIRYFQTLAYVDGWYEFV 259 Query: 83 APTFARAAKAKAT------HIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANV 136 P + +A Q + + ++ +L +D + ++ Sbjct: 260 FPMVVGPRYNPSHITNGVGAVARNQGGTSGQSTEVQYLKPSERSGHEIALHLDVDAGVSI 319 Query: 137 RRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRT 196 F + E++ I +K + + ++ Sbjct: 320 EEFSCVTHKISKTSTTPEQLSVDLSEGDRIPNKDFVLRYRIAGERIKSNFMVHRDERGGY 379 Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIV 256 + K+ +VF++D SGSM E + +++++ +K+L+ D+ I+ Sbjct: 380 FTMMLYPPKELGQLGRAPMEMVFVLDCSGSMSG-EPIAQAKAAIRHALKQLQPGDSFQII 438 Query: 257 TYAGDSRIALPS---ISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN 313 ++ + + + + A +++L+++G T G++ A + Sbjct: 439 NFSEHASQLGAKPLEATPENIRKGLAYVEALNSDGPTEMIEGIKAALDF---PHDPERLR 495 Query: 314 RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNY 373 + TDG I + I + V ++ + + +FGVG+ N ++ +A +G G Sbjct: 496 FVCFLTDGF----IGNEAEILAAVHERIGAS-RIFSFGVGS--CNRYLLDHLAKMGGGAV 548 Query: 374 SYIDTLSEAQKVLNSEMRQMLITVAKDVKAQI------EFNPAWVTEYRQIGYEKRQLRV 427 +++ KV++ ++ D+K E P + + R Sbjct: 549 AHLGLHDNGAKVMDDFFERVSHPAMTDIKVDWGNLQVSEVYPQQMPDLFVGRPVILTGRF 608 Query: 428 EHFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKT 478 N N+ + I L F +TL + +A +D+T Sbjct: 609 SGANTANIRVTGKAGVQPIELNFPVTLEDSAGNALPSVWARAKIAELADRT 659 >UniRef50_B8A860 Putative uncharacterized protein n=1 Tax=Oryza sativa Indica Group RepID=B8A860_ORYSI Length = 1128 Score = 171 bits (433), Expect = 7e-41, Method: Composition-based stats. Identities = 61/386 (15%), Positives = 125/386 (32%), Gaps = 36/386 (9%) Query: 169 KQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM 227 S AS + + + P E L V + A + +LV L+D S Sbjct: 570 PVSALASDKVQLSTFPRVDAIPRRECHPRLPVLVRVAVPATAARRAPVDLVTLLDISCGG 629 Query: 228 ISDER---LPLIQSSLKLLVKELREQDNIAIVTYAGDS--RIALPSISGSHKAEINAAID 282 L L++ ++ L++ L D +AIV + L +S + + + Sbjct: 630 GGGAPARRLDLLRKAMDLVIGNLGADDRLAIVPFHSSVVDATGLLEMSVEGRGVASRKVQ 689 Query: 283 SLDAEGSTNGGAGLELAYQQATKG---FIKGGINRILLATDGDFNVGIDDPKSIESMVKK 339 SL G T L A + + + ++L +DGD ++ ++ Sbjct: 690 SLAVAGGTKLFPALNAAVEILEARCWEAKRERVGAVVLISDGDD----------RTIFRE 739 Query: 340 QRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE-AQKVLNSEMRQMLITVA 398 + FG ++ + +AD +G Y +D + + +R++ VA Sbjct: 740 AINPRYPVHAFGF-RGAHDARAVHHVADHTSGVYGVLDDEHDRVTDAFAACVRRVTSVVA 798 Query: 399 KDVKAQIEFNPAWVTEYRQI--GYEKRQLRVEHFNNDNVDAGDIGAGKHITLLFELTLNG 456 D + + + + R E + + AG + AG L + ++ Sbjct: 799 VDAQVDLTCGAYSRASLLAVERSGDHRAHVDEDRRSGFIYAGALCAGDVKNFLVYVDVDR 858 Query: 457 QKASIDKLRYAPDNKLAKSDK---------TKELAWLKIRWKYPQGKESQLVEFPLGPTI 507 + + + +A ++ R K P E T+ Sbjct: 859 EADGGGVTELLTAHGTYMDAARRKETTVHLDERMAVVQRRDKVPDVSRDVAAELVRVDTV 918 Query: 508 NAPS---EDMRFRAAVAAYGQKLRGS 530 + + + + + AA +LR Sbjct: 919 KMVAVVLDRFKDKGSAAA-AMELREG 943 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 65/363 (17%), Positives = 125/363 (34%), Gaps = 48/363 (13%) Query: 197 LLKVDILAKDRKSEELPASNLVFLIDTS--GSMISDERLPLIQSSLKLLVKELREQDNIA 254 L++V S E +LV ++D S G + R+ L++ ++ ++ +L E D +A Sbjct: 35 LVRVVAPPPAAASSERAPIDLVAVLDVSCCGGLGPVNRMDLLKKAMGFVIDKLGEHDRLA 94 Query: 255 IVTYAGDSRIA----LPSISGSHKAEINAAID-SLDAEGSTNGGAGLELAYQQATKGF-- 307 +V + IA L ++ + E + SL G L+ A Sbjct: 95 VVPVQASAAIAEKHDLVEMNAEGRKEATRMVQSSLTVTGENKLSTALKKAATILEGRKDH 154 Query: 308 IKGGINRILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIAD 367 K I+L +DGD S++ ++ FG +N M RIA+ Sbjct: 155 DKKRPGFIVLISDGDD----------ASVLNDAMNLNCSVHAFGF-RDAHNARAMHRIAN 203 Query: 368 VGNGNYSYIDTLSE-AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLR 426 G Y ++ + + + + VA D + + + A T + E + + Sbjct: 204 TSAGTYGILNDGHDGLADAFVTSVGNITSIVAVDAEVSVSCSGAESTAAKLTAIESGRFK 263 Query: 427 VE---HFNNDNVDAGDIGAGKHITLLFELTLNGQKASIDKLRYAPDNKLAKSDKTKELAW 483 + + AG + AG + L + D+L + P + Sbjct: 264 HDINGGGKRGTIQAGALQAGAVRSFLVYVD----NVGDDELEHLPS-----------MLT 308 Query: 484 LKIRWKYPQGKESQLVEFPLGPTINAPSEDMRFRAAVAAYGQKLRGSEYLNNTSWQQIKQ 543 + ++++ SQ A E R A V +R + + +I + Sbjct: 309 VGVQYEDRSTTTSQ-----NAAENQAGREMARRTAQVV----VVRDGDEHSRLVAAEIVR 359 Query: 544 WAQ 546 A Sbjct: 360 VAA 362 >UniRef50_B8F8Z6 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z6_DESAA Length = 480 Score = 171 bits (432), Expect = 9e-41, Method: Composition-based stats. Identities = 67/260 (25%), Positives = 117/260 (45%), Gaps = 12/260 (4%) Query: 206 DRKSEELPASNLVFLIDTSGSMISDERLPLIQSSLKLLVKELREQDNIAIVTYAGDSRIA 265 + + ++V ++D SGSM +++ ++++K LV+ LR QD ++VTY+ Sbjct: 84 APEQTKTKPVDMVIVLDRSGSM-GGQKVRDAKAAVKGLVEGLRSQDRFSLVTYSNSVNGG 142 Query: 266 --LPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINRILLATDGDF 323 L ++ + +N +DS+ A G TN G GLE + +++L +DG Sbjct: 143 DGLHYLTADKRNSLNWMVDSIPAGGGTNLGGGLEKGVGVLRAYGAPDRMGKVILISDGQA 202 Query: 324 NVGIDDPKSIESMVKKQRESG--VTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDTLSE 381 N G+ DP + +M R+ G +++T G+G ++NE +M +AD G G Y Y++ + Sbjct: 203 NQGVTDPNQLAAMAA-LRDDGLVYSVTTVGIGQ-DFNEQLMATVADGGRGRYYYLENPGD 260 Query: 382 AQKVLNSEMRQMLITVAKDVKAQIEFNPAWVTEYRQIGYEKRQLRVEHFNNDNVDAGDIG 441 V E A + + P VT GY + N + G + Sbjct: 261 FLAVFQEEANWTRAVAASALSIHLPL-PKGVTAVSANGYP----VINKENGAFISPGALL 315 Query: 442 AGKHITLLFELTLNGQKASI 461 +G+ TL L N A I Sbjct: 316 SGQSRTLYIRLHANDDAAEI 335 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.305 0.125 0.295 Lambda K H 0.267 0.0386 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,599,307,714 Number of Sequences: 3077464 Number of extensions: 104060919 Number of successful extensions: 479719 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 1605 Number of HSP's successfully gapped in prelim test: 6126 Number of HSP's that attempted gapping in prelim test: 454159 Number of HSP's gapped (non-prelim): 15326 length of query: 575 length of database: 1,040,396,356 effective HSP length: 134 effective length of query: 441 effective length of database: 628,016,180 effective search space: 276955135380 effective search space used: 276955135380 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.5 bits) S2: 96 (41.7 bits)