BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (483 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_C6DJF9 Protein viaA n=166 Tax=Bacteria RepID=VIAA_PECCP 642 0.0 UniRef50_C4LBX1 von Willebrand factor type A n=1 Tax=Tolumonas a... 352 2e-95 UniRef50_A4SJ14 von Willebrand factor type A domain protein n=2 ... 315 2e-84 UniRef50_B8K8A8 von Willebrand factor, type A n=2 Tax=Vibrio Rep... 293 1e-77 UniRef50_Q5E077 Stimulator of RavA ATPase activity, VWA domain c... 291 3e-77 UniRef50_A6FIY2 Uncharacterized protein containing a von Willebr... 273 8e-72 UniRef50_C8SZ21 Protein viaA (VWA domain protein interacting wit... 267 8e-70 UniRef50_A6CWE2 Putative uncharacterized protein yieM n=3 Tax=Vi... 252 3e-65 UniRef50_D2U0I8 Putative uncharacterized protein n=1 Tax=Arsenop... 249 2e-64 UniRef50_A3US96 Putative uncharacterized protein (Fragment) n=1 ... 227 8e-58 UniRef50_Q1ZC32 Putative uncharacterized protein (Fragment) n=1 ... 205 4e-51 UniRef50_B3WV50 Protein ViaA n=5 Tax=Enterobacteriaceae RepID=B3... 165 4e-39 UniRef50_A2SS27 von Willebrand factor, type A n=1 Tax=Methanocor... 163 1e-38 UniRef50_C1Q9X6 Uncharacterized protein containing a von Willebr... 154 7e-36 UniRef50_Q6LJM7 Putative uncharacterized protein n=1 Tax=Photoba... 150 7e-35 UniRef50_C0QY03 von Willebrand factor type A (VWA) domain contai... 150 9e-35 UniRef50_C6M593 Putative uncharacterized protein n=1 Tax=Neisser... 149 3e-34 UniRef50_C1Q8W4 Uncharacterized protein containing a von Willebr... 148 6e-34 UniRef50_Q14PC7 Hypothetical two-component regulator system yiem... 141 6e-32 UniRef50_A6C7T1 Putative uncharacterized protein n=1 Tax=Plancto... 135 4e-30 UniRef50_A1SXM0 von Willebrand factor, type A n=2 Tax=Alteromona... 135 5e-30 UniRef50_A4Y9K4 von Willebrand factor, type A n=3 Tax=Shewanella... 133 2e-29 UniRef50_A6DQA4 Putative uncharacterized protein n=1 Tax=Lentisp... 123 2e-26 UniRef50_D1YZJ4 Putative uncharacterized protein n=1 Tax=Methano... 123 2e-26 UniRef50_C3XEK2 Putative uncharacterized protein n=1 Tax=Helicob... 121 6e-26 UniRef50_Q8EW10 Putative uncharacterized protein MYPE3970 n=1 Ta... 119 2e-25 UniRef50_Q0W1N2 Putative uncharacterized protein n=1 Tax=uncultu... 114 1e-23 UniRef50_B9KEB5 Putative uncharacterized protein n=1 Tax=Campylo... 112 4e-23 UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methano... 111 5e-23 UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methano... 111 5e-23 UniRef50_C3XJE4 Putative uncharacterized protein (Fragment) n=1 ... 111 7e-23 UniRef50_C4KA81 von Willebrand factor type A (VWA) domain-contai... 110 8e-23 UniRef50_Q5LDB9 Putative uncharacterized protein n=11 Tax=Bacter... 109 2e-22 UniRef50_A4S4M8 Predicted protein n=4 Tax=Mamiellales RepID=A4S4... 108 6e-22 UniRef50_C9KWG4 Putative uncharacterized protein n=1 Tax=Bactero... 103 1e-20 UniRef50_A6L4M8 Putative uncharacterized protein n=9 Tax=Bactero... 96 2e-18 UniRef50_D0Z403 Putative uncharacterized protein n=1 Tax=Photoba... 96 3e-18 UniRef50_A7V9J4 Putative uncharacterized protein n=7 Tax=Bactero... 91 1e-16 UniRef50_A8IX54 Predicted protein n=1 Tax=Chlamydomonas reinhard... 91 1e-16 UniRef50_D0LHL0 von Willebrand factor type A n=1 Tax=Haliangium ... 89 4e-16 UniRef50_C0ZE04 Putative uncharacterized protein n=1 Tax=Breviba... 88 8e-16 UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromon... 84 9e-15 UniRef50_A2SLM6 Uncharacterized protein containing a von Willebr... 81 1e-13 UniRef50_B1L0Y8 von Willebrand factor type A domain protein n=10... 80 2e-13 UniRef50_A8IQV8 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 79 3e-13 UniRef50_Q58221 Uncharacterized protein MJ0811 n=4 Tax=Methanoca... 77 1e-12 UniRef50_B7DQJ9 von Willebrand factor type A n=1 Tax=Alicyclobac... 75 8e-12 UniRef50_C6J3I5 Putative uncharacterized protein n=1 Tax=Paeniba... 71 1e-10 UniRef50_A8ZLC2 Putative uncharacterized protein n=3 Tax=Acaryoc... 70 2e-10 UniRef50_B8C4H1 Predicted protein n=1 Tax=Thalassiosira pseudona... 68 7e-10 UniRef50_C8SB00 von Willebrand factor type A n=1 Tax=Ferroglobus... 68 9e-10 UniRef50_D1XLN5 von Willebrand factor type A n=12 Tax=Actinomyce... 67 1e-09 UniRef50_Q2IEM5 VWA containing CoxE-like n=2 Tax=Anaeromyxobacte... 59 4e-07 UniRef50_Q9YD81 Putative uncharacterized protein n=1 Tax=Aeropyr... 57 2e-06 UniRef50_D1PED6 Putative uncharacterized protein n=1 Tax=Prevote... 57 2e-06 UniRef50_Q60384 Uncharacterized protein MJ0077 n=3 Tax=Methanoca... 56 2e-06 UniRef50_D0LUP3 Uncharacterized protein containing a von Willebr... 53 2e-05 UniRef50_C9RHJ4 von Willebrand factor type A n=1 Tax=Methanocald... 52 4e-05 UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus ... 50 2e-04 UniRef50_A7VY69 Putative uncharacterized protein n=1 Tax=Clostri... 50 2e-04 UniRef50_A7KV72 Putative metalloprotein chaperonin subunit n=1 T... 49 3e-04 UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglob... 49 5e-04 UniRef50_A3DPE5 von Willebrand factor, type A n=2 Tax=Desulfuroc... 47 0.001 UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobac... 45 0.007 UniRef50_Q2RZG0 Putative uncharacterized protein n=5 Tax=Bactero... 43 0.029 >UniRef50_C6DJF9 Protein viaA n=166 Tax=Bacteria RepID=VIAA_PECCP Length = 492 Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust. Identities = 307/487 (63%), Positives = 389/487 (79%), Gaps = 5/487 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 M+TL++L ++L++ E L++++++ LLA+PQLA FFEK+P LK+A+ +D+P W+E L+ R Sbjct: 1 MITLESLEMLLSIDENELLDDLVVTLLATPQLAFFFEKYPSLKSALLNDLPHWKETLKQR 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDA---- 116 L+ +VPP+L +E CYQ+SQ + F +LP I+D L + SP+ QA QL+ A Sbjct: 61 LRTTQVPPDLEKEFSCYQRSQSIDNQAFQTRLPAIMDTLSNVESPFLTQASQLITAPERT 120 Query: 117 -NSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILAD 175 +TS LH LFLQRWRLSL +Q +L+QQL+E+ERE LL E+Q+R+TLSG+LEPILA+ Sbjct: 121 LGQKVTSGLHALFLQRWRLSLTLQTVSLHQQLMEQEREILLDELQQRLTLSGKLEPILAE 180 Query: 176 NNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQM 235 N AAGRLWD+SA Q + D + ++ +G FL QP L++LAE+LGRSRE KSI +A Sbjct: 181 NENAAGRLWDLSAAQRIQTDPRPLLDFGAFLQRQPALQKLAERLGRSRETKSILTQEAPK 240 Query: 236 ETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRL 295 E FR VREPATVPEQV G+ QSDDILRL+P EL TLGI+ELEYEFYRRL+E +LLTYRL Sbjct: 241 EAFRVSVREPATVPEQVSGVHQSDDILRLMPTELVTLGISELEYEFYRRLLEHRLLTYRL 300 Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 GESWREK+ ERPVVH+ ++QPRGPFIVCVDTSGSMGGFNE+CAKAFCLALMRIALA+N Sbjct: 301 QGESWREKITERPVVHQQNEQQPRGPFIVCVDTSGSMGGFNERCAKAFCLALMRIALADN 360 Query: 356 RRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 RRCYIMLFST +V+YEL+ G+EQAIRFLSQ FRGGTD+++C A+++++ W DAD Sbjct: 361 RRCYIMLFSTGVVKYELTSADGLEQAIRFLSQSFRGGTDMSACLSALLDKMDDALWHDAD 420 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRS 475 AVVISDFIAQRLPD+V +KVK Q QHRFHAVAMS HGKPGIM IFDHIWRFDTG++S Sbjct: 421 AVVISDFIAQRLPDEVVNKVKSRQTQLQHRFHAVAMSDHGKPGIMHIFDHIWRFDTGLKS 480 Query: 476 RLLRRWR 482 RL+RRW+ Sbjct: 481 RLMRRWQ 487 >UniRef50_C4LBX1 von Willebrand factor type A n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LBX1_TOLAT Length = 479 Score = 352 bits (903), Expect = 2e-95, Method: Compositional matrix adjust. Identities = 185/487 (37%), Positives = 307/487 (63%), Gaps = 13/487 (2%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 M+ L TL+++L+++E ++++++ +++SPQ++ F + P + + V +W +++ ++ Sbjct: 1 MVDLQTLSLLLSINETQMVQDLVSTVMSSPQVSQFMHEHPLFFKNVQEHVQQWSQSIPAQ 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRL-NSPWAEQARQLVDANST 119 LK+ VP +L +E + Y +Q LS QF Q DLL +L +S + A+ L+ S Sbjct: 61 LKNIPVPDDLQQEYILYLDAQGLSAEQFT---QQSADLLVQLQHSDFHTDAQNLLLTLSQ 117 Query: 120 ITSALHT---LFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADN 176 + H LF+Q+WR L+ Q +L E+ERE++L +++ RM ++G+L+ LA Sbjct: 118 ANA--HNRKQLFIQKWREHLVSQVLSLEIIFAEQERERMLQQLELRMQVAGELDETLAPQ 175 Query: 177 NTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRS-REAKSIPRNDAQM 235 + G+LWD++A L +G+ L Y FL PELK++A+ LGR+ + S ++ Sbjct: 176 H--PGKLWDLTATHLLQGNSSLFRHYASFLTHNPELKKIADALGRAATQDSSAEEQINRV 233 Query: 236 ETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRL 295 ET + VP+ + G+ QS+++ RL+ E L ELE FY++L E++LL Y+ Sbjct: 234 ETAEWQTVQHEQVPDDLVGIHQSNELNRLISSETVLLTEPELETVFYKQLAERRLLNYQF 293 Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 G+S + + + +GPFIVC+DTSGSM G+ E CAK FC AL++IAL+EN Sbjct: 294 MGQSRSLETVMSEQRTFGETQDTKGPFIVCIDTSGSMSGYPEDCAKGFCFALLQIALSEN 353 Query: 356 RRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 R C IMLFST++V YEL+GP+G+++A+ FL F+GGTDL C + +M +Q + +AD Sbjct: 354 RACVIMLFSTDVVTYELTGPEGLQEALNFLGCSFKGGTDLEPCMQQVMHYMQQARFSNAD 413 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRS 475 AVV+SDFIAQRL + + ++++R + +RF+AV++S HGKP +M+IFD++W+FDT + Sbjct: 414 AVVLSDFIAQRLSVETEQQAQQIKR-NGNRFNAVSLSRHGKPALMKIFDNVWKFDTSLSG 472 Query: 476 RLLRRWR 482 R+LR+ R Sbjct: 473 RVLRKVR 479 >UniRef50_A4SJ14 von Willebrand factor type A domain protein n=2 Tax=Aeromonas RepID=A4SJ14_AERS4 Length = 484 Score = 315 bits (808), Expect = 2e-84, Method: Compositional matrix adjust. Identities = 184/476 (38%), Positives = 285/476 (59%), Gaps = 12/476 (2%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 M+ + T++ +LA+SE ++ EM++ALLAS Q++ F ++ + RWR + Sbjct: 1 MIEIGTMSALLAISEGEMVSEMVVALLASTQISRFIRIGKAQGRSLKQRLQRWRHQVNDT 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + VPP L +E + YQ LS + + +LP +L L R S +A++ RQL A+ + Sbjct: 61 IAHTPVPPVLEQEFLLYQHFISLSLARLVAELPTLLSALER-GSDFADEGRQL--AHQLV 117 Query: 121 ---TSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNN 177 T L L++WR SL+ L Q+L E ER +L E++E++ S +LE +L Sbjct: 118 DHPTEGARRLMLEKWRASLVGALLRLQQELAEAERLRLQQELEEQIGASEELEQVLDPQR 177 Query: 178 TAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMET 237 AG LW+++ G+ + LI +Y L ++P L+ +A+ +GRS Sbjct: 178 RTAGGLWNLAQGRWQPASLVLIRQYAAMLRKEPMLQEIADSMGRSLHDSEQ--LQRPQPP 235 Query: 238 FRTMVREPA---TVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYR 294 T+++EP VP+ + G+ ++D++R+LP E LG+ ELE EFYRR +E++LL+Y+ Sbjct: 236 QPTLIQEPVLSDDVPDDLVGIHPANDLMRMLPSEAVMLGVPELELEFYRRYLERRLLSYQ 295 Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 G R +++ R D + QP GP IVC+DTSGSMGG+ EQCAKA LAL+++AL E Sbjct: 296 ARGTLPRHQLLPRTTDRGDQELQPMGPVIVCIDTSGSMGGYPEQCAKALALALLQLALTE 355 Query: 355 NRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA 414 RRC++MLFST++ +EL+ G+++A RFL+ F GGTDL C A +++LQ+ + A Sbjct: 356 QRRCFVMLFSTDVATFELTDANGLDEAQRFLAMTFNGGTDLLPCLSATLQQLQAPGFELA 415 Query: 415 DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFD 470 D +VISDFIAQRLP + ++ + QR RF+AVAMS H KP ++R+FD W D Sbjct: 416 DVLVISDFIAQRLPASLV-ELMDRQRGRGTRFNAVAMSRHAKPALLRVFDKSWLLD 470 >UniRef50_B8K8A8 von Willebrand factor, type A n=2 Tax=Vibrio RepID=B8K8A8_VIBPA Length = 481 Score = 293 bits (749), Expect = 1e-77, Method: Compositional matrix adjust. Identities = 159/487 (32%), Positives = 278/487 (57%), Gaps = 12/487 (2%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D LN+ L +++ G+I+ + L+A Q+ E +K+++ + + +WR +++ R Sbjct: 1 MLGADGLNLALMIADSGIIDTAVNDLMARSQMMAVAEN-RGVKSSVKNHLLKWRGSVKKR 59 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + +E+ YQ+ QF Q+P+++ L +S + QAR+L++ N + Sbjct: 60 ITKVCETERFQQELALYQEVIYWDEAQFFEQIPEVIKKL-EWHSAFYLQARRLMEKNKGV 118 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 + + F +W SL LE +E++L ++ +RM ++ + + + Sbjct: 119 NNPMFPHYFCDQWYESLSDAIRQAQLTELEANKEKVLKDLYQRMETMKNMDKVTEEGDEG 178 Query: 180 A-GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPR-NDAQMET 237 + GRLWDM++ +L + D ++ ++ EFL + L+ +AE+LGR P N A +E Sbjct: 179 SVGRLWDMASARLSKTDLTVMKRHAEFLKKNQGLQEIAEKLGRMASQVDDPDLNKAPLEE 238 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + + + + G+ + DD+ +LLP E L ELE FY+ LV+K+L+ Y++ G Sbjct: 239 PQIVEEKSDKATDDIVGIHEGDDLNKLLPNETMFLAYPELEVVFYKHLVDKRLMNYKMQG 298 Query: 298 ESWREKVIERPVVHKDYDEQ---PRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 +S + + + K + Q +GPFI+CVD SGSM GF EQCAKA ALM+IALAE Sbjct: 299 KS---RTLRKVRAQKPDNAQVDVEKGPFIICVDASGSMSGFPEQCAKAMAYALMQIALAE 355 Query: 355 NRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA 414 +R CY++LFS+E + YEL+ G+ +A FL+ F GGTDL ++ + ++ +A Sbjct: 356 DRDCYVILFSSEQITYELTKQDGLREASDFLTYSFHGGTDLEPVLMKSIDLMTGDKYRNA 415 Query: 415 DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMR 474 D VVISDFIA + +++ +KV EL+ H++RFHA+++S +G P +M +FDH W + + Sbjct: 416 DMVVISDFIAPKQSEEMIAKVDELKE-HKNRFHAISLSKYGNPELMTMFDHCWSYHPNLM 474 Query: 475 SRLLRRW 481 R++++W Sbjct: 475 GRIMKKW 481 >UniRef50_Q5E077 Stimulator of RavA ATPase activity, VWA domain containing n=61 Tax=Vibrionales RepID=Q5E077_VIBF1 Length = 482 Score = 291 bits (746), Expect = 3e-77, Method: Compositional matrix adjust. Identities = 158/480 (32%), Positives = 270/480 (56%), Gaps = 5/480 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D LN+++ V+E G+I+ I +L+ PQ + P +K I + + +WR ++ + Sbjct: 1 MLGADALNLVMMVAESGMIDSSIAEILSRPQFLTAAKSNPNIKPTIKNHILKWRGKVKHK 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + + +E+ YQ S +F ++ +I+ L + +S + +A+QL + N + Sbjct: 61 MTKVCETERIQDELALYQDVIHWSENEFYQRIDEIISKL-KWHSAFYVEAKQLANDNKGL 119 Query: 121 TSALHT-LFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 + + F +RW SL L+E++E+LL+++ +R+ +E + A+ + A Sbjct: 120 MNPMFPRFFCERWYQSLSDAIKKAQLSELKEDKEKLLADLYQRIETLKTMESVTAEGDEA 179 Query: 180 -AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGR-SREAKSIPRNDAQMET 237 G+LWDM++ +L + + ++ + FL + L+ +A +LGR + EA+ ++ A E Sbjct: 180 QIGKLWDMASAKLTKSNVDIMKLHARFLKKNKGLQDIASKLGRMANEAEHSDKSQAMAEE 239 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + + V + + G+ + DD+ RLLP E L ELE FY+ L++K+L+ YR+ G Sbjct: 240 VKVVEEKSDFVTDDIVGVHEGDDLSRLLPNETLFLSHPELEVIFYQHLIDKRLMNYRMQG 299 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + + + +GPF+VCVD SGSM GF EQCAKA LM+IALAE R Sbjct: 300 ADRKLRKVTTQSRAASNALIEKGPFVVCVDASGSMSGFPEQCAKALAYGLMQIALAEERD 359 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 CY++LFST+ + YELS G+++ FLS +F GGTDL ++ + ++ +AD V Sbjct: 360 CYVILFSTQQITYELSKQDGLKEVADFLSYKFHGGTDLEPVLEKSIQLMHGDKYKNADLV 419 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 V+SDFIA + + + V +L++ HQ+RFHAV +S +G P +M +FDH W + M RL Sbjct: 420 VLSDFIAPTHSEKIDAMVGDLKK-HQNRFHAVCLSKYGNPALMAMFDHTWAYHPSMLGRL 478 >UniRef50_A6FIY2 Uncharacterized protein containing a von Willebrand factor type A(VWA) domain n=1 Tax=Moritella sp. PE36 RepID=A6FIY2_9GAMM Length = 469 Score = 273 bits (699), Expect = 8e-72, Method: Compositional matrix adjust. Identities = 154/441 (34%), Positives = 250/441 (56%), Gaps = 12/441 (2%) Query: 42 LKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHR 101 +K A DD W++ + S L D +P L+ E+ + ++LLS F ++ IL + + Sbjct: 32 IKLAYIDD---WKQQIMSLLADMPLPAGLSNEIHLCETARLLSPSNFRNKVEGILSKI-K 87 Query: 102 LNSPWAEQARQLVDANSTI-TSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQ 160 S + + N ++ + +FL W+ ++ + +L+EE+REQLL E+ Sbjct: 88 AESAFYNTGLTIYQQNRSMPDNVFFAVFLDSWQQAIELLLYQEQSRLIEEKREQLLIELA 147 Query: 161 ERMTLSGQLEPILADNNTAAG-RLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQL 219 ER QLE +L + G RLWD++ G+L D +L+ +Y L + ++K++A +L Sbjct: 148 EREETIEQLEDVLDSDLLCNGERLWDLAKGKLTHLDTKLLQRYAVNLRKNKDVKKIASEL 207 Query: 220 GRSREAKSIPRNDAQMETFRTMVREPA---TVPEQVDGLQQSDDILRLLPPELATLGITE 276 GR A P ++ T V + + VP+ + G+ SD+I R+L E L E Sbjct: 208 GRMALAHINPEETPN--SYETWVLDNSYQDNVPDDMQGVTYSDEISRMLQTEAVNLTFPE 265 Query: 277 LEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFN 336 LE FY+R +E+ LLTY+ G + K + + D DEQ GPFI+CVD+S SM GF Sbjct: 266 LEIIFYKRYIERHLLTYQYQGALQQYKKVTQYRDITDADEQTGGPFIICVDSSTSMHGFP 325 Query: 337 EQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLA 396 E AK+ C AL++IA + R+CY+M+FS E++ + ++ + + FLS FRGGTDL Sbjct: 326 ELTAKSICYALLQIAFEQRRQCYLMMFSNEVITFPVTQSTSLSTMLTFLSSSFRGGTDLQ 385 Query: 397 SCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGK 456 +E + S ++ +AD +VISDFIAQ+LP V KV+ + + ++R+HA+++S+ G Sbjct: 386 PVIEKSLELMSSAQYKNADTIVISDFIAQKLPTHVADKVRAI-KAQKNRYHAISLSSQGN 444 Query: 457 PGIMRIFDHIWRFDTGMRSRL 477 P +M+IFDH+WR+ G+ RL Sbjct: 445 PELMKIFDHVWRYSAGLTGRL 465 >UniRef50_C8SZ21 Protein viaA (VWA domain protein interacting with AAA ATPase) n=7 Tax=Enterobacteriaceae RepID=C8SZ21_KLEPR Length = 184 Score = 267 bits (682), Expect = 8e-70, Method: Compositional matrix adjust. Identities = 132/173 (76%), Positives = 154/173 (89%) Query: 2 LTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRL 61 +TLD LNVMLAVSEEG+IEEM++ALLASPQLAVFFEKFPRLK I D+PRWREA+R+RL Sbjct: 1 MTLDMLNVMLAVSEEGMIEEMLLALLASPQLAVFFEKFPRLKNIIAADIPRWREAVRARL 60 Query: 62 KDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTIT 121 K+ +PP+L EV YQQ+QLLST QFIVQLPQIL LH+L SP+A QA++LVD N+T T Sbjct: 61 KEVNIPPDLDAEVQTYQQAQLLSTSQFIVQLPQILGKLHQLQSPFAAQAQKLVDDNATFT 120 Query: 122 SALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILA 174 ALHTLFLQRWRLSL+VQAT+LNQQLL+EER+QLLSEVQERMTLSGQL+P+LA Sbjct: 121 PALHTLFLQRWRLSLVVQATSLNQQLLDEERDQLLSEVQERMTLSGQLDPVLA 173 >UniRef50_A6CWE2 Putative uncharacterized protein yieM n=3 Tax=Vibrio RepID=A6CWE2_9VIBR Length = 497 Score = 252 bits (643), Expect = 3e-65, Method: Compositional matrix adjust. Identities = 145/483 (30%), Positives = 262/483 (54%), Gaps = 6/483 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D+LN+ + V+E G+I+ + ++ L LK ++T + +W ++++ + Sbjct: 1 MLGADSLNLAMMVAESGIIDSAVRDIMQQTDLLAMGSD-EGLKQSLTASMAKWSKSVKRK 59 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLV-DANST 119 L + L E+ YQ++ L+ +F QL Q+++ L +S + +ARQL D ++ Sbjct: 60 LVKGQETESLQSELELYQRAVYLTEQEFDDQLSQLIEQLPE-DSHFLPKARQLASDIDAY 118 Query: 120 ITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 S F ++W SL + QQ +++++ + L ++ +++ +E + Sbjct: 119 PRSLFARQFCKQWYESLKQAVESKQQQTVDQQKSKFLKQMYQKIDTLKDMENLQEGGEQG 178 Query: 180 A-GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 GRLWD++ +L + D++ I + E+L ELK +A++LGR E P + + Sbjct: 179 KLGRLWDLAGAELTKQDWRHIERTAEYLENNQELKHIADKLGRMAEEVDAPELNKALSHD 238 Query: 239 RTMVREPAT-VPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 +V E + + G+ +S+DI LLP E L ELE FY+ LVEK+LLTY+ G Sbjct: 239 EVVVEEKTDFATDDIVGIHESNDINNLLPNETMYLAYPELETIFYQHLVEKRLLTYKSEG 298 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + + + P ++ GP ++ +D SGSM G E+ AKA +LM++A + R Sbjct: 299 KQRTVRQLHSPKTATGEADKETGPMLIAIDVSGSMQGAPEKSAKAIAYSLMKMAAQQQRE 358 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 C+++LFS+ + Y+L+G G+++A FLS F+GGTDL +E +Q ++ +AD + Sbjct: 359 CHVILFSSTFISYDLTGTTGLKEASDFLSYTFKGGTDLGKVLNHAVELMQGEQYKNADLL 418 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 VISDFIA + +V KV+ L R +RFH++ +S +G P ++ +FD WR+ + + Sbjct: 419 VISDFIAPKQEQEVVEKVESL-RGRYNRFHSLCLSKYGNPEVLGLFDTQWRYHPSLVGQF 477 Query: 478 LRR 480 +++ Sbjct: 478 IKK 480 >UniRef50_D2U0I8 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2U0I8_9ENTR Length = 330 Score = 249 bits (636), Expect = 2e-64, Method: Compositional matrix adjust. Identities = 135/312 (43%), Positives = 214/312 (68%), Gaps = 3/312 (0%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML + T++++L+++E LIEE+++ LLA+PQL +FFEK+P LK+ + +D+ W++ L + Sbjct: 1 MLNIATIDMLLSINELELIEEIVLTLLATPQLVIFFEKYPNLKSILLNDLLAWKKNLYRQ 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 L++ VP +LTEE YQQ+ + T +F LP ++ L + S + ++A L + S Sbjct: 61 LQETLVPIKLTEEFALYQQNLAIDTTKFFSNLPVTINKLTEIASTFVQEANYLQERISH- 119 Query: 121 TSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAA 180 A +LF+QRWRL+LI++ TT N+ LLE E+EQLL+E+++R+ L+G L +N + Sbjct: 120 DPAGQSLFIQRWRLNLIIEVTTFNKLLLEREKEQLLAELEQRLKLTGNLIETFNQDNHSV 179 Query: 181 GRLWDMSAGQLKR--GDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 G+LWD+S G L + + QL+++Y FL +QPEL++LAE LGR + K + +E+ Sbjct: 180 GKLWDISKGVLTQSSNNIQLLIQYSHFLQQQPELEKLAELLGRRQSLKPKQKQQQMLESI 239 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 ++ + P +PEQ+ G+ +DILRLLP ELA LG+ ELE+EFYR+LVEKQLLTYRL G+ Sbjct: 240 ISVEKIPDQIPEQISGINHGNDILRLLPSELALLGLEELEFEFYRKLVEKQLLTYRLQGD 299 Query: 299 SWREKVIERPVV 310 +W+++ I RP + Sbjct: 300 NWQQRKILRPAI 311 >UniRef50_A3US96 Putative uncharacterized protein (Fragment) n=1 Tax=Vibrio splendidus 12B01 RepID=A3US96_VIBSP Length = 403 Score = 227 bits (578), Expect = 8e-58, Method: Compositional matrix adjust. Identities = 136/399 (34%), Positives = 224/399 (56%), Gaps = 7/399 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D LN+ L V++ G+I+ + L+A Q+ + E +K ++ + + +WR ++ R Sbjct: 1 MLGADGLNLALMVADSGIIDTAMNDLIARSQVMMAAEN-KGVKTSVKNHLVKWRGKVKKR 59 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + EE+ YQ+ PQF ++ ++ L +S + QAR+L++ N + Sbjct: 60 VTKVCETDRFQEEIALYQEVIYWDEPQFFDEIDSVIKKL-EWHSAFYLQARRLMENNKGV 118 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPIL-ADNNT 178 +A+ F +W SL LE +E++L+++ +RM ++ + + + Sbjct: 119 YNAMFPHYFCDQWYQSLSDAIKQAQVTELETSKEKVLADLYQRMETMKNMDKVTESGDEG 178 Query: 179 AAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPR-NDAQMET 237 + GRLWDM++ +L + D ++ ++ EFLN+ L+ +AE+LGR + P + A +E Sbjct: 179 SVGRLWDMASAKLSKTDLTIMKRHAEFLNKHKGLQEIAEKLGRMASEEDDPSLHKAPVEE 238 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + + + + G+ +SDD+ ++LP E L ELE FY+ L +K+LL+YR G Sbjct: 239 LQMVEEKSDEAVDDIVGIHESDDLNKMLPNETMFLAYPELEVIFYKHLADKRLLSYRSQG 298 Query: 298 ESWR-EKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 +S KV + K+ D + +GPFIVCVD SGSM GF EQ AKA ALM+IALAE R Sbjct: 299 KSRTLRKVKAQKPDSKNVDIE-KGPFIVCVDASGSMSGFPEQSAKAMAYALMQIALAEER 357 Query: 357 RCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDL 395 CY++LFS+E + YEL+ G+ +A FLS F GGTDL Sbjct: 358 DCYVILFSSEQITYELTRQDGLREASDFLSYSFHGGTDL 396 >UniRef50_Q1ZC32 Putative uncharacterized protein (Fragment) n=1 Tax=Psychromonas sp. CNPT3 RepID=Q1ZC32_9GAMM Length = 328 Score = 205 bits (521), Expect = 4e-51, Method: Compositional matrix adjust. Identities = 115/328 (35%), Positives = 191/328 (58%), Gaps = 8/328 (2%) Query: 156 LSEVQERMTLSGQLEPILAD-NNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKR 214 L ++ +R +L I A+ N + RLWDM+ +L + + Q + + + L++ EL++ Sbjct: 1 LKDLYQRQETISKLTEIDANINPQNSMRLWDMAKAKLTKINVQTLKRTAKLLSKHSELQK 60 Query: 215 LAEQLGRSREAKSIP-RNDAQMETFRTMVREPATVPEQVD--GLQQSDDILRLLPPELAT 271 +A+QLGR P N ++ + R ++E +T P D G++QS D+ RLLP EL Sbjct: 61 IADQLGRMANQHDDPCLNRTEVHSRR--IKE-STSPFTGDIVGIKQSADLERLLPIELMF 117 Query: 272 LGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGS 331 L +EL+ FY+ L+EK+L TY+ + + I + EQ +GPFI+ +D SGS Sbjct: 118 LSDSELDVLFYKNLIEKRLSTYQQQNKHNEFEQITQFKQQPKKAEQDKGPFIIAIDASGS 177 Query: 332 MGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRG 391 M G E+CAKAF LM+IALA+NR CY++LFS + + YELS G+ + + FLS F G Sbjct: 178 MMGSAEKCAKAFAYGLMKIALAQNRECYVILFSAQQITYELSNQHGLSEILNFLSYSFHG 237 Query: 392 GTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAM 451 GTDL S + + +++ ++ +AD +VISDFI + K+ +L+ +RFHA+++ Sbjct: 238 GTDLTSVLESAFKVMETEKYKNADLIVISDFITPPMSSKTIDKLNKLKE-KSNRFHALSL 296 Query: 452 SAHGKPGIMRIFDHIWRFDTGMRSRLLR 479 S + ++ +FD W+++ + + R Sbjct: 297 SRYQNTEVLALFDKNWQYNPSKLANIKR 324 >UniRef50_B3WV50 Protein ViaA n=5 Tax=Enterobacteriaceae RepID=B3WV50_SHIDY Length = 78 Score = 165 bits (417), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 77/78 (98%), Positives = 78/78 (100%) Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDH 465 +QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDH Sbjct: 1 MQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDH 60 Query: 466 IWRFDTGMRSRLLRRWRR 483 IWRFDTGMRSRLLRRWRR Sbjct: 61 IWRFDTGMRSRLLRRWRR 78 >UniRef50_A2SS27 von Willebrand factor, type A n=1 Tax=Methanocorpusculum labreanum Z RepID=A2SS27_METLZ Length = 492 Score = 163 bits (412), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 99/328 (30%), Positives = 170/328 (51%), Gaps = 9/328 (2%) Query: 148 LEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLN 207 ++ R+QLL ++E Q++ + G WD+S G+L D ++ K+ ++L Sbjct: 149 IQNRRKQLLQNLKEWFETIQQMKEVFEALGVDTGVFWDLSVGKLSAQDISVLKKWADYLK 208 Query: 208 EQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPAT-VPEQVDGLQQSDDILRLLP 266 +++ L E +GR + + + T + V++P E++ G++ D+ ++P Sbjct: 209 YDEKIRELCELMGRLHKEQQSHHTEIINSTIQYHVKKPDVHSNEEIIGIKFGRDLENIIP 268 Query: 267 PELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIE---RPVVHKDYDEQPRGPFI 323 ELA L E+ F + VE +L+ + G + ++IE + V+ D DE+ GP I Sbjct: 269 QELALLSDPEVTLLFDLKYVENRLMCFSKQG--YITEIIEENMQETVNVD-DEEKMGPII 325 Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIR 383 +CVDTSGSM G E AKA L+L A+++ R CY++ FST I + + P+GI I Sbjct: 326 ICVDTSGSMSGAPENIAKALTLSLASRAISQKRNCYLINFSTSINTLDFTPPKGIHDLIN 385 Query: 384 FLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQ 443 FL F GGTD+A + + ++ AD +VISDF+ L D+ K+ Q+ + Sbjct: 386 FLKMSFHGGTDVAPALYEGIRMMSESDYKKADLLVISDFVIYGLSSDIVPLCKK-QKQEE 444 Query: 444 HRFHAVAMSAHGKPGIMR-IFDHIWRFD 470 +RF A+ + + G + +FD W +D Sbjct: 445 NRFFALCIGSFGTQRVEDGVFDQSWTYD 472 >UniRef50_C1Q9X6 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Brachyspira RepID=C1Q9X6_9SPIR Length = 529 Score = 154 bits (389), Expect = 7e-36, Method: Compositional matrix adjust. Identities = 107/362 (29%), Positives = 183/362 (50%), Gaps = 15/362 (4%) Query: 119 TITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSG---QLEPILAD 175 T A+ L W+ SL + + +++ RE+ +++E + +LE L + Sbjct: 157 TDIKAIRKSVLDNWKNSLDNKYIDWSLNEIDKFREEFFKQIKEFLDYLKDIMELENALGE 216 Query: 176 NNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQM 235 G L+D+S G L + D + I + + +K+L + LGR + + R + + Sbjct: 217 ---ETGSLFDLSLGNLLKRDIEYIKQLANLIKSNENIKKLCDMLGRFVKEEESYRIEKVL 273 Query: 236 --ETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTY 293 ETF T VR+ + E V G+ S DI +LP E L LE F + E +LLT+ Sbjct: 274 RKETFHTSVRDINSEDEIV-GITYSRDIHNILPQEKLLLAEGVLETLFGVKYFENRLLTF 332 Query: 294 RLHG--ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIA 351 + G + + +++IE + K ++ +GP I+CVDTSGSM G E AKA L L A Sbjct: 333 KKEGYTDYYYDEMIEDEM--KVVEDDKKGPIIICVDTSGSMSGVPETVAKAVTLYLASRA 390 Query: 352 LAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREW 411 + + R CY++ FST+I +L+ P ++ I FL F GGTD R ++ + + + Sbjct: 391 MKQKRNCYLINFSTQIETMDLTYPNTMDNLIEFLRLSFNGGTDAVPALRHAIKTMNTENY 450 Query: 412 FDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR-IFDHIWRFD 470 +D + ISDF+ DD K+ E Q+ +++RF+++ + + + IFD+ W +D Sbjct: 451 KKSDLLFISDFVFNGFTDD-DYKLAEAQKKNENRFYSLIIGSTPLFNVKNSIFDYNWCYD 509 Query: 471 TG 472 + Sbjct: 510 SS 511 >UniRef50_Q6LJM7 Putative uncharacterized protein n=1 Tax=Photobacterium profundum RepID=Q6LJM7_PHOPR Length = 492 Score = 150 bits (380), Expect = 7e-35, Method: Compositional matrix adjust. Identities = 116/416 (27%), Positives = 204/416 (49%), Gaps = 31/416 (7%) Query: 73 EVMCYQ----QSQLLSTPQF-IVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTL 127 +++ YQ Q+QL S ++ + +L + D +++LN +Q R++ T H Sbjct: 77 DIISYQRFISQAQLPSDKKYWLKELTVLDDKVNKLN----QQKRKI-----TAIKTKHNH 127 Query: 128 FLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMS 187 L WR + + + + +E+ LSE+ + + L ++ G L D S Sbjct: 128 LLTHWRKQYDKAHSKWQLEAIRQFQEKFLSELNDWLEQIKILSEVVESLGLEPGYLLDFS 187 Query: 188 AGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPAT 247 G+L D + + K+ E+L +K L E LG+ R+ + +D ++ET + + P Sbjct: 188 EGKLTLSDVEKLKKWAEYLPNDEGVKSLCEMLGKLRQ---VTLSD-KIETIKKTINMPEM 243 Query: 248 V-----PEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGES--W 300 V +++ GL+ D+ +LP ELA + E F + +E L+ + + G S Sbjct: 244 VFDGDSKQEIVGLKLGKDLEHVLPSELALMSDPETSILFDLKYLESSLMCFDMAGISIDH 303 Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYI 360 E+V+E+ + +D +GP ++C+DTSGSM G E AKA L L A E R CY+ Sbjct: 304 AEQVVEQSIQKED----KKGPMVICIDTSGSMHGSPETIAKALSLYLTTQAKKEQRDCYL 359 Query: 361 MLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVIS 420 + ST I +LS + + FL + F GGTD+A R + +++ + +AD ++IS Sbjct: 360 INISTSIEILDLSQGYSLSSLLTFLQKSFHGGTDVAPAMRHGINIMKNDAYENADMLIIS 419 Query: 421 DFIAQRLPDDVTSKVKELQRVHQHRFHAVAM-SAHGKPGIMRIFDHIWRFDTGMRS 475 DF+ LP+D V E QR+ +RF+++ + +A + FD W ++ S Sbjct: 420 DFVMSSLPNDCLELV-EQQRIKGNRFYSLCIGNAFMTNRLKTHFDSEWVYNPSNSS 474 >UniRef50_C0QY03 von Willebrand factor type A (VWA) domain containing protein n=1 Tax=Brachyspira hyodysenteriae WA1 RepID=C0QY03_BRAHW Length = 467 Score = 150 bits (380), Expect = 9e-35, Method: Compositional matrix adjust. Identities = 89/334 (26%), Positives = 178/334 (53%), Gaps = 14/334 (4%) Query: 148 LEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLN 207 +++ R++ +S+++ + L +L+ + G LWD G+L+ D L+ ++ +F+N Sbjct: 140 VKDRRDKFISDIESWINLLKKLKYMSNILRIKTGVLWDFRVGELEEADISLLKRWVDFIN 199 Query: 208 EQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPP 267 E +++ + + +GR + + +N T+ ++ ++ E++ G+ + DI ++P Sbjct: 200 EYKDIEVICDSIGRRIDIEKSLKNVEFKNTYSNTNKKISS-KEEIVGIYFAKDIENVIPE 258 Query: 268 ELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVD 327 EL+ L E E F + +E +L+ + + + ER + K + +G I+C+D Sbjct: 259 ELSLLCNEESEKLFKLKYIENRLMCFDKSAYVFND---ERDNIVKAGYREGKGDMIICID 315 Query: 328 TSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQ 387 TSGSM G NE AKA ++ AL+ENR Y++ FSTEI + + GIE I+FL Sbjct: 316 TSGSMKGINEYIAKATMFKMVMQALSENRNAYLINFSTEIYTCKFTKENGIEDLIKFLKL 375 Query: 388 QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 + GG+D+ + + + +AD +V+SDFI + +P+++ + + Q+ + +++ Sbjct: 376 SYHGGSDIYKALYEANRMMNTSSFRNADVLVLSDFIMEDMPNNLVTMCSK-QKNNGNKYF 434 Query: 448 AVAMSAHGK----PGIMRIFDHIWRF--DTGMRS 475 AV++ GK ++F+ W F D G++S Sbjct: 435 AVSI---GKFPFGYSYRKVFNRHWIFDIDNGLKS 465 >UniRef50_C6M593 Putative uncharacterized protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M593_NEISI Length = 482 Score = 149 bits (375), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 99/353 (28%), Positives = 179/353 (50%), Gaps = 11/353 (3%) Query: 127 LFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDM 186 L ++W+ L + + + R++LL+++++ + + QL L G Sbjct: 118 LLTEKWQQQLDQAKAQWQVEQINQLRQELLTQLKQELEVVKQLSQQLEQLGFGIGD---- 173 Query: 187 SAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPA 246 G L D + + ++ +L + +++AE LG+ R+ + + + +T ++ P Sbjct: 174 DIGNLTPQDIEEMKRWLNYLTQDKNAQQIAELLGKMRQIEQSEKIEQVKQT--VYIQNPQ 231 Query: 247 ---TVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREK 303 E++ GL+ D+ +LP ELA + E F + +E +L+ + L G ++ + Sbjct: 232 IDINSREEIIGLRLGKDLEYVLPSELALMADEETSILFDLKFLESKLMCFELQGMTYCDA 291 Query: 304 VIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLF 363 E V K +++ GP I+CVDTSGSM G E AKA L L A +ENR C+++ F Sbjct: 292 PTEIIVEQKSQEDEKPGPMILCVDTSGSMNGLPENIAKAMALFLGTKAKSENRSCFVINF 351 Query: 364 STEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFI 423 ST I +EL+ GI I FL Q F GGTD A R ++ ++ + AD ++ISDF+ Sbjct: 352 STGIETFELTSKTGISNLIAFLRQSFHGGTDAAPALRHALKMMEQESYQKADLLMISDFV 411 Query: 424 AQRLPDDVTSKVKELQRVHQHRFHAVAMS-AHGKPGIMRIFDHIWRFDTGMRS 475 LPDD+ + + E+QR ++F+++ + A + FD W ++ +++ Sbjct: 412 MNGLPDDLLASI-EIQRETGNQFNSLVIGDAFMSKRLKTHFDREWIYNPNVQT 463 >UniRef50_C1Q8W4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1Q8W4_9SPIR Length = 478 Score = 148 bits (373), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 91/328 (27%), Positives = 173/328 (52%), Gaps = 13/328 (3%) Query: 148 LEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLN 207 ++E R++ +S+++ ++L +L + G LWD G+L+ D L+ ++ EF+N Sbjct: 150 VKERRDKFISDIESWISLIKKLRYMSNILRIKTGVLWDFRVGELEENDISLLNRWVEFIN 209 Query: 208 E-QPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLP 266 + + E++ + + +G+ + + +N T+ + + + E++ G+ + DI ++P Sbjct: 210 KYKKEIETICDSIGKRVDIEKALKNIEFKNTY-SYTNKKISSKEEIVGIYFAKDIENVVP 268 Query: 267 PELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCV 326 EL+ L + E F + +E +L+ + + E + +V Y E +G I+C+ Sbjct: 269 EELSLLCDEDSEKLFKLKYIENRLMCFDKSAYVFNEN--DFDIVKAGYKE-GKGDMIICI 325 Query: 327 DTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLS 386 DTSGSM G +E AKA ++ AL+ENR Y++ FSTEI + GIE I+FL Sbjct: 326 DTSGSMKGTSEYIAKAIMFKMVMQALSENRNAYLINFSTEIYTCRFTKNNGIEDLIKFLK 385 Query: 387 QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 + GG+D+ + + + +AD +V+SDFI + +PD++ K+ QR + ++F Sbjct: 386 LSYHGGSDIYKALYEANRVMNTSSFKNADVLVLSDFIMEDMPDNLV-KICSNQRNNGNKF 444 Query: 447 HAVAMSAHGK----PGIMRIFDHIWRFD 470 AV++ GK ++F+ W FD Sbjct: 445 FAVSI---GKFPFGYSYKKVFNRHWIFD 469 >UniRef50_Q14PC7 Hypothetical two-component regulator system yiem receptor component protein n=1 Tax=Spiroplasma citri RepID=Q14PC7_SPICI Length = 519 Score = 141 bits (355), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 105/383 (27%), Positives = 194/383 (50%), Gaps = 14/383 (3%) Query: 97 DLLHRLNSPWAEQAR----QLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEER 152 D L ++NSP+ ++ + + T+ L F+ W LI + +EE R Sbjct: 96 DFLFKVNSPFYDRLNYYRYEFNKKQNNNTNMLFRDFIGIWESILIKRINDYRFAKIEELR 155 Query: 153 EQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRG-DYQLIVKYGEFLNEQPE 211 + + ++ ++ + + +L G++W+ + +LK+G + I K+ +FL P Sbjct: 156 TKFMQDLYNKVEIYNKANSLLKTVWNFFGKIWNPT--ELKKGVNMSAIDKFAKFLETNPA 213 Query: 212 LKRLAEQLGRSREAKSIPRNDAQMETFRTMVREP-ATVPEQVDGLQQSDDILRLLPPELA 270 + +A LGR + ++ E +P + PE++ G +S D+ + EL Sbjct: 214 IMEIATLLGRFQGESNLIEQRILEEIVMDYEWKPIGSSPEEIIGATESKDLEHMFAAELV 273 Query: 271 TLGITELEYEFYRRLVEKQLLTYRLHGESW--REKVIERPVVHKDYDEQPRGPFIVCVDT 328 L L+Y FY++ +E +L T+ + +E++ R + + Y + +GP I+ +DT Sbjct: 274 LLKDPVLKYIFYKKYIEGKLTTFEFLSQDKVPKEQIKLRTI--ETYVPEEKGPIILSIDT 331 Query: 329 SGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG-PQGIEQAIRFLSQ 387 S SM G EQ AKA LA+ +IAL E+R CY++ FS + Y LS + + I FLS+ Sbjct: 332 SSSMRGSPEQIAKALALAIAKIALGEHRPCYMINFSKSLDVYNLSSLKDSLPKLIEFLSK 391 Query: 388 QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 F G T++ + + S E+F+AD ++ISDF+ L ++ +K+ L++ ++RFH Sbjct: 392 SFAGDTNVEPALEHTLTVMDSNEYFNADLLLISDFLTSDLSPELITKINLLKQ-RRNRFH 450 Query: 448 AVAMSAHGKPGIMRIFDHIWRFD 470 A+ + G + IF++ W +D Sbjct: 451 AIVIGTMGAENVETIFNNAWIYD 473 >UniRef50_A6C7T1 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C7T1_9PLAN Length = 313 Score = 135 bits (339), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 83/249 (33%), Positives = 133/249 (53%), Gaps = 11/249 (4%) Query: 184 WDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVR 243 WD+SAG K + + KY + ++ P+++++ E+LGR + ++ + + + F ++ R Sbjct: 13 WDLSAGIFKYRGWGDLKKYRDLIDRIPQIRQMIEELGRLQASEEMDDDPTYADAFNSLRR 72 Query: 244 --------EPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRL 295 E Q G+++S D +R++P E L+ ++ +L E+ LLTYR+ Sbjct: 73 TTEEQREVEHPLARHQAQGIERSADFMRMIPSEAMLRRRPGLKRLWHAKLAERGLLTYRV 132 Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 G E ++ RGP IVCVDTSGSM G E AKA L RIA AE Sbjct: 133 RGTYVDRVSTEVEEQQPQSKKRIRGPIIVCVDTSGSMSGRPEAVAKALTLEACRIAHAEQ 192 Query: 356 RRCYIMLF--STEIVRYELS-GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWF 412 R C + F S + V +ELS P G++ + FL+ F GGTD+++ F + RL++ EW Sbjct: 193 RPCLLFSFSGSGQYVEHELSLSPDGLQSLLEFLTMNFDGGTDISTPFEKALARLRTAEWE 252 Query: 413 DADAVVISD 421 AD +++SD Sbjct: 253 RADILLVSD 261 >UniRef50_A1SXM0 von Willebrand factor, type A n=2 Tax=Alteromonadales RepID=A1SXM0_PSYIN Length = 529 Score = 135 bits (339), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 99/358 (27%), Positives = 171/358 (47%), Gaps = 15/358 (4%) Query: 126 TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWD 185 T L WR + + L+ + +++ L +++E + L + D G D Sbjct: 157 THLLGEWRKQIEQKRVEWELNLIHKLQQKFLEKMEEWLRYLSALINSIDDIGFDLGYFLD 216 Query: 186 MSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREP 245 S G+L D + I K+ ++ + L + LG+ R+ + +D ++E ++ P Sbjct: 217 FSKGELSESDIEQIKKWLNYIQNDKGAQLLCDLLGKIRQ---VSHSD-KIEIANKIIDVP 272 Query: 246 A-----TVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG--E 298 + E++ G++ ++ +LP E A L F + +E +L+ + + G Sbjct: 273 SQYIDSNSKEEIVGIKLGQELEHVLPSEFALLSDPSTSILFDLKYIESRLMCFDMVGIQN 332 Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 S + IE V ++ E +GP ++CVDTSGSM G E AKA L L A E R C Sbjct: 333 SVDQIEIEEEVTVQE--ENTKGPMVICVDTSGSMHGSPEAIAKAVTLFLSSTAQKEKRDC 390 Query: 359 YIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVV 418 Y++ FST I +LSG I+ I FL + F GGTD+A ++ +Q+ + +AD ++ Sbjct: 391 YLINFSTSIETLDLSGNYSIKTLIDFLRKSFHGGTDVAPAINHGLKVMQNDTYENADMLI 450 Query: 419 ISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAM-SAHGKPGIMRIFDHIWRFDTGMRS 475 ISDF+ LPD + L R +RF+++ + +A + IFD W ++ S Sbjct: 451 ISDFVMSYLPDKTVKNIGVL-RESGNRFYSLCIGNAFMSNRLSAIFDREWIYNPATTS 507 >UniRef50_A4Y9K4 von Willebrand factor, type A n=3 Tax=Shewanella RepID=A4Y9K4_SHEPC Length = 528 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 94/308 (30%), Positives = 147/308 (47%), Gaps = 21/308 (6%) Query: 161 ERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLG 220 ER+ + +LE + D G WD+S G + + +V+ + + + P+L+ + E LG Sbjct: 196 ERLEIWQELEEVFTDLGLLTGLGWDLSQGLFQSHGWMNLVRLQKIVKQIPQLREVIETLG 255 Query: 221 RSREAKSIPRNDAQMETFRTMVREPATV-----PEQVDGLQQSDDILRLLPPELATLGIT 275 ++ + P + + + R V P + G+ +SD I R+LP E A G Sbjct: 256 SMKDTEGEPIIEEIISRMSVIFRHEVEVTTPLVPMETKGITRSDSISRMLPQEAAFFGHP 315 Query: 276 ELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYD-----EQPRGPFIVCVDTSG 330 L+ ++ R E LL+Y + G +V E+ +K+ + RGP IVC+DTSG Sbjct: 316 VLKKLWHARRAEHALLSYAVEGTELITEVTEQEQENKENKAGNKVNRNRGPMIVCLDTSG 375 Query: 331 SMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST--EIVRYELSGPQ-GIEQAIRFLSQ 387 SM G E AKA L + +A E R C++ LF + E+ EL+ + G+EQ I FLS Sbjct: 376 SMQGTPENVAKALVLQCISVAKKEKRACFVYLFGSKGEVKEMELTPDKAGLEQMILFLSM 435 Query: 388 QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 F GGTD+ +ER ++W AD +++S D S L R +R Sbjct: 436 SFGGGTDVEGPLNMALERSDEKQWQQADILLVS--------DGEFSVSSGLSRKISNRKE 487 Query: 448 AVAMSAHG 455 MS HG Sbjct: 488 QRGMSVHG 495 >UniRef50_A6DQA4 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQA4_9BACT Length = 479 Score = 123 bits (308), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 98/384 (25%), Positives = 180/384 (46%), Gaps = 24/384 (6%) Query: 99 LHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSE 158 L L S W ++ Q+ D + + L + +W SL +T Q+ L+ + + E Sbjct: 90 LPDLKSYWNQELSQINDQPGNV-NVLPQFLISQWHKSLRDLQSTWKQERLDNLQNETQQE 148 Query: 159 VQERMT----LSGQLEPILAD-----NNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQ 209 + + M ++ +LE + D + + +L + + + K+ L + Sbjct: 149 MNDWMDNLNDIADELEKLDLDPEDVFDFASGAGAGGDGPSELSIQNLETLKKWLGTLKKD 208 Query: 210 PELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPEL 269 P +K L + LG+ ++AK N + + + E++ G++ S ++ LLP EL Sbjct: 209 PGIKDLCKLLGKLKQAK---LNKIKRSRTTSSSVSSSNSCEEISGIKFSKELEHLLPSEL 265 Query: 270 ATLGITELEYEFYRRLVEKQLLTYRLHG--ESWREKVIERPVVHKDYDEQPRGPFIVCVD 327 A L E E F + E +L+ + + G +++ IE P ++ +GP I+ VD Sbjct: 266 ALLTDPETEIIFDLKYAESRLMGFDMSGIQTVSKKEEIEMP-------DEEQGPMIIAVD 318 Query: 328 TSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQ 387 TSGSM G E AKA L + + ++ ENR CY++ FST+I +L+G + ++FL Sbjct: 319 TSGSMYGAPETTAKAITLYMAKTSMKENRNCYVIEFSTKIKTIDLAGSNRLSALMKFLEM 378 Query: 388 QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 F GGTD+ E + + +AD +V+SDFI L + K+++ + + F+ Sbjct: 379 SFNGGTDVEPAIEHGTEVMNQEGYRNADMLVVSDFILNDLEPPLVDKIQQ-AKAKNNSFY 437 Query: 448 AVAMSAHGKPGIMR-IFDHIWRFD 470 ++ + H R FD W ++ Sbjct: 438 SLCIGDHFHSHKNREYFDRKWVYN 461 >UniRef50_D1YZJ4 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YZJ4_METPS Length = 506 Score = 123 bits (308), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 85/290 (29%), Positives = 139/290 (47%), Gaps = 25/290 (8%) Query: 181 GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRT 240 GR WD S +L R + + KY + + +LK++ + +GR +ME Sbjct: 215 GRGWDRSMLELHRVYFANLHKYSKIVERNEDLKKILDTIGR-----------IEMEYGSR 263 Query: 241 MVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESW 300 + + +V + S D+ +LP E L L F+ +EK+LLTY L G +W Sbjct: 264 RLSLSSYSHSEVYSVTTSGDLQHMLPVESVKLQDETLRNLFFAHWMEKKLLTYELKGVNW 323 Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYI 360 D ++ RGP + VDTSGSM G E AK+ LAL+R + E+R + Sbjct: 324 -----------TDDSKKNRGPMVAMVDTSGSMHGDPEIVAKSIILALVRRMMKESRDVKV 372 Query: 361 MLFSTEIVRYELSGPQGIEQA---IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 LFS+E +E+ + A + FLS F GGTD + R +E L+ +++ +AD + Sbjct: 373 YLFSSEGQTHEIEITDNKKMATEFLDFLSYTFEGGTDFDTALREGVESLKKKQYVNADIL 432 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIW 467 I+D ++ V S +++++R + R + + GI R DHI+ Sbjct: 433 FITDGLSVVNDKYVISGLEQMKRENGTRLFTIIVGNDNAGGIDRFSDHIF 482 >UniRef50_C3XEK2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XEK2_9HELI Length = 493 Score = 121 bits (303), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 109/376 (28%), Positives = 176/376 (46%), Gaps = 22/376 (5%) Query: 108 EQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQL----LSEVQERM 163 E+ +Q D +T L +L + LS + A L +L+E ++ L L + + Sbjct: 109 EKLKQTRDYKQRLTDYLESLLETKEFLSNLGGAGELFSGVLDEMKQGLDVSNLGDEAYQN 168 Query: 164 TLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGR-- 221 L GQ + G+ D AG R D I +Y + LK +AE LGR Sbjct: 169 KLKGQRIEM------PGGKGTDNGAGIRNRIDINTIKQYFNTIQNSKALKEIAELLGRLE 222 Query: 222 SREAKSIPRNDAQMETFR-TMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYE 280 E +S + +++++ T V E++ G+ D+ LLP ELA L LE Sbjct: 223 KEEEESEIQKIKELKSYSYTQVIPTKRYKEEICGVTLGRDLENLLPQELAMLEDETLELL 282 Query: 281 FYRRLVEKQLLTYRLHG------ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGG 334 F + ++ +L + G E+ E E + E+ G I+CVDTSGSM G Sbjct: 283 FDLKYIQNRLFCFEKQGYHSITQEAQEEIEKEIETKKQKKREKNEGAIIICVDTSGSMYG 342 Query: 335 FNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTD 394 E AKA L L A + R CY++ FS I ELSG G+ + ++FL F GGTD Sbjct: 343 NPEYIAKALTLFLATKANTQKRACYLINFSIGIETMELSGKGGMAKLMQFLEMSFGGGTD 402 Query: 395 LASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAH 454 +A +A ++ +Q ++ +D +VISD +P+D+ +++ QR ++F+ + + + Sbjct: 403 VAPALKAGLKTMQQDDFKKSDLIVISDGGFGYIPNDLEKQMQN-QRQKDNKFYLLDI--N 459 Query: 455 GKPGIMRIFDHIWRFD 470 G G FD W ++ Sbjct: 460 GNSGKKTFFDKHWIYN 475 >UniRef50_Q8EW10 Putative uncharacterized protein MYPE3970 n=1 Tax=Mycoplasma penetrans RepID=Q8EW10_MYCPE Length = 488 Score = 119 bits (299), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 81/291 (27%), Positives = 147/291 (50%), Gaps = 16/291 (5%) Query: 183 LWDMSAGQLKRGD-YQLIVKYGEFLNEQPELKRLAEQLGR-SREAKSIPRNDAQMETFRT 240 LWD+S+ + + ++ + Y + + LK+ +G +E I +N+ ++ Sbjct: 119 LWDLSSIEERETQLFKAVENYFNLVKDDENLKKFIRMIGTFMQENLEIEKNEKELFL--- 175 Query: 241 MVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESW 300 +P++ L QS D+ L+P E+A L ELE F + +E++LLTY+L G Sbjct: 176 -----ENIPQETFALYQSSDLNNLIPNEIAQLDDPELEIIFLKNFIEQKLLTYQLWG--- 227 Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYI 360 E+ I+ V K D RGP +C+DTSGSM E +KA L +R + Sbjct: 228 IEREIQEEWVIKQRDIGERGPLFICLDTSGSMRNMKEVLSKALTLVFVRELEKMDINVVF 287 Query: 361 MLFSTEIVRYELSGPQGIEQAIRF-LSQQFRGGTDLASCFRAIMERLQSREWFDADAVVI 419 + FS E Y+L + ++++ L + F GG+D+ I + +++ A+ +++ Sbjct: 288 IPFSMEAKFYDLYDSKFKLKSVKMNLRKSFYGGSDIEKLVDLIDSVIYKKKYERANILIM 347 Query: 420 SDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAH-GKPGIMRIFDHIWRF 469 SDFI ++LP +K+K+L+ + H+ H++ +S K + IF+ WR+ Sbjct: 348 SDFIFKKLPKKAVNKLKKLKH-NGHKLHSLTISDQIYKNNLFDIFNTNWRY 397 >UniRef50_Q0W1N2 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W1N2_UNCMA Length = 477 Score = 114 bits (284), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 82/290 (28%), Positives = 135/290 (46%), Gaps = 22/290 (7%) Query: 179 AAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 A GR WD + + + D I +Y + + P+L +L E LGRS E ++T Sbjct: 200 AGGRGWDYAMIEQHKDDLYNIKRYSDIVRRNPDLMKLIEDLGRSSEG---------LDTG 250 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 V + +V + S D+ LLP EL L + L+Y F+ R +E +LLTY L Sbjct: 251 SGKVLHSGRL--EVHSIVTSSDLYYLLPSELIKLQDSILQYLFFARWIEGKLLTYHL--- 305 Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 P D + +GP I VDTSGSM G A+A LA +R+ L R+ Sbjct: 306 -------TDPGKSDTGDCKRKGPVIALVDTSGSMDGIPGILARAVTLATVRMFLQRGRKI 358 Query: 359 YIMLFSTEIVRYELSGPQGIEQA-IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 ++LFS+ E+ P+G + FL F GGTD + +A + L++R++ AD + Sbjct: 359 RVVLFSSVGQLDEIDLPEGSTPGFLEFLRSSFGGGTDFNTALKAGLGALKARQYASADIM 418 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIW 467 ++D +++ + + + L+ + V + G+ I D ++ Sbjct: 419 FVTDGMSRITDEALIEDWRRLKEASGSQIFTVIVGNDQAGGLEDISDRVY 468 >UniRef50_B9KEB5 Putative uncharacterized protein n=1 Tax=Campylobacter lari RM2100 RepID=B9KEB5_CAMLR Length = 474 Score = 112 bits (279), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 74/292 (25%), Positives = 142/292 (48%), Gaps = 17/292 (5%) Query: 184 WDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIP-RNDAQMETFRTMV 242 + + G K+ + I+ Y E + LK + + LG+SR + +ND+ + Sbjct: 185 YGNNTGCEKKLSIKSIIDYFEIIKNNHALKEICDLLGKSRNDDNKEGKNDSNLNN--NAQ 242 Query: 243 REPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWRE 302 + E++ G+ ++ LL EL L +LE F + +E +L + G Sbjct: 243 KTSKESKEEIKGVILGRNLEELLAQELGLLNDEDLENLFVLKYLENRLFCFEKQG----- 297 Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 ++K + + +G I+CVD+SGSM G E AK +++ AL E CY++ Sbjct: 298 ------YINKMQNHKNKGAIIICVDSSGSMDGQPEIIAKGITYYMVKKALKEKSACYLIN 351 Query: 363 FSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 FST+ E+ +G+++ FL F GGTD++ + ++++Q + +D +VISD Sbjct: 352 FSTKTKCEEIDLSKGMKKLFDFLCFSFNGGTDVSIALKEGVKKMQEDGFERSDLLVISDG 411 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMR 474 + + + ++ E QR +++F+ + + +G + IFD W++DT + Sbjct: 412 FFGDIDNKILKQM-EKQREQENKFYLLDI--NGCDKVKTIFDKHWKYDTSTK 460 >UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q46D40_METBF Length = 612 Score = 111 bits (278), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 81/291 (27%), Positives = 139/291 (47%), Gaps = 24/291 (8%) Query: 181 GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRT 240 GR WD S L R + + KY L + + + EQ+GR ++E Sbjct: 315 GRAWDYSLKALHREYFGNLEKYAALLRKSSAIHEILEQVGR-----------IELEYGSK 363 Query: 241 MVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESW 300 + +V + S D+ LLP E L L+ +FY ++E +LLTY+L GE+W Sbjct: 364 KLSLSPYSKSEVHSVTFSGDLRTLLPAETVKLKNPLLKRKFYADMLEGKLLTYQLKGENW 423 Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYI 360 + + +GP + VDTS SM G E AKA LA+ R L ENR + Sbjct: 424 NSDSAGK---------KRKGPVVALVDTSASMRGSPELLAKAVVLAVTRRMLTENRDVKV 474 Query: 361 MLFST--EIVRYELSGPQGI-EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD-ADA 416 +LFS+ + V EL+ + + E+ + FL F GGTD + RA ++ +++ + F+ AD Sbjct: 475 ILFSSKWQTVEIELTNKKRMGEEFLEFLKFTFGGGTDFNTALRAGLKAMKNEKAFEGADL 534 Query: 417 VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIW 467 + ++D ++ + + E++ + R ++ + + G+ +I DH + Sbjct: 535 LFLTDGYSELSEKPLIREWNEIKAERRARIFSLIIGNYDAGGLQQISDHTY 585 >UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q466I6_METBF Length = 562 Score = 111 bits (278), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 85/329 (25%), Positives = 169/329 (51%), Gaps = 34/329 (10%) Query: 154 QLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDY----QLIVKYGEFLNEQ 209 + ++E++E + L L + N W S +LK+ + +++ Y F + Sbjct: 233 EFVTEMEENLELFDTLTLLFPQRN------WSYSVKELKKEPFYVQLKMLKNYSTFFEKS 286 Query: 210 PELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPEL 269 P+LK++ + +GR RE P + ++ F +++ ++ SD I LLP E Sbjct: 287 PDLKKIMDFIGR-REFDP-PSDRIRLSPFGK---------DRIQTVRFSDSINNLLPMEA 335 Query: 270 ATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTS 329 A L L+ +FY ++E +LL+Y+ G+ + P + +PRGP IV VDTS Sbjct: 336 AKLLNPSLKKKFYADMLEGKLLSYQFLGKHYTGP----PRI------KPRGPMIVLVDTS 385 Query: 330 GSMGGFNEQCAKAFCLALMRIALAENRRCYIMLF--STEIVRYELSGPQGI-EQAIRFLS 386 GSM G + AK+ LA+ ++ L++ R ++LF +++ + ELS + + E+ + FL Sbjct: 386 GSMHGAPQTLAKSAVLAMAKLMLSQQRDMKVILFASTSQHLEIELSSRKKMSEKFLNFLL 445 Query: 387 QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 F GGTD + + ++ L+ +++ AD + I+D ++ + V ++ +E ++ + + Sbjct: 446 YTFGGGTDFNTALASGLKSLKEKDFQGADLLFITDGKSEVSDELVLARWEEAKKKYNAKV 505 Query: 447 HAVAMSAHGKPGIMRIFDHIWRFDTGMRS 475 +++ + + G G+ I D+I+ + M S Sbjct: 506 YSLIVGSSGAGGLSEISDYIYFVEMEMDS 534 >UniRef50_C3XJE4 Putative uncharacterized protein (Fragment) n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XJE4_9HELI Length = 429 Score = 111 bits (277), Expect = 7e-23, Method: Compositional matrix adjust. Identities = 79/269 (29%), Positives = 132/269 (49%), Gaps = 14/269 (5%) Query: 154 QLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELK 213 Q+L+E ++ + QL P + + +D S G + +++ I + + + +L+ Sbjct: 173 QILNEAKQSFDMK-QLCPNTYEESDIETNGYDYSKGHKRFINFKEINAFIKHIQTSKDLR 231 Query: 214 RLAEQLGRSREAKSIPRNDAQME-TFRTMVREPATVPEQVDGLQQSDDILRLLPPELATL 272 ++A LGR E + + ++ + +T + E++ G+ D+ LLP ELA L Sbjct: 232 KIAALLGREEENGNKKIEHSSIDQSIKTHNHK-----EEMSGVTLGRDLANLLPQELAML 286 Query: 273 GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSM 332 LE F + ++ +L + G +K E + K+ G I+CVDTS SM Sbjct: 287 KDENLELLFNLKYIQNRLFCFEKQGYETIQK--EHYKMAKN-----EGAMIICVDTSSSM 339 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGG 392 G E AKA L L A +NR CY++ FST+I ELSG I FL+ F GG Sbjct: 340 SGNREYLAKAITLFLATKASMQNRACYLINFSTDIETMELSGKDNARNLINFLAMSFNGG 399 Query: 393 TDLASCFRAIMERLQSREWFDADAVVISD 421 TD+A + ++++Q + +D +VISD Sbjct: 400 TDVAPALKEGLKKMQEDSFKQSDLIVISD 428 >UniRef50_C4KA81 von Willebrand factor type A (VWA) domain-containing protein n=1 Tax=Thauera sp. MZ1T RepID=C4KA81_THASP Length = 581 Score = 110 bits (276), Expect = 8e-23, Method: Compositional matrix adjust. Identities = 100/346 (28%), Positives = 149/346 (43%), Gaps = 37/346 (10%) Query: 157 SEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLA 216 E+ E + G L +L DN WD G L+ D++ +++ + PEL R+ Sbjct: 175 GEIDELVGAFGDLGDLL-DNAR-----WDALRGLLRSTDWREVLRIRALIEGLPELARIL 228 Query: 217 EQLGR----------SREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLP 266 LGR SR ++ + + VR P +P + G+Q+S I R+LP Sbjct: 229 RALGRACPTDEDAESSRALHAVVEHTEIQRSVSHRVRVP-DLPGETRGVQRSGRIARMLP 287 Query: 267 PELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQP-----RGP 321 E LG L ++ R E+ LL Y + + PV+ P +GP Sbjct: 288 AEATLLGHPRLRLVWHARRAERTLLAYEDDDHLQEDCLRPAPVLRPSQRPAPARRLEQGP 347 Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST--EIVRYELS-GPQGI 378 +VCVDTSGSM G E AKA L +R A A R C + F E+V EL G+ Sbjct: 348 MLVCVDTSGSMQGGAEAVAKAVVLEAVRCAHARRRACRVYAFGGPDEVVEMELGVDVDGV 407 Query: 379 EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKEL 438 + RFL Q F GGTD+ + + RL W AD ++ SD P + ++V+ Sbjct: 408 GRLARFLGQGFGGGTDICAPLERALARLDEAGWQLADLLIASDGEFGATP-ALAARVEAA 466 Query: 439 QRVHQHRFHAVAMSAHGKPGIMRIFDHI-WRFDTGMRSRLLRRWRR 483 +R R + + G++ + D I W +R WRR Sbjct: 467 RRERGLRVQGILIGDRETIGLLELADDIHW----------VRDWRR 502 >UniRef50_Q5LDB9 Putative uncharacterized protein n=11 Tax=Bacteroides RepID=Q5LDB9_BACFN Length = 419 Score = 109 bits (273), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 71/260 (27%), Positives = 129/260 (49%), Gaps = 13/260 (5%) Query: 199 IVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVRE---PATVPEQVDGL 255 + Y E P ++ L + LG+ K + + + RE + G+ Sbjct: 152 LFHYDEIAKNHPAIRELTKILGKQHYGK-----EKKFRMVAGIHREQIITHATKSDITGV 206 Query: 256 QQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYD 315 + +D+ LLP E L L+ F+ R +K+L + + ++ + + + Sbjct: 207 CEGNDLNSLLPIEYCYLSDPALQPLFFERFNKKKLQMMDYESKD-QHRIKDIKIQGNEIV 265 Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG- 374 E+ GPFI+CVDTSGSM G E+ K+ LA+ + ++R+CY++ FS +I E+ Sbjct: 266 EEQSGPFIICVDTSGSMSGEREEFVKSAILAIAELTEQQDRKCYLINFSNDIACIEIERL 325 Query: 375 PQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSK 434 Q I++ FL Q F GGTDL + L+++ + +AD V++SDF L ++++ + Sbjct: 326 GQNIQELANFLCQSFHGGTDLTPALLHAIYILKTKSYRNADLVMMSDFEMPPLNEELSEE 385 Query: 435 VKELQRVHQHRFHAVAMSAH 454 +K ++ Q++ H A+S H Sbjct: 386 IK---KIKQNKTHLYALSVH 402 >UniRef50_A4S4M8 Predicted protein n=4 Tax=Mamiellales RepID=A4S4M8_OSTLU Length = 535 Score = 108 bits (269), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 93/339 (27%), Positives = 156/339 (46%), Gaps = 18/339 (5%) Query: 145 QQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGE 204 +L+EE +EQ + + + E + D+ +D++ G ++ ++ + + Sbjct: 194 SRLMEEFKEQWEPAMDKLDKAAKAFEGLDLDDLADGPEGFDLTRGLWQQTGWKELDSLRK 253 Query: 205 FLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFR---TMVREPATVPEQVDGLQQSDDI 261 L + EL+ + LGR + R Q E +VR P PEQ GL +SDD+ Sbjct: 254 KLQDLKELRDMVRSLGRGSGRGPLRRAPRQRERQGFPIGLVRSPME-PEQTSGLCRSDDL 312 Query: 262 LRLLPPELATLG--ITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPR 319 R++P E+ L + + + R E+ LL+Y G W E E V + ++ +P Sbjct: 313 SRMMPSEMVLLASSLPQARLLHFARRAERTLLSYERVG--WSE---EPAVTVEGFETRPA 367 Query: 320 ---GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS--- 373 GP IVC+DTSGSM G E AKA L MR + ++ R CY+ FS EL Sbjct: 368 AECGPIIVCLDTSGSMMGARETVAKAMVLECMRQSRSQQRACYLYSFSGPGDCQELELKL 427 Query: 374 GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 G+ + FLS F GGTD+ F + RL EW +AD ++++D + + + + Sbjct: 428 NAAGLYGLLEFLSGSFHGGTDVDEPFNRALARLNEAEWSNADILLVTDGEIKPPDETLIA 487 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGIMR-IFDHIWRFDT 471 + E + + H + + G ++ I H+ F + Sbjct: 488 NLNEAKEEMGLKVHGLLVGDAGNAEVVESICTHVHAFKS 526 >UniRef50_C9KWG4 Putative uncharacterized protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KWG4_9BACE Length = 454 Score = 103 bits (258), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 75/294 (25%), Positives = 137/294 (46%), Gaps = 20/294 (6%) Query: 184 WDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVR 243 ++ S G+ DY+++ +Y + ELK + +GR +E Q E T+++ Sbjct: 171 FENSVGRSNMKDYRMVSRYRTISVKYKELKEIVSCMGREKE---------QAEELDTLIK 221 Query: 244 E--PATVPEQV-----DGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLH 296 + P T+ V G+++ +D+ L+P E+A L E F+ + +QL + Sbjct: 222 QYIPETLSASVAHSDIHGVEEGNDLQALMPTEVALLAEFATEDLFFMKYAMRQLQLFSNR 281 Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 +S R+K + + Q +GP IV +DTSGSM G E AKA L + ++A ++R Sbjct: 282 SDSVRKK--QESQTKRREPRQIKGPMIVAIDTSGSMSGKAESIAKALLLEITQMAKKQHR 339 Query: 357 RCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADA 416 +C+++ FS + + ++ F+ F GGTD + + L + AD Sbjct: 340 KCFLLSFSVRAQALDTAHSGNWKKVREFMVSHFSGGTDGEEMLKTALHTLTQENYLMADV 399 Query: 417 VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFD 470 ++ISDF S++++ Q RF+ + + +G + D +WR D Sbjct: 400 LIISDFEFDFCCKPTESRIRKEQE-RGVRFYGLQI-GNGVNVYEELLDKVWRLD 451 >UniRef50_A6L4M8 Putative uncharacterized protein n=9 Tax=Bacteroides RepID=A6L4M8_BACV8 Length = 453 Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 67/246 (27%), Positives = 122/246 (49%), Gaps = 10/246 (4%) Query: 196 YQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGL 255 Y+ ++ Y + P +++LA LG+ + + + ++ R + P + + G+ Sbjct: 189 YRRMLPYETVMKRNPAIRQLARLLGKKHRDQQKYDSLSGVDKKRLIRHSPHS---DITGV 245 Query: 256 QQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYD 315 +D+ LLP E L L F R EK+L + ++ K E PV + Sbjct: 246 TLGNDLNSLLPVEYCYLADDALRAVFMERYAEKRLQLF-----DYQSKETE-PVKDDKHK 299 Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG- 374 +GP+I+CVDTSGSM G E +K+ LA+ ++ +R+CY++ FS E V + Sbjct: 300 VSGQGPYIICVDTSGSMQGNREILSKSAILAIAQLTEKTHRKCYVINFSDEAVSLLIEDL 359 Query: 375 PQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSK 434 + + + FL+++F GGTD+ R + ++ ++D V+ISDF L ++ + Sbjct: 360 GRDMPRLAEFLNKRFDGGTDIEPALREAAHIINGNDFRESDIVLISDFEMPPLSRNLMEQ 419 Query: 435 VKELQR 440 VK ++R Sbjct: 420 VKVIKR 425 >UniRef50_D0Z403 Putative uncharacterized protein n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z403_LISDA Length = 543 Score = 95.5 bits (236), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 61/181 (33%), Positives = 96/181 (53%), Gaps = 25/181 (13%) Query: 251 QVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVV 310 +V G +S DI R+LP +LA L +LEY FY +L+E L TY+L G Sbjct: 328 EVYGTHKSADISRVLPSDLALLENEDLEYLFYAKLLESNLSTYKLLGH------------ 375 Query: 311 HKDYD-----EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST 365 H D++ E+ +GP + C+DTSGSM G A+A LA+ I E R Y++LF + Sbjct: 376 HIDFEKENDTEEDKGPIVTCLDTSGSMSGIPILKARALLLAIHSIITKEKRELYVLLFGS 435 Query: 366 EIVRYEL----SGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD-ADAVVIS 420 EL + G+ + F+ ++F GGTD + + + ++ +E F+ AD ++I+ Sbjct: 436 RGQVKELYLSETSSSGL---LPFICKEFSGGTDFETPLKRAINIIEHKEKFNKADILMIT 492 Query: 421 D 421 D Sbjct: 493 D 493 >UniRef50_A7V9J4 Putative uncharacterized protein n=7 Tax=Bacteroides RepID=A7V9J4_BACUN Length = 495 Score = 90.9 bits (224), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 79/296 (26%), Positives = 140/296 (47%), Gaps = 33/296 (11%) Query: 184 WDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVR 243 W M +G D++ I K PE+ ++A ++GR A R ++ Sbjct: 213 WGMMSGLWNTVDFERIRKIVRIQRSCPEIVKVARKMGRM--ADDEGREQIRVAEGNVYKM 270 Query: 244 EPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWR-- 301 E ++ + + G+ +D+ LLP ELA +ELE F + + ++L T+R E + Sbjct: 271 EHSSKCD-ILGISTGNDLNALLPIELAHSADSELEDLFVYKYLTRKLQTFRYKSEIMQPA 329 Query: 302 EKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIM 361 ++ +P +P+GP IVC+DTSGSM G E+ A + + L+ IA + R C+++ Sbjct: 330 RRIETKPA-------RPKGPMIVCLDTSGSMAGKPEKIAHSLLIKLLEIADRQRRNCFLI 382 Query: 362 LFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTD----LASCFRAIMERLQSREWFDADAV 417 FS I ++ + + + F S+ G TD L + FR + E +E+ +AD + Sbjct: 383 AFSVSIQPIDVRKERA--RLLEFFSKTACGDTDATRMLEATFRLLKE---GKEYMNADVL 437 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR-----IFDHIWR 468 +SDF ++P + ++E++R + H + GI FD I+R Sbjct: 438 WVSDF---KIPHSSPAFMEEIRRCREAGTHFYGLQI----GITDNEWTPFFDRIYR 486 >UniRef50_A8IX54 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IX54_CHLRE Length = 604 Score = 90.5 bits (223), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 87/304 (28%), Positives = 135/304 (44%), Gaps = 28/304 (9%) Query: 180 AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFR 239 G +D+ KR + + + L E EL+ L LGR + R Q F Sbjct: 188 GGDDFDLQGSIWKRAGWSQLDELRRKLEELKELRDLVRSLGRGGGWGPLRRAPVQ---FL 244 Query: 240 TMVREPATV-----PEQVDGLQQSDDILRLLPPELATLG----ITELEYEFYRRLVEKQL 290 + P + ++ GL +SDDI RLLP E A L + + + FY ++ EK L Sbjct: 245 DLNARPGLLRTVLEAQETRGLTRSDDISRLLPAEAALLARGRVVRQAKLLFYAKMAEKAL 304 Query: 291 LTYRLHGESWREKVI----ERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLA 346 TY G W E ER + D RGP ++CVDTSGSM G E AKA L Sbjct: 305 QTYERDG--WGEYPTQIEPERREIRPTAD---RGPILLCVDTSGSMRGARETVAKALALE 359 Query: 347 LMRIALAENRRCYIMLFS--TEIVRYELS-GPQGIEQAIRFLSQQFRGGTDLASCFRAIM 403 MR A + R C++ FS E+ EL+ + + FL + F GG+D + + Sbjct: 360 CMRAARQQERGCFVFAFSGPAEVREIELNMDAASVNNLLEFLEKMFNGGSDFNEPVKRCL 419 Query: 404 ERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGK----PGI 459 +RL +W ++D +++SD ++ + K+ + R H + + + K P + Sbjct: 420 DRLTDAKWANSDILLVSDGELRQPAPAIMRKLAGAKEALGLRVHGLVVGSPEKKRADPAV 479 Query: 460 MRIF 463 +R Sbjct: 480 LRAL 483 >UniRef50_D0LHL0 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LHL0_HALO1 Length = 509 Score = 89.0 bits (219), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 86/284 (30%), Positives = 127/284 (44%), Gaps = 49/284 (17%) Query: 200 VKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSD 259 ++ GE L +LK LA+ +G RE R R +VR P + E G Sbjct: 240 LELGERLMRSRKLKLLAKLVGAFREVAFEARR-------RRVVRTPQVMHEVGRGAH--- 289 Query: 260 DILRLLPPELATLGI----TELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYD 315 + RLLP EL LG+ L EF RRLVE +LL Y L G S Sbjct: 290 -LDRLLPSEL--LGLPRHRGALHREFVRRLVEGELLEYELRGAS---------------- 330 Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST--EIVRYELS 373 RGP +VCVD SGSM G E AKA L L IA E RRC ++FS+ + EL Sbjct: 331 --SRGPMVVCVDGSGSMQGTKEIWAKAVALTLTEIARRERRRCLAIVFSSGHALFEVELL 388 Query: 374 GPQG--------IEQAIRFLSQQF-RGGTDLASCFRAIMERLQSREWFDADAVVISDFIA 424 G +G ++ + ++ F GGTD R + + + D V I+D A Sbjct: 389 GAKGRSNVRAPMLDDNVLAFAEHFPGGGTDFEPPMRRALAAVSEGNYRRGDIVFITDGQA 448 Query: 425 QRLPDDVTSKVKELQRVHQHRFHA--VAMSAHGKPGIMRIFDHI 466 Q + +++ + + + ++ H+ R V ++ + ++R D + Sbjct: 449 Q-VSENLIADITKARKKHRFRVRGILVDVADSDRGSLLRFCDEV 491 >UniRef50_C0ZE04 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZE04_BREBN Length = 572 Score = 87.8 bits (216), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 77/289 (26%), Positives = 130/289 (44%), Gaps = 30/289 (10%) Query: 169 LEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSI 228 +E + + A R W G+L+R ++ +K+ E L P+L ++GR + Sbjct: 267 VEEVFTASQRFANRSWGHELGKLRRQSFEQYLKWIEKLKRHPDLVAFLNEVGRQVHRFRV 326 Query: 229 PRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEK 288 R + + + F PE+ L+QS DI +LP E L + E F + +E+ Sbjct: 327 KRKEIRSKHF----------PEEYYDLRQSGDIAHMLPGEAVLLADPDFENYFMLKWLEQ 376 Query: 289 QLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALM 348 +L+TY G W +E P P+GP I +DTS SM G + A+ F + Sbjct: 377 KLMTYDTSG--W----VEEP---------PKGPVICMLDTSHSMRGSKLRLAQIFIMTFA 421 Query: 349 RIALAENRRCYIMLFST--EIVRYELSGPQGIEQAIRFLSQQ-FRGGTDLASCFRAIMER 405 +++ E R ++LF EI L + A L+Q F GGT + + +E Sbjct: 422 ALSMLEKRDFILLLFGAKGEIKEQPLYHKKPDWPAFYGLAQMAFGGGTHFDAPMKRAIEL 481 Query: 406 LQSRE-WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSA 453 ++ + W AD V+++D I P V K+ L + Q R H++ + + Sbjct: 482 VEKEQAWRGADFVMVTDGIGGISP-YVQEKLIFLGQHKQVRLHSLIVGS 529 >UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KNK1_AERHH Length = 552 Score = 84.3 bits (207), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 86/325 (26%), Positives = 146/325 (44%), Gaps = 33/325 (10%) Query: 153 EQLLSEVQERMTLSGQLEPILADNNTAAGR----LWDMSAGQLKRGDYQL----IVKYGE 204 E++ E+++++T GQ IL D +T + L S D +L I + Sbjct: 247 EEIPDEMRKKLTGYGQ---ILGDIDTTYEKSKALLLACSGANFSYNDLKLCKDDIEPLAK 303 Query: 205 FLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRL 264 L + +K L ++GR+ + E + R P +V G +S+D+ R+ Sbjct: 304 QLQQNHAIKELTYKMGRAYIS----------EEKKKQARIPHASKSEVHGTHRSEDLARV 353 Query: 265 LPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIV 324 LP EL L LE FY R +E+ L+TY L G + + +++ GP + Sbjct: 354 LPTELLNLEDEALETLFYARFLERNLMTYELQGTTCTSG------EQLELEQKRTGPVVA 407 Query: 325 CVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLF--STEIVRYELSGPQGIEQAI 382 C+DTSGSM G A+A LA+ + E R +++LF + E+ Y + + Sbjct: 408 CLDTSGSMSGAPLLKARALLLAVSAVLQQEARSLHVVLFGDNGELREYAIHEENSASGLL 467 Query: 383 RFLSQQFRGGTDLASCFRAIMERLQ-SREWFDADAVVISDFIAQRLPDDVTSKVKELQRV 441 FL Q F GGTD + E ++ ++E+ AD ++ISD L DD ++ +++ Sbjct: 468 HFLRQGFGGGTDFETPLNRACEIIRDAKEYEKADILMISDGDCV-LSDDYIEHLQTRKKI 526 Query: 442 HQHRFHAVAMSAHGKPGIMRIFDHI 466 ++V HG+ R D + Sbjct: 527 LDCSIYSVL--CHGQRVADRFSDEV 549 >UniRef50_A2SLM6 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Burkholderiales Genera incertae sedis RepID=A2SLM6_METPP Length = 493 Score = 80.9 bits (198), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 74/250 (29%), Positives = 112/250 (44%), Gaps = 31/250 (12%) Query: 248 VPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVI-- 305 P ++ G++ ++ R+LP E A L L + RL E +L+ W E+ + Sbjct: 221 APGEILGVRPGRNLARMLPSEAAQLRHPLLHKLWRARLAEARLMV-------WDEEAVLF 273 Query: 306 -ERP--------VVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 +RP RGP +VC+DTSGSM G EQ AKA L R A E R Sbjct: 274 DQRPGGATPLRAAAQAAPPPLARGPMLVCIDTSGSMRGAPEQLAKAVVLQAARTAHRERR 333 Query: 357 RCYIMLF--STEIVRYELS-GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD 413 C ++ F + E++ +EL+ P G++ + F+ Q F GGTDLA+ + + S W Sbjct: 334 ACQLIAFGGAGELLTHELALTPAGLDALLDFIGQAFDGGTDLAAPLAHAVAAVHSARWQQ 393 Query: 414 ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGM 473 AD +++SD P + QR H R V + G++ + D I Sbjct: 394 ADLLLVSDGEFGCTPATLALLDGARQR-HGLRVQGVLVGDRETMGLLEVCDAI------- 445 Query: 474 RSRLLRRWRR 483 +R WRR Sbjct: 446 --HWVRDWRR 453 >UniRef50_B1L0Y8 von Willebrand factor type A domain protein n=10 Tax=Clostridium RepID=B1L0Y8_CLOBM Length = 578 Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 66/253 (26%), Positives = 116/253 (45%), Gaps = 38/253 (15%) Query: 212 LKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELAT 271 LK L++ +GR +E+ + R ++ A + ++ +DI+ LP E Sbjct: 322 LKELSDIIGRFKES--------ALRDQRNKHKDGAVA---IKSVRIGNDIIHTLPSEKML 370 Query: 272 LGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGS 331 L + EFYR+ +KQLL Y L + + +GP ++C+D S S Sbjct: 371 LINETTKKEFYRKFNQKQLLQYELESDKLK----------------AKGPMVICIDMSSS 414 Query: 332 MGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE-----IVRYELSGPQGI-EQAIRFL 385 M G E+ +KA +AL+ IA + R +LF+ + I+ + P+ I + A RF Sbjct: 415 MKGIKEKWSKAVAIALLEIAQQQKRNFAAILFNEDATEPIIIEKDKKEPEKILDIAERFD 474 Query: 386 SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHR 445 GGT + + +E ++ ++ AD V I+D + P D +K +L+ + + Sbjct: 475 G----GGTLFETPLQKALEVIEQSKFKKADIVFITDGHSYTHP-DFINKFNKLKDEKEFK 529 Query: 446 FHAVAMSAHGKPG 458 +V + A GK G Sbjct: 530 VLSVLIYAGGKIG 542 >UniRef50_A8IQV8 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8IQV8_CHLRE Length = 411 Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 43/105 (40%), Positives = 62/105 (59%), Gaps = 3/105 (2%) Query: 320 GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS--TEIVRYELS-GPQ 376 GP I+C+DTSGSM G E AKA L +R A + R+CY+ FS E+ +LS Sbjct: 210 GPIILCLDTSGSMRGARETVAKALALECLRGAHRQRRQCYLYAFSGPNEVQELQLSVDVD 269 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISD 421 ++Q + FLS F GGTD+ + + +ERL EW AD ++++D Sbjct: 270 SLDQLLAFLSCSFMGGTDVDAPLKLSLERLAKAEWAQADILMVTD 314 >UniRef50_Q58221 Uncharacterized protein MJ0811 n=4 Tax=Methanocaldococcus RepID=Y811_METJA Length = 439 Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 56/170 (32%), Positives = 84/170 (49%), Gaps = 21/170 (12%) Query: 260 DILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPR 319 D+ LLP E+ L L Y+F RR V+K+LL Y + + E+ + Sbjct: 228 DLKHLLPKEIVNLSDEILYYDFLRRFVDKKLLIYDIQNKL----------------EKQK 271 Query: 320 GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS-GPQGI 378 GP I+ +D SGSM G E KA L+++ IA ENR Y + F + VR+E P+ I Sbjct: 272 GPIIILLDHSGSMYGDREIWGKAVALSIIEIAKRENRDIYYIAFD-DGVRFEKKINPKTI 330 Query: 379 --EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWF-DADAVVISDFIAQ 425 ++ I S F GGT+ M ++ E F +AD ++I+D A+ Sbjct: 331 TFDEIIEIASLYFGGGTNFIMPLNRAMSIIKEHETFKNADILLITDGYAE 380 >UniRef50_B7DQJ9 von Willebrand factor type A n=1 Tax=Alicyclobacillus acidocaldarius LAA1 RepID=B7DQJ9_9BACL Length = 484 Score = 74.7 bits (182), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 51/177 (28%), Positives = 79/177 (44%), Gaps = 20/177 (11%) Query: 249 PEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERP 308 P ++ + DD+ +LP EL L E EF +R E++LL Y L G Sbjct: 261 PTEIVNITMGDDLANVLPSELLLLADPATEDEFIQRFAERRLLQYDLRG----------- 309 Query: 309 VVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST--E 366 ++ + +GP +VC+D SGS G E K LAL+ IA E R ++ F++ E Sbjct: 310 -----FEREGQGPIVVCIDESGSTAGMVEMWEKGIALALLAIARREKRAFAVVHFASAHE 364 Query: 367 IVRYELSGPQGIE--QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISD 421 I + P+ + ++ F GGTD S R + + + D V I+D Sbjct: 365 IFVQKWLRPKDASPTELVQMAQHFFNGGTDFESPLREAVRIMDEAAFQKGDIVFITD 421 >UniRef50_C6J3I5 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J3I5_9BACL Length = 409 Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 60/211 (28%), Positives = 93/211 (44%), Gaps = 22/211 (10%) Query: 215 LAEQLGRSREAKSIPRNDAQMETFRTMV-REPATVPEQVDGLQQSDDILRLLPPELATLG 273 LAE+L ++ K I + +M+ R +G++Q + I +LLP EL T Sbjct: 156 LAERLSHDKKMKDIAKWAGRMKVIANQKQRSKHKDAINRNGIKQGNSIEQLLPMELGTYA 215 Query: 274 ITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMG 333 + +F RR VE Q L Y G ++ +GP I+C+D SGSM Sbjct: 216 SPITKMDFLRRYVEGQTLQYDTKGP----------------EQLGKGPIILCLDQSGSMS 259 Query: 334 GFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR---YELSGPQGIEQAIRFLSQQFR 390 G + +K F LALM IA + R + FS+ YE G ++ I+ + Sbjct: 260 G-QDTISKGFALALMSIARKQRRDFAWIPFSSHAAAPLIYE-RGTIVVQDMIQLATIFLG 317 Query: 391 GGTDLASCFRAIMERLQSREWFDADAVVISD 421 GGT RA + ++ + AD V ++D Sbjct: 318 GGTSFEPPLRAASQVIEQSRFNQADIVFVTD 348 >UniRef50_A8ZLC2 Putative uncharacterized protein n=3 Tax=Acaryochloris marina MBIC11017 RepID=A8ZLC2_ACAM1 Length = 483 Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 69/255 (27%), Positives = 114/255 (44%), Gaps = 40/255 (15%) Query: 179 AAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMET- 237 A G W +G +K + ++P+LK++ G + E + R Q ET Sbjct: 198 ALGMSWGNESGDRNPTPTGEKLKLAALIEQRPQLKKILALAGNALETANRKRRQHQTETG 257 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + +V G+ +D+ ++LP ELA L + + FYR +E QL L Sbjct: 258 YGELV-----------GITTGNDVSQILPQELARLSDSRQKLSFYRDFLEGQLFQNDLQA 306 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQC-AKAFCLALMRIALAENR 356 +E+ +GP ++C+D SGSM N +KA +AL+++A + R Sbjct: 307 P----------------EEKGKGPMVICLDCSGSMVKGNRFLWSKALIVALVKLANEQER 350 Query: 357 RCYIMLFSTEIVRYE---LSGPQGIEQAIRFL-SQQFRGGTDLASCF---RAIMERLQSR 409 ++LF E + Y+ + I+Q IR L + GGT+ R I+E Q Sbjct: 351 VVSLVLF--ESICYDPIYFHPREDIDQLIRLLVTSPTDGGTEFQRPLEQARDIIE--QDE 406 Query: 410 EWFDADAVVISDFIA 424 ++ +AD V I+D IA Sbjct: 407 DYSEADIVFITDGIA 421 >UniRef50_B8C4H1 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C4H1_THAPS Length = 1141 Score = 68.2 bits (165), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 60/208 (28%), Positives = 86/208 (41%), Gaps = 34/208 (16%) Query: 206 LNEQPELKRLAEQLGRSREAKSI------PRNDAQMETFRTMVREPATVPEQVDGLQQSD 259 L+ P+LK L +LG+ AK PR + V P V GL +S Sbjct: 238 LSMMPKLKDLLARLGQRPSAKGKDVRKFRPRKRSNSRDDMMGVEIDPLDPTSVSGLTRSG 297 Query: 260 DILRLLPPELATL--GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQ 317 + +LP E L + L + F + E +LL + P Sbjct: 298 SLTTMLPSEAVLLRSSMKSLRWLFLAKKAESKLL-------------VSLPSASG----- 339 Query: 318 PRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG--- 374 GP I+C+DTS SM G E AKA LA + A ++ R C ++ FS+ E SG Sbjct: 340 --GPLIICLDTSWSMSGARESLAKAVVLASVSAANSQGRECRVVSFSSANNAVE-SGSIK 396 Query: 375 --PQGIEQAIRFLSQQFRGGTDLASCFR 400 G+ + + FLS F GGTD+ + Sbjct: 397 CDSDGVRKLLDFLSYSFGGGTDVTGALK 424 >UniRef50_C8SB00 von Willebrand factor type A n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SB00_FERPL Length = 469 Score = 67.8 bits (164), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 106/449 (23%), Positives = 188/449 (41%), Gaps = 72/449 (16%) Query: 52 RWREALRSRLKDA--RVPPELTEEVMCYQQS--QLLS----TPQFIVQLPQILDLLHRLN 103 R E ++ ++K+A + PP +T+ + + Q L P+F V + +++ Sbjct: 32 RIVEKVKDKVKEAIPQFPPLVTDTFNIFHKPDPQFLDDSQIAPEFRVNKRVLEKIMNTDT 91 Query: 104 SPWAEQARQLVDANSTITSALHTLFLQ---RWRLSLIVQATTLNQQLLE-------EERE 153 ++ L D NS I +A+ T L + +L I + T QQL E+ + Sbjct: 92 FSELKETTTLDDVNSAIATAILTERLYEELKSKLGEIKEHTEKIQQLRNQLPGKSGEDVK 151 Query: 154 QLLSEVQERMTLSGQLEPILADN-----------------NTAAGRLWDMSAGQLKRGDY 196 Q L +++E S L+ I+ N + G+ + D Sbjct: 152 QALQQIEEH---SRALQGIVTQGAVSVAVRKAQEEFEKVQNAMVALGFGNEPGKPVQVDP 208 Query: 197 QLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQ 256 + +K L LK++ E LG+ R ++ ++ A+ + ++M+ ++ + Sbjct: 209 ETAIKLASELKSNERLKKMVELLGKMR---NLLKSTAKAKPRKSML--------ELHSIT 257 Query: 257 QSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDE 316 +I RLLP E+ L + + F R E +LL Y L K ++ Sbjct: 258 SGREIERLLPSEI--LKLRKYRVVFLRDYYEGRLLHYDL----------------KRREK 299 Query: 317 QPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQ 376 + +GP ++ +D SGSM G EQ AKA LA + IA+ E R I+ F I ++ Q Sbjct: 300 ESKGPIVIALDLSGSMSGAKEQWAKAVSLATIDIAVKERRPWAIIAFDAGIKDVKVFRKQ 359 Query: 377 -GIEQAIRFLSQQFRGGTDLASCFRAIMERLQS-REWFDADAVVISDFIAQRLPDDVTSK 434 E + + GGT+ + M+ ++ RE+ AD + ISD ++ + + Sbjct: 360 PKPEDVLGIMRIGASGGTNFEKPLKEAMKIVEDCREFTKADILFISDGDC-KVGWEFLEE 418 Query: 435 VKELQRVHQHRFHAVAMSAHGKPGIMRIF 463 +R R V +S G P IMR+F Sbjct: 419 FTRFKRRRNVRVTGVLIS--GIPRIMRMF 445 >UniRef50_D1XLN5 von Willebrand factor type A n=12 Tax=Actinomycetales RepID=D1XLN5_9ACTO Length = 538 Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 74/269 (27%), Positives = 105/269 (39%), Gaps = 42/269 (15%) Query: 182 RLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTM 241 R W ++ G+L+R + + E L L R AE +GR R+ R Sbjct: 246 RAWGVAPGELERMPFDERARLAERLR-TGRLARWAELIGRFRQMADGER----------- 293 Query: 242 VREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWR 301 R ++ G+ DD+ R++P ELA LG+ L F R +L+ Y GE Sbjct: 294 ARRVENATGELVGVTLGDDLSRVIPSELANLGLPGLRAVFAARYAAGELMLYDTQGEQTT 353 Query: 302 EKVIERPVVHKDYDEQPRGPFIVCVDTSGSM-----GGFN-EQCAKAFCLALMRIALAEN 355 K G + CVDTS SM GG E AKA LAL+ A Sbjct: 354 GK----------------GAVVACVDTSHSMYEAGPGGVTREAWAKACALALLDQARHGG 397 Query: 356 RRCYIMLFST----EIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREW 411 R +LFS ++ R+ P + + F GGT + A + L+ E Sbjct: 398 RDFVGILFSAADKLQVFRFPAGRPADTARVLDFAETFLGGGTSYQTPLTAAADLLE--EE 455 Query: 412 FDADAVVISDFIAQRLPDDVTSKVKELQR 440 FDA A D + + DD +E R Sbjct: 456 FDATARTRGDIVM--ITDDECGVTEEWMR 482 >UniRef50_Q2IEM5 VWA containing CoxE-like n=2 Tax=Anaeromyxobacter dehalogenans RepID=Q2IEM5_ANADE Length = 430 Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 65/220 (29%), Positives = 99/220 (45%), Gaps = 34/220 (15%) Query: 206 LNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLL 265 L L+R+A GR + + R R V+ A ++V ++Q D+ R L Sbjct: 180 LKGDERLRRIAALAGRFKRIAAAKR--------RHRVKHGA---DEVTDVEQGADLGRAL 228 Query: 266 PPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVC 325 P ELA L L +F R L+E + L YRL G + K GP +V Sbjct: 229 PVELAKLSHRLLRLDFLRALLEGRSLQYRLEGTATLGK----------------GPLVVL 272 Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE--LSGPQGIEQAIR 383 +D SGSM G + A A LAL+ A E RR + +L V++E + + + + Sbjct: 273 LDKSGSMDGPRDVWATAVALALLDQAQRE-RRTFALLGFDARVKFEAVVKPSEALPEDGL 331 Query: 384 FLSQQFRGGTDLASCFRAIME--RLQSREWFDADAVVISD 421 F+S GGT++A+ R +E R AD V+++D Sbjct: 332 FVSCC--GGTEIAAAVRRGLEIIRTHPGALGKADLVLVTD 369 >UniRef50_Q9YD81 Putative uncharacterized protein n=1 Tax=Aeropyrum pernix RepID=Q9YD81_AERPE Length = 463 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 65/220 (29%), Positives = 97/220 (44%), Gaps = 40/220 (18%) Query: 253 DGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHK 312 DGL+ D+ R+ +L I EY F+ +LL YR KV++ Sbjct: 252 DGLEYGSDLERIHYSQL----ILPDEY-FWASFSSSKLLLYR--------KVLD------ 292 Query: 313 DYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYEL 372 RGP V +D SGSM G A+A +AL R +LAENRR F + V Y Sbjct: 293 ----SSRGPIYVLLDKSGSMVGAKIDWARAVAVALFRRSLAENRRFSARFFDS--VTYPA 346 Query: 373 ------SGPQGIEQAIRFLSQ-QFRGGTDLASCFRAI---MERLQSREWFDADAVVISDF 422 S P+ + +++L+ + GGTD+ + + + R E +D V+I+D Sbjct: 347 IHLRPRSKPRDFLELVKYLAAVKAGGGTDITAAIKTAADDISRTPRGEQRISDIVLITDG 406 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 RL DV V++ + R H V + H P + RI Sbjct: 407 -EDRLNIDV---VEDSLKRSDARLHTVIIQGH-NPYLKRI 441 >UniRef50_D1PED6 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PED6_9BACT Length = 549 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 43/159 (27%), Positives = 70/159 (44%), Gaps = 10/159 (6%) Query: 207 NEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLP 266 ++ PE+K + ++GR +A R M + ++G+ DD+ LLP Sbjct: 249 DKYPEIKEIVAKMGRVADANGKDRLTIASGVEMKMEHSAGS---DIEGITVGDDLNSLLP 305 Query: 267 PELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPR-GPFIVC 325 ELA ++E F + ++L T+R E + +P + R GP IVC Sbjct: 306 LELAQYSDEDMEGLFIYKYRTRRLQTFRYKSE------MSKPSRKLGFTHASRKGPMIVC 359 Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS 364 +DTS SM G E+ + L A R C+++ FS Sbjct: 360 LDTSASMYGTPERISSTLISLLEETAEDLERDCFLIDFS 398 >UniRef50_Q60384 Uncharacterized protein MJ0077 n=3 Tax=Methanocaldococcus RepID=Y077_METJA Length = 382 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 42/143 (29%), Positives = 71/143 (49%), Gaps = 15/143 (10%) Query: 320 GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST---EIVRYELSGPQ 376 G F+VC+D SGSM G E AKA L LM I+L N+R +LF +I YE Sbjct: 224 GDFVVCLDLSGSMRGNKEIWAKAIALCLMDISLKRNKRYISILFDDGVRDIKIYE--KKV 281 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 ++ + F S + GGT+ R ++ F+ D V I+D + + + K+K Sbjct: 282 SFDEILEFASVFYGGGTNFEKPLREALK-------FNGDIVFITDGECE-VSLEFLEKIK 333 Query: 437 ELQRVHQHRFHAVAMSAHGKPGI 459 E ++ + + +++ ++ KP + Sbjct: 334 EEKQRRKIKIYSICINT--KPTV 354 >UniRef50_D0LUP3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain-like protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LUP3_HALO1 Length = 536 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 60/241 (24%), Positives = 91/241 (37%), Gaps = 29/241 (12%) Query: 184 WDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVR 243 W G R + + + L E P ++++ E GR E R+ Sbjct: 251 WGREPGDFGRLPIEEFQRLSQVLRETPSVRKIVELAGRWSELLKPRLKRGHSPRGRS--- 307 Query: 244 EPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREK 303 ++ G+ + RL EL L L +L E++ L + L G Sbjct: 308 -------ELVGVTLGGGLERLCATELIKLRHPALRRVLLGQLAERRALVHELRGP----- 355 Query: 304 VIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLF 363 D RGP I+ VDTSGSM G AK+ LAL + R ++ F Sbjct: 356 -----------DVLGRGPMILVVDTSGSMHGARMTMAKSLMLALALHCWEQRRPLRVLTF 404 Query: 364 ST--EIVRYELSGPQGIEQAI-RFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVIS 420 E+ E++ + + + LS F GGTD + E + R W ADAV ++ Sbjct: 405 GAPGEMHESEVAVDEPFWTRLEQCLSVAFGGGTDFDGPLLRVCEIVGERPWRRADAVFLT 464 Query: 421 D 421 D Sbjct: 465 D 465 >UniRef50_C9RHJ4 von Willebrand factor type A n=1 Tax=Methanocaldococcus vulcanius M7 RepID=C9RHJ4_METVM Length = 383 Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 42/165 (25%), Positives = 70/165 (42%), Gaps = 25/165 (15%) Query: 258 SDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQ 317 + + LL E L + RR E +LL Y K+++ H Sbjct: 180 GNSLTNLLSCEYKNFTDEMLFVDLLRRYNENKLLNY---------KILDNIKNH------ 224 Query: 318 PRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQ- 376 G F++C+D SGSM G E AKA L L+ +L +RC +++F + ++ Sbjct: 225 --GDFVICLDLSGSMRGNKEIWAKAVSLCLIEASLKRGKRCVVIIFDDGVRETKIFEKNI 282 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISD 421 ++ + F S + GGT+ R ++ F+ D V I+D Sbjct: 283 HFKEVLDFASVFYGGGTNFEKPLREALK-------FNGDVVFITD 320 >UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BM85_HYPBU Length = 439 Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 70/249 (28%), Positives = 110/249 (44%), Gaps = 46/249 (18%) Query: 226 KSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRL 285 K+I +A + T + +R P ++DG + DI R++ ELA T+L F + Sbjct: 206 KTIESTEAYIRTRK--IRSPRG---ELDGYELGSDIERVVASELAL--PTDL---FLLKF 255 Query: 286 VEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCL 345 E+ LL Y+ VV ++Y G F V +D SGSM G AKA L Sbjct: 256 AERNLLLYK-------------KVVSEEY-----GKFYVLLDKSGSMMGMKIIWAKAVAL 297 Query: 346 ALMRIALAENRRCYIMLFSTEIVRYELSGPQGIE--QAIRFLSQQFR----GGTDLA--- 396 AL + A+ E R YI F + I L P+ + ++ L R GGTD+ Sbjct: 298 ALAQRAIREKREFYIRFFDS-IPYPPLYIPKRVHGRDVVKLLEYVARIRANGGTDITRAI 356 Query: 397 -SCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHG 455 + I +LQ + +D ++I+D + D + + L +V+ R H V +S + Sbjct: 357 LTAVDDIATKLQRSKV--SDIILITDGEDKIAIDTIR---RSLNKVNA-RLHTVMISGNN 410 Query: 456 KPGIMRIFD 464 P + I D Sbjct: 411 -PDLRAISD 418 >UniRef50_A7VY69 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VY69_9CLOT Length = 515 Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 63/223 (28%), Positives = 100/223 (44%), Gaps = 38/223 (17%) Query: 205 FLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRL 264 ++ +L+ +A LGR RE + R ++ E+ D L +DI Sbjct: 258 YVKNSKQLQEIARLLGRYRELIADKRKNSY----------SYGRGEKYD-LTTGNDITNC 306 Query: 265 LPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIV 324 L ELA LG+ E E F RR +K+L+ YR +R V K RG IV Sbjct: 307 LSSELALLGMAETEILFMRRYEQKRLMQYR-----------KRTAVVK-----GRGDMIV 350 Query: 325 CVDTSG---SMGGFNEQCAKAFCLALMRIALAENRRCYIMLF-STEIVRYELSGPQGI-- 378 +D SG S+ G+ + A AL+ IA + R+ ++ F S + +R +L P Sbjct: 351 LIDESGSTRSVAGWAKALAL----ALLDIASRDGRKFAMVHFASADRIRTDLFEPGHYTP 406 Query: 379 EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISD 421 E ++ Q F GGT+ + + + RL + +AD +I+D Sbjct: 407 EDVMKAAEQFFGGGTNFEAPLKEAL-RLMENGYENADITIITD 448 >UniRef50_A7KV72 Putative metalloprotein chaperonin subunit n=1 Tax=Bacillus phage 0305phi8-36 RepID=A7KV72_9CAUD Length = 553 Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 43/171 (25%), Positives = 68/171 (39%), Gaps = 20/171 (11%) Query: 255 LQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDY 314 ++ +D+ R+ P L L +F + EKQL Y+ G +KV Sbjct: 315 IETGNDLSRVTPTSLMKLASPATRNQFMKEFSEKQLQLYKKDG---IKKV---------- 361 Query: 315 DEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG 374 RGP I+ D SGSM G + + A LA++ +A E R + + +IV + Sbjct: 362 ---GRGPIIIDHDKSGSMRGNKDDWSTALTLAMLEVAQKEKRNFGYIPYQHQIVASHVKN 418 Query: 375 -PQG---IEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISD 421 P G + + GGT + L+S + D V I+D Sbjct: 419 IPAGELDPDDIMDIAELDSSGGTTFMPVLDESIRCLESDRYKKGDIVFITD 469 >UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGP5_ARCPR Length = 430 Score = 48.9 bits (115), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 35/120 (29%), Positives = 61/120 (50%), Gaps = 7/120 (5%) Query: 320 GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIE 379 G + + VD SGSM G A++ LA+ R+A + RR ++ F + + LS P + Sbjct: 272 GAYYILVDKSGSMVGEKTVWARSVALAIYRMASLKRRRYFLRFFDKK-THHLLSDPHEVV 330 Query: 380 QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTSKVKE 437 AI L + GGTD+ + R ++ L R D V+I+D + + +D++ +K+ Sbjct: 331 DAI--LKVKSNGGTDITNALRTAVKDLVERGLSDLTNTIVIITD--GEDVVEDLSKDLKK 386 >UniRef50_A3DPE5 von Willebrand factor, type A n=2 Tax=Desulfurococcaceae RepID=A3DPE5_STAMF Length = 443 Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 81/178 (45%), Gaps = 37/178 (20%) Query: 254 GLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKD 313 G + DI R++P LA EL FY R +E +LL Y +K++ Sbjct: 236 GYELGKDIERIVPSALALP--DEL---FYLRFLENRLLLY--------QKMLS------- 275 Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS 373 Q +GP V +D SGSM G AKA L+L A+ E+R Y F + + Y L+ Sbjct: 276 ---QGKGPLYVLLDKSGSMDGIKMTWAKAVALSLYMRAVREHREFYFRFFDS--IPYPLA 330 Query: 374 G------PQGIEQAIRFLSQ-QFRGGTDLASCFRAIMERLQS---REWFDADAVVISD 421 + + I ++++ + GGTD++ +++ RE +D ++I+D Sbjct: 331 KISRRPRASNVLKLIDYIARVRGSGGTDISKAIITACNDIRTGSVRE--TSDIIIITD 386 >UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobaceae RepID=C3NM85_SULIN Length = 452 Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust. Identities = 46/161 (28%), Positives = 71/161 (44%), Gaps = 32/161 (19%) Query: 254 GLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKD 313 G ++ DI R+L ELA + FY +L E QLL Y+ + RE + Sbjct: 245 GYEEGSDIERILYSELALPDML-----FYLKLAEGQLLLYQ---KQIRETL--------- 287 Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYEL- 372 GP + +D SGSM G AKA LAL A ENR Y+ F + Y L Sbjct: 288 ------GPIYLLLDKSGSMDGEKILWAKAVALALYSRAKRENRDFYLRFFDN--IPYPLI 339 Query: 373 -----SGPQGIEQAIRFLSQ-QFRGGTDLASCFRAIMERLQ 407 + + I + + ++ + + GGTD++ + E ++ Sbjct: 340 KVQKNAKSKDIIKMVEYIGKIRGGGGTDISRSIISACEDIK 380 >UniRef50_Q2RZG0 Putative uncharacterized protein n=5 Tax=Bacteroidetes/Chlorobi group RepID=Q2RZG0_SALRD Length = 371 Score = 42.7 bits (99), Expect = 0.029, Method: Compositional matrix adjust. Identities = 34/121 (28%), Positives = 52/121 (42%), Gaps = 15/121 (12%) Query: 312 KDYDEQPRGPFIVCVDTS-----GSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE 366 K Y+E+ ++CVD S GS G + A C + A+ N ++LFS E Sbjct: 149 KVYEEEREQTVMLCVDVSGSENFGSQGKLKREVAAEICAVIAFSAVQNNDTVGLLLFSDE 208 Query: 367 IVRYELSGPQGIEQAIRFLSQQFRG-----GTDLASCFRAIMERLQSREWFDADAVVISD 421 R+ G G +R + + F GTD+ R ++ LQ R V++SD Sbjct: 209 TERFVRPG-SGRRHVLRCIRELFTAEPESIGTDIGGALRRVLRILQRRSIL----VLVSD 263 Query: 422 F 422 F Sbjct: 264 F 264 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_C6DJF9 Protein viaA n=166 Tax=Bacteria RepID=VIAA_PECCP 528 e-148 UniRef50_B8K8A8 von Willebrand factor, type A n=2 Tax=Vibrio Rep... 523 e-147 UniRef50_Q5E077 Stimulator of RavA ATPase activity, VWA domain c... 515 e-144 UniRef50_A6CWE2 Putative uncharacterized protein yieM n=3 Tax=Vi... 472 e-131 UniRef50_C4LBX1 von Willebrand factor type A n=1 Tax=Tolumonas a... 468 e-130 UniRef50_A3US96 Putative uncharacterized protein (Fragment) n=1 ... 429 e-118 UniRef50_A4SJ14 von Willebrand factor type A domain protein n=2 ... 428 e-118 UniRef50_A6FIY2 Uncharacterized protein containing a von Willebr... 412 e-113 UniRef50_Q6LJM7 Putative uncharacterized protein n=1 Tax=Photoba... 366 1e-99 UniRef50_A1SXM0 von Willebrand factor, type A n=2 Tax=Alteromona... 364 4e-99 UniRef50_A2SS27 von Willebrand factor, type A n=1 Tax=Methanocor... 361 3e-98 UniRef50_C6M593 Putative uncharacterized protein n=1 Tax=Neisser... 345 3e-93 UniRef50_C1Q9X6 Uncharacterized protein containing a von Willebr... 343 7e-93 UniRef50_Q14PC7 Hypothetical two-component regulator system yiem... 330 7e-89 UniRef50_A6DQA4 Putative uncharacterized protein n=1 Tax=Lentisp... 328 2e-88 UniRef50_Q1ZC32 Putative uncharacterized protein (Fragment) n=1 ... 319 2e-85 UniRef50_C0QY03 von Willebrand factor type A (VWA) domain contai... 318 2e-85 UniRef50_C1Q8W4 Uncharacterized protein containing a von Willebr... 314 5e-84 UniRef50_C3XEK2 Putative uncharacterized protein n=1 Tax=Helicob... 313 8e-84 UniRef50_A4S4M8 Predicted protein n=4 Tax=Mamiellales RepID=A4S4... 280 8e-74 UniRef50_D2U0I8 Putative uncharacterized protein n=1 Tax=Arsenop... 280 9e-74 UniRef50_A4Y9K4 von Willebrand factor, type A n=3 Tax=Shewanella... 278 3e-73 UniRef50_C4KA81 von Willebrand factor type A (VWA) domain-contai... 274 7e-72 UniRef50_D1YZJ4 Putative uncharacterized protein n=1 Tax=Methano... 267 6e-70 UniRef50_B9KEB5 Putative uncharacterized protein n=1 Tax=Campylo... 263 8e-69 UniRef50_C8SB00 von Willebrand factor type A n=1 Tax=Ferroglobus... 260 1e-67 UniRef50_A6C7T1 Putative uncharacterized protein n=1 Tax=Plancto... 260 1e-67 UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methano... 258 4e-67 UniRef50_Q0W1N2 Putative uncharacterized protein n=1 Tax=uncultu... 254 5e-66 UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methano... 253 9e-66 UniRef50_C3XJE4 Putative uncharacterized protein (Fragment) n=1 ... 248 5e-64 UniRef50_Q5LDB9 Putative uncharacterized protein n=11 Tax=Bacter... 245 2e-63 UniRef50_A6L4M8 Putative uncharacterized protein n=9 Tax=Bactero... 243 9e-63 UniRef50_C9KWG4 Putative uncharacterized protein n=1 Tax=Bactero... 237 6e-61 UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromon... 237 9e-61 UniRef50_A7V9J4 Putative uncharacterized protein n=7 Tax=Bactero... 234 6e-60 UniRef50_Q8EW10 Putative uncharacterized protein MYPE3970 n=1 Ta... 224 5e-57 UniRef50_A8IX54 Predicted protein n=1 Tax=Chlamydomonas reinhard... 219 1e-55 UniRef50_A2SLM6 Uncharacterized protein containing a von Willebr... 216 1e-54 UniRef50_C0ZE04 Putative uncharacterized protein n=1 Tax=Breviba... 213 1e-53 UniRef50_B7DQJ9 von Willebrand factor type A n=1 Tax=Alicyclobac... 210 7e-53 UniRef50_Q58221 Uncharacterized protein MJ0811 n=4 Tax=Methanoca... 204 6e-51 UniRef50_D0LUP3 Uncharacterized protein containing a von Willebr... 204 7e-51 UniRef50_C6J3I5 Putative uncharacterized protein n=1 Tax=Paeniba... 202 3e-50 UniRef50_D1XLN5 von Willebrand factor type A n=12 Tax=Actinomyce... 201 4e-50 UniRef50_D0Z403 Putative uncharacterized protein n=1 Tax=Photoba... 200 8e-50 UniRef50_D0LHL0 von Willebrand factor type A n=1 Tax=Haliangium ... 199 2e-49 UniRef50_C8SZ21 Protein viaA (VWA domain protein interacting wit... 195 3e-48 UniRef50_B1L0Y8 von Willebrand factor type A domain protein n=10... 193 1e-47 UniRef50_A8ZLC2 Putative uncharacterized protein n=3 Tax=Acaryoc... 190 8e-47 UniRef50_B8C4H1 Predicted protein n=1 Tax=Thalassiosira pseudona... 189 2e-46 UniRef50_A7KV72 Putative metalloprotein chaperonin subunit n=1 T... 178 3e-43 UniRef50_C9RHJ4 von Willebrand factor type A n=1 Tax=Methanocald... 177 5e-43 UniRef50_D1PED6 Putative uncharacterized protein n=1 Tax=Prevote... 177 8e-43 UniRef50_A7VY69 Putative uncharacterized protein n=1 Tax=Clostri... 167 1e-39 UniRef50_Q2IEM5 VWA containing CoxE-like n=2 Tax=Anaeromyxobacte... 165 3e-39 UniRef50_Q60384 Uncharacterized protein MJ0077 n=3 Tax=Methanoca... 163 1e-38 UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus ... 148 4e-34 UniRef50_A8IQV8 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 143 1e-32 UniRef50_A3DPE5 von Willebrand factor, type A n=2 Tax=Desulfuroc... 139 3e-31 UniRef50_Q9YD81 Putative uncharacterized protein n=1 Tax=Aeropyr... 137 1e-30 UniRef50_B3WV50 Protein ViaA n=5 Tax=Enterobacteriaceae RepID=B3... 115 3e-24 UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglob... 110 1e-22 Sequences not found previously or not previously below threshold: UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobac... 137 1e-30 UniRef50_Q3V4Q4 Putative VWFA domain-containing protein ORF892 n... 107 8e-22 UniRef50_UPI00003C852B hypothetical protein Faci_06871 n=1 Tax=F... 93 2e-17 UniRef50_Q6KZN8 Putative uncharacterized protein n=1 Tax=Picroph... 87 1e-15 UniRef50_A4WJJ3 Putative uncharacterized protein n=5 Tax=Thermop... 75 5e-12 UniRef50_A6G7V2 von Willebrand factor, type A n=1 Tax=Plesiocyst... 74 2e-11 UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillu... 73 2e-11 UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax... 71 7e-11 UniRef50_D1A557 Vault protein inter-alpha-trypsin domain protein... 71 8e-11 UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-cont... 71 9e-11 UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n... 71 1e-10 UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein... 70 1e-10 UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ... 70 1e-10 UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein... 69 3e-10 UniRef50_A8M9M1 von Willebrand factor type A n=1 Tax=Caldivirga ... 69 3e-10 UniRef50_B2HK18 Conserved membrane protein n=3 Tax=Mycobacterium... 69 3e-10 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 69 4e-10 UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verruc... 69 4e-10 UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome... 68 9e-10 UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus Rep... 67 1e-09 UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putat... 67 1e-09 UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=... 66 2e-09 UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythro... 66 3e-09 UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein... 66 3e-09 UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1... 65 4e-09 UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacilla... 65 5e-09 UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepI... 65 5e-09 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 65 6e-09 UniRef50_C7PW75 Vault protein inter-alpha-trypsin domain protein... 65 6e-09 UniRef50_C4DQN3 Uncharacterized protein containing a von Willebr... 65 7e-09 UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein... 65 7e-09 UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacte... 65 8e-09 UniRef50_Q80UW6 Parp4 protein (Fragment) n=11 Tax=Eukaryota RepI... 64 1e-08 UniRef50_Q22ST4 von Willebrand factor type A domain containing p... 64 1e-08 UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 ... 64 1e-08 UniRef50_Q7UEC5 60-kDa SS-A/Ro ribonucleoprotein homolog n=3 Tax... 64 2e-08 UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella... 64 2e-08 UniRef50_B3QTN9 Vault protein inter-alpha-trypsin domain protein... 64 2e-08 UniRef50_Q8TY27 Uncharacterized conserved protein n=1 Tax=Methan... 63 2e-08 UniRef50_C0CQI8 Putative uncharacterized protein n=1 Tax=Blautia... 63 2e-08 UniRef50_A3MT69 VWA containing CoxE family protein n=4 Tax=Pyrob... 63 2e-08 UniRef50_Q6A0B1 MKIAA0177 protein (Fragment) n=4 Tax=Murinae Rep... 62 3e-08 UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillu... 62 4e-08 UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotri... 62 4e-08 UniRef50_B2HDT6 Putative uncharacterized protein n=3 Tax=Mycobac... 62 6e-08 UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n... 62 6e-08 UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira... 62 6e-08 UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geob... 61 8e-08 UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein... 61 8e-08 UniRef50_A2E6Y7 von Willebrand factor type A domain containing p... 61 8e-08 UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscill... 61 1e-07 UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebr... 61 1e-07 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 61 1e-07 UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcani... 60 1e-07 UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisp... 60 2e-07 UniRef50_A1RSU5 von Willebrand factor, type A n=5 Tax=Thermoprot... 60 2e-07 UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis... 60 2e-07 UniRef50_A1SAA4 Uncharacterized protein containing a von Willebr... 60 3e-07 UniRef50_Q60ED8 Von Willebrand factor type A domain containing p... 59 3e-07 UniRef50_UPI00016C377F protein containing a von Willebrand facto... 59 3e-07 UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 ... 59 3e-07 UniRef50_Q9UKK3 Poly [ADP-ribose] polymerase 4 n=14 Tax=Eutheria... 59 3e-07 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 59 3e-07 UniRef50_Q10JU7 Von Willebrand factor type A domain containing p... 59 3e-07 UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesioc... 59 3e-07 UniRef50_D2VKS7 von Willebrand factor type A domain-containing p... 59 4e-07 UniRef50_A4XHD9 von Willebrand factor, type A n=2 Tax=Clostridia... 59 4e-07 UniRef50_Q7MCW9 Uncharacterized protein n=2 Tax=Vibrio vulnificu... 59 4e-07 UniRef50_Q54DU5 von Willebrand factor A domain-containing protei... 59 4e-07 UniRef50_UPI0000E47A28 PREDICTED: hypothetical protein n=3 Tax=S... 59 4e-07 UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopi... 59 4e-07 UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=... 59 5e-07 UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha p... 59 5e-07 UniRef50_B2UZB2 von Willebrand factor type A domain protein n=1 ... 59 5e-07 UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfoloba... 59 5e-07 UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain... 59 6e-07 UniRef50_UPI0000E49DB4 PREDICTED: similar to poly (ADP-ribose) p... 59 6e-07 UniRef50_UPI0000D9E789 PREDICTED: similar to poly (ADP-ribose) p... 59 6e-07 UniRef50_B9L896 von Willebrand factor, type A n=1 Tax=Nautilia p... 58 6e-07 UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga o... 58 7e-07 UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein... 58 8e-07 UniRef50_Q54CQ8 von Willebrand factor A domain-containing protei... 58 9e-07 UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 ... 58 1e-06 UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepI... 57 1e-06 UniRef50_C0D9H7 Putative uncharacterized protein n=1 Tax=Clostri... 57 1e-06 UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1... 57 1e-06 UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID... 57 1e-06 UniRef50_D0LST5 Putative uncharacterized protein n=1 Tax=Haliang... 57 1e-06 UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11... 57 1e-06 UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, s... 57 1e-06 UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocyst... 57 1e-06 UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR 57 1e-06 UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun se... 57 2e-06 UniRef50_C4ZKE8 von Willebrand factor type A n=2 Tax=Thauera sp.... 57 2e-06 UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax... 57 2e-06 UniRef50_A5GQG5 Protoporphyrin IX Mg-chelatase subunit ChlD n=3 ... 57 2e-06 UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Art... 57 2e-06 UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis v... 57 2e-06 UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornu... 57 2e-06 UniRef50_Q55G98 von Willebrand factor A domain-containing protei... 57 2e-06 UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magno... 57 2e-06 UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastop... 57 2e-06 UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepI... 56 2e-06 UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopu... 56 2e-06 UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 ... 56 2e-06 UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein... 56 3e-06 UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globul... 56 3e-06 UniRef50_UPI0001BCB742 hypothetical protein FperA3_08546 n=1 Tax... 56 3e-06 UniRef50_A6Q208 von Willebrand factor type A domain protein n=1 ... 56 3e-06 UniRef50_Q22HH7 von Willebrand factor type A domain containing p... 56 3e-06 UniRef50_Q54MG4 von Willebrand factor A domain-containing protei... 56 3e-06 UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein... 56 3e-06 UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteri... 56 3e-06 UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein... 56 3e-06 UniRef50_A0D1M1 Chromosome undetermined scaffold_34, whole genom... 56 4e-06 UniRef50_Q021L5 von Willebrand factor, type A n=1 Tax=Candidatus... 56 4e-06 UniRef50_O26551 Magnesium chelatase subunit ChlI n=1 Tax=Methano... 56 4e-06 UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea s... 55 4e-06 UniRef50_Q54DV3 von Willebrand factor A domain-containing protei... 55 4e-06 UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudo... 55 4e-06 UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobact... 55 4e-06 UniRef50_C1YR26 Uncharacterized protein containing a von Willebr... 55 4e-06 UniRef50_A3ZTC3 Putative uncharacterized protein n=1 Tax=Blastop... 55 4e-06 UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN... 55 4e-06 UniRef50_A2E1S5 von Willebrand factor type A domain containing p... 55 4e-06 UniRef50_B9KV79 Putative uncharacterized protein n=1 Tax=Rhodoba... 55 5e-06 UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanob... 55 6e-06 UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=... 55 6e-06 UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis R... 55 6e-06 UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 55 7e-06 UniRef50_C6WL97 VWA containing CoxE family protein n=1 Tax=Actin... 55 7e-06 UniRef50_A9WI94 von Willebrand factor type A n=2 Tax=Chloroflexu... 55 7e-06 UniRef50_Q24FW2 von Willebrand factor type A domain containing p... 55 7e-06 UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharoph... 55 7e-06 UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=4... 55 7e-06 UniRef50_A6DST2 Putative uncharacterized protein n=1 Tax=Lentisp... 55 7e-06 UniRef50_C0Z8R3 Hypothetical membrane protein n=1 Tax=Brevibacil... 55 8e-06 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 55 8e-06 UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated c... 55 9e-06 UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiob... 55 9e-06 UniRef50_C5FHM8 von Willebrand factor type A domain-containing p... 54 9e-06 UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular o... 54 9e-06 UniRef50_A2BMB7 Putative uncharacterized protein n=1 Tax=Hyperth... 54 9e-06 UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alterom... 54 9e-06 UniRef50_Q986Q0 Mlr7258 protein n=2 Tax=Alphaproteobacteria RepI... 54 9e-06 UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein... 54 1e-05 UniRef50_Q1INP4 von Willebrand factor, type A n=1 Tax=Candidatus... 54 1e-05 UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain ... 54 1e-05 UniRef50_D2V048 Predicted protein n=2 Tax=Naegleria gruberi RepI... 54 1e-05 UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1... 54 1e-05 UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=... 54 1e-05 UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexu... 54 1e-05 UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesioc... 54 1e-05 UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens ... 54 1e-05 UniRef50_UPI00016C3857 hypothetical protein GobsU_16534 n=1 Tax=... 54 1e-05 UniRef50_Q1NTK1 Von Willebrand factor, type A n=2 Tax=delta prot... 54 1e-05 UniRef50_B1HSQ9 Putative uncharacterized protein n=2 Tax=Bacilla... 54 1e-05 UniRef50_D1YYY2 Putative uncharacterized protein n=1 Tax=Methano... 54 1e-05 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 54 2e-05 UniRef50_UPI0000ECD6E7 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.... 54 2e-05 UniRef50_UPI0001BC3853 von Willebrand factor type A n=1 Tax=Buty... 54 2e-05 UniRef50_B5HZU2 VWA domain-containing protein n=1 Tax=Streptomyc... 54 2e-05 UniRef50_B7FTA2 Predicted protein n=3 Tax=Bacillariophyta RepID=... 54 2e-05 UniRef50_B9HP09 Predicted protein n=13 Tax=cellular organisms Re... 54 2e-05 UniRef50_Q6VPP3 Parturition-related protein PRP3 n=6 Tax=Eutheri... 54 2e-05 UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 ... 54 2e-05 UniRef50_C7FPD9 Uncharacterized protein n=2 Tax=environmental sa... 54 2e-05 UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 ... 54 2e-05 UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 T... 53 2e-05 UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein... 53 2e-05 UniRef50_UPI00016C38A3 LPXTG-motif cell wall anchor domain prote... 53 2e-05 UniRef50_B5ZN80 von Willebrand factor type A n=8 Tax=Rhizobiales... 53 2e-05 UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=... 53 2e-05 UniRef50_A0DIJ2 Chromosome undetermined scaffold_52, whole genom... 53 2e-05 UniRef50_B6XSC0 Putative uncharacterized protein n=3 Tax=Bifidob... 53 2e-05 UniRef50_B6HQ22 Pc22g19800 protein n=1 Tax=Penicillium chrysogen... 53 2e-05 UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family pr... 53 2e-05 UniRef50_Q74N17 NEQ403 n=1 Tax=Nanoarchaeum equitans RepID=Q74N1... 53 2e-05 UniRef50_Q5K267 Putative uncharacterized protein (Fragment) n=1 ... 53 3e-05 UniRef50_A1S119 von Willebrand factor, type A n=1 Tax=Thermofilu... 53 3e-05 UniRef50_A8J3X6 Flagellar associated protein (Fragment) n=1 Tax=... 53 3e-05 UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine... 53 3e-05 UniRef50_Q11Y10 Possible outer membrane protein n=1 Tax=Cytophag... 53 3e-05 UniRef50_Q23JA0 von Willebrand factor type A domain containing p... 53 3e-05 UniRef50_B5IF69 von Willebrand factor type A domain protein n=2 ... 53 3e-05 UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus te... 53 3e-05 UniRef50_C5F2Q3 Phage protein (Fragment) n=5 Tax=Bacteria RepID=... 53 3e-05 >UniRef50_C6DJF9 Protein viaA n=166 Tax=Bacteria RepID=VIAA_PECCP Length = 492 Score = 528 bits (1361), Expect = e-148, Method: Composition-based stats. Identities = 307/487 (63%), Positives = 389/487 (79%), Gaps = 5/487 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 M+TL++L ++L++ E L++++++ LLA+PQLA FFEK+P LK+A+ +D+P W+E L+ R Sbjct: 1 MITLESLEMLLSIDENELLDDLVVTLLATPQLAFFFEKYPSLKSALLNDLPHWKETLKQR 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDAN--- 117 L+ +VPP+L +E CYQ+SQ + F +LP I+D L + SP+ QA QL+ A Sbjct: 61 LRTTQVPPDLEKEFSCYQRSQSIDNQAFQTRLPAIMDTLSNVESPFLTQASQLITAPERT 120 Query: 118 --STITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILAD 175 +TS LH LFLQRWRLSL +Q +L+QQL+E+ERE LL E+Q+R+TLSG+LEPILA+ Sbjct: 121 LGQKVTSGLHALFLQRWRLSLTLQTVSLHQQLMEQEREILLDELQQRLTLSGKLEPILAE 180 Query: 176 NNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQM 235 N AAGRLWD+SA Q + D + ++ +G FL QP L++LAE+LGRSRE KSI +A Sbjct: 181 NENAAGRLWDLSAAQRIQTDPRPLLDFGAFLQRQPALQKLAERLGRSRETKSILTQEAPK 240 Query: 236 ETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRL 295 E FR VREPATVPEQV G+ QSDDILRL+P EL TLGI+ELEYEFYRRL+E +LLTYRL Sbjct: 241 EAFRVSVREPATVPEQVSGVHQSDDILRLMPTELVTLGISELEYEFYRRLLEHRLLTYRL 300 Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 GESWREK+ ERPVVH+ ++QPRGPFIVCVDTSGSMGGFNE+CAKAFCLALMRIALA+N Sbjct: 301 QGESWREKITERPVVHQQNEQQPRGPFIVCVDTSGSMGGFNERCAKAFCLALMRIALADN 360 Query: 356 RRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 RRCYIMLFST +V+YEL+ G+EQAIRFLSQ FRGGTD+++C A+++++ W DAD Sbjct: 361 RRCYIMLFSTGVVKYELTSADGLEQAIRFLSQSFRGGTDMSACLSALLDKMDDALWHDAD 420 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRS 475 AVVISDFIAQRLPD+V +KVK Q QHRFHAVAMS HGKPGIM IFDHIWRFDTG++S Sbjct: 421 AVVISDFIAQRLPDEVVNKVKSRQTQLQHRFHAVAMSDHGKPGIMHIFDHIWRFDTGLKS 480 Query: 476 RLLRRWR 482 RL+RRW+ Sbjct: 481 RLMRRWQ 487 >UniRef50_B8K8A8 von Willebrand factor, type A n=2 Tax=Vibrio RepID=B8K8A8_VIBPA Length = 481 Score = 523 bits (1348), Expect = e-147, Method: Composition-based stats. Identities = 157/484 (32%), Positives = 275/484 (56%), Gaps = 6/484 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D LN+ L +++ G+I+ + L+A Q+ E +K+++ + + +WR +++ R Sbjct: 1 MLGADGLNLALMIADSGIIDTAVNDLMARSQMMAVAEN-RGVKSSVKNHLLKWRGSVKKR 59 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + +E+ YQ+ QF Q+P+++ L +S + QAR+L++ N + Sbjct: 60 ITKVCETERFQQELALYQEVIYWDEAQFFEQIPEVIKKL-EWHSAFYLQARRLMEKNKGV 118 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 + + F +W SL LE +E++L ++ +RM ++ + + + Sbjct: 119 NNPMFPHYFCDQWYESLSDAIRQAQLTELEANKEKVLKDLYQRMETMKNMDKVTEEGDEG 178 Query: 180 A-GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPR-NDAQMET 237 + GRLWDM++ +L + D ++ ++ EFL + L+ +AE+LGR P N A +E Sbjct: 179 SVGRLWDMASARLSKTDLTVMKRHAEFLKKNQGLQEIAEKLGRMASQVDDPDLNKAPLEE 238 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + + + + G+ + DD+ +LLP E L ELE FY+ LV+K+L+ Y++ G Sbjct: 239 PQIVEEKSDKATDDIVGIHEGDDLNKLLPNETMFLAYPELEVVFYKHLVDKRLMNYKMQG 298 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 +S + + + +GPFI+CVD SGSM GF EQCAKA ALM+IALAE+R Sbjct: 299 KSRTLRKVRAQKPDNAQVDVEKGPFIICVDASGSMSGFPEQCAKAMAYALMQIALAEDRD 358 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 CY++LFS+E + YEL+ G+ +A FL+ F GGTDL ++ + ++ +AD V Sbjct: 359 CYVILFSSEQITYELTKQDGLREASDFLTYSFHGGTDLEPVLMKSIDLMTGDKYRNADMV 418 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 VISDFIA + +++ +KV EL+ H++RFHA+++S +G P +M +FDH W + + R+ Sbjct: 419 VISDFIAPKQSEEMIAKVDELK-EHKNRFHAISLSKYGNPELMTMFDHCWSYHPNLMGRI 477 Query: 478 LRRW 481 +++W Sbjct: 478 MKKW 481 >UniRef50_Q5E077 Stimulator of RavA ATPase activity, VWA domain containing n=61 Tax=Vibrionales RepID=Q5E077_VIBF1 Length = 482 Score = 515 bits (1327), Expect = e-144, Method: Composition-based stats. Identities = 158/480 (32%), Positives = 269/480 (56%), Gaps = 5/480 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D LN+++ V+E G+I+ I +L+ PQ + P +K I + + +WR ++ + Sbjct: 1 MLGADALNLVMMVAESGMIDSSIAEILSRPQFLTAAKSNPNIKPTIKNHILKWRGKVKHK 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + + +E+ YQ S +F ++ +I+ L + +S + +A+QL + N + Sbjct: 61 MTKVCETERIQDELALYQDVIHWSENEFYQRIDEIISKL-KWHSAFYVEAKQLANDNKGL 119 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 + + F +RW SL L+E++E+LL+++ +R+ +E + A+ + A Sbjct: 120 MNPMFPRFFCERWYQSLSDAIKKAQLSELKEDKEKLLADLYQRIETLKTMESVTAEGDEA 179 Query: 180 -AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSR-EAKSIPRNDAQMET 237 G+LWDM++ +L + + ++ + FL + L+ +A +LGR EA+ ++ A E Sbjct: 180 QIGKLWDMASAKLTKSNVDIMKLHARFLKKNKGLQDIASKLGRMANEAEHSDKSQAMAEE 239 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + + V + + G+ + DD+ RLLP E L ELE FY+ L++K+L+ YR+ G Sbjct: 240 VKVVEEKSDFVTDDIVGVHEGDDLSRLLPNETLFLSHPELEVIFYQHLIDKRLMNYRMQG 299 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + + + +GPF+VCVD SGSM GF EQCAKA LM+IALAE R Sbjct: 300 ADRKLRKVTTQSRAASNALIEKGPFVVCVDASGSMSGFPEQCAKALAYGLMQIALAEERD 359 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 CY++LFST+ + YELS G+++ FLS +F GGTDL ++ + ++ +AD V Sbjct: 360 CYVILFSTQQITYELSKQDGLKEVADFLSYKFHGGTDLEPVLEKSIQLMHGDKYKNADLV 419 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 V+SDFIA + + + V +L++ HQ+RFHAV +S +G P +M +FDH W + M RL Sbjct: 420 VLSDFIAPTHSEKIDAMVGDLKK-HQNRFHAVCLSKYGNPALMAMFDHTWAYHPSMLGRL 478 >UniRef50_A6CWE2 Putative uncharacterized protein yieM n=3 Tax=Vibrio RepID=A6CWE2_9VIBR Length = 497 Score = 472 bits (1214), Expect = e-131, Method: Composition-based stats. Identities = 145/483 (30%), Positives = 262/483 (54%), Gaps = 6/483 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D+LN+ + V+E G+I+ + ++ L LK ++T + +W ++++ + Sbjct: 1 MLGADSLNLAMMVAESGIIDSAVRDIMQQTDLLAMGSD-EGLKQSLTASMAKWSKSVKRK 59 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLV-DANST 119 L + L E+ YQ++ L+ +F QL Q+++ L +S + +ARQL D ++ Sbjct: 60 LVKGQETESLQSELELYQRAVYLTEQEFDDQLSQLIEQLPE-DSHFLPKARQLASDIDAY 118 Query: 120 ITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 S F ++W SL + QQ +++++ + L ++ +++ +E + Sbjct: 119 PRSLFARQFCKQWYESLKQAVESKQQQTVDQQKSKFLKQMYQKIDTLKDMENLQEGGEQG 178 Query: 180 -AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 GRLWD++ +L + D++ I + E+L ELK +A++LGR E P + + Sbjct: 179 KLGRLWDLAGAELTKQDWRHIERTAEYLENNQELKHIADKLGRMAEEVDAPELNKALSHD 238 Query: 239 RTMVREPAT-VPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 +V E + + G+ +S+DI LLP E L ELE FY+ LVEK+LLTY+ G Sbjct: 239 EVVVEEKTDFATDDIVGIHESNDINNLLPNETMYLAYPELETIFYQHLVEKRLLTYKSEG 298 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + + + P ++ GP ++ +D SGSM G E+ AKA +LM++A + R Sbjct: 299 KQRTVRQLHSPKTATGEADKETGPMLIAIDVSGSMQGAPEKSAKAIAYSLMKMAAQQQRE 358 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 C+++LFS+ + Y+L+G G+++A FLS F+GGTDL +E +Q ++ +AD + Sbjct: 359 CHVILFSSTFISYDLTGTTGLKEASDFLSYTFKGGTDLGKVLNHAVELMQGEQYKNADLL 418 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 VISDFIA + +V KV+ L R +RFH++ +S +G P ++ +FD WR+ + + Sbjct: 419 VISDFIAPKQEQEVVEKVESL-RGRYNRFHSLCLSKYGNPEVLGLFDTQWRYHPSLVGQF 477 Query: 478 LRR 480 +++ Sbjct: 478 IKK 480 >UniRef50_C4LBX1 von Willebrand factor type A n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LBX1_TOLAT Length = 479 Score = 468 bits (1203), Expect = e-130, Method: Composition-based stats. Identities = 182/484 (37%), Positives = 304/484 (62%), Gaps = 7/484 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 M+ L TL+++L+++E ++++++ +++SPQ++ F + P + + V +W +++ ++ Sbjct: 1 MVDLQTLSLLLSINETQMVQDLVSTVMSSPQVSQFMHEHPLFFKNVQEHVQQWSQSIPAQ 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 LK+ VP +L +E + Y +Q LS QF Q +L L +S + A+ L+ S Sbjct: 61 LKNIPVPDDLQQEYILYLDAQGLSAEQFTQQSADLLVQLQ--HSDFHTDAQNLLLTLSQA 118 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 + LF+Q+WR L+ Q +L E+ERE++L +++ RM ++G+L+ LA + Sbjct: 119 NAHNRKQLFIQKWREHLVSQVLSLEIIFAEQERERMLQQLELRMQVAGELDETLAPQH-- 176 Query: 180 AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSR-EAKSIPRNDAQMETF 238 G+LWD++A L +G+ L Y FL PELK++A+ LGR+ + S ++ET Sbjct: 177 PGKLWDLTATHLLQGNSSLFRHYASFLTHNPELKKIADALGRAATQDSSAEEQINRVETA 236 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 + VP+ + G+ QS+++ RL+ E L ELE FY++L E++LL Y+ G+ Sbjct: 237 EWQTVQHEQVPDDLVGIHQSNELNRLISSETVLLTEPELETVFYKQLAERRLLNYQFMGQ 296 Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 S + + + +GPFIVC+DTSGSM G+ E CAK FC AL++IAL+ENR C Sbjct: 297 SRSLETVMSEQRTFGETQDTKGPFIVCIDTSGSMSGYPEDCAKGFCFALLQIALSENRAC 356 Query: 359 YIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVV 418 IMLFST++V YEL+GP+G+++A+ FL F+GGTDL C + +M +Q + +ADAVV Sbjct: 357 VIMLFSTDVVTYELTGPEGLQEALNFLGCSFKGGTDLEPCMQQVMHYMQQARFSNADAVV 416 Query: 419 ISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLL 478 +SDFIAQRL + + ++++R + +RF+AV++S HGKP +M+IFD++W+FDT + R+L Sbjct: 417 LSDFIAQRLSVETEQQAQQIKR-NGNRFNAVSLSRHGKPALMKIFDNVWKFDTSLSGRVL 475 Query: 479 RRWR 482 R+ R Sbjct: 476 RKVR 479 >UniRef50_A3US96 Putative uncharacterized protein (Fragment) n=1 Tax=Vibrio splendidus 12B01 RepID=A3US96_VIBSP Length = 403 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 132/405 (32%), Positives = 220/405 (54%), Gaps = 5/405 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D LN+ L V++ G+I+ + L+A Q+ + E +K ++ + + +WR ++ R Sbjct: 1 MLGADGLNLALMVADSGIIDTAMNDLIARSQVMMAAEN-KGVKTSVKNHLVKWRGKVKKR 59 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + EE+ YQ+ PQF ++ ++ L +S + QAR+L++ N + Sbjct: 60 VTKVCETDRFQEEIALYQEVIYWDEPQFFDEIDSVIKKL-EWHSAFYLQARRLMENNKGV 118 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 +A+ F +W SL LE +E++L+++ +RM ++ + + Sbjct: 119 YNAMFPHYFCDQWYQSLSDAIKQAQVTELETSKEKVLADLYQRMETMKNMDKVTESGDEG 178 Query: 180 A-GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPR-NDAQMET 237 + GRLWDM++ +L + D ++ ++ EFLN+ L+ +AE+LGR + P + A +E Sbjct: 179 SVGRLWDMASAKLSKTDLTIMKRHAEFLNKHKGLQEIAEKLGRMASEEDDPSLHKAPVEE 238 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + + + + G+ +SDD+ ++LP E L ELE FY+ L +K+LL+YR G Sbjct: 239 LQMVEEKSDEAVDDIVGIHESDDLNKMLPNETMFLAYPELEVIFYKHLADKRLLSYRSQG 298 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 +S + ++ + +GPFIVCVD SGSM GF EQ AKA ALM+IALAE R Sbjct: 299 KSRTLRKVKAQKPDSKNVDIEKGPFIVCVDASGSMSGFPEQSAKAMAYALMQIALAEERD 358 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAI 402 CY++LFS+E + YEL+ G+ +A FLS F GGTDL Sbjct: 359 CYVILFSSEQITYELTRQDGLREASDFLSYSFHGGTDLEPVLMKS 403 >UniRef50_A4SJ14 von Willebrand factor type A domain protein n=2 Tax=Aeromonas RepID=A4SJ14_AERS4 Length = 484 Score = 428 bits (1101), Expect = e-118, Method: Composition-based stats. Identities = 183/474 (38%), Positives = 282/474 (59%), Gaps = 8/474 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 M+ + T++ +LA+SE ++ EM++ALLAS Q++ F ++ + RWR + Sbjct: 1 MIEIGTMSALLAISEGEMVSEMVVALLASTQISRFIRIGKAQGRSLKQRLQRWRHQVNDT 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDAN-ST 119 + VPP L +E + YQ LS + + +LP +L L R S +A++ RQL Sbjct: 61 IAHTPVPPVLEQEFLLYQHFISLSLARLVAELPTLLSALER-GSDFADEGRQLAHQLVDH 119 Query: 120 ITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 T L L++WR SL+ L Q+L E ER +L E++E++ S +LE +L Sbjct: 120 PTEGARRLMLEKWRASLVGALLRLQQELAEAERLRLQQELEEQIGASEELEQVLDPQRRT 179 Query: 180 AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFR 239 AG LW+++ G+ + LI +Y L ++P L+ +A+ +GRS + Sbjct: 180 AGGLWNLAQGRWQPASLVLIRQYAAMLRKEPMLQEIADSMGRS--LHDSEQLQRPQPPQP 237 Query: 240 TMVREP---ATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLH 296 T+++EP VP+ + G+ ++D++R+LP E LG+ ELE EFYRR +E++LL+Y+ Sbjct: 238 TLIQEPVLSDDVPDDLVGIHPANDLMRMLPSEAVMLGVPELELEFYRRYLERRLLSYQAR 297 Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 G R +++ R D + QP GP IVC+DTSGSMGG+ EQCAKA LAL+++AL E R Sbjct: 298 GTLPRHQLLPRTTDRGDQELQPMGPVIVCIDTSGSMGGYPEQCAKALALALLQLALTEQR 357 Query: 357 RCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADA 416 RC++MLFST++ +EL+ G+++A RFL+ F GGTDL C A +++LQ+ + AD Sbjct: 358 RCFVMLFSTDVATFELTDANGLDEAQRFLAMTFNGGTDLLPCLSATLQQLQAPGFELADV 417 Query: 417 VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFD 470 +VISDFIAQRLP + + + QR RF+AVAMS H KP ++R+FD W D Sbjct: 418 LVISDFIAQRLPASLVELM-DRQRGRGTRFNAVAMSRHAKPALLRVFDKSWLLD 470 >UniRef50_A6FIY2 Uncharacterized protein containing a von Willebrand factor type A(VWA) domain n=1 Tax=Moritella sp. PE36 RepID=A6FIY2_9GAMM Length = 469 Score = 412 bits (1060), Expect = e-113, Method: Composition-based stats. Identities = 156/466 (33%), Positives = 258/466 (55%), Gaps = 14/466 (3%) Query: 17 GLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMC 76 L+++ ++ + A ++ +K A DD W++ + S L D +P L+ E+ Sbjct: 13 QLVDDALLDITAHDRVTS------DIKLAYIDD---WKQQIMSLLADMPLPAGLSNEIHL 63 Query: 77 YQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI-TSALHTLFLQRWRLS 135 + ++LLS F ++ IL + + S + + N ++ + +FL W+ + Sbjct: 64 CETARLLSPSNFRNKVEGILSKI-KAESAFYNTGLTIYQQNRSMPDNVFFAVFLDSWQQA 122 Query: 136 LIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAG-RLWDMSAGQLKRG 194 + + +L+EE+REQLL E+ ER QLE +L + G RLWD++ G+L Sbjct: 123 IELLLYQEQSRLIEEKREQLLIELAEREETIEQLEDVLDSDLLCNGERLWDLAKGKLTHL 182 Query: 195 DYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQM-ETFRTMVREPATVPEQVD 253 D +L+ +Y L + ++K++A +LGR A P ET+ VP+ + Sbjct: 183 DTKLLQRYAVNLRKNKDVKKIASELGRMALAHINPEETPNSYETWVLDNSYQDNVPDDMQ 242 Query: 254 GLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKD 313 G+ SD+I R+L E L ELE FY+R +E+ LLTY+ G + K + + D Sbjct: 243 GVTYSDEISRMLQTEAVNLTFPELEIIFYKRYIERHLLTYQYQGALQQYKKVTQYRDITD 302 Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS 373 DEQ GPFI+CVD+S SM GF E AK+ C AL++IA + R+CY+M+FS E++ + ++ Sbjct: 303 ADEQTGGPFIICVDSSTSMHGFPELTAKSICYALLQIAFEQRRQCYLMMFSNEVITFPVT 362 Query: 374 GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 + + FLS FRGGTDL +E + S ++ +AD +VISDFIAQ+LP V Sbjct: 363 QSTSLSTMLTFLSSSFRGGTDLQPVIEKSLELMSSAQYKNADTIVISDFIAQKLPTHVAD 422 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLR 479 KV+ ++ ++R+HA+++S+ G P +M+IFDH+WR+ G+ RL + Sbjct: 423 KVRAIK-AQKNRYHAISLSSQGNPELMKIFDHVWRYSAGLTGRLKK 467 >UniRef50_Q6LJM7 Putative uncharacterized protein n=1 Tax=Photobacterium profundum RepID=Q6LJM7_PHOPR Length = 492 Score = 366 bits (939), Expect = 1e-99, Method: Composition-based stats. Identities = 109/414 (26%), Positives = 191/414 (46%), Gaps = 19/414 (4%) Query: 69 ELTEEVMCYQ----QSQLLSTPQF-IVQLPQILDLLHRLNSPWAEQARQLVDANSTITSA 123 +++ YQ Q+QL S ++ + +L + D +++LN +Q R++ T Sbjct: 73 RFKNDIISYQRFISQAQLPSDKKYWLKELTVLDDKVNKLN----QQKRKI-----TAIKT 123 Query: 124 LHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRL 183 H L WR + + + + +E+ LSE+ + + L ++ G L Sbjct: 124 KHNHLLTHWRKQYDKAHSKWQLEAIRQFQEKFLSELNDWLEQIKILSEVVESLGLEPGYL 183 Query: 184 WDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVR 243 D S G+L D + + K+ E+L +K L E LG+ R+ + + +T Sbjct: 184 LDFSEGKLTLSDVEKLKKWAEYLPNDEGVKSLCEMLGKLRQVTLSDKIETIKKTINMPEM 243 Query: 244 E-PATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWRE 302 +++ GL+ D+ +LP ELA + E F + +E L+ + + G S Sbjct: 244 VFDGDSKQEIVGLKLGKDLEHVLPSELALMSDPETSILFDLKYLESSLMCFDMAGISIDH 303 Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 E+ V E +GP ++C+DTSGSM G E AKA L L A E R CY++ Sbjct: 304 --AEQVVEQSIQKEDKKGPMVICIDTSGSMHGSPETIAKALSLYLTTQAKKEQRDCYLIN 361 Query: 363 FSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 ST I +LS + + FL + F GGTD+A R + +++ + +AD ++ISDF Sbjct: 362 ISTSIEILDLSQGYSLSSLLTFLQKSFHGGTDVAPAMRHGINIMKNDAYENADMLIISDF 421 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSA-HGKPGIMRIFDHIWRFDTGMRS 475 + LP+D V++ QR+ +RF+++ + + FD W ++ S Sbjct: 422 VMSSLPNDCLELVEQ-QRIKGNRFYSLCIGNAFMTNRLKTHFDSEWVYNPSNSS 474 >UniRef50_A1SXM0 von Willebrand factor, type A n=2 Tax=Alteromonadales RepID=A1SXM0_PSYIN Length = 529 Score = 364 bits (934), Expect = 4e-99, Method: Composition-based stats. Identities = 97/391 (24%), Positives = 173/391 (44%), Gaps = 9/391 (2%) Query: 93 PQILDLLHRLNSPWA----EQARQLVDANSTITSAL--HTLFLQRWRLSLIVQATTLNQQ 146 P++ + + +A + + L + + L T L WR + + Sbjct: 118 PKVENKHSKDQRYFAMVSTSKRKTLQNKVTAALDPLLTRTHLLGEWRKQIEQKRVEWELN 177 Query: 147 LLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFL 206 L+ + +++ L +++E + L + D G D S G+L D + I K+ ++ Sbjct: 178 LIHKLQQKFLEKMEEWLRYLSALINSIDDIGFDLGYFLDFSKGELSESDIEQIKKWLNYI 237 Query: 207 NEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVR-EPATVPEQVDGLQQSDDILRLL 265 + L + LG+ R+ + + + + + E++ G++ ++ +L Sbjct: 238 QNDKGAQLLCDLLGKIRQVSHSDKIEIANKIIDVPSQYIDSNSKEEIVGIKLGQELEHVL 297 Query: 266 PPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVC 325 P E A L F + +E +L+ + + G IE +E +GP ++C Sbjct: 298 PSEFALLSDPSTSILFDLKYIESRLMCFDMVGIQNSVDQIEIEEEVTVQEENTKGPMVIC 357 Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFL 385 VDTSGSM G E AKA L L A E R CY++ FST I +LSG I+ I FL Sbjct: 358 VDTSGSMHGSPEAIAKAVTLFLSSTAQKEKRDCYLINFSTSIETLDLSGNYSIKTLIDFL 417 Query: 386 SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHR 445 + F GGTD+A ++ +Q+ + +AD ++ISDF+ LPD + L R +R Sbjct: 418 RKSFHGGTDVAPAINHGLKVMQNDTYENADMLIISDFVMSYLPDKTVKNIGVL-RESGNR 476 Query: 446 FHAVAMSA-HGKPGIMRIFDHIWRFDTGMRS 475 F+++ + + IFD W ++ S Sbjct: 477 FYSLCIGNAFMSNRLSAIFDREWIYNPATTS 507 >UniRef50_A2SS27 von Willebrand factor, type A n=1 Tax=Methanocorpusculum labreanum Z RepID=A2SS27_METLZ Length = 492 Score = 361 bits (927), Expect = 3e-98, Method: Composition-based stats. Identities = 99/367 (26%), Positives = 172/367 (46%), Gaps = 3/367 (0%) Query: 107 AEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLS 166 + + ++ S + + WR Q + ++ R+QLL ++E Sbjct: 108 NQTGERKDESIIENLSLIRRNEQEAWRKEYEKQLLEWQLEEIQNRRKQLLQNLKEWFETI 167 Query: 167 GQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAK 226 Q++ + G WD+S G+L D ++ K+ ++L +++ L E +GR + + Sbjct: 168 QQMKEVFEALGVDTGVFWDLSVGKLSAQDISVLKKWADYLKYDEKIRELCELMGRLHKEQ 227 Query: 227 SIPRNDAQMETFRTMVREPAT-VPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRL 285 + T + V++P E++ G++ D+ ++P ELA L E+ F + Sbjct: 228 QSHHTEIINSTIQYHVKKPDVHSNEEIIGIKFGRDLENIIPQELALLSDPEVTLLFDLKY 287 Query: 286 VEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCL 345 VE +L+ + G DE+ GP I+CVDTSGSM G E AKA L Sbjct: 288 VENRLMCFSKQGYITEIIEENMQETVNVDDEEKMGPIIICVDTSGSMSGAPENIAKALTL 347 Query: 346 ALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMER 405 +L A+++ R CY++ FST I + + P+GI I FL F GGTD+A + Sbjct: 348 SLASRAISQKRNCYLINFSTSINTLDFTPPKGIHDLINFLKMSFHGGTDVAPALYEGIRM 407 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR-IFD 464 + ++ AD +VISDF+ L D+ K+ Q+ ++RF A+ + + G + +FD Sbjct: 408 MSESDYKKADLLVISDFVIYGLSSDIVPLCKK-QKQEENRFFALCIGSFGTQRVEDGVFD 466 Query: 465 HIWRFDT 471 W +D Sbjct: 467 QSWTYDP 473 >UniRef50_C6M593 Putative uncharacterized protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M593_NEISI Length = 482 Score = 345 bits (884), Expect = 3e-93, Method: Composition-based stats. Identities = 112/451 (24%), Positives = 203/451 (45%), Gaps = 10/451 (2%) Query: 29 SPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQF 88 F++ L I + W + L R P + EE + Q L Sbjct: 19 QTDYQDLFKQHSWLGGQIQQRLFGWAHQTKLDLWQ-RSPFAIHEENLKNHQQTGLFNQSP 77 Query: 89 IVQLPQILDL--LHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQ 146 + + L + W ++ Q S + L ++W+ L + Sbjct: 78 VDDYQRFCKLTGIPFEKDFWQKELAQSKQVKSHKQNLPLKLLTEKWQQQLDQAKAQWQVE 137 Query: 147 LLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFL 206 + + R++LL+++++ + + QL L G G L D + + ++ +L Sbjct: 138 QINQLRQELLTQLKQELEVVKQLSQQLEQLGFGIGD----DIGNLTPQDIEEMKRWLNYL 193 Query: 207 NEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPA-TVPEQVDGLQQSDDILRLL 265 + +++AE LG+ R+ + + + +T + E++ GL+ D+ +L Sbjct: 194 TQDKNAQQIAELLGKMRQIEQSEKIEQVKQTVYIQNPQIDINSREEIIGLRLGKDLEYVL 253 Query: 266 PPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVC 325 P ELA + E F + +E +L+ + L G ++ + E V K +++ GP I+C Sbjct: 254 PSELALMADEETSILFDLKFLESKLMCFELQGMTYCDAPTEIIVEQKSQEDEKPGPMILC 313 Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFL 385 VDTSGSM G E AKA L L A +ENR C+++ FST I +EL+ GI I FL Sbjct: 314 VDTSGSMNGLPENIAKAMALFLGTKAKSENRSCFVINFSTGIETFELTSKTGISNLIAFL 373 Query: 386 SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHR 445 Q F GGTD A R ++ ++ + AD ++ISDF+ LPDD+ + + E+QR ++ Sbjct: 374 RQSFHGGTDAAPALRHALKMMEQESYQKADLLMISDFVMNGLPDDLLASI-EIQRETGNQ 432 Query: 446 FHAVAMSA-HGKPGIMRIFDHIWRFDTGMRS 475 F+++ + + FD W ++ +++ Sbjct: 433 FNSLVIGDAFMSKRLKTHFDREWIYNPNVQT 463 >UniRef50_C1Q9X6 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Brachyspira RepID=C1Q9X6_9SPIR Length = 529 Score = 343 bits (880), Expect = 7e-93, Method: Composition-based stats. Identities = 101/372 (27%), Positives = 177/372 (47%), Gaps = 5/372 (1%) Query: 107 AEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLS 166 + + + T A+ L W+ SL + + +++ RE+ +++E + Sbjct: 145 NQDSEDIKIQKDTDIKAIRKSVLDNWKNSLDNKYIDWSLNEIDKFREEFFKQIKEFLDYL 204 Query: 167 GQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAK 226 + + G L+D+S G L + D + I + + +K+L + LGR + + Sbjct: 205 KDIMELENALGEETGSLFDLSLGNLLKRDIEYIKQLANLIKSNENIKKLCDMLGRFVKEE 264 Query: 227 SIPRNDA--QMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRR 284 R + + ETF T VR+ +++ G+ S DI +LP E L LE F + Sbjct: 265 ESYRIEKVLRKETFHTSVRDI-NSEDEIVGITYSRDIHNILPQEKLLLAEGVLETLFGVK 323 Query: 285 LVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFC 344 E +LLT++ G + K ++ +GP I+CVDTSGSM G E AKA Sbjct: 324 YFENRLLTFKKEGYTDYYYDEMIEDEMKVVEDDKKGPIIICVDTSGSMSGVPETVAKAVT 383 Query: 345 LALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIME 404 L L A+ + R CY++ FST+I +L+ P ++ I FL F GGTD R ++ Sbjct: 384 LYLASRAMKQKRNCYLINFSTQIETMDLTYPNTMDNLIEFLRLSFNGGTDAVPALRHAIK 443 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR-IF 463 + + + +D + ISDF+ DD K+ E Q+ +++RF+++ + + + IF Sbjct: 444 TMNTENYKKSDLLFISDFVFNGFTDD-DYKLAEAQKKNENRFYSLIIGSTPLFNVKNSIF 502 Query: 464 DHIWRFDTGMRS 475 D+ W +D+ S Sbjct: 503 DYNWCYDSSRGS 514 >UniRef50_Q14PC7 Hypothetical two-component regulator system yiem receptor component protein n=1 Tax=Spiroplasma citri RepID=Q14PC7_SPICI Length = 519 Score = 330 bits (846), Expect = 7e-89, Method: Composition-based stats. Identities = 104/447 (23%), Positives = 207/447 (46%), Gaps = 21/447 (4%) Query: 55 EALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQAR--- 111 + + + ++ + +E+ + ++ D L ++NSP+ ++ Sbjct: 54 HQIENEIMKIKLDSNIEKEIYLFHWIKVNGYDGLKKNYASTQDFLFKVNSPFYDRLNYYR 113 Query: 112 -QLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLE 170 + + T+ L F+ W LI + +EE R + + ++ ++ + + Sbjct: 114 YEFNKKQNNNTNMLFRDFIGIWESILIKRINDYRFAKIEELRTKFMQDLYNKVEIYNKAN 173 Query: 171 PILADNNTAAGRLWDMSAGQLKR-GDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIP 229 +L G++W+ +LK+ + I K+ +FL P + +A LGR + ++ Sbjct: 174 SLLKTVWNFFGKIWN--PTELKKGVNMSAIDKFAKFLETNPAIMEIATLLGRFQGESNLI 231 Query: 230 RNDAQMETFRTMVREP-ATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEK 288 E +P + PE++ G +S D+ + EL L L+Y FY++ +E Sbjct: 232 EQRILEEIVMDYEWKPIGSSPEEIIGATESKDLEHMFAAELVLLKDPVLKYIFYKKYIEG 291 Query: 289 QLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALM 348 +L T+ + K + + Y + +GP I+ +DTS SM G EQ AKA LA+ Sbjct: 292 KLTTFEFLSQDKVPKEQIKLRTIETYVPEEKGPIILSIDTSSSMRGSPEQIAKALALAIA 351 Query: 349 RIALAENRRCYIMLFSTEIVRYELSG-PQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ 407 +IAL E+R CY++ FS + Y LS + + I FLS+ F G T++ + + Sbjct: 352 KIALGEHRPCYMINFSKSLDVYNLSSLKDSLPKLIEFLSKSFAGDTNVEPALEHTLTVMD 411 Query: 408 SREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIW 467 S E+F+AD ++ISDF+ L ++ +K+ L++ +RFHA+ + G + IF++ W Sbjct: 412 SNEYFNADLLLISDFLTSDLSPELITKINLLKQRR-NRFHAIVIGTMGAENVETIFNNAW 470 Query: 468 RFDT-----------GMRSRLLRRWRR 483 +D + ++++++ + Sbjct: 471 IYDPRDPFASELIIASLTGQIVKKYDK 497 >UniRef50_A6DQA4 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQA4_9BACT Length = 479 Score = 328 bits (842), Expect = 2e-88, Method: Composition-based stats. Identities = 97/388 (25%), Positives = 176/388 (45%), Gaps = 20/388 (5%) Query: 98 LLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLS 157 L L S W ++ Q+ D + + L + +W SL +T Q+ L+ + + Sbjct: 89 QLPDLKSYWNQELSQINDQPGNV-NVLPQFLISQWHKSLRDLQSTWKQERLDNLQNETQQ 147 Query: 158 EVQERMTLSGQLEPILADNNTAAGRLWDMS---------AGQLKRGDYQLIVKYGEFLNE 208 E+ + M + L + ++D + +L + + + K+ L + Sbjct: 148 EMNDWMDNLNDIADELEKLDLDPEDVFDFASGAGAGGDGPSELSIQNLETLKKWLGTLKK 207 Query: 209 QPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPE 268 P +K L + LG+ ++AK N + + + E++ G++ S ++ LLP E Sbjct: 208 DPGIKDLCKLLGKLKQAK---LNKIKRSRTTSSSVSSSNSCEEISGIKFSKELEHLLPSE 264 Query: 269 LATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDT 328 LA L E E F + E +L+ + + G K E + ++ +GP I+ VDT Sbjct: 265 LALLTDPETEIIFDLKYAESRLMGFDMSGIQTVSKKEEIEM-----PDEEQGPMIIAVDT 319 Query: 329 SGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQ 388 SGSM G E AKA L + + ++ ENR CY++ FST+I +L+G + ++FL Sbjct: 320 SGSMYGAPETTAKAITLYMAKTSMKENRNCYVIEFSTKIKTIDLAGSNRLSALMKFLEMS 379 Query: 389 FRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHA 448 F GGTD+ E + + +AD +V+SDFI L + K+++ + + F++ Sbjct: 380 FNGGTDVEPAIEHGTEVMNQEGYRNADMLVVSDFILNDLEPPLVDKIQQAK-AKNNSFYS 438 Query: 449 VAMSAHGKPGIMR-IFDHIWRFDTGMRS 475 + + H R FD W ++ S Sbjct: 439 LCIGDHFHSHKNREYFDRKWVYNPDGSS 466 >UniRef50_Q1ZC32 Putative uncharacterized protein (Fragment) n=1 Tax=Psychromonas sp. CNPT3 RepID=Q1ZC32_9GAMM Length = 328 Score = 319 bits (817), Expect = 2e-85, Method: Composition-based stats. Identities = 110/325 (33%), Positives = 184/325 (56%), Gaps = 2/325 (0%) Query: 156 LSEVQERMTLSGQLEPILADNN-TAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKR 214 L ++ +R +L I A+ N + RLWDM+ +L + + Q + + + L++ EL++ Sbjct: 1 LKDLYQRQETISKLTEIDANINPQNSMRLWDMAKAKLTKINVQTLKRTAKLLSKHSELQK 60 Query: 215 LAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGI 274 +A+QLGR P + R + + + G++QS D+ RLLP EL L Sbjct: 61 IADQLGRMANQHDDPCLNRTEVHSRRIKESTSPFTGDIVGIKQSADLERLLPIELMFLSD 120 Query: 275 TELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGG 334 +EL+ FY+ L+EK+L TY+ + + I + EQ +GPFI+ +D SGSM G Sbjct: 121 SELDVLFYKNLIEKRLSTYQQQNKHNEFEQITQFKQQPKKAEQDKGPFIIAIDASGSMMG 180 Query: 335 FNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTD 394 E+CAKAF LM+IALA+NR CY++LFS + + YELS G+ + + FLS F GGTD Sbjct: 181 SAEKCAKAFAYGLMKIALAQNRECYVILFSAQQITYELSNQHGLSEILNFLSYSFHGGTD 240 Query: 395 LASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAH 454 L S + + +++ ++ +AD +VISDFI + K+ +L+ +RFHA+++S + Sbjct: 241 LTSVLESAFKVMETEKYKNADLIVISDFITPPMSSKTIDKLNKLK-EKSNRFHALSLSRY 299 Query: 455 GKPGIMRIFDHIWRFDTGMRSRLLR 479 ++ +FD W+++ + + R Sbjct: 300 QNTEVLALFDKNWQYNPSKLANIKR 324 >UniRef50_C0QY03 von Willebrand factor type A (VWA) domain containing protein n=1 Tax=Brachyspira hyodysenteriae WA1 RepID=C0QY03_BRAHW Length = 467 Score = 318 bits (816), Expect = 2e-85, Method: Composition-based stats. Identities = 89/355 (25%), Positives = 179/355 (50%), Gaps = 6/355 (1%) Query: 117 NSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADN 176 NS + L + W + + +++ R++ +S+++ + L +L+ + Sbjct: 109 NSDNLNILKKNIISVWEKTYNKKNNDWLVSTVKDRRDKFISDIESWINLLKKLKYMSNIL 168 Query: 177 NTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQME 236 G LWD G+L+ D L+ ++ +F+NE +++ + + +GR + + +N Sbjct: 169 RIKTGVLWDFRVGELEEADISLLKRWVDFINEYKDIEVICDSIGRRIDIEKSLKNVEFKN 228 Query: 237 TFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLH 296 T+ ++ + E++ G+ + DI ++P EL+ L E E F + +E +L+ + Sbjct: 229 TYSNTNKKI-SSKEEIVGIYFAKDIENVIPEELSLLCNEESEKLFKLKYIENRLMCFDKS 287 Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 + + ER + K + +G I+C+DTSGSM G NE AKA ++ AL+ENR Sbjct: 288 AYVFND---ERDNIVKAGYREGKGDMIICIDTSGSMKGINEYIAKATMFKMVMQALSENR 344 Query: 357 RCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADA 416 Y++ FSTEI + + GIE I+FL + GG+D+ + + + +AD Sbjct: 345 NAYLINFSTEIYTCKFTKENGIEDLIKFLKLSYHGGSDIYKALYEANRMMNTSSFRNADV 404 Query: 417 VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKP-GIMRIFDHIWRFD 470 +V+SDFI + +P+++ + + Q+ + +++ AV++ ++F+ W FD Sbjct: 405 LVLSDFIMEDMPNNLVTMCSK-QKNNGNKYFAVSIGKFPFGYSYRKVFNRHWIFD 458 >UniRef50_C1Q8W4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1Q8W4_9SPIR Length = 478 Score = 314 bits (804), Expect = 5e-84, Method: Composition-based stats. Identities = 101/445 (22%), Positives = 208/445 (46%), Gaps = 19/445 (4%) Query: 38 KFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILD 97 RL+ + + +W++ L + + E E+ + + + I D Sbjct: 32 NEKRLEEELEVKITKWKKDLNTFTNENNPYDENKTELEISLKKLKVLKNKNIK--TTKED 89 Query: 98 LLHRLNS----------PWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQL 147 +++ LNS + + D+ + + + L W+ + + Sbjct: 90 IINDLNSFSILSDDSSVEFWKDKLHNSDSLNINLNIIKNNILSSWQKTYNKKNNEWLVST 149 Query: 148 LEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLN 207 ++E R++ +S+++ ++L +L + G LWD G+L+ D L+ ++ EF+N Sbjct: 150 VKERRDKFISDIESWISLIKKLRYMSNILRIKTGVLWDFRVGELEENDISLLNRWVEFIN 209 Query: 208 EQP-ELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLP 266 + E++ + + +G+ + + +N T+ ++ + E++ G+ + DI ++P Sbjct: 210 KYKKEIETICDSIGKRVDIEKALKNIEFKNTYSYTNKKI-SSKEEIVGIYFAKDIENVVP 268 Query: 267 PELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCV 326 EL+ L + E F + +E +L+ + + E + + K ++ +G I+C+ Sbjct: 269 EELSLLCDEDSEKLFKLKYIENRLMCFDKSAYVFNENDFD---IVKAGYKEGKGDMIICI 325 Query: 327 DTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLS 386 DTSGSM G +E AKA ++ AL+ENR Y++ FSTEI + GIE I+FL Sbjct: 326 DTSGSMKGTSEYIAKAIMFKMVMQALSENRNAYLINFSTEIYTCRFTKNNGIEDLIKFLK 385 Query: 387 QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 + GG+D+ + + + +AD +V+SDFI + +PD++ K+ QR + ++F Sbjct: 386 LSYHGGSDIYKALYEANRVMNTSSFKNADVLVLSDFIMEDMPDNLV-KICSNQRNNGNKF 444 Query: 447 HAVAMSAHGKP-GIMRIFDHIWRFD 470 AV++ ++F+ W FD Sbjct: 445 FAVSIGKFPFGYSYKKVFNRHWIFD 469 >UniRef50_C3XEK2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XEK2_9HELI Length = 493 Score = 313 bits (802), Expect = 8e-84, Method: Composition-based stats. Identities = 118/482 (24%), Positives = 198/482 (41%), Gaps = 45/482 (9%) Query: 29 SPQLAVFFEKFPRLKAAITDDVPRWREAL---RSRLKDARVPPELTE---EVMCYQ---- 78 LA E++ + ++ D P +E + R +L + L + E Y Sbjct: 7 QEHLAKAHEEYQAQQDSLQDSHPFKKEEVAHHRRKLTPTQDIKTLKDDIKEFDTYNKKHK 66 Query: 79 -------------QSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALH 125 L Q+ + + + + E+ +Q D +T L Sbjct: 67 GTLDSLLNKQEQNDITLTDKRQYTLNAWEQMLKSKKNEYIEQEKLKQTRDYKQRLTDYLE 126 Query: 126 TLFLQRWRLSLIVQATTLNQQLLEEEREQL----LSEVQERMTLSGQLEPILADNNTAAG 181 +L + LS + A L +L+E ++ L L + + L GQ + G Sbjct: 127 SLLETKEFLSNLGGAGELFSGVLDEMKQGLDVSNLGDEAYQNKLKGQRIEM------PGG 180 Query: 182 RLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFR-- 239 + D AG R D I +Y + LK +AE LGR + + E Sbjct: 181 KGTDNGAGIRNRIDINTIKQYFNTIQNSKALKEIAELLGRLEKEEEESEIQKIKELKSYS 240 Query: 240 -TMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 T V E++ G+ D+ LLP ELA L LE F + ++ +L + G Sbjct: 241 YTQVIPTKRYKEEICGVTLGRDLENLLPQELAMLEDETLELLFDLKYIQNRLFCFEKQGY 300 Query: 299 SWREKVIERPVVH------KDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIAL 352 + + + + E+ G I+CVDTSGSM G E AKA L L A Sbjct: 301 HSITQEAQEEIEKEIETKKQKKREKNEGAIIICVDTSGSMYGNPEYIAKALTLFLATKAN 360 Query: 353 AENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWF 412 + R CY++ FS I ELSG G+ + ++FL F GGTD+A +A ++ +Q ++ Sbjct: 361 TQKRACYLINFSIGIETMELSGKGGMAKLMQFLEMSFGGGTDVAPALKAGLKTMQQDDFK 420 Query: 413 DADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTG 472 +D +VISD +P+D+ +++ QR ++F+ + + +G G FD W ++ Sbjct: 421 KSDLIVISDGGFGYIPNDLEKQMQN-QRQKDNKFYLLDI--NGNSGKKTFFDKHWIYNAQ 477 Query: 473 MR 474 + Sbjct: 478 TQ 479 >UniRef50_A4S4M8 Predicted protein n=4 Tax=Mamiellales RepID=A4S4M8_OSTLU Length = 535 Score = 280 bits (716), Expect = 8e-74, Method: Composition-based stats. Identities = 90/336 (26%), Positives = 151/336 (44%), Gaps = 12/336 (3%) Query: 145 QQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGE 204 +L+EE +EQ + + + E + D+ +D++ G ++ ++ + + Sbjct: 194 SRLMEEFKEQWEPAMDKLDKAAKAFEGLDLDDLADGPEGFDLTRGLWQQTGWKELDSLRK 253 Query: 205 FLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFR---TMVREPATVPEQVDGLQQSDDI 261 L + EL+ + LGR + R Q E +VR P PEQ GL +SDD+ Sbjct: 254 KLQDLKELRDMVRSLGRGSGRGPLRRAPRQRERQGFPIGLVRSPME-PEQTSGLCRSDDL 312 Query: 262 LRLLPPELATLG--ITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPR 319 R++P E+ L + + + R E+ LL+Y G W E+ + Sbjct: 313 SRMMPSEMVLLASSLPQARLLHFARRAERTLLSYERVG--WSEEPAVTVEGFETRPAAEC 370 Query: 320 GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS---GPQ 376 GP IVC+DTSGSM G E AKA L MR + ++ R CY+ FS EL Sbjct: 371 GPIIVCLDTSGSMMGARETVAKAMVLECMRQSRSQQRACYLYSFSGPGDCQELELKLNAA 430 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 G+ + FLS F GGTD+ F + RL EW +AD ++++D + + + + + Sbjct: 431 GLYGLLEFLSGSFHGGTDVDEPFNRALARLNEAEWSNADILLVTDGEIKPPDETLIANLN 490 Query: 437 ELQRVHQHRFHAVAMSAHGKPGIMR-IFDHIWRFDT 471 E + + H + + G ++ I H+ F + Sbjct: 491 EAKEEMGLKVHGLLVGDAGNAEVVESICTHVHAFKS 526 >UniRef50_D2U0I8 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2U0I8_9ENTR Length = 330 Score = 280 bits (716), Expect = 9e-74, Method: Composition-based stats. Identities = 135/316 (42%), Positives = 215/316 (68%), Gaps = 3/316 (0%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML + T++++L+++E LIEE+++ LLA+PQL +FFEK+P LK+ + +D+ W++ L + Sbjct: 1 MLNIATIDMLLSINELELIEEIVLTLLATPQLVIFFEKYPNLKSILLNDLLAWKKNLYRQ 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 L++ VP +LTEE YQQ+ + T +F LP ++ L + S + ++A L + S Sbjct: 61 LQETLVPIKLTEEFALYQQNLAIDTTKFFSNLPVTINKLTEIASTFVQEANYLQERISH- 119 Query: 121 TSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAA 180 A +LF+QRWRL+LI++ TT N+ LLE E+EQLL+E+++R+ L+G L +N + Sbjct: 120 DPAGQSLFIQRWRLNLIIEVTTFNKLLLEREKEQLLAELEQRLKLTGNLIETFNQDNHSV 179 Query: 181 GRLWDMSAGQLKR--GDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 G+LWD+S G L + + QL+++Y FL +QPEL++LAE LGR + K + +E+ Sbjct: 180 GKLWDISKGVLTQSSNNIQLLIQYSHFLQQQPELEKLAELLGRRQSLKPKQKQQQMLESI 239 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 ++ + P +PEQ+ G+ +DILRLLP ELA LG+ ELE+EFYR+LVEKQLLTYRL G+ Sbjct: 240 ISVEKIPDQIPEQISGINHGNDILRLLPSELALLGLEELEFEFYRKLVEKQLLTYRLQGD 299 Query: 299 SWREKVIERPVVHKDY 314 +W+++ I RP + + Sbjct: 300 NWQQRKILRPAIKYER 315 >UniRef50_A4Y9K4 von Willebrand factor, type A n=3 Tax=Shewanella RepID=A4Y9K4_SHEPC Length = 528 Score = 278 bits (712), Expect = 3e-73, Method: Composition-based stats. Identities = 92/327 (28%), Positives = 156/327 (47%), Gaps = 15/327 (4%) Query: 161 ERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLG 220 ER+ + +LE + D G WD+S G + + +V+ + + + P+L+ + E LG Sbjct: 196 ERLEIWQELEEVFTDLGLLTGLGWDLSQGLFQSHGWMNLVRLQKIVKQIPQLREVIETLG 255 Query: 221 RSREAKSIPRNDAQMETFRTMVRE-----PATVPEQVDGLQQSDDILRLLPPELATLGIT 275 ++ + P + + + R VP + G+ +SD I R+LP E A G Sbjct: 256 SMKDTEGEPIIEEIISRMSVIFRHEVEVTTPLVPMETKGITRSDSISRMLPQEAAFFGHP 315 Query: 276 ELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYD-----EQPRGPFIVCVDTSG 330 L+ ++ R E LL+Y + G +V E+ +K+ + RGP IVC+DTSG Sbjct: 316 VLKKLWHARRAEHALLSYAVEGTELITEVTEQEQENKENKAGNKVNRNRGPMIVCLDTSG 375 Query: 331 SMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST--EIVRYELS-GPQGIEQAIRFLSQ 387 SM G E AKA L + +A E R C++ LF + E+ EL+ G+EQ I FLS Sbjct: 376 SMQGTPENVAKALVLQCISVAKKEKRACFVYLFGSKGEVKEMELTPDKAGLEQMILFLSM 435 Query: 388 QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 F GGTD+ +ER ++W AD +++SD + ++ K+ + H Sbjct: 436 SFGGGTDVEGPLNMALERSDEKQWQQADILLVSDGEFS-VSSGLSRKISNRKEQRGMSVH 494 Query: 448 AVAMSAHGKPGIMRIFDHIWRFDTGMR 474 V + P + +I + + +F + + Sbjct: 495 GVVIGGRLSP-MDKICEPLHQFSSWLD 520 >UniRef50_C4KA81 von Willebrand factor type A (VWA) domain-containing protein n=1 Tax=Thauera sp. MZ1T RepID=C4KA81_THASP Length = 581 Score = 274 bits (700), Expect = 7e-72, Method: Composition-based stats. Identities = 97/345 (28%), Positives = 147/345 (42%), Gaps = 35/345 (10%) Query: 157 SEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLA 216 E+ E + G L +L + WD G L+ D++ +++ + PEL R+ Sbjct: 175 GEIDELVGAFGDLGDLLDNAR------WDALRGLLRSTDWREVLRIRALIEGLPELARIL 228 Query: 217 EQLGR----------SREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLP 266 LGR SR ++ + + VR P +P + G+Q+S I R+LP Sbjct: 229 RALGRACPTDEDAESSRALHAVVEHTEIQRSVSHRVRVPD-LPGETRGVQRSGRIARMLP 287 Query: 267 PELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQP-----RGP 321 E LG L ++ R E+ LL Y + + PV+ P +GP Sbjct: 288 AEATLLGHPRLRLVWHARRAERTLLAYEDDDHLQEDCLRPAPVLRPSQRPAPARRLEQGP 347 Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS--TEIVRYELS-GPQGI 378 +VCVDTSGSM G E AKA L +R A A R C + F E+V EL G+ Sbjct: 348 MLVCVDTSGSMQGGAEAVAKAVVLEAVRCAHARRRACRVYAFGGPDEVVEMELGVDVDGV 407 Query: 379 EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKEL 438 + RFL Q F GGTD+ + + RL W AD ++ SD P + ++V+ Sbjct: 408 GRLARFLGQGFGGGTDICAPLERALARLDEAGWQLADLLIASDGEFGATP-ALAARVEAA 466 Query: 439 QRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRRWRR 483 +R R + + G++ + D I +R WRR Sbjct: 467 RRERGLRVQGILIGDRETIGLLELADDI---------HWVRDWRR 502 >UniRef50_D1YZJ4 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YZJ4_METPS Length = 506 Score = 267 bits (683), Expect = 6e-70, Method: Composition-based stats. Identities = 99/410 (24%), Positives = 172/410 (41%), Gaps = 55/410 (13%) Query: 92 LPQILDLLHRLNSPWAEQARQLVDANSTIT----SALHTLFLQRWRLSLIVQATTLNQQL 147 L QI D+L S + + ++L A+ L W + ++ Sbjct: 98 LEQIFDMLDDF-SRFEPELKKLARGKMAFYYQQFKAILESTLDLWHRRTSKETPRSTRRA 156 Query: 148 LEEERE---------------QLLSEVQERMTLSGQLEPILADNNTA----------AGR 182 +E+ + + L + + LSG + + + GR Sbjct: 157 AQEKVDIVERVSRFENDKASRKFLDLLAGSVLLSGIMGQVSNVEDHLESLEMLSLLYPGR 216 Query: 183 LWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMV 242 WD S +L R + + KY + + +LK++ + +GR +ME + Sbjct: 217 GWDRSMLELHRVYFANLHKYSKIVERNEDLKKILDTIGR-----------IEMEYGSRRL 265 Query: 243 REPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWRE 302 + +V + S D+ +LP E L L F+ +EK+LLTY L G +W Sbjct: 266 SLSSYSHSEVYSVTTSGDLQHMLPVESVKLQDETLRNLFFAHWMEKKLLTYELKGVNWT- 324 Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 D ++ RGP + VDTSGSM G E AK+ LAL+R + E+R + L Sbjct: 325 ----------DDSKKNRGPMVAMVDTSGSMHGDPEIVAKSIILALVRRMMKESRDVKVYL 374 Query: 363 FSTEIVRYELSGPQGIE---QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVI 419 FS+E +E+ + + + FLS F GGTD + R +E L+ +++ +AD + I Sbjct: 375 FSSEGQTHEIEITDNKKMATEFLDFLSYTFEGGTDFDTALREGVESLKKKQYVNADILFI 434 Query: 420 SDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRF 469 +D ++ V S +++++R + R + + GI R DHI+ Sbjct: 435 TDGLSVVNDKYVISGLEQMKRENGTRLFTIIVGNDNAGGIDRFSDHIFIL 484 >UniRef50_B9KEB5 Putative uncharacterized protein n=1 Tax=Campylobacter lari RM2100 RepID=B9KEB5_CAMLR Length = 474 Score = 263 bits (673), Expect = 8e-69, Method: Composition-based stats. Identities = 87/452 (19%), Positives = 185/452 (40%), Gaps = 25/452 (5%) Query: 31 QLAVFFEKFPRLKAAITDD-VPRWREALRSRLKDARVPPELTEEVMCYQQ----SQLLST 85 QL V+ +K + I + + +RE + K + E Y + L S Sbjct: 26 QLHVYIKKMKGFRLFIEEKKILSYREIIARTQKKIILKDL--TEFRKYNRKNIDFILDSL 83 Query: 86 PQFIVQLPQILDLLHR-LNSPWAEQARQLVDA-NSTITSALHTLFLQRWRLSLIVQATTL 143 ++ D L + N ++ ++ + + L + + +++ Sbjct: 84 ENTNKSQKELRDFLLKIWNEELNKKIKKYEEKFLKHYNKQVCILMEKIEKENILGSENCE 143 Query: 144 NQQLLEEEREQLLSEVQERMT-LSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKY 202 + + + + + + + + + G K+ + I+ Y Sbjct: 144 GNKAISVSKNDFFETNANNIENILENFKLEFIKDFHDKNPYYGNNTGCEKKLSIKSIIDY 203 Query: 203 GEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDIL 262 E + LK + + LG+SR + + + E++ G+ ++ Sbjct: 204 FEIIKNNHALKEICDLLGKSRNDDNKEGKND-SNLNNNAQKTSKESKEEIKGVILGRNLE 262 Query: 263 RLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPF 322 LL EL L +LE F + +E +L + G ++K + + +G Sbjct: 263 ELLAQELGLLNDEDLENLFVLKYLENRLFCFEKQGY-----------INKMQNHKNKGAI 311 Query: 323 IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAI 382 I+CVD+SGSM G E AK +++ AL E CY++ FST+ E+ +G+++ Sbjct: 312 IICVDSSGSMDGQPEIIAKGITYYMVKKALKEKSACYLINFSTKTKCEEIDLSKGMKKLF 371 Query: 383 RFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVH 442 FL F GGTD++ + ++++Q + +D +VISD + + + ++++ QR Sbjct: 372 DFLCFSFNGGTDVSIALKEGVKKMQEDGFERSDLLVISDGFFGDIDNKILKQMEK-QREQ 430 Query: 443 QHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMR 474 +++F+ + + +G + IFD W++DT + Sbjct: 431 ENKFYLLDI--NGCDKVKTIFDKHWKYDTSTK 460 >UniRef50_C8SB00 von Willebrand factor type A n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SB00_FERPL Length = 469 Score = 260 bits (664), Expect = 1e-67, Method: Composition-based stats. Identities = 112/488 (22%), Positives = 189/488 (38%), Gaps = 74/488 (15%) Query: 31 QLAVFFEKFP-RLKAAITDDVP------RWREALRSRLKDA--RVPPELTEEVMCYQ--Q 79 L +EK P +K DD+ R E ++ ++K+A + PP +T+ + Sbjct: 4 DLLKIYEKVPYTVKKDRLDDMLFGRHRDRIVEKVKDKVKEAIPQFPPLVTDTFNIFHKPD 63 Query: 80 SQLLST----PQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQ---RW 132 Q L P+F V + +++ ++ L D NS I +A+ T L + Sbjct: 64 PQFLDDSQIAPEFRVNKRVLEKIMNTDTFSELKETTTLDDVNSAIATAILTERLYEELKS 123 Query: 133 RLSLIVQATTLNQQLLE-------EEREQLLSEVQERMTLS--------------GQLEP 171 +L I + T QQL E+ +Q L +++E E Sbjct: 124 KLGEIKEHTEKIQQLRNQLPGKSGEDVKQALQQIEEHSRALQGIVTQGAVSVAVRKAQEE 183 Query: 172 ILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRN 231 N + G+ + D + +K L LK++ E LG+ R Sbjct: 184 FEKVQNAMVALGFGNEPGKPVQVDPETAIKLASELKSNERLKKMVELLGKMRNLLK---- 239 Query: 232 DAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLL 291 T +P ++ + +I RLLP E+ L + F R E +LL Sbjct: 240 -------STAKAKPRKSMLELHSITSGREIERLLPSEILKLR--KYRVVFLRDYYEGRLL 290 Query: 292 TYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIA 351 Y L K +++ +GP ++ +D SGSM G EQ AKA LA + IA Sbjct: 291 HYDL----------------KRREKESKGPIVIALDLSGSMSGAKEQWAKAVSLATIDIA 334 Query: 352 LAENRRCYIMLFSTEIVRYELSGPQG-IEQAIRFLSQQFRGGTDLASCFRAIMERLQS-R 409 + E R I+ F I ++ Q E + + GGT+ + M+ ++ R Sbjct: 335 VKERRPWAIIAFDAGIKDVKVFRKQPKPEDVLGIMRIGASGGTNFEKPLKEAMKIVEDCR 394 Query: 410 EWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIF-DHIWR 468 E+ AD + ISD ++ + + +R R V +S G P IMR+F D ++ Sbjct: 395 EFTKADILFISDGDC-KVGWEFLEEFTRFKRRRNVRVTGVLIS--GIPRIMRMFCDEVFA 451 Query: 469 FDTGMRSR 476 + + Sbjct: 452 LKERLDDK 459 >UniRef50_A6C7T1 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C7T1_9PLAN Length = 313 Score = 260 bits (663), Expect = 1e-67, Method: Composition-based stats. Identities = 85/304 (27%), Positives = 148/304 (48%), Gaps = 12/304 (3%) Query: 172 ILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRN 231 + + WD+SAG K + + KY + ++ P+++++ E+LGR + ++ + + Sbjct: 1 MFGPLDRLLSPGWDLSAGIFKYRGWGDLKKYRDLIDRIPQIRQMIEELGRLQASEEMDDD 60 Query: 232 DAQMETFRTMVR--------EPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYR 283 + F ++ R E Q G+++S D +R++P E L+ ++ Sbjct: 61 PTYADAFNSLRRTTEEQREVEHPLARHQAQGIERSADFMRMIPSEAMLRRRPGLKRLWHA 120 Query: 284 RLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAF 343 +L E+ LLTYR+ G E ++ RGP IVCVDTSGSM G E AKA Sbjct: 121 KLAERGLLTYRVRGTYVDRVSTEVEEQQPQSKKRIRGPIIVCVDTSGSMSGRPEAVAKAL 180 Query: 344 CLALMRIALAENRRCYIMLF--STEIVRYELS-GPQGIEQAIRFLSQQFRGGTDLASCFR 400 L RIA AE R C + F S + V +ELS P G++ + FL+ F GGTD+++ F Sbjct: 181 TLEACRIAHAEQRPCLLFSFSGSGQYVEHELSLSPDGLQSLLEFLTMNFDGGTDISTPFE 240 Query: 401 AIMERLQSREWFDADAVVISDFIAQRL-PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 + RL++ EW AD +++SD + D + + + ++ R + + + + Sbjct: 241 KALARLRTAEWERADILLVSDGAFSKSQVDALKPALDDAKKRLGLRVSGLLVGNYSSGPM 300 Query: 460 MRIF 463 + Sbjct: 301 NSLC 304 >UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q466I6_METBF Length = 562 Score = 258 bits (658), Expect = 4e-67, Method: Composition-based stats. Identities = 92/429 (21%), Positives = 190/429 (44%), Gaps = 48/429 (11%) Query: 68 PELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHR------------LNSPWAEQA--RQL 113 P + + + + +S +S +++ L +D + R + + R Sbjct: 133 PLIYKILERFSESTSISGTEYLKDLDTEMDEILRQFEEILKETLLMWGNSGYAELPGRNF 192 Query: 114 VDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPIL 173 + SAL L + + + L + ++E++E + L L + Sbjct: 193 QKMKNFGESALVDAILAFMQKGGYQEFLEKVMEGLYRRMNEFVTEMEENLELFDTLTLLF 252 Query: 174 ADNNTAAGRLWDMSAGQLKRGDY----QLIVKYGEFLNEQPELKRLAEQLGRSREAKSIP 229 N W S +LK+ + +++ Y F + P+LK++ + +GR Sbjct: 253 PQRN------WSYSVKELKKEPFYVQLKMLKNYSTFFEKSPDLKKIMDFIGRR------- 299 Query: 230 RNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQ 289 + + +R +++ ++ SD I LLP E A L L+ +FY ++E + Sbjct: 300 ----EFDPPSDRIRLSPFGKDRIQTVRFSDSINNLLPMEAAKLLNPSLKKKFYADMLEGK 355 Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 LL+Y+ G+ + +PRGP IV VDTSGSM G + AK+ LA+ + Sbjct: 356 LLSYQFLGKHYTGPP----------RIKPRGPMIVLVDTSGSMHGAPQTLAKSAVLAMAK 405 Query: 350 IALAENRRCYIMLFSTEIVRYEL---SGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERL 406 + L++ R ++LF++ E+ S + E+ + FL F GGTD + + ++ L Sbjct: 406 LMLSQQRDMKVILFASTSQHLEIELSSRKKMSEKFLNFLLYTFGGGTDFNTALASGLKSL 465 Query: 407 QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 + +++ AD + I+D ++ + V ++ +E ++ + + +++ + + G G+ I D+I Sbjct: 466 KEKDFQGADLLFITDGKSEVSDELVLARWEEAKKKYNAKVYSLIVGSSGAGGLSEISDYI 525 Query: 467 WRFDTGMRS 475 + + M S Sbjct: 526 YFVEMEMDS 534 >UniRef50_Q0W1N2 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W1N2_UNCMA Length = 477 Score = 254 bits (649), Expect = 5e-66, Method: Composition-based stats. Identities = 84/307 (27%), Positives = 141/307 (45%), Gaps = 22/307 (7%) Query: 164 TLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSR 223 L+ +P+ + A GR WD + + + D I +Y + + P+L +L E LGRS Sbjct: 185 ELADSADPLELMSLLAGGRGWDYAMIEQHKDDLYNIKRYSDIVRRNPDLMKLIEDLGRSS 244 Query: 224 EAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYR 283 E ++T V + +V + S D+ LLP EL L + L+Y F+ Sbjct: 245 EG---------LDTGSGKVLHSGRL--EVHSIVTSSDLYYLLPSELIKLQDSILQYLFFA 293 Query: 284 RLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAF 343 R +E +LLTY L P D + +GP I VDTSGSM G A+A Sbjct: 294 RWIEGKLLTYHL----------TDPGKSDTGDCKRKGPVIALVDTSGSMDGIPGILARAV 343 Query: 344 CLALMRIALAENRRCYIMLFSTEIVRYELSGPQG-IEQAIRFLSQQFRGGTDLASCFRAI 402 LA +R+ L R+ ++LFS+ E+ P+G + FL F GGTD + +A Sbjct: 344 TLATVRMFLQRGRKIRVVLFSSVGQLDEIDLPEGSTPGFLEFLRSSFGGGTDFNTALKAG 403 Query: 403 MERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 + L++R++ AD + ++D +++ + + + L+ + V + G+ I Sbjct: 404 LGALKARQYASADIMFVTDGMSRITDEALIEDWRRLKEASGSQIFTVIVGNDQAGGLEDI 463 Query: 463 FDHIWRF 469 D ++ Sbjct: 464 SDRVYIL 470 >UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q46D40_METBF Length = 612 Score = 253 bits (647), Expect = 9e-66, Method: Composition-based stats. Identities = 95/412 (23%), Positives = 179/412 (43%), Gaps = 41/412 (9%) Query: 69 ELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAE---QARQLVDANSTITSALH 125 LT+E Q+S L F + + + S + + + D + T+ Sbjct: 202 SLTQENSLPQESSLPQESSFQQENSLEQESSFQQESSLQDPEHENWENPDIQAGNTAESE 261 Query: 126 TL------FLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 L F+ + + ++ ++ + + E+L+ +++ + + L + Sbjct: 262 RLASMTLNFMSSEKAGEV--LDSVIEESIAAKIEELIPVLEDHLEMLEILSMLF------ 313 Query: 180 AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFR 239 GR WD S L R + + KY L + + + EQ+GR ++E Sbjct: 314 PGRAWDYSLKALHREYFGNLEKYAALLRKSSAIHEILEQVGR-----------IELEYGS 362 Query: 240 TMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGES 299 + +V + S D+ LLP E L L+ +FY ++E +LLTY+L GE+ Sbjct: 363 KKLSLSPYSKSEVHSVTFSGDLRTLLPAETVKLKNPLLKRKFYADMLEGKLLTYQLKGEN 422 Query: 300 WREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCY 359 W ++ +GP + VDTS SM G E AKA LA+ R L ENR Sbjct: 423 WN---------SDSAGKKRKGPVVALVDTSASMRGSPELLAKAVVLAVTRRMLTENRDVK 473 Query: 360 IMLFST--EIVRYELSGPQGI-EQAIRFLSQQFRGGTDLASCFRAIMERLQ-SREWFDAD 415 ++LFS+ + V EL+ + + E+ + FL F GGTD + RA ++ ++ + + AD Sbjct: 474 VILFSSKWQTVEIELTNKKRMGEEFLEFLKFTFGGGTDFNTALRAGLKAMKNEKAFEGAD 533 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIW 467 + ++D ++ + + E++ + R ++ + + G+ +I DH + Sbjct: 534 LLFLTDGYSELSEKPLIREWNEIKAERRARIFSLIIGNYDAGGLQQISDHTY 585 >UniRef50_C3XJE4 Putative uncharacterized protein (Fragment) n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XJE4_9HELI Length = 429 Score = 248 bits (632), Expect = 5e-64, Method: Composition-based stats. Identities = 84/398 (21%), Positives = 150/398 (37%), Gaps = 51/398 (12%) Query: 65 RVPPELTEEVMCYQQ--SQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITS 122 P + EE+ Q+ + + + + + + T Sbjct: 43 NFNPFMQEELALNQRKNIESCDDIEAQQDIQAQDLQAFERFNATHKAMLDSMLHQQTDIE 102 Query: 123 ALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLS---------------- 166 + L W L + + + E+ +EQ + +++ + Sbjct: 103 SARKYALAYWDNLLRDKKQKWLESMKEKLKEQYIQAIKDFLDYLLSILETLLRVYGLCGI 162 Query: 167 ----------------------GQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGE 204 QL P + + +D S G + +++ I + + Sbjct: 163 KKPNNDLALAQILNEAKQSFDMKQLCPNTYEESDIETNGYDYSKGHKRFINFKEINAFIK 222 Query: 205 FLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRL 264 + +L+++A LGR E + + ++ + E++ G+ D+ L Sbjct: 223 HIQTSKDLRKIAALLGREEENGNKKIEHSSID----QSIKTHNHKEEMSGVTLGRDLANL 278 Query: 265 LPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIV 324 LP ELA L LE F + ++ +L + G +K + + G I+ Sbjct: 279 LPQELAMLKDENLELLFNLKYIQNRLFCFEKQGYETIQKEHYKMA-------KNEGAMII 331 Query: 325 CVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRF 384 CVDTS SM G E AKA L L A +NR CY++ FST+I ELSG I F Sbjct: 332 CVDTSSSMSGNREYLAKAITLFLATKASMQNRACYLINFSTDIETMELSGKDNARNLINF 391 Query: 385 LSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 L+ F GGTD+A + ++++Q + +D +VISD Sbjct: 392 LAMSFNGGTDVAPALKEGLKKMQEDSFKQSDLIVISDG 429 >UniRef50_Q5LDB9 Putative uncharacterized protein n=11 Tax=Bacteroides RepID=Q5LDB9_BACFN Length = 419 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 79/369 (21%), Positives = 154/369 (41%), Gaps = 16/369 (4%) Query: 101 RLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQ 160 LN + + S L LF +W L + + E+ Sbjct: 65 DLNLQYYIDRFHTLKKRSKEWKHLRNLFFDKWYHLLANNEYNYQIERINNLCERF----- 119 Query: 161 ERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLG 220 L + L A +W + + + + Y E P ++ L + LG Sbjct: 120 --YRLQKNIADQLPQRGNAR-LMWLLRT---HQELAKQLFHYDEIAKNHPAIRELTKILG 173 Query: 221 RSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYE 280 + + + + G+ + +D+ LLP E L L+ Sbjct: 174 KQHY--GKEKKFRMVAGIHREQIITHATKSDITGVCEGNDLNSLLPIEYCYLSDPALQPL 231 Query: 281 FYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCA 340 F+ R +K+L + + ++ + + + E+ GPFI+CVDTSGSM G E+ Sbjct: 232 FFERFNKKKLQMMDYESKD-QHRIKDIKIQGNEIVEEQSGPFIICVDTSGSMSGEREEFV 290 Query: 341 KAFCLALMRIALAENRRCYIMLFSTEIVRYELSG-PQGIEQAIRFLSQQFRGGTDLASCF 399 K+ LA+ + ++R+CY++ FS +I E+ Q I++ FL Q F GGTDL Sbjct: 291 KSAILAIAELTEQQDRKCYLINFSNDIACIEIERLGQNIQELANFLCQSFHGGTDLTPAL 350 Query: 400 RAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 + L+++ + +AD V++SDF L ++++ ++K++++ H +A+++ + Sbjct: 351 LHAIYILKTKSYRNADLVMMSDFEMPPLNEELSEEIKKIKQNKTH-LYALSVHKQSENTY 409 Query: 460 MRIFDHIWR 468 + + + W Sbjct: 410 LNVCNKFWF 418 >UniRef50_A6L4M8 Putative uncharacterized protein n=9 Tax=Bacteroides RepID=A6L4M8_BACV8 Length = 453 Score = 243 bits (621), Expect = 9e-63, Method: Composition-based stats. Identities = 84/410 (20%), Positives = 160/410 (39%), Gaps = 35/410 (8%) Query: 69 ELTEEVMCYQQSQLLSTPQFIVQLPQILDL------LHRLNSPWAEQARQLVDANSTITS 122 T E + S+ L F+ L Q+ L LN + + + Sbjct: 66 RYTPEWEAFYSSEHLPDMAFLQYLKQMRGAFKKRYELAELNIDYYISLLENASLLRGEGA 125 Query: 123 ALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGR 182 FL +W L + + E + ++ +G Sbjct: 126 RTKEFFLDKWHQLLTRKEYDYQYMHINSLCEGF--------------DLLIRKQGKESGN 171 Query: 183 LWDMSAGQLKRGDY----QLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 S + +Y + ++ Y + P +++LA LG+ + + + ++ Sbjct: 172 KLLGSRMEWLLHNYPDLYRRMLPYETVMKRNPAIRQLARLLGKKHRDQQKYDSLSGVDKK 231 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 R + P + + G+ +D+ LLP E L L F R EK+L + + Sbjct: 232 RLIRHSPHS---DITGVTLGNDLNSLLPVEYCYLADDALRAVFMERYAEKRLQLFDYQSK 288 Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 PV + +GP+I+CVDTSGSM G E +K+ LA+ ++ +R+C Sbjct: 289 E------TEPVKDDKHKVSGQGPYIICVDTSGSMQGNREILSKSAILAIAQLTEKTHRKC 342 Query: 359 YIMLFSTEIVRYELSG-PQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 Y++ FS E V + + + + FL+++F GGTD+ R + ++ ++D V Sbjct: 343 YVINFSDEAVSLLIEDLGRDMPRLAEFLNKRFDGGTDIEPALREAAHIINGNDFRESDIV 402 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIW 467 +ISDF L ++ +VK ++R + F + + + + + W Sbjct: 403 LISDFEMPPLSRNLMEQVKVIKR-RKTSFFGLVFGNKPEMEYLNLCERYW 451 >UniRef50_C9KWG4 Putative uncharacterized protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KWG4_9BACE Length = 454 Score = 237 bits (605), Expect = 6e-61, Method: Composition-based stats. Identities = 79/370 (21%), Positives = 146/370 (39%), Gaps = 38/370 (10%) Query: 101 RLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQ 160 LN +L + + L W L+ + Q+ L + Q Sbjct: 120 ELNVDGYNSLLKLDEFDK---DILFQKICDEWSLAYREKILEEKQRYLNSSKIQF----- 171 Query: 161 ERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLG 220 + S G+ DY+++ +Y + ELK + +G Sbjct: 172 ------------------------ENSVGRSNMKDYRMVSRYRTISVKYKELKEIVSCMG 207 Query: 221 RSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYE 280 R +E + T+ + + G+++ +D+ L+P E+A L E Sbjct: 208 REKEQAEELDTLIKQYIPETL--SASVAHSDIHGVEEGNDLQALMPTEVALLAEFATEDL 265 Query: 281 FYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCA 340 F+ + +QL + +S R+K + + Q +GP IV +DTSGSM G E A Sbjct: 266 FFMKYAMRQLQLFSNRSDSVRKK--QESQTKRREPRQIKGPMIVAIDTSGSMSGKAESIA 323 Query: 341 KAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFR 400 KA L + ++A ++R+C+++ FS + + ++ F+ F GGTD + Sbjct: 324 KALLLEITQMAKKQHRKCFLLSFSVRAQALDTAHSGNWKKVREFMVSHFSGGTDGEEMLK 383 Query: 401 AIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + L + AD ++ISDF S++++ Q RF+ + + +G Sbjct: 384 TALHTLTQENYLMADVLIISDFEFDFCCKPTESRIRKEQ-ERGVRFYGLQIG-NGVNVYE 441 Query: 461 RIFDHIWRFD 470 + D +WR D Sbjct: 442 ELLDKVWRLD 451 >UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KNK1_AERHH Length = 552 Score = 237 bits (604), Expect = 9e-61, Method: Composition-based stats. Identities = 86/340 (25%), Positives = 144/340 (42%), Gaps = 27/340 (7%) Query: 135 SLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQ-LEPILADNNTAAGRLWDMSAGQLKR 193 +L + E E++ E+++++T GQ L I + L S Sbjct: 229 ALRAELAECKAYEAELADEEIPDEMRKKLTGYGQILGDIDTTYEKSKALLLACSGANFSY 288 Query: 194 GDYQL----IVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVP 249 D +L I + L + +K L ++GR+ E + R P Sbjct: 289 NDLKLCKDDIEPLAKQLQQNHAIKELTYKMGRAY----------ISEEKKKQARIPHASK 338 Query: 250 EQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 +V G +S+D+ R+LP EL L LE FY R +E+ L+TY L G + Sbjct: 339 SEVHGTHRSEDLARVLPTELLNLEDEALETLFYARFLERNLMTYELQGTTCTS------G 392 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST--EI 367 + +++ GP + C+DTSGSM G A+A LA+ + E R +++LF E+ Sbjct: 393 EQLELEQKRTGPVVACLDTSGSMSGAPLLKARALLLAVSAVLQQEARSLHVVLFGDNGEL 452 Query: 368 VRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERL-QSREWFDADAVVISDFIAQR 426 Y + + FL Q F GGTD + E + ++E+ AD ++ISD Sbjct: 453 REYAIHEENSASGLLHFLRQGFGGGTDFETPLNRACEIIRDAKEYEKADILMISDGDC-V 511 Query: 427 LPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 L DD ++ +++ ++V HG+ R D + Sbjct: 512 LSDDYIEHLQTRKKILDCSIYSVL--CHGQRVADRFSDEV 549 >UniRef50_A7V9J4 Putative uncharacterized protein n=7 Tax=Bacteroides RepID=A7V9J4_BACUN Length = 495 Score = 234 bits (596), Expect = 6e-60, Method: Composition-based stats. Identities = 76/378 (20%), Positives = 151/378 (39%), Gaps = 13/378 (3%) Query: 92 LPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEE 151 L QI D + + W + ++ ++ + Sbjct: 121 LQQISDKYREYGFDSRFYRSHFGTEGGYADDEVWEKMVDDWEDAFQLKMHEEKEKEIAFR 180 Query: 152 REQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPE 211 ++ L ++ + + + + W M +G D++ I K PE Sbjct: 181 KDALERRLRSNLKDIPEYIRQNRVDKDEFFQTWGMMSGLWNTVDFERIRKIVRIQRSCPE 240 Query: 212 LKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELAT 271 + ++A ++GR + + R ++ E ++ + G+ +D+ LLP ELA Sbjct: 241 IVKVARKMGRMADDEG--REQIRVAEGNVYKMEHSSKC-DILGISTGNDLNALLPIELAH 297 Query: 272 LGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGS 331 +ELE F + + ++L T+R ++++ + +P+GP IVC+DTSGS Sbjct: 298 SADSELEDLFVYKYLTRKLQTFRYKS-----EIMQPARRIETKPARPKGPMIVCLDTSGS 352 Query: 332 MGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRG 391 M G E+ A + + L+ IA + R C+++ FS I ++ + + + F S+ G Sbjct: 353 MAGKPEKIAHSLLIKLLEIADRQRRNCFLIAFSVSIQPIDVRKERA--RLLEFFSKTACG 410 Query: 392 GTDLASCFRAIMERLQS-REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVA 450 TD A L+ +E+ +AD + +SDF +++ R F+ + Sbjct: 411 DTDATRMLEATFRLLKEGKEYMNADVLWVSDFKIPHSSPAFMEEIRRC-REAGTHFYGLQ 469 Query: 451 MSAHGKPGIMRIFDHIWR 468 + FD I+R Sbjct: 470 IGITDN-EWTPFFDRIYR 486 >UniRef50_Q8EW10 Putative uncharacterized protein MYPE3970 n=1 Tax=Mycoplasma penetrans RepID=Q8EW10_MYCPE Length = 488 Score = 224 bits (571), Expect = 5e-57, Method: Composition-based stats. Identities = 83/305 (27%), Positives = 150/305 (49%), Gaps = 16/305 (5%) Query: 169 LEPILADNNTAAGRLWDMSAGQLKRGD-YQLIVKYGEFLNEQPELKRLAEQLGR-SREAK 226 LE + LWD+S+ + + ++ + Y + + LK+ +G +E Sbjct: 105 LEEFPQPEDYDLSILWDLSSIEERETQLFKAVENYFNLVKDDENLKKFIRMIGTFMQENL 164 Query: 227 SIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLV 286 I +N+ ++ +P++ L QS D+ L+P E+A L ELE F + + Sbjct: 165 EIEKNEKELFL--------ENIPQETFALYQSSDLNNLIPNEIAQLDDPELEIIFLKNFI 216 Query: 287 EKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLA 346 E++LLTY+L G E+ I+ V K D RGP +C+DTSGSM E +KA L Sbjct: 217 EQKLLTYQLWG---IEREIQEEWVIKQRDIGERGPLFICLDTSGSMRNMKEVLSKALTLV 273 Query: 347 LMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRF-LSQQFRGGTDLASCFRAIMER 405 +R + + FS E Y+L + ++++ L + F GG+D+ I Sbjct: 274 FVRELEKMDINVVFIPFSMEAKFYDLYDSKFKLKSVKMNLRKSFYGGSDIEKLVDLIDSV 333 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAH-GKPGIMRIFD 464 + +++ A+ +++SDFI ++LP +K+K+L+ + H+ H++ +S K + IF+ Sbjct: 334 IYKKKYERANILIMSDFIFKKLPKKAVNKLKKLKH-NGHKLHSLTISDQIYKNNLFDIFN 392 Query: 465 HIWRF 469 WR+ Sbjct: 393 TNWRY 397 >UniRef50_A8IX54 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IX54_CHLRE Length = 604 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 82/306 (26%), Positives = 128/306 (41%), Gaps = 11/306 (3%) Query: 163 MTLSGQLEPILADNNTA-AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGR 221 + G+ G +D+ KR + + + L E EL+ L LGR Sbjct: 170 VETLGRAGRAFEGLEALLGGDDFDLQGSIWKRAGWSQLDELRRKLEELKELRDLVRSLGR 229 Query: 222 SREAKSIPRNDAQMETFRTM--VREPATVPEQVDGLQQSDDILRLLPPELATLGI----T 275 + R Q + ++ GL +SDDI RLLP E A L Sbjct: 230 GGGWGPLRRAPVQFLDLNARPGLLRTVLEAQETRGLTRSDDISRLLPAEAALLARGRVVR 289 Query: 276 ELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGF 335 + + FY ++ EK L TY G IE P + RGP ++CVDTSGSM G Sbjct: 290 QAKLLFYAKMAEKALQTYERDGWGEYPTQIE-PERREIRPTADRGPILLCVDTSGSMRGA 348 Query: 336 NEQCAKAFCLALMRIALAENRRCYIMLFS--TEIVRYELS-GPQGIEQAIRFLSQQFRGG 392 E AKA L MR A + R C++ FS E+ EL+ + + FL + F GG Sbjct: 349 RETVAKALALECMRAARQQERGCFVFAFSGPAEVREIELNMDAASVNNLLEFLEKMFNGG 408 Query: 393 TDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMS 452 +D + ++RL +W ++D +++SD ++ + K+ + R H + + Sbjct: 409 SDFNEPVKRCLDRLTDAKWANSDILLVSDGELRQPAPAIMRKLAGAKEALGLRVHGLVVG 468 Query: 453 AHGKPG 458 + K Sbjct: 469 SPEKKR 474 >UniRef50_A2SLM6 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Burkholderiales Genera incertae sedis RepID=A2SLM6_METPP Length = 493 Score = 216 bits (551), Expect = 1e-54, Method: Composition-based stats. Identities = 92/355 (25%), Positives = 148/355 (41%), Gaps = 30/355 (8%) Query: 149 EEEREQLLSEV-----QERMTL---SGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIV 200 +E RE ++E+ E L L +L D A D G+L R ++Q Sbjct: 109 DEPRETAIAEMVAAFRAEWTLLHADWEHLLALLQDLGELAALQRDALRGRLARREWQAAQ 168 Query: 201 KYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQME-----TFRTMVREPATVPEQVDGL 255 + L P L L LGR ++ P+ + + P ++ G+ Sbjct: 169 QLAALLTRNPALVALIASLGRGLPREAPPQPAPTAPGRARVLGQLVETRLPDAPGEILGV 228 Query: 256 QQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKV----IERPVVH 311 + ++ R+LP E A L L + RL E +L+ + + ++ R Sbjct: 229 RPGRNLARMLPSEAAQLRHPLLHKLWRARLAEARLMVWDEEAVLFDQRPGGATPLRAAAQ 288 Query: 312 KDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS--TEIVR 369 RGP +VC+DTSGSM G EQ AKA L R A E R C ++ F E++ Sbjct: 289 AAPPPLARGPMLVCIDTSGSMRGAPEQLAKAVVLQAARTAHRERRACQLIAFGGAGELLT 348 Query: 370 YELS-GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLP 428 +EL+ P G++ + F+ Q F GGTDLA+ + + S W AD +++SD P Sbjct: 349 HELALTPAGLDALLDFIGQAFDGGTDLAAPLAHAVAAVHSARWQQADLLLVSDGEFGCTP 408 Query: 429 DDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRRWRR 483 + + + ++ H R V + G++ + D I +R WRR Sbjct: 409 ATL-ALLDGARQRHGLRVQGVLVGDRETMGLLEVCDAI---------HWVRDWRR 453 >UniRef50_C0ZE04 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZE04_BREBN Length = 572 Score = 213 bits (543), Expect = 1e-53, Method: Composition-based stats. Identities = 94/436 (21%), Positives = 172/436 (39%), Gaps = 54/436 (12%) Query: 28 ASPQLAVFFEKFPRLKAAITDD-VPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTP 86 QL+ + + K + + + + + + + P + EV+ Q Sbjct: 138 QQAQLSELLTEKQKAKLELVGYTLQQGKRVVEDKQEAMDTKPLVRAEVLSLQNRITELQE 197 Query: 87 QFIVQLPQ---ILDLLHRLNSPWAEQARQLVDANSTITSALHTLF--LQRWRLSLIVQAT 141 + VQ + +L + +L ++ +QL A+ L +W + Sbjct: 198 EMKVQFTKRTKLLQKVKKLEGELVQREKQLDRLQKQEKEAIAAFEKELGQWLEQSLKATL 257 Query: 142 TLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVK 201 + +E TL +E + + A R W G+L+R ++ +K Sbjct: 258 S----------------TEELDTLF--VEEVFTASQRFANRSWGHELGKLRRQSFEQYLK 299 Query: 202 YGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDI 261 + E L P+L ++GR + R + + + F PE+ L+QS DI Sbjct: 300 WIEKLKRHPDLVAFLNEVGRQVHRFRVKRKEIRSKHF----------PEEYYDLRQSGDI 349 Query: 262 LRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGP 321 +LP E L + E F + +E++L+TY G +E P+GP Sbjct: 350 AHMLPGEAVLLADPDFENYFMLKWLEQKLMTYDTSGWV---------------EEPPKGP 394 Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST--EIVRYELSGPQGIE 379 I +DTS SM G + A+ F + +++ E R ++LF EI L + Sbjct: 395 VICMLDTSHSMRGSKLRLAQIFIMTFAALSMLEKRDFILLLFGAKGEIKEQPLYHKKPDW 454 Query: 380 QAIRFLSQQ-FRGGTDLASCFRAIMERLQSRE-WFDADAVVISDFIAQRLPDDVTSKVKE 437 A L+Q F GGT + + +E ++ + W AD V+++D I P V K+ Sbjct: 455 PAFYGLAQMAFGGGTHFDAPMKRAIELVEKEQAWRGADFVMVTDGIGGISP-YVQEKLIF 513 Query: 438 LQRVHQHRFHAVAMSA 453 L + Q R H++ + + Sbjct: 514 LGQHKQVRLHSLIVGS 529 >UniRef50_B7DQJ9 von Willebrand factor type A n=1 Tax=Alicyclobacillus acidocaldarius LAA1 RepID=B7DQJ9_9BACL Length = 484 Score = 210 bits (535), Expect = 7e-53, Method: Composition-based stats. Identities = 82/465 (17%), Positives = 156/465 (33%), Gaps = 79/465 (16%) Query: 13 VSEEGLIEEMIIALLASPQL------AVFFEKFPRLKAAITDDVPRWREALRSRLKDARV 66 + E + + L + Q+ L+ A+ DDV + + + A+ Sbjct: 69 IMESLMRQSAWKNLRQTTQMDEYSAALGALHLRESLEEALPDDVRQAARQVEELARQAQ- 127 Query: 67 PPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHT 126 +Q++ + P D L +QA +L+ T Sbjct: 128 --------HLLEQAEAY--EEVADGHPSARDEAETLR----QQAAELMQTLQRATDQFEQ 173 Query: 127 LFLQRWRLSLIVQATTLNQQLL-EEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWD 185 F + R+ L +E ++ + Sbjct: 174 AF-------------DAQSGSIGRALRQALEQAAEEAQDTQRAMQ------------AFG 208 Query: 186 MSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREP 245 + G K + ++ L P ++ +A G + R + + Sbjct: 209 VGTGDGKPVSGKERLELAHILQTNPHVREIARMAGGMQMMALNKRKNRTLHP-------- 260 Query: 246 ATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVI 305 P ++ + DD+ +LP EL L E EF +R E++LL Y L G Sbjct: 261 ---PTEIVNITMGDDLANVLPSELLLLADPATEDEFIQRFAERRLLQYDLRG-------- 309 Query: 306 ERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST 365 ++ + +GP +VC+D SGS G E K LAL+ IA E R ++ F++ Sbjct: 310 --------FEREGQGPIVVCIDESGSTAGMVEMWEKGIALALLAIARREKRAFAVVHFAS 361 Query: 366 --EIVRYELSGPQGIE--QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISD 421 EI + P+ + ++ F GGTD S R + + + D V I+D Sbjct: 362 AHEIFVQKWLRPKDASPTELVQMAQHFFNGGTDFESPLREAVRIMDEAAFQKGDIVFITD 421 Query: 422 FIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 ++ + + + ++ + +V + + + D I Sbjct: 422 GESRVSDEFLHGEYARVKSEKAFQVISVVIG-YDDRSVRPFSDAI 465 >UniRef50_Q58221 Uncharacterized protein MJ0811 n=4 Tax=Methanocaldococcus RepID=Y811_METJA Length = 439 Score = 204 bits (519), Expect = 6e-51, Method: Composition-based stats. Identities = 74/390 (18%), Positives = 160/390 (41%), Gaps = 39/390 (10%) Query: 85 TPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLN 144 +F + + + + ++ +L + N+ + + F +++ +L + Sbjct: 62 EEKFKINKAILEGAIKNIEYEKSKLLTELDEVNAGTATIM---FCEKFFENLKLAKLNKE 118 Query: 145 QQLL--EEEREQLLSEVQE--RMTLSGQLEPILA-DNNTAAGRLWDMSAGQLKRGDYQLI 199 + E + E L +++E + T+ E + A + G K + Sbjct: 119 LKKFASEGKGEGLEDKLKEIAKNTMKDIAEEVSEVIQGFNAVENFGKGEGDKKLLSPEDR 178 Query: 200 VKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSD 259 +K + + + +++ + ++LG+ R + +++ ++ ++ + Sbjct: 179 IKLADKILQNKKIREIVKKLGKLRLLA--------INEYKSKIKH---YSGEIYSTKIGR 227 Query: 260 DILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPR 319 D+ LLP E+ L L Y+F RR V+K+LL Y + + E+ + Sbjct: 228 DLKHLLPKEIVNLSDEILYYDFLRRFVDKKLLIYDIQNKL----------------EKQK 271 Query: 320 GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGI- 378 GP I+ +D SGSM G E KA L+++ IA ENR Y + F + + P+ I Sbjct: 272 GPIIILLDHSGSMYGDREIWGKAVALSIIEIAKRENRDIYYIAFDDGVRFEKKINPKTIT 331 Query: 379 -EQAIRFLSQQFRGGTDLASCFRAIMERLQSRE-WFDADAVVISDFIAQRLPDDVTSKVK 436 ++ I S F GGT+ M ++ E + +AD ++I+D A+ + D + Sbjct: 332 FDEIIEIASLYFGGGTNFIMPLNRAMSIIKEHETFKNADILLITDGYAE-VNDVFLKEFD 390 Query: 437 ELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 + + + + +V + + I D + Sbjct: 391 KFKNEYNAKLISVFVETFPTETLKAISDEV 420 >UniRef50_D0LUP3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain-like protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LUP3_HALO1 Length = 536 Score = 204 bits (519), Expect = 7e-51, Method: Composition-based stats. Identities = 71/336 (21%), Positives = 116/336 (34%), Gaps = 40/336 (11%) Query: 150 EEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQ 209 + R L E ++ + W G R + + + L E Sbjct: 219 DLRAALQQACAELAEALRAVDEVAEAMQEC--DTWGREPGDFGRLPIEEFQRLSQVLRET 276 Query: 210 PELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPEL 269 P ++++ E GR E R+ ++ G+ + RL EL Sbjct: 277 PSVRKIVELAGRWSELLKPRLKRGHSPRGRS----------ELVGVTLGGGLERLCATEL 326 Query: 270 ATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTS 329 L L +L E++ L + L G D RGP I+ VDTS Sbjct: 327 IKLRHPALRRVLLGQLAERRALVHELRGP----------------DVLGRGPMILVVDTS 370 Query: 330 GSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST--EIVRYELSGPQGIEQAIR-FLS 386 GSM G AK+ LAL + R ++ F E+ E++ + + LS Sbjct: 371 GSMHGARMTMAKSLMLALALHCWEQRRPLRVLTFGAPGEMHESEVAVDEPFWTRLEQCLS 430 Query: 387 QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 F GGTD + E + R W ADAV ++D + + +++ + Sbjct: 431 VAFGGGTDFDGPLLRVCEIVGERPWRRADAVFLTDGEC-CVAEATRAQLARTRARVALNI 489 Query: 447 HAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRRWR 482 V + G+ + D +R +R R WR Sbjct: 490 IGVLVGRG--RGLDGVADIAYR------ARDGRGWR 517 >UniRef50_C6J3I5 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J3I5_9BACL Length = 409 Score = 202 bits (513), Expect = 3e-50, Method: Composition-based stats. Identities = 81/410 (19%), Positives = 148/410 (36%), Gaps = 45/410 (10%) Query: 66 VPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQA-----RQLVDANSTI 120 VP L ++ Q + ++ DL L + + ++ V + + Sbjct: 19 VPEGLEMNHQLMERVMSDEGYQEFREFTRLDDLAAALGTTKYSETVLGWVKEQVQRDQNL 78 Query: 121 TSALHTLFLQRWRLSLI--VQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNT 178 AL + S ++ Q E + L QE ++ +L Sbjct: 79 ADALQNYMNGKAGASQEASEALSSALNQNGNELSKMLAKAAQEATEAKENVKSLLGGMQA 138 Query: 179 AAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 +G +LK+ + + E L+ ++K +A+ GR + + + + Sbjct: 139 GSGES------ELKKVPLKDQLILAERLSHDKKMKDIAKWAGRMKVIANQKQRSKHKDAI 192 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 +G++Q + I +LLP EL T + +F RR VE Q L Y G Sbjct: 193 NR------------NGIKQGNSIEQLLPMELGTYASPITKMDFLRRYVEGQTLQYDTKGP 240 Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 ++ +GP I+C+D SGSM G + +K F LALM IA + R Sbjct: 241 ----------------EQLGKGPIILCLDQSGSMSG-QDTISKGFALALMSIARKQRRDF 283 Query: 359 YIMLFSTEIVRYELSGPQGI--EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADA 416 + FS+ + I + I+ + GGT RA + ++ + AD Sbjct: 284 AWIPFSSHAAAPLIYERGTIVVQDMIQLATIFLGGGTSFEPPLRAASQVIEQSRFNQADI 343 Query: 417 VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 V ++D + + + EL+ ++ + G+ D I Sbjct: 344 VFVTDGESH-VSERFLQSWNELKSKKGFSVLSLLLGRESIQGVEGFSDRI 392 >UniRef50_D1XLN5 von Willebrand factor type A n=12 Tax=Actinomycetales RepID=D1XLN5_9ACTO Length = 538 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 72/302 (23%), Positives = 111/302 (36%), Gaps = 47/302 (15%) Query: 182 RLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTM 241 R W ++ G+L+R + + E L L R AE +GR R+ R Sbjct: 246 RAWGVAPGELERMPFDERARLAERLR-TGRLARWAELIGRFRQMADGERA---------- 294 Query: 242 VREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWR 301 R ++ G+ DD+ R++P ELA LG+ L F R +L+ Y GE Sbjct: 295 -RRVENATGELVGVTLGDDLSRVIPSELANLGLPGLRAVFAARYAAGELMLYDTQGEQTT 353 Query: 302 EKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGF------NEQCAKAFCLALMRIALAEN 355 +G + CVDTS SM E AKA LAL+ A Sbjct: 354 ----------------GKGAVVACVDTSHSMYEAGPGGVTREAWAKACALALLDQARHGG 397 Query: 356 RRCYIMLFST----EIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSR-- 409 R +LFS ++ R+ P + + F GGT + A + L+ Sbjct: 398 RDFVGILFSAADKLQVFRFPAGRPADTARVLDFAETFLGGGTSYQTPLTAAADLLEEEFD 457 Query: 410 --EWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPG----IMRIF 463 D V+I+D + ++ +R R VA+ A G + + Sbjct: 458 ATARTRGDIVMITDDECG-VTEEWMRGWIGAKRRLDFRVFGVAVGAPLAAGTGSVLEALC 516 Query: 464 DH 465 D+ Sbjct: 517 DN 518 >UniRef50_D0Z403 Putative uncharacterized protein n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z403_LISDA Length = 543 Score = 200 bits (509), Expect = 8e-50, Method: Composition-based stats. Identities = 79/340 (23%), Positives = 135/340 (39%), Gaps = 38/340 (11%) Query: 144 NQQLLEEEREQLLSE------VQERMTLSGQLEPILADNNTAA----GRLWDMSAGQLKR 193 +Q LE +++ E ++ QL + A G + + +LK Sbjct: 222 SQSELERYSKEIEDENLPASALKNIDQYKTQLAKLKYAGEKATKLSKGLGYSFTYNELK- 280 Query: 194 GDYQLIV--KYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQ 251 D + I + L + + LGR A + + + Sbjct: 281 -DIKDIDFSSLAQELASNQAITDIVTTLGR-----------AYISEKTNHKQVKRINTNE 328 Query: 252 VDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVH 311 V G +S DI R+LP +LA L +LEY FY +L+E L TY+L G + Sbjct: 329 VYGTHKSADISRVLPSDLALLENEDLEYLFYAKLLESNLSTYKLLGHHIDFE-------K 381 Query: 312 KDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE 371 ++ E+ +GP + C+DTSGSM G A+A LA+ I E R Y++LF + E Sbjct: 382 ENDTEEDKGPIVTCLDTSGSMSGIPILKARALLLAIHSIITKEKRELYVLLFGSRGQVKE 441 Query: 372 LSGPQ-GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD-ADAVVISDFIAQRLPD 429 L + + F+ ++F GGTD + + + ++ +E F+ AD ++I+D + D Sbjct: 442 LYLSETSSSGLLPFICKEFSGGTDFETPLKRAINIIEHKEKFNKADILMITDGECN-VSD 500 Query: 430 DVTSKVKELQRVHQHRFHAVAMSAHGKPG---IMRIFDHI 466 + + + V + D I Sbjct: 501 NFQRMLVAKKSQLDFSVQTVICTGSFANTHQVTDGFSDRI 540 >UniRef50_D0LHL0 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LHL0_HALO1 Length = 509 Score = 199 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 89/329 (27%), Positives = 139/329 (42%), Gaps = 45/329 (13%) Query: 153 EQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPEL 212 +++ S++ + + + P + R + +G+ + ++ GE L +L Sbjct: 193 DKITSDLDDTIGRKVSVLPDQLEQGEDLRRSMGLGSGREGQVGAAERLELGERLMRSRKL 252 Query: 213 KRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATL 272 K LA+ +G RE R R +VR P + E G + RLLP EL L Sbjct: 253 KLLAKLVGAFREVAFEARR-------RRVVRTPQVMHEVGRGAH----LDRLLPSELLGL 301 Query: 273 --GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSG 330 L EF RRLVE +LL Y L G S RGP +VCVD SG Sbjct: 302 PRHRGALHREFVRRLVEGELLEYELRGAS------------------SRGPMVVCVDGSG 343 Query: 331 SMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE--IVRYELSGPQG---------IE 379 SM G E AKA L L IA E RRC ++FS+ + EL G +G + Sbjct: 344 SMQGTKEIWAKAVALTLTEIARRERRRCLAIVFSSGHALFEVELLGAKGRSNVRAPMLDD 403 Query: 380 QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQ 439 + F GGTD R + + + D V I+D AQ + +++ + + + + Sbjct: 404 NVLAFAEHFPGGGTDFEPPMRRALAAVSEGNYRRGDIVFITDGQAQ-VSENLIADITKAR 462 Query: 440 RVHQHRFHA--VAMSAHGKPGIMRIFDHI 466 + H+ R V ++ + ++R D + Sbjct: 463 KKHRFRVRGILVDVADSDRGSLLRFCDEV 491 >UniRef50_C8SZ21 Protein viaA (VWA domain protein interacting with AAA ATPase) n=7 Tax=Enterobacteriaceae RepID=C8SZ21_KLEPR Length = 184 Score = 195 bits (496), Expect = 3e-48, Method: Composition-based stats. Identities = 132/173 (76%), Positives = 154/173 (89%) Query: 2 LTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRL 61 +TLD LNVMLAVSEEG+IEEM++ALLASPQLAVFFEKFPRLK I D+PRWREA+R+RL Sbjct: 1 MTLDMLNVMLAVSEEGMIEEMLLALLASPQLAVFFEKFPRLKNIIAADIPRWREAVRARL 60 Query: 62 KDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTIT 121 K+ +PP+L EV YQQ+QLLST QFIVQLPQIL LH+L SP+A QA++LVD N+T T Sbjct: 61 KEVNIPPDLDAEVQTYQQAQLLSTSQFIVQLPQILGKLHQLQSPFAAQAQKLVDDNATFT 120 Query: 122 SALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILA 174 ALHTLFLQRWRLSL+VQAT+LNQQLL+EER+QLLSEVQERMTLSGQL+P+LA Sbjct: 121 PALHTLFLQRWRLSLVVQATSLNQQLLDEERDQLLSEVQERMTLSGQLDPVLA 173 >UniRef50_B1L0Y8 von Willebrand factor type A domain protein n=10 Tax=Clostridium RepID=B1L0Y8_CLOBM Length = 578 Score = 193 bits (490), Expect = 1e-47, Method: Composition-based stats. Identities = 78/388 (20%), Positives = 161/388 (41%), Gaps = 42/388 (10%) Query: 84 STPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTL 143 + +V++ + D + L S ++ + ++ TS L ++ + + + Sbjct: 184 TLSDLLVEMQEAEDRIKDL-SQEKQELEENIENLKNNTSDLSEEDMKNKIEQIDEELENM 242 Query: 144 NQQ---LLEEEREQLLSEVQERMTLSGQLEPILAD------NNTAAGRLWDMS--AGQLK 192 +Q L EE ++L ++ LS ++ + T+ + W + + Sbjct: 243 EKQADNLEEELSDKLEKSEEDIENLSKEMAEAFNEAEKEVREATSYVKDWGLGDKPNKSS 302 Query: 193 RGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQV 252 + + V+ E + + +LK L++ +GR +E+ R + + Sbjct: 303 KISFSDKVEALERIRKSKKLKELSDIIGRFKESA-----------LRDQRNKHKDGAVAI 351 Query: 253 DGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHK 312 ++ +DI+ LP E L + EFYR+ +KQLL Y L + Sbjct: 352 KSVRIGNDIIHTLPSEKMLLINETTKKEFYRKFNQKQLLQYELESD-------------- 397 Query: 313 DYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYEL 372 + +GP ++C+D S SM G E+ +KA +AL+ IA + R +LF+ + + Sbjct: 398 --KLKAKGPMVICIDMSSSMKGIKEKWSKAVAIALLEIAQQQKRNFAAILFNEDATEPII 455 Query: 373 --SGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDD 430 + E+ + + GGT + + +E ++ ++ AD V I+D + P D Sbjct: 456 IEKDKKEPEKILDIAERFDGGGTLFETPLQKALEVIEQSKFKKADIVFITDGHSYTHP-D 514 Query: 431 VTSKVKELQRVHQHRFHAVAMSAHGKPG 458 +K +L+ + + +V + A GK G Sbjct: 515 FINKFNKLKDEKEFKVLSVLIYAGGKIG 542 >UniRef50_A8ZLC2 Putative uncharacterized protein n=3 Tax=Acaryochloris marina MBIC11017 RepID=A8ZLC2_ACAM1 Length = 483 Score = 190 bits (483), Expect = 8e-47, Method: Composition-based stats. Identities = 88/405 (21%), Positives = 149/405 (36%), Gaps = 54/405 (13%) Query: 75 MCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRL 134 Y+Q + S Q I +L + + E Q + A S F Q+ Sbjct: 113 NLYRQLEEASDEQEIDELDTLRAQARQAKRDGREDLFQQLQAQGQAMSQAAQAFAQK--- 169 Query: 135 SLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRG 194 + + E E EVQE+ L G W +G Sbjct: 170 -----LEEKEGEGIGESVEDAEGEVQEKKDELEAL-----------GMSWGNESGDRNPT 213 Query: 195 DYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDG 254 +K + ++P+LK++ G + E + R Q ET ++ G Sbjct: 214 PTGEKLKLAALIEQRPQLKKILALAGNALETANRKRRQHQTETGY----------GELVG 263 Query: 255 LQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDY 314 + +D+ ++LP ELA L + + FYR +E QL L Sbjct: 264 ITTGNDVSQILPQELARLSDSRQKLSFYRDFLEGQLFQNDLQAP---------------- 307 Query: 315 DEQPRGPFIVCVDTSGSM-GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE-- 371 +E+ +GP ++C+D SGSM G +KA +AL+++A + R ++LF E + Y+ Sbjct: 308 EEKGKGPMVICLDCSGSMVKGNRFLWSKALIVALVKLANEQERVVSLVLF--ESICYDPI 365 Query: 372 -LSGPQGIEQAIRFLSQQ-FRGGTDLASCFRAIMERLQSRE-WFDADAVVISDFIAQRLP 428 + I+Q IR L GGT+ + ++ E + +AD V I+D IA Sbjct: 366 YFHPREDIDQLIRLLVTSPTDGGTEFQRPLEQARDIIEQDEDYSEADIVFITDGIAPLSS 425 Query: 429 DDVTSKVKELQRVHQHRFHAVAMSAHG-KPGIMRIFDHIWRFDTG 472 + L+++ F G + + W D Sbjct: 426 VFLQEYSDSLEKLKTDLFLLEIEPKQGWSNELRTLASQSWVIDAN 470 >UniRef50_B8C4H1 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C4H1_THAPS Length = 1141 Score = 189 bits (480), Expect = 2e-46, Method: Composition-based stats. Identities = 78/370 (21%), Positives = 135/370 (36%), Gaps = 60/370 (16%) Query: 139 QATTLNQQLLEEEREQLLSEV-QER---MTLSGQLEPILADN----NTAAGRLWDMSAGQ 190 + L+ LE+ E L + +E + L+ + + + + + G Sbjct: 163 EYEPLSADELEQLAESLTGTLSEEWGGVVQGVSLLDKVFGYDHNLLDLKGDDGFGLQDGI 222 Query: 191 LKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSI------PRNDAQMETFRTMVRE 244 + +Q I L+ P+LK L +LG+ AK PR + V Sbjct: 223 WQHNGWQPIPDLQRRLSMMPKLKDLLARLGQRPSAKGKDVRKFRPRKRSNSRDDMMGVEI 282 Query: 245 PATVPEQVDGLQQSDDILRLLPPELATL--GITELEYEFYRRLVEKQLLTYRLHGESWRE 302 P V GL +S + +LP E L + L + F + E +LL Sbjct: 283 DPLDPTSVSGLTRSGSLTTMLPSEAVLLRSSMKSLRWLFLAKKAESKLLV---------- 332 Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 GP I+C+DTS SM G E AKA LA + A ++ R C ++ Sbjct: 333 ----------SLPSASGGPLIICLDTSWSMSGARESLAKAVVLASVSAANSQGRECRVVS 382 Query: 363 FSTEIVRYELS----GPQGIEQAIRFLSQQFRGGTDLASCFRAIM--------------- 403 FS+ E G+ + + FLS F GGTD+ + + Sbjct: 383 FSSANNAVESGSIKCDSDGVRKLLDFLSYSFGGGTDVTGALKYALILAAHLLIAFHIYLP 442 Query: 404 --ERLQSREWFDADAVVISDFIA--QRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 E L++ +D +++SD + + V +K++ L+ H + + P + Sbjct: 443 KMETLETD-LASSDLLLVSDGEIPNPPVSNVVFAKLEALRLQTGMEIHGLLVGKRESPAL 501 Query: 460 MRIFDHIWRF 469 + + F Sbjct: 502 SSLCTEVHDF 511 >UniRef50_A7KV72 Putative metalloprotein chaperonin subunit n=1 Tax=Bacillus phage 0305phi8-36 RepID=A7KV72_9CAUD Length = 553 Score = 178 bits (452), Expect = 3e-43, Method: Composition-based stats. Identities = 69/388 (17%), Positives = 129/388 (33%), Gaps = 51/388 (13%) Query: 94 QILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEE-- 151 +++D L++ + +L + L R + + Q LE+E Sbjct: 165 ELIDQLNKARDA-QNRVDELNEKGGPGGKGLTQ------REAEELARLQQQIQDLEDEID 217 Query: 152 -----REQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLK-RGDYQLIVKYGEF 205 +E++ +++ M + + W + R K E Sbjct: 218 LNKSGQEEMKQGMEQAMEQASKKAFEEVREVRDTMESWGLDGTSSTMRISIDRRKKAIER 277 Query: 206 LNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLL 265 + P L L + +GR + + R P V ++ +D+ R+ Sbjct: 278 IRRSPRLNNLTDLVGRMKAIALQKKTQ----------RTPDG--HSVRTIETGNDLSRVT 325 Query: 266 PPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVC 325 P L L +F + EKQL Y+ G + RGP I+ Sbjct: 326 PTSLMKLASPATRNQFMKEFSEKQLQLYKKDG----------------IKKVGRGPIIID 369 Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG----PQGIEQA 381 D SGSM G + + A LA++ +A E R + + +IV + + Sbjct: 370 HDKSGSMRGNKDDWSTALTLAMLEVAQKEKRNFGYIPYQHQIVASHVKNIPAGELDPDDI 429 Query: 382 IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRV 441 + GGT + L+S + D V I+D + D+ + K+ + Sbjct: 430 MDIAELDSSGGTTFMPVLDESIRCLESDRYKKGDIVFITDGDCG-ITDEWLKEFKKKKEQ 488 Query: 442 HQHRFHAVAMSAHG---KPGIMRIFDHI 466 Q V ++ G + + + D I Sbjct: 489 LQFNVLTVLINLDGGASRATVEKFSDQI 516 >UniRef50_C9RHJ4 von Willebrand factor type A n=1 Tax=Methanocaldococcus vulcanius M7 RepID=C9RHJ4_METVM Length = 383 Score = 177 bits (450), Expect = 5e-43, Method: Composition-based stats. Identities = 70/395 (17%), Positives = 147/395 (37%), Gaps = 61/395 (15%) Query: 71 TEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQ 130 E++ Y + + + I +++ + L + S I + F + Sbjct: 28 DTEIIFYLFFKY--EVELLDDSEIIRKIINDRKFKHIKTITTLDENYSIIAT---EFFCE 82 Query: 131 RWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQ 190 +++ +L E+ RE+ LS++ + + +E I + G Sbjct: 83 KFK------------ELKEKSREEDLSDIFDELESY--MENISYSLGC-----FGSGCGY 123 Query: 191 LKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPE 250 D ++ E L + +LK + LG R + + R Sbjct: 124 RSYTDPTKKLELAEKLLKNKKLKEFIKLLGTFRRI-----------SLKKAKRRIKHFSG 172 Query: 251 QVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVV 310 + + + LL E L + RR E +LL Y++ Sbjct: 173 EKYSTTCGNSLTNLLSCEYKNFTDEMLFVDLLRRYNENKLLNYKIL-------------- 218 Query: 311 HKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY 370 + + G F++C+D SGSM G E AKA L L+ +L +RC +++F + Sbjct: 219 ---DNIKNHGDFVICLDLSGSMRGNKEIWAKAVSLCLIEASLKRGKRCVVIIFDDGVRET 275 Query: 371 ELSGPQ-GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPD 429 ++ ++ + F S + GGT+ R ++ F+ D V I+D + +P Sbjct: 276 KIFEKNIHFKEVLDFASVFYGGGTNFEKPLREALK-------FNGDVVFITDGECE-IPL 327 Query: 430 DVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD 464 D+ +++K+ + + + +++ ++ + I D Sbjct: 328 DMLNEIKKEKEKKEIKIYSLCINTKPTITLKNISD 362 >UniRef50_D1PED6 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PED6_9BACT Length = 549 Score = 177 bits (449), Expect = 8e-43, Method: Composition-based stats. Identities = 67/386 (17%), Positives = 137/386 (35%), Gaps = 52/386 (13%) Query: 128 FLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMS 187 ++ W++ + Q + + + + + M + + A + W++ Sbjct: 170 LVRDWKVCIDHQILNKLKDFISLRQNNFETGLVRMMDQITRNMKTKGVSEQRAVQAWELM 229 Query: 188 AGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPAT 247 +++ + + ++ PE+K + ++GR +A R M + Sbjct: 230 TNGWTETEFERRLNQVKIQDKYPEIKEIVAKMGRVADANGKDRLTIASGVEMKME---HS 286 Query: 248 VPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIER 307 ++G+ DD+ LLP ELA ++E F + ++L T+R E + + Sbjct: 287 AGSDIEGITVGDDLNSLLPLELAQYSDEDMEGLFIYKYRTRRLQTFRYKSE------MSK 340 Query: 308 PVVHKDYDEQP-RGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE 366 P + +GP IVC+DTS SM G E+ + L A R C+++ FS Sbjct: 341 PSRKLGFTHASRKGPMIVCLDTSASMYGTPERISSTLISLLEETAEDLERDCFLIDFSVS 400 Query: 367 IVRYELSGPQGIEQA----IRFLSQ------------------------------QFRGG 392 +L + E+ I + GG Sbjct: 401 TRAIDLMAKRKAERLKRIGITMMESAEADASPSDGDGQAHTGRGIRRQPTTTHLPFIGGG 460 Query: 393 TDLASCFRAIMERLQSRE--WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVA 450 T + + L + + +AD + I+DF+ P + S+ +E + RF+ + Sbjct: 461 TSAKKMMTQMFDLLDNDGLHYVNADVLWITDFLIPDPPQQLLSRFREYK-ETGTRFYGIR 519 Query: 451 M---SAHGKPGIMRIFDHIW--RFDT 471 + F+ I+ R+ Sbjct: 520 IVRDDDKEPNSWKEYFNQIYTIRYRP 545 >UniRef50_A7VY69 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VY69_9CLOT Length = 515 Score = 167 bits (422), Expect = 1e-39, Method: Composition-based stats. Identities = 66/295 (22%), Positives = 113/295 (38%), Gaps = 37/295 (12%) Query: 178 TAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMET 237 W S + + + ++ +L+ +A LGR RE + R ++ Sbjct: 233 HTIMEAWGSS--NEEMRNIPMNQTLLNYVKNSKQLQEIARLLGRYRELIADKRKNSY--- 287 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + L +DI L ELA LG+ E E F RR +K+L+ YR Sbjct: 288 --------SYGRGEKYDLTTGNDITNCLSSELALLGMAETEILFMRRYEQKRLMQYRKR- 338 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + RG IV +D SGS AKA LAL+ IA + R+ Sbjct: 339 ---------------TAVVKGRGDMIVLIDESGSTRSVA-GWAKALALALLDIASRDGRK 382 Query: 358 CYIMLF-STEIVRYELSGPQGI--EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA 414 ++ F S + +R +L P E ++ Q F GGT+ + + + + + +A Sbjct: 383 FAMVHFASADRIRTDLFEPGHYTPEDVMKAAEQFFGGGTNFEAPLKEALRLM-ENGYENA 441 Query: 415 DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPG--IMRIFDHIW 467 D +I+D L + T + + + + + G G + D I+ Sbjct: 442 DITIITDGECS-LSEIFTEEFHKKTAACKATVTGILLDKGGTCGKSLEPFCDKIY 495 >UniRef50_Q2IEM5 VWA containing CoxE-like n=2 Tax=Anaeromyxobacter dehalogenans RepID=Q2IEM5_ANADE Length = 430 Score = 165 bits (418), Expect = 3e-39, Method: Composition-based stats. Identities = 62/258 (24%), Positives = 101/258 (39%), Gaps = 35/258 (13%) Query: 199 IVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQS 258 + L L+R+A GR + + R R V+ A ++V ++Q Sbjct: 173 VRSLAARLKGDERLRRIAALAGRFKRIAAAKR--------RHRVKHGA---DEVTDVEQG 221 Query: 259 DDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQP 318 D+ R LP ELA L L +F R L+E + L YRL G + Sbjct: 222 ADLGRALPVELAKLSHRLLRLDFLRALLEGRSLQYRLEGTAT----------------LG 265 Query: 319 RGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGP-QG 377 +GP +V +D SGSM G + A A LAL+ A E R ++ F + + P + Sbjct: 266 KGPLVVLLDKSGSMDGPRDVWATAVALALLDQAQRERRTFALLGFDARVKFEAVVKPSEA 325 Query: 378 IEQAIRFLSQQFRGGTDLASCFRAIMERL--QSREWFDADAVVISDFIAQRLPDDVTSKV 435 + + F+S GGT++A+ R +E + AD V+++D Sbjct: 326 LPEDGLFVSCC--GGTEIAAAVRRGLEIIRTHPGALGKADLVLVTDG---GSDASEAGAF 380 Query: 436 KELQRVHQHRFHAVAMSA 453 +E + + Sbjct: 381 RESAAALGVTILGLGIGV 398 >UniRef50_Q60384 Uncharacterized protein MJ0077 n=3 Tax=Methanocaldococcus RepID=Y077_METJA Length = 382 Score = 163 bits (413), Expect = 1e-38, Method: Composition-based stats. Identities = 68/396 (17%), Positives = 138/396 (34%), Gaps = 63/396 (15%) Query: 71 TEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQ 130 E++ Y + + + + I ++ + L + S I + F + Sbjct: 27 ETEIVFYLFFKY--EVEILTETDLIKKIVRDRRFKNVKSITTLDENYSLIAT---EFFCE 81 Query: 131 RWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQ 190 + L ++ EE+ +LL E++ M G Sbjct: 82 K--------LKELKEKGREEDISELLDELESYMENITSSFSSFGSGE-----------GY 122 Query: 191 LKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAK-SIPRNDAQMETFRTMVREPATVP 249 D + ++ E L + +LK + LG+ + + + + Sbjct: 123 KSYTDPKKKLELTEKLLKNNKLKEFMKVLGKFKRMAIKKYKTKIKHFSGEKYSINLGNNL 182 Query: 250 EQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 + E L + RR E + L Y++ Sbjct: 183 INLLS------------SEYKNFAEEILFVDLLRRYNENKPLNYKIL------------- 217 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 + + G F+VC+D SGSM G E AKA L LM I+L N+R +LF + Sbjct: 218 ----ENNENCGDFVVCLDLSGSMRGNKEIWAKAIALCLMDISLKRNKRYISILFDDGVRD 273 Query: 370 YELSGPQ-GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLP 428 ++ + ++ + F S + GGT+ R ++ F+ D V I+D + + Sbjct: 274 IKIYEKKVSFDEILEFASVFYGGGTNFEKPLREALK-------FNGDIVFITDGECE-VS 325 Query: 429 DDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD 464 + K+KE ++ + + +++ ++ + +I D Sbjct: 326 LEFLEKIKEEKQRRKIKIYSICINTKPTVSLRQISD 361 >UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BM85_HYPBU Length = 439 Score = 148 bits (374), Expect = 4e-34, Method: Composition-based stats. Identities = 72/337 (21%), Positives = 126/337 (37%), Gaps = 56/337 (16%) Query: 137 IVQATTLNQQL-LEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGD 195 A Q L E RE + ++ ++ Q + + +AG Sbjct: 129 SKSAADAEQGLNAENIRESVRKALETARDVAQQAKELTNLAMR-------FTAGNASMLS 181 Query: 196 YQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGL 255 +++ L ++K L E L K+I +A + T R+ + ++DG Sbjct: 182 LDDVIQDVINLARNTDVKVLLEAL------KTIESTEAYIRT-----RKIRSPRGELDGY 230 Query: 256 QQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYD 315 + DI R++ ELA F + E+ LL Y+ Sbjct: 231 ELGSDIERVVASELALPTD-----LFLLKFAERNLLLYKK------------------VV 267 Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGP 375 + G F V +D SGSM G AKA LAL + A+ E R YI F + I L P Sbjct: 268 SEEYGKFYVLLDKSGSMMGMKIIWAKAVALALAQRAIREKREFYIRFFDS-IPYPPLYIP 326 Query: 376 QGIE--QAIRFLSQ----QFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRL 427 + + ++ L + GGTD+ ++ + ++ +D ++I+D + Sbjct: 327 KRVHGRDVVKLLEYVARIRANGGTDITRAILTAVDDIATKLQRSKVSDIILITDGEDKIA 386 Query: 428 PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD 464 D + + ++ R H V +S + P + I D Sbjct: 387 IDTIRRSLNKV----NARLHTVMISGN-NPDLRAISD 418 >UniRef50_A8IQV8 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8IQV8_CHLRE Length = 411 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 79/218 (36%), Gaps = 37/218 (16%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 + + ++ + GP I+C+DTSGSM G E AKA L +R Sbjct: 181 MAFDDLNCWLEDEPARVTSRMEIRPAAEMGPIILCLDTSGSMRGARETVAKALALECLRG 240 Query: 351 ALAENRRCYIMLFSTEIVRYELS---GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ 407 A + R+CY+ FS EL ++Q + FLS F GGTD+ + + +ERL Sbjct: 241 AHRQRRQCYLYAFSGPNEVQELQLSVDVDSLDQLLAFLSCSFMGGTDVDAPLKLSLERLA 300 Query: 408 SREWFDADAVVISDFIAQRLPDDVTSK--------------------------------- 434 EW AD ++++D D + Sbjct: 301 KAEWAQADILMVTDGEIPNPDDKIIQTSSLPSRTTRPPPAAAAAAAAAAAAAAAAAAAPQ 360 Query: 435 -VKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDT 471 + H + +S+H + ++ + F + Sbjct: 361 AISRAHEEMGLEVHGLLVSSHVTEAMRKLCTDVHVFKS 398 >UniRef50_A3DPE5 von Willebrand factor, type A n=2 Tax=Desulfurococcaceae RepID=A3DPE5_STAMF Length = 443 Score = 139 bits (349), Expect = 3e-31, Method: Composition-based stats. Identities = 70/373 (18%), Positives = 132/373 (35%), Gaps = 52/373 (13%) Query: 102 LNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQE 161 L S + R +S ++S ++F+ ++ ++ E+E E Sbjct: 92 LKSSFIHDIRSKTVVDSLMSSIAASIFISEFKQLENERSFGNATSNRRGEQEGREDEKAI 151 Query: 162 RMTLSGQLEPILADNNTAA---GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQ 218 R + + + D A + G + Y+ L E++++ E Sbjct: 152 RRNVEKAIANTMRDVENAKKLRMLIEGERPGTVSIMAYEEYGPELIRLARNVEVRKILEI 211 Query: 219 LGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELE 278 L I + + + + ++ G + DI R++P LA Sbjct: 212 L------AGIKPWNINIPERKQRFKH-----GELMGYELGKDIERIVPSALALPDE---- 256 Query: 279 YEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQ 338 FY R +E +LL Y+ Q +GP V +D SGSM G Sbjct: 257 -LFYLRFLENRLLLYQKM------------------LSQGKGPLYVLLDKSGSMDGIKMT 297 Query: 339 CAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG------PQGIEQAIRF-LSQQFRG 391 AKA L+L A+ E+R Y F + + Y L+ + + I + + G Sbjct: 298 WAKAVALSLYMRAVREHREFYFRFFDS--IPYPLAKISRRPRASNVLKLIDYIARVRGSG 355 Query: 392 GTDLASCFRAIMERLQSREWFD-ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVA 450 GTD++ +++ + +D ++I+D + + VT + R R +V Sbjct: 356 GTDISKAIITACNDIRTGSVRETSDIIIITDGVDRIAEQLVTYNL----RKANARLISVM 411 Query: 451 MSAHGKPGIMRIF 463 + K + I Sbjct: 412 IMGDNK-SLKNIS 423 >UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobaceae RepID=C3NM85_SULIN Length = 452 Score = 137 bits (344), Expect = 1e-30, Method: Composition-based stats. Identities = 70/367 (19%), Positives = 132/367 (35%), Gaps = 58/367 (15%) Query: 108 EQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQ---LLEEEREQLLSEVQERMT 164 ++ Q ++ L+ L Q T Q +L + E+ +S+ E Sbjct: 117 KKTSQSMEEREAAEEILNGLMKGSSSKEGKEQKNTNQQSMEKVLRQAHEKAMSKAIEDAN 176 Query: 165 LSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSRE 224 ++ I+ N G + + +++ E+K++ E L + Sbjct: 177 SVRNMQKIVGGNGAGTGSVLTFEG------EIHEVLRLA----RNTEIKKILEFLSGIPK 226 Query: 225 AKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRR 284 SI + R ++ G ++ DI R+L ELA + FY + Sbjct: 227 LGSITKR-----------RTTRFSKGELYGYEEGSDIERILYSELALP-----DMLFYLK 270 Query: 285 LVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFC 344 L E QLL Y+ + GP + +D SGSM G AKA Sbjct: 271 LAEGQLLLYQKQ------------------IRETLGPIYLLLDKSGSMDGEKILWAKAVA 312 Query: 345 LALMRIALAENRRCYIMLFST----EIVRYELSGPQGIEQAIRFLSQ-QFRGGTDLASCF 399 LAL A ENR Y+ F I + + + I + + ++ + + GGTD++ Sbjct: 313 LALYSRAKRENRDFYLRFFDNIPYPLIKVQKNAKSKDIIKMVEYIGKIRGGGGTDISRSI 372 Query: 400 RAIMERLQSREWFD-ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPG 458 + E ++ ++ ++++D + V +KE + +V + Sbjct: 373 ISACEDIKEGHVKGVSEIILLTDGEDKIAETTVRRSLKEA----NSQLISVMI-RGDNAD 427 Query: 459 IMRIFDH 465 + R+ D Sbjct: 428 LRRVSDE 434 >UniRef50_Q9YD81 Putative uncharacterized protein n=1 Tax=Aeropyrum pernix RepID=Q9YD81_AERPE Length = 463 Score = 137 bits (344), Expect = 1e-30, Method: Composition-based stats. Identities = 56/225 (24%), Positives = 90/225 (40%), Gaps = 39/225 (17%) Query: 253 DGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHK 312 DGL+ D+ R+ +L F+ +LL YR Sbjct: 252 DGLEYGSDLERIHYSQLILPDE-----YFWASFSSSKLLLYRK----------------- 289 Query: 313 DYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST----EIV 368 + RGP V +D SGSM G A+A +AL R +LAENRR F + I Sbjct: 290 -VLDSSRGPIYVLLDKSGSMVGAKIDWARAVAVALFRRSLAENRRFSARFFDSVTYPAIH 348 Query: 369 RYELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAI---MERLQSREWFDADAVVISDFIA 424 S P+ + +++L + + GGTD+ + + + R E +D V+I+D Sbjct: 349 LRPRSKPRDFLELVKYLAAVKAGGGTDITAAIKTAADDISRTPRGEQRISDIVLITDGED 408 Query: 425 QRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRF 469 + V++ + R H V + H P + RI +R+ Sbjct: 409 RLN----IDVVEDSLKRSDARLHTVIIQGH-NPYLKRIS---YRY 445 >UniRef50_B3WV50 Protein ViaA n=5 Tax=Enterobacteriaceae RepID=B3WV50_SHIDY Length = 78 Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats. Identities = 77/78 (98%), Positives = 78/78 (100%) Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDH 465 +QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDH Sbjct: 1 MQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDH 60 Query: 466 IWRFDTGMRSRLLRRWRR 483 IWRFDTGMRSRLLRRWRR Sbjct: 61 IWRFDTGMRSRLLRRWRR 78 >UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGP5_ARCPR Length = 430 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 72/429 (16%), Positives = 147/429 (34%), Gaps = 56/429 (13%) Query: 56 ALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVD 115 L ++ + +P +L + +Q QL++T + + L + E+AR+ + Sbjct: 16 QLIAQRIEGYLPKDLQQVYRTCEQIQLIATDCYFLHYSLYPFLRSKNEDDVLEEARKFLQ 75 Query: 116 --------ANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSG 167 + + L + ++L L+ E E ++ + Sbjct: 76 DYISSDRYQKIKMLTTLDDEMSLAYSIALAKAVIGKVLGLIRLEHENPFDNLKAYV--VW 133 Query: 168 QLEPILADNNTAAGRLWDMSAGQ---LKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSRE 224 + ++ A + + D + ++ E+ K++ + Sbjct: 134 AMRAMVEGGEIPAILDYATEYAEEMVRNANDVRELIGGKRAGKEEGTFKKVLDLAEHMLY 193 Query: 225 AKSIPRNDAQMETFRTMV-------REPATVPEQVDGLQQSDDILRLLPPELATLGITEL 277 K + + + + + ++ E++ G + I R L ELA Sbjct: 194 VKFMRDIVSFSKKLFSHIPKATYILKKRGRFGEELSGYSLTKRIDRALVRELALP----- 248 Query: 278 EYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNE 337 E F ++ + LT K+ G + + VD SGSM G Sbjct: 249 EELFLKKFSGEGFLT-------------------KEKLSIAEGAYYILVDKSGSMVGEKT 289 Query: 338 QCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLAS 397 A++ LA+ R+A + RR ++ F + LS P + AI L + GGTD+ + Sbjct: 290 VWARSVALAIYRMASLKRRRYFLRFFDKKTHHL-LSDPHEVVDAI--LKVKSNGGTDITN 346 Query: 398 CFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHG 455 R ++ L R D V+I+D +DV + + + +V + Sbjct: 347 ALRTAVKDLVERGLSDLTNTIVIITDG------EDVVEDLSKDLKKANANLISVMIQG-E 399 Query: 456 KPGIMRIFD 464 + I D Sbjct: 400 NETLKSISD 408 >UniRef50_Q3V4Q4 Putative VWFA domain-containing protein ORF892 n=1 Tax=Acidianus two-tailed virus RepID=Y892_ATV Length = 892 Score = 107 bits (268), Expect = 8e-22, Method: Composition-based stats. Identities = 61/386 (15%), Positives = 138/386 (35%), Gaps = 63/386 (16%) Query: 108 EQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSG 167 ++ ++ S + + L + + I + T +Q + E +E+ E++ + Sbjct: 553 QEINSILQTLSQLRDTVQQLRNYDFYSNAIDERTFDDQNVSEHRKEK---EIENLYDTAN 609 Query: 168 QLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKS 227 ++ +L D N + G Q Y L G + L G Sbjct: 610 NVQKLLKDLNVSEGEQ--NKILQELVTQYSLRRLLGNIATKYKGLLDFVRNKG------- 660 Query: 228 IPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVE 287 ++++E+ + P++ DD+ RL E A L + F + Sbjct: 661 ----ESEVESRHGSGKGPSS--------TMGDDLSRLFIKEYAKLSNPIQQKMFLLDYLN 708 Query: 288 KQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLAL 347 L ++ +E+ +G F+ +D+SGSM G A A L Sbjct: 709 GALSIHK-------------------SEEKKQGDFLFVIDSSGSMEGNKIATALAIPLVT 749 Query: 348 MRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ 407 + R + FS E + + I + S +F GGT++ S ++ + Sbjct: 750 YKK-YKGKRNILVETFSDE--PSPIYNIKNIANVLG--SMKF-GGTNIGSAVLYALKNID 803 Query: 408 S------REWFDA-----DAVVISDFIAQRLPDDVTSKVKELQRVHQHRF--HAVAMSAH 454 R+ ++ ++++D + +PDD+ ++ L++ ++ + + + Sbjct: 804 KPDSDYDRKLRESLRKTRTLILLTDGEDE-IPDDIAREINSLKKKNKVELLCYGIDLGER 862 Query: 455 GKPGIMRIFDHIWRFDTGMRSRLLRR 480 G + I D ++ + ++ + Sbjct: 863 GLKTLKEICDEVYAVGSNNFGNIVLK 888 >UniRef50_UPI00003C852B hypothetical protein Faci_06871 n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI00003C852B Length = 420 Score = 92.8 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 60/390 (15%), Positives = 136/390 (34%), Gaps = 67/390 (17%) Query: 86 PQFIVQLPQILDLLHRLNSPWAEQA-RQLVDANSTITSALHTLFLQRWRLSLIVQATTLN 144 +F+ + ++ + E+A QL D S + S L + + Sbjct: 67 NEFMESVRLRMEYITETKDFKQERAYTQLNDRLSMLYSINFMKALNENAKKNQPRNGNSS 126 Query: 145 QQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGE 204 ++ E+ + +++ ++ ++E I+ D N G+ + + ++ + Sbjct: 127 NAPDQKTIEKSMEGASKKVEMAHEIEKIVKDKNPGGN------IGKKEGMSVESLIDLTD 180 Query: 205 FLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRL 264 + ++ + + +PR +M +F ++ G ++ I + Sbjct: 181 KAMKVDNADKILTLANKLIDI--MPRYTKKMRSFSN--------TGELAGYYKTRHISNV 230 Query: 265 LPPELATLGITELEYEFYRRLV------EKQLLTYRLHGESWREKVIERPVVHKDYDEQP 318 L ELA FY +L+ EK+L++ Sbjct: 231 LSRELAMPDE-----IFYSKLINGFTGKEKRLMS-------------------------- 259 Query: 319 RGPFIVCVDTSGSMG-GFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQG 377 G + V +D SGSM G +++ LAL RIA + R+ Y F ++L Sbjct: 260 PGSYYVLLDKSGSMYEGDKTLWSRSVALALFRIARSRGRKYYFRFFDN--KPHDLLNS-P 316 Query: 378 IEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTSKV 435 + L+ + GT + + + LQ + ++I+D + D Sbjct: 317 FDVVENILTVEANKGTCIECALKTALRDLQDTKIRSETNTIIIITDGEDKVNMQDYF--- 373 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMRIFDH 465 ++ ++ + V ++ + G+ +I Sbjct: 374 ---RKENETKLITVMINGY-NEGLKKISTE 399 >UniRef50_Q6KZN8 Putative uncharacterized protein n=1 Tax=Picrophilus torridus RepID=Q6KZN8_PICTO Length = 417 Score = 87.4 bits (215), Expect = 1e-15, Method: Composition-based stats. Identities = 64/357 (17%), Positives = 129/357 (36%), Gaps = 52/357 (14%) Query: 111 RQLVDANSTITSALHTLFLQRW-RLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQL 169 L D S I S L + + +++ + +E E+ L E + + + Sbjct: 88 SYLNDKVSMIYSISFVKALGDEIKKAESSGRGSMDGKKAQEIIERALKESERIGDRARDV 147 Query: 170 EPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIP 229 +L N G+L D G L ++ + + + +++ Sbjct: 148 NNLLK-GNNPGGKLADKKDGTL-----DNVLDLTDKIIKVDNSEKIITM----------A 191 Query: 230 RNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQ 289 N + T V + ++ G ++ +IL +P E+A FY +L Sbjct: 192 TNLIDIMPKFTRVMKNKNNLGELGGYYKTRNILHAIPREVAMPDE-----IFYSKLA--- 243 Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMG-GFNEQCAKAFCLALM 348 G + REKVI G + + +D SGSM G +++ LAL Sbjct: 244 ------SGFTAREKVINSE-----------GSYYILLDKSGSMYEGTKTVWSRSVALALY 286 Query: 349 RIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQS 408 R+A + R+ ++ F + L+ P I L+ + GT + ++ ++S Sbjct: 287 RLATIKKRKYFLRFFDNKPHEV-LTRPYDIIN--NILTVEANKGTCIECAITTAIDDIKS 343 Query: 409 REWFD-ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD 464 D ++I+D + ++K + + + R +V ++ + I D Sbjct: 344 NRHLDTNTIIIITDGEDHVNKN----QLKLMLKKYNIRLISVMINGS-NDDLKNISD 395 >UniRef50_A4WJJ3 Putative uncharacterized protein n=5 Tax=Thermoproteaceae RepID=A4WJJ3_PYRAR Length = 431 Score = 75.5 bits (184), Expect = 5e-12, Method: Composition-based stats. Identities = 53/362 (14%), Positives = 113/362 (31%), Gaps = 68/362 (18%) Query: 104 SPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERM 163 S + ++ N ++ + L R SL+ + + E++++ ++ E Sbjct: 77 SDVYHEISKISRYNYQVSKSASVKLL-RAYNSLLSRIERGAVEGFEDQKQDF-RDLSENQ 134 Query: 164 TLSGQLEPIL----------ADNNTAAGRLWDMSAGQLKRG---DYQLIVKYGEFLNEQP 210 L ++ +L + + G+ D I Y L + Sbjct: 135 QLRNEISNLLRFYMGNVRNIEKLRKSMTKALGNEVGKETAELLFDID-IDPYRARLAKI- 192 Query: 211 ELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELA 270 L+ L E L +E + + R ++ D+ + L+ Sbjct: 193 -LESLVEMLSAVKEEVDQGDVQERRGVISGVTR-----------IRTYSDLQK--ATNLS 238 Query: 271 TLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSG 330 + F +L K L Y L ++ + VD SG Sbjct: 239 KAIYLQSRELFGYKLATKSLSIYDLALDTRDR-------------------VYLLVDKSG 279 Query: 331 SM-----------GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIE 379 SM A A +A+M+ + ++ F ++V ++ + I Sbjct: 280 SMFYSLYDGVAMDMTQKITWATALAIAVMKKSKRT-----VLRFFDQMVYPPITNVKDII 334 Query: 380 QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQ 439 +++ L GGTD+ + + + + + V+I+D + +V K Sbjct: 335 RSL--LRVLPLGGTDITAAVHTAVRDAKQQSLHNYKLVIITDGEDDMIHPEVLKMAKTAF 392 Query: 440 RV 441 R Sbjct: 393 RE 394 >UniRef50_A6G7V2 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G7V2_9DELT Length = 820 Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats. Identities = 32/146 (21%), Positives = 61/146 (41%), Gaps = 10/146 (6%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG----PQ 376 IV +DTSGSM G A+A AL+R +R ++ FS+ + R+ + Sbjct: 303 DLIVLLDTSGSMRGEPLAHAQAVTEALIRSLRDRDR-LELVEFSSRVRRWSQAPASMSAA 361 Query: 377 GIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKV 435 E+A+R++ + + GGT + A + L+ ++I+D + + + Sbjct: 362 KREEALRWVGALRASGGTHMRDGILAALASLRPEAQRQ--ILLITDGLIAF--ESEIVQA 417 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMR 461 R R H + + + + R Sbjct: 418 ARQHRPPGCRVHTLGIGSSVNRSLTR 443 >UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CIG8_9BACI Length = 931 Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 66/173 (38%), Gaps = 10/173 (5%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ EK++ + K E P ++ +D SGSM G+ Q AK + + L E Sbjct: 386 KTPIEKLLPVDMDLKGKKELPSLGMVIVLDRSGSMAGYKIQLAKEAAIRSAEL-LREKDT 444 Query: 358 CYIMLFSTE----IVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD 413 + F I + + + + I L+ GGT++ E+L E Sbjct: 445 LGFIAFDDRPWQIIDTEPIKDKEKVIEKINGLTS--GGGTNIFPSLELAYEQLTPLELQR 502 Query: 414 ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMS-AHGKPGIMRIFDH 465 ++++D + PD +T+ + + + VA+ + + D Sbjct: 503 KHIILLTDGQSATSPDYLTTIQEG--KENNITLSTVAIGEGSDSVLLEELSDE 553 >UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P2E4_9RHOB Length = 772 Score = 71.3 bits (173), Expect = 7e-11, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 74/175 (42%), Gaps = 13/175 (7%) Query: 293 YRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIAL 352 Y G + +IE P + + R + +DTSGSM G + +K F A ++ AL Sbjct: 338 YEADGGGYFSLLIEPPKLPAEDMIGQR-ELVFVLDTSGSMSGQPIEASKTFMTAAIK-AL 395 Query: 353 AENRRCYIMLFSTEIVRYE----LSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQ 407 + I+ FS + ++ L+ + ++A++F++ GGT++ A ++ Q Sbjct: 396 RPDDYFRILHFSNDTSQFAGQAVLATERNKQKALKFVADLSAGGGTEINQAVNAAFDQAQ 455 Query: 408 SREWFDADAVVISDFIAQRLPDDVTSKVKEL-QRVHQHRFHAVAMSAHGKPGIMR 461 V ++D D + +K + R+ + R +A + ++ Sbjct: 456 PDNTTRI-VVFLTDGYIG----DEATVIKSIANRIGKARIYAFGVGNSVNRFLLD 505 >UniRef50_D1A557 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Streptosporangineae RepID=D1A557_THECD Length = 795 Score = 71.3 bits (173), Expect = 8e-11, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 67/193 (34%), Gaps = 23/193 (11%) Query: 275 TELEYEFYRRLVEKQ--------LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCV 326 L+ +F RL + LT GES + P + +PR ++ + Sbjct: 249 ERLDRDFVLRLAYGRPEQAAASVTLTPDAEGESGTFTLTVLPP-SERCAPRPR-DVVILL 306 Query: 327 DTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR--------YELSGPQGI 378 D SGSM G+ A+ ++ +R ++ F + R + Sbjct: 307 DRSGSMHGWKMVAARRAAARIVDTLTGRDR-FAVLSFDDMVERPAGLDGGLSPATDRNRF 365 Query: 379 EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKEL 438 Q RGGT+LA+ R L D V+I+D D + + + Sbjct: 366 RAVEHLAGLQARGGTELAAPLREGAALLDDAG-RDRVLVLITDGQVGN-EDQLLALIDPF 423 Query: 439 QRVHQHRFHAVAM 451 + R HAV + Sbjct: 424 L--NGLRIHAVGI 434 >UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB92D Length = 586 Score = 71.3 bits (173), Expect = 9e-11, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 67/176 (38%), Gaps = 15/176 (8%) Query: 294 RLHGESWREKVIERPVVHKDYDEQP-RGPFIVCVDTSGSMGGFNEQCAK-AFCLALMRIA 351 G+ V+ P K D Q +DTSGSMGG AK + LA+ R++ Sbjct: 287 ESKGQHDYALVMLMPPQVKSQDLQDFDRDITFVIDTSGSMGGRPIVDAKESLQLAIDRLS 346 Query: 352 LAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQ-----FRGGTDLASCFRAIMERL 406 + ++ F+ + R + +G + ++ GGT++A A ++R Sbjct: 347 EKDR--FNVVAFNNDTTRLFETSVEGTTRNKQYARDFVKHLNAGGGTEMAPALNAALKRT 404 Query: 407 QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQ-HRFHAVAMSAHGKPGIMR 461 ++++ V I+D + + +++ R V + + M Sbjct: 405 TTKDFIK-QVVFITDGAVG----NEAALFSQIKNELGDARLFTVGIGSAPNSYFMT 455 >UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09E12_STIAU Length = 540 Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 64/179 (35%), Gaps = 8/179 (4%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 LLTY+ E + P E +DTSGSM G Q AK + Sbjct: 40 LLTYKQADEPGYFIALIAPKTEVSASEIAAKRVTFVIDTSGSMQGSRMQIAKDALKYCVT 99 Query: 350 IALAENRRCYIMLFSTEIV----RYELSGPQGIEQAIRFLSQ-QFRGGTDLASCFRAIME 404 ++ ++ FST++ + + P+ I++A+ F+ Q + GGT + ++ Sbjct: 100 RLNPQD-TFNVVRFSTDVEALFPALKSAQPENIQKAVAFVEQLEAIGGTAIDEALVRGLQ 158 Query: 405 RLQSREWFDADAVVISDFI--AQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + I+D + ++ + R + R + ++ Sbjct: 159 DNDGKSSAPHLLMFITDGQPTIGETDEGAIAQHAKDGRKAKTRLFTFGVGEDLNARLLD 217 >UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KVI6_HALNC Length = 671 Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 39/179 (21%), Positives = 70/179 (39%), Gaps = 15/179 (8%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L+TY +GE + + + P + R V VD SGSM GF A L+ Sbjct: 282 LMTYEWNGEHYFLMMAQPPKRVAPTEVMKREYLFV-VDVSGSMYGFPLNTASDLMRELLS 340 Query: 350 IALAENRRCYIMLFSTEIVRYELSG----PQGIEQAIRFL-SQQFRGGTDLASCFRAIME 404 +L I+ FS + P+ +++A+ + S Q GGT+L + Sbjct: 341 -SLKPQETFNILFFSGGSRVLSPTPLQATPENLQRAMTMMRSIQGGGGTELLPALKTAFA 399 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHR--FHAVAMSAHGKPGIMR 461 ++ + + VVI+D DV + +L + + + A + + +M Sbjct: 400 MPRTEDTARS-IVVITDGYV-----DVERQAYDLIKQNLNSTNLFAFGIGSSVNRYLME 452 >UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTR8_HALO1 Length = 903 Score = 70.5 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 37/173 (21%), Positives = 60/173 (34%), Gaps = 7/173 (4%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 G EK++ + EQP + VD SGSM G + AK A + Sbjct: 434 YQGT-RIEKIMPVRFDSEKQREQPHVAIALVVDRSGSMSGLKIEAAKESARATAEVLSPS 492 Query: 355 NRRCYIMLFSTEIVRY--ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWF 412 + ++ F + + A Q GGT++ R E LQ Sbjct: 493 D-LITVVAFDNQPTTIVRLQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGANAK 551 Query: 413 DADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDH 465 +V+SD A P D + + + R + AV + + + I D+ Sbjct: 552 VKHVIVLSDGQA---PYDGIADLCQEMRSARITVSAVGIGDADRNLLNLITDN 601 >UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XQ17_9BACT Length = 806 Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 67/180 (37%), Gaps = 9/180 (5%) Query: 289 QLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALM 348 L+ Y+ E ++ P V + + +DTSGSM G + AK L Sbjct: 281 NLMAYKTGDEDGYFLLLASPGVDAKAKQIVSKDVVFVLDTSGSMSGKKMEQAKK-ALQFC 339 Query: 349 RIALAENRRCYIMLFSTEIVR----YELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIM 403 +L + R I+ FSTE + E+A F+ + GGT + + + Sbjct: 340 VESLNDGDRFEIIRFSTESEPLFDKLAAVSKENREKAGDFIKNLKAMGGTAIDEALKKAL 399 Query: 404 ERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQH--RFHAVAMSAHGKPGIMR 461 L+S+E V ++D + D +K +Q ++ R + ++ Sbjct: 400 S-LESKEGRPFVVVFLTDGLPTVGTTDEDQILKGMQERNKEKRRIFCFGIGTDVNTHLLD 458 >UniRef50_A8M9M1 von Willebrand factor type A n=1 Tax=Caldivirga maquilingensis IC-167 RepID=A8M9M1_CALMQ Length = 474 Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 53/345 (15%), Positives = 116/345 (33%), Gaps = 56/345 (16%) Query: 152 REQLLSEVQERMTLSGQLEPIL-----ADNNTAAGRLWDMSAGQLKRGDYQLIVKY---- 202 + QL + +GQL ++ A ++D+ G + ++ + + Sbjct: 119 KSQLSPNNDSKDGGTGQLINQELGTGNESSDDVANVIYDVFYGSVGTMNFINLAQLLNMF 178 Query: 203 ----GEFLNEQPELKRLAEQL---------GR-----SREAKSIPRNDAQMETFR-TMVR 243 + L++LA L G+ + ++ R R + Sbjct: 179 VNPMAHITEKVKVLRKLARYLASYGLLPHQGKGGSRVFKALDNVAREPTIGNALRVSRFT 238 Query: 244 EPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREK 303 E + P + G+++ +G + ++K + + + Sbjct: 239 EHSNYPTYITGVREYR------------IGDPAYRID-----LDKTSMNM-VRKTFLNKP 280 Query: 304 VIERPVVHKDYDEQPRGPFIVCVDTSGSM---GG--FNEQCAKAFCLALMRIALAENRRC 358 + R +V ++Y + ++C+DTSGSM G AK + +R N R Sbjct: 281 MSTRDIVVREYADVKLMDIVLCLDTSGSMKEFSGAYMKMDIAKEAIVKYIRYLSRTNDRL 340 Query: 359 YIMLFS--TEIVRYELSGPQGIEQAIRFLSQQF-RGGTDLASCFRAIMERLQSREWFDAD 415 ++LF+ +I+ S + I + + GGT++A+ L + + Sbjct: 341 SMVLFNFRADILWGPHSVKKYINEMEEMSRYIYPGGGTNIANALEKARIILSKSNYPNKH 400 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + I+D + VK R VA+ + ++ Sbjct: 401 IICITDGRTVNASSCIKEAVK--LRRMGVTLSTVAVGDNSDFDLL 443 >UniRef50_B2HK18 Conserved membrane protein n=3 Tax=Mycobacterium RepID=B2HK18_MYCMM Length = 983 Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 69/202 (34%), Gaps = 20/202 (9%) Query: 273 GITELEYEFYRRLVEKQL-LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGS 331 L ++ + + L L G+ ++ P ++ +D S S Sbjct: 253 RDFVLRLDYDAQELASSLVLVPDADGDEGTYQLTVLPPAGVAAPRPRH--LVLVLDRSRS 310 Query: 332 MGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIR-------- 383 M G+ A+ ++ +++R ++ F I Y + P G+ +A Sbjct: 311 MAGWKMTAARRAASRIVDALTSDDR-FAVLTFDDGI-EYPVGLPAGLTEASDRHRYRAVE 368 Query: 384 -FLSQQFRGGTDLASCFRAIMERLQSREWFDAD---AVVISDFIAQRLPDDVTSKVKELQ 439 + RG T++ + R + L + D D ++ISD + +L Sbjct: 369 HLARVEARGDTEMLAPLRRALALLGREQVADTDDAVLILISDGQVGNEDQLLQELSGDLG 428 Query: 440 RVHQHRFHAVAMSAHGKPGIMR 461 R R H + + G +R Sbjct: 429 R---VRLHTIGVDEAVNAGFLR 447 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 68.9 bits (167), Expect = 4e-10, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 66/172 (38%), Gaps = 10/172 (5%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIVCV--DTSGSMGGFNEQCAKAFCLALMRIALAE 354 G ++ + V K D R P +C+ D SGSM G + K+ L L+ + Sbjct: 17 GAPTSQRQLRIAVAAKADDHDRRLPLNLCLVLDHSGSMDGQPLETVKSAALGLIDRLEED 76 Query: 355 NRRCYIMLFSTE----IVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIM-ERLQSR 409 +R ++ F I ++ I +AI + GGT + + + E + + Sbjct: 77 DR-LSVIAFDHRAKIVIENQQVRNGAAIAKAIE--RLKAEGGTAIDEGLKLGIQEAAKGK 133 Query: 410 EWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 E + +++D + +D K+ + ++ H + H ++ Sbjct: 134 EDRVSHIFLLTDGENEHGDNDRCLKLGTVASDYKLTVHTLGFGDHWNQDVLE 185 >UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JQC2_9BACT Length = 808 Score = 68.9 bits (167), Expect = 4e-10, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 71/196 (36%), Gaps = 16/196 (8%) Query: 280 EFYRRLVEK---QL--LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGG 334 FY L E +L LTYR + + ++ + + F+ +D SGSM G Sbjct: 237 VFYYMLEENLPGRLEVLTYRENEDKPGTFMMVMTPGVDLHPLEGGADFVFALDVSGSMQG 296 Query: 335 FNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE-----LSGPQGIEQAIRFLSQQF 389 A A+ ++ E+R ++ F+ + E R Sbjct: 297 KLHTLASGVKKAIGQL-KPEDR-FRVVAFNNTAFDLNRGWVSATEANLRETFARLDQLNS 354 Query: 390 RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAV 449 GGT++ + +ERL + A ++++D + + D +L RF+ Sbjct: 355 NGGTNVYAGVHLALERLDAD--RVATLILVTDGVTNQGIVD-PKAFYKLMHKQDLRFYGF 411 Query: 450 AMSAHGKPGIMR-IFD 464 + +M+ + D Sbjct: 412 LLGNSSNWPLMQLMCD 427 >UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DZ93_PARTE Length = 522 Score = 67.8 bits (164), Expect = 9e-10, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 61/177 (34%), Gaps = 8/177 (4%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRG-PFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L+Y + V+ +G I +D SGSM G K L++ Sbjct: 82 LSYSGLPTQGTQAVLLSVQTKNQAITIRQGIDLICLIDHSGSMSGEKMHLVKKSLKHLLK 141 Query: 350 IALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFL----SQQFRGGTDLASCFRAIMER 405 + +R C ++ F + R E +FL + + G TD+ + + + Sbjct: 142 MLQPNDRLC-LIEFDDQNYRLTRLMRATQENMYKFLIAIDTIEANGATDIGNAMKMALSI 200 Query: 406 LQSREWFD--ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 L+ R + + A ++SD + V + ++ + P IM Sbjct: 201 LKHRRFKNPIASIFLLSDGEDEGAAGRVWNDIQSKNIKEPFTINTFGFGRDCCPKIM 257 >UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus RepID=B7AA98_THEAQ Length = 706 Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 38/157 (24%), Positives = 57/157 (36%), Gaps = 6/157 (3%) Query: 282 YRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGP-FIVCVDTSGSMGGFNEQCA 340 Y R L T G + P +G ++ +D SGSM G A Sbjct: 268 YLRRGGGLLFTATPKGLFFGGWDRALPEDLPLKPLGRKGAALVLVLDVSGSMEGEKLAMA 327 Query: 341 KAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLA 396 A L L+R A E+ ++LFS+ ++ E LS + GGT L Sbjct: 328 VAGALELVRSAAPED-YLGVVLFSSSPRVLFPPRPMTAQGKKEAESLLLSLRAGGGTVLG 386 Query: 397 SCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 FR + LQ +V+SD I + + + Sbjct: 387 GAFREALRLLQDVPVERKALLVLSDGIIFDPKEPILA 423 >UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putative n=1 Tax=Ricinus communis RepID=B9RR85_RICCO Length = 755 Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 62/151 (41%), Gaps = 13/151 (8%) Query: 322 FIVCVDTSGSMGGFN-EQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQ 376 + VD SGSM G E A AL ++ + I+ F+ E + EL+ + Sbjct: 327 IVFIVDISGSMEGKPLEGMKNAMSGALAKLNPKD--SFNIIAFNGETYLFSSLMELATEK 384 Query: 377 GIEQAIRFLSQQF--RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSK 434 +E+A+ +++ F GGT+++ ME + + + +++D + + Sbjct: 385 TVERAVEWMNLNFIAGGGTNISVPLNQAMEMVSNTQGSLPVIFLVTDGAVED-ERHICDS 443 Query: 435 VKELQRVHQH---RFHAVAMSAHGKPGIMRI 462 +K+ R R + + + +R+ Sbjct: 444 MKKYVRGKGAICPRIYTFGIGTYCNHYFLRM 474 >UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174639B Length = 868 Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 10/165 (6%) Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYI 360 E+V+ + D +E+ + +D SGSM G + AK+ +A + L N + Sbjct: 394 IEEVLPVRLKAPDEEEKQSSALALVIDRSGSMSGEKLEMAKSAAIATAEV-LTRNDSIGV 452 Query: 361 MLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADA 416 F +E L+ + A + GGT+L F LQ + Sbjct: 453 YAFDSEAHVVVPMTRLTSSSAV--AGQIAGLTSGGGTNLHPAFTEARNALQRTKAKIKHM 510 Query: 417 VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++++D + R VA+ G+++ Sbjct: 511 IILTDGQTSGQG---YEALASQCRAEGVTISTVAIGDGAHVGLLQ 552 >UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythrobacter RepID=A3W9L9_9SPHN Length = 740 Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 58/174 (33%), Gaps = 12/174 (6%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + HGE P + E P I +D SGSM G + A+ L + Sbjct: 318 QRHGELEYVMATITPPALERVGEAPPREMIFVIDNSGSMAGESMPAARRSLLYALETLRP 377 Query: 354 ENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFR-----GGTDLASCFRAIMERLQS 408 ++R ++ F + S Q + I GGT++ RA + Sbjct: 378 QDR-FNVIRFDDTMTELFASAVQASDSNIAAAKTFTHNLMANGGTEMLPALRAALRDRAP 436 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQR-VHQHRFHAVAMSAHGKPGIMR 461 E + ++D + ++E+ R R V + + +MR Sbjct: 437 DERVR-QVIFLTDGALS----NEADMMEEINRNRKDSRVFMVGIGSAPNTYLMR 485 >UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Burkholderiales RepID=A1VI76_POLNA Length = 701 Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 66/200 (33%), Gaps = 17/200 (8%) Query: 273 GITELEYEFYRRLVEKQLLTYR------LHGESWREKVIERPVVHKDYDEQPRGPFIVCV 326 L+Y +E ++ Y+ GE++ +IE P PR +I V Sbjct: 274 RDFILDYRLAGERIESGVMLYQGTPGNGASGENFFLAMIEPPKQVAAQAISPR-DYIFVV 332 Query: 327 DTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV-----RYELSGPQGIEQA 381 D SGSM GF AK L+ + ++LFS + + Sbjct: 333 DISGSMHGFPLDTAKTLMRELIGKLRPSD-TFNVLLFSGSNRFLSPASVPATQANIEQAV 391 Query: 382 IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRV 441 GGT+L + + ++ + VV++D + + L + Sbjct: 392 RTIDEMGGGGGTELIPALKRVYAEPKAADVSR-TVVVVTDGFVTVEREAFELVRRNLSQA 450 Query: 442 HQHRFHAVAMSAHGKPGIMR 461 + + + + +M Sbjct: 451 N---LFSFGIGSSVNRHLME 467 >UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSY7_9GAMM Length = 670 Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 62/173 (35%), Gaps = 11/173 (6%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 GE + ++ P PR + +DTSGSM G AK + Sbjct: 295 EYKGEHYALVMLRTPDEMTSGPRMPR-EVVFVIDTSGSMAGQRMYHAKQALSQAVERLSP 353 Query: 354 ENRRCYIMLFSTEIVR----YELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQS 408 ++R ++ F+ + R + ++QA+ ++ Q GGT + + Sbjct: 354 DDR-FNVVEFNNQHSRLFSSMRSASAINVKQALNWVGRLQGGGGTMMLPAVEDALSVRSD 412 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + ++I+D A + +V E QR R V + ++R Sbjct: 413 PAYLR-QVILITD--ASVGNEAEILRVVERQR-KGARLFTVGIGVSPNSYLLR 461 >UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HPN0_LYSSC Length = 825 Score = 65.5 bits (158), Expect = 5e-09, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 60/166 (36%), Gaps = 6/166 (3%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ E ++ + K ++ P ++ +D SGSM G + AK + + E+ Sbjct: 345 KTPIETLLPVEMEIKGKEQLPSLGLVIVLDRSGSMSGSKLELAKEAAARSVEMLRDED-T 403 Query: 358 CYIMLFSTEIVR-YELSGPQGIEQAIR-FLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 + F E E+A+ LS GGT++ E L + Sbjct: 404 LGFIAFDDRPWEIIETGPLNNKEEAVDTILSVTPGGGTEIYGSLAKAYENLADMKLQRKH 463 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++++D +Q D + E + + VA+ ++ Sbjct: 464 IILLTDGQSQPGNYD---DLIEQGKDNGITLSTVAIGQDADANLLE 506 >UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepID=C5EGH1_9FIRM Length = 681 Score = 65.5 bits (158), Expect = 5e-09, Method: Composition-based stats. Identities = 41/188 (21%), Positives = 67/188 (35%), Gaps = 23/188 (12%) Query: 255 LQQSDDILRLLPPELATLGITELE---YEFYRRL------VEKQLLTYRLHGESWREKVI 305 ++QS D TL +F R V L+ E++ ++ Sbjct: 245 IEQSAD-----SSAHITLKDPADYGGNRDFILRYQLAGQTVNSGLMLNTGEKENFFLLMV 299 Query: 306 ERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST 365 + P PR +I +D SGSM G+ AK ++ L E ++LFS Sbjct: 300 QPPERVPAEAIPPR-EYIFVLDVSGSMFGYPLDTAKELIRNMVSN-LRETDTFNLILFSN 357 Query: 366 EIVRYELSG----PQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDA--DAVV 418 + +R + +E+AI + Q+ GGT+LA + VV Sbjct: 358 DAIRMSARSLPATDENVERAINLINRQKGGGGTELAPALEKAVGIPMDSGAGSVSRSVVV 417 Query: 419 ISDFIAQR 426 I+D Sbjct: 418 ITDGYMSD 425 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 65.1 bits (157), Expect = 6e-09, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 68/181 (37%), Gaps = 14/181 (7%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + + ++ V + +P + +D SGSM G Q K L L+ + + Sbjct: 182 KTQDSTNDILEEQKEQVKQVEQSRPSIDLVCVIDNSGSMQGEKIQNVKTTLLQLLDMLNS 241 Query: 354 ENRRCYIMLFST------EIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ 407 +R ++LF++ + + + I+ I S GGTD+ S LQ Sbjct: 242 NDR-LSLILFNSYPTLLCNLRKVDDENTPNIQSIIN--SITADGGTDINSGMLMAFNILQ 298 Query: 408 SREWFD--ADAVVISDFIAQRLPDDVTSKVKELQ--RVHQHRFHAVAMS-AHGKPGIMRI 462 R++F+ + ++SD + + + Q + H+ H P + RI Sbjct: 299 KRQFFNPVSSIFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFGSDHDGPLMNRI 358 Query: 463 F 463 Sbjct: 359 C 359 >UniRef50_C7PW75 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7PW75_CATAD Length = 1033 Score = 65.1 bits (157), Expect = 6e-09, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 49/135 (36%), Gaps = 13/135 (9%) Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR--------YELSGPQG 377 +D SGSMGG+ A+ ++ AE+R ++ F ++ E + Sbjct: 323 LDRSGSMGGWKMTAARRAAARIVDTLTAEDR-FAVLTFDDQMETPDGLPTGLSEATDRHR 381 Query: 378 IEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE-WFDADAVVISDFIAQRLPDDVTSKVK 436 + RGGT++ R L D ++I+D +T+ Sbjct: 382 FRAVQHLATVDARGGTEMEPPLRRAATLLSDDNPDRDRVLILITDGQVGNEDRLLTTLSP 441 Query: 437 ELQRVHQHRFHAVAM 451 +L + R H V + Sbjct: 442 KLTHI---RVHTVGI 453 >UniRef50_C4DQN3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DQN3_9ACTO Length = 831 Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 64/204 (31%), Gaps = 21/204 (10%) Query: 275 TELEYEFYRRL-------VEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVD 327 + +F R V LLT E + D +V +D Sbjct: 252 ERVNRDFILRFDYGESGDVAGSLLTAPDENEPTSGTFQLTAIPPSDLPRARPRDVVVLLD 311 Query: 328 TSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIE-------- 379 SGSMGG+ A+ ++ + +R + F T + E P G+ Sbjct: 312 RSGSMGGWKMVAARRAAARIVDTLSSADR-FAVRCFDTAMTSPEGLDPNGLSAGTDRNRF 370 Query: 380 -QAIRFLSQQFRGGTDLASCFRAIMERLQSREW-FDADAVVISDFIAQRLPDDVTSKVKE 437 + RGGTD+ ++ L + E D ++++D + Sbjct: 371 RAVEHLAGTETRGGTDILKPLSTAVDLLTAGEKGRDRVIILVTDGQVGNEDQILRELTG- 429 Query: 438 LQRVHQHRFHAVAMSAHGKPGIMR 461 R+ R H V + G + Sbjct: 430 --RLSGMRVHVVGIDKAVNAGFLH 451 >UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Deltaproteobacteria RepID=A0LHW4_SYNFM Length = 812 Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 70/195 (35%), Gaps = 19/195 (9%) Query: 279 YEFYRRL------VEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSM 332 +F + ++ LL YR E++ ++ P + R +I VD SGSM Sbjct: 286 RDFVLKYRLSGESIDSGLLLYRGKDENFFLLTVQPPKRVVEAAIPAR-EYIFIVDVSGSM 344 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCY-IMLFSTEIVRY-ELSGPQGIEQAIRFLSQ--- 387 GF + +K L+ C+ +MLFS + E S P + R + Sbjct: 345 HGFPLEISKRLLTDLIGGLKPT--DCFNVMLFSGDSTVMAERSVPASADNVRRAVEMIGR 402 Query: 388 -QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 Q GGT+L + + + + V+ +D + ++ ++ + F Sbjct: 403 RQGGGGTELLPALKKALSLPRKEGVSRS-MVIATDG-FVTVEEEAFELIRS--HIGDANF 458 Query: 447 HAVAMSAHGKPGIMR 461 + ++ Sbjct: 459 FPFGIGTSVNRMLIE 473 >UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacteria RepID=Q114A2_TRIEI Length = 1204 Score = 64.7 bits (156), Expect = 8e-09, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 64/185 (34%), Gaps = 21/185 (11%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQ-CAKAFCLALMRIALA 353 L E P + + R P I+ +DTS SM G + + + Sbjct: 606 LEPEFVENPEQRLPEPEFVENPENRCPIILLLDTSYSMSGEAITELNQGVKIFQASVKED 665 Query: 354 E----NRRCYIMLFSTEIVRYE--LSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ 407 E ++ F++EI + ++ + I + + G T + +E L+ Sbjct: 666 ELASLRVEIAVITFNSEIEVVQDFVTVDKFIPKTLE-----ASGVTHMGKAIEKALELLE 720 Query: 408 SRE--WFDAD-------AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPG 458 R+ + ++D +I+D D K++E + + F AV + Sbjct: 721 KRKQDYKNSDIQYYRPWIFLITDGQPTDTWQDAAKKIEEAETNRKLLFFAVGVRDADMET 780 Query: 459 IMRIF 463 + I Sbjct: 781 LSEIS 785 >UniRef50_Q80UW6 Parp4 protein (Fragment) n=11 Tax=Eukaryota RepID=Q80UW6_MOUSE Length = 498 Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats. Identities = 53/277 (19%), Positives = 94/277 (33%), Gaps = 24/277 (8%) Query: 194 GDYQLIVKYGEFLNEQPE---LKRL-AEQLGRSREAKSIPRNDAQMETFRTMVREPATVP 249 +Q E L + E +K + AEQ + +P + + +R+ +T Sbjct: 124 APWQQDKALNENLQDTVETIRIKEIGAEQSFSLAMSIEMPYMIEFISSDTHELRQKSTDC 183 Query: 250 EQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 + V + + L L + + EK+ V + + Sbjct: 184 KAVVSTVEGSSLDSGGFSLHIGLRDAYLPRMWVEKHPEKE--------SEACMLVFQPEL 235 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 D + + I+C+D S SM G AK L + + L E ++ IM F T Sbjct: 236 ADVLPDLRGKNEVIICLDCSSSMEGVTFTQAKQVALYALSL-LGEEQKVNIMQFGTGYKE 294 Query: 370 YELSGPQGIEQ---AIRFL--SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIA 424 S P+ I A F+ + G TD R + S + + ++ISD Sbjct: 295 L-FSYPKCITDSKMATEFIMSAAPSMGNTDFWKVLRYLSLLYPSEGFRN--ILLISDGHL 351 Query: 425 QRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 Q + + +Q R A+ + I+R Sbjct: 352 QSESLTLQLVKRNIQH---TRVFTCAVGSTANRHILR 385 >UniRef50_Q22ST4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ST4_TETTH Length = 648 Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 59/146 (40%), Gaps = 5/146 (3%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG--PQGI 378 +V +D SGSM G Q K + ++ + + +R C + + + + Sbjct: 223 DLVVVIDKSGSMEGEKIQLVKETLVKIINLMSSMDRICIVCFNESGDRPLTFTRVTDENK 282 Query: 379 EQAIRFLSQQF-RGGTDLASCFRAIMERLQSREWFDA--DAVVISDFIAQRLPDDVTSKV 435 + + + Q + GGT+++ ++ +Q+R++ + +++SD + V + + Sbjct: 283 QTLLNLIQQIYAGGGTNISEGINHALKAIQNRKFKNNVTSILLLSDGQDTKAYTRVKAYI 342 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMR 461 + Q + P ++R Sbjct: 343 DKYQIKDAFNIETIGFGEDHDPKLLR 368 >UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 Tax=Myxococcales RepID=Q1DE81_MYXXD Length = 860 Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 44/255 (17%), Positives = 86/255 (33%), Gaps = 32/255 (12%) Query: 213 KRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATL 272 L LGR +S T VR + E + D+ + +L Sbjct: 191 MDLLVDLGREVVVESPSHAITTTREEGTRVRVGFSRGE----VSLDRDL-------VLSL 239 Query: 273 GITELEYEFYRRLVEKQLLTYRL-HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGS 331 + F L+T+R G + P + P+ + VD SGS Sbjct: 240 RSPDSSAVF------TPLVTHRKGEGGPGTFALTVVPDLLALASAPPKQEVVFVVDVSGS 293 Query: 332 MGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIR-----FLS 386 M G + A+A L L L E R ++ F ++ ++ + + Sbjct: 294 MAGESLPQAQA-ALRLCLRHLREGDRFNVIAFENRFQSFQPEPVPFTQRTLEEADRWVAA 352 Query: 387 QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 GGT+L + RA ++ D V+++D + + + ++ + R Sbjct: 353 LNADGGTELLAPMRAAVQAAP-----DGVIVLLTDGQVGNEAEILRAVLEARKT---ARV 404 Query: 447 HAVAMSAHGKPGIMR 461 ++ + + ++R Sbjct: 405 YSFGIGTNVSDVLLR 419 >UniRef50_Q7UEC5 60-kDa SS-A/Ro ribonucleoprotein homolog n=3 Tax=Planctomycetaceae RepID=Q7UEC5_RHOBA Length = 552 Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats. Identities = 32/146 (21%), Positives = 54/146 (36%), Gaps = 25/146 (17%) Query: 320 GPFIVCVDTSGSM---------GG--FNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV 368 GP I+ +DTSGSM G +C L I L N ++ F T+ Sbjct: 348 GPVIIGLDTSGSMGCPVTGNRGRGGTSKMRCVDVAALFAAAI-LRRNPDSVVIPFDTQAY 406 Query: 369 RYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISD------- 421 + ++ I LS+ GGTD + F R + + A V++SD Sbjct: 407 KVKVDPSDTILSLSARLSKYGGGGTDCSLPFVEANTRYAKQAF--AGIVLVSDNESWITS 464 Query: 422 ----FIAQRLPDDVTSKVKELQRVHQ 443 Q V ++ ++ ++ + Sbjct: 465 GRRYGYGQNGSTGVMTQWEKFKKTQR 490 >UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CPU4_SHEPW Length = 710 Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 57/146 (39%), Gaps = 11/146 (7%) Query: 322 FIVCVDTSGSMGGFNEQCAKA-FCLALMRIALAENRRCYIMLFSTEIVRYELSG----PQ 376 I+ +DTSGSM G + AKA AL ++ + I+ F++ + + + Sbjct: 351 LILVIDTSGSMSGEAIEQAKASIIYALAGLSAQD--SFNILQFNSNVYALSDTPLNASAK 408 Query: 377 GIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKV 435 I +A ++ Q GGT+++ + + + + I+D P T Sbjct: 409 NIGRAQAYVQRLQANGGTEMSLALDKALSQQDANRERLRQVLFITDGAVGNEPQLFTQIR 468 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMR 461 +LQ Q R + + M+ Sbjct: 469 NQLQ---QSRLFTIGIGDAPNAHFMQ 491 >UniRef50_B3QTN9 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Bacteria RepID=B3QTN9_CHLT3 Length = 837 Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats. Identities = 40/186 (21%), Positives = 65/186 (34%), Gaps = 11/186 (5%) Query: 273 GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSM 332 L Y ++ LL + E++ ++ P + R + VD SGSM Sbjct: 290 RDYILRYRLAGNQIQSGLLLFEGEKENFFLATVQPPKRVTEKMIPNREYIYI-VDVSGSM 348 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY-ELSGP---QGIEQAIRFLS-Q 387 G +K L+ ++LFS E S P + IE+A L + Sbjct: 349 FGQPIAISKELMKKLLGRLRPTE-TFNLLLFSGGSKLLSEKSLPATDKNIEKAFYALENE 407 Query: 388 QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 GGT+L + L +E VVI+D + + K L + + Sbjct: 408 HGGGGTELLRALNRALG-LPKKEAGSRTFVVITDGYVSFEVETFETIRKNLNKAN---LF 463 Query: 448 AVAMSA 453 AV + Sbjct: 464 AVGIGN 469 >UniRef50_Q8TY27 Uncharacterized conserved protein n=1 Tax=Methanopyrus kandleri RepID=Q8TY27_METKA Length = 447 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 61/166 (36%), Gaps = 6/166 (3%) Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 K + + +++ + + +D+SGSM G + A AL EN + Sbjct: 254 KNVPLTLYRREFRNKDEPKVAILLDSSGSMSGDKMEVAATLAAALFETVGIEN--IGLWA 311 Query: 363 FSTEIVRY-ELSGPQGIEQAIR-FLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVIS 420 F +E+ + + + I L GGTD +++ L++ ++ + I+ Sbjct: 312 FRSEVHQLKDFEEVINRRKLIEKILGIPAGGGTDPVKPLIKVLDSLENVDYDKCKIIYIT 371 Query: 421 DFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 D I + DD L +A+ + + IF I Sbjct: 372 DAIF--MHDDFIRIRNLLSDRDDVELYALLIKDEFEHTGPTIFKRI 415 >UniRef50_C0CQI8 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CQI8_9FIRM Length = 393 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 52/136 (38%), Gaps = 11/136 (8%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYEL--SGPQGI 378 ++ +D SGSM G E C +A AL+ ++R + S + EL +G Sbjct: 110 DMVLLLDGSGSMQGKKEPCVQA-TEALLEQMDEQSRAQAVAFASCVLGNTELLPLDEEGR 168 Query: 379 EQAIRFLSQQ-FRGGTDLASCFRAIMERLQSREW--FDADAVVISDFIAQRLPDDVTSKV 435 E I+F+ GGT+ + L+ ++ +++SD + Sbjct: 169 ETLIKFVEGTDIIGGTEFGQPLTFALNSLEEKKETGRIQAVILLSDGEGP-----FPETL 223 Query: 436 KELQRVHQHRFHAVAM 451 +E + + + M Sbjct: 224 EEEYKEKDVVLYTIRM 239 >UniRef50_A3MT69 VWA containing CoxE family protein n=4 Tax=Pyrobaculum RepID=A3MT69_PYRCJ Length = 415 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 59/318 (18%), Positives = 91/318 (28%), Gaps = 35/318 (11%) Query: 163 MTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPE--------LKR 214 + L E L GR W G+ + + ++ L P K+ Sbjct: 89 VRLLRAFESYLLSLERR-GRAW---FGRGSQEAWVEAMRQLRRLFGDPADVSELHRVFKK 144 Query: 215 LAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGI 274 L E LGR R + R D + + E Sbjct: 145 LGEVLGRGRRGDPASLALSTASDPRRARLASLLAKAVDLSALLGDPLGDVGRVEGPGAEF 204 Query: 275 TELEYEFYRRLVEKQLLTYR---LHGESWREKVIERPVVHKDYDEQPRGP--FIVCVDTS 329 F R K+ + Y G + RG + VD S Sbjct: 205 EVAHGSFARV---KRAVAYARALFVGALPVFLHKAASSTLPVRRPRARGDSGVFLLVDKS 261 Query: 330 GSMG---GFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLS 386 GSM G E+ A A AL L +R + F E+ R E + + LS Sbjct: 262 GSMYSAVGGVEKIALATAFALA--VLKRYKRARLRFFDVEVHRVE-----ELGDLVDVLS 314 Query: 387 QQ-FRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHR 445 + GGTD++ A E VV++D + + + R Sbjct: 315 RAWAGGGTDISRAVEAAAEEAARERLRGYSLVVVTDGEDDAFSPAAAREARAVFRE---- 370 Query: 446 FHAVAMSAHGKPGIMRIF 463 V + G+ ++F Sbjct: 371 VLFVVVGERRLSGVRQVF 388 >UniRef50_Q6A0B1 MKIAA0177 protein (Fragment) n=4 Tax=Murinae RepID=Q6A0B1_MOUSE Length = 1269 Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 53/277 (19%), Positives = 94/277 (33%), Gaps = 24/277 (8%) Query: 194 GDYQLIVKYGEFLNEQPE---LKRL-AEQLGRSREAKSIPRNDAQMETFRTMVREPATVP 249 +Q E L + E +K + AEQ + +P + + +R+ +T Sbjct: 761 APWQQDKALNENLQDTVETIRIKEIGAEQSFSLAMSIEMPYMIEFISSDTHELRQKSTDC 820 Query: 250 EQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 + V + + L L + + EK+ V + + Sbjct: 821 KAVVSTVEGSSLDSGGFSLHIGLRDAYLPRMWVEKHPEKE--------SEACMLVFQPEL 872 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 D + + I+C+D S SM G AK L + + L E ++ IM F T Sbjct: 873 ADVLPDLRGKNEVIICLDCSSSMEGVTFTQAKQVALYALSL-LGEEQKVNIMQFGTGYKE 931 Query: 370 YELSGPQGIEQ---AIRFL--SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIA 424 S P+ I A F+ + G TD R + S + + ++ISD Sbjct: 932 L-FSYPKCITDSKMATEFIMSAAPSMGNTDFWKVLRYLSLLYPSEGFRN--ILLISDGHL 988 Query: 425 QRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 Q + + +Q R A+ + I+R Sbjct: 989 QSESLTLQLVKRNIQH---TRVFTCAVGSTANRHILR 1022 >UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B7C3_9BACI Length = 920 Score = 62.4 bits (150), Expect = 4e-08, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 62/169 (36%), Gaps = 11/169 (6%) Query: 298 ESWREKVIERPVVHKDYDEQPR-GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 ++ EK++ + K E P G IV D SGSM G + AK + + L E Sbjct: 381 KTPIEKLLPVNMDIKGKKEMPSLGLMIVM-DRSGSMAGSKLELAKEAAARSVEL-LREKD 438 Query: 357 RCYIMLFSTE----IVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWF 412 + F + L + I S GGT++ + E L++ + Sbjct: 439 TLGFIAFDDRPWVIVETGPLEDKKDAVDKIG--SVTPGGGTEIFTSLEKAYEELENLKLQ 496 Query: 413 DADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++++D + R D + E + + VA+ + ++ Sbjct: 497 RKHIILLTDGQSARSTDY--ESMIETGKENNITLSTVALGSDADRNLLE 543 >UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1C6_SLAHD Length = 744 Score = 62.4 bits (150), Expect = 4e-08, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 47/133 (35%), Gaps = 7/133 (5%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 ++ +DTSGSM G K + ++ + + + E A Sbjct: 382 VVLALDTSGSMDGEPLNETKTATREFASTIFKSDADVCLVSYDSSAR--NVIDSTDNEYA 439 Query: 382 IRFL--SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQR--LPDDVTSKVKE 437 ++ GGT++ R ERL+ V++SD A + DD+ + E Sbjct: 440 LKAAVRDLSAGGGTNIEDALRVSYERLEGSGSDKRIIVLMSDGEANEGLVGDDLIAYANE 499 Query: 438 LQRVHQHRFHAVA 450 ++ + + Sbjct: 500 IKDD-GVTIYTLG 511 >UniRef50_B2HDT6 Putative uncharacterized protein n=3 Tax=Mycobacterium RepID=B2HDT6_MYCMM Length = 772 Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats. Identities = 35/151 (23%), Positives = 58/151 (38%), Gaps = 15/151 (9%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ 380 +V +D SGSMGG+ A+ ++ + A +R C ++ F I + P G+ Sbjct: 309 DVVVVLDRSGSMGGWKMVAARRAAGRIVDMLDAGDRFC-VLAFDDRIETPP-AMPDGLVP 366 Query: 381 AIR---FLSQQ------FRGGTDLASCFRAIMERL-QSREWFDADAVVISDFIAQRLPDD 430 A F + RGGT +A +E L S E A V+++D Sbjct: 367 ASDRNRFAASSWLGSLRSRGGTVMAQPLTNAVEMLADSGEDRQASVVLVTDGQISGEDHL 426 Query: 431 VTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + S + R R + V + G + Sbjct: 427 LRSLAPVVGR---TRIYCVGVDRAVNAGFLE 454 >UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BC4A Length = 1038 Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 61/180 (33%), Gaps = 33/180 (18%) Query: 307 RPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE 366 RP + +P+ ++ +D SGSMG N AK +++ ++R +M FS+ Sbjct: 179 RPWYVQANVPKPK-QIVIVIDKSGSMGVTNMNLAKEAAKSVVNTLNPQDR-FAVMAFSSI 236 Query: 367 IVRYELS--------------GPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSR-- 409 V ++ + PQ ++ F+ + GGT+ A + Q Sbjct: 237 FVPFQSTVASDQCFATTFADASPQNKKKVEDFVDTISSGGGTNYAPALQKAFSFFQQEPS 296 Query: 410 ----EWFDAD-------AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPG 458 D + +SD I + S + +V + +G Sbjct: 297 VSDFNIKKIDPSEIDRVILFMSDGIPNDPGSTILSAQIRANEQLNN---SVIILTYGLGN 353 >UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira maxima CS-328 RepID=B5W7H4_SPIMA Length = 488 Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 52/155 (33%), Gaps = 4/155 (2%) Query: 308 PVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI 367 + +P+ ++ +DTSGSM G + + + + ++ FS+ Sbjct: 40 TRQPPGFLTKPK-AVVLLIDTSGSMSGQKLREVQTAASEFVSRQNLKRHDLAVVEFSSRA 98 Query: 368 VRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRL 427 E RGGT+L+ F LQ+ + + ++ +D + Sbjct: 99 SVVADFTRNETELQQAIARLSARGGTNLSEGFNLATSVLQNSD-RTPNILLFTDGVPNNP 157 Query: 428 PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 P + + + + R AV + + Sbjct: 158 P--MAASIAQQIRASGINLVAVGTGDAQINYLTAL 190 >UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788F71 Length = 1007 Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 67/168 (39%), Gaps = 9/168 (5%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ EK + + + E P I+ +D SGSM G + AK + + + A++ Sbjct: 385 KTPIEKALPVSMELEGKREIPSLGLILVIDRSGSMDGNKIELAKESAMRTVELMRAKD-T 443 Query: 358 CYIMLFSTE----IVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD 413 ++ F + + +L + + +I+ S GGT++ + +E + + Sbjct: 444 VGVVAFDDQPWWVVPPQKLGDKEEVLSSIQ--SIPSAGGTNIYPAVSSALEEMLKIDAQR 501 Query: 414 ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++++D + + + ++ +VA+ +++ Sbjct: 502 RHIILMTDGQSAMNSGY--QDLTDTMVENKITMSSVAVGMDADTNLLQ 547 >UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0AMP5_MARMM Length = 740 Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 61/177 (34%), Gaps = 10/177 (5%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L GE++ I P + I +D SGSMGG + + A+A + ++ Sbjct: 314 LFIEEWQGETYLLAQILPPAELGADTPRRARETIFVIDNSGSMGGASMRQARAALITALQ 373 Query: 350 IALAENRRCYIMLFST---EIVRYEL-SGPQGIEQAIRFL-SQQFRGGTDLASCFRAIME 404 +R ++ F ++ + + P + A+ F + +GGT + A + Sbjct: 374 RLEPGDR-FNVIRFDNTMEQVFPQAVDASPDNVATALTFARRLEAQGGTVMLPALNAALR 432 Query: 405 RLQSREWFDA-DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + V ++D + + L R R V + + M Sbjct: 433 DTSPDDDSRVRQIVFLTDGAIGNEAELFAAIEAGLGR---SRLFPVGIGSAPNGYFM 486 >UniRef50_A2E6Y7 von Willebrand factor type A domain containing protein n=4 Tax=Trichomonas vaginalis RepID=A2E6Y7_TRIVA Length = 720 Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 60/156 (38%), Gaps = 10/156 (6%) Query: 311 HKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR- 369 + + + F +D SGSM G + AK FCL ++ +L R I+ F Sbjct: 232 QFEGKVEQKSEFYFIIDCSGSMSGSRIENAK-FCLNILIHSLPIGCRFSIIQFGNSYKEV 290 Query: 370 ---YELSGPQGIEQAIRFLSQQFR-GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQ 425 + S GGTD+ S + ++ + + +++D Sbjct: 291 VSICDYSNKNVKYAMSAIARINADMGGTDILSPLEYVFKKKLGKGFIRK-IFLLTDGEVH 349 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 D + S+V+ + +R A+ + + PG+++ Sbjct: 350 N-SDMICSRVQ--KERENNRIFAIGLGSGADPGLIK 382 >UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZFT4_9SPHI Length = 827 Score = 60.9 bits (146), Expect = 1e-07, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 66/193 (34%), Gaps = 22/193 (11%) Query: 286 VEKQLLTYRLHGESWREKVIERPVVHKDY---DEQPRGP---------FIVCVDTSGSMG 333 ++ LL Y E K + K + + P+ P ++ VD SGSM Sbjct: 277 IQSGLLLYEGENEVASGKEEDNENAEKFFLMMMQPPKAPKNSQIPPREYVFIVDVSGSMH 336 Query: 334 GFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV-----RYELSGPQGIEQAIRFLSQQ 388 GF +K L+ L + +MLF + E + + Q+ Sbjct: 337 GFPLSVSKRLLKNLIGK-LRPKDKFNVMLFESSNQMMSPESMEATQANIQKAFGVIDQQR 395 Query: 389 FRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHA 448 GGT L + + Q++++ + VV++D + L R + A Sbjct: 396 GGGGTRLLPALKKALAFKQTKDYSRS-FVVVTDGYVTVEKEAFDLIRNNLNRAN---LFA 451 Query: 449 VAMSAHGKPGIMR 461 + + ++ Sbjct: 452 FGIGSSVNRFLIE 464 >UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SQR4_HAHCH Length = 733 Score = 60.9 bits (146), Expect = 1e-07, Method: Composition-based stats. Identities = 34/174 (19%), Positives = 64/174 (36%), Gaps = 12/174 (6%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + G+ +++ P D I VDTSGSM G + Q A+ L + Sbjct: 330 EIVGDDVYAQLLLMPPQFSDEGLSLPRELIWVVDTSGSMEGVSIQQARDAVLQALDTLTP 389 Query: 354 ENRRCYIMLF-STEIVRYELSGP---QGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQS 408 +R ++ F S + + P + ++QA RF+ + GGT++A + Sbjct: 390 RDR-FNVIEFNSHARKLFPQAVPAQERALQQARRFVRGLKADGGTEIAEALDRALSDAAP 448 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQ-HRFHAVAMSAHGKPGIMR 461 + V ++D + + K++ + R V + MR Sbjct: 449 EGYVR-QVVFLTDGSVG----NELALFKQIDQQLGDSRLFTVGIGPSPNRFFMR 497 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 60.9 bits (146), Expect = 1e-07, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 62/159 (38%), Gaps = 8/159 (5%) Query: 309 VVHKDYDEQPRG-PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI 367 + + + +G + +D SGSM G + + ++ +R C +++F ++ Sbjct: 110 INQRGQECVRQGVDLVCLIDHSGSMQGEKIKLVRKTLKQMLTFLQPCDRLC-LIMFDCKV 168 Query: 368 VR----YELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISD 421 R ++ + + S Q RGGTD+ + + + L+ R++ + + ++SD Sbjct: 169 YRLTRLMRVTQENVQKFRVAISSLQARGGTDIGNGMKMALSILKHRKYKNPVSAIFLLSD 228 Query: 422 FIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + + + V + + P IM Sbjct: 229 GVDEGAEERVRDDLIQYNIRDSFTIKTFGFGRDCCPKIM 267 >UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcanivorax sp. DG881 RepID=B4X134_9GAMM Length = 657 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 63/174 (36%), Gaps = 14/174 (8%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + GE + ++ P + FI+ D+SGSMGG + AKA L L L Sbjct: 275 EIDGEHYALLMVVPPKTGQVTALPRETLFII--DSSGSMGGAPMRQAKA-SLHLALQRLK 331 Query: 354 ENRRCYIMLFSTE---IVRYELSGPQGI-EQAIRFL-SQQFRGGTDLASCFRAIMERLQS 408 R I F ++ + ++ +QA F+ Q GGT + A + + S Sbjct: 332 PGDRFNITDFDSQHTLLFETPVTVSDNSRQQAQDFVDGLQASGGTHMLPALSATLSQPAS 391 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQ-HRFHAVAMSAHGKPGIMR 461 + + I+D + + + L + R V + + M Sbjct: 392 DGYLR-QVIFITDGAVG----NESGIFRALHQQLGEARLFTVGIGSAPNSHFMT 440 >UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI0_9BACT Length = 833 Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 56/166 (33%), Gaps = 6/166 (3%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ E+V+ ++ EQP ++ +D SGSM G A+ A + + + + Sbjct: 390 KTPIEEVLPVTSRYEKEKEQPSLALVLVIDKSGSMNGQPIVLAREASKAAAELLSSRD-Q 448 Query: 358 CYIMLFSTEIV-RYELSGPQGIEQAI-RFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 ++ F +L+ + + + GGT+L + L Sbjct: 449 VGVIAFDGSAKLVTDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDMLGIASAKIKH 508 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +V+SD +Q + V++ +M Sbjct: 509 MIVLSDGQSQGGD---FEGISSELAQMGVTISTVSLGQGAAVDLMA 551 >UniRef50_A1RSU5 von Willebrand factor, type A n=5 Tax=Thermoproteaceae RepID=A1RSU5_PYRIL Length = 358 Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats. Identities = 39/131 (29%), Positives = 56/131 (42%), Gaps = 16/131 (12%) Query: 321 PFIVCVDTSGSMG---GF--NEQCAK-AFCLALMRIALAENRRCYIMLFSTEIVRYELSG 374 P + +D SGSM G + AK A L ++A ++LF+T+ Sbjct: 200 PIYIALDVSGSMKEYIGALTKLKVAKNAIARYLHQMAHLRG-LVSLVLFNTDADFMWTPH 258 Query: 375 PQG--IEQAIRFLSQQFR-GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDV 431 P + I L + GGT+LAS E LQS E D V+I+D PD V Sbjct: 259 PVNIYLRDMIEILKYIYAMGGTELASAL----ELLQSHEISR-DIVIITDGRTHD-PDKV 312 Query: 432 TSKVKELQRVH 442 + K +R+H Sbjct: 313 LNLAKRFKRLH 323 >UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7RKA1_NEMVE Length = 1128 Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 46/323 (14%), Positives = 105/323 (32%), Gaps = 55/323 (17%) Query: 146 QLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEF 205 +L + + +++Q ++T++ +++ + + + + S + + D + + KY Sbjct: 68 SILNDLATRFANKLQTKVTIARKIKDAVEVSYAKSATV--TSRTECCKADTRWL-KYDSR 124 Query: 206 LNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLL 265 + L + + + + D ++T + + T+ Q G ++ L Sbjct: 125 FRTKVNLDEMCVIISGAASSNPKQLQDNVLQTMKQNIENNPTLTWQYFGSEEG------L 178 Query: 266 PPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVC 325 + + + R RP + QP+ I+ Sbjct: 179 YTNYPMIRDSSSCSSYDPRY---------------------RPWYVEAASPQPK-DVILV 216 Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ----- 380 VD SGSMGG AK ++ +R + F + + R +++ ++ Sbjct: 217 VDYSGSMGGSRLPIAKEAAKTVLDTLNPRDR-VAFLAFESGVRRVKVTSGDAKDEKCFES 275 Query: 381 --------AIRFLSQQ-----FRGGTDLASCFRAIMERLQSREWFDAD-----AVVISDF 422 I L + GGT A F A + L + ++D Sbjct: 276 SLAKASPVNIDILKKFLDGEYASGGTMYAIAFNAAFDILDKYYKEKNTTRRPVILFMTDG 335 Query: 423 IAQRLPDDVTSKVKELQRVHQHR 445 P + + VK + + Sbjct: 336 APNDDPGTILNTVKTRNQGLSTK 358 >UniRef50_A1SAA4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Shewanella amazonensis SB2B RepID=A1SAA4_SHEAM Length = 713 Score = 59.7 bits (143), Expect = 3e-07, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 65/193 (33%), Gaps = 16/193 (8%) Query: 279 YEFYRRLVEK-----QLLTYRLHGESWREKVIERPVVHKDYDEQPRG-PFIVCVDTSGSM 332 FY RL E ++TYR S + V D +G ++ +D SGSM Sbjct: 284 IVFYWRLQEGLPGRVDMVTYRDPKVSTKGTVKLTFTPGDDLGPVTQGRDWVFVLDKSGSM 343 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS----GPQGIEQAIRFLS-Q 387 G + L ++ + I+LF + I QA+ ++ Sbjct: 344 NGKYATLVEGVRQGLGKLPAQDR--FRIILFDESTQEFSKGFVPVDSNNINQALAWVEGI 401 Query: 388 QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 GTDL + + L + V+I+D +A + + EL + R Sbjct: 402 SPGNGTDLYQGLKRALTPLDAD--RSTGVVLITDGVAN-VGVTEKRRFLELMQQQDVRLF 458 Query: 448 AVAMSAHGKPGIM 460 M ++ Sbjct: 459 TFIMGNSANTPLL 471 >UniRef50_Q60ED8 Von Willebrand factor type A domain containing protein n=6 Tax=Poaceae RepID=Q60ED8_ORYSJ Length = 801 Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 55/161 (34%), Gaps = 23/161 (14%) Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYELSGPQGIEQA 381 +DTSGSM G + K + L + I+ F+ E+ E + IE A Sbjct: 338 IDTSGSMQGKPLESVKNAMYTTLSE-LVQGDYFNIITFNDELHSFSSCLEQVNEKTIENA 396 Query: 382 IRFLSQQF--RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQ 439 +++ F GGTD+ + L + +++D + ++ VKE Sbjct: 397 REWVNTNFIAEGGTDIMHPLSEAIALLSNSHNALPQIFLVTDGSVED-ERNICRTVKEQL 455 Query: 440 RVHQHR---------------FHAVAMSAHGKPGIMRIFDH 465 + + +++ GK FD Sbjct: 456 ATRGSKSPRISTFGLGSYCNHYFLRMLASIGKGHYDAAFDT 496 >UniRef50_UPI00016C377F protein containing a von Willebrand factor type A domain n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C377F Length = 821 Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 55/180 (30%), Gaps = 12/180 (6%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 L Y+ + + ++ ++ +DTS SM Q AK + Sbjct: 242 LVYKPIQTEDGYFMFLISPQVEAEKKRVARDLVLVLDTSSSMSDIKMQQAKKAVKFCLSQ 301 Query: 351 ALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQ-----QFRGGTDLASCFRAIMER 405 E+R ++ FST + ++ + ++ + GGT + + Sbjct: 302 LQPEDR-FGVVRFSTTVTKFRSELVAANTDYLDLATKWIDGLKTSGGTAIWPALNDALAM 360 Query: 406 LQSREWFDADAVVISDF---IAQRLPDDVTSKVKELQRVHQH-RFHAVAMSAHGKPGIMR 461 S V +D + + D + V L + + R + ++ Sbjct: 361 RSSDPSRPFTMVFFTDGQPTVDETNADKIVKNV--LAKNTGNTRIFTFGVGDDVNAAMLD 418 >UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UK93_METS4 Length = 761 Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 61/181 (33%), Gaps = 23/181 (12%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 GE + P + +PR +D SGSM G + + AKA L + Sbjct: 348 ERVGEDEYLLAVVTPPEGRAPARRPR-EVTFVIDNSGSMAGASMRQAKASLLVALDRLGP 406 Query: 354 ENRRCYIMLFSTEIVRYELSGPQGI-------EQAIRF-LSQQFRGGTDLASCFRAIMER 405 +R ++ F +L P + + A RF + + RGGT++ RA + Sbjct: 407 ADR-FNVIRFDD---TMDLLFPAPVPADEAHRDAARRFVAALEARGGTEMLPPLRAALAD 462 Query: 406 LQSREWFDAD----AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 E D V ++D ++ R R + + + +M Sbjct: 463 PHPEE---GDRVRQIVFLTDGAIGNEEQIFSAISAGRGR---SRLFMIGIGSAPNGHLMT 516 Query: 462 I 462 Sbjct: 517 H 517 >UniRef50_Q9UKK3 Poly [ADP-ribose] polymerase 4 n=14 Tax=Eutheria RepID=PARP4_HUMAN Length = 1724 Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 34/146 (23%), Positives = 60/146 (41%), Gaps = 12/146 (8%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ- 380 I+C+D S SM G AK L + + + E ++ I+ F T S P+ I Sbjct: 877 VIICLDCSSSMEGVTFLQAKQIALHALSL-VGEKQKVNIIQFGTGYKEL-FSYPKHITSN 934 Query: 381 --AIRFL--SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 A F+ + G TD R + +R + +++SD Q + +T ++ Sbjct: 935 TAAAEFIMSATPTMGNTDFWKTLRYLSLLYPARGSRN--ILLVSDGHLQ--DESLTLQLV 990 Query: 437 ELQRVHQHRFHAVAMSAHGKPGIMRI 462 + R H R A + + ++RI Sbjct: 991 KRSRPH-TRLFACGIGSTANRHVLRI 1015 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 53/153 (34%), Gaps = 13/153 (8%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR----YELSGPQ 376 I +DTSGSM G K L L+ + +R C ++ FST R + Sbjct: 1447 DLICVIDTSGSMNGQPLDLLKETLLFLVDLLQTGDRIC-LIQFSTNAQRLTPLLSIESKD 1505 Query: 377 GIEQAI-RFLSQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTS 433 I+ +GGT++ + + L+ R + + ++SD + + + Sbjct: 1506 NIKSIKNEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLLSDGLNDGAENKIRD 1565 Query: 434 KVKELQ-----RVHQHRFHAVAMSAHGKPGIMR 461 +K+L P +M Sbjct: 1566 LLKQLNFYQNYNEENFTIQTFGFGKDHDPNLMD 1598 >UniRef50_Q10JU7 Von Willebrand factor type A domain containing protein, expressed n=17 Tax=Poaceae RepID=Q10JU7_ORYSJ Length = 680 Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 55/161 (34%), Gaps = 23/161 (14%) Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYELSGPQGIEQA 381 +DTSGSM G + K + L + I+ F+ E+ E + IE A Sbjct: 338 IDTSGSMQGKPLESVKNAMYTTLSE-LVQGDYFNIITFNDELHSFSSCLEQVNEKTIENA 396 Query: 382 IRFLSQQF--RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQ 439 +++ F GGTD+ + L + +++D + ++ VKE Sbjct: 397 REWVNTNFIAEGGTDIMHPLSEAIALLSNSHNALPQIFLVTDGSVED-ERNICRTVKEQL 455 Query: 440 RVHQHR---------------FHAVAMSAHGKPGIMRIFDH 465 + + +++ GK FD Sbjct: 456 ATRGSKSPRISTFGLGSYCNHYFLRMLASIGKGHYDAAFDT 496 >UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBY0_9DELT Length = 996 Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 70/203 (34%), Gaps = 21/203 (10%) Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMG-GFNEQCAKAFCLALMRIALAE 354 G S E+V+ + EQP I+ +D SGSM G K A R Sbjct: 504 WGGSTIEQVLPVRFSGERQREQPTLALILVIDKSGSMSSGDRLDLVKEAARATARTLDPS 563 Query: 355 NRRCYIMLFSTE----IVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE 410 + ++ F + + I +IR GGT+ R +L + Sbjct: 564 D-EIGVIAFDNSPQVLVRLQPAANRLRISSSIR--RLSAGGGTNAMPALREAYLQLAGSK 620 Query: 411 WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAH-GKPGIMRIFDH---I 466 +++SD + P++ + + R +V + GK ++R+ + Sbjct: 621 ALVKHVILLSDGES---PENGINALLGDMRQSDITVSSVGVGDGAGKDFLIRVAERGRGR 677 Query: 467 WRFD------TGMRSRLLRRWRR 483 + + + SR R +R Sbjct: 678 YFYSEDGTDVPRIFSREAREVKR 700 >UniRef50_D2VKS7 von Willebrand factor type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VKS7_NAEGR Length = 923 Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 66/162 (40%), Gaps = 16/162 (9%) Query: 293 YRLHGESWREKVIERPVVHKDYDEQPRG-PFIVCVDTSGSMGGFNEQCAKAFCLALMRIA 351 + + + ++ P + ++ +G ++ VD SGSM G K+ ++ Sbjct: 663 FETECDLYCMATLQGPCFEQQAQKERKGVDLVLVVDKSGSMAGQKLDMVKSTLSFMVDQ- 721 Query: 352 LAENRRCYIMLFSTEIVR------YELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMER 405 L E R I+ F T++ ++ G + +Q +S T+L+ ++ Sbjct: 722 LKEKDRVAIVEFDTQVKTNLDLTKMDIEGKKKAKQVSSAISPGSC--TNLSGALFTSLKL 779 Query: 406 LQSREWFDAD---AVVISDFIAQR---LPDDVTSKVKELQRV 441 L SR+ + ++ +D +A R +++ +++L Sbjct: 780 LASRQQEKNEVTSVILFTDGLANRGLISTNEILQNMQDLMDE 821 >UniRef50_A4XHD9 von Willebrand factor, type A n=2 Tax=Clostridia RepID=A4XHD9_CALS8 Length = 909 Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 65/172 (37%), Gaps = 12/172 (6%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGG------FNEQCAKAFCLALMRI 350 S EK++ + K+ +++ ++ +D SGSMGG + AK+ ++ Sbjct: 383 SNSVLEKMLPVKMQLKNKEKERNVAVVLVIDHSGSMGGSNLRNINKLEIAKSAAAKMIDH 442 Query: 351 ALAENRRCYIMLFSTEIV-RYELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQS 408 + + ++ F + + + I +S Q GGT + + L+ Sbjct: 443 LESSD-SVGVIAFDHNFYWASKFGKLKSKNEVIENISTIQVGGGTAIIPPLTEAVNLLKK 501 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + D V+++D + + + + + + + + + + I+ Sbjct: 502 SKAKDKVIVLLTDGYGEEGGYEYPASIA---KRNNIKITTIGVGSSINAPIL 550 >UniRef50_Q7MCW9 Uncharacterized protein n=2 Tax=Vibrio vulnificus RepID=Q7MCW9_VIBVY Length = 688 Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 60/171 (35%), Gaps = 11/171 (6%) Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 E K+ P +Q R ++ +D SGSM G + + L ++ + Sbjct: 286 QSERGTIKLTFTPGDDLSAIQQGR-DWVFVLDKSGSMSGKHATLTEGVKRGLGKLPSGDR 344 Query: 356 RRCYIMLFSTEIVRYE----LSGPQGIEQAIRFLSQ-QFRGGTDLASCFRAIMERLQSRE 410 I++F + + QAI ++Q GGT+L + L S Sbjct: 345 --FRILMFDNRVQEITNGFIAVNQNNVTQAIETINQIATGGGTNLYDALERAVSGLDSDR 402 Query: 411 WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++++D +A + + +L + + R + M ++ Sbjct: 403 TTG--IILVTDGVAN-VGVTEKKQFLKLMQRYDVRLYTFIMGNSANTPLLE 450 >UniRef50_Q54DU5 von Willebrand factor A domain-containing protein DDB_G0292028 n=1 Tax=Dictyostelium discoideum RepID=Y2028_DICDI Length = 932 Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 56/147 (38%), Gaps = 11/147 (7%) Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS 373 + + FI +D SGSM G + +K L + +L EN + I+ F + + + Sbjct: 335 DEVDQKSEFIFVLDCSGSMSGKPIEKSK-MALEICMRSLNENSKFNIVCFGSNFNKLFET 393 Query: 374 GPQGIEQAIRFLSQQFR------GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRL 427 ++ ++ S+ GGT+L I+ + E+ +++D Sbjct: 394 SKHYNDETLQKASEYINRIDANLGGTELLEPIVDILSKESDPEFPR-QVFILTDGEISNR 452 Query: 428 PDDVTSKVKELQRVHQHRFHAVAMSAH 454 D + V + + R + ++ Sbjct: 453 -DKLIDYVG--KEANTTRIFTYGIGSY 476 >UniRef50_UPI0000E47A28 PREDICTED: hypothetical protein n=3 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47A28 Length = 1262 Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 67/192 (34%), Gaps = 22/192 (11%) Query: 282 YRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRG--------------PFIVCVD 327 + +L +LL Y ES + + R + G IV VD Sbjct: 969 HLQLQAHELLEYEQQVESVLRRYLRRLQWLLGASRRTFGTVVEKRVCICWCCLAVIVLVD 1028 Query: 328 TSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV--RYELSGPQGIE--QAIR 383 TSGSM E K + EN I+ FS +I R + P + A+R Sbjct: 1029 TSGSMVTHMEDLKKDLVALIWDQFKRENISFNIVRFSADIEPWRSHIVEPTDADCNDAVR 1088 Query: 384 FLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQ 443 ++S G +C + + DA +++D V ++ ++ V Sbjct: 1089 WVSSFVPAG---NTCTLEALSEAFREKEVDA-IYLLTDGKPDSSTSRVFREIAQVNTVRG 1144 Query: 444 HRFHAVAMSAHG 455 + H ++ + + Sbjct: 1145 VKVHTISFNCND 1156 >UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UNM0_RHOBA Length = 900 Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 53/135 (39%), Gaps = 6/135 (4%) Query: 297 GESWREKVIERPVVHKDYDEQPRGP---FIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 G +R ++ E V +++++ P ++ +D SGSMGG + AK A + + L Sbjct: 437 GGYYRTQIEEILPVRSNFEKEREKPSLAMMLVIDKSGSMGGQKIELAKDAAQAAVEL-LG 495 Query: 354 ENRRCYIMLFSTEIVRY-ELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREW 411 ++ F + EL +S + GGT++ E L Sbjct: 496 PKDAIGVIAFDGDSYTVSELRSTSDRGAISDAISTIEASGGTNMYPAMADAYEALLGATA 555 Query: 412 FDADAVVISDFIAQR 426 ++++D ++ Sbjct: 556 KLKHVILMTDGVSSP 570 >UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=Q8H924_ORYSJ Length = 646 Score = 58.9 bits (141), Expect = 5e-07, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 42/108 (38%), Gaps = 7/108 (6%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQ 376 + +D SGSM G K ++ +R C ++ FS+ R ++ Sbjct: 175 DLVTVLDVSGSMVGNKLALLKQAMGFVIDNLGPGDRLC-VISFSSGASRLMRLSRMTDAG 233 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA--DAVVISDF 422 S RGGT++ + R + L R + +A +++SD Sbjct: 234 KAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVESVILLSDG 281 >UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8U1E2_9PROT Length = 683 Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 58/175 (33%), Gaps = 26/175 (14%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR--CYIMLFSTEIVRY---ELSGPQ 376 I +D SGSM G + AKA +L R ++ F+ + + + Sbjct: 330 VIFVIDVSGSMKGEPLRAAKA---SLTSGIEGLGRNDTFNVVAFNNKAAAFYDAPVRASG 386 Query: 377 GIE----QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVT 432 + I L GGT++A+ F ++ + V I+D + Sbjct: 387 KFHRAALKVIDGLK--AGGGTEMAAAFELALQ-MPGDPDRLQQVVFITDG----AVSNEA 439 Query: 433 SKVKELQRVHQH-RFHAVAMSAHGKPGIMRIF------DHIWRFDTGMRSRLLRR 480 + +++ R V + + M + + DT R++R Sbjct: 440 ALFNQIKGELGARRLFTVGIGSAPNTFFMEEAARFGRGTYTYIGDTSSAERVMRD 494 >UniRef50_B2UZB2 von Willebrand factor type A domain protein n=1 Tax=Clostridium botulinum E3 str. Alaska E43 RepID=B2UZB2_CLOBA Length = 984 Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats. Identities = 26/110 (23%), Positives = 45/110 (40%), Gaps = 5/110 (4%) Query: 318 PRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA-ENRRCYIMLFSTEIVRYELSGPQ 376 P+ ++ +DTSGSM + K + + N I+ +ST Y L+ Sbjct: 88 PKKEIVLVLDTSGSMKDSKIKKMKNAAMEFVNKIKKIPNLDIDIVTYSTSGYTY-LNNGN 146 Query: 377 GIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDAD--AVVISDFI 423 E ++ + S + GGT+ R L + +AD V +SD + Sbjct: 147 TEEDLLKIINSIKADGGTNTGEGLRKANYILDLEKNKNADKSIVFMSDGM 196 >UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfolobaceae RepID=A4YGU7_METS5 Length = 383 Score = 58.5 bits (140), Expect = 5e-07, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 54/141 (38%), Gaps = 7/141 (4%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI-VRYELSGPQGIEQ 380 +IV +DTSGSM G + AK + L++ + + + + FS+ + + E P+ + Sbjct: 40 YIVLLDTSGSMDGLKIESAKKGAIELLKR-IPQGNKVSFVTFSSRVNIVREFVDPEDLT- 97 Query: 381 AIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQR 440 S G T + L ++ + ++++D D K + Sbjct: 98 -AEISSLSAGGQTAFFTALLTAFN-LHNKHGIPSYVILLTDG--NPTDDTNVETYKRIAI 153 Query: 441 VHQHRFHAVAMSAHGKPGIMR 461 + + + + I++ Sbjct: 154 PNGVQTISFGLGDDYNETILK 174 >UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09DT2_STIAU Length = 843 Score = 58.5 bits (140), Expect = 6e-07, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 65/177 (36%), Gaps = 14/177 (7%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L+T+RL + + P + R + VDTSGSM G + A+ L L Sbjct: 217 LVTHRLGEKPGTFALTVVPDLLGLATGPKRQEVVFVVDTSGSMEGESLPQAQG-ALRLCL 275 Query: 350 IALAENRRCYIMLFSTEIVRYELSGPQGIEQAIR-----FLSQQFRGGTDLASCFRAIME 404 L E R I+ F T + ++ + + + GGT+L A ++ Sbjct: 276 RHLREGDRFNIIAFDTSFQSFAPQPAVFTQKTLEQADRWVAALRANGGTELLQPMLAAVQ 335 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 V+++D + + + ++ + R ++ + + +++ Sbjct: 336 AAPEGV-----VVLLTDGQVGNEAEILQAVLRARKT---ARIYSFGIGTNVSDALLK 384 >UniRef50_UPI0000E49DB4 PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4 n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49DB4 Length = 1119 Score = 58.5 bits (140), Expect = 6e-07, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 61/148 (41%), Gaps = 16/148 (10%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR-YELSGPQGIEQ 380 ++ +D S SM G +Q AK C ++ +L E R ++ F T+ + P G Q Sbjct: 751 VVLLLDCSTSMKGEPKQDAKKIC-KMILQSLPEKSRFNVITFGTDFTELFPTVEPVGQRQ 809 Query: 381 AIRFLSQ-----QFRGGTDLASCFR--AIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 + L G ++ R +++ + S + +++SD L ++ + Sbjct: 810 LLEALEFIEGARSVGGSSEAWRPLRSLSLLPMMNSAR----NVLLVSDGH---LTNEKLT 862 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + H +R A+S+ G I+R Sbjct: 863 LEIASKYKHVNRIFTCAVSSAGNRHILR 890 >UniRef50_UPI0000D9E789 PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4, partial n=7 Tax=Euteleostomi RepID=UPI0000D9E789 Length = 741 Score = 58.5 bits (140), Expect = 6e-07, Method: Composition-based stats. Identities = 35/148 (23%), Positives = 59/148 (39%), Gaps = 14/148 (9%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 I+C+D S SM G AK L + + + E + I+ F T++ + I+Q Sbjct: 426 VIICLDCSSSMEGVTFLQAKQIALHALSL-VGEKHKVNIIQFGTDVSVM-TANGDCIQQG 483 Query: 382 IRFLSQQF-------RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSK 434 LS F G TD R + +R + +++SD Q + +T + Sbjct: 484 EHNLSLHFLQSATPTMGNTDFWKTLRYLSLLYPARGSRN--ILLVSDGHLQ--DESLTLQ 539 Query: 435 VKELQRVHQHRFHAVAMSAHGKPGIMRI 462 + + R H R A + I+RI Sbjct: 540 LVKRSRPH-TRLFACGIGPTANRHILRI 566 >UniRef50_B9L896 von Willebrand factor, type A n=1 Tax=Nautilia profundicola AmH RepID=B9L896_NAUPA Length = 288 Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 61/147 (41%), Gaps = 14/147 (9%) Query: 322 FIVCVDTSGSMGG-FNEQCAKAFCLALMRIALAENRRCYIMLFST-EIVRYELS-GPQGI 378 ++ +DTSGSM AKA L + +N +++F + L+ + Sbjct: 78 IVIDLDTSGSMAEFNKIDAAKAVSLDFAK--KRKNDALGLVVFGNIAYIASPLTFDKKTF 135 Query: 379 EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD----AVVISDFIAQRLPDDVTSK 434 E ++ + GG + + L S + +A+ ++++D + + Sbjct: 136 EDILKRIYVSIAGG---KTAIYDAL-FLSSNLFKNANGEKIIILLTDGMDNMSITPLDVV 191 Query: 435 VKELQRVHQHRFHAVAMSAHGKPGIMR 461 +K+L++ H + +++A+ +++ Sbjct: 192 IKKLKKEH-IKVYSIAIGGDADLSVLK 217 >UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CDX4_KOSOT Length = 730 Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 61/188 (32%), Gaps = 18/188 (9%) Query: 286 VEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCL 345 + ++ Y + ++ + P+ + +D SGSM G + AK L Sbjct: 241 LSSSIMNYWDEADRRGYFLLTLVPPREPERIIPK-DIVFILDISGSMSGQKIEKAKLALL 299 Query: 346 ALMRIALAENRRCYIMLFSTEI-----VRYELSGPQGIEQAIRFLSQQFRGGTDLASCFR 400 ++++ +R I+ F+ E+ S A++ G T++ Sbjct: 300 QVLQMLHEGDR-FSIITFNNEVNNLTERLLPFSDRTEWYPAVK--QIMAGGMTNIHDALL 356 Query: 401 AIMERL----QSREWFDADAVVISDFI-AQRLPD--DVTSKVKELQRVHQHRFHAVAMSA 453 +E L + + ++D + + D + +L +V + Sbjct: 357 EGIEVLGTQSTDDRYKV--VLFLTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVGY 414 Query: 454 HGKPGIMR 461 ++ Sbjct: 415 DVNAELLD 422 >UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVH6_PARL1 Length = 755 Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 63/178 (35%), Gaps = 11/178 (6%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L R+ E + ++ P + +PR V +D SGSM G + AK L + Sbjct: 318 LFRERVGNEDYLLVMLTPPSGSVQPEAKPREAIFV-IDNSGSMSGPSMVQAKESLLWALD 376 Query: 350 IALAENRRCYIMLFSTEI-VRYELSGPQGIEQAI---RFL-SQQFRGGTDLASCFRAIM- 403 L ++ F + V + + P E +F+ S + GGT++ RA + Sbjct: 377 R-LKPGDTFNVIRFDDTLTVLFPDAVPAHGENLAVAKKFVKSLEANGGTEMLPALRASLI 435 Query: 404 ERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +R + V ++D + L R R V + + M Sbjct: 436 DRNVNDGTRLRQIVFLTDGAISNEAELFHEITSNLGR---SRLFTVGIGSAPNSYFMT 490 >UniRef50_Q54CQ8 von Willebrand factor A domain-containing protein DDB_G0292740 n=1 Tax=Dictyostelium discoideum RepID=Y2740_DICDI Length = 910 Score = 57.8 bits (138), Expect = 9e-07, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 51/167 (30%), Gaps = 12/167 (7%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + + D +G FI +D SGSM G A+ L ++ +L Sbjct: 319 EKSSYAIALNFFPKFESINKEDIYQKGEFIFLIDCSGSMSGNPIDSARR-ALEIIIRSLN 377 Query: 354 ENRRCYIMLFSTEIVR--YELSGPQGIEQAIRFLSQQFR-----GGTDLASCFRAIMERL 406 E + I F + + E S + GGT+L + I+ + Sbjct: 378 EQCKFNIYCFGSGFNKAFQEGSRKYDDDSLAVVNRYVSNISANLGGTELLQPIKDILSKE 437 Query: 407 QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSA 453 E+ +++D + V + R + + Sbjct: 438 IDPEYPR-QIFILTDGAVSDRSK-LIEFVS--KESKTTRIFTYGIGS 480 >UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KDL1_SHEWM Length = 739 Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 56/150 (37%), Gaps = 15/150 (10%) Query: 322 FIVCVDTSGSMGGFNEQCAK-AFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ 380 I+ +DTSGSM G + AK A AL L ++ F++ + + Sbjct: 365 LILVIDTSGSMSGASIAQAKRALNYALA--GLKAKDTFNVIEFNSNVGSLSPYSLPATAK 422 Query: 381 AIRFL-----SQQFRGGTDLASCFRAIMERLQSREWFDAD----AVVISDFIAQRLPDDV 431 I S + GGT++ A +++ E ++ + ++D + Sbjct: 423 NIGLANQYVRSLKANGGTEMQLALNAALDKGTETEALGSERLRQVLFMTDGSVGD-EQSL 481 Query: 432 TSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +K Q++ + R + + + MR Sbjct: 482 FHLIK--QKIGESRLFTLGIGSAPNSHFMR 509 >UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VY88_NAEGR Length = 1082 Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 30/155 (19%), Positives = 54/155 (34%), Gaps = 17/155 (10%) Query: 322 FIVCVDTSGSMGGFN-EQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ 380 FI D SGSM G EQ + + C ++ + + ++ G ++ Sbjct: 131 FIALADVSGSMQGRPWEQVCTSLKHFAQQSFNNPAIICRMVAYESSAKEIDMKGT--LQS 188 Query: 381 AIRFLSQQF-RGGTDLASCFRAIMERLQSRE--------WFDADAVVISDFIA---QRLP 428 IR + F GGTD AS F+ + + + ++D P Sbjct: 189 IIRNIETAFTGGGTDFASAFQLACTIITRESGQDRENLPFGNVVITFLTDGEDFSKVGKP 248 Query: 429 DDVTSKVKELQRVH--QHRFHAVAMSAHGKPGIMR 461 + +E+ RV+ H V +H ++ Sbjct: 249 GGLQYLSEEINRVYRGDITIHTVGFGSHHNLELLD 283 >UniRef50_C0D9H7 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D9H7_9CLOT Length = 1360 Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 58/147 (39%), Gaps = 9/147 (6%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV-RYELS-GPQGIE 379 ++C D SGSM G + ++A +++ +++ N R ++LF++ + + + P I Sbjct: 551 MLLCCDVSGSMQGRPIEDSRAAVISMAE-SMSGNARLGVILFNSSVQGLTDFTVQPDVIR 609 Query: 380 QAIRFLSQQFRGGTDLASCFRAIMERLQSRE-WFDADAVVISDFIAQR--LPDDVTSKVK 436 S GGT++ +E VV+SD +++ + + Sbjct: 610 STAE--SMTANGGTNIFDTVVHGLESFPKNGPEVLNTLVVMSDGQENNAHSAEEIQTAIG 667 Query: 437 ELQRVHQHRFHAVAMSAH-GKPGIMRI 462 + + H + + + + I Sbjct: 668 QAAKDKSILVHCLGLGSEVDANYLQTI 694 >UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Shewanella amazonensis SB2B RepID=A1S752_SHEAM Length = 753 Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 32/148 (21%), Positives = 61/148 (41%), Gaps = 14/148 (9%) Query: 322 FIVCVDTSGSMGGFNEQCAK-AFCLALMRIALAENRRCYIMLFSTEIVRY-ELSGPQ--- 376 ++ +DTSGSM G + A+ A AL + + I+ FS++ + P Sbjct: 399 LVLVIDTSGSMAGDSMVQARSALIHALGGLGPQD--SFNIIAFSSDARPLWPDAKPATAF 456 Query: 377 GIEQAIRFL-SQQFRGGTDLASCFRAIME---RLQSREWFDADAVVISDFIAQRLPDDVT 432 + A +F+ S + GGT++AS ++ + + I+D D + Sbjct: 457 NLGAAQQFVRSLEADGGTEMASALELALKTPSVVDEDTKRLRQVLFITDGAVNG-EDALF 515 Query: 433 SKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + ++ +R+ R VA+ A M Sbjct: 516 NLIE--RRLGTSRLFPVAIGAAPNGYFM 541 >UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID=Q7G2L9_ORYSJ Length = 719 Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 41/108 (37%), Gaps = 7/108 (6%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQ 376 + +D S SM G K ++ + AL R ++ FS+ R +++ Sbjct: 261 DLVTVLDVSWSMAGTKLALLKR-AMSFVIQALGPGDRLSVVTFSSSARRLFPLRKMTESG 319 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDF 422 R S GGT++A R ++ R + V++SD Sbjct: 320 RQRALQRVSSLVADGGTNIADALRKAARVMEDRRERNPVCSIVLLSDG 367 >UniRef50_D0LST5 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LST5_HALO1 Length = 497 Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 68/179 (37%), Gaps = 24/179 (13%) Query: 254 GLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKD 313 +++ +I L+ ELA F +++++K+LL Y E E+ ++ Sbjct: 265 SIERKGNIDSLMLSELAY-----DREIFEQKVLDKELLYYAHEREREEEQRLQ------- 312 Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS 373 + VD+S SM G + A+ L L++ E ++ F + + Sbjct: 313 ---------YILVDSSASMRGQRQVFARGLALTLIKKLSLEGDEVWMRFFDSRLHELVKV 363 Query: 374 GPQGIEQAIRFLSQQFRGGTDLASCFRAI---MERLQSREWFDADAVVISDFIAQRLPD 429 G G LS + G + + FR + + RL+ + +I+ P+ Sbjct: 364 GRSGQVPVPYLLSFRSERGRNYSRVFRQLGLELTRLRRDQNRRVMVYIITHGQCHVAPE 422 >UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11 Tax=Rhizobiales RepID=A6X8G3_OCHA4 Length = 750 Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 56/145 (38%), Gaps = 10/145 (6%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI-VRYELS---GPQG 377 I +D SGSMGG + + AKA + +R ++ F + +E S + Sbjct: 354 VIFVIDNSGSMGGTSIEQAKASLDYALSQLQPGDR-FNVIRFDDTLTKFFEDSVDANQEN 412 Query: 378 IEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 I A RF+ S + +GGT++ A ++ V ++D + ++ Sbjct: 413 IASARRFVTSLEAQGGTEMLPALHAALDDSNQGNGLR-QIVFLTDGE---ISNEQQLLDA 468 Query: 437 ELQRVHQHRFHAVAMSAHGKPGIMR 461 R + R V + + +M Sbjct: 469 VAARRGRSRIFMVGIGSAPNSYLMN 493 >UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, scaffold_125.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HHA4_VITVI Length = 630 Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats. Identities = 31/135 (22%), Positives = 52/135 (38%), Gaps = 9/135 (6%) Query: 305 IERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 I+ P + D R P + +D SGSM G K L++ +R I+ Sbjct: 188 IKAPALLDDAHLLDRAPIDLVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGPSDR-LSIVS 246 Query: 363 FSTEIVR-YELSG-PQGIEQAIRFL--SQQFRGGTDLASCFRAIMERLQSREWFD--ADA 416 FS+ R + L +A S GGT++ + + L+ R + A Sbjct: 247 FSSTARRIFPLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKKGVRVLEERSEQNPVASI 306 Query: 417 VVISDFIAQRLPDDV 431 +++SD D+V Sbjct: 307 ILLSDGKDTYNCDNV 321 >UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G415_9DELT Length = 877 Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 57/156 (36%), Gaps = 25/156 (16%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR---CYIMLF-STEIVRYELSGPQG 377 I +D SGSM G AK +R AL+ R ++ F S+ + YE + P Sbjct: 375 MIFVIDRSGSMSGVPLALAK----QTLREALSHLRPVDTFNVISFESSTAMLYEAAVPAN 430 Query: 378 IEQAI---RFL-SQQFRGGTDLASCFRAIMER---LQSREWFDADAVVISDFIAQRLPDD 430 + + RF+ Q GGT ++ A + L + ++D D+ Sbjct: 431 EQNLVHAERFIDGLQAGGGTMMSGAVDAALSPEIGLGRHRY----VFFVTDG-FISNEDE 485 Query: 431 VTSKVKELQRV-----HQHRFHAVAMSAHGKPGIMR 461 + + L R + R + + + ++ Sbjct: 486 IARQASALVRAADKAGQRARVFGMGIGSSPNRELLA 521 >UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR Length = 757 Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 60/167 (35%), Gaps = 19/167 (11%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYELSGPQG 377 I +D SGSM G + AK L+ ++ E+ I+ F + E + + Sbjct: 333 VIFIIDISGSMKGGPFESAKNGLLSSLQKLNPED-SFNIIAFKMDTYLFSSVMEQATEEA 391 Query: 378 IEQAIRFL--SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKV 435 I +A R+L GGT++ + ++ L +I+D + D+ + V Sbjct: 392 IIEATRWLNDKLTADGGTNILGPLKQAIKLLAETTNSIPVIFLITDGAVED-ERDICNFV 450 Query: 436 KELQRVHQ---HRFHAVAMSAHGKPGIMRI--------FDHIWRFDT 471 K R + + +R+ FD + D+ Sbjct: 451 KGYLPSGGSISLRISTFGIGTYCNHHFLRMLAQIGRGHFDTAYDADS 497 >UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun sequence. (Fragment) n=16 Tax=Euteleostomi RepID=Q4SBF6_TETNG Length = 1039 Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 60/168 (35%), Gaps = 14/168 (8%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 + +D SGSM G Q + L ++ E+ I+LF I + S + ++ Sbjct: 435 VVFVIDMSGSMSGTKMQQTREAMLKILEDLDPED-HFGIILFDHRIQFWNTSLSKATKEN 493 Query: 382 IRFL-----SQQFRGGTDLASCFRAIMERLQSRE------WFDAD-AVVISDFIAQRLPD 429 I + Q GGTD+ + ++ L+ D ++++D Sbjct: 494 IDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKRLPEKSIDMIILLTDGDPNSGES 553 Query: 430 DVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 + + ++ + ++ G D + R + G+ R+ Sbjct: 554 RIPVIQENVKAAIGGQMSLFSLG-FGNDVKYPFLDVMSRENNGLARRI 600 >UniRef50_C4ZKE8 von Willebrand factor type A n=2 Tax=Thauera sp. MZ1T RepID=C4ZKE8_THASP Length = 840 Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 56/154 (36%), Gaps = 24/154 (15%) Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV--------RYELSGP 375 + VD SGSM G + A+ A++ L E R + F + + + Sbjct: 271 ILVDCSGSMQGDSIAAARRALQAIIA-GLREGERFSLSRFGSTVEHRSRALWRTSAATRQ 329 Query: 376 QGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE-----WFDA---DAVVISDFIAQRL 427 G A++ Q GGT++ + + + E A D ++I+D + Sbjct: 330 AGQRWAMQL--QADLGGTEMENALASTLALAGDAEPSPGTEEGAAAVDLLLITDGQIHAI 387 Query: 428 PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 V + R +R V + + G++R Sbjct: 388 DRTV-----KRARALGNRIFVVGIGSAPAEGVLR 416 >UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E105CF Length = 757 Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 52/151 (34%), Gaps = 21/151 (13%) Query: 323 IVCVDTSGSMGGFNEQCAKAFCLALMRIA-----LAENRRCYIMLFSTEIVRY----ELS 373 I +D+SGSM G A A+ I L E+ I+ F +E + + Sbjct: 393 IFVLDSSGSMHGT------ALTQAIDAIREGVSYLTEHDTFNIVDFDSEARALWRQSQFA 446 Query: 374 GPQGIEQAIRFLSQ-QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVT 432 +A+RFL GGT++ + +L + ++D ++ Sbjct: 447 DEVSKAEAMRFLRHVDSDGGTNMQDALALSLTQLLDSSTGLTQVIFVTDG----SINNER 502 Query: 433 SKVKELQRVHQH-RFHAVAMSAHGKPGIMRI 462 +K++ R V + A M Sbjct: 503 ELLKQIAEQLGDKRLFTVGIGAAPNSHFMEY 533 >UniRef50_A5GQG5 Protoporphyrin IX Mg-chelatase subunit ChlD n=3 Tax=cellular organisms RepID=A5GQG5_SYNR3 Length = 653 Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats. Identities = 31/134 (23%), Positives = 47/134 (35%), Gaps = 12/134 (8%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIV-CVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 E R+ +++ + ++ G ++ VD SGSM Q AK L L+ A Sbjct: 435 EPHRKVIVQDGDLRAKQLQRKAGALVIFLVDASGSMALNRMQSAKGAVLRLLTEAYENRD 494 Query: 357 RCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIM--------ERLQS 408 ++ F E L + I A R L GG S + LQ+ Sbjct: 495 EVALIPFRGEQAEVLLPPTRSITAAKRRLETMACGG---GSPLAHGLAQAARVGNNALQT 551 Query: 409 REWFDADAVVISDF 422 E V I+D Sbjct: 552 GELSQVVVVAITDG 565 >UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38CFE Length = 489 Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 46/137 (33%), Gaps = 5/137 (3%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ 380 ++ +DTSGSM G + + + ++ FS+ E Sbjct: 53 AVVMLIDTSGSMSGSKLPEVQRAASEFVSRQNLKRDDLAVVEFSSRASVVADFTRDEREL 112 Query: 381 AIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQR 440 GGT+L+ F LQ+ + + ++ +D P++ Q+ Sbjct: 113 QQAIARLSAWGGTNLSEGFNLATSVLQNSD-RPGNILLFTDGE----PNNRRMAASIAQQ 167 Query: 441 VHQHRFHAVAMSAHGKP 457 + + VA+ P Sbjct: 168 IRASGINLVAVGTGDAP 184 >UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5Z1_VITVI Length = 686 Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 31/135 (22%), Positives = 52/135 (38%), Gaps = 9/135 (6%) Query: 305 IERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 I+ P + D R P + +D SGSM G K L++ +R I+ Sbjct: 186 IKAPALLDDAHLLDRAPIDLVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGPSDR-LSIVS 244 Query: 363 FSTEIVR-YELSG-PQGIEQAIRFL--SQQFRGGTDLASCFRAIMERLQSREWFD--ADA 416 FS+ R + L +A S GGT++ + + L+ R + A Sbjct: 245 FSSTARRIFPLRRMSDNGREAAGLAINSLXSSGGTNIVEGLKKGVRVLEERSEQNPVASI 304 Query: 417 VVISDFIAQRLPDDV 431 +++SD D+V Sbjct: 305 ILLSDGKDTYNCDNV 319 >UniRef50_B7G6X2 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G6X2_PHATR Length = 523 Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 58/160 (36%), Gaps = 9/160 (5%) Query: 285 LVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKA 342 L E+ ++ + DE R P IV +D SGSM G + K Sbjct: 31 LAERDIIGIDSEVSTNHFCASIHARTMPKEDEDCRTPIDLIVVLDVSGSMTGNKLKLCKK 90 Query: 343 FCLALMRIALAENRRCYIMLFSTEIV----RYELSGPQGIEQAIRFLSQQFRGGTDLASC 398 L+R+ ++R ++ F ++ +S + S RG T++++ Sbjct: 91 TLTMLLRVLQTQDR-FGLISFGSDARVEFPAQAMSKQNKASALQKIQSLTTRGCTNMSAA 149 Query: 399 FRAIMERLQSREWFDA--DAVVISDFIAQRLPDDVTSKVK 436 ++ L+ E + ++D +A D+ V Sbjct: 150 LGLAVQELKIIEKSNPVRSLFFLTDGLANEGISDLDGLVS 189 >UniRef50_Q55G98 von Willebrand factor A domain-containing protein DDB_G0267758 n=1 Tax=Dictyostelium discoideum RepID=Y7758_DICDI Length = 878 Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 54/151 (35%), Gaps = 10/151 (6%) Query: 309 VVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV 368 K D + FI +D SGSM G + K ++R R ++ F + Sbjct: 304 KDIKIEDMNQKSEFIFLIDCSGSMVGEPMRKVKRAMEIIIRSLNENQHRVNVVCFGSSFK 363 Query: 369 RYELSGPQGIEQAIRFLSQQFR------GGTDLASCFRAIMERLQSREWFDADAVVISDF 422 + ++ + LS+ + GGT+L + + I+ + E+ +++D Sbjct: 364 KVFKVSRDYNDETLECLSKYIQSIEANLGGTELLTPIKNILSSPPNPEYPR-QLFILTDG 422 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSA 453 A D + + + + R + Sbjct: 423 EAP-HRDKIIHYLS--KESNTTRIFTYGIGD 450 >UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magnoliophyta RepID=Q9FF49_ARATH Length = 704 Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 66/189 (34%), Gaps = 17/189 (8%) Query: 289 QLLTYRLHGESWREKVIERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLA 346 + ++++ K + R P + +D SGSM G K Sbjct: 218 RSVSFKDFAVLINLKAPTSSKSSSNPSSSSRAPVDLVTVLDVSGSMAGTKLALLKRAMGF 277 Query: 347 LMRIALAENRRCYIMLFSTEIVR---YELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAI 402 +++ +R ++ FS+ R L G ++A++ + S GGT++A + Sbjct: 278 VIQNLGPFDR-LSVISFSSTARRNFPLRLMTETGKQEALQAVNSLVSNGGTNIAEGLKKG 336 Query: 403 MERLQSREWFD--ADAVVISDFIAQRL------PDDVTSKVKELQRVHQHR--FHAVAMS 452 L R + + + V++SD K + ++ +R HA Sbjct: 337 ARVLIDRRFKNPVSSIVLLSDGQDTYTMTSPNGSRGTDYKALLPKEINGNRIPVHAFGFG 396 Query: 453 AHGKPGIMR 461 A +M Sbjct: 397 ADHDASLMH 405 >UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZR58_9PLAN Length = 1032 Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 58/163 (35%), Gaps = 8/163 (4%) Query: 302 EKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCY-I 360 E+ +D P G ++ +D SGSM G Q + LA +R A + Sbjct: 439 EEASPVRFTIRDAKVVPVGALMLVLDKSGSMQGEKMQMTQGAALAAIRAMGAA--DFAGV 496 Query: 361 MLFSTEIVRY-ELSGPQGIEQAIRFLSQ-QFRGGTDLASCFRAIMERLQSREWFDADAVV 418 + F ++ R + + + + GGT++ LQ+ + +V Sbjct: 497 IGFDSQAQRIVPIRKVDNPGMFVAQVRKLSASGGTNMTPGVALGFRDLQNVDAGVKHMIV 556 Query: 419 ISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +SD + +++ + AVA+ + +M Sbjct: 557 LSDGQTEPGN---VAQIASDMKKMGMTVSAVAVGSDADQKLMA 596 >UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W4Q3_NAEGR Length = 454 Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 22/127 (17%), Positives = 43/127 (33%), Gaps = 2/127 (1%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 ++ +D SGSM G AK L + ++ + T Y+L + Sbjct: 41 IVIALDVSGSMRGQGIDQAKIAISNLFEQVVDTP-DVVLITYDTSAELYDLRKKPAETRQ 99 Query: 382 IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRV 441 Q GGTD F AI + + +D ++++++V Sbjct: 100 STLEQIQAGGGTDFTCVFEAISNLDMFNRQSEVAILFFTDGQ-DGSSHKREKAIEQMKKV 158 Query: 442 HQHRFHA 448 + + + Sbjct: 159 LETKTQS 165 >UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1915 Length = 728 Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 71/172 (41%), Gaps = 16/172 (9%) Query: 322 FIVCVDTSGSMGGFN-EQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE----LSGPQ 376 + +D SGSM G +Q +AF L + ++ I++F ++ +++ + P Sbjct: 250 VVFVIDHSGSMHGQKIKQTYEAFLKILADLPEEDH--FGILIFDDKVDKWQNTLVKAVPD 307 Query: 377 GIEQAIRFLSQ-QFRGGTDLASCFRAIMERLQS-------REWFDADAVVISDFIAQRLP 428 I +A +F+S+ RGGTD+ A ++ L++ + + + +SD Sbjct: 308 NIIKAKQFVSKISARGGTDINKALLAAVKMLKNTSRNKLLPKISTSIILFLSDGEPTSGV 367 Query: 429 DDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 + + +++ ++ + + G + + + G+ R+ Sbjct: 368 TNHNEIINNVKKANERQTTLYCLG-FGNDVDFNFLEKMALENGGLARRIYED 418 >UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 Tax=cellular organisms RepID=Q47YR5_COLP3 Length = 786 Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 54/150 (36%), Gaps = 14/150 (9%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYELSGPQ 376 I +DTSGSM + + AK+ + ++ I+ F + ++ Sbjct: 395 DIIFIIDTSGSMQAGSMEQAKSSLQLALLQLNNKD-SFNIIAFDNDTELLFPVTHMASAH 453 Query: 377 GIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDA----DAVVISDFIAQRLPDDV 431 I +A +F+ GGT++ + ++ + + V I+D A ++ Sbjct: 454 NISKAQQFIDGLSANGGTEMYRPLSNAL-MMKKDKTQSSKAIRQIVFITDG-AVANEFEL 511 Query: 432 TSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + Q R + V + A M+ Sbjct: 512 MQLLNTAQ--GDFRLYTVGIGAAPNGYFMK 539 >UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q083T9_SHEFN Length = 722 Score = 56.2 bits (134), Expect = 3e-06, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 60/153 (39%), Gaps = 17/153 (11%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS----GPQG 377 I+ +DTSGSM G + AK L L + I+ F++++ + + Sbjct: 346 LILVIDTSGSMSGQSITQAKQ-ALQFALAGLRDIDSFNIIEFNSDVTMLSATPLSANSRN 404 Query: 378 IEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDAD--------AVVISDFIAQRLP 428 I +A RF+ S GGT++ S + + ++ D + ++D A Sbjct: 405 IGKANRFIQSLDADGGTEMRSALQTALVDSVQQDSDQTDAHSEMLRQVIFMTDG-AVGNE 463 Query: 429 DDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++ + + ++ R V + + MR Sbjct: 464 HELYQLIND--QLGDSRLFTVGIGSAPNSDFMR 494 >UniRef50_UPI0000F2DDBB PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) n=1 Tax=Monodelphis domestica RepID=UPI0000F2DDBB Length = 819 Score = 56.2 bits (134), Expect = 3e-06, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 57/143 (39%), Gaps = 13/143 (9%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE----LSGPQG 377 + +D SGSM G + KA + ++ E+ ++ FS + R++ L+ + Sbjct: 263 IVFLIDKSGSMAGRKIKKTKAALIKILDDLKPED-HFNMITFSGHVTRWKPELVLALDEH 321 Query: 378 IEQAIRFLSQQ-FRGGTDLASCFRAIMERLQSREWFD-------ADAVVISDFIAQRLPD 429 +++A FLS G T++ A + L + ++++D + Sbjct: 322 LKEAKTFLSNTPALGVTNVNGAVLAAVSMLDESNKKKELPEGSVSMIILLTDGDSTEGET 381 Query: 430 DVTSKVKELQRVHQHRFHAVAMS 452 + + ++ + ++H + Sbjct: 382 KLQKIHENVKAAIRGQYHLFCLG 404 >UniRef50_UPI0001BCB742 hypothetical protein FperA3_08546 n=1 Tax=Fusobacterium periodonticum ATCC 33693 RepID=UPI0001BCB742 Length = 536 Score = 56.2 bits (134), Expect = 3e-06, Method: Composition-based stats. Identities = 49/330 (14%), Positives = 111/330 (33%), Gaps = 36/330 (10%) Query: 112 QLVDANSTITSALHTLFLQRW--------RLSLIVQATTLNQQLLEEEREQLLSEVQERM 163 L+ L +W + + L E+E ++LS++++R+ Sbjct: 131 YLMMDIKNYNENKPVSLLAKWLPSIKTHNKKNYFAIKLAKKLNLTEKEYRKILSKLRDRL 190 Query: 164 TLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPEL--KRLAEQLGR 221 + + + +VKY + E+ K E+L Sbjct: 191 NIVEKHITNKE-----------YEKIDYISVPSKAMVKYRSLFFTKDEIRFKEFIEEL-- 237 Query: 222 SREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEF 281 +++K N+ M F M + + I E I+ L + Sbjct: 238 -KDSKKTKYNNLFMNDFVKMYL--DNLGKIGVNYLYGKTIK-----EAYKNSISNLIKDL 289 Query: 282 YRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAK 341 + +E + + + G+ K ++ +V DTSGSM G + A Sbjct: 290 SLKELEDRQILLQRFGDEKNLINTMWKKQSKIEFDKN---VLVIADTSGSMQGTPFETAV 346 Query: 342 AFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRA 401 + + + + ++ R ++FS++ + Y + + + + G T++ F+ Sbjct: 347 SLAIYISQNNKSDEWRNKFIIFSSDCIEYSYNKNAELTDILDTIPLIV-GNTNIDKVFKK 405 Query: 402 IMERLQSREWFDAD-AVVISDFIAQRLPDD 430 I+ ++ D ++ISD + + Sbjct: 406 ILNDSVEKKLPQLDEVIIISDMEFDAVQNK 435 >UniRef50_A6Q208 von Willebrand factor type A domain protein n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q208_NITSB Length = 305 Score = 56.2 bits (134), Expect = 3e-06, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 62/163 (38%), Gaps = 23/163 (14%) Query: 317 QPRG-PFIVCVDTSGSM----------GGFNEQCAKAFCLALMRIALAENRRCYIMLFST 365 + +G ++ +D SGSM ++ A I+ N +++F + Sbjct: 79 KKKGYDIVLAIDASGSMQEKGFDPTDPQKTKFDVVRSLVKAF--ISKRRNDNIGVVIFGS 136 Query: 366 -EIVRYELS-GPQGIEQAIRFLSQQFRGG-TDLASCFRAIMERLQSREWFDADAVVISDF 422 + L+ + +++ + +L G T + + L+ + ++++D Sbjct: 137 FAYIASPLTFNKEAVKKILDYLDIGVAGSKTAIDDALIESVRLLKESQAKSKIVILLTDG 196 Query: 423 I--AQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIF 463 I A + P DV K+ + + + + + + K GI F Sbjct: 197 IDTASKTPPDVAVKMA---KKYGVKIYTIGIGD--KRGIDEAF 234 >UniRef50_Q22HH7 von Willebrand factor type A domain containing protein n=3 Tax=Tetrahymena thermophila SB210 RepID=Q22HH7_TETTH Length = 796 Score = 56.2 bits (134), Expect = 3e-06, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 73/179 (40%), Gaps = 18/179 (10%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + + + ++ E + F++ +D SGSM G N + AK + ++ +L E Sbjct: 258 PNIAQVKKQLFRQAQNEIELMKAEFLLLIDRSGSMVGSNIETAKQALIFFLK-SLPEGSI 316 Query: 358 CYIMLFST--------EIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSR 409 I+ F T + + + I++ +F Q GGT+++ + +M LQ + Sbjct: 317 YNIISFGTNYTVMYPQSVQVNDQNLQDSIDKIEKF--QANMGGTNISQALKYLMYNLQDQ 374 Query: 410 EWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHR--FHAVAMSAHGKPGIMRIFDHI 466 +I+D Q + E+ + ++ + +A+ + ++ +I + Sbjct: 375 YGLRKKIYIITDGEFQDYQPAL-----EIVKKNKFKCDINALCIGSYEFLYATQILNET 428 >UniRef50_Q54MG4 von Willebrand factor A domain-containing protein DDB_G0285975 n=4 Tax=Dictyostelium discoideum RepID=Y5975_DICDI Length = 917 Score = 56.2 bits (134), Expect = 3e-06, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 56/148 (37%), Gaps = 11/148 (7%) Query: 312 KDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR-- 369 D + FI +D SGSM G + AK L ++ +L EN + I F + + Sbjct: 330 TSDDVNQKSEFIFLIDCSGSMSGEPIKKAKR-ALEIIIRSLNENCKFNIYCFGSRFTKAF 388 Query: 370 --YELSGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQ 425 ++ + +++ ++ + GGT+L R I+ E+ +++D Sbjct: 389 DNSKMYNDETLKEISGYVEKIDADLGGTELLPPIRDILSTESDFEYPR-QLFILTDGEVS 447 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSA 453 D + + V + R + Sbjct: 448 ER-DSLINYV--ATESNNTRIFTYGIGN 472 >UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Rhizobium RepID=B5ZY26_RHILW Length = 794 Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 59/177 (33%), Gaps = 18/177 (10%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAK-AFCLALMRIALA 353 G++ + P + + +D SGSM G + + AK + LA+ R+ Sbjct: 329 KDGKTCLLAFVTPPTAPDAAAPPAKREVVFVIDNSGSMSGPSIEQAKQSLALAISRLTPN 388 Query: 354 ENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMER--- 405 + ++ F + Y + P E+AI ++ GGT++ + Sbjct: 389 DR--FNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLPADGGTEMLPALEDALRNQGP 446 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQ-HRFHAVAMSAHGKPGIMR 461 + + V ++D + +E+ R V + + M Sbjct: 447 VATGALRQ--VVFLTDGAIG----NEQQLFQEITANRGDARVFTVGIGSAPNTYFMT 497 >UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteriaceae RepID=C7P2A9_HALMD Length = 393 Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 58/176 (32%), Gaps = 17/176 (9%) Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 G + ++ P + + +C+DTSGSM G N + A+ + + E+ Sbjct: 16 DGTTVTAEIDVEPGEQETDVRRH---IALCIDTSGSMEGDNIKRARDGAAWVFGLLADED 72 Query: 356 RRCYIMLFSTEIVRY-------ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQS 408 I+ F TE +L ++ GGTD+ + +A E L S Sbjct: 73 -YVSIVAFDTEATVILPATRWSDLDRQTAMDHVEEL---TAGGGTDMYNGLKAAKETLSS 128 Query: 409 REWFDAD---AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +++SD D + E R + + +R Sbjct: 129 SATGPDTVKRLLLLSDGKDNERTPDEFEGLAEAIDDAGIRIQSAGIGTDYNEATIR 184 >UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella loihica PV-4 RepID=A3QDW1_SHELP Length = 776 Score = 55.8 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 59/182 (32%), Gaps = 19/182 (10%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + V+ P K PR + +DTSGSM G + AK+ L + + Sbjct: 377 DKEAKDSYALVMLMPPQDKARVRLPR-ELTLVIDTSGSMTGDSIAQAKSAILNALAGLGS 435 Query: 354 ENRRCYIMLFSTEIVRY-ELSGPQGIEQAIR---FL-SQQFRGGTDLASCFRAIMERLQS 408 ++ ++ F + + ++ + F+ S + GGT++A + + +S Sbjct: 436 QD-TFNVIAFDSSVRSLSPVALSATAANLGKANLFVQSLEADGGTEMAPALLRALSQPES 494 Query: 409 ---------REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 + V I+D + R R V + A Sbjct: 495 GVSSISSAVKPERLKQVVFITDGAVGNEASLFALIAANIGRQ---RLFTVGIGAAPNGYF 551 Query: 460 MR 461 M Sbjct: 552 ME 553 >UniRef50_A0D1M1 Chromosome undetermined scaffold_34, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0D1M1_PARTE Length = 1460 Score = 55.8 bits (133), Expect = 4e-06, Method: Composition-based stats. Identities = 35/144 (24%), Positives = 55/144 (38%), Gaps = 13/144 (9%) Query: 322 FIVCVDTSGSMGGFN-EQCAKAFCLALMRIALAENRRCYIMLFSTEIVR---YELSGPQG 377 +I+ +D SGSM G E K L I R I+LF+ + YE+ Q Sbjct: 1276 YILILDDSGSMEGAFFEAAKKGLVAFLQEIQKNPESRVTIILFNHQARCVVDYEIPDAQV 1335 Query: 378 IEQAIRFLSQQFRGGTDLASCFRAIMERLQS----REWFDADAVVISDFIAQRLPDDVTS 433 ++ I+F GGTD + +++ + + +D AQ P Sbjct: 1336 QQKEIQFR----GGGTDFDEPLKLAFDKIANNPDFDNFSSHSIFFYTDGQAQY-PTKAME 1390 Query: 434 KVKELQRVHQHRFHAVAMSAHGKP 457 KVK+ + + VA S P Sbjct: 1391 KVKQFPSDKREKIELVACSFEDSP 1414 >UniRef50_Q021L5 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q021L5_SOLUE Length = 337 Score = 55.8 bits (133), Expect = 4e-06, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 49/130 (37%), Gaps = 3/130 (2%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 ++ D SGSM G ++A A + A E+ ++LF+ Q E Sbjct: 112 IVIVFDCSGSM-GPKLAKSRAAVAAFLSSANPED-EFSLVLFNDRAQLVSGFNRQTDELQ 169 Query: 382 IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRV 441 + Q +G T L M++++ + +VISD +VK + Sbjct: 170 SKLFYAQSKGRTALLDAIYLAMDQMKHAKHSRKAVLVISDG-GDNCSRYSMREVKNRVKE 228 Query: 442 HQHRFHAVAM 451 + +++ + Sbjct: 229 GDAQIYSIGI 238 >UniRef50_O26551 Magnesium chelatase subunit ChlI n=1 Tax=Methanothermobacter thermautotrophicus str. Delta H RepID=O26551_METTH Length = 591 Score = 55.8 bits (133), Expect = 4e-06, Method: Composition-based stats. Identities = 34/158 (21%), Positives = 60/158 (37%), Gaps = 7/158 (4%) Query: 292 TYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQ-CAKAFCLALMRI 350 T R + + ++ K + R +I+ +DTS SM + AK L+R Sbjct: 399 TLRKAASAGTGTIEPEHLMEKVRIGKSRALYIIVLDTSSSMRLERKIKFAKTVSWLLLRD 458 Query: 351 ALAENRRCYIMLF---STEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ 407 + + R ++ F +V S + +E+A+ L G T L R E Sbjct: 459 SYEKRNRIALIAFRGYEANLVVEPTSNLETVEEALEGLRS--GGRTPLTPALRLAAEVAS 516 Query: 408 SREWFDADAVVISDFIAQR-LPDDVTSKVKELQRVHQH 444 S AVVISD + ++ + L+ ++ Sbjct: 517 SSSDEACTAVVISDGRCNVFINSNLEEDMNMLETELRN 554 >UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PS55_PICSI Length = 829 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 52/135 (38%), Gaps = 11/135 (8%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 E+ +++ E + D R P + +D SGSM G K ++ E Sbjct: 333 SEASKKQNYEDCEGNMVKDPGCRAPIDLVTVLDVSGSMSGTKLALLKRAMAFVISNLSPE 392 Query: 355 NRRCYIMLFSTEIVRYELSG-----PQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSR 409 +R ++ ST + L + + + L GGT++A R + L+ R Sbjct: 393 DRLSVVVFSSTAKRVFSLKRMTPDGQRAANRVVERLLCT--GGTNIAEGLRKGAKVLEDR 450 Query: 410 EWFD--ADAVVISDF 422 + A +++SD Sbjct: 451 RQRNPVASIMLLSDG 465 >UniRef50_Q54DV3 von Willebrand factor A domain-containing protein DDB_G0292016 n=1 Tax=Dictyostelium discoideum RepID=Y2016_DICDI Length = 918 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 53/165 (32%), Gaps = 9/165 (5%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + + + FI +D SGSM G + A+ ++R Sbjct: 272 DDKSYATAINFYPSFKNVNPDEVYQKSEFIFLIDCSGSMSGQSINKARRAMEIIIRSLNE 331 Query: 354 ENRR---CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERLQS 408 +++ C+ F+ + + + +E A F+ + GGT+L I+ Sbjct: 332 QHKVNIYCFGSSFNKVFDKSRVYNDETLEIAGSFVEKISANLGGTELLPPMVDILSSPND 391 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSA 453 E+ +++D + + + R + A Sbjct: 392 PEYPR-QVFILTDGEISERDKLIDYV---AKEANTTRIFTYGIGA 432 >UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NW6_PSEA6 Length = 701 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 58/156 (37%), Gaps = 23/156 (14%) Query: 322 FIVCVDTSGSMGGFNEQCAK-AFCLALMRIALAENRRCYIMLFSTEIVRYEL----SGPQ 376 + +DTSGSM G + AK A AL + L I+ F+ + + Sbjct: 305 VVFLLDTSGSMAGESIVQAKRAVDFALTQ--LRPEDNVNIIQFNDAPQALWKRAMPATAK 362 Query: 377 GIEQAIRF-LSQQFRGGTDLASCFRAIM----------ERLQSREWFDADAVVISDFIAQ 425 I++A + S GGT++A + + L S + V I+D + Sbjct: 363 HIQRARNWVASLHADGGTEMAPALTLALNKPSLHRDDSDLLGSHKLRQ--VVFITDG-SV 419 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 D + S ++ ++ +R + + + M Sbjct: 420 SNEDALMSLIES--KLADNRLFTIGIGSAPNSYFMT 453 >UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobactrum intermedium LMG 3301 RepID=C4WI90_9RHIZ Length = 777 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 55/146 (37%), Gaps = 10/146 (6%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR-YELS---GPQG 377 + +D SGSMGG + + AKA + +R ++ F + R +E+S Q Sbjct: 381 VVFVIDNSGSMGGTSIEQAKASLDYALSHLQPGDR-FNVIRFDDTLTRFFEVSVEASQQN 439 Query: 378 IEQAIRF-LSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 I A F +S + +GGT + A ++ V ++D + ++ Sbjct: 440 IASARHFVMSLEAQGGTAMLPALHAALDDSHQGNGLR-QIVFLTDGE---ISNEQQLLDA 495 Query: 437 ELQRVHQHRFHAVAMSAHGKPGIMRI 462 R + R V + +M Sbjct: 496 IAARRGRSRIFMVGIGTAPNSYLMNH 521 >UniRef50_C1YR26 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YR26_NOCDA Length = 505 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 52/144 (36%), Gaps = 10/144 (6%) Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST----EIVRYELSGPQGIE 379 V +D SGSMGG A L+L+ + ++ F+ E+ L + Sbjct: 44 VVLDRSGSMGGGRLDGAVRALLSLVERLAPSD-NFGLVSFNDQARVEVPCGPLEDKARVR 102 Query: 380 QAIRFLSQQFRGGTDLASCFRAIMERLQSREW-FDADAVVISDFIAQR--LPDDVTSKVK 436 + I L GGTDL+S ++ + ++ISD A + D+ +V Sbjct: 103 RLISGL--HASGGTDLSSGLLRGVQEARRAGADRGGTLLLISDGHANQGVTDHDLLRQVA 160 Query: 437 ELQRVHQHRFHAVAMSAHGKPGIM 460 H ++ ++ Sbjct: 161 ADAYAHGVTTTSLGYGLGYDEELL 184 >UniRef50_A3ZTC3 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZTC3_9PLAN Length = 346 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 53/146 (36%), Gaps = 13/146 (8%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 F + D SGSM G K L + E + ++ F ++ V + G + ++ Sbjct: 192 FCIIADCSGSMSGVKLDYVKEEILETVSSLPREA-QFQVIFFQSQAVPFPQKGWRHPKRD 250 Query: 382 IRFLSQQFR-----GGTDLASCFRAIMERLQSREWFDADAV-VISDFIAQRLPDDVTSKV 435 LS+ + GGT+ F ++ DAV ++D + + Sbjct: 251 FNALSEWLKTVGPAGGTNPLPAFEIALKFSPRP-----DAVFFMTDGLFDDNVVGEVKRQ 305 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMR 461 +L + HA++ +MR Sbjct: 306 NDLSEPK-VKVHAISFMDRSAEPLMR 330 >UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN03_ARATH Length = 641 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 25/108 (23%), Positives = 48/108 (44%), Gaps = 7/108 (6%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST---EIVRYELSGPQG 377 I +D SGSM G + K ++ + L E R ++ FS+ + L G Sbjct: 204 DLITVLDVSGSMDGVKMELMKN-AMSFVIQNLGETDRLSVISFSSMARRLFPLRLMSETG 262 Query: 378 IEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDF 422 + A++ + S GGT++A + ++ R W + + +++SD Sbjct: 263 KQAAMQAVNSLVADGGTNIAEGLKIGARVIEGRRWKNPVSGMMLLSDG 310 >UniRef50_A2E1S5 von Willebrand factor type A domain containing protein n=2 Tax=Trichomonas vaginalis RepID=A2E1S5_TRIVA Length = 688 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 30/140 (21%), Positives = 62/140 (44%), Gaps = 16/140 (11%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYEL------SGP 375 F +D SGSM G Q AK CL + +L R I+ F ++ YE+ Sbjct: 236 FYFIIDCSGSMSGSCIQNAK-LCLNIFMHSLPIGCRFSIIKFGSD---YEVALHPCDYTD 291 Query: 376 QGIEQAIRFLSQ--QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 + + +A++ L+ GGTD+ S + +ME + + +++D +++ + Sbjct: 292 ENVSEAMKQLNNIDAEMGGTDILSPLKYVMELTPKQGFIK-QVFLLTDGQ-DSNTNELCA 349 Query: 434 KVKELQRVHQHRFHAVAMSA 453 +E + +R ++ + + Sbjct: 350 LAQENR--TNNRIFSIGIGS 367 >UniRef50_B9KV79 Putative uncharacterized protein n=1 Tax=Rhodobacter sphaeroides KD131 RepID=B9KV79_RHOSK Length = 1043 Score = 55.5 bits (132), Expect = 5e-06, Method: Composition-based stats. Identities = 25/119 (21%), Positives = 41/119 (34%), Gaps = 11/119 (9%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRI---ALAENR--RCYIMLF------STEIVR 369 + +D SGSM G KA AL+R ++ +R I+L+ S E Sbjct: 259 AIYITLDVSGSMSGTRMAAQKAGVAALIREIGASVDPDRPNDIRIVLWNAGLAGSIERRN 318 Query: 370 YELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLP 428 E +E + LS GGT+ + F + ++D + Sbjct: 319 MEPDDYTALEDWMLALSNSTSGGTNFNAAFAEASTFFAGGGSKRRIVIFVTDGEPSPVS 377 >UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanobacteria RepID=B4VT64_9CYAN Length = 1037 Score = 55.1 bits (131), Expect = 6e-06, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 62/146 (42%), Gaps = 8/146 (5%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS---TEIVRYELSG-PQ 376 + VDTSGS G +K ++ ++ I+ F+ T++ L+ PQ Sbjct: 666 DVVFLVDTSGSQSGSPIVQSKELMRQFIQGLNPQD-TFTIIDFANSTTQLSDKPLANTPQ 724 Query: 377 GIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKV 435 ++A+ ++ GGT+L + ++ + V+++D + + + +++ Sbjct: 725 NRKKALNYINRLDANGGTELMNGIDTVLNFPAAPAGRLRSVVLLTDGLIGD-DEQIIAEI 783 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMR 461 ++ + +R ++ + + ++ Sbjct: 784 RDRLKP-GNRLYSFGVGSSTNRFLIE 808 >UniRef50_UPI00016C09D7 hypothetical protein Epulo_01596 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C09D7 Length = 262 Score = 55.1 bits (131), Expect = 6e-06, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 50/131 (38%), Gaps = 17/131 (12%) Query: 326 VDTSGSMGGFNEQ-CAKAF---CLALMRIALAE---NRRCYIMLFSTEIVRYELSGPQGI 378 +DTSGSM G A +AL +A + ++ FST +GP+ + Sbjct: 21 LDTSGSMTGVPIAALNTAMEECTVALKDLAKKNADAKLKIAVLEFSTGAKWVTYNGPESL 80 Query: 379 EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA-------DAVVISDFIAQRLPDDV 431 + + G TD+ + R + +L + + + ++D D+ Sbjct: 81 DDEFEWEHLSAGGVTDIGAALRELDIKLSRNGFLKSMTGALMPVIIFMTDGY---PTDEY 137 Query: 432 TSKVKELQRVH 442 + + EL++ Sbjct: 138 AAALAELRKNR 148 >UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V370_MONBE Length = 471 Score = 55.1 bits (131), Expect = 6e-06, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 58/161 (36%), Gaps = 10/161 (6%) Query: 288 KQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCL 345 + L Y G I + D++ R + +D SGSM G + ++ L Sbjct: 24 RPLWQYAEIGARESSAYISCRLTAPDFEPVERPAIDLVAVIDVSGSMAGQKLKMVQS-TL 82 Query: 346 ALMRIALAENRRCYIMLFSTEIVR-YELSGPQGIEQ---AIRFLSQQFRGGTDLASCFRA 401 + L + R ++ F +++ ++L + + T+L+ Sbjct: 83 EFLMRNLKDTDRFALVTFDSDVKTVFDLRPMTTAHKEACLADVQKLRAGSCTNLSGGLFR 142 Query: 402 IMERLQSREWFD---ADAVVISDFIAQRLPDDVTSKVKELQ 439 +E +Q R + ++++D IA D + L+ Sbjct: 143 GVELMQQRGATKGAVSSILLMTDGIANEGVRDKDDMCRALR 183 >UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VDM1_NAEGR Length = 754 Score = 55.1 bits (131), Expect = 7e-06, Method: Composition-based stats. Identities = 22/127 (17%), Positives = 44/127 (34%), Gaps = 2/127 (1%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 ++ +D SGSM G AK L + ++ + T Y+L + Sbjct: 41 IVIALDVSGSMRGQGIDQAKIAISNLFEQVVDIP-DVVLIAYDTSAELYDLRKKPAETRQ 99 Query: 382 IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRV 441 Q GGTD F AI + + + +D ++++++V Sbjct: 100 STLEQIQAGGGTDFTCVFEAISKLDMFNSQSEVAILFFTDGQ-DGSSHKREKAIEQMKKV 158 Query: 442 HQHRFHA 448 + + + Sbjct: 159 LETKTQS 165 >UniRef50_C6WL97 VWA containing CoxE family protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WL97_ACTMD Length = 1295 Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 36/150 (24%), Positives = 57/150 (38%), Gaps = 9/150 (6%) Query: 289 QLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALM 348 L T R + ERPV + I+ VD SGSM + A L Sbjct: 1101 NLATARRDEHGKVVVIPERPVFRSRGRKANDWRLILVVDVSGSME-ASTVWA---ALTAS 1156 Query: 349 RIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQS 408 A + + + FST++V + + L + GGT +A R + + Sbjct: 1157 VFAGVRSLTTHFLAFSTQVVDL---SERVADPLSLLLEVKVGGGTHIAGALRHARDLVTV 1213 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKEL 438 E V++SDF +T++V+EL Sbjct: 1214 PERTM--VVLVSDFEEGGPVASLTAQVREL 1241 >UniRef50_A9WI94 von Willebrand factor type A n=2 Tax=Chloroflexus RepID=A9WI94_CHLAA Length = 845 Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 57/182 (31%), Gaps = 12/182 (6%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVC--VDTSGSMGG----FNEQCAKAFC 344 ++ L G + P++ R P + +D S SM AK Sbjct: 364 QSFTLGGYAETPLADALPLLMTPPPRPQRAPVSILFIIDRSASMSATFGISKFDMAKEAA 423 Query: 345 LALMRIALAENRRCYIMLFSTEIV-RYELSGPQGIEQAIR----FLSQQFRGGTDLASCF 399 + + +R ++ F TE + + + GGT++ Sbjct: 424 ILSLTTLQPGDR-VGVLAFDTETIWTVPFRTVGEGVSLVELQDQIATMSLGGGTNIERAL 482 Query: 400 RAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 + L + + AV+++D + ++ E R Q +A+ + + Sbjct: 483 SVGLPALANEPYSTRHAVLLTDGRSYSNNYPRYQQLVETARAAQITLSTIAIGSDSDTEL 542 Query: 460 MR 461 + Sbjct: 543 LN 544 >UniRef50_Q24FW2 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FW2_TETTH Length = 1074 Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 60/146 (41%), Gaps = 6/146 (4%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS--GPQGI 378 I +D SGSM G K L L+ ++R ++LF++E+ + Sbjct: 365 DLICVMDNSGSMHGEKINMLKETLLYLIDQLDEKDR-LGLVLFNSEVTFRPMKSMDTTNK 423 Query: 379 EQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTSKV 435 + +++S + +GGTD+ + +++R++ + ++SD + + D V + Sbjct: 424 LKLKQYISDIRAQGGTDINLGMTEAFKFIKTRKYCNPVTSVFLLSDGLDSKAQDRVAVTL 483 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMR 461 K + Q + P +M Sbjct: 484 KNMSINEQFSINCFGFGRDHDPILMN 509 >UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21PJ3_SACD2 Length = 763 Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 63/182 (34%), Gaps = 19/182 (10%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 L GE + ++ P + + + + VDTSGSM G + Q AK L L Sbjct: 361 LAGEDYLLLMLLPPQGQQQHTQSLSRDIVFVVDTSGSMQGTSIQQAKR-SLQFALRGLNP 419 Query: 355 NRRCYIMLFSTEIVRYELSGPQGIEQAIRFL-----SQQFRGGTDLASCFRAI---MERL 406 + I+ F T R+ ++ + GT++ + + + Sbjct: 420 SDTFNIIEFDTSFSRFRSRPVSATASNVQAAVSWVNNLNADNGTEMYAALEEAFDQLASI 479 Query: 407 QSREWFDA-------DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 ++ V I+D A + S + +R++ R VA+ + Sbjct: 480 NPNGTENSKSSNNLQQVVFITDG-AVGNEQALLSLIH--RRLNNARLFTVAIGSAPNSYF 536 Query: 460 MR 461 MR Sbjct: 537 MR 538 >UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=40 Tax=Euteleostomi RepID=ITIH5_HUMAN Length = 942 Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 51/155 (32%), Gaps = 17/155 (10%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE-----LSGPQ 376 + +D+S SM G + K ++ ++R I+ FS I ++ ++ Sbjct: 296 VVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDR-FSIIGFSNRIKVWKDHLISVTPDS 354 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQS-------REWFDADAVVISDF---IAQR 426 + + GGTD+ + + L + + V ++D + + Sbjct: 355 IRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGIGDRSVSLIVFLTDGKPTVGET 414 Query: 427 LPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + +E R + + ++ Sbjct: 415 HTLKILNNTREAARGQVC-IFTIGIGNDVDFRLLE 448 >UniRef50_A6DST2 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DST2_9BACT Length = 307 Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 35/152 (23%), Positives = 62/152 (40%), Gaps = 18/152 (11%) Query: 324 VCVDTSGSMG---GFNEQCAKAFCLALMRIALAENRRCYIMLFSTE-IVRYELSGPQGIE 379 +C+D+SGSM G + A E + +F TE I ++ Sbjct: 92 ICLDSSGSMRADFGGKNRYEVAMQAVKEFTEYREGDAFGLTVFGTEYINWVPVTKDTSAI 151 Query: 380 QAIR-FL-----SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 FL S+ F GGT++A R ++L +E D +++SD ++ P+D Sbjct: 152 ALATPFLAPDRMSKWF-GGTNIAKALRGSQQQLLQQEDGDRMIILVSDGVSG-SPNDTVD 209 Query: 434 KVKELQRVHQHRFHAVAM---SAHGKPGIMRI 462 +EL ++ A + S +G P + + Sbjct: 210 MAQEL---RNNKIVAYCIYIGSGNGSPEMNAL 238 >UniRef50_C0Z8R3 Hypothetical membrane protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z8R3_BREBN Length = 424 Score = 54.7 bits (130), Expect = 8e-06, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 63/146 (43%), Gaps = 16/146 (10%) Query: 322 FIVCVDTSGSMGGF-NE-QCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGP 375 ++ +DTSGSM + Q KA +++ ++ ++ F + ELS Sbjct: 116 IVMVLDTSGSMQSSDPDNQLFKAAA-DMVQRMDSDM-NIAVVTFHDQTNVLQPLTELSSQ 173 Query: 376 QGIEQAIRFLSQQ--FRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQ-RLPDDVT 432 ++ ++ L Q GGT + +A +++LQ+ + ++ V++SD + +P + Sbjct: 174 SVKDEVVKKLLQFPRTDGGTRIDLALQAGLDQLQANQMANSTVVLMSDGYSDLDVPAALA 233 Query: 433 SKVKELQRVHQHRFHAVAMSAHGKPG 458 + +Q H V MS G Sbjct: 234 PY-----KQNQVIVHTVGMSQIDADG 254 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 54.7 bits (130), Expect = 8e-06, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 56/146 (38%), Gaps = 12/146 (8%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYEL----SGPQG 377 I +DTSGSM G + + AK+ L L I+ F++++ + Sbjct: 332 VIFVIDTSGSMHGESLEQAKS-ALFFALANLDPQDSFNIIEFNSKVNALNAQALPANDFN 390 Query: 378 IEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 I +A F+ + GGT++ F +++ + ++ V ++D + T Sbjct: 391 IRRARNFVYGLKADGGTEIGLAFEQVLDNSEHADYLR-QIVFLTDG----SISNETEVFA 445 Query: 437 ELQRVHQ-HRFHAVAMSAHGKPGIMR 461 +++ R + + + M Sbjct: 446 QIKGSLGDSRIFTIGIGSAPNSYFMT 471 >UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated chloride channel n=2 Tax=Gallus gallus RepID=UPI0000E80A5E Length = 928 Score = 54.7 bits (130), Expect = 9e-06, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 64/168 (38%), Gaps = 19/168 (11%) Query: 324 VCVDTSGSMGGFNEQ--CAKAFCLALMRIALAENRRCYIMLFSTEIVR----YELSGPQG 377 + +D SGSM N A + L++I +R I+ F + +++ Sbjct: 310 LVLDVSGSMNTNNRITNLRTAAEVFLIQIIEIGSR-VGIVTFESSAYEKSPLLQITSVAT 368 Query: 378 IEQAIRFLSQQFRGGTDLASCFRAIMERLQSR--EWFDADAVVISDFIAQRLPDDVTSKV 435 ++ ++ L GGT + + +E + + + ++ V+++D D S Sbjct: 369 RQRLVQNLPTTAGGGTKICAGIEKGLEIITNAIGTTYGSEIVLLTDGE-----DSTMSLC 423 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMRIFD-----HIWRFDTGMRSRLL 478 +E + H +A+ + + ++ D + S+L+ Sbjct: 424 REKVKESGAIIHTIALGPSAAKELEEFSNITGGLQLYAVDVDVPSKLV 471 >UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67LZ3_SYMTH Length = 414 Score = 54.7 bits (130), Expect = 9e-06, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 52/141 (36%), Gaps = 7/141 (4%) Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFL 385 VD SGSM G K L+ E+R I+ + ++ S P + A+R L Sbjct: 49 VDRSGSMAGAALYFTKQALRFLVDQMAEEDR-LAIVTYDDQVHVPFPSQPVVQKDAVRLL 107 Query: 386 --SQQFRGGTDLASCFRAIMERL--QSREWFDADAVVISDFIA--QRLPDDVTSKVKELQ 439 G T+L+ M+++ + + ++++D +A DV + Sbjct: 108 VDGITAGGTTNLSGGLATGMQQIRPHAGPGRVSRVLLMTDGLANVGVTDPDVLAGWARAW 167 Query: 440 RVHQHRFHAVAMSAHGKPGIM 460 R + + H ++ Sbjct: 168 REKGLAVSTMGVGPHFSEDLL 188 >UniRef50_C5FHM8 von Willebrand factor type A domain-containing protein n=1 Tax=Microsporum canis CBS 113480 RepID=C5FHM8_NANOT Length = 1002 Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 55/165 (33%), Gaps = 12/165 (7%) Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 G S I + + + I VD SGSM G + L +L + Sbjct: 299 EGHSAIMIEIPPDFMLESQEPVDDKEIIFLVDRSGSMAGKIHGLISSMQFYL--RSLPMS 356 Query: 356 RRCYIMLFSTEIV----RYELSGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERLQSR 409 I F + + + +A+ ++S GGTDL + + Sbjct: 357 TLFNICSFGSSYQLLWEQSRAYSEITLNEALYYVSSFSSNLGGTDLLPALEH---VVLQQ 413 Query: 410 EWFDADAVVISDFIAQRLPDDV-TSKVKELQRVHQHRFHAVAMSA 453 D +V++D RL + + ++ + RF A+ + Sbjct: 414 NHSSKDIIVLTDGEVWRLEETIRFVRLTHIVSKKAIRFFALGIGN 458 >UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular organisms RepID=YEGL_ECOLI Length = 219 Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 58/176 (32%), Gaps = 18/176 (10%) Query: 302 EKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFN--EQCAKAFCL--ALMRIALAENR- 356 + I + +PR P I+ +D SGSM G E A L+ LA R Sbjct: 2 SEQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRV 61 Query: 357 RCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE------ 410 I+ F V + I F G T + + ++ ++ R+ Sbjct: 62 ELGIVTFGPVHVEQPFTSAANFFPPILFAQ----GDTPMGAAITKALDMVEERKREYRAN 117 Query: 411 ---WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIF 463 ++ +I+D +KV + + F ++ + + +I Sbjct: 118 GISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQIS 173 >UniRef50_A2BMB7 Putative uncharacterized protein n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BMB7_HYPBU Length = 662 Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 56/163 (34%), Gaps = 13/163 (7%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 ++V +D SGSM + + LA +A A R ++LF +E+ + + + Sbjct: 508 YVVVLDKSGSMR----EYSLTALLASASLAPAITR---LVLFDSEVRVIDRLEQASVPKI 560 Query: 382 IRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQR 440 I L F G TD+ L + VVISD +V R Sbjct: 561 ISLLFKTSFEGYTDVVRALEESTRGLAPSK-----LVVISDLHQTVPSRKSVDEVLRGMR 615 Query: 441 VHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRRWRR 483 R VA RI + + + + R RR Sbjct: 616 TSGWRIAVVAPPTLDPRLARRITGIVKLYTVSKPADVARVIRR 658 >UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alteromonadales RepID=Q3IHK0_PSEHT Length = 664 Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 76/210 (36%), Gaps = 22/210 (10%) Query: 259 DDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQP 318 + + R E L + + F+ E GE + ++ P + ++ Sbjct: 269 NALNRDFVLEFKPLQKEQAQAAFFTEQFEN--------GERYGLAMLMPPADNFIATQRL 320 Query: 319 RGPFIVCVDTSGSMGGFNEQCAK-AFCLALMRIALAENRRCYIMLFSTEIVRYE----LS 373 + VDTSGSM G + + AK A AL L N I+ F + ++ Sbjct: 321 ARETVFVVDTSGSMHGQSMEQAKNALFYALS--LLDSNDSFNIIGFDNVVTLMSDKPLVA 378 Query: 374 GPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVT 432 + +A RF+ Q GGT++ A+++ Q + + ++D + Sbjct: 379 SGFNLRRAERFIYGLQADGGTEIQGALDAVLDGSQFDGFVR-QVIFLTDG----SVSNED 433 Query: 433 SKVKELQRVHQ-HRFHAVAMSAHGKPGIMR 461 + K +Q R V + + MR Sbjct: 434 ALFKSIQAKLGDSRLFTVGIGSAPNSFFMR 463 >UniRef50_Q986Q0 Mlr7258 protein n=2 Tax=Alphaproteobacteria RepID=Q986Q0_RHILO Length = 415 Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 36/158 (22%), Positives = 61/158 (38%), Gaps = 23/158 (14%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV--RYELSGPQGIE 379 ++CVD SGSM A A + +L R ++ F T IV ELS P + Sbjct: 228 VVLCVDQSGSMASSVIY---ASIFAAVMASLPVVR-TKLVCFDTAIVDLTEELSDPVEV- 282 Query: 380 QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQ 439 Q GGTD+ +R++ + V+I+D ++ ++ L Sbjct: 283 ----LFGVQLGGGTDINQAVAYCADRIERPT--KSHMVLITDLYEGGNGQELLQRLASLV 336 Query: 440 RVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 R + +A++ G+PG +D M + Sbjct: 337 RSGVNVVVLLALTDQGRPG----------YDPKMAGSV 364 >UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FW78_SHESH Length = 770 Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 60/162 (37%), Gaps = 28/162 (17%) Query: 322 FIVCVDTSGSMGGFNEQCA-KAFCLALMRIALAENRRCYIMLFSTEIVRYEL----SGPQ 376 I+ +DTSGSM G + A KA AL + + ++ F++++ + + Sbjct: 377 LILVIDTSGSMSGSAMEQAKKAMKYALAGLGSDD--TFNVIEFNSKVSSLSKGPIPASTK 434 Query: 377 GIEQAIRFL-SQQFRGGTDLASCFRAIM------ERLQSREWFDAD---------AVVIS 420 IE A RF+ S GGT++A + Q D + ++ Sbjct: 435 NIEMANRFVHSLTSDGGTEMALALEHALGQESGGSSWQETGLQGKDEESTSRLRQVLFMT 494 Query: 421 DFIAQRLPDDVTSKVKELQ-RVHQHRFHAVAMSAHGKPGIMR 461 D + K ++ R+ + R + + + M+ Sbjct: 495 DGAVG----NEAELFKLIKYRIGKSRLFTLGIGSAPNSHFMQ 532 >UniRef50_Q1INP4 von Willebrand factor, type A n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1INP4_ACIBL Length = 430 Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 53/130 (40%), Gaps = 5/130 (3%) Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIR 383 V +D SGSM A + L++ + E+ +++ F+ + + + + Sbjct: 193 VVIDNSGSMRDKRPAV-NAATINLVKASNPED-EVFVVNFNDD-YYLDQDYTDSVAKLKE 249 Query: 384 FLS-QQFRGGTDL-ASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRV 441 L + RGGT L + + +++ + +++D + + ++++Q+ Sbjct: 250 ALEKYETRGGTALYDAVLASNAHLMKAPKLEKKVLFIVTDGEDDASLNTLEQTIRKVQQE 309 Query: 442 HQHRFHAVAM 451 + + + + Sbjct: 310 NGPTIYTIGI 319 >UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744662 Length = 679 Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 53/152 (34%), Gaps = 10/152 (6%) Query: 307 RPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE 366 +P + + P ++ +D SGSM GF + +K L++ + I+ F+++ Sbjct: 303 QPPAKWEAGQTPPRDYLFVLDVSGSMNGFPIETSKRLMSDLLKGLNPGD-TFNILHFASD 361 Query: 367 IVRYELSGPQGIEQAIRFLSQQF-----RGGTDLASCFRAIMERLQSREWFDADAVVISD 421 + I ++ GGT+L + + + + V+++D Sbjct: 362 SAVLSPKPLAATPENIHLATKDLSRHRGNGGTELLPALQRALATPREVGVSRS-IVILTD 420 Query: 422 FIAQRLPDDVTSKVKELQRVHQHRFHAVAMSA 453 + KELQ + + Sbjct: 421 GYVTIEKEAFRLVRKELQNAN---VFTFGIGT 449 >UniRef50_D2V048 Predicted protein n=2 Tax=Naegleria gruberi RepID=D2V048_NAEGR Length = 1065 Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 56/171 (32%), Gaps = 9/171 (5%) Query: 300 WREKVIERPVVHKDYDEQPRGPF-IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 + I + + + +G I+ +D SGSM G AK L+ N R Sbjct: 63 RIQLAIRDEFWQQQHTSKNQGKLLIIALDKSGSMAGSGISEAKLALETLLSNVEGCNERI 122 Query: 359 YIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVV 418 ++F + +++ + + GGTD +S F+ I+ + Sbjct: 123 LFIVFDSNSELIDMTNMELENKLQVVKKVSAGGGTDFSSVFK-IIRNYGGSLNGQVAIIF 181 Query: 419 ISDFIAQRLPDDVTSKVKELQRVHQH------RFHAVAM-SAHGKPGIMRI 462 +D Q + + + + FH + S H + I Sbjct: 182 FTDGQDQYSSNSTREGSIKSLQERLNTESESYEFHTIGFTSVHDARLLTDI 232 >UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Photobacterium profundum 3TCK RepID=Q1YZ74_PHOPR Length = 714 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 32/147 (21%), Positives = 60/147 (40%), Gaps = 17/147 (11%) Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY-----ELSGPQGIEQ 380 +D SGSM G + + AK ++ E+ I+ F+ E + Y ++ I + Sbjct: 339 LDISGSMYGESIEQAKQALRYGLQQLQPED-SFNIVTFNHEAMLYSEQLLPVTSST-ITR 396 Query: 381 AIRFL-SQQFRGGTDLASCFRAIMER-----LQSREWFDADAVVISDFIAQRLPDDVTSK 434 A+RF+ GGT++A+ +A L S W + V I+D + + Sbjct: 397 ALRFVDGLDADGGTEMAAALKAAFSIKTHDQLNSTRWLN-QIVFITDG-SVGNESALFDL 454 Query: 435 VKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++ Q++ R V + + M Sbjct: 455 IE--QQLVDRRLFTVGIGSAPNSYFMT 479 >UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=Q10RY0_ORYSJ Length = 694 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 25/108 (23%), Positives = 49/108 (45%), Gaps = 7/108 (6%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR-YELSGP--QG 377 + +D SGSM G K ++ + L N R ++ FS+ R + L G Sbjct: 247 DLVTVLDVSGSMSGIKLSLLKR-AMSFVIQTLGPNDRLSVVAFSSTAQRLFPLRRMTLTG 305 Query: 378 IEQAIRFLSQ-QFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDF 422 +QA++ +S GGT++A + + ++ R + + +++SD Sbjct: 306 RQQALQAISSLVASGGTNIADALKKGAKVVKDRRRKNPVSSIILLSDG 353 >UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UYN7_ROSS1 Length = 459 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 308 PVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST 365 P H EQ R P + +D SGSM G AK L L + ++ FS Sbjct: 77 PPEHALPREQHRPPLHLVAVLDVSGSMSGTKLASAKE-ALRQALHFLQDGDVFSLVTFSD 135 Query: 366 EIVRYELSGP------QGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVI 419 ++ + + +E + + G T L ++ Q + +++ Sbjct: 136 QVQTHLKAESYAQRKRDKMENLLD--EIRASGMTALDGGLAQGIDLGQKKRQATTLVLLL 193 Query: 420 SDFIAQRLPDDVTSKVKELQRVHQHR--FHAVAMSAHGKPGIM 460 SD A D+ Q+ Q + + +M Sbjct: 194 SDGQANVGETDLEKIGLRAQKARQSGLIVSTLGVGLDYNEALM 236 >UniRef50_A6G857 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G857_9DELT Length = 540 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 49/144 (34%), Gaps = 8/144 (5%) Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY-ELSGPQGIEQAI 382 + VD S SM G + L + E+R ++ F E E + +E A Sbjct: 162 IAVDLSKSMEGEPIDRVRQGLLQMREQLEPEDR-VTLVGFGDEAQVIVENADKDSVELAT 220 Query: 383 RFLSQQFRGGTDLASCFRAIMERLQ---SREWFDADAVVISDFI--AQRLPDDVTSKVKE 437 + G T+L + R E+ W + +++SD + + D + E Sbjct: 221 AIAALVPWGSTNLYAGLRTAFEQTDLYAQEGWQNR-VLLVSDGVPTTGIVNSDKIEGLAE 279 Query: 438 LQRVHQHRFHAVAMSAHGKPGIMR 461 + V + +MR Sbjct: 280 AWSGMGYGLTTVGIGNDFDIELMR 303 >UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9U149_PHYPA Length = 1185 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 57/151 (37%), Gaps = 15/151 (9%) Query: 322 FIVCVDTSGSMGGFNEQCA-KAFCLALMRIALAENRRCYIMLFSTEIVR-YELSGPQGIE 379 I VD SGSM G + A +A L L I ++ I+ F + S P E Sbjct: 467 LIFVVDRSGSMQGTPIKQAGQALELFLRSIPCEDH-YFNIIGFGDNHKTLFPKSTPYNEE 525 Query: 380 QAIRFLSQQFR-----GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSK 434 + L GGT++ S F I E R +++D + D + Sbjct: 526 TLTKGLRYAQALEADMGGTEMMSAFEEIFE--HRRRDVPTQIFLLTDGEIWDV-DSLIEC 582 Query: 435 VKELQRVHQ----HRFHAVAMSAHGKPGIMR 461 +++ ++ + R ++ + ++ ++ Sbjct: 583 IRDAKKEEKSDNFVRVFSLGIGSNVSHHLVE 613 >UniRef50_UPI00016C3857 hypothetical protein GobsU_16534 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3857 Length = 402 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 40/146 (27%), Positives = 62/146 (42%), Gaps = 15/146 (10%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGP--FIVCVDTSGSMGGFNEQCAKAFCLALMRIAL 352 L S + + V+ E+ + P IV VD SGSM A A+M Sbjct: 212 LKNYSAEKGKLLVDQVYFYAAERKKKPWHVIVVVDQSGSM------LESAIFSAVMASIF 265 Query: 353 AE--NRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE 410 AE + ++LF TE+V +LS G + LS Q GGTD+ E ++ + Sbjct: 266 AELPAVKTSLVLFDTEVV--DLSDQVG-QPVDVLLSIQLGGGTDITKGLMYANELVRQPQ 322 Query: 411 WFDADAVVISDFIAQRLPDDVTSKVK 436 V+I+DF R D+ ++ + Sbjct: 323 --RTIVVLITDFYEGRPEADLVAQTR 346 >UniRef50_Q1NTK1 Von Willebrand factor, type A n=2 Tax=delta proteobacterium MLMS-1 RepID=Q1NTK1_9DELT Length = 771 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 53/146 (36%), Gaps = 23/146 (15%) Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY-------ELSGPQ 376 + +D SGSM G + AK ++ + E+ C +++F +E+ + + Sbjct: 269 ILLDCSGSMAGDSIAQAKQAISDMLNLLRPED-YCNLIMFGSEVKSVFPCQVAADKTNIT 327 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERL-----QSREWFDA----DAVVISDFIAQRL 427 + +AIR + GGT++ ++ E A + ++I+D Sbjct: 328 TLRRAIRAIDAD-MGGTEMQKALVETLKMSPIYKPPEVEVVPARISRNILLITDGQVWG- 385 Query: 428 PDDVTSKVKELQRVHQHRFHAVAMSA 453 ++ HR V + Sbjct: 386 ----DKQILRRMAKSDHRVFTVGVGG 407 >UniRef50_B1HSQ9 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HSQ9_LYSSC Length = 275 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 71/177 (40%), Gaps = 14/177 (7%) Query: 287 EKQLLTYRLHGESWREKVIERPVVHKD-------YDEQPRGPFIVCVDTSGSMGGFNEQC 339 E++L R E R+K+ ++ + + +Q GP I+C++ + M ++E C Sbjct: 74 EQRLYYERKWSELRRQKLKQQTTKNGNVFYIPFHDYKQDPGPIIICLEQTTGMEAYSELC 133 Query: 340 AKAFCLALMRIALAENRRCYIMLFSTEI-VRYELSGPQ-GIEQAIRFLSQQFRGGTDLAS 397 K+ L L A ENR YI+ + +I V Y + F+ + +G +L Sbjct: 134 -KSMILPLFMNAHRENRDLYIIPYDRQINVHYRFENGHLNLADFKSFIEYKAKGEAELLP 192 Query: 398 CFRAIMERLQSREW-FDADAVVISDF---IAQRLPDDVTSKVKELQRVHQHRFHAVA 450 + + L + +A+ ++ ++ Q L + E + +V Sbjct: 193 VLQFVRSILHENQLVTEAEVIIFTEGTPIDGQHLVSKQAKVMLEEMKRKYLAEFSVI 249 >UniRef50_D1YYY2 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YYY2_METPS Length = 716 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 68/177 (38%), Gaps = 9/177 (5%) Query: 289 QLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAK-AFCLAL 347 + + + + P KD + G +++ +D SGSM G ++ A+ A L Sbjct: 137 KAMAHEDASGDTYFMAMLAPPASKDVK-KISGEYVILIDHSGSMAGPKKEAAEWAVGKFL 195 Query: 348 MRIALAENRRCYIMLFSTEIVRYE--LSGPQG--IEQAIRFLSQQF-RGGTDLASCFRAI 402 + + + + FS Y L+G G ++ A+ F+ +F GGT++ Sbjct: 196 LGLGPDDW--FTLGAFSNNTRWYSRLLAGATGDTVKNAVEFMKSKFEGGGTEMGVALEQA 253 Query: 403 MERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 ++ + + ++I+D + +E +R + + + A + Sbjct: 254 LDIKRLKGDVSRHVLIITDAEVTDGGRILRLVDRESRRPDRRSISLLCIDAAPNSYL 310 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 29/127 (22%), Positives = 51/127 (40%), Gaps = 8/127 (6%) Query: 303 KVIERPVVHKDYDEQPRGPFIVCV-DTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIM 361 I + + G IVCV D SGSM G Q + ++ +R C ++ Sbjct: 71 CTINLESPAQTSEASRSGVDIVCVIDVSGSMQGEKIQLVQTTLNFMVERLSPADRIC-LI 129 Query: 362 LFSTE---IVRYELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFD--AD 415 FS + I R P+G +Q + GGT++ ++ L+ R + + Sbjct: 130 SFSNDATKISRLVQMSPKGKKQLKSMIPRLVASGGTNIVGGLEYGLQALRQRRTINQLSS 189 Query: 416 AVVISDF 422 +++SD Sbjct: 190 IILLSDG 196 >UniRef50_UPI0000ECD6E7 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.30) (PARP-4) (Vault poly(ADP- ribose) polymerase) (VPARP) (193 kDa vault protein) (PARP- related/IalphaI-related H5/proline-rich) (PH5P). n=5 Tax=Tetrapoda RepID=UPI0000ECD6E7 Length = 1691 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 49/145 (33%), Gaps = 11/145 (7%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS---TEIVRYELSGPQGI 378 I+ +D S SM G AK L ++ + ++ F +E + + + + Sbjct: 888 IIILLDCSNSMAGSALLQAKQIALHALKQFSSRQ-NVNLIKFGTNFSEFSSFSKNTSKDL 946 Query: 379 EQAIRFLSQQFR--GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 F++ G TDL R + S+ + ++ISD Q Sbjct: 947 ASLTEFITSATATMGNTDLWKTLRYLSLLFPSQGHRN--ILLISDGHIQNESVTFQLVKD 1004 Query: 437 ELQRVHQHRFHAVAMSAHGKPGIMR 461 VH R + + ++R Sbjct: 1005 N---VHHTRLFTCGVGSTANRHMLR 1026 >UniRef50_UPI0001BC3853 von Willebrand factor type A n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3853 Length = 623 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 63/161 (39%), Gaps = 29/161 (18%) Query: 323 IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCY-------------------IMLF 363 ++ +D SGSM + + L + + CY I+LF Sbjct: 100 VILIDCSGSMRTNDPDFEYSVKNTLYPGSSYQITTCYRKLASKNYVKAQGNDDRTGIVLF 159 Query: 364 STEIVR-YELSGPQGIEQAIRFLSQQF-RGGTDLASCFRAIMERL-QSREWFDADAVVIS 420 ++E EL+ + + + + + + GGT+ + + + L +R + +++S Sbjct: 160 TSEANTVCELTNSEYV--LMNAIDKIYSNGGTNFNNAIKESIRILTNTRNDSEKRILLVS 217 Query: 421 DFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 D ++ ++S V +L + + + V + +++ Sbjct: 218 DGESE-----LSSSVIDLAIENNIKINTVYIGGQNNNELLK 253 >UniRef50_B5HZU2 VWA domain-containing protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HZU2_9ACTO Length = 518 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 48/149 (32%), Gaps = 11/149 (7%) Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI--VRYELSGPQGIEQAIR 383 +DTSGSM G K LA + E +M F +++ VR + P + Sbjct: 346 LDTSGSMEGDRLDRLKT-ALADLTGDFREREEVTLMPFGSQVKSVRTHVVKPSDPRAGLD 404 Query: 384 FLS-----QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFI--AQRLPDDVTSKVK 436 + G T + + + L + V+++D A D + Sbjct: 405 AIRDDTSALSADGDTAIYTSLEKAYDHLGAGRDAFTSIVLMTDGENTAGAKARDFDAFYA 464 Query: 437 EL-QRVHQHRFHAVAMSAHGKPGIMRIFD 464 L ++ + + + I D Sbjct: 465 RLGRKARDTPVFPILFGDSDRSELAHIAD 493 >UniRef50_B7FTA2 Predicted protein n=3 Tax=Bacillariophyta RepID=B7FTA2_PHATR Length = 800 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 52/135 (38%), Gaps = 6/135 (4%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIV-CVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 + R I++ V + G I+ VD SGSM AK ++L+ A Sbjct: 567 SKEGRGVHIQQSDVRIKKMARKAGSLIIFVVDASGSMALNRMNAAKGAAVSLLTEAYQSR 626 Query: 356 RRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQ-FRGGTDLASCFRAI----MERLQSRE 410 + ++ F E+ L + I A + L Q GG+ LA + + +S + Sbjct: 627 DKISLIPFQGEMADVLLPPTKSITMARQRLEQMPCGGGSPLAHALQLATLTGINAQKSGD 686 Query: 411 WFDADAVVISDFIAQ 425 V+ISD A Sbjct: 687 VGKVVVVLISDGRAN 701 >UniRef50_B9HP09 Predicted protein n=13 Tax=cellular organisms RepID=B9HP09_POPTR Length = 786 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 50/139 (35%), Gaps = 6/139 (4%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIV-CVDTSGSMGGFNEQCAKAFCLALMRIAL 352 + R+ +E+ + + G ++ VD SGSM Q AK L L+ + Sbjct: 556 EKDTQKSRKVYVEKTDMRAKRMARKAGALVIFVVDASGSMALNRMQNAKGAALKLLAESY 615 Query: 353 AENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTD-----LASCFRAIMERLQ 407 + I+ F + L + I A + L + GG L + R + + Sbjct: 616 TSRDQVAIIPFRGDAAEVLLPPSRSISMARKRLERLPCGGGSPLAHGLTTAVRVGLNAEK 675 Query: 408 SREWFDADAVVISDFIAQR 426 S + V I+D A Sbjct: 676 SGDVGRIMIVAITDGRANI 694 >UniRef50_Q6VPP3 Parturition-related protein PRP3 n=6 Tax=Eutheria RepID=Q6VPP3_RAT Length = 923 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 59/154 (38%), Gaps = 20/154 (12%) Query: 323 IVCV--DTSGSMGGFNEQCAK---AFCLALMRIALAENRRCY-IMLF-STEIVRYELSGP 375 IVC+ D SGSM G ++ + A L +I E+R ++ F S+ V+ EL Sbjct: 307 IVCLVLDVSGSM-GSYDRLNRMNQAAKFFLQQIL--ESRSWAGMVHFHSSATVKSELIQI 363 Query: 376 QGI---EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA--DAVVISDFIAQRLPDD 430 Q + L GGT + S R + +++ + D +++SD D Sbjct: 364 NSDVERNQLLETLPTSASGGTSICSGIRTAFQVFKNKGYQTGGNDILLLSDGE-----DS 418 Query: 431 VTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD 464 + + H +A+ I + + Sbjct: 419 TAKDCLDEVKDSGAVVHFIALGKAFDQSISNMAN 452 >UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 Tax=Cystobacterineae RepID=Q1D9B7_MYXXD Length = 476 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 55/148 (37%), Gaps = 10/148 (6%) Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIR 383 + +D SGSM G+ AK L+ + ++R I+ + +++ S R Sbjct: 99 LVIDRSGSMSGYKLAQAKQAARHLIGLLNDQDR-LAIIHYGSDVKSLP-SLEATAANRER 156 Query: 384 FLSQQFR----GGTDLASCFRAIMERLQSRE--WFDADAVVISDFI--AQRLPDDVTSKV 435 GGT++ + A +L + + + +++SD D+ +++ Sbjct: 157 MFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQRTYGVNRLILMSDGQPTEGLTADEELTRM 216 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMRIF 463 R A+ + +M+ F Sbjct: 217 ARELRATGLTLSAIGVGTDFNEDLMQAF 244 >UniRef50_C7FPD9 Uncharacterized protein n=2 Tax=environmental samples RepID=C7FPD9_9BACT Length = 836 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 65/178 (36%), Gaps = 10/178 (5%) Query: 289 QLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALM 348 ++ +R G++ + P + D D+ VDTSGSM G A+A + Sbjct: 278 RVSAHRKPGQTGTFALTLEPPLKVDPDQVTPKELFFVVDTSGSMMGEPLDKARAAMRYAL 337 Query: 349 RIALAENRRCYIMLFSTEIVRYELSG----PQGIEQAIRFLS-QQFRGGTDLASCFRAIM 403 ++ I+ F++ + P+ + + + F+ +GGT++ + RA + Sbjct: 338 ERMGPDD-TFQIIDFASGVASLAPRPLPNTPENLRKGLAFIEAMTSQGGTEMLAGIRAAL 396 Query: 404 ERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + ++D D+ + Q V Q R + + ++ Sbjct: 397 DGPTPPGRLRI-VAFMTDGYIGN-DGDILDYID--QSVGQARLFSFGVGEDVNRYLLE 450 >UniRef50_Q1DFU7 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1DFU7_MYXXD Length = 422 Score = 53.5 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 61/146 (41%), Gaps = 10/146 (6%) Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG--PQGIEQA 381 + +D SGSM G A+ L++ E+R + + T++ + E+ Sbjct: 51 LVLDRSGSMNGQKLADARRAATELVQRLKPEDR-LAFIDYGTDVRVQPSRRMTEEAREEL 109 Query: 382 IRFLS-QQFRGGTDLASCFRAIMERLQS--REWFDADAVVISDFI---AQRLPDDVTSKV 435 + +S Q G T+++ A L+ RE+ + A+++SD + +V Sbjct: 110 LTLISGLQDDGSTNISGALDAAANALRPHMREYRVSRAILLSDGQPTTGIVSEPGLLDQV 169 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMR 461 ++L+R A+ + + +MR Sbjct: 170 RQLRRD-GITVSALGVGRDYQETLMR 194 >UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 Tax=Arabidopsis thaliana RepID=Q9M1S2_ARATH Length = 676 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 50/124 (40%), Gaps = 9/124 (7%) Query: 307 RPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS 364 + V + R P + +D SGSMGG K +++ + +R ++ FS Sbjct: 228 KAVTGDQISQYRRAPIDLVTVLDISGSMGGTKLALLKRAMGFVIQNLGSSDR-LSVIAFS 286 Query: 365 TEIVR-YELSGPQGIEQAIRFL---SQQFRGGTDLASCFRAIMERLQSREWFD--ADAVV 418 + R + L+ + + S GGT++ R + ++ R + A ++ Sbjct: 287 STARRLFPLTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIIL 346 Query: 419 ISDF 422 +SD Sbjct: 347 LSDG 350 >UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Shewanella RepID=A6WMD3_SHEB8 Length = 772 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 69/174 (39%), Gaps = 14/174 (8%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 +++ ++ P V K I+ +DTSGSM G + AK L ++ E+ Sbjct: 370 DNYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPED-S 428 Query: 358 CYIMLFSTEIVRYELSG----PQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWF 412 I+ F++ + + + +A +F+S Q GGT++A A + + Sbjct: 429 FNIIEFNSSLSLLSATPLPATSSNLSRARQFVSRLQADGGTEMALALDAALPKSLGSVSP 488 Query: 413 DA-----DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 DA + ++D + + ++ ++ + R V + + M+ Sbjct: 489 DAVQPLRQVIFMTDG-SVGNEQALFDLIRY--QIGESRLFTVGIGSAPNSHFMQ 539 >UniRef50_UPI00016C38A3 LPXTG-motif cell wall anchor domain protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C38A3 Length = 874 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 55/146 (37%), Gaps = 12/146 (8%) Query: 322 FIVCVDTSGSMGGFN---EQCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYELSG 374 I+ VD SGSM G A LA L+E+ + LF + R + Sbjct: 284 VILLVDHSGSMSGAKWEAADWAVERFLA----GLSEDDAFSLGLFHSTTKWFGERTRKAT 339 Query: 375 PQGIEQAIRFLSQQFR-GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 P+ + A+ FL GGT+L + R +S E ++++D + Sbjct: 340 PENVRAAVEFLKLNRDQGGTELGVALEQALARSRSAETPARHVLILTDAEVTDAGRILRL 399 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGI 459 E ++ ++ R + + A + Sbjct: 400 ADLESEKPNRRRISVLCIDAAPNAAL 425 >UniRef50_B5ZN80 von Willebrand factor type A n=8 Tax=Rhizobiales RepID=B5ZN80_RHILW Length = 522 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 51/128 (39%), Gaps = 16/128 (12%) Query: 324 VCVDTSGSMGGFNE-QCAKAFCLAL-------MRIALAENRRCYIMLFSTEIVRYELSGP 375 +C+D SGSM G E Q KA L + + + R ++ F + ++ Sbjct: 346 LCLDFSGSMQGDGEDQLQKAMRFLLTPDEASKVLVQWSPADRIIVIPFDGSVRNTFMASG 405 Query: 376 QGIEQ---AIRFLSQQFRGGTDLASCFRAIMERLQSRE----WFDADAVVISDFIAQRLP 428 +EQ Q+ GGTD+ +C ++++ + + A V+++D + Sbjct: 406 NPLEQEGLLNEISRQKAGGGTDMYTCAAQALQQIARSDRLSTYLPA-IVIMTDGRSDDQS 464 Query: 429 DDVTSKVK 436 S+ Sbjct: 465 QAFMSEWN 472 >UniRef50_Q2QSE5 Os12g0431700 protein n=3 Tax=Oryza sativa RepID=Q2QSE5_ORYSJ Length = 524 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 41/111 (36%), Gaps = 6/111 (5%) Query: 321 PFIVCVDTSGSMGGFN-EQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIE 379 + VD SGSM G E KA +M++ + S + + + Q Sbjct: 62 DLVAVVDVSGSMRGHKIESVKKALQFVIMKLTPVDRLSIVTFESSAKRLTKLRAMTQDFR 121 Query: 380 QAIRFLSQQ--FRGGTDLASCFRAIMERLQSREW---FDADAVVISDFIAQ 425 + + + GGTD+ + + L R + A+ ++SD + Sbjct: 122 GELDGIVKSLIANGGTDIKAGLDLGLAVLADRVFTESRTANIFLMSDGKLE 172 >UniRef50_A0DIJ2 Chromosome undetermined scaffold_52, whole genome shotgun sequence n=4 Tax=Eukaryota RepID=A0DIJ2_PARTE Length = 2542 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 61/155 (39%), Gaps = 12/155 (7%) Query: 300 WREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE-NRRC 358 ++ + K+ P+ +I +D SGSM G AK CL + N R Sbjct: 2338 RQDIKFLQKRQFKEVINSPKIHYIFMIDDSGSMSGSPWNTAKNCCLNCLSTIEKNLNARV 2397 Query: 359 YIMLFSTE---IVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERL---QSREWF 412 +++F++ + E+ +E+ I+F S G TD S F+ + + Q+ + Sbjct: 2398 SVIIFNSTARIAINCEIVNLVEMEKKIQFNS----GSTDFGSAFQQAYKLIVQHQNDAFQ 2453 Query: 413 DADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 + + +D A P + E+ + R Sbjct: 2454 KTEVLFYTDGGAAY-PKEQVKLFTEIPDHQKARIF 2487 >UniRef50_B6XSC0 Putative uncharacterized protein n=3 Tax=Bifidobacterium RepID=B6XSC0_9BIFI Length = 1192 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 41/111 (36%), Gaps = 11/111 (9%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIAL---------AENRRCYIMLFSTEIVRYE 371 ++ +D SGSM G + AK AL + L + + ++ FST+ E Sbjct: 492 DIVLVMDKSGSMKGELDNNAKEAANALAKKLLTDKNSTLPSEQQVQMAVVTFSTKA-TIE 550 Query: 372 LSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 + + + + GGT+ + + L R + +SD Sbjct: 551 QNFTTDVLKINNAVEGDPDGGTNWEAALKQA-NILSGRSNVKKHIIFLSDG 600 >UniRef50_B6HQ22 Pc22g19800 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6HQ22_PENCW Length = 896 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 54/138 (39%), Gaps = 11/138 (7%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQG 377 + VD SGSM A L L +L ++ F + ++S + Sbjct: 279 IVFVVDRSGSMTDNMHTLRSALGLFL--KSLPLGVPFNLISFGSSFEAIWARSKVSTRES 336 Query: 378 IEQAIRFLS--QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKV 435 +E+A++ Q GGT++ S A +E+ + + +V++D + V V Sbjct: 337 LEEALQHTKNIQADLGGTEILSGLEAAVEKRYQDKVLE--VLVLTDGEVWNQSE-VFDLV 393 Query: 436 KELQRVHQHRFHAVAMSA 453 + + H RF + + Sbjct: 394 NQANQQHSTRFFTLGLGD 411 >UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDA4D Length = 1547 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 61/139 (43%), Gaps = 13/139 (9%) Query: 322 FIVCVDTSGSMGGFNEQCA-KAFCLALMRIALAENRRCYIMLFSTEIV----RYELSGPQ 376 FI +D SGSM G A +A L L +L + ++ F + + E Q Sbjct: 308 FIFLLDRSGSMSGQPIDRACQALTLFL--KSLPTDSYFNVISFGSSFKLLFPQSEKYNSQ 365 Query: 377 GIEQAIRFLSQQFR--GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSK 434 +E+AI +S+ GGT++ + + + + + + + +++D P+ V S Sbjct: 366 SLEKAISNISKYKADLGGTEIYKPLKNVFVQNKIQGY-NKQVFLLTDGEV-DSPEQVISL 423 Query: 435 VKELQRVHQHRFHAVAMSA 453 +++ + R H++ + Sbjct: 424 IRKNNKFS--RVHSIGFGS 440 >UniRef50_Q74N17 NEQ403 n=1 Tax=Nanoarchaeum equitans RepID=Q74N17_NANEQ Length = 216 Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 28/132 (21%), Positives = 61/132 (46%), Gaps = 11/132 (8%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 +IV +D SGSM G E+ KA +AL I R ++LF+ +I++ + + Sbjct: 73 YIVLLDCSGSMKG--EKFEKALAIAL-SIIYKYKR-VGLILFNNKIIKS-IPPTENKTLL 127 Query: 382 IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFI-AQRLPDDVTSKVKELQR 440 + L R T+++ M+ + + ++ +I+D + D++ V++L Sbjct: 128 VNSLFVIPREKTNISIALEEAMKYAKPK----SEIFIITDAVPTDETVDELIETVRKLA- 182 Query: 441 VHQHRFHAVAMS 452 + + H + ++ Sbjct: 183 LKNIKVHVIGIN 194 >UniRef50_Q5K267 Putative uncharacterized protein (Fragment) n=1 Tax=Guillardia theta RepID=Q5K267_GUITH Length = 197 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 45/110 (40%), Gaps = 5/110 (4%) Query: 320 GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIE 379 G I D SGSM G + A + ++ R ++ FS ++L G I Sbjct: 76 GKTIAMSDVSGSMSGTPMFVSIALGILCSEVSHPAYRDL-VLTFSERPSWHKLQGCTNIV 134 Query: 380 QAIR-FLSQQFRGGTDLASCFRAIMERLQSREWFDADA---VVISDFIAQ 425 ++ + + G TD+ + I+E ++S+ + ++ISD Sbjct: 135 DKVKSLMRADWGGNTDVYKAMKLILELVRSKGLQPDEIPNLLIISDMQFD 184 >UniRef50_A1S119 von Willebrand factor, type A n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S119_THEPD Length = 327 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 31/149 (20%), Positives = 52/149 (34%), Gaps = 14/149 (9%) Query: 323 IVCVDTSGSM-----GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY--ELSGP 375 ++ VD SGSM GG + A+ L+ + ++ FS IV Sbjct: 103 VLVVDVSGSMEDSIPGGVKIEVARRAATLLVER-MPGGVDVGLLAFSDRIVLSLPPTGDR 161 Query: 376 QGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD--AVVISDFIAQRLPDDVTS 433 + + AI L GGT +A + L+ + F+A V +SD + Sbjct: 162 RRVLDAIESLKP--GGGTMYTYPLQAALSWLKPYKLFNASTLVVFVSDGL--PADAATYR 217 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 + R + V + G G + Sbjct: 218 TLLSEFRSLGIPVYTVYIGPGGDEGEREL 246 >UniRef50_A8J3X6 Flagellar associated protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8J3X6_CHLRE Length = 1043 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 48/142 (33%), Gaps = 25/142 (17%) Query: 293 YRLHGESWREKV-IERPVVHKDYDEQPRGPFIV-------------------CVDTSGSM 332 YR+ G + + P D PRG F++ +D SGSM Sbjct: 248 YRVWGNDMFLALNVSNPRPPHPADPDPRGAFVLSVAPPAPEFTAPFPRSVVFLLDRSGSM 307 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGG 392 G + AKA L +L ++ F E + P G +R S RG Sbjct: 308 SGEPMEFAKA-ALCFGLRSLTPLDTFTVVAFDHEQL---WFTPGGQLSWVR-ASVDARGL 362 Query: 393 TDLASCFRAIMERLQSREWFDA 414 TD+ + + M L A Sbjct: 363 TDIMTPLQTAMRVLSGGGTRIA 384 >UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RYC9_9GAMM Length = 686 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 63/181 (34%), Gaps = 15/181 (8%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 T R+ + + ++ P + + PR + VDTSGSMGG + + AK +R Sbjct: 288 FTERVDDQYYGLLMLVPPASQRAAETVPR-EIVFVVDTSGSMGGVSIKQAKGSLTRALRH 346 Query: 351 ALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLS-----QQFRGGTDLASCFRAIMER 405 +R ++ F++ ++ S + GGT++ + ++ Sbjct: 347 LGPNDR-FNVIEFNSSHRALFQHAVPASHHNLQLASEYVRHLEASGGTEMMPALQLALKL 405 Query: 406 LQSREWFDAD-----AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 +++ + + I+D V L R V + + M Sbjct: 406 PGAQDELRPEPALRQVIFITDGAVGNESALFEHIVDSLG---GSRLFTVGIGSAPNAWFM 462 Query: 461 R 461 R Sbjct: 463 R 463 >UniRef50_Q11Y10 Possible outer membrane protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11Y10_CYTH3 Length = 1313 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 64/199 (32%), Gaps = 15/199 (7%) Query: 273 GITELEYEFYR-RLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGP----FIVCVD 327 ++ +F+ +E Q+ T S E IE+ VV + P ++ +D Sbjct: 39 KYPVVKADFFALDKLENQITTLTRESVSIHENGIEQQVVKVVNPAAVK-PKSISLVLTID 97 Query: 328 TSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQ 387 S SM AK A++ + C + F+ + + Sbjct: 98 ISESMQKQYMPLAKNAAAAIVNKLPLDISECAVTSFNDVSFINTDFTRDRFKLLQSIQTL 157 Query: 388 QFRGGTDLASCFRA----IMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQ 443 GGTD F ++ L+ + ++D P ++ + K + Sbjct: 158 VPAGGTDYNKGFIKSNAGGLDILKKGLHEKV-LIFLTDGYGDVNPTEIIQQAKSI----G 212 Query: 444 HRFHAVAMSAHGKPGIMRI 462 + + + + + RI Sbjct: 213 AKVYVITLGMSAPEELKRI 231 >UniRef50_Q23JA0 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila SB210 RepID=Q23JA0_TETTH Length = 1049 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 63/147 (42%), Gaps = 13/147 (8%) Query: 322 FIVCVDTSGSMGGFNEQCA-KAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQ 376 FI +D SGSM G + A +A L L +L + ++ F + + + Sbjct: 313 FIFLLDRSGSMSGQPIRRACEALTLFL--KSLPNDSYFNVISFGSSFDKLFPSSTKYTSE 370 Query: 377 GIEQAIRFLSQQFR--GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSK 434 +E+AI +S+ GGT++ + + + + + + + +++D P V Sbjct: 371 SLEKAILLISKYQADLGGTEIYNPLNNVFVQNKIQGY-NKQIFLLTDGEV-DSPQQVVRL 428 Query: 435 VKELQRVHQHRFHAVAMSAHGKPGIMR 461 +K+ + +R H++ + +++ Sbjct: 429 IKKNNK--YNRVHSIGFGSGADQYLIK 453 >UniRef50_B5IF69 von Willebrand factor type A domain protein n=2 Tax=Aciduliprofundum boonei T469 RepID=B5IF69_9EURY Length = 471 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 56/155 (36%), Gaps = 9/155 (5%) Query: 288 KQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLAL 347 K L +Y + E + + + + ++ +D SGSM G + A+ + Sbjct: 282 KTLTSYGIVLPGVYSTKSESIDMGNEREGKSYRDSMIILDCSGSMEGEPFERAREAAYVM 341 Query: 348 MRIALAENRRCYIMLFST----EIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIM 403 R ++ ++ FS + V Y + + + + RGGT L + + Sbjct: 342 AREIQKAGKKVGLIPFSGYVIEDRVVYPTTDSSRLNDIL--ARIKPRGGTMLQYALQFAL 399 Query: 404 ERLQSREWFDADAVVISDFIAQRLPDDVTSKVKEL 438 + + + + SD R +++ + + Sbjct: 400 QFGKEYYVY---ILSDSDVYDVRSTEELLDEFRGR 431 >UniRef50_B1ZQD5 von Willebrand factor type A n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZQD5_OPITP Length = 859 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 52/139 (37%), Gaps = 8/139 (5%) Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY-ELSGPQGIEQAI 382 V +DTSGSM + + L ++ L + R ++ F+ + E Q + Sbjct: 518 VLLDTSGSMERTDRATSVRAALGVLASLLTPDDRVTLIGFARQPRLLAESLAGDQARQLV 577 Query: 383 RFLSQQ-FRGGTDLASCFRAI--MERLQSREWFDADAVVISDF---IAQRLPDDVTSKVK 436 S F GGT+L + + R V+I+D + P + ++++ Sbjct: 578 DLASTTPFTGGTNLEAALSLAGELARRHHNAAAQNRIVLITDGAANLGNADPAQLATRIE 637 Query: 437 ELQRVHQHRFHAVAMSAHG 455 L R F A + G Sbjct: 638 TL-RQQGIAFDACGVGTDG 655 >UniRef50_C5F2Q3 Phage protein (Fragment) n=5 Tax=Bacteria RepID=C5F2Q3_9HELI Length = 219 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 50/133 (37%), Gaps = 11/133 (8%) Query: 323 IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC--YIMLFSTEIVRYELSGPQGIEQ 380 IV D SGSM G + + IA R + + F + +EL I++ Sbjct: 72 IVACDVSGSMSGNPICISIGLAIY---IAQRNKGRFHNHFIDFCGDSRLHELPDNASIKE 128 Query: 381 AIRF-LSQQFRGGTDLASCF-RAIMERLQSREWFDAD----AVVISDFIAQRLPDDVTSK 434 +S T++ S AI+E L + + ++ISD + Sbjct: 129 LYDLVISSSRDMNTNIESVMVNAILETLIKNKIPKEECPKYVIIISDMEFDMCGKGKKTN 188 Query: 435 VKELQRVHQHRFH 447 ++ ++ +Q R + Sbjct: 189 IEYWKKKYQVRGY 201 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B8K8A8 von Willebrand factor, type A n=2 Tax=Vibrio Rep... 429 e-118 UniRef50_C6DJF9 Protein viaA n=166 Tax=Bacteria RepID=VIAA_PECCP 427 e-118 UniRef50_Q5E077 Stimulator of RavA ATPase activity, VWA domain c... 418 e-115 UniRef50_C4LBX1 von Willebrand factor type A n=1 Tax=Tolumonas a... 391 e-107 UniRef50_A6CWE2 Putative uncharacterized protein yieM n=3 Tax=Vi... 389 e-106 UniRef50_A4SJ14 von Willebrand factor type A domain protein n=2 ... 361 3e-98 UniRef50_A3US96 Putative uncharacterized protein (Fragment) n=1 ... 349 1e-94 UniRef50_A6FIY2 Uncharacterized protein containing a von Willebr... 337 4e-91 UniRef50_C6M593 Putative uncharacterized protein n=1 Tax=Neisser... 319 2e-85 UniRef50_A2SS27 von Willebrand factor, type A n=1 Tax=Methanocor... 306 1e-81 UniRef50_A1SXM0 von Willebrand factor, type A n=2 Tax=Alteromona... 297 8e-79 UniRef50_C1Q8W4 Uncharacterized protein containing a von Willebr... 292 2e-77 UniRef50_C1Q9X6 Uncharacterized protein containing a von Willebr... 290 8e-77 UniRef50_Q6LJM7 Putative uncharacterized protein n=1 Tax=Photoba... 288 2e-76 UniRef50_C0QY03 von Willebrand factor type A (VWA) domain contai... 282 2e-74 UniRef50_Q14PC7 Hypothetical two-component regulator system yiem... 280 8e-74 UniRef50_A6DQA4 Putative uncharacterized protein n=1 Tax=Lentisp... 275 2e-72 UniRef50_C3XEK2 Putative uncharacterized protein n=1 Tax=Helicob... 272 2e-71 UniRef50_B9KEB5 Putative uncharacterized protein n=1 Tax=Campylo... 258 2e-67 UniRef50_Q1ZC32 Putative uncharacterized protein (Fragment) n=1 ... 256 1e-66 UniRef50_A6L4M8 Putative uncharacterized protein n=9 Tax=Bactero... 242 2e-62 UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methano... 241 3e-62 UniRef50_A7V9J4 Putative uncharacterized protein n=7 Tax=Bactero... 236 1e-60 UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methano... 235 2e-60 UniRef50_D1YZJ4 Putative uncharacterized protein n=1 Tax=Methano... 233 2e-59 UniRef50_A4S4M8 Predicted protein n=4 Tax=Mamiellales RepID=A4S4... 232 3e-59 UniRef50_A4Y9K4 von Willebrand factor, type A n=3 Tax=Shewanella... 231 5e-59 UniRef50_Q5LDB9 Putative uncharacterized protein n=11 Tax=Bacter... 230 7e-59 UniRef50_C4KA81 von Willebrand factor type A (VWA) domain-contai... 230 9e-59 UniRef50_B7DQJ9 von Willebrand factor type A n=1 Tax=Alicyclobac... 229 2e-58 UniRef50_C3XJE4 Putative uncharacterized protein (Fragment) n=1 ... 229 2e-58 UniRef50_D2U0I8 Putative uncharacterized protein n=1 Tax=Arsenop... 226 2e-57 UniRef50_A6C7T1 Putative uncharacterized protein n=1 Tax=Plancto... 221 4e-56 UniRef50_Q0W1N2 Putative uncharacterized protein n=1 Tax=uncultu... 220 1e-55 UniRef50_C8SB00 von Willebrand factor type A n=1 Tax=Ferroglobus... 220 1e-55 UniRef50_C0ZE04 Putative uncharacterized protein n=1 Tax=Breviba... 211 6e-53 UniRef50_C6J3I5 Putative uncharacterized protein n=1 Tax=Paeniba... 204 5e-51 UniRef50_Q58221 Uncharacterized protein MJ0811 n=4 Tax=Methanoca... 203 9e-51 UniRef50_C9KWG4 Putative uncharacterized protein n=1 Tax=Bactero... 203 1e-50 UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromon... 201 5e-50 UniRef50_D0LUP3 Uncharacterized protein containing a von Willebr... 195 3e-48 UniRef50_A8IX54 Predicted protein n=1 Tax=Chlamydomonas reinhard... 194 8e-48 UniRef50_A2SLM6 Uncharacterized protein containing a von Willebr... 192 3e-47 UniRef50_A8ZLC2 Putative uncharacterized protein n=3 Tax=Acaryoc... 185 3e-45 UniRef50_D0LHL0 von Willebrand factor type A n=1 Tax=Haliangium ... 185 3e-45 UniRef50_D0Z403 Putative uncharacterized protein n=1 Tax=Photoba... 185 3e-45 UniRef50_C9RHJ4 von Willebrand factor type A n=1 Tax=Methanocald... 182 2e-44 UniRef50_D1XLN5 von Willebrand factor type A n=12 Tax=Actinomyce... 182 3e-44 UniRef50_D1PED6 Putative uncharacterized protein n=1 Tax=Prevote... 178 4e-43 UniRef50_Q60384 Uncharacterized protein MJ0077 n=3 Tax=Methanoca... 176 2e-42 UniRef50_B1L0Y8 von Willebrand factor type A domain protein n=10... 175 3e-42 UniRef50_Q8EW10 Putative uncharacterized protein MYPE3970 n=1 Ta... 175 4e-42 UniRef50_B8C4H1 Predicted protein n=1 Tax=Thalassiosira pseudona... 169 3e-40 UniRef50_A3DPE5 von Willebrand factor, type A n=2 Tax=Desulfuroc... 166 2e-39 UniRef50_A7KV72 Putative metalloprotein chaperonin subunit n=1 T... 163 1e-38 UniRef50_Q2IEM5 VWA containing CoxE-like n=2 Tax=Anaeromyxobacte... 162 3e-38 UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobac... 156 2e-36 UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglob... 155 3e-36 UniRef50_C8SZ21 Protein viaA (VWA domain protein interacting wit... 154 1e-35 UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus ... 150 1e-34 UniRef50_A7VY69 Putative uncharacterized protein n=1 Tax=Clostri... 146 3e-33 UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein... 141 7e-32 UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 ... 137 6e-31 UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alterom... 134 6e-30 UniRef50_UPI00003C852B hypothetical protein Faci_06871 n=1 Tax=F... 134 1e-29 UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein... 133 1e-29 UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythro... 133 1e-29 UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-cont... 133 2e-29 UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein... 132 2e-29 UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepI... 132 2e-29 UniRef50_Q9YD81 Putative uncharacterized protein n=1 Tax=Aeropyr... 132 2e-29 UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomyce... 132 3e-29 UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein... 132 4e-29 UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1... 131 6e-29 UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis... 131 7e-29 UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 ... 131 8e-29 UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine... 131 9e-29 UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharoph... 130 1e-28 UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella... 130 1e-28 UniRef50_Q3V4Q4 Putative VWFA domain-containing protein ORF892 n... 130 1e-28 UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein... 129 2e-28 UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax... 129 2e-28 UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11... 129 3e-28 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 129 3e-28 UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein... 128 4e-28 UniRef50_B9XLE8 Vault protein inter-alpha-trypsin domain protein... 128 4e-28 UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ... 127 7e-28 UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobact... 127 8e-28 UniRef50_Q6KZN8 Putative uncharacterized protein n=1 Tax=Picroph... 127 9e-28 UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n... 127 1e-27 UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein... 126 2e-27 UniRef50_Q80UW6 Parp4 protein (Fragment) n=11 Tax=Eukaryota RepI... 126 2e-27 UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudo... 126 2e-27 UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebr... 126 3e-27 UniRef50_UPI00016C377F protein containing a von Willebrand facto... 126 3e-27 UniRef50_B2A702 von Willebrand factor type A n=1 Tax=Natranaerob... 125 3e-27 UniRef50_A8IQV8 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 125 4e-27 UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain... 125 4e-27 UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=... 124 6e-27 UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillu... 124 7e-27 UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacilla... 124 8e-27 UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putat... 124 9e-27 UniRef50_D2R2E3 TROVE domain protein n=1 Tax=Pirellula staleyi D... 124 1e-26 UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein... 123 1e-26 UniRef50_A6G7V2 von Willebrand factor, type A n=1 Tax=Plesiocyst... 123 2e-26 UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcani... 122 2e-26 UniRef50_A7HHW8 Vault protein inter-alpha-trypsin domain protein... 122 2e-26 UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1... 122 3e-26 UniRef50_Q6A0B1 MKIAA0177 protein (Fragment) n=4 Tax=Murinae Rep... 122 3e-26 UniRef50_B2HK18 Conserved membrane protein n=3 Tax=Mycobacterium... 122 3e-26 UniRef50_B0CG18 von Willebrand factor type A domain protein, put... 122 4e-26 UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein... 122 4e-26 UniRef50_D1A557 Vault protein inter-alpha-trypsin domain protein... 122 4e-26 UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1... 122 4e-26 UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein... 121 4e-26 UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein... 121 5e-26 UniRef50_C7FPD9 Uncharacterized protein n=2 Tax=environmental sa... 121 5e-26 UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha p... 121 6e-26 UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanob... 121 8e-26 UniRef50_A8M9M1 von Willebrand factor type A n=1 Tax=Caldivirga ... 120 1e-25 UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain ... 120 1e-25 UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesioc... 120 2e-25 UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR 119 2e-25 UniRef50_UPI0000D9E789 PREDICTED: similar to poly (ADP-ribose) p... 118 5e-25 UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 ... 117 7e-25 UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga o... 117 9e-25 UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus Rep... 117 1e-24 UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscill... 117 1e-24 UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein... 116 1e-24 UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillu... 116 2e-24 UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens ... 116 2e-24 UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax... 116 3e-24 UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 116 3e-24 UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopi... 116 3e-24 UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun se... 116 3e-24 UniRef50_A6G2V8 von Willebrand factor, type A n=1 Tax=Plesiocyst... 115 3e-24 UniRef50_C4DQN3 Uncharacterized protein containing a von Willebr... 115 5e-24 UniRef50_Q60ED8 Von Willebrand factor type A domain containing p... 115 5e-24 UniRef50_Q01UI0 von Willebrand factor, type A n=1 Tax=Candidatus... 114 6e-24 UniRef50_Q9UKK3 Poly [ADP-ribose] polymerase 4 n=14 Tax=Eutheria... 114 6e-24 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 114 6e-24 UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocyst... 114 9e-24 UniRef50_Q10JU7 Von Willebrand factor type A domain containing p... 114 1e-23 UniRef50_Q7UL83 Inter-alpha-trypsin inhibitor family heavy chain... 113 1e-23 UniRef50_A8S006 Putative uncharacterized protein n=1 Tax=Clostri... 113 1e-23 UniRef50_A9SQ90 Predicted protein n=3 Tax=Physcomitrella patens ... 113 2e-23 UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geob... 113 2e-23 UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome... 113 2e-23 UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira... 112 2e-23 UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=4... 112 2e-23 UniRef50_B3QTN9 Vault protein inter-alpha-trypsin domain protein... 112 2e-23 UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Art... 112 3e-23 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 112 3e-23 UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magno... 112 4e-23 UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 ... 112 4e-23 UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisp... 111 5e-23 UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3... 111 7e-23 UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verruc... 110 8e-23 UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=... 110 8e-23 UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shew... 110 1e-22 UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 ... 110 1e-22 UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopu... 110 1e-22 UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, s... 110 1e-22 UniRef50_Q54DU5 von Willebrand factor A domain-containing protei... 110 1e-22 UniRef50_A2E6Y7 von Willebrand factor type A domain containing p... 110 2e-22 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 109 2e-22 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 109 2e-22 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 109 3e-22 UniRef50_A1SAA4 Uncharacterized protein containing a von Willebr... 109 3e-22 UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated c... 109 3e-22 UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastop... 108 4e-22 UniRef50_Q9LMB7 F14D16.26 n=5 Tax=rosids RepID=Q9LMB7_ARATH 108 4e-22 UniRef50_D1YYY2 Putative uncharacterized protein n=1 Tax=Methano... 108 5e-22 UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocep... 108 5e-22 UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 ... 108 6e-22 UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tet... 107 7e-22 UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3... 107 7e-22 UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexu... 107 7e-22 UniRef50_A2E1S5 von Willebrand factor type A domain containing p... 107 7e-22 UniRef50_C7PW75 Vault protein inter-alpha-trypsin domain protein... 107 8e-22 UniRef50_A9WI94 von Willebrand factor type A n=2 Tax=Chloroflexu... 107 8e-22 UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiob... 107 9e-22 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 107 1e-21 UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN... 107 1e-21 UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n... 107 1e-21 UniRef50_B2HDT6 Putative uncharacterized protein n=3 Tax=Mycobac... 107 1e-21 UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis R... 107 1e-21 UniRef50_Q23JA0 von Willebrand factor type A domain containing p... 106 2e-21 UniRef50_A5UWS5 von Willebrand factor, type A n=2 Tax=Roseiflexu... 106 2e-21 UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 T... 106 2e-21 UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Breviba... 105 2e-21 UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria Re... 105 2e-21 UniRef50_B8G7Y1 von Willebrand factor type A n=3 Tax=Chloroflexu... 105 3e-21 UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus tri... 105 3e-21 UniRef50_A3MT69 VWA containing CoxE family protein n=4 Tax=Pyrob... 105 3e-21 UniRef50_Q54DV3 von Willebrand factor A domain-containing protei... 105 3e-21 UniRef50_Q1NTK1 Von Willebrand factor, type A n=2 Tax=delta prot... 105 4e-21 UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacte... 105 5e-21 UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family pr... 105 5e-21 UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=... 104 5e-21 UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangi... 104 6e-21 UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID... 104 6e-21 UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotri... 104 6e-21 UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacte... 104 6e-21 UniRef50_Q22ST4 von Willebrand factor type A domain containing p... 104 7e-21 UniRef50_Q7MCW9 Uncharacterized protein n=2 Tax=Vibrio vulnificu... 104 8e-21 UniRef50_C3ZT39 Putative uncharacterized protein n=1 Tax=Branchi... 104 9e-21 UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesm... 104 1e-20 UniRef50_Q54CQ8 von Willebrand factor A domain-containing protei... 103 1e-20 UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophob... 103 1e-20 UniRef50_UPI0000ECD6E7 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.... 103 1e-20 UniRef50_Q0VTG8 Protein containing a von Willebrand factor type ... 103 1e-20 UniRef50_Q54MG4 von Willebrand factor A domain-containing protei... 103 2e-20 UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ... 103 2e-20 UniRef50_UPI00016C38A3 LPXTG-motif cell wall anchor domain prote... 103 2e-20 UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clup... 103 2e-20 UniRef50_Q55G98 von Willebrand factor A domain-containing protei... 103 2e-20 UniRef50_B6HQ22 Pc22g19800 protein n=1 Tax=Penicillium chrysogen... 103 2e-20 UniRef50_D2VKS7 von Willebrand factor type A domain-containing p... 102 3e-20 UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea s... 102 4e-20 UniRef50_D0IVS5 Putative uncharacterized protein n=3 Tax=Bacteri... 102 4e-20 UniRef50_B4BQC0 von Willebrand factor type A n=2 Tax=Geobacillus... 102 4e-20 UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated c... 101 5e-20 UniRef50_Q897H0 Membrane-associated protein n=1 Tax=Clostridium ... 101 5e-20 UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria Re... 101 5e-20 UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis v... 101 6e-20 UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular o... 101 7e-20 UniRef50_A4XHD9 von Willebrand factor, type A n=2 Tax=Clostridia... 101 8e-20 UniRef50_Q24FW2 von Willebrand factor type A domain containing p... 101 8e-20 UniRef50_A4WJJ3 Putative uncharacterized protein n=5 Tax=Thermop... 100 8e-20 UniRef50_D1CCX6 von Willebrand factor type A n=1 Tax=Thermobacul... 100 1e-19 UniRef50_C1XFI8 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus Re... 100 1e-19 UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteri... 100 1e-19 UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfoloba... 100 1e-19 Sequences not found previously or not previously below threshold: UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YN... 114 9e-24 UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microc... 109 4e-22 UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Cion... 108 5e-22 UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Tak... 108 6e-22 UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain... 107 8e-22 UniRef50_A7NJ01 von Willebrand factor type A n=2 Tax=Roseiflexus... 105 4e-21 UniRef50_Q7SGD8 Predicted protein n=4 Tax=Sordariales RepID=Q7SG... 105 4e-21 UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=... 105 5e-21 UniRef50_A0LPK8 Vault protein inter-alpha-trypsin domain protein... 104 9e-21 UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Tak... 104 1e-20 UniRef50_A9RSX3 Predicted protein n=1 Tax=Physcomitrella patens ... 104 1e-20 UniRef50_A7RNW3 Predicted protein n=3 Tax=Nematostella vectensis... 104 1e-20 UniRef50_C3ZG18 Putative uncharacterized protein n=1 Tax=Branchi... 103 2e-20 UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-lik... 102 4e-20 UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha ... 101 7e-20 UniRef50_A7RTF3 Predicted protein (Fragment) n=1 Tax=Nematostell... 100 1e-19 >UniRef50_B8K8A8 von Willebrand factor, type A n=2 Tax=Vibrio RepID=B8K8A8_VIBPA Length = 481 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 156/484 (32%), Positives = 275/484 (56%), Gaps = 6/484 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D LN+ L +++ G+I+ + L+A Q+ E +K+++ + + +WR +++ R Sbjct: 1 MLGADGLNLALMIADSGIIDTAVNDLMARSQMMAVAENR-GVKSSVKNHLLKWRGSVKKR 59 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + +E+ YQ+ QF Q+P+++ L +S + QAR+L++ N + Sbjct: 60 ITKVCETERFQQELALYQEVIYWDEAQFFEQIPEVIKKL-EWHSAFYLQARRLMEKNKGV 118 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 + + F +W SL LE +E++L ++ +RM ++ + + + Sbjct: 119 NNPMFPHYFCDQWYESLSDAIRQAQLTELEANKEKVLKDLYQRMETMKNMDKVTEEGDEG 178 Query: 180 A-GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSR-EAKSIPRNDAQMET 237 + GRLWDM++ +L + D ++ ++ EFL + L+ +AE+LGR + N A +E Sbjct: 179 SVGRLWDMASARLSKTDLTVMKRHAEFLKKNQGLQEIAEKLGRMASQVDDPDLNKAPLEE 238 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + + + + G+ + DD+ +LLP E L ELE FY+ LV+K+L+ Y++ G Sbjct: 239 PQIVEEKSDKATDDIVGIHEGDDLNKLLPNETMFLAYPELEVVFYKHLVDKRLMNYKMQG 298 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 +S + + + +GPFI+CVD SGSM GF EQCAKA ALM+IALAE+R Sbjct: 299 KSRTLRKVRAQKPDNAQVDVEKGPFIICVDASGSMSGFPEQCAKAMAYALMQIALAEDRD 358 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 CY++LFS+E + YEL+ G+ +A FL+ F GGTDL ++ + ++ +AD V Sbjct: 359 CYVILFSSEQITYELTKQDGLREASDFLTYSFHGGTDLEPVLMKSIDLMTGDKYRNADMV 418 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 VISDFIA + +++ +KV EL + H++RFHA+++S +G P +M +FDH W + + R+ Sbjct: 419 VISDFIAPKQSEEMIAKVDEL-KEHKNRFHAISLSKYGNPELMTMFDHCWSYHPNLMGRI 477 Query: 478 LRRW 481 +++W Sbjct: 478 MKKW 481 >UniRef50_C6DJF9 Protein viaA n=166 Tax=Bacteria RepID=VIAA_PECCP Length = 492 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 307/487 (63%), Positives = 389/487 (79%), Gaps = 5/487 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 M+TL++L ++L++ E L++++++ LLA+PQLA FFEK+P LK+A+ +D+P W+E L+ R Sbjct: 1 MITLESLEMLLSIDENELLDDLVVTLLATPQLAFFFEKYPSLKSALLNDLPHWKETLKQR 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDAN--- 117 L+ +VPP+L +E CYQ+SQ + F +LP I+D L + SP+ QA QL+ A Sbjct: 61 LRTTQVPPDLEKEFSCYQRSQSIDNQAFQTRLPAIMDTLSNVESPFLTQASQLITAPERT 120 Query: 118 --STITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILAD 175 +TS LH LFLQRWRLSL +Q +L+QQL+E+ERE LL E+Q+R+TLSG+LEPILA+ Sbjct: 121 LGQKVTSGLHALFLQRWRLSLTLQTVSLHQQLMEQEREILLDELQQRLTLSGKLEPILAE 180 Query: 176 NNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQM 235 N AAGRLWD+SA Q + D + ++ +G FL QP L++LAE+LGRSRE KSI +A Sbjct: 181 NENAAGRLWDLSAAQRIQTDPRPLLDFGAFLQRQPALQKLAERLGRSRETKSILTQEAPK 240 Query: 236 ETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRL 295 E FR VREPATVPEQV G+ QSDDILRL+P EL TLGI+ELEYEFYRRL+E +LLTYRL Sbjct: 241 EAFRVSVREPATVPEQVSGVHQSDDILRLMPTELVTLGISELEYEFYRRLLEHRLLTYRL 300 Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 GESWREK+ ERPVVH+ ++QPRGPFIVCVDTSGSMGGFNE+CAKAFCLALMRIALA+N Sbjct: 301 QGESWREKITERPVVHQQNEQQPRGPFIVCVDTSGSMGGFNERCAKAFCLALMRIALADN 360 Query: 356 RRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 RRCYIMLFST +V+YEL+ G+EQAIRFLSQ FRGGTD+++C A+++++ W DAD Sbjct: 361 RRCYIMLFSTGVVKYELTSADGLEQAIRFLSQSFRGGTDMSACLSALLDKMDDALWHDAD 420 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRS 475 AVVISDFIAQRLPD+V +KVK Q QHRFHAVAMS HGKPGIM IFDHIWRFDTG++S Sbjct: 421 AVVISDFIAQRLPDEVVNKVKSRQTQLQHRFHAVAMSDHGKPGIMHIFDHIWRFDTGLKS 480 Query: 476 RLLRRWR 482 RL+RRW+ Sbjct: 481 RLMRRWQ 487 >UniRef50_Q5E077 Stimulator of RavA ATPase activity, VWA domain containing n=61 Tax=Vibrionales RepID=Q5E077_VIBF1 Length = 482 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 156/480 (32%), Positives = 266/480 (55%), Gaps = 5/480 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D LN+++ V+E G+I+ I +L+ PQ + P +K I + + +WR ++ + Sbjct: 1 MLGADALNLVMMVAESGMIDSSIAEILSRPQFLTAAKSNPNIKPTIKNHILKWRGKVKHK 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + + +E+ YQ S +F ++ +I+ L + +S + +A+QL + N + Sbjct: 61 MTKVCETERIQDELALYQDVIHWSENEFYQRIDEIISKL-KWHSAFYVEAKQLANDNKGL 119 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPIL-ADNNT 178 + + F +RW SL L+E++E+LL+++ +R+ +E + + Sbjct: 120 MNPMFPRFFCERWYQSLSDAIKKAQLSELKEDKEKLLADLYQRIETLKTMESVTAEGDEA 179 Query: 179 AAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSR-EAKSIPRNDAQMET 237 G+LWDM++ +L + + ++ + FL + L+ +A +LGR EA+ ++ A E Sbjct: 180 QIGKLWDMASAKLTKSNVDIMKLHARFLKKNKGLQDIASKLGRMANEAEHSDKSQAMAEE 239 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + + V + + G+ + DD+ RLLP E L ELE FY+ L++K+L+ YR+ G Sbjct: 240 VKVVEEKSDFVTDDIVGVHEGDDLSRLLPNETLFLSHPELEVIFYQHLIDKRLMNYRMQG 299 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + + + +GPF+VCVD SGSM GF EQCAKA LM+IALAE R Sbjct: 300 ADRKLRKVTTQSRAASNALIEKGPFVVCVDASGSMSGFPEQCAKALAYGLMQIALAEERD 359 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 CY++LFST+ + YELS G+++ FLS +F GGTDL ++ + ++ +AD V Sbjct: 360 CYVILFSTQQITYELSKQDGLKEVADFLSYKFHGGTDLEPVLEKSIQLMHGDKYKNADLV 419 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 V+SDFIA + + + V +L++ HQ+RFHAV +S +G P +M +FDH W + M RL Sbjct: 420 VLSDFIAPTHSEKIDAMVGDLKK-HQNRFHAVCLSKYGNPALMAMFDHTWAYHPSMLGRL 478 >UniRef50_C4LBX1 von Willebrand factor type A n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LBX1_TOLAT Length = 479 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 182/484 (37%), Positives = 304/484 (62%), Gaps = 7/484 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 M+ L TL+++L+++E ++++++ +++SPQ++ F + P + + V +W +++ ++ Sbjct: 1 MVDLQTLSLLLSINETQMVQDLVSTVMSSPQVSQFMHEHPLFFKNVQEHVQQWSQSIPAQ 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 LK+ VP +L +E + Y +Q LS QF Q +L L +S + A+ L+ S Sbjct: 61 LKNIPVPDDLQQEYILYLDAQGLSAEQFTQQSADLLVQLQ--HSDFHTDAQNLLLTLSQA 118 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 + LF+Q+WR L+ Q +L E+ERE++L +++ RM ++G+L+ LA + Sbjct: 119 NAHNRKQLFIQKWREHLVSQVLSLEIIFAEQERERMLQQLELRMQVAGELDETLAPQH-- 176 Query: 180 AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSR-EAKSIPRNDAQMETF 238 G+LWD++A L +G+ L Y FL PELK++A+ LGR+ + S ++ET Sbjct: 177 PGKLWDLTATHLLQGNSSLFRHYASFLTHNPELKKIADALGRAATQDSSAEEQINRVETA 236 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 + VP+ + G+ QS+++ RL+ E L ELE FY++L E++LL Y+ G+ Sbjct: 237 EWQTVQHEQVPDDLVGIHQSNELNRLISSETVLLTEPELETVFYKQLAERRLLNYQFMGQ 296 Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 S + + + +GPFIVC+DTSGSM G+ E CAK FC AL++IAL+ENR C Sbjct: 297 SRSLETVMSEQRTFGETQDTKGPFIVCIDTSGSMSGYPEDCAKGFCFALLQIALSENRAC 356 Query: 359 YIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVV 418 IMLFST++V YEL+GP+G+++A+ FL F+GGTDL C + +M +Q + +ADAVV Sbjct: 357 VIMLFSTDVVTYELTGPEGLQEALNFLGCSFKGGTDLEPCMQQVMHYMQQARFSNADAVV 416 Query: 419 ISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLL 478 +SDFIAQRL + + ++++R + +RF+AV++S HGKP +M+IFD++W+FDT + R+L Sbjct: 417 LSDFIAQRLSVETEQQAQQIKR-NGNRFNAVSLSRHGKPALMKIFDNVWKFDTSLSGRVL 475 Query: 479 RRWR 482 R+ R Sbjct: 476 RKVR 479 >UniRef50_A6CWE2 Putative uncharacterized protein yieM n=3 Tax=Vibrio RepID=A6CWE2_9VIBR Length = 497 Score = 389 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 144/483 (29%), Positives = 262/483 (54%), Gaps = 6/483 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D+LN+ + V+E G+I+ + ++ L LK ++T + +W ++++ + Sbjct: 1 MLGADSLNLAMMVAESGIIDSAVRDIMQQTDLLAMGSD-EGLKQSLTASMAKWSKSVKRK 59 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLV-DANST 119 L + L E+ YQ++ L+ +F QL Q+++ L +S + +ARQL D ++ Sbjct: 60 LVKGQETESLQSELELYQRAVYLTEQEFDDQLSQLIEQLPE-DSHFLPKARQLASDIDAY 118 Query: 120 ITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 S F ++W SL + QQ +++++ + L ++ +++ +E + Sbjct: 119 PRSLFARQFCKQWYESLKQAVESKQQQTVDQQKSKFLKQMYQKIDTLKDMENLQEGGEQG 178 Query: 180 A-GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 GRLWD++ +L + D++ I + E+L ELK +A++LGR E P + + Sbjct: 179 KLGRLWDLAGAELTKQDWRHIERTAEYLENNQELKHIADKLGRMAEEVDAPELNKALSHD 238 Query: 239 RTMVR-EPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 +V + + + G+ +S+DI LLP E L ELE FY+ LVEK+LLTY+ G Sbjct: 239 EVVVEEKTDFATDDIVGIHESNDINNLLPNETMYLAYPELETIFYQHLVEKRLLTYKSEG 298 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + + + P ++ GP ++ +D SGSM G E+ AKA +LM++A + R Sbjct: 299 KQRTVRQLHSPKTATGEADKETGPMLIAIDVSGSMQGAPEKSAKAIAYSLMKMAAQQQRE 358 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAV 417 C+++LFS+ + Y+L+G G+++A FLS F+GGTDL +E +Q ++ +AD + Sbjct: 359 CHVILFSSTFISYDLTGTTGLKEASDFLSYTFKGGTDLGKVLNHAVELMQGEQYKNADLL 418 Query: 418 VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 VISDFIA + +V KV+ L R +RFH++ +S +G P ++ +FD WR+ + + Sbjct: 419 VISDFIAPKQEQEVVEKVESL-RGRYNRFHSLCLSKYGNPEVLGLFDTQWRYHPSLVGQF 477 Query: 478 LRR 480 +++ Sbjct: 478 IKK 480 >UniRef50_A4SJ14 von Willebrand factor type A domain protein n=2 Tax=Aeromonas RepID=A4SJ14_AERS4 Length = 484 Score = 361 bits (927), Expect = 3e-98, Method: Composition-based stats. Identities = 180/472 (38%), Positives = 279/472 (59%), Gaps = 4/472 (0%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 M+ + T++ +LA+SE ++ EM++ALLAS Q++ F ++ + RWR + Sbjct: 1 MIEIGTMSALLAISEGEMVSEMVVALLASTQISRFIRIGKAQGRSLKQRLQRWRHQVNDT 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDAN-ST 119 + VPP L +E + YQ LS + + +LP +L L S +A++ RQL Sbjct: 61 IAHTPVPPVLEQEFLLYQHFISLSLARLVAELPTLLSAL-ERGSDFADEGRQLAHQLVDH 119 Query: 120 ITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 T L L++WR SL+ L Q+L E ER +L E++E++ S +LE +L Sbjct: 120 PTEGARRLMLEKWRASLVGALLRLQQELAEAERLRLQQELEEQIGASEELEQVLDPQRRT 179 Query: 180 AGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGR-SREAKSIPRNDAQMETF 238 AG LW+++ G+ + LI +Y L ++P L+ +A+ +GR +++ + R T Sbjct: 180 AGGLWNLAQGRWQPASLVLIRQYAAMLRKEPMLQEIADSMGRSLHDSEQLQRPQPPQPTL 239 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 VP+ + G+ ++D++R+LP E LG+ ELE EFYRR +E++LL+Y+ G Sbjct: 240 IQEPVLSDDVPDDLVGIHPANDLMRMLPSEAVMLGVPELELEFYRRYLERRLLSYQARGT 299 Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 R +++ R D + QP GP IVC+DTSGSMGG+ EQCAKA LAL+++AL E RRC Sbjct: 300 LPRHQLLPRTTDRGDQELQPMGPVIVCIDTSGSMGGYPEQCAKALALALLQLALTEQRRC 359 Query: 359 YIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVV 418 ++MLFST++ +EL+ G+++A RFL+ F GGTDL C A +++LQ+ + AD +V Sbjct: 360 FVMLFSTDVATFELTDANGLDEAQRFLAMTFNGGTDLLPCLSATLQQLQAPGFELADVLV 419 Query: 419 ISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFD 470 ISDFIAQRLP + + + QR RF+AVAMS H KP ++R+FD W D Sbjct: 420 ISDFIAQRLPASLVELM-DRQRGRGTRFNAVAMSRHAKPALLRVFDKSWLLD 470 >UniRef50_A3US96 Putative uncharacterized protein (Fragment) n=1 Tax=Vibrio splendidus 12B01 RepID=A3US96_VIBSP Length = 403 Score = 349 bits (896), Expect = 1e-94, Method: Composition-based stats. Identities = 132/404 (32%), Positives = 220/404 (54%), Gaps = 5/404 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML D LN+ L V++ G+I+ + L+A Q+ + E +K ++ + + +WR ++ R Sbjct: 1 MLGADGLNLALMVADSGIIDTAMNDLIARSQVMMAAEN-KGVKTSVKNHLVKWRGKVKKR 59 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 + EE+ YQ+ PQF ++ ++ L +S + QAR+L++ N + Sbjct: 60 VTKVCETDRFQEEIALYQEVIYWDEPQFFDEIDSVIKKL-EWHSAFYLQARRLMENNKGV 118 Query: 121 TSALH-TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA 179 +A+ F +W SL LE +E++L+++ +RM ++ + + Sbjct: 119 YNAMFPHYFCDQWYQSLSDAIKQAQVTELETSKEKVLADLYQRMETMKNMDKVTESGDEG 178 Query: 180 A-GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIP-RNDAQMET 237 + GRLWDM++ +L + D ++ ++ EFLN+ L+ +AE+LGR + P + A +E Sbjct: 179 SVGRLWDMASAKLSKTDLTIMKRHAEFLNKHKGLQEIAEKLGRMASEEDDPSLHKAPVEE 238 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 + + + + + G+ +SDD+ ++LP E L ELE FY+ L +K+LL+YR G Sbjct: 239 LQMVEEKSDEAVDDIVGIHESDDLNKMLPNETMFLAYPELEVIFYKHLADKRLLSYRSQG 298 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 +S + ++ + +GPFIVCVD SGSM GF EQ AKA ALM+IALAE R Sbjct: 299 KSRTLRKVKAQKPDSKNVDIEKGPFIVCVDASGSMSGFPEQSAKAMAYALMQIALAEERD 358 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRA 401 CY++LFS+E + YEL+ G+ +A FLS F GGTDL Sbjct: 359 CYVILFSSEQITYELTRQDGLREASDFLSYSFHGGTDLEPVLMK 402 >UniRef50_A6FIY2 Uncharacterized protein containing a von Willebrand factor type A(VWA) domain n=1 Tax=Moritella sp. PE36 RepID=A6FIY2_9GAMM Length = 469 Score = 337 bits (865), Expect = 4e-91, Method: Composition-based stats. Identities = 157/466 (33%), Positives = 257/466 (55%), Gaps = 14/466 (3%) Query: 17 GLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMC 76 L+++ ++ + A ++ +K A DD W++ + S L D +P L+ E+ Sbjct: 13 QLVDDALLDITAHDRVT------SDIKLAYIDD---WKQQIMSLLADMPLPAGLSNEIHL 63 Query: 77 YQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDAN-STITSALHTLFLQRWRLS 135 + ++LLS F ++ IL + + S + + N S + +FL W+ + Sbjct: 64 CETARLLSPSNFRNKVEGILSKI-KAESAFYNTGLTIYQQNRSMPDNVFFAVFLDSWQQA 122 Query: 136 LIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAG-RLWDMSAGQLKRG 194 + + +L+EE+REQLL E+ ER QLE +L + G RLWD++ G+L Sbjct: 123 IELLLYQEQSRLIEEKREQLLIELAEREETIEQLEDVLDSDLLCNGERLWDLAKGKLTHL 182 Query: 195 DYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQM-ETFRTMVREPATVPEQVD 253 D +L+ +Y L + ++K++A +LGR A P ET+ VP+ + Sbjct: 183 DTKLLQRYAVNLRKNKDVKKIASELGRMALAHINPEETPNSYETWVLDNSYQDNVPDDMQ 242 Query: 254 GLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKD 313 G+ SD+I R+L E L ELE FY+R +E+ LLTY+ G + K + + D Sbjct: 243 GVTYSDEISRMLQTEAVNLTFPELEIIFYKRYIERHLLTYQYQGALQQYKKVTQYRDITD 302 Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS 373 DEQ GPFI+CVD+S SM GF E AK+ C AL++IA + R+CY+M+FS E++ + ++ Sbjct: 303 ADEQTGGPFIICVDSSTSMHGFPELTAKSICYALLQIAFEQRRQCYLMMFSNEVITFPVT 362 Query: 374 GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 + + FLS FRGGTDL +E + S ++ +AD +VISDFIAQ+LP V Sbjct: 363 QSTSLSTMLTFLSSSFRGGTDLQPVIEKSLELMSSAQYKNADTIVISDFIAQKLPTHVAD 422 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLR 479 KV+ + + ++R+HA+++S+ G P +M+IFDH+WR+ G+ RL + Sbjct: 423 KVRAI-KAQKNRYHAISLSSQGNPELMKIFDHVWRYSAGLTGRLKK 467 >UniRef50_C6M593 Putative uncharacterized protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M593_NEISI Length = 482 Score = 319 bits (816), Expect = 2e-85, Method: Composition-based stats. Identities = 112/450 (24%), Positives = 202/450 (44%), Gaps = 10/450 (2%) Query: 29 SPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQF 88 F++ L I + W + L R P + EE + Q L Sbjct: 19 QTDYQDLFKQHSWLGGQIQQRLFGWAHQTKLDLWQ-RSPFAIHEENLKNHQQTGLFNQSP 77 Query: 89 IVQLPQILDL--LHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQ 146 + + L + W ++ Q S + L ++W+ L + Sbjct: 78 VDDYQRFCKLTGIPFEKDFWQKELAQSKQVKSHKQNLPLKLLTEKWQQQLDQAKAQWQVE 137 Query: 147 LLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFL 206 + + R++LL+++++ + + QL L G G L D + + ++ +L Sbjct: 138 QINQLRQELLTQLKQELEVVKQLSQQLEQLGFGIGDD----IGNLTPQDIEEMKRWLNYL 193 Query: 207 NEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPA-TVPEQVDGLQQSDDILRLL 265 + +++AE LG+ R+ + + + +T + E++ GL+ D+ +L Sbjct: 194 TQDKNAQQIAELLGKMRQIEQSEKIEQVKQTVYIQNPQIDINSREEIIGLRLGKDLEYVL 253 Query: 266 PPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVC 325 P ELA + E F + +E +L+ + L G ++ + E V K +++ GP I+C Sbjct: 254 PSELALMADEETSILFDLKFLESKLMCFELQGMTYCDAPTEIIVEQKSQEDEKPGPMILC 313 Query: 326 VDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFL 385 VDTSGSM G E AKA L L A +ENR C+++ FST I +EL+ GI I FL Sbjct: 314 VDTSGSMNGLPENIAKAMALFLGTKAKSENRSCFVINFSTGIETFELTSKTGISNLIAFL 373 Query: 386 SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHR 445 Q F GGTD A R ++ ++ + AD ++ISDF+ LPDD+ + + E+QR ++ Sbjct: 374 RQSFHGGTDAAPALRHALKMMEQESYQKADLLMISDFVMNGLPDDLLASI-EIQRETGNQ 432 Query: 446 FHAVAMSAHGKP-GIMRIFDHIWRFDTGMR 474 F+++ + + FD W ++ ++ Sbjct: 433 FNSLVIGDAFMSKRLKTHFDREWIYNPNVQ 462 >UniRef50_A2SS27 von Willebrand factor, type A n=1 Tax=Methanocorpusculum labreanum Z RepID=A2SS27_METLZ Length = 492 Score = 306 bits (784), Expect = 1e-81, Method: Composition-based stats. Identities = 109/452 (24%), Positives = 197/452 (43%), Gaps = 15/452 (3%) Query: 38 KFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQ---QSQLLSTPQFIVQLPQ 94 + + + W ++ S L++ ++ + ++ S I L + Sbjct: 27 NDKDFENELNKKLTDWEKSTVSYLENTNPLETHRLNLVSAEWDYRAHGGSKKSIISDLDE 86 Query: 95 ILDLLHRLNSPWAEQA---------RQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQ 145 L +N + E+ + ++ S + + WR Q Sbjct: 87 YKVLQPSVNVKFWEEKIGEVDNQTGERKDESIIENLSLIRRNEQEAWRKEYEKQLLEWQL 146 Query: 146 QLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEF 205 + ++ R+QLL ++E Q++ + G WD+S G+L D ++ K+ ++ Sbjct: 147 EEIQNRRKQLLQNLKEWFETIQQMKEVFEALGVDTGVFWDLSVGKLSAQDISVLKKWADY 206 Query: 206 LNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPA-TVPEQVDGLQQSDDILRL 264 L +++ L E +GR + + + T + V++P E++ G++ D+ + Sbjct: 207 LKYDEKIRELCELMGRLHKEQQSHHTEIINSTIQYHVKKPDVHSNEEIIGIKFGRDLENI 266 Query: 265 LPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIV 324 +P ELA L E+ F + VE +L+ + G DE+ GP I+ Sbjct: 267 IPQELALLSDPEVTLLFDLKYVENRLMCFSKQGYITEIIEENMQETVNVDDEEKMGPIII 326 Query: 325 CVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRF 384 CVDTSGSM G E AKA L+L A+++ R CY++ FST I + + P+GI I F Sbjct: 327 CVDTSGSMSGAPENIAKALTLSLASRAISQKRNCYLINFSTSINTLDFTPPKGIHDLINF 386 Query: 385 LSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQH 444 L F GGTD+A + + ++ AD +VISDF+ L D+ K+ Q+ ++ Sbjct: 387 LKMSFHGGTDVAPALYEGIRMMSESDYKKADLLVISDFVIYGLSSDIVPLCKK-QKQEEN 445 Query: 445 RFHAVAMSAHGKPGIMR-IFDHIWRFDTGMRS 475 RF A+ + + G + +FD W +D S Sbjct: 446 RFFALCIGSFGTQRVEDGVFDQSWTYDPRSGS 477 >UniRef50_A1SXM0 von Willebrand factor, type A n=2 Tax=Alteromonadales RepID=A1SXM0_PSYIN Length = 529 Score = 297 bits (759), Expect = 8e-79, Method: Composition-based stats. Identities = 100/458 (21%), Positives = 189/458 (41%), Gaps = 15/458 (3%) Query: 26 LLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLST 85 LL + ++ + +D + + + + P + + + + Sbjct: 57 LLEKSEQSLLSHNKSNHIESAQEDYKHYTQFVNQ--AKLHLNPYWGKVIHALSEKVSVKE 114 Query: 86 PQFIVQLPQILDLLHRLNSPWA----EQARQLVDANSTITSAL--HTLFLQRWRLSLIVQ 139 P++ + + +A + + L + + L T L WR + + Sbjct: 115 KIE----PKVENKHSKDQRYFAMVSTSKRKTLQNKVTAALDPLLTRTHLLGEWRKQIEQK 170 Query: 140 ATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLI 199 L+ + +++ L +++E + L + D G D S G+L D + I Sbjct: 171 RVEWELNLIHKLQQKFLEKMEEWLRYLSALINSIDDIGFDLGYFLDFSKGELSESDIEQI 230 Query: 200 VKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPA-TVPEQVDGLQQS 258 K+ ++ + L + LG+ R+ + + + + E++ G++ Sbjct: 231 KKWLNYIQNDKGAQLLCDLLGKIRQVSHSDKIEIANKIIDVPSQYIDSNSKEEIVGIKLG 290 Query: 259 DDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQP 318 ++ +LP E A L F + +E +L+ + + G IE +E Sbjct: 291 QELEHVLPSEFALLSDPSTSILFDLKYIESRLMCFDMVGIQNSVDQIEIEEEVTVQEENT 350 Query: 319 RGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGI 378 +GP ++CVDTSGSM G E AKA L L A E R CY++ FST I +LSG I Sbjct: 351 KGPMVICVDTSGSMHGSPEAIAKAVTLFLSSTAQKEKRDCYLINFSTSIETLDLSGNYSI 410 Query: 379 EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKEL 438 + I FL + F GGTD+A ++ +Q+ + +AD ++ISDF+ LPD + + Sbjct: 411 KTLIDFLRKSFHGGTDVAPAINHGLKVMQNDTYENADMLIISDFVMSYLPDKTVKNI-GV 469 Query: 439 QRVHQHRFHAVAMSAHG-KPGIMRIFDHIWRFDTGMRS 475 R +RF+++ + + IFD W ++ S Sbjct: 470 LRESGNRFYSLCIGNAFMSNRLSAIFDREWIYNPATTS 507 >UniRef50_C1Q8W4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1Q8W4_9SPIR Length = 478 Score = 292 bits (748), Expect = 2e-77, Method: Composition-based stats. Identities = 100/445 (22%), Positives = 206/445 (46%), Gaps = 19/445 (4%) Query: 38 KFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILD 97 RL+ + + +W++ L + + E E+ + + + I D Sbjct: 32 NEKRLEEELEVKITKWKKDLNTFTNENNPYDENKTELEISLKKLKVLKNKNIK--TTKED 89 Query: 98 LLHRLNS----------PWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQL 147 +++ LNS + + D+ + + + L W+ + + Sbjct: 90 IINDLNSFSILSDDSSVEFWKDKLHNSDSLNINLNIIKNNILSSWQKTYNKKNNEWLVST 149 Query: 148 LEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLN 207 ++E R++ +S+++ ++L +L + G LWD G+L+ D L+ ++ EF+N Sbjct: 150 VKERRDKFISDIESWISLIKKLRYMSNILRIKTGVLWDFRVGELEENDISLLNRWVEFIN 209 Query: 208 EQP-ELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLP 266 + E++ + + +G+ + + +N T+ ++ + E++ G+ + DI ++P Sbjct: 210 KYKKEIETICDSIGKRVDIEKALKNIEFKNTYSYTNKKIS-SKEEIVGIYFAKDIENVVP 268 Query: 267 PELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCV 326 EL+ L + E F + +E +L+ + + E + K ++ +G I+C+ Sbjct: 269 EELSLLCDEDSEKLFKLKYIENRLMCFDKSAYVFNEN---DFDIVKAGYKEGKGDMIICI 325 Query: 327 DTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLS 386 DTSGSM G +E AKA ++ AL+ENR Y++ FSTEI + GIE I+FL Sbjct: 326 DTSGSMKGTSEYIAKAIMFKMVMQALSENRNAYLINFSTEIYTCRFTKNNGIEDLIKFLK 385 Query: 387 QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 + GG+D+ + + + +AD +V+SDFI + +PD++ + QR + ++F Sbjct: 386 LSYHGGSDIYKALYEANRVMNTSSFKNADVLVLSDFIMEDMPDNLVK-ICSNQRNNGNKF 444 Query: 447 HAVAMSAHGKPG-IMRIFDHIWRFD 470 AV++ ++F+ W FD Sbjct: 445 FAVSIGKFPFGYSYKKVFNRHWIFD 469 >UniRef50_C1Q9X6 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Brachyspira RepID=C1Q9X6_9SPIR Length = 529 Score = 290 bits (742), Expect = 8e-77, Method: Composition-based stats. Identities = 102/439 (23%), Positives = 193/439 (43%), Gaps = 8/439 (1%) Query: 43 KAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRL 102 + +I DD+ +++ + S +L + Y + + + ++ + + Sbjct: 78 EKSILDDINHYKK-IDSNADTQFFKSKLEDFKNKYNKKPKIKQTEIKQKIEKEEFDTDKN 136 Query: 103 NS----PWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSE 158 N + + + T A+ L W+ SL + + +++ RE+ + Sbjct: 137 NYYKETENNQDSEDIKIQKDTDIKAIRKSVLDNWKNSLDNKYIDWSLNEIDKFREEFFKQ 196 Query: 159 VQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQ 218 ++E + + + G L+D+S G L + D + I + + +K+L + Sbjct: 197 IKEFLDYLKDIMELENALGEETGSLFDLSLGNLLKRDIEYIKQLANLIKSNENIKKLCDM 256 Query: 219 LGRSREAKSIPRNDAQMETFR-TMVREPATVPEQVDGLQQSDDILRLLPPELATLGITEL 277 LGR + + R + + +++ G+ S DI +LP E L L Sbjct: 257 LGRFVKEEESYRIEKVLRKETFHTSVRDINSEDEIVGITYSRDIHNILPQEKLLLAEGVL 316 Query: 278 EYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNE 337 E F + E +LLT++ G + K ++ +GP I+CVDTSGSM G E Sbjct: 317 ETLFGVKYFENRLLTFKKEGYTDYYYDEMIEDEMKVVEDDKKGPIIICVDTSGSMSGVPE 376 Query: 338 QCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLAS 397 AKA L L A+ + R CY++ FST+I +L+ P ++ I FL F GGTD Sbjct: 377 TVAKAVTLYLASRAMKQKRNCYLINFSTQIETMDLTYPNTMDNLIEFLRLSFNGGTDAVP 436 Query: 398 CFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKP 457 R ++ + + + +D + ISDF+ DD K+ E Q+ +++RF+++ + + Sbjct: 437 ALRHAIKTMNTENYKKSDLLFISDFVFNGFTDD-DYKLAEAQKKNENRFYSLIIGSTPLF 495 Query: 458 GIMR-IFDHIWRFDTGMRS 475 + IFD+ W +D+ S Sbjct: 496 NVKNSIFDYNWCYDSSRGS 514 >UniRef50_Q6LJM7 Putative uncharacterized protein n=1 Tax=Photobacterium profundum RepID=Q6LJM7_PHOPR Length = 492 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 114/474 (24%), Positives = 200/474 (42%), Gaps = 40/474 (8%) Query: 30 PQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELT------------------ 71 + + P L AI + + L+ L + E+ Sbjct: 13 SRYQPMIKDNPDLDFAIKQQLSDCTQGLKLELIKSNPYTEIELALLNAELEFDEKAATSV 72 Query: 72 ---EEVMCYQ----QSQLLSTPQF-IVQLPQILDLLHRLNSPWAEQARQLVDANSTITSA 123 +++ YQ Q+QL S ++ + +L + D +++LN T Sbjct: 73 RFKNDIISYQRFISQAQLPSDKKYWLKELTVLDDKVNKLN---------QQKRKITAIKT 123 Query: 124 LHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRL 183 H L WR + + + + +E+ LSE+ + + L ++ G L Sbjct: 124 KHNHLLTHWRKQYDKAHSKWQLEAIRQFQEKFLSELNDWLEQIKILSEVVESLGLEPGYL 183 Query: 184 WDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVR 243 D S G+L D + + K+ E+L +K L E LG+ R+ + + +T Sbjct: 184 LDFSEGKLTLSDVEKLKKWAEYLPNDEGVKSLCEMLGKLRQVTLSDKIETIKKTINMPEM 243 Query: 244 E-PATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWRE 302 +++ GL+ D+ +LP ELA + E F + +E L+ + + G S Sbjct: 244 VFDGDSKQEIVGLKLGKDLEHVLPSELALMSDPETSILFDLKYLESSLMCFDMAGISIDH 303 Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 E+ V E +GP ++C+DTSGSM G E AKA L L A E R CY++ Sbjct: 304 A--EQVVEQSIQKEDKKGPMVICIDTSGSMHGSPETIAKALSLYLTTQAKKEQRDCYLIN 361 Query: 363 FSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 ST I +LS + + FL + F GGTD+A R + +++ + +AD ++ISDF Sbjct: 362 ISTSIEILDLSQGYSLSSLLTFLQKSFHGGTDVAPAMRHGINIMKNDAYENADMLIISDF 421 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHG-KPGIMRIFDHIWRFDTGMRS 475 + LP+D V++ QR+ +RF+++ + + FD W ++ S Sbjct: 422 VMSSLPNDCLELVEQ-QRIKGNRFYSLCIGNAFMTNRLKTHFDSEWVYNPSNSS 474 >UniRef50_C0QY03 von Willebrand factor type A (VWA) domain containing protein n=1 Tax=Brachyspira hyodysenteriae WA1 RepID=C0QY03_BRAHW Length = 467 Score = 282 bits (721), Expect = 2e-74, Method: Composition-based stats. Identities = 100/436 (22%), Positives = 206/436 (47%), Gaps = 9/436 (2%) Query: 36 FEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQI 95 + RL + + +W++ L + D + E+ + S + Sbjct: 31 IDNDQRLSEELEIKITKWKKDLNVFINDNNPYHDNKTELDIALKKLKNSNIS-REDIFND 89 Query: 96 LDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQL 155 L+ L+ L+ +L+ NS + L + W + + +++ R++ Sbjct: 90 LNSLNALSIDTNFWKERLL--NSDNLNILKKNIISVWEKTYNKKNNDWLVSTVKDRRDKF 147 Query: 156 LSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRL 215 +S+++ + L +L+ + G LWD G+L+ D L+ ++ +F+NE +++ + Sbjct: 148 ISDIESWINLLKKLKYMSNILRIKTGVLWDFRVGELEEADISLLKRWVDFINEYKDIEVI 207 Query: 216 AEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGIT 275 + +GR + + +N T+ ++ + E++ G+ + DI ++P EL+ L Sbjct: 208 CDSIGRRIDIEKSLKNVEFKNTYSNTNKKIS-SKEEIVGIYFAKDIENVIPEELSLLCNE 266 Query: 276 ELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGF 335 E E F + +E +L+ + + + ER + K + +G I+C+DTSGSM G Sbjct: 267 ESEKLFKLKYIENRLMCFDKSAYVFND---ERDNIVKAGYREGKGDMIICIDTSGSMKGI 323 Query: 336 NEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDL 395 NE AKA ++ AL+ENR Y++ FSTEI + + GIE I+FL + GG+D+ Sbjct: 324 NEYIAKATMFKMVMQALSENRNAYLINFSTEIYTCKFTKENGIEDLIKFLKLSYHGGSDI 383 Query: 396 ASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHG 455 + + + +AD +V+SDFI + +P+++ + + Q+ + +++ AV++ Sbjct: 384 YKALYEANRMMNTSSFRNADVLVLSDFIMEDMPNNLVTMCSK-QKNNGNKYFAVSIGKFP 442 Query: 456 KPG-IMRIFDHIWRFD 470 ++F+ W FD Sbjct: 443 FGYSYRKVFNRHWIFD 458 >UniRef50_Q14PC7 Hypothetical two-component regulator system yiem receptor component protein n=1 Tax=Spiroplasma citri RepID=Q14PC7_SPICI Length = 519 Score = 280 bits (716), Expect = 8e-74, Method: Composition-based stats. Identities = 104/450 (23%), Positives = 208/450 (46%), Gaps = 21/450 (4%) Query: 52 RWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQAR 111 + + + + ++ + +E+ + ++ D L ++NSP+ ++ Sbjct: 51 FYDHQIENEIMKIKLDSNIEKEIYLFHWIKVNGYDGLKKNYASTQDFLFKVNSPFYDRLN 110 Query: 112 ----QLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSG 167 + + T+ L F+ W LI + +EE R + + ++ ++ + Sbjct: 111 YYRYEFNKKQNNNTNMLFRDFIGIWESILIKRINDYRFAKIEELRTKFMQDLYNKVEIYN 170 Query: 168 QLEPILADNNTAAGRLWDMSAGQLKR-GDYQLIVKYGEFLNEQPELKRLAEQLGRSREAK 226 + +L G++W+ +LK+ + I K+ +FL P + +A LGR + Sbjct: 171 KANSLLKTVWNFFGKIWN--PTELKKGVNMSAIDKFAKFLETNPAIMEIATLLGRFQGES 228 Query: 227 SIPRNDAQMETFRTMVREP-ATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRL 285 ++ E +P + PE++ G +S D+ + EL L L+Y FY++ Sbjct: 229 NLIEQRILEEIVMDYEWKPIGSSPEEIIGATESKDLEHMFAAELVLLKDPVLKYIFYKKY 288 Query: 286 VEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCL 345 +E +L T+ + K + + Y + +GP I+ +DTS SM G EQ AKA L Sbjct: 289 IEGKLTTFEFLSQDKVPKEQIKLRTIETYVPEEKGPIILSIDTSSSMRGSPEQIAKALAL 348 Query: 346 ALMRIALAENRRCYIMLFSTEIVRYELSG-PQGIEQAIRFLSQQFRGGTDLASCFRAIME 404 A+ +IAL E+R CY++ FS + Y LS + + I FLS+ F G T++ + Sbjct: 349 AIAKIALGEHRPCYMINFSKSLDVYNLSSLKDSLPKLIEFLSKSFAGDTNVEPALEHTLT 408 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD 464 + S E+F+AD ++ISDF+ L ++ +K+ L + ++RFHA+ + G + IF+ Sbjct: 409 VMDSNEYFNADLLLISDFLTSDLSPELITKI-NLLKQRRNRFHAIVIGTMGAENVETIFN 467 Query: 465 HIWRFDT-----------GMRSRLLRRWRR 483 + W +D + ++++++ + Sbjct: 468 NAWIYDPRDPFASELIIASLTGQIVKKYDK 497 >UniRef50_A6DQA4 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQA4_9BACT Length = 479 Score = 275 bits (704), Expect = 2e-72, Method: Composition-based stats. Identities = 104/468 (22%), Positives = 191/468 (40%), Gaps = 34/468 (7%) Query: 29 SPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTP-- 86 E+FP L + + L + + + Q S+ T Sbjct: 12 QNDYGDLLERFPALTREVNRTLEEKAGEWEQTLAEENP---FSSDWNTLQASEKWLTEGK 68 Query: 87 ----QFIVQLPQILD-----LLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLI 137 + Q + L L S W ++ Q+ D + + L + +W SL Sbjct: 69 LGNDCLKSSVNQFHEFRAKCQLPDLKSYWNQELSQINDQPGNV-NVLPQFLISQWHKSLR 127 Query: 138 VQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMS---------A 188 +T Q+ L+ + + E+ + M + L + ++D + Sbjct: 128 DLQSTWKQERLDNLQNETQQEMNDWMDNLNDIADELEKLDLDPEDVFDFASGAGAGGDGP 187 Query: 189 GQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATV 248 +L + + + K+ L + P +K L + LG+ ++A N + + + Sbjct: 188 SELSIQNLETLKKWLGTLKKDPGIKDLCKLLGKLKQA---KLNKIKRSRTTSSSVSSSNS 244 Query: 249 PEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERP 308 E++ G++ S ++ LLP ELA L E E F + E +L+ + + G K E Sbjct: 245 CEEISGIKFSKELEHLLPSELALLTDPETEIIFDLKYAESRLMGFDMSGIQTVSKKEEIE 304 Query: 309 VVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV 368 ++ +GP I+ VDTSGSM G E AKA L + + ++ ENR CY++ FST+I Sbjct: 305 -----MPDEEQGPMIIAVDTSGSMYGAPETTAKAITLYMAKTSMKENRNCYVIEFSTKIK 359 Query: 369 RYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLP 428 +L+G + ++FL F GGTD+ E + + +AD +V+SDFI L Sbjct: 360 TIDLAGSNRLSALMKFLEMSFNGGTDVEPAIEHGTEVMNQEGYRNADMLVVSDFILNDLE 419 Query: 429 DDVTSKVKELQRVHQHRFHAVAMSAHGKPGI-MRIFDHIWRFDTGMRS 475 + K+++ + + F+++ + H FD W ++ S Sbjct: 420 PPLVDKIQQA-KAKNNSFYSLCIGDHFHSHKNREYFDRKWVYNPDGSS 466 >UniRef50_C3XEK2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XEK2_9HELI Length = 493 Score = 272 bits (696), Expect = 2e-71, Method: Composition-based stats. Identities = 116/478 (24%), Positives = 196/478 (41%), Gaps = 37/478 (7%) Query: 29 SPQLAVFFEKFPRLKAAITDDVPRWREAL---RSRLKDARVPPELTE---EVMCYQ---- 78 LA E++ + ++ D P +E + R +L + L + E Y Sbjct: 7 QEHLAKAHEEYQAQQDSLQDSHPFKKEEVAHHRRKLTPTQDIKTLKDDIKEFDTYNKKHK 66 Query: 79 -------------QSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALH 125 L Q+ + + + + E+ +Q D +T L Sbjct: 67 GTLDSLLNKQEQNDITLTDKRQYTLNAWEQMLKSKKNEYIEQEKLKQTRDYKQRLTDYLE 126 Query: 126 TLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWD 185 +L + LS + A L +L+E ++ L +V + Q + G+ D Sbjct: 127 SLLETKEFLSNLGGAGELFSGVLDEMKQGL--DVSNLGDEAYQNKLKGQRIEMPGGKGTD 184 Query: 186 MSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFR---TMV 242 AG R D I +Y + LK +AE LGR + + E T V Sbjct: 185 NGAGIRNRIDINTIKQYFNTIQNSKALKEIAELLGRLEKEEEESEIQKIKELKSYSYTQV 244 Query: 243 REPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWRE 302 E++ G+ D+ LLP ELA L LE F + ++ +L + G Sbjct: 245 IPTKRYKEEICGVTLGRDLENLLPQELAMLEDETLELLFDLKYIQNRLFCFEKQGYHSIT 304 Query: 303 KVIERPVVH------KDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 + + + + E+ G I+CVDTSGSM G E AKA L L A + R Sbjct: 305 QEAQEEIEKEIETKKQKKREKNEGAIIICVDTSGSMYGNPEYIAKALTLFLATKANTQKR 364 Query: 357 RCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADA 416 CY++ FS I ELSG G+ + ++FL F GGTD+A +A ++ +Q ++ +D Sbjct: 365 ACYLINFSIGIETMELSGKGGMAKLMQFLEMSFGGGTDVAPALKAGLKTMQQDDFKKSDL 424 Query: 417 VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMR 474 +VISD +P+D+ +++ QR ++F+ + + +G G FD W ++ + Sbjct: 425 IVISDGGFGYIPNDLEKQMQN-QRQKDNKFYLLDI--NGNSGKKTFFDKHWIYNAQTQ 479 >UniRef50_B9KEB5 Putative uncharacterized protein n=1 Tax=Campylobacter lari RM2100 RepID=B9KEB5_CAMLR Length = 474 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 87/453 (19%), Positives = 185/453 (40%), Gaps = 25/453 (5%) Query: 30 PQLAVFFEKFPRLKAAITDD-VPRWREALRSRLKDARVPPELTEEVMCYQQ----SQLLS 84 QL V+ +K + I + + +RE + K + E Y + L S Sbjct: 25 DQLHVYIKKMKGFRLFIEEKKILSYREIIARTQKKIILKDL--TEFRKYNRKNIDFILDS 82 Query: 85 TPQFIVQLPQILDLLHR-LNSPWAEQARQLVDA-NSTITSALHTLFLQRWRLSLIVQATT 142 ++ D L + N ++ ++ + + L + + +++ Sbjct: 83 LENTNKSQKELRDFLLKIWNEELNKKIKKYEEKFLKHYNKQVCILMEKIEKENILGSENC 142 Query: 143 LNQQLLEEEREQLLSEVQERMT-LSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVK 201 + + + + + + + + + G K+ + I+ Sbjct: 143 EGNKAISVSKNDFFETNANNIENILENFKLEFIKDFHDKNPYYGNNTGCEKKLSIKSIID 202 Query: 202 YGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDI 261 Y E + LK + + LG+SR + + + E++ G+ ++ Sbjct: 203 YFEIIKNNHALKEICDLLGKSRNDDNKEGKN-DSNLNNNAQKTSKESKEEIKGVILGRNL 261 Query: 262 LRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGP 321 LL EL L +LE F + +E +L + G ++K + + +G Sbjct: 262 EELLAQELGLLNDEDLENLFVLKYLENRLFCFEKQGY-----------INKMQNHKNKGA 310 Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 I+CVD+SGSM G E AK +++ AL E CY++ FST+ E+ +G+++ Sbjct: 311 IIICVDSSGSMDGQPEIIAKGITYYMVKKALKEKSACYLINFSTKTKCEEIDLSKGMKKL 370 Query: 382 IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRV 441 FL F GGTD++ + ++++Q + +D +VISD + + + ++++ QR Sbjct: 371 FDFLCFSFNGGTDVSIALKEGVKKMQEDGFERSDLLVISDGFFGDIDNKILKQMEK-QRE 429 Query: 442 HQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMR 474 +++F+ + + +G + IFD W++DT + Sbjct: 430 QENKFYLLDI--NGCDKVKTIFDKHWKYDTSTK 460 >UniRef50_Q1ZC32 Putative uncharacterized protein (Fragment) n=1 Tax=Psychromonas sp. CNPT3 RepID=Q1ZC32_9GAMM Length = 328 Score = 256 bits (655), Expect = 1e-66, Method: Composition-based stats. Identities = 110/325 (33%), Positives = 184/325 (56%), Gaps = 2/325 (0%) Query: 156 LSEVQERMTLSGQLEPILADNN-TAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKR 214 L ++ +R +L I A+ N + RLWDM+ +L + + Q + + + L++ EL++ Sbjct: 1 LKDLYQRQETISKLTEIDANINPQNSMRLWDMAKAKLTKINVQTLKRTAKLLSKHSELQK 60 Query: 215 LAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGI 274 +A+QLGR P + R + + + G++QS D+ RLLP EL L Sbjct: 61 IADQLGRMANQHDDPCLNRTEVHSRRIKESTSPFTGDIVGIKQSADLERLLPIELMFLSD 120 Query: 275 TELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGG 334 +EL+ FY+ L+EK+L TY+ + + I + EQ +GPFI+ +D SGSM G Sbjct: 121 SELDVLFYKNLIEKRLSTYQQQNKHNEFEQITQFKQQPKKAEQDKGPFIIAIDASGSMMG 180 Query: 335 FNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTD 394 E+CAKAF LM+IALA+NR CY++LFS + + YELS G+ + + FLS F GGTD Sbjct: 181 SAEKCAKAFAYGLMKIALAQNRECYVILFSAQQITYELSNQHGLSEILNFLSYSFHGGTD 240 Query: 395 LASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAH 454 L S + + +++ ++ +AD +VISDFI + K+ +L+ +RFHA+++S + Sbjct: 241 LTSVLESAFKVMETEKYKNADLIVISDFITPPMSSKTIDKLNKLKEKS-NRFHALSLSRY 299 Query: 455 GKPGIMRIFDHIWRFDTGMRSRLLR 479 ++ +FD W+++ + + R Sbjct: 300 QNTEVLALFDKNWQYNPSKLANIKR 324 >UniRef50_A6L4M8 Putative uncharacterized protein n=9 Tax=Bacteroides RepID=A6L4M8_BACV8 Length = 453 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 86/436 (19%), Positives = 167/436 (38%), Gaps = 37/436 (8%) Query: 43 KAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDL---- 98 + + + V + L++ T E + S+ L F+ L Q+ Sbjct: 42 QRELEEKVLMYYRRTTPSLQE--YYSRYTPEWEAFYSSEHLPDMAFLQYLKQMRGAFKKR 99 Query: 99 --LHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLL 156 L LN + + + FL +W L + + E Sbjct: 100 YELAELNIDYYISLLENASLLRGEGARTKEFFLDKWHQLLTRKEYDYQYMHINSLCEGF- 158 Query: 157 SEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDY----QLIVKYGEFLNEQPEL 212 + ++ +G S + +Y + ++ Y + P + Sbjct: 159 -------------DLLIRKQGKESGNKLLGSRMEWLLHNYPDLYRRMLPYETVMKRNPAI 205 Query: 213 KRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATL 272 ++LA LG+ + + + ++ R + P + + G+ +D+ LLP E L Sbjct: 206 RQLARLLGKKHRDQQKYDSLSGVDKKRLIRHSPHS---DITGVTLGNDLNSLLPVEYCYL 262 Query: 273 GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSM 332 L F R EK+L + + PV + +GP+I+CVDTSGSM Sbjct: 263 ADDALRAVFMERYAEKRLQLFDYQSKE------TEPVKDDKHKVSGQGPYIICVDTSGSM 316 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG-PQGIEQAIRFLSQQFRG 391 G E +K+ LA+ ++ +R+CY++ FS E V + + + + FL+++F G Sbjct: 317 QGNREILSKSAILAIAQLTEKTHRKCYVINFSDEAVSLLIEDLGRDMPRLAEFLNKRFDG 376 Query: 392 GTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAM 451 GTD+ R + ++ ++D V+ISDF L ++ +VK ++R F + Sbjct: 377 GTDIEPALREAAHIINGNDFRESDIVLISDFEMPPLSRNLMEQVKVIKRRK-TSFFGLVF 435 Query: 452 SAHGKPGIMRIFDHIW 467 + + + + W Sbjct: 436 GNKPEMEYLNLCERYW 451 >UniRef50_Q466I6 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q466I6_METBF Length = 562 Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats. Identities = 92/429 (21%), Positives = 190/429 (44%), Gaps = 48/429 (11%) Query: 68 PELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHR------------LNSPWAEQA--RQL 113 P + + + + +S +S +++ L +D + R + + R Sbjct: 133 PLIYKILERFSESTSISGTEYLKDLDTEMDEILRQFEEILKETLLMWGNSGYAELPGRNF 192 Query: 114 VDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPIL 173 + SAL L + + + L + ++E++E + L L + Sbjct: 193 QKMKNFGESALVDAILAFMQKGGYQEFLEKVMEGLYRRMNEFVTEMEENLELFDTLTLLF 252 Query: 174 ADNNTAAGRLWDMSAGQLKRGDY----QLIVKYGEFLNEQPELKRLAEQLGRSREAKSIP 229 R W S +LK+ + +++ Y F + P+LK++ + +GR Sbjct: 253 PQ------RNWSYSVKELKKEPFYVQLKMLKNYSTFFEKSPDLKKIMDFIGR-------- 298 Query: 230 RNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQ 289 + + +R +++ ++ SD I LLP E A L L+ +FY ++E + Sbjct: 299 ---REFDPPSDRIRLSPFGKDRIQTVRFSDSINNLLPMEAAKLLNPSLKKKFYADMLEGK 355 Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 LL+Y+ G+ + +PRGP IV VDTSGSM G + AK+ LA+ + Sbjct: 356 LLSYQFLGKHYTG----------PPRIKPRGPMIVLVDTSGSMHGAPQTLAKSAVLAMAK 405 Query: 350 IALAENRRCYIMLFSTEIVRYEL---SGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERL 406 + L++ R ++LF++ E+ S + E+ + FL F GGTD + + ++ L Sbjct: 406 LMLSQQRDMKVILFASTSQHLEIELSSRKKMSEKFLNFLLYTFGGGTDFNTALASGLKSL 465 Query: 407 QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 + +++ AD + I+D ++ + V ++ +E ++ + + +++ + + G G+ I D+I Sbjct: 466 KEKDFQGADLLFITDGKSEVSDELVLARWEEAKKKYNAKVYSLIVGSSGAGGLSEISDYI 525 Query: 467 WRFDTGMRS 475 + + M S Sbjct: 526 YFVEMEMDS 534 >UniRef50_A7V9J4 Putative uncharacterized protein n=7 Tax=Bacteroides RepID=A7V9J4_BACUN Length = 495 Score = 236 bits (603), Expect = 1e-60, Method: Composition-based stats. Identities = 78/393 (19%), Positives = 154/393 (39%), Gaps = 13/393 (3%) Query: 77 YQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSL 136 + S + L QI D + + W + Sbjct: 106 LEWSMAKRKDGWQALLQQISDKYREYGFDSRFYRSHFGTEGGYADDEVWEKMVDDWEDAF 165 Query: 137 IVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDY 196 ++ ++ + ++ L ++ + + + + W M +G D+ Sbjct: 166 QLKMHEEKEKEIAFRKDALERRLRSNLKDIPEYIRQNRVDKDEFFQTWGMMSGLWNTVDF 225 Query: 197 QLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQ 256 + I K PE+ ++A ++GR + + R ++ E ++ + G+ Sbjct: 226 ERIRKIVRIQRSCPEIVKVARKMGRMADDEG--REQIRVAEGNVYKMEHSSKC-DILGIS 282 Query: 257 QSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDE 316 +D+ LLP ELA +ELE F + + ++L T+R E +++ + Sbjct: 283 TGNDLNALLPIELAHSADSELEDLFVYKYLTRKLQTFRYKSE-----IMQPARRIETKPA 337 Query: 317 QPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQ 376 +P+GP IVC+DTSGSM G E+ A + + L+ IA + R C+++ FS I ++ + Sbjct: 338 RPKGPMIVCLDTSGSMAGKPEKIAHSLLIKLLEIADRQRRNCFLIAFSVSIQPIDV--RK 395 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQS-REWFDADAVVISDFIAQRLPDDVTSKV 435 + + F S+ G TD A L+ +E+ +AD + +SDF ++ Sbjct: 396 ERARLLEFFSKTACGDTDATRMLEATFRLLKEGKEYMNADVLWVSDFKIPHSSPAFMEEI 455 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWR 468 + R F+ + + FD I+R Sbjct: 456 RRC-REAGTHFYGLQIGITDN-EWTPFFDRIYR 486 >UniRef50_Q46D40 Putative uncharacterized protein n=3 Tax=Methanosarcina RepID=Q46D40_METBF Length = 612 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 95/426 (22%), Positives = 181/426 (42%), Gaps = 41/426 (9%) Query: 56 ALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAE---QARQ 112 + LT+E Q+S L F + + + S + + + Sbjct: 189 QENKQENPLEQESSLTQENSLPQESSLPQESSFQQENSLEQESSFQQESSLQDPEHENWE 248 Query: 113 LVDANSTITSALHTL------FLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLS 166 D + T+ L F+ + + ++ ++ + + E+L+ +++ + + Sbjct: 249 NPDIQAGNTAESERLASMTLNFMSSEKAGEV--LDSVIEESIAAKIEELIPVLEDHLEML 306 Query: 167 GQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAK 226 L + GR WD S L R + + KY L + + + EQ+GR Sbjct: 307 EILSMLF------PGRAWDYSLKALHREYFGNLEKYAALLRKSSAIHEILEQVGR----- 355 Query: 227 SIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLV 286 ++E + +V + S D+ LLP E L L+ +FY ++ Sbjct: 356 ------IELEYGSKKLSLSPYSKSEVHSVTFSGDLRTLLPAETVKLKNPLLKRKFYADML 409 Query: 287 EKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLA 346 E +LLTY+L GE+W ++ +GP + VDTS SM G E AKA LA Sbjct: 410 EGKLLTYQLKGENWN---------SDSAGKKRKGPVVALVDTSASMRGSPELLAKAVVLA 460 Query: 347 LMRIALAENRRCYIMLFST--EIVRYELSGPQGI-EQAIRFLSQQFRGGTDLASCFRAIM 403 + R L ENR ++LFS+ + V EL+ + + E+ + FL F GGTD + RA + Sbjct: 461 VTRRMLTENRDVKVILFSSKWQTVEIELTNKKRMGEEFLEFLKFTFGGGTDFNTALRAGL 520 Query: 404 ERLQSRE-WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 + +++ + + AD + ++D ++ + + E++ + R ++ + + G+ +I Sbjct: 521 KAMKNEKAFEGADLLFLTDGYSELSEKPLIREWNEIKAERRARIFSLIIGNYDAGGLQQI 580 Query: 463 FDHIWR 468 DH + Sbjct: 581 SDHTYL 586 >UniRef50_D1YZJ4 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YZJ4_METPS Length = 506 Score = 233 bits (593), Expect = 2e-59, Method: Composition-based stats. Identities = 100/412 (24%), Positives = 172/412 (41%), Gaps = 55/412 (13%) Query: 92 LPQILDLLHRLNSPWAEQARQLVDANSTIT----SALHTLFLQRWRLSLIVQATTLNQQL 147 L QI D+L S + + ++L A+ L W + ++ Sbjct: 98 LEQIFDMLDDF-SRFEPELKKLARGKMAFYYQQFKAILESTLDLWHRRTSKETPRSTRRA 156 Query: 148 LEEERE---------------QLLSEVQERMTLSGQLEPILADNNTA----------AGR 182 +E+ + + L + + LSG + + + GR Sbjct: 157 AQEKVDIVERVSRFENDKASRKFLDLLAGSVLLSGIMGQVSNVEDHLESLEMLSLLYPGR 216 Query: 183 LWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMV 242 WD S +L R + + KY + + +LK++ + +GR +ME + Sbjct: 217 GWDRSMLELHRVYFANLHKYSKIVERNEDLKKILDTIGR-----------IEMEYGSRRL 265 Query: 243 REPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWRE 302 + +V + S D+ +LP E L L F+ +EK+LLTY L G +W Sbjct: 266 SLSSYSHSEVYSVTTSGDLQHMLPVESVKLQDETLRNLFFAHWMEKKLLTYELKGVNWT- 324 Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 D ++ RGP + VDTSGSM G E AK+ LAL+R + E+R + L Sbjct: 325 ----------DDSKKNRGPMVAMVDTSGSMHGDPEIVAKSIILALVRRMMKESRDVKVYL 374 Query: 363 FSTEIVRYELSGPQGIEQA---IRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVI 419 FS+E +E+ + A + FLS F GGTD + R +E L+ +++ +AD + I Sbjct: 375 FSSEGQTHEIEITDNKKMATEFLDFLSYTFEGGTDFDTALREGVESLKKKQYVNADILFI 434 Query: 420 SDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDT 471 +D ++ V S +++++R + R + + GI R DHI+ Sbjct: 435 TDGLSVVNDKYVISGLEQMKRENGTRLFTIIVGNDNAGGIDRFSDHIFILGK 486 >UniRef50_A4S4M8 Predicted protein n=4 Tax=Mamiellales RepID=A4S4M8_OSTLU Length = 535 Score = 232 bits (591), Expect = 3e-59, Method: Composition-based stats. Identities = 87/349 (24%), Positives = 150/349 (42%), Gaps = 10/349 (2%) Query: 134 LSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKR 193 + + +L+EE +EQ + + + E + D+ +D++ G ++ Sbjct: 183 ERAKEKNKEIVSRLMEEFKEQWEPAMDKLDKAAKAFEGLDLDDLADGPEGFDLTRGLWQQ 242 Query: 194 GDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMV--REPATVPEQ 251 ++ + + L + EL+ + LGR + R Q E + PEQ Sbjct: 243 TGWKELDSLRKKLQDLKELRDMVRSLGRGSGRGPLRRAPRQRERQGFPIGLVRSPMEPEQ 302 Query: 252 VDGLQQSDDILRLLPPELATL--GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 GL +SDD+ R++P E+ L + + + R E+ LL+Y G W E+ Sbjct: 303 TSGLCRSDDLSRMMPSEMVLLASSLPQARLLHFARRAERTLLSYERVG--WSEEPAVTVE 360 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 + GP IVC+DTSGSM G E AKA L MR + ++ R CY+ FS Sbjct: 361 GFETRPAAECGPIIVCLDTSGSMMGARETVAKAMVLECMRQSRSQQRACYLYSFSGPGDC 420 Query: 370 YELS---GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQR 426 EL G+ + FLS F GGTD+ F + RL EW +AD ++++D + Sbjct: 421 QELELKLNAAGLYGLLEFLSGSFHGGTDVDEPFNRALARLNEAEWSNADILLVTDGEIKP 480 Query: 427 LPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR-IFDHIWRFDTGMR 474 + + + + E + + H + + G ++ I H+ F + Sbjct: 481 PDETLIANLNEAKEEMGLKVHGLLVGDAGNAEVVESICTHVHAFKSWTA 529 >UniRef50_A4Y9K4 von Willebrand factor, type A n=3 Tax=Shewanella RepID=A4Y9K4_SHEPC Length = 528 Score = 231 bits (589), Expect = 5e-59, Method: Composition-based stats. Identities = 109/472 (23%), Positives = 204/472 (43%), Gaps = 39/472 (8%) Query: 17 GLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMC 76 + ++ II + + +A + + P++ A+ D+ ++ + + Sbjct: 74 EVTQQQIITTVEASGVARYCKDNPQVTDALITDLL-----IKLEGAQLQS-------LAL 121 Query: 77 YQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQ-ARQLVDANSTITSALHTLFLQRWRLS 135 +Q +++ + + QL + + + +Q A +L D + L W Sbjct: 122 SRQLTIIAENEALEQLRLEFEAIQKKRRSSKKQKAMELND--AQCLDIRIQAELNAWVQL 179 Query: 136 LIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGD 195 + L +L + ER+ + +LE + D G WD+S G + Sbjct: 180 VNSAGGQLF---------ELPAIWSERLEIWQELEEVFTDLGLLTGLGWDLSQGLFQSHG 230 Query: 196 YQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVRE-----PATVPE 250 + +V+ + + + P+L+ + E LG ++ + P + + + R VP Sbjct: 231 WMNLVRLQKIVKQIPQLREVIETLGSMKDTEGEPIIEEIISRMSVIFRHEVEVTTPLVPM 290 Query: 251 QVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVV 310 + G+ +SD I R+LP E A G L+ ++ R E LL+Y + G +V E+ Sbjct: 291 ETKGITRSDSISRMLPQEAAFFGHPVLKKLWHARRAEHALLSYAVEGTELITEVTEQEQE 350 Query: 311 HKDYD-----EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST 365 +K+ + RGP IVC+DTSGSM G E AKA L + +A E R C++ LF + Sbjct: 351 NKENKAGNKVNRNRGPMIVCLDTSGSMQGTPENVAKALVLQCISVAKKEKRACFVYLFGS 410 Query: 366 --EIVRYELSGPQ-GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 E+ EL+ + G+EQ I FLS F GGTD+ +ER ++W AD +++SD Sbjct: 411 KGEVKEMELTPDKAGLEQMILFLSMSFGGGTDVEGPLNMALERSDEKQWQQADILLVSDG 470 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMR 474 ++ K+ + H V + P + +I + + +F + + Sbjct: 471 EFSVSS-GLSRKISNRKEQRGMSVHGVVIGGRLSP-MDKICEPLHQFSSWLD 520 >UniRef50_Q5LDB9 Putative uncharacterized protein n=11 Tax=Bacteroides RepID=Q5LDB9_BACFN Length = 419 Score = 230 bits (587), Expect = 7e-59, Method: Composition-based stats. Identities = 82/439 (18%), Positives = 169/439 (38%), Gaps = 33/439 (7%) Query: 32 LAVFFEKFPRLKAAITDDVPRWREALR-SRLKDARVPPELTEEVMCYQQSQLLSTPQFIV 90 ++ + P L+ + +W L D+ + + S ++ Sbjct: 11 MSYYQHTQPSLQEFYSHYATQWEHFYEGHELTDSAF-------LRFLENSAYPLQMKYNR 63 Query: 91 QLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEE 150 LN + + S L LF +W L + + Sbjct: 64 G---------DLNLQYYIDRFHTLKKRSKEWKHLRNLFFDKWYHLLANNEYNYQIERINN 114 Query: 151 EREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQP 210 E+ + A +W + + + + Y E P Sbjct: 115 LCERFYR--------LQKNIADQLPQRGNARLMWLLRT---HQELAKQLFHYDEIAKNHP 163 Query: 211 ELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELA 270 ++ L + LG+ + + + G+ + +D+ LLP E Sbjct: 164 AIRELTKILGKQHY--GKEKKFRMVAGIHREQIITHATKSDITGVCEGNDLNSLLPIEYC 221 Query: 271 TLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSG 330 L L+ F+ R +K+L + + ++ + + + E+ GPFI+CVDTSG Sbjct: 222 YLSDPALQPLFFERFNKKKLQMMDYESKD-QHRIKDIKIQGNEIVEEQSGPFIICVDTSG 280 Query: 331 SMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG-PQGIEQAIRFLSQQF 389 SM G E+ K+ LA+ + ++R+CY++ FS +I E+ Q I++ FL Q F Sbjct: 281 SMSGEREEFVKSAILAIAELTEQQDRKCYLINFSNDIACIEIERLGQNIQELANFLCQSF 340 Query: 390 RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAV 449 GGTDL + L+++ + +AD V++SDF L ++++ ++K++ + ++ +A+ Sbjct: 341 HGGTDLTPALLHAIYILKTKSYRNADLVMMSDFEMPPLNEELSEEIKKI-KQNKTHLYAL 399 Query: 450 AMSAHGKPGIMRIFDHIWR 468 ++ + + + + W Sbjct: 400 SVHKQSENTYLNVCNKFWF 418 >UniRef50_C4KA81 von Willebrand factor type A (VWA) domain-containing protein n=1 Tax=Thauera sp. MZ1T RepID=C4KA81_THASP Length = 581 Score = 230 bits (586), Expect = 9e-59, Method: Composition-based stats. Identities = 95/350 (27%), Positives = 145/350 (41%), Gaps = 30/350 (8%) Query: 153 EQLLSEVQERMTLSGQLEPILADNNTAAGRL-WDMSAGQLKRGDYQLIVKYGEFLNEQPE 211 + ++ ER +L D WD G L+ D++ +++ + PE Sbjct: 164 DSFAADWAERCGEIDELVGAFGDLGDLLDNARWDALRGLLRSTDWREVLRIRALIEGLPE 223 Query: 212 LKRLAEQLGRS----------REAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDI 261 L R+ LGR+ R ++ + + VR P P + G+Q+S I Sbjct: 224 LARILRALGRACPTDEDAESSRALHAVVEHTEIQRSVSHRVRVPDL-PGETRGVQRSGRI 282 Query: 262 LRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQP--- 318 R+LP E LG L ++ R E+ LL Y + + PV+ P Sbjct: 283 ARMLPAEATLLGHPRLRLVWHARRAERTLLAYEDDDHLQEDCLRPAPVLRPSQRPAPARR 342 Query: 319 --RGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS--TEIVRYELS- 373 +GP +VCVDTSGSM G E AKA L +R A A R C + F E+V EL Sbjct: 343 LEQGPMLVCVDTSGSMQGGAEAVAKAVVLEAVRCAHARRRACRVYAFGGPDEVVEMELGV 402 Query: 374 GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 G+ + RFL Q F GGTD+ + + RL W AD ++ SD P + + Sbjct: 403 DVDGVGRLARFLGQGFGGGTDICAPLERALARLDEAGWQLADLLIASDGEFGATP-ALAA 461 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRRWRR 483 +V+ +R R + + G++ + D I +R WRR Sbjct: 462 RVEAARRERGLRVQGILIGDRETIGLLELADDIH---------WVRDWRR 502 >UniRef50_B7DQJ9 von Willebrand factor type A n=1 Tax=Alicyclobacillus acidocaldarius LAA1 RepID=B7DQJ9_9BACL Length = 484 Score = 229 bits (584), Expect = 2e-58, Method: Composition-based stats. Identities = 79/464 (17%), Positives = 156/464 (33%), Gaps = 77/464 (16%) Query: 13 VSEEGLIEEMIIALLASPQL------AVFFEKFPRLKAAITDDVPRWREALRSRLKDARV 66 + E + + L + Q+ L+ A+ DDV + + + A+ Sbjct: 69 IMESLMRQSAWKNLRQTTQMDEYSAALGALHLRESLEEALPDDVRQAARQVEELARQAQ- 127 Query: 67 PPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHT 126 +Q++ + P D L +QA +L+ T Sbjct: 128 --------HLLEQAEAY--EEVADGHPSARDEAETLR----QQAAELMQTLQRATDQFEQ 173 Query: 127 LFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDM 186 F + + + R+ L +E ++ + + Sbjct: 174 AFDAQ------------SGSIGRALRQALEQAAEEAQDTQRAMQ------------AFGV 209 Query: 187 SAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPA 246 G K + ++ L P ++ +A G + R + + Sbjct: 210 GTGDGKPVSGKERLELAHILQTNPHVREIARMAGGMQMMALNKRKNRTLHP--------- 260 Query: 247 TVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIE 306 P ++ + DD+ +LP EL L E EF +R E++LL Y L G Sbjct: 261 --PTEIVNITMGDDLANVLPSELLLLADPATEDEFIQRFAERRLLQYDLRG--------- 309 Query: 307 RPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE 366 ++ + +GP +VC+D SGS G E K LAL+ IA E R ++ F++ Sbjct: 310 -------FEREGQGPIVVCIDESGSTAGMVEMWEKGIALALLAIARREKRAFAVVHFASA 362 Query: 367 ----IVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 + ++ + ++ F GGTD S R + + + D V I+D Sbjct: 363 HEIFVQKWLRPKDASPTELVQMAQHFFNGGTDFESPLREAVRIMDEAAFQKGDIVFITDG 422 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 ++ + + + ++ + +V + + + D I Sbjct: 423 ESRVSDEFLHGEYARVKSEKAFQVISVVIG-YDDRSVRPFSDAI 465 >UniRef50_C3XJE4 Putative uncharacterized protein (Fragment) n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XJE4_9HELI Length = 429 Score = 229 bits (584), Expect = 2e-58, Method: Composition-based stats. Identities = 84/398 (21%), Positives = 149/398 (37%), Gaps = 51/398 (12%) Query: 65 RVPPELTEEVMCYQQ--SQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITS 122 P + EE+ Q+ + + + + + + T Sbjct: 43 NFNPFMQEELALNQRKNIESCDDIEAQQDIQAQDLQAFERFNATHKAMLDSMLHQQTDIE 102 Query: 123 ALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLS---------------- 166 + L W L + + + E+ +EQ + +++ + Sbjct: 103 SARKYALAYWDNLLRDKKQKWLESMKEKLKEQYIQAIKDFLDYLLSILETLLRVYGLCGI 162 Query: 167 ----------------------GQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGE 204 QL P + + +D S G + +++ I + + Sbjct: 163 KKPNNDLALAQILNEAKQSFDMKQLCPNTYEESDIETNGYDYSKGHKRFINFKEINAFIK 222 Query: 205 FLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRL 264 + +L+++A LGR E + + ++ + E++ G+ D+ L Sbjct: 223 HIQTSKDLRKIAALLGREEENGNKKIEHSSID----QSIKTHNHKEEMSGVTLGRDLANL 278 Query: 265 LPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIV 324 LP ELA L LE F + ++ +L + G +K + G I+ Sbjct: 279 LPQELAMLKDENLELLFNLKYIQNRLFCFEKQGYETIQKE-------HYKMAKNEGAMII 331 Query: 325 CVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRF 384 CVDTS SM G E AKA L L A +NR CY++ FST+I ELSG I F Sbjct: 332 CVDTSSSMSGNREYLAKAITLFLATKASMQNRACYLINFSTDIETMELSGKDNARNLINF 391 Query: 385 LSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 L+ F GGTD+A + ++++Q + +D +VISD Sbjct: 392 LAMSFNGGTDVAPALKEGLKKMQEDSFKQSDLIVISDG 429 >UniRef50_D2U0I8 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2U0I8_9ENTR Length = 330 Score = 226 bits (575), Expect = 2e-57, Method: Composition-based stats. Identities = 136/328 (41%), Positives = 220/328 (67%), Gaps = 4/328 (1%) Query: 1 MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSR 60 ML + T++++L+++E LIEE+++ LLA+PQL +FFEK+P LK+ + +D+ W++ L + Sbjct: 1 MLNIATIDMLLSINELELIEEIVLTLLATPQLVIFFEKYPNLKSILLNDLLAWKKNLYRQ 60 Query: 61 LKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTI 120 L++ VP +LTEE YQQ+ + T +F LP ++ L + S + ++A L + S Sbjct: 61 LQETLVPIKLTEEFALYQQNLAIDTTKFFSNLPVTINKLTEIASTFVQEANYLQERISH- 119 Query: 121 TSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAA 180 A +LF+QRWRL+LI++ TT N+ LLE E+EQLL+E+++R+ L+G L +N + Sbjct: 120 DPAGQSLFIQRWRLNLIIEVTTFNKLLLEREKEQLLAELEQRLKLTGNLIETFNQDNHSV 179 Query: 181 GRLWDMSAGQLKR--GDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 G+LWD+S G L + + QL+++Y FL +QPEL++LAE LGR + K + +E+ Sbjct: 180 GKLWDISKGVLTQSSNNIQLLIQYSHFLQQQPELEKLAELLGRRQSLKPKQKQQQMLESI 239 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 ++ + P +PEQ+ G+ +DILRLLP ELA LG+ ELE+EFYR+LVEKQLLTYRL G+ Sbjct: 240 ISVEKIPDQIPEQISGINHGNDILRLLPSELALLGLEELEFEFYRKLVEKQLLTYRLQGD 299 Query: 299 SWREKVIERP-VVHKDYDEQPRGPFIVC 325 +W+++ I RP + ++ + G ++ Sbjct: 300 NWQQRKILRPAIKYERSTLRVSGAAVLL 327 >UniRef50_A6C7T1 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C7T1_9PLAN Length = 313 Score = 221 bits (564), Expect = 4e-56, Method: Composition-based stats. Identities = 82/304 (26%), Positives = 143/304 (47%), Gaps = 12/304 (3%) Query: 172 ILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRN 231 + + WD+SAG K + + KY + ++ P+++++ E+LGR + ++ + + Sbjct: 1 MFGPLDRLLSPGWDLSAGIFKYRGWGDLKKYRDLIDRIPQIRQMIEELGRLQASEEMDDD 60 Query: 232 DAQMETFRTMVR--------EPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYR 283 + F ++ R E Q G+++S D +R++P E L+ ++ Sbjct: 61 PTYADAFNSLRRTTEEQREVEHPLARHQAQGIERSADFMRMIPSEAMLRRRPGLKRLWHA 120 Query: 284 RLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAF 343 +L E+ LLTYR+ G E ++ RGP IVCVDTSGSM G E AKA Sbjct: 121 KLAERGLLTYRVRGTYVDRVSTEVEEQQPQSKKRIRGPIIVCVDTSGSMSGRPEAVAKAL 180 Query: 344 CLALMRIALAENRRCYIMLFSTEIVRYELS---GPQGIEQAIRFLSQQFRGGTDLASCFR 400 L RIA AE R C + FS E P G++ + FL+ F GGTD+++ F Sbjct: 181 TLEACRIAHAEQRPCLLFSFSGSGQYVEHELSLSPDGLQSLLEFLTMNFDGGTDISTPFE 240 Query: 401 AIMERLQSREWFDADAVVISDFIAQRLP-DDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 + RL++ EW AD +++SD + D + + + ++ R + + + + Sbjct: 241 KALARLRTAEWERADILLVSDGAFSKSQVDALKPALDDAKKRLGLRVSGLLVGNYSSGPM 300 Query: 460 MRIF 463 + Sbjct: 301 NSLC 304 >UniRef50_Q0W1N2 Putative uncharacterized protein n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W1N2_UNCMA Length = 477 Score = 220 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 83/308 (26%), Positives = 141/308 (45%), Gaps = 22/308 (7%) Query: 164 TLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSR 223 L+ +P+ + A GR WD + + + D I +Y + + P+L +L E LGRS Sbjct: 185 ELADSADPLELMSLLAGGRGWDYAMIEQHKDDLYNIKRYSDIVRRNPDLMKLIEDLGRSS 244 Query: 224 EAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYR 283 E ++T V + +V + S D+ LLP EL L + L+Y F+ Sbjct: 245 EG---------LDTGSGKVLHSGRL--EVHSIVTSSDLYYLLPSELIKLQDSILQYLFFA 293 Query: 284 RLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAF 343 R +E +LLTY + P D + +GP I VDTSGSM G A+A Sbjct: 294 RWIEGKLLTY----------HLTDPGKSDTGDCKRKGPVIALVDTSGSMDGIPGILARAV 343 Query: 344 CLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ-AIRFLSQQFRGGTDLASCFRAI 402 LA +R+ L R+ ++LFS+ E+ P+G + FL F GGTD + +A Sbjct: 344 TLATVRMFLQRGRKIRVVLFSSVGQLDEIDLPEGSTPGFLEFLRSSFGGGTDFNTALKAG 403 Query: 403 MERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 + L++R++ AD + ++D +++ + + + L+ + V + G+ I Sbjct: 404 LGALKARQYASADIMFVTDGMSRITDEALIEDWRRLKEASGSQIFTVIVGNDQAGGLEDI 463 Query: 463 FDHIWRFD 470 D ++ Sbjct: 464 SDRVYILG 471 >UniRef50_C8SB00 von Willebrand factor type A n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SB00_FERPL Length = 469 Score = 220 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 110/494 (22%), Positives = 184/494 (37%), Gaps = 74/494 (14%) Query: 30 PQLAVFFEKFP-RLKAAITDDVPRWRE------ALRSRLKDA--RVPPELTEEVMCYQ-- 78 L +EK P +K DD+ R ++ ++K+A + PP +T+ + Sbjct: 3 EDLLKIYEKVPYTVKKDRLDDMLFGRHRDRIVEKVKDKVKEAIPQFPPLVTDTFNIFHKP 62 Query: 79 QSQLLST----PQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRW-- 132 Q L P+F V + +++ ++ L D NS I +A+ T L Sbjct: 63 DPQFLDDSQIAPEFRVNKRVLEKIMNTDTFSELKETTTLDDVNSAIATAILTERLYEELK 122 Query: 133 -RLSLIVQATTLNQQLLE-------EEREQLLSEVQERMTLS--------------GQLE 170 +L I + T QQL E+ +Q L +++E E Sbjct: 123 SKLGEIKEHTEKIQQLRNQLPGKSGEDVKQALQQIEEHSRALQGIVTQGAVSVAVRKAQE 182 Query: 171 PILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPR 230 N + G+ + D + +K L LK++ E LG+ R Sbjct: 183 EFEKVQNAMVALGFGNEPGKPVQVDPETAIKLASELKSNERLKKMVELLGKMRNLLK--- 239 Query: 231 NDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQL 290 T +P ++ + +I RLLP E+ L F R E +L Sbjct: 240 --------STAKAKPRKSMLELHSITSGREIERLLPSEILKLRK--YRVVFLRDYYEGRL 289 Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 L Y L K +++ +GP ++ +D SGSM G EQ AKA LA + I Sbjct: 290 LHYDL----------------KRREKESKGPIVIALDLSGSMSGAKEQWAKAVSLATIDI 333 Query: 351 ALAENRRCYIMLFSTEIVRYELSGPQGI-EQAIRFLSQQFRGGTDLASCFRAIMERLQS- 408 A+ E R I+ F I ++ Q E + + GGT+ + M+ ++ Sbjct: 334 AVKERRPWAIIAFDAGIKDVKVFRKQPKPEDVLGIMRIGASGGTNFEKPLKEAMKIVEDC 393 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIF-DHIW 467 RE+ AD + ISD + + + +R R V + G P IMR+F D ++ Sbjct: 394 REFTKADILFISDGDCKVGWE-FLEEFTRFKRRRNVRVTGVLI--SGIPRIMRMFCDEVF 450 Query: 468 RFDTGMRSRLLRRW 481 + + Sbjct: 451 ALKERLDDKAAEAI 464 >UniRef50_C0ZE04 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZE04_BREBN Length = 572 Score = 211 bits (536), Expect = 6e-53, Method: Composition-based stats. Identities = 94/449 (20%), Positives = 173/449 (38%), Gaps = 54/449 (12%) Query: 16 EGLIEEMIIALLASPQLAVFFEKFPRLKAAITDD-VPRWREALRSRLKDARVPPELTEEV 74 GL ++ QL+ + + K + + + + + + + P + EV Sbjct: 126 TGLKDQQGQEGSQQAQLSELLTEKQKAKLELVGYTLQQGKRVVEDKQEAMDTKPLVRAEV 185 Query: 75 MCYQQSQLLSTPQFIVQLPQ---ILDLLHRLNSPWAEQARQLVDANSTITSAL--HTLFL 129 + Q + VQ + +L + +L ++ +QL A+ L Sbjct: 186 LSLQNRITELQEEMKVQFTKRTKLLQKVKKLEGELVQREKQLDRLQKQEKEAIAAFEKEL 245 Query: 130 QRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAG 189 +W + + +E TL +E + + A R W G Sbjct: 246 GQWLEQSLKATLS----------------TEELDTLF--VEEVFTASQRFANRSWGHELG 287 Query: 190 QLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVP 249 +L+R ++ +K+ E L P+L ++GR + R + + + F P Sbjct: 288 KLRRQSFEQYLKWIEKLKRHPDLVAFLNEVGRQVHRFRVKRKEIRSKHF----------P 337 Query: 250 EQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 E+ L+QS DI +LP E L + E F + +E++L+TY G Sbjct: 338 EEYYDLRQSGDIAHMLPGEAVLLADPDFENYFMLKWLEQKLMTYDTSGW----------- 386 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS--TEI 367 +E P+GP I +DTS SM G + A+ F + +++ E R ++LF EI Sbjct: 387 ----VEEPPKGPVICMLDTSHSMRGSKLRLAQIFIMTFAALSMLEKRDFILLLFGAKGEI 442 Query: 368 VRYELSGPQGIEQAIR-FLSQQFRGGTDLASCFRAIMERLQSRE-WFDADAVVISDFIAQ 425 L + A F GGT + + +E ++ + W AD V+++D I Sbjct: 443 KEQPLYHKKPDWPAFYGLAQMAFGGGTHFDAPMKRAIELVEKEQAWRGADFVMVTDGIGG 502 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSAH 454 P V K+ L + Q R H++ + + Sbjct: 503 ISP-YVQEKLIFLGQHKQVRLHSLIVGSA 530 >UniRef50_C6J3I5 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J3I5_9BACL Length = 409 Score = 204 bits (520), Expect = 5e-51, Method: Composition-based stats. Identities = 82/427 (19%), Positives = 151/427 (35%), Gaps = 45/427 (10%) Query: 65 RVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQA-----RQLVDANST 119 VP L ++ Q + ++ DL L + + ++ V + Sbjct: 18 EVPEGLEMNHQLMERVMSDEGYQEFREFTRLDDLAAALGTTKYSETVLGWVKEQVQRDQN 77 Query: 120 ITSALHTLFLQRW--RLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNN 177 + AL + ++ Q E + L QE ++ +L Sbjct: 78 LADALQNYMNGKAGASQEASEALSSALNQNGNELSKMLAKAAQEATEAKENVKSLLGGMQ 137 Query: 178 TAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMET 237 +G +LK+ + + E L+ ++K +A+ GR + + + + Sbjct: 138 AGSGES------ELKKVPLKDQLILAERLSHDKKMKDIAKWAGRMKVIANQKQRSKHKDA 191 Query: 238 FRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHG 297 +G++Q + I +LLP EL T + +F RR VE Q L Y G Sbjct: 192 INR------------NGIKQGNSIEQLLPMELGTYASPITKMDFLRRYVEGQTLQYDTKG 239 Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ +GP I+C+D SGSM G + +K F LALM IA + R Sbjct: 240 ----------------PEQLGKGPIILCLDQSGSMSGQ-DTISKGFALALMSIARKQRRD 282 Query: 358 CYIMLFSTEIVRYELSGPQGI--EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 + FS+ + I + I+ + GGT RA + ++ + AD Sbjct: 283 FAWIPFSSHAAAPLIYERGTIVVQDMIQLATIFLGGGTSFEPPLRAASQVIEQSRFNQAD 342 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRS 475 V ++D + + EL+ ++ + G+ D I R T Sbjct: 343 IVFVTDGESHV-SERFLQSWNELKSKKGFSVLSLLLGRESIQGVEGFSDRIVRASTFEDE 401 Query: 476 RLLRRWR 482 + + + Sbjct: 402 SVYQAFE 408 >UniRef50_Q58221 Uncharacterized protein MJ0811 n=4 Tax=Methanocaldococcus RepID=Y811_METJA Length = 439 Score = 203 bits (517), Expect = 9e-51, Method: Composition-based stats. Identities = 76/455 (16%), Positives = 167/455 (36%), Gaps = 46/455 (10%) Query: 23 IIALLASPQLAVFFEKFPRLKAAITDDVPRWREA---LRSRLKDARVPPELTEEVMCYQQ 79 + ++ + + + + + +L + V + Sbjct: 1 MKNIIKHDAYDKKAYERFLKNSKYLQKLISYYSQYHPIHEKLAEDTFYAFFKYVVEFNEY 60 Query: 80 SQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQ 139 +F + + + + ++ +L + N+ + + F +++ +L + Sbjct: 61 V----EEKFKINKAILEGAIKNIEYEKSKLLTELDEVNAGTATIM---FCEKFFENLKLA 113 Query: 140 ATTLNQQLL--EEEREQLLSEVQE--RMTLSGQLEPILAD-NNTAAGRLWDMSAGQLKRG 194 + E + E L +++E + T+ E + A + G K Sbjct: 114 KLNKELKKFASEGKGEGLEDKLKEIAKNTMKDIAEEVSEVIQGFNAVENFGKGEGDKKLL 173 Query: 195 DYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDG 254 + +K + + + +++ + ++LG+ R + ++ Sbjct: 174 SPEDRIKLADKILQNKKIREIVKKLGKLRLLA-----------INEYKSKIKHYSGEIYS 222 Query: 255 LQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDY 314 + D+ LLP E+ L L Y+F RR V+K+LL Y + + Sbjct: 223 TKIGRDLKHLLPKEIVNLSDEILYYDFLRRFVDKKLLIYDIQNKL--------------- 267 Query: 315 DEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG 374 E+ +GP I+ +D SGSM G E KA L+++ IA ENR Y + F + + Sbjct: 268 -EKQKGPIIILLDHSGSMYGDREIWGKAVALSIIEIAKRENRDIYYIAFDDGVRFEKKIN 326 Query: 375 PQGI--EQAIRFLSQQFRGGTDLASCFRAIMERLQSRE-WFDADAVVISDFIAQRLPDDV 431 P+ I ++ I S F GGT+ M ++ E + +AD ++I+D A+ D Sbjct: 327 PKTITFDEIIEIASLYFGGGTNFIMPLNRAMSIIKEHETFKNADILLITDGYAEVN-DVF 385 Query: 432 TSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 + + + + + +V + + I D + Sbjct: 386 LKEFDKFKNEYNAKLISVFVETFPTETLKAISDEV 420 >UniRef50_C9KWG4 Putative uncharacterized protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KWG4_9BACE Length = 454 Score = 203 bits (517), Expect = 1e-50, Method: Composition-based stats. Identities = 78/370 (21%), Positives = 142/370 (38%), Gaps = 38/370 (10%) Query: 101 RLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQ 160 LN +L + + L W L+ + Q+ L + Q Sbjct: 120 ELNVDGYNSLLKLDEFDK---DILFQKICDEWSLAYREKILEEKQRYLNSSKIQFE---- 172 Query: 161 ERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLG 220 S G+ DY+++ +Y + ELK + +G Sbjct: 173 -------------------------NSVGRSNMKDYRMVSRYRTISVKYKELKEIVSCMG 207 Query: 221 RSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYE 280 R +E + T + + G+++ +D+ L+P E+A L E Sbjct: 208 REKEQAEELDTLIKQYIPET--LSASVAHSDIHGVEEGNDLQALMPTEVALLAEFATEDL 265 Query: 281 FYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCA 340 F+ + +QL + +S R+K + + Q +GP IV +DTSGSM G E A Sbjct: 266 FFMKYAMRQLQLFSNRSDSVRKK--QESQTKRREPRQIKGPMIVAIDTSGSMSGKAESIA 323 Query: 341 KAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFR 400 KA L + ++A ++R+C+++ FS + + ++ F+ F GGTD + Sbjct: 324 KALLLEITQMAKKQHRKCFLLSFSVRAQALDTAHSGNWKKVREFMVSHFSGGTDGEEMLK 383 Query: 401 AIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + L + AD ++ISDF S++++ Q RF+ + + Sbjct: 384 TALHTLTQENYLMADVLIISDFEFDFCCKPTESRIRKEQ-ERGVRFYGLQIGNGVNVY-E 441 Query: 461 RIFDHIWRFD 470 + D +WR D Sbjct: 442 ELLDKVWRLD 451 >UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KNK1_AERHH Length = 552 Score = 201 bits (511), Expect = 5e-50, Method: Composition-based stats. Identities = 88/377 (23%), Positives = 151/377 (40%), Gaps = 27/377 (7%) Query: 98 LLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLS 157 L + A+ D + + + +L + E E++ Sbjct: 192 RLWDWDDSLADNPTPEPDWLAQEDHLKNRPLNEEMIAALRAELAECKAYEAELADEEIPD 251 Query: 158 EVQERMTLSGQ-LEPILADNNTAAGRLWDMSAGQLKRGDYQ----LIVKYGEFLNEQPEL 212 E+++++T GQ L I + L S D + I + L + + Sbjct: 252 EMRKKLTGYGQILGDIDTTYEKSKALLLACSGANFSYNDLKLCKDDIEPLAKQLQQNHAI 311 Query: 213 KRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATL 272 K L ++GR+ E + R P +V G +S+D+ R+LP EL L Sbjct: 312 KELTYKMGRAY----------ISEEKKKQARIPHASKSEVHGTHRSEDLARVLPTELLNL 361 Query: 273 GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSM 332 LE FY R +E+ L+TY L G + + +++ GP + C+DTSGSM Sbjct: 362 EDEALETLFYARFLERNLMTYELQGTTCTS------GEQLELEQKRTGPVVACLDTSGSM 415 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCYIMLFST--EIVRYELSGPQGIEQAIRFLSQQFR 390 G A+A LA+ + E R +++LF E+ Y + + FL Q F Sbjct: 416 SGAPLLKARALLLAVSAVLQQEARSLHVVLFGDNGELREYAIHEENSASGLLHFLRQGFG 475 Query: 391 GGTDLASCFRAIMERLQS-REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAV 449 GGTD + E ++ +E+ AD ++ISD L DD ++ +++ ++V Sbjct: 476 GGTDFETPLNRACEIIRDAKEYEKADILMISDGDC-VLSDDYIEHLQTRKKILDCSIYSV 534 Query: 450 AMSAHGKPGIMRIFDHI 466 HG+ R D + Sbjct: 535 L--CHGQRVADRFSDEV 549 >UniRef50_D0LUP3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain-like protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LUP3_HALO1 Length = 536 Score = 195 bits (496), Expect = 3e-48, Method: Composition-based stats. Identities = 70/336 (20%), Positives = 116/336 (34%), Gaps = 40/336 (11%) Query: 150 EEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQ 209 + R L E ++ + W G R + + + L E Sbjct: 219 DLRAALQQACAELAEALRAVDEVAEAMQEC--DTWGREPGDFGRLPIEEFQRLSQVLRET 276 Query: 210 PELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPEL 269 P ++++ E GR E R+ ++ G+ + RL EL Sbjct: 277 PSVRKIVELAGRWSELLKPRLKRGHSPRGRS----------ELVGVTLGGGLERLCATEL 326 Query: 270 ATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTS 329 L L +L E++ L + L G RGP I+ VDTS Sbjct: 327 IKLRHPALRRVLLGQLAERRALVHELRGPDVL----------------GRGPMILVVDTS 370 Query: 330 GSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS--TEIVRYELSGPQGI-EQAIRFLS 386 GSM G AK+ LAL + R ++ F E+ E++ + + + LS Sbjct: 371 GSMHGARMTMAKSLMLALALHCWEQRRPLRVLTFGAPGEMHESEVAVDEPFWTRLEQCLS 430 Query: 387 QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 F GGTD + E + R W ADAV ++D + + +++ + Sbjct: 431 VAFGGGTDFDGPLLRVCEIVGERPWRRADAVFLTDGEC-CVAEATRAQLARTRARVALNI 489 Query: 447 HAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRRWR 482 V + G+ + D +R +R R WR Sbjct: 490 IGVLVGRG--RGLDGVADIAYR------ARDGRGWR 517 >UniRef50_A8IX54 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IX54_CHLRE Length = 604 Score = 194 bits (492), Expect = 8e-48, Method: Composition-based stats. Identities = 101/451 (22%), Positives = 170/451 (37%), Gaps = 22/451 (4%) Query: 16 EGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVM 75 + MII +A + +K+P ++ A+ + L E Sbjct: 28 SEPLRTMIIRAMAKLGVGRLCKKYPSVRDALLKSLLETVVKYEKMLAGIEEEAE------ 81 Query: 76 CYQQSQLLSTPQFI--VQLPQILDLLHRLNSPWAEQARQLVDANSTITSA-LHTLFLQRW 132 ++ + L+ F +L D N + S + A + LQ Sbjct: 82 --EREKDLNGNYFKTAAELKAEEDAKRAANRGNHTGDKHNAPVPSYQSPAEVAAARLQAS 139 Query: 133 RLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTA-AGRLWDMSAGQL 191 + + + + ++L Q + G+ G +D+ Sbjct: 140 IKAAMAAGASGEKATAFALVKELYGTWQGPVETLGRAGRAFEGLEALLGGDDFDLQGSIW 199 Query: 192 KRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTM--VREPATVP 249 KR + + + L E EL+ L LGR + R Q + Sbjct: 200 KRAGWSQLDELRRKLEELKELRDLVRSLGRGGGWGPLRRAPVQFLDLNARPGLLRTVLEA 259 Query: 250 EQVDGLQQSDDILRLLPPELATLGI----TELEYEFYRRLVEKQLLTYRLHGESWREKVI 305 ++ GL +SDDI RLLP E A L + + FY ++ EK L TY G I Sbjct: 260 QETRGLTRSDDISRLLPAEAALLARGRVVRQAKLLFYAKMAEKALQTYERDGWGEYPTQI 319 Query: 306 ERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST 365 E P + RGP ++CVDTSGSM G E AKA L MR A + R C++ FS Sbjct: 320 E-PERREIRPTADRGPILLCVDTSGSMRGARETVAKALALECMRAARQQERGCFVFAFSG 378 Query: 366 --EIVRYELS-GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 E+ EL+ + + FL + F GG+D + ++RL +W ++D +++SD Sbjct: 379 PAEVREIELNMDAASVNNLLEFLEKMFNGGSDFNEPVKRCLDRLTDAKWANSDILLVSDG 438 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSA 453 ++ + K+ + R H + + + Sbjct: 439 ELRQPAPAIMRKLAGAKEALGLRVHGLVVGS 469 >UniRef50_A2SLM6 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=2 Tax=Burkholderiales Genera incertae sedis RepID=A2SLM6_METPP Length = 493 Score = 192 bits (487), Expect = 3e-47, Method: Composition-based stats. Identities = 90/355 (25%), Positives = 145/355 (40%), Gaps = 30/355 (8%) Query: 149 EEEREQLLSEVQERMTL--------SGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIV 200 +E RE ++E+ L +L D A D G+L R ++Q Sbjct: 109 DEPRETAIAEMVAAFRAEWTLLHADWEHLLALLQDLGELAALQRDALRGRLARREWQAAQ 168 Query: 201 KYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQME-----TFRTMVREPATVPEQVDGL 255 + L P L L LGR ++ P+ + + P ++ G+ Sbjct: 169 QLAALLTRNPALVALIASLGRGLPREAPPQPAPTAPGRARVLGQLVETRLPDAPGEILGV 228 Query: 256 QQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKV----IERPVVH 311 + ++ R+LP E A L L + RL E +L+ + + ++ R Sbjct: 229 RPGRNLARMLPSEAAQLRHPLLHKLWRARLAEARLMVWDEEAVLFDQRPGGATPLRAAAQ 288 Query: 312 KDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS--TEIVR 369 RGP +VC+DTSGSM G EQ AKA L R A E R C ++ F E++ Sbjct: 289 AAPPPLARGPMLVCIDTSGSMRGAPEQLAKAVVLQAARTAHRERRACQLIAFGGAGELLT 348 Query: 370 YELS-GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLP 428 +EL+ P G++ + F+ Q F GGTDLA+ + + S W AD +++SD P Sbjct: 349 HELALTPAGLDALLDFIGQAFDGGTDLAAPLAHAVAAVHSARWQQADLLLVSDGEFGCTP 408 Query: 429 DDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRRWRR 483 + + ++ H R V + G++ + D I +R WRR Sbjct: 409 ATLA-LLDGARQRHGLRVQGVLVGDRETMGLLEVCDAIH---------WVRDWRR 453 >UniRef50_A8ZLC2 Putative uncharacterized protein n=3 Tax=Acaryochloris marina MBIC11017 RepID=A8ZLC2_ACAM1 Length = 483 Score = 185 bits (470), Expect = 3e-45, Method: Composition-based stats. Identities = 85/410 (20%), Positives = 146/410 (35%), Gaps = 52/410 (12%) Query: 75 MCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRL 134 Y+Q + S Q I +L + + E Q + A S F Q+ Sbjct: 113 NLYRQLEEASDEQEIDELDTLRAQARQAKRDGREDLFQQLQAQGQAMSQAAQAFAQK--- 169 Query: 135 SLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRG 194 + + E E EVQE+ L G W +G Sbjct: 170 -----LEEKEGEGIGESVEDAEGEVQEKKDELEAL-----------GMSWGNESGDRNPT 213 Query: 195 DYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDG 254 +K + ++P+LK++ G + E + R Q ET ++ G Sbjct: 214 PTGEKLKLAALIEQRPQLKKILALAGNALETANRKRRQHQTETGY----------GELVG 263 Query: 255 LQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDY 314 + +D+ ++LP ELA L + + FYR +E QL L Sbjct: 264 ITTGNDVSQILPQELARLSDSRQKLSFYRDFLEGQLFQNDLQ----------------AP 307 Query: 315 DEQPRGPFIVCVDTSGSM-GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV-RYEL 372 +E+ +GP ++C+D SGSM G +KA +AL+++A + R ++LF + Sbjct: 308 EEKGKGPMVICLDCSGSMVKGNRFLWSKALIVALVKLANEQERVVSLVLFESICYDPIYF 367 Query: 373 SGPQGIEQAIRFLSQQF-RGGTDLASCFRAIMERL-QSREWFDADAVVISDFIAQRLPDD 430 + I+Q IR L GGT+ + + Q ++ +AD V I+D IA L Sbjct: 368 HPREDIDQLIRLLVTSPTDGGTEFQRPLEQARDIIEQDEDYSEADIVFITDGIA-PLSSV 426 Query: 431 VTSKVKELQRVHQHRFHAVAMSA--HGKPGIMRIFDHIWRFDTGMRSRLL 478 + + + + + + + W D L Sbjct: 427 FLQEYSDSLEKLKTDLFLLEIEPKQGWSNELRTLASQSWVIDANGNFEEL 476 >UniRef50_D0LHL0 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LHL0_HALO1 Length = 509 Score = 185 bits (469), Expect = 3e-45, Method: Composition-based stats. Identities = 99/466 (21%), Positives = 174/466 (37%), Gaps = 60/466 (12%) Query: 26 LLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLST 85 + A+ +LA + RL A++ R+ L++ L ++ Sbjct: 61 VRAAEELAAAVQIHHRLVTAVSQARDLAALRARTELRENECAALLP---GLVERILTAMK 117 Query: 86 PQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLI---VQATT 142 F + ++L+ + + Q + F R L Sbjct: 118 RDFYIGPQELLEAAEVAHDE--DTLAQREAEREHLRELPEDAFDDDERERLEGDLDGEID 175 Query: 143 LNQQLLEEER-------EQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGD 195 ++ ++E R +++ S++ + + + P + R + +G+ + Sbjct: 176 ALRERIDEARARQARVADKITSDLDDTIGRKVSVLPDQLEQGEDLRRSMGLGSGREGQVG 235 Query: 196 YQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGL 255 ++ GE L +LK LA+ +G RE R + T P+ + + Sbjct: 236 AAERLELGERLMRSRKLKLLAKLVGAFREVAFEARRRRVVRT-----------PQVMHEV 284 Query: 256 QQSDDILRLLPPELATL--GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKD 313 + + RLLP EL L L EF RRLVE +LL Y L G S Sbjct: 285 GRGAHLDRLLPSELLGLPRHRGALHREFVRRLVEGELLEYELRGAS-------------- 330 Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS 373 RGP +VCVD SGSM G E AKA L L IA E RRC ++FS+ +E+ Sbjct: 331 ----SRGPMVVCVDGSGSMQGTKEIWAKAVALTLTEIARRERRRCLAIVFSSGHALFEVE 386 Query: 374 -----GPQGI------EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 G + + + F GGTD R + + + D V I+D Sbjct: 387 LLGAKGRSNVRAPMLDDNVLAFAEHFPGGGTDFEPPMRRALAAVSEGNYRRGDIVFITDG 446 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMS--AHGKPGIMRIFDHI 466 AQ +++ + + + ++ H+ R + + + ++R D + Sbjct: 447 QAQV-SENLIADITKARKKHRFRVRGILVDVADSDRGSLLRFCDEV 491 >UniRef50_D0Z403 Putative uncharacterized protein n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z403_LISDA Length = 543 Score = 185 bits (469), Expect = 3e-45, Method: Composition-based stats. Identities = 83/378 (21%), Positives = 141/378 (37%), Gaps = 41/378 (10%) Query: 109 QARQLVDANSTITSALHTLFLQRWRLSLIVQATTL---NQQLLEEEREQLLSE------V 159 + + NS S L ++ Q L Q +Q LE +++ E + Sbjct: 184 RLWEQGQGNSISESELKQIYHQLDNTPLDSQTLDAIIHSQSELERYSKEIEDENLPASAL 243 Query: 160 QERMTLSGQLEPILADNNTAA----GRLWDMSAGQLKRGDYQLIV--KYGEFLNEQPELK 213 + QL + A G + + +LK D + I + L + Sbjct: 244 KNIDQYKTQLAKLKYAGEKATKLSKGLGYSFTYNELK--DIKDIDFSSLAQELASNQAIT 301 Query: 214 RLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLG 273 + LGR+ + + +V G +S DI R+LP +LA L Sbjct: 302 DIVTTLGRA-----------YISEKTNHKQVKRINTNEVYGTHKSADISRVLPSDLALLE 350 Query: 274 ITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMG 333 +LEY FY +L+E L TY+L G + ++ E+ +GP + C+DTSGSM Sbjct: 351 NEDLEYLFYAKLLESNLSTYKLLGHHIDFEK-------ENDTEEDKGPIVTCLDTSGSMS 403 Query: 334 GFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS-GPQGIEQAIRFLSQQFRGG 392 G A+A LA+ I E R Y++LF + EL + F+ ++F GG Sbjct: 404 GIPILKARALLLAIHSIITKEKRELYVLLFGSRGQVKELYLSETSSSGLLPFICKEFSGG 463 Query: 393 TDLASCFRAIMERL-QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAV-A 450 TD + + + + ++ AD ++I+D D+ + + V Sbjct: 464 TDFETPLKRAINIIEHKEKFNKADILMITDGECNV-SDNFQRMLVAKKSQLDFSVQTVIC 522 Query: 451 MSAHGKPG--IMRIFDHI 466 + D I Sbjct: 523 TGSFANTHQVTDGFSDRI 540 >UniRef50_C9RHJ4 von Willebrand factor type A n=1 Tax=Methanocaldococcus vulcanius M7 RepID=C9RHJ4_METVM Length = 383 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 71/414 (17%), Positives = 154/414 (37%), Gaps = 64/414 (15%) Query: 68 PELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTL 127 E E++ Y + + + I +++ + L + S I + Sbjct: 25 SEKDTEIIFYLFFKY--EVELLDDSEIIRKIINDRKFKHIKTITTLDENYSIIAT---EF 79 Query: 128 FLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMS 187 F ++++ +L E+ RE+ LS++ + + +E I + Sbjct: 80 FCEKFK------------ELKEKSREEDLSDIFDELESY--MENISYSLGC-----FGSG 120 Query: 188 AGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPAT 247 G D ++ E L + +LK + LG R + + R Sbjct: 121 CGYRSYTDPTKKLELAEKLLKNKKLKEFIKLLGTFRRI-----------SLKKAKRRIKH 169 Query: 248 VPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIER 307 + + + LL E L + RR E +LL Y++ Sbjct: 170 FSGEKYSTTCGNSLTNLLSCEYKNFTDEMLFVDLLRRYNENKLLNYKI------------ 217 Query: 308 PVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI 367 + + G F++C+D SGSM G E AKA L L+ +L +RC +++F + Sbjct: 218 -----LDNIKNHGDFVICLDLSGSMRGNKEIWAKAVSLCLIEASLKRGKRCVVIIFDDGV 272 Query: 368 VRYELSGPQ-GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQR 426 ++ ++ + F S + GGT+ R ++ + D V I+D + Sbjct: 273 RETKIFEKNIHFKEVLDFASVFYGGGTNFEKPLREALKF-------NGDVVFITDGECE- 324 Query: 427 LPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD---HIWRFDTGMRSRL 477 +P D+ +++K+ + + + +++ ++ + I D I+ ++ + ++ Sbjct: 325 IPLDMLNEIKKEKEKKEIKIYSLCINTKPTITLKNISDVVLTIYELNSKVAEQI 378 >UniRef50_D1XLN5 von Willebrand factor type A n=12 Tax=Actinomycetales RepID=D1XLN5_9ACTO Length = 538 Score = 182 bits (461), Expect = 3e-44, Method: Composition-based stats. Identities = 73/303 (24%), Positives = 111/303 (36%), Gaps = 47/303 (15%) Query: 181 GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRT 240 R W ++ G+L+R + + E L L R AE +GR R+ R Sbjct: 245 MRAWGVAPGELERMPFDERARLAERLRT-GRLARWAELIGRFRQMADGERA--------- 294 Query: 241 MVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESW 300 R ++ G+ DD+ R++P ELA LG+ L F R +L+ Y GE Sbjct: 295 --RRVENATGELVGVTLGDDLSRVIPSELANLGLPGLRAVFAARYAAGELMLYDTQGEQT 352 Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSM--GG----FNEQCAKAFCLALMRIALAE 354 +G + CVDTS SM G E AKA LAL+ A Sbjct: 353 T----------------GKGAVVACVDTSHSMYEAGPGGVTREAWAKACALALLDQARHG 396 Query: 355 NRRCYIMLFST----EIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSR- 409 R +LFS ++ R+ P + + F GGT + A + L+ Sbjct: 397 GRDFVGILFSAADKLQVFRFPAGRPADTARVLDFAETFLGGGTSYQTPLTAAADLLEEEF 456 Query: 410 ---EWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPG----IMRI 462 D V+I+D ++ +R R VA+ A G + + Sbjct: 457 DATARTRGDIVMITDDECGV-TEEWMRGWIGAKRRLDFRVFGVAVGAPLAAGTGSVLEAL 515 Query: 463 FDH 465 D+ Sbjct: 516 CDN 518 >UniRef50_D1PED6 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PED6_9BACT Length = 549 Score = 178 bits (451), Expect = 4e-43, Method: Composition-based stats. Identities = 70/425 (16%), Positives = 143/425 (33%), Gaps = 49/425 (11%) Query: 92 LPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEE 151 L Q+ + L+ + ++ W++ + Q + + Sbjct: 135 LDQLEQEHAEDGFDKPFFLK-LMTGDGASKPENWERLVRDWKVCIDHQILNKLKDFISLR 193 Query: 152 REQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPE 211 + + + M + + A + W++ +++ + + ++ PE Sbjct: 194 QNNFETGLVRMMDQITRNMKTKGVSEQRAVQAWELMTNGWTETEFERRLNQVKIQDKYPE 253 Query: 212 LKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELAT 271 +K + ++GR +A R M + ++G+ DD+ LLP ELA Sbjct: 254 IKEIVAKMGRVADANGKDRLTIASGVEMKME---HSAGSDIEGITVGDDLNSLLPLELAQ 310 Query: 272 LGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGS 331 ++E F + ++L T+R E + + +GP IVC+DTS S Sbjct: 311 YSDEDMEGLFIYKYRTRRLQTFRYKSE-----MSKPSRKLGFTHASRKGPMIVCLDTSAS 365 Query: 332 MGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRF------- 384 M G E+ + L A R C+++ FS +L + E+ R Sbjct: 366 MYGTPERISSTLISLLEETAEDLERDCFLIDFSVSTRAIDLMAKRKAERLKRIGITMMES 425 Query: 385 -------------------LSQQF--------RGGTDLASCFRAIMERLQSRE--WFDAD 415 + +Q GGT + + L + + +AD Sbjct: 426 AEADASPSDGDGQAHTGRGIRRQPTTTHLPFIGGGTSAKKMMTQMFDLLDNDGLHYVNAD 485 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGK---PGIMRIFDHIWRFDTG 472 + I+DF+ P + S+ +E + RF+ + + F+ I+ Sbjct: 486 VLWITDFLIPDPPQQLLSRFREY-KETGTRFYGIRIVRDDDKEPNSWKEYFNQIYTIRYR 544 Query: 473 MRSRL 477 R Sbjct: 545 PLRRY 549 >UniRef50_Q60384 Uncharacterized protein MJ0077 n=3 Tax=Methanocaldococcus RepID=Y077_METJA Length = 382 Score = 176 bits (446), Expect = 2e-42, Method: Composition-based stats. Identities = 69/395 (17%), Positives = 141/395 (35%), Gaps = 61/395 (15%) Query: 71 TEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQ 130 E++ Y + + + + I ++ + L + S I + F + Sbjct: 27 ETEIVFYLFFKY--EVEILTETDLIKKIVRDRRFKNVKSITTLDENYSLIAT---EFFCE 81 Query: 131 RWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQ 190 + L ++ EE+ +LL E++ M G Sbjct: 82 K--------LKELKEKGREEDISELLDELESYMENITSSFSSFGSGE-----------GY 122 Query: 191 LKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPE 250 D + ++ E L + +LK + LG+ + + Sbjct: 123 KSYTDPKKKLELTEKLLKNNKLKEFMKVLGKFKRMAIKKYKT-----------KIKHFSG 171 Query: 251 QVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVV 310 + + ++++ LL E L + RR E + L Y++ Sbjct: 172 EKYSINLGNNLINLLSSEYKNFAEEILFVDLLRRYNENKPLNYKI--------------- 216 Query: 311 HKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY 370 + + G F+VC+D SGSM G E AKA L LM I+L N+R +LF + Sbjct: 217 --LENNENCGDFVVCLDLSGSMRGNKEIWAKAIALCLMDISLKRNKRYISILFDDGVRDI 274 Query: 371 ELSGPQ-GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPD 429 ++ + ++ + F S + GGT+ R ++ + D V I+D + + Sbjct: 275 KIYEKKVSFDEILEFASVFYGGGTNFEKPLREALKF-------NGDIVFITDGECEVSLE 327 Query: 430 DVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD 464 K+KE ++ + + +++ ++ + +I D Sbjct: 328 -FLEKIKEEKQRRKIKIYSICINTKPTVSLRQISD 361 >UniRef50_B1L0Y8 von Willebrand factor type A domain protein n=10 Tax=Clostridium RepID=B1L0Y8_CLOBM Length = 578 Score = 175 bits (444), Expect = 3e-42, Method: Composition-based stats. Identities = 75/394 (19%), Positives = 159/394 (40%), Gaps = 45/394 (11%) Query: 86 PQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQ 145 +V++ + D + L S ++ + ++ TS L ++ + + + + Sbjct: 186 SDLLVEMQEAEDRIKDL-SQEKQELEENIENLKNNTSDLSEEDMKNKIEQIDEELENMEK 244 Query: 146 Q---LLEEEREQLLSEVQERMTLSGQLEPILADNNTA------AGRLWDMS--AGQLKRG 194 Q L EE ++L ++ LS ++ + + W + + + Sbjct: 245 QADNLEEELSDKLEKSEEDIENLSKEMAEAFNEAEKEVREATSYVKDWGLGDKPNKSSKI 304 Query: 195 DYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDG 254 + V+ E + + +LK L++ +GR +E+ R + + Sbjct: 305 SFSDKVEALERIRKSKKLKELSDIIGRFKESA-----------LRDQRNKHKDGAVAIKS 353 Query: 255 LQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDY 314 ++ +DI+ LP E L + EFYR+ +KQLL Y L + Sbjct: 354 VRIGNDIIHTLPSEKMLLINETTKKEFYRKFNQKQLLQYELESDKL-------------- 399 Query: 315 DEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYEL-- 372 + +GP ++C+D S SM G E+ +KA +AL+ IA + R +LF+ + + Sbjct: 400 --KAKGPMVICIDMSSSMKGIKEKWSKAVAIALLEIAQQQKRNFAAILFNEDATEPIIIE 457 Query: 373 SGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVT 432 + E+ + + GGT + + +E ++ ++ AD V I+D + PD Sbjct: 458 KDKKEPEKILDIAERFDGGGTLFETPLQKALEVIEQSKFKKADIVFITDGHSYTHPD-FI 516 Query: 433 SKVKELQRVHQHRFHAVAM---SAHGKPGIMRIF 463 +K +L+ + + +V + G +++F Sbjct: 517 NKFNKLKDEKEFKVLSVLIYAGGKIGNIESLQLF 550 >UniRef50_Q8EW10 Putative uncharacterized protein MYPE3970 n=1 Tax=Mycoplasma penetrans RepID=Q8EW10_MYCPE Length = 488 Score = 175 bits (443), Expect = 4e-42, Method: Composition-based stats. Identities = 88/405 (21%), Positives = 174/405 (42%), Gaps = 22/405 (5%) Query: 76 CYQQSQLLSTPQFIVQLPQILDLLHRL-----NSPWAEQARQLVDANSTITSALHTLFLQ 130 Q + + + L++ +P ++ + LQ Sbjct: 4 LKQANIITELVNLLENSQTTALKLYKFVNGEGTNPTKQEFEKFRKELINSIKRKKENALQ 63 Query: 131 RWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTL---SGQLEPILADNNTAAGRLWDMS 187 + + ++ L+++ L+ + E++ + LE + LWD+S Sbjct: 64 LFPIIKYFESNRLSEEDLQNDGTLFTVPSDEKIAIASPIEVLEEFPQPEDYDLSILWDLS 123 Query: 188 AGQLKRGD-YQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPA 246 + + + ++ + Y + + LK+ +G + + + Sbjct: 124 SIEERETQLFKAVENYFNLVKDDENLKKFIRMIGTFMQENLEIEKNEK-------ELFLE 176 Query: 247 TVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIE 306 +P++ L QS D+ L+P E+A L ELE F + +E++LLTY+L G E+ I+ Sbjct: 177 NIPQETFALYQSSDLNNLIPNEIAQLDDPELEIIFLKNFIEQKLLTYQLWG---IEREIQ 233 Query: 307 RPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE 366 V K D RGP +C+DTSGSM E +KA L +R + + FS E Sbjct: 234 EEWVIKQRDIGERGPLFICLDTSGSMRNMKEVLSKALTLVFVRELEKMDINVVFIPFSME 293 Query: 367 IVRYELSGPQGIEQAIRF-LSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQ 425 Y+L + ++++ L + F GG+D+ I + +++ A+ +++SDFI + Sbjct: 294 AKFYDLYDSKFKLKSVKMNLRKSFYGGSDIEKLVDLIDSVIYKKKYERANILIMSDFIFK 353 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSAH-GKPGIMRIFDHIWRF 469 +LP +K+K+L+ + H+ H++ +S K + IF+ WR+ Sbjct: 354 KLPKKAVNKLKKLKH-NGHKLHSLTISDQIYKNNLFDIFNTNWRY 397 >UniRef50_B8C4H1 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8C4H1_THAPS Length = 1141 Score = 169 bits (427), Expect = 3e-40, Method: Composition-based stats. Identities = 77/372 (20%), Positives = 130/372 (34%), Gaps = 58/372 (15%) Query: 139 QATTLNQQLLEEEREQLLSEVQER----MTLSGQLEPIL----ADNNTAAGRLWDMSAGQ 190 + L+ LE+ E L + E + L+ + + + + G Sbjct: 163 EYEPLSADELEQLAESLTGTLSEEWGGVVQGVSLLDKVFGYDHNLLDLKGDDGFGLQDGI 222 Query: 191 LKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSI------PRNDAQMETFRTMVRE 244 + +Q I L+ P+LK L +LG+ AK PR + V Sbjct: 223 WQHNGWQPIPDLQRRLSMMPKLKDLLARLGQRPSAKGKDVRKFRPRKRSNSRDDMMGVEI 282 Query: 245 PATVPEQVDGLQQSDDILRLLPPELATLGIT--ELEYEFYRRLVEKQLLTYRLHGESWRE 302 P V GL +S + +LP E L + L + F + E +LL Sbjct: 283 DPLDPTSVSGLTRSGSLTTMLPSEAVLLRSSMKSLRWLFLAKKAESKLLV---------- 332 Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 GP I+C+DTS SM G E AKA LA + A ++ R C ++ Sbjct: 333 ----------SLPSASGGPLIICLDTSWSMSGARESLAKAVVLASVSAANSQGRECRVVS 382 Query: 363 FSTEIVRYELS----GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ----------- 407 FS+ E G+ + + FLS F GGTD+ + + Sbjct: 383 FSSANNAVESGSIKCDSDGVRKLLDFLSYSFGGGTDVTGALKYALILAAHLLIAFHIYLP 442 Query: 408 -----SREWFDADAVVISDFIAQRLP--DDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + +D +++SD P + V +K++ L+ H + + P + Sbjct: 443 KMETLETDLASSDLLLVSDGEIPNPPVSNVVFAKLEALRLQTGMEIHGLLVGKRESPALS 502 Query: 461 RIFDHIWRFDTG 472 + + F Sbjct: 503 SLCTEVHDFLVD 514 >UniRef50_A3DPE5 von Willebrand factor, type A n=2 Tax=Desulfurococcaceae RepID=A3DPE5_STAMF Length = 443 Score = 166 bits (419), Expect = 2e-39, Method: Composition-based stats. Identities = 69/390 (17%), Positives = 135/390 (34%), Gaps = 51/390 (13%) Query: 102 LNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQE 161 L S + R +S ++S ++F+ ++ ++ E+E E Sbjct: 92 LKSSFIHDIRSKTVVDSLMSSIAASIFISEFKQLENERSFGNATSNRRGEQEGREDEKAI 151 Query: 162 RMTLSGQLEPILADNNTAA---GRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQ 218 R + + + D A + G + Y+ L E++++ E Sbjct: 152 RRNVEKAIANTMRDVENAKKLRMLIEGERPGTVSIMAYEEYGPELIRLARNVEVRKILEI 211 Query: 219 LGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELE 278 L + R+ ++ G + DI R++P LA Sbjct: 212 LAGIKP-----------WNINIPERKQRFKHGELMGYELGKDIERIVPSALALPDE---- 256 Query: 279 YEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQ 338 FY R +E +LL Y+ Q +GP V +D SGSM G Sbjct: 257 -LFYLRFLENRLLLYQK------------------MLSQGKGPLYVLLDKSGSMDGIKMT 297 Query: 339 CAKAFCLALMRIALAENRRCYIMLFSTE----IVRYELSGPQGIEQAIRFL-SQQFRGGT 393 AKA L+L A+ E+R Y F + + + I ++ + GGT Sbjct: 298 WAKAVALSLYMRAVREHREFYFRFFDSIPYPLAKISRRPRASNVLKLIDYIARVRGSGGT 357 Query: 394 DLASCFRAIMERLQSREWFD-ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMS 452 D++ +++ + +D ++I+D + + VT +++ R +V + Sbjct: 358 DISKAIITACNDIRTGSVRETSDIIIITDGVDRIAEQLVTYNLRKA----NARLISVMI- 412 Query: 453 AHGKPGIMRIFDHIW---RFDTGMRSRLLR 479 + I + RF+T +++ Sbjct: 413 MGDNKSLKNISVKYFTVSRFNTKNIIQIVE 442 >UniRef50_A7KV72 Putative metalloprotein chaperonin subunit n=1 Tax=Bacillus phage 0305phi8-36 RepID=A7KV72_9CAUD Length = 553 Score = 163 bits (413), Expect = 1e-38, Method: Composition-based stats. Identities = 77/466 (16%), Positives = 148/466 (31%), Gaps = 60/466 (12%) Query: 16 EGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVM 75 L +M+ L+ S Q K + + A+ + + + ++ Sbjct: 96 SRLNHQMMEGLMESEQYDQL---RKNTKFDVMNS------AIGTEVMQNQAMEKIQYFKQ 146 Query: 76 CYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLS 135 Y Q Q Q +++D L++ + +L + L R + Sbjct: 147 QYIQQQQTGEKQDGGDAGELIDQLNKARDA-QNRVDELNEKGGPGGKGLTQ------REA 199 Query: 136 LIVQATTLNQQLLEEE-------REQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSA 188 + Q LE+E +E++ +++ M + + W + Sbjct: 200 EELARLQQQIQDLEDEIDLNKSGQEEMKQGMEQAMEQASKKAFEEVREVRDTMESWGLDG 259 Query: 189 GQLK-RGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPAT 247 R K E + P L L + +GR + + + Sbjct: 260 TSSTMRISIDRRKKAIERIRRSPRLNNLTDLVGRMKAIALQKKTQRTPD----------- 308 Query: 248 VPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIER 307 V ++ +D+ R+ P L L +F + EKQL Y+ G Sbjct: 309 -GHSVRTIETGNDLSRVTPTSLMKLASPATRNQFMKEFSEKQLQLYKKDG---------- 357 Query: 308 PVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI 367 + RGP I+ D SGSM G + + A LA++ +A E R + + +I Sbjct: 358 ------IKKVGRGPIIIDHDKSGSMRGNKDDWSTALTLAMLEVAQKEKRNFGYIPYQHQI 411 Query: 368 VRYELSG----PQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFI 423 V + + + GGT + L+S + D V I+D Sbjct: 412 VASHVKNIPAGELDPDDIMDIAELDSSGGTTFMPVLDESIRCLESDRYKKGDIVFITDGD 471 Query: 424 AQRLPDDVTSKVKELQRVHQHRFHAVAM---SAHGKPGIMRIFDHI 466 + D+ + K+ + Q V + + + + D I Sbjct: 472 CG-ITDEWLKEFKKKKEQLQFNVLTVLINLDGGASRATVEKFSDQI 516 >UniRef50_Q2IEM5 VWA containing CoxE-like n=2 Tax=Anaeromyxobacter dehalogenans RepID=Q2IEM5_ANADE Length = 430 Score = 162 bits (409), Expect = 3e-38, Method: Composition-based stats. Identities = 61/291 (20%), Positives = 102/291 (35%), Gaps = 34/291 (11%) Query: 179 AAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETF 238 G L + + L L+R+A GR + + R Sbjct: 153 RGGLLPGTGTADGVPREQGAVRSLAARLKGDERLRRIAALAGRFKRIAAAKRRHRVKH-- 210 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 ++V ++Q D+ R LP ELA L L +F R L+E + L YRL G Sbjct: 211 ---------GADEVTDVEQGADLGRALPVELAKLSHRLLRLDFLRALLEGRSLQYRLEGT 261 Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 + +GP +V +D SGSM G + A A LAL+ A E R Sbjct: 262 ATL----------------GKGPLVVLLDKSGSMDGPRDVWATAVALALLDQAQRERRTF 305 Query: 359 YIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWF--DADA 416 ++ F + + P L GGT++A+ R +E +++ AD Sbjct: 306 ALLGFDARVKFEAVVKPSEALP-EDGLFVSCCGGTEIAAAVRRGLEIIRTHPGALGKADL 364 Query: 417 VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIW 467 V+++D +E + + + ++ D + Sbjct: 365 VLVTDG---GSDASEAGAFRESAAALGVTILGLGIGV-EREWLVPWCDEVH 411 >UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobaceae RepID=C3NM85_SULIN Length = 452 Score = 156 bits (394), Expect = 2e-36, Method: Composition-based stats. Identities = 74/398 (18%), Positives = 143/398 (35%), Gaps = 60/398 (15%) Query: 78 QQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLI 137 + S + S + + + +L+ L ++ Q ++ L+ L Sbjct: 89 EYSIVNSAVSLALTVSYVQNLIEELER--IKKTSQSMEEREAAEEILNGLMKGSSSKEGK 146 Query: 138 VQATTLNQ---QLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRG 194 Q T Q ++L + E+ +S+ E ++ I+ N G + Sbjct: 147 EQKNTNQQSMEKVLRQAHEKAMSKAIEDANSVRNMQKIVGGNGAGTGSVLTFEG------ 200 Query: 195 DYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDG 254 + +++ E+K++ E L + SI + R ++ G Sbjct: 201 EIHEVLRLA----RNTEIKKILEFLSGIPKLGSITK-----------RRTTRFSKGELYG 245 Query: 255 LQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDY 314 ++ DI R+L ELA + FY +L E QLL Y+ Sbjct: 246 YEEGSDIERILYSELALP-----DMLFYLKLAEGQLLLYQKQ------------------ 282 Query: 315 DEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE----IVRY 370 + GP + +D SGSM G AKA LAL A ENR Y+ F I Sbjct: 283 IRETLGPIYLLLDKSGSMDGEKILWAKAVALALYSRAKRENRDFYLRFFDNIPYPLIKVQ 342 Query: 371 ELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFD-ADAVVISDFIAQRLP 428 + + + I + + ++ + GGTD++ + E ++ ++ ++++D + Sbjct: 343 KNAKSKDIIKMVEYIGKIRGGGGTDISRSIISACEDIKEGHVKGVSEIILLTDGEDKIAE 402 Query: 429 DDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 V +KE + +V + + R+ D Sbjct: 403 TTVRRSLKEA----NSQLISVMI-RGDNADLRRVSDEY 435 >UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGP5_ARCPR Length = 430 Score = 155 bits (392), Expect = 3e-36, Method: Composition-based stats. Identities = 72/429 (16%), Positives = 146/429 (34%), Gaps = 56/429 (13%) Query: 56 ALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVD 115 L ++ + +P +L + +Q QL++T + + L + E+AR+ + Sbjct: 16 QLIAQRIEGYLPKDLQQVYRTCEQIQLIATDCYFLHYSLYPFLRSKNEDDVLEEARKFLQ 75 Query: 116 --------ANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSG 167 + + L + ++L L+ E E ++ + Sbjct: 76 DYISSDRYQKIKMLTTLDDEMSLAYSIALAKAVIGKVLGLIRLEHENPFDNLKAYV--VW 133 Query: 168 QLEPILADNNTAAGRLWDMSAGQ---LKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSRE 224 + ++ A + + D + ++ E+ K++ + Sbjct: 134 AMRAMVEGGEIPAILDYATEYAEEMVRNANDVRELIGGKRAGKEEGTFKKVLDLAEHMLY 193 Query: 225 AKSIPRNDAQMETFRTMV-------REPATVPEQVDGLQQSDDILRLLPPELATLGITEL 277 K + + + + + ++ E++ G + I R L ELA Sbjct: 194 VKFMRDIVSFSKKLFSHIPKATYILKKRGRFGEELSGYSLTKRIDRALVRELALP----- 248 Query: 278 EYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNE 337 E F ++ + LT K+ G + + VD SGSM G Sbjct: 249 EELFLKKFSGEGFLT-------------------KEKLSIAEGAYYILVDKSGSMVGEKT 289 Query: 338 QCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLAS 397 A++ LA+ R+A + RR ++ F + LS P + AI L + GGTD+ + Sbjct: 290 VWARSVALAIYRMASLKRRRYFLRFFDKKTHHL-LSDPHEVVDAI--LKVKSNGGTDITN 346 Query: 398 CFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHG 455 R ++ L R D V+I+D DV + + + +V + Sbjct: 347 ALRTAVKDLVERGLSDLTNTIVIITDGE------DVVEDLSKDLKKANANLISVMI-QGE 399 Query: 456 KPGIMRIFD 464 + I D Sbjct: 400 NETLKSISD 408 >UniRef50_C8SZ21 Protein viaA (VWA domain protein interacting with AAA ATPase) n=7 Tax=Enterobacteriaceae RepID=C8SZ21_KLEPR Length = 184 Score = 154 bits (388), Expect = 1e-35, Method: Composition-based stats. Identities = 132/175 (75%), Positives = 154/175 (88%) Query: 2 LTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRL 61 +TLD LNVMLAVSEEG+IEEM++ALLASPQLAVFFEKFPRLK I D+PRWREA+R+RL Sbjct: 1 MTLDMLNVMLAVSEEGMIEEMLLALLASPQLAVFFEKFPRLKNIIAADIPRWREAVRARL 60 Query: 62 KDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTIT 121 K+ +PP+L EV YQQ+QLLST QFIVQLPQIL LH+L SP+A QA++LVD N+T T Sbjct: 61 KEVNIPPDLDAEVQTYQQAQLLSTSQFIVQLPQILGKLHQLQSPFAAQAQKLVDDNATFT 120 Query: 122 SALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADN 176 ALHTLFLQRWRLSL+VQAT+LNQQLL+EER+QLLSEVQERMTLSGQL+P+LA Sbjct: 121 PALHTLFLQRWRLSLVVQATSLNQQLLDEERDQLLSEVQERMTLSGQLDPVLAKM 175 >UniRef50_A2BM85 Conserved archaeal protein n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BM85_HYPBU Length = 439 Score = 150 bits (378), Expect = 1e-34, Method: Composition-based stats. Identities = 68/357 (19%), Positives = 121/357 (33%), Gaps = 49/357 (13%) Query: 118 STITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLE---PILA 174 S + SA+ L + L + + EQ L+ R ++ LE + Sbjct: 101 SMVASAIFLEALLKELNKLPRPQGGETRSKSAADAEQGLNAENIRESVRKALETARDVAQ 160 Query: 175 DNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQ 234 +AG +++ L ++K L E L + Sbjct: 161 QAKELTNLAMRFTAGNASMLSLDDVIQDVINLARNTDVKVLLEAL-----------KTIE 209 Query: 235 METFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYR 294 R+ + ++DG + DI R++ ELA F + E+ LL Y+ Sbjct: 210 STEAYIRTRKIRSPRGELDGYELGSDIERVVASELALPTD-----LFLLKFAERNLLLYK 264 Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 + G F V +D SGSM G AKA LAL + A+ E Sbjct: 265 K------------------VVSEEYGKFYVLLDKSGSMMGMKIIWAKAVALALAQRAIRE 306 Query: 355 NRRCYIMLFSTEIVRY-----ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSR 409 R YI F + + G ++ + GGTD+ ++ + ++ Sbjct: 307 KREFYIRFFDSIPYPPLYIPKRVHGRDVVKLLEYVARIRANGGTDITRAILTAVDDIATK 366 Query: 410 --EWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD 464 +D ++I+D + D + + ++ R H V + + P + I D Sbjct: 367 LQRSKVSDIILITDGEDKIAIDTIRRSLNKV----NARLHTVMI-SGNNPDLRAISD 418 >UniRef50_A7VY69 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VY69_9CLOT Length = 515 Score = 146 bits (367), Expect = 3e-33, Method: Composition-based stats. Identities = 79/405 (19%), Positives = 136/405 (33%), Gaps = 54/405 (13%) Query: 68 PELTEEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTL 127 P + + Y T Q + I D + ++ L Sbjct: 140 PAIDTPELRYLDVIEQLTMQAKKAIQAIYD-------------SRNSQKPASAKKLLFLY 186 Query: 128 FLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMS 187 +LS I T Q+ E L + + L W S Sbjct: 187 NRADRKLSQIECLTAKIQKAAVRWAEALEPVIDTALRA--ALHEAFKT--HTIMEAWGSS 242 Query: 188 AGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPAT 247 + + + ++ +L+ +A LGR RE + R ++ + Sbjct: 243 --NEEMRNIPMNQTLLNYVKNSKQLQEIARLLGRYRELIADKRKNSY-----------SY 289 Query: 248 VPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIER 307 + L +DI L ELA LG+ E E F RR +K+L+ YR Sbjct: 290 GRGEKYDLTTGNDITNCLSSELALLGMAETEILFMRRYEQKRLMQYRKR----------- 338 Query: 308 PVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLF-STE 366 + RG IV +D SGS AKA LAL+ IA + R+ ++ F S + Sbjct: 339 -----TAVVKGRGDMIVLIDESGSTRSVA-GWAKALALALLDIASRDGRKFAMVHFASAD 392 Query: 367 IVRYELSGPQGI--EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIA 424 +R +L P E ++ Q F GGT+ + + + + + +AD +I+D Sbjct: 393 RIRTDLFEPGHYTPEDVMKAAEQFFGGGTNFEAPLKEALRLM-ENGYENADITIITDGEC 451 Query: 425 QRLPDDVTSKVKELQRVHQHRFHAVAM--SAHGKPGIMRIFDHIW 467 L + T + + + + + + D I+ Sbjct: 452 S-LSEIFTEEFHKKTAACKATVTGILLDKGGTCGKSLEPFCDKIY 495 >UniRef50_B9XQ17 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XQ17_9BACT Length = 806 Score = 141 bits (354), Expect = 7e-32, Method: Composition-based stats. Identities = 52/354 (14%), Positives = 119/354 (33%), Gaps = 21/354 (5%) Query: 129 LQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSA 188 + ++ + + + E+ R+ V++ L + +++ + Sbjct: 124 IDKFSMEIDGKPVQAEILKAEKARDIYEGIVRKMKD--PALMEY-EGRDVLKLKIFPIEP 180 Query: 189 GQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDA-QMETFRTMVREPAT 247 KR Y + L L + + + ++ + ++ + Sbjct: 181 NGKKRITLS----YTQVLKLDSGLLNYVLPMNAGKYSSKPIKSVSVKVNVESKRPLKTIY 236 Query: 248 VPEQVDGLQQSDDILRLLPPELATLG-ITELEYEFY--RRLVEKQLLTYRLHGESWREKV 304 P +++ + E + + +L+ F + + L+ Y+ E + Sbjct: 237 SPSHEVEVKRDGSNRATVGYEASEVKPDADLQLYFAPEKDEIGVNLMAYKTGDEDGYFLL 296 Query: 305 IERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS 364 + P V + + +DTSGSM G + AK + +L + R I+ FS Sbjct: 297 LASPGVDAKAKQIVSKDVVFVLDTSGSMSGKKMEQAKKALQFCVE-SLNDGDRFEIIRFS 355 Query: 365 TEIVR----YELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWFDADAVVI 419 TE + E+A F+ + GGT + + + L+S+E V + Sbjct: 356 TESEPLFDKLAAVSKENREKAGDFIKNLKAMGGTAIDEALKKALS-LESKEGRPFVVVFL 414 Query: 420 SDFIAQRLPDDVTSKVKELQRVH--QHRFHAVAMSAHGKPGIM-RIFDHIWRFD 470 +D + D +K +Q + + R + ++ RI + F Sbjct: 415 TDGLPTVGTTDEDQILKGMQERNKEKRRIFCFGIGTDVNTHLLDRIAEETRAFS 468 >UniRef50_Q1DE81 von Willebrand factor type A domain protein n=2 Tax=Myxococcales RepID=Q1DE81_MYXXD Length = 860 Score = 137 bits (346), Expect = 6e-31, Method: Composition-based stats. Identities = 53/346 (15%), Positives = 103/346 (29%), Gaps = 38/346 (10%) Query: 128 FLQRWRLSLIVQATTLNQQLLEEERE-QLLSEVQERMTLSGQLEPILADNNTAAGRLWDM 186 L R ++ E E + L V + P L G Sbjct: 100 LLDEERRNVFTAQVGNLLPYEETVVEVEFLQAVTAEEGSVRWMLPTLVAPRYIPGATTGD 159 Query: 187 SAGQLKRGDYQLIVKYGEFLNEQPEL-----KRLAEQLGRSREAKSIPRNDAQMETFRTM 241 G + + L LGR +S T Sbjct: 160 RTGHGSEEPTAQVPDADRITPPVGNVHYGLRMDLLVDLGREVVVESPSHAITTTREEGTR 219 Query: 242 VREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRL-HGESW 300 VR + + D+ + +L + F L+T+R G Sbjct: 220 VRV-GFSRGE---VSLDRDL-------VLSLRSPDSSAVF------TPLVTHRKGEGGPG 262 Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYI 360 + P + P+ + VD SGSM G + A+A +R E R + Sbjct: 263 TFALTVVPDLLALASAPPKQEVVFVVDVSGSMAGESLPQAQAALRLCLRHL-REGDRFNV 321 Query: 361 MLFSTEIVRYELSG----PQGIEQAIRF-LSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 + F ++ + +E+A R+ + GGT+L + RA ++ Sbjct: 322 IAFENRFQSFQPEPVPFTQRTLEEADRWVAALNADGGTELLAPMRAAVQAAPDG-----V 376 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 V+++D + + + ++ + R ++ + + ++R Sbjct: 377 IVLLTDGQVGNEAEILRAVLEARKT---ARVYSFGIGTNVSDVLLR 419 >UniRef50_Q3IHK0 Putative uncharacterized protein n=2 Tax=Alteromonadales RepID=Q3IHK0_PSEHT Length = 664 Score = 134 bits (337), Expect = 6e-30, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 78/212 (36%), Gaps = 18/212 (8%) Query: 255 LQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDY 314 L + + + R E L + + F+ E GE + ++ P + Sbjct: 265 LNEQNALNRDFVLEFKPLQKEQAQAAFFTEQFEN--------GERYGLAMLMPPADNFIA 316 Query: 315 DEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY---- 370 ++ + VDTSGSM G + + AK + L N I+ F + Sbjct: 317 TQRLARETVFVVDTSGSMHGQSMEQAKNALFYALS-LLDSNDSFNIIGFDNVVTLMSDKP 375 Query: 371 ELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPD 429 ++ + +A RF+ Q GGT++ A+++ Q + + ++D + D Sbjct: 376 LVASGFNLRRAERFIYGLQADGGTEIQGALDAVLDGSQFDGFVR-QVIFLTDG-SVSNED 433 Query: 430 DVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + ++ ++ R V + + MR Sbjct: 434 ALFKSIQ--AKLGDSRLFTVGIGSAPNSFFMR 463 >UniRef50_UPI00003C852B hypothetical protein Faci_06871 n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI00003C852B Length = 420 Score = 134 bits (336), Expect = 1e-29, Method: Composition-based stats. Identities = 56/392 (14%), Positives = 126/392 (32%), Gaps = 67/392 (17%) Query: 85 TPQFIVQLPQILDLLHRLNSPWAEQA-RQLVDANSTITSALHTLFLQRWRLSLIVQATTL 143 +F+ + ++ + E+A QL D S + S L + Sbjct: 66 KNEFMESVRLRMEYITETKDFKQERAYTQLNDRLSMLYSINFMKALNENAKKNQPRNGNS 125 Query: 144 NQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYG 203 + ++ E+ + +++ ++ ++E I+ D N G+ + + ++ Sbjct: 126 SNAPDQKTIEKSMEGASKKVEMAHEIEKIVKDKNPGGN------IGKKEGMSVESLIDLT 179 Query: 204 EFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILR 263 + + ++ + + + ++ G ++ I Sbjct: 180 DKAMKVDNADKILTLANKLIDIMPRYTKK----------MRSFSNTGELAGYYKTRHISN 229 Query: 264 LLPPELATLGITELEYEFYRRLV------EKQLLTYRLHGESWREKVIERPVVHKDYDEQ 317 +L ELA FY +L+ EK+L++ Sbjct: 230 VLSRELAMPDE-----IFYSKLINGFTGKEKRLMS------------------------- 259 Query: 318 PRGPFIVCVDTSGSM-GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQ 376 G + V +D SGSM G +++ LAL RIA + R+ Y F + S Sbjct: 260 -PGSYYVLLDKSGSMYEGDKTLWSRSVALALFRIARSRGRKYYFRFFDNKPHDLLNSP-- 316 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTSK 434 + L+ + GT + + + LQ + ++I+D + D Sbjct: 317 -FDVVENILTVEANKGTCIECALKTALRDLQDTKIRSETNTIIIITDGEDKVNMQDYFR- 374 Query: 435 VKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 + ++ + V + G+ +I Sbjct: 375 -----KENETKLITVMI-NGYNEGLKKISTEY 400 >UniRef50_Q0AMP5 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Maricaulis maris MCS10 RepID=Q0AMP5_MARMM Length = 740 Score = 133 bits (335), Expect = 1e-29, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 61/177 (34%), Gaps = 10/177 (5%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L GE++ I P + I +D SGSMGG + + A+A + ++ Sbjct: 314 LFIEEWQGETYLLAQILPPAELGADTPRRARETIFVIDNSGSMGGASMRQARAALITALQ 373 Query: 350 IALAENRRCYIMLFSTEIVRYEL----SGPQGIEQAIRFL-SQQFRGGTDLASCFRAIM- 403 +R ++ F + + + P + A+ F + +GGT + A + Sbjct: 374 RLEPGDR-FNVIRFDNTMEQVFPQAVDASPDNVATALTFARRLEAQGGTVMLPALNAALR 432 Query: 404 ERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + + V ++D + + L R R V + + M Sbjct: 433 DTSPDDDSRVRQIVFLTDGAIGNEAELFAAIEAGLGR---SRLFPVGIGSAPNGYFM 486 >UniRef50_A3W9L9 Putative uncharacterized protein n=2 Tax=Erythrobacter RepID=A3W9L9_9SPHN Length = 740 Score = 133 bits (335), Expect = 1e-29, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 76/234 (32%), Gaps = 17/234 (7%) Query: 240 TMVREPATVPEQVDGLQQSDDILRLLPPELATLGITEL--EYEFYRRLVEKQ----LLTY 293 T+ +P PE + + + TL + +F R L + Sbjct: 257 TVNLDPGFAPEAISSPYHAVSVRGSGSTRTVTLADGAVPANRDFELRWSASGDAPMLGLF 316 Query: 294 RLH-GESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIAL 352 + GE P + E P I +D SGSM G + A+ L + Sbjct: 317 KQRHGELEYVMATITPPALERVGEAPPREMIFVIDNSGSMAGESMPAARRSLLYALETLR 376 Query: 353 AENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLS-----QQFRGGTDLASCFRAIMERLQ 407 ++R ++ F + S Q + I GGT++ RA + + Sbjct: 377 PQDR-FNVIRFDDTMTELFASAVQASDSNIAAAKTFTHNLMANGGTEMLPALRAALRD-R 434 Query: 408 SREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + + ++D D + ++ ++ R V + + +MR Sbjct: 435 APDERVRQVIFLTDGALSNEAD-MMEEINRNRK--DSRVFMVGIGSAPNTYLMR 485 >UniRef50_UPI0001AEB92D inter-alpha-trypsin inhibitor domain-containing protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEB92D Length = 586 Score = 133 bits (334), Expect = 2e-29, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 64/177 (36%), Gaps = 10/177 (5%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 + T + ++ P V + +DTSGSMGG AK + Sbjct: 284 IFTESKGQHDYALVMLMPPQVKSQDLQDFDRDITFVIDTSGSMGGRPIVDAKESLQLAID 343 Query: 350 IALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLS-----QQFRGGTDLASCFRAIME 404 +E R ++ F+ + R + +G + ++ GGT++A A ++ Sbjct: 344 RL-SEKDRFNVVAFNNDTTRLFETSVEGTTRNKQYARDFVKHLNAGGGTEMAPALNAALK 402 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 R ++++ V I+D + S++K + R V + + M Sbjct: 403 RTTTKDFIK-QVVFITDGAVGNEA-ALFSQIKN--ELGDARLFTVGIGSAPNSYFMT 455 >UniRef50_D0KVI6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KVI6_HALNC Length = 671 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 70/202 (34%), Gaps = 12/202 (5%) Query: 266 PPE-LATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIV 324 P E L + + L+TY +GE + + + P + R ++ Sbjct: 257 PSETHTGNRDFILSFRLQGAKINSGLMTYEWNGEHYFLMMAQPPKRVAPTEVMKR-EYLF 315 Query: 325 CVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG----PQGIEQ 380 VD SGSM GF A L+ + I+ FS + P+ +++ Sbjct: 316 VVDVSGSMYGFPLNTASDLMRELLSSLKPQE-TFNILFFSGGSRVLSPTPLQATPENLQR 374 Query: 381 AIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQ 439 A+ + S Q GGT+L + ++ + + VVI+D + L Sbjct: 375 AMTMMRSIQGGGGTELLPALKTAFAMPRTEDTARS-IVVITDGYVDVERQAYDLIKQNLN 433 Query: 440 RVHQHRFHAVAMSAHGKPGIMR 461 + A + + +M Sbjct: 434 STN---LFAFGIGSSVNRYLME 452 >UniRef50_C5EGH1 von Willebrand factor n=2 Tax=Clostridiales RepID=C5EGH1_9FIRM Length = 681 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 72/216 (33%), Gaps = 12/216 (5%) Query: 253 DGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHK 312 S I P + L Y+ + V L+ E++ +++ P Sbjct: 247 QSADSSAHITLKDPADYGGNRDFILRYQLAGQTVNSGLMLNTGEKENFFLLMVQPPERVP 306 Query: 313 DYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE- 371 PR +I +D SGSM G+ AK ++ E ++LFS + +R Sbjct: 307 AEAIPPR-EYIFVLDVSGSMFGYPLDTAKELIRNMVSNL-RETDTFNLILFSNDAIRMSA 364 Query: 372 ---LSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWF--DADAVVISDFIAQ 425 + + +E+AI + Q+ GGT+LA + VVI+D Sbjct: 365 RSLPATDENVERAINLINRQKGGGGTELAPALEKAVGIPMDSGAGSVSRSVVVITDGYMS 424 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + V F + + ++ Sbjct: 425 D-EQAIFDIVAGNLDT--TSFFSFGIGTSVNRYLIE 457 >UniRef50_Q9YD81 Putative uncharacterized protein n=1 Tax=Aeropyrum pernix RepID=Q9YD81_AERPE Length = 463 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 66/324 (20%), Positives = 119/324 (36%), Gaps = 57/324 (17%) Query: 148 LEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLN 207 L E E+ L V++ + ++ +L+ G + +++ + Sbjct: 168 LREAVEKALESVEKDARAAKNVKQLLSMMGA----------GDTSVLAFDENIEFILRIA 217 Query: 208 EQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPP 267 + ++ R+ E + + R RE + DGL+ D+ R+ Sbjct: 218 RETDVSRVLENV-----------QGIREIMRRRSRRETRSPKGWFDGLEYGSDLERIHYS 266 Query: 268 ELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVD 327 +L F+ +LL YR + RGP V +D Sbjct: 267 QLILPDE-----YFWASFSSSKLLLYRK------------------VLDSSRGPIYVLLD 303 Query: 328 TSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFST----EIVRYELSGPQGIEQAIR 383 SGSM G A+A +AL R +LAENRR F + I S P+ + ++ Sbjct: 304 KSGSMVGAKIDWARAVAVALFRRSLAENRRFSARFFDSVTYPAIHLRPRSKPRDFLELVK 363 Query: 384 FL-SQQFRGGTDLASCFRAIMERLQ---SREWFDADAVVISDFIAQRLPDDVTSKVKELQ 439 +L + + GGTD+ + + + + E +D V+I+D + V++ Sbjct: 364 YLAAVKAGGGTDITAAIKTAADDISRTPRGEQRISDIVLITDGEDRLN----IDVVEDSL 419 Query: 440 RVHQHRFHAVAMSAHGKPGIMRIF 463 + R H V + H P + RI Sbjct: 420 KRSDARLHTVIIQGH-NPYLKRIS 442 >UniRef50_D2R2I7 von Willebrand factor type A n=2 Tax=Planctomycetaceae RepID=D2R2I7_9PLAN Length = 786 Score = 132 bits (332), Expect = 3e-29, Method: Composition-based stats. Identities = 36/293 (12%), Positives = 102/293 (34%), Gaps = 20/293 (6%) Query: 200 VKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMV-REPATVPEQVDGLQQS 258 +++ + L ++ L L + +R + + T + + + P +++ Sbjct: 182 IRFSQLLRQEHRLTDLLIPMATARYTNTPIEKVSLEATIESSIAIKSVYSPTHAVDVKRP 241 Query: 259 DDILRLLPPELATLGITELEYEFYRRLVEKQL----LTYR-LHGESWREKVIERPVVHKD 313 D+ + E + + ++ + + L L+YR + + ++ P + Sbjct: 242 DEKHATVKFEASNY-LPTTDFRLLYDVGDAPLAASVLSYRPDNSDEGFFLMLASPNHSQG 300 Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY--- 370 + + I VD SGSM G + A+ ++ E I+ + + + + Sbjct: 301 EVDLTKKTVIFVVDRSGSMQGKKIEQAREAMRYVLNNLH-EGDTFNIVAYDSTVESFKPE 359 Query: 371 --ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLP 428 + G T+++ + L + + + ++D + Sbjct: 360 LQKFDDATRKSALAYVDGLYAGGSTNISGALDSAFAMLTGSDRPN-YILFLTDGLPTAGE 418 Query: 429 DDV--TSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLR 479 + ++ + + VH+ R + ++ D + R + G +S+ +R Sbjct: 419 TNEGKIVELAKQKNVHRARMINFGVGYDVNSRLL---DRMSRENFG-QSQYVR 467 >UniRef50_B5ZY26 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Rhizobium RepID=B5ZY26_RHILW Length = 794 Score = 132 bits (331), Expect = 4e-29, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 64/219 (29%), Gaps = 14/219 (6%) Query: 253 DGLQQSDDILRLLPPEL-ATLGITELEYEFYR---RLVEKQLLTYRLHGESWREKVIERP 308 ++Q D R + + A + E + +L L G++ + P Sbjct: 283 VDIRQDGDQERTISLKGDAVPADKDFELTWQAAPGKLPSAGLFREVKDGKTCLLAFVTPP 342 Query: 309 VVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV 368 + + +D SGSM G + + AK + +R ++ F + Sbjct: 343 TAPDAAAPPAKREVVFVIDNSGSMSGPSIEQAKQSLALAISRLTPNDR-FNVIRFDDTMT 401 Query: 369 RY----ELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIM-ERLQSREWFDADAVVISDF 422 Y + P E+AI ++ GGT++ + + V ++D Sbjct: 402 DYFKGLVAATPDNREKAIAYVRGLPADGGTEMLPALEDALRNQGPVATGALRQVVFLTDG 461 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 R V + + M Sbjct: 462 AIGNEQQLFQEITAN---RGDARVFTVGIGSAPNTYFMT 497 >UniRef50_B5JSY7 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSY7_9GAMM Length = 670 Score = 131 bits (329), Expect = 6e-29, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 59/176 (33%), Gaps = 11/176 (6%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 + GE + ++ P PR + +DTSGSM G AK + Sbjct: 292 FSEEYKGEHYALVMLRTPDEMTSGPRMPR-EVVFVIDTSGSMAGQRMYHAKQALSQAVER 350 Query: 351 ALAENRRCYIMLFSTE----IVRYELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMER 405 ++R ++ F+ + + ++QA+ ++ Q GGT + + Sbjct: 351 LSPDDR-FNVVEFNNQHSRLFSSMRSASAINVKQALNWVGRLQGGGGTMMLPAVEDALSV 409 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + ++I+D + + ++ R V + ++R Sbjct: 410 RSDPAYLR-QVILITDASVGNEAEILRVV---ERQRKGARLFTVGIGVSPNSYLLR 461 >UniRef50_A7RKA1 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7RKA1_NEMVE Length = 1128 Score = 131 bits (328), Expect = 7e-29, Method: Composition-based stats. Identities = 57/392 (14%), Positives = 130/392 (33%), Gaps = 60/392 (15%) Query: 94 QILDLLHRLNSPWAEQAR-QLVDANSTIT-SALHTLFLQRWRLSLIVQATTLN-QQLLEE 150 ++ L + S + A +L + + + L T +Q + L ++ L+ +L + Sbjct: 13 LLMLYLIKDTSTTLQTAPGELAKKLAELAINGLGTSEMQGYYDKLTFKSLDLDGNSILND 72 Query: 151 EREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQP 210 + +++Q ++T++ +++ + + + + S + + D + + KY + Sbjct: 73 LATRFANKLQTKVTIARKIKDAVEVSYAKSATV--TSRTECCKADTRWL-KYDSRFRTKV 129 Query: 211 ELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELA 270 L + + + + D ++T + + T+ Q G ++ L Sbjct: 130 NLDEMCVIISGAASSNPKQLQDNVLQTMKQNIENNPTLTWQYFGSEEG------LYTNYP 183 Query: 271 TLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSG 330 + + + R RP + QP+ I+ VD SG Sbjct: 184 MIRDSSSCSSYDPRY---------------------RPWYVEAASPQPK-DVILVVDYSG 221 Query: 331 SMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ---------- 380 SMGG AK ++ +R + F + + R +++ ++ Sbjct: 222 SMGGSRLPIAKEAAKTVLDTLNPRDR-VAFLAFESGVRRVKVTSGDAKDEKCFESSLAKA 280 Query: 381 ---AIRFLSQQ-----FRGGTDLASCFRAIMERL-----QSREWFDADAVVISDFIAQRL 427 I L + GGT A F A + L + + ++D Sbjct: 281 SPVNIDILKKFLDGEYASGGTMYAIAFNAAFDILDKYYKEKNTTRRPVILFMTDGAPNDD 340 Query: 428 PDDVTSKVKELQRVHQHR--FHAVAMSAHGKP 457 P + + VK + + M P Sbjct: 341 PGTILNTVKTRNQGLSTKADILTFGMGGGISP 372 >UniRef50_B0UK93 LPXTG-motif cell wall anchor domain protein n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UK93_METS4 Length = 761 Score = 131 bits (328), Expect = 8e-29, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 58/176 (32%), Gaps = 11/176 (6%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 GE + P + +PR +D SGSM G + + AKA L + Sbjct: 348 ERVGEDEYLLAVVTPPEGRAPARRPR-EVTFVIDNSGSMAGASMRQAKASLLVALDRLGP 406 Query: 354 ENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRF-LSQQFRGGTDLASCFRAIMERLQS 408 +R ++ F + + + A RF + + RGGT++ RA + Sbjct: 407 ADR-FNVIRFDDTMDLLFPAPVPADEAHRDAARRFVAALEARGGTEMLPPLRAALADPHP 465 Query: 409 REWFD-ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIF 463 E V ++D ++ R R + + + +M Sbjct: 466 EEGDRVRQIVFLTDGAIGNEEQIFSAISAGRGR---SRLFMIGIGSAPNGHLMTHA 518 >UniRef50_B7RYC9 Vault protein inter-alpha-trypsin n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RYC9_9GAMM Length = 686 Score = 131 bits (328), Expect = 9e-29, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 72/203 (35%), Gaps = 22/203 (10%) Query: 276 ELEYEFYRRL--VEKQL-----LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDT 328 E++ +F + L T R+ + + ++ P + + PR + VDT Sbjct: 266 EMDRDFVLQWSAASGSLPGAAFFTERVDDQYYGLLMLVPPASQRAAETVPR-EIVFVVDT 324 Query: 329 SGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRF 384 SGSMGG + + AK +R +R ++ F++ + ++ A + Sbjct: 325 SGSMGGVSIKQAKGSLTRALRHLGPNDR-FNVIEFNSSHRALFQHAVPASHHNLQLASEY 383 Query: 385 LS-QQFRGGTDLASCFRAIMERLQSREWFD-----ADAVVISDFIAQRLPDDVTSKVKEL 438 + + GGT++ + ++ +++ + I+D V L Sbjct: 384 VRHLEASGGTEMMPALQLALKLPGAQDELRPEPALRQVIFITDGAVGNESALFEHIVDSL 443 Query: 439 QRVHQHRFHAVAMSAHGKPGIMR 461 R V + + MR Sbjct: 444 ---GGSRLFTVGIGSAPNAWFMR 463 >UniRef50_Q21PJ3 von Willebrand factor, type A n=1 Tax=Saccharophagus degradans 2-40 RepID=Q21PJ3_SACD2 Length = 763 Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 66/182 (36%), Gaps = 19/182 (10%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 L GE + ++ P + + + + VDTSGSM G + Q AK +R Sbjct: 361 LAGEDYLLLMLLPPQGQQQHTQSLSRDIVFVVDTSGSMQGTSIQQAKRSLQFALRGLNPS 420 Query: 355 NRRCYIMLFSTEIVRYELSG----PQGIEQAIRFL-SQQFRGGTDLASCFRAIMERL--- 406 + I+ F T R+ ++ A+ ++ + GT++ + ++L Sbjct: 421 D-TFNIIEFDTSFSRFRSRPVSATASNVQAAVSWVNNLNADNGTEMYAALEEAFDQLASI 479 Query: 407 QSREWFDA-------DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 ++ V I+D + S + R++ R VA+ + Sbjct: 480 NPNGTENSKSSNNLQQVVFITDGAVGN-EQALLSLIHR--RLNNARLFTVAIGSAPNSYF 536 Query: 460 MR 461 MR Sbjct: 537 MR 538 >UniRef50_B8CPU4 Von Willebrand factor, type A n=1 Tax=Shewanella piezotolerans WP3 RepID=B8CPU4_SHEPW Length = 710 Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 57/169 (33%), Gaps = 9/169 (5%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + ++ P I+ +DTSGSM G + AKA + + A++ Sbjct: 327 PQYSLVMLLPPQDKMRLSALAPRELILVIDTSGSMSGEAIEQAKASIIYALAGLSAQD-S 385 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFL-----SQQFRGGTDLASCFRAIMERLQSREWF 412 I+ F++ + + + I Q GGT+++ + + + Sbjct: 386 FNILQFNSNVYALSDTPLNASAKNIGRAQAYVQRLQANGGTEMSLALDKALSQQDANRER 445 Query: 413 DADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + I+D P T +LQ+ R + + M+ Sbjct: 446 LRQVLFITDGAVGNEPQLFTQIRNQLQQ---SRLFTIGIGDAPNAHFMQ 491 >UniRef50_Q3V4Q4 Putative VWFA domain-containing protein ORF892 n=1 Tax=Acidianus two-tailed virus RepID=Y892_ATV Length = 892 Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats. Identities = 57/386 (14%), Positives = 126/386 (32%), Gaps = 63/386 (16%) Query: 108 EQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSG 167 ++ ++ S + + L + + I + T +Q + E +E+ + + + Sbjct: 553 QEINSILQTLSQLRDTVQQLRNYDFYSNAIDERTFDDQNVSEHRKEKEIENLYDTANNVQ 612 Query: 168 QLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKS 227 +L L + ++ Q Y L G + L G Sbjct: 613 KLLKDLNVSEGEQNKIL-----QELVTQYSLRRLLGNIATKYKGLLDFVRNKG------- 660 Query: 228 IPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVE 287 ++++E+ + P++ DD+ RL E A L + F + Sbjct: 661 ----ESEVESRHGSGKGPSS--------TMGDDLSRLFIKEYAKLSNPIQQKMFLLDYLN 708 Query: 288 KQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLAL 347 L ++ +E+ +G F+ +D+SGSM G A A L Sbjct: 709 GALSIHK-------------------SEEKKQGDFLFVIDSSGSMEGNKIATALAIPLVT 749 Query: 348 MRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ 407 + R + FS E S I+ L GGT++ S ++ + Sbjct: 750 YKK-YKGKRNILVETFSDE-----PSPIYNIKNIANVLGSMKFGGTNIGSAVLYALKNID 803 Query: 408 SREW-----------FDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAM--SAH 454 + ++++D +PDD+ ++ L++ ++ + Sbjct: 804 KPDSDYDRKLRESLRKTRTLILLTDGE-DEIPDDIAREINSLKKKNKVELLCYGIDLGER 862 Query: 455 GKPGIMRIFDHIWRFDTGMRSRLLRR 480 G + I D ++ + ++ + Sbjct: 863 GLKTLKEICDEVYAVGSNNFGNIVLK 888 >UniRef50_A7HVH6 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVH6_PARL1 Length = 755 Score = 129 bits (325), Expect = 2e-28, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 61/178 (34%), Gaps = 11/178 (6%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L R+ E + ++ P + +PR I +D SGSM G + AK L + Sbjct: 318 LFRERVGNEDYLLVMLTPPSGSVQPEAKPREA-IFVIDNSGSMSGPSMVQAKESLLWALD 376 Query: 350 IALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIM- 403 ++ F + + + + A +F+ S + GGT++ RA + Sbjct: 377 RLKP-GDTFNVIRFDDTLTVLFPDAVPAHGENLAVAKKFVKSLEANGGTEMLPALRASLI 435 Query: 404 ERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +R + V ++D + L R R V + + M Sbjct: 436 DRNVNDGTRLRQIVFLTDGAISNEAELFHEITSNLGR---SRLFTVGIGSAPNSYFMT 490 >UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P2E4_9RHOB Length = 772 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 77/196 (39%), Gaps = 13/196 (6%) Query: 273 GITELEYEFYRR--LVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSG 330 L YE + + Y G + +IE P + + D + + +DTSG Sbjct: 316 RDFVLRYELAAQSDVAAGVSSRYEADGGGYFSLLIEPPKLPAE-DMIGQRELVFVLDTSG 374 Query: 331 SMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLS 386 SM G + +K F A ++ AL + I+ FS + ++ L+ + ++A++F++ Sbjct: 375 SMSGQPIEASKTFMTAAIK-ALRPDDYFRILHFSNDTSQFAGQAVLATERNKQKALKFVA 433 Query: 387 -QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHR 445 GGT++ A ++ Q V ++D + S R+ + R Sbjct: 434 DLSAGGGTEINQAVNAAFDQAQPDNTTRI-VVFLTDGYIGDEATVIKSI---ANRIGKAR 489 Query: 446 FHAVAMSAHGKPGIMR 461 +A + ++ Sbjct: 490 IYAFGVGNSVNRFLLD 505 >UniRef50_A6X8G3 LPXTG-motif cell wall anchor domain protein n=11 Tax=Rhizobiales RepID=A6X8G3_OCHA4 Length = 750 Score = 129 bits (324), Expect = 3e-28, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 65/177 (36%), Gaps = 12/177 (6%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L + + + + P + ++ + I +D SGSMGG + + AKA + Sbjct: 324 LFREHIGKDDYVLAYVTPPAL--ASPKKVQREVIFVIDNSGSMGGTSIEQAKASLDYALS 381 Query: 350 IALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIME 404 +R ++ F + ++ + + I A RF+ S + +GGT++ A ++ Sbjct: 382 QLQPGDR-FNVIRFDDTLTKFFEDSVDANQENIASARRFVTSLEAQGGTEMLPALHAALD 440 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 V ++D + + R + R V + + +M Sbjct: 441 DSNQGNGLR-QIVFLTDGEISNEQQLLDAV---AARRGRSRIFMVGIGSAPNSYLMN 493 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 129 bits (323), Expect = 3e-28, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 61/172 (35%), Gaps = 10/172 (5%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 + + ++ P ++ I +DTSGSM G + + AK+ + + Sbjct: 305 FENDDYALVMLMPPSDEFIAAQRLPREVIFVIDTSGSMHGESLEQAKSALFFALANLDPQ 364 Query: 355 NRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSR 409 + I+ F++++ + I +A F+ + GGT++ F +++ + Sbjct: 365 D-SFNIIEFNSKVNALNAQALPANDFNIRRARNFVYGLKADGGTEIGLAFEQVLDNSEHA 423 Query: 410 EWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++ V ++D + L R + + + M Sbjct: 424 DYLR-QIVFLTDGSISNETEVFAQIKGSL---GDSRIFTIGIGSAPNSYFMT 471 >UniRef50_A1VI76 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Burkholderiales RepID=A1VI76_POLNA Length = 701 Score = 128 bits (322), Expect = 4e-28, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 72/200 (36%), Gaps = 17/200 (8%) Query: 273 GITELEYEFYRRLVEKQLLTYRL------HGESWREKVIERPVVHKDYDEQPRGPFIVCV 326 L+Y +E ++ Y+ GE++ +IE P PR +I V Sbjct: 274 RDFILDYRLAGERIESGVMLYQGTPGNGASGENFFLAMIEPPKQVAAQAISPR-DYIFVV 332 Query: 327 DTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG----PQGIEQAI 382 D SGSM GF AK L+ + ++LFS + IEQA+ Sbjct: 333 DISGSMHGFPLDTAKTLMRELIGKLRPSD-TFNVLLFSGSNRFLSPASVPATQANIEQAV 391 Query: 383 RFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRV 441 R + GGT+L + + ++ + VV++D + + L + Sbjct: 392 RTIDEMGGGGGTELIPALKRVYAEPKAADVSR-TVVVVTDGFVTVEREAFELVRRNLSQA 450 Query: 442 HQHRFHAVAMSAHGKPGIMR 461 + + + + +M Sbjct: 451 N---LFSFGIGSSVNRHLME 467 >UniRef50_B9XLE8 Vault protein inter-alpha-trypsin domain protein n=1 Tax=bacterium Ellin514 RepID=B9XLE8_9BACT Length = 723 Score = 128 bits (322), Expect = 4e-28, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 76/238 (31%), Gaps = 22/238 (9%) Query: 231 NDAQMETFRTMVREPATVPEQV-DGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQ 289 + + + + +T PEQ+ L + D I L Y ++ Sbjct: 318 SIEEFSCVTHKISKTSTTPEQLSVDLSEGDRIPN---------KDFVLRYRIAGERIKSN 368 Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 + +R + ++ P P + +D SGSM G AKA ++ Sbjct: 369 FMVHRDERGGYFTMMLYPPKELGQLGRAP-MEMVFVLDCSGSMSGEPIAQAKAAIRHALK 427 Query: 350 IALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIME 404 I+ FS + + P+ I + + ++ G T++ +A ++ Sbjct: 428 QLQP-GDSFQIINFSEHASQLGAKPLEATPENIRKGLAYVEALNSDGPTEMIEGIKAALD 486 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 E ++D + + + + R+ R + + + ++ Sbjct: 487 FPHDPE-RLRFVCFLTDGFIGNEAEILAAVHE---RIGASRIFSFGVGS-CNRYLLDH 539 >UniRef50_D0LTR8 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LTR8_HALO1 Length = 903 Score = 127 bits (320), Expect = 7e-28, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 60/169 (35%), Gaps = 6/169 (3%) Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 + EK++ + EQP + VD SGSM G + AK A + + Sbjct: 437 TRIEKIMPVRFDSEKQREQPHVAIALVVDRSGSMSGLKIEAAKESARATAEVLSPSD-LI 495 Query: 359 YIMLFSTEIVRY--ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADA 416 ++ F + + A Q GGT++ R E LQ Sbjct: 496 TVVAFDNQPTTIVRLQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGANAKVKHV 555 Query: 417 VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDH 465 +V+SD A P D + + + R + AV + + + I D+ Sbjct: 556 IVLSDGQA---PYDGIADLCQEMRSARITVSAVGIGDADRNLLNLITDN 601 >UniRef50_C4WI90 Poly [ADP-ribose] polymerase 4 n=1 Tax=Ochrobactrum intermedium LMG 3301 RepID=C4WI90_9RHIZ Length = 777 Score = 127 bits (320), Expect = 8e-28, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 62/179 (34%), Gaps = 12/179 (6%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L + + + + P V ++ + + +D SGSMGG + + AKA + Sbjct: 351 LFREHVGKDDYVLAYVTPPAV--ASAKKAQREVVFVIDNSGSMGGTSIEQAKASLDYALS 408 Query: 350 IALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRF-LSQQFRGGTDLASCFRAIME 404 +R ++ F + R+ + Q I A F +S + +GGT + A ++ Sbjct: 409 HLQPGDR-FNVIRFDDTLTRFFEVSVEASQQNIASARHFVMSLEAQGGTAMLPALHAALD 467 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIF 463 V ++D + + R + R V + +M Sbjct: 468 DSHQGNGLR-QIVFLTDGEISNEQQLLDAI---AARRGRSRIFMVGIGTAPNSYLMNHA 522 >UniRef50_Q6KZN8 Putative uncharacterized protein n=1 Tax=Picrophilus torridus RepID=Q6KZN8_PICTO Length = 417 Score = 127 bits (319), Expect = 9e-28, Method: Composition-based stats. Identities = 63/384 (16%), Positives = 130/384 (33%), Gaps = 53/384 (13%) Query: 85 TPQFIVQLPQILDLLHRL-NSPWAEQARQLVDANSTITSALHTLFLQRW-RLSLIVQATT 142 F+ + + + L D S I S L + + + Sbjct: 61 NDAFLEAAREKMKKYTETPEFKRVKNYSYLNDKVSMIYSISFVKALGDEIKKAESSGRGS 120 Query: 143 LNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKY 202 ++ + +E E+ L E + + + +L N G+L D G L ++ Sbjct: 121 MDGKKAQEIIERALKESERIGDRARDVNNLLKG-NNPGGKLADKKDGTL-----DNVLDL 174 Query: 203 GEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDIL 262 + + + +++ N + T V + ++ G ++ +IL Sbjct: 175 TDKIIKVDNSEKIITMA----------TNLIDIMPKFTRVMKNKNNLGELGGYYKTRNIL 224 Query: 263 RLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPF 322 +P E+A FY +L G + REKVI G + Sbjct: 225 HAIPREVAMPDE-----IFYSKLA---------SGFTAREKVINSE-----------GSY 259 Query: 323 IVCVDTSGSM-GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 + +D SGSM G +++ LAL R+A + R+ ++ F + I Sbjct: 260 YILLDKSGSMYEGTKTVWSRSVALALYRLATIKKRKYFLRFFDNKPHEVLTRPYDIIN-- 317 Query: 382 IRFLSQQFRGGTDLASCFRAIMERLQSREWFDA-DAVVISDFIAQRLPDDVTSKVKELQR 440 L+ + GT + ++ ++S D ++I+D + ++K + + Sbjct: 318 -NILTVEANKGTCIECAITTAIDDIKSNRHLDTNTIIIITDGEDHVNKN----QLKLMLK 372 Query: 441 VHQHRFHAVAMSAHGKPGIMRIFD 464 + R +V + + I D Sbjct: 373 KYNIRLISVMI-NGSNDDLKNISD 395 >UniRef50_Q09E12 Inter-alpha-inhibitor H4 heavy chain, putative n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09E12_STIAU Length = 540 Score = 127 bits (319), Expect = 1e-27, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 69/198 (34%), Gaps = 12/198 (6%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 LLTY+ E + P E +DTSGSM G Q AK + Sbjct: 40 LLTYKQADEPGYFIALIAPKTEVSASEIAAKRVTFVIDTSGSMQGSRMQIAKDALKYCVT 99 Query: 350 IALAENRRCYIMLFSTEIV----RYELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIME 404 ++ ++ FST++ + + P+ I++A+ F+ + GGT + ++ Sbjct: 100 RLNPQD-TFNVVRFSTDVEALFPALKSAQPENIQKAVAFVEQLEAIGGTAIDEALVRGLQ 158 Query: 405 RLQSREWFDADAVVISDFIA--QRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 + + I+D + ++ + R + R + ++ Sbjct: 159 DNDGKSSAPHLLMFITDGQPTIGETDEGAIAQHAKDGRKAKTRLFTFGVGEDLNARLL-- 216 Query: 463 FDHIWRFDTGMRSRLLRR 480 D + D S +R Sbjct: 217 -DRLSS-DGAGTSDFVRD 232 >UniRef50_A6WMD3 Vault protein inter-alpha-trypsin domain protein n=11 Tax=Shewanella RepID=A6WMD3_SHEB8 Length = 772 Score = 126 bits (317), Expect = 2e-27, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 65/174 (37%), Gaps = 14/174 (8%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 +++ ++ P V K I+ +DTSGSM G + AK L ++ E+ Sbjct: 370 DNYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPED-S 428 Query: 358 CYIMLFSTEIVRYELSG----PQGIEQAIRFLS-QQFRGGTDLASCFRAIM-----ERLQ 407 I+ F++ + + + +A +F+S Q GGT++A A + Sbjct: 429 FNIIEFNSSLSLLSATPLPATSSNLSRARQFVSRLQADGGTEMALALDAALPKSLGSVSP 488 Query: 408 SREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + ++D + ++ ++ + R V + + M+ Sbjct: 489 DAVQPLRQVIFMTDGSVGN-EQALFDLIR--YQIGESRLFTVGIGSAPNSHFMQ 539 >UniRef50_Q80UW6 Parp4 protein (Fragment) n=11 Tax=Eukaryota RepID=Q80UW6_MOUSE Length = 498 Score = 126 bits (316), Expect = 2e-27, Method: Composition-based stats. Identities = 49/276 (17%), Positives = 88/276 (31%), Gaps = 22/276 (7%) Query: 194 GDYQLIVKYGEFLNEQPE---LKRL-AEQLGRSREAKSIPRNDAQMETFRTMVREPATVP 249 +Q E L + E +K + AEQ + +P + + +R+ +T Sbjct: 124 APWQQDKALNENLQDTVETIRIKEIGAEQSFSLAMSIEMPYMIEFISSDTHELRQKSTDC 183 Query: 250 EQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 + V + + L L + + EK+ V + + Sbjct: 184 KAVVSTVEGSSLDSGGFSLHIGLRDAYLPRMWVEKHPEKE--------SEACMLVFQPEL 235 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 D + + I+C+D S SM G AK L + + E + IM F T Sbjct: 236 ADVLPDLRGKNEVIICLDCSSSMEGVTFTQAKQVALYALSLLGEEQ-KVNIMQFGTGYKE 294 Query: 370 YELSGP--QGIEQAIRFLSQQF--RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQ 425 + A F+ G TD R + S + + ++ISD Q Sbjct: 295 LFSYPKCITDSKMATEFIMSAAPSMGNTDFWKVLRYLSLLYPSEGFRN--ILLISDGHLQ 352 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + +Q R A+ + I+R Sbjct: 353 SESLTLQLVKRNIQH---TRVFTCAVGSTANRHILR 385 >UniRef50_Q15NW6 Vault protein inter-alpha-trypsin n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NW6_PSEA6 Length = 701 Score = 126 bits (316), Expect = 2e-27, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 62/189 (32%), Gaps = 25/189 (13%) Query: 294 RLHGESWREKVIERPVVHKDY--------DEQPRGPFIVCVDTSGSMGGFNEQCAKAFCL 345 G+ V+ P V Y + P + +DTSGSM G + AK Sbjct: 269 ETQGKYRYGLVMLTPPVQDAYHSTGGAVAQQMPSREVVFLLDTSGSMAGESIVQAKRAVD 328 Query: 346 ALMRIALAENRRCYIMLFSTEIVRYEL----SGPQGIEQAIRF-LSQQFRGGTDLASCFR 400 + E+ I+ F+ + + I++A + S GGT++A Sbjct: 329 FALTQLRPED-NVNIIQFNDAPQALWKRAMPATAKHIQRARNWVASLHADGGTEMAPALT 387 Query: 401 AIMERLQSRE--------WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMS 452 + + V I+D + D + S ++ ++ +R + + Sbjct: 388 LALNKPSLHRDDSDLLGSHKLRQVVFITDG-SVSNEDALMSLIES--KLADNRLFTIGIG 444 Query: 453 AHGKPGIMR 461 + M Sbjct: 445 SAPNSYFMT 453 >UniRef50_Q2SQR4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SQR4_HAHCH Length = 733 Score = 126 bits (315), Expect = 3e-27, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 66/174 (37%), Gaps = 12/174 (6%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + G+ +++ P D I VDTSGSM G + Q A+ L + Sbjct: 330 EIVGDDVYAQLLLMPPQFSDEGLSLPRELIWVVDTSGSMEGVSIQQARDAVLQALDTLTP 389 Query: 354 ENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQS 408 +R ++ F++ + + + ++QA RF+ + GGT++A + Sbjct: 390 RDR-FNVIEFNSHARKLFPQAVPAQERALQQARRFVRGLKADGGTEIAEALDRALSDAAP 448 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKEL-QRVHQHRFHAVAMSAHGKPGIMR 461 + V ++D + + K++ Q++ R V + MR Sbjct: 449 EGYVR-QVVFLTDGSVG----NELALFKQIDQQLGDSRLFTVGIGPSPNRFFMR 497 >UniRef50_UPI00016C377F protein containing a von Willebrand factor type A domain n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C377F Length = 821 Score = 126 bits (315), Expect = 3e-27, Method: Composition-based stats. Identities = 34/234 (14%), Positives = 74/234 (31%), Gaps = 14/234 (5%) Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQL----LTYR 294 R V+ + V+ +++SD + + L + + FY +K + L Y+ Sbjct: 188 RHAVQNVYSPTHAVNTVRKSDKEVSVTFERKQALLDKDFQ-LFYGH-GDKDIGLSPLVYK 245 Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 + + ++ ++ +DTS SM Q AK + E Sbjct: 246 PIQTEDGYFMFLISPQVEAEKKRVARDLVLVLDTSSSMSDIKMQQAKKAVKFCLSQLQPE 305 Query: 355 NRRCYIMLFSTEIV----RYELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSR 409 +R ++ FST + + ++ A +++ + GGT + + S Sbjct: 306 DR-FGVVRFSTTVTKFRSELVAANTDYLDLATKWIDGLKTSGGTAIWPALNDALAMRSSD 364 Query: 410 EWFDADAVVISDFIAQRLPDDVTSKVKE--LQRVHQHRFHAVAMSAHGKPGIMR 461 V +D + VK + R + ++ Sbjct: 365 PSRPFTMVFFTDGQPTVDETNADKIVKNVLAKNTGNTRIFTFGVGDDVNAAMLD 418 >UniRef50_B2A702 von Willebrand factor type A n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A702_NATTJ Length = 599 Score = 125 bits (314), Expect = 3e-27, Method: Composition-based stats. Identities = 60/412 (14%), Positives = 131/412 (31%), Gaps = 34/412 (8%) Query: 72 EEVMCYQQSQLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQR 131 E+ +++ + L S + ++ D+N +S Sbjct: 167 MELRKIYNMKIMPLAKNSQTQEGEDSSLEDYTSSTDSKLKERSDSNREASSLSSDKLNDV 226 Query: 132 WRLSLIVQATTLNQQLLEEE-----REQLLSEVQER--MTLSGQLEPILADNNTAAGRLW 184 +++ + QQ +EE ++ L+ R +T + + + + Sbjct: 227 QQIAESEGKASQAQQTMEELAKGASKQNFLNSASRRKLLTYLKKNRIVKETKDKLYLTPY 286 Query: 185 DMSAGQLKRGDYQLIV-KYGEFLNEQPE-----LKRLAEQLGRSREAKSIPRNDAQMETF 238 +L + I + + P+ +K++ + + + F Sbjct: 287 GQELAELMKRHLPEIDARLRNMVRTMPKKTNHSIKKIMTRQDSGQRKSPFKERGVKKPDF 346 Query: 239 RTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGE 298 V E A G ++ L E L E L E+ T + + Sbjct: 347 GDYVEEIALPETITAGAKR-------LYRET-YLQNKVEESN-KAVLTEQNKQTEQNKQD 397 Query: 299 SWR---EKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 + + + + ++ +Q VD SGSMGG Q K F ++ L Sbjct: 398 FKQKSGLTLNPQDIRVRNKKKQTSMNVCFLVDASGSMGGRRMQEVKFFAEHVL---LKGR 454 Query: 356 RRCYIMLFSTEIVRYELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSR--EWF 412 + I+ F + V E+ + ++ L + G T ++ + L+S + Sbjct: 455 DKIAILTFREDNVNVEIPFTRNWDKLRSGLNKIKAFGLTPMSKGIEMARKYLESEVGQQK 514 Query: 413 DADAVVISDF---IAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + V+I+D I+ D +K Q++ Q V + ++ Sbjct: 515 NTFLVLITDGLPTISDGGEDPFKETLKAAQKLSQTSIKFVCIGLEPNVKFLK 566 >UniRef50_A8IQV8 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8IQV8_CHLRE Length = 411 Score = 125 bits (314), Expect = 4e-27, Method: Composition-based stats. Identities = 48/221 (21%), Positives = 82/221 (37%), Gaps = 37/221 (16%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 + + ++ + GP I+C+DTSGSM G E AKA L +R Sbjct: 181 MAFDDLNCWLEDEPARVTSRMEIRPAAEMGPIILCLDTSGSMRGARETVAKALALECLRG 240 Query: 351 ALAENRRCYIMLFS--TEIVRYELS-GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ 407 A + R+CY+ FS E+ +LS ++Q + FLS F GGTD+ + + +ERL Sbjct: 241 AHRQRRQCYLYAFSGPNEVQELQLSVDVDSLDQLLAFLSCSFMGGTDVDAPLKLSLERLA 300 Query: 408 SREWFDADAVVISDFIAQRLPDDVT----------------------------------S 433 EW AD ++++D D + Sbjct: 301 KAEWAQADILMVTDGEIPNPDDKIIQTSSLPSRTTRPPPAAAAAAAAAAAAAAAAAAAPQ 360 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMR 474 + H + +S+H + ++ + F + Sbjct: 361 AISRAHEEMGLEVHGLLVSSHVTEAMRKLCTDVHVFKSWSA 401 >UniRef50_Q09DT2 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q09DT2_STIAU Length = 843 Score = 125 bits (313), Expect = 4e-27, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 69/177 (38%), Gaps = 14/177 (7%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L+T+RL + + P + R + VDTSGSM G + A+ +R Sbjct: 217 LVTHRLGEKPGTFALTVVPDLLGLATGPKRQEVVFVVDTSGSMEGESLPQAQGALRLCLR 276 Query: 350 IALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRF-LSQQFRGGTDLASCFRAIME 404 E R I+ F T + + + +EQA R+ + + GGT+L A ++ Sbjct: 277 HL-REGDRFNIIAFDTSFQSFAPQPAVFTQKTLEQADRWVAALRANGGTELLQPMLAAVQ 335 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 V+++D + + + ++ + R ++ + + +++ Sbjct: 336 AAPEG-----VVVLLTDGQVGNEAEILQAVLRARKT---ARIYSFGIGTNVSDALLK 384 >UniRef50_UPI000174639B hypothetical protein VspiD_18615 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174639B Length = 868 Score = 124 bits (312), Expect = 6e-27, Method: Composition-based stats. Identities = 59/356 (16%), Positives = 107/356 (30%), Gaps = 39/356 (10%) Query: 139 QATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRG---- 194 Q +L E QL EV+ M +G L+ ++ +S + Sbjct: 203 QFLPSQSRLHEGAALQLKVEVESTMDGAGLLKLFENGVEVERRKVKVVSGSTVTETFVRH 262 Query: 195 -DYQLIVKYGEFLNE--------QPELKRLAEQLGRSREA-------KSIPRNDAQMETF 238 D + I KY L E L + GR R + A + Sbjct: 263 PDTRNIYKYRAVLEGFAGDAIPANNEALTLVDVRGRLRLLYVEGDMNEGQYLVQAMAKEG 322 Query: 239 RTMVREPATV----PEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLL--- 291 + P+++ G IL +P ++ +L ++ Sbjct: 323 IELELRAPNSIPNTPQELSG--FDGVILSDVPAHQVGETAMVAIRDYVDKLGGGFIMLGG 380 Query: 292 --TYRLHGESWREKVIERPVVHKDYDEQPR--GPFIVCVDTSGSMGGFNEQCAKAFCLAL 347 ++ + G PV K DE+ + + +D SGSM G + AK+ +A Sbjct: 381 PNSFGVGGYYRTPIEEVLPVRLKAPDEEEKQSSALALVIDRSGSMSGEKLEMAKSAAIAT 440 Query: 348 MRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAI--RFLSQQFRGGTDLASCFRAIMER 405 + N + F +E A+ + GGT+L F Sbjct: 441 AEVLTR-NDSIGVYAFDSEAHVVVPMTRLTSSSAVAGQIAGLTSGGGTNLHPAFTEARNA 499 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 LQ + ++++D + R VA+ G+++ Sbjct: 500 LQRTKAKIKHMIILTDGQTSGQG---YEALASQCRAEGVTISTVAIGDGAHVGLLQ 552 >UniRef50_A6CIG8 Putative uncharacterized protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CIG8_9BACI Length = 931 Score = 124 bits (311), Expect = 7e-27, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 64/171 (37%), Gaps = 6/171 (3%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ EK++ + K E P ++ +D SGSM G+ Q AK + L E Sbjct: 386 KTPIEKLLPVDMDLKGKKELPSLGMVIVLDRSGSMAGYKIQLAKEAAIRSAE-LLREKDT 444 Query: 358 CYIMLFSTEI-VRYELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDAD 415 + F + + E+ I + GGT++ E+L E Sbjct: 445 LGFIAFDDRPWQIIDTEPIKDKEKVIEKINGLTSGGGTNIFPSLELAYEQLTPLELQRKH 504 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM-RIFDH 465 ++++D + PD +T+ + + + VA+ ++ + D Sbjct: 505 IILLTDGQSATSPDYLTTI--QEGKENNITLSTVAIGEGSDSVLLEELSDE 553 >UniRef50_B1HPN0 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HPN0_LYSSC Length = 825 Score = 124 bits (311), Expect = 8e-27, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 60/166 (36%), Gaps = 6/166 (3%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ E ++ + K ++ P ++ +D SGSM G + AK + + E+ Sbjct: 345 KTPIETLLPVEMEIKGKEQLPSLGLVIVLDRSGSMSGSKLELAKEAAARSVEMLRDED-T 403 Query: 358 CYIMLFSTEI-VRYELSGPQGIEQAIR-FLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 + F E E+A+ LS GGT++ E L + Sbjct: 404 LGFIAFDDRPWEIIETGPLNNKEEAVDTILSVTPGGGTEIYGSLAKAYENLADMKLQRKH 463 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++++D +Q D + E + + VA+ ++ Sbjct: 464 IILLTDGQSQPGNYD---DLIEQGKDNGITLSTVAIGQDADANLLE 506 Score = 51.7 bits (122), Expect = 6e-05, Method: Composition-based stats. Identities = 12/109 (11%), Positives = 38/109 (34%), Gaps = 9/109 (8%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR--CYIMLFSTEIVRYELS 373 + VD S SM G ++ + + ++ + FS+ + + Sbjct: 22 PIKEEQIVYLVDRSASMNGTEDEMVQ----FIQDSLQSKKDEQLAGLYSFSSTLQTEAIM 77 Query: 374 GPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 + +++ +F + T++ + + ++ V+++D Sbjct: 78 T-KTLKEVPKFTEIKATDQTNIEQSLQLATGIIDPKKATR--LVLLTDG 123 >UniRef50_B9RR85 Inter-alpha-trypsin inhibitor heavy chain, putative n=1 Tax=Ricinus communis RepID=B9RR85_RICCO Length = 755 Score = 124 bits (311), Expect = 9e-27, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 63/180 (35%), Gaps = 11/180 (6%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + + + + R + VD SGSM G + K + Sbjct: 303 DQRDMFYLYLFPGDQPNMKVFRKEIVFIVDISGSMEGKPLEGMKNAMSGALAKLNP-KDS 361 Query: 358 CYIMLFSTEIVR----YELSGPQGIEQAIRFLSQQ--FRGGTDLASCFRAIMERLQSREW 411 I+ F+ E EL+ + +E+A+ +++ GGT+++ ME + + + Sbjct: 362 FNIIAFNGETYLFSSLMELATEKTVERAVEWMNLNFIAGGGTNISVPLNQAMEMVSNTQG 421 Query: 412 FDADAVVISDFIAQRLPDDVTSKVKELQRVHQH---RFHAVAMSAHGKPGIMRIFDHIWR 468 +++D + + +K+ R R + + + +R+ + R Sbjct: 422 SLPVIFLVTDGAVED-ERHICDSMKKYVRGKGAICPRIYTFGIGTYCNHYFLRMLATVCR 480 >UniRef50_D2R2E3 TROVE domain protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2E3_9PLAN Length = 531 Score = 124 bits (310), Expect = 1e-26, Method: Composition-based stats. Identities = 51/392 (13%), Positives = 128/392 (32%), Gaps = 31/392 (7%) Query: 80 SQLLSTPQFIVQLPQILDLLHRLNSPWAEQA---RQLVDANSTITSALHTLFLQR-WRLS 135 + ++ +P +L L + SP + ++++D+ + + + + R S Sbjct: 77 AIYSRDRAYLKDVPALLVALLTVKSPELVTSELFQRVIDSPKMLRNFVQIIRSGVVGRKS 136 Query: 136 LIVQATTLNQQLLEEEREQLL-----SEVQERMTLSGQLEPILADNNTAAGRLW----DM 186 L + L ++ L+ +Q L + + P+ + A W + Sbjct: 137 LGSRPKRLVREWLDARSDQALFVGSVGNDPSLGDIVKMVHPVPRTPSREALYAWLIGREY 196 Query: 187 SAGQLKR--GDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVRE 244 G L +Y+ I + + + + EL + + + + Sbjct: 197 EEGALPPIVQEYERIKR--QVIRGKDELPDVPFSM-LTHLPLTPSDWKRIARNASWQTTR 253 Query: 245 PATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKV 304 + G+ Q +++R + L + F +L+ L + + Sbjct: 254 MNLNTFERHGVLQDQELVRCIANRLRNPRAIQQARVFPYQLMAAYLNMNETLPAKIKAAL 313 Query: 305 IERPVVHKDYDEQPRGPFIVCVDTSGSM-------GGFNEQCAK--AFCLALMRIALAEN 355 E + + G +V VD SGSM G + + + +N Sbjct: 314 GEAMELAISNVPRIEGKVLVMVDVSGSMRSPATGNRGSATSKVRCIDVAALIAASIVRKN 373 Query: 356 RRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 ++ FS +++ +L Q + + + L+ GT+ ++ +R+ + Sbjct: 374 PEAEVIPFSDDVILVKLDRQQPVMKQAQQLASLPSVGTNCSAALAHA----NARQLKASM 429 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 + +SD + + + + Q + Sbjct: 430 VIYVSDNESWIDSQSQWKGHRATETMRQWQLF 461 >UniRef50_A3QDW1 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella loihica PV-4 RepID=A3QDW1_SHELP Length = 776 Score = 123 bits (309), Expect = 1e-26, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 60/182 (32%), Gaps = 19/182 (10%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + V+ P K PR + +DTSGSM G + AK+ L + + Sbjct: 377 DKEAKDSYALVMLMPPQDKARVRLPR-ELTLVIDTSGSMTGDSIAQAKSAILNALAGLGS 435 Query: 354 ENRRCYIMLFSTEIVRYEL----SGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQS 408 ++ ++ F + + + + +A F+ S + GGT++A + + +S Sbjct: 436 QD-TFNVIAFDSSVRSLSPVALSATAANLGKANLFVQSLEADGGTEMAPALLRALSQPES 494 Query: 409 ---------REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 + V I+D + R R V + A Sbjct: 495 GVSSISSAVKPERLKQVVFITDGAVGNEASLFALIAANIGRQ---RLFTVGIGAAPNGYF 551 Query: 460 MR 461 M Sbjct: 552 ME 553 >UniRef50_A6G7V2 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G7V2_9DELT Length = 820 Score = 123 bits (308), Expect = 2e-26, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 68/170 (40%), Gaps = 10/170 (5%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 R + P+ IV +DTSGSM G A+A AL+R +L + Sbjct: 279 NSYGRLVLTPPPIEPGREVSAVPRDLIVLLDTSGSMRGEPLAHAQAVTEALIR-SLRDRD 337 Query: 357 RCYIMLFSTEIVRYELSG----PQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREW 411 R ++ FS+ + R+ + E+A+R++ + + GGT + A + L+ Sbjct: 338 RLELVEFSSRVRRWSQAPASMSAAKREEALRWVGALRASGGTHMRDGILAALASLRPEAQ 397 Query: 412 FDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++I+D + + V + R R H + + + + R Sbjct: 398 R--QILLITDGLIAFESEIVQA--ARQHRPPGCRVHTLGIGSSVNRSLTR 443 >UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcanivorax sp. DG881 RepID=B4X134_9GAMM Length = 657 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 60/176 (34%), Gaps = 12/176 (6%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 + GE + ++ P + + +D+SGSMGG + AKA ++ Sbjct: 272 FHEEIDGEHYALLMVVPPKTGQVTALPRET--LFIIDSSGSMGGAPMRQAKASLHLALQR 329 Query: 351 ALAENRRCYIMLFSTEIVRYELSG----PQGIEQAIRFL-SQQFRGGTDLASCFRAIMER 405 +R I F ++ + +QA F+ Q GGT + A + + Sbjct: 330 LKPGDR-FNITDFDSQHTLLFETPVTVSDNSRQQAQDFVDGLQASGGTHMLPALSATLSQ 388 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 S + + I+D + ++L + R V + + M Sbjct: 389 PASDGYLR-QVIFITDGAVGNESGIFRALHQQL---GEARLFTVGIGSAPNSHFMT 440 >UniRef50_A7HHW8 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HHW8_ANADF Length = 1362 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 60/176 (34%), Gaps = 11/176 (6%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 L R GE + ++ P +P+ + VD SGSM G +A + Sbjct: 351 LVQREKGEDFLMLFVQPPAGVAPALVRPK-ELVFLVDKSGSMMGAPFDRVRALVARALD- 408 Query: 351 ALAENRRCYIMLFSTEIVRYELSG----PQGIEQAIRFL-SQQFRGGTDLASCFRAIMER 405 A+ + ++ F + P I +A +L S + GGT++ RA + Sbjct: 409 AMGPDDTFQVVAFDGSAQAMSEAPLPATPSAIARAKEWLASLEGGGGTEMLEGVRAALS- 467 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 V +D P+ + V+ L + R + + ++ Sbjct: 468 PPEDPRRLRMVVFCTDGFIGNEPE-IIEAVEAL--RGRARVFGFGIGSSVNRYLVE 520 >UniRef50_Q1YZ74 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Photobacterium profundum 3TCK RepID=Q1YZ74_PHOPR Length = 714 Score = 122 bits (306), Expect = 3e-26, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 64/186 (34%), Gaps = 18/186 (9%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGP-----FIVCVDTSGSMGGFNEQCAKAFC 344 L T + G+ + + P V+ + +D SGSM G + + AK Sbjct: 298 LFTQHVEGQGYGLLLTMPPQVNHQVNSTTSSALFHQSVTFVLDISGSMYGESIEQAKQAL 357 Query: 345 LALMRIALAENRRCYIMLFSTEI----VRYELSGPQGIEQAIRFL-SQQFRGGTDLASCF 399 ++ E+ I+ F+ E + I +A+RF+ GGT++A+ Sbjct: 358 RYGLQQLQPED-SFNIVTFNHEAMLYSEQLLPVTSSTITRALRFVDGLDADGGTEMAAAL 416 Query: 400 RAIMERLQSREWFDA----DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHG 455 +A + V I+D + ++ Q++ R V + + Sbjct: 417 KAAFSIKTHDQLNSTRWLNQIVFITDGSVGNES-ALFDLIE--QQLVDRRLFTVGIGSAP 473 Query: 456 KPGIMR 461 M Sbjct: 474 NSYFMT 479 >UniRef50_Q6A0B1 MKIAA0177 protein (Fragment) n=4 Tax=Murinae RepID=Q6A0B1_MOUSE Length = 1269 Score = 122 bits (306), Expect = 3e-26, Method: Composition-based stats. Identities = 49/276 (17%), Positives = 88/276 (31%), Gaps = 22/276 (7%) Query: 194 GDYQLIVKYGEFLNEQPE---LKRL-AEQLGRSREAKSIPRNDAQMETFRTMVREPATVP 249 +Q E L + E +K + AEQ + +P + + +R+ +T Sbjct: 761 APWQQDKALNENLQDTVETIRIKEIGAEQSFSLAMSIEMPYMIEFISSDTHELRQKSTDC 820 Query: 250 EQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 + V + + L L + + EK+ V + + Sbjct: 821 KAVVSTVEGSSLDSGGFSLHIGLRDAYLPRMWVEKHPEKE--------SEACMLVFQPEL 872 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 D + + I+C+D S SM G AK L + + E + IM F T Sbjct: 873 ADVLPDLRGKNEVIICLDCSSSMEGVTFTQAKQVALYALSLLGEEQ-KVNIMQFGTGYKE 931 Query: 370 YELSGP--QGIEQAIRFLSQQF--RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQ 425 + A F+ G TD R + S + + ++ISD Q Sbjct: 932 LFSYPKCITDSKMATEFIMSAAPSMGNTDFWKVLRYLSLLYPSEGFRN--ILLISDGHLQ 989 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + +Q R A+ + I+R Sbjct: 990 SESLTLQLVKRNIQH---TRVFTCAVGSTANRHILR 1022 >UniRef50_B2HK18 Conserved membrane protein n=3 Tax=Mycobacterium RepID=B2HK18_MYCMM Length = 983 Score = 122 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 69/202 (34%), Gaps = 20/202 (9%) Query: 273 GITELEYEFYRRLVEKQL-LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGS 331 L ++ + + L L G+ ++ P ++ +D S S Sbjct: 253 RDFVLRLDYDAQELASSLVLVPDADGDEGTYQLTVLPPAGVAAPRPRH--LVLVLDRSRS 310 Query: 332 MGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIR-------- 383 M G+ A+ ++ AL + R ++ F I Y + P G+ +A Sbjct: 311 MAGWKMTAARRAASRIVD-ALTSDDRFAVLTFDDGI-EYPVGLPAGLTEASDRHRYRAVE 368 Query: 384 -FLSQQFRGGTDLASCFRAIMERLQSREWFDAD---AVVISDFIAQRLPDDVTSKVKELQ 439 + RG T++ + R + L + D D ++ISD + +L Sbjct: 369 HLARVEARGDTEMLAPLRRALALLGREQVADTDDAVLILISDGQVGNEDQLLQELSGDLG 428 Query: 440 RVHQHRFHAVAMSAHGKPGIMR 461 R R H + + G +R Sbjct: 429 R---VRLHTIGVDEAVNAGFLR 447 >UniRef50_B0CG18 von Willebrand factor type A domain protein, putative n=5 Tax=Cyanobacteria RepID=B0CG18_ACAM1 Length = 708 Score = 122 bits (305), Expect = 4e-26, Method: Composition-based stats. Identities = 44/451 (9%), Positives = 118/451 (26%), Gaps = 47/451 (10%) Query: 29 SPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLL----- 83 SP F + K + + L V T ++ Sbjct: 63 SPSTGGLFAEVDGKKLSFPLKQTDVEANISGNLSRVEVKQTFTNPYDRPLEAIYQFPLPE 122 Query: 84 ----STPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQ 139 + + I ++ ++A+Q+ + L++ R +L Sbjct: 123 DAAVDDMEIRIGNRIIRGVIKER-----QEAKQIYETAKQEGKTAA--LLEQERANL--- 172 Query: 140 ATTLNQQLLEEEREQLLSEVQERM--TLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQ 197 + + + G + + Sbjct: 173 ---------------FTQSLANIVPGETIEVVIRYTNSLEFEGGDYEFVFPTVVGPRYIP 217 Query: 198 -LIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQ 256 + + ++ L + + +R + + + Sbjct: 218 GDQIDAAGNTTRVADAAKITPPLLPPSQRSGNDISITVNLDAGVPIRNLRSPSHPILTSK 277 Query: 257 QSDDILRLLPPELATL-GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYD 315 + + L + L Y+ + + LLT + + +K Sbjct: 278 KGEQTQVKLANQTTIPNKDLILRYQVASKQTQATLLTQSDQRGGHFATYLIPALKYKSNQ 337 Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG- 374 P+ + +DTSGS G ++ + N I+ FS + Sbjct: 338 IVPK-DVVFLIDTSGSQSGPPIVQSRKLMTQFLDKLNP-NDTFSIINFSNTTSKLSPKPL 395 Query: 375 ---PQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDD 430 P ++A+ ++ GGT+L + + + + V+++D + + Sbjct: 396 ANTPANRKKALEYIKKLDANGGTELMNGINTVAAFPPAPDGRLRSVVLLTDGLIGD-DET 454 Query: 431 VTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + V++ + +R + + ++ Sbjct: 455 IIAAVRDRLKP-GNRIYPFGVGFSTNRFLLD 484 >UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Deltaproteobacteria RepID=A0LHW4_SYNFM Length = 812 Score = 122 bits (305), Expect = 4e-26, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 68/194 (35%), Gaps = 11/194 (5%) Query: 273 GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSM 332 L+Y ++ LL YR E++ ++ P + R +I VD SGSM Sbjct: 286 RDFVLKYRLSGESIDSGLLLYRGKDENFFLLTVQPPKRVVEAAIPAR-EYIFIVDVSGSM 344 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE----IVRYELSGPQGIEQAIRFL-SQ 387 GF + +K L+ + +MLFS + R + + +A+ + + Sbjct: 345 HGFPLEISKRLLTDLIGGLKPTDC-FNVMLFSGDSTVMAERSVPASADNVRRAVEMIGRR 403 Query: 388 QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 Q GGT+L + + + + + +D + ++ + F Sbjct: 404 QGGGGTELLPALKKALSLPRKEGVSRSMVIA-TDGFVTVEEEAF-ELIRS--HIGDANFF 459 Query: 448 AVAMSAHGKPGIMR 461 + ++ Sbjct: 460 PFGIGTSVNRMLIE 473 >UniRef50_D1A557 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Streptosporangineae RepID=D1A557_THECD Length = 795 Score = 122 bits (305), Expect = 4e-26, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 69/202 (34%), Gaps = 23/202 (11%) Query: 275 TELEYEFYRRLVEKQ--------LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCV 326 L+ +F RL + LT GES + P + +PR ++ + Sbjct: 249 ERLDRDFVLRLAYGRPEQAAASVTLTPDAEGESGTFTLTVLPPS-ERCAPRPR-DVVILL 306 Query: 327 DTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR--------YELSGPQGI 378 D SGSM G+ A+ ++ +R ++ F + R + Sbjct: 307 DRSGSMHGWKMVAARRAAARIVDTLTGRDR-FAVLSFDDMVERPAGLDGGLSPATDRNRF 365 Query: 379 EQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKEL 438 Q RGGT+LA+ R L D V+I+D D + + + Sbjct: 366 RAVEHLAGLQARGGTELAAPLREGAALLDDAG-RDRVLVLITDGQVGN-EDQLLALIDPF 423 Query: 439 QRVHQHRFHAVAMSAHGKPGIM 460 + R HAV + G + Sbjct: 424 L--NGLRIHAVGIDQAVNAGFL 443 >UniRef50_A1S752 Inter-alpha-trypsin inhibitor domain protein n=1 Tax=Shewanella amazonensis SB2B RepID=A1S752_SHEAM Length = 753 Score = 122 bits (305), Expect = 4e-26, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 65/173 (37%), Gaps = 13/173 (7%) Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 G + P + R ++ +DTSGSM G + A++ + + ++ Sbjct: 374 QGNYSHGLLTFMPPQPNLANRLAR-ELVLVIDTSGSMAGDSMVQARSALIHALGGLGPQD 432 Query: 356 RRCYIMLFSTEIVRY----ELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIME---RLQ 407 I+ FS++ + + + A +F+ S + GGT++AS ++ + Sbjct: 433 -SFNIIAFSSDARPLWPDAKPATAFNLGAAQQFVRSLEADGGTEMASALELALKTPSVVD 491 Query: 408 SREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + I+D D + + ++ R+ R VA+ A M Sbjct: 492 EDTKRLRQVLFITDGAVN-GEDALFNLIER--RLGTSRLFPVAIGAAPNGYFM 541 >UniRef50_A1U6Y4 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Marinobacter RepID=A1U6Y4_MARAV Length = 712 Score = 121 bits (304), Expect = 4e-26, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 61/178 (34%), Gaps = 13/178 (7%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L + GE + ++ P + + +DTSGSM G + + A++ L + Sbjct: 327 LFRQQWQGEDFLMAMVMPPATTGQVLRR---ELLFVIDTSGSMAGESIRQARSALLRGLD 383 Query: 350 IALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIME 404 +R ++ F+++ + + +A ++ GGT++A M Sbjct: 384 TLRPGDR-FNVIQFNSQAHALYTQPVPANGHYLARARDYVQDLTADGGTEMAGALSLAMG 442 Query: 405 RL-QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 V ++D + +++ R VA+ + +R Sbjct: 443 MDGSESSGHVQQMVFMTDGAVGNES-ALFDQIRTGL--GNRRLFTVAIGSAPNMHFLR 497 >UniRef50_Q083T9 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q083T9_SHEFN Length = 722 Score = 121 bits (304), Expect = 5e-26, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 64/175 (36%), Gaps = 17/175 (9%) Query: 300 WREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCY 359 + ++ P V I+ +DTSGSM G + AK + L + Sbjct: 324 YSLVMLMPPSVEVSEQHLIARELILVIDTSGSMSGQSITQAKQALQFALA-GLRDIDSFN 382 Query: 360 IMLFSTEIVRYELS----GPQGIEQAIRFL-SQQFRGGTDLASCFRAIM--------ERL 406 I+ F++++ + + I +A RF+ S GGT++ S + + ++ Sbjct: 383 IIEFNSDVTMLSATPLSANSRNIGKANRFIQSLDADGGTEMRSALQTALVDSVQQDSDQT 442 Query: 407 QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + ++D ++ + + ++ R V + + MR Sbjct: 443 DAHSEMLRQVIFMTDGAVGN-EHELYQLIND--QLGDSRLFTVGIGSAPNSDFMR 494 >UniRef50_C7FPD9 Uncharacterized protein n=2 Tax=environmental samples RepID=C7FPD9_9BACT Length = 836 Score = 121 bits (304), Expect = 5e-26, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 69/208 (33%), Gaps = 18/208 (8%) Query: 267 PELATLGIT--------ELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQP 318 +L E+ Y ++ +R G++ + P + D D+ Sbjct: 248 STYVSLADPDALPNRTIEVRYMLASTQPVARVSAHRKPGQTGTFALTLEPPLKVDPDQVT 307 Query: 319 RGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG---- 374 VDTSGSM G A+A + + I+ F++ + Sbjct: 308 PKELFFVVDTSGSMMGEPLDKARAAMRYALERMGP-DDTFQIIDFASGVASLAPRPLPNT 366 Query: 375 PQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTS 433 P+ + + + F+ +GGT++ + RA ++ ++D D+ Sbjct: 367 PENLRKGLAFIEAMTSQGGTEMLAGIRAALDGPTPPG-RLRIVAFMTDGYIGN-DGDILD 424 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + Q V Q R + + ++ Sbjct: 425 YID--QSVGQARLFSFGVGEDVNRYLLE 450 >UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8U1E2_9PROT Length = 683 Score = 121 bits (303), Expect = 6e-26, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 57/172 (33%), Gaps = 18/172 (10%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQ---- 376 I +D SGSM G + AKA + + N ++ F+ + + + + Sbjct: 329 EVIFVIDVSGSMKGEPLRAAKASLTSGIEGLGR-NDTFNVVAFNNKAAAFYDAPVRASGK 387 Query: 377 -GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKV 435 + GGT++A+ F ++ + V I+D + + Sbjct: 388 FHRAALKVIDGLKAGGGTEMAAAFELALQ-MPGDPDRLQQVVFITDGAV----SNEAALF 442 Query: 436 KELQRVHQH-RFHAVAMSAHGKPGIMRIF------DHIWRFDTGMRSRLLRR 480 +++ R V + + M + + DT R++R Sbjct: 443 NQIKGELGARRLFTVGIGSAPNTFFMEEAARFGRGTYTYIGDTSSAERVMRD 494 >UniRef50_B4VT64 Vault protein inter-alpha-trypsin n=9 Tax=Cyanobacteria RepID=B4VT64_9CYAN Length = 1037 Score = 121 bits (302), Expect = 8e-26, Method: Composition-based stats. Identities = 54/458 (11%), Positives = 138/458 (30%), Gaps = 35/458 (7%) Query: 28 ASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQL---LS 84 +S Q F + + + + + V T ++ L Sbjct: 362 SSTQTGGLFATINGQQQVFPLEHTQVATKITGNVSRVEVTQTFTNPFDHPLEAVYKFPLP 421 Query: 85 TPQFIVQLP-QILDLLHRLNSPWAEQARQLVDANSTITSA---LHTLFLQRWRLSLIVQA 140 + + Q+ D + R ++A+Q+ + L + SL Sbjct: 422 DEAAVDDMEIQLDDRIIRGVIKKRQEAKQIYEKAKKAGKTAGLLEQERANVFTQSLANIK 481 Query: 141 TTLNQQL--LEEEREQLLSEVQE---------RMTLSGQLEPILADNNTAAG-RLWDMSA 188 + Q+ + + E R T + A + G + + S+ Sbjct: 482 PGESIQVTIRYTDSLKFEGGDYEFAFPMVVAPRYTAGNSVGSAKAPTTNSVGSKHFSASS 541 Query: 189 GQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATV 248 + + L+ P + GRS + ++ V Sbjct: 542 AKAPTTNKTLMTNVAYAAEVNPPI----APPGRSGHDIDVTVEIDAGVPISSVRSPSHPV 597 Query: 249 PEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERP 308 Q ++ E L Y+ + +LT + Sbjct: 598 TTQQTSSTVRVELAD---QETIPNKDLILRYQVAGADTQATVLTQADERGGHFATYLIPA 654 Query: 309 VVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV 368 + ++ + P+ + VDTSGS G +K ++ ++ I+ F+ Sbjct: 655 IEYQQNEIVPK-DVVFLVDTSGSQSGSPIVQSKELMRQFIQGLNPQD-TFTIIDFANSTT 712 Query: 369 RYE----LSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFI 423 + + PQ ++A+ ++ GGT+L + ++ + V+++D + Sbjct: 713 QLSDKPLANTPQNRKKALNYINRLDANGGTELMNGIDTVLNFPAAPAGRLRSVVLLTDGL 772 Query: 424 AQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + +++++ + +R ++ + + ++ Sbjct: 773 IGD-DEQIIAEIRDRLKP-GNRLYSFGVGSSTNRFLIE 808 >UniRef50_A8M9M1 von Willebrand factor type A n=1 Tax=Caldivirga maquilingensis IC-167 RepID=A8M9M1_CALMQ Length = 474 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 54/394 (13%), Positives = 128/394 (32%), Gaps = 58/394 (14%) Query: 103 NSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQ-QLLEEEREQLLSEVQE 161 E+A + + + + + L+ +Q + ++ ++ Sbjct: 79 KREILEEAFKYALSLVNDLNMRNDDEINTDEEGLVKGNVGKSQLSPNNDSKDGGTGQL-- 136 Query: 162 RMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKY--------GEFLNEQPELK 213 ++ +L ++ A ++D+ G + ++ + + + L+ Sbjct: 137 ---INQELGTGNESSDDVANVIYDVFYGSVGTMNFINLAQLLNMFVNPMAHITEKVKVLR 193 Query: 214 RLAEQLGR--------------SREAKSIPRNDAQMETFR-TMVREPATVPEQVDGLQQS 258 +LA L + ++ R R + E + P + G+++ Sbjct: 194 KLARYLASYGLLPHQGKGGSRVFKALDNVAREPTIGNALRVSRFTEHSNYPTYITGVREY 253 Query: 259 DDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQP 318 +G + ++K + + + + R +V ++Y + Sbjct: 254 R------------IGDPAYRID-----LDKTSMNM-VRKTFLNKPMSTRDIVVREYADVK 295 Query: 319 RGPFIVCVDTSGSM---GGF--NEQCAKAFCLALMRIALAENRRCYIMLFS--TEIVRYE 371 ++C+DTSGSM G AK + +R N R ++LF+ +I+ Sbjct: 296 LMDIVLCLDTSGSMKEFSGAYMKMDIAKEAIVKYIRYLSRTNDRLSMVLFNFRADILWGP 355 Query: 372 LSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDD 430 S + I + GGT++A+ L + + + I+D Sbjct: 356 HSVKKYINEMEEMSRYIYPGGGTNIANALEKARIILSKSNYPNKHIICITDGRTVNASSC 415 Query: 431 VTSKVKELQRVHQHRFHAVAMSAHGK-PGIMRIF 463 + VK R VA+ + +MR+ Sbjct: 416 IKEAVK--LRRMGVTLSTVAVGDNSDFDLLMRLS 447 >UniRef50_UPI0001744662 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744662 Length = 679 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 68/199 (34%), Gaps = 17/199 (8%) Query: 274 ITELEYEFYRRLVEKQLLTYR------LHGESWREKVIERPVVHKDYDEQPRGPFIVCVD 327 L Y+ R V LL ++ ES+ ++ P + PR ++ +D Sbjct: 265 DFVLRYQLSGREVATGLLLHQAPAGSSPEAESFFLLNVQPPAKWEAGQTPPR-DYLFVLD 323 Query: 328 TSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQ 387 SGSM GF + +K L++ I+ F+++ + I ++ Sbjct: 324 VSGSMNGFPIETSKRLMSDLLKGLNP-GDTFNILHFASDSAVLSPKPLAATPENIHLATK 382 Query: 388 -----QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVH 442 + GGT+L + + + + V+++D + KELQ + Sbjct: 383 DLSRHRGNGGTELLPALQRALATPREVGVSRS-IVILTDGYVTIEKEAFRLVRKELQNAN 441 Query: 443 QHRFHAVAMSAHGKPGIMR 461 + ++ Sbjct: 442 ---VFTFGIGTAVNRWLIE 457 >UniRef50_A6GBY0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GBY0_9DELT Length = 996 Score = 120 bits (300), Expect = 2e-25, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 67/201 (33%), Gaps = 17/201 (8%) Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMG-GFNEQCAKAFCLALMRIALAE 354 G S E+V+ + EQP I+ +D SGSM G K A R Sbjct: 504 WGGSTIEQVLPVRFSGERQREQPTLALILVIDKSGSMSSGDRLDLVKEAARATARTLDPS 563 Query: 355 NRRCYIMLFSTEIVRY-ELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWF 412 + ++ F L + + GGT+ R +L + Sbjct: 564 D-EIGVIAFDNSPQVLVRLQPAANRLRISSSIRRLSAGGGTNAMPALREAYLQLAGSKAL 622 Query: 413 DADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSA-HGKPGIMRIFDH---IWR 468 +++SD + P++ + + R +V + GK ++R+ + + Sbjct: 623 VKHVILLSDGES---PENGINALLGDMRQSDITVSSVGVGDGAGKDFLIRVAERGRGRYF 679 Query: 469 FD------TGMRSRLLRRWRR 483 + + SR R +R Sbjct: 680 YSEDGTDVPRIFSREAREVKR 700 Score = 42.0 bits (97), Expect = 0.055, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 36/148 (24%), Gaps = 12/148 (8%) Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI--------ALAENR-RCYI 360 + VD S S+ A+ + E+R R + Sbjct: 156 QPSLRSPIRGKTVVFVVDVSESIDDSQLAAAEQAVREAAELAASEAELGIEKEDRTRVRV 215 Query: 361 MLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVIS 420 + ++ EL + E ++ D AS R L V+++ Sbjct: 216 ITYAGRARLLELEAGEAGELSLPRDPDNAMAS-DHASALRLAEALLDPDTEGR--VVLMT 272 Query: 421 DFIAQRLPDDVTSKVKELQRVHQHRFHA 448 D + + H Sbjct: 273 DATGDLAEREGLGQAIFDLEDRGVSVHT 300 >UniRef50_B9GVZ4 Predicted protein n=4 Tax=rosids RepID=B9GVZ4_POPTR Length = 757 Score = 119 bits (299), Expect = 2e-25, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 72/211 (34%), Gaps = 22/211 (10%) Query: 278 EYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNE 337 + F L++ LL + + + + R I +D SGSM G Sbjct: 292 KDLFGGVLLQSPLL---RDIDDRQMFCFYLFPGNNQSMKAFRKEVIFIIDISGSMKGGPF 348 Query: 338 QCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYELSGPQGIEQAIRFL--SQQFRG 391 + AK L+ ++ E+ I+ F + E + + I +A R+L G Sbjct: 349 ESAKNGLLSSLQKLNPED-SFNIIAFKMDTYLFSSVMEQATEEAIIEATRWLNDKLTADG 407 Query: 392 GTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQH---RFHA 448 GT++ + ++ L +I+D + D+ + VK R Sbjct: 408 GTNILGPLKQAIKLLAETTNSIPVIFLITDGAVED-ERDICNFVKGYLPSGGSISLRIST 466 Query: 449 VAMSAHGKPGIMRI--------FDHIWRFDT 471 + + +R+ FD + D+ Sbjct: 467 FGIGTYCNHHFLRMLAQIGRGHFDTAYDADS 497 >UniRef50_UPI0000D9E789 PREDICTED: similar to poly (ADP-ribose) polymerase family, member 4, partial n=7 Tax=Euteleostomi RepID=UPI0000D9E789 Length = 741 Score = 118 bits (296), Expect = 5e-25, Method: Composition-based stats. Identities = 50/280 (17%), Positives = 91/280 (32%), Gaps = 26/280 (9%) Query: 194 GDYQLIVKYGEFLNEQPE---LKRL-AEQLGRSREAKSIPRNDAQMETFRTMVREPATVP 249 +Q E L + E +K + +Q + +P + + +++ T Sbjct: 302 APWQQDKALNENLQDTVEKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDC 361 Query: 250 EQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 + V + + L L + + EK+ V + + Sbjct: 362 KAVISTMEGSSLDSSGFSLHIGLSDAYLPRMWVEKHPEKE--------SEACMLVFQPDL 413 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 D I+C+D S SM G AK L + + E + I+ F T++ Sbjct: 414 DVHLPDLANESEVIICLDCSSSMEGVTFLQAKQIALHALS-LVGEKHKVNIIQFGTDVSV 472 Query: 370 YELSGPQGIEQAIRFLSQQF-------RGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 + I+Q LS F G TD R + L +++SD Sbjct: 473 M-TANGDCIQQGEHNLSLHFLQSATPTMGNTDFWKTLRY-LSLLYPARGSRN-ILLVSDG 529 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 Q + +T ++ + R H R A + I+RI Sbjct: 530 HLQ--DESLTLQLVKRSRPH-TRLFACGIGPTANRHILRI 566 >UniRef50_B1KDL1 LPXTG-motif cell wall anchor domain protein n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KDL1_SHEWM Length = 739 Score = 117 bits (294), Expect = 7e-25, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 58/173 (33%), Gaps = 13/173 (7%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 E ++ P + D I+ +DTSGSM G + AK + L Sbjct: 341 EDDYALLMLLPPSDQKQDVSISRELILVIDTSGSMSGASIAQAKRALNYALA-GLKAKDT 399 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFL-----SQQFRGGTDLASCFRAIMERLQSRE-- 410 ++ F++ + + I S + GGT++ A +++ E Sbjct: 400 FNVIEFNSNVGSLSPYSLPATAKNIGLANQYVRSLKANGGTEMQLALNAALDKGTETEAL 459 Query: 411 --WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + ++D + +K Q++ + R + + + MR Sbjct: 460 GSERLRQVLFMTDGSVGD-EQSLFHLIK--QKIGESRLFTLGIGSAPNSHFMR 509 >UniRef50_C5CDX4 von Willebrand factor type A n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CDX4_KOSOT Length = 730 Score = 117 bits (293), Expect = 9e-25, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 64/186 (34%), Gaps = 14/186 (7%) Query: 286 VEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCL 345 + ++ Y + ++ + P+ + +D SGSM G + AK L Sbjct: 241 LSSSIMNYWDEADRRGYFLLTLVPPREPERIIPK-DIVFILDISGSMSGQKIEKAKLALL 299 Query: 346 ALMRIALAENRRCYIMLFSTEI-----VRYELSGPQGIEQAIRFLSQQFRGGTDLASCFR 400 ++++ E R I+ F+ E+ S A++ G T++ Sbjct: 300 QVLQMLH-EGDRFSIITFNNEVNNLTERLLPFSDRTEWYPAVK--QIMAGGMTNIHDALL 356 Query: 401 AIMERL--QSREWFDADAVVISDFIAQRLPDDVTSKVK---ELQRVHQHRFHAVAMSAHG 455 +E L QS + + ++D D+ + ++ +L +V + Sbjct: 357 EGIEVLGTQSTDDRYKVVLFLTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVGYDV 416 Query: 456 KPGIMR 461 ++ Sbjct: 417 NAELLD 422 >UniRef50_B7AA98 von Willebrand factor type A n=3 Tax=Thermus RepID=B7AA98_THEAQ Length = 706 Score = 117 bits (292), Expect = 1e-24, Method: Composition-based stats. Identities = 40/185 (21%), Positives = 62/185 (33%), Gaps = 10/185 (5%) Query: 282 YRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGP-FIVCVDTSGSMGGFNEQCA 340 Y R L T G + P +G ++ +D SGSM G A Sbjct: 268 YLRRGGGLLFTATPKGLFFGGWDRALPEDLPLKPLGRKGAALVLVLDVSGSMEGEKLAMA 327 Query: 341 KAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLA 396 A L L+R A E+ ++LFS+ ++ E LS + GGT L Sbjct: 328 VAGALELVRSAAPED-YLGVVLFSSSPRVLFPPRPMTAQGKKEAESLLLSLRAGGGTVLG 386 Query: 397 SCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGK 456 FR + LQ +V+SD I + + + A+A+ Sbjct: 387 GAFREALRLLQDVPVERKALLVLSDGIIFDPKEPILALAATA----GVEVSALALGPDAD 442 Query: 457 PGIMR 461 + Sbjct: 443 AAFLE 447 >UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZFT4_9SPHI Length = 827 Score = 117 bits (292), Expect = 1e-24, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 65/206 (31%), Gaps = 22/206 (10%) Query: 273 GITELEYEFYRRLVEKQLLTYRLHGE------------SWREKVIERPVVHKDYDEQPRG 320 + Y ++ LL Y E ++ +P + P Sbjct: 264 RDYIVRYRLAGSKIQSGLLLYEGENEVASGKEEDNENAEKFFLMMMQPPKAPKNSQIPPR 323 Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ 380 ++ VD SGSM GF +K L+ + +MLF + + + Sbjct: 324 EYVFIVDVSGSMHGFPLSVSKRLLKNLIGKLRP-KDKFNVMLFESSNQMMSPESMEATQA 382 Query: 381 AIR-----FLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKV 435 I+ Q+ GGT L + + Q++++ + V ++D + Sbjct: 383 NIQKAFGVIDQQRGGGGTRLLPALKKALAFKQTKDYSRSFVV-VTDGYVTVEKEAFDLIR 441 Query: 436 KELQRVHQHRFHAVAMSAHGKPGIMR 461 L R + A + + ++ Sbjct: 442 NNLNRAN---LFAFGIGSSVNRFLIE 464 >UniRef50_A8FW78 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Shewanella sediminis HAW-EB3 RepID=A8FW78_SHESH Length = 770 Score = 116 bits (291), Expect = 1e-24, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 63/181 (34%), Gaps = 24/181 (13%) Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYI 360 V+ P + + I+ +DTSGSM G + AK + + + + Sbjct: 356 YALVMLLPPSLEKSRNRVSRELILVIDTSGSMSGSAMEQAKKAMKYALAGLGS-DDTFNV 414 Query: 361 MLFSTEIVRYEL----SGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERL--------- 406 + F++++ + + IE A RF+ S GGT++A + + Sbjct: 415 IEFNSKVSSLSKGPIPASTKNIEMANRFVHSLTSDGGTEMALALEHALGQESGGSSWQET 474 Query: 407 ------QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + + ++D + + +K R+ + R + + + M Sbjct: 475 GLQGKDEESTSRLRQVLFMTDGAVGNEAE-LFKLIK--YRIGKSRLFTLGIGSAPNSHFM 531 Query: 461 R 461 + Sbjct: 532 Q 532 >UniRef50_Q2B7C3 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B7C3_9BACI Length = 920 Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 64/166 (38%), Gaps = 5/166 (3%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ EK++ + K E P ++ +D SGSM G + AK + L E Sbjct: 381 KTPIEKLLPVNMDIKGKKEMPSLGLMIVMDRSGSMAGSKLELAKEAAARSVE-LLREKDT 439 Query: 358 CYIMLFSTEI-VRYELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDAD 415 + F V E + + A+ + S GGT++ + E L++ + Sbjct: 440 LGFIAFDDRPWVIVETGPLEDKKDAVDKIGSVTPGGGTEIFTSLEKAYEELENLKLQRKH 499 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++++D + R D + E + + VA+ + ++ Sbjct: 500 IILLTDGQSARSTD--YESMIETGKENNITLSTVALGSDADRNLLE 543 Score = 43.2 bits (100), Expect = 0.022, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 38/124 (30%), Gaps = 5/124 (4%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGP 375 P + D S S+ G + + + + ++ + E E S Sbjct: 60 PAPGKTVVFIADRSASVQGREGELLDFIDAGI--QSKGKEDSYAVIS-AGETAAAESSLA 116 Query: 376 QGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKV 435 + F + +G T+L + + + V++SD +K+ Sbjct: 117 SMKGEFREFSTDTGKGETNLEAGIQLASTLMPEETPGR--IVLLSDGRETAGSSREAAKL 174 Query: 436 KELQ 439 + + Sbjct: 175 LKNR 178 >UniRef50_A9U149 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9U149_PHYPA Length = 1185 Score = 116 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 56/178 (31%), Gaps = 13/178 (7%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 H + + I VD SGSM G + A +R Sbjct: 439 ERHPRDGTHAITLTFLPRFALRPMSSSELIFVVDRSGSMQGTPIKQAGQALELFLRSIPC 498 Query: 354 ENRRCYIMLFSTEIVRY-ELSGPQGIEQAIRFLSQ-----QFRGGTDLASCFRAIMERLQ 407 E+ I+ F S P E + L GGT++ S F I E Sbjct: 499 EDHYFNIIGFGDNHKTLFPKSTPYNEETLTKGLRYAQALEADMGGTEMMSAFEEIFE--H 556 Query: 408 SREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQ----HRFHAVAMSAHGKPGIMR 461 R +++D + D + +++ ++ + R ++ + ++ ++ Sbjct: 557 RRRDVPTQIFLLTDGEIWDV-DSLIECIRDAKKEEKSDNFVRVFSLGIGSNVSHHLVE 613 >UniRef50_UPI0000E105CF vault protein inter-alpha-trypsin n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E105CF Length = 757 Score = 116 bits (289), Expect = 3e-24, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 60/187 (32%), Gaps = 11/187 (5%) Query: 282 YRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAK 341 Y+ ++ TY L + + + I +D+SGSM G A Sbjct: 354 YQEYIDGT--TYGLIHVVPPVISHDPSSMASTVTPSIQQNTIFVLDSSGSMHGTALTQAI 411 Query: 342 AFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLS-QQFRGGTDLA 396 + E+ I+ F +E + + +A+RFL GGT++ Sbjct: 412 DAIREGVSYL-TEHDTFNIVDFDSEARALWRQSQFADEVSKAEAMRFLRHVDSDGGTNMQ 470 Query: 397 SCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGK 456 + +L + ++D ++ ++ E ++ R V + A Sbjct: 471 DALALSLTQLLDSSTGLTQVIFVTDGSINN-ERELLKQIAE--QLGDKRLFTVGIGAAPN 527 Query: 457 PGIMRIF 463 M Sbjct: 528 SHFMEYA 534 >UniRef50_A8H5J9 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella pealeana ATCC 700345 RepID=A8H5J9_SHEPA Length = 789 Score = 116 bits (289), Expect = 3e-24, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 63/187 (33%), Gaps = 23/187 (12%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + E + ++ P + I+ +DTSGSM G AK + Sbjct: 373 QTASEKYGLVMLMPPQGAEQQPSSIHRELILVIDTSGSMSGDAIIQAKTALKYALAGLRP 432 Query: 354 ENRRCYIMLFSTEIVRYE----LSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQS 408 + + I+ F++++ ++ + P + QA ++ + GGT+++ A + Sbjct: 433 TD-KFNIVQFNSDVDKWSGMAMSATPYNLAQAQNYINRLEANGGTEMSIAINAALNIETV 491 Query: 409 REWFD--------------ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAH 454 + + I+D + ++ ++ R + + + Sbjct: 492 TDKETGTELDNNDLGSNLLRQVLFITDGAVSNES-MLFELIE--AQLGDSRLFTIGIGSA 548 Query: 455 GKPGIMR 461 M+ Sbjct: 549 PNAHFMQ 555 >UniRef50_Q7UNM0 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UNM0_RHOBA Length = 900 Score = 116 bits (289), Expect = 3e-24, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 55/175 (31%), Gaps = 8/175 (4%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRG--PFIVCVDTSGSMGGFNEQCAKAFCLALM 348 + L G + PV E+ + ++ +D SGSMGG + AK A + Sbjct: 432 QAFGLGGYYRTQIEEILPVRSNFEKEREKPSLAMMLVIDKSGSMGGQKIELAKDAAQAAV 491 Query: 349 RIALAENRRCYIMLFSTEIVRY-ELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERL 406 + ++ F + EL +S + GGT++ E L Sbjct: 492 ELLGP-KDAIGVIAFDGDSYTVSELRSTSDRGAISDAISTIEASGGTNMYPAMADAYEAL 550 Query: 407 QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++++D ++ V + VA+ ++ Sbjct: 551 LGATAKLKHVILMTDGVSSPGD---FQGVAGDMSASRITLSTVALGQGSSEDLLE 602 >UniRef50_Q4SBF6 Chromosome 11 SCAF14674, whole genome shotgun sequence. (Fragment) n=16 Tax=Euteleostomi RepID=Q4SBF6_TETNG Length = 1039 Score = 116 bits (289), Expect = 3e-24, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 62/178 (34%), Gaps = 18/178 (10%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYE 371 + + +D SGSM G Q + L ++ E+ I+LF I Sbjct: 429 PRLPKNVVFVIDMSGSMSGTKMQQTREAMLKILEDLDPED-HFGIILFDHRIQFWNTSLS 487 Query: 372 LSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWFDA-------DAVVISDFI 423 + + I++A+ ++ Q GGTD+ + ++ L+ ++++D Sbjct: 488 KATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKRLPEKSIDMIILLTDGD 547 Query: 424 AQRLPDDVTSKVKELQRVHQ--HRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLR 479 + + ++ ++ K + D + R + G+ R+ Sbjct: 548 PNSGESRIPVIQENVKAAIGGQMSLFSLGFGNDVKYPFL---DVMSRENNGLARRIYE 602 Score = 44.4 bits (103), Expect = 0.011, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 20/57 (35%), Gaps = 3/57 (5%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQC-AKAFCLALMRIALAENRRCYIMLFSTEIVRYE 371 + + +D SGSM G Q A +L + + + FS I + Sbjct: 329 PRLPKNVVFVIDMSGSMSGTKMQQEAHRAARSLQKRSTDGG--TARISFSPTIEQQR 383 >UniRef50_A6G2V8 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G2V8_9DELT Length = 877 Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 55/171 (32%), Gaps = 10/171 (5%) Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 G + +P ++ + VD SGSMGG AK ++ + Sbjct: 351 EGGDGYFTLTVQPPEQVADEQAVARELVFVVDNSGSMGGLPMDTAKGLMRKALKDIRP-D 409 Query: 356 RRCYIMLFSTEI----VRYELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSRE 410 ++ FS + + IE + ++ + Q GGT + +A + + Sbjct: 410 DTFTVLRFSESASGLSNKLLPATQDNIEAGVDYVDAMQGMGGTQMTEGIKAALRVPHDPD 469 Query: 411 WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + ++D + + + + R ++ + ++ Sbjct: 470 -RLRVVMFLTDGYIGN-EQAIFELIDDN--IGDARLFSLGVGGAPNRYLLD 516 >UniRef50_C4DQN3 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DQN3_9ACTO Length = 831 Score = 115 bits (287), Expect = 5e-24, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 61/204 (29%), Gaps = 21/204 (10%) Query: 275 TELEYEFYRRL-------VEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVD 327 + +F R V LLT E + D +V +D Sbjct: 252 ERVNRDFILRFDYGESGDVAGSLLTAPDENEPTSGTFQLTAIPPSDLPRARPRDVVVLLD 311 Query: 328 TSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE---------LSGPQGI 378 SGSMGG+ A+ ++ + +R + F T + E + Sbjct: 312 RSGSMGGWKMVAARRAAARIVDTLSSADR-FAVRCFDTAMTSPEGLDPNGLSAGTDRNRF 370 Query: 379 EQAIRFLSQQFRGGTDLASCFRAIMERLQ-SREWFDADAVVISDFIAQRLPDDVTSKVKE 437 + RGGTD+ ++ L + D ++++D + Sbjct: 371 RAVEHLAGTETRGGTDILKPLSTAVDLLTAGEKGRDRVIILVTDGQVGNEDQILRELT-- 428 Query: 438 LQRVHQHRFHAVAMSAHGKPGIMR 461 R+ R H V + G + Sbjct: 429 -GRLSGMRVHVVGIDKAVNAGFLH 451 >UniRef50_Q60ED8 Von Willebrand factor type A domain containing protein n=6 Tax=Poaceae RepID=Q60ED8_ORYSJ Length = 801 Score = 115 bits (287), Expect = 5e-24, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 53/149 (35%), Gaps = 11/149 (7%) Query: 323 IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYELSGPQGI 378 + +DTSGSM G + K + + + I+ F+ E+ E + I Sbjct: 335 VFIIDTSGSMQGKPLESVKNAMYTTLSELV-QGDYFNIITFNDELHSFSSCLEQVNEKTI 393 Query: 379 EQAIRFLSQQ--FRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 E A +++ GGTD+ + L + +++D + ++ VK Sbjct: 394 ENAREWVNTNFIAEGGTDIMHPLSEAIALLSNSHNALPQIFLVTDGSVED-ERNICRTVK 452 Query: 437 ELQRVHQH---RFHAVAMSAHGKPGIMRI 462 E R + ++ +R+ Sbjct: 453 EQLATRGSKSPRISTFGLGSYCNHYFLRM 481 >UniRef50_Q01UI0 von Willebrand factor, type A n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01UI0_SOLUE Length = 837 Score = 114 bits (286), Expect = 6e-24, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 50/170 (29%), Gaps = 6/170 (3%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 G+ P ++ +D S SM G + A+ + ++ Sbjct: 367 DKKGKPEDALERSLPAKLAPPRSPEGTAVVLIIDKSSSMEGRKIELARLAAIGVVENLRP 426 Query: 354 ENRRCYIMLFSTEIVR-YELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREW 411 +++F + + + +S GGT +A +R+ + Sbjct: 427 -IDSVGVLIFDNSFQWAVPIRKAEDRATIKKLISGITPDGGTQIAPALTEAYQRILPQTA 485 Query: 412 FDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 V+++D I++ + + + + V + + Sbjct: 486 MYKHIVLLTDGISEEGDSMTLT---KEAQANHVTISTVGLGQDVNRAFLE 532 Score = 54.8 bits (130), Expect = 8e-06, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 46/144 (31%), Gaps = 7/144 (4%) Query: 317 QPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQ 376 + + +V DTS S+ E A+A + ++ F+ L Sbjct: 55 ESKVAVVVLADTSASVSA--EDLARASAITTEVERGRGRHWTRVIPFARSTRTTALEEKS 112 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 R + GTDL + R L + ++ISD V + Sbjct: 113 KEGWRFRHTAGAAGRGTDLETAIRDGSASLPAGMVPR--LLLISDGNENLGS--VARAIW 168 Query: 437 ELQRVHQHRFHAVAMSAHGKPGIM 460 + Q VA++ KPG+ Sbjct: 169 QAQ-QMAIPIDTVALAGRPKPGLR 191 >UniRef50_Q9UKK3 Poly [ADP-ribose] polymerase 4 n=14 Tax=Eutheria RepID=PARP4_HUMAN Length = 1724 Score = 114 bits (286), Expect = 6e-24, Method: Composition-based stats. Identities = 46/277 (16%), Positives = 88/277 (31%), Gaps = 22/277 (7%) Query: 194 GDYQLIVKYGEFLNEQPE---LKRL-AEQLGRSREAKSIPRNDAQMETFRTMVREPATVP 249 +Q E L + E +K + +Q + +P + + +++ T Sbjct: 753 APWQQDKALNENLQDTVEKICIKEIGTKQSFSLTMSIEMPYVIEFIFSDTHELKQKRTDC 812 Query: 250 EQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPV 309 + V + + L L + + EK+ V + + Sbjct: 813 KAVISTMEGSSLDSSGFSLHIGLSAAYLPRMWVEKHPEKE--------SEACMLVFQPDL 864 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 D I+C+D S SM G AK L + + E ++ I+ F T Sbjct: 865 DVDLPDLASESEVIICLDCSSSMEGVTFLQAKQIALHALS-LVGEKQKVNIIQFGTGYKE 923 Query: 370 YELSGPQ--GIEQAIRFL--SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQ 425 A F+ + G TD R + L +++SD Q Sbjct: 924 LFSYPKHITSNTAAAEFIMSATPTMGNTDFWKTLRY-LSLLYPARGSRN-ILLVSDGHLQ 981 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 + +T ++ + R H R A + + ++RI Sbjct: 982 --DESLTLQLVKRSRPH-TRLFACGIGSTANRHVLRI 1015 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 114 bits (286), Expect = 6e-24, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 68/181 (37%), Gaps = 14/181 (7%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + + ++ V + +P + +D SGSM G Q K L L+ + + Sbjct: 182 KTQDSTNDILEEQKEQVKQVEQSRPSIDLVCVIDNSGSMQGEKIQNVKTTLLQLLDMLNS 241 Query: 354 ENRRCYIMLFST------EIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQ 407 +R ++LF++ + + + I+ I S GGTD+ S LQ Sbjct: 242 NDR-LSLILFNSYPTLLCNLRKVDDENTPNIQSIIN--SITADGGTDINSGMLMAFNILQ 298 Query: 408 SREWFD--ADAVVISDFIAQRLPDDVTSKVKELQRVHQ--HRFHAVAMSAHGKPGIM-RI 462 R++F+ + ++SD + + + Q + H+ + +M RI Sbjct: 299 KRQFFNPVSSIFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFGSDHDGPLMNRI 358 Query: 463 F 463 Sbjct: 359 C 359 >UniRef50_Q8YNZ7 Alr4412 protein n=6 Tax=Cyanobacteria RepID=Q8YNZ7_ANASP Length = 820 Score = 114 bits (284), Expect = 9e-24, Method: Composition-based stats. Identities = 27/235 (11%), Positives = 73/235 (31%), Gaps = 9/235 (3%) Query: 232 DAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLL 291 +++ + + + + +++L + L Y+ + +L Sbjct: 212 TIEIDAGVKVQNIQSPSHQVQISYAEKQVLVKLAGGDTIPNKDLILRYQVAGESTQATVL 271 Query: 292 TYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIA 351 + + P + D+ + +DTSGS G + + Sbjct: 272 SQADER-GGHFALYLIPAIQYRQDQVVPKDVVFLIDTSGSQMGAPLMQCQELMRRFINGL 330 Query: 352 LAENRRCYIMLFSTEIVRYEL----SGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERL 406 + I+ FS + + Q +AI ++ GGT++ RA++ Sbjct: 331 NP-DDTFSIVDFSDTTRQLSPVPLANNAQNRTRAINYINQLSANGGTEMLRGIRAVLNFP 389 Query: 407 QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + V+++D + + L+ +R ++ + ++ Sbjct: 390 VTDPGRLRSIVLLTDGYIGNENQILAEVQQHLK--SGNRLYSFGAGSSVNRFLLN 442 >UniRef50_A6G415 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G415_9DELT Length = 877 Score = 114 bits (284), Expect = 9e-24, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 52/173 (30%), Gaps = 13/173 (7%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + ++ P + I +D SGSM G AK + + Sbjct: 351 QPGHFTLVVEPPQSDLDSLVGQREMIFVIDRSGSMSGVPLALAKQTLREALSHLRPVD-T 409 Query: 358 CYIMLFSTEIVRY----ELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWF 412 ++ F + + Q + A RF+ Q GGT ++ A + + Sbjct: 410 FNVISFESSTAMLYEAAVPANEQNLVHAERFIDGLQAGGGTMMSGAVDAALS-PEIGLGR 468 Query: 413 DADAVVISDFIAQRLPDDVTSKVKELQRVHQ-----HRFHAVAMSAHGKPGIM 460 ++D D++ + L R R + + + ++ Sbjct: 469 HRYVFFVTDGFISN-EDEIARQASALVRAADKAGQRARVFGMGIGSSPNRELL 520 >UniRef50_Q10JU7 Von Willebrand factor type A domain containing protein, expressed n=17 Tax=Poaceae RepID=Q10JU7_ORYSJ Length = 680 Score = 114 bits (284), Expect = 1e-23, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 53/149 (35%), Gaps = 11/149 (7%) Query: 323 IVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYELSGPQGI 378 + +DTSGSM G + K + + + I+ F+ E+ E + I Sbjct: 335 VFIIDTSGSMQGKPLESVKNAMYTTLSELV-QGDYFNIITFNDELHSFSSCLEQVNEKTI 393 Query: 379 EQAIRFLSQQ--FRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVK 436 E A +++ GGTD+ + L + +++D + ++ VK Sbjct: 394 ENAREWVNTNFIAEGGTDIMHPLSEAIALLSNSHNALPQIFLVTDGSVED-ERNICRTVK 452 Query: 437 ELQRVHQH---RFHAVAMSAHGKPGIMRI 462 E R + ++ +R+ Sbjct: 453 EQLATRGSKSPRISTFGLGSYCNHYFLRM 481 >UniRef50_Q7UL83 Inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain n=1 Tax=Rhodopirellula baltica RepID=Q7UL83_RHOBA Length = 764 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 57/171 (33%), Gaps = 8/171 (4%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 E + +P + E I+ +DTSGSM G + F ++ Sbjct: 317 ESDAEDGYVMLALQPKWSIEPTEITPREVILVLDTSGSMNGPAISQLRLFADHVLDHLNP 376 Query: 354 ENRRCYIMLFSTEIVRYEL----SGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQS 408 N ++ FS ++ + I+ A +F+ + GGT+L + + Sbjct: 377 -NDEFRVIAFSNRTTAFQPNAVSATDANIQSAKQFVRGLRASGGTNLLPALKLALGGEAD 435 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 ++++D + + +++ R +A A + Sbjct: 436 ESARPRYMILMTDALVGN-DHSILRYLRQ-PEFQDARVFPIAFGAAPNDYL 484 >UniRef50_A8S006 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S006_9CLOT Length = 692 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 62/389 (15%), Positives = 114/389 (29%), Gaps = 22/389 (5%) Query: 92 LPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEE 151 P + L+ ++ + A + + +EE Sbjct: 276 WPYMPSLIEQVREDLKNKTHHAAQAAEEQLATPPKTLSGTSKPVPSKTPYNHTPSSEDEE 335 Query: 152 REQLLSEVQERMT--LSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQ 209 RE+L S + + + I ++ + S + + + L E+ Sbjct: 336 RERLQSALDYETGRIALEKTDEISEGDSGGTSYDRNFSGSGYVSQAAEDMQRIVSQLAEE 395 Query: 210 PELKRLAEQLGRSREAKSIPRNDAQMETFRT-MVREPATVPEQVDGLQQSDDILRLLPPE 268 R E+L + +S + + + VPE+ + L P Sbjct: 396 SAAIRYEEELSEELQDESNRISYGNIHKGIHIHINRMGYVPEEYR-ISYQKVFPALHPIS 454 Query: 269 LATLGITELEYEFYRRLVEKQL--LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVC- 325 L + + +L L + + + + +K +G VC Sbjct: 455 -KRLQKQVSQILIDSK-TGGKLDGLPFGRRINARNAVRNDGKLFYKIRLPHEQGDIAVCL 512 Query: 326 -VDTSGSMGG-FNEQCAKAFCLALMRIALAENRRCYIML----FSTEIVRYELSGPQGIE 379 +D SGSM A++ L + +A I + EI Y Q + Sbjct: 513 LIDESGSMSSRDRITKARSAALIIHDFCVALGIPIAIYGHTEDYDVEIYSYAEYDSQDGK 572 Query: 380 QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIA------QRLPDDVTS 433 A R + R G + R ERL +R +ISD + Sbjct: 573 DAYRLMDMSSRCGNRDGAALRFAAERLMTRAEDLKLLFLISDGQPAGDGYYGTAAEADLR 632 Query: 434 KVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 +K+ + A A+ KP I RI Sbjct: 633 GIKQEYSRKGIQLFAAAIGDD-KPNIKRI 660 >UniRef50_A9SQ90 Predicted protein n=3 Tax=Physcomitrella patens subsp. patens RepID=A9SQ90_PHYPA Length = 778 Score = 113 bits (282), Expect = 2e-23, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 51/152 (33%), Gaps = 9/152 (5%) Query: 319 RGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV----RYELSG 374 + + +D SGSM G A + + E+ I+ F E + E + Sbjct: 345 QRAVVFLLDRSGSMYGDPLNDALQALYSGLESLKPED-SFNIIAFDHETALFSSQMERAN 403 Query: 375 PQGIEQAIRFL--SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVT 432 I +A + + RGGTD+ S + + +++ +I+D + Sbjct: 404 SASILRAREWATEKCKARGGTDILSPLQQAFKLVENFPGAVPYVFLITDGAVDNEKNICL 463 Query: 433 SKVKELQR--VHQHRFHAVAMSAHGKPGIMRI 462 + + R + + +++ Sbjct: 464 TMQSRIVELGARAPRISTFGIGHYCNYYFLKM 495 >UniRef50_UPI0001788F71 von Willebrand factor type A n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788F71 Length = 1007 Score = 113 bits (282), Expect = 2e-23, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 5/166 (3%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ EK + + + E P I+ +D SGSM G + AK + + + Sbjct: 385 KTPIEKALPVSMELEGKREIPSLGLILVIDRSGSMDGNKIELAKESAMRTVELM-RAKDT 443 Query: 358 CYIMLFSTEI--VRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 ++ F + V E S GGT++ + +E + + Sbjct: 444 VGVVAFDDQPWWVVPPQKLGDKEEVLSSIQSIPSAGGTNIYPAVSSALEEMLKIDAQRRH 503 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 ++++D + + + ++ +VA+ +++ Sbjct: 504 IILMTDGQSAMNSGY--QDLTDTMVENKITMSSVAVGMDADTNLLQ 547 >UniRef50_A0DZ93 Chromosome undetermined scaffold_7, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DZ93_PARTE Length = 522 Score = 113 bits (282), Expect = 2e-23, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 70/222 (31%), Gaps = 12/222 (5%) Query: 245 PATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKV 304 V +S+ + L L+ +L L T + Sbjct: 42 NDDDAIDVVITNESNYGRKSLSQNYMKQANYVLQDNVELKLSYSGLPTQGTQA-----VL 96 Query: 305 IERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS 364 + ++ + I +D SGSM G K L+++ +R C ++ F Sbjct: 97 LSVQTKNQAITIRQGIDLICLIDHSGSMSGEKMHLVKKSLKHLLKMLQPNDRLC-LIEFD 155 Query: 365 TEIVRYELSGPQGIEQAIRFL----SQQFRGGTDLASCFRAIMERLQSREWFD--ADAVV 418 + R E +FL + + G TD+ + + + L+ R + + A + Sbjct: 156 DQNYRLTRLMRATQENMYKFLIAIDTIEANGATDIGNAMKMALSILKHRRFKNPIASIFL 215 Query: 419 ISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 +SD + V + ++ + P IM Sbjct: 216 LSDGEDEGAAGRVWNDIQSKNIKEPFTINTFGFGRDCCPKIM 257 >UniRef50_B5W7H4 von Willebrand factor type A n=1 Tax=Arthrospira maxima CS-328 RepID=B5W7H4_SPIMA Length = 488 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 52/168 (30%), Gaps = 3/168 (1%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 L G ++ ++ +DTSGSM G + + + + Sbjct: 26 LLGALIFGEIWLYLTRQPPGFLTKPKAVVLLIDTSGSMSGQKLREVQTAASEFVSRQNLK 85 Query: 355 NRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA 414 ++ FS+ E RGGT+L+ F LQ+ + Sbjct: 86 RHDLAVVEFSSRASVVADFTRNETELQQAIARLSARGGTNLSEGFNLATSVLQNSD-RTP 144 Query: 415 DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 + ++ +D + P + + + + R AV + + Sbjct: 145 NILLFTDGVPNNPP--MAASIAQQIRASGINLVAVGTGDAQINYLTAL 190 >UniRef50_Q86UX2 Inter-alpha-trypsin inhibitor heavy chain H5 n=40 Tax=Euteleostomi RepID=ITIH5_HUMAN Length = 942 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 56/474 (11%), Positives = 138/474 (29%), Gaps = 58/474 (12%) Query: 37 EKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQSQLLSTPQFIVQLPQIL 96 E+ + D R + L+ + P +TE +S ++S F ++L Sbjct: 19 EEAQSWGHSSEQDGLRVPRQV-RLLQRLKTKPLMTE---FSVKSTIISRYAFTTVSCRML 74 Query: 97 DLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLL 156 + + + + T+ + + T ++ + +E+ Sbjct: 75 NRASED-----QDIEFQMQIPAAAFITNFTMLIGD--KVYQGEITEREKKSGDRVKEKRN 127 Query: 157 SEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLA 216 +E ++ A + + +S +L + + + KY ++ +P+ Sbjct: 128 KTTEENGEKGTEIFRASAVIPSKDKAAFFLSYEELLQ---RRLGKYEHSISVRPQ----- 179 Query: 217 EQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITE 276 + GR +I + + G + +D P + T Sbjct: 180 QLSGRLSVDVNILESAGIASLEVLPLHNSRQ-----RGSGRGEDDSGPPPSTVINQNETF 234 Query: 277 LEYEFY------RRLVEKQLL--------TYRLHGESWREKVIERPVVHKDYDEQPRGP- 321 F R+ + +L R + + V + + P P Sbjct: 235 ANIIFKPTVVQQARIAQNGILGDFIIRYDVNREQSIGDIQVLNGYFVHYFAPKDLPPLPK 294 Query: 322 -FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE-----LSGP 375 + +D+S SM G + K ++ ++R I+ FS I ++ ++ Sbjct: 295 NVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDR-FSIIGFSNRIKVWKDHLISVTPD 353 Query: 376 QGIEQAIRFLSQQFRGGTDLASCFRAIMERLQS-------REWFDADAVVISDFIAQRLP 428 + + GGTD+ + + L + + V ++D Sbjct: 354 SIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGIGDRSVSLIVFLTDGKPTVGE 413 Query: 429 DDVTSKVK--ELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 + Q + + ++ + + G+ R+ Sbjct: 414 THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLE---KLSLENCGLTRRVHEE 464 >UniRef50_B3QTN9 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Bacteria RepID=B3QTN9_CHLT3 Length = 837 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 61/194 (31%), Gaps = 11/194 (5%) Query: 273 GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSM 332 L Y ++ LL + E++ ++ P + R + D SGSM Sbjct: 290 RDYILRYRLAGNQIQSGLLLFEGEKENFFLATVQPPKRVTEKMIPNREYIYIV-DVSGSM 348 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE----LSGPQGIEQAIRFLS-Q 387 G +K L+ ++LFS + + IE+A L + Sbjct: 349 FGQPIAISKELMKKLLGRLRPTE-TFNLLLFSGGSKLLSEKSLPATDKNIEKAFYALENE 407 Query: 388 QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 GGT+L + + V I+D + + K L + + Sbjct: 408 HGGGGTELLRALNRALGLPKKEAGSRTFVV-ITDGYVSFEVETFETIRKNLNKAN---LF 463 Query: 448 AVAMSAHGKPGIMR 461 AV + ++ Sbjct: 464 AVGIGNGVNRFLIE 477 >UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38CFE Length = 489 Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 48/168 (28%), Gaps = 3/168 (1%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 L G ++ ++ +DTSGSM G + + + Sbjct: 27 LLGALIFGEIWLYLTRQPPGFLTKPQAVVMLIDTSGSMSGSKLPEVQRAASEFVSRQNLK 86 Query: 355 NRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA 414 ++ FS+ E GGT+L+ F LQ+ + Sbjct: 87 RDDLAVVEFSSRASVVADFTRDERELQQAIARLSAWGGTNLSEGFNLATSVLQNSD-RPG 145 Query: 415 DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 + ++ +D + + + + R AV + + Sbjct: 146 NILLFTDGEPNN--RRMAASIAQQIRASGINLVAVGTGDAPVNYLTAL 191 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 112 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 50/382 (13%), Positives = 117/382 (30%), Gaps = 31/382 (8%) Query: 92 LPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEE 151 +P +L + + QL S Q+ ++ +E+ Sbjct: 1236 IPVLLSVKTEEQMISYQYFDQLDTHPSQKDIHSLHQIDQKSEHQIMENIERQRSLSIEKL 1295 Query: 152 REQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPE 211 E V ++ + + +++ S+G D + L Sbjct: 1296 HE-----VANHDEINENKQDLTHNHD---------SSGYQSDQDQSNNRDITKIL---DG 1338 Query: 212 LKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELAT 271 L + +Q P+ + + ++ + + Q +++ + ++ T Sbjct: 1339 LLTVEKQQTDENTHDKQPKIEEKDQSEGQQEEFEQNETHSLRKISQKKVLIKSIQRKVKT 1398 Query: 272 LGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGS 331 + E Q + + S + + + + + + I +DTSGS Sbjct: 1399 NKEKVQKALNEED-KENQTKSQQHRISSNVKNISGQFSLGQLQPMRFPIDLICVIDTSGS 1457 Query: 332 MGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR----YELSGPQGIEQAIRFL-S 386 M G K L L+ + +R C ++ FST R + I+ + Sbjct: 1458 MNGQPLDLLKETLLFLVDLLQTGDRIC-LIQFSTNAQRLTPLLSIESKDNIKSIKNEINR 1516 Query: 387 QQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTSKVKELQ----- 439 +GGT++ + + L+ R + + ++SD + + + +K+L Sbjct: 1517 LVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLLSDGLNDGAENKIRDLLKQLNFYQNY 1576 Query: 440 RVHQHRFHAVAMSAHGKPGIMR 461 P +M Sbjct: 1577 NEENFTIQTFGFGKDHDPNLMD 1598 >UniRef50_Q9FF49 Retroelement pol polyprotein-like n=34 Tax=Magnoliophyta RepID=Q9FF49_ARATH Length = 704 Score = 112 bits (279), Expect = 4e-23, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 66/189 (34%), Gaps = 17/189 (8%) Query: 289 QLLTYRLHGESWREKVIERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLA 346 + ++++ K + R P + +D SGSM G K Sbjct: 218 RSVSFKDFAVLINLKAPTSSKSSSNPSSSSRAPVDLVTVLDVSGSMAGTKLALLKRAMGF 277 Query: 347 LMRIALAENRRCYIMLFSTEIVR---YELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAI 402 +++ +R ++ FS+ R L G ++A++ + S GGT++A + Sbjct: 278 VIQNLGPFDR-LSVISFSSTARRNFPLRLMTETGKQEALQAVNSLVSNGGTNIAEGLKKG 336 Query: 403 MERLQSREWFD--ADAVVISDFIAQ------RLPDDVTSKVKELQRVHQHRF--HAVAMS 452 L R + + + V++SD K + ++ +R HA Sbjct: 337 ARVLIDRRFKNPVSSIVLLSDGQDTYTMTSPNGSRGTDYKALLPKEINGNRIPVHAFGFG 396 Query: 453 AHGKPGIMR 461 A +M Sbjct: 397 ADHDASLMH 405 >UniRef50_C3YPR2 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YPR2_BRAFL Length = 863 Score = 112 bits (279), Expect = 4e-23, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 97/304 (31%), Gaps = 34/304 (11%) Query: 202 YGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDI 261 Y E L + L + + + + + +ET ++ + +++G++ + Sbjct: 113 YEELLQRRLGSYELVLSIRPQQVVRHLKIDVRIIETRDIVMLDNTYGSGELEGVEIARPS 172 Query: 262 LRLLPPELATLGITELEY-------EFYRRL-VEKQLLTYRLHGESWREKVIERPVVHKD 313 + + ++ +F R V++ L + + P Sbjct: 173 PNRAHIQYRPTDMEQMRMSPSGISGDFLVRYDVKRDLSVGDIQIVNGYFVHYFAPSGLPV 232 Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV----- 368 + + +D SGSMGG + K +++ ++ R +M FS Sbjct: 233 VPK----NIVFIIDKSGSMGGTKMRQTKQAMNTILKDL-RDHDRFNVMPFSYSSTMWRPN 287 Query: 369 RYELSGPQGIEQAIRFL--SQQFRGGTDLASCFRAIMERLQS-------REWFDADAVVI 419 L+ + IE A ++ S GGT++ + L+ + + + Sbjct: 288 EMVLATRENIESARTYVRRSINAGGGTNINQAIIDAADLLRRVTDDQPNSPRSASLIIFL 347 Query: 420 SDFIAQRL---PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSR 476 +D + P ++ VK R Q + + + + G+ R Sbjct: 348 TDGLPSVGESKPRNIMVNVKNAIRE-QVSLFCLGFGKDVDFPFLE---KMALENRGLARR 403 Query: 477 LLRR 480 + Sbjct: 404 IYED 407 >UniRef50_A6DLI0 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLI0_9BACT Length = 833 Score = 111 bits (278), Expect = 5e-23, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 6/165 (3%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 ++ E+V+ ++ EQP ++ +D SGSM G A+ A L+ + Sbjct: 390 KTPIEEVLPVTSRYEKEKEQPSLALVLVIDKSGSMNGQPIVLAREASKAAAE-LLSSRDQ 448 Query: 358 CYIMLFSTEIVRY-ELSGPQGIEQAI-RFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 ++ F +L+ + + + GGT+L + L Sbjct: 449 VGVIAFDGSAKLVTDLTSAANKGEVLSQIDGIGAGGGTNLYPAMVMGRDMLGIASAKIKH 508 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 +V+SD +Q + V++ +M Sbjct: 509 MIVLSDGQSQGGD---FEGISSELAQMGVTISTVSLGQGAAVDLM 550 >UniRef50_C5WYU9 Putative uncharacterized protein Sb01g047460 n=3 Tax=Andropogoneae RepID=C5WYU9_SORBI Length = 698 Score = 111 bits (277), Expect = 7e-23, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 52/176 (29%), Gaps = 16/176 (9%) Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYI 360 + P + + +D SGSM G K +++ +R + Sbjct: 221 ILIHLRAPKSSHSASSRAPLDLVTVLDVSGSMAGTKIALLKNAMSFVIQTLGPNDR-LSV 279 Query: 361 MLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA-- 414 + FS+ R ++ + S GGT++A + + ++ R + Sbjct: 280 IAFSSTARRLFPLRRMTLAGRQQALQAVSSLVASGGTNIADGLKKGAKVIEDRRLKNPVC 339 Query: 415 DAVVISDFIAQR---LPDDVTSKVKEL------QRVHQHRFHAVAMSAHGKPGIMR 461 +++SD ++ + H + H + M Sbjct: 340 SIILLSDGQDTYTLPSDRNLLDYSALVPPSILPGTGHHVQIHTFGFGSDHDSAAMH 395 >UniRef50_B5JQC2 Vault protein inter-alpha-trypsin n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JQC2_9BACT Length = 808 Score = 110 bits (276), Expect = 8e-23, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 79/220 (35%), Gaps = 19/220 (8%) Query: 256 QQSDDILRLLPPELATLGITELEYEFYRRLVEK---QL--LTYRLHGESWREKVIERPVV 310 ++ ++L + A L ++ FY L E +L LTYR + + ++ Sbjct: 216 TETGELLYRFESQGAILDE---DFVFYYMLEENLPGRLEVLTYRENEDKPGTFMMVMTPG 272 Query: 311 HKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV-- 368 + + F+ +D SGSM G A A+ + L R ++ F+ Sbjct: 273 VDLHPLEGGADFVFALDVSGSMQGKLHTLASGVKKAIGQ--LKPEDRFRVVAFNNTAFDL 330 Query: 369 ---RYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQ 425 + E R GGT++ + +ERL + A ++++D + Sbjct: 331 NRGWVSATEANLRETFARLDQLNSNGGTNVYAGVHLALERLDADRV--ATLILVTDGVTN 388 Query: 426 RLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR-IFD 464 + D +L RF+ + +M+ + D Sbjct: 389 QGIVDP-KAFYKLMHKQDLRFYGFLLGNSSNWPLMQLMCD 427 >UniRef50_Q10RY0 Os03g0142500 protein n=2 Tax=Oryza sativa RepID=Q10RY0_ORYSJ Length = 694 Score = 110 bits (276), Expect = 8e-23, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 52/183 (28%), Gaps = 16/183 (8%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 ++ P + + +D SGSM G K +++ Sbjct: 220 ERRKVFAILIHLKAPKSLDSVSSRAPLDLVTVLDVSGSMSGIKLSLLKRAMSFVIQTLGP 279 Query: 354 ENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSR 409 +R ++ FS+ R ++ + S GGT++A + + ++ R Sbjct: 280 NDR-LSVVAFSSTAQRLFPLRRMTLTGRQQALQAISSLVASGGTNIADALKKGAKVVKDR 338 Query: 410 EWFD--ADAVVISDFIAQRLPDDVTSKVKELQ---------RVHQHRFHAVAMSAHGKPG 458 + + +++SD + + H + H Sbjct: 339 RRKNPVSSIILLSDGQDTHSFLSGEADINYSILVPPSILPGTSHHVQIHTFGFGTDHDSA 398 Query: 459 IMR 461 M Sbjct: 399 AMH 401 >UniRef50_B0TQ23 LPXTG-motif cell wall anchor domain n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TQ23_SHEHH Length = 850 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 67/194 (34%), Gaps = 34/194 (17%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 E + ++ P D ++ +DTSGSM G AK+ + ++ Sbjct: 433 EKYALVMLMPPQGSDDESSSIARELVLVIDTSGSMSGDAIIQAKSALKYALAGLRPQD-S 491 Query: 358 CYIMLFSTEIVRYE----LSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWF 412 ++ F++ + R+ + + +A ++ Q GGT+++ A + +L + Sbjct: 492 FNVLQFNSTVERWSRHVMPATAINLGRAQNYINGLQADGGTEMSLALDAALTKLDNDRGH 551 Query: 413 DA-------------------------DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFH 447 ++ + I+D + ++K ++ + R Sbjct: 552 NSKPVHDDDRYQSSNETLEQSAATPLRQVLFITDGAVANESR-LFEQIKN--QLGESRLF 608 Query: 448 AVAMSAHGKPGIMR 461 + + + M+ Sbjct: 609 TIGIGSAPNAHFMQ 622 >UniRef50_Q47YR5 Von Willebrand factor type A domain protein n=2 Tax=cellular organisms RepID=Q47YR5_COLP3 Length = 786 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 59/179 (32%), Gaps = 14/179 (7%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 T + GE + P K + I +DTSGSM + + AK+ + Sbjct: 367 FTQEISGEHYTLLTFFPPE--KAVAQVIARDIIFIIDTSGSMQAGSMEQAKSSLQL-ALL 423 Query: 351 ALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMER 405 L I+ F + ++ I +A +F+ GGT++ + Sbjct: 424 QLNNKDSFNIIAFDNDTELLFPVTHMASAHNISKAQQFIDGLSANGGTEMYRPLSNALMM 483 Query: 406 LQSREWFDA---DAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + V I+D A ++ + Q R + V + A M+ Sbjct: 484 KKDKTQSSKAIRQIVFITDG-AVANEFELMQLLNTAQ--GDFRLYTVGIGAAPNGYFMK 539 >UniRef50_UPI00006A1915 Transmembrane protein 110. n=2 Tax=Xenopus (Silurana) tropicalis RepID=UPI00006A1915 Length = 728 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 67/179 (37%), Gaps = 18/179 (10%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS-- 373 ++ + +D SGSM G + L ++ E I++F ++ +++ + Sbjct: 244 QKVPKNVVFVIDHSGSMHGQKIKQTYEAFLKILADL-PEEDHFGILIFDDKVDKWQNTLV 302 Query: 374 --GPQGIEQAIRFLSQ-QFRGGTDLASCFRAIMERLQS-------REWFDADAVVISDFI 423 P I +A +F+S+ RGGTD+ A ++ L++ + + + +SD Sbjct: 303 KAVPDNIIKAKQFVSKISARGGTDINKALLAAVKMLKNTSRNKLLPKISTSIILFLSDGE 362 Query: 424 AQRLPDDVTSKVKELQR--VHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 + + +++ Q + + + + + G+ R+ Sbjct: 363 PTSGVTNHNEIINNVKKANERQTTLYCLGFGNDVDFNFLE---KMALENGGLARRIYED 418 >UniRef50_D1HHA4 Whole genome shotgun sequence of line PN40024, scaffold_125.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HHA4_VITVI Length = 630 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 53/165 (32%), Gaps = 16/165 (9%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----E 371 ++ + +D SGSM G K L++ +R I+ FS+ R Sbjct: 201 DRAPIDLVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGPSDR-LSIVSFSSTARRIFPLRR 259 Query: 372 LSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPD 429 +S + S GGT++ + + L+ R + A +++SD D Sbjct: 260 MSDNGREAAGLAINSLTSSGGTNIVEGLKKGVRVLEERSEQNPVASIILLSDGKDTYNCD 319 Query: 430 DVTSKVKELQRVHQHR--------FHAVAMSAHGKPGIMR-IFDH 465 +V + R H + M I D Sbjct: 320 NVNRRQTSHCASSNPRQGRQAIIPVHTFGFGSDHDSTAMHAISDE 364 >UniRef50_Q54DU5 von Willebrand factor A domain-containing protein DDB_G0292028 n=1 Tax=Dictyostelium discoideum RepID=Y2028_DICDI Length = 932 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 57/152 (37%), Gaps = 11/152 (7%) Query: 314 YDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS 373 + + FI +D SGSM G + +K L + +L EN + I+ F + + + Sbjct: 335 DEVDQKSEFIFVLDCSGSMSGKPIEKSK-MALEICMRSLNENSKFNIVCFGSNFNKLFET 393 Query: 374 GPQGIEQAIRFLSQQFR------GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRL 427 ++ ++ S+ GGT+L I+ + E+ +++D Sbjct: 394 SKHYNDETLQKASEYINRIDANLGGTELLEPIVDILSKESDPEFPR-QVFILTDGEISNR 452 Query: 428 PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 D + V + + R + ++ + Sbjct: 453 -DKLIDYV--GKEANTTRIFTYGIGSYVDKEL 481 >UniRef50_A2E6Y7 von Willebrand factor type A domain containing protein n=4 Tax=Trichomonas vaginalis RepID=A2E6Y7_TRIVA Length = 720 Score = 110 bits (274), Expect = 2e-22, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 60/157 (38%), Gaps = 10/157 (6%) Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 + + + F +D SGSM G + AK FCL ++ +L R I+ F Sbjct: 231 PQFEGKVEQKSEFYFIIDCSGSMSGSRIENAK-FCLNILIHSLPIGCRFSIIQFGNSYKE 289 Query: 370 Y----ELSGPQGIEQAIRFLSQQFR-GGTDLASCFRAIMERLQSREWFDADAVVISDFIA 424 + S GGTD+ S + ++ + + +++D Sbjct: 290 VVSICDYSNKNVKYAMSAIARINADMGGTDILSPLEYVFKKKLGKGFIRK-IFLLTDGEV 348 Query: 425 QRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 D + S+V+ + +R A+ + + PG+++ Sbjct: 349 HN-SDMICSRVQ--KERENNRIFAIGLGSGADPGLIK 382 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 64/172 (37%), Gaps = 10/172 (5%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 G ++ + V K D R P + +D SGSM G + K+ L L+ E Sbjct: 17 GAPTSQRQLRIAVAAKADDHDRRLPLNLCLVLDHSGSMDGQPLETVKSAALGLIDRL-EE 75 Query: 355 NRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIM-ERLQSR 409 + R ++ F ++ I +AI + GGT + + + E + + Sbjct: 76 DDRLSVIAFDHRAKIVIENQQVRNGAAIAKAIE--RLKAEGGTAIDEGLKLGIQEAAKGK 133 Query: 410 EWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 E + +++D + +D K+ + ++ H + H ++ Sbjct: 134 EDRVSHIFLLTDGENEHGDNDRCLKLGTVASDYKLTVHTLGFGDHWNQDVLE 185 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 57/150 (38%), Gaps = 7/150 (4%) Query: 317 QPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQ 376 + + +D SGSM G + + ++ +R C +++F ++ R Sbjct: 119 RQGVDLVCLIDHSGSMQGEKIKLVRKTLKQMLTFLQPCDRLC-LIMFDCKVYRLTRLMRV 177 Query: 377 GIEQAIRF----LSQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDD 430 E +F S Q RGGTD+ + + + L+ R++ + + ++SD + + + Sbjct: 178 TQENVQKFRVAISSLQARGGTDIGNGMKMALSILKHRKYKNPVSAIFLLSDGVDEGAEER 237 Query: 431 VTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 V + + P IM Sbjct: 238 VRDDLIQYNIRDSFTIKTFGFGRDCCPKIM 267 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 109 bits (272), Expect = 3e-22, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 61/197 (30%), Gaps = 14/197 (7%) Query: 274 ITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMG 333 F ++ L+ E +E P + + +D SGSM Sbjct: 47 DIAAYGVFAFNYLQ---LSPEKAQEIPCTINLESPAQTSEASRSG-VDIVCVIDVSGSMQ 102 Query: 334 GFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV---RYELSGPQGIEQAIRFL-SQQF 389 G Q + ++ +R C ++ FS + R P+G +Q + Sbjct: 103 GEKIQLVQTTLNFMVERLSPADRIC-LISFSNDATKISRLVQMSPKGKKQLKSMIPRLVA 161 Query: 390 RGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTSK---VKELQRVHQH 444 GGT++ ++ L+ R + + +++SD + + + + Sbjct: 162 SGGTNIVGGLEYGLQALRQRRTINQLSSIILLSDGQDNNGTTVLQRAKATMDSIVIRDDY 221 Query: 445 RFHAVAMSAHGKPGIMR 461 H ++ Sbjct: 222 SVHTFGYGHGHDSTLLN 238 >UniRef50_A1SAA4 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Shewanella amazonensis SB2B RepID=A1SAA4_SHEAM Length = 713 Score = 109 bits (272), Expect = 3e-22, Method: Composition-based stats. Identities = 40/194 (20%), Positives = 66/194 (34%), Gaps = 16/194 (8%) Query: 278 EYEFYRRLVEK-----QLLTYRLHGESWREKVIERPVVHKDYDEQPRG-PFIVCVDTSGS 331 + FY RL E ++TYR S + V D +G ++ +D SGS Sbjct: 283 DIVFYWRLQEGLPGRVDMVTYRDPKVSTKGTVKLTFTPGDDLGPVTQGRDWVFVLDKSGS 342 Query: 332 MGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYEL----SGPQGIEQAIRFLS- 386 M G + L + L R I+LF + I QA+ ++ Sbjct: 343 MNGKYATLVEGVRQGLGK--LPAQDRFRIILFDESTQEFSKGFVPVDSNNINQALAWVEG 400 Query: 387 QQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 GTDL + + L + V+I+D +A + + EL + R Sbjct: 401 ISPGNGTDLYQGLKRALTPLDAD--RSTGVVLITDGVANVGVTE-KRRFLELMQQQDVRL 457 Query: 447 HAVAMSAHGKPGIM 460 M ++ Sbjct: 458 FTFIMGNSANTPLL 471 >UniRef50_UPI0000E80A5E PREDICTED: similar to calcium-activated chloride channel n=2 Tax=Gallus gallus RepID=UPI0000E80A5E Length = 928 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 24/200 (12%), Positives = 65/200 (32%), Gaps = 17/200 (8%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGF-NEQCAKAFCLALMRIAL 352 S ++ + + + +D SGSM + + + Sbjct: 280 DFRNSSVVNSLVPPFETTFELLQTQDRAVSLVLDVSGSMNTNNRITNLRTAAEVFLIQII 339 Query: 353 AENRRCYIMLFSTEIVR----YELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQS 408 R I+ F + +++ ++ ++ L GGT + + +E + + Sbjct: 340 EIGSRVGIVTFESSAYEKSPLLQITSVATRQRLVQNLPTTAGGGTKICAGIEKGLEIITN 399 Query: 409 --REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFD-- 464 + ++ V+++D D S +E + H +A+ + + Sbjct: 400 AIGTTYGSEIVLLTDGE-----DSTMSLCREKVKESGAIIHTIALGPSAAKELEEFSNIT 454 Query: 465 ---HIWRFDTGMRSRLLRRW 481 ++ D + S+L+ + Sbjct: 455 GGLQLYAVDVDVPSKLVEAF 474 >UniRef50_B4VXM6 Vault protein inter-alpha-trypsin n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXM6_9CYAN Length = 928 Score = 109 bits (271), Expect = 4e-22, Method: Composition-based stats. Identities = 29/236 (12%), Positives = 69/236 (29%), Gaps = 18/236 (7%) Query: 231 NDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQL 290 ++ + ++ L + D I L Y+ + + Sbjct: 347 EIKEVHSPSHQIQIERQDQGMRVTLSRRDTIPN---------KDLILRYQVAGDRTQTTV 397 Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 L+ + + + + P+ + +DTSGS G + + Sbjct: 398 LSQADTRGGHFAVYLIPAIEYNPHQLVPK-DVVFLIDTSGSQSGEPLNKCQELMRRFING 456 Query: 351 ALAENRRCYIMLFSTEIV---RYELSGP-QGIEQAIRFL-SQQFRGGTDLASCFRAIMER 405 + I+ FS L+ Q A+ ++ GGT L +A++ Sbjct: 457 LNP-HDTFTIIDFSDTTRQLSPVPLANTVQNRNSAMNYINQLNASGGTQLRRGIQAVLNF 515 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + V+++D + + L+ +R H+ + ++ Sbjct: 516 PEVDPGRLRSIVLLTDGYIGNENQILAEVQRHLK--LGNRLHSFGAGSSVNRFLLN 569 >UniRef50_A3ZR58 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZR58_9PLAN Length = 1032 Score = 108 bits (270), Expect = 4e-22, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 59/167 (35%), Gaps = 6/167 (3%) Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN 355 + E+ +D P G ++ +D SGSM G Q + LA +R A+ Sbjct: 433 WANTKLEEASPVRFTIRDAKVVPVGALMLVLDKSGSMQGEKMQMTQGAALAAIR-AMGAA 491 Query: 356 RRCYIMLFSTEIVR-YELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWFD 413 ++ F ++ R + + + GGT++ LQ+ + Sbjct: 492 DFAGVIGFDSQAQRIVPIRKVDNPGMFVAQVRKLSASGGTNMTPGVALGFRDLQNVDAGV 551 Query: 414 ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 +V+SD + +++ + AVA+ + +M Sbjct: 552 KHMIVLSDGQTEPGN---VAQIASDMKKMGMTVSAVAVGSDADQKLM 595 Score = 44.4 bits (103), Expect = 0.010, Method: Composition-based stats. Identities = 19/118 (16%), Positives = 35/118 (29%), Gaps = 12/118 (10%) Query: 312 KDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR-----IALAENRRCYIMLFSTE 366 + R I +D S +A ++R + R +++F + Sbjct: 74 QTLQRHDRMTVIYLLDQS---QSIPSDQRQAMVEYVVREVSAHRREEQGDRVGVIVFGDQ 130 Query: 367 IVRYELSGPQGIEQAIRF--LSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 + R L+ T+LA + LQS V++SD Sbjct: 131 PAIEFPPTDAPLPPLKRLAALALVETDETNLAGAMQLAQASLQSDSAGR--IVIVSDG 186 >UniRef50_Q9LMB7 F14D16.26 n=5 Tax=rosids RepID=Q9LMB7_ARATH Length = 736 Score = 108 bits (270), Expect = 4e-22, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 48/153 (31%), Gaps = 11/153 (7%) Query: 319 RGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI----VRYELSG 374 + + VD S SM G + K + I+ FS + E Sbjct: 308 KREVVFVVDISKSMTGKPLEDVKNAISTALSKLDP-GDSFNIITFSNDTALFSTSMESVT 366 Query: 375 PQGIEQAIRFLSQQF--RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVT 432 +E+ I ++++ F GT++ +E L + ++D + + Sbjct: 367 SDAVERGIEWMNKNFVVADGTNMLPPLEKAVEMLSNTRGSIPMIFFVTDGSVED-ERHIC 425 Query: 433 SKVKELQRVHQH---RFHAVAMSAHGKPGIMRI 462 +K+ R H + +++ Sbjct: 426 DVMKKHLASAGSVFPRIHTFGLGVFCNHYFLQM 458 >UniRef50_UPI000180CCF8 PREDICTED: similar to PK-120 n=1 Tax=Ciona intestinalis RepID=UPI000180CCF8 Length = 864 Score = 108 bits (270), Expect = 5e-22, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 67/211 (31%), Gaps = 24/211 (11%) Query: 290 LLTYRLHGESWREKVIERP-----VVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFC 344 L+ Y + E +++ + + +D SGSM G K Sbjct: 263 LVNYDVTREELGGEILIKDGYFVHFFAPTNLPVIPKKVVFVIDVSGSMSGHKIVQTKEAL 322 Query: 345 LALMRIALAENRRCYIMLFSTEIVRYELS-----GPQGIEQAIRFL-SQQFRGGTDLASC 398 ++ E + I+ FS+ + + P I A + + S RGGT+ + Sbjct: 323 RTILDDLN-EIDQFNIITFSSTTNVWHPNEMVDVNPTNIRNAKKHVRSMYARGGTNFNAA 381 Query: 399 FRAIMERLQSREWFD-------ADAVVISDFIAQRLPDDVTSKVKELQRVHQHR--FHAV 449 ++ L++ + ++++D + + ++ R + Sbjct: 382 ALDGIQLLETISSNRTNTLEEASMMILLTDGQPTVGVTGNEAIRRNIRERVNGRYSIFCL 441 Query: 450 AMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 H + D I + G+ ++ Sbjct: 442 GFGQHLDHEFL---DQIASENKGLSRKIYND 469 >UniRef50_D1YYY2 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YYY2_METPS Length = 716 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 64/174 (36%), Gaps = 7/174 (4%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRI 350 + + + P KD ++ G +++ +D SGSM G ++ A+ + + Sbjct: 139 MAHEDASGDTYFMAMLAPPASKDV-KKISGEYVILIDHSGSMAGPKKEAAEWAVGKFL-L 196 Query: 351 ALAENRRCYIMLFSTEIVRYEL----SGPQGIEQAIRFLSQQF-RGGTDLASCFRAIMER 405 L + + FS Y + ++ A+ F+ +F GGT++ ++ Sbjct: 197 GLGPDDWFTLGAFSNNTRWYSRLLAGATGDTVKNAVEFMKSKFEGGGTEMGVALEQALDI 256 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 + + ++I+D + +E +R + + + A + Sbjct: 257 KRLKGDVSRHVLIITDAEVTDGGRILRLVDRESRRPDRRSISLLCIDAAPNSYL 310 >UniRef50_Q6PGW2 Zgc:112265 protein (Fragment) n=15 Tax=Clupeocephala RepID=Q6PGW2_DANRE Length = 927 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 47/396 (11%), Positives = 116/396 (29%), Gaps = 52/396 (13%) Query: 115 DANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERMTLSGQLEPILA 174 F+ ++R+ + + + EE ++Q ++ R +G ++ + Sbjct: 67 QEIQFEVKIPKNAFISKFRMIIEGKTYDGVVKKKEEAQQQY-NKAVSRGESAGLIKSVGR 125 Query: 175 DNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQ 234 + + + ++ + Y E L +LG+ + Sbjct: 126 TLEDFKTSV---TVAANSKVTFE--LTYEELLKR---------RLGKYELLINAQPMQPV 171 Query: 235 ME--TFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRL------- 285 + + P +V G + D+ + T + FY Sbjct: 172 ADFKIDVHIQENPGISFLEVKGDLNTGDLASAVKT---TRADKDAWVTFYPTRDQQTKCT 228 Query: 286 --VEKQL-----LTYRL-----HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMG 333 E L +TY + GE + +D SGSM Sbjct: 229 NCAENGLNGDLIITYDVNRGNPKGEVQISNGYFVHYFAPSDVPHIPKNVVFIIDRSGSMH 288 Query: 334 GFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI----VRYELSGPQGIEQAIRFL-SQQ 388 G + ++ L +++ E+ ++ F EI + E A F+ Q Sbjct: 289 GRKIRQTRSALLTILKDL-DEDDHFGLITFDAEIDFWRRELLQATKANRENAESFVKRIQ 347 Query: 389 FRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRF 446 RG T++ A ++ + + ++++D ++ + ++ +F Sbjct: 348 DRGATNINDAVLAGVDMINRNPRKGTASILILLTDGDPTAGETNIEKIMANVKEAIGSKF 407 Query: 447 --HAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 + + + + + + R+ Sbjct: 408 PLYCLGFGYDVNFDFLT---KMSLENNAVARRIYED 440 >UniRef50_UPI00016E8A41 UPI00016E8A41 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E8A41 Length = 945 Score = 108 bits (269), Expect = 6e-22, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 53/171 (30%), Gaps = 19/171 (11%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSG-----PQ 376 + +DTS SM G + K ++ + FS+ + ++ P Sbjct: 300 VVFVIDTSASMLGKKIRQTKEALFTILGDLRP-GDHFNFISFSSRVKVWQPGRLVPVTPN 358 Query: 377 GIEQAIRFL-SQQFRGGTDLASCFRAIMERLQS-------REWFDADAVVISDFIAQRLP 428 + A +F+ GGT++ S + LQ + + ++D Sbjct: 359 NVRDAKKFIFMLPTSGGTNINSAIQTGSSLLQDYLSAQDASPNSVSLIIFLTDGQPTVGE 418 Query: 429 DDVTSKVKELQR--VHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 + + + + + + ++ + + GM R+ Sbjct: 419 VQSVTILGNTRSAVQGKFCIFTIGIGNDVDYRLLE---RMALDNCGMMRRI 466 >UniRef50_Q1D9B7 von Willebrand factor type A domain protein n=3 Tax=Cystobacterineae RepID=Q1D9B7_MYXXD Length = 476 Score = 108 bits (269), Expect = 6e-22, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 63/174 (36%), Gaps = 8/174 (4%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 G S + ++ + +D SGSM G+ AK L+ + ++R Sbjct: 72 GTSEVFATFDLSGAQVPGAQRSPVNLALVIDRSGSMSGYKLAQAKQAARHLIGLLNDQDR 131 Query: 357 RCYIMLFSTEIVRYEL--SGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQ--SREW 411 I+ + +++ + E+ +++ GGT++ + A +L R + Sbjct: 132 -LAIIHYGSDVKSLPSLEATAANRERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTAQRTY 190 Query: 412 FDADAVVISDFIAQRL--PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIF 463 +++SD D+ +++ R A+ + +M+ F Sbjct: 191 GVNRLILMSDGQPTEGLTADEELTRMARELRATGLTLSAIGVGTDFNEDLMQAF 244 >UniRef50_UPI00017B0D26 UPI00017B0D26 related cluster n=2 Tax=Tetraodon nigroviridis RepID=UPI00017B0D26 Length = 856 Score = 107 bits (268), Expect = 7e-22, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 56/172 (32%), Gaps = 21/172 (12%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE-----LSGPQ 376 + +DTS SM G + K L ++ +R + FS+ I ++ + P Sbjct: 229 VVFVIDTSASMLGKKMRQTKEALLTILGDLRPADR-FNFISFSSRIRVWQPGRLVPATPS 287 Query: 377 GIEQAIRFLSQQF-RGGTDLASCFRAIMERLQSREWFD-------ADAVVISDFIAQRL- 427 + A +F+ GGTD+ + L+ + + ++D Sbjct: 288 AVRDAKKFVVMLPTSGGTDIDGAIQTGSSLLRDHLSGRDAGPNSVSLIIFLTDGQPTVGE 347 Query: 428 --PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRL 477 P + + R + M ++ + + GM R+ Sbjct: 348 VRPGAILGNARAAVRDK-FCIFTIGMGDDVDYRLLE---RMALDNCGMMRRI 395 >UniRef50_P79263 Inter-alpha-trypsin inhibitor heavy chain H4 n=3 Tax=Theria RepID=ITIH4_PIG Length = 921 Score = 107 bits (268), Expect = 7e-22, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 70/220 (31%), Gaps = 19/220 (8%) Query: 273 GITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSM 332 T L+ F R + +T V + I +DTSGSM Sbjct: 227 QETVLDGNFIVRYDVNRTVTGGSIQIENGYFVHYFAPEVWSAIPKN---VIFVIDTSGSM 283 Query: 333 GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI--VRYELSGPQGIEQAIRF-LSQQF 389 G Q + + ++ + + ++ FS E R + + +E+A + Sbjct: 284 RGRKIQQTREALIKILGDLGS-RDQFNLVSFSGEAPRRRAVAASAENVEEAKSYAAEIHA 342 Query: 390 RGGTDLASCFRAIMERLQSREWFD-------ADAVVISDFIAQRLPDDVTSKVKELQRV- 441 +GGT++ ++ L+ + ++++D + + K ++ Sbjct: 343 QGGTNINDAMLMAVQLLERANREELLPARSVTFIILLTDGDPTVGETNPSKIQKNVREAI 402 Query: 442 -HQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 QH + + + + G+ R+ Sbjct: 403 DGQHSLFCLGFGFDVPYAFLE---KMALENGGLARRIYED 439 >UniRef50_A5UYN7 von Willebrand factor, type A n=1 Tax=Roseiflexus sp. RS-1 RepID=A5UYN7_ROSS1 Length = 459 Score = 107 bits (268), Expect = 7e-22, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 52/170 (30%), Gaps = 8/170 (4%) Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 + +P + +D SGSM G AK + L + ++ Sbjct: 74 AKVPPEHALPREQHRPPLHLVAVLDVSGSMSGTKLASAKEALRQAL-HFLQDGDVFSLVT 132 Query: 363 FSTEIVRY---ELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDADAVV 418 FS ++ + E + ++ L + G T L ++ Q + ++ Sbjct: 133 FSDQVQTHLKAESYAQRKRDKMENLLDEIRASGMTALDGGLAQGIDLGQKKRQATTLVLL 192 Query: 419 ISDFIAQRLPDDVTS--KVKELQRVHQHRFHAVAMSAHGKPGIM-RIFDH 465 +SD A D+ + R + + +M I + Sbjct: 193 LSDGQANVGETDLEKIGLRAQKARQSGLIVSTLGVGLDYNEALMVEIANQ 242 >UniRef50_A2E1S5 von Willebrand factor type A domain containing protein n=2 Tax=Trichomonas vaginalis RepID=A2E1S5_TRIVA Length = 688 Score = 107 bits (268), Expect = 7e-22, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 83/238 (34%), Gaps = 13/238 (5%) Query: 227 SIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLV 286 + F V+ + + E ++ ++ + +++ AT E + Sbjct: 144 PFTYQRPENFEFSIHVKTLSELKEIINSVRGTINVIDPHNVIFATKEFPNDESITIETQI 203 Query: 287 EKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLA 346 + + + + + + + F +D SGSM G Q AK CL Sbjct: 204 KDKDNNIAIWSDGYIAIST---FTYFETKVHSNSEFYFIIDCSGSMSGSCIQNAK-LCLN 259 Query: 347 LMRIALAENRRCYIMLFSTE----IVRYELSGPQGIEQAIRFLSQQF-RGGTDLASCFRA 401 + +L R I+ F ++ + + + E + + GGTD+ S + Sbjct: 260 IFMHSLPIGCRFSIIKFGSDYEVALHPCDYTDENVSEAMKQLNNIDAEMGGTDILSPLKY 319 Query: 402 IMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 +ME + + +++D +++ + +E +R ++ + + + Sbjct: 320 VMELTPKQGFIK-QVFLLTDGQDSN-TNELCALAQEN--RTNNRIFSIGIGSGADKDL 373 >UniRef50_Q14624 35 kDa inter-alpha-trypsin inhibitor heavy chain H4 n=38 Tax=Eutheria RepID=ITIH4_HUMAN Length = 930 Score = 107 bits (268), Expect = 8e-22, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 62/174 (35%), Gaps = 20/174 (11%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS----GPQG 377 + +D SGSM G Q + + ++ + +++FSTE ++ S + Sbjct: 275 VVFVIDKSGSMSGRKIQQTREALIKILDDLSP-RDQFNLIVFSTEATQWRPSLVPASAEN 333 Query: 378 IEQAIRFLS-QQFRGGTDLASCFRAIMERLQS-------REWFDADAVVISDFIA---QR 426 + +A F + Q GGT++ ++ L S E + ++++D + Sbjct: 334 VNKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVSLIILLTDGDPTVGET 393 Query: 427 LPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 P + + V+E + + + + G+ R+ Sbjct: 394 NPRSIQNNVREAVSGRYS-LFCLGFGFDVSYAFLE---KLALDNGGLARRIHED 443 >UniRef50_C7PW75 Vault protein inter-alpha-trypsin domain protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7PW75_CATAD Length = 1033 Score = 107 bits (268), Expect = 8e-22, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 56/172 (32%), Gaps = 14/172 (8%) Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 ++ +PR + +D SGSMGG+ A+ ++ AE+R Sbjct: 297 DIGTFLLTVLPPEPTGATRPR-DVALILDRSGSMGGWKMTAARRAAARIVDTLTAEDR-F 354 Query: 359 YIMLFSTEIVR--------YELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERL-QSR 409 ++ F ++ E + + RGGT++ R L Sbjct: 355 AVLTFDDQMETPDGLPTGLSEATDRHRFRAVQHLATVDARGGTEMEPPLRRAATLLSDDN 414 Query: 410 EWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 D ++I+D +T+ +L R H V + ++ Sbjct: 415 PDRDRVLILITDGQVGNEDRLLTTLSPKLTH---IRVHTVGIDTAVNAAFLQ 463 >UniRef50_A9WI94 von Willebrand factor type A n=2 Tax=Chloroflexus RepID=A9WI94_CHLAA Length = 845 Score = 107 bits (268), Expect = 8e-22, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 60/182 (32%), Gaps = 12/182 (6%) Query: 291 LTYRLHGESWREKVIERPVVHKDYDEQPRGP--FIVCVDTSGSMGGF----NEQCAKAFC 344 ++ L G + P++ R P + +D S SM AK Sbjct: 364 QSFTLGGYAETPLADALPLLMTPPPRPQRAPVSILFIIDRSASMSATFGISKFDMAKEAA 423 Query: 345 LALMRIALAENRRCYIMLFSTEIVRYELSGPQG-----IEQAIRFLSQQFRGGTDLASCF 399 + + +R ++ F TE + G +E + + GGT++ Sbjct: 424 ILSLTTLQPGDR-VGVLAFDTETIWTVPFRTVGEGVSLVELQDQIATMSLGGGTNIERAL 482 Query: 400 RAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 + L + + AV+++D + ++ E R Q +A+ + + Sbjct: 483 SVGLPALANEPYSTRHAVLLTDGRSYSNNYPRYQQLVETARAAQITLSTIAIGSDSDTEL 542 Query: 460 MR 461 + Sbjct: 543 LN 544 >UniRef50_Q67LZ3 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67LZ3_SYMTH Length = 414 Score = 107 bits (267), Expect = 9e-22, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 58/170 (34%), Gaps = 7/170 (4%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR 356 G V + + +P VD SGSM G K L+ E+R Sbjct: 20 GGEVYLLVTVKAPRMPAPEGRPPLNLAAVVDRSGSMAGAALYFTKQALRFLVDQMAEEDR 79 Query: 357 RCYIMLFSTEIVRYELSGPQGIEQAIRFL--SQQFRGGTDLASCFRAIMERL--QSREWF 412 I+ + ++ S P + A+R L G T+L+ M+++ + Sbjct: 80 -LAIVTYDDQVHVPFPSQPVVQKDAVRLLVDGITAGGTTNLSGGLATGMQQIRPHAGPGR 138 Query: 413 DADAVVISDFIAQRL--PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + ++++D +A DV + R + + H ++ Sbjct: 139 VSRVLLMTDGLANVGVTDPDVLAGWARAWREKGLAVSTMGVGPHFSEDLL 188 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 53/147 (36%), Gaps = 9/147 (6%) Query: 324 VCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQGIE 379 + +D SGSM G + K L+ L +R +++F ++ I+ Sbjct: 46 LILDHSGSMAGQPLETVKRAAQKLVDRLLPSDR-LAVIVFDHVAKVLIPNQPVTDRDKIK 104 Query: 380 QAIRFLSQQFRGGTDLASCFRAIM-ERLQSREWFDADAVVISDFIAQRLPDDVTSKVKEL 438 I L+ GGT + + + E + ++ + +++D + + ++ E Sbjct: 105 TRISHLA--AMGGTAIDEGLQLGLTELIAAKAGAISQIFLLTDGENEHGNNSRCLQLAEE 162 Query: 439 QRVHQHRFHAVAMSAHGKPG-IMRIFD 464 + + H + +I D Sbjct: 163 AAKENITLNTLGFGYHWNQDVLEQIAD 189 >UniRef50_Q9LN03 T6D22.13 n=2 Tax=Arabidopsis thaliana RepID=Q9LN03_ARATH Length = 641 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 70/212 (33%), Gaps = 29/212 (13%) Query: 261 ILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRG 320 + L PE++ L +F L+ + G V + Sbjct: 161 LEIKLFPEVSALAKPVSRADFAV------LVHLKAEG-----------VSDDARRARAPL 203 Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR---YELSGPQG 377 I +D SGSM G + K +++ +R ++ FS+ R L G Sbjct: 204 DLITVLDVSGSMDGVKMELMKNAMSFVIQNLGETDR-LSVISFSSMARRLFPLRLMSETG 262 Query: 378 IEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDAD--AVVISDFIAQRLPDDVTSK 434 + A++ + S GGT++A + ++ R W + +++SD + Sbjct: 263 KQAAMQAVNSLVADGGTNIAEGLKIGARVIEGRRWKNPVSGMMLLSDGQDNFTFSHAGVR 322 Query: 435 VKE-----LQRVHQHRFHAVAMSAHGKPGIMR 461 ++ L + H + +M Sbjct: 323 LRTDYESLLPSSCRIPIHTFGFGSDHDAELMH 354 >UniRef50_UPI000180BC4A PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BC4A Length = 1038 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 34/273 (12%), Positives = 79/273 (28%), Gaps = 35/273 (12%) Query: 223 REAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFY 282 + + P ++ + R+ + P ++ + + L L+++++ Sbjct: 98 SQLEFDPSFKQKVNYGQVCERKSSNSP--LNSTKLQNGFKNACIQNYQNL--PSLKWQYF 153 Query: 283 RRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKA 342 L L + + ++ +D SGSMG N AK Sbjct: 154 GSEQGVTTLFPSLRATDCGSFDNRCRPWYVQANVPKPKQIVIVIDKSGSMGVTNMNLAKE 213 Query: 343 FCLALMRIALAENRRCYIMLFSTEIVRYELS--------------GPQGIEQAIRFL-SQ 387 +++ ++R +M FS+ V ++ + PQ ++ F+ + Sbjct: 214 AAKSVVNTLNPQDR-FAVMAFSSIFVPFQSTVASDQCFATTFADASPQNKKKVEDFVDTI 272 Query: 388 QFRGGTDLASCFRAIMERLQSREWF-------------DADAVVISDFIAQRLPDDVTSK 434 GGT+ A + Q D + +SD I + S Sbjct: 273 SSGGGTNYAPALQKAFSFFQQEPSVSDFNIKKIDPSEIDRVILFMSDGIPNDPGSTILSA 332 Query: 435 VKELQRVHQHRFH--AVAMSAHGKPGIMRIFDH 465 + + + + + Sbjct: 333 QIRANEQLNNSVIILTYGLGNADFGVLRNMATN 365 >UniRef50_B2HDT6 Putative uncharacterized protein n=3 Tax=Mycobacterium RepID=B2HDT6_MYCMM Length = 772 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 66/202 (32%), Gaps = 17/202 (8%) Query: 273 GITELEYEFYR-RLVEKQLLTYRLHGESWREKV---IERPVVHKDYDEQPRGPFIVCVDT 328 L + + RL LL G + +V +V +D Sbjct: 257 RDFVLRFRLDQGRLSSSALLVADAAGADATDAEEGTWSLTLVPPAEPSSAPRDVVVVLDR 316 Query: 329 SGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE-------LSGPQGIEQA 381 SGSMGG+ A+ ++ + A +R ++ F I + + A Sbjct: 317 SGSMGGWKMVAARRAAGRIVDMLDAGDR-FCVLAFDDRIETPPAMPDGLVPASDRNRFAA 375 Query: 382 IRFL-SQQFRGGTDLASCFRAIMERLQSREW-FDADAVVISDFIAQRLPDDVTSKVKELQ 439 +L S + RGGT +A +E L A V+++D + S + Sbjct: 376 SSWLGSLRSRGGTVMAQPLTNAVEMLADSGEDRQASVVLVTDGQISGEDHLLRSLAPVVG 435 Query: 440 RVHQHRFHAVAMSAHGKPGIMR 461 R R + V + G + Sbjct: 436 R---TRIYCVGVDRAVNAGFLE 454 >UniRef50_A9V370 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V370_MONBE Length = 471 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 62/187 (33%), Gaps = 14/187 (7%) Query: 288 KQLLTYRLHGESWREKVIERPVVHKDYDE--QPRGPFIVCVDTSGSMGGFNEQCAKAFCL 345 + L Y G I + D++ +P + +D SGSM G + ++ Sbjct: 24 RPLWQYAEIGARESSAYISCRLTAPDFEPVERPAIDLVAVIDVSGSMAGQKLKMVQSTLE 83 Query: 346 ALMRIALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLASCFRA 401 LMR +R ++ F +++ ++ + T+L+ Sbjct: 84 FLMRNLKDTDR-FALVTFDSDVKTVFDLRPMTTAHKEACLADVQKLRAGSCTNLSGGLFR 142 Query: 402 IMERLQSREWFD---ADAVVISDFIAQRLPDDVTSKVKELQRVHQH----RFHAVAMSAH 454 +E +Q R + ++++D IA D + L+ + + Sbjct: 143 GVELMQQRGATKGAVSSILLMTDGIANEGVRDKDDMCRALRGLMGPAPDYTIYTFGYGKD 202 Query: 455 GKPGIMR 461 ++R Sbjct: 203 HNENMLR 209 >UniRef50_Q23JA0 von Willebrand factor type A domain containing protein n=2 Tax=Tetrahymena thermophila SB210 RepID=Q23JA0_TETTH Length = 1049 Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 64/155 (41%), Gaps = 11/155 (7%) Query: 313 DYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY-- 370 D+ R FI +D SGSM G + A L L +L + ++ F + + Sbjct: 304 DHLNSSRSEFIFLLDRSGSMSGQPIRRA-CEALTLFLKSLPNDSYFNVISFGSSFDKLFP 362 Query: 371 --ELSGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQR 426 + +E+AI +S+ GGT++ + + + + + + + +++D Sbjct: 363 SSTKYTSESLEKAILLISKYQADLGGTEIYNPLNNVFVQNKIQGY-NKQIFLLTDGEVD- 420 Query: 427 LPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 P V +K+ + +R H++ + +++ Sbjct: 421 SPQQVVRLIKKNNK--YNRVHSIGFGSGADQYLIK 453 >UniRef50_A5UWS5 von Willebrand factor, type A n=2 Tax=Roseiflexus RepID=A5UWS5_ROSS1 Length = 851 Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 56/181 (30%), Gaps = 12/181 (6%) Query: 292 TYRLHGESWREKVIERPVVHKDYDEQPRGP--FIVCVDTSGSMGGF----NEQCAKAFCL 345 ++ L PV R ++ +D S SMG AK + Sbjct: 362 SFTLGAYKNTPLEETLPVEMTPPPRPERSDTTLLLIIDQSASMGPETGISKFTMAKEAAI 421 Query: 346 ALMRIALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLS-QQFRGGTDLASCFR 400 + +L + R ++ F + + R +S GGTD+ + + Sbjct: 422 -MATESLRQEDRIGVLAFDVSTRWVVDFQPVGVGLSLADVQRRISTLPLGGGTDIYNALQ 480 Query: 401 AIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + L + AV+++D + + E R +A+ ++ Sbjct: 481 EGLPALAQQPGRVRHAVLLTDGRSFTDDRQAYRMLLEEARSQNITLSTIAIGTDADINLL 540 Query: 461 R 461 + Sbjct: 541 Q 541 >UniRef50_Q9M1S2 Putative uncharacterized protein T5N23_140 n=5 Tax=Arabidopsis thaliana RepID=Q9M1S2_ARATH Length = 676 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 58/155 (37%), Gaps = 9/155 (5%) Query: 313 DYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY-E 371 + + +D SGSMGG K +++ + +R ++ FS+ R Sbjct: 236 SQYRRAPIDLVTVLDISGSMGGTKLALLKRAMGFVIQNLGSSDR-LSVIAFSSTARRLFP 294 Query: 372 LS--GPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQR 426 L+ G + A++ + S GGT++ R + ++ R + A +++SD Sbjct: 295 LTRMSDAGRQLALQAVNSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIILLSDGRDTY 354 Query: 427 LPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + K + Q H+ + +M Sbjct: 355 TTNHPDPSYKVML--PQISVHSFGFGSDHDASVMH 387 >UniRef50_C0Z595 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z595_BREBN Length = 947 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 63/174 (36%), Gaps = 14/174 (8%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGF-----NEQCAKAFCLALMRIAL 352 ++ E+ + + K ++ P + +D SGSM A+ + + Sbjct: 384 QTPIEEALPVHMDLKGKEQLPSLGLQLVIDKSGSMSSDARGADKMALAREAAIRATTMMN 443 Query: 353 AENRRCYIMLFSTEIVRYELSGPQGIEQA----IRFLSQQFRGGTDLASCFRAIMERLQS 408 A++ ++ F +++ PQ + + + Q GGTD+ + ER+++ Sbjct: 444 AQD-YIGVIAFDD--TPWDVVAPQSVTKLDEIQQQISRIQADGGTDIFPALQLGYERVKA 500 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI 462 ++++D + D + + VA+ G++ + Sbjct: 501 MNTQRKHVILLTDGQSALDDDY--EGLLQQMTAENITVSTVALGDDSDRGLLEM 552 Score = 55.5 bits (132), Expect = 5e-06, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 47/134 (35%), Gaps = 18/134 (13%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAEN---RRCYIMLFSTE-IVRYELSGPQG 377 + VD S SM L+ +R A+ + + ++ E V ++ Q Sbjct: 68 IVFVVDRSASMKDDPR------VLSFLREAVGQKQAADKYAVIAIGAEAAVDQPMTIRQE 121 Query: 378 IEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKE 437 ++ +++ T+LA R + + V+++D + ++ Sbjct: 122 VQPLGVDVNRNA---TNLAEGIRLASAMIPTN--ARGKVVLLTDGLETSGD---AARQTR 173 Query: 438 LQRVHQHRFHAVAM 451 L R AV++ Sbjct: 174 LARERGIAVEAVSL 187 >UniRef50_B8HUC4 von Willebrand factor type A n=7 Tax=Bacteria RepID=B8HUC4_CYAP4 Length = 254 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 60/186 (32%), Gaps = 18/186 (9%) Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFC-----LALMRIALAENRRCYIMLFS 364 + +PR P I+ +DTSGSM G Q A L ++ I+ F Sbjct: 42 DDFANNPEPRCPVILLLDTSGSMRGTPIQELNAGVELFRDELLADALASKRVEVAIVGFG 101 Query: 365 TEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE---------WFDAD 415 V + + T L + ++ LQSR+ ++ Sbjct: 102 PVQVIQDFVTADYFNPP----KLRAEADTPLGAAIETALDLLQSRKDTYKANGIAYYRPW 157 Query: 416 AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRS 475 +I+D +VKE + F ++ + + +I +R Sbjct: 158 VFLITDGGPTDHWQTAARRVKEGESKKSFAFFSIGVEGARIDILAQISTRTPLKLKELRF 217 Query: 476 RLLRRW 481 R L +W Sbjct: 218 RDLFQW 223 >UniRef50_B8G7Y1 von Willebrand factor type A n=3 Tax=Chloroflexus RepID=B8G7Y1_CHLAD Length = 914 Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 62/178 (34%), Gaps = 17/178 (9%) Query: 296 HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGG------FNEQCAKAFCLALMR 349 + E ++ + +E+P ++ +D SGSM A+ Sbjct: 387 WRRTLLEPILPVALDPPLREERPDLALVLVIDRSGSMRELVDDGRTQLDLAREAVYQ-AS 445 Query: 350 IALAENRRCYIMLFSTEIVRYELSGPQ----GIEQAIRFLSQQFRGGTDLASCFRAIMER 405 L + + ++ F + P IE A+ GGT++ S E Sbjct: 446 RGLTQRDQIALIAFDSIADTLLPLQPLPGLFTIEDALS--RLVAGGGTNIRSGIALAAET 503 Query: 406 LQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIF 463 + + + ++++D +++ D+ + + R A+A+ P + R+ Sbjct: 504 IATSQARIRHVILLTDGVSETEYADLVADL----RAQGITVSAIAIGLDTDPALERVA 557 Score = 42.4 bits (98), Expect = 0.036, Method: Composition-based stats. Identities = 18/109 (16%), Positives = 36/109 (33%), Gaps = 11/109 (10%) Query: 323 IVCVDTSGSMGGFNEQCAKAFCLALMRIAL---AENRRCYIMLFSTEIVRYELSGPQGIE 379 + +D S S+ +A L + A + R +++F ++ P Sbjct: 72 VFLIDASDSIA----PVQRAAILDYLAQAQANADPDDRMAVVVFGARAAVERIAEPPY-- 125 Query: 380 QAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLP 428 R + F T++A + L + V++SD A Sbjct: 126 PLTRIDTPVFSSRTNIADAIELGLALLPAALHQR--LVLLSDGGANEGD 172 >UniRef50_B9GK57 Predicted protein (Fragment) n=2 Tax=Populus trichocarpa RepID=B9GK57_POPTR Length = 595 Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 60/181 (33%), Gaps = 21/181 (11%) Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIML 362 +V+ P+ + + + +D SGSM G K +++ +R I+ Sbjct: 138 RVLAPPLDNTLPHHRAPIDIVNVLDVSGSMAG-KLILLKRAVNFIIQNLGPSDR-LSIVT 195 Query: 363 FSTEIVR---YELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAIMERLQSREWFD--ADA 416 FS+ R G E AI + S GGT++ + R + L+ R + A Sbjct: 196 FSSSARRILPLRTMSGSGREDAISVVNSLSATGGTNIVAGLRKGVRVLEERRQHNSVASI 255 Query: 417 VVISDFIAQ--RLPDDVTSKVK----------ELQRVHQHRFHAVAMSAHGKPGIMR-IF 463 +++SD + +K E R H M I Sbjct: 256 ILLSDGCDTQSHSTHNRLEYLKLIFPSNNASGEESRQPTFPIHTFGFGLDHDSAAMHAIS 315 Query: 464 D 464 D Sbjct: 316 D 316 >UniRef50_A3MT69 VWA containing CoxE family protein n=4 Tax=Pyrobaculum RepID=A3MT69_PYRCJ Length = 415 Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats. Identities = 57/323 (17%), Positives = 90/323 (27%), Gaps = 39/323 (12%) Query: 160 QERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPE-------- 211 + + L E L GR W G+ + + ++ L P Sbjct: 86 EAAVRLLRAFESYLLSLERR-GRAW---FGRGSQEAWVEAMRQLRRLFGDPADVSELHRV 141 Query: 212 LKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELAT 271 K+L E LGR R + R D + + E Sbjct: 142 FKKLGEVLGRGRRGDPASLALSTASDPRRARLASLLAKAVDLSALLGDPLGDVGRVEGPG 201 Query: 272 LGITELEYEFYRRLVEKQLLTYRL---HGESWREKVIERPVVHKDYDEQPRGP--FIVCV 326 F R K+ + Y G + RG + V Sbjct: 202 AEFEVAHGSFARV---KRAVAYARALFVGALPVFLHKAASSTLPVRRPRARGDSGVFLLV 258 Query: 327 DTSGSMGG-----FNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQA 381 D SGSM A AF LA+++ R F E+ R E + Sbjct: 259 DKSGSMYSAVGGVEKIALATAFALAVLKRYKRARLRF----FDVEVHRVE-----ELGDL 309 Query: 382 IRFLSQ-QFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQR 440 + LS+ GGTD++ A E VV++D + + + R Sbjct: 310 VDVLSRAWAGGGTDISRAVEAAAEEAARERLRGYSLVVVTDGEDDAFSPAAAREARAVFR 369 Query: 441 VHQHRFHAVAMSAHGKPGIMRIF 463 V + G+ ++F Sbjct: 370 E----VLFVVVGERRLSGVRQVF 388 >UniRef50_Q54DV3 von Willebrand factor A domain-containing protein DDB_G0292016 n=1 Tax=Dictyostelium discoideum RepID=Y2016_DICDI Length = 918 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 55/172 (31%), Gaps = 11/172 (6%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + + + FI +D SGSM G + A+ ++ +L Sbjct: 272 DDKSYATAINFYPSFKNVNPDEVYQKSEFIFLIDCSGSMSGQSINKARRAM-EIIIRSLN 330 Query: 354 ENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERLQ 407 E + I F + + + + +E A F+ + GGT+L I+ Sbjct: 331 EQHKVNIYCFGSSFNKVFDKSRVYNDETLEIAGSFVEKISANLGGTELLPPMVDILSSPN 390 Query: 408 SREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 E+ +++D D + V + + R + A + Sbjct: 391 DPEYPR-QVFILTDGEISER-DKLIDYV--AKEANTTRIFTYGIGASVDQEL 438 >UniRef50_A7NJ01 von Willebrand factor type A n=2 Tax=Roseiflexus RepID=A7NJ01_ROSCS Length = 972 Score = 105 bits (262), Expect = 4e-21, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 62/184 (33%), Gaps = 16/184 (8%) Query: 292 TYRLHGESWREKVIERPVVHKDYDEQPRGPF--IVCVDTSGSMGG-------FNEQCAKA 342 ++ G PV+ D + + ++ +D SGSM AK Sbjct: 379 SFGAGGYRRTPLEPVLPVLLDPLDTKQQPDLALVMVIDRSGSMSELVGGSRRNRLDLAKE 438 Query: 343 FCLALMRIALAENRRCYIMLFSTEIV-RYELSGPQGIEQAIRFLSQQF-RGGTDLASCFR 400 + L + +++F L + + R L GGT++ Sbjct: 439 AVYQ-ASLGLTPIDQVGLVVFDDAANWVLPLQRLPSVVEIERALGSFGIGGGTNIRPGIE 497 Query: 401 AIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + L S + ++++D IA+ D+ +++ R VA+ P ++ Sbjct: 498 QAAQALASADAKVKHVILLTDGIAESNYSDLIAQM----RAAGVTISTVAIGEDANPNLV 553 Query: 461 RIFD 464 + + Sbjct: 554 DVAN 557 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 52/161 (32%), Gaps = 10/161 (6%) Query: 285 LVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPF-----IVCVDTSGSMGGFNEQC 339 L ++L +R +I + Q P + VD S SM + Sbjct: 27 LTPRRLAPWRFWSSLVLRSIILLALTLALAGTQIVLPVRELTTVFLVDVSDSMTPAQRER 86 Query: 340 AKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCF 399 A + + A+ + +++F + GP G + + R T+L Sbjct: 87 ALQYVNDALA-AMPPGDQAAVVVFGDNALVERAPGPIGPLSRLTSVPITTR--TNLQEAV 143 Query: 400 RAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQR 440 + + + V+ISD +++ +++ Sbjct: 144 QLGLALFPAETQKR--LVLISDGGENAGRVADAAQLAAIRK 182 >UniRef50_Q7SGD8 Predicted protein n=4 Tax=Sordariales RepID=Q7SGD8_NEUCR Length = 1086 Score = 105 bits (262), Expect = 4e-21, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 71/220 (32%), Gaps = 15/220 (6%) Query: 254 GLQQSDDILRLLPPELATLGITELEYEFYRRLVE----KQLLTYRLHGESWREKVIERPV 309 G D+ LG EL +F ++V + H + ++ + + Sbjct: 224 GAAAGTDMSLQKASATLALGTAELAQDFILQVVATNTGNPIALLETHTDIPHQRALMATL 283 Query: 310 VHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR 369 V K R + D SGSMGG + K+ ++ ++ + I F + Sbjct: 284 VPKFNLPSTRPEIVFVCDRSGSMGGARIEGLKSALRIFLK-SIPVGAKFNICSFGSTFEF 342 Query: 370 Y-----ELSGPQGIEQAIRFLS-QQFR-GGTDLASCFRAIMERLQSREWFDADAVVISDF 422 + + A+ ++S GGT++ A E + D + +++D Sbjct: 343 LFSDGSRSYDHESLRLAMDYVSRMDADLGGTEMYQPLEAAFE--KRYNDMDLEVFLLTDG 400 Query: 423 IAQRLPDDVTSKVKELQRVHQ-HRFHAVAMSAHGKPGIMR 461 T K++ R + + ++ Sbjct: 401 EIWNQEHLFTMINKKVSESQGAIRLFTLGIGNDVSHALIE 440 >UniRef50_Q1NTK1 Von Willebrand factor, type A n=2 Tax=delta proteobacterium MLMS-1 RepID=Q1NTK1_9DELT Length = 771 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 57/184 (30%), Gaps = 23/184 (12%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + ++ + +D SGSM G + AK ++ + Sbjct: 239 ERDPNGGYVALASFSPRLPPSEQPIPTSLAILLDCSGSMAGDSIAQAKQAISDMLNLLRP 298 Query: 354 ENRRCYIMLFSTEIVRY-------ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERL 406 E+ C +++F +E+ + + + +AIR + GGT++ ++ Sbjct: 299 ED-YCNLIMFGSEVKSVFPCQVAADKTNITTLRRAIRAIDAD-MGGTEMQKALVETLKMS 356 Query: 407 QSREWFDADAV---------VISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKP 457 + + + V +I+D ++ HR V + Sbjct: 357 PIYKPPEVEVVPARISRNILLITDGQVWGD-----KQILRRMAKSDHRVFTVGVGGAVCE 411 Query: 458 GIMR 461 + Sbjct: 412 AFLH 415 >UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacteria RepID=Q114A2_TRIEI Length = 1204 Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 69/203 (33%), Gaps = 21/203 (10%) Query: 295 LHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNE-QCAKAFCLALMRIALA 353 L E P + + R P I+ +DTS SM G + + + + Sbjct: 606 LEPEFVENPEQRLPEPEFVENPENRCPIILLLDTSYSMSGEAITELNQGVKIFQASVKED 665 Query: 354 E----NRRCYIMLFSTEIVRYELSGPQGIEQAIRFL--SQQFRGGTDLASCFRAIMERLQ 407 E ++ F++EI Q +F+ + + G T + +E L+ Sbjct: 666 ELASLRVEIAVITFNSEIEVV-----QDFVTVDKFIPKTLEASGVTHMGKAIEKALELLE 720 Query: 408 SRE---------WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPG 458 R+ ++ +I+D D K++E + + F AV + Sbjct: 721 KRKQDYKNSDIQYYRPWIFLITDGQPTDTWQDAAKKIEEAETNRKLLFFAVGVRDADMET 780 Query: 459 IMRIFDHIWRFDTGMRSRLLRRW 481 + I + G+ + L +W Sbjct: 781 LSEISVCPPKKLNGLDFQSLFKW 803 >UniRef50_UPI000155CC23 PREDICTED: similar to ITI-like protein n=3 Tax=Amniota RepID=UPI000155CC23 Length = 1374 Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 50/182 (27%), Gaps = 21/182 (11%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE---- 371 + + +D SGSM G + K ++ + I+ FS + ++ Sbjct: 314 PPVQKNVVFVIDVSGSMFGTKMKQTKKAMHVILNDLH-HDDYFNIVTFSDAVSVWKASGS 372 Query: 372 --LSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREW---------FDADAVVIS 420 + P + + G TD+ + + ++ Sbjct: 373 IQATPPNIKSAKVYVNKMEADGWTDINAALLVAASVFNQSTGETGRGKGLKKIPLIIFLT 432 Query: 421 DFIAQRLPDDVTSKVKELQR--VHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLL 478 D A + + ++ +A +MR + + G+ R+ Sbjct: 433 DGEATAGVTVASRILSNAKQSLKGNISLFGLAFGDDADYHLMR---RLSLENRGVARRIY 489 Query: 479 RR 480 Sbjct: 490 ED 491 >UniRef50_UPI00006CDA4D Glutathionylspermidine synthase family protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDA4D Length = 1547 Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 65/160 (40%), Gaps = 13/160 (8%) Query: 309 VVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCA-KAFCLALMRIALAENRRCYIMLFSTEI 367 D+ R FI +D SGSM G A +A L L +L + ++ F + Sbjct: 295 QELVDHLNSSRSEFIFLLDRSGSMSGQPIDRACQALTLFL--KSLPTDSYFNVISFGSSF 352 Query: 368 VRY----ELSGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERLQSREWFDADAVVISD 421 E Q +E+AI +S+ GGT++ + + + + + + + +++D Sbjct: 353 KLLFPQSEKYNSQSLEKAISNISKYKADLGGTEIYKPLKNVFVQNKIQGY-NKQVFLLTD 411 Query: 422 FIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 P+ V S +++ + R H++ + ++ Sbjct: 412 GEVD-SPEQVISLIRKNNKF--SRVHSIGFGSGADQYLIN 448 >UniRef50_Q8H924 Os10g0464900 protein n=3 Tax=Oryza sativa RepID=Q8H924_ORYSJ Length = 646 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 47/164 (28%), Gaps = 24/164 (14%) Query: 321 PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----ELSGPQ 376 + +D SGSM G K ++ +R C ++ FS+ R ++ Sbjct: 175 DLVTVLDVSGSMVGNKLALLKQAMGFVIDNLGPGDRLC-VISFSSGASRLMRLSRMTDAG 233 Query: 377 GIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDA--DAVVISDFIAQR-------- 426 S RGGT++ + R + L R + +A +++SD Sbjct: 234 KAHAKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVESVILLSDGQDTYTVPPRGGY 293 Query: 427 ---------LPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +P + H M Sbjct: 294 DRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGKDHDAAAMH 337 >UniRef50_A9FM70 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FM70_SORC5 Length = 507 Score = 104 bits (260), Expect = 6e-21, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 48/163 (29%), Gaps = 9/163 (5%) Query: 307 RPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTE 366 + ++ VD SGSM G + A+A A + + + F+ Sbjct: 109 PAARAARGQPRAPAAVVLLVDASGSMQGPKMENARAAAQAFVDRL-PDGDLVSVASFADT 167 Query: 367 IV----RYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIME--RLQSREWFDADAVVIS 420 L A + G T+L + + + V+IS Sbjct: 168 AQARVAPTVLGRSTRPAVARAIAALGPDGSTNLFAGLKLAEQHALAAPSTHAVRRVVLIS 227 Query: 421 DFIAQRLPD--DVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 D A P D+ + + H + ++ + A + Sbjct: 228 DGQANIGPSSPDILGALAQRGAAHGVQITSIGVGADYDERTLN 270 >UniRef50_Q7G2L9 Os10g0464500 protein n=3 Tax=Magnoliophyta RepID=Q7G2L9_ORYSJ Length = 719 Score = 104 bits (260), Expect = 6e-21, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 55/187 (29%), Gaps = 24/187 (12%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 E ++ P + + +D S SM G K +++ AL R Sbjct: 238 EFAVLIHLKAPSSPATVTSRAPIDLVTVLDVSWSMAGTKLALLKRAMSFVIQ-ALGPGDR 296 Query: 358 CYIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD 413 ++ FS+ R +++ R S GGT++A R ++ R + Sbjct: 297 LSVVTFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRRERN 356 Query: 414 A--DAVVISDFIAQ-------------RLPDDVT----SKVKELQRVHQHRFHAVAMSAH 454 V++SD PD S + + HA A Sbjct: 357 PVCSIVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFGAD 416 Query: 455 GKPGIMR 461 M Sbjct: 417 HDSPAMH 423 >UniRef50_C7N1C6 Uncharacterized protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N1C6_SLAHD Length = 744 Score = 104 bits (260), Expect = 6e-21, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 45/141 (31%), Gaps = 3/141 (2%) Query: 313 DYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYEL 372 D ++ ++ +DTSGSM G K + ++ + + Sbjct: 373 DPNDASSRHVVLALDTSGSMDGEPLNETKTATREFASTIFKSDADVCLVSYDSSARNVID 432 Query: 373 SGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRL--PDD 430 S GGT++ R ERL+ V++SD A DD Sbjct: 433 STDNEYALKAAVRDLSAGGGTNIEDALRVSYERLEGSGSDKRIIVLMSDGEANEGLVGDD 492 Query: 431 VTSKVKELQRVHQHRFHAVAM 451 + + E+ + + + Sbjct: 493 LIAYANEI-KDDGVTIYTLGF 512 >UniRef50_Q3M2E0 von Willebrand factor, type A n=2 Tax=Cyanobacteria RepID=Q3M2E0_ANAVT Length = 218 Score = 104 bits (260), Expect = 6e-21, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 61/193 (31%), Gaps = 20/193 (10%) Query: 305 IERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQ-CAKAFCLA----LMRIALAENRRCY 359 + + + + R P I+ +DTSGSM G Q + + + + Sbjct: 1 MPVGLPEFVENPENRCPVILLLDTSGSMSGQPIQELNRGLATFKEDVIKDSQASLSVEVA 60 Query: 360 IMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE--------- 410 I+ F + + + G T + ++ L++R+ Sbjct: 61 IITFGPVRLVQDFVNIDQFTPP----QLEAEGVTPMGEAIEYALDLLETRKSAYKENGIL 116 Query: 411 WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRI--FDHIWR 468 ++ +I+D +VKE + + F V + + +I + Sbjct: 117 YYRPWIFLITDGAPTDYYHLAAQRVKEAEANRRLCFFTVGVQGADFNKLRQIAPAERPPV 176 Query: 469 FDTGMRSRLLRRW 481 G+ R L W Sbjct: 177 ILNGLDFRSLFVW 189 >UniRef50_Q22ST4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22ST4_TETTH Length = 648 Score = 104 bits (260), Expect = 7e-21, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 66/169 (39%), Gaps = 7/169 (4%) Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 + + + H + +V +D SGSM G Q K + ++ + + +R C Sbjct: 201 DLDYEKLLKHHQHLQTLGRQTVDLVVVIDKSGSMEGEKIQLVKETLVKIINLMSSMDRIC 260 Query: 359 YIMLFS---TEIVRYELSGPQGIEQAIRFLSQ-QFRGGTDLASCFRAIMERLQSREWFD- 413 I+ F+ + + + + + + Q GGT+++ ++ +Q+R++ + Sbjct: 261 -IVCFNESGDRPLTFTRVTDENKQTLLNLIQQIYAGGGTNISEGINHALKAIQNRKFKNN 319 Query: 414 -ADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +++SD + V + + + Q + P ++R Sbjct: 320 VTSILLLSDGQDTKAYTRVKAYIDKYQIKDAFNIETIGFGEDHDPKLLR 368 >UniRef50_Q7MCW9 Uncharacterized protein n=2 Tax=Vibrio vulnificus RepID=Q7MCW9_VIBVY Length = 688 Score = 104 bits (259), Expect = 8e-21, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 70/192 (36%), Gaps = 18/192 (9%) Query: 282 YRRLVEK---QL--LTYRL--HGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGG 334 Y RL E +L ++YR E K+ P +Q R ++ +D SGSM G Sbjct: 265 YWRLQEGLPGRLEAVSYRDPQQSERGTIKLTFTPGDDLSAIQQGR-DWVFVLDKSGSMSG 323 Query: 335 FNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE----LSGPQGIEQAIRFLSQQF- 389 + + L + L R I++F + + QAI ++Q Sbjct: 324 KHATLTEGVKRGLGK--LPSGDRFRILMFDNRVQEITNGFIAVNQNNVTQAIETINQIAT 381 Query: 390 RGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAV 449 GGT+L + L S ++++D +A + + +L + + R + Sbjct: 382 GGGTNLYDALERAVSGLDSDRTTG--IILVTDGVANVGVTE-KKQFLKLMQRYDVRLYTF 438 Query: 450 AMSAHGKPGIMR 461 M ++ Sbjct: 439 IMGNSANTPLLE 450 >UniRef50_A0LPK8 Vault protein inter-alpha-trypsin domain protein n=2 Tax=Deltaproteobacteria RepID=A0LPK8_SYNFM Length = 680 Score = 104 bits (259), Expect = 9e-21, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 56/177 (31%), Gaps = 9/177 (5%) Query: 290 LLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMR 349 L+ YR +S ++ + +D SGSM G ++ Sbjct: 273 LIPYRKGPDSAGTFMVVVTPAASLKRIAEGVDWTFVLDISGSMTGRKITTLIEGVSRVLG 332 Query: 350 IALAENRRCYIMLFSTEIVR----YELSGPQGIEQAIRFLSQ-QFRGGTDLASCFRAIME 404 A +R I+ F+T Y + P+ ++ ++ + Q Q G T L Sbjct: 333 KMSANDR-FRIVTFNTTAADFTGGYVPASPENVQTWMQRVKQIQAGGSTALFDGLDLAYR 391 Query: 405 RLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 L V+++D + P + L + H R + +M Sbjct: 392 LL--DGERTTGIVLVTDGVCNVGP-TRHDEFLGLLKQHDVRLFTFVIGNSANQPLMD 445 >UniRef50_C3ZT39 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZT39_BRAFL Length = 1044 Score = 104 bits (259), Expect = 9e-21, Method: Composition-based stats. Identities = 42/193 (21%), Positives = 66/193 (34%), Gaps = 16/193 (8%) Query: 293 YRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIAL 352 + + I P Y + R ++ +DTSGSM G L L Sbjct: 245 FTNGQNPPQNMNIPDPTFEFLYQTELRKE-VLVLDTSGSM-GKKLLFNLRQSLTSHVYNL 302 Query: 353 AENRRCYIMLFSTEIVR----YELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQS 408 I+ F++E + + + L G T + S + + L + Sbjct: 303 PIGSSLGIVTFNSEATINAPMTVIGNETTRDALVGALPMTTGGKTSIGSGLQEALGLLGN 362 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWR 468 ++ISD LP + V RV H H VA+ A G P + ++ Sbjct: 363 DLGR---IILISDGQEDELPH--IADVLPALRVAGHTVHTVAIGADGDPMLEQLS----- 412 Query: 469 FDTGMRSRLLRRW 481 DTG +S RW Sbjct: 413 RDTGGKSFYHTRW 425 >UniRef50_UPI00016DFBC7 UPI00016DFBC7 related cluster n=4 Tax=Takifugu rubripes RepID=UPI00016DFBC7 Length = 883 Score = 104 bits (258), Expect = 1e-20, Method: Composition-based stats. Identities = 37/359 (10%), Positives = 111/359 (30%), Gaps = 44/359 (12%) Query: 139 QATTLNQQLLEEEREQLLSEVQERMT-LSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQ 197 + +++ + + + +E + GQ E ++ + + + + + Sbjct: 89 EKFSVSVNIASNSKVTFVMTYEELLQRTLGQYEIVIRVKPKEPVQEFKIVSNIFEPQGIS 148 Query: 198 LIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQ 257 I + FL L + ++ K + + + + + P ++Q Sbjct: 149 YIDAHATFLTN-----DLLPLVEKTVTDKKVSASTGLIRSRSHTQAHISFSPT----IEQ 199 Query: 258 SDDILRLLPPELATLGITELEYEFYRRLVEKQLLTY-RLHGESWREKVIERPVVHKDYDE 316 ++ +F + + + + P + Sbjct: 200 QRKCPDCPGT--------IIDGDFIIKYDVNRDKNLGDIQIANGYFVHFFAPKDLTRLPK 251 Query: 317 QPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE----L 372 + +D SGSM G Q + + ++ E+ I+ F + + + L Sbjct: 252 ----NVVFVIDRSGSMSGTKMQQIQEAMIKILEDLHPED-HFGIIQFDSSVDSWRNSLSL 306 Query: 373 SGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERLQSREWFDA-------DAVVISDFI 423 + + I +A+ +++Q T++ + ++ L + ++++D Sbjct: 307 ATEENISEAMAYVNQISHKIQATNINAAVLKAVDMLVTDREAKRLPEKSIDMIILLTDGD 366 Query: 424 A-QRLPDDVTSKVKELQRV---HQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLL 478 + + ++E R + + G + D + R + G+ R+ Sbjct: 367 PTTDIGETRIPVIQENVRNAIGGNMSLYGLGFGNDVDYGFL---DVMSRENKGLARRIY 422 >UniRef50_A9RSX3 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RSX3_PHYPA Length = 1068 Score = 104 bits (258), Expect = 1e-20, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 53/178 (29%), Gaps = 12/178 (6%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 H + I VD SGSM G + A +R Sbjct: 255 ERHPSHGTHAIALTFQPRFALQPLRTSEMIFLVDRSGSMMGTQIKQAGEALELFLRSIPF 314 Query: 354 ENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQF------RGGTDLASCFRAIMERLQ 407 EN I+ F + + + E +++ GGT++A+ F + + + Sbjct: 315 ENHYFNIVGFGSNHNFLFPTSVEYTEDSLKKAVHYAQTIQANMGGTEIANAFFEVFQ--R 372 Query: 408 SREWFDADAVVISDFIAQRLPD---DVTSKVKELQRVH-QHRFHAVAMSAHGKPGIMR 461 R +++D + + V + R + R + + ++ Sbjct: 373 RRRNVPTQIFLLTDGMVWDAEQLTKSIIEAVDDGARNNSPVRVFTLGVGNAVSHHLIE 430 >UniRef50_A7RNW3 Predicted protein n=3 Tax=Nematostella vectensis RepID=A7RNW3_NEMVE Length = 798 Score = 104 bits (258), Expect = 1e-20, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 60/180 (33%), Gaps = 17/180 (9%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPR------GPFIVCVDTSGSMGGFNEQCAKAFCLAL 347 + G + + +++PV ++ + G FI VD SGSM G + A A L L Sbjct: 255 KTRGLAISTEFLQKPVAMVNFVPAFKADDLTCGEFIFVVDRSGSMSGSRIKDA-ARTLQL 313 Query: 348 MRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFR------GGTDLASCFRA 401 +L + I+ F + ++ ++ + GGT++ R Sbjct: 314 FLKSLPDGCYFNIVGFGSSYKTLFSKSKTYNDETLKTATNHAAHLAADLGGTEILEPLRW 373 Query: 402 IMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + +++D ++ + R + + +++ Sbjct: 374 VYSQSLIEGAPR-QLFLLTDGEVGNTAQVISLVAENA---STARVFSFGIGDGASTELIK 429 >UniRef50_Q10Z89 von Willebrand factor, type A n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10Z89_TRIEI Length = 477 Score = 104 bits (258), Expect = 1e-20, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 53/194 (27%), Gaps = 7/194 (3%) Query: 293 YRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIAL 352 + G + + ++ +DTS SM G +A + Sbjct: 19 FGGAGCLIAAAIFGEMWLSLTRRPPQPQTVVLLIDTSSSMWGGKLPEVQAAATGFVERQN 78 Query: 353 AENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWF 412 I+ FS+ E + GGT+L+ + + L++ Sbjct: 79 LTVNNLAIVEFSSNSQVLTNFDADKTELKQAIANLTPSGGTNLSQGLKTVASLLRNSNTP 138 Query: 413 DADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIF---DHIWRF 469 + ++ +D + R V + + D ++ Sbjct: 139 N--ILLFTDGQPNDPRAS--KSIAREIREAGINLVTVGTGDANSNYLTSLTENPDLVFFA 194 Query: 470 DTGMRSRLLRRWRR 483 ++G + R + Sbjct: 195 NSGEIDQAFRAAEK 208 >UniRef50_Q54CQ8 von Willebrand factor A domain-containing protein DDB_G0292740 n=1 Tax=Dictyostelium discoideum RepID=Y2740_DICDI Length = 910 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 55/173 (31%), Gaps = 12/173 (6%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 + + D +G FI +D SGSM G A+ L ++ +L Sbjct: 319 EKSSYAIALNFFPKFESINKEDIYQKGEFIFLIDCSGSMSGNPIDSARR-ALEIIIRSLN 377 Query: 354 ENRRCYIMLFSTEIVR-----YELSGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERL 406 E + I F + + + R++S GGT+L + I+ + Sbjct: 378 EQCKFNIYCFGSGFNKAFQEGSRKYDDDSLAVVNRYVSNISANLGGTELLQPIKDILSKE 437 Query: 407 QSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 E+ +++D + V + + R + + + Sbjct: 438 IDPEYPR-QIFILTDGAVSDRS-KLIEFVSKESKT--TRIFTYGIGSSVDVEL 486 >UniRef50_A0LPD4 von Willebrand factor, type A n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LPD4_SYNFM Length = 479 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 55/161 (34%), Gaps = 11/161 (6%) Query: 315 DEQPRGPFIVCVDTSGSMGGF-NEQCAKAFCLALMRIALAENRRCYIMLFSTEIVR---Y 370 + + +V +D SGSM A+ L L+ +E R ++ +S + R Sbjct: 87 EARRELDMVVVMDRSGSMADAGKLTHARQAVLNLLSRL-SETDRFALVSYSDHVQRHGGL 145 Query: 371 ELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQS--REWFDADAVVISDFIAQRL 427 P R + Q G T+L + + +L + + ++ISD +A R Sbjct: 146 LPITPANRATLERIVRGIQPGGATNLGGGLQEGISQLAELQQNGRLSRLILISDGLANRG 205 Query: 428 --PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR-IFDH 465 + + + V + +M I D Sbjct: 206 VTDPSALGTMASVAAERGYAVSTVGVGLDFNEHLMTSIADK 246 >UniRef50_UPI0000ECD6E7 Poly [ADP-ribose] polymerase 4 (EC 2.4.2.30) (PARP-4) (Vault poly(ADP- ribose) polymerase) (VPARP) (193 kDa vault protein) (PARP- related/IalphaI-related H5/proline-rich) (PH5P). n=5 Tax=Tetrapoda RepID=UPI0000ECD6E7 Length = 1691 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 39/279 (13%), Positives = 87/279 (31%), Gaps = 27/279 (9%) Query: 194 GDYQLIVKYGEFLNEQPELKRLA----EQLGRSR--EAKSIPRNDAQMETFRTMVREPAT 247 +Q E N Q +K++ E L + + +P + + ++ ++ T Sbjct: 764 SPWQQDKALNE--NTQDTIKKICVKQVETLKKFSLDMSIEMPYSIESIHSWTHKLKIKKT 821 Query: 248 VPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIER 307 + V ++ + + L + + + V + Sbjct: 822 ECKAVIKTVENSSLDSSGFGLDIWISHAYLPRMWVEK--------HPNKNSEACMLVFQP 873 Query: 308 PVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFS--- 364 +EQ I+ +D S SM G AK L ++ + + ++ F Sbjct: 874 EFEAAFDEEQLSSEIIILLDCSNSMAGSALLQAKQIALHALKQFSS-RQNVNLIKFGTNF 932 Query: 365 TEIVRYELSGPQGIEQAIRFL--SQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDF 422 +E + + + + F+ + G TDL R + S+ + ++ISD Sbjct: 933 SEFSSFSKNTSKDLASLTEFITSATATMGNTDLWKTLRYLSLLFPSQGHRN--ILLISDG 990 Query: 423 IAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 Q + H R + + ++R Sbjct: 991 HIQNESVTFQLVKDNV---HHTRLFTCGVGSTANRHMLR 1026 >UniRef50_Q0VTG8 Protein containing a von Willebrand factor type A domain n=5 Tax=Gammaproteobacteria RepID=Q0VTG8_ALCBS Length = 698 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 42/267 (15%), Positives = 78/267 (29%), Gaps = 27/267 (10%) Query: 205 FLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRL 264 L + P+ K +QLG + ++ + + Sbjct: 210 RLPKHPQAK--IQQLG-------SQQWQVTLDNRNASTTIKGKGRSEKGDNNFAPPATNG 260 Query: 265 LPPELATLGITELEYEFYRRLVEK-----QLLTYRLHGESWREKVIERPVVHKDYDEQPR 319 PP TL + + Y R + L+ Y+ G+ ++ Sbjct: 261 HPPSAFTL---DQDIVVYWRHQQDLPGSVDLVAYKAPGKDRGTFMLSITPGDDLPPITTG 317 Query: 320 GPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY-----ELSG 374 ++ +D SGSM AL + L N R I+LF + + Sbjct: 318 SDWVFVLDISGSMNAKLATLGDGVRQALGK--LRGNDRFRIVLFDDRAEELTSGFVDATP 375 Query: 375 PQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSK 434 + + + Q RGGT+L + L + V+++D +A Sbjct: 376 NNIRQYTQKIMQLQSRGGTNLFGGLSLALTPLDAD--RPTGIVLVTDGVANVG-KTRQKD 432 Query: 435 VKELQRVHQHRFHAVAMSAHGKPGIMR 461 +L H R M ++ Sbjct: 433 FIDLLENHDVRLFTFVMGNSANRPMLT 459 >UniRef50_C3ZG18 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZG18_BRAFL Length = 806 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 55/167 (32%), Gaps = 11/167 (6%) Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 + P + ++ G FI +D SGSM G + A+ L ++ +L Sbjct: 253 DHTVMLTFVPDLSREDLVANCGEFIFILDRSGSMSGNKIKNARETLLLFLK-SLPIGCYF 311 Query: 359 YIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERLQSREWF 412 I+ F + E + ++ A + L + GGT++ + + ++ Sbjct: 312 NIVGFGSTHESLFKGSEKYDNKSLKTACKALGKMEADLGGTEILQPLQYVYKQPPIAGHP 371 Query: 413 DADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 +++D V + R +V + + Sbjct: 372 R-QLFLLTDGEVWDTQACVREV---AKHADSARCFSVGIGEGASTAL 414 >UniRef50_Q54MG4 von Willebrand factor A domain-containing protein DDB_G0285975 n=4 Tax=Dictyostelium discoideum RepID=Y5975_DICDI Length = 917 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 58/168 (34%), Gaps = 11/168 (6%) Query: 298 ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR 357 + D + FI +D SGSM G + AK L ++ +L EN + Sbjct: 316 YAVSVNFTPSFSHLTSDDVNQKSEFIFLIDCSGSMSGEPIKKAKR-ALEIIIRSLNENCK 374 Query: 358 CYIMLFSTEIVR----YELSGPQGIEQAIRFLSQQFR--GGTDLASCFRAIMERLQSREW 411 I F + + ++ + +++ ++ + GGT+L R I+ E+ Sbjct: 375 FNIYCFGSRFTKAFDNSKMYNDETLKEISGYVEKIDADLGGTELLPPIRDILSTESDFEY 434 Query: 412 FDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 +++D D + + V + R + + Sbjct: 435 PR-QLFILTDGEVSER-DSLINYV--ATESNNTRIFTYGIGNSVDTEL 478 >UniRef50_D0LL92 von Willebrand factor type A n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LL92_HALO1 Length = 430 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 61/185 (32%), Gaps = 12/185 (6%) Query: 292 TYRLHGESWREKVIERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLALMR 349 Y L + RE + + R P + +D SGSM G K L L+ Sbjct: 21 QYDLLPSNARELNLMVRLEGTGDAPATRAPLDLALVIDRSGSMSGDKLSDVKTAALELLE 80 Query: 350 IALAENRRCYIMLFSTEIVRYELSGPQG----IEQAIRFLSQQFRGGTDLASCFRAIMER 405 E+ ++ +S+++ + + E L+ Q RGGT L +E Sbjct: 81 TLQPED-TITLVSYSSDVSMHLMRTRADDAGQREARRALLALQARGGTALGPGLFRALEA 139 Query: 406 LQ--SREWFDADAVVISDFIAQRLP--DDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM- 460 L+ S + ++ SD IA V + + +M Sbjct: 140 LEGASDRTRMSHLMLFSDGIANAGEVRPSVLGARAAGAFGAGVSVSTMGVGVDYNEDLMT 199 Query: 461 RIFDH 465 R+ D Sbjct: 200 RLADQ 204 >UniRef50_UPI00016C38A3 LPXTG-motif cell wall anchor domain protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C38A3 Length = 874 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 57/156 (36%), Gaps = 6/156 (3%) Query: 309 VVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIV 368 ++ I+ VD SGSM G + A + L+E+ + LF + Sbjct: 271 PPKFADAKKVPREVILLVDHSGSMSGAKWEAADWAVERFLA-GLSEDDAFSLGLFHSTTK 329 Query: 369 RY----ELSGPQGIEQAIRFLSQQFR-GGTDLASCFRAIMERLQSREWFDADAVVISDFI 423 + + P+ + A+ FL GGT+L + R +S E ++++D Sbjct: 330 WFGERTRKATPENVRAAVEFLKLNRDQGGTELGVALEQALARSRSAETPARHVLILTDAE 389 Query: 424 AQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 + E ++ ++ R + + A + Sbjct: 390 VTDAGRILRLADLESEKPNRRRISVLCIDAAPNAAL 425 >UniRef50_Q498Q0 Inter-alpha (Globulin) inhibitor H3 n=6 Tax=Clupeocephala RepID=Q498Q0_DANRE Length = 892 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 62/172 (36%), Gaps = 9/172 (5%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYE---- 371 ++ + +D SGSM G + + L ++ + ++ FS+ I ++ Sbjct: 263 QRIPKNVVFIIDQSGSMQGNKIEQTRMAMLRILSDLAK-DDYFGLITFSSHIQAWKPELL 321 Query: 372 LSGPQGIEQAIRFLSQQFRGG-TDLASCFRAIMERLQS--REWFDADAVVISDFIAQRLP 428 + + +E+A F+ Q GG TD+ + + +E + ++++D Sbjct: 322 KATAENVEEAKTFVKQIRSGGATDINGAVLNAVNMINQYTQEGSASILILLTDGDPTSGV 381 Query: 429 DDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 + + + ++ ++ + + + + G R+ Sbjct: 382 TNPVTIQQNVKTAIGGKYPLYCLGFGFNVRF-EFLEKMSLENNGAARRIYED 432 >UniRef50_Q55G98 von Willebrand factor A domain-containing protein DDB_G0267758 n=1 Tax=Dictyostelium discoideum RepID=Y7758_DICDI Length = 878 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 54/158 (34%), Gaps = 10/158 (6%) Query: 308 PVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI 367 K D + FI +D SGSM G + K ++R R ++ F + Sbjct: 303 FKDIKIEDMNQKSEFIFLIDCSGSMVGEPMRKVKRAMEIIIRSLNENQHRVNVVCFGSSF 362 Query: 368 VRYELSGPQGIEQAIRFLSQQFR------GGTDLASCFRAIMERLQSREWFDADAVVISD 421 + ++ + LS+ + GGT+L + + I+ + E+ +++D Sbjct: 363 KKVFKVSRDYNDETLECLSKYIQSIEANLGGTELLTPIKNILSSPPNPEYPR-QLFILTD 421 Query: 422 FIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGI 459 A + KE + R + + Sbjct: 422 GEAPHRDKIIHYLSKE---SNTTRIFTYGIGDSVDIDL 456 >UniRef50_B6HQ22 Pc22g19800 protein n=1 Tax=Penicillium chrysogenum Wisconsin 54-1255 RepID=B6HQ22_PENCW Length = 896 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 63/174 (36%), Gaps = 11/174 (6%) Query: 294 RLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALA 353 H +K + +V K + VD SGSM A L L +L Sbjct: 251 ETHPTLPNQKALMVSLVPKFSLPPDLSEIVFVVDRSGSMTDNMHTLRSALGLFL--KSLP 308 Query: 354 ENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLS-QQFR-GGTDLASCFRAIMERLQ 407 ++ F + ++S + +E+A++ Q GGT++ S A +E + Sbjct: 309 LGVPFNLISFGSSFEAIWARSKVSTRESLEEALQHTKNIQADLGGTEILSGLEAAVE--K 366 Query: 408 SREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 + + +V++D + V V + + H RF + + ++ Sbjct: 367 RYQDKVLEVLVLTDGEVWNQSE-VFDLVNQANQQHSTRFFTLGLGDSVSHSLIN 419 >UniRef50_D2VKS7 von Willebrand factor type A domain-containing protein n=2 Tax=Naegleria gruberi RepID=D2VKS7_NAEGR Length = 923 Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 64/184 (34%), Gaps = 16/184 (8%) Query: 293 YRLHGESWREKVIERPVVHKDYDEQPRG-PFIVCVDTSGSMGGFNEQCAKAFCLALMRIA 351 + + + ++ P + ++ +G ++ VD SGSM G K+ ++ Sbjct: 663 FETECDLYCMATLQGPCFEQQAQKERKGVDLVLVVDKSGSMAGQKLDMVKSTLSFMVDQL 722 Query: 352 LAENRRCYIMLFSTEIVR---YELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQ 407 E R I+ F T++ +G ++A + S T+L+ ++ L Sbjct: 723 -KEKDRVAIVEFDTQVKTNLDLTKMDIEGKKKAKQVSSAISPGSCTNLSGALFTSLKLLA 781 Query: 408 SREWFD---ADAVVISDFIAQRL-------PDDVTSKVKELQRVHQHRFHAVAMSAHGKP 457 SR+ ++ +D +A R ++ + EL H Sbjct: 782 SRQQEKNEVTSVILFTDGLANRGLISTNEILQNMQDLMDELLSTSNVTIHTFGFGQDTDA 841 Query: 458 GIMR 461 ++ Sbjct: 842 NMLT 845 >UniRef50_C0PS55 Putative uncharacterized protein n=1 Tax=Picea sitchensis RepID=C0PS55_PICSI Length = 829 Score = 102 bits (253), Expect = 4e-20, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 60/185 (32%), Gaps = 21/185 (11%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPF--IVCVDTSGSMGGFNEQCAKAFCLALMRIALAE 354 E+ +++ E + D R P + +D SGSM G K ++ E Sbjct: 333 SEASKKQNYEDCEGNMVKDPGCRAPIDLVTVLDVSGSMSGTKLALLKRAMAFVISNLSPE 392 Query: 355 NRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE 410 +R +++FS+ R ++ GGT++A R + L+ R Sbjct: 393 DR-LSVVVFSSTAKRVFSLKRMTPDGQRAANRVVERLLCTGGTNIAEGLRKGAKVLEDRR 451 Query: 411 WFD--ADAVVISDFIAQR----------LPDDVTSKVKELQRVHQHRF--HAVAMSAHGK 456 + A +++SD D+ ++ R + HA Sbjct: 452 QRNPVASIMLLSDGQDTYSLSSRGVVLFPSDEQRRSARQSTRYGHVQIPVHAFGFGVDHD 511 Query: 457 PGIMR 461 M Sbjct: 512 AATMH 516 >UniRef50_Q6UXX5 Inter-alpha-trypsin inhibitor heavy chain H5-like protein n=3 Tax=Eutheria RepID=ITH5L_HUMAN Length = 1313 Score = 102 bits (253), Expect = 4e-20, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 57/176 (32%), Gaps = 21/176 (11%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGP-----Q 376 + +D S SM G + K ++ A N I+ FS + ++ G Q Sbjct: 284 VVFVIDVSSSMFGTKMEQTKTAMNVILSDLQA-NDYFNIISFSDTVNVWKAGGSIQATIQ 342 Query: 377 GIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSRE---------WFDADAVVISDFIAQR 426 + A +L + G TD+ S A L + ++D Sbjct: 343 NVHSAKDYLHCMEADGWTDVNSALLAAASVLNHSNQEPGRGPSVGRIPLIIFLTDGEPTA 402 Query: 427 LPDDVTSKVKELQRVHQHRF--HAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 + + +++ HR ++A ++R + + G+ R+ Sbjct: 403 GVTTPSVILSNVRQALGHRVSLFSLAFGDDADFTLLR---RLSLENRGIARRIYED 455 >UniRef50_D0IVS5 Putative uncharacterized protein n=3 Tax=Bacteria RepID=D0IVS5_COMTE Length = 618 Score = 102 bits (253), Expect = 4e-20, Method: Composition-based stats. Identities = 55/291 (18%), Positives = 92/291 (31%), Gaps = 34/291 (11%) Query: 152 REQLLSEVQERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPE 211 RE L V ++ L + G L+D L ++ + L + Sbjct: 315 REPFLWNVACDFVINDWLIQMDVGQPPGIGLLYDPELRGLSAE--EVYDRIAGDLRRMRK 372 Query: 212 LKRLA----EQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPP 267 L+ LA + L R+ + + ++ F + + D LLP Sbjct: 373 LRTLAGGQGDMLERNVGGQRKAGDYTDIDEFCRTQLGKGLLRHE-------QDARGLLPA 425 Query: 268 EL-----ATLGIT-----ELEYEFYRRL--VEKQLLTYRLHGESWREKVIERPVVHKDYD 315 L A L EL F VE + R+ I RP + D Sbjct: 426 GLIEEIRALLQPPIDWQVELARWFDHHFPPVETRRSYARISRRQSATPDIPRPRIQADSR 485 Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRR-CYIMLFSTEIVRYELSG 374 F V +DTSGSM AKA A+ A A++ ++ Sbjct: 486 WLEGRTFGVLLDTSGSM--ERHVLAKAL-GAIASYADAKDVPAVRLICCDAAAYDLGYLP 542 Query: 375 PQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQ 425 I + ++ + RGGT L +++ + ++I+D Sbjct: 543 AADIA---QRIALKGRGGTVLQPGVDLLLQDDDFPKDGP--ILIITDGQCD 588 >UniRef50_B4BQC0 von Willebrand factor type A n=2 Tax=Geobacillus RepID=B4BQC0_9BACI Length = 668 Score = 102 bits (253), Expect = 4e-20, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 49/152 (32%), Gaps = 12/152 (7%) Query: 300 WREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALM---RIALAENR 356 R V P +P + +D SGSM Q AK+ A + + N Sbjct: 177 GRLDVTLIPQGGVPAPVRPPIDVVFVMDVSGSMTTMKLQSAKSALQAAVNYFKTNYHPND 236 Query: 357 RCYIMLFSTEIV---RYELSGPQGIEQAIRFL-----SQQFRGGTDLASCFRAIMERLQS 408 R ++ FS ++ + + + GGT+ ++ Sbjct: 237 RFALIPFSDDVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLAQSYFND 296 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQR 440 E + ++D + L + KE+++ Sbjct: 297 PE-RKKYIIFLTDGMPTVLNTTSSITHKEIKK 327 >UniRef50_UPI0000E4A663 PREDICTED: similar to calcium activated chloride channel 1 precursor n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E4A663 Length = 1245 Score = 101 bits (252), Expect = 5e-20, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 55/174 (31%), Gaps = 12/174 (6%) Query: 301 REKVIERPVVHKDYDEQPRGPFIVCVDTSGSMG-GFNEQCAKAFCLALMRIALAENRRCY 359 + IE ++ +DTSGSMG + A + + + Sbjct: 504 NDAQIEPSFDLVQASTGDECRVVLVLDTSGSMGTSNRIDKVNSAATAFV-NLVDDGISIG 562 Query: 360 IMLFSTEIVR----YELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDAD 415 I+ F+ +++ + GGT + +E L + AD Sbjct: 563 IVTFTGSPTTRHALTQINTQADRDSLRDIFQLTASGGTCIGCGLEQGLEVLMAHPSGSAD 622 Query: 416 ---AVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHI 466 V+++D + + + +++ + R + VA+ + I Sbjct: 623 GGIIVLMTDGQDSGIQNHI---IRQTLQDMGVRVNTVAIGEDAYGELSLIAQET 673 >UniRef50_Q897H0 Membrane-associated protein n=1 Tax=Clostridium tetani RepID=Q897H0_CLOTE Length = 842 Score = 101 bits (252), Expect = 5e-20, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 67/183 (36%), Gaps = 14/183 (7%) Query: 292 TYRLHGESWREKVIERPVVHKDYDEQPRGP--FIVCVDTSGSMGGF-----NEQCAKAFC 344 ++ L + PV +++ +G ++ +D SGSM + AK Sbjct: 379 SFALGSYENTKFEELLPVSCNVKNKRKQGDAGIVLLIDCSGSMDDESGGVKKIELAKQGA 438 Query: 345 LALMRIALAENRRCYIMLFSTEIV-RYELSGPQGIEQAIRFL-SQQFRGGTDLASCFRAI 402 + ++ +E+ I+ FS I + E+ I+ + + +GGT + Sbjct: 439 IETIKALESED-YIGILGFSDTIDWVVPFQKAENKEKLIKEVGKLKPKGGTLIIPGLIEG 497 Query: 403 MERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPG-IMR 461 ++ L S + ++++D A++ K E + + V + + Sbjct: 498 VKTLSSAKTKVKHMILLTDGQAEKNG---FDKYLENMKKNNMTLSTVGLGEDSDREVLTH 554 Query: 462 IFD 464 + D Sbjct: 555 LSD 557 >UniRef50_C9RRF6 von Willebrand factor type A n=3 Tax=Bacteria RepID=C9RRF6_FIBSS Length = 228 Score = 101 bits (252), Expect = 5e-20, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 57/189 (30%), Gaps = 23/189 (12%) Query: 313 DYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENR-----RCYIMLFSTEI 367 + + R P + +DTSGSM G ++ ++ F Sbjct: 13 ENNPSTRVPVCLVLDTSGSMEGQPISELNEGINCFYDAVRSDETALYAAEIAVVTFGGSA 72 Query: 368 V-RYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE---------WFDADAV 417 V + + S + + F GGT + ++ L+ R+ ++ V Sbjct: 73 VLKTDFSTLEHQPDSPNF---FANGGTPMGEAMNMALDLLEKRKGEYKASGVDYYQPWIV 129 Query: 418 VISDFIAQRLPDDVTSKVK---ELQRVHQHRFHAVAMSAHGKPGIMRIF--DHIWRFDTG 472 +++D + V+ E+ + + + + + F G Sbjct: 130 LMTDGKPNGDSSEYARAVQRTCEMIKNRKLTIFPIGIGEDADMNALAAFSPKRSPLKLQG 189 Query: 473 MRSRLLRRW 481 + R W Sbjct: 190 LNFREFFAW 198 >UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5Z1_VITVI Length = 686 Score = 101 bits (251), Expect = 6e-20, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 53/187 (28%), Gaps = 38/187 (20%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY----E 371 ++ + +D SGSM G K L++ +R I+ FS+ R Sbjct: 199 DRAPIDLVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGPSDR-LSIVSFSSTARRIFPLRR 257 Query: 372 LSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPD 429 +S + S GGT++ + + L+ R + A +++SD D Sbjct: 258 MSDNGREAAGLAINSLXSSGGTNIVEGLKKGVRVLEERSEQNPVASIILLSDGKDTYNCD 317 Query: 430 DVTSKVKELQRVHQHR------------------------------FHAVAMSAHGKPGI 459 +V + R H + Sbjct: 318 NVNRRQTSHCASSNPRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHTFGFGSDHDSTA 377 Query: 460 MR-IFDH 465 M I D Sbjct: 378 MHAISDE 384 >UniRef50_Q5RHF3 Novel protein similar to vertebrate inter-alpha (Globulin) inhibitor H5 (ITIH5) (Fragment) n=2 Tax=Danio rerio RepID=Q5RHF3_DANRE Length = 906 Score = 101 bits (251), Expect = 7e-20, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 50/176 (28%), Gaps = 22/176 (12%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELS-----GPQ 376 + +DTS SM G + K ++ N + FS I ++ P Sbjct: 254 VVFVIDTSASMLGTKMKQTKQALFTIINELRP-NDNFNFVTFSNRIRVWQPGKLVPVTPI 312 Query: 377 GIEQAIRFLSQ-QFRGGTDLASCFRAIMERLQS--------REWFDADAVVISDFIAQRL 427 I A +F+ GGTD+ + L + + ++D Sbjct: 313 SIRDAKKFIYMISVTGGTDINGGIQTGSALLSDYLSSKDESHHHSVSLIIFLTDGRPTVG 372 Query: 428 ---PDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR 480 + S K + + M ++ + + G R+ Sbjct: 373 VLQSPTIISNTKTAVQEK-FCLFTIGMGDDVDYRLLE---RMSLDNCGTMRRIPED 424 >UniRef50_P76396 Uncharacterized protein yegL n=64 Tax=cellular organisms RepID=YEGL_ECOLI Length = 219 Score = 101 bits (251), Expect = 7e-20, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 61/191 (31%), Gaps = 18/191 (9%) Query: 303 KVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAE-----NRR 357 + I + +PR P I+ +D SGSM G A + LA+ Sbjct: 3 EQITFATSDFASNPEPRCPCILLLDVSGSMNGRPINELNAGLVTFRDELLADPLALKRVE 62 Query: 358 CYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSRE------- 410 I+ F V + I +G T + + ++ ++ R+ Sbjct: 63 LGIVTFGPVHVEQPFTSAANFFPPI----LFAQGDTPMGAAITKALDMVEERKREYRANG 118 Query: 411 --WFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWR 468 ++ +I+D +KV + + F ++ + + +I Sbjct: 119 ISYYRPWIFLITDGAPTDEWQAAANKVFRGEEDKRFAFFSIGVQGADMKTLAQISVRQPL 178 Query: 469 FDTGMRSRLLR 479 G++ R L Sbjct: 179 PLQGLQFRELF 189 >UniRef50_A4XHD9 von Willebrand factor, type A n=2 Tax=Clostridia RepID=A4XHD9_CALS8 Length = 909 Score = 101 bits (251), Expect = 8e-20, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 65/172 (37%), Gaps = 12/172 (6%) Query: 297 GESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGG------FNEQCAKAFCLALMRI 350 S EK++ + K+ +++ ++ +D SGSMGG + AK+ ++ Sbjct: 383 SNSVLEKMLPVKMQLKNKEKERNVAVVLVIDHSGSMGGSNLRNINKLEIAKSAAAKMIDH 442 Query: 351 ALAENRRCYIMLFSTEIVR-YELSGPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQS 408 + + ++ F + + + I +S Q GGT + + L+ Sbjct: 443 LESSD-SVGVIAFDHNFYWASKFGKLKSKNEVIENISTIQVGGGTAIIPPLTEAVNLLKK 501 Query: 409 REWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIM 460 + D V+++D + + + + + + + + + + I+ Sbjct: 502 SKAKDKVIVLLTDGYGEEGGYEYPASIA---KRNNIKITTIGVGSSINAPIL 550 >UniRef50_Q24FW2 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q24FW2_TETTH Length = 1074 Score = 101 bits (251), Expect = 8e-20, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 62/151 (41%), Gaps = 6/151 (3%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEI--VRYELS 373 ++P I +D SGSM G K L L+ E R ++LF++E+ + Sbjct: 360 QRPPIDLICVMDNSGSMHGEKINMLKETLLYLIDQL-DEKDRLGLVLFNSEVTFRPMKSM 418 Query: 374 GPQGIEQAIRFLS-QQFRGGTDLASCFRAIMERLQSREWFD--ADAVVISDFIAQRLPDD 430 + +++S + +GGTD+ + +++R++ + ++SD + + D Sbjct: 419 DTTNKLKLKQYISDIRAQGGTDINLGMTEAFKFIKTRKYCNPVTSVFLLSDGLDSKAQDR 478 Query: 431 VTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 V +K + Q + P +M Sbjct: 479 VAVTLKNMSINEQFSINCFGFGRDHDPILMN 509 >UniRef50_A4WJJ3 Putative uncharacterized protein n=5 Tax=Thermoproteaceae RepID=A4WJJ3_PYRAR Length = 431 Score = 100 bits (250), Expect = 8e-20, Method: Composition-based stats. Identities = 54/376 (14%), Positives = 112/376 (29%), Gaps = 70/376 (18%) Query: 104 SPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQERM 163 S + ++ N ++ + L R SL+ + + E++++ ++ E Sbjct: 77 SDVYHEISKISRYNYQVSKSASVKLL-RAYNSLLSRIERGAVEGFEDQKQDF-RDLSENQ 134 Query: 164 TLSGQLEPIL----------ADNNTAAGRLWDMSAGQLKRGDY--QLIVKYGEFLNEQPE 211 L ++ +L + + G+ I Y L + Sbjct: 135 QLRNEISNLLRFYMGNVRNIEKLRKSMTKALGNEVGKETAELLFDIDIDPYRARLAKI-- 192 Query: 212 LKRLAEQLGRSREAKSIPRNDAQMETFRTMVREPATVPEQVDGLQQSDDILRLLPPELAT 271 L+ L E L +E + + R ++ D+ + L+ Sbjct: 193 LESLVEMLSAVKEEVDQGDVQERRGVISGVTR-----------IRTYSDLQK--ATNLSK 239 Query: 272 LGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGS 331 + F +L K L Y L ++ + VD SGS Sbjct: 240 AIYLQSRELFGYKLATKSLSIYDLALDTRDR-------------------VYLLVDKSGS 280 Query: 332 M-----------GGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQ 380 M A A +A+M+ + ++ F ++V ++ + I Sbjct: 281 MFYSLYDGVAMDMTQKITWATALAIAVMKKSKRT-----VLRFFDQMVYPPITNVKDI-- 333 Query: 381 AIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQR 440 L GGTD+ + + + + + V+I+D + +V K R Sbjct: 334 IRSLLRVLPLGGTDITAAVHTAVRDAKQQSLHNYKLVIITDGEDDMIHPEVLKMAKTAFR 393 Query: 441 VHQHRFHAVAMSAHGK 456 AV + Sbjct: 394 E----VKAVLVGGTNS 405 >UniRef50_A7RTF3 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7RTF3_NEMVE Length = 756 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 66/185 (35%), Gaps = 23/185 (12%) Query: 283 RRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKA 342 + +EK L+T + +++ +E G FI +D SGSM G + A+ Sbjct: 248 QDFLEKPLVTLNFMPDFGKQEALET------------GEFIFVIDRSGSMSGDRIKNARE 295 Query: 343 FCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFR------GGTDLA 396 L L +L E+ ++ F + + S + + ++ + GGT++ Sbjct: 296 -TLFLFLKSLPEHCHFNVVGFGSSYEKLFSSSTKYSDSSVNKACNHAKNLEANLGGTEIL 354 Query: 397 SCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGK 456 + + + + +++D V + VK + R + Sbjct: 355 EPLKYVFSQPVIKGSPR-QVFLMTDGEVGN-TQQVITLVK--KNSTHARCFTFGIGQGAS 410 Query: 457 PGIMR 461 +++ Sbjct: 411 TALIK 415 >UniRef50_D1CCX6 von Willebrand factor type A n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CCX6_THET1 Length = 918 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 66/195 (33%), Gaps = 28/195 (14%) Query: 293 YRLHGESWR--EKVIERPVVHKDYDEQPRGPFIVCVDTSGSMG-----GFNE-------- 337 + L G E+ + ++ DE+P+ ++ +D SGSM G Sbjct: 376 FALGGYFNTPLEQTLPVDSQIRNPDEEPQVAVVMAIDKSGSMAACHCEGSKLLEQYPGGI 435 Query: 338 ---QCAKAFCLALMRIALAENRRCYIMLFSTE----IVRYELSGPQGIEQAIRFLSQQFR 390 AK + L L N ++ F T + ++ I A + Q Sbjct: 436 PKVDIAKESAI-LSSETLGPNDIFGVVAFDTAPRWVVRPEPVTDKSSI--AEKVAGIQGS 492 Query: 391 GGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVA 450 GGT++ ++ L + + ++++D + D ++ R H V+ Sbjct: 493 GGTNIYGGLAEAIDSLIKVKAKNKHVILLTDGWSNVGNYD---ELISKARRHGITISTVS 549 Query: 451 MSAHGKPGIMRIFDH 465 + + I + Sbjct: 550 AAGGSAQLLRSIAEK 564 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 26/131 (19%), Positives = 45/131 (34%), Gaps = 8/131 (6%) Query: 316 EQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGP 375 + + VD S S+G AK F ++A + +++F E + L+ Sbjct: 60 PSHKLGVVFLVDASDSVGPEGIAQAKEFVRKAYQLAGR-DVDLGVVVFGKEPLIDSLTSS 118 Query: 376 QGIEQAIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKV 435 G + FLS+ TD+ S R + V++SD +V Sbjct: 119 DG--KLPDFLSRPDSTATDIPSAMRLAFSMFPADSSKK--IVLLSDGNNNVGD---MQEV 171 Query: 436 KELQRVHQHRF 446 L R+ Sbjct: 172 SRLARMFGVTV 182 >UniRef50_C1XFI8 Mg-chelatase subunit ChlD n=2 Tax=Meiothermus RepID=C1XFI8_MEIRU Length = 722 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 64/180 (35%), Gaps = 11/180 (6%) Query: 290 LLTYRLHG---ESWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSM-GGFNEQCAKAFCL 345 L T G W + + + +E ++ +D SGSM A L Sbjct: 281 LWTATPQGLFFGGWERTSLADSLPVEPVEEPGGVGIVLVLDVSGSMLEDDKLGLAVTGSL 340 Query: 346 ALMRIALAENRRCYIMLFSTEIVRY----ELSGPQGIEQAIRFLSQQFRGGTDLASCFRA 401 L+R A ++ +++FS ++ E LS Q GGT + + Sbjct: 341 ELIRSARPQD-YIGVVVFSDRPRWLFRPRPMTEQGRKEAESLLLSTQAGGGTMIRRAYLE 399 Query: 402 IMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +E L+ + ++D +A + D+ +E + + VA+ A +R Sbjct: 400 ALEALEQVPTESKQVIALTDGLAADVTPDLFDAAREASPR--IKTNTVAIGADADGRFLR 457 >UniRef50_C7P2A9 von Willebrand factor type A n=2 Tax=Halobacteriaceae RepID=C7P2A9_HALMD Length = 393 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 60/170 (35%), Gaps = 8/170 (4%) Query: 299 SWREKVIERPVVHKDYDEQPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRC 358 E V + + R +C+DTSGSM G N + A+ A + LA+ Sbjct: 16 DGTTVTAEIDVEPGEQETDVRRHIALCIDTSGSMEGDNIKRARDGA-AWVFGLLADEDYV 74 Query: 359 YIMLFSTEIVR-YELSGPQGIEQ--AIRFL-SQQFRGGTDLASCFRAIMERLQSREWFDA 414 I+ F TE + +++ A+ + GGTD+ + +A E L S Sbjct: 75 SIVAFDTEATVILPATRWSDLDRQTAMDHVEELTAGGGTDMYNGLKAAKETLSSSATGPD 134 Query: 415 DA---VVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMR 461 +++SD D + E R + + +R Sbjct: 135 TVKRLLLLSDGKDNERTPDEFEGLAEAIDDAGIRIQSAGIGTDYNEATIR 184 >UniRef50_A4YGU7 von Willebrand factor, type A n=12 Tax=Sulfolobaceae RepID=A4YGU7_METS5 Length = 383 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 54/146 (36%), Gaps = 8/146 (5%) Query: 322 FIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRY-ELSGPQGIEQ 380 +IV +DTSGSM G + AK + L++ + + + FS+ + E P+ + Sbjct: 40 YIVLLDTSGSMDGLKIESAKKGAIELLKRI-PQGNKVSFVTFSSRVNIVREFVDPEDLTA 98 Query: 381 AIRFLSQQFRGGTDLASCFRAIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQR 440 S G T + L ++ + ++++D D K + Sbjct: 99 --EISSLSAGGQTAFFTALLTAFN-LHNKHGIPSYVILLTDG--NPTDDTNVETYKRIAI 153 Query: 441 VHQHRFHAVAMSAHGKPGIMR-IFDH 465 + + + + I++ + D Sbjct: 154 PNGVQTISFGLGDDYNETILKSLADR 179 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.128 0.307 Lambda K H 0.267 0.0391 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,183,352,704 Number of Sequences: 3077464 Number of extensions: 75433000 Number of successful extensions: 308344 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 812 Number of HSP's successfully gapped in prelim test: 4804 Number of HSP's that attempted gapping in prelim test: 300365 Number of HSP's gapped (non-prelim): 7379 length of query: 483 length of database: 1,040,396,356 effective HSP length: 133 effective length of query: 350 effective length of database: 631,093,644 effective search space: 220882775400 effective search space used: 220882775400 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 95 (41.3 bits)