BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (427 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteri... 751 0.0 UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacter... 447 e-124 UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkhol... 397 e-109 UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobact... 377 e-103 UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteri... 367 e-100 UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodoba... 357 3e-97 UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidith... 265 3e-69 UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeri... 229 2e-58 UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales ... 192 1e-47 UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitroso... 175 3e-42 UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=... 170 9e-41 UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 ... 170 1e-40 UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium... 153 1e-35 UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani Rep... 152 3e-35 UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria ... 146 1e-33 UniRef50_A9FJ88 Uncharacterized conserved protein involved in st... 140 8e-32 UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaen... 136 1e-30 UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes ... 136 2e-30 UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonif... 125 4e-27 UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitroco... 113 2e-23 UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiob... 106 1e-21 UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meioth... 98 7e-19 UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5... 94 7e-18 UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriac... 84 1e-14 UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phag... 77 2e-12 UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobac... 69 3e-10 UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteo... 65 4e-09 UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candida... 53 2e-05 >UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteria RepID=Y1510_YERPA Length = 424 Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust. Identities = 360/423 (85%), Positives = 391/423 (92%), Gaps = 1/423 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M +FIDRRLNGKNKSMVNRQRFLRRYK+QIKQSI++AINKRSVTD++SGESVSIP +DI+ Sbjct: 1 MGYFIDRRLNGKNKSMVNRQRFLRRYKSQIKQSIADAINKRSVTDIESGESVSIPIDDIN 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EPMFHQG GGLRHRVHPGNDHF+ NDR++RPQGGGGG A +DGEG+DEFVFQIS Sbjct: 61 EPMFHQGNGGLRHRVHPGNDHFITNDRVDRPQGGGGGGSGQG-NAGKDGEGEDEFVFQIS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 KDEYLDLLFEDLALPNLK+NQ +QL E+KTHRAGYT+NGVPANISVVRSLQNSLARRTAM Sbjct: 120 KDEYLDLLFEDLALPNLKRNQYKQLAEFKTHRAGYTSNGVPANISVVRSLQNSLARRTAM 179 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 TA KRREL LE L ++ NSEPAQLLEEERLRK I EL+ KI RVPFIDTFDLRYKNYE Sbjct: 180 TASKRRELRELEAALTVLENSEPAQLLEEERLRKAITELKQKIARVPFIDTFDLRYKNYE 239 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 +RP+PSSQAVMFCLMDVSGSMDQ+TKDMAKRFYILLYLFLSRTYKNV+VVYIRHHTQAKE Sbjct: 240 RRPEPSSQAVMFCLMDVSGSMDQATKDMAKRFYILLYLFLSRTYKNVDVVYIRHHTQAKE 299 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE EFFYSQETGGTIVSSALKLMDEVV+ERYNPAQWNIYAAQASDGDNWADDSPLCHE+ Sbjct: 300 VDEQEFFYSQETGGTIVSSALKLMDEVVQERYNPAQWNIYAAQASDGDNWADDSPLCHEL 359 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 LAKK+LPVVRYYSYIEITRRAHQTLWREYE L+ FDNFA+QHIR+ +DIYPVFRELFHK Sbjct: 360 LAKKILPVVRYYSYIEITRRAHQTLWREYEDLEEKFDNFAIQHIREPEDIYPVFRELFHK 419 Query: 421 QNA 423 Q Sbjct: 420 QTV 422 >UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacteria RepID=Y882_CHRSD Length = 426 Score = 447 bits (1150), Expect = e-124, Method: Compositional matrix adjust. Identities = 240/430 (55%), Positives = 315/430 (73%), Gaps = 10/430 (2%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 MT+FIDRR N KNKS VNRQRFL+RY++ IK+++ EA+N+RS+TD++ GE +SIP +DIS Sbjct: 1 MTYFIDRRANAKNKSAVNRQRFLQRYRSHIKRAVEEAVNRRSITDMERGEKISIPAKDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP+F G GG R V PGN FV+ DR+ R GG G GSG+G AS GEG DEF F +S Sbjct: 61 EPVFQHGPGGARTIVSPGNKEFVEGDRLRR-PGGEGRGGSGEGSASNQGEGMDEFAFSLS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 ++E+LD +F+ LALP+L++ Q R L E + RAG T +GVP+ I++VRS++ + ARR M Sbjct: 120 REEFLDFVFDGLALPHLERKQLRDLDEVRPVRAGVTRDGVPSRINIVRSMREAQARRIGM 179 Query: 181 TAGKRRELHALEENLAIISNSEP-----AQLLEEERLRKEIAELRAKIERVPFIDTFDLR 235 A +R L EE L +P A++ E L+ EI L ++E VPFIDT+DLR Sbjct: 180 RAPIKRALREAEEALESEERKDPVLRNPARIGE---LKAEIERLEKRLEAVPFIDTYDLR 236 Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHH 295 Y N +P PS++AVMFC+MDVSGSM Q KD+AKRF++LLYLFL R Y+ VE+V+IRHH Sbjct: 237 YNNLIDQPQPSNKAVMFCVMDVSGSMTQGHKDIAKRFFLLLYLFLERNYEKVELVFIRHH 296 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSP 355 T AKEVDE EFFYS+ETGGTIVSSAL L+DE++ +RY+PAQWN+Y AQASDGDNW DDS Sbjct: 297 TAAKEVDEEEFFYSRETGGTIVSSALTLVDEIIAKRYSPAQWNLYVAQASDGDNWDDDSL 356 Query: 356 LCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDN-FAMQHIRDQDDIYPVF 414 C ++L L+ ++YY+Y+EIT +HQ LW EYE +Q+ + FAMQ I + DIYPVF Sbjct: 357 TCRDLLMTSLMAKLQYYTYVEITPHSHQALWEEYERVQAAHPSRFAMQQIVEPGDIYPVF 416 Query: 415 RELFHKQNAT 424 R+LF K+ A+ Sbjct: 417 RKLFRKRVAS 426 >UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkholderiales RepID=A9I2A9_BORPD Length = 419 Score = 397 bits (1020), Expect = e-109, Method: Compositional matrix adjust. Identities = 206/426 (48%), Positives = 285/426 (66%), Gaps = 10/426 (2%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M IDRRLNG+NKS VNR+RFLRRYK QI++++ + + +RS+ D+D G +++P DIS Sbjct: 1 MNSLIDRRLNGRNKSAVNRERFLRRYKDQIRRAVQDLVRERSIEDMDQGGEINLPARDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP F G+GG R VHPGN F + D RP GS G +GE D+F F +S Sbjct: 61 EPHFRHGQGGDRELVHPGNREFAKGDTFPRP----SGSDGEGGSEPGEGESVDQFTFSLS 116 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 + E+L+L FEDL LP+L + Q +T+ K RAGYT G P+ +SV R+L+ SL+RR A+ Sbjct: 117 RAEFLNLFFEDLELPHLIRTQLGDVTQKKWQRAGYTTTGSPSLLSVSRTLKASLSRRVAL 176 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R EL A + L + A E + LR+E+ + ++ R+PF+D DLRY+N Sbjct: 177 GVAARAELEAAQAKLDAAIAAG-APQAEIDALRQEVEDCANRLARLPFLDDLDLRYRNRV 235 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 P ++AVMFCLMDVSGSMD+ KD+AKRF+ LLYLFLSR Y++V+VV+IRH A+E Sbjct: 236 SVAMPMARAVMFCLMDVSGSMDEGKKDLAKRFFTLLYLFLSRKYEHVDVVFIRHTDNAEE 295 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE FFY ++GGTIV SAL+LM E+V++RY P+ WN+YAAQASDGD++ D+ Sbjct: 296 VDEQTFFYDPKSGGTIVLSALELMHEIVQQRYPPSAWNVYAAQASDGDSFGADAGKSARF 355 Query: 361 LAKKLLPVVRYYSYIEI--TRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 LA+ LLP RY++YIE+ ++ A + +LW EYE Q T +F M+ I ++ +IYPVF +L Sbjct: 356 LAENLLPATRYFAYIEVPDSQEARKSSLWAEYE--QETAPHFVMRRICERGEIYPVFHDL 413 Query: 418 FHKQNA 423 F K+ A Sbjct: 414 FKKETA 419 >UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobacteria RepID=Y6755_BRAJA Length = 427 Score = 377 bits (969), Expect = e-103, Method: Compositional matrix adjust. Identities = 188/430 (43%), Positives = 278/430 (64%), Gaps = 22/430 (5%) Query: 4 FIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPM 63 IDRRLN KS+ NRQRFLRR K+ ++ ++ + +R + DV G V+IP + + EP Sbjct: 5 IIDRRLNPGGKSLENRQRFLRRAKSLVQGAVKKTSQERDIKDVLEGGEVTIPLDGMHEPR 64 Query: 64 FHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDE 123 F + GG R V PGN FV+ D ++R G GS + +G+ +D F F +S+DE Sbjct: 65 FRR-EGGTRDMVLPGNKKFVEGDYLQR-----SGQGSAKDSGPGEGDSEDAFRFVLSRDE 118 Query: 124 YLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAG 183 ++DL +DL LP+L + + Q RAGYT +G PANISV R+++ +LARR A+ Sbjct: 119 FVDLFLDDLELPDLAKRKIAQTESEGIQRAGYTTSGSPANISVSRTVKLALARRIALKRP 178 Query: 184 KRRELHALEENLAIISNSEPAQLLEEER--LRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 ++ E+ LE +A ++ E+ER L E+ +L AK +R+PFID D+RY+ +E Sbjct: 179 RKDEIEELEAAIAACTD-------EDERVVLLAELEKLMAKTKRIPFIDPLDIRYRRFET 231 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 P P +QAVMFCLMDVSGSM + KD+AKRFY+LLY+FL R YK+VE+V+IRH +A+EV Sbjct: 232 VPKPVAQAVMFCLMDVSGSMSEHMKDLAKRFYMLLYVFLKRRYKHVEIVFIRHTDRAEEV 291 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 DE FFY +GGT+VSSAL+ M ++V+ER+NP+ WNIYAAQASDGDN D L +L Sbjct: 292 DEQTFFYGPASGGTLVSSALQAMHDIVRERFNPSDWNIYAAQASDGDNSYSDGELTGLLL 351 Query: 362 AKKLLPVVRYYSYIEITRR-------AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 K+LPV ++++Y+E+ + +LW YE L+++ +M+ + ++ +I+PVF Sbjct: 352 TDKILPVCQFFAYLEVGESGGSAFDLSDSSLWTLYERLRNSGAPLSMRKVSERSEIFPVF 411 Query: 415 RELFHKQNAT 424 +LF ++ + Sbjct: 412 HDLFQRRETS 421 >UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteria RepID=Y975_NITHX Length = 439 Score = 367 bits (943), Expect = e-100, Method: Compositional matrix adjust. Identities = 186/444 (41%), Positives = 279/444 (62%), Gaps = 28/444 (6%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN K++S+ NRQRFLRR + ++K+SI + + ++D D ++VSIPT Sbjct: 1 MPIFIDRRLNPKDRSLGNRQRFLRRAREELKRSIRDRVRSGRISDADGEQAVSIPTRSTD 60 Query: 61 EPMFHQGR-GGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 EP F + G R V PGN HFV DR+ +P G G+ + S+D +F F + Sbjct: 61 EPRFEAAKDSGRREHVLPGNKHFVPGDRLRKPGHGAAGTPDPSMKDSED-----DFRFVL 115 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 S++E LDL FEDL LP++ + +++ ++ RAG+ A G P NI+V R+++NS RR A Sbjct: 116 SREEVLDLFFEDLELPDMVKLSLKEILAFRPRRAGFAATGSPTNINVGRTMRNSYGRRIA 175 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIER-------VPFIDTF 232 + KR E+ A+ + +A + + + + R+ IA L+A++ER + ++D Sbjct: 176 LKRPKREEVDAIRQEIAELESGSQSPVA-----RQRIAALQAEVERLERKRRLIAYVDPV 230 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 D+R+ +E +P P+++AVMFCLMDVSGSM + KD+AKRF++LL+LFL Y E+V+I Sbjct: 231 DIRFNRFEAQPIPNAKAVMFCLMDVSGSMGEREKDLAKRFFVLLHLFLKCRYDRTEIVFI 290 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 H +A+EV+E FFYS ++GGT+VS+AL+ M ++ ERY ++WNIYAAQASDGDN A Sbjct: 291 SHTHEAQEVNEETFFYSTQSGGTVVSTALEKMHRIIAERYPGSEWNIYAAQASDGDNAAA 350 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEIT--RRAH--------QTLWREYEHLQSTFDNFAMQ 402 DS C +L ++++ + +YY+Y+EI R H +LWR Y + + + NF M Sbjct: 351 DSHRCITLLDEEIMRLCQYYAYVEIIDERERHIFGTTENGTSLWRAYSSVNANWPNFQMT 410 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 I D DIYPVFR+LF +Q K Sbjct: 411 RIADAADIYPVFRQLFTRQATAEK 434 >UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodobacterales RepID=B6B8L1_9RHOB Length = 445 Score = 357 bits (917), Expect = 3e-97, Method: Compositional matrix adjust. Identities = 193/436 (44%), Positives = 277/436 (63%), Gaps = 21/436 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD----VDSGESVSIPT 56 M FIDRR N K KS+ NRQRFLRR + IK+ + +++ +S+ D GE V+IP Sbjct: 1 MHHFIDRRANPKGKSLGNRQRFLRRARENIKERVDQSVRGKSIQSGSGVPDGGEKVTIPA 60 Query: 57 EDISEP-MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEF 115 + EP FH +GGLR V PGN FV D I+RPQGG +G G +AS++G+G+DEF Sbjct: 61 RGLKEPRFFHSSKGGLRRHVLPGNKDFVVGDTIKRPQGG---TGQGGRKASEEGDGEDEF 117 Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 F ++++EYL++LFE L LP+L + + T RAG T G P N+++VR+++NSL Sbjct: 118 SFTLTQEEYLEILFEGLELPDLVEKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLG 177 Query: 176 RRTAMTAGKRRELHALEENLA---IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 RR A+ + LEE +A + + P Q E LRK++ + K + V +ID Sbjct: 178 RRIALQRPTTKSQRDLEEQIAELEALDDRTPPQEDFLEALRKKLDGIIRKRKVVGYIDPL 237 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 DLRY + +S+AV+FCLMDVSGSM + KD+AKRF++LL+LFL R Y++ E+V++ Sbjct: 238 DLRYDTFVPEKIRNSRAVVFCLMDVSGSMQEREKDLAKRFFLLLHLFLERCYEHTELVFV 297 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 RH A+EVDE FFY++ETGGTIVS+AL+ M E+++ERY P +WNIY AQASDG+N+ + Sbjct: 298 RHTHHAQEVDEETFFYARETGGTIVSTALEKMKEIIEERYPPDEWNIYGAQASDGENFGN 357 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQT----------LWREYEHLQSTFDNFAMQ 402 DS C ++L LLPV ++Y+Y+EI A + LW+ Y +++ +F MQ Sbjct: 358 DSARCKKLLLNDLLPVSQFYAYVEIVDEAAEMLLNNPEAGEDLWQNYREVKAQAQHFEMQ 417 Query: 403 HIRDQDDIYPVFRELF 418 + IYP+FRE F Sbjct: 418 RVSQPGHIYPIFREFF 433 >UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EPS6_ACIF5 Length = 434 Score = 265 bits (677), Expect = 3e-69, Method: Compositional matrix adjust. Identities = 167/430 (38%), Positives = 259/430 (60%), Gaps = 17/430 (3%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDV-DSGESVSIPTEDI 59 M+ IDRR +G +S N+ R RR +A++K ++ + S+ D+ ++ + VSIPT D+ Sbjct: 1 MSMIIDRRSSG-TRSTANQDRLQRRVRARLKVAVEKMARSGSIEDLANTDQPVSIPTRDL 59 Query: 60 SEPMFHQGRGGLR-HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQ 118 EP F + RV PGN + + D I +P+GG + DG G+DE Sbjct: 60 HEPSFRRDLSDTSWERVLPGNKEYQRGDEINKPEGG---GSGKGRAGAPDGLGEDEVAIV 116 Query: 119 ISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT 178 +S DE+LDLLF+ LALPNL++ Q + + RAG+ +G P+ + V R+++ + ARR Sbjct: 117 LSADEFLDLLFDGLALPNLRKMAQGDIQADQWRRAGFIKDGSPSRMHVGRTMRAARARRL 176 Query: 179 AMTAGKRRELHALEENLAIISNSEPAQL-------LEEERLRK---EIAELRAKIERVPF 228 A+ AGKRREL L + ++ +L +E+ERL + +I L KI+ +PF Sbjct: 177 ALRAGKRRELQDLLDARNVLQEEIQGRLAQKQDVSVEQERLSELNHQIDALERKIKAIPF 236 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ID DLR+ + +++P P + AVMFC+MDVSGSM + KD+AKRF++LLYLFL R Y+ V+ Sbjct: 237 IDEADLRFAHIDQQPHPITNAVMFCVMDVSGSMGEKEKDLAKRFFLLLYLFLHRHYQAVQ 296 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 +V+I+HH+ A E E FF ++E GGT+VS A+ L +E++++R+ P +WN+Y AQ SDGD Sbjct: 297 MVFIKHHSTASECSEQAFFGAREGGGTLVSPAIILSEEIMRQRFPPDRWNVYLAQVSDGD 356 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 N+ D+ + E L L + + Y+E+ R + L R Y+ + F +++ Sbjct: 357 NYFADNAVVEEHLLNLLPRLRNLF-YLEVNRDSESDLLRLYDAIAQDFPELVTARASERE 415 Query: 409 DIYPVFRELF 418 DIYP+FR LF Sbjct: 416 DIYPMFRTLF 425 >UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GJ99_SILST Length = 300 Score = 229 bits (583), Expect = 2e-58, Method: Compositional matrix adjust. Identities = 126/285 (44%), Positives = 185/285 (64%), Gaps = 19/285 (6%) Query: 150 THRAGYTANGVPANISVVRSLQNSLARRTAM---TAGKRRELHALEENLAIISNSEPAQL 206 T RAG T G P N+++VR+++NSL RR A+ + +R+L A L I P+Q Sbjct: 13 TRRAGLTTAGTPNNLNLVRTMRNSLGRRIALQRPSTQTQRDLEAQVAELEEIEARSPSQ- 71 Query: 207 LEEERLRKEIAEL---RAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 +E L + +A+L + K V +ID DLRY + +S+AV+FCLMDVSGSM + Sbjct: 72 --DELLAELVAKLDGIKRKRRVVGYIDPLDLRYDTFVPEKIRNSRAVVFCLMDVSGSMQE 129 Query: 264 STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKL 323 KD+AKRF++LL+LFL+R Y++ E+V++RH A+EVDE FFY++ETGGTIVS+AL+ Sbjct: 130 REKDLAKRFFLLLHLFLTRGYEHTEIVFVRHTHYAQEVDEETFFYARETGGTIVSTALEK 189 Query: 324 MDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 M E++ ERY P +WNIY AQASDG+N+ +DS C +IL ++LLP+ ++Y+Y+EI + Q Sbjct: 190 MKEIIDERYPPDEWNIYGAQASDGENFGNDSVRCRKILTEQLLPMCQFYAYVEIVEESAQ 249 Query: 384 T----------LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 LW+ Y ++ +F MQ + + IYP+FRE F Sbjct: 250 MLLDNTEAGEDLWQNYRQVKEACRHFEMQRVSEPGHIYPIFREFF 294 >UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales RepID=Y568_CLOK1 Length = 403 Score = 192 bits (489), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 130/410 (31%), Positives = 211/410 (51%), Gaps = 25/410 (6%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S+ +R+R + + IK ++++ I++ S+ + V IP + I E F G Sbjct: 14 DRSLEDRRRHRQLVEKSIKDNLADIISEESIIGQSKNKKVKIPIKGIKEYQFIYGDNSSG 73 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 G+ + DRI + G G+ QG +Q EG+D + +++ ++ LD L EDL Sbjct: 74 VGSGDGSQK--KGDRIGKAIKDRDGKGN-QGAGNQ--EGEDMYEIEVTIEDVLDYLMEDL 128 Query: 133 ALPNLKQNQQRQ-LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP + + + Q L+ ++GY G+ ++ R++ L R+ T RE+H Sbjct: 129 ELPLMDKKKFSQILSNNSPKKSGYQRKGINPRLAKKRTVVEKLKRQQG-TKRALREIHGE 187 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 E S+P L E K R PF DLRY +++P A + Sbjct: 188 LE-------SDPKNKLPENTTIKS---------RFPFKQD-DLRYFRVKRKPKLELNAAI 230 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD SGSMD + K +A+ F+ +LY F+ Y NVEV +I H T AK V E+EFF+ E Sbjct: 231 ICVMDTSGSMDSTRKFLARSFFFVLYRFIKMKYNNVEVKFISHSTSAKVVTENEFFHKVE 290 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS LK EV++E YNPA WN+Y SDGDNW++D+ L + AK L V Sbjct: 291 SGGTYISSGLKKALEVIEENYNPAYWNVYTFYVSDGDNWSEDNSLALK-CAKDLCKVCNL 349 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQ 421 +SY EI + + + + T +NF + I ++ D++ +++ +K+ Sbjct: 350 FSYAEIIPSPYGSSIKHIFQNKITDNNFTVVTIHEKQDLWKSLKKILNKE 399 >UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitrosococcus oceani RepID=Q3J885_NITOC Length = 394 Score = 175 bits (443), Expect = 3e-42, Method: Compositional matrix adjust. Identities = 126/419 (30%), Positives = 208/419 (49%), Gaps = 49/419 (11%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S +R R ++ + I+ ++++ + + S+ + +P I E F G+ Sbjct: 17 DRSAKDRLRHRQKVRKAIRDNVADIVAEESIIGQSRDRIIKVPIRGIREYRFVYGQNTPG 76 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 G+ Q + PQG GG +G D G D + +I+ +E ++++ EDL Sbjct: 77 VGTGQGDSEPGQTVG-QVPQGDGGPGHAG------DRPGMDYYETEITLEELIEIMLEDL 129 Query: 133 ALPNLKQNQQRQ-LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP++++ + R+ L+E + R G+ GV ++ RRTA + +RR Sbjct: 130 ELPDMERKRFREVLSERTSKRKGFRRVGVRVHMD---------KRRTAKSRIRRR----- 175 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + S+ + AE R PF D+RY + P S AV+ Sbjct: 176 -----LASDKD--------------AEDNETKHRFPFHRD-DMRYHRLREDMRPQSNAVV 215 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 FC+MD SGSMD K +A+ F+ LLY F+ Y NV+VV+I HHT+A+EV E EFF+ E Sbjct: 216 FCIMDTSGSMDTLKKYLARSFFFLLYQFVRSRYVNVDVVFIAHHTKAREVTEEEFFHKGE 275 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT +SS E+++ RY+P+ WNIYA SDGDN+ D+ + A+ L V Sbjct: 276 AGGTFISSGYSKALEIIQNRYHPSLWNIYAFHCSDGDNFDSDNAATLKA-AEVLCQVCNL 334 Query: 372 YSYIEITRRA----HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + Y EI R T+ + ++ DNF I+ ++DI+P FR+L +++ ++K Sbjct: 335 FGYGEIKPRPSGFYEGTMLDLFRSVR--MDNFQSVLIQRKEDIWPSFRQLLSRESESSK 391 >UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C05A6 Length = 368 Score = 170 bits (431), Expect = 9e-41, Method: Compositional matrix adjust. Identities = 124/401 (30%), Positives = 190/401 (47%), Gaps = 71/401 (17%) Query: 24 RRYKAQIKQSISEAINK----RSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGN 79 +R++ +++SI E I+ S+T+ +G V + +++ E F G V G+ Sbjct: 20 KRHRKLVEKSIRENIDMLIVGESITETAAGNIVKVRIQELPEYRFKFGSS--TEYVAIGD 77 Query: 80 DHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQ 139 V N++ + + +AS + G D + +I D+ L LLFE L LPNL + Sbjct: 78 GDEVVNEKCDF-----------EMEASNEA-GLDIYESEIVLDDALALLFEQLELPNLYE 125 Query: 140 NQQRQLTEYKTH-RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 + + L + T R+G G+ + R+LQ + R Sbjct: 126 KKFKNLEYFSTQKRSGIKKTGIYPRFAKKRTLQEKIIRN--------------------- 164 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + FI+ D+RY++ K+ S AV+ C+MD S Sbjct: 165 -------------------------KNGRFINQ-DIRYQSLAKKQINHSNAVIVCIMDTS 198 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVS 318 GSM + KDMAK FY LLY F+ Y VE+++I H T AKEV E++FF+ E+GGT +S Sbjct: 199 GSMGTTKKDMAKSFYFLLYQFIKIRYAKVEMIFIAHSTIAKEVTENDFFHKGESGGTYIS 258 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI- 377 S E++KERY+P WN+Y SDGDNW DD+ L LA +L + YIEI Sbjct: 259 SGYTKALEIIKERYDPRLWNVYTFHCSDGDNWTDDNNLAVS-LANELCSCSNLFGYIEIK 317 Query: 378 TRRAHQTLWREYE-HLQSTFDNFAMQHIRDQDDIYPVFREL 417 T + EY H+ S +NF I + DI+ VF+++ Sbjct: 318 TNNYSSVILNEYNAHITS--NNFLALKIFKKSDIFEVFKKV 356 >UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 Tax=Shewanella benthica KT99 RepID=A9DKM0_9GAMM Length = 167 Score = 170 bits (430), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 102/153 (66%), Positives = 125/153 (81%), Gaps = 1/153 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN + +S VNRQRF+ RYK QIK+++S+A+ +RSVTDVD GE +SIPT+DIS Sbjct: 16 MANFIDRRLNARGRSTVNRQRFINRYKQQIKKAVSDAVTRRSVTDVDKGERISIPTKDIS 75 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP FHQG+GG+R RVHPGND F++ D+IERP GGG GSGQG AS GEG D+FVFQIS Sbjct: 76 EPSFHQGQGGIRERVHPGNDQFIKGDKIERPP-GGGSQGSGQGDASNSGEGDDDFVFQIS 134 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRA 153 KDEYL+LLFEDL LPNL+ N+ +L EY+ +RA Sbjct: 135 KDEYLELLFEDLELPNLQNNRLNKLVEYQVYRA 167 >UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium roseum DSM 5159 RepID=B9L510_THERP Length = 389 Score = 153 bits (387), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 121/420 (28%), Positives = 188/420 (44%), Gaps = 75/420 (17%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K +++ R + + K IK+++++ ++++S+ D V +P + E R Sbjct: 19 KGAIDQARHMEKVKEAIKRNLADIVSEQSLITSDGKRVVRVPIRVLEE---------YRF 69 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSG------------SGQGQASQDGEGQDEFVFQISK 121 R P + V QG GG SG G + D G D + +++ Sbjct: 70 RFDPDSGRQVG-------QGSGGTHVGDVVGRVGGGQRSGDGPQAGDQPGIDYYEAELTI 122 Query: 122 DEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 +E +L+FEDL LPNL++ + R+L +G AN+ R+L+ +L RR A Sbjct: 123 EELSELIFEDLELPNLEEKRLRELESEAVRFTEIRRHGPFANLDKRRTLRENL-RRNAW- 180 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 R R I + + DLR+K +E+ Sbjct: 181 -----------------------------RGRARIGDFANE----------DLRFKTWER 201 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV 301 S AV+ +MDVSGSM K +++ FY + FL Y VE+ +I HH +A+EV Sbjct: 202 DVKRESNAVVIAMMDVSGSMGTFEKYVSRAFYYWMVRFLRTKYDRVEIRFIAHHAEAREV 261 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEIL 361 E EFF E+GGT S+A +L ++++E Y P WNIY SDGDNW D+ C E L Sbjct: 262 SEEEFFSRGESGGTRASTAYELALQLIRESYPPDSWNIYPFHFSDGDNWPSDNERCRE-L 320 Query: 362 AKKLLPVVRYYSYIEI--TRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 A++LL + Y EI R +Q TL + + S I ++ D+Y R F Sbjct: 321 AEELLRCANLFGYGEIRQGRYTYQSTLMHTLQRIGSP--KLVTVTITEKADVYQALRRFF 378 >UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani RepID=Q896G6_CLOTE Length = 386 Score = 152 bits (383), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 115/416 (27%), Positives = 199/416 (47%), Gaps = 43/416 (10%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 N++ +R+R + IK ++ + + + ++ + +P + + E F + Sbjct: 13 NRAGEDRKRHRELVEKSIKDNLVDVLLQEDISIQKENIKIKVPIKGVKEYEFTYSQNRSF 72 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN+ Q ++R S G G + + EG+D F +++ +E +F+DL Sbjct: 73 VVVGKGNEKKGQKIALKR------ASEQGGGAGAGEIEGEDIFETEVTIEEIFQSIFDDL 126 Query: 133 ALPNLKQNQ-QRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LPNLK+ + + L + + G+ +G+ ++ RRTA+ KR++ Sbjct: 127 ELPNLKKKKFNKILNDSFKRKKGFKKHGISPRLA---------KRRTAIEKVKRKQAT-- 175 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 ++ L ++IAE R PF DLRY + + AV+ Sbjct: 176 -----------------QKVLGRDIAE------RFPFKKD-DLRYSRVKLNKNKEYNAVI 211 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD S SMDQ K MA+ F+ ++Y F+ Y+ V++ +I H T AKEV E EFF+ E Sbjct: 212 ICIMDTSASMDQMKKYMARSFFFMIYKFIKMKYEEVDICFISHSTTAKEVTEEEFFHKVE 271 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS K E++ RYNP +NIY ASDGDNW +D+ ++ AK+L V Sbjct: 272 SGGTYISSGYKKALEIINTRYNPQIYNIYTFHASDGDNWNEDNDRAVKV-AKELSNVCNL 330 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + YIEI + R + +NF I ++D++ +++ ++ +G Sbjct: 331 FGYIEIMGYGYSNGIRNKYLKEIEKENFIPLIIEKKEDLWRALKDILKQEMREERG 386 >UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria RepID=Y587_BACC4 Length = 391 Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 109/415 (26%), Positives = 189/415 (45%), Gaps = 53/415 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR + + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGYDDQQRHQEKVQEAIKNNLPDLVTEESIVMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGG-SGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN D + R GG G G+GQ + D G+D + ++S E F +L Sbjct: 81 -VGQGNGDSKVGDVVARDGSGGQKQKGPGKGQGAGDAAGEDYYEAEVSILELEQAFFREL 139 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYT---ANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 LPNLK+ +++ E + + G+ NI R++ S +R AM+ + H Sbjct: 140 ELPNLKR---KEMDENRIEHVEFNDIRKTGLWGNIDKKRTMI-SAYKRNAMSG--KASFH 193 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 + + DL+++ + + P S+A Sbjct: 194 PIHQE--------------------------------------DLKFRTWNEVLKPDSKA 215 Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS 309 V+ +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E EFF Sbjct: 216 VVLAMMDTSGSMGIWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVTEEEFFSK 275 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV 369 E+GGTI SS K E++ +Y+P ++NIY SDGDN D+ C + L ++L+ Sbjct: 276 GESGGTICSSVYKKALELIDNKYSPDRYNIYPFHFSDGDNLTSDNARCVK-LVEELMKKC 334 Query: 370 RYYSYIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + Y E+ + H TL Y++++ DNF ++ + D++ + F +++ Sbjct: 335 NMFGYGEVNQYNRHSTLMSAYKNIKD--DNFRYYILKQKADVFHAMKSFFREESG 387 >UniRef50_A9FJ88 Uncharacterized conserved protein involved in stress response n=21 Tax=Bacteria RepID=A9FJ88_SORC5 Length = 405 Score = 140 bits (353), Expect = 8e-32, Method: Compositional matrix adjust. Identities = 108/407 (26%), Positives = 187/407 (45%), Gaps = 57/407 (14%) Query: 21 RFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQG---RGGL-RHRVH 76 RF + + +I++++ + I++ + + VSIP I P F G RGG+ + + Sbjct: 47 RFRQIVRGRIRENLRKYISQGELIGRKGKDLVSIPIPQIDIPRFRFGDKQRGGVGQGDGN 106 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 PG+ P GG GQGQA GEG ++ +E +L E+L LP+ Sbjct: 107 PGD-----------PVGGSDDKQPGQGQAGS-GEGDHLLEVDVTLEELAGILGEELELPD 154 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 ++ + +++ +G G + R+ + +L R Sbjct: 155 IQDKGKSKISNAHDRYSGIRRVGPESLRHFKRTYREALKR-------------------- 194 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 ++ R + + D RY++++ +P + AV+ +MD Sbjct: 195 ---------MISSGTFRPSAPVVVPVPD--------DKRYRSWKTITEPVANAVIIYMMD 237 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 VSGSM K++ + + +L+R YK +E +I H A+EVD FF+++E+GGT+ Sbjct: 238 VSGSMGDEQKEIVRIESFWIDAWLTRQYKGLESRFIIHDAIAREVDRDTFFHTRESGGTM 297 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA-DDSPLCHEILAKKLLPVVRYYSY- 374 +SSA KL +++ Y P +WNIY SDGDNW+ DD+ C ++L ++LP V ++Y Sbjct: 298 ISSAYKLCSQIIDNDYPPDEWNIYPFHFSDGDNWSMDDTLSCVDVLKTQILPRVNMFAYG 357 Query: 375 -IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 +E + Q + EH S D + IRD+D I ++ K Sbjct: 358 QVESPYGSGQFIKDLKEHF-SQDDRVVVSEIRDKDAIVGSIKDFLGK 403 >UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaenterica_26029 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI0001913F8A Length = 88 Score = 136 bits (343), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 78/88 (88%), Positives = 82/88 (93%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 LT KTHRAG+T+NGVPANISVVRSLQNSLARRTAMTAGKRRELHALE L IS+SEPA Sbjct: 1 LTSNKTHRAGFTSNGVPANISVVRSLQNSLARRTAMTAGKRRELHALETELETISHSEPA 60 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTF 232 QLLEEERLR+EIAELRAKIERVPFIDTF Sbjct: 61 QLLEEERLRREIAELRAKIERVPFIDTF 88 >UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes RepID=Y926_BACA2 Length = 394 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 106/411 (25%), Positives = 184/411 (44%), Gaps = 47/411 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR ++ + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGFDDQQRHQKKVQEAIKNNLPDLVTEESIIMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ D + R G G+G+GQ + D G+D + ++S + + LF++L Sbjct: 81 -VGQGDGDSEVGDVVAR-DGADKKQGAGKGQGAGDQAGEDYYEAEVSLMDLEEALFQELE 138 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL+Q ++ + G+ NI +RT ++A KR + Sbjct: 139 LPNLQQKERDNIVHTDIEFNDIRKTGLTGNID---------KKRTMLSAYKRNAMTGKPS 189 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 I DL+YK + P S+AV+ Sbjct: 190 FYPIYPE--------------------------------DLKYKTWNDVTKPESKAVVLA 217 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E +FF E+G Sbjct: 218 MMDTSGSMGVWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVSEEDFFSKGESG 277 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GTI SS + E++ E+Y+PA++NIY SDGDN D+ C + L ++ + Sbjct: 278 GTICSSVYRKSLELIDEKYDPARYNIYPFHFSDGDNLTSDNARCVK-LVNDIMKKSNLFC 336 Query: 374 YIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 Y E+ + H TL Y++++ D F ++ + D++ + F + + Sbjct: 337 YGEVNQYNRHSTLMSAYKNVKD--DKFKYYILKQKSDVFQALKSFFKNEES 385 >UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonifex degensii KC4 RepID=C9RD74_AMMDK Length = 371 Score = 125 bits (313), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 115/406 (28%), Positives = 176/406 (43%), Gaps = 71/406 (17%) Query: 25 RYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMF----HQ----GRGGLRHRVH 76 + K I+Q + E I + S+ D V IP + E F HQ G+ G Sbjct: 26 KLKEIIRQRLPELITEESLILADDRRKVRIPLRLVEEFRFRFASHQEMLVGQAG----SQ 81 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 PG D + I P G+G + G D + ++S +E +++FE+LALP+ Sbjct: 82 PGTD-----ETIVFPG-------IGRGGGAGTEPGIDYYEAEVSVEEIAEVVFEELALPH 129 Query: 137 LK--QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 K R + E A G+ A + R+L N+L R HA E Sbjct: 130 YKPKNTANRGIAE---EWADLRRQGIRACLDRRRTLLNALKR------------HAKEG- 173 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 RK E R + DLR++ + P ++AV+ + Sbjct: 174 ------------------RK--GEFR--------LCPSDLRFRVWRSIESPEARAVVLAM 205 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGG 314 +D SGSM K +A+ F+ + FL Y NVEVVY+ HHT+A+E EFF E+GG Sbjct: 206 LDTSGSMGPLEKYLARSFFFWMVRFLEANYANVEVVYLAHHTEARETTASEFFRKGESGG 265 Query: 315 TIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSY 374 T SS +L ++++ RY P ++NIYA SDGDN D+ C E++ +LL V Y Sbjct: 266 TRCSSVYELALDIIETRYPPTEYNIYAFHFSDGDNLPADNERCMELIG-RLLEVANLVGY 324 Query: 375 IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 EI T + + + +R++ D+Y + F + Sbjct: 325 GEIEGPYFYTSTLKTVYQSIAHPRLVVVTLRERKDVYRALKAFFAR 370 >UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BNQ7_9GAMM Length = 391 Score = 113 bits (282), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 81/320 (25%), Positives = 146/320 (45%), Gaps = 53/320 (16%) Query: 115 FVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSL 174 F + D+ LD L+++L LP+LK ++ E R G+ G + + R+++ ++ Sbjct: 117 FALEFQIDDILDWLWDELELPHLKPRLGTRIEEDAYIREGWDRRGARSRLDRRRTMKEAI 176 Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 RR+A E +P ++ DL Sbjct: 177 KRRSAQGP-----------------------------------------EAIPIVND-DL 194 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 R++ +R P++ AV F L+DVS SMD+ + +AK F+ + R + +E+V+I H Sbjct: 195 RFRQLARRRRPTTNAVAFFLLDVSSSMDEHCRRLAKTFFFWALQGVRRQFSTIEIVFIAH 254 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 +A E +E FF GGT S+A+ ++++ERY+PA +N Y A+DG N+++D Sbjct: 255 TVEAWEFEEENFFRIHGQGGTKSSTAVHKAQQILEERYDPAMYNCYLFYATDGHNFSEDR 314 Query: 355 PLCHEILAKKLLPVVRYYSYIEITRRAHQTL-------WREYEHLQSTFDNFAMQHIRDQ 407 E L +L P++ + Y E++ + H+ L WR ++++ + Sbjct: 315 RRATEALL-RLAPLMNFLGYAEVSHQNHRRLDTEVAGIWRGLGAEGWPVGSYSLTR---E 370 Query: 408 DDIYPVFRELFHKQNATAKG 427 DI+ + F Q A A+ Sbjct: 371 ADIWLAIKAFFTDQAAEAEA 390 >UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67Q87_SYMTH Length = 395 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 67/207 (32%), Positives = 103/207 (49%), Gaps = 17/207 (8%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 DLR++ +++ P + AV+ +MD SGSM K +A+ + FL Y+ V++ ++ Sbjct: 192 DLRFRTWDEAEIPGASAVLIIMMDTSGSMGTGEKYIARSLCHWMVRFLRTRYERVKLHFV 251 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 H T+AKE+DE FF E+GGT SSA + +++ RY P + N+YA SDGDN Sbjct: 252 AHTTEAKEMDEESFFTRGESGGTRCSSAYEYALQLIDRRYPPDRHNLYAFHFSDGDNLIS 311 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEITRRAHQ------------TLWREYEHLQSTFDNFA 400 D+P +L +KLL Y +I + TL+RE + F Sbjct: 312 DNPRAVALL-RKLLERCALVGYGQIETQPQYLSMPYYQPNTLLTLFRE----EIDHPRFV 366 Query: 401 MQHIRDQDDIYPVFRELFHKQNATAKG 427 IRD+ +IY R F + A +G Sbjct: 367 TALIRDRSEIYAALRAFFPRPGAGERG 393 >UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XT76_9DEIN Length = 360 Score = 97.8 bits (242), Expect = 7e-19, Method: Compositional matrix adjust. Identities = 81/270 (30%), Positives = 122/270 (45%), Gaps = 43/270 (15%) Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSL- 174 V ++ +E+L+L+ E L LP L+ Q G P ++ R SL Sbjct: 91 VAEMDLEEFLELIGEALKLPRLEPKQ-----------GGAVEESSPKYTTLSRRGPESLR 139 Query: 175 -ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 ARRT A +R A++ + R E L VP D D Sbjct: 140 HARRTLRQALRR----AIQSGI----------------YRPEDPRL------VPERD--D 171 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 RY+ E +P P +QA + +DVSGSM+ + + + ++ + + + Y+ Sbjct: 172 YRYRAPEPKPRPQAQAALVFALDVSGSMEGEQLRLVRILSYWITAWVKKHFPRLSRHYLL 231 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 H +A EV E +FF +E GGT +SS +KL +V+ ERY +N Y +DGDNW DD Sbjct: 232 HDAEAWEVSEEDFFRLREGGGTRLSSGIKLAQQVL-ERYPAQLYNRYVYHFTDGDNWQDD 290 Query: 354 SPLCHEILAKKLLPVVRYYSYIEITRRAHQ 383 + E L K LLP + Y Y ++ R Q Sbjct: 291 TAEALETL-KALLPTLSLYGYAQVRSRYGQ 319 >UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5_THET2 Length = 351 Score = 94.4 bits (233), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 101/399 (25%), Positives = 161/399 (40%), Gaps = 64/399 (16%) Query: 21 RFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGND 80 RF + ++K+ + E + + + G VSIP + P VH Sbjct: 14 RFKEIVRGEVKKRVREFLTREELFGQVEGRLVSIPLPQLEIPKI----------VHGEPL 63 Query: 81 HFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQN 140 G G V ++ +E+LDL+ E L LP L+ Sbjct: 64 GEGLGLGGP--------------GEEALGPGGHIPVAELELEEFLDLVGEALRLPRLRPK 109 Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISN 200 + ++TE G V R+L+ SL R A+ +G+ R E+ ++ Sbjct: 110 GEGEVTEEALRHTTIARKGPRGLRHVRRTLKESLKR--ALQSGEYR-----PEDPLLVPE 162 Query: 201 SEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS 260 E DLRYK ++P P +QAV+ +DVSGS Sbjct: 163 RE------------------------------DLRYKAPRRKPIPHAQAVVLFALDVSGS 192 Query: 261 MDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSA 320 M + + K + L++ R + +E Y+ H +A EV E EFF ++E GGT +SSA Sbjct: 193 MREEELKLVKTLSFWITLWIKRHFPRLERRYLLHDAEAWEVPEEEFFKAREGGGTRISSA 252 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRR 380 L L +E++K Y A +N Y SDG+NW D+P ++LLP + Y Y ++ Sbjct: 253 LLLAEEILKA-YPEAFYNRYLFHFSDGENWQGDTP-LALEALRRLLPSLALYGYAQVEGP 310 Query: 381 AHQT-LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 Q E + A+ +R ++D+ R L Sbjct: 311 YGQGHFLEEVREALGGREGVALAAVRGREDLPVALRRLL 349 >UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriaceae RepID=Y746_HALSA Length = 442 Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 55/181 (30%), Positives = 89/181 (49%), Gaps = 16/181 (8%) Query: 250 VMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYS 309 V+ + DVSGSM +S +++ +R + L +L+ Y N E VYI H A EVD +FF Sbjct: 266 VVVNIRDVSGSMRESKRELVERTFTPLDWYLTGKYDNAEFVYIAHDADAWEVDRTDFFGI 325 Query: 310 QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS-----PLCHEILAKK 364 Q GGT +S+A +L + V+ E Y ++WN Y A DG+N DD+ PL ++I A Sbjct: 326 QSGGGTRISTAYELAENVLDE-YPFSEWNRYVFAAGDGENSHDDTEENVIPLMNDIDAN- 383 Query: 365 LLPVVRYYSYIEITRRAHQTLWREYEHLQSTF---DNFAMQHIRDQDDIYPVFRELFHKQ 421 ++Y+E ++ F DN A+ + + DD+ + + Sbjct: 384 ------LHAYVETQPTDGVQTGTHAGKVRDAFGDTDNVAVTTVTEPDDVMGAIETILSTE 437 Query: 422 N 422 + Sbjct: 438 D 438 >UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phage phiYS40 RepID=A0MN74_9CAUD Length = 340 Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 46/130 (35%), Positives = 67/130 (51%), Gaps = 1/130 (0%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 +RYK+ KR P A+++ D S S+D K K + F+ YKNV + Sbjct: 150 IRYKHLRKREVPIFDAIVYFARDYSASVDDKKKFKIKSTAFWINNFIKYNYKNVTTKFAV 209 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 H T+AK V E +FF E G T+ SS +L+ E + RY+ +N Y SDG+N DD Sbjct: 210 HDTKAKFVSEQDFFKLSEGGATLCSSVFELIYEDYR-RYSVDDYNFYLFYFSDGENLPDD 268 Query: 354 SPLCHEILAK 363 +P E++ K Sbjct: 269 NPKLRELVEK 278 >UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobacteria RepID=Q747C5_GEOSL Length = 447 Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 70/270 (25%), Positives = 127/270 (47%), Gaps = 56/270 (20%) Query: 90 RPQ--GGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTE 147 RPQ GG G +G G+G+ G + + + + +L E LPNLK+ + Sbjct: 147 RPQQEGGSGTAGHGEGE----GHELESTAYDLGR-----ILTERFDLPNLKEKGK----- 192 Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI-----ISNSE 202 K+ + Y+ + N R L ++ + RR LE N+A+ ++ + Sbjct: 193 -KSSLSHYSYDLTDRN----RGFGQILEKKQTL----RR---ILETNIALGTVADVAEID 240 Query: 203 PAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMD 262 P +L+ R +RV I + +L Y+ SQA++F + D SGSM+ Sbjct: 241 PTRLVISPR------------DRVYRILSRELEYE---------SQALVFFIRDYSGSME 279 Query: 263 QSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRHHTQAKEVDEHEFFYS-QETGGTIVSSA 320 + ++L+Y +L + + VE +I H A+EV + +Y+ + GGT V++A Sbjct: 280 GKATEAVCSQHVLIYSWLLYQFARQVETRFILHDNDAREVPDFYTYYNLRVAGGTRVAAA 339 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNW 350 ++++E+V++ +NIY +DGD+W Sbjct: 340 YRMVNEIVEKESLARDYNIYVFHGTDGDDW 369 >UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TM27_9PROT Length = 318 Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 31/66 (46%), Positives = 44/66 (66%), Gaps = 1/66 (1%) Query: 4 FIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPM 63 IDRR N + KS+ NRQRFLRR K Q+ +++ +A +R + DV GE + IPT+ ++EP Sbjct: 230 IIDRRRNSQGKSLANRQRFLRRAKRQVTEAVRQASAERRIRDVADGEQIVIPTDGLNEPR 289 Query: 64 F-HQGR 68 F H R Sbjct: 290 FRHDAR 295 >UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EEA7_9EURY Length = 373 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 42/160 (26%), Positives = 76/160 (47%), Gaps = 17/160 (10%) Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN-VEVVY 291 D+RY E++ P+ A + D+SGSM+ + + L+ +L Y++ V++ Y Sbjct: 174 DIRYNLIEEKLIPNLSATFVIMRDISGSMEMYG-EFSATIAGLIEFWLKEKYEHTVKIRY 232 Query: 292 IRHHTQAKEVD---EHEFFYSQETGGTIVSSALKLMDEVV-----------KERYNPAQW 337 + H +A E D +FF +GGT + A KL+ ++ KER + Sbjct: 233 VAHTDEAFEYDPRKREDFFKLSSSGGTAFNPAYKLVIDMTDGASYKSNSPYKERIDYQSE 292 Query: 338 NIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 +++ +DGDN+ + E L KKL P + Y+++ Sbjct: 293 DVFLLHITDGDNYNGEDEAVRETL-KKLFPRLTKVFYLQV 331 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteri... 530 e-149 UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteri... 457 e-127 UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodoba... 457 e-127 UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobact... 453 e-126 UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacter... 451 e-125 UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkhol... 450 e-125 UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria ... 393 e-108 UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales ... 382 e-104 UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes ... 381 e-104 UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitroso... 373 e-102 UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidith... 370 e-101 UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium... 361 3e-98 UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani Rep... 355 2e-96 UniRef50_A9FJ88 Uncharacterized conserved protein involved in st... 346 1e-93 UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=... 343 6e-93 UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonif... 325 2e-87 UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiob... 297 4e-79 UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeri... 296 8e-79 UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitroco... 291 4e-77 UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meioth... 268 4e-70 UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5... 266 1e-69 UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriac... 230 9e-59 UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phag... 200 1e-49 UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobac... 165 2e-39 UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candida... 149 1e-34 UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 ... 143 1e-32 UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteo... 86 2e-15 UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaen... 83 1e-14 Sequences not found previously or not previously below threshold: UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reineke... 49 4e-04 UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepI... 48 8e-04 UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobac... 47 0.002 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 46 0.002 UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferrogl... 46 0.004 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 46 0.004 UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeo... 45 0.005 UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Stre... 45 0.006 UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methyli... 45 0.006 UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus ... 44 0.014 UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglob... 43 0.018 UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actin... 42 0.045 UniRef50_D2S019 ATPase associated with various cellular activiti... 42 0.056 >UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteria RepID=Y1510_YERPA Length = 424 Score = 530 bits (1365), Expect = e-149, Method: Composition-based stats. Identities = 362/422 (85%), Positives = 393/422 (93%), Gaps = 1/422 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M +FIDRRLNGKNKSMVNRQRFLRRYK+QIKQSI++AINKRSVTD++SGESVSIP +DI+ Sbjct: 1 MGYFIDRRLNGKNKSMVNRQRFLRRYKSQIKQSIADAINKRSVTDIESGESVSIPIDDIN 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EPMFHQG GGLRHRVHPGNDHF+ NDR++RPQGGGGG QG A +DGEG+DEFVFQIS Sbjct: 61 EPMFHQGNGGLRHRVHPGNDHFITNDRVDRPQGGGGGGSG-QGNAGKDGEGEDEFVFQIS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 KDEYLDLLFEDLALPNLK+NQ +QL E+KTHRAGYT+NGVPANISVVRSLQNSLARRTAM Sbjct: 120 KDEYLDLLFEDLALPNLKRNQYKQLAEFKTHRAGYTSNGVPANISVVRSLQNSLARRTAM 179 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 TA KRREL LE L ++ NSEPAQLLEEERLRK I EL+ KI RVPFIDTFDLRYKNYE Sbjct: 180 TASKRRELRELEAALTVLENSEPAQLLEEERLRKAITELKQKIARVPFIDTFDLRYKNYE 239 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 +RP+PSSQAVMFCLMDVSGSMDQ+TKDMAKRFYILLYLFLSRTYKNV+VVYIRHHTQAKE Sbjct: 240 RRPEPSSQAVMFCLMDVSGSMDQATKDMAKRFYILLYLFLSRTYKNVDVVYIRHHTQAKE 299 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE EFFYSQETGGTIVSSALKLMDEVV+ERYNPAQWNIYAAQASDGDNWADDSPLCHE+ Sbjct: 300 VDEQEFFYSQETGGTIVSSALKLMDEVVQERYNPAQWNIYAAQASDGDNWADDSPLCHEL 359 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 LAKK+LPVVRYYSYIEITRRAHQTLWREYE L+ FDNFA+QHIR+ +DIYPVFRELFHK Sbjct: 360 LAKKILPVVRYYSYIEITRRAHQTLWREYEDLEEKFDNFAIQHIREPEDIYPVFRELFHK 419 Query: 421 QN 422 Q Sbjct: 420 QT 421 >UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteria RepID=Y975_NITHX Length = 439 Score = 457 bits (1176), Expect = e-127, Method: Composition-based stats. Identities = 182/439 (41%), Positives = 274/439 (62%), Gaps = 18/439 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN K++S+ NRQRFLRR + ++K+SI + + ++D D ++VSIPT Sbjct: 1 MPIFIDRRLNPKDRSLGNRQRFLRRAREELKRSIRDRVRSGRISDADGEQAVSIPTRSTD 60 Query: 61 EPMFHQGR-GGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 EP F + G R V PGN HFV DR+ +P G G+ + S +D+F F + Sbjct: 61 EPRFEAAKDSGRREHVLPGNKHFVPGDRLRKPGHGAAGTPDPSMKDS-----EDDFRFVL 115 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 S++E LDL FEDL LP++ + +++ ++ RAG+ A G P NI+V R+++NS RR A Sbjct: 116 SREEVLDLFFEDLELPDMVKLSLKEILAFRPRRAGFAATGSPTNINVGRTMRNSYGRRIA 175 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEE--RLRKEIAELRAKIERVPFIDTFDLRYK 237 + KR E+ A+ + +A + + + + + L+ E+ L K + ++D D+R+ Sbjct: 176 LKRPKREEVDAIRQEIAELESGSQSPVARQRIAALQAEVERLERKRRLIAYVDPVDIRFN 235 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 +E +P P+++AVMFCLMDVSGSM + KD+AKRF++LL+LFL Y E+V+I H + Sbjct: 236 RFEAQPIPNAKAVMFCLMDVSGSMGEREKDLAKRFFVLLHLFLKCRYDRTEIVFISHTHE 295 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 A+EV+E FFYS ++GGT+VS+AL+ M ++ ERY ++WNIYAAQASDGDN A DS C Sbjct: 296 AQEVNEETFFYSTQSGGTVVSTALEKMHRIIAERYPGSEWNIYAAQASDGDNAAADSHRC 355 Query: 358 HEILAKKLLPVVRYYSYIEI----------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 +L ++++ + +YY+Y+EI T +LWR Y + + + NF M I D Sbjct: 356 ITLLDEEIMRLCQYYAYVEIIDERERHIFGTTENGTSLWRAYSSVNANWPNFQMTRIADA 415 Query: 408 DDIYPVFRELFHKQNATAK 426 DIYPVFR+LF +Q K Sbjct: 416 ADIYPVFRQLFTRQATAEK 434 >UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodobacterales RepID=B6B8L1_9RHOB Length = 445 Score = 457 bits (1175), Expect = e-127, Method: Composition-based stats. Identities = 194/444 (43%), Positives = 280/444 (63%), Gaps = 21/444 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD----VDSGESVSIPT 56 M FIDRR N K KS+ NRQRFLRR + IK+ + +++ +S+ D GE V+IP Sbjct: 1 MHHFIDRRANPKGKSLGNRQRFLRRARENIKERVDQSVRGKSIQSGSGVPDGGEKVTIPA 60 Query: 57 EDISEPMF-HQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEF 115 + EP F H +GGLR V PGN FV D I+RPQGG +G G +AS++G+G+DEF Sbjct: 61 RGLKEPRFFHSSKGGLRRHVLPGNKDFVVGDTIKRPQGG---TGQGGRKASEEGDGEDEF 117 Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 F ++++EYL++LFE L LP+L + + T RAG T G P N+++VR+++NSL Sbjct: 118 SFTLTQEEYLEILFEGLELPDLVEKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLG 177 Query: 176 RRTAMTAGKRRELHALEENLAIIS---NSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 RR A+ + LEE +A + + P Q E LRK++ + K + V +ID Sbjct: 178 RRIALQRPTTKSQRDLEEQIAELEALDDRTPPQEDFLEALRKKLDGIIRKRKVVGYIDPL 237 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 DLRY + +S+AV+FCLMDVSGSM + KD+AKRF++LL+LFL R Y++ E+V++ Sbjct: 238 DLRYDTFVPEKIRNSRAVVFCLMDVSGSMQEREKDLAKRFFLLLHLFLERCYEHTELVFV 297 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 RH A+EVDE FFY++ETGGTIVS+AL+ M E+++ERY P +WNIY AQASDG+N+ + Sbjct: 298 RHTHHAQEVDEETFFYARETGGTIVSTALEKMKEIIEERYPPDEWNIYGAQASDGENFGN 357 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEI----------TRRAHQTLWREYEHLQSTFDNFAMQ 402 DS C ++L LLPV ++Y+Y+EI A + LW+ Y +++ +F MQ Sbjct: 358 DSARCKKLLLNDLLPVSQFYAYVEIVDEAAEMLLNNPEAGEDLWQNYREVKAQAQHFEMQ 417 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 + IYP+FRE F + A+ Sbjct: 418 RVSQPGHIYPIFREFFLPKVKGAQ 441 >UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobacteria RepID=Y6755_BRAJA Length = 427 Score = 453 bits (1165), Expect = e-126, Method: Composition-based stats. Identities = 186/431 (43%), Positives = 276/431 (64%), Gaps = 18/431 (4%) Query: 3 WFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEP 62 IDRRLN KS+ NRQRFLRR K+ ++ ++ + +R + DV G V+IP + + EP Sbjct: 4 HIIDRRLNPGGKSLENRQRFLRRAKSLVQGAVKKTSQERDIKDVLEGGEVTIPLDGMHEP 63 Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD 122 F + GG R V PGN FV+ D ++R G GS + +G+ +D F F +S+D Sbjct: 64 RFRR-EGGTRDMVLPGNKKFVEGDYLQR-----SGQGSAKDSGPGEGDSEDAFRFVLSRD 117 Query: 123 EYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTA 182 E++DL +DL LP+L + + Q RAGYT +G PANISV R+++ +LARR A+ Sbjct: 118 EFVDLFLDDLELPDLAKRKIAQTESEGIQRAGYTTSGSPANISVSRTVKLALARRIALKR 177 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 ++ E+ LE +A ++ + E L E+ +L AK +R+PFID D+RY+ +E Sbjct: 178 PRKDEIEELEAAIAACTDED-----ERVVLLAELEKLMAKTKRIPFIDPLDIRYRRFETV 232 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P P +QAVMFCLMDVSGSM + KD+AKRFY+LLY+FL R YK+VE+V+IRH +A+EVD Sbjct: 233 PKPVAQAVMFCLMDVSGSMSEHMKDLAKRFYMLLYVFLKRRYKHVEIVFIRHTDRAEEVD 292 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 E FFY +GGT+VSSAL+ M ++V+ER+NP+ WNIYAAQASDGDN D L +L Sbjct: 293 EQTFFYGPASGGTLVSSALQAMHDIVRERFNPSDWNIYAAQASDGDNSYSDGELTGLLLT 352 Query: 363 KKLLPVVRYYSYIEITRRAH-------QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 K+LPV ++++Y+E+ +LW YE L+++ +M+ + ++ +I+PVF Sbjct: 353 DKILPVCQFFAYLEVGESGGSAFDLSDSSLWTLYERLRNSGAPLSMRKVSERSEIFPVFH 412 Query: 416 ELFHKQNATAK 426 +LF ++ + + Sbjct: 413 DLFQRRETSQE 423 >UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacteria RepID=Y882_CHRSD Length = 426 Score = 451 bits (1159), Expect = e-125, Method: Composition-based stats. Identities = 236/427 (55%), Positives = 308/427 (72%), Gaps = 4/427 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 MT+FIDRR N KNKS VNRQRFL+RY++ IK+++ EA+N+RS+TD++ GE +SIP +DIS Sbjct: 1 MTYFIDRRANAKNKSAVNRQRFLQRYRSHIKRAVEEAVNRRSITDMERGEKISIPAKDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP+F G GG R V PGN FV+ DR+ RP G G G +G AS GEG DEF F +S Sbjct: 61 EPVFQHGPGGARTIVSPGNKEFVEGDRLRRPGGEGRGGSG-EGSASNQGEGMDEFAFSLS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 ++E+LD +F+ LALP+L++ Q R L E + RAG T +GVP+ I++VRS++ + ARR M Sbjct: 120 REEFLDFVFDGLALPHLERKQLRDLDEVRPVRAGVTRDGVPSRINIVRSMREAQARRIGM 179 Query: 181 TAGKRRELHALEENLAIISNSEP--AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 A +R L EE L +P L+ EI L ++E VPFIDT+DLRY N Sbjct: 180 RAPIKRALREAEEALESEERKDPVLRNPARIGELKAEIERLEKRLEAVPFIDTYDLRYNN 239 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 +P PS++AVMFC+MDVSGSM Q KD+AKRF++LLYLFL R Y+ VE+V+IRHHT A Sbjct: 240 LIDQPQPSNKAVMFCVMDVSGSMTQGHKDIAKRFFLLLYLFLERNYEKVELVFIRHHTAA 299 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 KEVDE EFFYS+ETGGTIVSSAL L+DE++ +RY+PAQWN+Y AQASDGDNW DDS C Sbjct: 300 KEVDEEEFFYSRETGGTIVSSALTLVDEIIAKRYSPAQWNLYVAQASDGDNWDDDSLTCR 359 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD-NFAMQHIRDQDDIYPVFREL 417 ++L L+ ++YY+Y+EIT +HQ LW EYE +Q+ FAMQ I + DIYPVFR+L Sbjct: 360 DLLMTSLMAKLQYYTYVEITPHSHQALWEEYERVQAAHPSRFAMQQIVEPGDIYPVFRKL 419 Query: 418 FHKQNAT 424 F K+ A+ Sbjct: 420 FRKRVAS 426 >UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkholderiales RepID=A9I2A9_BORPD Length = 419 Score = 450 bits (1158), Expect = e-125, Method: Composition-based stats. Identities = 205/426 (48%), Positives = 281/426 (65%), Gaps = 10/426 (2%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M IDRRLNG+NKS VNR+RFLRRYK QI++++ + + +RS+ D+D G +++P DIS Sbjct: 1 MNSLIDRRLNGRNKSAVNRERFLRRYKDQIRRAVQDLVRERSIEDMDQGGEINLPARDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP F G+GG R VHPGN F + D RP G G GS G+ D+F F +S Sbjct: 61 EPHFRHGQGGDRELVHPGNREFAKGDTFPRPSGSDGEGGSEPGEGES----VDQFTFSLS 116 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 + E+L+L FEDL LP+L + Q +T+ K RAGYT G P+ +SV R+L+ SL+RR A+ Sbjct: 117 RAEFLNLFFEDLELPHLIRTQLGDVTQKKWQRAGYTTTGSPSLLSVSRTLKASLSRRVAL 176 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R EL A + L + A E + LR+E+ + ++ R+PF+D DLRY+N Sbjct: 177 GVAARAELEAAQAKLDAAIAAG-APQAEIDALRQEVEDCANRLARLPFLDDLDLRYRNRV 235 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 P ++AVMFCLMDVSGSMD+ KD+AKRF+ LLYLFLSR Y++V+VV+IRH A+E Sbjct: 236 SVAMPMARAVMFCLMDVSGSMDEGKKDLAKRFFTLLYLFLSRKYEHVDVVFIRHTDNAEE 295 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE FFY ++GGTIV SAL+LM E+V++RY P+ WN+YAAQASDGD++ D+ Sbjct: 296 VDEQTFFYDPKSGGTIVLSALELMHEIVQQRYPPSAWNVYAAQASDGDSFGADAGKSARF 355 Query: 361 LAKKLLPVVRYYSYIEITRR---AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 LA+ LLP RY++YIE+ +LW EYE Q T +F M+ I ++ +IYPVF +L Sbjct: 356 LAENLLPATRYFAYIEVPDSQEARKSSLWAEYE--QETAPHFVMRRICERGEIYPVFHDL 413 Query: 418 FHKQNA 423 F K+ A Sbjct: 414 FKKETA 419 >UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria RepID=Y587_BACC4 Length = 391 Score = 393 bits (1010), Expect = e-108, Method: Composition-based stats. Identities = 105/416 (25%), Positives = 182/416 (43%), Gaps = 47/416 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR + + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGYDDQQRHQEKVQEAIKNNLPDLVTEESIVMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGS-GSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN D + R GG G G+GQ + D G+D + ++S E F +L Sbjct: 81 -VGQGNGDSKVGDVVARDGSGGQKQKGPGKGQGAGDAAGEDYYEAEVSILELEQAFFREL 139 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 LPNLK+ + + G+ NI R++ ++ R + H + Sbjct: 140 ELPNLKRKEMDENRIEHVEFNDIRKTGLWGNIDKKRTMISAYKRNAMSG---KASFHPIH 196 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 + DL+++ + + P S+AV+ Sbjct: 197 QE--------------------------------------DLKFRTWNEVLKPDSKAVVL 218 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET 312 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E EFF E+ Sbjct: 219 AMMDTSGSMGIWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVTEEEFFSKGES 278 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 GGTI SS K E++ +Y+P ++NIY SDGDN D+ C + L ++L+ + Sbjct: 279 GGTICSSVYKKALELIDNKYSPDRYNIYPFHFSDGDNLTSDNARCVK-LVEELMKKCNMF 337 Query: 373 SYIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y E+ + H TL Y++++ DNF ++ + D++ + F +++ Sbjct: 338 GYGEVNQYNRHSTLMSAYKNIKD--DNFRYYILKQKADVFHAMKSFFREESGEKMA 391 >UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales RepID=Y568_CLOK1 Length = 403 Score = 382 bits (982), Expect = e-104, Method: Composition-based stats. Identities = 125/413 (30%), Positives = 209/413 (50%), Gaps = 25/413 (6%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S+ +R+R + + IK ++++ I++ S+ + V IP + I E F G Sbjct: 14 DRSLEDRRRHRQLVEKSIKDNLADIISEESIIGQSKNKKVKIPIKGIKEYQFIYGDNSSG 73 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 + + DRI + G G+ Q + + EG+D + +++ ++ LD L EDL Sbjct: 74 VGSG--DGSQKKGDRIGKAIKDRDGKGN---QGAGNQEGEDMYEIEVTIEDVLDYLMEDL 128 Query: 133 ALPNLKQNQQRQL-TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP + + + Q+ + ++GY G+ ++ R++ L R+ G +R L + Sbjct: 129 ELPLMDKKKFSQILSNNSPKKSGYQRKGINPRLAKKRTVVEKLKRQQ----GTKRALREI 184 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 L S+P L E K R PF DLRY +++P A + Sbjct: 185 HGEL----ESDPKNKLPENTTIKS---------RFPFKQD-DLRYFRVKRKPKLELNAAI 230 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD SGSMD + K +A+ F+ +LY F+ Y NVEV +I H T AK V E+EFF+ E Sbjct: 231 ICVMDTSGSMDSTRKFLARSFFFVLYRFIKMKYNNVEVKFISHSTSAKVVTENEFFHKVE 290 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS LK EV++E YNPA WN+Y SDGDNW++D+ L + AK L V Sbjct: 291 SGGTYISSGLKKALEVIEENYNPAYWNVYTFYVSDGDNWSEDNSLALK-CAKDLCKVCNL 349 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 +SY EI + + + + T +NF + I ++ D++ +++ +K+ Sbjct: 350 FSYAEIIPSPYGSSIKHIFQNKITDNNFTVVTIHEKQDLWKSLKKILNKELEE 402 >UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes RepID=Y926_BACA2 Length = 394 Score = 381 bits (978), Expect = e-104, Method: Composition-based stats. Identities = 103/412 (25%), Positives = 181/412 (43%), Gaps = 47/412 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR ++ + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGFDDQQRHQKKVQEAIKNNLPDLVTEESIIMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ D + R G G+G+GQ + D G+D + ++S + + LF++L Sbjct: 81 -VGQGDGDSEVGDVVAR-DGADKKQGAGKGQGAGDQAGEDYYEAEVSLMDLEEALFQELE 138 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL+Q ++ + G+ NI R++ ++ R Sbjct: 139 LPNLQQKERDNIVHTDIEFNDIRKTGLTGNIDKKRTMLSAYKRNAMTGKPS--------- 189 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 I DL+YK + P S+AV+ Sbjct: 190 --------------------------------FYPIYPEDLKYKTWNDVTKPESKAVVLA 217 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E +FF E+G Sbjct: 218 MMDTSGSMGVWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVSEEDFFSKGESG 277 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GTI SS + E++ E+Y+PA++NIY SDGDN D+ C + L ++ + Sbjct: 278 GTICSSVYRKSLELIDEKYDPARYNIYPFHFSDGDNLTSDNARCVK-LVNDIMKKSNLFC 336 Query: 374 YIEITR-RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 Y E+ + H TL Y++++ D F ++ + D++ + F + + Sbjct: 337 YGEVNQYNRHSTLMSAYKNVKD--DKFKYYILKQKSDVFQALKSFFKNEESG 386 >UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitrosococcus oceani RepID=Q3J885_NITOC Length = 394 Score = 373 bits (957), Expect = e-102, Method: Composition-based stats. Identities = 122/419 (29%), Positives = 204/419 (48%), Gaps = 49/419 (11%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S +R R ++ + I+ ++++ + + S+ + +P I E F G+ Sbjct: 17 DRSAKDRLRHRQKVRKAIRDNVADIVAEESIIGQSRDRIIKVPIRGIREYRFVYGQNTPG 76 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 G+ Q + PQG GG + D G D + +I+ +E ++++ EDL Sbjct: 77 VGTGQGDSEPGQTVG-QVPQGDGGPGH------AGDRPGMDYYETEITLEELIEIMLEDL 129 Query: 133 ALPNLKQNQQRQ-LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP++++ + R+ L+E + R G+ GV ++ R+ ++ + RR A Sbjct: 130 ELPDMERKRFREVLSERTSKRKGFRRVGVRVHMDKRRTAKSRIRRRLA------------ 177 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + AE R PF D+RY + P S AV+ Sbjct: 178 ---------------------SDKDAEDNETKHRFPFHRD-DMRYHRLREDMRPQSNAVV 215 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 FC+MD SGSMD K +A+ F+ LLY F+ Y NV+VV+I HHT+A+EV E EFF+ E Sbjct: 216 FCIMDTSGSMDTLKKYLARSFFFLLYQFVRSRYVNVDVVFIAHHTKAREVTEEEFFHKGE 275 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT +SS E+++ RY+P+ WNIYA SDGDN+ D+ + A+ L V Sbjct: 276 AGGTFISSGYSKALEIIQNRYHPSLWNIYAFHCSDGDNFDSDNAATLKA-AEVLCQVCNL 334 Query: 372 YSYIEITRR----AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + Y EI R T+ + ++ DNF I+ ++DI+P FR+L +++ ++K Sbjct: 335 FGYGEIKPRPSGFYEGTMLDLFRSVR--MDNFQSVLIQRKEDIWPSFRQLLSRESESSK 391 >UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EPS6_ACIF5 Length = 434 Score = 370 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 170/437 (38%), Positives = 263/437 (60%), Gaps = 17/437 (3%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD-VDSGESVSIPTEDI 59 M+ IDRR +G +S N+ R RR +A++K ++ + S+ D ++ + VSIPT D+ Sbjct: 1 MSMIIDRRSSG-TRSTANQDRLQRRVRARLKVAVEKMARSGSIEDLANTDQPVSIPTRDL 59 Query: 60 SEPMFHQGRGGLR-HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQ 118 EP F + RV PGN + + D I +P+GGG G G + DG G+DE Sbjct: 60 HEPSFRRDLSDTSWERVLPGNKEYQRGDEINKPEGGGSGKGR---AGAPDGLGEDEVAIV 116 Query: 119 ISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT 178 +S DE+LDLLF+ LALPNL++ Q + + RAG+ +G P+ + V R+++ + ARR Sbjct: 117 LSADEFLDLLFDGLALPNLRKMAQGDIQADQWRRAGFIKDGSPSRMHVGRTMRAARARRL 176 Query: 179 AMTAGKRRELHALEENLAIISNSEPAQL-------LEEERLRK---EIAELRAKIERVPF 228 A+ AGKRREL L + ++ +L +E+ERL + +I L KI+ +PF Sbjct: 177 ALRAGKRRELQDLLDARNVLQEEIQGRLAQKQDVSVEQERLSELNHQIDALERKIKAIPF 236 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ID DLR+ + +++P P + AVMFC+MDVSGSM + KD+AKRF++LLYLFL R Y+ V+ Sbjct: 237 IDEADLRFAHIDQQPHPITNAVMFCVMDVSGSMGEKEKDLAKRFFLLLYLFLHRHYQAVQ 296 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 +V+I+HH+ A E E FF ++E GGT+VS A+ L +E++++R+ P +WN+Y AQ SDGD Sbjct: 297 MVFIKHHSTASECSEQAFFGAREGGGTLVSPAIILSEEIMRQRFPPDRWNVYLAQVSDGD 356 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 N+ D+ + E L L + + Y+E+ R + L R Y+ + F +++ Sbjct: 357 NYFADNAVVEEHLLNLLPRLRNLF-YLEVNRDSESDLLRLYDAIAQDFPELVTARASERE 415 Query: 409 DIYPVFRELFHKQNATA 425 DIYP+FR LF + + Sbjct: 416 DIYPMFRTLFATEETPS 432 >UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium roseum DSM 5159 RepID=B9L510_THERP Length = 389 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 118/415 (28%), Positives = 187/415 (45%), Gaps = 51/415 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K +++ R + + K IK+++++ ++++S+ D V +P + E F R Sbjct: 19 KGAIDQARHMEKVKEAIKRNLADIVSEQSLITSDGKRVVRVPIRVLEEYRFRFDPDSGRQ 78 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ + GGG SG G + D G D + +++ +E +L+FEDL Sbjct: 79 -VGQGSGG---THVGDVVGRVGGGQRSGDGPQAGDQPGIDYYEAELTIEELSELIFEDLE 134 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL++ + R+L +G AN+ R+L+ +L RR A Sbjct: 135 LPNLEEKRLRELESEAVRFTEIRRHGPFANLDKRRTLRENL-RRNAW------------- 180 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 R R I + + DLR+K +E+ S AV+ Sbjct: 181 -----------------RGRARIGDFANE----------DLRFKTWERDVKRESNAVVIA 213 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MDVSGSM K +++ FY + FL Y VE+ +I HH +A+EV E EFF E+G Sbjct: 214 MMDVSGSMGTFEKYVSRAFYYWMVRFLRTKYDRVEIRFIAHHAEAREVSEEEFFSRGESG 273 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT S+A +L ++++E Y P WNIY SDGDNW D+ C E LA++LL + Sbjct: 274 GTRASTAYELALQLIRESYPPDSWNIYPFHFSDGDNWPSDNERCRE-LAEELLRCANLFG 332 Query: 374 YIEITR---RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 Y EI + TL + + S I ++ D+Y R F + Sbjct: 333 YGEIRQGRYTYQSTLMHTLQRIGS--PKLVTVTITEKADVYQALRRFFGPEVGQE 385 >UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani RepID=Q896G6_CLOTE Length = 386 Score = 355 bits (911), Expect = 2e-96, Method: Composition-based stats. Identities = 109/416 (26%), Positives = 192/416 (46%), Gaps = 43/416 (10%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 N++ +R+R + IK ++ + + + ++ + +P + + E F + Sbjct: 13 NRAGEDRKRHRELVEKSIKDNLVDVLLQEDISIQKENIKIKVPIKGVKEYEFTYSQNRSF 72 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN+ Q ++R S G G + + EG+D F +++ +E +F+DL Sbjct: 73 VVVGKGNEKKGQKIALKR------ASEQGGGAGAGEIEGEDIFETEVTIEEIFQSIFDDL 126 Query: 133 ALPNLKQNQQRQL-TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LPNLK+ + ++ + + G+ +G+ ++ R+ + R+ A R++ Sbjct: 127 ELPNLKKKKFNKILNDSFKRKKGFKKHGISPRLAKRRTAIEKVKRKQATQKVLGRDI--- 183 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 ER PF DLRY + + AV+ Sbjct: 184 -------------------------------AERFPFKKD-DLRYSRVKLNKNKEYNAVI 211 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD S SMDQ K MA+ F+ ++Y F+ Y+ V++ +I H T AKEV E EFF+ E Sbjct: 212 ICIMDTSASMDQMKKYMARSFFFMIYKFIKMKYEEVDICFISHSTTAKEVTEEEFFHKVE 271 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS K E++ RYNP +NIY ASDGDNW +D+ + +AK+L V Sbjct: 272 SGGTYISSGYKKALEIINTRYNPQIYNIYTFHASDGDNWNEDNDRAVK-VAKELSNVCNL 330 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + YIEI + R + +NF I ++D++ +++ ++ +G Sbjct: 331 FGYIEIMGYGYSNGIRNKYLKEIEKENFIPLIIEKKEDLWRALKDILKQEMREERG 386 >UniRef50_A9FJ88 Uncharacterized conserved protein involved in stress response n=21 Tax=Bacteria RepID=A9FJ88_SORC5 Length = 405 Score = 346 bits (887), Expect = 1e-93, Method: Composition-based stats. Identities = 101/405 (24%), Positives = 182/405 (44%), Gaps = 47/405 (11%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + RF + + +I++++ + I++ + + VSIP I P F G R V Sbjct: 44 DHGRFRQIVRGRIRENLRKYISQGELIGRKGKDLVSIPIPQIDIPRFRFG-DKQRGGVGQ 102 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ + P GG GQGQ + GEG ++ +E +L E+L LP++ Sbjct: 103 GDGNPGD------PVGGSDDKQPGQGQ-AGSGEGDHLLEVDVTLEELAGILGEELELPDI 155 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + + +++ +G G + R+ + +L R Sbjct: 156 QDKGKSKISNAHDRYSGIRRVGPESLRHFKRTYREALKR--------------------- 194 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 ++ R + + D RY++++ +P + AV+ +MDV Sbjct: 195 --------MISSGTFRPSAPVVVPVPD--------DKRYRSWKTITEPVANAVIIYMMDV 238 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV 317 SGSM K++ + + +L+R YK +E +I H A+EVD FF+++E+GGT++ Sbjct: 239 SGSMGDEQKEIVRIESFWIDAWLTRQYKGLESRFIIHDAIAREVDRDTFFHTRESGGTMI 298 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA-DDSPLCHEILAKKLLPVVRYYSYIE 376 SSA KL +++ Y P +WNIY SDGDNW+ DD+ C ++L ++LP V ++Y + Sbjct: 299 SSAYKLCSQIIDNDYPPDEWNIYPFHFSDGDNWSMDDTLSCVDVLKTQILPRVNMFAYGQ 358 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + ++ + S D + IRD+D I ++ K Sbjct: 359 VESPYGSGQFIKDLKEHFSQDDRVVVSEIRDKDAIVGSIKDFLGK 403 >UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C05A6 Length = 368 Score = 343 bits (880), Expect = 6e-93, Method: Composition-based stats. Identities = 119/401 (29%), Positives = 187/401 (46%), Gaps = 63/401 (15%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + +R + + I+++I I S+T+ +G V + +++ E F G V Sbjct: 18 DAKRHRKLVEKSIRENIDMLIVGESITETAAGNIVKVRIQELPEYRFKFGSS--TEYVAI 75 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ V N++ + + +AS + G D + +I D+ L LLFE L LPNL Sbjct: 76 GDGDEVVNEKCDF-----------EMEASNEA-GLDIYESEIVLDDALALLFEQLELPNL 123 Query: 138 KQNQQRQLTEYKTH-RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 + + + L + T R+G G+ + R+LQ + R Sbjct: 124 YEKKFKNLEYFSTQKRSGIKKTGIYPRFAKKRTLQEKIIRN------------------- 164 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + FI+ D+RY++ K+ S AV+ C+MD Sbjct: 165 ---------------------------KNGRFINQ-DIRYQSLAKKQINHSNAVIVCIMD 196 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 SGSM + KDMAK FY LLY F+ Y VE+++I H T AKEV E++FF+ E+GGT Sbjct: 197 TSGSMGTTKKDMAKSFYFLLYQFIKIRYAKVEMIFIAHSTIAKEVTENDFFHKGESGGTY 256 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SS E++KERY+P WN+Y SDGDNW DD+ L LA +L + YIE Sbjct: 257 ISSGYTKALEIIKERYDPRLWNVYTFHCSDGDNWTDDNNLAVS-LANELCSCSNLFGYIE 315 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 I + ++ + T +NF I + DI+ VF+++ Sbjct: 316 IKTNNYSSVILNEYNAHITSNNFLALKIFKKSDIFEVFKKV 356 >UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonifex degensii KC4 RepID=C9RD74_AMMDK Length = 371 Score = 325 bits (834), Expect = 2e-87, Method: Composition-based stats. Identities = 105/407 (25%), Positives = 168/407 (41%), Gaps = 51/407 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K + +R + K I+Q + E I + S+ D V IP + E F Sbjct: 15 KGEEDARRHQEKLKEIIRQRLPELITEESLILADDRRKVRIPLRLVEEFRFRFA-SHQEM 73 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V ++ I P G GG + G D + ++S +E +++FE+LA Sbjct: 74 LVGQAGSQPGTDETIVFPGIGRGGGAGTE-------PGIDYYEAEVSVEEIAEVVFEELA 126 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+ K A G+ A + R+L N+L R G++ E Sbjct: 127 LPHYKPKNTAN-RGIAEEWADLRRQGIRACLDRRRTLLNALKRHA--KEGRKGEFR---- 179 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + DLR++ + P ++AV+ Sbjct: 180 -----------------------------------LCPSDLRFRVWRSIESPEARAVVLA 204 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 ++D SGSM K +A+ F+ + FL Y NVEVVY+ HHT+A+E EFF E+G Sbjct: 205 MLDTSGSMGPLEKYLARSFFFWMVRFLEANYANVEVVYLAHHTEARETTASEFFRKGESG 264 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT SS +L ++++ RY P ++NIYA SDGDN D+ C E++ +LL V Sbjct: 265 GTRCSSVYELALDIIETRYPPTEYNIYAFHFSDGDNLPADNERCMELIG-RLLEVANLVG 323 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 Y EI T + + + +R++ D+Y + F + Sbjct: 324 YGEIEGPYFYTSTLKTVYQSIAHPRLVVVTLRERKDVYRALKAFFAR 370 >UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67Q87_SYMTH Length = 395 Score = 297 bits (761), Expect = 4e-79, Method: Composition-based stats. Identities = 97/422 (22%), Positives = 170/422 (40%), Gaps = 54/422 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 + ++++R R + I+ ++++ ++ ++ D + V +P + E F + Sbjct: 18 QGQMDQERHQARIREAIRANLADIVSDEAIIASDGRKVVRLPIRVLREYRFRLDWQK-QP 76 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 RV + + + RP +G +D F + +E +LLF +L+ Sbjct: 77 RVGEADGPVRPGEPVGRPGRAAEAAGGSGAGDEAG---EDWFETDVPLEELEELLFAELS 133 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+L+ Q+ LT G+ ANI R+L ++ R + Sbjct: 134 LPHLEPKQEPHLTVLHHEWRDVRRQGLYANIDKKRTLLEAMKRNRLAGRPPLAGIRRE-- 191 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 DLR++ +++ P + AV+ Sbjct: 192 ---------------------------------------DLRFRTWDEAEIPGASAVLII 212 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K +A+ + FL Y+ V++ ++ H T+AKE+DE FF E+G Sbjct: 213 MMDTSGSMGTGEKYIARSLCHWMVRFLRTRYERVKLHFVAHTTEAKEMDEESFFTRGESG 272 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT SSA + +++ RY P + N+YA SDGDN D+P +L +KLL Sbjct: 273 GTRCSSAYEYALQLIDRRYPPDRHNLYAFHFSDGDNLISDNPRAVALL-RKLLERCALVG 331 Query: 374 YIEITRRAHQTLWREYE--------HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 Y +I + Y+ + F IRD+ +IY R F + A Sbjct: 332 YGQIETQPQYLSMPYYQPNTLLTLFREEIDHPRFVTALIRDRSEIYAALRAFFPRPGAGE 391 Query: 426 KG 427 +G Sbjct: 392 RG 393 >UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GJ99_SILST Length = 300 Score = 296 bits (758), Expect = 8e-79, Method: Composition-based stats. Identities = 123/293 (41%), Positives = 182/293 (62%), Gaps = 13/293 (4%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAG---KRRELHALEENL 195 + + T RAG T G P N+++VR+++NSL RR A+ +R+L A L Sbjct: 2 EKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLGRRIALQRPSTQTQRDLEAQVAEL 61 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 I P+Q L ++ ++ K V +ID DLRY + +S+AV+FCLM Sbjct: 62 EEIEARSPSQDELLAELVAKLDGIKRKRRVVGYIDPLDLRYDTFVPEKIRNSRAVVFCLM 121 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 DVSGSM + KD+AKRF++LL+LFL+R Y++ E+V++RH A+EVDE FFY++ETGGT Sbjct: 122 DVSGSMQEREKDLAKRFFLLLHLFLTRGYEHTEIVFVRHTHYAQEVDEETFFYARETGGT 181 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 IVS+AL+ M E++ ERY P +WNIY AQASDG+N+ +DS C +IL ++LLP+ ++Y+Y+ Sbjct: 182 IVSTALEKMKEIIDERYPPDEWNIYGAQASDGENFGNDSVRCRKILTEQLLPMCQFYAYV 241 Query: 376 EITRRAHQT----------LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 EI + Q LW+ Y ++ +F MQ + + IYP+FRE F Sbjct: 242 EIVEESAQMLLDNTEAGEDLWQNYRQVKEACRHFEMQRVSEPGHIYPIFREFF 294 >UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BNQ7_9GAMM Length = 391 Score = 291 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 95/421 (22%), Positives = 177/421 (42%), Gaps = 58/421 (13%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 + + R + + +++ + + + V +V +P + F +R Sbjct: 21 RGTRDWLRHNEKIREAVREQLPDLVAGSDVLSRPDNRTVKVPVRFMEHYRFRLRNPDVRT 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 G R +P + S +GEGQ F + D+ LD L+++L Sbjct: 81 GAGQGKAKPGDVLRPAQP-----ARPGQGKEGSGEGEGQITFALEFQIDDILDWLWDELE 135 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+LK ++ E R G+ G + + R+++ ++ RR+A Sbjct: 136 LPHLKPRLGTRIEEDAYIREGWDRRGARSRLDRRRTMKEAIKRRSAQG------------ 183 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 E +P ++ DLR++ +R P++ AV F Sbjct: 184 -----------------------------PEAIPIVND-DLRFRQLARRRRPTTNAVAFF 213 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 L+DVS SMD+ + +AK F+ + R + +E+V+I H +A E +E FF G Sbjct: 214 LLDVSSSMDEHCRRLAKTFFFWALQGVRRQFSTIEIVFIAHTVEAWEFEEENFFRIHGQG 273 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT S+A+ ++++ERY+PA +N Y A+DG N+++D E L +L P++ + Sbjct: 274 GTKSSTAVHKAQQILEERYDPAMYNCYLFYATDGHNFSEDRRRATEALL-RLAPLMNFLG 332 Query: 374 YIEITRRAHQTL-------WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 Y E++ + H+ L WR ++++ + DI+ + F Q A A+ Sbjct: 333 YAEVSHQNHRRLDTEVAGIWRGLGAEGWPVGSYSLTR---EADIWLAIKAFFTDQAAEAE 389 Query: 427 G 427 Sbjct: 390 A 390 >UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XT76_9DEIN Length = 360 Score = 268 bits (684), Expect = 4e-70, Method: Composition-based stats. Identities = 91/403 (22%), Positives = 152/403 (37%), Gaps = 54/403 (13%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + QRF + ++K+ E + + G+ VSIP + P G + Sbjct: 7 DLQRFKEIVRGEVKKRAREFLTREEYLGSLDGQVVSIPLPQLELPRLQYGHNEMGQG--- 63 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 E G G G G V ++ +E+L+L+ E L LP L Sbjct: 64 -----------EGEGEGQGQGMGGTAGRGGLGPSGHVPVAEMDLEEFLELIGEALKLPRL 112 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + Q + E + G + R+L+ +L R Sbjct: 113 EPKQGGAVEESSPKYTTLSRRGPESLRHARRTLRQALRRAI------------------- 153 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + R E L + + D RY+ E +P P +QA + +DV Sbjct: 154 ----------QSGIYRPEDPRLVPERD--------DYRYRAPEPKPRPQAQAALVFALDV 195 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV 317 SGSM+ + + + ++ + + + Y+ H +A EV E +FF +E GGT + Sbjct: 196 SGSMEGEQLRLVRILSYWITAWVKKHFPRLSRHYLLHDAEAWEVSEEDFFRLREGGGTRL 255 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 SS +KL +V+ ERY +N Y +DGDNW DD+ E L K LLP + Y Y ++ Sbjct: 256 SSGIKLAQQVL-ERYPAQLYNRYVYHFTDGDNWQDDTAEALETL-KALLPTLSLYGYAQV 313 Query: 378 TRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 R Q + + A + ++ + + L Sbjct: 314 RSRYGQGRFIDDLRSHFPSDPALATAELGGRESLPSALKRLLG 356 >UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5_THET2 Length = 351 Score = 266 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 104/403 (25%), Positives = 160/403 (39%), Gaps = 64/403 (15%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + RF + ++K+ + E + + + G VSIP + P G Sbjct: 11 DLLRFKEIVRGEVKKRVREFLTREELFGQVEGRLVSIPLPQLEIPKIVHGEPLGEGL--- 67 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G G G G V ++ +E+LDL+ E L LP L Sbjct: 68 ---------------------GLGGPGEEALGPGGHIPVAELELEEFLDLVGEALRLPRL 106 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + + ++TE G V R+L+ SL R Sbjct: 107 RPKGEGEVTEEALRHTTIARKGPRGLRHVRRTLKESLKR--------------------- 145 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 L+ R E L + E DLRYK ++P P +QAV+ +DV Sbjct: 146 --------ALQSGEYRPEDPLLVPERE--------DLRYKAPRRKPIPHAQAVVLFALDV 189 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV 317 SGSM + + K + L++ R + +E Y+ H +A EV E EFF ++E GGT + Sbjct: 190 SGSMREEELKLVKTLSFWITLWIKRHFPRLERRYLLHDAEAWEVPEEEFFKAREGGGTRI 249 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEI 377 SSAL L +E++K Y A +N Y SDG+NW D+PL E L + LP + Y Y ++ Sbjct: 250 SSALLLAEEILKA-YPEAFYNRYLFHFSDGENWQGDTPLALEALRRL-LPSLALYGYAQV 307 Query: 378 TRRAHQT-LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 Q E + A+ +R ++D+ R L Sbjct: 308 EGPYGQGHFLEEVREALGGREGVALAAVRGREDLPVALRRLLG 350 >UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriaceae RepID=Y746_HALSA Length = 442 Score = 230 bits (586), Expect = 9e-59, Method: Composition-based stats. Identities = 93/449 (20%), Positives = 172/449 (38%), Gaps = 58/449 (12%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 +R+RF + + +Q ++E I + ++V +P + + P F + R V Sbjct: 5 EDRERFHEIGEQR-RQDLAEFIQYGDL-GGSGPDAVRVPIKLVDLPAFEYDQ-LDRGGVG 61 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 G+ D++ P +G G + D E+ +++ +E+ L E L L + Sbjct: 62 QGDVD--PGDQVGEP----DEAGEGDDDEAGDESADHEY-YEMDPEEFAAELDERLGL-D 113 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM---------------- 180 L ++ + E + G + + L R+ AM Sbjct: 114 LDPKGKKVVAETEGAFNETARRGPRGTLDFAHLYKQGLKRKIAMDFDEAYVTAALRVDGW 173 Query: 181 ---TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE------------- 224 + + A I+ + +++ R + A I+ Sbjct: 174 GVDAVYTWAREQHIPVSRAWIAERARSPSPDDDAGRVVDDAVWASIDAMEAAVDVEPTRT 233 Query: 225 --------RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 RVP D R+++ + V+ + DVSGSM +S +++ +R + L Sbjct: 234 RIRRGGPGRVPLRRE-DERFRHPKVVEHRERNVVVVNIRDVSGSMRESKRELVERTFTPL 292 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 +L+ Y N E VYI H A EVD +FF Q GGT +S+A +L + V+ E Y ++ Sbjct: 293 DWYLTGKYDNAEFVYIAHDADAWEVDRTDFFGIQSGGGTRISTAYELAENVLDE-YPFSE 351 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 WN Y A DG+N DD+ L + ++Y+E ++ F Sbjct: 352 WNRYVFAAGDGENSHDDTEENVIPLMNDI--DANLHAYVETQPTDGVQTGTHAGKVRDAF 409 Query: 397 ---DNFAMQHIRDQDDIYPVFRELFHKQN 422 DN A+ + + DD+ + ++ Sbjct: 410 GDTDNVAVTTVTEPDDVMGAIETILSTED 438 >UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phage phiYS40 RepID=A0MN74_9CAUD Length = 340 Score = 200 bits (508), Expect = 1e-49, Method: Composition-based stats. Identities = 86/366 (23%), Positives = 137/366 (37%), Gaps = 75/366 (20%) Query: 16 MVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRV 75 ++ R+L+R + IK + + +N + + + V I + EP F Sbjct: 5 TIDEIRYLKRLENIIKARMQDIVNSNDIIESTPEDKVRIRIPIMDEPYFK---------- 54 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP 135 + G G GSGSG EG E +++ +E +LLFE L LP Sbjct: 55 -----------PVFPGSGAGAGSGSGSEPGEGSEEGDHEIEIELTVEELSELLFEYLGLP 103 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 +K + + + G + G + I RR Sbjct: 104 KIKPKG-SSVEKEEYLIEGISKTGPRSRIH----------RR------------------ 134 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + +I + + + +RYK+ KR P A+++ Sbjct: 135 ----------------------KTYYEIMKYGYKEDS-IRYKHLRKREVPIFDAIVYFAR 171 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 D S S+D K K + F+ YKNV + H T+AK V E +FF E G T Sbjct: 172 DYSASVDDKKKFKIKSTAFWINNFIKYNYKNVTTKFAVHDTKAKFVSEQDFFKLSEGGAT 231 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + SS +L+ E + RY+ +N Y SDG+N DD+P E L +KL +Y Sbjct: 232 LCSSVFELIYEDYR-RYSVDDYNFYLFYFSDGENLPDDNPKLRE-LVEKLSEDFNLIAYG 289 Query: 376 EITRRA 381 E+ Sbjct: 290 EVKSTD 295 >UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobacteria RepID=Q747C5_GEOSL Length = 447 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 77/380 (20%), Positives = 152/380 (40%), Gaps = 53/380 (13%) Query: 23 LRRYKAQIKQSISEAINKRSVTDVDSGES---VSIPTEDISEPMFHQ------GRGGLRH 73 L R + + + + I + +G V +PT + E + H Sbjct: 72 LERDRLREEDGLPRKIRIGKLIKPGAGGKEKIVVVPT-TVEEKLIHDRAPEETEEDESMG 130 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 G++ + ++ RPQ GG +G G+ +G + + + + +L E Sbjct: 131 GTGDGDEGEIIGEQPVRPQQEGGSGTAGHGEG--EGHELESTAYDLGR-----ILTERFD 183 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNLK+ ++ + + Y+ + N R L ++ + E + Sbjct: 184 LPNLKEKGKK------SSLSHYSYDLTDRN----RGFGQILEKKQTLRRIL--ETNIALG 231 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 +A ++ +P +L+ R +RV I + +L Y SQA++F Sbjct: 232 TVADVAEIDPTRLVISPR------------DRVYRILSRELEY---------ESQALVFF 270 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRHHTQAKEVDEHEFFY-SQE 311 + D SGSM+ + ++L+Y +L + + VE +I H A+EV + +Y + Sbjct: 271 IRDYSGSMEGKATEAVCSQHVLIYSWLLYQFARQVETRFILHDNDAREVPDFYTYYNLRV 330 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT V++A ++++E+V++ +NIY +DGD+W + L +++L Sbjct: 331 AGGTRVAAAYRMVNEIVEKESLARDYNIYVFHGTDGDDWDTNGEETIPEL-RRMLAYANR 389 Query: 372 YSYIEITRRAHQTLWREYEH 391 + E E Sbjct: 390 IGVTIAEHTYGSSGNTEVER 409 >UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EEA7_9EURY Length = 373 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 45/195 (23%), Positives = 86/195 (44%), Gaps = 22/195 (11%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN-VEV 289 D+RY E++ P+ A + D+SGSM+ + + L+ +L Y++ V++ Sbjct: 172 DEDIRYNLIEEKLIPNLSATFVIMRDISGSMEMYG-EFSATIAGLIEFWLKEKYEHTVKI 230 Query: 290 VYIRHHTQAKEVD---EHEFFYSQETGGTIVSSALKLMDEVVK-----------ERYNPA 335 Y+ H +A E D +FF +GGT + A KL+ ++ ER + Sbjct: 231 RYVAHTDEAFEYDPRKREDFFKLSSSGGTAFNPAYKLVIDMTDGASYKSNSPYKERIDYQ 290 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +++ +DGDN+ + E L KKL P + Y+++ + Y ++S Sbjct: 291 SEDVFLLHITDGDNYNGEDEAVRETL-KKLFPRLTKVFYLQVGGYSDSF----YNLIKSV 345 Query: 396 FDNFAMQHIRDQDDI 410 + ++ +DI Sbjct: 346 DPE-KLSEVKSGNDI 359 >UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 Tax=Shewanella benthica KT99 RepID=A9DKM0_9GAMM Length = 167 Score = 143 bits (361), Expect = 1e-32, Method: Composition-based stats. Identities = 101/153 (66%), Positives = 124/153 (81%), Gaps = 1/153 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN + +S VNRQRF+ RYK QIK+++S+A+ +RSVTDVD GE +SIPT+DIS Sbjct: 16 MANFIDRRLNARGRSTVNRQRFINRYKQQIKKAVSDAVTRRSVTDVDKGERISIPTKDIS 75 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP FHQG+GG+R RVHPGND F++ D+IER GGG GSGQG AS GEG D+FVFQIS Sbjct: 76 EPSFHQGQGGIRERVHPGNDQFIKGDKIER-PPGGGSQGSGQGDASNSGEGDDDFVFQIS 134 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRA 153 KDEYL+LLFEDL LPNL+ N+ +L EY+ +RA Sbjct: 135 KDEYLELLFEDLELPNLQNNRLNKLVEYQVYRA 167 >UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TM27_9PROT Length = 318 Score = 86.4 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 32/73 (43%), Positives = 45/73 (61%) Query: 4 FIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPM 63 IDRR N + KS+ NRQRFLRR K Q+ +++ +A +R + DV GE + IPT+ ++EP Sbjct: 230 IIDRRRNSQGKSLANRQRFLRRAKRQVTEAVRQASAERRIRDVADGEQIVIPTDGLNEPR 289 Query: 64 FHQGRGGLRHRVH 76 F LR H Sbjct: 290 FRHDARRLRLDRH 302 >UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaenterica_26029 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI0001913F8A Length = 88 Score = 83.3 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 78/88 (88%), Positives = 82/88 (93%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 LT KTHRAG+T+NGVPANISVVRSLQNSLARRTAMTAGKRRELHALE L IS+SEPA Sbjct: 1 LTSNKTHRAGFTSNGVPANISVVRSLQNSLARRTAMTAGKRRELHALETELETISHSEPA 60 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTF 232 QLLEEERLR+EIAELRAKIERVPFIDTF Sbjct: 61 QLLEEERLRREIAELRAKIERVPFIDTF 88 >UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BJU7_9GAMM Length = 555 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 45/247 (18%), Positives = 79/247 (31%), Gaps = 26/247 (10%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 +EE + + PA ++ + + L+ + + P Sbjct: 140 VEEFINYFDYALPAPDTTNTPIQISTERTQTPWNPQTELVRVSLQSYRSDFKTLPPLN-- 197 Query: 251 MFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------- 302 + L+DVSGSM+ K + +R + LL L + VY E Sbjct: 198 LVFLLDVSGSMNSPDKLPLMQRSFNLLVSQLRPQDRVAIAVYAGQSGVVLEPTSGDQKAQ 257 Query: 303 -EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEI 360 + GGT S+ + L ++ + Y P N +DGD N S + Sbjct: 258 INQAINQLRAGGGTHGSAGIHLAYDLAQANYLPDGINR-IFIGTDGDFNVGTTSLTELKA 316 Query: 361 LAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 L ++ + + T + L E + + + D Y R+L Sbjct: 317 LIERKREAGVFLSVLGFG--TGNYNDALMEELSNHGNGTAYYL--------DSYQEARKL 366 Query: 418 FHKQNAT 424 F Q A Sbjct: 367 FATQLAA 373 >UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VY88_NAEGR Length = 1082 Score = 47.9 bits (112), Expect = 8e-04, Method: Composition-based stats. Identities = 34/172 (19%), Positives = 64/172 (37%), Gaps = 29/172 (16%) Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 SN + Q+ E+ R E+ L + + + + P P+ + L DVS Sbjct: 82 SNQKVEQVEEKSLFRIEVEHLIDLDDHDIGVAELTIHVNDPSLNPKPT---LFIALADVS 138 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEV--VYIRHHTQAKEVD------------EH 304 GSM + L F +++ N + + + + AKE+D E Sbjct: 139 GSMQGRPWEQVCTS---LKHFAQQSFNNPAIICRMVAYESSAKEIDMKGTLQSIIRNIET 195 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQW-----NIYAAQASDGDNWA 351 F GGT +SA +L ++ + N+ +DG++++ Sbjct: 196 AF----TGGGTDFASAFQLACTIITRESGQDRENLPFGNVVITFLTDGEDFS 243 >UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DA43_9BACT Length = 883 Score = 46.7 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 14/124 (11%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD--MAKRFYILLYLFLSRTYKNVEVV- 290 L+ + K P S + L+DVSGSM+ K + K F +L+ + V +V Sbjct: 408 LKGREIPKDERPPSN--LVFLIDVSGSMNMPNKLPLLQKCFSLLVEQLGPK--DRVSIVT 463 Query: 291 ------YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + TQ KE + GGT SS + L + ++ + P N A Sbjct: 464 YASGTKLVLEPTQDKEAMQTAIDGLHAGGGTHGSSGIDLAYRMAQQSFIPGGTNRVIL-A 522 Query: 345 SDGD 348 +DGD Sbjct: 523 TDGD 526 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 66/180 (36%), Gaps = 25/180 (13%) Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET- 312 ++D SGSMD + K + L L + V I +AK V E++ + Sbjct: 47 VLDHSGSMDGQPLETVKSAALGLIDRLEED-DRLSV--IAFDHRAKIVIENQQVRNGAAI 103 Query: 313 ----------GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI-- 360 GGT + LKL + + +I+ +DG+N D+ C ++ Sbjct: 104 AKAIERLKAEGGTAIDEGLKLGIQEAAKGKEDRVSHIFLL--TDGENEHGDNDRCLKLGT 161 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 +A V + + +Q + ++ +I + + FR+LF + Sbjct: 162 VASDYKLTVHTLGFGD---HWNQDVLEAIAASAQG----SLSYIENPSEALHTFRQLFQR 214 >UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCW7_FERPL Length = 403 Score = 45.5 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 47/242 (19%), Positives = 90/242 (37%), Gaps = 19/242 (7%) Query: 118 QISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 I+ E LD FE+ ++L E G T + R + +A++ Sbjct: 110 DINFKELLDYFFEE---------ALKELIEMGII-EGVTKRFFRRKVKFSRQAERIIAQK 159 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA--ELRAKIERVPFIDTFDLR 235 K + + E +S +L+E + ++ + R + F++R Sbjct: 160 VMKEVSKEAKGYYAESEGETLSYIPGYELVEYDEYLHSYDLIDIPETMIRAAKNEDFEIR 219 Query: 236 YKNYEKR-PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 K+ R P + L+DVS SM A + L + + + + ++EV H Sbjct: 220 EKDIVSRNPKKVGKRHFVMLIDVSDSMRGKKIVGAIEAALALKMSIRKGFDDLEVFVFNH 279 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 T+ ++ E + G T ++ ALK ++ + Y +DG+ A + Sbjct: 280 RTE--KIREGDIVNVDVEGRTDIALALKTARNALRGKDGAK----YVILITDGEPTASYN 333 Query: 355 PL 356 PL Sbjct: 334 PL 335 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 45.5 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 49/290 (16%), Positives = 95/290 (32%), Gaps = 55/290 (18%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P +++ Q + + + + IS + L S+ R+ K ++ E+ Sbjct: 1356 PKIEEKDQSEGQQEEFEQN---ETHSLRKISQKKVLIKSIQRKVKTNKEKVQKALNEEDK 1412 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 N +Q K I+ + + P DL C+ Sbjct: 1413 ----ENQTKSQQHRISSNVKNISGQFSLGQLQPMRFPIDL-----------------ICV 1451 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------------ 302 +D SGSM+ D+ K + L L + + I+ T A+ + Sbjct: 1452 IDTSGSMNGQPLDLLKETLLFLVDLLQTGDR---ICLIQFSTNAQRLTPLLSIESKDNIK 1508 Query: 303 --EHEFFYSQETGGTIVSSALKLMDEVVKE-RYNPAQWNIYAAQASDGDNWADDSPLCHE 359 ++E GGT + ++L +V+K+ RY +++ SDG N ++ Sbjct: 1509 SIKNEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLL--SDGLNDGAENK---- 1562 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + LL + +Y + + + N M I D Sbjct: 1563 --IRDLLKQLNFYQ----NYNEENFTIQTFGFGKDHDPNL-MDKISQLMD 1605 >UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeoglobus fulgidus RepID=O28828_ARCFU Length = 410 Score = 45.2 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 84/246 (34%), Gaps = 14/246 (5%) Query: 113 DEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN 172 D ++S + +D F+++ + L++ + E + HR + S+ Sbjct: 109 DISKDELSMSQVVDNFFDEV-VDELQEMGYVEKVETRFHRKIIHYT------AKAESVLA 161 Query: 173 SLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 ++ +R E S ++++ + + + Sbjct: 162 EKVLSLSLQNLDKRSYGEHETEKLGQSIFSSERIVDYDPFTHSYDNIDLVESLIASAMRG 221 Query: 233 DLRYKN---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEV 289 ++ ++P + + V L+DVS SM A + L + R E+ Sbjct: 222 EIELNENEMVARQPKHTEKCVYVMLIDVSDSMRGRKIVGAIEAALCLRKAIRRAGSGDEL 281 Query: 290 VYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDN 349 I + +A E+ E E + G T + ALK +++K SDG+ Sbjct: 282 RVIAFNHRAHEIKEGEILNLEARGRTDIGLALKRARKILKGSSGTGV----VFLISDGEP 337 Query: 350 WADDSP 355 + +P Sbjct: 338 TSSYNP 343 >UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Streptomyces RepID=D1WZ12_9ACTO Length = 1289 Score = 44.8 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 43/271 (15%), Positives = 79/271 (29%), Gaps = 26/271 (9%) Query: 66 QGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYL 125 G G PG P E + + L Sbjct: 956 HGEGSRGGLSGPGRTGSRGGREPSFPGVREWSEELAALFGPGVREEVLAAAAVTGRQDVL 1015 Query: 126 DLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVP-ANISVVRSLQNSLARRTAMTAGK 184 L A P++ +L AG G+P A ++ +R L L Sbjct: 1016 AELDPAAATPSV------ELLRTILRYAG----GLPEARLAALRPLVRHLVDELTRQLTT 1065 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 R LA + +L LR +A R + + +++ R Sbjct: 1066 RLRPALTGTMLARPTRRPGGRLDLPRTLRANLATARRTADGTVQVIPQKPVFRS---RAR 1122 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH 304 S+ + + DVSGSM+ A + L + + ++ T+ ++ H Sbjct: 1123 RSADWRLILVTDVSGSME------ASTIWSALTASVLAGVPTLSTHFLAFSTEVVDLTGH 1176 Query: 305 E------FFYSQETGGTIVSSALKLMDEVVK 329 GGT +++ L+ +++ Sbjct: 1177 VHDPLSLLLEVSVGGGTHIAAGLRHARGLIE 1207 >UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SP98_METPP Length = 791 Score = 44.8 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 65/331 (19%), Positives = 113/331 (34%), Gaps = 34/331 (10%) Query: 70 GLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQG-QASQDGEGQDEFVFQISKDEYLDLL 128 G +PGN + + G GSG+ A GQD +S ++ L Sbjct: 412 GRGGAPNPGNGNSQSGGQPGDQAGAQDGSGTEDMVPAPTVTHGQDH---VMSTEDLAQAL 468 Query: 129 FEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRREL 188 + + + + +GV + I+ Q + R Sbjct: 469 HDAGVSSDTMAKLGFDDLKKIPEEVKHAKDGVVSAINKASEDQMKVGSRYPGGHLLHYAK 528 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD---- 244 + + + E A E K + + +D D+ +K+ P Sbjct: 529 AQMLDFFKPVLTWEMAHKKLLEACGKGSRYDPTEPWTLYHVDAADMGFKHQRDVPFMGSR 588 Query: 245 ---PSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV--EVVYIRHHTQAK 299 + +MF ++D SGS+D + M KRF R + V +V+ T + Sbjct: 589 MPGKEQKPLMFDIIDTSGSVDDA---MLKRFVSEALNQARRVSRGVAPDVLISWADTICR 645 Query: 300 EVDEH-------EFFYS----QETGGTIVSSALKLMDEVVK--ERYNPAQWNIYAA-QAS 345 V E +F GGT +A++ + E+VK + A+ NI A + Sbjct: 646 GVPEFISEKNYKQFLTKGINYGGRGGTNFQAAIENVLEMVKPGSKSGYAKRNIDAICYMT 705 Query: 346 D-GDNWADDSPLCHEIL---AKKLLPVVRYY 372 D GD+ D + L + KKL P++ Sbjct: 706 DSGDSVPDPARLLRKAQECGLKKLPPILFLV 736 >UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHI4_9EURY Length = 705 Score = 43.6 bits (101), Expect = 0.014, Method: Composition-based stats. Identities = 54/292 (18%), Positives = 99/292 (33%), Gaps = 26/292 (8%) Query: 94 GGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP------NLKQNQQRQLTE 147 G + ++D G +E + S+ E L+LL +++ L NL + R + Sbjct: 372 GDKTGSKSNNELNKDLTGSEEEILNNSRGEILNLL-KNMPLSKYPIIHNLDRTINRTSDQ 430 Query: 148 YKTHR----AGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEP 203 + G+ NGV I+V R ++ L +E + Sbjct: 431 SSSKEVFEITGHGKNGV-GAITVYDGPVEQYNERIRPNPKIKKMLEEIELLNNPRRDLTG 489 Query: 204 AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQ 263 +RK I K +I LR+ + A ++ L+D+SGSM Sbjct: 490 GTSGIRPDIRKLIRYEVTKDPA--YISKPYLRH-------IKDAGAEIWMLLDISGSMGG 540 Query: 264 STKDMAKRFYILLYLFLS-RTYKNVEV--VYIRHHTQAKEVDEHEFFYSQETGGTIVSSA 320 + AKR ++ L Y ++ + Y T E D G T A Sbjct: 541 QKINAAKRILGSIHDSLDGSKYVHLRMFGFYGSDGTHVFEFDRKMLMNLAAMGDTPTDIA 600 Query: 321 LKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 + +++K+ + + ++ +DGD K + V + Sbjct: 601 IYYAMDLMKK--DKSNFDKTLFIITDGDPNNGQETKNALNSLKNAMKNVNVF 650 >UniRef50_D2RGP5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGP5_ARCPR Length = 430 Score = 43.2 bits (100), Expect = 0.018, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 53/136 (38%), Gaps = 7/136 (5%) Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI---RHH 295 K ++ + L+D SGSM A+ + +Y S + + + HH Sbjct: 262 LTKEKLSIAEGAYYILVDKSGSMVGEKTVWARSVALAIYRMASLKRRRYFLRFFDKKTHH 321 Query: 296 --TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + EV + + GGT +++AL+ + + ER N +DG++ +D Sbjct: 322 LLSDPHEVVD-AILKVKSNGGTDITNALRTAVKDLVERGLSDLTNTIVI-ITDGEDVVED 379 Query: 354 SPLCHEILAKKLLPVV 369 + L+ V+ Sbjct: 380 LSKDLKKANANLISVM 395 >UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actinomycetales RepID=Q6ABM1_PROAC Length = 654 Score = 42.1 bits (97), Expect = 0.045, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 59/171 (34%), Gaps = 13/171 (7%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R + + RR + RR + + L + ++ + + Sbjct: 380 RFARRACGRRLRTRSNDRRGRYVSARPTDRPDDLALDATLRAAAVHQKSRRATERPDLAV 439 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR---FYILLYLFLSRT- 283 + D R K R + + + ++D SGSM + A + +LL ++ R Sbjct: 440 HVKPIDWRAKVRAGR----AASCVIFVVDASGSMGSRGRMTASKGAVLSLLLDAYVKRDR 495 Query: 284 -----YKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVK 329 ++ + T + EV +H G T +S+ L EVV+ Sbjct: 496 VCLIGFRRDRAEVLVPVTSSVEVAQHGLAELPVGGRTPLSAGLIKACEVVR 546 >UniRef50_D2S019 ATPase associated with various cellular activities AAA_5 n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2S019_9EURY Length = 665 Score = 41.7 bits (96), Expect = 0.056, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 65/191 (34%), Gaps = 16/191 (8%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R+++ +LARRT R + + + + L + Sbjct: 385 RTMREALARRTPSKVDVRSGRYVRARDSESVDDVAIDATLRAAAPHQPARRETDDSSSGI 444 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS-MDQSTKDMAKRFYILLYLFLSRTYKN 286 I+ DLR K E+R ++A++ ++D SGS M KR + L R Sbjct: 445 AIEPKDLRQKIRERR----AEALVVFVVDASGSVMSGRQMFETKRGILSLVEDAYRARDR 500 Query: 287 VEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V VV R V+ G T ++ L E+V ER + Sbjct: 501 VAVVVFREEGAFTLVEPTRNLSAARRAVSKLTVGGNTPLAHGLVEAYELV-ERERRRDED 559 Query: 339 IYAA--QASDG 347 +Y SDG Sbjct: 560 LYPLVVLFSDG 570 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteri... 523 e-147 UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteri... 463 e-129 UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodoba... 458 e-127 UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacter... 456 e-127 UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobact... 455 e-126 UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkhol... 449 e-125 UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria ... 420 e-116 UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes ... 402 e-110 UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales ... 397 e-109 UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium... 390 e-107 UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitroso... 389 e-107 UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidith... 375 e-102 UniRef50_A9FJ88 Uncharacterized conserved protein involved in st... 372 e-101 UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani Rep... 369 e-100 UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=... 363 8e-99 UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonif... 359 2e-97 UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiob... 343 8e-93 UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitroco... 330 5e-89 UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriac... 310 9e-83 UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meioth... 306 1e-81 UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeri... 305 3e-81 UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5... 278 2e-73 UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phag... 243 7e-63 UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobac... 216 1e-54 UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reineke... 172 3e-41 UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candida... 156 2e-36 UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 ... 139 2e-31 UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobac... 117 5e-25 UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepI... 114 5e-24 UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteo... 88 6e-16 UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaen... 81 6e-14 Sequences not found previously or not previously below threshold: UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 ... 98 6e-19 UniRef50_C6M483 von Willebrand factor type A domain protein n=1 ... 96 3e-18 UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea... 95 5e-18 UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 ... 91 8e-17 UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteo... 90 2e-16 UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteo... 87 1e-15 UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacte... 87 1e-15 UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium... 86 2e-15 UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales... 86 3e-15 UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter... 85 4e-15 UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobact... 84 1e-14 UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 ... 84 1e-14 UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria R... 84 1e-14 UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20... 83 2e-14 UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Rumi... 83 2e-14 UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidet... 83 2e-14 UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostri... 83 3e-14 UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatiba... 79 4e-13 UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacte... 77 1e-12 UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=... 76 2e-12 UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellac... 76 2e-12 UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopi... 76 3e-12 UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 ... 76 3e-12 UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 ... 75 5e-12 UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Breviba... 75 6e-12 UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 ... 74 8e-12 UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9... 74 9e-12 UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacter... 74 1e-11 UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobact... 74 1e-11 UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria R... 73 2e-11 UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocyst... 73 2e-11 UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4... 73 2e-11 UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmati... 73 2e-11 UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophag... 73 2e-11 UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella... 71 6e-11 UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 ... 71 1e-10 UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimon... 69 2e-10 UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastop... 69 2e-10 UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenz... 69 3e-10 UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria Re... 69 3e-10 UniRef50_C7N770 Uncharacterized protein containing a von Willebr... 68 5e-10 UniRef50_UPI000185CB41 protein containing von Willebrand factor ... 68 6e-10 UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobact... 67 8e-10 UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 ... 67 1e-09 UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella... 66 2e-09 UniRef50_UPI00006CDDCC von Willebrand factor type A domain conta... 65 6e-09 UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp.... 64 1e-08 UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostri... 63 2e-08 UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangi... 63 2e-08 UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 ... 63 2e-08 UniRef50_B4D1N7 Autotransporter-associated beta strand repeat pr... 62 5e-08 UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Ach... 61 7e-08 UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepI... 59 3e-07 UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepI... 59 4e-07 UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax... 57 2e-06 UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanoba... 54 8e-06 UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiph... 54 1e-05 UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 ... 54 1e-05 UniRef50_D0MZH7 Putative uncharacterized protein n=1 Tax=Phytoph... 53 2e-05 UniRef50_Q231J4 von Willebrand factor type A domain containing p... 53 2e-05 UniRef50_Q22SJ4 von Willebrand factor type A domain containing p... 53 2e-05 UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 ... 53 2e-05 UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferrogl... 53 2e-05 UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscill... 53 2e-05 UniRef50_Q9ZGE6 Magnesium-chelatase 67 kDa subunit n=2 Tax=Helio... 53 3e-05 UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeo... 52 3e-05 UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha p... 52 5e-05 UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genom... 51 6e-05 UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcal... 51 8e-05 UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methyli... 51 1e-04 UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi... 51 1e-04 UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepI... 51 1e-04 UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Stre... 50 2e-04 UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglob... 50 2e-04 UniRef50_A0CHZ1 Chromosome undetermined scaffold_185, whole geno... 50 2e-04 UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 ... 50 2e-04 UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoa... 50 2e-04 UniRef50_A6BYV9 Putative uncharacterized protein n=1 Tax=Plancto... 49 3e-04 UniRef50_Q23KK4 von Willebrand factor type A domain containing p... 49 3e-04 UniRef50_D2S019 ATPase associated with various cellular activiti... 49 3e-04 UniRef50_O26551 Magnesium chelatase subunit ChlI n=1 Tax=Methano... 49 3e-04 UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 ... 49 3e-04 UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacte... 49 3e-04 UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnol... 49 4e-04 UniRef50_Q23AA2 Putative uncharacterized protein n=1 Tax=Tetrahy... 49 4e-04 UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genom... 49 4e-04 UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromon... 48 5e-04 UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacter... 48 6e-04 UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellu... 48 6e-04 UniRef50_Q97HZ9 Predicted metal-dependent peptidase n=1 Tax=Clos... 48 7e-04 UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcani... 48 7e-04 UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Art... 48 7e-04 UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=... 48 8e-04 UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesioc... 47 9e-04 UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 ... 47 0.001 UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12... 47 0.001 UniRef50_Q23FU3 von Willebrand factor type A domain containing p... 47 0.001 UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis v... 47 0.002 UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genom... 47 0.002 UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein... 46 0.002 UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actin... 46 0.002 UniRef50_UPI0001555D4A PREDICTED: hypothetical protein, partial ... 46 0.003 UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genom... 46 0.004 UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus ... 46 0.004 UniRef50_A9B2Y1 VWA containing CoxE family protein n=4 Tax=Bacte... 45 0.004 UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin... 45 0.005 UniRef50_A1AQS2 Protoporphyrin IX magnesium-chelatase n=1 Tax=Pe... 45 0.005 UniRef50_A0CK50 Chromosome undetermined scaffold_2, whole genome... 45 0.006 UniRef50_C4G1K3 Putative uncharacterized protein n=1 Tax=Abiotro... 44 0.008 UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter... 44 0.009 UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobac... 44 0.010 UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangi... 44 0.014 UniRef50_A5FL88 Putative uncharacterized protein n=2 Tax=Flavoba... 43 0.020 UniRef50_B4S8S0 Magnesium chelatase ATPase subunit D n=3 Tax=Chl... 43 0.022 UniRef50_B1I3V2 Magnesium chelatase n=4 Tax=cellular organisms R... 42 0.034 UniRef50_C9LTL4 Magnesium-chelatase, subunit D/I family n=1 Tax=... 42 0.048 >UniRef50_Q1C7U5 UPF0229 protein YPA_1510 n=195 Tax=Proteobacteria RepID=Y1510_YERPA Length = 424 Score = 523 bits (1347), Expect = e-147, Method: Composition-based stats. Identities = 364/422 (86%), Positives = 395/422 (93%), Gaps = 1/422 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M +FIDRRLNGKNKSMVNRQRFLRRYK+QIKQSI++AINKRSVTD++SGESVSIP +DI+ Sbjct: 1 MGYFIDRRLNGKNKSMVNRQRFLRRYKSQIKQSIADAINKRSVTDIESGESVSIPIDDIN 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EPMFHQG GGLRHRVHPGNDHF+ NDR++RPQGGGGG SGQG A +DGEG+DEFVFQIS Sbjct: 61 EPMFHQGNGGLRHRVHPGNDHFITNDRVDRPQGGGGGG-SGQGNAGKDGEGEDEFVFQIS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 KDEYLDLLFEDLALPNLK+NQ +QL E+KTHRAGYT+NGVPANISVVRSLQNSLARRTAM Sbjct: 120 KDEYLDLLFEDLALPNLKRNQYKQLAEFKTHRAGYTSNGVPANISVVRSLQNSLARRTAM 179 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 TA KRREL LE L ++ NSEPAQLLEEERLRK I EL+ KI RVPFIDTFDLRYKNYE Sbjct: 180 TASKRRELRELEAALTVLENSEPAQLLEEERLRKAITELKQKIARVPFIDTFDLRYKNYE 239 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 +RP+PSSQAVMFCLMDVSGSMDQ+TKDMAKRFYILLYLFLSRTYKNV+VVYIRHHTQAKE Sbjct: 240 RRPEPSSQAVMFCLMDVSGSMDQATKDMAKRFYILLYLFLSRTYKNVDVVYIRHHTQAKE 299 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE EFFYSQETGGTIVSSALKLMDEVV+ERYNPAQWNIYAAQASDGDNWADDSPLCHE+ Sbjct: 300 VDEQEFFYSQETGGTIVSSALKLMDEVVQERYNPAQWNIYAAQASDGDNWADDSPLCHEL 359 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 LAKK+LPVVRYYSYIEITRRAHQTLWREYE L+ FDNFA+QHIR+ +DIYPVFRELFHK Sbjct: 360 LAKKILPVVRYYSYIEITRRAHQTLWREYEDLEEKFDNFAIQHIREPEDIYPVFRELFHK 419 Query: 421 QN 422 Q Sbjct: 420 QT 421 >UniRef50_Q1QPM0 UPF0229 protein Nham_0975 n=24 Tax=Proteobacteria RepID=Y975_NITHX Length = 439 Score = 463 bits (1192), Expect = e-129, Method: Composition-based stats. Identities = 182/439 (41%), Positives = 274/439 (62%), Gaps = 18/439 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN K++S+ NRQRFLRR + ++K+SI + + ++D D ++VSIPT Sbjct: 1 MPIFIDRRLNPKDRSLGNRQRFLRRAREELKRSIRDRVRSGRISDADGEQAVSIPTRSTD 60 Query: 61 EPMFHQGRG-GLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQI 119 EP F + G R V PGN HFV DR+ +P G G+ + S +D+F F + Sbjct: 61 EPRFEAAKDSGRREHVLPGNKHFVPGDRLRKPGHGAAGTPDPSMKDS-----EDDFRFVL 115 Query: 120 SKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTA 179 S++E LDL FEDL LP++ + +++ ++ RAG+ A G P NI+V R+++NS RR A Sbjct: 116 SREEVLDLFFEDLELPDMVKLSLKEILAFRPRRAGFAATGSPTNINVGRTMRNSYGRRIA 175 Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEE--RLRKEIAELRAKIERVPFIDTFDLRYK 237 + KR E+ A+ + +A + + + + + L+ E+ L K + ++D D+R+ Sbjct: 176 LKRPKREEVDAIRQEIAELESGSQSPVARQRIAALQAEVERLERKRRLIAYVDPVDIRFN 235 Query: 238 NYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 +E +P P+++AVMFCLMDVSGSM + KD+AKRF++LL+LFL Y E+V+I H + Sbjct: 236 RFEAQPIPNAKAVMFCLMDVSGSMGEREKDLAKRFFVLLHLFLKCRYDRTEIVFISHTHE 295 Query: 298 AKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 A+EV+E FFYS ++GGT+VS+AL+ M ++ ERY ++WNIYAAQASDGDN A DS C Sbjct: 296 AQEVNEETFFYSTQSGGTVVSTALEKMHRIIAERYPGSEWNIYAAQASDGDNAAADSHRC 355 Query: 358 HEILAKKLLPVVRYYSYIEI----------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 +L ++++ + +YY+Y+EI T +LWR Y + + + NF M I D Sbjct: 356 ITLLDEEIMRLCQYYAYVEIIDERERHIFGTTENGTSLWRAYSSVNANWPNFQMTRIADA 415 Query: 408 DDIYPVFRELFHKQNATAK 426 DIYPVFR+LF +Q K Sbjct: 416 ADIYPVFRQLFTRQATAEK 434 >UniRef50_B6B8L1 Putative uncharacterized protein n=2 Tax=Rhodobacterales RepID=B6B8L1_9RHOB Length = 445 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 194/444 (43%), Positives = 280/444 (63%), Gaps = 21/444 (4%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD----VDSGESVSIPT 56 M FIDRR N K KS+ NRQRFLRR + IK+ + +++ +S+ D GE V+IP Sbjct: 1 MHHFIDRRANPKGKSLGNRQRFLRRARENIKERVDQSVRGKSIQSGSGVPDGGEKVTIPA 60 Query: 57 EDISEPMF-HQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEF 115 + EP F H +GGLR V PGN FV D I+RPQ GG+G G +AS++G+G+DEF Sbjct: 61 RGLKEPRFFHSSKGGLRRHVLPGNKDFVVGDTIKRPQ---GGTGQGGRKASEEGDGEDEF 117 Query: 116 VFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLA 175 F ++++EYL++LFE L LP+L + + T RAG T G P N+++VR+++NSL Sbjct: 118 SFTLTQEEYLEILFEGLELPDLVEKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLG 177 Query: 176 RRTAMTAGKRRELHALEENLAIIS---NSEPAQLLEEERLRKEIAELRAKIERVPFIDTF 232 RR A+ + LEE +A + + P Q E LRK++ + K + V +ID Sbjct: 178 RRIALQRPTTKSQRDLEEQIAELEALDDRTPPQEDFLEALRKKLDGIIRKRKVVGYIDPL 237 Query: 233 DLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI 292 DLRY + +S+AV+FCLMDVSGSM + KD+AKRF++LL+LFL R Y++ E+V++ Sbjct: 238 DLRYDTFVPEKIRNSRAVVFCLMDVSGSMQEREKDLAKRFFLLLHLFLERCYEHTELVFV 297 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWAD 352 RH A+EVDE FFY++ETGGTIVS+AL+ M E+++ERY P +WNIY AQASDG+N+ + Sbjct: 298 RHTHHAQEVDEETFFYARETGGTIVSTALEKMKEIIEERYPPDEWNIYGAQASDGENFGN 357 Query: 353 DSPLCHEILAKKLLPVVRYYSYIEI----------TRRAHQTLWREYEHLQSTFDNFAMQ 402 DS C ++L LLPV ++Y+Y+EI A + LW+ Y +++ +F MQ Sbjct: 358 DSARCKKLLLNDLLPVSQFYAYVEIVDEAAEMLLNNPEAGEDLWQNYREVKAQAQHFEMQ 417 Query: 403 HIRDQDDIYPVFRELFHKQNATAK 426 + IYP+FRE F + A+ Sbjct: 418 RVSQPGHIYPIFREFFLPKVKGAQ 441 >UniRef50_Q1QZ69 UPF0229 protein Csal_0882 n=164 Tax=Proteobacteria RepID=Y882_CHRSD Length = 426 Score = 456 bits (1173), Expect = e-127, Method: Composition-based stats. Identities = 238/427 (55%), Positives = 310/427 (72%), Gaps = 4/427 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 MT+FIDRR N KNKS VNRQRFL+RY++ IK+++ EA+N+RS+TD++ GE +SIP +DIS Sbjct: 1 MTYFIDRRANAKNKSAVNRQRFLQRYRSHIKRAVEEAVNRRSITDMERGEKISIPAKDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP+F G GG R V PGN FV+ DR+ R GG G GSG+G AS GEG DEF F +S Sbjct: 61 EPVFQHGPGGARTIVSPGNKEFVEGDRLRR-PGGEGRGGSGEGSASNQGEGMDEFAFSLS 119 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 ++E+LD +F+ LALP+L++ Q R L E + RAG T +GVP+ I++VRS++ + ARR M Sbjct: 120 REEFLDFVFDGLALPHLERKQLRDLDEVRPVRAGVTRDGVPSRINIVRSMREAQARRIGM 179 Query: 181 TAGKRRELHALEENLAIISNSEP--AQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKN 238 A +R L EE L +P L+ EI L ++E VPFIDT+DLRY N Sbjct: 180 RAPIKRALREAEEALESEERKDPVLRNPARIGELKAEIERLEKRLEAVPFIDTYDLRYNN 239 Query: 239 YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA 298 +P PS++AVMFC+MDVSGSM Q KD+AKRF++LLYLFL R Y+ VE+V+IRHHT A Sbjct: 240 LIDQPQPSNKAVMFCVMDVSGSMTQGHKDIAKRFFLLLYLFLERNYEKVELVFIRHHTAA 299 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCH 358 KEVDE EFFYS+ETGGTIVSSAL L+DE++ +RY+PAQWN+Y AQASDGDNW DDS C Sbjct: 300 KEVDEEEFFYSRETGGTIVSSALTLVDEIIAKRYSPAQWNLYVAQASDGDNWDDDSLTCR 359 Query: 359 EILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD-NFAMQHIRDQDDIYPVFREL 417 ++L L+ ++YY+Y+EIT +HQ LW EYE +Q+ FAMQ I + DIYPVFR+L Sbjct: 360 DLLMTSLMAKLQYYTYVEITPHSHQALWEEYERVQAAHPSRFAMQQIVEPGDIYPVFRKL 419 Query: 418 FHKQNAT 424 F K+ A+ Sbjct: 420 FRKRVAS 426 >UniRef50_P59348 UPF0229 protein bll6755 n=25 Tax=Alphaproteobacteria RepID=Y6755_BRAJA Length = 427 Score = 455 bits (1170), Expect = e-126, Method: Composition-based stats. Identities = 186/431 (43%), Positives = 276/431 (64%), Gaps = 18/431 (4%) Query: 3 WFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEP 62 IDRRLN KS+ NRQRFLRR K+ ++ ++ + +R + DV G V+IP + + EP Sbjct: 4 HIIDRRLNPGGKSLENRQRFLRRAKSLVQGAVKKTSQERDIKDVLEGGEVTIPLDGMHEP 63 Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD 122 F + GG R V PGN FV+ D ++R G GS + +G+ +D F F +S+D Sbjct: 64 RFRR-EGGTRDMVLPGNKKFVEGDYLQR-----SGQGSAKDSGPGEGDSEDAFRFVLSRD 117 Query: 123 EYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTA 182 E++DL +DL LP+L + + Q RAGYT +G PANISV R+++ +LARR A+ Sbjct: 118 EFVDLFLDDLELPDLAKRKIAQTESEGIQRAGYTTSGSPANISVSRTVKLALARRIALKR 177 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 ++ E+ LE +A ++ + E L E+ +L AK +R+PFID D+RY+ +E Sbjct: 178 PRKDEIEELEAAIAACTDED-----ERVVLLAELEKLMAKTKRIPFIDPLDIRYRRFETV 232 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P P +QAVMFCLMDVSGSM + KD+AKRFY+LLY+FL R YK+VE+V+IRH +A+EVD Sbjct: 233 PKPVAQAVMFCLMDVSGSMSEHMKDLAKRFYMLLYVFLKRRYKHVEIVFIRHTDRAEEVD 292 Query: 303 EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILA 362 E FFY +GGT+VSSAL+ M ++V+ER+NP+ WNIYAAQASDGDN D L +L Sbjct: 293 EQTFFYGPASGGTLVSSALQAMHDIVRERFNPSDWNIYAAQASDGDNSYSDGELTGLLLT 352 Query: 363 KKLLPVVRYYSYIEITRRAH-------QTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 K+LPV ++++Y+E+ +LW YE L+++ +M+ + ++ +I+PVF Sbjct: 353 DKILPVCQFFAYLEVGESGGSAFDLSDSSLWTLYERLRNSGAPLSMRKVSERSEIFPVFH 412 Query: 416 ELFHKQNATAK 426 +LF ++ + + Sbjct: 413 DLFQRRETSQE 423 >UniRef50_A9I2A9 Putative uncharacterized protein n=6 Tax=Burkholderiales RepID=A9I2A9_BORPD Length = 419 Score = 449 bits (1155), Expect = e-125, Method: Composition-based stats. Identities = 205/426 (48%), Positives = 281/426 (65%), Gaps = 10/426 (2%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M IDRRLNG+NKS VNR+RFLRRYK QI++++ + + +RS+ D+D G +++P DIS Sbjct: 1 MNSLIDRRLNGRNKSAVNRERFLRRYKDQIRRAVQDLVRERSIEDMDQGGEINLPARDIS 60 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP F G+GG R VHPGN F + D RP G G GS G+ D+F F +S Sbjct: 61 EPHFRHGQGGDRELVHPGNREFAKGDTFPRPSGSDGEGGSEPGEGES----VDQFTFSLS 116 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 + E+L+L FEDL LP+L + Q +T+ K RAGYT G P+ +SV R+L+ SL+RR A+ Sbjct: 117 RAEFLNLFFEDLELPHLIRTQLGDVTQKKWQRAGYTTTGSPSLLSVSRTLKASLSRRVAL 176 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R EL A + L + A E + LR+E+ + ++ R+PF+D DLRY+N Sbjct: 177 GVAARAELEAAQAKLDAAIAAG-APQAEIDALRQEVEDCANRLARLPFLDDLDLRYRNRV 235 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE 300 P ++AVMFCLMDVSGSMD+ KD+AKRF+ LLYLFLSR Y++V+VV+IRH A+E Sbjct: 236 SVAMPMARAVMFCLMDVSGSMDEGKKDLAKRFFTLLYLFLSRKYEHVDVVFIRHTDNAEE 295 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 VDE FFY ++GGTIV SAL+LM E+V++RY P+ WN+YAAQASDGD++ D+ Sbjct: 296 VDEQTFFYDPKSGGTIVLSALELMHEIVQQRYPPSAWNVYAAQASDGDSFGADAGKSARF 355 Query: 361 LAKKLLPVVRYYSYIEITRR---AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 LA+ LLP RY++YIE+ +LW EYE Q T +F M+ I ++ +IYPVF +L Sbjct: 356 LAENLLPATRYFAYIEVPDSQEARKSSLWAEYE--QETAPHFVMRRICERGEIYPVFHDL 413 Query: 418 FHKQNA 423 F K+ A Sbjct: 414 FKKETA 419 >UniRef50_B7H9V9 UPF0229 protein BCB4264_A0587 n=93 Tax=Bacteria RepID=Y587_BACC4 Length = 391 Score = 420 bits (1079), Expect = e-116, Method: Composition-based stats. Identities = 105/416 (25%), Positives = 179/416 (43%), Gaps = 47/416 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR + + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGYDDQQRHQEKVQEAIKNNLPDLVTEESIVMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGS-GSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN D + R GG G G+GQ + D G+D + ++S E F +L Sbjct: 81 -VGQGNGDSKVGDVVARDGSGGQKQKGPGKGQGAGDAAGEDYYEAEVSILELEQAFFREL 139 Query: 133 ALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALE 192 LPNLK+ + + G+ NI R++ ++ R Sbjct: 140 ELPNLKRKEMDENRIEHVEFNDIRKTGLWGNIDKKRTMISAYKRNAMSGKAS-------- 191 Query: 193 ENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMF 252 I DL+++ + + P S+AV+ Sbjct: 192 ---------------------------------FHPIHQEDLKFRTWNEVLKPDSKAVVL 218 Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQET 312 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E EFF E+ Sbjct: 219 AMMDTSGSMGIWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVTEEEFFSKGES 278 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYY 372 GGTI SS K E++ +Y+P ++NIY SDGDN D+ C + L ++L+ + Sbjct: 279 GGTICSSVYKKALELIDNKYSPDRYNIYPFHFSDGDNLTSDNARCVK-LVEELMKKCNMF 337 Query: 373 SYIEITRRA-HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y E+ + H TL Y++++ DNF ++ + D++ + F +++ Sbjct: 338 GYGEVNQYNRHSTLMSAYKNIK--DDNFRYYILKQKADVFHAMKSFFREESGEKMA 391 >UniRef50_A7Z2R4 UPF0229 protein RBAM_009260 n=29 Tax=Firmicutes RepID=Y926_BACA2 Length = 394 Score = 402 bits (1032), Expect = e-110, Method: Composition-based stats. Identities = 103/412 (25%), Positives = 181/412 (43%), Gaps = 47/412 (11%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K ++QR ++ + IK ++ + + + S+ + + V IP + E +H Sbjct: 21 KGFDDQQRHQKKVQEAIKNNLPDLVTEESIIMSNGKDVVKIPIRSLDEYKIRYNYDKNKH 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ D + R G G+G+GQ + D G+D + ++S + + LF++L Sbjct: 81 -VGQGDGDSEVGDVVAR-DGADKKQGAGKGQGAGDQAGEDYYEAEVSLMDLEEALFQELE 138 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL+Q ++ + G+ NI R++ ++ R Sbjct: 139 LPNLQQKERDNIVHTDIEFNDIRKTGLTGNIDKKRTMLSAYKRNAMTGKPS--------- 189 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 I DL+YK + P S+AV+ Sbjct: 190 --------------------------------FYPIYPEDLKYKTWNDVTKPESKAVVLA 217 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K MA+ F+ + FL Y+ V++ +I HHT+AK V E +FF E+G Sbjct: 218 MMDTSGSMGVWEKYMARSFFFWMTRFLRTKYETVDIEFIAHHTEAKVVSEEDFFSKGESG 277 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GTI SS + E++ E+Y+PA++NIY SDGDN D+ C + L ++ + Sbjct: 278 GTICSSVYRKSLELIDEKYDPARYNIYPFHFSDGDNLTSDNARCVK-LVNDIMKKSNLFC 336 Query: 374 YIEITRRA-HQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 Y E+ + H TL Y++++ D F ++ + D++ + F + + Sbjct: 337 YGEVNQYNRHSTLMSAYKNVK--DDKFKYYILKQKSDVFQALKSFFKNEESG 386 >UniRef50_B9DZE4 UPF0229 protein CKR_0568 n=26 Tax=Clostridiales RepID=Y568_CLOK1 Length = 403 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 125/413 (30%), Positives = 208/413 (50%), Gaps = 25/413 (6%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S+ +R+R + + IK ++++ I++ S+ + V IP + I E F G Sbjct: 14 DRSLEDRRRHRQLVEKSIKDNLADIISEESIIGQSKNKKVKIPIKGIKEYQFIYGDNSSG 73 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 + DRI + G G+ Q + + EG+D + +++ ++ LD L EDL Sbjct: 74 VGSGD--GSQKKGDRIGKAIKDRDGKGN---QGAGNQEGEDMYEIEVTIEDVLDYLMEDL 128 Query: 133 ALPNLKQNQQRQL-TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP + + + Q+ + ++GY G+ ++ R++ L R+ G +R L + Sbjct: 129 ELPLMDKKKFSQILSNNSPKKSGYQRKGINPRLAKKRTVVEKLKRQQ----GTKRALREI 184 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 L S+P L E K R PF DLRY +++P A + Sbjct: 185 HGEL----ESDPKNKLPENTTIKS---------RFPFKQD-DLRYFRVKRKPKLELNAAI 230 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD SGSMD + K +A+ F+ +LY F+ Y NVEV +I H T AK V E+EFF+ E Sbjct: 231 ICVMDTSGSMDSTRKFLARSFFFVLYRFIKMKYNNVEVKFISHSTSAKVVTENEFFHKVE 290 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS LK EV++E YNPA WN+Y SDGDNW++D+ L + AK L V Sbjct: 291 SGGTYISSGLKKALEVIEENYNPAYWNVYTFYVSDGDNWSEDNSLALK-CAKDLCKVCNL 349 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 +SY EI + + + + T +NF + I ++ D++ +++ +K+ Sbjct: 350 FSYAEIIPSPYGSSIKHIFQNKITDNNFTVVTIHEKQDLWKSLKKILNKELEE 402 >UniRef50_B9L510 Sporulation protein YhbH n=1 Tax=Thermomicrobium roseum DSM 5159 RepID=B9L510_THERP Length = 389 Score = 390 bits (1001), Expect = e-107, Method: Composition-based stats. Identities = 114/417 (27%), Positives = 182/417 (43%), Gaps = 51/417 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K +++ R + + K IK+++++ ++++S+ D V +P + E F R Sbjct: 19 KGAIDQARHMEKVKEAIKRNLADIVSEQSLITSDGKRVVRVPIRVLEEYRFRFDPDSGRQ 78 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V G+ + GGG SG G + D G D + +++ +E +L+FEDL Sbjct: 79 -VGQGSGG---THVGDVVGRVGGGQRSGDGPQAGDQPGIDYYEAELTIEELSELIFEDLE 134 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNL++ + R+L +G AN+ R+L+ +L R Sbjct: 135 LPNLEEKRLRELESEAVRFTEIRRHGPFANLDKRRTLRENLRRNA--------------- 179 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 R+ DLR+K +E+ S AV+ Sbjct: 180 --------------------------WRGRARIGDFANEDLRFKTWERDVKRESNAVVIA 213 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MDVSGSM K +++ FY + FL Y VE+ +I HH +A+EV E EFF E+G Sbjct: 214 MMDVSGSMGTFEKYVSRAFYYWMVRFLRTKYDRVEIRFIAHHAEAREVSEEEFFSRGESG 273 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT S+A +L ++++E Y P WNIY SDGDNW D+ C E LA++LL + Sbjct: 274 GTRASTAYELALQLIRESYPPDSWNIYPFHFSDGDNWPSDNERCRE-LAEELLRCANLFG 332 Query: 374 YIEITR---RAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 Y EI + TL + + S I ++ D+Y R F + Sbjct: 333 YGEIRQGRYTYQSTLMHTLQRIGS--PKLVTVTITEKADVYQALRRFFGPEVGQEVA 387 >UniRef50_Q3J885 Putative uncharacterized protein n=2 Tax=Nitrosococcus oceani RepID=Q3J885_NITOC Length = 394 Score = 389 bits (999), Expect = e-107, Method: Composition-based stats. Identities = 119/419 (28%), Positives = 201/419 (47%), Gaps = 49/419 (11%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 ++S +R R ++ + I+ ++++ + + S+ + +P I E F G+ Sbjct: 17 DRSAKDRLRHRQKVRKAIRDNVADIVAEESIIGQSRDRIIKVPIRGIREYRFVYGQNTPG 76 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 G+ Q G G G + D G D + +I+ +E ++++ EDL Sbjct: 77 VGTGQGDSEPGQTV-------GQVPQGDGGPGHAGDRPGMDYYETEITLEELIEIMLEDL 129 Query: 133 ALPNLKQNQQRQL-TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LP++++ + R++ +E + R G+ GV ++ R+ ++ + RR A Sbjct: 130 ELPDMERKRFREVLSERTSKRKGFRRVGVRVHMDKRRTAKSRIRRRLA------------ 177 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 + AE R PF D+RY + P S AV+ Sbjct: 178 ---------------------SDKDAEDNETKHRFPFHRD-DMRYHRLREDMRPQSNAVV 215 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 FC+MD SGSMD K +A+ F+ LLY F+ Y NV+VV+I HHT+A+EV E EFF+ E Sbjct: 216 FCIMDTSGSMDTLKKYLARSFFFLLYQFVRSRYVNVDVVFIAHHTKAREVTEEEFFHKGE 275 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT +SS E+++ RY+P+ WNIYA SDGDN+ D+ + A+ L V Sbjct: 276 AGGTFISSGYSKALEIIQNRYHPSLWNIYAFHCSDGDNFDSDNAATLKA-AEVLCQVCNL 334 Query: 372 YSYIEITRR----AHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + Y EI R T+ + ++ DNF I+ ++DI+P FR+L +++ ++K Sbjct: 335 FGYGEIKPRPSGFYEGTMLDLFRSVR--MDNFQSVLIQRKEDIWPSFRQLLSRESESSK 391 >UniRef50_B5EPS6 Putative uncharacterized protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EPS6_ACIF5 Length = 434 Score = 375 bits (962), Expect = e-102, Method: Composition-based stats. Identities = 166/437 (37%), Positives = 258/437 (59%), Gaps = 17/437 (3%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTD-VDSGESVSIPTEDI 59 M+ IDRR +G +S N+ R RR +A++K ++ + S+ D ++ + VSIPT D+ Sbjct: 1 MSMIIDRRSSG-TRSTANQDRLQRRVRARLKVAVEKMARSGSIEDLANTDQPVSIPTRDL 59 Query: 60 SEPMFHQGRGGLR-HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQ 118 EP F + RV PGN + + D I +P+GGG G + DG G+DE Sbjct: 60 HEPSFRRDLSDTSWERVLPGNKEYQRGDEINKPEGGGSGK---GRAGAPDGLGEDEVAIV 116 Query: 119 ISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRT 178 +S DE+LDLLF+ LALPNL++ Q + + RAG+ +G P+ + V R+++ + ARR Sbjct: 117 LSADEFLDLLFDGLALPNLRKMAQGDIQADQWRRAGFIKDGSPSRMHVGRTMRAARARRL 176 Query: 179 AMTAGKRRELHALEENLAIISNSEPAQLLEE----------ERLRKEIAELRAKIERVPF 228 A+ AGKRREL L + ++ +L ++ L +I L KI+ +PF Sbjct: 177 ALRAGKRRELQDLLDARNVLQEEIQGRLAQKQDVSVEQERLSELNHQIDALERKIKAIPF 236 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ID DLR+ + +++P P + AVMFC+MDVSGSM + KD+AKRF++LLYLFL R Y+ V+ Sbjct: 237 IDEADLRFAHIDQQPHPITNAVMFCVMDVSGSMGEKEKDLAKRFFLLLYLFLHRHYQAVQ 296 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 +V+I+HH+ A E E FF ++E GGT+VS A+ L +E++++R+ P +WN+Y AQ SDGD Sbjct: 297 MVFIKHHSTASECSEQAFFGAREGGGTLVSPAIILSEEIMRQRFPPDRWNVYLAQVSDGD 356 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 N+ D+ + E L L + + Y+E+ R + L R Y+ + F +++ Sbjct: 357 NYFADNAVVEEHLLNLLPRLRNLF-YLEVNRDSESDLLRLYDAIAQDFPELVTARASERE 415 Query: 409 DIYPVFRELFHKQNATA 425 DIYP+FR LF + + Sbjct: 416 DIYPMFRTLFATEETPS 432 >UniRef50_A9FJ88 Uncharacterized conserved protein involved in stress response n=21 Tax=Bacteria RepID=A9FJ88_SORC5 Length = 405 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 101/405 (24%), Positives = 182/405 (44%), Gaps = 47/405 (11%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + RF + + +I++++ + I++ + + VSIP I P F G R V Sbjct: 44 DHGRFRQIVRGRIRENLRKYISQGELIGRKGKDLVSIPIPQIDIPRFRFG-DKQRGGVGQ 102 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ + P GG GQGQ + GEG ++ +E +L E+L LP++ Sbjct: 103 GDGNPGD------PVGGSDDKQPGQGQ-AGSGEGDHLLEVDVTLEELAGILGEELELPDI 155 Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 + + +++ +G G + R+ + +L R Sbjct: 156 QDKGKSKISNAHDRYSGIRRVGPESLRHFKRTYREALKR--------------------- 194 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 ++ R + + D RY++++ +P + AV+ +MDV Sbjct: 195 --------MISSGTFRPSAPVVVPVPD--------DKRYRSWKTITEPVANAVIIYMMDV 238 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIV 317 SGSM K++ + + +L+R YK +E +I H A+EVD FF+++E+GGT++ Sbjct: 239 SGSMGDEQKEIVRIESFWIDAWLTRQYKGLESRFIIHDAIAREVDRDTFFHTRESGGTMI 298 Query: 318 SSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA-DDSPLCHEILAKKLLPVVRYYSYIE 376 SSA KL +++ Y P +WNIY SDGDNW+ DD+ C ++L ++LP V ++Y + Sbjct: 299 SSAYKLCSQIIDNDYPPDEWNIYPFHFSDGDNWSMDDTLSCVDVLKTQILPRVNMFAYGQ 358 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + ++ + S D + IRD+D I ++ K Sbjct: 359 VESPYGSGQFIKDLKEHFSQDDRVVVSEIRDKDAIVGSIKDFLGK 403 >UniRef50_Q896G6 Conserved protein n=1 Tax=Clostridium tetani RepID=Q896G6_CLOTE Length = 386 Score = 369 bits (948), Expect = e-100, Method: Composition-based stats. Identities = 109/416 (26%), Positives = 192/416 (46%), Gaps = 43/416 (10%) Query: 13 NKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLR 72 N++ +R+R + IK ++ + + + ++ + +P + + E F + Sbjct: 13 NRAGEDRKRHRELVEKSIKDNLVDVLLQEDISIQKENIKIKVPIKGVKEYEFTYSQNRSF 72 Query: 73 HRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDL 132 V GN+ Q ++R S G G + + EG+D F +++ +E +F+DL Sbjct: 73 VVVGKGNEKKGQKIALKR------ASEQGGGAGAGEIEGEDIFETEVTIEEIFQSIFDDL 126 Query: 133 ALPNLKQNQQRQL-TEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHAL 191 LPNLK+ + ++ + + G+ +G+ ++ R+ + R+ A R++ Sbjct: 127 ELPNLKKKKFNKILNDSFKRKKGFKKHGISPRLAKRRTAIEKVKRKQATQKVLGRDIA-- 184 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 ER PF DLRY + + AV+ Sbjct: 185 --------------------------------ERFPFKKD-DLRYSRVKLNKNKEYNAVI 211 Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQE 311 C+MD S SMDQ K MA+ F+ ++Y F+ Y+ V++ +I H T AKEV E EFF+ E Sbjct: 212 ICIMDTSASMDQMKKYMARSFFFMIYKFIKMKYEEVDICFISHSTTAKEVTEEEFFHKVE 271 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 +GGT +SS K E++ RYNP +NIY ASDGDNW +D+ + +AK+L V Sbjct: 272 SGGTYISSGYKKALEIINTRYNPQIYNIYTFHASDGDNWNEDNDRAVK-VAKELSNVCNL 330 Query: 372 YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + YIEI + R + +NF I ++D++ +++ ++ +G Sbjct: 331 FGYIEIMGYGYSNGIRNKYLKEIEKENFIPLIIEKKEDLWRALKDILKQEMREERG 386 >UniRef50_UPI00016C05A6 hypothetical protein Epulo_00085 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C05A6 Length = 368 Score = 363 bits (931), Expect = 8e-99, Method: Composition-based stats. Identities = 119/401 (29%), Positives = 187/401 (46%), Gaps = 63/401 (15%) Query: 18 NRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHP 77 + +R + + I+++I I S+T+ +G V + +++ E F G V Sbjct: 18 DAKRHRKLVEKSIRENIDMLIVGESITETAAGNIVKVRIQELPEYRFKFGSS--TEYVAI 75 Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNL 137 G+ V N++ + + +AS + G D + +I D+ L LLFE L LPNL Sbjct: 76 GDGDEVVNEKCDF-----------EMEASNEA-GLDIYESEIVLDDALALLFEQLELPNL 123 Query: 138 KQNQQRQLTEYKTH-RAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 + + + L + T R+G G+ + R+LQ + R Sbjct: 124 YEKKFKNLEYFSTQKRSGIKKTGIYPRFAKKRTLQEKIIRN------------------- 164 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + FI+ D+RY++ K+ S AV+ C+MD Sbjct: 165 ---------------------------KNGRFINQ-DIRYQSLAKKQINHSNAVIVCIMD 196 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 SGSM + KDMAK FY LLY F+ Y VE+++I H T AKEV E++FF+ E+GGT Sbjct: 197 TSGSMGTTKKDMAKSFYFLLYQFIKIRYAKVEMIFIAHSTIAKEVTENDFFHKGESGGTY 256 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SS E++KERY+P WN+Y SDGDNW DD+ L LA +L + YIE Sbjct: 257 ISSGYTKALEIIKERYDPRLWNVYTFHCSDGDNWTDDNNLAV-SLANELCSCSNLFGYIE 315 Query: 377 ITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFREL 417 I + ++ + T +NF I + DI+ VF+++ Sbjct: 316 IKTNNYSSVILNEYNAHITSNNFLALKIFKKSDIFEVFKKV 356 >UniRef50_C9RD74 Putative uncharacterized protein n=1 Tax=Ammonifex degensii KC4 RepID=C9RD74_AMMDK Length = 371 Score = 359 bits (920), Expect = 2e-97, Method: Composition-based stats. Identities = 106/407 (26%), Positives = 167/407 (41%), Gaps = 51/407 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 K + +R + K I+Q + E I + S+ D V IP + E F Sbjct: 15 KGEEDARRHQEKLKEIIRQRLPELITEESLILADDRRKVRIPLRLVEEFRFRFA-SHQEM 73 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 V ++ I P G GG + G D + ++S +E +++FE+LA Sbjct: 74 LVGQAGSQPGTDETIVFPGIGRGGGAGTE-------PGIDYYEAEVSVEEIAEVVFEELA 126 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+ K A G+ A + R+L N+L R G++ E Sbjct: 127 LPHYKPKNTAN-RGIAEEWADLRRQGIRACLDRRRTLLNALKRHA--KEGRKGEFR---- 179 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + DLR++ + P ++AV+ Sbjct: 180 -----------------------------------LCPSDLRFRVWRSIESPEARAVVLA 204 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 ++D SGSM K +A+ F+ + FL Y NVEVVY+ HHT+A+E EFF E+G Sbjct: 205 MLDTSGSMGPLEKYLARSFFFWMVRFLEANYANVEVVYLAHHTEARETTASEFFRKGESG 264 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT SS +L ++++ RY P ++NIYA SDGDN D+ C E L +LL V Sbjct: 265 GTRCSSVYELALDIIETRYPPTEYNIYAFHFSDGDNLPADNERCME-LIGRLLEVANLVG 323 Query: 374 YIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 Y EI T + + + +R++ D+Y + F + Sbjct: 324 YGEIEGPYFYTSTLKTVYQSIAHPRLVVVTLRERKDVYRALKAFFAR 370 >UniRef50_Q67Q87 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67Q87_SYMTH Length = 395 Score = 343 bits (879), Expect = 8e-93, Method: Composition-based stats. Identities = 98/422 (23%), Positives = 170/422 (40%), Gaps = 54/422 (12%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 + ++++R R + I+ ++++ ++ ++ D + V +P + E F + Sbjct: 18 QGQMDQERHQARIREAIRANLADIVSDEAIIASDGRKVVRLPIRVLREYRFRLDWQK-QP 76 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 RV + + + RP +G +D F + +E +LLF +L+ Sbjct: 77 RVGEADGPVRPGEPVGRPGRAAEAAGGSGAGDEAG---EDWFETDVPLEELEELLFAELS 133 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+L+ Q+ LT G+ ANI R+L ++ R Sbjct: 134 LPHLEPKQEPHLTVLHHEWRDVRRQGLYANIDKKRTLLEAMKRNRLAGRP---------- 183 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 + I DLR++ +++ P + AV+ Sbjct: 184 -------------------------------PLAGIRREDLRFRTWDEAEIPGASAVLII 212 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 +MD SGSM K +A+ + FL Y+ V++ ++ H T+AKE+DE FF E+G Sbjct: 213 MMDTSGSMGTGEKYIARSLCHWMVRFLRTRYERVKLHFVAHTTEAKEMDEESFFTRGESG 272 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT SSA + +++ RY P + N+YA SDGDN D+P L +KLL Sbjct: 273 GTRCSSAYEYALQLIDRRYPPDRHNLYAFHFSDGDNLISDNPRAV-ALLRKLLERCALVG 331 Query: 374 YIEITRRAHQTLWREYE--------HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 Y +I + Y+ + F IRD+ +IY R F + A Sbjct: 332 YGQIETQPQYLSMPYYQPNTLLTLFREEIDHPRFVTALIRDRSEIYAALRAFFPRPGAGE 391 Query: 426 KG 427 +G Sbjct: 392 RG 393 >UniRef50_A4BNQ7 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BNQ7_9GAMM Length = 391 Score = 330 bits (847), Expect = 5e-89, Method: Composition-based stats. Identities = 95/421 (22%), Positives = 177/421 (42%), Gaps = 58/421 (13%) Query: 14 KSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRH 73 + + R + + +++ + + + V +V +P + F +R Sbjct: 21 RGTRDWLRHNEKIREAVREQLPDLVAGSDVLSRPDNRTVKVPVRFMEHYRFRLRNPDVRT 80 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 G R +P + S +GEGQ F + D+ LD L+++L Sbjct: 81 GAGQGKAKPGDVLRPAQP-----ARPGQGKEGSGEGEGQITFALEFQIDDILDWLWDELE 135 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LP+LK ++ E R G+ G + + R+++ ++ RR+A Sbjct: 136 LPHLKPRLGTRIEEDAYIREGWDRRGARSRLDRRRTMKEAIKRRSAQG------------ 183 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 E +P ++ DLR++ +R P++ AV F Sbjct: 184 -----------------------------PEAIPIVND-DLRFRQLARRRRPTTNAVAFF 213 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETG 313 L+DVS SMD+ + +AK F+ + R + +E+V+I H +A E +E FF G Sbjct: 214 LLDVSSSMDEHCRRLAKTFFFWALQGVRRQFSTIEIVFIAHTVEAWEFEEENFFRIHGQG 273 Query: 314 GTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYS 373 GT S+A+ ++++ERY+PA +N Y A+DG N+++D E L +L P++ + Sbjct: 274 GTKSSTAVHKAQQILEERYDPAMYNCYLFYATDGHNFSEDRRRATEALL-RLAPLMNFLG 332 Query: 374 YIEITRRAHQTL-------WREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 Y E++ + H+ L WR ++++ + DI+ + F Q A A+ Sbjct: 333 YAEVSHQNHRRLDTEVAGIWRGLGAEGWPVGSYSLTR---EADIWLAIKAFFTDQAAEAE 389 Query: 427 G 427 Sbjct: 390 A 390 >UniRef50_Q9HRD6 UPF0229 protein VNG_0746C n=11 Tax=Halobacteriaceae RepID=Y746_HALSA Length = 442 Score = 310 bits (793), Expect = 9e-83, Method: Composition-based stats. Identities = 93/449 (20%), Positives = 173/449 (38%), Gaps = 58/449 (12%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 +R+RF + + +Q ++E I + ++V +P + + P F + R V Sbjct: 5 EDRERFHEIGEQR-RQDLAEFIQYGDL-GGSGPDAVRVPIKLVDLPAFEYDQ-LDRGGVG 61 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 G+ D++ P +G G + D E+ +++ +E+ L E L L + Sbjct: 62 QGDVDP--GDQVGEP----DEAGEGDDDEAGDESADHEY-YEMDPEEFAAELDERLGL-D 113 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM---------------- 180 L ++ + E + G + + L R+ AM Sbjct: 114 LDPKGKKVVAETEGAFNETARRGPRGTLDFAHLYKQGLKRKIAMDFDEAYVTAALRVDGW 173 Query: 181 ---TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE------------- 224 + + A I+ + +++ R + A I+ Sbjct: 174 GVDAVYTWAREQHIPVSRAWIAERARSPSPDDDAGRVVDDAVWASIDAMEAAVDVEPTRT 233 Query: 225 --------RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILL 276 RVP + D R+++ + V+ + DVSGSM +S +++ +R + L Sbjct: 234 RIRRGGPGRVP-LRREDERFRHPKVVEHRERNVVVVNIRDVSGSMRESKRELVERTFTPL 292 Query: 277 YLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQ 336 +L+ Y N E VYI H A EVD +FF Q GGT +S+A +L + V+ E Y ++ Sbjct: 293 DWYLTGKYDNAEFVYIAHDADAWEVDRTDFFGIQSGGGTRISTAYELAENVLDE-YPFSE 351 Query: 337 WNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTF 396 WN Y A DG+N DD+ L + ++Y+E ++ F Sbjct: 352 WNRYVFAAGDGENSHDDTEENVIPLMNDI--DANLHAYVETQPTDGVQTGTHAGKVRDAF 409 Query: 397 ---DNFAMQHIRDQDDIYPVFRELFHKQN 422 DN A+ + + DD+ + ++ Sbjct: 410 GDTDNVAVTTVTEPDDVMGAIETILSTED 438 >UniRef50_C1XT76 Uncharacterized conserved protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XT76_9DEIN Length = 360 Score = 306 bits (783), Expect = 1e-81, Method: Composition-based stats. Identities = 91/404 (22%), Positives = 152/404 (37%), Gaps = 54/404 (13%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 + QRF + ++K+ E + + G+ VSIP + P G + Sbjct: 6 RDLQRFKEIVRGEVKKRAREFLTREEYLGSLDGQVVSIPLPQLELPRLQYGHNEMGQG-- 63 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 E G G G G V ++ +E+L+L+ E L LP Sbjct: 64 ------------EGEGEGQGQGMGGTAGRGGLGPSGHVPVAEMDLEEFLELIGEALKLPR 111 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L+ Q + E + G + R+L+ +L R Sbjct: 112 LEPKQGGAVEESSPKYTTLSRRGPESLRHARRTLRQALRRAI------------------ 153 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + R E L + + D RY+ E +P P +QA + +D Sbjct: 154 -----------QSGIYRPEDPRLVPERD--------DYRYRAPEPKPRPQAQAALVFALD 194 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 VSGSM+ + + + ++ + + + Y+ H +A EV E +FF +E GGT Sbjct: 195 VSGSMEGEQLRLVRILSYWITAWVKKHFPRLSRHYLLHDAEAWEVSEEDFFRLREGGGTR 254 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SS +KL +V+ ERY +N Y +DGDNW DD+ E L K LLP + Y Y + Sbjct: 255 LSSGIKLAQQVL-ERYPAQLYNRYVYHFTDGDNWQDDTAEALETL-KALLPTLSLYGYAQ 312 Query: 377 ITRRAHQ-TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + R Q + + A + ++ + + L Sbjct: 313 VRSRYGQGRFIDDLRSHFPSDPALATAELGGRESLPSALKRLLG 356 >UniRef50_Q1GJ99 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GJ99_SILST Length = 300 Score = 305 bits (780), Expect = 3e-81, Method: Composition-based stats. Identities = 123/299 (41%), Positives = 183/299 (61%), Gaps = 13/299 (4%) Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAG---KRRELHALEENL 195 + + T RAG T G P N+++VR+++NSL RR A+ +R+L A L Sbjct: 2 EKATVETETIGTRRAGLTTAGTPNNLNLVRTMRNSLGRRIALQRPSTQTQRDLEAQVAEL 61 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 I P+Q L ++ ++ K V +ID DLRY + +S+AV+FCLM Sbjct: 62 EEIEARSPSQDELLAELVAKLDGIKRKRRVVGYIDPLDLRYDTFVPEKIRNSRAVVFCLM 121 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 DVSGSM + KD+AKRF++LL+LFL+R Y++ E+V++RH A+EVDE FFY++ETGGT Sbjct: 122 DVSGSMQEREKDLAKRFFLLLHLFLTRGYEHTEIVFVRHTHYAQEVDEETFFYARETGGT 181 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 IVS+AL+ M E++ ERY P +WNIY AQASDG+N+ +DS C +IL ++LLP+ ++Y+Y+ Sbjct: 182 IVSTALEKMKEIIDERYPPDEWNIYGAQASDGENFGNDSVRCRKILTEQLLPMCQFYAYV 241 Query: 376 EI----------TRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 EI A + LW+ Y ++ +F MQ + + IYP+FRE F + Sbjct: 242 EIVEESAQMLLDNTEAGEDLWQNYRQVKEACRHFEMQRVSEPGHIYPIFREFFLPKVKG 300 >UniRef50_Q72KI5 Glycosyltransferase n=3 Tax=Thermus RepID=Q72KI5_THET2 Length = 351 Score = 278 bits (711), Expect = 2e-73, Method: Composition-based stats. Identities = 104/404 (25%), Positives = 161/404 (39%), Gaps = 64/404 (15%) Query: 17 VNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVH 76 + RF + ++K+ + E + + + G VSIP + P G Sbjct: 10 RDLLRFKEIVRGEVKKRVREFLTREELFGQVEGRLVSIPLPQLEIPKIVHGEPLGEGL-- 67 Query: 77 PGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPN 136 G G G G V ++ +E+LDL+ E L LP Sbjct: 68 ----------------------GLGGPGEEALGPGGHIPVAELELEEFLDLVGEALRLPR 105 Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 L+ + ++TE G V R+L+ SL R Sbjct: 106 LRPKGEGEVTEEALRHTTIARKGPRGLRHVRRTLKESLKR-------------------- 145 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 L+ R E L + E DLRYK ++P P +QAV+ +D Sbjct: 146 ---------ALQSGEYRPEDPLLVPERE--------DLRYKAPRRKPIPHAQAVVLFALD 188 Query: 257 VSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTI 316 VSGSM + + K + L++ R + +E Y+ H +A EV E EFF ++E GGT Sbjct: 189 VSGSMREEELKLVKTLSFWITLWIKRHFPRLERRYLLHDAEAWEVPEEEFFKAREGGGTR 248 Query: 317 VSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIE 376 +SSAL L +E++K Y A +N Y SDG+NW D+PL E ++LLP + Y Y + Sbjct: 249 ISSALLLAEEILKA-YPEAFYNRYLFHFSDGENWQGDTPLALEA-LRRLLPSLALYGYAQ 306 Query: 377 ITRRAHQT-LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 + Q E + A+ +R ++D+ R L Sbjct: 307 VEGPYGQGHFLEEVREALGGREGVALAAVRGREDLPVALRRLLG 350 >UniRef50_A0MN74 Conserved bacterial protein n=1 Tax=Thermus phage phiYS40 RepID=A0MN74_9CAUD Length = 340 Score = 243 bits (621), Expect = 7e-63, Method: Composition-based stats. Identities = 86/366 (23%), Positives = 137/366 (37%), Gaps = 75/366 (20%) Query: 16 MVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRV 75 ++ R+L+R + IK + + +N + + + V I + EP F Sbjct: 5 TIDEIRYLKRLENIIKARMQDIVNSNDIIESTPEDKVRIRIPIMDEPYFK---------- 54 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALP 135 + G G GSGSG EG E +++ +E +LLFE L LP Sbjct: 55 -----------PVFPGSGAGAGSGSGSEPGEGSEEGDHEIEIELTVEELSELLFEYLGLP 103 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 +K + + + G + G + I RR Sbjct: 104 KIKPKG-SSVEKEEYLIEGISKTGPRSRIH----------RR------------------ 134 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 + +I + + + +RYK+ KR P A+++ Sbjct: 135 ----------------------KTYYEIMKYGYKED-SIRYKHLRKREVPIFDAIVYFAR 171 Query: 256 DVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGT 315 D S S+D K K + F+ YKNV + H T+AK V E +FF E G T Sbjct: 172 DYSASVDDKKKFKIKSTAFWINNFIKYNYKNVTTKFAVHDTKAKFVSEQDFFKLSEGGAT 231 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 + SS +L+ E + RY+ +N Y SDG+N DD+P E L +KL +Y Sbjct: 232 LCSSVFELIYEDYR-RYSVDDYNFYLFYFSDGENLPDDNPKLRE-LVEKLSEDFNLIAYG 289 Query: 376 EITRRA 381 E+ Sbjct: 290 EVKSTD 295 >UniRef50_Q747C5 Conserved domain protein n=11 Tax=Deltaproteobacteria RepID=Q747C5_GEOSL Length = 447 Score = 216 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 76/380 (20%), Positives = 152/380 (40%), Gaps = 53/380 (13%) Query: 23 LRRYKAQIKQSISEAINKRSVTDVDSGES---VSIPTEDISEPMFHQ------GRGGLRH 73 L R + + + + I + +G V +PT + E + H Sbjct: 72 LERDRLREEDGLPRKIRIGKLIKPGAGGKEKIVVVPT-TVEEKLIHDRAPEETEEDESMG 130 Query: 74 RVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLA 133 G++ + ++ RPQ GG +G G+ +G + + + + +L E Sbjct: 131 GTGDGDEGEIIGEQPVRPQQEGGSGTAGHGE--GEGHELESTAYDLGR-----ILTERFD 183 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 LPNLK+ ++ + + Y+ + N R L ++ + E + Sbjct: 184 LPNLKEKGKK------SSLSHYSYDLTDRN----RGFGQILEKKQTLRRIL--ETNIALG 231 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFC 253 +A ++ +P +L + + +RV I + +L Y SQA++F Sbjct: 232 TVADVAEIDPTRL------------VISPRDRVYRILSRELEY---------ESQALVFF 270 Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTY-KNVEVVYIRHHTQAKEVDEHEFFY-SQE 311 + D SGSM+ + ++L+Y +L + + VE +I H A+EV + +Y + Sbjct: 271 IRDYSGSMEGKATEAVCSQHVLIYSWLLYQFARQVETRFILHDNDAREVPDFYTYYNLRV 330 Query: 312 TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRY 371 GGT V++A ++++E+V++ +NIY +DGD+W + L +++L Sbjct: 331 AGGTRVAAAYRMVNEIVEKESLARDYNIYVFHGTDGDDWDTNGEETIPEL-RRMLAYANR 389 Query: 372 YSYIEITRRAHQTLWREYEH 391 + E E Sbjct: 390 IGVTIAEHTYGSSGNTEVER 409 >UniRef50_A4BJU7 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4BJU7_9GAMM Length = 555 Score = 172 bits (435), Expect = 3e-41, Method: Composition-based stats. Identities = 46/263 (17%), Positives = 81/263 (30%), Gaps = 26/263 (9%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R + +EE + + PA ++ + + L Sbjct: 124 RRFLNQGMRPPADSIRVEEFINYFDYALPAPDTTNTPIQISTERTQTPWNPQTELVRVSL 183 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + + P + L+DVSGSM+ K + +R + LL L + VY Sbjct: 184 QSYRSDFKTLPPLN--LVFLLDVSGSMNSPDKLPLMQRSFNLLVSQLRPQDRVAIAVYAG 241 Query: 294 HHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 E + GGT S+ + L ++ + Y P N + Sbjct: 242 QSGVVLEPTSGDQKAQINQAINQLRAGGGTHGSAGIHLAYDLAQANYLPDGINR-IFIGT 300 Query: 346 DGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 DGD N S + L ++ + + T + L E + + + Sbjct: 301 DGDFNVGTTSLTELKALIERKREAGVFLSVLGFG--TGNYNDALMEELSNHGNGTAYYL- 357 Query: 402 QHIRDQDDIYPVFRELFHKQNAT 424 D Y R+LF Q A Sbjct: 358 -------DSYQEARKLFATQLAA 373 >UniRef50_D2EEA7 Putative uncharacterized protein n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EEA7_9EURY Length = 373 Score = 156 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 90/204 (44%), Gaps = 23/204 (11%) Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKN-VEV 289 D+RY E++ P+ A + D+SGSM+ + + L+ +L Y++ V++ Sbjct: 172 DEDIRYNLIEEKLIPNLSATFVIMRDISGSMEMYG-EFSATIAGLIEFWLKEKYEHTVKI 230 Query: 290 VYIRHHTQAKEVD---EHEFFYSQETGGTIVSSALKLMDEVVK-----------ERYNPA 335 Y+ H +A E D +FF +GGT + A KL+ ++ ER + Sbjct: 231 RYVAHTDEAFEYDPRKREDFFKLSSSGGTAFNPAYKLVIDMTDGASYKSNSPYKERIDYQ 290 Query: 336 QWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQST 395 +++ +DGDN+ + E L KKL P + Y+++ + Y ++S Sbjct: 291 SEDVFLLHITDGDNYNGEDEAVRETL-KKLFPRLTKVFYLQVGGYSDSF----YNLIKSV 345 Query: 396 FDNFAMQHIRDQDDI-YPVFRELF 418 + ++ +DI Y +++ Sbjct: 346 DPE-KLSEVKSGNDISYNNVKKVL 368 >UniRef50_A9DKM0 Putative uncharacterized protein (Fragment) n=1 Tax=Shewanella benthica KT99 RepID=A9DKM0_9GAMM Length = 167 Score = 139 bits (351), Expect = 2e-31, Method: Composition-based stats. Identities = 101/153 (66%), Positives = 124/153 (81%), Gaps = 1/153 (0%) Query: 1 MTWFIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDIS 60 M FIDRRLN + +S VNRQRF+ RYK QIK+++S+A+ +RSVTDVD GE +SIPT+DIS Sbjct: 16 MANFIDRRLNARGRSTVNRQRFINRYKQQIKKAVSDAVTRRSVTDVDKGERISIPTKDIS 75 Query: 61 EPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQIS 120 EP FHQG+GG+R RVHPGND F++ D+IER GGG GSGQG AS GEG D+FVFQIS Sbjct: 76 EPSFHQGQGGIRERVHPGNDQFIKGDKIER-PPGGGSQGSGQGDASNSGEGDDDFVFQIS 134 Query: 121 KDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRA 153 KDEYL+LLFEDL LPNL+ N+ +L EY+ +RA Sbjct: 135 KDEYLELLFEDLELPNLQNNRLNKLVEYQVYRA 167 >UniRef50_B4DA43 von Willebrand factor type A n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DA43_9BACT Length = 883 Score = 117 bits (294), Expect = 5e-25, Method: Composition-based stats. Identities = 42/248 (16%), Positives = 77/248 (31%), Gaps = 29/248 (11%) Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 +EE L P + + L+ + K P S Sbjct: 365 RIEELLNYFPYDYPQPQG-AAPFSATMEVATCPWAPEHRLVRVGLKGREIPKDERPPSN- 422 Query: 250 VMFCLMDVSGSMDQSTKD--MAKRFYILLYLFLSRTYKNVEVV-------YIRHHTQAKE 300 + L+DVSGSM+ K + K F +L+ + V +V + TQ KE Sbjct: 423 -LVFLIDVSGSMNMPNKLPLLQKCFSLLVEQLGPK--DRVSIVTYASGTKLVLEPTQDKE 479 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + GGT SS + L + ++ + P N A+DGD N + Sbjct: 480 AMQTAIDGLHAGGGTHGSSGIDLAYRMAQQSFIPGGTNRVIL-ATDGDWNIGITNQSELL 538 Query: 360 ILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRE 416 + + + + ++ + + + D R+ Sbjct: 539 SMITRKAKSGVFLTVLGFG--LDNLKDSMLVKLADHGNGHYAYI--------DTEQEARK 588 Query: 417 LFHKQNAT 424 +F Q ++ Sbjct: 589 VFVDQLSS 596 >UniRef50_D2VY88 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VY88_NAEGR Length = 1082 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 84/238 (35%), Gaps = 39/238 (16%) Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 SN + Q+ E+ R E+ L + + + + P P+ + L DVS Sbjct: 82 SNQKVEQVEEKSLFRIEVEHLIDLDDHDIGVAELTIHVNDPSLNPKPT---LFIALADVS 138 Query: 259 GSMDQSTKDMAKRFYILLYLFLSRTYKNVEV--VYIRHHTQAKEVD------------EH 304 GSM + L F +++ N + + + + AKE+D E Sbjct: 139 GSMQGRPWEQVCTS---LKHFAQQSFNNPAIICRMVAYESSAKEIDMKGTLQSIIRNIET 195 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQW-----NIYAAQASDGDNWADDS-PLCH 358 F GGT +SA +L ++ + N+ +DG++++ P Sbjct: 196 AF----TGGGTDFASAFQLACTIITRESGQDRENLPFGNVVITFLTDGEDFSKVGKPGGL 251 Query: 359 EILAKKLLPVVR------YYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDI 410 + L++++ V R + + L + + + + D +D+ Sbjct: 252 QYLSEEINRVYRGDITIHTVGFG---SHHNLELLDNIRKVGTIEGAYRYANYDDNNDV 306 >UniRef50_A4C5H3 Von Willebrand factor type A domain protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C5H3_9GAMM Length = 608 Score = 97.9 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 78/265 (29%), Gaps = 26/265 (9%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R M + E + A E A + + Sbjct: 170 RRMIKMGKRPPADAVREEAFINYFDYHYSAPKSLETPFNVHTEVAPAPWNNQRQLLKIGI 229 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVY-- 291 + + EK ++ + L+DVSGSM+ K + K +L L VVY Sbjct: 230 KGFDIEKAELKAAN--LVFLLDVSGSMNAPDKLPLLKSSLTMLTKQLDENDSVAIVVYAG 287 Query: 292 ----IRHHTQAKE--VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + T+ E V + G T + ++L ++ + + N A+ Sbjct: 288 AAGLVLPATKGNEYQVISNALNNLSAGGSTNGAQGIELAYQIASQNFKKEGINRVIL-AT 346 Query: 346 DGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAM 401 DGD N S + L + + + + L + ++ + + Sbjct: 347 DGDFNVGMSSVDALKKLIANKRKTGIALTTLGFGQ--GNYNDGLMEQLANIGNGQHAYI- 403 Query: 402 QHIRDQDDIYPVFRELFHKQNATAK 426 D R++ + ++ Sbjct: 404 -------DTINEARKVLVDELSSTM 421 >UniRef50_C6M483 von Willebrand factor type A domain protein n=1 Tax=Neisseria sicca ATCC 29256 RepID=C6M483_NEISI Length = 538 Score = 95.6 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 91/275 (33%), Gaps = 23/275 (8%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V ++ R ++ +EE + + P + + + Sbjct: 90 IDVDTGSYANVRRFLTNGEQPPKDAVRIEEIVNYFPYNYPLP-TDNRPFAVHTETIDSPW 148 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + + ++ ++ K+ P + + L+DVSGSMD+ K + ++ +L L Sbjct: 149 QPEAKLIKIGIQAQDTAKKDLPPAN--LVFLVDVSGSMDEENKLPLVQKTLRILTQQLRP 206 Query: 283 TYKNVEVVYIR--------HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 K + Y KE + G T SAL++ E ++ + P Sbjct: 207 QDKVTLITYASGEDLVLPPTSGADKETILSAIDKLRAGGATDGESALQMAYEQAQKAFVP 266 Query: 335 AQWNIYAAQASDGD-NWA-DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHL 392 N A+DGD N D+ ++A+K V + ++ + + Sbjct: 267 NGINRILL-ATDGDFNVGVSDTETLKSMVAEKRKSGVSLSTLGFGMGNYNEDMMEQIADA 325 Query: 393 QSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 ++ D +++ +Q + Sbjct: 326 GDGNYSYI--------DNEKEAKKVLQQQLTSTLA 352 >UniRef50_A9DBX8 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9DBX8_9RHIZ Length = 668 Score = 95.2 bits (235), Expect = 5e-18, Method: Composition-based stats. Identities = 40/302 (13%), Positives = 87/302 (28%), Gaps = 35/302 (11%) Query: 146 TEYKTHRAGYTANGVPA---------NISVVRSLQNSLARRTAMTAGKRRELHALEENLA 196 + G+ +NGV + + V + + R +EE + Sbjct: 195 EAERDRVEGFDSNGVRSVAEYPVSTFSADVDTASYAMVRRALKQGVMPDPRTVRIEEMVN 254 Query: 197 IISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMD 256 + PA E R + + ++ + + P + V+ L+D Sbjct: 255 YFNYDYPAPESVETPFRATVTVTPTPWNANTRLLHIGVKGYDVKPAARPQANLVL--LVD 312 Query: 257 VSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFF 307 VSGSM ++ K + K + LL L V Y E Sbjct: 313 VSGSMQETDKLPLLKSAFRLLIQKLEPEDTVSIVTYAGDAGTVLEPTPASDKAKILDALD 372 Query: 308 YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLL 366 + G T ++ ++ + ++ N A+DGD N + L ++ Sbjct: 373 DLRPGGSTAGAAGIEEAYRLAEKARVNGGVNRVLL-ATDGDFNVGASDDDALKSLIEEKR 431 Query: 367 PV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNA 423 + + + + + L + + + D + ++ Sbjct: 432 ESGVFLSIFGFGQ--GNYNDQLMQTLAQNGNGVAAYI--------DTLAEAEKTLAQEAT 481 Query: 424 TA 425 + Sbjct: 482 AS 483 >UniRef50_C5BTM6 von Willebrand factor type A domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BTM6_TERTT Length = 689 Score = 91.0 bits (224), Expect = 8e-17, Method: Composition-based stats. Identities = 47/286 (16%), Positives = 87/286 (30%), Gaps = 23/286 (8%) Query: 131 DLALPN-LKQNQQRQLTEYKTHRAGYTANGVPAN--ISVVRSLQNSLARRTAMTAGKRRE 187 DL P+ L+ + T+ T + I V + + + R+ ++ Sbjct: 210 DLEPPHQLETADRDHFDTVATNPIKVTREEPVSTFSIDVDTASYSFVRRQLNRGQLPQKA 269 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 LEE + P + I + A + + + K P + Sbjct: 270 AVRLEEMVNYFPYDYPLPSAATAPFKPTITVIPAPWNQAKRLVHIGI--KALPLAHPPKA 327 Query: 248 QAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD---E 303 + L+DVSGSM K + K+ LL L T VVY E E Sbjct: 328 N--LVFLLDVSGSMGSPDKLPLVKQSMELLLSGLQPTDTVSIVVYAGAAGTVLEPTPVAE 385 Query: 304 H-----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLC 357 G T + ++L ++ + Y N A+DGD N P Sbjct: 386 QQKILAALDRLNAGGSTAGAQGIELAYQLAEANYQRDAVNRIIL-ATDGDFNVGIADPEQ 444 Query: 358 HEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + ++ + + + + L ++ + + Sbjct: 445 LKGYVERKRANGIELSILGFG--SGNYNDALMQQLAQNGNGVAAYI 488 >UniRef50_C6BAR1 von Willebrand factor type A n=7 Tax=Alphaproteobacteria RepID=C6BAR1_RHILS Length = 706 Score = 89.8 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 39/278 (14%), Positives = 84/278 (30%), Gaps = 19/278 (6%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH-ALEENL 195 L N++R + V + V S + RR+ L +EE + Sbjct: 227 LDPNRERFANAAANPIKSVATDPVSTFSADVDSASYAFVRRSLTGGAMPDPLSVRVEEMI 286 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 P ++ + + + R + ++ + P + + L+ Sbjct: 287 NYFPYDWPGPNNADQPFKATVTVMPTPWNRDTELMHVAIKGYDIAPATTPRAN--LVFLI 344 Query: 256 DVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEF 306 DVSGSMD+ K + K + L+ L V Y + Sbjct: 345 DVSGSMDEPDKLPLLKSAFRLMVNRLKADDTVSIVTYAGNAGTVLAPTRVAEKSKILSAI 404 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKL 365 + G T + ++ ++ K+ + N A+DGD N S + + ++ Sbjct: 405 DRLEPGGSTGGAEGIEAAYDLAKQGFVKDGVNRVML-ATDGDFNVGPSSDGDLKRIIEEK 463 Query: 366 LPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 + + + +L + + + Sbjct: 464 RKDGIFLTVLGFG--RGNLNDSLMQTLAQNGNGSAAYI 499 >UniRef50_A8TM27 Putative Ser protein kinase n=1 Tax=alpha proteobacterium BAL199 RepID=A8TM27_9PROT Length = 318 Score = 87.9 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 32/73 (43%), Positives = 45/73 (61%) Query: 4 FIDRRLNGKNKSMVNRQRFLRRYKAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPM 63 IDRR N + KS+ NRQRFLRR K Q+ +++ +A +R + DV GE + IPT+ ++EP Sbjct: 230 IIDRRRNSQGKSLANRQRFLRRAKRQVTEAVRQASAERRIRDVADGEQIVIPTDGLNEPR 289 Query: 64 FHQGRGGLRHRVH 76 F LR H Sbjct: 290 FRHDARRLRLDRH 302 >UniRef50_B1KPQ5 von Willebrand factor type A n=8 Tax=Gammaproteobacteria RepID=B1KPQ5_SHEWM Length = 640 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 48/309 (15%), Positives = 94/309 (30%), Gaps = 28/309 (9%) Query: 130 EDLALPNLKQN-QQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRREL 188 +D+ LP L+ + + AG + I V ++L R R Sbjct: 135 QDIYLPELQNRDKFERQVANGIMVAGEIPVSTFS-IDVDTGSYSTLRRSINHGVLPERGT 193 Query: 189 HALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 +EE + + PA E+ + + L+ EK +SQ Sbjct: 194 VRVEELINYFAYQYPAPDAGEQPFSVNTELAPSPYNPHKMLLRIGLKGFEKEKADLGASQ 253 Query: 249 AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE------- 300 + L+DVSGSM K + K +L L + VVY + Sbjct: 254 --LVFLLDVSGSMSSQDKLPLLKNALKMLSQQLDEGDRISIVVYAGASGVVLDGVKGNDT 311 Query: 301 -VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 + G T + ++L ++ ++ + N A+DGD N Sbjct: 312 LAISQALDKLKAGGSTNGGAGIELAYQLAQKHFIAGGVNRVIL-ATDGDFNVGVSDQQAL 370 Query: 359 EILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR 415 E + ++ + + + + L + + + D R Sbjct: 371 EDMIEEKRKQGIALTTLGFGQ--GNYNDHLMEQLADKGNGHYAYI--------DTLNEAR 420 Query: 416 ELFHKQNAT 424 ++ + + Sbjct: 421 KVLVDEISA 429 >UniRef50_C0YQB8 von Willebrand factor, type A n=2 Tax=Flavobacteriaceae RepID=C0YQB8_9FLAO Length = 800 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 48/276 (17%), Positives = 82/276 (29%), Gaps = 27/276 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + +++ R + +EE + P E A Sbjct: 356 IDVDNASYSNVRRMINNGQVVDKNAVRIEEMVNYFKYDYPQPKNEN-PFSINTEYSDAPW 414 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ KN P+S + L+DVSGSM K + K + +L L Sbjct: 415 NPKHKLLKIGLQGKNLPMDKLPASN--LVFLIDVSGSMSDENKLPLLKSSFKVLLNQLRP 472 Query: 283 TYKNVEVVY------IRHHTQAKEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 K VVY + T A E D+ Q G T + ++L ++ +E + Sbjct: 473 KDKVGIVVYAGSAGMVLPPTSAGEKDKIIEALDRLQAGGSTAGGAGIELAYKLAQENFVK 532 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 N A+DGD N S + L + + + Sbjct: 533 EGNNRVI-IATDGDFNVGTSSISDLKTLIEDRRKSGVFLTCLGFG--MGNYKDNTLETLA 589 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D + K+ A + Sbjct: 590 DKGNGNYAYI--------DNMQEANKFLGKEFAGSM 617 >UniRef50_A9KS19 von Willebrand factor type A n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KS19_CLOPH Length = 551 Score = 86.4 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 42/273 (15%), Positives = 82/273 (30%), Gaps = 35/273 (12%) Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 ++ R L+ RR A + +EE L + + Sbjct: 122 NIRRMLKE--GRRVDTGAVR------IEEMLNYFNYDYKLPEGDS-PFGITTELSDCPWN 172 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRT 283 + ++ + + S + L+DVSGSM D+ + +R ++LL L+ Sbjct: 173 PDTKLFLAGIQTEKIDFSKSAPSN--LVFLIDVSGSMMDEDKLPLVQRAFLLLTENLTEK 230 Query: 284 YKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + V Y + T KE ++ + G T S ++ ++ E Y Sbjct: 231 DRISIVTYAGNDTVVLSGAKGNQKEKIQNAITELEAGGSTFGSKGIETAYQLAMENYIEG 290 Query: 336 QWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEH 391 N A+DGD N S L ++ + + T Sbjct: 291 GNNRVIL-ATDGDLNVGVTSESELTNLIEEKRKSGVALSVLGFG--TGNIKDNKMEALAD 347 Query: 392 LQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + D R++ ++ Sbjct: 348 HGNGNYAYI--------DSLMEARKVLVEEMGA 372 >UniRef50_C8SEV7 von Willebrand factor type A n=4 Tax=Rhizobiales RepID=C8SEV7_9RHIZ Length = 718 Score = 85.6 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 75/240 (31%), Gaps = 18/240 (7%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 + R + + +EE + ++ + + Sbjct: 279 VRRSLKEGFVPQADTVRVEEMINYFPYDWKGPDSASTPFNSTVSVMPTPWNTHTKLMHVA 338 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVY- 291 ++ + + P + + L+DVSGSMD+ K + K + LL L V Y Sbjct: 339 IKGFDVKPTEQPKAN--LVFLIDVSGSMDEPDKLPLLKSAFRLLVSKLKADDTISIVTYA 396 Query: 292 -----IRHHTQAKEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + T+ E D+ + Q G T + +K ++ ++ + N A Sbjct: 397 GDAGTVLMPTKIAEKDKILNAIDNLQPGGSTAGEAGIKEAYKLAQQSFIKDGVNRVML-A 455 Query: 345 SDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DGD N + L ++ + + + + + + + + Sbjct: 456 TDGDFNVGQTDDDDLKRLIEQERKTGVFLSVFGFG--RGNLNDEMMQTIAQNGNGTAAYI 513 >UniRef50_B0T5X0 von Willebrand factor type A n=1 Tax=Caulobacter sp. K31 RepID=B0T5X0_CAUSK Length = 592 Score = 85.2 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 38/275 (13%), Positives = 80/275 (29%), Gaps = 26/275 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + ++ R A + +EE + +E + + + + Sbjct: 147 IDVDTAAYANVRRFLNEGAAPPHDALRVEELINYFDYGYARPTAQEPPFKPTVTVVPSPW 206 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + + ++ + P + L+D SGSM + +AK+ +L L Sbjct: 207 SQDRQLMHIGVQGYATPRAGQPPLN--LVFLIDTSGSMSGPDRLPLAKKALNVLIDQLRP 264 Query: 283 TYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + V Y ++K + G T L+L + ++ +P Sbjct: 265 QDRVSMVAYAGSAGAVLSPTDGKSKLKMRCALTALRSGGSTAGGQGLELAYALARQNLDP 324 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYE 390 N +DGD N P + + Y + + T+ + Sbjct: 325 KAVNRVIL-MTDGDFNVGIADPTRLKDFVADQRKSGVYLSVYGFG--RGNYNDTMMQALA 381 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 + + D R+L +A Sbjct: 382 QNGNGTAAYV--------DGLQEARKLLRDDFDSA 408 >UniRef50_Q21MJ3 von Willebrand factor, type A n=5 Tax=Proteobacteria RepID=Q21MJ3_SACD2 Length = 708 Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats. Identities = 34/265 (12%), Positives = 82/265 (30%), Gaps = 26/265 (9%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 + R+ ++ EE + + P + I + + + + Sbjct: 271 VRRQLNSGYLPEKDAIRAEELINYFDYNYPLPSDSTAPFKPNITVIDSPWAKGKKLVHIG 330 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI 292 L+ + P + + L+DVSGSM+ K + K+ +L L+ VVY Sbjct: 331 LKGYDIAPDQKPRTN--LVFLLDVSGSMNSQDKLPLVKQSMEMLLSTLNPDDTVAIVVYA 388 Query: 293 RHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 E Q G T + + L ++ + ++ N A Sbjct: 389 GAAGTVLEPTPAKDKQKILSAMQRLQAGGSTAGGAGIALAYDLAEANFDKKAVNRVIL-A 447 Query: 345 SDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +DGD N + + ++ + + + + L + + + Sbjct: 448 TDGDFNVGSTNNETLQGFVERKREKGIFLSVLGFGQ--GNYNDHLMQTLAQNGNGVAAYI 505 Query: 401 MQHIRDQDDIYPVFRELFHKQNATA 425 D +++ ++ +++ Sbjct: 506 --------DTVSEAQKVLVQEASSS 522 >UniRef50_Q2N8R4 Von Willebrand factor type A domain protein n=2 Tax=Erythrobacter RepID=Q2N8R4_ERYLH Length = 580 Score = 83.7 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 38/261 (14%), Positives = 75/261 (28%), Gaps = 24/261 (9%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R + + EE + + R + L Sbjct: 143 RRFLSQGQMPPKAAVRTEEFINYFRYDYDRPQDRSQPFTVNFDAARTPWNEDTRLIRIGL 202 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 + E+ P + + LMDVSGSM + K + K L L K VVY Sbjct: 203 AGYDIERSERPPAN--LVFLMDVSGSMGRPDKLPLVKTALAGLAGELQPQDKVSIVVYAG 260 Query: 294 HHTQAKEVDEH------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 E Q G T + ++L ++ ++ + N A+DG Sbjct: 261 AAGLVLEPTNDTRKIRAALNQLQAGGSTAGGAGIQLAYQIAEDNFIEGGVNRVIL-ATDG 319 Query: 348 D-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 D N S + +K + + T ++ + + + + + Sbjct: 320 DFNVGVSSRDALIEMIEKKRDSGITLTTLGFG--TGNYNEAMMEQIANHGNGNYAYI--- 374 Query: 404 IRDQDDIYPVFRELFHKQNAT 424 D +++ + ++ Sbjct: 375 -----DSALEAKKVLGDEMSS 390 >UniRef50_C6VVX3 von Willebrand factor type A n=17 Tax=Bacteria RepID=C6VVX3_DYAFD Length = 625 Score = 83.7 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 85/274 (31%), Gaps = 27/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 + V R+ +++ R + +EE + P E + + Sbjct: 167 VDVDRAAYSNVRRFLNNGQMPPEDAVRIEEMINYFDYDYPQPRGEH-PVAIVAETTDSPW 225 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ K +S + L+DVSGSM+++ K + K+ + LL L Sbjct: 226 NPGLKLVHIGLQAKTVSAENLSASN--LVFLIDVSGSMNEANKLPLLKQAFKLLADQLRV 283 Query: 283 TYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 K V Y K+ + + G T ++L ++ K+ + P Sbjct: 284 EDKISIVAYAGSAGMVLAPTSGSEKKTIKDALDKLEAGGSTAGGEGIELAYDLAKKHFLP 343 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 N A+DGD N + + L ++ + + + Sbjct: 344 KGNNRVIL-ATDGDFNVGISNESELQKLIEEKRKAGIFLSVMGFG--MGNYKDSHVETLA 400 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + D R++F ++ Sbjct: 401 DKGNGNYAYI--------DNIQEARKVFVQEFGG 426 >UniRef50_Q4KKB4 von Willebrand factor type A domain protein n=20 Tax=Proteobacteria RepID=Q4KKB4_PSEF5 Length = 582 Score = 83.3 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 72/265 (27%), Gaps = 26/265 (9%) Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLR 235 RR E E L + A + + + ++ Sbjct: 129 RRLLNQGSLPPEGAVRLEELVNYFPYDYALPTDGSPFGVTTELAPSPWNPHTRLLRIGIK 188 Query: 236 YKNYEKRPDPSSQAVMFCLMDVSGSMDQST-KDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + + + L+DVSGSMD+ + K LL L + VVY Sbjct: 189 ASDRAVAELAPAN--LVFLVDVSGSMDRREGLPLVKSTLKLLVDQLRDQDRVSLVVYAGE 246 Query: 295 HTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 E G T +S ++L ++ ++ + N A+D Sbjct: 247 SRVVLEPTSGRDKAKIRTAIDQLTAGGSTAGASGIQLAYQMAQQGFIDQGINRILL-ATD 305 Query: 347 GD-NWAD---DSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 GD N DS +K + + ++ L + + Sbjct: 306 GDFNVGVSDFDSLKAMAAEKRKSGVSLTTLGFG--VDNYNEHLMEQLADAGDGNYAYI-- 361 Query: 403 HIRDQDDIYPVFRELFHKQNATAKG 427 D R++ Q ++ Sbjct: 362 ------DNLREARKVLVDQLSSTLA 380 >UniRef50_UPI0001C375AE von Willebrand factor type A n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C375AE Length = 550 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 46/291 (15%), Positives = 87/291 (29%), Gaps = 30/291 (10%) Query: 141 QQRQLTEYKTHRAGYTANGVPANISVVRSLQNS---------LARRTAMTAGKRRELHAL 191 ++ +L GYT G S S ++ + R + + Sbjct: 75 EEPELPSANEEYKGYTEAGFKDTKSEPLSTFSADVDTASYTNVRRLIENRNIVPEDAVRI 134 Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 EE + P R + R + ++ K +++ P S + Sbjct: 135 EEFINYFDYDYPQPEDGSAFGRY-VEIADCPWNRDHKLMMVGIQGKELQQQETPPSN--L 191 Query: 252 FCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHT------QAKEVDE- 303 L+D SGSM+ K + + + +L L + + V Y + DE Sbjct: 192 VFLIDSSGSMNSYDKLPLVQSAFSMLAEQLDKNDRISIVTYAGSSAVLLDGEKGSNTDEI 251 Query: 304 -HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEIL 361 + + +G T +K E+ +E + N A+DGD N S L Sbjct: 252 LEQLYSITASGSTNGEGGIKTAYELAEEHFIKGGNNRVIL-ATDGDLNVGASSEEELTRL 310 Query: 362 AKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + + + E + NF+ D+ + Sbjct: 311 IETKRDNGIYLSVLGFGE--GNYKDARMEALADNGNG--NFSYIDSEDEAE 357 >UniRef50_C7PNZ7 von Willebrand factor type A n=5 Tax=Bacteroidetes RepID=C7PNZ7_CHIPD Length = 639 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 42/274 (15%), Positives = 76/274 (27%), Gaps = 27/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V R+ +++ R + +EE + + + Sbjct: 191 IDVDRASYSNVRRFLNEGNMPPVDAVRVEEMINYFDYKY-SNPTGNTPVAVRTDMAICPW 249 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ K+ K P S + L+DVSGSM + K + K+ + LL L Sbjct: 250 NTAHQLVRIALKGKDVAKDNLPPSN--LVFLIDVSGSMSDAKKLPLVKQAFKLLVNQLRP 307 Query: 283 TYKNVEVVYI--------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + VVY K + G T ++L + E Sbjct: 308 VDRVAIVVYAGAAGLVLPSTSGDHKTAILDALDKLEAGGSTAGGEGVQLAYKTATEYLLK 367 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 + N A+DGD N S + + +K + + Sbjct: 368 SGNNRVI-IATDGDFNVGPSSDGELQRIIEKKREKGIFLSVLGFG--MGNYKDNKLELLA 424 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + D + R F + Sbjct: 425 DKGNGNYAYI--------DNFEEARRTFATEFGG 450 >UniRef50_C0CVB5 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0CVB5_9CLOT Length = 556 Score = 82.5 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 42/266 (15%), Positives = 75/266 (28%), Gaps = 27/266 (10%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 +L R+ + +EE L + P E+E + Sbjct: 105 ANLRRKILEGNEVPADAVRIEEMLNYFTYDYPEP-TEDEPFSVTTYIGDCPWNENHKLLQ 163 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVV 290 L+ + + S + L+DVSGSM+ + K + KR ++LL L V Sbjct: 164 IGLQAEKPDLENQKPSN--LVFLIDVSGSMESADKLGLVKRAFLLLTENLRPEDTVSIVT 221 Query: 291 YIRHHT--------QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 Y T + K G T S ++ + +E + N Sbjct: 222 YASSDTVVLDGVSGEEKAAIMTAIENLTAGGSTDGSKGIETAYRLAEEHFQKDGNNRVIL 281 Query: 343 QASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 A+DGD N S L +K + + T + Sbjct: 282 -ATDGDLNLGLTSEGDLTRLIQKKKESGVFLSVMGFG--TGNIKDNKMEALADNGNGQYA 338 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNAT 424 + D + + ++ Sbjct: 339 YV--------DSLMEAKRVLVEELGG 356 >UniRef50_UPI0001913F8A hypothetical protein Salmonellaentericaenterica_26029 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI0001913F8A Length = 88 Score = 81.4 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 78/88 (88%), Positives = 82/88 (93%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 LT KTHRAG+T+NGVPANISVVRSLQNSLARRTAMTAGKRRELHALE L IS+SEPA Sbjct: 1 LTSNKTHRAGFTSNGVPANISVVRSLQNSLARRTAMTAGKRRELHALETELETISHSEPA 60 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTF 232 QLLEEERLR+EIAELRAKIERVPFIDTF Sbjct: 61 QLLEEERLRREIAELRAKIERVPFIDTF 88 >UniRef50_B8F8Z5 von Willebrand factor type A n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8F8Z5_DESAA Length = 558 Score = 78.7 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 42/319 (13%), Positives = 86/319 (26%), Gaps = 48/319 (15%) Query: 122 DEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 +EY + P + + N+ V +++ Sbjct: 90 EEYAPIREGGFKSPLYDPLSTFSIDVDTASYSNVRRFLSYGNMPPVDAVR---------- 139 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 +EE + P ++ + + R + L+ + + Sbjct: 140 ---------IEEMINYFHYDYPQPKG-QDPFSITMEMSQCPWNRDNMLVHVGLQGRCLDY 189 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVV-------YIR 293 + S + L+DVSGSM+ K + KR +L L V +V + Sbjct: 190 KDVKPSN--LVFLLDVSGSMNSENKLPLVKRSMEMLVKELGAG-DRVSIVTYAGSAGLVL 246 Query: 294 HHTQAKEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NW 350 T A+ + + G T ++L V E P N +DGD N Sbjct: 247 PSTSARNKRKIITALDRLEAGGSTAGGEGIELAYRVAWENLIPEGNNRVIL-CTDGDFNV 305 Query: 351 A-DDSPLCHEILAKKLLP--VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 +P ++ +K + + + + + Sbjct: 306 GVSSTPELVRMIEEKRRAGIYLTICGFG--MGNYKDEKMEAISNAGNGNFYYI------- 356 Query: 408 DDIYPVFRELFHKQNATAK 426 D ++F + Sbjct: 357 -DSRREAHKVFVQDMRANM 374 >UniRef50_A3PN61 von Willebrand factor, type A n=9 Tax=Rhodobacteraceae RepID=A3PN61_RHOS1 Length = 651 Score = 77.5 bits (189), Expect = 1e-12, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 61/250 (24%), Gaps = 18/250 (7%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + L RE +EE + PA R ++ R Sbjct: 212 IDVDTASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPFRPTLSITRTPW 271 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ + P + L+D SGSM K + K+ + L+ L Sbjct: 272 NPETRLVHVALQGRMPAIEDRPPLN--LVFLIDTSGSMQDPAKLPLLKQSFGLMLGRLRP 329 Query: 283 TYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + V Y + G T L L E Sbjct: 330 EDQVAIVTYAGSAGEVLAPTAANQRSTILSALDRLDAGGSTAGDEGLALAYRTASEMAGA 389 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILA---KKLLPVVRYYSYIEITRRAHQTLWREYE 390 + A+DGD N P L + + + + Sbjct: 390 GEVTRVVL-ATDGDFNLGISDPEELARLVAHERDTGVYLSVLGFG--RGNLDDATMQALA 446 Query: 391 HLQSTFDNFA 400 + + Sbjct: 447 QNGNGQAAYI 456 >UniRef50_UPI00016C54C8 hypothetical protein GobsU_21830 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C54C8 Length = 638 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 34/266 (12%), Positives = 72/266 (27%), Gaps = 26/266 (9%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 ++ R L E + S + + + + Sbjct: 197 ANVRRMLNEGTLPPASAVFLAEFVNYFPYSYAPPPAGADPVAFHVEMGPCPWNAKHHLLR 256 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVV 290 ++ P + L+D SGSM Q + + ++ LL L+ + V Sbjct: 257 VGVQAHQIPAEKLPPRN--LVFLVDTSGSMQQENRLPLVQKSLELLVEKLTEKDRVSVVT 314 Query: 291 YIRHHTQAKEVDEHE--------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 Y A Q GGT +K + ++ + N Sbjct: 315 YAGDSRVALPPTSGADKKAILDVVTGLQANGGTNGEGGIKKAYQFARDTFLDGGVNRVIL 374 Query: 343 QASDGD-NWA-DDSPLCHEILAKKLLPV--VRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 +DGD N D+ +++ ++ + Y +E + + Sbjct: 375 -CTDGDFNVGVVDNGELVKLIEEQRKSKVFLTVLGYG--MGNYKDDRLKELANHGNGHHA 431 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNAT 424 + D +++F +Q Sbjct: 432 YI--------DTLDEAKKVFVEQGGA 449 >UniRef50_A5WCP1 von Willebrand factor, type A n=2 Tax=Moraxellaceae RepID=A5WCP1_PSYWF Length = 571 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 40/306 (13%), Positives = 92/306 (30%), Gaps = 31/306 (10%) Query: 137 LKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQN--SLARRTAMTAGKRRELHALEEN 194 + QQ E + + T+ A +S+ + ++ R ++ +EE Sbjct: 100 MAPKQQENYAEIEPNAVNATSEQAFATLSIDTDTGSYANVRRFLNQGQLPPKDAVRVEEL 159 Query: 195 LAIISNS-EPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY--EKRPDPSSQAVM 251 + + A+ + + I ++ ++ K+ P + + Sbjct: 160 INYFNYDFTAAKKQANAPFLVSTEVVNSPWHPTNQIVKVGIKAEDLLTAKQKQPPAN--L 217 Query: 252 FCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------- 303 L+DVSGSMD K +AK +L L + Y + Sbjct: 218 VFLVDVSGSMDTEDKLQLAKSSLKMLTKQLRAQDSITLITYAGNTKVVLPSTPGNQTQKI 277 Query: 304 -HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEIL 361 + +G T +A+KL + E + N +DGD N S + Sbjct: 278 LNAIDNLTASGSTNGEAAIKLAYQQATEHFKKDGINR-ILMLTDGDFNVGVSSVKDMLQI 336 Query: 362 AKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 + + + + + + + + ++ D +++ Sbjct: 337 IRSNRDKGISLSTLGFGQ--GNYNDHMMEQVADNGNGNYSYI--------DSLSEAKKVL 386 Query: 419 HKQNAT 424 + + Sbjct: 387 IDEMSA 392 >UniRef50_Q7UP85 Putative uncharacterized protein n=1 Tax=Rhodopirellula baltica RepID=Q7UP85_RHOBA Length = 885 Score = 76.0 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 55/410 (13%), Positives = 109/410 (26%), Gaps = 61/410 (14%) Query: 31 KQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIER 90 K A + S G+ P + R R + + + Sbjct: 316 KDEAPTAPREPSAGKPVVGDFAVAPVPE-QLGRQQFDFRASRGRTLE--RQLGETEELA- 371 Query: 91 PQGGGGGSGSGQGQASQDGEGQ--DEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEY 148 P G G D+F P +++N+ R++ + Sbjct: 372 PTSDRLAILPPTPDGEGQGPGMSGDKFE------------------P-IQENEFRRVADD 412 Query: 149 KTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE 208 + ++ + A+ + VRS R + +EE + E Sbjct: 413 A--LSTFSIDVDTASYAKVRSYLQR-------GQLPRPDSVRIEELINYFDYQYTPPSAE 463 Query: 209 EE-RLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK- 266 + +A + ++ K+ +++ P + L+D SGSM + K Sbjct: 464 DPVPFSSAMAVASCPWNENNRLVRVGIQAKDIDRKERPRCN--LVFLIDTSGSMKRPNKL 521 Query: 267 DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGTIVS 318 + +L L + VVY + G T Sbjct: 522 PLVIEGMKVLLDQLKNRDRVAIVVYAGSSGLVLDSTPVKQKKKIIRALSALSAGGSTNGG 581 Query: 319 SALKLMDEVVKERYNPAQWNIYAAQASDGD-NW---ADDSPLCHEILAKKLLPVVRYYSY 374 + L+L + +E + N SDGD N D + K + + Sbjct: 582 AGLQLAYQTARENFIEDGVNRVIL-CSDGDFNVGMTGTDQLVAEATRQSKSGTELTVLGF 640 Query: 375 IEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + + + F D +++ Q A Sbjct: 641 G--MGNHNDAMMERISNSGAGNYAFV--------DTIAEAKKVLADQVAG 680 >UniRef50_B4WCU1 von Willebrand factor type A domain protein n=2 Tax=Caulobacteraceae RepID=B4WCU1_9CAUL Length = 613 Score = 76.0 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 75/274 (27%), Gaps = 29/274 (10%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + +++ R + +EE + A + + Sbjct: 160 IDVDTAAYSNVRRFIDEGRSPPADAVRVEELINAFDYGYARPTSLARPFAITTAVVASPW 219 Query: 224 ERVPF-----IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLY 277 I L+ + + L+DVSGSM K D+AK+ L Sbjct: 220 APRTERGGRQIVHIGLQGYELPQGEQRPLN--LTFLVDVSGSMRSPDKLDLAKQAMNLAI 277 Query: 278 LFLSRTYKNVEVVYIRH---------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV 328 L + V Y K + +GGT ++ + + Sbjct: 278 DRLRPQ-DTLSVTYYAEGAGTTLQPTPGDQKLKMRCAVASLRASGGTAGATGMTNAYDQA 336 Query: 329 KERYNPAQWNIYAAQASDGD-NWA-DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLW 386 + + + N +DGD N D+ + +A+K V Y Sbjct: 337 QASFARDKVNR-ILMFTDGDFNVGVTDNKRLEDYVAEKRGTGVYLSVYGFGRGNYQDARM 395 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + + + D+ R LF Sbjct: 396 QTIAQAGNGVAAYV-------GDL-RDARRLFGP 421 >UniRef50_C8N8N5 von Willebrand factor type A domain protein n=2 Tax=Gammaproteobacteria RepID=C8N8N5_9GAMM Length = 563 Score = 74.8 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 38/283 (13%), Positives = 81/283 (28%), Gaps = 36/283 (12%) Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIA 217 G +NI + + +N L + +EE L + P + + Sbjct: 124 TGSYSNIRRMLTRENRL---------PPADAVRVEEILNYFAYGYPLPQ-DGKPFAVHTQ 173 Query: 218 ELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILL 276 + + + + ++ + P + + L+D SGSMD K + K+ Sbjct: 174 TVDSPWQADAKLIRIAIQAADLAPEKRPPAN--LVFLIDTSGSMDDPDKLPLVKKTVCHF 231 Query: 277 YLFLSRTYKNVEVVYIRHHTQ--------AKEVDEHEFFYSQETGGTIVSSALKLMDEVV 328 L + + Y + KE + G T AL++ + Sbjct: 232 AEALRADDRISLITYSGSTAEILPPTAGDQKETIIAALKPLRAHGATAGGEALRMAYDAA 291 Query: 329 KERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQT 384 + Y N A+DGD N P + + Y + + Sbjct: 292 AKNYRKDGINRILL-ATDGDFNVGISDPATLKNYVADKRKSGISLTTLGYG--SGNYNDE 348 Query: 385 LWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + + ++ D +++ +Q + Sbjct: 349 MMEQLADAGDGNYSYI--------DSEAEAKKVLVRQLTSTLA 383 >UniRef50_C0Z5D5 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5D5_BREBN Length = 513 Score = 74.8 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 39/252 (15%), Positives = 76/252 (30%), Gaps = 28/252 (11%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 E +EE + S PA + + + ++ I ++ K + Sbjct: 119 AEAVRVEEFINFFPTSYPAP--TNQTFAIQADSGPSPFQKNLQIVRIGIKGKELSPKERK 176 Query: 246 SSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HT 296 + + ++DVSGSM+Q + ++ K+ +L L T VVY T Sbjct: 177 PAN--LVFVIDVSGSMNQENRLELVKKSLHVLVDQLQPTDSVGIVVYGSEGRVLLPPTST 234 Query: 297 QAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSP 355 + K+ Q G T L L E+ + P N SDG N + Sbjct: 235 EDKQAILSAIDELQPEGSTNAEQGLVLGYEMAARSFKPPAINRVIL-CSDGVANVGETGA 293 Query: 356 LCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + + + + + + + + + D + Sbjct: 294 EGILRSIEDYARKDIYLSSFGFG--MGNYNDVMMEQLANKGEGSYAYI--------DTFS 343 Query: 413 VFRELFHKQNAT 424 R +F + Sbjct: 344 EARRIFTESLTG 355 >UniRef50_A1ZHE2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZHE2_9SPHI Length = 704 Score = 74.4 bits (181), Expect = 8e-12, Method: Composition-based stats. Identities = 34/259 (13%), Positives = 77/259 (29%), Gaps = 27/259 (10%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEE--------RLRKE 215 I V + +++ R + +EE + P ++ Sbjct: 252 IDVDNASYSNVRRFVNDGQPLPKNAVRVEEMINYFEYDYPQPTPTKDKEGKLQTHPFSVN 311 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYI 274 + L+ +N + + + + L+D SGSMD K + KR + Sbjct: 312 TEYGTCPWNPHHKLLQIGLQGENLQTKNASPAN--LVFLVDASGSMDSEDKLPLLKRSFK 369 Query: 275 LLYLFLSRTYKNVEV---------VYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMD 325 +L L+ + + + V +E + G T ++L Sbjct: 370 VLLKQLTDSRTKIAIVAYAGASGLVLPATSVSHREKILTALENIESGGSTAGGEGIELAY 429 Query: 326 EVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRA 381 ++ ++ + N A+DGD N S L + + T Sbjct: 430 KIAQQAFIAGGNNRVIL-ATDGDFNVGLSSDEELMQLISNKRKSGVYLTCLGFG--TGNL 486 Query: 382 HQTLWREYEHLQSTFDNFA 400 + ++ + + + + Sbjct: 487 NDSMMEKLTNAGNGNYYYI 505 >UniRef50_A9IU52 Putative lipoprotein n=3 Tax=Bordetella RepID=A9IU52_BORPD Length = 582 Score = 74.0 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 34/262 (12%), Positives = 69/262 (26%), Gaps = 22/262 (8%) Query: 174 LARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD 233 + R + EE + ++ A + Sbjct: 147 VRRLLNEGRLPPPDAVRAEEFINYFDYGYATPDSRQQPFSIITEVSAAPWNPQRQLLKIG 206 Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI 292 ++ + P++ + L+D SGSM + K + K L L + V Y Sbjct: 207 IQGYRVAPQDIPAAN--LVFLVDTSGSMAERDKLPLIKGALKQLVAQLRPQDRVAIVTYA 264 Query: 293 RH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 K + G T + L L + + N A Sbjct: 265 GQASMTLDSTPGDQKARINAAIDELRAAGSTNGGAGLDLAYAQAAKGFVKGGVNRILL-A 323 Query: 345 SDGD-NWADDSPLCHE-ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQ 402 SDGD N + +A++ + + + L + + ++ Sbjct: 324 SDGDFNVGATDLEDLKDKIARQRQGGIALTTLGVGGGNFNDALAMQLADAGNGSYHYL-- 381 Query: 403 HIRDQDDIYPVFRELFHKQNAT 424 D R++ Q ++ Sbjct: 382 ------DSLREARKVLAAQMSS 397 >UniRef50_Q28U54 von Willebrand factor type A n=3 Tax=Rhodobacterales RepID=Q28U54_JANSC Length = 686 Score = 74.0 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 40/271 (14%), Positives = 74/271 (27%), Gaps = 33/271 (12%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEE-ERLRKEIAELRAKIERVPF 228 L+++L R A + +EE + PA ++ R + Sbjct: 253 LRSTLNR----GALPAPDAVRIEEMVNYFPYDYPAPTADDISPFRPNVQVFETPWNPDTQ 308 Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK--DMAKRFYILLYLFLSRTYKN 286 + ++ P + L+D SGSM+ K + + F ++L LS + Sbjct: 309 LVHIGIQGDLPVVEDRPPLN--LVFLIDTSGSMNDPAKLPLLIQSFRLMLNR-LSPEDEV 365 Query: 287 VEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 V Y A E Q G T L+ + E + + Sbjct: 366 AIVTYAGSAGVALEPTAASDTATINAALTTLQAGGSTNGVGGLEEAYRLAGEMMVDGEVS 425 Query: 339 IYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQS 394 A+DGD N E + + + + + Sbjct: 426 RVLL-ATDGDFNVGLSDAGALEDYIAEQRDTGIYLSVLGFG--RGNLQDDTMQALAQNGN 482 Query: 395 TFDNFAMQHIRDQDDIYPVFRELFHKQNATA 425 ++ D + + Q A A Sbjct: 483 GTASYI--------DTLHEAQRVLVDQLAGA 505 >UniRef50_P76481 Uncharacterized protein yfbK n=38 Tax=Enterobacteriaceae RepID=YFBK_ECOLI Length = 575 Score = 73.6 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 45/284 (15%), Positives = 79/284 (27%), Gaps = 24/284 (8%) Query: 158 NGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLA---IISNSEPAQLLEEERLRK 214 G AN V R L L + + + I + + + Sbjct: 130 TGSYAN--VRRFLNQGL-----LPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAM 182 Query: 215 EIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFY 273 A + D+ K+ + P+S + L+D SGSM + + Sbjct: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASN--LVFLIDTSGSMISDERLPLIQSSL 240 Query: 274 ILLYLFLSRTYKNVEVVYIRHHTQA--------KEVDEHEFFYSQETGGTIVSSALKLMD 325 LL L V Y A K G T + L+L Sbjct: 241 KLLVKELREQDNIAIVTYAGDSRIALPSISGSHKAEINAAIDSLDAEGSTNGGAGLELAY 300 Query: 326 EVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQ 383 + + + N A+DGD N D P E + KK V ++ ++ Sbjct: 301 QQATKGFIKGGINRILL-ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNE 359 Query: 384 TLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAKG 427 + + + ++ Q + R++ K Sbjct: 360 AMMVRIADVGNGNYSYIDTLSEAQKVLNSEMRQMLITVAKDVKA 403 >UniRef50_A5F9T1 von Willebrand factor, type A n=9 Tax=Bacteria RepID=A5F9T1_FLAJ1 Length = 709 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 46/276 (16%), Positives = 78/276 (28%), Gaps = 27/276 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + ++ R ++ +EE + + P E + Sbjct: 264 IDVDNASYTNIRRFLNSGQEVPKDAVRVEEMVNFFKYNYPQPKNEH-PFSINTEYSDSPW 322 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 I L+ KN PSS + L+DVSGSM+ K + K+ +L L Sbjct: 323 NSQNKILKIGLQGKNIATNDLPSSN--LVFLIDVSGSMEDMNKLPLLKQSMKILVNELRP 380 Query: 283 TYKNVEVVYIRHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 T K VVY + G T + ++L ++ E + Sbjct: 381 TDKVSIVVYAGAAGMVLPPTSGNEKKTIIKALDQLEAGGSTAGGAGIELAYKIATENFIK 440 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 N A+DGD N S E L ++ + Y + Sbjct: 441 GGNNRVIL-ATDGDFNVGSSSNSDMEKLIEEKRKTGVFLTCLGYG--MGNYKDSKMEILA 497 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNATAK 426 + + D K+ + Sbjct: 498 DKGNGNYAYI--------DNIQEANRFLGKEFKGSM 525 >UniRef50_A6GHE4 von Willebrand factor, type A n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GHE4_9DELT Length = 785 Score = 73.3 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 48/160 (30%), Gaps = 11/160 (6%) Query: 251 MFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKEV 301 + L+DVSGSM K + K + L L VVY KE Sbjct: 357 LVFLLDVSGSMSSRGKLPLIKHGFTQLVEQLGAEDHVSIVVYAGAAGVVLPPTSGDQKET 416 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEI 360 + GGT S+ + E+ + + N +DGD N Sbjct: 417 ILGALDRLEAGGGTNGSAGIVEAYELAQANFVDGGVNRVIL-GTDGDFNVGLSDHDALVE 475 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 L ++ + S + + L + + F Sbjct: 476 LIEQKRESGVFLSVLGVGGHYDDELMEQLADHGNGNYAFL 515 >UniRef50_B0CCM8 von Willebrand factor, type A domain protein n=4 Tax=Cyanobacteria RepID=B0CCM8_ACAM1 Length = 686 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 90/304 (29%), Gaps = 22/304 (7%) Query: 130 EDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH 189 + L LP + + +I V + +++ R ++ Sbjct: 196 DRLHLPGTFNTEDYKRINENPFFLPQRTPLSTFSIDVDTASYSNVRRFIRQGQLPPKDAV 255 Query: 190 ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQA 249 LEE + + ++ A + L+ K EK S Sbjct: 256 RLEELINYFDYGYASPKGDQ-PFSVSTEVATAPWNNQHKLVHIGLKGKELEKEQ--PSN- 311 Query: 250 VMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVYIRH--------HTQAKE 300 + L+DVSGSM + K + K+ LL L + VVY K Sbjct: 312 -LVFLIDVSGSMKRPNKLALVKKSLCLLVHQLKPEDRVSLVVYAGRAGIVLPSTPGTQKA 370 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHE 359 + + G T ++ +K+ ++ + + N A+DGD N S E Sbjct: 371 TIMNAIDRLEAGGSTAGAAGIKMAYDMAERHFLKNGNNRVIL-ATDGDFNVGQSSDAELE 429 Query: 360 ILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFR- 415 L ++ + Y T + + + + Q + R Sbjct: 430 RLIEQKRDRGVFLTVLGYG--TGNYKDNKMELLANKGNGNYAYIDTLLEAQKVLVNDLRG 487 Query: 416 ELFH 419 LF Sbjct: 488 TLFT 491 >UniRef50_C1AC65 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AC65_GEMAT Length = 642 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 44/274 (16%), Positives = 79/274 (28%), Gaps = 27/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V R+ + R + +EE + + + A Sbjct: 192 IDVDRASYGNARRFLQDGQRPPADAVRIEELINYFPYELREPRG-NDPVAITTEVTTAPW 250 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + + L+ + E P + + L+DVSGSM K + K+ LL + Sbjct: 251 QPRHQLVRIALQSRRIETASLPPNN--LVFLIDVSGSMQSPDKLPLVKQSLRLLVDQMRP 308 Query: 283 TYKNVEVVYI--------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 + V Y KE + G T + ++L +E + Sbjct: 309 QDRVAIVAYAGAAGLVLPSTSGDEKETIIQAIERLEAGGSTAGGAGIELAYRTAREHFMD 368 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLL---PVVRYYSYIEITRRAHQTLWREYE 390 N ASDGD N S E L ++ + + T + Sbjct: 369 HGNNRVIL-ASDGDFNVGVSSDGELERLIERKRTEGTYLTILGFG--TGNYQDAKMEKLA 425 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + DDI R++ ++ Sbjct: 426 KRGNGNYGYV-------DDIAEA-RKMLVREMGA 451 >UniRef50_C7PR69 von Willebrand factor type A n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PR69_CHIPD Length = 588 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 87/274 (31%), Gaps = 27/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 + V R+ +++ R + +EE + S P + + L Sbjct: 160 VDVDRAAYSNIRRFVKLKERIPANAVRIEEMVNYFHYSYPLPPVGQT-LAIYSNYATCPW 218 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + +R K+ P S + L+DVSGSM K + + + +L L Sbjct: 219 AEDHRLLQIAVRGKSVNLDSLPPSN--LVFLIDVSGSMAMPNKLPLLQAAFRILVNNLRS 276 Query: 283 TYKNVEVVYI--------RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 V Y AK + Y G T +A+KL ++ +E + Sbjct: 277 NDHVAIVAYAGVPGVILPSTPGSAKSKILNAIDYLSAGGATAGEAAIKLAYQIAEENFIK 336 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILA---KKLLPVVRYYSYIEITRRAHQTLWREYE 390 N A+DGD N S E L K+ ++ + + + Sbjct: 337 EGNNRVIL-ATDGDFNVGQTSDHDMEQLILGKKETGVLLTCLGFG--MKNYKDSKLETLS 393 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + NFA D ++F ++ + Sbjct: 394 SKGNG--NFAYI------DNLEEASKIFAREFGS 419 >UniRef50_A3D1E9 von Willebrand factor, type A n=8 Tax=Shewanella RepID=A3D1E9_SHEB5 Length = 642 Score = 71.3 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 75/274 (27%), Gaps = 26/274 (9%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V +L R + +EE L + P + Sbjct: 155 IDVDTGSYATLRRMLREGRLPEKGTVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPY 214 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ + K +S + L+DVSGSM + K + + LL LS Sbjct: 215 NDDMMLLRIGLKGYDLPKSQLGASN--LVFLLDVSGSMASADKLPLLQTALKLLTAQLSA 272 Query: 283 TYKNVEVVYIRH--------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 K VVY + + G + ++ K+ + P Sbjct: 273 QDKVSIVVYAGAAGVVLDGVSGNDTQTLTYALEQLSAGGSINGGQGITQAYQLAKKHFIP 332 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYE 390 N A+DGD N L +K + + + L + Sbjct: 333 NGINRVIL-ATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFG--LGNYNDQLMEQLA 389 Query: 391 HLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + D R++ + ++ Sbjct: 390 DKGNGNYAYI--------DTLNEARKVLVDELSS 415 >UniRef50_A1ZU67 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZU67_9SPHI Length = 552 Score = 70.6 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 44/284 (15%), Positives = 90/284 (31%), Gaps = 18/284 (6%) Query: 128 LFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRE 187 L ED++ P +K+ + + + + TA +I V + + + Sbjct: 75 LEEDVSPPKIKEKKPANENTFLSVK---TAPLSTFSIDVDNASYSRARKSINNGQLPSTS 131 Query: 188 LHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSS 247 LEE + + Q + + + L+ K + R S Sbjct: 132 SVRLEEFINYFNYQY-KQPEGQHPFSVNTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLS 190 Query: 248 QAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--- 303 + L+DVSGSM K + ++ + +L L + VVY + + Sbjct: 191 N--LVFLIDVSGSMSAPDKLPLLRKAFKMLVNNLGEEDRVAIVVYAGNAGLVLPATQGTD 248 Query: 304 -----HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLC 357 Q G T + +KL ++ K+ + N A+DGD N S Sbjct: 249 KQKIMEALDKLQSGGSTAGGAGIKLAYKIAKQNFIKEGNNRIIL-ATDGDFNLGASSDQA 307 Query: 358 HEILAKKLLPVVRYYSYIEIT-RRAHQTLWREYEHLQSTFDNFA 400 + L ++ + + + + + + + Sbjct: 308 MQNLIEEKRKEGVFITVLGLGMGNYRDSKMEIIADKGNGNYYYL 351 >UniRef50_D0XSR4 von Willebrand factor type A n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XSR4_9CAUL Length = 625 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 35/253 (13%), Positives = 66/253 (26%), Gaps = 19/253 (7%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 I V + ++ R + R+ +EE + +E A + Sbjct: 169 IDVDTAAYANVRRFISEGQTPPRDAVRVEEMINYFDYGYARPGRADEPFAVSTAVAASPW 228 Query: 224 ERVPF-----IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLY 277 I L+ + ++DVSGSM K +A++ L+ Sbjct: 229 SANAGAGGRQIVHIGLQGYELPAGERRPLN--LTFMVDVSGSMQSPDKLGLAQQTMNLII 286 Query: 278 LFLSRTYKNVEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVK 329 L + Y A G T + + E + Sbjct: 287 DRLRPEDRVAVTYYASDVGTAVGPTPGSEKLKLRCAVAALNAGGSTAGAQGMVNAYEQAE 346 Query: 330 ERYNPAQWNIYAAQASDGD-NWA-DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWR 387 ++P + N +DGD N D + +A K + Y + Sbjct: 347 AAFSPDKVNR-ILMFTDGDFNVGVTDDRRLEDYVADKRGTGIYLSVYGFGRGNYQDARMQ 405 Query: 388 EYEHLQSTFDNFA 400 + + Sbjct: 406 TIAQAGNGVAAYV 418 >UniRef50_A3ZT14 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT14_9PLAN Length = 616 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 54/375 (14%), Positives = 105/375 (28%), Gaps = 54/375 (14%) Query: 63 MFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD 122 R P + ++ G G G G D+F + Sbjct: 97 KVRSDARQDRLATLPTESRRLGIEQPNAAPGFMPQLDGIAGHGEGPGVGGDKFAYV---- 152 Query: 123 EYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTA 182 + N R + + + ++ + A+ S +RS Sbjct: 153 ---------------ENNPFRAVADEP--LSTFSIDVDTASYSKIRSYLIDYH-----QL 190 Query: 183 GKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKR 242 + + +EE + + + A +++ + + ++ K Sbjct: 191 PPQGAVR-VEELINYFTY-DYATPTDQKPFAANVEAAACPWNAEHRLVRIGIKGKEIANA 248 Query: 243 PDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYI--------R 293 P+S + L+DVSGSM+ + K + K+ LL L K VVY Sbjct: 249 ERPASN--LVFLLDVSGSMNNARKLPLLKQGMKLLVDQLGENDKVAIVVYAGAAGMVLNS 306 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWAD 352 + K Q G T ++L + E + N +DGD N Sbjct: 307 TNGDDKSTIMEALDRLQAGGSTNGGQGIELAYQAATENFIKGGVNRVIL-CTDGDFNVGV 365 Query: 353 DSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 S +A + + T + + E + F D Sbjct: 366 TSTSDLVTMAADKAKSGVFLSVMGFG--TGNHNDAMMEELSGKANGNYAFI--------D 415 Query: 410 IYPVFRELFHKQNAT 424 +++ +Q + Sbjct: 416 TITEAKKVLVEQMSG 430 >UniRef50_A0NVX5 Putative uncharacterized protein n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0NVX5_9RHOB Length = 608 Score = 69.4 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 67/258 (25%), Gaps = 26/258 (10%) Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 + +EE + + P ++ + + ++ Sbjct: 181 GRLPNPDAVRVEEMVNYFDYNYPVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVP 240 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 PS + L+D SGSM + K + ++ + LL L + V Y Sbjct: 241 LDDLPSQN--LVFLIDTSGSMADANKLPLLQQSFRLLLSSLRDEDEVAIVTYAGSSGVLL 298 Query: 300 EVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NW 350 E + + G T LK + + + A+DGD N Sbjct: 299 EPTKVADKTRILEKINALTSGGSTAGHEGLKGAYALAETMTGDGEQTRIIL-ATDGDFNV 357 Query: 351 ADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQ 407 P + + + + + L + + Sbjct: 358 GLSDPDSLKRYVAEQRENGTALSVLGFG--RGNYNDELMQTLAQNGQGVAAYI------- 408 Query: 408 DDIYPVFRELFHKQNATA 425 D R++ Q ++ Sbjct: 409 -DTLSEARKVLVDQVVSS 425 >UniRef50_B1ZYN3 von Willebrand factor type A n=2 Tax=Bacteria RepID=B1ZYN3_OPITP Length = 792 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 70/234 (29%), Gaps = 35/234 (14%) Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQ---------LLEEERLRKE 215 +V R L+ + +EE + A E Sbjct: 335 NVRRFLRE--------GRLPPADAVRIEELVNYFPYRYAAPGRVRDEGVAAPGEAPFAAA 386 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYI 274 + A + L+ K+ ++ + L+DVSGSMDQ K + + Sbjct: 387 LEVAAAPWAAQHRLVRIGLKAKDAAVSGRAAAN--LVFLLDVSGSMDQPNKLRLVQESMR 444 Query: 275 LLYLFLSRTYKNVEVVYIRHHTQA---------KEVDEHEFFYSQETGGTIVSSALKLMD 325 LL L + V Y + A +E+ + + G T + L+L Sbjct: 445 LLLGRLQPEDRVAIVTYAGNSGLALPSTPVARQREILD-AIDELRAGGSTNGAMGLQLAY 503 Query: 326 EVVKERYNPAQWNIYAAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYI 375 ++ K + N +DGD N S L ++ + + Sbjct: 504 DIAKANFVANGVNRVIL-CTDGDFNVGVTSEGELVRLIEEKAKSGVFLTVLGFG 556 >UniRef50_C7N770 Uncharacterized protein containing a von Willebrand factor type A (VWA) domain n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N770_SLAHD Length = 629 Score = 68.3 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 50/345 (14%), Positives = 94/345 (27%), Gaps = 27/345 (7%) Query: 75 VHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDG---EGQDEFVFQISKDEYLDLLFED 131 + G + E P + S + +G + + + + D L E Sbjct: 99 IGVGVGTNLLGSNAEMPVAETKAASEDTMAGSANSYAPDGGLAYETDEAYETF-DTLDEG 157 Query: 132 LALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELH-- 189 + + + + E + T + V + +L R Sbjct: 158 APMEDFNTEEYAAIEENGFV-STVTRPLSTCSADVDTASYCNLRRMINDGYSLDEIPDGA 216 Query: 190 -ALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQ 248 +EE L + R + + +K S Sbjct: 217 VRIEEMLNYFHYDSGEP-EGNDLFAVRAESARCPWNDQTQLLVMT--FTASDKAQTASKG 273 Query: 249 AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKE---VDEH 304 + + L+D+SGSMD+ K D+ K + L L + V Y E D+ Sbjct: 274 SNLVFLIDISGSMDEPDKLDLLKDSFGTLLENLGPNDRVSIVTYAAGEDVLLEGASGDDT 333 Query: 305 -----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADDSPLCH 358 + G T + L++ EV + Y N ASDGD N S Sbjct: 334 RKIMRALNRLEADGSTNGEAGLEMAYEVAERNYIEGGVNRIVM-ASDGDLNVGITSESDL 392 Query: 359 EILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 ++ + + + T + ++ Sbjct: 393 YDFVEEKRETGVYLSVLGFG--SGNYKDTKMETLADHGNGTYHYI 435 >UniRef50_UPI000185CB41 protein containing von Willebrand factor n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185CB41 Length = 550 Score = 68.3 bits (165), Expect = 6e-10, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 76/245 (31%), Gaps = 19/245 (7%) Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLE-EERLRKEIAELRAKI 223 V R+ +L R ++ +EE + PA E LR Sbjct: 107 DVDRASYANLRRMLGYGQLPPKDAIRIEEMINYFDYDYPAPTKEATSPLRVTPELAPTPW 166 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSR 282 + L+ K + P S + L+DVSGSMD+ K + K + LL L Sbjct: 167 NPEHLLLRIGLQAKKLDLAQAPPSN--IVFLIDVSGSMDEPNKLPLLKSSFKLLLTQLKP 224 Query: 283 TYKNVEVVYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 T + V Y A E +G T SS ++L + ++ + Sbjct: 225 TDRVAIVTYASGTKVALSSTPVKERQKIEKVLDNLYASGSTSGSSGIQLAYKEAQKNFIK 284 Query: 335 AQWNIYAAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYE 390 N A+DGD N +P E +K + + + Sbjct: 285 NGNNRIIL-ATDGDFNVGISNPRELEKFIEKQRESGIYMSVLGFG--MGNYRDDMAETIA 341 Query: 391 HLQST 395 + Sbjct: 342 DKGNG 346 >UniRef50_B0UJ22 von Willebrand factor type A n=1 Tax=Methylobacterium sp. 4-46 RepID=B0UJ22_METS4 Length = 654 Score = 67.5 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 34/268 (12%), Positives = 71/268 (26%), Gaps = 30/268 (11%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 ++++L R + EE + + PA + R + + + Sbjct: 200 VRDALNRN---HLPPPAAVRT-EELINYFPYAYPAPASPDAPFRVTASVFPSPWAEGRKL 255 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVE 288 +R P + + L+D SGSM + + K+ +L L + Sbjct: 256 LHIGIRGYAVAPAERPPAN--LVFLVDTSGSMAAPNRLPLVKQSLAMLLTTLDARDRVAL 313 Query: 289 VVYIRHHTQAKEVDE--------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V Y E Q G T ++ + ++P N Sbjct: 314 VAYAGEVGTVLEPTPAGEAGRILAAIETLQAHGSTAGGEGIRQAYALAARHFDPKAVNRV 373 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTF 396 A+DGD N + + + + L + + Sbjct: 374 IL-ATDGDFNVGITGRDELTGFVARERRKGIFLSVLGFG--MGNLNDALMQALAKDGNGV 430 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNAT 424 D R++ ++ + Sbjct: 431 AAHI--------DTAQEARKVLVEEATS 450 >UniRef50_A7C0I1 von Willebrand factor type A domain protein n=5 Tax=Bacteria RepID=A7C0I1_9GAMM Length = 367 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 49/165 (29%), Gaps = 18/165 (10%) Query: 244 DPSSQAVMFCLMDVSGSMDQ-STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P + + L+DVSGSM + K LL L+ K VVY E Sbjct: 1 MPPAN--LVFLVDVSGSMRSNHKLALLKSALKLLSNQLTEKDKVSLVVYAGAAGVVLEPT 58 Query: 303 --------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWADD 353 G T S+ + L + ++ + N A+DGD N Sbjct: 59 PGHQSVKINGALERLTAGGSTHGSAGIHLAYNLAEQAFIKNGINRILL-ATDGDFNVGTV 117 Query: 354 SPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQST 395 + L ++ + + + L + + Sbjct: 118 DFEALKNLVEEKRKSGISLTTLGFG--RGNYNDQLMEQLADAGNG 160 >UniRef50_C8WPE6 von Willebrand factor type A n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WPE6_EGGLE Length = 555 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 38/268 (14%), Positives = 72/268 (26%), Gaps = 28/268 (10%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 L+ +A+R A + EE L + P + + + Sbjct: 112 LRRMVAQRYAPAVVPAGAVRT-EELLNYFDYAYPEPVG-SDLFGVSAQMSDCPWNDQTKL 169 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVE 288 + + + + A + L+DVSGSMD K + K + L L+ + Sbjct: 170 --LVMGFATEKDGDASPTGANLVFLIDVSGSMDDPDKLPLVKDSFAALVEGLTERDRVSV 227 Query: 289 VVYIRHHTQAKE-VDEHE-------FFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY 340 V Y E V + G T + L+ + + + N Sbjct: 228 VTYASGERVLLEGVPGDDKRRIMRAVDSLVAEGSTNGEAGLEQAYRLAESSFIEGGVNRV 287 Query: 341 AAQASDGD-NWADDSPLCHEILAKKLLP---VVRYYSYIEITRRAHQTLWREYEHLQSTF 396 ASDGD N S ++ + + + + Sbjct: 288 VM-ASDGDLNVGISSESELHDFVEQKRETGVYLSVLGFG--SGNYKDNKMETLADHGNGA 344 Query: 397 DNFAMQHIRDQDDIYPVFRELFHKQNAT 424 ++ D R + + Sbjct: 345 YHYI--------DCAEEARRVLGRNLRA 364 >UniRef50_UPI00006CDDCC von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CDDCC Length = 2138 Score = 64.8 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 48/290 (16%), Positives = 94/290 (32%), Gaps = 55/290 (18%) Query: 135 PNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEEN 194 P +++ Q + + + + IS + L S+ R+ K ++ E+ Sbjct: 1356 PKIEEKDQSEGQQEEFEQNE---THSLRKISQKKVLIKSIQRKVKTNKEKVQKALNEEDK 1412 Query: 195 LAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCL 254 N +Q K I+ + + P DL C+ Sbjct: 1413 ----ENQTKSQQHRISSNVKNISGQFSLGQLQPMRFPIDL-----------------ICV 1451 Query: 255 MDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD------------ 302 +D SGSM+ D+ K + L L + + I+ T A+ + Sbjct: 1452 IDTSGSMNGQPLDLLKETLLFLVDLLQTGDR---ICLIQFSTNAQRLTPLLSIESKDNIK 1508 Query: 303 --EHEFFYSQETGGTIVSSALKLMDEVV-KERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 ++E GGT + ++L +V+ + RY +++ SDG N ++ Sbjct: 1509 SIKNEINRLVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLL--SDGLNDGAENK---- 1562 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 + LL + +Y + + + N M I D Sbjct: 1563 --IRDLLKQLNFYQ----NYNEENFTIQTFGFGKDHDPNL-MDKISQLMD 1605 >UniRef50_A3TQT5 Putative secreted protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TQT5_9MICO Length = 533 Score = 63.6 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 75/244 (30%), Gaps = 23/244 (9%) Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVM 251 EE + + PA ++ L+ + A ++ + + L+ + + R M Sbjct: 131 EEWVNSFDSGFPAPRKDDLELQSDQARASSEDD-GTRLVRIGLQGREVDVREWQPVALTM 189 Query: 252 FCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRTYKNVEVVY------IRHHTQAKEVDE- 303 +D SGSMD + + K LL L V Y + T ++ D Sbjct: 190 V--VDTSGSMDIRERLGLVKSSLALLAENLRPDDTIAIVTYQTDATPLLEPTPVRDTDTI 247 Query: 304 -HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWA-DDSPLCHEI 360 + G T + + L L + +E Y N+ ASDG N D Sbjct: 248 LAAIDRLEAGGSTNLEAGLLLGYDQAREAYKQGATNVVLL-ASDGVANVGVTDGGRLATA 306 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + + + L + F + D + R+LF + Sbjct: 307 IRDNGRRGIHLVTVGYGMGNYSDHLMEQLADQGDGFYEYI--------DTFEEARKLFVE 358 Query: 421 QNAT 424 Sbjct: 359 DLRA 362 >UniRef50_A8SXC8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SXC8_9FIRM Length = 612 Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 41/272 (15%), Positives = 81/272 (29%), Gaps = 22/272 (8%) Query: 111 GQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL 170 G V S Y ++ ++ ++ +N ++ + + A+ A+ S VRS Sbjct: 101 GDTAMVTDTSNSMYSEVAYDTREYDSMTENGF--VSTVDRPLSTFAADRDTASYSNVRSY 158 Query: 171 QNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFID 230 S + +EE L + + + E+ + + Sbjct: 159 IES-------GSLPPDGAVRIEEMLNYFTYDYRKKPEDGEKFSIYTEYSDCPWNKDTKLM 211 Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSM-DQSTKDMAKRFYILLYLFLSRTYKNVEV 289 + + S + L+D SGSM D + + ++ + +L L + V Sbjct: 212 MVGINTDEIDFGDKKPSN--LVFLIDTSGSMYDDNKLPLVQQSFAMLAENLDENDRVSIV 269 Query: 290 VYIRHHTQAKEVD--------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA 341 Y T G T A+ E+ ++ + N Sbjct: 270 TYAGEDTVVLSGTPGSEQYTISEALSNMTAEGCTNGGDAIITAYELAEKNFINGGNNRVI 329 Query: 342 AQASDGD-NWADDSPLCHEILAKKLLPVVRYY 372 A+DGD N S L + + Sbjct: 330 L-ATDGDLNVGLTSESDLVDLITEEKKENNIF 360 >UniRef50_A9GVG2 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVG2_SORC5 Length = 656 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 38/266 (14%), Positives = 69/266 (25%), Gaps = 34/266 (12%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R+ A + EE L + +A + + + Sbjct: 229 RRKIMDGALPPYQAVRAEEFLNYFDYGYASPAA--GPFAVHLAAAPSPFTSGHHLVRVAV 286 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 + K + + L+D SGSM K ++AK+ +L L V Sbjct: 287 QGKRVPVKERTPVH--LVYLVDTSGSMQSPDKIELAKKSLKMLTDTLKPGD---TVALCT 341 Query: 294 HHTQAKEVDE-----------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 + +EV G T +SS + L + + N Sbjct: 342 YAGSVREVLAPTGIESKGKILAALADLTAGGSTAMSSGIDLAYSLAERTLVKGHVNRVIV 401 Query: 343 QASDGD-NWADDSPLCHEILAKKLLPV---VRYYSYIEITRRAHQTLWREYEHLQSTFDN 398 SDGD N S K+ + + + + + + Sbjct: 402 -LSDGDANVGPTSHDEILKTIKRARDKGITLSTVGFGQ--GNYKDLMMEQLANQGDGNYA 458 Query: 399 FAMQHIRDQDDIYPVFRELFHKQNAT 424 + D R +F +Q Sbjct: 459 YI--------DSEAQARRVFSEQVGG 476 >UniRef50_Q1CVN5 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1CVN5_MYXXD Length = 700 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 33/295 (11%), Positives = 82/295 (27%), Gaps = 39/295 (13%) Query: 145 LTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPA 204 + + + ++ + A+ ++ R+ + + +EE + Sbjct: 241 INTEEERFSTFSVDTDSASYTLTRAYLER-------GSLPNEQAVRVEEFVNTFDYGYAH 293 Query: 205 QLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQS 264 Q ++ + + + + ++ + + S + ++DVSGSM+ Sbjct: 294 QG--SAPFSVQVEGFPSPVRKGYHVVHVGVKAREVSRPQRKPSH--LVFVIDVSGSMNLE 349 Query: 265 TKD-MAKRFYILLYLFLSRTYKNVEVVY------IRHHTQA--KEVDEHEFFYSQETGGT 315 + + KR LL L + VVY + T A + G T Sbjct: 350 NRLGLVKRALHLLVNELDERDQVSIVVYGSTARLVLEPTSAVHAHIIRAAIDSLHTEGST 409 Query: 316 IVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPV------V 369 + L++ + N SDG A+ + +++ + Sbjct: 410 NAQAGLEMGYSLAASHLVEGGINRVIL-CSDG--VANTGLTDANSIWERIRARAAKGITL 466 Query: 370 RYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + L + + D +F + Sbjct: 467 STVGFG--MGNYNDVLMERLSQVGEGNYAYV--------DRIEEAHRIFVRDLTG 511 >UniRef50_B4D1N7 Autotransporter-associated beta strand repeat protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D1N7_9BACT Length = 1545 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 46/256 (17%), Positives = 74/256 (28%), Gaps = 45/256 (17%) Query: 192 EENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL-RY--KNYEKRPDPSSQ 248 EE + +P + R PF DL R+ K P Sbjct: 1146 EEFINAFDYRDPEPSPGAPL------AFVTERARYPFAQNRDLLRFAVKTAAAGRQPGRP 1199 Query: 249 AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR-------------- 293 + L+D SGSM+++ + ++ + +L L K V + R Sbjct: 1200 LNIVLLLDRSGSMERADRVNIVREALSVLAKHLQPQDKLSIVTFARTPHLWADAVAGDKV 1259 Query: 294 HHTQAK--EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNW 350 H A+ E+ GGT + +AL L E + N +DG N Sbjct: 1260 HDVIARVNEITPE--------GGTNLEAALDLAYETAHHHFAVDSTNRVIL-FTDGAANL 1310 Query: 351 ADDSPLCHEILAKKLLPV-VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDD 409 D +P + + + + L + F I +D Sbjct: 1311 GDVNPDALTKKVEAQRKQGIALDCFGIGWEGYNDDLLEQLTRNADGRYGF----INTPED 1366 Query: 410 IYPVFRELFHKQNATA 425 F Q A A Sbjct: 1367 ----AAANFATQIAGA 1378 >UniRef50_A9NFD9 Surface-anchored VWFA domain protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NFD9_ACHLI Length = 486 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 49/298 (16%), Positives = 101/298 (33%), Gaps = 25/298 (8%) Query: 122 DEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMT 181 DE + +F D + +N ++ ++ + + A+ S +RS NS Sbjct: 31 DENYNYIFNDDEHQEIIENPFIDVSVNN--KSNISLSANTASYSFIRSQINS-------G 81 Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 R +EE + + + ++ + ++ + L K + Sbjct: 82 RAVDRNAVRIEEMVNFFNYNYNQPETDKT-FGFKSELIQTPWNNETHLLLIGLETKQVDL 140 Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKD-MAKRFYILLYLFLSRT-------YKNVE-VVYI 292 PS+ + L+DVSGSM + K +AK+ LL + Y + E VV+ Sbjct: 141 GDIPSN---IVILLDVSGSMSATNKLSLAKKAMELLIEQMKPNDVISLVTYSSGEKVVFK 197 Query: 293 RHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD-NWA 351 + + +G T L + +V +E + N A+DGD N Sbjct: 198 GKSIDDMAYMTSQIRLLKASGSTAGKKGLDMAYKVAEEYFIEGGNNRIIL-ATDGDFNVG 256 Query: 352 -DDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQD 408 + + E +++K + + +Y + ++ I + Sbjct: 257 ISSTDMLIEYISEKRESGIYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARK 314 >UniRef50_D2W4Q3 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W4Q3_NAEGR Length = 454 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 64/202 (31%), Gaps = 30/202 (14%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L++ Q + +DVSGSM D AK I + + +VV I Sbjct: 26 LQFDLISNIQRKEKQ--IVIALDVSGSMRGQGIDQAK---IAISNLFEQVVDTPDVVLIT 80 Query: 294 HHTQAK---------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI----Y 340 + T A+ E + Q GGT + + + + +N Sbjct: 81 YDTSAELYDLRKKPAETRQSTLEQIQAGGGTDFTCVFEAISNL-------DMFNRQSEVA 133 Query: 341 AAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEI--TRRAHQTLWREYEHLQSTFD 397 +DG D + E + K L + + + I T L + L S Sbjct: 134 ILFFTDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQG 193 Query: 398 NFAMQHIRDQDDIYPVFRELFH 419 F ++D ++I L Sbjct: 194 TFQY--VKDANEINQSMENLIG 213 >UniRef50_D2VDM1 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VDM1_NAEGR Length = 754 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 66/202 (32%), Gaps = 30/202 (14%) Query: 234 LRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 L++ Q + +DVSGSM D AK I + + +VV I Sbjct: 26 LQFDLISNIQRKEKQ--IVIALDVSGSMRGQGIDQAK---IAISNLFEQVVDIPDVVLIA 80 Query: 294 HHTQAK---------EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN----IY 340 + T A+ E + Q GGT + + + ++ +N + Sbjct: 81 YDTSAELYDLRKKPAETRQSTLEQIQAGGGTDFTCVFEAISKL-------DMFNSQSEVA 133 Query: 341 AAQASDG-DNWADDSPLCHEILAKKLLPVVRYYSYIEI--TRRAHQTLWREYEHLQSTFD 397 +DG D + E + K L + + + I T L + L S Sbjct: 134 ILFFTDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQG 193 Query: 398 NFAMQHIRDQDDIYPVFRELFH 419 F ++D ++I L Sbjct: 194 TFQY--VKDANEINQSMENLIG 213 >UniRef50_A0P2E4 Von Willebrand factor type A like domain n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P2E4_9RHOB Length = 772 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 51/161 (31%), Gaps = 19/161 (11%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR-------TYKNVEVVYIRHHTQAKEVDE 303 + ++D SGSM + +K F L + N + A E ++ Sbjct: 366 LVFVLDTSGSMSGQPIEASKTFMTAAIKALRPDDYFRILHFSNDTSQFAGQAVLATERNK 425 Query: 304 HEFFY----SQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT ++ A+ + + P +DG + D + Sbjct: 426 QKALKFVADLSAGGGTEINQAVNAAFDQAQ----PDNTTRIVVFLTDG--YIGDEATVIK 479 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 +A ++ R Y++ + ++ L + + Sbjct: 480 SIANRI-GKARIYAFG-VGNSVNRFLLDAMATEGRGYARYV 518 >UniRef50_Q55874 Uncharacterized protein sll0103 n=16 Tax=Cyanobacteria RepID=Y103_SYNY3 Length = 420 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 62/180 (34%), Gaps = 25/180 (13%) Query: 254 LMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEH--------- 304 ++D SGSMD + K + L L + V I +AK V E+ Sbjct: 47 VLDHSGSMDGQPLETVKSAALGLIDRLEE-DDRLSV--IAFDHRAKIVIENQQVRNGAAI 103 Query: 305 --EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDNWADDSPLCHEI 360 + GGT + LKL + + + + +DG+N D+ C + Sbjct: 104 AKAIERLKAEGGTAIDEGLKLGIQEAAK----GKEDRVSHIFLLTDGENEHGDNDRCLK- 158 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 L V Y T ++ + ++ +I + + FR+LF + Sbjct: 159 ----LGTVASDYKLTVHTLGFGDHWNQDVLEAIAASAQGSLSYIENPSEALHTFRQLFQR 214 >UniRef50_A9AWD1 von Willebrand factor type A n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AWD1_HERA2 Length = 610 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 77/251 (30%), Gaps = 23/251 (9%) Query: 186 RELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDP 245 + +EE L P + + E+A + ++ ++ E Sbjct: 206 ADSVRVEEYLNAFDYEYPQPEDGDFAIYSEVAP-SPFGGPNYELVQIGIQARSIEVADRK 264 Query: 246 SSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQA 298 A + ++D SGSM Q + +M K I L L V + + + T Sbjct: 265 P--AALTFVIDTSGSMAQDNRLEMVKNALIYLAGQLEPDDSLAIVAFNDGMRVVLNPTSG 322 Query: 299 KEVDE--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSP 355 + + + G T + L E+ + + P N SDG N P Sbjct: 323 ENQMDIITAINSLEPAGSTNAEAGLYKGFELAWQAFKPEGINRILL-CSDGVANSGMTEP 381 Query: 356 LCHEILAKKLLPV-VRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVF 414 ++ L V+ +Y + L + N+A D Sbjct: 382 SQLLATFQQYLDAGVQLSTYGVGMGNYNDILLEQLADKGDG--NYAYF------DSADEA 433 Query: 415 RELFHKQNATA 425 + LF +Q + Sbjct: 434 QRLFGEQLTGS 444 >UniRef50_A1ZRP2 Von Willebrand factor type A domain protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZRP2_9SPHI Length = 1088 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 24/107 (22%), Positives = 40/107 (37%), Gaps = 10/107 (9%) Query: 251 MFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQA--KEV 301 + L+DVSGSM K + K + L + V+Y + T A +E Sbjct: 914 LMLLLDVSGSMSSKDKLPLLKESFKYLISIMRPQDDVSIVIYAGDAAIVLKPTSASNQEQ 973 Query: 302 DEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + G T V + KL + + + + N A+DG+ Sbjct: 974 INAVIDKLRSRGKTNVKAGFKLAYKWMSKNFKEGGNNRIIL-ATDGE 1019 Score = 44.0 bits (102), Expect = 0.012, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 40/106 (37%), Gaps = 10/106 (9%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY------IRHHTQAKEVDE- 303 + L+DVSGSM M K L + K VV+ + T AK + Sbjct: 687 LMLLLDVSGSMKN-ELPMLKSALKYLVNIMRPEDKVSVVVFGSEAKLMLRPTSAKYKAQI 745 Query: 304 -HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + +G T + LKL + ++ Y N ASDG+ Sbjct: 746 MQAIDTLKSSGRTNGEAGLKLAYQWIQNNYKNNNNNRIIL-ASDGE 790 >UniRef50_D0MZH7 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0MZH7_PHYIN Length = 1850 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 44/294 (14%), Positives = 94/294 (31%), Gaps = 42/294 (14%) Query: 28 AQIKQSISEAINKRSVTDVDSGESV---SIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQ 84 ++ +++E + ++ G V S P + P G+ P ND V Sbjct: 1455 ERMYGTMAELASDGTLELKVDGSRVGGPSTPKTGLDTP--KYGKDD------PNNDPHVG 1506 Query: 85 NDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFED------LALPNLK 138 + GG +G G + V Q+S+++ ++ E +A L Sbjct: 1507 GNTWAGGTGGSDTAGLGGRGGPYRLDKGH-PVHQVSQEKKDEVSAEARAKARAMAQEALA 1565 Query: 139 QNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAII 198 + R++ + Y + + R +A L A+ + + Sbjct: 1566 EK-LREIDMSEREWETYQ------------TYFKRVERESAQLRAVLANLEAVAQERNWL 1612 Query: 199 SNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVS 258 + +L + + + + + D ++ M +MDVS Sbjct: 1613 RHQSSGELDDGKL----VDGVAGERLVFKRRGVRDSPFQAPAGHQQEQEPKRMVFVMDVS 1668 Query: 259 GSM------DQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF 306 GSM D + M + +++ F + ++ H + E+ EF Sbjct: 1669 GSMYRFNGQDSRLERMLETSLMIMESFAGFE-RELDYCIFGHSGDSPEIPFVEF 1721 >UniRef50_Q231J4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q231J4_TETTH Length = 520 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 51/136 (37%), Gaps = 20/136 (14%) Query: 229 IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++ D +K + ++D SGSM ++ K+ + + + + Sbjct: 76 VEAKDFDADQVKKDKVRYQPLDLIFVIDTSGSMQGKKIELVKKSILQVLHIIQGDDR--- 132 Query: 289 VVYIRHHTQAKEVDE-------------HEFFYSQETGGTIVSSALKLMDEVVKERYNPA 335 + + ++QAK + E Q GGT + ++ +++KER N Sbjct: 133 ISLVGFNSQAKVLLELTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDIIKERTNSK 192 Query: 336 QWNIY-AAQASDG-DN 349 N+ SDG DN Sbjct: 193 --NLASIFLLSDGQDN 206 >UniRef50_Q22SJ4 von Willebrand factor type A domain containing protein n=8 Tax=Tetrahymena thermophila RepID=Q22SJ4_TETTH Length = 646 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 46/115 (40%), Gaps = 15/115 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHT---QAKEVDEHE-- 305 + C++D SGSM K + L L+ + +++ + T ++VD+ Sbjct: 210 LVCVIDNSGSMQGEKIQNVKTTLLQLLDMLNSNDRLSLILFNSYPTLLCNLRKVDDENTP 269 Query: 306 -----FFYSQETGGTIVSSALKLMDEVVKER--YNPAQWNIYAAQASDGDNWADD 353 GGT ++S + + ++++R +NP SDG + D Sbjct: 270 NIQSIINSITADGGTDINSGMLMAFNILQKRQFFNPVSS---IFLLSDGQDNGAD 321 >UniRef50_B5JNR2 von Willebrand factor type A domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JNR2_9BACT Length = 923 Score = 53.2 bits (126), Expect = 2e-05, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 69/246 (28%), Gaps = 25/246 (10%) Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD-LRY 236 A EE + +P + + + PF D LR+ Sbjct: 513 LAQNVRPPAGTLRTEEFVNAFDYGDPTPPVARKI------GFTWERAHWPFAHDRDVLRF 566 Query: 237 K-NYEKRPDPSSQAV-MFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 SSQ + + +D SGSM + + D+ L L+ + V + R Sbjct: 567 SLQTAAHGRASSQPLHLTLAIDTSGSMSRPDRVDIVNSLATALQSNLTEKDRLSIVSFDR 626 Query: 294 HHT---QAKEVDEHEFF-----YSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + V GGT + SAL+L + + + N + Sbjct: 627 QPRLVLDGQSVTAETNLATLATQLNPQGGTDLESALQLSYQTAQRHFQENAINRVIL-IT 685 Query: 346 DG-DNWADDSPL-CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQH 403 DG N + + + + + + + T F Sbjct: 686 DGAANLGNTNAEQLRTTVTENRIRGIALDCFGIGFDGHDDTFLESLSRNGDGRYRF---- 741 Query: 404 IRDQDD 409 +R +D Sbjct: 742 LRSPED 747 >UniRef50_C8SCW7 Putative uncharacterized protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCW7_FERPL Length = 403 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 83/242 (34%), Gaps = 19/242 (7%) Query: 118 QISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 I+ E LD FE+ AL L + G T + R + +A++ Sbjct: 110 DINFKELLDYFFEE-ALKELIEMGI---------IEGVTKRFFRRKVKFSRQAERIIAQK 159 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYK 237 K + + E +S +L+E + + + D + Sbjct: 160 VMKEVSKEAKGYYAESEGETLSYIPGYELVEYDEYLHSYDLIDIPETMIRAAKNEDFEIR 219 Query: 238 N---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRH 294 + P + L+DVS SM A + L + + + + ++EV H Sbjct: 220 EKDIVSRNPKKVGKRHFVMLIDVSDSMRGKKIVGAIEAALALKMSIRKGFDDLEVFVFNH 279 Query: 295 HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDS 354 T+ ++ E + G T ++ ALK ++ + Y +DG+ A + Sbjct: 280 RTE--KIREGDIVNVDVEGRTDIALALKTARNALRGKDGAK----YVILITDGEPTASYN 333 Query: 355 PL 356 PL Sbjct: 334 PL 335 >UniRef50_A1ZFT4 Von Willebrand factor, type A n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZFT4_9SPHI Length = 827 Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 59/183 (32%), Gaps = 22/183 (12%) Query: 237 KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-------YKNVEV 289 K + P + V ++DVSGSM ++KR L L +++ Sbjct: 313 KAPKNSQIPPREYV--FIVDVSGSMHGFPLSVSKRLLKNLIGKLRPKDKFNVMLFESSNQ 370 Query: 290 VYIRHHTQAKEVDEHEFFYS----QETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQAS 345 + +A + + + F + GGT + ALK + + ++ + Sbjct: 371 MMSPESMEATQANIQKAFGVIDQQRGGGGTRLLPALKKALAFKQTK----DYSRSFVVVT 426 Query: 346 DGDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIR 405 DG L + L +++ I ++ L F + H Sbjct: 427 DG---YVTVEKEAFDLIRNNLNRANLFAFG-IGSSVNRFLIEGMARAGMGEP-FIVTHGT 481 Query: 406 DQD 408 + D Sbjct: 482 EAD 484 >UniRef50_Q9ZGE6 Magnesium-chelatase 67 kDa subunit n=2 Tax=Heliobacteriaceae RepID=BCHD_HELMO Length = 666 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 91/277 (32%), Gaps = 35/277 (12%) Query: 108 DGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVV 167 + + + + + F+ +P + Q + R G A+G ++ Sbjct: 356 ETPPDEAPKDEQTLQLPEEFFFDAEEVPMEDELLSLQNKVQRQARGG--AHGKQKSLERG 413 Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R + A+ + + + Q E + +R Sbjct: 414 RYAR-------ALLPPPGKNSRVAVDATLRAAAPYQRQRRESGQYG----------DRQV 456 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK-RFYILLYLFLSRTYKN 286 + D+R K + ++ S A++ ++D SGSM + AK +LL K Sbjct: 457 IVTNSDIRAKQFVRK----SGALIIFVVDASGSMAFNRMSSAKGAVSVLLNEAYVNRDKV 512 Query: 287 VEVVYIRH-------HTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 +++ T++ E+ + F GG+ ++ A+ EV + Sbjct: 513 ALIIFRGQQAETLVPPTRSVELAKKRFDQVPVGGGSPLAGAIAQAIEVGVNSIGSDVGQV 572 Query: 340 YAAQASDGD-NWADD---SPLCHEILAKKLLPVVRYY 372 +DG N D P E L +++L + R Sbjct: 573 IITLITDGRGNVPMDPQAGPKNREQLNEEILALSRLV 609 >UniRef50_O28828 Putative uncharacterized protein n=1 Tax=Archaeoglobus fulgidus RepID=O28828_ARCFU Length = 410 Score = 52.5 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 84/247 (34%), Gaps = 14/247 (5%) Query: 112 QDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQ 171 D ++S + +D F+++ + L++ + E + HR + S+ Sbjct: 108 GDISKDELSMSQVVDNFFDEV-VDELQEMGYVEKVETRFHRKIIHYT------AKAESVL 160 Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 ++ +R E S ++++ + + + Sbjct: 161 AEKVLSLSLQNLDKRSYGEHETEKLGQSIFSSERIVDYDPFTHSYDNIDLVESLIASAMR 220 Query: 232 FDLRYKN---YEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVE 288 ++ ++P + + V L+DVS SM A + L + R E Sbjct: 221 GEIELNENEMVARQPKHTEKCVYVMLIDVSDSMRGRKIVGAIEAALCLRKAIRRAGSGDE 280 Query: 289 VVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + I + +A E+ E E + G T + ALK +++K SDG+ Sbjct: 281 LRVIAFNHRAHEIKEGEILNLEARGRTDIGLALKRARKILKGSSGTG----VVFLISDGE 336 Query: 349 NWADDSP 355 + +P Sbjct: 337 PTSSYNP 343 >UniRef50_A8U1E2 Putative uncharacterized protein n=1 Tax=alpha proteobacterium BAL199 RepID=A8U1E2_9PROT Length = 683 Score = 51.7 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 56/181 (30%), Gaps = 23/181 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-------YKNVEVVY----IRHHTQAK 299 + ++DVSGSM AK L R + N + +R + Sbjct: 330 VIFVIDVSGSMKGEPLRAAKASLTSGIEGLGRNDTFNVVAFNNKAAAFYDAPVRASGKFH 389 Query: 300 EVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + GGT +++A +L ++ +P + +DG A + Sbjct: 390 RAALKVIDGLKAGGGTEMAAAFELALQMPG---DPDRLQQVVF-ITDG---AVSNEAALF 442 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFH 419 K L R ++ I + E +I D V R+LF Sbjct: 443 NQIKGELGARRLFTVG-IGSAPNTFFMEEAARFGRGT----YTYIGDTSSAERVMRDLFT 497 Query: 420 K 420 K Sbjct: 498 K 498 >UniRef50_A0C051 Chromosome undetermined scaffold_14, whole genome shotgun sequence n=3 Tax=Paramecium tetraurelia RepID=A0C051_PARTE Length = 636 Score = 51.3 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 57/164 (34%), Gaps = 23/164 (14%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQA------KEVDEH 304 + CL+D+SGSM +M K I+L FL + + I A K V Sbjct: 162 LICLIDISGSMIGVKIEMVKASLIVLLQFLGDNDR---LQLITFDNDAHRLTPLKTVTNQ 218 Query: 305 E-------FFYSQETGGTIVSSALKLM-DEVVKERYNPAQWNIYAAQASDGDNWADDSPL 356 + GG +S A K+ ++ +Y +++ SDG ++ Sbjct: 219 NKSYFTQIIKQIKANGGNRISEATKMAFYQLKSRKYINNVTSVFLL--SDGVDYTYPEVK 276 Query: 357 CHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 ++ + + + E + + +L+S F Sbjct: 277 NQIQTVNEVF-TLHTFGFGE---DHDAQMMTQLCNLKSGSFYFV 316 >UniRef50_B8HNU4 von Willebrand factor type A n=4 Tax=Chroococcales RepID=B8HNU4_CYAP4 Length = 421 Score = 50.9 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Query: 254 LMDVSGSMDQSTKDMAKRFYI-LLYLFLSRT------YKNVEVVYI-RHHTQAKEVDEHE 305 ++D SGSM + KR L+ L + +V V I ++ + Sbjct: 47 ILDHSGSMAGQPLETVKRAAQKLVDRLLPSDRLAVIVFDHVAKVLIPNQPVTDRDKIKTR 106 Query: 306 FFYSQETGGTIVSSALKLMD-EVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 + GGT + L+L E++ + +DG+N ++ C ++ + Sbjct: 107 ISHLAAMGGTAIDEGLQLGLTELIAAKAGAISQ---IFLLTDGENEHGNNSRCLQLAEEA 163 Query: 365 LLPVVRY----YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 + + Y +Q + + ++ I D+ F LF++ Sbjct: 164 AKENITLNTLGFGY-----HWNQDVLEQIADAAGG----SLMFIEYPQDVLIGFERLFNQ 214 >UniRef50_A2SP98 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SP98_METPP Length = 791 Score = 50.9 bits (120), Expect = 1e-04, Method: Composition-based stats. Identities = 72/386 (18%), Positives = 130/386 (33%), Gaps = 48/386 (12%) Query: 22 FLRRYKAQIKQSISEAINKRSVTDVDSGESVSIP------TEDISEPMFHQGRGGLRHRV 75 +L K + ++ I ++ + S P +D+ M G Sbjct: 364 YLDMLKPSERVEMAMKILEKILQPQKSNGMPQQPQNGGLTIKDLERAMGRGGAPN----- 418 Query: 76 HPGNDHFVQNDRIERPQGGGGGSGSGQG-QASQDGEGQDEFVFQISKDEYLDLLFE-DLA 133 PGN + + G GSG+ A GQD +S ++ L + ++ Sbjct: 419 -PGNGNSQSGGQPGDQAGAQDGSGTEDMVPAPTVTHGQDH---VMSTEDLAQALHDAGVS 474 Query: 134 LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE 193 + + L + +GV + I+ Q + R + + Sbjct: 475 SDTMAKLGFDDLKKIPEEVKH-AKDGVVSAINKASEDQMKVGSRYPGGHLLHYAKAQMLD 533 Query: 194 NLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD-------PS 246 + E A E K + + +D D+ +K+ P Sbjct: 534 FFKPVLTWEMAHKKLLEACGKGSRYDPTEPWTLYHVDAADMGFKHQRDVPFMGSRMPGKE 593 Query: 247 SQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNV--EVVYIRHHTQAKEVDEH 304 + +MF ++D SGS+D + M KRF R + V +V+ T + V E Sbjct: 594 QKPLMFDIIDTSGSVDDA---MLKRFVSEALNQARRVSRGVAPDVLISWADTICRGVPEF 650 Query: 305 -------EFFYS----QETGGTIVSSALKLMDEVVK--ERYNPAQWNI-YAAQASD-GDN 349 +F GGT +A++ + E+VK + A+ NI +D GD+ Sbjct: 651 ISEKNYKQFLTKGINYGGRGGTNFQAAIENVLEMVKPGSKSGYAKRNIDAICYMTDSGDS 710 Query: 350 WADDSPLCHEIL---AKKLLPVVRYY 372 D + L + KKL P++ Sbjct: 711 VPDPARLLRKAQECGLKKLPPILFLV 736 >UniRef50_B8G546 von Willebrand factor type A n=6 Tax=Chloroflexi (class) RepID=B8G546_CHLAD Length = 418 Score = 50.5 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 60/183 (32%), Gaps = 19/183 (10%) Query: 253 CLMDVSGSMDQSTKDMAKRFYILLYLFLSRT--------YKNVEVVYIRHHTQAKEVDEH 304 ++D SGSM + + K + L V+ + + Sbjct: 47 FVLDRSGSMQGAKLESMKAATRRVIELLRPHDVAAIVIFDDTVQTLIPATPVGDRSALLA 106 Query: 305 EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEILAKK 364 E GGT +S ++ +++ P + + +DG W D P+C + LA+ Sbjct: 107 AVETITEAGGTAMSLGMQAAQTELQKHLGPDRISRMLL-LTDGQTWG-DEPICRD-LART 163 Query: 365 LLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 L + + + ++ L + + ++ I D I F + Sbjct: 164 LGQAGVRITALGLGTEWNEQLLDDIAAASDGYSDY----IADPAQI----ETFFQQAVKE 215 Query: 425 AKG 427 A+ Sbjct: 216 AQA 218 >UniRef50_D2V0V6 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2V0V6_NAEGR Length = 502 Score = 50.5 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 67/199 (33%), Gaps = 32/199 (16%) Query: 182 AGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEK 241 L E L+ + ++ + ++ + E +PF+ L + Sbjct: 2 KVSESALQKFETLLSNFAPQSVNLSVQIDAGHLPTDQIGVQCE-IPFLVRL-LSGNLPPQ 59 Query: 242 RPDPSSQAVM-----FCL-MDVSGSMDQSTKDMAKRFYI---------LLYLFLSRTYKN 286 + + V+ CL +D+SGSMD+ K+ +K + L+ FL+ Sbjct: 60 EEEAETTNVLKTPVNICLVLDISGSMDEPLKNRSKGSKLTACKSAIRELVTNFLTYKD-- 117 Query: 287 VEVVYIRHHTQAKEVDEH---------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQW 337 + I + K V + G T ++SAL +++ P Sbjct: 118 -TIHLITYSDSPKTVFTEKNKESVNLNDIDKISTEGSTNIASALHSAVDLLHNSNAPGT- 175 Query: 338 NIYAAQASDGD-NWADDSP 355 A SDG N + + Sbjct: 176 -KLIAFFSDGQCNVGETNL 193 >UniRef50_D1WZ12 VWA containing CoxE family protein n=13 Tax=Streptomyces RepID=D1WZ12_9ACTO Length = 1289 Score = 50.2 bits (118), Expect = 2e-04, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 80/291 (27%), Gaps = 24/291 (8%) Query: 45 DVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQ 104 D + + T G G PG P Sbjct: 935 DQLPSGAARLATALDELYGAGHGEGSRGGLSGPGRTGSRGGREPSFPGVREWSEELAALF 994 Query: 105 ASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANI 164 E + + L L A P++ +L AG A + Sbjct: 995 GPGVREEVLAAAAVTGRQDVLAELDPAAATPSV------ELLRTILRYAG---GLPEARL 1045 Query: 165 SVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIE 224 + +R L L R LA + +L LR +A R + Sbjct: 1046 AALRPLVRHLVDELTRQLTTRLRPALTGTMLARPTRRPGGRLDLPRTLRANLATARRTAD 1105 Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY 284 + +++ R S+ + + DVSGSM+ A + L + Sbjct: 1106 GTVQVIPQKPVFRS---RARRSADWRLILVTDVSGSME------ASTIWSALTASVLAGV 1156 Query: 285 KNVEVVYIRHHTQAKEVDEHEFFYS------QETGGTIVSSALKLMDEVVK 329 + ++ T+ ++ H GGT +++ L+ +++ Sbjct: 1157 PTLSTHFLAFSTEVVDLTGHVHDPLSLLLEVSVGGGTHIAAGLRHARGLIE 1207 >UniRef50_D2RGD5 von Willebrand factor type A n=1 Tax=Archaeoglobus profundus DSM 5631 RepID=D2RGD5_ARCPR Length = 411 Score = 50.2 bits (118), Expect = 2e-04, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 88/235 (37%), Gaps = 20/235 (8%) Query: 118 QISKDEYLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARR 177 +S E ++ +E++ + +LK+ + + GY + R+L + + Sbjct: 114 DLSTSELVNYFYEEI-IEDLKKEGYLE----DDYFRGYKFT-----KNAERALSKKIL-Q 162 Query: 178 TAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP-FIDTFDLRY 236 ++ + E +S+ +++E + LR + + V + LR+ Sbjct: 163 LSLQDLTGEDFGEHETEKTGVSSFLKNEIVEYDELRHSYDSIDLQETLVKCALRDPSLRF 222 Query: 237 KNYEKRPDP---SSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIR 293 + + V L+DVS SM A + L + ++ E+ + Sbjct: 223 DERDLVAREGKHMEKCVYVMLIDVSDSMRGRRIVGALESALALRKVIKKS-NMDELHVVA 281 Query: 294 HHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + + +++ + E + G T + ALK E++K+R +DG+ Sbjct: 282 FNHRVRKIKDEEILNLRTRGRTDIGLALKTAREIIKKRRGSG----VIFLITDGE 332 >UniRef50_A0CHZ1 Chromosome undetermined scaffold_185, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CHZ1_PARTE Length = 265 Score = 50.2 bits (118), Expect = 2e-04, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 80/207 (38%), Gaps = 24/207 (11%) Query: 163 NISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK 222 N SL +++ TA + ++ + ++P E+ L EI L+ Sbjct: 55 NFGYQHSLGPKYSQQLPQTAISQEIFDDDDQVQTNLVQAKPNMYDLEKELIFEIKTLQKM 114 Query: 223 IE-------RVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 I+ ++P I + + + + CL+D S SM+ S + K+ + Sbjct: 115 IKLSKISTQQLPGIISIKTK-DQLNDQDLNRVGVDLICLIDKSSSMNGSKIETVKQSLKV 173 Query: 276 LYLFLSRTYKNVEVVYIRHH---TQAKEVDEHE-------FFYSQETGGTIVSSALKLMD 325 L FLS + +++ H T K + E + GGT +SSA ++ Sbjct: 174 LLTFLSNQDRLQLIIFNTHAKRLTPLKRITEDNKLYFTQMIDQIKSDGGTQISSATQIAI 233 Query: 326 -EVVKERYNPAQWNI-YAAQASDG-DN 349 ++ +Y + N+ SDG DN Sbjct: 234 SQLKGRKY---RNNVSSVFLLSDGQDN 257 >UniRef50_Q1D6F9 von Willebrand factor type A domain protein n=2 Tax=Cystobacterineae RepID=Q1D6F9_MYXXD Length = 592 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 35/265 (13%), Positives = 70/265 (26%), Gaps = 31/265 (11%) Query: 175 ARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDL 234 R +EE + E + + + + Sbjct: 174 RRYLVNGQLPPASAVRVEEFVNYFKFRYAPP--ETGAFAVHLEGAPSPFDAKRHFLRVGV 231 Query: 235 RYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIR 293 + K + A + L+D SGSM K +A+ + L+ V Y Sbjct: 232 QGKVVSRSQRKP--AHLVFLVDTSGSMHSEDKLPLAREAIKVAVKNLNENDTVAIVTYAG 289 Query: 294 HH---------TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA 344 + T AK + GGT + S ++L ++ + + Sbjct: 290 NTRDVLPPTPATDAKSI-HAALDSLTAGGGTAMGSGMELAYRHAVKKASGSVV-SRVVVL 347 Query: 345 SDGD-----NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNF 399 +DGD N + + + + K V + L + + + Sbjct: 348 TDGDANIGRNVSAN--AMLDSIHKYTAEGVTLTTVGFGMGNYRDDLMEKLADKGNGNCFY 405 Query: 400 AMQHIRDQDDIYPVFRELFHKQNAT 424 D +++F Q Sbjct: 406 V--------DSLREAKKVFETQLTG 422 >UniRef50_A4C730 Putative uncharacterized protein n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C730_9GAMM Length = 684 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 60/179 (33%), Gaps = 23/179 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKR--FYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEF-- 306 + ++D SGSM + + AK F+ L L ++ +E + A+ + ++F Sbjct: 332 VIFVIDTSGSMHGESLEQAKSALFFALANLDPQDSFNIIEFNSKVNALNAQALPANDFNI 391 Query: 307 -------FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHE 359 + + GGT + A + + + A + +DG + + Sbjct: 392 RRARNFVYGLKADGGTEIGLAFEQVLD----NSEHADYLRQIVFLTDG---SISNETEVF 444 Query: 360 ILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELF 418 K L R ++ I + L F I D D+ + LF Sbjct: 445 AQIKGSLGDSRIFTIG-IGSAPNSYFMTRAATLGRGTFTF----IGDVTDVQRTMKNLF 498 >UniRef50_A6BYV9 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYV9_9PLAN Length = 1197 Score = 49.4 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 62/422 (14%), Positives = 120/422 (28%), Gaps = 65/422 (15%) Query: 24 RRYKAQIKQSISEA-INKRSVTDVDSGESVSIPTEDISEP-------MFHQ-------GR 68 R+ + I + + + ++ D + P P + Sbjct: 791 RKGREAIANLFPDFPLRQLNIKDPFAK-----PIRVAEAPGEIPLADRWRLILGVKGCST 845 Query: 69 GGLRHRVHPGNDHFVQNDRIERPQGGG--GGSGSGQGQASQDGEGQDEFVFQISKDEYLD 126 + + + ++R R G G + A E + KD + Sbjct: 846 PKSQQVAGTLDQLYGGSEREGRGLQGDLASDRGGTEAAAPSVREWISDVERLFGKDVCEE 905 Query: 127 LLFEDLA------LPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAM 180 +L E L +L R E + ++R L ++ R A Sbjct: 906 VLGEAAVNGRAAVLEHLNHATVRPSVELLEQVLSLRGALSERELGLLRKLARNITERMAK 965 Query: 181 TAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYE 240 R ++A + +L L + K + I L Y+ Sbjct: 966 QLANRLRPALHGLSIARPTRRRSPRLDFARTLNSNLHTAYRKSDGRISIAPTRLVYRLPA 1025 Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKD---MAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 KR + ++DVSGSM+ S MA F L ++V + TQ Sbjct: 1026 KRQM---DWHLIFVVDVSGSMEASVIYSSMMAAIFSAL---------PAIDVKFFAFSTQ 1073 Query: 298 AKEVD---EHEFFYSQE---TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWA 351 + E E GGT + L+ E + NP++ + +D Sbjct: 1074 VIDFTGRVEDPLSLLMEIQIGGGTHIGLGLRAARESIT---NPSRTLVVL--VTD----F 1124 Query: 352 DDSPLCHEILAKKLL---PVVRYYSYI----EITRRAHQTLWREYEHLQSTFDNFAMQHI 404 ++ E+L++ ++ + E R H + + + Sbjct: 1125 EEGVSVPELLSEVVMLSSSGAKLIGLAALNDEAKPRYHAGTAAAVVQAGMPVAAVSPERL 1184 Query: 405 RD 406 + Sbjct: 1185 AE 1186 >UniRef50_Q23KK4 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23KK4_TETTH Length = 1085 Score = 49.4 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 38/260 (14%), Positives = 88/260 (33%), Gaps = 46/260 (17%) Query: 180 MTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNY 239 + + + + E A +Q + + I L +K + +Y +Y Sbjct: 382 LKQLEEEKAKLIREKSAFWDKKNSSQEARLSQYSQSINSLNSKYPLGKMCSIVEKKYFHY 441 Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK 299 + + D SGS + + L Y + YI+ + + Sbjct: 442 ------------YFIQDESGSFSNDHQYAIQGVAQLFNRIKPNDY----ITYIKFDSSSH 485 Query: 300 E---------VDEHEFFYSQE---TGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 + + +F + GGT SA + + + ++ +Y+ ++ + +DG Sbjct: 486 VDIPKTLKSSLSQGDFISKIQKCRGGGTNFQSAFQTLLQQIQSKYDQQEYPVVIF-ITDG 544 Query: 348 -DNWADDSPLCHEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFDNFAMQHI 404 DN DS L + +Y Y + + +++ + F+N + Sbjct: 545 QDNTDLDS---IISQITSLCQDIVFYTIGYGSVNEKY-------LKNITNKFNN----TV 590 Query: 405 RDQDDIYPVFRELFHKQNAT 424 ++ +I +LF+ +N Sbjct: 591 GEKKEINGKPVDLFYVKNTP 610 >UniRef50_D2S019 ATPase associated with various cellular activities AAA_5 n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2S019_9EURY Length = 665 Score = 49.4 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 65/191 (34%), Gaps = 16/191 (8%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R+++ +LARRT R + + + + L + Sbjct: 385 RTMREALARRTPSKVDVRSGRYVRARDSESVDDVAIDATLRAAAPHQPARRETDDSSSGI 444 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGS-MDQSTKDMAKRFYILLYLFLSRTYKN 286 I+ DLR K E+R ++A++ ++D SGS M KR + L R Sbjct: 445 AIEPKDLRQKIRERR----AEALVVFVVDASGSVMSGRQMFETKRGILSLVEDAYRARDR 500 Query: 287 VEVVY--------IRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVV--KERYNPAQ 336 V VV + T+ G T ++ L E+V + R + Sbjct: 501 VAVVVFREEGAFTLVEPTRNLSAARRAVSKLTVGGNTPLAHGLVEAYELVERERRRDEDL 560 Query: 337 WNIYAAQASDG 347 + + SDG Sbjct: 561 YPLVVL-FSDG 570 >UniRef50_O26551 Magnesium chelatase subunit ChlI n=1 Tax=Methanothermobacter thermautotrophicus str. Delta H RepID=O26551_METTH Length = 591 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 54/160 (33%), Gaps = 15/160 (9%) Query: 240 EKRPDPSSQAVMFCLMDVSGSMDQSTK-DMAKRFYILL--YLFLSRT------YKNVEVV 290 EK S+A+ ++D S SM K AK LL + R ++ E Sbjct: 418 EKVRIGKSRALYIIVLDTSSSMRLERKIKFAKTVSWLLLRDSYEKRNRIALIAFRGYEAN 477 Query: 291 YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG--D 348 + T E E + G T ++ AL+L EV + A + SDG + Sbjct: 478 LVVEPTSNLETVEEALEGLRSGGRTPLTPALRLAAEVASSSSDEACTAVVI---SDGRCN 534 Query: 349 NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWRE 388 + + + + + L + ++ E Sbjct: 535 VFINSNLEEDMNMLETELRNLNLL-FVNAEPEKRSLGILE 573 >UniRef50_Q9U7P4 Putative uncharacterized protein (Fragment) n=1 Tax=Eufolliculina uhligi RepID=Q9U7P4_9CILI Length = 494 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 58/182 (31%), Gaps = 29/182 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE------- 303 + C++DVSGSM + + + LS + + I A ++ Sbjct: 91 IVCVIDVSGSMQGEKIQLVQTTLNFMVERLSPADR---ICLISFSNDATKISRLVQMSPK 147 Query: 304 ------HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPL 356 +GGT + L+ + +++R Q + SDG DN Sbjct: 148 GKKQLKSMIPRLVASGGTNIVGGLEYGLQALRQRRTINQLSSIIL-LSDGQDNNGTTVLQ 206 Query: 357 CHEILAKKLLPVVRY----YSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYP 412 + ++ Y + Y TL ++ + ++D++ I Sbjct: 207 RAKATMDSIVIRDDYSVHTFGYG---HGHDSTLLNALAEPKNGAFYY----VKDEETIAT 259 Query: 413 VF 414 F Sbjct: 260 AF 261 >UniRef50_Q114A2 von Willebrand factor, type A n=3 Tax=Cyanobacteria RepID=Q114A2_TRIEI Length = 1204 Score = 49.0 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 66/196 (33%), Gaps = 26/196 (13%) Query: 191 LEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAV 250 L + + Q E + + L F++ + R E +P ++ Sbjct: 577 LPLVVDSYLQEKQKQEAREAQAKAAPERLLEPE----FVENPEQRLPEPEFVENPENRCP 632 Query: 251 MFCLMDVSGSMDQS---TKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDEHEFF 307 + L+D S SM + + + VE+ I +++ + V +F Sbjct: 633 IILLLDTSYSMSGEAITELNQGVKIFQASVKEDELASLRVEIAVITFNSEIEVV--QDFV 690 Query: 308 --------YSQETGGTIVSSALKLMDEVVKER------YNPAQWNIYAAQASDG---DNW 350 + +G T + A++ E++++R + + + +DG D W Sbjct: 691 TVDKFIPKTLEASGVTHMGKAIEKALELLEKRKQDYKNSDIQYYRPWIFLITDGQPTDTW 750 Query: 351 ADDSPLCHEILAKKLL 366 D + E + L Sbjct: 751 QDAAKKIEEAETNRKL 766 >UniRef50_B9SJS6 Protein binding protein, putative n=6 Tax=Magnoliophyta RepID=B9SJS6_RICCO Length = 540 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 73/209 (34%), Gaps = 14/209 (6%) Query: 156 TANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE 215 G AN+ + L + + + +E + S P + +LR Sbjct: 10 RRRGRRANLLAGGEAEQKLPLVPLLPPPLKMSSNDDDEKIVTRSRPTPPIVPARVKLR-S 68 Query: 216 IAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYIL 275 I A +E +L + P + ++DVS SM+ + K + Sbjct: 69 INNDMAPLEESKLKVMLELTGGDSSSYGRPGLD--LVAVLDVSRSMEGDKMEKMKTAMLF 126 Query: 276 LYLFLSRTYKNVEVVY---------IRHHT-QAKEVDEHEFFYSQETGGTIVSSALKLMD 325 + L T + V + +R T +++E E+ G T +++ L+ Sbjct: 127 IIKKLGPTDRLSIVTFSGGANRLCPLRQTTGKSQEEFENLINGLNADGATNITAGLQTAL 186 Query: 326 EVVKERYNPAQWNIYAAQASDGD-NWADD 353 +V+K R + + SDG+ N D Sbjct: 187 KVLKGRSFNGERVVGIMLMSDGEQNAGSD 215 >UniRef50_Q23AA2 Putative uncharacterized protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23AA2_TETTH Length = 968 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 66/207 (31%), Gaps = 25/207 (12%) Query: 222 KIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + +R+ + R K S + ++D SGSM K +A + S Sbjct: 2 EEKRIYYKIPLK-RVKATTTTEKGGSNLHIVGIIDASGSMSSWWKWIA-------EFWNS 53 Query: 282 RTYKNVEVVYIRHHTQAKEVDEH---EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 + + I A+ + + G T + A K+ + V+ P + Sbjct: 54 ESIPKENLHTITFDGTARHCQSNVLSTRIHDHGGGMTAIPEAFKMFETVLD--SIPVNES 111 Query: 339 IYAAQASDGDNWADDSPLCHEILAKKLL-----PVVRYYSYIEITRRAHQTLWREYEHLQ 393 + A SDG D++ E KKL + + + R + Sbjct: 112 VTAIFISDG---QDNNLNTLEERMKKLKGNHENRKINFICLGIESGFPTFLSMRLRQLYH 168 Query: 394 STFDNFAMQHIRDQDDIYPVFRELFHK 420 +N ++ + Y + F+K Sbjct: 169 QGDENIPALYLIE----YVSEKAFFNK 191 >UniRef50_A0CDA1 Chromosome undetermined scaffold_17, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0CDA1_PARTE Length = 604 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 59/164 (35%), Gaps = 22/164 (13%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY----IRHHTQAKEVDEHEF 306 + C++D SGSM +M K+ +L FL + + + R + DE++ Sbjct: 187 LLCVIDRSGSMSGEKIEMVKQTLNILLNFLGPKDRLCLIQFDDTCQRLTNLRRVTDENKT 246 Query: 307 FY------SQETGGTIVSSALKLMDEVVKERYNPAQWNIY-AAQASDGDNWADDSPLCHE 359 +Y GGT++ ++ + + +Y + N+ SDG + Sbjct: 247 YYSDIISKIYANGGTVIGLGTQMALKQI--KYRKSVNNVTAIFVLSDGQD-----EAAIS 299 Query: 360 ILAKKLL---PVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFA 400 L K+L + +S+ L + +L F Sbjct: 300 SLQKQLAYYKQTLTIHSFG-FGSDHDAKLMTKISNLGKGSFYFV 342 >UniRef50_A0KNK1 Putative uncharacterized protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KNK1_AERHH Length = 552 Score = 48.2 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 70/224 (31%), Gaps = 22/224 (9%) Query: 167 VRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERV 226 R+ + ++ A + + + + L Sbjct: 319 GRAYISEEKKKQA--RIPHASKSEVHGTHRSEDLARVLPTELLNLEDEALETLFYARFLE 376 Query: 227 PFIDTFDLRYKNYEKRPDPSSQ----AVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 + T++L+ + + +D SGSM + A+ + + L + Sbjct: 377 RNLMTYELQGTTCTSGEQLELEQKRTGPVVACLDTSGSMSGAPLLKARALLLAVSAVLQQ 436 Query: 283 TYKNVEVVYIRHHTQAKE--VDEHE-------FFYSQETGGTIVSSALKLMDEVVK--ER 331 +++ VV + + +E + E F GGT + L E+++ + Sbjct: 437 EARSLHVVLFGDNGELREYAIHEENSASGLLHFLRQGFGGGTDFETPLNRACEIIRDAKE 496 Query: 332 YNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVVRYYSYI 375 Y A SDGD D + H KK+L YS + Sbjct: 497 YEKAD----ILMISDGDCVLSDDYIEHLQTRKKIL-DCSIYSVL 535 >UniRef50_A3PUP3 von Willebrand factor, type A n=1 Tax=Mycobacterium sp. JLS RepID=A3PUP3_MYCSJ Length = 233 Score = 48.2 bits (113), Expect = 6e-04, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 59/172 (34%), Gaps = 17/172 (9%) Query: 242 RPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTY---KNVEVVYIRHHTQA 298 +P + L DVSGSM +R + +L K VEV + T A Sbjct: 12 DANPDPRVACVVLADVSGSMQGEPIAALERGFAAFTRYLQNEVLASKRVEVAVVTFGTVA 71 Query: 299 KE-VDEHEFFYSQ-----ETGGTIVSSALKLMDEVVKER---YNPAQWNIY---AAQASD 346 V E Q +G T +++ + L +++++R Y A Y +D Sbjct: 72 TVLVPMQEARTLQPVAFTASGTTNMAAGIHLALDILEDRKHAYKAAGLQYYRPWILLLTD 131 Query: 347 GD-NWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFD 397 G N + A + V ++ R +Q L R +S Sbjct: 132 GKPNLDGFDEAVARLNAVESARGVTVFAVGAGPRVDYQQLGR-LSLQRSPAP 182 >UniRef50_B9MS10 von Willebrand factor type A n=2 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MS10_ANATD Length = 1188 Score = 48.2 bits (113), Expect = 6e-04, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 45/119 (37%), Gaps = 18/119 (15%) Query: 251 MFCLMDVSGSM-----DQSTKDMAKRFYILLYLFLSRTYKNVEVV------YIRHH-TQA 298 + ++D SGSM + K AK F L VV Y+ T Sbjct: 500 LVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQG-----DRAAVVDFDNFGYLLQPLTTD 554 Query: 299 KEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 + ++ GGT ++ +++ ++ + R + + + +DG+ + D++ Sbjct: 555 FQAVKNAIDRIDSWGGTNIAEGIRIANQQLISRSSEDRIKVIIL-LTDGEGYYDNNLTT 612 >UniRef50_Q97HZ9 Predicted metal-dependent peptidase n=1 Tax=Clostridium acetobutylicum RepID=Q97HZ9_CLOAB Length = 456 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 63/215 (29%), Gaps = 31/215 (14%) Query: 140 NQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEE--NLAI 197 ++ + G I L+N + R +A +R L++ A Sbjct: 200 KNFDEM-NIHKTWSESYNRGYENQIDE---LKNKIIRNSAKGRIPKRVQEYLDDMNKKAE 255 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 IS + + + K R P+ DLR K + + +D+ Sbjct: 256 ISWQMYLKKAIGTLPKGYKKTITRKDRRQPY--RMDLRGKLSDHIIK------IVVAIDI 307 Query: 258 SGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEV-----DEHEFFYSQET 312 SGSM + D A + KN E+ I + + Sbjct: 308 SGSMTDAEIDAAMTEIFDIL-----KNKNYELTIIECDNIVRRMYRVSKPRDMKKKLDTK 362 Query: 313 GGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 GGT S + + + + + +DG Sbjct: 363 GGTSFSPVFEYLHK-------NRMEDCFLIYFTDG 390 >UniRef50_B4X134 Vault protein inter-alpha-trypsin n=1 Tax=Alcanivorax sp. DG881 RepID=B4X134_9GAMM Length = 657 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 51/318 (16%), Positives = 89/318 (27%), Gaps = 36/318 (11%) Query: 131 DLALP-NLKQNQQRQLTEYKTH----RAGYTANG--VPANISVVRSLQNSLARRTAMTAG 183 L LP L T R A G A + V ++ AR + + Sbjct: 168 QLRLPLTLTPRFTPPTEAPHTLDSLLRNTVAAPGGTADAGTASVHIDLDAGARLATLGSP 227 Query: 184 KRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFD-LRYKNYEKR 242 + I+ A ++ + L E + F + D Y Sbjct: 228 SHAIHYQRHGRRYTITPKAGAIAMDRDLLLNWELEDTGEPLVTRFHEEIDGEHYALLMVV 287 Query: 243 PDPSSQAVMF-----CLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQ 297 P + Q ++D SGSM + AK L L + + HT Sbjct: 288 PPKTGQVTALPRETLFIIDSSGSMGGAPMRQAKASLHLALQRLKPGDRFNITDFDSQHTL 347 Query: 298 AKEVD----------EHEF-FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASD 346 E +F Q +GGT + A L + + + +D Sbjct: 348 LFETPVTVSDNSRQQAQDFVDGLQASGGTHMLPA--LSATLSQPAS--DGYLRQVIFITD 403 Query: 347 GDNWADDSPLCHEILAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 G + L ++L R ++ + + F + +I D Sbjct: 404 G--AVGNESGIFRALHQQLGE-ARLFTVGI-----GSAPNSHFMTRAAQFGRGSFTYIND 455 Query: 407 QDDIYPVFRELFHKQNAT 424 Q+ + LF + + Sbjct: 456 QNQVQQGMDTLFRRLESP 473 >UniRef50_UPI0001C38CFE von Willebrand factor, type A n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C38CFE Length = 489 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 48/143 (33%), Gaps = 22/143 (15%) Query: 243 PDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVD 302 P + + L+D SGSM S +R + K ++ + ++A V Sbjct: 49 TKPQA---VVMLIDTSGSMSGSKLPEVQRAASEFVS--RQNLKRDDLAVVEFSSRASVVA 103 Query: 303 ---------EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADD 353 + GGT +S L V++ P NI +DG+ + Sbjct: 104 DFTRDERELQQAIARLSAWGGTNLSEGFNLATSVLQNSDRPG--NILLF--TDGE---PN 156 Query: 354 SPLCHEILAKKLLPV-VRYYSYI 375 + +A+++ + + Sbjct: 157 NRRMAASIAQQIRASGINLVAVG 179 >UniRef50_C9LLI0 Magnesium-chelatase, subunit D/I family n=1 Tax=Dialister invisus DSM 15470 RepID=C9LLI0_9FIRM Length = 640 Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats. Identities = 52/357 (14%), Positives = 109/357 (30%), Gaps = 48/357 (13%) Query: 67 GRGGLRHRVHPGNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKD-EYL 125 +++ D E P G G G + D D F D E + Sbjct: 296 DPDKNNTEKDQESENNDPGDDGEDPSGNHIVEAMGNG-GNNDESSSDMPEFPQGADDEKV 354 Query: 126 DLLFEDLALPNL-KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGK 184 D + LP L QN+++Q T + + T + R K Sbjct: 355 DSADLHVTLPPLWIQNEKKQFTPKGSGKRHITRS------------DERQGRYVKAGIPK 402 Query: 185 RRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPD 244 + A + + P Q + + I D+R ++R Sbjct: 403 GETHDIAID--ATLRAAAPHQKGRQSNGCAVV------------IRHEDIR---RKEREK 445 Query: 245 PSSQAVMFCLMDVSGSMDQSTKDMAKR---FYILLYLFLSRT------YKNVEVVYIRHH 295 + + L+D SGSM + A + F +L + R ++ + Sbjct: 446 RTGN-IFLFLVDASGSMGARERMKAVKGVVFKMLADAYQKRDRVGMIAFRRDRAEVLLPI 504 Query: 296 TQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPA-QWNIYAAQASDG---DNWA 351 T++ E + + G T ++ L ++++ Y + +DG ++ Sbjct: 505 TRSIEFAQKKLAALPTGGKTPLAQGLIKAEDMLDRLYKQDPLQDPVLILITDGRATNSLN 564 Query: 352 DDSPLCHEIL--AKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 ++ + L A+++ + I+ + + + F + I + Sbjct: 565 KNTDPVRDALSEAERIGHRHMLAAVIDTESGFIKLGLAKELAQKMGASYFHVDKISE 621 >UniRef50_A6GAI6 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GAI6_9DELT Length = 560 Score = 47.5 bits (111), Expect = 9e-04, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 36/119 (30%), Gaps = 17/119 (14%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVV---------- 290 P + +D SGSM + ++ + + L V + Sbjct: 213 PEERPPMNVTLV--LDTSGSMAGTPIELLRETSRAIAAQLKLG-DTVSICEWDTSNDWTL 269 Query: 291 --YIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG 347 Y E+ + GGT + L+ E+ + Y+P N SDG Sbjct: 270 AGYAV-TGPNDELLLEKINDVVHGGGTNLYGGLESGYELAQMVYDPDAINRLVL-ISDG 326 >UniRef50_B5JCH3 Putative uncharacterized protein (Fragment) n=1 Tax=Octadecabacter antarcticus 307 RepID=B5JCH3_9RHOB Length = 197 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 33/97 (34%), Gaps = 7/97 (7%) Query: 170 LQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFI 229 +++SL R + +EE + + PA E R I + Sbjct: 108 IRSSLTR----GQLPPTDAVRIEEMINYFPYAYPAPEGE-APFRPTINVFETPWNADTQL 162 Query: 230 DTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTK 266 ++ + P + L+D SGSM+ + K Sbjct: 163 VHIGIQGEMPAIEDRPPLN--LVFLIDTSGSMESADK 197 >UniRef50_D2BAS2 von Willebrand factor type A domain protein n=12 Tax=Actinomycetales RepID=D2BAS2_STRRD Length = 490 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 70/218 (32%), Gaps = 33/218 (15%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQ--AVMFCLMDVSGSMDQSTK-DMAKRFYILLYLFLS 281 R+P T +R ++ +P ++ A + ++DVSGSM + + D+ + L L Sbjct: 122 RMPENGTALIRVGLQTRKAEPEARRPANLTFVVDVSGSMGEPGRLDLVREALHKLVDQLG 181 Query: 282 RTYKNVEVVYIRHHTQAKEV-----------DEHEFFYSQETGGTIVSSALKLMDEVVKE 330 V +V TQA+ V T + + L Sbjct: 182 PG-DQVSIV--AFSTQARLVLSMTPATGRDQLHAAIDRLGVEDSTNLETGLTAGYAEAAR 238 Query: 331 RYNPAQWNIYAAQASDGDNWADDSPLCHEILAKKLLPVV----RYYSYIEITRRAHQTLW 386 + PA N SDG A+ + + ++ + R L Sbjct: 239 AFRPAATNRVIL-LSDG--LANTGDTTWQGILDRVAESAGRQITLLCVG-VGRDYGDQLM 294 Query: 387 REYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHKQNAT 424 + + + DD R++F +Q AT Sbjct: 295 EQLADNGDGAAVY----VSSADD----ARKVFVEQLAT 324 >UniRef50_Q23FU3 von Willebrand factor type A domain containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q23FU3_TETTH Length = 755 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 52/146 (35%), Gaps = 24/146 (16%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAK------EVDEH 304 + CL+D SGSM + ++ L L + ++ + + AK +V++ Sbjct: 50 IICLIDNSGSMAGKKAQLVRKSLKYLLKILEKGD---QISLVSFSSTAKTLCPLTQVNDE 106 Query: 305 -------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLC 357 GGT V K + +++ R + + +DG+ DS Sbjct: 107 NKQQIKSAIKQINGQGGTFVIPGFKEVTKILNSR-KEQREQTFILLLTDGEFGDIDSGKV 165 Query: 358 HEILAK-------KLLPVVRYYSYIE 376 + + + + P + Y Y + Sbjct: 166 IQNINRLFTQSEIQKTPYIYTYGYGD 191 >UniRef50_A5B5Z1 Putative uncharacterized protein n=2 Tax=Vitis vinifera RepID=A5B5Z1_VITVI Length = 686 Score = 46.7 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 31/141 (21%), Positives = 52/141 (36%), Gaps = 22/141 (15%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVYI----------RHHTQAKE 300 + ++DVSGSM S + KR L L + + V + R +E Sbjct: 205 LVAVLDVSGSMAGSKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMSDNGRE 264 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQA--SDG-DNWADDS--- 354 +GGT + LK V++ER ++ N A+ SDG D + D+ Sbjct: 265 AAGLAINSLXSSGGTNIVEGLKKGVRVLEER---SEQNPVASIILLSDGKDTYNCDNVNR 321 Query: 355 ---PLCHEILAKKLLPVVRYY 372 C +++L + Sbjct: 322 RQTSHCASSNPRQVLEYLNLL 342 >UniRef50_A0EHG0 Chromosome undetermined scaffold_97, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0EHG0_PARTE Length = 533 Score = 46.7 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 46/125 (36%), Gaps = 16/125 (12%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY---IRHHTQAKEVDEH--- 304 + CL+D SGSM + ++ + FL + +++ + T+ V + Sbjct: 124 LVCLIDHSGSMQGEKIKLVRKTLKQMLTFLQPCDRLCLIMFDCKVYRLTRLMRVTQENVQ 183 Query: 305 ----EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIY--AAQASDGDNWADDSPLCH 358 Q GGT + + +K+ ++K R N SDG + + Sbjct: 184 KFRVAISSLQARGGTDIGNGMKMALSILKHR---KYKNPVSAIFLLSDGVDEGAE-ERVR 239 Query: 359 EILAK 363 + L + Sbjct: 240 DDLIQ 244 >UniRef50_A0LHW4 Vault protein inter-alpha-trypsin domain protein n=3 Tax=Deltaproteobacteria RepID=A0LHW4_SYNFM Length = 812 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 54/182 (29%), Gaps = 20/182 (10%) Query: 252 FCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT-------YKNVEVVYIRHH--TQAKEVD 302 ++DVSGSM +++KR L L T + V A V Sbjct: 335 IFIVDVSGSMHGFPLEISKRLLTDLIGGLKPTDCFNVMLFSGDSTVMAERSVPASADNVR 394 Query: 303 E--HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGDNWADDSPLCHEI 360 Q GGT + ALK + ++ + A+DG + E Sbjct: 395 RAVEMIGRRQGGGGTELLPALKKALSLPRKE----GVSRSMVIATDG--FVTVEEEAFE- 447 Query: 361 LAKKLLPVVRYYSYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRDQDDIYPVFRELFHK 420 L + + ++ + I ++ L + F + + FR Sbjct: 448 LIRSHIGDANFFPFG-IGTSVNRMLIEGMARAGAGEP-FVITRPDEAPAGAEKFRRYIQS 505 Query: 421 QN 422 Sbjct: 506 PL 507 >UniRef50_Q6ABM1 Magnesium-chelatase 67 kDa subunit n=4 Tax=Actinomycetales RepID=Q6ABM1_PROAC Length = 654 Score = 46.3 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 67/201 (33%), Gaps = 13/201 (6%) Query: 138 KQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAI 197 +Q + + AG P S R + + RR + RR + Sbjct: 350 DPRKQPSGSGEQVVAAGDPFAVRPLEPSQDRFARRACGRRLRTRSNDRRGRYVSARPTDR 409 Query: 198 ISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDV 257 + L + ++ + + + D R K R + + + ++D Sbjct: 410 PDDLALDATLRAAAVHQKSRRATERPDLAVHVKPIDWRAKVRAGR----AASCVIFVVDA 465 Query: 258 SGSMDQSTKDMAKR---FYILLYLFLSRT------YKNVEVVYIRHHTQAKEVDEHEFFY 308 SGSM + A + +LL ++ R ++ + T + EV +H Sbjct: 466 SGSMGSRGRMTASKGAVLSLLLDAYVKRDRVCLIGFRRDRAEVLVPVTSSVEVAQHGLAE 525 Query: 309 SQETGGTIVSSALKLMDEVVK 329 G T +S+ L EVV+ Sbjct: 526 LPVGGRTPLSAGLIKACEVVR 546 >UniRef50_UPI0001555D4A PREDICTED: hypothetical protein, partial n=1 Tax=Ornithorhynchus anatinus RepID=UPI0001555D4A Length = 397 Score = 45.9 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 4/103 (3%) Query: 27 KAQIKQSISEAINKRSVTDVDSGESVSIPTEDISEPMFHQGRGGLRHRVHPGNDHFVQND 86 K +IK + ++ + + ++ V P + +P G + H G Sbjct: 294 KNRIKNNCNDIVTEEDTEELQRKRKV--PRPLLDKPG--DGPVRMSTLTHRGRRPEGSKA 349 Query: 87 RIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLF 129 +R +G GS + + D +G D +++ E + LL Sbjct: 350 LEKRVEGEEAGSWAQGPGSGNDIDGGDSKDIRLTLMEEVLLLG 392 >UniRef50_A0E9B3 Chromosome undetermined scaffold_84, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0E9B3_PARTE Length = 603 Score = 45.5 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 52/155 (33%), Gaps = 15/155 (9%) Query: 251 MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYK----------NVEVVYIRHHTQAKE 300 + C++DVSGSM ++ K L L + ++ +IR+ + K Sbjct: 122 LVCVVDVSGSMIGRKINLVKDSLRYLMKILGPEDRICIIVFTTVAHIVTSFIRNTQENKP 181 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHE 359 + + + T +S + ++K R + SDG D++ + Sbjct: 182 LLKKAILELKGLASTNISDGMNKALWMLKNRKYKNPVSC-IFLLSDGQDDYKGAEQRVFD 240 Query: 360 ILAKKLLP---VVRYYSYIEITRRAHQTLWREYEH 391 L + V+ + Y + +Y Sbjct: 241 QLQLLKIEEKFVIHTFGYGQDHDAYVMNQIAKYRE 275 >UniRef50_C7DHI4 von Willebrand factor type A n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHI4_9EURY Length = 705 Score = 45.5 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 56/148 (37%), Gaps = 7/148 (4%) Query: 223 IERVPFIDTFDLRY-KNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLS 281 + + + T D Y R + A ++ L+D+SGSM + AKR ++ L Sbjct: 499 RKLIRYEVTKDPAYISKPYLRHIKDAGAEIWMLLDISGSMGGQKINAAKRILGSIHDSLD 558 Query: 282 -RTYKNVEV--VYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWN 338 Y ++ + Y T E D G T A+ +++K+ + + ++ Sbjct: 559 GSKYVHLRMFGFYGSDGTHVFEFDRKMLMNLAAMGDTPTDIAIYYAMDLMKK--DKSNFD 616 Query: 339 IYAAQASDGD-NWADDSPLCHEILAKKL 365 +DGD N ++ L + Sbjct: 617 KTLFIITDGDPNNGQETKNALNSLKNAM 644 >UniRef50_A9B2Y1 VWA containing CoxE family protein n=4 Tax=Bacteria RepID=A9B2Y1_HERA2 Length = 460 Score = 45.1 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 52/351 (14%), Positives = 103/351 (29%), Gaps = 49/351 (13%) Query: 85 NDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEYLDLLFEDLALPNLKQNQQRQ 144 + + G G G F +S++E ++ + L +K+ R+ Sbjct: 126 GFQPGALRQSQPGQGQASQPGGLQGGQGVGSGFNLSEEELRQVI-QGLEKDLIKRMALRE 184 Query: 145 LTEYKTHRAGYTANGVPA-------NISVVRSLQNSL---ARRTAMTAGKRRELHALEEN 194 + + A P+ N+L R + ++ L+ Sbjct: 185 VLQD----NRLAAQLTPSMAVVEQLLRDKSHLSGNALINAKRLIKQYVDELADVLRLQVM 240 Query: 195 LAIISNSE----PAQLLEEERLRKEIAELRAKIERVPFIDTFD-LRYKNYEKRPDPSSQA 249 A+ + + P ++ L++ I D L Y+ K+ P Sbjct: 241 QAVSAKIDRSVPPKRVFRNLDLKRTIWRNLTNWNSNEGRLYVDRLYYRQTAKKRTPMRMI 300 Query: 250 VMFCLMDVSGSMDQ---STKDMAKRFYILLYLFLSRTYKNVEVVYIRHHTQAKEVDE--- 303 V+ D SGSM +A F L +V++ I T+ ++ Sbjct: 301 VVV---DQSGSMVDAMVQCTILASIFAGL---------PHVDMHLIAFDTRMLDLTPWVH 348 Query: 304 ---HEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYA---AQASDGDNWADDSPLC 357 +Q GGT ++ AL E ++E P + + D D+ Sbjct: 349 DPFEVLLRTQLGGGTSINEALLFASEKIQE---PRKTAVVLITDFYEGGSDQVLLDTIKA 405 Query: 358 HEILAKKLLPVVRYY--SYIEITRRAHQTLWREYEHLQSTFDNFAMQHIRD 406 +PV Y + L + + ++ I+ Sbjct: 406 MIESGVHFIPVGAVTSSGYFSVNDWFRTKLKEMGRPIFAGSPRKLIEQIKQ 456 >UniRef50_UPI0000E460BF PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000E460BF Length = 1028 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 45/122 (36%), Gaps = 14/122 (11%) Query: 221 AKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFL 280 K + +D + + Y + + P+++ + ++DVSGSM KR + + + Sbjct: 283 KKAGHLEIVDGYFVHYFSPDG--LPNTRKNVIFVIDVSGSMYGQKTRQTKRAFTTILDDV 340 Query: 281 SRTYKNVEVVYIRHHTQAKE------------VDEHEFFYSQETGGTIVSSALKLMDEVV 328 + +++ + +E + GGT + +L E++ Sbjct: 341 RPIDRINIILFSSYAHVWREDQMVEATSDNIAAAKRHVNGLSVGGGTNIYDSLMKAVEIL 400 Query: 329 KE 330 E Sbjct: 401 LE 402 >UniRef50_A1AQS2 Protoporphyrin IX magnesium-chelatase n=1 Tax=Pelobacter propionicus DSM 2379 RepID=A1AQS2_PELPD Length = 617 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 63/195 (32%), Gaps = 18/195 (9%) Query: 148 YKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLL 207 R G P + S + +R A R + S + Sbjct: 332 ESAQREEIMGVGAPFKL-RRLSFRKDRRKRQANGRRTRTRIKGRGGRYVKSLLSSTEHDI 390 Query: 208 EEERLRKEIAELRAKIERVPF--IDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQST 265 + + A + R I+ DLR++ E+R ++ ++D SGSM Sbjct: 391 AIDATLRACAPFQKARNRQGMLKIEQDDLRFRQRERRM----GHLVLFVVDGSGSMGARQ 446 Query: 266 KDMAKR---FYILLYLFLSRTYKNVEVVY-------IRHHTQAKEVDEHEFFYSQETGGT 315 + M + +LL + + K +V+ + T + E+ G T Sbjct: 447 RMMETKGAVQSLLLDCY-QKRDKVAMIVFRKDRAELVLPPTASVELAARRLAELPVGGKT 505 Query: 316 IVSSALKLMDEVVKE 330 ++S L +V+ Sbjct: 506 PLASGLLKTHRLVRR 520 >UniRef50_A0CK50 Chromosome undetermined scaffold_2, whole genome shotgun sequence n=2 Tax=Paramecium tetraurelia RepID=A0CK50_PARTE Length = 1015 Score = 44.8 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 56/146 (38%), Gaps = 18/146 (12%) Query: 225 RVPFIDTFDLRYKNYEKRPDPSSQAV--MFCLMDVSGSMDQSTKDMAKRFYILLYLFLSR 282 ++PFI+ + + A+ + ++D SGSM + L F ++ Sbjct: 10 KIPFIEQSQDQSTKKGDKKGKEGNAILTIIGVIDASGSMSG-CWE-------WLSDFWNQ 61 Query: 283 TYKNVEVVYIRHHTQAKEVDEHEFFYS---QETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + ++ I T+ K E GGT + A + M+ +++ P Q NI Sbjct: 62 SIPKENLITITFDTRQKISAEGVLSKRIKDHGGGGTEIVPAFQTMETELQK--VPIQNNI 119 Query: 340 YAAQASDGDNWADDSPLCHEILAKKL 365 SDG D++ + KKL Sbjct: 120 TVIFISDG---QDNNVRTIDERMKKL 142 >UniRef50_C4G1K3 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1K3_ABIDE Length = 1659 Score = 44.4 bits (103), Expect = 0.008, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 62/201 (30%), Gaps = 11/201 (5%) Query: 164 ISVVRSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKI 223 S + +++ LAR + + + E++ I++ + Sbjct: 2 RSERKMVRSLLARFMVLMMVINLLGGINPSAVKAGPDEYYKNGSEKQENGVTISKKVTRY 61 Query: 224 ERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRT 283 +L+ K + + + +MD SGSM+ + + AK+ L Sbjct: 62 NAADGTYDIELKVKGSTEVVQNNKILDIVLVMDTSGSMEGKSLENAKKAANNFVDKLLPQ 121 Query: 284 YKNVEVVYIR-------HHTQAKEVD--EHEFFYSQETGGTIVSSALKLMDEVVKERYNP 334 NV + + + V ++ + GGT L+ V+ P Sbjct: 122 NNNVNIGIVSFAEKGEIKSGLTRNVTTLKNAIKGLKADGGTYTQQGLEKAATVLNG--AP 179 Query: 335 AQWNIYAAQASDGDNWADDSP 355 A+ DG+ + Sbjct: 180 AEHKKVMVVIGDGEPTYANGE 200 >UniRef50_C6VUB8 von Willebrand factor type A n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VUB8_DYAFD Length = 935 Score = 44.4 bits (103), Expect = 0.009, Method: Composition-based stats. Identities = 26/108 (24%), Positives = 41/108 (37%), Gaps = 12/108 (11%) Query: 251 MFCLMDVSGSMDQSTK-DMAKRFYILLYLFLSRT-------YKNVEVVYIRHHTQAK--E 300 M L+DVS SM+ K + KR L + Y V ++ + AK E Sbjct: 759 MVLLLDVSSSMNSPYKMPLLKRSIKSLLTLVRPEDMISIVLYSGKARVVLKPTSGAKASE 818 Query: 301 VDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + Q G T + +KL + ++Y N A+DG+ Sbjct: 819 ISRM-IDLLQSDGDTDGNEGIKLAYKTANKQYIRGGNNRIVL-ATDGE 864 >UniRef50_C3NM85 von Willebrand factor type A n=14 Tax=Sulfolobaceae RepID=C3NM85_SULIN Length = 452 Score = 44.0 bits (102), Expect = 0.010, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 47/121 (38%), Gaps = 14/121 (11%) Query: 241 KRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYL--------FLSRTYKNVEVVYI 292 ++ + ++ L+D SGSMD AK + LY F R + N+ I Sbjct: 280 QKQIRETLGPIYLLLDKSGSMDGEKILWAKAVALALYSRAKRENRDFYLRFFDNIPYPLI 339 Query: 293 RHHTQAKEVD----EHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDGD 348 + AK D + GGT +S ++ E +KE + I +DG+ Sbjct: 340 KVQKNAKSKDIIKMVEYIGKIRGGGGTDISRSIISACEDIKEGHVKGVSEIILL--TDGE 397 Query: 349 N 349 + Sbjct: 398 D 398 >UniRef50_A9G8C3 Putative uncharacterized protein n=2 Tax=Sorangium cellulosum 'So ce 56' RepID=A9G8C3_SORC5 Length = 907 Score = 43.6 bits (101), Expect = 0.014, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 52/188 (27%), Gaps = 17/188 (9%) Query: 172 NSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDT 231 + A E + + A + P L + LR ++ F Sbjct: 427 AAFEAALARGVVPAAERELVGDVAARYAPEVPLALDKALGLRADLERAALGPGGGAFHLR 486 Query: 232 FDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKRFYILLYLFLSRTYKNVEVVY 291 LR P + +D SGSM + D A+R L L+ + Sbjct: 487 LALRSAAAAAAARPHLSVHLV--LDTSGSMAGAPIDSARRAAQALVDRLAPADDFSLTTF 544 Query: 292 IRHHTQAKEVDEH------------EFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNI 339 + A+ V E +E GGT + + L L P Sbjct: 545 ---SSDAEVVIEDGPVGPRRAAIRRAIEGLREGGGTNIGAGLSLGYAQASRPGIPEDAVR 601 Query: 340 YAAQASDG 347 SDG Sbjct: 602 VVLLVSDG 609 >UniRef50_A5FL88 Putative uncharacterized protein n=2 Tax=Flavobacterium RepID=A5FL88_FLAJ1 Length = 1111 Score = 43.2 bits (100), Expect = 0.020, Method: Composition-based stats. Identities = 34/166 (20%), Positives = 54/166 (32%), Gaps = 9/166 (5%) Query: 65 HQGRGGLRHRVHPG---NDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISK 121 G G G N +N + + QG G G +DGEG E + +I K Sbjct: 928 KSGEGKQGQDSGQGKEGNGKEGKNGQGKNGQGSKEGGKGKDGNEGEDGEGDAEKIMEIYK 987 Query: 122 DE--YLDLLFEDLALPNLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSL---QNSLAR 176 ++ + L ++LA L + + + K G N ++ R L Q L Sbjct: 988 EQVKLREALQKELAKKGLDAQGRSAIEQMKASEKQILNKG-FKNENLQRILNIQQELLKL 1046 Query: 177 RTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAK 222 A+ + E N SN A + L + Sbjct: 1047 NNAVQEQGQDTKRQSETNKTEFSNRSNALPSSLTDYLNSVEILNRQ 1092 >UniRef50_B4S8S0 Magnesium chelatase ATPase subunit D n=3 Tax=Chlorobiaceae RepID=B4S8S0_PROA2 Length = 619 Score = 42.8 bits (99), Expect = 0.022, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 70/185 (37%), Gaps = 17/185 (9%) Query: 176 RRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKE--IAELRAKIERVP---FID 230 R A+ + R + + + + L+ ++ + LR + I+ Sbjct: 356 RGEALNNRRGRFVRSQPGEIRGGKVALIPTLISAAPWQESRRLERLRKTGKVSTTGLIIN 415 Query: 231 TFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAK--RFYILLYLFLSR------ 282 D++ K + + S + ++D SGSM + AK ++L ++ R Sbjct: 416 KEDVKVKKFRDK----SGTLFIFIVDASGSMALNRMRQAKGAVSHLLQNAYVHRDQVALI 471 Query: 283 TYKNVEVVYIRHHTQAKEVDEHEFFYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAA 342 +++ E + +Q+ + + E GGT ++SA+ L E K+ I Sbjct: 472 SFRGKEAQLLLPPSQSVDRAKRELDVLPTGGGTPLASAIYLAWETAKQARTKGVSQIMFV 531 Query: 343 QASDG 347 +DG Sbjct: 532 LITDG 536 >UniRef50_B1I3V2 Magnesium chelatase n=4 Tax=cellular organisms RepID=B1I3V2_DESAP Length = 670 Score = 42.4 bits (98), Expect = 0.034, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 55/174 (31%), Gaps = 20/174 (11%) Query: 168 RSLQNSLARRTAMTAGKRRELHALEENLAIISNSEPAQLLEEERLRKEIAELRAKIERVP 227 R L+ RR+ + + E L + LR + + R Sbjct: 410 RVLRKGSGRRSRTRTPTKAG-----RYVRATLRRERDDLAFDATLRAAAP-FQKQRARDG 463 Query: 228 FIDTFDLRYKNYEKRPDPSSQAVMFCLMDVSGSMDQSTKDMAKR---FYILLYLFLSRTY 284 + + + R + ++D SGSM + +A + +LL + R Sbjct: 464 VAVAVESQDIREKVREKRIGN-FLVFVVDASGSMGAQQRMVAAKGAVLSLLLDAYQKR-- 520 Query: 285 KNVEVV-YIRHHTQAK-------EVDEHEFFYSQETGGTIVSSALKLMDEVVKE 330 V +V + H + E+ E G T +++ L EV + Sbjct: 521 DRVGMVAFKGEHAEVLLPPTNSVELAERRLAELPTGGRTPLAAGLLKAYEVARA 574 >UniRef50_C9LTL4 Magnesium-chelatase, subunit D/I family n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LTL4_9FIRM Length = 657 Score = 41.7 bits (96), Expect = 0.048, Method: Composition-based stats. Identities = 50/296 (16%), Positives = 90/296 (30%), Gaps = 53/296 (17%) Query: 78 GNDHFVQNDRIERPQGGGGGSGSGQGQASQDGEGQDEFVFQISKDEY--LDLLFEDLALP 135 G ++DR E + A D G+D +DE ++ + L+L Sbjct: 341 GESKEQEDDRGESQEKEAQADEEQASPADGDSGGED-------RDETHSIEAVMARLSL- 392 Query: 136 NLKQNQQRQLTEYKTHRAGYTANGVPANISVVRSLQNSLARRTAMTAGKRRELHALEENL 195 R+ + ++G V ++ R + SL R G+R +L Sbjct: 393 ------LRETVCVRKGKSG-RRAIVQLDVPAGRPWRTSLPR-----TGRRIDLAFAATLR 440 Query: 196 AIISNSEPAQLLEEERLRKEIAELRAKIERVPFIDTFDLRYKNYEKRPDPSSQAVMFCLM 255 A + +R E + + R + A + L+ Sbjct: 441 AAAPYQRQRHGEQAVVIRAEDLRVWIRARR---------------------ASANILFLV 479 Query: 256 DVSGSMDQSTK-DMAKRFYILLYLFLSRTYKNVEVVYIRHH--------TQAKEVDEHEF 306 D SGSM + M K + L + V ++ R T++ E+ E Sbjct: 480 DASGSMGAKERMKMVKGAVLALLREAYQKRDRVGLIAFRRTSAETLLPMTRSVELAEKAL 539 Query: 307 FYSQETGGTIVSSALKLMDEVVKERYNPAQWNIYAAQASDG-DNWADDSPLCHEIL 361 G T ++ L +++ E +DG N + E L Sbjct: 540 RSLPTGGKTPLAEGLAAALKMMDELSRKEGAETVLVLVTDGRTNVSAAGKAKEEAL 595 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.304 0.126 0.315 Lambda K H 0.267 0.0392 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,093,419,015 Number of Sequences: 3077464 Number of extensions: 78681452 Number of successful extensions: 608335 Number of sequences better than 1.0e-01: 232 Number of HSP's better than 0.1 without gapping: 97 Number of HSP's successfully gapped in prelim test: 226 Number of HSP's that attempted gapping in prelim test: 606353 Number of HSP's gapped (non-prelim): 1444 length of query: 427 length of database: 1,040,396,356 effective HSP length: 131 effective length of query: 296 effective length of database: 637,248,572 effective search space: 188625577312 effective search space used: 188625577312 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.9 bits) S2: 94 (40.9 bits)