BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (203 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76066 Putative uncharacterized protein ydaW n=7 Tax=Es... 418 e-116 UniRef50_B6IA78 Hypothetical phage protein n=97 Tax=Enterobacter... 345 5e-94 UniRef50_UPI0001BCF478 hypothetical protein EscherichiacoliO157_... 105 6e-22 UniRef50_UPI0001BCF391 hypothetical protein EscherichiacoliO157_... 76 8e-13 UniRef50_B2N5R0 Putative uncharacterized protein n=1 Tax=Escheri... 69 1e-10 >UniRef50_P76066 Putative uncharacterized protein ydaW n=7 Tax=Escherichia coli RepID=YDAW_ECOLI Length = 203 Score = 418 bits (1074), Expect = e-116, Method: Compositional matrix adjust. Identities = 203/203 (100%), Positives = 203/203 (100%) Query: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV Sbjct: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 Query: 61 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF Sbjct: 61 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 Query: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACH 180 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACH Sbjct: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACH 180 Query: 181 GVLIITVITPSPTKNSATKMPEN 203 GVLIITVITPSPTKNSATKMPEN Sbjct: 181 GVLIITVITPSPTKNSATKMPEN 203 >UniRef50_B6IA78 Hypothetical phage protein n=97 Tax=Enterobacteriaceae RepID=B6IA78_ECOSE Length = 270 Score = 345 bits (886), Expect = 5e-94, Method: Compositional matrix adjust. Identities = 166/168 (98%), Positives = 168/168 (100%) Query: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV Sbjct: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 Query: 61 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 DKSAYTWRVADNN+QQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF Sbjct: 61 DKSAYTWRVADNNIQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 Query: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVS 168 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASV+ Sbjct: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVA 168 >UniRef50_UPI0001BCF478 hypothetical protein EscherichiacoliO157_21186 n=1 Tax=Escherichia coli O157:H7 str. FRIK2000 RepID=UPI0001BCF478 Length = 122 Score = 105 bits (263), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 55/117 (47%), Positives = 75/117 (64%), Gaps = 3/117 (2%) Query: 55 KKAGFVDKSAYTWRVA---DNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQ 111 +K G VDK+ +TW +A + V + Q Q E P+ +T K++ + I Q G + Sbjct: 4 QKNGVVDKTGHTWFLAVEGEAGVTEGQALQPEAPDVVTEEVAPKVTADMMVEFIGQDGAK 63 Query: 112 TADELATLFGTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVS 168 T +ELA FG ++RKVASTLA+ + GRL RVNQ GKFRYC+ G NLPA+PKAA V+ Sbjct: 64 TCEELAGKFGVSTRKVASTLAVVTATGRLARVNQNGKFRYCMSGGNLPADPKAAPVT 120 >UniRef50_UPI0001BCF391 hypothetical protein EscherichiacoliO157_14042 n=2 Tax=Escherichia coli O157:H7 RepID=UPI0001BCF391 Length = 76 Score = 75.9 bits (185), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 40/75 (53%), Positives = 55/75 (73%), Gaps = 3/75 (4%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVA---DNNV 74 METV DALKA+ KA+S +AARL ISREE +NELWELK G VDK+ +TW +A ++ V Sbjct: 1 METVSDALKALKKASSHVVAARLGISREEAVNELWELKINGVVDKTGHTWFLAGEGESRV 60 Query: 75 QQEQPAQAELPEEIT 89 +E+P ++E + +T Sbjct: 61 TEERPVKSEAQDMLT 75 >UniRef50_B2N5R0 Putative uncharacterized protein n=1 Tax=Escherichia coli 53638 RepID=B2N5R0_ECOLX Length = 170 Score = 68.9 bits (167), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 31/50 (62%), Positives = 39/50 (78%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTW 67 METVFDALKAMGKA+S E+AARL ++R+E +NELW+LK+ G D W Sbjct: 1 METVFDALKAMGKASSQEVAARLGMTRDEAINELWKLKRRGEADNKGPMW 50 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76066 Putative uncharacterized protein ydaW n=7 Tax=Es... 360 2e-98 UniRef50_B6IA78 Hypothetical phage protein n=97 Tax=Enterobacter... 293 2e-78 UniRef50_UPI0001BCF478 hypothetical protein EscherichiacoliO157_... 185 1e-45 UniRef50_UPI0001BCF391 hypothetical protein EscherichiacoliO157_... 111 1e-23 UniRef50_B2N5R0 Putative uncharacterized protein n=1 Tax=Escheri... 95 1e-18 Sequences not found previously or not previously below threshold: UniRef50_A8AHR4 Putative uncharacterized protein n=1 Tax=Citroba... 50 6e-05 UniRef50_B2HDY0 ATP-dependent helicase Lhr n=16 Tax=Actinomyceta... 47 3e-04 UniRef50_Q6D6U1 Probable ATP-dependent helicase n=4 Tax=Proteoba... 42 0.009 UniRef50_C8U7P5 Putative uncharacterized protein n=13 Tax=root R... 40 0.065 >UniRef50_P76066 Putative uncharacterized protein ydaW n=7 Tax=Escherichia coli RepID=YDAW_ECOLI Length = 203 Score = 360 bits (924), Expect = 2e-98, Method: Composition-based stats. Identities = 203/203 (100%), Positives = 203/203 (100%) Query: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV Sbjct: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 Query: 61 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF Sbjct: 61 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 Query: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACH 180 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACH Sbjct: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACH 180 Query: 181 GVLIITVITPSPTKNSATKMPEN 203 GVLIITVITPSPTKNSATKMPEN Sbjct: 181 GVLIITVITPSPTKNSATKMPEN 203 >UniRef50_B6IA78 Hypothetical phage protein n=97 Tax=Enterobacteriaceae RepID=B6IA78_ECOSE Length = 270 Score = 293 bits (750), Expect = 2e-78, Method: Composition-based stats. Identities = 166/168 (98%), Positives = 168/168 (100%) Query: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV Sbjct: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 Query: 61 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 DKSAYTWRVADNN+QQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF Sbjct: 61 DKSAYTWRVADNNIQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 Query: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVS 168 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASV+ Sbjct: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVA 168 >UniRef50_UPI0001BCF478 hypothetical protein EscherichiacoliO157_21186 n=1 Tax=Escherichia coli O157:H7 str. FRIK2000 RepID=UPI0001BCF478 Length = 122 Score = 185 bits (469), Expect = 1e-45, Method: Composition-based stats. Identities = 55/117 (47%), Positives = 75/117 (64%), Gaps = 3/117 (2%) Query: 55 KKAGFVDKSAYTWRVA---DNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQ 111 +K G VDK+ +TW +A + V + Q Q E P+ +T K++ + I Q G + Sbjct: 4 QKNGVVDKTGHTWFLAVEGEAGVTEGQALQPEAPDVVTEEVAPKVTADMMVEFIGQDGAK 63 Query: 112 TADELATLFGTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVS 168 T +ELA FG ++RKVASTLA+ + GRL RVNQ GKFRYC+ G NLPA+PKAA V+ Sbjct: 64 TCEELAGKFGVSTRKVASTLAVVTATGRLARVNQNGKFRYCMSGGNLPADPKAAPVT 120 >UniRef50_UPI0001BCF391 hypothetical protein EscherichiacoliO157_14042 n=2 Tax=Escherichia coli O157:H7 RepID=UPI0001BCF391 Length = 76 Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 40/75 (53%), Positives = 55/75 (73%), Gaps = 3/75 (4%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVA---DNNV 74 METV DALKA+ KA+S +AARL ISREE +NELWELK G VDK+ +TW +A ++ V Sbjct: 1 METVSDALKALKKASSHVVAARLGISREEAVNELWELKINGVVDKTGHTWFLAGEGESRV 60 Query: 75 QQEQPAQAELPEEIT 89 +E+P ++E + +T Sbjct: 61 TEERPVKSEAQDMLT 75 >UniRef50_B2N5R0 Putative uncharacterized protein n=1 Tax=Escherichia coli 53638 RepID=B2N5R0_ECOLX Length = 170 Score = 94.8 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 40/100 (40%), Positives = 59/100 (59%), Gaps = 3/100 (3%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNNVQQE 77 METVFDALKAMGKA+S E+AARL ++R+E +NELW+LK+ G D W + + +++ Sbjct: 1 METVFDALKAMGKASSQEVAARLGMTRDEAINELWKLKRRGEADNKGPMWWLT-SGDEEK 59 Query: 78 QPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELA 117 Q E P + + +E + TA E G + + A Sbjct: 60 QAGYRETP--VVDLIPQRGTESEKTARQEASGTENTGQHA 97 >UniRef50_A8AHR4 Putative uncharacterized protein n=1 Tax=Citrobacter koseri ATCC BAA-895 RepID=A8AHR4_CITK8 Length = 228 Score = 49.8 bits (117), Expect = 6e-05, Method: Composition-based stats. Identities = 33/134 (24%), Positives = 53/134 (39%), Gaps = 7/134 (5%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNNVQQE 77 METV L+ MG+AT E+A RL+I + L L E + G D W + Q + Sbjct: 1 METVIQVLEKMGRATYREVADRLEIDPVDALTMLREQRDQGLCDFGDGGWFLGTVKGQSQ 60 Query: 78 Q-----PAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLA 132 Q P P + + + ++G T LA +R + S L Sbjct: 61 QSTPNAPVHP-APRLKGEEPEP-VDPDVVRQQLREQGAMTTVSLAAAVNRNARGMVSVLR 118 Query: 133 MAISKGRLIRVNQG 146 +G +++ +G Sbjct: 119 ALERQGVVVKNGKG 132 >UniRef50_B2HDY0 ATP-dependent helicase Lhr n=16 Tax=Actinomycetales RepID=B2HDY0_MYCMM Length = 1539 Score = 47.4 bits (111), Expect = 3e-04, Method: Composition-based stats. Identities = 37/133 (27%), Positives = 52/133 (39%), Gaps = 10/133 (7%) Query: 19 ETVFDALKAMGKATSIELAARLDISREEVLNELWE----LKKAGFVDKSAYTWRVADNNV 74 E V D L+ +G T E+AAR D + + E ++A V + +W VA ++ Sbjct: 932 EGVADLLRLLGPLTEDEVAARSDTTGGTDIGGWLEGLRAARRALEVSYAGRSWWVAVEDI 991 Query: 75 QQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVAS 129 + LP T A ++E L P T E A FG R A Sbjct: 992 GRLRDGVGAAVPVGLPAAFTEAVADPLAE-LLGRYARTHPPFTTAEAAGRFGLGLRVTAD 1050 Query: 130 TLAMAISKGRLIR 142 L S GRL+R Sbjct: 1051 VLGRLASDGRLVR 1063 >UniRef50_Q6D6U1 Probable ATP-dependent helicase n=4 Tax=Proteobacteria RepID=Q6D6U1_ERWCT Length = 1598 Score = 42.4 bits (98), Expect = 0.009, Method: Composition-based stats. Identities = 40/160 (25%), Positives = 63/160 (39%), Gaps = 14/160 (8%) Query: 12 YREKNLMETVFDALKAMGKATSIELAARLDISREEV---LNELWELKK--AGFVDKSAYT 66 YR K E V D L+ +G + E+AARLD +E LN L ++ + + Sbjct: 952 YRAKG-KEGVADLLRELGPLCTQEVAARLDSEADEAAAFLNSLESGRRVIRVMIGAREHW 1010 Query: 67 WRVADNNVQQ---EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTT 123 + D + P LP +T + + RGP T + A+ FG Sbjct: 1011 ASIEDAGRLRDALGVPLPVGLPAAFLESTADPLRS-LIARFARTRGPFTTGDAASAFGLG 1069 Query: 124 SRKVASTLAMAISKGRLIR----VNQGGKFRYCIPGDNLP 159 + V + L +G+L+ V +G K + GD P Sbjct: 1070 ASVVEAVLRQIAEQGKLLSGRFGVRRGMKSPPQVSGDAGP 1109 >UniRef50_C8U7P5 Putative uncharacterized protein n=13 Tax=root RepID=C8U7P5_ECO10 Length = 129 Score = 39.7 bits (91), Expect = 0.065, Method: Composition-based stats. Identities = 16/50 (32%), Positives = 26/50 (52%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTW 67 M V L +GKA++ ++ L+I +V++ LW+LK G V W Sbjct: 1 MNAVLTELNKLGKASAESISKGLNIDLNDVIDTLWKLKNQGVVTVKNGIW 50 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76066 Putative uncharacterized protein ydaW n=7 Tax=Es... 336 2e-91 UniRef50_B6IA78 Hypothetical phage protein n=97 Tax=Enterobacter... 265 7e-70 UniRef50_UPI0001BCF478 hypothetical protein EscherichiacoliO157_... 161 1e-38 UniRef50_A8AHR4 Putative uncharacterized protein n=1 Tax=Citroba... 146 5e-34 UniRef50_B2HDY0 ATP-dependent helicase Lhr n=16 Tax=Actinomyceta... 138 1e-31 UniRef50_B2N5R0 Putative uncharacterized protein n=1 Tax=Escheri... 124 3e-27 UniRef50_UPI0001BCF391 hypothetical protein EscherichiacoliO157_... 98 2e-19 Sequences not found previously or not previously below threshold: UniRef50_Q6D6U1 Probable ATP-dependent helicase n=4 Tax=Proteoba... 86 6e-16 UniRef50_Q2J762 DEAD/DEAH box helicase-like n=5 Tax=Actinomyceta... 81 2e-14 UniRef50_C6W9K2 DEAD/H associated domain protein n=3 Tax=Actinom... 77 4e-13 UniRef50_D0L8P3 DEAD/H associated domain protein n=13 Tax=Coryne... 76 9e-13 UniRef50_D0LRE7 DEAD/H associated domain protein n=1 Tax=Haliang... 75 2e-12 UniRef50_A0JSK2 DEAD/H associated domain protein n=5 Tax=Actinom... 74 2e-12 UniRef50_A6W838 DEAD/H associated domain protein n=27 Tax=Bacter... 69 1e-10 UniRef50_C1F7I8 DEAD/DEAH box helicase domain protein n=1 Tax=Ac... 68 1e-10 UniRef50_C5BWN7 DEAD/H associated domain protein n=3 Tax=Microco... 68 3e-10 UniRef50_B8GTJ5 DEAD/DEAH box helicase domain-containing protein... 67 3e-10 UniRef50_A8L6I9 DEAD/H associated domain protein n=13 Tax=Bacter... 67 4e-10 UniRef50_D1VBA8 DEAD/H associated domain protein n=1 Tax=Frankia... 66 9e-10 UniRef50_C7NJ98 ATP dependent helicase, Lhr family n=1 Tax=Kytoc... 65 1e-09 UniRef50_D2AY23 Putative ATP-dependent helicase Lhr n=1 Tax=Stre... 62 1e-08 UniRef50_Q023K9 ATP dependent helicase, Lhr family n=1 Tax=Candi... 60 4e-08 UniRef50_A7HB32 DEAD/H associated domain protein n=2 Tax=Myxococ... 59 9e-08 UniRef50_A9WLQ1 ATP-dependent helicase n=3 Tax=Micrococcaceae Re... 59 1e-07 UniRef50_B9ZKU2 DEAD/H associated domain protein n=1 Tax=Thioalk... 59 1e-07 UniRef50_C8Q5F9 DEAD/H associated domain protein n=2 Tax=Enterob... 58 1e-07 UniRef50_B8HC38 DEAD/H associated domain protein n=1 Tax=Arthrob... 57 4e-07 UniRef50_Q2IJM9 DEAD/DEAH box helicase-like n=9 Tax=Bacteria Rep... 57 4e-07 UniRef50_C7MDW5 ATP dependent helicase, Lhr family n=1 Tax=Brach... 53 9e-06 UniRef50_UPI0001C317F9 DEAD/H associated domain protein n=1 Tax=... 47 4e-04 UniRef50_C8WJH4 DEAD/H associated domain protein n=1 Tax=Eggerth... 45 0.001 UniRef50_Q8NS36 Lhr-like helicases n=2 Tax=Corynebacterium gluta... 44 0.002 UniRef50_C7R3F9 DEAD/H associated domain protein n=1 Tax=Jonesia... 44 0.003 UniRef50_P30015 Probable ATP-dependent helicase lhr n=44 Tax=Ent... 43 0.005 UniRef50_C8U7P5 Putative uncharacterized protein n=13 Tax=root R... 42 0.011 UniRef50_Q1YII8 Possible regulatory protein n=1 Tax=Aurantimonas... 41 0.024 >UniRef50_P76066 Putative uncharacterized protein ydaW n=7 Tax=Escherichia coli RepID=YDAW_ECOLI Length = 203 Score = 336 bits (862), Expect = 2e-91, Method: Composition-based stats. Identities = 203/203 (100%), Positives = 203/203 (100%) Query: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV Sbjct: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 Query: 61 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF Sbjct: 61 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 Query: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACH 180 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACH Sbjct: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSSACH 180 Query: 181 GVLIITVITPSPTKNSATKMPEN 203 GVLIITVITPSPTKNSATKMPEN Sbjct: 181 GVLIITVITPSPTKNSATKMPEN 203 >UniRef50_B6IA78 Hypothetical phage protein n=97 Tax=Enterobacteriaceae RepID=B6IA78_ECOSE Length = 270 Score = 265 bits (677), Expect = 7e-70, Method: Composition-based stats. Identities = 166/168 (98%), Positives = 168/168 (100%) Query: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV Sbjct: 1 MSSSQELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFV 60 Query: 61 DKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 DKSAYTWRVADNN+QQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF Sbjct: 61 DKSAYTWRVADNNIQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLF 120 Query: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVS 168 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASV+ Sbjct: 121 GTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVA 168 >UniRef50_UPI0001BCF478 hypothetical protein EscherichiacoliO157_21186 n=1 Tax=Escherichia coli O157:H7 str. FRIK2000 RepID=UPI0001BCF478 Length = 122 Score = 161 bits (408), Expect = 1e-38, Method: Composition-based stats. Identities = 55/117 (47%), Positives = 75/117 (64%), Gaps = 3/117 (2%) Query: 55 KKAGFVDKSAYTWRVA---DNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQ 111 +K G VDK+ +TW +A + V + Q Q E P+ +T K++ + I Q G + Sbjct: 4 QKNGVVDKTGHTWFLAVEGEAGVTEGQALQPEAPDVVTEEVAPKVTADMMVEFIGQDGAK 63 Query: 112 TADELATLFGTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVS 168 T +ELA FG ++RKVASTLA+ + GRL RVNQ GKFRYC+ G NLPA+PKAA V+ Sbjct: 64 TCEELAGKFGVSTRKVASTLAVVTATGRLARVNQNGKFRYCMSGGNLPADPKAAPVT 120 >UniRef50_A8AHR4 Putative uncharacterized protein n=1 Tax=Citrobacter koseri ATCC BAA-895 RepID=A8AHR4_CITK8 Length = 228 Score = 146 bits (367), Expect = 5e-34, Method: Composition-based stats. Identities = 33/134 (24%), Positives = 53/134 (39%), Gaps = 7/134 (5%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNNVQQE 77 METV L+ MG+AT E+A RL+I + L L E + G D W + Q + Sbjct: 1 METVIQVLEKMGRATYREVADRLEIDPVDALTMLREQRDQGLCDFGDGGWFLGTVKGQSQ 60 Query: 78 Q-----PAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLA 132 Q P P + + + ++G T LA +R + S L Sbjct: 61 QSTPNAPVHP-APRLKGEEPEP-VDPDVVRQQLREQGAMTTVSLAAAVNRNARGMVSVLR 118 Query: 133 MAISKGRLIRVNQG 146 +G +++ +G Sbjct: 119 ALERQGVVVKNGKG 132 >UniRef50_B2HDY0 ATP-dependent helicase Lhr n=16 Tax=Actinomycetales RepID=B2HDY0_MYCMM Length = 1539 Score = 138 bits (348), Expect = 1e-31, Method: Composition-based stats. Identities = 37/133 (27%), Positives = 52/133 (39%), Gaps = 10/133 (7%) Query: 19 ETVFDALKAMGKATSIELAARLDISREEVLNELWE----LKKAGFVDKSAYTWRVADNNV 74 E V D L+ +G T E+AAR D + + E ++A V + +W VA ++ Sbjct: 932 EGVADLLRLLGPLTEDEVAARSDTTGGTDIGGWLEGLRAARRALEVSYAGRSWWVAVEDI 991 Query: 75 QQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVAS 129 + LP T A ++E L P T E A FG R A Sbjct: 992 GRLRDGVGAAVPVGLPAAFTEAVADPLAE-LLGRYARTHPPFTTAEAAGRFGLGLRVTAD 1050 Query: 130 TLAMAISKGRLIR 142 L S GRL+R Sbjct: 1051 VLGRLASDGRLVR 1063 >UniRef50_B2N5R0 Putative uncharacterized protein n=1 Tax=Escherichia coli 53638 RepID=B2N5R0_ECOLX Length = 170 Score = 124 bits (310), Expect = 3e-27, Method: Composition-based stats. Identities = 40/100 (40%), Positives = 59/100 (59%), Gaps = 3/100 (3%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNNVQQE 77 METVFDALKAMGKA+S E+AARL ++R+E +NELW+LK+ G D W + + +++ Sbjct: 1 METVFDALKAMGKASSQEVAARLGMTRDEAINELWKLKRRGEADNKGPMWWL-TSGDEEK 59 Query: 78 QPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELA 117 Q E P + + +E + TA E G + + A Sbjct: 60 QAGYRETP--VVDLIPQRGTESEKTARQEASGTENTGQHA 97 >UniRef50_UPI0001BCF391 hypothetical protein EscherichiacoliO157_14042 n=2 Tax=Escherichia coli O157:H7 RepID=UPI0001BCF391 Length = 76 Score = 97.6 bits (241), Expect = 2e-19, Method: Composition-based stats. Identities = 40/75 (53%), Positives = 55/75 (73%), Gaps = 3/75 (4%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVA---DNNV 74 METV DALKA+ KA+S +AARL ISREE +NELWELK G VDK+ +TW +A ++ V Sbjct: 1 METVSDALKALKKASSHVVAARLGISREEAVNELWELKINGVVDKTGHTWFLAGEGESRV 60 Query: 75 QQEQPAQAELPEEIT 89 +E+P ++E + +T Sbjct: 61 TEERPVKSEAQDMLT 75 >UniRef50_Q6D6U1 Probable ATP-dependent helicase n=4 Tax=Proteobacteria RepID=Q6D6U1_ERWCT Length = 1598 Score = 86.0 bits (211), Expect = 6e-16, Method: Composition-based stats. Identities = 41/160 (25%), Positives = 63/160 (39%), Gaps = 14/160 (8%) Query: 12 YREKNLMETVFDALKAMGKATSIELAARLDISREEV---LNELWELKKAGFVDKSAYTWR 68 YR K E V D L+ +G + E+AARLD +E LN L ++ V A Sbjct: 952 YRAKG-KEGVADLLRELGPLCTQEVAARLDSEADEAAAFLNSLESGRRVIRVMIGAREHW 1010 Query: 69 VADNNVQQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTT 123 + + + P LP +T + + RGP T + A+ FG Sbjct: 1011 ASIEDAGRLRDALGVPLPVGLPAAFLESTADPL-RSLIARFARTRGPFTTGDAASAFGLG 1069 Query: 124 SRKVASTLAMAISKGRLIR----VNQGGKFRYCIPGDNLP 159 + V + L +G+L+ V +G K + GD P Sbjct: 1070 ASVVEAVLRQIAEQGKLLSGRFGVRRGMKSPPQVSGDAGP 1109 >UniRef50_Q2J762 DEAD/DEAH box helicase-like n=5 Tax=Actinomycetales RepID=Q2J762_FRASC Length = 1567 Score = 81.0 bits (198), Expect = 2e-14, Method: Composition-based stats. Identities = 36/152 (23%), Positives = 61/152 (40%), Gaps = 13/152 (8%) Query: 13 REKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADN 72 R +E V D L+ +G T+ E AR + L EL + ++A V + VA Sbjct: 958 RHARDVEGVADLLRMLGDLTTEEAVAR-GATPG-WLAELEQTRRALRVRIAGEERWVAIE 1015 Query: 73 NVQQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKV 127 + + P +P+ T + ++ GP +E A G + V Sbjct: 1016 DAGRLRDALGVPLPVGVPDAFTEPVRDPLG-DLISRYARTHGPFDTEEPAARLGLGTAVV 1074 Query: 128 ASTLAMAISKGRLIRV----NQGGKFRYCIPG 155 A L + GRL+R ++GG+ ++C G Sbjct: 1075 AGVLDGLVRSGRLVRGELHPDRGGE-QWCDAG 1105 >UniRef50_C6W9K2 DEAD/H associated domain protein n=3 Tax=Actinomycetales RepID=C6W9K2_ACTMD Length = 1606 Score = 76.8 bits (187), Expect = 4e-13, Method: Composition-based stats. Identities = 30/129 (23%), Positives = 45/129 (34%), Gaps = 8/129 (6%) Query: 19 ETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNNVQQ-- 76 E D L+ +G + E A R R E L EL ++A V + + + + + Sbjct: 950 EGAADLLRFLGDLSEEEAAERG--VRPEWLTELVAARRAIRVRIAGHERVITIEDAGRVR 1007 Query: 77 ---EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLAM 133 +PE T + L GP A + A FG V L Sbjct: 1008 DALGVALPVGVPEVFTEPVADPLG-DLLGRYARTHGPFPAGQAAERFGLGVAVVTGVLER 1066 Query: 134 AISKGRLIR 142 + GRL+R Sbjct: 1067 MAATGRLVR 1075 >UniRef50_D0L8P3 DEAD/H associated domain protein n=13 Tax=Corynebacterineae RepID=D0L8P3_GORB4 Length = 1544 Score = 75.6 bits (184), Expect = 9e-13, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 58/170 (34%), Gaps = 23/170 (13%) Query: 13 REKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAG---FVDKSAYTWRV 69 R+ E + D L+ +G T+ E+AAR ++ L +L +AG V Sbjct: 923 RQARDAEDIVDLLRWLGPLTTEEVAARYQ--GDDAAAVLADLHRAGPIISVTHQGRPLWA 980 Query: 70 ADNNVQQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTS 124 A + + PA +P T ++ + GP T E A G Sbjct: 981 AVEDTARLRDALGVPAPLGVPAAYTDPVPDPVT-DLVHRYARTHGPFTLTEAAESLGMAV 1039 Query: 125 RKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSA 174 V TL ++ R + GD LPA A + W A Sbjct: 1040 AVVRDTLVRLAAQ------------RKVVEGDFLPAAADDAHPTTQWCHA 1077 >UniRef50_D0LRE7 DEAD/H associated domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LRE7_HALO1 Length = 1523 Score = 74.9 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 34/152 (22%), Positives = 59/152 (38%), Gaps = 13/152 (8%) Query: 13 REKNLMETVFDALKAMGKATSIELAARLD---ISREEVLNELWELKKAGFVDKSAYTWRV 69 R ++ + D L+ +G+ EL AR D + E+ L EL ++ + V Sbjct: 950 RRPRHVDALSDLLRRLGELDDDELRARWDAREMPVEDALAELQRARRIARLRVGGRARWV 1009 Query: 70 ADNNVQQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTS 124 A ++ +PE A + E L RGP ++ ++A + Sbjct: 1010 AMEDMGLYRDALGAAPPPGVPEVFLEPVAAPV-ETLLGRFARTRGPFSSAQVAARYALPP 1068 Query: 125 RKVASTLAMAISKGRLIRVNQGGKFRYCIPGD 156 +V + L +G+L+R G FR GD Sbjct: 1069 AQVDALLGALAGQGKLLR----GAFRPGATGD 1096 >UniRef50_A0JSK2 DEAD/H associated domain protein n=5 Tax=Actinomycetales RepID=A0JSK2_ARTS2 Length = 1694 Score = 74.5 bits (181), Expect = 2e-12, Method: Composition-based stats. Identities = 41/197 (20%), Positives = 65/197 (32%), Gaps = 38/197 (19%) Query: 18 METVFDALKAMGKATSIELAARLDISR---------------EEVLN-----------EL 51 ME V D L+ +G T E+AARL+ + E + L Sbjct: 1023 MEGVADLLRLLGPLTPAEVAARLEPAGPPVAEPAPGVEPAPVVEPVGTLVADAAAHLAAL 1082 Query: 52 WELKKAGFVDKSAYTWRVADNNVQQ-----EQPAQAELPEEITTATVAKISECDLTATIE 106 +A V+ A + + P +P + ++ Sbjct: 1083 QRANRALRVNIGGVERFAAVEDAARLRDAIGVPLPMGVPLAFIEPVADPLG-DLVSRYAR 1141 Query: 107 QRGPQTADELATLFGTTSRKVASTLAMAISKGRLIRVNQGGKFR-YCIPGDNLPAEPKAA 165 GP TA E A G V + L + GR++ G+FR + P +P P+ Sbjct: 1142 THGPFTAAEAAARLGLGVAVVGTALKRLAADGRVVE----GEFRPHATPPRQVPETPEPG 1197 Query: 166 SV-SPLWLSASSSACHG 181 +P SAS+ G Sbjct: 1198 EEDAPALPSASTITLAG 1214 >UniRef50_A6W838 DEAD/H associated domain protein n=27 Tax=Bacteria RepID=A6W838_KINRD Length = 1584 Score = 68.7 bits (166), Expect = 1e-10, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 51/157 (32%), Gaps = 27/157 (17%) Query: 19 ETVFDALKAMGKATSIELAARLD-----------ISREEV---LNELWELKKAGFVDKSA 64 E V D L+ +G T+ E+ R + EV L L E ++ V + Sbjct: 959 EDVADLLRVLGPLTTAEVVVRSGRGEPGSQGPAALPEGEVARWLVGLEEARRLIRVRIAG 1018 Query: 65 YTWRVADNNVQQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATL 119 + + P +PE T ++ RGP T ++A Sbjct: 1019 EEHWAVIEDAGRLRDGLGTPLPVGIPEAFTEPVADPLA-GLFGRYARTRGPFTVLDVAAR 1077 Query: 120 FGTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGD 156 FG L S GRL+ G+ R PG Sbjct: 1078 FGVGRAVAHDALRRLQSAGRLVE----GELR---PGG 1107 >UniRef50_C1F7I8 DEAD/DEAH box helicase domain protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7I8_ACIC5 Length = 1565 Score = 68.3 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 55/144 (38%), Gaps = 14/144 (9%) Query: 19 ETVFDALKAMGKATSIELAARLDI-SREEVLNELWELKKAGFVDKSAYTWRVADNNVQQ- 76 + + D L +G + ELA RL VL+ L + ++ V ++ +A + + Sbjct: 954 DGIHDLLLRLGDLSREELARRLATPDLLAVLDRLLKARRILEVKIASERRLIAVEDAARY 1013 Query: 77 ----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLA 132 P + + + + E + GP T E A FG +R V S L Sbjct: 1014 RDALGIPLPPGIAKALLATVEKPVLE-LIRRFARTHGPFTLAEAAQRFGLDARAVESVLR 1072 Query: 133 MAISKGRLIRVNQGGKFRYCIPGD 156 +GR++ G FR PG Sbjct: 1073 TLALEGRVLE----GGFR---PGG 1089 >UniRef50_C5BWN7 DEAD/H associated domain protein n=3 Tax=Micrococcineae RepID=C5BWN7_BEUC1 Length = 1668 Score = 67.6 bits (163), Expect = 3e-10, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 50/151 (33%), Gaps = 14/151 (9%) Query: 19 ETVFDALKAMGKATSIELAARLDISREEV---LNELWELKKAGFVDKSAYTWRVADNNVQ 75 E FD ++ +G + E+A R + E V L +L ++ V + A + Sbjct: 1038 EGAFDLVRRLGPVPAEEVARRA-LDPEAVPGWLADLERQRRIIAVRVAGREQWSAAEDAG 1096 Query: 76 Q-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVAST 130 + LP + + + GP TA E+A FG V Sbjct: 1097 RLRDALGTALPLGLPAAVLESVPDPFG-DLVRRHARTHGPFTAAEVAHRFGLAPGAVLPV 1155 Query: 131 LAMAISKGRLIRVNQGGKFRYCIPGDNLPAE 161 L G L G+ R G + ++ Sbjct: 1156 LRTFERDGTLT----AGRLRPAASGASAGSD 1182 >UniRef50_B8GTJ5 DEAD/DEAH box helicase domain-containing protein n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GTJ5_THISH Length = 1476 Score = 67.2 bits (162), Expect = 3e-10, Method: Composition-based stats. Identities = 33/142 (23%), Positives = 49/142 (34%), Gaps = 13/142 (9%) Query: 23 DALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNN-----VQQE 77 D L+ +G ELA R L L +A V + +A + Sbjct: 907 DLLRRLGDQHLDELARRCTEDPAPWLETLKHSGRAAAVRIAGEPRFIAAEDAGLYRDALG 966 Query: 78 QPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLAMAISK 137 LP+ + E L RGP + A FG T +V L +++ Sbjct: 967 VAPPPGLPDAFLVPVPDAL-EQLLRRFARTRGPFVTKQAAARFGLTEAQVIPVLRALLTE 1025 Query: 138 GRLIRVNQGGKFRYCIPGDNLP 159 GRL++ G+ R PG P Sbjct: 1026 GRLVQ----GELR---PGGREP 1040 >UniRef50_A8L6I9 DEAD/H associated domain protein n=13 Tax=Bacteria RepID=A8L6I9_FRASN Length = 1613 Score = 67.2 bits (162), Expect = 4e-10, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 65/195 (33%), Gaps = 29/195 (14%) Query: 13 REKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADN 72 R + +E+ D L+A+G T E AR ++ L L + + V Sbjct: 956 RHAHGVESTADLLRALGDLTDEEAGAR-GVAP-SWLAALADAGRVIRVRIGGEARWAPVE 1013 Query: 73 NVQQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKV 127 + + P +P T + ++ GP +A FG V Sbjct: 1014 DAGRLRDALGVPLPMGVPAAFTEPVRDPLG-DLVSRYARTHGPFEPAAVAARFGLGVAVV 1072 Query: 128 ASTLAMAISKGRLIRV----NQGGKFRYCIPG----------DNLPAEPKAASVS----- 168 L + GRL+R ++ G+ ++C G L E +A Sbjct: 1073 VGALERLTAAGRLVRGELHPDRAGE-QWCDAGVLRALRRRSLAALRKEVEAVPTRAMGAF 1131 Query: 169 -PLWLSASSSACHGV 182 P W S ++ HGV Sbjct: 1132 LPAWQSVTTGRGHGV 1146 >UniRef50_D1VBA8 DEAD/H associated domain protein n=1 Tax=Frankia sp. EuI1c RepID=D1VBA8_9ACTO Length = 1675 Score = 65.6 bits (158), Expect = 9e-10, Method: Composition-based stats. Identities = 34/135 (25%), Positives = 49/135 (36%), Gaps = 8/135 (5%) Query: 13 REKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADN 72 R +E D L+ +G T+ E AR EE L L +A V + VA Sbjct: 986 RRARDVEGAADLLRVLGDLTTAEALARGAT--EEWLRALAAAGRALEVRIAGEERWVAVE 1043 Query: 73 NVQQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKV 127 + + P +P T + ++ RGP A+ELA FG Sbjct: 1044 DAGRLRDALGVPLPMGVPTAFTEPVRDPLG-DLVSRFTRTRGPFDAEELAARFGLGVSVA 1102 Query: 128 ASTLAMAISKGRLIR 142 L + GRL+R Sbjct: 1103 VGALERLVGAGRLVR 1117 >UniRef50_C7NJ98 ATP dependent helicase, Lhr family n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NJ98_KYTSD Length = 1660 Score = 65.2 bits (157), Expect = 1e-09, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 45/135 (33%), Gaps = 8/135 (5%) Query: 13 REKNLMETVFDALKAMGKATSIELAARL--DISREEVLNELWELKKAGFVDKSAYTWRVA 70 R E + D L+ +G ELAAR + EL ++ V Sbjct: 1049 RRARHAEDLADLLRVLGPLDDAELAARSVPEGDPAAWTAELEGARRIVRVPVGGRDCWAV 1108 Query: 71 DNNVQQ-----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSR 125 +V + P +P+ + ++ L GP A+ LA +G Sbjct: 1109 VEDVARLRDALGVPVPPGIPQAAMVVSTDPLT-DLLVRHARTHGPFRAEALAERYGLGRA 1167 Query: 126 KVASTLAMAISKGRL 140 V + L + GR+ Sbjct: 1168 VVVAALRELVRSGRV 1182 >UniRef50_D2AY23 Putative ATP-dependent helicase Lhr n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AY23_STRRD Length = 1620 Score = 62.2 bits (149), Expect = 1e-08, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 59/185 (31%), Gaps = 29/185 (15%) Query: 19 ETVFDALKAMGKATSIELAARLDISREEV-LNELWELKKAGFVDKSAYTWRVADNNVQQ- 76 E + D L++ G + +++ R + L L ++A V + A + + Sbjct: 1027 EDLADLLRSHGPLLAADVSLR---EGDPAWLTGLESARRAIRVRVAGQEQWAAIEDAARL 1083 Query: 77 ----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLA 132 P +P + + R P A A FG V TL Sbjct: 1084 RDALGVPLPVGVPHTFLEPVDDPLG-DLVARHARTRTPFPASTAAARFGLGPAVVTDTLR 1142 Query: 133 MAISKGRLIRVNQGGKFR-------YCIPG--------DNLPAEPKAASVSPLWLSASSS 177 GR++ G+FR +C G + VSP L+ + Sbjct: 1143 RLAVSGRVV----SGEFRPGGRGEEWCDAGVLRMLRRRSLARLRKEVEPVSPETLAVFAP 1198 Query: 178 ACHGV 182 A HG+ Sbjct: 1199 AWHGI 1203 >UniRef50_Q023K9 ATP dependent helicase, Lhr family n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q023K9_SOLUE Length = 1521 Score = 60.2 bits (144), Expect = 4e-08, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 49/134 (36%), Gaps = 7/134 (5%) Query: 19 ETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAY--TWRVADNNVQQ 76 + + D L +G + E+AAR E L + ++ Y A Sbjct: 937 DALHDLLLKLGDLSKEEIAARSTGGSLEDLVAARRAVRVRIAGETRYIPVEYAARYRDAV 996 Query: 77 EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLAMAIS 136 P L + ++ + E L GP T E+A +G + +V ++L Sbjct: 997 GTPLPPGLADVFLESSEHPLLE-LLRRYARTHGPFTTAEVAARYGLVAAQVDASLRRQHL 1055 Query: 137 KGRLIRVNQGGKFR 150 G+L+ G+FR Sbjct: 1056 DGKLLE----GEFR 1065 >UniRef50_A7HB32 DEAD/H associated domain protein n=2 Tax=Myxococcales RepID=A7HB32_ANADF Length = 1553 Score = 59.1 bits (141), Expect = 9e-08, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 44/139 (31%), Gaps = 11/139 (7%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRV------AD 71 ++ + D L +G + ELA R E ++ + R+ A Sbjct: 950 VDGIADLLLRLGDLSREELARRTATPELAAGAERLVAERRAVALRIGGEERLVAVEDAAR 1009 Query: 72 NNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTL 131 P LPE + + E + P T ELA G + L Sbjct: 1010 YRDGLGVPLPPGLPEALLGPVPDAL-EGLAARYARRHAPFTTGELAARLGISRGAAERAL 1068 Query: 132 AMAISKGRLIRVNQGGKFR 150 + +GRL+ G FR Sbjct: 1069 RALLERGRLLE----GAFR 1083 >UniRef50_A9WLQ1 ATP-dependent helicase n=3 Tax=Micrococcaceae RepID=A9WLQ1_RENSM Length = 409 Score = 59.1 bits (141), Expect = 1e-07, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 39/109 (35%), Gaps = 10/109 (9%) Query: 47 VLNELWELKKAGFVDKSAYTWRVADNNVQQ-----EQPAQAELPEEITTATVAKISECDL 101 + EL + +A + + +A + + P +P + + + Sbjct: 2 AITELVKANRAFTSKIAGQSVLLAVEDAARVRDALGLPLPMGIPLAFIEPVLDPLG-DLV 60 Query: 102 TATIEQRGPQTADELATLFGTTSRKVASTLAMAISKGRLIRVNQGGKFR 150 + GP E+A FG V S L + GR+++ G+FR Sbjct: 61 SRYARTHGPFLTSEVAARFGLGVAIVQSALDRLGADGRVVQ----GEFR 105 >UniRef50_B9ZKU2 DEAD/H associated domain protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZKU2_9GAMM Length = 1506 Score = 58.7 bits (140), Expect = 1e-07, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 41/129 (31%), Gaps = 6/129 (4%) Query: 19 ETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNN----- 73 + + D L+ G T E+AA + L +L +A V + +A + Sbjct: 920 DALHDLLRRRGDLTPDEVAAACAEAPGPWLEQLARSLRAVEVRIAGEPRWIAAEDAALYR 979 Query: 74 VQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLAM 133 LP + E L GP +++L +G + L Sbjct: 980 DALGVVPPPGLPAAFLEPPQHPL-EQLLRRYARTHGPFRSEDLGARYGLAPGALLPALES 1038 Query: 134 AISKGRLIR 142 G L+R Sbjct: 1039 LERAGVLLR 1047 >UniRef50_C8Q5F9 DEAD/H associated domain protein n=2 Tax=Enterobacteriaceae RepID=C8Q5F9_9ENTR Length = 1525 Score = 58.3 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 36/162 (22%), Positives = 62/162 (38%), Gaps = 22/162 (13%) Query: 18 METVFDALKAMGKATSIELAARL--DISREEVLNELWELKKAGFVDKSAYT-WRVADNNV 74 +E + D L+ +G T ELA R D E L L ++ V + W +A++ Sbjct: 926 IEALSDLLREVGPLTDAELALRYSGDAPLAEGLQALQLAQRIFSVRIADQQRWALAEDAS 985 Query: 75 QQEQPAQAELPEEITTATVAKISE---CDLTATIEQRGPQTADELATLFGTTSRKVASTL 131 + +P+ + A + ++ L TA ELA FG VA L Sbjct: 986 RLRDALGCHVPDTLPPALLQPVTRPLYDLLHRYARTHALFTASELAHHFGLGVAVVADAL 1045 Query: 132 AMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLS 173 +G++I+ GD P++ + P W++ Sbjct: 1046 GELRQQGKVIK------------GDFTPSDDE----EPQWVA 1071 >UniRef50_B8HC38 DEAD/H associated domain protein n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HC38_ARTCA Length = 1729 Score = 57.1 bits (136), Expect = 4e-07, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 38/122 (31%), Gaps = 14/122 (11%) Query: 51 LWELKKAGFVDKSAYTWRVADNNVQQ-----EQPAQAELPEEITTATVAKISECDLTATI 105 L +A V+ A + + P +P + ++ Sbjct: 1110 LQRANRAIKVNIGGAERFAAVEDAARLRDAIGVPLPMGVPLAFIEPVADPLG-DLVSRYA 1168 Query: 106 EQRGPQTADELATLFGTTSRKVASTLAMAISKGRLIRVNQGGKFR-YCIPGDNLPAEPKA 164 GP TA E A G V + L GR++ G+FR + P A P+A Sbjct: 1169 RTHGPFTAAEAAARLGLGVAVVGTALKRLADDGRVVE----GEFRPHATPAA---AGPEA 1221 Query: 165 AS 166 Sbjct: 1222 TP 1223 >UniRef50_Q2IJM9 DEAD/DEAH box helicase-like n=9 Tax=Bacteria RepID=Q2IJM9_ANADE Length = 1536 Score = 56.8 bits (135), Expect = 4e-07, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 44/138 (31%), Gaps = 11/138 (7%) Query: 19 ETVFDALKAMGKATSIELAARLDISREEV-LNELWELKKAGFVDKSAYTWRVADNNVQQ- 76 + + D L +G + E+ R L+ L ++A + VA + + Sbjct: 928 DGLADMLLRLGDLSRAEVERRSASPEVAASLDRLVAERRAVPLRVGGEERFVAVEDAARY 987 Query: 77 ----EQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLA 132 P +PE + + + + P TA ELA G L Sbjct: 988 RDGLGVPLPPGVPEALLAPVPDALD-GLVARHARRHAPFTAGELAARLGLGRAAAERVLE 1046 Query: 133 MAISKGRLIRVNQGGKFR 150 GRL+ G FR Sbjct: 1047 RLARAGRLLE----GAFR 1060 >UniRef50_C7MDW5 ATP dependent helicase, Lhr family n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MDW5_BRAFD Length = 1601 Score = 52.5 bits (124), Expect = 9e-06, Method: Composition-based stats. Identities = 31/138 (22%), Positives = 49/138 (35%), Gaps = 12/138 (8%) Query: 19 ETVFDALKAMGKATSIELAARL--DISREEVLNELWELKKAGFVDKSAYTWRVADNN--- 73 E + DAL+ +G T+ +L+ R + E L +L ++ V + A + Sbjct: 966 EQIADALRRLGPLTADDLSRRTAEGVDLEAELEQLTRARRIISVRLAGREHVAAVEDAGL 1025 Query: 74 --VQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTL 131 LP+ A V + L RGP A + FG TL Sbjct: 1026 LRDALGTALPPGLPQAHL-APVDRAVPQLLARWARGRGPFPAADAVEAFGLAPGVARGTL 1084 Query: 132 AMAISKGRLIRVNQGGKF 149 ++ RV Q G+F Sbjct: 1085 EQLTAE----RVLQQGEF 1098 >UniRef50_UPI0001C317F9 DEAD/H associated domain protein n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C317F9 Length = 1480 Score = 47.1 bits (110), Expect = 4e-04, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 48/141 (34%), Gaps = 18/141 (12%) Query: 23 DALKAMGKATSIELAARL--DISREEVLNELWELKKAGFVDKSAYTWRV--ADNNVQQEQ 78 D L+ +G T+ E A R+ + L+ L ++A + V AD + ++ Sbjct: 902 DVLRRVGDLTAAEAAQRVVPGVDAAAALDALASERRAVLLRIGGEQRWVDAADAGMYRDA 961 Query: 79 PA---QAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTLAMAI 135 LPE + L + GP T +E FG L Sbjct: 962 LGVVPPGGLPEAFLADVPDALVR-LLRRYAQTHGPFTTEEPRLRFGVDP---TPALRALE 1017 Query: 136 SKGRLIRVNQGGKFRYCIPGD 156 G L++ G+ R PG Sbjct: 1018 RDGELVQ----GELR---PGG 1031 >UniRef50_C8WJH4 DEAD/H associated domain protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WJH4_EGGLE Length = 1741 Score = 45.2 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 31/85 (36%), Gaps = 5/85 (5%) Query: 70 ADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVAS 129 A ++ + A AE + + E L GP +A+ A FG Sbjct: 1149 AQREIEGSRAASAESDG--GEDALHPLDE-LLARYARTHGPFSAEAAAARFGIGVAVARD 1205 Query: 130 TLAMAISKGRLI--RVNQGGKFRYC 152 LA S GRL+ R + G R C Sbjct: 1206 GLARLTSAGRLMQGRFGEDGDGRAC 1230 >UniRef50_Q8NS36 Lhr-like helicases n=2 Tax=Corynebacterium glutamicum RepID=Q8NS36_CORGL Length = 1520 Score = 44.4 bits (103), Expect = 0.002, Method: Composition-based stats. Identities = 27/130 (20%), Positives = 48/130 (36%), Gaps = 3/130 (2%) Query: 5 QELRSNFYREKNLMETVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSA 64 Q+LR R E + D+L+ +G EL + ++ + ++ G + A Sbjct: 924 QQLRRQGDRAARNNEELADSLRILGPIPLDELGEHITFENPDLEDRAMTVRINGR-EHLA 982 Query: 65 YTWRVADNNVQQEQPAQAELPEEITTATVAKISECDLTATIEQRGPQTADELATLFGTTS 124 P +P ++ T+ E + + RGP TA++LA FG Sbjct: 983 QVLDAPLLRDALGVPVPPGVPAQV--ETITDALEQLVNRWVRTRGPFTANDLAEAFGLGI 1040 Query: 125 RKVASTLAMA 134 + L A Sbjct: 1041 ATAITALQSA 1050 >UniRef50_C7R3F9 DEAD/H associated domain protein n=1 Tax=Jonesia denitrificans DSM 20603 RepID=C7R3F9_JONDD Length = 1618 Score = 44.4 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 50/162 (30%), Gaps = 40/162 (24%) Query: 19 ETVFDALKAMGKATSIELAAR-LDISREE-----------------------------VL 48 E V+D L+ G T L AR + I R++ V+ Sbjct: 974 EDVWDFLQRWGPITVEGLGARWVGIDRDDLVQVDPHHLNRSVDALSCPDPRGVDWARDVV 1033 Query: 49 NELWELKKAGFV---DKSAYTWRVADN------NVQQEQPAQAELPEEITTATVAKISEC 99 EL + ++ V +S VA + +E + + Sbjct: 1034 AELVQSRRVFPVRLMTRSGPREYVASIVDAGRLRDALGVAIPPGIADEWSAPVRDPLG-D 1092 Query: 100 DLTATIEQRGPQTADELATLFGTTSRKVASTLAMAISKGRLI 141 + + P T D++A FG V + ++ G ++ Sbjct: 1093 MVRRYAKTHVPFTVDDVAERFGVGRAVVFDVVRRLVASGVVV 1134 >UniRef50_P30015 Probable ATP-dependent helicase lhr n=44 Tax=Enterobacteriaceae RepID=LHR_ECOLI Length = 1538 Score = 43.3 bits (100), Expect = 0.005, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 51/135 (37%), Gaps = 9/135 (6%) Query: 19 ETVFDALKAMGKATSIELAARLDISREEV---LNELWELKKAGFVDKSAYTWRVADNNVQ 75 E +FD L+ +G T +LA R S EEV L L +K+ S R+A + Sbjct: 939 EGLFDLLRELGPMTVEDLAQRHTGSSEEVASYLENLLAVKRIFPAMISGQE-RLACMDDA 997 Query: 76 Q--EQPAQAELPEEITTATVAKIS---ECDLTATIEQRGPQTADELATLFGTTSRKVAST 130 LPE + + ++S + TA++LA F V Sbjct: 998 ARLRDALGVRLPESLPEIYLHRVSYPLRDLFLRYLRAHALVTAEQLAHEFSLGIAIVEEQ 1057 Query: 131 LAMAISKGRLIRVNQ 145 L +G ++ + Q Sbjct: 1058 LQQLREQGLVMNLQQ 1072 >UniRef50_C8U7P5 Putative uncharacterized protein n=13 Tax=root RepID=C8U7P5_ECO10 Length = 129 Score = 42.1 bits (97), Expect = 0.011, Method: Composition-based stats. Identities = 16/50 (32%), Positives = 26/50 (52%) Query: 18 METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTW 67 M V L +GKA++ ++ L+I +V++ LW+LK G V W Sbjct: 1 MNAVLTELNKLGKASAESISKGLNIDLNDVIDTLWKLKNQGVVTVKNGIW 50 >UniRef50_Q1YII8 Possible regulatory protein n=1 Tax=Aurantimonas manganoxydans SI85-9A1 RepID=Q1YII8_MOBAS Length = 344 Score = 41.0 bits (94), Expect = 0.024, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 56/170 (32%), Gaps = 25/170 (14%) Query: 25 LKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNNVQQEQ------ 78 L A+G T LA RL ISR V L EL++ G V + + Sbjct: 41 LPALGHLTESLLAERLGISRAPVRRALSELEQQGLVKRVKGKGYAVLAGNASARKPPLPG 100 Query: 79 ----PAQAELPEEIT---TATVAKISECDLTATIEQRGPQTADELATLFGTTSRKVASTL 131 PA P + + ++ + + LA +G + + Sbjct: 101 NTASPAHDGSPLQSSPSWERIYGEVEDQVIARISFASWRVNEARLARHYGVSRTVARDVV 160 Query: 132 AMAISKGRLIRVNQGGKFRYCIPGDNLPA--------EP----KAASVSP 169 +G L + ++G + + D++ EP KAA +P Sbjct: 161 GRLQQRGVLRKDDRGRWYAPALTPDHIAELYELRSILEPIALTKAAPHAP 210 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.302 0.120 0.307 Lambda K H 0.267 0.0375 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 960,747,861 Number of Sequences: 3077464 Number of extensions: 31897090 Number of successful extensions: 95273 Number of sequences better than 1.0e-01: 38 Number of HSP's better than 0.1 without gapping: 31 Number of HSP's successfully gapped in prelim test: 23 Number of HSP's that attempted gapping in prelim test: 95201 Number of HSP's gapped (non-prelim): 64 length of query: 203 length of database: 1,040,396,356 effective HSP length: 122 effective length of query: 81 effective length of database: 664,945,748 effective search space: 53860605588 effective search space used: 53860605588 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.4 bits) S2: 89 (39.0 bits)